GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ECOFABE 1229 bp ds-DNA BCT 22-FEB-1990 DEFINITION E.coli biotin carboxyl carrier protein (fabE) gene, complete cds. ACCESSION M32214 KEYWORDS biotin carboxyl carrier protein. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1229) AUTHORS Alix,J.-H. TITLE A rapid procedure for cloning genes from lambda libraries by complementation of E.coli defective mutants: Application to the fabE region of the E.coli chromosome JOURNAL DNA 8, 779-789 (1989) STANDARD simple staff_review FEATURES from to/span description pept 269 739 biotin carboxyl carrier protein (fabE) BASE COUNT 321 a 324 c 318 g 266 t ORIGIN 1 cgacctcgtc ctccctgacg cagtttttgc gctgcggaaa aggtgacatt ggcgcaacga 61 aggtatattt tgttttttgc cggaggatag cagcagatcg ctgcacaatg tccgtcaagt 121 ctaacattga cactctgggg caaaatagac cggcgtcccg gcctgctgga atttatcgct 181 atgcatacag ctgtcggggc atacgcttta cagacggcgg tgaaacgcct gtcacaatca 241 cactaaacaa agagtacgga acccactcat ggatattcgt aagattaaaa aactgatcga 301 gctggttgaa gaatcaggca tctccgaact ggaaatttct gaaggcgaag agtcagtacg 361 cattagccgt gcagctcctg ccgcaagttt ccctgtgatg caacaagctt acgctgcacc 421 aatgatgcag cagccagctc aatctaacgc agccgctccg gcgaccgttc cttccatgga 481 agcgccagca gcagcggaaa tcagtggtca catcgtacgt tccccgatgg ttggtacttt 541 ctaccgcacc ccaagcccgg acgcaaaagc gttcatcgaa gtgggtcaga aagtcaacgt 601 gggcgatacc ctgtgcatcg ttgaagccat gaaaatgatg aaccagatcg aagcggacaa 661 atccggtacc gtgaaagcaa ttctggtcga aagtggacaa ccggtagaat ttgacgagcc 721 gctggtcgtc atcgagtaac gaggcgaaca tgctggataa aattgttatt gccaaccgcg 781 gcgagattgc attgcgtatt cttcgtgcct gtaaagaact gggcatcaag actgtcgctg 841 tgcactccag cgcggatcgc gatctaaaac acgtattact ggcagatgaa acggtctgta 901 ttggccctgc tccgtcagta aaaagttatc tgaacatccc ggcaatcatc agcgccgctg 961 aaatcaccgg cgcagtagca atccatccgg gttacggctt cctctccgag aacgccaact 1021 ttgccgagca ggttgaacgc tccggcttta tcttcattgg cccgaaagca gaaaccattc 1081 gcctgatggg cgacaaagta tccgcaatcg cggcgatgaa aaaagcgggc gtcccttgcg 1141 taccgggttc tgacgcgccg ctgggcgacg atatggataa aaaccgtgcc attgctaaac 1201 gcattggtta tccggtgatt atcaaaccg //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS VACWRA 2955 bp ds-DNA VRL 13-FEB-1990 DEFINITION Vaccinia virus abortive late protein gene, complete cds. ACCESSION M32064 KEYWORDS abortive late protein. SOURCE Vaccinia virus (strain WR) DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus. REFERENCE 1 (bases 1 to 2955) AUTHORS Pacha,R.F., Meis,R.J. and Condit,R.C. TITLE Structure and expression of the vaccinia virus gene which prevents virus-induced breakdown of RNA JOURNAL Unpublished (1990) D. of Biochem., SUNY-Buffalo, Buffalo, NY 14214 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.C.Condit, 12-FEB-1990. FEATURES from to/span description pept 825 < 1 (c) 32 kD protein pept 1439 828 (c) 23 kD protein pept 1454 2935 56 kD abortive late protein mRNA 1028 > 2955 56 kD abortive late protein early mRNA mRNA < 2955 1442 (c) 23 kD protein late mRNA BASE COUNT 1019 a 536 c 474 g 926 t ORIGIN About 120 kb from left end of genome. 1 tctagattcg tcggtgcact catgtaacca acatacccta ggtcctaaat atttatctcc 61 ggaattagat tttggataat tcgcgcacca acaatttcta tttcctttat gatcgttaca 121 aaagacgtat aatgccgtat ccccaaaagt aaaataatca ggacgaataa ttctaataaa 181 ctcagaacaa tatctcgcat ccatatgttt ggagcaaata tcggaataag tagacatagc 241 cggtttccgt tttgcacgta accattctaa acaattgggg tttccaggat cgtttctaca 301 aaatccagtc atgaaatcgt cacaatgttc tgtcttgtaa ttattattaa atatttttgg 361 acagtgtttg gtatttgtct tagaacaaca ttttgccacg ctatcactat cgcccaggag 421 ataatccttt tttataaaat gacatcgttg cccggatgct atataatcag tagcgtgttt 481 taaatcctta atatattcag gagttacctc gttctgataa tagattaatg atccaggacg 541 aaatttgaaa gaactacatg gttctccatg aattaataca tattgtttag caaattcagg 601 aactataaaa ctactacaat gatctatcga cataccatct atcaaacaaa acttgggttt 661 aatttctccc ggagatgttt cataatagta cgtataactt tcttctgcaa acttaacagc 721 tctattatat tcaggataat taaaacctaa ttccatatat ttgtctcgta tatctgctat 781 tcctggtgct attttgattc tattaagagt aacagctgcc cccattctta ataatcgtca 841 gtatttaaac tgttaaatgt tggtatatca acatctacct tatttcccgc agtataaggt 901 ttgttgcagg tatactgttc aggaatggtt acatttatac ttcttctata gtcctgtctt 961 tcgatgttca tcacatatgc aaagaacaga ataaacaaaa taatgtaaga aataatatta 1021 aatatctgtg aattcgtaaa tacattgatt gccataataa ttacagcagc tacaatacac 1081 acaatagaca ttcccacagt gttgccatta cctccacgat acatttgagt tactaagcaa 1141 taggtaataa ctaagctagt aagaggcaat agaaaagatg agataaatat catcaatata 1201 gagattagag gagggctata tagagccaag acgaacaaaa tcaaaccgag taacgttcta 1261 acatcattat ttttgaagat tcccaaataa tcattcattc ctccataatc gttttgcatc 1321 atacctccat ctttaggcat aaacgattgc tgctgttcct ctgtaaataa atctttatca 1381 agcactccag cacccgcaga gaagtcgtca agcatattgt aatatcttaa ataactcatt 1441 tatatattaa aaaatgtcac tattaaagat ggagtataat ctttatgccg aactaaaaaa 1501 aatgacttgt ggtcaacccc taagtctttt taacgaagac ggggatttcg tagaagttga 1561 accgggatca tcctttaagt ttctgatacc taagggattt tacgcctctc cttccgtaaa 1621 gacgagtcta gtatttgaaa cattaacaac gaccgataat aaaatcacta gtatcaatcc 1681 aacaaatgcg ccaaagttat atcctcttca acgcaaagtc gtatctgaag tagtttctaa 1741 tatgaggaaa atgatcgaat caaaacgtcc tctatacatt actcttcact tggcgtgtgg 1801 atttggtaag actattacca cgtgttatct tatggctaca cacggtagaa aaaccgtcat 1861 ttgcgtaccc aataaaatgt taatacatca atggaagaca caggtagagg cagtcggatt 1921 ggaacataag atatccatag atggagtaag tagtctatta aaggaactaa agactcaaag 1981 tccggatgta ttaatagtag tcagtagaca tctgacaaac gatgcctttt gtaaatatat 2041 caataagcat tatgatttgt tcatcttgga tgaatcacat acgtataatc tgatgaacaa 2101 tacagcagtt acaagatttt tagcgtatta tcctccgatg atgtgttatt ttttaactgc 2161 tacacctaga ccagctaacc gaatttattg taacagtatt attaatattg ccaagttatc 2221 cgatctaaaa aaaactatct atgcggtaga tagttttttt gagccatatt ccacagacaa 2281 tattagacat atggtaaaac gactagatgg accatctaat aaatatcata tatataccga 2341 gaagttatta tctgtagacg agcctagaaa tcaacttatt cttaataccc tggtagaaga 2401 attcaagtca ggaactatta atcgcatttt agttattact aaactacgtg aacatatggt 2461 attattctac aaacgattat tagatctttt cggaccagag gttgtattta taggagacgc 2521 ccaaaataga cgtactccag atatggtcaa atcaatcaag gaactaaata gatttatatt 2581 cgtatccacc ttattttatt ccggtactgg tttagatatt cctagtttgg attcgttgtt 2641 catttgctcg gcagtaatca acaatatgca aatagagcaa ttactaggga gggtatgtcg 2701 agaaacagaa ctattagata ggacggtata tgtatttcct aacacatcca tcaaagaaat 2761 aaagtacatg ataggaaatt tcatgcaacg aattattagt ctgtctgtag ataaactagg 2821 atttaaacaa gaaagttatc ggaaacatca agaatccgat cccacttctg tatgtacaac 2881 atcctccaga gaagaacgtg tattaaatag aatatttaac tcgcaaaatc gttaagaagt 2941 ttaagcgacg atccg //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS PSECAMAB 3825 bp ds-DNA BCT 13-FEB-1990 DEFINITION P.putida putidaredoxin reductase (camA) and putidaredoxin (camB) genes, complete cds. ACCESSION J05406 KEYWORDS electron transfer protein; flavoprotein reductase; iron-sulfur protein; putidaredoxin; putidaredoxin reductase. SOURCE P.putida (strain ATCC 17453) DNA, clone pBS5. ORGANISM Pseudomonas putida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Acidiphilium cryptum; Pseudomonadaceae. REFERENCE 1 (bases 1 to 3825) AUTHORS Peterson,J.A., Lorence,M.C. and Amarneh,B. TITLE Putidaredoxin reductase and putidaredoxin: Cloning, sequence determination, and heterologous expression of the proteins JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Peterson, 07-FEB-1990. FEATURES from to/span description pept 47 1315 putidaredoxin reductase pept 1371 1694 putidaredoxin mRNA < 1 1842 camA/camB mRNA (3' end +/- 11 bp) binding 32 36 ribosome binding site binding 77 106 FAD binding domain (pot.) binding 511 541 NADH binding domain (pot.) binding 1358 1363 ribosome binding site BASE COUNT 855 a 1086 c 1135 g 749 t ORIGIN 1 tccggcgact accaaagcgg tataaacaca tgggagtgcg tgctaagtga acgcaaacga 61 caacgtggtc atcgtcggta ccggactggc tggcgttgag gtcgccttcg gcctgcgcgc 121 cagcggctgg gaaggcaata tccggttggt gggggatgcg acggtaattc cccatcacct 181 accaccgcta tccaaagctt acttggccgg caaagccaca gcggaaagcc tgtacctgag 241 aaccccagat gcctatgcag cgcagaacat ccaactactc ggaggcacac aggtaacggc 301 tatcaaccgc gaccgacagc aagtaatcct atcggatggc cgggcactgg attacgaccg 361 gctggtattg gctaccggag ggcgtccaag acccctaccg gtggccagtg gcgcagttgg 421 aaaggcgaac aactttcgat acctgcgcac actcgaggac gccgagtgca ttcgccggca 481 gctgattgcg gataaccgtc tggtggtgat tggtggcggc tacattggcc ttgaagtggc 541 tgccaccgcc atcaaggcga acatgcacgt caccctgctt gatacggcag cccgggttct 601 ggagcgggtt accgccccgc cggtatcggc cttttacgag cacctacacc gcgaagccgg 661 cgttgacata cgaaccggca cgcaggtgtg cgggttcgag atgtcgaccg accaacagaa 721 ggttactgcc gtcctctgcg aggacggcac aaggctgcca gcggatctgg taatcgccgg 781 gattggcctg ataccaaact gcgagttggc cagtgcggcc ggcctgcagg ttgataacgg 841 catcgtgatc aacgaacaca tgcagacctc tgatcccttg atcatggccg tcggcgactg 901 tgcccgattt cacagtcagc tctatgaccg ctgggtgcgt atcgaatcgg tgcccaatgc 961 cttggagcag gcacgaaaga tcgccgccat cctctgtggc aaggtgccac gcgatgaggc 1021 ggcgccctgg ttctggtccg atcagtatga gatcggattg aagatggtcg gactgtccga 1081 agggtacgac cggatcattg tccgcggctc tttggcgcaa cccgacttca gcgttttcta 1141 cctgcaggga gaccgggtat tggcggtcga tacagtgaac cgtccagtgg agttcaacca 1201 gtcaaaacaa ataatcacgg atcgtttgcc ggttgaacca aacctactcg gtgacgaaag 1261 cgtgccgtta aaggaaatca tcgccgccgc caaagctgaa ctgagtagtg cctgaaatct 1321 atacccacaa taaatcaccg ttttgcccca tagcgtgtga ggataaacag atgtctaaag 1381 tagtgtatgt gtcacatgat ggaacgcgtc gcgaactgga tgtggcggat ggcgtcagcc 1441 tgatgcaggc tgcagtctcc aatggtatct acgatattgt cggtgattgt ggcggcagcg 1501 ccagctgtgc cacctgccat gtctatgtga acgaagcgtt cacggacaag gtgcccgccg 1561 ccaacgagcg ggaaatcggc atgctggagt gcgtcacggc cgaactgaag ccgaacagca 1621 ggctctgctg ccagatcatc atgacgcccg agctggatgg catcgtggtc gatgttcccg 1681 ataggcaatg gtaaaccaca atggtaaacc actgcgagcc aaaacagccg agcaggagcg 1741 cagtccggca acaccttatt aagcacatgc cgaaccctat ttgcagcgct tcatgcctgc 1801 aaagtcccga ttgatgaaat ccgggctcca agcaaggagc ccggaatctc tcaccgccac 1861 gaaatcaatg gccaatcccg ggacttggtt gagacgtccg tttctccact acgactttgt 1921 cgcaatgatc tcctcatgct cagatccgaa ggatctctat agatcataca gccaggctgt 1981 atctggagaa gtccaaggtt catctggtct gtgatgccaa tgactcgccc tcaagcctcg 2041 ttttgtcagt gagagtagtc acctgatgca acagcgtctc actcaacgcc tgcgccgcgc 2101 tggatagctg atgcccagcc tggtgcagca ggccgacgcg acgcgacacc tgtggctcgc 2161 tcagcggtag gcagcgggcg cccagttctt ccatctgctg ccggcacagt tgcggcacgg 2221 cgctgacgcc gagtgaacgc aaacgacaac gtggtcatcg tcggtaccgg actggctggc 2281 gttgaggtcg ccttcggcct gcgcgccagc ggctgggaag gcaatatccg gttggtgggg 2341 gatgcgacgg taattcccca tcacctacca ccgctatcca aagcttactt ggccggcaaa 2401 gccacagcgg aaagcctgta cctgagaacc ccagatgcct atgcagcgca gaacatccaa 2461 ctactcggag gcacacaggt aacggctatc aaccgcgacc gacagcaagt aatcctatcg 2521 gatggccggg cactggatta cgaccggctg gtattggcta ccggagggcg tccaagaccc 2581 ctaccggtgg ccagtggcgc agttggaaag gcgaacaact ttcgatacct gcgcacactc 2641 gaggacgccg agtgcattcg ccggcagctg attgcggata accgtctggt ggtgattggt 2701 ggcggctaca ttggccttga agtggctgcc accgccatca aggcgaacat gcacgtcacc 2761 ctgcttgata cggcagcccg ggttctggag cgggttaccg ccccgccggt atcggccttt 2821 tacgagcacc tacaccgcga agccggcgtt gacatacgaa ccggcacgca ggtgtgcggg 2881 ttcgagatgt cgaccgacca acagaaggtt actgccgtcc tctgcgagga cggcacaagg 2941 ctgccagcgg atctggtaat cgccgggatt ggcctgatac caaactgcga gttggccagt 3001 gcggccggcc tgcaggttga taacggcatc gtgatcaacg aacacatgca gacctctgat 3061 cccttgatca tggccgtcgg cgactgtgcc cgatttcaca gtcagctcta tgaccgctgg 3121 gtgcgtatcg aatcggtgcc caatgccttg gagcaggcac gaaagatcgc cgccatcctc 3181 tgtggcaagg tgccacgcga tgaggcggcg ccctggttct ggtccgatca gtatgagatc 3241 ggattgaaga tggtcggact gtccgaaggg tacgaccgga tcattgtccg cggctctttg 3301 gcgcaacccg acttcagcgt tttctacctg cagggagacc gggtattggc ggtcgataca 3361 gtgaaccgtc cagtggagtt caaccagtca aaacaaataa tcacggatcg tttgccggtt 3421 gaaccaaacc tactcggtga cgaaagcgtg ccgttaaagg aaatcatcgc cgccgccaaa 3481 gctgaactga gtagtgcctg aatgtctaaa gtagtgtatg tgtcacatga tggaacgcgt 3541 cgcgaactgg atgtggcgga tggcgtcagc ctgatgcagg ctgcagtctc caatggtatc 3601 tacgatattg tcggtgattg tggcggcagc gccagctgtg ccacctgcca tgtctatgtg 3661 aacgaagcgt tcacggacaa ggtgcccgcc gccaacgagc gggaaatcgg catgctggag 3721 tgcgtcacgg ccgaactgaa gccgaacagc aggctctgct gccagatcat catgacgccc 3781 gagctggatg gcatcgtggt cgatgttccc gataggcaat ggtaa //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS CHKAK1 6300 bp ds-DNA VRT 15-MAR-1989 DEFINITION Chicken cytosolic adenylate kinase (AK1) gene, complete cds and flanks. ACCESSION D00251 KEYWORDS AK1 gene; adenylate kinase; cytosolic adenylate kinase. SOURCE Chicken liver DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 6300) AUTHORS Suminami,Y., Kishi,F., Torigoe,T. and Nakazawa,A. TITLE Structure and complete nucleotide sequence of the gene encoding chicken cytosolic adenylate kinase JOURNAL J. Biochem. 103, 611-617 (1988) STANDARD full staff_entry COMMENT The chicken AK1 gene spanned about 6 kbp and consisted of 7 exons. The transcription initiation site was determined by the S1 nuclease mapping analysis. Evolution of the AK1 gene is discussed in comparison with the isozyme AK2. corresponds to DDBJ (rel4.0) entry <CHKAK1>(D00251) dated 12-SEP-1988 FEATURES from to/span description pept 2612 2621 adenylate kinase (a part of exon 2) 4056 4091 adenylate kinase (exon 3) 4478 4641 adenylate kinase (exon 4) 4754 4870 adenylate kinase (exon 5) 5382 5573 adenylate kinase (exon 6) 5689 5754 adenylate kinase (exon 7) pre-msg 1070 5914 AK1 mRNA and introns IVS 1121 2578 AK1 intron a IVS 2622 4055 AK1 intron b IVS 4092 4477 AK1 intron c IVS 4642 4753 AK1 intron d IVS 4871 5381 AK1 intron e IVS 5574 5688 AK1 intron f signal 973 976 CAT box signal 1041 1047 TATA box signal 5896 5901 polyadenylation signal BASE COUNT 998 a 2011 c 1904 g 1387 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccgat tgcttcagag tccccttggt ttggggtccg ggctgcactg tgcaccccag 61 ggctgataca gcatcagggc cgtgtgcacc gctgctgccc atgggtctgc acagccacga 121 catggcaggt tttagggtgt ctcatcccct cctgcaggac gtgggaaccc tgcatggtcc 181 caactcagaa cctcccaggc tgcgttccat caccctgaga gcacacgggg ggtaggtgaa 241 tggtcccctt tggctggcat gggggtcacg ttgctttggg ggcaaaattt tagccccacc 301 cccccccccc acctgtcccc cagcacaaat cctcacattt tcagtgtcac catccccaga 361 gcagagacct gagagccacc ccagttagac caaggggatg acactgccaa cgtggtgccg 421 ggagggggga ggtggggagg aggggggggg caataccaca cagcagctgc cggaaccttt 481 tgtaacctgg ccatggggcg tggggggatt ttttccctcc tggtcccggt gtttggattc 541 ccactgtgga gcccacgatg ctcatcactc agcagcccct cacgaaccat tcagaagggt 601 ttggacgctg agccccattg ggtggcagcc ccgtgcctca gtttccccct cggccgtggc 661 agcggggggg gggggcgtcc ggtggggtca caccttgggg ggaggggggg gctcagcacc 721 gtgttttgta tcgatttgta caggaataaa cctcgctgca gtctgttgcc ttggaccgtg 781 ctgcgggatg catctctggg gtccgggggg gagggaaagg gaccccgtag gacccccccc 841 catcccgagc ggggctgagg tgggcggttg ggggctgcag gacctcccag ctctacccgt 901 catccgcggg ctcctcccgt tgcgggggcg cggtaccggg gggctgcggg ggggctgcgg 961 gcgggcgggc ggcattgccc ccccccacag tgccccccct ccccgctgcc cgctcccggc 1021 atgccccgcg ccgccagcat tatatatatc tagcggtacg gcagaagctg ccggccagca 1081 gcagccgccc cccccgtatt ctgcaccgcc ccccccgcag gtcagtgccg ccccccccag 1141 ccctgctgct ggtttgcacc gcggttcgtg tcccaacccc cccccccaga tctcccccct 1201 ggtccccccc ggcacagtct cgcggtgacc ttggggaggg cgttccggca tccggcttcg 1261 gggccccccc cgctgccccc cgcccgacca ttctcgcgtg ggggcgagac gcgatcggtc 1321 cgtccccccc tctccgtgcg ttgagctgtg gggatggggg tgcggggggg gggatgtgaa 1381 catcgctgcg ggtgtgcggg acccgggggg ggagaaggga gaggagcggt ggggagcacc 1441 ggggggcatt gcaggggtgc tgtgctattg cggggggtgc tgagctatcg gggggcgccg 1501 gggtgcggcc cttgggctgg agtccccaca tgggaggtgg gtgcgggcaa agcgtgcggt 1561 tggagctcct ggtgggctgc agttctgggg agcaccctgg ggtgcagcac ggcgtgggct 1621 gtgctgggga gtgctgtgcc gcccgctccg tgggcacagc gccacgctgg atcaggccgt 1681 gagcccacca agggctgctg ccaggtgtgc tcggtgcccc cgcgccaggg acccccaggt 1741 tgtgcccttc ctatggggcg atgcagggct cagcctcaaa acgcaagcag gacgtgaccc 1801 ggtgcagatt tcagcccgtc catacttagc ctctgccctt tcccggctgt aacacgacct 1861 gatggatggc cacggcccct tcggccccgg tctggccggt ttggggacag cggggtgacg 1921 tggtgtggct gagcaggtgg ggtgctgcgg ggctgggtgc gggcgggcgg ttatgggtat 1981 ggccaaaccc acagcgagga actggaaggg cagtggggtg cagcagcccc cccttgggca 2041 ccgagaacag gctgacccct taaaaagcat cccctccagc acatggggag ctggaggtac 2101 gtgggatact gcctccgtcc ctcagtgtcc tgctgccatg tgtcaccccg agccaatttc 2161 cggctgccag gcagcacccc gggcatgggg atgtggtcag ggcaggggag ggtgctggga 2221 ttgtcaccgt gcgtcagccc agtgccggca cagggcgggt ggctgccggc agcgaggtag 2281 gcgacggccg caggaagtga ggtcccagct cctggagcca tggaattcag gcagcattgt 2341 gcctttctcg tgtattttag taattttata tatttaagag ccaagctggg gaacggtgct 2401 gctagcaggg gggaaacccc gtcccccagc gtggccctag agctcaattt gagatctgct 2461 ccatcccatc gtggaagcaa agggagagct gttgtcctgg caacgtcctc cccaggccgc 2521 agtgccgggc cccgtgtgtc cctgccgtgc ctctcactct gctctgcttc ccttgcagcc 2581 tgaagtgctc ctccacccag acagcagcag catgtcgaca ggtgagtgct ttgcatctgt 2641 gcctcctgca tccattccct ccatcccctt tgcattcacc cccccttgca cctgatccct 2701 caatccccct tggcatccat cctctccaac ccctgttgcc tccatccctt ttatctcatt 2761 gtgtgtactc aatttttcct ccttgtgtcc atctcctgcg tcctttttgc atttgtcctc 2821 tccgtccctc ttgcatctaa acccttgggc cccctcgtgt ccatcctctg tgtccatctg 2881 ttccctctgt tcctcttgca tctaatccct tagttccctt tgcatccatc catctgtccc 2941 cccatcctcc tatgttgtcc atttgtgtcc atcacttgtg cctcgacatg tccatccctt 3001 ttctccccct tacatctgat ctctcagccc tctgtgcctg tctcccatgt cctctctgta 3061 tctgtcacct ctgtccctct tgcatctgat ccttaggtcc ccttttgtgt ctgtctcatg 3121 ttctctttcc atctatctcc tctgtttccc ttgtatccaa gtcctgaatc tctcatccaa 3181 tccccagagt cccccccttg tgtccatctt ctatgttctc cagtagtgtc tgtctcctct 3241 gtctccccct cgtgtccatt ccgtgtcctc tttccatccc ttccctccat tccccttacc 3301 tgtccccccg ttgcctctgt cccatttctc ccccttgctt ccctccagcc cccacgtact 3361 ccatgtcttc tgcccacccc ccttccccac cagcctcttg gctccctccc atcccggggc 3421 tcaccggctg ctggaggagc tgaaatttca gaagactgag atctggagat ggcagcacgg 3481 ccccagggtg gctcatgtta tttttagcgt cctccctctg cgctgtcaca ccccaacgcg 3541 tggcagggac agatcctgcg gggggggggg gggaatcctg cagggaaaac ccccccgtcc 3601 tatgggctgg agtgctctga gtggatgggg ctatgccatt tctggggaca gtcagatccc 3661 acagggttgt gccatgccgt gccatgctgc accaagcatg actaggagct gaccatgggg 3721 atggttgtgg ctgtccctgc tggttcctat ctctcagtgg ggtcggagaa atgaccgtct 3781 ccaggatgca gtggagttcc aacccctcag ctcaacccct cgtctgccag gacaccatgg 3841 ggtctcagtg cacgagcatg tgcagcttca atcctggact gtctaggtga caggatgaca 3901 ggatgagtgc ttcctactct gccctccctg gggttggggt tggtccccac cactcctcgt 3961 tcagtcctgc cactggggac agttttctcc ctggtgtaac ttctggcttt tagatttatt 4021 gttgtttgtt ttttttttct ctctctctcc accagaaaaa ctgaagcacc acaagatcat 4081 cttcgtggtg ggtaagtggg tgtgggacag aggtggccgg tgtggggtgt gctgggacca 4141 gcagggtcta gcaaagcttg ggatgccagc atccaatgtg atgtgatggc aaactcctgg 4201 gggtggtgtc atgggtgctt gatggggcgg tgatgtccag ggtaccatca ggagagtgat 4261 gtctgggata ttggtcatgg tgcctctggt gccatcagag caatgatgcc caggtgccaa 4321 tggccaatca gccacaccag cagcccatgc cgccccaaca gctctttcca tccatcctgt 4381 ctcctctcat caggctttga tgggtctggg ctgccgctcc tcctaccccc actagatggg 4441 gcatgggaca tgacggtggc cctggtgtga cttgcaggtg gccccggctc agggaagggg 4501 acgcaatgcg agaagattgt gcacaagtat gggtacactc acctctccac tggggacctg 4561 ctccgggcag aggtcagctc gggctcagag cggggcaaga agctgcaagc catcatggag 4621 aagggcgagc tggttcccct ggtgagtttg tggtcatgaa cacagggtga tgtggggaca 4681 ctgtggtgat gcagcagcag gaatggggca cgtcgtccca catcaccctg acagctgtat 4741 gccccctccc caggacacgg tgctggacat gctgcgggac gccatgttgg ctaaggcaga 4801 cacctccaag ggtttcctca ttgacggcta ccctcgcgag gtgaagcagg gagaggagtt 4861 tgaaaagaag gtgagggccg ttcccctggt gagtgccgtg cggtgttatg ttgtgccacc 4921 ctccgtctgc ggcctatgcg gtgctggcca atggtgaggc cttgtcggct tcactgcagc 4981 ctcagcccag ccctggcgtc agccacagca gggcacggtg gcaggggaca tctatccagg 5041 ctgccctcag gcctggggca gtggcagaga gccaaggcct gtcccctggt gggtgccatg 5101 ccacagtgtg ttgtgtcatc ctccatctgc ggcctacatg gtgatggcac tatgtgaggc 5161 ctagtcaggc ttcactgcgg cctacaggcc tacgccctgg ggacccccta tagtgggatg 5221 cggtggcagg ggacgtctgt ctgggctgca cccagggctg gcagagggtg actggggtga 5281 tggccacgct gtcccctcct tgccggtgtc acacctccag ccccacctgg aggagacgtg 5341 gggacaccca gacccacatg gtgacagctg tgtccccaca gattgccccc cccacgctgc 5401 tgctctacgt ggatgcgggg aaggagacga tggtgaagcg gctgctgaag cggggagaga 5461 ccagcgggcg ggtggacgac aacgaggaga ccatcaagaa gcgcttggag acgtactaca 5521 aggctaccga acccgtcatc gccttctaca agggccgggg catcgtccgc caggtgagcg 5581 gggtacaggg aggggggaag caatacatgg gggtaacaga gctgccagct gccccccctt 5641 ccccccagca ttgctgaggg aaccagcacc ccgtcactgt ccccacagct gaatgccgag 5701 ggcaccgtgg atgaggtttt ccagcaggtc tgctcctacc tcgacaagct gtgacggccc 5761 cgcgcccccc accctgcgct ccccccacac gcgcagggca gagacagcgg aagtggcctt 5821 atcctgtttt cgtggacaga gccgcgcgaa ggaaatttca aggacattgt gtttggctct 5881 ttcccgtctc tccccagtaa agttcacttt aatgagccca gactttatct ttttcttctg 5941 tcgcaggaaa tgagtttttc tttccagaga tttttgtttc tagcctcccc ccaccccccc 6001 tcagccagcc cctctctgaa gctgataaag ggcaggaagc gggtgtttat cccggctgag 6061 atgcagcggg cccgggggat gcttggagga aggcaggaga tcccggctgc ttcgcctggg 6121 gccttggtgt cctggccctg tcctgctccc cccaacccga tctcattcct gtgctcctgt 6181 ctctgcatgc gcctggggca gcatcccacc ccatccctcc tcctcatccc atcagatggt 6241 gatttccacc ccttccacca cttcctcgct gcttttgtcc attggggtgt tttgggatcc // LOCUS CHKBGBL3 192 bp ds-DNA VRT 15-MAR-1990 DEFINITION Chick 14K beta-galactoside-binding lectin gene, exon 3. ACCESSION D00310 KEYWORDS 14K lectin; beta-galactoside-binding lectin; chick 14K beta-galactoside-binding lectin; lectin. SEGMENT 3 of 4 SOURCE A library of chick genomic fragments in lambda-EMBL3 provided by J. D. Engel (Northwestern University). ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 192) AUTHORS Ohyama,Y. and Kasai,K. TITLE Isolation and characterization of the chick 14K beta-galactoside- binding lectin gene JOURNAL J. Biochem. 104, 173-177 (1988) STANDARD full staff_entry COMMENT The mRNA for the lectin is encoded by four exons. The first exon encodes only three amino acid residues and the other three exons encode, respectively, one of the three repeating sequences found in this lectin. FEATURES from to/span description pept + 11 + 182 14K beta-galactoside-binding lectin, exon 3 (AA 31 at 12) IVS < 1 10 14K beta-galactoside-binding lectin, intron 2 IVS 183 > 192 14K beta-galactoside-binding lectin, intron 3 BASE COUNT 49 a 48 c 52 g 43 t ORIGIN About 1.2 kb after segment 2. 1 ttctctgcag ctttgtgatg aatctgggca aggactcaac ccacctgggg cttcacttca 61 acccccgctt tgatgctcac ggtgatgtga acctcattgt gtgcaactca aagaaaatgg 121 aagagtgggg taccgagcaa agggagacgg tcttcccttt ccagaaggga gccccaatag 181 aggtaaagcc tg // LOCUS XELPRL5B 986 bp ss-mRNA VRT 15-MAR-1990 DEFINITION X.laevis ribosomal protein L5b 60 S subunit mRNA, complete cds. ACCESSION M29033 KEYWORDS 5S ribosomal RNA binding protein; nucleolar protein; ribosomal protein. SOURCE X.laevis stage 17 embryo, cDNA to mRNA, clone L5b. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 986) AUTHORS Wormington,W.M. TITLE Developmental expression and 5S rRNA-binding activity of Xenopus laevis ribosomal protein L5 JOURNAL Mol. Cell. Biol. 9, 5281-5288 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Wormington, 14-OCT-1989. FEATURES from to/span description pept 53 943 L5b ribosomal protein mRNA < 1 > 986 L5b mRNA BASE COUNT 309 a 200 c 252 g 225 t ORIGIN 1 gttagggctg tgactctccg gtcctagccc cgacggctca aaatagagca aaatggggtt 61 cgtaaaggtc gtcaagaaca aggcttattt taagaggtac caggtcaagt tccgcagaag 121 gagagagggc aagaccgatt actatgctcg caagcgactg gtgatccagg ataagaacaa 181 gtacaatact cccaagtaca ggatgattgt acgtgtcacc aacagagaca tcatctgcca 241 gattgcttat gctaggattg agggagacat gattgtatgt gctgcttatg ctcatgagct 301 tcccaagtat ggtgtcaaag ttgggctgac aaactatgct gcagcatatt gtactggcct 361 gctgctggct cgtaggcttc tcaacaaatt cggtctggac aaagtatacg aaggtcaagt 421 ggaagtaaca ggagatgagt acaatgtaga gagcgttgat ggtgaacctg gtgcctttac 481 atgctacctt gatgccggcc tcaccagaac caccactgga aacaaagttt ttggtgctct 541 taagggagct gtggatggag gtttatctat tccacacagt accaagcgat tccctggcta 601 tgactctgaa agcaaagaat tcaatcctga ggtccaccgc aagcacatct tcgcccagaa 661 tattgcagag tacatgcgtc ttctgatgga agaagatgaa gatgcatata aaaaacagtt 721 ctcccagtac ataaagaatg gtgtcgcagc agatcagttg gaagacatat acaaaaaggc 781 tcatgcaggt attcgggaga atccagtgca cgaaaagaaa cccaagaagg aagtcaaaaa 841 gaagaggtgg aaccgtgcca aattgtctct ggaacagaaa aaggaccgtg ttgctcagaa 901 aaaagccagt ttcctcagag ctcaggagaa ggcagacagc taaatggatt atgttgcgat 961 tttctaagtg ttgtcagcaa aaaaat //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS DRMMTTRND 976 bp ds-DNA ORG 15-JUN-1989 DEFINITION D.mauritania Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes. ACCESSION M18073 KEYWORDS NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys; transfer RNA-Trp; transfer RNA-Tyr. SOURCE D.mauritania (adult, from Mauritius Island) mitochondrial DNA. ORGANISM Mitochondrion Drosophila mauritania Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae; Drosophila mauritiana. REFERENCE 1 (bases 1 to 976) AUTHORS Satta,Y., Ishiwa,H. and Chigusa,S.I. TITLE Analysis of nucleotide substitutions of mitochondrial DNAs in Drosophila melanogaster and its sibling species JOURNAL Mol. Biol. Evol. 4, 638-650 (1987) STANDARD simple staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by Y.Satta, 04-FEB-1988. FEATURES from to/span description ORF < 1 461 NADH dehydrogenase subunit 2 (AA at 1; ta stop codon) ORF 668 > 976 cytochrome oxidase subunit 1 (gta start codon) tRNA 461 526 Trp-tRNA tRNA 580 519 (c) Cys-tRNA tRNA 666 601 (c) Tyr-tRNA BASE COUNT 348 a 105 c 96 g 427 t ORIGIN Unreported. 1 tctttaataa ttagagaatc aatttgactt atttattttt ttttttattc atttttatca 61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct 121 tgatttgtaa acagaaaaat tttaaaattt acattattta taaatttttt atcattagga 181 ggattacctc catttttagg atttttacct aaatgacttg taattcaaca attaacatta 241 tgtaatcaat attttttatt attattaata ataatatcaa ctttaattac attatttttt 301 tatttacgaa tctgttattc tgcttttata ataaattatt tcgaaaataa ttgaatcata 361 aagataaata taattagtag taatactaat atatatttaa ttataacttt tttttcaatt 421 tttggattat ttataatttc tttattttat tttatatttt aaggctttaa gttaataaaa 481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aaattactcc 541 ttcaaaattg cagtttgata tcattattga ctataagacc taataaaatt tgtccttatt 601 tgattaagaa gaataaatct tataaataga tttacaatct atcgcctaaa cttcagccac 661 ttaatcagta accgcgacaa tgattatttt ctacaaatca taaagatatt ggaactttat 721 attttatttt tggagcttga gctggaatag ttggaacatc attaagaatt ttaattcgag 781 ccgaattagg acatcctgga gcattaattg gagatgacca aatttataat gtaattgtaa 841 ctgcacatgc ttttattata atttttttta tagttatacc tattataatt ggtggatttg 901 gaaattgatt agtaccttta atattaggtg ctcctgatat agcattccca cgaataaata 961 atataagatt ttgatt // LOCUS DROMTTRND 975 bp ds-DNA ORG 15-JUN-1989 DEFINITION D.melanogaster Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes. ACCESSION M18022 KEYWORDS NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys; transfer RNA-Trp; transfer RNA-Tyr. SOURCE D.melanogaster (adult, from Sapporo, Japan) mitochondrial DNA. ORGANISM Mitochondrion Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae; Drosophila melanogaster. REFERENCE 1 (bases 1 to 975) AUTHORS Satta,Y., Ishiwa,H. and Chigusa,S.I. TITLE Analysis of nucleotide substitutions of mitochondrial DNAs in Drosophila melanogaster and its sibling species JOURNAL Mol. Biol. Evol. 4, 638-650 (1987) STANDARD simple staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by Y.Satta, 04-FEB-1988. FEATURES from to/span description ORF < 1 461 NADH dehydrogenase subunit 2 (AA at 1; ta stop codon) ORF 667 > 975 cytochrome oxidase subunit 1 (ata start codon) tRNA 461 526 Trp-tRNA tRNA 580 519 (c) Cys-tRNA tRNA 665 600 (c) Tyr-tRNA BASE COUNT 345 a 101 c 97 g 432 t ORIGIN Unreported. 1 tctttaataa ttagagaatc aatttgatta atttattttt ttttttattc atttttatca 61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct 121 tgatttgtta atagaaaaat tttgaaattt acattattta taaatttttt atcattagga 181 ggattacctc catttttagg atttttacca aaatgacttg taattcaaca attaacatta 241 tgtaatcaat attttatatt aacaattata ataatatcaa ctttaattac attatttttt 301 tatttacgaa tttgttattc cgcttttata ataaattatt ttgaaaataa ttgaatcata 361 aagataaata taaatagtat taattataat atatatataa ttataacttt tttttcaatt 421 tttggattat ttttaatttc tttattttat tttatatttt aaggctttaa gttaataaaa 481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aacttactcc 541 ttcaaaattg cagtttgata tcattattga ctataagacc taattaattt gtccttattt 601 gattaagaag aataaatctt atatatagat ttacaatcta tcgcctaaac ttcagccact 661 taatcaataa tcgcgacaat gattattttc tacaaatcat aaagatattg gaactttata 721 ttttattttt ggagcttgag ctggaatagt tggaacatct ttaagaattt taattcgagc 781 tgaattagga catcctggag cattaattgg agatgatcaa atttataatg taattgtaac 841 tgcacatgct tttattataa ttttttttat ggttatacct attataattg gtggatttgg 901 aaattgatta gtgcctttaa tattaggtgc tcctgatata gcattcccac gaataaataa 961 tataagattt tgact // LOCUS DRSMTTRND 975 bp ds-DNA ORG 15-JUN-1989 DEFINITION D.simulans Trp-tRNA, Cys-tRNA, Tyr-tRNA, NADH dehydrogenase subunit 2 (3' end) cytochrome oxidase subunit 1 (5' end) genes. ACCESSION M18072 KEYWORDS NADH dehydrogenase; cytochrome oxidase; transfer RNA-Cys; transfer RNA-Trp; transfer RNA-Tyr. SOURCE D.simulans (adult from Nairobi, Kenya) mitochondrial DNA. ORGANISM Mitochondrion Drosophila simulans Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae; Drosophila simulans. REFERENCE 1 (bases 1 to 975) AUTHORS Satta,Y., Ishiwa,H. and Chigusa,S.I. TITLE Analysis of nucleotide substitutions of mitochondrial DNAs in Drosophila melanogaster and its sibling species JOURNAL Mol. Biol. Evol. 4, 638-650 (1987) STANDARD simple staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by Y.Satta, 04-FEB-1988. FEATURES from to/span description ORF < 1 461 NADH dehydrogenase subunit 2 (AA at 1; ta stop codon) ORF 667 > 975 cytochrome oxidase subunit 1 (gtg start codon) tRNA 461 526 Trp-tRNA tRNA 580 519 (c) Cys-tRNA tRNA 665 600 (c) Tyr-tRNA BASE COUNT 343 a 107 c 102 g 423 t ORIGIN Unreported. 1 tctttaataa ttagagaatc agtttggttt atttattttt ttttttattc atttttatca 61 tttgtattaa catttatatt taatattttt aaattatttc atttaaatca attattttct 121 tgatttgtaa atagaaaaat tttaaaattt acattattta taaatttttt atcattggga 181 ggattacccc catttttagg atttttacca aaatgacttg taatccaaca attaacatta 241 tgtaatcaat attttttatt attaataata ataatatcaa ctttaattac attatttttt 301 tatttacgaa tttgttattc tgcttttata ataaattatt tcgaaaataa ttgaatcata 361 aagataaata taattagtat taatactaat atatatttaa ttataacttt tttttcaatt 421 tttggattat ttataatttc tttattttat tttatatttt aaggctttaa gttaataaaa 481 ctaataacct tcaaagctat aaataaagaa atttctttaa gccttagtaa aaattactcc 541 ttcaaaattg cagtttgata tcattattga ctataagacc taataaaatt gtccttattt 601 gattaagaag aataaatctt ataaatagat ttacaatcta tcgcctaaac ttcagccact 661 taatcagtga ccgcgacaat gattattttc tacaaatcat aaagatattg gaactttata 721 ttttatcttt ggagcttgag ctgggatagt cggaacatca ttaagaattt taattcgagc 781 cgaattagga catcctggag cattaatcgg agatgatcaa atttataatg taattgtaac 841 tgcacatgct tttattataa ttttttttat agttatacct attataattg gtggatttgg 901 aaattgatta gtgcctttaa tattaggtgc ccctgatata gcattcccgc gaataaataa 961 tataagattt tgatt // LOCUS RICMTB1 2135 bp ds-DNA circular ORG 15-MAR-1989 DEFINITION Oryza sativa (rice) Mitochondrial plasmid-like DNA (B1), complete sequence. ACCESSION D00293 KEYWORDS cytoplasmic male sterility; mitochondrial DNA; mitochondrial plasmid. SOURCE Cytoplasmic male-sterile rice line (A-58CMS), mitochondrial plasmid-like DNA (B1). ORGANISM Mitochondrion Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 2135) AUTHORS Shikanai,T., Yang,Z.-Q. and Yamada,Y. TITLE Properties of the circular plasmid-like DNA B1 from mitochondria of cytoplasmic male-sterile rice JOURNAL Plant Cell Physiol. 28, 1243-1251 (1987) STANDARD full staff_entry COMMENT The complete nucleotide sequence of B1 (2,135bp) which is a plasmid-like minicircular DNA in the mitochondria of Oryza sativa A-58CMS was found to contain sequences homologous to those in the 1,913bp plasmid-like DNA of maize, and therefore, this sequence may have originated from the nuclear genome. The number 2014 described under the result section as the beginning of ORF seems to be wrong number. According to the sequence map, correct number should be 2019. corresponds to DDBJ (rel4.0) entry <RICMTB1>(D00293) dated ORG FEATURES from to/span description ORF 1793 2035 open reading frame ORF 2019 1732 (c) open reading frame (comp strand) BASE COUNT 545 a 494 c 449 g 647 t ORIGIN 3 bp downstream of EcoRI site. 1 ttcaggattg gctaaggctc tagccattaa atgtgactgg ggttgatcca agggctggtc 61 tcatggtttt gactaggtgt ttctcaaatt tctgatcata tttcctgtcg ttataacccc 121 caaaatacgc taactcgcag ctcgaagttc ccagttacat gggcaaccct actcactata 181 catagggtgt gaagagacgg acaacaatag tctttctctg ctttcactcg agccaaacta 241 agtgttttct aggattggcc ccaaagccat actatggtca caacggtaac cccctcgatt 301 tagtttatcc aatcaatcga gaggcagcat ctctatcaat tactcggatc ggcaacctcg 361 gttctggaat gtggacttca ctgaccagcc gcctcttagt ccacacccaa cttggaagtg 421 ttattgattc gttagtaacc ttcgccatcc taatacccga gactgtagtg ttctaagttg 481 gttaaagccg tcgtctcctg ggatgcccgg ctggtcgggc aagtattagt gaaacggctc 541 cgcgcaatca ataactaccc tgcttaggta tgagcttcca agtcccgaat ccttttgtct 601 caagtctgcc ttggtaatgt gaccacacat cctttgatga tggaagtgtt gtttcccaac 661 aatccacccc ggaatcctag ttggagtaat gtatccctac atactatcgc gctaatgtaa 721 tcgggttctc agatcatact caatgagggg tggctagtat cccgaaactt tgaagaacat 781 gagttaagta gttcattcac ttggtttcgc caccagtcaa aacgccctaa aacccagtag 841 gttttctcga tcaccattct acgattttat ttccacctgt tttagaacac ataaattgtg 901 ctacccttaa gagtttatca ctaaaatcaa gaactttatt tatgacctgt aagttgcctc 961 ataaaaagta gatgttagta cccgactaaa aggagggaat caaataaaag gtaaagctct 1021 caattaaaat aaaaagctat cgtttacttt gttaagggta tattgcccgt agcgaggtta 1081 aggtaaaccc ttgaccgagt gaaggcgagc ccggtaaagc actaagtgta ttggctagct 1141 tgtactttat tagctatcaa gtgtttgatg taccaattac tttattgtgt caactagttt 1201 attgaatgca aaggtaattc cctcgataat tgatttaatt atcaagaaag agaattagtt 1261 accctccttc cctcgagctg agctatttct tgtgtattca atcgtatgca cgagggctgt 1321 cctctcctct gctctatctc ccggacaggt atgaaagtcg taattctttg ggtttttgta 1381 gtcaatggaa tgagcggagg gagacacatc cttccatcca aaggtgtgtg tgtcccgtag 1441 cgagtggaat ggctggttcg gtctagtcgt atagaacggg ttgagttctt tttactgagt 1501 cctttcgagg ctggaagtgt ttcttacagt ttccttttcc gaagggaaag tcaagtagta 1561 taaactagta ctaagtccaa gaagaagggg tgaaaagtga ggagggcgat tagcccgatc 1621 cacttgaggg gaggaaggcc ttccttccct actaatagta ccaattcata attcctattc 1681 gtaataaatt catatttatt tggaaaaata tgaaaactaa ttatctttat tctagttccc 1741 taaaaccctt tccgtgccac tcctttctgt tccactcatc caacattgtt ggatgccaaa 1801 gactttgacg aagactttcg tacctgtatt tctgtccggt atgacgtcgc tttcgtgtcg 1861 gcatagcgtg ccccagttgc cacagttgct aggctgttca gagtcctctc ggaccccggc 1921 taggcatcct cccggtaatg ccaggggctg cttcaagaaa cgggtgatgt gggttccaat 1981 cattcttggt ttggttcctc ggcaccaagc caatcccatt cctttccgct ggtgaccgcc 2041 ttcttgttgc ctaccttttt aacccctgta gtaaccccaa gtccaagtac aaggcatacg 2101 tgctccttta atatgcttgc tcctcttttc ttgaa // LOCUS TOBCPNPLR 93 bp ds-DNA ORG 15-MAR-1990 DEFINITION Tobacco chloroplast 23S rDNA and lincomycin resistant mutants. ACCESSION X13526 KEYWORDS 23S ribosomal RNA; antibiotic resistance; lincomycin resistance; ribosomal RNA. SOURCE tobacco (Chloroplast Nicotiana plumbaginifolia). ORGANISM Chloroplast Nicotiana plumbaginifolia Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Nicotiana plumbaginifolia. REFERENCE 1 (bases 1 to 93) AUTHORS Cseploe,A., Etzold,T., Schell,J. and Schreier,P.H. TITLE Point mutations in the 23 S rRNA genes of four lincomycin resistant Nicotiana plumbaginifolia mutants could provide new selectable markers for chloroplast transformation JOURNAL Mol. Gen. Genet. 214, 295-299 (1988) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description MUTANT 22 22 g is a in LR400 MUTANT 48 48 a is g in LR415 MUTANT 49 49 a is g in LR421 MUTANT 49 49 a is g in LR446 FEATURES from to/span description rRNA < 1 > 93 23S ribosomal RNA BASE COUNT 23 a 22 c 25 g 23 t ORIGIN 1 tgaaatagac atgtctgtga agatgcggac tacctgcacc tggacagaaa gaccctatga 61 agcttcactg ttccctggga ttggctttgg gcc // LOCUS VFACPTG 1409 bp ds-DNA ORG 15-SEP-1989 DEFINITION Broad bean (V.faba) chloroplast Leu-tRNA-CAA, Leu-tRNA-UAA and Phe-tRNA genes. ACCESSION M23771 KEYWORDS transfer RNA-Leu; transfer RNA-Phe. SOURCE Broad bean (V.faba) chloroplast DNA, clone Bam6a. ORGANISM Chloroplast Vicia faba Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea; Vicia faba. REFERENCE 1 (bases 1 to 1409) AUTHORS Bonnard,G., Weil,J.-H. and Steinmetz,A. TITLE The intergenic region between the Vicia faba chloroplast Leu-tRNA-CAA and Leu-tRNA-UAA genes contains a partial copy of the split Leu-tRNA-UAA gene JOURNAL Curr. Genet. 9, 417-422 (1985) STANDARD simple staff_entry FEATURES from to/span description tRNA 15 95 Leu-tRNA-CAA tRNA 539 573 Leu-tRNA-TAA, exon 1 1025 1074 Leu-tRNA-TAA, exon 2 tRNA 1185 1257 Phe-tRNA BASE COUNT 506 a 240 c 238 g 425 t ORIGIN 1 bp upstream of BglII site. 1 agatctcaat attatgcctt gaagaggact cgaacctcca cgctgtttag cacgagattt 61 tgagtctcgc gtgtctacca tttcaccacc aaggcatcct taaagggaat catattctct 121 gaatatgata tctatcttat gtcatggttc aatattatta tactccaatt ctaatccaat 181 tgcaacatag aatacaacat ctcaaataga atacaacatc tcaaatagaa tacggtacga 241 tataatacag tatctactat ataatacagc gagaatagaa ttctttctcg actaaataaa 301 tatagcttct agatgaacga aaagctagtc aaagggtcga ttcaaaattg gtagacgcta 361 cggacttaat tgtattgagc tttggtatgg aaacatagta atgttaagtg aaaacataac 421 ataatataat acagcatcta ctatataata caacatgtac tatataatac agcgagaaga 481 gaattctttc tcgactaaat aaatatagct tctagatgaa cgaaaagcaa gtcaaacggg 541 ggatatggcg aaattggtag acgctacgga cttaattgta ttgagccttg gtatggaaac 601 atattaagtg aaaactttca aattcagaga aaccctagaa ttttaaatgg gcaatcctga 661 gccaaatcct tctttccgaa aacaaaaaac taaaagttca gaaaaaaagg ataggtgcag 721 agactcaatg gaagctattc taacaaatgg agttgacaac attcaattga ttaatgaaga 781 tttctaactt ctatttgtaa attgtaacta ttttttttag ttttgattcg atcacaattg 841 aaacgaaaca ttagaatcaa atcaattcca actggaagaa aaaatggtga atattcattg 901 atcaaatcag tcactccacc aaaatatgat ggatcttttg aataactgat taatcagacg 961 agaataaaga tagagtccca ttctacatgt caataccgac atcaatgaaa gttttagtaa 1021 tcggaaaatc cgtcgacttt aaaaatcgtg agggttcaag tccctctatc cccaaaagcc 1081 gattgaattt gctcattttt atggcccttt cccttttttt tattgagttg acatagactc 1141 aagtaatttc cctcaaatta ggatggtgcg tcaagatttt aatggtcggg atagctcagc 1201 cggtagagca aaggactgaa aatccttgtg tcaccagttc aaatctggtt cccggcgact 1261 cattatgtat gagtatctat tcccatattt cttttcaaaa atcaaatatt gggaggaata 1321 gatacatatt gattagtggt cttaataatt atacataatt tatctaggcg tacagagata 1381 tgctctttct agatgaaaaa ggaatcgat // LOCUS WHTCPTAREP 781 bp ds-DNA ORG 15-MAR-1990 DEFINITION Wheat chloroplast dispersed repeat (9) DNA for nonfunctional rpl23' homologue segment (located in large single-copy region LSCR). ACCESSION X12849 KEYWORDS pseudogene; rbcL gene; repetitive sequence; ribosomal protein; ribosomal protein L23; ribulose bisphosphate carboxylase. SOURCE wheat (Chloroplast Triticum aestivum). ORGANISM Chloroplast Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Triticum aestivum. REFERENCE 1 (bases 1 to 781) AUTHORS Bowman,C.M., Barker,R.F. and Dyer,T.A. TITLE In wheat ctDNA, segments of ribosomal protein genes are dispersed repeats, probably conserved by nonreciprocal recombination JOURNAL Curr. Genet. 14, 127-136 (1988) STANDARD simple automatic COMMENT *source: clone=pTacE8. see x12850 for repeat 9 in functional rpl23 gene; see x12848 for maize chloroplast repeat 9 in rpl23' homologue EMBL features not translated to GenBank features: key from to description INVREP 214 228 inverted repeat A INVREP 234 249 inverted repeat A' RPT 268 276 direct repeat 1 RPT 297 301 direct repeat 2 RPT 302 306 direct repeat 2 RPT 313 318 direct repeat 3 SITE 377 678 rpl23' nonfunctional segment INVREP 377 384 inverted repeat B INVREP 385 392 inverted repeat B' RPT 606 614 direct repeat 1 RPT 631 635 direct repeat 2 RPT 636 640 direct repeat 2 RPT 645 650 direct repeat 3 FEATURES from to/span description pept < 1 93 rbcL gene product (AA at 1) BASE COUNT 302 a 105 c 141 g 233 t ORIGIN 1 gcttgcaaat ggagtcctga actagccgca gcttgtgaag tatggaaggc gatcaaattc 61 gagttcgagc cggtagatac tattgataag tagataaaac taaagataaa gaagagaagg 121 tataaataaa aaagaaacga aataaaaaga gaaaaaaata agttatgaaa tgcagtaatc 181 ttctttattc ttctaattga ttgcaattaa actcggctca atcttttttt tctaaaaaag 241 attgagccga ataaaaatag atcatgatat gatcatgaga cttgacaaat cgagattcgt 301 ctattctata tatctagaat atatatatta aggtataata caataatgaa atacaaataa 361 aataataaaa tatagtatta tcatatgata attgaatcaa atacgcagta tttactgaaa 421 taagtcttcg tttattgggt ttgaatcaat atacttttaa tgtcgaatgg gattcactaa 481 gactgaaata aagcattggg tcgtactctt ctttggtgtt aaggtggtag ctgtgtatag 541 ccatcgacta cctagaaagg taaaagaata ggacctattc tgggccatac aatgcattac 601 agacgtatga tcattaccct tcaaccgggt tattctattc cacttctaga tagagaaaaa 661 aactaaagga gaatgaatga aaaaagacat agtttggaag ttagaccttt ttataggact 721 ctctttcaat ttcaaaaaag aggacgtttg aaacttttaa caggcgtaat cgtgagtcaa 781 c //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ACYFIMBA 1850 bp ds-DNA BCT 13-FEB-1990 DEFINITION A.viscosus fimbrial structural protein type 1 subunit gene, complete cds. ACCESSION M32067 KEYWORDS fimbrial structural protein. SOURCE A.viscosus (strain T14V) DNA, clone MY3833. ORGANISM Actinomyces viscosus Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 31 to 1850) AUTHORS Yeung,M.K. and Cisar,J.O. TITLE Sequence homology between the subunits of two immunologically and functionally distinct types of Actinomyces spp JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 30) AUTHORS Yeung,M.K. JOURNAL Unpublished (1990) Univ. of Texas, San Antonio, Texas 78284 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.K.Yeung, 12-FEB-1990. FEATURES from to/span description pept 124 1725 fimbrial structural protein type 1 subunit precursor sigp 124 213 fimbrial structural protein type 1 subunit signal peptide matp 214 1722 fimbrial structural protein type 1 subunit BASE COUNT 395 a 714 c 515 g 226 t ORIGIN 1 caagcttgca tgcctgcagg tcgactctag aggaacgcga gaagaaccca cgactccgct 61 ccgtcgcgcg cgggtcaccc cttctatccc tcattcatcc caacaacaca ggagagaaca 121 gcaatgcact ccctcaacac gcgccggggc ctcggcctcg ccgctgcgat gacgctcgcc 181 gccggcgcgc tcgtcgcccc gaccggcgcg gcggcacccg ctgacccgaa cggctccacc 241 atcgaccccg acgcggccac cacgctgacc gtccacaagt gcgagcagac cgacaccaac 301 ggcgtcaagg aggggaccgg taacgaggac ccccaggccg agtgcaagcc cgtctccgac 361 gtcgagttca ccatcaccaa gctgaacgtg gacctgacga cctacgacgg ctggaagacc 421 ctcgcggacc tcaagggtga cgtggtcaag gccggtgccc tcaagagcac caccgtccag 481 aagatcacca ccggtgccaa cggtctggcc tccttcaccg acgcccagac cgaggtcggc 541 gcctacctcg tcagcgagac gcgcaccccc gacaaggtca tcccggccga ggacttcgtc 601 gtcaccctgc cgatgaccaa cccgcaggac accgccaagt ggaactacaa cgtccacgtc 661 taccccaaga acaccctctc cggtgtggac aagcaggtga ccgacaagcc ggcccccggc 721 tccgggcgcg acatcaccta caccatcacc acctccatcc cgaaggtgga ctaccccggc 781 ggtgcgcgca tcaagcgcta cgaggtcgtt gaccgcctcg acaagcgcat caagaaggaa 841 gccctgaccc cggtcgtcaa gatcgtcggt cagaacgagg tgaccctggc ggagaccacc 901 gactacaccc tcatcactgc cgagggtaag gaccacaact gggccaccat ccagctcacc 961 gaggagggcc gccgcaaggc ctccgaggcg cgctacaacg gcaacggcga gaccaagctc 1021 caggtgaccc tgaacgccaa gttcgacgcc gccgtcaacc tcgagggcga cctgtccaac 1081 accgcgggcc tcatccccaa cgacagcccc aacttcacct gggacccgaa caaccccggc 1141 accaccacgg acatccccgg catccccacc acccccgtgc tctccaagta cggcaaggtg 1201 gttctcacca agaccggtac ggacgacctg gccgacaaga ccaagtacaa cggcgcccag 1261 ttccaggtct acgagtgcac caagaccgcc agcggtgcca cgctgcgtga ctccgacccc 1321 agcacccaga ccgtcgaccc gctgaccatc ggtggggaga agaccttcac caccgccggc 1381 cagggcaccg tggagatcaa ctacctgcgc gccaacgact acgtcaacgg tgcgaagaag 1441 gaccagctga ccgacgagga ctactactgc ctcgtggaga ccaaggcccc cgagggctac 1501 aacctccagg ccgacccgct ccccttccgg gtcctggctg agaaggccga gaagaaggcc 1561 gcgaccgagg tgaccgtcac cgacatcccg aagaacgccg gcttccgcct gccgctgacc 1621 ggtgccaacg gcgtcatctt cctgaccatc gccggcgccc tgctggtggc cggtggcgcg 1681 gtggtcgcct acgccaacaa gcgccgtcac gttgccaagc actgacagcc tctggcagcc 1741 actggcagcc actgaggtga ccggccccag ccggccgtca tcacgctgac tgcttccttg 1801 tgggggaggc gggagaaccc gtctccccca cacgggcgtc gcagggatcc //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS BOVTCRA19 123 bp ss-mRNA MAM 15-MAR-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90028 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA41. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 123) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Department of Veterinary Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 FEATURES from to/span description pept < 1 > 123 T-cell receptor alpha chain, variable joining region. (AA at 1) BASE COUNT 23 a 34 c 32 g 34 t ORIGIN 1 tttttccttt ttgcggacca cagactcagg cacttacttc tgtgctgtgc agctggaagg 61 ggggtgctgc caaccagctc atctttggga ctggcacgct gctttctgtc aagccaagca 121 caa // LOCUS CATSIS1 1125 bp ds-DNA MAM 15-SEP-1989 DEFINITION Feline c-sis proto-oncogene, segment 1. ACCESSION M25353 KEYWORDS proto-oncogene. SEGMENT 1 of 7 SOURCE Feline placenta DNA, clone pPHS1. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 1125) AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A., Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Genetic organization of the c-sis transcription unit JOURNAL Nucleic Acids Res. 15, 959-970 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 1043 + 1105 c-sis proto-oncogene pre-msg 51 > 1125 c-sis, mRNA and introns IVS 1106 > 1125 c-sis, intron A BASE COUNT 171 a 430 c 362 g 162 t ORIGIN 780 bp upstream of BamHI site. 1 tagcactctc ccttctcctt tataaaggcc ggaacagctg aaagggtggc aacttctcct 61 cctgccgccg ggagcagccc gcctgtctcc ccgcgcgccc gcagcctccc ccgctgcctc 121 cctgagggct cccctctggc cgccagcgcc catctttcat ttccgagatt gcgatatttt 181 gcgcacacac gcatacacac acgcgcaaaa agggggggaa aaaagggccc accctccagc 241 ctcgctgcaa agagaagccg gagcagccgc agctcacagc ccagaggacg cccagagcgg 301 cagcgcgcag acagacggac cgacggactc cgccgcgtcc acctgtctgc ccggcccggc 361 actgcgcgca cgggcacggc cgagcgcgga gcagccgtgc ccgccgcccg ggcccgccaa 421 ggcgcacacg ctccggcccc ccccacccgg cccgggcggg agtttgcacc tctccctgcc 481 cgggtactcg ggccgccgct gcaaagccaa ctttggaaaa agtttttggg gggagacttt 541 ggccttgagg tgcccggctc tgcactttcc gacttcgggg gcctttctag aaaatgttgc 601 aaaaaagcta agccagcggg cagaggaaaa cgcctatagc cggcgagtga agacgaacca 661 tcggctgccg tgttcttttt cctcttggag gttggagtcc cctgggcgcc cccacacggc 721 tagacgcctc ggctggttcg cgacgcagac ccccggccgt ggatgctcgc tcgggctcgg 781 gatccgccca ggtagcggcc ggaccctggt cccgcgccca ggccctcccc agccccccaa 841 cgacggagcc ggggccgggg gcggcggcgc cgggggccat gcgggtgagc cgggcggcgc 901 ctgcagcggc ctgagcgcct gatcgccgca gacccgagcc gagcccacct ccttccccag 961 cccccgcccc ccaccctggc cgcgggggcg gcgcgctcgg tccacgcgtc cggggccccg 1021 cggggccggg cccggagtcg gcatgaatcg ctgctgggcg ctcttcctgt ctctctgctg 1081 ctacctgcgt ctggtcagcg ccgaggtgag ttgcgacggc ggcgg // LOCUS CATSIS2 137 bp ds-DNA MAM 15-SEP-1989 DEFINITION Feline c-sis proto-oncogene, segment 2. ACCESSION M25354 KEYWORDS proto-oncogene. SEGMENT 2 of 7 SOURCE Feline placenta DNA, clone pPHS1. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 137) AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A., Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Genetic organization of the c-sis transcription unit JOURNAL Nucleic Acids Res. 15, 959-970 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 117 c-sis proto-oncogene pre-msg < 1 > 137 c-sis, mRNA and introns IVS < 1 20 c-sis, intron A IVS 118 > 137 c-sis, intron B BASE COUNT 29 a 44 c 34 g 30 t ORIGIN About 7.5 kb after segment 1. 1 ccatggtgtc tccctggcag ggggacccca ttcccgagga actctacaag atgctgagtg 61 accactcaat ccgctccttc gatgacctcc agcgcctgct acacggagac tccgtaggta 121 aattgaacct tgttcgg // LOCUS CATSIS3 142 bp ds-DNA MAM 15-SEP-1989 DEFINITION Feline c-sis proto-oncogene, segment 3. ACCESSION M25355 KEYWORDS proto-oncogene. SEGMENT 3 of 7 SOURCE Feline placenta DNA, clone pPHS1. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 142) AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A., Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Genetic organization of the c-sis transcription unit JOURNAL Nucleic Acids Res. 15, 959-970 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 122 c-sis proto-oncogene pre-msg < 1 > 142 c-sis, mRNA and introns IVS < 1 20 c-sis, intron B IVS 123 > 142 c-sis, intron C BASE COUNT 31 a 35 c 45 g 31 t ORIGIN About 2.4 kb after segment 2. 1 tgccttgcct tgccttgcag acgaagacag agccgagttg gacctgaatt cgactcgatc 61 ccattgtgga ggcgagctgg agagcttatc ccgagggaga aggagcctag gtgaggctgc 121 aggtaagaat cctcctttct cg // LOCUS CATSIS4 246 bp ds-DNA MAM 15-SEP-1989 DEFINITION Feline c-sis proto-oncogene, segment 4. ACCESSION M25356 KEYWORDS proto-oncogene. SEGMENT 4 of 7 SOURCE Feline placenta DNA, clone pPHS1. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 246) AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A., Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Genetic organization of the c-sis transcription unit JOURNAL Nucleic Acids Res. 15, 959-970 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 226 c-sis proto-oncogene pre-msg < 1 > 246 c-sis, mRNA and introns IVS < 1 20 c-sis, intron C IVS 227 > 246 c-sis, intron D BASE COUNT 37 a 90 c 80 g 39 t ORIGIN About 1.5 kb after segment 3. 1 tgaggccccc ctcccttcag gttccccgac ggtcgccgag ccagccatga tcgccgagtg 61 caagacacgc acggaggtgt tcgaggtgtc ccggcgcctc atagaccgca ccaacgccaa 121 cttcctggtg tggccgccct gcgtggaggt gcagcgctgc tccggctgct gtaacaaccg 181 caacgtgcag tgccgtccca cccaggtgca gctgcggctt gtccaggtgc gcgggggggc 241 ccgacc // LOCUS CATSIS5 185 bp ds-DNA MAM 15-SEP-1989 DEFINITION Feline c-sis proto-oncogene, segment 5. ACCESSION M25357 KEYWORDS proto-oncogene. SEGMENT 5 of 7 SOURCE Feline placenta DNA, clone pPHS1. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 185) AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A., Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Genetic organization of the c-sis transcription unit JOURNAL Nucleic Acids Res. 15, 959-970 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 165 c-sis proto-oncogene pre-msg < 1 > 185 c-sis, mRNA and introns IVS < 1 20 c-sis, intron D IVS 166 > 185 c-sis, intron E BASE COUNT 40 a 47 c 67 g 31 t ORIGIN About 1.4 kb after segment 4. 1 tcgatggttc tgtcctccag gtgagaaaaa tcgagattgt gcggaagagg ccagtcttta 61 agaaggccac ggtgaccctg gaggaccacc tggcgtgcaa gtgtgagacg gtggtggctg 121 cacgacccgt gacccgaagc ccggggagct cccaagagca gcgaggtaag cgctggtccg 181 gggtc // LOCUS CATSIS6 187 bp ds-DNA MAM 15-SEP-1989 DEFINITION Feline c-sis proto-oncogene, segment 6. ACCESSION M25358 KEYWORDS proto-oncogene. SEGMENT 6 of 7 SOURCE Feline placenta DNA, clone pPHS1. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 187) AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A., Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Genetic organization of the c-sis transcription unit JOURNAL Nucleic Acids Res. 15, 959-970 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 40 c-sis proto-oncogene pre-msg < 1 > 187 c-sis, mRNA and introns IVS < 1 20 c-sis, intron E IVS 168 > 187 c-sis, intron F BASE COUNT 44 a 57 c 64 g 22 t ORIGIN About 2.1 kb after segment 6. 1 tctttctctc gagcagccag gacaccccag actcgggtga ccattcggac ggtgcgagtc 61 cgccggcccc ccaaggggaa gcaccagaag ttcaagcaca cgcatgacaa gaaggcactg 121 aaggagaccc tcggagccta ggggcatctg caggagcgtg cgggcaggtg aggccaggcg 181 gggaacc // LOCUS CATSIS7 1643 bp ds-DNA MAM 15-SEP-1989 DEFINITION Feline c-sis proto-oncogene, segment 7. ACCESSION M25359 KEYWORDS proto-oncogene. SEGMENT 7 of 7 SOURCE Feline placenta DNA, clone pPHS1. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 1643) AUTHORS Van den Ouweland,A.M.W., Groningen,J.J.M., Schalken,J.A., Van Neck,H.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Genetic organization of the c-sis transcription unit JOURNAL Nucleic Acids Res. 15, 959-970 (1987) STANDARD simple staff_entry FEATURES from to/span description pre-msg < 1 1623 c-sis proto-oncogene, mRNA and introns IVS < 1 20 c-sis proto-oncogene, intron F BASE COUNT 310 a 513 c 437 g 383 t ORIGIN About 300 bp after segment 6. 1 tccctctggt tggcctacag ggttatttaa tacggtattt gctgtatcgc ccccatgggg 61 tccttggagt gataatattg ttcccctcgt ccgtctgtct cgatgcctga ttcggacggc 121 caatggtgct tccccccctc ccacgcgcct gtccacccac cagcgggtcc cctcattggc 181 ctccagcgcc ttgcccggga gctcgagaag gacacgaagg acctgaactc cactgctgtc 241 ttcctccgct gaccccgaga acgtggcata gaagtgtaag agagacccat gggatcgccg 301 ttcggggacc ggaggagctt cccttcccca gctcctggcc cgggtcacac ctgagcgccg 361 tggactggcc tgaaaccctg cacgtggccc gcccggtccc aggacccctg gcccgctctg 421 aagggagcac ccccaggcag gccagggtgc ctcgtcctcc tgtggctggg accacgcgtt 481 ggaccgtaga ctggagagaa atcctcccac ggcgcccagg cccagtcact tctttctccc 541 cggtcacctc tgcttatagt ggcttccttt cgttttctat atatgaaatc ttcgaagatg 601 tggactcctc tgggtgggcg tggcccgagt acccggcagc caagtgcccc gtcagatggg 661 ttagagatga agtttgctct ggagccacca tggatggtga tctgggcgct cactgcctcc 721 ttccactccc ctgccttcac ctcctctgtt tcatctctct acctccaccc tgcgtcttcc 781 tcttgtcccg gccttcagtc cgctccaccg agggactcct tggaccccgt gctgagaccg 841 taaataaccc cactcactgc tccattaggc cgaagaccag gggtcagggc agcagggggc 901 ctgcgcttcg tatcccagcc cggccgagac tgccgtgtag gttcgtgcag ggtgtgcact 961 gtgcaaggca tgcgtgcagg ggtcgttgtt catgccgtag aacatggccc gctggtgtat 1021 ttgttaggac agttttctgg cagatggagc taaaatgtct tgagcaaagg gatcctgttc 1081 ctaatcgaca tgaaggttcc atgtggactg gctgtgtccc gacccagcca atggcttggg 1141 gtagttggac aggagagtga gcccacgttg ggtggggacg ccacgcgttg tccagctggc 1201 ctctattttc ctgagtcctc agctcaagca gttccccttc caggcgggcg tgaaaaaccc 1261 aagagagaat ctccaaggga gggggcagcc ctcgctctcc ccgggcacgc taccccccag 1321 agtagattct tttgagatcc gcccgtggtg gctcctccca ggaaaccagc tcacgggctg 1381 ggagtggggg agaaaaggga aaagatcccc aggggtccct ggggtggggt ctgagctccc 1441 acctccctcc cctccacccc actgcacttt cccccttacc ccccccctcc aaaatctgct 1501 ctcttcagtt tgtaaagttg gtgattatat ttttgggggc tttcctttta ttttttaaat 1561 gtaaatttat ttatattccg tatttaaagt tgtaaaaaaa aaaaaataac cacaaacaaa 1621 accaaatgaa tgaatccacc ggc // LOCUS PIGMDHA 1080 bp ss-mRNA MAM 15-MAR-1990 DEFINITION Pig malate dehydrogenase mRNA, 3' end. ACCESSION M29463 KEYWORDS cytosolic malate dehydrogenase; malate dehydrogenase. SOURCE Pig liver, cDNA to mRNA, clone ppcMDH-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1080) AUTHORS Joh,T., Takeshima,H., Tsuzuki,T., Setoyama,C., Shimada,K., Tanase,S., Kuramitsu,S., Kagamiyama,H. and Morino,Y. TITLE Cloning and sequence analysis of cDNAs encoding mammalian cytosolic malate dehydrogenase JOURNAL J. Biol. Chem. 262, 15127-15131 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 870 malate dehydrogenase (AA at 1; EC 1.1.1.37) BASE COUNT 320 a 232 c 240 g 288 t ORIGIN 1 atgatgggtg tcctggatgg tgtcctgatg gagctgcaag actgtgccct tcccctcctg 61 aaagatgtca ttgcaacaga taaagaagag attgccttca aagacctgga cgttgccatt 121 ctcgtgggtt ccatgccaag aagggatggc atggagagga aagatttact caaagcaaat 181 gtgaaaatct tcaaatgcca gggtgctgcc ttggacaaat atgctaagaa gtcagttaag 241 gttatcgtgg tgggaaaccc agccaatacc aactgcctga ctgcctccaa gtcggctcca 301 tccattccca aggagaactt cagctgcttg actcgtttgg atcacaaccg agctaaagca 361 cagattgctc tcaaacttgg tgtgacttct gatgatgtca agaatgtcat catctgggga 421 aaccattcct caactcagta tccagatgtc aaccatgcca aggtgaaact gcaggcaaag 481 gaagttggtg tttatgaagc tgtgaaggat gacagctggc tcaagggaga gttcatcacg 541 actgtgcagc agcgcggtgc tgctgtcatc aaggctcaga aactgtccag tgcaatgtct 601 gcggcaaaag ccatttgtga ccacgtcaga gacatctggt ttggaacccc agagggagag 661 tttgtgtcca tgggcattat ctctgatggc aactcctatg gtgttcctga tgatctgctc 721 tactcattcc ctgttacaat caaggataag acctggaaaa ttgttgaagg tctccctatt 781 aatgatttct ctcgtgagaa gatggacctc actgcaaagg aactagcaga agaaaaagaa 841 actgcatttg aatttctttc ctctgcctga ttaaacaatc atttcaatgt actaaatgcc 901 ccaaagctga aaaatctaaa tgtccttttt gaccctagta ccaaataata ataatgctaa 961 ccttaaatta ctagtgaaaa ataacacact ttaaagattg tgtgcttctt ggtacaaatt 1021 tgtgacagtt tatcatcatg ctgttagtgc cacattctaa aataaatata tatccaaatg // LOCUS PIGUFG 996 bp ds-DNA MAM 15-MAR-1990 DEFINITION Pig uteroferrin gene, 5' end. ACCESSION M30283 KEYWORDS iron-transport protein; transport protein; uteroferrin. SOURCE Pig adult liver DNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 996) AUTHORS Simmen,R.C.M., Srinivas,V. and Roberts,R.M. TITLE cDNA sequence, gene organization, and progesterone induction of mRNA for uteroferrin, a porcine uterine iron transport protein JOURNAL DNA 8, 543-554 (1989) STANDARD simple staff_review COMMENT See GenBank entry M30284 for sequence of uteroferrin mRNA. FEATURES from to/span description pept 887 > 996 uteroferrin precursor, exon 2 (first expressed exon) sigp 887 946 uteroferrin signal peptide matp 947 > 996 uteroferrin pre-msg 651 > 996 UF mRNA and introns IVS 745 886 UF intron A BASE COUNT 239 a 333 c 265 g 159 t ORIGIN 1 aatgggggtc aggcaggact tcctggaaaa agggagcccc agctgtttgg gggaggggag 61 gtgaccctgg gacactggct gggatgttgt caaacacagc tcacacacag gacacacagc 121 tcagctgaga aacacaacag agacacacaa cttgggcaca aaacaggcac acacaagtcg 181 ccttcccccc tacacacagt tccacttgag gactcaggtg cacgtcacag acatgccatg 241 caggccttgc cagctgagta tacacagcca ctggaacact ggctcccagg ctcacaacca 301 acatgtgact gaggcaggca cgcttggcta cacgcagaaa acagctcaga cacacccaga 361 cacaggcaga caaacaccac ctctgagaac accgacacac aggaggtgcc tggcccgggc 421 attcggacac agctgcctac acagcacaca cgggccccga gacacgaact gctcctgtgg 481 gcagacgcag gggctgtggg gcaggcgcag ctgcactggt gcaccagaca gccacctcaa 541 atcccgtgtc cagcgctggg gaactaactc caaggctcac atgacccgag gggagggctt 601 ttggaacagt tggcggagaa actgcatcat cctcactatg atcatttccg acccagggaa 661 taaaggctca gtgaccgcca gttctgcact ggagctcaac ctgcctctct gagcccccag 721 cgaccggctc aagcctcccc tagggtgagt gctgggccct gcatgaccac cctttcccct 781 aagccccatg gctggaccct atcctcccac taaccctggg gataaggggg ggtgggtctc 841 gggtgcctgc tgcccacatc ccagacactt gtctctcctc tcccagatgg acacgtggac 901 ggtactgctc atcctgcaag cctccctggt gctccccggg gccgtgggca ccagaaccaa 961 caccaggact gcccccaccc ccatcctgcg ttttgt // LOCUS PIGUFMR 1424 bp ss-mRNA MAM 15-MAR-1990 DEFINITION Pig uteroferrin mRNA, complete cds. ACCESSION M30284 KEYWORDS iron-transport protein; transport protein; uteroferrin. SOURCE Pig (60-day pregnant) endometrial, cDNA to mRNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1424) AUTHORS Simmen,R.C.M., Srinivas,V. and Roberts,R.M. TITLE cDNA sequence, gene organization, and progesterone induction of mRNA for uteroferrin, a porcine uterine iron transport protein JOURNAL DNA 8, 543-554 (1989) STANDARD simple staff_review COMMENT See GenBank entry M30283 for sequence of the 5' end of the uteroferrin gene. FEATURES from to/span description pept 95 1111 uteroferrin precursor sigp 95 154 uteroferrin signal peptide matp 155 1108 uteroferrin mRNA 1 1424 UF mRNA BASE COUNT 299 a 446 c 411 g 268 t ORIGIN 1 acccagggaa taaaggctca gtgaccgcca gttctgcact ggagctcaac ctgcctctct 61 gagcccccag cgaccggctc aagcctcccc taggatggac acgtggacgg tactgctcat 121 cctgcaagcc tccctggtgc tccccggggc cgtgggcacc agaaccaaca ccaggactgc 181 ccccaccccc atcctgcgtt ttgtggccgt gggggactgg ggaggggtcc ccaatgcccc 241 gttccacaca gcccgggaaa tggccaatgc caaggcgatt gccacaactg tgaagaccct 301 gggcgcagac ttcatcctgt ccctggggga caatttctac ttcactgggg tgcatgatgc 361 caaagacaag aggttccagg agacctttga ggatgtgttt tctgacccct ccctccgcaa 421 tgtgccctgg cacgtgctgg ctggcaacca cgaccatctg gggaacgtct cggcacagat 481 agcctattct aagatctcca agcgctggaa cttccccagc ccttactacc gcctgcgctt 541 caagatccca cggtccaatg tgtctgtggc catcttcatg ctggacacgg tgacgctgtg 601 tggcaactcg gacgactttg tcagccagca accggagcgc aacctggcgc tggcccgcac 661 acagctggcc tggatcaaga agcagctggc ggcagcaaag gaggactatg tgctggtggc 721 cggccactat cctgtgtggt ccatcgccga gcatggaccc acccactgcc tagtcaagca 781 gttgctgcca ctgctgacca cacacaaggt cactgcctac ctgtgcggcc atgaccacaa 841 cctgcagtac cttcaggatg agaatggctt gggctttgtg ctgagcgggg ccgggaactt 901 catggacccc tccaagaagc acctgcgcaa ggtccccaac ggctacctgc gcttccactt 961 tggggctgag aactcactag gtggcttcgc ctacgtggag atcaccccca aagagatgag 1021 cgtcacttac attgaagcct cgggcaagtc tctcttcaag accaagttgc caaggcgagc 1081 caggtctgag caccagcacc gacgggcctg agaagtgcct ccctgctggc gggtgggtgg 1141 gtggcccact gggccatccc ccccagccct taagcaggct ttcctgggta cctgtggtgc 1201 tacggcagag cgggaaggag atggacagat gaggaaatgt gatgccaagt ggcccttgtg 1261 acaaggatgc ccacagaggt gaaagaagca tggacacatg taccggccag agtgccaagg 1321 ccccatggcc gggctcgccc agcctgagct tctgggcaat ggggggtgag gaagaaagct 1381 ctctcctggg tcaggcatcc ttctgtcact gccaaatagt taat // LOCUS RABBGLOB 44594 bp ds-DNA MAM 15-MAR-1990 DEFINITION Rabbit beta-like globin gene cluster encoding the epsilon, gamma, delta (pseudogene) and beta globin polypeptides, complete cds. ACCESSION M18818 X07786 X07787 X07788 KEYWORDS beta-globin; delta globin; epsilon globin; gamma globin; globin gene cluster; hemoglobin alpha-2 beta-2; pseudogene. SOURCE Rabbit (strain New Zealand White, haplotype short) liver DNA, clones lambda R beta G[3,5,8]. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 44594) AUTHORS Margot,J., Demers,G.W. and Hardison,R. TITLE Complete nucleotide sequence of the rabbit beta-like globin gene cluster: analysis of intergenic sequences and comparison with the human beta-like globin gene cluster JOURNAL J. Mol. Biol. (1988) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Hardison, 02-MAY-1988. The epsilon and gamma genes are only transcribed in embryonic red cells derived from the yolk-sac. Both genes encode abundant amounts of embryonic globin polypeptides. The delta globin is a pseudogene, and is not transcribed in any erythroid tissue. It is the product of a gene conversion between delta and beta, so that the 5' end of the delta globin is derived from the beta gene. Finally, the beta gene is transcribed in red cells derived from both fetal liver and adult bone marrow. FEATURES from to/span description pept 5999 6090 epsilon globin, exon 1 6212 6434 epsilon globin, exon 2 7237 7365 epsilon globin, exon 3 pre-msg 5942 > 7489 E-G mRNA + introns IVS 6091 6211 E-G, intron A IVS 6435 7236 E-G, intron B pept 15525 15616 gamma globin, exon 1 15741 15963 gamma globin, exon 2 16781 16909 gamma globin, exon 3 pre-msg 15463 > 16997 G-G mRNA + introns IVS 15617 15740 G-G, intron A IVS 15964 16780 G-G, intron B pept.ps 22252 22342 delta pseudo-globin, exon 1 22445 22665 delta pseudo-globin, exon 2 23421 23548 delta pseudo-globin, exon 3 pre-msg 22204 > 23676 D-pG mRNA + introns IVS 22343 22444 D-pG, intron A IVS 22666 23420 D-pG, intron B (no splice consensus) pept 30970 31061 beta globin, exon 1 31188 31410 beta globin, exon 2 31984 32112 beta globin, exon 3 pre-msg 30917 > 32204 B-G mRNA + introns rpt 1201 2586 direct repeat L10c-1 rpt 2817 3137 direct repeat C1 rpt 3553 3827 direct repeat C2 rpt 8895 9206 direct repeat C3 rpt 10039 10390 direct repeat C4 rpt 10444 10808 direct repeat C5 rpt 10866 11186 direct repeat C6 rpt 11815 12184 direct repeat C7 rpt 18604 18936 direct repeat C8 rpt 18937 19059 direct repeat C9 (approx. 5' and 3' ends) rpt 19162 19470 direct repeat C10 rpt 20027 20379 direct repeat C11 rpt 20382 20743 direct repeat C12 rpt 24175 24573 direct repeat C13 rpt 25864 26076 direct repeat L10c-2 rpt 26206 26587 direct repeat C14 rpt 28578 29020 direct repeat L10c-3 rpt 32898 33190 direct repeat C15 rpt 33724 38097 direct repeat L10c-4 rpt 38098 44594 direct repeat L10c-5 BASE COUNT 12729 a 8346 c 9035 g 14484 t ORIGIN 1 agctacggcg ctggcttgct agacatttta atgatagtca catagctgta acacacatac 61 atccatatag atgcagacaa agacatactc tgacacacac agatagatga tatagatggt 121 ccagaatgta ttttcagagg atttctgagc aattaaccac acaggaccct taaatcctaa 181 cccacaccac tttgccttca gtatgctacc tgtgtgggaa ccagggatgg ggacacagaa 241 gtcagaagca ctccaggttt ctgttgttag ccagggccct cccttaccat agcttcgtgt 301 cacactgagc cttatctata cccacagttt ctgaatccac ctggctatca ccagctcagt 361 ccttaggaga agggaaaaca aacactgagt ttgtctctgc ttataaaaac aaagcaaaat 421 aaaattaaat ttcaatccca tcccaaacaa atgaataact aagcaaaaac aaaaaggaga 481 agtggaaggg aagtggagga ggaagaagaa gagggtaaga aaacagttgc agaaagaggc 541 tggagatgtg tgcgtctgaa ggagggaaga gccccagtag cctcacaccg gcctagggac 601 ctgcggaagg agcagcttct ttcctctaca aaatgctcca gatttgataa gctaggaggt 661 ggggtaattt actaagagaa gggacacatt tggggctttt gttattcaag acttaaaaat 721 tttgggtagt gaatggtctt gctaatggat ggggatctat cttcttccac ttaagccctg 781 cacacttgcc attccttaat tttgtttcag ggagatactt tttttgtttt gttttgtttt 841 gttttagatt caggaaactt caatgtcact gaaaacagca tttctaggaa agaaagagtt 901 atttactttt gtaagtcaaa aatgatgaaa tcatggatga ctttctagca gcagaggtag 961 aatattatct tataattgga tattttccta caaattatag tacaaggttt tatgctctat 1021 atatctaaga gtctcatttg aagcaactct gtaaaaattt tccagtattg agtttatgta 1081 tattttagaa gtgctgcatt tgaaggaaac ttgtaaacta ttacttaagt cttgggcaag 1141 agcaatgaga tttgtctcaa tgacctaagg tagtaaactg gagggaagga agattaaaac 1201 atactcattc tgaacaccaa aaaggaaatc tgttgaagtg aaatggacac tatgagaaac 1261 ggtgacttga tcagcagagc cctgactgtt aatgaacaac ttaatacatt atccctctta 1321 ttagtttttt gtctgttcta cttaatatga ctggtttaat tctgtaattt atacacagtt 1381 attcttaagt gttgaaaatc aactaaaatg tgatccctgt taaacataag agtgggaata 1441 agagagggaa gagatgtaca atttgggaca tgctcaagct gacttgcccc aaatggtaga 1501 gttagaaaca taccagggga ttccaattca atcccatcaa ggtggcatgt accaatgcca 1561 tctcctagtc caagtgatca atttcacttc acaattgatc ataatgaaag gactaagagt 1621 caaagggagc acataaacaa gtctagtacc tgctaacact aaccgataga ataaataaag 1681 gggagagtga tccaacatgg gaagtgaggt actcagcaga ctcatagaat ggcagatgtc 1741 ctaaatagca ctctggcctc agaatcagcc ctaaaggcat tccgatctgg ctgaaaagcc 1801 catgagagta tttcaggcat ggaaagccaa gacactctgg caaaagatct ctgtgagtga 1861 gatcccagtg gaaagaacag gtcttcaaag aaggaggtac ctttctctga agggaggaga 1921 gaacctccac tttgactatg accttgtcta aataagataa gagtcggaga actcagaggg 1981 cttccatagc cttggaaact catgactgga gcatagggag attactgatg ccatagacag 2041 gagtgtcaat tggtaaagtc aacaacagga gtcactgtgc acttactcct catgtaggat 2101 ctctgtcctt aatgtgctgt acattgagac ttaatgctat aacgagtact caaacaatat 2161 atttcacttt gtgtttctat gggggtgcaa actgttgaaa tctttactta atgcatacta 2221 aactgatcct ctgtaaaaaa aaaaaaagaa attatcaact cccaacttga ctctcactgg 2281 gattaaacat gacaataggt ctgatctgat ttcatcatca tttaaaaaaa atcatctatt 2341 atttttcact ttatgtttct gtgtgggagc aaactgttga aatccttact taatgtatac 2401 taagctgatc ttctgtatat taagataatc gaaaatgaat cttgatgtga atggaagggg 2461 agagggagtg ggaaagggga gggttgtggg tgggagggac ggtatggggg ggaaaccatt 2521 gtaatccata aatcgtattt ggaaatttat attcattaaa taaaagttaa aaaaaatact 2581 cattctaata aaaatggcat cccttctatt tcctaacatt ttatttctat gttgctttta 2641 attcatggac tatgttagaa aaattattta acacaaacac agaaaaataa gaatttacca 2701 gtattttaat catgaaataa tctaacactt atgtatagga tcagaaaatc ccaaagaaac 2761 caaaaaaagt aaggccatat ttcagataaa aggccatatt tcagataaat ggtaaccaaa 2821 tgcaagaatg ttttaaacat ttaattaacc acttagaaag ggacacacag gcacacacac 2881 atacacagaa tcagagagag ctacctgctt gttcactccc aaatgccccc caaaaaccca 2941 ggataggtta ggcagaaacc aggagcccag actcaatctg gatctcccac aggtgtggca 3001 gggacccccc atacctgagt tctcacttgc tgcctcttag gaagctagaa tggggagcag 3061 agccaggacc caagctcaag ccttctgata tagaatacgg gcatctcagg cagcatcaca 3121 actggtgact tcaaaagttg tccctaattt caacttctaa tttccaagct ttatccttca 3181 gaaaagtgtg acgtgaaatg aatgattctg cctttccact ccttcttccc agtaaattta 3241 atttgcattc acagatctat ggttatgctt tagggccatt gtccacaagg tccaaagtct 3301 gtgaactaag ctactaagag ataaatcagt ctgagaatga gacctaattt tgactcaaat 3361 tcatctacaa cctttgtaag aaatgaggca gacgcctgat gtgttcacag cagcagcatt 3421 gtatgtccac tagtagtggt ttagtctcac tttttgcagc tacaccagaa aatattctta 3481 agacttatga aagcagccac cagaaattct tgttttttgt taccaaagtc attgccagct 3541 aaggcaaaac ataagagaaa cttctgctgt ggccagggag tgcagtggag gatggcctag 3601 gtgcttgggc cctgcacccc atgggagacc aggaaaagca cctggctcct ggctcctgcc 3661 atcggatcac gcggtgcgcc ggccgcagcg cgccggccgc ggcggccatt ggagggtgaa 3721 ccaacggcaa aggaagacct ttctctctgt ctctctctct cactgttcac tctgcctgtc 3781 aaaaaataaa aaaaaataaa aaaaataaaa aaaaaaggaa agagaaaaag agagagagag 3841 gaagagagga tatcggggaa ttcaatctgg gtctcctata ttggtagcag agaaccaagt 3901 actagagcca tcaccagatg tgcccctcca ccccaggttt gcattaacag gaagctggaa 3961 ttgacaacaa agctaagact tgaacctagg aaatctgata ttggatgtgg gtatcccaat 4021 caatcatcaa cttgtcatca tttgacatca actatgtcaa atgcttggtg ttcagatatc 4081 ttgaagtaat atttgtctca atacttggat attaccctca ttgtgaaatc accactagat 4141 ctaatgtatt cataatgaag tatatgtaac actacactac atacttgata tttcatgttt 4201 ggttatttta tctcaataca ttttatcctt agcaatatta cgtatattaa ttcatgcatt 4261 aacacattat ttagtgtttc cattctacct aattagtcca tgatatagaa gaagagtagg 4321 aatttctgac tttatctatc ttcttcacta aaagcaagaa agggaggtag agattcaata 4381 atctggaata atactagttt gaggggagtc atttaatctg atgaagcttc tactattaga 4441 gtataaagga agataaagta gtctcatgtg ctagaaatgt ttgcattaca cctttaggaa 4501 gtaatcataa gtaccagcgt agggttataa acaagagaag agttctgggt aaacttagaa 4561 cattgagatg caaggcactg gactagggag acaaaagcca tagtgcaaaa caaaaaggac 4621 ccagaagcac ccatgtcacc agaatgcaag aaactatagg aaaagaaatt tccatgtata 4681 gcaagagcaa aggaagaact agaggaaaga aaggatagaa acatacgggg aattagccag 4741 atataagaac tagttaagga atactagata atgattatgt agtggaatgt ttctggaatt 4801 aaaaaggctg gaatttctgc agttctcttc cattatttct ctgatcatca actttttcct 4861 tttgacaatt atataacttc tgtgacctct ctttctttat tccattccgt ggataatgcc 4921 aattgtatga ctcagtacag agcagtgaaa taacaaaggg aatatctttc atgaacatca 4981 ataattcact aatacatata aatgtggatt tacagttttg tctccaacat gcaaagaatt 5041 agccagtcat cacatcaggg ggcttcaaaa acaagggcac ttagaagaat ttgaacctct 5101 accctggtga ctacagcaca cactagacaa atcaattaac ttattcctca gtttgattat 5161 aaatcctggg tttaaataat tttagtgtca tctgcctcat ggattaaagt ataagtagag 5221 cagacaagcc aaaaaagaga gtcagggaaa tatccctgga gtgctctctg taagcacagg 5281 agataacctt gctgtctgtc ctgtctagga aatcactcct agactgtact ttggaattga 5341 gtcttagtcc tactgcagat cccttgtcac tctcttcagc ttccaggagt aaccatgcat 5401 ggcccctgtc tcatggaaaa actaagatct caaaagtgtt tgcagttcac tcagggaagc 5461 tcaagtggat gatgttctgc ctcaaccact ttcatttttt gtgtgtgtgt gtgctgtgtt 5521 gctttgtttt ccttgacctt ctttttgtca ccaaccttga cccaaatatg aaactacttt 5581 agccctaaca ctttccagtc ttatgcttag ctgctctcta tgaatctttc ttggaaaagg 5641 ggaatgggaa tgatggatgc catcttggaa agattacatg gaaaaagaag agagtgaaaa 5701 catcgataat gtatgacaaa ttgtgttctg cttggaggag gaaccaacag agtagggatg 5761 tctcacgatc cgaaagccat tatcacaaac tcagtgaacc accactgacc atctggtggc 5821 ctggctccac ccatgaagga cacattccag tcttgaccaa tgacttcaaa gtatgaaaga 5881 gaacaagggg ccagaccatc agcagtgggg aataaaaggc caagccttga agcagcagca 5941 caaagctgct tctgacacat ttgtgatcga tcaccagcaa gctcccagac gtgacaccat 6001 ggtgcacttt actcctgagg aaaagtgcat tatcagtaag cagtggggcc aggtgaacat 6061 cgatgagact ggaggtgaag ctttgggcag gtaagtattg gttctcaata cttggtagag 6121 aaaggcaaat atgaccctgg tagactgacc aggaaattgc tcaaagtttt tgtatctctg 6181 attttctatt tgctgttgtc ccatcatata ggctccttgt tgtctacccc tggacccaaa 6241 gattctttga caactttggc aacttgtcct cttcctctgc catcatggga aaccccaagg 6301 tcaaggcaca tggcaagaag gtgctgacct cctttggaga tgccatcaag aacatggaca 6361 accttaaggg tgcctttgct aagctgagtg agctgcactg tgacaaactg catgtggatc 6421 ctgagaactt caaagtgagt ttaggagatg ctcatgagct ctttctggtt ttataactta 6481 gcacaataat gggatataag actatgattg gagagatcaa caaagatttc agaagtcata 6541 gatatgtttt gttgttagag atctccaatg gtcatcaaca agaccaattt aattcagaaa 6601 tagtaaggta aaaggctatt aacagtctaa tttttcagaa aatgatgtgt ttatcctgag 6661 aataatgtta ttgaattgta gctttgtaca aggaaaaata gaatttgtct gttagtagat 6721 gaaggcctat ctcaatgaag gagaaatgca atgcttacat tatatttttt ctgatgatga 6781 actaggactt gattagggac gatttatgcg agatctcttg aaaaaaaata caacaaattt 6841 tctgaaaaat attaaattct ctcagtattg tgaccaaata taaacttgtt cctgctttgg 6901 ttggagactt taaactccaa gacccttgca acactggcat tctatgttta ctgaaactaa 6961 acaattatta aatgagaaag aaaaaaagaa agaagataaa tatggtgaaa tgtggaagga 7021 ggataagata taggtagtca gaatgttgaa cggaagtctc ataaaaatga aattgaatcc 7081 attcttgtct tggtttaagg gtaggcacaa tccttggagt aatttgaggt cagagttttc 7141 agggaatgta ggtacatagt catttctttc tttgtaaatt cttttgggaa cacctactca 7201 atatgcctat gtatcatttt gtcttttacc taacagctcc tgggaaacgt gctgctgatc 7261 gttctggcca ctcattttgg caaagaattc actccggagg tgcaggctgc ttggcagaag 7321 ctcgtgtctg gtgttgccat tgctctggcc cacaagtatc actgaactct cttcccagtt 7381 tctttagtgc ccctactcat ccccagcgtc ctccttctgt accttgggga ctaggctcag 7441 ccttgatggc acagtttctg tttaataaaa tacattctat tcagtaatca aaaaatgatg 7501 tctcatcttc ttcgtatttt actcatgtgt ttaaaaaaaa aaaaccacac atgttcatga 7561 ttgagacaga gagagacata ggaagaaata agaatctttg agtagtatat taggatttat 7621 caaataaagg gaggtgttat tggagggtac aggagaatcc agagaggaca ccaagatggg 7681 gaagaagttc tcttgggtgg ggcaaaggct tcattaagtg gactgagtgt ggcaaaaaca 7741 tacggcagag ggaagggaat gagggaccaa agtggtatct gcgaaatcag atttatggtg 7801 aatgttgttt cactcaaaag aaagatttgt agtgatcaaa actctagaca tggtaatact 7861 attttaatgt acttgaacat acacattgaa gaaattagct tgatttatat cttgagaaat 7921 tggtttgaat ataaagaaat gagttaaagt atgagttgga aatttccaga aaaaaatgtt 7981 ttcatgaaca aactgcagtt tgttctttcc tgaagggttc tgttgtctca atgggatgcc 8041 aagactgaga catattacat cattgtgcat tatcttctga ttgaatgctg taaaataaag 8101 ctagtttcaa cacaaatgaa aactttcatg ggaaagataa agatttcatg tagaaaataa 8161 aatactagca aatgtgcctg gtcctaaatc agcaaggtaa aacttggtgt catagatgga 8221 actggaattc tggaagtaga ggggagtatt aggaaagatt agatagtgct gagctttgac 8281 tataaaaaat aaaatctact actaggctat tagcgacaat gagttataaa tattcttagg 8341 atgctaagtg actccatgga taagattttg ggggaaataa tcttcttatg catgctggat 8401 gaattcaaat tttcagagac tagacatgac tgattattca aaaggacaga tataatgatt 8461 taaacctgag gggggcagag agatctacaa aagtgaatga tgtacaatgt cgagggagga 8521 agagatggta tctgattgac ggaggtgagt gagaaggaag gaggagctgg acaatgactg 8581 acaggtccca gactggctga atgtaacagg aggaggccag gcactggcct ggagaccctc 8641 ctataggagt gtgattaaat gattcactct tgagggagtg ctggtttgct cagggaatct 8701 tgacattcag gtcctgagtc cacctctcaa atcgggagaa tttacaagtt agaatgctgt 8761 atcttcacaa atgaaacttt ctcagatatt cattgccccc acacattgat cagaatttgt 8821 cctatgaatt atttttctct tttctgaata tattaattgt aaaattgtaa taaaacaagt 8881 aacaaaatgc atgtcagtat acttcttttt atttatttga cagatagagt tagacagtga 8941 gagagacaga gagaaaggtc ttccttccat tggttcaccc cctaaatggc tgccacggcc 9001 ggaactaccc cgatctgaag ccaggagcca ggtgcttttt cctacgcgag tgcaggcaca 9061 caatcacttg ggccatcctc cactgccttc ctaggccaca gcagagagct ggattggaag 9121 aggagcaacc gggactagaa cctggcgccc atatgggatg ccggcaccgc aggtggagga 9181 ttaaccaaga gagccacagt atacttaaca acataataaa tttgaaggca ttctatcttc 9241 catcctcatg aattccttaa aatttagtta attgaataat agtgtctaac agtattagaa 9301 aaagattact taattttgat agaaaccaga tattttgtga atattgttgc tactcaagga 9361 gaaaaaaatg caaaaaacac caaatgaaag ttttcttctt tttcttcaac attttgggaa 9421 tcattgtttt cagagtgagt taaatcaatt attccatgtg actttttgaa ttattgatgt 9481 ttgttgaagt ggaaacaaga atgtcgaatc taagaaattt gaaaggtatt ggctaggttt 9541 tccacctgta tattaaagga tattttcact agatataaaa tgatatgttg gcaatatatt 9601 tttatgaaag ataaaatatc attccattgt tctatgtttt cttctttgag ttttagcagt 9661 ttgactatgg tgtgcattgg atcgtttttc tcttctgttt aaaccttgct taaagtttcc 9721 agtcttggat atgtggaaca tttatgagat tttaaaagat tttgctcaat atttattggg 9781 tttccttctg tcctcctttg ctccaggctc ctctatcagc tgctcagttt tttttttttt 9841 cattctatct gtgcttcaat ttgagtattt caattgagta tttcctattg ctctttgagt 9901 ctgctaatct tttttatgtc atttctgtct gctattagct aatgtactga attcttcatc 9961 ttaaatatta caattttcag cacaggagtt ttatttgatt ttcttctata ctccaattct 10021 atgaaaatat gctgtttgct cattctctac cttttaaatt tatttttaat gattatctac 10081 tttctatgtt taggttttat ttttatgttt gtttttcatt tgaaaggcac agggacagaa 10141 aaagagatag aggtagagat agacccagag agagagaaag agatctttct gctggttcac 10201 tactcaaatt tctgcagtag ccataatagt atcagcctaa aaccaggaac tagggctcaa 10261 tctagatctc ttatgccgca ttccagggta catgttaact ggaaggtgta atagaaaaca 10321 cagttaggac ttgaaccaag cactctgata ggagatgtgg gcatcccaaa tggtgtcata 10381 accactgcat caaacactca cctcattaaa aaagtttttg attaacatat taaacgtaca 10441 tatttatgtt caaatattta tttgtgtatt tgaaagtcag agcaacagag agagagagac 10501 agagacagag acagaaacct tctataggct ggctcactcc ccaaatggca gatatggcca 10561 ggacagtgtc aggccaaagc caggagccaa gaactctatc caggtcattc atgtgggtgg 10621 cagaagttca agtacttggg aaatcttcag atgcttctcc aaatacgtta gcatgcagct 10681 ggatctgaag tggaacagcc aggactcgaa ccagtgctcc aatatgggat gccagtgttg 10741 caagcagtgt cttaaattgc tgcaccacaa cactggcccc aataattgta tattctaatt 10801 atgttaaatg cagtatccaa cacaagtatt tagcataaac tcatcaaata ggtaattagc 10861 ctttccatta tctttccttt tttgaagatt tattttattt atttgaaaga caaagtttca 10921 tagaggacaa gacaaagaga ggtctttcat ctgctggttc acttcccaaa tgactacaac 10981 aactgggtct gccagcccaa gccagaaaca ggagcttctt ctgggtcttc catgagggta 11041 gaggggtgga agcacttgga tctctgctgc tttcccagcc acattagcag agaatagata 11101 gaagtagaca tgaaccggct cccatatggg atgctgatga tgcagatagt ggttttacct 11161 gctacaccac aacaccaact ttatctcttc tttgtgtttg gagcctgtga gatcctctct 11221 ttcagttctt tgcagaatag ataacggtat tatgaactat aatagctcta ttctcctatg 11281 aaacactaga atttatttat ttggtctaac tgttttggta tctgttattc tacctccctc 11341 tatcctgctc cccatacctt ctctatcttc tagtaatcat tatgctaaag ccagtttttt 11401 gagacttcca tatatgagac agaacacata gtagttgtct ttctgtgtct ggtttatttc 11461 acttaacata aagagttcca gttccattca ttttgctcca aatgtcaaaa tttcattctt 11521 tttatggctg aataatattt cattgtatgc acatcacatt ttaattttta ttaatataat 11581 gaaaacagac ttaatgtagc tcatagatac aactctaata atataatgat acttcccttc 11641 ctccctccct ctccttttct ttcttctttt aattattgat ataacatatt tgtaacttac 11701 attaactttg acattatagt caaagactta atgctccatt aaataaagag ttagataagg 11761 ccatagttca tcagacatat aatcaagaac aataaacaat aatcaaaaga aattatgtta 11821 attttactca tatacattaa ttttttaaaa agatttattt atttattcga aggtcagagt 11881 aacacagaga gaaaaggagg ggcagagaga gagagtggtt ttccatccaa tggttcactc 11941 ccaaattgct gcaacagccg gaactgtgtt gatccaaagc caggagccag gagcttcttc 12001 caggtctccc acgtaagtac agggcccaag gacttgggac atcttctact gctttcccag 12061 gccatagcag agagctggat tgtaagtgga gcagccgggt cttgaactgg cacccatatg 12121 ggatggcggc actttgggcc agggtgttaa tccgctgtgc cacagcacca gccccatgta 12181 aattacattt taaaataatc ataggtcatt taaactacag tagtaaatca ttcttaaaca 12241 tttttgtaca aagattcttt tttttacatg acattttctt tatctatcaa ttgggcacct 12301 agatcaattc catattttgg ctgttgtgaa cagccaaatg aatatgacag ggaaggtatt 12361 tttatgatat actgttttta ttccattgga tatatatcca gaagagagat gctggataat 12421 atatggttaa tatattttta gttttttgag aaaactaaat accccatgat gactgcccta 12481 atttccattg tgaatgactt cttaatttta tgttaaatct ctatactatg gataaatgaa 12541 caacgattat tgtagttttg ggttatcagt ggcctctttg agttcagtat gaccctaaac 12601 agagtaaaca ggggttattc atgccactaa tgtttttaag caatcataaa tgagatgaag 12661 gtgaattttt aatcaatcaa ttttttcctc tcttgttccc aactattatt ctggctctgc 12721 tagaaggcat caaggatggt gactatggag ctatagacag gtaacacact cttcatccct 12781 aagagggggg tagatctaaa tctagatagt actttcccaa cataacataa ttggaataaa 12841 tttttaattc acgcatacac aactcagaga caggtgtaaa tgatacacaa aagtgtaagt 12901 aattcatgtc aatgtatagg caataagttt tctgaacaaa atagtgttgc aacaaccaga 12961 atattctgac aacagtaatg taaccaggaa tacagagagc ttagagatta tgccattacg 13021 aagattaaaa tcctgtaacg acctctcact cccaccagcg gaagacaggt tccatctctt 13081 tagacctgtg cctcttctgc aacaatgagt caatgttctc cacttcccag gcccaccatt 13141 tatctctttg gcattccaag agttgctttt ctattgcttc agagctttga atttgctatt 13201 tcatcttcta agttctcttt cctaatatct tcccatgaca gattcctgtt tgtcagtctg 13261 aagcagctca agtacttcct ctttggaaaa agctctttct gacacaatca tgcttcattc 13321 ttaccaagtc cagataataa ttttcttttt agaaagcagt gatatctgaa attattttgt 13381 taatcattta cttattcatt ttctgtatct tctgccccta aattttaaac cctcagagaa 13441 aaatttattt ccttgatcag ttgtcttaaa gccagagttt ctagaacaat tataacacaa 13501 ttattcatgt tattatgtaa atgtttaata gtacatggga tggcatttat gaaaacacaa 13561 ctttataggt gaggtcttag gtatatcata agtaatgacg tggggaaaat ggaaggataa 13621 aacatctatg gacaaggaga aaattggaaa tattacctca aaaagatagt gggggggggg 13681 attttcttgt tagcgatgac tcttgaatca ctgggtttct agaaagatag ggaagttgct 13741 tgcagatgtg agggatcata gatatccctg agtctgagag ggagtgatta aaacaccctc 13801 atttgttaag acttttggag tcttagatat tagcctctga gtcatcctcc aaggtcgagt 13861 tctataaact gaccacaata ggactattaa tgtggtcaaa gtgctttctt tataaatcta 13921 cttcacaggc ttcatttctt aaggagactg taacctgaat tcctttatat cagtagaaat 13981 gtcctctact cacccttagc ttgagaaaag tcataaatta ttttggttat actggaaaat 14041 tactgacatc ttgaacaaat ctgatcttcc tagtttttta atcattatca acagtgaaca 14101 aatacaggtt gtatgtggca tcaagaaaag tagaaggatt ttagctgtat tattttaatt 14161 gctgttgatc tgaaatgtgt aatttcataa cctgtatttt tatatgatca ttgagacttg 14221 ccacagggac aaataataat taagccagac ttccaaaact tcgaatatta tgaactagca 14281 aggtatgaaa ccatggactg ttttcagaga caataaaaca aaatagcata aacttaaaaa 14341 aaggactctt cctgccttat tcaaagtgtg acatgtcaga gacagaagtc aattaaagat 14401 aaaagtgtct aaaagtttag ttatgtccca gctgtgacag tacatgaatg atgagtcaat 14461 ggaaatagct cctgcctttt gagctatttc aaacacacat aagaagagaa ggagatgctc 14521 cctgatgttt tgtcttataa taaaaattgc tgaaaatttt ccctggcatt tatcagagat 14581 aatttactca gagccacact gagtacagaa gagcaacacc aatactctgg gtcaccagtc 14641 agcatacaca cttatccagt gggaagcaca cattatgtgg tgcctacaga gcaagacatt 14701 catgtggatg aaaatatatt ttgctcacgc agtacagaca tacttatttc aaacagatac 14761 aatacatgta agtagaaaat aaagtagata catgtaagta gaaaataaag taagagttca 14821 attctgtatt atgctactta atactaacat atttatgcat ggagagttat gctttatggg 14881 agttgttttc cctacacagt tgtgaattag gaatatatct cctaaagcca caaaggatat 14941 attcacaatc agtataggac ttctcccctg gccattccta actgaatgag atttatactt 15001 gtctcatcag aaatcttgga tctgtgccca aaattatctg aatacttttc atatgtttgc 15061 ttatgcttta gctataattt gcaggcatct atggtgttag tgactaaaat ataggaaatg 15121 ctggcaaaaa aagtcctaga ctatttaatg gtgaggaaac aagaaaaata gctaaataga 15181 aaaataaatg ggagaactct ttaaaaaaat gaattgaaaa gggacaggac taacaaaact 15241 tgaggagcag agcccccttg agggggcctc tctgccacta tctcaatata tagtctgtgt 15301 gaagcttctc tgcctaaacc tcacccctgc gctgaccagc cttgccttga ccaatagtcg 15361 ttacacaaaa acactgacca atagcctcag agaacacggc gaaacaaggg gccagatgtc 15421 cagcgaggaa gaataaaagg acgagcctta gagcagtttc acatacttgc ttctgagaca 15481 tctgagacta tcagcaagct cagcgagctc ctagaccaga catcatggtt catttcacgg 15541 ctgaggaaaa ggctgctatc acaagcacat ggaagctggt ggacgtagaa gatgccggag 15601 cagaagccct cggaaggtag gccctggggt ccaggacaag gcagaggagg aatgaaactg 15661 agcctggcag gactccaggc cctttctcag gacttgtgaa gctctctgat actcccacca 15721 ctattgttct gtctctatag gctcctggtc gtctatcctt ggacccagag attcttcgat 15781 agctttggaa acctgtcctc ttcctctgcc atcatgggaa accccaaagt gaaggcccat 15841 ggcaagaagg tgttgactgc ctttggagat gctgtaaaga atgtggatga tctcaaaaac 15901 acattcgccc acctgagtga gctgcactgt gacaggctgc atgtggatcc cgagaacttc 15961 aaagtaagtc caagacaagc tcaagccctc ttgcatttag tcatgggtga aggtaggaca 16021 ttaagtattg gccttctgca gcataggaga tatttggttg ggagataaga aagctctaga 16081 ggaactgaac tgagactaag tggtaatgca ttagggtcta aacaatactt tcaggagtgt 16141 agagggattc gtctatactc aagtgtattt tagaagaggc ttctgtagag atgaggaaac 16201 agacttttgt taatttgact tttgaagaaa taactccttc atttctcccc caactcactt 16261 tccttaaaaa agtttaaaac cctaatgaag gcagaaaagg agtgacaatg ggaaagataa 16321 atgagggcag ccataaggcg agacttacgt gattttgtgt agagtgccat gcatattcct 16381 ctaatggcta atacttgttt aaaatgctgc aatgcccatt tctaacctaa attatacact 16441 tccagattga ggattatgtg tctctgggag agaatattgg aacatttggt ccgaagtggt 16501 tttgaaaaga taccgtgagg tgttaccttt tcccctcagc ccacaaatgg ggagttgtgt 16561 tagatgatga cagaaatatt taaattatga ccaatgatta gtttggacag gggaccaaac 16621 tacttacatg taacaactgg gaggggggga agatcagcct tggaggtaaa ttaacaaagg 16681 cttggttttg aggggtattc agtgtccaga tagcgggaga ctagcctgaa ctctctatta 16741 actatgtgtt tgtgtttgtc ttctcttttc acctccacag ctcttaggaa atgtgctagt 16801 aattgtcttg gcgaagtatt ttggcaagga attcactccc caggtccaga gtgcctggca 16861 gaaattggtg gctggagtgg ccactgccct ggcccacaag tatcactgaa ccccttcact 16921 gtaggacaga gcttctagga agaagcttta tccctcaaat aataatgaaa ataataaaac 16981 tactctaaga aattatttgt gatggtattg agtttatttt ccttgtactt ttaaatatat 17041 gatcctcaag ggatttattt tttttattgt tgagggatgt gtgtgtgtgt gtgtgtgtgc 17101 ctgtgtacta tgtgtgtata ctcttcattt acttttgctt gagcggtgga tttctgatga 17161 gaaaacagaa gaacaaggca acaaaagggc atgcatgtga gaattctgca cgggaaacaa 17221 ggagtgggga tttcgggtga ggaggaagtg ctttatggga caagaattca gaacaggaca 17281 gtgtagagga ggggatgaag tcttcaggga agtcctcagg ggctgacttg aggcagagta 17341 gtgatggggc tcaatggaaa gaagaaatta cagaccttag taggtctaag tctgggcaag 17401 agttctaatt tcagtctatt tggtgcaaca caattcccac atgcaccata ctttccacaa 17461 catttgtggg gctagatagg aggttggaga tcttgaagaa cacagcaggg ggtagttctt 17521 ttccaagtta accacaaata gctgatcctg aagagattta tgctttaagt gttgaggctt 17581 gctaggatag ttaatttgat cccatgcttc ctgggcagac aatgtccaag ggtacacata 17641 gaatacagta cttcaaggaa aaagtatcat tgtgagtagc tagaagggat gagtatcttt 17701 gtgggggaaa taaatcatca gaaggttaga aataagaatg catgtgatat gtgatcaatg 17761 gagcagcact gggcggtcag agggaaacag gagggatttg tattgattag tttgcagtga 17821 aggtgttgta tgactaaaag ttaatccatg tttataggca ggtgtacaat acaatgacca 17881 tggaattagg aagattagcc tggctgtggg tgctaatcat gctaatgtta ggtgactatc 17941 attgcttttc ttgccttgga attcaatctc atctcaattt tcatctataa tattccaata 18001 atacacctga gcccatttct ctactttcaa tccttttgtt cctcccttaa tacattttac 18061 tctgatgttc taatttattt tttctattat gaaatgccaa aaattcttgg ttcagcttgt 18121 ttttttcttc tgcagtttca gagttttgca ctgaaaattc ctgctgcttc catggactta 18181 tttctctaac ttcttatatc tggctctatc ttgtcattag gttcctaatt caactctgac 18241 ttcttttact aaagctattc tacgttattt ttctatagcc agttgctgct gtatcatttt 18301 atgcattttc ttctgtcata caatacctca taaaattatc ttattcagtt atttgtttag 18361 ttgtgtgcat tcattgcctt ttctattagc tgtgaccttc atgagacttc ataagttttc 18421 ttgtctgttc attcaaaatt tccaacattt agaggtgagc atttagccta ggagttaaga 18481 tgctggttaa aatgtctgtg tctcacacta gagtcacagg gtttaatacc cagctatgcc 18541 cacaactcta gtttattggt aatgaaagcc atgggaagca atgatgattg ttcaaataat 18601 tgcttttctg tggccagcac acggctcact tggttaatcc tccgctgcgg tgccggcatc 18661 ccatatgggc ccagggttct agttccggtt gctcctcttc cagtccagct ctctgctgtg 18721 gcctggaggg caatggagga tggcccaagt gcttggcccc tgcacctgca tgacagacca 18781 ggaagaagca cctggctcct ggctttagat cggcatacct ccagctgtag cagtcatttg 18841 gggagtgaac caatggaagg aaaacctttc tctctgcctc tctctcactg tctattactc 18901 tacctgtcca ataaataaaa aaaaaatgtt ttctgtcacc tctgtgggag accaggattg 18961 agttcctacc ttggctttgg atcctatctc cctcttggcc attgcaggac ttagaggagt 19021 aaaccagtgg atggaaactg tttctctcta cctctcaaat ctatttttta agtttccagc 19081 attttaaaca atgtatagat tatttggcta agtgaatgtt ttgctttaaa aaatgatggg 19141 agcatgaatt tggaagattg gttatgtagc tgcttagaat actcacatcc catattggag 19201 tgcttccttc aaggcccagc tcctctgctt ctgatccaac ttcctgctaa cacataccct 19261 gggaggccac aggtgatagc ttaagtgctt ggttccctgt caccaatgtg ggagacctgg 19321 attgagcttc agactcctgg cttcaatctg gccctgctct ggctgctgta gccatttgaa 19381 gagttaacaa gataaaatat ctctctatct tctctgggtc tctacctttc agataaaaag 19441 taaaataaaa taataaaata aaacttatgt gatgagtcag atgattgtga tgatgttcta 19501 gcactgatgt aaagagggat caaatgtcag tggaggccat gcatgtgaaa taaattggaa 19561 gtgtggaaca ggtatgaaca agaaagaatt aggaagcctt aaaaagtgat ttacaaagag 19621 ggtcaaaaca gcatttggaa gaaactgcag taataaatgc ctgtccagct ggtaaggtgg 19681 ggagaacatt cgctgacctg cagagtgatg aatacccctg aatcagcagg gatggcagtg 19741 tgttcacagt gtttgaggga taagcattac tgccattaga agctagtgcc ctaaaccatt 19801 tcatcccagc ccacaaagga gtttaagtga gagccccttg gcttctatct ttcactaact 19861 tctactctca aaagggctcc tttgggtcga ctcaaactct tccttcaggg ctatataaca 19921 aagcacagaa tctttccttt gcagctctgc accagtccaa aaggcactga gaggtcagag 19981 agaattcagt ttgttgggat gaattagctt cttgaagctt agcaattgtt attcatgttt 20041 tatatatttt aagatttatt tacttgaaaa gcagagttac agagagagat ggagacacac 20101 acacacacac acacacacac ggatagaatc ttccattcac tggttcacta cccaaatgcc 20161 catgacagcc aggtctaggc caaagccagg agccaggaac tccatctggg tcatcccctt 20221 gacggcaagg ccccaaacac ttgggtgatt atctgctgct ttcccaggct tattatcagg 20281 aagctggatt gggagtggag cactagggac tcaaaacggc attcctacat gggacatctg 20341 ttttgcaagc agcaactgaa cctgctgtac catggtattg gcccaatgga tgcattattt 20401 tttttttact tgacagagtt tgacagtgag aaagagagag agagagacat agagaaagga 20461 gaaagttctt ccttctgttg gttcaccccc aaaatggccg ccacggcagg cacgctgcac 20521 tgatctgaag ccaggagcca ggtgcttcct cctgatttcg catgcaggtg caggagccca 20581 agcaattggg ccattctcca ctgcccttct gggccacagc agagagctgg attggaagag 20641 gagcaactgg gactagaacc caacccccat aagggatgcc tgagctgcag ggggatgatt 20701 aaccaagcga gccatggtgc tggctcccaa tggatgcatt attaatagta atcactcaca 20761 atggaacaat gctcagttct tatcaaacat aatcaattct tagttgtgtt tccaaacaga 20821 aggtagatag aaatcctttt aattttaaaa agcaaatcaa aaaccaagaa aaaattttct 20881 gtacccttat cttaaagtgt gaattacaat taggaaggtc aagaattata aatctattga 20941 gcattgggtc ggatgctgct atttttaagc taactttaaa caaaatactc tgataatttt 21001 tgaatgtatt tctgaagttt gttgagctta gactcaagac taggaaaaga gaaataaccc 21061 tcttcaatgg aggttttcac acccagcaat gctgaaatgc ttgcatatac atggtgtgag 21121 gttctttttc ataggagcat atctacacaa tataaaagtt tctagtatct gtggcctaga 21181 ctagatattt tagtctcctt ggattaccaa cccttcttta taaaagaggg cagtctctag 21241 aggagcgaga aaatcaacca cagtgttctg tcatagtgaa ttagaaggtg gtggttaaaa 21301 tgctggttgg gaggcccaga ctccacatta cactgcttgc ctttgagtct tggctctgat 21361 cgtgattcca gcctccacac acagctaatg tgtgtcctgg gaggatgctg gtgatggcac 21421 aaataataga gttcctgtca agtcatattc tttctctgct tccctagaca cttccccagt 21481 tcagttcaga cagaaataat atcaacatgc aatgttgata tcccatttga aagcattgct 21541 tctcccagtt tttcaacttt ggctggtaat tgggttttgg tgtgtttgta aattttacat 21601 ctgaagtctc catgtttcca tttctgcatt gtaaaaatct aataagtatc acacagaaca 21661 aacttgttcc agatttattc tatttttcaa acattattgt ttgaatgaat atgtattttc 21721 ttaccagaag gattaatgca aagaagagag aagacatact tagaaatatg gcagagatat 21781 catccattct ttctaatttt ttttgcatgt tcttgagaca caggaagaga tccatatgta 21841 tacttttaaa ttatccaaat atgcaaacct ctactttgtg tgctttcttt cttcatttgt 21901 gtaataaaaa aagttgtgga agtaatttct cataggcttg tgatatcaga aaatttacat 21961 aaaactgcat aaataaaact gctaaggagg gtgattttgg taacaatgaa tattttctgg 22021 gaggggtcct ggagatacat agagggaaaa gtctgaattc agactcataa gccagtgcca 22081 gagctgacca tgaccagtgt tctgtcactc ttctggcctc accctggcat tggtgactca 22141 ccagggtagg cagtgcaggg gccactactg ggcataaaag gcagagtggg acagctgctg 22201 cttatgcttg tatttgacac aactgtgttt actagcaata ttccaaatag aatggtgcat 22261 ctctctggtg aagagaagga tacattcact gccctgtggg gcaaggttga tgtgaagaag 22321 ttggtgatga ggccctgaga gggttgcttt tcaagttata gggcagtcta ttgaggcaga 22381 tggaagctgg gcttgtgtag acagagctgg ccccagggtt tctgccaggc actgaatccc 22441 ccagtgctga ttgtctaccc aagaacccag aggttcttta taccgtttgg ggacctgtcc 22501 tctgaacctg ctgttatgag taatgtcaag gtgacggccc attgcaagaa ggtgctggct 22561 accttcagtg agggtctgaa tcacctggac aacctcgggg aaaattttgt ccagctgagt 22621 gagctgcact gtcacaagct ccatgtggat cctgagaact tcagtgttag tctaggcaat 22681 gttccacttt tttcttttca cattctcatc ttcttttttt gttgtttgtt tgttaagcaa 22741 cctgtctttg tcaacatttc tatctaattc cctatatttc gtcatttatt gcttttcaaa 22801 aattttcatt tttctttagt atttcgtttt ttcattcctc acagcattgt ttttttaatg 22861 tttctttatt aatatcctat attttccttg ctctccactc cctgcccaat atcctttcct 22921 ctaattgcta tctttgttat gcattctaga ttccatatgc taattctgca cctggaaaaa 22981 accttctaac cctccatgga gataggagag acttctcaaa aaggagaagc acaggatatg 23041 tctataggct ataaatcatt tggaacatga agaataattt gaattttata aattcaagaa 23101 taaatgaaaa taaaaggaag gtaactatct atctgaggac aatagatcaa aaggtcacat 23161 tgtaggtggg gttgcagatg ttgctaagaa gcaaccacta ttacatcaat taagggatta 23221 attagtaata ttattatata tgcatataca tgtacatacc tatgtgatta tacacttata 23281 tatatattgg ttgttttaaa ttgagactct atttagatta agatgtagga agaaaataat 23341 tagtgttggc ccaatttcag aatacaagct tgatttctgt gctaaccata cgtaaaatat 23401 ctccatagct ctttctcaca ctccagggca atgggctggt gattgtgctg gccttctact 23461 tcggcaaaga atttacccca tagctgaagg tgcctatcac agagtggcag ctggtgtggc 23521 caatgcctgg gctcacaagt acccctaaga tcttgacttg ttctctactt accctgttcc 23581 atggaaaatc ttttctatat ttcatctatt gaacatgaag acataatacc caacctaagg 23641 gtgtgagcct aataaaagat catttatctt aacttcatgg ttcatatcat ttattttatt 23701 tttctcacca tggatatgtg gtgggggaga atccctgatg gcttacagat aggaacctct 23761 tgtttcttat aaataaatca agaaaatgag aaaaagaaga gggccatata ttcctactaa 23821 catttgcact tctatctcca aagcatagag attgtcacaa gtgtttcata aagggaggat 23881 tctgtattac agagattcca caagagacct tcaactaaga tatggcctca taaccacttt 23941 caaaagttac tgagaaaata atgtgctcaa gggcataaat ttttttgctc cagactgtgg 24001 tgtttccttt ctttctcttt ccaatattta tacactgtaa ctcgtattac taattgccaa 24061 acatgtattc ctgctctaag aaccttcctg ccttcctgct cccaaaacct cttcagaata 24121 atgcttcttt atccagtcct caatttccag tacagtttag tactctctca tcactgctga 24181 ggtctgtgga gctcagcctt ggggaatttt tatttttaat ttttaagatt tatttattta 24241 atttaaaatt ggtattatag agaaagggag agagagagag ggagagagag agagagagag 24301 agatcttcca ttcactggtt tactctacaa atggccatat tggctaggct gaagctagga 24361 gctttatcca gatcccctat gtgggtggca aggacccaca cattaggcca tcttctgctg 24421 cttttcctgt gccattagca gggagttggg tcagaagcgg agcagctggg acacaaacta 24481 gtgctcatat gagatgtcag catcagaggt ggcagcttta cctcgtacgc cacaatactg 24541 gctccctcag tgaacttttt aatgttgctg aggaaattct ttccaaaccc ttcctgtata 24601 ttcatttttc acaactttac agctcttctt atgtcacaca cacacacaca cacacacacg 24661 agcaaataca tacatacacc ccacatatat tatgcttgat aaatatttct gtctattctg 24721 cactaatttg caggcctact gtgccctgta tcaaattctt ctatagataa taagtggaaa 24781 ttttttcaat ggaataataa gtgaatgaaa gtaattagga ctcctagctc ttgagggaaa 24841 taagaatatg ttagatcatt gcgtggagtt cttttgatgt aaatactgca atatacaatg 24901 aggaagcaag caggaaagta aggtactaaa gataaatctt ttgctttata atacaggctg 24961 ccaaatttat tcttgttgaa agtaattttg ttatgtgtaa gattagaaga atttttagaa 25021 ccctctaaca tatgaatttc ttgaaacaca acacattctg gttaattaac ttggacttac 25081 ttgaatacct aaatagaacc tggtgctttc ttctatatca acatgattct gtaagaactc 25141 tgcatatact gccaggattt accaacatgc cactgtgtga gggtaaaggg ttttttatgt 25201 acattttttt cttctcagat gattgcacat agtgtgacca aggaagtctt tatatacaaa 25261 taaaattttt taaggactac agacttggga tatagatttg acataattgc tatgatgctt 25321 tattgctatg attgccaact gctagatacc tatatcctat agtggagtgc ccgaatttga 25381 ttcccagctc tgtttcctgt ttccagcttt tggtaatgca gaccagagag gcagcagtga 25441 tagttcaaga gattgtgatc agtacatcca tgcgggagaa ttgcattaag ttcccagctc 25501 atagctcctt ttctgttcat ttgtgttgtg aaccaattga tgggaacaaa taacaatgat 25561 tattaaagaa acacagactc acatctcaga aaccaaatta ctccaggaaa aatcacagga 25621 agatcacttt tatcctcctg aacataggaa acccctccta agaatcagat aaaaggataa 25681 gcgatttctg aaggatctga aagagggtaa gagaatgtgg actttatgct cttgaaatag 25741 ggaaacaact gacctttttt tcctcaacat ggtcaagaaa agaacaaaaa tattttatta 25801 tattgtgttc catttttctc catctgaaag tggcaaatca atttaggaag aaatttggcc 25861 tacaacttac tatgaactga tcttctgtaa ataaagagaa ttgaaaatga attttgatgt 25921 gaatagaaga ggagagggag cgggaaaagg ggagggttgt gggtgggagg gaagttatgg 25981 gttgggaagt cactgtaatc cataagctgc actttggaaa tttatattca ttaaataaaa 26041 gttaaaaaaa aaaaagcaaa aaaaaaaaag aaacttgtga caagcataag taattactgt 26101 gggcaagcat tactgccttt ctattgtggt tgaggtggtc aagagccagt agagacttgg 26161 cagcttgtta ttatagtctc atttattggt agacaataat tacttgaaaa tcacagcaga 26221 ggctggtgct gcggcttact aggctaatcc tctgctctgc ggcgccggca ctccaggttc 26281 tagttccggt tggggcaccg gattctgtcc cggttgcccc tcttccaggc cagctctctg 26341 ctgtggcctg ggagtgcagt ggaggatggc ccaagtgctt gggccctgca cccccatggg 26401 agaccaggag gaagctcctg actcctggct tcggatcggc gcagtgcgcc ggccgtagcg 26461 accatctagg gggtgaacca acagaaaagg aagacctttt tctctgtctc tctctgttta 26521 actctgcctg tcaaaaataa gtaaataaat aaatgtgcac ttccctattt acaaaaaaag 26581 aaaatcatag aagaggaatt aattccttgt gatttgactt cacatttatt tgataacttc 26641 ccatacagtt atttctattt tcccccatga tgttgcttta taggtgattt cattacatga 26701 agatgggaaa gatagagatg ggaggaggga gggaagaagg gagggagaga aagagagaga 26761 ttgattgtag gtttgatttc aaaaacattt aaaaataata ttatttaatt aaaccaagtt 26821 gaatctgaca gaaaattatc ctaagtatcc atcttttgag ctcagagacc tactctgtag 26881 attttaactt gtgaatcaat atctacttgt atttataggg tgcccagact taagccaaga 26941 ttctgtctct aaaaaggtga aatctccaat ttaaccttgg aggagtttgc accattgctt 27001 tcaaggaatg gcccactcag tgaggtgaat aaaatctaat tctgagtgaa aatctgagaa 27061 aacatcaaag taaaagtcat acactgaaat accaattgct ctataataat tagaaatata 27121 taagttgtat tcaaaaaagt tcatggaaat gcattttatg aaaagcatac atgaaaatat 27181 gtttttctac caagatcaat ttatctatta atttcatttt tcaatggagt ttttaaagaa 27241 tccctgcact ctagcaaagt ggtatctggg ttaactttac actttttgat gagataaata 27301 aattcttcct aataaataaa gactacaata ttccctaatt cttcaactat tttgtgtatg 27361 cccctctttc tatgagttct gattttctta gcgctatttt attttactta tccttagaag 27421 atgctaagtt gaagtagctg aaaaattgtg cttctgggaa acaaaactac agcaaacagc 27481 tgaactatgt ttagtcaaga aagcagacag tagattgctg aaatgtcaca catgaggaac 27541 ttaatataga acaccaaata tattcagaaa cattcatgct gaaggccatt tttgaactgt 27601 acagcttcac attcacctag atgggtgtca acaggtggat tctgttgacg gagcttcatg 27661 ttcatccact taattatact ctgattagga ccactttcgt atagaacatt aggaaacagg 27721 aaagcctgcc agaatggagt caaggaagga aacttgtgcc tctgcctctt tctgagaaca 27781 catattttgt gtatgtgtat gtgtgtgtgt gtgagtgtgt gtatgtgagt gtatgtgtgt 27841 gtgtatgtgt gttgcagtgc ttccgggagc tggaggcaaa gtagagcaga ctatctgatc 27901 cctacaggaa agaagcaggc tcatgctgtg aggcagcagt gaatctcagt tactttctgg 27961 gaactttgga tggtgtagat ttgagactga ggctctggac ataaccacat cgatcgctat 28021 ttatcctgtc ttgaaactta ctatgtccaa tacatcagaa cttttgatag tcctatgact 28081 tgtaggttat ctccacaggg ttatgggtaa cagacggaaa agtctgtttc acattctaat 28141 cttgctgggg cagcagcaaa ccctcccctg aacccaagat atagcactgg tgctggtgcc 28201 agtacaagac agtgacaaaa tgtaacttag gggatctagt ttgaacttga gaaatggcgg 28261 aggaggtatt tagaaaggtg ccaatttctg ctaaccaaga ctgcaaagtt gtagtcttga 28321 caattctgct ctttacataa aattgaagca ttttctgtgt tgttctttag taagttattt 28381 aatttttcta tgaaaaagaa ttcttgaaaa ggatttaccc aaacttccta gaacaccttc 28441 tttcatatat atacacctga catcagataa aataggtatg aaagaggata tttaaaagtc 28501 tccttgtaaa gtgatgacta aactaatgac tttaaattta ataaaatgaa aaccaaaggc 28561 aaaattgcta aagccaagtt aaaaaaagaa acttaaaact caaaaggctt ccatagtctt 28621 ggcaactcat gacaagagcc tagggtgatt actgatgcca taaacaagag tgtcaatttg 28681 ttaagtcaac aacaggagtc accgtgcact tactcctcat gtaggatctc tgtccttaat 28741 gtgctgtaca ttatgattta atgatataac tagtactcag tgtttttcac tttgtgtttc 28801 tatgtgggtg caaactgttg aaatctttac ttaatatatg ctaaactgat cttctgtata 28861 taaagagaat tgaaaatgaa tcttgatgtg aatggaaggg gagagggagt gggaaagggg 28921 agggttgcgg gtgggaggga agttatggtg ggggggaagc cattgtaatc cataagctgt 28981 actttggaaa tttatattca ttaaataaat gttaaaaaaa aaacatttgg aagacctctt 29041 tccccagtat tcagcatttg aaatgccttc ctaggggtaa attatatgct cattgctcca 29101 gtctttgaaa gaggaataag tataaagtat ttaaattctg tcccttttac cccatgatca 29161 tatcctgaac acataattgt tacatatttc cgtataccca gagaagaatc aggccttccc 29221 ataagtgaaa gaatgagcag aatgcagttt gttcttgagt acctgatggg tgtctgagca 29281 gtttaaaatg gtgagaaata actgatggat ttagcagtga tggggacagt aatttggaga 29341 atagcatgat agaaagggaa ttttgtgaat ttttctggaa aaactggaaa tagagaattt 29401 gttttttttt ttttctggct cactgaaaac acattaatgt aattggagtt tcagagtgaa 29461 atgacaagag aattctattt gacatttgtt gccagtaatg gcaaaggaga gagaaagatg 29521 tttctagaca ttatggtacc tgcattagaa aatggttcta tggtaaggaa aagtatggct 29581 ctttccatac tggaaggaat ggcagtttag gggagggata tactgatgag tcagggatga 29641 tgcaaagagg ggagaatggg cagcagggtc tgtgggaatg ctgagatcct ccactgagct 29701 cagaagcact tcctcctttg cttttttcat tgcactttta acaatgacaa gcagaactcc 29761 acactccaca cttcagaagt ctttcctagc cggagaagtt agcaatttat tatgtgggag 29821 aaaagatgag aaagagaaag acattgggaa aagctataaa gatagaaggg taaactttag 29881 taaaagcaat caagtaaaca aaaaatgtat agagaaaaca gggctgttgg tggagagttt 29941 ggatcaaagg aatgtttcta aaagatagca ttcttagtgg tttagaaggg atgaaacagc 30001 tcaggctgta aatgtcttcc catgctccta ccaccacctt caagatccag aaacatcccc 30061 gtttttccct agacactttc ccagatcacc aaggcagagc taattaggca tccagtgtgg 30121 gtatggcatt gtaagccatt gcttttccta gcgttacagc actggcatgt agctggtttt 30181 gacacatgtg tagttttcac ctcattgaga ggtccctgtt tccattgctt caccccccaa 30241 accataacag agtaaaaatt ccacaaaata aatttgattc aaatttattc tattttaaaa 30301 gtttatttat ttgattatat gataaaatta aatgagtaac aacaacaata acatataatg 30361 aataaattaa taagtgtttt tcttaccagg aagatttact ccaaagcaga aggaagtcgt 30421 acttataact atggcacagg tttcatccat tctgtttttt gtaattattt tggatattct 30481 gaagacataa agagatctct ctctctctct ctctctctct ctacctatct atttatctat 30541 ttaagtggat ttcaacacac aaatcttctc ccttttctgt gccttaaatc ctcatttgta 30601 tgataaataa ttgcagagaa aatttttcat aggcttacca ggctctaata acaaaaatta 30661 tataaataaa tttggcaaga aaggtgtttt cagtagcaat tagtactgct ggtatgggtc 30721 tgggagatac atagaaggaa ggctgagtct gtcagactcc taagccattg ccataactgc 30781 caaggacagg ggtgctgtca tcacccagac ctcaccctgc agagccacac cctggtgttg 30841 gccaatctac acacggggta gggattacat agttcaggac ttgggcataa aaggcagagc 30901 agggcagctg ctgcttacac ttgcttttga cacaactgtg tttacttgca atcccccaaa 30961 acagacagaa tggtgcatct gtccagtgag gagaagtctg cggtcactgc cctgtggggc 31021 aaggtgaatg tggaagaagt tggtggtgag gccctgggca ggttggtatc ctttttacag 31081 cacaacttaa tgagacagat agaaactggt cttgtagaaa cagagtagtc gcctgctttt 31141 ctgccaggtg ctgacttctc tcccctgggc tgttttcatt ttctcaggct gctggttgtc 31201 tacccatgga cccagaggtt cttcgagtcc tttggggacc tgtcctctgc acatgctgtt 31261 atgagcaatc ctaaggtgaa ggctcatggc aagaaggtgc tggctgcctt cagtgagggt 31321 ctgaatcacc tggacaacct caaaggcacc tttgctaagc tgagtgaact gcactgtgac 31381 aagctgcacg tggatcctga gaacttcagg gtgagtttgg ggacccttga ttgttctttc 31441 tttttcgcta ttgtaaaatt catgttatat ggagggggca aagttttcag ggtgttgttt 31501 agaatgggaa gatgtccctt gtatcaccat ggaccctcat gataattttg tttctttcac 31561 tttctactct gttgacaacc attgtctcct cttattttct tttcattttc tgtaactttt 31621 tcgttaaact ttagcttgca tttgtaacga atttttaaat tcacttttgt ttatttgtca 31681 gattgtaagt actttctcta atcacttttt tttcaaggca atcagggtat attatattgt 31741 acttcagcac agttttagag aacaattgtt ataattaaat gataaggtag aatatttctg 31801 catataaatt ctggctggcg tggaaatatt cttattggta gaaacaacta catcctggtc 31861 atcatcctgc ctttctcttt atggttacaa tgatatacac tgtttgagat gaggataaaa 31921 tactctgagt ccaaaccggg cccctctgct aaccatgttc atgccttctt ctttttccta 31981 cagctcctgg gcaacgtgct ggttgttgtg ctgtctcatc attttggcaa agaattcact 32041 cctcaggtgc aggctgccta tcagaaggtg gtggctggtg tggccaatgc cctggctcac 32101 aaataccact gagatctttt tccctctgcc aaaaattatg gggacatcat gaagcccctt 32161 gagcatctga cttctggcta ataaaggaaa tttattttca ttgcaatagt gtgttggaat 32221 tttttgtgtc tctcactcgg aaggacatat gggagggcaa atcatttaaa acatcagaat 32281 gagtatttgg tttagagttt ggcaacatat gccatatgct ggctgccatg aacaaaggtg 32341 gctataaaga ggtcatcagt atatgaaaca gccccctgct gtccattcct tattccatag 32401 aaaagccttg acttgaggtt agattttttt tatattttgt tttgtgttat ttttttcttt 32461 aacatcccta aaattttcct tacatgtttt actagccaga tttttcctcc tctcctgact 32521 actcccagtc atagctgtcc ctcttctctt atgaagatct tattaaagca gctgggacag 32581 ggacagaaaa agggctttga ctgcctttct cttgagccct tttcctgatc tccacaactc 32641 actgatacca ctggtctcat tggaaggggt gggctgttaa cagtgtgaca aatgtaggaa 32701 taaactggat gcaaaagggg gctttgtgca gctttatatt cactgttgtc ttaaaccctt 32761 tttatggact caaatcaaat gacagtccct caggatgtta gcttctgaat tcagaaagtg 32821 attgcagagt tgcccactcc tttatcctgt gtctgatggt tttgctgtct ctgtagtgat 32881 tagcttatgt caccatttcc tcattcaata ggcactaggt ggatgaaagg ttctggttca 32941 ctccccaaat acctgcaaca gtcaggagtg tgtcaggcca aaaccagaaa acaggaattg 33001 ccatggggtc tccatgatgg gtggcaggga ctcaagtaca tgagccatat tcggctgctt 33061 ccaggtacat tagcagaaaa ctagatcaga agtggagctg tggggaccag aataaacact 33121 ttgatatggg atgttggtgt ctcaagtagc aacttaaccc cctgctcact aaaacactct 33181 aatcctcatt acctaggagc aactgagcct gagggctatc taatatagct ggtgacacag 33241 agatcatata ccctggctaa aagcatggct gaatccatga aagaaaatat atgctcaaaa 33301 taggaataga atacacagat ttatgcacag atgcttacaa attttagcca atcctgatga 33361 catggttaac ttggagatct agatcagttc ttgccagcat gcccagagaa tagtacatgg 33421 gaaaatttat agagatgatg agttagagac aaagtgagtg ataatgacat tgcctgggat 33481 tgctgctagg tacactgaaa aatcagggag gaagatccaa taaatgaccc attcaaaatc 33541 tagaaaacct gtcaacagga actttggaaa cttatttcta atgtatctga acatcaaggc 33601 agcaataagt ctttctgtaa aatcattaaa tatgcccaaa tgtcaagttc tatgtgagtc 33661 atgaaggtaa cttgataatg ctctacactt catattttgt tcattgttta atacaaaacg 33721 caatttttat tttatttatt taatttttaa ctgtttattt aataaatata aatttccaaa 33781 ttacagctta tagattacaa tggcttcatc ctcataactt gccttgccaa cctgcaaccc 33841 tcccatctcc tgctccctct cccattccat tcacatcaag attcattttc aattatcttt 33901 atatacagaa gatcaattta gtatatatta agtaaagatt ttaacagttt gcacccacac 33961 agaacataaa gtataaatac tgtttgagta ctagttatag cattaattca cattgaacaa 34021 cacattaagg acagagatcc tacatgagga gtaagtgcac agcgactcct gtcgttgact 34081 taacaaattg acattcttgt ttagggggtc agttatctcc ccaggctcct gtcatgagtt 34141 accaaggcta tggaggcctt ttgagttcac tgacttcgat cttatttaga caaggtcata 34201 gtgaaagtgg aagtccactc ctccctttag agaacggtac ctccttcctc aatggcccat 34261 tctttcaact gggatctcgc tcacagagat ctttcattta gctcatttaa ctcctttttt 34321 tttttttttt tctagagcat cttacctttc cattgcctga aatactttca tgggctcttc 34381 agccagatgt gaatgcctta agggctgatt ctgaggccag agtgctgttt aggacatgtg 34441 ccattctatg agtctgatgt gtatcccatt tcccatgttg gaatgttctc tccattttta 34501 attctgtcag ttagtattag cagacactag tcttgtttat gtgatccctc tgactcttat 34561 gcctatcatt acgatcaatt gtgaacagaa attgatcact gggactagtg agatggcatt 34621 ggaacatggc cacctcaatg ggattgaatt cgaatcccct ggtctgtttc taactctacc 34681 atttgaggta agtcagtttg agcatgtccc gaattgcaca tctcttccct ctcttattcc 34741 cactcttata tttaacaggg attacttttc agttaaattt aaacacctaa gaataattgt 34801 gtgttaatta cagagttcaa ccaatagtat taagtagaac aaccaaaaaa tactaaaagg 34861 gataaagtat tacattgtac atcaacagtc aggacaaggg ctgttcaagt cactgtttct 34921 catagtgttc atttcacttt gacaggtttc ctttttggtg ctgggtcagt tgtcactgat 34981 cagggagaac atatgatatt tgtccctttg ggactggctt atttcactca gcatgatgtg 35041 ttccagattc ctccattttg ttgcaaatga ccggatttca ttgttttttt ttgcttctat 35101 atagtattct atagagtaca tgtcccataa tttcttcctc cagtctactg ttgatgggca 35161 tttgggttgg ttccaggtct tagctattgt gaagtgagcc gcaataaaca ttgaggtgca 35221 gacagcttgt ttgtttgcca atttaatttc ctttgggtaa attccaggag cgggatggct 35281 gggttgtatg gtagggttat attcaggttt ctgaggatct ccagactgac ttccataggg 35341 gcttaaccag tttgcattcc caccaacagt gggttagtgt ccctttctcc ccacatcctt 35401 tccagcatct attgttggta gatttctgta tgtgagccat tctaagcggg gtgaggtgaa 35461 acctcattgt ggttttgatt tgcatttccc tgattgctag cgatcttgaa catttcttca 35521 tgtggatgtt ggccatttgg atttcctctt ttcaaaaatg gcaagtgagg tccttggccc 35581 atctcttaag tgggttgttt gttttgatgc tgtggagttt ctttatgtct ttgtggattc 35641 tagctattaa tgctttatct gttgcttagt ttgcaaatat tttttcccat tctgtcagtt 35701 gcctcttcac ttcctgactg cttcttttgc agtacagaac ttctcaattt gatgtaatct 35761 caatagttaa ttttggcttt gactgcctgt gcctccaggg tcttttccaa gaagtctttg 35821 cggtgccaat atcttgcagg gtttctccaa tgttctctaa taacttcatg gtgtcgggtc 35881 atagatttag gtctttaatc catgttgagt ggatttttgt gtaaggtgta aggtaggggt 35941 ctttcttcat gcttcagcac gtggaaatcc cagcaccatt tattgaatag actgtccttg 36001 ctccaggaat tggttttaga ttcctgatca aatataagta ggctcttgta tcccttcaat 36061 ttctttttct tgcctaacag ctctggctaa agcctccaga aatatactga atagcagtgg 36121 tgagaatgga tatccctgta tggtaccaga tctcagtgga aatgcttcca actttttccc 36181 attcaatagg atgctggtcg tgggtttttc ataaattgct ttgattgtat tgaggaacat 36241 tccttctata cccagtttac ttagagtttt caccatgaaa gggtcttgtg ttttattgaa 36301 tgctttctct gcatctattg agataatcat atggtttttc ttctgcagtc tgttaatgtg 36361 gtgtatcaca tttgcaaaca cttgaaccat ccctgcatac cagggttata tcccacttgg 36421 tctgggtgaa tgatctttct gaaatgttgt tgcactccgt tggccagaat tttattgaga 36481 atttttgagt ctatgttcat taggtatatt gttctgtaat tttctttcaa tgctgcatct 36541 ttttccggct taggaattaa ggtgatgctg gattcataga aagattttgg gaggattccc 36601 tctttttcaa ttgttctgaa tagtttgaga agaattgagt tagttcttct ttaaatttct 36661 ggtagaattc agtagtgaat ccatctggtc ctgggctttt ctttgttggg agggccttta 36721 ttactgtttc aatttctgcc tcagttatgg gtttgtttag gctttcgatg tcttcctggt 36781 tcaatgtagg taggttgcag gtgtccagga atctatgcat ttctgataga tttccctgtt 36841 tgctggcata cagtccttgt agtaatttct gatgattctt ttcatttctg tggtgtctgc 36901 tgttacattt cctatttcat ctctgatttt attgatttgg tctcttcttc ttttagttag 36961 ttgagctaat gcggtatcaa ttttgtttat tttttcaaaa aaccagctcc ccatttggct 37021 gatttttggt aatttttttg gattcaatcc tgttgatttc ttctctgatt ttaattattt 37081 ctcttctcct actagatttg ggtctgcttt gctgcagttt ttctagatcc ttgaggtgat 37141 ttgaaagctc atctatttgg tgcctttcca atttcttgat gtaggcacct attgatataa 37201 acttttctct taacactgct ttcgctgcat ctcatacatt ttggtatgtt gtgctgttat 37261 cctcatttac ttccagaaag tttttgattt ctcttttgat ttctttgatg acctagtgtt 37321 cattcaggag catgttgttc actctccatg tgtttgcata tgctgtaggg attcctgagt 37381 tgctaatttc cgacttcatt ctattatggt ctgagaagct gcatcatatg attctaattc 37441 ttttgaatgt gctgagactt gctttatggc ctagtatgtg gttaatctta gagtaggttc 37501 catgtactgc tgagaagaat gtaaattctt taagtgcagg atgaaaagtt ctgtagatat 37561 gtgtcagatc catctgggct atagtatcct ttgaatgtac tgtttcctgt agtcttctgt 37621 cctgtgatct gtctatttct gagagtggag tattgaagtc ccccagtact attgtattgg 37681 agtctaagtc tccctttaac tctcttaaca aatcttttaa ataaaccggt gccctgtaat 37741 taggtgcata tacattgata atcgttatat tttttctgtt gaattcatcc cttaatcatt 37801 atgtagtgcc cctctttgtc tctcttaaca gtttttgtgc taaagtttat tttgtctgat 37861 attaagatgg ttatgcctgc tcttttttca tttctgttgg catggaatat ctttctccag 37921 cctttcacat tcagtctgga tgcatctttg ttggaaagat gtgtttctgt aagcagcaaa 37981 tagatgggtt ttgttcctta acccaatcag ccaatctatg tcttttaact ggacagttca 38041 ggccattaac gttcagtgtg actattgata agtgtaattt tgccctgcca ttttcccaaa 38101 gatatatttt ttttctccaa aattgtattt attttacatt tattcttttt ttttatcttt 38161 tatttaatga atataaattt ccaaagtacg actcatgggt tacaatggct tcccccccat 38221 accgtccctc ccacccacaa ccctcccctt tcccactccc tctccccttc cattcacatc 38281 aagattcatt tttattatct taatatgcag aagatcagct tagtatacct taagtaagga 38341 tttcaacagt ttgctcccac acagaaacat aaagtgaaaa ataatagatg attttttttt 38401 aaatgatgat gaaatcagat gagacctatt gtcatgttta atcccagtga gagtcaagtt 38461 gggaattgat aatttctttt ttttttcttt tctttttttt tttacagaag atcagtttag 38521 tgtacattaa gtaaagattt caatcgtttg cacccccata gaaacacaaa gtgaaatata 38581 ctgtttgagt actcgttata gcattaagct tcagtgtaca gcacattaag gacagagatc 38641 ctacatgagg agtaagtgca cagtgactcc tgttgttgac tttacaaatt gacactcctg 38701 tttatggcat cagtaatctc cctatgcacc agtcatgagt ttccaaggct atggaagccc 38761 cttgagttct ccgactctta tcttgtttag acacggtcat agtcaaagtg gaggttctct 38821 cctcccttca gagaaaggca cctccctctt tgaagacctg ttctttccac tgggatctca 38881 ttcacagaga tctttttgcc agagtgtctt ggctttccat gcctgaaata ctctcatggg 38941 cttttcagcc agatccgagt gcctttaggg ctgattctga ggccagagtg ctatttagga 39001 cactccgcca ttctatgagt ctgctgagta tctcacttcc catgttggat cactctcccc 39061 tttatttatt ctatcggtta gtgttagcag atactagact tgtttatgtg ctccctttga 39121 ctcttagtcc tttcattatg atcaattgtg aactgaaatt gatcacttgg aatagtgaga 39181 tggcattggc acatgccacc ttgatgggat tgaattggaa tcccctggta tgtttccaac 39241 tctaccaatt ggggcaagtc agcctgagca tgtcccaaat tatacatctc ttccctctct 39301 tattgccact cttatgttta acagggatca catttcagtt aattttcaac acttaagaat 39361 aactgtgtga taattacaga attaaaccag tcatattaag tagaacagac aaaaaaaata 39421 ctatgaggga taatgtatta agttgtccat tagcagtcag ggctatgctg atcaagtcac 39481 catttctcat agtgtccatt tcacttcagg aggtttcctt tttggtgttc agtcagttgt 39541 caccgatcag ggagaacata tggtatttgt ccctttggga ctggcttact tcactcagca 39601 tgatgtgttc cagattcctc cattttgttg caaatgactg gatttcgttg tttcttactg 39661 cggtatagta ttctaaagaa tacatatccc ataatttctt tatccagtct accattgatg 39721 ggcatttagg ttggttccag gtgttggcta ttgtgaattg tgctgcaaca aacattaggg 39781 tgcagaccgc ttttttcttt atcaatttaa actcctttgg gtaaattcca aggagtggga 39841 tggctgggtc gaacggtagg gttatattca ggtttctgag gaatctccag actgatttcc 39901 ataggggctt gaccagtttg cattcccacc aacagtgggt tagtgtccct ttttccccac 39961 atcctcacca gcatctgttg ttggtagatt tctgcatgtg aaccattcta accggggtga 40021 ggtgaaacct cattgtggtt ttgatttgca tttccctgat tgctagtgac cttgaacatt 40081 ttttcatgtg cctgttggcc atttggattt cctcttttga aaaatgtcta ttgaggtcct 40141 tggtccatct cttaagtggg ttgtttgttt tgatgttgtg gagtttcttg atttctttgt 40201 agattctggt tattaaccct ttatctgttg catagtttgc aaatattttt tcccattctg 40261 ttggttgtct cttcactctc ctgactgttt cttttgcagt acagaaactt ctcaatttga 40321 tgcaatccca atagttgatt ttggctttga ctgcctgtgc ctcccgggtc ttttccagaa 40381 attctttgcc tgtgccaata tcttgaaggg tttctccaat gttctctaat aacttaatgg 40441 tgtcaggtcg tggatttagg tctttaatcc atgttgagtg gatttttgtg tgagatgtaa 40501 ggtaggggtc ttgcttcatg cttctgcacg tggaaatcca gttttcccaa caccatttat 40561 tgaatagact gtccttgctc caggaattag ttttagatcc ttgatcaaat aaaagttggc 40621 tgtagatgtt tggattgatt tctggtgttt caattctgtt ccattggtct atccatctgt 40681 ttctgtacca gtaccatgct gttttgataa ctactgccct gtagtatgtc ctgaagtctg 40741 gtattgtgat gcctccggct ttgtttttgt tgtacaagat tgctttagct attcgaggtc 40801 tcttgtgcct ccatatgaat ttcagcatca ttttttctag atcatagaag aatgtctttg 40861 gtatcttgat tggtattgca ttgaatctat aaattgcttt tgggagaatg gacattttga 40921 tgatgttgat cttccaatcc atgagcatgg aagatttttc cattttttgg tatcctcttc 40981 tatttctttc tttaaggttt tgtaattttc atcgtagaga tctttaacgt ccttggttaa 41041 gtttattcca aggtatttga ttgtttttgt agctattgtg aatgggattg atcttagcag 41101 ttctttctca gccatggcat tgcttgtgta tacaaaggct gttgattttt gtgcattgat 41161 tttatatcct gccactttgc caaactcctc tatgagttcc aatagtctct tagtagagtt 41221 ctttggatcc tctaagtaca gaatcatatc gtctgcaaag agggatagtt tgacttcttc 41281 cttcttgatt tgtattcctt tgatttcttt ttcttgtctg atggctctgg ctaaaacatc 41341 cagaactatg ttaaatagca gtggtgagag tgggcatccc tgcctggtgc cagattttag 41401 tggaaatgct tccaactttt ccccattcaa taggatgctg gctgtgggtt ttttataaat 41461 tgctttgatt atattgagga atgttccttc tatacccaat ttgcttagag ttttcatcat 41521 gaaagggtgt tgaattttat caaatgcttt ctctgcatca attgagataa ccatatggtt 41581 tttcttctgc agtctgttaa tgtggtgaat cacattgatt gatttgcgaa cattgaacca 41641 tccctgcata ccagggatga atcccacttg gtctgggtgg atgattttcc tgatgtgttg 41701 ttgtattcta ttggccagaa ttttattgag gatttttgca tctatgttca tcagggatat 41761 tggtctgtaa ttttctttca gtcctgcatc tttctctggc ttagggatta aggtgatgct 41821 ggcttcatag aaagaatttg ggaggattcc ctctttttcg attgttctga atagtttgag 41881 aataatggga ttagttcttc tttaaatgtc tggtagaatt cagcagtgaa tccatctggt 41941 cctgggcttt tctttgttgg gagggccttt attactgttt caatttctgt ttcagttatg 42001 ggtctattta ggttttcgat gtcttcatgg ttcaattttg gtagattgca tgtgtccagg 42061 aatctatcca tttctgatag gttttcctgt ttgctggcat acaggtcctt gtagtaattt 42121 ctgatgattc tttttatttc tgtggtgtct gttgttacgt ttactttttc atctctgatt 42181 ttattgattt gggtcttttc tcttcttttt ttagttagtt gggccaatgg ggtgtcaatt 42241 ttgtttattt tttcaaaaaa ccagcttctc gcttggctga ttttttgtaa tgtttttttt 42301 ttttttgatt caatcctgtt aatttcttct ctggttttaa ttatttctct tctcctacta 42361 gatttgggtt tggtttgctg cagtttttct aggtccttga ggtgcactga aagctcattt 42421 atttggtgcc tttccaattt cttgatatag gcacctattg ctataaattt gcctctcaat 42481 actgcttttg ctgtatccca taagttttga tatgttgtgt tgttgtcttc atttacttcc 42541 agaaagtttt tgatttctct tttgatttct tgaatgaccc agtgttcatt caggagcatg 42601 ttgttcagtc tccatgtgtt tgcatacttt cttgggtttc ctgagttgct aatttccagc 42661 ttcatcccac tgtggtctga gaagctgcat ggtatgattc taattctttt aaatttgctg 42721 agacttgctt tatggcctag tatgtgatca atcctagaga aggtcccatg cgctgctgag 42781 aagaatgtga agtctgtaga tgtagggttg aaagttctgt agatatctgt tagatccatt 42841 tgggcaatag tgtcaattaa atctgctgtt tccttgttga tcttctgtcc ggatgatctg 42901 tctatttctg agagtggagt attgaagtcc cccagtacta ttgtattgga atctaagtct 42961 ccctttaagt cccttaacat atcttttaaa tagaccggtg ccctgtaatt aggtgcatat 43021 acatttataa tagttacatc ttcctgttga attgaaccct taatcattat atagtgtccc 43081 tctttgtctc tcttaacagt ttttgtatta aagtttattt tgtctgatat taatatggct 43141 acacctgctt tttttggttt ctgttggcat ggaatatctt tttccaacct ttcactttca 43201 gtctgcatgc ctctttgtta gagagatgtg tttcttgtag gcaacaaata gttgggttgt 43261 gttctgtgag ccagtcatcc aaacggtgtc ttttaactga agaattcaga ccattaatgt 43321 tcaatgtgac aattgatacg tagtgacttt gccctgccat tttcccggaa atattttcta 43381 gtatatgctt tgagcttccc atgctctttt actggtaggt gttcttcctt tcccttcttt 43441 catattgatg gccgtgtttc tgtgtttctg agtgtagcac atctttaagt atcttttgca 43501 gggccggaca agtggccaca aagtctttca atttctgttt gctatgaaag gtctttattt 43561 caccttcatt cacaaatgag agcttggcag gatataatat tctgggctgg caatttttct 43621 ctcttagcac ctgtgctatg tctcgccatt ccctcctagc ttgtagggtt tctgatgaga 43681 agtctgctgt gagtctgatt ggagatcctc tgagagtaat ctgatgtttc tctcttgcac 43741 attttaggat cttttcttta tgtttcactg tggtaagttt aattaccacg tgtcgtgtga 43801 ggatctcttt tggtcatgtt tattgggggt tctatgagct tcctgtacta ggatatctct 43861 gtccttctcc aaacctggaa agttctctgc tagtatctca ctaaaaaggc cttccaatcc 43921 tttctctctc tccatgcctt caggaactcc tagaactcga atgttggttt ttttaatagt 43981 atcctgtaga ttcccaacaa tattttttag atttctaatt tcctcttctt ttctttggtt 44041 tgactgtata ctttcctgtg ctctgtcttc taagtccaat attctgtctt ctgcttcacc 44101 cattctgttt ttaaggcttt ctaatgtgtt tgccatttga tctattgagc tcttcatttc 44161 attttgattt ctcttcacta ttacactttc ctgttctact agtttctgag tttcattttg 44221 actcttcctt aaaatttcat tttcacgaga gagattttca atcttgtcca ttaaggattt 44281 ctgtagttca aggatttgct tttgaaaact tctaaatgtt cttatcataa attttttgaa 44341 atccgtatct tgcatttctt ctatctcatc atcttcatac tcttggcttg gggtgttttg 44401 cttatttgga ggcatcatag tgtcatcgtt gatcttgctc cctctatttc tgtgtttgtt 44461 actcggcata gttaattctt cttgtgtcac tgtgcgtttt ttttttttct tttttttttt 44521 tttttatact gtgtccatgt taagtggact gcctgctgtt ggaggagcct tggaggcttg 44581 agatgggtgc ggcc //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS YSCPROMA 162 bp ds-DNA PLN 13-FEB-1990 DEFINITION S.cerevisiae promoter region, clone pUT340. ACCESSION M32023 KEYWORDS promoter. SOURCE S.cerevisiae (strain FL200) DNA, clone pUT340. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 162) AUTHORS Gatignol,A., Dassain,M. and Tiraby,G. TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector based on phleomycin resistance JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.L.Stassi, 08-FEB-1990. BASE COUNT 46 a 25 c 29 g 62 t ORIGIN 1 gatctttttt tttttttttt ttttttttgt gctgcaaagt ttcttaaagc cttcgggctt 61 acgaaatcct ttatcaccga aaggggaccg cttcgaaaag tggatataaa acaaggtatt 121 tattttttat agacaatgac caaatgacag gatagatcca tg // LOCUS YSCPROMB 270 bp ds-DNA PLN 13-FEB-1990 DEFINITION S.cerevisiae promoter region, clone pUT341. ACCESSION M32024 KEYWORDS promoter. SOURCE S.cerevisiae (strain FL200) DNA, clone pUT341. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 270) AUTHORS Gatignol,A., Dassain,M. and Tiraby,G. TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector based on phleomycin resistance JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.L.Stassi, 08-FEB-1990. BASE COUNT 74 a 47 c 55 g 94 t ORIGIN 1 gatatatttt cgcggctcgc gttttgtttg cttcttattt tacactgagt tttcgtgccg 61 caaacgtgga gatgggaaaa agaaaagtcg ggaaaataat gagaaatttc tacttttggt 121 attcctcata cagcctgcgc ggtttattag taaaataccc gataatcctc gaggtttgaa 181 aaacttttcc ctctactact gttgacacgg atttttttat ttaagaggaa aagtcgtggt 241 tgttttcctc gaacaaatta gatatccatg // LOCUS YSCPROMC 343 bp ds-DNA PLN 13-FEB-1990 DEFINITION S.cerevisiae promoter region, clone pUT342. ACCESSION M32025 KEYWORDS promoter. SOURCE S.cerevisiae (strain FL200) DNA, clone pUT342. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 343) AUTHORS Gatignol,A., Dassain,M. and Tiraby,G. TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector based on phleomycin resistance JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.L.Stassi, 08-FEB-1990 BASE COUNT 87 a 81 c 64 g 111 t ORIGIN 1 ctaaaataca gcaaacttgc attttggtca tttatggtta gcaagtttaa gtgctactac 61 attacatatt acctgaccct gaaaacgggc ttaaaagccc atctactata gtgccgctgc 121 gggtaaccgt agcttcttgc tttttttccc accgtcattt attttctagt cttctttccg 181 agaaatgccg gcggcgtttg ccgttccgtt cgcaatctct cgagaagttc gagagatgaa 241 aaattttctt gggaaagcca agaaggtacg atcgtaggca tgattcatat ataaactctt 301 tcccaatctt tttggttacg tacatactcc cccctgatcc atg // LOCUS YSCPROMD 328 bp ds-DNA PLN 13-FEB-1990 DEFINITION S.cerevisiae promoter region, clone pUT343. ACCESSION M32026 KEYWORDS promoter. SOURCE S.cerevisiae (strain FL200) DNA, clone pUT343. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 328) AUTHORS Gatignol,A., Dassain,M. and Tiraby,G. TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector based on phleomycin resistance JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.L.Stassi, 08-FEB-1990. BASE COUNT 95 a 80 c 63 g 90 t ORIGIN 1 gtcatatata accataacca agtaatacat attcaaaatg ggtgttgaac aaatcttaaa 61 gagaaagacc ggtgtcatcg ttggtgaaga tgtccacaac ttattcactt acgctaagga 121 acacaagttc gctattccag ctattaacgt cacctcttct tctactgccg tcgctgcttt 181 agaagctgct agagacagca agtccccaat cattttgcaa acctctaacg gtggtgctgc 241 ttacttcgct ggtaagggta tctctaacga aggtcaaaat gcttccatca agggtgctat 301 tgccgctgcc cactacatca gatccatg // LOCUS YSCPROME 123 bp ds-DNA PLN 13-FEB-1990 DEFINITION S.cerevisiae promoter region, clone pUT344. ACCESSION M32027 KEYWORDS promoter. SOURCE S.cerevisiae (strain FL200) DNA, clone pUT344. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 123) AUTHORS Gatignol,A., Dassain,M. and Tiraby,G. TITLE Cloning of Saccharomyces cerevisiae promoters using a probe vector based on phleomycin resistance JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.L.Stassi, 08-FEB-1990. BASE COUNT 41 a 15 c 21 g 46 t ORIGIN 1 gatcattatt gatagcttac ttctttgagc ggaatatata gagattttgg actcgaagaa 61 tgctaaatat ttcttagagt taatttttgc aaaaatgaga tatcaatcat ttcatgatcc 121 atg //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS MUSACTMEL 1857 bp ss-mRNA ROD 15-MAR-1989 DEFINITION Mouse A-X actin mRNA, complete cds. ACCESSION J04181 KEYWORDS actin. SOURCE Mouse (strain C57BL/6) B16 melanoma F-1, cDNA to mRNA, clone pMA30. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1857) AUTHORS Sadano,H., Taniguchi,S., Kakunaga,T. and Baba,T. TITLE cDNA cloning and sequence of a new type of actin in mouse B16 melanoma JOURNAL J. Biol. Chem. 263, 15868-15871 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Sadano, 08-SEP-1988. FEATURES from to/span description pept 63 1190 A-X actin mRNA < 1 1857 actin mRNA BASE COUNT 410 a 498 c 453 g 496 t ORIGIN 794 bp upstream of BglII site. 1 cccgcgagca cagcttcttt gcagctcctt cgttgccggt ccactcccgc caccagttcg 61 ccatggatga cgatatcgct gcgctggtcg tcgacaacgg ctccggcatg tgcaaagccg 121 gcttcgcggg cgacgatgct cccctggctg tattcccctc catcgtgggc cgccctaggc 181 accagggtgt gatggtggga atgggtcaga aggactccta tgtgggtgac gaggcccaga 241 gcaagagagg tatcctgacc ctgaagtacc ccattgaaca tggcattgtt accaactggg 301 acgacatgga gaagatctgg caccacacct tctacaatga gctgcgtgtg gcccctgagg 361 agcaccctgt gctgctcacc gaggcccccc tgaaccctaa ggccaaccgt gaaaagatga 421 cccagatcat gtttgagacc ttcaacaccc cagccatgta cgtagccatc caggctgtgc 481 tgtccctgta tgcctctggt cgtaccacag gcattgtgat ggactccgga gacggggtca 541 cccacactgt gcccatctac gagggctatg ctctccctca cgccatcctg cgtctggacc 601 tggctggccg ggacctgaca gactacctca tgaagatcct gaccgagcgt ggctacagct 661 tcaccaccac agctgagagg gaaatcgtgc gtgacatcaa agagaagctg tgctatgttg 721 ctctagactt cgagcaggag atggccactg ccgcatcctc ttcctccctg gagaagagct 781 atgagctgcc tgacggccag gtcatcacta ttggcaacga gcggttccga tgccctgagg 841 ctcttttcca gccttccttc ttgggtatgg aatcctgtgg catccatgaa actacattca 901 attccatcat gaagtgtgac gttgacatcc gtaaagacct ctatgccaac acagtgctgt 961 ctggtggtac caccatgtac ccaggcattg ctgacaggat gcagaaggag attactgctc 1021 tggctcctag caccatgaag atcaagatca ttgctcctcc tgagcgcaag tactctgtgt 1081 ggatcggtgg ctccatcctg gcctcactgt ccaccttcca gcagatgtgg atcagcaagc 1141 aggagtacga tgagtccggc ccctccatcg tgcaccgcaa gtgcttctag gcggactgtt 1201 actgagctgc gttttacacc ctttctttga caaaacctaa cttgcgcagg aaaaaaaaaa 1261 taagagacaa cattggcatg gctttgtttt tttaaatttt ttttaaagtt tttttttttt 1321 ttttttttta agtttttttg ttttgttttg gcgcttttga ctcaggattt aaaaactgga 1381 acggtgaagg cgacagcagt tggttggagc aaacatcccc caaagttcta caaatgtggc 1441 tgaggacttt gtacattgtt ttgttttttt ttttttggtt ttgtcttttt ttaatagtca 1501 ttccaagtat ccatgaaata agtggttaca ggaagtccct caccctccca aaagccaccc 1561 ccactcctaa gaggaggatg gtcgcgtcca tgccctgagt ccaccccggg gaaggtgaca 1621 gcattgcttc tgtgtaaatt atgtactgca aaaatttttt taaatcttcc gccttaatac 1681 ttcatttttg tttttaattt ctgaatggcc caggtctgag gcctcccttt tttttgtccc 1741 cccaacttga tgtatgaagg ctttggtctc cctgggaggg ggttgaggtg ttgaggcagc 1801 cagggctggc ctgtacactg acttgagacc aataaaagtg cacaccttac cttacac // LOCUS MUSIGHMM 369 bp ds-DNA ROD 15-DEC-1989 DEFINITION Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from anti-alpha(1->6)dextran hybridoma 27.7.2. ACCESSION M24320 KEYWORDS immunoglobulin heavy chain; joining exon; mu immunoglobulin; variable region. SOURCE Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.7.2, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 369) AUTHORS Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B., Morrison,S.L. and Kabat,E.A. TITLE Amino acid substitutions in V-H CDR2 change the idiotype but noT the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans JOURNAL J. Immunol. 142, 888-893 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Kabat, 15-MAR-1989. FEATURES from to/span description pept < 1 > 369 Ig mu chain VJ3-region precursor (AA at 1) matp < 1 > 369 Ig mu chain VJ3-region recomb 321 322 V-region end/J3-region start BASE COUNT 92 a 85 c 96 g 84 t 12 others ORIGIN 1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcagt agctactgga tagagnnnnn ntgggtaaag 121 cagaggcctg gacatggcct tgagtggatt ggagagattt tacctggaag tggtagtact 181 aactacaatg agaagttcaa gggcaaggcc acattcactg cagatacatc ctccaacaca 241 gcctacatgc aactcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga 301 cattactacg gtagtagctc ctttnnnnnn gcttactggg gccaagggac tctggtcact 361 gtctctgca // LOCUS MUSIGHMN 369 bp ds-DNA ROD 15-DEC-1989 DEFINITION Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from anti-alpha(1->6)dextran hybridoma 27.4b.2. ACCESSION M24321 KEYWORDS immunoglobulin heavy chain; joining exon; mu immunoglobulin; variable region. SOURCE Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.4b.2, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 369) AUTHORS Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B., Morrison,S.L. and Kabat,E.A. TITLE Amino acid substitutions in V-H CDR2 change the idiotype but noT the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans JOURNAL J. Immunol. 142, 888-893 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Kabat, 15-MAR-1989. FEATURES from to/span description pept < 1 > 369 Ig mu chain VJ3-region precursor (AA at 1) matp < 1 > 369 Ig mu chain VJ3-region recomb 321 322 V-region end/J3-region start BASE COUNT 90 a 82 c 97 g 88 t 12 others ORIGIN 1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcagt agttactgga tagagnnnnn ntgggtaaaa 121 cagaggcctg gacatggcct tgagtggatt ggagagattt ttcctggaag tggtagttct 181 aactacagtg agaagttcaa gggtaaggcc acattcactg cagatacatc ctccaacaca 241 gcctacatgc aactcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga 301 cattactacg gtagtagctc gtttnnnnnn gcttactggg gccaagggac tctggtcact 361 gtctctgca // LOCUS MUSIGHMO 369 bp ds-DNA ROD 15-DEC-1989 DEFINITION Mouse active Ig mu-chain VJ3-region mRNA, partial cds, from anti-alpha(1->6)dextran hybridoma 27.10.2. ACCESSION M28529 M24322 KEYWORDS immunoglobulin heavy chain; joining exon; mu immunoglobulin; variable region. SOURCE Mouse (BALB/c) anti-alpha(1->6)dextran hybridoma 27.10.2, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 369) AUTHORS Sikder,S.K., Borden,P., Gruezo,F., Akolkar,P.N., Bhattacharya,S.B., Morrison,S.L. and Kabat,E.A. TITLE Amino acid substitutions in V-H CDR2 change the idiotype but noT the antigen-binding of monoclonal antibodies to alpha(1->6)dextrans JOURNAL J. Immunol. 142, 888-893 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Kabat, 15-MAR-1989. FEATURES from to/span description pept < 1 > 369 Ig mu chain VJ3-region precursor (AA at 1) matp < 1 > 369 Ig mu chain VJ3-region recomb 321 322 V-region end/J3-region start BASE COUNT 90 a 84 c 98 g 85 t 12 others ORIGIN 1 caggttcagc tgcagcagtc tggagctgag ctgatgaagc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcagt agctactgga tagagnnnnn ntgggtaaag 121 cagaggcctg gacatggcct tgagtggatt ggagagattt ttcctggaag tggtagtact 181 aagtacaatg agaagttcaa gggcaaggcc acattcactg cagatacatc ctccaacaca 241 gcctacatgc agctcagcag cctgacatct gaggactctg ccgtctatta ctgtgcaaga 301 cattactacg gtagtagctc ctttnnnnnn gcttactggg gccaagggac tctggtcact 361 gtctctgca // LOCUS MUSIL71 1926 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse interleukin 7 (IL-7) gene, exon 1. ACCESSION M29054 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 1 of 4 SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1926) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept 1897 + 1906 interleukin 7, exon 1 pre-msg 1350 > 1926 IL7 mRNA and introns IVS 1907 > 1926 IL7 intron A BASE COUNT 498 a 478 c 430 g 520 t ORIGIN 156 bp upstream of XbaI site. 1 aatccctggt ttctttctct ttcctctctt tccttttttt tttttttttt tttttttttg 61 gtggacattt ttccatgttc catcgttgaa atgctttctc accacagtag ctctcttcca 121 tcctgtcata ctgcaacgct ggagcctaat cctattctag atagcccttc tctgtggatg 181 ggagctgtct ggagaccagg gctgcagtga agataacaag tactgtcaaa tctcctgatg 241 ctttgagcag gatgacacca agagagacac cagagaggcc tttaaacacc cctgcaggca 301 ccgggtagga tttattttgg aagtgcacta agacaacagg ttttacattg aatgtggaaa 361 gtgagagtcc ctgcagagtc cacaggccag tgcccagtta gaactcctat ggtccacact 421 ctaaatgcac agacagtacc tctctgcctc ttttcactgt taacagacag aatctgaagc 481 tcgtgtcctg atgtccgatt aggaaccgcc agctccagac ccctttctga ggttccccag 541 agcgttaaga gcacacccac caccctaggg cgtcttctta gtctactgta tcctccatta 601 tttacaaggt tctcttccac aagagattcc aagcagactc tttgaagcca gcctgcaggt 661 gttttttatc caaattcagc cagtgaactt agctctgtta cttaaggaag cctctgttcc 721 catgtgtatc tctcaactgc agataagaaa acagattgcc tcggcagaaa actccagatg 781 ccaaaataca tgtaaattcc tgaccaagag tcctggcata gagcaaatac tagatatatg 841 agagatatgg tcctgatctc agatctctct tgttactggt ttgcacaacc tgccttcacc 901 taagcagtgt ttctcctgat tataagaaac aacactaaat caggaatccc ctcggtaatt 961 caggcctatt attaaaacaa aacaaaacac tttaatggaa acaatccaca atgggtggat 1021 tagctatgtc aggagataac tgtgggagat agttactatg cagaacttct atgtgccccc 1081 aactaatacg agttttggtt ttctgaagct accacttaca tactagactg gatggactca 1141 caagaattat ttgttcttaa agtcctacct gagcaggtgc attgcacaga tgacagggag 1201 gaaaaaaaaa aaaaaacaaa ctagagaacc tagaggggca gagggcgtga ccctcttaat 1261 catccttcac ttcctttttt tttttttttt ttttaagatg acttacatcg gcggcagcag 1321 aagcagcgac gcctcctcag tggaactgac ttgcaatagc gagctttctc tgctgcacat 1381 ttgtggcttc tgtggacata ttagtaacca gcggttttag ctcccagtct cacagagttg 1441 ccagagaggt tagaagtcat ttgaaaagcc tattagccga atctttctga tccagaaggc 1501 cagctggctt ctcctgagct actttcaatt cgcagcaacc actgatcctg gtccaggtga 1561 ctggggaaga cgctgagggt ataaacccaa acattgaacc tgaagaccca gcgcaaagta 1621 gaaactgaaa gtaccctgct tactctgccg gcagatccta cggaagttat ggcaaagcca 1681 gagcgcctgg gtggccggtg atgcatgcgg cccctcttgg gatggatgga ccaggcgtgg 1741 cgtgggtgag aggagtcagc tgcctgaact gccctgccca gcaccggttt gcggccaccc 1801 ggtggatgac cggggtcctg ggagtgatta tgggtggtga gagccggctc ctgctgcagt 1861 cccagtcatc atgactacac ccacctcccg cagaccatgt tccatggtaa gcgctgctct 1921 ctggtg // LOCUS MUSIL72 174 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse interleukin 7 (IL-7) gene, exon 2. ACCESSION M29055 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 2 of 4 SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 174) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 + 154 interleukin 7, exon 2 pre-msg < 1 > 174 IL7 mRNA and introns IVS < 1 20 IL7 intron A IVS 155 > 174 IL7 intron B BASE COUNT 44 a 31 c 35 g 64 t ORIGIN Undetermined number of bp after segment 1. 1 tcattaactg ctgggttcag tttcttttag atatatcttt ggaattcctc cactgatcct 61 tgttctgctg cctgtcacat catctgagtg ccacattaaa gacaaagaag gtaaagcata 121 tgagagtgta ctgatgatca gcatcgatga attggtacgt gttgttttat tttt // LOCUS MUSIL73 121 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse interleukin 7 (IL-7) gene, exon 3. ACCESSION M29056 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 3 of 4 SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 121) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 + 101 interleukin 7, exon 3 pre-msg < 1 > 121 IL7 mRNA and introns IVS < 1 20 IL7 intron B IVS 102 > 121 IL7 intron C BASE COUNT 46 a 13 c 20 g 42 t ORIGIN Undetermined number of bp after segment 2. 1 attgttttat attattacag gacaaaatga caggaactga tagtaattgc ccgaataatg 61 aaccaaactt ttttagaaaa catgtatgtg atgatacaaa ggtaatggta atttttactt 121 t // LOCUS MUSIL74 172 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse interleukin 7 (IL-7) gene, exon 4. ACCESSION M29057 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 4 of 4 SOURCE Mouse (BALB/c) DNA, clones lambda-mIL7.[1,2,3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 172) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 / 152 interleukin 7, exon 4 pre-msg < 1 > 172 IL7 mRNA and introns IVS < 1 20 IL7 intron C IVS 153 > 172 IL7 intron D BASE COUNT 56 a 34 c 32 g 50 t ORIGIN About 1.5 kb after segment 3. 1 tgactttttt cttctaatag gaagctgctt ttctaaatcg tgctgctcgc aagttgaagc 61 aatttcttaa aatgaatatc agtgaagaat tcaatgtcca cttactaaca gtatcacaag 121 gcacacaaac actggtgaac tgcacaagta aggtgagagt ggactactca ct // LOCUS MUSMFG1A 508 bp ss-mRNA ROD 15-MAR-1990 DEFINITION Mouse zinc finger protein (mfg1) mRNA, partial cds. ACCESSION M28513 KEYWORDS zinc finger protein. SOURCE Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 508) AUTHORS Passananti,C., Felsani,A., Caruso,M. and Amati,P. TITLE Mouse genes coding for "zinc-finger"-containing proteins: Characterization and expression in differentiated cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Felsani, 29-SEP-1989. FEATURES from to/span description pept < 1 > 508 zinc finger protein mfg1 mRNA (AA at 1; put.) BASE COUNT 191 a 99 c 97 g 121 t ORIGIN 1 attcgtacag gggagaaagc cttcaagtgt gaagaatgtg gtaaggcctt taataatcgt 61 tcaaaccttt ctgaacacaa gagaatacat actggagaaa aaccctacaa atgtgaagaa 121 tgtggtaagg cctttcgtat tcgctcaaaa ctttctacac accagagagt tcatacggga 181 gaaaaaccct acaaatgtga agaatgtggt aaggctttta atagtcattc aaacctttct 241 gaacacaaga gaatccatac tggagagaaa ccttacaaat gtgaagaatg tggtaaggct 301 ttcagtactt catcaaacct ttctgaacac aagagaatcc atactggaga aaaaccgatc 361 aaatgtgaag aatgtggcaa ggccttcagt actcgctcaa catattatag acaccagaaa 421 aaccatactg gaaaaaaacc atacaagtgt gaagaatgcg ccaaagaatt ttcctaccct 481 tcattgctta aggtacatca aagaattc // LOCUS MUSMFG2A 1218 bp ss-mRNA ROD 15-MAR-1990 DEFINITION Mouse zinc finger protein (mfg2) mRNA, partial cds. ACCESSION M28514 KEYWORDS zinc finger protein. SOURCE Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1218) AUTHORS Passananti,C., Felsani,A., Caruso,M. and Amati,P. TITLE Mouse genes coding for "zinc-finger"-containing proteins: Characterization and expression in differentiated cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Felsani, 29-SEP-1989. FEATURES from to/span description pept < 1 > 1218 zinc finger protein mfg2 mRNA (AA at 1; put.) BASE COUNT 359 a 259 c 282 g 318 t ORIGIN 1 tatgaatgta aggagtctgg gaaggccttc agactccccc aacagctcac aaggcatcag 61 aagtgtcaca gtggtgagaa gcctttcagt cataatgaag gcagacaggc ttttcaacat 121 cccaacctac tcaagtaccc taaagccatt catacaggtg ccaaagcatt tgcatgcagg 181 gaatgtggga agtccttcaa ccgtgtctcc agccttgttg aacatgggct tattcatgct 241 gatgtgaaac catatgaatg taatgagtgt gggaaagcct ttaaacgaca cagaagtttt 301 gtgcgccatc agaaaatcca ttctggtgag agaccctttc agtgtaagga ttgtgggaag 361 ggcttcattg ttctggctca cctcactcgg caccagagca gtcatagtga agagaagcca 421 tttgaatgtg aggaatgtgg caagaaattt agaactgcca gacaccttgt taagcaccag 481 cgcattcata gcggcgagaa accctttgaa tgcaatgtat gtgggagtgc tttccgcctt 541 caactatacc tttctgagca tcagaaaact cacatggaag agaaatactt ggaatgcaat 601 gtgtgtggga aggcttttcg gcttcaggat atcctttctg agcacttgaa aactcacact 661 gaagagaatc ctttcaagtg taagctctgt gggtcatcct tcccacataa gtaccaactt 721 aataagcatc tcacagttca cactgatggg aaaccctatc agtgcaagga atgtggcaaa 781 tgctttcgtc aaaggtcaaa actaacggaa catgagagta ttcacactgg caagaaaccc 841 ttccagtgtg aagcatgtgg gaagtcttta gctaacacac ttctcattca tcatcagaag 901 tctcacagtg gtgagagacc ttttgaatgt aaagaatgcg gaaaggcttt ccttcttccc 961 agtcaactta atagccataa aattgttcat acaagtaaaa gaccctttga atgcaaggta 1021 tgtgggaagt ccttcaagcg tgaatccaat ctcattcaac atggcgctgt tcatgctggt 1081 gtgaaatcat atgaatgcag tgagtgtggg aaaggtttta ttgaccgctc aagcctgttc 1141 caccatcgga aaattcattc tgatgagaaa ccatttaagt gtcaggagtg tggaaaggcc 1201 ttcgttgttc tggcgtat // LOCUS MUSMFG3A 916 bp ss-mRNA ROD 15-MAR-1990 DEFINITION Mouse zinc finger protein (mfg3) mRNA, partial cds. ACCESSION M28515 KEYWORDS zinc finger protein. SOURCE Mouse (strain CD-1) adult male skeletal muscle, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 916) AUTHORS Passananti,C., Felsani,A., Caruso,M. and Amati,P. TITLE Mouse genes coding for "zinc-finger"-containing proteins: Characterization and expression in differentiated cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9417-9421 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Felsani, 29-SEP-1989. FEATURES from to/span description pept < 1 > 916 zinc finger protein mfg3 mRNA (AA at 1; put.) BASE COUNT 281 a 186 c 200 g 249 t ORIGIN 1 cttactgtgc tggctcagct tactcggcac gagaacattc acactgaaga caaatcgttt 61 gaatgtaagc agtgtggtaa gatatttagt aatggctctt acctcttacg acactacgat 121 actcacacca atgagaaacc ctttgagtgt aatatatgtg gcaaggcttt taggcttcat 181 ctataccttt ctgagcatca gaaaactcac accgatgaga agcctttcaa gtgtaagctg 241 tgtgagtcag ccttcagacg taagtaccag ctcagtgaac atcagagaat tcacactgat 301 ggtaaaccct atcagtgcaa ggattgttgg gaattctttc gtcgacgatc aaattttatc 361 gaacatcaga gtattcacac tggaaagaaa ccctttgagt gtaaggactg tgggaaggtc 421 tttagactaa atatacatct cattcgacat cagagatttc atagtgatga gaagcccttt 481 gaatgtaaag aatgtggaaa ggcttttcat ttttccagcc agcttaataa ccataaaacc 541 agtcatacag gtcagacacc ttttgaatgt aaagaatgtg ggaagtcgtt taagcgtgtc 601 tccagccttg tggaacatag gattattcac agtggtgtga aaccatataa atgtaatgcg 661 tgtggaagag ccttcaatcg tcgctcaaac ctcatgcaac atgagaaaat tcattctgat 721 gagagaccct ttgagtgtaa ggactgtgga aaggccttca ctgttctggc tcagctcaca 781 cggcaccaga cgattcataa tggaaagaaa tcatatgaat gtgagcagtg tgggtcagcc 841 ttcagactcc cgtaccaact cactcaacat cagagaattc attatgatgt gaagccattt 901 cagtgcaagg atgtgg // LOCUS MUSMHSLPC 5362 bp ss-mRNA ROD 15-SEP-1989 DEFINITION Mouse MHC sex-limited protein (Slp) mRNA, complete cds. ACCESSION M21576 M12385 KEYWORDS antigen; major histocompatibility complex; sex-limited protein. SOURCE Mouse (strain B10.WR) liver, cDNA to mRNA, clone pSlp20B. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5362) AUTHORS Ogata,R.T. and Sepich,D.S. TITLE Murine sex-limited protein: Complete cDNA sequence and comparison with murine fourth complement component JOURNAL J. Immunol. 135, 4239-4244 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 52 5259 sex-limited protein precursor sigp 52 108 sex-limited protein signal peptide matp 109 5256 sex-limited protein mRNA < 1 5362 Slp mRNA BASE COUNT 1219 a 1494 c 1462 g 1187 t ORIGIN 39 bp upstream of BamHI site. 1 cgagcagaca gtcagaccag acaggtctga cctttcctgg atcctccagc catgcggctc 61 ctctgggggc tggcctgggt gttcagcttc tgtgcctcat ccctgcagaa gcccaggttg 121 ctcctgtttt ccccttctgt ggttaatttg gggacccccc tgtcggtggg ggtacagctc 181 ctggatgccc ctccaggaca ggaggtaaaa ggatcagtgt tcctcagaaa cccaaagggt 241 ggttcctgct ccccaaagaa ggactttaag ctgagctcgg gagatgactt tgtgctgctc 301 agccttgagg tcccactgga agatgtgagg agctgtggcc tctttgacct gcgcagagcc 361 ccccacatcc agctggtagc tcagtctccg tggctaagga acacagcttt caaagccaca 421 gagactcagg gtgtcaactt gctcttctct tcccgacgag gccacatctt tgtgcagacc 481 gatcagccta tctataatcc agggcagcgg gttcgttatc gggtctttgc actggatcaa 541 aagatgcgcc catccactga tttcctcacc atcacagtgg agaactccca tggcctcagt 601 gtactcaaga aggagatatt tacttccaca cccatcttac aagatacctt catcattcca 661 gacatctcag agcctgggac ctggaagatc tcagctaggt tctcagatgg actggagtcc 721 aataggagca cccactttga agtgaagagg tatgtccttc ccaacttcga ggtgaagatt 781 actccttgga agccatatat cctgatggtg cccagcaaca gtgatgaaat ccaattagac 841 atccaggcca ggtacatcta tgggaagccc gtgcagggcg tggcatacac acggtttgcg 901 ctcatggatg agcaagggaa gaggactttc cttcagggcc tagagaccca ggccaagttg 961 gtggaaggcc ggacccacat ttccatctca aaggaccagt tccaggctgc cctggataaa 1021 atcaatattg gggtcagaga cctggagggg ctgcgtctct atgctgctac agctgtcatt 1081 gagtctccag gaggagagat ggaggaggca gaactcacgt cctggcgctt tgtatcatct 1141 gccttttcct tggatctcag ccgcactaag cggcatctcg tgcctggagc ccacttcctg 1201 ctgcaggcct tggtcagaga aatatcaggc tctgaagcct ctaacgttat tgtcaaagtt 1261 tctgccacat tggtgtcagg ctctgattcc caagtcctca acgttcaaca gagcaccaat 1321 agaattggcc aagtcagcat ttccttcccc atcccaccaa ccgtcacaga acttcggctc 1381 ttggtgtctg cgggctccct ctacccaacg atagccaggc tcaccgtgca atccccacct 1441 tcaagaggca ctggctttct ttctattgag ccactagacc ctcggtcccc tcgtgtgggg 1501 gacaccttta tcctaaactt acaagctgtg ggcatccctg cacctacctt ctctcattac 1561 tactacatga tcatctccag aggccagatc atggctatga gtcgggaggc ccggaggacc 1621 gtgacctctg tctccgtgtt ggtggaccat cagctggctc cttcattcta cttcatggct 1681 tacttctatc accaaggaca cccggtggcc aactctctgc tcatcaacat ccagcccagg 1741 gactgtgagg gcaagctgca attgaaggta gttggtgcca aggagtatca taatggggac 1801 atgatgaagc tccaaattca aactgactcc aaagccctgg tggcactggg agctgtggac 1861 accgctctgt atgctgtggg cggttggtct cacaaacccc tcgacatgag caaggtcttt 1921 gaagtaatca acagctacaa ccttggctgt ggtcctggag gtggggatga tgcccttcag 1981 gtgttccagg atgctggtct ggccttttct gatggtgatc gactaactca aaccagagag 2041 gacctgagct gtcccaagga gaagaaaagt cggcaaaaga gaaatgttaa cttcctgaag 2101 gctcttagtg agaagttggg ccagtattct tctccagatg ccaagcgctg ctgccaagac 2161 gggatgacga agctgcccat gaagcgtacc tgtgagcagc gggctgcccg tgtgcctcag 2221 caggcctgcc gtgagccctt cttgtcctgt tgcaagtttg ctgaggacct tcgcaggaac 2281 cagaccagga gccaggcacc ccttgcccga aaggtccggg acatggtgaa tttgatagag 2341 gaagatgaca ttcttgtgcg cacctccttc ccagagaact ggctctggag agtggaacct 2401 gtagacagct ccaaactgtt gacagtgcgg cttcctgatt ctatgaccac atgggagatt 2461 catggtgtga gcctgtccaa aagcaaaggt ctgtgtgtag ccaagccaac tcgtgttcga 2521 gtgttcagaa aattccacct tcacctgcgc ctgcccatct ccgtccgccg ctttgagcag 2581 cttgaattac ggcctgttct ctacaactat ctgaatgatg acaagaacgt gagtgtccat 2641 gtgaccccag tggagggcct gtgcatggct ggtggtggaa tgatggccca gtgggtgata 2701 gtgcctgcag gctctgccct gcctgtggcc ttctctgtgg tacccacagc ttctaccaac 2761 gtgcccctga aattggtggc taaagggact ttggattcaa gcgattctgt gtctaagatt 2821 ctccaaattg agaaggaagg agccatccac agagaagaga tagtctacaa cctcgacccc 2881 ctaaataacc taggtcagat gttggagata cctggcagct cggatcccaa catagtccct 2941 gaaggagact tcagcacctt tgtcaaggtt acagcctctg aacccttgga gactctgggc 3001 tctgaaggag ctttgtcccc aggaggcgtg gcctcccttc tgaggcttcc ccggagctgc 3061 gcagagcaaa ccatgatcta tttggctcct accctgactg cttccaacta cctggacagg 3121 acaaaacagt ggagcaaact gtcccctgag accaaggacc atgctgtgca tctgatccag 3181 aaaggccacg ttcggatcca gcagtttcgg aagaaagatg gctcctttgg ggcttggtta 3241 caccgggaca gcagcacctg gctgactgcc tttgtgctga agattctgag tttggcccag 3301 gaacaggtgg gcaactcccc ggagaagctg caggagacgg ctagctggct gctggcccag 3361 cagctgggtg atggctcctt ccacgaccca tgtccagtca tccacagagc aatgcagggg 3421 ggcttggtgg ggtccaatga gacggtggca ctgaccgcct ttgtggtcat tgcccttcac 3481 cacgggctga acgtcttccg ggaggggcat gcgaagcagc tgaagaacag agtggaagcc 3541 tccatcacca aggcaaactc tttcttgggg cagaaggcaa gtgctgggct cctgggtgcc 3601 catgccgccg ccatcacagc ctatgccctt acgctgacca aggcctcgga ggacctgcgg 3661 aatgttgccc acaacagcct gatggccatg gctgaggaaa cgggggaaaa cctctactgg 3721 ggcttagtcc ttggctctca ggacaaagtt gtgttgcgcc ccgcagcccc ccgtagccca 3781 acagaacctg tgccccaggc cccagccttg tggatcgaaa ccacagccta tgccctgctc 3841 cacctgcttc tgcgggaggg aaagggaaaa atggctgaca aggctgcatc ctggctcacc 3901 caccagggaa acttccacgg ggcattccgc agtacccagg acactgtggt caccctggat 3961 gccctgtctg cctactggat cgcttcgcac accactgagg agaaagcact gaacgtgacg 4021 ctcagctcca tgggccgcaa tgggctcaaa acccacgtgc tacacttgaa caaccaccaa 4081 gtcaagggcc tggaggagga gctgaagttc tccctgggca gcacaatcag tgtcaaggtg 4141 gaaggaaaca gcaaaggcac cttgaagatc cttcgtacct acaacgtcct ggacatgaag 4201 aacaccacat gccaggacct tcagatagaa gtgaaggtca cagacgctgt ggaatatgca 4261 tggagcgcct acgaagacta tgaagacgac tataacatgc cagccacaga tgatcccagc 4321 gttcccttgc agcctgtcac gcccctgcag ctatttgagg gtcgttggag ccgccgcagg 4381 agggaggccc ccaaggtggc tgaagagcgg gagtccagag ttcactacac tgtgtgtatc 4441 tggcacaatg gcaagctggg gctgtctggc atggccatcg cagacatcac cctcctgagt 4501 ggattccacg ccctgagggc tgacctggag aagctgacct ctctctctga ccgttatgtg 4561 agtcactttg agactgacgg gccccatgtc ctgttgtact ttgactcggt ccctaccacc 4621 cgggagtgtg tgggctttgg agcctcacag gaggtggttg tgggactggt gcagccagcc 4681 agtgctgtcc tgtatgacta ctacagccct gatcacaagt gctctgtgtt ttatgctgca 4741 cccaccaaga gccagctcct ggccacactg tgctctggag atgtatgcca gtgtgccgag 4801 gggaagtgcc ctcgactgct aaggtcactg gagcgaaggg tggaggacaa ggacggctac 4861 cggatgaggt tcgcctgcta ttatcaccaa gtggagtatg gcttcacggt taaggttctt 4921 cgagaagatg gcagagctgc cttccgtctc tttgagtcca agatcaccca agtcctgcat 4981 ttcagaacgg acaccatggc ctccataggt cagacccgca acttcctgag ccgggcctct 5041 tgccgccttc gtttggagcc taacaaagag tacttgatca tggggatgga cggggaaacc 5101 agtgacaaca agggagaccc ccagtacttg ctggactcaa atacctggat tgaggagatg 5161 ccttcagaac aaatgtgcaa gagcacccgc catcgggcag cctgtttcca gctcaaagat 5221 ttcctgatgg agttcagcag ccgggggtgc caggtgtgag gccttaggac tctggctctc 5281 tgagctcagc tcagggtcag ggcctcgctg gatgaggggc tctgctctac agggtaaata 5341 aaagaaaagc tttttgacag cc // LOCUS MUSNCAML1 3783 bp ss-mRNA ROD 15-MAR-1990 DEFINITION Mouse mRNA for neural cell adhesion molecule L1 (N-CAM L1). ACCESSION X12875 KEYWORDS glycoprotein; neural cell adhesion molecule; transmembrane protein. SOURCE house mouse (Mus musculus). ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3783) AUTHORS Moos,M., Tacke,R., Scherer,H., Teplow,D., Frueh,K. and Schachner,M. TITLE Neural adhesion molecule L1 as a member of the immunoglobulin superfamily with binding domains similar to fibronectin JOURNAL Nature 334, 701-703 (1988) STANDARD simple automatic COMMENT *source: developmental stage=8-day old; tissue=brain; *source: library=lambda gt10 and lambda gt11. FEATURES from to/span description pept 1 3783 N-CAM L1 (AA 1 - 1260) BASE COUNT 982 a 1038 c 997 g 766 t ORIGIN 1 atggtcgtga tgctgcggta cgtgtggcct ctcctcctct gcagcccctg cctgctcata 61 cagattccag acgaatataa aggacaccat gtgctagagc cacctgtcat cacggaacag 121 tctccacggc gcctggttgt cttcccaaca gatgacataa gcctgaaatg tgaagccaga 181 ggcagacccc aagtggagtt ccgctggacg aaagatggca tccacttcaa acccaaggaa 241 gaattgggtg tagtggtgca tgaggcaccc tattctggct ccttcaccat cgaaggcaac 301 aacagctttg cccagaggtt tcagggcatc tatcgctgct atgccagcaa taagctagga 361 actgccatgt cgcatgagat ccagctcgtg gccgagggtg cccccaagtg gccgaaggag 421 actgtaaaac ctgtggaagt ggaggaagga gaatcagtag ttctgccttg caaccctcca 481 cccagtgcag ccccacctag gatctactgg atgaacagca agattttcga catcaaacaa 541 gatgagcggg tgtccatggg ccagaatgga gacctatatt ttgccaatgt gcttacctca 601 gacaatcatt cagactacat ctgcaatgcc cacttccctg gtacccggac catcattcaa 661 aaggaaccta ttgacctccg ggtcaagccc accaacagca tgattgaccg gaagccacgt 721 ctgctctttc ccacaaactc cagcagccgc ctggtagcct tgcagggcca gtcattgatc 781 ctggagtgca ttgctgaggg attccctaca cccaccatca agtggctgca ccccagtgac 841 ccaatgccaa cagaccgtgt tatctaccaa aaccacaaca agaccctgca actactcaat 901 gtgggcgaag aggacgatgg cgagtatacc tgccttgctg agaactcgct gggcagtgcc 961 cggcatgcct actatgttac tgtggaagct gccccatatt ggctgcagaa gccccagagc 1021 catttgtatg gtccaggaga gactgcccgc ctagactgcc aagtccaggg caggccccaa 1081 ccagagatca cttggagaat caacggaatg tctatggaga cggtgaacaa ggaccagaag 1141 taccggattg agcaggggtc tctgatcttg agtaacgtgc agccaactga cacaatggtg 1201 acccagtgtg aagcccgcaa ccagcatggg ctcctgctag ccaatgccta catttatgtt 1261 gtccagctgc cagccaggat cctaacaaaa gacaatcaga catacatggc agttgagggc 1321 agtactgctt acttgctgtg caaagccttt ggagctcctg ttcccagtgt ccagtggctg 1381 gatgaagaag gaaccacagt gcttcaggat gaacgatttt tcccctatgc caatggaacg 1441 ctgagcatca gagacctcca ggccaatgac actggacgct atttctgcca ggctgccaat 1501 gaccagaaca atgtgaccat tttggctaac ctacaggtta aagaagcaac ccagatcaca 1561 caggggcccc ggagcgcaat tgagaagaaa ggtgcaaggg tgacattcac gtgccaggcc 1621 tcctttgacc cctctttgca ggccagcatc acttggcgtg gagatgggag agacctacag 1681 gaacgtgggg acagtgacaa gtatttcata gaagatggga aactagtcat ccagagcctg 1741 gactacagtg accagggcaa ctacagttgt gtggccagca ctgaactgga tgaggtggag 1801 agcagggcac agctcttagt ggtggggagc cctgggccag tgcctcacct ggagctgtcc 1861 gaccgccacc tgctgaagca gagccaggtg cacttgtctt ggagccctgc tgaagaccac 1921 aactctccca ttgagaagta tgacattgaa tttgaggaca aggaaatggc tcctgagaaa 1981 tggttcagtc tgggcaaggt gccaggaaat cagacctcta ctaccctcaa gctgtccccc 2041 tatgtccact acacctttcg ggtcactgcc attaacaaat atggtcctgg agaacccagc 2101 cctgtctctg agagtgtggt cacacctgag gcagccccag agaagaaccc tgtggatgtg 2161 agaggggaag ggaatgagac caacaatatg gtcatcacat ggaagcccct tcggtggatg 2221 gattggaatg ccccccagat tcagtaccgt gtacagtggc gtccacaggg caagcaggag 2281 acctggagga aacagaccgt gagcgaccct ttcctggtgg tgtctaacac ttccacattt 2341 gtgccttatg agatcaaagt ccaggcagtg aacaaccagg gcaagggccc tgagccccag 2401 gtcaccattg gctattcagg ggaagactac ccccaggtga gccctgaact tgaagacatc 2461 acaatcttca actcaagtac tgtgcttgtc aggtggaggc ctgtggactt ggcccaggtt 2521 aagggccacc tcaagggata caatgtaaca tactggtgga agggcagcca gagaaagcac 2581 agcaagaggc atatccacaa aagccacata gtggtacctg caaataccac cagtgccatc 2641 ctcagtggtt tgcgccctta cagctcttac catgtggagg tacaggcctt taatgggcgg 2701 ggcttggggc ctgcgagtga atggaccttc agcaccccag agggagtgcc tggccaccct 2761 gaggcattac acctggagtg tcagtcggac actagtctgc tactgcactg gcagccacca 2821 ctcagccaca atggagtgct cactggctac ctgctctctt accatcccgt ggaaggggaa 2881 agcaaagagc agttgttctt caacctttcg gacccagaac tccggactca taatctgacc 2941 aacctcaacc ctgatctaca gtaccgcttc cagcttcagg ccaccaccca acaggggggt 3001 cctggtgagg ccatcgtgcg tgaaggaggc accatggccc tgtttggcaa gccagatttt 3061 ggcaacatct cagccacagc aggtgaaaac tacagcgtgg tctcctgggt ccctcggaag 3121 ggccagtgca atttcaggtt ccatatcttg ttcaaagcct taccagaagg gaaagtgagc 3181 cctgatcacc agcctcagcc tcagtatgtc agctacaatc agagctccta cacacaatgg 3241 aacctacagc ctgacaccaa atatgagatc cacctgataa aggagaaggt cctcctgcac 3301 catctggatg tgaagactaa tggaactggc cctgtgcgag tttctactac agggagcttt 3361 gcctccgagg gctggttcat cgcctttgtc agcgctatca ttctcttgct cctcatcctg 3421 ctcatcctct gcttcatcaa acgcagcaag ggtggcaaat actcagtgaa ggacaaggag 3481 gacactcagg tagattccga ggcccggccc atgaaagacg agaccttcgg cgagtacagg 3541 tccctggaga gtgacaatga agagaaggcc tttggcagca gccagccatc tctcaacgga 3601 gacatcaaac ccctaggcag tgatgacagc ctggctgatt atgggggcag tgtggacgtc 3661 cagttcaatg aggatggctc tttcatcggc cagtacagtg gcaagaaaga gaaggaggca 3721 gcaggaggca atgacagttc aggggctacc tctcctatca atcctgcagt agccctagaa 3781 tag // LOCUS MUSPBGD1 2663 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 1. ACCESSION M29949 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 1 of 4 SOURCE Mouse (C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2663) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene JOURNAL J. Biol. Chem. (1989) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept 505 537 porphobilinogen deaminase (housekeeping) exon 1 (EC 4.3.1.8) 2191 2307 porphobilinogen deaminase (housekeeping) exon 2 (EC 4.3.1.8) 2457 + 2510 porphobilinogen deaminase (housekeeping) exon 3 (EC 4.3.1.8) pep$ 2475 + 2510 porphobilinogen deaminase (erythroid sp.) exon 1 pre-msg 341 > 2663 PGB deaminase (hk) mRNA and introns IVS 538 2190 PGB deaminase (hk) intron A (no splice consensus) IVS 2308 2456 PGB deaminase (hk) intron B IVS 2511 > 2663 PGB deaminase (hk) intron C pre-msg 2192 > 2663 PGB deaminase (ery.sp.) mRNA and introns IVS 2511 > 2663 PGB deaminase (ery.sp.) intron A binding 1126 1133 NPE binding site binding 312 317 Sp1 binding site binding 329 334 Sp1 binding site site 2101 2107 CACCC box site 2118 2124 CACCC box BASE COUNT 622 a 661 c 707 g 673 t ORIGIN Chromosome 9 1 ccacccccac cccacacaca cacacaaagt aaatagggct ggagagctta gtggttaaga 61 gcactgactg ctctttcaga ggtcctgagt tcaattccca gaaaccacat ggtgctcaca 121 accatctgca atagggtctg atgccctttt ctggtgtgtc taaagaagag agcaatggtg 181 tactcatata cataaaataa ttttttttaa aaagtaaaag ataataaaaa ttgaaaagga 241 aaaaaatctt tttgagttgt tctgtgcagt ggacttgagc gaaaaggctg gctatgtcgc 301 aatcctaatt cccgcccaga ggaaggcacc gccccgttga gggagggcag cggacgtgac 361 gcagagctca gcaggtcctg cagccggagt gaagtgcggg ctcgggcccc atgtgccttc 421 agtcccggcc ggcccaggtc gtcggcttct gcagacacca ggggaccgca gcggcactgc 481 cgcgcctgcg ccctgggcgg agtcatgtcc ggtaacggcg gcgcggccac aaccgcggtg 541 agttctgagc cggtgaccga tgacccgcac ttctcggggc tttctgggtg caacgattgg 601 ccccgggttg ccatgttctc gtcgtctatt ggtcggaata gttagctgtc atttttcccc 661 ccccacacct caaggttttt tttaaagggc cagtaactag gttgccctaa ggcagggaag 721 gagtgatctc gagcagtggg ggcggggttg tgagtggaaa ggtggtccgc cctgggattc 781 catccctgta ggctctggct ggatctctgt tgttcccgac cagtaaagga ttatgcacag 841 acaagatcct tttcacgaag aaggggctga ggcaaatcca gctatctcgg aatacgatcc 901 acttcattca ggggagagca caccccactt cttaaaactg tatacaaaca tcttggaggt 961 tacacgcctt ctcccgttct ccgttatgaa gtcacccagc cttagccacc cacaaaagtc 1021 ctagtagaga cacacctgaa ttgctattgt gagcggggga acccacccct gggccttgtc 1081 atttctggcc tgcctggaaa gttctgaact tgtgggcagg ctgcctgaga taaggctgag 1141 ctgggaagct tgcttatctc ctgcccaggc agtaagcagt agtcttggct atgaaaacat 1201 ttttagagca ctgggttagg gtaggaaggc ctggatttca gcacccactt tctgtctgtt 1261 catagctgtg agatgtttag acagtaattt gaccactctg catctttgct tctgtgacac 1321 gggtggaagt acctaccctg tctaacctag tagggttgtt gcaaggacaa tatgcagaca 1381 ctgctcaaat gctgttctgg gtcaatcaat taaaaaacaa attgtttgaa cttagcaatt 1441 cctttctatg ggctccctgt tgtccgaaat ttctgtgtta tttcaagccc agctaaattg 1501 caaaggctat ctcagagtcg tttgttggag gaatcttcgc agtggagtag actggagtcc 1561 aagagcaagt tttcaccttc agtgaccaag aacttgagtg tctggttata gaagaacctg 1621 tgagatgagg aacctggtgc agggaagggg gacaatctgt acagtgactc ctgtcccctt 1681 tgtatcagac tgcagaaccc agttctacct gcttggccct agacaccttt atccaaggcg 1741 ccttaacaaa agaaagaggt gtgtcctttt gagctcttgg ctctggctta agacaccaga 1801 ggaaacccgt aggcaatgac tgttaggcag tttattcttg tagtcttctg ggacttcttg 1861 aggcatgagg tggcctttaa tttaacaagc ccttgatggg atgatgttcc caaagtcacc 1921 caccaagggc atgaaagggc tgtacattag cttggttgat ttcagtcctt gttaggagta 1981 catcctggtg tctcacccag ggcttagtga ggccttctca agtgcctgag ttgttgtgga 2041 cagtgagctt gttctctagc aatgggaggc ttcagctgtc ctgccccagc ttctgtaggc 2101 cccaccctcc agcagggccc accctcactg tgccgaggct gatgggcctt atcattttgc 2161 ccacctggct gtgtgcagcc ctcccactca gaacctcctt ggccaggctg ggctttgggg 2221 ctcagtgtcc tgttgctgct gccacaacag atcctattac agcttttctt ctggtcttgc 2281 ttctctggat cccgtagagg gcagaaggta ccaaggaaga ttcaaggacc agtcctggga 2341 gtctctcctt cctagcagcc tcacctgcct aggacccggg agtcctctct cctaagcctg 2401 tgatcctagt tctttgaatg aggaaaagat cgtaacctag ggactttctt ctgcaggaag 2461 aaaacggctc aaagatgagg gtgattcgag tgggcacccg taagagccag gtgagtacag 2521 acatagcgcg ttgcctcaag aattgtaatg ctcacgggtc actagtggga accaaaggct 2581 agcatcgagc aaataagagt gtgtgagagt cgatttcatg ggggatggca gctcacttcc 2641 tctgaaaaga gagtctctgg agc // LOCUS MUSPBGD2 2761 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 2. ACCESSION M29950 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 2 of 4 SOURCE Mouse (C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2761) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene JOURNAL J. Biol. Chem. (1989) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 50 122 porphobilinogen deaminase (housekeeping), exon 4 (EC 4.3.1.8) 485 534 porphobilinogen deaminase (housekeeping), exon 5 (EC 4.3.1.8) 617 672 porphobilinogen deaminase (housekeeping), exon 6 (EC 4.3.1.8) 981 1058 porphobilinogen deaminase (housekeeping), exon 7 (EC 4.3.1.8) 1969 2046 porphobilinogen deaminase (housekeeping), exon 8 (EC 4.3.1.8) 2141 + 2216 porphobilinogen deaminase (housekeeping), exon 9 (EC 4.3.1.8) pep$ + 50 122 porphobilinogen deaminase (erythroid sp.) exon 2 485 534 porphobilinogen deaminase (erythroid sp.) exon 3 617 672 porphobilinogen deaminase (erythroid sp.) exon 4 981 1058 porphobilinogen deaminase (erythroid sp.) exon 5 1969 2046 porphobilinogen deaminase (erythroid sp.) exon 6 2141 + 2216 porphobilinogen deaminase (erythroid sp.) exon 7 pre-msg < 1 > 2761 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 49 PGB deaminase (hk) intron C; ery.sp. intron A IVS 123 484 PGB deaminase (hk) intron D; ery.sp. intron B IVS 535 616 PGB deaminase (hk) intron E; ery.sp. intron C IVS 673 980 PGB deaminase (hk) intron F; ery.sp. intron D IVS 1059 1968 PGB deaminase (hk) intron G; ery.sp. intron E IVS 2047 2140 PGB deaminase (hk) intron H; ery.sp. intron F IVS 2217 > 2761 PGB deaminase (hk) intron I; ery.sp. intron G BASE COUNT 703 a 605 c 725 g 728 t ORIGIN Chromosome 9; approximately 147 bp upstream of segment 1. 1 aaggcagtgg ccaggggagg tgagaaacca tctgactctc tttccccagc tggctcgcat 61 acagaccgag actgtggtgg cgatgctgaa agccttgtac cctggcatac agtttgaaat 121 cagtaagttt tcttgagagg agtgattggt agtgaacggg aagccagtga accggaggac 181 agggcatctc tcgtttgcct gtggtcaaag cctgccttgt aagactattc tggctgcttg 241 tgaagggaaa gaaagattgt ctcctgtgca catctcctcc agctgcccgg gctagcctga 301 catttccata ctttctgctt tgggttcttt tatgagtatg tctgcttttt ctgtcggtgt 361 gtgtatctga gagagttagg ggctgggtct tctatgcctc agactccact gtgaatccag 421 tcaaggcctg aacgaggggt gactcagtag gtgttaatgg gtatctgatt gactctctcc 481 tcagttgcta tgtccaccac gggagacaag attgttgata ctgcactctc taaggtaacg 541 ccagtccttg tcccattctt cttgtccctc tcccacgtgt aaggggttca ctctgaggct 601 ctctcttgcc tggcagattg gagagaagag cctgtttacc aaggagctag aaaacgccct 661 ggaaaaaaac gagtgagtga ggatggagga atgtggtacc ccgagcctag aaccccaaag 721 tggctctcca atattggcag gattgtcggg ttagactgtg gagctcacag gctttcacag 781 agaagagagc cttgccttgg agtagcctaa ctacctgggg aatcagactg ccgggggaaa 841 ggggtagagt agttgagaag agaccaggtc ttagatctta agatgctatc ttcctgaacg 901 gtcaaggatg ctggggtggg tggtggagat aaggtcacct actcaaagcc tctctctgtg 961 cctccccctg ccgtctccag agtggacctg gttgttcact ccctgaagga tgtgcctacc 1021 atactacctc ctggctttac tattggagcc atctgcaagt aagcggggag gacatgcatg 1081 ggacggaggg ccctgggcag gattaatcct actgtgggaa tctttgagtt tttttttttt 1141 ttttttccat ttggaactta accgcttagc cgtctgtttt gaaggttctc agacatagtg 1201 tggcaggaaa gccaattggt tgacttggtt gactatttag agtttgtgga gttgggctca 1261 gtggcacgga cctgaaatcc cagctactgg gaggctaaga caggatcaga gattctgggc 1321 cagcctgggc tacagaggga tttgaaccag cctgaggaac ttagattgtg ccttaggggc 1381 acagaaggct ggcttacagt ggcttaggtg gtaaaggcat tttttgctgt caagccaatg 1441 acctgagttc agtccgtggg gtgcacttgg tgaaagaaga gggttgaatc ccacaagttg 1501 tcatctgact catgcataca tgctgtagaa tgtttatgct cctcatccct caatgaaaat 1561 ggaaacaatc aaggaaatga aatataaaac ctgctgggtg gtggtgcgca cgcctataat 1621 cccagcactt gggaggcaga ggcaggtgaa ttcaacctgg tctacaaagt gagttccagg 1681 actatacaga gaaacccagt cttaaaaaca aaacaaaact aaacaacaac aacaacaaca 1741 acaaaaaaga aaaaacaaag aaagaaatat aaaacctttc caaagaaaat aaaatgaatt 1801 tggcctggtg gctcatgcta taatctcagc attcagagag ctgaggcagg agggttattg 1861 tgagttaaag gctagctggg gtacagagaa aattttaggt cacctgggct agagttaacc 1921 ctatctccaa atgctaatac ctttatttca tcatcatttg ctttgcagac ggcaaaaccc 1981 ttgtgatgct gttgtctttc acccaaagtt tattggaaag accctggaaa ccttgccaga 2041 gaaaaggtga gtgggcctag tgtgcggggg agagaggcct ggacagtgga gaacagttgg 2101 cagcctgggt taagtttaat tctaaactct ctctgagcag tgccgtggga accagctctc 2161 tgaggagagt ggctcagcta cagagaaagt tccccaacct ggaattcaag agtattgtat 2221 cctttcagaa gaaggagggg aaaaagaggg aaagaaggac cttccgaagc aagtggtcca 2281 tgcggtcagg gggtcgtctt tccatctgtc cgtccaccca cccacccacc catccatcca 2341 tccatccaca catccacagt cctttaatgt tttgcttttt tttttttttc ctgagacagg 2401 gtttctctgt gtagcctggc tgtcctggaa ctcactttgt agaccaagct ggcctgcaaa 2461 gtgagaaatc cgcctgcctc tgcctcctga gtgctgggat taaaggtgcg ccaccactgc 2521 ctggcaccct aatgtttttt aaactcaggc ctggcaatga ggacaatttg caaaacaaac 2581 atggttcctt gttctataca gctgacatgt tagacagaca ggcaggcact gcagatactg 2641 accggtgacc actcctggtg cagggacaga ggcgcttctg cttttacttt ctgtgctgct 2701 aagtggtttt ggtttttaca gtgaatatgt gatatgtttc ataaaagtaa tttttttttc 2761 t // LOCUS MUSPBGD3 800 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 5. ACCESSION M29951 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 3 of 4 SOURCE Mouse (C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 800) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene JOURNAL J. Biol. Chem. (1989) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 402 515 porphobilinogen deaminase (housekeeping), exon 10 (EC 4.3.1.8) 739 + 777 porphobilinogen deaminase (housekeeping), exon 11 (EC 4.3.1.8) pep$ + 402 515 porphobilinogen deaminase (erythroid sp.) exon 8 739 + 777 porphobilinogen deaminase (erythroid sp.) exon 9 pre-msg < 1 > 790 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 401 PGB deaminase (hk) intron I; ery.sp. intron G IVS 516 738 PGB deaminase (hk) intron J; ery.sp. intron H IVS 778 > 790 PGB deaminase (hk) intron K; ery.sp. intron I BASE COUNT 181 a 201 c 182 g 236 t ORIGIN Chromosome 9; approximately 477 bp upstream of segment 2. 1 ctgtacccca gctagccttt aactcacaat aaccctcctg cctcagctct ctgaatgctg 61 agattatagc catgagccac caggccaaat tcattttata tttctttctt tcttttttct 121 tttttgttgt tgttgttgtt gttgtttagt tttgttttgt ttttaagact gggaaactct 181 gtatagtcct ggaactcact ttgtagacca gatttagcct tgaattcatg gagatctgta 241 tctgcctcca gtgctgggat ttaaaggtgt atacaccacc actcaacaaa aacacaacaa 301 aaacaaaagt tttttaaaag ttagctagag gggggaaaag agactgtggg gcagagggtg 361 cactgggtag gtcttgactt ctccttagca acgctccaca gcggggaaac ctcaacaccc 421 gccttcggaa gctggatgag ctgcaggaat tcagtgccat tgtcctggct gtggctggcc 481 tacagcgcat gggctggcag aaccgggtgg gccaggtagg agctgccctg ttctgcttcc 541 cattgaatct gcctctctcc tgccttgatt tcttggtgac cattctgcca acaacactac 601 aaccagaagc ccaggctagg gatattggga ctcattgctg gatttcctac ctgtgccttc 661 cccaggcttc ctagattgca aaccctagct cactgccttt gaacatcccc tatcccacca 721 tcttgtctct ctccacagat tttgcaccca gaggaatgca tgtatgctgt gggtcaggta 781 ggtaggtttg cctggagaga // LOCUS MUSPBGD4 1386 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 4. ACCESSION M29952 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 4 of 4 SOURCE Mouse (C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1386) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene JOURNAL J. Biol. Chem. (1989) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 154 273 porphobilinogen deaminase (housekeeping), exon 12 (EC 4.3.1.8) 459 512 porphobilinogen deaminase (housekeeping), exon 13 (EC 4.3.1.8) 618 704 porphobilinogen deaminase (housekeeping), exon 14 (EC 4.3.1.8) 783 956 porphobilinogen deaminase (housekeeping), exon 15 (EC 4.3.1.8) pep$ + 154 273 porphobilinogen deaminase (erythroid sp.) exon 10 459 512 porphobilinogen deaminase (erythroid sp.) exon 11 618 704 porphobilinogen deaminase (erythroid sp.) exon 12 783 956 porphobilinogen deaminase (erythroid sp.) exon 13 pre-msg < 1 1279 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 153 PGB deaminase (hk) intron K; ery.sp. intron I IVS 274 458 PGB deaminase (hk) intron L; ery.sp. intron J IVS 513 617 PGB deaminase (hk) intron M; ery.sp. intron K IVS 705 782 PGB deaminase (hk) intron N; ery.sp. intron L BASE COUNT 372 a 305 c 343 g 366 t ORIGIN Chromosome 9; approximately 77 bp upstream of segment 3. 1 atcagtagtt cctgaaacct gttcatacct tgcacctcta tccatcaata atgttaaaga 61 caggtttgtt gttatgcata acccaggaag cagtagaggt gtgtttctca tcttagctct 121 attactagag aagaacagcc tgttgttctt tagggggccc tagccgtgga agtccgagcc 181 aaggaccagg atatcttgga cctagtgagt gtgttgcacg atcctgaaac tctgcttcgc 241 tgcattgctg aaagggcttt tctgaggcac ctggtaagat gggctcctcc catggtgttg 301 tggggaaacc aggaagggca gtagggaggg agatttgtca agtactcagt atgtaatgtt 361 ttgtatgtat ggagaggacc ttgatctggc ctcttgaggt ctgtggtcaa aagtggtgtt 421 aaaggccctt agagctcaaa ggaacaatat cattgcagga aggaggctgc agcgtgcccg 481 tagcagtgca tacagtgata aaggatgggc aagtaagcca gggaaatgga tgaggggagg 541 gactgtcatt tccatgtgca cccaaacatc taagtaactt tctttaaaca tcctggtaca 601 aacattttat ttcctagctg tacctgactg gtggagtatg gagtctagat ggctcagata 661 gcatgcaaga gactatgcag gccaccatcc aggtccctgt tcaggtattg actgggagat 721 gaggaggaat aaatagaact cttgtaatct tcctcttacc aaaattgtaa cctgtcatcc 781 agcaagaaga tggtccagaa gatgacccac aactggttgg aatcactgcc cggaacattc 841 caagaggagc ccagctagct gctgagaacc tgggcatcag cctggccagc ttgctgctca 901 acaaaggagc caagaacatc ctggatgttg cacggcagct taatgatgtg cgctaactgg 961 tctgtagggc acaggaaccc tggctgccac tccagtgcct acttctggct tccaagtgcc 1021 ctgtgctcca tccctagggg tgtgattatc ccaggaaatt gaaccacagg gttgttgaga 1081 cttccacttt ggaagatatg cctcaccttg gggcctccat atctgccttt ccctcagtag 1141 ttgggggctt catctcttta gagaaagtcc atgccaatct ttgaatgtaa ccaataccac 1201 taataaacca gtttagaatg tggttcttct gatagagttg gggaagatat gaataaaccc 1261 aaagcccttt taaacttgaa tgagtctgag acctttctgt tgtaaaacac gctgtgattt 1321 gcctcatgtt ctcaaaaaaa aaaaaaaaaa tcagccttta attcctacag cctgtcttca 1381 gtcgac // LOCUS MUSPHKGB 1012 bp ss-mRNA ROD 15-MAR-1990 DEFINITION Mouse skeletal muscle phosphorylase kinase alpha subunit mRNA, partial cds. ACCESSION M28867 KEYWORDS phosphorylase kinase; protein kinase. SOURCE Mouse (strain ICR Swiss White, adult) mixed fiber skeletal muscle from hind limb, cDNA to mRNA, clone alpha-1.0Kb. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1012) AUTHORS Bender,P.K. and Lalley,P.A. TITLE The I/Lyn mouse phosphorylase kinase deficiency: Mutation disrupts expression of the alpha/alpha' subunit mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9996-10000 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Bender, 05-OCT-1989. FEATURES from to/span description pept < 1 > 1012 phosphorylase kinase alpha subunit (AA at 1) (EC 2.7.1.3.8) BASE COUNT 280 a 236 c 229 g 267 t ORIGIN Chromosome X. 1 aattcactac tgcccagggc ttcaacatcc aaagaagttg atgccagtct gctctcagtg 61 gtctctttcc cagcctttgc tgtagaggac agccatttgg tggagctcac caaacaggag 121 atcatcacca agcttcaggg tcgttatggt tgctgtcgtt ttctgcgaga tggatataaa 181 actcctaaag aggatcccca acgcctatac tataacccag ctgagctgaa gctatttgaa 241 aacattgagt gcgaatggcc attgttctgg acatacttta tccttgatgg gatcttcagt 301 ggcaacgtag aacaggttca agaatataga gaggctcttg atgcagtcct catcaagggc 361 aaaaatggag tccctcttct tccagagctg tacagtgtcc ctcctgacag ggttgatgaa 421 gagtatcaaa atccccacac tgtggatcga gtccctatgg gaaaattgcc tcacatgtgg 481 ggtcagtctc tatacatttt aggaagcttg atggcagagg gatttttagc tcctggagaa 541 attgatcccc tgaatcgtag gttttctact gtgccaaagc cagatgtggt ggttcaagtc 601 tccattctgg ctgaaacaga agaaatcaag gccattttga aggacaaagg aattgatgtg 661 gagaccattg ctgaagtgta ccccataaga gtacagccag ctcgtattct cagccatatt 721 tattctagtc taggatgcaa cagtagaatg aaactcagtg gacgacccta caggctcatg 781 ggtgtgcttg gaacatcaaa actttatgac attcgcaaaa ctatctttac tttcactcca 841 cagtttatag accagcaaca gttctacctg gctctggaca accagatgat agtagaaatg 901 ctcagaacag acctttccta cctctgtagc cgctggagga tgacaggcca gcccacgatc 961 actttcccta tctcgcacac catgcttgat gaagatggaa ccagcttgaa tt // LOCUS MUSTCBVE 389 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse germline T cell receptor V-beta-13g gene V-region, 2 partial exons. ACCESSION M25914 SOURCE Mouse T lymphocyte DNA, clone V-beta-13g. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 389) AUTHORS Morahan,G., Allison,J., Peterson,M.G. and Malcolm,L. TITLE Sequence of the V-beta-13 gene used by an influenza-specific T cell JOURNAL Immunogenetics 30, 311-313 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by Grant Morahan, 05-JUL-1989. FEATURES from to/span description pept < 1 23 T cell receptor V-region, exon 1 (AA at 2) 350 > 389 T cell receptor V-region, exon 2 IVS 24 349 TCR intron BASE COUNT 87 a 91 c 89 g 122 t ORIGIN 1 cagtgttctg ttccttgaca caggtaagtt tttagaaaaa cagtctccgt tttgcatttc 61 tctggggtgt gtgtgtgtga ttatttcaat tgttcccatt ttgaaacagt ctgatctttt 121 cccacagtac tgtctgaagc tggagtcacc cagtctccca gatatgcagt cctacaggaa 181 gggcaagctg tttccttttg gtgtgaccct atttctggac atgataccct ttactggtat 241 cagcagccca gagaccaggg gccccagctt ctagtttact ttcgggatga ggctgttata 301 gataattcac agttgccctc ggatcgattt tctgctgtga ggcctaaagg aactaactcc 361 actctcaaga tccagtctgc aaagcaggg // LOCUS MUSTYR 1838 bp ss-mRNA ROD 15-MAR-1988 DEFINITION Mouse tyrosinase mRNA, complete cds. ACCESSION D00131 KEYWORDS melanin; melanocyte; tyrosinase. SOURCE Mouse melanocytes cell line TM10 mRNA, cDNA to mRNA, clones Tyrs-33, Tyrs-J2, and Tyrs-J19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1838) AUTHORS Yamamoto,H., Takeuchi,S., Kudo,T., Makino,K., Nakata,A., Shinoda,T. and Takeuchi,T. TITLE Cloning and sequencing of mouse tyrosinase cDNA JOURNAL Jpn. J. Genet. 62, 271-274 (1987) STANDARD simple automatic COMMENT Tyrosinase is the key enzyme for melanin,synthesis in mammalian melanocytes and has been considered to be a unique marker for the study of melanocyte differentiation. A cDNA library was constructed from poly(A)+ mRNA from mouse melanocytes and screened using anti- tyrosinase antiserum and oligonucleotide probes corresponding to amino acid sequence of tyrosinase. sequencing of some cDNA clones positive in these screenings gave a nucleotide sequence of 1838 nucleotides including a open reading frame of 1344 nucleotides that was found to correspond exactly to the amino acid sequence of the cyanogen bromide fragments of tyrosinase. FEATURES from to/span description pept 65 1411 mouse tyrosinase signal 1735 1740 polyadenylation signal BASE COUNT 505 a 397 c 399 g 537 t ORIGIN 64 bp upstream of the tyrosinase initiation codon. 1 ggaaaagaag tctgtgacac tcattaacct attggtgcag attttgtatg atctaaagga 61 gaaaatgttc ttggctgttt tgtattgcct tctgtggagt ttccagatct ctgatggcca 121 ttttcctcga gcctgtgcct cctctaagaa cttgttggca aaagaatgct gcccaccatg 181 gatgggtgat gggagtccct gcggccagct ttcaggcaga ggttcctgcc aggatatcct 241 tctgtccagt gcaccatctg gacctcagtt ccccttcaaa ggggtggatg accgtgagtc 301 ctggccctct gtgttttata ataggacctg ccagtgctca ggcaacttca tgggtttcaa 361 ctgcggaaac tgtaagtttg gatttggggg cccaaattgt acagagaagc gagtcttgat 421 tagaagaaac atttttgatt tgagtgtctc cgaaaagaat aagttctttt cttacctcac 481 tttagcaaaa catactatca gctcagtcta tgtcatcccc acaggcacct atggccaaat 541 gaacaatggg tcaacaccca tgtttaatga tatcaacatc tacgacctct ttgtatggat 601 gcattactat gtgtcaaggg acacactgct tgggggctct gaaatatgga gggacattga 661 ttttgcccat gaagcaccag ggtttctgcc ttggcacaga cttttcttgt tattgtggga 721 acaagaaatt cgagaactaa ctggggatga gaacttcact gttccatact gggattggag 781 agatgcagaa aactgtgaca tttgcacaga tgagtacttg ggaggtcgtc accctgaaaa 841 tcctaactta ctcagcccag catccttctt ctcctcctgg cagatcattt gtagcagatc 901 agaagagtat aatagccatc aggttttatg cgatggaaca cctgagggac cactattacg 961 taatcctgga aaccatgaca aagccaaaac ccccaggctc ccatcttcag cagatgtgga 1021 attttgtctg agtttgaccc agtatgaatc tggatcaatg gatagaactg ccaatttcag 1081 ctttagaaac acactggaag tatttttgtt tgaacacaat ggctgcgaag ggcaccgccc 1141 tcttttggaa gtttacccag aagccaatgc acctatcggc cataacagag actcttacat 1201 ggttcctttc ataccgctct atagaaatgg tgatttcttc ataacatcca aggatctggg 1261 atatgactac agctacctcc aagagtcaga tccaggcttt tacagaaatt atattgagcc 1321 ttacttggaa caaggccagt cgtatctggc catggcttct tggggcagca ctggtgggag 1381 ctgttattgc tgcagctctc tctgggctta gcagtaggct atgccttcag aagaagaaga 1441 agaagaagca accccaggag gaaaggcagc cactcctcat ggacaaagac gactaccaca 1501 gcttgctgta tcagagccat ctgtgaacat cctaggaaac agagtggact gaaaggtttt 1561 acctcactcg acctatttgt tggtgtttct acaaatttaa actagcataa aacatagacc 1621 atagctgttt ggcttttttt cagacccatg ttttttccta agtcctagtt tctaagaaat 1681 gactgggatt tgctaaaata tatatatata taaataataa cttactaata gctaaataaa 1741 atttcctctt acaactaatt gagtggtttt tatgaatgtg tcttaattat ttaaacttga 1801 ggcacatttt tgttttcctt acttcattgt gaatttcc // LOCUS RATALDCPR 1294 bp ds-DNA ROD 15-MAR-1990 DEFINITION Rat aldolase gene 5'flank promoter region and exon 1 (EC 4.1.2.13). ACCESSION X07845 KEYWORDS aldolase C; fructose 1,6-bisphosphate aldolase; fructose bisphosphate aldolase; promoter. SOURCE rat (Rattus rattus). ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1294) AUTHORS Skala,H. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 1294) AUTHORS Vibert,M., Henry,A., Kahn,A. and Skala,H. TITLE ; JOURNAL Eur. J. Biochem. 181, 33-39 (1989) STANDARD simple automatic COMMENT *source: library=Charon 4A; strain=Buffalo; see X06984 for rat aldolase C cDNA EMBL features not translated to GenBank features: key from to description SITE 1 640 promoter region SITE 641 855 multiple transcription start sites MSG 641 901 exon 1 IVS 902 >1294 intron I [1] Author address Skala H., INSERM U. 129, 24 rue du Faubourg Saint Jacques, 75014 Paris, France. Submitted (06-JUN-1988) on tape to the EMBL data library BASE COUNT 261 a 401 c 284 g 348 t ORIGIN 1 ggatccctta tcccaataat aaaccaattg gcatggagct gactatgttt tatgctattt 61 aaataaagtg tatttaatgt gtccttccag tccactgaat ctaattttgg gagtaggggt 121 gggcaggtga aatcatggag gcagatcagc cctccagcct aggcagcctt cacttcctgc 181 ggcagctaaa gccagggctg actgcttttc ttcgcttgat tctcttgctt ccactctccc 241 ctccccccca aaaaaagggc aatggactgc ctgttccctg gatcactgcc tgtggggaac 301 agaaaagact tcattcactt actttcccca ttccgcccac ccgcccccgc cccccggggc 361 aggcactggt aaatatgggg cagagcatag aaactcaaat ccattattcc atgccttgaa 421 gtgggcaggc agcctgttag gtggcctctg tcctacctcc catctactga tgtttagatg 481 tctctcctct aaaccaggct taatccccaa cattccgcat aatgtgaaag catttgcctt 541 tcacagcaaa agactagtta tggatagagc aaacacaagt ccatctcttt cctcatgggt 601 tctagtggca gggttccatg atgagggagg ggcatcgtct tttacaaacc ttggggaccc 661 tggcagagaa gtcactccac ctcactactc cacccctgaa gcgtggcctt accctagacc 721 agtcctgggg agagggcggg accaggatgg gaggtgtctg tcacgccccc agggagtcac 781 gtagctctgc ggcatctgct gctgccttat ttactccagc ttggactgag ctactgtctg 841 ttgctaacca gaccaaagtg agctgtgcct gtggctccgg ccgctgactg actgagtata 901 ggtaaggtcc cgggtgaggc ggagatgccc cgtccccgtt tgtcttcccc tgccgtcctt 961 gccctgctcc ctaggaccct agctcctctg ctttctgtgg ctcaggaaaa ccctgcttct 1021 ctccttcccc ctcgctctcc tttccatttc caagtcttcc tctgcctcca gaacttcact 1081 ggggtcggtg tatcccacct agggtatttt aatctcagcc ccttcctggt ccttctaggc 1141 ttgtccccag gtctaggaag agcatcagga ctggttaacc tgtatcacac cactatctag 1201 ctcatctctc ccaggtctcc ttcctgcttt tcctgcatct ctccctcttt cccataatcc 1261 actcactcag ccttaattcc tcacctctgg atcc // LOCUS RATCNRAB 2267 bp ss-RNA ROD 15-MAR-1990 DEFINITION Rat calcineurin A beta mRNA. ACCESSION D90036 KEYWORDS calcineurin; calcineurin A beta; calmodulin-binding protein; calmodulin-dependent protein phosphatase; isoform. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2267) AUTHORS Kuno,T., Takeda,T., Hirai,M., Ito,A., Mukai,H. and Tanaka,C. TITLE Evidence for a Second Isoform of the Catalytic Subunit of Calmodulin-Dependent Protein Phosphatase (Calcineurin A) JOURNAL Biochem. Biophys. Res. Commun. (19xx) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Takayoshi Kuno Department of Pharmacology Kobe University School of Medicine 7-5-1 Kusunoki-cho,Chuo-ku Kobe 650 Japan Phone: 078-341-7451 x3273 Fax: 078-351-6531 Peptides, 87-338 and 401-424, seem to be putative catalytic domain and calmodulin binding domain, respectively. FEATURES from to/span description ORF 40 1617 ORF (525 AA) BASE COUNT 639 a 481 c 525 g 622 t ORIGIN 1 cccgcccggc ccggcaccta gccgagcccc gggcccagca tggccgcccc ggagccggcc 61 cgggccgcgc cgccccctcc cccgcccccg ccgccccccc tcggggccga ccgcgtcgtc 121 aaagccgttc cttttccccc aactcaccgg ttgacatctg aagaagtgtt tgatatggat 181 gggataccca gggttgatgt tctgaagaac catctggtaa aagaaggtcg ggtagatgaa 241 gaaattgcac taagaattat caatgagggt gctgccatac ttaggcggga gaaaaccatg 301 atagaagtag aagctccaat tacagtgtgt ggtgacatcc atggccaatt ttttgatctg 361 atgaaacttt ttgaagtagg aggatcacct gctaatacac gatacctttt tcttggtgat 421 tatgtggaca gaggttattt tagtatagag tgtgtcttgt atttatgggt cttgaagatt 481 ctatacccaa gcacattatt ccttctgaga ggcaaccatg aatgcagaca cctcactgag 541 tattttacct ttaagcagga atgtaaaatt aagtattcag aaagggtcta tgaagcttgt 601 atggaggctt ttgacagctt gccccttgct gcacttctaa accaacaatt tctttgtgtt 661 catggtggac tttcaccaga gatacacaca ctggatgata ttaggagatt agatagattt 721 aaagagccac ctgcatttgg accaatgtgt gacttgctgt ggtctgatcc ttctgaagac 781 tttggaaatg aaaaatcaca agaacatttt agtcataata cggttcgagg atgttcttat 841 ttttataact atccagcagt gtgtgaattt ttgcaaaaca ataatttgtt atcgattatt 901 agagctcatg aagctcaaga tgcaggctat agaatgtaca gaaaaagtca aactacaggg 961 tttccttcat taataacaat tttttcggca cctaattact tagatgtcta caataataaa 1021 gctgctgtac taaagtatga gaacaatgtg atgaacatcc gccagtttaa ctgttctcca 1081 catccttact ggttgcccaa ttttatggat gtctttacat ggtctttacc atttgttgga 1141 gaaaaagtca cagagatgtt ggtaaatgtt ctcagtattt gctctgatga tgaactaatg 1201 acagaagggg aagaccagtt tgatgtaggt tcagctgcag cccggaaaga aatcataaga 1261 aacaagatcc gagcaattgg caagatggca agagtcttct ctgttctcag ggaggagagt 1321 gaaagcgtgc tgacactcaa gggcctgact cccacaggga tgttgcctag tggagtgttg 1381 gctggaggac ggcagacctt gcaaagtgcc acagttgagg ctattgaggc tgaaaaagca 1441 atacgaggat cctctccacc acatagaatc tgcagttttg aagaggcaaa gggtttggat 1501 aggatcaatg agagaatgcc accccggaaa gatgctgtgc agcaagatgg tttcaattcc 1561 ctgaacaccg cacataccac tgagaaccac gggactggca accatagtgc ccagtgacca 1621 gcggcttccc agggatggac agatcactaa ggagccggag gggtcggccg agctgatgat 1681 aaatgtcata atctctctga agaaacaaac cattgtgctt tttgagaccc ttgccccctt 1741 cccggatgga ggcttgaggg ccttgggact tgtgctgtcc ataagattgg ggaatcgctg 1801 ccacagcgga gagcagtgag caaggggctt ggggcaaatt ccagtggagg aagcccaaac 1861 ctccatttat gcttgtggtt cacacattta cgtttacaaa tgagatttcc tttgttttct 1921 ccctcagtag aattagattt ttttcaacca tgactttaaa tgcaatcttt agagttaatg 1981 tggaacctcc ctcaccccca ccccatgaaa tgtctttaag aggatggatt agcatggtct 2041 taaaatatat ttctgaggtt actagatgta ttttgaattg tagacaaaat ctgagaaacc 2101 cagttggtgt ttatataaaa acgctgacct caggtcatag ttcttaaatg tggctaattc 2161 tgtaacatag tcttggtatt tttcaattat gaatgcataa actatttcta agaagactct 2221 tacttgaaca agatccaaaa aaccaattta gatccttttt gcccccg // LOCUS RATGSTB 1038 bp ss-mRNA ROD 25-SEP-1987 DEFINITION Rat liver glutathione S-transferase Yb-1 subunit mRNA, complete cds. ACCESSION M11719 KEYWORDS glutathione transferase. SOURCE Rat, cDNA to mRNA, clone pGTA/C44. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1038) AUTHORS Ding,G.J.-F., Lu,A.Y.H. and Pickett,C.B. TITLE Rat liver glutathione S-transferases: Nucleotide sequence analysis of a Yb-1 cDNA clone and prediction of the complete amino acid sequence of the Yb-1 subunit JOURNAL J. Biol. Chem. 260, 13268-13271 (1985) STANDARD full staff_review FEATURES from to/span description pept 38 694 glutathione S-transferase Yb-1 subunit (EC 2.5.1.18) BASE COUNT 243 a 313 c 233 g 249 t ORIGIN Unreported. 1 gctgaagcca aattgagaag accacagcgc cagaaccatg cctatgatac tgggatactg 61 gaacgtccgc gggctgacac acccgatccg cctgctcctg gaatacacag actcaagcta 121 tgaggagaag agatacgcca tgggcgacgc tcccgactat gacagaagcc agtggctgaa 181 tgagaagttc aaactgggcc tggacttccc caatctgccc tacttaattg atggatcgcg 241 caagattacc cagagcaatg ccataatgcg ctaccttgcc cgcaagcacc acctgtgtgg 301 agagacagag gaggagcgga ttcgtgccga cattgtggag aaccaggtca tggacaaccg 361 catgcagctc atcatgcttt gttacaaccc cgactttgag aagcagaagc cagagttctt 421 gaagaccatc cctgagaaga tgaagctcta ctctgagttc ctgggcaagc gaccatggtt 481 tgcaggggac aaggtcacct atgtggattt ccttgcttat gacattcttg accagtacca 541 catttttgag cccaagtgcc tggacgcctt cccaaacctg aaggacttcc tggcccgctt 601 tgagggcctg aagaagatct ctgcctacat gaattgcagc cgctacctct caacacctat 661 attttcgaag ttggcccaat ggagtaacaa gtaggccctt gctacactgg cactcacaga 721 gaggacctgt ccacattgga tcctgcaggc accctggcct tctgcactgt ggttctctct 781 ccttcctgct cccttctcca gctttgtcag ccccatctcc tcaacctcac cccagtcatg 841 cccacatagt cttcattctc cccactttct ttcatagtgg tccccttctt tattgacacc 901 ttaacacaac ctcacagtcc ttttctgtga tttgaggtct gccctgaact cagtctccct 961 agacttaccc caaatgtaac actgtctcag tgccagcctg ttcctggtgg gggagctgcc 1021 ccaggcctgt ctcatctt // LOCUS RATRHLA 1116 bp ss-mRNA ROD 15-MAR-1989 DEFINITION Rat asialoglycoprotein receptor (RHL2) mRNA, complete cds. ACCESSION M16347 KEYWORDS alternative splicing; asialoglycoprotein receptor. SOURCE Rat (Sprague Dawley) liver RNA, clone 13D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1116) AUTHORS McPhaul,M. and Berg,P. TITLE Identification and characterization of cDNA clones encoding two homologous proteins that are part of the asialoglycoprotein receptor JOURNAL Mol. Cell. Biol. 7, 1841-1847 (1987) STANDARD simple staff_review FEATURES from to/span description pept 154 1059 asialoglycoprotein receptor (RHL2) BASE COUNT 266 a 305 c 288 g 257 t ORIGIN 1 ggtgcctaga ttagccccct cctccttctc gcctgctgtc ctgctgtccc aggtttaacc 61 ccctttttct ccttggactc aggctgcctc cggaagcaga gtagctctct atacatttaa 121 cagtcccaga tctgtctcca gcctagggcc atcatggaga aggactttca agatatccag 181 cagctggact ctgaggaaaa cgaccatcag ctcattggcg atgaggaaca aggctctcat 241 gtgcagaatc ttaggaccga aaatccacgt tggggaggac agcctccttc caggcccttt 301 ccacagcgcc tctgctccaa gttccgcctc agtctgctcg ccctggcctt caacattctc 361 ctgctggtgg tcatctgtgt ggtttcatcc caaagcatgc agctgcaaaa ggagttctgg 421 accctgaaag aaaccttgag caacttctcc accaccaccc tgatggagtt caaggctctg 481 gactcccacg gaggtagcag gaatgacaac ttgacttctt gggaaacaat actggagaaa 541 aagcagaagg acataaaagc agatcactcc acgctgctct tccacctgaa gcacttcccc 601 ctggatctgg caaccctgac ctgtcagctg gcgttcttcc tgagcaacgg cacagaatgc 661 tgccccgtta actgggtgga gtttggtgga agctgctact ggttttctcg ggatgggctc 721 acctgggctg aggctgacca gtactgccaa atggagattg cccatctgct ggtcatcaac 781 tcaagggagg agcaggaatt cgttgtaaag cacaggggcg cgtttcacat ttggataggt 841 ctcaccgaca aggatggctc ctggaaatgg gtggatggga cggaatatag aagtaacttc 901 aagaattggg ctttcactca gccagataac tgccaggggc atgaagaggg gggaagtgaa 961 gactgtgctg aaatcctgtc agatggcctc tggaatgaca acttctgcca gcaggtgaac 1021 cgctgggctt gtgaaaggaa acgggacatc acctactagg agtctgctct actatgtctt 1081 tgtcaccctc cgggaacccc gcatcactca ttagga //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ECOPHNAQ 15611 bp ds-DNA BCT 02-APR-1990 DEFINITION E.coli psiD locus containing alkylphosphonate uptake (phn) genes A through Q, complete cds. ACCESSION J05260 KEYWORDS C-P lyase; HisM-like integral membrane protein; HisP-like nucleotide binding protein; inverted repeats. SOURCE E.coli (strains B) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 15611) AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T. TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and sequencing of the phn (psiD) genes involved in alkylphosphonate uptake and C-P lyase activity in Escherichia coli B JOURNAL J. Biol. Chem. 265, 4461-4471 (1990) STANDARD full staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by C.-M.Chen, 01-MAR-1990, for release after publication. FEATURES from to/span description pept 2874 3209 phnA protein pept 3767 4210 phnB protein pept 4343 5131 HisP-like nucleotide binding protein (phnC) pept 5156 6172 phnD protein pept 6227 7057 phnE protein pept 7078 7803 phnF protein pept 7804 8256 phnG protein pept 8253 8837 phnH protein pept 8837 9901 phnI protein pept 9894 10739 phnJ protein pept 10736 11494 HisP-like nucleotide binding protein (phnK) pept 11605 12285 HisP-like nucleotide binding protein (phnL) pept 12282 13418 HisM-like integral membrane protein (phnM) pept 13418 13975 HisP-like nucleotide binding protein (phnN) pept 13962 14396 phnO protein pept 14398 15156 phnP protein pept 15131 15496 phnQ protein pept 2473 245 (c) ORF742 pept 7470 7030 (c) ORF146 pept 9298 8489 (c) ORF269 pept 12264 11920 (c) ORF114 pept 14760 14380 (c) ORF126 rpt 3224 3257 inverted repeat a rpt 3271 3306 inverted repeat b rpt 3324 3357 inverted repeat c rpt 3371 3406 inverted repeat d rpt 3424 3457 inverted repeat e rpt 3471 3506 inverted repeat f rpt 3524 3557 inverted repeat g rpt 3571 3606 inverted repeat h rpt 3624 3657 inverted repeat i rpt 3671 3706 inverted repeat j rpt 6176 6211 inverted repeat k rpt 6229 6262 inverted repeat l rpt 11501 11534 inverted repeat m rpt 11548 11583 inverted repeat n signal 2622 2627 -35 region signal 2645 2650 -10 region signal 2738 2743 -35 region signal 2771 2776 -10 region signal 4074 4079 -35 region signal 4096 4101 -10 region signal 2084 2079 (c) -10 region signal 2108 2103 (c) -35 region signal 2720 2715 (c) -10 region signal 2742 2737 (c) -35 region signal 12056 12051 (c) -10 region signal 12080 12075 (c) -35 region signal 2737 2754 phosphate (pho) box BASE COUNT 3466 a 4625 c 4306 g 3214 t ORIGIN 1 ggatccagca tcgacgccag tttttccacc attgtcagtc gcaggctaag cggcgcattt 61 aacatgccgc cgttcgtcca tgtctgaagc tgcacacgcg aaagaagttc ctgcatcagt 121 cgttcacgaa actgctgctg atgggcttgt ggaaggcggg catcatcgcc ctgcgccaga 181 tccactaaaa agcggggata aaccgactcc agcacgcgac cggggccgtc cagtaacgtc 241 ttggtcaata tcgttctgcc gtgaaaagtg tttgaatatc atcgcgtaac agctgggcgt 301 cggtgtaaat ccagccgtga gtcatcacag tctgctgcaa ttgctgctgc atcagcctga 361 ccaccgattc attttgttga cgcagagcca ggctttcgcg taaacgcgtc tgtaattccg 421 tcaaacatga agcgaactca gcgaaaaaag tattcatgcc tgccgtaaca gattcatcga 481 cctgctctgc cagaacttta gccatttgtt ggcaataaag atcgacttct gcgcttaatg 541 ctcgttgcaa cacactgtaa tcaaccgttt ctgtcgggga tttctcattt ccccgtcccc 601 agtcgggctg attcaaccag cgcgaaaaag tctcacgcac aacgcctaaa cgcgtgctct 661 gctcgtccgt tgcatcctgg cgcgaaatga ctgcactgaa cagctggcga gtgttgaagt 721 ggggaactac gccgtgaaaa acaggaaaat gaaacccagg acgaaaccct gactcgctca 781 attccatttt gacttgttgc tcaatggggc gaataacatc ggttaacact cggcaaaggg 841 tggattccag ctcggcaaaa cgcagcgtaa agtcgcgact gatggtgttc tgcgccgtct 901 gtaacagtgt ctcacagcgg gtacgcatct cgcttaacgg ctccgaatca tcctgaaaca 961 aggcggctaa ctgcgcattc agcgcatctt gttgttgacg cagaaagtgg ttggcggagg 1021 tcagggccag ctcgatttca tgtttaatct cgccgctcac ctgcgcctga ttgagttgca 1081 atagctgcaa actttcttcg acctgatgga tattttgccg caattgttca caagcgacgt 1141 ttaacccgtg cgcacgaaaa tccaggtatt cccgcgcctg ctgcgcgtaa ttcaacagtt 1201 tatgcgcagc agatcgcaaa gcatacaacg aggcgttagc gtaagcggca tgaagcaacg 1261 cctgaattgg ctgggcgaac agcgaatctt cccacaactg atcggcagca tgacgaatat 1321 gttcgaggtc cgccagatcg gcatgacgcc agcgcctgcc gagcgcggca tgggcaaaat 1381 cttccaccca gcgttgttgc tctggcgctg gtaacttacc gttgttggct aactcatggc 1441 gcgcccgatt cgccaggtag ccccacatcg acgacaccgg aaatatctgc tgtggcgtaa 1501 tacagccttt catcagcgtc ccggaaatca gtgcccgcac ctggtcggcg tcgtcactgt 1561 tacgatcctg ttgatcgaac ttattgacca gcacatacag cggcaccgat tgccccaccg 1621 ccaaaatcgc ctcacggacc tcttcatcgg agatcgattt cagttgcgta taatccagca 1681 ccgccagtac cgccgaggcg cgtgccagct gctggttaag cattttttgc agatgcggtt 1741 gcccggcttc atttggcccg ggggtatcca gtaacgtcaa ctgaccggga taactctcca 1801 gccccgccag atggacaaac tccacttcaa tcacgggaat atgctcaatg gcggcgtaag 1861 cagaaaaagg aaaatcgacg tccagcgcct tcgccagtcg cactaaatca ttcaaacttt 1921 tcagacaatg aaaaataggc tgggcaccca gataatattt ttcgaaagcg acgccatttt 1981 cgatccgctg cataagcgca cgcatatctt tatctatttc cagcacatcg gtcagatgct 2041 taatatcgca atcacgcagg cgctgttgta attgttgaat taaacaatcg attggcgcga 2101 catgtgaaaa atgcagtacc ggttcctttt gcccgggcgt atggcgaata agcgtcggca 2161 gcgcagtcat tgggcgatta cgattaggca gaacctccgt accaacaatg gcattaatgg 2221 tggttgattt ccctgctttc atggtaccga caattgcaag caccatttcc agtcgggaaa 2281 ttttacgcaa ctcattattc agcatcgcgt gacgttcggc gatattaggc tgactccagg 2341 gtaaagccag ttgtggcgcg tcgtctccgg gtacagagag aggcattttt tccagtaact 2401 gcaactgttg gcgagaaagc tgtaacaggc gttcagcctc ctgacttaac tcatacaggg 2461 tctgtgtgta catagaaaat tcttccttaa agcaaatttt gttattttat ttagccagat 2521 tgtttttgag ttctgttttc ggcttttata attactgcaa gaaataattt tatatttagt 2581 gtgttgtttt ttatcagaat aaataacgtc ttctgatacg tttaaaacgt cagaaagata 2641 aaaatatcat gtgaattaaa aaaagaacaa gtagagcatt aacattatct taaataataa 2701 atagaggcaa aaagattatt ttctttttgc gtttcctttc aaatgaaaac gatcgtcgtc 2761 taaaatcagc agtacccccg acaaactcag ggattttgtg tataattgcg gcctttttcg 2821 gcaatctgcc gttttttggc gcttttgccc tgctgacttt tgaggaaatc cacatgtcat 2881 taccacactg cccaaaatgc aactccgaat acacttacga agataacggc atgtacatct 2941 gcccggaatg tgcctacgaa tggaacgacg cagaacctgc acaggaaagc gacgagctga 3001 tcgttaaaga tgctaacggc aatctgctgg ctgacggcga cagcgttacc atcattaaag 3061 atctgaaggt gaaaggtagc tcttcgatgc tgaaaattgg caccaaagtg aaaaacatcc 3121 gcctggttga aggcgaccat aacatcgatt gcaaaatcga cggttttggt ccgatgaaac 3181 tgaaatctga gtttgtgaaa aagaactgat tgtattgtga tcggtaagcc ggataaggcg 3241 ctcgcgccgc atccggcaac ggtgccagat gcctgatgcg acgcttgcgc gtcttatcag 3301 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac 3361 ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc 3421 tccgtaggcc ggataaggcg tttacgccgc atccggcaac agtgccaact gcctgatgcg 3481 acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg 3541 tttacgccgc atccggcaat ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag 3601 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac 3661 agtgccgact gcctgatgcg acgctcgcgc gtcttatcag gccgcctctc atctgtataa 3721 atttcgaact acacttaact ggcttctctt aactgaggtc accatcatgc cgttaagtcc 3781 ctacctctct tttgccggta actgttccga cgcgattgcc tattatcaac gtacgttggg 3841 cgcggaactg ctctataaaa tcagcttcgg cgaaatgcca aaatcagcgc aggacagcgc 3901 cgagaactgc ccttccggaa tgcaatttcc cgataccgcc atcgctcatg ccaacgtgcg 3961 cattgccgga agcgacatca tgatgagcga tgccatgccg tcaggaaaag ccagctactc 4021 cggctttacg ctggtgctcg attcgcaaca ggtcgaagaa ggaaaacgct ggtttgacaa 4081 tcttgccgct aacggaaaaa tcgaaatggc ctggcaggaa actttctggg cgcatggctt 4141 tggcaaagtc accgataaat ttggcgtacc gtggatgatt aatgtcgtca aacaacaacc 4201 aacgcaataa cccgccggga ggcccgccct cccgcactgt catcgaattc ccgttaactc 4261 ttcatctgtt agtcactttt aattaaccaa atcgtcacaa taatccgcca cgatggagcc 4321 acttttttag ggaggctgca tcatgcaaac gattatccgt gtcgagaagc tcgccaaaac 4381 cttcaatcag catcaggcgc tgcatgcggt tgatctgaac attcatcacg gtgaaatggt 4441 ggctctgctt gggccgtcgg gttccggcaa atccaccctt ttacgtcact taagcggttt 4501 gattaccggc gataaatccg ccggcagcca tatcgagctg ctgggccgca cagtccagcg 4561 cgaaggccgt ctggcgcgcg atatccgcaa aagccgcgcc aacaccggct acatcttcca 4621 acaattcaac ctggtgaacc gcctgagcgt actggagaac gtgctgattg gcgcgctcgg 4681 cagcacgccg ttctggcgca cctgttttag ctggtttacc cgcgagcaga aacaacgcgc 4741 gttacaggcg ctgacccgcg ttggcatggt gcattttgcc catcaacgcg tttccaccct 4801 ctccggcgga cagcagcagc gtgtggcgat tgcccgcgcg ctgatgcagc aggcgaaggt 4861 gattctggcc gatgaaccca tcgcctcgct ggacccggaa tccgcccgca tcgtgatgga 4921 caccctgcgc gacatcaatc agaacgacgg catcaccgtg gtcgtcacgc tgcatcaggt 4981 ggattacgcc ctgcgctact gcgaacgcat cgtcgccctg cgccaggggc acgttttcta 5041 cgacggcagc agccaacagt ttgataacga acgttttgac catctctacc gcagcattaa 5101 tcgcatcgaa gagaacgcga aagctgcctg acatccccat cattgaggaa aacgaatgaa 5161 cgctaagata attgcctcgc tggccttcac cagcatgttc agcctcagca ccctgttaag 5221 cccggcacac gccgaagagc aggaaaaggc gctgaatttc ggcattattt caacggaatc 5281 acagcaaaac ctgaaaccgc aatggacgcc attcttacag gatatggaga agaagctggg 5341 cgtgaaggtg aacgccttct ttgccccaga ctacgcaggc attatccagg gaatgcgctt 5401 caataaagtg gatatcgcct ggtacggcaa cctgtcggca atggaagcgg tggatcgcgc 5461 caacggccag gtcttcgccc agacggtcgc ggcggatgga tcgccaggtt actggagcgt 5521 gttgatcgtc aacaaagata gtccgatcaa caacctgaac gatctgctgg cgaagcggaa 5581 agatctcacc ttcggcaatg gcgatcctaa ctccacctct ggcttcctcg tccccggtta 5641 ctacgtcttc gccaaaaaca atatctccgc cagcgacttc aagcgcaccg tcaacgccgg 5701 gcatgaaacc aacgcgctgg ccgtcgccaa caagcaggtg gatgtggcga ccaacaacac 5761 cgaaaacctc gacaagctga aaacctccgc gccggagaag ctgaaagaac tgaaagtgat 5821 ctggaaatcg ccgctgatcc caggcgatcc gatcgtctgg cgtaaaaatc tttccgaaac 5881 caccaaagac aagatctacg acttctttat gaattacggc aaaacgccgg aagagaaagc 5941 ggtgctggaa cgcctgggct gggcgccgtt ccgcgcctcc agcgacctgc aactggtgcc 6001 gattcgccag ctcgcactgt ttaaagagat gcagggcgtg aaaagcaata aaggactgaa 6061 tgagcaggac aagctggcaa aaaccaccgc gattcaggcg caactggatg acctggaccg 6121 cctgaacaac gcgctaagcg cgatgagttc ggtgagtaaa gcggtgcagt aaatcgtagg 6181 tcggataaga cgccccggcg tcgcatccga caatgtgcag gcgttgatgc cggatgcggt 6241 gcaagcacct tatccggcct acagaccgga gccaaacatg caaaccatca ccatcgcccc 6301 acccaagcgc agctggttct cgcttctgag ctgggccgtt gttctcgccg tgctggtcgt 6361 ctcgtggcag ggcgcggaaa tggccccgct cacgctgatt aaagacggcg gcaacatggc 6421 aaccttcgct gccgacttct tcccgcccga tttcagccag tggcaggatt acctcaccga 6481 aatggccgtc acgctgcaaa tcgccgtctg gggcaccgcg ctggcggtgg ttctctccat 6541 cccctttggc ctgatgagcg ccgaaaacct ggtgccgtgg tgggtttacc agcccgttcg 6601 ccgcctgatg gacgcctgcc gcgccattaa cgaaatggtc ttcgccatgc tgttcgtggt 6661 cgccgtcggt ctcggaccgt tcgctggcgt gctggcgcta tttatccaca ccaccggcgt 6721 gctctccaag ctgctttccg aagcggtaga agcaattgaa cctggcccgg tggaaggcat 6781 tcgcgccacc ggtgccaaca agctcgaaga gatcctctac ggcgtgctgc cgcaggtgat 6841 gccgctgctg atctcctact ccctctatcg cttcgaatcc aacgtccgct cggcgaccgt 6901 cgtcggcatg gtcggcgcgg gcgggatcgg cgtcaccctg tgggaagcga ttcgcggttt 6961 ccagttccaa caaacctgcg ccctgatggt gcttatcatc gtcacggtca gcctgctgga 7021 tttcctctct caacggttgc gtaagcactt tatctgataa gcgaggcatt gatatctatg 7081 cacttgtcta cacatccgac cagctaccca acacgctatc aagagatagc cgcaaaactt 7141 gagcaggagc ttcgtcaaca ctaccgctgc ggcgactatc ttcccgccga gcagcaactg 7201 gcagcgcgct ttgaggtgaa tcgccacacc ctgcgccgcg ccatcgacca actggtggaa 7261 aaaggctggg tacagcgccg tcagggcgtc ggcgtgctgg tgctgatgcg cccgttcgat 7321 tacccgctca acgcccaggc gcgttttagc cagaatctgc tggatcaggg cagccatccc 7381 accagcgaaa aactgctttc ggtattgcgc cccgcgtccg gccacgtcgc tgacgcactg 7441 gggattaccg agggggagaa cgtcatccac ctgcgcaccc tgcgtcgggt caacggcgtc 7501 gcgctctgtt taatcgacca ctacttcgcg gacctcaccc tctggccgac gctgcaacgc 7561 ttcgacagcg gctcgctgca cgattttctg cgcgagcaaa ccggaattgc gctgcgccgc 7621 agccagacgc ggatcagcgc ccgccgcgcc caggccaaag agtgccagcg tcttgaaatc 7681 ccgaatatgt cgccgctgct gtgcgtgcgc acccttaacc accgtgacgg tgaaagcagc 7741 ccggcggagt actccgtcag cctgacgcgc gccgacatga ttgaattcac tatggagcac 7801 tgaatgcacg cagataccgc gacccgccag cactggatgt ccgtgctggc gcacagccaa 7861 ccggctgaac tggcagcacg cctgaacgcg ctaaacatca ccgccgacta tgaggtgatc 7921 cgcgccgctg aaactggcct ggtacagatt caggcgcgga tgggcggcac cggcgaacgt 7981 ttttttgccg gcgacgccac gctgacccgc gccgccgtgc gcctgactga cggcacgctc 8041 ggctacagct gggtgctggg gcgtgataaa cagcacgccg aacgctgcgc gctgattgac 8101 gcgctgatgc agcaatctcg ccactttcaa aacttatcag aaacccttat tgccccgctg 8161 gacgctgacc gtatggcacg cattgccgca cgccaggccg aagtgaacgc cagccgggtc 8221 gacttcttta cgatggttcg cggagacaac gcatgaccct ggaaaccgct tttatgcttc 8281 ccgtgcagga tgcccagcac agttttcgtc gcctgttaaa ggccatgagc gagccgggcg 8341 tgattgtcgc cctgcatcag ctcaaacgcg gctggcaacc gctgaatatc gccaccacca 8401 gcgtgctgct gacgctggcc gataacgaca cgccggtgtg gctttctacc ccattaaata 8461 acgatatcgt caaccagagc ctgcgttttc ataccaacgc gccgctggtc agccagccgg 8521 aacaggcgac cttcgcggtg acggatgagg cgatttccag cgaacagctc aacgcccttt 8581 ccaccggcac cgccgttgcg ccggaagcgg gcgcgacgct gattttacag gtcgccagcc 8641 tgagcggcgg gcgcatgttg cgtctcaccg gcgcgggtat tgccgaagaa cgaatgatcg 8701 ctccgcagct gccggagtgc attctgcacg aactcaccga gcgcccgcac ccgttcccgc 8761 tcggcatcga cctgatcctg acctgcggcg aacgcctgct ggctattccg cgaaccacgc 8821 atgtggaggt gtgctgatgt acgttgccgt aaaagggggc gaaaaggcga tcgacgccgc 8881 ccacgccctg caagagagcc gacgccgggg cgataccgat ttgcctgaac tgagcgtcgc 8941 ccagattgaa cagcagctta acctcgcggt agatcgcgtg atgaccgaag gcggcattgc 9001 cgaccgcgaa ctggcggcgc tggcgctgaa acaggccagc ggcgataacg ttgaagcgat 9061 tttcctgctg cgcgcctacc gcaccacgtt ggcgaagctg gcggtaagcg agccgctcga 9121 caccaccggg atgcgtctcg aacgccgtat ctccgccgtt tataaagaca ttcccggcgg 9181 ccagctgctt ggcccaacct acgactacac ccatcgcctg ctcgatttta ccctgctggc 9241 aaacggcgaa gcgccgacgc tgaccaccgc cgacagcgaa caacagccgt cgccgcacgt 9301 tttcagcctg ctggcgcgtc aggggctggc gaagtttgaa gaggatagcg gcgcacagcc 9361 ggatgacatc acccgcacgc cgccggttta cccctgctca cgttcttccc gtttgcagca 9421 gttgatgcgc ggcgacgaag gctatttgct ggcgctggcc tactccaccc agcgtggtta 9481 cggacgcaat cacccgttcg cgggcgagat ccgcagtggt tacatcgacg tgtcgattgt 9541 gccggaagag ctgggatttg cggtaaacgt cggcgaacta ctgatgaccg agtgtgaaat 9601 ggtcaacggt tttatcgacc cgccggatga gccgccgcac ttcacgcgcg gctacgggct 9661 ggtattcggc atgagcgagc gcaaagcgat ggcaatggcg ctggtcgatc gtgcgttgca 9721 ggctccggaa tacggcgagc acgcgacagg cccggcgcag gatgaagagt ttgtgctggc 9781 acatgccgac aacgtcgaag ccgcaggctt tgtctcgcac ctcaaactcc cccactacgt 9841 cgatttccag gccgaactgg agctactcaa acgtctgcaa caggagaaga accatggcta 9901 atctgagcgg ctacaacttt gcctacctcg acgagcagac caaacgcatg atccgccgcg 9961 ccatcttaaa agcggtggcg atccccggtt atcaggtgcc gtttggcggg cgcgagatgc 10021 cgatgccata cggctgggga accggcggca tacagctcac cgccagcgtg attggcgaaa 10081 gcgacgtgct aaaggtgatt gaccagggtg cggatgacac caccaacgcc gtgtcgattc 10141 gcaacttctt taagcgcgtg accggggtaa acaccactga acgtacggac gatgcgacgc 10201 ttatccagac gcgtcaccgc atccccgaaa cgccgctgac cgaagatcag atcattatct 10261 tccaggtgcc aatcccggaa ccgctgcgct ttatcgagcc gcgcgaaacg gaaacccgca 10321 ccatgcacgc gctggaagag tacggcgtga tgcaggtgaa actgtatgaa gatatcgccc 10381 gcttcggtca tatcgccact acctacgcct atccggtgaa ggtgaacggg cgctacgtaa 10441 tggacccgtc gccgatcccg aaattcgata acccaaaaat ggacatgatg cccgccctgc 10501 aactgttcgg cgcggggcgc gagaagcgca tctatgcggt gccgccgttt acccgcgtgg 10561 aaagtctcga tttcgacgat cacccgttca ccgttcagca gtgggatgag ccatgcgcca 10621 tctgcggatc gacccacagc tatcttgatg aagtggtgct ggatgacgcc ggaaaccgca 10681 tgtttgtctg ctccgatacc gattattgcc gccaacagag cgaggcaaaa aaccaatgaa 10741 tcaaccgtta ctttcggtca ataacctgac ccacctttac gcgccgggca aaggctttag 10801 cgatgtctct tttgatttat ggccggggga agtgctgggc attgtcgggg aatccggctc 10861 cgggaagacc acgctgctga agtcgatctc cgcgcgcctg acgccgcagc agggggaaat 10921 tcactacgag aaccgttcgc tgtatgcaat gagcgaggcc gaccgccgtc gcctgctgcg 10981 taccgaatgg ggcgtggtgc atcagcatcc actcgacggc ctgcgccgcc aggtgtcggc 11041 aggcggcaat atcggcgagc ggctgatggc gaccggggca cgtcattacg gcgatattcg 11101 tgccaccgcg cagaagtggc tggaagaggt ggagattccc gccaaccgga tcgacgacct 11161 gccgaccacc ttttccggcg gtatgcagca gcgtttgcag attgcccgca acctggtgac 11221 gcatccgaag ctggtgttta tggatgaacc gaccggcggg ctggatgtgt cggtgcaggc 11281 ccgcctgctc gacctgctgc gcggcctggt ggtggagctg aacctcgcgg tggtgattgt 11341 cacccatgat ttaggcgtcg cccgcctgct ggcggaccgt ttgctggtga tgaagcaggg 11401 gcaagtggtg gagagtgggt taaccgaccg cgtgctcgac gacccgcatc atccgtatac 11461 acagctgctg gtgtcatcgg ttttgcagaa ttgagccggt gccggatgcg gcgtaaacgc 11521 cttatccggc ctacaaatgc gctccccgta ggtcggataa gacgcgtcag cgtcgcatcc 11581 gacacccgaa ccacgaggcg aaaaatgatt aacgtacaaa acgtcagtaa aaccttcatc 11641 ctgcaccagc aaaacggcgt gcgcctgccc gtcctcaatc gcgcctcgct caccgtcaac 11701 gcgggcgaat gcgtggtgct ccacggccat tccggcagcg gcaaatcaac tctgctacgc 11761 tcgctgtacg ccaactatct acccgacgaa ggtcaaatcc agatcaaaca cggtgacgag 11821 tgggtagacc tggtcaccgc gccagcgcgc aaagtggtgg aaatccgcaa aaccaccgtc 11881 ggctgggtga gccagtttct gcgcgtcatc ccgcgtatct cagcactgga agtggtgatg 11941 cagccgctgc tcgataccgg cgttccgcgt gaagcctgcg ccgctaaagc cgcgcgtctt 12001 ctcacccgcc tgaacgtgcc ggaacgcctg tggcacctgg caccatcgac attttccggt 12061 ggcgaacagc agcgcgtcaa catcgcccgc ggctttatcg tcgactaccc cattctgctg 12121 cttgacgaac ctaccgcctc gctggacgcc aaaaacagcg ccgcggtggt ggaactgatt 12181 cgcgaagcca aaacccgtgg cgcagccatc gtaggcatct tccatgacga agctgtacgt 12241 aatgacgtcg ccgaccgcct gcacccaatg ggagcctctt catgattatc aataacgtta 12301 agctggtgct ggaaaacgag gtggtaagcg gttcgctgga ggtgcagaac ggcgaaatcc 12361 gcgcctttgc cgaaagccag agccgcctgc cggaggcgat ggacggcgaa ggcggctggc 12421 tgctgccggg gctgattgag ctgcataccg ataatctgga taaattcttc accccgcgcc 12481 cgaaagttga ctggcctgcc cactcggcga tgagcagcca cgacgcgctg atggtggcga 12541 gcggcatcac caccgtactg gatgccgtgg caattggcga cgtgcgcgac ggcggcgatc 12601 ggctggagaa tctggagaag atgatcaacg ccatcgaaga gacgcagaaa cgcggcgtca 12661 accgcgccga gcaccgtctg catctgcgct gcgaactgcc gcatcacacc acgctgccgc 12721 tgtttgaaaa actggtgcag cgcgagccgg tgacgctggt gtcgctgatg gaccactcgc 12781 cgggccagcg ccagttcgcc aaccgcgaga agtatcgcga atattatcag ggcaaatact 12841 ccctcactga tgcgcagatg cagcagtacg aagaagagca actggcgctc gccgcacgct 12901 ggtcgcagcc gaatcgcgaa tccatcgccg ccctgtgccg cgcgcgaaaa attgcgcttg 12961 ccagccacga tgacgccacc cacgcccacg ttgctgaatc tcaccagctt ggcagcgtga 13021 tcgccgaatt tcccaccacg ttcgaagcgg cggaagcctc gcgcaagcat ggcatgaacg 13081 tgctgatggg cgcgccgaat attgtgcgcg gcggctcgca ctccggcaac gtggcggcca 13141 gtgaactggc gcagcttggc ctgctggata tcctctcttc cgactactac cccgccagcc 13201 tgctcgatgc ggcatttcgc gtcgccgatg acgagagcaa ccgctttacg ctgccgcagg 13261 cggtgaagct ggtgactaaa aatccagcgc aggcgcttaa tctccaggat cgcggggtga 13321 ttggcgaggg caaacgcgcc gacctggtgc tggcgcatcg caaggacaat catattcata 13381 tcgaccacgt ctggcgtcag ggtaaaaggg tgttctgatg atgggaaaac tgatttggtt 13441 aatggggccg tccggctccg ggaaagacag cctgctggcg gaactccgcc tgcgggaaca 13501 aactcagtta ctggtggcgc atcgctacat cacgcgcgat gccagcgccg gaagtgaaaa 13561 ccatatcgcc ctgagcgagc aggagttttt tacccgcgcg gggcaaaatc tgttggcctt 13621 aagctggcac gctaacggtc tgtattatgg cgtcggcgtc gagattgatc tctggctgca 13681 cgccggattc gacgtgctgg tcaacggctc acgcgcccat ctgccgcagg cgcgggcgcg 13741 ctatcaatcg gcgctgctgc ccgtctgttt acaggtttcg ccggagatcc tccgccagcg 13801 cctggaaaac cgtggccgtg aaaacgccag tgaaattaac gcccgcctgg cgcgcgccgc 13861 ccgctatact ccacaggatt gccatacgct caacaatgac ggcagcctgc gccagtcggt 13921 cgacacgctg ctgacgctga tccatcagaa ggagaaacac catgcctgct tgtgagcttc 13981 gcccggccac gcagtacgac accgacgcgg tttacgcgct gatttgtgag ctaaaacagg 14041 cggagtttga ccaccacgcg tttcgcgtgg gttttaacgc caatctgcgc gacccaaaca 14101 tgcgctacca tctggcgctg cttgatggcg aagttgtcgg catgatcggc ctgcatttgc 14161 agtttcatct gcatcatgtc aactggatcg gcgaaattca ggagttggtg gtaatgccgc 14221 aggcgcgcgg tctgaacgtc ggcagtaagt tactggcgtg ggcagaagaa gaagcccgcc 14281 aggccggggc cgaaatgacc gaactttcga ccaacgtgaa gcgccacgac gcgcaccgtt 14341 tctatctgcg cgaaggctac gagcagagcc acttccgctt caccaaggcg ctgtaacatg 14401 agcctgaccc tcacgctcac cggcaccggc ggcgcacagg gcgttccggc atggggctgc 14461 gagtgtgcgg cctgcgccag agcgcggcgc tcgccgcagt atcgccgcca accgtgcagc 14521 ggcgtagtga agtttaacga cgcaatcacc ctgatcgacg ccgggctgca cgatctcgcc 14581 gatcgctggt cgcccggatc gttccagcag tttttgctga cgcattatca tatggatcac 14641 gtccaggggc tgtttccgct gcgctggggc gttggcgatc cgatcccggt ttacggcccg 14701 ccggatgaac agggctgcga cgatctgttt aaacatccgg gcctgcttga tttcagccac 14761 acggtggaac cgtttgtggt gtttgatttg caggggttac aggtcacgcc cctgccgctc 14821 aaccactcaa aactgacctt cggttatctg ctggaaacgg cacacagccg ggtggcgtgg 14881 ctgtctgaca ccgcaggctt gccggaaaaa acgctgaaat ttttacgcaa taatcagccg 14941 caggtaatgg tgatggattg cagtcacccg ccgcgcgcgg atgcaccgcg taatcactgt 15001 gatttaaata ccgtgcttgc gctgaatcag gttatccgct cgccacgggt gattctgacc 15061 catatcagcc accagtttga tgcgtggctg atggaaaacg cactaccgtc agggtttgag 15121 gtggggtttg atgggatgga gattggggtg gcgtgatgag agggaatgtg cgcgctggcc 15181 ccctcaccct aaccctctcc ccagaggggc gaggggaccg attgtgctcg atattgaata 15241 ttgcgctcgt tttctccctc tccccattgg ggtgaggggc gatgcctgct ccatacccaa 15301 cctcatcgcc catactcatc ttccattctc cgctcttcat cctccagttg ccgacgctcc 15361 tgatcaagct ggcgctggcg atcgtccagc tgcctgcggc gatcttcaaa ctggcggcgg 15421 cggtcgtcat attgtctgcg ccgatcgtcg ctcacttcac gctgccagcc gtggtcgcgc 15481 gaatcttcat agttgaagcg gcgcacgaaa aacgcgaaag cgtttcacga taaatgcgaa 15541 aactttagct ttcgcgcttc aaatgaaaca gatgtattaa ttactgcttt ttattcatta 15601 catggggatc c //
GenBank-Updates@genbank.bio.net (04/06/90)
LOCUS ECOPHNAQ 15611 bp ds-DNA BCT 02-APR-1990 DEFINITION E.coli psiD locus containing alkylphosphonate uptake (phn) genes A through Q, complete cds. ACCESSION J05260 KEYWORDS C-P lyase; HisM-like integral membrane protein; HisP-like nucleotide binding protein; inverted repeats. SOURCE E.coli (strain B) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 15611) AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T. TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and sequencing of the phn (psiD) genes involved in alkylphosphonate uptake and C-P lyase activity in Escherichia coli B JOURNAL J. Biol. Chem. 265, 4461-4471 (1990) STANDARD full staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by C.-M.Chen, 01-MAR-1990, for release after publication. FEATURES from to/span description pept 2874 3209 phnA protein pept 3767 4210 phnB protein pept 4343 5131 HisP-like nucleotide binding protein (phnC) pept 5156 6172 phnD protein pept 6227 7057 phnE protein pept 7078 7803 phnF protein pept 7804 8256 phnG protein pept 8253 8837 phnH protein pept 8837 9901 phnI protein pept 9894 10739 phnJ protein pept 10736 11494 HisP-like nucleotide binding protein (phnK) pept 11605 12285 HisP-like nucleotide binding protein (phnL) pept 12282 13418 HisM-like integral membrane protein (phnM) pept 13418 13975 HisP-like nucleotide binding protein (phnN) pept 13962 14396 phnO protein pept 14398 15156 phnP protein pept 15131 15496 phnQ protein pept 2473 245 (c) ORF742 pept 7470 7030 (c) ORF146 pept 9298 8489 (c) ORF269 pept 12264 11920 (c) ORF114 pept 14760 14380 (c) ORF126 rpt 3224 3257 inverted repeat a rpt 3271 3306 inverted repeat b rpt 3324 3357 inverted repeat c rpt 3371 3406 inverted repeat d rpt 3424 3457 inverted repeat e rpt 3471 3506 inverted repeat f rpt 3524 3557 inverted repeat g rpt 3571 3606 inverted repeat h rpt 3624 3657 inverted repeat i rpt 3671 3706 inverted repeat j rpt 6176 6211 inverted repeat k rpt 6229 6262 inverted repeat l rpt 11501 11534 inverted repeat m rpt 11548 11583 inverted repeat n signal 2622 2627 -35 region signal 2645 2650 -10 region signal 2738 2743 -35 region signal 2771 2776 -10 region signal 4074 4079 -35 region signal 4096 4101 -10 region signal 2084 2079 (c) -10 region signal 2108 2103 (c) -35 region signal 2720 2715 (c) -10 region signal 2742 2737 (c) -35 region signal 12056 12051 (c) -10 region signal 12080 12075 (c) -35 region signal 2737 2754 phosphate (pho) box BASE COUNT 3466 a 4625 c 4306 g 3214 t ORIGIN 1 ggatccagca tcgacgccag tttttccacc attgtcagtc gcaggctaag cggcgcattt 61 aacatgccgc cgttcgtcca tgtctgaagc tgcacacgcg aaagaagttc ctgcatcagt 121 cgttcacgaa actgctgctg atgggcttgt ggaaggcggg catcatcgcc ctgcgccaga 181 tccactaaaa agcggggata aaccgactcc agcacgcgac cggggccgtc cagtaacgtc 241 ttggtcaata tcgttctgcc gtgaaaagtg tttgaatatc atcgcgtaac agctgggcgt 301 cggtgtaaat ccagccgtga gtcatcacag tctgctgcaa ttgctgctgc atcagcctga 361 ccaccgattc attttgttga cgcagagcca ggctttcgcg taaacgcgtc tgtaattccg 421 tcaaacatga agcgaactca gcgaaaaaag tattcatgcc tgccgtaaca gattcatcga 481 cctgctctgc cagaacttta gccatttgtt ggcaataaag atcgacttct gcgcttaatg 541 ctcgttgcaa cacactgtaa tcaaccgttt ctgtcgggga tttctcattt ccccgtcccc 601 agtcgggctg attcaaccag cgcgaaaaag tctcacgcac aacgcctaaa cgcgtgctct 661 gctcgtccgt tgcatcctgg cgcgaaatga ctgcactgaa cagctggcga gtgttgaagt 721 ggggaactac gccgtgaaaa acaggaaaat gaaacccagg acgaaaccct gactcgctca 781 attccatttt gacttgttgc tcaatggggc gaataacatc ggttaacact cggcaaaggg 841 tggattccag ctcggcaaaa cgcagcgtaa agtcgcgact gatggtgttc tgcgccgtct 901 gtaacagtgt ctcacagcgg gtacgcatct cgcttaacgg ctccgaatca tcctgaaaca 961 aggcggctaa ctgcgcattc agcgcatctt gttgttgacg cagaaagtgg ttggcggagg 1021 tcagggccag ctcgatttca tgtttaatct cgccgctcac ctgcgcctga ttgagttgca 1081 atagctgcaa actttcttcg acctgatgga tattttgccg caattgttca caagcgacgt 1141 ttaacccgtg cgcacgaaaa tccaggtatt cccgcgcctg ctgcgcgtaa ttcaacagtt 1201 tatgcgcagc agatcgcaaa gcatacaacg aggcgttagc gtaagcggca tgaagcaacg 1261 cctgaattgg ctgggcgaac agcgaatctt cccacaactg atcggcagca tgacgaatat 1321 gttcgaggtc cgccagatcg gcatgacgcc agcgcctgcc gagcgcggca tgggcaaaat 1381 cttccaccca gcgttgttgc tctggcgctg gtaacttacc gttgttggct aactcatggc 1441 gcgcccgatt cgccaggtag ccccacatcg acgacaccgg aaatatctgc tgtggcgtaa 1501 tacagccttt catcagcgtc ccggaaatca gtgcccgcac ctggtcggcg tcgtcactgt 1561 tacgatcctg ttgatcgaac ttattgacca gcacatacag cggcaccgat tgccccaccg 1621 ccaaaatcgc ctcacggacc tcttcatcgg agatcgattt cagttgcgta taatccagca 1681 ccgccagtac cgccgaggcg cgtgccagct gctggttaag cattttttgc agatgcggtt 1741 gcccggcttc atttggcccg ggggtatcca gtaacgtcaa ctgaccggga taactctcca 1801 gccccgccag atggacaaac tccacttcaa tcacgggaat atgctcaatg gcggcgtaag 1861 cagaaaaagg aaaatcgacg tccagcgcct tcgccagtcg cactaaatca ttcaaacttt 1921 tcagacaatg aaaaataggc tgggcaccca gataatattt ttcgaaagcg acgccatttt 1981 cgatccgctg cataagcgca cgcatatctt tatctatttc cagcacatcg gtcagatgct 2041 taatatcgca atcacgcagg cgctgttgta attgttgaat taaacaatcg attggcgcga 2101 catgtgaaaa atgcagtacc ggttcctttt gcccgggcgt atggcgaata agcgtcggca 2161 gcgcagtcat tgggcgatta cgattaggca gaacctccgt accaacaatg gcattaatgg 2221 tggttgattt ccctgctttc atggtaccga caattgcaag caccatttcc agtcgggaaa 2281 ttttacgcaa ctcattattc agcatcgcgt gacgttcggc gatattaggc tgactccagg 2341 gtaaagccag ttgtggcgcg tcgtctccgg gtacagagag aggcattttt tccagtaact 2401 gcaactgttg gcgagaaagc tgtaacaggc gttcagcctc ctgacttaac tcatacaggg 2461 tctgtgtgta catagaaaat tcttccttaa agcaaatttt gttattttat ttagccagat 2521 tgtttttgag ttctgttttc ggcttttata attactgcaa gaaataattt tatatttagt 2581 gtgttgtttt ttatcagaat aaataacgtc ttctgatacg tttaaaacgt cagaaagata 2641 aaaatatcat gtgaattaaa aaaagaacaa gtagagcatt aacattatct taaataataa 2701 atagaggcaa aaagattatt ttctttttgc gtttcctttc aaatgaaaac gatcgtcgtc 2761 taaaatcagc agtacccccg acaaactcag ggattttgtg tataattgcg gcctttttcg 2821 gcaatctgcc gttttttggc gcttttgccc tgctgacttt tgaggaaatc cacatgtcat 2881 taccacactg cccaaaatgc aactccgaat acacttacga agataacggc atgtacatct 2941 gcccggaatg tgcctacgaa tggaacgacg cagaacctgc acaggaaagc gacgagctga 3001 tcgttaaaga tgctaacggc aatctgctgg ctgacggcga cagcgttacc atcattaaag 3061 atctgaaggt gaaaggtagc tcttcgatgc tgaaaattgg caccaaagtg aaaaacatcc 3121 gcctggttga aggcgaccat aacatcgatt gcaaaatcga cggttttggt ccgatgaaac 3181 tgaaatctga gtttgtgaaa aagaactgat tgtattgtga tcggtaagcc ggataaggcg 3241 ctcgcgccgc atccggcaac ggtgccagat gcctgatgcg acgcttgcgc gtcttatcag 3301 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac 3361 ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc 3421 tccgtaggcc ggataaggcg tttacgccgc atccggcaac agtgccaact gcctgatgcg 3481 acgcttgcgc gtcttatcag gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg 3541 tttacgccgc atccggcaat ggtgccgact gcctgatgcg acgcttgcgc gtcttatcag 3601 gcctacaaat tcccgcaccc tccgtaggcc ggataaggcg tttacgccgc atccggcaac 3661 agtgccgact gcctgatgcg acgctcgcgc gtcttatcag gccgcctctc atctgtataa 3721 atttcgaact acacttaact ggcttctctt aactgaggtc accatcatgc cgttaagtcc 3781 ctacctctct tttgccggta actgttccga cgcgattgcc tattatcaac gtacgttggg 3841 cgcggaactg ctctataaaa tcagcttcgg cgaaatgcca aaatcagcgc aggacagcgc 3901 cgagaactgc ccttccggaa tgcaatttcc cgataccgcc atcgctcatg ccaacgtgcg 3961 cattgccgga agcgacatca tgatgagcga tgccatgccg tcaggaaaag ccagctactc 4021 cggctttacg ctggtgctcg attcgcaaca ggtcgaagaa ggaaaacgct ggtttgacaa 4081 tcttgccgct aacggaaaaa tcgaaatggc ctggcaggaa actttctggg cgcatggctt 4141 tggcaaagtc accgataaat ttggcgtacc gtggatgatt aatgtcgtca aacaacaacc 4201 aacgcaataa cccgccggga ggcccgccct cccgcactgt catcgaattc ccgttaactc 4261 ttcatctgtt agtcactttt aattaaccaa atcgtcacaa taatccgcca cgatggagcc 4321 acttttttag ggaggctgca tcatgcaaac gattatccgt gtcgagaagc tcgccaaaac 4381 cttcaatcag catcaggcgc tgcatgcggt tgatctgaac attcatcacg gtgaaatggt 4441 ggctctgctt gggccgtcgg gttccggcaa atccaccctt ttacgtcact taagcggttt 4501 gattaccggc gataaatccg ccggcagcca tatcgagctg ctgggccgca cagtccagcg 4561 cgaaggccgt ctggcgcgcg atatccgcaa aagccgcgcc aacaccggct acatcttcca 4621 acaattcaac ctggtgaacc gcctgagcgt actggagaac gtgctgattg gcgcgctcgg 4681 cagcacgccg ttctggcgca cctgttttag ctggtttacc cgcgagcaga aacaacgcgc 4741 gttacaggcg ctgacccgcg ttggcatggt gcattttgcc catcaacgcg tttccaccct 4801 ctccggcgga cagcagcagc gtgtggcgat tgcccgcgcg ctgatgcagc aggcgaaggt 4861 gattctggcc gatgaaccca tcgcctcgct ggacccggaa tccgcccgca tcgtgatgga 4921 caccctgcgc gacatcaatc agaacgacgg catcaccgtg gtcgtcacgc tgcatcaggt 4981 ggattacgcc ctgcgctact gcgaacgcat cgtcgccctg cgccaggggc acgttttcta 5041 cgacggcagc agccaacagt ttgataacga acgttttgac catctctacc gcagcattaa 5101 tcgcatcgaa gagaacgcga aagctgcctg acatccccat cattgaggaa aacgaatgaa 5161 cgctaagata attgcctcgc tggccttcac cagcatgttc agcctcagca ccctgttaag 5221 cccggcacac gccgaagagc aggaaaaggc gctgaatttc ggcattattt caacggaatc 5281 acagcaaaac ctgaaaccgc aatggacgcc attcttacag gatatggaga agaagctggg 5341 cgtgaaggtg aacgccttct ttgccccaga ctacgcaggc attatccagg gaatgcgctt 5401 caataaagtg gatatcgcct ggtacggcaa cctgtcggca atggaagcgg tggatcgcgc 5461 caacggccag gtcttcgccc agacggtcgc ggcggatgga tcgccaggtt actggagcgt 5521 gttgatcgtc aacaaagata gtccgatcaa caacctgaac gatctgctgg cgaagcggaa 5581 agatctcacc ttcggcaatg gcgatcctaa ctccacctct ggcttcctcg tccccggtta 5641 ctacgtcttc gccaaaaaca atatctccgc cagcgacttc aagcgcaccg tcaacgccgg 5701 gcatgaaacc aacgcgctgg ccgtcgccaa caagcaggtg gatgtggcga ccaacaacac 5761 cgaaaacctc gacaagctga aaacctccgc gccggagaag ctgaaagaac tgaaagtgat 5821 ctggaaatcg ccgctgatcc caggcgatcc gatcgtctgg cgtaaaaatc tttccgaaac 5881 caccaaagac aagatctacg acttctttat gaattacggc aaaacgccgg aagagaaagc 5941 ggtgctggaa cgcctgggct gggcgccgtt ccgcgcctcc agcgacctgc aactggtgcc 6001 gattcgccag ctcgcactgt ttaaagagat gcagggcgtg aaaagcaata aaggactgaa 6061 tgagcaggac aagctggcaa aaaccaccgc gattcaggcg caactggatg acctggaccg 6121 cctgaacaac gcgctaagcg cgatgagttc ggtgagtaaa gcggtgcagt aaatcgtagg 6181 tcggataaga cgccccggcg tcgcatccga caatgtgcag gcgttgatgc cggatgcggt 6241 gcaagcacct tatccggcct acagaccgga gccaaacatg caaaccatca ccatcgcccc 6301 acccaagcgc agctggttct cgcttctgag ctgggccgtt gttctcgccg tgctggtcgt 6361 ctcgtggcag ggcgcggaaa tggccccgct cacgctgatt aaagacggcg gcaacatggc 6421 aaccttcgct gccgacttct tcccgcccga tttcagccag tggcaggatt acctcaccga 6481 aatggccgtc acgctgcaaa tcgccgtctg gggcaccgcg ctggcggtgg ttctctccat 6541 cccctttggc ctgatgagcg ccgaaaacct ggtgccgtgg tgggtttacc agcccgttcg 6601 ccgcctgatg gacgcctgcc gcgccattaa cgaaatggtc ttcgccatgc tgttcgtggt 6661 cgccgtcggt ctcggaccgt tcgctggcgt gctggcgcta tttatccaca ccaccggcgt 6721 gctctccaag ctgctttccg aagcggtaga agcaattgaa cctggcccgg tggaaggcat 6781 tcgcgccacc ggtgccaaca agctcgaaga gatcctctac ggcgtgctgc cgcaggtgat 6841 gccgctgctg atctcctact ccctctatcg cttcgaatcc aacgtccgct cggcgaccgt 6901 cgtcggcatg gtcggcgcgg gcgggatcgg cgtcaccctg tgggaagcga ttcgcggttt 6961 ccagttccaa caaacctgcg ccctgatggt gcttatcatc gtcacggtca gcctgctgga 7021 tttcctctct caacggttgc gtaagcactt tatctgataa gcgaggcatt gatatctatg 7081 cacttgtcta cacatccgac cagctaccca acacgctatc aagagatagc cgcaaaactt 7141 gagcaggagc ttcgtcaaca ctaccgctgc ggcgactatc ttcccgccga gcagcaactg 7201 gcagcgcgct ttgaggtgaa tcgccacacc ctgcgccgcg ccatcgacca actggtggaa 7261 aaaggctggg tacagcgccg tcagggcgtc ggcgtgctgg tgctgatgcg cccgttcgat 7321 tacccgctca acgcccaggc gcgttttagc cagaatctgc tggatcaggg cagccatccc 7381 accagcgaaa aactgctttc ggtattgcgc cccgcgtccg gccacgtcgc tgacgcactg 7441 gggattaccg agggggagaa cgtcatccac ctgcgcaccc tgcgtcgggt caacggcgtc 7501 gcgctctgtt taatcgacca ctacttcgcg gacctcaccc tctggccgac gctgcaacgc 7561 ttcgacagcg gctcgctgca cgattttctg cgcgagcaaa ccggaattgc gctgcgccgc 7621 agccagacgc ggatcagcgc ccgccgcgcc caggccaaag agtgccagcg tcttgaaatc 7681 ccgaatatgt cgccgctgct gtgcgtgcgc acccttaacc accgtgacgg tgaaagcagc 7741 ccggcggagt actccgtcag cctgacgcgc gccgacatga ttgaattcac tatggagcac 7801 tgaatgcacg cagataccgc gacccgccag cactggatgt ccgtgctggc gcacagccaa 7861 ccggctgaac tggcagcacg cctgaacgcg ctaaacatca ccgccgacta tgaggtgatc 7921 cgcgccgctg aaactggcct ggtacagatt caggcgcgga tgggcggcac cggcgaacgt 7981 ttttttgccg gcgacgccac gctgacccgc gccgccgtgc gcctgactga cggcacgctc 8041 ggctacagct gggtgctggg gcgtgataaa cagcacgccg aacgctgcgc gctgattgac 8101 gcgctgatgc agcaatctcg ccactttcaa aacttatcag aaacccttat tgccccgctg 8161 gacgctgacc gtatggcacg cattgccgca cgccaggccg aagtgaacgc cagccgggtc 8221 gacttcttta cgatggttcg cggagacaac gcatgaccct ggaaaccgct tttatgcttc 8281 ccgtgcagga tgcccagcac agttttcgtc gcctgttaaa ggccatgagc gagccgggcg 8341 tgattgtcgc cctgcatcag ctcaaacgcg gctggcaacc gctgaatatc gccaccacca 8401 gcgtgctgct gacgctggcc gataacgaca cgccggtgtg gctttctacc ccattaaata 8461 acgatatcgt caaccagagc ctgcgttttc ataccaacgc gccgctggtc agccagccgg 8521 aacaggcgac cttcgcggtg acggatgagg cgatttccag cgaacagctc aacgcccttt 8581 ccaccggcac cgccgttgcg ccggaagcgg gcgcgacgct gattttacag gtcgccagcc 8641 tgagcggcgg gcgcatgttg cgtctcaccg gcgcgggtat tgccgaagaa cgaatgatcg 8701 ctccgcagct gccggagtgc attctgcacg aactcaccga gcgcccgcac ccgttcccgc 8761 tcggcatcga cctgatcctg acctgcggcg aacgcctgct ggctattccg cgaaccacgc 8821 atgtggaggt gtgctgatgt acgttgccgt aaaagggggc gaaaaggcga tcgacgccgc 8881 ccacgccctg caagagagcc gacgccgggg cgataccgat ttgcctgaac tgagcgtcgc 8941 ccagattgaa cagcagctta acctcgcggt agatcgcgtg atgaccgaag gcggcattgc 9001 cgaccgcgaa ctggcggcgc tggcgctgaa acaggccagc ggcgataacg ttgaagcgat 9061 tttcctgctg cgcgcctacc gcaccacgtt ggcgaagctg gcggtaagcg agccgctcga 9121 caccaccggg atgcgtctcg aacgccgtat ctccgccgtt tataaagaca ttcccggcgg 9181 ccagctgctt ggcccaacct acgactacac ccatcgcctg ctcgatttta ccctgctggc 9241 aaacggcgaa gcgccgacgc tgaccaccgc cgacagcgaa caacagccgt cgccgcacgt 9301 tttcagcctg ctggcgcgtc aggggctggc gaagtttgaa gaggatagcg gcgcacagcc 9361 ggatgacatc acccgcacgc cgccggttta cccctgctca cgttcttccc gtttgcagca 9421 gttgatgcgc ggcgacgaag gctatttgct ggcgctggcc tactccaccc agcgtggtta 9481 cggacgcaat cacccgttcg cgggcgagat ccgcagtggt tacatcgacg tgtcgattgt 9541 gccggaagag ctgggatttg cggtaaacgt cggcgaacta ctgatgaccg agtgtgaaat 9601 ggtcaacggt tttatcgacc cgccggatga gccgccgcac ttcacgcgcg gctacgggct 9661 ggtattcggc atgagcgagc gcaaagcgat ggcaatggcg ctggtcgatc gtgcgttgca 9721 ggctccggaa tacggcgagc acgcgacagg cccggcgcag gatgaagagt ttgtgctggc 9781 acatgccgac aacgtcgaag ccgcaggctt tgtctcgcac ctcaaactcc cccactacgt 9841 cgatttccag gccgaactgg agctactcaa acgtctgcaa caggagaaga accatggcta 9901 atctgagcgg ctacaacttt gcctacctcg acgagcagac caaacgcatg atccgccgcg 9961 ccatcttaaa agcggtggcg atccccggtt atcaggtgcc gtttggcggg cgcgagatgc 10021 cgatgccata cggctgggga accggcggca tacagctcac cgccagcgtg attggcgaaa 10081 gcgacgtgct aaaggtgatt gaccagggtg cggatgacac caccaacgcc gtgtcgattc 10141 gcaacttctt taagcgcgtg accggggtaa acaccactga acgtacggac gatgcgacgc 10201 ttatccagac gcgtcaccgc atccccgaaa cgccgctgac cgaagatcag atcattatct 10261 tccaggtgcc aatcccggaa ccgctgcgct ttatcgagcc gcgcgaaacg gaaacccgca 10321 ccatgcacgc gctggaagag tacggcgtga tgcaggtgaa actgtatgaa gatatcgccc 10381 gcttcggtca tatcgccact acctacgcct atccggtgaa ggtgaacggg cgctacgtaa 10441 tggacccgtc gccgatcccg aaattcgata acccaaaaat ggacatgatg cccgccctgc 10501 aactgttcgg cgcggggcgc gagaagcgca tctatgcggt gccgccgttt acccgcgtgg 10561 aaagtctcga tttcgacgat cacccgttca ccgttcagca gtgggatgag ccatgcgcca 10621 tctgcggatc gacccacagc tatcttgatg aagtggtgct ggatgacgcc ggaaaccgca 10681 tgtttgtctg ctccgatacc gattattgcc gccaacagag cgaggcaaaa aaccaatgaa 10741 tcaaccgtta ctttcggtca ataacctgac ccacctttac gcgccgggca aaggctttag 10801 cgatgtctct tttgatttat ggccggggga agtgctgggc attgtcgggg aatccggctc 10861 cgggaagacc acgctgctga agtcgatctc cgcgcgcctg acgccgcagc agggggaaat 10921 tcactacgag aaccgttcgc tgtatgcaat gagcgaggcc gaccgccgtc gcctgctgcg 10981 taccgaatgg ggcgtggtgc atcagcatcc actcgacggc ctgcgccgcc aggtgtcggc 11041 aggcggcaat atcggcgagc ggctgatggc gaccggggca cgtcattacg gcgatattcg 11101 tgccaccgcg cagaagtggc tggaagaggt ggagattccc gccaaccgga tcgacgacct 11161 gccgaccacc ttttccggcg gtatgcagca gcgtttgcag attgcccgca acctggtgac 11221 gcatccgaag ctggtgttta tggatgaacc gaccggcggg ctggatgtgt cggtgcaggc 11281 ccgcctgctc gacctgctgc gcggcctggt ggtggagctg aacctcgcgg tggtgattgt 11341 cacccatgat ttaggcgtcg cccgcctgct ggcggaccgt ttgctggtga tgaagcaggg 11401 gcaagtggtg gagagtgggt taaccgaccg cgtgctcgac gacccgcatc atccgtatac 11461 acagctgctg gtgtcatcgg ttttgcagaa ttgagccggt gccggatgcg gcgtaaacgc 11521 cttatccggc ctacaaatgc gctccccgta ggtcggataa gacgcgtcag cgtcgcatcc 11581 gacacccgaa ccacgaggcg aaaaatgatt aacgtacaaa acgtcagtaa aaccttcatc 11641 ctgcaccagc aaaacggcgt gcgcctgccc gtcctcaatc gcgcctcgct caccgtcaac 11701 gcgggcgaat gcgtggtgct ccacggccat tccggcagcg gcaaatcaac tctgctacgc 11761 tcgctgtacg ccaactatct acccgacgaa ggtcaaatcc agatcaaaca cggtgacgag 11821 tgggtagacc tggtcaccgc gccagcgcgc aaagtggtgg aaatccgcaa aaccaccgtc 11881 ggctgggtga gccagtttct gcgcgtcatc ccgcgtatct cagcactgga agtggtgatg 11941 cagccgctgc tcgataccgg cgttccgcgt gaagcctgcg ccgctaaagc cgcgcgtctt 12001 ctcacccgcc tgaacgtgcc ggaacgcctg tggcacctgg caccatcgac attttccggt 12061 ggcgaacagc agcgcgtcaa catcgcccgc ggctttatcg tcgactaccc cattctgctg 12121 cttgacgaac ctaccgcctc gctggacgcc aaaaacagcg ccgcggtggt ggaactgatt 12181 cgcgaagcca aaacccgtgg cgcagccatc gtaggcatct tccatgacga agctgtacgt 12241 aatgacgtcg ccgaccgcct gcacccaatg ggagcctctt catgattatc aataacgtta 12301 agctggtgct ggaaaacgag gtggtaagcg gttcgctgga ggtgcagaac ggcgaaatcc 12361 gcgcctttgc cgaaagccag agccgcctgc cggaggcgat ggacggcgaa ggcggctggc 12421 tgctgccggg gctgattgag ctgcataccg ataatctgga taaattcttc accccgcgcc 12481 cgaaagttga ctggcctgcc cactcggcga tgagcagcca cgacgcgctg atggtggcga 12541 gcggcatcac caccgtactg gatgccgtgg caattggcga cgtgcgcgac ggcggcgatc 12601 ggctggagaa tctggagaag atgatcaacg ccatcgaaga gacgcagaaa cgcggcgtca 12661 accgcgccga gcaccgtctg catctgcgct gcgaactgcc gcatcacacc acgctgccgc 12721 tgtttgaaaa actggtgcag cgcgagccgg tgacgctggt gtcgctgatg gaccactcgc 12781 cgggccagcg ccagttcgcc aaccgcgaga agtatcgcga atattatcag ggcaaatact 12841 ccctcactga tgcgcagatg cagcagtacg aagaagagca actggcgctc gccgcacgct 12901 ggtcgcagcc gaatcgcgaa tccatcgccg ccctgtgccg cgcgcgaaaa attgcgcttg 12961 ccagccacga tgacgccacc cacgcccacg ttgctgaatc tcaccagctt ggcagcgtga 13021 tcgccgaatt tcccaccacg ttcgaagcgg cggaagcctc gcgcaagcat ggcatgaacg 13081 tgctgatggg cgcgccgaat attgtgcgcg gcggctcgca ctccggcaac gtggcggcca 13141 gtgaactggc gcagcttggc ctgctggata tcctctcttc cgactactac cccgccagcc 13201 tgctcgatgc ggcatttcgc gtcgccgatg acgagagcaa ccgctttacg ctgccgcagg 13261 cggtgaagct ggtgactaaa aatccagcgc aggcgcttaa tctccaggat cgcggggtga 13321 ttggcgaggg caaacgcgcc gacctggtgc tggcgcatcg caaggacaat catattcata 13381 tcgaccacgt ctggcgtcag ggtaaaaggg tgttctgatg atgggaaaac tgatttggtt 13441 aatggggccg tccggctccg ggaaagacag cctgctggcg gaactccgcc tgcgggaaca 13501 aactcagtta ctggtggcgc atcgctacat cacgcgcgat gccagcgccg gaagtgaaaa 13561 ccatatcgcc ctgagcgagc aggagttttt tacccgcgcg gggcaaaatc tgttggcctt 13621 aagctggcac gctaacggtc tgtattatgg cgtcggcgtc gagattgatc tctggctgca 13681 cgccggattc gacgtgctgg tcaacggctc acgcgcccat ctgccgcagg cgcgggcgcg 13741 ctatcaatcg gcgctgctgc ccgtctgttt acaggtttcg ccggagatcc tccgccagcg 13801 cctggaaaac cgtggccgtg aaaacgccag tgaaattaac gcccgcctgg cgcgcgccgc 13861 ccgctatact ccacaggatt gccatacgct caacaatgac ggcagcctgc gccagtcggt 13921 cgacacgctg ctgacgctga tccatcagaa ggagaaacac catgcctgct tgtgagcttc 13981 gcccggccac gcagtacgac accgacgcgg tttacgcgct gatttgtgag ctaaaacagg 14041 cggagtttga ccaccacgcg tttcgcgtgg gttttaacgc caatctgcgc gacccaaaca 14101 tgcgctacca tctggcgctg cttgatggcg aagttgtcgg catgatcggc ctgcatttgc 14161 agtttcatct gcatcatgtc aactggatcg gcgaaattca ggagttggtg gtaatgccgc 14221 aggcgcgcgg tctgaacgtc ggcagtaagt tactggcgtg ggcagaagaa gaagcccgcc 14281 aggccggggc cgaaatgacc gaactttcga ccaacgtgaa gcgccacgac gcgcaccgtt 14341 tctatctgcg cgaaggctac gagcagagcc acttccgctt caccaaggcg ctgtaacatg 14401 agcctgaccc tcacgctcac cggcaccggc ggcgcacagg gcgttccggc atggggctgc 14461 gagtgtgcgg cctgcgccag agcgcggcgc tcgccgcagt atcgccgcca accgtgcagc 14521 ggcgtagtga agtttaacga cgcaatcacc ctgatcgacg ccgggctgca cgatctcgcc 14581 gatcgctggt cgcccggatc gttccagcag tttttgctga cgcattatca tatggatcac 14641 gtccaggggc tgtttccgct gcgctggggc gttggcgatc cgatcccggt ttacggcccg 14701 ccggatgaac agggctgcga cgatctgttt aaacatccgg gcctgcttga tttcagccac 14761 acggtggaac cgtttgtggt gtttgatttg caggggttac aggtcacgcc cctgccgctc 14821 aaccactcaa aactgacctt cggttatctg ctggaaacgg cacacagccg ggtggcgtgg 14881 ctgtctgaca ccgcaggctt gccggaaaaa acgctgaaat ttttacgcaa taatcagccg 14941 caggtaatgg tgatggattg cagtcacccg ccgcgcgcgg atgcaccgcg taatcactgt 15001 gatttaaata ccgtgcttgc gctgaatcag gttatccgct cgccacgggt gattctgacc 15061 catatcagcc accagtttga tgcgtggctg atggaaaacg cactaccgtc agggtttgag 15121 gtggggtttg atgggatgga gattggggtg gcgtgatgag agggaatgtg cgcgctggcc 15181 ccctcaccct aaccctctcc ccagaggggc gaggggaccg attgtgctcg atattgaata 15241 ttgcgctcgt tttctccctc tccccattgg ggtgaggggc gatgcctgct ccatacccaa 15301 cctcatcgcc catactcatc ttccattctc cgctcttcat cctccagttg ccgacgctcc 15361 tgatcaagct ggcgctggcg atcgtccagc tgcctgcggc gatcttcaaa ctggcggcgg 15421 cggtcgtcat attgtctgcg ccgatcgtcg ctcacttcac gctgccagcc gtggtcgcgc 15481 gaatcttcat agttgaagcg gcgcacgaaa aacgcgaaag cgtttcacga taaatgcgaa 15541 aactttagct ttcgcgcttc aaatgaaaca gatgtattaa ttactgcttt ttattcatta 15601 catggggatc c //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMJNLTRA 765 bp ss-mRNA PRI 22-FEB-1990 DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction mRNA, clone lambda-LTR8. ACCESSION M32219 KEYWORDS Alu repetitive sequence; long terminal repeat. SOURCE Human, cDNA to mRNA, clone lambda-LTR8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 765) AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and Matsubara,K. TITLE Transcription of human endogenous retroviral long terminal repeat (LTR) sequence in a lung cancer cell line JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990) STANDARD simple staff_review FEATURES from to/span description recomb 223 224 human DNA end; endogenous retrovirus DNA start rpt 224 342 Alu repeat LTR 343 765 long terminal repeat BASE COUNT 252 a 158 c 148 g 207 t ORIGIN 1 ggaaccccaa ccttacatgt aatacaaact taactcaaaa tggatcatat atctaaatgt 61 aaaatggaaa gctataaaac tgaaaacaga ctatctttac aacctaggcg taggtatagt 121 ttttagacat tacaccaaaa gcacatgccg taaaagaaaa aatagataaa ttggtggatt 181 tcattaaaat taaaaaactt tttctctctg aaaaatcctg ttaagctggg cgctgtggtt 241 catgcctgta atcccagcac tttgggaggc tgagttggga agaaattaat agcttgaggc 301 caggagttca agatcatcct gggcagcaaa gtcatacact cttgagggaa gagagagacc 361 ttctcatatt gttttatatt gttttatact cagtacctgt tttaagaaaa aaacaaggaa 421 gtgaaatcaa agacaggcag cccggcacca ggcctgaaac cagccctggg cctgcctggc 481 ctaaacctag tagttaaaaa tcaacttacg acttagaacc tgatgttatc cgtagattcc 541 aagcattgta taaaaaaatt gtgaaactcc ctgttgtgtt ctgtaccagt gcatgaaacc 601 cctgtcacat atcccctaga ttgctcaatc aatcacgacc ctttcatgtg aaatctttag 661 tgttgtgagc ccttaaaagg gacagaaatt gtgcacttga ggagctcaga ttttaaggct 721 gtagcttgcc gatgctccca gctgaataaa gcccttcctt ctaca // LOCUS HUMJNLTRB 551 bp ss-mRNA PRI 22-FEB-1990 DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction mRNA, clone lambda-LTR22. ACCESSION M32220 KEYWORDS long terminal repeat. SOURCE Human, cDNA to mRNA, clone lambda-LTR22. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 551) AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and Matsubara,K. TITLE Transcription of human endogenous retroviral long terminal repeat (LTR) sequence in a lung cancer cell line JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990) STANDARD simple staff_review FEATURES from to/span description recomb 120 121 human DNA end; endogenous retrovirus DNA start LTR 121 551 long terminal repeat BASE COUNT 146 a 146 c 129 g 130 t ORIGIN 1 ggcttgagct gcagtcacag gccttggctg gaccagggat ggcccccagc tcccaggagg 61 gcccactgac cctgcagctc cagccttctc catacttcaa caaagaatga gttgtggcaa 121 tgagggaaga gagaccctct catagtgttt tatactcagt acctgtttta agaaaaaaca 181 acaaggaagt aaaaccaaag acaggcaggc agcctggcgc taggcccgaa accaggcctg 241 cgcctgcctg gcctaaaccc agtagttgaa aatcaattca taacttagaa accgatgtta 301 ttcatagatt ccagacattg tatagaagaa catttgtgaa actccctgcc gtgttctgtt 361 tctctctgac cgccggtgca tgcagcgcct gtcacgtacc gcctgcttgc tcaaatcaat 421 gacgaccctt tcatgtgaaa tcttcgtgtt gtgagccctt aaaagggaca gaaattgtgc 481 acttggggag ctcggatttt aaggcagtag cttgccgatg ctcccagctg aataaagccc 541 ttccttctac a //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMJNLTRA 765 bp ss-mRNA PRI 22-FEB-1990 DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction mRNA, clone lambda-LTR8. ACCESSION M32219 KEYWORDS Alu repetitive sequence; long terminal repeat. SOURCE Human lung cancer cell line, cDNA to mRNA, clone lambda-LTR8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 765) AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and Matsubara,K. TITLE Transcription of human endogenous retroviral long terminal repeat (LTR) sequence in a lung cancer cell line JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990) STANDARD simple staff_review FEATURES from to/span description recomb 223 224 human DNA end; endogenous retrovirus DNA start rpt 224 342 Alu repeat LTR 343 765 long terminal repeat BASE COUNT 252 a 158 c 148 g 207 t ORIGIN 1 ggaaccccaa ccttacatgt aatacaaact taactcaaaa tggatcatat atctaaatgt 61 aaaatggaaa gctataaaac tgaaaacaga ctatctttac aacctaggcg taggtatagt 121 ttttagacat tacaccaaaa gcacatgccg taaaagaaaa aatagataaa ttggtggatt 181 tcattaaaat taaaaaactt tttctctctg aaaaatcctg ttaagctggg cgctgtggtt 241 catgcctgta atcccagcac tttgggaggc tgagttggga agaaattaat agcttgaggc 301 caggagttca agatcatcct gggcagcaaa gtcatacact cttgagggaa gagagagacc 361 ttctcatatt gttttatatt gttttatact cagtacctgt tttaagaaaa aaacaaggaa 421 gtgaaatcaa agacaggcag cccggcacca ggcctgaaac cagccctggg cctgcctggc 481 ctaaacctag tagttaaaaa tcaacttacg acttagaacc tgatgttatc cgtagattcc 541 aagcattgta taaaaaaatt gtgaaactcc ctgttgtgtt ctgtaccagt gcatgaaacc 601 cctgtcacat atcccctaga ttgctcaatc aatcacgacc ctttcatgtg aaatctttag 661 tgttgtgagc ccttaaaagg gacagaaatt gtgcacttga ggagctcaga ttttaaggct 721 gtagcttgcc gatgctccca gctgaataaa gcccttcctt ctaca // LOCUS HUMJNLTRB 551 bp ss-mRNA PRI 22-FEB-1990 DEFINITION Human DNA/endogenous retroviral long terminal repeat (LTR) junction mRNA, clone lambda-LTR22. ACCESSION M32220 KEYWORDS long terminal repeat. SOURCE Human lung cancer cell line, cDNA to mRNA, clone lambda-LTR22. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 551) AUTHORS Tomita,N., Horii,A., Doi,S., Yokouchi,H., Ogawa,M., Mori,T. and Matsubara,K. TITLE Transcription of human endogenous retroviral long terminal repeat (LTR) sequence in a lung cancer cell line JOURNAL Biochem. Biophys. Res. Commun. 166, 1-10 (1990) STANDARD simple staff_review FEATURES from to/span description recomb 120 121 human DNA end; endogenous retrovirus DNA start LTR 121 551 long terminal repeat BASE COUNT 146 a 146 c 129 g 130 t ORIGIN 1 ggcttgagct gcagtcacag gccttggctg gaccagggat ggcccccagc tcccaggagg 61 gcccactgac cctgcagctc cagccttctc catacttcaa caaagaatga gttgtggcaa 121 tgagggaaga gagaccctct catagtgttt tatactcagt acctgtttta agaaaaaaca 181 acaaggaagt aaaaccaaag acaggcaggc agcctggcgc taggcccgaa accaggcctg 241 cgcctgcctg gcctaaaccc agtagttgaa aatcaattca taacttagaa accgatgtta 301 ttcatagatt ccagacattg tatagaagaa catttgtgaa actccctgcc gtgttctgtt 361 tctctctgac cgccggtgca tgcagcgcct gtcacgtacc gcctgcttgc tcaaatcaat 421 gacgaccctt tcatgtgaaa tcttcgtgtt gtgagccctt aaaagggaca gaaattgtgc 481 acttggggag ctcggatttt aaggcagtag cttgccgatg ctcccagctg aataaagccc 541 ttccttctac a //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS SHPPACAP 1763 bp ss-mRNA MAM 23-FEB-1990 DEFINITION Ovine pituitary adenylate cyclase activating polypeptide (PACAP38) mRNA, complete cds. ACCESSION M32216 KEYWORDS pituitary adenylate cyclase activating polypeptide. SOURCE Ovine hypothalamus, cDNA to mRNA, clone pOH38P7. ORGANISM Ovis sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1763) AUTHORS Kimura,C., Ohkubo,S., Ogi,K., Hosoya,M., Itoh,Y., Onda,H., Miyata,A., Jiang,L., Dahl,R.R., Stibbs,H.H., Arimura,A. and Fujino,M. TITLE A novel peptide which stimulates adenylate cyclase: Molecular cloning and characterization of the ovine and human cDNAs JOURNAL Biochem. Biophys. Res. Commun. 166, 81-89 (1990) STANDARD simple staff_review FEATURES from to/span description pept 577 1107 pituitary adenylate cyclase activating polypeptide precursor sigp 577 648 pituitary adenylate cyclase activating signal peptide matp 970 1083 pituitary adenylate cyclase activating peptide BASE COUNT 411 a 490 c 418 g 444 t ORIGIN 1 ctgctaactg cccagataaa taggagcaga gggctggtca cctctgtaat aaccaccggc 61 agcagtagaa gaaaccgcag cttcagaagc agccagagag acttctgagc agcgaaggcg 121 ctgcctgctc gagctgcctg gccgggcggc tgccccagac gccgacttcg ccgaggccct 181 ctctctctct ctctctctct ctctctctct ctctctctct ctctgcttct ttccttatca 241 ctcctttctt ctcagtggac ttcaggccac tttgtctccc acccccactc agctcgtcgc 301 ctcctccgtc ttccttctcc atctctcctc tcgcccccct tctctcagtg tcacgctccg 361 tcctagttcc gagcgtcgtc aaacttttga acagaataac aggactcagc aaacaagtcc 421 tccagctcct cccgcggctc cggctcgttc ctgcggctcc tgctcagaca ctaacgccag 481 acggcgatgc ctcttgggtt gtgactacag cgcacaaact tggagaagct ctttgcccgc 541 cgtcctactt ggcagcaaat cctctcctgg cagcgaatga ccatgtgtag cggagcgagg 601 ctggccctgc tcgtttacgg gatactgatg cacagcagcg tctacggctc acctgccgcc 661 tccggactcc ggttcccggg gatcaggccg gagaacgagg cgtacgacga ggacggaaac 721 ccgcagcagg acttctacga ctcggagccg ccaggcgtgg ggagccccgc ctccgcgctg 781 cgcgatgcct acgcgctcta ctacccggcg gaggaaagag atgtcgccca cgggatcctt 841 gataaggcct accgcaaagt gctggaccag ctgtccgcca ggagatacct gcagacgctc 901 atggccaagg gcttgggtgg gaccccgggc ggcggcgcgg acgacgactc ggagccgctc 961 tccaagcgcc actcggacgg catcttcact gacagctaca gccgctaccg gaagcaaatg 1021 gctgttaaga aatacttggc ggctgtccta gggaaaaggt ataaacaaag ggttaagaac 1081 aaaggacggc gaataccgta cttgtagcga cgagttacca gctatcctgt gtatacagcc 1141 ctgacacaat gagaagtcgt ttttcccaac tgactgaact gtcatcgctg ctgtgttctg 1201 tcccacatgt atttatgtat gaagtcaagc cattaaatga atattttgat aataatattg 1261 tttttctttt tacgaagcac tggagaatgc acagatatac tttgtggacc aattattgat 1321 attgacatat atattacgaa tatataaaga gtatatatat atatatataa gtataataga 1381 gagccgttca tacagtgtgc acaaggactg aagattcgcc tgagctgttt gtttttatat 1441 aaaataaata gaaaaataga caatcattgt tttgaatatt actcctattt ttgtaaactg 1501 gaattaaaag gatagtattt ttatccacaa taggcctgaa gatattaatc ctgaccattt 1561 gctactgtac ataaacagtg atgccctgct ccagggagac tttgaggtaa tgatttggga 1621 ggattgctga aggtctctct ttcccaggga gtctctgggg caggctgctt caatcccagc 1681 tgaactcgac tgaggctctg tctacccctt gctgggtggc aatgccaata cttccgcttt 1741 ctttgattct atttttatgt gta //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMZFX 3000 bp ss-mRNA PRI 04-DEC-1989 DEFINITION Human zinc finger protein X-linked (ZFX) mRNA, complete cds. ACCESSION M30608 KEYWORDS zinc finger protein. SOURCE Human HeLa cell line, cDNA to mRNA, clone pCD5.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3000) AUTHORS Palmer,M.S., Berta,P., Sinclair,A.H., Pym,B. and Goodfellow,P.N. TITLE Comparison of human ZFY and ZFX transcripts JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1681-1685 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.S.Palmer, 08-DEC-1989. FEATURES from to/span description pept 259 2676 zinc finger protein /nomgen="ZFX" /hgml_locus_uid="LS0126V" /map="Xp22.1-p21.3" BASE COUNT 918 a 588 c 693 g 801 t ORIGIN 1 gtccgtccgg ttcttgctat attgccccag gctggtctcg aactcctggg ctcaagccgt 61 tctcccgcct cccactgccc gcctgtcact gccgtctgtt ccctgagctg tgctttacgc 121 tgggaaagac atagaagaac tattaagaag atagaattgt tttgctgcgc agtacagcaa 181 cagtggatgt tcaagattaa gattagagtc aagttgtgtg attaagacag gagctgtgac 241 tgatgagaat taaaggccat ggatgaagat gggcttgaat tacaacaaga gccaaactca 301 ttttttgatg caacaggagc tgatggtaca cacatggatg gtgatcaaat tgttgtggaa 361 gtacaagaaa ctgtttttgt ttcagatgtt gtggattcag acataactgt gcataacttt 421 gttcctgatg acccagattc agttgtaatc caagatgtta ttgaggacgt tgttatagaa 481 gatgttcagt gcccagatat catggaagaa gcagatgtgt ctgaaacggt catcattcct 541 gagcaagtgc tggactcaga tgtaactgaa gaagtttctt tagcacattg cacagtccca 601 gatgatgttt tagcttctga cattacttca gcctcaatgt ctatgccaga acacgtcttg 661 acgggtgatt ctatacatgt gtctgacgtt ggacatgttg gacatgttgg acatgttgaa 721 catgtggttc atgatagtgt agtggaagca gaaattgtca ctgatcctct gactaccgac 781 gtagtttcag aagaagtatt ggtagcagac tgtgcctctg aagcagtcat agatgccaat 841 gggatccctg tggaccagca ggatgatgac aaaggcaact gtgaggacta ccttatgatt 901 tccttggatg atgctggcaa aatagaacac gatggttctt ctggaatgac catggacaca 961 gagtcggaaa ttgatccttg taaagtggat ggcacttgcc ctgaggtcat caaggtgtac 1021 atttttaaag ctgaccctgg agaagatgac ttaggtggaa ctgtagacat tgtggagagt 1081 gagcctgaga atgatcatgg agttgaactg cttgatcaga acagcagtat tcgtgttccc 1141 agggaaaaga tggtttatat gactgtcaat gactctcagc cagaagatga agatttaaat 1201 gttgctgaaa tcgctgacga agtttatatg gaagtgatcg taggagagga ggatgctgca 1261 gcagcacggg cagccgccgc cgtgcacgag cagcaaatgg atgacaatga aatcaaaacc 1321 ttcatgccga ttgcatgggc agcagcttat ggtaataatt ctgatggaat tgaaaaccgg 1381 aatggcactg caagtgccct cttgcacata gatgagtctg ctggcctcgg cagactggct 1441 aaacaaaaac caaagaaaag gagaagacct gattccaggc agtaccaaac agcaataatt 1501 attggccctg atggacatcc tttgactgtc tatccttgca tgatttgtgg gaagaagttt 1561 aagtcgagag gttttttgaa aaggcacatg aaaaaccatc ccgaacacct tgccaagaag 1621 aaataccgct gtactgactg tgattacact accaacaaga agataagttt acacaaccac 1681 ctggagagcc acaagctgac cagcaaggca gagaaggcca ttgaatgcga tgagtgtggg 1741 aagcatttct ctcatgcagg ggctttgttt actcacaaaa tggtgcataa ggaaaaagga 1801 gccaacaaaa tgcacaagtg taaattctgt gaatacgaga cagctgaaca agggttattg 1861 aatcgccacc tcttggcagt ccacagcaag aactttcctc atatttgtgt ggagtgtggt 1921 aagggttttc gtcacccgtc agagctcaaa aagcacatga gaatccatac tggggagaag 1981 ccgtaccaat gccagtactg cgaatatagg tctgcagact cttctaactt gaaaacgcat 2041 gtcaaaacta agcatagtaa agagatgcca ttcaagtgtg acatttgtct tctgactttc 2101 tcggatacca aagaggtgca gcaacatgct cttatccacc aagaaagcaa aacacaccag 2161 tgtttgcatt gcgaccacaa gagttcgaac tcaagtgatt tgaaacgaca cataatttca 2221 gttcacacga aagactaccc ccataagtgt gacatgtgtg ataaaggctt tcacaggcct 2281 tcagaactca agaaacacgt ggctgcccac aagggcaaaa aaatgcacca gtgtagacat 2341 tgtgacttta agattgcaga tccatttgtt ctaagtcgcc atattctctc agttcacaca 2401 aaggatcttc catttaggtg caagagatgt agaaagggat ttaggcaaca gagtgagctt 2461 aaaaagcata tgaagacaca cagtggcagg aaagtgtatc agtgtgagta ctgtgagtat 2521 agcactacag atgcctcagg ctttaaacgg cacgttattt ccattcacac gaaagactat 2581 cctcaccggt gtgagtactg caagaaaggc ttccgaagac cttcagaaaa gaaccagcac 2641 ataatgcgac atcataaaga agttggcctg ccctaacaat acttctacag aacgtttgta 2701 gagatattgg ccttgaagca gaaaattcat tttaaagcca atcagtctca ttcacataca 2761 atactgtata ttgatttatg ctgtgtacaa atagaattat tacttctagt tgactttttt 2821 ttaaatatac attttgctca gtagtgtgtt ctgaattcta ttcagtttgt ttaataaata 2881 gggaaaactg gcaacatgct agttactttt aataaagtaa tccctgattc tataccgaag 2941 ttttatatct tagaatttta tatttattta aatatttacc ttgcttacct tgatggtact // LOCUS HUMZFY 2659 bp ss-mRNA PRI 04-DEC-1989 DEFINITION Human zinc finger protein Y-linked (ZFY) mRNA, complete cds. ACCESSION M30607 KEYWORDS zinc finger protein. SOURCE Human foreskin fibroblast, cDNA to mRNA, (pCD2Bassing library of H.Okayama), clones pMF-1 and p5.8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2659) AUTHORS Palmer,M.S., Berta,P., Sinclair,A.H., Pym,B. and Goodfellow,P.N. TITLE Comparison of human ZFY and ZFX transcripts JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1681-1685 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.S.Palmer, 08-DEC-1989. FEATURES from to/span description pept 1 2406 zinc finger protein /nomgen="ZFY" /hgml_locus_uid="LU0028V" /map="Yp11.3" mRNA < 259 2630 ZFY mRNA (pot.) BASE COUNT 861 a 488 c 597 g 713 t ORIGIN 1 atggatgaag atgaatttga attgcagcca caagagccaa actcattttt tgatggaata 61 ggagctgatg ctacacacat ggatggtgat cagattgttg tggaaataca agaagcagtt 121 tttgtttcta atattgtgga ttctgacata actgtgcata actttgttcc tgatgaccca 181 gactcagttg taatccaaga tgttgttgaa gatgttgtca tagaggagga tgttcagtgc 241 tcagatatct tagaagaggc agatgtatct gaaaatgtca tcattcctga gcaagtgctg 301 gactcagatg taactgaaga agtttcttta ccacactgca cagtcccaga tgatgtttta 361 gcttctgaca ttacttcaac ctcaatgtct atgccagaac atgttttaac gagtgaatcc 421 atgcatgtgt gtgacattgg acatgttgaa catatggtgc atgatagtgt agtggaagca 481 gaaatcatta ctgatcctct gacgagtgac atagtttcag aagaagtatt ggtagcagac 541 tgtgcccctg aagcagtcat agatgccagc gggatctcag tggaccagca agataatgac 601 aaagccagct gtgaggacta cctaatgatt tcgttggatg atgctggcaa aatagaacat 661 gatggttcca ctggagtgac catcgatgca gaatcagaaa tggatccttg taaagtggat 721 agcacttgtc ctgaagtcat caaggtgtac atttttaaag ctgaccctgg agaagatgac 781 ttaggtggaa ctgtagacat tgtggagagt gaacctgaaa atgatcatgg agttgaacta 841 cttgatcaga acagcagtat tcgtgttccc agggaaaaga tggtttatat gactgtcaat 901 gactctcaac aagaagatga agatttaaat gttgctgaaa ttgctgatga agtttatatg 961 gaagtgatcg taggagagga ggatgctgct gttgcagcag cagcagctgc tgtgcatgag 1021 cagcaaattg atgaggatga aatgaaaacc ttcgtaccaa ttgcatgggc agcagcttat 1081 ggtaataatt ctgatggaat tgaaaaccgg aatggcactg caagtgccct cttgcacata 1141 gatgagtctg ctggccttgg cagactggct aaacagaaac caaagaaaaa gagaagacct 1201 gattccaggc agtaccaaac agcaataatt attggccctg atggtcatcc tttgactgtc 1261 tatccttgca tgatttgtgg gaagaagttt aagtcgaggg gttttttgaa aagacacatg 1321 aaaaaccatc ctgaacacct tgccaagaag aagtaccact gtactgactg tgattacact 1381 accaataaga agataagttt acataaccac ctggagagcc acaagctgac cagcaaggca 1441 gagaaggcca ttgaatgtga tgagtgtggg aagcattttt ctcatgcagg ggctttgttt 1501 actcacaaaa tggtgcataa ggaaaaaggg gccaacaaaa tgcacaagtg taaattctgt 1561 gaatatgaga cagctgaaca ggggttattg aatcgccacc tcttggcagt ccacagcaag 1621 aactttcctc atatttgtgt ggagtgtggt aaaggtttcc gatacccgtc ggaactgaga 1681 aagcacatgc gaatccatac cggcgagaag ccataccaat gccagtactg tgaatatagg 1741 tctgcagact cttctaactt gaaaacacat ataaaaacaa agcatagtaa agagatgcca 1801 ttcaagtgtg acatttgtct tctgactttc tcagatacca aagaagtgca gcaacatact 1861 cttgtccacc aagaaagcaa aacacatcag tgtttgcatt gcgaccacaa gagttcaaac 1921 tcaagtgatt tgaaacgaca tgtaatttca gttcatacga aagactatcc tcataagtgt 1981 gagatgtgcg agaaaggctt tcacaggcct tcagaactta agaaacatgt ggctgtccac 2041 aaaggtaaaa aaatgcacca atgtagacat tgtgacttta agattgcaga cccatttgtt 2101 ctaagtcgcc atattctctc agttcacaca aaggatcttc catttaggtg taagagatgt 2161 agaaagggat ttaggcaaca aaatgagctt aaaaagcata tgaagacaca cagtggcagg 2221 aaagtatatc agtgtgagta ctgtgagtat agcactacag atgcctcagg ctttaaacgg 2281 cacgttattt ccattcatac aaaagactat cctcatcggt gtgagtactg caagaaaggc 2341 ttccgaagac cttcagaaaa gaaccagcac ataatgagac accataaaga agttggtctg 2401 ccctaacagt gtgtctacaa gcttgtaaag atgttggcct tgaagcagaa aattcatttt 2461 ttaaagccag tcttgttcac atccattact atacattgaa ttatgctgtg taaaaataga 2521 attattgctt ctagtccact tttctttaca ttttattcaa tacgctgtcc tgaatcctat 2581 tcagtttctt taatagatga gtaaaaatag caacaagcaa gttgcttata ataaaataat 2641 ttgtgattct cccgaattc //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMANTCD36 1870 bp ss-mRNA PRI 15-DEC-1989 DEFINITION Human CD36 antigen mRNA, complete cds. ACCESSION M24795 KEYWORDS cell surface antigen; cell surface receptor; erythrocyte antigen; monocyte antigen; platelet antigen. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1870) AUTHORS Oquendo,P., Hundt,E., Lawler,J. and Seed,B. TITLE CD36 directly mediates cytoadherence of Plasmodium falciparum parasitized erythrocytes JOURNAL Cell 58, 95-101 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Seed, 12-MAY-1989. FEATURES from to/span description pept 211 1629 CD36 antigen BASE COUNT 599 a 343 c 356 g 572 t ORIGIN Unreported. 1 gaaaaatcct tcttagccat tttaaagata gctttccaat gattagacga attgattctt 61 tctgtgactc atcagttcct ttcctgtaaa attcatgtct tgctgttgat ttgtgaataa 121 gaaccagagc ttgtagaaac cactttaatc atatccagga gtttgcaaga aacaggtgct 181 taacactaat tcacctcctg aacaagaaaa atgggctgtg accggaactg tgggctcatc 241 gctggggctg tcattggtgc tgtcctggct gtgtttggag gtattctaat gccagttgga 301 gacctgctta tccagaagac aattaaaaag caagttgtcc tcgaagaagg tacaattgct 361 tttaaaaatt gggttaaaac aggcacagaa gtttacagac agttttggat ctttgatgtg 421 caaaatccac aggaagtgat gatgaacagc agcaacattc aagttaagca aagaggtcct 481 tatacgtaca gagttcgttt tctagccaag gaaaatgtaa cccaggacgc tgaggacaac 541 acagtctctt tcctgcagcc caatggtgcc atcttcgaac cttcactatc agttggaaca 601 gaggctgaca acttcacagt tctcaatctg gctgtggcag ctgcatccca tatctatcaa 661 aatcaatttg ttcaaatgat cctcaattca cttattaaca agtcaaaatc ttctatgttc 721 caagtcagaa ctttgagaga actgttatgg ggctataggg atccattttt gagtttggtt 781 ccgtaccctg ttactaccac agttggtctg ttttatcctt acaacaatac tgcagatgga 841 gtttataaag ttttcaatgg aaaagataac ataagtaaag ttgccataat cgacacatat 901 aaaggtaaaa ggaatctgtc ctattgggaa agtcactgcg acatgattaa tggtacagat 961 gcagcctcat ttccaccttt tgttgagaaa agccaggtat tgcagttctt ttcttctgat 1021 atttgcaggt caatctatgc tgtatttgaa tccgacgtta atctgaaagg aatccctgtg 1081 tatagatttg ttcttccatc caaggccttt gcctctccag ttgaaaaccc agacaactat 1141 tgtttctgca cagaaaaaat tatctcaaaa aattgtacat catatggtgt gctagacatc 1201 agcaaatgca aagaagggag acctgtgtac atttcacttc ctcattttct gtatgcaagt 1261 cctgatgttt cagaacctat tgatggatta aacccaaatg aagaagaaca taggacatac 1321 ttggatattg aacctataac tggattcact ttacaatttg caaaacggct gcaggtcaac 1381 ctattggtca agccatcaga aaaaattcaa gtattaaaga atctgaagag gaactatatt 1441 gtgcctattc tttggcttaa tgagactggg accattggtg atgagaaggc aaacatgttc 1501 agaagtcaag taactggaaa aataaacctc cttggcctga tagaaatgat cttactcagt 1561 gttggtgtgg tgatgtttgt tgcttttatg atttcatatt gtgcatgcag atcgaaaaca 1621 ataaaataag tatgtaccaa aaaatattgc ttcaataata ttagcttata tattacttgt 1681 tttcacttta tcaaagagaa gttacatatt aggccatata tatttctaga catgtctagc 1741 cactgatcat ttttaaatat aggtaaataa acctataaat attatcacgc agatcactaa 1801 agtatatctt taattctggg agaaatgaga taaaagatgt acttgtgacc attgtaacaa 1861 tagcacaaat // LOCUS HUMARB3A 1270 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human beta-3-adrenergic receptor gene. ACCESSION M29932 KEYWORDS adrenergic receptor; beta-3-adrenergic receptor. SOURCE Human placenta DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1270) AUTHORS Emorine,L.J., Marullo,S., Briend-Sutren,M.-M., Patey,G., Tate,K., Delavier-Klutchko,C. and Strosberg,A.D. TITLE Molecular characterization of the human beta-3-adrenergic receptor JOURNAL Science 245, 1118-1121 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.J.Emorine 17-NOV-1989. FEATURES from to/span description pept 38 1246 beta-3-adrenergic receptor BASE COUNT 132 a 484 c 405 g 249 t ORIGIN 33 bp upstream of SmaI site. 1 ttccttcttt ccctaccgcc ccacgcgcga cccggggatg gctccgtggc ctcacgagaa 61 cagctctctt gccccatggc cggacctccc caccctggcg cccaataccg ccaacaccag 121 tgggctgcca ggggttccgt gggaggcggc cctagccggg gccctgctgg cgctggcggt 181 gctggccacc gtgggaggca acctgctggt catcgtggcc atcgcctgga ctccgagact 241 ccagaccatg accaacgtgt tcgtgacttc gctggccgca gccgacctgg tgatgggact 301 cctggtggtg ccgccggcgg ccaccttggc gctgactggc cactggccgt tgggcgccac 361 tggctgcgag ctgtggacct cggtggacgt gctgtgtgtg accgccagca tcgaaaccct 421 gtgcgccctg gccgtggacc gctacctggc tgtgaccaac ccgctgcgtt acggcgcact 481 ggtcaccaag cgctgcgccc ggacagctgt ggtcctggtg tgggtcgtgt cggccgcggt 541 gtcgtttgcg cccatcatga gccagtggtg gcgcgtaggg gccgacgccg aggcgcagcg 601 ctgccactcc aacccgcgct gctgtgcctt cgcctccaac atgccctacg tgctgctgtc 661 ctcctccgtc tccttctacc ttcctcttct cgtgatgctc ttcgtctacg cgcgggtttt 721 cgtggtggct acgcgccagc tgcgcttgct gcgcggggag ctgggccgct ttccgcccga 781 ggagtctccg ccggcgccgt cgcgctctct ggccccggcc ccggtgggga cgtgcgctcc 841 gcccgaaggg gtgcccgcct gcggccggcg gcccgcgcgc ctcctgcctc tccgggaaca 901 ccgggccctg tgcaccttgg gtctcatcat gggcaccttc actctctgct ggttgccctt 961 ctttctggcc aacgtgctgc gcgccctggg gggcccctct ctagtcccgg gcccggcttt 1021 ccttgccctg aactggctag gttatgccaa ttctgccttc aacccgctca tctactgccg 1081 cagcccggac tttcgcagcg ccttccgccg tcttctgtgc cgctgcggcc gtcgcctgcc 1141 tccggagccc tgcgccgccg cccgcccggc cctcttcccc tcgggcgttc ctgcggcccg 1201 gagcagccca gcgcagccca ggctttgcca acggctcgac gggtaggtaa ccggggcaga 1261 gggaccggcg // LOCUS HUMCY4ARO 2736 bp ss-mRNA PRI 15-SEP-1989 DEFINITION Human aromatase system cytochrome P-450 (P450XIX) mRNA, complete cds. ACCESSION J04127 KEYWORDS aromatase cytochrome P-450; microsomal protein. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2736) AUTHORS Corbin,C.J., Graham-Lorence,S., McPhaul,M., Mason,J.I., Mendelson,C.R. and Simpson,E.R. TITLE Isolation of a full-length cDNA insert encoding human aromatase system cytochrome P-450 and its expression in nonsteroidogenic cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8948-8952 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by E.R.Simpson, 03-MAR-1989. FEATURES from to/span description pept 28 1539 cytochrome P-450-Arom /nomgen="CYP19" /map="15q21" /hgml_locus_uid="LL0035S" mRNA < 1 2385 CYP19 mRNA (alt.) mRNA < 1 2736 CYP19 mRNA (alt.) BASE COUNT 801 a 599 c 591 g 745 t ORIGIN Chromosome 15q21. 1 gccccctctg aggtcaagga acacaagatg gttttggaaa tgctgaaccc gatacattat 61 aacatcacca gcatcgtgcc tgaagccatg cctgctgcca ccatgccagt cctgctcctc 121 actggccttt ttctcttggt gtggaattat gagggcacat cctcaatacc aggtcctggc 181 tactgcatgg gaattggacc cctcatctcc cacggcagat tcctgtggat ggggatcggc 241 agtgcctgca actactacaa ccgggtgtat ggagaattca tgcgagtctg gatctctgga 301 gaggaaacac tcattatcag caagtcctca agtatgttcc acataatgaa gcacaatcat 361 tacagctctc gattcggcag caaacttggg ctgcagtgca tcggtatgca tgagaaaggc 421 atcatattta acaacaatcc agagctctgg aaaacaactc gacccttctt tatgaaagct 481 ctgtcaggcc ccggccttgt tcgtatggtc acagtctgtg ctgaatccct caaaacacat 541 ctggacaggt tggaggaggt gaccaatgaa tcgggctatg tggacgtgtt gacccttctg 601 cgtcgtgtca tgctggacac ctctaacacg ctcttcttga ggatcccttt ggacgaaagt 661 gctatcgtgg ttaaaatcca aggttatttt gatgcatggc aagctctcct catcaaacca 721 gacatcttct ttaagatttc ttggctatac aaaaagtatg agaagtctgt caaggatttg 781 aaagatgcca tagaagttct gatagcagaa aaaagatgca ggatttccac agaagagaaa 841 ctggaagaat gtatggactt tgccactgag ttgattttag cagagaaacg tggtgacctg 901 acaagagaga atgtgaacca gtgcatattg gaaatgctga tcgcagctcc tgacaccatg 961 tctgtctctt tgttcttcat gctatttctc attgcaaagc accctaatgt tgaagaggca 1021 ataataaagg aaatccagac tgttattggt gagagagaca taaagattga tgatatacaa 1081 aaattaaaag tgatggaaaa cttcatttat gagagcatgc ggtaccagcc tgtcgtggac 1141 ttggtcatgc gcaaagcctt agaagatgat gtaatcgatg gctacccagt gaaaaagggg 1201 acaaacatta tcctgaatat tggaaggatg cacagactcg agtttttccc caaacccaat 1261 gaatttactc ttgaaaattt tgcaaagaat gttccttata ggtactttca gccatttggc 1321 tttgggcccc gtggctgtgc aggaaagtac atcgccatgg tgatgatgaa agccatcctc 1381 gttacacttc tgagacgatt ccacgtgaag acattgcaag gacagtgtgt tgagagcata 1441 cagaagatac acgacttgtc cttgcaccca gatgagacta aaaacatgct ggaaatgatc 1501 tttaccccaa gaaactcaga caggtgtctg gaacactaga gaaggctggt cagtacccac 1561 tctggagcat tttctcatca gtagttcaca tacaaatcat ccatccttgc caatagtgtc 1621 atcctcacag tgaacactca gtggcccatg gaattttata ggcatacctc ctatggttgt 1681 caccaagcta ggtgctattg gtcatctgct cctgttcaca ccagagaacc aggctacaag 1741 agaaaaagca gaggccaaga gtttgagggg agaaatagtc ggtgaagaaa ccgtatccat 1801 aaagacccga ttccaccaaa tgtgctttga gaaggatagg ccttcattaa caaaatgtat 1861 gtctggttcc ccagtagagc tctactgcct caaccaaggg gatttttatg tctggggcag 1921 aaacactcaa gttgattaga aagaccaggc caatgtcagg gtacctgggg ccaaacccac 1981 ctgctagtgt gaattaaagt actttaattt tgttttctgt ggaggtggaa aagcaacatt 2041 catagtcttt ggagaaatgc ttagaaattc agcatttgac ccttgctgtg aattaagccc 2101 aattaattcc tgtttgtcta catatgatct gtctgtggca aaagtttaat cagaggaaat 2161 tctttcccag tctgtcgatt tatgcctcag ccacttgcct gtgctacaat tcattgtgtt 2221 acctgtagat tcaggtaata caaaccatat ataatcatca agtaatacaa actaatttag 2281 taatagcctg ggttaagtat tattagggcc ctgtgtctgc atgtagaaaa aaaaattcac 2341 atgatgcact tcaaattcaa ataaaaatcc ttttggcatg ttcccatttt tgcttagctc 2401 aattagtgtg gctaaccaag agataactgt aaatgtgaca ttgatttgct cttactacag 2461 ctacagtgat tgggggagga aaagtcccaa cccaatgggc tcaaacttct aaggggtact 2521 cctctcatcc ccttatcctt ctccctcgac attttctccc tctttcttcc catgacccca 2581 aagccaaggg caacagatca gtaaagaacg tggtcagagt agaacccctg aagtattttt 2641 taatcctacc tcaaaattta acagttacct gagagattta acattatcta gttcattgaa 2701 tcattgtatg tggtcatgga taaattgcac acttgg // LOCUS HUMFCRII 1403 bp ss-mRNA PRI 30-SEP-1988 DEFINITION Human FcRII mRNA for immunoglobulin G receptor. ACCESSION Y00644 KEYWORDS IgG receptor; cell surface glycoprotein; immunoglobulin receptor. SOURCE human (Homo sapiens). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1403) AUTHORS Moore,K.W. and Martens,C.L. JOURNAL Unpublished (1987) DNAX, Palo Alto, CA, 94304 USA. STANDARD simple automatic REFERENCE 2 (bases 1 to 1403) AUTHORS Stuart,S.G. TITLE Isolation and Expression of cDNA clones encoding a JOURNAL J. Exp. Med. 166, 1668-1684 (1987) STANDARD simple automatic COMMENT *source: cell line=K937; library=pcD vector; clone=16.2; EMBL features not translated to GenBank features: key from to description SITE 308 316 N-glycosylation site SITE 551 559 N-glycosylation site SITE 656 742 transmembrane domain FEATURES from to/span description pept 20 973 precursor polypeptide (AA -34 to 287) sigp 20 121 signal peptide (AA -34 to -1) matp 122 970 mature IgG receptor (AA 1-287) BASE COUNT 410 a 372 c 305 g 316 t ORIGIN 1 ggggggggac agtgctggga tgactatgga gacccaaatg tctcagaatg tatgtcccag 61 aaacctgtgg ctgcttcaac cattgacagt tttgctgctg ctggcttctg cagacagtca 121 agctgcagct cccccaaagg ctgtgctgaa acttgagccc ccgtggatca acgtgctcca 181 ggaggactct gtgactctga catgccaggg ggctcgcagc cctgagagcg actccattca 241 gtggttccac aatgggaatc tcattcccac ccacacgcag cccagctaca ggttcaaggc 301 caacaacaat gacagcgggg agtacacgtg ccagactggc cagaccagcc tcagcgaccc 361 tgtgcatctg actgtgcttt ccgaatggct ggtgctccag acccctcacc tggagttcca 421 ggagggagaa accatcatgc tgaggtgcca cagctggaag gacaagcctc tggtcaaggt 481 cacattcttc cagaatggaa aatcccagaa attctcccgt ttggatccca ccttctccat 541 cccacaagca aaccacagtc acagtggtga ttaccactgc acaggaaaca taggctacac 601 gctgttctca tccaagcctg tgaccatcac tgtccaagtg cccagcatgg gcagctcttc 661 accaatgggg atcattgtgg ctgtggtcat tgcgactgct gtagcagcca ttgttgctgc 721 tgtagtggcc ttgatctact gcaggaaaaa gcggatttca gccaattcca ctgatcctgt 781 gaaggctgcc caatttgagc cacctggacg tcaaatgatt gccatcagaa agagacaact 841 tgaagaaacc aacaatgact atgaaacagc tgacggcggc tacatgactc tgaaccccag 901 ggcacctact gacgatgata aaaacatcta cctgactctt cctcccaacg accatgtcaa 961 cagtaataac taaagagtaa cgttatgcca tgtggtcata ctctcagctt gcgtatggat 1021 gcaaaaaaga ggggaattgt taaaggaaaa tttaaatgga gactggaaaa atcctgagca 1081 aacaaaacca cctggccctt agaaatagct ttaactttgc ttaaactaca aacacaagca 1141 aaacttcacg gggtcatact acatacaagc ataagcaaaa cttaacttgg atcatttctg 1201 gtaaatgctt atgttagaaa taagacaacc ccagccaatc acaagcagcc tactaacata 1261 taattaggtg actagggact ttctaagaag atacctaccc ccaaaaaaca acttatgtaa 1321 ttgaaaacca accgattgcc tttattttgc ttccacattt tcccaataaa tacttgcctg 1381 tgacattttg ccactggaac act // LOCUS HUMGNAT1 1292 bp ss-mRNA PRI 15-MAR-1990 DEFINITION Human GNAT1 mRNA for transducin alpha-chain. ACCESSION X15088 KEYWORDS guanine nucleotide-binding protein; transducin; transducin alpha-chain. SOURCE human (Homo sapiens). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1292; enum. 1 to 1292) AUTHORS Van Dop,C. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 1292) AUTHORS Van Dop,C., Medynski,D.C. and Apone,L.M. TITLE Nucleotide sequence for a cDNA encoding the alpha subunit of retinal transducin (GNAT1) isolated from the human eye JOURNAL Nucleic Acids Res. 17, 4887-4887 (1989) STANDARD simple automatic COMMENT *source: tissue=eye; clone=UO50; EMBL features not translated to GenBank features: key from to description SITE 1253 1258 pot. polyA site [1] Author address Van Dop C., Howard Hughes Medical Institute, Children's Hospital Enders 1107, 300 Longwood Avenue, Boston MA 02115, U S A. Submitted (21-APR-1989) on tape to the EMBL data library FEATURES from to/span description pept < 1 1156 transducin alpha-chain (AA at 2) /hgml_locus_uid="LE0116J" /map="3" /nomgen="GNAT1" BASE COUNT 311 a 387 c 362 g 232 t ORIGIN 1 aggtcctcct gggccagaag ggttcctggg agccaggttc tgggatcccc tccatccaga 61 agaaccacct gctcactctg tcccttcgcc tgctgctggg accatggggg ctggggccag 121 tgctgaggag aagcactcca gggagctgga aaagaagctg aaagaggacg ctgagaagga 181 tgctcgaacc gtgaagctgc tgcttctggg tgccggtgag tccgggaaga gcaccatcgt 241 caagcagatg aagattatcc accaggacgg gtactcgctg gaagagtgcc tcgagtttat 301 cgccatcatc tacggcaaca cgttgcagtc catcctggcc atcgtacgcg ccatgaccac 361 actcaacatc cagtacggag actctgcacg ccaggacgac gcccggaagc tgatgcacat 421 ggcagacact atcgaggagg gcacgatgcc caaggagatg tcggacatca tccagcggct 481 gtggaaggac tccggtatcc aggcctgttt tgagcgcgcc tcggagtacc agctcaacga 541 ctcggcgggc tactacctct ccgacctgga gcgcctggta accccgggct acgtgcccac 601 cgagcaggac gtgctgcgct cgcgagtcaa gaccactggc atcatcgaga cgcagttctc 661 cttcaaggat ctcaacttcc ggatgttcga tgtgggcggg cagcgctcgg agccgaagaa 721 gtggatccac tgcttcgagg gcgtgacctg catcatcttc atcgcggcgc tgaccgcgta 781 cgacatggtg ctagtggagg acgacgaagt gaaccgcatg cacgagagcc tgcacctgtt 841 caacagcatc tgcaaccacc gctacttcgc cacgacgtcc atcgtgctct tccttaacaa 901 gaaggacgtc ttcttcgaga aggtcaagaa ggcgcacctc agcatctgtt tcccggacta 961 cgatggaccc aacacctacg aggacgccgg caactacatc aaggtgcagt tcctcgagct 1021 caacatgcgg cgcgacgtga aggagatcta ttcccacatg acgtgcgcca ccgacacgca 1081 gaacgtcaaa ttctgcttcg acgctgtcac cgacatcatc atcaaggaga acctcaaaga 1141 ctgtggcctc ttctgagcca gggcctgtgc tgcagtcggg gacaaggagc ttccgtctgg 1201 caaggccggg gcacaatttg cactcccctc agctagacgc agcagactca gcaataaacc 1261 tttgcatcag gcaaaaaaaa aaaaacaaaa aa // LOCUS HUMGSHPXG 1733 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human glutathione peroxidase gene, complete cds. ACCESSION Y00483 KEYWORDS glutathione peroxidase. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1733) AUTHORS Ishida,K. JOURNAL Unpublished (1987) Nippon Kayaku Co.,Ltd, Kita-ku, Tokyo 115, Japan STANDARD simple automatic REFERENCE 2 (bases 1 to 1733) AUTHORS Ishida,K., Morino,T., Takagi,K. and Sukenaga,Y. TITLE Nucleotide sequence of a human gene for glutathione peroxidase JOURNAL Nucleic Acids Res. 15, 10051-10051 (1987) STANDARD simple automatic COMMENT An in-phase "tga" codon is located at positions 612-614 and encodes selenocysteine. FEATURES from to/span description pept 474 611 glutathione peroxidase, exon 1 (EC 1.11.1.9). /nomgen="GPX1" /map="3q11-q12" /hgml-locus_uid="LE0062E" 615 719 glutathione peroxidase, exon 1 998 1357 glutathione peroxidase, exon 2 pre-msg 155 1566 GTP mRNA and intron IVS 720 997 GTP intron BASE COUNT 277 a 529 c 526 g 401 t ORIGIN 1 aacctagatc cctctgctgt cccctgcact gccggtaaca tggcacagca gagcagggtt 61 gtttgtgcac gggcagctcc tgcagctgct gccgtcgccc accagcctcc tatgccaaac 121 cccacatcct aactcaggaa cctctgagaa aaaacggagc cctcgagggg cccagccttg 181 gaagggtaac tggaccgctg ccgcctggtt gcctgggcca gaccagacat gcctgctgct 241 ccttccggct taggaggagc acgcgtcccg ctcgcgcgca ctctccagcc ttttcctggc 301 tgaggagggg ccgagcctcc ggtagggcgg gggccggatg aggcgggacc tcaggcccgg 361 aaaactgcct gtgccacgtg acccgccgcc ggccagttaa aaggaggcgc ctgctggcct 421 ccccttacag tgcttgttcg gggcgctccg ctggcttctt ggacaattgc gccatgtgtg 481 ctgctcggct agcggcggcg gcggcccagt cggtgtatgc cttctcggcg cgcccgttgg 541 ccggcgggga gcctgtgagc ctgggctccc tgcggggcaa ggtactactt atcgagaatg 601 tggcgtccct ctgaggcacc acggtccggg actacaccca gatgaacgag ctgcagcggc 661 gcctcggacc ccggggcctg gtggtgctcg gcttcccgtg caaccagttt gggcatcagg 721 tgcgccgggc ggagcgggac gggacggggg cggacgtgca gtagtggctg ggggcgccgg 781 cggtgtggtg gtgggtgcgt cggctccatg cgcggagagt ctggctactc tctcgtttcc 841 tttctgttgc tcgtagctgc tgaaattcct ctccgccctt gggattgcgc atggagggaa 901 aaatcccggt gactcataga aaatctcccc tgtttgtggt tagaacgttt ctctcctcct 961 cttgaccccg ggttctagct gcccttctct cctgtaggag aacgccaaga acgaagagat 1021 tctgaattcc ctcaagtacg tccggcctgg tggtgggttc gagcccaact tcatgctctt 1081 cgagaagtgc gaggtgaacg gtgcgggggc gcaccctctc ttcgccttcc tgcgggaggc 1141 cctgccagct cccagcgacg acgccaccgc gcttatgacc gaccccaagc tcatcacctg 1201 gtctccggtg tgtcgcaacg atgttgcctg gaactttgag aagttcctgg tgggccctga 1261 cggtgtgccc ctacgcaggt acagccgccg cttccagacc attgacatcg agcctgacat 1321 cgaagccctg ctgtctcaag ggcccagctg tgcctagggc gcccctccta ccccggctgc 1381 ttggcagttg cagtgctgct gtctcggggg ggttttcatc tatgagggtg tttcctctaa 1441 acctacgagg gaggaacacc ttgatcttac agaaaatacc acctcgagat gggtgctggt 1501 cctgttgatc ccagtctctg ccagaccaag gctagtttcc ccactaataa agtgccgggt 1561 gtcagcagac tgtgtgtatg tcctgtgtca ttgtcatttg ggaattcttt ttcttttctt 1621 tttttttttt tttttttgag acggagtttt ttgctctatt gcccaggctt gagtgcagtg 1681 gcgcaatcta ggctcactgc aagctccgcc tcccgggttc agcatttctg cta // LOCUS HUMHBB3E 213 bp ss-mRNA PRI 15-MAR-1990 DEFINITION Human beta-globin mRNA, 3' end. ACCESSION M11428 KEYWORDS beta-globin; globin. SOURCE Human cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 213) AUTHORS Lang,K.M. and Spritz,R.A. TITLE Cloning specific complete polyadenylylated 3'-terminal cDNA segments JOURNAL Gene 33, 191-196 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 81 beta globin /nomgen="HBB" /map="11p11.5" /hgml_locus_uid="LQ0007V" BASE COUNT 53 a 50 c 45 g 65 t ORIGIN 1 gaattcaccc caccagtgca ggctgcctat cagaaagtgg tggctggtgt ggctaatgcc 61 ctggcccaca agtatcacta agctcgcttt cttgctgtcc aatttctatt aaaggttcct 121 ttgttccgta agtccaacta ctaaactggg ggatattatg aagggccttg agcatctgga 181 ttctgcctaa taaaaaacat ttattttcat tgc // LOCUS HUMHK1A 4134 bp ss-mRNA PRI 15-DEC-1989 DEFINITION Human calcium-ATPase (HK1) mRNA, complete cds. ACCESSION M23114 J04025 KEYWORDS ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase. SOURCE Human kidney cortex, cDNA to mRNA, clones lambda HK[1a,1b]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4134) AUTHORS Lytton,J. and MacLennan,D.H. TITLE Molecular cloning of cDNAs from human kidney coding for two alternatively spliced products of the cardiac Ca2+-ATPase gene JOURNAL J. Biol. Chem. 263, 15024-15031 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Lytton, 14-MAR-1989. Two alternative splicing products, HK1 and HK2, are realized in human kidney cDNAs. HK2 codes for a protein identical to rabbit cardiac muscle Ca2+ ATPase, with the exception of 6 scattered amino acid replacements, whereas HK1 codes for a protein identical to that encoded by HK2, but with the carboxyl-terminal 4 amino acids replaced by an extended sequence of 49 amino acids. See accession M23115 and J04703. FEATURES from to/span description pept 164 3292 calcium-ATPase (EC 3.6.1.3) /nomgen="ATP2B" /map="12" /hgml_locus_uid="LP0123P" mRNA 1 4134 calcium-ATPase mRNA site 3143 3144 alternative splice site BASE COUNT 1056 a 938 c 1012 g 1128 t ORIGIN 254 bp upstream of HindIII site. 1 gggtgattca gcgcccggcg aggcggaacg ggccgcaaga ggaggagggg agagcccgtc 61 cgcgcctggg ctcccggggt ggcacgagcc cgcggccgga gtgcgaggcg gaggcgagga 121 ggccgcgggg acgggaggcg aggccggccg ggcccccgaa gccatggaga acgcgcacac 181 caagacggtg gaggaggtgc tgggccactt cggcgtcaac gagagtacgg ggctgagcct 241 ggaacaggtc aagaagctta aggagagatg gggctccaac gagttaccgg ctgaagaagg 301 aaaaaccttg ctggaacttg tgattgagca gtttgaagac ttgctagtta ggattttatt 361 actggcagca tgtatatctt ttgttttggc ttggtttgaa gaaggtgaag aaacaattac 421 agcctttgta gaaccttttg taattttact catattagta gccaatgcaa ttgtgggtgt 481 atggcaggaa agaaatgctg aaaatgccat cgaagccctt aaggaatatg agcctgaaat 541 gggcaaagtg tatcgacagg acagaaagag tgtgcagcgg attaaagcta aagacatagt 601 tcctggtgat attgtagaaa ttgctgttgg tgacaaagtt cctgctgata taaggttaac 661 ttccatcaaa tctaccacac taagagttga ccagtcaatt ctcacaggtg aatctgtctc 721 tgtcatcaag cacactgatc ccgtccctga cccacgagct gtcaaccaag ataaaaagaa 781 catgctgttt tctggtacaa acattgctgc tgggaaagct atgggagtgg tggtagcaac 841 tggagttaac accgaaattg gcaagatccg ggatgaaatg gtggcaacag aacaggagag 901 aacacccctt cagcaaaaac tagatgaatt tggggaacag ctttccaaag tcatctccct 961 tatttgcatt gcagtctgga tcataaatat tgggcacttc aatgacccgg ttcatggagg 1021 gtcctggatc agaggtgcta tttactactt taaaattgca gtggccctgg ctgtagcagc 1081 cattcctgaa ggtctgcctg cagtcatcac cacctgcctg gctcttggaa ctcgcagaat 1141 ggcaaagaaa aatgccattg ttcgaagcct cccgtctgtg gaaacccttg gttgtacttc 1201 tgttatctgc tcagacaaga ctggtacact tacaacaaac cagatgtcag tctgcaggat 1261 gttcattctg gacagagtgg aaggtgatac ttgttccctt aatgagttta ccataactgg 1321 atcaacttat gcacctattg gagaagtgca taaagatgat aaaccagtga attgtcacca 1381 gtatgatggt ctggtagaat tagcaacaat ttgtgctctt tgtaatgact ctgctttgga 1441 ttacaatgag gcaaagggtg tgtatgaaaa agttggagaa gctacagaga ctgctctcac 1501 ttgcctagta gagaagatga atgtatttga taccgaattg aagggtcttt ctaaaataga 1561 acgtgcaaat gcctgcaact cagtcattaa acagctgatg aaaaaggaat tcactctaga 1621 gttttcacgt gacagaaagt caatgtcggt ttactgtaca ccaaataaac caagcaggac 1681 atcaatgagc aagatgtttg tgaagggtgc tcctgaaggt gtcattgaca ggtgcaccca 1741 cattcgagtt ggaagtacta aggttcctat gacctctgga gtcaaacaga agatcatgtc 1801 tgtcattcga gagtggggta gtggcagcga cacactgcga tgcctggccc tggccactca 1861 tgacaaccca ctgagaagag aagaaatgca ccttgaggac tctgccaact ttattaaata 1921 tgagaccaat ctgaccttcg ttggctgcgt gggcatgctg gatcctccga gaatcgaggt 1981 ggcctcctcc gtgaagctgt gccggcaagc aggcatccgg gtcatcatga tcactgggga 2041 caacaagggc actgctgtgg ccatctgtcg ccgcatcggc atcttcgggc aggatgagga 2101 cgtgacgtca aaagctttca caggccggga gtttgatgaa ctcaacccct ccgcccagcg 2161 agacgcctgc ctgaacgccc gctgttttgc tcgagttgaa ccctcccaca agtctaaaat 2221 cgtagaattt cttcagtctt ttgatgagat tacagctatg actggcgatg gcgtgaacga 2281 tgctcctgct ctgaagaaag ccgagattgg cattgctatg ggctctggca ctgcggtggc 2341 taaaaccgcc tctgagatgg tcctggcgga tgacaacttc tccaccattg tggctgccgt 2401 tgaggagggg cgggcaatct acaacaacat gaaacagttc atccgctacc tcatctcgtc 2461 caacgtcggg gaagttgtct gtattttcct gacagcagcc cttggatttc ccgaggcttt 2521 gattcctgtt cagctgctct gggtcaatct ggtgacagat ggcctgcctg ccactgcact 2581 ggggttcaac cctcctgatc tggacatcat gaataaacct ccccggaacc caaaggaacc 2641 attgatcagc gggtggctct ttttccgtta cttggctatt ggctgttacg tcggcgctgc 2701 taccgtgggt gctgctgcat ggtggttcat tgctgctgac ggtggtccaa gagtgtcctt 2761 ctaccagctg agtcatttcc tacagtgtaa agaggacaac ccggactttg aaggcgtgga 2821 ttgtgcaatc tttgaatccc catacccgat gacaatggcg ctctctgttc tagtaactat 2881 agaaatgtgt aacgccctca acagcttgtc cgaaaaccag tccttgctga ggatgccccc 2941 ctgggagaac atctggctcg tgggctccat ctgcctgtcc atgtcactcc acttcctgat 3001 cctctatgtc gaacccttgc cactcatctt ccagatcaca ccgctgaacg tgacccagtg 3061 gctgatggtg ctgaaaatct ccttgcccgt gattctcatg gatgagacgc tcaagtttgt 3121 ggcccgcaac tacctggaac ctggtaaaga gtgtgtgcag cctgccacca aatcctgctc 3181 gttctcggca tgcaccgatg ggatttcctg gccgtttgtg ctgctcataa tgcccctggt 3241 gatctgggtc tatagcacag acactaactt tagcgatatg ttctggtctt gactgacagt 3301 tttccataaa gaagatgttt aacttaatca attaattttt ttattgttta aagcaactgt 3361 ctatttctgc tgaattttca catgaacata ctggctggtg atggaggttt catactctag 3421 attttgtttt gctttttctg actccagtgg ggcaagattt tcctttttta tacacataat 3481 taaagtgtcc attgacatgt acagagaact aacactattt tatgcaaata tttttttgta 3541 gatgaaaaag catgtacagt gttctgttta atactcatcc ttgtataaaa aaaatagttg 3601 agccagcaga cattgtcagc aaattaattg gcagcagatt ttaggaaatg aatgtgtgtg 3661 gttttttttc taaaactaaa tagcatgtat tgtgtctttt gcatgatgat ccggatttaa 3721 tttgatatca cagtctaatt tttattcata agccaatttt tctgcactga gcagagtctt 3781 gctacctcag tcagtattgt tttggtttgc tacttccctc acccactttg gcctccgttc 3841 accccacccc accccacctc tccccacctt acccccgccc cgcttggctt cttctttagg 3901 attgtgatgg ttcgttctgt ttacatcagt tttaacgaga ggtatgcctg tactcgcttg 3961 tgcagaaaac attgttccag attcaatcga ctgggtttat gtcccttcac atagttttta 4021 aggttattta tttaaatgtc taatgtattt tattgtaaca gacattgttt tgccaacatt 4081 gcctatttca gtggcacgtc atctagtttt aaaaaaataa aacattttaa aaag // LOCUS HUMHK2A 3781 bp ss-mRNA PRI 15-DEC-1989 DEFINITION Human calcium-ATPase (HK2) mRNA, complete cds. ACCESSION M23115 J04025 KEYWORDS ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase. SOURCE Human kidney cortex, cDNA to mRNA, clones lambda HK2[a,b]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3781) AUTHORS Lytton,J. and MacLennan,D.H. TITLE Molecular cloning of cDNAs from human kidney coding for two alternatively spliced products of the cardiac Ca2+-ATPase gene JOURNAL J. Biol. Chem. 263, 15024-15031 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Lytton, 14-MAR-1989. Two alternative splicing products, HK1 and HK2, are realized in human kidney cDNAs. HK2 codes for a protein identical to rabbit cardiac muscle Ca2+ ATPase, with the exception of 6 scattered amino acid replacements, whereas HK1 codes for a protein identical to that encoded by HK2, but with the carboxyl-terminal 4 amino acids replaced by an extended sequence of 49 amino acids. See accession M23114 and J04703. FEATURES from to/span description pept 4 2997 calcium-ATPase (EC 3.6.1.3) /nomgen="ATP2B" /map="12" /hgml_locus_uid="LP0123P" mRNA 1 3781 calcium-ATPase mRNA site 2983 2984 alternative splice site BASE COUNT 1018 a 829 c 913 g 1021 t ORIGIN 94 bp upstream of HindIII site. 1 gccatggaga acgcgcacac caagacggtg gaggaggtgc tgggccactt cggcgtcaac 61 gagagtacgg ggctgagcct ggaacaggtc aagaagctta aggagagatg gggctccaac 121 gagttaccgg ctgaagaagg aaaaaccttg ctggaacttg tgattgagca gtttgaagac 181 ttgctagtta ggattttatt actggcagca tgtatatctt ttgttttggc ttggtttgaa 241 gaaggtgaag aaacaattac agcctttgta gaaccttttg taattttact catattagta 301 gccaatgcaa ttgtgggtgt atggcaggaa agaaatgctg aaaatgccat cgaagccctt 361 aaggaatatg agcctgaaat gggcaaagtg tatcgacagg acagaaagag tgtgcagcgg 421 attaaagcta aagacatagt tcctggtgat attgtagaaa ttgctgttgg tgacaaagtt 481 cctgctgata taaggttaac ttccatcaaa tctaccacac taagagttga ccagtcaatt 541 ctcacaggtg aatctgtctc tgtcatcaag cacactgatc ccgtccctga cccacgagct 601 gtcaaccaag ataaaaagaa catgctgttt tctggtacaa acattgctgc tgggaaagct 661 atgggagtgg tggtagcaac tggagttaac accgaaattg gcaagatccg ggatgaaatg 721 gtggcaacag aacaggagag aacacccctt cagcaaaaac tagatgaatt tggggaacag 781 ctttccaaag tcatctccct tatttgcatt gcagtctgga tcataaatat tgggcacttc 841 aatgacccgg ttcatggagg gtcctggatc agaggtgcta tttactactt taaaattgca 901 gtggccctgg ctgtagcagc cattcctgaa ggtctgcctg cagtcatcac cacctgcctg 961 gctcttggaa ctcgcagaat ggcaaagaaa aatgccattg ttcgaagcct cccgtctgtg 1021 gaaacccttg gttgtacttc tgttatctgc tcagacaaga ctggtacact tacaacaaac 1081 cagatgtcag tctgcaggat gttcattctg gacagagtgg aaggtgatac ttgttccctt 1141 aatgagttta ccataactgg atcaacttat gcacctattg gagaagtgca taaagatgat 1201 aaaccagtga attgtcacca gtatgatggt ctggtagaat tagcaacaat ttgtgctctt 1261 tgtaatgact ctgctttgga ttacaatgag gcaaagggtg tgtatgaaaa agttggagaa 1321 gctacagaga ctgctctcac ttgcctagta gagaagatga atgtatttga taccgaattg 1381 aagggtcttt ctaaaataga acgtgcaaat gcctgcaact cagtcattaa acagctgatg 1441 aaaaaggaat tcactctaga gttttcacgt gacagaaagt caatgtcggt ttactgtaca 1501 ccaaataaac caagcaggac atcaatgagc aagatgtttg tgaagggtgc tcctgaaggt 1561 gtcattgaca ggtgcaccca cattcgagtt ggaagtacta aggttcctat gacctctgga 1621 gtcaaacaga agatcatgtc tgtcattcga gagtggggta gtggcagcga cacactgcga 1681 tgcctggccc tggccactca tgacaaccca ctgagaagag aagaaatgca ccttgaggac 1741 tctgccaact ttattaaata tgagaccaat ctgaccttcg ttggctgcgt gggcatgctg 1801 gatcctccga gaatcgaggt ggcctcctcc gtgaagctgt gccggcaagc aggcatccgg 1861 gtcatcatga tcactgggga caacaagggc actgctgtgg ccatctgtcg ccgcatcggc 1921 atcttcgggc aggatgagga cgtgacgtca aaagctttca caggccggga gtttgatgaa 1981 ctcaacccct ccgcccagcg agacgcctgc ctgaacgccc gctgttttgc tcgagttgaa 2041 ccctcccaca agtctaaaat cgtagaattt cttcagtctt ttgatgagat tacagctatg 2101 actggcgatg gcgtgaacga tgctcctgct ctgaagaaag ccgagattgg cattgctatg 2161 ggctctggca ctgcggtggc taaaaccgcc tctgagatgg tcctggcgga tgacaacttc 2221 tccaccattg tggctgccgt tgaggagggg cgggcaatct acaacaacat gaaacagttc 2281 atccgctacc tcatctcgtc caacgtcggg gaagttgtct gtattttcct gacagcagcc 2341 cttggatttc ccgaggcttt gattcctgtt cagctgctct gggtcaatct ggtgacagat 2401 ggcctgcctg ccactgcact ggggttcaac cctcctgatc tggacatcat gaataaacct 2461 ccccggaacc caaaggaacc attgatcagc gggtggctct ttttccgtta cttggctatt 2521 ggctgttacg tcggcgctgc taccgtgggt gctgctgcat ggtggttcat tgctgctgac 2581 ggtggtccaa gagtgtcctt ctaccagctg agtcatttcc tacagtgtaa agaggacaac 2641 ccggactttg aaggcgtgga ttgtgcaatc tttgaatccc catacccgat gacaatggcg 2701 ctctctgttc tagtaactat agaaatgtgt aacgccctca acagcttgtc cgaaaaccag 2761 tccttgctga ggatgccccc ctgggagaac atctggctcg tgggctccat ctgcctgtcc 2821 atgtcactcc acttcctgat cctctatgtc gaacccttgc cactcatctt ccagatcaca 2881 ccgctgaacg tgacccagtg gctgatggtg ctgaaaatct ccttgcccgt gattctcatg 2941 gatgagacgc tcaagtttgt ggcccgcaac tacctggaac ctgcaatact ggagtaaccg 3001 cttcctaaac cattttgcag aaatgtaagg gtgttcggtt gcgtgcatgt gcgtttttag 3061 caacacatct accaaccctg tgcatgactg atgttgggga aaaagaaaag taaaaaactt 3121 cccaactcac tttgtgttat gtggaggaaa tgtgtattac caatggggtt gttagctttt 3181 aaatcaaaat actgattaca gatgtacaat ttagcttaat cagaaagcct ctccagagaa 3241 gtttggtttc tttgctgcaa gaggaatgag gctctgtaac cttatctaag aacttggaag 3301 ccgtcagcca agtcgccaca tttctctgca aaatgtcata gcttatataa atgtacagta 3361 ttcaattgta atgcatgctt cggttgtaag tagccagatc cctctccagt gacattggaa 3421 catgctactt tttaattggc cctgtacagt ttgcttattt ataaattcat taaaaacact 3481 acaggtgttg aatggttaaa atgtaggcct ccagttcatt ttcagttatt ttctgagtgt 3541 gcagacagct atttcgcact gtattaaatg taacttattt aatgaaatca gaagcagtag 3601 acagatgttg gtgcaataca aatattgtga tgcatttatc ttaataaaat gctaaatgtc 3661 aatttatcac tgcgcatgtt tgactttaga ctgtaaatag agatcagttt gtttctttct 3721 gtgctggtaa caatgagcgt cgcacagaca tggtttcagg taaataaatc tattctatga 3781 t // LOCUS HUMHMG17G 7195 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human HMG-17 gene for non histone chromosomal protein HMG-17. ACCESSION X13546 KEYWORDS HMG-17 chromosomal protein; HMG-17 gene; non histone protein. SOURCE human (Homo sapiens). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7195; enum. 1 to 7195) AUTHORS Landsman,D. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (sites; for [1]) AUTHORS Landsman,D., McBride,O.W. and Bustin,M. TITLE Human non-histone chromosomal protein HMG-17: identification, characterization, chromosome localization and RFLP's of a functional gene from the large multigene family JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic COMMENT *source: cell line=MCF-7; library=EMBL3; clone=pH17g; **map: chromosomal location=1p12-34. See also HSHMG17 (M12623) for the cDNA sequence and HSHMG17P (X06444) for a retropseudogene HMG-17, published by Landsman D. and others. EMBL features not translated to GenBank features: key from to description PRM 2315 2319 CAAT-box PRM 2372 2377 TATA-box TRANSCR 2402 5851 primary transcript MSG 2402 2523 exon 1 IVS 2524 3369 intron I MSG 3370 3414 exon 2 IVS 3415 3589 intron II MSG 3590 3619 exon 3 IVS 3620 3969 intron III MSG 3970 4020 exon 4 IVS 4021 4456 intron IV MSG 4457 4552 exon 5 IVS 4553 4997 intron V MSG 4998 5851 exon 6 [1] Author address Landsman D., Laboratory of Molecular Carcinogenesis, National Institutes of Health, Building 37, Room 3D20, Bethesda, MD 20892, U.S.A__ Submitted (12-NOV-1988) on tape to the EMBL data library by: FEATURES from to/span description pept 2509 2523 put. HMG-17 protein (AA 1-5) /hgml_locus_uid="LY0033Z" /map="1p36.1-p35" /nomgen="HMG17" 3370 3414 put. HMG-17 protein (AA 6-20) 3590 3619 put. HMG-17 protein (AA 21-30) 3970 4020 put. HMG-17 protein (AA 31-47) 4457 4552 put. HMG-17 protein (AA 48-79) 4998 5033 put. HMG-17 protein (AA 80-90) BASE COUNT 1664 a 1754 c 1809 g 1968 t ORIGIN 1 aagcttgttc taggtcctct gcttgtgagg gtcaaagctg tgtcctttcc cttacctccc 61 tctgccagga cttgctgcag agctgctgag aggattagtg cctttgaaga gctgtctgcc 121 tgagcaactc tatttcaggt gccccacacc ggcaagtacc agccagcaac accaaccaaa 181 tgctactctc tttaaagtcc attttccttc ttcttttttt tttttttttt ttggagacag 241 catctcaccc tgtcctggct ggtctcgaac tcctgacctc aggtgatccg ccggcctcac 301 cctcccaaag tgctgggatt acaggcatga gccaccatgc cagacttccc attttacttt 361 ctgcaagctg tttccctagc agctccctct aggggagagg tgaaatcttg caagttgtag 421 caagagcaca caggaaaccc ctaactttcc tataccccac ccgcctcttc ccctttctgt 481 cccgggatac tggcggcaag agacttcttg gctattgtcc atgctcccag aatcaagcat 541 aaatgccaga cacggcgatt gagaagccaa tcagtgaacc ctttgcaaag ccccatccac 601 acctggcact cccctctacc aatccctggc acagggttcc tggagagcag gtgctgtaca 661 ttttacagct ttacaatggg gctgttgaca gccataatta gggaggcatg aattatgcgg 721 ctataatgca gagccctaca attaaggcgg gaatgagggg ctggaggcag caaacggaat 781 ctgccctatg agcgtggctg ttgagtcctg tctcctgggt ctgactttcc gtaatatgat 841 tggggtacag tagaggtgat taatggggct ggcatctctc tttggcctga ggttctgtat 901 tctgggaaag gtatacaggg tggagtaggg agaagctgcc ccaggaggcg atgtagtggt 961 ggaaagaaga ggcagagagg tcgtcgtcgt cgcccagcag caagggctgc aaaatagtag 1021 aactcgtggt tgctttggac aggtgtgatt tgtgcaagcc aggttcaacc cttgcctcaa 1081 gaaatcagat gggaccaatt tagtgtcctt ccacctgtga gccaagcccc catttgagga 1141 catctatcgt attcttgtgt gctgggtctc aaatagaatt tttaaagatt cttagatgta 1201 aaacttgttt gctaactgca acggaagaac accttctttt gctgctgcaa ttgcctaccc 1261 tccctcaggt gcacttggct agctccaaga ccctggcttt ggactagggg gcaccggaag 1321 agcccttttg ccatcacacc acctgttcct ctacttgttt cctaatcccc gggcgctttc 1381 ctccaccctc ctttcctcgc gcaagtgaaa ggcagccagc ttgacagtgt ggaaggaggg 1441 atttcgccag ctgcggctgc agtgcccagg cttggtggcc gcgtcgggtt gggcagctgg 1501 gcaaaatgtc tccgagaggg gtagagtgta gggggaagtg agtgctggta gacctcgacg 1561 gccgctctcc accatactgt ggcggggtct ggaggtttgg gaatacaggg accttaaggg 1621 agccacccga gaggagttgg gggtgggggc ttcgatagca tttggggagg cggctgcgca 1681 acgggtgctg aaaggacagc tcctacctga cccacggcac ccaaaaggca cctctgcatg 1741 gaaagtctcc ccctgaaacc cctagcccac ctcgagtggg accgtctttc aggtgaggtg 1801 cacgaaaggc ttctccgaag tggcagctcg gaaggatgca cgtccggctg cgccaagctg 1861 gtttccccgc ctttttccca cttcggcgtc agcctgggct ctgggaactg cagcctgtgc 1921 ctccagccgc gcgcttcctc ctgcttccag aggagcaggg cagtgcagcc aggcctcgca 1981 aacccctctt ctggggcccc cagcgggagc aggaaggaaa gccgctgaga tgcgccgtga 2041 gtgtcgcacg gcttggagaa gtgggttgct gggttgcata aggtggaagg aacttgctag 2101 tttgcaaaag acagctcttg gcgggagccg gaacccggag cccgccgact cgcgcacccc 2161 tcctcattta ctgccttcga cgcgcgccac cccgactgcg cccccacccc ctcggcgcgc 2221 cgccgtctcc gcgcactacc ccccccacac acccccctcc tcactccctc cagaggaggt 2281 gagtttaaac cccgcccacg tgaccccagc tgggccaatg aacggcggcg ggaggtgaaa 2341 tcggttctaa ccggtccggg gctcccagcg ctataaaaac tttataaacc ccccggagcc 2401 cgagcagtgt gaagaagagg cgagaacgac ccccggaccg accaaagccc gcgcgccgct 2461 gcatcccgcg tccagcacct acgtcccgct gccgtcgccg ccgccaccat gcccaagaga 2521 aaggtacgtg gcgcgagggc cccaggcgcc gggccaccac tgccgccacc gccgccgccg 2581 cctccctggt gcagggagcg agaatcggcg ccgagcagga gccagcgcac cgtccccgcg 2641 cgggggctgg agacggtgtc gggcagctcg gggctaaccc tgagcggctc ggctgcccgc 2701 gggcgccaga ggccatattg gaggagcggc ggccgcggcg ggaggagcca tgttggcggc 2761 tgtttatccc gctctcctcg ctcgccgccc ccgccccgtg ccccctcccc catcgcgagg 2821 gccccttccc tccctctgcg ggcgggcaat tcaaacccga aagggcggga aggcggcgct 2881 cggggttggc gggcggggga aagcgtgccg ccaaaaaacc gccgccgtga ggcggggctt 2941 gtgcggtatg gccccgcccc ctcgcccacg ttccccgcgc acgagacgcg cgctgtcgcc 3001 gcccacgagt tccccgggct gcgcgcgcct ccctctcccg ccctcgacgg ctgccatagc 3061 aacggcgctg ggctccgctc cggagggggt ttgtttgcgc catctgcagc tgttgctcct 3121 gcctgtcgcg gtggtgcggg ctccgctgcc ctccccggct gcgctccggt ccagccctcg 3181 cttctcgggg tcggcgagcc ggagctcctg cgcgcgcttc gttcttatac gaacgtcggg 3241 ctcactcatt tatgtcctag aaaagttgtg tggacgactg ctttaatttt catttttagc 3301 actctaaagt ttgggaagta attaagaacc acctcaaaat ctgcagtttt ttgttcttgt 3361 ttcttatagg ctgaagggga tgctaaggga gataaagcaa aggtgaagga cgaagtaagt 3421 cattctctct tcaagggtca aagccttgga ctagcagagg ccactggact cggtgattaa 3481 ccgtaacctg tgtcctgaat ttacactcct ataatctaga gcaaattgat accaaacttt 3541 caaagcgact tacctgtcct atttctaact ttctcgttgt ctttaatagc cacagagaag 3601 atccgcgagg ttgtctgctg taagtgtatg cttttgaatt ttcgtgcttg tccctgaaac 3661 taaaaaacat caaaaaacaa ttccctttgc ttccatgaat tatggttagt gcctggtttt 3721 gaatcattgc ctctacttgg gactcttgcc ctttgggttt tgctggttct gaaattctga 3781 tgcctgtagc caaagtggga catttgagtg ggcttctgga aatccaacat tctagaagaa 3841 agccaaccac aaaaactttg aggaggagga gaaacttctc tacccttggt acttttggtt 3901 ggttgtgggt ggttttcttc agtccattgt actgatgttc actttttcct ctcttcctgc 3961 caaaaaaaga aacctgctcc tccaaagcca gagcccaagc ctaaaaaggc ccctgcaaag 4021 gtaagtgcta acattggaac tgatcatttt cacagaatga ggactgtcct tagtgcctta 4081 acttaattag cataatggtg cctccattaa tggaggttat aaactgtgtg gatagcttac 4141 ctcgtccgtg tcattcataa cgttggtttc ctgatcaaga attctgttgt tagtttatag 4201 gaaaatctgc atttaaatat taaatacccc agggtttaaa ctgatttatt cttctgttac 4261 ttaggaaatt ctaagtacta aagattatag caaattacag ataatttagc ctagttttga 4321 tcagttgttc ttgtcaaatg ggtgagggtt tgtgttttat cttgagtaaa caatcctacc 4381 ttgtgcagaa ctttgcagac cataccttgg taatacgaaa tggggagaaa cagttctatt 4441 ttttcccctt tttcagaagg gagagaaggt acccaaaggg aaaaagggaa aagctgatgc 4501 tggcaaggag gggaataacc ctgcagaaaa tggagatgcc aaaacagacc aggtataact 4561 gctgtttcac cctttgttag atttgttcat tcagttagtt gctgatatca aaaatttaat 4621 tgcccttttc ttgtatcact ccaaatgtac catttggtcc agtgtgcttg tggctttcct 4681 gttaacttaa atcctggatt cttgaaatct ctactgtcag ctgaagggca ttgtgtttta 4741 tactacatga agttttcaag ctaggagtaa cctcaggtct cttattctgg tgttcttcct 4801 acctcaattt tacagtgagg cccagaggag aggtgactta gcaaagttac ctagtaagtc 4861 attctcagaa gaaatactga gtctcagtac ctgaaacctg aactttggcc tgggagcaaa 4921 gtgatgctgt aggtggaatg tgaacacaga tagttttgaa atctacgcat tgcattaatt 4981 tgtctgtttt cttttaggca cagaaagctg aaggtgctgg agatgccaag tgaagtgtgt 5041 gcatttttga taactgtgta cttctggtga ctgtacagtt tgaaatacta ttttttatca 5101 agttttataa aaatgcagaa ttttgtttta cttttttttt ttttttaaaa gctatgttgt 5161 tagcacacag aacacttcat tgttgttttt gggggaaggg gcatatgtca ctaatagaat 5221 gtctccaaag ctggattgat gtggagaaaa cacctttccc ttctagtttt gagagacttc 5281 ctcttggctc ccaggaggag ggattccctg actttgacac acatggccac cttggcacaa 5341 aagccttgtg gtatagaaaa acaaatttgt ttttatgtcc tcttctccct ttccatcttt 5401 cagcatagac ttaactccct taagcccaga catctgttga gacctgaccc ctagtcattg 5461 gttaccagtg tgtcaggcaa tctggacttt ccagtgatgc cactgagatg gcacctgtca 5521 aaagagcagt ggttccattt ctagattgtg gatcttcaga taaattctgc cattttcatt 5581 tcacttcctg aaagtcaggg tcggcttgtg aaaagttgtt aaacaacatg ctaaatgtga 5641 aatgtcaacc ctcactctaa actttccctg ttcagagcat cagatgaaga cttcattggg 5701 ttttatagtg gctttctgat ttttggtagt ccattgaaga agggagtttg aaagttgttg 5761 tatactgtta acgattgtct gcccatgtcc tgcctgaaat accatgattg tttatggaaa 5821 gtatctttaa taaagctgga tacagtttgg cttggaatgc tgcctctgat cttttccaca 5881 agtggggtaa cctggtttat ccaagtctct tggaacaggg tacgttctgc tttgaggtac 5941 tccatgtaca gtctaatgat gatctttcac tgatttatag tcagcttcca aaacacacag 6001 tgctaggtgc agttaggaag gattccagga gtagacttag ctgatggtga aaggctggga 6061 gtatggagtg atttctgtac ttggttatgc agcattccct gcagtgtttg ttttttgcat 6121 gactccgaat acatgaagtg tattaaatgc agatttttgg tagatctgcg gttttggctg 6181 ggtagctgac aggagcctct ctacttccat ctgcatttgc aacttctata aaattagcaa 6241 attaatagac tcatgtcaga gagtgtctct gtgaatgtgt acacagttgt ccatttgtaa 6301 ggtagatgac actttgctcc taagtagacc tgcaaacaaa gacaatgggg tctttcctga 6361 ttcctttagt ttggagatgc agagaacccc acgtttctct aagtgaaatg gcaagaagtt 6421 atttgttaga atatggaact ggtaacaaat tcatgtttgc accttgaatt tgctttcaca 6481 gacctggctg tattgttgga caggaataaa gtaaattgga gactaattgc tttgtccatt 6541 cccttagtca aaaaagtatc gctgttaagt tcttgaggtt ttttgtgtat tatctctcaa 6601 gtgttgagat gtttgtgtaa tatatgtatc acactcatgg atgaggaatc aggcacagat 6661 gaaacttgtt gaaggttatg tgtacattta atgagcagca gagtcaggat tcagaccctc 6721 aactagacaa aagtcatctc tgacagactt ttattcagga taaaatacca tttgtccatt 6781 tagccagtta ccagccattt actttcatta ttgctgaagt cattcattct aggttgtaga 6841 cgggtaattt ttattctaaa agcggtgtcc tcttccagag gaaacttaag agacatctta 6901 gaatggtcag gggtagtggg ctggtatttg agcagaggtt gcagaatttg agaaagttac 6961 agctgaggtt ctttgttagt ggcaagtctg ctaagccctg ggctatgcta acagctctgc 7021 ctcatgctct ttaatctttg agtcagacct atgagataca ggtgtagccc cattttgtag 7081 tcgagagtga ggctcagatg tgaagcgact ggctgaggtt aaacttgata gtctgtttac 7141 ttattttctg gccctcttct tgttgccata taattaggat gagtagggta agctt // LOCUS HUMIGLVB 464 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human Ig germline lambda-chain (autoantibody Humv117) gene, V-region, partial cds. ACCESSION M30446 KEYWORDS V-region; anti-DNA antibody; autoantibody; germline; immunoglobulin light chain; lambda-immunoglobulin. SOURCE Human retinoblastoma cell line Y79 DNA, (library of W.-H.Lee). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 464) AUTHORS Siminovitch,K.A., Misener,V., Kwong,P.C., Song,Q.-L. and Chen,P.P. TITLE A natural autoantibody is encoded by germline heavy and lambda light chain variable region genes without somatic mutation JOURNAL J. Clin. Invest. 84, 1675-1678 (1989) STANDARD simple staff_entry FEATURES from to/span description pept / 44 / 349 Ig lambda chain V-region precursor (AA at 46) sigp / 44 54 Ig lambda chain signal peptide (AA at 46) matp 55 / 349 Ig lambda chain IVS < 1 43 Ig lambda-chain intron iDNA 350 > 464 intervening DNA (5' end approx.) signal 351 357 7 mer recombination signal signal 381 389 9 mer recombination signal BASE COUNT 99 a 150 c 118 g 97 t ORIGIN 1 tctagaccaa gaatcaccgt gtctgtgtct ctcctgcttc cagggtcctg ggcccagtct 61 gtgttgacgc agccgccctc agtgtctgcg gccccaggac agaaggtcac catctcctgc 121 tctggaagca gctccaacat tgggaataat tatgtatcct ggtaccagca gctcccagga 181 acagccccca aactcctcat ctatgaaaat aataagcgac cctcagggat tcctgaccga 241 ttctctggct ccaagtctgg cacgtcagcc accctgggca tcaccggact ccagactggg 301 gacgaggccg attattactg cggaacatgg gatagcagcc tgagtgctgg cacagtgctc 361 cagcccaatg gggaactgag acaagaaccc ccttcttcct cccccaggag ggtgagtgcc 421 gccagctgct gctcacgcct gacctgtagc ttctgctgct gcag // LOCUS HUMIL71 2195 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human interleukin 7 (IL7) gene, exon 1. ACCESSION M29048 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 1 of 6 SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA to mRNA, clone 3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2195) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept 2167 + 2176 interleukin 7, exon 1 pre-msg < 1783 > 2195 IL7 mRNA and introns IVS 2177 > 2195 IL7 intron A BASE COUNT 570 a 543 c 498 g 584 t ORIGIN 1783 bp upstream of EcoRI site. 1 ctaggcgtgt ttcattatag aaatcatttt taaaactcac tccagtgtca tgtctgtcct 61 cctccccacc tcccactcta ggttccagtc aaatggaaat atgtggcttt tctcactcca 121 ttctgtctct cttttccatt tgcttacccc cttcgcttcc tctttcattc tatttccttc 181 tgctttcatg aactttctct ttcacacata ttcactgatg cattcccact cacctgttct 241 ctcttactct gtcacccccc agtacttact gaatgccagc catgggctaa agaccatagt 301 aagtggacgc tgggcaggaa gagaaaggcc attgtggctg gaggatagtg atgtggtacc 361 acgaaaacgt cagagagggc agtggggccc agatcaccag ggcactgagg gcctggtacc 421 cagtgggaag tgaccagaag gtggggcaca aaatagaatc agatatcctg ctgctctgag 481 aagaatgaag gggttgtggg caaaagtgaa agctggaaga tcccttagac agtctctata 541 gtcaccttag ggttaatttt ggaggtgggg cagacaggac ttgtctatgg atttgatgtg 601 ggaaatgaga ccaaaaaagg aatcagagac ccaagtttca ggcttgagca actctgttct 661 ccctagttca ttgccatcct tttttttcct ttctccttcc ctcctcttcc ttgttcacgc 721 tatagcagaa tgtgacatac tccttaccag ccccaagttc attcctcttc aagtgtcagt 781 ttagaagtca cagactccag gaaactttcc tgaagtcccc acactggggc aaagccacac 841 acgtactgtc atctccctcc cggcatcagt gtgcacatta ttactttatt cacaggagtt 901 ccaagctgga gtttctacag cccaaatgca gcaagtctgt gtctaagctc agttagctaa 961 gtgaaatttg gaaagttact taaggaagtc tcagtcccca catctgtaaa gtgcagataa 1021 gaaaatggtc tacctcacac aaagttttgc atgagatgat atacataaaa tgcttagcac 1081 aaaactggca gagagcaaat gctagataaa tgatagtcgt tattattatc gctgttgtta 1141 ctggtttaca ttatccacct tcatctaagc accctttctg cagaatagca gaaaccaaac 1201 taatgtagca aataagctac ataattcaag cccaggaaaa agttaacatt tcagtggcat 1261 gcattcaaga cgaatagttt gatttattag ccaattcaga taaatgtgca cgtggaagtc 1321 atagttaaat attatcgtca gtttccacgt cctgcgttta atttggggtt tgattttcca 1381 aatacaacac ttaccagatt aggtggaccc acaggattat ttttccttga ggtctcacct 1441 gagcaggtgc atgtacagca gacggagcag aaagagactg attagagagg ttggagtggt 1501 agagggcgtg accctcttaa tcattcttca cttccttttt taaaagacga cttggcatcg 1561 tccaccacat ccgcggcaac gcctccttgg tgtcgtccgc ttccaataac ccagcttgcg 1621 tcctgcacac ttgtggcttc cgtgcacaca ttaacaactc atgggtctag ctcccagtcg 1681 ccaagcgttg ccaaggcgtt gagagatcat ctgggaagtc ttttacccag aattgctttg 1741 attcaggcca gctggttttt ctgcggtgat tcggaaattc gcgaattcct ctggtcctca 1801 tccaggtgcg cgggaagcag gtgcccagga gagaggggat aatgaagatt ccatgctgat 1861 gatcccaaag attgaacctg cagaccaagc gcaaagtaga aactgaaagt acactgctgg 1921 cggatcctac ggaagttatg gaaaaggcaa agcgcagagc cacgccgtag tgtgtgccgc 1981 cccccttggg atggatgaaa ctgcagtcgc ggcgtgggta agaggaacca gctgcagaga 2041 tcaccctgcc caacacagac tcggcaactc cgcggaagac cagggtcctg ggagtgacta 2101 tgggcggtga gagcttgctc ctgctccagt tgcggtcatc atgactacgc ccgcctcccg 2161 cagaccatgt tccatggtaa gcgctcttct ccctt // LOCUS HUMIL72 177 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human interleukin 7 (IL7) gene, exon 2. ACCESSION M29049 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 2 of 6 SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA to mRNA, clone 3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 177) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 + 157 interleukin 7, exon 2 pre-msg < 1 > 177 IL7 mRNA and introns IVS < 1 20 IL7 intron A IVS 158 > 177 IL7 intron B BASE COUNT 43 a 26 c 34 g 74 t ORIGIN About 3 kb after segment 1. 1 ttatttactg tcaaatttag tttcttttag gtatatcttt ggacttcctc ccctgatcct 61 tgttctgttg ccagtagcat catctgattg tgatattgaa ggtaaagatg gcaaacaata 121 tgagagtgtt ctaatggtca gcatcgatca attattggta tgtgattatt ttgtttt // LOCUS HUMIL73 121 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human interleukin 7 (IL7) gene, exon 3. ACCESSION M29050 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 3 of 6 SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA to mRNA, clone 3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 121) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 + 101 interleukin 7, exon 3 pre-msg < 1 > 121 IL7 mRNA and introns IVS < 1 20 IL7 intron B IVS 102 > 121 IL7 intron C BASE COUNT 44 a 10 c 22 g 45 t ORIGIN Undetermined number of bp after segment 2. 1 ttttatgtta tttattacag gacagcatga aagaaattgg tagcaattgc ctgaataatg 61 aatttaactt ttttaaaaga catatctgtg atgctaataa ggtaatgata attatttgga 121 g // LOCUS HUMIL74 157 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human interleukin 7 (IL7) gene, exon 4. ACCESSION M29051 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 4 of 6 SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA to mRNA, clone 3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 157) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 + 152 interleukin 7, exon 4 pre-msg < 1 > 157 IL7 mRNA and introns IVS < 1 20 IL7 intron C IVS 153 > 157 IL7 intron D BASE COUNT 45 a 28 c 32 g 52 t ORIGIN About 1.5 kb after segment 3. 1 ctgacttttt cctataatag gaaggtatgt ttttattccg tgctgctcgc aagttgaggc 61 aatttcttaa aatgaatagc actggtgatt ttgatctcca cttattaaaa gtttcagaag 121 gcacaacaat actgttgaac tgcactggcc aggtaag // LOCUS HUMIL75 94 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human interleukin 7 (IL7) gene, exon 5. ACCESSION M29052 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 5 of 6 SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA to mRNA, clone 3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 94) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 + 74 interleukin 7, exon 5 pre-msg < 1 > 94 IL7 mRNA and introns IVS < 1 20 IL7 intron D IVS 75 > 94 IL7 intron E BASE COUNT 33 a 13 c 22 g 26 t ORIGIN About 1 kb after segment 4. 1 aatgtgactt tgtttttaag gttaaaggaa gaaaaccagc tgccctgggt gaagcccaac 61 caacaaagag tttggtgaga ataattgtat aatt // LOCUS HUMIL76 1672 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human interleukin 7 (IL7) gene, exon 6. ACCESSION M29053 KEYWORDS hemopoietic growth factor; interleukin; lymphopoietin; pre-B cell growth factor. SEGMENT 6 of 6 SOURCE Human leukocyte DNA, clones lambda-hIL7.[1,13,17,20b,20g], and cDNA to mRNA, clone 3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1672) AUTHORS Lupton,S.D., Gimpel,S., Jerzy,R., Brunton,L.L., Hjerrild,K.A., Cosman,D. and Goodwin,R.G. TITLE Characterization of the human and murine interleukin-7 genes JOURNAL Unpublished (1989) IMMUNEX Corporation, Seattle, Washington 98101 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Lupton, 26-SEP-1989. FEATURES from to/span description pept + 21 140 interleukin 7, exon 6 pre-msg < 1 811 IL7 mRNA and introns IVS < 1 20 IL7 intron E signal 753 758 polyA signal signal 1060 1065 polyA signal BASE COUNT 586 a 266 c 258 g 562 t ORIGIN About 0.2 kb after segment 5. 1 tttaaaactc tattctctag gaagaaaata aatctttaaa ggaacagaaa aaactgaatg 61 acttgtgttt cctaaagaga ctattacaag agataaaaac ttgttggaat aaaattttga 121 tgggcactaa agaacactga aaaatatgga gtggcaatat agaaacacga actttagctg 181 catcctccaa gaatctatct gcttatgcag tttttcagag tggaatgctt cctagaagtt 241 actgaatgca ccatggtcaa aacggattag ggcatttgag aaatgcatat tgtattacta 301 gaagatgaat acaaacaatg gaaactgaat gctccagtca acaaactatt tcttatatat 361 gtgaacattt atcaatcagt ataattctgt actgattttt gtaagacaat ccatgtaagg 421 tatcagttgc aataatactt ctcaaacctg tttaaatatt tcaagacatt aaatctatga 481 agtatataat ggtttcaaag attcaaaatt gacattgctt tactgtcaaa ataattttat 541 ggctcactat gaatctatta tactgtatta agagtgaaaa ttgtcttctt ctgtgctgga 601 gatgttttag agttaacaat gatatatgga taatgccggt gagaataaga gagtcataaa 661 ccttaagtaa gcaacagcat aacaaggtcc aagataccta aaagagattt caagagattt 721 aattaatcat gaatgtgtaa cacagtgcct tcaataaatg gtatagcaaa tgttttgaca 781 tgaaaaaagg acaatttcaa aaaaataaaa taaaataaaa ataaattcac ctagtctaag 841 gatgctaaac cttagtactg agttacattg tcatttatat agattataac tgtctaaata 901 agtttgcaat ttgggagata tatttttaag ataataatat atgtttacct tttaattaat 961 gaaatatctg tatttaattt tgacactata tctgtatata aaatattttc atacagcatt 1021 acaaattgct tactttggaa tacatttctc ctttgataaa ataaatgagc tatgtattaa 1081 cactgccaga ttcagttaat aaatctcaac agaattttta aggtgagatt tttaatactt 1141 cactgctctt taattttcta ctttcattga atatatgacc tttaatagcc tattaccaac 1201 aatgaatatg agacattacc ttctatgtta aattttagtc ttttaaaaga aacttgtctg 1261 gaaatgtctg cagtcttcat tcactgataa agggaaatgg aaacactatt ctaagcaggc 1321 cagaatctac aaatagctgt cagtttgcat tgcactgaac tcacatcctt catgctctct 1381 cagcctcatc accatctagt tatgtgctga tgggctttat ttgatctaca gaagatgctc 1441 tggtgacacc ctcagtgtgt gttggtaaca ccttcctgcc ttctctcact ttccttgacc 1501 catatgacat ttcaccttca ctggtgcgat agctgtatag aattgtatac acttccatac 1561 aatttactga actgccccat taacaggact ttgagtcaag tatttttttc tgtttggtta 1621 tcactgttaa ctccagaaag atgagttgac taacattttt tagaagcaaa tg // LOCUS HUMINSRMUT 349 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human insulin receptor gene allele C, exon 17, with an Alu insertion. ACCESSION M27196 KEYWORDS Alu repetitive element; insulin receptor. SOURCE Human (patient 1, haplotype C) white bloodcell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 349) AUTHORS Taira,M., Taira,M., Hashimoto,N., Shimada,F., Suzuki,Y., Kanatsuka,A., Nakamura,F., Ebina,Y., Tatibana,M., Makino,H. and Yoshida,S. TITLE Human diabetes associated with a deletion of the tyrosine kinase domain of the insulin receptor JOURNAL Science 245, 63-66 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.Taira, 02-SEP-1989. FEATURES from to/span description pept.ps / 40 341 pseudo-insulin receptor, exon 17 (AA at 42) /nomgen="INSR" /map="19p13.1-p13" /hgml_locus_uid="LG0007M" IVS < 1 39 INSR intron P rpt 145 231 Alu repeat (partial) rpt 233 349 Alu repeat (partial) BASE COUNT 99 a 81 c 89 g 80 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcctcca aggatgctgt gtagataagt aagaagtagt gtttccatgc tctgtgtacg 61 tgccggacca gtgggaggtg tctcgagaga agatcaccct ccttcgagag ctggggcagg 121 gctccttcgg catggtgtat gaggctggtc tcgaactccc cacctcaggt gatccgcgtg 181 ccttggcctc ccaaagtggt gggattacag gcgtgagcca ccacgcccag cccctgcact 241 gtggttttta tcaccacagt gaagggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaagct 301 cttttaactc tatgctttgt gtccactttc caaaagaatg aaaatagat // LOCUS HUMPDGFRB 400 bp ss-mRNA PRI 15-MAR-1990 DEFINITION Human platelet-derived growth factor beta mRNA fragment. ACCESSION M30493 KEYWORDS platelet-derived growth factor. SOURCE Human embryonic lung fibroblast, cDNA to mRNA, clone HPR5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 400) AUTHORS Miki,T., Matsui,T., Heidaran,M.A. and Aaronson,S.A. TITLE An efficient directional cloning system to construct cDNA libraries containing full-length inserts at high frequency JOURNAL Gene 83, 137-146 (1989) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 > 400 platelet-derived growth factor beta mRNA /nomgen="PDGFRB" /map="5q33-q35" /hgml_locus_uid="LW0046Z" BASE COUNT 70 a 144 c 112 g 74 t ORIGIN Chromosome 5q33-q35. 1 gccggcccct cagccctgct gcccagcacg agcctgtgct cgccctgccc aacgcagaca 61 gccagaccca gggcggcccc tctggcggct ctgctcctcc cgaaggatgc ttggggagtg 121 aggcgaagct gggcgctcct ctcccctaca gcagccccct tcctccatcc ctctgttctc 181 ctgagccttc aggagcctgc accagtcctg cctgtccttc tactcagctg ttaccaactc 241 tgggaccagc agtctttctg ataactggga gagggcagta aggaggactt cctggagggg 301 gtgactgtcc agagcctgga actgtgccca caccagaagc catcagcagc aaggacacca 361 tgcggcttcc gggtgcgatg ccagctctgg ccctcaaagg // LOCUS HUMPDGFRC 150 bp ss-mRNA PRI 15-MAR-1990 DEFINITION Human platelet-derived growth factor alpha mRNA fragment. ACCESSION M30494 KEYWORDS platelet-derived growth factor. SOURCE Human embryonic lung fibroblast, cDNA to mRNA, clone TR4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 150) AUTHORS Miki,T., Matsui,T., Heidaran,M.A. and Aaronson,S.A. TITLE An efficient directional cloning system to construct cDNA libraries containing full-length inserts at high frequency JOURNAL Gene 83, 137-146 (1989) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 > 150 platelet-derived growth factor alpha mRNA /nomgen="PDGFRA" /map="4q11-q13" /hgml_locus_uid="LR0223Z" BASE COUNT 40 a 30 c 47 g 33 t ORIGIN Chromosome 4q11-q13. 1 ccattactgt tggagctaca gggagagaaa caggaggaga ctgcaagaga tcatttggga 61 aggccgtggg cacgctcttt actccatgtg tgggacattc attgcggaat aacatcggag 121 gagaagtttc ccagagctat ggggacttcc // LOCUS HUMPSAA 7130 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human prostate specific antigen gene, complete cds. ACCESSION M27274 KEYWORDS Alu repetitive element; kallikrein; prostate specific antigen. SOURCE Human peripheral leukocyte DNA, (library of R.Wydro), clones lambda-PSA.[24A,25A,25B]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7130) AUTHORS Lundwall,A. TITLE Characterization of the gene for prostate-specific antigen, a human glandular kallikrein JOURNAL Biochem. Biophys. Res. Commun. 161, 1151-1159 (1989) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Lundwall, 03-NOV-1989. FEATURES from to/span description pept 675 720 prostate specific antigen, exon 1 /nomgen="APS" /map="19q13.3-qter" /hgml_locus_uid="LN0098S" 1959 2118 prostate specific antigen, exon 2 3756 4042 prostate specific antigen, exon 3 4186 4322 prostate specific antigen, exon 4 5699 5854 prostate specific antigen, exon 5 pre-msg 634 6491 PSA mRNA and introns IVS 721 1958 PSA intron A IVS 2119 3755 PSA intron B IVS 4043 4185 PSA intron C IVS 4323 5698 PSA intron D rpt 2583 2935 Alu repeat signal 503 512 CACCC box signal 605 611 TATA box binding 581 586 SpI binding site BASE COUNT 1530 a 2024 c 1867 g 1709 t ORIGIN 1 bp upstream of EcoRI site; chromosome 19q13.3-qter. 1 gaattccaca ttgtttgctg cacgttggat tttgaaatgc tagggaactt tgggagactc 61 atatttctgg gctagaggat ctgtggacca caagatcttt ttatgatgac agtagcaatg 121 tatctgtgga gctggattct gggttgggag tgcaaggaaa agaatgtact aaatgccaag 181 acatctattt caggagcatg aggaataaaa gttctagttt ctggtctcag agtggtgcag 241 ggatcaggga gtctcacaat ctcctgagtg ctggtgtctt agggcacact gggtcttgga 301 gtgcaaagga tctaggcacg tgaggctttg tatgaagaat cggggatcgt acccaccccc 361 tgtttctgtt tcatcctggg catgtctcct ctgcctttgt cccctagatg aagtctccat 421 gagctacaag ggcctggtgc atccagggtg atctagtaat tgcagaacag caagtgctag 481 ctctccctcc ccttccacag ctctgggtgt gggagggggt tgtccagcct ccagcagcat 541 ggggagggcc ttggtcagcc tctgggtgcc agcagggcag gggcggagtc ctggggaatg 601 aaggttttat agggctcctg ggggaggctc cccagcccca agcttaccac ctgcacccgg 661 agagctgtgt caccatgtgg gtcccggttg tcttcctcac cctgtccgtg acgtggattg 721 gtgagagggg ccatggttgg ggggatgcag gagagggagc cagccctgac tgtcaagctg 781 aggctctttc ccccccaacc cagcacccca gcccagacag ggagctgggc tcttttctgt 841 ctctcccagc cccacttcaa gcccataccc ccagcccctc catattgcaa cagtcctcac 901 tcccacacca ggtccccgct ccctcccact taccccagaa ctttctcccc attgcccagc 961 cagctccctg ctcccagctg ctttactaaa ggggaagttc ctgggcatct ccgtgtttct 1021 ctttgtgggg ctcaaaacct ccaaggacct ctctcaatgc cattggttcc ttggaccgta 1081 tcactggtcc atctcctgag cccctcaatc ctatcacagt ctactgactt ttcccattca 1141 gctgtgagtg tccaacccta tcccagagac cttgatgctt ggcctcccaa tcttgcccta 1201 ggatacccag atgccaacca gacacctcct tcttcctagc caggctatct ggcctgagac 1261 aacaaatggg tccctcagtc tggcaatggg actctgagaa ctcctcattc cctgactctt 1321 agccccagac tcttcattca gtggcccaca ttttccttag gaaaaacatg agcatcccca 1381 gccacaactg ccagctctct gattccccaa atctgcatcc ttttcaaaac ctaaaaacaa 1441 aaagaaaaac aaataaaaca aaaccaactc agaccagaac tgttttctca acctgggact 1501 tcctaaactt tccaaaacct tcctcttcca gcaactgaac ctggccataa ggcacttatc 1561 cctggttcct agcacccctt atcccctcag aatccacaac ttgtaccaag tttcccttct 1621 cccagtccaa gaccccaaat caccacaaag gacccaatcc ccagactcaa gatatggtct 1681 gggcgctgtc ttgtgtctcc taccctgatc cctgggttca actctgctcc cagagcatga 1741 agcctctcca ccagcaccag ccaccaacct gcaaacctag ggaagattga cagaattccc 1801 agcctttccc agctccccct gcccatgtcc caggactccc agccttggtt ctctgccccc 1861 gtgtcttttc aaacccacat cctaaatcca tctcctatcc gagtccccca gttccccctg 1921 tcaaccctga ttcccctgat ctagcacccc ctctgcaggc gctgcgcccc tcatcctgtc 1981 tcggattgtg ggaggctggg agtgcgagaa gcattcccaa ccctggcagg tgcttgtggc 2041 ctctcgtggc agggcagtct gcggcggtgt tctggtgcac ccccagtggg tcctcacagc 2101 tgcccactgc atcaggaagt gagtaggggc ctggggtctg gggagcaggt gtctgtgtcc 2161 cagaggaata acagctgggc attttcccca ggataacctc taaggccagc cttgggactg 2221 ggggagagag ggaaagttct ggttcaggtc acatggggag gcagggttgg ggctggacca 2281 ccctccccat ggctgcctgg gtctccatct gtgtccctct atgtctcttt gtgtcgcttt 2341 cattatgtct cttggtaact ggcttcggtt gtgtctctcc gtgtgactat tttgttctct 2401 ctctccctct cttctctgtc ttcagtctcc atatctcccc ctctctctgt ccttctctgg 2461 tccctctcta gccagtgtgt ctcaccctgt atctctctgc caggctctgt ctctcggtct 2521 ctgtctcacc tgtgccttct ccctactgaa cacacgcacg ggatgggcct ggggggaccc 2581 tgagaaaagg aagggctttg gctgggcgcg gtggctcaca cctgtaatcc cagcactttg 2641 ggaggccaag gcaggtagat cacctgaggt caggagttcg agaccagcct ggccaactgg 2701 tgaaacccca tctctactaa aaatacaaaa aattagccag gcgtggtggc gcatgcctgt 2761 agtcccagct actcaggagg ctgagggagg agaattgctt gaacctggga ggttgaggtt 2821 gcagtgagcc gagaccgtgc cactgcactc cagcctgggt gacagagtga gactccgcct 2881 caaaaaaaaa aaaaaaaaaa aaaaaaaaaa agaaaagaaa agaaaagaaa aggaatcttt 2941 tatccctgat gtgtgtgggt atgagggtat gagagggccc ctctcactcc attccttctc 3001 caggacatcc ctccactctt gggagacaca gagaagggct ggttccagct ggagctggga 3061 ggggcaattg agggaggagg aaggagaagg gggaaggaaa acagggtatg ggggaaagga 3121 ccctggggag cgaagtggag gatacaacct tgggcctgca ggccaggcta cctacccact 3181 tggaaaccca cgccaaagcc gcatctacag ctgagccact ctgaggcctc ccctccccgg 3241 cggtccccac tcagctccaa agtctctctc ccttttctct cccacacttt atcatccccc 3301 ggattcctct ctacttggtt ctcattcttc ctttgacttc ctgcttccct ttctcattca 3361 tctgtttctc actttctgcc tggttttgtt cttctctctc tctttctctg gcccatgtct 3421 gtttctctat gtttctgtct tttctttctc atcctgtgta ttttcggctc accttgtttg 3481 tcactgttct cccctctgcc ctttcattct ctctgtcctt ttaccctctt cctttttccc 3541 ttggtttctc tcagtttctg tatctgccct tcaccctctc acactgctgt ttcccaactc 3601 gttgtctgta tttttggcct gaactgtgtc ttccccaacc ctgtgttttt ctcactgttt 3661 ctttttctct tttggagcct cctccttgct cctctgtccc ttctctcttt ccttatcatc 3721 ctcgctcctc attcctgcgt ctgcttcctc cccagcaaaa gcgtgatctt gctgggtcgg 3781 cacagcctgt ttcatcctga agacacaggc caggtatttc aggtcagcca cagcttccca 3841 cacccgctct acgatatgag cctcctgaag aatcgattcc tcaggccagg tgatgactcc 3901 agccacgacc tcatgctgct ccgcctgtca gagcctgccg agctcacgga tgctgtgaag 3961 gtcatggacc tgcccaccca ggagccagca ctggggacca cctgctacgc ctcaggctgg 4021 ggcagcattg aaccagagga gtgtacgcct gggccagatg gtgcagccgg gagcccagat 4081 gcctgggtct gagggaggag gggacaggac tcctgggtct gagggaggag ggccaaggaa 4141 ccaggtgggg tccagcccac aacagtgttt ttgcctggcc cgtagtcttg accccaaaga 4201 aacttcagtg tgtggacctc catgttattt ccaatgacgt gtgtgcgcaa gttcaccctc 4261 agaaggtgac caagttcatg ctgtgtgctg gacgctggac agggggcaaa agcacctgct 4321 cggtgagtca tccctactcc caagatcttg aggggaaagg tgagtgggga ccttaattct 4381 gggctggggt ctagaagcca acaaggcgtc tgcctcccct gctccccagc tgtagccatg 4441 ccacctcccc gtgtctcatc tcattccctc cttccctctt ctttgactcc ctcaaggcaa 4501 taggttattc ttacagcaca actcatctgt tcctgcgttc agcacacggt tactaggcac 4561 ctgctatgca cccagcactg ccctagagcc tgggacatag cagtgaacag acagagagca 4621 gcccctccct tctgtagccc ccaagccagt gaggggcaca ggcaggaaca gggaccacaa 4681 cacagaaaag ctggagggtg tcaggaggtg atcaggctct cggggaggga gaaggggtgg 4741 ggagtgtgac tgggaggaga catcctgcag aaggtgggag tgagcaaaca cctgccgcag 4801 gggaggggag ggccctgcgg cacctggggg agcagaggga acagcatctg gccaggcctg 4861 ggaggagggg cctagagggc gtcaggagca gagaggaggt tgcctggctg gagtgaagga 4921 tcggggcagg gtgcgagagg gaagaaagga cccctcctgc agggcctcac ctgggccaca 4981 ggaggacact gcttttcctc tgaggagtca ggaactgtgg atggtgctgg acagaagcag 5041 gacagggcct ggctcaggtg tccagaggct gccgctggcc tccctatggg atcagactgc 5101 agggagggag ggcagcaggg atgtggaggg agtgatgatg gggctgacct gggggtggct 5161 ccaggcattg tccccacctg ggcccttacc cagcctccct cacaggctcc tggccctcag 5221 tctctcccct ccactccatt ctccacctac ccacagtggg tcattctgat caccgaactg 5281 accatgccag ccctgccgat ggtcctccat ggctccctag tgccctggag aggaggtgtc 5341 tagtcagaga gtagtcctgg aaggtggcct ctgtgaggag ccacggggac agcatcctgc 5401 agatggtcct ggcccttgtc ccaccgacct gtctacaagg actgtcctcg tggaccctcc 5461 cctctgcaca ggagctggac cctgaagtcc cttccctacc ggccaggact ggagccccta 5521 cccctctgtt ggaatccctg cccaccttct tctggaagtc ggctctggag acatttctct 5581 cttcttccaa agctgggaac tgctatctgt tatctgcctg tccaggtctg aaagatagga 5641 ttgcccaggc agaaactggg actgacctat ctcactctct ccctgctttt acccttaggg 5701 tgattctggg ggcccacttg tctgtaatgg tgtgcttcaa ggtatcacgt catggggcag 5761 tgaaccatgt gccctgcccg aaaggccttc cctgtacacc aaggtggtgc attaccggaa 5821 gtggatcaag gacaccatcg tggccaaccc ctgagcaccc ctatcaactc cctattgtag 5881 taaacttgga accttggaaa tgaccaggcc aagactcaag cctccccagt tctactgacc 5941 tttgtcctta ggtgtgaggt ccagggttgc taggaaaaga aatcagcaga cacaggtgta 6001 gaccagagtg tttcttaaat ggtgtaattt tgtcctctct gtgtcctggg gaatactggc 6061 catgcctgga gacatatcac tcaatttctc tgaggacaca gataggatgg ggtgtctgtg 6121 ttatttgtgg gatacagaga tgaaagaggg gtgggatcca cactgagaga gtggagagtg 6181 acatgtgctg gacactgtcc atgaagcact gagcagaagc tggaggcaca acgcaccaga 6241 cactcacagc aaggatggag ctgaaaacat aacccactct gtcctggagg cactgggaag 6301 cctagagaag gctgtgagcc aaggagggag ggtcttcctt tggcatggga tggggatgaa 6361 gtaaggagag ggactggacc ccctggaagc tgattcacta tggggggagg tgtattgaag 6421 tcctccagac aaccctcaga tttgatgatt tcctagtaga actcacagaa ataaagagct 6481 cttatactgt ggtttattct ggtttgttac attgacagga gacacactga aatcagcaaa 6541 ggaaacaggc atctaagtgg ggatgtgaag aaaacaggga aaatctttca gttgttttct 6601 cccagtgggg tgttgtggac agcacttaaa tcacacagaa gtgatgtgtg accttgtgta 6661 tgaagtattt ccaactaagg aagctcacct gagccttagt gtccagagtt cttattgggg 6721 gtctgtagga taggcatggg gtactggaat agctgacctt aacttctcag acctgaggtt 6781 cccaagagtt caagcagata cagcatggcc tagagcctca gatgtacaaa aacaggcatt 6841 catcatgaat cgcactgtta gcatgaatca tctggcacgg cccaaggccc caggtatacc 6901 aaggcacttg ggccgaatgt tccaagggat taaatgtcat ctcccaggag ttattcaagg 6961 gtgagccctg tacttggaac gttcaggctt tgagcagtgc agggctgctg agtcaacctt 7021 ttactgtaca ggggggtgag ggaaagggag aagatgagga aaccgcctag ggatctggtt 7081 ctgtcttgtg gccgagtgga ccatggggct atcccaagaa ggaggaattc // LOCUS HUMRPS24A 659 bp ss-mRNA PRI 15-MAR-1990 DEFINITION Human ribosomal protein S24 mRNA. ACCESSION M31520 KEYWORDS ribosomal protein S24. SOURCE Human male lymphoblast from lymphoid tumor cell line HT1080 (ATCC 121) cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 133 to 641) AUTHORS Brown,S.J., Jewell,A., Maki,C.G. and Roufa,D.J. TITLE A cDNA encoding human ribosomal protein S24 JOURNAL Unpublished (1990) Kansas St. Univ. Manhattan, KS 66506 STANDARD full staff_entry REFERENCE 2 (bases 1 to 659) AUTHORS Brown,S.J. and Roufa,D.J. JOURNAL Unpublished (1990) Kansas St. Univ. Manhattan, KS 66506 STANDARD full staff_entry COMMENT Authorin Submission [2] kindly submitted by Roufa,D.J., 22-JAN-1990. FEATURES from to/span description pept / 132 51 (c) unknown protein (AA at 131) /evidence=EXPERIMENTAL pept 143 544 ribosomal protein S24 /function="component of human 40S ribosomal subunit" /evidence=EXPERIMENTAL /gene="HUMRPS24" /standard_name="rpS24" mRNA 132 1 (c) unknown mRNA /evidence=EXPERIMENTAL /note="800 nt. anonymous human transcript detected on Northern blots of HT1080 and HeLa cell cytoplasmic mRNAs" [2] signal 25 20 (c) poly-A signal /note="detected by pattern only" [2] signal 598 603 poly-A signal /evidence=EXPERIMENTAL [1] recomb 133 142 misc_recomb site /note="site of artificial recombination between 2 cDNAs which occurred during reverse transcription or recombinant DNA cloning operations" /evidence=EXPERIMENTAL [2] BASE COUNT 218 a 144 c 144 g 153 t ORIGIN 1 gggtttatcg gaaaatgtgt ttattgagat ggtttcccac tcatcttgac tcagagtgct 61 tttagtgctg cttcctcctg aaggaacatc cttctgtaag ccttgctttt cctccttggc 121 tgtctgaaga tagatcgcca tcatgaacga caccgtaact atccgcacta gaaagttcat 181 gaccaaccga ctacttcaga ggaaacaaat ggtcattgat gtccttcacc ccgggaaggc 241 gacagtgcct aagacagaaa ttcgggaaaa actagccaaa atgtacaaga ccacaccgga 301 tgtcatcttt gtatttggat tcagaactca ttttggtggt ggcaagacaa ctggctttgg 361 catgatttat gattccctgg attatgcaaa gaaaaatgaa cccaaacata gacttgcaag 421 acatggcctg tatgagaaga aaaagacctc aagaaagcaa cgaaaggaac gcaagaacag 481 aatgaagaaa gtcaggggga ctgcaaaggc caatgttggt gctggcaaaa agccgaagga 541 gtaaaggtgc tgcaatgatg ttagctgtgg ccactgtgga tttttcgcaa gaacattaat 601 aaactaaaaa cttcatgtgt aaaaaaaaaa aaaaaaaaaa accccccccc ccccccccc //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS MUSMK1A 2222 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse intronless potassium channel gene MK1. ACCESSION M30439 KEYWORDS potassium channel protein. SOURCE Mouse (strain AKR) cell line L47.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites; for [2]) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B.L. and Gutman,G.A. TITLE A family of three mouse potassium channel genes with intronless coding regions JOURNAL Science 247, 943-975 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2222) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B.L. and Gutman,G.A. JOURNAL Unpublished (1990) Univ. of Calif. Irvine, CA 92717 STANDARD full staff_entry COMMENT Authorin Submission [2] kindly submitted by Gutman,G.A., 12-DEC-1989 MK1, MK2 and MK3 represent three members of a family of mouse genes encoding potassium channel proteins, related to the Drosophila shaker locus. Each mouse protein is encoded by a single, uninterrupted exon, although one (or more) introns may be present in the 5' untranslated region (MK1, MK2). The sequence of MK1 is included within, and identical to, the published sequence of MBK1 cDNA (Tempel, Jan & Jan, Nature 232:837-9, 1988) with which it overlaps, except for two additional G's, one each at MK1 positions 162 and 442. MK1 closely resemble the published rat cDNA sequence RCK1 (Baumann et al., EMBO J. 7:2457, 1988). FEATURES from to/span description pept 584 2071 potassium channel protein IVS < 1 52 intron BASE COUNT 517 a 613 c 598 g 494 t ORIGIN 1 atcagaatgc ttctgatttt ctaccctcgt atcacttttt atttctctgc agcgtccatc 61 gatcgccccg gtgggagctt agaaggcggc aggcgaagag gggtaggagg gggggagagc 121 caaggagcag caaagtgggt gacaggcctg gggagctgct ggagttggca ctgcaccgtg 181 tcctagaagg gctgcaggag gggaggggag ccggggcgag ccccggaagc gatggcaggt 241 gtgcattaag gtgatcagag caggaagaaa aacccattca ccaacacaaa aacagagcac 301 ccccccccaa aaaatacccc aagagtagca aaaaagcttc actgggggcc ctcaggcgag 361 aggggtttcc aaattgcgta aaaggcagag tggaggaggg gaggtgataa ctagcaaagt 421 tgcagacttc tgaaccttct ggctctgaag cccctccctg tgagcgtggg gaagactcac 481 tttccggtgg gggggccgct tgggtccccc ccacccctac tccctcgctc cttcacaacc 541 cgggctctct cctggcctcc tacccctgca ccctgctttc atcatgacgg tgatgtcggg 601 ggagaatgcg gacgaggctt cgaccgctcc aggtcacccc caggatggca gctacccgag 661 gcaggcggac cacgacgacc acgaatgctg cgagcgcgta gtaatcaaca tctccgggct 721 gcgcttcgaa acgcagctca agactctggc acagttcccc aacacgctgc tgggcaaccc 781 gaagaaacgc atgcgctact ttgaccccct gaggaacgag tacttctttg accgcaaccg 841 gcccagcttc gatgccatcc tttattacta ccagtccggg ggccgcctgc gcaggccggt 901 caacgtgccc ctggacatgt tctccgagga gattaaattt tacgagttgg gcgaggaagc 961 catggagaag ttccgggaag atgagggctt catcaaggaa gaggagcgcc ccctacccga 1021 gaaggagtac cagcgccagg tgtggctgct ctttgagtat ccggagagct caggacctgc 1081 ccgggttatt gccattgtgt cggtcatggt catcctcatc tccatagtca tcttttgcct 1141 ggagactctc cctgagctga aggacgacaa ggacttcacg ggcaccatcc accgcatcga 1201 caacaccaca gtcatctata cttccaacat cttcacagac cctttcttca ttgtggaaac 1261 cttgtgtatc atctggttct cttttgagct ggtggtgcgc ttcttcgcct gccccagcaa 1321 gacagacttc tttaagaaca tcatgaactt catcgacatt gtggccatca tcccttattt 1381 cattaccctg ggcacggaga tagctgagca ggagggaaat cagaagggcg agcaggccac 1441 ttccctggcc atcctcaggg tcatccgctt ggtaagggtg ttcagaatct tcaaactctc 1501 ccgccactcc aagggccttc agatcctggg ccagaccctc aaagctagta tgagggagtt 1561 agggctgctc atctttttcc tcttcattgg ggtcatactg ttttctagcg cagtgtactt 1621 tgcggaggcg gaagaagctg agtcgcactt ctccagtatc cccgatgctt tctggtgggc 1681 ggtggtgtcc atgaccactg tgggatacgg tgacatgtac cctgtgacaa ttggaggcaa 1741 gatcgtgggc tccttgtgtg ccatcgctgg tgtgctgaca attgccctgc ccgtacctgt 1801 cattgtgtcc aatttcaact atttctacca ccgagaaact gagggggaag agcaggctca 1861 gttgctccat gttagttctc ctaacttagc ctctgacagt gacctcagcc gccgcagctc 1921 ctctactatc agcaagtctg agtacatgga gatcgaagag gatatgaaca atagcatagc 1981 ccattacaga caggctaata tcagaactgg taactgcacc acagctgatc aaaactgcgt 2041 taataagagc aagctcctga ccgatgttta aaaagcaaca ggcaagccaa caaaagcccc 2101 aaacaaaccc cttggcgact cctgtcccgc tctgtagata ctttactaag cagtagtctt 2161 tgaatgcttt attttgctgg caatgcattg ttgcattgtg aatttgggga gtggcgaacc 2221 tg // LOCUS MUSMK2A 3383 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse intronless potassium channel gene MK2. ACCESSION M30440 KEYWORDS potassium channel protein. SOURCE Mouse (strain AKR) cell line L47.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites; for [2]) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B.L. and Gutman,G.A. TITLE A family of three mouse potassium channel genes with intronless coding regions JOURNAL Science 247, 943-975 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3383) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B.L. and Gutman,G.A. JOURNAL Unpublished (1990) Univ. of Calif. Irvine, CA 92717 STANDARD full staff_entry COMMENT Authorin Submission [2] kindly submitted by Gutman,G.A., 05-DEC-1989 MK1, MK2 and MK3 represent three members of a family of mouse genes encoding potassium channel proteins, related to the Drosophila Shaker locus. Each mouse protein is encoded by a single, uninterrupted exon, although one (or more) introns may be present in the 5' untranslated region (MK1, MK2). MK2 closely resembles the published rat cDNA sequence RBK2 (McKinnon, J. Biol. Chem. 264:8230, 1989), and the deduced amino acid sequence of the rat protein RCK5 (Stuhmer et al., EMBO J. 8:3235, 1989). FEATURES from to/span description pept 1118 2617 potassium channel protein IVS 527 959 intron /note="putative intron by comparison with RBK2 cDNAs" BASE COUNT 818 a 881 c 791 g 881 t 12 others ORIGIN 1 cctgcaggca tcgaagctta tggttatcca ggagaggtta ggaaaaggct cctgtggtgg 61 atctcattca caagtgtcat ggaaagtaga tgtgggccnn nntnnacttc ttgccctgag 121 gggatctccc agagccctta aaagtcttng tctcagctgc ctggaancct tggtcccacc 181 ttgctgctgt gttttccagg caacgtcaca cctccggagg acagccagga ggactcaggc 241 ttttgctgag ctgtgcatct tgcctccttc cttccaaagg cttcagaccc aagctccact 301 ctcctcctca agtcgtgcgc tgtgcccctg gctaagcatc tcactgtccc tcgagcccca 361 tccagtccta gagcacacct ccaggacctg ccctcctgct cagaatgact caccattatt 421 tccagctcaa gtgagaagat gtgacgggga ctgagctgcc tatttgtgta tctgtgccat 481 cagatctgct cttcgaagaa gagaacctca gctcctgcct ggccaggtaa atcaaacaat 541 cacttagtag cagtcatata gaggcaatgg gggtggcatc agtgaccaag gnncaggtcc 601 caaacaccct accatgaata cntcctttgg tgcaggtatt gccaaaacat tagcacagtg 661 cctggcagag ggctcccttc tccactcctc ctgtgcagct atgtccccag catgtgtgtg 721 gtacccaaga tcctgtctca gccncctgcc cctagcgcag gccattctat agcttccttg 781 aaagattgtt tgttccctga accctactga cccctagaca agtgacaagg tgataatagt 841 tttagtggca gaagtgatgg tacagtcatg tttctgaact cttggggaca tcaagtgcac 901 ctcctgtcag gttgccaatg actggtgatc tcatgtcctt acctgcctct gtctttcagc 961 cgagtggggc tgtgtctcca actcacaaag ggctctgggg ctttcctgca gaagctcaga 1021 cttatgcttc tgtgtttcct gggccccagt tatcagcacc gagttccctg ccgcagctgg 1081 aaggcttaaa gtaaagcaca cttctcaagc cccaattatg acagtggcta ccggagaccc 1141 agtggacgag gctgctgccc tccctgggca cccacaagac acctatgacc cagaggcaga 1201 ccatgaatgc tgtgagagag tggtcatcaa catctcaggc ctgcggttcg aaactcagct 1261 aaagacctta gcccagttcc cagagaccct cttaggggac cccaagaaac ggatgaggta 1321 cttcgatccc ctccgaaatg agtacttttt tgatcgcaac cgccctagct ttgatgccat 1381 tttgtactac tatcagtctg ggggcaggtt gaggcgacct gtgaacgtgc ccttagatat 1441 cttctcggaa gaaatccggt tttatgagct aggagaagaa gcaatggaga tgtttcggga 1501 ggatgaaggc tacatcaagg aagaagagcg tcctctgcct gaaaatgagt ttcagagaca 1561 ggtgtggctt ctctttgaat accctgagag ctcagggcct gccaggatta tagccattgt 1621 atctgtgatg gtcattctga tctccatcgt cagcttctgt ctggaaacct tgcccatctt 1681 ccgggatgag aatgaggaca tgcatggtgg cggggtgacc ttccacacct attccaacag 1741 caccatcggg taccagcagt ccacctcctt caccgaccct ttcttcattg tagagactct 1801 ctgcatcatc tggttctcct ttgagtttct ggttagattc tttgcctgtc ccagcaaagc 1861 tggcttcttc accaacatca tgaacatcat tgacattgtg gctatcatcc cttactttat 1921 caccctgggg acagagttag ctgagaagcc agaggatgcc cagcaaggcc agcaggccat 1981 gtcactggcc attctccgtg tcatccggtt ggtaagagtc tttaggattt tcaagttgtc 2041 cagacactcc aaaggtctac agattctagg tcagaccctc aaagctagca tgagggaatt 2101 gggcctcctg atattcttcc tcttcattgg ggtcatcctc ttctctagtg ctgtctattt 2161 tgcagaagct gatgagagag attcccagtt ccccagcatc ccggatgctt tctggtgggc 2221 agtcgtctcc atgacaactg taggctatgg agacatggtt ccaactacca ttggggggaa 2281 gatagtgggt tctctgtgtg caattgcagg tgtgttaacc attgccttac cagtccctgt 2341 catagtgtct aatttcaact acttctacca ccgggagaca gagggagagg agcaggccca 2401 gtacttgcaa gtgacaagct gtccaaagat cccgtcctcc cctgacctaa agaaaagtag 2461 aagtgcctct accataagta agtctgatta catggagata caggagggag ttaacaacag 2521 caatgaggac tttagagagg agaacttaaa aacagccaac tgtaccttgg ctaacacaaa 2581 ctatgtgaat attaccaaaa tgttaactga tgtctgattg aagcctacta atgtactcac 2641 agctcaacag gactgatgca gatgttgcat aatagcctgc attgtagtca gtgttctaca 2701 gtgttctaca gcatctggtt ctgcatggaa agcaatagtt cgtgcaagtg acttttgatc 2761 ttttgatttt tgatttagaa cacagaatat ttatcatggc tttcatgaaa tcttcatcac 2821 cgacttacag gtttccaaag aggagagtca cccatggagc cagcatctca gaaaggcaca 2881 atgcagcccc ctcacaccca cacacaacct accatagcaa cactatccta tcttcctacc 2941 taaacatgag cacacctggg aggtgcattc cctccacccc acccctacca ccctatttca 3001 gcccatctgc cccttcacag aggaacatac cattgtggct tagttctaaa gtctggtgac 3061 tactcaaagg tcattctcgg ttttgtgttg agaagaacac acagtcctgt gtgttggaac 3121 tactttctgt gtcacaggct ggggtttgtg aattgcagtt gccaactaga tgctccggag 3181 gcttatgttt cgtaacggaa aaatgctgca ttctgctttt tctctgcagt gtcgatgtga 3241 gggaagccca ggggggtggg ggagggacag ttaatatgac caaaatatga attgccaagt 3301 ttcacatttg ttcccttagg cctatgggga gaagctacct aattcaggga cttctcagct 3361 tcagatacca ttttgcaagc ttg // LOCUS MUSMK3A 1994 bp ds-DNA ROD 15-MAR-1990 DEFINITION Mouse intronless potassium channel gene MK3. ACCESSION M30441 KEYWORDS potassium channel protein. SOURCE Mouse (strain AKR) cell line L47.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites; for [2]) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B. and Gutman,G.A. TITLE A family of three mouse potassium channel genes with intronless coding regions JOURNAL Science 247, 943-975 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1994) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B.L. and Gutman,G.A. JOURNAL Unpublished (1990) Univ. of Calif. Irvine, CA 92717 STANDARD full staff_entry COMMENT Authorin Submission [2] kindly submitted by Gutman,G.A., 05-DEC-1989 MK1, MK2 and MK3 represent three members of a family of mouse genes encoding potassium channel proteins, related to the Drosophila shaker locus. Each mouse protein is encoded by a single, uninterrupted exon, although one (or more) introns may be present in the 5' untranslated region (MK1, MK2). MK3 closely resembles the published rat cDNA sequence RCK3 (Stuhmer et al., EMBO J. 8:3235, 1989). FEATURES from to/span description pept 150 1736 potassium channel protein BASE COUNT 394 a 612 c 534 g 454 t ORIGIN 1 agccgccgct agggaaggaa agcaccgccg cctcccgcgc tcgaccgccg cagccctcca 61 cccatcaccg cgcccaccct gcaccggacc ccgcaggagg cggcgcgcgc atcctgcaga 121 gccccggcca cgccgagctg ccgccagaca tgaccgtggt gcccggggac cacctgctgg 181 agccagaggc ggcgggaggc ggtggcgggg acccgcctca gggaggctgt ggcagtggcg 241 gcggcggtgg cggctgcgac cgctacgagc cactgccacc cgcgctgccc gccgcgggcg 301 agcaagattg ctgcggcgag cgtgtggtca tcaacatctc cgggctgcgc ttcgagacgc 361 agctcaagac cctctgccag ttccccgaga cactgctggg cgaccccaag cggcgcatgc 421 ggtactttga cccactccgc aatgagtact tcttcgaccg caaccgaccc agcttcgacg 481 ccatcctcta ctactaccag tccgggggcc gcattcgccg gccggtcaac gtgcccatcg 541 acatcttctc cgaggagatc cgcttttacc agctgggtga ggaggccatg gaaaagttcc 601 gtgaggatga gggcttcctg cgggaggagg agcgacccct gccccgccgt gacttccagc 661 gccaggtgtg gctgctcttc gaatatccgg agagctccgg gccggcccgg ggcattgcca 721 ttgtgtcagt gctggtcatt ctcatctcca ttgtcatctt ctgcttggag acgcttcccg 781 agtttcgcga tgagaaagac tatcccgcct ccccgtcgca ggacgtgttt gaggctgcca 841 acaacagcac gtcgggggcc ccttctggag cctccagctt ctcggacccc ttcttcgtgg 901 tggagacctt gtgcatcatc tggttctcct ttgagcttct ggtgcggttc tttgcttgcc 961 ccagtaaagc caccttctcc agaaatatca tgaacttgat agacattgtg gccatcattc 1021 cttattttat cactctgggc actgagctgg ctgaacgaca aggtaatggg cagcaggcca 1081 tgtcgctggc catcctaaga gtcatccgcc tagtaagggt tttccgcatc ttcaagctct 1141 cccgccattc taaggggctg cagatcctag gacagacgct gaaggcttcc atgcgggagc 1201 tggggctgct catattcttc ctcttcattg gggtcatcct tttctccagt gcagcttact 1261 ttgctgaggc agacgaccct tcttcgggtt ttaacagtat cccggatgcc ttctggtggg 1321 cagtagtaac catgacaact gttggttatg gtgatatgca cccagtgacc ataggaggca 1381 agattgtggg ctctctttgt gccatcgcag gtgtcttgac cattgcattg ccagttcctg 1441 tgattgtttc caacttcaac tacttctacc accgggagac agaaggggaa gagcaagccc 1501 agtacatgca cgtgggcagt tgccagcacc tctcctcttc agccgaggag ctccgaaaag 1561 cccggagtaa ctccactctg agtaagtcgg agtatatggt gatcgaagag gggggtatga 1621 accagagcgc cttcccgcag acccccttca aaacgggcaa ctccacagcc acttgcacca 1681 cgaacaataa ccccaactcc tgtgtcaaca tcaagaagat attcactgat gtctaatata 1741 tgatacggtt gccaattctg tgcccagtat tgtgtggaac atgccccctt ggtctgtgta 1801 tgcccttgat ttatacattt ccagaccact catcaaggaa agtacaagaa gtgaggaagc 1861 acacttcatt ctccctattg cttcatactg aaacaggtgc ctgtttttgc aagtgggctg 1921 cattctctca gctctttttt tctctctctc cctgtctctt aattttgtga ccaacaaact 1981 tacattaagc gtgg //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS YSCMTSPR1A 81 bp ds-DNA ORG 21-FEB-1990 DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel junction. ACCESSION M31839 KEYWORDS . SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 81) AUTHORS Clark-Walker,G.D. TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990) STANDARD full staff_review FEATURES from to/span description rpt 21 60 40 bp direct repeat BASE COUNT 30 a 13 c 25 g 13 t ORIGIN 1 gaaaccggga cctcggagac gtaatagggg gagggggtgg gtgataataa ccagaatatt 61 caataaatac agagcacaca t // LOCUS YSCMTSPR1B 81 bp ds-DNA ORG 21-FEB-1990 DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel junction. ACCESSION M31840 KEYWORDS . SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 81) AUTHORS Clark-Walker,G.D. TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990) STANDARD full staff_review FEATURES from to/span description rpt 21 60 40 bp direct repeat BASE COUNT 35 a 7 c 18 g 21 t ORIGIN 1 tatatatttc aaatatataa gtaatagggg gagggggtgg gtgataataa ccagaatatt 61 aaataaatac agagcacaca t // LOCUS YSCMTSPR1C 80 bp ds-DNA ORG 21-FEB-1990 DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel junction. ACCESSION M31841 KEYWORDS . SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 80) AUTHORS Clark-Walker,G.D. TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990) STANDARD full staff_review FEATURES from to/span description rpt 16 66 51 bp direct repeat BASE COUNT 25 a 15 c 17 g 23 t ORIGIN 1 atttattata atatttatat taatagttcc ggggcccggc cacgggagcc ggaaccccga 61 aaggagtatt attaaacatt // LOCUS YSCMTSPR1D 80 bp ds-DNA ORG 21-FEB-1990 DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel junction. ACCESSION M31842 KEYWORDS . SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 80) AUTHORS Clark-Walker,G.D. TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990) STANDARD full staff_review FEATURES from to/span description rpt 16 66 51 bp direct repeat BASE COUNT 28 a 15 c 19 g 18 t ORIGIN 1 ttataattca atttatatat taatagttcc ggggcccggc cacgggagcc ggaaccccga 61 aaggaggaat aagataaata // LOCUS YSCMTSPR1E 80 bp ds-DNA ORG 21-FEB-1990 DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel junction. ACCESSION M31843 KEYWORDS . SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 80) AUTHORS Clark-Walker,G.D. TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990) STANDARD full staff_review FEATURES from to/span description rpt 17 65 49 bp direct repeat BASE COUNT 29 a 14 c 17 g 20 t ORIGIN 1 tattattaaa tattatatag ttccggggcc cggccacggg agccggaacc ccgaaaggag 61 aaatattaat atattataaa // LOCUS YSCMTSPR1F 80 bp ds-DNA ORG 21-FEB-1990 DEFINITION S.cerevisiae mitochondrial DNA involved in the formation of a novel junction. ACCESSION M31844 KEYWORDS . SOURCE S.cerevisiae (strain hfp 15.19S) mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 80) AUTHORS Clark-Walker,G.D. TITLE In vivo rearrangement of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8847-8851 (1990) STANDARD full staff_review FEATURES from to/span description rpt 17 65 49 bp direct repeat BASE COUNT 32 a 14 c 17 g 17 t ORIGIN 1 atattattta ttattaatag ttccggggcc cggccacggg agccggaacc ccgaaaggag 61 aaataaataa aataaaaaat //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMBAT2A 6704 bp ss-mRNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) mRNA, complete cds. ACCESSION M33509 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SOURCE Human T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6704) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 102 6530 HLA-B-associated transcript 2 (BAT2) mRNA 1 6704 BAT2 mRNA signal 6692 6697 poly-A signal BASE COUNT 1435 a 2224 c 1897 g 1148 t ORIGIN Chromosome 6p21.3. 1 cctaggcccg ggtcccggat ccccgcgcac ccggccaggc tctggcacgt tttgggggag 61 gtgcctgcag gacccaacat actcaatgag cttccagcgc aatgtccgat cgctcggggc 121 cgactgccaa gggaaaggat ggaaagaagt attcctcgct caacctgttt gatacgtata 181 agggcaagtc cttagagatc cagaaacccg cctgttgccc ctcgccatgg cctgcagagt 241 ctcgggaaag ttgccattgc ccggcgtatc gacctccagc caaccttcca agcctgaaag 301 ccgagaacaa aggcaatgac cccaatgtct cactagtgcc aaaagacgga acaggatggg 361 caagcaaaca ggagcagtcc gaccccaaga gttccgatgc ctcaaccgct cagccgccgg 421 aatcgcagcc actgccggct tcacagacgc ctgcctccaa ccagccgaaa cgacccccag 481 cagcccccga gaacactcct ttggttccaa gcggggtaaa gtcctgggca caagccagcg 541 tcacccatgg agcacatgga gatggtggaa gggcatcaag cctactgtca cgattctctc 601 gagaggaatt tccgaccctg caggcggctg gcgaccagga caaggctgcc aaggaaaggg 661 agtctgccga acagtcgtct gggcccggac caagcctccg cccccaaaat tctacaactt 721 ggagggacgg aggtgggcgt ggccctgatg agctggaggg cccggactcc aaacttcatc 781 atggtcatga tccccggggt gggctacagc cttcaggccc accccagttc cctccctacc 841 gcggaatgat gccgcctttc atgtatcccc catatctccc gttccctccg ccctatggac 901 cccaggggcc ttaccgatac cccactcctg atgggcccag ccgttttccc cgtgtggcgg 961 gcccccgagg ctcagggcca ccaatgcgct tagtagagcc tgtgggtcgt ccctctattc 1021 tcaaagagga taatctcaaa gagtttgatc agttggatca ggagaatgat gatggttggg 1081 caggggccca tgaagaggtt gactacactg aaaagctcaa gttcagcgat gaggaagatg 1141 ggcgagactc tgatgaggag ggagctgagg gccacaggga ttcccaatca gcttctggtg 1201 aggaacggcc ccctgaagca gatggcaaaa agggcaactc ccccaacagc gaaccgccca 1261 ctcctaagac ggcctgggca gaaacctctc ggcctccaga gacagagccg ggacctcctg 1321 ccccaaagcc tcccctaccc cctggggact acccagatcg tgggggtcct ccctgcaagc 1381 ccccagcacc tgaagatgag gatgaggcat ggcggcagcg acgaaagcag tcgtcatctg 1441 agatttccct ggcagtggag cgggcccggc gacggcgaga agaagaggag cggcgcatgc 1501 aagaagagcg ccgggcagcc tgtgctgaga agctcaagcg actcgatgaa aagtttgggg 1561 cacctgacaa gcggctcaaa gcagagcctg ctgccccacc tgctgcccct tctaccccag 1621 ccccaccacc tgcagtccct aaagaactcc ctgcacctcc agctccacct ccagcatcag 1681 ccccaacacc agagacagaa cctgaagagc cagcacaggc ccctcctgcc caatctactc 1741 ctactccagg tgtggctgcg gctcccactc tggtgagtgg tggtggcagt accagtagca 1801 ccagcagtgg cagcttcgaa gccagcccag tggaaccaca actgccctca aaagagggtc 1861 ctgaaccacc agaagaggtt cctcctccta ccacaccccc agttccaaag gtggaaccca 1921 agggtgatgg gattggtccc acccgccagc cccctagtca gggcttgggc taccccaaat 1981 atcagaagtc gttgcctcct cgtttccagc ggcagcagca ggagcagctc ctgaagcagc 2041 agcagcagca ccagtggcag cagcatcaac agggctctgc ccctcctacc ccagtgcccc 2101 catcaccacc acagcctgtg accctggggg ctgtgccagc tccacaggct ccacccccgc 2161 cccccaaggc cctgtaccca ggtgctctgg gccggccccc acccatgccc ccaatgaact 2221 ttgatccccg atggatgatg attcctcctt atgtggaccc ccggctcctc cagggtcgtc 2281 cccctctaga gttctaccct cctggtgtgc atccctctgg cctagttccc cgagagcgtt 2341 cagacagtct ggggctcagc tcagagccat ttgaccgtca tgcacctgct atgttacggg 2401 aacggggcac tccaccggtg gatccaaagt tggcctgggt aggagatgtc ttcaccgcca 2461 cacccgctga accccgccca cttacctcac ctctgcgcca ggctgcggat gaggatgaca 2521 aggggatgag gagcgagact cctccagtac ctcccccacc accctatctg gccagttatc 2581 caggctttcc tgagaatgga gcccctgggc ccccaatctc tcgctttcct ctggaggaac 2641 cagggccccg tccactcccc tggcccccag gcagtgatga agtggccaag atacaaactc 2701 caccacccaa gaaggagccc cctaaggagg agactgcaca gctgacgggg ccagaagcag 2761 gccgaaagct gcccgcgagt cggagtggag caggcccccc accaccacgc agagagagtc 2821 gcacagagac ccgctggggc cctcgtccag ggagcagtcg tcgtggaatc cctccagagg 2881 agccaggggc cccaccccgc cgggctgggc ctataaagaa acctccacca cctacaaaag 2941 tagaagagct gcctcccaag cccctcgaac agggggatga aacccccaaa cccccaaagc 3001 cagacccact caagataacc aaggggaagc tagggggccc caaggagacc ccacccaatg 3061 gaaatctttc ccctgcccca aggcttcgga gggactattc gtatgaaaga gtgggtccta 3121 cctcttgccg gggtcggggc cgaggcgagt attttgccag agggaggggt tttcggggga 3181 cctatggggg acgagggcgg ggaggccaag cgaattccgc agttaccgag agtttcgagg 3241 agatgatggg cgtggaggtg ggacaggggg accaaaccac cctcctgctc cccgaggccg 3301 ccatgccagc gagacacgga gcgagggttc agagtatgag gaaatcccca agcggtgccg 3361 gcagcggggc tcagaaacag gcagcgagac ccatgagagt gatctggctc cttcagacaa 3421 ggaggctccc acacccaagg agggaacact cacccaggtc ctctcgctcc cccaccacca 3481 ggagccccac ccttcaccga gcgccagccc gcttcacgtg cccgggggtc ggcgagtctt 3541 cactcccaga gggtgccatc tcgccggggc cgaggaggag ggaggcccct cctcaagttt 3601 gcccaggctg gagccctcca gccaagtctc tggctcccaa gaaacctccc acaggccctt 3661 tgccaccaag taaggagcct ttgaaagaga agttgatccc agggcctctg tcccctgtgg 3721 cgcgcggagg cagcaatgga ggtagcaatg tgggcatgga agatggggag cgaccccgaa 3781 ggaggcgaca tgggagggct cagcagcagg ataaaccgcc tcgtttccgg aggctgaagc 3841 aggaacggga gaatgccgca agggggtctg agggcaagcc ctccctaacc cttccagcct 3901 ccgctcctgg acctgaggag gccctcacaa cagtcacagt ggccccagca cctccgcggg 3961 cagctgccaa gtctcctgat ctgtcaaacc agaactcaga ccaagccaat gaggaatggg 4021 agactgcatc agagagcagt gacttcacca gtgagcgccg aggggacaaa gaggcacccc 4081 caccagtact gctgacaccc aaggctgtgg gaactcctgg gggaggtgga ggtggagccg 4141 taccaggtat ttcagccatg tcccgcggag atctgagcca gagagccaag gatttgagta 4201 aacggagctt ctcaagtcag cggccaggca tggaacggca gaatcggcgc cctggcccag 4261 ggggcaaggc tggcagcagt ggcagcagca gtggaggagg cggtgggggt cctggaggaa 4321 ggaccgggcc aggacgaggc gacaagagga gctggccctc tcccaagaac cgaagtcgtc 4381 ctccagagga gcgtcccccg gggcttcccc tgcctccccc acctcccagc agttctgctg 4441 tcttccgcct ggaccaagtt atccacagca accctgctgg catccaacag gctctggccc 4501 agcttagtag ccgtcaaggg agtgtaactg caccaggggg tcatccaagg cacaagcctg 4561 ggcctcccca agcccctcag ggcccctctc ctaggccccc aacccgatac gagccccaga 4621 gggtcaacag cggcctcagt tctgaccccc actttgagga gccggggcca atggtgagag 4681 gggtgggtgg gactcctcgg gactctgccg gggttagtcc ctttccccct aaacgtcggg 4741 agcggcctcc cagaaaacca gagctgctac aggaggaatc tttgccacct cctcatagct 4801 ctggattctt gggctctaag cctgagggcc caggccctca ggcagagtcc agagatacag 4861 gcacagaggc cctgacccct cacatctgga accgtttaca tactgccact agccgaaaga 4921 gttaccggcc cacgtccatg gagccttgga tggagcccct gagtcctttt gaggatgtgg 4981 ctggcacaga aatgagtcag tctgacagtg gggtggacct gagtggggat tctcaggtgt 5041 catcaggtcc ctgcagccag cgaagttccc ctgatggagg actcaagggg gcagcagagg 5101 gaccccccaa gaggcctgga ggctcctcac ccctgaatgc tgttccttgt gagggtccac 5161 ctggctctga acctcctagg agaccaccac ctgcccccca cgatggggac agaaaggagc 5221 tgccccggga gcagcctctg ccccctggcc ccattggcac agaacgatca cagcgtacag 5281 accgaggcac agagcctggc cccattcggc catcccatcg acctggtccc ccagtccagt 5341 ttggcactag tgacaaggac tcagacttac gcctagtggt aggagacagc ttgaaagcag 5401 agaaggagct aacagcatca gtcactgagg ccattcctgt atcacgagac tgggagctgc 5461 ttcccagtgc tgctgcctct gctgagccac aatccaagaa cctggattct gggcactgtg 5521 tcccggagcc cagctcctca ggccagcgcc tgtatcctga ggttttctat ggcagtgctg 5581 ggccttccag ttctcagatc tctgggggga gccatggact ctcaattaca tccaaacagt 5641 ggaggcttcg ccctgggaca ccctcactgc acccttacag atcacagccc ctatacctac 5701 ccccgggccc agcccctccc tcagcactgc tctctggggt agctctcaag ggccagtttc 5761 tggatttctc cacaatgcaa gctacagagc tggggaagtt gccggctgga ggagttctct 5821 accctccacc ttccttcctc tactctccgg ctttctgccc cagtcctttg cctgacacat 5881 cgttgcttca ggtacgccag gatctgccat ccccttcgga tttttattct actcctctgc 5941 agcctggtgg ccaaagtggc tttctccctt caggggctcc tgcccagcag atgcttctac 6001 ccatggtaga ctcacagctg cctgtggtga actttggctc cctgccgcca gcaccacctc 6061 ctgccccacc tcccctttct ctgttacctg tgggccctgc tctgcagccc cccagcctgg 6121 ctgtgcggcc cccacctgct cctgctactc gggtgctgcc ttcacctgcc aggcccttcc 6181 ccgctagctt ggggcgagca gagctgcatc cagtggaact aaagccgttc caggattatc 6241 aaaaactgag cagcaacctt gggggacctg gatcatcacg gactccccca actggaaggt 6301 ccttctctgg cctcaattcc cgtctcaagg ccacgccttc cacctacagt ggagtcttcc 6361 gcacccagcg cgtcgacctt taccagcagg cctccccacc agatgccctg cgctggatac 6421 ctaagccttg ggagcggaca gggccgccac ctcgagaagg gccctcccga cgggcagagg 6481 agcctgggtc ccgaggggac aaggagcctg ggttgccccc accccgctga gggagttcct 6541 cttgccccct acccccgggg cttgtatata gattataaat atataagggg gaaaggggtg 6601 ggcggggagg ggttgtgggg ctggggcctc acttcccctc ctcccccttc ccctggtccc 6661 ctgtccctgg ggctgtttgt taaaaaagag taataaaagg attt // LOCUS HUMBAT2B1 336 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 5' flank. ACCESSION M33510 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 1 of 4 SOURCE Human T-cell line MANN DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites; for [2]) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 336) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. BASE COUNT 108 a 60 c 65 g 101 t 2 others ORIGIN Chromosome 6p21.3. 1 tctagaatcg ggtagtaaga gacaaaggag ggtaacagta ctgcatttca caaaatgaaa 61 cccattgtta agaaattaca aattcccaat aatttcaaat ataaaaattt attcatgaaa 121 attataggtt ataaaattaa atgtccgtct tagtcgatgg ttgcccatat tttgatgaac 181 gagtcattcc tagcctatct ttgttcaaat gatttgcata cttatgcaaa taggtagaac 241 tgcccgaaga atgcctacnt gcgtggtgcg gacgaaacgc ttgccgggsc ctttggattg 301 gtctgtctag ccacctcatt tgcatgacgt aatata // LOCUS HUMBAT2B2 188 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 5' end. ACCESSION M33511 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 2 of 4 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 154 to 188) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 188) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pre-msg 154 188 BAT2 mRNA BASE COUNT 20 a 64 c 59 g 18 t 27 others ORIGIN About 500 bp after segment 1; chromosome 6p21.3. 1 gtgcthhhng gggcggcggt tccgcggatg ggccgttagt cgggstcagc cgcggagtga 61 gngagggaga cgnnaggasg aacccggcca tccgccgcca tcctcccccg ccccaccgcc 121 atccgtcccg gggacnnnnn nnnnnnnnnn nnncctaggc ccgggtcccg gatccccgcg 181 cacccggc // LOCUS HUMBAT2B3 3090 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, exons 2 through 4. ACCESSION M33512 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 3 of 4 SOURCE Human T-cell line MANN DNA (introns), and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1460 to 1572; 2396 to 2572; and 2919 to 3090) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 3090) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 1460 1572 HLA-B-associated transcript 2 (BAT2), exon 2 (first expressed exon) 2396 2572 HLA-B-associated transcript 2, exon 3 2919 + 3090 HLA-B-associated transcript 2, exon 4 pre-msg < 1 > 3090 BAT2 mRNA and introns IVS < 1 1459 BAT2 intron A (no splice consensus) IVS 1573 2395 BAT2 intron B IVS 2573 2918 BAT2 intron C BASE COUNT 747 a 709 c 730 g 901 t 3 others ORIGIN About 500 bp after segment 2; chromosome 6p21.3. 1 tctagaatcc tgcttttatc ccagcatctt tgctttctat gttgctcagt cgccctatgt 61 ctgctttttc atttttcctg ttcctcgtct cctttctccc ccaaccccgt ttttcttctt 121 gggcctctgc cccttacttc gttgtctaca tccttttttt ttttgccatt cctgtttcca 181 tatattttcc acctgctttc gtattcatta ttttctgtta gttttggact attcgctaca 241 tgactcttgt attcgttttc ccttcatata tttatcttca cagattggcc tcctcaaaca 301 cctacgaagc aacatccatc ttatgtgtag cttgtcataa agttctttct ccccaatttt 361 agctttcatt ctgggcctgt ctggatttcc ctgctttctt ccccactatt tctcatctct 421 ttacactgtt cccgaccata aacgaatgcc tggtcactct ggaatggact gagagacctg 481 tcgtccggct tgcttaggga gctggaggta tcgagtaaag aaacactggt gatggacatt 541 tttaatcagg ataggaaaac gaagatggct ctgccttggc cctctgtttt ctggcccatg 601 gttacagggt gctaaggtgg ctccataatg ctttttctca gttcttcata tggtaaaaca 661 gtatttcatc tggaggcgat tttttccagg agccaataca ggagcaagtt taccaaaaga 721 tgggatattt caaatacttg aggttcctat agcctgggag tatgtacagc cctagttgtt 781 ctatgaggat ttctctggta ccaaccccca ttccngctga gcaagctcat aaaatcctta 841 aactcccagc ataccttnct gcaaaccttc ccagatggac acgaggctgc tgggctggga 901 gctggggtac agggccctgg gggcatgatt agggagcttg tgtccaataa acagggaatc 961 taaagtgttg tttcttcttc tctgatggaa ttgtatgctt cttttttagt tttctcttag 1021 cttgaatttg tcctgttgta agtctctgaa acgattttgg tggagagaga agagattatt 1081 acttgtaggg aattactctt tngtagacag gcacaaaggg cagagtgttt atactaggag 1141 gatgctggat ttttacttag atttccttgt aacaaaggtc gtctggggcc aaggagggaa 1201 catggcattt gagctatgag ggagctaagt agatcatggt tggactttaa gaagagtggg 1261 cagtttacat agactggagg aaaagacacc agagggactc atatctgagt ccctaatgat 1321 aatgcaatgg agtttttaag tttctgttat ggtctgtaca gggacagaga ctgagacact 1381 tgcgtctggc ccacaggctc tggcacgttt tgggggaggt gcctgcagga cccaacatac 1441 tcaatgagct tccagcgcaa tgtccgatcg ctcggggccg actgccaagg gaaaggatgg 1501 aaagaagtat tcctcgctca acctgtttga tacgtataag ggcaagtcct tagagatcca 1561 gaaacccgcc tggtgagagt cctgcaaaga tgcttctgat ggttgaaaag ctaggcatgc 1621 atggggcata cgttttagag ctctaaagga agtggctgta gtagaaatac caaaagacta 1681 gaggagattt cccaacttac actgggtcct ttaaaggggg tgtgggctct gggtgaacac 1741 cagttatcct cctacaaagg cgtgtctgtg gttccctgtc tttggacacg taagaattgg 1801 aggaaataaa tgtggatttg ggaaactttg aggccagctt gcttcttgca ggctcatgat 1861 caaccaatct cacataaaag tattgaatgt tacatatctc agccttcttg atagggattt 1921 actagatttt tttttttttt tttttttttt ttttttgaga ccaagtttag ctcctgttgc 1981 ccaggctgga gtgcaatggt gtgatcttga cttaccacaa cctccaccgc ctgggtttaa 2041 gcgattatcc tgcctcagcc tcctgagtag ctgggattac aggcatgcac cccggctaat 2101 tttgtgtttt tagtagagac agggtttctc cattttggtc aagctggtct tgaactcctg 2161 acctcaggtg atccgcctcc ctcggcctgc caaagtgctg ggattgcaaa gtgtgagcca 2221 ccacaatcag cgcgatttca gagattatta aggcagggga aggaatccct tctaagagaa 2281 gtttggagga agtaggtaat aaaatattca acatgtataa atgtgtccca ggataggagg 2341 ccatcagatc tcccacatga ggcattttcg accctctctc cgtcttgttc tccagttgcc 2401 cctcgccatg gcctgcagag tctcgggaaa gttgccattg cccggcgtat cgacctccag 2461 ccaaccttcc aagcctgaaa gccgagaaca aaggcaatga ccccaatgtc tcactagtgc 2521 caaaagacgg aacaggatgg gcaagcaaac aggagcagtc cgaccccaag aggtagacag 2581 aggcttgggg gacctagagt gatgggtatt ttaacttgaa cttcagggag cattggggct 2641 tggtttagtc cagccacgtc tgaagagacg aagaggtccc tttcttacct attgcaggtt 2701 ccttgttaaa tgactaagga atggtactaa actttagctt tttgtcttgg agagagagca 2761 tgaaaaaata gacaacaggt acaaggatga caaaattaat ttgtccttat atttgtaaat 2821 ggtagcaatg ggcatgattt cagtcctgag tctccaccag ttggagaagt cagggaggca 2881 tctcaggtgt gaataacctt cccattctgt cccctcagtt ccgatgcctc aaccgctcag 2941 ccgccggaat cgcagccact gccggcttca cagacgcctg cctccaacca gccgaaacga 3001 cccccagcag cccccgagaa cactcctttg gttccaagcg gggtaaagtc ctgggcacaa 3061 gccagcgtca cccatggagc acatggagat // LOCUS HUMBAT2B4 6349 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 2 (BAT2) gene, 3' end. ACCESSION M33518 M31293 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 4 of 4 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 209 to 6349) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 6349) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept + 209 6175 HLA-B-associated transcript 2 (BAT2), exon 5 pre-msg < 1 6349 BAT2 mRNA and introns IVS < 1 208 BAT2 intron D (no splice consensus) signal 6337 6342 poly-A signal BASE COUNT 1334 a 2095 c 1798 g 1121 t 1 others ORIGIN About 370 bp after segment 3; chromosome 6p21.3. 1 agctaatttg tgtgtgttta gtagagatgg gttcacatgt tggcagatgg tctcgatctc 61 ttgacctctg tgatccgccc gcctcagccg gtcccagagt gctgggatta caggcgtgag 121 ccaccgcgcc cagccagagt cttccacttt tatnagcatg tcctcaggaa atgtcttctg 181 tctcctgttc tgcatcccca tcctaatagg tggaagggca tcaagcctac tgtcacgatt 241 ctctcgagag gaatttccga ccctgcaggc ggctggcgac caggacaagg ctgccaagga 301 aagggagtct gccgaacagt cgtctgggcc cggaccaagc ctccgccccc aaaattctac 361 aacttggagg gacggaggtg ggcgtggccc tgatgagctg gagggcccgg actccaaact 421 tcatcatggt catgatcccc ggggtgggct acagccttca ggcccacccc agttccctcc 481 ctaccgcgga atgatgccgc ctttcatgta tcccccatat ctcccgttcc ctccgcccta 541 tggaccccag gggccttacc gataccccac tcctgatggg cccagccgtt ttccccgtgt 601 ggcgggcccc cgaggctcag ggccaccaat gcgcttagta gagcctgtgg gtcgtccctc 661 tattctcaaa gaggataatc tcaaagagtt tgatcagttg gatcaggaga atgatgatgg 721 ttgggcaggg gcccatgaag aggttgacta cactgaaaag ctcaagttca gcgatgagga 781 agatgggcga gactctgatg aggagggagc tgagggccac agggattccc aatcagcttc 841 tggtgaggaa cggccccctg aagcagatgg caaaaagggc aactccccca acagcgaacc 901 gcccactcct aagacggcct gggcagaaac ctctcggcct ccagagacag agccgggacc 961 tcctgcccca aagcctcccc taccccctgg ggactaccca gatcgtgggg gtcctccctg 1021 caagccccca gcacctgaag atgaggatga ggcatggcgg cagcgacgaa agcagtcgtc 1081 atctgagatt tccctggcag tggagcgggc ccggcgacgg cgagaagaag aggagcggcg 1141 catgcaagaa gagcgccggg cagcctgtgc tgagaagctc aagcgactcg atgaaaagtt 1201 tggggcacct gacaagcggc tcaaagcaga gcctgctgcc ccacctgctg ccccttctac 1261 cccagcccca ccacctgcag tccctaaaga actccctgca cctccagctc cacctccagc 1321 atcagcccca acaccagaga cagaacctga agagccagca caggcccctc ctgcccaatc 1381 tactcctact ccaggtgtgg ctgcggctcc cactctggtg agtggtggtg gcagtaccag 1441 tagcaccagc agtggcagct tcgaagccag cccagtggaa ccacaactgc cctcaaaaga 1501 gggtcctgaa ccaccagaag aggttcctcc tcctaccaca cccccagttc caaaggtgga 1561 acccaagggt gatgggattg gtcccacccg ccagccccct agtcagggct tgggctaccc 1621 caaatatcag aagtcgttgc ctcctcgttt ccagcggcag cagcaggagc agctcctgaa 1681 gcagcagcag cagcaccagt ggcagcagca tcaacagggc tctgcccctc ctaccccagt 1741 gcccccatca ccaccacagc ctgtgaccct gggggctgtg ccagctccac aggctccacc 1801 cccgcccccc aaggccctgt acccaggtgc tctgggccgg cccccaccca tgcccccaat 1861 gaactttgat ccccgatgga tgatgattcc tccttatgtg gacccccggc tcctccaggg 1921 tcgtccccct ctagagttct accctcctgg tgtgcatccc tctggcctag ttccccgaga 1981 gcgttcagac agtctggggc tcagctcaga gccatttgac cgtcatgcac ctgctatgtt 2041 acgggaacgg ggcactccac cggtggatcc aaagttggcc tgggtaggag atgtcttcac 2101 cgccacaccc gctgaacccc gcccacttac ctcacctctg cgccaggctg cggatgagga 2161 tgacaagggg atgaggagcg agactcctcc agtacctccc ccaccaccct atctggccag 2221 ttatccaggc tttcctgaga atggagcccc tgggccccca atctctcgct ttcctctgga 2281 ggaaccaggg ccccgtccac tcccctggcc cccaggcagt gatgaagtgg ccaagataca 2341 aactccacca cccaagaagg agccccctaa ggaggagact gcacagctga cggggccaga 2401 agcaggccga aagctgcccg cgagtcggag tggagcaggc cccccaccac cacgcagaga 2461 gagtcgcaca gagacccgct ggggccctcg tccagggagc agtcgtcgtg gaatccctcc 2521 agaggagcca ggggccccac cccgccgggc tgggcctata aagaaacctc caccacctac 2581 aaaagtagaa gagctgcctc ccaagcccct cgaacagggg gatgaaaccc ccaaaccccc 2641 aaagccagac ccactcaaga taaccaaggg gaagctaggg ggccccaagg agaccccacc 2701 caatggaaat ctttcccctg ccccaaggct tcggagggac tattcgtatg aaagagtggg 2761 tcctacctct tgccggggtc ggggccgagg cgagtatttt gccagaggga ggggttttcg 2821 ggggacctat gggggacgag ggcggggagg ccaagcgaat tccgcagtta ccgagagttt 2881 cgaggagatg atgggcgtgg aggtgggaca gggggaccaa accaccctcc tgctccccga 2941 ggccgccatg ccagcgagac acggagcgag ggttcagagt atgaggaaat ccccaagcgg 3001 tgccggcagc ggggctcaga aacaggcagc gagacccatg agagtgatct ggctccttca 3061 gacaaggagg ctcccacacc caaggaggga acactcaccc aggtcctctc gctcccccac 3121 caccaggagc cccacccttc accgagcgcc agcccgcttc acgtgcccgg gggtcggcga 3181 gtcttcactc ccagagggtg ccatctcgcc ggggccgagg aggagggagg cccctcctca 3241 agtttgccca ggctggagcc ctccagccaa gtctctggct cccaagaaac ctcccacagg 3301 ccctttgcca ccaagtaagg agcctttgaa agagaagttg atcccagggc ctctgtcccc 3361 tgtggcgcgc ggaggcagca atggaggtag caatgtgggc atggaagatg gggagcgacc 3421 ccgaaggagg cgacatggga gggctcagca gcaggataaa ccgcctcgtt tccggaggct 3481 gaagcaggaa cgggagaatg ccgcaagggg gtctgagggc aagccctccc taacccttcc 3541 agcctccgct cctggacctg aggaggccct cacaacagtc acagtggccc cagcacctcc 3601 gcgggcagct gccaagtctc ctgatctgtc aaaccagaac tcagaccaag ccaatgagga 3661 atgggagact gcatcagaga gcagtgactt caccagtgag cgccgagggg acaaagaggc 3721 acccccacca gtactgctga cacccaaggc tgtgggaact cctgggggag gtggaggtgg 3781 agccgtacca ggtatttcag ccatgtcccg cggagatctg agccagagag ccaaggattt 3841 gagtaaacgg agcttctcaa gtcagcggcc aggcatggaa cggcagaatc ggcgccctgg 3901 cccagggggc aaggctggca gcagtggcag cagcagtgga ggaggcggtg ggggtcctgg 3961 aggaaggacc gggccaggac gaggcgacaa gaggagctgg ccctctccca agaaccgaag 4021 tcgtcctcca gaggagcgtc ccccggggct tcccctgcct cccccacctc ccagcagttc 4081 tgctgtcttc cgcctggacc aagttatcca cagcaaccct gctggcatcc aacaggctct 4141 ggcccagctt agtagccgtc aagggagtgt aactgcacca gggggtcatc caaggcacaa 4201 gcctgggcct ccccaagccc ctcagggccc ctctcctagg cccccaaccc gatacgagcc 4261 ccagagggtc aacagcggcc tcagttctga cccccacttt gaggagccgg ggccaatggt 4321 gagaggggtg ggtgggactc ctcgggactc tgccggggtt agtccctttc cccctaaacg 4381 tcgggagcgg cctcccagaa aaccagagct gctacaggag gaatctttgc cacctcctca 4441 tagctctgga ttcttgggct ctaagcctga gggcccaggc cctcaggcag agtccagaga 4501 tacaggcaca gaggccctga cccctcacat ctggaaccgt ttacatactg ccactagccg 4561 aaagagttac cggcccacgt ccatggagcc ttggatggag cccctgagtc cttttgagga 4621 tgtggctggc acagaaatga gtcagtctga cagtggggtg gacctgagtg gggattctca 4681 ggtgtcatca ggtccctgca gccagcgaag ttcccctgat ggaggactca agggggcagc 4741 agagggaccc cccaagaggc ctggaggctc ctcacccctg aatgctgttc cttgtgaggg 4801 tccacctggc tctgaacctc ctaggagacc accacctgcc ccccacgatg gggacagaaa 4861 ggagctgccc cgggagcagc ctctgccccc tggccccatt ggcacagaac gatcacagcg 4921 tacagaccga ggcacagagc ctggccccat tcggccatcc catcgacctg gtcccccagt 4981 ccagtttggc actagtgaca aggactcaga cttacgccta gtggtaggag acagcttgaa 5041 agcagagaag gagctaacag catcagtcac tgaggccatt cctgtatcac gagactggga 5101 gctgcttccc agtgctgctg cctctgctga gccacaatcc aagaacctgg attctgggca 5161 ctgtgtcccg gagcccagct cctcaggcca gcgcctgtat cctgaggttt tctatggcag 5221 tgctgggcct tccagttctc agatctctgg ggggagccat ggactctcaa ttacatccaa 5281 acagtggagg cttcgccctg ggacaccctc actgcaccct tacagatcac agcccctata 5341 cctacccccg ggcccagccc ctccctcagc actgctctct ggggtagctc tcaagggcca 5401 gtttctggat ttctccacaa tgcaagctac agagctgggg aagttgccgg ctggaggagt 5461 tctctaccct ccaccttcct tcctctactc tccggctttc tgccccagtc ctttgcctga 5521 cacatcgttg cttcaggtac gccaggatct gccatcccct tcggattttt attctactcc 5581 tctgcagcct ggtggccaaa gtggctttct cccttcaggg gctcctgccc agcagatgct 5641 tctacccatg gtagactcac agctgcctgt ggtgaacttt ggctccctgc cgccagcacc 5701 acctcctgcc ccacctcccc tttctctgtt acctgtgggc cctgctctgc agccccccag 5761 cctggctgtg cggcccccac ctgctcctgc tactcgggtg ctgccttcac ctgccaggcc 5821 cttccccgct agcttggggc gagcagagct gcatccagtg gaactaaagc cgttccagga 5881 ttatcaaaaa ctgagcagca accttggggg acctggatca tcacggactc ccccaactgg 5941 aaggtccttc tctggcctca attcccgtct caaggccacg ccttccacct acagtggagt 6001 cttccgcacc cagcgcgtcg acctttacca gcaggcctcc ccaccagatg ccctgcgctg 6061 gatacctaag ccttgggagc ggacagggcc gccacctcga gaagggccct cccgacgggc 6121 agaggagcct gggtcccgag gggacaagga gcctgggttg cccccacccc gctgagggag 6181 ttcctcttgc cccctacccc cggggcttgt atatagatta taaatatata agggggaaag 6241 gggtgggcgg ggaggggttg tggggctggg gcctcacttc ccctcctccc ccttcccctg 6301 gtcccctgtc cctggggctg tttgttaaaa aagagtaata aaaggattt // LOCUS HUMBAT3A 3740 bp ss-mRNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 3 (BAT3) mRNA, complete cds. ACCESSION M33519 M31294 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SOURCE Human T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3740) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 250 3648 HLA-B-associated transcript 3 (BAT3) mRNA 1 3740 BAT3 mRNA BASE COUNT 744 a 1182 c 1057 g 757 t ORIGIN Chromosome 6p21.3. 1 ggcgacagcg gtggcggctc ctcggggtgc tcggctccct cccacctagg ccggccccgg 61 cccgactcgc cctcagaaac tcactgtttg gggctgcgga ctttctcgtc gtgccccaca 121 aaagtaaagc ttggggacct ggggggagcc ggaagtatcg cttcgagatc cccaaatact 181 atcggggaaa cggaagtggc cgtcggtggc aggtttgggg gagaccggaa gtgacgagac 241 ctgtcggcca tggagcctaa tgatagtacc agtaccgctg tggaggagcc tgacagcttg 301 gaggtgttgg tgaagacctt ggactctcaa actcgtacct ttattgtggg ggcccagatg 361 aatgtaaaag agtttaagga gcacattcgt gcctctgtca gcatcccatc tgaaaaacaa 421 cggctcattt accagggacg agttctgcaa gatgataaga agcttcagga atacaatgtt 481 gggggaaagg ttatccacct ggtggaacgg gctcctcctc agactcacct cccttctggg 541 gcatcttctg ggacggggtc tgcctcagcc actcatggtg ggggatcccc ccctggtact 601 cgggggcctg gggcctctgt tcatgaccgg aatgccaaca gctatgtcat ggttggaacc 661 ttcaatcttc ctagtgacgg ctctgctgtg gatgttcaca tcaacatgga acaggccccg 721 attcagagtg agccccgggt acggctggtg atggctcagc acatgatcag ggatatacag 781 accttactat cccggatgga gactctcccc taccttcagt gtcgaggagg gccccaaccg 841 cagcacagtc agccgccccc gcagccaccg gctgtgaccc cggagccagt agccttgagc 901 tctcaaacat cagaaccagt tgaaagtgaa gcacctcccc gggagcccat ggaggcagaa 961 gaagtggagg agcgtgcccc agcccagaac ccggagctca ctcctggccc agccccagcg 1021 ggcccaacac ctgccccgga aacaaatgca cccaaccatc cttcccctgc ggagtatgtc 1081 gaggtgctcc aggagctaca gcggctggag agtcgcctcc agcccttctt gcagcgctac 1141 tacgaggttc tgggtgctgc tgccaccacg gactacaata acaatcacga gggccgggag 1201 gaggatcagc ggttgatcaa cttggtaggg gagagcctgc gactgctggg caacaccttt 1261 gttgcactgt ctgacctgcg ctgcaatctg gcctgcacgc ccccacgaca cctgcatgtg 1321 gtccggccta tgtctcacta caccaccccc atggtgctcc agcaggcagc cattcccata 1381 cagatcaatg tgggaaccac tgtgaccatg acaggaaatg ggactcggcc ccccccaact 1441 cccaatgcag aggcacctcc ccctggtcct gggcaggcct catccgtggc tccgtcttct 1501 accaatgtcg agtcctcagc tgagggggct cccccgccag gtccagctcc cccgccagcc 1561 accagccacc cgagggtcat ccggatttcc caccagagtg tggaacccgt ggtcatgatg 1621 cacatgaaca ttcaagattc tggcacacag cctggtggtg ttccgagtgc tcccactggc 1681 cccctgggac cccctggtca tggccaaacc ctgggacagc aggtgccagg cttcccaaca 1741 gctccaaccc gggtggtgat tgcccggccc actcctccac aggctcggcc ttcccatcct 1801 ggagggcccc cagtctctgg gacactgcag ggcgccggtc tgggtaccaa tgcctcgttg 1861 gcccagatgg tgagcggcct tgtggggcag cttcttatgc agccagtcct tgtggctcag 1921 gggaccccag gtatggctcc accgccagcc cctgccactg cttctgccag tgctggcacc 1981 accaacacag ctaccacagc tggccccgct cctggggggc ctgcccagcc tccacccacc 2041 cctcaaccct ccatggctga tcttcagttc tctcagcttc tggggaacct gctagggcct 2101 gcagggccag gggctggagg gcctggtgtg gcttctccca ccatcactgt ggcgatgcct 2161 ggtgtccctg cctttctcca aggcatgact gacttcttgc aggcaacaca gacagcccct 2221 ccaccacccc cacctcctcc acccccacca cctgccccag agcagcagac catgccccca 2281 ccaggctccc cttctggtgg cgcagggagt cctggaggcc tgggtcttga gagcctgtca 2341 ccggagtttt ttacctcagt ggtgcagggt gtgctcagct ccctgctggg ctccctgggg 2401 gctcgggctg gcagcagtga aagtattgct gccttcatac aacgcctcag tggatccagc 2461 aacatctttg agcctggagc tgatggggcc cttggattct ttggggcctt gctttctctt 2521 ctgtgccaga acttctctat ggtggacgta gtgatgcttc tccatgggca tttccagcca 2581 ctacaacggc tccagcccca gctgcgatcc ttcttccacc agcactacct gggtggtcag 2641 gagcccacac ccagtaacat ccggatggca acccacacat tgatcacggg gctagaagag 2701 tatgtgcggg agagtttttc cttggtgcag gttcagccag gtgtggacat catccggaca 2761 aacctggaat ttctccaaga gcagtttaat agcattgctg cgcatgtgct gcattgcaca 2821 gatagtggat ttggggcccg gttgctggag ttgtgtaacc aaggcctgtt tgaatgcctg 2881 gccctaaacc tgcactgctt ggggggacag cagatggagc ttgctgctgt tatcaatggc 2941 cgaattcgtc gtatgtctcg tggggtgaat ccctccttgg tgagctggct gaccactatg 3001 atgggactga ggcttcaggt ggtactggag cacatgcctg taggccctga tgccattctc 3061 agatacgttc gcagggttgg tgatcccccc cagccacttc ctgaggagcc aatggaagtt 3121 cagggagcag aaagagcttc ccctgagcct cagcgggaga atgcttcccc agcccctgga 3181 acaacagcag aagaggccat gtcccgaggt ccacctcctg ctcctgaggg gggctcccgg 3241 gatgaacagg atggagcttc agctgagaca gaaccttggg cagctgcagt ccccccagaa 3301 tgggtcccta ttatccagca ggacattcag agccagcgga aggtgaaacc gcagccccct 3361 ctgagtgatg cctacctcag tggtatgcct gccaagagac gcaagacgat gcagggtgag 3421 ggcccccagc tgcttctctc agaggctgtg agccgggcag ctaaggcagc cggagctcgg 3481 cccctgacga gccccgagag cctgagccgg gacctggagg caccagaggt tcaggagagc 3541 tacaggcagc agctccggtc tgatatacaa aaacgactgc aggaagaccc caactacagt 3601 ccccagcgct tccccaatgc ccagcgggcc tttgctgatg atccttagct ctttgctcta 3661 tggcccttcc tcatcagggg accgtttccc ccctcttcct tcacagtatt taagaaataa 3721 aagtcggatt ttttctggcc // LOCUS HUMBAT3B1 785 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 3 (BAT3) gene, 5' end. ACCESSION M33520 M31294 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 1 of 2 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 333 to 689) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 785) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept 582 + 689 HLA-B-associated transcript 3 (BAT3), exon 1 pre-msg 333 > 785 BAT3 mRNA and introns IVS 690 > 785 BAT3 intron A BASE COUNT 170 a 210 c 239 g 165 t 1 others ORIGIN Chromosome 6p21.3. 1 aaggcgcagc gagggcaata gggtggagaa gagttttagc tgctagacag tgccgcctga 61 aattatcagc ctgccaagat ttaaacatag atgaatgtgg cataatcccc catctccaaa 121 gtccaagtcc atacgaccgt ccatagcctc tcgaggcagt ggtagagtcc cagctggtga 181 ctgtttttca ggcatttacg gtagccacct caatcttcta gcgctcaacg cgcgcacaga 241 cgtgaacgcc gccagagggg ggagggggtg gggcgatgct taagtgtcca cgcatcccgt 301 agtgcgacgg cacagcgtag taggtncccc cgggcgacag cggtggcggc tcctcggggt 361 gctcggctcc ctcccaccta ggccggcccc ggcccgactc gccctcagaa actcactgtt 421 tggggctgcg gactttctcg tcgtgcccca caaaagtaaa gcttggggac ctggggggag 481 ccggaagtat cgcttcgaga tccccaaata ctatcgggga aacggaagtg gccgtcggtg 541 gcaggtttgg gggagaccgg aagtgacgag acctgtcggc catggagcct aatgatagta 601 ccagtaccgc tgtggaggag cctgacagct tggaggtgtt ggtgaagacc ttggactctc 661 aaactcgtac ctttattgtg ggggcccagg tgagacacct cactagttct ggaagacacc 721 tttagctttt ccacgtttag gccccttagc ctgagagatg agcttgattt ttctaggtca 781 ccaga // LOCUS HUMBAT3B2 4227 bp ds-DNA PRI 18-JAN-1990 DEFINITION Human HLA-B-associated transcript 3 (BAT3) gene, 3' end. ACCESSION M33521 M31294 KEYWORDS class III gene; major histocompatibility complex; proline-rich protein. SEGMENT 2 of 2 SOURCE Human T-cell line MANN DNA, and T-cell line HPB-All, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 731 to 848; 963 to 4227) AUTHORS Banerji,J., Sands,J., Strominger,J.L. and Spies,T. TITLE A gene pair from the human major histocompatibility complex encodes large proline-rich proteins with multiple repeated motifs and a single ubiquitin-like domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2374-2378 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 4227) AUTHORS Banerji,J. JOURNAL Unpublished (1990) 7 Divinity Ave., Cambridge, MA 02138 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Banerji, 11-JAN-1990, for release after publication. FEATURES from to/span description pept + 731 848 HLA-B-associated transcript 3 (BAT3), exon 2 963 4135 HLA-B-associated transcript 3, exon 3 pre-msg < 1 4227 BAT3 mRNA and introns IVS < 1 730 BAT3 intron A IVS 849 962 BAT3 intron B BASE COUNT 842 a 1276 c 1116 g 987 t 6 others ORIGIN About 1.1 kb after segment 1; chromosome 6p21.3. 1 ttatcttntt agatcatttc cttccacctt aacctatacc agacccactc cttctttgcc 61 attttttaat cttggaaatc acaggagngt ctgtaaatna ctggatcatc ttgtgtttgg 121 aaggggtact gatgtctcta gacacatacn cccttggatg ccagacagat aatataattt 181 ccatgtgttt tttttttgtt tttcatccgt gttatttttc ctggatctat aacctgagct 241 tcattaagtt tatttattta attttttcga gatggagtcc cacnctttca cccaggctag 301 agtgtagtga tgcgatctcg gctcactgca acctccgcct cccgaattca agtgattctc 361 ttgcttcagc ctccctagta gctgggatta caggcgacca ccatgcctgg cttatttttt 421 gtatttttgg taaaaagggg ttttacatgt tggccaggct ggtctcgaac tctgacctaa 481 gtgatctgcc tgccttggcc tcccaagtgc tggattacag tgtgagacca ccgctccagc 541 caatatgtct gtatttttga cacgtgttac tttagttaag ggtttgcaca gtaatgatct 601 cacggtcaag acaaacgggt agtgattdct gtggtggttt ttacccctca cctccacaac 661 tcggttgtct gtctttgttc ttcctctttc ctccattctt tccattcctg tgcatgcctc 721 ttcttttcag atgaatgtaa aagagtttaa ggagcacatt cgtgcctctg tcagcatccc 781 atctgaaaaa caacggctca tttaccaggg acgagttctg caagatgata agaagcttca 841 ggaatacagt aagggggctg gggaggcagt tcagaggttg gggctactgt ctggagggat 901 gaactgaggc catgggttta cctgttcata ctatgttttg gtgtgtgtct atttttctgc 961 agatgttggg ggaaaggtta tccacctggt ggaacgggct cctcctcaga ctcacctccc 1021 ttctggggca tcttctggga cggggtctgc ctcagccact catggtgggg gatccccccc 1081 tggtactcgg gggcctgggg cctctgttca tgaccggaat gccaacagct atgtcatggt 1141 tggaaccttc aatcttccta gtgacggctc tgctgtggat gttcacatca acatggaaca 1201 ggccccgatt cagagtgagc cccgggtacg gctggtgatg gctcagcaca tgatcaggga 1261 tatacagacc ttactatccc ggatggagac tctcccctac cttcagtgtc gaggagggcc 1321 ccaaccgcag cacagtcagc cgcccccgca gccaccggct gtgaccccgg agccagtagc 1381 cttgagctct caaacatcag aaccagttga aagtgaagca cctccccggg agcccatgga 1441 ggcagaagaa gtggaggagc gtgccccagc ccagaacccg gagctcactc ctggcccagc 1501 cccagcgggc ccaacacctg ccccggaaac aaatgcaccc aaccatcctt cccctgcgga 1561 gtatgtcgag gtgctccagg agctacagcg gctggagagt cgcctccagc ccttcttgca 1621 gcgctactac gaggttctgg gtgctgctgc caccacggac tacaataaca atcacgaggg 1681 ccgggaggag gatcagcggt tgatcaactt ggtaggggag agcctgcgac tgctgggcaa 1741 cacctttgtt gcactgtctg acctgcgctg caatctggcc tgcacgcccc cacgacacct 1801 gcatgtggtc cggcctatgt ctcactacac cacccccatg gtgctccagc aggcagccat 1861 tcccatacag atcaatgtgg gaaccactgt gaccatgaca ggaaatggga ctcggccccc 1921 cccaactccc aatgcagagg cacctccccc tggtcctggg caggcctcat ccgtggctcc 1981 gtcttctacc aatgtcgagt cctcagctga gggggctccc ccgccaggtc cagctccccc 2041 gccagccacc agccacccga gggtcatccg gatttcccac cagagtgtgg aacccgtggt 2101 catgatgcac atgaacattc aagattctgg cacacagcct ggtggtgttc cgagtgctcc 2161 cactggcccc ctgggacccc ctggtcatgg ccaaaccctg ggacagcagg tgccaggctt 2221 cccaacagct ccaacccggg tggtgattgc ccggcccact cctccacagg ctcggccttc 2281 ccatcctgga gggcccccag tctctgggac actgcagggc gccggtctgg gtaccaatgc 2341 ctcgttggcc cagatggtga gcggccttgt ggggcagctt cttatgcagc cagtccttgt 2401 ggctcagggg accccaggta tggctccacc gccagcccct gccactgctt ctgccagtgc 2461 tggcaccacc aacacagcta ccacagctgg ccccgctcct ggggggcctg cccagcctcc 2521 acccacccct caaccctcca tggctgatct tcagttctct cagcttctgg ggaacctgct 2581 agggcctgca gggccagggg ctggagggcc tggtgtggct tctcccacca tcactgtggc 2641 gatgcctggt gtccctgcct ttctccaagg catgactgac ttcttgcagg caacacagac 2701 agcccctcca ccacccccac ctcctccacc cccaccacct gccccagagc agcagaccat 2761 gcccccacca ggctcccctt ctggtggcgc agggagtcct ggaggcctgg gtcttgagag 2821 cctgtcaccg gagtttttta cctcagtggt gcagggtgtg ctcagctccc tgctgggctc 2881 cctgggggct cgggctggca gcagtgaaag tattgctgcc ttcatacaac gcctcagtgg 2941 atccagcaac atctttgagc ctggagctga tggggccctt ggattctttg gggccttgct 3001 ttctcttctg tgccagaact tctctatggt ggacgtagtg atgcttctcc atgggcattt 3061 ccagccacta caacggctcc agccccagct gcgatccttc ttccaccagc actacctggg 3121 tggtcaggag cccacaccca gtaacatccg gatggcaacc cacacattga tcacggggct 3181 agaagagtat gtgcgggaga gtttttcctt ggtgcaggtt cagccaggtg tggacatcat 3241 ccggacaaac ctggaatttc tccaagagca gtttaatagc attgctgcgc atgtgctgca 3301 ttgcacagat agtggatttg gggcccggtt gctggagttg tgtaaccaag gcctgtttga 3361 atgcctggcc ctaaacctgc actgcttggg gggacagcag atggagcttg ctgctgttat 3421 caatggccga attcgtcgta tgtctcgtgg ggtgaatccc tccttggtga gctggctgac 3481 cactatgatg ggactgaggc ttcaggtggt actggagcac atgcctgtag gccctgatgc 3541 cattctcaga tacgttcgca gggttggtga tcccccccag ccacttcctg aggagccaat 3601 ggaagttcag ggagcagaaa gagcttcccc tgagcctcag cgggagaatg cttccccagc 3661 ccctggaaca acagcagaag aggccatgtc ccgaggtcca cctcctgctc ctgagggggg 3721 ctcccgggat gaacaggatg gagcttcagc tgagacagaa ccttgggcag ctgcagtccc 3781 cccagaatgg gtccctatta tccagcagga cattcagagc cagcggaagg tgaaaccgca 3841 gccccctctg agtgatgcct acctcagtgg tatgcctgcc aagagacgca agacgatgca 3901 gggtgagggc ccccagctgc ttctctcaga ggctgtgagc cgggcagcta aggcagccgg 3961 agctcggccc ctgacgagcc ccgagagcct gagccgggac ctggaggcac cagaggttca 4021 ggagagctac aggcagcagc tccggtctga tatacaaaaa cgactgcagg aagaccccaa 4081 ctacagtccc cagcgcttcc ccaatgccca gcgggccttt gctgatgatc cttagctctt 4141 tgctctatgg cccttcctca tcaggggacc gtttcccccc tcttccttca cagtatttaa 4201 gaaataaaag tcggattttt tctggcc //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS CHBSSRNA 1504 bp ss-RNA RNA 01-FEB-1990 DEFINITION C.limicola ribosomal RNA small subunit. ACCESSION M31769 KEYWORDS ribosomal RNA; ribosomal RNA small subunit. SOURCE C.limicola (strain ATCC 8327) RNA. ORGANISM Chlorobium limicola Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Green bacteria; Green sulfur bacteria. REFERENCE 1 (bases 1 to 1504) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) Microbiol. Dept. U. of IL., Urbana, IL. 61801 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 01-FEB-1990. FEATURES from to/span description rRNA 1 1504 ribosomal RNA small subunit BASE COUNT 396 a 334 c 437 g 311 t 26 others ORIGIN 1 nnncaacgga gagtttgatc ctggctcagg acgaacgctg gcggcgtgcc taacacatgc 61 aagtcaaagg aaagtttctt cggaaataag tacttggcgc aagggtgagt aaggcatagg 121 taatctgccc tttggactgg cataaccccg agaaatcggg gacaatacca gatgatgcag 181 cggaaccgca tggttatgtt gttaaatgat ttatcgccaa aggatgagcc tatgttccat 241 caggtagttg gtagggtaac ggcctaccaa gccaacgacg gatagctggt ctgagaggat 301 gatcagccac attggaactg agacacggtc cagactccta cgggaggcag cagtgaggaa 361 tattgcgcaa tgggcgaaag cctgacgcag caacgccgcg tggatgatga agttcttcgg 421 aatgtaaaat ccttttgctg gggacgaata ggtcggcttg ccgactctga cggtacccag 481 cgaataagcc acggctaact ctgtgccagc agccgcggtg atacaggggt ggcaagcgtt 541 gtccggattt actgggtgta aagggtgcgc aggcggactg ataagtcggg ggttaaatcc 601 atgtgctcaa cacatgcacg gcttccgata ctgtcagtct tgagtctcga agaggaagat 661 ggaatttccg gtgtaacggt ggaatgtgta gatatcggaa agaacaccag tggcgaaggc 721 agtcttctgg tcgagtactg acgctcaggc acgaaagcgt ggggagcaaa caggattaga 781 taccctggta gtccacgccg taaacgatga atactagatg ttggtcatat tgatcagtgt 841 cgcagctaac gcattaagta ttccacctgg gaagtacgcc cgcaagggtg aaactcaaag 901 gaattgacgn nnnccngcac aagcggtgga tcatgtggtt taattcgatg caacgcgaag 961 aaccttacct aggcttgaaa tgttagctaa agctcctgaa agggagcgtc cttcggggag 1021 ctagcacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc 1081 gcaacgagcg caacccctac aattagttac taacaggtta agctgaggac tctaattgaa 1141 ctgcctacgc aagtagagag gaagganggg atgacgtcaa gtcctcatgg cccttacgcc 1201 tagggccaca cacgtgatac aatggcgact acagagggca aagccgcgag gcagaggaaa 1261 tcccttaaaa gtcgtctcag tccggatcgg agtctgcaac tcgactccgt gaagttggaa 1321 tcgctagtaa tcgcggatca gcatgccgcg gtgaatgtgt tcccgggcct tgtacacacc 1381 gcccgtcaag tcatggaagt caggagtacc caaagacgtc cgcgcgttta aggtaagact 1441 ggtaactggg actaagnngt aacaagnnnn nngtaccgga agnnnnnnnn ngatcacctc 1501 cttt // LOCUS ECOSPEA 3236 bp ds-DNA BCT 01-FEB-1990 DEFINITION E.coli arginine decarboxylase (speA) gene, complete cds, agmatinase (speB) and methionine adenosyltransferase (metK) genes, 5' end. ACCESSION M31770 KEYWORDS agmatinase; arginine decarboxylase; metK gene; methionine adenosyltransferase; speA gene; speB gene. SOURCE E.coli (strain K12) DNA, clones pLC2-5 and lambda-[1H10,23G45]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3236) AUTHORS Moore,R.C. and Boyle,S.M. TITLE Nucleotide sequence and analysis of the speA gene encoding arginine decarboxylase of Escherichia coli JOURNAL Unpublished (1990) Virginia S. Uni. Vet. Med., Blacksburg, VA 24061 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.C.Moore, 02-FEB-1990. FEATURES from to/span description pept 192 < 1 (c) methionine adenosyltransferase (metK) pept 987 2963 arginine decarboxylase (speA) (EC 4.1.1.19) pept 3101 > 3236 agmatinase (speB) signal 3009 3022 rho-independent transcription terminator signal 811 816 -35 signal signal 839 844 -10 signal binding 977 980 ribosome binding site site 1878 1886 pyridoxal phosphate binding site (put.) BASE COUNT 743 a 784 c 860 g 849 t ORIGIN 62.9 min on K12 map. 1 tacccaaggt cgctggtggt gatttcgccg ccaactaaaa ccaatgccgg tttttacgta 61 ggtttcgcaa gcaacgcgtg ctttcggatc ctgttcgagg atcgcgtcta aaacggcatc 121 agaaatttgg tcagcaattt tgtcaggatg cccttcagag acggactcgg acgtaaaaag 181 gtgttttgcc atatttaata tcacctaaag agaatttggt tagctcaaac tgttgtgtgg 241 attttctgtg gtagcggatc ctaccacgac tctgcaggtt aaaaacactg gcagtctgag 301 tgttaatcgg tatggatgga ttaacatctg gatggctatt ttaggtcaat tcttcaccct 361 atttccactt ttttttgaat cgtgtctcat tctgttaaaa acgtggctgg aaatttttcc 421 tgacaatgcc ggcattctgc gtatttatct tttgcaattt tctgccattg tggggtataa 481 aacgcggcgc gcggcttaaa taaaaagcac acgacgtttc tttcgtgttg ccacttccag 541 ccgggttcaa atcagagttt tggcttgtgg gttcgtctta acaggcggcc gtggaggtga 601 tacgaaataa tgaaccgttg tctgctgctt aacctgtctc accgttctgg tgaagattcg 661 ttccccgcac tctgcatctc tgctttgcat acctgccgat gttataccca tctcggcgct 721 tctcaggatt caagagctgg ttacagttac tgaggactga acaagggcgc tcttgtaaaa 781 acaagagttt tctcgtggtt tcgccgaact ttcacactta cgttcggtta tgtgcttaat 841 aatgttatga aaaagaaacc ggttgcgcag ttggagcgtc agcattcact gctggaaaat 901 ccatgtgctt atgggttgtt atcgcagttc caggctgcga tagtcgttaa ctgttttaca 961 cttaataaaa taatttgagg ttcgctatgt ctgacgacat gtctatgggt ttgccttcgt 1021 cagcgggcga acacggtgta ctacgctcca tgcaggaggt tgcaatgagc tcccaggaag 1081 ccagcaagat gctgcgtact tacaatattg cctggtgggg caataactac tatgacgtta 1141 acgagctggg ccacattagc gtgtgcccgg acccggacgt cccggaagct cgcgtcgatc 1201 tcgcgcagtt agtgaaaact cgtgaagcac agggccagcg tctgcctgca ctgttctgtt 1261 tcccacagat cctgcagcac cgtttgcgtt ccattaacgc cgcgttcaaa cgtgcgaggg 1321 aatcctacgg ctataacggc gattacttcc ttgtttatcc gatcaaagtt aaccagcacc 1381 gccgcgtgat tgagtccctg attcattcgg gcgaaccgct gggtctggaa gccggttcca 1441 aagccgagtt gatggcagta ctggcacatg ctggcatgac ccgtagcgtc atcgtctgca 1501 acggttataa agaccgcgaa tatatccgcc tggcattaat tggcgagaag atggggcaca 1561 aggtctatct ggtcattgag aagatgtcag aaatcgccat tgtgctggat gaagcagaac 1621 gtctgaatgt cgttcctcgt ctgggcgtgc gtgcacgtct gcgttcgcag ggttcgggta 1681 aatggcagtc ctccggcggg gaaaaatcga agttcggcct ggctgcgact caggtactgc 1741 aactggttga aaccctgcgt gaagccgggc gtctcgacag cctgcaacta ctgcacttcc 1801 acctcggttc gcagatggcg aatattcgcg atatcgcgac aggcgttcgt gaatccgcgc 1861 gtttctatgt ggaactgcac aagctgggcg tcaatattca gtgcttcgac gtcggcggcg 1921 gtctgggcgt ggattatgaa ggtactcgtt cgcagtccga ctgttcggtg aactacggcc 1981 tcaatgaata cgccaacaac attatctggg cgattggcga tgcgtgtgaa gaaaacggtc 2041 tgccgcatcc gacggtaatc accgaatcgg gtcgtgcggt gactgcgcat cacaccgtgc 2101 tggtgtctaa tatcatcggc gtggaacgta acgaatacac ggtgccgacc gcgcctgcag 2161 aagatgcgcc gcgcgcgctg caaagcatgt gggaaacctg gcaggagatg cacgaaccgg 2221 gaactcgccg ttctctgcgt gaatggttac acgacagtca gatggatctg cacgacattc 2281 atatcggcta ctcttccggc atctttagcc tgcaagaacg tgcatgggct gagcagcttt 2341 atttgagcat gtgccatgaa gtgcaaaagc agctggatcc gcaaaaccgt gctcatcgtc 2401 cgattatcga cgagctgcag gaacgtatgg cggacaaaat gtacgtcaac ttctcgctgt 2461 tccagtcgat gccggacgca tgggggatcg accagttgtt cccggttctg ccgctggaag 2521 ggctggatca agtgccggaa cgtcgcgctg tgctgctgga tattacctgt gactctgacg 2581 gtgctatcga ccactatatt gatggtgacg gtattgccac gacaatgcca atgccggagt 2641 acgatccaga gaatccgccg atgctcggtt tctttatggt cggcgcatat caggagatcc 2701 tcggcaacat gcacaacctg ttcggtgata ccgaagcggt tgacgtgttc gtcttccctg 2761 acggtagcgt agaagtagaa ctgtctgacg aaggcgatac cgtggcggac atgctgcaat 2821 atgtacagct cgatccgaaa acgctgttaa cccagttccg cgatcaagtg aagaaaaccg 2881 atcttgatgc tgaactgcaa caacagttcc ttgaagagtt cgaggcaggt ttgtacggtt 2941 atacttatct tgaagatgag taagtcctgt gttacttgaa tccgcttaat ttagcggtga 3001 taatccgcca caatttattg tgacaaatcc aacccttcct cgtcgggcct aacgacgcgg 3061 aagggttttt ttatatcgac tttgtaatag gagtccatcc atgagcacct taggtcatca 3121 atacgataac tcactggttt ccaatgcctt tggtttttta cgcctgccga tgaacttcca 3181 gccgtatgac agcgatgcag actgggtgat tactggcgtg ccgttcgata tggcca //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS MUSIGHRT 314 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 68.2D8, partial cds. ACCESSION M32044 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.2D8. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 314) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 88 a 58 c 86 g 82 t ORIGIN Chromosome 12. 1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg 121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat 181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat 301 tactgcacag atct // LOCUS MUSIGHRU 314 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 68.3D3, partial cds. ACCESSION M32045 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.3D3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 314) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 88 a 58 c 85 g 82 t 1 others ORIGIN Chromosome 12. 1 agggtccaga gtnaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg 121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat 181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat 301 tactgcacag atct // LOCUS MUSIGHRV 306 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 80.1B5, partial cds. ACCESSION M32046 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain 80.1B5), cDNA to mRNA, anti-GAC hybridoma 80.1B5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 306) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 306 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 85 a 56 c 85 g 80 t ORIGIN Chromosome 12. 1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg 121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat 181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat 301 tactgc // LOCUS MUSIGHRW 314 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 68.2C4, partial cds. ACCESSION M32047 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 68.2C4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 314) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 89 a 57 c 84 g 84 t ORIGIN Chromosome 12. 1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtactgg 121 gtccgccagt ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat 181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta atgtctacct tcaaatgaac agcttaaggg ctgaagacac tggaatttat 301 tactgcacag attt // LOCUS MUSIGHRX 314 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 65.1B3, partial cds. ACCESSION M32048 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J),cDNA to mRNA, anti-GAC hybridoma 65.1B3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 314) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 91 a 58 c 84 g 81 t ORIGIN Chromosome 12. 1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgaactgg 121 gtccgccagt ctccagagaa gggacttgaa tgggttactg aaattagatt gaaatctgat 181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat 301 tactgcacag atct // LOCUS MUSIGHRY 313 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 66.4A10, partial cds. ACCESSION M32049 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.4A10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 313) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 313 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 86 a 59 c 84 g 83 t 1 others ORIGIN Chromosome 12. 1 agggtncaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtcctgg 121 gtccgccagt ctccagagaa gggactttat tgggttgctg aaattagatt gaaatctgat 181 aattctgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctacct gcaaatgaac agcttaaggg ctgaagacac tggaatttat 301 tactgcacag atc // LOCUS MUSIGHRZ 314 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 66.5A3, partial cds. ACCESSION M32050 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.5A3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 314) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 314 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 91 a 58 c 84 g 81 t ORIGIN Chromosome 12. 1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaaccactg gatgtactgg 121 gtccgccaat ctccagagaa gggacttgag tgggttgctg aaattagatt gaaatctgat 181 aattatgtaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctacct gcaaatgaac aacttaaggg ctgaagacac tggaatttat 301 tacagcacag atct // LOCUS MUSIGHSA 305 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 66.3E3, partial cds. ACCESSION M32051 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 66.3E3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 305) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 305 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 85 a 55 c 85 g 80 t ORIGIN Chromosome 12. 1 agggtccaga gtgaagtgaa gcttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgt tgcctctgga ttcactttca gtaactactg gatgtgctgg 121 gtccgccagt ctccagagaa ggggcttgag tgggttgctg aaattagatt gaaatctaat 181 aattatgcaa cacattatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctacct gcaaatgaac aacttagagg ctgaagacac tggaatttat 301 tactg // LOCUS MUSIGHSB 305 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse Ig active H-chain (GAC1) mRNA V-region, from hybridoma 65.1D2, partial cds. ACCESSION M32052 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, anti-GAC hybridoma 65.1D2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 305) AUTHORS Jarvis,C.D.., Cannon,L.E. and Stavnezer,J. TITLE Mouse antibody response to group a streptococcal carbohydrate JOURNAL J. Immunol. 143, 4213-4220 (1989) STANDARD full staff_review FEATURES from to/span description pept / 12 > 305 Ig H-chain V-region, exon X (AA at 13) IVS < 1 11 Ig H-chain V-region intron BASE COUNT 85 a 55 c 86 g 79 t ORIGIN Chromosome 12. 1 agggtccaga gtgaagtgaa acttgaggag tctggaggag gcttggtgca acctggagga 61 tccatgaaac tctcctgtgc tgcctctgga ttcactttca ggaactcctg gatatactgg 121 gtccgccagt ctccagagaa ggggcttgag tgggttggtg aaattagatt gaagtcagat 181 aattttgcaa cacaatatgc ggagtctgtg aaagggaggt tcaccatctc aagagatgat 241 tccaaaagta gtgtctactt gcaaatgaac agcttaaggg ttgaagacac tgcaatttat 301 tactg //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS BSPCGTD 2648 bp ds-DNA BCT 15-DEC-1989 DEFINITION Bacillus sp. cyclomaltodextrin glucanotransferase gene, complete cds. ACCESSION M28053 KEYWORDS cyclomaltodextrin glucanotransferase. SOURCE Bacillus sp. (strain 17-1; ATCC 31007) DNA. ORGANISM Bacillus sp. Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2648) AUTHORS Kaneko,T., Song,K.-B., Hamamoto,T., Kudo,T. and Horikoshi,K. TITLE Construction of a chimeric series of Bacillus cyclomaltodextrin glucanotransferases and analysis of the thermal stabilities and pH optima of the enzymes JOURNAL J. Gen. Microbiol. 135, 3447-3457 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by T.Kaneko, 18-SEP-1989. FEATURES from to/span description pept 224 2365 cyclomaltodextrin glucanotransferase precursor (EC 2.4.1.19) sigp 224 304 cyclomaltodextrin glucanotransferase signal peptide matp 305 2362 cyclomaltodextrin glucanotransferase BASE COUNT 692 a 777 c 652 g 527 t ORIGIN 1 gaattcgctt tcatttacaa tattgttgaa atcaaacact cgcataattt aagggccatg 61 cattccgtca ccgcacaccc ggtatggaac aaccccggtt tctcttcagg agatgccggg 121 gttttttgtt gccctatatt tcgtacagga ggtgattccc cgcagcttcg gtttgatcat 181 tgaaccaacc cccattctat aactgctaca taggaggaat acgatgaaaa aaatctccaa 241 acttacaacc gctcttgccc ttagcttgtc cctggcgcta agcttgctcg gcccagccca 301 tgcagccccg gatacctcgg tatccaacaa gcaaaatttc agcacggatg tcatttatca 361 aatctttacg gaccgtttct ccgacggcaa tcctgccaac aacccgaccg gcccggcctt 421 tgacggtacc tgcaccaacc tgaggcttta ctgcggcggc gactggcaag ggatcatcaa 481 caaaatcaac gacggctacc tgaccggcat gggcgttacc gccatctgga tctcccagcc 541 ggtcgagaac atctacagcg tcatcaatta ttccggcgtg aataacaccg cataccacgg 601 ctattgggct agagacttca agaaaacgaa cccggcttac ggcacgattg ccgatttcca 661 gaatctcatc gctgccgccc atgccaagaa catcaaagtc atcatcgact tcgcaccgaa 721 ccacacgtct cccgcttcct tggaccagcc gtcgtttgcg gagaacggca agctgtacaa 781 caacggcagg gacgaaggcg gatacacgaa cgatacgcac aatttattcc atcataacgg 841 gggcaccgat ttctcgacga ccgagaacgg gatttacaaa aacctgtacg atcttgccga 901 tctgaaccac aacaacagca cagtggatac ctacctgaag gatgccatta aaatgtggct 961 cgatctgggc attgacggca ttcgaatgga cgcggtaaaa catatgccgt tcggctggca 1021 gaagagcttc atggccaccg tcaacaatta caagccggtc ttcaccttcg gcgaatggtt 1081 cctcggggta aacgaagtga gtgccgagaa tcataagttc gccaacgttt ccggcatgag 1141 cctgctggac ttccgcttcg cgcagaaggt tcgccaggta ttcaaggaca acaccgacaa 1201 tatgtacgga ctgaaatcca tgctagaggg ctctgcgacc gattacgcgc agatggagga 1261 tcaggtcacg ttcatcgata accacgacat ggaacgtttc cacaataaca gcgccaaccg 1321 ccggaagctg gagcaagcgc tcgccttcac gttaacttcc cgcggcgtgc cagccatcta 1381 ttatggaacc gagcaataca tgtccggcgg gaacgatccc gacaaccggg ctcgaattcc 1441 ttctttctct accacgacta cggcttatca ggtcagtaaa aagctggcgc ctctgcgtaa 1501 atccaatccg gccatcgctt atgggacgac gcaggagcgc tggataaaca acgatgtgtt 1561 gatctatgag cgcaagttcg gcaacaacgt agccgtcatc gccgtcaacc gcaacgtgaa 1621 cacgtcggcc tccattacgg gactggtaac ctcactgccg gccggcagct acaccgacgt 1681 gctgggcggc ctcttgaacg gtaacaattt gaccgtcggc tccggcggat ccgcttccat 1741 cttcacgctg gcagcgggcg gtacggccgt gtggcaatac accacggccg tgactgcgcc 1801 aaccatcggg cacgtaggac cgatgatggc caaaccaggc gcagccgtca ccattgacgg 1861 ccgcggtttc ggtgcaacca aaggcacggt atacttcggc acgacggcag ttaccggcgc 1921 taatattacg gcttgggaag acacgcagat caaagtgaaa atccctgccg ttgccggagg 1981 cgtatacaat atcaaaatcg ccaacagtgc cggaacctca agcaatgtgc atgacaactt 2041 cgaagtgctg agcggggatc aggtcagcgt gcgctttgtg gtgaacaacg ccaccaccgc 2101 gctcggccag aacgtatatc tggcaggcag cgtaagcgag ctcggcaact gggacccggc 2161 caaagccatc ggtccgttgt acaaccaggt catctaccaa tacccaacct ggtactatga 2221 cgtcaccgtt cccgccggca aaacgatcga atttaaattc ctgaaaaaac agggctcgac 2281 ggtaacgtgg gaaggcggct cgaaccatac cttcaccgca ccaacgagcg gcacggccac 2341 cataaacgtg aactggcaac cgtaagttca agaacgacta gcttctgccg aagaacggcc 2401 cctaaacaaa accagctccg gatatgatcc cggagctggt tttgttcatg caggtcataa 2461 ttccagtagc aatagcccgt tacgcgaagc atctcacagt tcctctgaac tgcatgagcg 2521 attcccaaag agagagcgtg ctcagcaggc agtaactcag cattcctctg cattgcatga 2581 gcgattccca aagggagagc gtctcaacaa gtagtgactc agcaagcaga cgggacaact 2641 caacgatc // LOCUS BSUDIV 1403 bp ds-DNA BCT 14-FEB-1990 DEFINITION B.subtilis division initiation protein (divIB) gene, 5' end. ACCESSION M31800 KEYWORDS division initiation protein. SOURCE B.subtilis (strain 168; prototroph SB19) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1403) AUTHORS Harry,E.J. and Wake,R.G. TITLE Cloning and expression of a Bacillus subtilis division initiation gene for which a homolog has not been identified in another organism JOURNAL J. Bacteriol. 171, 6835-6839 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 165 ORF1 (AA at 1) pept 295 1086 division initiation protein pept 1083 > 1403 ORF3 (ttg start codon) signal 213 240 transcription terminator BASE COUNT 521 a 229 c 313 g 340 t ORIGIN 1 ggcggtgcaa agatatcgga gatgcacgga aacttcatcg tcaatgcggg gggagcatca 61 gcaaaagatg tgcttgatct cattgaccat gtgaaaaaga caatccgtga aaaatacgag 121 attgatatgc acacagaggt tgaaatcatc ggcggaaatc gctgattcaa gttctgactg 181 aagctgttca tatgatatac tgtaagcaaa cgacaaacgg catcatagta tgccgtttgt 241 tttggaatag acagactttt aacagctgtt tcatttgaat gaggtgaaca ggcaatgaac 301 ccgggtcaag accgagaaaa aatcgttaac attgaagagc ggatccctaa aattaaagag 361 cagaggaaac aaaaggcaaa ccgccgtctc atttcattta ttatgctctt ttttattatg 421 gtgctgatca ttgtgtacct gcaaacacca atcagtaaag tatcaacaat ctctgttaca 481 gggaatgaaa atgtatcaaa aaaggaaatt atcgaccttt ccgatattaa cagcggagat 541 actgagtttt ggagtttgga caagcaaaaa acagaaaaga agattcagca aaataaatta 601 gtgaaaaaag cggagatcag caaatcgctt cctaataaaa tcaacattgc aatcgaagaa 661 tataaagcga ttgcttattt agaaaaagac gatgtgtact atgaagtgct tgaaaacggc 721 tcagttctgc caaacgaagt aacgccggat gatgccggac cgattttagt gaattggaca 781 aacgcgaaga agcggtctca aatggccaag cagctcgatg cgttgtcaaa ttcgttaaaa 841 cagtcgattt cagaaattta ttatacgcct gtaaaaatgg atgaaaaccg tattaagttg 901 tacatgaacg acggttatgt tgttacggca tctattaaaa cgtttgcaga ccgaatgaag 961 acgtatcctt ctatcatctc acagctaagc agcaataaaa aaggaatcat tcatttagaa 1021 gttgccacct attttgaaga gtttgggaaa aacgataaag ctgctaaaaa ggaagatgaa 1081 aattgagggg caaatcagca gtccttcttt ccttgattat gcttatcgca ggttttttga 1141 tttcattttc ttttcaaatg acaaaagaaa acaacaaaag cgcggctgag acggaagagt 1201 ggaaaaaaga atatgcgctg agagatgaac tgctgaaaca agagaaagaa aataagaagt 1261 ttgaaaaaga attatatcaa aagcaaaaca aagtccgcca ggcggaaaat aagctgaaga 1321 aagagaagtc ggaatactat aatgtcctcg aagatacaga aaaatacagg atgtatattg 1381 gggaagtcgg tgtgcaggga gaa // LOCUS HUMDF3AA 572 bp ss-mRNA PRI 16-FEB-1990 DEFINITION Human breast carcinoma-associated antigen DF3 mRNA, 5' end. ACCESSION M31823 KEYWORDS antigen DF3; glycoprotein. SOURCE Human breast carcinoma cell line MCF-7, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 572) AUTHORS Abe,M., Siddiqui,J. and Kufe,D. TITLE Sequence analysis of the 5' region of the human DF3 breast carcinoma-associated antigen gene JOURNAL Biochem. Biophys. Res. Commun. 165, 644-649 (1989) STANDARD simple staff_review FEATURES from to/span description pept 67 > 572 DF3 antigen mRNA 1 > 572 DF3 mRNA BASE COUNT 114 a 236 c 129 g 93 t ORIGIN 1 acctctcaag cagccagcgc ctgcctgaat ctgttctgcc ccctccccac ccatttcacc 61 accaccatga caccgggcac ccagtctcct ttcttcctgc tgctgctcct cacagtgctt 121 acagctacca cagcccctaa acccgcaaca gttgttacgg gttctggtca tgcaagctct 181 accccaggtg gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact 241 gagaagaatg ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc 301 tcctccacca ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt 361 tcagctgcca cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc 421 tccaccaccc cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc 481 tccaccgccc cccaagccca cggtgtcacc tcggccccgg acaccaggcc ggccccgggc 541 tccaccgccc cccaagccca cggtgtcacc tc // LOCUS MUSH2LINK 2252 bp ss-mRNA ROD 14-FEB-1990 DEFINITION Mouse MHC H-2K/t-w5-linked open reading frame mRNA, complete cds. ACCESSION M32010 KEYWORDS . SOURCE Mouse embryo carcinoma cell line F9, cDNA to mRNA, clone 10.2a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2252) AUTHORS Han,St.-Jacques.B., MacMurray,T.-H. and Shin,A.H.-S. TITLE A putative transmembrane protein with histidine-rich charge clusters encoded in the H-2K/t-w5 region of mice JOURNAL Mol. Cell. Biol. 10, 138-145 (1990) STANDARD simple staff_review FEATURES from to/span description pept 301 1611 MHC H-2K/t-w5-linked ORF precursor sigp 301 396 MHC H-2K/t-w5-linked ORF signal peptide (put.) matp 397 1608 MHC H-2K/t-w5-linked ORF mRNA < 1 2252 MHC H-2K/t-w5-linked ORF mRNA BASE COUNT 491 a 586 c 702 g 472 t 1 others ORIGIN 1 tcccggagcc ggtgagaggt ccctgctgct cccttacggc gctttctagg cctttacccc 61 aacgagtggg ccatagagac gcgggcccag agagaccgta aagttgctga tcaaaggcta 121 gagcggtgtc gggggtgggg ggctgcatcc aggaagggtg ttggggatga ggtggaccgg 181 ccttggggac aatgtaagag cggagcaagt agtatagagg aagggcttca agggacgcgg 241 atcccgaata ggtagattga gagtcaagtc gagtcgtctc ttgttcctcc ggtcagcgtg 301 atgaccatgg gcctgcgggc cccccactgg gtggctgtgg gactgctgac ctgggcggct 361 ttggggctgc tggtggccgg acacgagggt catggtgacc tgcacaaaga tgtggaagag 421 gacttccatg gccacagtca cgggcactca catgaagatt tccaccatgg ccacagccac 481 gggcactcgc atgaagattt ccaccatggc cacggccaca ctcatgaaag catctggcat 541 gggcatgccc acagccacga ccatggacac tcacgtgagg aattacacca tggccatagc 601 catggccact cccacgatag cctccaccac ggaggacatg gacatgccca tcgtgaacat 661 agccatggga cttctaggga ggctggggct ccaggcatca aacaccacct ggacactgtc 721 accctctggg cctacgcact gggggccaca gtgctgatct ccgcagctcc gttcttcgtg 781 ctgttcctca tcccagtaga atctaactct cccaggcacc gctctctgct ccagatcctg 841 ctcagttttg cttccggggg gctcctgggt gatgcgttcc tccacctcat cccgcatgcc 901 ttggaacctc attctcacca cgctccagag cagcctganc atggacactc ccacagtggc 961 cagggcccca ttctctctgt ggggctgtgg gttctcagtg ggattgtcgc cttcctcgtg 1021 gtggagaaat ttgtgagaca cgtgaaagga ggacatggac acagtcacgg acacggagac 1081 aggcatgcgc atggagacag tcacacccat ggagatagac atgagtgttc ttcaaaggaa 1141 aagcccagca cagaggaaga gaaggaagtg ggcgggttgc ggaaaaggag aggaggaaac 1201 actgggccca gagatggccc ggtgaaacct cagagccctg aagaagaaaa agcaggctca 1261 gacctgcgtg tgtctgggta cctgaatctg gctgctgact tggcacacaa cttcacagac 1321 ggtctggcca ttggtgcttc ctttcgtggg ggccgagggc tagggatcct gaccacaatg 1381 acagtcctgc tgcacgaact gcctcatgag gttggggatt ttgccatcct ggtccagtct 1441 ggctgcagca agaagcaggc gatgcgtctg caactcgtga ctgcaattgg agcattggca 1501 ggcactcgtg tgcccttctc accgagggag gggcagtgga cagtgacgtg gcaggtggtg 1561 caggtcctgg ctgggtcctg ccattcactg caggcggatt tatctacgta gcaacagtgt 1621 ctgtgctgcc tgagctattg agagaccgat ctccactgca gtcactgttg gaggtgctgg 1681 ggctgctggg gggtgttgcc atgatggtac tgattgccca tcttgagtga ggggtgaggt 1741 gacctgtcct tcccctaact cctaataaag gcagttggta gtcctggcca gtgctaatgc 1801 cagaaggagt gtcagccttg gagattagca gagcctctgt ttgaggcctt agaggtatga 1861 ggatcatacg gagcatgaga ggccagaagg accacggagt gggagactgc ccagcagcgt 1921 tgttgctttt ggaaaaatca atgggaccat gaagaagact gggaagacgg tagcctaccc 1981 gatggtccct aatctacgat ttctcagcgc cacgtgccgc aggggtccct ccaaggccca 2041 tctccctctg aactagttag tgatggcttc agggaagacc tggcagaagg actggggtag 2101 acatcaatcg tgtgtcctga tttgaagggg gaggggggct ccttgggaag atgtctcagc 2161 ctgatttttt gtctctactc attttatacc actgtttgaa tgggtgcgga ggaacggtga 2221 ccaggaataa aagaccttgg atcttcggcc cc // LOCUS PSEMVAAB 1117 bp ds-DNA BCT 16-FEB-1990 DEFINITION P.mevalonii HMG-CoA-reductase (mvaA) gene, 3' end, and HMG-CoA-lyase (mvbB) gene, complete cds. ACCESSION M31807 KEYWORDS 3-hydroxy-3-methyl glutaryl CoA-lyase; 3-hydroxy-3-methyl glutaryl CoA-reductase; HMG-CoA-lyase; HMG-CoA-reductase. SOURCE P.mevalonii DNA. ORGANISM Pseudomonas mevalonii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Acidiphilium cryptum; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1117) AUTHORS Anderson,D.H. and Rodwell,V.W. TITLE Nucleotide sequence and expression in Esherichia coli of the 3-hydroxy-3-methylglutaryl coenzyme A lyase gene of Pseudomonas mevalonii JOURNAL J. Bacteriol. 171, 6468-6472 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 135 HMG-CoA-reductase (mvaA; AA at 1) pept 147 1052 HMG-CoA-lyase (mvaB) BASE COUNT 188 a 362 c 380 g 187 t ORIGIN 1 catgcgcgca atattgccgt ggtggcgggc gcccgaggcg atgaggtgga ctgggttgcc 61 cggcagttgg tggaatacca cgacgtgcgc gccgaccgcg ccgtagcagt gctgaaacaa 121 aagcgcggcc aatgagcgtg gtcccgatgc aagcggtaaa ggtctttgaa gtcggccccc 181 gtgacggcct gcagaacgaa cgccagccgc tgtcggtggc cgcccgtgtg ggcttgatcg 241 gcgaactggc tggcaccggc ctgcggcata tcgaagccgg cgccttcgtg tcgccgcgct 301 gggtgccgca gatggccggc agcgacgagg tgttgcgcca gttgcccagc aacgacgggg 361 tcagttacac ggccctggtg cccaaccggc aaggcttcga ggccgcgcaa cgggctggct 421 gccgcgaggt agcggtgttc gccgccgcct ccgaggcgtt ttcgcgcaac aacatcaatt 481 gctccatcga tgaaagcttc gagcgcttca ccccggtgtt gcgcgccgcc aacgaagcct 541 ctatccgggt gcgcggttat gtatcctgcg tgctcggttg cccgttcagt ggggccgttg 601 cgccggaggc tgtggccaag gtcgcacgcc gcctgtacga actgggctgc tacgaaatca 661 gcctgggtga caccattggc gccggccgcc cggatgaaac ggctcaattg ttcgagctct 721 gcgcacggca actgccggtc gcggcactgg ccggccactt ccacgatacc tggggcatgg 781 ccatcgccaa tgtgcatgcc gcactcgcgc agggtgtacg caccttcgac agctcggtcg 841 cgggcctcgg cggctgcccc tactcgccgg gtgccagcgg taacgtggcc acggaagatc 901 tgttgtacct gctgcacggc ctgggctaca gcaccggtgt cgacctggag gcggtggcac 961 aggttggtgt gcgcatcagc gcgcagctgg gcaccgccaa ccgctcccgt gccggccttg 1021 ccctggcagc aaggagcgcc cgcgaacact gatagcgcct gtggctcccg aacaaaaaca 1081 acaagcccgc aagggcagga gaatgccaat gaccttt //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS HUMFAPS 1148 bp ss-mRNA PRI 12-JAN-1990 DEFINITION Human farnesyl pyrophosphate synthetase mRNA, complete cds. ACCESSION J05262 KEYWORDS farnesyl pyrophosphate synthetase. SOURCE Human hepatoma cell line HepG2, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1138) AUTHORS Wilkin,D.J., Kutsunai,S.Y. and Edwards,P.A. TITLE Isolation and sequence of the human farnesyl pyrophosphate synthetase cDNA: Coordinate regulation of the mRNAs for farnesyl pyrophosphate synthetase, 3-hydroxy-3-methylglutaryl coenzyme A reductase, and 3-hydroxy-3-methylglutaryl coenzyme A synthase by phorbol ester JOURNAL J. Biol. Chem. 265, 4607-4614 (1990) STANDARD full staff_review REFERENCE 2 (bases 1139 to 1148) AUTHORS Wilkin,D.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.J.Wilkin, 12-JAN-1990, for release after publication. FEATURES from to/span description pept 7 1068 farnesyl pyrophosphate synthetase (EC 2.5.1.1) BASE COUNT 302 a 272 c 324 g 250 t ORIGIN 1 cacagaatga acggagacca gaattcagat gtttatgccc aagaaaagca ggatttcgtt 61 cagcacttct cccagatcgt tagggtgctg actgaggatg agatggggca cccagagata 121 ggagatgcta ttgcccggct caaggaggtc ctggagtaca atgccattgg aggcaagtat 181 aaccggggtt tgacggtggt agtagcattc cgggagctgg tggagccaag gaaacaggat 241 gctgatagtc tccagcgggc ctggactgtg ggctggtgtg tggaactgct gcaagctttc 301 ttcctggtgg cagatgacat catggattca tcccttaccc gccggggaca gacctgctgg 361 tatcagaagc cgggcgtggg tttggatgcc atcaatgatg ctaacctcct ggaagcatgt 421 atctaccgcc tgctgaagct ctattgccgg gagcagccct attacctgaa cctgatcgag 481 ctcttcctgc agagttccta tcagactgag attgggcaga ccctggacct cctcacagcc 541 ccccagggca atgtggatct tgtcagattc actgaaaaga ggtacaaatc tattgtcaag 601 tacaagacag ctttctactc cttctacctt cctatagctg cagccatgta catggcagga 661 attgatggcg agaaggagca cgccaatgcc aagaagatcc tgctggagat gggggagttc 721 tttcagattc aggatgatta ccttgacctc tttggggacc ccagtgtgac cggcaaaatt 781 ggcactgaca tccaggacaa caaatgcagc tggctggtgg ttcagtgtct gcaacgggcc 841 actccagaac agtaccagat cctgaaggaa aattacgggc agaaggaggc tgagaaagtg 901 gcccgggtga aggcgctata tgaggagctg gatctgccag cagtgttctt gcaatatgag 961 gaagacagtt acagccacat tatggctctc attgaacagt acgcagcacc cctgccccca 1021 gccgtctttc tggggcttgc gcgcaaaatc tacaagcgga gaaagtgacc tagagattgc 1081 aagggcgggg agaggaggct ctcaataaat aatcgtgtaa ccttaaaaaa aaaaaaaacc 1141 tcgacgat //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS MUSPROA 1987 bp ss-mRNA ROD 23-JAN-1990 DEFINITION Mouse protective protein (Mo54) mRNA, complete cds. ACCESSION J05261 KEYWORDS glycoprotein; protective protein; serine carboxypeptidase. SOURCE Mouse (strain BCBA) testis and brain, cDNA to mRNA, clones lambda-B1 and lambda-T2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1987) AUTHORS Galjart,N.J., Gillemans,N., Meijer,D. and d'Azzo,A. TITLE Mouse "protective protein": cDNA cloning, sequence comparison, and expression JOURNAL J. Biol. Chem. 265, 4678-4684 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.d'Azzo, 22-JAN-1990, for release after publication. FEATURES from to/span description pept 214 1638 protective protein precursor sigp 214 282 protective protein signal peptide matp 283 1635 protective protein signal 1966 1971 poly-A signal BASE COUNT 453 a 580 c 522 g 432 t ORIGIN 1 ggaactgctc gatgcgcaga tagggttcaa ccagtagggc ttggggacca ctggggaccg 61 ctgcctccgc ttctggttga ccttcccggg gcacacggtt gtggtggcgt gtgatagcga 121 agacccaact gtggccaagg ctgaccaagt caggcagtga aaactcgggt acagcggcca 181 gactggcggg atccagagcg gtcaggccaa gacatgcccg gaaccgcgct gtctccactg 241 ctcttgttgc tgctcctgtc ctgggcgtcc cggaacgaag cagctccgga ccaggatgaa 301 atcgattgtc tccccggcct ggccaagcag ccctctttcc ggcaatactc cggctacctc 361 agagcatcgg actccaagca cttccactac tggtttgtgg agtcgcagaa cgacccaaag 421 aacagccccg tggtgctttg gcttaacggg ggtcccggct gcagctcgct cgatgggctg 481 cttacagagc acggcccctt tctgatccag ccagatggtg tcaccctgga gtacaacccc 541 tatgcttgga acctgattgc caacgtgctg tatatcgagt ccccagctgg ggtgggcttc 601 tcctactcgg atgacaagat gtacgtgacc aatgacacag aggtggcgga gaacaattat 661 gaagccctta aagacttctt ccgcctcttt ccggaataca aggacaacaa acttttcctg 721 acaggagaga gctatgctgg catctacatc cccaccttgg ctgtactggt catgcaggat 781 cctagcatga atcttcaggg gctggctgtg ggcaatggac ttgcctccta tgagcagaac 841 gacaactccc tggtctactt tgcctactac catggccttc tggggaacag actttggact 901 tcactgcaga cccactgctg cgctcagaac aagtgtaact tctatgacaa caaagaccca 961 gagtgtgtaa acaatctcct ggaagtgtct cgaattgtgg gcaaatctgg cctcaacatc 1021 tacaatctct atgctccgtg tgctggtggg gtgcccggca gacatagata tgaggacaca 1081 cttgtagtcc aggattttgg caacatcttc actcgcctgc cacttaagcg gagatttcct 1141 gaggcactga tgcgttctgg ggacaaggta cgcttggatc ctccctgcac caacaccaca 1201 gccccttcca actacctcaa caacccctat gttcggaagg ctctccacat ccccgagtcg 1261 ctgccccgct gggacatgtg caacttcttg gtgaatttac agtaccgccg cctctaccaa 1321 agcatgaact cccagtacct gaagctgctc agttcacaga aataccagat cctgctctac 1381 aacggagatg tggacatggc ctgcaacttc atgggcgatg agtggtttgt ggattcgctc 1441 aaccagaaga tggaggtgca gcgccggccc tggctagtgg actacgggga gagcggagaa 1501 caggtagctg gtttcgtgaa ggagtgttca cacatcacct tcctcaccat caagggtgcc 1561 ggacacatgg tccccacgga caagcctcga gctgctttta ccatgttctc gaggttcctg 1621 aacaaagagc cttactgata gcttccgctg caacccatgg aaatcctcag tacctgatca 1681 gcccctctcg gcctctctca gcagagagga gtcctccgct aatgctcccg tggggcaagt 1741 cctgtactaa aactgcccag acttctcaca gcctggtgca tcccagcctg ggcctcaggg 1801 cgtcacagac agcctaggac caagtgagca ctttgttccg gccatcgtgg ggtggcctga 1861 cctccccgac tgcataaaaa ctgaccaccc caggacccca acagagctag agaaacccaa 1921 ggaggggggt ggatggattc taattgatgg ttgactaatt ctggaattaa atcgggtata 1981 gcttcca //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS RATCAA 1150 bp ds-DNA ROD 07-FEB-1990 DEFINITION Rat cytosolic aspartate aminotransferase gene, exon 1. ACCESSION J05263 KEYWORDS cytosolic aspartate aminotransferase. SOURCE Rat (strain Wistar) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1150) AUTHORS Pave-Preux,M., Aggerbeck,M., Veyssier,C., Bousquet-Lemercier,B., Hanoune,J. and Barouki,R. TITLE Hormonal discrimination among transcription start sites of aspartate aminotransferase JOURNAL J. Biol. Chem. 265, 4444-4448 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Barouki, 05-FEB-1990, for release after publication. FEATURES from to/span description pept 684 / 801 cytosolic aspartate aminotransferase, exon 1 (EC 2.6.1.1) pre-msg 592 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) pre-msg 614 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) pre-msg 618 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) pre-msg 620 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) pre-msg 628 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) pre-msg 630 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) pre-msg 635 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) pre-msg 649 > 1150 cytosolic aspartate aminotransferase mRNA and intron (alt.) IVS 802 > 1150 cytosolic aspartate aminotransferase intron A BASE COUNT 232 a 334 c 338 g 246 t ORIGIN 1 ctgcagggtc tctcctgcca gcgatcaagg tagacaacac ccccaggttt ctttctgcgg 61 aagcactgag tggagatgga gaaggtcacc cgcgatcggg tacagcgtgt ccaggcggac 121 cactagagtc cagtgtcggg gacttggctg tcagcccggc ctggctaacg caggaacgtg 181 cgcaggaacg gagcctggcg gggacagcca cctgctcagc tggtacagaa agacctgttc 241 tctactattc ccgagggcgg gccaccgccc caggtctcca ggacccagaa ggtcaggagg 301 ctgtcctcta gggactcccg ccacccgcat catcctcatc accctaagga ataaaagcct 361 cagcgattgg aacacgctat gccaatcatc ctagtctttg cccacgcctc ctccggtgtc 421 cactgcattg gttgcatcat acaagcctcc gataagatcc tcctcttgaa ttggctaata 481 gagccttgtc ccgcccaacc cttctttgtc ggaaggttgt gattggttaa gcgcgttgcc 541 aagctccggg cgcggctcgg tcattggaag ccgcgggctg ccccgcccgc cggtgaaggt 601 gagagtctcc tccagtcaca ggtccccgca ccgcacggag cagctcccag ctcgtgctcg 661 aggtcccctc agattccatc gcgatggccc ctccatcatt ctttgcccag gttccacagg 721 ccccgccggt tctggtcttt aagctcattg cggacttccg ggatgatccc gatccccgca 781 aggttaacct cggcgtggga ggtaaggatg cagtgccctg gaatgcgagc agctttgtgg 841 ggggaggtgg gcttaggagg aactgaggct tagtttggtg cctccctgag aggggactgg 901 ggagagaagg ctgggaccac gagggcatca ccagcctacc catttcacag atggcgatac 961 agagtccttg gaaagaggcg gtgttcaagg tcacacagga caggtgtaac atctgggaca 1021 catgccgttt ttgtttcttc caccctgtct tctatatagt acttgttcta aagcaaccat 1081 ggggtggtaa ggagagagaa gtctggtagc tttggcgtca tggtcagctg ttagcctaat 1141 ggagctgcag //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS RATSKR 2979 bp ss-mRNA ROD 16-FEB-1990 DEFINITION Rat substance K receptor mRNA, complete cds. ACCESSION M31838 KEYWORDS substance K receptor. SOURCE Rat stomach, cDNA to mRNA, clone prTKR1-1. ORGANISM Rattus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2979) AUTHORS Sasai,Y. and Nakanishi,S. TITLE Molecular characterization of rat substance K receptor and its mRNAs JOURNAL Biochem. Biophys. Res. Commun. 165, 695-702 (1989) STANDARD simple staff_review FEATURES from to/span description pept 449 1621 substance K receptor signal 2957 2962 poly-A signal BASE COUNT 721 a 859 c 729 g 670 t ORIGIN 1 ggcccacaga cttggcacac tgtgcacctc cgtcatctgg gatgtaccct gaatagcatt 61 aagagagatc tgaattcaaa ttcgggtttg tcaccaactt ccggaatgcc tgttgctgga 121 cgcgtctttt ctctgactct cagcgttctg actctggtgt gaaagagagg agccgatcca 181 acgcgctatc aggatgcctc ctgctctgag actggagttt gagttctcca tctccaagag 241 atgcttgaac agtctgattg ggaccttgct tcctgctcag cctgactcat ttctgcccga 301 gctgtaggcg tggccctctg gctacacgtg tctgcttttg aatcgtcctt ccccccttaa 361 atctacacac cgaagaggat ttctcttctg tggcaattcg agacggaact gcagggccag 421 accctgtgtt ccaggcccag aatcagtcat ggggacccgt gccattgttt ctgacgccaa 481 catcttgtct ggcctcgaga gcaatgccac gggtgttaca gccttctcca tgcctggctg 541 gcagctggcg ttatgggcca cagcctacct ggccctggtg ctggtggctg tgacaggcaa 601 tgccacagtc atctggatca ttctggccca cgagaggatg cgcacagtca ccaactattt 661 catcatcaac ctggccttgg cggacctctg catggcagcc ttcaacgcga ccttcaactt 721 catctacgcc agtcacaaca tctggtactt cggccgtgcc ttctgctatt tccagaacct 781 ctttcccatc acagccatgt tcgtcagcat ctactccatg accgccattg ctgctgacag 841 gtacatggcc atcgttcacc ccttccagcc acggctctcg gcccccagca ccaaggcgat 901 tatcgctggc atctggctgg tagccctggc tctcgcctcc ccgcaatgct tctactccac 961 catcactgtg gacgaggggg ccaccaagtg tgtggtggcc tggcccaatg acaacggagg 1021 caagatgctc ctactgtatc atctggtcgt gtttgtcctc atctacttcc tgcctctcct 1081 ggtgatgttc ggggcttaca gtgtcatcgg cctcacactg tggaagcgcg cggtacccag 1141 acaccaggct cacggagcca acttacgcca tctacaggcc aagaagaagt ttgtgaaggc 1201 catggtactg gtggtgctga catttgccat ctgctggctg ccctaccacc tctacttcat 1261 cctggggacc ttccaagagg acatctacta ccacaagttt atccagcagg tctacctggc 1321 gctcttctgg ctggccatga gctccacaat gtacaaccct atcatttatt gctgccttaa 1381 ccacaggttt cgctctggat tccggcttgc tttccggtgc tgcccctggg tgacaccaac 1441 tgaggaagac aggctggagc tgactcacac gccatccctc tccaggagag tcaaccggtg 1501 tcataccaag gagactttgt tcatgacggg ggacatgacc cactctgagg ctaccaatgg 1561 acaggttggg agtccccagg atggggagcc tgctggaccc atctgcaagg cccaggctta 1621 gcccctttga gaagcagctg acggaaaagc ccctacataa tagcttgtcc cacaccccat 1681 tcttcaacac accaacagaa aaaacagaat ggtgctcgga atgccacagg caatacaggt 1741 gtcaacaaag aactctgagc acagacacct gtcacaggca atccagtgat ggaggaaggc 1801 gctgggagga ctgggaagcc acacttggcc aacctagaag ctgcaaagtg agatgtctcc 1861 caggagtata gagagagaag gaaaactgga tcccagcgaa gaaaactgca gttgtaatga 1921 gaagggcagc tgaaggacag agagatggct tggcgaggtg gaaaagcctc tgctaagtaa 1981 gcagtggaaa agcctctgct aagccttgtg acctaagtgc agtacctgca acccacaggg 2041 aaggtaagaa ccaaagccta caagttgtcc tctgacttcc tccaatgtat gcatgcatca 2101 tgtatgcatc tgtgtgtgca catacatctg cacatgcacg cacatacaaa atatgcgcat 2161 gcacgtgcac acttacaatg ttaaagagga agaaggcgac agctcccaca ggactgtaac 2221 agattgcaga ggacaagcgc ccaaagggaa acccagctgc ttcgctcgct ctggaacaca 2281 attgccaggt tttccgaata aagaaagcct aagatccaaa tatatatgta tgggagctcc 2341 gggcttctaa atttggttca tgaaaacgct tagagacact ggctaaccca gagcggtgca 2401 catccacacc accacctcct cgccaaagac aagccacaaa ggaagtcacc tatttggctg 2461 cgaatggatc catggttttt ctaccttagg atctaaaagc tcccatccag cccagagggg 2521 agggcagtta acagtccaga ggagcttcag ctgagatgta agatactctg accccctgct 2581 tgatggcagg attgccgtga ggcttgttgg aaacgatgcc ttcaaaccag caggccacat 2641 ccagcgaaat gctcaataaa tagtaaccac gattcttcgc acggcgtcag cttttaagac 2701 acaaaagcca cctacaagtt aggagaatgg ctccccctct cacttctgca gtgaggagct 2761 ggcaggcaga gcacaccgat ggtcccagtt ctaagaacgc ctggcctagg ccgctccagg 2821 ccagtccgtc tccctgcttg gtggtcactc atttgctgtc taccccgtgc cctctgaaag 2881 ccaaagacgc tgtaaacacg tgttcaaagt ttaatgtcac ctgagctttc ccagagttca 2941 ctgcgccagt acccccaata aagggttctg atgtcactt //
GenBank-Updates@genbank.bio.net (04/07/90)
LOCUS RATIGFBP3 2352 bp ss-mRNA ROD 16-FEB-1990 DEFINITION Rat insulin-like growth factor-binding protein (IGF-BP3) mRNA, complete cds. ACCESSION M31837 KEYWORDS insulin-like growth factor-binding protein. SOURCE Rat ovary, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2352) AUTHORS Shimasaki,S., Koba,A., Mercado,M., Shimonaka,M. and Ling,N. TITLE Complementary DNA structure of the high molecular weight rat insulin-like growth factor binding protein (IGF-BP3) and tissue distribution of its mRNA JOURNAL Biochem. Biophys. Res. Commun. 165, 907-912 (1989) STANDARD simple staff_review FEATURES from to/span description pept 57 935 insulin-like growth factor binding protein precursor sigp 57 137 insulin-like growth factor binding protein signal peptide matp 138 932 insulin-like growth factor binding protein mRNA < 1 2352 IGF-BP3 mRNA BASE COUNT 573 a 623 c 606 g 550 t ORIGIN 1 gggagcagcg agcaagcagg tcctcagcgt ccagtcaccg ctctaagcca ggcgccatgc 61 atcccgcgcg ccccgcgctc tgggcggctg cgctcaccgc cctcactctg ctccgcggac 121 cgccagtggc gcgggccggc gcgggcgcgg tgggcgcggg ccccgtggtg cgctgcgaac 181 cgtgcgacgc gcgtgcgctg gcccagtgcg cgcctccgcc caccgcgccc gcgtgcacgg 241 agctggtgcg agaacccggc tgcggctgct gcctgacttg cgcgctgcgc gaaggcgacg 301 cgtgcggcgt ctacacggag cgctgtggca ccggcctccg ctgccagccg cgaccggccg 361 agcagtatcc cctgaaggcg ctgctgaatg gccgcgggtt ctgcgccaac gccagcgccg 421 ccagcaacct gagtgcctac ctcccctccc agccgtctcc tggaaacacc actgagtctg 481 aggaggacca caatgctggg agtgtggaaa gccaggttgt ccccagcaca catcgcgtga 541 ctgattccaa gttccatcca ctccattcaa agatggaggt catcataaaa ggccaggcta 601 gggacagcca gcgctacaaa gttgactatg agtcccagag cacagacacc cagaacttct 661 cctccgagtc taagcgggag acagaatatg gtccctgccg cagagaaatg gaggacacac 721 tgaatcatct gaagttcctc aatgtgctga gtcccagggg cgtccacatc ccaaactgtg 781 acaagaaggg gttctataag aagaaacagt gtcgcccttc caaaggcaga aagcggggct 841 tctgctggtg cgtggacaag tacgggcagc cattgccagg ctatgacacc aaggggaaag 901 acgacgtgca ttgcctcagc gtgcagagcc agtagatacc gctgtgccac ttaacgtgga 961 gctcaaatac gccttatttt gcacaaaaga ctgccaacaa cgtgatcagc agctggctat 1021 accttgattt atatttctct ctctctctct ctctctctct ctctctctct ctctctcttt 1081 tgtggtgaac tgaataaaaa caaacaaaac acatacaaaa acaaaaacaa aaaaaaaagc 1141 caagtttaga cagatttctg aaatgcctct ggttgtttaa atagtgaact tggtcatctt 1201 tgtatctcgc agtagtcaac caaaagcagt ttgaattttc ttgttgcttc ctatgaaaac 1261 cacacgtgta ctccaggcca cggatgccgt cgccccctaa ctcacccacc cactgtgggc 1321 ttcagtgctg ctggccctct gccttcttga tttcagaggc tctgttgctg atagagaaaa 1381 accctctttc catcccctgt aagtaagtgc aggcactgtg gagaatgggg aagcctggaa 1441 cccagtgacc cggacgtctg gaagcatcct cctgaggcct ctggtcctta ttgtgccatc 1501 tctgaatcaa gggcctggcc ctgtatctgc aagtggcctg acctacttgg gaactgtggg 1561 agagaaaaat gtgttgtctc tcttactaaa aatgactaag aatgttctag ggcgctccga 1621 gagcccataa agacaaggac aaggaccttc ctttgtcagg cagcttcctg atgacttggc 1681 ccagcagaaa tatcaaactc catgtgcaga gatgtcgcaa ataacggtgc gcttagttct 1741 ccggatgact tcaagaaaac agtgttttct ggcccagcct ctcaaaataa aatttgttgt 1801 ggggtggggc tgaggggagg cagctttcaa aagagagaag gttttcatct tccttgttgg 1861 agaccctggt aagaacatgg agagaatcac ctgtttgttg atcttggggt ccttctcaaa 1921 ctttctttat aattcatgcg tatatgcaga caaaatatgt tcttaattgt taacattgta 1981 tacaacatag cccaaatata ttagaatctg tactagataa tcctagataa aaggttagag 2041 atgctaggtg atgtaaccac agacacgccc gaggaaagga gcctgtgtct ggaggctggg 2101 ccgctttccc cgaggccaag gccatggtgg tctggcaata cagggtgtga ggagactgta 2161 ctgcatccca cggggtggac atgcgctgta cagagctttc cttgagagca caaaggaatc 2221 ttgagacatt ctgcctgcct gtcagctttt ctttattttt ttaattaagt ttttggggga 2281 aaaatgtatt tttgaaaagt ttgtcttgca atgtatttat aaatagtaaa taaagttttt 2341 ttactattta ag //
GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS HUMTSHRX 2415 bp ss-mRNA PRI 22-FEB-1990 DEFINITION Human thyroid stimulatory hormone receptor (hTSHR) mRNA, complete cds. ACCESSION M32215 KEYWORDS thyroid stimulatory hormone receptor. SOURCE Human thyroid, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2415) AUTHORS Misrahi,M., Loosfelt,H., Atger,M., Sar,S., Guiochon-Mantel,A. and Milgrom,E. TITLE Cloning, sequencing and expression of human TSH receptor JOURNAL Biochem. Biophys. Res. Commun. 166, 394-403 (1990) STANDARD simple staff_review FEATURES from to/span description pept 45 2339 thyroid stimulatory hormone receptor /hgml_locus_uid="TSHR" /map="22q11-q13" /nomgen="LY0058G" BASE COUNT 638 a 624 c 527 g 626 t ORIGIN 1 ggcgatttcg gaggatggag aaatagcccc gagtcccgtg gaaaatgagg ccggcggact 61 tgctgcagct ggtgctgctg ctcgacctgc ccagggacct gggcggaatg gggtgttcgt 121 ctccaccctg cgagtgccat caggaggagg acttcagagt cacctgcaag gatattcaac 181 gcatccccag cttaccgccc agtacgcaga ctctgaagct tattgagact cacctgagaa 241 ctattccaag tcatgcattt tctaatctgc ccaatatttc cagaatctac gtatctatag 301 atgtgactct gcagcagctg gaatcacact ccttctacaa tttgagtaaa gtgactcaca 361 tagaaattcg gaataccagg aacttaactt acatagaccc tgatgccctc aaagagctcc 421 ccctcctaaa gttccttggc attttcaaca ctggacttaa aatgttccct gacctgacca 481 aagtttattc cactgatata ttctttatac ttgaaattac agacaaccct tacatgacgt 541 caatccctgt gaatgctttt cagggactat gcaatgaaac cttgacactg aagctgtaca 601 acaacggctt tacttcagtc caaggatatg ctttcaatgg gacaaagctg gatgctgttt 661 acctaaacaa gaataaatac ctgacagtta ttgacaaaga tgcatttgga ggagtataca 721 gtggaccaag cttgctggac gtgtctcaaa ccagtgtcac tgcccttcca tccaaaggcc 781 tggagcacct gaaggaactg atagcaagaa acacctggac tcttaagaaa cttccacttt 841 ccttgagttt ccttcacctc acacgggctg acctttctta cccaagccac tgctgtgcct 901 ttaagaatca gaagaaaatc agaggaatcc ttgagtcctt gatgtgtaat gagagcagta 961 tgcagagctt gcgccagaga aaatctgtga atgccttgaa tagccccctc caccaggaat 1021 atgaagagaa tctgggtgac agcattgttg ggtacaagga aaagtccaag ttccaggata 1081 ctcataacaa cgctcattat tacgtcttct ttgaagaaca agaggatgag atcattggtt 1141 ttggccagga gctcaaaaac ccccaggaag agactctaca agcttttgac agccattatg 1201 actacaccat atgtggggac agtgaagaca tggtgtgtac ccccaagtcc gatgagttca 1261 acccgtgtga agacataatg ggctacaagt tcctgagaat tgtggtgtgg ttcgttagtc 1321 tgctggctct cctgggcaat gtctttgtcc tgcttattct cctcaccagc cactacaaac 1381 tgaacgtccc ccgctttctc atgtgcaacc tggcctttgc ggatttctgc atggggatgt 1441 acctgctcct catcgcctct gtagacctct acactcactc tgagtactac aaccatgcca 1501 tcgactggca gacaggccct gggtgcaaca cggctggttt cttcactgtc tttgcaagcg 1561 agttatcggt gtatacgctg acggtcatca ccctggagcg ctggtatgcc atcaccttcg 1621 ccatgcgcct ggaccggaag atccgcctca ggcacgcatg tgccatcatg gttgggggct 1681 gggtttgctg cttccttctc gccctgcttc ctttggtggg aataagtagc tatgccaaag 1741 tcagtatctg cctgcccatg gacaccgaga cccctcttgc tctggcatat attgtttttg 1801 ttctgacgct caacatagtt gccttcgtca tcgtctgctg ctgttatgtg aagatctaca 1861 tcacagtccg aaatccgcag tacaacccag gggacaaaga taccaaaatt gccaagagga 1921 tggctgtgtt gatcttcacc gacttcatat gcatggcccc aatctcattc tatgctctgt 1981 cagcaattct gaacaagcct ctcatcactg ttagcaactc caaaatcttg ctggtactct 2041 tctatccact taactcctgt gccaatccat tcctctatgc tattttcacc aaggccttcc 2101 agagggatgt gttcatccta ctcagcaagt ttggcatctg taaacgccag gctcaggcat 2161 accgggggca gagggttcct ccaaagaaca gcactgatat tcaggttcaa aaggttaccc 2221 acgagatgag gcagggtctc cacaacatgg aagatgtcta tgaactgatt gaaaagtccc 2281 atctaacccc aaagaagcaa ggccaaatct cagaagagta tatgcaaacg gttttgtaag 2341 ttaacactac actactcaca atggtagggg aacttacaaa ataatagttt cttgaatatg 2401 cattccaatc ccatg //
GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS MUSHSP70 2249 bp ss-mRNA ROD 23-FEB-1990 DEFINITION Mouse heat shock-related protein (hcs70t) mRNA, complete cds. ACCESSION M32218 KEYWORDS heat shock-related protein. SOURCE Mouse male testicular germ cell, cDNA to mRNA, clones GE6 and HS2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2249) AUTHORS Matsumoto,M. and Fujimoto,H. TITLE Cloning of a hsp70-related gene expressed in mouse spermatids JOURNAL Biochem. Biophys. Res. Commun. 166, 43-49 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1 1893 heat shock-related protein BASE COUNT 612 a 569 c 583 g 484 t 1 others ORIGIN 1 atggctgcta ataaaggaat ggcgatcggc atcgacctgg gcaccaccta ctcgtgcgtg 61 ggcgtgttcc aacacggcaa ggtggagatc atcgccgact accaggtcaa ccgcaccacc 121 cctgactacg tggccttcac cgacaccagc cgcctcatcg agcgctccaa gaaccaggtg 181 gccatgaatc cccagaacac tgtttttgat gccaaacgtc taattggcag gaagtttaat 241 gatcctgttg tgcagtcaaa tatggaagct ttggccattt caatgatcaa tgaagccggc 301 aaacccaagg tgatggtgtc ctataaagga gagaagaaag ccttctaccc agaggagatc 361 tcatccatgg tactgacgaa gatgaaggag actgcagagg ttttttgggc accaatgtca 421 caacgtgtga tcacggtgcc agcctatttc aatgactctc agcggcaagc caccaaggat 481 gcaggtgtca tcgcaggact caatgtgctg agaataatca atgagcccac ggcggtcgca 541 tgggctaggt tggataaagg aagtcacgca gagcggcacg tgctcatctt cgacctgggg 601 ggtggcacgt tcgacttgtc catcctgacg atcgacgacg catcttcagt gaagccacgc 661 ggcgacacgc acctgggagg ggaggacttc gacaaccggc tggtgagcca cttcgtgaga 721 gtcaagagga agcacaagaa ggacaatcaa gccaagaaca agcgcgcggt gcggcggctg 781 cgcacggcgt gtgagagggc caagaggacg ctgtcgtcca gcacccaggv aaacctggag 841 atcgactctt tatatgaggg catcgacttc tacacgtcca tcactagagc acggtttgaa 901 gagctgtgtg cagacctatt tagaggcaca cttgagcccg tggaaaagtc tcttcgggat 961 gccaagatgg ataaggctaa aatccatgac attgttctag tagggggctc cacccgcatc 1021 ccaaaggtgc aaaaaatgct tcaggactac tttaatggac gggatctcaa caagagtatc 1081 aatcccgatg aggcagtcgc ctacggagct gcagtccagg gagctatttt aatgggcgac 1141 aaatctgaaa aagtacagga tttgcttttg ttggacgtag ctcccctgtc tctaggattg 1201 gagacagctg ggggtgtgat gactgtattg atcaagcgga actccaccat ccccaccaag 1261 cagacgaaga tcttcaccac ctactcggac aaccagcccg gggtattgat ccagtgtacg 1321 aggcaaacca tacccgacaa caacctggtg gggccctttg acttgactgg aatacctcct 1381 gcacctagcg tgccacagat cgaggtgacc ttcgacatcg cgcgcaacgg tatcctgaac 1441 gttacggcca tggacaagag caccggcaag gccaacaaga tcaccatcac caacgacaag 1501 ggtcgcctga gcaaggagga gattgactcg atcgagagcc ggactgctta caaacgtgag 1561 gatgagggtc agagggagaa aatcgctgcc aaaaatgcct tagaatcgta cgcctttaat 1621 atgaagagcg cttgtggtga tgagggtctg aaggacaaga tcacggagtc cgataaaaag 1681 aaaatactgg ataaatgcaa tgaggttcct ttcctgctga gtcaaaccca gctggctgag 1741 aaagtagagt ttgatcataa aagaaaagaa ctggaaaata tgtgtaatcc gatcatcaca 1801 aaactgtacc agagcggatg caccgggccc actgtacgcc agggtatact ccggcagggc 1861 tccacaggcc ctaccatcga ggaagtagat tagccttttc cagaattgca gggtgctagg 1921 gtgcctctag gcgaatttta ttcatcttca aacatcaata tgattcttga actgactgga 1981 ctcagcctac gtcaccagtc ctttggatcc cgatggagaa gctcgaacgc accttttcac 2041 acccccaccc gtcgtcagtc tatgatcctg aactggacct ttaggaaaac caggcccctc 2101 tttagagcca tttgagaatt tgatgtctgt atttattatc cacaccccac ctttctcctt 2161 cctgtgtgtt ggttatttgt ctctcaataa atttgttccc aaaggaaaca actctgcact 2221 tttattttag caggccttac tgatatgag //
GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS MUSGAS3A 1799 bp ss-mRNA ROD 26-FEB-1990 DEFINITION Mouse growth arrest specific (gas3) mRNA, complete cds. ACCESSION M32240 KEYWORDS growth arrest specific protein. SOURCE Mouse fibroblast cell line NIH-3T3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1799) AUTHORS Manfioletti,G., Ruaro,M.E., Del Sal,G., Philipson,L. and Schneider,C. TITLE A growth arrest specific (gas) gene codes for a membrane protein JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Philipson, 15-FEB-1990. FEATURES from to/span description pept 187 621 growth arrest specific protein (gas3) mRNA < 1 1799 gas3 mRNA BASE COUNT 444 a 487 c 388 g 480 t ORIGIN 1 ccggagcctc ccactgcccc cttgctttgc gcgcgcgtga cccgcagcac agctgtcttt 61 ggggacgcca gcaacccagt ggacgcaccg gagtttgtgc ctgaggctaa tctgctctga 121 gatagctgtc cctttgaact gaaacaggca ccgctcctct gatcccgagc ccaactccca 181 gccaccatgc tcctactctt gttggggatc ctgttcctgc acatcgcggt gctagtgttg 241 ctcttcgtct ccaccatcgt cagccaatgg ctcgtgggta atggacacac gactgatctc 301 tggcagaact gtaccacatc cgccttggga gccgtccaac actgctactc ctcatcagtg 361 agcgaatggc tgcagtctgt ccaggccacc atgatcctgt ctgtcatctt cagcgtcctg 421 gctctgttcc tgttcttctg ccagctcttc actctcacca aaggcggccg gttttacatc 481 actggattct tccaaatcct tgctggtctg tgcgtgatga gtgcagcggc catctacaca 541 gtgaggcaca gtgagtggca tgtcaacact gactactcct atggcttcgc tacatcctgg 601 cctgggtggc ctttccccta gccctcctca gtggtatcat ctatgtgatc ctgcggaaac 661 gcgaatgagg cgcccgacga cgcaccgtcc gtctaggctc tgagcgcgca tagggtccac 721 agggagggag gaaggaaacc agagaacaaa accaaccaac caaaaaagag ctagccccaa 781 acccaaacgc aagccaaacc aaacagaacg cagttgagtg gggattgctg ttgattgaag 841 atgtatataa tatctatggt ttataaaacc tatttataac actttttaca tatatgtaca 901 taggattgtt ttgcttttta tgttgaccgt cagcctcgtg ttgaatctta aacaacttta 961 catcctaaca ctataaccaa gctcagtatc tttgttttgt ttcgtttttt tttttaatct 1021 ttttgttttg ctcagacata aaaactccac gtggccccct ttcatctgaa agcagatacc 1081 tccctcccac tcaacctcat aggataacca aagtgtgggg acaaacccca gacagttgaa 1141 gacctttaca ctatgggtga cccagtgcat ttagcaggag tatccactgc ccgaatccat 1201 gtgtgaagcc ctaagcactc acagacgaaa agccctgacc ggaaccctct gcaaaaacag 1261 taatagctgg tggctcctga acacttgacc ctgtagacgg agtactgggg ccacacgttt 1321 aaatgagaag tcagagacaa gcaatctgtg aaatggtgct atagatttac cattccttgt 1381 tattactaat cgtttaaacc actcactgga aactcaatta acagttttat gcgatacagc 1441 agaatggaga cccgatacaa acggttcata actgctttca tacctagcta ggctgttgtt 1501 attactacaa taaataaatc tcaaagcctt cgtcagtccc acagttttct cacggtcgga 1561 gcatcaggac gagcatctag acccttggga ctagcgagtt ccctggcttt ctgggtctag 1621 agtgttctgt gcctccaagg actgtctggc gatgacttgt attggccacc aactgtagat 1681 gtatatacgg tgtccttctg atgctaagac tccagacctt tcttgttttt gcttgctttc 1741 tctgatttta taccaactgt gtggactaag atgcatcaaa ataaacatca gagtaactc //
GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS RATGFGDVA 630 bp ss-mRNA ROD 26-FEB-1990 DEFINITION Rat glioma-derived vascular endothelial cell growth factor mRNA, complete cds. ACCESSION M32167 KEYWORDS endothelial cell growth factor protein. SOURCE Rat cell line GS-9L, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 630) AUTHORS Conn,G., Bayne,M., Soderman,D., Kwok,P., Sullivan,K., Palisi,T., Hope,D. and Thomas,K. TITLE Amino acid and cDNA sequences of a vascular endothelial cell mitogen that is homologous to platelet-derived growth factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Bayne, 16-FEB-1900. FEATURES from to/span description pept 5 577 endothelial cell growth factor protein precursor sigp 5 82 endothelial cell growth factor protein signal peptide matp 83 574 endothelial cell growth factor protein BASE COUNT 169 a 163 c 172 g 126 t ORIGIN 1 aaccatgaac tttctgctct cttgggtgca ctggaccctg gctttactgc tgtacctcca 61 ccatgccaag tggtcccagg ctgcacccac gacagaaggg gagcagaaag cccatgaagt 121 ggtgaagttc atggacgtct accagcgcag ctattgccgt ccgattgaga ccctggtgga 181 catcttccag gagtaccccg atgagataga gtatatcttc aagccgtcct gtgtgcccct 241 aatgcggtgt gcgggctgct gcaatgatga agccctggag tgcgtgccca cgtcggagag 301 caacgtcact atgcagatca tgcggatcaa acctcaccaa agccagcaca taggagagat 361 gagcttcctg cagcatagca gatgtgaatg cagaccaaag aaagatagaa caaagccaga 421 aaatcactgt gagccttgtt cagagcggag aaagcatttg tttgtccaag atccgcagac 481 gtgtaaatgt tcctgcaaaa acacagactc gcgttgcaag gcgaggcagc ttgagttaaa 541 cgaacgtact tgcagatgtg acaagccaag gcggtgagcc aggctgcagg aaggagcctc 601 ctcagggttt cgggaactag acctctcacc //
GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS HUMSHBGA 6087 bp ds-DNA PRI 31-JAN-1990 DEFINITION Human human sex hormone-binding globulin (SHBG) gene, complete cds. ACCESSION M31651 KEYWORDS human sex hormone-binding globulin. SOURCE Human adult testis DNA, clone hgSH!BG-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6087) AUTHORS Hammond,G.L., Underhill,D.A., Rykse,H.M. and Smith,C.L. TITLE The human sex hormone-binding globulin gene contains exons for androgen-binding protein and two other testicular messenger RNAs JOURNAL Mol. Endocrinol. 3, 1869-1876 (1989) STANDARD simple staff_review COMMENT A region of SHBGr-3 mRNA contains an inverted exon. However, the paper does not state the boundaries. This exon is related to a palindromic sequence "atcttggctcagtctccacctccaagat" located at positions 4455-4482. FEATURES from to/span description pept 2643 2753 sex hormone-binding globulin, exon 1 2886 2977 sex hormone-binding globulin, exon 2 3150 3339 sex hormone-binding globulin, exon 3 3670 3831 sex hormone-binding globulin, exon 4 4059 4218 sex hormone-binding globulin, exon 5 4349 4485 sex hormone-binding globulin, exon 6 5218 5425 sex hormone-binding globulin, exon 7 5670 5818 sex hormone-binding globulin, exon 8 sigp 2643 2732 sex hormone-binding globulin signal peptide matp 2733 2753 sex hormone-binding globulin 2886 2977 sex hormone-binding globulin 3150 3339 sex hormone-binding globulin 3670 3831 sex hormone-binding globulin 4059 4218 sex hormone-binding globulin 4349 4485 sex hormone-binding globulin 5218 5425 sex hormone-binding globulin 5670 5815 sex hormone-binding globulin IVS 2754 2885 SHBG intron A IVS 2978 3149 SHBG intron B IVS 3340 3669 SHBG intron C IVS 3832 4058 SHBG intron D IVS 4219 4348 SHBG intron E IVS 4486 5217 SHBG intron F IVS 5426 5669 SHBG intron G mRNA 656 751 SHBG-2 mRNA, exon 1 (5' end unsure) < 2643 2753 SHBG-2 mRNA, exon 2 2886 2977 SHBG-2 mRNA, exon 3 3150 3339 SHBG-2 mRNA, exon 4 3670 3831 SHBG-2 mRNA, exon 5 4059 4218 SHBG-2 mRNA, exon 6 4349 4485 SHBG-2 mRNA, exon 7 5670 > 5818 SHBG-2 mRNA, exon 8 mRNA 540 596 SHBG-3 mRNA, exon 1 rpt 835 1223 Alu repeat rpt 1359 2002 Alu repeat rpt 4619 4928 Alu repeat BASE COUNT 1400 a 1632 c 1640 g 1415 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcggct agctcctaag gcgtgggtac ggaagctaga ttagagcaga agggccccgc 61 tgctccccga gcaggttccc aaggcgagcc cctccccctg cccccgcctc ctacgacccc 121 gctctggccg cgccactctg acccccgggt taccggcctg cagtcttcac ccgaatcagc 181 ctcaggatat ctccacagtc tccctccttg gcctctcgga tccgcacgga agccatccgg 241 atccccgctg tctgggacca aagtcccagg gcctcgcaaa cggcaactag accccttaaa 301 gggcctacgg acttggatcc tgaagagcct gagagagcgg ggtggcggga gtcggggggg 361 acggcggggt agccgcggcc tggtaagtgg agctgggatt ccggcgccgt acgggaggag 421 agagtaggcc agcgaggcga tcctctgtcc ggcatagccc caccccctcg aattctgtcg 481 cagcaggggg cacaactgtc agccaatcag cttggagaac aggcacggcc gcgtcccccc 541 caagccccac ccccgacagc tggatcttgt gactgggctc ctgggtagag ttcaaggttg 601 gagtgaagcg gcttccttgc ggttgtgtgg gtgtcccaac ctgggtcgag ataccccgcg 661 gttcaaaggc tcccccgcag tgctttttaa attgacatat gcagtgataa cctgctttag 721 cctcaggctc actcacccgc ccagaccctg ggtaagcctt aagaccctca gctctgaaag 781 ctgtttcctg cagctcttga gtagcatgaa gtgttacctc ttgggggcat ttgcattttt 841 aaatgtttta ttttatattt atttatttat ttatttttgg agatggtgta ttgctttgtc 901 gcccaggctg gggtgtagtg gcgcgatctc tgctcactgc agcctccacc tcccgagttc 961 aagcgatcct cctgtctcag cctccggagt agctggaact acagtcgcgc accggcacgc 1021 ccggctaatt tttttttctt ttttcttctc tttttttttg ttttaacgga ttctcactct 1081 gtcactcagg ctggagtgca gtggcgcgat ctcggctcac tgcaacctct ccctcctggg 1141 ttcaagcgat tctcctgcct caccctggag atagctggga ttacaggcat gggcaaccat 1201 acctggctaa tttttgtatt tttagtagag acggggtttc actatgttgg ccaggctggt 1261 ctggaactcc tgacctcaag tgatctgccc gcctcagcct tctaaagtgc ggggattaca 1321 gatgtgaccc accaagcccg gtctgtcatt tgcattttaa aatgggtcat ggggtgggca 1381 cagtggctca cacctgtaat cccagcattt tggggaggca gaggcaggcg gatcactgag 1441 atcaggaatt tgagaccagc ctgaccaaca tggtgaaacc cgtctctact aaaatacaaa 1501 attagacagg tgtggtggcg catgccttta ctcccagcta cacgggaggc tgagacagga 1561 gaatcgcttg aacctgggag gtggaggtta cagtgagccg agatcgtgcc attgcactcc 1621 agcctgggca acaaaagcga aactccatct caaaataaat aaataaaata aaatgcgtca 1681 gggagggtcg ggccttgtgg ctaatgcctg taatccaggt actttgggag gctgaggtgg 1741 gcggatcatt tcaggtcaga ggttcgagat cagcctgggc aacgtggtga aatccccgtc 1801 tctactaaaa acacaaaaaa atttgctggg cgtggtggtg cgtgcacctg tagtcccaac 1861 tactaaggag gctgagacag gaggatcgct tgaactcgag aggcagaggc agcagtgagc 1921 cgagatcact ccactgcact ccagcctggg tgatagagca agactctgtc taaaataaaa 1981 taaaataaaa taaaataaaa ttgggtcagg gagtgggtga tttctactgc tagactgttt 2041 aggccctgta ataaatggat aagggaagat aactgagagg cggggggcag gtcccttctt 2101 aatattcact gaatcataca cacagacaat accttcttgg gagacaggcc tcagaggctg 2161 ggaaaagact gggggaggag ttcagaccag atgccaggca ctgtgcctgc attttctcaa 2221 tgaaccctct ttcacagtca ccccgtaaag tattatttcc tcattttaca gcaaggacac 2281 tgaagcacaa aggtgaagtg acttggccca aggtcactca gggacagaaa tcttggagga 2341 cctagatcag gccctagagg aggagagggg agatggaata tcctctccca gttcagaaac 2401 tttctcggca gtggaggatg atagtggagg gactctgtcc ttcaccccat tgatccccag 2461 aggggtgata gctgagtctt gtgactgggc ccctgggcag gggtcaaggg tcagtgcccc 2521 tgtttccttt accccctcct ccccgggcaa cctttaaccc tccaccgccc acacgcaagg 2581 ctgcctgcct ctacacattc tcccaagagt tgtctgagcc gccgagtgga cagtggctga 2641 ttatggagag cagaggccca ctggctacct cgcgcctgct gctgttgctg ctgttgctac 2701 tactgcgtca cacccgccag ggatgggccc tgagacctgt tctccccacc caggtgcagg 2761 agcgggacag ggcactcagc tcatgcagtc ttcccttctc tcctctggcc ctgtagcagg 2821 gcctctccct ctgtctgtct ctgacatgtc cctactcagc tttgtttgtt ttctctttct 2881 gatagagtgc ccacgaccct ccggctgtcc acctcagcaa tggcccagga caagagccta 2941 tcgctgtcat gacctttgac ctcaccaaga tcacaaagta tggggttggc ctagcccttg 3001 acccagtccc ctggttctgc cctctctcca tcagctcttc tcttttccct gtcttccttt 3061 ccttatctgt gaacaccatc tcccccaaac ccacactggt tctcaaagga cacatgacat 3121 acacaatctt tccttctgtg tccttccaga acctcctcct cctttgaggt tcgaacctgg 3181 gacccagagg gagtgatttt ttatggggat accaacccta aggatgactg gtttatgctg 3241 ggacttcgag acggcaggcc tgagatccaa ctgcacaatc actgggccca gcttacggtg 3301 ggtgctggac cacggctgga tgatgggaga tggcaccagg taagctagct ctggtcctca 3361 ggggagggat gtctggagct ggtctgagga aagggaacaa aaccaagtta ttgggcatcc 3421 ctttaccact gtcatctcgt ttaatccaca cgaaccccca caaagtagct attcttggcc 3481 ccatcttttc tgatgggaat tctaaggctc agtcagtata taagtgacaa gagctgagtg 3541 acccaaggcc aaggatgcta gctgcttctt taaggcatgt tctttccact atagtactag 3601 gctgcctcac aggaaggtgg cagaaacaga tcccaggggc ctctgatttt gcttcccacc 3661 ttcctgcagg tggaagtcaa gatggagggg gactctgtgc tgctggaggt ggatggggag 3721 gaggtgctgc gcctgagaca ggtctctggg cccctgacca gcaaacgcca tcccatcatg 3781 aggattgcgc ttggggggct gctcttcccc gcttccaacc ttcggttgcc ggtaactaca 3841 ccccaggggt ggaaccctag ccaagacttg gtaaagcact gctgggtggc tggccgtggg 3901 aatctaagtc cacactttta gggagaaggg aagggttgag agctgcaagg gggaggccaa 3961 atgctcagag gggagtcaac tgagggcagg gaggtcggga ctgcgcctcc gatgccctga 4021 tttctacatc cccgtatctt atctctgtca cactccagct ggttcctgcc ctggatggct 4081 gcctgcgccg ggattcctgg ctggacaaac aggccgagat ctcagcatct gcccccacta 4141 gcctcagaag ctgtgatgta gaatcaaatc ccgggatatt tctccctcca gggactcagg 4201 cagaattcaa tctccgaggt agatttcctc ggagtctatt tttcccaccc tggccagctc 4261 agcctgcctc tgtccccctc taccactggc ccctttcctc cttgagaccc cagctttgag 4321 gcctcaggat aatcatttct ccccacagac attccccagc ctcatgcaga gccctgggcc 4381 ttctctttgg acctgggact caagcaggca gcaggctcag gccacctcct tgctcttggg 4441 acaccagaga acccatcttg gctcagtctc cacctccaag atcaagtaag ggacagtggg 4501 cattgcctgt attcagtgga gcctggagca atgaggaaga gggagtccaa catgtcaata 4561 ttaggaaggt ttccagccca gggaacataa caagactggc tccacagaat tgtttttcat 4621 taataattag ccaggcatgg tggtggtgct tgcctgtaat cccaggtgct ggaggccaag 4681 accagaggat cacttgaggc caggagtttg acaccagcct gggcaacata gcagagacct 4741 ctgtctaaaa aaaaaaaaaa attagccagg catggtagca catgtctgct gccctagcta 4801 tttaggagcc tgaggcagga ggttcacttg agcccaggag tttgaagctg cagtgagcta 4861 tgatgtgcca ctgcactctg acctgggcca cagtgagacc ctgtctcaaa aaataaaaat 4921 aaaaataagg cttatggatg gcactcaggt gggtggtagg ggcgagggac atatcttgaa 4981 gctccccaca gcaagcaaac agttttgact tagactgcat atttacttgg ggcaggtgtg 5041 gtttcaaaaa gggtcaagcc aaaaaaaatt ggggcaggat ttaagtggtg agaatggcca 5101 gtaggtggag gcatagcgaa gaggcagaat taaggcagct aggggtgagg ccacaggcag 5161 taggcccggc tcattcttcc ctctctctct accgtccctt tcccacacac tctgcagaag 5221 gtggtgttgt cttctgggtc ggggccaggg ctggatctgc ccctggtctt gggactccct 5281 cttcagctga agctgagtat gtccagggtg gtcttgagcc aagggtcgaa gatgaaggcc 5341 cttgccctgc ctcccttagg cctgctcccc tcccttaacc tctgggccaa gcctcaaggg 5401 cgtctcttcc tgggggcttt accaggtaag agagaatgat gttcaagttc atgagcacaa 5461 cattggaaac agctcaaggg aggcggcaca ttttgagggg aaggaaacct ctgggaggga 5521 agaagaatag gccacaagaa gaagatatgg gggcagtgga aggtagtgct tttgcaaact 5581 caggttggag gagtggaaaa gtggggagaa gattctggat ccgagccacc ttaatgctct 5641 aatgccacct ttgcactacc tccctctagg agaagactct tccacctctt tttgcctgaa 5701 tggcctttgg gcacaaggtc agaggctgga tgtggaccag gccctgaaca gaagccatga 5761 gatctggact cacagctgcc cccagagccc aggcaatggc actgacgctt cccattaaag 5821 ctccacctaa gaaccccctt tgaaagttac tgattattca tttattcaac aaatattcac 5881 tgtgcactag caatgtacca ggcactgtgc caagtattga gttgtcttaa tgagcaaaaa 5941 cactctggtt cctaccctct tggtgcccac agtcccatag ggaagcagac atccatcaaa 6001 ggctaactaa taagtggata gttggaagca ctgataaaga agaattggag agttgtgaaa 6061 acatggagac tggcgggcgt gtggctc //
GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS OWEPH18 1548 bp ds-DNA INV 23-FEB-1990 DEFINITION O.fusiformis CCPurine repeat sequence. ACCESSION M32217 KEYWORDS . SOURCE O.fusiformis DNA, clone PH18. ORGANISM Owenia fusiformis Eukaryota; Animalia; Metazoa; Annelida; Polychaeta; Oweniida; Oweniidae. REFERENCE 1 (bases 1 to 1548) AUTHORS Bakalara,N., Collet,J., Planells,R., Thouveny,Y. and Fontes,M. TITLE Presence in invertebrate genomes of sequences characterized by the repetition of the triplet CCpurine JOURNAL Biochem. Biophys. Res. Commun. 166, 66-73 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 515 663 CCPu repeat sequence BASE COUNT 424 a 395 c 230 g 499 t ORIGIN 1 aaagcatttt tccaatcttc acgctcgagt gattttggat gatttatgga gtttgagact 61 ttttttcacc aaacggtcac tttattctta ctcctgagtc aatttagaga ctttttccaa 121 tcttcacgct cgagtgattt tggatgattt ctggagtttg agacattctt tcaccaaacg 181 atcacttgat tcttactccc aagtcattta agagtatttt ttcagtcgtc atggtttagt 241 gatttcggtt acgcttgatc aatttctgct aatttttggg agtttaatgc gttttcacaa 301 aatagacacc cacccactat acaaaataac atgttttaat aggcatctgt taatttaaca 361 taccactgat aaaaatataa aaccacacaa gataaaaaaa atgctaacat taacagtttt 421 acaagcgaaa cattttcgac caactaaagt gtatacttaa atgagcagct gaaccatttc 481 aatcaatcag attagattat tgcactcttt gaccccgccg ccaccaccac cgccgccgcc 541 accaccgcca ccaccaccgc cgccaccacc accaccgcca ccaccgccgc caccaccacc 601 accgccacca ccgccgccac caccaccacc gccaccacca ccaccgccgc caccaccgcc 661 gcctcggtgc cacaactatc gacaacaaca accccaacaa caataaaaat gtattttagt 721 gctatagtag tttattaata cacatatttt catttataca aattatttca tagtcccttt 781 tggatgaggg attcactgtc tgtgttgacg ttttccagga gggccagaat acaccataac 841 atccctcttt ttcttaggtt ttttaaaaag tcgtattcat cacatttcca ttggaggtgt 901 ggtcaaagga atcattgtca ctcatttgtc tgtaaacgtc tcttggtggc ctatcccgtt 961 cgccattttc tttctgctgc ttgtcaattt ctgccttggc tctctataaa cagcttcagt 1021 gggactgaga tgttgaacaa tcggtttcat ggactgatca cactgtgatc cacttgagtt 1081 ttcctcgttt accatatcca gcatgttcga tgactctgcc tttcacttgg gctcgtataa 1141 acttcttcca cacttctggg ttgtccacat aaggcactat cttaggcatt gttgcttctt 1201 ccttttctat taattatctg atgctaatgt ccaggtttcc ctttatatac tcatcaaaca 1261 ttcattcatg aaatctaact acgttcgtct tcttcttcca tctcacatag tgtatctgca 1321 tcttctaaat cacttccgtc aatagactct tcatcgttaa acatttcatc agcgatataa 1381 gaatatttgg ccaaagtttt tctgatggct ctcgacagag atatgtcttg atcaactaag 1441 aattcaattt cttctacaat ttctcgatgt aaatcagcat ctttgacttc aatgaataac 1501 ctcagaattt gaccataatc atttaaaaac tcagattgct gggataag //
GenBank-Updates@genbank.bio.net (04/09/90)
LOCUS MUSIGHRH 345 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-5, partial cds. ACCESSION M32034 KEYWORDS N-region; diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 345) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 345 Ig H-chain V-D-J region (AA at 1) recomb 279 280 V-region DNA end/N-region DNA start recomb 282 283 N-region DNA end/D-region DNA start recomb 300 301 D-region DNA end/N-region DNA start recomb 303 304 N-region DNA end/J-region DNA start BASE COUNT 93 a 78 c 90 g 84 t ORIGIN Chromosome 12. 1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg 61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg 121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac 181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagccctg 241 acatctgagg actctgcagt ctatttctgt gcaagatctc attactatgg tggtagctac 301 gactttgact actggggcca aggcaccact ctcacagtct cctca // LOCUS MUSIGHRI 363 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-7, partial cds. ACCESSION M32035 KEYWORDS N-region; diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 363 Ig H-chain V-D-J region (AA at 1) recomb 297 298 V-region DNA end/N-region DNA start recomb 300 301 N-region DNA end/D-region DNA start recomb 318 319 D-region DNA end/N-region DNA start recomb 321 322 N-region DNA end/J-region DNA start BASE COUNT 99 a 83 c 94 g 87 t ORIGIN Chromosome 12. 1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg 61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg 121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac 181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac 241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctcat 301 tactatggtg gtagctacga ctttgactac tggggccaag gcaccactct cacagtctcc 361 tca // LOCUS MUSIGHRJ 363 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65D6-3, partial cds. ACCESSION M32036 KEYWORDS N-region; diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 363 Ig H-chain V-D-J region (AA at 1) recomb 297 298 V-region DNA end/N-region DNA start recomb 300 301 N-region DNA end/D-region DNA start recomb 318 319 D-region DNA end/N-region DNA start recomb 321 322 N-region DNA end/J-region DNA start BASE COUNT 99 a 84 c 93 g 87 t ORIGIN Chromosome 12. 1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg 61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg 121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac 181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac 241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcccat 301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc 361 tca // LOCUS MUSIGHRK 363 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65I4-2, partial cds. ACCESSION M32037 KEYWORDS N-region; diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65I4-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 363 Ig H-chain V-D-J region (AA at 1) recomb 297 298 V-region DNA end/N-region DNA start recomb 300 301 N-region DNA end/D-region DNA start recomb 318 319 D-region DNA end/N-region DNA start recomb 321 322 N-region DNA end/J-region DNA start BASE COUNT 100 a 82 c 93 g 88 t ORIGIN Chromosome 12. 1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg 61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg 121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac 181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac 241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctaat 301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc 361 tca // LOCUS MUSIGHRL 363 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J region, from hybridoma P65J4-1, partial cds. ACCESSION M32038 KEYWORDS N-region; diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65J4-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 363 Ig H-chain V-D-J region (AA at 1) recomb 297 298 V-region DNA end/N-region DNA start recomb 300 301 N-region DNA end/D-region DNA start recomb 318 319 D-region DNA end/N-region DNA start recomb 321 322 N-region DNA end/J-region DNA start BASE COUNT 100 a 82 c 93 g 88 t ORIGIN Chromosome 12. 1 gaggttcagc ttcagcagtc tggagctgag ctggtgaggg ctgggtcctc agtgaagatg 61 tcctgcaagg cttctggata tacattcaca agctacggta taaactgggt gaaacagagg 121 cctggacagg gcctggaatg gattggatat attaatcctg gaaatggtta tactaagtac 181 aatgagaagt tcaagggcaa gaccacactg actgtagaca aatcctccag cacagcctac 241 atgcagctca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatctaat 301 tactatggtg gtagctacta ctttgactac tggggccaag gcaccactct cacagtctcc 361 tca // LOCUS MUSIGKCRP 324 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-5, partial cds. ACCESSION M32039 KEYWORDS N-region; immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 324 Ig kappa-chain V-J region (AA at 1) recomb 285 286 V-region DNA end/N-region DNA start recomb 288 289 N-region DNA end/J-region DNA start BASE COUNT 96 a 80 c 72 g 76 t ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagc tggaaatcaa acgg // LOCUS MUSIGKCRQ 324 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-7, partial cds. ACCESSION M32040 KEYWORDS N-region; immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 324 Ig kappa-chain V-J region (AA at 1) recomb 285 286 V-region DNA end/N-region DNA start recomb 288 289 N-region DNA end/J-region DNA start BASE COUNT 96 a 80 c 71 g 76 t 1 others ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattn ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagc tggaaatcaa acgg // LOCUS MUSIGKCRR 324 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65D6-3, partial cds. ACCESSION M32041 KEYWORDS N-region; immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65D6-3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 324 Ig kappa-chain V-J region (AA at 1) recomb 285 286 V-region DNA end/N-region DNA start recomb 288 289 N-region DNA end/J-region DNA start BASE COUNT 96 a 80 c 72 g 76 t ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagc tggaaatcaa acgg // LOCUS MUSIGKCRS 324 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65I4-2, partial cds. ACCESSION M32042 KEYWORDS N-region; immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65I4-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 324 Ig kappa-chain V-J region (AA at 1) recomb 285 286 V-region DNA end/N-region DNA start recomb 288 289 N-region DNA end/J-region DNA start BASE COUNT 96 a 80 c 72 g 76 t ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagc tggaaatcaa acgg // LOCUS MUSIGKCRT 324 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, from hybridoma P65J4-1, partial cds. ACCESSION M32043 KEYWORDS N-region; immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain A/J), cDNA to mRNA, hybridoma P65J4-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L. and Margolies,M.N. TITLE The amino acid residues at the V-H-D-J-H junctions affect the affinity of anti-p-azophenylarsonate antibodies JOURNAL J. Immunol. 143, 4090-4097 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 324 Ig kappa-chain V-J region (AA at 1) recomb 285 286 V-region DNA end/N-region DNA start recomb 288 289 N-region DNA end/J-region DNA start BASE COUNT 95 a 80 c 72 g 76 t 1 others ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccnacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagc tggaaatcaa acgg //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS MUSIGHRM 374 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrM4, partial cds. ACCESSION M31953 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 374) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 374 Ig mu-chain precursor V-D-J1-region (AA at 3) sigp < 1 20 Ig mu-chain signal peptide sigp 21 > 374 Ig mu-chain recomb 314 315 V-region DNA end/D-region DNA start recomb 323 324 D-region DNA end/J1-region DNA start BASE COUNT 92 a 80 c 109 g 91 t 2 others ORIGIN Chromosome 12. 1 ttttaaaaga tgtccagtgt gaagtgcagc tgttggagac tggaggaggc ttggtgcaac 61 ctggggggtc acggggactc tcttgtgaag gctcagggtt cacttttagt ggcttctgga 121 tgngctgggt tcgacagaca cctgggaaga ccctggagtg gattggagac attnattctg 181 atggcagtgc aataaactac gcaccatcca taaaggatcg attcactatc ttcagagaca 241 atgacaagag caccctgtac ctgcagatga gcaatgtgcg atcggaggac acagccacgt 301 atttctgtat gagatatggt aactactggt acttcgatgt ctggggcaca gggaccacgg 361 tcaccgtctc ctca // LOCUS MUSIGHRN 287 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma S12M1, partial cds. ACCESSION M31954 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma S12M1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 287) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 287 Ig mu-chain V-D-J1-region (AA at 3) recomb 227 228 V-region DNA end/D-region DNA start recomb 236 237 D-region DNA end/J1-region DNA start BASE COUNT 77 a 64 c 75 g 64 t 7 others ORIGIN Chromosome 12. 1 aaggcnnagg gnncactttt agtggcttct ggatgagctg ggntcgacag acacctggga 61 agaccctgga gtggattgga gacattaatt ctgatggcag tgcaataaac tacgcaccat 121 ccataaagga tcgattcact atcttcagag acaatgacaa gagcaccctg tacctgcaga 181 tgagcaatgt gcgatcggag gacacagcca cgtatttctg tatgagatat ggtaactact 241 ngtacttcga tgtctggggc acagggacca cggtcaccgt ctcctna // LOCUS MUSIGHRO 366 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig active mu-chain mRNA V-D-J2 region, from hybridoma CH12, partial cds. ACCESSION M31956 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma CH12, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 366 Ig mu-chain precursor V-D-J2-region (AA at 1) matp 1 > 366 Ig mu-chain recomb 294 295 V-region DNA end/D-region DNA start recomb 321 322 D-region DNA end/J2-region DNA start BASE COUNT 95 a 93 c 97 g 81 t ORIGIN Chromosome 12. 1 caggtccaac tgcagcagcc tgggactgaa ctggtgaagc ctggggcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg 121 cctggacaag gccttgagtg gattggaaat attaatccta gcaatggtgg tactaactac 181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aatcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagagattac 301 tacggtagta gctgggggta ctactttgac tactggggcc aaggcaccac tctcacagtc 361 tcctca // LOCUS MUSIGHRP 366 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig active mu-chain mRNA V-D-J2 region, from hybridoma BrM1, partial cds. ACCESSION M31957 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM1., cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 366 Ig mu-chain precursor V-D-J2 region (AA at 1) matp 1 > 366 Ig mu-chain recomb 294 295 V-region DNA end/D-region DNA start recomb 321 322 D-region DNA end/J2-region DNA start BASE COUNT 93 a 90 c 98 g 80 t 5 others ORIGIN Chromosome 12. 1 caggtccaac tgcngcagcc tgggactgaa ctggtgaagc cttgggcttc agtgaagctg 61 tcctgcaagg cttctggcca caccttcaca agctantgga taagctgggt gaagnagaat 121 cctggacagg gccttgagtg gattggatnt attnatccga gaagtggtga tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca aatcctccag cacagcgtac 241 atggacgtcc gcagcctgac atctgaggac tctgcggtct attattgtgc aagaagctat 301 tactacggta gtagctacgg ggactttgac tactggggcc aaggcaccac tctcacagtc 361 tcctca // LOCUS MUSIGHRQ 384 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig active mu-chain mRNA V-D-J3 region, from hybridoma BrM8, partial cds. ACCESSION M31958 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM8, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 384) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 384 Ig mu-chain precursor V-D-J3-region (AA at 1) matp < 1 > 384 Ig mu-chain recomb 297 298 V-region DNA end/D-region DNA start recomb 339 340 D-region DNA end/J3-region DNA start BASE COUNT 96 a 92 c 99 g 95 t 2 others ORIGIN Chromosome 12. 1 gaggtccagc tgcaacaatc tggacctgag ctggtgaagc ctggggcttc agtgaagata 61 tcctgtaagg cttctggata cacgttcact gactactana tgaactgggt gaagcagagc 121 catggaaaga gccttgantg gattggagat attaatccta acaatggtgg tactagctac 181 aaccagaagt tcaagggcaa ggccacattg actgtagaca agtcctcctc cagcacagcc 241 tacatggtac gccgcagcct gacatctgag gactttgcag tctattactg tgcaagatcc 301 ctctactact tggggagctc tctctactat gattacggct ggtttgctta ctggggccaa 361 gggactctgg tcactgtctc tgca // LOCUS MUSIGHRR 331 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrM11, partial cds. ACCESSION M31959 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 331 Ig mu-chain V-D-J1-region (AA at 2) recomb 262 263 V-region DNA end/D-region DNA start recomb 283 284 D-region DNA end/J1-region DNA start BASE COUNT 79 a 84 c 87 g 77 t 4 others ORIGIN Chromosome 12. 1 aatggagcct ggagggtccc ggaaactctc ctgtgcagcc tctggattca ctttcagtag 61 cattggaatg cactgggttc gtcaggctcc agagaagggg ctggagttgg tngcatacat 121 tagtagtggc agtagtacca tctactatgc agacacagng aagggccgat tcaccatctc 181 cagagacaat gccaagaaca ccctnttcct gcaaatgacc agtctaaggt ctgaggacac 241 ggccatgtat tactgtgcaa ggtgccctta tgattacgac tcttngtact tcgatgtctg 301 gggcacaggg accacggtca ccgtctcctc a // LOCUS MUSIGHRS 195 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig active mu-chain mRNA V-D-J1 region, from hybridoma BrMs19, partial cds. ACCESSION M31960 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrMs19, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 195) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 195 Ig mu-chain V-D-J1-region (AA at 1) recomb 135 136 V-region DNA end/D-region DNA start recomb 144 145 D-region DNA end/J1-region DNA start BASE COUNT 50 a 56 c 41 g 40 t 8 others ORIGIN Chromosome 12. 1 agtgngnnaa cnntctacaa cccatctctc cagagcccca tctccattac tagagaaacg 61 tcaaagaacc agtncttcct ccaattgaac tctgtgacca cagaggacac agccatgtat 121 tactgtgcag gagacagtgc tggctactgg tacttcgatg tctggggcac agggaccacg 181 gtcaccgtct ccnna // LOCUS MUSIGKCRU 321 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J4 region, from hybridoma BrM4, partial cds. ACCESSION M31955 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma BrM4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain precursor VJ4-region (AA at 1) matp 1 > 321 Ig kappa-chain recomb 285 286 V-region end/J4-region start BASE COUNT 94 a 74 c 76 g 72 t 5 others ORIGIN Chromosome 6. 1 gacatcaaga tgaccnagtc nccatcctcc atgtatgcat cgctgggaga gagagtcact 61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca 121 tggaaatctc ctaagaccct gatctatnat gcaacaagct tggcagatgg ggtcccatca 181 agattcagtg gcagtggatc tgggcaagat tnttctctaa ccatcagcag cctggagtct 241 gacgatacag caacttntta ctgtctacag catggtgaga gcccattcac gttcggctcg 301 gggacaaagt tggaaataaa a // LOCUS MUSIGKCRV 261 bp ss-mRNA ROD 20-FEB-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J2 region, from hybridoma S12M1, partial cds. ACCESSION M31961 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.A) anti BrMRBC hybridoma S12M1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 261) AUTHORS Conger,J.D., Sage,H.J. and Corley,R.B. TITLE Diversity in the available repertoire of murine antibodies reactive with bromelain-treated isologous erythrocytes JOURNAL J. Immunol. 143, 4044-4052 (1989) STANDARD full staff_review FEATURES from to/span description pept < 1 > 261 Ig kappa-chain VJ2-region (AA at 1) recomb 225 226 V-region end/J2-region start BASE COUNT 79 a 58 c 65 g 55 t 4 others ORIGIN Chromosome 6. 1 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca 61 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca 121 aganncagtg gcagtggatc tgggcaagat tnttctctaa ccatcagcag cctggagtct 181 gacgatacag caacttatna ctgtctacag catggtgaga gcccgtacac gttcggaggg 241 gggaccaagc tggaaataaa a //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS ECOCMA 1007 bp ds-DNA BCT 15-MAR-1989 DEFINITION Plasmid pColBM-Cl139 from E.coli cma gene encoding colicin M activity peptide, complete cds. ACCESSION M16754 KEYWORDS colicin M activity. SOURCE Plasmid pColBM-Cl139 DNA, clone pTO4. ORGANISM Plasmid Colicin BM-Cl139 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1007) AUTHORS Koeck,J., Oelschlaeger,T., Kamp,R.M. and Braun,V. TITLE Primary structure of colicin M, an inhibitor of murein biosynthesis JOURNAL J. Bacteriol. 169, 3358-3361 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 60 875 colicin M activity peptide mRNA < 1 901 cma mRNA BASE COUNT 283 a 227 c 198 g 299 t ORIGIN NaeI site. 1 ggcggaggcg ttagccagcg gcgaactctg ttatcttgtt aacttataag gagttatgta 61 tggaaacctt aactgttcat gcaccatcac catcaactaa cttaccaagt tatggcaatg 121 gtgcattttc tctttcagca ccacatgtgc ctggtgctgg ccctctttta gtccaggttg 181 tttatagttt tttccagagt ccaaacatgt gtcttcaggc tttaactcaa cttgaggatt 241 acatcaaaaa acatggggcc agcaaccctc tcacattgca gatcatatcg acaaatattg 301 gttacttctg taacgccgac cgaaatctgg ttcttcaccc tggaataagc gtttatgacg 361 cttaccactt cgcaaaacca gcgccaagtc aatatgacta tcgctcaatg aatatgaaac 421 aaatgagcgg taatgtcact acaccaattg tggcgcttgc tcactattta tggggtaatg 481 gcgctgaaag gagcgttaat atcgccaaca ttggtcttaa aatttcccct atgaaaatta 541 atcagataaa agacattata aaatctggtg tagtaggcac attccctgtt tctacaaagt 601 tcacacatgc cactggtgat tataatgtta ttaccggtgc atatcttggt aatatcacac 661 tgaaaacaga aggtacttta actatctctg ccaatggctc ctggacttac aatggcgttg 721 ttcgttcata tgatgataaa tacgatttta acgccagcac tcaccgtggc attatcggag 781 agtcgctcac aaggctcggg gcgatgtttt ctggtaaaga gtaccagata ctgcttcctg 841 gtgaaattca cattaaagaa agtggtaagc gataatataa gggccggggt tccggctttt 901 tatgcagcct ttataacagt gtattagtca ttcgcatatt caacatatcc ctcctcacaa 961 tggtacatgc cataatacat tctcttaccc tccgggcctt cagcttt // LOCUS ECOCOLM 810 bp ds-DNA BCT 15-SEP-1989 DEFINITION Plasmid pColBM-Cl139 colicin M resistance (cmi) locus, complete cds. ACCESSION M17810 KEYWORDS cmi protein; colicin M resistance protein. SOURCE Plasmid pColBM-Cl139 DNA, isolated from E.coli strain JM101. ORGANISM Plasmid Colicin BM-Cl139 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 810) AUTHORS Oelschlaeger,T. and Braun,V. TITLE Sequence, expression, and localization of the immunity protein for colicin M JOURNAL J. Bacteriol. 169, 4765-4769 (1987) STANDARD simple staff_review FEATURES from to/span description pept 322 675 colicin M resistance protein pept 337 675 colicin M resistance protein (alt.) BASE COUNT 252 a 146 c 177 g 235 t ORIGIN 7 bp upstream of PstI site. 1 ggctgcagag cctgatgggg cataagtcca tcagctcaaa cgaggtctac acgaaggtgt 61 ttgcactgga tgtggctgta cggcaccggg tgcagttttc gatgcctgag tccgatgcgg 121 tcacaatgct gaaaaacaga catgcataat aagtcacaat tatgaattgt gatttcttct 181 ataaaaaaga gaccactgca atatgtgatc tcttgtatta tttcataatt gttaaagcca 241 cttcacagta tgctcacatt gtactgatat attcgtaatg tttttttata tcgaatgaac 301 gacagaagtt gtggagattt tatgaaagta attagcatga aatttatttt tattttaacg 361 attattgctc ttgctgctgt ttttttctgg tctgaagata aaggtccggc atgctatcag 421 gtcagcgatg aacaggccag aacgtttgta aaaaatgatt acctgcaaag aatgaaacgc 481 tgggacaacg atgtacaact tcttggtaca gaaatcccga aaattacatg ggaaaagatt 541 gagagaagtt taacagatgt tgaagatgaa aaaacacttc ttgtcccatt taaagctgaa 601 ggcccggacg gtaagagaat gtattatggc atgtaccatt gtgaggaggg atatgttgaa 661 tatgcgaatg actaatacac tgttataaag gctgcataaa aagccggaac cccggccctt 721 atattatcgc ttaccacttt ctttaatgtg aatttcacca ggaagcagta tctggtactc 781 tttaccagaa aacatcgccc cgacggatcc // LOCUS ECOORID 1890 bp ds-DNA BCT 15-DEC-1989 DEFINITION E.coli plasmid pColBM-Cl139 D protein gene, complete cds, oriV region, and ORF. ACCESSION J03312 KEYWORDS D protein; ORF; oriV region. SOURCE E.coli (strain K12 Cl139) plasmid pColBM-Cl139 DNA. ORGANISM Plasmid Colicin BM-Cl139 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1890) AUTHORS Thumm,G., Oelschlaeger,T. and Braun,V. TITLE Plasmid pCoIBM-CI139 does not encode a colicin lysis protein but contains sequences highly homologous to the D protein (resolvase) and the oriV region of the miniF plasmid JOURNAL Plasmid (1988) In press STANDARD simple staff_review COMMENT Draft entry and sequence for [1] kindly submitted by G.Thumm, 18-AUG-1988. Sequence [1] complemented. FEATURES from to/span description pept 1108 1890 D protein ORF 654 1106 ORF8 site 257 576 oriV region BASE COUNT 434 a 476 c 534 g 446 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatct tttggccgtt tacgtctgtt ccgttatcct gatgatacga tgttctgcac 61 gttctgccgg gaagatgcag atgattcgct taaaagtatt atgacccatc tctgggagct 121 ggatgcagag atgacagatc ctgtcatagc tatgtttaat cacgtctgag tgccgtgagt 181 gatttctgtc ttttatgcaa cagtgccaag atattgtaat caaaaaaaag cattaatgca 241 ttttggacag taatctattt taattgatga catagaggca ttaatctttc tttttcttca 301 ggaagatccg aaaactcctg gtcacggatc ttcctctccc ccacacaacg ccacctcctg 361 taagcacaac atgtggtgcc ggattcagct gctgatgaca ctatatgttg tgtcatctcc 421 ctgacctgtg atgcgtcgcg caggggcgga aaacagcgat atgatgattt cctcggcgtg 481 gtacacttcc ggaaagttgt gatattccgg aaagtcggat ctgacggaaa cggctctccg 541 gtaatttaac ggcgtggtta tatggatgct tgttatcatg gtgatgatga taacggcatg 601 atgttatcag acggcgtgac ggtaagggca gtgatgatgg atgacgttat cgcatgaccg 661 tccctgcccg gaaaagaaaa aaggagtcac ccatgttttt tattgagaat gaaggtcagg 721 ctgtcgccgg aacggattac tggcagtctg tacaggcgca ggccggatat gtctacctca 781 gctggaatgc cggcgcagcc aggctgcttg tcccggatgc ggcaaaacat ttactcaggg 841 agatgcgggg ggctgagtac gtcatcatca gtaagggagc actgcatggc cgcgatgcgc 901 tggaactggt atttgaagac ggcagcgatg cgccgtttgt gatccacatg ctgagtgagc 961 agtgcgatcg cctgctcccc gaaaacaacc agggaggggg ttttgttgtc accgtctgga 1021 cgcgtggcgg taaccagctc cgttatccgg gaaagtaccg ggttgtggaa aacctgcccg 1081 acgtttcccc gtggagtgaa cactgatatg cagcacctgc cggcaccgat ccaccatgcc 1141 cgggatgctg ttcagcttcc tgttgccatc gattatccgg cagcgctggc actccgccag 1201 atgtcgatgg ttcatgatga actgcccaaa tacctgctgg cccctgaagt gagcgccctg 1261 ctccattacg tcccggatct gcgccgcaag atgctgctgg ccacactgtg gaacaccggt 1321 gcgcgcatta atgaagcact ggcgctgacg cggggggatt tttcgctcac gcctccgtat 1381 ccgtttgtgc agctggccac tctgaagcag cggacagaaa aagccgccag gacggcagga 1441 agaatgcccg ccggtcagca gactcaccgg ctggttccgc tctccgactc ctggtacgtc 1501 agccagctgc agacgatggt agccacactg aaaatcccca tggaacggcg taataaacga 1561 acaggcagga cagagaaagc gcggatctgg gaagtgacgg acagaacggt caggacctgg 1621 attggggagg cggttgccgc cgctgccgct gatggtgtga cgttctctgt cccggtcacg 1681 ccacatacgt tccgccattc ctatgcgatg cacatgctgt atgccggtat accgcttaag 1741 gttctgcaga gtctgatggg gcataagtcc atcagctcaa cagaggtcta cacgaaggtg 1801 tttgcactgg atgtggctgc acggcaccgg gtgcagtttt cgatgcctga gtccgatgcg 1861 gtcacaatgc tgaaaaacag acatgcataa //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS CEC38P 1455 bp ds-DNA BCT 31-AUG-1987 DEFINITION Plasmid ColE3-CA38 colicinogenic region containing colicin E3 (colE3), immunity (immE3), and putative hic and immE8 genes. ACCESSION J01574 J01575 M14038 KEYWORDS colicin; colicin release protein; immune response gene; lysis protein; unidentified reading frame. SOURCE Plasmid ColE3-CA38 (from E.coli) DNA. ORGANISM Plasmid Colicin E3-CA38 Prokaryota; Bacteria. REFERENCE 1 (bases 52 to 651) AUTHORS Masaki,H. and Ohta,T. TITLE A plasmid region encoding the active fragment and the inhibitor protein of colicin E3-CA38 JOURNAL FEBS Lett. 149, 129-132 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 651) AUTHORS Mock,M., Miyada,C.G. and Gunsalus,R.P. TITLE Nucleotide sequence for the catalytic domain of colicin E3 and its immunity protein. Evidence for a third gene overlapping colicin JOURNAL Nucleic Acids Res. 11, 3547-3557 (1983) STANDARD full staff_review REFERENCE 3 (bases 640 to 1455) AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P. TITLE Characterization and nucleotide sequence of a colicin-release gene in the hic region of plasmid ColE3-CA38 JOURNAL Gene 29, 175-184 (1984) STANDARD full staff_review REFERENCE 4 (bases 640 to 1455) AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P. TITLE Corrigenda: Characterization and nucleotide sequence of a colicin-release gene in the hic region of plasmid ColE3-CA38 JOURNAL Gene 42, 351-355 (1986) STANDARD full staff_review COMMENT There are three ORFs distal to the immunity gene (immE3) in plasmid ColE3-CA38. ORF1 (bases 803 to 1066) is homologous to the E2-immunity gene in plasmid ColE2-P9. [3] has tentatively assigned ORF1 as the immE8 gene, but points out that a gene product has not yet been identified. Through deletion mutation studies this region was shown to be non-essential for colicin release. There are two overlapping reading frames further downstream of ORF1 (bases 1070 to 1246 and 1128 to 1421), which are homologous to the H' and H genes in plasmid CloDF13. In CloDF13 the H gene has been shown to be the lysis gene. Because of gene homology and in vitro studies which show that the ORF3 region is necessary for cell lysis, [3] has identified ORF3 as the hic gene. However, the in vitro studies do not exclude the possibility that ORF2 functions in colicin release or that it contributes to the Hic phenotype. There is also the possibility that the hic gene initiates at bp 1095 rather than 1128. The hic gene product also has not been identified. The immE8 and hic genes have been experimentally localized to the regions annotated in the Features Table. Their coding regions were deduced by finding the open reading frames and comparing them with sequences of genes in plasmids with like phenotypes [3]. There are six inverted repeats in the hic region. IR-2, IR-3, and IR-5 (positions 762-792, 964-984, 1131-1157) are "a" + "t" rich and show resemblance to SOS boxes. IR-1 an IR-4 (673-708 and 1072-1107) show attenuator-like structure. They may attenuate transcription of the hic gene after SOS induction to a level more optimal for colicin release. IR-6 (1287-1332) has a terminator-like structure. A Shine-Delgarno sequence is present at positions 376-379 between the colE3 and immE3 genes. The putative ribosome binding site for the hic gene can be found at 1117-1122. [1] refers to colicin E3 as protein A and the immunity protein as protein B. [2] refers to the colE3 gene as the ceaC gene and the immE3 gene as the ceaC gene. Draft entry and clean copy sequence kindly provide by R.J.Watson, May 1985 [3]. FEATURES from to/span description pept < 1 375 colicin E3 (AA at 1) pept 385 642 immunity protein-E3 pept 803 1066 immE8 protein (putative; gtg start codon) pept 1128 1271 lysis protein (putative) revision 755 756 gc in [4]; cg in [3] revision 1265 1267 gca in [4]; ga in [3] revision 1348 1349 tt in [4]; ttt in [3] BASE COUNT 473 a 214 c 352 g 416 t ORIGIN 150 bp upstream of Sau3A site. 1 gctatggaaa gcaggaagaa gaaagaagat aagaaaagga gtgctgaaaa taatttaaac 61 gatgaaaaga ataagcccag aaaaggtttt aaagattacg ggcatgatta tcatccagct 121 ccgaaaactg agaatattaa agggcttggt gatcttaagc ctgggatacc aaaaacacca 181 aagcagaatg gtggtggaaa acgcaagcgc tggactggag ataaagggcg taagatttat 241 gagtgggatt ctcagcatgg tgagcttgag gggtatcgtg ccagtgatgg tcagcatctt 301 ggctcatttg accctaaaac aggcaatcag ttgaaaggtc cagatccgaa acgaaatatc 361 aagaaatatc tttgagagga agttatggga cttaaattgg atttaacttg gtttgataaa 421 agtacagaag attttaaggg tgaggagtat tcaaaagatt ttggagatga cggttcagtt 481 atggaaagtc taggtgtgcc ttttaaggat aatgttaata acggttgctt tgatgttata 541 gctgaatggg tacctttgct acaaccatac tttaatcatc aaattgatat ttccgataat 601 gagtattttg tttcgtttga ttatcgtgat ggtgattggt gatcaaatat tatcagggat 661 gagttgatat acgggcttct agtgttcatg gatgaacgct ggagcctcca aatgtagaaa 721 tgttatattt tttattgagt tcttggttat aattgctccg caatgattta aataagcatt 781 atttaaaaca ttctcaggag aggtgaaggt ggagctaaaa aaaagtattg gtgattacac 841 tgaaaccgaa ttcaaaaaat ttattgaaga catcatcaat tgtgaaggtg atgaaaaaaa 901 acaggatgat aacctcgagt attttataaa tgttactgag catcctagtg gttctgatct 961 gatttattac ccagaaggta ataatgatgg tagccctgaa ggtgttatta aagagattaa 1021 agaatggcga gccgctaacg gtaagtcagg atttaaacag ggctgaaata tgaatgccgg 1081 ttgtttatgg atgaatggct ggcattcttt cacaacaagg agtcgttatg aaaaaaataa 1141 cagggattat tttattgctt cttgcagtca ttattctgtc tgcatgtcag gcaaactata 1201 tccgggatgt tcagggcggg accgtatctc cgtcatcaac agctgaagtg accggattag 1261 caacgcagta acccgaaatc ctctttgaca aaaacaaagc gtgtcaggct gattctgatg 1321 cgcttttttt ttgaaatgtc acaaaaattc catgtgggag atgggatcta aaatcctcgt 1381 gcagaacttt ccatccaggg ggagaaaact tgtcgttttg agccgttcgg tgttcagaac 1441 gcacgaaacc gatcg //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS BLCNNS 961 bp ss-RNA VRL 15-MAR-1990 DEFINITION Bunyamwera virus small RNA segment, N and NSs protein. ACCESSION D00353 KEYWORDS N protein; NSs protein; SRNA; nonstructural protein; nucleocapsid protein; small RNA. SOURCE Bunyamwera virus, cDNA to viral RNA, clones pBU[NS14,N3/59,N308, N309,N93]. ORGANISM Bunyamwera virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Bunyavirus. REFERENCE 1 (bases 1 to 961) AUTHORS Elliott,R.M. TITLE Nucleotide sequence analysis of the small(S) RNA segment of Bunyamwera virus, the prototype of the family Bunyaviridae JOURNAL J. Gen. Virol. 70, 1281-1285 (1989) STANDARD full staff_entry COMMENT Submitted in computer readable form by R.M. Elliott on 18-Jan-1989. The virus contains the negative sense strand; the positive strand is shown below. FEATURES from to/span description pept 86 787 N protein pept 105 410 NSs protein BASE COUNT 298 a 187 c 215 g 261 t ORIGIN 1 agtagtgtac tccacactac aaacttgcta ttgttgaaaa tcgctgtgct attaaatcca 61 acagaaggtc attaaaggct ctttaatgat tgagttggaa tttcatgatg tcgctgctaa 121 caccagcagt acttttgacc cagaggtcgc atacgctaac tttaagcgtg tccacaccac 181 tgggcttagt tatgaccaca tacgaatctt ctacattaaa ggacgcgaga ttaaaactag 241 tctcgcaaaa agaagtgaat gggaagttac acttaacctt gggggctgga agattactgt 301 atataatacg aattttcctg gcaaccggaa caacccagtt cctgacgatg gtcttaccct 361 ccaccgcctc agtggattcc ttgccaggta cctacttgag aagatgctga aagtcagtga 421 accagagaaa ttgattatta aatcaaaaat aatcaaccct ttggctgaaa agaatgggat 481 cacttggaat gatggagagg aagtttatct ctctttcttc ccaggatcag agatgttctt 541 aggaactttc agattctacc ccttagcaat cgggatctac aaagttcagc gcaaggaaat 601 ggaaccaaaa taccttgaga aaacaatgcg gcagaggtac atgggactag aagcagcaac 661 ttggactgtt agtaaattga cagaagttca gtctgcactg acagttgtct ctagcttagg 721 ttggaagaaa accaatgtta gtgcagctgc cagggacttc cttgctaaat tcggaatcaa 781 catgtaagca gggatgcatt tttaatcggg ctaaagtcat ctgttttaat ttggctaaaa 841 gggttgtttc aacccacaaa ataacagctg cttgggtggg tggttgggga cagaaagaca 901 gcgggctaaa tcaacattat attgttaatg gtattttaag ttttaggtgg agcacactac 961 t // LOCUS DEN2NGC 2357 bp ss-RNA VRL 15-MAR-1990 DEFINITION Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA. ACCESSION D00346 KEYWORDS E protein; M protein; prM protein; structural protein. SOURCE Dengue virus type 2 (New Guinea C strain), cDNA to genomic RNA. ORGANISM Dengue virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Flaviviridae; Flavivirus (arbovirus group B). REFERENCE 1 (bases 1 to 2357) AUTHORS Gruenberg,A., Woo,W.S., Biedrzycka,A. and Wright,P.J. TITLE Partial Nucleotide Sequence and Deduced Amino Acid Sequence of the Structural Proteins of Dengue Virus Type 2, New Guinea C and PUO-218 Strains; JOURNAL J. Gen. Virol. 69, 1391-1398 (1988) STANDARD simple staff_review COMMENT Nucleotide 1 in the NGC sequence corresponds to nucleotide 77 counting from the 5'end of the DEN-2(JAM) sequence. FEATURES from to/span description pept 21 > 2357 viral polyprotein matp 21 362 C protein matp 363 860 prM protein matp 636 860 M protein matp 861 2345 E protein matp 2346 2357 NS 1 protein (amino end) BASE COUNT 782 a 471 c 595 g 509 t ORIGIN 20bp upstream from the C protein amino terminal end 1 aattagagag cagatctctg atgaataacc aacgaaaaaa ggcgagaaat acgcctttca 61 atatgctgaa acgcgagaga aaccgcgtgt cgactgtaca acagctgaca aagagattct 121 cacttggaat gctgcaggga cgaggaccat taaaactgtt catggccctg gtggcgttcc 181 ttcgtttcct aacaatccca ccaacagcag ggatactgaa gagatgggga acaattaaaa 241 aatcaaaagc cattaatgtt ttgagagggt tcaggaaaga gattggaagg atgctgaaca 301 tcttgaacag gagacgcaga actgcaggca tgatcattat gctgattcca acagtgatgg 361 cgttccattt aaccacacgt aacggagaac cacacatgat cgtcagtaga caagagaaag 421 ggaaaagtct tctgtttaaa acagaggatg gtgtgaacat gtgtaccctc atggccatgg 481 accttggtga attgtgtgaa gatacaatca cgtacaagtg tccttttctc aggcagaatg 541 aaccagaaga catagattgt tggtgcaact ctacgtccac atgggtaact tatgggacgt 601 gtaccaccac aggagaacac agaagagaaa aaagatcagt ggcactcgtt ccacatgtgg 661 gaatgggact ggagacacga actgaaacat ggatgtcatc agaaggggcc tggaaacatg 721 cccagagaat tgaaacttgg atcttgagac atccaggctt taccataatg gcagcaatcc 781 tggcatacac cataggaacg acacatttcc aaagagccct gattttcatc ttactgacag 841 ctgtcgctcc ttcaatgaca atgcgttgca taggaatatc aaatagagac tttgtagaag 901 gggtttcagg aggaagctgg gttgacatag tcttagaaca tggaagctgt gtgacgacga 961 tggcaaaaaa caaaccaaca ttggattttg aactgataaa aacagaagcc aaacaacctg 1021 ccactctaag gaagtactgt atagaggcaa agctgaccaa cacaacaaca gattctcgct 1081 gcccaacaca aggagaaccc agcctaaatg aagagcagga caaaaggttc gtctgcaaac 1141 actccatggt ggacagagga tggggaaatg gatgtggatt atttggaaaa ggaggcattg 1201 tgacctgtgc tatgttcaca tgcaaaaaga acatgaaagg aaaagtcgtg caaccagaaa 1261 acttggaata caccattgtg ataacacctc actcagggga agagcatgca gtcggaaatg 1321 acacaggaaa acatggcaag gaaatcaaaa taacaccaca gagttccatc acagaagcag 1381 agttgacagg ctatggcact gtcacgatgg agtgctctcc gagaacgggc ctcgacttca 1441 atgagatggt gttgctgcaa atggaaaata aagcttggct ggtgcacagg caatggttcc 1501 tagacctgcc gttgccatgg ctgcccggag cggacacaca aggatcaaat tggatacaga 1561 aagagacatt ggtgactttc aaaaatcccc atgcgaagaa acaggatgtt gttgttttgg 1621 gatcccaaga aggggccatg cacacagcac tcacaggggc cacagaaatc cagatgtcat 1681 caggaaactt actgttcaca ggacatctca agtgcaggct gaggatggac aaactacagc 1741 tcaaaggaat gtcatactct atgtgcacag gaaagtttaa agttgtgaag gaaatagcag 1801 aaacacaaca tggaacaata gttatcagag tacaatatga aggggacggt tctccatgta 1861 agatcccttt tgagataatg gatttggaaa aaagacatgt tttaggtcgc ctgattacag 1921 tcaacccaat cgtaacagaa aaagatagcc cagtcaacat agaagcagaa cctccattcg 1981 gagacagcta catcatcata ggagtagagc cgggacaatt gaagctcaac tggtttaaga 2041 aaggaagttc tatcggccaa atgattgaga caacaatgag gggagcgaag agaatggcca 2101 ttttaggtga cacagcttgg gattttggat ccctgggagg agtgtttaca tctataggaa 2161 aggctctcca ccaagttttc ggagcaatct atggggctgc cttcagtggg gtctcatgga 2221 ctatgaaaat cctcatagga gtcattatca catggatagg aatgaattca cgcagcacct 2281 cactttctgt gtcactagta ttggtgggag tcgtgacgct gtatttggga gttatggtgc 2341 aggccgatag tggttgc // LOCUS HS1IRLULR 13052 bp ds-DNA VRL 15-MAR-1990 DEFINITION Herpes simplex virus type 1 (HSV-1) genome, rightmost part of the long unique region (UL) and all of the internal long repeat region (IRL). ACCESSION D00374 KEYWORDS IE110; IE63; UL54; UL55; UL56; immediate-early protein; internal long repeat region; long repeat region; long unique region; transcriptional activator; transcriptional modulating protein. SOURCE HSV-1 (strain 17) DNA, clones BamHI b, XhoI c, BamHI k, HpaI s plus v. ORGANISM Herpes simplex virus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 13052) AUTHORS Perry,L.J. and McGeoch,D.J. TITLE The DNA sequences of the long repeat region and adjoining parts of the long unique region in the genome of herpes simplex virus type 1 JOURNAL J. Gen. Virol. 69, 2831-2846 (1988) STANDARD full staff_entry COMMENT There were two small divergences within the two versions of the UL proximal part of RL (discussed in [1]). FEATURES from to/span description pept 413 1951 immediate-early transcriptional modulating protein IE63 (gene UL54) ORF 2175 2735 ORF of gene UL55 ORF 3602 3009 (c) ORF of gene UL56 pept 10787 10731 (c) IE110 exon 1 9965 9299 (c) IE110 exon 2 9162 7559 (c) IE110 exon 3 mRNA 275 1974 IE63 mRNA pre-msg 10935 7350 (c) IE110 mRNA and introns IVS 10730 9967 (c) IE110 intron 1 IVS 9298 9163 (c) IE110 intron 2 rpt 3837 4017 reiteration set 1 rpt 4224 4244 reiteration set 2 rpt 4465 4496 reiteration set 3 rpt 7170 7317 reiteration set 4 rpt 10422 10583 reiteration set 5 rpt 12007 12060 reiteration set 6 rpt 12730 12952 reiteration set 7 refnumbr 1 1 numbered 113322 in [1] signal 247 251 TATA box signal 3792 3786 (c) TATA box signal 10962 10958 (c) TATA box signal 1956 1961 polyadenylation signal signal 2777 2782 polyadenylation signal signal 2880 2875 (c) polyadenylation signal signal 2884 2879 (c) polyadenylation signal signal 7372 7367 (c) polyadenylation signal signal 7412 7407 (c) polyadenylation signal variant 1055 1062 eight c residues in HpaI s plus v clone; seven c residues in BamHI b clone site 3836 3836 end of UL BASE COUNT 1933 a 4879 c 4243 g 1997 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccaac gaccccgccc atgggtccca attggccgtc ccgttaccaa gaccaaccca 61 gccagcgtat ccacccccgc ccgggtcccc gcggaagcgg aacggggtat gtgatatgct 121 aattaaatac atgccacgta cttatggtgt ctgattggtc cttgtctgtg ccggaggtgg 181 ggcgggggcc ccgcccgggg ggcggaacga ggaggggttt gggagagccg gccccggcac 241 cacgggtata aggacatcca ccacccggcc ggtggtggtg tgcagccgtg ttccaaccac 301 ggtcacgctt cggtgcctct ccccgattcg ggcccggtcg ctcgctaccg gtgcgccacc 361 accagaggcc atatccgaca ccccagcccc gacggcagcc gacagcccgg tcatggcgac 421 tgacattgat atgctaattg acctcggcct ggacctctcc gacagcgatc tggacgagga 481 cccccccgag ccggcggaga gccgccgcga cgacctggaa tcggacagca gcggggagtg 541 ttcctcgtcg gacgaggaca tggaagaccc ccacggagag gacggaccgg agccgatact 601 cgacgccgct cgcccggcgg tccgcccgtc tcgtccagaa gaccccggcg tacccagcac 661 ccagacgcct cgtccgacgg agcggcaggg ccccaacgat cctcaaccag cgccccacag 721 tgtgtggtcg cgcctcgggg cccggcgacc gtcttgctcc cccgagcagc acgggggcaa 781 ggtggcccgc ctccaacccc caccgaccaa agcccagcct gcccgcggcg gacgccgtgg 841 gcgtcgcagg ggtcggggtc gcggtggtcc cggggctgcc gatggtttgt cggacccccg 901 ccggcgtgcc cccagaacca atcgcaaccc tgggggaccc cgccccgggg cggggtggac 961 ggacggcccc ggcgcccccc atggcgaggc gtggcgcggc agtgagcagc ccgacccacc 1021 cggaggccag cggacacggg gcgtgcgcca agcacccccc ccgctaatga cgctggcgat 1081 tgcccccccg cccgcggacc cccgcgcccc ggccccggag cgaaaggcgc ccgccgccga 1141 caccatcgac gccaccacgc ggttggtcct gcgctccatc tccgagcgcg cggcggtcga 1201 ccgcatcagc gagagctttg gccgcagcgc acaggtcatg cacgacccct ttggggggca 1261 gccgtttccc gccgcgaata gcccctgggc cccggtgctg gcgggccaag gagggccctt 1321 tgacgccgag accagacggg tctcctggga aaccttggtc gcccacggcc cgagcctcta 1381 tcgcactttt gccggcaatc ctcgggccgc atcgaccgcc aaggccatgc gcgactgcgt 1441 gctgcgccaa gaaaatttca tcgaggcgct ggcctccgcc gacgagacgc tggcgtggtg 1501 caagatgtgc atccaccaca acctgccgct gcgcccccag gaccccatta tcgggacgac 1561 cgcggctgtg ctggataacc tcgccacgcg cctgcggccc tttctccagt gctacctgaa 1621 ggcgcgaggc ctgtgcggcc tggacgaact gtgttcgcgg cggcgtctgg cggacattaa 1681 ggacattgca tccttcgtgt ttgtcattct ggccaggctc gccaaccgcg tcgagcgtgg 1741 cgtcgcggag atcgactacg cgacccttgg tgtcggggtc ggagagaaga tgcatttcta 1801 cctccccggg gcctgcatgg cgggcctgat cgaaatccta gacacgcacc gccaggagtg 1861 ttcgagtcgt gtctgcgagt tgacggccag tcacatcgtc gcccccccgt acgtgcacgg 1921 caaatatttt tattgcaact ccctgtttta ggtacaataa aaacaaaaca tttcaaacaa 1981 atcgcccctc gtgttgtcct tctttgctca tggccggcgg ggcgtgggtc acggcagatg 2041 gcgggggtgg gcccggcgta cggcctgggt gggcggaggg aactaaccca acgtataaat 2101 ccgtccccgt tccaaggccg gtgtcatagt gcccttagga gcttcccgcc cgggcgcatc 2161 cccccttttg cactatgaca gcgacccccc tcaccaacct gttcttacgg gccccggaca 2221 taacccacgt ggccccccct tactgcctca acgccacctg gcaggccgaa acggccatgc 2281 acaccagcaa aacggactcc gcttgcgtgg ccgtgcggag ttacctggtc cgcgcctcct 2341 gtgagaccag cggcacaatc cactgctttt tctttgcggt atacaaggac acccaccaca 2401 cccctccgct gattaccgag ctccgcaact ttgcggacct ggttaaccac ccgccggtcc 2461 tacgcgaact ggaggataag cgcggggtgc ggctgcggtg tgcgcggccg tttagcgtcg 2521 ggacgattaa ggacgtctct gggtccggcg cgtcctcggc gggagagtac acgataaacg 2581 ggatcgtgta ccactgccac tgtcggtatc cgttctcaaa aacatgctgg atgggggcct 2641 ccgcggccct acagcacctg cgctccatca gctccagcgg catggccgcc cgcgcggcag 2701 agcatcgacg cgtcaagatt aaaattaagg cgtgatctcc aaccccccca tgaatgtgtg 2761 taaccccccc caaaaaaata aagagccgta acccaaccaa accaggcgtg gtgtgagttt 2821 gtggacccaa agccctcaga gacaacgcga caggccagta tggaccgtga tacttttatt 2881 tattaactca caggggcgct taccgccaca ggaataccag aataatgacc accacaatcg 2941 cgaccacccc aaatacagca tggcgccaca ccacgccaca acagccctgt cgccggtatg 3001 gggcatgatc agacgagccg cgccgcgcgt tgggccctgt acagctcgcg cgaattgacc 3061 ctaggaggcc gccacgcgcc cgagttttgc gttcgtcgct ggtcgtcggg cgccaaagcc 3121 ccggacggct gttcggtcga acgaacggcc acgacagtgg cataggttgg ggggtggtcc 3181 gacatagcct cggcgtacgt cgggaggccc gacaagaggt cccttgtgat gtcgggtggg 3241 gccacaagcc tggtttccgg aagaaacagg ggggttgcca ataacccgcc agggccaaaa 3301 ctccggcgct gcgcacgtcg ttcggcgcgg cgccgggcgc gccgagcggc tcgctgggcg 3361 gcttggcgtg agcggccccg ctccgacgcc tcgccctctc cggaggaggt tggcggaatt 3421 ggcacggaca acaggggccc agcagagtac ggtggaggtg ggtccgtggg ggtgtccaga 3481 tcaataacga caaacggccc ctcgttccta ccagacaagc tatcgtaggg gggcggggga 3541 tcagcaaacg cgttccccgc gctccataaa cccgcgtcgg gttgcgccgc ctccgaagcc 3601 atggatgcgc cccaaagcca cgactcccgc gcgctaggtc cttggggtaa tggaaaaggc 3661 cctactcccc atccaagcca gccaagttaa cgggctacgc cttcgggaat gggactggca 3721 ccccggcgga ttttgttggg ctggcatgcg tcgcccaacc gagggccgcg tccacgggac 3781 gcgcctttta taaccccggg ggtcattccc aacgatcaca tgcaatctaa ctggctcccc 3841 tctccccccc tctcccctct ccccccctct cccctctccc cccctctccc ctctcccccc 3901 ctctcccctc tccccccctc tcccctctcc ccccctctcc cctctccccc cctctcccct 3961 ctccccccct ctcccctctc cccccctctc ccctctcccc ccctctcccc tctcccctct 4021 gctctttccc cgtgacaccc gacgctgggg gcgtggctgc cgggaggggc cgcggatggg 4081 cgggcctact tggtttcccg cccccccccc ccccccccga accgccccgc cggctttgcc 4141 cccctttgat cccctgctac ccccaacccg tgctggtggt gcgggttggg gggggatgtg 4201 ggcgggggtg cgcgggaggt gtcggtggtg gtggtggtgg tggtagtagg aatggtggtg 4261 aggggggggg ggcgctggtt ggtcaaaaaa gggagggacg ggggccggca gaccgacggc 4321 gacaacgctc cccggcggcc gggtcgcggc tcttacgagc ggcccggccc gcgctcccac 4381 cccccgggcc gtgtccttgc tttccccccg tctccccccc ccccgccttc tcctcctcct 4441 cctcgttttt ccaaaccccg cccacccggc ccggcccggc ccggcccggc ccggccaccg 4501 ccgcccaccc acccacctcg ggatacccag ccccggtccc ccgttccccg ggggccgtta 4561 tctccagcgc cccgtccggc gcgccgcccc ccgccgctaa accccatccc gcccccggga 4621 ccccacatat aagcccccag ccacacgcaa gaacagacac gcagaacggc tgtgtttatt 4681 taaataaacc aatgtcggaa taaacaaaca caaacacccg cgacgggggg acggagggga 4741 cggagggagg gggtgacggg ggacgggaac agacacaaaa acaaccacaa aaaacaacca 4801 cccaccgaca cccccacccc agtctcctcg ccttctccca cccaccccac gcccccactg 4861 agcccggtcg atcgacgagc acccccgccc acgcccccgc ccctgccccg gcgacccccg 4921 gcccgcacga tcccgacaac aataacaacc ccaacggaaa gcggcggggt gttgggggag 4981 gcgaggaaca accgagggga acgggggatg gaaggacggg aagtggaagt cctgataccc 5041 atcctacacc cccctgcctt ccaccctccg gccccccgcg agtccacccg ccggccggct 5101 accgagaccg aacacggcgg ccgccgcagc cgccgcagcc gccgccgaca ccgcagagcc 5161 ggcgcgcgca ctcacaagcg gcagaggcag aaaggcccag agtcattgtt tatgtggccg 5221 cgggccagca gacggcccgc gacacccccc ccccgcccgt gtgggtatcc ggccccccgc 5281 cccgcgccgg tccattaagg gcgcgcgtgc ccgcgagata tcaatccgtt aagtgctctg 5341 cagacagggg caccgcgccc ggaaatccat taggccgcag acgaggaaaa taaaattaca 5401 tcacctaccc acgtggtgct gtggcctgtt tttgctgcgt catctcagcc tttataaaag 5461 cgggggcgcg gccgtgccga tcgcgggtgg tgcgaaagac tttccgggcg cgtccgggtg 5521 ccgcggctct ccgggccccc ctgcagccgg ggcggccaag gggcgtcggc gacatcctcc 5581 ccctaagcgc cggccggccg ctggtctgtt ttttcgtttt ccccgtttcg ggggtggtgg 5641 gggttgcggt ttctgtttct ttaacccgtc tggggtgttt ttcgttccgt cgccggaatg 5701 tttcgttcgt ctgtcccctc acggggcgaa ggccgcgtac ggcccgggac gaggggcccc 5761 cgaccgcggc ggtccgggcc ccgtccggac ccgctcgccg gcacgcgacg cgaaaaaggc 5821 cccccggagg cttttccggg ttcccggccc ggggcctgag atgaacactc ggggttaccg 5881 ccaacggccg gcccccgtgg cggcccggcc cggggccccg gcggacccaa ggggccccgg 5941 cccggggccc cacaacggcc cggcgcatgc gctgtggttt ttttttcctc ggtgttctgc 6001 cgggctccat cgcctttcct gttctcgctt ctcccccccc ccttcttcac ccccagtacc 6061 ctcctccctc ccttcctccc ccgttatccc actcgtcgag ggcgccccgg tgtcgttcaa 6121 caaagacgcc gcgtttccag gtaggttaga cacctgcttc tccccaatag agggggggga 6181 cccaaacgac agggggcgcc ccagaggcta aggtcggcca cgccactcgc gggtgggctc 6241 gtgttacagc acaccagccc gttcttttcc ccccctccca cccttagtca gactctgtta 6301 cttacccgtc cgaccaccaa ctgccccctt atctaagggc cggctggaag accgccaggg 6361 ggtcggccgg tgtcgctgta accccccacg ccaatgaccc acgtactcca agaaggcatg 6421 tgtcccaccc cgcctgtgtt tttgtgcctg gctctctatg cttgggtctt actgcctggg 6481 gggggggagt gcgggggagg gggggtgtgg aaggaaatgc acggcgcgtg tgtacccccc 6541 ctaaagttgt tcctaaagcg aggatacgga ggagtggcgg gtgccggggg accggggtga 6601 tctctggcac gcgggggtgg gaagggtcgg gggagggggg gatggagtac cggcccacct 6661 ggccgcgcgg gtgcgcgtgc ctttgcacac caaccccacg tcccccggcg gtctctaaga 6721 agcaccgccc cccctccttc ataccaccga gcatgcctgg gtgtgggttg gtaaccaaca 6781 cgcccatccc ctcgtctcct gtgattctct ggctgcaccg cattcttgtt ttctaactat 6841 gttcctgttt ctgtctcccc cccccccacc cctccgcccc accccccaac acccacgtct 6901 gtggtgtggc cgaccccctt ttgggcgccc cgtcccgccc cgccacccct cccatccttt 6961 gttgccctat agtgtagtta accccccccg ccctttgtgg cggccagagg ccaggtcagt 7021 ccgggcgggc aggcgctcgc ggaaacttaa cacccacacc caacccactg tggttctggc 7081 tccatgccag tggcaggatg ctttcgggga tcggtggtca ggcagcccgg gccgcggctc 7141 tgtggttaac accagagcct gcccaacatg gcacccccac tcccacgcac ccccactccc 7201 acgcaccccc actcccacgc acccccactc ccacgcaccc ccactcccac gcacccccac 7261 tcccacgcac ccccactccc acgcaccccc actcccacgc acccccactc ccacgcatcc 7321 ccgcgataca tccaacacag acagggaaaa gatacaaaag taaaccttta tttcccaaca 7381 gacagcaaaa atcccctgag ttttttttta ttagggccaa cacaaaagac ccgctggtgt 7441 gtggtgcccg tgtctttcac ttttcccctc cccgacacgg attggctggt gtagtgggcg 7501 cggccagaga ccacccagcg cccgaccccc ccctccccac aaacacgggg ggcgtccctt 7561 attgttttcc ctcgtcccgg gtcgacgccc cctgctcccc ggaccacggg tgccgagacc 7621 gcaggctgcg gaagtccagg gcgcccacta gggtgccctg gtcgaacagc atgttcccca 7681 cgggggtcat ccagaggctg ttccactccg acgcgggggc cgtcgggtac tcggggggca 7741 tcacgtggtt acccgcggtc tcggggagca gggtgcggcg gctccagccg gggaccgcgg 7801 cccgcagccg ggtcgccatg tttcccgtct ggtccaccag gaccacgtac gccccgatgt 7861 tccccgtctc catgtccagg atgggcaggc agtcccccgt gatagtcttg ttcacgtaag 7921 gcgacagggc gaccacgcta gagacccccg agatgggcag gtagcgcgtg aggccgcccg 7981 cggggacggc cccggaagtc tccgcgtggc gcgtcttccg ggcacacttc ctcggccccc 8041 gcggcccaga agcagcgcgg gggccgaggg aggtttcctc ttgtctccct cccagggcac 8101 cgacggcccc gcccgaggag gcggaagcgg aggaggacgc ggccccggcg gcggaagagg 8161 cggcccccgc gggggtcggg gccgaggagg aagaggcaga ggaggaagag gcggaggccg 8221 ccgaggacgt caggggggtc ccgggcccac cctggccgcg cccccccggc cctgagtcgg 8281 agggggggtg cgtcgccgcc ctcttggccc ctgccggcgc gaggggggga cgcgtggact 8341 ggggggaggg gttttcctgg cccgacccgc gcctcttcct cggacgcacc gccgcctcct 8401 gctcgacaga ggcggcggag gggagcgggg cggcgccgga gggggcggcg ccgcgggagg 8461 gcccgtgccc accctccacg cccggccccc ccgagccgcg cgccaccgtc gcacgcgccc 8521 ggcacagact ctgttcttgg ttcgcggcct gagccaggga cgagtgcgac tggggcacac 8581 ggcgcgcgtc cgcggggcgg gcggccggct ccgccccggg ggccggggcg cgggggccgg 8641 gccccggagg cggcgctcgc acgcacgggg ccacggccgc gcgggggcgc gcgggtcccg 8701 acgcggccgc ggacgcgggg ggcccggggc ggggggcgga gcctggcatg ggcgccgcgg 8761 ggggcctgtg gggagaggcc gggggggagt cgctgatcac tatggggtct ctgttgtttg 8821 caaggggggc gggtctgttg acaagggggc ccgtccggcc cctcggccgc cccgcctccg 8881 cttcaacaac cccaacccca accccaaccc ccccggaggg gccagacgcc ccccgcggcg 8941 ccgcggctcg cgactggcgg gagccgccgc cgccgctgct gttggtggtg gtgttggtgt 9001 tactgctgcc gtgtggcccg atgggcgccg aggggggcgc tgtccgagcc gcggccggct 9061 ggggggctgc gtgagacgcc ccgcccgtca cggggggcgc ggcggcgcct ctgcgtgggg 9121 gggcgcgggg cgtccggcgg ggggcgggcg gtacgtagtc tgctgcaaga gacaacgggg 9181 ggcgcgatca ggttacgccc cctccccggc ccgccctttc ctcgcccgcc cgcctattcc 9241 tccctccccc cccctcctcc tcctcctccc ccagggtcct tgccgccccc cgcctcaccg 9301 tcgtccaggt cgtcgtcatc ctcgtccgtg gtgggctccg ggtgggtggg cgacagggcc 9361 ctcaccgtgt gcccccccag ggtcaggtac cgcggggcga accgctgatt gcccgtccag 9421 ataaagtcca cggccgtgcc cgccctgacg gcctcctcgg cctccatgcg ggtctggggg 9481 tcgttcacga tcgggatggt gctgaacgac ccgctgggcg tcacgcccac tatcaggtac 9541 accagcttgg cgttgcacag cgggcaggtg ttgcgcaatt gcatccaggt tttcatgcac 9601 gggatgcaga agcggtgcat gcacgggaag gtgtcgcagc gcaggtgggg cgcgatctca 9661 tccgtgcaca cggcgcacac gtcgccctcg tcgctccccc cgtcctctcg agggggggcg 9721 cccccgcaac tgccggggtc ttcctcgcgg ggggggctcc cccccgagac cgccccccca 9781 tccacgccct gcggccccag cagccccgtc tcgaacagtt ccgtgtccgt gctgtccgcc 9841 tcggaggcgg agtcgtcgtc atggtggtcg gcgtcccccc gcccccccac ttcggtctcc 9901 gcctcagagt cgctgctgtc cggcaggtct cggtcgcagg gaaacaccca gacatccggg 9961 gcgggctaag gggaaaaaag gggggcgggt aagaatgggg ggggatttcc cgcgtcaatc 10021 agcacccacg agttccccct ctcccccccc cgcctcacaa agtcctgccc ccctgctggc 10081 ctcggaagag gggggagaaa ggggtctgca accaaaggtg gtctgggtcc gtcctttgga 10141 tcccgacccc tcttcttccc tcttctcccg ccctccagac gcaccggagt cgggggtccc 10201 acggcgtccc ccaaatatgg cgggcggctc ctccccaccc ccctagatgc gtgtgagtaa 10261 ggggggcctg cgtatgagtc agtggggacc acgcccccaa cacggcgacc ccggtccttg 10321 tgtgtttgtt gtgggggcgt gtctctgtgt atgagtcagg gggtcccacg gcgaccccgg 10381 gccctgcgtc tgagtcaaag gggccatgtg tatgtgttgg gggtctgtat atataaagtc 10441 agggggtcac atggcgaccc ccaacagggc gaccccggtc cctgtatata tagggtcagg 10501 gggttccgca ccccctaaca tggcgccccc ggtccctgta tatatagtgt cacggggttc 10561 cacgccccct aacatggcgc cccaacatgg cgcccggctc ccgtgtatga gtgggggtcc 10621 cccaacatgg cggccggttc cagtgtaagg gtcgggggtc ccccaacatg gcgcccccca 10681 atatggcgcc ccccaatatg gcgccccaga catggcgccc ggcccctcac ctcgcgctgg 10741 gggcggccct caggccggcg ggtactcgct ccggggcggg gctccatggg ggtcgtatgc 10801 ggctggaggg tcgcggacgg agggtccctg ggggtcgcaa cgtaggcggg gcttctgtgg 10861 tgatgcggag agggggcggc ccgagtctgc ctggctgctg cgtctcgctc cgagtgccga 10921 ggtgcaaatg cgaccagact gtcgggccag ggctaactta taccccacgc ctttcccctc 10981 cccaaagggg cggcagtgac gattccccca atggccgcgc gtcccagggg aggcaggccc 11041 accgcggggc ggccccgtcc ccggggacca acccggcgcc cccaaagaat atcattagca 11101 tgcacggccc ggcccccgat ttgggggccc aacccggtgt cccccaaaga accccattag 11161 catgcccctc ccgccgacgc aacaggggct tggcctgcgt cggtgccccg gggcttcccg 11221 ccttcccgaa gaaactcatt accatacccg gaaccccagg ggaccaatgc gggttcattg 11281 agcgacccgc gggccaatgc gcgaggggcc gtgtgttccg ccaaaaaagc aattagcata 11341 acccggaacc ccaggggagt ggttacgcgc ggcgcgggag gcggggaata ccggggttgc 11401 ccattaaggg ccgcgggaat tgccggaagc gggaagggcg gccggggccg cccattaatg 11461 agtttctaat taccataccg ggaagcggaa caaggcctct tgcaagtttt taattaccat 11521 accgggaagt gggcggcccg gcccattggg cggtaactcc cgcccaatgg gccgggcccc 11581 gaagactcgg cggacgctgg ttggccgggc cccgccgcgc tggcggccgc cgattggcca 11641 gtcccgcccc cgaggcggcc cgccctgtga gggcgggctg gctccaagcg tatatatgcg 11701 cggctcctgc catcgtctct ccggagagcg gcttggtgcg gagctcccgg gagctccgcg 11761 gaagacccag gccgcctcgg gtgtaacgtt agaccgagtt cgccgggccg gctccgcggg 11821 ccagggcccg ggcacgggcc tcgggcccca ggcacggccc gatgaccgcc tcggcctccg 11881 ccacccggcg ccggaaccga gcccggtcgg cccgctcgcg ggcccacgag ccgcggcgcg 11941 ccaggcgggc ggccgaggcc cagaccacca ggtggcgcac ccggacgtgg ggcgagaagc 12001 gcacccgcgc gggggtcgcg ggggtcgcgg gggtcgcggg ggtcgcgggg gtcgcggggg 12061 gctccggcgc cccctccccg cccgcgcgtc gcaggcgcag gcgcgccagg tgctccgcgg 12121 tgacgcgcag gcggagggcg aggcgcggcg gaaggcggaa ggggcgcgag ggggggtggg 12181 aggggtcagc cccgcccccc gggcccacgc cgggcggtgg gggcccgggg ggcggggggc 12241 ggcggcggtg ggccgggcct ctggcgccga ctcgggcggg gggctgtccg gccagtcgtc 12301 gtcatcgtcg tcgtcggacg cggactcggg aacgtggagc cactggcgca gcagcagcga 12361 acaagaaggc gggggcccac cggcgggggg cggcggcggg gcggccgcgg gcgcgctcct 12421 gaccgcgggt tccgagttgg gcgtggaggt tacctgggac tgtgcggttg ggacggcgcc 12481 cgtgggcccg ggcggccggg ggcggcgggg gccgcgatgg cggcggcggc gggccatgga 12541 gacagagagc gtgccggggt ggtagagttt gacaggcaag catgtgcgtg cagaggcgag 12601 tagtgcttgc ctgtctaact cgctagtctc ggccgcgggg ggcccgggct gcccgccgcc 12661 accgctttaa agggccgcgc gcgacccccg gggggtgtgt tttggggggg gcccgttttc 12721 ggcgtctggc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg 12781 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg 12841 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cctccccccg 12901 ctcctccccc cgctcctccc cccgctcctc cccccgctcc tccccccgct cccgcggccc 12961 cgccccccac gcccgccgcg cgcgcgcacg ccgcccggac cgccgcccgc cttttttgcg 13021 cgcgcgcgcg cccgcggggg gcccgggctg cc // LOCUS HS5IE5KB1 2520 bp ds-DNA VRL 15-DEC-1989 DEFINITION Human cytomegalovirus genome, BamHI-HindIII fragment (5'-terminal part of the 5 kb transcript from the immediate-early region). ACCESSION D00328 KEYWORDS immediate-early gene; transforming region. SOURCE Human cytomegalovirus (strain AD169) genomic DNA, clone pAT153 provided by J. D. Oram and R. G. Downing. ORGANISM Human cytomegalovirus Viridae; ds-DNA enveloped viruses; Herpesviridae; Betaherpesvirinae. REFERENCE 1 (bases 1 to 2520) AUTHORS Kouzarides,T., Bankier,A.T. and Barrell,B.G. TITLE Nucleotide sequence of the transforming region of human cytomegalovirus JOURNAL Mol. Biol. Med. 1, 47-58 (1983) STANDARD full staff_entry REFERENCE 2 (sites; 5 kb RNA start site) AUTHORS Plachter,B., Traupe,B., Albrecht,J. and Jahn,G. TITLE Abundant 5 kb RNA of human cytomegalovirus without a major translational reading frame JOURNAL J. Gen. Virol. 69, 2251-2266 (1988) STANDARD full staff_entry COMMENT In [2], the 5' end of the 5 kb RNA was determined by primer extension. In [1], the BamHI-HindIII fragment was described as the sequence containing the region capable of transforming NIH3T3 cells. FEATURES from to/span description site 1664 1664 5 kb RNA start site BASE COUNT 647 a 713 c 589 g 571 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccgca gcagtccgtt ggcggagtcc gaggagtgct gaccgccgct cccgccgccg 61 ccaccgccac caccagcgcc gccgcctcca ccaccaccgg cagacgagga cgactttttg 121 cgccgttctt cgtgacgctg ttcctgcctt cgccgttgct gttcctccgc agaagggccg 181 tcgcgagtcc cgccgctgcc acccagcgga ggacacgcag acggcggaag cggtagacgc 241 ggcgccgcaa ccaccgcctc cgctggagga ttcgccgtgg tttttcaggt aatgccgcac 301 gtaagtcact tgcaaattac cgttctcgga aatcatggtg agcagcgcgc tctcattggg 361 tccgctggag cccaccaccg aggagacgga tttgttgaag acgataccgc cgcgtacaaa 421 gaggtgctcc tgcagctccc cgtcgcccgt aatgtcaata gacatgaagc cctgctgcgt 481 cttggcgccg gccgaagcct cgccgtgctg cataatggta gcgcagagcc agcccttgtt 541 gaggtgcagc accttgccat cgccgtccac gcagttgacc agacgcgcgg tatcgaagac 601 gaactggcgc acgtcgaaag tctgattgac gctttgatgc aggatgcgat taggattcgc 661 aaaagtccag tattttcgca cgacggtagt agggagatcc atgacgcggc ggcgcaaagc 721 gcgagcgcaa cgctcgtcgg aggccgtgga gcgagtgccg ccgcagccgg cagagcgccg 781 aaccccgtcg cagactctat ttatacatca tctttccagc ccgcctagca acacccacaa 841 acaacgtcac gacgcaacgt ggttaaacag tacgtttatt aaagtaactg ggtgaacgac 901 accggagcgg actgcaaatc gcaacgctac tttctcgagt gcagatactc ttcgagacgg 961 ctaaacaacg tgtccaactc gctgagacct ttccgcgtgc cgttatccga tttcctttcc 1021 gcctcctgag acagccgact aatcacggac ttatctccgc aacctaacag aggctgggag 1081 cccgacaaaa gtaaaacagc gtgctatgaa cacgttgtta cctctgtgcg gacagcgccg 1141 ccacagagac acttacacat tgccgcatgt ctttgtagat ggattctagc gtcgagcgca 1201 tactatgcaa ttccgtcttg agtccgggat agacgtggtc gcctgcggga aacacgatct 1261 ccagataccg cctcaacaac cagtccatga cgctgcatcc ccaacagcct ttgaccaccg 1321 taccgtcgag ccacacggag tagtcgtcct cacgttgcta caagaggaaa actacgtcac 1381 ccgacacgcg gaaaagaaag accgtcgcaa taaaccgtac ctacgtgacc taccaacgta 1441 ggttttactc gatgaaaggt gacgcggaga tcttgcaatc tggtcgcgta atcctctgga 1501 cgacactgcg gctttgtatt ctttatcgtc gtcgtcgccg gcttcgcctc ctcggaagcg 1561 cctagaaaaa agacgatcag gaccagagag gaagagacca tcaccgacag catcgccgca 1621 gcatgccgtc ccagtccgcc gcccaactgc gcgtcccagg taggtggtcc tttatgttat 1681 gatgtttttg tcaatttttt ttttcaattt ctttcttccg cggttagaat agtttctgta 1741 ggaaccaatt atcaatctga cgggttatcg tcaccacttg atggcaaaac gaaatttttt 1801 ttttcattgc cttgaagtct ctcccgccac caccaccacc gccgttgtct ccggctggag 1861 atcaagacga aattcctcct ctctaaaaaa aaaggtggtg ggcttaattg atcatggcaa 1921 gaagaaaaac tatactgaat aaactgtgtg caaaactact agtaacaaca aaaatagcga 1981 ctagatacac cacggacaat ctcagcagat actctctcaa aagaaaaaaa agacgccgta 2041 acgtcggaga atctggtatc tactgcctga cgaatttttt tttcgtccat gtatgtgatt 2101 acgagtagta gtggtatgta gaacaagaag aaaaatcgta gtccccaaaa ggataataaa 2161 aataacactc atagagaatc acagattttc tctagacaac tctctatcca aataacgaat 2221 gtgaagcgta caaagtaaga tattcaaaga atagcacctt catagattca tttcagcttt 2281 ctactccttg taatttaaag ttgcactaaa caaagctctt aaagaaggtt cgagccgctc 2341 tcgatcactc atcgatcacg cgagtcttat tattccacca caacgtaaca ttcttcactt 2401 tgtagagaca ctttatcgta gagtaaccct cgatttccta gctgttgttt tttgattatt 2461 ttgttcgctc taagagagat actcgaaatc ctacttacac caaggaccct acatcatcgc // LOCUS HS5IE5KB2 1291 bp ds-DNA VRL 15-DEC-1989 DEFINITION Human cytomegalovirus genome, 3'-terminal part of the 5 kb transcript from the immediate-early region. ACCESSION D00327 KEYWORDS immediate-early gene. SOURCE Human cytomegalovirus (strain AD169) genomic DNA, clones pGJ0.1, pGJ0.2, pGJ0.3, pGJ0.5, pGJ0.6, and pGJ0.7. ORGANISM Human cytomegalovirus Viridae; ds-DNA enveloped viruses; Herpesviridae; Betaherpesvirinae. REFERENCE 1 (bases 1 to 1291) AUTHORS Plachter,B., Traupe,B., Albrecht,J. and Jahn,G. TITLE Abundant 5 kb RNA of human cytomegalovirus without a major translational reading frame JOURNAL J. Gen. Virol. 69, 2251-2266 (1988) STANDARD full staff_entry COMMENT One of the predominant transcripts from the immediate early region is a 5 kb RNA. This sequence analysis revealed multiple stop codons throughout the AT-rich potential coding region. FEATURES from to/span description RNA < 1 1267 5 kb RNA (3'-terminal part) (alt.) RNA < 1 1280 5 kb RNA (3'-terminal part) (alt.) BASE COUNT 384 a 333 c 275 g 299 t ORIGIN 15 bp upstream of EcoRI site. 1 ctttttattt tttcgaattc atgttcgaaa acacaagctt ccataacaag aacccgtacc 61 gaagaaaagt tccatcgact aaaaagaaaa aagaaaacga agcaagacct cgacgacaac 121 aacacatcaa agaaagacga ccagctgatt atgttcttag aattccacac acccgcgagc 181 cgatccgcaa acgtcgtgcg aggcgcgctt tctctggctc gacacaatga tcacaccgca 241 cgctatagac acgtcgtcgt ggacgacgat gacctcaggc cacgaatgac aaccaacatg 301 ggcaaagtcc aattagccaa aaagacgacg attctaagaa ttgatgaatc ctcgatatac 361 gcctatcgat aggtttcaat tgtgtcatat acatcaaatg aaaaacagga cgcacgataa 421 aagcttcctt acagcataac tgtaacatac gatcatggaa catctcctca catacctttc 481 tcctctcaca taggaaaaca aaactctttt ttttctttcc tgtcaaggaa aaaatcaatg 541 taccaccaca tcactttctc ctcggtcccg gcgacggatg ggcgcgcacg cggacaaaga 601 cccaccggcc acttccactt attttttgtt gttaatcgtc ttctcccccg cacgcggacg 661 accaccaacg ctagctgctc attccgtcaa ccagtcacac cgcgcacgga gaaggggccg 721 gggtccgcgg gcacccgcgg cggaggcgcg gttccctctc tctaattccc tggaaaacaa 781 gtaatgacaa acaaaaagac gacaaaaggt ctctattctg ctacatgaga gaaattatag 841 ctgttggcaa tttttcaaaa tacatgttat aaggcatcct ctctgccaca cgcgcagtca 901 cggataggat cagtgcgtat tcattataaa aaaaaacaca aacaacccat atatgtgaag 961 cagaatgatg accgaccgca cggagcgacg ccgtcgactg tcagcctcgc gaggagacac 1021 cgcggaccgg ggaaacggat aagtttacga acagaaatct caaaagacgc tgacccgata 1081 agtaccgtca cggagacacg gtggtttttt attgaatttc cagtgtatcg agccaccgtg 1141 atgcaggtac ggtggtttta tgtaaagtgc cgctatctat aggcgatgtg ttcctgacgg 1201 tgtgtgtttt tttggggata gacaacgtgg ttcttgtacg tggtttttac cctgctcaat 1261 aaagtcacgt tttccttaca ggtgttgtgt c // LOCUS HSE1GB 4283 bp ds-DNA VRL 15-DEC-1989 DEFINITION Equine herpesvirus 1(EHV1) glycoprotein B (gB) gene and 3' end of an overlapping upstream gene with homology to the HSV1 ICP18.5 gene. ACCESSION D00401 KEYWORDS gB gene; glycoprotein; glycoprotein B. SOURCE Equine herpesvirus 1 (isolate HVS 25A) genomic DNA, clones pMAC[209, 221]. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4283) AUTHORS Whalley,J.M., Robertson,G.R., Scott,N.A., Hudson,G.C., Bell,C.W. and Woodworth,L.M. TITLE Identification and nucleotide sequence of a gene in equine herpesvirus 1 analogous to the herpes simplex virus gene encoding the major envelope glycoprotein gB JOURNAL J. Gen. Virol. 70, 383-394 (1989) STANDARD full staff_entry COMMENT Submitted in computer readable form by Whalley,J.M. on 19-Nov-1988. The EHV1 gB ORF appears to be overlapped at its5' end by 135 nt of the 3' end of an upstream ORF the potential translation product of which has approximately 50% identity with HSV gene ICP 18.5 and VZV gene 30 product. FEATURES from to/span description ORF 951 3893 equivalent to the gB glycoprotein gene of HSV. ORF < 1 1089 analogous ORF to HSV1 ICP18.5. sigp 951 1205 signal peptide. signal 719 723 putative CAT box. signal 802 806 putative TATA box. signal 3902 3907 putative polyA signal. site 823 831 similar sequence to putative HSV1 mRNA start site. BASE COUNT 1090 a 1168 c 1118 g 907 t ORIGIN map position aprox. 0.41-0.44 unit. 1 ctgcagaggc tcacggaccc agacaccagc aacagagagg ccctcaagca gctgctgggt 61 cgcatagggg tggataccga cgacggggcc ggcgagttgg gggacgcctt agacgtggat 121 ttggataatc taggtggggc ccctcctgtc aacagcaccc cctgtggtga ggacgccctc 181 tgtcgaaccg tttccgagga acgcccgtgg gacaaacttt tagagcgggc gactgcggat 241 gcttcgcagc gcaggcgcat gtacgcggag cgtctgtcaa agcgttccat cgccagtttg 301 gggcgctgcg tgcgcgaaca gcgaagagaa ctagaaaaaa ccctgagagt taacgtgtat 361 ggcgaagtgc tgctacatac gtacgtatcg tcctacaacg ggttttgcgc caggcgcggg 421 ttttgcgcgg cggtgagtcg agcgggtacc atcatagata accgctctag cacgtccgcg 481 ttcgactcgc atcagttcat gaaggcggcg ctgcttcgcc accccattga ccagtcgctc 541 atgccgtcca taacacacaa gtttttcgag ctgatcaacg ggcccgtgtt tgacaacgct 601 ggccacaact ttgcgcagcc gccaaacacg gcattatatt acagcgttga aaacgttggg 661 ttgttaccgc atctcaagga ggaactagct cggtttatga ttactgcggc taaaggtgat 721 tggtcaatta gcgagtttca aaggttttat tgctttgagg gagtgacagg tgtgacggcc 781 acgcagcggc tggcgtggaa atatatcggg gagctcatcc tagccgccgc agtattctcc 841 tcggttttcc actgtggaga ggtgcgcctc ctgcgcgcag atcgtaccta cccggactcc 901 agcggcgcac agcgctgcgt gagcggcatt tacataacct acgaggcgtc atgtcctctg 961 gttgccgttc tgtcggcggc tccacatggg gcaattggcg cggagacggt ggtgatttac 1021 gacagcgacg tgttctctct cctgtatgca gtgctccagc agctggctcc tggatcggga 1081 gccaactagg caatgttgga aacttactcg ccacccccca cccgctggga aagccggcat 1141 catcgagggt gggcacaata gttctagcct gtttgttgct ttttggaagc tgtgttgtta 1201 gagccgtacc caccacgcca agccccccaa ctagtactcc cacttccatg tcaacgcact 1261 cccatgggac agtagaccct acgctgctcc ccacagaaac gcccgaccca ctcagactgg 1321 ctgtgcgcga gtccggtata ctcgctgagg atggagactt ttacacctgc ccaccgccta 1381 ccggatccac cgtcgtacgc atcgaaccac ctagaacttg ccccaagttt gaccttggga 1441 gaaacttcac ggaggggatt gctgttattt ttaaggaaaa catcgctccc tacaaattca 1501 gggcaaacgt atactacaag gacatcgttg taacacgtgt gtggaaagga tacagccata 1561 cgtccctgtc cgacagatac aatgacaggg ttccggtttc ggtggaggag atcttcggtc 1621 tcatcgacag taagggaaaa tgttcgtcaa aggccgagta cctcagagat aacatcatgc 1681 accacgcgta ccacgacgac gaggacgagg tggagcttga tttgtgccgt ccaagtttgc 1741 aactccgggg ggccagagcc tggcagacca ccaacgatac tacgtcttac gtggggtgga 1801 tgccatggag gcactacacg tcaacgtctg tcaactgcat cgtcgaggag gtggaggcgc 1861 ggtccgtcta cccctacgac tccttcgccc tgtccaccgg tgatattgtg tacgcgtctc 1921 cgttttacgg cctgagggct gccgctcgca tagagcacaa tagctacgcg caggagcgtt 1981 tcaggcaagt tgaagggtac aggccccgcg acttagacag taaactacaa gccgaagagc 2041 cggttaccaa aaattttatc actaccccgc atgtcaccgt cagctggaac tggaccgaga 2101 agaaagtcga ggcgtgtacg ctgaccaaat ggaaagaggt cgacgaactc gtcagggacg 2161 agttccgcgg gtcctacaga tttactattc gatccatctc gtctacgttt atcagtaaca 2221 ctactcaatt taagttggaa agtgcccccc ttactgaatg tgtatccaaa gaagcaaagg 2281 aagccataga ctcgatatac aaaaagcagt acgagtctac gcacgtcttt agcggtgatg 2341 tggaatatta cctggcacgc ggggggttct taattgcatt cagacctatg ctctccaacg 2401 aactcgccag gctgtacctg aacgagcttg tgagatctaa ccgcacctac gacctaaaaa 2461 atctattgaa ccccaatgca aacaataaca ataacaccac gcgaagacgc aggtctctcc 2521 tgtcagtacc agaacctcag ccaacccaag atggtgtgca tagagaacaa attctacatc 2581 gcttgcacaa acgagcagtg gaggcaacgg caggtaccga ttcttccaac gtcaccgcca 2641 aacagctgga gctcatcaaa accacgtcgt ctatcgagtt tgccatgcta cagtttgcat 2701 acgatcacat ccaatcccac gtcaatgaaa tgctaagtag aatagcaact gcgtggtgta 2761 ccctccaaaa caaagagcgg accctatgga acgaaatggt gaagattaac ccgagcgcca 2821 tagtctccgc aacccttgac gagcgagttg cagcgagggt cctgggggac gtgatagcta 2881 taacgcactg cgccaaaata gagggcaacg tgtacttgca aaactccatg cgctcgatgg 2941 acagtaacac gtgctactcc cgcccccccg taacatttac aattactaag aatgcaaaca 3001 acagagggtc gatagaaggc cagctgggag aggagaacga gattttcacg gagcgcaagc 3061 tgatcgagcc gtgcgccctc aatcagaagc gctactttaa gtttggcaaa gagtacgttt 3121 actacgagaa ctacacgttc gtccgcaaag tgccccccac ggaaatcgag gttatcagca 3181 cgtacgttga actaaacttg acccttttgg aagaccgcga gtttctgccc ctggaggtgt 3241 acacgcgggc tgagctggag gacaccggcc tgctagacta cagcgaaata cagcgccgca 3301 accagctcca cgctctcagg ttttacgaca tcgacagcgt ggtcaacgtg gacaataccg 3361 cagtgattat gcaggggatc gccagctttt tcaagggcct gggtaaagtg ggggaggccg 3421 tgggaacgct cgttctcggc gccgccggcg ctgttgtttc aaccgtatct ggaatagctt 3481 cgtttttaaa caacccattt ggggggctag ccatcggcct gctggtaatc gccggcctgg 3541 tagctgcgtt ttttgcttac agatatgtaa tgcagatccg cagtaacccc atgaaagctc 3601 tataccccat aacaacaaag gccttgaaaa acaaagccaa aacttcctac ggccagaacg 3661 aggaggacga tgggagcgac tttgatgagg ccaagcttga agaggctcgc gaaatgatca 3721 aatacatgtc tatggtttcg gccctggaaa agcaggaaaa gaaagctata aagaaaaaca 3781 gtggggttgg cctgatcgcc agtaacgtct caaagctggc cctgcgaagg cgcggtccca 3841 aatatacccg actccaacag aacgatacca tggaaaatga aaaaatggtt taaacatgtt 3901 taataaatat tatgacacgt actcaaagtg tgacctcata tttgcataac cactttctag 3961 ttccggcccc aaggatattt aagcctagta tctccgccga ggtttcatcc tcattcacca 4021 actcacactt agagttgacg cttcctcttg cgcctttgct ctcgccgctc ctgtgttagc 4081 gtatactgcc caagaaatgg attctccacg cggtatctcc acagctaccg gtgatgccca 4141 cgccgaggcc gcggtttccc cagccgcgaa atccagataa aaacgaagcc cccgatgtag 4201 acggaccaga agccactact gagtgtttag accacaccta cacccaacag acaagcgggg 4261 gtgatggcct agatgctatc gat // LOCUS HSEIEP 8174 bp ds-DNA VRL 15-DEC-1989 DEFINITION Equine herpesvirus type 1 immediate-early protein gene, complete cds. ACCESSION J04366 KEYWORDS immediate-early protein; nonstructural protein; regulatory gene. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 8174) AUTHORS Grundy,F.J., Baumann,R.P. and O'Callaghan,D.J. TITLE DNA sequence and comparative analyses of the equine herpesvirus type 1 immediate early gene JOURNAL Virology 172, 223-236 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by D.J.O'Callaghan, 13-JUN-1989. FEATURES from to/span description pept 988 5451 immediate-early protein signal 334 339 TATA box signal 5735 5740 polyA signal BASE COUNT 1171 a 2937 c 2790 g 1276 t ORIGIN 1 cccggggagg agacgcatgc agatgagatg tgcatcgagg tgtcatggcg tccaggggcg 61 ttcaccttta tgcatatgag aggcgctatt cggcatcccg ttggcgcgac gcgcttccct 121 gggaggagac atacgcaaat tagaaacgac acacgggttc taattggttg gagcgggggg 181 gaggcgaaaa gcgcatgcaa atgcaaagcg cgggaccggg ccccataggc tagagccgct 241 acacgcccac cgcccatcat caacggccaa tcacaatcga tagtgtgggc tggccactcc 301 cactaggggg aaggcaaaac tccctcgtag tagtataaag cacctgttgc ttacccatcg 361 tagcatcgcg gactagagag cctttcagct cactggacca gccagccttc gaggactatc 421 gatcgcatct tggaaagctt acccgctctt ggcactcctt cttcggcttg cggaggtaag 481 agctccccgg ggacacgacc ggcttcgatc tgcttcttct cccggggaga gcgttagaga 541 acggggcgag tgccaaaaag gccatggaac ccctccaaca acgatgtccc gagggggtgg 601 ctccgaggcc cgcttcgacc tagcggtcga agcgcggtgg ggatacttac ctcgaagccg 661 gcgaaggcta taccttcccc gggcagaccc gggcggcttc tgcctcggcg gagctcggcg 721 cggaagcctg gatatctgac ggggcgtggt taccacccaa gcgggggaga ggcccgggcc 781 gcccgcgttc ccttttacca ttcggctccg ctccaactca acatcttttc cgcctctgct 841 tttccagggt agagaagcgg cgcccgtcgt ccgagcgccc gccgcggaac cccgccaccg 901 gccacccgcc aaccttccct tctcggtctt ccgagcgagc cttctcgtgc ggttggttct 961 cgaccccgaa gccggagcta gcacgccatg gccagccagc gcagcgactt cgccccggac 1021 ctctacgact tcatcgagag caacgacttc ggcgaggacc ccctcatccg cgcagccagc 1081 gcggccgaag aggggttcac ccagcccgcc gcgcccgacc tgctgtacgg cagccagaac 1141 atgttcgggg tggacgacgc tccgctctcc accccggtgg tggtcatccc tccgccgtct 1201 ccggctcccg agccccgcgg agggaaggcg aagcggtcgc cctcggccgc cggcagcggc 1261 ggtcctccta ccccggcggc tgccgcccag ccggcgtccc cggcacccag cccggctccg 1321 gggctcgccg cgatgctgaa gatggtccac tcctccgtgg ccccggggaa cggtcgccgg 1381 gccacgggct cctcatcacc cggcggtggg gacgcggccg acccggtcgc cctcgacagc 1441 gataccgaga cctgcccggg gtccccgcag cccgagtttc catcctcggc ctccccgggc 1501 ggagggtccc cggcaccccg ggtccggagc atctccatct catcgtcgtc ctcgtcctcg 1561 tcctcgatgg acgaggacga ccaggcggat ggtgccgggg cgagtagctc ctcttcgtcg 1621 tcctccgacg acagcgacag cgacgaaggc ggcgaggagg agacccctcg cccgcggcac 1681 tcgcagaacg ccgcgaagac cccgtcggcc gccggctctc ccgggccgtc ctccggaggg 1741 gatcgcccgg ccgctggggc cgccaccccg aagagctgcc gctccggcgc cgcttccccc 1801 ggcgcacccg ctccggctcc agcttcggcg cccgctccca gccgcccggg aggaggcctc 1861 ctccctccgg gggctcgcat tttagagtac ctggagggcg tccgcgaggc caatctggcc 1921 aagacgctgg agaggcccga accgcccgcg gggatggctt ctccgccggg ccggagccct 1981 caccggctcc ccaaggacca gcgtccgaaa tcggctctgg cgggagcgtc gaagcgcaag 2041 cgggccaacc ccagacccag accccagacc cagacccagg caccggccga ggaggccccg 2101 cagacggccg tgtgggactt gctggacatg aactcatccc aggctaccgg ggcggcggca 2161 gcagcagcat cggccccggc ggcggcttcg tgcgccccgg gcgtctacca gcgcgagccg 2221 cttctcaccc cgtccgggga cccctggccc gggtcggatc caccaccgat ggggagggtg 2281 cgatacgggg ggaccgggga ctcgcgggac gggctgtggg acgaccccga gatagtcctg 2341 gccgcctcgc gctacgccga ggcgcaggcc ccagtaccgg tcttcgtgcc ggagatgggg 2401 gactccacca agcagtacaa cgctctggtc cgcatggtgt tcgagagccg cgaagccatg 2461 tcctggctgc agaactctaa gctcagcggg caagaccaga acctggcgca gttctgccag 2521 aagttcatcc acgctccgcg cggacacggg tccttcatca ccgggagcgt ggccaacccc 2581 ctgccccaca tcggggacgc catggcggcc gggaacgcgc tctgggccct gccacacgcg 2641 gccgcctcgg tggccatgag ccgccgctac gatcgcactc agaagagctt catcctccag 2701 agcctccggc gcgcctacgc ggacatggcc tacccgagag acgaggcggg gaggccggac 2761 tcactcgccg ccgtggccgg ctgcccggcc caggccgccg ctgccgcggc cagccagcaa 2821 cagcccgagg ccccggcgcc ctcggtccgc gtccgcgaag cgtacacccg ggtctgcgcg 2881 gccctcgggc cccgacgcaa ggctgccgcg gccgcggccg ctccggggac cagggcgccc 2941 aggccgtccg ccttcagact cagggagctc ggggacgcct gcgtgctggc ctgccaggcc 3001 gtcttcgagg ccctcctgcg cctccgcggc ggggcgtccg ccgtccccgg actggacccc 3061 agcgagatcc cctctcccgc ctgccctccc gaggcgctgt gctccaaccc ggccgggctg 3121 gagacggcgg ccctctccct ctacgaactc agggacctgg tcgagcgggc caggctcctc 3181 ggggactctg accctaccca ccgcctgggc tccgacgagc tgcgcctcgc ggtgcgcgcc 3241 gttctggtgg tggcccggac cgtggcgccg ctggtgcgct acaacgccga gggggcccgg 3301 gcccgggcct cggcctggac cgtcacccag gccgtgttca gcatacccag cctggtcggg 3361 gggatgttgg gggaggccgt gtccctgctg gccccaccga ctcggtccca gcagccctca 3421 tcgtcctcgc ccggcggcga gcccttctcc ggctccgcgg ccgcggaggg gagccttcag 3481 accctgccgc ccctgtggcc caccgtcccc gggaagcagt ccgcgacggt cccctcgtcc 3541 cactcccagt ccccccagca ctcccagagc ggcggaggcg ccggggctac gaccgccacc 3601 tgctgccggg ccacccagac aaacgcccgc tcccgggggc agcagcacca gccgcagaag 3661 gcccgctccc ctcaggcggc cgcctccccg gcccacctca gccaggaggc gatgcccggc 3721 tcctcctcgg acgaccgtgc catccacggg cgccccaggg gcaagagcgg caagcggcgc 3781 tccgagcccc tggagccggc ggcccaggcc ggagcctcgg cctccttctc ctcgtccgcc 3841 cgggggtacg atccctcggg gccggtcgac agccctccgg cccccaagcg cagggtggcc 3901 accccgggcc accaggctcc ccgggccctg ggacccatgc cagccgaggg ccccgaccgt 3961 cggggcggat tcaggcgcgt tccccgcgga gactgccaca ctccgcggcc cagcgacgcg 4021 gcttgcgcgg cctactgtcc ccccgagctg gtggcggagc tcatcgacaa ccagctgttc 4081 cccgaggcct ggcgcccggc gctcaccttc gatccccagg ccctggccac catcgcggcc 4141 cgctgcagcg gccccccggc ccgggacggc gcgcgcttag gggagctggc ggccagcggc 4201 ccgctgagac ggagggccgc ctggatgcac cagatccccg accccgagga cgtgaaggtg 4261 gtggtcctct actccccgct ccaggacgag gacctgctgg gcggactccc ggcctcccgc 4321 cccggcggct ctcggcgcga gcccctctgg tccgacctca aggggggact ctcggcgctg 4381 ctggcggccc tggggaaccg catcctcacc aagcggtccc acgcctgggc cggcaactgg 4441 accggggccc cggacgtctc ggccctcaac gcccaggggg tcctgctgct gtcgaccggg 4501 gacctggcct tcaccggctg cgtcgagtac ctctgcctgc gcctgggctc cgccaggcgc 4561 aagctcctgg tgctggacgc ggtctccacc gaggattggc cccaggacgg tcccgcgatc 4621 agccagtacc acatctacat gcgggccgcc ctgactccgc gggtcgcctg cgccgtgcgc 4681 tggcccgggg agcgccacct cagccgcgcg gtcctcacct ccagcaccct cttcgggccc 4741 ggactgttcg cgagggccga ggccgcgttc gcgcgcctgt acccggactc tgcgcccctg 4801 aggctgtgcc gctcctccaa cgtggcctac acggtggaca ctcgcgccgg cgagcgcacc 4861 cgcgttcccc tggctccgag ggagtaccgc cagcgcgtcc tgcccgacta cgacggctgc 4921 aaggacatgc gggcccaggc cgagggcctc gggttccacg acccggactt tgaggagggc 4981 gccgcgcaga gccaccgcgc ggccaaccga tggggactcg gggcctggct gcgccccgtg 5041 tacctcgcct gcggccggcg cggcgctggg gccgtggagc cctcggagct tctgatcccc 5101 gagctgctga gcgagttctg ccgggtggcg ctgctggagc ccgacgccga ggccgagccc 5161 ctggtgctgc ccatcaccga ggctccccgc cgccgagccc cgcgggtcga ctgggagccc 5221 gggttcggct ctcgctccac ctcggtcctg cacatggggg ccacggagct gtgcctgccg 5281 gagcccgacg acgagctcga gatcgacggg gccggcgatg tggagctggt ggttgagcac 5341 cccggcccga gccccggcgt ggcccaggcc ctccgccgcg ctcccatcaa gatcgaggtg 5401 gtgtcggacg acgaggacgg aggagactgg tgcaatccgt acctctcctg aacacgatgg 5461 agcgcctccc tgcggccgaa aacaagaaaa atcagtacat ccacaactat gtgtccgccc 5521 agcacaacgc agactccgcc tagactcccg cctccatccg ctgacgctga accccgcccc 5581 gccctctgct gacgcgaaga caaggccctc cccggacgac atgtgaggaa cgaagggggc 5641 gttgtatcta gcagcccacg ttccttattg ctcacatgtc tgcccaatcg gtgggcactt 5701 ccaggctttc ccctatcgct gagtggttgt ttttaataaa gtttttttta aattttgatt 5761 gaccgcgtgg tctttgttta ctgggcgggt tgatgggcgg gttgatgggc gggttgatgg 5821 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg 5881 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg 5941 gcgggttgat gggcgggttg atgggcgggt tgatgggcgg gttgatgggc gggttgatgg 6001 gcgggttgat gggcgggttg atggttcctg ctcctcccct tcctgctcct ccccttcctg 6061 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg 6121 ctcctcccct tcctgctcct ccccttcctg ctcctcccct tcctgctcct ccccttcctg 6181 ctcctcccct tcctgctcct ccccttccgc tacgtcacta ccgcctacgt cactaccgga 6241 ctcctcccct tccgcttccg gccacgcccc ttccggtgag ccccagcata gcagtgagcc 6301 ccagcatagc agtgacgtca ctttgacccc cccccttaga ccacgccccc ctattcaaat 6361 gcggggggga gacgcgggct gggggggcca ggctctctct cgggcgcggg cccgtgaccc 6421 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc 6481 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat 6541 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg 6601 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca 6661 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct 6721 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg 6781 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg 6841 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg 6901 tgacccttga ccagatatgg cccggggcca ggctctctct cgggcgcggg cccgtgaccc 6961 ttgaccagat atggcccggg gccaggctct ctctcgggcg cgggcccgtg acccttgacc 7021 agatatggcc cggggccagg ctctctctcg ggcgcgggcc cgtgaccctt gaccagatat 7081 ggcccggggc caggctctct ctcgggcgcg ggcccgtgac ccttgaccag atatggcccg 7141 gggccaggct ctctctcggg cgcgggcccg tgacccttga ccagatatgg cccggggcca 7201 ggctctctct cgggcgcggg cccgtgaccc ttgaccagat atggcccggg gccaggctct 7261 ctctcgggcg cgggcccgtg acccttgacc agatatggcc cggggccagg ctctctctcg 7321 ggcgcgggcc cgtgaccctt gaccagatat ggcccggggc caggctctct ctcgggcgcg 7381 ggcccgtgac ccttgaccag atatggcccg gggccaggct ctctctcggg cgcgggcccg 7441 tgacccttga ccagatatgg cccgggtaga gagagactgg gttcagaaga gccagagtgg 7501 gtctgtaaag acaagggagt gggacgcggg tggtgggaag tggctcaaca ccgtggccgg 7561 agatggttgg ggagggggaa aatgggggaa atatagtaaa ctagtttact actggtacta 7621 ttccacggtt atagcatttc taagctggtc cgaggaggag agtagaaagg actcaatgtg 7681 tccatttgtg tgatatatag tctgtgaccc ctagtaacac tactgccatt agtttctccc 7741 cactatatgc tcagcttgtc tataccgcgc tcacactcag gaggttaggt gtgctaatag 7801 gccaatcggg gggggggggg ggtgtggtgg taaatagcgg catcccccct agagcagata 7861 aactggagtt taatagggct agggcagggg gctagggcag ggggctaggg cagggggcta 7921 gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc tagggcaggg 7981 ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag ggggctaggg 8041 cagggggcta gggcaggggg ctagggcagg gggctagggc agggggctag ggcagggggc 8101 tagggcaggg ggctagggca gggggctagg gcagggggct agggcagggg gctagggcag 8161 ggggctaggg cagg // LOCUS MLVCASBRE 3335 bp ds-DNA VRL 15-MAR-1989 DEFINITION Murine leukemia virus (Cas-Br-E MuLV), 3' end of proviral genome, encoding pol polyprotein, partial cds, and env polyprotein, complete cds. ACCESSION M14702 KEYWORDS env gene; glycoprotein; pol gene; provirus. SOURCE Murine leukemia virus (isolate pBR-NE-8) proviral DNA, from mouse brain. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 3335) AUTHORS Rassart,E., Nelbach,L. and Jolicoeur,P. TITLE Cas-Br-E murine leukemia virus: Sequencing of the paralytogenic region of its genome and derivation of specific probes to study its origin and the structure of its recombinant genomes in leukemic tissues JOURNAL J. Virol. 60, 910-919 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 850 pol polyprotein pept 790 2775 env polyprotein matp 2185 2772 Prp15E glycoprotein LTR 2817 3335 3' long terminal repeat rpt 3192 3259 3' LTR R region rpt 2817 2828 inverted terminal repeat copy A rpt 3324 3335 inverted terminal repeat copy B BASE COUNT 851 a 928 c 798 g 758 t ORIGIN 84 bp upstream of HindIII site. 1 gatcgatttc accgaggtaa aacctagatt gtatggctat aagtatcttt tagtttttgt 61 agatactttc tctggctgga tagaagcttt cccaaccaag aaagaaaccg ccaaggtcgt 121 gactaagaaa ctgctagaag agatcttccc taggttcggc atgccgcagg tattgggaac 181 tgacaatggg cctgccttcg tctccaaggt gagtcagaca gtggccgatc tgttggggat 241 tgattggaaa ttacattgtg catacagacc ccaaagctca ggtcaggtag aaagaatgaa 301 taggaccatc aaggagactt taactaaatt aacgcttgca actggctcta gagactgggt 361 cctcctactc cccttagccc tgtaccgagc ccgcaacacg ccgggccccc atggcctcac 421 cccatatgag atcttatatg gggcaccccc gccccttgta aacttccctg accctgacat 481 gaccagagtt actaacagcc cctctctcca agctcactta caggctctct acttagtcca 541 gcacgaagtt tggagaccac tggcggcagc ttaccaagaa caactggacc ggccggtggt 601 gcctcaccct taccgggtcg gcgacaccgt gtgggtccgc cgacatcaaa ccaagaacct 661 agaacctcgc tggaaaggac cttacacagt cctgctgacc acccccaccg ctctcaaagt 721 ggacggcatc tctgcgtggg tacacgccgc tcacgtaaag gcagcaacga cttctccggc 781 cagaacagca tggaaggtcc agcgttctca aaatccccta aagataagac tatcgagaga 841 gccttcctag gggttttggg gatcttattc gtgacaggag ggttagcgag cagagacaac 901 ccccaccagg tatataatat aacttgggaa gtaacaaatg gagaacaaga cactgtgtgg 961 gcagtaaccg gcaaccaccc cttgtggact tggtggccag acctcacacc agacctttgt 1021 atgctggccc tacatggccc aactcattgg ggcctagaca accaccctcc atattcctct 1081 cccccggggc ccccttgttg ttcaggagat gcaggggctg tgtcaggctg tgctagagac 1141 tgtgatgagc ccttgacctc ttactccccc cggtgcaata cagcctggaa tagactgaaa 1201 ctggcccggg taacacatgc acctaaagag ggattttata tctgccctgg gtcacatcgc 1261 cccaggtggg ctcggtcgtg cgggggtcta gacgcctatt attgtgcctc ctgggggtgc 1321 gaaactacag gccgagcagc ctggaaccca acttcatctt gggactatat cacagtaagc 1381 aataatttaa cttcctcaca ggccaccaaa gcctgcaaaa ataatggctg gtgcaacccc 1441 cttgtcatac gattcacggg tccaggaaaa agggccacct cctggactac aggtcatttc 1501 tggggactgc gcctgtacat ctctggacat gacccagggc tcacttttgg gattcggcta 1561 aaagtgacag atctgggacc tagagttcca atagggccaa atcctgtctt gtcagatcag 1621 cgaccgccct cccggcctgt acctgccaga cctcccccac cttcagcctc accttccact 1681 cccaccatac ctccacagca ggggaccggg gacaggttac ttaatctggt ccagggagcc 1741 tacctcacac tcaatatgac tgatcccacc agaacccagg agtgttggtt atgcctagtc 1801 tccgagcctc cgtattatga aggggtggcc gtgttgagag agtacactag tcatgagacg 1861 gcacctgcta actgctcctc cggatcccaa cataagctga ccttatctga ggtaactgga 1921 cagggaagat gtctaggaac ggttcccaaa actcaccagg ctctatgcaa ccgcaccgag 1981 cccaccgtaa gtggttccaa ttacttggtg gctcccgaag gtaccctctg ggcatgcagc 2041 accgggctca ctccctgtct gtctactact gtgctcaact taaccactga ttactgtgtc 2101 ctagttgaac tctggccaaa ggtgacctac cactcccctg actatgtcta tactcagttt 2161 gaaccagggg ccagattccg aagagagccg gtgtcgctga ccctcgccct gctaccagaa 2221 ggtctcacca tgggtggaat tgccgcagga gtagggacag ggacaactgc cctggtcgcc 2281 acccaacagt ttcaacaact tcaggctgct atgcacaacg acctcaagga agttgaaaaa 2341 tcaattacta atctagaaaa gtctctgacc tcgctgtcag aagtggtttt gcagaaccgc 2401 agaggcctag atctactatt tctaaaagag ggaggccttt gcgcggctct aaaagaagag 2461 tgctgctttt atgcagacca cacaggatta gtgagagata gcatggccaa acttagagaa 2521 agactaaacc agagacaaaa attgtttgaa tcaggacaag ggtggtttga aggactgttt 2581 aataggtccc catggttcac aaccctgata tccactatta tgggccctct gatagtactt 2641 ttattaatcc tacttttcgg accctgcatt ctcaatcgat tggtccaatt tgttaaagac 2701 aggatctcag tggtccaggc tctggttttg actcagcaat atcaccagct aaaacccata 2761 gagtacgagc cgtgaataaa ataaaagatt ttatttagtt tccagaaaaa ggggggaatg 2821 aaagacccac catcaggttt agcaagctag cttaagtaac gccatttatt ttgcaaggcc 2881 tggaaaaata ccgaactgag aatagggaag ttcggatcaa ggtcaggaac agaaaaacag 2941 ctgaagttgg gccaaacagg atatctgtgg taagcagttt cggccccggc ccgaggccag 3001 aacagatggt ccccagatat ggcccaatcc tcagcagttt ctagggaccc atcagatgtt 3061 ttcaggctgc cccaaagacc tgaagtgacc ctgtgcctta tttgaactaa ccaatcagct 3121 cgcttctcgc ttcggtttgc gcgcttctgc tccccgagct ctataaaaga gcacacaacc 3181 cctcactcgg cgcgccagtc ctccgataga ctgagtcgcc cgggtacccg tgtatccaat 3241 aaaccctctt gctgttgcat ccgactggtg gtctcgctgt tccttgggag ggtctcctca 3301 gagtgattga ctacccgcct cgggggtctt tcatt // LOCUS PPMCG 5089 bp ds-DNA VRL 15-MAR-1990 DEFINITION Monkey B-lymphotropic papovavirus complete genome. ACCESSION M30540 KEYWORDS complete genome; large T-antigen; small T-antigen; viral protein. SOURCE Monkey C-lymphotropic papovavirus DNA, clone pL02, passed in human B-lymphoblastoid cell line BJA-B. ORGANISM Monkey B-lymphotropic papovavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 5089) AUTHORS Furuno,A., Kanda,T. and Yoshiike,K. TITLE Monkey B-lymphotropic papovavirus genome: The entire DNA sequence and variable regions JOURNAL Jpn J Med Sci Biol 39, 151-161 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 152 721 small T-antigen pept 152 388 large T antigen, exon 1 744 2600 large T antigen, exon 2 pept 3770 2664 (c) VP-1 pept 4362 3649 (c) VP-3 pept 4719 3649 (c) VP-2 rpt 4724 4783 repeat copy A rpt 4784 4843 repeat copy B BASE COUNT 1516 a 1015 c 1054 g 1504 t ORIGIN 695 bp upstream of HindIII site. 1 cccctagcct cctcctcttc tttcaacaaa gagagaggct ttggaggctt ttccaaaaac 61 tcattaggta agctgccctg agatattttc ccatataatt aagtattaag gccacctagg 121 taattaaatt tattccattt tattcacagc catggaccaa acgctgtcta aggaggagag 181 aaatgagctt atggatttat tgcaaataac tagagctgca tggggaaatc tttctatgat 241 gaaaaaagcc tataaaaatg tctccaagct ctaccatcct gataaaggag gagattcagc 301 taaaatgcag cggctcaatg aattatttca aagggtccag gttaccttga tggagataag 361 gagtcaatgt ggatcctctt cttcccaggt agcttggttt ttttgggatg agaattttag 421 aaccctagga gcttttctag gagaaaaatt taatgaaaaa attattggac tctaccctac 481 ttgcactaaa tttgtaagag ctaattgtaa ttgtatagta tgtctgctaa aaaagcagca 541 tgcaggtaca aaaaaaaatt taaaaaagcc atgtttagtc tggggagaat gttggtgcta 601 caaatgttat ttagtatggt ttggctttcc tgaggatttc acctcttttc gctactggac 661 ccttcttatg gcaaatatgg atttatctat gctcaagctt tggacggaac tgggattcta 721 atgtaagtat ttttattttc tagggttact tcagtgagga cttctacttt gggcctacca 781 cctttcaata tagccctatg gatcgagatg cagttcggga ggatcttcca aatccagggg 841 aagggtcttg ggggaaatgg tggagagagt ttgttaatag gcaatgttgt gatgatttgt 901 tttgctcaga aacaatgagt agttcaagtg atgaagacac ccccccagcg gcgcaacctc 961 ctcctcctcc tgccccttcc ccagaagaag aggatgaaat agaatttgta gaagagaccc 1021 caagttcctg tgatggatct tcttctcaaa gctcctacac ctgcaccccc cctaaaagga 1081 agaaaactga agaaaagaag ccagatgatt ttcctgtatg tttatattcc tttttaagtc 1141 atgcaattta tagtaataag actatgaata gttttttaat atatactact ttggagaaag 1201 ccaggcaact gtataaaact gtggaaaaat ctaaaattgt agttgatttt aaggctagtt 1261 tttcttatca ggatgaggaa ggggaggggt gtttgctgtt tttaattact ttaggaaaac 1321 atagagtgtc tgctgttaag catttttgtg tatcccaatg tacttttagt tttattcatt 1381 gtaaagctgt tgttaaacct ctagagttat ataagacctt aagtaaacca ccttttaagt 1441 tgttggaaga gaacaaaccg ggtgtatcca tgtttgagtt ccaagaggag aaggaacagt 1501 ctgttaattg gcaagaaata tgtaactttg caaatgaggc caacatttct gatgtcttat 1561 tgttgcttgg catctacata gattttgcag tggaacctgg caaatgtggc aagtgtgaaa 1621 aaaagcagca caaattccac tataattatc acaaagcaca tcatgccaat gcttgcctct 1681 tcttggagag tagagcccaa aaaaacattt gccaacaagc agttgaccag gtcctagcag 1741 ctaaaaggtt aaaattagta gaatgcagta gaattgaatt attagaagag agatttttgc 1801 agctttttga tgaaatggat gacttcctgc atggtgagat agaaattcta agatggatgg 1861 cgggtgtggc ctggtacacc attttactag ataattcttg ggatgttttt caaaatatcc 1921 tacaattaat aactaccagc caacccaaaa aaaggaatgt cctgataaag ggaccaatta 1981 acagtggtaa aactactttg gcttctgctt tcatgcattt ttttgatggc aaagctctaa 2041 atataaattg tcctgcagat aaactgtcct ttgaacttgg ctgtgctatt gatcaattct 2101 gtgttttgtt agatgatgtg aagggccaaa taaccttaaa taagcacttg caaccaggtc 2161 aaggggtaaa taatcttgat aacctgagag atcatcttga tggaacaatt aaagttaatt 2221 tagaaaagaa acatgtaaac aaaaggagtc aaatttttcc cccggttatt atgactatga 2281 atgagtactt gttgcctcct accataggag ttagatttgc tcttcatctg catttaaaac 2341 ctaaggctta tcttaaacaa agcctggaaa aaagtgacct ggtagccaaa agaatattaa 2401 attcaggata tactattttg ctccttttgt tatggtacaa tcctgtggat tcttttactc 2461 caaaagtgca agaaaaagtg gtgcaatgga aagaaaccct tgaaaaatat gtgtcaatta 2521 ctcagtttgg taatattcag caaaatatca ttgatggaaa agaccccttg catggaattg 2581 taattgaaga acaaatgtaa ataatgtaat catcattttc tgttttattt ctggtacaat 2641 aaagtcttac aatgcattca gcctcacata tcatttgaga cagggagaac agtctggttc 2701 tgacaaaatt tatcaacata tctattaagg tcagggtccc ctgggagtcc ttctgttccc 2761 tcaaatattc tgacttcttc cacttgtcct gagacccctt ccattggttg tccctgaatt 2821 tggggcataa gaccagagaa gaagctattt agaagagagc tgacaggata aggattttta 2881 acaatccttt tcctgagggt cacattgaaa tatctgggaa gccccctcca actttgggtt 2941 tcagaatagt tggtatgaac tccagcaata tcagcacaag acagaaacag tttgtcccct 3001 ttacaaagag gcccaactcc attttcatcc agcagcacag ttgtgacaga attagtgaac 3061 tgcataactg gtggggtggt ggctccccct gtaaaactcc cataatatct agtattttca 3121 tttttagagg ggtcagggca ccacacctcc actgggtact ttccatcttt atccagcaag 3181 gctttggcct ttggatctag gccttggttt cctggtttca tatttttaat agcaactaca 3241 tcatcaggat aggtagctgt agagctagca actaggcctt ggagttccag gggctctcct 3301 ccaacagcaa acatgtgata ggtagtgccc tgcacgggga cacaccctga ggatgaccca 3361 tagatgtact ttcctccctg gtgcaaatta actagtgagg aaattccaac aacttcagtc 3421 tttacagaca ctgcttccca catcaaaatg gtgtcacagg tcatgtcttc atttaggagg 3481 gggagtttaa taacagctac tgaataacaa ggaagggtgc ctttgttggg ggtgtcagag 3541 gccttactga aagcagtatt tatagaatta ctatatccat acaagtcctc agaaggaata 3601 ttatttccca ttctaggatt aagataggcc tcaatttggg taatagcatc aggccctgtt 3661 cttacttcta gcacctctac tcctcctttt actaggagcc tggggacggg agcgggaata 3721 gggcatgttt ttttgcatgc tccgtcttgc ctttttcttt gaggggccat cttcttcttt 3781 ctccaattta ttaagctcca cttcccaggt gggagttata tcaccatata aacctagaat 3841 tagaggaagc atccagtctt gagttactct ttggtgtgcc cctcctggag cagtataatg 3901 ttctacatac tgaccagacc taggttcatc ataacccagt tctcttcttg ccctttgacc 3961 ttccctattt tcttgatatt caaaatcagc tctacttgga ggaggctctc ctctgtttct 4021 atattcttgt cttagctgga tgggatttct agcaggtaga tacctgtaat aatcttggac 4081 actggaataa atatggacag gcccactggt caaagcccac ctagcatttt cagcaatttg 4141 agccaaggtg tgactcagtt cattggtact tctgacagcc acagcccttg tagcttgacc 4201 aatttgcaaa gtggcctgtc tcatcaaatg cctccatact tctctgccca cagcatgaaa 4261 caatgattca ccccagtcaa gtacagcatt caggtagtag ctaaaagagg taaatcccgg 4321 gaacaaataa tcaacttgag gaaaccaagg cacaagagcc atattaacaa ctggtacttc 4381 tttggagtat ccaaaagttg tcactcctgc agcaaccaca gcactggcac ctgaaacagt 4441 ttgaaaaaaa actcctattc ctatggcatt gttgagagct gttgggatag cacttaggag 4501 ggaaaactgc tctgttgtaa gtccagtaag agacaaggcc tctagagtac taagtccagc 4561 aagatccact gcttctattt caatgagcca ggctgcctca gtacttacag cagcaaaagc 4621 ctccccagta aggatagcat caactgtaaa tccagtactt aagcttaatt cagcagcaat 4681 ttcagaaata ttaaacaaaa gagataatac accccccatt tcttaccaaa tggcgggcta 4741 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaccaaa tggcgggcta 4801 atttaaaaaa ggcgggcttc ttggcggcgc tgatgtaaat gagtaacttc ctctacttga 4861 ggttgctaag taggttgcta agcgccacct agcaactaga ccgcagaaca gttgtttgtc 4921 acttatcagg aaatgtcaca aaaagtcccc gggcggtgcg gtgagcgagt ctaaccacag 4981 cttcctctat cagttgattc tgcaaaaaca acctgttatt gaagtctgca agtctgcaaa 5041 atcactatgg caaccctagt tttttttacc tggtataaga ggccagggg // LOCUS PVYAAA 9704 bp ss-RNA VRL 15-MAR-1990 DEFINITION Potato virus Y (N strain) genomic RNA, complete. ACCESSION D00441 KEYWORDS 38K protein; HC protein; NIa protein; NIb protein; Vpg protein; capsid protein; genome-linked protein; helper component protein; inclusion protein; polymerase; polyprotein; protease. SOURCE Potato virus Y (N strain), 5'end of genomic RNA and cDNA to genomic RNA. ORGANISM Potato virus Y Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potyvirus. REFERENCE 1 (bases 1 to 9704) AUTHORS Robaglia,C., Durand-Tardif,M., Tronchet,M., Boudazin,G., Astier-Manifacier,S. and Casse-Delbart,F. TITLE Nucleotide sequence of potato virus Y (N strain) Genomic RNA JOURNAL J. Gen. Virol. 70, 935-947 (1989) STANDARD full staff_entry COMMENT Most of the sequence was obtained from a shotgun cloning procedure. The 150 nucleotides at the 5'end were directly sequenced on the virul RNA. FEATURES from to/span description virion 1 9704 genomic RNA pept 185 9376 polyproteins matp 185 1009 putative extreme 5'protein matp 1010 2656 putative helper component protein, HC matp 2657 3655 putative 38K protein matp 3656 5557 cytoplasmic inclusion protein matp 5558 5713 putative genome linked protein, Vpg matp 5714 7009 putative nuclear inclusion protein NIa matp 7010 8572 putative nuclear inclusion protein NIb matp 8573 9376 putative capsid protein BASE COUNT 3004 a 1818 c 2273 g 2609 t ORIGIN putative 5'end of RNA genome. 1 aattaaaaca actcaataca acataagaaa aacaacgcaa aaacactcat aaacgctcat 61 tctcactcaa gcaacttgct aagtttcagt ttaaatcatt tccttgcaat tctctagaac 121 aatattggaa accatttcaa ctcaacaagc aatttcatca cttccaacca atttcagatc 181 ctcaatggca acttacatgt caacaatctg ttttggttcg tttgaatgca agctaccata 241 ctcaccagcc tcttgcgagc atattgtgaa ggaacgagaa gtgccggctt ccgttgatcc 301 tttcgcagat ctggaaacac aacttagtgc acgattgctc aagcaaaaat atgctactgt 361 tcgtgtgctc aaaaacggta cttttacgta ccgatacaag actgatgccc agataatgcg 421 cattcagaag aaactggaga ggaaggatag ggaagaatat cacttccaaa tggccgctcc 481 tagtattgtg tcaaaaatta ctatagctgg cggagatcct ccatcaaagt ctgagccaca 541 agcaccaaga gggatcattc atacaactcc aaggatgcgt aaagtcaaga cacgccccat 601 aataaagttg acagaaggcc agatgaatca cctcattaag cagataaaac agattatgtc 661 ggagaaaaga gggtctgtcc acttaattag taagaaaacc actcatgttc aatataagaa 721 gatacttggt gcatactccg cagcggttcg aactgcacat atgatgggtt tgcgacggag 781 agtggactcc gatgtgatat gtggacagtt ggacttttgc aacgtctcgc tcggacggac 841 aaatggttcc aatcaagtcc gcactatcaa catacgaagg ggtgatagtg gagtcatctt 901 gaacacaaaa agcctcaaag gccactttgg tagaagttca ggaggcttgt tcatagtgcg 961 tggatcacac gaagggaaat tgtatgatgc acgttctaga gttactcaga gtattttaaa 1021 ctcaatgatc cagttttcga atgccgacaa tttttggaag ggtctggacg gtaattgggc 1081 acgaatgaga tatccttcgg atcacacatg tgtagctggt ttacctgtcg aagattgtgg 1141 tagggtagct gcattgatgg cacacagtat ccttccgtgc tataagataa cttgccccac 1201 ctgtgctcaa cagtatgcca gcttgccagt tagcgatctg tttaagctat tgcataaaca 1261 tgcaagagat ggtttgaatc gattgggagc ggataaagac cggtttatac atgttaataa 1321 gttcttgata gcgttagagc atctaactga accggtggac ctgaatctcg agcttttcaa 1381 tgagatattt aaatccatag gggagaaaca gcaagcaccg ttcaagaatt taaatgtctt 1441 aaataatttc ttcctgaaag gaaaagaaaa tacagctcat gaatggcagg tagctcaatt 1501 gagtttgctc gaattagcaa ggttccagaa gaacagaact gataacatca agaaaggtga 1561 tatatctttc ttcagaaata aattatctgc caaggcaaac tggaatctgt atttgtcgtg 1621 cgacaaccag ctggataaaa atgcaaactt cctctgggga caaagggagt atcatgctaa 1681 gcggtttttc tcaaacttct ttgaggaaat tgatccagca aagggatact cagcatatga 1741 aatccgcaag catccaagtg gaacaaggaa gctctcaatt ggtaacttag ttgtcccact 1801 tgatttagct gagtttaggc agaagatgaa aggtgactat aggaaacaac caggggtcag 1861 caaaaagtgc acgagttcga aagatggtaa ttatgtgtat ccctgttgtt gcacaacact 1921 tgatgatggt tcagccattg aatcaacatt ctatccacca actaaaaagc accttgtaat 1981 tggcaatagt ggtgaccaaa aatttgttga tttaccaaaa ggggattcgg agatgttata 2041 cattgccaag cagggttatt gttatattaa cgtgtttctt gcaatgctga ttaacattag 2101 cgaggaggat gcaaaggatt tcacaaagaa agttcgcgac atgtgtgtgc caaagcttgg 2161 aacctggcca actatgatgg atttggcgac cacttgtgct caaatgagaa tattctatcc 2221 tgacgtacat gatgcagaat tgcccagaat attggttgac catgacactc aaacgtgtca 2281 tgtggttgac tcatttggct cgcagacaac tggatatcat attctaaaag catccagcgt 2341 gtctcaactt atcttgtttg caaatgatga attagaatct gatataaaac attatagagt 2401 tggtggtgtt cctaatgcta gccctgaact tgggtccaca atatcacctt tcagagaagg 2461 aggagttata atgtctgagt cggcagcgct gaaactgctt ttgaagggaa tttttagacc 2521 taaggtgatg agacagttgc tgttagatga gccttacctg ttgattctat caatactatc 2581 ccctggcata ctgatggcta tgtataataa tgggattttt gaacttgcgg tgaggttgtg 2641 gattaatgag aaacaatcca tagctatgat agcatcgcta ctatcagctt tagccctacg 2701 agtgtcagcg gcagaaacac tcgtcgcaca gaggattata attgatgctg cagctacaga 2761 cctccttgat gctacgtgtg atgggttcaa cctacatcta acgtacccca ctgcattgat 2821 ggtgttgcaa gttgttaaga atagaaatga atgtgatgat accctattca aggcgggttt 2881 tccaagttac aacacgagcg tcgtacagat tatggaaaaa aattatctaa atctcttgaa 2941 cgatgcttgg aaagatttaa cttggcgaga aaattatccg caacatggta ctcatacaga 3001 gcaaaacgct ctatccactc ggtacataaa acccacagaa aaggcagatt tgaaagggtt 3061 atacaacata tcaccacaag cgttcttggg ccgaagcgcc caggtggtca aaggcactgc 3121 ctcaggattg agcgagcgat ttaataatta tttcaatact aagtgtgtaa atatttcatc 3181 ctttttcatt cgtagaatct ttaggcgttt gccaaccttt gtcacttttg ttaactcatt 3241 attagttatt agtatgttaa ccagcgtagt ggcagtgtgt caggcaataa ttttagatca 3301 gaggaagtat aggagagaaa tcgagttgat gcagatagag aagaatgaga ttgtctgcat 3361 ggagctatat gcaagtttac agcgcaaact tgaacgcgat ttcacatggg atgagtacat 3421 tgagtatttg aagtcagtaa accctcagat agttcagttt gctcaagcgc agatggaaga 3481 atatgatgtg cgacaccagc gttccacacc agttgttaaa aatttggaac aagtggtagc 3541 atttatggct ttagtcatca tggtgtttga tgctgaaagg agtgattgcg tgttcaaaac 3601 tctcaataaa tttaagggtg tcctttcctc actggattat gaagttagac atcagtcctt 3661 agacgatgtg atcaagaatt ttgatgagag gaatgagatt attgattttg aattgagtga 3721 ggacacaatt cgaacttcat cagtgctaga tacaaagttt agtgattggt gggatcgaca 3781 aatccagatg ggacatacac ttccacatta cagaactgag gggcacttca tggaatttac 3841 aagagcaact gctgttcaag tggctaatga cattgcccat agcgaacacc tagacttttt 3901 agtacgggga gctgttgggt ctggaaagtc aactgggttg cctgttcatc ttagtgtggc 3961 cggatctgtg cttttaattg aaccaacgcg accactagcg gagaacgttt tcaaacagct 4021 atctagtgaa ccattcttca agaagccaac actgcgtatg cgtggaaata gtatatttgg 4081 ctcttctcca atctccgtca tgactagcgg atttgcgcta cactacttcg ccaataatcg 4141 ctctcaatta gctcagttca actttgtaat atttgatgag tgtcatgttc tggatccttc 4201 cgcgatggcg ttccgcagtc tgctgagtgt ttatcatcaa gcatgcaaag tattaaaagt 4261 gtcagctact ccagtgggaa gagaggttga attcacaaca cagcagccag tcaagttaat 4321 agtggaggac acagtgtctt tccaatcatt tgttgatgca caaggttcta aaactaatgc 4381 tgatgttgtt cagtttggtt caaacgtact tgtgtacgtg tcgagctaca atgaagttga 4441 caccttggcc aagctcctaa cagacaagaa tatgatggtc acaaaggttg atggcagaac 4501 aatgaagcac ggttgcctag aaattgtcac aaaaggaacc agtgcgagac cacattttgt 4561 tgtagcaacc aacataattg agaatggagt gactttggac atagacgtgg ttgtggactt 4621 tgggttgaaa gtctcaccgt tcttggacat tgacaatagg agcattgctt acaataaggt 4681 gagtgttagc tatggtgaga gaattcaaag gctgggtcgt gttggacgct tcaagaaagg 4741 agtagcattg cgcattggac acactgaaaa gggaattatt gaaattccaa gcatggtcgc 4801 tactgaggcg gctcttgctt gctttgcata taacttgcca gtgatgacag gaggcgtttc 4861 aactagtctg attggcaatt gtactgtgcg ccaggttaaa acaatgcagc aatttgaatt 4921 gagtcccttc tttatccaga atttcgttgc tcatgatgga tcaatgcatc ctgtcataca 4981 tgacattctt aaaaagtata aacttcgaga ttgtatgacg cctttgtgcg atcagtctat 5041 accatacagg gcatcgagta cttggttatc ggttagtgaa tatgagcgac ttggagtggc 5101 cttagaaatt ccaaagcaag tcaaaattgc attccatatc aaagagatcc ctcctaagct 5161 ccacgaaatg ctttgggaaa cggttgtcaa gtacaaagac gtttgcttat ttccaagcat 5221 tcgagcatcg tccatcagca aaatcgcata cacattgcgt acagatctct tcgccatccc 5281 aagaactcta atattggtgg agagattgct tgaagaggag cgagtgaagc agagccaatt 5341 cagaagtctc atcgatgaag ggtgctcaag catgttttca attgttaact taaccaacac 5401 tctcagagct agatatgcaa aagattacac cgcagagaac atacaaaaac ttgagaaggt 5461 gagaagtcaa ctaaaagaat tctcaaattt ggatggttct gcatgtgagg agaatttaat 5521 aaagaggtat gagtcgttgc agttcgttca tcaccaagct gcgacgtcac ttgcaaagga 5581 tctcaagttg aaggggattt ggaacaagtc attagtggct aaagacttga tcatagcagg 5641 cgctgttgca attggtggaa taggactcat atatagttgg ttcacacaat cagttgagac 5701 tgtgtctcat caagggaaaa ataaatccaa aagaatccaa gccttgaagt ttcgccatgc 5761 tcgtgacaaa agggctggct ttgaaattga caacaatgat gacacaatag aggaattctt 5821 cggatctgca tacaggaaaa agggaaaagg taaaggtacc acagttggta tgggtaagtc 5881 aagcaggagg ttcatcaaca tgtatgggtt tgatccaaca gagtactcat tcatccaatt 5941 cgttgatcca ctcactgggc ggcaaataga agaaaatgtc tatgctgaca ttagagatat 6001 tcaagagaga tttagtgaag tgcgaaagaa aatggttgag aatgatgaca ttgaaatgca 6061 agccttgggt agtaacacga ccatacatgc atacttcagg aaagattggt gtgataaagc 6121 tttgaagatt gatttaatgc cacataaccc actcaaagtt tgtgacaaaa caaatggcat 6181 tgccaaattt cctgagagag agctcgaact aaggcagact gggccagctg tagaagtcga 6241 tgtgaaggac ataccagcac aggaggtgga gcatgaagct aaatcgctca tgagaggctt 6301 gagagacttc aacccaattg cccaaacagt ttgtaggctg aaagtatctg ttgaatatgg 6361 ggcatcagag atgtacggtt ttggatttgg agcatacata gtagcgaacc accatttatt 6421 taggagttac aatggttcca tggaggtgca atccatgcac ggtacattca gggtgaagaa 6481 tctacacagt ttgagcgttc tgccaattaa aggtagggac atcatcctca tcaaaatgcc 6541 gaaagatttc cctgtctttc cacagaaatt gcatttccga gctcctacac agaatgaaag 6601 aatttgttta gttggaacca acttccaaga gaagtatgct tcgtcgatca tcacagaaac 6661 aagcactact tacaatatac caggcagcac attctggaag cattggattg aaacagataa 6721 tggacattgt ggactaccag tggtgagcac cgccgatgga tgtatagtcg gaattcacag 6781 tctggcaaac aatgcacaca ccacgaacta ctactcagcc ttcgatgaag attttgaaag 6841 caagtacctc cgaaccaatg agcacaatga atgggtcaag tcttgggttt ataatccaga 6901 cacagtgttg tggggcccgt tgaaacttaa agacagcact cccaaagggt tattcaaaac 6961 aacaaagctt gtgcaagatc taatcgatca tgatgtagtg gtggagcaag ctaagcattc 7021 tgcatggatg tttgaagcct tgacaggaaa tttgcaagct gtcgcaacaa tgaagagcca 7081 attagtaacc aagcatgtag ttaaaggaga gtgtcgacac ttcacagaat ttctgactgt 7141 ggatgcagag gcagaggcag aggcattctt caggcctttg atggatgcgt atgggaaaag 7201 cttgctaaat agagatgcgt acatcaagga cataatgaag tattcaaaac ctatagatgt 7261 tggtgtcgtg gatcggatgc atttgaggaa gccatcaata gggttatcat ctacctgcaa 7321 tgtgcacggc ttcaagaagt gtgcatatgt cactgatgag caagaaattt tcaaagcgct 7381 caacatgaaa gctgcagtcg gagccagtta tgggtgcaaa aagaaagact attttgagca 7441 tttcactgat gcagataagg aagaaatagt catgcaaagc tgtctgcgat tgtataaagg 7501 tttgcttggc atttggaacg gatcattgaa ggcagagctc cggtgtaagg agaagatact 7561 tgcaaataag acgaggacgt tcactgctgc acctctagac actttgctgg gtggtaaagt 7621 gtgtgttgat gacttcaata atcaatttta ttcaaagaat attgaatgct gttggacagt 7681 tgggatgact aagttttatg gtggttggga taaactgctt cggcgtttac ctgagaattg 7741 ggtatactgt gatgctgatg gctcacagtt tgatagttca ctaactccat acctaatcaa 7801 tgctgttctc accatcagaa gcacatacat ggaagactgg gatgtggggt tgcagatgct 7861 gcgcaattta tacactgaga ttgtttacac accaatttca actccagatg gaacaattgt 7921 caagaagttt agaggtaata atagtggtca accttctacc gttgtggata attctctcat 7981 ggttgtcctt gctatgcatt acgctctcat taaggagtgc gttgagtttg aagaaatcga 8041 cagcacgtgt gtattctttg ttaatggtga tgacttattg attgctgtga atccggagaa 8101 agagagcatt ctcgatagaa tgtcacaaca tttctcagat cttggtttga actatgattt 8161 ttcgtcgaga acaagaagga aggaggaatt gtggttcatg tcccatagag gcctgctaat 8221 cgagggtatg tacgtgccaa agcttgaaga agagagaatt gtatccattc tgcaatggga 8281 tagagctgat ctgccagagc acagattaga agcgatttgc gcagctatga tagagtcctg 8341 gggttattct gaactaacac accaaatcag gagattctac tcatggttat tgcaacagca 8401 accttttgca acaatagcgc aggaagggaa ggctccttat atagcaagca tggcactaag 8461 gaaactgtat atggataggg ctgtggatga ggaagagcta agagccttca ctgaaatgat 8521 ggtcgcatta gatgatgagt ttgagcttga ctcttatgaa gtacaccatc aagcaaatga 8581 cacaattgat gcaggaggaa gcaacaagaa agatgcaaaa ccagagcagg gcagcatcca 8641 gccaaacccg aacaaaggaa aggataagga tgttaatgca ggcacatctg ggacacatac 8701 tgtgccgaga atcaaggcta tcacgtccaa aatgagaatg cccacaagca agggagcaac 8761 cgtgccaaac ttagaacatt tgcttgagta tgctccacaa caaattgata tttcaaatac 8821 tcgggcaact caatcacagt ttgatacgtg gtatgaggca gtgcggatgg catacgacat 8881 aggagaaact gagatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg 8941 aacctcgcca aatgtcaacg gagtttgggt tatgatggat gggaatgaac aagttgagta 9001 cccgttgaaa ccaatcgttg agaatgcaaa accaaccctt aggcaaatca tggcacattt 9061 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg 9121 atatggttta attcgaaatc tgcgggatat gggtttagcg cgttatgcct ttgactttta 9181 tgaggtcaca tcacgaacac cagtgagggc tagggaagcg cacattcaaa tgaaggccgc 9241 agcattgaaa tcagcccaac ctcgactttt cgggttggac ggtggcatca gtacacaaga 9301 ggagaacaca gagaggcaca ccaccgagga tgtctctcca agtatgcata ctctacttgg 9361 agtcaagaac atgtgatgta gtgtctctcc ggacgatata taagtattta catatgcagt 9421 aagtattttg gcttttcctg tactactttt atcataatta ataatcgttt gaatattact 9481 ggcagatagg ggtggtatag cgattccgtc gttgttagtg accttagctg tcggttctgt 9541 attattaagt cttagataaa aagtgccggg ttgttgttgt gtgactgatc tatcgattag 9601 gtgatgctgt gattctgtca tagcagtgac tatgtctgga tttagttact tgggtgatgc 9661 tgtgattctg tcatagcagt gactgtaaac ttcaatcagg agac // LOCUS ROBTRFVP2 2687 bp ss-RNA VRL 15-MAR-1990 DEFINITION Bovine rotavirus mRNA for RNA binding protein VP2. ACCESSION X14057 X14507 KEYWORDS RNA binding protein. SOURCE Bovine rotavirus. ORGANISM Bovine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2687; enum. 1 to 2687) AUTHORS Cohen,J. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 2687; enum. 1 to 2687) AUTHORS Kumar,A., Charpilienne,A. and Cohen,J. TITLE Nucleotide sequence of the gene encoding for the RNA binding protein (VP2) of RF bovine rotavirus JOURNAL Nucleic Acids Res. 17, 2126-2126 (1989) STANDARD simple automatic COMMENT *source: strain=RF; Data kindly reviewed (21-APR-1989) by Cohen J. [1] Author address Cohen J., INRA, Station de Virologie et d'Immunologie Moleculaires , INRA, CRJ, Domaine de Vilvert, 78350 Jouy en Josas, France. Submitted (16-JAN-1989) on tape to the EMBL data library FEATURES from to/span description pept 17 2659 VP2 protein (AA 1-880) site 1622 1637 leucine zipper (AA 536-557) site 2009 2074 leucine zipper (AA 665-686) BASE COUNT 997 a 425 c 520 g 745 t ORIGIN 1 ggctattaaa ggttcaatgg cgtacaggaa acgtggagcg cgccgtgagg cgaatataaa 61 taataatgac cgaatgcaag agaaagatga cgagaaacaa gatcaaaaca atagaatgca 121 gttgtctgat aaagtacttt caaagaaaga ggaagtcgta accgacagtc aagaagaaat 181 taaaattgct gatgaagtga agaaatcgac gaaagaagaa tctaaacaat tgcttgaagt 241 tttgaaaaca aaagaagagc accaaaaaga gatacaatat gaaattttgc aaaaaacgat 301 accaacattt gaaccaaaag agtcaatatt gaaaaaattg gaggatatca aaccggaaca 361 agcgaagaag cagactaagc tatttagaat atttgaaccg agacagctac caatttatag 421 agcgaatggt gaaaaagagt tgcgtaacag atggtattgg aagctgaaga aagatacttt 481 accagatgga gattatgatg ttagagaata ctttctaaat ttgtatgatc aggttcttac 541 tgaaatgcca gattatttac tattaaaaga tatggcagtt gaaaataaaa attcgagaga 601 tgccggtaaa gttgttgatt ctgaaacagc aagtatctgt gatgctatat ttcaagatga 661 ggaaacagaa ggtgcagtga gacgattcat tgcggagatg agacagcgcg tacaagctga 721 cagaaacgtt gtcaattacc catcaatatt gcatccaata gattacgctt ttaatgagta 781 ttttttgcaa caccaattag ttgaaccatt gaataatgat ataatattca attacattcc 841 tgaaaggata aggaatgacg ttaactatat acttaatatg gacagaaatc tgccatcaac 901 agctagatat ataagaccta atttactaca agacagactg aatttgcatg acaattttga 961 atccttgtgg gatacaataa caacttcaaa ctatattctg gcaagatcgg tagtaccaga 1021 tttaaaggaa ttagtttcaa ccgaagcgca aattcaaaaa atgtcacaag acttgcaact 1081 agaagcatta acaatacagt cagaaacgca gtttttaaca ggtataaact cacaagcagc 1141 aaatgactgt ttcaaaactc tgattgcagc aatgttaagt caacgaacca tgtcgcttga 1201 tttcgtgact acaaattata tgtcattaat ttcaggcatg tggttactaa ctgtagtgcc 1261 aaatgacatg ttcataaggg aatcattggt tgcatgtcaa ctggctatag tgaatacaat 1321 aatatatcca gcgttcggaa tgcaacgaat gcattataga aacggagacc cacaaagacc 1381 atttcagata gcagaacaac aaatacaaaa ttttcaagta gcgaattggc tgcattttgt 1441 caataacaat caatttagac aagtagttat tgatggtgta ttgaatcagg tgctgaatga 1501 caatattaga aatggacatg tcattaatca attgatggaa gctttaatgc aactatcacg 1561 acaacagttt ccaacaatgc ctgttgatta taagaggtca atccagcgtg gaatattatt 1621 gctatcaaat aggcttggtc aattagttga tttaactagg ttattagctt acaactacga 1681 aacactaatg gcatgtgtta cgatgaatat gcaacatgtt cagactttga caacagaaaa 1741 attacagtta acttcagtca catcgttgtg tatgcttatt ggaaatgcaa ccgttatacc 1801 cagcccgcag acattgtttc actattataa tgttaatgtt aattttcatt caaattataa 1861 tgaaagaatt aatgatgcag tggccataat aactggagct aatagactaa atttatatca 1921 gaaaaagatg aaggcaatag ttgaagattt tttaaaaaga ttacatattt tcgatgtagc 1981 tagagttcca gatgatcaaa tgtatagatt aagggataga ctacgactat tgccagtaga 2041 agtaagacga ttggatattt ttaatttgat actgatgaac atggatcaga tagaacgcgc 2101 atcagataaa attgcgcaag gtgttattat tgcgtaccgc gatatgcaat tggaaagaga 2161 cgaaatgtat ggctacgtga atatagctag aaatttagat gggttccagc aaataaacct 2221 agaagaattg atgagaacag gcgattatgc acaaataact aacatgctct tgaataatca 2281 accagtagcg ctagttggag ctcttccatt tgttacagac tcgtcagtca tatcgttgat 2341 agcgaacgtt gacgctacag tttttgccca aatagttaaa ttacggaaag ttgatacctt 2401 gaaaccaata ttgtataaaa taaattcaga ttcgaatgac ttttacctag ttgccaacta 2461 tgattgggtg cctacttcaa ccacaaaagt atataagcaa gttccacagc aatttgattt 2521 cagaaattcg atgcatatgt taacatcaaa tcttactttc actgtttact ctgatctgct 2581 tgcattcgta tcggccgata cagtagaacc tataaatgca gttgcatttg ataatatgcg 2641 catcatgaac gagttgtaaa cgccaacccc actgtggaga tatgacc // LOCUS VACH3K 4536 bp ds-DNA VRL 15-DEC-1989 DEFINITION Vaccinia virus, HindIII K fragment. ACCESSION D00382 KEYWORDS nonessential gene; serine protease inhibitors. SOURCE Vaccinia virus HindIII K fragment originally from vaccinia virus strain WR, cloned in pBR322, was a gift from R. Wittek, transformed into Escherichia coli strain TG1. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus. REFERENCE 1 (bases 1 to 4536) AUTHORS Boursnell,M.E.G., Foulds,I.J., Campbell,J.I. and Binns,M.M. TITLE Non-essential genes in the vaccinia virus HindIII K fragment: a gene related to serine protease inhibitors and a gene related to the 37K vaccinia virus major envelope antigen JOURNAL J. Gen. Virol. 69, 2995-3003 (1988) STANDARD full staff_entry COMMENT One gene, predicted to encode a 42.2K protein, is highly related to the family of serine protease inhibitors. It shows approximately 25% identity to human antithrombin III and 19% identity to the cowpox virus 38K protein gene which is also related to serine protease inhibitors. The product of another gene shows a similar high level of identity to the 37K vaccinia virus major envelope antigen. The existance of viable deletion mutants and recombinants containing foreign DNA inserted into both these genes indicates that they are non-essential. FEATURES from to/span description ORF 50 < 1 (c) ORF KO, amino end. ORF 1381 272 (c) ORF K1 ORF 1697 1431 (c) ORF K2 ORF 3023 1749 (c) ORF K3 ORF 3185 3051 (c) ORF K4 ORF 3604 3200 (c) ORF K5 ORF 3818 3573 (c) ORF K6 ORF 3957 4406 ORF K7 ORF 4235 4041 (c) ORF8 BASE COUNT 1454 a 789 c 712 g 1581 t ORIGIN 1 bp upstream of HindIII site 1 aagcttttca gctgcttaga cttccaagta ttaattcgtg acagatccat gtctgaaacg 61 agacgctaat tagtgtatat tttttcattt tttataattt tgtcatattg caccagaatt 121 aataatatct ctaatagatc tgattagtag atacatggct atcgcaaaac aacatataca 181 catttaataa aaataatatt tattaagaaa attcagattt cacgtaccca tcaatataaa 241 taaaataatg attccttaca ccgtacccat attaaggaga ttccacctta cccataaaca 301 atataaatcc agtaatatca tgtctgatga tgaacacaaa tggtgtatta aattccagtt 361 tttcaggaga tgatctcgcc gtagctacca taatagtaga tgcctctgct acagttcctt 421 gttcgtcgac atctatcttt gcattctgaa acattttata aatatataat gggtccctag 481 tcatatgttt aaacgacgca ttatctggat taaacatact aggagccatc atttcggcta 541 tcgacttaat atccctctta ttttcgatag aaaatttagg gagtttaaga ttgtacactt 601 tattccctaa ttgaaacgac caatagtcta attttgcagc cgtaatagaa tctgtgaaat 661 gggtcatatt atcacctatt gccaggtaca tactaatatt agcatcctta tacggaaggc 721 gtaccatatc atattcttcg tcatcgattg tgattgtatt tccttgcaat ttagtaacta 781 cgttcatcat gggaaccgtt ttcgtaccgt acttattagt aaaactagca ttgcgtgttt 841 tagtgatatc aaacggatat tgccatatac ctttaaaata tatagtatta atgattgccc 901 atagagtatt attgtcgagc atattagaat ctactacatt agacataccg gatctacgtt 961 ctactataga attaatttta ttaaccgcat ctcgtctaaa gtttaatcta tataggccga 1021 atctatgata ttgttgataa tacgacggtt taatgcacac agtattatct acgaaacttt 1081 gataagttag atcagtgtac gtatatttag atgttttcag cttagctaat cctgatatta 1141 attctgtaaa tgctggaccc agatctcttt ttctcaaatc catagtcttc aataattcta 1201 ttctagtatt acctgatgca ggcaatagcg acataaacat agaaaacgaa taaccaaacg 1261 gtgagaagac aatattatca tcttgaatat ttttatacgc tactataccg gcattggtaa 1321 atccttgcag acgataggta gacactgaac acgttaacga tagtatcaat aacgcaatca 1381 tgattttatg gtattaataa ttaaccttat ttttatgttc ggtataaaaa ttattgatgt 1441 ctacacatcc ttttgtaatt gacatctata tatccttttg tataatcaac tctaatcact 1501 ttaactttta cagttttccc taccagttta tccctatatt caacatatct atccatatgc 1561 atcttaacac tctctgccaa gatagcttca aagtgaggat agtcaaaaag ataaatatat 1621 agagcataat ccttctcgta tactctgccc tttattacat cacccgcatt gggcaacgaa 1681 taacaaaatg caagcatctt gttaacgggc tcgtaaattg ggataaaaat tatgttttta 1741 tatctatttt attcaagaga atattcagga atttcttttt ccggttgtat ctcatcgcag 1801 tatatatcat ttgtacattg tttcatattt tttaatagtc tacacctttt agtaggacta 1861 gtatcgtaca attcatagct gtattttgaa ttccaatcac gcataaaaat atcttccaat 1921 tgttgacgaa gacctaatcc atcatccggt gtaatattaa tagatgctcc acatgtatcc 1981 gtaaagtaat ttcctgtcca atttgaggta cctatatacg ccgttttatc ggttaccata 2041 tatttggcat ggtttaccct agaatacgga atgggaggat cagcatctgg tacaataaat 2101 agctttactt ctatatttat gtttttagat tttagcatag cgatagatct taaaaagttt 2161 ctcatgataa acgaagatcg ttgccagcaa ctaatcaata gcttaactga cacttgtctg 2221 tctatagcgg ctcttcttaa ttcatcttct atataaggcc aaaacaaaat attgcctgcc 2281 ttcgaataaa taatagggat aaagttcata acagatacat aaacgaattt actcgcattt 2341 ctgatacatg acaataaagc ggttaaatca ttggttcttt ccatagtaca tagttgttgc 2401 ggtgcagaag caataaatac agagtgtgga acgccgctta cgttaatact aagaggatga 2461 tctgtattat aatacgacgg ataaaagttt ttccaattat atggtagatt gttaactcca 2521 agataccagt atacctcaaa aatttgagtg agatccgctg ccaagttcct attattgaag 2581 atcgcaatac ccaattcttt gacctgagtt agtgatctcc aatccatgtt agcgcttcct 2641 aaataaatat gtgtattatc agatatccaa aattttgtat gaagaactcc tcctaggata 2701 tttgtaatat ctatgtatcg tacttcaact ccggccattt gtagtctttc aacatccttt 2761 aatggtttgt tagatttatt gacggctact ctaactcgta ctcctctttt gggtaattgt 2821 acaatcttgt ttaatattat cgtgccgaaa ttcgtaccca cttcatccga taaactccaa 2881 taaaaagatg atatatctag tgtttttgtg gtattggata gaatttccct ccacatgtta 2941 aatgtagaca aatatacttt atcaaattgc atacctatag gaatagtctc tgtaatcact 3001 gcgattgtat tatccggatt cattttattt gttaaaagaa taatcctata tcacttcact 3061 ctattaaaaa tccaagtttc tatttctttc atgactgatt ttttaacttc atccgtttcc 3121 ttatgaagat gatgtttggc accttcataa atttttattt ctctattaca atttgcatgt 3181 tgcatgaaat aatatgcacc taaaacatcg ctaatcttat tgtttgttcc ctggagtatg 3241 agagtcgggg ggtgttaatc ttggaaatta tttttctaac cttgttggta gccttcaaga 3301 cctgactagc aaatccagcc ttaatttttt catgattgat taatgggtcg tattggtatt 3361 tataaacttt atccatatct ctagatactg attctggaca tagctttccg actggcgcat 3421 ttagtgtgat ggttcccata agtttggcag ctagcagatt cagttttgaa acagcatctg 3481 cattaactag aggagacatt agaatcattg ctgtaaacaa gtttggatta tcgtaagagg 3541 ctagctccca tggaatgacc caataagtag atttaatagt taccacgtgc tgtaccaaag 3601 tcatcaatca tcattttttc accattactt cttccatgtc caatatgatc atgtgagaat 3661 actaaaattc ctaacgatga tatgttttca gctagttcgt cataacgtcc agaatgttta 3721 ccagctccat gacttatgaa tactaatgcc ttaggatatg taataggttt ccaatatatg 3781 taatcattgt ccagattgaa catacagttt gcactcatga ttcacgttat ataactatca 3841 atattaacag ttcgtttgat gatcatatta tttttatgtt ttattgataa ttgtaaaaac 3901 atacaattaa atcaatatag aggaaggaga cggctactgt cttttgtgag atagtcatgg 3961 cgactaaatt agattatgag gatgctgttt tttactttgt ggatgatgat aaaatatgta 4021 gtcgcgactc catcatcgat ctaatagatg aatatattac gtggagaaat catgttatag 4081 tgtttaacaa agatattacc agttgtggaa gactgtacaa ggaattgatg aagttcgatg 4141 atgtcgctat acggtactat ggtattgata aaattaatga gattgtcgaa gctatgagcg 4201 aaggagacca ctacatcaat tttacaaaag tccatgatca ggaaagttta ttcgctacca 4261 taggaatatg tgctaaaatc actgaacatt ggggatacaa aaagatttca gaatctagat 4321 tccaatcatt gggaaacatt acagatctga tgaccgacga taatataaac atcttgatac 4381 tttttctaga aaaaaaattg aattgatgat ataggggtct tcataacgca taattattac 4441 gttagcattc tatatccgtg ttaaaaaaaa ttatcctatc atgtatttga gagttttata 4501 tgtagcaaac atgatagctg tgatgccaat aagctt //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS FPLTRAC 3227 bp ds-DNA BCT 05-FEB-1990 DEFINITION Plasmid F TraC protein (traC) gene, complete cds and TrbI protein (traI) gene, partial cds. ACCESSION M30936 KEYWORDS TraC protein; TrbI protein. SOURCE plasmid F DNA, clone PKAS2. ORGANISM Plasmid F Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 3227) AUTHORS Schandel,K.A., Maneewannakul,S., Vonder Haar,R.A., Ippen-Ihler,K. and Webster,R.E. TITLE Nucleotide sequence of the F plasmid gene traC and identification of its product JOURNAL Unpublished (1990) STANDARD full staff_review FEATURES from to/span description pept 268 2895 TraC protein (gtg start codon) pept 2892 > 3227 TrbI protein binding 256 261 Shine-Delgarno sequence BASE COUNT 785 a 827 c 900 g 715 t ORIGIN 1 gcatgcggaa atcctattcc ggaagcccgg cggaaaatat ttcccggtgt gacgttgtgc 61 gttgaatgtc aggcgtatca ggaaagacag agaaaacatt atgcataagt cagtcgcaga 121 acatagtgat ttaattccgg atgaacatga gtggatattc agaaaacaga aatctttgta 181 tatgcgccgg gaaatggcgc gttaattaca ggtattccct tcatggctac cgcatttctc 241 gctttatttt tcaactaagg aattcatgtg aataacccac ttgaggccgt cactcaggcg 301 gttaactccc tcgtcacagc actgaaactg cctgacgaat ccgcaaaggc caatgaagtt 361 ctgggcgaaa tgagcttccc gcagttcagc cgtctgctgc cgtaccgtga ttacaaccag 421 gaatccggtc tgttcatgaa tgacaccacg atgggcttta tgctggaagc cattcccatc 481 aatggggcga atgagtccat tgtggaggcc ctcgatcata tgctgcgcac caaactgccg 541 cgcggtattc cgttgtgtat ccatctgatg tccagtcagc tggttggtga caggattgaa 601 tacgggctgc gtgagttctc ctggtctggc gaacaggccg aacggtttaa cgcgattacc 661 cgggcctatt atatgaaagc ggcagcgaca cagtttccgc tgccggaggg aatgaatctg 721 cccctgaccc tgcgccatta ccgggtgttt atctcgtact gttctccctc gaagaaaaaa 781 agccgggccg acattctgga aatggaaaac ctggtgaaaa tcatccgggc gtcgttacag 841 ggggccagta tcaccacaca gacggtggat gcacaggcct ttatcgatat tgtcggggag 901 atgattaacc ataacccgga ttccctgtac ccgaaaagac gtcagctgga cccgtattct 961 gatctgaatt atcagtgtgt ggaggacagt tttgatctga aagttcgggc tgattacctg 1021 acactgggcc tgcgtgagaa cggcaggaac agcacggccc gcatcctgaa tttccatctg 1081 gcccgtaacc cggaaatcgc cttcctgtgg aacatggccg acaactacag caacctgctg 1141 aacccggaac tgtccatctc ctgtccgttc atcctgacgc tgactctggt ggtggaagac 1201 caggtgaaaa cccacagcga agccaacctg aagtacatgg acctggagaa aaagtcgaag 1261 acctcctatg ccaaatggtt tccgtccgtg gagaaagagg cgaaggagtg gggggaactg 1321 cgtcagcggc tgggctccgg tcagtcctct gtcgtgtcct acttcctcaa catcacagcc 1381 ttctgcaagg acaataatga aacggcactg gaagtggagc aggacatcct gaacagcttc 1441 cgtaaaaacg gttttgagct gatttcaccg cgctttaacc acatgcgcaa tttcctgacc 1501 tgtctgccct ttatggccgg gaaagggttg tttaaacagc tgaaagaggc cggagtggta 1561 cagcgcgcag agagctttaa tgtggccaac ctgatgccgt tagtggcgga taaccccctg 1621 acaccggcag gtctgctggc acccacctac cgtaaccagc tggcgtttat cgatattttc 1681 ttccggggga tgaataacac caactataac atggcggtct gtggcacctc cggggccggt 1741 aaaaccgggc tgatacagcc acttatccgc agcgtgctgg actccggagg ctttgccgtg 1801 gtgttcgaca tgggggatgg atacaagtcc ctgtgtgaga acatgggcgg ggtgtatctg 1861 gacggtgaaa ccctgcgttt taacccgttt gcgaacatca ccgatattga ccagtcagcg 1921 gagcgtgtcc gtgaccagtt gtcagtgatg gccagcccca acggtaacct ggatgaagtg 1981 catgaaggtc tgctgttgca ggcggtcaga gcctcctggc tggccaaaga gaacagagca 2041 cgtattgatg acgtggtgga tttcctgaaa aacgccagtg acagcgagca gtatgccgag 2101 tcaccgacta tccgcagccg tctggacgaa atgattgtgc tgcttgacca gtacactgcc 2161 aacggcactt acggccagta ttttaactct gatgagccgt ccctgcggga tgacgcaaaa 2221 atggtggtgc tggagctggg cggactggaa gaccgtccgt cactgctggt tgcggtgatg 2281 ttctccctga ttatctacat cgagaacagg atgtaccgca cgccgcgtaa cctcaagaaa 2341 ctgaacgtta ttgatgaagg ctggcgtctg ctggacttca aaaatcacaa ggtcggtgaa 2401 tttattgaga aaggctaccg tacggcccgc cgtcataccg gtgcctatat caccatcaca 2461 cagaacatcg tcgactttga ctctgataag gcctccagtg ctgcccgcgc ggcatggggt 2521 aactcctcct acaaaattat cctcaaacag agtgcgaagg agttcgcgaa atacaaccaa 2581 ctgtatccgg accagttcct gccactgcag cgcgacatga ttggtaagtt tggtgcggcc 2641 aaagaccagt ggttcagttc cttcctgctg caggtggaaa accattcctc ctggcaccgt 2701 ctgtttgtgg acccgttaag ccgcgccatg tacagctctg acggcccgga ttttgagttt 2761 gtgcagcaga agcgtaaaga ggggctgagt attcatgagg cagtgtggca gctggcgtgg 2821 aagaagtcag ggccggaaat ggcttcgctg gaagcctggc tggaagaaca tgagaaatac 2881 aggagtgtgg catgagttca acgcagaaac ccgccgacgt cacggcagaa cgacgcagcc 2941 actggtggtg gacggtgccg ggatgcctgg caatggtgtt actgaacgca gccgtcagtt 3001 acggcattgt cagactgaac gcaccggtga ctgtcgcctt caacatgaaa cagaccgtgg 3061 atgcgttttt tgacagcgcc agtcagaaac aactgtcaga agcgcagtca aaagcccttt 3121 cagcgcgttt taacacggca ctggaagcca gcctgcaggc gtggcagcag aaacaccacg 3181 cggtcattct ggtgtcgcct gctgtggtac agggggcacc ggatatc //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS DROLARM 7606 bp ss-mRNA INV 19-OCT-1989 DEFINITION D.melanogaster protein tyrosine phosphatase (DLAR) mRNA, complete cds. ACCESSION M27700 KEYWORDS protein tyrosine phosphatase. SOURCE Drosophila melanogaster (strain Oregon R) 4-12 hour embryo, cDNA to mRNA, clones DLAR[23,55]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Streuli,M., Krueger,N.X., Tsai,A.Y.M. and Saito,H. TITLE A family of receptor-linked protein tyrosine phosphatases in humans and Drosophila JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8698-8702 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 7606; for [1]) AUTHORS Saito,H. JOURNAL Unpublished (1989) Dana-Farber Cancer Inst,44 Binney St.,Boston MA STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Saito, 07-SEP-1989. FEATURES from to/span description pept 118 6207 protein tyrosine phosphatase (DLAR) precursor sigp 118 213 protein tyrosine phosphatase (DLAR) signal peptide matp 214 6204 protein tyrosine phosphatase (DLAR) BASE COUNT 1999 a 1973 c 1926 g 1708 t ORIGIN 1 aaagagaata aaaagagcat ttaaagttat tagtaaccat cgttgttgtt gagttgttgt 61 tgttgttacc aactgtcgac atacccttgc aactgccggc gaaaacatag cgaaataatg 121 ggtctgcaga tgacagcagc ccgtccaatc gcagctctca gcctactagt gttgtccttg 181 ctcacctgga ctcaccccac aatcgtggat gcagcccatc cgccggagat catcaggaag 241 ccgcagaatc agggagtccg agtgggcggc gttgctagct tctattgtgc ggcccgcggt 301 gatccgcctc catcgatagt gtggcgcaaa aatggcaaaa aagtttcggg aacccagtcg 361 cgttacacgg tgctggagca gcccggcggg atttccatac tccggattga gcccgtgcgg 421 gcgggacgcg atgatgcacc atacgagtgt gtggcggaga acggggtggg cgatgccgtt 481 tccgcagatg caactttaac catatatgaa ggcgataaaa cacccgcagg ctttccggtt 541 ataacccagg gtccgggaac tcgcgtcatt gaagtgggtc acacggtcct catgacatgc 601 aaagccatcg gcaatccgac gccaaacatt tactggatta agaatcagac aaaggttgat 661 atgagcaatc cgcgctactc tctcaaggat ggcttcctgc aaatcgaaaa cagtcgcgag 721 gaggatcagg gcaaatacga gtgtgtggct gagaactcaa tgggcacgga gcactcgaag 781 gccaccaact tgtatgtgaa agtccgtcgt gttccgccca ccttttcccg cccaccagag 841 accatcagcg aggtgatgtt gggatctaat ctgaatctat cctgcattgc cgtcggctca 901 cccatgccgc atgtcaagtg gatgaagggc tctgaagatc ttacacccga gaatgagatg 961 ccaatcggac gaaatgtcct gcagctgatc aatatccagg agagcgccaa ctacacttgc 1021 atagcggcct ccactttggg ccaaatcgat tccgtttcgg tggttaaagt gcaatctctg 1081 cccaccgcac ccaccgatgt gcaaatctcc gaggtgaccg ccacttcggt gcgtctggag 1141 tggtcgtaca agggtcccga ggacttgcaa tattacgtga tccagtacaa gccgaagaac 1201 gccaaccagg ccttcagcga gataagcggc atcatcacca tgtactatgt ggtccgtgcc 1261 ctgagtccct acacggagta cgagttctac gtgatagccg tgaacaatat tggacgcgga 1321 ccgccctcgg caccagcgac atgtaccacc ggtgagacaa aaatggaaag tgcaccacgt 1381 aatgtccaag tgcgcacgct gagctcgtcc acgatggtta ttacttggga accaccagag 1441 acgcccaatg gacaagtgac cggctacaag gtgtactaca cgaccaattc gaatcagccg 1501 gaggcgtcgt ggaactccca gatggtcgac aatagcgaac tgaccacagt ctcggacgtg 1561 acgccccacg ccatctacac ggtccgggtt caggcctaca catcgatggg agccggtcca 1621 atgtccacgc cggtccaggt gaaggcccag caaggtgtgc catcgcaacc gagcaatttc 1681 cgggcaaccg atatcggcga gaccgcagtc acactgcaat ggaccaagcc gacgcattcc 1741 agcgagaata tcgtgcacta cgagctctac tggaatgaca catacgccaa tcaggcccat 1801 cacaagcgca tttccaactc ggaggcgtat accctggacg gactctaccc cgataccctc 1861 tactacatct ggctggctgc aaggtcgcaa cgtggcgagg gggccaccac cccgcccatt 1921 ccggtgcgca ccaagcaata tgtaccaggt gctccgcctc gaaatatcac cgccatagcc 1981 accagctcga cgaccatatc cctcagctgg ctgcctccgc ccgtcgagcg atcgaacggc 2041 cggatcatat actataaggt gttcttcgtg gaggtgggtc gcgaagacga cgaggccacc 2101 accatgaccc tcaatatgac cagcattgtg ctggacgagc tgaagcgctg gacagagtac 2161 aagatctggg tgctggccgg cacctccgtc ggggatgggc cgcggtcgca tcccatcatt 2221 ttgcgcaccc aagaggatgt gcccggcgat ccgcaagatg tgaaggccac gcctttgaac 2281 tccacttcga tccatgtcag ctggaagccg cctctcgaaa aggatcgcaa tggcatcatc 2341 cgtgggtatc atatacacgc ccaggagctg cgagatgagg gcaagggctt tttgaacgaa 2401 cccttcaagt ttgatgtggt agacacgctg gagttcaatg tgactggctt gcagccggat 2461 acaaagtact ccattcaggt ggcggcacta actcgtaaag gagatggtga ccggagtgct 2521 gcgattgtgg tgaaaactcc tggcggagta ccagtccgac caacggtgag tctgaagatc 2581 atggagcggg aaccgatcgt gtccatcgaa ctcgaatggg agcgaccggc gcagacctat 2641 ggcgaattgc gtggctatcg acttcgatgg ggcgtcaagg accaagcact gaaggaggag 2701 atgctgtcag gaccgcagat gaccaagaaa cggtttgata acttggaacg cggagttgag 2761 tacgaatttc gtgtggcggg cagcaatcat attggtatcg gacaagagac ggtgaaaata 2821 tttcagacac ccgagggaac acccggtgga ccgccttcta acattaccat tcgcttccaa 2881 actccggatg tactgtgcgt gacctgggat ccaccaacta gggagcaccg gaatggcata 2941 atcacccgct atgatgttca gtttcacaag aaaatcgatc atggcctggg atccgagcga 3001 aatatgactc tccggaaggc ggtgttcaca aatctggagg agaacaccga gtatatcttc 3061 cgggtgaggg cttatacgaa gcagggagct ggtcccttca gcgacaagtt aatcgtggag 3121 acagaacgtg acatgggtcg agcacctatg tccctgcagg cagaggcaac atcggagcaa 3181 actgcggaga tctggtggga accggtaaca agtcgtggca agttgctggg ctacaagatc 3241 ttttacacca tgacagctgt cgaggatctg gacgattggc aaacgaaaac cgttggactt 3301 acggaatccg ctgatcttgt taatctcgag aagtttgccc aatatgccgt ggccattgcg 3361 gcgaggttca agaacggatt gggacgtctt agtgaaaagg ttacagtacg catcaagccg 3421 gaggatgtgc ccttaaatct tcgcgctcac gatgtcagca cccattcgat gaccttgagt 3481 tggtcgccac ccattcgcct aaccccggtc aactacaaga tcagcttcga tgccatgaag 3541 gtgtttgtgg actcgcaggg attctcccag acccagatcg ttcccaagcg agagattatc 3601 cttaagcact atgtgaagac ccacactatc aacgaactca gtccgtttac cacgtacaat 3661 gtgaatgtga gtgccattcc ctcggattat tcctaccggc cgcccacaaa gattacggtc 3721 acaacgcaaa tggctgcacc tcagccaatg gtgaagccgg atttctacgg cgttgttaat 3781 ggcgaggaaa ttctggtgat actgcctcag gcttcggagg aatatggacc catatcgcac 3841 tattatttgg tggtggtccc ggaggacaag tccaatctgc acaagatacc cgatcagttc 3901 cttaccgatg atctcttgcc gggcaggaac aagccagagc gtccgaatgc accgtacatt 3961 gcagccaagt tcccgcagcg ttccattccg ttcacattcc acctgggatc tggcgatgat 4021 tatcataact ttacaaatcg caaattggag cgagagaagc gctaccgcat ctttgtgcga 4081 gcggtggtgg atacgccaca gaagcacctc tacacctcca gtcccttctc tgagttccta 4141 tcgctggaca tgagggaagc tccgccaggt gagcggcccc accgacccga tcccaattgg 4201 cccgcggagc cggaagtgtc ggtgaaccgc aacaaggacg aaccggagat tctgtgggtg 4261 gtgctgcccc tgatggtgtc cacattcatt gtgtccaccg ccctgatcgt tctctgtgtg 4321 gtaaagcgtc gtcgccagcc gtgcaagact ccggatcagg cagctgtcac aaggccactg 4381 atggccgccg acctgggagc cggacctacg cccagcgatc cagtggacat gaggcgcttg 4441 aacttccaga cacccggcat gatctcccat ccgcccatac cgatatccga gtttgccaac 4501 cacatcgaac gactcaagtc caatgacaat cagaagtttt cgcaggaata cgaaagcatt 4561 gagccgggcc aacagttcac ctgggacaac tccaatctgg agcataacaa gtctaagaat 4621 cgctatgcaa atgttaccgc ctacgatcat tcacgcgtcc agttgccagc ggtggagggt 4681 gtggttggat cagattacat caatgccaat tactgtgacg gctatcggaa gcacaatgcc 4741 tacgtggcga cccaaggtcc gttgcaggag acctttgtgg acttctggcg catgtgttgg 4801 gaactgaaga cggccaccat tgtgatgatg acgcgattgg aggaacgaac gcgcataaag 4861 tgcgatcagt attggcccac tcgcggaacg gaaacctatg gtcagatctt tgtgaccatc 4921 acggagacac aggaactggc cacctacagc atccgcacct tccagttgtg ccggcagggc 4981 tttaacgatc ggcgtgagat caagcagctg cagttcacag cctggccaga tcatggagtg 5041 cccgatcatc cggctccctt ccttcagttc ttgcgccggt gtcgcgccct cacgccaccg 5101 gaatccggac ccgtgattgt tcactgctct gcgggagttg gtcgcactgg ctgttatatc 5161 gtaatcgatt caatgttgga acgaatgaaa cacgagaaga ttatcgacat ctatgggcat 5221 gttacttgtt tacgggcgca acggaactac atggtgcaga cggaggatca gtacatcttc 5281 attcatgacg ccatcctgga ggccatcatc tgtggggtga cggaggtgcc ggctcgcaat 5341 ctacacaccc acctacagaa actattgatc acggagcccg gcgagaccat ctcgggcatg 5401 gaggtggagt tcaagaagct gtctaacgtc aagatggact cgtccaagtt cgtaacggcc 5461 aatctgccgt gcaacaagca caagaatcgc ctggtccaca ttctgccgta cgagtcaagt 5521 cgcgtctacc tgacccccat ccatggaatc gagggaagcg actatgtcaa cgccagcttc 5581 atcgacggct atcgttaccg ttccgcatac atcgccgcac agggtcctgt ccaggatgcc 5641 gctgaggact tttggcgcat gctctgggag cacaactcca ccattgtggt catgctgacc 5701 aagctcaagg aaatgggaag ggaaaagtgc ttccagtact ggccccatga gcgatccgta 5761 cgctatcagt attatgtcgt ggatcccatt gctgagtaca acatgccgca gtataagctg 5821 cgtgaattta aggtcacgga tgcccgagat ggctcatcgc gcaccgtccg ccagttccag 5881 ttcatcgatt ggccggagca gggtgtgccc aagtcgggcg agggcttcat cgacttcatc 5941 ggacaggtgc acaagaccaa ggagcagttt ggccaggatg gacccattac cgtgcactgt 6001 tcggcgggcg tgggacgttc gggtgtcttt atcactctga gcatcgttct ggaacgaatg 6061 cagtacgagg gagtactgga cgtcttccag acagtgcgca tactgcgatc ccagcgtccg 6121 gctatggtac aaaccgagga tcaataccac ttctgctatc gcgctgcact ggagtacttg 6181 ggctcattcg acaattatac aaactgagtg catttctcaa tgggattgcc aggtccacag 6241 agctaaaatc tcatcgaccc tgcgttcgtt gcatacttat ttcggacttt acgcataata 6301 cgcatttacg atatacacgg atatcataac tctaaagtat tatagccagc tactgcccac 6361 acttacttaa tacacctata cttatatacg tggatatgtt tagttgataa gcgcaccccc 6421 gcccgcaggt tatgttacca ttacgattct ttgggggact cgatcttata tattgcgttc 6481 tattttattg ataatgtaaa ctaaagaaga ctgttacacg tttaattatc aaaaccttat 6541 atatcaaccc acatgcactt ctatgtacga gtatgtgcat caagatttgc tcgctaatgg 6601 agaaggatgt caatcaactt gttgttgttc gtggtactgc cgccggagtg accatgaatg 6661 aggggcgcca ctggaaaccg atacatattt gaactaagga cttggtagct attagtctta 6721 ccgcctaaat cgaatctcga acggaagcaa agggtcgaaa gtagaagcca ttttaagtaa 6781 atcgaaaact tctggcgatt tggcgcaaac tccagacgct attataatat ataatacaat 6841 caatcaaaca ctgctatgtt aatcagtttt catgccttgg ttaatggaaa ccagtataaa 6901 cggtccgtgc aattagtcaa aagtcttgca tagcgcatat tttataaggc agctagattc 6961 taaaattagt tttcaaacac aacttacgtt ttgatctccc cgcaaaagat gacgaacctt 7021 atggactgat cgatgtcttg gtagctaaca gacagttcaa atttcaaaga ccattatgat 7081 agatcacata cgcaacttcc agccaagcag ctattttaac ttcaacctca ttccctcgac 7141 ttctattcag ttccctagta cattcctgta tataccacag tcgatacccg agctagccac 7201 ttatgtatgt gtaatttaga ttgaaaagct atgaaaagtt attgtaccta aggtgtatga 7261 agacatagag aataaccgag cgaacagatt agaatgataa ctgtagatat gtactgtagc 7321 atataatttt taccataaaa tagagcgaca cacgcggtta taaataaaca taaacataaa 7381 atcgagacat tggcaatagt tcttaaggta cataataatg tatataactg ataagctgtt 7441 ttgcaaaacg tttgaattcg gcctcgtaac tatacataaa tatataaaga gttagcaagc 7501 aaatgcttaa aagcaaacac taaattatta gctgaaacac atgttaaaca accatagtca 7561 tatgtaaact aaatatttat acgaaataaa ggtatttgtt ataact // LOCUS DROPTPM 4477 bp ss-mRNA INV 19-OCT-1989 DEFINITION D.melanogaster protein tyrosine phosphatase (DPTP) mRNA, complete cds. ACCESSION M27699 KEYWORDS protein tyrosine phosphatase. SOURCE Drosophila melanogaster (strain Oregon R) 4-12 hour embryo, cDNA to mRNA, clone DPTP12. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Streuli,M., Krueger,N.X., Tsai,A.Y.M. and Saito,H. TITLE A family of receptor-linked protein tyrosine phosphatases in humans and Drosophila JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8698-8702 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4477; for [1]) AUTHORS Saito,H. JOURNAL Unpublished (1989) Dana-Farber Cancer Inst,44 Binney St.,Boston MA STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Saito, 07-SEP-1989. FEATURES from to/span description pept 55 4443 protein tyrosine phosphatase (DPTP) precursor sigp 55 123 protein tyrosine phosphatase (DPTP) signal peptide matp 124 4440 protein tyrosine phosphatase (DPTP) BASE COUNT 1456 a 996 c 970 g 1054 t 1 others ORIGIN 1 atataatttt tgacatatgt atttatccgg cncggagtga gcaattagca agccatggcg 61 ttactctacc gacgcatgtc gatgctatta aatataattc ttgcatatat tttcctttgt 121 gcgatatgtg tgcaaggatc tgtgaaacaa gagtgggcag aaataggtaa aaacgtatct 181 ctagagtgcg catctgaaaa cgaagcagta gcatggaaac tgggcaacca aactattaat 241 aaaaaccata caagatataa aataagaact gagccattga aatccaacga cgatggcagt 301 gaaaacaacg atagtcaaga cttcatgaag tataagaatg tcctgacgct tctcgatgtc 361 aatataaatg actcggggaa ctacacctgc acagcccaaa cggggcaaaa ccattcgact 421 gaattccaag ttaagccata ccttccatcc aaagtcctgc aaagtacccc cgacaggatc 481 aagagaaaaa tcaagcagga tgtcatgttg tactgtttga tcgagatgta cccacaaaat 541 gagacaacta atagaaacct caagtggctt aaggacggca gccagtttga attcctggac 601 accttctcgt ccatttcgaa gctaaacgat acgcacttaa acttcaccct ggaattcaca 661 gaagtttaca agaaagagaa tggcacctac aagtgcaccg tcttcgatga caccggactc 721 gaaattacct ccaaagagat tactcttttc gtaatggaag tgccacaagt tagcattgat 781 ttcgccaagg cagttggtgc taataaaata tacctaaatt ggaccgttaa cgacggcaac 841 gatccaattc agaagttctt catcactctg caggaggctg gaacaccgac ttttacttat 901 cataaggact ttattaacgg cagccataca tcgtatattc tggaccattt caagccgaac 961 acaacctatt ttttaagaat cgtgggaaag aactcgattg gcaatggcca acccacccag 1021 tatccccagg gaatcaccac gcttagttat gatcccatat ttataccaaa agtcgagacc 1081 accggcagca cggcgtccac gataacgatt ggctggaatc caccgccgcc ggatctcatt 1141 gattatatac aatactacga actgattgtc tccgaatcgg gcgaggtacc caaagtgatc 1201 gaagaggcca tttaccagca gaattctcga aacttgccat acatgtttga taagcttaag 1261 accgccacag actacgaatt tagggtaagg gcatgtagtg atctaaccaa gacttgtgga 1321 ccatggtccg aaaacgtgaa cggcacaacg atggacggcg tggcaacaaa acccaccaac 1381 ttgagcatac agtgtcatca tgacaacgtc acgagaggca actccatcgc cattaactgg 1441 gacgttccca agacgccaaa cggcaaggtt gtgtcatatt taattcactt gctgggcaat 1501 cccatgagca cagtggatag agagatgtgg ggaccaaaga ttcgaagaat cgatgaaccc 1561 catcacaaga ccctctacga aagtgttagc ccgaacacaa actacacggt gacggtgtct 1621 gccataacgc ggcacaagaa gaacggcgaa ccggccaccg gaagttgtct aatgcccgtc 1681 tccacgccgg atgccattgg ccgtaccatg tggtcaaagg tgaacctgga ctccaagtac 1741 gttctcaagt tgtatctacc gaagatcagc gaacgcaatg ggcccatatg ctgctataga 1801 ttgtatctag ttagaattaa caatgacaac aaggaattgc cggacccgga gaagctaaac 1861 atcgccacat atcaggaggt tcacagcgat aacgtgacta gaagtagtgc atatatagcg 1921 gaaatgataa gtagcaagta ctttaggccg gaaatattct tgggcgctga aaagagattt 1981 agtgagaaca acgatataat ccgcggcaac gacgaaattt gtcggaaatg cttggagggt 2041 actccatttc tgaggaaacc cgagatcatt cacatacccc cacaaggttc actatcaaat 2101 tccgattctg aactgcccat tttgtcggag aaggacaact tgatcaaagg agcaaactta 2161 acagagcatg ctctgaaaat cttagaaagt aagttaaggg ataaaagaaa cgcggtgacc 2221 agcgatgaga atccaattct aagcgccgtc aacccaaatg tgccactcca cgattctagt 2281 cgagatgtct tcgatggtga gatagatatt aactccaact acaccggatt cctagagata 2341 atagttcggg atcgaaacaa tgccctgatg gcttatagca aatactttga tataattact 2401 ccggcgactg aagctgaacc catccaatcc ttgaacaata tggactacta cctaagcatt 2461 ggggtcaagg ctggagccgt attacttggt gtcatacttg tatttattgt gctgtgggtt 2521 ttccatcaca agaaaaccaa gaatgaactg cagggcgaag acactttaac actaagagat 2581 tctttgagca gggcattgtt cggtcgacga aatcacaacc acagtcattt tataacttcc 2641 ggaaatcaca aaggattcga cgctggcccc attcacagat tagatttaga aaacgcctac 2701 aagaaccgac ataaggacac tgactacgga tttctgcggg aatatgagat gctgccaaat 2761 cgctttagcg atcggacaac taaaaacagt gatttaaagg agaacgcctg caagaacagg 2821 tatcccgata taaaggctta cgatcagacg cgcgtaaagt tggctgtcat caatggcctg 2881 caaactacgg actacatcaa tgctaacttc gtaattggct acaaggagag aaagaagttt 2941 atctgtgcac agggtccaat ggaaagtacc atcgatgatt tttggcgtat gatttgggaa 3001 caacatctag aaataattgt gatacttaca aatttagagg aatataacaa ggccaagtgc 3061 gcgaaatatt ggccagaaaa agtatttgat acgaaacagt tcggagatat tttagtgaaa 3121 tttgcacaag aacgtaagac tggtgattat attgaacgta ccctgaacgt ttccaagaac 3181 aaagccaatg tcggcgagga ggaggaccgt agacaaatca cccaatacca ctacctaacg 3241 tggaaggact tcatggcacc agagcatcca catggcatca tcaaattcat acgtcaaatc 3301 aattccgtct actccctgca aaggggtcca attttagtgc attgcagtgc tggtgtgggt 3361 agaaccggaa ccctggtggc tttggattcc ctaatccaac agctggagga agaagactcg 3421 gtgtccattt acaacacagt gtgtgatcta cgacaccaac gaaatttttt agtccaatct 3481 ctgaaacaat acatctttct ttaccgggct ctattagata ctggaacctt tggaaacacg 3541 gatatttgca ttgatacaat ggcttctgca attgaatctc ttaagcggaa acccaacgag 3601 ggtaaatgca aattggaaat ggaattcgag aaactactgg ccacagcaga tgagataagt 3661 aaatcatgta gtgtgggcga aaacgaggag aacaatatga agaacagaag tcaagagatt 3721 ataccctacg accgcaacag agttatactg acaccacttc cgatgcggga aaactcaaca 3781 tacattaacg catcattcat agagggctat gataatagcg aaaccttcat cattgcccag 3841 gatccatttg aaaacactat aggagacttc tggcgaatga tctcggaaca gagtgttacc 3901 accctcgtca tgatatccga aatcggagat ggtcccagga aatgcccgcg atactgggca 3961 gatgatgagg ttcaatacga ccacatactc gtgaaatacg tgcacagcga aagttgtcca 4021 tactatactc gccgcgaatt ttatgttacg aattgcaaaa tagacgatac gctgaaagtc 4081 acacaatttc aatacaatgg ttggcccacc gtggacggag aagttcctga agtctgccgt 4141 ggcattatag aacttgtaga tcaagcatac aaccattata agaacaacaa gaattctggt 4201 tgtcgatctc cactcacagt tcattgcagt ctgggaaccg atcgaagttc cattttcgtt 4261 gccatgtgta ttttggtcca gcatctcagg ctggagaagt gtgtcgacat ctgtgccaca 4321 acaaggaaat tacgatctca gcgaacggga cttatcaact catacgcaca atacgagttc 4381 ctacatcgcg caataattaa ttattcagac ttacatcaca tagccgagtc aacattggat 4441 taagttagat atgtatatac ttatacgagt accgaat // LOCUS MUSPPC1 643 bp ds-DNA ROD 29-AUG-1989 DEFINITION Mouse secretory granule proteoglycan peptide core gene, exon 1. ACCESSION M27391 J05044 KEYWORDS proteoglycan peptide core. SEGMENT 1 of 3 SOURCE M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 643) AUTHORS Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and Stevens,R.L. TITLE Cloning and characterization of the mouse gene that encodes the peptide core of secretory granule proteoglycans, and expression of this gene in transfected rat-1 fibroblasts JOURNAL J. Biol. Chem. 264, 16719-16726 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by R.L. Stevens, 02-AUG-1989. FEATURES from to/span description pept 546 + 618 secretory granule proteoglycan peptide core pre-msg 505 > 643 peptide core mRNA and introns IVS 619 > 643 peptide core intron A BASE COUNT 141 a 134 c 180 g 188 t ORIGIN 1 aattctagca gactctggac gttaacggag accgctcatc ctgggggctg agaacccagc 61 tcggctcgga atgttccctg cttgtgcctg actctgtgcg cgcccagctt ctctttgatg 121 tgcgctgtgg atgagccgag ctcagttctg gaacagctga gtcctcctgt ctgtttagat 181 tgttacctga aggaagggag ggggaagaaa gtgctgattc gactttttga tggggaaaac 241 ttttttttta aacatgcaaa tgacagatgg cagagctttt tggaaaaaga aaaaataata 301 accacacagc aaacgcctag ggggagtccg gtggagtttc atcatgggta tgaacagttg 361 ttgttttttt caactttctt cttctttctg ggtgttgatg tggatctctt tctatttgtt 421 caggaaactg tgacgtgtgt tcttgggcag ggtctgaggt tttggaacct ctttctaaaa 481 gggacagaaa gagcaccctg ctacatttgc taatccagag gctgagtgga gccgagctgg 541 tcaggatgca ggttcccgtc ggcagcaggc ttgtcctggc tctcgccttc gtcctggttt 601 ggggatcttc agtgcaaggt aagagaccca ggatctttaa ttc // LOCUS MUSPPC2 183 bp ds-DNA ROD 29-AUG-1989 DEFINITION Mouse secretory granule proteoglycan peptide core gene, exon 2. ACCESSION M27392 J05044 KEYWORDS proteoglycan peptide core. SEGMENT 2 of 3 SOURCE M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 183) AUTHORS Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and Stevens,R.L. TITLE Cloning and characterization of the mouse gene that encodes the peptide core of secretory granule proteoglycans, and expression of this gene in transfected rat-1 fibroblasts JOURNAL J. Biol. Chem. 264, 16719-16726 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by R.L. Stevens, 02-AUG-1989. FEATURES from to/span description pept + 19 + 163 secretory granule proteoglycan peptide core pre-msg < 1 > 183 peptide core mRNA and introns IVS < 1 18 peptide core intron A IVS 164 > 183 peptide core intron B BASE COUNT 46 a 44 c 44 g 49 t ORIGIN 1 ggttccttgt tcgcacaggt tatcctgctc ggagagccag gtaccagtgg gtccgctgca 61 aaccgaatgg cttttttgcg aactgcatcg aggagaaggg accacagttt gacctaatag 121 atgaatccaa taacatcggc cctcccatga ataatcctgt tttgtaagta gactttcatc 181 gat // LOCUS MUSPPC3 681 bp ds-DNA ROD 29-AUG-1989 DEFINITION Mouse secretory granule proteoglycan peptide core gene, exon 3. ACCESSION M27393 J05044 KEYWORDS proteoglycan peptide core. SEGMENT 3 of 3 SOURCE M.musculus (strain BALB/c) liver DNA, clone lambda-MG-PG1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 681) AUTHORS Avraham,S., Austen,K.F., Nicodemus,C.F., Gartner,M.C. and Stevens,R.L. TITLE Cloning and characterization of the mouse gene that encodes the peptide core of secretory granule proteoglycans, and expression of this gene in transfected rat-1 fibroblasts JOURNAL J. Biol. Chem. 264, 16719-16726 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by R.L. Stevens, 02-AUG-1989. FEATURES from to/span description pept + 20 260 secretory granule proteoglycan peptide core pre-msg < 1 > 678 peptide core mRNA and introns IVS < 1 19 peptide core intron B BASE COUNT 208 a 135 c 123 g 215 t ORIGIN 1 ttttttcttt gtattttagg atggaaggac cctcaaaaga tttcatctcc aattatgatg 61 actatgggtc aggttcgggc tccggctctg gctccggctc tggctcgggt tccggctccg 121 gaagtggctt cctaggtgac atggaatggg aataccagcc aacagatgaa agcaatattg 181 tctatttcaa ctataagcct tttgacagga ttctcactga gcaaaaccaa gaccaaccag 241 aagacgattt tattatatga atgtgacggt ctctgtctcc ccacctccat gtggaacaat 301 gtattcagta tacttagtgt accacgttta aatgaccagt ctcaggataa agagttttac 361 agaaaattta aaatgcctgg aaaagactct tgaatcctgt tacccctttc ctcattaact 421 cgtaaggaat tatgctttaa tgctgttacc tatcttgttg ttctggaaaa tgcctgcatt 481 tatgtgtatt gaatcaacat ttaagaaatt aacacacacc cccattatta tacaataact 541 ttcaaagcca tactggtttt gaaaatttta atttgatagc aagttgatga acaatctttc 601 atacctaaag tgttcaggaa cccaactcgc attgtgaatt acaaatatat tcctttatgt 661 gattaaaaag aaaataaagt g //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS CE3LYS 392 bp ds-DNA BCT 15-JUN-1989 DEFINITION Plasmid ColE3 lysis protein gene, complete cds. ACCESSION X03631 KEYWORDS inverted repeat; lysis protein. SOURCE Plasmid Colicin E3 (from E.coli) DNA. ORGANISM Plasmid Colicin E3 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 392; enum. 1 to 392) AUTHORS Toba,M., Masaki,H. and Ohta,T. TITLE Primary structures of the CoIE2-P9 and CoIE3-CA38 lysis genes JOURNAL J. Biochem. 99, 591-596 (1986) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description INVREP 233 247 imp. inverted repeat A INVREP 249 263 imp. inverted repeat A' SITE 264 269 T-stretch OPR 233 269 pot. transcription terminator FEATURES from to/span description pept 65 208 lysis protein (aa 1-47) BASE COUNT 110 a 78 c 93 g 111 t ORIGIN 1 tgaaatatga atgccggttg tttatggatg aatggctggc attctttcac aacaaggagt 61 cgttatgaaa aaaataacag ggattatttt attgcttctt gcagtcatta ttctgtctgc 121 atgtcaggca aactatatcc gggatgttca gggcgggacc gtatctccgt catcaacagc 181 tgaagtgacc ggattagcaa cgcagtaacc cgaaatcctc tttgacaaaa acaaagcgtg 241 tcaggctgat tctgatgcgc tttttttttg aaatgtcaca aaaattccat gtgggagatg 301 ggatctaaaa tcctcgtgca gaactttcca tccaggggga gaaaacttgt cgttttgagc 361 cgttcggtgt tcagaacgca cgaaaccgat cg //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS CECCOLE3IM 915 bp ds-DNA BCT 16-JUN-1986 DEFINITION Plasmid ColE3-CA38 immunity protein gene, complete cds, and colicin E3 gene, 3' end. ACCESSION X01162 KEYWORDS colicin; immune response gene; nuclease; toxin. SOURCE Plasmid ColE3-CA38 DNA, from E.coli. ORGANISM Plasmid Colicin E3-CA38 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 915) AUTHORS Lau,P.C.K., Rowsome,R.W., Zuker,M. and Visentin,L.P. TITLE Comparative nucleotide sequences encoding the immunity proteins and the carboxyl-terminal peptides of colicins E2 and E3 JOURNAL Nucleic Acids Res. 12, 8733-8745 (1984) STANDARD simple staff_review COMMENT EMBL features not translated to GenBank features: key from to description SITE 22 255 pot. receptor-binding domain SITE 256 546 pot. RNase domain FEATURES from to/span description pept < 1 549 colicin E3 (AA at 1) pept 559 816 immunity protein (aa 1-85) BASE COUNT 297 a 129 c 236 g 253 t ORIGIN 21 bp upstream of TaqI site. 1 gctgaaataa aacaatttaa tcgatttgcc catgacccaa tggctggcgg tcacagaatg 61 tggcaaatgg ccgggcttaa agcccagcgg gcgcagacgg atgtaaataa taagcaggct 121 gcatttgatg ctgctgcaaa agagaagtca gatgctgatg ctgcattgag ttctgctatg 181 gaaagcagga agaagaaaga agataagaaa aggagtgctg aaaataattt aaacgatgaa 241 aagaataagc ccagaaaagg ttttaaagat tacgggcatg attatcatcc agctccgaaa 301 actgagaata ttaaagggct tggtgatctt aagcctggga taccaaaaac accaaagcag 361 aatggtggtg gaaaacgcaa gcgctggact ggagataaag ggcgtaagat ttatgagtgg 421 gattctcagc atggtgagct tgaggggtat cgtgccagtg atggtcagca tcttggctca 481 tttgacccta aaacaggcaa tcagttgaaa ggtccagatc cgaaacgaaa tatcaagaaa 541 tatctttgag aggaagttat gggacttaaa ttggatttaa cttggtttga taaaagtaca 601 gaagatttta agggtgagga gtattcaaaa gattttggag atgacggttc agttatggaa 661 agtctaggtg tgccttttaa ggataatgtt aataacggtt gctttgatgt tatagctgaa 721 tgggtacctt tgctacaacc atactttaat catcaaattg atatttccga taatgagtat 781 tttgtttcgt ttgattatcg tgatggtgat tggtgatcaa atattatcag ggatgagttg 841 atatacgggc ttctagtgtt catggatgaa cgctggagcc tccaaatgta gaaatgttat 901 attttttatt gagtt //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS FSOFARS1A 132 bp ds-DNA PLN 15-MAR-1990 DEFINITION F.oxysporum autonomous replicating sequence (ARS). ACCESSION M29451 KEYWORDS autonomous replicating sequence. SOURCE F.oxysporum lycopersici (strain 73) ARS DNA, clone pFOLT4R4. ORGANISM Fusarium oxysporum Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 132) AUTHORS Powell,W.A. and Kistler,H.C. TITLE In vivo rearrangement of foreign DNA by Fusarium oxysporum produces linear self-replicating plasmids JOURNAL Unpublished (1989) College Env. Sci., S.U.N.Y., Sayracuse, NY 13210 STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.A.Powell 27-OCT-1989. FEATURES from to/span description site 1 9 DNA homologous to yeast core sequence rpt 7 12 telomere repeat unit site 111 117 ligation point of telomeres -> ApaI site BASE COUNT 26 a 8 c 56 g 42 t ORIGIN 116 bp upstream of ApaI site. 1 tttggtttag ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg 61 ttagggttag ggttagggtt agggttaggg ttagggttag ggttagggtt agggccctaa 121 ccctaaacca aa // LOCUS MZEGST3AA 913 bp ss-mRNA PLN 15-SEP-1989 DEFINITION Maize GSH glutathione S-transferase III mRNA, complete cds. ACCESSION X06755 KEYWORDS glutathione S-transferase; tandem repeat; transferase. SOURCE Maize, cDNA to mRNA, clone pGTC27. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 913) AUTHORS Grove,G., Zarlengo,R.P., Timmermann,K.P., Li,N., Tam,M.F. and Tu,C.-P.D. TITLE Characterization and heterospecific expression of cDNA clones of genes in the maize GSH S-transferase multigene family JOURNAL Nucleic Acids Res. 16, 425-438 (1988) STANDARD simple staff_entry COMMENT Data kindly reviewed (16 JUNE 1988) by D.Tu. FEATURES from to/span description pept 75 743 glutathione S transferase III (EC 2.5.1.18) mRNA < 1 873 GST mRNA (alt.) mRNA < 1 884 GST mRNA (alt.) mRNA < 1 898 GST mRNA (alt.) mRNA < 1 913 GST mRNA (alt.) rpt 54 71 tandemly repeated trinucleotide BASE COUNT 145 a 332 c 270 g 166 t ORIGIN 1 cccactttac tcctatccac tgcggcctgg acgcgtgcga gaggcttgac caagcagcag 61 cagcagcagc agcgatggcg cctctgaagc tgtacgggat gccgctgtcc cccaacgtgg 121 tgcgcgtggc caccgtgctc aacgagaagg gcctcgactt cgagatcgtc cccgtcgacc 181 tcaccaccgg cgcccacaag cagcccgact tcctcgccct caaccctttc ggccagatcc 241 cggctctcgt cgacggagac gaagtcctct tcgagtcccg tgcgatcaac cggtacatcg 301 ccagcaagta cgcgtcggag ggcacggacc tgctccccgc gacggcgtcg gcggcgaagc 361 tggaggtgtg gctggaggtg gagtcgcacc acttccaccc gaacgcgtcg ccgctggtgt 421 tccagctgct cgtgaggccg ctcctgggcg gcgcccccga cgcggcggtg gtggagaagc 481 acgcggagca gctcgccaag gtgctcgacg tgtacgaggc gcacctggcc cgcaacaagt 541 acctcgccgg ggacgagttc acgctcgccg acgccaacca cgcgctccta cctgctctca 601 cctcagcaag acccccaagg ccgggctgcg tcgccgcccg cccccacgtc aaggcctggt 661 gggaggccat cgccgcccgc cccgcgttcc agaagaccgt cgccgccatc cccttgcccc 721 cgccgccctc ctcctcggct tgacctcgcc ttgcgttgcc tgggtcgcgg atgcgtcgga 781 gccctgagtc gaataaaaga ggccgcatcc tgtcttgcat ttgctcgtgc gccatgtgct 841 ataacagcct gtgtaataaa cactgttgcc tttcgtgtgt gttcattgcc tttttgttgg 901 ttggtctttg caa // LOCUS MZEGSTI 929 bp ss-mRNA PLN 15-SEP-1989 DEFINITION Maize GSH glutathione S-transferase I mRNA, complete cds. ACCESSION X06754 KEYWORDS glutathione S-transferase. SOURCE Maize, cDNA to mRNA, clone pGTC2. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 929) AUTHORS Grove,G., Zarlengo,R.P., Timmermann,K.P., Li,N., Tam,M.F. and Tu,C.-P.D. TITLE Characterization and heterospecific expression of cDNA clones of genes in the maize GSH S-transferase multigene family JOURNAL Nucleic Acids Res. 16, 425-438 (1988) STANDARD simple staff_entry COMMENT Data kindly reviewed (16 June 1988) by D. Tu FEATURES from to/span description pept 82 726 GST glutathione S transferase I (EC 2.5.1.18) mRNA < 1 915 GST mRNA (alt.) mRNA < 1 918 GST mRNA (alt.) mRNA < 1 924 GST mRNA (alt.) mRNA < 1 929 GST mRNA (alt.) rpt 35 65 tandemly repeated pentanucleotide BASE COUNT 211 a 230 c 277 g 211 t ORIGIN 1 gggcgcagag aatccccaag caaacaaaca gggtagaggg agaggagagg agaggagagg 61 agaggttggg tctgggacac catggctccg atgaagctgt acggggcggt gatgtcgtgg 121 aacttgacga ggtgcgcaac ggcgctggag gaggctggct ccgactacga gatcgtgccc 181 atcaacttcg ccaccgccga gcacaagagc cccgagcacc tcgtccgcaa cccgtttggt 241 caggttccag ctctgcagga tggtgacttg tacctcttcg aatcacgagc aatctgcaag 301 tacgctgctc gcaaaaacaa gccagagctg ttgagggaag gaaacctcga ggaggcagca 361 atggtggatg tttggatcga ggtggaggct aaccagtaca ccgctgcact gaatcccatc 421 ctcttccagg tcctcatcag tcctatgctt gggggaacca ccgaccagaa agttgtggat 481 gagaaccttg agaagctgaa gaaggtgcta gaggtgtacg aggcacgcct gaccaagtgc 541 aagtaccttg ctggagactt cctcagcctc gccgacctga accatgtgtc tgtcactctc 601 tgcctgttcg ctacgcccta cgcatctgtg ctcgacgcct acccgcatgt gaaggcctgg 661 tggtctggtc tgatggagag gccgtctgtc cagaaggtcg ctgccctgat gaagccatct 721 gcttgaaacg gttgccctgt caatcggtga atggaataag ccaagtctgt tctgggtgtt 781 tgcttgttca gtgatgtgtc ttctatgacg aaagttgagt gccttatgtg tgagcatgct 841 tgcttgtaac gagaccttga attgttcgga aatgagcatt cctgtgctct tcttgaaaat 901 aaactttgcg ttaggttcct caggatctt // LOCUS MZEPCSSU 830 bp ss-mRNA PLN 15-DEC-1988 DEFINITION Maize ribulose 1,5-bisphosphate carboxylase small subunit mRNA, complete cds. ACCESSION D00170 KEYWORDS RuBPC; ribulose 1,5-bisphosphate carboxylase; small subunit. SOURCE Maize leaf, cDNA to mRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 830) AUTHORS Matsuoka,M., Kano-Murakami,Y., Tanaka,Y., Ozeki,Y. and Yamamoto,N. TITLE Nucleotide sequence of cDNA encoding the small subunit of ribulose- 1,5-bisphosphate carboxylase from maize JOURNAL J. Biochem. 102, 673-676 (1987) STANDARD simple automatic COMMENT Comparison with the RuBPC small subunit genes from other plants revealed that the maize small subunit is similar to the wheat one, there being 73% homology between the transit peptides and 64% between the mature proteins. This indicates that there is no noteworthy difference between the C3 and C4 small subunit structures. corresponds to DDBJ (rel3.0) entry <RUBPCSSU>(D00170) dated 29-MAR-1988 FEATURES from to/span description pept 66 578 ribulose 1,5-bisphosphate carboxylase small subunit (SSu) BASE COUNT 147 a 281 c 220 g 182 t ORIGIN 65 bp upstream of the SSu initiation codon. 1 caggacgacc caagcaagca agcaagcagc gagtacatac atacatacta ggcagccagg 61 cagccatggc gcccaccgtg atgatggcct cgtcggccac cgccgtcgct ccgttccagg 121 ggctcaagtc caccgccagc ctccccgtcg cccgccgctc ctccagaagc ctcggcaacg 181 tcagcaacgg cggaaggatc cggtgcatgc aggtgtggcc ggcctacggc aacaagaagt 241 tcgagacgct gtcgtacctg ccgccgctgt cgacggacga cctgctgaag caggtggact 301 acctgctgcg caacggctgg ataccctgcc tcgagttcag caaggtcggc ttcgtgtacc 361 gcgagaactc cacctccccg tgctactacg acggccgcta ctggaccatg tggaagctgc 421 ccatgttcgg ctgcaacgac gccacccagg tgtacaagga gctgcaggag gccatcaaat 481 cctacccgga cgccttccac cgcgtcatcg gcttcgacaa catcaagcag acgcagtgcg 541 tcagcttcat cgcctacaag cccccgggca gcgactagac cgcgcccgcc ggccgccccc 601 cgccggctag ctagctagct agctcctgcg tgagctagta gctagtgcca tgcgtcgtct 661 ctgtcgttcg gttttgcttc gggtcaccgt accctttgct tgcttggttt cttctttcct 721 tttttccttt tttttttctt cttttccccg gccatggttc ctttgctttc agcagttctc 781 tgtgatgtga tgtatccatt gttgcaagca tgcatggcct tgcattggct // LOCUS PETRBCA 318 bp ss-mRNA PLN 15-MAR-1990 DEFINITION Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA (clone pSSU 117), partial cds. ACCESSION M29641 KEYWORDS ribulose 1,5-bisphosphate carboxylase. SOURCE Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 117. ORGANISM Petunia hybrida Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 318) AUTHORS Dunsmuir,P., Smith,S. and Bedbrook,J. TITLE A number of different nuclear genes for the small subunit of RuBPCase are transcribed in petunia JOURNAL Nucleic Acids Res. 11, 4177-4183 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 318 ribulose 1,5-bisphosphate carboxylase (AA at 1) BASE COUNT 89 a 67 c 76 g 86 t ORIGIN 1 tacgagacac tctcatactt tcccgatttg accgacgaac aattgttcaa agaagttgag 61 taccttttga ataagggatg ggttccttgt ttggaattcg agctgaaaca caaatttatc 121 taccgtgaat atcacgcatc accaggatac tatgatggaa ggtactggac aatgtggaag 181 ttgcccatgt ttggttgcac tgatgctacc caagtcttgg gtgagctcca agaggccaag 241 aaggcttacc caaatgcatg gatcaggatc atcggattcg agaacgttcg tcaagtgcag 301 tgcatcagtt tcattgcc // LOCUS PETRBCB 212 bp ss-mRNA PLN 15-MAR-1990 DEFINITION Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA (clone pSSU 51), partial cds. ACCESSION M29642 KEYWORDS ribulose 1,5-bisphosphate carboxylase. SOURCE Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 51. ORGANISM Petunia hybrida Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 212) AUTHORS Dunsmuir,P., Smith,S. and Bedbrook,J. TITLE A number of different nuclear genes for the small subunit of RuBPCase are transcribed in petunia JOURNAL Nucleic Acids Res. 11, 4177-4183 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 207 ribulose 1,5-bisphosphate carboxylase (AA at 1) BASE COUNT 53 a 57 c 53 g 49 t ORIGIN 1 catgcatctc cagggtacta tgatggcagg tactggacca tgtggaagct gcccatgttc 61 gggtgcactg atgccaccca agtcttgggt gagctccaag aggccaagaa ggcttacccc 121 aatgcctgga tcagtatcat tggcttcgac aacgtacgtc aagtgcaatg catcagtttc 181 attgcctaca agccccaagg ctactagatt ac // LOCUS PETRBCC 251 bp ss-mRNA PLN 15-MAR-1990 DEFINITION Petunia ribulose 1,5-bisphosphate carboxylase small subunit mRNA (clone pSSU 41), partial cds. ACCESSION M29643 KEYWORDS ribulose 1,5-bisphosphate carboxylase. SOURCE Petunia (strain Mitchell) leaf cDNA to mRNA, clone pSSU 41. ORGANISM Petunia hybrida Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 251) AUTHORS Dunsmuir,P., Smith,S. and Bedbrook,J. TITLE A number of different nuclear genes for the small subunit of RuBPCase are transcribed in petunia JOURNAL Nucleic Acids Res. 11, 4177-4183 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 141 ribulose 1,5-bisphosphate carboxylase (AA at 1) BASE COUNT 77 a 67 c 61 g 46 t ORIGIN 1 accgatgcaa cccaggtgtt gggtgagctc caagaggcca agaaggctta ccctaatgca 61 tggatcagaa tcatcggatt cgacaacgtc cgtcaagtgc aatgcatcag tttcattgcc 121 tacaagccac caggctacta agttatatta ggacagcttc ccatgtgtat ttaggggcag 181 tcccggcgga ttgggagcca aggaccccgg ccggccaaaa atagccagga acccaccaaa 241 aaaccaaaaa a // LOCUS SPIPSI2 863 bp ss-mRNA PLN 15-MAR-1990 DEFINITION Spinach psAD mRNA for subunit II precursor of photosystem I reaction center. ACCESSION X14017 KEYWORDS photosynthesis; photosystem I; photosystem I reaction center. SOURCE spinach (Spinacia oleracea). ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 863) AUTHORS Muench,S., Ljungberg,U., Steppuhn,J., Schneiderbauer,A., Nechushtai,R., Beyreuther,K. and Herrmann,R.G. TITLE Nucleotide sequences of cDNAs encoding the entire precursor polypeptides for subunits II and III of the photosystem I reaction center from spinach JOURNAL Curr. Genet. 14, 511-518 (1988) STANDARD simple automatic COMMENT *source: tissue=seedling; library=lambda gt11; clone=p6SocPS12-4. seq. differs from that reported by Lagoutte (1988), see Y00759; the psaD gene for subunit II probably exists only in a single copy per spinach genome EMBL features not translated to GenBank features: key from to description MSG <1 863 psaD mRNA SITE 842 846 put. polyA signal POLYA 863 863 polyA site FEATURES from to/span description pept 155 793 subunit II preprotein (AA -50 to 162) sigp 155 307 transit peptide (AA -50 to -1) matp 305 790 mature subunit II (AA 1 - 162) BASE COUNT 279 a 214 c 158 g 212 t ORIGIN 1 ctaaggaaaa cagaaaaatt gccacaaatt gggctacagg aagctttaat tcaaactaca 61 gtctaatggt acacattcaa caagtcctat aaagtacaat tagttgacaa aacaaaaaac 121 caatccaaag ttccaaaaca acaacaatca aattatggcc atggcaactc aagccaccct 181 cttctcccca tcttctctct cctccgcaaa acccattgac acccgcctca ccacctcttt 241 caaacaaccc tccgccctct ctttcgcctc caagcccgcc tcccgccacc actctatccg 301 ggcagccgcc gcagctgaag gcaaagctgc caccccaact gaaaccaagg aggcgcctaa 361 aggttttact cctccggagt tggaccctaa cactccctcc cccatctttg ccggcagcac 421 aggtggacta ttgaggaagg cacaagtaga ggagttttac gtgataacat gggaatcacc 481 aaaagagcaa atattcgaga tgccaacagg aggagcagca ataatgaggg aaggaccaaa 541 cttgctaaaa ttagcacgta aagagcaatg tttagcactt ggaacaagac ttaggtctaa 601 atacaagatt aagtaccaat tttacagggt tttcccaagt ggtgaagttc aatatttaca 661 ccctaaagat ggtgtttacc cagaaaaggt taaccctggt agacaaggtg ttgggttaaa 721 catgaggtct attggtaaga atgttagccc tattgaggtt aaatttactg gcaagcaacc 781 ttatgatttg taataacttt ttatcttatt ttcatcatgt tcatgctact atgttttgga 841 caaatacaat gtgattattt tag // LOCUS SPIPSI3 653 bp ss-mRNA PLN 15-MAR-1990 DEFINITION Spinach psaE mRNA for subunit III precursor of photosystem I reaction center. ACCESSION X14018 KEYWORDS photosynthesis; photosystem I; photosystem I reaction center; psaE gene. SOURCE spinach (Spinacia oleracea). ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 653) AUTHORS Muench,S., Ljungberg,U., Steppuhn,J., Schneiderbauer,A., Nechushtai,R., Beyreuther,K. and Herrmann,R.G. TITLE Nucleotide sequences of cDNAs encoding the entire precursor polypeptides for subunits II and III of the photosystem I reaction center from spinach JOURNAL Curr. Genet. 14, 511-518 (1988) STANDARD simple automatic COMMENT *source: tissue=seedling; library=lambda gt11; clone=p6SocPS12-4. at least two genes for subunit III are present in spinach genome designated psaE-1 and psaE-2 by [1] EMBL features not translated to GenBank features: key from to description MSG <1 653 psaE mRNA MSG <1 608 altern. psAE mRNA POLYA 608 608 altern. polyA site POLYA 653 653 polyA site FEATURES from to/span description pept 106 483 subunit III preprotein (AA -34 to 91) sigp 106 207 transit peptide (AA -34 to -1) matp 208 480 subunit III (AA 1 - 91) BASE COUNT 198 a 140 c 139 g 176 t ORIGIN 1 aaaacacaaa gctactagtt caaagggact ccctcctttt cactacttgc aaaatttatt 61 tcttgtaaga agaaaaaaca gagagagaga gggagaaaat taataatggc tagcattgca 121 tcatcagtag ctgttaggct tgggctaaca caagtgttgc ccaacaaaaa cttctcatct 181 ccaaggagca ctaggcttgt tgtccgggct gctgaagagg ctgctgctgc acccgctgcc 241 gcatcgcccg aaggtgaagc acccaaagct gctgctaagc caccaccaat tggccccaag 301 aggggttcta aggtgagaat catgagaaag gaatcatact ggtacaaggg tgttggctca 361 gtggttgctg ttgatcagga tccaaaaaca agatatccag ttgtggtgag attcaacaag 421 gtgaattacg cgaatgtatc aaccaacaac tacgcactgg atgagattca agaagttgca 481 tgatcaactc tgttataagt ttgcgaaatt tccaaacatt tagatgaatt aggagctatt 541 tctgctgtgt gtatccaatg taaatctact tcttttgagt atcaacttct ttgcccatga 601 aacttgtcag atatgctcct catcatccta agtattcacc atattctttc ctt // LOCUS SPIPSI4 659 bp ss-mRNA PLN 15-MAR-1990 DEFINITION Spinach mRNA for photosystem I subunit V. ACCESSION X13134 Y00967 KEYWORDS photosystem I; photosystem I reaction center. SOURCE spinach (Spinacia oleracea). ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 659) AUTHORS Herrmann,R.G. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 659) AUTHORS Steppuhn,J., Hermans,J., Nechushtai,R., Ljungberg,U., Thuemmler,F., Lottspeich,F. and Herrmann,R.G. TITLE Nucleotide sequence of cDNA clones encoding the entire precursor polypeptides for subunits IV and V of the photosystem I reaction center JOURNAL FEBS Lett. 237, 218-224 (1988) STANDARD simple automatic COMMENT *source: tissue=green tissue; clone=p6SocPI5-7. EMBL features not translated to GenBank features: key from to description MSG <1 659 PSI subunit V mRNA SITE 620 624 put. polyA signal POLYA 659 659 polyA site [1] Author address Herrmann R.G., Botanisches Institut der Ludwig Maximilians Universiteit, Menzigerstr. 67, 8000 Muenchen 19, FRG. Submitted (06-OCT-1988) on tape to the EMBL data library FEATURES from to/span description pept 16 519 PSI subunit V preprotein (AA -69 to 98) sigp 16 222 transit peptide (AA -69 to -1) matp 223 516 mature PSI subunit V (AA 1 - 98) BASE COUNT 185 a 128 c 131 g 215 t ORIGIN 1 taaatccgta aaaaaatggc agcagcaaca gcatctttga gcagcacttt actagcaccc 61 tgcagctcaa agcagccgca gccgcaacag caacatcagc atcagcagct caaatgcaag 121 tccttttccg gcctccgccc actgaagcta aacatcagca gcaacaacag cagcagcagt 181 ctatcgatgt cttctgcaag aagatcaatg acttgccgtg cagagttgag cccatcattg 241 gtgataagtc tgagtacagg gctatccctt tttttaggaa ggtttgtgtt cttcaatttc 301 cagagggaga atatggccaa acaggtgcct gaacaaaatg gtatgtctca ttttgaggcg 361 ggcgatactc gtgctaagga gtatgtgagc cttcttaagt ctaatgatcc tgttgggttt 421 aacattgttg atgttcttgc ttggggttct attggtcata ttgttgctta ttatatcttg 481 gctactgcta gcaatggtta tgaccctagc ttcttttaaa ttatcatatt tcggatttta 541 gtttcttttt taattgttat attattatta ttattattat tattatgttg agttgttgtg 601 taattaacag attctgctca aatattattg cattcacttc tgtaaaaaaa aaaaaaaaa // LOCUS SPIPSI5 923 bp ss-mRNA PLN 15-MAR-1990 DEFINITION Spinach mRNA for photosystem I subunit IV. ACCESSION X13133 Y00967 KEYWORDS photosystem I; photosystem I reaction center. SOURCE spinach (Spinacia oleracea). ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 923) AUTHORS Herrmann,R.G. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 923) AUTHORS Steppuhn,J., Hermans,J., Nechushtai,R., Ljungberg,U., Thuemmler,F., Lottspeich,F. and Herrmann,R.G. TITLE Nucleotide sequence of cDNA clones encoding the entire precursor polypeptides for subunits IV and V of the photosystem I reaction center JOURNAL FEBS Lett. 237, 218-224 (1988) STANDARD simple automatic COMMENT *source: tissue=green tissue; clone=p6SocPI4-7; EMBL features not translated to GenBank features: key from to description MSG <1 923 PSI subunit IV mRNA MSG 923 923 polyA site [1] Author address Herrmann R.G., Botanisches Institut der Ludwig Maximilians Universiteit, Menzigerstr. 67, 8000 Muenchen 19, FRG. Submitted (06-OCT-1988) on tape to the EMBL data library FEATURES from to/span description pept 30 725 PSI subunit IV preprotein (AA -77 to 154) sigp 30 260 signal peptide (AA -77 to -1) matp 261 722 PSI subunit IV (AA 1 - 154) BASE COUNT 249 a 183 c 202 g 289 t ORIGIN 1 cctcacttcc ttccatccaa aacacaacca tgtctttcac aatccccact aacctctaca 61 aacctcttgc taccaaacct aagcatctat cttcttcctc ctttgcacca agatcaaaga 121 ttgtttgcca acaagaaaat gatcaacaac aacctaagaa gctagagcta gctaaggtag 181 gtgccaacgc tgccgctgcg ttggctcttt cctcagtgct actttcctca tggagtgtag 241 cacccgatgc tgccatggct gacattgcag ggctaacacc atgcaaggag tccaagcagt 301 ttgctaagag ggaaaagcag gccttgaaga agttgcaagc ctcgttgaag ctgtacgcag 361 atgatagcgc accagcttta gcgatcaagg ctacaatgga gaagaccaag aagaggttcg 421 ataactatgg aaagtatggt ttgctttgtg ggtctgatgg acttcctcat ttgatagtga 481 gtggtgatca gagacactgg ggtgagttta tcacaccagg gattttgttc ttgtatattg 541 ctgggtggat tgggtgggtt ggaaggagtt atttgattgc tattagggat gagaagaaac 601 ctactcagaa ggagattatt attgatgttc ctttggcttc aagtcttctt tttagaggct 661 tcagttggcc tgttgctgct tatagagagt tgctcaatgg tgagcttgtt gacaacaact 721 tttaatttta tcatcctttt atcaaatcaa ttaatcctgt gttatattgt ctactatgta 781 tatcttaaat actccttttt ggaaacttga atgtgtgttt cgattgtttg attgattcat 841 ccggatgtaa tcatcatccg attagaattg tgtaagtgtt tgtttaatga tgctaattgg 901 aaccgacgat cttttacaaa aaa //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS ATPCPRBCLA 1790 bp ds-DNA ORG 20-FEB-1990 DEFINITION A.rosea chloroplast ribulose bisphosphate carboxylase/oxygenase large subunit (rbcL) gene, complete cds. ACCESSION X15924 KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase. SOURCE A.rosea chloroplast DNA. ORGANISM Chloroplast Atriplex rosea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae; Atriplex rosea. REFERENCE 1 (bases 1 to 1790) AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R., Andrews,T.J. and Whitfield,P.R. TITLE Comparisons of rbcL genes for the large subunit of ribulose-bisphosphate carboxylase from closely related C3 and C4 plant species JOURNAL J. Biol. Chem. 265, 808-814 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.S. Fudson, 20-JUL-1989. FEATURES from to/span description pept 259 1686 ribulose bisphosphate carboxylase/oxygenase. large subunit (EC 4.1.1.39) BASE COUNT 521 a 314 c 417 g 538 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgaaa ttagaaactt gaaaaacaac ggttacgatt gggttgcgcc atatatatga 61 aagagtatac aataatgatg tatttgacga atcaaataca tggtctatta gcgaaccgtt 121 tgattagttg ataatattaa ttgagaattt attttatgaa agattcctat aaaaggtttc 181 attaggggct aatttatgtc gagtagacct tgttgttttg ttgtaaaaat ttaaaattga 241 agttgtaggg agggacttat gtcaccacaa acagagacta aagcaaatgt tggatttaaa 301 gctggtgtta aagattacaa attgacttat tatactcctg agtatgaaac cctagatact 361 gatatcttgg cagcattccg agtaagtcct caaccgggag ttccacccga agaagcaggg 421 gctgcagtag ctgctgaatc ttctactggt acatggacaa ctgtatggac cgacggactt 481 accagtcttg atcgttacaa aggacgatgc taccacatcg agcctgttgc tggagaagaa 541 aatcaatata tttgttatgt agcatatccc ttagaccttt ttgaagaagg ttctgttact 601 aatatgttta cttccattgt gggtaacgta tttgggttca aagccctgcg tgctctacgt 661 ttggaggatt tgcgaatccc tgttgcttat gtaaaaactt tccaaggccc gcctcacggt 721 atccaagttg agagagataa attgaataag tatggccgcc ccctattggg atgcactatt 781 aaaccaaaat tggggttatc cgctaaaaac tatggtcgag cagtttatga agtccttcgt 841 ggtggacttg attttaccaa agatgatgaa aacgtgaact cccagccgtt tatgcgttgg 901 agagaccgtt tcctattttg tgccgaagct ctttataaag cacaagccga aacaggcgaa 961 atcaagggtc attacttgaa tgctactgcg ggtacatgcg aagacatgat gaaaagggct 1021 gtatttgcca gagaattggg agttcctatc gtaatgcatg actacttaac agggggattc 1081 actgcaaata ctactttgtc tcattattgc cgagataatg gtctacttct gcacatccac 1141 cgtgcaatgc acgcagttat tgatagacag aagaatcatg gtatacactt ccgtgtacta 1201 gctaaagcgt tacgtctgtc tggtggagat catattcatt ctggtaccgt agtaggtaag 1261 cttgaaggag aaagagatat tactttaggt tttgttgatt tactacgtga tgattatact 1321 gaaaaagacc gaagtcgtgg tatttatttt agtcaatctt gggtttccac accaggtgtt 1381 ctgcccgttg cttcgggagg tattcacgtt tggcatatgc ctgctctaac cgagatcttt 1441 ggggatgatt ctgtactaca gtttggtgga ggaactttag gacacccttg ggggaatgca 1501 ccgggtgctg tagcgaatcg agtagctcta gaagcatgtg tacaagctcg taatgaggga 1561 cgtgatcttg ctcgtgaggg taatacaatt attcgcgagg ctagcaaatg gagtcctgaa 1621 ctagctgctg cttgtgagat atggaaggaa atcaaatttg aattcccagc aatggataca 1681 gtttaggcta agtaattaat gttcggtctc ttagttgaat tgtaattaaa caaactcggc 1741 ccaatctttt actaaaagga ttgagccgaa tacaattatt gtagatatat // LOCUS ATPCPRBCLB 1786 bp ds-DNA ORG 20-FEB-1990 DEFINITION A.patula chloroplast ribulose bisphosphate carboxylase/oxygenase large subunit (rbcL) gene, complete cds. ACCESSION X15925 KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase. SOURCE A.patula chloroplast DNA. ORGANISM Chloroplast Atriplex patula Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae; Atriplex patula. REFERENCE 1 (bases 1 to 1786) AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R., Andrews,T.J. and Whitfield,P.R. TITLE Comparisons of rbcL genes for the large subunit of ribulose-bisphosphate carboxylase from closely related C3 and C4 plant species JOURNAL J. Biol. Chem. 265, 808-814 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.S. Fudson, 20-JUL-1989. FEATURES from to/span description pept 255 1682 ribulose bisphosphate carboxylase/oxygenase. large subunit (EC 4.1.1.39) BASE COUNT 517 a 316 c 419 g 534 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgaaa ttagaaactt gaaaaacaac ggttacgatt gggttgcgcc atatatatga 61 aagagtatac aataatgatg tatttgacga atcaaataca tggtctatta gcgaaccgtt 121 tgattagttg ataatattaa ttgagaattt tatgaaagat tcctataaaa ggtttcatta 181 ggggctaatt tatgtcgagt agaccttgtt gttttgttgt aaaaatttaa aattgaagtt 241 gtagggaggg acttatgtca ccacaaacag agactaaagc aagtgttgga tttaaagctg 301 gtgttaaaga ttacaaattg acttattata ctcctgagta tgaaacccta gatactgata 361 tcttggcagc attccgagta agtcctcaac cgggagttcc acccgaagaa gcaggggctg 421 cagtagctgc tgaatcttct actggtacat ggacaactgt atggaccgac gggcttacca 481 gtcttgatcg ttacaaagga cgatgctacc acatcgagcc tgttgctgga gaagaaaatc 541 aatatatttg ttatgtagca tatcccttag acctttttga agaaggttct gttactaata 601 tgtttacttc cattgtgggt aacgtatttg ggttcaaagc cctgcgtgct ctacgtttgg 661 aggatttgcg aatccctgtt gcttatgtaa aaactttcca aggcccgcct cacggtatcc 721 aagttgagag agataaattg aacaagtatg gccgccccct attgggatgc actattaaac 781 caaaattggg gttatccgct aaaaactatg gtcgagcagt ttatgaagtc cttcgtggtg 841 gacttgattt taccaaagat gatgaaaacg tgaactccca gccgtttatg cgttggagag 901 accgtttcct attttgtgcc gaagctcttt ataaagcaca agccgaaaca ggcgaaatca 961 agggtcatta cttgaatgct actgcgggta catgcgaaga catgatgaaa agggctgtat 1021 ttgccagaga attgggagtt cctatcgtaa tgcatgacta cttaacaggg ggattcactg 1081 caaatactac tttgtctcat tattgccgag ataatggtct acttctgcac atccaccgtg 1141 caatgcacgc agttattgat agacagaaga atcatggtat gcacttccgt gtactagcta 1201 aagcgttacg tctgtctggt ggagatcata ttcattctgg taccgtagta ggtaagcttg 1261 aaggagaaag agatattact ttaggttttg ttgatttact acgtgatgat tatactgaaa 1321 aagaccgaag tcgtggtatt tattttactc aatcttgggt ttccacacca ggtgttctgc 1381 ccgttgcttc gggaggtatt cacgtttggc atatgcctgc tctaaccgag atctttgggg 1441 atgattctgt actacagttt ggtggaggaa ctttaggaca cccttggggg aatgcaccgg 1501 gtgctgtagc gaatcgagta gctctagaag catgtgtaca agctcgtaat gagggacgtg 1561 atcttgctcg tgagggtaat acaattattc gcgaggctag caaatggagt cctgaactag 1621 ctgctgcttg tgagatatgg aaggaaatca aatttgaatt cccagcaatg gatacagttt 1681 aggctaagta attaatgttc ggtctcttag ttgaattgta attaaacaaa ctcggcccaa 1741 tcttttacta aaaggattga gccgaataca attattgtag atatat // LOCUS BSPMANN 1939 bp ds-DNA BCT 06-FEB-1990 DEFINITION Bacillus sp. beta-mannanase gene, complete cds. ACCESSION M31797 KEYWORDS 1,4-beta-D-mannan mannanohydrolase; mannan endo-1,4-beta-mannosidase; mannanase. SOURCE Bacillus sp. (strain AM-001) DNA, clone pMAH5. ORGANISM Bacillus sp. Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1939) AUTHORS Akino,T., Kato,C. and Horikoshi,K. TITLE Two bacillus beta-mannanases having different COOH termini are produced in Escherichia coli carrying pMAH5 JOURNAL Appl. Environ. Microbiol. 55, 3178-3183 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 220 1761 beta-mannanase precursor (EC 3.2.1.78) sigp 220 297 beta-mannanase signal peptide matp 298 1758 beta-mannanase signal 1826 1840 pot. transcription termination signal signal 1861 1889 pot. transcription termination signal signal 1905 1916 pot. transcription termination signal BASE COUNT 637 a 343 c 420 g 539 t ORIGIN 1 bp upstream of XbaI site. 1 tctagactcc aaaggttact atcaacctgt ctatttattt aactgtacag tagatggggt 61 agaatcaaac catcatcatc cctgccatct aaattcatta tatgaactcc tcaatagaga 121 acaacaaatc ataatccaac catatttttc taatcaatca ctatgttaag ataaaaaatg 181 taatcgctta caattaaaag gatagaggag gattatgtaa tgaaggtgta caagaaggtg 241 gcttttgtta tggcttttat tatgtttttt tcggtcctgc cgacgatctc aatgtcgtca 301 gaagcaaacg gtgctgcatt atcgaatcct aatgcgaacc aaacgacaaa aaacgtgtat 361 agttggttag ccaatctacc aaacaagagt aataaacgtg tggtgtcggg acacttcgga 421 gggtacagtg attctacctt agcctggatc aaacaatgcg caagggagct gacaggaaaa 481 atgccaggaa tattatcttg tgattataag aattggcaga cgcgattgta tgtagccgat 541 caaattagct atggctgcaa tcaagaatta ataaactttt ggaaccaagg aggtttggtc 601 acgatcagtg tacacatgcc aaatccaggg tttcattcgg gggaaaacta caaaacaatt 661 ttgcctactt cacagttcca aaatctaacc aatcacagga caacagaggg tagaaggtgg 721 aaggatatgc tggataagat ggcagatggg ttggacgagc tacagaacaa tggagtgacg 781 gttcttttcc gtcctttaca tgaaatgaat ggagaatggt tctggtgggg agcagaaggt 841 tacaatcaat ttgatcaaac acgtgccaat gcctatatca gcgcatggag agatatgtat 901 caatatttta ctcatgagcg taagctgaat aaccttattt gggtttactc acctgatgtt 961 tacagagatc atgtaacaag ttactaccca ggagcaaatt atgtagatat tgtggctctt 1021 gattcctacc atcctgatcc acatagcctt actgaccaat ataatcgaat gatcgcttta 1081 gataaacctt ttgcttttgc tgaaatcggt cctcctgaaa gcatggctgg ttcctttgat 1141 tattcaaatt atattcaagc aattaaacaa aaatatccac gtactgtcta tttcctagct 1201 tggaatgata aatggagtcc acataacaac agaggagcat gggatctatt taatgattca 1261 tgggttgtaa ataggggaga gattgattat ggtcaatcaa atccagccac tgttctctat 1321 gattttgaaa acaatacgct atcgtggtcc gggtgtgaat ttacggacgg aggaccatgg 1381 acttcgaatg aatggtcggc aaatggtact caatcgttga aagcagatgt cgttctgggc 1441 aataatagct accatttgca aaaaacagtg aatcgaaatc ttagttcatt caaaaaccta 1501 gaaattaaag tgagccattc ttcgtgggga aatgtaggaa gtggcatgac agcaagagtt 1561 ttcgtcaaaa cagggagtgc ttggagatgg aatgcaggtg aattttgtca gtttgcaggc 1621 aaacgaacaa ccgcactatc tattgatttg acgaaagtaa gtaatctgca tgatgttcga 1681 gagataggtg tagagtataa agcaccagca aatagcaacg ggaagacggc gatttactta 1741 gatcatgtga ccgtaagata atacaaaaaa aagtggttga aagcggtaac atatctagca 1801 tatgatgata gggactagat aataatagac tgtcagacta ggaggtaagt cataatgaaa 1861 aaaagtctga tcctcttgct cggactttta ttagctttct ccatgctatt aatagcctat 1921 ctatcattca cccctgcag // LOCUS FTRCPRBCLA 1839 bp ds-DNA ORG 20-FEB-1990 DEFINITION F.bidentis chloroplast ribulose bisphosphate carboxylase/oxygenase large subunit (rbcL) gene, complete cds. ACCESSION X15923 KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase. SOURCE F.bidentis chloroplast DNA. ORGANISM Chloroplast Flaveria bidentis Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Asterales; Asteraceae; Flaveria bidentis. REFERENCE 1 (bases 1 to 1839) AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R., Andrews,T.J. and Whitfield,P.R. TITLE Comparisons of rbcL genes for the large subunit of ribulose-bisphosphate carboxylase from closely related C3 and C4 plant species JOURNAL J. Biol. Chem. 265, 808-814 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.S. Fudson, 20-JUL-1989. FEATURES from to/span description pept 280 1737 ribulose bisphosphate carboxylase/oxygenase. large subunit (EC 4.1.1.39) BASE COUNT 523 a 325 c 437 g 554 t ORIGIN 1 aggggggaat ttcttagttt ttgggtgatt tttaggtatt tggataaaaa aaaaagcatt 61 gggttgcgcc atatatatga aagagtatac aataatgatg tatttgccga atcaaatacc 121 atggtctaat aatcaagcat tctgattagt tgataatttt actattagtt gggaattttg 181 tgaaaggttc ctgtaaaaag tttcattaac gcctaattca tgtcgagtag accttgttgt 241 tgtgagaatt cttaattcat gagttgtagg gagggattta tgtcaccaca aacagagact 301 aaagcaagtg ttggattcaa agctggtgtt aaagattata aattgactta ttatactcct 361 gaatatgaaa ccaaggatac tgatatcttg gcagcatttc gagtaactcc tcaacctgga 421 gttccgcctg aagaagcagg ggccgcagta gctgccgaat cttctactgg tacatggaca 481 actgtatgga ccgatggact tacgagcctt gatcgttaca aaggccgatg ctatggaatc 541 gagcctgttc ctggagaaga caatcaatat attgcttatg tagcttaccc attagacctt 601 tttgaagaag gttctgttac taacatgttt acttccattg taggtaatgt atttgggttc 661 aaagccctgc gtgctctacg tctggaagat ttgcgaatcc ctactgcgta tgttaaaact 721 ttcgcgggtc cgcctcacgg tatccaagtt gagagagata aattgaacaa gtatggtcgt 781 cccctgttgg gctgtactat taaacctaaa ttggggttat ccgctaaaaa ctacggtaga 841 gcttgttatg aatgtcttcg tggtggcctt gattttacta aagatgatga gaacgtgaac 901 tcccaaccat ttatgcgttg gagagaccgt ttcttatttt gtgccgaagc tatttataaa 961 gcacaagctg aaacaggtga aatcaaaggg cattacttga atgctactgc gggtacatgc 1021 gaagaaatgc ttaaaagggc tgtatttgcc agagaattgg gagttcctat cataatgcat 1081 gactacctaa cagggggatt cactgcaaat actagcttgt ctcattattg ccgagataac 1141 ggcctacttc ttcacatcca ccgcgcaatg catgcagtta ttgatagaca gaagaatcat 1201 ggtattcact tccgtgtact agctaaagcg ttacgtatgt ctggtggaga tcacattcat 1261 tccggtaccg tagtaggtaa acttgaaggg gaaagagaaa tcactttggg ctttgttgat 1321 ttattgcgtg atgattttat tgaaaaagat agaagtcgcg gtatttattt cacccaagat 1381 tgggtctctc taccaggtgt tctgcctgta gcttcggggg gtattcacgt ttggcatatg 1441 cctgctctga ccgagatctt tggggatgat tccgtactac agttcggtgg aggaacttta 1501 gggcaccctt gggggaatgc acctggtgcc gtagctaatc gagtagctct agaagcatgt 1561 gtacaagctc gtaatgaggg acgtgatctt gctactgagg gtaatgagat tatccgtgag 1621 gctaccaaat ggagccctga actagctgct gcttgtgaag tatggaagga gatcaaattt 1681 gagttccagg caatggatac tttggatacg gataaggata aggataagaa gagataacgt 1741 tcgttctctt aattgaattt caatgaaact cggcccaatc ttttactaaa aggattgagc 1801 cgaatccaac atgcatatat tttagataga tacatactt // LOCUS FTRCPRBCLB 1842 bp ds-DNA ORG 20-FEB-1990 DEFINITION F.pringlei chloroplast ribulose bisphosphate carboxylase/oxygenase large subunit (rbcL) gene, complete cds. ACCESSION X15922 KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase. SOURCE F.pringlei chloroplast DNA. ORGANISM Chloroplast Flaveria pringlei Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Asterales; Asteraceae; Flaveria pringlei. REFERENCE 1 (bases 1 to 1842) AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R., Andrews,T.J. and Whitfield,P.R. TITLE Comparisons of rbcL genes for the large subunit of ribulose-bisphosphate carboxylase from closely related C3 and C4 plant species JOURNAL J. Biol. Chem. 265, 808-814 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.S. Fudson, 20-JUL-1989. FEATURES from to/span description pept 283 1740 ribulose bisphosphate carboxylase/oxygenase. large subunit (EC 4.1.1.39) BASE COUNT 529 a 324 c 437 g 552 t ORIGIN 1 aggggggaat ttcttagttt ttgggtgatt tttaggtatt tcgataaaaa aaaaaaaaga 61 attgggttgc gccatatata tgaaagagta tacaataatg atgtatttgc cgaatcaaat 121 accatggtct aataatcaag cattctgatt agttgataat tttactatta gttgggaatt 181 ttgtgaaagg ttcctgtaaa aagtttcatt aacgcctaat tcatgtcgag tagaccttgt 241 tgttgtgaga attcttaatt catgagttgt agggagggat ttatgtcacc acaaacagag 301 actaaagcaa gtgttggatt caaagctggt gttaaagatt ataaattgac ttattatact 361 cctgaatatg aaaccaagga tactgatatc ttggcagcat ttcgagtaac tcctcaacct 421 ggagttccgc ctgaagaagc aggggccgca gtagctgccg aatcttctac tggtacatgg 481 acaactgtat ggaccgatgg acttacgagc cttgatcgtt acaaaggccg atgctatgga 541 atcgagcctg ttcctggaga agacaatcaa tatattgctt atgtagctta cccattagac 601 ctttttgaag aaggttctgt tactaacatg tttacttcca ttgtaggtaa tgtatttggg 661 ttcaaagccc tgcgtgctct acgtctggaa gatttgcgaa tccctactgc gtatgttaaa 721 actttcgagg gtccgcctca cggtatccaa gttgagagag ataaattgaa caagtatggt 781 cgtcccctgt tgggatgtac tattaaacct aaattggggt tatccgctaa aaactacggt 841 agagcttgtt atgaatgtct tcgtggtggc cttgatttta ctaaagatga tgagaacgtg 901 aactcccaac catttatgcg ttggagagac cgtttcttat tttgtgccga agctatttat 961 aaagcacaag ctgaaacagg tgaaatcaaa gggcattact tgaatgctac tgcgggtaca 1021 tgcgaagaaa tgcttaaaag ggctgtattt gccagagaat tgggagttcc tatcgtaatg 1081 catgactacc taacaggggg attcactgca aatactagct tgtctcatta ttgccgagat 1141 aacggcctac ttcttcacat ccaccgcgca atgcatgcag ttattgatag acagaagaat 1201 catggtatgc acttccgtgt actagctaaa gcgttacgta tgtctggtgg agatcacatt 1261 cattccggta ccgtagtagg taaacttgaa ggggaaagag aaatcacttt gggctttgtt 1321 gatttactgc gtgatgattt tattgaaaaa gatagaagtc gcggtattta tttcacccaa 1381 gattgggtct ctctaccagg tgttctgcct gtagcttcgg ggggtattca cgtttggcat 1441 atgcctgctc tgaccgagat ctttggggat gattccgtac tacagttcgg tggaggaact 1501 ttagggcacc cttgggggaa tgcacctggt gccgtagcta atcgagtagc tctagaagca 1561 tgtgtacaag ctcgtaatga gggacgcgat cttgctactg agggtaatga aattatccgt 1621 gaggctacca aatggagtcc tgaactagct gctgcttgtg aagtatggaa ggagatcaaa 1681 tttgagttcc aggcaatgga tactttggat acggataagg ataaggataa gaagagataa 1741 cgttcgttct cttaattgaa tttcaatgaa actcggccca atcttttact aaaaggattg 1801 agccgaatcc aacatgcata tattttagat agatacatac tt // LOCUS HUMHPRT1 1331 bp ss-mRNA PRI 15-DEC-1989 DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) mRNA, complete cds. ACCESSION M31642 J00205 KEYWORDS hypoxanthine phosphoribosyltransferase; transferase. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1331) AUTHORS Jolly,D.J., Okayama,H., Berg,P., Esty,A.C., Filpula,D., Bohlen,P., Johnson,G.G., Shively,J.E., Hunkapillar,T. and Friedmann,T. TITLE Isolation and characterization of a full-length expressible cDNA for human hypoxanthine phosphoribosyltransferase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 477-481 (1983) STANDARD full staff_review FEATURES from to/span description pept 86 742 hypoxanthine phosphoribosyltransferase /nomgen="HPRT" /map="Xq26" /hgml_locus_uid="LE0031K" mRNA < 1 1331 hprt mRNA BASE COUNT 385 a 237 c 281 g 428 t ORIGIN Chromosome Xq26. 1 tcttgctgcg cctccgcctc ctcctctgct ccgccaccgg cttcctcctc ctgagcagtc 61 agcccgcgcg ccggccggct ccgttatggc gacccgcagc cctggcgtcg tgattagtga 121 tgatgaacca ggttatgacc ttgatttatt ttgcatacct aatcattatg ctgaggattt 181 ggaaagggtg tttattcctc atggactaat tatggacagg actgaacgtc ttgctcgaga 241 tgtgatgaag gagatgggag gccatcacat tgtagccctc tgtgtgctca aggggggcta 301 taaattcttt gctgacctgc tggattacat caaagcactg aatagaaata gtgatagatc 361 cattcctatg actgtagatt ttatcagact gaagagctat tgtaatgacc agtcaacagg 421 ggacataaaa gtaattggtg gagatgatct ctcaacttta actggaaaga atgtcttgat 481 tgtggaagat ataattgaca ctggcaaaac aatgcagact ttgctttcct tggtcaggca 541 gtataatcca aagatggtca aggtcgcaag cttgctggtg aaaaggaccc cacgaagtgt 601 tggatataag ccagactttg ttggatttga aattccagac aagtttgttg taggatatgc 661 ccttgactat aatgaatact tcagggattt gaatcatgtt tgtgtcatta gtgaaactgg 721 aaaagcaaaa tacaaagcct aagatgagag ttcaagttga gtttggaaac atctggagtc 781 ctattgacat cgccagtaaa attatcaatg ttctagttct gtggccatct gcttagtaga 841 gctttttgca tgtatcttct aagaatttta tctgttttgt actttagaaa tgtcagttgc 901 tgcattccta aactgtttat ttgcactatg agcctataga ctatcagttc cctttgggcg 961 gattgttgtt taacttgtaa atgaaaaaat tctcttaaac cacagcacta ttgagtgaaa 1021 cattgaactc atatctgtaa gaaataaaga gaagatatat tagtttttta attggtattt 1081 taatttttat atatgcagga aagaatagaa gtgattgaat attgttaatt ataccaccgt 1141 gtgttagaaa agtaagaagc agtcaatttt cacatcaaag acagcatcta agaagttttg 1201 ttctgtcctg gaattatttt agtagtgttt cagtaatgtt gactgtattt tccaacttgt 1261 tcaaattatt accagtgaat ctttgtcagc agttcccttt taaatgcaaa tcaataaatt 1321 cccaaaaatt t // LOCUS NRACPRBCLA 2010 bp ds-DNA ORG 20-FEB-1990 DEFINITION N.tenuifolia chloroplast ribulose bisphosphate carboxylase/oxygenase (rbcL) gene, complete cds. ACCESSION X15920 KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase. SOURCE N.tenuifolia chloroplast DNA. ORGANISM Chloroplast Neurachne tenuifolia Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Graminneae; Neurachne tenuifolia. REFERENCE 1 (bases 1 to 2010) AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R., Andrews,T.J. and Whitfield,P.R. TITLE Comparisons of rbcL genes for the large subunit of ribulose-bisphosphate carboxylase from closely related C3 and C4 plant species JOURNAL J. Biol. Chem. 265, 808-814 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.S. Fudson, 20-JUL-1989. FEATURES from to/span description pept 438 1874 ribulose bisphosphate carboxylase/oxygenase. large subunit (EC 4.1.1.39) BASE COUNT 614 a 345 c 453 g 598 t ORIGIN 1 agtagcccga tataaaataa ataaaaagaa aggggcctat taagaacttt caaaattgta 61 aaataaggat tagggattgg tttgggttgc gctatatcta tcaaagagta tacaataatg 121 atggatttgg tgaatcaaat ccatggttta ataacaaatc gtgttaactt accataacaa 181 caactcaatt cctatcgaat tcctatagta gaattcctac aggatagaac gtacacaggg 241 tgtacgcatt atatatgaat gaaacatatt cattaactta agcatactcc tttttttatt 301 taatgagttg atattaattg aatatctttg tttttttttt tttagatttt tgcaaaggtt 361 tcatttacgc ctaatccata tcgagtagac cctgtcgttg tgagaattat taattaatga 421 gttgtaggga gggacttatg tcaccacaaa cagaaactaa agcaagtgtt ggatttaaag 481 ctggtgttaa ggattataaa ttgacttact acaccccgga gtacgaaacc aaggatactg 541 atatcttggc agcatttcga gtaactcctc agcccggggt tccgcctgaa gaagcagggg 601 ctgcagtagc tgcggaatct tctactggta catggacaac tgtttggact gatggactta 661 ccagtcttga tcgttacaaa ggacgatgct atcacatcga gcccgttcct ggggaggaag 721 atcaatatat ctgttatgta gcttatccat tagacctatt tgaagagggt tctgttacta 781 acatgtttac ttccattgtg ggtaacgtat ttggtttcaa agccctacgc gctctacgtt 841 tggaggatct acgaattccc cctacttatt caaaaacttt ccaaggtccg cctcacggta 901 tccaagttga aagggataag ttgaacaagt atgggcgtcc tttattggga tgtactatta 961 aaccaaaatt gggattatct gcaaaaaatt acggtagagc gtgttatgag tgtctacgcg 1021 gtggacttga ttttaccaaa gatgatgaaa acgtaaactc acaaccattt atgcgctgga 1081 gagaccgttt tgtcttttgt gcggaagcaa tttataaagc acaagccgaa accggtgaaa 1141 tcaagggaca ttacttgaat gcgactgcag gtacatgcga agaaatgatg aagagagcta 1201 tatttgcaag agaattaggg gttcctattg taatgcatga ctacttaact ggaggattca 1261 ccgcaaatac tagtttggct atgtattgcc gcgacaacgg cctacttctt cacattcacc 1321 gagcaatgca tgcagttatt gatagacaga aaaatcatgg tatgcatttc cgtgtattag 1381 ctaaagcatt gcgtatgtcg gggggagatc atgtccacgc cggtacagta gtaggtaagt 1441 tagaagggga acgtgaaata actttaggtt ttgttgattt attgcgcgat gattttattg 1501 aaaaagatcg atctcgcggt gtctttttca ctcaggactg ggtatccatg ccaggtgtta 1561 taccggtggc ttcagggggt attcatgttt ggcatatgcc agctctgacc gaaatctttg 1621 gagacgattc cgtattacaa tttggtggag gaactttagg acatccttgg ggaaatgcac 1681 ctggtgcagc agctaatcgt gtggctttag aagcctgtgt acaagctcgt aacgaagggc 1741 gtgatcttgc tcgtgaaggt aatgaaatta tcaaagcagc ttgcaaatgg agtcctgaac 1801 tagccgcagc ttgtgaagta tggaaggcga tcaaatttga gttcgcgccg gtggataccg 1861 tagataaggt ctaaataaaa taaaaaagaa gtaaaataga aagagaaaaa atcagttacg 1921 aaatgcagta attcttcttt attcttctaa ttgattgcaa ttaaattcgg ctcaatcttt 1981 taaaagattg agccgaattt aaatagatct // LOCUS NRACPRBCLB 1990 bp ds-DNA ORG 20-FEB-1990 DEFINITION N.munroi chloroplast ribulose bisphosphate carboxylase/oxygenase large subunit (rbcL) gene, complete cds. ACCESSION X15921 KEYWORDS ribulose bisphosphate carboxylase; ribulose bisphosphate oxygenase. SOURCE N.munroi chloroplast DNA. ORGANISM Chloroplast Neurachne munroi Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Gramineae; Neurachne munroi. REFERENCE 1 (bases 1 to 1990) AUTHORS Hudson,G.S., Mahon,J.D., Anderson,P.A., Gibbs,M.J., Badger,M.R., Andrews,T.J. and Whitfield,P.R. TITLE Comparisons of rbcL genes for the large subunit of ribulose-bisphosphate carboxylase from closely related C3 and C4 plant species JOURNAL J. Biol. Chem. 265, 808-814 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.S. Fudson, 20-JUL-1989. FEATURES from to/span description pept 418 1854 ribulose bisphosphate carboxylase/oxygenase. large subunit (EC 4.1.1.39) BASE COUNT 613 a 340 c 445 g 592 t ORIGIN 1 agtagcccga tataaaataa ataaaaagaa aggggcctat taagaacttt caaaatccta 61 aaataaggat tagggattgg tttgggttgc gctatatcta tcaaagagta tacaataatg 121 atggaatcaa atccatggtt taataacgaa tcgtgttaac ttaccataac aacaactcaa 181 ttcctataga attcctatag gatagaacgt acacagggtg tacgcattat atatgaatga 241 aacatattca ttaacttaag catactcctt tttttattta atgagttgat attaattgaa 301 tatttttgaa tatttttttt tttagatttt tgcaaaggtt tcatttacgc ctaatccata 361 tcgagtagac cttgtcgttg tgagaattat taattaatga gttgtaggga gggacttatg 421 tcaccacaaa cagaaactaa agcaagtgtt ggatttaaag ctggtgttaa ggattataaa 481 ttgacttact acaccccgga gtacgaaacc aaggatactg atatcttggc agcatttcga 541 gtaactcctc agcccggggt tccgcctgaa gaagcagggg ctgcagtagc tgcggaatct 601 tctactggta catggacaac tgtttggact gatggactta ccagtcttga tcgttacaaa 661 ggacgatgct atcacatcga gcccgttcct ggggaagaag atcaatatat ctgttatgta 721 gcttatccat tagacctatt tgaagagggt tctgttacta acatgtttac ttccattgtg 781 ggtaacgtat ttggtttcaa agccctacgc gctctacgtt tggaggatct acgaattccc 841 cctacttatt caaaaacttt ccaaggtccg cctcacggta tccaagttga aagggataag 901 ttgaacaagt atgggcgtcc tttattggga tgtactatta aaccaaaatt gggattatct 961 gcaaaaaatt acggtagagc gtgttatgag tgtctacgcg gtggacttga ttttaccaaa 1021 gatgatgaaa acgtaaactc acaaccattt atgcgctgga gagaccgttt tgtcttttgt 1081 gcggaagcaa tttataaagc acaagccgaa accggtgaaa tcaagggaca ttacttgaat 1141 gcgactgcag gtacatgcga agaaatgatg aagagagctg catttgcaag agaattaggg 1201 gttcctattg taatgcatga ctacttaact ggaggattca ccgcaaatac tagtttgtct 1261 atgtattgcc gcgacaacgg cctacttctt cacattcacc gagcaatgca tgcagttatt 1321 gatagacaga aaaatcatgg tatccatttc cgtgtattag ctaaagcatt gcgcatgtcg 1381 gggggagatc atgtccactc cggtacagta gtaggtaagt tagaagggga acgtgaaata 1441 actttaggtt ttgttgattt attgcgcgat gattttattg aaaaagatcg atctcgcggt 1501 gtctttttca ctcaggactg ggtatccatg ccaggtgtta taccggtggc ttcagggggt 1561 attcatgttt ggcatatgcc agctctgacc gaaatctttg gagacgattc tgtattacaa 1621 tttggtggag gaactttagg acatccttgg ggaaatgcac ctggtgcagt agctaatcgt 1681 gtggctttag aagcctgtgt acaagctcgt aacgaagggc gtgatcttgc tcgtgaaggt 1741 aatgaaatta tcaaagcagc ttgcaaatgg agtcctgaac tagccgcagc ttgtgaagta 1801 tggaaggcga tcaaatttga gttcgagccg gtggataccg tagataaggt ctaaataaaa 1861 taaaaaagaa gtaaaataga aagagaaaaa ataagttacg aaatgcagta attcttcttt 1921 attcttctaa ttgattgcaa ttaaattcgg ctcaatcttt taaaagattg agccgaattt 1981 aaatagatct // LOCUS RATPGKXL 1675 bp ss-mRNA ROD 07-FEB-1990 DEFINITION Rat X-chromosome linked phosphoglycerate kinase mRNA, complete cds. ACCESSION M31788 KEYWORDS phosphoglycerate kinase. SOURCE Rat liver, cDNA to mRNA, clones 6, 15, and 16. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1675) AUTHORS Ciccarese,S., Tommasi,S. and Vonghia,G. TITLE Cloning and cDNA sequence of the rat X-chromosome linked phosphoglycerate kinase JOURNAL Biochem. Biophys. Res. Commun. 165, 1337-1344 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 41 1294 phosphoglycerate kinase BASE COUNT 426 a 359 c 459 g 431 t ORIGIN 289 bp upstream of PstI site. 1 ggcctctctc cgggcctctc gacctcacgg tgttgccaaa atgtcgcttt ctaacaagct 61 gactttggac aagctggacg tgaaggggaa gcgggtcgtg atgagggtgg acttcaatgt 121 tcctatgaag aacaaccaga taacgaataa ccaaaggatc aaggctgctg tcccaagcat 181 caaattctgc ttggacaatg gagccaactc ggttgtgctt atgagccacc tgggccgtcc 241 tgatggtgtg cccatgcccg acaagtactc cttagagcca gttgctgcag aactcaaatc 301 tctgctgggc aaggatgttc tgttcttgaa ggattgtgtg ggctcagaag tagagaatgc 361 ctgtgccaac ccagcggctg ggactgtcat cctcctggag aacctccgct ttcatgtaga 421 ggaagaaggg aagggaaaag atgcttctgg gaacaaggtt aaagctgagc cagctaaaat 481 tgatgctttc cgagcctccc tgtccaaact tggagatgtc tatgtcaatg atgcttttgg 541 gactgcacac agagcccaca gttccatggt gggtgtgaat ctgccacaga aggctggtgg 601 atttttgatg aagaaggagc tgaactactt tgccaaggct ttggagagtc cagagcgacc 661 cttcctggct atcttgggag gagctaaagt tgcagacaag atccagctga tcaataatat 721 gctagacaaa gtcaatgaga tgatcatcgg tgggggaatg gcttttacct tccttaaggt 781 gctcaacaac atggagattg gcacatctct gtatgatgaa gagggagcca agattgtcaa 841 agatctcatg accaaagctg agaaaaatgg tgtgaagatt accttgcctg ttgactttgt 901 cactgctgac aaatttgatg agaatgcaaa gactggccaa gctactgtgg cctctggtat 961 acctgctggc tggatgggct tggactgtgg tactgagagc agtaagaaat atgctgaggc 1021 tgtggctcga gctaagcaga ttgtttggaa cggccctgtt ggggtatttg aatgggaagc 1081 atttgccagg ggaaccaagt ccctcatgga tgaggtggtg aaagccacgt ctaggggctg 1141 catcactatc ataggtggtg gagacaccgc cacttgctgt gccaaatgga acacagagga 1201 taaagtcagc catgtgagca ctgggggcgg cgccagtttg gagctcctgg aaggtaaagt 1261 ccttcctggg gtggatgctc tcagcaatgt ttagtatttt cctgcctttt ggttcctgtg 1321 cacagcccct aagtcgactt agtgttttcc gcatctccat ttggtgttag tgcagctagt 1381 ggccaagacg agcaccagga acctaagcag cagctgcaca gcatctcagc tcgtctttac 1441 tgcatgggat tcattatagt taagtccatt taatttctta gtgactaaac cttgcattaa 1501 ggcatatatt tatactctgc ctgtgaaagg aagtgagctg taaaagctta gctctcttcg 1561 ctgtatgtag cctctggtta gcctgtcact gttcatgaca gcatggaaat aacgatgagg 1621 attccagctg taggtttggg aggttgatga tattgaacca ataaaagatc ccact // LOCUS RATQRE1 504 bp ds-DNA ROD 20-FEB-1990 DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 1. ACCESSION M31801 KEYWORDS NAD(P)H:quinone reductase. SEGMENT 1 of 5 SOURCE Rat cell line H5-6 DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 504) AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone reductase gene expression by planar aromatic compounds and determination of the exon structure of the quinone reductase structural gene JOURNAL J. Biol. Chem. 264, 21793-21797 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 325 + 331 NAD(P)H:quinone reductase, exon 1 pre-msg 206 > 504 NAD(P)H:quinone reductase mRNA and introns IVS 332 > 504 NAD(P)H:quinone reductase intron A BASE COUNT 112 a 150 c 127 g 115 t ORIGIN 1 actctcaagc tcccagaaaa cccttttgac ttccatatac agaagtccta ggccagccct 61 aaactgcttc tccctgccaa taacttggta tcctcccccc agcgcctctg ggctcggcaa 121 tccagccccg ccctcgctgg ctgccctgca cagtgggctg ggccggaaaa gcaagatata 181 aagcctgaaa gtgctcagta cagctcgcac tagcctaggc tgtggcacgc aggatctttc 241 cgaagcattt cagggtcgtc ctggcaacca gctgctcagc caatcagcgc ttgacactac 301 gatccgcccc caacttctgg agccatggcg ggtgagtatg gctccaactc cagcctaatt 361 catcctgagg aggatgtagg ggcttgctat ggggtttgtt ccttgcctcg aagttgaaaa 421 gtgtagagat taggatcctg gatgagcctc ggtgagtccc cggaaggaga gcttcttctc 481 agaaccatag gtgcagatta ttct // LOCUS RATQRE2 634 bp ds-DNA ROD 20-FEB-1990 DEFINITION Rat liver NAD(P)H:quinone reductase gene, exons 2 and 3. ACCESSION M31802 KEYWORDS NAD(P)H:quinone reductase. SEGMENT 2 of 5 SOURCE Rat liver cell line H5-6 DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 634) AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone reductase gene expression by planar aromatic compounds and determination of the exon structure of the quinone reductase structural gene JOURNAL J. Biol. Chem. 264, 21793-21797 (1989) STANDARD simple staff_entry FEATURES from to/span description pept + 167 331 NAD(P)H:quinone reductase, exon 2 402 + 532 NAD(P)H:quinone reductase, exon 3 pre-msg < 1 > 634 NAD(P)H:quinone reductase mRNA and introns IVS < 1 166 NAD(P)H:quinone reductase intron A IVS 332 > 401 NAD(P)H:quinone reductase intron B IVS 533 > 634 NAD(P)H:quinone reductase intron C BASE COUNT 156 a 158 c 161 g 159 t ORIGIN Undtermined number of bp after segment 1. 1 ctttagattt tcagcttaaa gacgtttagt tgataatgca gccactgttt cccgctctga 61 gaaactaagg tggggaacgt gtctggtccc aagcactttt agattaggga ctcacccgtc 121 ctgtttggat tttctttcct cacctcctca cgtacgcctt aaacagtgag aagagccctg 181 attgtattgg cccacgcaga gaggacatca ttcaactatg ccatgaagga ggctgctgtg 241 gaggctctga agaagaaagg atgggaggtg gtcgaatctg acctctatgc tatgaacttt 301 aaccccctca tttccagaaa cgacatcaca ggtaagaatc gtctccctcc actgacagtg 361 gaccacgtga cccagcctca gcccctcttg cctcccaaca ggggagccga aggactcgga 421 gaactttcag taccctgttg agtcatctct ggcgtataag gaaggccgcc tgagcccgga 481 tattgtagct gaacagaaaa agctggaagc tgcagacctg gtgatatttc aggtacctca 541 taggacggaa gtcatgagag cagtgtctag ggtgtctgtc cttattgtcc ctagagccct 601 ctgcattcta agtggtgtca gaaagtgtgc ggct // LOCUS RATQRE3 248 bp ds-DNA ROD 20-FEB-1990 DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 4. ACCESSION M31803 KEYWORDS NAD(P)H:quinone reductase. SEGMENT 3 of 5 SOURCE Rat liver cell line H5-6 DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 248) AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone reductase gene expression by planar aromatic compounds and determination of the exon structure of the quinone reductase structural gene JOURNAL J. Biol. Chem. 264, 21793-21797 (1989) STANDARD simple staff_entry FEATURES from to/span description pept + 27 + 140 NAD(P)H:quinone reductase, exon 4 pre-msg < 1 > 248 NAD(P)H:quinone reductase mRNA and introns IVS < 1 26 NAD(P)H:quinone reductase intron C IVS 141 > 248 NAD(P)H:quinone reductase intron D BASE COUNT 48 a 52 c 74 g 74 t ORIGIN Undtermined number of bp after segment 2. 1 tgacctcttg gtcttctgtc ccacagttcc cattgtattg gtttggggtg cccgccattc 61 tgaaaggctg gtttgagaga gtgcttgtag caggattcgc ctacacgtat gccaccatgt 121 atgacaaggg tcctttccag gtaggtgaac tgttgggtat ttcctgagta gattctgccc 181 tgacatcaca gatggactct taagaccggg ttggggtggg gttggggggt gcactgtaag 241 cttataaa // LOCUS RATQRE4 199 bp ds-DNA ROD 20-FEB-1990 DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 5. ACCESSION M31804 KEYWORDS NAD(P)H:quinone reductase. SEGMENT 4 of 5 SOURCE Rat liver cell line H5-6 DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 199) AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone reductase gene expression by planar aromatic compounds and determination of the exon structure of the quinone reductase structural gene JOURNAL J. Biol. Chem. 264, 21793-21797 (1989) STANDARD simple staff_entry FEATURES from to/span description pept + 63 + 164 NAD(P)H:quinone reductase, exon 5 pre-msg < 1 > 199 NAD(P)H:quinone reductase mRNA and introns IVS < 1 62 NAD(P)H:quinone reductase intron D IVS 165 199 NAD(P)H:quinone reductase intron E BASE COUNT 41 a 55 c 53 g 50 t ORIGIN Undetermined number of bp after segment 3. 1 cccccttcat taactcacgg ggaccctgga gaggagcggg tgatacttgt tttctctttc 61 agaataagaa gaccttgctt tccatcacca ccgggggcag cggctccatg tactctctgc 121 agggtgtcca cggggacatg aacgtcattc tctggccaat tcaggtagct cctcacgggg 181 aacttccaga ggagtttgt // LOCUS RATQRE5 964 bp ds-DNA ROD 20-FEB-1990 DEFINITION Rat liver NAD(P)H:quinone reductase gene, exon 6. ACCESSION M31805 KEYWORDS NAD(P)H:quinone reductase. SEGMENT 5 of 5 SOURCE Rat liver cell line H5-6 DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 964) AUTHORS Bayney,R.M., Morton,M.R., Favreau,L.V. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Regulation of quinone reductase gene expression by planar aromatic compounds and determination of the exon structure of the quinone reductase structural gene JOURNAL J. Biol. Chem. 264, 21793-21797 (1989) STANDARD simple staff_entry FEATURES from to/span description pept + 57 362 NAD(P)H:quinone reductase, exon 6 pre-msg < 1 > 964 NAD(P)H:quinone reductase mRNA and introns IVS < 1 56 NAD(P)H:quinone reductase intron E site 727 757 region of dyad symmetry BASE COUNT 222 a 229 c 240 g 273 t ORIGIN Undetermined number of bp after segment 4. 1 tgcttacagt gcagactcac tcccgtgcct tacctctgtg ttctgtgtct ctccagagtg 61 gcattctgcg cttctgtggc ttccaggtct tagaacctca actggtgtac agcattggcc 121 acaccccacc cgatgcccgc gtgcaggtcc tggaagggtg gaagaagcgt ctggagactg 181 tctgggagga gtcaccactc tactttgctc caagcagctt gtttgaccta aacttccagg 241 caggattctt actgaaaaaa gaggttcaag aggagcagaa aaagaacaag tttggccttt 301 ctgtgggcca tcatttgggc aagtccattc cagccgacaa ccagatcaaa gctagaaaat 361 aaggttttcc ataccatgta gttagaccca ggtttctttt tccccagttt gcctggcttg 421 ctttcagttt tcgcctttgt tccacaagga taggaaaagg aggaggctcg cctcatgcgt 481 ttttggatag ttctgccacg gcgtgtgaca gcaaaatgaa cgaggtcaga ttaggggcct 541 caggtggcct gggatatgaa tcagggagag gtgtagccgc gagcggggaa aatactcttc 601 taggtctttt gtacactata agcttttttc ttcgggctag cctggctaaa tggcatccaa 661 tcctccaccc acttgttgct attagttacc tctctgtggt ttagggcagg agggaattgc 721 tcaaacaatg gctgagggac taacttgttt agcagttagc taaagcctgt ttatgatcca 781 tcctggtttc aattactgtg cagtgactga caagcctcgg gggattgctc tccagctctt 841 ctctgccttg tacatagcac acccaggtcc tgggaaatga atacaaaaac aggtctccgc 901 ctcattcttg ctattctttc tttgtgtgtg tgtgtgtgga aataaatgga tatttcacac 961 gtca //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS FPLTRAC 3227 bp ds-DNA BCT 05-FEB-1990 DEFINITION Plasmid F TraC protein (traC) gene, complete cds and TrbI protein (traI) gene, partial cds. ACCESSION M30936 KEYWORDS TraC protein; TrbI protein. SOURCE plasmid F DNA, clone PKAS2. ORGANISM Plasmid F Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 3227) AUTHORS Schandel,K.A., Maneewannakul,S., Vonder Haar,R.A., Ippen-Ihler,K. and Webster,R.E. TITLE Nucleotide sequence of the F plasmid gene traC and identification of its product JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Printed copy of sequence [1] kindly submitted by R.E.Webster, 20-DEC-1989 FEATURES from to/span description pept 268 2895 TraC protein (gtg start codon) pept 2892 > 3227 TrbI protein binding 256 261 Shine-Delgarno sequence BASE COUNT 785 a 827 c 900 g 715 t ORIGIN 1 gcatgcggaa atcctattcc ggaagcccgg cggaaaatat ttcccggtgt gacgttgtgc 61 gttgaatgtc aggcgtatca ggaaagacag agaaaacatt atgcataagt cagtcgcaga 121 acatagtgat ttaattccgg atgaacatga gtggatattc agaaaacaga aatctttgta 181 tatgcgccgg gaaatggcgc gttaattaca ggtattccct tcatggctac cgcatttctc 241 gctttatttt tcaactaagg aattcatgtg aataacccac ttgaggccgt cactcaggcg 301 gttaactccc tcgtcacagc actgaaactg cctgacgaat ccgcaaaggc caatgaagtt 361 ctgggcgaaa tgagcttccc gcagttcagc cgtctgctgc cgtaccgtga ttacaaccag 421 gaatccggtc tgttcatgaa tgacaccacg atgggcttta tgctggaagc cattcccatc 481 aatggggcga atgagtccat tgtggaggcc ctcgatcata tgctgcgcac caaactgccg 541 cgcggtattc cgttgtgtat ccatctgatg tccagtcagc tggttggtga caggattgaa 601 tacgggctgc gtgagttctc ctggtctggc gaacaggccg aacggtttaa cgcgattacc 661 cgggcctatt atatgaaagc ggcagcgaca cagtttccgc tgccggaggg aatgaatctg 721 cccctgaccc tgcgccatta ccgggtgttt atctcgtact gttctccctc gaagaaaaaa 781 agccgggccg acattctgga aatggaaaac ctggtgaaaa tcatccgggc gtcgttacag 841 ggggccagta tcaccacaca gacggtggat gcacaggcct ttatcgatat tgtcggggag 901 atgattaacc ataacccgga ttccctgtac ccgaaaagac gtcagctgga cccgtattct 961 gatctgaatt atcagtgtgt ggaggacagt tttgatctga aagttcgggc tgattacctg 1021 acactgggcc tgcgtgagaa cggcaggaac agcacggccc gcatcctgaa tttccatctg 1081 gcccgtaacc cggaaatcgc cttcctgtgg aacatggccg acaactacag caacctgctg 1141 aacccggaac tgtccatctc ctgtccgttc atcctgacgc tgactctggt ggtggaagac 1201 caggtgaaaa cccacagcga agccaacctg aagtacatgg acctggagaa aaagtcgaag 1261 acctcctatg ccaaatggtt tccgtccgtg gagaaagagg cgaaggagtg gggggaactg 1321 cgtcagcggc tgggctccgg tcagtcctct gtcgtgtcct acttcctcaa catcacagcc 1381 ttctgcaagg acaataatga aacggcactg gaagtggagc aggacatcct gaacagcttc 1441 cgtaaaaacg gttttgagct gatttcaccg cgctttaacc acatgcgcaa tttcctgacc 1501 tgtctgccct ttatggccgg gaaagggttg tttaaacagc tgaaagaggc cggagtggta 1561 cagcgcgcag agagctttaa tgtggccaac ctgatgccgt tagtggcgga taaccccctg 1621 acaccggcag gtctgctggc acccacctac cgtaaccagc tggcgtttat cgatattttc 1681 ttccggggga tgaataacac caactataac atggcggtct gtggcacctc cggggccggt 1741 aaaaccgggc tgatacagcc acttatccgc agcgtgctgg actccggagg ctttgccgtg 1801 gtgttcgaca tgggggatgg atacaagtcc ctgtgtgaga acatgggcgg ggtgtatctg 1861 gacggtgaaa ccctgcgttt taacccgttt gcgaacatca ccgatattga ccagtcagcg 1921 gagcgtgtcc gtgaccagtt gtcagtgatg gccagcccca acggtaacct ggatgaagtg 1981 catgaaggtc tgctgttgca ggcggtcaga gcctcctggc tggccaaaga gaacagagca 2041 cgtattgatg acgtggtgga tttcctgaaa aacgccagtg acagcgagca gtatgccgag 2101 tcaccgacta tccgcagccg tctggacgaa atgattgtgc tgcttgacca gtacactgcc 2161 aacggcactt acggccagta ttttaactct gatgagccgt ccctgcggga tgacgcaaaa 2221 atggtggtgc tggagctggg cggactggaa gaccgtccgt cactgctggt tgcggtgatg 2281 ttctccctga ttatctacat cgagaacagg atgtaccgca cgccgcgtaa cctcaagaaa 2341 ctgaacgtta ttgatgaagg ctggcgtctg ctggacttca aaaatcacaa ggtcggtgaa 2401 tttattgaga aaggctaccg tacggcccgc cgtcataccg gtgcctatat caccatcaca 2461 cagaacatcg tcgactttga ctctgataag gcctccagtg ctgcccgcgc ggcatggggt 2521 aactcctcct acaaaattat cctcaaacag agtgcgaagg agttcgcgaa atacaaccaa 2581 ctgtatccgg accagttcct gccactgcag cgcgacatga ttggtaagtt tggtgcggcc 2641 aaagaccagt ggttcagttc cttcctgctg caggtggaaa accattcctc ctggcaccgt 2701 ctgtttgtgg acccgttaag ccgcgccatg tacagctctg acggcccgga ttttgagttt 2761 gtgcagcaga agcgtaaaga ggggctgagt attcatgagg cagtgtggca gctggcgtgg 2821 aagaagtcag ggccggaaat ggcttcgctg gaagcctggc tggaagaaca tgagaaatac 2881 aggagtgtgg catgagttca acgcagaaac ccgccgacgt cacggcagaa cgacgcagcc 2941 actggtggtg gacggtgccg ggatgcctgg caatggtgtt actgaacgca gccgtcagtt 3001 acggcattgt cagactgaac gcaccggtga ctgtcgcctt caacatgaaa cagaccgtgg 3061 atgcgttttt tgacagcgcc agtcagaaac aactgtcaga agcgcagtca aaagcccttt 3121 cagcgcgttt taacacggca ctggaagcca gcctgcaggc gtggcagcag aaacaccacg 3181 cggtcattct ggtgtcgcct gctgtggtac agggggcacc ggatatc //
GenBank-Updates@genbank.bio.net (04/10/90)
LOCUS DDIGP24 3000 bp ds-DNA INV 12-SEP-1989 DEFINITION D.discoideum glycoprotein 24 A and B (GP24A and GP24B) genes, complete cds. ACCESSION M27588 KEYWORDS adhesion protein; glycoprotein. SOURCE D.discoideum DNA, clones DG[2,3]. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 3000) AUTHORS Loomis,W.F. and Fuller,D.L. TITLE A pair of tandemly repeated genes code for gp24, a putative adhesion protein of Dictyostelium discoideum JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 886-890 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.F.Loomis, 08-SEP-1989, for release after publication. FEATURES from to/span description pept 1072 1132 glycoprotein 24A (GP24A), exon 1 1234 1481 glycoprotein 24A (GP24A), exon 2 pept 2585 2645 glycoprotein 24B (GP24B), exon 1 2747 2997 glycoprotein 24B (GP24B), exon 2 pre-msg 1023 > 2997 GP24A mRNA and intron pre-msg 2533 > 2997 GP24B mRNA and intron IVS 1133 1233 GP24A intron A IVS 2646 2746 GP24B intron A signal 812 817 GP24A G box signal 822 826 GP24A G box signal 994 999 GP24A TATA box signal 2334 2339 GP24B G box signal 2342 2346 GP24B G box signal 2504 2509 GP24B TATA box BASE COUNT 1123 a 285 c 323 g 1269 t ORIGIN 1 actccttttg atttggttgg cgttaatgaa tttgaatttt taattaattc tgattttaat 61 tttaatgatt cctgagaaat aaaatattag tattataatt attataatta ttattattac 121 tataatatat aaatataaat aatattaata aaaatatttt acaatattct ctatctcttc 181 taaaatttgt ttaattggtt tctttaataa attcgttaaa ttatatattt gattttgata 241 aagaggttga atattctcac tattttgaga ttttactact ctagtaattc tgtgacctga 301 aatgatgtaa tatttatacg cttttgtgat cctaatggta ttgttgatgt tattgttgtt 361 gttgttgttg ttgttgttgt tgttgttgtt gttgtcgata atgatggtgg attcgatttc 421 ttttcaaatg agatcgaatt tgttttgaat cttttattgc tattatcatt actattatta 481 tctacattct catcattaat tatatcattt gtatcactta agtttctttt tgtacctaac 541 atgtttttat tatttgttgt atctattaca acatttgaaa aaaataaagt tgaataaaaa 601 aaaaaaaaaa aaaaaaaaaa gactttattt tttaatttgg gcgccaattc aaaaaaaaaa 661 aaaaaaaaaa aattaatttg ggaaatacag tttttttttt ttatgcccac catttgacat 721 agataacctt ttataaataa taataaaacc tctaatttaa actgtttata aaaaaatggt 781 ggaatgataa aaaaaaaaaa actgttttga ttgtggggga ggggtgaaga tgattgtcat 841 tttattttgt tttatttatt attttttttt ttaatgggga tacaagtgag atataaatta 901 caataattta ttattaaaat aaaacaatta attttttttt tctttttttt ttatttttta 961 tttttttctt tttatttttt ttttttaatt atttaataaa attttttttc ctcattcaat 1021 ttaaatataa ttactcttaa aaataaataa ataataataa taaattttaa aatggttgat 1081 ttgaaaatta cattagttaa tgaagatggg gaatcaacaa taagcggtaa aggtaattgg 1141 aataattaat ttctaaatat ttaattttat ttatatatat ttaaaaatct aacatatcat 1201 ttcatttcat tctggctctt tttaaaaata taggtcatcc attaccagct cccttaatat 1261 ttcctccaat atatattttc cgttttaccc aataccaaac tgaaggtaaa ctttgggaca 1321 aaaatgaatt tcaaataaaa tcaggtaaaa ttgaattcga cggagaagaa tatgatattc 1381 ctgaatctaa aggcacatgg agtaaagatg atgaggaaaa cgcaatcgat gttaatttac 1441 atctatttag accaccagaa aaatttttcc caaaaaatta aagatcttaa aacagatctg 1501 atacaatgaa aacaaagaaa taaaaaattt cattaatctt attttttcat catatataat 1561 ttctatgttt ttaaaattta attatctgac atttaaataa ataagaaaca tatatgatgt 1621 aaatgtatta ggtggttttg gttttcgttt gttcaattga ggaaacataa gataaattta 1681 tctgacattt aaataaataa gaaacatata tgatgcaaat gtatttggag ttttgaaatt 1741 aagaaattca acatcatatt aggtttaata atttacattt ctctgaaatg ggagatatta 1801 ttcatcatat agttatacca aacattcagt tattgcaatt acattctttc cacaaaaagt 1861 tttatttata atttgtttta tttgatttta attttgcaga tttgttgtag tgagttttca 1921 cactaaattt tttttttttt tttttttaaa ttaaaaacaa ttaatttaac caaaacaata 1981 aaaaaaacta tttataaaca gttatttgat tgttttaaga ttaaataaca atttacttta 2041 aaatttgcgc caaatttttt ttttttaatt attttttttt ttttctatta tttttttttt 2101 tttttttgct agtcagattt tagcaaattt tagaaatcaa attcaatgat tggtaaaaat 2161 caagattttc aatcttttaa taatatatta cagcatcaac aacactcacc aaattaccat 2221 atcacaaatt attgttatca gtttaataat tgacatagaa taaacttttt ttaaataata 2281 ataaaacctc taatttaaac tgtttataaa aaaaaaaaaa aaactgtttt gattgtggga 2341 ggggtgaaga tgattgtcat tttattttat tttatttatt attttttttt aatggggata 2401 caagtgagat ataaattaca ataatttatt attaaaaaaa acaatttttt tatttttttt 2461 tttttatttt tttttttctt ttttttattt ttttttaatt atttaataaa attttttttc 2521 aacattcaat ttaaatataa ttactcttaa aaataaataa ataataataa taataaattt 2581 taaaatgact gatttgaaaa ttacattagt taatgaagat ggggaatcaa caataagtgg 2641 taaaggtaat tggaataatt aatttctaaa tatttaattt tatttatata tatttaaaaa 2701 tctaacatat catttcattt tattctggat tttttataaa atataggtca tccattacca 2761 gctcccctaa tatttcctcc aatatattgt ttctgtttta tccaatataa aactgaaggt 2821 aaactttggg acaaaaatga ttttcaaata aaatcaggta aaattgaatt cggcggagaa 2881 gaatatgata ttactgaatc taaaggcaca tggagtaaag atgatgaaga aaaccacatc 2941 aaagttagtt tacatctaat tgtaccacca aaaaaaattt tccaaaaaaa tttttagatc //
GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS BSUHEMAC 3795 bp ds-DNA BCT 15-FEB-1990 DEFINITION B.subtilis delta-aminolevulinate synthase (hemA) uroporphyrinogen I synthase (hemC) genes, complete cds. ACCESSION M32130 KEYWORDS delta-aminolevulinate synthase; uroporphyrinogen I synthase. SOURCE B.subtilis DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3795) AUTHORS Petricek,M., Rutberg,L., Schroeder,I. and Hederstedt, TITLE Cloning and characterization of the hemA region of the Bacillus subtilis chromosome JOURNAL Unpublished (1990) Univ. of Lund, Sweden STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by I.C.Schroeder, 16-FEB-1990. FEATURES from to/span description pept 232 1599 delta-aminolevulinate synthase (hemA, EC 2.3.1.37) pept 1607 2437 ORF2 pept 2470 3414 uroporphyrinogen I synthase (hemC, EC 4.3.1.8) BASE COUNT 1068 a 689 c 953 g 1085 t ORIGIN 1 atgcatatca ccttcttgtt ttttagagct gatgtgtagt aaatttctgc tgtttttggt 61 attgtcaata ggaatgcttc ttttccctga agctttttct aatatagcat aagaatttta 121 aaatctgttc acattttgtg aaagaaacta tgttataatt attataaata atgagttcta 181 tgttagaatg attataaatt aagattgggt gttgggggtg taattagagc gatgcatata 241 cttgttgtgg gagtagatta taaatccgcc cctattgaga tacgtgaaaa agtaagtttt 301 cagccgaatg agctggcaga agcaatggtg cagctgaaag aagagaaaag cattcttgaa 361 aacatcattg tctcaacctg caaccgcact gaaatttatg cggtagtcga ccagcttcat 421 accggccgtt attatataaa aaagttttta gctgattggt ttcaattaag caaagaagag 481 ctgtcaccgt tcttaacgtt ttatgagagc gatgccgctg ttgagcattt attccgtgta 541 gcctgcggac ttgattctat ggtgattggc gaaacgcaga ttctcggaca ggtacgcgac 601 agctttaaaa cagctcagca agaaaaaacg atcgggacta tttttaatga gctgtttaag 661 caggcagtta cagtgggcaa acggactcac gccgaaacag acattggctc aaatgcggtg 721 tcagtaagct atgctgcagt tgaacttgcc aaaaaaatct tcggaaatct ttcaagcaag 781 cacatattga ttctcggtgc gggaaaaatg ggcgagcttg ctgcggaaaa cctgcacgga 841 cagggaatcg gcaaggtcac tgtcattaac cgaacatact tgaaagcgaa ggagcttgca 901 gaccgttttt caggtgaagc gagaagcttg aatcagcttg aaagcgcgct tgcggaggct 961 gatattttaa tcagttcaac cggtgcaagt gaatttgtcg tgtccaaaga gatgatggaa 1021 aacgcgaata agcttcgcaa gggacgtccg ctgtttatgg tcgacattgc cgtgcctaga 1081 gatcttgatc cggcgctgaa tgatcttgaa ggtgtttttc tttatgatat cgacgatctg 1141 gaaggcattg tagaagcgaa catgaaagag cggagagaaa cagctgaaaa agttgaactg 1201 ttaattgaag aaaccattgt ggaatttaaa caatggatga atacacttgg tgttgtgcct 1261 gttatttctg cattgcgcga aaaggcgctt gccatccagt cagaaacgat ggacagcatt 1321 gagcgtaagc tgcctcactt aagcacaaga gagaaaaaac tgttgaacaa acacaccaaa 1381 agtattatta accaaatgct tcgtgatccg attttaaagg tgaaagagct tgcggcagat 1441 gctgattctg aagaaaagct cgcgttgttt atgcagattt ttgatattga agaagctgcg 1501 ggccgtcaaa tgatgaaaac cgttgaaagc agccagaagg tccactcttt taagaaggct 1561 gaatcaaaag cgggctttag cccacttgta agtgagtgaa agctgaatga ttgatactgc 1621 aatggcaaga cttaatgagg ggacaatcgt catttacgcg ttaagtgtac tcttttattt 1681 tatagatttt cttcaacaca accggaaggc tggaaaaatg gccttctggt tgctttctat 1741 tgtctggact ctgcaaaccg tgtatttggc ctattttatg tgggtgacgg ggcggtttcc 1801 ggtattaaat gtgacagagg cactttattt ttatgcctgg gtgcttgtca cgctgtcact 1861 tgtactgaca aagcttttac gtgttgactt tatcgtgttt tttacaaatg ttataggatt 1921 ttctatgatc gccattcaca cattttcacc gacagagcag cagtcagctg ctttttccgg 1981 gcagcttgta tccgagcttt tggtgattca tattacaatg gcgattcttt catacggcgc 2041 tttttccctt tctttcgttt tttctgtgct atatatgttt caatatcatg tgctgaaaaa 2101 gaaaaagtgg ggaaaatggc tgttgagaat agaagattta tctaagcttg attatatggc 2161 gtatgtttta aatgtcattg gggttccgat gctgctgctg agtttgattc tcggcgtcat 2221 ttgggcgtat gtctcactag aaacgctgta ttggtttgac gccaaagtgc ttggttcgtt 2281 tgtcgtcctg ctgctgtaca gctattatct ttatatcagg ctgattaagg agctgcaagg 2341 aaaggtcgct gcactgtgga atacggcttg ttttctggtg ctgatgatca attatttcct 2401 gcttggaagc ctgtcgcaat tccattggtt cagttaaacg atgtcccaag cagattcggg 2461 aggaaagaaa tgatgagaac gattaaagta ggttccagac ggagcaaact cgctatgact 2521 caaacaaaat gggttattca aaaactgaag gaaatcaatc cttcgtttgc ttttgaaatt 2581 aaagagatcg tgacaaaggg cgaccggatt gtcgatgtta cactctcaaa agtgggtgga 2641 aaagggcttt ttgtcaaaga aattgaacag gcgcttttaa acgaagagat tgatatggca 2701 gtgcacagca tgaaggacat gcctgctgtt ttgcctgaag gccttgtgat cggctgtatt 2761 cctgaacggg aggacccgcg tgatgccctt atttcaaaga atcgcgtaaa gctttcagaa 2821 atgaagaaag gtgctgtcat tggcacaagc agtttaagaa gaagcgcgca gcttttgatt 2881 gagcgccctg accttacaat taaatggatt agaggtaata ttgatacaag acttcaaaag 2941 ctggaaacag aggattatga cgcaattatt ttagcggctg ccggcctttc cagaatgggt 3001 tggaagcaag atgtcgtaac cgaattcctt gagcctgagc gctgtttgcc tgctgtgggg 3061 cagggagccc tggcgattga gtgccgagaa tcggatgaag agctgttggc gttgttttct 3121 cagtttacag atgaatatac aaaacggact gtcttagcgg aacgtgcttt tttaaacgcg 3181 atggagggcg gctgccaggt tccgatcgcg ggctactccg tgttaaatgg acaggatgaa 3241 attgaaatga caggtcttgt cgcttcacct gacggcaaaa tcatttttaa agaaaccgtc 3301 accggaaacg atccggagga agtaggaaag cgctgtgccg ctcttatggc tgacaaagga 3361 gcaaaagatt taattgatcg tgtaaaacgg gagcttgacg aggatggaaa atgattttcc 3421 gttgaaagga aaaacagtgc ttgtcacccg gaataaggca caggcagcat catttcagca 3481 aaaagtggag gcgcttggcg gtaaagcggt tttaacctct ttgattacgt ttcgccgcgc 3541 tttgccgaat gatgttgcgg aacaggtaag agaggatctt gccgcgccag gctggcttgt 3601 ttttacaagt gtgaacgggg cagacttctt tttttcttat ctgaaggaaa atcagcttat 3661 tctccctgcg cataaaaaaa ttgcagccgt cggtgaaaaa accgcgcgcc gtttaaaaat 3721 gcataacgta tcggttgatg tgatgccaca ggagtatatt gctgaacaat tgcgtgacgc 3781 tcttaagcag catgc // LOCUS ECOAFR1 1476 bp ds-DNA BCT 15-FEB-1990 DEFINITION E.coli AF/R1 major pili subunit (afrA) gene, complete cds. ACCESSION M32083 KEYWORDS afrA gene; major pili subunit; surface antigen. SOURCE E.coli (strain RDEC-1) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1127) AUTHORS Wolf,M.K. and Boedeker,E.C. TITLE Cloning of the genes for AF/R1 pili from rabbit enteroadherent Escherichia coli RDEC-1 and DNA sequence of the major structural subunit JOURNAL Infect. Immun. (1990) In press STANDARD full staff_entry REFERENCE 2 (bases 1 to 1476) AUTHORS Wolf,M.K. and Boedeker,E.C. JOURNAL Unpublished (1990) Walter reed Army Inst. of Res. Washington DC STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.K. Wolf, 13-FEB-1990. FEATURES from to/span description pept 627 1115 AF/R1 major pili subunit (afrA) site 349 390 region of dyad symmetry signal 493 498 pot. -35 region signal 515 520 pot. -10 region binding 612 616 pot. ribosome binding site BASE COUNT 447 a 278 c 287 g 462 t 2 others ORIGIN 1 bp upstream of EcoRI site. 1 gaattcccta gtgaatgtct gctgggaatc ataaaacaat ctttctgata tatccacaat 61 ttttaggttg gtaaatctta aaagaatagc cgctcgcgtt atcctgctta attgaatgta 121 tttacctaaa gtaacaccta tgttttcttt aaacagtaat tgcagatacc gtctgctgta 181 tccggagtaa tcaacgaggg catttatatc tatagatata ctctctaaat tatcatcaat 241 gtactgtgtt atcgcgttta tcgtaagtgt tttcagcatg tacgtagctc ctatatgtat 301 gtttacgtgt taccccacat catgttaata aaaccccttc tgttttttta gctgattgtg 361 cattgtacac ataccgtgca caattagcta acaacgcaga ccaatatttt ttaaaatacc 421 ccgcgttttc acatgacttg tatctattct cttagagaaa ttaatgcatc tctatcacat 481 catgtgtagt actggacaaa tagtcatggg agcctattac cgaacagcga agatggcata 541 tgttttctta ttaagaaaga ggaaagaata tggcgcactc gttttatctc aattttggta 601 aaaaaaatat atggagaatg tcagaaatga aaaaaacatt tattgcgtct gtaattgtaa 661 taaccataaa tacgggatca gcaattgctg ctcaaggcga tgttcagttc tttggtaccg 721 ttactgcgaa gacctgtgat cttgtcgttg aacacgaggg ggctgtggtc aatatgattc 781 agttgggttc tgtaactaat ggtggaacta atgctggcac cgatatcgga gcaaacaaat 841 cgtttaccct gaagccagca tcaggggtga catgcaatac catcactact gctaaaatgg 901 catggtcttc tcctgcaatg accgttaatg gtattggtaa tctatcaggt aaggctattg 961 atgcccatgt gaagttagtg gcgattaaca gcacgggtaa agttcaaact gataccaacg 1021 cagataagga aattaaagcg ggtcaaaata cagttgatta ctcaattact ggttctggcc 1081 tactgatgaa ggctttaaat ttaaagctca gttaattggc ggtaccattc caggtgactt 1141 cgatagtgct gctgcatatt ccgttgcata caactaatat ttgaatgtaa atccgggaag 1201 cccctccctt cccggattta atatttagaa cagcatattt aactggtgcc cttaactttg 1261 cttaggtgtg aagaggttag cttatgaaat taaaaacatt tcctaaaata tctctactgg 1321 ccctgagtat atggtattct cactccagct tggctgatga acttaatctg gattttatac 1381 agaacgtcag cgttattcca tcaattctga aaagtgacgc aatttacccg gaaggacaat 1441 atatcgttga cgtaaccgta aataaagaac gtatdd // LOCUS ECOCYS 5755 bp ds-DNA BCT 14-FEB-1990 DEFINITION E.coli thiosulfate binding protein (cysP), sulfate permease (cysT, cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes, complete cds. ACCESSION M32101 KEYWORDS cysA gene; cysM gene; cysP gene; cysT gene; cysW gene; o-acetylserine (thiol)-lyase-B; sulfate permease; thiosulfate binding protein. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1292 to 5755) AUTHORS Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A. TITLE Sulfate and thiosulfate transport in E.coli K12: Nucleotide sequence and expression of the cysTWAM gene cluster JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (bases 1 to 1291) AUTHORS Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D. TITLE Sulfate and thiosulfate transport in E.coli K12: Identification of a gene encoding a novel protein involved in thiosulfate binding JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Halanicka, 14-FEB-1990. FEATURES from to/span description pept 559 1575 thiosulfate binding protein (cysP) pept 1575 2408 sulfate permease (cysT) pept 2408 3283 sulfate permease (cysW) pept 3273 4370 sulfate permease (cysA) pept 4505 5416 o-acetylserine (thiol)-lyase-B (cysM; gtg start codon; EC 4.2.99.8) signal 516 521 -10 region site 5737 5755 right end of mu BASE COUNT 1259 a 1477 c 1682 g 1337 t ORIGIN 52 min on K12 map. 1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag 61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt 121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc 181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt 241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt 301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc 361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac 421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta 481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta 541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc 601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac 661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat 721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg 781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa 841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat 901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc 961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg 1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt 1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc 1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg 1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa 1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac 1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg 1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac 1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc 1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg 1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa 1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg 1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg 1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg 1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc 1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg 1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag 1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg 2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa 2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg 2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga 2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc 2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat 2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag 2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg 2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat 2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc 2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc 2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg 2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg 2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca 2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg 2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga 2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa 3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt 3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca 3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt 3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa 3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga 3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga 3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg 3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc 3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga 3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg 3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg 3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg 3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc 3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca 3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga 3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga 3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg 4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc 4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt 4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg 4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc 4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg 4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt 4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt 4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga 4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat 4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc 4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa 4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat 4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg 4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat 4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga 4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg 4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat 5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca 5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg 5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa 5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc 5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat 5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca 5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg 5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg 5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt 5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa 5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct 5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag // LOCUS HUMCOLA2I 1994 bp ds-DNA PRI 15-FEB-1990 DEFINITION Human collagen type I alpha-2 (COL1A2) gene, exon 1 (partial). ACCESSION M31886 KEYWORDS collagen. SOURCE Human DNA, clone pCOL-alpha-2-bGH. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1994) AUTHORS Sherwood,A.L., Bottenus,R.E., Martzen,M.R. and Bornstein,P. TITLE Structural and functional analysis of the first intron of the human alpha-2(I) collagen-encoding gene JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Bornstein, 02-FEB-1990. FEATURES from to/span description pre-msg < 1 > 1994 collagen alpha-2 type I mRNA and intron /nomgen="COL1A2" /map="7q21.3-q22.1" /hgml_locus_uid="LP0002V" IVS 156 > 1994 COL1A2 intron A binding 141 154 NF1 binding site binding 1034 1040 AP1 binding site binding 1061 1067 AP1 binding site site 1370 1409 gt-rich region BASE COUNT 580 a 413 c 456 g 545 t ORIGIN 1 gcatgcccgc gcccgccagg tgatacctcc gccggtgacc caggggctct gcgacacaag 61 gagtctgcat gtctaagtgc tagacatgct cagctttgtg gatacgcgga ctttgttgct 121 gcttgcagta accttatgcc tagcaacatg ccaatgtaag tgccttcagc ttgtttgggg 181 gagactgggt agagaggtta gatgggaggg caccctgccc tgaaaaggaa aacctgtaac 241 ctgaattcca ggtacacttg gagggcagac tctcaggcat gtgggaaaac gccggaattg 301 ataagaaaca tggaaattac tttaaaaaat gaaaacataa aagccttgcc aaaagttagg 361 gaacttttcc tctaagttca gagtgagaca gttaactcgg tctggctcct cagcttagta 421 acccccaaag ggagcggaag gtctttttcc ctaaggatga gatattaacg accaatgtgg 481 tggaggaagt caagggcctg caccccacag gccccataac cgcactgatg tccaccttgt 541 aaaacttgag gcctgcgtta gaaagccctt caactgagta atgtaaaact cacctcctaa 601 gagcttttat cttctgggca ttgtaaggct tgtccggagg aggaggatga cgatgctgat 661 atgatgatgg ttataaggcg ccctctggag gaaggaaaat gaaagtacag gggacagggc 721 cttaagcaga tggaatccca attaaagctt ctacggattt atacagatta atgatcagca 781 tttctggttg gagcctttcc cagtggctag tcagtgaacc ctggaaagaa gaatggatgc 841 tacttggagt gggtacattc tgaaaagtaa tataagtgtc tcaattcact ttctagtcat 901 ggaaatggta acatttttta actcaaatct gctctaaatt ttgtttgagc ctgagaatta 961 cccctttgac atgttcccag tgataagcaa acattatgaa cgcagcaagt tgagaaatat 1021 caacattgag atgagactca agagaccggg gtttttccca tgagtctgac accaatttgc 1081 tgcgtgactt tgggcaagtc aaacggcctt ttctaaaatg tgagacagag attaaaggga 1141 ccccaaggcc actttccagc tctaggttcc atggccagac tttcatgtca acagagaatg 1201 aagaagatca gtccgttttc atcttgaaaa tggctgccaa agtgctagac aaagatattg 1261 actagatggg ggatggtatt gtctgaccac acccagtact ccaaaaagtt gttccaccca 1321 cacagcacgg tgtctaccac tgcataattt ctaatgcatt tgtgtgcttg tgtgtgtgtg 1381 tgtgtgtgtg tgtctgtgtg tctgtgtgtc tcttccccct tcattcactt ttagtataca 1441 tactgtggat actaaggagt aattgcagtg aacaaattca cattaccgag ttcatatttt 1501 taatgagatc ttgagagtgg gaggaaagag tcggctccta gagaataaaa tgaaggcaga 1561 cttagggaaa tttgaaggta caaaggcaac ttaccttctg atcaacagcc aaccacagtc 1621 tggaataaat gttatcaaac acacattctt caaaatggtc cgtgtctgag taattaaaag 1681 gcaaatttcc aaaatcataa ggacttccgt taatcaagtc aggcataatt attcttccta 1741 ctgatgacac aatgaagtaa acatatcatt cttgtaattt aacagtaatt ctcgtaaatt 1801 gcccttaaat gtcagtgctg gatgtggtcc accctcctaa attgtgactg ttgcaacaga 1861 tgttctcact tcaaataacg cacttcttgg ccacctaatt aaagcaattt ttggggtgat 1921 tcatcctact gcaagcttgg ccacacttgt atcctgtatt aacctataat ttttgtaccg 1981 taggagaaga attc // LOCUS HUMP120PC 2612 bp ss-mRNA PRI 14-FEB-1990 DEFINITION Human proliferating-cell nucleolar protein P120 mRNA, complete cds. ACCESSION M32110 KEYWORDS proliferating cell nuclear protein. SOURCE Human fetal liver cell line CML and testis, cDNA to mRNA, and lymph node DNA (bases 1 to 30). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2612) AUTHORS Fonagy,A., Henning,D., Jhiang,S., Haidar,M., Busch,R.K., Larson,R., Valdez,B. and Busch,H. TITLE Cloning of the cDNA and sequence of the human proliferating cell nuclear protein P120 JOURNAL Cancer Communications 1, 243-251 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.Busch, 14-FEB-1990. FEATURES from to/span description pept < 1 2612 proliferating cell nuclear protein P120 (AA at 3) BASE COUNT 674 a 701 c 725 g 512 t ORIGIN 346 bp upstream of AvaI site. 1 cacgcgcgac gccaccttct cccatttctg cctgccacag taccatgggg cgcaagttgg 61 accctacgaa ggagaagcgg gggccaggcc gaaaggcccg gaagcagaag ggtgccgaga 121 cagaactcgt cagattcttg cctgcagtaa gtgacgaaaa ttccaagagg ctgtctagtc 181 gtgctcgaaa gagggcagcc aagaggagat tgggctctgt tgaagcccct aagacaaata 241 agtctcctga ggccaaacca tcgcctggaa agctaccaaa agggatctct gcaggagctg 301 tccagacagc tggtaagaag ggaccccagt ccctatttaa tgctcctcga ggcaagaagc 361 gcccagcacc tggcagtgat gaggaagagg aggaggaaga ctctgaagaa gatggtatgg 421 tgaaccacgg ggacctctgg ggctccgagg acgatgctga tacggtagat gactatggag 481 ctgactccaa ctctgaggat gaggaggaag gtgaagcgtt gctgcccatt gaaagagctg 541 ctcggaagca gaaggcccgg gaagctgctg ctgggatcca gtggagtgaa gaggagaccg 601 aggacgagga ggaagagaaa gaagtgaccc ctgagtcagg ccccccaaag gtggaagagg 661 cagatggggg cctgcagatc aatgtggatg aggaaccatt tgtgctgccc cctgctgggg 721 agatggagca ggatgcccag gctccagacc tgcaacgagt tcacaagcgg atccaggata 781 ttgtgggaat tctgcgtgat tttggggctc agcgggagga agggcggtct cgttctgaat 841 acctgaaccg gctcaagaag gatctggcca tttactactc ctatggagac ttcctgcttg 901 gcaagctcat ggacctcttc cctctgtctg agctggtgga gttcttagaa gctaatgagg 961 tgcctcggcc cgtcaccctc cggaccaata ccttgaaaac ccgacgccga gaccttgcac 1021 aggctctaat caatcgtggg gttaacctgg atcccctggg caagtggtca aagactggac 1081 tagtggtgta tgattcttct gtgcccattg gtgctacccc cgagtacctg gctgggcact 1141 acatgctgca gggagcctcc agcatgttgc ccgtcatggc cttggcaccc caggaacatg 1201 agcggatcct ggacatgtgt tgtgcccctg gaggaaagac cagctacatg gcccagctga 1261 tgaagaacac gggtgtgatc cttgccaatg acgccaatgc tgagcggctc aagagtgttg 1321 tgggcaactt gcatcggctg ggagtcacca acaccattat cagccactat gatgggcgcc 1381 agttccccaa ggtggtgggg ggctttgacc gagtactgct ggatgctccc tgcagtggca 1441 ctggggtcat ctccaaggat ccagccgtga agactaacaa ggatgagaag gacatcctgc 1501 gctgtgctca cctccagaag gagttgctcc tgagtgctat tgactctgtc aatgcgacct 1561 ccaagacagg aggctacctg gtttactgca cctgttctat cacagtagaa gagaatgagt 1621 gggtggtaga ctatgctctg aaaaagagga atgtgcgact ggtgcccacg ggcctagact 1681 ttggccagga aggttttacc cgctttcgag aaaggcgctt ccaccccagt ctgcgttcta 1741 cccgacgctt ctaccctcat acccacaata tggatgggtt cttcattgcc aagttcaaga 1801 aattttccaa ttctatccct cagtcccaga caggaaattc tgaaacagcc acacctacaa 1861 atgtagactt gcctcaggtc atccccaagt ctgagaacag cagccagcca gccaagaaag 1921 ccaagggggc tggaaagaca aagcagcagc tgcagaaaca gcaacatccc aagaaggcct 1981 ccttccagaa gctgaatggc atctccaaag gggcagactc agaattgtcc actgtacctt 2041 ctgtcacaaa gacccaagct tcctccagct tccaggatag cagtcagcca gctggaaaag 2101 ccgaagggat cagggagcca aaggtgactg ggaagctaaa gcaacgatca cctaaattac 2161 agtcctccaa gaaagttgct ttcctcaggc agaatgcccc tcccaagggc acagacacac 2221 aaacaccggc tgtgttatcc ccatccaaga ctcaggccac cctgaaacct aaggaccatc 2281 atcagcccct tggaagggcc aagggggttg agaagcagca gttcgcagag cagccttttg 2341 agaaagctgc cttccagaaa cagaatgata cccccaaggg cctcagcctc ccactgtgtc 2401 tcccatccgt tccagccgcc ccccaccagc aaagaggaag aaatctcagt ccaggggcaa 2461 cagccagctg ctgctatctt agatggttga aaactagacg ggtggctcac tgccattgtc 2521 accaggttgg aactcttgcc tctgtgagga tgccttctct actgtgcata cccatgaaat 2581 ttaatacaca ttttaaaacc tctggccact ga // LOCUS MUSH2RIIBP 2204 bp ss-mRNA ROD 10-APR-1990 DEFINITION Mouse MHC class I regulatory element binding protein (H-2RIIBP) mRNA, 3' end. ACCESSION M26804 KEYWORDS MHC class I regulatory element binding protein. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2204) AUTHORS Hamada,K., Gleason,S.L., Levi,B.-Z., Hirschfeld,S., Appella,E. and Ozato,K. TITLE H-2RIIBP, a member of the nuclear hormone receptor superfamily that binds to both the regulatory element of major histocompatibility class I genes and the estrogen response element JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8289-8293 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by author, 10-AUG-1989. FEATURES from to/span description pept < 1 1341 MHC class I regulatory element binding protein (H-2RIIBP) mRNA < 1 2204 H-2RIIBP mRNA BASE COUNT 436 a 645 c 648 g 475 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccccc gaagcccaga cagctcctcc ccaaatcccc tttctcaggg gatccgtccg 61 tcttctcctc ctggcccacc tcttacccct tcagcacctc cacctccaat gccacccccg 121 ccactgggct cccccttccc agtcatcagt tcttccatgg ggtcccctgg tctgccccct 181 ccggctcccc caggattctc cgggcctgtc agcagccctc agatcaactc cacagtgtcg 241 ctccctgggg gtgggtctgg cccccctgaa gatgtgaagc caccggtctt aggggtccgg 301 ggcctgcact gtccaccccc tccaggtggt cctggggctg gcaaacggct ctgtgcaatc 361 tgcggggacc gaagctcagg caagcactat ggggtttaca gctgcgaggg ctgcaagggt 421 ttcttcaagc gcaccattcg gaaggacctg acctactcgt gtcgtgataa caaagactgt 481 acagtggaca agcgccagcg gaatcgctgt cagtactgtc gctatcagaa gtgcctggcc 541 actggcatga aaagggaggc ggttcaggag gagcgtcaac gggggaagga caaagacggg 601 gatggagatg gggctggggg agcccctgag gagatgcctg tggacaggat cctggaggca 661 gagcttgctg tggagcagaa gagtgaccaa ggcgttgagg gtcctggggc caccgggggt 721 ggtggcagca gcccaaatga cccagtgact aacatctgcc aggcagctga caaacagctg 781 ttcacactcg ttgagtgggc aaagaggatc ccgcacttct cctccctacc tctggacgat 841 caggtcatac tgctgcgggc aggctggaac gagctcctca ttgcgtcctt ctcccatcgg 901 tccattgatg tccgagatgg catcctcctg gccacgggtc ttcatgtgca cagaaactca 961 gcccattccg caggcgtggg agccatcttt gatcgggtgc tgacagagct agtgtccaaa 1021 atgcgtgaca tgaggatgga caagacagag cttggctgcc tgcgggcaat catactgttt 1081 aatccagacg ccaagggcct ctccaaccct ggagaggtgg agatccttcg ggagaaggtg 1141 tacgcctcac tggagaccta ttgcaagcag aagtaccctg agcagcaggg ccggtttgcc 1201 aagctgctgt tacgtcttcc tgccctccgc tccatcggcc tcaagtgtct ggagcacctg 1261 ttcttcttca agctcattgg cgacaccccc attgacacct tcctcatgga gatgcttgag 1321 gctccccacc agctagcctg agcccagatg cacaccgagt gtcactgagg aggacttgag 1381 cctgggcagg gggcagagcc atgggacagg tgcagagcag gaggggactt gcccagcctg 1441 ccagggatct ggcaacactt agcagggttc gcttggtctc caagtcgaag gggaccccag 1501 atccctgtga ggactttatg tctaccttca gtggccttga gtctctgaat ttgtcggggt 1561 ctcccatggt gcaggtgatt cttcatcctg gctccccagc acaaagcact gccctgcttc 1621 cttctcattt ggcctcactc ccttctgaag agtggaacag agctccccca gaaaggggtg 1681 ttgtggggca ggccccccaa gctgatgatc atgggagcag ggctctgaca gcctttatcc 1741 tctcagactt gacagatggg ggcagaggag ggacctgcct ctgtctcctg tcagccccat 1801 ttccacagtc cctcctgcag tcagactgaa gaataaaggg gtagtgaagg ggctgctgga 1861 ggtggaggaa cccattgctc ttttaatttc ctgtgaggag agactgggag ttagactcaa 1921 agaagtactg tacatcccca ggttgactta aatgtcaggg ctggagatgg catgtgggca 1981 aggaggcccc tcaggtgggc tgtcccaaag ctccctgggc tctgcctcgg gtggccctac 2041 agctcttccc tagtcttaag cacagctagg ctgggagcaa gtggggacat tgatgggggt 2101 ggccagcctg cagagttggg tgctgggctg catggttttt gccctggacc tcttttgggg 2161 gttccctccc atctttcact tgcacataaa gttgctttcc agtt // LOCUS MUSID 927 bp ss-mRNA ROD 15-FEB-1990 DEFINITION Mouse helix-loop-helix DNA binding protein regulator (Id) mRNA, 3' end. ACCESSION M31885 KEYWORDS helix-loop-helix DNA binding protein regulator; helix-loop-helix protein; regulatory protein. SOURCE Mouse (strain DBA2) erythroleukemia cell line MEL, cDNA to mRNA, clone pMH18. REFERENCE 1 (bases 1 to 927) AUTHORS Benezra,R., Davis,R.L., Lockshon,D., Turner,D.L. and Weintraub,H. TITLE The protein Id: A negative regulator of helix-loop DNA binding proteins JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.Benezra, 02-FEB-1990 FEATURES from to/span description pept < 1 533 helix-loop-helix protein (Id) (AA at 3) site 261 305 helix 1 site 306 335 loop site 336 383 helix 2 BASE COUNT 192 a 256 c 259 g 220 t ORIGIN 1 attgtacaac ctttctccaa cttcttgttc tcttcccaca ctctgttctc agcctcctcc 61 gctcccctcc gcctgttctc aggatcatga aggtcgccag tggcagtgcc gcagccgctg 121 caggccctag ctgttcgctg aaggcgggca ggacagcggg cgaggtggta cttggtctgt 181 cggagcaaag cgtggccatc tcgcgctgcg ctgggacgcg cctgcccgcc ttgctggacg 241 agcagcaggt gaacgtcctg ctctacgaca tgaacggctg ctactcacgc ctcaaggagc 301 tggtgcccac cctgccccag aaccgcaaag tgagcaaggt ggagatcctg cagcatgtaa 361 tcgactacat cagggacctg cagctggagc tgaactcgga gtctgaagtc gggaccaccg 421 gaggccgggg actgcctgtc cgcgccccgc tcagcaccct gaacggcgag atcagtgcct 481 tggcggccga ggcggcatgt gttccagccg acgatcgcat cttgtgtcgc tgaggcggcg 541 cactgaggga ccagatggac tccagccctt caggaggcaa gaggaaaaaa gtgctctcgg 601 ttccccaggg gatctctggg aaagacacta ccgcagccac cggactcttg gcggatcggt 661 ccagtgggta gagggtttga tcaacagagc ctcaccctct ccacctttca gcctccagag 721 actttgggga gggggttaat caaccccgcg tgtttctgtt ttattgaaaa agcagacatt 781 ttttttaaat ggtcacattt cgtgcttctc ggatttctga ggaaatattt tatattgtat 841 attacaatga tcactggctg aaaatattgt tttacaatag ttctatgggg gtgggttttt 901 tgttgttatt aaacaaacac tttagat // LOCUS MZEPPDK 644 bp ds-DNA PLN 16-FEB-1990 DEFINITION Z.mays pyruvate orthophosphate dikinase (PPDK) gene, 3' end. ACCESSION M32081 KEYWORDS pyruvate orthophosphate dikinase. SOURCE Z.mays mays (strain B73) DNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 644) AUTHORS Glackin,C.A. and Grula,J.W. TITLE Organ-specific transcripts of different size and abundance JOURNAL Unpublished (1990) In Press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.A.Glacken, 13-FEB-1990. FEATURES from to/span description pept < 1 23 pyruvate orthophosphate dikinase (PPDK) (AA at 3) mRNA < 1 322 PPDK mRNA BASE COUNT 150 a 148 c 148 g 198 t ORIGIN 1 ctgcagctca ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc ctgcagctca 61 ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc tgttggtgca tgctggtgat 121 taataatact actatgacag agccatatgc tgttggtgca tgctggtgat taataatact 181 actatgacag agccatatgc tctgtgaaga gtattagtag cagcgctcat aaaagctaca 241 gttccatcta tctgtgaaga gtattagtag cagcgctcat aaaagctaca gttccatcta 301 ttttctcagc tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttctcagc 361 tatgtaaaac ttccaaactg ttcatgctta aaactgaggg ttttcgtggt gtgagatgtg 421 catgtcgttg ttgaggccat tgctgcacat ttttcgtggt gtgagatgtg catgtcgttg 481 ttgaggccat tgctgcacat tccacctatt gaggccctcc tcaaattaag cctcgaacaa 541 gctgatcatc tccacctatt gaggccctcc tcaaattaag cctcgaacaa gctgatcatc 601 ttttctgaga actctagact cgttttctga gaactctaga ctcg // LOCUS RATCLATP 4350 bp ss-mRNA ROD 15-FEB-1990 DEFINITION Rat ATP citrate-lyase mRNA, complete cds. ACCESSION J05210 KEYWORDS ATP citrate-lyase. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4350) AUTHORS Elshourbagy,N.A., Near,J.C., Metz,P.J., Sathe,G.M., Southan,C., Stickler,J.E., Gross,M., Young,J.F., Wells,T.N.C. and Groot,P.H.E. TITLE Rat ATP citrate-lyase: Molecular cloning and sequencing analysis of a full length cDNA and mRNA abundance as a function of diet, organ, and age JOURNAL J. Biol. Chem. 265, 1430-1435 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by N.A.Elshourbagy, 13-FEB-1990. FEATURES from to/span description pept 73 3375 ATP citrate-lyase mRNA < 1 4350 ATP cytrate-lyase mRNA BASE COUNT 1084 a 1114 c 1136 g 1016 t ORIGIN 1 taagctggtg cttacggaca gagagccaca ctcgggcttt ctcgaagagg taaaccaggt 61 ccctctgcag ccatgtcagc caaggcaatt tcagagcaga ccggcaaaga actcctttac 121 aagtacatct gtaccacctc agccatccag aaccggttca agtatgcccg ggttactccc 181 gacacagact gggcccatct cctgcaggac cacccctggc tgcttagcca gagcttggta 241 gtcaagccgg accagctgat caaacgtcga ggaaagcttg gtctagtcgg ggtcaacctc 301 tctctggatg gagtcaaatc ctggctgaaa cctcgactgg gacatgaggc caccgtcggc 361 aaggccaaag gcttcctcaa gaactttctg attgagccct tcgtccccca cagtcaggcg 421 gaggagttct acgtgtgcat ctatgctacc cgggaaggag actacgtcct gttccaccat 481 gaagggggtg tggatgtggg cgatgtggac accaaagccc agaagctgct tgtgggtgtg 541 gacgagaaac tgaacgctga agacattaag agacacctgt tggtccacgc ccccgaagac 601 aagaaagaaa tcctggccag cttcatctcc ggcctattca atttctacga agatctttac 661 ttcacctacc ttgagatcaa cccccttgtg gtgaccaaag atggtgtcta catccttgac 721 ctggcggcca aggtggacgc cactgctgac tacatctgca aagtcaagtg gggtgatata 781 gagttccctc ccccctttgg gcgtgaggca tacccagagg aagcctacat tgcagacctg 841 gatgccaaaa gtggggcgag cttgaagctg accttgctga accccaaggg gcggatctgg 901 accatggttg ccgggggtgg cgcctctgtc gtgtacagtg ataccatctg tgatcttgga 961 ggtgtcaacg aactggcgaa ttacggggag tactctggtg cccccagtga acaacagacc 1021 tatgactacg ccaagaccat cctctcactt atgactcgag agaagcaccc ggatggcaag 1081 atcctcatca ttggaggcag cattgcaaac ttcaccaacg tggccgccac cttcaagggc 1141 attgtgagag caattcgaga ttaccagggt tccctgaagg agcacgaggt caccatcttt 1201 gttcgaagag gtggcccgaa ctatcaagag ggattacgag tgatgggaga agttgggaag 1261 accactggaa tccccatcca tgtctttggc acagaaactc acatgacggc cattgtgggc 1321 atggcctggg caccggccat tcccaaccag ccacccacag cggctcacac tgccaacttc 1381 ctccttaatg ccagtgggag cacatcgaca ccagcaccca gcaggacagc gtctttttcc 1441 gagtccagag ctgacgaggt ggcccctgca aagaaagcca agccagccat gccccaagat 1501 tcagtcccaa gtccaagatc cctgcaagga aagagtgcca ccctcttcag ccgacatacc 1561 aaggctatcg tatggggcat gcagacccgg gctgtgcaag gcatgctgga ctttgactac 1621 gtgtgctccc gagatgagcc ttcagtggct gctatggtct acccgttcac gggggatcat 1681 aagcagaagt tttactgggg acacaaggaa atcctgatcc ctgtcttcaa gaacatggct 1741 gacgccatga aaaagcatcc ggaggtagac gtgctgatca actttgcatc tctgcgatcg 1801 gcttatgaca gcaccatgga gaccatgaac tatgcacaga tccggaccat agccatcata 1861 gcagaaggca tccctgaggc tctcacacgg aagctcatca agaaggcaga ccagaagggc 1921 gtgaccatca ttgggccagc cacggttggg ggcatcaagc ctggatgctt taagattggg 1981 aatactggtg ggatgctgga caacatcctg gcctccaaac tgtatcgccc aggcagtgtg 2041 gcctacgtct cgcgttcagg aggcatgtct aacgaactca ataatatcat ctctcggacc 2101 acagatggtg tctacgaggg tgttgccatc ggcggggaca ggtaccctgg gtccacattc 2161 atggatcacg tgctgcgtta ccaagacact ccaggagtca agatgattgt agttcttggg 2221 gagatagggg gtacagaaga atataagatc tgccggggca tcaaggaggg ccgcctcacc 2281 aagccagtgg tctgctggtg catcgggacc tgtgccacca tgttctcttc tgaggtccag 2341 tttggccacg ctggggcttg tgccaaccag gcttctgaaa cggcagtagc caagaaccag 2401 gccttgaagg aagcgggagt gtttgtgccc cgaagctttg atgagctcgg agaaatcatt 2461 cagtccgtgt atgaagatct tgtggccaaa ggcgccattg tacctgctca ggaagtgcca 2521 cctccaacag tacccatgga ctactcttgg gccagggagc tgggtttaat ccgaaaacct 2581 gcctcattca tgaccagcat ctgtgacgag cgggggcagg aactcattta tgcgggcatg 2641 cccatcaccg aggtcttcaa ggaagagatg ggcattggtg gtgtcctggg cctcctctgg 2701 ttccagagaa ggttgcccaa gtattcctgc cagttcattg agatgtgtct catggtcacc 2761 gctgatcacg ggccagctgt ctccggggcc cataacacta tcatctgtgc tcgggctggg 2821 aaggacctgg tctccagcct cacctcaggg ctgctcacca ttggggaccg gtttgggggt 2881 gccttggacg cagcagcgaa gatgttcagt aaagcctttg acagcggcat tattcccatg 2941 gagtttgtga acaagatgaa gaaggagggg aaactgatca tgggcatcgg ccatcgagtc 3001 aaatcgataa acaacccaga catgcgagtg cagatcctca aagactttgt caaacagcac 3061 ttccccgcca ccccgctgct cgactatgca ctggaagtgg agaaaatcac cacctcaaag 3121 aagccaaatc ttatcctgaa cgtggatggt ttcatcggcg ttgcgtttgt ggacatgctt 3181 aggaactgtg gctccttcac ccgggaggaa gctgacgagt atgttgacat tggagccctc 3241 aatggcgtct ttgtgctggg aaggagtatg ggcttcatcg ggcactatct tgaccagaag 3301 aggctgaagc aagggctgta tcgtcacccc tgggacgaca tttcctatgt tctcccggaa 3361 cacatgagca tgtaaccgag ccagcagccc taccgtagaa aaaggaagac aaaaactccc 3421 tcctcgacaa tatagcggac agacagctgg aaacagagcc cgttatgggc tgggcctgga 3481 atggaaatag ccattgatgt gcaggcatgg aaagccaaca ccacaggccc attcagtcca 3541 cacagagaag cttagtattt ttttttatat atatatctat atatatataa gcatagaaat 3601 ttaaaaccaa gccaatactt gtgacgtttg cgctgctacc tgctgtatct attacatgga 3661 agactgtaag caagcgctgt cagaataatg ttcttctagg gccttatgat gttgctttct 3721 ttttttaatt agttgaaaat ttatttttcc tctagaacta gtggatccga cttttaagac 3781 ttcaggatac tatctgtttg taggaccact gtctggtatc ccacctccca ctcatcttca 3841 caccacatga agaacactgt attaatctga ttttttagga tctttttttt tttttttgtg 3901 ttatgtgtta agggtttatt tagtatccca ctgaaacgtt ctgtgtttcg gaccaatgtc 3961 tacttatgtc aaggggagga gggttggggc cattgtaccc ttagccatcg tcacacatgt 4021 ggagtagtaa cttaaatgta aagttgtaac atacaagtgt ttaaaatgga aaccgcaaag 4081 caaaaagctg tgaaacgtct cgtgtcttgt gttctctgtg ttcatgcagc tgacttgtct 4141 gttactgaag tgtgggtcca aagactcaca tctgttccgc atctgtaacc cacagagatt 4201 ctggcagctg ccacctcagt ctcttctctg tattatcatg tttggtttaa ataaactaga 4261 tagtaaaaag aattcctgca gcccggggga tccactagtt ctagagcggc gcaccgcggt 4321 ggagctccag cttttgttcc ctttagtgag // LOCUS RATLOX 5351 bp ss-mRNA ROD 15-FEB-1990 DEFINITION Rat aorta lysyl oxidase mRNA, complete cds. ACCESSION J02903 KEYWORDS lysyl oxidase. SOURCE Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7, 13, 8-1 and IIB. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5351) AUTHORS Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D., Troxler,R.F. and Kagan,H.M. TITLE Cloning of rat aorta lysyl oxidase cDNA: Complete codons and predicted amino acid sequence JOURNAL Biochemistry (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.C.Trackman, 07-FEB-1990. FEATURES from to/span description pept 287 1516 lysyl oxidase precursor sigp 287 349 pot. lysyl oxidase signal peptide matp 350 1513 lysyl oxidase site 2252 2274 region of dyad symmetry BASE COUNT 1302 a 1528 c 1245 g 1276 t ORIGIN 192 bp upstream of TaqI site. 1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc 61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg 121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc 181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt 241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg 301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc 361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa 421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga 481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt 541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg 601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg 661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc 721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac 781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga 841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc 901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat 961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga 1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg 1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca 1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag 1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg 1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga 1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa 1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa 1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac 1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt 1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa 1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat 1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta 1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat 1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca 1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt 1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg 1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc 2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt 2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac 2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc 2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta 2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac 2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat 2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct 2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct 2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg 2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat 2641 gcagtactgt aatccccgtc cctcctggag cccgaattcc ttctacttca gacactgtgc 2701 gctctcccgg accgtcgtgc gccgctcccc gtcgccttcc aggactggga aaggggagag 2761 gaggacggtg ccacgtccga cggcctcctg ggctgggggc agggtctgcc gctcgccctt 2821 ggcaccagtc cctgcgaccc atccccgagc ctcgccctcc tcctccctgc tcgaagaggt 2881 ctccctcctt cgcgggatct gagtccctgt cttcattttt ctcctagcca cgtccctccc 2941 cgagaaggga cgagccggga gcatcatgcg tttcgcctgg accgtgctct ttctgggaca 3001 gctgcagttc tgtccccttc tccgctgcgc cccgcaggcc ccgcgcgagc ctcccgccgc 3061 ccccggtgcc tggcgccaga caatccaatg ggagaacaac gggcaggtgt tcagtctgtt 3121 gagcctgggg gcgcagtacc agcctcagcg acgccgcgac tccagcgcca ctgccccgag 3181 agccgacgca acgctgcagc acagccacgc acgcccattc tgctgctgcg tgacaaccgc 3241 actgcctctg cccgtgcgag gactccaagc ccatctgggg tcgccgcggg tcgtccccgg 3301 cccgcagccc gccactggtt ccaagttggt ttctcgccgt cgggggccgg cgatggagcc 3361 tcaaggcgcg cagaaccgga ctgcgtcgcc acagcctccg cagctcagta atctgaggcc 3421 acccagccac gtagatcgca tggtggcgac gacccctaca atccctacaa gtactccgac 3481 gacaacccct attataacta ctatgacact tatgagagac cggtccggga gcaggcaccg 3541 acctggatat ggcaccggtt acttccagta cggtctcccg gacctggtac ccgatcccta 3601 ctacatccag gcatccacgt acgtacaaaa gatgtctatg tacaacctga gatgcgctgc 3661 ggaagaaaac tgcctggcca gttcagcata tagggcggat gtcagagact atgaccacag 3721 ggtactgcta cgatttcctc agagagtgaa aaaccaaggg acgtctgact tcttaccaag 3781 ccgcccccgc tactcctggg agtggcacag ctgccaccaa cattaccaca gcatggatga 3841 attcagccac tacgacctgc tggatgccag cacacagagg agagtggccg agggccacaa 3901 agcaagcttc tgtctggagg acacttcctg tgattatggg taccacagac gatttgcctg 3961 tactgcacac acacaggggt tgagtcccgg atgttatgat acttatgcag cagacataga 4021 ctgccagtgg attgatatta cagatgtaca acccggaaat tacattctaa aggtcagtgt 4081 aaaccccagc tacctggtgc ctgaatcaga ctacagtaac aatgtcgtac gctgtgaaat 4141 tcgctacaca ggacatcacg cctatgcctc aggctgcacc atttcaccgt attagaaaga 4201 agctcacctt cccaaaggat gaagcagtac ctggtgtttg gacctatgaa aaccgtagat 4261 tagcttaagt aggaagactt agatatttta aaaggcaaac ggaaaaacaa caaagaaggt 4321 tttgtttgga ctctttcaca acaaatcaca taactggatt ttgagtgttt aaatcagcat 4381 tagattggca cattttaaat acttattcat gttgctttat gaagtaatgg tgtttcaatt 4441 ctgtgggtgc atagtgggct ctttcaaaga attctgaatt tcttaccttc ttttgaaatt 4501 atagtgcaaa aagaagagga tattttaatg aatgagccac aatttgaact gattactttc 4561 taaattgcca gacccatgag acaatgatga tgggtttgta tttgcctcaa catagattcg 4621 ctttttaaaa agggtgttcc tattgtatag gcaaaaatgg atacacttgg tgctgaggaa 4681 gggtcaaata ctaactattg ttgtcacgaa atataggtct acagcagaga gatggtgagt 4741 atatattcag atagttacat ccctatataa actatgttta cattttagat gcttttcttt 4801 ctgttaattg cttaatctca ctctgacttg aggtacaact tctgttttgg aatgaattag 4861 ataattccag attctggttt gataattgtt gacattcccc catgctactt tttctgaggg 4921 cagaaacgtc taatgtgacg actcttcaca ttaccattac gaggatacac agcacagcga 4981 aatcattccg atgacaggtg tgatagatgg agagctaaca tgcaactgcc gagtgtttca 5041 ctgttagcca gaactaagtc acttgcccca cacagcaatt acaccatgaa tctctaacat 5101 cacaaccttc tttcaaatac ccacggactc atccatcctt ccatccgtca tccatccatc 5161 cgtccgtccg tccgtcctga ctgcctagtg ccactgtctg gctaggcaca cccactatca 5221 acctggttca cctgtcatgg cagcctgtac ccacccccgc cacacacccc gacgctggcc 5281 tatagtgcaa aggttgtgcg ggctggtcct tcccacaatg cagtactgta atccccgtcc 5341 ctcctggagc c // LOCUS SHFIPAH 2900 bp ds-DNA BCT 16-FEB-1990 DEFINITION S.flexner invasion plasmid antigen (ipaH) gene, complete cds. ACCESSION M32063 KEYWORDS invasion plasmid antigen. SOURCE S.flexner (strain M90T-W), serotype S) DNA, clone pWR390. ORGANISM Shigella flexneri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2900) AUTHORS Hartman,A.B., Venkatesan,M.M., Oaks,E.V. and Buysse,J.M. TITLE Sequence and molecular characterization of a multicopy invasion plasmid antigen gene, ipaH, of Shigella flexner JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.B.Hartman, 12-FEB-1990. FEATURES from to/span description pept 251 1849 invasion plasmid antigen pept 2277 > 2900 ORF3 signal 139 144 -35 region signal 162 167 -10 region signal 2111 2116 -35 region signal 2132 2137 -10 region BASE COUNT 868 a 657 c 578 g 797 t ORIGIN 1 catagaaaac ctccataaat aaattacaac taacttctgt tatgtgtaaa atggaaacta 61 ttaaaactta atatcggaaa tggtaagtga aatttgtata aatatacaat tttaaatatt 121 tattctcaca aatataaggt tgacctagca ttatgttctc tgtaaataat acacactcat 181 cagtttcttg ctccccctct attaactcaa actcaaccag taatgaacat tatctgagaa 241 tcctgactga atgggaaaag aactcttctc ccgggaagag cgaggcattg cttttaacag 301 actctcccag tgctttcaga atcaagaagc agtattaaat ttatcagacc taaatttgac 361 gtctcttccc gaattaccaa agcatatttc tgctttgatt gtagaaaata ataaattaac 421 atcattgcca aagctgcctg catttcttaa agaacttaat gctgataata acaggctttc 481 tgtgatacca gaacttcctg agtcattaac aactttaagt gttcgttcta atcaactgga 541 aaaccttcct gttttgccaa accatttaac atcattattt gttgaaaata acaggctata 601 taacttaccg gctcttcccg aaaaattgaa atttttacat gtttattata acaggctgac 661 aacattaccc gacttaccgg ataaactgga aattctctgt gctcagcgca ataatctggt 721 tacttttcct caattttctg atagaaacaa tatcagacaa aaggaatatt attttcattt 781 taatcagata accactcttc cggagagttt ttcacaatta gattcaagtt acaggattaa 841 tatttcaggg aatccattgt cgactcgcgt tctgcaatcc ctgcaaagat taacctcttc 901 gccggactac cacggcccgc agatttactt ctccatgagt gacggacaac agaatacact 961 ccatcgcccc ctggctgatg ccgtgacagc atggttcccg gaaaacaaac aatctgatgt 1021 atcacagata tggcatgctt ttgaacatga agagcatgcc aacacctttt ccgcgttcct 1081 tgaccgcctt tccgataccg tctctgcacg caatacctcc ggattccgtg aacaggtcgc 1141 tgcatggctg gaaaaactca gtgcctctgc ggagcttcga cagcagtctt tcgctgttgc 1201 tgctgatgcc actgagagct gtgaggaccg tgtcgcgctc acatggaaca atctccggaa 1261 aaccctcctg gtccatcagg catcagaagg ccttttcgat aatgataccg gcgctctgct 1321 ctccctgggc agggaaatgt tccgcctcga aattctggag gacattgccc gggataaagt 1381 cagaactctc cattttgtgg atgagataga agtctacctg gccttccaga ccatgctcgc 1441 agagaaactt cagctctcca ctgccgtgaa ggaaatgcgt ttctatggcg tgtcgggagt 1501 gacagcaaat gacctccgca ctgccgaagc tatggtcaga agccgtgaag agaatgaatt 1561 tacggactgg ttctccctct ggggaccatg gcatgctgta ctgaagcgta cggaagctga 1621 ccgctgggcg caggcagaag agcagaagta tgagatgctg gagaatgagt actctcagag 1681 ggtggctgac cggctgaaag catcaggtct gagcggtgat gcggatgcgc agagggaagc 1741 cggtgcacag gtgatgcgtg agactgaaca gcagatttac cgtcagctga ctgacgaggt 1801 actggccctg cgattgtctg aaaacggctc acgactgcac cattcataat cacgtcgcat 1861 aagcataaac cgcagaccgg attgactccg gaaaaactgt gacccgatta cggaccttaa 1921 caacaacccg taaatcctcg ctcaataccg gcagggattt acggcgtgca actgactttt 1981 ttgaggggat aaccaaccag atcgtttgct atgggaatat cgagacagta atgagttaaa 2041 tgataaaaat tgtttgaaaa tataggggat aaagatcaat ccaaactgga tgaaagtaga 2101 actggtcaca ttaacatggg tagactgata taacaatcga cggttactgg aaagacagga 2161 acatattcct ccagccggaa tgaaaacgcc gataaagctc taggattgtt tttttaaaga 2221 ctttctcgtt ttatttgcat taatagacca agatatgaat agtgaggggt taataaatga 2281 aaccgatcaa caatcattct ttttttcgtt ccctttgtgg cttatcatgt atatctcgtt 2341 tatcggtaga agaacagtgt accagagatt accaccgcat ctgggatgac tgggctaggg 2401 aaggaacaac aacagaaaat cgcatccagg cggttcgatt attgaaaata tgtctggata 2461 cccgggagcc tgttctcaat ttaagcttac tgaaactacg ttctttacca ccactccctt 2521 tgcatatacg tgaacttaat atttccaaca atgagttaat ctccctacct gaaaattctc 2581 cgcttttgac agaacttcat gtaaatggta acaacttgaa tatactcccg acacttccat 2641 ctcaactgat taagcttaat atttcattca atcgaaattt gtcatgtctg ccatcattac 2701 caccatattt acaatcactc tcggcacgtt ttaatagtct ggagacgtta ccagagcttc 2761 catcaacgct aacaatatta cgtattgaag gtaatcgcct tactgtcttg cctgaattgc 2821 ctcatagact acaagaactc tttgtttccg gcaacagact acaggaacta ccagaatttc 2881 ctcagagctt aaaatatttg // LOCUS VACCSBP 1020 bp ds-DNA VRL 06-DEC-1989 DEFINITION Vaccinia virus cell surface-binding protein gene, complete cds. ACCESSION J05190 KEYWORDS antigen; carbonic anhydrase-related transmembrane protein; cell surface-binding protein; envelope protein. SOURCE Vaccinia virus (wild type WR) DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1020) AUTHORS Maa,J.-S., Rodriguez,J.F. and Esteban,M. TITLE Structural and functional characterization of a cell surface binding protein of vaccinia virus JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Esteban 30-NOV-1989. FEATURES from to/span description pept 61 975 cell surface-binding protein site 822 972 attachment site site 9 19 alpha helix site 103 114 alpha helix site 270 280 alpha helix site 286 292 alpha helix BASE COUNT 354 a 182 c 167 g 317 t ORIGIN Map position HindIII-D. 1 catccattgt aattcccata ctaagagcta tttttaaaca gttatcattt catttttact 61 atgccgcaac aactatctcc tattaatata gaaactaaaa aagcaatttc taacgcgcga 121 ttgaagccgt tagacataca ttataatgag tcgaaaccaa ccactatcca gaacactgga 181 aaactagtaa ggattaattt taaaggagga tatataagtg gagggtttct ccccaatgaa 241 tatgtgttat catcactaca tatatattgg ggaaaggaag acgattatgg atccaatcac 301 ttgatagatg tgtacaaata ctctggagag attaatcttg ttcattggaa taagaaaaaa 361 tatagttctt atgaagaggc aaaaaaacac gatgatggac ttatcattat ttctatattc 421 ttacaagtat tggatcataa aaatgtatat tttcaaaaga tagttaatca attgcattcc 481 attagatccg ccaatacgtc tgcaccgttt gattcagtat tttatctaga caatttgctg 541 cctagtaagt tggattattt tacatatcta ggaacaacta tcaaccactc tgcagacgct 601 gtatggataa tttttccaac gccaataaac attcattctg atcaactatc taaattcaga 661 acactattgt cgtcgtctaa tcatgatgga aaaccgcatt atataacaga gaactataga 721 aatccgtata aattgaacga cgacacgcaa gtatattatt ctggggagat tatacgagca 781 gcaactacct ctccagcgcg cgagaactat tttatgagat ggttgtccga tttgagagag 841 acatgttttt catattatca aaaatatatc gaagagaata aaacattcgc aattattgcc 901 atagtattcg tgtttatact taccgctatt ctctttttta tgagtcgacg atattcgcga 961 gaaaaacaaa actagattcg ataccttgtt gagcctccat tagaacggca gtgacttcgc //
GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS CATL11AA 424 bp ds-DNA MAM 28-FEB-1990 DEFINITION Cat LINE-1 DNA sequence region 1. ACCESSION M32251 KEYWORDS L1 repetitive sequence; repetitive sequence. SOURCE Cat DNA. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 424) AUTHORS Fanning,T. and Singer,S. TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987) STANDARD simple staff_entry FEATURES from to/span description rpt 1 424 L1 repetitive element BASE COUNT 182 a 71 c 61 g 110 t ORIGIN 1 caaggatttt tcaatacaaa aagcaatatc agtgtaagat accatattaa cagaatgaag 61 ggaaaaaacc cacatcatta tctcaactga tacagaaaaa gcatttgaca acattcaata 121 ccctttcatg ataaaaacat tgtgaaaact agggatagaa gaaattacct tcattgaata 181 aaagccatat atgaaaaacc cacaagtaat atcatattca atggtaaaag actaaaatct 241 tttcctctaa atcaggaaaa ggcaaggatg cctactttca ccacttctat tcaatgtgta 301 ctggaaattc tagctagagc aattaggcaa gaaaaagaaa ggcatccaaa ttgaaaagga 361 tgaagtaaaa ttattcctat ttgcagttga tatgatctta tacatagaaa accctaaaga 421 ttac // LOCUS CATL12AA 377 bp ds-DNA MAM 28-FEB-1990 DEFINITION Cat LINE-1 DNA sequence region 2. ACCESSION M32252 KEYWORDS L1 repetitive sequence; repetitive sequence. SOURCE Cat DNA. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 377) AUTHORS Fanning,T. and Singer,S. TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987) STANDARD simple staff_entry FEATURES from to/span description rpt 1 377 L1 repetitive element BASE COUNT 146 a 81 c 85 g 65 t ORIGIN 1 gcaatgacac atcggacaaa gggctagtat ccaaaacctg taaagaactc accaaactcc 61 acacccaaaa aacaaataat ccagtgaaga aatgggcaga agacatgaat agacactttc 121 ctaaacaaga ccagatggtc aacaggcaca tgaaaagatg ctcaacgtca ctccccatga 181 gggagataca aatcaaaacc acaatgagat atcacctcac accagtcaga gtggctaaaa 241 tgagaaaatc aggagatgct gatgctggag aggatgtgga gaaacgggaa ccctcttgca 301 ctgttggtgg gaatgcaagc aaactggtgc agctgctctg gaaaacagtg tggaggttcc 361 tcagaaaatt aaaaata // LOCUS HUMCOL4A6 1056 bp ss-mRNA PRI 09-MAR-1990 DEFINITION Human collagen type IV mRNA. ACCESSION M33653 KEYWORDS alternative splicing; collagen type IV. SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clones HT-[125,133]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1056) AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and Prockop,D.J. TITLE Partial characterization of a low molecular weight human collagen that undergoes alternative splicing JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 740 collagen type IV /hgml_locus_uid="LJ0118J" /map="13q34" /nomgen="COL4A2" BASE COUNT 277 a 228 c 333 g 218 t ORIGIN 1 agatccggac gctggccttg atggggcctc ttggtcttct tgggcaaatt ggcccacttg 61 gagctctagg gatgctaggc cagaaggggg agattggact gctaggccct ctaggacacg 121 atgggaaagg acctcgcggt aaactaggag acatgggccc tcttggtccc caaggccccc 181 taggaaagga tggacctcca ggagtgaagg gagaaaacgg gcacccaggg agcccaggag 241 agaaggggga aaaaggggag acaggacaag caggctcacc ggttcctggg ctgctagggc 301 cagaggggcc tcccggacct ccggggctcc aaggtgttcc tggaccaaag ggggaagcag 361 gactagacgg agcaaaagga gagaaaggct tccagggaga aaaaggagac cgtggtcccc 421 tgggactacc cggagcttca ggtttggacg gcaggcctgg gccaccgggt actccaggac 481 caattggagt tccaggccca gcgggaccaa agggcgagag gggcagcaaa ggagaccctg 541 ggatgacagg accaacggga gcagctgggc ttcctggttt acatggacca cccggggaca 601 agggaaaccg gggggagagg gggaagaaag gctctagagg gcctaaaggg gacaagggag 661 accaaggagc gcctggatta gatgccccct gcccattggg gcaagatggc ttaccagtcc 721 aaggctgctg gaacaagtga tgcctctaac cttggattgg cctgtgtgtg tgtttgtaca 781 tagaatattt atttttatac agttttcact ttttgaaaat gccagaagta tgatgcatct 841 tacagattat taaaaaagaa agaaaaaccg ttgcatattt tgtacagaaa atatcaacct 901 cttccctttt gtttacaaga tgttttgtat aagcctatgt ctctaataca ttttttgttt 961 ggtcgtaatg tctgcatgat atttgtgcac atttattaag tatcgaaggt taataaattg 1021 ttgtgtcctg gtgccaaggg gggccagcca gaactg // LOCUS HUMCOL4A7 629 bp ss-mRNA PRI 09-MAR-1990 DEFINITION Human collagen type IV mRNA. ACCESSION M15524 KEYWORDS alternative splicing; collagen type IV. SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clone HT-127. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 629) AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and Prockop,D.J. TITLE Partial characterization of a low molecular weight human collagen that undergoes alternative splicing JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 403 collagen type IV (AA at 2) /hgml_locus_uid="LJ0118J" /map="13q34" /nomgen="COL4A2" BASE COUNT 175 a 131 c 185 g 138 t ORIGIN 1 agaggggcct cccggacctc cggggctcca aggtgttcct ggaccaaagg gggaagcagg 61 actagacgga gcaaaaggag agaaaggctt ccagggagaa aaaggagacc gtggtcccct 121 gggactaccc ggtactccag gaccaattgg agttccaggc ccagcgggac caaagggcga 181 gaggggcagc aaaggagacc ctgggatgac aggaccaacg ggagcagctg ggcttcctgg 241 tttacatgga ccacccgggg acaagggaaa ccggggggag agggggaaga aaggctctag 301 agggcctaaa ggggacaagg gagaccaagg agcgcctgga ttagatgccc cctgcccatt 361 ggggcaagat ggcttaccag tccaaggctg ctggaacaag tgatgcctct aaccttggat 421 tggcctgtgt gtgtgtttgt acatagaata tttattttta tacagttttc actttttgaa 481 aatgccagaa gtatgatgca tcttacagat tattaaaaaa gaaagaaaaa ccgttgcata 541 ttttgtacag aaaatatcaa cctcttccct tttgtttaca agatgttttg tataagccta 601 tgtctctaat acattttttg tttggtcgt // LOCUS HUMRB1RA 4600 bp ss-mRNA PRI 09-APR-1990 DEFINITION Human retinoblastoma associated (RB1) mRNA. ACCESSION M33647 J02994 KEYWORDS retinoblastoma protein. SOURCE Human retinoblast cDNA to mRNA, clone p4.7R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4600) AUTHORS Friend,S.H., Horowitz,J.M., Gerber,M.R., Wang,X.-F., Bogenmann,E., Li,F.P. and Weinberg,R.A. TITLE Deletions of a DNA sequence in retinoblastomas and mesenchymal tumors: Organization of the sequence and its encoded protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9059-9063 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence for [1] kindly provided by S.H.Friend, 10-FEB-1988. FEATURES from to/span description pept 4 2790 retinoblastoma-associated protein /hgml_locus_uid="LE0159J" /map="13q14.2" /nomgen="RB1" BASE COUNT 1489 a 842 c 815 g 1454 t ORIGIN 1 gtcatgccgc ccaaaacccc ccgaaaaacg gccgccaccg ccgccgctgc cgccgcggaa 61 cccccggcac cgccgccgcc gccccctcct gaggaggacc cagagcagga cagcggcccg 121 gaggacctgc ctctcgtcag gcttgagttt gaagaaacag aagaacctga ttttactgca 181 ttatgtcaga aattaaagat accagatcat gtcagagaga gagcttggtt aacttgggag 241 aaagtttcat ctgtggatgg agtattggga ggttatattc aaaagaaaaa ggaactgtgg 301 ggaatctgta tctttattgc agcagttgac ctagatgaga tgtcgttcac ttttactgag 361 ctacagaaaa acatagaaat cagtgtccat aaattcttta acttactaaa agaaattgat 421 accagtacca aagttgataa tgctatgtca agactgttga agaagtatga tgtattgttt 481 gcactcttca gcaaattgga aaggacatgt gaacttatat atttgacaca acccagcagt 541 tcgatatcta ctgaaataaa ttctgcattg gtgctaaaag tttcttggat cacattttta 601 ttagctaaag gggaagtatt acaaatggaa gatgatctgg tgatttcatt tcagttaatg 661 ctatgtgtcc ttgactattt tattaaactc tcacctccca tgttgctcaa agaaccatat 721 aaaacagctg ttatacccat taatggttca cctcgaacac ccaggcgagg tcagaacagg 781 agtgcacgga tagcaaaaca actagaaaat gatacaagaa ttattgaagt tctctgtaaa 841 gaacatgaat gtaatataga tgaggtgaaa aatgtttatt tcaaaaattt tatacctttt 901 atgaattctc ttggacttgt aacatctaat ggacttccag aggttgaaaa tctttctaaa 961 cgatacgaag aaatttatct taaaaataaa gatctagatg caagattatt tttggatcat 1021 gataaaactc ttcagactga ttctatagac agttttgaaa cacagagaac accacgaaaa 1081 agtaaccttg atgaagaggt gaatgtaatt cctccacaca ctccagttag gactgttatg 1141 aacactatcc aacaattaat gatgatttta aattcagcaa gtgatcaacc ttcagaaaat 1201 ctgatttcct attttaacaa ctgcacagtg aatccaaaag aaagtatact gaaaagagtg 1261 aaggatatag gatacatctt taaagagaaa tttgctaaag ctgtgggaca gggttgtgtc 1321 gaaattggat cacagcgata caaacttgga gttcgcttgt attaccgagt aatggaatcc 1381 atgcttaaat cagaagaaga acgattatcc attcaaaatt ttagcaaact tctgaatgac 1441 aacatttttc atatgtcttt attggcgtgc gctcttgagg ttgtaatggc cacatatagc 1501 agaagtacat ctcagaatct tgattctgga acagatttgt ctttcccatg gattctgaat 1561 gtgcttaatt taaaagcctt tgatttttac aaagtgatcg aaagttttat caaagcagaa 1621 ggcaacttga caagagaaat gataaaacat ttagaacgat gtgaacatcg aatcatggaa 1681 tcccttgcat ggctctcaga ttcaccttta tttgatctta ttaaacaatc aaaggaccga 1741 gaaggaccaa ctgatcacct tgaatctgct tgtcctctta atcttcctct ccagaataat 1801 cacactgcag cagatatgta tctttctcct gtaagatctc caaagaaaaa aggttcaact 1861 acgcgtgtaa attctactgc aaatgcagag acacaagcaa cctcagcctt ccagacccag 1921 aagccattga aatctacctc tctttcactg ttttataaaa aagtgtatcg gctagcctat 1981 ctccggctaa atacactttg tgaacgcctt ctgtctgagc acccagaatt agaacatatc 2041 atctggaccc ttttccagca caccctgcag aatgagtatg aactcatgag agacaggcat 2101 ttggaccaaa ttatgatgtg ttccatgtat ggcatatgca aagtgaagaa tatagacctt 2161 aaattcaaaa tcattgtaac agcatacaag gatcttcctc atgctgttca ggagacattc 2221 aaacgtgttt tgatcaaaga agaggagtat gattctatta tagtattcta taactcggtc 2281 ttcatgcaga gactgaaaac aaatattttg cagtatgctt ccaccaggcc ccctaccttg 2341 tcaccaatac ctcacattcc tcgaagccct tacaagtttc ctagttcacc cttacggatt 2401 cctggaggga acatctatat ttcacccctg aagagtccat ataaaatttc agaaggtctg 2461 ccaacaccaa caaaaatgac tccaagatca agaatcttag tatcaattgg tgaatcattc 2521 gggacttctg agaagttcca gaaaataaat cagatggtat gtaacagcga ccgtgtgctc 2581 aaaagaagtg ctgaaggaag caaccctcct aaaccactga aaaaactacg ctttgatatt 2641 gaaggatcag atgaagcaga tggaagtaaa catctcccag gagagtccaa atttcagcag 2701 aaactggcag aaatgacttc tactcgaaca cgaatgcaaa agcagaaaat gaatgatagc 2761 atggatacct caaacaagga agagaaatga ggatctcagg accttggtgg acactgtgta 2821 cacctctgga ttcattgtct ctcacagatg tgactgtata actttcccag gttctgttta 2881 tggccacatt taatatcttc agctcttttt gtggatataa aatgtgcaga tgcaattgtt 2941 tgggtgattc ctaagccact tgaaatgtta gtcattgtta tttatacaag attgaaaatc 3001 ttgtgtaaat cctgccattt aaaaagttgt agcagattgt ttcctcttcc aaagtaaaat 3061 tgctgtgctt tatggatagt aagaatggcc ctagagtggg agtcctgata acccaggcct 3121 gtctgactac tttgccttct tttgtagcat ataggtgatg tttgctcttg tttttattaa 3181 tttatatgta tattttttta atttaacatg aacaccctta gaaaatgtgt cctatctatc 3241 ttccaaatgc aatttgattg actgcccatt caccaaaatt atcctgaact cttctgcaaa 3301 aatggatatt attagaaatt agaaaaaaat tactaatttt acacattaga ttttatttta 3361 ctattggaat ctgatatact gtgtgcttgt tttataaaat tttgctttta attaaataaa 3421 agctggaagc aaagtataac catatgatac tatcatacta ctgaaacaga tttcatacct 3481 cagaatgtaa aagaacttac tgattatttt cttcatccaa cttatgtttt taaatgagga 3541 ttattgatag tactcttggt ttttatacca ttcagatcac tgaatttata aagtacccat 3601 ctagtacttg aaaaagtaaa gtgttctgcc agatcttagg tatagaggac cctaacacag 3661 tatatcccaa gtgcactttc taatgtttct gggtcctgaa gaattaagat acaaattaat 3721 tttactccat aaacagactg ttaattatag gagccttaat ttttttttca tagagatttg 3781 tctaattgca tctcaaaatt attctgccct ccttaatttg ggaaggtttg tgttttctct 3841 ggaatggtac atgtcttcca tgtatctttt gaactggcaa ttgtctattt atcttttatt 3901 tttttaagtc agtatggtct aacactggca tgttcaaagc cacattattt ctagtccaaa 3961 attacaagta atcaagggtc attatgggtt aggcattaat gtttctatct gattttgtgc 4021 aaaagcttca aattaaaaca gctgcattag aaaaagaggc gcttctcccc tcccctacac 4081 ctaaaggtgt atttaaacta tcttgtgtga ttaacttatt tagagatgct gtaacttaaa 4141 ataggggata tttaaggtag cttcagctag cttttaggaa aatcactttg tctaactcag 4201 aattattttt aaaaagaaat ctggtcttgt tagaaaacaa aattttattt tgtgctcatt 4261 taagtttcaa acttactatt ttgacagtta ttttgataac aatgacacta gaaaacttga 4321 ctccatttca tcattgtttc tgcatgaata tcatacaaat cagttagttt ttaggtcaag 4381 ggcttactat ttctgggtct tttgctacta agttcacatt agaattagtg ccagaatttt 4441 aggaacttca gagatcgtgt attgagattt cttaaataat gcttcagata ttattgcttt 4501 attgcttttt tgtattggtt aaaactgtac atttaaaatt gctatgttac tattttctac 4561 aattaatagt ttgtctattt taaaataaat tagttgttag // LOCUS MV1MOBPA 2436 bp ds-DNA BCT 15-DEC-1989 DEFINITION Plasmid pMV158 mobilization protein gene. ACCESSION M28538 KEYWORDS mobilization. SOURCE Plasmid pMV158 from Streptococcus agalactiae, DNA. ORGANISM Plasmid pMV158 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 2436) AUTHORS Priebe,S.D. and Lacks,S.A. TITLE Region of the streptococcal plasmid pMV158 required for conjugative mobilization JOURNAL J. Bacteriol. 171, 4778-4784 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence for [1] kindly provided by S.A.Lacks, 18-JAN-1990. FEATURES from to/span description pept 628 2112 mobilization peptide BASE COUNT 824 a 393 c 565 g 654 t ORIGIN 1 agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt gatccttttt 61 ttataacagg aattcaaatc tttttgttcc attaaagggc gcgattgctg aataaaagat 121 acgagagacc tctcttgtat cttttttatt ttgagtggtt ttgtccgtta cactagaaaa 181 ccgaaagaca ataaaaattt tattcttgct gagtctggct ttcggtaagc tagacaaaac 241 ggacaaaata aaaatcagca agggacaggt agtatttttt gagtgatctt ctcaaaaaat 301 actacctgtc ccttgctgat ttttaaacga gcacgagagc aaaacccccc tttgctgagg 361 tggcagaggg caggtttttt tgtttctttt ttctcgtaaa aaaaagaaag gtcttaaagg 421 ttttatggtt ttggtcggca ctgccgacag cctcgcagag cacacacttt atgaatataa 481 agtatagtgt gttatacttt acatggaagt tataccgaaa ttgtgctaaa ctaccagtta 541 agatgttgct gattaagacg agcaatagac tgtgtcggct cgtctgtctt tcggtaatcg 601 aaagacaagt cttaaggagg gaaatctatg agttacatgg ttgcaagaat gcagaagatg 661 aaagctggca atttgggagg agcttttaag cataatgaac gtgtttttga gacgcactca 721 aataaagata taaacccgag caggtcacac ctcaactacg agttgacgga tcgtgaccgc 781 tcggtgtcgt atgaaaagca gattaaagac tatgtgaacg aaaataaggt ttctaatcga 841 gcaatccgaa aagatgcagt tctatgtgat gagtggatta tcacatctga taaagatttt 901 tttgaaaagt tggatgagga acagacgaga acgttttttg aaacggctaa aaattatttt 961 gcagagaact atggcgagtc aaatattgct tatgcgagtg ttcacttgga tgaaagcacc 1021 cctcacatgc acatgggagt agtacctttt gagaacggta aattgtcatc aaaagcaatg 1081 tttgaccgtg aggagctaaa acatatccaa gaggacttgc ctagatacat gagcgaccac 1141 ggttttgagt tggaacgtgg caagctgaat agtgaagcta agcataagac tgtagctgaa 1201 ttcaagaggg caatggctga tatggaactc aaagaggaac ttcttgaaaa atatcatgca 1261 ccgctttttg ttgatgagag aacaggcgag ttgaacaatg acacggaagc tttttggcat 1321 gaaaaagagt ttgctgatat gtttgaagtt caatctccga tacgtgaaac aactaaccaa 1381 gaaaaaatgg actggttaag aaaacagtac caagaagagc tgaaaaaact agaatcgtct 1441 aaaaagcccc tagaagacga tttaagccat ttagaagagt tgcttgataa aaagaccaag 1501 gaatatatta aaatcgattc tgaggcctct gagagggcct cagagctatc taaagccgag 1561 ggatatataa ataccctaga aaatcattcg aagagcttag aagcgaaaat agagtgttta 1621 gagagtgata atctacaatt ggaaaaacaa aaggcgacaa aactcgaagc gaaagcgttg 1681 aacgagagtg agttgcgaga actaaagcct aagaagaatt ttctaggaaa agagcattat 1741 gagttaagtc ctgaacaatt tgaagggttg aaggcagaag tttatcgtag tagaactcta 1801 ttgcaccaca aagatattga actggagcaa gcaaaacgtc aagtatctct gagagcctct 1861 aaaaactatt ttacagctag tttagagcga gctaaggaaa aagctaaagg tgagagtata 1921 gaccgtctta aaagcgaaat aaagcgacta aaaaacgaaa attcaatttt acgtcagcaa 1981 aatgacaaga tgctagggaa attaagagag ttaatgcctg ataaagcctt taagaatttg 2041 ttatcagaac ttaaggcgat taagccaatc gtgaatataa ttaaaaaggc tattgaaaag 2101 agcttgttct gagcgattta tgccgtgaaa gctatttgac aataagcagt gacagagtac 2161 gctaggacgt gccgagccga aaggctttag cgtttcggac ggacacggac aaaggacggc 2221 agtcactggt tacttgttgt caaatagacc atggaataaa aagcgtcaaa agtcttgagt 2281 ggatgatacc ctatggtact ctattcgcct tttgactttt ttgctataat ttaagtgtcg 2341 ccagttcttc cgtcaggtaa tgcgaactta gactggaggt gagcgttgtg aagacattcc 2401 tcgagcttgt ctttgtccct tttgtggttg gcgttg // LOCUS RATIGFI1 958 bp ss-mRNA ROD 15-JUN-1988 DEFINITION Rat mRNA (clone IGF1AB1) for insulin-like growth factor I. ACCESSION X06107 Y00429 M32260 KEYWORDS insulin-like growth factor I. SOURCE rat (Rattus norvegicus). ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 958; enum. 1 to 958) AUTHORS Rotwein,P. JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA. STANDARD simple automatic REFERENCE 2 (bases ??) AUTHORS Shimatsu,A. and Rotwein,P. TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing Within the 5' Untranslated Region JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987) STANDARD simple automatic COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is described in $<X06118>$ *source: tissue=liver; strain=Sprague-Dawley; EMBL features not translated to GenBank features: key from to description SITE 1 258 5' UT-region SITE 736 958 3' UT-region FEATURES from to/span description pept 259 738 IGF-I (AA 1-159) BASE COUNT 260 a 246 c 209 g 243 t ORIGIN 1 cgaatgttcc cccagctgtt tcctgtctac agtgtctgtg ttttgtagat aaatgtgagg 61 attttctcta aatccctctt ctgcttgcta aatctcactg tcgctgctaa attcagacca 121 gatagagcct gcgcaatcga aataaagtcc tcaaaattga aatgtgactt tgctctaaca 181 tctcccatct ctctggattc tttttgcctc attattcctg cccaccaatt catttccaga 241 ctttgtactt cagaagcgat ggggaaaatc agcagtcttc caactcaatt atttaagatc 301 tgcctctgtg acttcttgaa gataaagata cacatcatgt cgtcttcaca tctcttctac 361 ctggcactct gcttgctcac ctttaccagc tcggccacag ccggaccaga gaccctttgc 421 ggggctgagc tggtggacgc tcttcagttc gtgtgtggac caaggggctt ttacttcaac 481 aagcccacag gctatggctc cagcattcgg agggcaccac agacgggcat tgtggatgag 541 tgttgcttcc ggagctgtga tctgaggagg ctggagatgt actgtgctcc gctgaagcct 601 acaaagtcag ctcgttccat ccgggcccag cgccacactg acatgcccaa gactcagaag 661 tcccagcccc tatcgacaca caagaaaagg aagctgcaaa ggagaaggaa aggaagtaca 721 cttgaagaac acaagtagag gaagtgcagg aaacaagacc tacagaatgt aggaggagcc 781 tcccgaggaa cagaaaatgc cacgtcaccg caagatcctt tgctgcttga gcaacctgca 841 aaacatcgga acacctgcca aatatcaata atgagttcaa taccatttca gagatgggca 901 tttccctcaa tgaaatacac aagtaaacat tccgacattg tctttaggag tgtttgtt // LOCUS RATIGFI2 798 bp ss-mRNA ROD 15-JUN-1988 DEFINITION Rat mRNA (clone IGF1AB2) for insulin-like growth factor I. ACCESSION X06108 Y00429 M32339 KEYWORDS insulin-like growth factor I. SOURCE rat (Rattus norvegicus). ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 798) AUTHORS Rotwein,P. JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA. STANDARD simple automatic REFERENCE 2 (bases ??) AUTHORS Shimatsu,A. and Rotwein,P. TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing Within the 5' Untranslated Region JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987) STANDARD simple automatic COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is described in $<XX6107>$.. *source: tissue=liver; strain=Sprague-Dawley; EMBL features not translated to GenBank features: key from to description SITE 1 72 5' UT-region SITE 553 798 3' UT-region FEATURES from to/span description pept 73 552 IGF-I (AA 1-159) BASE COUNT 238 a 196 c 183 g 181 t ORIGIN 1 gaatgttccc ccagctgttt cctgtctaca gtgtctgtgt tttgtagata aatactttgt 61 acttcagaag cgatggggaa aatcagcagt cttccaactc aattatttaa gatctgcctc 121 tgtgacttct tgaagataaa gatacacatc atgtcgtctt cacatctctt ctacctggca 181 ctctgcttgc tcacctttac cagctcggcc acagccggac cagagaccct ttgcggggct 241 gagctggtgg acgctcttca gttcgtgtgt ggaccaaggg gcttttactt caacaagccc 301 acaggctatg gctccagcat tcggagggca ccacagacgg gcattgtgga tgagtgttgc 361 ttccggagct gtgatctgag gaggctggag atgtactgtg ctccgctgaa gcctacaaag 421 tcagctcgtt ccatccgggc ccagcgccac actgacatgc ccaagactca gaagtcccag 481 cccctatcga cacacaagaa aaggaagctg caaaggagaa ggaaaggaag tacacttgaa 541 gaacacaagt agaggaagtg caggaaacaa gacctacaga atgtaggagg agcctcccga 601 ggaacagaaa atgccacgtc accgcaagat cctttgctgc ttgagcaacc tgcaaaacat 661 cggaacacct gccaaatatc aataatgagt tcaataccat ttcagagatg ggcatttccc 721 tcaatgaaat acacaagtaa acattccgac attgtcttta ggagtgtttg ttaaaaaaaa 781 aaaaaaaaaa acaaaaac // LOCUS VSVNJLA 590 bp ss-RNA VRL 14-MAR-1990 DEFINITION Vesicular stomatitis virus L gene. ACCESSION M17436 KEYWORDS L gene. SOURCE Vesicular stomatitis virus (New Jersey serotype; Ogden strain) cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 590) AUTHORS Luk,D., Masters,P.S., Gill,D.S. and Banerjee,A.K. TITLE Intergenic sequences of the vesicular stomatitis virus genome (New Jersey serotype): Evidence for two transcription initiation sites within the L gene JOURNAL Virology 160, 88-94 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 66 > 590 L protein BASE COUNT 189 a 110 c 117 g 174 t ORIGIN 1 atggctcatt tccgttaata tatgaaaaaa actggaatag aagcataact ccaacaggaa 61 tcaaaatgga tttcgatcta atagaagatt ctgctaactg ggaagatgat gagtcggact 121 tttttttgcg agacattcta tctcaagaag atcagatgtc ttacttaaac accgcagact 181 ataatttgaa ttctccactc atttccgatg acatggttta tataattaaa agaatgaatc 241 acgaggaggt tccccccatt tggagatcta aggagtggga ttctcctctt gatatgctta 301 gaggttgtca agctcaaccc atgtctcacc aagagatgca caattggttt ggaacttgga 361 ttcaaaatgt tcagcatgat tccgcccaac ccttcacatt cttaaaagag gtggacaaag 421 agtcagaaat gacctatgac ttggtctcca cttttctgaa ggggtgggtg gggaaagatt 481 acccattcaa gtcaaagaac aaagaaattg atagtatggc attagtcggt cctctttgtc 541 aaaaattcct tgatttgcat aaaatcactc tgatattaaa tgcagtttct //
GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS CATL11AA 424 bp ds-DNA MAM 28-FEB-1990 DEFINITION Cat LINE-1 DNA sequence region 1. ACCESSION M32251 KEYWORDS L1 repetitive sequence; repetitive sequence. SOURCE Cat DNA. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 424) AUTHORS Fanning,T. and Singer,S. TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987) STANDARD simple staff_entry FEATURES from to/span description rpt 1 424 L1 repetitive element BASE COUNT 182 a 71 c 61 g 110 t ORIGIN 1 caaggatttt tcaatacaaa aagcaatatc agtgtaagat accatattaa cagaatgaag 61 ggaaaaaacc cacatcatta tctcaactga tacagaaaaa gcatttgaca acattcaata 121 ccctttcatg ataaaaacat tgtgaaaact agggatagaa gaaattacct tcattgaata 181 aaagccatat atgaaaaacc cacaagtaat atcatattca atggtaaaag actaaaatct 241 tttcctctaa atcaggaaaa ggcaaggatg cctactttca ccacttctat tcaatgtgta 301 ctggaaattc tagctagagc aattaggcaa gaaaaagaaa ggcatccaaa ttgaaaagga 361 tgaagtaaaa ttattcctat ttgcagttga tatgatctta tacatagaaa accctaaaga 421 ttac // LOCUS CATL12AA 377 bp ds-DNA MAM 28-FEB-1990 DEFINITION Cat LINE-1 DNA sequence region 2. ACCESSION M32252 KEYWORDS L1 repetitive sequence; repetitive sequence. SOURCE Cat DNA. ORGANISM Felis catus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Feliforma; Felidae. REFERENCE 1 (bases 1 to 377) AUTHORS Fanning,T. and Singer,S. TITLE The LINE-1 DNA sequences in four mammalian orders predict proteins that conserve homologies to retrovirus proteins JOURNAL Nucleic Acids Res. 15, 2251-2260 (1987) STANDARD simple staff_entry FEATURES from to/span description rpt 1 377 L1 repetitive element BASE COUNT 146 a 81 c 85 g 65 t ORIGIN 1 gcaatgacac atcggacaaa gggctagtat ccaaaacctg taaagaactc accaaactcc 61 acacccaaaa aacaaataat ccagtgaaga aatgggcaga agacatgaat agacactttc 121 ctaaacaaga ccagatggtc aacaggcaca tgaaaagatg ctcaacgtca ctccccatga 181 gggagataca aatcaaaacc acaatgagat atcacctcac accagtcaga gtggctaaaa 241 tgagaaaatc aggagatgct gatgctggag aggatgtgga gaaacgggaa ccctcttgca 301 ctgttggtgg gaatgcaagc aaactggtgc agctgctctg gaaaacagtg tggaggttcc 361 tcagaaaatt aaaaata // LOCUS HUMCOL4A6 1056 bp ss-mRNA PRI 09-MAR-1990 DEFINITION Human collagen type IV mRNA. ACCESSION M33653 KEYWORDS alternative splicing; collagen type IV. SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clones HT-[125,133]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1056) AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and Prockop,D.J. TITLE Partial characterization of a low molecular weight human collagen that undergoes alternative splicing JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 740 collagen type IV /hgml_locus_uid="LJ0118J" /map="13q34" /nomgen="COL4A2" BASE COUNT 277 a 228 c 333 g 218 t ORIGIN 1 agatccggac gctggccttg atggggcctc ttggtcttct tgggcaaatt ggcccacttg 61 gagctctagg gatgctaggc cagaaggggg agattggact gctaggccct ctaggacacg 121 atgggaaagg acctcgcggt aaactaggag acatgggccc tcttggtccc caaggccccc 181 taggaaagga tggacctcca ggagtgaagg gagaaaacgg gcacccaggg agcccaggag 241 agaaggggga aaaaggggag acaggacaag caggctcacc ggttcctggg ctgctagggc 301 cagaggggcc tcccggacct ccggggctcc aaggtgttcc tggaccaaag ggggaagcag 361 gactagacgg agcaaaagga gagaaaggct tccagggaga aaaaggagac cgtggtcccc 421 tgggactacc cggagcttca ggtttggacg gcaggcctgg gccaccgggt actccaggac 481 caattggagt tccaggccca gcgggaccaa agggcgagag gggcagcaaa ggagaccctg 541 ggatgacagg accaacggga gcagctgggc ttcctggttt acatggacca cccggggaca 601 agggaaaccg gggggagagg gggaagaaag gctctagagg gcctaaaggg gacaagggag 661 accaaggagc gcctggatta gatgccccct gcccattggg gcaagatggc ttaccagtcc 721 aaggctgctg gaacaagtga tgcctctaac cttggattgg cctgtgtgtg tgtttgtaca 781 tagaatattt atttttatac agttttcact ttttgaaaat gccagaagta tgatgcatct 841 tacagattat taaaaaagaa agaaaaaccg ttgcatattt tgtacagaaa atatcaacct 901 cttccctttt gtttacaaga tgttttgtat aagcctatgt ctctaataca ttttttgttt 961 ggtcgtaatg tctgcatgat atttgtgcac atttattaag tatcgaaggt taataaattg 1021 ttgtgtcctg gtgccaaggg gggccagcca gaactg // LOCUS HUMCOL4A7 629 bp ss-mRNA PRI 09-MAR-1990 DEFINITION Human collagen type IV mRNA. ACCESSION M15524 KEYWORDS alternative splicing; collagen type IV. SOURCE Human tumor cell line HT-1080, cDNA to mRNA, clone HT-127. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 629) AUTHORS Pihlajaniemi,T., Myllylae,R., Seyer,J., Kurkinen,M. and Prockop,D.J. TITLE Partial characterization of a low molecular weight human collagen that undergoes alternative splicing JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 940-944 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 403 collagen type IV (AA at 2) /hgml_locus_uid="LJ0118J" /map="13q34" /nomgen="COL4A2" BASE COUNT 175 a 131 c 185 g 138 t ORIGIN 1 agaggggcct cccggacctc cggggctcca aggtgttcct ggaccaaagg gggaagcagg 61 actagacgga gcaaaaggag agaaaggctt ccagggagaa aaaggagacc gtggtcccct 121 gggactaccc ggtactccag gaccaattgg agttccaggc ccagcgggac caaagggcga 181 gaggggcagc aaaggagacc ctgggatgac aggaccaacg ggagcagctg ggcttcctgg 241 tttacatgga ccacccgggg acaagggaaa ccggggggag agggggaaga aaggctctag 301 agggcctaaa ggggacaagg gagaccaagg agcgcctgga ttagatgccc cctgcccatt 361 ggggcaagat ggcttaccag tccaaggctg ctggaacaag tgatgcctct aaccttggat 421 tggcctgtgt gtgtgtttgt acatagaata tttattttta tacagttttc actttttgaa 481 aatgccagaa gtatgatgca tcttacagat tattaaaaaa gaaagaaaaa ccgttgcata 541 ttttgtacag aaaatatcaa cctcttccct tttgtttaca agatgttttg tataagccta 601 tgtctctaat acattttttg tttggtcgt // LOCUS HUMRB1RA 4600 bp ss-mRNA PRI 09-APR-1990 DEFINITION Human retinoblastoma associated (RB1) mRNA. ACCESSION M33647 J02994 KEYWORDS retinoblastoma protein. SOURCE Human retinoblast cDNA to mRNA, clone p4.7R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4600) AUTHORS Friend,S.H., Horowitz,J.M., Gerber,M.R., Wang,X.-F., Bogenmann,E., Li,F.P. and Weinberg,R.A. TITLE Deletions of a DNA sequence in retinoblastomas and mesenchymal tumors: Organization of the sequence and its encoded protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9059-9063 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence for [1] kindly provided by S.H.Friend, 10-FEB-1988. FEATURES from to/span description pept 4 2790 retinoblastoma-associated protein /hgml_locus_uid="LE0159J" /map="13q14.2" /nomgen="RB1" BASE COUNT 1489 a 842 c 815 g 1454 t ORIGIN 1 gtcatgccgc ccaaaacccc ccgaaaaacg gccgccaccg ccgccgctgc cgccgcggaa 61 cccccggcac cgccgccgcc gccccctcct gaggaggacc cagagcagga cagcggcccg 121 gaggacctgc ctctcgtcag gcttgagttt gaagaaacag aagaacctga ttttactgca 181 ttatgtcaga aattaaagat accagatcat gtcagagaga gagcttggtt aacttgggag 241 aaagtttcat ctgtggatgg agtattggga ggttatattc aaaagaaaaa ggaactgtgg 301 ggaatctgta tctttattgc agcagttgac ctagatgaga tgtcgttcac ttttactgag 361 ctacagaaaa acatagaaat cagtgtccat aaattcttta acttactaaa agaaattgat 421 accagtacca aagttgataa tgctatgtca agactgttga agaagtatga tgtattgttt 481 gcactcttca gcaaattgga aaggacatgt gaacttatat atttgacaca acccagcagt 541 tcgatatcta ctgaaataaa ttctgcattg gtgctaaaag tttcttggat cacattttta 601 ttagctaaag gggaagtatt acaaatggaa gatgatctgg tgatttcatt tcagttaatg 661 ctatgtgtcc ttgactattt tattaaactc tcacctccca tgttgctcaa agaaccatat 721 aaaacagctg ttatacccat taatggttca cctcgaacac ccaggcgagg tcagaacagg 781 agtgcacgga tagcaaaaca actagaaaat gatacaagaa ttattgaagt tctctgtaaa 841 gaacatgaat gtaatataga tgaggtgaaa aatgtttatt tcaaaaattt tatacctttt 901 atgaattctc ttggacttgt aacatctaat ggacttccag aggttgaaaa tctttctaaa 961 cgatacgaag aaatttatct taaaaataaa gatctagatg caagattatt tttggatcat 1021 gataaaactc ttcagactga ttctatagac agttttgaaa cacagagaac accacgaaaa 1081 agtaaccttg atgaagaggt gaatgtaatt cctccacaca ctccagttag gactgttatg 1141 aacactatcc aacaattaat gatgatttta aattcagcaa gtgatcaacc ttcagaaaat 1201 ctgatttcct attttaacaa ctgcacagtg aatccaaaag aaagtatact gaaaagagtg 1261 aaggatatag gatacatctt taaagagaaa tttgctaaag ctgtgggaca gggttgtgtc 1321 gaaattggat cacagcgata caaacttgga gttcgcttgt attaccgagt aatggaatcc 1381 atgcttaaat cagaagaaga acgattatcc attcaaaatt ttagcaaact tctgaatgac 1441 aacatttttc atatgtcttt attggcgtgc gctcttgagg ttgtaatggc cacatatagc 1501 agaagtacat ctcagaatct tgattctgga acagatttgt ctttcccatg gattctgaat 1561 gtgcttaatt taaaagcctt tgatttttac aaagtgatcg aaagttttat caaagcagaa 1621 ggcaacttga caagagaaat gataaaacat ttagaacgat gtgaacatcg aatcatggaa 1681 tcccttgcat ggctctcaga ttcaccttta tttgatctta ttaaacaatc aaaggaccga 1741 gaaggaccaa ctgatcacct tgaatctgct tgtcctctta atcttcctct ccagaataat 1801 cacactgcag cagatatgta tctttctcct gtaagatctc caaagaaaaa aggttcaact 1861 acgcgtgtaa attctactgc aaatgcagag acacaagcaa cctcagcctt ccagacccag 1921 aagccattga aatctacctc tctttcactg ttttataaaa aagtgtatcg gctagcctat 1981 ctccggctaa atacactttg tgaacgcctt ctgtctgagc acccagaatt agaacatatc 2041 atctggaccc ttttccagca caccctgcag aatgagtatg aactcatgag agacaggcat 2101 ttggaccaaa ttatgatgtg ttccatgtat ggcatatgca aagtgaagaa tatagacctt 2161 aaattcaaaa tcattgtaac agcatacaag gatcttcctc atgctgttca ggagacattc 2221 aaacgtgttt tgatcaaaga agaggagtat gattctatta tagtattcta taactcggtc 2281 ttcatgcaga gactgaaaac aaatattttg cagtatgctt ccaccaggcc ccctaccttg 2341 tcaccaatac ctcacattcc tcgaagccct tacaagtttc ctagttcacc cttacggatt 2401 cctggaggga acatctatat ttcacccctg aagagtccat ataaaatttc agaaggtctg 2461 ccaacaccaa caaaaatgac tccaagatca agaatcttag tatcaattgg tgaatcattc 2521 gggacttctg agaagttcca gaaaataaat cagatggtat gtaacagcga ccgtgtgctc 2581 aaaagaagtg ctgaaggaag caaccctcct aaaccactga aaaaactacg ctttgatatt 2641 gaaggatcag atgaagcaga tggaagtaaa catctcccag gagagtccaa atttcagcag 2701 aaactggcag aaatgacttc tactcgaaca cgaatgcaaa agcagaaaat gaatgatagc 2761 atggatacct caaacaagga agagaaatga ggatctcagg accttggtgg acactgtgta 2821 cacctctgga ttcattgtct ctcacagatg tgactgtata actttcccag gttctgttta 2881 tggccacatt taatatcttc agctcttttt gtggatataa aatgtgcaga tgcaattgtt 2941 tgggtgattc ctaagccact tgaaatgtta gtcattgtta tttatacaag attgaaaatc 3001 ttgtgtaaat cctgccattt aaaaagttgt agcagattgt ttcctcttcc aaagtaaaat 3061 tgctgtgctt tatggatagt aagaatggcc ctagagtggg agtcctgata acccaggcct 3121 gtctgactac tttgccttct tttgtagcat ataggtgatg tttgctcttg tttttattaa 3181 tttatatgta tattttttta atttaacatg aacaccctta gaaaatgtgt cctatctatc 3241 ttccaaatgc aatttgattg actgcccatt caccaaaatt atcctgaact cttctgcaaa 3301 aatggatatt attagaaatt agaaaaaaat tactaatttt acacattaga ttttatttta 3361 ctattggaat ctgatatact gtgtgcttgt tttataaaat tttgctttta attaaataaa 3421 agctggaagc aaagtataac catatgatac tatcatacta ctgaaacaga tttcatacct 3481 cagaatgtaa aagaacttac tgattatttt cttcatccaa cttatgtttt taaatgagga 3541 ttattgatag tactcttggt ttttatacca ttcagatcac tgaatttata aagtacccat 3601 ctagtacttg aaaaagtaaa gtgttctgcc agatcttagg tatagaggac cctaacacag 3661 tatatcccaa gtgcactttc taatgtttct gggtcctgaa gaattaagat acaaattaat 3721 tttactccat aaacagactg ttaattatag gagccttaat ttttttttca tagagatttg 3781 tctaattgca tctcaaaatt attctgccct ccttaatttg ggaaggtttg tgttttctct 3841 ggaatggtac atgtcttcca tgtatctttt gaactggcaa ttgtctattt atcttttatt 3901 tttttaagtc agtatggtct aacactggca tgttcaaagc cacattattt ctagtccaaa 3961 attacaagta atcaagggtc attatgggtt aggcattaat gtttctatct gattttgtgc 4021 aaaagcttca aattaaaaca gctgcattag aaaaagaggc gcttctcccc tcccctacac 4081 ctaaaggtgt atttaaacta tcttgtgtga ttaacttatt tagagatgct gtaacttaaa 4141 ataggggata tttaaggtag cttcagctag cttttaggaa aatcactttg tctaactcag 4201 aattattttt aaaaagaaat ctggtcttgt tagaaaacaa aattttattt tgtgctcatt 4261 taagtttcaa acttactatt ttgacagtta ttttgataac aatgacacta gaaaacttga 4321 ctccatttca tcattgtttc tgcatgaata tcatacaaat cagttagttt ttaggtcaag 4381 ggcttactat ttctgggtct tttgctacta agttcacatt agaattagtg ccagaatttt 4441 aggaacttca gagatcgtgt attgagattt cttaaataat gcttcagata ttattgcttt 4501 attgcttttt tgtattggtt aaaactgtac atttaaaatt gctatgttac tattttctac 4561 aattaatagt ttgtctattt taaaataaat tagttgttag // LOCUS MV1MOBPA 2436 bp ds-DNA BCT 15-DEC-1989 DEFINITION Plasmid pMV158 mobilization protein gene. ACCESSION M28538 KEYWORDS mobilization. SOURCE Plasmid pMV158 from Streptococcus agalactiae, DNA. ORGANISM Plasmid pMV158 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 2436) AUTHORS Priebe,S.D. and Lacks,S.A. TITLE Region of the streptococcal plasmid pMV158 required for conjugative mobilization JOURNAL J. Bacteriol. 171, 4778-4784 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence for [1] kindly provided by S.A.Lacks, 18-JAN-1990. FEATURES from to/span description pept 628 2112 mobilization peptide BASE COUNT 824 a 393 c 565 g 654 t ORIGIN 1 agggatttct aaatcgttaa gggatcaact ttgggagaga gttcaaaatt gatccttttt 61 ttataacagg aattcaaatc tttttgttcc attaaagggc gcgattgctg aataaaagat 121 acgagagacc tctcttgtat cttttttatt ttgagtggtt ttgtccgtta cactagaaaa 181 ccgaaagaca ataaaaattt tattcttgct gagtctggct ttcggtaagc tagacaaaac 241 ggacaaaata aaaatcagca agggacaggt agtatttttt gagtgatctt ctcaaaaaat 301 actacctgtc ccttgctgat ttttaaacga gcacgagagc aaaacccccc tttgctgagg 361 tggcagaggg caggtttttt tgtttctttt ttctcgtaaa aaaaagaaag gtcttaaagg 421 ttttatggtt ttggtcggca ctgccgacag cctcgcagag cacacacttt atgaatataa 481 agtatagtgt gttatacttt acatggaagt tataccgaaa ttgtgctaaa ctaccagtta 541 agatgttgct gattaagacg agcaatagac tgtgtcggct cgtctgtctt tcggtaatcg 601 aaagacaagt cttaaggagg gaaatctatg agttacatgg ttgcaagaat gcagaagatg 661 aaagctggca atttgggagg agcttttaag cataatgaac gtgtttttga gacgcactca 721 aataaagata taaacccgag caggtcacac ctcaactacg agttgacgga tcgtgaccgc 781 tcggtgtcgt atgaaaagca gattaaagac tatgtgaacg aaaataaggt ttctaatcga 841 gcaatccgaa aagatgcagt tctatgtgat gagtggatta tcacatctga taaagatttt 901 tttgaaaagt tggatgagga acagacgaga acgttttttg aaacggctaa aaattatttt 961 gcagagaact atggcgagtc aaatattgct tatgcgagtg ttcacttgga tgaaagcacc 1021 cctcacatgc acatgggagt agtacctttt gagaacggta aattgtcatc aaaagcaatg 1081 tttgaccgtg aggagctaaa acatatccaa gaggacttgc ctagatacat gagcgaccac 1141 ggttttgagt tggaacgtgg caagctgaat agtgaagcta agcataagac tgtagctgaa 1201 ttcaagaggg caatggctga tatggaactc aaagaggaac ttcttgaaaa atatcatgca 1261 ccgctttttg ttgatgagag aacaggcgag ttgaacaatg acacggaagc tttttggcat 1321 gaaaaagagt ttgctgatat gtttgaagtt caatctccga tacgtgaaac aactaaccaa 1381 gaaaaaatgg actggttaag aaaacagtac caagaagagc tgaaaaaact agaatcgtct 1441 aaaaagcccc tagaagacga tttaagccat ttagaagagt tgcttgataa aaagaccaag 1501 gaatatatta aaatcgattc tgaggcctct gagagggcct cagagctatc taaagccgag 1561 ggatatataa ataccctaga aaatcattcg aagagcttag aagcgaaaat agagtgttta 1621 gagagtgata atctacaatt ggaaaaacaa aaggcgacaa aactcgaagc gaaagcgttg 1681 aacgagagtg agttgcgaga actaaagcct aagaagaatt ttctaggaaa agagcattat 1741 gagttaagtc ctgaacaatt tgaagggttg aaggcagaag tttatcgtag tagaactcta 1801 ttgcaccaca aagatattga actggagcaa gcaaaacgtc aagtatctct gagagcctct 1861 aaaaactatt ttacagctag tttagagcga gctaaggaaa aagctaaagg tgagagtata 1921 gaccgtctta aaagcgaaat aaagcgacta aaaaacgaaa attcaatttt acgtcagcaa 1981 aatgacaaga tgctagggaa attaagagag ttaatgcctg ataaagcctt taagaatttg 2041 ttatcagaac ttaaggcgat taagccaatc gtgaatataa ttaaaaaggc tattgaaaag 2101 agcttgttct gagcgattta tgccgtgaaa gctatttgac aataagcagt gacagagtac 2161 gctaggacgt gccgagccga aaggctttag cgtttcggac ggacacggac aaaggacggc 2221 agtcactggt tacttgttgt caaatagacc atggaataaa aagcgtcaaa agtcttgagt 2281 ggatgatacc ctatggtact ctattcgcct tttgactttt ttgctataat ttaagtgtcg 2341 ccagttcttc cgtcaggtaa tgcgaactta gactggaggt gagcgttgtg aagacattcc 2401 tcgagcttgt ctttgtccct tttgtggttg gcgttg // LOCUS RATIGFI1 958 bp ss-mRNA ROD 15-JUN-1988 DEFINITION Rat mRNA (clone IGF1AB1) for insulin-like growth factor I. ACCESSION X06107 Y00429 M32260 KEYWORDS insulin-like growth factor I. SOURCE rat (Rattus norvegicus). ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 958; enum. 1 to 958) AUTHORS Rotwein,P. JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA. STANDARD simple automatic REFERENCE 2 (bases 1 to 958) AUTHORS Shimatsu,A. and Rotwein,P. TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing Within the 5' Untranslated Region JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987) STANDARD simple automatic COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is described in $<X06118>$ *source: tissue=liver; strain=Sprague-Dawley; EMBL features not translated to GenBank features: key from to description SITE 1 258 5' UT-region SITE 736 958 3' UT-region FEATURES from to/span description pept 259 738 IGF-I (AA 1-159) BASE COUNT 260 a 246 c 209 g 243 t ORIGIN 1 cgaatgttcc cccagctgtt tcctgtctac agtgtctgtg ttttgtagat aaatgtgagg 61 attttctcta aatccctctt ctgcttgcta aatctcactg tcgctgctaa attcagacca 121 gatagagcct gcgcaatcga aataaagtcc tcaaaattga aatgtgactt tgctctaaca 181 tctcccatct ctctggattc tttttgcctc attattcctg cccaccaatt catttccaga 241 ctttgtactt cagaagcgat ggggaaaatc agcagtcttc caactcaatt atttaagatc 301 tgcctctgtg acttcttgaa gataaagata cacatcatgt cgtcttcaca tctcttctac 361 ctggcactct gcttgctcac ctttaccagc tcggccacag ccggaccaga gaccctttgc 421 ggggctgagc tggtggacgc tcttcagttc gtgtgtggac caaggggctt ttacttcaac 481 aagcccacag gctatggctc cagcattcgg agggcaccac agacgggcat tgtggatgag 541 tgttgcttcc ggagctgtga tctgaggagg ctggagatgt actgtgctcc gctgaagcct 601 acaaagtcag ctcgttccat ccgggcccag cgccacactg acatgcccaa gactcagaag 661 tcccagcccc tatcgacaca caagaaaagg aagctgcaaa ggagaaggaa aggaagtaca 721 cttgaagaac acaagtagag gaagtgcagg aaacaagacc tacagaatgt aggaggagcc 781 tcccgaggaa cagaaaatgc cacgtcaccg caagatcctt tgctgcttga gcaacctgca 841 aaacatcgga acacctgcca aatatcaata atgagttcaa taccatttca gagatgggca 901 tttccctcaa tgaaatacac aagtaaacat tccgacattg tctttaggag tgtttgtt // LOCUS RATIGFI2 798 bp ss-mRNA ROD 15-JUN-1988 DEFINITION Rat mRNA (clone IGF1AB2) for insulin-like growth factor I. ACCESSION X06108 Y00429 M32339 KEYWORDS insulin-like growth factor I. SOURCE rat (Rattus norvegicus). ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 798) AUTHORS Rotwein,P. JOURNAL Unpublished (1987) Washington U, Sch of Med, St. Louis, MO, USA. STANDARD simple automatic REFERENCE 2 (bases 1 to 798) AUTHORS Shimatsu,A. and Rotwein,P. TITLE Sequence of Two Rat Insulin-like Growth Factor I mRNAs Differing Within the 5' Untranslated Region JOURNAL Nucleic Acids Res. 15, 7196-7196 (1987) STANDARD simple automatic COMMENT Another IGF-I mRNA of rat liver differing in the 5' UT-region is described in $<XX6107>$.. *source: tissue=liver; strain=Sprague-Dawley; EMBL features not translated to GenBank features: key from to description SITE 1 72 5' UT-region SITE 553 798 3' UT-region FEATURES from to/span description pept 73 552 IGF-I (AA 1-159) BASE COUNT 238 a 196 c 183 g 181 t ORIGIN 1 gaatgttccc ccagctgttt cctgtctaca gtgtctgtgt tttgtagata aatactttgt 61 acttcagaag cgatggggaa aatcagcagt cttccaactc aattatttaa gatctgcctc 121 tgtgacttct tgaagataaa gatacacatc atgtcgtctt cacatctctt ctacctggca 181 ctctgcttgc tcacctttac cagctcggcc acagccggac cagagaccct ttgcggggct 241 gagctggtgg acgctcttca gttcgtgtgt ggaccaaggg gcttttactt caacaagccc 301 acaggctatg gctccagcat tcggagggca ccacagacgg gcattgtgga tgagtgttgc 361 ttccggagct gtgatctgag gaggctggag atgtactgtg ctccgctgaa gcctacaaag 421 tcagctcgtt ccatccgggc ccagcgccac actgacatgc ccaagactca gaagtcccag 481 cccctatcga cacacaagaa aaggaagctg caaaggagaa ggaaaggaag tacacttgaa 541 gaacacaagt agaggaagtg caggaaacaa gacctacaga atgtaggagg agcctcccga 601 ggaacagaaa atgccacgtc accgcaagat cctttgctgc ttgagcaacc tgcaaaacat 661 cggaacacct gccaaatatc aataatgagt tcaataccat ttcagagatg ggcatttccc 721 tcaatgaaat acacaagtaa acattccgac attgtcttta ggagtgtttg ttaaaaaaaa 781 aaaaaaaaaa acaaaaac // LOCUS VSVNJLA 590 bp ss-RNA VRL 14-MAR-1990 DEFINITION Vesicular stomatitis virus L gene. ACCESSION M17436 KEYWORDS L gene. SOURCE Vesicular stomatitis virus (New Jersey serotype; Ogden strain) cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 590) AUTHORS Luk,D., Masters,P.S., Gill,D.S. and Banerjee,A.K. TITLE Intergenic sequences of the vesicular stomatitis virus genome (New Jersey serotype): Evidence for two transcription initiation sites within the L gene JOURNAL Virology 160, 88-94 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 66 > 590 L protein BASE COUNT 189 a 110 c 117 g 174 t ORIGIN 1 atggctcatt tccgttaata tatgaaaaaa actggaatag aagcataact ccaacaggaa 61 tcaaaatgga tttcgatcta atagaagatt ctgctaactg ggaagatgat gagtcggact 121 tttttttgcg agacattcta tctcaagaag atcagatgtc ttacttaaac accgcagact 181 ataatttgaa ttctccactc atttccgatg acatggttta tataattaaa agaatgaatc 241 acgaggaggt tccccccatt tggagatcta aggagtggga ttctcctctt gatatgctta 301 gaggttgtca agctcaaccc atgtctcacc aagagatgca caattggttt ggaacttgga 361 ttcaaaatgt tcagcatgat tccgcccaac ccttcacatt cttaaaagag gtggacaaag 421 agtcagaaat gacctatgac ttggtctcca cttttctgaa ggggtgggtg gggaaagatt 481 acccattcaa gtcaaagaac aaagaaattg atagtatggc attagtcggt cctctttgtc 541 aaaaattcct tgatttgcat aaaatcactc tgatattaaa tgcagtttct //
GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS LEIKPCAA 234 bp ss-mRNA ORG 28-MAR-1990 DEFINITION L.tarentolae kinetoplast ATPase 6 edited mRNA, partial cds. ACCESSION M33229 KEYWORDS ATPase 6. SOURCE L.tarentolae kinetoplast (strain University of California) cDNA to mRNA. ORGANISM Kinetoplast Leishmania tarentolae Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Leishmania tarentolae. REFERENCE 1 (bases 1 to 234) AUTHORS Bhat,J.G., Koslowsky,D.J., Feagin,J.E., Smiley,B.L. and Kenneth,S. TITLE An extensively edited mitochondrial transcript in kinetoplastids encodes a protein homologous to ATPase subunit 6 JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by J.G.Bhat 23-MAR-1990. Each uridine of the mRNA within the ranges specified ("t" in GenBank conventions) are the result of post-transcriptional editing. The "u addition"s and "u deletions" are alterations of the maxicircle DNA. FEATURES from to/span description pept 32 > 234 ATPase 6 site 33 48 post-transcriptional editing (u addition) site 52 64 post-transcriptional editing (u addition) site 70 77 post-transcriptional editing (u addition) site 28 29 post-transcriptional editing (u deletion) site 80 98 post-transcriptional editing (u addition) site 31 32 post-transcriptional editing (u deletion) site 108 127 post-transcriptional editing (u addition) site 130 136 post-transcriptional editing (u addition) site 139 212 post-transcriptional editing (u addition) site 66 67 post-transcriptional editing (u deletion) site 129 130 post-transcriptional editing (u deletion) BASE COUNT 58 a 7 c 35 g 134 t ORIGIN 1 tatataaaaa attatatcag attaagaaaa aatgtttgtt ttttttgttt gtgatttagt 61 aattatgcgt attttattat gtttttgtta tagtgtttga tccagaatta tttttgtgtt 121 attttataat gttttttata tttgtaccga gttaatgttt tgtatttttg atgtttattt 181 atttgttggt ttatgtatgt ttatttgttt atgattcgtg ttatttaatt ttta // LOCUS TRBKPEATA 828 bp ss-mRNA ORG 28-MAR-1990 DEFINITION T.brucei kinetoplast ATPase 6 edited mRNA, complete cds. ACCESSION M33228 KEYWORDS ATPase 6. SOURCE T.brucei kinetoplast (strain EATRO 164) procyclic Istar1 serodeme cell line [isolated from Bos Taurus blood], cDNA to mRNA. ORGANISM Kinetoplast Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma brucei. REFERENCE 1 (bases 1 to 828) AUTHORS Bhat,J.G., Koslowsky,D.J., Feagin,J.E., Smiley,B.L. and Kenneth,S. TITLE An extensively edited mitochondrial transcript in kinetoplastids encodes a protein homologous to ATPase subunit 6 JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by J.G.Bhat 23-MAR-1990. Each uridine of the mRNA within the ranges specified ("t" in GenBank conventions) are the result of post-transcriptional editing. The "u addition"s and "u deletions" are alterations of the maxicircle DNA. FEATURES from to/span description pept 34 678 ATPase 6 site 33 59 post-transcriptional editing (u addition) site 67 132 post-transcriptional editing (u addition) site 135 170 post-transcriptional editing (u addition) site 172 173 post-transcriptional editing (u deletion) site 177 199 post-transcriptional editing (u addition) site 201 202 post-transcriptional editing (u deletion) site 205 245 post-transcriptional editing (u addition) site 253 256 post-transcriptional editing (u addition) site 258 259 post-transcriptional editing (u deletion) site 261 267 post-transcriptional editing (u addition) site 274 275 post-transcriptional editing (u deletion) site 276 278 post-transcriptional editing (u addition) site 280 281 post-transcriptional editing (u deletion) site 282 283 post-transcriptional editing (u addition) site 286 370 post-transcriptional editing (u addition) site 374 380 post-transcriptional editing (u addition) site 383 384 post-transcriptional editing (u deletion) site 384 402 post-transcriptional editing (u addition) site 407 408 post-transcriptional editing (u deletion) site 408 473 post-transcriptional editing (u addition) site 480 498 post-transcriptional editing (u addition) site 505 528 post-transcriptional editing (u addition) site 531 617 post-transcriptional editing (u addition) site 621 631 post-transcriptional editing (u addition) site 635 642 post-transcriptional editing (u addition) site 646 647 post-transcriptional editing (u addition) site 650 651 post-transcriptional editing (u deletion) site 654 657 post-transcriptional editing (u addition) site 660 661 post-transcriptional editing (u deletion) site 662 699 post-transcriptional editing (u addition) site 702 703 post-transcriptional editing (u deletion) site 703 775 post-transcriptional editing (u addition) site 776 777 post-transcriptional editing (u deletion) site 778 779 post-transcriptional editing (u addition) site 782 783 post-transcriptional editing (u addition) site 785 786 post-transcriptional editing (u deletion) BASE COUNT 156 a 21 c 137 g 514 t ORIGIN 1 aaaaataagt attttgatat tattaaagta aatatgtttt tatttttttt ttgtgattta 61 ttttggttgc gtttgttatt atgtatgtat tattgtgtat gatctaggtt atgttttatt 121 gtgtatttta attgtttaat gttgattttt gattttttat tattttgttt gtttgatttg 181 tatttgtttg ttggtttgtg tttgttttta ttgttgtggt ttatgttgtt taatttatat 241 agtttaattt tgtattattg tattacttat ttgaatttgt atttgttgtt ttgtattgtt 301 tttttattgt atattgcatt tttatttttg ttttgttttt tatgtgattt ttttttgttt 361 aataatttgt tagttggtga tagttttatg gatgtttttt ttattcgttt tttgttgtgt 421 tttttagagt gtttttcttt gttgtgtcgt tgtttgtcga cgtttttgcg tttgttttgt 481 aatttattat catcccattt tttattgttg atgttttttg atttttttta ttttattttt 541 gttttttttt ttttatggtg ttttttgtta ttgatttatt ttatttattt ttgtgttttg 601 tttttgttta ttattttatg tgtttttata tttgttggat ttatttgccg ccatattaca 661 gttatttatt ttttgtaata tgattttgca gttgataatg gattttttgt tgtttttgtt 721 gtttgtttag ttttgtattt gatttttgat agttattata ttgttgttga aatttggttt 781 gttattggag ttatagaata agatcaaata agttaataat aaaaaaaa //
GenBank-Updates@genbank.bio.net (04/11/90)
LOCUS RATCA2 446 bp ds-DNA ROD 13-NOV-1989 DEFINITION Rat calcium/calmodulin-dependent protein kinase II alpha-subunit gene, 5' end. ACCESSION M29699 KEYWORDS calcium/calmodulin-dependent protein kinase. SOURCE Rat DNA, clone c4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 446) AUTHORS Sunyer,T. and Sahyoun,N. TITLE Sequence analysis and DNA-protein interactions within the 5' flanking region of the Ca-2+/calmodulin-dependent protein kinase II alpha-subunit gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 278-282 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Sunyer, 03-NOV--1989, for release after publication. FEATURES from to/span description pept 429 > 446 calcium/calmodulin-dependent protein kinase II mRNA 280 > 446 calcium/calmodulin-dependent protein kinase II mRNA (alt.) mRNA 282 > 446 calcium/calmodulin-dependent protein kinase II mRNA (alt.) signal 114 120 tata box binding 201 205 Sp1 binding site BASE COUNT 96 a 126 c 134 g 90 t ORIGIN 1 gatcacttgt ggactaagtt tgttcacatc ccgttctcca accccctcag tacatcaccc 61 tgggagaaca aggtccactt gcttctgggc ccacacagtc ctgcagtatt gtgtatataa 121 ggccagggca acggaggagc aggttttgaa gtgaaaggca ggcaggtgtt ggggaggcag 181 ttaccggggc aacgggaaca gggcgtttcg gaggtggttg ccatggggac ctggatgctg 241 acgaaggctc gcgagctgtg agcagccaca gtgccctgct cagaagcccc aagctcgtca 301 atcaagctgg ttctccattt gcactcagga gcacgggcag gcgagtggcc cctagttctg 361 ggagcagagt atcagcatcc cagtcctagt cccgagccta aagcctcgcc tgcctgccca 421 gtgccaggat ggctaccatc acctgc //
GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS ECOTRWA 76 bp ss-tRNA RNA 15-SEP-1989 DEFINITION E.coli Trp-tRNA. ACCESSION M24301 KEYWORDS transfer RNA-Trp. SOURCE E.coli tRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Eisenberg,S.P., Yarus,M. and Soll,L. TITLE The effect of an Escherichia coli regulatory mutation on transfer RNA structure JOURNAL J. Mol. Biol. 135, 111-126 (1979) STANDARD simple staff_entry COMMENT In the TrpX mutation the base at position 37 ("a") is not modified. In the "uga" suppressor mutation base 24 "g" is changed to "a". FEATURES from to/span description tRNA 1 76 Phe-tRNA modified 16 16 d modified 17 17 d modified 20 20 d modified 32 32 cm anticdn 34 36 Phe-tRNA anticodon cca modified 37 37 ms2i6a modified 46 46 gm7 modified 55 55 psi BASE COUNT 14 a 21 c 24 g 17 t ORIGIN 5' end of mature tRNA. 1 aggggcgtag ttcaattggt agagcaccgg tctccaaaac cgggtgttgg gagttcgagt 61 ctctccgccc ctgcca // LOCUS ECOTRWSUP 76 bp ss-tRNA RNA 15-SEP-1989 DEFINITION E.coli Trp-tRNA suppressor mutant. ACCESSION M25066 KEYWORDS transfer RNA-Trp suppressor. SOURCE E.coli tRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Eisenberg,S.P., Yarus,M. and Soll,L. TITLE The effect of an Escherichia coli regulatory mutation on transfer RNA structure JOURNAL J. Mol. Biol. 135, 111-126 (1979) STANDARD simple staff_entry COMMENT In the TrpX mutation the base at position 37 ("a") is not modified. In Trp-tRNA base 24 "a" is changed to "g". FEATURES from to/span description tRNA 1 76 Phe-tRNA modified 16 16 d modified 17 17 d modified 20 20 d modified 32 32 cm anticdn 34 36 Phe-tRNA anticodon cca modified 37 37 ms2i6a modified 46 46 gm7 modified 55 55 psi BASE COUNT 15 a 21 c 23 g 17 t ORIGIN 5' end of mature tRNA. 1 aggggcgtag ttcaattggt agaacaccgg tctccaaaac cgggtgttgg gagttcgagt 61 ctctccgccc ctgcca // LOCUS ECOTRWX 76 bp ss-tRNA RNA 15-SEP-1989 DEFINITION E.coli Trp-tRNA TrpX mutation. ACCESSION M25067 KEYWORDS transfer RNA-TrpX. SOURCE E.coli tRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Eisenberg,S.P., Yarus,M. and Soll,L. TITLE The effect of an Escherichia coli regulatory mutation on transfer RNA structure JOURNAL J. Mol. Biol. 135, 111-126 (1979) STANDARD simple staff_entry COMMENT In the Trp-tRNA the base at position 37 ("a") is modified, whereas it is not modified in the TrpX mutation. In the suppressor Trp-tRNA base 24 "g" is changed to "a". FEATURES from to/span description tRNA 1 76 Phe-tRNA modified 16 16 D modified 17 17 D modified 20 20 D modified 32 32 Cm anticdn 34 36 Phe-tRNA anticodon cca modified 46 46 Gm7 modified 55 55 psi BASE COUNT 14 a 21 c 24 g 17 t ORIGIN 5' end of mature tRNA. 1 aggggcgtag ttcaattggt agagcaccgg tctccaaaac cgggtgttgg gagttcgagt 61 ctctccgccc ctgcca // LOCUS HCYSA 313 bp ds-DNA SYN 15-MAR-1990 DEFINITION Human cystatin A gene. ACCESSION D00404 KEYWORDS cystatin A; cysteine-proteinase inhibitor. SOURCE Artificial gene. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 313) AUTHORS Kaji,H., Kumagai,I., Takeda,A., Miura,K. and Samejima,T. TITLE Studies on chemical synthesis of human cystatin A gene and its expression in Escherichia coli JOURNAL J. Biochem. 105, 143-147 (1989) STANDARD simple staff_entry COMMENT Submitted in computer readable form by H. Kaji on 14-Apr-1989. The expression plasmid for cystatin A was constructed by introducing the synthetic gene downstream of the tac promoter of an E. coli plasmid which is a derivative of pKK223-3 and the gene was expressed in E. coli JM109 without IPTG-induction. FEATURES from to/span description pept 6 302 cystatin A refnumbr 6 8 numbered codon 1 in [1] BASE COUNT 96 a 74 c 83 g 60 t ORIGIN 1 aattcatgat cccgggaggc ttaagcgaag ccaagccagc tacaccggag atccaggaga 61 tcgtcgacaa ggttaagccg cagctcgagg agaagaccaa cgagacatat ggtaagcttg 121 aggccgttca gtataagacc caggttgttg ccggcaccaa ctactacatc aaagttcgtg 181 ctggcgacaa caaatacatg catctgaagg tgttcaaaag cttacctggc cagaacgaag 241 atctggtact gactgggtac caggtcgaca agaacaagga cgacgagtta actggcttct 301 gatgaattct gca // LOCUS HUMPRG 1182 bp ss-mRNA PRI 15-MAR-1990 DEFINITION Human PRG gene (alt. hpg) encoding hemopoietic proteoglycan core protein, complete cds. ACCESSION X17042 M25538 KEYWORDS hemopoietic PG core protein. SOURCE Human adult hematopoietic myeloid cell, cDNA to mRNA, clone D-D2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1182) AUTHORS Stellrecht,C.M. and Saunders,G.F. TITLE Nucleotide sequence of a cDNA encoding a hemopoietic proteoglycan core protein JOURNAL Nucleic Acids Res. 17, 7523-7523 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by C.M.Stellrecht, 27-JUL-1989. [1] Author Address: C.M.Stellrecht, University of Texas, M.D.Anderson Cancer Center, Department of Biochemistry and Molecular Biology, Box 117, 1515 Holcomb Blvd., Houston, TX, 77054. FEATURES from to/span description pept 25 501 proteoglycan core protein precursor /nomgen="PRG" /map="10" /hgml_locus_uid="LU0037W" sigp 25 105 proteoglycan core protein signal peptide matp 106 498 proteoglycan core protein signal 279 315 CS substitution signal rpt 304 357 Ser-Gly repeat BASE COUNT 359 a 223 c 224 g 376 t ORIGIN Chromosome 10q22.1. 1 gaattccgct agactaagtt ggtcatgatg cagaagctac tcaaatgcag tcggcttgtc 61 ctggctcttg ccctcatcct ggttctggaa tcctcagttc aaggttatcc tacgcagaga 121 gccaggtacc aatgggtgcg ctgcaatcca gacagtaatt ctgcaaactg ccttgaagaa 181 aaaggaccaa tgttcgaact acttccaggt gaatccaaca agatcccccg tctgaggact 241 gacctttttc caaagacgag aatccaggac ttgaatcgta tcttcccact ttctgaggac 301 tactctggat caggcttcgg ctccggctcc ggctctggat caggatctgg gagtggcttc 361 ctaacggaaa tggaacagga ttaccaacta gtagacgaaa gtgatgcttt ccatgacaac 421 cttaggtctc ttgacaggaa tctgccctca gacagccagg acttgggtca acatggatta 481 gaagaggatt ttatgttata aaagaggatt ttcccacctt gacaccaggc aatgtagtta 541 gcatatttta tgtaccatgg ttatatgatt aatcttggga caaagaattt tatagaaatt 601 tttaaacatc tgaaaaagaa gcttaagttt tatcatcctt ttttttctca tgaattctta 661 aaggattatg ctttaatgct gttatctatc ttattgttct tgaaaatacc tgcatttttt 721 ggtatcatgt tcaaccaaca tcattatgaa attaattaga ttcccatggc cataaaatgg 781 ctttaaagaa tatatatata tttttaaagt agcttgagaa gcaaattggc aggtaatatt 841 tcatacctaa attaagactc tgacttggat tgtgaattat aatgatatgc cccttttctt 901 ataaaaacaa aaaaaaaata atgaaacaca gtgaatttgt agagtggggg tatttgacat 961 attttacagg gtggagtgta ctatatacta ttacctttga atgtgtttgc agagctagtg 1021 gatgtgtttg tctacaagta tgattgctgt tacataacac cccaaattaa ctcccaaatt 1081 aaaacacagt tgtgctgtca atacctcata ctgctttacc tttttttcct ggatatctgt 1141 gtattttcaa atgttactat atattaaagc agaaatataa cc // LOCUS SUSMTA 4191 bp ds-DNA INV 15-MAR-1990 DEFINITION S.purpuratus metallothionein-A (SpMTA) gene, complete cds. ACCESSION M30606 KEYWORDS metal binding protein; metallothionein; metallothionein-A. SOURCE S.purpuratus sperm DNA, clone lambda-MT206. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 4191) AUTHORS Harlow,P., Watkins,E., Thornton,R.D. and Nemer,M. TITLE Structure of an ectodermally expressed sea urchin metallothionein gene and characterization of its metal-responsive region JOURNAL Mol. Cell. Biol. 9, 5445-5455 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Nemer, 11-DEC-1989. FEATURES from to/span description pept 757 787 metallothionein-A, exon 1 1908 1994 metallothionein-A, exon 2 3080 3156 metallothionein-A, exon 3 pre-msg 679 4154 MTA mRNA and introns IVS 788 1907 MTA, intron A IVS 1995 3079 MTA, intron B IVS 3189 3738 MTA, intron C signal 652 656 TATA box BASE COUNT 1194 a 758 c 821 g 1391 t 27 others ORIGIN 1 gtctacgtaa tttagtatgg aaaaataacc attttatcat acaggttgat catcaatata 61 tcattatgca gtgaacacca tgaatatttc gaaatggagg gatagttgca tttctggcgg 121 agatatccgg gtcgggaatc atctactttt attccccctc cccatttttt tcttcttctc 181 cttgttcttc ttctttttcg tttcttaatc atggtattaa tcatttgcaa atcaattaca 241 tgcctttata tatatataat aatcatggtg ctacatgatt ttgaaatact ttatggaaac 301 cttacgtttt accagacctt gacacaaagt tgacagtgtt acctttatca ttcacttccg 361 tactgtttca gcacatgagc aacactttgt tctttgaggg caaacgtttt gggggagggg 421 agggcatccc acggcgcttt aaaaacaaaa aggatttttg cttcccttcc atatcccgta 481 taatttatcc gacacgagcc tagtcatcac cattcctgat agaagcgtgt gcaaaactag 541 aaaggcccac ctagatcgag ctcctcaggc accgctaaca ggttgcctag ctcgtgccta 601 aacgtggtgt gtttacctat ttgtttgcac acggatattg taatgaaaac atataaaacc 661 ccgaatttca ccagtttttg tcacaagctg aagcgtttga gtgtgattta cattaccttc 721 tgaagaaaca tacaactcga gttcaatttc atcaccatgc ctgatgtcaa gtgtgtctgc 781 tgcaagggta agcaaaaaat cggaatcaat gttctcatta aaataccgag ttacagaata 841 ccgtgtacgt acggtgtagc gtttctagaa acaaaccgct cgatgttacg tgcatacgat 901 gatacgatac agccatggcc atgaccatga gaggcccccg atcacgacac acgcagctag 961 ctagaccgaa ttttccgtct ctttcgttcg ttaagcctct tatttgggta gggtcggctc 1021 tctctttaga gacggaagcc gactagacct agggcgcgtt catgcttgat tttgaaggca 1081 gaatgagcga tttgtaacgc tcatcgtaaa cgctcatcgt aatcgcagtt gcacggtgca 1141 ctgttcatgc tcacttctcc aggcaattct ccttgggtgc cttcagctgg attcgcaaac 1201 ctgtgtgcgc atgctacgag agcccataga ggtcagtgcg ttgaccccat ttaaacgcgt 1261 ttacaaattg taaacgcatc tttttgtgag tttacgatcg gcgatcatga atcggctttt 1321 tcggtgattc tacaaaatca agcatgaacg cgcccctata ctcattccct tgctaacacc 1381 aaggtgcatt tatttttann nntatnccna naannncccc aaatgtcatt ttacctcgtc 1441 gcatgttata ttgtaaatta ggcctaggct ataataaatc ggttccactt accaacttga 1501 ctgaaataat ttaatggtcc ctttttagga aattatgggt ctgaaatcga aatccatgtt 1561 gaaaatcttg caaaacgtat tttaactctc gctgcgcacg acgtaatcta aatctcatcc 1621 cttctagtct acaccgaagg caaaatgatt atgaaggaaa aactgtgcgc gagcttcacg 1681 ttcgttatca cagtttttct tcataatcat tttcatgttt tcttgcccgt ttcttcatac 1741 ttgttacatc aacagttatg tcaataacta agtttttacc aacaagcact gctggatttg 1801 tggtttcaaa ttgcaagatt gttattactt ttttaaaatt ttatatcatt atatagacta 1861 tatatatata tatatatata tatttttttt tttaattctg aacacagagg gcaaggagtg 1921 tgcctgcttt ggacaagatt gctgtaaaac tggtgaatgc tgcaaggatg gaacctgctg 1981 tggaatatgc acaagtaagc ctttcatcat tttcagtgaa ccctttaaaa ttaaagcagg 2041 cgcgtacgag gattttttaa gggggggggg ggggggttta acatttataa atcgggccga 2101 aaattctgca acgactcagc cgaccgctga ataagtgnng gggggggggg gggtagtaag 2161 aggacacttt ttattttttt ttggtctcga aattagaaaa tttgacattt tgctccgttg 2221 gggggggggg ggggaaggtc agctttttta ggtcagccat gggaattttt ttttttaata 2281 aaaaaaaaaa acatccagan nnnggggggg ggtttttaag gaaatttagg gggggggttt 2341 acacacaaaa aattaccaaa gggtgggggt ttaaccnnnn nnnncccctg tgtaagcncc 2401 tgcatcccag taattaaggc ttttttatac taaactgttt acacaaggct tttaaaagta 2461 tttatattca tagaaaaaaa ttgtcccaat tgctcatgca tttttttaat ttctttttac 2521 tctcaccttt aaatgggtgc aggcttaaat gtcaggctaa ctggcaatga ttatttgtgc 2581 tcattatcaa tattctagtg tatccatggt tttgttatgg tcgggggggg gggatctgac 2641 aaaaaatatt gacagccaag aatataggtc tatatgattt gtcagggaga tatgaatttc 2701 acatctccta ggttttattg ttgttgtcac caattaaatt attttaaaaa tcactgacaa 2761 gcttggaaaa ggattttgaa aataatttaa aacatgttca tctcattttc ctggtgaatt 2821 ttgggtctga aaagcgctat ataagaacta gctattatta tatatatata tatatatata 2881 taattagatt ttttattttt ttggaggggg gggggggggn ctttttccat gtatgcctaa 2941 atgaggggcg cagaccaatt gtgcccttta tatatcctgc tgtgtagtct gcatgaaaga 3001 gaaaatgcag ttataccaac aatagaatag agacatttca atttgatgtt attacttggt 3061 taattttatt tgtccacaga cgctgcgtgc aagtgtgcta atggctgcaa gtgtggaagc 3121 ggctgctcat gcactgaggg gaactgtgca tgctagacga gcttggccct ttgctatagg 3181 gatggcaggt gagattttat ttattttcat ggctacatag taaaaattgt ttccaggcaa 3241 aaacgttagc tagttatggg atatgacagg gtctgaaatg taagactgta gatttgcact 3301 gcataaagct gtgccatcat ttttttttaa atagaatttt ccgagagagc atcaaaatta 3361 atgtccaaat gtcgaaagcc aactgaatat gttaaatatt ggctgagatg tgaccatctg 3421 aataaatagg ttccctatag gaactcagtg ttattttgct gtggttcata acatcacgna 3481 ccagagcaaa ataacactga gttcctacac aaaatctatt tattacatta ctgcattgcg 3541 atatacaaaa aaggtaaaat tagaaaatct ttgcaagtta tggcaatatg tatagacctt 3601 ccgtgtcttt catgactttt gtatattttt ttcaacaacc ggaacaaatt cattaaccat 3661 ttctgtgttt tcatactttt ttgtaaaaat attttcaagc attttgtttt tctctccgtg 3721 tctgtttctc ttctacagat tgagcataca gctgatgaag tcactcagcg catctgccaa 3781 catgctccaa gcagacctga ctttgtatat actttcgacc cgatgtatta tattttttgt 3841 tgtaagatag ttgtatatta agttaattta atctttattt ttttctgaaa atatgagact 3901 aattggcatt aataggacta cttaaacaac aattatttat taaggctgac agtaatgcct 3961 gatagaattt cacttcattt tttgctgccc tgttttgtaa attcttcagt ctgtttctaa 4021 cttctatcaa gttgcaaatt tggaatgaaa ataataataa taatacatgt ggagagtact 4081 tgtagtgcgc aaactccaca tgactaatgt gttaaagact cgctcatgat taaatttaaa 4141 taatgctaca aagtatattc agtatgcgtt gcttagtttt ttctcatgat t // LOCUS SYNCYSTA 318 bp ds-DNA SYN 15-MAR-1990 DEFINITION Artificial cystatin alpha gene. ACCESSION X13226 KEYWORDS artificial gene; cystatin; cysteine-proteinase inhibitor. SOURCE Artificial gene. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 318) AUTHORS Katunuma,N., Yamato,M., Kominami,E. and Ike,Y. TITLE Total synthesis of the cystatin alpha gene and its expression in E. coli JOURNAL FEBS Lett. 238, 116-118 (1988) STANDARD simple automatic COMMENT *source: library=pBR322; clone=pTPI-010; Data kindly reviewed (23/5/89) by Katunuma N. FEATURES from to/span description pept 6 317 cystatin alpha (AA 1 - 103) BASE COUNT 99 a 63 c 76 g 80 t ORIGIN 1 aattcatgga cccgggtact actggtatcg ttggtggtgt ttctgaagct aaaccggcta 61 ctccggaaat ccaggaagtt gctgacaaag ttaaacgtca gctggaagaa aaaactaacg 121 aaaaatacga aaaattcaaa gttgttgaat acaaatctca ggttgttgct ggtcagatcc 181 tgttcatgaa agttgacgtt ggtaacggtc gtttcctgca catgaaagtt ctgcgtggtc 241 tgtctggtga cgacgacctg aaactgctgg actaccagac taacaaaact aaaaacgacg 301 aactgactga cttctagg //
GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS YSCPDC1UP 1193 bp ds-DNA PLN 15-MAR-1990 DEFINITION Yeast PDC1 gene for pyruvate decarboxylase (EC 4.1.1.1.). ACCESSION X14016 KEYWORDS PDC1 gene; pyruvate decarboxylase. SOURCE yeast (Saccharomyces cerevisiae). ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1193; enum. 1 to 1193) AUTHORS Butler,G. and McConnell,D.J. TITLE Identification of an upstream activation site in the pyruvate decarboxylase structural gene (PDC1) of Saccharomyces cerevisiae JOURNAL Curr. Genet. 14, 405-412 (1988) STANDARD simple automatic COMMENT *source: clone=YIpDC1-2; The sequence overlaps with that reported by Kellermann et. al. in Nucl. Acids Res. 14:8963-8977(1986) X04675. EMBL features not translated to GenBank features: key from to description SITE 555 569 RPG box (upstream activation site) FEATURES from to/span description pept 1191 > 1193 pyruvate decarboxylase BASE COUNT 362 a 251 c 206 g 374 t ORIGIN 1 gatcaagcga gtccatcggc gacctgccca ccgctaaact ctcttaacat acggccgtac 61 agaaaaaacg gcactggagt gagcagtgtc tccgacgatt tggatatcga tttcaacgat 121 agttttgcct cagaagaaag ttataaacgt gccgctgcag tgacatctac cctaaaggcc 181 agaattgaaa aaatgaaggc caaatcaagg cgggaaggga caaccaggac gtaaagggta 241 gcctccccat aacataaact caataaaata tatagtcttc aacttgaaaa aggaacaagc 301 tcatgcaaag aggtggtacc cgcacgccga aatgcatgca agtaacctat tcaaagtaat 361 atctcataca tgtttcatga gggtaacaac atgcgactgg gtgagcatat gttccgctga 421 tgtgatgtgc aagataaaca agcaagacag aaactaactt cttcttcatg taataaacac 481 accccgcgtt tatttaccta tctttaaact tcaacacctt atatcataac taatatttct 541 tgagataagc acactgcacc cataccttcc ttaaaaacgt agcttccagt ttttggtggt 601 tctggcttcc ttcccgattc cgcccgctaa acgcataatt ttgttgcctg gtggcatttg 661 caaaatgcat aacctatgca tttaaaagat tatgtatgct cttctgactt ttcgtgtgat 721 gaggctcgtg gaaaaaatga ataatttatg aatttgagaa caattttgtg ttgttacggt 781 attttactat ggaataatca atcaattgag gattttatgc aaatatcgtt tgaatatttt 841 tccgaccctt tgagtacttt tcttcataat tgcataatat tgtccgctgc ccgtttttct 901 gttagacggt gtcttgatct acttgctatc gttcaacacc accttatttt ctaactattt 961 tttttttagc tcatttgaat cagcttatgg tgatggcaca tttttgcata aacctagctg 1021 tcctcgttga acataggaaa aaaaaatata taaacaaggc tctttcactc tccttggaat 1081 cagatttggg tttgttccct ttattttcat atttcttgtc atattctttt ctcaattatt 1141 atcttctact cataacctca cgcaaaataa cacagtgcaa atcaatcaaa atg //
GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS APH5A10A 721 bp ss-RNA VRL 21-FEB-1990 DEFINITION Foot and mouth disease virus 5' non-coding region, typeA-12. ACCESSION M32257 KEYWORDS . SOURCE Foot and mouth disease virus (serotype A-12) cDNA to viral RNA. ORGANISM Foot and mouth disease virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae; Aphthovirus. REFERENCE 1 (bases 1 to 721) AUTHORS Clarke,B.E., Brown,A.L., Currey,K.M., Newton,S.E., Rowlands,D.J. and Carroll,A.R. TITLE potential secondary and tertiary structure in the genomic rna of foot and mouth disease virus JOURNAL Nucleic Acids Res. 15, 7067-7079 (1987) STANDARD simple staff_entry BASE COUNT 178 a 202 c 181 g 160 t ORIGIN 1 ccccctaagt tctaccgtcg ttcccgacgc aaaagggagg taaccacaag cttgacgccg 61 tctttcccga cgttaaaggg tagtaaccgc aagcttctac cgcccatccc ggcgttaaaa 121 gggtgtaacc acaagatgca ccttcgtccc ggaagtaaaa cggcaatttc acacagtttt 181 gcccgttttc acgagaaacg ggacgtctgc gcacgaaacg cctgtcgctt gaggaggact 241 tgtacaaaca cgatctaagc aggtttcccc aactgacaca aaccgtgcaa tttggaactc 301 cgcctggtct ttccaggtct agaggggtga cactttgtac tgtgtttggc tccacgctcg 361 gtccactggc gagtgttagt aacagcactg ttgcttcgta gcggagcatg atggccgtgg 421 gaactcctcc ttggtaacaa ggacccacgg ggccgaaagc cacgtccaat cggacccatc 481 atgtgtgcaa ccccagcaca gcaacttttc tgcgaaactc acttcaaggt gacactgata 541 ctggtactca aacactggtg acaggctaag gatgcccttc aggtaccccg aggtaacacg 601 cgtcactcgg gatctgagaa ggggactggg gcttctataa aagcgtccag gttaaaaagc 661 ttctatgcct gaataggtga ccggaggccg gcaccttttc tttacagcca ctgactttat 721 g // LOCUS APH5A10B 685 bp ss-RNA VRL 21-FEB-1990 DEFINITION Foot and mouth disease virus 5' non-coding region, typeA-10. ACCESSION M32258 KEYWORDS . SOURCE Foot and mouth disease virus (serotype A-10) cDNA to viral RNA. ORGANISM Foot and mouth disease virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae; Aphthovirus. REFERENCE 1 (bases 1 to 685) AUTHORS Clarke,B.E., Brown,A.L., Currey,K.M., Newton,S.E., Rowlands,D.J. and Carroll,A.R. TITLE potential secondary and tertiary structure in the genomic rna of foot and mouth disease virus JOURNAL Nucleic Acids Res. 15, 7067-7079 (1987) STANDARD simple staff_entry BASE COUNT 169 a 194 c 174 g 148 t ORIGIN 1 ccccctaagt tttgccgtcg tttccgacgt aaaaggaagt aatcacaagc ttgacaccgc 61 ctgtcccggc ggtaaaggga agtaaccgca agcttactac cgcctacccc ggcgttaacg 121 gggaacaacc acaaacacac ttttgcccat ttttgcgaga aatgggacgt ctgcgcacga 181 aacgcgccgt cgcttgagga ggacttgtac aaacacgatc tacgcaggtt tccccaactg 241 acacttaccg tgcaacttga aatcccgcct ggtctttcca ggtctagagg ggtgacaatt 301 tgtactgtgg ttgactccac gctcggtcca ctggcgagtg ctagtagcag cactgttgtc 361 tcgtagcgga gcatgatggc cgtgggaatc cctccttggt aacaaggacc cacggggcca 421 aaagccacgt cctaacggac ccatcatgtg tgcaacccca gcacggcgac tttcctgtga 481 aacccacttt aaggtgacac tgatactggt actcaaacac tggtggcagg ctaaggatgc 541 ccttcaggta ccccgaggta acacgcgaca cttgggatct gagaagggga ctgaggcttc 601 tttaaaagcg ctcggtttaa aaagcttcta cgcctgaaca ggtgaccgga ggccggcacc 661 tttctctaca accactgata ttatg // LOCUS CHKHMG17 1327 bp ss-mRNA VRT 15-MAR-1988 DEFINITION Chicken mRNA for high mobility group protein HMG-17. ACCESSION Y00416 M32256 KEYWORDS DNA binding protein; high mobility group protein. SOURCE chicken (Gallus gallus). ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1327; enum. 1 to 1327) AUTHORS Landsman,D. TITLE ; JOURNAL Unpublished (1987) NIH/NCI, Bethesda, MD 20892, USA. STANDARD simple automatic REFERENCE 2 (bases 1 to 1327; enum. 1 to 1327) AUTHORS Landsman,D. and Bustin,M. TITLE Chicken Non-Histone Chromosomal Protein HMG-17 cDNA Sequence JOURNAL Nucleic Acids Res. 15, 6750-6750 (1987) STANDARD simple automatic COMMENT *source: developmental stage=embryo. FEATURES from to/span description pept 175 447 high mobility group protein HMG-17 (AA 1-90) BASE COUNT 355 a 320 c 309 g 343 t ORIGIN 1 gaattccgcc agcgcagcga gccggccgcc agccccgccg cgccgccccg ctctccccct 61 cggccctccc ccgcttctcg ccgccaccga gcgagcccgg ctgcccgccc ccgcccgccc 121 cctccgctcg ctctctccct cctcgcacaa cacacgcacg cgccgcccgg agctatgccg 181 aagagaaagg ctgaaggaga taccaagggc gataaggcca aagttaagga tgagccacaa 241 cggagatcgg caaggttatc tgctaaacct gcccctccga agccagagcc taaacctaaa 301 aaggcagctc caaagaagag tgagaaggtg cccaagggaa agaaggggaa agctgatgct 361 ggcaaggagg gaaacaaccc tgcagaaaat ggagatgcca aaacagacca ggcacagaaa 421 gccgaaggtg ctggtgatgc caagtaaaat gtgtgaattt ttgataactg tgtacttctg 481 gtgactgtac agtttgaaat actatttttt atcaagtttt ataacaatgc agaattttgt 541 tttacttttt tttaagctat gttgttagca cacagaccgc tttgttgttg tgttttgagg 601 ggggggcagt gggacaaacg tcacttaatc tgtttcttgg aacctaaatt ttaaaagttt 661 accccttccc agttttttag aaggactctt cctaaatgga gcaggaaggg attccttcgt 721 gctgcacacc tcttccgttt tgtggaccgc atcagagtga acggaagctc ccgagatgcc 781 tgttgccaac ttcagaactg cagtttgcag tgccctctgc gtttcctttc atgccctccc 841 tttttgccta gagcctatca ctccgaaata cagcagacat ggcatgttgg gactcaccac 901 tctaaatgca ttgtcaggtg atctggactt ctggtgtcta atttgggata taatagctct 961 aaaaggagct gcatttcctc tttcatattg tagatctaca gattaaggaa tctgcagttt 1021 ttaatttttc ctcgcaaagt cagggtagat ttgtgaagag ttgttaaaca acatgctaaa 1081 tgtgaaagtg tccgccctca ctctaaacat ttccctctac aagtatacaa aaatgaagat 1141 ttgtcggttt tatagcaacc tttatgtttg ggtagtccat gaagggaggg gagtttgaca 1201 gttgttgtaa aatgttgcag attgtagccc atgtcctgcc taaattacca tgattgttta 1261 tgaaaagtac ctttaataaa gctggatacg gtttggcttg gaaaaaaaaa aaaaaaaaac 1321 ggaattc // LOCUS HUMSAU3E 50 bp ds-DNA PRI 28-FEB-1990 DEFINITION Human Sau3A related sequence, clone pUCXba30. ACCESSION M32265 KEYWORDS alphoid-like repetitive sequence; repetitive sequence. SOURCE Human DNA, clone pUCXba30. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 50) AUTHORS Okumura,K., Kiyama,R. and Oishi,M. TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA: Analysis of recombination in the excision event JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987) STANDARD simple staff_entry BASE COUNT 16 a 11 c 7 g 16 t ORIGIN 1 tgaaactctc tttctttgga ttctgcatct aaaaaccaaa cggaagcatt // LOCUS HUMSAU3F 88 bp ds-DNA PRI 28-FEB-1990 DEFINITION Human Sau3A related sequence, clone pUCXba25. ACCESSION M32266 KEYWORDS alphoid-like repetitive sequence; repetitive sequence. SOURCE Human DNA, clone pUCXba25. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 88) AUTHORS Okumura,K., Kiyama,R. and Oishi,M. TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA: Analysis of recombination in the excision event JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987) STANDARD simple staff_entry BASE COUNT 22 a 18 c 21 g 27 t ORIGIN 1 cagcagtttc caatcactct ttctgtggaa tctgcaagtg atatttgggc ctctctgagg 61 atttcgttgg aaacgggata aaatgcac // LOCUS HUMSAU3G 1192 bp ds-DNA PRI 28-FEB-1990 DEFINITION Human Sau3A related sequence, clone pUCXba34. ACCESSION M32267 KEYWORDS alphoid-like repetitive sequence; repetitive sequence. SOURCE Human DNA, clone pUCXba34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1192) AUTHORS Okumura,K., Kiyama,R. and Oishi,M. TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA: Analysis of recombination in the excision event JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987) STANDARD simple staff_entry BASE COUNT 328 a 239 c 273 g 352 t ORIGIN 1 cctgtggtgg aaaaggaact atctactcat aaaagctaga tggaagcatt ctcagaaact 61 actttgtgat gattgctttc aggtcacaga gttgaacatt ccctttgata gagccgtttg 121 gagacacaat tttggttcaa tcggcaaggg gagatttgga ccgctttgag gcctatgcga 181 gtagaggaaa tcactgccca taaaaactag acagcagcat tctcaggaaa cactttgtga 241 cgattgagtt gaactcacag agcggaacat tcctttggat ggagcagttt tgaaacacac 301 tttttgtaga atctgcaagt ggacagttgg acctccctga ggatttcttt ggaaacggga 361 taacgtcacc taactaaaca gaagctttcg cagaaacttc tttgtgacgt ttgcattcaa 421 agtccagagt tgaaccttcc cttgatagtt cacgtttgaa acactctttt tgtaggatct 481 gcaagtggat attgggagca ctttgaggcc ttcgttcgaa atgggtatat cttcacataa 541 aatccagaca gaagccttct cagaaacttc tctgtgatga ttgcatgcaa ctcacagagt 601 tgaacattcc tttgcataga gcagtttcga aactctcttt tttctgaatc tgcacatgga 661 taggtgcaac tctgtgaaga tttccttgga aacgggaata tcttcactta aagagtaaac 721 ggatgccttc tcagaaactt ctttgtgagg catgtgttca actcccagac tttaaccttg 781 cttttcatag agcagttttg aaacattctt ttcgtagagt ctccaagtgg acgtttggag 841 cgctttcagg cctgtggtgg aaaaggaaat atcttcacct caaaactaga gagaagcatt 901 gtcagaaacg tctttgtgat gatggcattc aactcacgga gttgaaggtt ccttttgata 961 cagcagtttg gaaacactct ttcagtggga cctgcaagcg gatatttggg gctctttgga 1021 gatttcgatg gaaaaggcat aatcttccca taaaagctaa acggaagcgt gctcagagcc 1081 ttcttggtga tgtttgcatt caactcacag agttgtactt tcctttcgac agagcagctt 1141 tgaaaccctc tctttctaga atctgcaagt ggacatttgg agggcttcga gg // LOCUS HUMSAU3H 91 bp ds-DNA PRI 28-FEB-1990 DEFINITION Human Sau3A related sequence, clone pUCXba34. ACCESSION M32268 KEYWORDS alphoid-like repetitive sequence; repetitive sequence. SOURCE Human DNA, clone pUCXba34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 91) AUTHORS Okumura,K., Kiyama,R. and Oishi,M. TITLE Sequence analysis of extrachromosomal Sau3A and related family DNA: Analysis of recombination in the excision event JOURNAL Nucleic Acids Res. 15, 7477-7489 (1987) STANDARD simple staff_entry BASE COUNT 20 a 15 c 26 g 30 t ORIGIN 1 gttgaaggtt ccttttgata caacagtttg gaaacactct ttcagtggga cctgcaagcg 61 gatatttggg gctctttgga gatttcgatg g // LOCUS NEUMTINT1 1377 bp ss-mRNA ORG 27-FEB-1990 DEFINITION N.crassa mitochondrial RNA precursor intergenic region, segment 1. ACCESSION M32263 KEYWORDS . SEGMENT 1 of 3 SOURCE N.crassa mitochondrial cDNA to precursor RNA. ORGANISM Mitochondrion Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa. REFERENCE 1 (bases 1 to 1377) AUTHORS Agsteribbe,E. and Hartog,M. TITLE Processing of precursor RNAs from mitochondria of Neurospora crassa JOURNAL Nucleic Acids Res. 15, 7249-7263 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 ATPase 6 (AA at 1) pept 1369 > 1377 MAL protein BASE COUNT 456 a 213 c 277 g 431 t ORIGIN 1 ctacattaga acagtcctgt gcagtccttt aacaaaatta cgctgcttta ggtgacacct 61 gaaagcggtc caccctcctc ttacgtatcc gaaggatact aaaggtcccc aatatcaagt 121 taaagtcaca tatgcaatca aatagttaaa cgtctcgttg taacttgata agattactct 181 attcgaggtc tcttatagca aatagagaga ggaatgccca ctcgcaaacg ggttcggtac 241 ttaaaatgaa cagagatacg caagttcgca tctgaaaaag tggacggaga aggaacttat 301 ggccactcta ccgagtggtt gtagcaaatg cagaatcgac ccgctatagg tttaacttct 361 cctaggatgt agtttgggag gcacgcccaa actataaaag tgctaagggg taggaaggtt 421 cgaatcctga gttcccacca ccatgggtgg cgtgcctcga gaagggggcg tgcctctgga 481 tagcttgcgc ttattttttt ttttagccgt ccccccgccc tgcagtactg cagggggggg 541 aagctaaaaa aaattaaaga ggtggccaga ggctaaaaaa aaaaatgcta gctttagcta 601 aaaaaaaaat aaccgataaa ccaaattttt aatagcatat atagacgact aatttaatta 661 tattagtctg agcggagcta agctgtgcgg attaggatta ggattaatcc taatcgtact 721 taatgattat ctttaataat atgaaataaa tatgaatatt tatttcacca tgtaacatta 781 ataatgttaa tgtgtgattt aatttggtac gtatcacgta aggggaatga tggatgatac 841 attatgctat gtaatgatgt aatgtaatgt aatgtaatgt tatatccact atgtactgca 901 aatgtaatca tctgatagat aatataaaat gtaacttaat gtaacttaac ttaatgtaat 961 gtaatgtaac ttaatgtaac ttaatgttat gtaactgtaa tgtaatattt tgtaatgtaa 1021 ctgtaatgta acttaatata atgtaactgt aatgtaatat aatgtacagt aatgtaactg 1081 taatgtaatg ttacattaca tgtaatgtaa tgtaatgtat tgtaatgtaa tctttatatg 1141 taatgtacta aagattacga tagctagcca gttatatact ggctagctat cgtatgatta 1201 ttgtagccat tagggatagc ctaatggcta cttgattatc attagatata ctgcaacagc 1261 agtaaataca ctgttcggga gcctcagtgg tatttgctgc tttgcggtgg tggaattgaa 1321 gaggttacta gattctagtt aatttttgat tccaccattg tatatataat gatacaa // LOCUS NEUMTINT2 712 bp ss-mRNA ORG 28-FEB-1990 DEFINITION N.crassa mitochondrial RNA precursor intergenic region, segment 2. ACCESSION M32264 KEYWORDS . SEGMENT 2 of 3 SOURCE N.crassa mitochondrial cDNA to precursor RNA. ORGANISM Mitochondrion Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa. REFERENCE 1 (bases 1 to 712) AUTHORS Agsteribbe,E. and Hartog,M. TITLE Processing of precursor RNAs from mitochondria of Neurospora crassa JOURNAL Nucleic Acids Res. 15, 7249-7263 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 MAL protein (AA at 1) pept 704 > 712 COII protein BASE COUNT 199 a 149 c 175 g 189 t ORIGIN 225 bp after segment 1. 1 gttgcatagt aatgatagag attaggaata aaatgatgta tggctgctta aataaaaaat 61 atggctgctc ttcctgctta gctagctagc tagcttcgag ggaccccacg aatacggccc 121 acacacgaag gtgtgtgggg tattcgtggg gggccccctc ctacttcctg cctacgaagt 181 ctaggcaggc aggaagcagc cttaataatt tagctagcca taatatcgcg accaccctcc 241 tccactcaac cccccccttg gcggccccca tctttccccc gccctgcaat actgcagggg 301 ggcataatta aatgtgatgg ggaccttgag caaaggaggg ggtacgcgag tggaggaggg 361 tggtactttt ttgggaggga agctgagcga agctgagcga agctgagcga agtgagcgaa 421 gtgtgcgaag ctgacggaag gtaattcatt tttttgggcc cccgccccga agggcggtcc 481 cccgaaaaaa caaaggacgc tagcgtagcc taaccttatg agcagcaggc tataaaaacc 541 cattaggcac agggtatact tcgtttagtg gataagaatt agcctagcct taatatacaa 601 tacagtttaa tgttatattt aaattctata ttttatgaaa agattataat atatttatta 661 ttatggttta atattcaata tagaatattc tatttctttt tttatgggat ta // LOCUS NEUMTINT3 691 bp ss-mRNA ORG 28-FEB-1990 DEFINITION N.crassa mitochondrial RNA precursor intergenic region, segment 3. ACCESSION M33670 KEYWORDS . SEGMENT 3 of 3 SOURCE N.crassa mitochondrial cDNA to precursor RNA. ORGANISM Mitochondrion Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa. REFERENCE 1 (bases 1 to 691) AUTHORS Agsteribbe,E. and Hartog,M. TITLE Processing of precursor RNAs from mitochondria of Neurospora crassa JOURNAL Nucleic Acids Res. 15, 7249-7263 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 COII protein (AA at 1) BASE COUNT 181 a 154 c 198 g 158 t ORIGIN 754 bp after of segment 2. 1 gaacagtaag acgataaaat agtttttggc taaataaagg aactcggggg accccacgaa 61 tacggcccac acacgaaggt gtgtggggta ttcgtggggc cctatgtatt tgagtatgaa 121 gccatctagt ggttatctct cccccccctc cccgcaggtt tgcggggggg agcgaggggg 181 tagcgataaa aagtttctaa tatgaagagg atgacttatg gaattaactc cacccccccc 241 ccccttaagg gggggagtgg agtggaacgg aatgtggaga cgccgcgtgc cccccgcaaa 301 tttgcggggg gggcgggtgg atggaacatc cgggaggtgg tgggttatat tataaacaat 361 attcggggga accccgcccc ttcgggggcg gaggccatta actggaaagc acccccccct 421 taaggggggt ggcttttaat tattattatt tttataggtt aattaggttt aaaaggctac 481 ccgccccccc ccctgcagta ctgcagggcg gggaagggtt aaaataacaa gcaagttcct 541 gaagtctggc gtagctagcg gaaggaacct tgccttaaat ccaaaaacct tttaaagagt 601 taccaagact gaaccggggg ctagcccccg attaggtaat taataagttt atgtatatct 661 agtaagggta taactctaat cccctgcagt a // LOCUS PMUMUC1 965 bp ds-DNA PHG 15-MAR-1990 DEFINITION Bacteriophage mu C operon 5' region involved in late gene transcription. ACCESSION Y00419 M32261 KEYWORDS C operon; activator protein; late gene transcription; regulatory protein; trans-activator. SOURCE Bacteriophage mu DNA, clone pWM6. ORGANISM Bacteriophage mu Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 965) AUTHORS Stoddard,S.F. and Howe,M.M. TITLE DNA sequence within the mu C operon JOURNAL Nucleic Acids Res. 15, 7198-7198 (1987) STANDARD simple automatic REFERENCE 2 (bases 161 to 360) AUTHORS Stoddard,S.F. and Howe,M.M. TITLE Characterization of the C operon transcript of bacteriophage mu JOURNAL J. Bacteriol. 172, 361-371 (1990) STANDARD simple staff_entry COMMENT The mu C gene product is a transactivator of mu late transcription. see also X03992 for Mu C gene Data kindly reviewed (27-OCT-1987) by Howe M.M. EMBL features not translated to GenBank features: key from to description SITE 53 43 (C) pot. MuA binding site SITE 123 111 (C) pot.IHF binding site SITE 185 173 (C) pot. Mu repressor binding site SITE 193 180 (C) pot. Mu repressor binding site SITE 183 193 pot. MuA binding site INVREP 190 197 inverted repeat A INVREP 203 210 inverted repeat A' TERM 190 210 transcription terminator of an early mRNA PRM 225 230 pot. -10 region SITE 237 240 region of 5' end of Mu C mRNA (by S1 mapping) RBS 251 253 pot. ribosome binding site SITE 414 426 pot. IHF binding site SITE 454 467 pot. Mu repressor binding site RBS 462 465 pot. ribosome binding site INVREP 782 800 inverted repeat B SITE 820 811 (C) pot. MuA binding site SITE 846 858 pot. Mu repressor binding site SITE 903 890 (C) pot. IHF binding site SITE 893 902 pot. MuA binding site FEATURES from to/span description ORF 262 480 open reading frame 1 ORF 278 406 open reading frame 2 ORF 390 97 (c) open reading frame 3 ORF 473 835 open reading frame 4 ORF 845 964 open reading frame 5 ORF 933 739 (c) open reading frame 6 BASE COUNT 183 a 230 c 269 g 283 t ORIGIN 1 ctacattcca cgcggtcagg ctctggactc gctgattcgg gatttgcgta tctggaatga 61 ttttaacggt cgcaacgtca gcgaactgac cacccgttat ggtgtcactt tcaatacggt 121 gtacaaagcc attcgccgga tgcgacggct gaaataccgc cagtaccagc cctcacttct 181 gtaaacagta aagccggtta atccggcttt ttttacgtcc tcaatatcct gtgatgaata 241 accgtaccgg ggaaaatcat catgggtaaa ggctggaatg catcctttca tcttggcaga 301 cgtgagcggc tgcgtcagga ggttttgcac cgtgtcgccg gtgggccacg tcctgcgccc 361 cgcgactata ccggtcatga tggcacccac ggcagttatt acatgaaagg ctggcagtca 421 gttgatatgc cggaaattct tcatcattgc ctgctttaca gggagaaaca ttatgtctga 481 gcgttctgct cgtcagtggc ctgatttttt atcggtggtg ttactggcgc tgttattgtg 541 gataagtctg ttttgtggct ggcgcgcgct gatgttttgc tgtgcctcgg ttttcagtgt 601 tgctttatgt gttgccgctg attgtctgga tgcgctgata atgagctgcc gggtgcctga 661 acattttgcc cggttcgtct ggcctctgac ctggctgggg agtctgtccg ggctgggatt 721 agccgtgatg gcaacgtctc aactgaaaac gggtcctgag catgtcatct gggcgctggc 781 cggattgctg acgttctggc tctcatttcg ttttcgcgct cgtctgttcg ggtaaagggg 841 caccatgtac agaaaattca gtgatgaatg tttcgggccg tccacgctga ttaatgcgat 901 aaaagtgatt gcccttgtgg ttctgataac catcagtgcc gtggtgtatc tttctgtctg 961 ttaac // LOCUS PNSTETFA1 97 bp ds-DNA BCT 28-FEB-1990 DEFINITION Plasmid pNS1981 flanking region of tet resistance gene, 5' end. ACCESSION M32262 KEYWORDS . SEGMENT 1 of 2 SOURCE Plasmid pNS1981 from B.subtilis. ORGANISM Plasmid pNS1981 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 97) AUTHORS Sakaguchi,R. and Shishido,K. TITLE A unique DNA structure of the junction of homologous and nonhomologous regions between tetracycline-resistance plasmid pNS1981 and kanamycin-resistance plasmid pUB110 JOURNAL Nucleic Acids Res. 15, 7202-7202 (1987) STANDARD simple staff_entry BASE COUNT 38 a 8 c 18 g 33 t ORIGIN 1 tgggaacgga aaaattattt tattaaagag tagttcaaca aacgggccat attgttgtat 61 aagtgatgaa atactgaatt taaaacttag tttatat // LOCUS PNSTETFA2 94 bp ds-DNA BCT 28-FEB-1990 DEFINITION Plasmid pNS1981 flanking region of tet resistance gene, 3' end. ACCESSION M33669 KEYWORDS . SEGMENT 2 of 2 SOURCE Plasmid pNS1981 from B.subtilis. ORGANISM Plasmid pNS1981 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 94) AUTHORS Sakaguchi,R. and Shishido,K. TITLE A unique DNA structure of the junction of homologous and nonhomologous regions between tetracycline-resistance plasmid pNS1981 and kanamycin-resistance plasmid pUB110 JOURNAL Nucleic Acids Res. 15, 7202-7202 (1987) STANDARD simple staff_entry BASE COUNT 27 a 14 c 16 g 37 t ORIGIN 1 tttataacag gaattcaaat ctttttgttc cattaaaggg cgcgattgct gaataaaaga 61 tacgagagac ctctcttgta tcttttttat tttg // LOCUS STMTGFMA 147 bp ds-DNA BCT 28-FEB-1990 DEFINITION S.rimosus initiator tRNA gene. ACCESSION M32254 KEYWORDS transfer RNA-fMet. SOURCE Streptomyces rimosus DNA. ORGANISM Streptomyces rimosus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 147) AUTHORS Gamulin,V. and Soell,D. TITLE The initiator tRNA genes from Streptomyces rimosus JOURNAL Nucleic Acids Res. 15, 6747-6747 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 51 125 tRNA-fMet BASE COUNT 28 a 39 c 51 g 29 t ORIGIN 1 gtgatttgcg ctggtcgacg gcggtggtgc gtagactggt cacacaacgg cgcggggtgg 61 agcagctcgg tagctcgctg ggctcataac tcagaggtcg caggttcaaa tcctgtcccc 121 gctactgatg aagaaaggcc cggatcc // LOCUS STMTGFMB 147 bp ds-DNA BCT 28-FEB-1990 DEFINITION S.rimosus initiator tRNA gene. ACCESSION M32255 KEYWORDS transfer RNA-fMet. SOURCE Streptomyces rimosus DNA. ORGANISM Streptomyces rimosus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 147) AUTHORS Gamulin,V. and Soell,D. TITLE The initiator tRNA genes from Streptomyces rimosus JOURNAL Nucleic Acids Res. 15, 6747-6747 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 51 125 tRNA-fMet BASE COUNT 28 a 48 c 49 g 22 t ORIGIN 1 gcgtgggccg ggcaccacct ccgtgccgtg gtaccgtgga cacacaacgg cgcggggtgg 61 agcagctcgg tagctcgctg ggctcataac ccagaggtcg caggttcaaa tcctgtcccc 121 gctactggaa acgaaaggcc cggatcc //
GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS MUSIGR 2003 bp ss-mRNA ROD 18-JUL-1989 DEFINITION Mouse interferon-gamma receptor mRNA, complete cds. ACCESSION M25764 KEYWORDS interferon-gamma receptor. SOURCE Mouse (strain BALB/c) plasmacytoid lymphosarcoma cell line ABPL4, cDNA to mRNA, clone 45. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2003) AUTHORS Kumar,C.S., Muthukumaran,G., Frost,L.J., Noe,M., Ahn,Y.H., Mariano,T.M. and Pestka,S. TITLE Molecular characterization of the murine interferon gamma receptor cDNA JOURNAL J. Biol. Chem. 264, 17939-17946 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence for [1] kindly submitted by S.Pestka, 30-JUN-1989, for release after publication. FEATURES from to/span description pept 31 1464 interferon-gamma receptor precursor sigp 31 111 interferon-gamma receptor signal peptide matp 112 1461 interferon-gamma receptor BASE COUNT 534 a 462 c 489 g 518 t ORIGIN Chromosome 10. 1 ccccctgtca gaggtgtccc tcgcgcagga atgggcccgc aggcggcagc tggcaggatg 61 attctgctgg tggtcctgat gctgtctgcg aaggtcggga gtggagcttt gacgagcacc 121 gaggatcctg agcctccctc ggtgcctgta ccgacgaatg ttctaattaa gtcttataac 181 ttgaaccctg tcgtatgctg ggaataccag aacatgtcac agactcctat ttttactgta 241 caggtaaagg tgtattcggg ttcctggact gattcctgca ccaacatttc tgatcattgt 301 tgtaatatct atggacaaat tatgtatcct gatgtatctg cctgggccag agttaaagct 361 aaggttggac aaaaagaatc tgactatgca cggtcaaaag agttccttat gtgcctaaag 421 ggaaaggtcg ggccccctgg cctggagatc aggaggaaga aggaagaaca gctctccgtc 481 ctcgtatttc accctgaagt cgttgtgaat ggagagagcc agggaaccat gtttggtgac 541 gggagcacct gttacacatt cgactatact gtgtatgtgg agcataaccg gagtggggag 601 atcctacata cgaaacatac ggtcgaaaaa gaagagtgta atgagactct gtgtgagtta 661 aacatctcag tatccacact ggattccaga tattgtattt cagtagacgg aatctcatct 721 ttctggcaag ttagaacaga aaaatcgaaa gacgtctgta tccctccttt ccatgatgac 781 agaaaggatt caatttggat tctggtggtt gctcctctta ccgtctttac agtagttatc 841 ctggtatttg cgtattggta tactaagaag aattcattca agagaaaaag cataatgtta 901 cctaagtcct tgctctctgt ggtaaaaagt gccacgttag agacaaaacc tgaatcgaag 961 tattcacttg tcacaccgca ccagccagct gtcctagaga gtgagacggt gatctgtgaa 1021 gagcccctgt ccacagtgac agctccagac agccccgaag cagcagaaca ggaagaactt 1081 tcaaaagaaa caaaggctct ggaggctgga ggaagcacgt ctgccatgac cccagacagc 1141 cctccaactc cgacacaaag acgcagcttt tccctgttaa gtagtaacca gtcaggccct 1201 tgtagcctca ccgcctatca ctcccgaaac ggctctgaca gtggcctcgt gggatcgggc 1261 agctccatat cggacttgga atctctccca aacaacaact cagaaacaaa gatggcagag 1321 cacgaccctc cacccgtgag aaaggccccc atggcctccg gttatgacaa accgcacatg 1381 ttggtggacg tgcttgtgga tgttgggggg aaggagtctc tcatggggta tagactcaca 1441 ggagaggccc aggagctgtc ctaaggtctc ccgaggcctg ctggtggtaa agaaactgac 1501 cttttaggca gtttttctgc attgatttca tgaaagaagc tatacattag ctaatactaa 1561 ccacatagaa tatcagactt agatacgtga ataaggatcc tgtgggcact gctgggtcca 1621 ctctgcaaat gccaagacta tcaaaggaac gtattgtcgc ttctggctcc ttcccaggtg 1681 ggctagcatc tgtgagtttg cctcggctag ccttgcttcc tacagccgcc actgctcctc 1741 caccctgatc atctcacagg acagggtgga ccgggttttt tttttttttt tcacacacct 1801 ttgtatatgt aagttcatgt atataatatg tttacatgtt tcactttgaa ctgaaagcta 1861 ctcaaagcca gccgtaagtc tatggtagaa tgtgatggaa catgttggtg gaagcttgta 1921 caatagaaca cattggtggg agcttgtaca tactttttta tggagcatta cttacgattt 1981 tttaagtaaa atgttttgaa acc //
GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS DROELAVK 2505 bp ss-mRNA INV 10-OCT-1988 DEFINITION D.melanogaster elav protein, complete cds. ACCESSION M21152 KEYWORDS RNA-binding protein; elav protein; neural protein. SOURCE D.melanogaster (Canton-S) Kauvar 3-12 hour embryo library cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2505) AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K. TITLE The elav gene of Drosophila, required in neurons, has three RNP consensus motifs JOURNAL Science 242, 1570-1572 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence kindly submitted by K.White, 26-OCT-1988. FEATURES from to/span description pept 492 1943 elav protein BASE COUNT 772 a 669 c 629 g 435 t ORIGIN Chromosome X; 1B5-9 on the cytogenetic map. 1 tttccacgcc aaccatcgtg cgccgcggtc tgaacgctcc tgccacagaa aaagaaaata 61 aaagcaagga aaattctaat cgaataccaa atatcgtgct tgtgtgctct ttccgcaatt 121 gatttttttt aagtagtgca tgacaataac cgttgagttg actccaaccg aagtaaccat 181 aactggaagc tgacaacagc cattggccgg aatctcatct cacaccgagc gaaatacggc 241 gacgccagcg tttatttatt aaaaacgatt acattacaac aaacaccaaa gtcctggaca 301 accgaagaga gtgtgggaga aaatcgaagc agagagcaag gaggagcgcc gtgagagcgc 361 gcgcgcaagg atttgagatt gcaaagagaa cttagcaatt agttcgcttt gtttgtccag 421 ccaaacaaca acaacaacca accagctccc caaaaccaac tacacacata ctcgctccct 481 aaactaaaac aatggacttt attatggcaa ataccggagc tggcggcgga gtagacacac 541 aggcacagct aatgcagagt gccgctgcag ccgcagcagt ggcggcaaca aacgcggccg 601 ccgctcccgt acagaatgca gccgccgtgg cggccgccgc ccagctgcag cagcaacagg 661 tgcaacaggc aatcctgcag gtgcagcagc agcagacaca gcaagcggtg gccgcggccg 721 ctgccgcagt gacccagcag ctccaacagc aacagcaggc tgtcgtggcc caacaggctg 781 tagtgcagca gcaacaacag caggcggcgg cagtggtgca acaggcggcg gtccaacagg 841 ctgtggtgcc ccagccgcag caggcgcagc ccaatacgaa tggcaatgca ggatcgggat 901 cgcaaaatgg cagcaacggc agcacggaga cgcgcacaaa ccttattgtc aactacttgc 961 cgcaaacaat gaccgaagac gagatccgtt cgctcttctc cagcgtcggc gagattgagt 1021 cggtgaagct gatacgcgac aagtcgcagg tctacatcga tcctctcaat ccgcaggcgc 1081 ccagcaaggg ccaaagtctg ggctacggct ttgttaacta tgtccggccg caagatgccg 1141 agcaggctgt taatgttcta aacggcctgc gactgcagaa caaaaccata aaggtgtcgt 1201 ttgcccgccc gtcgtccgat gccattaaag gcgccaacct ttatgtgtcg gggctgccaa 1261 agacgatgac ccagcaggaa ctggaggcca tcttcgcacc attcggagca ataatcacat 1321 cgcgcattct gcagaacgct ggcaacgata cgcagacgaa aggagtaggc tttattcgat 1381 tcgataagcg ggaggaggcc acaagggcta tcattgcctt gaacggtact acaccatcca 1441 gctgcacaga ccccatagtg gtaaaattct ccaatacgcc cggcagcacc agtaagatca 1501 tccagccgca gctgcccgct ttcctcaatc cccaactggt gcgacgcatc gggggcgcca 1561 tgcacacgcc agttaacaag ggtctggccc ggttttcacc aatggcgggc gacatgctgg 1621 acgtaatgct gcccaatgga ctaggagcgg cggcggcggc ggccacaacg ctggccagtg 1681 ggcctggcgg cgcgtatccc attttcatct acaacttggc gccggaaacg gaggaggctg 1741 ccctgtggca gctgtttggt ccatttggag ctgtgcaatc ggtaaagatt gtcaaggatc 1801 ccacaacgaa tcagtgcaag ggctacggct tcgtttcgat gaccaactac gatgaggcgg 1861 ccatggccat tcgggcgctc aacggctaca ccatgggcaa tcgggtgctg caggtcagct 1921 tcaagaccaa caaagccaag tagagcggcc caaatggaag tggacaagga cgcagcggga 1981 gcaccagcaa ccacaaccca ttatcttgaa tctaaacctg accacacaac acacatatat 2041 aaatatgcat ggtataacgg taactaagcg caacaaaaca gtatgtgtac aacacacaag 2101 aaaaaaacaa gaaagtagca ggcaggagaa aatactcttt ccactaaacg acaacgaaac 2161 ggcggaaatc cccctaaaaa cacaaatctc ctccccaaaa ccgccggcca aaagcgttgc 2221 aatagaaaaa ttcttcttgt ttagcattta agtaaaaaac cttaactaaa cgaagcgagc 2281 agaagaagtt ataagatcgg gagaagcgaa taggtagggg gttgagaaag aacaatgaga 2341 ttttaagaaa taatattgca aaatgaaaac acactgtggc agtggcagct gtgaaaggtc 2401 aaaggttggc acagtcgatc tagtcacaaa gcaaaacaaa gaaatcaacc aaacaaacac 2461 aaaacacaca aaaacgaaaa caaatttaac tttattagca aaaaa // LOCUS DROELAVM 9285 bp ds-DNA INV 26-OCT-1988 DEFINITION D.melanogaster elav protein, complete cds. ACCESSION M21153 KEYWORDS RNA-binding protein; elav protein; neural protein. SOURCE D.melanogaster (Canton-S) Maniatis library DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 9285) AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K. TITLE The elav gene of Drosophila, required in neurons, has three RNP consensus motifs JOURNAL Science 242, 1570-1572 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence kindly submitted by K.White, 26-OCT-1988. BASE COUNT 2786 a 2147 c 2058 g 2294 t ORIGIN Chromosome X; 1B5-9 on the cytogenetic map. 1 ggatccagtc gagggatcac tgaactaaaa agtaaagaga ggagaaacag gagaagccct 61 taaagtgcga gttcgctgga gtgcctatcg atccttgagg gcagcgctcg atccttatta 121 ctattggcag gatcatttta taccccttac ataaaaatat cagtatcagt ttgttccctt 181 atcatgcgtg ctctttgaca tatccgcggg aatttatttc tatttttgct tagtcaaact 241 gaagaccatt ttgtcccaac ataaaatatg taaaatattt ttttttatca ttttaaattc 301 tcagggatta agtcaatgtt tgaaaacttc tagctgactt gcgctccact accactcggc 361 atgcgtgccc ctgattagta ggccaggtgg tgggcctttg gcacgatcgc aaccctaagg 421 actgcaaccg aatagcattg agaaggagac caggatttct gcactcaaag gtgaaattgt 481 ctgagcacgg aacacggaaa aacaacaaca aaaacatggc agacggaaaa caatgccgcc 541 agcggcaagg agggagggtg gaaacacatg ccacgattca gtgtcgaatg tcctgtgatc 601 ccacattcat atattttgtt attgtgtgcc tgcatccctc gcctgctttt aattcctgca 661 tcgatccttc tttgctcctg gtctacgaac ccgatgtctt atggtaattt tatgattatt 721 acaattagat tcgggggaaa ctgcatctgc cgccgtctgc agatccgatt catttggatt 781 ccgtctgctc aatccagatc catgatggga tcatggccat ttgggcattg gtacatggaa 841 agttaaaagc gccctaggcg ttctaatcca actcgtctca agaaaaacgg gtgaatgcat 901 ggaacgctgg atctggttct ggaagatccg gctaagtcaa caagaaatat gtaacctcgc 961 cattatagct ttactttcga agcagaaaat gcaatcgaga atatcgaatg cacgctccag 1021 gttcctatgg ctcctagctt tcggtgcagc acaacaaata tttgggttgg ttcgtttttc 1081 cgccacgatt ttaaaacgct aagggtggct ttttcggagt ccctctcatt tgagccggga 1141 aaaagaccag accctccctt tccctcccca gccactttcc agctttgttg taacctcgtg 1201 ttgtgcctgc tttttaatcc cgctttgttg taacgcaggc catttgctgt ttttgttttg 1261 tttctggcgt ggctaaaacg gattatgtga cccaaccctt tcgatttcga gtctcattgt 1321 attacatcct tttcccaaaa atgactaaaa aagcgtattg cctatttctg gggtggacag 1381 aatgaaacgg ggtcagcacg tggccgcaaa cttgaaatac cattatggat ccaataggac 1441 tgtgaagagt tattagtgaa acatttcgaa caaactattc ctttaattga aatttgccaa 1501 ttccccactt aagcatcgca tttgcatatt tgtatgctaa ccaaactctc tgcgacgtcg 1561 atctgtctga atctgaatcc gcatatcttt cgagggcagc aatagcgcca ccccacccca 1621 tttccgttgc cactagcagc gcaccaccct ttgaaaggac atcaaaaacc accccctgcc 1681 caaagttgcg ttacaattcc cgaagtgcag tgcagcgtac ataaaacata caatgaaatc 1741 cgataaaatt ttctatgtcc tttatatcct cataacatgc ccaactataa accgagccaa 1801 attcttctcg tgaaaaagtt gcacacaacg cgatttcact gtaccttaca ttgttctgta 1861 aactggaaca gagcgtccga ctggaagagg cgaaggaaaa aatcagcaca gggaacagga 1921 agttcacaag gattgggggt gtccgagggt ggctacatat atatggcgat atatggcata 1981 cagtatgtgc atacatatac atacatatac acatatatgt atgtatgtac catactatgt 2041 ggcgtgtcta gtggcgcgaa taacgaatac aggagcaatg aagcacacga acgaagcgaa 2101 acgaaacgaa gggaaatatt gaaaaagcca gaattgcatt acttcaaaac tataccggct 2161 cttggatatg ggtcctgggt ctcggttcct tgcttctggc tttcgcttcg tagggggtgt 2221 acgagcaata ctgccaggag aaaccaccct agcctttggc gttggtggtt gatagtgcaa 2281 aaataataat aacggaaacg gaaccataaa cccaaaacag cccttacaac gcgactgatt 2341 tgtaaggata atggtcgcat gcatcataaa acggttgacg acctcgtcgg aaagagccaa 2401 cattttgcaa tgatttccaa ttgctcctgc tatctaggcc caatgggccg ctaagtacat 2461 atgaagcata catagctaaa tacatatgaa gcatacatac atcgggttaa ctataaaacc 2521 aatgtgttct agatgagttt gttaatttta caaagtaagt tagggctgct ccagtggttt 2581 taagttgctt aactggacaa gagaacttgg ctctctcccc catctatctc ttgtcttcgg 2641 gaaaggtaac ctccattttg atcggttgaa tttgtatcaa aaagcatgaa aaactttgaa 2701 tacagcataa accactgaga gagcagccga aagccagggg tgtatcaaaa ggggttctta 2761 gaaatattcc actttgcaac gattatagac tcgtttctta tcagacaaaa ctttctccac 2821 ttctacgtgg tatttacaga aacaaaactt aaggtaaaat gttaggtggc tatatattaa 2881 agctattaaa atatattaaa ggtttccgga ataaccccct atctcggcgc gcagatacgc 2941 ccatgtcgcc actctttcgc tctctacctt ccactctcgc tccgaacacc attgcaaccc 3001 cgagagagct gctcgagagg caactatgag atatgagaat gagaggttgc tctatgtgaa 3061 aatggaaacc gaaaccgtca atgcctcaat ttcgcatttc attatttcat ttccacgcca 3121 accatcgtgc gccgcggtct gaacgctcct gccacagaaa aagaaaataa aagcaaggaa 3181 aattctaatc gaataccaaa tatcgtgctt gtgtgctctt tccgcaattg atttttttta 3241 agtagtgcat gacaataacc gttgagttga ctccaaccga agtaaccata actggaagct 3301 gacaacagcc attggccgga atctcatctc acaccgagcg aaatgtaaga gtatccccag 3361 gatatccaca atttgtgcaa ctgaatgaaa ggcaatgata acaatatcaa tcaacaactt 3421 tgaaacagta cttttttttt gctaatgatc gcgtcgcgct gcataattgt gcaatttact 3481 cacttataat ccgcgtgtct gcatatatcc atatgtacct atatctattt ttttgctagt 3541 gggtgtgtga tgtgtgtaag ttgtatacat tcaaacgaat attaataggc aaaacaacaa 3601 tatgcaattc gatggaggaa aaagcgaacc tcccaaagag aatggcggag agatgggcgg 3661 cggttagtca gagtgagagg gggcggccag tgcggagata gtagatgtgt tcagatctac 3721 gcctctgtga gctgtgtatc tgtaaatact tgctgctcaa atttaaatct gtgtgtgtat 3781 gtgtgtgtgg gcttgcgtgt gcgggagaga agagcgctga gtgttagagc gagagccagt 3841 tccgaagagg gcgcagctag catatcgatt ttatgatttc atgtgcaagg agcattgtac 3901 gcaacaacaa acggaggaac acgaccaact aaccgttaga agtagccatt ttttgagcgc 3961 agcggagagg gtgggcgagt gtgtgtgggt ggcggagtac ttgcagtcaa caaaacgccg 4021 gttgccataa aagcaaatgc acgcaggcat gaatcagacc catgtcgtag tagcaatatt 4081 aaacatagag gattcgcgat tcacagcaaa ggagtttttt ctcaaaaaac gacaacaatt 4141 agttgttttt tttcccagtg gtcgtctgtg tttatgcttg tgtggaagag agctcgaaat 4201 aggtagatat atacacgcgt atttatttat tatgtgtaca tatgtgcttg taaaaacaat 4261 atcagccata caaagtttga gtcccctccg cccacctcct aacaacaaca tcgatatcta 4321 aatcggtgca ggtaaagagc ggttgctaca ggcccaaata acctctcttc agtagaattt 4381 taccaacatc tcaaagtatt ctcgtgcgcc acatgtatgc gtatctgcac gcagatatat 4441 tttttgtatc gcctgtgaaa tatgaattca atgctttgga tggggtaaaa tcgataatat 4501 cgattggcga taacaatcgc acgcacacac acaggcccgc aaacacgaac aactaaaact 4561 acaacaacaa ccaccaaacc catcgaattt accgcttata gttttgtttt atttttgttt 4621 ttgtccacat cagacggcga cgccagcgtt tatttattaa aaacgattac attacaacaa 4681 acaccaaagt cctggacaac cgaagagagt gtgggagaaa atcgaagcag agagcaagga 4741 ggagcgccgt gagagcgcgc gcgcaaggat ttgagattgc aaagagaact tagcaattag 4801 ttcgctttgt ttgtccagcc aaacaacaac aacaaccaac cagctcccca aaaccaacta 4861 cacacatact cgctccctaa actaaaacaa gtgagtattg aaagacgacg ggccgagctg 4921 acgagatcgc aggacgaaag agagaaaact ttcgaagggt gcggtgtcgt gtcaaggcaa 4981 cacaatcaaa ccactaatgc ggaatattaa aaaaaaaaat taatttaaat cataagaggt 5041 tagctatatt tgatatatat atgtgtgtat ccatgtacat atgtctatgt tagacaagta 5101 gctctttctt gccgtattgt agttcgggtt atcgttaccc gcctctaatt ggcttgccat 5161 gacgaaagtg ggtggtacag gacgacgacg tgcgtgcgcg tgaccttttg actcttccat 5221 aagcgcctca ataaaaggtg cgagtcggta catacatatg tatgagaata gaggatatta 5281 aaaacgcgcc aaactctaag aaaagcatcc accaaagtaa aaacagtagc aactaaaaac 5341 ttttcagtct cagctaacca gtccaaacca gctatgtagt ttttagcacg actttgggag 5401 tgtcgaatgc gttaactaaa gctcaaatat atatcgagac ccgctgtagt ccctatatgc 5461 aaatcaaatt gctaccattt ccgcagcgat cgcttttcgc gacagccccc tctctctgtc 5521 gcacttagtt cagtgaacca acacattcgc tggtttcgtt ccgagcagca tcctctgtct 5581 ctcttgctca gcctcttcgc cgccccctct gcctaagcct ctgtttctgc tgtccgcacc 5641 ccttaccaag gtgtcatgta aatggcggag ccggcttagg gggtggcaat gccgtgcttt 5701 taacccttat cggtggcaaa tggcgctcgt acatttggag aaaatgccgt tctacgtttc 5761 gttttgcttc gtttcgccaa gcgaagtatt acaacttgtt tgactaaggg tgggtctacc 5821 gtctattgct catttatggt gttttgatgc tttctcaagg ggtgcgggct ggtttctcgc 5881 atgtcctgac cctctgggtc atccctctgt tcaccgcatt acataactgg gcggggtgca 5941 cagcgcgaca cactcttatg gagagagcgc tgtaatttgc ggaaaggaga cgagggggag 6001 aagggacgca agagcagcta cagcgcgcag cagtgccaga aagtcctgcg ctcgccccag 6061 aagcatcctg ttcctgtgac gcagcggatc tggtctcgcg ttcgctccac tacctccact 6121 tcctacacct acagtacaca tatcacatga acttaaaact agtgaacgga ttaattaaaa 6181 aggaatcgca aaacaaaagc tcgctaatcg cgcagacaca gcagaagctt aaattcagaa 6241 aagtttagtt cttcagtgcc gaaggtgtgt ttgcagagct ttgatttcta gtgcgcgggt 6301 gtgggaagag aggactaaat agagacagag agagcggccg gatgagaaag tggggcgctg 6361 ggcggcgcct gttagcaaag cttcgttgac caaccgttat ttcctgaaca aaggaataac 6421 atacattcat cacttaatca ccaactgttt ttaaatccac agaattataa tttagcgtat 6481 gattcaagat ttgggaattg tgcgcaccca ggataaccgc gtttgtgttt aatttacaaa 6541 catgccccta atgtctatta aataaaagtt cagcgtcgga aaattaaaga gtctggtaag 6601 gattacagta tatttatgct gtaatatcct ttaattgtcc cgggtcacat ttttccgtat 6661 ccccacgccc taaagcattt tcgtatccat taattaaaca agtaaaggtg taggcttata 6721 ttatgttata cagtccgctc tcacaagcat tataagaaat atattacttg tctgttcacc 6781 aaataggtaa aaaatataca tatcatatca aacgatttaa ataattgcgc cctgggcaga 6841 tgtaaataaa gaaatataaa ttaaaaaata tcgttagaat aaaccaacaa attttctatt 6901 caaaccctag ctgattgata atcaattagc ccgttataac tactaaaacc cataataata 6961 caaaaaacta ctaacacatc atgcaaatgt tttaattttg acatttaact attctttcca 7021 tgcgcctacc cctgcgttgg tcgtgacaac caacaataaa cttctcgact ctgcccgatc 7081 ctctcaccta tcactcagtg gactttatta tggcaaatac cggagctggc ggcggagtag 7141 acacacaggc acagctaatg cagagtgccg ctgcagccgc agcagtggcg gcaacaaacg 7201 cggccgccgc tcccgtacag aatgcagccg ccgtggcggc cgccgcccag ctgcagcagc 7261 aacaggtgca acaggcaatc ctgcaggtgc agcagcagca gacacagcaa gcggtggccg 7321 cggccgctgc cgcagtgacc cagcagctcc aacagcaaca gcaggctgtc gtggcccaac 7381 aggctgtagt gcagcagcaa caacagcagg cggcggcagt ggtgcaacag gcggcggtcc 7441 aacaggctgt ggtgccccag ccgcagcagg cgcagcccaa tacgaatggc aatgcaggat 7501 cgggatcgca aaatggcagc aacggcagca cggagacgcg cacaaacctt attgtcaact 7561 acttgccgca aacaatgacc gaagacgaga tccgttcgct cttctccagc gtcggcgaga 7621 ttgagtcggt gaagctgata cgcgacaagt cgcaggtcta catcgatcct ctcaatccgc 7681 aggcgcccag caagggccaa agtctgggct acggctttgt taactatgtc cggccgcaag 7741 atgccgagca ggctgttaat gttctaaacg gcctgcgact gcagaacaaa accataaagg 7801 tgtcgtttgc ccgcccgtcg tccgatgcca ttaaaggcgc caacctttat gtgtcggggc 7861 tgccaaagac gatgacccag caggaactgg aggccatctt cgcaccattc ggagcaataa 7921 tcacatcgcg cattctgcag aacgctggca acgatacgca gacgaaagga gtaggcttta 7981 ttcgattcga taagcgggag gaggccacaa gggctatcat tgccttgaac ggtactacac 8041 catccagctg cacagacccc atagtggtaa aattctccaa tacgcccggc agcaccagta 8101 agatcatcca gccgcagctg cccgctttcc tcaatcccca actggtgcga cgcatcgggg 8161 gcgccatgca cacgccagtt aacaagggtc tggcccggtt ttcaccaatg gcgggcgaca 8221 tgctggacgt aatgctgccc aatggactag gagcggcggc ggcggcggcc acaacgctgg 8281 ccagtgggcc tggcggcgcg tatcccattt tcatctacaa cttggcgccg gaaacggagg 8341 aggctgccct gtggcagctg tttggtccat ttggagctgt gcaatcggta aagattgtca 8401 aggatcccac aacgaatcag tgcaagggct acggcttcgt ttcgatgacc aactacgatg 8461 aggcggccat ggccattcgg gcgctcaacg gctacaccat gggcaatcgg gtgctgcagg 8521 tcagcttcaa gaccaacaaa gccaagtaga gcggcccaaa tggaagtgga caaggacgca 8581 gcgggagcac cagcaaccac aacccattat cttgaatcta aacctgacca cacaacacac 8641 atatataaat atgcatggta taacggtaac taagcgcaac aaaacagtat gtgtacaaca 8701 cacaagaaaa aaacaagaaa gtagcaggca ggagaaaata ctctttccac taaacgacaa 8761 cgaaacggcg gaaatccccc taaaaacaca aatctcctcc ccaaaaccgc cggccaaaag 8821 cgttgcaata gaaaaattct tcttgtttag catttaagta aaaaacctta actaaacgaa 8881 gcgagcagaa gaagttataa gatcgggaga agcgaatagg tagggggttg agaaagaaca 8941 atgagatttt aagaaataat attgcaaaat gaaaacacac tgtggcagtg gcagctgtga 9001 aaggtcaaag gttggcacag tcgatctagt cacaaagcaa aacaaagaaa tcaaccaaac 9061 aaacacaaaa cacacaaaaa cgaaaacaaa tttaacttta ttagcaaaaa agaaaatgag 9121 aaattttaaa aatttattaa ctttaagtaa aacgcagcaa tgaaatcttc caaacaaaag 9181 aaagcaaaac agcaaatctg tttgttacac ttcaactttt tctaccataa aaactttgaa 9241 tcttgatact aaacctaaag cagaattaag aaaatagagg aattc // LOCUS HUMDBTEFH 2335 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human beta-globin intergenic region spanning the deletion which causes delta-beta-thalassemia. ACCESSION M22800 X14550 KEYWORDS . SOURCE Human patient s.h. leukocyte cell DNA, togi clone d. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 643) AUTHORS Mishima,N. and Gilman,J.G. JOURNAL Unpublished (1989) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Mishima,N. and Gilman,J.G. TITLE DNA deletion in indian delta-beta-thalassemia with elevated fetal hemoglobin extends from 1 kb 3' of a-GAMMA globin gene into downstream KpnI (L1) repeated sequence JOURNAL FASEB J. 2, 546-546 (1988) STANDARD simple staff_review REFERENCE 3 (bases 1 to 2335) AUTHORS Mishima,N. and Gilman,J.G. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by J.G.Gilman 28-FEB-1989; [3], 11-APR-1990. This sequence is across the DNA deletion in the beta-globin gene cluster. BASE COUNT 931 a 462 c 393 g 530 t 19 others ORIGIN Chromosome 11. 1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa 61 tttccatgta attgcatagt tttcatctat tcattagtct tgaaatctat ttttattcct 121 ctgtggtctg agagtgattg atatgatttt ttttcatttt ctgaggattg ttttaggtcc 181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa 241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac 301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa 361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc 421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat 481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa 541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt 601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa 661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat 721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat 781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc 841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga 901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt 961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt 1021 ttcaggatat naaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca 1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata 1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact 1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa 1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat 1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat 1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga 1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt 1501 ggggtggtag gagagnngag ggataanatt aggagatata tctaatgtaa atgacgagtt 1561 aacgggtgca gcacaccaat atggcacacg tatcactatg taacaaactg cacaggtacc 1621 gtagaactta gagtacaaaa aaaaattgag aaaaagaaaa aagctggaga taacacattg 1681 cctaacttca aaccatacca cagggctaca gtaaccaaaa cagtttggta ctgttacaga 1741 aaagacacac agacaaatgg aacagaatag agagcccaga aataagacca catgcctaca 1801 accatctgat ctttgacaaa gctgataana acaagcaatg aggaaaagac accctatnca 1861 acaagtcgtg ttgggataac tgactaancg tatntagaag attgaaacca gaccccgtnc 1921 tatatcatat ncncaaataa ncncaagatg gattagagac ttaaatgcaa agcccaaact 1981 atnanaatgc ttgaaagcaa cctagacaat accattccgg acatgagcag gggcaaagat 2041 ttcatgacaa agatgccaag agtcattgca gcaaaagcaa aaattgacaa ataggatcta 2101 attaaaataa agtttctgca cagcaaaaga aactatcaac tgagtaaaca acctatagaa 2161 tnggagaaaa tnttnncaaa ctatgcatcc aacaaaggcc taatattcag aatctataag 2221 gaacttaaat gaatctacaa gaacaaacaa acaatgccac aaaaaatggg taaaggacat 2281 gaacagacag tattcaaaag aagacatcac atatggccac caagcatata aaaaa //
GenBank-Updates@genbank.bio.net (04/12/90)
LOCUS DROELAVK 2505 bp ss-mRNA INV 10-OCT-1988 DEFINITION D.melanogaster elav protein, complete cds. ACCESSION M21152 KEYWORDS RNA binding protein; elav protein; neural protein. SOURCE D.melanogaster (Canton-S) Kauvar 3-12 hour embryo library cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2505) AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K. TITLE The elav gene of Drosophila, required in neurons, has three RNP consensus motifs JOURNAL Science 242, 1570-1572 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence kindly submitted by K.White, 26-OCT-1988. FEATURES from to/span description pept 492 1943 elav protein BASE COUNT 772 a 669 c 629 g 435 t ORIGIN Chromosome X; 1B5-9 on the cytogenetic map. 1 tttccacgcc aaccatcgtg cgccgcggtc tgaacgctcc tgccacagaa aaagaaaata 61 aaagcaagga aaattctaat cgaataccaa atatcgtgct tgtgtgctct ttccgcaatt 121 gatttttttt aagtagtgca tgacaataac cgttgagttg actccaaccg aagtaaccat 181 aactggaagc tgacaacagc cattggccgg aatctcatct cacaccgagc gaaatacggc 241 gacgccagcg tttatttatt aaaaacgatt acattacaac aaacaccaaa gtcctggaca 301 accgaagaga gtgtgggaga aaatcgaagc agagagcaag gaggagcgcc gtgagagcgc 361 gcgcgcaagg atttgagatt gcaaagagaa cttagcaatt agttcgcttt gtttgtccag 421 ccaaacaaca acaacaacca accagctccc caaaaccaac tacacacata ctcgctccct 481 aaactaaaac aatggacttt attatggcaa ataccggagc tggcggcgga gtagacacac 541 aggcacagct aatgcagagt gccgctgcag ccgcagcagt ggcggcaaca aacgcggccg 601 ccgctcccgt acagaatgca gccgccgtgg cggccgccgc ccagctgcag cagcaacagg 661 tgcaacaggc aatcctgcag gtgcagcagc agcagacaca gcaagcggtg gccgcggccg 721 ctgccgcagt gacccagcag ctccaacagc aacagcaggc tgtcgtggcc caacaggctg 781 tagtgcagca gcaacaacag caggcggcgg cagtggtgca acaggcggcg gtccaacagg 841 ctgtggtgcc ccagccgcag caggcgcagc ccaatacgaa tggcaatgca ggatcgggat 901 cgcaaaatgg cagcaacggc agcacggaga cgcgcacaaa ccttattgtc aactacttgc 961 cgcaaacaat gaccgaagac gagatccgtt cgctcttctc cagcgtcggc gagattgagt 1021 cggtgaagct gatacgcgac aagtcgcagg tctacatcga tcctctcaat ccgcaggcgc 1081 ccagcaaggg ccaaagtctg ggctacggct ttgttaacta tgtccggccg caagatgccg 1141 agcaggctgt taatgttcta aacggcctgc gactgcagaa caaaaccata aaggtgtcgt 1201 ttgcccgccc gtcgtccgat gccattaaag gcgccaacct ttatgtgtcg gggctgccaa 1261 agacgatgac ccagcaggaa ctggaggcca tcttcgcacc attcggagca ataatcacat 1321 cgcgcattct gcagaacgct ggcaacgata cgcagacgaa aggagtaggc tttattcgat 1381 tcgataagcg ggaggaggcc acaagggcta tcattgcctt gaacggtact acaccatcca 1441 gctgcacaga ccccatagtg gtaaaattct ccaatacgcc cggcagcacc agtaagatca 1501 tccagccgca gctgcccgct ttcctcaatc cccaactggt gcgacgcatc gggggcgcca 1561 tgcacacgcc agttaacaag ggtctggccc ggttttcacc aatggcgggc gacatgctgg 1621 acgtaatgct gcccaatgga ctaggagcgg cggcggcggc ggccacaacg ctggccagtg 1681 ggcctggcgg cgcgtatccc attttcatct acaacttggc gccggaaacg gaggaggctg 1741 ccctgtggca gctgtttggt ccatttggag ctgtgcaatc ggtaaagatt gtcaaggatc 1801 ccacaacgaa tcagtgcaag ggctacggct tcgtttcgat gaccaactac gatgaggcgg 1861 ccatggccat tcgggcgctc aacggctaca ccatgggcaa tcgggtgctg caggtcagct 1921 tcaagaccaa caaagccaag tagagcggcc caaatggaag tggacaagga cgcagcggga 1981 gcaccagcaa ccacaaccca ttatcttgaa tctaaacctg accacacaac acacatatat 2041 aaatatgcat ggtataacgg taactaagcg caacaaaaca gtatgtgtac aacacacaag 2101 aaaaaaacaa gaaagtagca ggcaggagaa aatactcttt ccactaaacg acaacgaaac 2161 ggcggaaatc cccctaaaaa cacaaatctc ctccccaaaa ccgccggcca aaagcgttgc 2221 aatagaaaaa ttcttcttgt ttagcattta agtaaaaaac cttaactaaa cgaagcgagc 2281 agaagaagtt ataagatcgg gagaagcgaa taggtagggg gttgagaaag aacaatgaga 2341 ttttaagaaa taatattgca aaatgaaaac acactgtggc agtggcagct gtgaaaggtc 2401 aaaggttggc acagtcgatc tagtcacaaa gcaaaacaaa gaaatcaacc aaacaaacac 2461 aaaacacaca aaaacgaaaa caaatttaac tttattagca aaaaa // LOCUS DROELAVM 9285 bp ds-DNA INV 26-OCT-1988 DEFINITION D.melanogaster elav protein, complete cds. ACCESSION M21153 KEYWORDS RNA binding protein; elav protein; neural protein. SOURCE D.melanogaster (Canton-S) Maniatis library DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 9285) AUTHORS Robinow,S., Campos,A.R., Yao,K.-M. and White,K. TITLE The elav gene of Drosophila, required in neurons, has three RNP consensus motifs JOURNAL Science 242, 1570-1572 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence kindly submitted by K.White, 26-OCT-1988. BASE COUNT 2786 a 2147 c 2058 g 2294 t ORIGIN Chromosome X; 1B5-9 on the cytogenetic map. 1 ggatccagtc gagggatcac tgaactaaaa agtaaagaga ggagaaacag gagaagccct 61 taaagtgcga gttcgctgga gtgcctatcg atccttgagg gcagcgctcg atccttatta 121 ctattggcag gatcatttta taccccttac ataaaaatat cagtatcagt ttgttccctt 181 atcatgcgtg ctctttgaca tatccgcggg aatttatttc tatttttgct tagtcaaact 241 gaagaccatt ttgtcccaac ataaaatatg taaaatattt ttttttatca ttttaaattc 301 tcagggatta agtcaatgtt tgaaaacttc tagctgactt gcgctccact accactcggc 361 atgcgtgccc ctgattagta ggccaggtgg tgggcctttg gcacgatcgc aaccctaagg 421 actgcaaccg aatagcattg agaaggagac caggatttct gcactcaaag gtgaaattgt 481 ctgagcacgg aacacggaaa aacaacaaca aaaacatggc agacggaaaa caatgccgcc 541 agcggcaagg agggagggtg gaaacacatg ccacgattca gtgtcgaatg tcctgtgatc 601 ccacattcat atattttgtt attgtgtgcc tgcatccctc gcctgctttt aattcctgca 661 tcgatccttc tttgctcctg gtctacgaac ccgatgtctt atggtaattt tatgattatt 721 acaattagat tcgggggaaa ctgcatctgc cgccgtctgc agatccgatt catttggatt 781 ccgtctgctc aatccagatc catgatggga tcatggccat ttgggcattg gtacatggaa 841 agttaaaagc gccctaggcg ttctaatcca actcgtctca agaaaaacgg gtgaatgcat 901 ggaacgctgg atctggttct ggaagatccg gctaagtcaa caagaaatat gtaacctcgc 961 cattatagct ttactttcga agcagaaaat gcaatcgaga atatcgaatg cacgctccag 1021 gttcctatgg ctcctagctt tcggtgcagc acaacaaata tttgggttgg ttcgtttttc 1081 cgccacgatt ttaaaacgct aagggtggct ttttcggagt ccctctcatt tgagccggga 1141 aaaagaccag accctccctt tccctcccca gccactttcc agctttgttg taacctcgtg 1201 ttgtgcctgc tttttaatcc cgctttgttg taacgcaggc catttgctgt ttttgttttg 1261 tttctggcgt ggctaaaacg gattatgtga cccaaccctt tcgatttcga gtctcattgt 1321 attacatcct tttcccaaaa atgactaaaa aagcgtattg cctatttctg gggtggacag 1381 aatgaaacgg ggtcagcacg tggccgcaaa cttgaaatac cattatggat ccaataggac 1441 tgtgaagagt tattagtgaa acatttcgaa caaactattc ctttaattga aatttgccaa 1501 ttccccactt aagcatcgca tttgcatatt tgtatgctaa ccaaactctc tgcgacgtcg 1561 atctgtctga atctgaatcc gcatatcttt cgagggcagc aatagcgcca ccccacccca 1621 tttccgttgc cactagcagc gcaccaccct ttgaaaggac atcaaaaacc accccctgcc 1681 caaagttgcg ttacaattcc cgaagtgcag tgcagcgtac ataaaacata caatgaaatc 1741 cgataaaatt ttctatgtcc tttatatcct cataacatgc ccaactataa accgagccaa 1801 attcttctcg tgaaaaagtt gcacacaacg cgatttcact gtaccttaca ttgttctgta 1861 aactggaaca gagcgtccga ctggaagagg cgaaggaaaa aatcagcaca gggaacagga 1921 agttcacaag gattgggggt gtccgagggt ggctacatat atatggcgat atatggcata 1981 cagtatgtgc atacatatac atacatatac acatatatgt atgtatgtac catactatgt 2041 ggcgtgtcta gtggcgcgaa taacgaatac aggagcaatg aagcacacga acgaagcgaa 2101 acgaaacgaa gggaaatatt gaaaaagcca gaattgcatt acttcaaaac tataccggct 2161 cttggatatg ggtcctgggt ctcggttcct tgcttctggc tttcgcttcg tagggggtgt 2221 acgagcaata ctgccaggag aaaccaccct agcctttggc gttggtggtt gatagtgcaa 2281 aaataataat aacggaaacg gaaccataaa cccaaaacag cccttacaac gcgactgatt 2341 tgtaaggata atggtcgcat gcatcataaa acggttgacg acctcgtcgg aaagagccaa 2401 cattttgcaa tgatttccaa ttgctcctgc tatctaggcc caatgggccg ctaagtacat 2461 atgaagcata catagctaaa tacatatgaa gcatacatac atcgggttaa ctataaaacc 2521 aatgtgttct agatgagttt gttaatttta caaagtaagt tagggctgct ccagtggttt 2581 taagttgctt aactggacaa gagaacttgg ctctctcccc catctatctc ttgtcttcgg 2641 gaaaggtaac ctccattttg atcggttgaa tttgtatcaa aaagcatgaa aaactttgaa 2701 tacagcataa accactgaga gagcagccga aagccagggg tgtatcaaaa ggggttctta 2761 gaaatattcc actttgcaac gattatagac tcgtttctta tcagacaaaa ctttctccac 2821 ttctacgtgg tatttacaga aacaaaactt aaggtaaaat gttaggtggc tatatattaa 2881 agctattaaa atatattaaa ggtttccgga ataaccccct atctcggcgc gcagatacgc 2941 ccatgtcgcc actctttcgc tctctacctt ccactctcgc tccgaacacc attgcaaccc 3001 cgagagagct gctcgagagg caactatgag atatgagaat gagaggttgc tctatgtgaa 3061 aatggaaacc gaaaccgtca atgcctcaat ttcgcatttc attatttcat ttccacgcca 3121 accatcgtgc gccgcggtct gaacgctcct gccacagaaa aagaaaataa aagcaaggaa 3181 aattctaatc gaataccaaa tatcgtgctt gtgtgctctt tccgcaattg atttttttta 3241 agtagtgcat gacaataacc gttgagttga ctccaaccga agtaaccata actggaagct 3301 gacaacagcc attggccgga atctcatctc acaccgagcg aaatgtaaga gtatccccag 3361 gatatccaca atttgtgcaa ctgaatgaaa ggcaatgata acaatatcaa tcaacaactt 3421 tgaaacagta cttttttttt gctaatgatc gcgtcgcgct gcataattgt gcaatttact 3481 cacttataat ccgcgtgtct gcatatatcc atatgtacct atatctattt ttttgctagt 3541 gggtgtgtga tgtgtgtaag ttgtatacat tcaaacgaat attaataggc aaaacaacaa 3601 tatgcaattc gatggaggaa aaagcgaacc tcccaaagag aatggcggag agatgggcgg 3661 cggttagtca gagtgagagg gggcggccag tgcggagata gtagatgtgt tcagatctac 3721 gcctctgtga gctgtgtatc tgtaaatact tgctgctcaa atttaaatct gtgtgtgtat 3781 gtgtgtgtgg gcttgcgtgt gcgggagaga agagcgctga gtgttagagc gagagccagt 3841 tccgaagagg gcgcagctag catatcgatt ttatgatttc atgtgcaagg agcattgtac 3901 gcaacaacaa acggaggaac acgaccaact aaccgttaga agtagccatt ttttgagcgc 3961 agcggagagg gtgggcgagt gtgtgtgggt ggcggagtac ttgcagtcaa caaaacgccg 4021 gttgccataa aagcaaatgc acgcaggcat gaatcagacc catgtcgtag tagcaatatt 4081 aaacatagag gattcgcgat tcacagcaaa ggagtttttt ctcaaaaaac gacaacaatt 4141 agttgttttt tttcccagtg gtcgtctgtg tttatgcttg tgtggaagag agctcgaaat 4201 aggtagatat atacacgcgt atttatttat tatgtgtaca tatgtgcttg taaaaacaat 4261 atcagccata caaagtttga gtcccctccg cccacctcct aacaacaaca tcgatatcta 4321 aatcggtgca ggtaaagagc ggttgctaca ggcccaaata acctctcttc agtagaattt 4381 taccaacatc tcaaagtatt ctcgtgcgcc acatgtatgc gtatctgcac gcagatatat 4441 tttttgtatc gcctgtgaaa tatgaattca atgctttgga tggggtaaaa tcgataatat 4501 cgattggcga taacaatcgc acgcacacac acaggcccgc aaacacgaac aactaaaact 4561 acaacaacaa ccaccaaacc catcgaattt accgcttata gttttgtttt atttttgttt 4621 ttgtccacat cagacggcga cgccagcgtt tatttattaa aaacgattac attacaacaa 4681 acaccaaagt cctggacaac cgaagagagt gtgggagaaa atcgaagcag agagcaagga 4741 ggagcgccgt gagagcgcgc gcgcaaggat ttgagattgc aaagagaact tagcaattag 4801 ttcgctttgt ttgtccagcc aaacaacaac aacaaccaac cagctcccca aaaccaacta 4861 cacacatact cgctccctaa actaaaacaa gtgagtattg aaagacgacg ggccgagctg 4921 acgagatcgc aggacgaaag agagaaaact ttcgaagggt gcggtgtcgt gtcaaggcaa 4981 cacaatcaaa ccactaatgc ggaatattaa aaaaaaaaat taatttaaat cataagaggt 5041 tagctatatt tgatatatat atgtgtgtat ccatgtacat atgtctatgt tagacaagta 5101 gctctttctt gccgtattgt agttcgggtt atcgttaccc gcctctaatt ggcttgccat 5161 gacgaaagtg ggtggtacag gacgacgacg tgcgtgcgcg tgaccttttg actcttccat 5221 aagcgcctca ataaaaggtg cgagtcggta catacatatg tatgagaata gaggatatta 5281 aaaacgcgcc aaactctaag aaaagcatcc accaaagtaa aaacagtagc aactaaaaac 5341 ttttcagtct cagctaacca gtccaaacca gctatgtagt ttttagcacg actttgggag 5401 tgtcgaatgc gttaactaaa gctcaaatat atatcgagac ccgctgtagt ccctatatgc 5461 aaatcaaatt gctaccattt ccgcagcgat cgcttttcgc gacagccccc tctctctgtc 5521 gcacttagtt cagtgaacca acacattcgc tggtttcgtt ccgagcagca tcctctgtct 5581 ctcttgctca gcctcttcgc cgccccctct gcctaagcct ctgtttctgc tgtccgcacc 5641 ccttaccaag gtgtcatgta aatggcggag ccggcttagg gggtggcaat gccgtgcttt 5701 taacccttat cggtggcaaa tggcgctcgt acatttggag aaaatgccgt tctacgtttc 5761 gttttgcttc gtttcgccaa gcgaagtatt acaacttgtt tgactaaggg tgggtctacc 5821 gtctattgct catttatggt gttttgatgc tttctcaagg ggtgcgggct ggtttctcgc 5881 atgtcctgac cctctgggtc atccctctgt tcaccgcatt acataactgg gcggggtgca 5941 cagcgcgaca cactcttatg gagagagcgc tgtaatttgc ggaaaggaga cgagggggag 6001 aagggacgca agagcagcta cagcgcgcag cagtgccaga aagtcctgcg ctcgccccag 6061 aagcatcctg ttcctgtgac gcagcggatc tggtctcgcg ttcgctccac tacctccact 6121 tcctacacct acagtacaca tatcacatga acttaaaact agtgaacgga ttaattaaaa 6181 aggaatcgca aaacaaaagc tcgctaatcg cgcagacaca gcagaagctt aaattcagaa 6241 aagtttagtt cttcagtgcc gaaggtgtgt ttgcagagct ttgatttcta gtgcgcgggt 6301 gtgggaagag aggactaaat agagacagag agagcggccg gatgagaaag tggggcgctg 6361 ggcggcgcct gttagcaaag cttcgttgac caaccgttat ttcctgaaca aaggaataac 6421 atacattcat cacttaatca ccaactgttt ttaaatccac agaattataa tttagcgtat 6481 gattcaagat ttgggaattg tgcgcaccca ggataaccgc gtttgtgttt aatttacaaa 6541 catgccccta atgtctatta aataaaagtt cagcgtcgga aaattaaaga gtctggtaag 6601 gattacagta tatttatgct gtaatatcct ttaattgtcc cgggtcacat ttttccgtat 6661 ccccacgccc taaagcattt tcgtatccat taattaaaca agtaaaggtg taggcttata 6721 ttatgttata cagtccgctc tcacaagcat tataagaaat atattacttg tctgttcacc 6781 aaataggtaa aaaatataca tatcatatca aacgatttaa ataattgcgc cctgggcaga 6841 tgtaaataaa gaaatataaa ttaaaaaata tcgttagaat aaaccaacaa attttctatt 6901 caaaccctag ctgattgata atcaattagc ccgttataac tactaaaacc cataataata 6961 caaaaaacta ctaacacatc atgcaaatgt tttaattttg acatttaact attctttcca 7021 tgcgcctacc cctgcgttgg tcgtgacaac caacaataaa cttctcgact ctgcccgatc 7081 ctctcaccta tcactcagtg gactttatta tggcaaatac cggagctggc ggcggagtag 7141 acacacaggc acagctaatg cagagtgccg ctgcagccgc agcagtggcg gcaacaaacg 7201 cggccgccgc tcccgtacag aatgcagccg ccgtggcggc cgccgcccag ctgcagcagc 7261 aacaggtgca acaggcaatc ctgcaggtgc agcagcagca gacacagcaa gcggtggccg 7321 cggccgctgc cgcagtgacc cagcagctcc aacagcaaca gcaggctgtc gtggcccaac 7381 aggctgtagt gcagcagcaa caacagcagg cggcggcagt ggtgcaacag gcggcggtcc 7441 aacaggctgt ggtgccccag ccgcagcagg cgcagcccaa tacgaatggc aatgcaggat 7501 cgggatcgca aaatggcagc aacggcagca cggagacgcg cacaaacctt attgtcaact 7561 acttgccgca aacaatgacc gaagacgaga tccgttcgct cttctccagc gtcggcgaga 7621 ttgagtcggt gaagctgata cgcgacaagt cgcaggtcta catcgatcct ctcaatccgc 7681 aggcgcccag caagggccaa agtctgggct acggctttgt taactatgtc cggccgcaag 7741 atgccgagca ggctgttaat gttctaaacg gcctgcgact gcagaacaaa accataaagg 7801 tgtcgtttgc ccgcccgtcg tccgatgcca ttaaaggcgc caacctttat gtgtcggggc 7861 tgccaaagac gatgacccag caggaactgg aggccatctt cgcaccattc ggagcaataa 7921 tcacatcgcg cattctgcag aacgctggca acgatacgca gacgaaagga gtaggcttta 7981 ttcgattcga taagcgggag gaggccacaa gggctatcat tgccttgaac ggtactacac 8041 catccagctg cacagacccc atagtggtaa aattctccaa tacgcccggc agcaccagta 8101 agatcatcca gccgcagctg cccgctttcc tcaatcccca actggtgcga cgcatcgggg 8161 gcgccatgca cacgccagtt aacaagggtc tggcccggtt ttcaccaatg gcgggcgaca 8221 tgctggacgt aatgctgccc aatggactag gagcggcggc ggcggcggcc acaacgctgg 8281 ccagtgggcc tggcggcgcg tatcccattt tcatctacaa cttggcgccg gaaacggagg 8341 aggctgccct gtggcagctg tttggtccat ttggagctgt gcaatcggta aagattgtca 8401 aggatcccac aacgaatcag tgcaagggct acggcttcgt ttcgatgacc aactacgatg 8461 aggcggccat ggccattcgg gcgctcaacg gctacaccat gggcaatcgg gtgctgcagg 8521 tcagcttcaa gaccaacaaa gccaagtaga gcggcccaaa tggaagtgga caaggacgca 8581 gcgggagcac cagcaaccac aacccattat cttgaatcta aacctgacca cacaacacac 8641 atatataaat atgcatggta taacggtaac taagcgcaac aaaacagtat gtgtacaaca 8701 cacaagaaaa aaacaagaaa gtagcaggca ggagaaaata ctctttccac taaacgacaa 8761 cgaaacggcg gaaatccccc taaaaacaca aatctcctcc ccaaaaccgc cggccaaaag 8821 cgttgcaata gaaaaattct tcttgtttag catttaagta aaaaacctta actaaacgaa 8881 gcgagcagaa gaagttataa gatcgggaga agcgaatagg tagggggttg agaaagaaca 8941 atgagatttt aagaaataat attgcaaaat gaaaacacac tgtggcagtg gcagctgtga 9001 aaggtcaaag gttggcacag tcgatctagt cacaaagcaa aacaaagaaa tcaaccaaac 9061 aaacacaaaa cacacaaaaa cgaaaacaaa tttaacttta ttagcaaaaa agaaaatgag 9121 aaattttaaa aatttattaa ctttaagtaa aacgcagcaa tgaaatcttc caaacaaaag 9181 aaagcaaaac agcaaatctg tttgttacac ttcaactttt tctaccataa aaactttgaa 9241 tcttgatact aaacctaaag cagaattaag aaaatagagg aattc // LOCUS HUMDBTEFH 2335 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human beta-globin intergenic region spanning the deletion which causes delta-beta-thalassemia. ACCESSION M22800 X14550 KEYWORDS . SOURCE Human patient s.h. leukocyte cell DNA, togi clone d. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 643) AUTHORS Mishima,N. and Gilman,J.G. JOURNAL Unpublished (1989) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Mishima,N. and Gilman,J.G. TITLE DNA deletion in indian delta-beta-thalassemia with elevated fetal hemoglobin extends from 1 kb 3' of a-GAMMA globin gene into downstream KpnI (L1) repeated sequence JOURNAL FASEB J. 2, 546-546 (1988) STANDARD simple staff_review REFERENCE 3 (bases 1 to 2335) AUTHORS Mishima,N. and Gilman,J.G. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by J.G.Gilman 28-FEB-1989; [3], 11-APR-1990. This sequence is across the DNA deletion in the beta-globin gene cluster. BASE COUNT 931 a 462 c 393 g 530 t 19 others ORIGIN Chromosome 11. 1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa 61 tttccatgta attgcatagt tttcatctat tcattagtct tgaaatctat ttttattcct 121 ctgtggtctg agagtgattg atatgatttt ttttcatttt ctgaggattg ttttaggtcc 181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa 241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac 301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa 361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc 421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat 481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa 541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt 601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa 661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat 721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat 781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc 841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga 901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt 961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt 1021 ttcaggatat naaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca 1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata 1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact 1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa 1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat 1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat 1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga 1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt 1501 ggggtggtag gagagnngag ggataanatt aggagatata tctaatgtaa atgacgagtt 1561 aacgggtgca gcacaccaat atggcacacg tatcactatg taacaaactg cacaggtacc 1621 gtagaactta gagtacaaaa aaaaattgag aaaaagaaaa aagctggaga taacacattg 1681 cctaacttca aaccatacca cagggctaca gtaaccaaaa cagtttggta ctgttacaga 1741 aaagacacac agacaaatgg aacagaatag agagcccaga aataagacca catgcctaca 1801 accatctgat ctttgacaaa gctgataana acaagcaatg aggaaaagac accctatnca 1861 acaagtcgtg ttgggataac tgactaancg tatntagaag attgaaacca gaccccgtnc 1921 tatatcatat ncncaaataa ncncaagatg gattagagac ttaaatgcaa agcccaaact 1981 atnanaatgc ttgaaagcaa cctagacaat accattccgg acatgagcag gggcaaagat 2041 ttcatgacaa agatgccaag agtcattgca gcaaaagcaa aaattgacaa ataggatcta 2101 attaaaataa agtttctgca cagcaaaaga aactatcaac tgagtaaaca acctatagaa 2161 tnggagaaaa tnttnncaaa ctatgcatcc aacaaaggcc taatattcag aatctataag 2221 gaacttaaat gaatctacaa gaacaaacaa acaatgccac aaaaaatggg taaaggacat 2281 gaacagacag tattcaaaag aagacatcac atatggccac caagcatata aaaaa //
GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS CULESTB1 3105 bp ds-DNA INV 27-FEB-1990 DEFINITION C.pipiens esterase B1 gene, complete cds. ACCESSION M32328 KEYWORDS esterase. SOURCE Mosquito (C.pipiens quinquefasciatus, strain Tem-R) 4th instar larva DNA. ORGANISM Culex pipiens Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae. REFERENCE 1 (bases 1 to 3105) AUTHORS Mouches,C., Pauplin,Y., Agarwal,M., Lemieux,L., Herzog,M., Abadon,M., Beyssat-Arnaouty,V., Hyrien,O., Robert de Saint Vincent,B., Georghiou,G.P. and Pasteur,N. TITLE Characterization of amplification core and esterase B1 gene responsible for insecticide resistnace in Culex JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by N.Pasteur, 23-FEB-1990. FEATURES from to/span description pept 406 546 esterase B1, exon 1 1322 1459 esterase B1, exon 2 1512 2393 esterase B1, exon 3 2455 2916 esterase B1, exon 4 pre-msg 268 3016 esterase B1 mRNA and introns IVS 547 1321 esterase B1 intron A IVS 1460 1511 esterase B1 intron B IVS 2394 2454 esterase B1 intron c rpt 155 171 direct repeat copy 1 rpt 176 192 direct repeat copy 2 signal 246 250 pot. TATA box signal 2998 3003 poly-A signal BASE COUNT 904 a 671 c 700 g 830 t ORIGIN Chromosome 2. 1 tacataccgc aataatcaaa atataacaaa aataatgttt tttttcctct caaatatgtt 61 gaacaatccc attttacccc gcatctgcat tcattttaag agagaattgt ctgtgtatag 121 gtagagtagt ctcgcaggta gggtacaaac acgagcgatg aacacatttg catttgcgat 181 gaacacattt gcttgatgcg tgttttaaaa cttagcagtg caaagcagaa taccaccaac 241 atcgcaattt tcagctccac aaatcatcag tacagagtgg gcagccgcac cgagctgttg 301 gtgcaagtca attcagctga gcaaaccgaa aaaaaaaaac ttcgaagagt cacacccagc 361 tgatagcgaa aatttaagca acaaaaaaac tccaatctac gtaggatgag tttggaaagc 421 ttaaccgttc agaccaaata cggcccggtc cggggcaaac ggaacgtatc gttgctggga 481 caggagtacg tcagctttca gggaattccg tacgcccggg caccggaagg ggagctgcgg 541 tttaaggtga gagtggtaaa ttgtttcaag tgctgttcaa attttatgga tgtgcaagtg 601 catttttgtt caaataaaga gcaacgagtg ctgctgatta gcgctgtatc taagagtgtg 661 acctccgcgg cttgatactt caccgcaatg acatgacata tttgttgata aaaataaaat 721 gtaataaata ttttgtactt ggacacgtac aaaccagtca tggtctaggc tggaaataaa 781 ttgaagagtg acaaagtcaa acataaattt ggcagtgact tccattttga taaaagataa 841 gaccataaac ttaatccaaa attgtttgtt ggacaattga taatgaagtc tgtgcaagta 901 gaagttattg aaacattaga ctgtttaaaa tttaatttaa tctatgattt aatctttatt 961 ttgttcaata atctaaattt ttgtaaaaca aaaaaatctt cagaaaacct atcaactgca 1021 tgctataaca ttttcaaatg acttcgaaat attaccgaaa taacattaaa tttgtcaatc 1081 aaatttagta gttttgacat tttgaaaatg ctaaaaaaat gatcaaactt gcaaaaccat 1141 tttttttttt cataaatcct cagcaatttt gcgaacaaat tattaataaa aagaaacgtt 1201 caaatatcac acctcactgt gtaaaatatt ttaaccagca ctgtatttcc cggtgacttg 1261 aacgcaaaac aagcaacgaa caaaactgga aatttaaaaa cataacctta aaacatttca 1321 ggcaccagtt ccaccgcaaa agtggaccga aacgttggac tgcacgcagc aatgcgagcc 1381 ctgctatcac ttcgaccggc gcctccagaa gatcgtcggc tgcgaggaca gtctgaagat 1441 caacgtgttt gcgaaggagg tgagttggtt tcaaagaatc aatttcaact ctgaattcac 1501 gatttctcca gatcaaccct tcaacccctc ttccggtgat gctgtacatc tacggcgggg 1561 gcttcacgga aggaaccagc ggaaccgaac tgtacgggcc ggatttcctg gttcagaagg 1621 atatcgtgtt ggtgtcgttc aattaccgta ttggggcgtt aggttttctg tgttgtcaat 1681 cggagcagga tggcgtaccc ggtaatgccg gactcaaaga tcagaacttg gccattcgtt 1741 gggttctgga gaacattgcc gcctttggag gagacccgaa gcgcgtgacc ctggccggcc 1801 atagcgcagg tgccgcttcg gttcagtatc atctgatttc ggatgcgtcc aaggacttgt 1861 ttcagcggcg tatcgtaatg tctgggagta cgtattccag ttggtctttg accaggcaac 1921 gcaactgggt tgagaagttg gcgaaggcca tcggttggga tggacagggt ggtgagtccg 1981 gagcgttgag attcttgaga cgtgccaaac cggaggacat tgttgctcac caggagaagc 2041 ttctgacgga ccaggacatg caggatgata tctttactcc gtttggacct accgttgaac 2101 cgtacctgac ggaacagtgc ataataccga aggcaccgtt cgagatggct cgaacagctt 2161 ggggtgacaa gattgatatc atgatcggtg gtacttctga agaaggactg ctactgctgc 2221 aaaagatcaa gttgcatccg gaactactgt cccatcctca tctattcctg ggaaatgttc 2281 ctccaaattt gaagatcagc atggaaaaac gaatcgagtt tgctgccaag ctgaaacaac 2341 gttactaccc cgacagcatt ccttcaatgg agaacaacct gggatacgtt catgtaagtc 2401 caaacctaac ctcaatcacc acatcaactt atcaaccctc cacaaaaatt ccagatgatg 2461 tccgaccggg tcttctggca cggcctgcac cgcaccatcc ttgcccgcgc cgctcgatcg 2521 cgcgcccgca ccttcgtgta ccggatctgt ctggattcgg agttttacaa ccactaccgc 2581 atcatgatga tcgacccgaa gctgcgcggc acggcccatg ccgacgagct gtcctatctg 2641 ttttccaact ttacccagca ggtccccggc aaggaaacgt tcgagtaccg cggtctgcaa 2701 acgctggtcg atgtgttcag cgcgttcgtc atcaacgggg atccaaactg tggcatgacg 2761 gcgaagggtg gtgtggtctt tgagccgaac gcgcagacga agcccacgtt caagtgtctg 2821 aacattgcca acgacggggt ggcgttcgtt gactatccgg atgcggaccg gttggacatg 2881 tgggacgcaa tgtacgtgaa tgatgagctg ttttgaggag aaagttttta aaatatccta 2941 ttgagatttt gcaatgcctt atttaatctg ttgttttatt ttaattatgt attgttgaat 3001 aaatttatac tatataacaa aagttttatt cttggagtca ttccgctaac ttttttgttt 3061 gtgttcaaca tattgaagat ctgacaaccc tatcaaaggt tatag // LOCUS DROFASI 15288 bp ds-DNA INV 26-FEB-1990 DEFINITION D.melanogaster fasciclin I (FasI) gene, complete cds. ACCESSION M32311 SOURCE D.melanogaster DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 15288) AUTHORS McAllistar,L., Zinn,K., Rehm,J. and Goodman,?. TITLE no title given JOURNAL Unpublished (1990)Howard Hughes Med. Inst. Berkeley, CA 94720 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.B. McAllister, 22-FEB-1990 FEATURES from to/span description pept 1593 1697 fasciclin I, exon 2 (first expressed exon) 7090 7222 fasciclin I, exon 3 7304 7470 fasciclin I, exon 4 7545 7775 fasciclin I, exon 4 7843 8011 fasciclin I, exon 5 8089 8249 fasciclin I, exon 6 9814 9822 fasciclin I, exon 7 10989 11143 fasciclin I, exon 8 11210 11300 fasciclin I, exon 9 11364 11584 fasciclin I, exon 10 11644 11806 fasciclin I, exon 11 11871 12018 fasciclin I, exon 12 12428 12633 fasciclin I, exon 13 IVS 824 1449 FasI intron A IVS 1698 7089 FasI intron B IVS 7223 7303 FasI intron C IVS 7471 7544 FasI intron D IVS 7776 7842 FasI intron E IVS 8012 8088 FasI intron F IVS 8250 9813 FasI intron G IVS 9823 10988 FasI intron H IVS 11144 11209 FasI intron I IVS 11301 11363 FasI intron J IVS 11585 11643 FasI intron K IVS 11807 11870 FasI intron L IVS 12019 12427 FasI intron M ORF 1 164 ORF (AA at 3) BASE COUNT 4691 a 3381 c 2990 g 4225 t 1 others ORIGIN Chromosome 3, map position 89e-f. 1 ctcgaggaga ccccgaaggt ccagctggag accccgatgc tggcgttgca gcctctgcag 61 ctcggcagtt gccaactgat ccagctcgct ttggtcagtc atcgttattg gctttctgaa 121 ttcggtcttt gcaatggtat tgtcaaaact gataatgata ttaatttgat ttcaatagca 181 aacaagcgtt gctggtggca agttgcttag taacggcgtg gtttatcgaa gacttttacc 241 atttacttaa agtaagaaga accaaatagg tacactgctt gttaaaattc aatgttatct 301 ccttagaaaa gtaaaaaata agtgcttgaa aattataacc ggttcaattt ggaataaaat 361 ctttcaaaca gtagagactc tgcttagcgg agttatccac cgcattctgt cgttgccaca 421 ttttgcgaaa cggttaaaat gattatttat gcttttaaac gaaaaatgag aaaatttaaa 481 gcagtcttct tgtaataata ataagattgg ataaacagac attattataa atctgacaca 541 gaagagcaac gcagcgctcg atttgtacta atattttacc tctaacaaga ttcggaaaag 601 cataaaaagc acataaataa gagtcaccgc caaagacatg cacaaataaa tataaatgtt 661 aagcgattta ttaacagtgc acagtgttgc gatttgaaac atcgacagct atcgataaca 721 ctcgttatcg tcaccacacg ctcggcttaa acctgttctc cgcttttcgt tttaattcga 781 aaaaaacgtt cgagcgtggc gcacgttgcg attcctccga ctggtacttg ctccccttat 841 attgaaactc tgtggaaaat ctatactgtg catagggcat aactctaaaa tagctgcacg 901 gcccatttcg ctgcccttgc tattgattta ccaccgcgct gcgttcgttt gtgtgtgcta 961 cttaaagaat ttgtgtgttt tccaagtttg gtccgtggac attgcataat ttctgcggcg 1021 gccgttaatg ttaattcctg cagcccgagt ttccgagaat tacgcagaat aaagaccaga 1081 gagaaaacta taaaatcgaa aacagaaaaa agtgccgcag cagcgaaatg caaaggcgca 1141 taataattaa acacacagcg acggaatgaa gaaaaaataa tacacaataa gcgcagcttt 1201 gtttctagtt aaattgcgtt tgtgttgtgc ttgccgattc ctcggtggtc cgtttttcgc 1261 ttgttgttct atgtgacata acggaactct gggcaaaagc gaacaggaag cagcgataac 1321 cttgcaaaaa caaagaaaat accaaggagg acaaaaagca tgccaagcat atatctgtga 1381 aataatattt tcttttccga ggaaatgctg tttgtcgtcg gctaactgtt gtttgccttt 1441 gaattgcaga tcttaatcgt agagcagcac tcacaccagc acacgccccc cgcaaaacag 1501 cacacacagc cacactcaca aacgacgtac actgccagcg cataaacaag tgctgttgtg 1561 gttgttgttg ctgctgctgc tcgtccgcta aaatgctgaa cgctgcagcg ctgcttttgg 1621 cgctgctctg cgccgcgaac gcagccgccg ccgccgattt ggcggacaaa ttgcgcgatg 1681 attcggaact ctctcaggtg agtcagaaga agaagaagaa gcgaataacg agaacgtgga 1741 ggatgaggag gtgaaaggag gtggtagagt aagagcacga agaaaagcag tagcagcaga 1801 aacagttggt gcaaataacc gaaacacttg tccgtagtca attaagcccc cagcaccacc 1861 caccaaacgt cgccccccct ccccccgctc acccttgggg tacagtaggg attgcctgcc 1921 atgaacatta gtaatccttt tctatctttg caaccgatta aattaatgcg aagagttaca 1981 cttcgaatat ttgtcaaaag tatgtttgtt taggtatccg tacgcgcaac atgtaaaatt 2041 tgaacacatt caggggcttt tattagtcta atatatcatt tcgttggaaa agtatttctt 2101 aactgcctac attttataca gtaatcagtc gcattttttc tgtagccagg gaaactgcca 2161 aaattaaaat aacttttgtg tgagcaaaaa acaaactatt tttagcagac atttatgcgt 2221 ttaaataatt ttttcagcct tttgaatatg tttcatatat ttctgtgatg ccttaatttc 2281 ttctcatggt cggatacctt attgcaatga taaaatcaat tttttattat gcctgacttg 2341 gctttggcga agtcaataaa tttaccttgc cacctaatag tggcaactta tcaaagttcg 2401 gctgtaattg tgccaccacc cacaagacgc cgccccaatt cgttggcctt gaaaggaaaa 2461 gccttggtgg agtgtgcgag gaggcggggc agatcgggcg ggaaaaagga gcctgcgcaa 2521 gagggacgca cccgcatttg ggcacaggta gcgtagcctt cgttccggca gcatcgtcct 2581 tgacctttag ccatctcact tgttgttgga ctgcggaaaa tcgatgatgc tgcctaaaat 2641 ccctggcggc agttcaggtc cagccgaccg ctaagctcca cttggccaca tttcgtatgc 2701 atggcggagt tcgttatgta aaaatgaaaa gccacataaa ttaagcccag gtctgctgaa 2761 ttggctctcc ccagcaaaag gcaaatggta gaaaaaacgc cggccgttga cgtccactgc 2821 gcataacagt tttatattta gttttgcctt tttacacctt cacaggccaa ctcacataca 2881 aagtacatag aaaacaaaaa cacattaatg tgtaaaaaac tcagaagcca acaaaggtgc 2941 gcggctgaaa taacaagaac aacatgtgta atgaattcaa aggcgggggt tcagtcacca 3001 gaaggggtat tagcattaca ccaggagaaa tatatatata tttttttaat atgttaaggt 3061 ttatggtagc ccattaagcc acataactta tactcaaaaa atagaattga tttgagtgga 3121 aaatagtaac atagttcgtc tattgtaatt gcttttatct tatatagtag gtgaagtagt 3181 agttgaatac gcgtttttaa gtaactgact tgagtccctg cttaattttc tagtctgcct 3241 taacactgca tttaacaccc gaaatacacg ctgcttcaaa agtaaattag acttccaagt 3301 aaaaagaaag cgtgtgacag actgtaattg ttgcagcaag tgtctcaggg aagtagttgc 3361 atccacacag aaaaggcaac cttttcgagt tacaataggc gaaacagagc gtacaactaa 3421 aaaaaattaa aaacaagaag ggggactcgg cttgctcact tgcacccttg ctttagggca 3481 ttgtaacttt gcccagaagt ttgccacttt cggagcaaag catttccggc ctcatatctg 3541 gagtactgct atacatgagg gtgggaccat ttggtcgcca taaaaaagta agggcggaaa 3601 gtaatcttct ggcatttatg caaattaagt gccgtaaaac taatggatca aaaagtcagg 3661 cttgggaaat cccttttaaa gttacttctt aagactatga aataatcatt ttattaataa 3721 tatgtcgtat tgcaggtagc tgctttcaat ccaatcccct ttttaaattt tgcgatttat 3781 atattttctc agttgtattt aaaatgtaga caagactggt tcaaaagggt atctccatct 3841 tctaaagtcg aagtcgggct tctttactct ggtgcctgca cttctcatta catgctgatg 3901 gcttagataa atgtaggttg cctgttttgg tttgcggctt acggcttacg gcttgtggtt 3961 ttgcatggca aaccaccgag tttaggagag agtttcgaaa ttagcagata attgagccac 4021 gccgatctga gttttatatt caacttcttg acagccgaaa tgaaagtaca ctccgaggaa 4081 cgagctgacc tcctgcgttt gacccccact tcctgtcgtg cgtcagccaa cgtcttgcgc 4141 gttttattaa ttttaatttt taacctttga ctgatgcaat tgcagttggc cgagtggggg 4201 tgtggacagc ttgggttgac aactgcgggg ctggctgaac tctgtctagg aacactcact 4261 cgtatggctc acggcacaga acagtccaca ctcgcagcat catcgaattc aattaaattt 4321 aatttaagat ttcaatgcga tgccaaggcg cgttatttga tatgcgacga gctccattac 4381 aggcgaggcg atgaatgagc cacgactgac cttaatggat ccagatgctc atagacggat 4441 ctctgtccga cggagcactg aaggtgcaca gttttacgat cctaggcgtg ggggcggaat 4501 ggcctcggag tgatgccagt gtagaacccc ccatttcgat ccctttccac tgctgaaccg 4561 ccaataaaag gagaatgctg ttgccccaat ctctcgattc cgaaccggaa tccgaagtcg 4621 ggggttgaga tcgctttcgc ttttgcttct gttttctgtt tctgctgccg ctgcaattat 4681 cgaaaagaaa atatactgaa caaccgacat atgcatacat aatttgtacc aattaaattt 4741 atttatacaa catgcactcg cccagaagga aacaacctaa ccctcctgct ccgatttcag 4801 tccctagtcc aaaatacgaa gcttggctcg ctctcttggc taattagttt taatggctcc 4861 attagatgat ggaaattgaa tttcctccaa acaggcttac cctctcatga ttgttgttgt 4921 ttttggtata ggaattagta caatttttgg ctacaaatcg aaagaaaaca gaacaaattt 4981 gaactgccag cggttctctc tctttctctt tctttctttc tctctctcta tctctttcag 5041 agcactgcca aatccttgag gcgccttatg ccagttaggg taagcagcaa ttccaaaaaa 5101 ccaaacagaa acacacgaac acaattagct attgtaaatt tggaacaaac tcaaaagaca 5161 ttgtaaatgt acaactcaca tgtttttggc ttaatttagt taaccaaccc gtactcatcg 5221 ggccattggc aaattattgt ttgctccgcc gtctgactct cacccctgat gttcgttgtt 5281 cgtctgtggc cccagggtca ttggcttggc actcgggaaa atcaaaatgc gcttttgaat 5341 aaacccaccc tcttgggagc acttttggcg gttgggcggc gagtggtgtt tgcatagtaa 5401 atgagacgcc ccagaccgaa agggaaactg tttatcgatg acagcataat actctaacaa 5461 ttgtatcaaa ttttcgggct tccaaaaaag gactccccgg ggaagagctg gccttatcgc 5521 gatgatgtgc caaaagagcc ggaaactcat taagatctgt tcccttccgg cccggccttt 5581 catcaaatgc aaaacattcg ataacggaaa tttatgaaag taattgcatc taattgtcaa 5641 aggcttccag ccagaaccca agggggttga atttagtaag gatgctgaca gatggccatt 5701 ggaaaatggc ggccaagaga aagtgtcgca aacaatatca ttgcctattc aataaaactt 5761 ctctgtactc tattggccat tatacctgaa cgtatcccat atcaacctgt gacactgtga 5821 catatgccac agccattccc caggacccca cttaacagca gggcactctc ttggccagat 5881 aagcgaaaac atttcgagag cgatatagac accagcaagc agttctgacc aagctggcca 5941 agaagacggt gtaagaacaa agcgcgctgg cgactcgaat gtccacgacc aggacaacaa 6001 caacgctaac ggccccgaca actgacggtc tcaaattacc aaaaagaaaa tgaatgcaga 6061 gcgaggataa atgtgtattt acactcagga aaaggacgaa aattcctcaa aaaaagagta 6121 tttcaatttt aaggtgttaa gttcgttcgt tgatgtttga aaggaattca aaggcatatc 6181 aatatcagta aaagaaatta caagtaagga tcccactggc aatttaaatt taagcatata 6241 tttccaatta tgaattgaat ggattgaatg gacgtgaagc acctatttct aggcctagcc 6301 actttttttt cggtgtacat gcgtgtgtat tggcgatgcc agagcataga tatgtacagc 6361 aggcagggga aatgggaatg gaaaagggaa aggacggact tttaaaccca aattgccatg 6421 actcatgccg ggcagcacac gaaaacacga gagtcacacg accgttttaa aggagggcac 6481 ggaaaaaggt gttgtcctcg gcaggacacg cacccacaca ggcacaccaa gacacagata 6541 tgtccgcagg acctcataaa gccgactggg aatgtgcagc ggggcgtggc ggcagtggtt 6601 ctccacgccg ggaagccgca gagtgtgcgg ttgcatgtgg cctcatatgg caggggcagg 6661 tggctggccc gcccgagcaa atgactctgg gcccggttta gggtcagtgg ctggcgtttt 6721 ggggtggctg gaaaatgcat tttcacatcc cgctctcaat tgaatgacac acaatccact 6781 tgggcggcgt aacccaccca gaggcttacg taattggttc aataaacctt ttttttctag 6841 tacgccgccc tggcatcacc accaggacat gcataaagtt gttcctcatt ttattagttt 6901 gttagacgtt gccatggtgc tggccggttg ccgttttgcc aaaaccgccc agcaaccgtc 6961 tgcgaaccac cccttaaggc acccactcca tcctagatcc accacccact actccatccc 7021 actcggttcg ataagatatt ggcaatagag cgacgcgaat gagtgtaacg aacccgttta 7081 tctttgcagt tctacagcct gctggagagc aatcaaattg ccaactcaac gctttcgctg 7141 cgcagctgca cgatctttgt gcccaccaat gaagccttcc agcgctacaa gagcaaaacc 7201 gcccatgtgc tctatcacat tagtgagttc agtttcaatc aggctgatta aaaataataa 7261 aaatcaaaat aacatgactc taccctctga ttactctttg cagccactga ggcgtacacc 7321 cagaaacgac tgccgaatac cgtgtcatcg gacatggccg gcaatccacc gctgtacatc 7381 acaaagaact cgaatggcga catctttgtg aacaatgccc ggatcatacc ctcgctcagt 7441 gtggagacaa acagcgatgg caagcggcag gtcagtggcc ccacatatgc atatgttgca 7501 tttcatagag tgattgaaca actctcttca ttacttaccg acagatcatg cacatcatcg 7561 acgaggtact ggagccgctc accgtcaagg ctggccattc ggataccccc aacaatccga 7621 atgctctcaa gttcctgaag aacgccgagg agttcaacgt ggacaacatc ggtgtgcgca 7681 cgtaccgcag ccaggtgacg atggccaaga aagagtcggt ctatgatgcc gccggacagc 7741 acacgttcct ggttcccgtc gatgaaggct tcaaggtgct tttctagttt aattgcatta 7801 tttgttacgc attgtaatct cactgttgct ttatgactcc agctctcggc tcgcagcagc 7861 ctcgtggacg gcaaggtcat cgatggccat gtgataccaa acactgtcat cttcactgcc 7921 gctgcccagc atgacgatcc caaggcttcc gccgcttttg aggacttact caaggtcacc 7981 gtcagtttct tcaagcagaa gaacggcaaa agtaagattg cactaaagac ataccatttt 8041 actaaaggtt tacgaatgat tgtatggttg atataatttt acttgcagtg tacgtcaagt 8101 caaacaccat tgtgggtgat gccaaacacc gcgtgggcgt ggttctggcc gagatcgtga 8161 aggcgaacat cccagtgagc aacggagtag tccatctgat ccaccgcccg ctgatgatca 8221 tcgatacgac ggtcacccaa ttcctgcagg tatgtagggt gttgaagact agcatccagt 8281 gcaattgtag acacacagac acgacacaga cagtccaaga aacaaacaca tagaaacaat 8341 gatagtgcct gtggcttgtg ataaggcttg gcattttgat tggctcggca cgcactgaaa 8401 gatagcgctg gcggcagacc gatggactgg gaacataaat cacttgcctg ctgccccagc 8461 gaatgctaca aacacatgag ctaaacacat tgaaacatac acactgataa gcggcttttc 8521 tttgccctaa gtgagcagct cctccaagtg gcgccaaagg gataatattt ccaaggttga 8581 agtttcgtcg gggctcaact gagggcaaag gccaaaacca aataatcttg cacccagaaa 8641 ccactgccta aacacataca gtataaataa ataagatcaa tgcagacact agttttcagt 8701 atcccctaga tatatctaac taatacaaat gaaaacccaa aagactgatt ttatttggaa 8761 ctgaattcgc tcaaaagtct gtattccgat tcggttctat gtaaaatcaa aagcaaaatt 8821 ttattttaaa ataattttta aaatataaat tttattttaa aaaaattttg ggtgtaatgt 8881 aagtattgcc aaaaagctga aaaacaaaaa aaaaaaaaac aaatcttaaa gcaatgtatt 8941 tcatgtaaat ttttctatga atcagatttt cgtatcaata tgcatttcac aagcagttgt 9001 ttgcattgca attgcaaaaa actgactcag tagggcaacg cgctcagaat tggccctcga 9061 aattcaatac gtgatttagt caccaattgg caaacactaa aaaaanttgc ttagcggctg 9121 cgctttgctt gatctgaccg aaatcgacca aaacttcatt cacaaaaact tccattgttc 9181 tatattgaaa tcgaccaaaa cttcattcac aaaaacttcc atttttctat attgaatgta 9241 ttggtacttt aatggcgtca atgaactaag caagtgtata aattattccc atcattgatt 9301 ggctaaagtt cttcgatccc tcgaccgccg tgcatatcca aagatcagtc atctgtacgc 9361 tcgcctacag ccggctagct ttgatcatca ccgtaaccgt agatgtatat gtatacacca 9421 agcataaact gataacaaac cacccacccg ctcgataaat agttaccctc ctcataggca 9481 tatacattag aaacagtgcg tttcgaaact gtgaggcaaa gtgtcgccga acagttttga 9541 gtccgcactg tagaacggct atatcattcg accgctcgaa tcgtaagcgc cttcatatcg 9601 tgctatcgtt ttcgaaattt tcaaactagt tttgttgtaa ttacgtgcga tactctcctt 9661 ttgctaattt actgtatcca tacctctatc ctttgtttcc gtcacatgcc gaataattgt 9721 acttaaaact acaaaacaca atccaaccca accacatatc gctcgaaatt gaatcggtgc 9781 acccaaatat tgaaacaaaa ccaaaacgtt cagtcgttca aggtaaacaa cagactcaaa 9841 atccgatttt aattttaagt tgttcggttt cttttgactt gcattttgtt ttgcatcccc 9901 tgctatttta gctatgcatg agtttattac aaaaaatgtg tagacctttt ttgtaaaatt 9961 agtccattgg ttgttaaagt cgcgatgcct tagggtctaa gaatatttct tctgattttt 10021 attcacaaat agtctttata ggatgtacag cactagttta aatttgcaaa tgaagcaagt 10081 gcacatgata ctttaaagtt aaagagtatt tatcgtatga tgagtttaca gaaattttac 10141 ctaattactt aagataaact tctatttttt gtcggagctt ttatatcctt gttttatttt 10201 tttaaatcat agtaatggaa attattatga ttcgctgtaa attgttgaaa aatgcacgta 10261 actcagtaac attgtacata atggagccaa accgtttgcc tgttctttga aaaatgtcta 10321 gaacaatgaa agattaaata taattccctt gtgcttcatt tcaaacgttc gtttgtggcg 10381 gaacaactcg ggcaatctgc gtggaaccca aatcccatat taaacaatat ttaaattcaa 10441 ttttattcga aacttgtctt caccaaatca cttcgaatga atattgaact atttctgcta 10501 acactccgtt ttaaaactga tttagtttat gaacgtaagt aaacatcaca ccagcagaca 10561 aagaacaaaa accacaacag tcatcagtgc aacgctttgc tatcaaattg ccatccaacc 10621 caaccaccgt gcaaacaatt caatgattcg gtagattttc cgtcattaaa tcgcataaat 10681 tcagctgtgt taagtgagtc attaagcgag cattcccatt gattcacaac ttaattggcc 10741 agccatccat tcccccaccc atatgtgtat aactggctta cattataatc ggatatattt 10801 ccaattgttg tatttttttc ctaattcttt gtactactcc caaaaaaaaa aaaacaattt 10861 ataagcttat gcatacccct cttccattca ccgtttattt atttatatgc attttgtttt 10921 aaggtacttc aaatagcatt atgatatgtt cattggtatt aattttgcaa tctatcttga 10981 aggagaagga gaatgctgag aacggagctc tgcgcaagtt ctacgaagtt ataatggaca 11041 atggtggagc agttctggac gacatcaata gcctgacaga agtgaccatt ttggctccca 11101 gcaatgaggc ttggaactcc tcgaacatca acaatgtttt gcggtaatgt tgctcaaatc 11161 gggaagataa tatttaatta tcataataat aaataaatac atcttacaga gatcggaata 11221 agatgaggca gatcctgaac atgcatatca tcaaggaccg cttaaatgtg gacaagatca 11281 ggcagaaaaa tgcaaatttg gtgagcttta catgagctta taatcaaatg gaaatttttt 11341 acaaacacca ttcttttcaa aagattgccc aggtgcccac tgtcaacaac aacactttcc 11401 tgtacttcaa cgttcgcggt gagggatcgg ataccgtgat aacagttgag ggaggcggcg 11461 tgaatgccac cgttatccag gctgatgtgg cccagactaa tggttatgtt cacatcatcg 11521 accatgtgct gggcgtgcct tacactacag ttcttggcaa acttgaatcc gatcccatga 11581 tgaggtatgt tcaattttaa agctttatta gtacgctgtt taaattttat cgtattcctt 11641 aagtgacacc tataagatgg gaaaattctc gcactttaat gaccagctga acaacacaca 11701 acgccgcttc acctactttg tgcccaggga caagggctgg cagaagaccg agctggatta 11761 cccatcggct cacaagaagc tttttatggc cgacttttcc tatcatgtag gtttttcacc 11821 ctaagcctat taggcttttc tattatactc aaaatgtttt gtaaccgcag tccaagtcca 11881 ttctggagcg tcatttggct atttcggata aggagtacac catgaaggat ctggttaagt 11941 tttcgcaaga atcgggcagc gtaatcctac ccacgttccg cgactctttg agtatccgcg 12001 tggaggagga agctggacgt aagtataaca cattggatac caagggctca gttgcttgtc 12061 ctttcaattc aagtcgattc gtttcaccta atttgattga ttctattggt cttcccatac 12121 cgtgctgcat tactaaccga acctgttttt ttacaactct taaactaaat ttttcgcttt 12181 ctgcttaact aaaactaatc gttgtccaat caaaagatct ccatgatgag tatgctagtc 12241 acgaatggac tggtgagtgc tgccgttttt caaacccaca tcgcaccgcc cacaacgaaa 12301 tccttgacct ggccatgtcc acccatagct cgctcattgg gcaatccact agcattaacc 12361 acccatcgat tgcataccaa ttttctattg tctagtggct attaacaacg attacttcca 12421 tttgcaggct atgtgatcat ttggaactac aagaagatca acgtataccg gcccgatgtt 12481 gagtgcacca acggaattat ccacgtcatc gactacccac tcctggagga aaaggatgtg 12541 gtcgtggccg gaggtagcta tttgccagaa tcaagcattt gcatcatctt ggccaacctc 12601 ataatgataa cagtagcaaa gttcttgaac taaatgcatc cgatatgtaa aaacaaatcc 12661 aatccaaagc aaatgcaaat caaacacaac aacaacagtc gtctacagaa caagaaccaa 12721 caacactcag tatcagacta acttaacatc cacatggatc taaataatca gcaccagttt 12781 gttgataccg atcgaaaacc acaagcaacc caaacagtat ctgtaatata tacgtcacaa 12841 ggaacgatca tcattccaac ccaacatgcc cacgtccacg ccatctacat acccacatac 12901 acacactcag aaaaaaacga tgcagaaaac tgcctgctaa atttgctttt gcatcgacag 12961 cttctcccaa aatagctggc aagagagact aagataaagt cgaagaaacc cctatgttta 13021 agtattaaac tcgattttcc ctcaaacggc cttatgtata atattgaata tgaaactttc 13081 tgtaaaacat tttaatttta tgttcgccct atactggtag cttacgttca tcagctcgct 13141 atgttttaag tttagaccca cactgacatt ggtatcgttc catgtaacaa ctgattgtgt 13201 ttatgatttt aagtgtacat tttttgtgaa ttgcttgttt tagttaatgg tctaatttat 13261 gatttatgaa atgagttatc ttaagtagaa tgcgaatctc attgtacatc tcgaaaacga 13321 agaagctaag cataagcaat aagcgcatca cacactctca tactcgagcg cccacacatt 13381 ccccctattc ctatccataa atacatacaa catgtaacga aacaaagttc attgtaagat 13441 ttgaaagtgt gaaatttgat aataaactga aagtttttca tttgaacctt actattaaga 13501 gcggcagcaa tcttgtctcg aaccaatcgc gtgacccctg taggcctaga actatatcag 13561 tgtcggacaa gaaatgtgca accaacaaac ccaattatat gatatgccat cctgcactta 13621 gttactagcc ccgctatgca tgtgaacccc cagaaaaacc cattcgccca cacacaccag 13681 ccacttggag cgagagtagc cactgcaact aggttagtcc aatgtataag ttccgaaact 13741 gccgacaact acgaaaacta tacagcatac ataaactccc tgtaatctaa atcactcacg 13801 gcacacaact acaactacaa ctacatgtca tcattgaatg gattttgata ccgattttaa 13861 cttgcatata aacaaaaaca agaactagac aacgtgaagg agattttaaa caaaattctc 13921 cctcggtcga gcagttgcat ttcaaacttt gtacgtagtt taaaactagt tttttagtcc 13981 gacgtagaac aacccaattg ctaactatat accaactttc tttctatttc tctctgtctc 14041 tccccctaat gctatgtact tataggttag acattgtaac tattgtaatc aacccagtgc 14101 gttaaacccg agtgttaagt cgaacagtaa cacagaattg tactatcccc aaatgaataa 14161 ctatcagcct accagtacac tgtctcaact ctcaccacca ccaccaccca cttaggaact 14221 cagtcgaact tgaactcgaa atcaaagatc cagttgtggc agtcgcttca cgtagttgct 14281 aattcccaat tcgaaccgat cctttccgaa agtcttatct ttagtatagg tggtttagtt 14341 tcatttggag ccgtgcagtg ccgtagcagc taagtaaaaa tgtatgaaat gaagatgaac 14401 acgagatcga aatcgtacgg aatgatcaga aatcagaaat aaataatgaa tacgctaatg 14461 aattgtacaa gtaagcttta aagaattgct ggaggagcgc ggatcggaga acttagagga 14521 gggagaaccg cattgcaatc gcattgcaat ttgtgtcgta gtcagtagtt acacgttaag 14581 cggcgtctta acgtgtaact agtgccttac taaagataaa cgcattacct taacctttat 14641 acaaatttac tcaaaacata cttgtacccc aagcatacgt tccgcttcga atgataccca 14701 gatatatata cggagttaca ccccaagaat acaagtataa ctacaaatga tattgcgcca 14761 cacgctattt acaccaaata caccaaacaa atcgagaaat gcatattttt catatattta 14821 attgtcagaa taatataacg tatatgtaat atgtagttta tttactgtaa aacgcaagaa 14881 cctaacaagt ggaatttgaa tcacatacaa ttgatgtata ttagctattg agtttctaag 14941 caagcgttag acactgaaat atatgttcaa atatatgatt caaatatata caatatgcga 15001 aaccaagcaa actatggaaa actggagtgc caaaagaata ttattccatt ttattttacg 15061 acaagcgctt tttacaaata aaccgaatcc atttaaatta ctcgtaaaga acagacagat 15121 tatatttagc attagttaaa ctaattatta catgtactag aaaaccgaat gtcaaccgag 15181 aatcttcagc aagcttgagc gaataataaa actttaaaac taactataaa taaatcgacc 15241 gtcctttttt gcacttaatc atgggttatg gttgtggaac ctgaattc // LOCUS DROIAB 2131 bp ds-DNA INV 14-FEB-1990 DEFINITION D.melanogaster abdominal-A (iab) 1.7 and 2.0 kb RNAs, complete cds. ACCESSION M32109 KEYWORDS . SOURCE D.melanogaster (strain Canton S) embryo DNA, and cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2131) AUTHORS Cumberledge,S., Zaratzian,A. and Sakonju,S. TITLE Characterization of two RNAs transcribed from the cis-regulatory region of the abd-A domain within the Drosophila bithorax complex JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.Sakonju, 14-FEB-1990. FEATURES from to/span description RNA 141 1802 1.7 kb RNA RNA 141 2101 2.0 kb RNA BASE COUNT 592 a 468 c 478 g 593 t ORIGIN Chromosome III, bithorax complex. 1 ttagttcgaa gcacgtgcgt gtgagttccg catgaacggt acaccaacac tttcgaacag 61 aaacccctca cagatactcc tcacacagat actctactgt actgggctac gatatctgtg 121 agatacgcac agataccggc aggccgccgg cgtcaaagtc gacgtgctgc cggcgtttga 181 gagaccgagc ggcgtgcaag aaatcgtcgg gcaacggctt caaatgaacg caaaacgttg 241 gaaagcaaac aaccggcaaa agccatcagc agctaaaaat aaaactgcaa ctatcaaagt 301 gaatctgtag agaaagagtg atagctcagt gtgcccagtg tttatttaat tacttattta 361 aatttgattt tcaaaataag ttgtgatttt ggttcgtttg tttgtttccc cattaaatcg 421 catcgcggaa ctggtttgca ttttgtatct tttatcgctg cagttcgttt ctgtgtggtt 481 tcatgtattc cccatctatt ccatctcttt ttgcagatac aagtgaatga aaccctaacg 541 tttaaacgcc accataaaca ccaaataaca aattttctgg caaacatggc atgtcacggt 601 cccatttaac gtggaaaatc ggcccagaaa ttcaacacga cgacaacttt ttcgactgtc 661 ccagaaagca aatatgtaaa gtgtatttgt gtgcgtgcca ttgacattga cttatggtca 721 tgaaggggtt agggggacgg gtctaagggt gggggttgat ggctaaaggg gggtctcgcc 781 gctacgacaa aaacttggat tctctcgttt cttttttgtc tttcgtcttc aacgcacatt 841 tatggtttgt tttgctcaca cacacactca aaaagcctac gcagcggcag agagggcaca 901 agcagcagag aaagagacga aacatgtctc ggatgatgat aaacgcctct cgtctatttt 961 ggcgggtcca tgaatgaaac aattgcctcg aatgactgac tgccagacag atagctagaa 1021 actgccctta cggcgggcaa attggggtat tttgggtttc ggtgactttg agggaccggg 1081 acctgagggc ccggaaagcc gacagcaata aacactgtgc tcagacaaaa gtctgatgag 1141 acagaccatc atgatgctca tcatcatcgg cgataataat gattttcatt tgcacattcc 1201 acccgtttcg ctgttgccgg cttccagcat tcagattcag tttgcattca gtggggcagg 1261 ttatttatgg cacttttcct atcgccactc actggggtaa tcgattttta ttggattaat 1321 attgctttcc gcatgatttg gccttctttt cgttatcact tagcaggttt cttgacgctg 1381 aggagtggac aatgactaaa actcagtcat ctggaaaagc tggaccaagg acatccatat 1441 acaagtgctg gctaaacgtc tgtggttatc gcaatcattt atggcggaat agaaggacac 1501 gtcttatgtg tcgattttcg ggccccatcc ctgttgaaaa atcagccgga cagcagcaac 1561 caccgaacac aagacacagg acgaccggaa gtgaggaaaa cgcttgtaaa atcggtgcac 1621 acgagtgcga gcgagacacg agcgaacggg ccgaaaggaa acgacaatta cgaggacacg 1681 ccggcttgct tttctgcttt tcctgtctgc tcctttcagt tgcggccgtg caatcgtcgg 1741 gtcaacaacg ccgcgtgtcc gatttcaatt tacaaattag aatatacatc tttacatctg 1801 gatttggttt catctttatc tgcatttgga tccgcgacga cgcataattt tcagcacttt 1861 tgaataatgg cgattctgtg cgacctcgta aacgtatact gaatgtatcc tgagtgtatc 1921 ctatccggta taccttcagt atacgtaaca cgagagagca ctagcaacgt gggatgtgaa 1981 acacttttta agacacctta atacacgtag cactttgtta tatggtctta cactttttaa 2041 atatctcgac ttataaatat ataatttaaa tataaatact ttaataatgc gatataagtt 2101 aaatacttcg atttactggc tggaagttca t // LOCUS FVBOPD 1693 bp ds-DNA BCT 05-MAR-1990 DEFINITION Flavobacterium sp. parathion hydrolase gene, complete cds. ACCESSION M29593 KEYWORDS parathion hydrolase. SOURCE Flavobacterium sp. (strain ATCC 27551) DNA, clone pPDL2. ORGANISM Flavobacterium sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Acidiphilium cryptum; Neisseriaceae. REFERENCE 1 (bases 1 to 1693) AUTHORS Mulbry,W.W. and Karns,J.S. TITLE Parathion hydrolase specified by the Flavobacterium opd gene: Relationship between the gene and protein JOURNAL J. Bacteriol. 171, 6740-6746 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 419 1516 parathion hydrolase signal 312 317 -35 region signal 334 339 -10 region binding 408 411 ribosme binding site BASE COUNT 372 a 497 c 477 g 347 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgagc ggcttaccgg cgccctgctc gaccggttga cccaccatgt ccatatcctc 61 gagatgaacg ggggcagcta tcggcttgcc agcagccgga aacggcaaaa gcacagccac 121 cacgcacagc aagaggaggt gcccccccca tgacctgatc agaaaacccc tcatctgctg 181 tgctgaacgg ccttcgctac gctccagacc gttcagcaca gcagatgaaa gcaccgcctc 241 gacaagaggc tttttgttca atccaactgg tacactctta caccggaatc ttgcacaatt 301 ttaccccggc attgacatct gacgcgtcaa cagtaaaaga aacaaccggt tcagatctgc 361 agcctgactc ggcaccagtc gctgcaagca gagtcgtaag caatcgcaag ggggcagcat 421 gcaaacgaga agggttgtgc tcaagtctgc ggccgccgca ggaactctgc tcggcggcct 481 ggctgggtgc gcgagcgtgg ctggatcgat cggcacaggc gatcggatca ataccgtgcg 541 cggtcctatc acaatctctg aagcgggttt cacactgact cacgagcaca tctgcggcag 601 ctcggcagga ttcttgcgtg cttggccaga gttcttcggt agccgcaaag ctctagcgga 661 aaaggctgtg agaggattgc gccgcgccag agcggctggc gtgcgaacga ttgtcgatgt 721 gtcgactttc gatatcggtc gcgacgtcag tttattggcc gaggtttcgc gggctgccga 781 cgttcatatc gtggcggcga ccggcttgtg gttcgacccg ccactttcga tgcgattgag 841 gagtgtagag gaactcacac agttcttcct gcgtgagatt caatatggca tcgaagacac 901 cggaattagg gcgggcatta tcaaggtcgc gaccacaggc aaggcgaccc cctttcagga 961 gttagtgtta aaggcggccg cccgggccag cttggccacc ggtgttccgg taaccactca 1021 cacggcagca agtcagcgcg atggtgagca gcaggccgcc atttttgagt ccgaaggctt 1081 gagcccctca cgggtttgta ttggtcacag cgatgatact gacgatttga gctatctcac 1141 cgccctcgct gcgcgcggat acctcatcgg tctagaccac atcccgcaca gtgcgattgg 1201 tctagaagat aatgcgagtg catcagccct cctgggcatc cgttcgtggc aaacacgggc 1261 tctcttgatc aaggcgctca tcgaccaagg ctacatgaaa caaatcctcg tttcgaatga 1321 ctggctgttc gggttttcga gctatgtcac caacatcatg gacgtgatgg atcgcgtgaa 1381 ccccgacggg atggccttca ttccactgag agtgatccca ttcctacgag agaagggcgt 1441 cccacaggaa acgctggcag gcatcactgt gactaacccg gcgcggttct tgtcaccgac 1501 cttgcgggcg tcatgacgcc atctggatcc ttccagccag cggccactat tccccgtcaa 1561 gataccgaac gatgaagtcg cgcatcgatc gataggcatc ttcaatttga tcagggctgc 1621 cacctccaaa gccgtggcca cccctgtcga tagtcttgag gacgtagggc acaccgtgct 1681 tttcgaactg cag // LOCUS HUMCSAE 1372 bp ss-mRNA PRI 26-FEB-1990 DEFINITION Human epithelial glycoprotein (EGP) mRNA, complete cds. ACCESSION M32306 KEYWORDS cell surface antigen; cell surface glycoprotein; epithelial glycoprotein; transmembrane protein; tumor-associated antigen. SOURCE Human colon carcinoma cell line HT-29, cDNA to mRNA, (library HT-29/cdm8). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1372) AUTHORS Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J., Gattoni-Celli,S. and Brand,S.J. TITLE Epithelial glycoprotein is a member of a family of epithelial cell surface antigens homologous to nidogen, a matrix adhesion protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.Simon, 23-FEB-1990. FEATURES from to/span description pept 22 966 epithelial glycoprotein (EGP) precursor sigp 22 84 epithelial glycoprotein signal peptide matp 85 963 epithelial glycoprotein mRNA < 1 1372 EGP mRNA BASE COUNT 436 a 233 c 317 g 386 t ORIGIN 1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt 61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag 121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca 181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg 241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg 301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc 361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata 421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga 481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt 541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt 601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct 661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg 721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat 781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg 841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga 901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat 961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac 1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc 1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc 1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact 1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt 1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt 1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg // LOCUS HUMNOXF 2206 bp ss-mRNA PRI 15-FEB-1990 DEFINITION Human neutrophil oxidase factor (p67-phox) mRNA, complete cds. ACCESSION M32011 KEYWORDS neutrophil oxidase factor. SOURCE Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA, clone 10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2206) AUTHORS Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G., Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L. TITLE Cloning of a 67K neutrophil oxidase factor and its similarity to a noncatalytic region of P60c-src JOURNAL Science (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.L.Malech, 08-FEB-1990. FEATURES from to/span description pept 68 1648 neutrophil oxidase factor (p67-phox) mRNA < 1 2206 p67-phox signal 2030 2035 poly-A signal signal 2181 2186 poly-A signal BASE COUNT 624 a 499 c 573 g 510 t ORIGIN 1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta 61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga 121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg 181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa 241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg 301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt 361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct 421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa 481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa 541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc 601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta 661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc 721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag 781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga 841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc 901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga 961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc 1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa 1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta 1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat 1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga 1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa 1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga 1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa 1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt 1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga 1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga 1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat 1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca 1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga 1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat 1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct 1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt 1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa 2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa 2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg 2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc // LOCUS HUMUDPGTA 1855 bp ss-mRNA PRI 26-FEB-1990 DEFINITION Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA, complete cds. ACCESSION J05428 KEYWORDS 3,4-catechol estrogen UDP-glucuronosyltransferase. SOURCE Human liver, cDNA to mRNA, clone 63-11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1855) AUTHORS Ritter,J.K., Sheen,Y.Y. and Owens,I.S. TITLE Cloning and expression of human liver UDP-glucuronosyltransferase in COS-1 cells: 3,4-catechol estrogens and estriol as primary substrates JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.S.Owens, 22-FEB-1990. FEATURES from to/span description pept 15 1604 UDP-glucuronosyltransferase (EC 2.4.1.17) BASE COUNT 571 a 337 c 394 g 553 t ORIGIN 1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct 61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt 121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac 181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa 241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga 301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca 361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat 421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct 481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc 541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg 601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct 661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt 721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat 781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt 841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg 901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca 961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc 1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt 1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg 1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt 1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg 1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc 1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc 1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc 1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt 1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt 1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg 1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat 1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat 1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa 1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt // LOCUS PSERCBC 1060 bp ds-DNA BCT 08-MAR-1990 DEFINITION P.aeruginosa carbenicillinase gene, complete cds. ACCESSION J05162 KEYWORDS carbenicillinase. SOURCE P.aeruginosa (strain Dalgleish) DNA, clone pMON709. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Acidiphilium cryptum; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1060) AUTHORS Boissinot,M. and Levesque,R.C. TITLE Nucleotide sequence of the PSE-4 carbenicillinase gene and correlations with the Staphylococcus aureus PC1 beta-lactamase crystal structure JOURNAL J. Biol. Chem. 265, 1225-1230 (1990) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence kindly provided by authors, 25-JAN-1990. FEATURES from to/span description pept 122 1036 carbenicillinase BASE COUNT 316 a 195 c 246 g 303 t ORIGIN 357 bp upstream of HincII site. 1 tcgggcatcc aagcagcaag cgcgttacgc cgtgggtcga tgtttgatgt tatggagcag 61 caacgatgtt acgcagcagg gcagtcgccc taaaacaaag ttagccatat tatggagcct 121 catgctttta tataaaatgt gtgacaatca aaattatggg gttacttaca tgaagttttt 181 attggcattt tcgcttttaa taccatccgt ggtttttgca agtagttcaa agtttcagca 241 agttgaacaa gacgttaagg caattgaagt ttctctttct gctcgtatag gtgtttccgt 301 tcttgatact caaaatggag aatattggga ttacaatggc aatcagcgct tcccgttaac 361 aagtactttt aaaacaatag cttgcgctaa attactatat gatgctgagc aaggaaaagt 421 taatcccaat agtacagtcg agattaagaa agcagatctt gtgacctatt cccctgtaat 481 agaaaagcaa gtagggcagg caatcacact cgatgatgcg tgcttcgcaa ctatgactac 541 aagtgataat actgcggcaa atatcatcct aagtgctgta ggtggcccca aaggcgttac 601 tgatttttta agacaaattg gggacaaaga gactcgtcta gaccgtattg agcctgattt 661 aaatgaaggt aagctcggtg atttgaggga tacgacaact cctaaggcaa tagccagtac 721 tttgaataaa tttttatttg gttccgcgct atctgaaatg aaccagaaaa aattagagtc 781 ttggatggtg aacaatcaag tcactggtaa tttactacgt tcagtattgc cggcgggatg 841 gaacattgcg gatcgctcag gtgctggcgg atttggtgct cggagtatta cagcagttgt 901 gtggagtgag catcaagccc caattattgt gagcatctat ctagctcaaa cacaggcttc 961 aatggaagag cgaaatgatg cgattgttaa aattggtcat tcaatttttg acgtttatac 1021 atcacagtcg cgctgataag gctaacaagg ccatcaagtt // LOCUS RAT5MDAA 2030 bp ss-mRNA ROD 15-DEC-1988 DEFINITION Rat iodothyronine 5' monodeiodinase (5'MD) mRNA, 3' end. ACCESSION M21018 KEYWORDS iodothyronine 5' monodeiodinase. SOURCE Rat liver, cDNA to mRNA, clone 2302. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2030) AUTHORS Boado,R.J., Campbell,D.A. and Chopra,I.J. TITLE Nucleotide sequence of rat liver iodothyronine 5'-monodeiodinase (5' MD): Its identity with the protein disulfide isomerase JOURNAL Biochem. Biophys. Res. Commun. 155, 1297-1304 (1988) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Campbell, 14-OCT-1988. FEATURES from to/span description pept < 1 1449 iodothyronine 5' monodeiodinase (AA at 1) BASE COUNT 527 a 493 c 532 g 478 t ORIGIN 1 gtcctggtgc tgaagaagag caacttcgca gaggcgctgg cggcgcacaa ctacctgctg 61 gtggagttct atgccccatg gtgtggccac tgcaaagcac tggccccaga gtatgccaaa 121 gctgctgcaa aactgaaggc agaaggctct gagatccgac tagcaaaggt ggacgccaca 181 gaagagtctg acctggccca gcagtatggt gtccgtggct accccacaat caagttcttc 241 aagaatggag acacagcctc cccaaaggaa tatacagctg gcagggaagc tgacgacatt 301 gtgaactggc tgaagaaacg cacaggccca gcagccacaa ccctgtctga cactgcagct 361 gcagagtcct tggtggactc aagcgaagtg acggtcatcg gcttcttcaa ggacgcaggg 421 tcagactccg ccaagcagtt cttgctggca gcagaggctg ttgatgacat accttttgga 481 atcacttcca atagcgatgt gttttccaag taccagctgg acaaggatgg ggtggtcctc 541 tttaagaagt ttgatgaagg ccgcaacaat tttgaaggtg agatcaccaa ggagaagctg 601 ttagacttca tcaagcacaa ccagctgcct ttggtcatcg agttcactga acagacagct 661 ccaaagattt tcggaggtga aatcaaaaca catattctgc tgttcctgcc caagagtgtg 721 tctgactacg atggcaaatt gagcaacttt aagaaagcgg ccgagggctt taagggcaag 781 atcctgttca tcttcatcga tagtgaccac actgacaacc agcgcatact tgagttcttt 841 ggcctgaaga aggaggaatg tccagctgtg cggcttatta ccctggagga agagatgacc 901 aagtacaaac cggagtcaga cgagctgaca gctgagaaga tcacacaatt ttgccaccac 961 ttcctggagg gcaagatcaa gccccacctg atgagccagg aactgcctga agactgggac 1021 aagcagccag tgaaagtgct agttgggaaa aactttgagg aggttgcttt tgatgagaaa 1081 aagaacgtgt ttgttgaatt ctatgctccc tggtgtggtc actgcaagca gctagccccg 1141 atttgggata aactgggaga gacatacaaa gaccatgaga atatcgtcat cgctaagatg 1201 gactcaacag ccaatgaggt ggaagctgtg aaggtgcaca gctttcccac actcaagttc 1261 ttcccagcaa gtgcagacag aacggtcatt gattacaacg gtgagcggac actagatggt 1321 tttaagaaat tcttggagag cggtggccag gatggagcgg gggacaatga cgacctcgac 1381 ctagaagaag ctttagagcc agatatggaa gaagacgacg atcagaaagc cgtgaaggat 1441 gaactgtagt gcagaagcca gatctgggcg cctgaaccca aaacctcggt ggccatgtcc 1501 cagcagccca catctccgga gcctgagcct caccccagga ggagcgccat cagaacccag 1561 ggaatctttc tgaagccaca ctcatctgac acacgtacac ttaaacctgt ccttcttttt 1621 ttgcttttca attttggaaa gggatctctg accaggccag cccatcttga agggctacgt 1681 tttgttttaa ttggtggtgt actttttgta cgtggatttt gtcccaagtg cttgctacca 1741 tatttgggat ttcacactgg taatgtcttt cctgttagag aggtttatgc tatcacttca 1801 gatttcgtct gtgagatgtt tcatcttcct gacatgtctt catgtcgagg tacttgttcc 1861 accacgcaga cctccctgag accccttctg ccctgcgcag gaggcgatgg ttctgggtcg 1921 tatgctctct ctctctccac cttgtactag tgttcgcatg acagcatggc ttttgtagtt 1981 tgcatttaac ctggggattt ctgcatcctg tcagagggtg ggtccccacc // LOCUS RIRANT17KA 539 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.conorii 17K genus-common antigen gene, complete cds. ACCESSION M28480 KEYWORDS antigen. SOURCE R.conorii DNA. ORGANISM Rickettsia conorii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 539) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 60 539 17 kD antigen precursor sigp 60 119 17 kD antigen signal peptide matp 120 536 17 kD antigen mRNA 38 > 539 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 179 a 93 c 125 g 142 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata 61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta 121 acggtccggg cggcatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg 181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg 241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa 301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag 361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa 421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac 481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga // LOCUS RIRANT17KB 535 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.typhi 17K genus-common antigen gene, complete cds. ACCESSION M28481 KEYWORDS antigen. SOURCE R.typhi DNA. ORGANISM Rickettsia typhi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 535) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 56 535 17 kD antigen precursor sigp 56 115 17 kD antigen signal peptide matp 116 535 17 kD antigen mRNA 37 > 535 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 182 a 93 c 122 g 138 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat taaataatag attaaagagg attatatgaa 61 actattatct aaagtaatga ttttagctct tgcagcttct atgttacaag catgtaatgg 121 tccgggtggt atgaacaaac aagggactgg aacacttctt ggtggcgccg gcggtgcatt 181 acttggttct caattcggtc acggtaaagg acaacttgtc ggagtaggtg taggcgcatt 241 acttggggca gttcttggtg gacaaatcgg tgcaagtctg gatgagcagg atagaaaact 301 tctagaacta acatcacaaa gagctttaga atctgctcct agcggtagta acatagaatg 361 gcgcaatcca gataatggca atcatggtta cgtaacacct aataaaactt ataggaacag 421 tacaggtcaa tattgccgtg aatacactca aacagttgta atagggggaa aacaacaaac 481 aacatatgga aatgcatgcc gccaacctga cggacaatgg caagttgtga attga // LOCUS RIRANT17KC 532 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.typhi 17K genus-common antigen gene, complete cds. ACCESSION M28482 KEYWORDS antigen. SOURCE R.typhi DNA. ORGANISM Rickettsia typhi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 532) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 56 532 17 kD antigen precursor sigp 56 115 17 kD antigen signal peptide matp 116 529 17 kD antigen mRNA 37 > 532 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 183 a 88 c 121 g 140 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat ttaaataata attagagagg attatatgaa 61 actattatct aaaataatga ttatagctct tgcagcttct atgttacaag cttgtaatgg 121 tcagagtggt atgaacaaac aaggtaccgg tacacttctt ggtggcgcag gaggtgcatt 181 acttggttct caattcggtc aaggtaaagg acaacttgtc ggagtaggtg taggcgcatt 241 acttggggca gttcttggtg gacaaatcgg cgcaagtatg gatgagcagg atagaagact 301 tctagaacta acatcacaaa gagctttaga atctgcacct agcggtagta acatagaatg 361 gcggaatcca gataacggca atcatggtta cgtcacacct aataaaactt ataggaacag 421 tgcaggtcaa tattgccgtg aatacactca aacagttata ataggcggaa aacaaaaaac 481 atatggtaat gcatgccgtc aacctgatgg gcaatggcaa gttgtgaatt ga // LOCUS RIRANT17KD 539 bp ds-DNA BCT 23-FEB-1990 DEFINITION R.rickettsia 17K genus-common antigen gene, complete cds. ACCESSION M28479 KEYWORDS antigen. SOURCE R.rickettsia DNA. ORGANISM Rickettsia rickettsia Unclassified. REFERENCE 1 (bases 1 to 539) AUTHORS Anderson,B.E. and Tzianabos,T. TITLE Comparative sequence analysis of a genus-common rickettsial antigen gene JOURNAL J. Bacteriol. 171, 5199-5201 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by B.Anderson, 27-OCT-1989. FEATURES from to/span description pept 60 539 17 kD antigen precursor sigp 60 119 17 kD antigen signal peptide matp 120 536 17 kD antigen mRNA 37 > 539 17 kD mRNA signal 1 6 -35 region signal 21 27 -10 region BASE COUNT 179 a 92 c 125 g 143 t ORIGIN 1 tttacaaaat tctaaaaacc atatacttat taattatata ttaatttaga gagaattata 61 tgaaactatt atctaaaatt atgattatag ctcttgcaac ttctatgtta caagcctgta 121 acggtccggg cggtatgaat aaacaaggta caggaacact tcttggcggt gctggcggcg 181 cattacttgg ttctcaattc ggtaagggca aaggacagct tgttggagta ggtgtaggtg 241 cattacttgg agcagttctt ggtggacaaa tcggtgcagg tatggatgaa caggatagaa 301 gacttgcaga gcttacctca cagagagctt tagaaacagc tcctagtggt agtaacgtag 361 aatggcgtaa tccggataac ggcaattacg gttacgtaac acctaataaa acttatagaa 421 atagcactgg tcaatattgc cgtgagtaca ctcaaacagt tgtaataggc ggaaaacaac 481 aaaaagcata cggtaatgca tgccgccaac ctgacggaca atggcaagtt gtgaattga // LOCUS ROTNS53 1611 bp ds-RNA VRL 02-MAR-1990 DEFINITION Simian rotavirus SA11 segment 5 nonstructural protein (NS53) gene, complete cds. ACCESSION X14914 KEYWORDS nonstructural protein. SOURCE Simian rotavirus, cDNA to viral RNAi, clone 35. ORGANISM Simian (SA11) rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1611) AUTHORS Mitchell,D.B. and Both,G.W. TITLE Conservation of a potential metal binding motif despite extensive sequence diversity in the rotavirus nonstructural protein NS53 JOURNAL Virology 174, 618-621 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.B.Mitchell, 03-APR-1989. FEATURES from to/span description pept 31 1518 nonstructural protein BASE COUNT 571 a 214 c 288 g 538 t ORIGIN 1 ggcttttttt tgaaaagtct tgtgttagcc atggctactt ttaaagatgc atgctttcat 61 tatcgtagat taactgcttt aaatcggaga ttatgcaaca ttggtgcaaa ttctatttgc 121 atgccagttc ctgatgagaa aattaagggg tggtgtttag aatgttgtca aatagctgat 181 ttaacccatt gttatggttg ctcattgccg catgtttgca aatggtgtgt tcagaacaga 241 agatgcttcc ttgacaatga acctcatttg cttaagctta gaactgtgaa acatccaatt 301 accaaagaca aattacagtg tatcatagac ttgtacaata taatatttcc aattaatgat 361 aaagtaatta gaaaatttga aagaatgata aagcaaagaa aatgtaggaa tcaatataaa 421 attgaatggt ataatcattt gctgctccca attacattaa atgctgctgc atttaagttt 481 gatgaaaata atctttatta tgtttttggg ttatatgaga aatcagtcag tgatatatat 541 gctccatata gaattgttaa ctttataaat gaatttgata aattattgct tgatgatatt 601 aactttacaa gaatgtccaa tctaccaata gagttgagaa ccattatgca agaatacttc 661 caattatcaa gactgccatc atcaaaacta aagcaaattt acttttcaga ttttactaaa 721 gaaactgtga tttttaatac ttatacaaaa acgccaggaa gatcaatata cagaaatgta 781 actgaattta attggagaga tgaattggag ctttattctg atttaaaaaa tgataagaat 841 aaattaattg ctgcaatgat gacgagtaag tatactcggt tctatgctca tgataataat 901 tttggaaggt tgaaaatgac aatatttgag ttgggacatc attgtcagcc taactacgtg 961 gcatctaatc acccaggcaa tgcttccgat atccagtact gtaaatggtg taatataaaa 1021 tattttctta gtaaaattga ttggcggatt cgtgatatgt ataatttatt gatggaattt 1081 attaaggatt gttataaaag taatgttaac gttggacatt gtagttctgt tgaaaacata 1141 tatcctttaa ttaaaagatt aatttggagt ttgtttacta atcacatgga tcaaacaatt 1201 gaagaagtgt ttaatcacat gtcgccagtg tcagttgaag gtacgaatgt catcatgttg 1261 attcttggat tgaatattag tttgtataat gaaattaagc gcactttgaa tgtagatagc 1321 ataccaatgg tacttaattt aaatgaattc agtagtatag ttaaatcaat tagcagtaaa 1381 tggtataatg ttgatgaatt ggataaattg ccaatgtcaa taaaatcaac ggaggaactg 1441 attgaaatga agaattctgg aactttaact gaagaatttg agctactgat ctccaactca 1501 gaagatgaca atgagtgaaa ttactaatgt cactatctaa ttatacagta tttagccatc 1561 acaagaccgt ccagactaga gtagcgccta gctggcaaaa tactgtgaac c // LOCUS SPINR 3284 bp ss-mRNA PLN 06-MAR-1990 DEFINITION Spinach NADH nitrate reductase mRNA, complete cds. ACCESSION M32600 KEYWORDS NADH nitrate reductase; nitrate reductase. SOURCE Spinach, cDNA to mRNA, clone pIMP4. ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 3284) AUTHORS Prosser,I.M. and Lazarus,C.M. TITLE Complete nucleotide sequence of spinach nitrate reductase cDNA JOURNAL Unpublished (1990) Long Ashton Res. Stn, Long Ashton, Bristol, UK STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.M.Prosser, 05-MAR-1990. FEATURES from to/span description pept 116 2896 NADH nitrate reductase BASE COUNT 894 a 642 c 849 g 899 t ORIGIN 1 aaaaaccata acacctctgg aaaaatattg aggaagaacc gtagttcttc tttccttttt 61 ctttttttct ttttaattac atttcaaaaa accctacatt tgatcgattt gagctatggc 121 ggcgtcagtc gatcgtcaat atcatccggc gccgatgagc ggtgtcgttc gtaccccatt 181 tagcaaccac caccgttccg actctccggt tcggaacggt tatactttta gtaacccgcc 241 ttcttcaaat ggcgttgtta aacccggtga aaaaatcaag ctcgtagata ataatagtaa 301 tagtaataat ggtagcaata ataataataa tcgctatgat agcgatagtg aagaagatga 361 tgacgaaaat gagatgaatg tctggaatga gatgatcaaa aaaggtaaca gcgaattgga 421 gccgtcttcc gtcgatagcc gtgatgaagg aacggctgat caatggattg agaggaatcc 481 ttcaatgatc cgtctcacag ggaagcaccc gttcaattcg gagcctccat tgacccgatt 541 gatgcaccac gggtttctca ccccggttcc gcttcattac gttcggaacc acggtccagt 601 ccccaacgct aagtgggagg attggaccgt tgaggtgacc gggttagtca aaagaccaat 661 ccggttcact atggaccaat tggtcaatga cttccaaagc cgtgagtttc cggtgacgct 721 tgtctgcgcc ggtaaccgga ggaaggaaca gaatatgacg aaacaatcga tagggtttaa 781 ctggggttcc gccgccgtgt caacttccgt gtggcgcggt gttccacttc gcgacgtgct 841 taagcggtgt ggggtcatga gttcgttaaa aggggctctc aacgtttgtt ttgagggggc 901 tgaggattta ccaggaggtg gcgggtcgaa atatggtaca agtgttaaaa gggagtttgc 961 aatggatccg gcccgtgaca ttattctagc gtacatgcaa aatggggaga agttgagccc 1021 ggatcatggg tatccggttc ggatgatcat acccgggttt attggtgggc ggatggttaa 1081 gtggttgaag aggattattg tgactacaac agagtctgat aattattacc attacaagga 1141 taatagggtc cttccctctc atgttgatgc tgaacttgcc aattctgaag cttggtggta 1201 caaacaagag tacatcataa atgagctgaa cgttaactcg gtgataacat cgccgtgcca 1261 cgaagaaatc ttgcctatca acgcctggac tacccaacgt ccttacacaa tgaggggtta 1321 cgcttattct gggggaggaa gaaaagtgac ccgagtggaa gtgacaatgg atggtggaga 1381 cacatgggac atatgtgaat tggaccacca agagagggga agcaaatatg gtaaattctg 1441 gtgttggtgt ttctggtctt tagaggtcga agtactagac ttactcggag ccaaagagat 1501 tggggtccgt gcttgggatg aaagcctcaa cacccaacct gaaaagctca tttggaatgt 1561 catgggtatg atgaacaact gctggtttcg tgttaagacc aacgtgtgca agcctcacaa 1621 gggagagatc ggtatcgtgt tcgaacaccc gacccaaccc ggtaacaagt cgggcgggtg 1681 gatggcccgt gaacgccacc tcgagatctc cgattctggc ccaacattga agaggacagc 1741 atccacccca ttcatgaaca ccacctcaaa gatgtactca atgtcagagg tgaagaaaca 1801 caacactgct gactcagctt ggattgtagt ccatggaaat gtctacaacg cgacccggtt 1861 cctaaaggat cacccgggtg ggtcagacag tatcctcatc aatgcaggaa ccgactgcac 1921 cgaagagttc gatgcaattc actcagacaa ggccaagagg cttctagaag atttcaggat 1981 aggtgagttg atctcaacgg gctacacctc tgactcgtcc tcgcccggta actcagtcca 2041 tggcgggtcg gtttactccg ggttagccgg gttagcccca attacagaag cggttccatt 2101 acgtaatgtt gctttaaacc cacgggttaa gatcccatgc aaacttattg agaaagtttc 2161 actttcccat gacgtaagaa ggttccgatt cgggttaccc tctgaagacc aggtgttggg 2221 tttacctgta ggaaaacaca tcttcctttg tgctaatgtt gatgataagc tttgtatgag 2281 agcttatact ccatcaagta ccattgatgt ggttgggtac tttgatcttg tggttaaggt 2341 ttacttcaag gatgttcacc ctaggttccc taacggtggg gtgatgtcac aacatctaga 2401 ctcactttct ttgggctcga ttgttgacgt gaaaggccca ttgggccata ttgagtattt 2461 aggaaagggg aatttcacag tccatggaaa gcccaagttt gctaaaaagc ttgctatgat 2521 atcaggtggg accgggatta ccccgattta tcaggtgatg caggcaatac tgaaagatcc 2581 tgaggataag actgagatgc atgttgtgta tgcaaatcgg acggaggagg atatcttgct 2641 tagagaagag cttgataaat gggctgatga gtttagagat cgggttaaag tatggtacgt 2701 tgttgaaaag gcggaggaag ggtggaaata tgataccggg tttatatctg agaagatact 2761 tagagatcac gtgccggcgg ttggggatga tgtgttggca ctcacgtgcg gtcctcctcc 2821 tatgattcaa tttgcggttc aacctaactt ggataagatg ggttttgata ttaaggaaca 2881 gttgttgatt ttttgaaagt actgttacgt tgggaagtga atttactaca ctacccctga 2941 agatatttgt gttacgtgca cattgcaata tcgacgtgtg taacacatgc gttgtgcttt 3001 cgaatgtata gtagaagttg ggtaaaatgg agtatatgaa cactactata tgtattctat 3061 gtatgtaatg taatttgtgg agctttttct tctttcttta cccggtgaaa ttatgtgaat 3121 ataagtgctc cactagattg tttgttgata agctttttgt tgtccttatg aaaattggtt 3181 gatggaggaa gaagtatgta gaaggaagaa attcaagtag ctaaggttta tgtattgtaa 3241 gagggagatt actatttggt agtctccatt taccatatta ttat // LOCUS SRGMDH 4638 bp ds-DNA PLN 06-MAR-1990 DEFINITION S.vulgare NADP malate dehydrogenase gene, compete cds. ACCESSION M31965 KEYWORDS NADP malate dehydrogenase. SOURCE S.vulgare (haplotype 2n) 5 day old leaf mesophyll cell DNA. ORGANISM Sorghum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 4638) AUTHORS Luchetta,P., Cretin,C. and Gadal,P. TITLE Structure and characterization of the sorghum vulgare gene encoding NADP-malate dehydrogenase JOURNAL Gene (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Luchetta, 02-MAR-1990. FEATURES from to/span description mRNA 763 783 NADP-malate dehydrogenase, exon 1 873 963 NADP-malate dehydrogenase, exon 2 1461 1547 NADP-malate dehydrogenase, exon 3 1643 1727 NADP-malate dehydrogenase, exon 4 1984 2165 NADP-malate dehydrogenase, exon 5 2420 2490 NADP-malate dehydrogenase, exon 6 2570 2660 NADP-malate dehydrogenase, exon 7 2749 2823 NADP-malate dehydrogenase, exon 8 2912 3013 NADP-malate dehydrogenase, exon 9 3100 3222 NADP-malate dehydrogenase, exon 10 3303 3371 NADP-malate dehydrogenase, exon 11 3456 3524 NADP-malate dehydrogenase, exon 12 3619 3672 NADP-malate dehydrogenase, exon 13 3765 4638 NADP-malate dehydrogenase, exon 14 pre-msg 603 4638 NADP-malate dehydrogenase mRNA and introns IVS 784 872 NADP-malate dehydrogenase intron A IVS 964 1460 NADP-malate dehydrogenase intron A IVS 1548 1642 NADP-malate dehydrogenase intron A IVS 1728 1983 NADP-malate dehydrogenase intron A IVS 2166 2419 NADP-malate dehydrogenase intron A IVS 2491 2569 NADP-malate dehydrogenase intron A IVS 2661 2748 NADP-malate dehydrogenase intron A IVS 2824 2911 NADP-malate dehydrogenase intron A IVS 3014 3099 NADP-malate dehydrogenase intron A IVS 3223 3302 NADP-malate dehydrogenase intron A IVS 3372 3455 NADP-malate dehydrogenase intron A IVS 3525 3618 NADP-malate dehydrogenase intron A IVS 3673 3764 NADP-malate dehydrogenase intron A BASE COUNT 1255 a 1044 c 993 g 1346 t ORIGIN 1 aggcctagtc aaaaagtaaa tgtgattatt tacaaacatt gtctctacat tttgcatggc 61 ttcaaaacca aagatcgcac acacttttgc caaaattttg ccaccgttgc taaccaatcg 121 atgtgtaacg atgagaagtg ccatcacact ctctgaaaag actcaggccc tggcactggc 181 aggaccgcag gaaaggcact gagcaaagat gatggtcgtg ggagaataaa caaccttttt 241 ttttttttgc ctttgtaata aatgtccccg ccgtctctca cgtagtacgt agtttcggtg 301 tggctttgtt ttaaaaggga accaaatcta ccaatgagtc tttttaaaaa gaagaaaaat 361 actggtcagc aaaactgaga ttctccagca acgaaaatga aaaagacttg gttgtcagtg 421 ttaaccacca ccaggatagc aggcgtgtgg acgtacaaga tgccgaacag tcgctgtcgc 481 gcgacgtgtc ccgcgccggg cggtccgctc gccttgcctg ccgctaatcc aattcctacg 541 cgcgctcctg ctcctgcgta caaccagaga ggctataatc tcctccactc cctcccctgt 601 agctgcagcc cgttccactc ctatccatcc gtgctccagc tccaccatgg gcctctcaac 661 ggcttactcc ccggtcggat cgcacctcgc gccggcgcct ctcgggcacc gtcgctccgc 721 ccagctccac cgcccgcgcc gggccctgct cgccaccgtc cgatgctccg tcgacgccgc 781 caagtgagta gtagtatact acccggacgt ggacatgaat gcgtgccatt tatcaactac 841 taggctgact agccccctat acgttgatgc aggcaggtgc aggatggcgt ggcgacggcg 901 gaggcgccgg cgacgcggaa ggactgcttc ggggtcttct gcaccaccta cgacctcaag 961 gcggtgagtc cctcccttcg cctcaggcgg tggtggtcct ccccggttcc cggtccggcc 1021 tccccgtgtg tgttcgctga aatgctccat cgacttttgc cttttgcttc aaactcaaag 1081 tagaatccct tttccaccta aagctagcaa cgttgctttt ctccaaaaga gaaaaaaaaa 1141 gaaaaagcta gcacgtcgtt ggtgtcatgg tgtggaccgt tggaggtgtg gtgtggatat 1201 atgctacgat actggatagc ttgcactcat atcaacagct ggcgttatgc ctctataaat 1261 aaaaataaat atcatacaga ccaccaaaac caaagcgcag ggtggagtgc cacgcatgag 1321 acgctgcact gcacgccact gctgcttgac ttaatttggt ttctaattaa cttgctgctg 1381 tatatatcgt catatagtag tacagtacta acaatctgtc atcctatgcc attgaatgga 1441 ttgggagttg ggacacacag gaagacaaga ccaagtcgtg gaagaagcta gtgaacattg 1501 ctgtgtcagg cgcggctggg atgatatcga accacctgct gttcaaagta actaaatcgt 1561 ttactgactc tacttgtttc ctatcttgag ttgcagtatt gtaatgctga ccattcattt 1621 tattaacaca aaaaaaattc agcttgcctc tggtgaggtt ttcggacaag accaaccaat 1681 agcactgaag ttacttggtt cagagagatc gtttcaagct ctcgaaggtg aatgactgac 1741 tgcctcttct tctttctgcc ttgctatctg agaagaagaa tcaattcata ttttccactc 1801 actctgttcc aaaagctaaa atgccttata tataatttgg aacaggggga gtacatacgt 1861 atagctttag tttcaagctg cccttacagc ttccagtgtt gtactctctt atatcaatgg 1921 attactgtga cagtgaatat ttatcacttc ttcctcatgt gctctcaaaa actgcaactg 1981 taggtgttgc tatggaactg gaggactcgc tatatccatt gctgagggaa gtcagcattg 2041 gtatagatcc ttatgaggtc tttgaagatg tagattgggc ccttcttatt ggtgctaagc 2101 cccgaggtcc tggcatggag cgagctgcgt tactggatat caatggtcaa atctttgctg 2161 atcaggtttc ttttgcactc tgttcctcct ttcttctcaa gtactatgtc accaaattgt 2221 aagaagtcct ctcaggaaag gtgccaagaa gcattttctg aaacttcaac ctcaaatttt 2281 aaggattcct aagcattaga acctgcaaat tgccaaaacc agaaaccaag tacaaatttg 2341 agtcaaagcc cacctaaata aacgaccaga gtaacctgtt attcttgtta ttatctgaat 2401 tttttttaaa ttgtgtcagg ggaaagcact taatgccgtg gcctcgaaga acgtgaaagt 2461 cttagttgtt ggaaatccct gtaacactaa gtacggacat tgattgttca ccaagattaa 2521 cgattttatg tcatcaccat gtcttagttg tggcttgccc ttgatgcagt gcgttaattt 2581 gcttgaaaaa tgctccagac ataccagcaa aaaattttca tgcactgacg aggttggatg 2641 aaaatagagc aaagtgccag gtgactgacg ttgaacctta ttatgtctcg tccttccatt 2701 ccaaacagga aaacagatta agaaatcaca cacaatctcc ttttgtagct agcactcaaa 2761 gcaggtgtat tttatgacaa agtatcaaac gtgactattt gggggaacca ttcgacaact 2821 caggtgaagt tttgcattat aaattaaatg agcagactat tttgcagaca catttcttcg 2881 ctgaatggtt ggcactaact tctctgtgta ggttcctgat ttcttgaatg ccaaaattga 2941 tgggagacca gtgaaagaag tcattaagga taccaagtgg ttagaagaag agttcaccat 3001 tacggttcaa aaggtatgac aatccatttg gagtgtctgt tactcggata aataaacaag 3061 gttgtgacca taattcgttt cattcttata tctaaacagc gtggaggtgc gctcatccaa 3121 aaatggggca gatcttcagc tgcatcaacc gctgtatcaa tagcggatgc tattaaatcc 3181 ctggtaactc ctaccccaga aggcgactgg ttctccacag gggtaagttt tgatgccttt 3241 gaattgatca cagctttatt tcggcattcc tctaatggaa aaaagtaata aaaaaaacac 3301 aggtttatac gactggaaat ccttatggca tagcagagga tatcgtgttc agcatgccat 3361 gcagatcgaa ggtatgatct actaatcaaa ttgtccttat tatttttgtc ctaagataag 3421 gtaccactcc aatgatcgta aaattcttcc aacagggtga tggtgattac gaactagcta 3481 ctgatgtgtc aatggacgat tttctctggg aacggattaa aaaggtaagg gctataggaa 3541 ctgccatcac tgatatgctg agaatataag cagcactaga gattttaccc taacagttgg 3601 ccctgtacat attttcagag tgaagctgaa ttgcttgctg agaagaaatg cgttgcccat 3661 cttactggag aggtatgtca cagtgacgta ttttatttag cttaaaaaga gagaaaccgt 3721 gccggaaagg attggcttag ctttttctgt ttgtgcaaat gcaggggaat gcatattgtg 3781 atgttccgga ggataccatg ctaccgggag aagtgtagtt tagttacaga gaagcaatag 3841 caggggaatg catttgtcta tactatagct gagactcgag agaaacagct gcttgcaaga 3901 tgcagcaact acttttactt atatatatat attgtttata ttctgtaccc gctcagagcg 3961 ttgtgtgcac acgctgtgtg attccgggac cgtctttacg atcgttttgc cttttttggg 4021 ccataccgct tcgctcccgt gttgcgcttc cattattgag ttattaccac tctgttttat 4081 aatatttatt actgccgaga ttttgtgtag atattacaat cttgtgtttg gttgtgtctc 4141 tgctcatacg caagtgctgc tgcaactctt gtttttattc acatttagtt acaactgtgt 4201 tattttgtag taacgacgtg ttatatgtgc tgttacaatc gtattgtaac tctattcttt 4261 tcctcatttc tccttttact gtacttattt aacagaatta gtcgattttc agtcacagaa 4321 taatattttc tctcacaaca aatcaacatc aatcagtgtc gtcgaatctc tacctcgtgg 4381 agccatggac cgttgtctcg tgggcaatcc caagtgagcc ctatatgagg cagttctttc 4441 gctaatcatt tttaaaacat ataccacaag acccatacac cgattatgtt tgaaactttc 4501 aatgctagca acttcatcga catacagggc tcacacgtta gtaaaaaaat gtgaattaag 4561 cactgtctct gcattttgca tggcttcaaa caaaagatct cgcactttga ctacatttcg 4621 ccactgctgc taaggctt // LOCUS TRBHSRP 2378 bp ds-DNA INV 15-FEB-1990 DEFINITION T.brucei heat shock-related protein gene, complete cds. ACCESSION M32139 KEYWORDS heat shock-related protein. SOURCE T.brucei (strain 427-60) DNA, clone H8. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 2378) AUTHORS Lee,M.G.-S., Polvere,R.I. and Van der Ploeg,L.H.T. TITLE Evidence for segmental gene conversion between a cognate hsp 70 gene and the temperature sensitively transcribed hsp 70 genes of Trypanosoma brucei JOURNAL Mol. Biochem. Parasitol. (1900) In press STANDARD full staff_entry COMMENT The conserved domains are 100 % homologous to the corresponding regions of Hsp70 genes 2-6 of T.brucei. Draft entry and computer-readable sequence [1] kindly submitted by L.T.H.Van der Ploeg, 16-FEB-1990. FEATURES from to/span description pept 331 2361 heat shock-related protein site 301 463 conserved domain site 532 613 conserved domain site 732 920 conserved domain site 985 1028 conserved domain BASE COUNT 575 a 535 c 717 g 551 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttcctt tattttcttt ttccttttga gctaactcta cctccgtagc cgccagcgtt 61 ctcaatgcag ttttgcgtgt aagtcacttg cctcttggcc aagtagactc ggtatgcaac 121 accaggagcg ccgcacgcat aacaatttgc gtttacgcgt ctacgttagt gtgcagatgt 181 tatgatactt ttgttttaac tttgctccgc gtccttggcg ttggtattgt tgcaatcttt 241 catcggccag tgcctgtata tttcaatatc ctcttgcttt tcatccttgc tactgcctct 301 gaaaacaagg aatatctgcc tctttgaagg atgacatacg aaggcgccat tggtatcgac 361 ctcggtacga cgtactcgtg cgttggtgtg tggcagaacg aacgtgtgga aatcattgca 421 aatgaccagg gtaaccgtac gacgccgtcc tacgttgctt tcgtcaataa tgaggttctt 481 gttggcgatg cggcgaagag ccacgccgct cgtggctcga atggtgttat attcgacgcg 541 aaacggctga tcggacggaa gttctccgac tctgtcgtgc agtccgacat gaagcactgg 601 cccttcaagg tcgaggaggg cgagaaaggt ggtgctgtga tgagggtaga gcatcttgga 661 gaggggatgt tactgcagcc cgagcaaata tctgcccgtg tccttgcgta cctgaagtct 721 tgtgcggagt cgtatcttgg gaagcaggtg gctaaggctg tcgtaacagt gcctgcatac 781 ttcaacgact cgcagcgcca ggcaacgaag gacgcgggga cgattgctgg tctggaggtg 841 ctgcgcatca tcaacgagcc aacggctgct gcaattgcct atggcctgga caaggctgat 901 gaggggaagg aacgcaacgt tttggtgttt gactttggag gtggaacgtt tgacgtctca 961 attatttcag taagcggcgg tgttttcgag gtgaaagcca caaatggtga cacccacctt 1021 ggtggagagg atgtggacgc cgcactgctg gagcacgcac ttgctgatat aaggaaccgt 1081 tatggaatag aacagggctc tctgtcccaa aagatgctat ccaagctgcg ttcacggtgt 1141 gaggaagtga agcgagtact gtcacactcg acagtggggg aaattgcact cgacgggctg 1201 cttcccgacg gcgaagaata cgtgctcaag ctaacgcgag cgaggttgga agagctatgc 1261 acgaaaattt ttgctcgctg tttgagtgtg gtacagaggg cgttgaagga cgcttcaatg 1321 aaggttgagg atattgagga tgttgtgctt gtcgggggaa gttctcggat tcctgctgtg 1381 caggctcagt taagggagct attcagaggt aagcaactct gcagcagtgt gcaccctgac 1441 gaggcagtgg cgtatggggc tgcatggcag gcgcacgtac tttccggtgg atatggggag 1501 agctctagga cagcaggtat agtgcttctg gatgttgtgc ccctttcaat cggcgttgaa 1561 gtggacgacg gtaagtttga tgtgataatt cgtcggaaca caaccattcc ataccttgcg 1621 accaaggagt acagcactgt ggatgacaat cagagtgaag tggagataca ggttttcgaa 1681 ggtgagcgcc ccctcacgcg tcacaaccac aggctaggct cctttgttct tgacggtatt 1741 accccggcaa agcacggtga accgactatc accgttactt ttagtgttga cgccgacggg 1801 atcctgaccg ttacagccgc agaagaactc gggagcgtaa caaaaacact tgttgtggag 1861 aactcagagc gacttaccag tgaggaagta cagaagatga ttgaagttgc acaaaagttt 1921 gccttaacgg acgccaccgc tctggcgagg atggaggcca ccgagaggct cacgcagtgg 1981 tttgaccgac tcgaagctgt aatggagacc gtcccgcaac catactcaga gaagcttcaa 2041 aagcgtattg cctttcttcc gcacgggaaa gagtgggtcg gcacacagct tcatacgtac 2101 actgacgcgg cgtcgataga ggcgaaagta gcgaagattg aacggcttgc aaagagggcc 2161 ctgaaatcgg cgcgtcgtga aggaaaggat ggatgggcac cgggcaatga ggacaatgga 2221 agcggtgatg acaacgacgg tgacgataac agtgatgagg aggacgagct ccaaaggggg 2281 aggggcgtga cagagggatc ggggaggtcc ccgataagga aacgcgaccg catagaggct 2341 atcaacgcaa acacagagta agtcgtagag gtgtctaa // LOCUS YSCALG1 2011 bp ds-DNA PLN 06-MAR-1990 DEFINITION S.cerevisiae beta-mannosyltransferase (alg1) gene, complete cds. ACCESSION J05416 KEYWORDS beta-mannosyltransferase. SOURCE S.cerevisiae DNA. REFERENCE 1 (bases 1 to 2011) AUTHORS Albright,C.F. and Robbins,P.W. TITLE The sequence and transcript heterogeneity of the yeast gene ALG1, an essential mannosyltransferase involved in N-glycosylation JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.F.Albright 06-MAR-1990. FEATURES from to/span description pept 115 339 upstream ORF pept 379 1728 beta-mannosyltransferase (alg1) mRNA 102 1826 alg1 mRNA (alt., both ends approx.) mRNA 203 1826 alg1 mRNA (alt., both ends approx.) BASE COUNT 639 a 341 c 390 g 641 t ORIGIN Chromosome II; 2.1 map units from Lys2. 1 gggcaataaa aaattctaag aatagaatca tatcacagta cacgaaaata taaatacatc 61 ttgaaaatgt ctcaaattgg gtggaaaaga gggtgcttga attcattagg acatatggtt 121 gatgttctca ggttctactt atgtttgctt tgtcgatttt tgcatgcttt gactgtgact 181 ttcctgtctg atatctttgt atggcttgtt gctaaaacaa ggtcaatcca agcagtgatc 241 atcttgcacg ttgcttcaat agaacgcgca tatagcaatc atcaagttaa ttggtcatat 301 atctttcaat cagcaattag caaagccatt agagggtaat tgtaacggca ttttaggggc 361 tatagtcaat tggtgaaaat gtttttggaa attcctcggt ggttacttgc cttaataata 421 ttataccttt ccataccgtt agtggtttat tatgttatac cctacttgtt ttatggcaac 481 aagtcgacca aaaaaaggat catcatattt gtgctgggtg atgtaggaca ctctccaagg 541 atatgctatc acgctataag tttcagtaag ttaggttggc aagtcgagct atgcggttat 601 gtggaggaca ctctacccaa aattatttcc agtgatccaa atatcaccgt ccatcatatg 661 tcaaacttga aaagaaaggg aggcggaaca tcagttatat ttatggtaaa gaaggtgctt 721 tttcaagttt taagtatttt caaattactt tgggaattga gaggaagcga ttacatacta 781 gttcaaaatc caccgagcat acccattctt ccgattgctg tgctatacaa gttgaccggt 841 tgtaaactaa ttattgattg gcacaatcta gcatattcga tattgcaact aaaatttaaa 901 ggaaactttt accatccttt agtgttgata tcttacatgg tagagatgat attcagcaaa 961 tttgctgatt ataacttgac tgttactgaa gcaatgagga aatatttaat tcaaagcttt 1021 cacttgaatc caaagagatg tgctgttctc tacgaccgcc cggcttccca atttcaacct 1081 ttggcaggtg acatttctcg tcaaaaagcc ctaactacca aagcctttat aaagaattat 1141 attcgcgatg attttgatac agaaaaaggc gataaaatta ttgtgacttc aacatcattc 1201 acccctgatg aagatattgg tattttatta ggtgccctaa agatttacga aaactcttat 1261 gtcaaatttg attcaagttt gcctaagatc ttgtgtttta taacgggtaa aggaccacta 1321 aaggagaaat atatgaagca agtagaagaa tatgactgga agcgctgtca aatcgaattt 1381 gtgtggttgt cagcagagga ttacccaaag ttattacaat tatgcgatta cggagtttcc 1441 ctgcatactt caagttcagg gttggacctg ccaatgaaaa ttttagatat gtttggctca 1501 ggtcttcctg ttattgcaat gaactatcca gtgcttgacg aattagtaca acacaatgta 1561 aatgggttaa aatttgttga tagaagggag cttcatgaat ctctgatttt tgctatgaaa 1621 gatgctgatt tataccaaaa attgaagaaa aatgtaacgc aggaagctga gaacagatgg 1681 caatcaaatt gggaacgaac aatgagagat ttgaagctaa ttcattgagt caatggtaac 1741 tcagcctttc ttttttgaaa attactattt tcgactcttt tttatacagt tacatagtac 1801 tacctctaat acacattcat gattaacaat gtttcaaaca atataaagtc ccgataacga 1861 ccttttgaag tggtgacgtt accgctcttc gttgacaaga ttcaagaggg ctgtcagaat 1921 aacagctatc atggtggaaa gtaggtgacc ctttgaacaa ggcagcatat atccatcgac 1981 tatcatgttc ctaaaatgtt atcatagccg g // LOCUS YSCHEM15 2916 bp ds-DNA PLN 07-MAR-1990 DEFINITION S.cerevisiae ferrochelatase (HEM15) gene, complete cds, and Val-tRNA gene. ACCESSION J05395 KEYWORDS delta repetitive element; ferrochelatase; protoheme ferro-lyase; transfer RNA-Val. SOURCE S.cerevisiae (strain FL200) DNA, clones pHEM15-[1,2]. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2916) AUTHORS Labbe-Bois,R. TITLE The ferrochelatase from Saccharomyces cerevisiae: Sequence, disruption and expression of its structural gene HEM15 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Lobbe-Bois, 06-MAR-1990. FEATURES from to/span description pept 841 2022 ferrochelatase precursor (EC 4.99.1.1) sigp 841 933 ferrochelatase signal peptide matp 934 2019 ferrochelatase tRNA 2253 2325 Val-tRNA rpt 2435 2765 variant solo delta element rpt 2776 2888 truncated delta-element BASE COUNT 907 a 563 c 527 g 918 t 1 others ORIGIN 1 aaaatagcat tcagtgggaa cgatcccaat aagcagatag catatcgcag tgtaaaacta 61 tcaatcccat agttggtaat tatctttgtt aaaacagcgt ccacaggatt gtacatgttg 121 gaagcgatca gcagcagaat taaatcacca cggtttgtct tggcgtatng gtttgtcttc 181 ttgcttgacc tgtgcctgta ttttgtcctt ggctttaaat ctgtttaata agaacccttt 241 atttagatga ccattgtttt cgacggtttg tggtttcgac gcgcaatgtg aaagatcttc 301 aatgttttgt tgaacagtaa agcaaaacag ctcaacagca ggacaaaaaa agctaaaaaa 361 gaactcttcg cacttttaac aagcagtttt actgaacaat tttagaagta aatttcagga 421 aggttcaaca gctttacatg aaagaggatg ctggccattt ttttttttta aaaaagtttt 481 ttatatacac aaactcagta aaataattac aaatatgtag catgtgtagg atgccttgaa 541 acatctccgc tggcaatagc gttagacatg cccaatcttc aatggtcacc gtaagcctta 601 tatgacgaag gagttgctgc ccttcgtttt ctcatgctgc tttttttcgt aggggatatg 661 ggattgaaga aggaggacgc gaaggaatac ttttggtcgg tcaaggtcac tcttggcttg 721 acaccaaaca agcgtaaatt ctcgtgttcc acttgagtcg gtttacaaga agtacagaaa 781 tatttaacac atacctgcta tttggacaaa tttagcagta aaaaaaagaa cgtttaaaga 841 atgctttcca gaacaatccg tacacaaggt tccttcctaa gaagatcaca actgaccatt 901 acaagatcat tttcggttac attcaacatg cagaatgcac aaaagagatc acccacagga 961 attgttttga tgaacatggg tggcccctct aaagttgagg aaacatatga ttttttgtat 1021 caattatttg ccgataatga cctaattccc attagtgcta agtatcagaa gacaattgct 1081 aaatatattg ctaagtttcg tacccccaag atagagaagc aatataggga aattggtggg 1141 ggctccccaa tccggaaatg gtctgagtat caagccactg aggtctgtaa aatcttagat 1201 aaaacctgtc cagaaacggc gcctcataag ccttacgtgg cgtttcgtta cgcaaagccg 1261 ctaaccgcag aaacttataa acaaatgcta aaagatggcg tgaagaaggc agtggccttt 1321 tctcaatatc ctcatttctc ttattccact accgggtcat ccatcaacga attgtggaga 1381 cagattaagg cattggactc cgagagatct atatcttggt cggttattga tcgttggcct 1441 acaaatgaag gtctaatcaa ggccttctcc gaaaatatca ccaaaaaact acaagagttt 1501 ccgcaacctg tcagagacaa ggttgtttta ttgttttccg cacattctct acccatggat 1561 gttgttaaca ccggtgatgc ctacccagct gaggtagctg cgacggttta caacatcatg 1621 caaaaattaa agtttaaaaa cccttatagg ttggtttggc aatcccaagt tggaccaaaa 1681 ccatggttgg gagcgcagac agctgaaatt gcggaatttt taggccccaa agttgatggc 1741 ctaatgttta ttcctatcgc ctttacctct gatcatattg aaacattgca tgaaattgac 1801 ttaggcgtca ttggggaatc ggaatataag gataaattta agagatgcga atctttaaat 1861 ggcaaccaga cctttattga aggcatggca gatctcgtca aaagccactt acagagtaac 1921 caactctatt ctaatcaact acctcttgat tttgcacttg gcaagtccaa tgatcctgta 1981 aaggaccttt cattggtatt tggcaatcac gaatctactt gataagggcc attcatccca 2041 caatctcaat atcagtatat tctcttattt attaattgag gttgcgtatt tatgttttct 2101 ttgtggtata taatatatca ttatttattg cattttaaaa cttttttttt cattattttt 2161 ctctgttgaa tgcagtcaac atctccctta atattcgata ctcatcttta atactatcag 2221 caaaaagttt aagataaaat attcttaaaa acgatttcgc ccaggatcga actggggacg 2281 ttctgcgtgt taagcagatg ccataaccga ctagaccacg aaaccactat ttgttgaaat 2341 ctttaacaag cgaggaaact ttagtgaaat attaaaagga ctaaaccgga ccggttcttc 2401 tatttctaca tatctaatca cccaatgtag tttttgaata tttgtgaata gttggataat 2461 tgttgggatc ccattgtgca caaaggccat aatattatgt ctacagaata tactagatgt 2521 cctccctata ggatatagta atcctctaaa tggaaccgat atttctacat aataatatta 2581 cgattattcc tccttccgtt ttatatgttt cattatccta gcacactatc aatctttgca 2641 tttcagcttc cattagattt gatgactatt tctcaatctt tatgttatct ccttacgccg 2701 catgtgataa tatactgcta gtatgactac tagttgatag aagatagttg atttttactc 2761 caacaaaagt aacaatatta tttagaacta tagattccat tttgtgcatt cccatattct 2821 cgaggaaaac ttttagtata ttctgtagac ataatattat cgcctttgtg aacaatagaa 2881 tcccaacaat tgtcgcaaat ttaccaattt tctaga // LOCUS YSCOSCP 1451 bp ds-DNA PLN 05-MAR-1990 DEFINITION S.cerevisiae oligomycin sensitivity conferring protein (OSCP) gene, complete cds. ACCESSION M32487 KEYWORDS oligomycin sensitivity conferring protein. SOURCE S.cerevisiae (strain D-273-10B, ATCC 25657) DNA, clone 62-17. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1451) AUTHORS Misook,U., Jones,D. and Mueller,D.M. JOURNAL Unpublished (1990)Univ. of Health Sci., North Chicago, IL 60064 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by U.Misook, 02-MAR-1990. FEATURES from to/span description pept 498 1136 oligomycin sensitivity conferring protein (OSCP) mRNA 429 > 1136 OSCP mRNA (alt.) mRNA 430 > 1136 OSCP mRNA (alt.) BASE COUNT 484 a 245 c 273 g 449 t ORIGIN 1 agatctggct gcaaacctgg tctacataaa tgtagtacat tataagggac taccaagaaa 61 tacgaagcgc tcagatatct cttaaagttc atttttttgc cggaataatc ttggtttagt 121 aacttataaa actacttacg tccaataaat cgttatattt tatgccgtcg tcataaagtg 181 gacactttcg agagacggtg ctggccaata agatgctgat atttgaaata tcatttaccc 241 gggcccgggt gggaaaaaca aggaattaaa gataatgaag tgatcttttg gcgagatagt 301 aaaactttca attatctagg aacaatttca aacgccaatt gttaactttg tgtacagtat 361 ttctagtttt tatctcattg aaaacaaaat tcagtcaact tggaaattta agcaggctac 421 tgtggcattg agatattttt ccgcagagta agaaccgatt agcaagttac cgcattattt 481 ttaaaccgtg gacaatcatg tttaatagag tctttaccag gtcatttgca tcaagcttaa 541 gagctgctgc ttccaaagct gctgctcccc caccggtgag attgttcggt gttgagggca 601 cttacgcaac agctttatat caggccgctg caaagaactc ctctattgat gctgcctttc 661 aatctttgca aaaagttgaa agcacagtca aaaagaatcc aaaattaggt cacttattac 721 tgaatcctgc attgtcattg aaggacagaa attctgtcat tgatgccatc gtggagactc 781 acaagaattt ggatggttat gtcgttaacc tattgaaggt tctttccgaa aataacagac 841 tgggatgttt tgaaaaaatt gcgtctgact ttggcgtttt aaacgatgct cacaacgggt 901 tacttaaagg tacagtgact agtgctgaac cattggatcc taaatctttc aagaggattg 961 aaaaagcttt gagcgcctcc aaattggtgg ggcaaggtaa atctttgaag ctggaaaacg 1021 tggttaagcc agagattaag ggtggtttga ttgtagaact tggtgacaag actgttgact 1081 taagcatttc tacaaagatt caaaaactga ataaggtctt agaggacagc atttaacgtt 1141 aataatatag ttggatatgc tcaactcgtt tgatgttact ttctttgcct ccctcctaat 1201 ttttgattct gttcaaatat atattctttt ccgtgaattt ttcctaaaca tcaaaattac 1261 gtacatatac gaataaataa gaaacccctc taaaaagtga atttttttgt tcgagacaac 1321 cattttaatc atatatatgt tcgtgtatcc aggcaaactt atattaatcg caaaaaaaaa 1381 aattataagt gtatatatgg aataaacata aaaaataata aaagaagaaa aaaaaaaata 1441 atgaagaatt c //
GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS CHKMIM1 5818 bp ds-DNA VRT 25-OCT-1989 DEFINITION Chicken myeloid protein-1 (mim-1) gene, complete cds. ACCESSION M29448 KEYWORDS myeloid protein-1. SOURCE Chicken (strain SPAFAS) hatchling myeloblast DNA, clone 35B-J612. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 445 to 704) AUTHORS Ness,S.A., Marknell,A. and Graf,T. TITLE The v-myb oncogene product binds to and activates the promyelocyte- specific mim-1 gene JOURNAL Cell 59, 1115-1125 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 444; 705 to 5818) AUTHORS Ness,S.A. JOURNAL Unpublished (1989) EMBL pf 10.2209, 6900 Heidelberg 1, Germany STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.A.Ness, 25-OCT-1989, for release after publication. FEATURES from to/span description pept 713 758 myeloid protein-1, exon 1 1313 1445 myeloid protein-1, exon 2 2411 2562 myeloid protein-1, exon 3 3069 3245 myeloid protein-1, exon 4 3992 4109 myeloid protein-1, exon 5 4628 4782 myeloid protein-1, exon 6 5316 5515 myeloid protein-1, exon 7 pre-msg 687 5546 mim-1 mRNA and introns IVS 759 1312 mim-1 intron A IVS 1446 2410 mim-1 intron B IVS 2563 3068 mim-1 intron C IVS 3246 3991 mim-1 intron D IVS 4110 4627 mim-1 intron E IVS 4783 5315 mim-1 intron F signal 445 704 promoter sequence signal 625 621 (c) CAAT box signal 659 664 TATA box signal 5526 5531 poly-A signal BASE COUNT 1785 a 1235 c 1132 g 1666 t ORIGIN 1 aactgtggct tggcaccatc catttcagaa tctgagaggt tttaacaaga tttctctgtg 61 aaaaaaacag ggttccagag aaaagactcc atggattgat gtccagacat ggtgaaaata 121 tcagacgaac agaaacagtc tatgtcaaga gcacctgcac tcttagccta agtaatatgt 181 gtactaaaca aacattacac tttcagtgtc ttctgtcgtg tcgtattgaa gccataatgg 241 gcacatacag ctggggagga ggatctggaa ccagcatgag aaagacaact ggtgatgccc 301 acaacagaca acacatggag ttttgcccat cttttcccac cagctgaatc ctctggatcc 361 aattcttgtt tcctatcatt tttggcctaa gcgtgtcctt gcttggggaa ggagaagaaa 421 aagcgttaag aaatggttca accaaagctt cctgaaaagt gggattgttc atttccatct 481 gttaatgtga gctaagacac ccgttacttt acctgtcttt cccaaccagc tctaaaaaac 541 cgttataatg tacagatatc tttagtgcct aatatggcaa ggctgtctgc tgcggcgctg 601 taacacttgg ctcaggactg attggccaac acaacagtcc cacactttgg aaaacacata 661 aatactacag agagctactg taggacgtct ttgactcgct caaaggcacg agatgccagc 721 cctcagcctg atcgccctgc tcagcctggt gtccactggt gagttgggac tgctttggtg 781 tggtttgtct cgagtctgta cagacagaca ctaattaata actgaaatta atttcagcat 841 aaagctaagc tctttaaatg tgtttttatg atgaacatat gtaatgtgcg ctagtatata 901 actattaaca aaaatacaaa gggtgtgttt tacagataga gctatcctga agtgctttgg 961 tttcatggaa aaagtagttg gcattatagc agaaactaat attagaaatc tgtgtctaca 1021 gctttacttc cttttactgt atgtaccttt atcaaggcac aaggcttcca ggaaagcagt 1081 caaatatgca taacaggttg ttcagtgttc ttacagctgc agtttggggc attatgaagg 1141 cccacaaacc atcatttacc tgtaattaag gaggaaacct ctcctgtgag gcagaagtca 1201 cgacacgtgt cttccatctt aaacattgca ttttgcagct taccactttc agttgcacaa 1261 cactaacgca ttcctatttc tattttcctg ttttcctctt ctgtatgggc agcttttgcc 1321 aggcagtggg aggtgcaccc tccgcagcag cagggcaggc actgggcgca gatatgcagc 1381 gggaatcctt tcaatagaat ccggggctgt gacagatacg gctgcggcaa ttacggagcc 1441 agcaggtaac agggcaggca gcaggggcct ggagcaggaa tgcatcatgg ctttcctaca 1501 ccaaatgttc acaaagcaag aatgagatat aaccaacttt taaaaacgaa aggcttgcta 1561 atattttgcc ttctttctcc aagatttaag agcaatacac aaaatcacca gtcgtgcctt 1621 tcatataaca ttcattgata tttctgctgt actttaaatg aaagagagct tctctatgtt 1681 ttctcttact cactttgaga cattactaag caatctcaca gaaaaaaata tcagtgtaat 1741 ttagactaac agatgtatgg gtaaagagta aagaagagag accactacat aaactagcag 1801 gctgagtagg aaccgcaatg gaaagcattc caaggactag tgtaattcat ccttggttct 1861 tatagacaag acacttaact agcaatcatc aatagagctt ataggaagct ctacaaatta 1921 ctttacacaa gctaaagatc tgcagacacc actgcctctg cataggcacc ttataacaga 1981 gaaggttatg ctgtcctaac aaatgctccc aggcccttcc agctcaggat attctctgat 2041 tctatgatct ctactacaga acttaagaat tgctaagtga ggaaaatgcc aaatatacat 2101 ttctatttga catggcacag atgcattctg gggtgggaac aggggtactg tgatggtctg 2161 gcgtcctatg aatttgggag aattaaaaac attcattaaa gtttgaaatt ctgcccccat 2221 ttaaagtggg tcatactaat acaaacacgt tgcaacaaaa atacgactga aactagcttc 2281 tccccactag cttctcccaa tttcactgca gatgtaaaac cttacgtttt aagagaagct 2341 tcatatgtag tgctacgtga tctatctatt aggcctgaca gattgcataa ttacttaatt 2401 atgtcttcag acagggtaaa ggagaaaagc acaagggcgt ggatgtcatc tgcactgatg 2461 gatcaatagt gtacgctccc ttcagcggcc agctctccgg acccattcga ttctttcata 2521 atggaaatgc cattgatgat ggagtccaaa tctctgggtc aggtaagtaa taacagcaca 2581 caggtcagaa ggcaggcgct acagatctta tttacctgag taagatattt ctgtgcctct 2641 tgaagtcaag cagattgctg atttagtctc tctagttctt atactgaagc acaactacac 2701 tgctgctcag aaataccaaa agctccagca ggatacagcc tttttcttag agagaacaga 2761 agaagggagt cgaaaaacac atagcagaaa aacgactgca agtcaaacaa aggaaacagt 2821 aaaaacaaag catccgaatc ctcagttttg atgagtggat attttcctgc ctctgttttc 2881 caacttcctt tttttccttt ttcctatttt ctatagggat gtgatactcg caagattgtt 2941 ggtaccttga ctccatttca gcttcaccac tgtagccccc agtaacagcc cagtgaggaa 3001 gatttcttgc actctgcctc aacagatgca gctggtgttt cctgaccttc ctctcccatc 3061 ttttccaggt tactgtgtaa aactggtctg cattcatccc atcagatacc acggccaaat 3121 ccagaaaggg caacaacttg gaagaatgct gccaatgcaa aaagtgtttc ctggcattgt 3181 gtctcacatt cacgtcgaga actgcgacca gtctgatcct actcatctcc tcagacctat 3241 acctggtaaa aaaaacatgc aacaaacatt tgagttttat catgaatgaa aatatgttta 3301 attgcaaata acaataatat taaacaatag gagtctcctc tgtcataggg tctcctaaag 3361 agcaacctcc aaaagagaga agacatgcat tatagcataa caggaccaca gaaaggaagg 3421 aacatgtatt agcttactga gctttatttc tcacccctca ctgccctccc atggcacagc 3481 agcatcaagt tccagcaaca gagtcagatg tctcttttga tccttctaac taccagcaat 3541 gtttccaata ttacatctta actagatcaa actttaaatt tgcacttgaa aacaatcatc 3601 ctaaaggttc tggtgatggt ttaggactta aggactttga ttccattagt gtaggcacag 3661 aataagaaaa atccaaatcc taccccctca gtttacactc tggtttataa ctgtgataat 3721 cctgtttttg tgtttttttt cctctttgaa gcgtgttctt tatatcagct caccgtaatc 3781 acaaagcatt aagatgtagt ttgaggcact tgttccactt atgtttccaa aggggtgaca 3841 aattttatgg agaatcaaac accaactgaa catgcaagat gtaattccag catgtttctt 3901 cctaactaaa atgaaaatgt aatagttgca aatatgaaat acattaaaaa aaaccaacac 3961 taccttcatt tcctgtttct gtatttcata gatatttcac caccattccc acaacaagat 4021 gctcactggg ctgtagtatg tgctgggaat cctacaaatg agataagagg ctgtgataaa 4081 tatggctgtg gatactttgg agctccaagg tactatactt gaacacatgg gggttttatt 4141 attgctctgt tggaatagac ttgaactctg tttaagctct gaaagtttag gttgaacata 4201 agggaacact tactacagag gtaatgcagc ataaagaaca ggctgccaat gttgttcatg 4261 aaaccgtcat acttggaggg tgttgggtta gacaaagtca tgcctgacct ggtgttggga 4321 acaatacaag ctgaaggtag ggctaaacgc cctccacagg taccttccaa ccagtacttc 4381 tccagctcta gagttctatt gcaaagctag ggagaaatac atatccaccc acacaattcc 4441 caaaagctgt aaatctagca gactatataa cattacctta cctaccaaaa cagaatttct 4501 acacattggt caatcatggc agcaccatct tagtactact aacctgtttt tagaacttgc 4561 taagcgacct ctctctgcat aacgtaaggt ctggaagaga cttctctcat tacatatttg 4621 tttccaggcg caatggtaaa ggagagaagc acaagggcgt ggatgtcatc tgtgctgatg 4681 gtgcaacagt gtatgctccc ttttctggtg agctgtctgg accagttaaa ttctttcata 4741 atggaaacgc cattgatgat ggagtccaaa tcaggggatc aggtataaag ctattttcat 4801 ttatttcatt taatagttca ttttgactca agttcatgta ggatacgttc cctcacaggt 4861 agttgagtat atcagcaaag cacacatttc cagctgctta gctccacatt cagaaatttc 4921 tgctcgtttc ttaggatgaa aatataaaca ctcaatctta aagacaacga gggaagatct 4981 gccttacatg aactattatt tagttttctc tctacaagga aaaaaaaaat tggttgggct 5041 gcaaattaca tcagaaaaaa ttaaaaacgg ggagcttgag ttctgtacaa tgtcttcaag 5101 ctcttctgct aaagcaagaa ttctcacctg tcccttcatt ttcctttccc taacaagcca 5161 gcaactccaa ccgctctctc tactttttac agtatttttg atagcattgc atttccaaaa 5221 ctgctatctc agtggaaagg acaagagttg ttcaaataga atgcaaaaga acaaatgtac 5281 atcatctaat tttttttctt cctcttctta accaggcttc tgtgtcaaac tgctgtgcat 5341 ccatcccatc agatacaatg gtaggatttc taagggacaa gtccttggca gaatgttgcc 5401 gatgcaaaga gtatttcccg ggtacatatc tcatattcat gttgagaact gcgatcgctc 5461 agatcctact agcaatcttg aaagggggaa aggagaaagt gaaatggaag tgtaataagt 5521 tctgaataaa tacatctcag cattcaaatg ctgtttttgt tctcacatgg atggtctatc 5581 tcaatgaaag aaggaaaatg tactttacaa atagaataaa gtgtacacag caggcacaac 5641 aatagattac tgatgcaata tttttgggcc ctacttaata ctgtaaggtc ctaacatcaa 5701 catctctcca ggtaccagtg aggtactgat ttttaagtat tggtagttgt gaatagctac 5761 tgttgcagct tcgccagttt acaaactttc tgcattttgg aatacaaagt tatcgaaa // LOCUS CHKMMIM1X 1054 bp ss-mRNA VRT 25-OCT-1989 DEFINITION Chicken myeloid protein (mim-1) mRNA, complete cds. ACCESSION M29449 KEYWORDS myeloid protein-1. SOURCE Chicken (strain SPAFAS) hatchling myeloblast, cDNA to mRNA, clone J612. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (sites; for [2]) AUTHORS Ness,S.A., Marknell,A. and Graf,T. TITLE The v-myb oncogene binds to and activates the promyelocyte- specific mim-1 gene JOURNAL Cell 59, 1115-1125 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1054) AUTHORS Ness,S.A. JOURNAL Unpublished (1989) EMBL pf 10.2209, 6900 Heidelberg 1, Germany STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.A.Ness, 25-OCT-1989, for release after publication. FEATURES from to/span description pept 27 1007 myeloid protein-1 BASE COUNT 297 a 241 c 265 g 251 t ORIGIN 1 gtctttgact cgctcaaagg cacgagatgc cagccctcag cctgatcgcc ctgctcagcc 61 tggtgtccac tgcttttgcc aggcagtggg aggtgcaccc tccgcagcag cagggcaggc 121 actgggcgca gatatgcagc gggaatcctt tcaatagaat ccggggctgt gacagatacg 181 gctgcggcaa ttacggagcc agcagacagg gtaaaggaga aaagcacaag ggcgtggatg 241 tcatctgcac tgatggatca atagtgtacg ctcccttcag cggccagctc tccggaccca 301 ttcgattctt tcataatgga aatgccattg atgatggagt ccaaatctct gggtcaggtt 361 actgtgtaaa actggtctgc attcatccca tcagatacca cggccaaatc cagaaagggc 421 aacaacttgg aagaatgctg ccaatgcaaa aagtgtttcc tggcattgtg tctcacattc 481 acgtcgagaa ctgcgaccag tctgatccta ctcatctcct cagacctata cctgatattt 541 caccaccatt cccacaacaa gatgctcact gggctgtagt atgtgctggg aatcctacaa 601 atgagataag aggctgtgat aaatatggct gtggatactt tggagctcca aggcgcaatg 661 gtaaaggaga gaagcacaag ggcgtggatg tcatctgtgc tgatggtgca acagtgtatg 721 ctcccttttc tggtgagctg tctggaccag ttaaattctt tcataatgga aacgccattg 781 atgatggagt ccaaatcagg ggatcaggct tctgtgtcaa actgctgtgc atccatccca 841 tcagatacaa tggtaggatt tctaagggac aagtccttgg cagaatgttg ccgatgcaaa 901 gagtatttcc cgggtacata tctcatattc atgttgagaa ctgcgatcgc tcagatccta 961 ctagcaatct tgaaaggggg aaaggagaaa gtgaaatgga agtgtaataa gttctgaata 1021 aatacatctc agcattcaaa aaaaaaaaaa aaaa //
GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS SYCTREA 76 bp ss-tRNA RNA 15-JUN-1989 DEFINITION Synechocystis sp. Glu-tRNA. ACCESSION M19535 KEYWORDS transfer RNA; transfer RNA-Glu. SOURCE Synechocystis PCC 6803 tRNA. ORGANISM Synechocystis sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 76) AUTHORS O'Neill,G., Peterson,D., Schon,A., Chen,M.-W. and Soll,D. TITLE Formation of the chlorophyll-precursor delta-aminolevulinic acid in cyanobacteria requires aminoacylation of a tRNA-Glu JOURNAL J. Bacteriol. (1988) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Soll, 30-JUN-1988 FEATURES from to/span description tRNA 1 76 Glu-tRNA modified 20 20 d modified 54 54 t modified 55 55 p site 34 34 modified bsae (presumably uracil) BASE COUNT 15 a 25 c 21 g 14 t 1 others ORIGIN 1 gcccccatcg tctagaggcc taggacacct ccctntcacg gaggcgacag ggattcgaat 61 tcccttgggg gtacca // LOCUS SYCTREAB 1000 bp ds-DNA BCT 13-FEB-1990 DEFINITION Synechocystis sp. Glu-tRNA gene. ACCESSION M32099 KEYWORDS transfer RNA; transfer RNA-Glu. SOURCE Synechocystis PCC 6803 DNA, clone pS6TE. ORGANISM Synechocystis sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 1000) AUTHORS O'Neill,G.P. and Soll,D. TITLE Expression of the Synechocystis strain 6803 tRNA-Glu provides a functional excess of tRNA for protein and chlorophyll biosynthesis JOURNAL Unpublished (1990) Yale Univ. New Haven, CT 06511 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Soll, 14-FEB-1990. FEATURES from to/span description tRNA 201 273 Glu-tRNA signal 97 136 inverted repeat/stem loop signal 130 135 -35 transcriptional promoter element signal 154 159 -10 transcriptional promoter element signal 293 320 inverted repeat/rho-dependent terminator BASE COUNT 261 a 223 c 254 g 262 t ORIGIN 1 cccgatggca ctaaactggt gacggtccac gaacccattc gctagtaatg agctggccgg 61 tcagggcgga ctgtttttta ccgttaaacc ccaggcctgt tgaccgtaga aaaaattcat 121 tgtctagggt tgacagtgat caatctttat cgctatattt aaaaacgtct gaaataacga 181 actgttatcg agactgccta gcccccatcg tctagaggcc taggacacct ccctttcacg 241 gaggcgacag ggattcgaat tcccttgggg gtattaaaaa aaaagaaaaa acgaaaagaa 301 gttcttcggg gcttcttttt tgttgctcac ccaaacaatt ccccaagaag tttaaaaccc 361 tctttcggtt tatggaacag aagaagaagc ggggagcttg tcgagggaaa cagcgcagaa 421 atggcaagac ccaaactcca tctatggcag tataggggcg tagttctttg tccttgaggc 481 tctgtctggc aaatgttaat cctgcgtcgc ctaatactag ttgtttccct tagttttgcc 541 acgatcgcca ttgacattac tacgagacca gggctggaac cagtgcaagg ccagggggtg 601 gagacagtag actcgcaaac actaaaagtg ggggtggtgg gcaatcctcc ctttgtcttt 661 tatggagaag gaaaaaacgc agcatttacc ggtattagcc tagatgtgtg gagagctgtg 721 gccgagtccc agaagtgaat tcagaatatg taaggcaaaa ttccatctcc gcaggcatca 781 cagcggtggc ggaaggggaa ttggatattt taattggtcc cattagtgtt accccagagc 841 gggccgccat agagggcatt accttcaccc agccctattt cagtagtggc attggcctgt 901 attccaggca aaccggtgag tttatgggaa agattttcgc ctttttttgg catagcgctt 961 tgtcttcagc gggggtactt actctacttt tatttttagg //
GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS RATADHX1 546 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 1. ACCESSION M29516 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 1 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 124; 125 to 239) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 239 to 546) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept 161 + 178 alcohol dehydrogenase, exon 1 (EC 1.1.1.1) pre-msg < 1 > 422 ADH mRNA and introns IVS 179 > 546 ADH intron A BASE COUNT 162 a 101 c 127 g 156 t ORIGIN 1 gaataaactg atgggtgtgt cgtcgtgaag ggctagatca catgtgggat cagctgacac 61 cccagcaaat aaaatgctgt ggcaactttg gcccagttca ctgcaggagg gaagaagtct 121 acgaggatcc atccatttcc tgcccacagc gaaggacagc atgagcacag ctggaaaagt 181 aaggaaaaat acctcaccct tggaaatata ttctcattga tatgtgaaga gatgcagaat 241 cctaacttag aaatgacagg catcgctggg gtctggtgca gcttgactgt actaagtatc 301 tttaatgaga gggtatagcg tctaggtttg tgaatgcctt catttcctca ttaactctgt 361 aggacttttt ccaagccaat ggtgactgct ttgcaaatag aaagttttta ggtcttttat 421 cgaaagtctg tatttatctt aagataccaa caaatgtgct gtccttcgta atcatatttt 481 attatgaaca caggaacgtt aaatcggaca gagtttcata tgaggaattt gcttcaggga 541 atttag // LOCUS RATADHX2 237 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 2. ACCESSION M29517 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 2 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 233) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 234 to 237) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept + 112 + 213 alcohol dehydrogenase, exon 2 (EC 1.1.1.1) pre-msg < 1 > 237 ADH mRNA and introns IVS < 1 111 ADH intron A IVS 214 > 237 ADH intron B BASE COUNT 54 a 49 c 67 g 67 t ORIGIN About 2.0 Kb after segment 1. 1 aattatgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gagtgataga 61 aggatacaat tgcgtgatct cgttggagct gaagtaaccc tttgcttcca ggtaatcaaa 121 tgcaaagcgg ccgtgctatg ggagcctcac aagcccttca ccatcgagga catagaagtc 181 gcacccccca aggcccatga agttcgcatt aaggtgaaca ccccttttca agtctgt // LOCUS RATADHX3 186 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 3. ACCESSION M29518 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 3 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 11 to 186) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 10) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept + 30 + 169 alcohol dehydrogenase, exon 3 (EC 1.1.1.1) pre-msg < 1 > 186 ADH mRNA and introns IVS < 1 29 ADH intron B IVS 170 > 185 ADH intron C BASE COUNT 44 a 44 c 56 g 42 t ORIGIN 1 gacaactgaa cttgatcttt aacccacaga tggtggccac cggagtctgc cgctcagacg 61 atcacgcggt tagtggatcc ctgttcacgc ctcttcctgc agttctaggc cacgagggag 121 ctggcattgt tgagagcatt ggagaagggg tgacttgtgt gaaaccaggg tacagaatag 181 agctga // LOCUS RATADHX4 586 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exons 4 and 5. ACCESSION M29519 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 4 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 119 to 582) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 118; 583 to 586) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Med Res. build. rm 424 Indiana U. Ind., IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept + 154 240 alcohol dehydrogenase, exon 4 (EC 1.1.1.1) 332 + 554 alcohol dehydrogenase, exon 5 pre-msg < 1 > 586 ADH mRNA and introns IVS < 1 153 ADH intron C IVS 241 331 ADH intron D IVS 555 > 586 ADH intron E BASE COUNT 140 a 157 c 136 g 153 t ORIGIN About .9 Kb after segment 3. 1 ttttagagca tattcataat agagacacac atgcacctga aagccaaaac tgactatgag 61 gaaatgatta tggcttggga acatgcatgc tcttctggag gccctaactg ggtcattcac 121 aaccacttgc cattctgttt tctcttcatc cagtgataaa gtcatcccgc tgttctctcc 181 ccagtgtgga aaatgcagga tctgcaagca cccggaaagc aacctctgtt gccaaactaa 241 gtgatgtgtt tcttattgtc ttcttgcaca attgggcggc actctggttt cattctgtct 301 cacatcgttt gtattcctgt gtttcaccaa ggaatctgac acagcctaag ggagctttgc 361 tggacggcac cagcaggttc tcctgcaggg gaaagcccat tcaccacttc atcagcacca 421 gcaccttctc ccagtacact gtggtagatg acatagcggt ggccaaaatc gatgcggctg 481 caccgctgga caaagtctgc ctcatcggct gtggcttctc gactggctat ggctctgccg 541 tccaagtcgc caaggtagga tggacaatgg cccctggaac aaggct // LOCUS RATADHX5 409 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 6. ACCESSION M29520 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 5 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 110 to 404) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 109; 405 to 409) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept + 122 + 382 alcohol dehydrogenase, exon 6 (EC 1.1.1.1) pre-msg < 1 > 409 ADH mRNA and introns IVS < 1 121 ADH intron E IVS 383 > 409 ADH intron F BASE COUNT 108 a 94 c 106 g 101 t ORIGIN About 2.3 Kb after segment 4. 1 gggtttccaa ctgatggaat gaagtagcca cgataaagga catagcaagg cagtttgtgc 61 agtgtggaac acttccgaac agtcaccatt cagtccactc ttgcgttttc tgaaaataca 121 ggtgacccca ggctccacct gtgccgtgtt tggcctggga ggtgttggtc tgtctgtcgt 181 cattggctgt aaaacagcag gagcagccaa gatcattgcc gtggacatca acaaagacaa 241 gtttgcgaag gccaaagagt taggtgccac tgactgtatc aaccctcaag actacaccaa 301 acccatccag gaagttctcc aggagatgac tgatggaggg gtggactttt catttgaagt 361 cattggccgt cttgatacca tggtatgtac tgtggcatac cttaagatc // LOCUS RATADHX6 262 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 7. ACCESSION M29521 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 6 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 49 to 216) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 48; 217 to 262) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept + 63 + 198 alcohol dehydrogenase, exon 7 (EC 1.1.1.1) pre-msg < 1 > 262 ADH mRNA and introns IVS < 1 62 ADH intron F IVS 199 > 262 ADH intron G BASE COUNT 62 a 68 c 65 g 67 t ORIGIN About 1.8 Kb after segment 5. 1 attgcctaaa gctgcacagg accacaaaaa gaatctactt taactattaa tgtttattcc 61 agacttctgc cctgttaagc tgccattcag catgcggtgt aagcgtcatt gtcggggtgc 121 ctcccagtgc ccaaagcctc tccgttaacc ccatgtcgct gctgctggga cgcacctgga 181 aaggagcaat attcggcggt atgtatttac agcgaggaga tcaacgtccg catctgttca 241 tgcatgtcaa ggtgggcgtg cg // LOCUS RATADHX7 186 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 8. ACCESSION M29522 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 7 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 21 to 186) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 20) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept + 31 + 169 alcohol dehydrogenase, exon 8 (EC 1.1.1.1) pre-msg < 1 > 186 ADH mRNA and introns IVS < 1 30 ADH intron G IVS 170 > 186 ADH intron H BASE COUNT 47 a 37 c 45 g 57 t ORIGIN About .9 Kb after segment 6. 1 gatctcatgg agatgctgct ttcccttcag ggtttaagag taaagatgcc gtccccaaac 61 ttgtcgctga cttcatggct aagaagtttc cgttggagcc gctgattact catgttttac 121 cttttgaaaa gataaatgaa gcatttgacc tgctccgtgc tggaaagagg taagctttga 181 gagtat // LOCUS RATADHX8 545 bp ds-DNA ROD 27-OCT-1989 DEFINITION Rat alcohol dehydrogenase (ADH) gene, exon 9. ACCESSION M29523 J04747 KEYWORDS alcohol dehydrogenase. SEGMENT 8 of 8 SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rADH18. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 65 to 360) AUTHORS Crabb,D.W., Stein,P.M., Dipple,K.M., Hittle,J.B., Sidhu,R., Qulali,M., Zhang,K. and Edenberg,H.J. TITLE Structure and expression of the rat class I alcohol dehydrogenase gene JOURNAL Genomics 5, 906-914 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 64; 361 to 545) AUTHORS Crabb,D.W. JOURNAL Unpublished 1900 (1989) Indiana U. 975 Walnut St. Indianapolis IN STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Crabb, 13-OCT-1989, for release after publication. FEATURES from to/span description pept + 94 118 alcohol dehydrogenase, exon 9 (EC 1.1.1.1) pre-msg < 1 > 360 ADH mRNA and introns IVS < 1 93 ADH intron H BASE COUNT 130 a 150 c 109 g 156 t ORIGIN About .7 Kb after segment 7. 1 gttgtctcag agatcgtaac acatgtttta ctcgtgcata tggggagtgt aggtgtggag 61 tgtgaaaacg gtatcttcta ttttgacttt cagtatccgt accgtcctga cgttctgaga 121 acacatgaat gccttcccgg tttctgactc ctccgcctac cgaatcaagc gccagccaca 181 tcacaacctt aaccttgctc ttcagagaca cacagccaat aaagcacctg tgcaagcttt 241 ccgtgagaaa ctgagttttc gatggaacct gttccaaccg catgtttaca atcctatctt 301 attgtttcgc acctcagctg cctgttaaac accgtgatca aaacacccag cccattgctc 361 ttcgcccctc cccggaacac tgtactattt ctggaggtgt tgggtcagtc attatttcag 421 gtttttattg aaggaaacaa ctccatattc attgtccacc aaagggcact tagaacgcac 481 gagcgctcat gtgtggtgct gccttttagt tcttacaaca gagcttctcc acgcttccag 541 ccagc //
GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS CHTHYPAB 2659 bp ds-DNA BCT 12-FEB-1990 DEFINITION C.trachomatis HypA and HypB protein genes, complete cds. ACCESSION M31739 KEYWORDS . SOURCE C.trachomatis (serovar A, isolate HAR-13) elementary body DNA, clone pTA571. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 2659) AUTHORS Morrison,R.P., Lyng,K., Su,H., Yuan,Y. and Zhang,Y.-X. JOURNAL Unpublished (1990) Rocky Mountain Laboratory, Hamilton, MT 59840 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.P.Morrison, 31-JAN-1990. FEATURES from to/span description pept 441 779 HypA protein pept 817 2451 HypB protein BASE COUNT 830 a 527 c 613 g 689 t ORIGIN 1 gtcgattttt gtgatgctcg tcagggggcg gagcctatgg aaaaacgcca gcaacgcggc 61 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 121 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 181 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 241 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 301 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 361 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 421 atttcacaca ggaaacagct atgaccatga ttacgaattc gagctcggta cccggggatc 481 aagcaacgac cctcaagatt aaacctttgg gagatagaat tttagttaaa agagaagaag 541 aagcttccac tgcaagaggc ggaatcattc ttcctgacac tgccaagaaa aagcaagata 601 gagctgaagt tttagctcta ggaacaggca aaaaagatga taaagggcag caacttcctt 661 ttgaagttca ggttggtgac atcgttttaa ttgataaata ttctggccaa gaacttactg 721 tcgaaggtga agagtacgtc atcgttcaaa tgagcgaagt tatcgcagtt ctgcaataaa 781 aactaagaga gtgaagaaga tttaaggagc gcatcaatgg tcgctaaaaa cattaaatac 841 aacgaagaag ccagaaagaa aattcaaaaa ggagttaaga ctttagctga agctgtaaaa 901 gtcactctag ggcctaaagg acgacatgtt gtcatagata aaagcttcgg atcccctcaa 961 gtaactaaag atggtgttac cgttgcgaaa gaagttgagc ttgccgacaa acatgaaaat 1021 atgggcgctc aaatggtcaa agaagtcgcc agcaaaactg ctgacaaagc tggagacgga 1081 actacaacag ctactgttct tgctgaagct atctatacag aaggattacg caatgtaaca 1141 gctggagcaa atccaatgga cctcaaacga ggtattgata aagctgttaa ggttgttgtt 1201 gatcaaatca aaaaaatcag caagcctgtt cagcatcata aagaaattgc tcaagttgca 1261 acaatttctg ctaataatga tgcagaaatc gggaatctga ttgctgaagc aatggagaaa 1321 gttggtaaaa acggctctat cactgttgaa gaagcaaaag gatttgaaac cgttttggat 1381 gttgttgaag gaatgaattt caatagaggt tacctctcta gctacttcgc aacaaatcca 1441 gaaactcaag aatgtgtatt agaagacgct ttggttctaa tctacgataa gaaaatttct 1501 gggatcaaag atttccttcc tattttacaa caagttgctg aatccggccg tcctcttctt 1561 attatagcag aagacattga aggcgaagct ttagctactt tggtcgtgaa cagaattcgt 1621 ggaggattcc gggtttgcgc agttaaagct ccaggctttg gagatagaag aaaagctatg 1681 ttggaagaca tcgctatctt aactggcggt caactcatta gcgaagagtt gggcatgaaa 1741 ttagaaaacg ctaacttagc tatgttaggt aaagctaaaa aagttatcgt ttctaaagaa 1801 gacacgacca tcgtcgaagg aatgggtgaa aaagaagctt tagaagctcg ttgcgaaagc 1861 atcaaaaaac aaattgaaga cagctcttct gattacgata aagaaaaact ccaagagcgt 1921 cttgctaagc tctctggtgg agtagcagtc attcgcgttg gagctgcaac agagattgag 1981 atgaaagaga aaaaagatcg tgtagacgat gctcaacatg ctacaatcgc tgctgttgaa 2041 gaaggaattc ttcctggtgg aggaacagca ttaatccgtt gtatccctac tcttgaagcc 2101 ttcttgccaa tgttgactaa tgaagatgag caaattggag ctcgcattgt tttgaaagct 2161 ctttccgctc ctttgaaaca aattgctgca aacgcaggaa aagaaggtgc tatcatcttc 2221 caacaagtta tgtcccgttc tgcgaacgaa ggatatgatg cattgcgtga tgcatacaca 2281 gatatgcttg aagctggtat tttagatcct gctaaagtaa cccgttctgc tttagaaagc 2341 gcggcttccg tagctggatt acttttgaca acagaagctc tcattgcaga gattccagaa 2401 gaaaaacctg ctgcagctcc agcaatgcct ggcgcaggaa tggactatta attcctctaa 2461 tgggaacaaa tagattcttc gagcctcgtt tccaaaagga acgaggcttt tttttagatt 2521 cctaatattt ctctattcct ctatcgtaaa catctagtgc ttacgaccat ccttttctat 2581 gtttaaacta atcaagagcg catttctcat agcctgttgt attgtagggt acttctggat 2641 aaaaaaagaa agtattgtt // LOCUS HUMCA1XIA 6158 bp ss-mRNA PRI 15-MAR-1989 DEFINITION Human alpha-1 type XI collagen (COL11A1) mRNA, complete cds. ACCESSION J04177 J05407 KEYWORDS alpha-1 (XI) collagen; collagen; type XI collagen. SOURCE Human placenta fibroblast, cDNA to mRNA, clone OK4 [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1773 to 6158) AUTHORS Bernard,M., Yoshioka,H., Rodriguez,E., van der Rest,M., Kimura,T., Ninomiya,Y., Olsen,B.R. and Ramirez,F. TITLE Cloning and sequencing of pro-alpha-1(XI) collagen cDNA demonstrates that type XI belongs to the fibrillar class of collagens and reveals that the expression of the gene is not restricted to cartilagenous tissue JOURNAL J. Biol. Chem. 263, 17159-17166 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 1835) AUTHORS Yoshioka,H. and Ramirez,F. TITLE Pro alpha-1(XI) collagen, structure of the amino-terminal propeptide and expression of the gene in tumor cell lines JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Ramirez, 02-FEB-1990; for [2] by M.Bernard, 21-SEP-1988. FEATURES from to/span description pept 162 5582 alpha-1 (type XI) collagen precursor /nomgen="COL11A1" /map="1p21" /hgml_locus_uid="LH0170J" sigp 162 269 alpha-1 (type XI) collagen signal peptide matp 270 1745 alpha-1 (type XI) collagen matp 5013 5579 alpha-1 (type XI) collagen BASE COUNT 1693 a 1355 c 1746 g 1361 t 3 others ORIGIN 1 aaccatcaaa tttagaagaa aaagcccttt gactttttcc ccctctccct ccccaatggc 61 tgtgtagcaa acatccctgg cgataccttg gaaaggacga agttggtctg cagtcgcaat 121 ttcgtgggtt gagttcacag ttgtgagtgc ggggctcgga gatggagccg tggtcctcta 181 ggtggaaaac gaaacggtgg ctctgggatt tcaccgtaac aaccctcgca ttgaccttcc 241 tcttccaagc tagagaggtc agaggagctg ctccagttga tgtactaaaa gcactagatt 301 ttcacaattc tccagaggga atatcaaaaa caacgggatt ttgcacaaac agaaagaatt 361 ctaaaggctc agatactgct tacagagttt caaagcaagc acaactcagt gccccaacaa 421 aacagttatt tccaggtgga actttcccag aagacttttc aatactattt acagtaaaac 481 caaaaaaagg aattcagtct ttccttttat ctatatataa tgagcatggt attcagcaaa 541 ttggtgttga ggttgggaga tcacctgttt ttctgtttga agaccacact ggaaaacctg 601 ccccagaaga ctatcccctc ttcagaactg ttaacatcgc tgacgggaag tggcatcggg 661 tagcaatcag cgtggagaag aaaactgtga caatgattgt tgattgtaag aagaaaacca 721 cgaaaccact tgatagaagt gagagagcaa ttgttgatac caatggaatc acggtttttg 781 gaacaaggat tttggatgaa gaagtttttg agggggacat tcagcagttt ttgatcacag 841 gtgatcccaa ggcagcatat gactactgtg agcattatag tccagactgt gactcttcag 901 cacccaaggc tgctcaagct caggaacctc agatagatga gtatgcacca gaggatataa 961 tcgaatatga ctatgagtat ggggaagcag agtataaaga ggctgaaagt gtaacagagg 1021 gacccactgt aactgaggag acaatagcac agacggaggc aaacatcgtt gatgattttc 1081 aagaatacaa ctatggaaca atggaaagtt accagacaga agctcctagg catgtttctg 1141 ggacaaatga gccaaatcca gttgaagaaa tatttactga agaatatcta acgggagagg 1201 attatgattc ccagaggaaa aattctgagg atacactata tgaaaacaaa gaaatagacg 1261 gcagggattc tgatcttctg gtagatggag atttaggcga atatgatttt tatgaatata 1321 aagaatatga agataaacca acaagccccc ctaatgaaga atttggtcca ggtgtaccag 1381 cagaaactga tattacagaa acaagcataa atggccatgg tgcatatgga gagaaaggac 1441 agaaaggaga accagcagtg gttgagcctg gtatgcttgt cgaaggacca ccaggaccag 1501 caggacctgc aggtattatg ggtcctccag gtctacaagg ccccactgga ccccctggtg 1561 accctggcga taggggcccc ccaggacgtc ctggcttacc aggggctgat ggtctacctg 1621 gtcctcctgg tactatgttg atgttaccgt tccgttatgg tggtgatggt tccaaaggac 1681 caaccatctc tgctcaggaa gctcaggctc aagctattct tcagcaggct cggattgctc 1741 tgagaggccc acctggccca atgggtctaa ctggaagacc aggtcctgtg ggggggcctg 1801 gttcatctgg ggccaaaggt gagagtggtg atccaggtcc tcagggccct cgaggcgtcc 1861 agggtccccc tggtccaacg ggaaaacctg gaaaaagggg tcgtccaggt gcagatggag 1921 gaagaggaat gccaggagaa cctggggcaa agggagatcg agggtttgat ggacttccgg 1981 gtctgccagg tgacaaaggt cacaggggtg aacgaggtcc tcaaggtcct ccaggtcctc 2041 ctggtgatga tggaatgagg ggagaagatg gagaaattgg accaagaggt cttccaggtg 2101 aagctggccc acgaggtttg ctgggtccaa ggggaactcc aggagctcca gggcagcctg 2161 gtatggcagg tgtagatggc cccccaggac caaaagggaa catgggtccc caaggggagc 2221 ctgggcctcc aggtcaacaa gggaatccag gacctcaggg tcttcctggt ccacaaggtc 2281 caattggtcc tcctggtgaa aaaggaccac aaggaaaacc aggacttgct ggacttcctg 2341 gtgctgatgg gcctcctggt catcctggga aagaaggcca gtctggagaa aagggggctc 2401 tgggtccccc tggtccacaa ggtcctattg gatnnccggg cccccgggga gtaaagggag 2461 cagatggtgt cagaggtctc aagggatcta aaggtgaaaa gggtgaagat ggttttccag 2521 gattcaaagg tgacatgggt ctaaaaggtg acagaggaga agttggtcaa attggcccaa 2581 gagggnaaga tggccctgaa ggacccaaag gtcgagcagg cccaactgga gacccaggtc 2641 cttcaggtca agcaggagaa aagggaaaac ttggagttcc aggattacca ggatatccag 2701 gaagacaagg tccaaagggt tccactggat tccctgggtt tccaggtgcc aatggagaga 2761 aaggtgcacg gggagtagct ggcaaaccag gccctcgggg tcagcgtggt ccaacgggtc 2821 ctcgaggttc aagaggtgca agaggtccca ctgggaaacc tgggccaaag ggcacttcag 2881 gtggcgatgg ccctcctggc cctccaggtg aaagaggtcc tcaaggacct cagggtccag 2941 ttggattccc tggaccaaaa ggccctcctg gaccaccagg aaggatgggc tgcccaggac 3001 accctgggca acgtggggag actggatttc aaggcaagac cggccctcct gggccagggg 3061 gagtggttgg accacaggga ccaaccggtg agactggtcc aataggggaa cgtgggtatc 3121 ctggtcctcc tggccctcct ggtgagcaag gtcttcctgg tgctgcagga aaagaaggtg 3181 caaagggtga tccaggtcct caaggtatct cagggaaaga tggaccagca ggattacgtg 3241 gtttcccagg ggaaagaggt cttcctggag ctcagggtgc acctggactg aaaggagggg 3301 aaggtcccca gggcccacca ggtccagttg gctcaccagg agaacgtggg tcagcaggta 3361 cagctggccc aattggttta cgagggcgcc cgggacctca gggtcctcct ggtccagctg 3421 gagagaaagg tgctcctgga gaaaaaggtc cccaagggcc tgcagggaga gatggagttc 3481 aaggtcctgt tggtctccca gggccagctg gtcctgccgg ctcccctggg gaagacggag 3541 acaagggtga aattggtgag ccgggacaaa aaggcagcaa gggtggcaag ggagaaaatg 3601 gccctcccgg tcccccaggt cttcaaggac cagttggtgc ccctggaatt gctggaggtg 3661 atggtgaacc aggtcctaga ggacagcagg ggatgtttgg gcaaaaaggt gatgagggtg 3721 ccagaggctt ccctggacct cctggtccaa taggtcttca gggtctgcca ggcccacctg 3781 gtgaaaaagg tgaaaatggg gatgttggtc catgggggcc acctggtcct ccaggcccaa 3841 gaggccctca aggtcccaat ggagctgatg gaccacaagg acccccaggt tctgttggtt 3901 cagttggtgg tgttggagaa aagggtgaac ctggagaagc aggaaaccca gggcctcctg 3961 gggaagcagg tgtaggcggt cccaaaggag aaagaggaga gaaaggggaa gctggtccac 4021 ctggagctgc tggacctcca ggtgccaagg ggccgccagg tgatgatggc cctaagggta 4081 acccgggtcc tgttggtttt cctggagatc ctggtcctcc tggggaactt ggccctgcag 4141 gtcaagatgg tgttggtggt gacaagggtg aagatggaga tcctggtcaa ccgggtcctc 4201 ctggcccatc tggtgaggct ggcccaccag gtcctcctgg aaaacgaggt cctcctggag 4261 ctgcaggtgc agagggaaga caaggtgaaa aaggtgctaa gggggaagca ggtgcagaag 4321 gtcctcctgg aaaaaccggc ccagtcggtc ctcagggacc tgcaggaaag cctggtccag 4381 aaggtcttcg gggcatccct ggtcctgtgg gagaacaagg tctccctgga gctgcaggcc 4441 aagatggacc acctggtcct atgggacctc ctggcttacc tggtctcaaa ggtgaccctg 4501 gctccaaggg tgaaaaggga catcctggtt taattggcct gattggtcct ccaggagaac 4561 aaggggaaaa aggtgaccga gggctccctg gaactcaagg atctccagga gcaaaagggg 4621 atgggggaat tcctggtcct gctggtccct taggtccacc tggtcctcca ggcttaccag 4681 gtcctcaagg cccaaagggt aacaaaggct ctactggacc cgctggccag aaaggtgaca 4741 gtggtcttcc agggcctcct gggcctccag gtccacctgg tgaagtcatt cagcctttac 4801 caatcttgtc ctccaaaaaa acgagaagac atactgaagg catgcaagca gatgcagatg 4861 ataatattct tgattactcg gatggaatgg aagaaatatt tggttccctc aattccctga 4921 aacaagacat cgagcatatg aaatttccaa tgggtactca gaccaatcca gcccgaactt 4981 gtaaagacct gcaactcagc catcctgact tcccagatgg tgaatattgg attgatccta 5041 accaaggttg ctcaggagat tccttcaaag tttactgtaa tttcacatct ggtggtgaga 5101 cttgcattta tccagacaaa aaatctgagg gagtaagaat ttcatcatgg ccaaaggaga 5161 aaccaggaag ttggtttagt gaatttaaga ggggaaaact gctttcatac ttagatgttg 5221 aaggaaattc catcaatatg gtgcaaatga cattcctgaa acttctgact gcctctgctc 5281 ggcaaaattt cacctaccac tgtcatcagt cagcagcctg gtatgatgtg tcatcaggaa 5341 gttatgacaa agcacttcgc ttcctgggat caaatgatga ggagatgtcc tatgacaata 5401 atccttttat caaaacactg tatgatggtt gtacgtccag aaaaggctat gaaaaaactg 5461 tcattgaaat caatacacca aaaattgatc aagtacctat tgttgatgtc atgatcagtg 5521 actttggtga tcagaatcag aagttcggat ttgaagttgg tcctgtttgt tttcttggct 5581 aagattaaga caaagaacat atcaaatcaa cagaaaatgt accttggtgc caccaaccca 5641 ttttgtgcca catgcaagtt ttgaataagg atgtatggaa aacaacgctg catatacagg 5701 taccatttag gaaataccga tgcctttgtg ggggcagaat cacagacaaa agctttgaaa 5761 atcataaaga tataagttgg tgtggctaag atggaaacag ggctgattct tgattcccaa 5821 ttctcaactc tccttttcct atttgaattt ctttggtgct gtagaaaaca aaaaaagaaa 5881 aatatatatt cataaaaaat atggtgctca ttctcatcca tccaggatgt actaaaacag 5941 tgtgtttaat aaattgtaat tattttgtgt acagttctat actgttatct gtgtccattt 6001 ccaaaacttg cacgtgtccc tgaattccgc tgactctaat ttatgaggat gccgaactct 6061 gatggcaata atatatgtat tatgaaaatg aagttatgat ttccgatgac cctaagtccc 6121 tttctttggt taatgatgaa attcctttgt gtgtgttt // LOCUS STYENVM 1656 bp ds-DNA BCT 14-FEB-1990 DEFINITION S.typhimurium envM protein gene, complete cds. ACCESSION M31806 KEYWORDS . SOURCE S.typhimurium (strain AG701i50) DNA, clone KF501. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1656) AUTHORS Turnowsky,F., Fuchs,K., Jeschek,C. and Hoegenauer,G. TITLE envM genes of Salmonella typhimurium and Escherichia coli JOURNAL J. Bacteriol. 171, 6555-6565 (1989) STANDARD simple staff_review FEATURES from to/span description pept 259 558 ORF-1 pept 676 1464 envM protein BASE COUNT 413 a 395 c 394 g 454 t ORIGIN 1 cgatgatatc tcttaaatac ctttctgata tctggcaact atgatttccg atctgaaatg 61 taaacggcat atgtctcatc ctctgaaaat ttattataat acacatgcaa acatataaaa 121 acctgatttg taactactgt aaccacctga tttccaaaat ttttcatatt cagtattact 181 ctgcatcaat aaatagagaa aaccttcagc acctaaaaaa gaaattattt ttgttaagca 241 tttgctggtt aaagttacat gtccataaca tcgctaatag gtgaggtaaa ccatttggat 301 gaatattcag gcttcctaca gaacctttat ttgtctggtg gaaagcgcat cgctatcgta 361 cttgctatcc gtcccggtgt ctctggttgg atgaggcgaa ccttacacca gctggtatac 421 tgcaccgtag ggcctgtccc attagactct ttgatttgcc agtttggccc tggggagtgc 481 tcaaaatcct cagggacttc gttaccgtgt ggttttagaa cgtcacctgc cggagatagc 541 ggcggaaaaa tagcgtgaca cacggggcca ggagtgggat ataatttcag actaatttcg 601 aataatgttg aaaagttgct gttaaccatt gttcgtaaaa gcaaaaaata agcaatgact 661 ataaggatta aagctatggg ttttctttcc ggtaagcgca ttctggtcac tggcgtggcc 721 agtaaactgt ccatcgccta cggtatcgca caggcgatgc accgcgaagg agctgaactg 781 gcgttcacct accagaacga caaactgaaa ggccgcgtag aagaatttgc tgcccagctg 841 ggttccagca tcgttttacc gtgtgatgtc gctgaagatg ccagcattga cgctatgttt 901 gcggagctgg gtaacgtttg gccgaaattt gacggttttg tacactcgat cggtttcgcg 961 cctggcgatc agctggacgg cgattatgtt aatgcggtta cccgcgaagg ctttaaagtc 1021 gctcacgata ttagctctta cagctttgtt gccatggcga aagcctgccg caccatgctg 1081 aatccgggtt ctgcgctgtt gaccttgtct tatctgggag cggagcgcgc catcccgaac 1141 tacaatgtca tgggtctggc gaaagcctct ctggaagcca acgtgcgtta tatggcaaac 1201 gctatgggac cggaaggggt tcgcgtcaac gccatctctg cgggtccaat ccgcacgctg 1261 gcggcgtccg gtattaaaga cttccgtaaa atgctggcgc attgtgaagc cgttacgcca 1321 attcgtcgca ccgtgaccat tgaagatgtg ggtaactcag cggcgtttct gtgttccgat 1381 ctctcggcag gcatttctgg cgaagtcgtc catgttgacg gcgggttcag cattgcggca 1441 atgaatgaac tggaactgaa gtaagcgcat taagtttatt agcgggcggc acggctgtgc 1501 tgccccattc ccctcactca ccactcatct cgttatatag cgttctgata tttgttatca 1561 cctaacaatc tttcatccct ttcccgcata cgccaggata atccagcata aacgatccga 1621 cgccgaatca ttatcgatcg ccatttcatc ctcggg //
GenBank-Updates@genbank.bio.net (04/13/90)
LOCUS DROEMCA1 934 bp ds-DNA INV 07-MAR-1990 DEFINITION D.melanogaster extramacrochaetae protein gene, exon 1. ACCESSION M32636 KEYWORDS extramacrochaetae protein; helix-loop-helix protein; regulatory protein. SEGMENT 1 of 2 SOURCE D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and DNA, clones pP7, lambda-B[11,16]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 934) AUTHORS Ellis,H.M., Spann,D.R. and Posakony,J.W. TITLE Extramacrochaetae, a negative regulator of sensory organ development in Drosophila, defines a new class of helix-loop-helix proteins JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.W.Posakony, 06-MAR-1990. FEATURES from to/span description pept 347 + 910 extramacrochaetae protein, exon 1 pre-msg 85 > 934 emc mRNA and introns (minor alt.) IVS 911 > 934 emc intron BASE COUNT 270 a 273 c 244 g 147 t ORIGIN Chromosome 3, left arm 61D1,2. 1 cgaacccgag agcgaaagac tccgctatcg gattcggaaa tgccgttcgg ggggtataaa 61 agcgggcgct ctgagcgaga caccactcat tcgttgtctg accgctcgca gcaaaagata 121 tagaaatcga gaacgtaaac gcagaaaaca cgctggatta aagtgtttcg tttcgaacat 181 cgacagagtg agaaaacaaa gtgaagtgag cgcacaaagc cgaaatcaaa tcaaattgaa 241 aagccagtca agagcaatct ttaactgaac ctcgcaagta atccggtacg accgtgtaac 301 aaccgccaag attctactca aggagcgtaa agcaatcatc tccagaatga agtccctgac 361 ggccgtctgc cagacaggtg cctccggaat gccggccttg aatgccagcg ggcgcatcca 421 gcgccatccc acgcaccgcg gcgacgggga gaacgccgag atgaagatgt atctgtccaa 481 actgaaggac ctcgttccgt tcatgcccaa gaacaggaag ctcaccaagc tggagatcat 541 ccagcacgtc atcgactaca tctgcgacct gcagaccgag ctggagacgc accccgagat 601 gggcaacttc gatgcggcag ccgctctgac ggcggtgaac ggactccacg aggacgagga 661 cagcgacatg gaggatgcgg atgccgaggc agaagcggaa gtcgatccag atatcctcgc 721 ccagcgcctg aatgccgagc agccggcgaa agtctctagt cccgccgccc gtctcccgct 781 taccgatcgc caaacgccca acactcttgt ggcgcccgcc catccgcagc agcatcagca 841 gcagcagcaa ctgcaactgc agcagcaaca actgcaatca cagcagcaac tgtccaacag 901 tttagcaacg gtgagttaac aaacaaattg ctag // LOCUS DROEMCA2 1307 bp ds-DNA INV 07-MAR-1990 DEFINITION D.melanogaster extramacrochaetae protein gene, exon 2. ACCESSION M32637 KEYWORDS extramacrochaetae protein; helix-loop-helix protein; regulatory protein. SEGMENT 2 of 2 SOURCE D.melanogaster 8-12 hour embryo, cDNA to mRNA, clones pNB5B, and DNA, clones pP7, lambda-B[11,16]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1307) AUTHORS Ellis,H.M., Spann,D.R. and Posakony,J.W. TITLE Extramacrochaetae, a negative regulator of sensory organ development in Drosophila, defines a new class of helix-loop-helix proteins JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.W.Posakony, 06-MAR-1990. FEATURES from to/span description pept + 31 66 extramacrochaetae protein (emc), exon 1 pre-msg < 1 1140 emc mRNA and introns IVS < 1 30 emc intron BASE COUNT 499 a 244 c 193 g 371 t ORIGIN Chromosome 3, left arm 61D1,2. 1 taatttgaat cttccgcttc tctcttttag ccacagaatg cggagaaaga cagcagacag 61 tcgtaagcgt ggaaacaccc agccgggaaa acccacggaa cctataagcc tattaactaa 121 atgcataatg aaagtacagt caaaactcca ctcgcaacct tcgtgtgaag atctccgcat 181 aagataacgt gtgaaactct caagaaaacc acagcccagg cgcatctctt caacgctcct 241 tttcccggga aagggaaaac ccgggaaaac gagacctcgc ttttggctcc aacagagttt 301 agtttaaaca tttaataaca agaatacaac aacacaaaac ctagcattag agcagcgtaa 361 gcaagtacac attatgatta ttaattaaac attacaacta ggatgatggc gcttaagaat 421 gaacggaaga aagaaaactt ggcaaagctt taaacaattt aattttcctt taacgcttga 481 aattacactg aagcatatat ttttgttttc ctaattttag ttttgttttg gttaaaacaa 541 cgcaagaaaa aatgttacta atgcaaagca aacagagatt agcaataaaa aaaaattata 601 aaaatgcaca ggcaccaaac tggaattacg ttgaatcacc gtttcctgca ttacgattat 661 atgatttcaa tccaaatgat tttcctatta attttaatca atgaagagtt ttcctattat 721 tgattagcca cagccaataa acaacaagaa caaccaactc agcagcacac acacattcaa 781 attcattaat ttatatcgag agagtacgtt tagagtttaa gttcgagaaa gttcatttcc 841 aggcactaat tttcactcac accactcaca tagaaaaaca caaacacctc attttgtagt 901 tttccgagtc ccttggatcg ttttcctctt ttttaatcga aataacgcat aatgcataat 961 aattattacc agatatacac taaaacctat aaatatatat taatatatgt tctgtgatct 1021 gtagcatact ttgtacaaag tgtttttttt agcataatta tgcgatttta attatgatcc 1081 gcaatcgaaa ttagagattt ttgttctcat ttcgatggta aaaacactct atcatgatgt 1141 aaaaaaaaaa aacaaaagaa aacatttgca tgatgttgcg aaaaaatata ttatctaaaa 1201 cataaagaaa aacaaacact agcctcttaa taaattgcaa aaatttaaaa aacattaaaa 1261 atcgaatatg tcaatggatt tccgttggga gcaggggatt aagaaac // LOCUS DROFMRFA1 1328 bp ds-DNA INV 08-MAR-1990 DEFINITION D.melanogaster FMRFamide neuropeptide gene, 5' flank, exon 1. ACCESSION M32640 J05415 KEYWORDS FMRFamide neuropeptide. SEGMENT 1 of 2 SOURCE D.melanogaster (strain Canton S) DNA, (library of T.Maniatis). ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1328) AUTHORS Schneider,L.E. and Taghert,P.H. TITLE Organization and expression of the Drosophila FMRFamide neuropeptide gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.E.Schneider, 07-MAR-1990. FEATURES from to/span description pre-msg 1139 > 1328 FMRFamide neuropeptide mRNA and intron IVS 1248 > 1328 FMRFamide neuropeptide intron signal 1066 1071 CAAT box signal 1109 1114 TATA box site 40 51 conserved sequence 1 site 130 140 conserved sequence 2 site 309 326 conserved sequence 3 site 374 388 conserved sequence 4 site 394 414 conserved sequence 5 site 507 545 conserved sequence 6 site 574 602 conserved sequence 7 site 637 649 conserved sequence 8 site 671 721 conserved sequence 9 site 746 766 conserved sequence 10 site 860 872 conserved sequence 11 site 894 909 conserved sequence 12 site 1015 1027 conserved sequence 13 site 1028 1041 conserved sequence 14 site 1053 1061 conserved sequence 15 site 1063 1076 conserved sequence 16 site 1086 1095 conserved sequence 17 site 1108 1175 conserved sequence 18 BASE COUNT 340 a 331 c 319 g 338 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttctgg ggtgagtcca tagattacag agcagcgcag ttttcttgac cagatgatgg 61 tctagctgaa ctatttttaa ctgggcgatt ggcacacgac ttggcctggt catcaaactc 121 aagccaagat ttgaacttga actagtctcg ccttatcaca ttgccttcta attcgttttc 181 attttcgatt ttcgcatcta cttatgctaa tgaagccatc tgcagacgtg gttttcgaac 241 gtatttatat tgattatggg tgatcgtcaa caagagcagt ggacacccaa taaacctgtc 301 caaaacccga cacatttctg cccagtcatg cgtggtggac aatagccaaa tgccattgat 361 gagactcgtc tccaaaactt tggccttttg ccgggccgta attacagact tccgtctttt 421 gaacagtttt ttcagcccca cccaagagtc gagtcttgaa aagctggctg ggatggggtg 481 gtttcgggtg ctggacgaga tgccagaggc gccacaatgt atcctgttac aggttacagg 541 gccataaagc gccataaacg ccgcgacggc aatggcaaat tataacgcat acggacacgt 601 agtcgatcca ctggctagaa ggctaattgg acgtgcccgg ccaggatgtc cctgctcatc 661 ccacagtcga cttaaaagtg ggcgtaacat aacaattgtg ggcgatttgc atacaatttg 721 gtgctcgagt ggccacctaa ctggcgtcga aatgttgcta cgtcattgtt gctgcccctc 781 gtgcttcttg gctaataagt cagcatgaat gcggtctgtg ttttggcctt ttgtcccgct 841 gacaaccgct gacaagcaaa acaaacatat cagcggctga ccagcgacca tttgccgaca 901 ttgtcaaggt ttttttttca gccccgaata cacctgctaa tctgtggtgg tgactagcgc 961 gctcggaagg cagaaaaaac tgcagcgaaa atcagttcca ttgaatgtcc tctggcaaca 1021 tgttgccgca gcacttgttg cgcgcgtcca ctcgcagagg cacctcaaca tcagcctcct 1081 tgggcgccag cacaaaggac gcaacgcata taaaacagga tgtttcgcca aacatcctca 1141 cagaactgtt ccaccttcga gcgggcaaca agtgtgtgtg cggcccaaaa ggatccccag 1201 accttcgaat tcactctagt ttcctagtaa ggggacaggt ttcagaggtg agtgccatgc 1261 gaaaagtgtg aaaacagtgc tgagccttaa gcagtcgagt tattctttgg aaaagccaat 1321 agccaatt // LOCUS DROFMRFA2 1603 bp ds-DNA INV 08-MAR-1990 DEFINITION D.melanogaster FMRFamide neuropeptide gene, exon 2. ACCESSION M32641 J05415 KEYWORDS FMRFamide neuropeptide. SEGMENT 2 of 2 SOURCE D.melanogaster (strain Canton S) DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1603) AUTHORS Schneider,L.E. and Taghert,P.H. TITLE Organization and expression of the Drosophila FMRFamide neuropeptide gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.E.Schneider, 07-MAR-1990. FEATURES from to/span description pept 220 1263 FMRFamide neuropeptide precursor sigp 220 291 FMRFamide neuropeptide signal peptide matp 527 570 FMRFamide neuropeptide A matp 571 663 FMRFamide neuropeptide B matp 664 696 FMRFamide neuropeptide C matp 697 729 FMRFamide neuropeptide D matp 730 762 FMRFamide neuropeptide E matp 763 795 FMRFamide neuropeptide F matp 796 828 FMRFamide neuropeptide G matp 829 861 FMRFamide neuropeptide H matp 862 894 FMRFamide neuropeptide I matp 895 921 FMRFamide neuropeptide J matp 922 954 FMRFamide neuropeptide K matp 976 1002 FMRFamide neuropeptide L matp 1003 1035 FMRFamide neuropeptide M matp 1036 1065 FMRFamide neuropeptide N pre-msg < 1 1538 FMRFamide neuropeptide mRNA and introns IVS < 1 219 FMRFamide neuropeptide intron signal 1404 1408 poly-A signal signal 1482 1486 poly-A signal signal 1507 1511 poly-A signal BASE COUNT 409 a 398 c 431 g 365 t ORIGIN Undetermined number of bp after segment 1. 1 gcggctggtt aacttttaac gcgccggccc aacagatctg gccataaata tcatgatggt 61 taattagcag gacactttgt ctgcgcgtcc tggctgccgg tggctactgt taccctaagt 121 aaaacgaacg gtcttaatgt gattaaaagt tgcaatagat aacgcgagat gccgccgcct 181 cataattatc ctgccgtcac tccattctta tccttgcaga tgggcattgc cttgatgttc 241 ctgctggccc tgtaccagat gcagtcggcc atccacagcg agatcatcga tacgcccaac 301 tatgcgggca actcgttgca ggacgctgac tccgaggtga gtccatcgca ggacaatgac 361 ctggtagatg cactgctcgg caacgatcag accgagaggg cggagctgga gttccggcac 421 cccatctctg tgattggcat cgactactcg aagaacgccg tggtgctgca cttccagaaa 481 cacggccgga aaccgcgcta caagtacgat cccgagctgg aggccaagcg aaggtccgtg 541 caggacaact tcatgcactt cggcaagagg caggcggagc agctgccacc ggagggcagc 601 tatgctgaat ccgatgaact ggagggcatg gccaagcgag cagctatgga tcggtatggc 661 agagatccca agcaggactt catgcggttt ggtcgggatc cgaaacagga cttcatgagg 721 tttggcaggg atccaaagca ggacttcatg agattcggtc gggatcccaa gcaggatttc 781 atgagattcg gtcgagatcc caagcaggat ttcatgaggt ttggacgcac tccggctgag 841 gatttcatga ggttcggacg cactccggcg gaggacttca tgaggttcgg acgctccgac 901 aatttcatgc gcttcggacg cagtccccac gaggagcttc gcagtcccaa acaggatttc 961 atgcgattcg gtcgcccgga caacttcatg cgcttcgggc gttccgctcc gcaggatttt 1021 gtgcgctccg ggaagatgga ctcaaacttc attcgattcg gtaagagctt gaagccggcg 1081 gctcccgagt ccaagccagt caagtccaat caaggcaacc caggcgaacg cagtccagtg 1141 gacaaggcca tgacggagct gttcaagaaa caggagctgc aggatcagca ggtgaagaac 1201 ggcgcacagg cgaccaccac gcaggatggg agtgtggaac aggaccagtt cttcggccag 1261 tgaggtagtc ctgcgggacg cctccttgta aatagatatg gacaaatgta cgcaaggatc 1321 taaattgata tacgtatata acccactcct cacacgaact cctgacttat gcctgaacta 1381 tgaattttta atgaatgggc tggattaaaa attcaccgtg ctttgaagtt cttatctata 1441 aatatatcta gtgtaatatt gaagaaattg aaattggcgt gaataaaatc ctgtggcaac 1501 attttaaata aagattgctt tactgtaaat tatggcgcac tcgaatagcc ttatttcatt 1561 cggtcgaggt acatattgta tagccattat tagtttctcc act // LOCUS DRVFMRFRN1 3189 bp ds-DNA INV 08-MAR-1990 DEFINITION D.virilis FMRFamide-related neuropeptide gene, 5' flank, exon 1. ACCESSION M32642 KEYWORDS FMRFamide-related neuropeptide; neuropeptide; secretory granule protein. SEGMENT 1 of 2 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 3189) AUTHORS Taghert,P.H. and Schneider,L.E. TITLE Inter-specific comparison of a Drosophila gene encoding FMRF amide-related neuropeptides JOURNAL J. Neurosci. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.H.Taghert 07-MAR-1990. FEATURES from to/span description pre-msg 2890 > 3189 FMRFamide-related neuropeptide mRNA and introns IVS 3059 > 3189 FMRFamide-related neuropeptide intron A site 1211 1222 conserved sequence 1 site 1225 1236 conserved sequence 2 site 1300 1416 conserved sequence 3 site 1491 1504 conserved sequence 4 site 1513 1533 conserved sequence 5 site 1636 1668 conserved sequence 6 site 1695 1724 conserved sequence 7 site 1426 1438 conserved sequence 8 site 1814 1864 conserved sequence 9 site 1888 1908 conserved sequence 10 site 2116 2128 conserved sequence 11 site 2144 2159 conserved sequence 12 site 2635 2646 conserved sequence 13 site 2674 2688 conserved sequence 14 site 2754 2853 conserved sequence 15 site 2691 2704 conserved sequence 16 site 2041 2050 conserved sequence 17 site 2856 2933 conserved sequence 18 BASE COUNT 1034 a 630 c 656 g 869 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttctat cagtaaacaa caacaactac cgttattgct caatacctgt agttgttgtc 61 gctgctgctg atttcccatg cagcttagaa catgacgcaa ctgaacagca gcagcagcag 121 cagccgccat cgcacaaata tgttcgttta ccgttttgaa tttatgttta tagcagcagc 181 agcacatgtt gttacgtctt gcaattgttt atggcataaa tctttatagc atacacacac 241 acatacatgc atgtgtacat gcataaattc cgtgtgtgcc cattaattag agagaattgg 301 ccatggataa gctggtcggt tacttattag gttaccacct ggtggactcg aaaacaaata 361 aaactcggca cataaacaaa aaaactaaaa attaagacga aaattcaaca atggcattga 421 catcaaacac attgcctgcg ctttcaaacg gcccatgaca acaagagcta actaactaaa 481 tcgtaaaaaa agagctagta aaatccaatt tttttaacca tctaaaatcg taaagcctat 541 caaaacacgg cttttggcat ttggagcaaa cggttgtcaa gcggagcccc attgagctaa 601 tcaggttctg ctgttgctca ttgacggaca ccaattttat ttatctattt ttattgactc 661 ttctagatgt attccatatc cattgttgag ctgtgatatg cttaaaaaag taacgattaa 721 ctggaaaata ctctttagtg aaatcaacgt aataaaatac ttattggtat ttataataga 781 acaaatgcaa atacaattat atgatcggaa ataaaatttt tttaagtaat gcatttgaaa 841 agaaaagaaa aataaatata tttacaatta ggtttctttt attttggtaa cggatgcata 901 gcctagaaga ggtatatatc gaaaaatatg tctatattta attgtctaat ctgtcaaaaa 961 tgcaaactga aagattatct aaatattagt tgagcagtaa tggattctag tttatttaac 1021 tgatttccaa ttacctaaaa gtaaagtaag aaagaagcag ttgatatatt taaaagattg 1081 cagggcatcg aaaattcgtt gaagtgaact taaagcacgc ccaatcttag tttccgatat 1141 atataattta agttatttat ttcgtatgtg ttactcactg atggaacagc gcgacggagt 1201 ccacgctctc ggtcaagaaa acgctttgaa cttgagcgtt gttagcgtta ggtatttatg 1261 caaatggatt cagcatcatg catatttatc gatgcaaatg tctggctatt gattttcgac 1321 gatcgttaga gaacaaaaaa aaaaaaaaga agctaaaatg aaataagaat tgaagcgaac 1381 tgccagacac acaatggttg acacatttct tgcaagtcat ggtccacatc ttggccggac 1441 gctgggcggg ttcttcaaat tcagacacgc aaaaaacggg ttgaaaacaa aaaacttggc 1501 ctttcaaagt gtggccgtaa ttacagactt ccgcttacga aaccgaaagt gtccgcccag 1561 ccaaatggcc aagaagtatt ggccgaagat ggggttgggg ctgggcgcag tcgggcagac 1621 gttttgggct gctaaaggcg ccacaatgta tcccgttaca ggggccataa aactccataa 1681 gcgactggcg acaaggccaa attataccgc atacgcacac gtagccatgg tctaagttgt 1741 ggacccggct aattggacgc gcaccagggg tatggtcagg gggtggtggg gttgtttttg 1801 gattctctat aaacttaaaa gtgggcggca cataacaatt gtgggcgatt tgcatacaat 1861 ttggcgcttg gattaccagc aaagcgcgtc aaaatgttgc tgcgtcatcg cgggccaaaa 1921 accagatgac tttgccatgt gctaaatata aacacatgca catatatata tatatatatg 1981 tgtatatata atatatatag tatatatgca catgcaagcg gctaataaag atattgtatg 2041 gccagcacaa tgcagctgtc tgcgggccat tgtttgtctg acagtccgtc tgacagtttt 2101 tgtctgtcca tcacaaacaa acatatcaca gccaaaatgg caggccgaca ttgtcaaggc 2161 tggcttctgt tcaaagcttg cagtttaacg ataaatgaat tgcacaaaga caccaaaaaa 2221 aaaccaacta tatatgctcc agttgcaagt gcttgactag cagataccct gtagccagcg 2281 acaagctgcc ttgatttgag ttctttgcaa agaaatagtt gatggtcctg ataaataatg 2341 gggctccaat tggattactt taagcctcct tggagcctgg acctgtccaa agccttctcc 2401 gttgtgagag tcgaaaaacc caagagcaag caagtttcta gctcacgaaa cttgcgataa 2461 agaccctctg gtcgggcctc gtagcagcag gagtatttac cagacaggac tcggttgttg 2521 atgctgatca ggaatatata tatatatata tacttaatgg gcagagcttg ctgacacttt 2581 gtatcctctt tcacccgttt acaatgggct tagggtataa agagagaagt cgctgcaact 2641 gttgccagct aacatgttgc ataaacatgc cgcagcaaca cttgttgctg cctcaacaac 2701 acccacacac acacacacac atacacacac acacacacag ctgcacagca acacgcagag 2761 gctttggact gccgcctgcc tattatcctg aacaaaggac accgtcggct gtcggccgtc 2821 ggccggttgt acgttgtcgg ccgccgtcta acggtatata aaaccggatg tttcgccaaa 2881 catttgcaca gaagtgtttc acattgccac ggtcaacaag tgtcctgacc aaaatcaggt 2941 agaagaagaa ccacacgagt ggccagtgaa aagtgtttta aaatttccca tttcccaacg 3001 gaatgtcgat ttgattgcat tgcaatattc gtataatttt aaatatttga aaaaaaaggt 3061 gagtttttca aaattttgaa gccctgagca aagtgatttg agtgaaatac taaagcttag 3121 ttaatgggtg gaagaaagct taaatagtct taaaggcttt aaggcttaaa gaaaaagttt 3181 gaatctaga // LOCUS DRVFMRFRN2 1803 bp ds-DNA INV 08-MAR-1990 DEFINITION D.virilis FMRFamide-related neuropeptide gene, exon 2. ACCESSION M32643 KEYWORDS FMRFamide-related neuropeptide; neuropeptide; secretory granule protein. SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1803) AUTHORS Taghert,P.H. and Schneider,L.E. TITLE Inter-specific comparison of a Drosophila gene encoding FMRFamide-related neuropeptides JOURNAL J. Neurosci. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.H.Taghert 07-MAR-1990. FEATURES from to/span description pept 551 1570 FMRFamide-related neuropeptide precursor pre-msg < 1 > 1570 FMRFamide-related neuropeptide mRNA and intron IVS < 1 539 FMRFamide-related neuropeptide intron A BASE COUNT 471 a 438 c 459 g 435 t ORIGIN 1 tgacggagtg cgaacgttca tatacatcag ctccggcgcc cagcccctgc tcctggcgta 61 gaccatcagc ttagcgcctt taacaagttg tattaaggtt gttgatggcc gtgtgtgtgt 121 gtgtgtgtgt gtgacgggga ggggggcaac tcatgttgct gttgttgctg caggtcgatt 181 gttgttgctg ttgctgttgt tgctgccttt aatgcgcttg gttaactttt tacgcgcaca 241 ttcatatttc gctcttggcc aggcggatta tcgcataaat attgttatgg ttaattagcg 301 ggacactttg tctggccggg caaacaaaag gcgttaagac gctgccaccg gagtagcctg 361 ctgctgcctg gtttttgctg ctgttgccag ttaagtaaaa gtgaatgcgt tcattgcttt 421 atccatgggg gcgttaatgt gattaaaact tgttatagat aatgcggcaa gtgatttagt 481 caacacctca taattatgtt caatttctcc ctctctccct ccctttcact ccatctcagc 541 accttcaatc atgggcatag cactgatgtt cctgctggcc ctctaccaga tgcagtcggc 601 catccatagc gagattattg agacgccatc cagctataat gataactcgc tgctggaggc 661 agctgcggag gagcccaatt cgagagccac cgcgtctgag tcggatctgc tggatggact 721 gatgagcacc gataatccca atccggagca gcaaaccgaa ctggagtttc gctatccgat 781 ctcggccatt ggcattggtt atgccaagaa ttcggtggtg ttgcgtttcc agaagcacgc 841 gcgcaaacag aacttcaagt acgatccgga ctacgagatg aagcgcaagt cgctacagga 901 caatttcatg cactttggca agcggcaggc ggagcagctg cctcaggcaa ctggacccgg 961 ctactatgag tgcatcaagc gctccgccat ggatcgctac ggacgcgatc ccaagcagga 1021 ctttatgcgt ttcgggcgtg ctccgccttc ggactttatg cgcttcggac gtgcaccctc 1081 ggactttatg cgtttcggac gcgatcccag ccaggatttc atgcgcttcg ggcgctccga 1141 taactttatg cgctttggca ggaacctgaa ctttcacgag gagctgcgca gtcccaaaca 1201 ggatttcatg cgtttcggtc gcccggacaa tttcatgcgc ttcggccgat ccgcgcccac 1261 agaattcgag cgtaatggca aaatggactc gaacttcatg cgctttggca agagatcagg 1321 tgtaatggcc aaattgacca aatcccagct gcagcaaaac aaattgacaa ctgccgatgg 1381 caagcagcag ccagccgagg agggcaatcc cacggacaag gccatctcga tgctattcaa 1441 taagcaccag cagcagcagc agcagcagca ggggcagagg ctgcagcagg aagaccgaca 1501 gcagatgaag agcagcgcag agcagaacaa cttggaggaa gccagcgtgg agcagttcta 1561 tgagccatag attctaccgc gcacagcaca ctgtaaatat aaattggcca aaaaaaccga 1621 acactatgat aattgtactt aacactatgt atgtatgtac ttaaacgcat tatagacatt 1681 gcagaagaaa tttaaaaaag gaaaaaaaaa aacgtaattt gaattttaat ggggctggat 1741 taaaaattca ccacgctttg atctgatcat aagaaaacaa aagaaaaaaa agtagaaaac 1801 tct // LOCUS ECACELS 1224 bp ds-DNA BCT 06-MAR-1990 DEFINITION E.carotovora beta(1,4)-glucan glucanohydrolase (celS) gene, complete cds. ACCESSION M32399 KEYWORDS beta(1,4)-glucan glucanohydrolase; endoglucanase. SOURCE E.carotovora carotovora (strain SCC3193) DNA, clone pHSK25. ORGANISM Erwinia carotovora Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1224) AUTHORS Saarilahti,H.T., Henrissat,B. and Palva,E.T. TITLE CelS: A novel endoglucanase identified from Erwinia carotovora subsp carotovora JOURNAL Gene (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.T.Saarilahti, 02-MAR-1990. FEATURES from to/span description pept 208 1002 endoglucanase (celS) precursor sigp 208 303 endoglucanase signal peptide matp 304 999 endoglucanase binding 196 200 ribosome binding site BASE COUNT 353 a 278 c 253 g 340 t ORIGIN 1 taaaattaac accctgattt tatttcacat tattcatatt tattaagact ttaatcattt 61 ccccatgaat tggattattc cgacagagtg gaaataacat aaccagttaa actaaccaca 121 tctcagactt gatcatagtc acgtgataac gatgtgttaa ctttcagatt aaaacacctc 181 aagaatatat ttaaaaagag atttattatg cagactgtga atacccagcc acaccgtatt 241 ttccgcgtat tattgcccgc agtattttct tccttgctac tctcatcact aacagtgtct 301 gctgccagtt catcaaacga tgcggataaa ctctatttcg gtaataataa atattaccta 361 ttcaataacg tatggggaaa agatgaaata aaaggatggc agcaaacgat tttctataat 421 agcccaatta gcatggggtg gaactggcac tggccaagca gtacccacag tgttaaagcc 481 tatccttcac tggtgagcgg ttggcactgg acggcgggct atacggaaaa tagcggattg 541 ccgatacaat tatccagtaa taagagtatc accagtaacg ttacttattc tatcaaggct 601 acgggtactt acaacgccgc ttacgacatt tggttccaca ctaccgataa agccaactgg 661 gactcgtcgc ctaccgacga attgatgatt tggctaaatg ataccaatgc aggccctgcg 721 ggtgattata ttgaaaccgt tttcctcggc gacagcagtt ggaacgtatt caaaggctgg 781 atcaacgcgg ataatggtgg aggttggaat gtcttttcct ttgtccatac ttccggcacc 841 aacagtgcat cactcaatat tcgtcacttc accgactatt tggtgcagac aaagcaatgg 901 atgagcgatg aaaaatatat cagcagcgtt gagtttggca ccgagatctt tggcggcgat 961 gggcagatcg acatcaccga atggcgcgta gacgtaaaat agcgcgtctg gcgttgctat 1021 tcctccccgt tcagccgcaa taaacacgtt aacgtagagc tgaaacggat tggtctcatc 1081 gacgttctct tttcattttc agttcagaca gacgttgccc gttctttagg cgggttcgct 1141 tccccccaca acaaaattcg tgagctgtgg cacgtaaatt tcattagcgt tatgataatc 1201 ccctcggcta atcaggaatg tccc // LOCUS ECODCM 2472 bp ds-DNA BCT 26-FEB-1990 DEFINITION E.coli DNA cytosine methylase (dcm) and patch repair protein genes, complete cds. ACCESSION M32307 KEYWORDS DNA cytosine methylase; patch repair protein. SOURCE E.coli (K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2472) AUTHORS Sohail,A., Lieb,M., Dar,M. and Bhagwat,A.S. TITLE Gene required for very short patch repair in Escherichia coli partly overlaps the DNA cytosine methylase gene JOURNAL Unpublished (1990)Wayne St. Univ. Detroit, MI 48202 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.S.Bhagwat, 23-FEB-1990. FEATURES from to/span description pept 250 1668 DNA cytosine methylase (dcm) pept 1649 2119 pot. patch repair protein BASE COUNT 606 a 620 c 695 g 550 t 1 others ORIGIN 43 min on K12 map. 1 gatcatttcc agactaagtt gctgaacgta ccgcaaacca tgcaaaccgc aagaggcaag 61 cacgtggcgc agcacaacgc gcatttttta gtcgagttta tggcgaagct cagtgccgaa 121 ctggcggggg agaatgaagg tgtcgatcac aaggtgatag atgcgttttc atccgctggc 181 tgagcgcgtg gccctaaatg gctgtaatta tgttaacctg tcggccatct cagatggccg 241 gtgaaatcta tgcaggaaaa tatatcagta accgattcat acagcaccgg gaatgccgca 301 caggcaatgc tggagaaact gctgcaaatt tatgatgtta aaacgttggt ggcgcagctt 361 aatggtgtag gtgagaatca ctggagcgcg gcaattttaa aacgtgcgct ggcgaatgac 421 tcggcatggc accgtttaag tgagaaagag ttcgcccatc tgcaaacgtt attacccaaa 481 ccaccggcac atcatccgca ttatgcgttt cgctttatcg atctattcgc cggaattggc 541 ggcatccgtc gcggttttga atcgattggc ggacagtgcg tgtttaccag cgaatggaac 601 aaacatgcgg tacgcactta taaagccaac cattattgcg atccggcgac gcatcatttt 661 aatgaagata tccgcgacat caccctcagc cataaagaag gcgtgagtga tgaggcggcg 721 gcggaacata ttcgtcaaca cattcctgaa cacgatgttt tactggccgg tttcccttgt 781 cagccatttt cgctggctgg cgtatcgaaa aagaactcgc tcgggcgggc gcacggtttt 841 gcctgcgata cccagggcac gctgtttttt gatgtggtac gcattatcga cgcgcgtcgt 901 ccggcgatgt ttgtgctcga aaacgtcaaa aacctgaaaa gtcacgacca gggtaaaacg 961 ttccgcatca tcatgcagac gctggacgaa ctgggctatg acgtggctga tgcagaagat 1021 aatgggccag acgatccgaa aatcatcgac ggcaaacatt ttctgccgca gcaccgtgaa 1081 cgcatcgtgc tggtgggttt tcgtcgcgat ctgaatctga aagccgattt taccctgcgt 1141 gatatcagcg aatgtttccc tgcgcagcga gtgacgctgg cgcagctgtt ggacccgatg 1201 gtcgaggcga aatatatcct gacgccggtg ctgtggaagt acctctatcg atatgcgaaa 1261 aaacatcagg cgcgcggtaa cggcttcggt tatggaatgg tttatccgaa caatccgcaa 1321 agcgtcacgc gtacgctgtc tgcgcgttat tacaaagatg gcgcggaaat tttaatcgat 1381 cgcggctggg atatggccac gggtgagaaa gactttgacg atccgctgaa tcagcaacat 1441 cgtccacgtc ggttaacgcc tcgggaatgc gcgcgcttaa tgggttttga agcgccggga 1501 gaagcgaaat tccgtattcc ggtttcggac actcaggcct atcgccagtt cggtaactcg 1561 gtggtcgtgc cggtctttgc cgcggtggca aaactgcttg agccaaaaat caaacaggcg 1621 gtggcgttgc gtcagcaaga ggcacaacat ggccgacgtt cacgataagg ccactcgcag 1681 caaaaatatg cgcgcgattg ccacgcgtga tacggcgata gagaagcgcc tcgccagtct 1741 gttaaccggg cagggcctgg catttcgcgt tcaggacgcc agtctgcccg gacgtccgga 1801 ttttgtcgtt gatgaatatc gctgcgtgat atttacccat ggctgcttct ggcatcatca 1861 tcactgctat ctgtttaaag tgcctgcgac tcgaaccgag ttctggctgg agaagatagg 1921 taaaaatgtt gagcgcgatc gccgcgatat cagtcgcttg caggaactcg gctggcgcgt 1981 attgattgtc tgggagtgcg cgttacgtgg gcgcgagaag ctgacggatg aagcgcttac 2041 cgagcgtctg gaagagtgga tctgcggcga aggtgccagc gcgcagatcg acacgcaggg 2101 gattcattta ctcgcttgat gcgtcctgaa taactggcgc aactacgggt tttgccggga 2161 agagatattt tcccaacgtg accagtacca ccgcgaagac aattacgccg agcgccagcc 2221 attcaatctt cgacagtgtt tctccaccca gtcccgtacc cagcaagacc gcgaccaccg 2281 ggttaacgta agcgtagctg gtggcgagag ccggactgac attacggatt aaatacatat 2341 aagcgttgat ggcgataatc gaaccaaaca gcgccagata gccgaccgca aggaagcctg 2401 aaagggaagg gagcgccgtc agtttttnac ccgcaatcat cgacgcgatc attaacacca 2461 cgcctgccgc cg // LOCUS FCLPOLCYS 3865 bp ss-RNA VRL 06-MAR-1990 DEFINITION Feline calicivirus RNA-dependent RNA polymerase, cystein protease, and 2C protein,. ACCESSION M32296 KEYWORDS RNA polymerase; cysteine protease; nonstructural protein. SOURCE Feline calicivirus (isolate CFI/68 FIV), cDNA to viral RNA, clones pCV[2,8]. ORGANISM Feline calicivirus Viridae; ss-RNA nonenveloped viruses; Caliciviridae. REFERENCE 1 (bases 1 to 3865) AUTHORS Neill,J.D. TITLE Nucleotide sequence of a region of the feline calicivirus genome which encodes picornavirus-like RNA-dependent RNA polymerase, cysteine protease and 2C polypeptides JOURNAL Unpublished (1990)Pioneer Hi-Bred International, Johnston, IA 50131 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.D.Neill, 23-FEB-1990. FEATURES from to/span description pept < 1 3865 polyprotein (AA at 2) matp < 1 451 2C-like protein matp 1802 2242 cysteine protease-like protein matp 2521 3762 RNA-dependent RNA polymerase-like protein BASE COUNT 1057 a 865 c 894 g 1049 t ORIGIN 1 ggttcctgta tgttacattc ttactggacc acctggatgc gggaaaacaa ctgcagcgca 61 ggcattagcc aagaagttat ctgaccaaga accatcggtc atcaaccttg atgtagacca 121 ccatgatacg tacactggca atgaggtgtg cataattgat gaatttgatt cctctgataa 181 ggttgattat gcaaattttg ttattggaat ggttaactcg gcaccaatgg tactaaattg 241 tgatatgctg gaaaataagg gcaagttgtt cacctcaaag tacataataa tgacttccaa 301 ttctgaaaca cctgtcaaac catcctccaa acgcgctggc gcattttatc ggagagttac 361 ctatcatgat gttgctacct tggttgaatc gcacaagcgt gcacggcctg gtactgctgt 421 cccccgtagt tgctacaaga aaaacttttc acatctatcg cttgctaaac gaggcgctga 481 gtgctggtgc aaagagtatg tccttgatcc taagggattg caacaccaga gcacaaaggc 541 tcctccccca accttcctta atattgattc tttggcccaa acaatgaagc aagactttgc 601 tttaaagaac atggcatttg aagctgaggt aggatgtagt gaacatcggt atggatttgt 661 atgtcagcag agtgaagttg aaacagttcg ccgactcctt aatgccatta gaatgaggct 721 taacgctacg tttaccgttt gcgttggcct tgaagcttct aattccgttg ggtgtactgc 781 gcacgtttta acacctgacg aaccattcaa tggaaaaagg ttcgttgtgt cacgctgcaa 841 tgaggcatcg ctatccgcac tagaaggaaa ctgcgttcaa actgcattag gagtgtgcat 901 gtccaataag gacctcactc acctgtgtca cttcattaag gggaaaattg tcaatgatag 961 tgtcaggtta gatgaactac ccgccaatca acatgtggta accgttaatt cggtgtttga 1021 tttggcctgg gctcttcgtc gtcactctac actaactggt cagttccaag caatcagagc 1081 cgcatatgat gtgcttcatg tccctgacaa ggtccccgcc atgttgcggc actggatgga 1141 tgaaacctcc ttctctgatg aacacgtggt aacgcagttc attacacctg gtggagttgt 1201 aattcttgag tcgtgtggcg gtgcgcgcat ctgggcactt ggtaacaatg ttatcagggc 1261 tggaggcgtc actgccatac ccactggggg atgtgttagg ttaatgggac tctcagctca 1321 aacaatgcca tggtctgaaa ttctttcgga actcttctct ctgctaggga aaatctggtc 1381 tagtgttaaa gtctctaccc ttattctcac cgctctcagc atgtatgcat ctaggttcag 1441 gcctaaaaca gaagcaaagg gaaaaacaaa atccaaaatt ggtccatata gaggccgagg 1501 tgtggcactt accgatgatg agtacgacga atggaaagaa cacaatgccg ctaggaagtt 1561 ggacctctca gttgaagatt ttctaatgtt gcgtcaccgt gccgcactag gggcggatga 1621 cactgatgcc gtcaaattta ggtcatggtg gaattcaagg tctagattgg ctgatgactt 1681 tgaagacgtc accgtgattg gtaagggtgg cgtaaagcat gagaaaatta gaacaaatac 1741 cctcagagcc gttgaccgtg gctacgatgt cagctttgcc gaggagtcag ggcccggggc 1801 aaaattccac aaaaacgcaa ttgggtcagt cactgacgtc tgtggagaac acaagggcta 1861 ttgtgttcac atgggtcacg gggtctacgc ctcagtagca cacgtggtta agggtgactc 1921 tttctttctg ggtgaaagaa tctttgactt aaaaactaat ggtgagttct gttgcttccg 1981 tagcactaag atcttgccta gtgctgctcc tttcttttct ggaagaccaa cgcgtgaccc 2041 ttggggatct cctgtagcaa ctgactggaa accaaaacct tactcaacca catcggggaa 2101 aattgtaggg tgtttcgcaa caacatcaac tgaaacccac cctggcgact gtggcctccc 2161 gtacattgac gacaatggta gagtcactgg gttgcacaca ggctctgggg gacccaagac 2221 cccaagtgca aaattggttg ttccctacgt acatattgac atgaaaacta aatctgtcac 2281 tgctcaaaag tatgatgtca ctaaaccaga tattagttat aaaggtttag tttgtaaaca 2341 attggatgaa attagaatta tcccaaaagg taccaggcta cacgtatctc ctgcacattt 2401 agaagatttt gaggaatgct ctcaccagcc tgcctctcta ggtagtggtg atcctcggtg 2461 tcccaaatcg ctcactgcaa ttgttgttga ctctcttaag ccctactgtg tggtagttaa 2521 tgggcccccc catgatatct tacatagggt acaaaagatg ctaattgatc atctttctgg 2581 ttttgtcccc atgaacatat catctgatac ctccatgctt tctgctttcc acaaactcaa 2641 ccatgataca tcttgtgggc catatctagg tggtagaaag aaagatcaca tggtcaatgg 2701 agaacctgac aaggcgctat tagatttgtt gtcttcaaaa tggaagttgg caacacaggg 2761 aattgcactc ccacatgagt acacaattgg tttgaaagat gaactaaggc ccattgaaaa 2821 agtgcaagaa gggaaaagaa ggatgatctg gggctgtgat gtcggcgttg ccactgtatg 2881 tgcggctgca tttaaaggtg ttagtgacgc cattacggca aaccatcaat atggccctat 2941 acaggttggc attaacatgg acagtcccag tgttgaggca cttttccaaa ggatcaaaag 3001 cgcacgaaag gtgtttgctg ttgattactc aaagtgggat tcaacccaat ccccccgcgt 3061 tagtgccgct tcaattgaca tactgcggta cttttctgat cggactccaa ttgttgattc 3121 ggccacaaac acgcttaaaa gcccccccat tgctgtattt aatggagtag ctgtcaaggt 3181 gtcctctggt ctcccatctg gcatgcccct cacatctgta atcaactcgc tcaatcactg 3241 tttatatgtt ggctgtgcaa tcttgcaatc tttggaagct cgtaacgtgc ctgtcacttg 3301 gaatcttttc tcaactttcg acatgatgac ttatggtgat gatggtgttt acatgttccc 3361 cacaatgtat gcaagtatta gtgaccaaat ttttgccaat ctttctgcat atggtcttaa 3421 gcccacaaga gttgacaaat cagttggttc aattgaacca attgatccta attcagttgt 3481 cttcctaaaa agaacaatta cacgaacccc ccaaggaata cgaggcttgc ttgatcgcag 3541 ctcaattctc aggcaattct actacatcaa gggtgagaac acagataatt ggaaagagcc 3601 cccaaaaact atagatccaa tgtcaagagg gcaacaattg tggaatgcat gtctgtatgc 3661 aagtcagcat ggtattgact tttataataa ggtatacaaa ttggccgaaa aagcagtcga 3721 gtacgaagga ttgcatcttg agcccccttc ttacagcaca gccttggagc attacaacag 3781 ccaatttaat ggtgtggagg cgcggactga ccagatcgat acgagtggca tggccgccct 3841 tcactgtgat gtgttcgaag tttga // LOCUS HUMGALOPS 794 bp ds-DNA PRI 06-MAR-1990 DEFINITION Human 1,3-galactosyltransferase pseudogene, 3' end. ACCESSION J05421 KEYWORDS 1,3-galactosyltransferase; pseudogene. SOURCE Human peripheral blood leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 794) AUTHORS Larsen,R.D., Rivera-Marrero,C.A., Ernst,L.K., Cummings,R.D. and Lowe,J.B. TITLE Frameshift and nonsense mutation in a human genomic sequence homologous to a murine UDP-gal-beta-D-gal alpha-1,3 galactosyltransferase cDNA JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.B. Lowe. 02-MAR-1990. FEATURES from to/span description pept.ps / 63 755 pseudo-1,3-galactosyltransferase site 394 394 frameshift mutation site 475 475 frameshift mutation site 427 429 in-frame stop codon site 708 710 in-frame stop codon BASE COUNT 227 a 165 c 181 g 221 t ORIGIN 1 cagcttgtgg tttctttcag gaatcccaga ggataaatgt tttgcttttc ttctttgttt 61 cagatataat gatcattact tggaggagtt cataacatct gctaataggt acttcatggt 121 tggccacaaa gtcatatttt acatcatggt ggatgatgtc tccaagctgc cgtttataga 181 gctgggtcct ctgcattcct tcaaaatgtt tgaggtcaag ccagagaaga ggtggcaaga 241 catcagcatg atgcgtatga agatcactgg ggagcacatc ttggcccaca tccaacacga 301 ggtcgacttc ctcttctgca tggatgtgga ccaggtcttc caagaccatt ttggggtgga 361 gaccctaggc cagtcagtgg ctcagctaca ggctggcggt acaaggcaga tccctatgac 421 tttacctagg agaggtggaa agagtcagca ggatacattc catttggcca ggggattttt 481 attaccatgc agccatttct ggaggaacac ccattcaggt tctcaacatc acccaggagt 541 gctttaaggg aatcctcctg gacaagaaaa atgacataga agccaagtgg catgatgaaa 601 gccacctaaa caagtatttc cttctcaata aaccctctaa aatcttatcc ctaaaatact 661 gctgggatta tcatataggc ctgccttcag atattaaaac tgtcaagtga tcgtggcaga 721 caaaagagta taatttggtt agaaataatg tctgacttca aattgtgcca gtagatttct 781 gaatttaaga gaga // LOCUS MEFTGSRNA 3367 bp ds-DNA BCT 06-MAR-1990 DEFINITION M.fervidus 7S RNA, Ser-tRNA, 16S rRNA and Ala-tRNA genes. ACCESSION M32222 KEYWORDS 16SrRNA; 7S RNA; transfer RNA-Ala; transfer RNA-Ser. SOURCE Methanothermus fervidus. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (bases 1 to 3367) AUTHORS Haas,E.S., Brown,J.W., Daniels,C.J. and Reeve,J.N. TITLE Genes encoding the 7S RNA and a Ser-tRNA are linked to one of the two rRNA operons in the genome of the extremely thermophilic archaebacterium Methanothermus fervidus JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.J.Daniels, 02-FEB-1990. FEATURES from to/span description pept 132 887 ORF 260 RNA 944 1258 7S RNA tRNA 1273 1359 Ser-tRNA rRNA 1566 3046 16S rRNA tRNA 3105 3178 Ala-tRNA BASE COUNT 898 a 751 c 998 g 720 t ORIGIN 1 ggagacagtg gttcaccagt catacaagat ggtaaggttg tgggtaatat tgtcttcaat 61 atattacaca aatgctacag ctatagcatc ttccctagaa aaatttttag aaaggaagta 121 atgtggtggc catgaggatt ggatttattg gatttggaga agtaagttct actttatcac 181 aattttttaa ggataaggtg gaagtacaaa catgtgttaa aggtaggagt gaaaaaacta 241 aaaaaattgc aaaaaaactt ggagtaaaga tatacaaaga ttataaagac cttgtaaaaa 301 attctgacat tgttatttca gctgttactc cattttcagc tttagatgtg gctaaaaaat 361 atggaaaata tgtaaaaggt atatatgtag atgtgaataa tgtatcacct ctaacaaaac 421 ataaaatttt gaaatatatt gacgaggaaa aatttgttga ttgtgcaata attggaagaa 481 taaaacgtaa atttaaaatg atttgttcag gtaaaaatgc aaataaactt aaaatattag 541 aaaaatttgg agtacctatt gaagttatag gttcaaaggt tggggaagcc tcaacattaa 601 aaatgctcag aagtttatat actaaaagtc ttgctgcaat attattagaa gttttttcgg 661 ttgctaataa attaggactt atcgatgaat tacttgaaat attagaggaa acagaaggga 721 agaaatttgt agatttatgt aagtcaagag tggtaggatc atttattcac tcaaggagaa 781 gatatgaaga aatatgtgaa atagaaaaat ttatattatc tcataatttg aaaccaataa 841 tgatcaaatg tacaaaaaat atgtttaaac atatagagga atgttgacaa agattatatc 901 aaaaaattta aataagattg aaaaataaaa tataaaatgg cagcgaggct aggccggggg 961 gttaggggtc ccctgtaagc gcaaatcccc tatatggcgc ggccgaagcc caggaggcgg 1021 caagaccgcc agacatcggc ctgagggtta aacaatgaag cctcgtccca cagggccacc 1081 ggtggcgagg gtccagctgg agggctggac ctaatcgcct ttgctgcggg aacgggtcag 1141 gcccggaagg gagcagccct accgcagacg gatggtgctt gtgggtcaac ggggtggagt 1201 ctataaccct cagatcaccg gtgtctggtg gtcttgtcca ctcctgggcg tgcctcgctg 1261 ccaaatgtct tcagtagccg gggtggccca gcctggtagg gcgtcggcct gctaagccga 1321 tgatccgtta aggatcgcgc gggttcaaat cccgtccccg gcgtgcaata ttttctaatt 1381 tttttgtatt gatagtgttg ctaggtatgt aatgccatgg taccggcgtg gattcggagt 1441 gcctatttag gctcgatgaa gaatcgcact ccagccatgg caacaaaatt ggtgagagcc 1501 tgagtgaata actcaggctt gatacatagg tcccacgggt gctggtaaag tcccaaagga 1561 atcccccgcc aactccgttt gatcctggcg gaggccactg ctatgggggt ccgactaagc 1621 catgcaagtc gaacgggcct tgtgcccgtg gcgaacggct cagtaacacg tggacaacct 1681 accctggggt ccgggataac cccgggaaac tggggctaat cccggatagg cgaggtctcc 1741 tggaatgggg cctcgccgaa aggtcttttg accgccccag gatgggtctg cggccgatta 1801 ggtagttggt agggtaacgg cctaccaagc ctacgatcgg tacgggttgt gagagcaaga 1861 gcccggagac ggggcctgag acaaggcccc gggccctacg gggcgcagca ggcgcgaaaa 1921 ctccgcaatg cgcgaaagcg cgacgggggg acccccagtg ccactccgta aggagtggct 1981 tttccggagt gtaaaaagct ccgggaataa gggctgggca agaccggtgc cagccgccgc 2041 ggtaacaccg gcagcccgag tggtggccgc gtttattggg cctaaagcgt ccgtagccgg 2101 tccggtaagt ctccggtgaa agcccgcagc tcaactgcgg gagtagccgg agatactgcc 2161 ggacttgggg ccgggagagg ccggaggtac ccccggggta ggggtgaaat cctgtaatcc 2221 cggggggacc acctgtggcg aaggcgtccg gctggaacgg gcccgacggt gagggacgaa 2281 agccagggga gcgaaccgga ttagataccc gggtagtcct ggccgtaaac gatgcggact 2341 tggtgttggg gcaacctcga gttgccccag tgccgaaggg aagccgttaa gtccgccgcc 2401 tggggagtac ggccgcaagg ctgaaactta aaggaattgg cgggggagca ccacaacgcg 2461 tggagcctgc ggtttaattg gattcaacgc cggacacctc accgggggcg acggcaggat 2521 gatggccagg ttgatgacct tgcctgacga gccgagagga ggtgcatggc cgccgtcagc 2581 tcgtaccgtg aggcgtcctg ttaagtcagg caacgagcga gacccgcgcc cctagttgcc 2641 agcgggtccc gtaagggacg ccgggcacac tagggggacc gccagcgata agctggagga 2701 aggtgcgggc gacggtaggt ccgtatgccc cgaaaccccc gggctacacg cgggctacaa 2761 tggccgggac aatgggtacc gaccccgaaa gggggaggta atcccataaa cccggccgta 2821 gttcggatcg agggctgcaa ctcgccctcg tgaagctgga atgcgtagta atcgcgggtc 2881 actatcccgc ggtgaatacg tccctgctcc ttgcacacac cgcccgtcac gccacccaaa 2941 cggggttcgg atgaggccat ggcctctggc catggtcgaa tccgggcccc gtgaggaggg 3001 cgaagtcgta acaaggtagc cgtaggggaa cctgcggctg gatctaacct ccttaaaata 3061 ataatattcc tttgggacac caaaagcacc cgtgggacct tttttatttt tgggcccata 3121 gctcagcctg ggagagcgcc gcccttgcaa ggcggaggcc ccgggttcaa atcccggtgg 3181 gtccatgcaa ctagccatct tatgataggc tagtgaagct agatgatcaa agatgattct 3241 aggtgcatag gtagttcgtg gacaccaact gtagggaact ggtccaaggt ttaagccgcc 3301 cggtggatgg ctcggcttgg gcgccgacga aggccgtggc aagctgcgat aagcccgggt 3361 gaggcgc // LOCUS STASAU3AIM 3360 bp ds-DNA BCT 06-MAR-1990 DEFINITION S.aureus Sau3AI-restriction-enzyme and Sau3AI-modification-enzyme genes, complete cds. ACCESSION M32470 KEYWORDS Sau3AI-modification-enzyme; Sau3AI-restriction-enzyme. SOURCE S.aureus (strain 3AI) DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 3360) AUTHORS Seeber,S., Kessler,C. and Goetz,F. TITLE Cloning, expression and characterization of the Sau3AI restriction and modification genes in Staphylococcus carnosus TM300 JOURNAL Unpublished (1990) Univ. of Tuebingen, D-7400 Tuebingen 1 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.t Seeber, 02-MAR-1990. FEATURES from to/span description pept 448 1917 Sau3AIR protein (ttg start codon) pept 2017 3255 Sau3AIM protein binding 437 441 ribosome binding site binding 2004 2010 ribosome binding site BASE COUNT 1295 a 393 c 638 g 1034 t ORIGIN 1 aagtcgacct tcaccaagac cgaattttcc gcgtataccc gcagtaccaa acgttaattt 61 actttcaaaa ccttctcgct gttcaatgtc agattgctgc tcataaaaat gtttaactaa 121 actatcatta gctctttcta tccataattc tttatccatt gttgctaaac atcctttcaa 181 aatctcagtt agacttaata aaacatgaaa actaaagccc ttacatttat gtaatgaatt 241 ataaagaaat acgccccaaa agtaaaaaaa cacagcccca agacaatact tttcacaagt 301 attatataat agatgtgtat gaaaatgcat ggagtagatg taagagtgat attcaaaatg 361 tgtaaaaaat atggataatt ctatataatt atattattga aattttaaat agcggaaata 421 attatttaat gttaagaggg gataattttg gaaagttatt tgacaaaaca agccgtacat 481 aatcgcgcaa aagaagctgt tggtaaaagt gtattagaat taaatggtgg tgaatcgatt 541 aaacaaagta agagttcagt tggtgatgca tttgaaaatt ggtttggtaa gaaaaaagac 601 agtgatagta aaccagatat ggcagaagct ggggtggaac ttaaggcaac gccatttaaa 661 aagttgaaaa acggaaagta tagctccaaa gaaagattag tattaaatat tataaactat 721 gagaaagtgg caaatgaaaa ttttgaaact agtagttttt tatctaagaa taatactata 781 gaattagctt tctatgaata tatcaaggga acacctagtg ataattggat tattaaagaa 841 gcggtgcttt atgaaatgca taaaaacccg attgattatg aaataattaa acaagattgg 901 gaaataataa atcaatatat taatgaagga aaggcacatg aattgagtga aggtttgaca 961 agttatttag cgccatgtac aaagggtgcg aatgctagtt ctttaagaaa tcagccttat 1021 tcagacataa aagcaaagca aagagcattt tctttgaagt ctgggtatat gacatctatt 1081 ttacgcaaat atgttctagg tgatgaaaaa atagattcaa ttgtcaaaga cccatttgaa 1141 ataaaagaaa aatcaataga ggacatagtc tttgaaaaat ttcagccata tataaactgg 1201 tcaatcgata aattatgcga acatttttct atcaataaag gtgagaaagg tttaaattat 1261 agaatagcct ctgccatttt aaatctaaaa ggtaaaacta ctaaaagtaa accattcccg 1321 gaagttgaag agtttgaaaa atcatctata gtagtcaaaa cagttcattt taataaaaag 1381 aatgtgaata aagaaagtat gtcatttgga gcttttaaat ttgaagaact agctaatgag 1441 gaatgggaag atagtgaagg atatcctagt gcacaatggc gaaacttttt gttagaaaca 1501 aggtttttat tttttgttgt taaagaagat gaagatggtg tagacatatt caaaggaata 1561 aaatttttta gtatgcctga agaagacata aacggacctg tcaaaagaat gtgggatgat 1621 acagtgaaga aattaaaaga gggtgtcaca ttagaagctg taccggacaa aagtacaaag 1681 gatggttgga gaataaaaaa taattttgta gataaaagtg atgatttaat ttgccatgtt 1741 agaccacaca ctaataacag agactatcgt ggaggaagta atgcagataa gcttcctaaa 1801 aagattaact ggattaatag acctgactca gatgattatt cggatgagtg gatgactaaa 1861 caaagttttt ggataaataa tgactacata aaaaagcaag ttgaagattt attgtagtta 1921 aagtatgtta aaatataaga tattctttta aaatatcgaa cgatcgttcg tattttgtgt 1981 tataataagg ttgaattaag tataggaggt cgcctaatga ataaaattaa agtagtagaa 2041 ttgtttgcgg gtgtaggcgg gtttcgttta ggtttagaaa atacgaaaaa tggtatattt 2101 gacataactt gggcaaatca atgggagccc tcacgaaaaa tccaacatgc atttgattgt 2161 tatagtaaaa gatttaagaa cggcatccat agtaataagg atattgccca ggtatctgat 2221 gaagaaatgg caaatactga agctgatatg attgtaggag gatttccttg ccaagattat 2281 tcagttgcaa ggagtttaaa tggagaatta ggaatacaag gaaaaaaggg cgttttattc 2341 tggcaaatta ttagatatat tcaaaataca tttcctaaat acttgttgct tgaaaatgtt 2401 gatagattat tgaaatcacc ttcgagtcag agagggagag actttgctgt aatgttatca 2461 accttaaatg agttaggcta taatgttgaa tggcgcgtga ttaatgctgc tgattatggc 2521 aatgctcaaa gacgtagaag ggtatttata tttggatata agcaagattt aaactatagc 2581 aaagctatgg aagaaagtcc gttggataaa attatatatc acaatggttt gtttgctgaa 2641 gcttttccga ttgaagatta tgccaataaa aatagagtaa ataggactca tattactcat 2701 gatatagtcg atatttcaga taatttcagt tttcaatttt ataatagtgg aatcatgaaa 2761 aatggagaaa ttttaactat tgacacaata ccaaaatatg aaaaatcagt aaccttagga 2821 gaaattattg aaagtaatgt agatgatggt ttttcattaa atcaagatca aattgataaa 2881 tttaaatatt taagaggacc caaaaagatt aaacgaacta ctaaagatgg tcatgaatat 2941 tatttttcag aaggtggtat gtctgaaaca gattcattag agttacctgc aagaacaatg 3001 cttacaagtg aatcatctat taatagaagt actcattttt taaacgtaga tggtgtttat 3061 agaactttga cacctattga agcagaaagg ttgaatgggt ttccagataa ttggacagaa 3121 ggtatgccaa ttaagatgag atacttttgt atgggcaatg ctcttgttgt gcctttgatt 3181 actagaatag gtaatcaaat tgaaaaaatt gatagtatta caaatgatga attcagtcag 3241 ctacgtttat tttaaataaa tcaaatgtag agtgcgtttg atttaacaag tttgtatact 3301 aaagattcat agattattgt atattgacca gtatttatca gcgtattatt ttaatatata // LOCUS TRPTYF1 1321 bp ds-DNA BCT 06-MAR-1990 DEFINITION T.pallidum pallidum antigen TyF1 gene, complete cds. ACCESSION M32401 KEYWORDS antigen. SOURCE T.pallidum pallidum (strain Nichols) DNA. ORGANISM Treponema pallidum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; Spirochaetales; Spirochaetaceae. REFERENCE 1 (bases 1 to 1321) AUTHORS Noordhoek,G.T., Hermans,P.W.M., Paul,A.N., Schouls,L.M., van der Sluis,J.J. and van Embdon,J.D.A. TITLE Treponema pallidum subspecies pallidum (Nichols) and Treponema pallidum subspecies pertenue (CDC 2575) differ in at least one nucleotide: Comparison of two homologous antigens JOURNAL Microb. Pathog. 6, 29-42 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by G.T. Noordhoek, 02-MAR-1990. FEATURES from to/span description pept 615 1148 antigen TyF1 variant 736 736 g in T.pallidum pallidum; a in T.pallidum pertenue BASE COUNT 349 a 297 c 353 g 322 t ORIGIN 1 aacagcacgc gggcccccgg actacatcgc tcttgtgaac ctcaacggtc gcctttccat 61 actctaagga ataggagcac gagtacgtat cccccggtag tttccccgcc ttccgaacta 121 aaataagagg tattcccatg cgatctgcaa aaggcgcggc aaaaataaag ccacgtgatt 181 cgattgctgc gaccgcggta acgtgctcat cgcggtagaa ttccaccatt tgatcaagac 241 agtaacgaaa tacagccgcg ttcatcagca cgccagtaat gtcgtagtag agaattcctt 301 ttttagggaa atcaatccgc ttacgaattg cgcggtccag cgccgcgtgt ccgtccatag 361 gggcatggta acgtccaata ccacgcacgt caatgatctt accggtttgt tgggaggctt 421 ggtggattga gaattacgtc tcctggaaaa aagatttcgc tgaaacttca cgaaatctcg 481 gtgaaaataa atgattattt taccaatcgg tgaaaaaaag ccgggaaaag tccaaaaaga 541 cagtggttat gctccatttc tttcgatttt ttgttggcat ggtttttgct ttaaagtttg 601 gaggagaaag aacgatgaac atgtgtacag atggaaaaaa ataccacagc accgccacga 661 gcgctgcagt tggagccagc gcccccggtg taccggacgc tcgtgccatt gctgctatct 721 gcgagcaatt gcgccggcat gtagcggatc tgggagtact gtatatcaag ctacataact 781 atcactggca catctacggc attgagttta aacaggtgca tgagctcctt gaagagtatt 841 atgtatcagt tactgaagcc tttgatacga ttgccgagcg gttgttacag ctgggcgcgc 901 aggctcctgc gtctatggct gaataccttg cgttgagtgg aattgcagaa gagacggaga 961 aagagatcac tatcgtctct gcgcttgcgc gcgtaaagcg ggattttgaa tacctaagta 1021 cgcgattcag ccaaacgcaa gtacttgcag ctgaaagtgg ggatgcagtg actgacggca 1081 ttatcacaga catactgagg acgttgggaa aggccatttg gatgcttggt gctaccctga 1141 aagcctaggt agagcaggct gtacgtacaa cacacgtacg gccatgcgct ggaagtcctg 1201 tattttgcac ataaggcctc tctcccgtta cagcatgagg ggagggaggt gttggttgaa 1261 gtgcttgggg aagtgtgcat aatcgtccta cggaaggggg cgttttgtgg aaaaaattgt 1321 t // LOCUS YS5MTII 1257 bp ds-DNA PLN 07-MAR-1990 DEFINITION C.glabrata metallothionein (MTII) gene, complete cds. ACCESSION J05398 KEYWORDS metallothionein. SOURCE C.glabrata (ATCC 2001) DNA. ORGANISM Candida glabrata Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 1257) AUTHORS Mehra,R.K., Garey,J.R. and Winge,D.R. TITLE Selective and tandem amplification of a member of the metallothionein gene family in Candida glabrata JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.R.Garey 07-MAR-1990. FEATURES from to/span description pept 631 789 metallothionein (MTII) rpt 1246 > 1257 amplifiable repeat BASE COUNT 420 a 255 c 199 g 383 t ORIGIN 1 ccccgggatc gtttagaccc caagaatcca agatatgttt tagctgacag ggtgcatggc 61 tactccgtga tttaccatat gtccatcgca ttcgtcatgg cctgatgcca atgaccatac 121 tctttagact cggatccgag aagtcatccc tatccatacg atatcttctt ccgagtccat 181 tagataaggc aacgctagat ttagctgatt gattgccctg agaattaaag ggaatcagcg 241 aaaatagaaa aaaaggtttt ggatggatat ttttgttctt tttttgagac attggtgtca 301 ctgccacagc ttaataacat ggcaatcagc agtgtctaga ttcagcggaa tttagtggaa 361 gcgaaatgaa ttcggctgac ttagtgccta tcagtaatat acatatttgc tttttgcttt 421 ggcattgaat ttatcaataa ctgcataccc aaaattaaaa ataaaataga aaaaatagta 481 atatataaag cacccgtaac ttcccatttc tgggaaactt ggaattcatt tctcccattt 541 atcctttctt ctatatatcg aatcaacaca tcaacaatac caacaaactt caacttatac 601 acaacatcta atatttatta tagcttcgaa atgcctgaac aagtcaactg ccaatacgat 661 tgccactgct ccaactgtgc ttgtgaaaat acttgcaact gctgtgccaa gccagcatgt 721 gcttgcacaa actctgcttc caatgaatgc tcctgccaaa cttgcaagtg tcaaacatgc 781 aagtgctaaa cagcattcaa agaagaataa tttctagtat ttttgctatt gttgctttat 841 ttttcatgat tattgataag gtactgggta tcttctgcta atgacgctga actagctcta 901 tatgctagct atatacatta cgtataaaaa aatataaaaa tataaaaata taaaaaaaaa 961 ataaaaaaat cacttcaaaa actataacaa ctaccaagtt tctattttct ctttgactaa 1021 atcaaatcga tctagtaaac attgcagaac aatttgtgta attatcgtct gatctaaggg 1081 ccggcgtaag caattctaac cgaccacact acgttactta acaaccacac ttgctcttga 1141 atacgaaaca gcaagctcta cctatatgaa atggtctaca aaagaagttt tctctagtac 1201 tctgagaagg ttatattata catcattgaa atgcttaaag ctgatttgca tcccggg //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATA2UGLB 506 bp ds-DNA ROD 13-FEB-1990 DEFINITION Rat alpha-2u globulin gene, exon 1. ACCESSION M33690 M32014 KEYWORDS alpha-2u globulin. SOURCE Rat (strain Wistar-Lewis) adult liver DNA, clone 1H. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 138 to 506) AUTHORS Shaw,P.H., Walter-Sierra,R., Tamone,F. and Schibler,U. TITLE Rapid identification of DNA fragments containing promoters for RNA polymerase II JOURNAL Gene 84, 371-381 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 137) AUTHORS Tamone,F. JOURNAL Unpublished (1990) Dept. Mol. Biol. U. of Geneva, Geneva, SWITZ. STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Tamone, 07-FEB-1990. FEATURES from to/span description pept 436 > 506 alpha-2u globulin, exon 1 mRNA 377 > 506 alpha-2u globulin mRNA signal 1 376 promoter BASE COUNT 146 a 120 c 117 g 123 t ORIGIN Chromosome 5. 1 cttcttccac actccactca agtcttgaag caaagattct ctccatgctt gaccttccag 61 ttcagtaccc acccactcca taactggttt tctgattcta agtcagatcc aatgttgcat 121 gtggttccag atggccctct agaacaccca ctgtttttct tgggaatatg ttttgcgaaa 181 tgtatgagtg atagaatcaa tccataggag atgacatcgc caagtttcaa aagggcagga 241 acaatccttg gcttcacatc agtacatgag gaaaacattc cacaaagcct gaaggatgga 301 ggcccatatg agaaggaaaa aaaaacactc aaacccagag agagtataaa gacgagcaaa 361 gtgctggagg tggagtgtgg gcaccatcag cagagggatt gtcccgacag agaggcaatt 421 ctattcccta ccaacatgaa gctgttgctg ctgctgctgt gtctgggcct gaccctggtc 481 tgcggccatg cagaagaagc tagttt // LOCUS RATA2UGLBA 371 bp ds-DNA ROD 13-FEB-1990 DEFINITION Rat alpha 2u globulin pseudogene, exon 1. ACCESSION M33652 M32014 KEYWORDS alpha-2u globulin. SOURCE Rat (strain Wistar-Lewis) adult liver DNA, clone 1F. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 371) AUTHORS Shaw,P.H., Walter-Sierra,R., Tamone,F. and Schibler,U. TITLE Rapid identification of DNA fragments containing promoters for RNA polymerase II JOURNAL Gene 84, 371-381 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F. Tamone, 07-FEB-1990. FEATURES from to/span description pept.ps 301 > 371 pseudo-alpha-2u globulin, exon 1 mRNA 243 > 371 alpha-2u globulin mRNA BASE COUNT 119 a 71 c 96 g 85 t ORIGIN Chromosome 5. 1 tctagaacat ccattgtttt tcttgggaat atgttttgtg aaatgtatga gtgataaaat 61 catccatagg agatgagatt gcccaagttt gaaaagggca ggaacaatcc ttggcttcac 121 atcagtacat gagaaaacat tccacaaagc ctgagggaag gaggcccata tgagaaggaa 181 aaattaaaaa acactcgacc cagagagagt ataaagagga gcaaagtgct tgaggtggag 241 tgtgggcacc atcaacagag agagtgtccc gacagagagg caattctatt ccctaccaac 301 atgaagctgt agctgctgct gctgtgtctg ggcctgaccc tggtctgtgg ccatgcagaa 361 gaagctaatt c // LOCUS RIRSTPSTA 2906 bp ds-DNA BCT 12-FEB-1990 DEFINITION R.tsutsugamushi heat shock protein 11 (stp11) and major antigen 58 (sta58) genes, complete cds. ACCESSION M31887 KEYWORDS antigen; heat shock protein; heat shock protein 11; major antigen 58; outer membrane protein. SOURCE R.tsutsugamushi (strain KARP) DNA, clone pRTS58H2.9. ORGANISM Rickettsia tsutsugamushi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 2906) AUTHORS Stover,C.K., Marana,D.P., Dasch,G.A. and Oaks,E.V. TITLE Molecular cloning and sequence analysis of the sta58 major antigen of Rickettsia tsutsugamushi: Sequence homology and antigenic comparison to the 60 kilodalton family of stress proteins JOURNAL Unpublished (1990) Walter Reed Hospital, Washington,D.C. 20307 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.V.Oaks, 02-FEB-1990. FEATURES from to/span description pept 780 1064 heat shock protein 11 pept 1137 2804 major antigen 58 binding 770 775 ribosome binding site signal 675 680 -35 promoter signal 700 705 -10 promoter BASE COUNT 1083 a 354 c 599 g 870 t ORIGIN 1 aagctttgca acattaattg ccaattcttt tgacagtaat gtaattgcgg ttagaaaaag 61 tgactgaaac ctcagaaaaa agaattgtag gagttaattg cttaatacaa tatttacatc 121 taatagtaaa acatttgact ttcttagttt taattttagg ctagtgaaaa ctgatcaatc 181 taaaattaaa attcagaaaa atgtctagaa cgagagcaca aacttctctg ctaaaaatac 241 taaaagaaat attctgcaag tttatattgt aaccaattga tagagtaata tatcttatca 301 accagatatt acatggcttg ataaattact ttaggattgg taactctagc agttgtttta 361 gatatgtaat ggacttgttt gataagatag taagacatca tctaatgcaa tcaagtggat 421 tgaaaggctt ttgctggaaa aggttgtgta gaagaaagtt atatgaaaag gtggaattat 481 ataatgatta ccaaattcga tagatctgtg gtaagcaata acttctgata ggctataagc 541 aatgatatag ctaaaccagt ataaaacagt tatgttatat actttttcaa attaattaag 601 aaaaacaatt agtgcgatat taatggttta aactcagcgt ataaaagaaa aatataaaaa 661 ttttttattc acacttgaaa taagtagtaa cagaagatat atgattaggg catagtaaaa 721 gtatagatta actaattatg ctaatttaat tatatagagt taaatttttg gaggattata 781 tgaaatacca accactgtat gatcgtgtgc tagttgagcc aatacaaaat gatgaagcac 841 atggaaaaat tcttattcca gataccgcaa aggagaaacc aacagaagga atagtagtta 901 tggttggcgg tggctataga aatgataaag gtgatattac accgttaaaa gtaaagaaag 961 gagatactat agtctataca aaatgggctg gtactgaaat aaaattagaa agtaaagact 1021 atgtagttat taaagaatca gatatattat tagttaaaag ctaaaatgct taatgtatta 1081 ataattaaag gatgctagac taatcaaatt ataaaatatt ttaaggagtt ttcagtatga 1141 gtaaacaaat tgtacatggc gatcaatgtc gtaaaaaaat aatagaaggc ataaatgttg 1201 tagctaatgc agtaggtata acattaggac ccaaaggtag atgtgtagca atagaacaat 1261 cttatggtcc tcctaaaatt actaaagatg gagttagtgt tgctaaagca attcaactta 1321 aggataaatc tcttaatgtt ggagcccaat ttgttatatc agttgctagt aaaacagctg 1381 atgtggctgg tgatggtaca actactgcta cagttatagc tgatgctgca gttagggagc 1441 taaataaagc tgaagttgca ggcattgata tacaggaagt aagaaaaggc gctgagaaag 1501 ctgttgaagc agttattgct gatgtacgta aaaacagcag ccctgttaaa aatgaggaag 1561 aaattgctca agtagctact gtttcatcta atggagaccg cgaaattggt gaaaagattg 1621 ctaatgcaat gaaacaagtt ggccaagaag gagtgataac tgttgaagat tctaaaaatt 1681 ttaactttga agttgaagtt gttaaaggaa tgagatttga ccgcggatat atatcacagt 1741 actttgcaac gaatcgtgaa aagatgatta ccgaatttga aaatccttat attttgctat 1801 tggatcagaa ggtatctaca gtgcagccac tggttcctgt gcttgaagct gttgctcaca 1861 ctggcaagcc attagtattg attgctgatg atgtagacgg agaagctctt actgcattga 1921 tattaaacaa tttaaaaggt tctattaaag tagtagctgt taaggcacct ggttttggtg 1981 atcgaaaaaa agaaatgtta gaagatattg ctattttaac aaatggagaa gtaattactg 2041 aacaactagg aatcaaacta gaaaaggtta atgatacatc taagttagga actgcaaata 2101 gagttattgt tacaaaagat catactacta tagttcatga taagaataat agtgatatag 2161 agaagaaagt taattctcgt tgtgaacaga tccgtgaagc aattaaagat acaacttcag 2221 attatgaaaa agaaaaatta caagagcgct tagcaaaact tagaaacggt gtagctgtac 2281 taaaagttgg tggtgctact gaagtagaac aaaaagaacg taaggataga gtagaagatg 2341 ctttacatgc tactagagca gctgttgaag aaggtatagt gcctggtggt ggtgttgcgc 2401 tattttatgc ttctcgtgtt ttagatagtt tgaagtttga taatgaagat caaagggtag 2461 gtattaacat tatcaaaaaa gtattagaag ctccggtaag acaaatagtt aaaaatgctg 2521 gaggcaaaga ggatgttgtt gttaatgaac taagcaagag tacagataaa aaccgtggat 2581 tcgatgctag aacaatgcaa tacgttgata tgataaaagc aggtattgtg gatccaacta 2641 aagttgtacg tactgcgtta caagatgctt tttctgtggc ttcattagta attgctacct 2701 cagctatgat tactgatcat gaggaagata ataatactgg taaccgcagt ggtggtggag 2761 taggtggtgg acatcatggt ggtatgggcg gaatggattt ctaatccgcc cccatgtcta 2821 ccttcaaaaa aaatagagta taacttttgt ataaggttgc tagttatgta tttttttaca 2881 aaattgcagt ataatatttc aagctt // LOCUS YSCTUP1A 2891 bp ds-DNA PLN 12-FEB-1990 DEFINITION S.cerevisiae TUP1 protein gene, complete cds. ACCESSION M31733 KEYWORDS regulatory protein. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2891) AUTHORS Williams,F.E. and Trumbly,R.J. TITLE Characterization of TUP1: A mediator of glucose repression in Saccharomyces cerevisiae JOURNAL Unpublished (1990) Dept. Biochem. Ohio Med. Coll., Toledo, OH 43699 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.J.Trumbly, 30-JAN-1990. FEATURES from to/span description pept 509 2650 TUP1 protein mRNA 234 2780 TUP1 mRNA signal 108 112 TATA box BASE COUNT 899 a 700 c 530 g 762 t ORIGIN Chromosome III. 1 tgtacatccc aaagacataa acatatgtat ataagaacaa cataaacaaa taaataaacc 61 aggaaagccc taatttaccc ttctttttgg ctgagcaatc taggagttaa taattttttg 121 gaactttccc cctaacattt ttagagacag aaccctctta cattatcgct accgacggac 181 ctttaatttt tcgttacttc tcaaaaaaca ctgaaaaact gcgaaaaaat taaaaagctc 241 tcccgtcaaa gcaacaaaca acaaaacaaa ataacaagta cataagtaaa caaatagaga 301 gaaattcccc catctacccc caattgataa acatcagcga agcaagaaca actggctgaa 361 cacgtgcccc tctatctgtc cttctgcaat tgtttaattt tctttcacgc caagttacct 421 ttcgcacacc actactcttt ttctattgtt tttttgtctt ttttgataag caggggaaga 481 aagaaatcag ctttccatcc aaaccaatat gactgccagc gtttcgaata cgcagaataa 541 gctgaatgag cttctcgatg ccatcagaca ggagtttctc caagtctcac aagaggcaaa 601 tacctaccgt cttcaaaacc aaaaggatta cgatttcaaa atgaaccagc agctggctga 661 gatgcagcag ataagaaaca ccgtctacga actggagcta actcacagga aaatgaagga 721 cgcgtacgaa gcagagatca agcacttgaa actagggctg gagcaaagag accatcaaat 781 tgcatctttg accgtccagc aacagcagca acagcaacag cagcaacagg tccagcagca 841 tttacaacag caacaacagc agctagccgc tgcatctgca tctgttccag ttgcgcaaca 901 accaccggct actacttcgg ccaccgccac tccagcagca aacacaacta ctggttcgcc 961 atcggccttc ccagtacaag ctagccgtcc taatctggtt ggctcacagt tgcctaccac 1021 cactttgcct gtggtgtcct caaacgccca acaacaacta ccacaacagc aactgcaaca 1081 gcagcaactt caacaacagc aaccacctcc ccaggtttcc gtggcaccat tgagtaacac 1141 agccatcaac ggatctccta cttctaaaga gaccactact ttaccctctg tcaaggcacc 1201 tgaatctacg ttgaaagaaa ctgaaccgga aaataataat acctcgaaga taaatgacac 1261 cggatccgcc accacggcca ccactaccac cgcaactgaa actgaaatca aacctaagga 1321 ggaagacgcc accccggcta gtttgcacca ggatcactac ttagtccctt ataatcaaag 1381 agcaaaccac tctaaaccta tcccaccttt ccttttggat ctagattccc agtctgttcc 1441 cgatgctctg aagaagcaaa caaatgatta ttatatttta tacaacccgg cactaccaag 1501 agaaattgac gttgagttac acaaatcttt ggatcatact tcagttgttt gttgcgtgaa 1561 gttcagtaac gatggtgaat acttagccac aggctgcaac aaaactactc aagtgtatcg 1621 cgtttcagac ggttctctgg tggcccgttt atctgacgat tctgctgcca ataaccatcg 1681 aaattcgatc actgaaaata acaccaccac gtccacggat aacaatacaa tgacaaccac 1741 tactaccacc acaattacta ccacagcgat gacttcggca gcagaattgg caaaagatgt 1801 ggaaaacctg aacacttcgt cttccccatc atccgacttg tatatccgtt cagtgtgttt 1861 ttctccagat gggaaatttt tggcaacagg tgctgaagac agactgatta gaatttggga 1921 tattgaaaat agaaagattg ttatgattct tcaaggccac gaacaagata tttattcatt 1981 ggactacttt ccctcaggtg acaaattagt ctccggttct ggtgaccgta ccgttcgtat 2041 ttgggactta cgtacaggcc agtgttcatt gactttatcc attgaagatg gtgttaccac 2101 cgtcgctgta tcaccaggtg atggtaaata catcgctgct ggttctctag atcgtgctgt 2161 gagagtttgg gattccgaga ccggattctt ggtggaaaga ctagattcgg aaaacgaatc 2221 cggtacaggc cacaaggact ctgtttatag cgttgtcttc acgagagatg gacaaagcgt 2281 tgtatccggc tcattagata gatctgttaa gctctggaat ttgcagaatg caaacaacaa 2341 gagcgattcg aaaactccaa attccggcac ttgtgaagtt acgtatatcg ggcataaaga 2401 ctttgtattg tccgtggcca ccacacaaaa tgatgagtac atcttgtccg gttccaaaga 2461 tcgtggtgtc ctgttttggg ataagaaatc cggcaatccg ttattgatgt tgcaaggtca 2521 taggaattca gttatatctg tggctgtggc aaacgggtct tcgctgggtc cagaatataa 2581 cgtttttgct actggtagcg gtgattgtaa agcaaggatt tggaagtata aaaaaatagc 2641 gccaaattaa gaacagaaca caaaaggaac actttacaaa tgtaactaac taaactattc 2701 ttcaattcat tgtcattata tctttctctc tattcttttt tatctactta tttgttatgt 2761 tatgatccag gtacgcttta aaacaaacag ttactttcct caccattcat cccttcccct 2821 ccttttcctg tgtctgaata ttgcggttca ttattctttg atatgaattt atttgtgtat 2881 tgtataagct t //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS PFAMSAA 433 bp ds-DNA INV 13-FEB-1990 DEFINITION P.falciparum (strain B358) major surface antigen p190 gene, partial cds. ACCESSION M32111 KEYWORDS major surface antigen p190. SOURCE P.falciparum (strain B358) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 433) AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A. TITLE Genetic diversity in the major merozoite surface antigen of Plasmodium falciparum: High prevalence of a third polymorphic form detected in strains derived from malaria patients JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A. Scherf, 14-FEB-1990. FEATURES from to/span description pept < 1 > 433 major surface antigen p190 (AA at 1) BASE COUNT 148 a 75 c 83 g 127 t ORIGIN 1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atggtttatt tcataaggaa 61 aaaatgatct taaatgaaga agaaattact acaaaaggtg caagtgctca aggttcaagt 121 ggtccaagtg gtacaccaag tggtacaagt ggtccaagtg gtacaagtgg tccaagtggt 181 acaagtggtc caagtggtac aagtccatca tctcgttcaa acactttacc tcgttcaaat 241 acttcatctg gtgcaagccc tccagctgat gcaagcgatt cagatgctaa atcttacgct 301 gatttaaaac acagagtacg aaattacttg ttcactatta aagaactcaa atatcccgaa 361 ctctttgatt taaccaatca tatgttaact ttgtgtgata atattcatgg tttcaaatat 421 ttaattgatg gat // LOCUS PFAMSAB 388 bp ds-DNA INV 13-FEB-1990 DEFINITION P.falciparum (strain B342) major surface antigen p190 gene, partial cds. ACCESSION M32112 KEYWORDS major surface antigen p190. SOURCE P.falciparum (strain B342) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 388) AUTHORS Kimura,E., Mattei,D., Mana di Santi,S.A. and Scherf, TITLE Genetic diversity in the major merozoite surface antigen of Plasmodium falciparum: High prevalence of a third polymorphic form detected in strains derived from malaria patients JOURNAL Unpublished (1990) Inst. pasteur, Paris Cedex 15 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A. Scherf, 14-FEB-1990. FEATURES from to/span description pept < 1 > 338 major surface antigen p190 (AA at 1) BASE COUNT 138 a 63 c 70 g 117 t ORIGIN 1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa 61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta 121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact 181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat 241 gctaaatctt acgctgattt aaaacataga gctcaaaatt acttgttcac tattaaagaa 301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttacc taaaaatgtt 361 gatggtttca aatatttaat tgatggat // LOCUS PFAMSAC 379 bp ds-DNA INV 14-FEB-1990 DEFINITION P.falciparum (strain B608) major surface antigen p190 gene, partial cds. ACCESSION M32113 KEYWORDS major surface antigen p190. SOURCE P.falciparum (strain B608) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 379) AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A. TITLE Genetic diversity in the major merozoite surface antigen of Plasmodium falciparum: High prevalence of a third polymorphic form detected in strains derived from malaria patients JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A. Scherf, 14-FEB-1990. FEATURES from to/span description pept < 1 > 379 major surface antigen p190 (AA at 1) BASE COUNT 135 a 64 c 62 g 118 t ORIGIN 1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atggtttatt tcataaggaa 61 aaaatgctct taaatgaaga agaaattact acaaaaggtg caagtgctca aagtggtaca 121 agttgtacaa gtggtccaag tggtacaagt ccatcatctc gttcaaacac tttacctcgt 181 tcaaatactt catctggtgc aagccctcca gctgatgcaa gcgattcaga tgctaaatct 241 tactctgatt taaaacacag agtacgaaat tacttgttaa ctatcaaaga actcaaatat 301 cctcaactct tttatttaac taatcatatg ttgactttat ctaaaaatgt tgatggtttc 361 aaatatttaa ttgatggat // LOCUS PFAMSAD 388 bp ds-DNA INV 13-FEB-1990 DEFINITION P.falciparum (strain B440) major surface antigen p190 gene, partial cds. ACCESSION M32114 KEYWORDS major surface antigen p190. SOURCE P.falciparum (strain B440) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 388) AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A. TITLE Genetic diversity in the major merozoite surface antigen of Plasmodium falciparum: High prevalence of a third polymorphic form detected in strains derived from malaria patients JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A. Scherf, 14-FEB-1990. FEATURES from to/span description pept < 1 > 388 major surface antigen p190 (AA at 1) BASE COUNT 138 a 62 c 69 g 119 t ORIGIN 1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa 61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagccagt acctgctgta 121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact 181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat 241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcgc tattaaagaa 301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt 361 gatggtttca aatatttaat tgatggat // LOCUS PFAMSAE 388 bp ds-DNA INV 13-FEB-1990 DEFINITION P.falciparum (strain B411) major surface antigen p190 gene, partial cds. ACCESSION M32115 KEYWORDS major surface antigen p190. SOURCE P.falciparum (strain B411) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 388) AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A. TITLE Genetic diversity in the major merozoite surface antigen of Plasmodium falciparum: High prevalence of a third polymorphic form detected in strains derived from malaria patients JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A. Scherf, 14-FEB-1990. FEATURES from to/span description pept < 1 > 388 major surface antigen p190 (AA at 1) BASE COUNT 138 a 61 c 70 g 119 t ORIGIN 1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa 61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta 121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact 181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat 241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcac tattaaagaa 301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt 361 gatggtttca aatatttaat tgatggat // LOCUS PFAMSAF 388 bp ds-DNA INV 14-FEB-1990 DEFINITION P.falciparum (strain B439) major surface antigen p190 gene, partial cds. ACCESSION M32116 KEYWORDS major surface antigen p190. SOURCE P.falciparum (strain B439) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 388) AUTHORS Kimura,E., Mattei,D., Mana di Santi,S. and Scherf,A. TITLE Genetic diversity in the major merozoite surface antigen of Plasmodium falciparum: High prevalence of a third polymorphic form detected in strains derived from malaria patients JOURNAL Unpublished (1990) Inst. Pasteur, Paris Cedex 15 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A. Scherf, 14-FEB-1990. FEATURES from to/span description pept < 1 > 388 major surface antigen p190 (AA at 1) BASE COUNT 138 a 61 c 70 g 119 t ORIGIN 1 aaaaaactag aagctttaga agatgcagta ttgacaggtt atagtttatt tcaaaaggaa 61 aaaatggtat taaaggatgg agcaaatact caagttgttg caaagcctgc aggtgctgta 121 agtactcaaa gtgctaaaaa tcctccaggt gctacagtac cttcaggtac tgcaagtact 181 aaaggtgcta taagatctcc aggtgctgca aatccttcag atgattcaag tgattcagat 241 gctaaatctt acgctgattt aaaacataga gttcaaaatt acttgttcac tattaaagaa 301 ctcaaatatc ccgaactctt tgatttaacc aatcatatgt tgactttatc taaaaatgtt 361 gatggtttca aatatttaat tgatggat //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS HPEPP 2570 bp ss-RNA VRL 13-APR-1990 DEFINITION Hepatitis E virus polyprotein (encoding NTP-binding protein and RNA-dependent RNA polymerase) mRNA, 3' end. ACCESSION M32400 KEYWORDS NTP-binding protein; NTP-binding protein; polyprotein. SOURCE Hepatitis E virus (strain Burma; enterically transmitted non-A, non-B), cDNA to viral RNA, (host human liver and bile). ORGANISM Hepatitis E virus Viridae; ss-RNA nonenveloped viruses. REFERENCE 1 (bases 1 to 2570) AUTHORS Reyes,G.R., Purdy,M.A., Kim,J.P., Luk K,-C., Young,L.M., Fry,K.E. and Bradley,D. TITLE Isolation of a cDNA from the virus responsible for enterically transmitted non-A, non-B hepatitis JOURNAL Science 247, 1335-1339 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2570) AUTHORS Tam,A.W., Smith,M.M., Kim J,P., Young,L.M., Piatak,M., Feldman,R.A., Purdy,M.A., Bradley,D.W., Reyes,G.R. and Fry,K.E. TITLE Hepatitis E virus: A positive strand RNA virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Authorin entry and computer-readable sequence for [1] kindly submitted by K.E.Fry, 02-MAR-1990. FEATURES from to/span description pept < 1 2184 polyprotein (NTP-binding protein/RNA-dependent polymerase region) (AA at 1) BASE COUNT 429 a 800 c 711 g 630 t ORIGIN 1 gttgttcagt accagtttac tgcaggtgtg cctggatccg gcaagtcccg ctctatcacc 61 caagccgatg tggacgttgt cgtggtcccg acgcgtgagt tgcgtaatgc ctggcgccgt 121 cgcggctttg ctgcttttac cccgcatact gccgccagag tcacccaggg gcgccgggtt 181 gtcattgatg aggctccatc cctcccccct cacctgctgc tgctccacat gcagcgggcc 241 gccaccgtcc accttcttgg cgacccgaac cagatcccag ccatcgactt tgagcacgct 301 gggctcgtcc ccgccatcag gcccgactta ggccccacct cctggtggca tgttacccat 361 cgctggcctg cggatgtatg cgagctcatc cgtggtgcat accccatgat ccagaccact 421 agccgggttc tccgttcgtt gttctggggt gagcctgccg tcgggcagaa actagtgttc 481 acccaggcgg ccaagcccgc caaccccggc tcagtgacgg tccacgaggc gcagggcgct 541 acctacacgg agaccactat tattgccaca gcagatgccc ggggccttat tcagtcgtct 601 cgggctcatg ccattgttgc tctgacgcgc cacactgaga agtgcgtcat cattgacgca 661 ccaggcctgc ttcgcgaggt gggcatctcc gatgcaatcg ttaataactt tttcctcgct 721 ggtggcgaaa ttggtcacca gcgcccatca gttattcccc gtggcaaccc tgacgccaat 781 gttgacaccc tggctgcctt cccgccgtct tgccagatta gtgccttcca tcagttggct 841 gaggagcttg gccacagacc tgtccctgtt gcagctgttc taccaccctg ccccgagctc 901 gaacagggcc ttctctacct gccccaggag ctcaccacct gtgatagtgt cgtaacattt 961 gaattaacag acattgtgca ctgccgcatg gccgccccga gccagcgcaa ggccgtgctg 1021 tccacactcg tgggccgcta cggcggtcgc acaaagctct acaatgcttc ccactctgat 1081 gttcgcgact ctctcgcccg ttttatcccg gccattggcc ccgtacaggt tacaacttgt 1141 gaattgtacg agctagtgga ggccatggtc gagaagggcc aggatggctc cgccgtcctt 1201 gagcttgatc tttgcaaccg tgacgtgtcc aggatcacct tcttccagaa agattgtaac 1261 aagttcacca caggtgagac cattgcccat ggtaaagtgg gccagggcat ctcggcctgg 1321 agcaagacct tctgcgccct ctttggccct tggttccgcg ctattgagaa ggctattctg 1381 gccctgctcc ctcagggtgt gttttacggt gatgcctttg atgacaccgt cttctcggcg 1441 gctgtggccg cagcaaaggc atccatggtg tttgagaatg acttttctga gtttgactcc 1501 acccagaata acttttctct gggtctagag tgtgctatta tggaggagtg tgggatgccg 1561 cagtggctca tccgcctgta tcaccttata aggtctgcgt ggatcttgca ggccccgaag 1621 gagtctctgc gagggttttg gaagaaacac tccggtgagc ccggcactct tctatggaat 1681 actgtctgga atatggccgt tattacccac tgttatgact tccgcgattt tcaggtggct 1741 gcctttaaag gtgatgattc gatagtgctt tgcagtgagt atcgtcagag tccaggagct 1801 gctgtcctga tcgccggctg tggcttgaag ttgaaggtag atttccgccc gatcggtttg 1861 tatgcaggtg ttgtggtggc ccccggcctt ggcgcgctcc ctgatgttgt gcgcttcgcc 1921 ggccggctta ccgagaagaa ttggggccct ggccctgagc gggcggagca gctccgcctc 1981 gctgttagtg atttcctccg caagctcacg aatgtagctc agatgtgtgt ggatgttgtt 2041 tcccgtgttt atggggtttc ccctggactc gttcataacc tgattggcat gctacaggct 2101 gttgctgatg gcaaggcaca tttcactgag tcagtaaaac cagtgctcga cttgacaaat 2161 tcaatcttgt gtcgggtgga atgaataaca tgtcttttgc tgcgcccatg ggttcgcgac 2221 catgcgccct cggcctattt tgttgctgct cctcatgttt ttgcctatgc tgcccgcgcc 2281 accgcccggt cagccgtctg gccgccgtcg tgggcggcgc agcggcggtt ccggcggtgg 2341 tttctggggt gaccgggttg attctcagcc cttcgcaatc ccctatattc atccaaccaa 2401 ccccttcgcc cccgatgtca ccgctgcggc cggggctgga cctcgtgttc gccaacccgc 2461 ccgaccactc ggctccgctt ggcgtgacca ggcccagcgc cccgccgttg cctcacgtcg 2521 tagacctacc acagctgggg ccgcgccgct aaccgcggtc gctccggccc //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATTNT 17249 bp ds-DNA ROD 20-MAY-1987 DEFINITION Rat fast skeletal TnT gene encoding troponin T isoforms, complete cds. ACCESSION M15202 M14514 M14515 M14516 M14517 M14518 M14519 M14520 M14521 M14522 M14523 M14524 M14525 M14527 M14528 M14529 M14530 M14531 M10059 M14532 M14533 M14534 K02429 J00794 KEYWORDS alternate splicing; troponin. SOURCE Rat adult skeletal muscle DNA [2],[4], clone pTnT-1 [2]; myogenic cell line L6E9, cDNA to mRNA, clone pTnT-15 [1],[2],[3]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 12798 to 16893; exons 12, 13, 14, 15, 17, 18) AUTHORS Garfinkel,L.I., Periasamy,M. and Nadal-Ginard,B. TITLE Cloning and characterization of cDNA sequences corresponding to myosin light chains 1, 2, and 3, troponin-C, troponin-T, alpha-tropomyosin, and alpha-actin JOURNAL J. Biol. Chem. 257, 11078-11086 (1982) STANDARD full staff_review REFERENCE 2 (bases 13478 to 16906) AUTHORS Medford,R.M., Nguyen,H.T., Destree,A.T., Summers,E. and Nadal-Ginard,B. TITLE A novel mechanism of alternative RNA splicing for the developmentally regulated generation of troponin T isoforms from a single gene JOURNAL Cell 38, 409-421 (1984) STANDARD full staff_review REFERENCE 3 (bases 313 to 16893; exons and intron/exon boundaries) AUTHORS Breitbart,R.E., Nguyen,H.T., Medford,R.M., Destree,A.T., Mahdavi,V. and Nadal-Ginard,B. TITLE Intricate combinatorial patterns of exon splicing generate multiple regulated troponin T isoforms from a single gene JOURNAL Cell 41, 67-82 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 17249) AUTHORS Breitbart,R.E. and Nadal-Ginard,B. TITLE Complete nucleotide sequence of the fast skeletal troponin T gene: Alternatively spliced exons exhibit unusual interspecies divergence JOURNAL J. Mol. Biol. 188, 313-324 (1986) STANDARD full staff_review REFERENCE 5 (bases 14620 to 14621; sequence correction) AUTHORS Breitbart,R.E. JOURNAL Unpublished (1990) Children's Hosp., Boston MA 02115 STANDARD full staff_review COMMENT Other exon combinations, besides the ones given in the Features table, are possible in theory, but no mRNAs corresponding to the right lengths were found [3]. There still is a question if there might be an alternate exon 9, which would again double the possible exon combinations and proteins. Even though no proteins have been identified for each and every possible protein described below, mRNAs of the appropriate lengths have been identified. The variation in mRNAs encoding tropomyosin T may be due to different half-lives at particular developmental stages [3]. Distinct similarities are found among exons 3, 4, 5, 6, and 8 [4]. These exons may have originated by gene duplication [4]. The variable region of troponin T is represented by exons 4-8 and the constant regions by exons 2 and 3, 9-15 and 18 [3],[4]. Exons 16 and 17 are mutually exclusive [3],[4]. When exon 16 is translated the alpha forms of the proteins are found, and when exon 17 is expressed the beta forms are produced [3],[[4]. Class Ia mRNAs are expressed only after the first postnatal week; class IVb mRNAs are found in L6E9 myotubes, but not in the fetus and are then increasingly expressed postnatally. Class IIa mRNAs increase continually through 14 days. FEATURES from to/span description pept 3146 3162 tropomyosin T class Ia alpha-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ia alpha-1, exon 3 4313 4330 tropomyosin T class Ia alpha-1, exon 4 5538 5555 tropomyosin T class Ia alpha-1, exon 5 5737 5751 tropomyosin T class Ia alpha-1, exon 6 6722 6733 tropomyosin T class Ia alpha-1, exon 7 6935 6949 tropomyosin T class Ia alpha-1, exon 8 9703 9721 tropomyosin T class Ia alpha-1, exon 9 11980 12025 tropomyosin T class Ia alpha-1, exon 10 12591 12707 tropomyosin T class Ia alpha-1, exon 11 12795 12872 tropomyosin T class Ia alpha-1, exon 12 13082 13195 tropomyosin T class Ia alpha-1, exon 13 13295 13404 tropomyosin T class Ia alpha-1, exon 14 13515 13605 tropomyosin T class Ia alpha-1, exon 15 14613 14653 tropomyosin T class Ia alpha-1, exon 16 16666 16720 tropomyosin T class Ia alpha-1, exon 17 pept 3146 3162 tropomyosin T class Ia alpha-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ia alpha-2, exon 3 5538 5555 tropomyosin T class Ia alpha-2, exon 4 5737 5751 tropomyosin T class Ia alpha-2, exon 5 6722 6733 tropomyosin T class Ia alpha-2, exon 6 6935 6949 tropomyosin T class Ia alpha-2, exon 7 9703 9721 tropomyosin T class Ia alpha-2, exon 8 11980 12025 tropomyosin T class Ia alpha-2, exon 9 12591 12707 tropomyosin T class Ia alpha-2, exon 10 12795 12872 tropomyosin T class Ia alpha-2, exon 11 13082 13195 tropomyosin T class Ia alpha-2, exon 12 13295 13404 tropomyosin T class Ia alpha-2, exon 13 13515 13605 tropomyosin T class Ia alpha-2, exon 14 14613 14653 tropomyosin T class Ia alpha-2, exon 15 16666 16720 tropomyosin T class Ia alpha-2, exon 16 pept 3146 3162 tropomyosin T class Ib alpha-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ib alpha-1, exon 3 4313 4330 tropomyosin T class Ib alpha-1, exon 4 5737 5751 tropomyosin T class Ib alpha-1, exon 5 6722 6733 tropomyosin T class Ib alpha-1, exon 6 6935 6949 tropomyosin T class Ib alpha-1, exon 7 9703 9721 tropomyosin T class Ib alpha-1, exon 8 11980 12025 tropomyosin T class Ib alpha-1, exon 9 12591 12707 tropomyosin T class Ib alpha-1, exon 10 12795 12872 tropomyosin T class Ib alpha-1, exon 11 13082 13195 tropomyosin T class Ib alpha-1, exon 12 13295 13404 tropomyosin T class Ib alpha-1, exon 13 13515 13605 tropomyosin T class Ib alpha-1, exon 14 14613 14653 tropomyosin T class Ib alpha-1, exon 15 16666 16720 tropomyosin T class Ib alpha-1, exon 16 pept 3146 3162 tropomyosin T class Ib alpha-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ib alpha-2, exon 3 5737 5751 tropomyosin T class Ib alpha-2, exon 4 6722 6733 tropomyosin T class Ib alpha-2, exon 5 6935 6949 tropomyosin T class Ib alpha-2, exon 6 9703 9721 tropomyosin T class Ib alpha-2, exon 7 11980 12025 tropomyosin T class Ib alpha-2, exon 8 12591 12707 tropomyosin T class Ib alpha-2, exon 9 12795 12872 tropomyosin T class Ib alpha-2, exon 10 13082 13195 tropomyosin T class Ib alpha-2, exon 11 13295 13404 tropomyosin T class Ib alpha-2, exon 12 13515 13605 tropomyosin T class Ib alpha-2, exon 13 14613 14653 tropomyosin T class Ib alpha-2, exon 14 16666 16720 tropomyosin T class Ib alpha-2, exon 15 pept 3146 3162 tropomyosin T class Ic alpha-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic alpha-1, exon 3 4313 4330 tropomyosin T class Ic alpha-1, exon 4 5538 5555 tropomyosin T class Ic alpha-1, exon 5 6722 6733 tropomyosin T class Ic alpha-1, exon 6 6935 6949 tropomyosin T class Ic alpha-1, exon 7 9703 9721 tropomyosin T class Ic alpha-1, exon 8 11980 12025 tropomyosin T class Ic alpha-1, exon 9 12591 12707 tropomyosin T class Ic alpha-1, exon 10 12795 12872 tropomyosin T class Ic alpha-1, exon 11 13082 13195 tropomyosin T class Ic alpha-1, exon 12 13295 13404 tropomyosin T class Ic alpha-1, exon 13 13515 13605 tropomyosin T class Ic alpha-1, exon 14 14613 14653 tropomyosin T class Ic alpha-1, exon 15 16666 16720 tropomyosin T class Ic alpha-1, exon 16 pept 3146 3162 tropomyosin T class Ic alpha-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic alpha-2, exon 3 4313 4330 tropomyosin T class Ic alpha-2, exon 4 6722 6733 tropomyosin T class Ic alpha-2, exon 5 6935 6949 tropomyosin T class Ic alpha-2, exon 6 9703 9721 tropomyosin T class Ic alpha-2, exon 7 11980 12025 tropomyosin T class Ic alpha-2, exon 8 12591 12707 tropomyosin T class Ic alpha-2, exon 9 12795 12872 tropomyosin T class Ic alpha-2, exon 10 13082 13195 tropomyosin T class Ic alpha-2, exon 11 13295 13404 tropomyosin T class Ic alpha-2, exon 12 13515 13605 tropomyosin T class Ic alpha-2, exon 13 14613 14653 tropomyosin T class Ic alpha-2, exon 14 16666 16720 tropomyosin T class Ic alpha-2, exon 15 pept 3146 3162 tropomyosin T class Ic alpha-3, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic alpha-3, exon 3 5538 5555 tropomyosin T class Ic alpha-3, exon 4 6722 6733 tropomyosin T class Ic alpha-3, exon 5 6935 6949 tropomyosin T class Ic alpha-3, exon 6 9703 9721 tropomyosin T class Ic alpha-3, exon 7 11980 12025 tropomyosin T class Ic alpha-3, exon 8 12591 12707 tropomyosin T class Ic alpha-3, exon 9 12795 12872 tropomyosin T class Ic alpha-3, exon 10 13082 13195 tropomyosin T class Ic alpha-3, exon 11 13295 13404 tropomyosin T class Ic alpha-3, exon 12 13515 13605 tropomyosin T class Ic alpha-3, exon 13 14613 14653 tropomyosin T class Ic alpha-3, exon 14 16666 16720 tropomyosin T class Ic alpha-3, exon 15 pept 3146 3162 tropomyosin T class Ic alpha-4, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic alpha-4, exon 3 6722 6733 tropomyosin T class Ic alpha-4, exon 4 6935 6949 tropomyosin T class Ic alpha-4, exon 5 9703 9721 tropomyosin T class Ic alpha-4, exon 6 11980 12025 tropomyosin T class Ic alpha-4, exon 7 12591 12707 tropomyosin T class Ic alpha-4, exon 8 12795 12872 tropomyosin T class Ic alpha-4, exon 9 13082 13195 tropomyosin T class Ic alpha-4, exon 10 13295 13404 tropomyosin T class Ic alpha-4, exon 11 13515 13605 tropomyosin T class Ic alpha-4, exon 12 14613 14653 tropomyosin T class Ic alpha-4, exon 13 16666 16720 tropomyosin T class Ic alpha-4, exon 14 pept 3146 3162 tropomyosin T class Ia beta-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ia beta-1, exon 3 4313 4330 tropomyosin T class Ia beta-1, exon 4 5538 5555 tropomyosin T class Ia beta-1, exon 5 5737 5751 tropomyosin T class Ia beta-1, exon 6 6722 6733 tropomyosin T class Ia beta-1, exon 7 6935 6949 tropomyosin T class Ia beta-1, exon 8 9703 9721 tropomyosin T class Ia beta-1, exon 9 11980 12025 tropomyosin T class Ia beta-1, exon 10 12591 12707 tropomyosin T class Ia beta-1, exon 11 12795 12872 tropomyosin T class Ia beta-1, exon 12 13082 13195 tropomyosin T class Ia beta-1, exon 13 13295 13404 tropomyosin T class Ia beta-1, exon 14 13515 13605 tropomyosin T class Ia beta-1, exon 15 15251 15291 tropomyosin T class Ia beta-1, exon 16 16666 16720 tropomyosin T class Ia beta-1, exon 17 pept 3146 3162 tropomyosin T class Ia beta-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ia beta-2, exon 3 5538 5555 tropomyosin T class Ia beta-2, exon 4 5737 5751 tropomyosin T class Ia beta-2, exon 5 6722 6733 tropomyosin T class Ia beta-2, exon 6 6935 6949 tropomyosin T class Ia beta-2, exon 7 9703 9721 tropomyosin T class Ia beta-2, exon 8 11980 12025 tropomyosin T class Ia beta-2, exon 9 12591 12707 tropomyosin T class Ia beta-2, exon 10 12795 12872 tropomyosin T class Ia beta-2, exon 11 13082 13195 tropomyosin T class Ia beta-2, exon 12 13295 13404 tropomyosin T class Ia beta-2, exon 13 13515 13605 tropomyosin T class Ia beta-2, exon 14 15251 15291 tropomyosin T class Ia beta-2, exon 15 16666 16720 tropomyosin T class Ia beta-2, exon 16 pept 3146 3162 tropomyosin T class Ib beta-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ib beta-1, exon 3 4313 4330 tropomyosin T class Ib beta-1, exon 4 5737 5751 tropomyosin T class Ib beta-1, exon 5 6722 6733 tropomyosin T class Ib beta-1, exon 6 6935 6949 tropomyosin T class Ib beta-1, exon 7 9703 9721 tropomyosin T class Ib beta-1, exon 8 11980 12025 tropomyosin T class Ib beta-1, exon 9 12591 12707 tropomyosin T class Ib beta-1, exon 10 12795 12872 tropomyosin T class Ib beta-1, exon 11 13082 13195 tropomyosin T class Ib beta-1, exon 12 13295 13404 tropomyosin T class Ib beta-1, exon 13 13515 13605 tropomyosin T class Ib beta-1, exon 14 15251 15291 tropomyosin T class Ib beta-1, exon 15 16666 16720 tropomyosin T class Ib beta-1, exon 16 pept 3146 3162 tropomyosin T class Ib beta-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ib beta-2, exon 3 5737 5751 tropomyosin T class Ib beta-2, exon 4 6722 6733 tropomyosin T class Ib beta-2, exon 5 6935 6949 tropomyosin T class Ib beta-2, exon 6 9703 9721 tropomyosin T class Ib beta-2, exon 7 11980 12025 tropomyosin T class Ib beta-2, exon 8 12591 12707 tropomyosin T class Ib beta-2, exon 9 12795 12872 tropomyosin T class Ib beta-2, exon 10 13082 13195 tropomyosin T class Ib beta-2, exon 11 13295 13404 tropomyosin T class Ib beta-2, exon 12 13515 13605 tropomyosin T class Ib beta-2, exon 13 15251 15291 tropomyosin T class Ib beta-2, exon 14 16666 16720 tropomyosin T class Ib beta-2, exon 15 pept 3146 3162 tropomyosin T class Ic beta-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic beta-1, exon 3 4313 4330 tropomyosin T class Ic beta-1, exon 4 5538 5555 tropomyosin T class Ic beta-1, exon 5 6722 6733 tropomyosin T class Ic beta-1, exon 6 6935 6949 tropomyosin T class Ic beta-1, exon 7 9703 9721 tropomyosin T class Ic beta-1, exon 8 11980 12025 tropomyosin T class Ic beta-1, exon 9 12591 12707 tropomyosin T class Ic beta-1, exon 10 12795 12872 tropomyosin T class Ic beta-1, exon 11 13082 13195 tropomyosin T class Ic beta-1, exon 12 13295 13404 tropomyosin T class Ic beta-1, exon 13 13515 13605 tropomyosin T class Ic beta-1, exon 14 15251 15291 tropomyosin T class Ic beta-1, exon 15 16666 16720 tropomyosin T class Ic beta-1, exon 16 pept 3146 3162 tropomyosin T class Ic beta-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic beta-2, exon 3 4313 4330 tropomyosin T class Ic beta-2, exon 4 6722 6733 tropomyosin T class Ic beta-2, exon 5 6935 6949 tropomyosin T class Ic beta-2, exon 6 9703 9721 tropomyosin T class Ic beta-2, exon 7 11980 12025 tropomyosin T class Ic beta-2, exon 8 12591 12707 tropomyosin T class Ic beta-2, exon 9 12795 12872 tropomyosin T class Ic beta-2, exon 10 13082 13195 tropomyosin T class Ic beta-2, exon 11 13295 13404 tropomyosin T class Ic beta-2, exon 12 13515 13605 tropomyosin T class Ic beta-2, exon 13 15251 15291 tropomyosin T class Ic beta-2, exon 14 16666 16720 tropomyosin T class Ic beta-2, exon 15 pept 3146 3162 tropomyosin T class Ic beta-3, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic beta-3, exon 3 5538 5555 tropomyosin T class Ic beta-3, exon 4 6722 6733 tropomyosin T class Ic beta-3, exon 5 6935 6949 tropomyosin T class Ic beta-3, exon 6 9703 9721 tropomyosin T class Ic beta-3, exon 7 11980 12025 tropomyosin T class Ic beta-3, exon 8 12591 12707 tropomyosin T class Ic beta-3, exon 9 12795 12872 tropomyosin T class Ic beta-3, exon 10 13082 13195 tropomyosin T class Ic beta-3, exon 11 13295 13404 tropomyosin T class Ic beta-3, exon 12 13515 13605 tropomyosin T class Ic beta-3, exon 13 15251 15291 tropomyosin T class Ic beta-3, exon 14 16666 16720 tropomyosin T class Ic beta-3, exon 15 pept 3146 3162 tropomyosin T class Ic beta-4, exon 2 (first expressed exon) 3344 3357 tropomyosin T class Ic beta-4, exon 3 6722 6733 tropomyosin T class Ic beta-4, exon 4 6935 6949 tropomyosin T class Ic beta-4, exon 5 9703 9721 tropomyosin T class Ic beta-4, exon 6 11980 12025 tropomyosin T class Ic beta-4, exon 7 12591 12707 tropomyosin T class Ic beta-4, exon 8 12795 12872 tropomyosin T class Ic beta-4, exon 9 13082 13195 tropomyosin T class Ic beta-4, exon 10 13295 13404 tropomyosin T class Ic beta-4, exon 11 13515 13605 tropomyosin T class Ic beta-4, exon 12 15251 15291 tropomyosin T class Ic beta-4, exon 13 16666 16720 tropomyosin T class Ic beta-4, exon 14 pept 3146 3162 tropomyosin T class IIa alpha-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa alpha-1, exon 3 4313 4330 tropomyosin T class IIa alpha-1, exon 4 5538 5555 tropomyosin T class IIa alpha-1, exon 5 5737 5751 tropomyosin T class IIa alpha-1, exon 6 6935 6949 tropomyosin T class IIa alpha-1, exon 7 9703 9721 tropomyosin T class IIa alpha-1, exon 8 11980 12025 tropomyosin T class IIa alpha-1, exon 9 12591 12707 tropomyosin T class IIa alpha-1, exon 10 12795 12872 tropomyosin T class IIa alpha-1, exon 11 13082 13195 tropomyosin T class IIa alpha-1, exon 12 13295 13404 tropomyosin T class IIa alpha-1, exon 13 13515 13605 tropomyosin T class IIa alpha-1, exon 14 14613 14653 tropomyosin T class IIa alpha-1, exon 15 16666 16720 tropomyosin T class IIa alpha-1, exon 16 pept 3146 3162 tropomyosin T class IIa alpha-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa alpha-2, exon 3 4313 4330 tropomyosin T class IIa alpha-2, exon 4 5737 5751 tropomyosin T class IIa alpha-2, exon 5 6935 6949 tropomyosin T class IIa alpha-2, exon 6 9703 9721 tropomyosin T class IIa alpha-2, exon 7 11980 12025 tropomyosin T class IIa alpha-2, exon 8 12591 12707 tropomyosin T class IIa alpha-2, exon 9 12795 12872 tropomyosin T class IIa alpha-2, exon 10 13082 13195 tropomyosin T class IIa alpha-2, exon 11 13295 13404 tropomyosin T class IIa alpha-2, exon 12 13515 13605 tropomyosin T class IIa alpha-2, exon 13 14613 14653 tropomyosin T class IIa alpha-2, exon 14 16666 16720 tropomyosin T class IIa alpha-2, exon 15 pept 3146 3162 tropomyosin T class IIa alpha-3, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa alpha-3, exon 3 5538 5555 tropomyosin T class IIa alpha-3, exon 4 5737 5751 tropomyosin T class IIa alpha-3, exon 5 6935 6949 tropomyosin T class IIa alpha-3, exon 6 9703 9721 tropomyosin T class IIa alpha-3, exon 7 11980 12025 tropomyosin T class IIa alpha-3, exon 8 12591 12707 tropomyosin T class IIa alpha-3, exon 9 12795 12872 tropomyosin T class IIa alpha-3, exon 10 13082 13195 tropomyosin T class IIa alpha-3, exon 11 13295 13404 tropomyosin T class IIa alpha-3, exon 12 13515 13605 tropomyosin T class IIa alpha-3, exon 13 14613 14653 tropomyosin T class IIa alpha-3, exon 14 16666 16720 tropomyosin T class IIa alpha-3, exon 15 pept 3146 3162 tropomyosin T class IIa alpha-4, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa alpha-4, exon 3 5737 5751 tropomyosin T class IIa alpha-4, exon 4 6935 6949 tropomyosin T class IIa alpha-4, exon 5 9703 9721 tropomyosin T class IIa alpha-4, exon 6 11980 12025 tropomyosin T class IIa alpha-4, exon 7 12591 12707 tropomyosin T class IIa alpha-4, exon 8 12795 12872 tropomyosin T class IIa alpha-4, exon 9 13082 13195 tropomyosin T class IIa alpha-4, exon 10 13295 13404 tropomyosin T class IIa alpha-4, exon 11 13515 13605 tropomyosin T class IIa alpha-4, exon 12 14613 14653 tropomyosin T class IIa alpha-4, exon 13 16666 16720 tropomyosin T class IIa alpha-4, exon 14 pept 3146 3162 tropomyosin T class IIa beta-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa beta-1, exon 3 4313 4330 tropomyosin T class IIa beta-1, exon 4 5538 5555 tropomyosin T class IIa beta-1, exon 5 5737 5751 tropomyosin T class IIa beta-1, exon 6 6935 6949 tropomyosin T class IIa beta-1, exon 7 9703 9721 tropomyosin T class IIa beta-1, exon 8 11980 12025 tropomyosin T class IIa beta-1, exon 9 12591 12707 tropomyosin T class IIa beta-1, exon 10 12795 12872 tropomyosin T class IIa beta-1, exon 11 13082 13195 tropomyosin T class IIa beta-1, exon 12 13295 13404 tropomyosin T class IIa beta-1, exon 13 13515 13605 tropomyosin T class IIa beta-1, exon 14 15251 15291 tropomyosin T class IIa beta-1, exon 15 16666 16720 tropomyosin T class IIa beta-1, exon 16 pept 3146 3162 tropomyosin T class IIa beta-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa beta-2, exon 3 4313 4330 tropomyosin T class IIa beta-2, exon 4 5737 5751 tropomyosin T class IIa beta-2, exon 5 6935 6949 tropomyosin T class IIa beta-2, exon 6 9703 9721 tropomyosin T class IIa beta-2, exon 7 11980 12025 tropomyosin T class IIa beta-2, exon 8 12591 12707 tropomyosin T class IIa beta-2, exon 9 12795 12872 tropomyosin T class IIa beta-2, exon 10 13082 13195 tropomyosin T class IIa beta-2, exon 11 13295 13404 tropomyosin T class IIa beta-2, exon 12 13515 13605 tropomyosin T class IIa beta-2, exon 13 15251 15291 tropomyosin T class IIa beta-2, exon 14 16666 16720 tropomyosin T class IIa beta-2, exon 15 pept 3146 3162 tropomyosin T class IIa beta-3, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa beta-3, exon 3 5538 5555 tropomyosin T class IIa beta-3, exon 4 5737 5751 tropomyosin T class IIa beta-3, exon 5 6935 6949 tropomyosin T class IIa beta-3, exon 6 9703 9721 tropomyosin T class IIa beta-3, exon 7 11980 12025 tropomyosin T class IIa beta-3, exon 8 12591 12707 tropomyosin T class IIa beta-3, exon 9 12795 12872 tropomyosin T class IIa beta-3, exon 10 13082 13195 tropomyosin T class IIa beta-3, exon 11 13295 13404 tropomyosin T class IIa beta-3, exon 12 13515 13605 tropomyosin T class IIa beta-3, exon 13 15251 15291 tropomyosin T class IIa beta-3, exon 14 16666 16720 tropomyosin T class IIa beta-3, exon 15 pept 3146 3162 tropomyosin T class IIa beta-4, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIa beta-4, exon 3 5737 5751 tropomyosin T class IIa beta-4, exon 4 6935 6949 tropomyosin T class IIa beta-4, exon 5 9703 9721 tropomyosin T class IIa beta-4, exon 6 11980 12025 tropomyosin T class IIa beta-4, exon 7 12591 12707 tropomyosin T class IIa beta-4, exon 8 12795 12872 tropomyosin T class IIa beta-4, exon 9 13082 13195 tropomyosin T class IIa beta-4, exon 10 13295 13404 tropomyosin T class IIa beta-4, exon 11 13515 13605 tropomyosin T class IIa beta-4, exon 12 15251 15291 tropomyosin T class IIa beta-4, exon 13 16666 16720 tropomyosin T class IIa beta-4, exon 14 pept 3146 3162 tropomyosin T class IIIa alpha, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIIa alpha, exon 3 4313 4330 tropomyosin T class IIIa alpha, exon 4 5538 5555 tropomyosin T class IIIa alpha, exon 5 9703 9721 tropomyosin T class IIIa alpha, exon 6 11980 12025 tropomyosin T class IIIa alpha, exon 7 12591 12707 tropomyosin T class IIIa alpha, exon 8 12795 12872 tropomyosin T class IIIa alpha, exon 9 13082 13195 tropomyosin T class IIIa alpha, exon 10 13295 13404 tropomyosin T class IIIa alpha, exon 11 13515 13605 tropomyosin T class IIIa alpha, exon 12 14613 14653 tropomyosin T class IIIa alpha, exon 13 16666 16720 tropomyosin T class IIIa alpha, exon 14 pept 3146 3162 tropomyosin T class IIIb alpha, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIIb alpha, exon 3 5538 5555 tropomyosin T class IIIb alpha, exon 4 9703 9721 tropomyosin T class IIIb alpha, exon 5 11980 12025 tropomyosin T class IIIb alpha, exon 6 12591 12707 tropomyosin T class IIIb alpha, exon 7 12795 12872 tropomyosin T class IIIb alpha, exon 8 13082 13195 tropomyosin T class IIIb alpha, exon 9 13295 13404 tropomyosin T class IIIb alpha, exon 10 13515 13605 tropomyosin T class IIIb alpha, exon 11 14613 14653 tropomyosin T class IIIb alpha, exon 12 16666 16720 tropomyosin T class IIIb alpha, exon 13 pept 3146 3162 tropomyosin T class IIIa beta, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIIa beta, exon 3 4313 4330 tropomyosin T class IIIa beta, exon 4 5538 5555 tropomyosin T class IIIa beta, exon 5 9703 9721 tropomyosin T class IIIa beta, exon 6 11980 12025 tropomyosin T class IIIa beta, exon 7 12591 12707 tropomyosin T class IIIa beta, exon 8 12795 12872 tropomyosin T class IIIa beta, exon 9 13082 13195 tropomyosin T class IIIa beta, exon 10 13295 13404 tropomyosin T class IIIa beta, exon 11 13515 13605 tropomyosin T class IIIa beta, exon 12 15251 15291 tropomyosin T class IIIa beta, exon 13 16666 16720 tropomyosin T class IIIa beta, exon 14 pept 3146 3162 tropomyosin T class IIIb beta, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IIIb beta, exon 3 5538 5555 tropomyosin T class IIIb beta, exon 4 9703 9721 tropomyosin T class IIIb beta, exon 5 11980 12025 tropomyosin T class IIIb beta, exon 6 12591 12707 tropomyosin T class IIIb beta, exon 7 12795 12872 tropomyosin T class IIIb beta, exon 8 13082 13195 tropomyosin T class IIIb beta, exon 9 13295 13404 tropomyosin T class IIIb beta, exon 10 13515 13605 tropomyosin T class IIIb beta, exon 11 15251 15291 tropomyosin T class IIIb beta, exon 12 16666 16720 tropomyosin T class IIIb beta, exon 13 pept 3146 3162 tropomyosin T class IVa alpha, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IVa alpha, exon 3 5538 5555 tropomyosin T class IVa alpha, exon 4 5737 5751 tropomyosin T class IVa alpha, exon 5 6722 6733 tropomyosin T class IVa alpha, exon 6 9703 9721 tropomyosin T class IVa alpha, exon 7 11980 12025 tropomyosin T class IVa alpha, exon 8 12591 12707 tropomyosin T class IVa alpha, exon 9 12795 12872 tropomyosin T class IVa alpha, exon 10 13082 13195 tropomyosin T class IVa alpha, exon 11 13295 13404 tropomyosin T class IVa alpha, exon 12 13515 13605 tropomyosin T class IVa alpha, exon 13 14613 14653 tropomyosin T class IVa alpha, exon 14 16666 16720 tropomyosin T class IVa alpha, exon 15 pept 3146 3162 tropomyosin T class IVb alpha, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IVb alpha, exon 3 4313 4330 tropomyosin T class IVb alpha, exon 4 5538 5555 tropomyosin T class IVb alpha, exon 5 5737 5751 tropomyosin T class IVb alpha, exon 6 6722 6733 tropomyosin T class IVb alpha, exon 7 9703 9721 tropomyosin T class IVb alpha, exon 8 11980 12025 tropomyosin T class IVb alpha, exon 9 12591 12707 tropomyosin T class IVb alpha, exon 10 12795 12872 tropomyosin T class IVb alpha, exon 11 13082 13195 tropomyosin T class IVb alpha, exon 12 13295 13404 tropomyosin T class IVb alpha, exon 13 13515 13605 tropomyosin T class IVb alpha, exon 14 14613 14653 tropomyosin T class IVb alpha, exon 15 16666 16720 tropomyosin T class IVb alpha, exon 16 pept 3146 3162 tropomyosin T class IVc alpha-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IVc alpha-1, exon 3 4313 4330 tropomyosin T class IVc alpha-1, exon 4 5737 5751 tropomyosin T class IVc alpha-1, exon 5 6722 6733 tropomyosin T class IVc alpha-1, exon 6 9703 9721 tropomyosin T class IVc alpha-1, exon 7 11980 12025 tropomyosin T class IVc alpha-1, exon 8 12591 12707 tropomyosin T class IVc alpha-1, exon 9 12795 12872 tropomyosin T class IVc alpha-1, exon 10 13082 13195 tropomyosin T class IVc alpha-1, exon 11 13295 13404 tropomyosin T class IVc alpha-1, exon 12 13515 13605 tropomyosin T class IVc alpha-1, exon 13 14613 14653 tropomyosin T class IVc alpha-1, exon 14 16666 16720 tropomyosin T class IVc alpha-1, exon 15 pept 3146 3162 tropomyosin T class IVc alpha-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IVc alpha-2, exon 3 5737 5751 tropomyosin T class IVc alpha-2, exon 4 6722 6733 tropomyosin T class IVc alpha-2, exon 5 9703 9721 tropomyosin T class IVc alpha-2, exon 6 11980 12025 tropomyosin T class IVc alpha-2, exon 7 12591 12707 tropomyosin T class IVc alpha-2, exon 8 12795 12872 tropomyosin T class IVc alpha-2, exon 9 13082 13195 tropomyosin T class IVc alpha-2, exon 10 13295 13404 tropomyosin T class IVc alpha-2, exon 11 13515 13605 tropomyosin T class IVc alpha-2, exon 12 14613 14653 tropomyosin T class IVc alpha-2, exon 13 16666 16720 tropomyosin T class IVc alpha-2, exon 14 pept 3146 3162 tropomyosin T class IVd alpha-1, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IVd alpha-1, exon 3 4313 4330 tropomyosin T class IVd alpha-1, exon 4 5538 5555 tropomyosin T class IVd alpha-1, exon 5 6722 6733 tropomyosin T class IVd alpha-1, exon 6 9703 9721 tropomyosin T class IVd alpha-1, exon 7 11980 12025 tropomyosin T class IVd alpha-1, exon 8 12591 12707 tropomyosin T class IVd alpha-1, exon 9 12795 12872 tropomyosin T class IVd alpha-1, exon 10 13082 13195 tropomyosin T class IVd alpha-1, exon 11 13295 13404 tropomyosin T class IVd alpha-1, exon 12 13515 13605 tropomyosin T class IVd alpha-1, exon 13 14613 14653 tropomyosin T class IVd alpha-1, exon 14 16666 16720 tropomyosin T class IVd alpha-1, exon 15 pept 3146 3162 tropomyosin T class IVd alpha-2, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IVd alpha-2, exon 3 4313 4330 tropomyosin T class IVd alpha-2, exon 4 6722 6733 tropomyosin T class IVd alpha-2, exon 5 9703 9721 tropomyosin T class IVd alpha-2, exon 6 11980 12025 tropomyosin T class IVd alpha-2, exon 7 12591 12707 tropomyosin T class IVd alpha-2, exon 8 12795 12872 tropomyosin T class IVd alpha-2, exon 9 13082 13195 tropomyosin T class IVd alpha-2, exon 10 13295 13404 tropomyosin T class IVd alpha-2, exon 11 13515 13605 tropomyosin T class IVd alpha-2, exon 12 14613 14653 tropomyosin T class IVd alpha-2, exon 13 16666 16720 tropomyosin T class IVd alpha-2, exon 14 pept 3146 3162 tropomyosin T class IVd alpha-3, exon 2 (first expressed exon) 3344 3357 tropomyosin T class IVd alpha-3, exon 3 5538 5555 tropomyosin T class IVd alpha-3, exon 4
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS TRSSTOBAA 360 bp ss-RNA VRL 14-MAR-1990 DEFINITION Tobacco ringspot virus (TobRV) mRNA. ACCESSION M17439 KEYWORDS . SOURCE Tobacco ringspot virus (isolate 62L), cDNA to viral RNA. ORGANISM Tobacco ringspot virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Nepovirus. REFERENCE 1 (bases 1 to 360) AUTHORS Buzayan,J.M., McNinch,J.S., Schneider,I.R. and Bruening,G. TITLE A nucleotide sequence rearrangement distinguishes two isolates of satellite tobacco ringspot virus RNA JOURNAL Virology 160, 95-99 (1987) STANDARD simple staff_review BASE COUNT 70 a 90 c 110 g 90 t ORIGIN 1 accggatgtg ctttccggtc tgatgagtcc gtgaggacga aacaggactg tcaggtggcc 61 gaaagccacc acgtaaacta atgaaccgtg ctgcgtagcg ttgtgcctcg tggaggtggg 121 atgccacctc gtggagcagc cttcgtgtgg gcgcggcggt gtagttagtc aaggcgtacc 181 aggtaatata ccacaacgtg tgtttctctg gttgacttct ctgtttgttg tgtcattagg 241 tcccgggtct cgcattagcg gcgacggggt attctcttac gacatggaag tttgagagac 301 cgcgcctcta cactatgcgc ggccggggcg aatccaaata ttctagcccg ataccctgtc //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATMYHOA 260 bp ss-mRNA ROD 14-MAR-1990 DEFINITION Rat cardiac beta-myosin heavy chain mRNA, 3' end, clone pCMHC-21. ACCESSION M32697 KEYWORDS beta-myosin; myosin; myosin heavy chain. SOURCE Rat (strain Wistar) three months-old ventricle, cDNA to mRNA, clone pCMHC-21. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 260) AUTHORS Mahdavi,V., Lompre,A.M., Chambers,A.P. and Nadal-Ginard,B. TITLE Cardiac myosin heavy chain isozymic transitions during development and under pathological conditions are regulated at the level of mRNA availability JOURNAL Eur. Heart J. 5, 181-191 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 204 beta-myosin heavy chain (AA at 1) BASE COUNT 77 a 69 c 79 g 35 t ORIGIN 1 ctgcagttga aggtgaaggc ctacaagcgc caggctgaag aggcggagga acaggccaac 61 accaacctgt ccaagttccg caaggtgcag cacgagctgg atgaggcaga ggagagggcg 121 gacattgccg agtcccaggt caacaagctg cgggccaaga gccgtgacat tggcgccaag 181 cagaaaatcc acgatgagga ataacctgtc cagcagaaag agcctcgccg ttgccatccc 241 acaataaata cgaatgttcg // LOCUS RATMYHOB 303 bp ss-mRNA ROD 14-MAR-1990 DEFINITION Rat cardiac beta-myosin heavy chain mRNA, 3' end, clone pCMHC-5. ACCESSION M32698 KEYWORDS beta-myosin; myosin; myosin heavy chain. SOURCE Rat (strain Wistar) three months-old ventricle, cDNA to mRNA, clone pCMHC-5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Mahdavi,V., Lompre,A.M., Chambers,A.P. and Nadal-Ginard,B. TITLE Cardiac myosin heavy chain isozymic transitions during development and under pathological conditions are regulated at the level of mRNA availability JOURNAL Eur. Heart J. 5, 181-191 (1984) STANDARD simle staff_review FEATURES from to/span description pept < 1 198 beta-myosin heavy chain (AA at 1) BASE COUNT 87 a 80 c 89 g 47 t ORIGIN 1 ctgcagttga aggtgaaggc ctacaagcgc caggctgaag aggcggagga acaggccaac 61 accaacctgt ccaagttccg caaggtgcag cacgagctgg atgaggcaga ggagagggcg 121 gacattgccg agtcccaggt caacaagctg cgggccaaga gccgtgacat tggcgccaag 181 ggcctgaatg aagagtagat cttgctctac ccaaccctaa ggatgcctgt gaagccctga 241 gacctggagc ctttgaaaag caccttcagg cagaaacaca ataaagcaat tttccttcaa 301 gcc //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS HUMP42LA 2465 bp ss-mRNA PRI 11-JAN-1990 DEFINITION Human erythrocyte membrane protein 4.2 (HUMP4.2L) mRNA, complete cds. ACCESSION M30647 KEYWORDS . SOURCE Human sickle cell patient reticulocyte, cDNA to mRNA, clones 7 and 12. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2465) AUTHORS Sung,L.A., Chien,S., Chang,L.-S., Lambert,K., Bliss,S.A., Bouhassira,E.E., Nagel,R.L., Schwartz,R.S. and Rybicki,A.C. TITLE Molecular cloning of human protein 4.2: A major component of the red erythrocyte membrane JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 955-959 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.Sung, 12-DEC-1989, for release after publication. FEATURES from to/span description pept 228 2393 protein 4.2 (HUMP4.2L) (put.) mRNA < 1 2465 protein 4.2 (HUMP4.2L) mRNA BASE COUNT 603 a 661 c 697 g 504 t ORIGIN 1 gaactggtcc aggagatacc agcagagagg gagtaggaga gaagaaacat gtcagggtgc 61 tcacaggagt agtgggggga ggttttgcta tttccagatt cttaagccaa caaaagtgcc 121 ttcatatttt ctgtctggaa gacagaaagc ccagaaggag cccagaagca acagtttgag 181 agaggcgctt tctgcggcca agtggataag aggagcggcc tgcaaccatg ggacagggtg 241 agccaagcca gcgctcgaca gggcttgctg gactgtatgc agcccccgca gcatcacctg 301 ttttcattaa aggaagtggg atggatgccc tgggtatcaa gagctgtgac tttcaggcag 361 caagaaacaa tgaggagcac cacaccaagg ccctcagctc ccggcgcctc tttgtgagga 421 gggggcagcc cttcaccatc atcctgtact tccgcgctcc agtccgtgca tttctgcctg 481 ccctgaagaa ggtggccctc actgcacaaa ctggagagca gccttccaag atcaacagga 541 cccaagccac attcccaatt tccagtctgg gggaccgaaa gtggtggagt gcagtggtgg 601 aggagagaga tgcccagtcc tggaccatct ctgtgaccac acctgcagac gctgtcattg 661 gccactactc gcttctgctg caggtctcag gcaggaagca actcctcttg ggtcagttca 721 cactgctttt taacccctgg aatagagagg atgctgtgtt cctgaagaat gaggctcagc 781 gcatggagta cttgttgaac cagaatggtc tcatctacct gggtacagct gactgcatcc 841 aggcagagtc ctgggacttt ggccagttcg agggggatgt cattgacctc agcctgcgct 901 tgctgagcaa ggacaagcag gtagagaagt ggagccagcc ggtgcacgtg gcccgtgtgt 961 tgggtgcctt gctgcatttt ctcaaggagc agagggtcct gcccaccccg cagacccagg 1021 ccacccagga aggggccttg ctgaacaagc gccggggcag cgtgcccatc ctgcggcagt 1081 ggctcaccgg ccgaggccga cctgtgtatg atggccaggc ctgggtgttg gctgctgttg 1141 cttgcacagt gctgcgatgc ctgggaatcc ctgcccgcgt ggtgaccacg tttgcctcag 1201 cacagggcac cggtgggcgt cttctcatag atgaatacta taatgaggag ggacttcaga 1261 acggagaagg ccagagaggc agaatctgga tcttccagac ttccacagag tgctggatga 1321 agcgcggcct gccttgccag ggttatgatg gatggcagat tctgcaccca agtgctccta 1381 atggaggtgg agtcctgggg tcctgtgatc tggtgccggt cagagcagtc aaggagggga 1441 cgctggggct gaccccagca gtgtcagacc tttttgctgc cataaatgcc tcatgtgtgg 1501 tctggaagtg ctgtgaggat gggacactgg agttgactga ctccaacaca aagtatgttg 1561 gcaacaacat cagcaccaag ggtgtgggca gtgaccgctg cgaggacatc actcagaact 1621 acaagtatcc tgaagggtct cttcaggaaa aagaggtgct ggagagagtc gagaaagaga 1681 aaatggaacg tgagaaagac aacggcatcc gtcctcccag tctcgagact gccagtcctc 1741 tgtacctgct cttgaaagca cccagctccc tacccctgag aggggatgcc cagatctcag 1801 tgacgctggt taatcacagt gagcaggaga aggcagtgca gctggcaatt ggggtccagg 1861 ctgtacacta caacggtgtc cttgctgcca agctctggag gaagaagctg cacctcacgc 1921 tcagtgccaa cctggaaaag ataataacca tcggcctgtt cttctccaat tttgagcgaa 1981 acccacccga gaacaccttc cttagactca ccgccatggc aacacactct gaatccaacc 2041 ttagctgctt tgctcaggaa gacattgcca tttgtagacc acaccttgcc atcaagatgc 2101 cagagaaagc agagcagtat caacccctca cagcctcagt cagcctccag aactccctag 2161 atgcccccat ggaggactgt gtgatctcca tcctgggaag ggggctcatt cacagagaga 2221 ggagctacag attccgttca gtgtggcctg aaaacaccat gtgtgccaag ttccagttca 2281 cgccaacaca tgtggggctc cagagactca ctgtggaagt ggactgcaac atgttccaga 2341 acctaaccaa ctataaaagc gtcaccgtgg tagcccctga actatcagct taaacttcca 2401 gctctatcac cactctcctg ccaacccttg ttctacaatc taaaccaaac atgtgctagg 2461 aagag // LOCUS HUMP42SA 2375 bp ss-mRNA PRI 11-JAN-1990 DEFINITION Human erythrocyte membrane protein 4.2 (HUMP4.2S) mRNA, complete cds. ACCESSION M30646 KEYWORDS . SOURCE Human sickle cell patient reticulocyte, cDNA to mRNA, clones 7 and 16. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2375) AUTHORS Sung,L.A., Chien,S., Chang,L.-S., Lambert,K., Bliss,S.A., Bouhassira,E.E., Nagel,R.L., Schwartz,R.S. and Rybicki,A.C. TITLE Molecular cloning of human protein 4.2: A major component of the red erythrocyte membrane JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 955-959 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.Sung, 12-DEC-1989, for release after publication. FEATURES from to/span description pept 228 2303 protein 4.2 (HUMP4.2S) (put.) mRNA < 1 2375 protein 4.2 mRNA BASE COUNT 583 a 638 c 669 g 485 t ORIGIN 1 gaactggtcc aggagatacc agcagagagg gagtaggaga gaagaaacat gtcagggtgc 61 tcacaggagt agtgggggga ggttttgcta tttccagatt cttaagccaa caaaagtgcc 121 ttcatatttt ctgtctggaa gacagaaagc ccagaaggag cccagaagca acagtttgag 181 agaggcgctt tctgcggcca agtggataag aggagcggcc tgcaaccatg ggacaggccc 241 tgggtatcaa gagctgtgac tttcaggcag caagaaacaa tgaggagcac cacaccaagg 301 ccctcagctc ccggcgcctc tttgtgagga gggggcagcc cttcaccatc atcctgtact 361 tccgcgctcc agtccgtgca tttctgcctg ccctgaagaa ggtggccctc actgcacaaa 421 ctggagagca gccttccaag atcaacagga cccaagccac attcccaatt tccagtctgg 481 gggaccgaaa gtggtggagt gcagtggtgg aggagagaga tgcccagtcc tggaccatct 541 ctgtgaccac acctgcagac gctgtcattg gccactactc gcttctgctg caggtctcag 601 gcaggaagca actcctcttg ggtcagttca cactgctttt taacccctgg aatagagagg 661 atgctgtgtt cctgaagaat gaggctcagc gcatggagta cttgttgaac cagaatggtc 721 tcatctacct gggtacagct gactgcatcc aggcagagtc ctgggacttt ggccagttcg 781 agggggatgt cattgacctc agcctgcgct tgctgagcaa ggacaagcag gtagagaagt 841 ggagccagcc ggtgcacgtg gcccgtgtgt tgggtgcctt gctgcatttt ctcaaggagc 901 agagggtcct gcccaccccg cagacccagg ccacccagga aggggccttg ctgaacaagc 961 gccggggcag cgtgcccatc ctgcggcagt ggctcaccgg ccgaggccga cctgtgtatg 1021 atggccaggc ctgggtgttg gctgctgttg cttgcacagt gctgcgatgc ctgggaatcc 1081 ctgcccgcgt ggtgaccacg tttgcctcag cacagggcac cggtgggcgt cttctcatag 1141 atgaatacta taatgaggag ggacttcaga acggagaagg ccagagaggc agaatctgga 1201 tcttccagac ttccacagag tgctggatga agcgcggcct gccttgccag ggttatgatg 1261 gatggcagat tctgcaccca agtgctccta atggaggtgg agtcctgggg tcctgtgatc 1321 tggtgccggt cagagcagtc aaggagggga cgctggggct gaccccagca gtgtcagacc 1381 tttttgctgc cataaatgcc tcatgtgtgg tctggaagtg ctgtgaggat gggacactgg 1441 agttgactga ctccaacaca aagtatgttg gcaacaacat cagcaccaag ggtgtgggca 1501 gtgaccgctg cgaggacatc actcagaact acaagtatcc tgaagggtct cttcaggaaa 1561 aagaggtgct ggagagagtc gagaaagaga aaatggaacg tgagaaagac aacggcatcc 1621 gtcctcccag tctcgagact gccagtcctc tgtacctgct cttgaaagca cccagctccc 1681 tacccctgag aggggatgcc cagatctcag tgacgctggt taatcacagt gagcaggaga 1741 aggcagtgca gctggcaatt ggggtccagg ctgtacacta caacggtgtc cttgctgcca 1801 agctctggag gaagaagctg cacctcacgc tcagtgccaa cctggaaaag ataataacca 1861 tcggcctgtt cttctccaat tttgagcgaa acccacccga gaacaccttc cttagactca 1921 ccgccatggc aacacactct gaatccaacc ttagctgctt tgctcaggaa gacattgcca 1981 tttgtagacc acaccttgcc atcaagatgc cagagaaagc agagcagtat caacccctca 2041 cagcctcagt cagcctccag aactccctag atgcccccat ggaggactgt gtgatctcca 2101 tcctgggaag ggggctcatt cacagagaga ggagctacag attccgttca gtgtggcctg 2161 aaaacaccat gtgtgccaag ttccagttca cgccaacaca tgtggggctc cagagactca 2221 ctgtggaagt ggactgcaac atgttccaga acctaaccaa ctataaaagc gtcaccgtgg 2281 tagcccctga actatcagct taaacttcca gctctatcac cactctcctg ccaacccttg 2341 ttctacaatc taaaccaaac atgtgctagg aagag //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATFERA1 563 bp ds-DNA ROD 14-MAR-1990 DEFINITION Rat ferritin heavy subunit gene, partial exon X. ACCESSION M18051 KEYWORDS ferritin. SEGMENT 1 of 3 SOURCE Rat (strain Fischer) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 563) AUTHORS Murray,M.T., White,K. and Munro,H.N. TITLE Conservation of ferritin heavy subunit gene structure: Implications for the regulation of ferritin gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987) STANDARD simple staff_review FEATURES from to/span description pept / 349 + 459 ferritin heavy subunit, exon X pre-msg < 1 > 563 ferritin H mRNA and intron IVS 460 > 563 ferritin H intron A BASE COUNT 87 a 204 c 173 g 99 t ORIGIN 1 agctcagaga cccaagagcc gcctcacaat cacacaggct cctccccgcc cacgcactgc 61 tggcttgggc aacacgccta caggaagagg cggggctggg cggcccaccg cgctgattgg 121 ccggagcgcg cctgacgcag gatcccgcta taaagtgcgg cccgctggtc cctacgccag 181 acgttctcgc ccagagtcgc cgcggtttcc tgcttcaaca gtgcttgaac ggaacccggt 241 gctcgacccc tccgaccccc gtccggccgc tttgagcctg agccctttgc aacttcgtcg 301 ctccgccgct ccagcgtcgc ctccgcgcct cgtccagccg ccatcatgac caccgcgtct 361 ccctcgcaag tgcgccagaa ctaccaccag gactcggagg ctgccatcaa ccgccagatc 421 aacctggagt tgtatgcctc ctacgtctat ctgtccatgg tgagtgcggc ctggcctttg 481 cgggggcgga aagagggtgc ggcctggcct cccttgggcc acttggtgag ctggcggagg 541 gtgggttggg gcgtggctgc ggg // LOCUS RATFERA2 232 bp ds-DNA ROD 14-MAR-1990 DEFINITION Rat ferritin heavy subunit gene, exon X+1. ACCESSION M18052 KEYWORDS ferritin. SEGMENT 2 of 3 SOURCE Rat (strain Fischer) three months-old liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 232) AUTHORS Murray,M.T., White,K. and Munro,H.N. TITLE Conservation of ferritin heavy subunit gene structure: Implications for the regulation of ferritin gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987) STANDARD simple staff_review FEATURES from to/span description pept + 51 + 197 ferritin heavy subunit, exon X+1 pre-msg < 1 > 232 ferritin H mRNA and introns IVS < 1 50 ferritin H intron A IVS 198 > 232 ferritin H intron B BASE COUNT 66 a 48 c 57 g 61 t ORIGIN About 1 kb after segment 1. 1 gcatctgcct tgctgtgggg atcaataaca aatacccttt ccactttcag tcttgttatt 61 ttgaccggga tgatgtggcc ctgaagaact ttgccaaata ctttctccat caatctcatg 121 aagagaggga acatgctgag aaactgatga agctgcagaa ccagcgaggt ggacgaatct 181 tcctgcagga tatcaaggta agtagactat gggactgcgt taaatgagca gt // LOCUS RATFERA3 764 bp ds-DNA ROD 14-MAR-1990 DEFINITION Rat ferritin heavy subunit gene, last two exons. ACCESSION M18053 KEYWORDS ferritin. SEGMENT 3 of 3 SOURCE Rat (strain Fischer) three months-old liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 764) AUTHORS Murray,M.T., White,K. and Munro,H.N. TITLE Conservation of ferritin heavy subunit gene structure: Implications for the regulation of ferritin gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7438-7442 (1987) STANDARD simple staff_review FEATURES from to/span description pept + 34 159 ferritin heavy subunit, exon X+2 252 413 ferritin heavy subunit, exon X+3 pre-msg < 1 566 ferritin H mRNA and introns IVS < 1 33 ferritin H intron B IVS 160 251 ferritin H intron C BASE COUNT 209 a 166 c 188 g 201 t ORIGIN About 150 bp after segment 2. 1 ctgcagatga attgacatgt ttctttgatt cagaaacctg accgtgatga ctgggagagc 61 gggctgaatg caatgaggtg tgcactgcac ttggaaaaga gtgtgaatca gtcactactg 121 gaacttcaca aactggctac tgacaagaat gatccccacg tgagtatcag aaacacgggg 181 tgagtggaga tgatttgcca cagggcttgg gagagctgac cagtaaccct gtcccatgtt 241 ctctttccta gttatgtgac ttcattgaga cgcattacct gaatgagcag gtgaaatcca 301 ttaaagaact gggtgaccac gtgaccaact tacgcaagat gggagcccct gaatctggca 361 tggcagaata tctctttgac aagcacaccc tgggacacgg tgatgagagc taagctgacg 421 tccccaaggc catgtgactt tactggctca ctgaggcagt gcatgcatgt caggctgcct 481 ttatcttttc tataagttgc accaaaacat ctgcttaaaa gttctttaat ttgtaccatt 541 tcttcaaata aagaattttg gtacccagct cttgttgtga ttgaggatga gcgcaccagc 601 ttcccttgcg tcggctatat aaccacactg caacgcctga aagaatattt attaaactcg 661 tagttgggga aagatagtga aagacaggtg tgttcagaca ggactaagca gtcctggttc 721 tgagttacct gccagactgc catgggaaca tattcttgag tgtc //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS HUMHPRTB 56736 bp ds-DNA PRI 15-MAR-1990 DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete cds. ACCESSION M26434 KEYWORDS Alu repetitive element; LINE repetitive element; hypoxanthine phosphoribosyltransferase. SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalasemia patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 56736) AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J., Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W. TITLE Automated DNA sequencing of the human HPRT locus JOURNAL Genomics 6, 593-608 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Edwards, 26-JUL-1989. FEATURES from to/span description pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1 /nomgen="HPRT" /map="Xq26" /hgml_locus_uid="LE0031K" 14780 14886 hypoxanthine phosphoribosyltransferase, exon 2 16603 16786 hypoxanthine phosphoribosyltransferase, exon 3 27891 27956 hypoxanthine phosphoribosyltransferase, exon 4 31617 31634 hypoxanthine phosphoribosyltransferase, exon 5 34937 35019 hypoxanthine phosphoribosyltransferase, exon 6 39815 39861 hypoxanthine phosphoribosyltransferase, exon 7 40033 40109 hypoxanthine phosphoribosyltransferase, exon 8 41454 41501 hypoxanthine phosphoribosyltransferase, exon 9 IVS 1704 14779 HPRT intron A IVS 14887 16602 HPRT intron B IVS 16787 27890 HPRT intron C IVS 27957 31616 HPRT intron D IVS 31635 34936 HPRT intron E IVS 35020 39814 HPRT intron F IVS 39862 40032 HPRT intron G IVS 40110 41453 HPRT intron H rpt 258 41 (c) Alu repeat copy A rpt 1012 694 (c) Alu repeat copy B rpt 3532 3342 (c) Alu repeat copy C rpt 3982 3343 (c) Alu repeat copy D rpt 6232 5937 (c) Alu repeat copy E rpt 6524 6288 (c) Alu repeat copy F rpt 7504 7807 Alu repeat copy G rpt 8713 8408 (c) Alu repeat copy H rpt 9477 9169 (c) Alu repeat copy I rpt 9699 10142 Alu repeat copy J rpt 10407 10293 (c) Alu repeat copy K rpt 12154 12310 Alu repeat copy L rpt 13302 12998 (c) Alu repeat copy M rpt 13638 13314 (c) Alu repeat copy N rpt 14238 13923 (c) Alu repeat copy O rpt 14610 14239 (c) Alu repeat copy P rpt 16090 15843 (c) Alu repeat copy Q rpt 16518 16293 (c) Alu repeat copy R rpt 18783 18439 (c) Alu repeat copy S rpt 18962 19084 Alu repeat copy T rpt 19397 19104 (c) Alu repeat copy U rpt 20064 19778 (c) Alu repeat copy V rpt 20486 20636 Alu repeat copy W rpt 20990 20696 (c) Alu repeat copy X rpt 21293 20996 (c) Alu repeat copy Y rpt 22171 21878 (c) Alu repeat copy Z rpt 23772 23460 (c) Alu repeat copy AA rpt 25310 25426 Alu repeat copy AB rpt 26388 26279 (c) Alu repeat copy AC rpt 29863 29560 (c) Alu repeat copy AD rpt 33618 33148 (c) Alu repeat copy AE rpt 36575 36645 Alu repeat copy AF rpt 38764 39083 Alu repeat copy AG rpt 39091 39388 Alu repeat copy AH rpt 40622 40925 Alu repeat copy AI rpt 40952 41262 Alu repeat copy AJ rpt 42640 42936 Alu repeat copy AK rpt 44776 44486 (c) Alu repeat copy AL rpt 46627 46305 (c) Alu repeat copy AM rpt 47520 47763 Alu repeat copy AN rpt 49257 49551 Alu repeat copy AO rpt 49705 49993 Alu repeat copy AP rpt 50002 50312 Alu repeat copy AQ rpt 51066 50774 (c) Alu repeat copy AR rpt 52276 51978 (c) Alu repeat copy AS rpt 52407 52674 Alu repeat copy AT rpt 54653 54551 (c) Alu repeat copy AU rpt 55465 55164 (c) Alu repeat copy AV rpt 56523 56689 Alu repeat copy AW rpt 11312 11254 (c) LINE repeat copy A (partial) rpt 18284 18242 (c) LINE repeat copy B (partial) rpt 27091 26843 (c) LINE repeat copy C (partial) rpt 37558 37300 (c) LINE repeat copy D (partial) rpt 39451 39427 (c) LINE repeat copy E (partial) rpt 51317 51100 (c) LINE repeat copy F (partial) rpt 261 321 short interspersed repeat copy A rpt 1013 1054 short interspersed repeat copy B rpt 2902 2869 (c) short interspersed repeat copy C rpt 3078 3107 short interspersed repeat copy D rpt 4228 4254 short interspersed repeat copy E rpt 4463 4494 short interspersed repeat copy F rpt 21582 21635 short interspersed repeat copy G rpt 28430 28394 (c) short interspersed repeat copy H rpt 28923 28981 short interspersed repeat copy I rpt 35562 35604 short interspersed repeat copy J rpt 44042 44108 short interspersed repeat copy K rpt 46628 46688 short interspersed repeat copy L rpt 51715 51687 (c) short interspersed repeat copy M unsure 17479 17479 polymorphism or cloning error unsure 24903 24903 polymorphism or cloning error BASE COUNT 15689 a 11281 c 11599 g 18167 t ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26. 1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt 61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc 121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc 181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt 241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga 301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc 361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag 421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc 481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt 541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct 601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga 661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt 721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat 781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga 841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag 901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct 961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc 1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag 1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac 1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc 1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg 1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt 1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg 1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg 1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg 1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc 1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc 1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg 1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg 1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg 1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc 1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga 1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca 1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc 2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat 2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg 2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag 2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg 2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct 2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt 2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg 2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca 2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg 2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt 2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg 2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa 2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt 2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg 2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc 2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc 3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt 3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt 3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct 3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat 3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt 3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag 3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc 3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc 3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta 3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac 3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc 3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt 3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc 3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc 3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc 3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag 3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc 4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca 4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc 4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc 4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa 4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca 4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga 4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac 4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg 4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt 4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac 4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg 4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag 4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg 4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca 4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg 4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat 4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg 5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc 5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta 5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc 5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact 5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct 5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg 5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg 5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc 5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa 5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat 5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa 5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg 5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc 5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa 5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt 5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt 6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg 6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga 6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg 6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc 6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt 6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc 6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg 6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt 6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga 6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac 6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct 6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag 6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc 6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt 6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg 6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc 6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca 7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc 7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga 7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt 7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc 7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg 7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag 7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag 7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca 7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt 7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac 7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga 7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt 7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa 7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct 7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact 7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt 7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc 8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac 8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct 8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg 8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat 8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag 8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat 8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg 8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg 8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc 8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac 8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg 8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc 8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat 8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga 8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta 8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg 9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt 9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag 9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt 9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc 9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag 9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt 9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca 9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc 9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc 9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat 9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg 9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc 9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga 9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca 9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat 9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta 9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc 10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc 10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa 10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa 10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat 10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt 10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct 10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt 10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga 10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca 10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct 10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg 10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg 10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa 10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta 10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa 10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca 10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct 11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa 11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc 11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat 11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc 11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag 11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag 11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga 11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg 11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg 11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt 11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac 11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa 11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa 11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa 11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt 11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt 12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag 12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat 12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat 12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc 12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt 12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt 12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct 12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat 12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa 12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag 12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt 12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac 12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca 12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct 12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt 12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc 12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta 13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc 13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg 13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc 13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct 13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt 13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg 13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat 13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca 13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga 13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt 13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac 13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta 13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg 13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc 13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat 13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg 13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc 14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt 14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga 14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt 14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc 14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg 14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc 14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc 14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca 14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg 14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata 14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg 14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac 14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg 14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat 14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac 14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa 15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt 15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg 15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg 15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg 15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt 15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat 15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt 15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata 15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat 15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag 15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg 15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt 15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga 15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc 15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca 15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag 15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt 16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag 16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt 16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa 16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa 16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt 16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact 16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac 16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac 16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa 16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg 16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg 16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag 16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa 16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa 16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta 16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta 16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg 17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt 17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt 17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt 17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc 17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg 17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc 17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt 17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa 17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc 17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat 17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc 17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat 17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact 17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa 17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt 17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa 18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat 18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat 18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt 18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat 18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc 18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg 18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt 18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt 18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca 18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc 18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt 18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg 18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg 18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat 18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa 18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat 18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg 19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa 19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag 19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga 19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc 19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa 19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg 19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc 19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa 19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg 19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa 19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa 19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg 19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag 19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga 19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag 19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt 19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct 20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt 20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac 20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct 20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga 20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat 20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg 20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat 20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt 20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc 20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag 20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt 20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca 20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta 20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg 20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc 20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt 21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc 21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg 21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc 21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc 21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa 21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat 21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa 21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat 21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat 21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt 21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt 21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta 21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt 21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt 21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct 21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct 21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat 22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc 22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg 22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca 22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag 22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc 22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat 22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg 22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct 22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca 22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt 22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc 22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta 22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct 22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta 22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct 22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa 22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa 23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag 23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc 23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa 23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc 23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac 23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga 23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt 23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc 23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc 23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat 23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc 23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc 23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc 23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga 23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga 23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa 24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg 24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat 24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat 24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct 24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt 24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga 24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg 24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt 24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc 24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt 24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc 24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg 24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat 24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa 24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt 24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg 24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc 25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag 25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc 25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact 25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta 25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg 25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc 25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag 25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc 25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca 25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag 25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag 25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt 25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag 25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat 25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa 25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata 25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg 26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt 26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat 26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt 26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag 26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc 26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact 26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg 26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat 26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag 26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt 26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt 26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata 26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta 26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat 26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg 26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata 27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata 27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg 27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc 27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat 27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag 27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga 27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt 27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac 27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa 27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg 27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt 27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat 27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct 27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata 27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt 27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat 27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct 28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg 28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca 28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat 28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa 28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca 28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc 28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg 28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa 28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt 28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt 28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt 28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat 28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa 28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac 28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac 28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca 28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct 29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac 29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa 29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc 29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga 29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt 29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca 29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca 29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt 29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt 29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca 29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg 29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt 29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct 29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg 29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag 29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct 30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga 30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact 30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt 30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa 30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt 30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg 30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc 30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc 30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag 30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga 30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta 30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata 30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct 30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata 30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa 30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca 30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga 31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg 31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac 31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat 31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga 31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg 31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt 31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt 31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa 31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat 31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg 31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac 31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat 31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg 31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt 31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt 31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac 31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta 32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa 32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag 32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta 32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt 32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt 32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact 32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc 32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat 32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa 32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta 32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca 32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg 32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc 32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt 32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt 32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt 33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc 33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac 33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta 33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac 33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta 33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta 33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata 33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata 33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc 33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat 33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa 33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct 33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc 33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt 33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga 33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct 33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag 34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc 34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat 34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac 34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt 34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga 34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt 34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc 34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt 34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca 34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct 34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac 34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa 34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc 34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc 34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg 34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt 34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac 35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt 35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat 35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta 35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag 35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga 35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc 35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc 35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt 35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag 35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat 35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta 35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat 35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta 35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg 35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt 35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa 36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt 36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag 36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat 36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga 36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa 36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt 36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga 36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct 36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg 36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg 36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga 36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag 36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga 36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta 36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt 36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag 36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta 37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa 37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc 37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc 37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg 37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc 37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg 37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac 37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag 37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg 37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc 37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt 37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc 37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt 37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc 37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc 37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag 37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa 38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat 38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc 38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat 38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga 38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata 38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa 38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag 38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt 38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg 38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc 38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg 38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa 38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg 38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa 38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt 38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg 39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa 39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt 39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg 39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc 39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga 39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga 39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc 39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca 39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac 39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa 39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc 39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg 39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct 39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa 39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct 39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga 39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat 40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg 40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt 40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa 40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct 40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag 40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag 40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc 40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact 40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg 40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg 40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag 40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac 40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag 40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg 40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca 40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca 40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct 41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg 41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac 41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg 41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct 41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc 41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt 41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag 41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg 41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta 41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac 41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc 41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt 41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata 41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga 41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg 42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt 42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc 42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt 42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt 42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta 42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg 42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct 42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt 42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta 42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag 42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta 42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca 42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa 42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg 42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag 42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac 42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa 43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt 43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc 43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct 43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat 43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat 43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc 43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt 43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc 43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt 43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt 43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc 43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca 43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag 43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg 43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa 43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg 43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt 44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca 44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc 44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta 44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact 44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc 44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca 44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg 44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg 44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc 44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac 44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg 44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta 44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca 44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc 44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact 44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt 45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga 45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt 45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg 45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc 45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta 45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg 45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc 45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac 45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct 45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca 45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa 45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa 45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa 45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat 45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg 45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt 45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt 46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca 46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc 46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca 46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg 46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt 46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag 46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct 46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt 46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc 46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc 46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc 46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa 46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc 46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat 46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg 46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc 46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac 47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac 47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca 47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg 47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca 47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc 47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga 47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc 47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa 47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact 47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag 47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg 47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc 47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa 47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga 47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg 47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga 48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc 48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct 48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc 48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg 48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg 48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt 48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg 48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa 48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat 48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc 48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg 48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga 48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat 48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg 48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc 48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt 48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt 49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg 49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag 49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc 49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct 49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat 49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa 49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc 49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc 49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc 49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc 49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta 49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg 49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa 49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc 49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag 49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct 49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt 50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat 50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc 50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt 50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct 50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga 50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca 50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt 50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca 50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt 50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg 50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca 50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa 50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga 50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct 50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt 50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac 51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca 51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca 51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct 51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca 51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac 51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc 51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac 51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa 51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc 51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct 51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc 51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc 51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga 51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt 51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac 51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg 51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa 52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga 52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag 52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa 52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg 52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat 52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg 52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca 52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa 52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag 52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca 52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt 52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta 52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa 52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa 52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac 52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc 52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca 53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata 53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt 53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag 53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc 53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact 53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc 53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat 53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag 53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca 53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag 53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc 53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc 53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc 53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt 53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta 53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct 54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg 54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac 54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt 54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg 54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc 54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc 54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg 54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga 54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag 54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc 54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta 54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta 54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg 54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta 54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga 54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa 54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca 55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc 55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca 55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc 55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc 55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc 55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta 55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg 55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg 55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt 55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt 55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg 55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat 55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac 55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt 55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa 55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa 55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta 56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg 56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag 56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca 56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt 56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca 56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt 56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct 56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg 56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg 56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa 56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc 56701 gccgcccctg gtgatggacc agcggggctt cgacga //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS THARGA 635 bp ds-DNA BCT 06-MAR-1990 DEFINITION T.acidophilum 5S rRNA gene. ACCESSION M32297 KEYWORDS 5S ribosomal RNA. SOURCE T.acidophilum (strain 122-1B2) DNA, clone pT5PH-23. ORGANISM Thermoplasma acidophilum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoplasmales. REFERENCE 1 (bases 1 to 635) AUTHORS Ree,H.K. and Zimmerman,R.A. TITLE The organization and expression of the three rRNA genes from Thermoplasma acidophilum JOURNAL Unpublished (1990) Univ. of Massachusetts, Dept. of Biochemistry, Amherst, MA 01003 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.K.Ree 23-FEB-1990. FEATURES from to/span description rRNA 213 335 5S rRNA RNA 211 346 5S RNA ( 3' end +/- 4 bp) site 184 191 put. promoter site 333 354 put. terminator BASE COUNT 175 a 140 c 140 g 180 t ORIGIN 1 ctgcagcctt atcatgggaa aatgggcatc tgctggagac gactatgtac ttcatgcatc 61 catattatct tatgatatta aaatgctaaa gcgcatgatc gcttcggaag gaatatcgac 121 agggcgagat tcgctgtctc ataatcgccc ggatctgcgc tgaagccgta aaatcacgaa 181 aatcttatat agatgtgttc tatatagtgt tcggcaacgg tcatagcagc agggaaacac 241 cagatcccat tccgaactcg acggttaagc ctgctgcgta ttgcgttgta ctgtatgccg 301 cgagggtacg ggaagcgcaa tatgctgtta ccacttttga aatgaaagtt ttttacatct 361 atctttaatt cagtggcaat gattaatcct tggtcatcat ctgatttctt cgactatgag 421 aggctaaaga gaggaattcg gcataggcga tatgaacgtt cccttcgatc attttctctt 481 caggacttat acttggccag cgatccatag actatatcga ctacgcttta aagcatcaca 541 tgaaattcaa cgttatgact ggcctcatgc cttcaggaga gatgcatctt ggcaacaaga 601 gcgccataga ccaggtcatc ttcttccaga agctt //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS MUSFILA 765 bp ds-DNA ROD 06-MAR-1990 DEFINITION Mouse filaggrin gene, partial cds (type A repeat subunit). ACCESSION M32300 J05198 KEYWORDS epidermal matrix protein; filaggrin. SOURCE Mouse cell line 3T3 DNA, clone 6.1A2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 765) AUTHORS Rothnagel,J.A. and Steinert,P.M. TITLE The structure of the gene for mouse filaggrin and a comparison of the repeating units JOURNAL J. Biol. Chem. 265, 1862-1865 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Rothnagel, 23-FEB-1990. FEATURES from to/span description pept < 1 > 765 filaggrin (AA at 1) rpt 1 765 type A repeat BASE COUNT 178 a 257 c 250 g 80 t ORIGIN 1 ggtggactgg cagctgacgc ctccaggcgc tctggggcgc gtcaaggcca ggcatctgcc 61 cagggtcgcg cagggtcaca aggccaagca cagggccgcg tcggctcgtc agctgacagg 121 caagggcgtc gaggggtcag cgagagtcag gccagcgaca gcgagggcca ctccgacttc 181 tcagaaggcc aggcagtagg agctcacagg cagtctggag ccggccagag gcacgagcag 241 cggtcctccc gaggccagca cggctccgga ttctacccag tatactacta ctatgaacaa 301 gaacacagcg aggaggagtc cgattcccag catcagcacg gccaccagca cgaacagcaa 361 cgcgggcacc agcaccagca ccagcaccag catgagcacg aacagcccga gtccggccac 421 aggcagcagc agtccagtgg gagaggacac cagggagccc accaggaaca gggaagggac 481 agcgccaggt cccggggttc taaccaaggg cacagctcct ctcgccacca ggccgactct 541 cccagggtca gcgcaagatc aggctcagga ggaagaggac agtcccccga cgcctctggc 601 cgcagctcaa acaggaggga cagaccccgg cagcccagcc ccagccagtc cagtgacagc 661 caagtccatt ctggagtcca ggtcgaaggc cggcgcgggc agtcctcatc tgccaacagg 721 agggccggat ccagctccgg ctcaggggtc cagggggcct ctgca // LOCUS MUSFILB 750 bp ds-DNA ROD 06-MAR-1990 DEFINITION Mouse filaggrin gene, partial cds (type B repeat subunit). ACCESSION M32301 J05198 KEYWORDS epidermal matrix protein; filaggrin. SOURCE Mouse cell line 3T3 DNA, clone 6.1A2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 750) AUTHORS Rothnagel,J.A. and Steinert,P.M. TITLE The structure of the gene for mouse filaggrin and a comparison of the repeating units JOURNAL J. Biol. Chem. 265, 1862-1865 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Rothnagel 23-FEB-1990. FEATURES from to/span description pept < 1 > 750 filaggrin (AA at 1) rpt 1 750 type B repeat BASE COUNT 174 a 252 c 248 g 76 t ORIGIN 1 ggtggactgg cagctgacgc ctccaggcgc tctggggcgc ttcaaggcca ggcatctgcc 61 cagggtcgcg cagggtcaca aggccaagca cagggccgcg tcggctcgtc agctgacagg 121 caagggcgtc gaggggtcag cgagagtcag gccagcgaca gcgagggcca ctctgacttc 181 tcagaaggcc aggcagtagg agctcacagg cagtctggag ccggccagag gcacgagcag 241 cggtcctccc gaggccagca cggctccgga tactactatg agcaagaaca cagcgaggag 301 gagtccgatt cccagcacca gcacggccac cagcacgaac agcaacgcgg gcaccagcac 361 cagcaccagc accagcatga gcacgaacag cccgagtccg gccacaggca gcagcagtcc 421 agtgggagag gacaccaggg agcccaccag gaacagggaa gggacagcgc caggtcccgg 481 ggttctaacc aagggcacag ctcctctcgc caccaggccg actctcccag ggtcagcgca 541 agatcaggct caggaggaag aggacagtcc cccgacgcct ctggccgcag ctcaaacagg 601 agggacagac cccggcagcc cagccccagc cagtccagtg acagccaagt ccattctgga 661 gtccaggtcg aaggccggcg cgggcagtcc tcatctgcca acaggagggc cggatccagc 721 tccagctcag gggtccaggg ggcctctgca //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS MUSJUNDR 1596 bp ss-mRNA ROD 17-JAN-1990 DEFINITION Mouse junD proto-oncogene mRNA, complete cds. ACCESSION J05205 KEYWORDS JUN-D protein, junD proto-oncogene. SOURCE Mouse myocyte cell line BC-3H1, cDNA to genomic RNA, clone jun16. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1596) AUTHORS Li,L., Hu,J.-S. and Olson,E.N. TITLE Different members of the jun proto-oncogene family exhibit distinct patterns of expression in response to type beta transforming growth factor JOURNAL J. Biol. Chem. 265, 1556-1562 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Li, 12-JAN-1990, for release after publication. FEATURES from to/span description pept 43 1068 JUN-D protein BASE COUNT 264 a 559 c 492 g 280 t 1 others ORIGIN 1 gagaggagat gacagagctt catgagtgat tcagtcccac ggatggaaac gcccttgtat 61 ggcgaggagg cgctgagcgg cctggctgcg ggtgcgtcga gcgtcgctgg tgctactggg 121 gcccccggcg gtggtggctt cgcgcccccg ggccgcgctt tccccggggc gcccccgacg 181 agcagcatgc tgaagaaaga cgcgctgacg ctcagcctgg cggacgaggg agcggcggga 241 ttgaaaccag ggtcggccac tgcaccttct gcgctgcgcc ccgacggcgc ccccgacggg 301 ctgctggctt cgccggatct tgggctgctc aaactcgcgt cgccggagct ggagaggctg 361 atcatccagt ccaacgggct ggtgaccact accccgacca gtacgcagtt cctctacccg 421 aaggtggcag ccagcgagga gcaggagttc gccgaaggct tcgtcaaggc gctggaggac 481 ctgcacaagc aaagccagct gggtgcggcc accgcggcca cctcaggggc tcccgcgcct 541 cccgcgcccg ccgacctggc cgccaccccc ggggccacgg agaccccggt ctacgccaac 601 ctgagcagtt tcgcgggtgg cgccgggccc cctgggggcg cggccaccgt ggctttcgcc 661 gcggagccag tgcccttccc gccgcccccg ggcgcgctgg ggccgccgcc acctccgcat 721 ccaccgcgcc tggccgcgct caaggacgag ccgcagaccg tgccggacgt gccgagcttc 781 ggcgacagcc ctccgctgtc gcccatcgac atggacacgc aagaacgcat caaggcggag 841 cgcaagaggc tgcgcaaccg catcgccgcc tccaaatgcc gcaagcgcaa gctggagcgt 901 atctcgcgcc tggaggagaa agtcaagacc ctcaaaagcc agaacaccga gctggcgtcc 961 accgccagcc tgctgcgcga gcaggtggcg cacgtcaaac agaaagtcct cagccacgtc 1021 aacagcggct gccagctgct gccccagcac caggtcccgg cgtactgagc ccgagcncgg 1081 ggcgcatgcg cggactagct gcggtggggg ggcgccccgg actctttcga gactcggtgc 1141 ccccggactc gacaagccgg acccccctta actctggatg gggaacccga gcgcacgacc 1201 cccgccctcg cgccgcctct ctacccccag tcctgcgcgt ggctgcccct ttgatcctcc 1261 gagtaggggc tctaaggggc gcgcagcctc acgctctgcc tttccttttt tttttgcctt 1321 ttggaagaga gaacgggagt gtccgattcc gccctattta tgtttctact cgggaacaaa 1381 cgttggttgc gtgttgagtg tgtgtttcct tgtgttggtt ttttaaagaa acgggaagaa 1441 gaaaaaaaaa accctccccc ctcccccccc cgctctctct cctcccgaca ccatcccctc 1501 tctttttgtt tggttttgtt ttgctacgag tccacattcc tgttccgtaa tcctggttcg 1561 ccggttttgt gttttcagta aagtcttcgt tacgcc //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS CECCOLE3A 3066 bp ds-DNA BCT 31-AUG-1987 DEFINITION Plasmid ColE3-CA38 colicin E3, immunity proteins E3 and E8, and lysis genes, complete cds. ACCESSION J01574 J01575 M14038 X02397 KEYWORDS colicin; colicin release protein; immune response gene; immunity protein; lysis protein. SOURCE Plasmid ColE3-CA38 (from E.coli) DNA. ORGANISM Plasmid Colicin E3-CA38 Prokaryota; Bacteria. REFERENCE 1 (bases 1663 to 2662) AUTHORS Masaki,H. and Ohta,T. TITLE A plasmid region encoding the active fragment and the inhibitor protein of colicin E3-CA38 JOURNAL FEBS Lett. 149, 129-132 (1982) STANDARD full staff_review REFERENCE 2 (bases 1612 to 2662) AUTHORS Mock,M., Miyada,C.G. and Gunsalus,R.P. TITLE Nucleotide sequence for the catalytic domain of colicin E3 and its immunity protein. Evidence for a third gene overlapping colicin JOURNAL Nucleic Acids Res. 11, 3547-3557 (1983) STANDARD full staff_review REFERENCE 3 (bases 2251 to 3066) AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P. TITLE Characterization and nucleotide sequence of a colicin-release gene in the hic region of plasmid ColE3-CA38 JOURNAL Gene 29, 175-184 (1984) STANDARD full staff_review REFERENCE 4 (bases 2251 to 3066) AUTHORS Watson,R.J., Lau,P.C.K., Vernet,T. and Visentin,L.P. TITLE Corrigenda: Characterization and nucleotide sequence of a colicin-release gene in the hic region of plasmid ColE3-CA38 JOURNAL Gene 42, 351-355 (1986) STANDARD full staff_review REFERENCE 5 (bases 1 to 2854) AUTHORS Masaki,H. and Ohta,T. TITLE Colicin E3 and its immunity genes JOURNAL J. Mol. Biol. 182, 217-227 (1985) STANDARD simple automatic REFERENCE 6 (bases 1 to 3066; revises [5]) AUTHORS Akutsu,A., Masaki,H. and Ohta,T. TITLE Molecular structure and immunity specificity of colicin E6, an evolutionary intermediate between E-group colicins and cloacin DF13 JOURNAL J. Bacteriol. 171, 6430-6436 (1989) STANDARD simple staff_review COMMENT [1] refers to colicin E3 as protein A and the immunity protein E3 as protein B. [2] refers to the colE3 gene as the ceaC gene and the immE3 gene as the ceaC gene. Draft entry and printed copy of sequence for [3] kindly provide by R.J.Watson, May 1985. FEATURES from to/span description pept 331 1986 colicin E3 pept 1996 2253 immunity protein E3 pept 2420 2677 immunity protein E8 (gtg start codon) pept 2739 2882 lysis protein binding 317 322 ribosome binding site (pot.) binding 1988 1993 ribosome binding site (pot.) revision 132 134 atc in [6]; attc in [5] revision 755 756 gc in [4]; cg in [3] revision 1265 1267 gca in [4]; ga in [3] revision 1348 1349 tt in [4]; ttt in [3] BASE COUNT 897 a 546 c 796 g 827 t ORIGIN 1 aactcggttt taatcagacc tggcatgagt ggaagcggga cgaacagcac aggcaacaac 61 aacgccgccc cgggcacttc cggggcatga gtatgtgata tccggggctg caccccggac 121 cccgccaaca catcacgggc cacaaaattt tttgtggccc gctctgcgtt ttctaagtgt 181 tatccctcct gatttctaaa aaattttcca cctgaacttg acagaaaaaa cgatgacgag 241 tactttttga tctgtacata aacccagtgg ttttatgtac agtattaatc gtgtaatcaa 301 ttgttttaac gcttaaaaga gggaattttt atgagcggtg gcgatggacg cggccataac 361 acgggcgcgc atagcacaag tggtaacatt aatggtggcc cgaccgggct tggtgtaggt 421 ggtggtgctt ctgatggctc cggatggagt tcggaaaata acccgtgggg tggtggttcc 481 ggtagcggca ttcactgggg tggtggttcc ggtcatggta atggcggggg gaatggtaat 541 tccggtggtg gttcgggaac aggcggtaat ctgtcagcag tagctgcgcc agtggcattt 601 ggttttccgg cactttccac tccaggagct ggcggtctgg cggtcagtat ttcagcggga 661 gcattatcgg cagctattgc tgatattatg gctgccctga aaggaccgtt taaatttggt 721 ctttgggggg tggctttata tggtgtattg ccatcacaaa tagcgaaaga tgaccccaat 781 atgatgtcaa agattgtgac gtcattaccc gcagatgata ttactgaatc acctgtcagt 841 tcattacctc tcgataaggc aacagtaaac gtaaatgttc gtgttgttga tgatgtaaaa 901 gacgagcgac agaatatttc ggttgtttca ggtgttccga tgagtgttcc ggtggttgat 961 gcaaaaccta ccgaacgtcc gggtgttttt acggcatcaa ttccaggtgc acctgttctg 1021 aatatttcag ttaataacag tacgccagca gtacagacat taagcccagg tgttacaaat 1081 aatactgata aggatgttcg cccggcagga tttactcagg gtggtaatac cagggatgca 1141 gttattcgat tcccgaagga cagcggtcat aatgccgtat atgtttcagt gagtgatgtt 1201 cttagccctg accaggtaaa acaacgtcaa gatgaagaaa atcgccgtca gcaggaatgg 1261 gatgctacgc atccggttga agcggctgag cgaaattatg aacgcgcgcg tgcagagctg 1321 aatcaggcaa atgaagatgt tgccagaaat caggagcgac aggctaaagc tgttcaggtt 1381 tataattcgc gtaaaagcga acttgatgca gcgaataaaa ctcttgctga tgcaatagct 1441 gaaataaaac aatttaatcg atttgcccat gacccaatgg ctggcggtca cagaatgtgg 1501 caaatggccg ggcttaaagc ccagcgggcg cagacggatg taaataataa gcaggctgca 1561 tttgatgctg ctgcaaaaga gaagtcagat gctgatgctg cattgagttc tgctatggaa 1621 agcaggaaga agaaagaaga taagaaaagg agtgctgaaa ataatttaaa cgatgaaaag 1681 aataagccca gaaaaggttt taaagattac gggcatgatt atcatccagc tccgaaaact 1741 gagaatatta aagggcttgg tgatcttaag cctgggatac caaaaacacc aaagcagaat 1801 ggtggtggaa aacgcaagcg ctggactgga gataaagggc gtaagattta tgagtgggat 1861 tctcagcatg gtgagcttga ggggtatcgt gccagtgatg gtcagcatct tggctcattt 1921 gaccctaaaa caggcaatca gttgaaaggt ccagatccga aacgaaatat caagaaatat 1981 ctttgagagg aagttatggg acttaaattg gatttaactt ggtttgataa aagtacagaa 2041 gattttaagg gtgaggagta ttcaaaagat tttggagatg acggttcagt tatggaaagt 2101 ctaggtgtgc cttttaagga taatgttaat aacggttgct ttgatgttat agctgaatgg 2161 gtacctttgc tacaaccata ctttaatcat caaattgata tttccgataa tgagtatttt 2221 gtttcgtttg attatcgtga tggtgattgg tgatcaaata ttatcaggga tgagttgata 2281 tacgggcttc tagtgttcat ggatgaacgc tggagcctcc aaatgtagaa atgttatatt 2341 ttttattgag ttcttggtta taattgctcc gcaatgattt aaataagcat tatttaaaac 2401 attctcagga gaggtgaagg tggagctaaa aaaaagtatt ggtgattaca ctgaaaccga 2461 attcaaaaaa tttattgaag acatcatcaa ttgtgaaggt gatgaaaaaa aacaggatga 2521 taacctcgag tattttataa atgttactga gcatcctagt ggttctgatc tgatttatta 2581 cccagaaggt aataatgatg gtagccctga aggtgttatt aaagagatta aagaatggcg 2641 agccgctaac ggtaagtcag gatttaaaca gggctgaaat atgaatgccg gttgtttatg 2701 gatgaatggc tggcattctt tcacaacaag gagtcgttat gaaaaaaata acagggatta 2761 ttttattgct tcttgcagtc attattctgt ctgcatgtca ggcaaactat atccgggatg 2821 ttcagggcgg gaccgtatct ccgtcatcaa cagctgaagt gaccggatta gcaacgcagt 2881 aacccgaaat cctctttgac aaaaacaaag cgtgtcaggc tgattctgat gcgctttttt 2941 tttgaaatgt cacaaaaatt ccatgtggga gatgggatct aaaatcctcg tgcagaactt 3001 tccatccagg gggagaaaac ttgtcgtttt gagccgttcg gtgttcagaa cgcacgaaac 3061 cgatcg // LOCUS CECCOLE6A 3065 bp ds-DNA BCT 16-FEB-1990 DEFINITION Plasmid ColE6-CT14 colicin E6, immunity proteins E6 and E8, and lysis protein genes, complete cds. ACCESSION M31808 KEYWORDS colicin E6; immunity protein; lysis protein. SOURCE Plasmid ColE6-CT14 (from E.coli) DNA. ORGANISM Plasmid Colicin E6-CT14 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 3065) AUTHORS Akutsu,A., Masaki,H. and Ohta,T. TITLE Molecular structure and immunity specificity of colicin E6, an evolutionary intermediate between E-group colicins and cloacin DF13 JOURNAL J. Bacteriol. 171, 6430-6436 (1989) STANDARD simple staff_review FEATURES from to/span description pept 331 1986 colicin E6 pept 1996 2253 immunity protein E6 pept 2420 2677 immunity protein E8 (gtg start codon) pept 2739 2882 lysis protein BASE COUNT 903 a 545 c 799 g 818 t ORIGIN 1 aactcggttt taatcagacc tggcatgagt ggaagcggga cgaacagcac aggcaacaac 61 aacgccgccc cgggcacttc cggggcatga gtatgtgata tccggggctg caccccggac 121 cccgccaaca catcacgggc cacaaaattt tttgtggccc gctctgcgtt ttctaagtgt 181 tatccctcct gatttctaaa aaattttcca cctgaacttg acagaaaaaa agatgacgag 241 tactttttga tctgtacata aaaccagtgg ttttatgtac agtattaatc atgtaattaa 301 ttgttttaac gcttaaaaga gggaattttt atgagcggtg gcgatggacg cggccataac 361 acgggcgcgc atagcacaag tggtaacatt aatggtggcc cgaccgggct tggtgtaggt 421 ggtggtgctt ctgatggctc cggatggagt tcggaaaata acccgtgggg tggtggttcc 481 ggtagcggca ttcactgggg tggtggttcc ggtcatggta atggcggggg gaatggtaat 541 tccggtggtg gctcgggaac aggcggtaat ctgtcagcag tagctgcgcc agtggcattt 601 ggttttccgg cactttccac tccaggagct ggcggtctgg cggtcagtat ttcagcggga 661 gcattatcgg cagctattgc tgatattatg gctgccctga aaggaccgtt taaatttggt 721 ctttgggggg tggctttata tggtgtattg ccatcacaaa tagcgaaaga tgaccccaat 781 atgatgtcaa agattgtgac gtcattaccc gcagatgata ttactgaatc acctgtcagt 841 tcattacctc tcgataaggc aacagtaaac gtaaatgttc gtgttgttga tgatgtaaaa 901 gacgaacgac agaatatttc ggttgtttca ggtgttccga tgagtgttcc ggtggttgat 961 gcaaaaccta ccgaacgtcc aggtgttttt acggcatcaa ttccaggtgc acctgttctg 1021 aatatttcag ttaataacag tacgccagca gtacagacat taagcccagg tgttacaaat 1081 aatactgata aggatgttcg cccggcagga tttactcagg ggggtaatac cagggatgca 1141 gttattcgat tcccgaagga cagcggtcat aatgccgtat atgtttcagt gagtgatgtt 1201 cttagccctg accaggtaaa acaacgtcag gatgaagaaa atcgccgtca gcaggaatgg 1261 gatgctacgc atccggttga agcggctgag cgaaattatg aacgcgcgcg tgcagagctg 1321 aatcaggcaa atgaagatgt tgccagaaat caggagcgac aggctaaagc tgttcaggtt 1381 tataattcgc gtaaaagcga acttgatgca gcgaataaaa ctcttgctga tgcaatagct 1441 gaaataaaac aatttaatcg atttgcccat gacccaatgg ctggcggtca cagaatgtgg 1501 caaatggccg ggcttaaagc ccagcgggcg cagacggatg taaataataa gcaggctgca 1561 tttgatgctg ctgcaaaaga gaagtcagat gctgatgctg cattgagttc tgctatggaa 1621 agcaggaaga agaaagaaga taagaaaagg agcgctgaaa ataaattaaa cgaggaaaaa 1681 aacaagcctc gcaagggagt taaagattac ggtcatgatt atcatccaga tcctaaaact 1741 gaagatataa aagggctggg tgagttaaaa gagggtaaac caaaaactcc aaagcaaggt 1801 ggtggcggta aacgtgctag atggtatgga gataaagggc gtaagattta tgagtgggac 1861 tctcagcatg gtgagcttga ggggtatcgt gccagtgatg gtcagcatct tggctcattc 1921 gagcctaaga ctggtaatca gttgaaagga cctgatccaa aacgaaatat caaaaagtat 1981 ctttgagagg atgttatggg gcttaaatta catattaatt ggtttgataa gacgaccgag 2041 gaatttaaag gtggtgagta ttcaaaagat tttggagatg atggctcggt cattgaacgt 2101 cttggaatgc ctttaaaaga taatatcaat aatggttggt ttgatgttat agctgaatgg 2161 gtacctttgc tacaaccata ctttaatcat caaattgata tttccgataa tgagtatttt 2221 gtttcgtttg attatcgtga tggtgattgg tgatcaaata ttatcaggga tgagttgatg 2281 tacgggcttc tagtgttcat ggatgaacgc tggagcctcc aaatgtagaa gtgttatatt 2341 ttttattgag ttcttggtta taattgctcc gcaataattt aaataggcat tatttaaaac 2401 attctcagga gaggtgaagg tggagctaaa gaaaagtatt ggtgattaca ctgaaaccga 2461 attcaaaaaa attattgaaa acatcatcaa ttgtgaaggt gatgaaaaaa aacaggatga 2521 taacctcgag cattttataa gtgttactga gcatcctagt ggttctgatc tgatttatta 2581 cccagaaggt aataatgatg gtagccctga agctgttatt aaagagatta aagaatggcg 2641 agctgctaac ggtaagtcag gatttaaaca gggctgaaat atgaatgccg gttgtttaag 2701 gatgaatgac tggcattctt tcacaacaag gagtcgttat gaaaaaaata acagggatta 2761 ttttattgct tcttgcagtc attattctgg ctgcatgtca ggcaaactat atccgtgatg 2821 ttcagggcgg gactgtatca ccgtcgtcaa ctgctgaact gaccggagtg gaaacgcagt 2881 aacccgaaat cctctttgac aaaaacaaag cgtgtcaggc tgattctgat gcgctttttt 2941 ttgaaatgtc acaaaaattc catgtgggag atgggatcta aaatccccgt gcagaacttt 3001 ccatccagag ggagaaaact tgtcgttttg agccgttcgg tgttcagaac gcacgaaacc 3061 gatcg //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons 1 and 2. ACCESSION M32476 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 1 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2238) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene and analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 704 767 carcinoembryonic antigen-related protein (CGM1), exon 1 1770 + 2129 carcinoembryonic antigen-related protein, exon 2 pre-msg 512 > 2238 CGM1 mRNA and introns IVS 768 1769 CGM1 intron A IVS 2130 > 2238 CGM1 intron B BASE COUNT 615 a 532 c 539 g 552 t ORIGIN 1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata 61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg 121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca 181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc 241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg 301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt 361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct 421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca 481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg 541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg 601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga 661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt 721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta 781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt 841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg 901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt 961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg 1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg 1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt 1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa 1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc 1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt 1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc 1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa 1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta 1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag 1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag 1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg 1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg 1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac 1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct 1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac 1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc 1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac 2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc 2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg 2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc 2221 tgcattgatc tacatgtt // LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32477 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 2 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 539) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 539 carcinoembryonic antigen-related protein intron B BASE COUNT 126 a 127 c 138 g 148 t ORIGIN 1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact 61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt 121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg 181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct 241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac 301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt 361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc 421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg 481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc // LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32478 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 3 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 828) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 828 carcinoembryonic antigen-related protein intron B BASE COUNT 193 a 255 c 190 g 190 t ORIGIN 1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct 61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc 121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca 181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa 241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata 301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt 361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt 421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg 481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag 541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc 601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga 661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac 721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac 781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc // LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 3. ACCESSION M32479 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 4 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 642) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 61 + 420 carcinoembryonic antigen-related protein (CGM1), exon 3 pre-msg < 1 > 642 CGM1 mRNA and introns IVS < 1 60 CGM1 intron B IVS 421 > 642 CGM1 intron C BASE COUNT 151 a 160 c 154 g 177 t ORIGIN 1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag 61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc 121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc 181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat 241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt 301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct 361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca 421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt 481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg 541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt 601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg // LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 4. ACCESSION M32480 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 5 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 616) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 90 + 449 carcinoembryonic antigen-related protein (CGM1), exon 4 pre-msg < 1 > 616 CGM1 mRNA and introns IVS < 1 89 CGM1 intron C IVS 450 616 CGM1 intron D BASE COUNT 152 a 153 c 147 g 164 t ORIGIN 1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa 61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac 121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct 181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat 241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg 301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt 361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa 421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg 481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag 541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca 601 ttgtctgatt cccttc // LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron 4. ACCESSION M32481 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 6 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 654 carcinoembryonic antigen-related protein intron D BASE COUNT 185 a 146 c 155 g 168 t ORIGIN 1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg 61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat 121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc 181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac 241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg 301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga 361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat 421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt 481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt 541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca 601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa // LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 5. ACCESSION M32482 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 7 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 118 + 471 carcinoembryonic antigen-related protein (CGM1), exon 5 pre-msg < 1 > 492 CGM1 mRNA and introns IVS < 1 117 CGM1 intron D IVS 472 492 CGM1 intron E BASE COUNT 134 a 134 c 105 g 119 t ORIGIN 1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc 61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct 121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg 181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt 241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag 301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg 361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa 421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac 481 tctcaaggtc tc // LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 6. ACCESSION M32483 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 8 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 640 / 987 carcinoembryonic antigen-related protein (CGM1), exon 6 pre-msg < 1 > 1341 CGM1 mRNA and introns IVS < 1 639 CGM1 intron E IVS 988 1341 CGM1 intron F BASE COUNT 357 a 329 c 321 g 334 t ORIGIN 1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt 61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat 121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga 181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca 241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa 301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat 361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag 421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga 481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg 541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt 601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca 661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc 721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca 781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc 841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca 901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag 961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt 1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc 1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa 1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct 1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg 1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag 1321 atccgttgcc tgacggtcga c //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons 1 and 2. ACCESSION M32476 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 1 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2238) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene and analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 704 767 carcinoembryonic antigen-related protein (CGM1), exon 1 1770 + 2129 carcinoembryonic antigen-related protein, exon 2 pre-msg 512 > 2238 CGM1 mRNA and introns IVS 768 1769 CGM1 intron A IVS 2130 > 2238 CGM1 intron B BASE COUNT 615 a 532 c 539 g 552 t ORIGIN 1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata 61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg 121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca 181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc 241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg 301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt 361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct 421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca 481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg 541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg 601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga 661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt 721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta 781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt 841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg 901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt 961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg 1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg 1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt 1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa 1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc 1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt 1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc 1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa 1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta 1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag 1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag 1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg 1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg 1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac 1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct 1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac 1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc 1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac 2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc 2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg 2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc 2221 tgcattgatc tacatgtt // LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32477 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 2 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 539) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 539 carcinoembryonic antigen-related protein intron B BASE COUNT 126 a 127 c 138 g 148 t ORIGIN 1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact 61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt 121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg 181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct 241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac 301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt 361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc 421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg 481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc // LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32478 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 3 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 828) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 828 carcinoembryonic antigen-related protein intron B BASE COUNT 193 a 255 c 190 g 190 t ORIGIN 1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct 61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc 121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca 181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa 241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata 301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt 361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt 421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg 481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag 541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc 601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga 661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac 721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac 781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc // LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 3. ACCESSION M32479 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 4 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 642) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 61 + 420 carcinoembryonic antigen-related protein (CGM1), exon 3 pre-msg < 1 > 642 CGM1 mRNA and introns IVS < 1 60 CGM1 intron B IVS 421 > 642 CGM1 intron C BASE COUNT 151 a 160 c 154 g 177 t ORIGIN 1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag 61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc 121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc 181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat 241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt 301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct 361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca 421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt 481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg 541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt 601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg // LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 4. ACCESSION M32480 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 5 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 616) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 90 + 449 carcinoembryonic antigen-related protein (CGM1), exon 4 pre-msg < 1 > 616 CGM1 mRNA and introns IVS < 1 89 CGM1 intron C IVS 450 616 CGM1 intron D BASE COUNT 152 a 153 c 147 g 164 t ORIGIN 1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa 61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac 121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct 181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat 241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg 301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt 361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa 421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg 481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag 541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca 601 ttgtctgatt cccttc // LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron 4. ACCESSION M32481 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 6 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 654 carcinoembryonic antigen-related protein intron D BASE COUNT 185 a 146 c 155 g 168 t ORIGIN 1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg 61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat 121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc 181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac 241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg 301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga 361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat 421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt 481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt 541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca 601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa // LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 5. ACCESSION M32482 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 7 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 118 + 471 carcinoembryonic antigen-related protein (CGM1), exon 5 pre-msg < 1 > 492 CGM1 mRNA and introns IVS < 1 117 CGM1 intron D IVS 472 492 CGM1 intron E BASE COUNT 134 a 134 c 105 g 119 t ORIGIN 1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc 61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct 121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg 181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt 241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag 301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg 361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa 421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac 481 tctcaaggtc tc // LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 6. ACCESSION M32483 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 8 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 640 / 987 carcinoembryonic antigen-related protein (CGM1), exon 6 pre-msg < 1 > 1341 CGM1 mRNA and introns IVS < 1 639 CGM1 intron E IVS 988 1341 CGM1 intron F BASE COUNT 357 a 329 c 321 g 334 t ORIGIN 1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt 61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat 121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga 181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca 241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa 301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat 361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag 421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga 481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg 541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt 601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca 661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc 721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca 781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc 841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca 901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag 961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt 1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc 1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa 1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct 1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg 1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag 1321 atccgttgcc tgacggtcga c //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AA 3190 bp ss-mRNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (rnCGM1) mRNA, complete cds. ACCESSION M32474 J05417 KEYWORDS carcinoembryonic antigen-related protein. SOURCE R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3190) AUTHORS Rebstock,S., Lucas,K., Thompson,F.A. and Zimmermann,W. TITLE cDNA and gene analysis imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1900) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 122 2251 carcinoembryonic antigen-related protein precursor (rnCGM1) sigp 122 221 carcinoembryonic antigen-related protein signal peptide matp 222 2248 rnCGM1 protein BASE COUNT 871 a 798 c 693 g 828 t ORIGIN 1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta 61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca 121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct 181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc 241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga 301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact 361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt 421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac 481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt 541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc 601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga 661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc 721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac 781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt 901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc 961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga 1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc 1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac 1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt 1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc 1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct 1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta 1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta 1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact 1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac 1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt 1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc 1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag 1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc 1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa 1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca 1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg 2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct 2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag 2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct 2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga 2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg 2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca 2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac 2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc 2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct 2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt 2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag 2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa 2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat 2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct 2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag 2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct 3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt 3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag 3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt 3181 ttctccaaat //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons 1 and 2. ACCESSION M32476 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 1 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2238) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene and analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 704 767 carcinoembryonic antigen-related protein (CGM1), exon 1 1770 + 2129 carcinoembryonic antigen-related protein, exon 2 pre-msg 512 > 2238 CGM1 mRNA and introns IVS 768 1769 CGM1 intron A IVS 2130 > 2238 CGM1 intron B BASE COUNT 615 a 532 c 539 g 552 t ORIGIN 1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata 61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg 121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca 181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc 241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg 301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt 361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct 421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca 481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg 541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg 601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga 661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt 721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta 781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt 841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg 901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt 961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg 1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg 1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt 1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa 1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc 1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt 1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc 1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa 1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta 1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag 1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag 1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg 1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg 1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac 1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct 1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac 1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc 1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac 2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc 2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg 2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc 2221 tgcattgatc tacatgtt // LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32477 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 2 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 539) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 539 carcinoembryonic antigen-related protein intron B BASE COUNT 126 a 127 c 138 g 148 t ORIGIN 1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact 61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt 121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg 181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct 241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac 301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt 361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc 421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg 481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc // LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32478 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 3 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 828) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 828 carcinoembryonic antigen-related protein intron B BASE COUNT 193 a 255 c 190 g 190 t ORIGIN 1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct 61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc 121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca 181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa 241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata 301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt 361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt 421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg 481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag 541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc 601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga 661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac 721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac 781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc // LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 3. ACCESSION M32479 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 4 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 642) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 61 + 420 carcinoembryonic antigen-related protein (CGM1), exon 3 pre-msg < 1 > 642 CGM1 mRNA and introns IVS < 1 60 CGM1 intron B IVS 421 > 642 CGM1 intron C BASE COUNT 151 a 160 c 154 g 177 t ORIGIN 1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag 61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc 121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc 181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat 241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt 301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct 361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca 421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt 481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg 541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt 601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg // LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 4. ACCESSION M32480 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 5 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 616) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 90 + 449 carcinoembryonic antigen-related protein (CGM1), exon 4 pre-msg < 1 > 616 CGM1 mRNA and introns IVS < 1 89 CGM1 intron C IVS 450 616 CGM1 intron D BASE COUNT 152 a 153 c 147 g 164 t ORIGIN 1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa 61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac 121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct 181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat 241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg 301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt 361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa 421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg 481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag 541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca 601 ttgtctgatt cccttc // LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron 4. ACCESSION M32481 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 6 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 654 carcinoembryonic antigen-related protein intron D BASE COUNT 185 a 146 c 155 g 168 t ORIGIN 1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg 61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat 121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc 181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac 241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg 301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga 361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat 421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt 481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt 541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca 601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa // LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 5. ACCESSION M32482 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 7 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 118 + 471 carcinoembryonic antigen-related protein (CGM1), exon 5 pre-msg < 1 > 492 CGM1 mRNA and introns IVS < 1 117 CGM1 intron D IVS 472 492 CGM1 intron E BASE COUNT 134 a 134 c 105 g 119 t ORIGIN 1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc 61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct 121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg 181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt 241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag 301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg 361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa 421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac 481 tctcaaggtc tc // LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 6. ACCESSION M32483 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 8 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 640 / 987 carcinoembryonic antigen-related protein (CGM1), exon 6 pre-msg < 1 > 1341 CGM1 mRNA and introns IVS < 1 639 CGM1 intron E IVS 988 1341 CGM1 intron F BASE COUNT 357 a 329 c 321 g 334 t ORIGIN 1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt 61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat 121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga 181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca 241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa 301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat 361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag 421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga 481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg 541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt 601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca 661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc 721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca 781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc 841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca 901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag 961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt 1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc 1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa 1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct 1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg 1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag 1321 atccgttgcc tgacggtcga c //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons 1 and 2. ACCESSION M32476 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 1 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2238) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene and analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 704 767 carcinoembryonic antigen-related protein (CGM1) precursor, exon 1 1770 + 2129 carcinoembryonic antigen-related protein, exon 2 sigp 704 767 carcinoembryonic antigen-related protein (CGM1) signal peptide 1770 1804 carcinoembryonic antigen-related protein signal peptide matp 1805 + 2129 carcinoembryonic antigen-related protein pre-msg 512 > 2238 CGM1 mRNA and introns IVS 768 1769 CGM1 intron A IVS 2130 > 2238 CGM1 intron B BASE COUNT 615 a 532 c 539 g 552 t ORIGIN 1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata 61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg 121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca 181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc 241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg 301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt 361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct 421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca 481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg 541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg 601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga 661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt 721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta 781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt 841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg 901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt 961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg 1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg 1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt 1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa 1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc 1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt 1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc 1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa 1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta 1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag 1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag 1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg 1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg 1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac 1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct 1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac 1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc 1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac 2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc 2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg 2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc 2221 tgcattgatc tacatgtt // LOCUS RATCGM1AC2 539 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32477 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 2 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 539) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 539 carcinoembryonic antigen-related protein intron B BASE COUNT 126 a 127 c 138 g 148 t ORIGIN 1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact 61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt 121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg 181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct 241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac 301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt 361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc 421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg 481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc // LOCUS RATCGM1AC3 828 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32478 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 3 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 828) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 828 carcinoembryonic antigen-related protein intron B BASE COUNT 193 a 255 c 190 g 190 t ORIGIN 1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct 61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc 121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca 181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa 241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata 301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt 361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt 421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg 481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag 541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc 601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga 661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac 721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac 781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc // LOCUS RATCGM1AC4 642 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 3. ACCESSION M32479 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 4 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 642) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 61 + 420 carcinoembryonic antigen-related protein (CGM1), exon 3 matp + 61 + 420 carcinoembryonic antigen-related protein pre-msg < 1 > 642 CGM1 mRNA and introns IVS < 1 60 CGM1 intron B IVS 421 > 642 CGM1 intron C BASE COUNT 151 a 160 c 154 g 177 t ORIGIN 1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag 61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc 121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc 181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat 241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt 301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct 361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca 421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt 481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg 541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt 601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg // LOCUS RATCGM1AC5 616 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 4. ACCESSION M32480 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 5 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 616) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 90 + 449 carcinoembryonic antigen-related protein (CGM1), exon 4 matp + 90 + 449 carcinoembryonic antigen-related protein pre-msg < 1 > 616 CGM1 mRNA and introns IVS < 1 89 CGM1 intron C IVS 450 616 CGM1 intron D BASE COUNT 152 a 153 c 147 g 164 t ORIGIN 1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa 61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac 121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct 181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat 241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg 301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt 361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa 421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg 481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag 541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca 601 ttgtctgatt cccttc // LOCUS RATCGM1AC6 654 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron 4. ACCESSION M32481 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 6 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 654 carcinoembryonic antigen-related protein intron D BASE COUNT 185 a 146 c 155 g 168 t ORIGIN 1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg 61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat 121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc 181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac 241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg 301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga 361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat 421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt 481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt 541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca 601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa // LOCUS RATCGM1AC7 492 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 5. ACCESSION M32482 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 7 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 118 + 471 carcinoembryonic antigen-related protein (CGM1), exon 5 matp + 118 + 471 carcinoembryonic antigen-related protein pre-msg < 1 > 492 CGM1 mRNA and introns IVS < 1 117 CGM1 intron D IVS 472 492 CGM1 intron E BASE COUNT 134 a 134 c 105 g 119 t ORIGIN 1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc 61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct 121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg 181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt 241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag 301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg 361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa 421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac 481 tctcaaggtc tc // LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 6. ACCESSION M32483 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 8 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 640 / 987 carcinoembryonic antigen-related protein (CGM1), exon 6 matp + 640 / 987 carcinoembryonic antigen-related protein pre-msg < 1 > 1341 CGM1 mRNA and introns IVS < 1 639 CGM1 intron E IVS 988 1341 CGM1 intron F BASE COUNT 357 a 329 c 321 g 334 t ORIGIN 1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt 61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat 121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga 181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca 241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa 301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat 361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag 421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga 481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg 541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt 601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca 661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc 721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca 781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc 841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca 901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag 961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt 1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc 1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa 1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct 1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg 1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag 1321 atccgttgcc tgacggtcga c //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM1AA 3190 bp ss-mRNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) mRNA, complete cds. ACCESSION M32474 J05417 KEYWORDS carcinoembryonic antigen-related protein. SOURCE R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3190) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analysis imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 122 2251 carcinoembryonic antigen-related protein precursor (CGM1) sigp 122 220 carcinoembryonic antigen-related protein signal peptide matp 221 2248 CGM1 protein BASE COUNT 871 a 798 c 693 g 828 t ORIGIN 1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta 61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca 121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct 181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc 241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga 301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact 361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt 421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac 481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt 541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc 601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga 661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc 721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac 781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt 901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc 961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga 1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc 1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac 1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt 1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc 1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct 1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta 1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta 1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact 1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac 1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt 1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc 1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag 1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc 1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa 1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca 1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg 2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct 2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag 2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct 2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga 2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg 2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca 2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac 2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc 2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct 2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt 2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag 2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa 2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat 2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct 2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag 2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct 3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt 3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag 3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt 3181 ttctccaaat //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS RATCGM4AA 4627 bp ds-DNA ROD 05-MAR-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM4) gene, exons 2 and 3. ACCESSION M32475 J05417 KEYWORDS carcinoembryonic antigen-related protein. SOURCE Rat (SPrarue-Dawley) liver DNA, clone lambda-rnCGM415-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4627) AUTHORS Zimmermann,W. JOURNAL Unpublished (1990) Univ. of Freiburg, D7800 Freiburg FRG STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept / 145 489 carcinoembryonic antigen-related protein (CGM4), exon 2 (AA at 147) 3379 / 3738 carcinoembryonic antigen-related protein exon 3 pre-msg < 1 > 4627 CGM4 mRNA and introns IVS < 1 144 CGM4 intron B IVS 490 3378 CGM4 intron C IVS 3739 > 4627 CGM4 intron D BASE COUNT 1213 a 1165 c 1162 g 1087 t ORIGIN 1 agatctgggt cgacctgcag gtcaacggat ctgggcctta gcaggagtgt gggcagagct 61 ctgggaaggc agaagtgtga ttttttaaaa aaccaacaga tttcacctgc tcaatatcga 121 tggttgctct gtcttccctt ttagcctccc ttctaacctg ttggctcctg actactgccc 181 aggtcaacat tgaatcggtg ccattcaatg tggttgaagg ggaaaacgtc cttcttcttg 241 tccacaatct gccagagaat ctcatagcct ttgcctggta tagagggctg aggaaaattg 301 gagtatacat actgaacact gaagtaagtg tgacggggcc aatgtacagc ggtagagaga 361 cagtgtacag caatggttcc ctgtgtatcc gcaatgtcac ccagaaggac acaggattct 421 acactctacg aacagtcaac acacgtggag aaactgtatc aacaacatcc ttgtacctct 481 atgtgtacag taagtgatac tttgtgaact ctgggtgttg tgtggggttc attccgtaga 541 cacacacaga agaggcaggc ctacctaccc tttgcattgt gtctccttat tgaggtgtga 601 acatttaact caggctaagg agagtaatgc caattgaata gaatccttct tttgacttta 661 ccttgtagtc agctggatgt gtggttaact cagtgaagga catcagccct tgtctagact 721 tctggggttc ttagcagtaa tgtgtccttg ggaaagacct tgagggaagg agattgggtt 781 tgaatgagat agccatagga tcctcatgga agtgagaacc agaaagccct ggctccagac 841 ctctgtcctg actcatctcc tgatggcccc gagaagcatt ttacaaaggc tggattctga 901 catctgttgg cagggaacag tgcttttgag gagcaaatcc ttgtgccaca tacaatcacc 961 tggtgcacgg ccatgagagc cacagttagg cgaggtctcc tggatctctc cagtgactca 1021 tcagggagag aatagaaaga cagatgtccc ggccactaag ttaactgtta tgatggcctt 1081 atgagacttc caggaaggtc atggttgcca ggaagaggga caaaggacac agatccccct 1141 gacagttgct tgtcctttgg ggtccagctc atagaagtct gtccgcaggc aaatgacacc 1201 aggctctgct gatgtggata gctccccaga tctgagctgc agttctccca gcgatcacga 1261 gggccgcctc agggaaacac aattaacacc cagaagagta tttgtctaaa ccaggaactt 1321 acctcctcct ctggctagct cccctgttcc tacagacatg ggggtcacac agccttctca 1381 gacctaccag ctgcctcctt ttctgctgcc ttgctaggga attatgtgta gtggctgctt 1441 tgtgtatttt ctttggaaaa gatagagtat cctaagggaa tcacccagac agaggttcaa 1501 ggcatctctg aaaggccagg cagcacatgg cagagccacc tcacagctca ggacccagag 1561 gaagtgtgcc caccatcttg aatccatgca tgggacgatg gagcccagag ctacgttcca 1621 ggactcaggt cacctcccac acactcaaga agtgaggctc ctgacacagc tgctcctggg 1681 ccccttttct ccctgagaat cctgactggt ggctgcagtg agaacacatc tgtcccctcc 1741 cccactcgtc acacagctgg cccttgggat cctcacacac atctctgtct ccttcctcct 1801 gagagcaaac tacctctttg acgggcactg agaacacagg gcagactggg tgcccagctg 1861 gttctgggtc acccagggag tgcagaggct cactcactgg tgctgactga gccaggaaga 1921 ggccagaaca gagggatgcc ccccgggtga gctgctgtct tcttagggca cagagatgct 1981 cagaggtttg tttgtcactg tgagctctgt ggcatgagac agaaagagcc cagaggagag 2041 gttaggtgtg taggactgag tgtgcacagg gcagagaaca gagttaccca cagcccacgg 2101 gactctggga tatgatcctg tctggcggag gctgagctca gaggatcaga gaacttggga 2161 gctgtattgg agcagatgtg ctacagactg aggacagatc tggccacaga gaccagggcg 2221 gtgctctgta ccatctgcaa acaatgcccc acctgttggt gctcctgctc acagatgagg 2281 agaccacatt ttacagtgtg tgagaggaga ggactcacct actgtctaaa gtctcttcaa 2341 ggggacaggg actggagaag agtttcaggt ttgtagggct gaaaacacta aagtataggg 2401 gctcatcatc atcatcatca ccaccgccat caccaccacc accaccacca ccaccaccac 2461 caccaccacc accaccacca ccatcatcat catcatcatg aggctcttgg taaataagaa 2521 gaagcagggg gaggaggaga ttattgtcaa cccacagttc accatcaatg agcccagtgt 2581 tctgaagact gaggttctca gctgtgatgc cccaaataag aaaccaagct ggtgttgatc 2641 agtgacatgg ctcagtggat ctgggtgttt gcttcatgtc tgacaacctg agaaccagtg 2701 aacacaagtt gtccctgacc tccacctagg gacggcgttt tgcacccaac acagacacac 2761 tgaggcatgc ccttgcacat gaactcatac accaatataa taagcaaatg cataaaaatt 2821 atagcaaatg gaagcagtca acactgtatt cccaaacata ctaatttgtt aaataaatcc 2881 atggccatgt attcattcat tcattcattc actcattcat ttactctcca agatatttga 2941 gttttctttt gcagtctttt ttttttaaaa gataatataa gacaaatccc agttctcatt 3001 attccctagc cctagactgg aagacgacca gtgaagaaag ctagaaggcg aatcagtcac 3061 taaaggacaa gaaacaaaag agtcagagtg tgacggtcgg gaggcttcac cccaacaccc 3121 atcgactgac actgagggtg agcagggatc tgaggacggt gaggcagggc catgttgaca 3181 cctgaggaga gagcagcata gagaggaaat gacaagtgag gggcgcggag tgcatggagg 3241 taatgcactg acctccacta gctagggcag ggagactccc acacctcagc tgaccactgg 3301 acacagctgc tcggactcag gcaccatctt agccaaatac taaagtcctg atgttgacgg 3361 atctctcttc ccttctagcc tctcttttca tctgtgggcg tccttttaac cctgccaagc 3421 tcactattga atcagtgccg cccagtgttg ctgaaggggg aagcgttctt ctcctcgttc 3481 acaatctcca ggacgagctt cgagggtttt tctggtacaa aggggcgtct atgtctagca 3541 accatgagat agcccgatac agaacagcaa agaattcaag tgtgccaggc cctgcccaca 3601 gtggtagaga gacggtgtac agcaatggat ccctcctgct ccagaatgtc acccggaatg 3661 acactgggtt ctacacccta cgcactctga aaagacatca gaaaatggaa ttggcacacg 3721 tgcaacttca ggtggacagt aagtgatttt ccgtgatcgt tcagtgctgg gtgggtcttt 3781 gacacacagg actgtcaccc ctggcatgtg gctacctcct ctctgccttt ttatccccat 3841 gttgtggtta accactatgt gcaggacaca tgtgatggaa agaaatgccc atgggtcaga 3901 cttatcatct gactctcccc tgtatcaagg acagtaactc aaccctaggt gctagactct 3961 gcccagtcat ctggggcatc ttgccatgca acgtgaggaa accatggatc ctcacagcgt 4021 ggtgagcacc aggaagctct gatctcagtc gtttgtccca gacttgactg caaatgtctc 4081 taggagcatt ttgtcaggag tgctgcttac tgcctctctc ctcacagcct gccatcctga 4141 tcttatagta acccaggaca ctgagcccag gggtgaaaat gctcccagtt gggctgggct 4201 ctcccacatc aatcaccaat taaaaatgta ctacaggtta gcccacaggt tattttggtg 4261 gtggcatttt aaattgaggc ccttgtttca aaaaattcta gcttgtgtta agttgacata 4321 aagccagcag cacgattcct gagccctccc caatacctat atctgccaag aagaccagac 4381 tgttcccacc catcatccgg ccttagtcct gggtgctata ggctgggacg tgagaacatg 4441 tggaatgtga agtctgagga tgaccgcagg tacaaaggag atgagaaagt cagagagtgt 4501 gtatccaggg tgtgtagaga ccaaaggtca ggggaggcat catcccaaag cacagtgtgc 4561 atgagtatgt gcaatgtctg aatgagggca gtgagggaca gccacggaga caccaaggac 4621 agagctc //
GenBank-Updates@genbank.bio.net (04/14/90)
LOCUS YSCSEN2X 2813 bp ds-DNA PLN 26-FEB-1990 DEFINITION S.cerevisiae tRNA splicing endonuclease beta-subunit (SEN2) gene, complete cds. ACCESSION M32336 KEYWORDS tRNA splicing endonuclease beta-subunit. SOURCE S.cerevisiae (strain S288C) YCp50 library of Rose DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2813) AUTHORS Ho,C.K., Rauhut,R. and Abelson,J. TITLE Saccharomyces cerevisiae SEN2 gene encodes the middle subunit of tRNA splicing endonuclease JOURNAL Unpublished (1990) Cal. Tech., Div. of Bio., Pasadena, CA 91125 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.K.Ho, 23-FEB-1990. Dr. Ho relayed updated title information by phone on 09-APR-1990. FEATURES from to/span description pept 520 1653 tRNA splicing endonuclease beta-subunit BASE COUNT 927 a 522 c 539 g 825 t ORIGIN 1 aaaattttct tcaagatgca agtttcccat tcagttctat aggaagttaa aacaaaagag 61 ctgtttgaga gctcgtaggc tttggttgtt aagagaacgg actaaataac accaggaggt 121 gtgtctagac ttagttgaac gagttttttc tcgtgctatt atatctgttc ttaatacttc 181 cactttattt taatgatgat agcatgattt atgttgctcg aagttaatcg atgtgagcac 241 aatgatttct caagactaca acggtatctg aattttattt cttgtcaaaa atgttttata 301 gaagccactc ttgctataga tttatattgc cacacataac tttttcacgt aaatttgaaa 361 gagctatatt agctggtgct ttcactaaca tctctttgct ctagtatttc atttcggtgc 421 ataaaacatt tgaacttcca tcgaaattca atatgctaaa acgtttcatt gtaacacgac 481 gctgaaaaga caataagtac gaaggactca agaaggagca tgtctaaagg gagggtcaat 541 cagaagcgtt acaagtatcc tcttccaatt caccccgtag atgaccttcc agagttaatc 601 cttcataatc cgttatcttg gctatattgg gcataccgct attacaagag tacgaacgca 661 ctgaacgata aagtacatgt agattttata ggagatacca ctcttcacat cacggttcaa 721 gatgacaagc aaatgctata tctttggaat aatggatttt ttggcactgg tcaatttagc 781 aggagtgagc ctacatggaa agctagaaca gaggccagac tgggtctcaa tgatactccc 841 ctccacaatc ggggaggaac aaagagtaat actgaaacgg agatgacttt agagaaggtc 901 acacaacaga ggagattaca aagattagag ttcaagaaag aacgtgcgaa gttagaaaga 961 gaattactag aattaagaaa aaaaggtggt catatcgatg aggaaaacat cctccttgag 1021 aaacaacgag aatcattaag aaaatttaaa ctaaaacaaa cagaagacgt gggcatcgtt 1081 gcacaacagc aagacatttc cgaatcaaat ttaagagatg aagacaacaa tcttcttgat 1141 gagaacggtg atcttttgcc actagagtca ctagaactaa tgccagtgga agctatgttt 1201 ttaacttttg cacttcctgt tcttgacata tctcccgcgt gcctggcagg gaaactgttt 1261 caattcgatg ccaaatataa agatattcac tcctttgtca gatcatacgt tatataccat 1321 cactacagat cacacggttg gtgcgtaaga tctggtataa aattcggctg cgattattta 1381 ttatataaga gagggccacc atttcaacac gctgaatttt gtgttatggg tcttgaccac 1441 gacgtctcta aagattatac gtggtattct agcatagccc gtgttgtggg cggcgcaaag 1501 aagacgtttg tgttatgcta cgtggagaga ctaatctcgg aacaggaggc gatagcacta 1561 tggaaatcaa acaactttac taaattgttc aatagttttc aagttggtga agtattgtat 1621 aagagatggg ttcccggaag aaatagagac tagaatatca atgagaagat aaatccttcg 1681 tatgtatata tgtatgtgta tgtatttttt atacagattt caatggtttt tgtaagcatc 1741 tgatacccat ccacaatccc taaacccctt gttctttgct ttggggccct tgacagggtc 1801 atcatatatt gctctaaact tttcattgtg tgtggctggt ttggatatgc aaacaaattt 1861 attatcagga aggacacatt tcaactgaga tttagggaag ggacaaggac agtctttctg 1921 ctgtgcaacg cacgttaggg tgtccggaca caaataccca tcacaagggt tatttaaaac 1981 ttgatcttca tatgattgtt gctgttgctg ttgctgttgc tgatgatgac cgaaattgaa 2041 gaaggcattc actccagtgc tgccaaaaat aaacaatgca aagatgagaa cgatactcca 2101 gcgactttga ctcatactac agtaaaattt ggtcgttttt tgcctggcaa accatgccta 2161 tttttcgtca ttaaaggtct tctgagcttt catcaaaatt atctctataa ttataaagga 2221 cagctcagag ccacacccaa acatctgcaa aggtaaaggc catctttcat ctcacattgc 2281 ctaacaagaa agtcacattg caattgtttc gagaagaaaa cattaattca attacgtggt 2341 tcccatgcgt ctgtgttaag cgtcaattta tcatttgacg aactcgttgc aaccacaaac 2401 aaagcaaggg ccgcatttat caaaggaaaa ccaaccaaaa gcataaataa acgagatttt 2461 tattatgaaa aagccatgcg aatcctacta gaataaattc aaagtgaaga catttagtca 2521 attcgatctt ctctctcttg ctttaactag tcgtaaaacc taataagttt cattcacgac 2581 gcgtattttt ttttcaacat atcatctaac gcgttttaag aaatttcagt aaataatgaa 2641 gctgtagtaa tgccaacaat ccattgtggc tactttatac aacgcactat agagcaataa 2701 aagtgtagca ataaacacta gagagaaggc acataataac aagaatatac tcacgcacat 2761 gtattatgga atcagccagt ttagcgaacc gtacaacaaa atcttaagga att //
GenBank-Updates@genbank.bio.net (05/25/90)
LOCUS FCLCAPS 2493 bp ss-RNA VRL 25-MAY-1990 DEFINITION Feline calicivirus capsid protein gene (put.), complete cds. ACCESSION M32819 KEYWORDS capsid protein. SOURCE Feline calicivirus (CFI/68 FIV), cDNA to viral RNA, clone pCV[2,7]. ORGANISM Feline calicivirus Viridae; ss-RNA nonenveloped viruses; Caliciviridae. REFERENCE 1 (bases 1 to 2493) AUTHORS Neill,J.D. TITLE Nucleotide sequence and expression of the capsid protein gene of Feline calicivirus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Neill, 14-MAR-1990. Pioneer Hi-Bred International, 7300 NW 62nd Ave., Johnston, IA 50131 FEATURES from to/span description pept 127 2133 capsid protein ORF (put.) pept 2130 2450 capsid protein ORF (put.) BASE COUNT 666 a 593 c 531 g 703 t ORIGIN 1 gcccccttct tacagcacag ccttggagca ttacaacagc caatttaatg gtgtggaggc 61 gcggactgac cagatcgata cgagtggcat ggccgccctt cactgtgatg tgttcgaagt 121 ttgagcatgt gctcaacctg cgctaacgtg cttaaatact atgattggga tcctcacatc 181 aaattggtaa tcaaccccaa caaatttcta catgttggct tctgcgataa ccctttaatg 241 tgttgttatc ctgaattact acctgaattt ggcaccatgt gggattgtga tcaatcgcca 301 ctccaagtct accttgagtc aatcctgggt gatgatgaat ggtcctccac tcatgaagca 361 attgacccag ttgtgccacc aatgcattgg gatgaagccg gaaaaatctt ccaaccacac 421 cctggcgtcc ttatgcatca cctcatctgt aaggttgcag aaggatggga cccaaacctg 481 ccacttttcc gcttggaagc ggacgatggt tccatcacga cacctgaaca gggaacaatg 541 gttggtggag tcattgctga gcccaacgcc caaatgtcaa ccgcagctga catggccact 601 gggaaaagtg tggactctga gtgggaagcc ttcttctcct ttcacactag tgtgaactgg 661 agcacatctg aaactcaggg gaagatactc tttaaacaat ccttaggacc attgctcaac 721 ccctacctta cccatcttgc aaagctgtat gttgcttggt ctggttctgt tgatgttagg 781 ttttctattt ctggatctgg tgtctttgga gggaaattag ctgctattgt tgtgccgcca 841 ggaattgatc ctgttcaaag tacttcaatg ctgcaatatc ctcatgtcct ctttgatgct 901 cgtcaagttg aacctgttat cttttccatt cccgatctaa gaagcacctt atatcacctt 961 atgtctgaca ctgataccac atcgttggta atcatggtgt acaatgatct tattaacccc 1021 tatgctaatg actcaaactc ttcgggctgc attgtcactg tggaaactaa accggggcca 1081 gatttcaagt ttcacctctt aaaacctcct gggtctatgt taactcacgg atctatccca 1141 tctgatctaa tcccaaaatc atcttcgctt tggattggaa atcggttttg gtctgacata 1201 accgattttg taattcggcc ttttgtgttc caggcaaatc gacactttga tttcaaccaa 1261 gagacagcag gttggagcac cccaaggttt cgcccaatta ctatcactat cagtgttaag 1321 gagtcagcaa agcttggtat tggagtggcc accgactaca ttgttcccgg cataccagat 1381 ggatggcccg acacaacaat cccaggtgag ttggtacctg ttggtgacta tgccatcact 1441 aatggcacca acaatgatat caccacagct gcgcagtacg atgcagccac tgagattaga 1501 aacaacacca atttcagagg catgtacatt tgtggttctc ttcaaagagc ttggggggat 1561 aagaagattt caaatactgc ttttatcaca accggcacgg ttgatggagc caaattgata 1621 cccagtaata ccattgacca aacaaaaatt gccgtattcc aagacacaca tgcgaataag 1681 catgtccaga cctcggacga cacattggcc ctgcttggtt atactggtat tggtgaggaa 1741 gcaattggtg ctgaccgcga tagagttgtg cgaattagcg tcctcccgga acgtggcgca 1801 cgtggtggca atcacccaat cttccacaaa aactctatca agcttggtta tgtaattagg 1861 tccattgatg tgttcaattc tcaaattctg catacctcta ggcaactttc cctcaatcat 1921 tacttattgt cgcctgactc ctttgctgtc tataggatta ttgactctaa tggatcctgg 1981 tttgacatag gcattgataa tgatggattt tcttttgttg gtgtatcaag tattggtaaa 2041 ttagagtttc ctttaactgc ctcctacatg ggaattcaat tggcaaaaat tcgacttgcc 2101 tctaacatta ggagtgtgat gacaaaatta tgaattcaat tttgggctta attgacactg 2161 tcacgaacac aattggcaaa gctcaacaaa tcgaattgga taaggctgca cttggtcagc 2221 aacgcgagct ggcactccaa cgtatgaact tggatcgcca ggctctaaat aatcaagtgg 2281 agcaatttaa caaactgctt gagcagaggg tacaaggccc aatccaatct gtgcgcctgg 2341 cacgcgcagc tggtttcagg gtcgaccctt actcatacac aaatcaaaac ttttatgacg 2401 atcaattaaa tgcaatcaga ctatcatata gaaatttgtt caagaattga tcacttaacc 2461 ctttgggtgc cgcacttgcg cctaacccca ggg // LOCUS RIRSTA56 2280 bp ds-DNA BCT 25-MAY-1990 DEFINITION R.tsutsugamushi scrub typhus antigen 56 (sta 56) gene, complete cds. ACCESSION M33004 KEYWORDS antigen; outer membrane protein; scrub typhus antigen 56. SOURCE R.tsutsugamushi (strain Karp) DNA, clone pRTS56H2.3. ORGANISM Rickettsia tsutsugamushi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Rickettsiales; Rickettsiaceae. REFERENCE 1 (bases 1 to 2280) AUTHORS Stover,C.K., Marana,D.P., Carter,J.M., Roe,B.A., Mardis,E. and Oaks,E.V. TITLE The 56 kilodalton major protein antigen of Rickettsia tsutsugamushi: Molecular cloning and sequence analysis of sta 56 gene, and precise identification of a strain specific epitope JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.V.Oaks, 16-MAR-1990. Walter Reed Institute, Dept. of Enteric Infections, Walter Reed Army Institute of Research, Washington, D.C. 20307 FEATURES from to/span description pept 556 2154 scrub typhus antigen 56 precursor (sta 56) sigp 556 621 scrub typhus antigen 56 signal peptide matp 622 2151 scrub typhus antigen 56 signal 77 82 -35 region signal 99 103 -10 region binding 545 550 ribosome binding site BASE COUNT 764 a 319 c 489 g 708 t ORIGIN 1 aagcttgttc attttttatg tgggctaatt ttagataatg caatgttagt ataattatgt 61 ggttaattaa tgtatcttga tttaagattt tatataaata taataagatt tatgtagggc 121 ttaattatta gcttaaaaaa ctgttgctat tttagctaaa aataaaagtt tgggcaagaa 181 aaattattaa taattgaagg tagttgttgc gtaaaaagct gtgttatgct atctaaggtt 241 aaatgtagca aggtgctaat agataattaa tgtattttcg aacgtgtctt taagctatat 301 ataagagcag tattctattg aatattgttt ctaagtatat aaaaaataaa aatacatttt 361 acaattgata aaacgctttg agcacatttt taacacagtg ttttatagat tgtttaaatt 421 attttacaag tactattaaa tattagtata ctaaataata gttttttgat ataaaactaa 481 agttagtgtg gctaaataat tagtttagaa tggttaccac taaaaaataa atttaattct 541 tttaaggaga ttagaatgaa aaaaattatg ttaattgcta gtgcaatgtc tgcgttgtcg 601 ttgccatttt cagctagtgc aatagaattg ggggaagaag gattagagtg tggtccttat 661 gctaaagttg gagttgttgg aggaatgatt actggcgtag aatctgctcg cttggatcca 721 gctgatgctg aaggcaaaaa acacttgtca ttaacaaatg ggctgccatt tggtggaacg 781 ttggctgcag gtatgacaat cgctccagga tttagagcag agataggtgt tatgtacctt 841 acaaatataa ctgctcaggt tgaagaaggt aaagttaagg cagattctgt aggtgagaca 901 aaggcagatt ctgtaggtgg gaaagatgct cctatacgta agcggtttaa acttacacct 961 cctcagccta ctataatgcc tataagtata gctgtacgtg actttgggat tgatattcct 1021 aaccagacct cagcagcaag cacaagccgc agcctcaggc ttaatgatga gcaacgtgct 1081 gcagctagga tcgcttggtt aaagaattgt gctggtattg actatagggt aaaaaaccct 1141 aatgatccta atgggcctat ggttataaat ccgatattgt taaatattcc acagggtaac 1201 cctaatcctg ttggaaatcc accgcagcga gcaaatccgc ctgcaggttt tgcgatacat 1261 aaccatgagc aatggaggca tttggtagtt gggcttgctg cattatcaaa tgctaataaa 1321 cctagcgctt ctcctgtcaa agtattaagt gataaaatta ctcagatata tagtgatata 1381 aagcatttgg ctgatatagc tggtattgat gttcctgata ctagtttgcc taatagtgca 1441 tctgtcgaac agatacagaa taaaatgcaa gaattaaacg atctattgga agagctcaga 1501 gaatcttttg atgggtatct tggtggtaat gcttttgcta atcagataca gttgaatttt 1561 gtcatgccgc agcaagcaca gcagcagggg caagggcagc aacagcaagc tcaagctaca 1621 gcgcaagaag cagtagcagc agcagctgtt aggcttttaa atggcaatga tcagattgcg 1681 cagttatata aagatcttgt taaattgcag cgtcatgcag gaattaagaa agcgatggaa 1741 aaattagctg cccaacaaga agaagatgca aagaatcaag gtgaaggtga ctgcaagcag 1801 caacaaggaa catctgaaaa atctaaaaaa ggaaaagaca aagaggcaga gtttgatctg 1861 agtatgattg tcggccaagt taaactctat gctgacgtaa tgataactga atcagtctca 1921 atatatgctg gtgttggtgc agggttagct tatacttctg gaaaaataga taataaggat 1981 attaaagggc atacaggcat ggttgcatca ggagcacttg gtgtagcaat taatgctgct 2041 gaaggtgtgt atgtggacat agaaggtagt tatatgtact cattcagtaa aatagaagag 2101 aagtattcaa taaatcctct tatggcaagt gtaagtgtac gctataactt ctagtttttt 2161 ctttaaaatt ataaaaaaag cagctaaaag ttctttacag ggtttttagc tgctttttca 2221 gagttttttt ataataataa aaataacttt attctttgct atttaattag cttgaagctt // LOCUS HUMTROPI 816 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds. ACCESSION J04760 KEYWORDS slow-twitch skeletal troponin I; troponin I. SOURCE Human adult slow-twitch skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 816) AUTHORS Wade,R., Eddy,R., Shows,T.B. and Kedes,L. TITLE cDNA sequence, tissue-specific expression and chromosomal mapping of the human slow-twitch skeletal muscle isoform of troponin I JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Wade, 15-MAR-1990. FEATURES from to/span description pept 77 640 slow-twitch skeletal troponin I (TNN1) BASE COUNT 180 a 247 c 244 g 145 t ORIGIN Chromosome 1q12-qter.. 1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg 61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa 121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga 181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg 241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt 301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa 361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg 421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc 481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc 541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa 601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc 661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg 721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct 781 ctggaactgg gattaaacag atacccaaga ggcagg // LOCUS ASOTAAG1 2935 bp ds-DNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds. ACCESSION M33218 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 2935) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1 828 868 Taka-amylase A precursor, exon 2 955 1070 Taka-amylase A precursor, exon 3 1140 1248 Taka-amylase A precursor, exon 4 1317 1545 Taka-amylase A precursor, exon 5 1603 1765 Taka-amylase A precursor, exon 6 1830 1976 Taka-amylase A precursor, exon 7 2041 2281 Taka-amylase A precursor, exon 8 2360 2647 Taka-amylase A precursor, exon 9 sigp 607 669 Taka-amylase A signal peptide matp 670 772 Taka-amylase A 828 868 Taka-amylase A 955 1070 Taka-amylase A 1140 1248 Taka-amylase A 1317 1545 Taka-amylase A 1603 1765 Taka-amylase A 1830 1976 Taka-amylase A 2041 2281 Taka-amylase A 2360 2644 Taka-amylase A pre-msg 543 > 2789 Taa-G1 mRNA and introns IVS 773 827 Taa-G1 intron A (no splice consensus) IVS 869 954 Taa-G1 intron B IVS 1071 1139 Taa-G1 intron C IVS 1249 1316 Taa-G1 intron D IVS 1546 1602 Taa-G1 intron E IVS 1766 1829 Taa-G1 intron F IVS 1977 2040 Taa-G1 intron G IVS 2282 2359 Taa-G1 intron H signal 2784 2789 poly-A signal BASE COUNT 818 a 752 c 657 g 708 t ORIGIN 1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact 61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca 121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca 181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag 241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat 301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat 361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt 421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc 481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg 541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag 601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct 661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat 721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt 781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg 841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg 901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac 961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc 1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat 1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga 1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt 1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct 1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct 1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact 1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct 1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga 1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc 1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg 1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt 1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca 1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt 1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca 1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca 1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa 1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag 2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg 2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa 2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc 2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa 2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga 2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg 2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga 2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt 2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg 2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag 2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag 2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga 2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca 2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc 2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg // LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end. ACCESSION M33220 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 197) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 195 > 197 Taka-amylase A (Taa-G2) precursor pre-msg 131 > 197 Taa-G2 mRNA and introns signal 2 11 CAAT box signal 95 100 TATA box BASE COUNT 59 a 52 c 36 g 50 t ORIGIN 1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa 61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg 121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc 181 cacagaaggc atttatg // LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end. ACCESSION M33222 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 198) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1) pre-msg < 1 151 Taa-G2 mRNA and introns (alt.) pre-msg < 1 156 Taa-G2 mRNA and introns (alt.) BASE COUNT 45 a 48 c 53 g 52 t ORIGIN About 2.1 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc 181 gtcctccaga ggcgagga // LOCUS ASOTTAM1 191 bp ss-mRNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end. ACCESSION M33219 KEYWORDS Taka-amylase A. SEGMENT 1 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 191) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 189 > 191 Taka-amylase A (Taa) mRNA 125 > 191 Taa mRNA BASE COUNT 56 a 52 c 32 g 51 t ORIGIN 1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca 61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt 121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga 181 aggcatttat g // LOCUS ASOTTAM2 156 bp ss-mRNA PLN 25-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end. ACCESSION M33221 KEYWORDS Taka-amylase A. SEGMENT 2 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 156) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa) (AA at 1) mRNA < 1 151 Taa mRNA (alt.) mRNA < 1 156 Taa mRNA (alt.) BASE COUNT 40 a 35 c 42 g 39 t ORIGIN About 1.5 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgactgag agcccgtctc gaatcc // LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 25-MAY-1990 DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH) mRNA, complete cds. ACCESSION J05223 KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase; glyceraldehyde-3-phosphate dehydrogenase. SOURCE M.crystallinum, cDNA to mRNA. ORGANISM Mesembryanthemum crystallinum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Aizoaceae. REFERENCE 1 (bases 1 to 1354) AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J. TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3- phosphate dehydrogenase during the transition from C-3 photosynthesis to crassulacean acid metabolism in Mesembryanthemum crystallinum JOURNAL J. Biol. Chem. 265, 3497-3502 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.1.12) BASE COUNT 315 a 308 c 345 g 386 t ORIGIN 1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct 61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc 121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac 181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag 241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg 301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga 361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt 421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac 481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg 541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc 601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg 661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt 721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg 781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct 841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat 901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga 961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac 1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga 1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc 1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt 1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta 1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc 1321 agcatttatg aaacattgct gtgagctttt gagt // LOCUS ECOPRIA 2658 bp ds-DNA BCT 25-MAY-1990 DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR gene, 5' end. ACCESSION M33293 KEYWORDS cytR gene; priA gene; primosomal protein. SOURCE E.coli (strain W3110) DNA, clone pEL042. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2658) AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A. TITLE The priA gene encoding the primosomal, replicative n' protein of E.coli JOURNAL Unpublished (1990) Stanford U., Beckman Cntr., Stanford, CA 94305 STANDARD full automatic COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr., 26-MAR-1990. FEATURES from to/span description pept 64 2262 primosomal protein n' pept 2418 > 2658 cytR gene product signal 27 32 -10 region signal 5 10 -35 region binding 53 59 ribosome binding site BASE COUNT 578 a 738 c 756 g 586 t ORIGIN 1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat 61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg 121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag 181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag 241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga 301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc 361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt 421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa 481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa 541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa 601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg 661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc 721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa 781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg 841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc 901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt 961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt 1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac 1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc 1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt 1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg 1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac 1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt 1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac 1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc 1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg 1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg 1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca 1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg 1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc 1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac 1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa 1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg 1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg 2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg 2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg 2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt 2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg 2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg 2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg 2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg 2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata 2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac 2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc 2641 cggatatctg cgatcccc // LOCUS EWCTELRNA 657 bp ds-DNA INV 25-MAY-1990 DEFINITION E.crassus telomerase RNA component gene, complete cds. ACCESSION M33461 KEYWORDS telomerase RNA. SOURCE E.crassus DNA. ORGANISM Euplotes crassus Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; Spirotricha; Heterotrichida; Clevelandellina. REFERENCE 1 (bases 1 to 657) AUTHORS Shippen-Lentz,D. and Blackburn,E.H. TITLE Functional evidence for an RNA template in telomerase JOURNAL Science 247, 546-552 (1990) STANDARD simple staff_entry FEATURES from to/span description RNA 152 342 telomerase RNA component site 186 197 functional telomeric template BASE COUNT 202 a 122 c 111 g 222 t ORIGIN 1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga 61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt 121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg 181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg 241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt 301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg 361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt 421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat 481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga 541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac 601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt // LOCUS HS4DWXJ 160 bp ds-DNA VRL 25-MAY-1990 DEFINITION Epstein-Barr virus defective WZhet junction. ACCESSION M33474 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 160) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 87 90 WZhet junction BASE COUNT 41 a 45 c 41 g 33 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc 61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact 121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac // LOCUS HS4WXJ 181 bp ds-DNA VRL 25-MAY-1990 DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5. ACCESSION M33473 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 181) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 108 111 WZhet junction BASE COUNT 50 a 42 c 46 g 43 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt 61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca 121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga 181 c // LOCUS HUMREGA 4251 bp ds-DNA PRI 25-MAY-1990 DEFINITION Human regenerating protein (reg) gene, complete cds. ACCESSION J05412 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4251) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. FEATURES from to/span description pept 1571 1634 regenerating protein (reg), exon 2 (first expressed exon) 2270 2388 regenerating protein, exon 3 2696 2833 regenerating protein, exon 4 3549 3660 regenerating protein, exon 5 3856 3923 regenerating protein, exon 6 pre-msg 1196 4116 reg mRNA and introns IVS 1224 1524 reg intron A IVS 1635 2269 reg intron B IVS 2389 2695 reg intron C IVS 2834 3548 reg intron D IVS 3661 3855 reg intron E signal 1169 1174 TATA box BASE COUNT 1161 a 927 c 869 g 1294 t ORIGIN 1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc 61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat 121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt 181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc 241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt 301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc 361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc 421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat 481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac 541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg 601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg 661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata 721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag 781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc 841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga 901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt 961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag 1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc 1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa 1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa 1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat 1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta 1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc 1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag 1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac 1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt 1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct 1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact 1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt 1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat 1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg 1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt 1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa 1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac 2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga 2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat 2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa 2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc 2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc 2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag 2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca 2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc 2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac 2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca 2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta 2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt 2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca 2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta 2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag 2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata 3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt 3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca 3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct 3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa 3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga 3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt 3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa 3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc 3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc 3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg 3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag 3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg 3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt 3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt 3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct 3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc 3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca 4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt 4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc 4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct 4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c // LOCUS HUMREGRELA 1524 bp ds-DNA PRI 25-MAY-1990 DEFINITION Human reg-related sequence, complete cds. ACCESSION J05413 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human esophageal mucosa DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1524) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. BASE COUNT 382 a 368 c 345 g 429 t ORIGIN 1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt 61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt 121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg 181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat 241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg 301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga 361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata 421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat 481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag 541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa 601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc 661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac 721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta 781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca 841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt 901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat 961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct 1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc 1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga 1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc 1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct 1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct 1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc 1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg 1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc 1501 tataaatcat ataatgtaaa ggat // LOCUS MDPCGA 4801 bp ss-RNA VRL 25-MAY-1990 DEFINITION Aleutian mink disease parvovirus complete genome. ACCESSION M20036 KEYWORDS complete genome. SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (bases 1 to 4592) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 4593 to 4801) AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W., Perryman,S. and Wolfinbarger,J.B. TITLE Nucleotide sequence of the 5' terminal palindrome of Aleutian mink disease parvovirus (ADV) and construction of an infectious molecular clone JOURNAL J. Virol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by M.E.Bloom, 18-MAR-1990. Sequence reported below is (+) strand. FEATURES from to/span description ORF 116 1978 Left ORF ORF 1983 2207 Middle ORF2 ORF 1993 2211 Middle ORF1 ORF 2241 4399 Right ORF BASE COUNT 1740 a 912 c 943 g 1206 t ORIGIN 1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt 61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt 121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag 181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact 241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt 301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact 361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc 421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac 481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt 541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta 601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag 661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa 721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc 781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga 841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta 901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt 961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga 1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg 1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa 1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt 1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga 1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt 1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga 1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa 1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg 1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta 1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat 1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac 1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg 1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa 1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa 1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca 1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt 1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca 2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa 2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg 2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata 2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta 2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc 2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag 2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg 2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg 2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt 2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact 2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag 2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata 2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta 2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa 2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt 2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg 3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc 3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct 3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta 3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt 3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg 3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa 3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg 3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt 3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac 3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga 3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat 3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac 3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt 3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc 3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg 3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg 3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac 4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg 4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa 4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag 4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag 4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg 4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact 4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc 4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta 4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac 4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt 4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt 4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct 4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg 4801 a // LOCUS MDPUPS 3454 bp ss-RNA VRL 25-MAY-1990 DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial sequence. ACCESSION M32981 KEYWORDS . SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (sites) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3454, for [1]) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840 STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by M.E.Bloom 18-MAR-1990. Sequence reported below is (+) strand. BASE COUNT 1268 a 673 c 698 g 815 t ORIGIN 1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc 61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac 121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac 181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta 241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa 301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga 361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat 421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt 481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa 541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta 601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat 661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag 721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga 781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat 841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc 901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact 961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt 1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac 1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac 1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa 1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta 1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag 1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac 1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg 1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa 1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac 1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca 1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga 1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc 1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg 1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta 1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt 1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga 1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag 2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag 2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac 2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt 2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga 2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc 2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg 2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat 2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta 2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg 2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa 2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg 2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta 2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc 2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa 2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt 2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca 3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg 3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct 3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga 3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt 3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc 3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt 3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa 3421 aacctatggt tactttactt ggagaggcaa gctt // LOCUS MUSAA2DEL 300 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion fragment. ACCESSION M33468 KEYWORDS deletion mutant. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 130 133 deletion breakpoint BASE COUNT 82 a 45 c 76 g 97 t ORIGIN 1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact 61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt 121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt 181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata 241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa // LOCUS MUSSL20JA 300 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion fragment. ACCESSION M33467 KEYWORDS deletion mutant. SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description pept.ps / 42 122 dilute gene, exon 3 179 > 300 dilute gene, exon 5 recomb 129 130 deletion breakpoint BASE COUNT 86 a 51 c 61 g 102 t ORIGIN 1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac 61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac 121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta 181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta 241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc // LOCUS PRVVP4 2359 bp ds-RNA VRL 25-MAY-1990 DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds. ACCESSION M33516 KEYWORDS capsid protein VP4. SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA. ORGANISM Porcine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2359) AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K. TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain of porcine rotavirus to that of asymptomatic human rotavirus strains JOURNAL J. Virol. 64, 414-418 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 10 2337 capsid protein VP4 BASE COUNT 812 a 406 c 442 g 699 t ORIGIN 1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa 61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg 121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat 181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca 241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac 301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga 361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg 421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta 481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat 541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa 601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag 661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta 721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag 781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt 841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca 901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt 961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc 1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac 1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg 1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt 1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc 1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca 1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga 1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac 1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg 1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata 1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt 1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct 1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc 1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata 1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga 1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct 1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat 1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat 2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc 2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata 2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc 2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa 2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag 2341 ctctatagag gatgtgacc // LOCUS RATSTAA 1000 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds. ACCESSION M33329 KEYWORDS hydroxysteroid sulfotransferase a. SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1000) AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K., Yamazoe,Y. and Kato,R. TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a strong homology in amino acid sequence with the senescence marker protein SMP-2 in rat livers JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 38 892 hydroxysteroid sulfotransferase a (STa) mRNA < 1 1000 STa mRNA signal 977 982 poly-A signal BASE COUNT 299 a 187 c 211 g 303 t ORIGIN 1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga 61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa 121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg 181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt 241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa 301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct 361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc 421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta 481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc 541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa 601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga 661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc 721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg 781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt 841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa 901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa 961 gtgaatgtgg tcattgaata aattctattc tggcattgtg // LOCUS SMFPOLENV 3534 bp ss-RNA VRL 25-MAY-1990 DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and envelope (env) gene, complete cds. ACCESSION M33561 KEYWORDS envelope protein; polymerase. SOURCE Simian foamy virus type 1, cDNA to viral RNA. ORGANISM Simian foamy virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Spumavirinae. REFERENCE 1 (bases 1 to 3534) AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A. TITLE Relationship of the env genes and the endonuclease domain of the pol genes of simian foamy virus type 1 and human foamy virus JOURNAL J. Virol. 64, 406-410 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 582 polymerase (AA at 1) pept 524 3481 envelope protein BASE COUNT 1162 a 645 c 670 g 1046 t 11 others ORIGIN 1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac 61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta 121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat 181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta 241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc 301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg 361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc 421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt 481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg 541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac 601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag 661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg 721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat 781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt 841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt 901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt 961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt 1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt 1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt 1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca 1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac 1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca 1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa 1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa 1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga 1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg 1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc 1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt 1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca 1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa 1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg 1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt 1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa 1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc 2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac 2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa 2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag 2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg 2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt 2341 atatttactn nnnnnnnnnn tggtaaccct gatggaagct gcccttcatg atgtttcgat 2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat 2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt 2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta 2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga 2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt 2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa 2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt 2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg 2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg 2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt 3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc 3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt 3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa 3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg 3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc 3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg 3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact 3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg 3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat // LOCUS ENHCYPA 495 bp ds-DNA INV 25-MAY-1990 DEFINITION Entamoeba histolytica cysteine protease gene, partial cds. ACCESSION M27307 KEYWORDS cysteine protease. SOURCE Entamoeba histolytica (strain HM-1) trophozoites DNA, clone E452. ORGANISM Entamoeba histolytica Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina; Entamoebidae. REFERENCE 1 (sites; for [2]) AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H. TITLE Amplification and sequencing of genomic DNA fragments encoding cysteine proteases from protozoan parasites JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 495) AUTHORS Eakin,A.E. JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by A.E.Eakin, 24-AUG-1989. FEATURES from to/span description pept < 1 > 495 cysteine protease (AA at 1) BASE COUNT 161 a 75 c 107 g 152 t ORIGIN 1 caagggcaat gcgggtcgtg ctggactttc tgtacaactg cagttcttga aggaagagtt 61 aacaaagatc ttggaaaact ttactcattc tctgaacaac aattagttga ttgtgatgct 121 tctgataatg gatgtgaacg aggacatcca tctaactcac ttaaattcat ccaagaaaat 181 aatggattag gattagaaag cgattatcca tataaagctg ttgctggtac ttgcaagaaa 241 gttaaaaacg ttgctactgt tactggttct agaagagtta ctgatggaag tgaaactgga 301 cttcaaacta ttattgctga aaacggacct gttgctgttg gtatggatgc tagcagacca 361 tcattccaat tatataagaa aggaactatc tattctgata ctaaatgtag atcaagaatg 421 atgaatcact gtgttactgc tgttggttat ggttcaaata gtaatggtaa atattggatt 481 gtcaagaatt cctgg // LOCUS TRBCYPA 495 bp ds-DNA INV 25-MAY-1990 DEFINITION Trypanosoma brucei cysteine protease gene, partial cds. ACCESSION M27306 KEYWORDS cysteine protease. SOURCE Trypanosoma brucei gambiense trypomastigote DNA, clone B502. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (sites; for [2]) AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H. TITLE Amplification and sequencing of genomic DNA fragments encoding cysteine proteases from protozoan parasites JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 495) AUTHORS Eakin,A.E. JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by A.E.Eakin, 24-AUG-1989. FEATURES from to/span description pept < 1 > 495 cysteine protease (AA at 1) BASE COUNT 116 a 115 c 140 g 124 t ORIGIN 1 caagggcagt gtgggtcgtg ctgggccttt tcaactatcg gcaacatcga agggcagtgg 61 caggtggcag gaaatcctct cgtatccctc tcggagcaga ttctagtgta ctgtgatcca 121 ttgattggct gtggtggtgg gctgatggac aatgccttca actggatagt aaattcaaac 181 ggtggaaacg tattcacgga ggcgagctat ccctatgttt ctgggaatgg tgagcagcca 241 cagtgccaga tgaatggtca cgagatcggt gctgcgataa cagaccatgt tgacttaccg 301 caggatgagg acgccatcgc cgcgtatttg gcagaaaacc gtccccttgc tattgccgtt 361 gaagcaccgc agttttatgg acataacggt gggattctga cttcatgcac ctccgagcaa 421 ctggatcatg gtgtgctcct cgttggttac aatgataata gcaatccacc ctattggatc 481 gtcaagaact cctgg // LOCUS TRCCYPA 495 bp ds-DNA INV 25-MAY-1990 DEFINITION Trypanosoma cruzi cysteine protease gene, partial cds. ACCESSION M27305 KEYWORDS cysteine protease. SOURCE Trypanosoma cruzi (strain RA; Argentine) epimastigote DNA, clone R501. ORGANISM Trypanosoma cruzi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (sites; for [2]) AUTHORS Eakin,A.E., Bouvier,J., Sakanari,J.A., Craik,C.S. and McKerrow,J.H. TITLE Amplification and sequencing of genomic DNA fragments encoding cysteine proteases from protozoan parasites JOURNAL Mol. Biochem. Parasitol. 39, 1-8 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 495) AUTHORS Eakin,A.E. JOURNAL Unpublished (1989) Dept. Pharm. Chem. UCSF, San Francisco, CA 94143 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by A.E.Eakin, 24-AUG-1989. FEATURES from to/span description pept < 1 > 495 cysteine protease (AA at 1) BASE COUNT 94 a 143 c 158 g 100 t ORIGIN 1 cagggccaat gcggctcgtg ctgggccttc tccgccattg gcaacgttag tggccagtgg 61 tttcttgccg gtcacccgct gactaacctg tcggagcaga tgctcgtgtc gtgcgacaaa 121 acggactctg gctgcagtgg tggcctgatg aacaacgcct ttgagtggat tgtgcaggag 181 aataacggcg gcgtgtacac ggaggacagc tacccttatg cgtcgggcga ggggatatcg 241 ccgccgtgca cgacgtcagg ccacacggtg ggtgccacga ttaccggtca cgttgaatta 301 cctcaggacg aggcccaaat agccgcatgg cttgccgtca atggcccggt tgccgttgcg 361 cacgccagca gctggatgac ctacacgggc ggcgttatga cgagctgcgt ctccgagcag 421 ctggatcacg gccttcttct cgtcggctac aatgacagcg ccgcagtgcc gtactggatc 481 atcaagaact cgtgg // LOCUS STRDEXB 1800 bp ds-DNA BCT 25-MAY-1990 DEFINITION S.mutans dextran glucosidase (dexB) gene, complete cds. ACCESSION M30944 KEYWORDS dextran glucosidase. SOURCE S.mutans (strain INGBRITT) DNA. ORGANISM Streptococcus mutans Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 1800) AUTHORS Russell,R.R.B. and Ferretti,J.J. TITLE Nucleotide sequence of the dextran glucosidase (dexB) gene from Streptococcus mutans JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma City, OK 73190 FEATURES from to/span description pept 91 1701 dextran glucosidase (dexB) (EC 3.2.1.11) binding 78 83 ribosome binding site BASE COUNT 585 a 290 c 376 g 549 t ORIGIN 1 aaaacggctt aaattgtttt agattttctc agtctcttta ttagaactaa agaaactaac 61 atgtcataat aaattaaagg tggtgataag atgcaaaaac attggtggca caaggcaact 121 gtttatcaaa tttatccaaa atcttttatg gatacaaatg gtgatggaat tggtgatctc 181 aaaggtatta cgagtaaatt ggattatttg caaaagttag gggttatggc tatttggcta 241 tctccagttt atgatagccc catggatgac aatggctatg acattgcgaa ctatgaagca 301 attgcggata tttttggcaa tatggctgat atggataatt tgctgacgca ggcaaaaatg 361 cgcgacataa aaatcattat ggatctagtg gttaatcata cctcagatga acatacttgg 421 tttattgaag cacgtgagca tccagacagt tctgaacgcg attattatat ttggtgtgac 481 cagccaaatg atttggaatc tattttcggt ggttctgctt ggcagtatga tgataagtcc 541 gatcaatatt atttgcattt ttttagtaag aagcagccag atctaaactg ggaaaacgca 601 aacttacgtc agaagattta tgatatgatg aatttctgga ttgataaagg tattggcggc 661 tttcggatgg acgtcattga tatgattggg aaaattcctg ctcagcatat tgtcagtaac 721 ggaccaaaat tgcatgctta tcttaaggag atgaatgccg ctagttttgg tcaacatgat 781 ctgctgactg tgggggaaac ttggggagca acgcctgaga ttgcgaagca atattcaaat 841 ccagtcaatc acgaactctc tatgattttt caatttgaac atattggtct tcagcataaa 901 ccagaagctc ctaaatggga ttatgtgaag gaacttaatg ttcctgcttt aaaaacaatc 961 tttaataaat ggcagactga gttggaatta ggacaggggt ggaattcgtt attctggaat 1021 aaccatgacc tgcctcgtgt tttatcaatc tggggaaata cgggcaaata tcgtgagaag 1081 tctgctaaag cactggctat tcttcttcac cttatgcgtg ggacacctta tatttatcaa 1141 ggtgaagaga ttgggatgac caattatcct tttaaagatt taaatgaact tgatgatatt 1201 gaatcactta attatgctaa ggaagctttt acaaatggta agtctatgga aactatcatg 1261 gacagtattc gtatgattgg ccgtgataat gccagaacac ctatgcaatg ggatgcttct 1321 caaaatgccg gattttcaac agcggataaa acatggctgc cagttaatcc aaactataaa 1381 gacatcaatg ttcaagcagc tctgaaaaat tccaattcta tcttttacac ctatcaacaa 1441 ctcattcagc ttcgaaaaga aaatgattgg ctagtagatg ccgattttga attgctccct 1501 acagcggaca aagtatttgc ctatttacga aaggtaagag aagaaaggta tcttatagtg 1561 gtcaatgttt cagatcagga agaagttcta gagattgatg ttgacaaaca agaaactctc 1621 attagcaata caaatgaaag cgctgctctt gccaatcaca aactccagcc ttgggatgct 1681 ttttgtatta agataaacta ggatagaata agaaaacaac agacgatacc acttaaaaag 1741 tgacggcctg ttgttttttt aatgtttgta ttaagtaata aagcgattct gattgactat // LOCUS STRGBP 1968 bp ds-DNA BCT 25-MAY-1990 DEFINITION S.mutans glucan-binding protein (gbp) gene, complete cds. ACCESSION M30945 KEYWORDS glucan binding protein. SOURCE S.mutans (strain INGBRITT) DNA. ORGANISM Streptococcus mutans Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 1968) AUTHORS Banas,J.A., Russell,R.R.B. and Ferretti,J.J. TITLE Sequence analysis of the gene for the glucan-binding protein of Streptococcus mutans INGBRITT JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma City, OK 73190. FEATURES from to/span description pept 124 1815 glucan-binding protein (gbp) signal 12 17 -35 region signal 37 42 -10 region binding 112 116 ribosome binding site BASE COUNT 648 a 341 c 445 g 534 t ORIGIN 1 ccggctataa gttgaaatat tgtaggtatt aaaaactatc tttagtttag tatttacatt 61 aattttaaaa atgttatagt ggaagtgtca tgttgattac tattttttta aggaggtaaa 121 atgatgaaag aaaagacacg ttttaaactg cacaaggtta aaaagcagtg ggtggcgatt 181 gccgtgacta gtctagctct agctgcgata ttgtcaggag ctcacttgac tcaggctgag 241 gaacaatccg gcggtactga cagtaagcca agactgacag cgactgtaca ggaaagctca 301 gaacaaccaa ttacaaaagc tccagcagct gattcatctg tagaaaataa cagtgctaac 361 gctgttaaaa gttctgaaac agcagaggca gctgaagtat ccgatggagg cagagccagc 421 caaactgaag cagtaacaaa ccaaacaaac tctgaagagc accatccagc agaaaaagcg 481 acagccgttt ctggagaagc tcagtcagtg caaaatgctc catcagaaaa tgctgcccag 541 caggaaacgg ctaaaaccga gccagcgact gctgcagaaa ataatgacgc tgctccaacc 601 aatagcttct ttaaaaaaga tggtaaatgg tactacaaaa aggccgatgg acagctggca 661 accggttggc agataattga tggaaagcag ctctatttca accaagatgg tagtcaggtc 721 aaaggagaaa ttcatgtgga gacaggggat caaatcattt atcatcctgt tttcataagt 781 gattcacctt cagttttgga agtcaataag atttattact ttgatcctga tagtggtgaa 841 ctctggaagg atcgttttgt ctattctagt tatgcagatc ccctccatta tgaaaatatt 901 aaacatgaag gctggttcta tcttggagaa gatggaaagg ctgctatcgg ctggagaact 961 attggcggta aaaaatacta ttttgacact aatggtgttc aagtcaaagg aaagctaatt 1021 agtacagatg gcaattataa tctaattagc cagaagtatg gcaagaaatc tttcctagat 1081 cctgacaccg gtgaagcttg gactaatcgt tttgtcaatg caaagtatta tttctacaac 1141 tttgcaggat acgtctctac gacagactgg ttctatatgg gagccgatgg tatcggcgtg 1201 accgattggc aaaagatcga tggtatggat tactatttcg aaccttccag tggtattcag 1261 gttaaaggcg acattgctga gcgtgatggc aaggtctatt atttagatga agacagtgga 1321 caagttgtta agaatcgttt tggcacaaca cctgccgagc gtatcagtac agttgaggct 1381 cgtttcccta aaacttatta ttttggagcg gacggtagcc gcaaagatct aactggttgg 1441 cagattattg atggtaaaac ttattacttt aaggatgatc acagcataaa agcaaagtca 1501 gagtatagtc aaattggtgg ttctgtgcct gatgacggtt ttgcagagat tgatggtgat 1561 ggttactttt ttgatactca aggtcaattc gtaacgaata gatttgtcag aaaatacgac 1621 tacagtaata tttggtatta ttatggaagc gatggcaaac gtgtatcagg ctggcaaact 1681 atcgacggta agcgctacta ctttagccaa gatgaaaaga caaagggccg tcaaattaaa 1741 ggacaaacca tcactatcga tggtaaagaa tatacttttg acaaagacag cggtgaagtt 1801 atcaatagta actagttggt aaatcccatg gcacacaaaa acgagcagat ttcatactct 1861 gttcgttttt tcgccttaaa acttatatat ttataaatcg tcaataaagt gtttacttga 1921 aaacggtaaa tatgccaaga gtttgactgt tatcaattaa tgggaaag // LOCUS STRGTFS 4376 bp ds-DNA BCT 25-MAY-1990 DEFINITION S.downei glucosyltransferase S (gtfs) gene, complete cds. ACCESSION M30943 KEYWORDS glucosyltransferase S. SOURCE S.downei (strain MFe28) DNA. ORGANISM Streptococcus downei Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 4376) AUTHORS Gilmore,K.S., Russell,R.R.B. and Ferretti,J.J. TITLE Sequence analysis of a Streptococcus downei gtfs gene which specifies a glucosyltransferase synthesizing soluble glucans JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Ferretti, 20-DEC-1989. U of Oklahoma, Box 26901, Oklahoma City, OK 73190 FEATURES from to/span description pept 247 4344 glucosyltransferase S (gtfs) signal 127 132 -35 region signal 150 154 -10 region binding 233 237 ribosome binding site BASE COUNT 1282 a 944 c 1000 g 1150 t ORIGIN 1 tggcctatgg agtagcggtc attctcacca tcttgaacat caagttaatt attgatttgt 61 tccaataaat atgaatctct ttaaagaata aatttaaaga gatttttatg ttgttttaca 121 atctaatgac aaatttagga tttctgagat taaacttaaa taacatgtta aaataaattg 181 ataattaaaa ttatttggtt attaaagttt taattgtcga atttttttat aaggagaatg 241 atattcatgg aaaaaaattt gcgctacaaa ttgcataagg ttaaaaagca atgggtagcc 301 attggtgtga cgaccgtaac gcttagcttt ctagctggag gccaagtcgt tgcagccgac 361 acaaacaata atgacgggac aagtgttcag gtcaacaaaa tggtgccaag tgatcccaag 421 tttgacgctc aagcccaaaa cggtcaactt gctcaagcta tgtttaaggc tgctaatcaa 481 gcagatcaaa ctgctacaag tcaagttagc ccagcaacag atggcagggt tgataaccag 541 gtgacaccag cagctaatca acctgcggct aatgtggcca atcaagatgt ggccaatcca 601 gcgactgatg ccggagcact taatcgccaa tcggccgctg atacgtcaac agatggtaag 661 gctgtacctc agactagtga ccaaccaggt catctggaaa cagttgatgg caaaacttac 721 tatgtggatg ccaatggtca acggttgaaa aactattcaa tggtcattga tggcaagacc 781 tattactttg acggtcagac aggcgaagcg caaaccgact tgcctaagac cggtcaagct 841 aatcaagaca atgtgcccga tagctatcag gccaataatc aagcctatag taatgaggct 901 tctagctttg agacagttga taattacctg acagcggatt cctggtatcg ccctcgcaag 961 attttgaaaa atggccagag ctggcaggct agttcagaag gtgacctgcg tcctattctc 1021 atgacctggt ggccagatgc ggccaccaag gcagcctatg ccaacttctg ggctaaagaa 1081 ggcctgattt caggttctta caggcaaaat tctgccaatc ttgatgcggc tacccagaat 1141 attcaatctg ccattgaaaa gaagattgcc agtgaaggca ataccaactg gttgcgcgac 1201 aagatgtccc aattcgtcaa gtcacaaaac cagtggagta ttgcttctga aaatgagact 1261 gtctatccta atcaggatca tatgcaaggg ggggccctgc tcttttcaaa cagtaaggat 1321 accgagcatg ctaactcaga ctggcgcctg ctcaaccgca atccaacctt ccaaactggt 1381 aaacaaaagt attttacaac caactatgcg ggttatgaat tgctcttagc caatgatgtt 1441 gataattcta acccggtcgt ccaagccgaa cagctcaatc acctccacta ccttatgaac 1501 tggggtgaca ttgttatggg ggataaggat gctaactttg atggtgtccg agttgatgcg 1561 gttgataatg tcaatgccga cctgctacaa attcagaggg attattataa ggccaaatat 1621 ggcaccgatc aaaatgaaaa aaatgccatt gaccacctct ctatcctaga agcctggtct 1681 ggcaacgaca atgactacgt caaggaccaa aacaactttt ctctctctat tgacaatgat 1741 caacgcagtg gtatgttgaa agctttcggt tatgcttctg cctatcgtgg taatttgagt 1801 aatctcgcaa ctgctggctt gaaaaaccgt agtgcaaatc ctgatagtga cccagttcca 1861 aactatgtct ttattcgtgc ccatgactct gaggtacaga cacggattgc taagattatt 1921 cgtgagaaat tgggtaagac caatgctgat ggtttgacca atctgacctt agatgattta 1981 aacaaggcct ttgacatcta caaccaggat atgaatgcca ccgataaggt ttattatcct 2041 aataatttac cgatggccta tgcttggatg ctgcaaaata aggatacggt cactcgggtc 2101 tactatggtg atatgtacac cgataatggc cagtatatgg cgaccaagac cccattttat 2161 aatgctattg agaccctgct taagggccgg attaagtatg ttgctggtgg tcaagcggtc 2221 tcctacaagc aagattggtc tagtggaatt ttgacctcgg ttcgctatgg taagggagct 2281 aattcggcta gtgatgctgg gaacacggaa acccgcaatt ctggtatggc cctcctgatt 2341 aacaatcgac ctaacttcag ggcttatcgg aacttgacct tgaacatggg ggcagcccat 2401 aagagccaag cctaccgtcc tctcttgctg tctaccaagg atggcattgc cacctacctt 2461 aatgatagcg atgtggattc tcgccaatac aagtacaccg atagtcaggg gaacttaagc 2521 tttagtgcct ctgaactgca aagtgttgct aacgcccaag tctcaggtat gattcaggtt 2581 tgggtacccg tcggtgcggc agataaccaa gatgtccgta cttctccaag tacacaagcg 2641 accaaggatg gcaatatcta tcatcaaagt gacgccctag attcccaagt catctatgaa 2701 ggtttctcta atttccaagc cttcgcccaa agtcctgacc aatataccaa tgctgtcatt 2761 gctaaaaatg gcgacctctt taagtcttgg ggcattaccc aatttgaaat ggcaccgcag 2821 tacgtgtcat cagaagatgg aactttcctt gattccgtta ttttaaatgg ttatgccttc 2881 tcagaccgct atgacctggc tatgagtaag aataataaat atggctccaa gcaggatttg 2941 gccaatgcca ttaaaggact tcagtctgca ggtatcaagg tcttgtcaga cttggtaccg 3001 aaccaacttt ataatctgcc tggaaaagaa gtggtaacgg caacccgggt caatcaatat 3061 ggtcaagcca aatcaggggc gaccatcaat aagactcctt atgtggccaa tacccgttcc 3121 tatggtgact accaagaaca gtacggtgga aaattcttgg atgacttgca aaagctctac 3181 ccaagactct ttagcaccaa gcaaatttca acgggcaaac caattgatcc atctgttaag 3241 attaccaatt ggtcggctaa gtatttcaat ggctccaata ttttggggcg tggtgccaag 3301 tatgttctga gtgagggcaa taagtatctc aacttggctg atggcaagct cttcttgcca 3361 acggtcctca ataatactta tggtcagccg caagtatcgg ctaatggttt tatttctaag 3421 aatggcggta ttcattatct tgataaaaat ggtcaggaag tcaagaatcg gttcaaggaa 3481 atttctggca gctggtatta cttcgattct gacggtaaga tggctactgg aaaaacgaag 3541 attggcaatg atacctacct ctttatgcct aatggtaaac aacttaagga aggtgtctgg 3601 tatgatggta agaaggccta ctattatgat gataatggta ggacgtggac caataagggc 3661 ttcgttgaat ttagagttga cggtcaagac aagtggcgtt acttcaatgg tgatggcact 3721 atcgccattg gactagttag tctagataat cgcaccctct actttgatgc ctatggctat 3781 caagtcaagg gacaaaccgt gacgattaat ggtaagtcct atacctttga tgccgatcag 3841 ggtgacttgg tccaaacaga caatgccaat ccagctcctc agggtcaagc aggctggaaa 3901 ctcctaggag ataaccagtg gggctaccgc aaggacggtc aactcttgac gggtgagcaa 3961 actattgatg gtcaaaaggt cttcttccaa gataatggcg tccaagtcaa aggtggaact 4021 gcgacagatg cttcaggtgt cttgcgtttc tacgaccgtg accagggcca ccaagttggc 4081 aagggctggt actcaacctc cgacgataat tgggtctatg tcaatgaatc cggtcaagtt 4141 ctgacaggct tacaaaccat tgatggtcag acggtctatt tcgatgacaa gggtatccag 4201 gccaagggca aggctgtctg ggatgaaaat ggcaacctgc gttactttga tgccgattca 4261 ggtaatatgc ttcgggaccg ttggaagaac gttgacggaa actggtatta cttcaaccgc 4321 aatggcctag ccacgaggtg gtaagcctaa tactcgctaa atattaaaag agctcg // LOCUS VCHTCPA 1048 bp ds-DNA BCT 25-MAY-1990 DEFINITION V.cholerae major pilin subunit (tcpA) gene, complete cds. ACCESSION M33514 KEYWORDS antigen; pilin. SOURCE V.cholerae (strain Z17561 (Classical, Inaba)) DNA, clone pPM2114. ORGANISM Vibrio cholerae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1048) AUTHORS Faast,R., Ogierman,M.A., Stroeher,U.H. and Manning,P.A. TITLE Nucleotide sequence of the structural gene, tcpA, for a major pilin subunit of Vibrio cholerae JOURNAL Gene 85, 227-231 (1989) STANDARD simple staff_review FEATURES from to/span description pept 144 818 major pilin subunit (tcpA) precursor sigp 144 218 major pilin subunit (tcpA) signal peptide matp 219 815 major pilin subunit (tcpA) ORF 865 > 1048 ORF BASE COUNT 327 a 182 c 240 g 299 t ORIGIN 1 aaaaaataaa aaacacagca aaaatgacat ctgtcaattg taggtgactt tgtgtggtta 61 aatgtgcgtg ttgcttacgt tatctaaaaa agaccaagcg acgcatttct ttaaagacag 121 taaaatggtg gagttacata aatatgcaat tattaaaaca gctttttaag aagaaatttg 181 taaaagaaga acacgataag aaaaccggtc aagagggtat gacattactc gaagtgatca 241 tcgttctagg cattatgggg gtggtttcgg cgggggttgt tactctggcg cagcgtgcga 301 ttgattcgca gattatgacc aaggccgcgc aaagtctcaa tagtatccaa gttgcactga 361 cacagacata ccgtggtcta ggtaattatc cagcaacagc tgatgcgaca gctgctagta 421 agctaacttc aggcttggtt agtttaggta aaatatcatc cgatgaggca aaaaacccat 481 tcaatggtac aaatatgaat attttttcat ttccgcgtaa tgcagcagct aataaagcat 541 ttgcaatttc agtggatggt ctgacacagg ctcaatgcaa gacacttatt accagtgtcg 601 gtgatatgtt cccatatatt gcaatcaaag ctggtggcgc agtagcactt gcagatctag 661 gtgattttga gaattctgca gcagcggctg agacaggcgt tggtgtgatc aaatctatcg 721 ctcccgctag taagaattta gatctaacga acatcactca cgttgagaaa ttatgtaaag 781 gtactgctcc attcggcgtt gcatttggta acagctaatt caaataagtt tgtttaactt 841 aatcttaacg ttgcccatta aataatgggc aacttattaa attcaatgtg gtatcaatat 901 gagaaaatac caacaaggtg tcggattatt ggaggcgatt ctggcttctg cgtattaggg 961 atggcattgg tcgctgctgg gagctattac aagcgggaag ctgaactcat gattaaatcc 1021 agtaacgcat ttgatgttat tgagttgt // LOCUS HPCPOLYP 7310 bp ss-RNA VRL 25-MAY-1990 DEFINITION Hepatitis C virus polyprotein gene, partial cds. ACCESSION M32084 KEYWORDS polyprotein. SOURCE Hepatitis C virus, cDNA to viral RNA, clones K9-1 through 15e, isolated from chimpanzee (individual 910) blood plasma. ORGANISM Hepatitis C virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae incertae sedis. REFERENCE 1 (bases 1 to 7310) AUTHORS Choo,Q.-L., Richman,K. and Han,J. TITLE The nucleotide sequence of the Hepatitis C viral genome JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.Houghton, 22-FEB-1990. Chiron Corporation, 4560 Horton Street, Emeryville CA 94608. FEATURES from to/span description pept < 1 > 7310 polyprotein (AA at 3) BASE COUNT 1495 a 2218 c 2058 g 1539 t ORIGIN 1 caggctgtcc tgagaggcta gccagctgcc gaccccttac cgattttgac cagggctggg 61 gccctatcag ttatgccaac ggaagcggcc ccgaccagcg cccctactgc tggcactacc 121 ccccaaaacc ttgcggtatt gtgcccgcga agagtgtgtg tggtccggta tattgcttca 181 ctcccagccc cgtggtggtg ggaacgaccg acaggtcggg cgcgcccacc tacagctggg 241 gtgaaaatga tacggacgtc ttcgtcctta acaataccag gccaccgctg ggcaattggt 301 tcggttgtac ctggatgaac tcaactggat tcaccaaagt gtgcggagcg cctccttgtg 361 tcatcggagg ggcgggcaac aacaccctgc actgccccac tgattgcttc cgcaagcatc 421 cggacgccac atactctcgg tgcggctccg gtccctggat cacacccagg tgcctggtcg 481 actacccgta taggctttgg cattatcctt gtaccatcaa ctacaccata tttaaaatca 541 ggatgtacgt gggaggggtc gaacacaggc tggaagctgc ctgcaactgg acgcggggcg 601 aacgttgcga tctggaagac agggacaggt ccgagctcag cccgttactg ctgaccacta 661 cacagtggca ggtcctcccg tgttccttca caaccctacc agccttgtcc accggcctca 721 tccacctcca ccagaacatt gtggacgtgc agtacttgta cggggtgggg tcaagcatcg 781 cgtcctgggc cattaagtgg gagtacgtcg ttctcctgtt ccttctgctt gcagacgcgc 841 gcgtctgctc ctgcttgtgg atgatgctac tcatatccca agcggaggcg gctttggaga 901 acctcgtaat acttaatgca gcatccctgg ccgggacgca cggtcttgta tccttcctcg 961 tgttcttctg ctttgcatgg tatttgaagg gtaagtgggt gcccggagcg gtctacacct 1021 tctacgggat gtggcctctc ctcctgctcc tgttggcgtt gccccagcgg gcgtacgcgc 1081 tggacacgga ggtggccgcg tcgtgtggcg gtgttgttct cgtcgggttg atggcgctga 1141 ctctgtcacc atattacaag cgctatatca gctggtgctt gtggtggctt cagtattttc 1201 tgaccagagt ggaagcgcaa ctgcacgtgt ggattccccc cctcaacgtc cgaggggggc 1261 gcgacgccgt catcttactc atgtgtgctg tacacccgac tctggtattt gacatcacca 1321 aattgctgct ggccgtcttc ggaccccttt ggattcttca agccagtttg cttaaagtac 1381 cctactttgt gcgcgtccaa ggccttctcc ggttctgcgc gttagcgcgg aagatgatcg 1441 gaggccatta cgtgcaaatg gtcatcatta agttaggggc gcttactggc acctatgttt 1501 ataaccatct cactcctctt cgggactggg cgcacaacgg cttgcgagat ctggccgtgg 1561 ctgtagagcc agtcgtcttc tcccaaatgg agaccaagct catcacgtgg ggggcagata 1621 ccgccgcgtg cggtgacatc atcaacggct tgcctgtttc cgcccgcagg ggccgggaga 1681 tactgctcgg gccagccgat ggaatggtct ccaaggggtg gaggttgctg gcgcccatca 1741 cggcgtacgc ccagcagaca aggggcctcc tagggtgcat aatcaccagc ctaactggcc 1801 gggacaaaaa ccaagtggag ggtgaggtcc agattgtgtc aactgctgcc caaaccttcc 1861 tggcaacgtg catcaatggg gtgtgctgga ctgtctacca cggggccgga acgaggacca 1921 tcgcgtcacc caagggtcct gtcatccaga tgtataccaa tgtagaccaa gaccttgtgg 1981 gctggcccgc tccgcaaggt agccgctcat tgacaccctg cacttgcggc tcctcggacc 2041 tttacctggt cacgaggcac gccgatgtca ttcccgtgcg ccggcggggt gatagcaggg 2101 gcagcctgct gtcgccccgg cccatttcct acttgaaagg ctcctcgggg ggtccgctgt 2161 tgtgccccgc ggggcacgcc gtgggcatat ttagggccgc ggtgtgcacc cgtggagtgg 2221 ctaaggcggt ggactttatc cctgtggaga acctagagac aaccatgagg tccccggtgt 2281 tcacggataa ctcctctcca ccagtagtgc cccagagctt ccaggtggct cacctccatg 2341 ctcccacagg cagcggcaaa agcaccaagg tcccggctgc atatgcagct cagggctata 2401 aggtgctagt actcaacccc tctgttgctg caacactggg ctttggtgct tacatgtcca 2461 aggctcatgg gatcgatcct aacatcagga ccggggtgag aacaattacc actggcagcc 2521 ccatcacgta ctccacctac ggcaagttcc ttgccgacgg cgggtgctcg gggggcgctt 2581 atgacataat aatttgtgac gagtgccact ccacggatgc cacatccatc ttgggcatcg 2641 gcactgtcct tgaccaagca gagactgcgg gggcgagact ggttgtgctc gccaccgcca 2701 cccctccggg ctccgtcact gtgccccatc ccaacatcga ggaggttgct ctgtccacca 2761 ccggagagat ccctttttac ggcaaggcta tccccctcga agtaatcaag ggggggagac 2821 atctcatctt ctgtcattca aagaagaagt gcgacgaact cgccgcaaag ctggtcgcat 2881 tgggcatcaa tgccgtggcc tactaccgcg gtcttgacgt gtccgtcatc ccgaccagcg 2941 gcgatgttgt cgtcgtggca accgatgccc tcatgaccgg ctataccggc gacttcgact 3001 cggtgataga ctgcaatacg tgtgtcaccc agacagtcga tttcagcctt gaccctacct 3061 tcaccattga gacaatcacg ctcccccagg atgctgtctc ccgcactcaa cgtcggggca 3121 ggactggcag ggggaagcca ggcatctaca gatttgtggc accgggggag cgcccctccg 3181 gcatgttcga ctcgtccgtc ctctgtgagt gctatgacgc aggctgtgct tggtatgagc 3241 tcacgcccgc cgagactaca gttaggctac gagcgtacat gaacaccccg gggcttcccg 3301 tgtgccagga ccatcttgaa ttttgggagg gcgtctttac aggcctcact catatagatg 3361 cccactttct atcccagaca aagcagagtg gggagaacct tccttacctg gtagcgtacc 3421 aagccaccgt gtgcgctagg gctcaagccc ctcccccatc gtgggaccag atgtggaagt 3481 gtttgattcg cctcaagccc accctccatg ggccaacacc cctgctatac agactgggcg 3541 ctgttcagaa tgaaatcacc ctgacgcacc cagtcaccaa atacatcatg acatgcatgt 3601 cggccgacct ggaggtcgtc acgagcacct gggtgctcgt tggcggcgtc ctggctgctt 3661 tggccgcgta ttgcctgtca acaggctgcg tggtcatagt gggcagggtc gtcttgtccg 3721 ggaagccggc aatcatacct gacagggaag tcctctaccg agagttcgat gagatggaag 3781 agtgctctca gcacttaccg tacatcgagc aagggatgat gctcgccgag cagttcaagc 3841 agaaggccct cggcctcctg cagaccgcgt cccgtcaggc agaggttatc gcccctgctg 3901 tccagaccaa ctggcaaaaa ctcgagacct tctgggcgaa gcatatgtgg aacttcatca 3961 gtgggataca atacttggcg ggcttgtcaa cgctgcctgg taaccccgcc attgcttcat 4021 tgatggcttt tacagctgct gtcaccagcc cactaaccac tagccaaacc ctcctcttca 4081 acatattggg ggggtgggtg gctgcccagc tcgccgcccc cggtgccgct actgcctttg 4141 tgggcgctgg cttagctggc gccgccatcg gcagtgttgg actggggaag gtcctcatag 4201 acatccttgc agggtatggc gcgggcgtgg cgggagctct tgtggcattc aagatcatga 4261 gcggtgaggt cccctccacg gaggacctgg tcaatctact gcccgccatc ctctcgcccg 4321 gagccctcgt agtcggcgtg gtctgtgcag caatactgcg ccggcacgtt ggcccgggcg 4381 agggggcagt gcagtggatg aaccggctga tagccttcgc ctcccggggg aaccatgttt 4441 cccccacgca ctacgtgccg gagagcgatg cagctgcccg cgtcactgcc atactcagca 4501 gcctcactgt aacccagctc ctgaggcgac tgcaccagtg gataagctcg gagtgtacca 4561 ctccatgctc cggttcctgg ctaagggaca tctgggactg gatatgcgag gtgttgagcg 4621 actttaagac ctggctaaaa gctaagctca tgccacagct gcctgggatc ccctttgtgt 4681 cctgccagcg cgggtataag ggggtctggc gagtggacgg catcatgcac actcgctgcc 4741 actgtggagc tgagatcact ggacatgtca aaaacgggac gatgaggatc gtcggtccta 4801 ggacctgcag gaacatgtgg agtgggacct tccccattaa tgcctacacc acgggcccct 4861 gtacccccct tcctgcgccg aactacacgt tcgcgctatg gagggtgtct gcagaggaat 4921 atgtggagat aaggcaggtg ggggacttcc actacgtgac gggtatgact actgacaatc 4981 tcaaatgccc gtgccaggtc ccatcgcccg aatttttcac agaattggac ggggtgcgcc 5041 tacataggtt tgcgcccccc tgcaagccct tgctgcggga ggaggtatca ttcagagtag 5101 gactccacga atacccggta gggtcgcaat taccttgcga gcccgaaccg gacgtggccg 5161 tgttgacgtc catgctcact gatccctccc atataacagc agaggcggcc gggcgaaggt 5221 tggcgagggg atcacccccc tctgtggcca gctcctcggc tagccagcta tccgctccat 5281 ctctcaaggc aacttgcacc gctaaccatg actcccctga tgctgagctc atagaggcca 5341 acctcctatg gaggcaggag atgggcggca acatcaccag ggttgagtca gaaaacaaag 5401 tggtgattct ggactccttc gatccgcttg tggcggagga ggacgagcgg gagatctccg 5461 tacccgcaga aatcctgcgg aagtctcgga gattcgccca ggccctgccc gtttgggcgc 5521 ggccggacta taaccccccg ctagtggaga cgtggaaaaa gcccgactac gaaccacctg 5581 tggtccatgg ctgtccgctt ccacctccaa agtcccctcc tgtgcctccg cctcggaaga 5641 agcggacggt ggtcctcact gaatcaaccc tatctactgc cttggccgag ctcgccacca 5701 gaagctttgg cagctcctca acttccggca ttacgggcga caatacgaca acatcctctg 5761 agcccgcccc ttctggctgc ccccccgact ccgacgctga gtcctattcc tccatgcccc 5821 ccctggaggg ggagcctggg gatccggatc ttagcgacgg gtcatggtca acggtcagta 5881 gtgaggccaa cgcggaggat gtcgtgtgct gctcaatgtc ttactcttgg acaggcgcac 5941 tcgtcacccc gtgcgccgcg gaagaacaga aactgcccat caatgcacta agcaactcgt 6001 tgctacgtca ccacaatttg gtgtattcca ccacctcacg cagtgcttgc caaaggcaga 6061 agaaagtcac atttgacaga ctgcaagttc tggacagcca ttaccaggac gtactcaagg 6121 aggttaaagc agcggcgtca aaagtgaagg ctaacttgct atccgtagag gaagcttgca 6181 gcctgacgcc cccacactca gccaaatcca agtttggtta tggggcaaaa gacgtccgtt 6241 gccatgccag aaaggccgta acccacatca actccgtgtg gaaagacctt ctggaagaca 6301 atgtaacacc aatagacact accatcatgg ctaagaacga ggttttctgc gttcagcctg 6361 agaagggggg tcgtaagcca gctcgtctca tcgtgttccc cgatctgggc gtgcgcgtgt 6421 gcgaaaagat ggctttgtac gacgtggtta caaagctccc cttggccgtg atgggaagct 6481 cctacggatt ccaatactca ccaggacagc gggttgaatt cctcgtgcaa gcgtggaagt 6541 ccaagaaaac cccaatgggg ttctcgtatg atacccgctg ctttgactcc acagtcactg 6601 agagcgacat ccgtacggag gaggcaatct accaatgttg tgacctcgac ccccaagccc 6661 gcgtggccat caagtccctc accgagaggc tttatgttgg gggccctctt accaattcaa 6721 ggggggagaa ctgcggctat cgcaggtgcc gcgcgagcgg cgtactgaca actagctgtg 6781 gtaacaccct cacttgctac atcaaggccc gggcagcctg tcgagccgca gggctccagg 6841 actgcaccat gctcgtgtgt ggcgacgact tagtcgttat ctgtgaaagc gcgggggtcc 6901 aggaggacgc ggcgagcctg agagccttca cggaggctat gaccaggtac tccgcccccc 6961 ctggggaccc cccacaacca gaatacgact tggagctcat aacatcatgc tcctccaacg 7021 tgtcagtcgc ccacgacggc gctggaaaga gggtctacta cctcacccgt gaccctacaa 7081 cccccctcgc gagagctgcg tgggagacag caagacacac tccagtcaat tcctggctag 7141 gcaacataat catgtttgcc cccacactgt gggcgaggat gatactgatg acccatttct 7201 ttagcgtcct tatagccagg gaccagcttg aacaggccct cgattgcgag atctacgggg 7261 cctgctactc catagaacca cttgatctac ctccaatcat tcaaagactc // LOCUS YSCREP1A 1498 bp ds-DNA PLN 25-MAY-1990 DEFINITION S.cerevisiae rep1 protein gene, complete cds. ACCESSION M31942 KEYWORDS rep1 protein. SOURCE S.cerevisiae (strain SKQ-2n) plasmid Scp3 DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1498) AUTHORS Neuville,P., Bonneu,M. and Aigle,M. TITLE 2 micron plasmid heterogeneity in the yeast Saccharomyces cerevisiae: A new sequence for the rep1 gene JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Neuville, 13-FEB-1990. FEATURES from to/span description pept 295 1437 rep1 protein BASE COUNT 513 a 245 c 315 g 425 t ORIGIN 1 aagctttgac aaaaagcatt gagtagctaa ctcttctatg caatctatag ctgttttata 61 aggcatttaa tgacaaattg attgtttaga aacatactag tgaaagtagc cttaatcgcc 121 tctcgaaatt aataatgcag tatggtgtaa aaaatgcaac tcgcgttgct ctactttttt 181 ccgaatttcc gaatacgcag ctggggtgat tgatcgattt cgtatcaaaa gttttgttta 241 taaaaaccgc gaaatccttc ttcaacagat ggatttttac agtgctgaca tacaatgaca 301 acaactataa tagaaaataa ccgaaatatg aatggagaga gactacttgt ttgtattaag 361 caatgtatta tgcagcactt ccaacctatg atatacgatg aaagtagatg tgtaattgaa 421 accacaaggg gaactttttc agttcctgac aattataaga aatataaaac cttagcattt 481 gcttttgttg gacacgtact gaatacagat gacacaccgg taattgaaaa agaactggat 541 tggcctgatc ctgcactagt gtacaataca attgtcgatc gaatcagaaa tcacccagaa 601 ttatcacagt ttatatcggt tgcattcatt agtcaattaa aggcggccat aggggaggtt 661 agatatcaat gtaagaggca cgttaaaccg cagggaaagg gtataagaag gccgaacgta 721 ttttttagat acatggaatc cccgtttgtc aataccaagg ttactgactt cttttcttat 781 cttcgagact ataataaaat tgcctcagag tatcacaata atactaaatt cattctcacg 841 ttttcatgta aagcatattg ggcatctggc caaaactttt ccgccttgaa gaatgttatt 901 aggtgctcca tagttcatga atacatttct aggtttgtgg aaagagaaca gggtcttata 961 ggagatcagg agcaaccacg tgaagaaaac tcttcccctg aactaagtaa tgtgcaacat 1021 gaagtcaata atttaatcga acaagatgcc gcggcggatg aagggttgtg gagtgagata 1081 gattcattat gtgaaaaatg gcagtctgaa gcggaagacc aagctgaagc ggagataata 1141 gccgacagaa taattggaaa tagtcaaagg atggcgaacc ttaaaattcg tcgtacaaag 1201 ttcaaaagtg tcttgtatca tatactaaag gaactaattc aatctcaggg aaccgtaaag 1261 gtttatcgcg gtagtagctt ttcacacgat tctataaaga taagcttaca ttatgaagag 1321 cagcatatta cagccgtatg ggtctaccta acagtgaaat ttgaagagca ttggaaacca 1381 gttgatgtag aagtcgagtt taggtgcaag ttcaaggagc gaaaggtcaa tgagtaggtt 1441 atatagggat atagaagata tagaacatag caaagagctg cttttgagca atgtttgt // LOCUS MUSREN1C 758 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse renin (Ren-1-c) gene, 5' end. ACCESSION M34190 KEYWORDS renin. SOURCE M.musculus (strain C57BL/10) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 758) AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J., Pioli,D. and Brammar,W.J. TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d, and its upstream region JOURNAL Gene 84, 91-104 (1989) STANDARD simple staff_review FEATURES from to/span description pept 729 > 758 renin (Ren-1-c), exon 1 pre-msg 701 > 758 Ren-1-c mRNA and introns BASE COUNT 208 a 156 c 245 g 149 t ORIGIN 1 tctagagtca ttgggctcag ccacccttcc cacaccccca tgcctgccac cactctgctc 61 tgtgaccagg tcctgcttat cctataccta cctagcttgg tctcacaggc taagatttat 121 cagagctgcc ctgccatggg ccttgttggg cacctggcgt ggggaaataa aggaggtggg 181 cgagactcac cagagctcag agagatgggc gggcagacag caggggaagg cactgggtgg 241 tctggcagct ggaaatgctg ggaggccttc ttgggggaga ttagataaag ctcttcaggg 301 gaaggcctat tccatgactc cagcatggtg agtctagatg aaaggaggta gtctatggtt 361 ttagagcttt attgtagaag agagagagag aaggtagaga agtagaagcc agtcattgcc 421 atgaagaagg aagggggaga aggagagcga aaggtaagag taagaaaaca agagctttag 481 gagagagaca agagagagag gaggaggcaa gcagccaggt aactctgggg gtggagtctg 541 gacagcctac atgactgatg gccacagaat tatggagctg ggtccttggc cagaaaacag 601 gctgcctttc atggtcccac aggccctggg gtaataaatc aaagcagagc ctgtgataca 661 tggtgtgtat aaaagaaggc tcagggggtc tgggctacac agctcttaga aagccttggc 721 tgaaccagat ggacagaagg aggatgcctc tctgggca // LOCUS MUSREN1D 11231 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse renin (Ren-1-d) gene, complete cds. ACCESSION M32352 KEYWORDS renin. SOURCE M.musculus (strain DBA/2J) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 11231) AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J., Pioli,D. and Brammar,W.J. TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d, and its upstream region JOURNAL Gene 84, 91-104 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1286 1377 renin (Ren-1-d), exon 1 4697 4847 renin, exon 2 5364 5487 renin, exon 3 6056 6174 renin, exon 4 6955 7151 renin, exon 5 8940 9059 renin, exon 6 9487 9631 renin, exon 7 9884 9982 renin, exon 8 10547 10708 renin, exon 9 pre-msg 1258 10887 Ren-1-d mRNA and introns IVS 1378 4696 Ren-1-d intron A IVS 4848 5363 Ren-1-d intron B IVS 5488 6055 Ren-1-d intron C IVS 6175 6954 Ren-1-d intron D IVS 7152 8939 Ren-1-d intron E IVS 9060 9486 Ren-1-d intron F IVS 9632 9883 Ren-1-d intron G IVS 9983 10546 Ren-1-d intron H BASE COUNT 2809 a 2725 c 3051 g 2646 t ORIGIN 1 ggtacttagg gagagtaata ctgccagcca gggtcaggtc acttcgacag ctctgccagg 61 cttctgagag gtacagggaa gaaggacagg gggcaaaagg ataggtctaa gggagtgaag 121 ggccagtagg agggggtcag ctggaagcca gctgcactta agacctcctt tcttttagag 181 ctacccttct gtccaccatc actaacccag gagggacacc caccctctcc ctgatgccag 241 aggttagatg acaggttact tttccaacta ggcagcagcc aaaaggaatc actcatgatc 301 aagatggaag actagggcag agcctaggag gagccagggg tgatttgaag gttaatatag 361 attaatagat aggttaatat ataatattag gttaatatgc aggtctcggg ggcatgagtt 421 ggtagggtct gagttagagg tttgaagcca gggtctatgc tgtctgtacc ctgacctcat 481 gctgaacagc acagctcctc attagcctct gtctaagcta gaggttgcgg ggccaggcca 541 aacagggact ctagagtcat tgggctcagc cacccttccc acacccccat gcctgccacc 601 actctgctct gcgaccaggt tctgcttatc ctatacctac ctaacttggt ctcacaggct 661 agaatttatc aggactgccc tgccatgggc cttgttgggc acctggcatg gggaaataaa 721 ggaggtgggc gagacctgtg tctcaccaga gctcagagtg actggatggg cagacagcag 781 gggaaggcac tgggtggtct ggcagctgga aacgctggga ggccttcttg ggggaaatta 841 gatgaagctc ttcaggggaa ggcctattcc atgactccag catggtgagt ctagatgaaa 901 ggaggtagtc tatggtttta gagctttatt gtagaagaga gagagagaag gtagagaagt 961 agatgccagc cattgccacg aagaaggaag ggggagaagg agagcaaaag gtaagagtaa 1021 gaaaacaaga gctttaggag agagacaaga gagagaggag gaggcaagca gccaggtaac 1081 tctgggggtg gagtctggac agcctacatg actgatggcc acagaattat ggagctgggt 1141 ccttggccag aaaacaggct gcctttcatg gtcccacagg ccctggggta ataaatcaaa 1201 gcagatcctg tgatacatgg tgtgtataaa agaaggctca gggggtctgg gctacacagc 1261 tcttagaaag ccttggctga accagatgga caggaggagg atgcctctct gggcactctt 1321 gttgctctgg agtccttgca ccttcagtct cccaacacgc accgctacct ttgaacggta 1381 acttgggcag agaaggggtg gcagggtaca ggaactggca tcttactaac gccctcagct 1441 gtctatgcgt tgggtcatcc agtccttttt ggccagccag tcagggatgt atgatcctgc 1501 tctgacaatc ctagggaata gcccaaagcc ctagtggacg gccataatta aggggtacca 1561 caacttcccc cttctactac ttttttaaaa ggcgaggggt cactggagtg actgtgtcat 1621 ttgtggtgaa aattttgggt ggagtggaag tatcagaaga aacatgtagg ccaaaggagt 1681 aacatgacta tcagttaatt aatctagata gacattaagt tttccttgtg atcatgtaac 1741 tccccccaag ttcagcagcc gtttggtttg gcgagagagt tcatctgtag ttggaagacc 1801 tcaagtctgt cctctgtggg gttcagctcg tgtctctgga tctcatgtgt gtcttgggag 1861 tcttctgtca gatgaggtgt ctggaggaag gcagctttgt ggtctgtagg tctcaggaga 1921 cctgtctatg ctggatctgc agctgcaggc actgcccttt acttttgtgg ggataaacct 1981 gtggggtaat cacagcagag gagcagaacc ctagcagcag gatgtctgca tagggtggat 2041 ggggaacgtt aagcctgcaa tggtttgctt ggaagcacat cctggcctgt tgttaggatc 2101 ctagaaaagc aagggtgttt gaggaatgta gtcctggaag gcatccttta agtctgcttt 2161 gggtggctaa ggaggaaaat atcatgttaa ttgtgtatct gggataggca gtaacagatt 2221 gtctgcaaaa ggaagctggt tgtcagttaa ggattggatg acggtaatca gcatggttaa 2281 ttcaacccag ggaagcacat tagatggctt ttggccctta aaggaaatca tgaatgctat 2341 ggagaaaaat aaacattaaa ataaagaagc aagaaaattt tggattgaaa agcttactag 2401 cagcagatga gggtaagcag caagctaatc tcaccaatcc tcagtccgcc attcaaaaac 2461 aggatgaaag cagagcctgc ttcaccggct gctttgaaga tttgatgaga ggataagcat 2521 agcacttccg atgccaggcg gggaccggtg taagaagcca tggttacagc atggattacc 2581 acagagtgta ggaaagggaa ggctcctggc agccccacga gggaggatga aacagccatt 2641 tagtgcagtt ccagggctgt aggaacaagg cttcttgagg caggagcagg gcagaggcaa 2701 gagaaggctc acccagcaat ggtctgggga ctatgatgag gaagtcctga aagagtcaag 2761 gcaatgagaa ctgtgaatct gcttggtgct taggagctgg tgggttgctt tcatcctaaa 2821 ttgagtctcc agacagacat cagcacccat gttgcagatg aggatgtcct ccagaaaggt 2881 ctgaggactc actggtcttt gtttccaaag ccctgccatc ttgctcccag catgcagcag 2941 agagcgtggg attaactaga gggcgtgggg ttctgtgctc cctgctctgt gtaaagtatt 3001 gagcaagtca ccttgagggg ctgagcaatg ggctttcttc tcaaggccat tacatgcctt 3061 gcatacacac aatgtccagc ctccacgctt agcctgagct gtgcctgggc gaaaaaacca 3121 tcaaggtgcc agaggacgaa gaggttagtg aatgctcaga accgtgcagc atgccctcag 3181 catcaggtgg ctctatcccc gttgtccctg taggctaggc ttgtggggag cactcagggg 3241 aaatggaccc attggctggc atccttggtg agagcaccta ggtatgggct caggtcttga 3301 aacattaagg atctaggaga ggaatagaag gggttttatg tttattttct tttgggacag 3361 agcctcatta tggagcctgg aactcactat atagaccaga ctggcctcaa acccacagag 3421 atcagccttc ctctgcctcc tgaggactga aattaatgtc atgccctacc acacaacaga 3481 gaggttttct aaaagaagga agggtcataa aagaaacagc aagcattggg tctttttcag 3541 gaaattggtg gagaactaga cctgtcttcc tcccttgaag agtggggaaa taaaacacac 3601 ccatctggtg agatggctgg ggtcaactaa cacaagagtg gaaatacttg caagccgtgc 3661 taggactaag tcatttctgc tggagattac tcggataggt gtgaaatatg ggcaaagctg 3721 ggaggtgagg cacggagtgc tgggaggaag gttagggctc acactggaga agaaagaggg 3781 ctctctggat gtctgggaaa gccctggctc aggactggtg gcatcttcct ccaactgcaa 3841 aacagtctgt gtggtcccgg cagtcctggg taggaaagac agacagacag agcttggact 3901 gtctctgttt ggcatttctg agtcatcttc atccttgtga accttcaatg tctccgccac 3961 caccgccgga aaccagggag ctagatgtca tgtggtttgt atgagggtgc gtgccaggca 4021 gctggcctgc ggccaggagt agctcaggga gctgctgttc taaatgaaat ctcttgaatt 4081 ttgccagtca ataatgattg gaggtgagga ggggcccagg attcaggccc aagacacatc 4141 agtccccagg tgaggcacac cggcaagcta tagacactgg cctatccaat ccaagtcaag 4201 ctaggttcat tgggcacata tcacctggct ctgtgatacc ccagggagca acaagtcctt 4261 gcagcctgct gtgggcatct gctgccactt acaggccatc cgtggtactg ccttgcccgt 4321 caatccaagg agatggcaag aggctgtata gaccttttat gttcctctcc agaacagtga 4381 ggaagagtag gtgttcaaca cacgatgatg ataatggtag atagatgaac agacggactg 4441 actgatgaac agacgggtgg atggacagac ggatggacgg atgaacaaac tgctttatct 4501 aactaaaata agatctggga attacttagt accagaataa gtgactccaa gaaaccccag 4561 ggaactaaaa gtgattcgta attgcaatgg gagagggtta gtgttagacg taccaaaaag 4621 caaggcaaca tctgggctct gtcaacgttg gagcactgca cgactcttct gtttttcttt 4681 ctctaccctg gagcagaatc ccactcaaga aaatgccctc tgtccgggaa atcctggagg 4741 agcggggagt ggacatgacc aggctcagtg ctgaaagggg cgtattcaca aagaggcctt 4801 ccttgatcaa tcttacctcc cccgtggtcc tcaccaacta cctgaatgtg agtcctggct 4861 catctgctat ctccttttct ctctgggact agcaggctcc cactccttcc acaccatgtt 4921 tgactgagag ccaaagagca ccccatagaa ctcctattta ccagccgaga agtctgatgg 4981 ggaggggctg ttgtgattgt tttgttgttt ggtttggttt gtttttgaga cagggtctca 5041 ctgtgtagcc tggcaagcct ggaactcact ttgtagacca gactgacctc gaactcacag 5101 agttctacct acctctgcct cccaggtgct gggattaaag catgtgacat catgcccagc 5161 aattttaatt gctttgatta agcttgcccc agcaagcctc ctgtatccca ggccagcaaa 5221 agacaatgct gagtcctcac atctcagcaa aaagaaagcc aagagccgag aaggggtcca 5281 gacaaccccc caaaccagag gccccaagcc ctgaagtcac ccccatctct ctgggcccca 5341 ttttcctttt gcccccgcca cagacccagt actacggcga gattggcatc ggtaccccac 5401 cccagacctt caaagtcatc tttgacacgg gttcggccaa cctctgggtg ccctccacca 5461 agtgcagccg cctctacctt gcttgtggta agagtcaaga gtcagcctcg tgcctctccc 5521 cagctctgcc tgctgcacct atgacctgct tctcctcagc tccacaccac tgactgagga 5581 agagagagga agtgtctctc ttctttttga aatggcagcc aagttaataa gccagctttg 5641 ggaatgcctg ggcacctgga aggccttttg ccatgtccca aggattcaga caggaacaaa 5701 gtgtaaaccc actccaggtc ttgaggcttt ttcgatgctc ccatgctgag tacttactct 5761 gtgcccagaa tcctctttag cactatatca gtgttagcaa tggctttaga actctgtgaa 5821 aaaaatgtct cagaaagcac gagcacataa gacgaaaaca aaaaaaacca ctatccttgc 5881 ctggtttaca tttttccaag ctggagactc aggaaactaa acacacacac acacacacac 5941 acacacacac acacacacct aagtcttcaa ccatgtttat catttgtcct gcctctgagc 6001 cttctatgta aatcaatctc tcttgtgctg actggggcct gtaaccctct gctagggatt 6061 cacagcctct atgagtcctc tgactcctcc agctacatgg agaacgggtc cgacttcacc 6121 atccactacg gatcaggaag agtcaaaggc ttcctcagcc aggacgtggt gactgtgagt 6181 aggatcgcct ctatttacca cagggcccat gtggttccag acaggtatac tccagccctc 6241 ctccctaccc ctcccactcc tctgcaggag cctgggaagt tgaagtgcag cctccttggc 6301 agcttcttct agctgaagct ccgaagccaa ttgcaggaga gccaaacctg ctgacactga 6361 tgcacaaggc tttgcatcta acttgcatgt ggatgtgtgc ctaaacccca cagtttcctt 6421 cctagtaacc agtctgactt tgctaattga gaccaggttt ggagtactga gattaggaat 6481 caggggtttg tgtccgggca gttttgattt ctttccttca gtgtctaagc ctgttctggc 6541 atatctcagc cccgctgtgt cagaggaaga gaaagacaca tacagaggtc agaggaaatc 6601 agtgcgagcc cttccctcgc actgcccccg cggaatctgg ggagggcaga aggagccctg 6661 gtaatgacga ggcatcctca gcctcggctg gtgggctagc acgaaagcgg gcctctagga 6721 gagttccggt cacctgtctg gcaggacctc tttctcagcc tttatagcat gagtcacccc 6781 catctctctg ggccccaagc atagaggtgt tgtcagtgtg tgtgcagact tgacggaata 6841 gtcttaactg ctgtacagag cgtgtgtatc tctgctgccc tggtatgaaa gccagagcag 6901 atcccagcca aggttaggca gtaacctatg tcccttctgg ttcttctctc acaggtgggt 6961 ggaatcactg tgacacagac ctttggagag gtcaccgagc tgcccctgat ccctttcatg 7021 ctggccaagt ttgacggtgt cctaggcatg ggctttcctg ctcaggccgt tggcggggtt 7081 acccctgtct ttgaccacat tctctcccag ggggtgctga aggaggaagt gttctctgtc 7141 tactacaaca ggtgggcctt tgacaccaag gccagggcag ggaggggtgg agagcaggga 7201 gagcagccac cgtccagttc aggatttata tgcttctaat taggtggttc tcagtcccca 7261 tctagaacca cagtagccaa ttagaacaaa tggaagtcaa gcattcggtc cccagggact 7321 ggccacacag acacagccag cttcatgcct ttgtttatct gttctcattg atccggttag 7381 tgctctgtta ccatgtgcta tggtaggctc caggggtgga cagtgaacag acgacccctg 7441 ccctccatcc tcacagagct cactgcccag caggggagat gagaggtaca acacaatgag 7501 tgccattcta aagctgcaga tggttttcag ttctaggcca ggattctggg cagttcagcc 7561 aggcctgggg aaactgtggt cagtgcccca gcaggggtgg gatctgctgc cagtggctgg 7621 gtttagaaat atggtctaag ctggaggtaa gatctttacg ggtaagaata gaggccaggg 7681 gtggtgagag tggaagggag tggtgtcctg gaacaatctt agaggaagca ggagtctctg 7741 gatcatgact ctggcagact tctgtgcaaa gaggggcatt gagccaatga caccactcgg 7801 aggctcctgc agtcaatggt cttgggactg tgtttcaaag agaccagaaa aacaaagtgt 7861 ctaagagtgg aagtagatgt cagaaagggg gaggggggac tccgcaaaat gtgggcagtt 7921 ctggaaattc aacagggaag gagatgtggg gcagagaagc gggagagggg aggtgggtgc 7981 tctaaacgct agaggatgct tgagcaatgc tgtgggtgga agaaggagag ctgtggaaaa 8041 cacatgcatg aggctgagaa gcaggagccc tggaagggat gaggacccag caacaggtag 8101 caagcagtcc tgggaagcag agacctgctt cctttccctt gggctgggaa gaaaggaaga 8161 tctagataag tcactgactc ttgcacagaa cctgtgtaca cctaacccat gtacatatgt 8221 gcacatacaa atgtgtgttg tgaattctaa gaaactccat catatacaca tgagtatctt 8281 aaaaccaccc aaggcaatcc tgagcttagg tagcatcatg cccgtggaga gagaagtgag 8341 ggtagtcagc tctggctggg aaggagagcc tggctccagc catttggtga ctcagttgcc 8401 tggggggggg gggtgtgaaa attaacctct accaaacaag ggttgctgat tctttcctac 8461 gaagtctccc ctctgtgtcc ctctttgcct cagcagaaat ctcaaggagg gaggagcccc 8521 atatggcccc aaagctccaa tacttcccag acccttaccc tggaatgata cccaccaccc 8581 acacactcca ttcatccagg cccaactcca ctaaatgtct gccttctccc tagtgtccac 8641 atgcccagga aagacgtgat ttgccacttg tctgcatcta cccaatgtcc ttgagctaga 8701 cagttttctg ggtgcaggtg aacagtccta gaaatacttc tgttacattt agggaggatc 8761 cagcaaggga agatgcaatg gggtctgtgt gcctcctgat gacagcttac agccttgcca 8821 tctacccact aaacccttcc tctatcccaa agtcctctgc cctaccccac accagactct 8881 actctccttc atccctgtat ctcgggtatg tctctaactt ccaatgttct ttcctttagg 8941 ggttcccacc tgctgggggg cgaggtggtg ctaggaggca gtgacccgca gcattaccaa 9001 ggcaattttc actatgtgag catcagcaag actgactcct ggcagatcac gatgaagggg 9061 tgggtcagca agcgtaactc tcccaaccat gcaaactgat gtcactgtgt gattggggtg 9121 ggctgggaat gggatgaggg ggacacttct tagggcacag ccaccatctg cctctctgct 9181 tgctctgtcc taggatgggg gcatgaaaat ttgaggatgg gaaggaaggg actcagggaa 9241 aggaggggga gtccggaggc tacatgctga tgcaaggaac agttagtacc tttccatagt 9301 gtctcggccc tcttccagtt catcatcaga ccatgtatgg cctcctatca cactacccta 9361 aattgggaaa ggcatgcagg cggcgggaaa agggtgcccg acagcttcta cctgcctgcc 9421 ttgtctgtct gtctgactgt ctgcctgcct gcctgcctgc ctgccagtct aactatagct 9481 ctgcagggtg tctgtggggt cttccaccct gctatgtgaa gaaggctgtg cggtagtggt 9541 ggacactggt tcatccttta tctcggctcc tacgagctcc ctgaagttga tcatgcaagc 9601 cctgggagcc aaggagaaga gaatagatga agtaagagat ccgtggggag ggctgggtgg 9661 tggtgagggg tggcagcagc actgtgcggc tccagcctgc aaatcgtcca gacagcactg 9721 gatcggaaac aaagctttcc cctcctcccc tctcccaccc cctgaatgta tggcatgtct 9781 tccatagaag gtgaccagag actggcacag gctctctggc acagggtgct ctagtctgct 9841 tcgaagcttt ccaccctgac aacccctctc tttattcccc cagtatgttg tgaactgtag 9901 ccaggtgccc accctccccg acatttcctt tgacctggga ggcagggcct acacactcag 9961 cagtacggac tacgtgctac aggtgaggct gggacaaggg tgggcagggg cactgaaggg 10021 ggacaagtgt aatgtacaga tcccccagta gactcgtggt tgatgaatac ggctcatgtc 10081 atcatctgga gaatttttag cagtcccagt acccaggcca catacaagtc tcgcctagca 10141 gatcggagct tccacaacag gcatctggga atcagagttt taaaacgact tccccagtag 10201 gttccattat gaagctctgc tgtgactcag tgagccccct ctctccatta ttgagtagcc 10261 ctctttctag tcctccatcc ttttcatgct cccaggctta gagaagttca atgacttgtt 10321 ttgggtcaga agctaagtga gcccgtgagc aaggggtaga cacaccattt gcctgatacc 10381 tactacagtg cctgcctacc ccaggaaggc tgggatttag gataggtaaa acagttctgg 10441 gaggatgtag gagagaggag gcatatactg aagagctgtt gcttcactct ggaggtggga 10501 atgcgttgtg tggcgggcag tagagcaagc ctctgcttct tgccagtatc cctacaggag 10561 agacaagctg tgcacactgg ctctccatgc catggacatc ccaccaccca ctgggcctgt 10621 ctgggtcctg ggtgccacct tcatccgcaa gttctataca gagtttgatc ggcataacaa 10681 tcgcattgga ttcgccttgg cccgctaagg ccctctgcca cccagtaacc ctaggccaag 10741 ccaagctggc actcctgggg gccattttgt ctggctttgt ccccaacata gggacactgg 10801 acacagagac cctaacgagt gtttgcccct tcacctgcac tcacccttcc ctgctttaag 10861 gaaaaaccga ataaagattt catgtttaaa gcctgtttcg gatgggttct ttggagtttg 10921 gaggaggtac acctgcatgg tcacacgtgg agaagactat gacacgtgag gccagagatg 10981 gaggtttgtt agtaaagtgc ttgcttttca agtgtgagaa cctgagtttg attccaggaa 11041 cccacactaa aaccaacaaa caaacaaaca gggctggaga gatggctcag tagttaagtg 11101 cactgagtgt tcttcctgat gtccttgcgt tcaattccca gcaacacacg tggtggctca 11161 caaccatgtg tatggggtct gatgccctct tctggcatgt ctaagagagt gataatgtac 11221 tcctaattaa t // LOCUS MUSREN2D 1116 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse renin (Ren-2-d) gene, 5' end. ACCESSION M34191 KEYWORDS renin. SOURCE M.musculus (strain DBA/2J) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1116) AUTHORS Burt,D.W., Mullins,L.J., George,H., Smith,G., Brooks,J., Pioli,D. and Brammar,W.J. TITLE The nucleotide sequence of a mouse renin-encoding gene, Ren-1-d, and its upstream region JOURNAL Gene 84, 91-104 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1087 > 1116 renin (Ren-2-d), exon 1 pre-msg 1059 > 1116 Ren-2-d mRNA and introns BASE COUNT 309 a 248 c 303 g 256 t ORIGIN 1 tctagagtca ttgggctcag cacccttccc acacccccat gcctgccacc actctgctct 61 gcgaccagtg tcttgcttat cctatacctc acctaagcta catgccataa atcaatgctt 121 gttattcaag gcatcatctc ctttgggggg cacttctcaa aaagtcctct cctgtcccta 181 gatatctgag actcctcagt ggccttccat ttagaagggt gttgtctctg aagtaaaaac 241 ttcagttcct ctcacaggct aagatttatc agagctgcct gccatgggcc ttgttggcac 301 tggcgtggga ataagggggt ggcgagactg tgtcccgcag agctcagaga ctgggcaggc 361 agacagcagg agaagacact gggtggtctg gcagctggaa acgctgggag gccttcttgc 421 gggagattag atgaggctat tcaggggaag gcctattcca tttctccagc atggtgagtc 481 tagatgaaaa gaggtagtga aatagaggcc agccattgcc acaaagaagg taagagtaag 541 agaacaagag cttaagaaag agccaagatg ggctggagag atggctcagt gggtaagagc 601 acccgactgc tcttccaaag gtccgaagtt caaatcccag caaccacatg gtggctcaca 661 accatctcat aatgagatct gactccctct tcttggagtg tctgaagaca gctacagtat 721 atttacatat aataaaaata aatttaaaaa ataaaaaaat aaaaataaat ttaagagaga 781 gagagaggag gagccaagca gccccattta tagtgggctg ggctaccttg ctgttgtagg 841 gtaactgtgg ggaggagcaa acctggctat tgccaggtaa ctctgggggg tggagttacc 901 ttgactactt gactgatggc cacagaatga tggaggtgga tccttggcca gaaaacaggc 961 tgcgtttcat ggtcccacag gccctggggt aataaatcag agcagagcct gtgatacatg 1021 gtgtgtataa aagaaggctc agggggtctg ggctacacag ctcttagaaa gccttggctg 1081 aaccagatgg acaggaggag gatgcctctc tgggca // LOCUS HUMASPX 7787 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human nonerythroid alpha-spectrin (SPTAN1) mRNA, complete cds. ACCESSION J05243 KEYWORDS alpha-fodrin; nonerythroid alpha-spectrin. SOURCE Human lung fibroblast cell line WI38, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7787) AUTHORS Moon,R.T. and McMahon,A.P. TITLE Generation of diversity of nonerythroid spectrins: Multiple polypeptides are predicted by sequence analysis of cDNAs encompassing the coding region of human nonerythroid alpha- spectrin JOURNAL J. Biol. Chem. 265, 4427-4433 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.T.Moon, 12-DEC-1989. FEATURES from to/span description pept 103 7521 nonerythroid alpha-spectrin /hgml_locus_uid="LK0022P" /nomgen="SPTAN1" /map="9q33-q34" BASE COUNT 2147 a 1926 c 2239 g 1475 t ORIGIN 1 gaattcgggg aacggtgtgg agcggaggcc gcggaggctc ctcggtcctt cagcacccct 61 cggcccgacg cacccacgcc cctcaccccc cgagagccga aaatggaccc aagtggggtc 121 aaagtgctgg aaacagcaga ggacatccag gagaggcggc agcaggtcct agaccgatac 181 caccgcttca aggaactctc aacccttagg cgtcagaagc tggaagattc ctatcgattc 241 cagttctttc aaagagatgc tgaagagctg gagaaatgga tacaggaaaa acttcagatt 301 gcatctgatg agaattataa agacccaacc aacttgcagg gaaagcttca gaagcatcaa 361 gcatttgaag ctgaagtgca ggccaactca ggagccattg ttaagctgga tgaaactgga 421 aacctgatga tctcagaagg gcattttgca tctgaaacca tacggacccg tttgatggag 481 ctgcaccgcc agtgggaatt acttttggag aagatgcgag aaaaaggaat caaattgctg 541 caggcccaga acttggtgca gtacttacga gaatgtgagg acgtgatgga ctggatcaat 601 gacaaggaag caattgttac ttctgaagag ctgggccagg atctggagca tgtagaggtt 661 ttacagaaga aatttgaaga gtttcaaaca gatatggctg ctcatgaaga aagagttaat 721 gaagtgaacc agtttgctgc caaactcata caggagcagc accctgagga ggaactgatc 781 aagactaagc aggatgaagt caatgcagcc tggcagcggc tgaagggcct ggctctgcag 841 aggcagggga agctctttgg ggcagcagaa gttcagcgct ttaacaggga tgtggatgag 901 actatcagtt ggattaagga aaaggagcag ttaatggcct ctgatgattt tggccgagac 961 ctggcaagtg ttcaggctct gcttcggaag cacgagggtc tggagagaga tcttgctgct 1021 ctagaagaca aggtcaaagc cctgtgtgct gaggctgacc gcctgcaaca gtcccaccct 1081 ctgagtgcaa cacagattca agtgaagcga gaggaactga ttacaaactg ggagcagatc 1141 cgcaccttgg cggcagagag acatgcacgg ctcaatgatt catacaggct tcaacgcttc 1201 cttgctgact tccgtgacct caccagctgg gtgactgaga tgaaagccct catcaatgca 1261 gatgagcttg ccagtgatgt ggctggggct gaagccctgc tagatagaca ccaagagcac 1321 aagggtgaaa ttgatgccca tgaagacagc ttcaaatctg cagatgaatc tggacaggca 1381 ctgcttgctg ctggtcacta tgcctcagat gaagtgaggg agaagctgac cgtcctttcc 1441 gaggagagag cggcgctgct ggagctgtgg gagctgcgca ggcagcagta cgagcagtgc 1501 atggacctgc agctcttcta ccgggacact gagcaggtgg acaactggat gagcaagcag 1561 gaggcgttcc tgttgaatga agacttggga gatttcttgg atagtgtgga agcgcttctt 1621 aagaagcacg aagactttga gaaatccctt agtgcccagg aggaaaagat tacagcatta 1681 gatgaatttg caaccaagct aattcagaac aaccactatg caatggaaga tgtggccact 1741 cgccgagacg ctctgttgag ccgccgcaat gcccttcacg agagagccat gcgtcgccgg 1801 gcccagctag ccgattcttt ccatctgcag cagtttttcc gtgattctga tgagctcaag 1861 agttgggtga atgagaagat gaaaactgcc acagatgaag cttataaaga tccatccaac 1921 ctacaaggaa aagtacagaa gcatcaggct tttgaggctg agctctcagc aaaccagagc 1981 cgaattgatg ccttggagaa agctggccaa aagctgattg atgtcaacca ctatgccaag 2041 gatgaagtgg cagctcgtat gaatgaggtg atcagtttgt ggaagaaact gctagaggcc 2101 actgaactga aaggaataaa gcttcgtgaa gccaaccagc aacagcaatt taatcgcaat 2161 gttgaggata ttgaattgtg gctatatgaa gtagaaggtc acttggcttc ggatgattac 2221 ggcaaagatc ttaccaatgt gcagaacctc cagaagaaac atgccctgct agaggcagat 2281 gtggctgctc accaggaccg aattgatggc gtcaccattc aggcccgcca gttccaagat 2341 gctggccatt ttgatgcaga aaacatcaag aagaaacagg aagccctcgt ggctcgctat 2401 gaggcactca aggagcccat ggttgcccgg aagcagaagc tggccgattc tctgcggttg 2461 cagcagctct tccgggatgt tgaggatgag gagacgtgga ttcgagagaa agagcccatt 2521 gccgcatcta ccaacagagg taaggattta attggggtcc agaatctgct aaagaaacat 2581 caagccttac aagcagaaat tgctggacat gaaccacgca tcaaagcagt tacacagaag 2641 gggaatgcca tggtggagga aggccatttt gctgcagagg atgtgaaggc caagcttcac 2701 gagctgaacc aaaagtggga ggcactgaaa gcaaaagctt cccagcgtcg gcaggacctg 2761 gaggactctc tgcaggccca gcagtacttt gctgatgcta acgaggctga atcctggatg 2821 cgggagaagg aacccattgt gggcagcact gactatggca aggacgaaga ctctgctgag 2881 gctctactga agaaacacga agctttgatg tcagatctca gtgcctacgg cagcagcatc 2941 caggctttgc gagaacaagc acagtcctgc cggcaacaag tggcccccac ggatgatgag 3001 actgggaagg agctggtctt ggctctctac gactatcagg agaagagtcc ccgagaggtc 3061 accatgaaga agggagatat ccttacctta ctcaacagca ccaacaagga ttggtggaaa 3121 gtggaagtga acgatcgtca gggttttgtg ccggctgcgt acgtgaagaa attggacccc 3181 gcccagtcag cctcccggga gaatctcctg gaggagcaag gcagcatagc actgcggcag 3241 gagcagattg acaatcagac acgcataact aaggaggccg gcagtgtatc tctgcgtatg 3301 aagcaggtgg aagaactata tcattctctg ctggaactgg gtgagaagcg taaaggcatg 3361 ttggagaaga gttgcaagaa gtttatgttg ttccgtgaag cgaatgaact acagcaatgg 3421 atcaatgaga aggaagccgc tctgacaagt gaggaggtcg gagcagactt ggagcaggtt 3481 gaggtgctcc agaagaagtt tgatgacttc cagaaggacc tgaaggccaa tgagtcacgg 3541 ttgaaggaca ttaacaaggt agctgaagac ctggagtctg aaggtcttat ggcagaggag 3601 gtgcaggctg tgcaacaaca ggaagtgtat ggcatgatgc ccagggatga aactgattcc 3661 aagacagcct ccccgtggaa gtctgctcgt ctgatggttc acaccgtggc cacctttaat 3721 tccatcaagg agctgaatga gcgctggcgg tccctacagc agctggccga ggaacggagc 3781 cagctcttgg gcagcgccca tgaagtacag aggttccaca gagatgctga tgaaaccaaa 3841 gaatggattg aagagaagaa tcaagctcta aacacagaca attatggaca tgatctcgcc 3901 agtgtccagg ccctgcaacg caagcatgag ggcttcgaga gggaccttgc ggctctcggt 3961 gacaaggtaa actcccttgg tgaaacagca gagcgcctga cccagtccca tcccgagtca 4021 gcagaagacc tgcaggaaaa gtgcacagag ttaaaccagg cctggagcag cctggggaaa 4081 cgtgcagatc agcgcaaggc aaagttgggt gactcccacg acctgcagcg cttccttagc 4141 gatttccggg acctcatgtc ttggatcaat ggaatacggg ggttggtgtc ctcagatgag 4201 ctagccaagg atgtcaccgg agctgaggca ttgctggagc gacaccagga acaccggaca 4261 gaaatcgatg ccagggctgg cactttccag gcatttgagc agtttggaca gcagctgttg 4321 gctcacggac actatgccag ccctgagatc aagcagaaac ttgatattct tgaccaggag 4381 cgtgcagacc tggagaaggc ctgggttcag cgcaggatga tgctggatca gtgccttgaa 4441 ctgcagctgt tccatcggga ctgtgagcaa gctgagaact ggatggctgc ccgggaggcc 4501 ttcttgaata ccgaagacaa aggagactca ctggacagcg tagaggctct gatcaaaaaa 4561 catgaagact ttgacaaagc gattaacgtc caggaagaga agattgctgc tctgcaggcc 4621 tttgccgacc agctcatcgc tgccggccat tatgccaagg gagacatttc tagccggcgc 4681 aatgaggtct tggacaggtg gcgacgtctg aaagcccaga tgattgagaa aaggtcaaag 4741 ctaggagaat ctcaaaccct ccaacagttc agccgggatg tggatgagat tgaggcttgg 4801 atcagtgaaa aattgcaaac agcgagtgat gagtcgtaca aggatcccac caacatccag 4861 agcaagcacc agaagcacca ggcttttgaa gcagagctgc atgccaacgc tgaccggatc 4921 cgtggggtta tcgacatggg caactccctc attgaacgtg gagcctgtgc cggcaatgag 4981 gatgctgtca aggcccgcct ggctgcctta gctgaccagt ggcaattctt ggtgcaaaag 5041 tcagcggaaa agagccagaa actgaaagaa gccaacaagc agcagaactt caacacaggg 5101 atcaaggaca ttgcattctg gctgtctgag gtggaggccc tgctggcatc cgaagattat 5161 ggcaaagacc tggcttctgt gaacaacctg ctgaaaaagc atcaactgct ggaagcagat 5221 atatctgccc atgaggatcg cctgaaggac ctgaacagcc aggcagacag cctgatgacc 5281 agcagtgcct tcgacacctc ccaagtaaag gacaagaggg acaccatcaa cgggcgcttc 5341 cagaagatca agagcatggc ggcctcccgg cgagccaagc tgaatgaatc ccatcgcctg 5401 caccagttct tccgggacat ggatgacgag gagtcctgga tcaaggagaa gaagctgctg 5461 gtgggctcag aggactacgg ccgggaccta actggcgtgc agaacctgag gaagaagcac 5521 aagcggctgg aagcagaact ggctgcgcat gagccggcta ttcagggtgt cctggacact 5581 ggcaagaagc tgtccgatga caacaccatc gggaaagagg agatccagca gcggctggcg 5641 cagtttgtgg agcactggaa agagctgaag cagctggcag ctgcccgggg tcagcggctg 5701 gaagagtcct tggaatatca gcagtttgta gccaatgtgg aagaggaaga agcctggatc 5761 aatgagaaaa tgaccctggt ggccagcgaa gattatggcg acactcttgc cgccatccag 5821 ggcttactga agaaacatga agcttttgag acagccttca ccgtccacaa ggatcgcgtg 5881 aatgatgtct gcaccaatgg acaagacctc attaagaaga acaatcacca tgaggagaac 5941 atctcttcaa agatgaaggg cctgaacggg aaagtgtcag acctggagaa agctgcagcc 6001 cagagaaagg cgaacgtgga tgagaactcg gccttccttc agttcaactg gaaggcggac 6061 gtggtggagt cctggatcgg tgaaaaggag aacagcttga agacagatga ttatggccga 6121 gacctgtctt ctgtgcagac gctcctcacc aaacaggaaa cttttgacgc tgggctgcag 6181 gccttccagc aggaaggcat tgccaacatc actgccctca aagatcagct tctcgccgcc 6241 aaacacgttc agtccaaggc catcgaggcc cggcacgcct ccctcatgaa gaggtggagc 6301 cagcttctgg ccaactcagc cgcccgcaag aagaagcttc tggaggctca gagtcacttc 6361 cgcaaggtgg aggacctctt cctgaccttc gccaaaaagg cttctgcctt caacagctgg 6421 tttgaaaatg cagaggagga cttaacagac cccgtgcgct gcaactcctt ggaagaaatc 6481 aaagctttgc gcgaggccca cgacgccttc cgctcctccc tcagctctgc ccaggctgac 6541 ttcaaccagc tggccgagct ggaccgccag atcaagagct tccgcgtagc ctccaacccc 6601 tacacctggt ttaccatgga ggccctggag gagacctgga ggaacctaca gaaaatcatc 6661 aaggagaggg agctggagct gcagaaggaa cagcggcggc aggaggagaa cgacaagctg 6721 cgccaggagt ttgcccagca cgccaacgcc ttccaccagt ggatccaaga gaccaggaca 6781 tacctcctcg atgggtcctg tatggtggaa gagtcgggga ccctcgaatc ccagcttgaa 6841 gctaccaaac gcaagcacca ggaaatccga gccatgagaa gtcagctcaa aaagatcgag 6901 gacctggggg ccgccatgga ggaggccctc atcctggaca acaagtacac ggagcacagc 6961 accgtgggcc tcgcccagca gtgggaccag ctggaccagc tgggcatgcg catgcagcac 7021 aacctggagc agcagatcca ggccaggaac acaacaggtg tgactgagga ggccctcaaa 7081 gaattcagca tgatgtttaa acactttgac aaggacaagt ctggcaggct gaaccatcag 7141 gatggcaaat cttgcctgcg ctccctgggc tatgacctgc ccatggtgga ggaaggggaa 7201 cctgaccctg agttcgaggc aatcctggac acggtggatc cgaacagaga tggccatgtc 7261 tccttgcaag aatacatggc tttcatgatc agccgcgaaa ctgagaacgt caagtccagc 7321 gaggagattg agagcgcctt ccgggccctc agctcagagg gaaagcctta cgtgaccaag 7381 gaggagctct accagaacct gacccgggaa caagccgact actgcgtctc ccacatgaag 7441 cccatcgtgg acggcaaggg ccgcgagctc cccaccgcgt tcgactacgt ggagttcacc 7501 cgctcgcttt tcgtgaactg agccactccc tgggtcaccc acccctcgct gcttgccctg 7561 cgtcgccttg ctgcatgtcc gctcctctgt gtgctctcac tttccactgt aaccttaagc 7621 ctgcttagct tggaataaga cttaggagaa aatggtgctt cactaacccg cttccggtcc 7681 agtcacaatc atcatgtcac tgtgggaccc agatctgtgt cttgaagcag ctgccctcat 7741 tccgacttca gaaaatcgaa gcagctggcg cctccccttc ggaattc // LOCUS CIPFNRA 1419 bp ss-mRNA PLN 25-MAY-1990 DEFINITION M.crystallinum ferredoxin-NADP+ reductase (fnrA) mRNA, complete cds. ACCESSION M25528 KEYWORDS ferredoxin-NADP+ reductase. SOURCE M.crystallinum, cDNA to mRNA, clone lambdaMC-fnr1. ORGANISM Mesembryanthemum crystallinum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Aizoaceae. REFERENCE 1 (bases 1 to 1419) AUTHORS Michalowski,C.B., Schmitt,J.M. and Bohnert,H.J. TITLE Expression during salt stress and nucleotide sequence of cDNA for ferredoxin-NADP+ reductase from Mesembryanthemum crystallinum JOURNAL Plant Physiol. 89, 817-822 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.J.Bohnert, 13-JUN-1989. FEATURES from to/span description pept 75 1172 ferredoxin-NADP+ reductase precursor (fnrA; EC 1.6.7.1) sigp 75 230 ferredoxin-NADP+ reductase signal peptide matp 231 1169 ferredoxin-NADP+ reductase mRNA 45 1395 fnrA mRNA (alt.) mRNA < 1 1419 fnrA mRNA (alt.) BASE COUNT 416 a 283 c 322 g 398 t ORIGIN 1 tttttttttt ttctggttcc tcttcttctt cttcttcttc ttctgcaaag atatcttctt 61 caatcagctc agcaatggct gccgcagtaa ctgccgcagt ctctttccct tctactaaat 121 ccactcccct ttctaccaga acatcttccg tcatcaccca tgagaaaatt aacttcaaca 181 aggttcctct atactacagg aatgtatcag ttggtgggaa agtgggaacc attagggcag 241 tagcctcaga tgttgaggca cctgttgcaa aggtggagaa gcattcaaag aaaatggagg 301 aaggagtgat agtcaacaag tacaagccca agaaccctta cactggcaga tgcctcctca 361 acactaagat tactggagat gatgctcctg gagagacctg gcacatggtt ttctcccatg 421 aaggagagat tccctacaga gaggggcagt ctgttggagt gattcccgaa ggtattgaca 481 agaatggaaa gccccacaag ttgagattgt actcaattgc cagcaggccc ttgggcgact 541 ttggtgactc taaaactgtt tccttgtgtg taaaacgact tatctacacc aatgacaacg 601 gagaaatagt caagggtgtt tgctcaaact ttttgtgtga tttgaaacct ggtagtgaag 661 tagtgctaac tggaccagtt gggaaggaaa tgctgatgcc aaaagaccct aatgccacaa 721 ttatcatgct tgcaactgga actggtattg ctcctttccg ttcgttcttg tggaaaatgt 781 tctttgagaa gcacgatgac tacaagttca atggtttggc atggctcttc ttgggtgtac 841 ccactagcag ttctttgtta tacaaggagg aatttgagaa gatgaaggag aaagctccag 901 aaaacttcag gttggatttc gctgtgagca gagaacaaac taatgagaaa ggggagaaaa 961 tgtacatcca aacccgtatg gcacaatatg atagagagct atgggaattg ctcaagaaag 1021 acaacactta cgtctacatg tgtgggctga aaggcatgga gaaaggaatc gatgacatta 1081 tggtttcatt ggccgcagaa gatggtattg actggttcga ctacaagaag cagttgaaga 1141 aggcagaaca gtggaacgtg gaagtctact gataaacttg tacaaaactc tcttctttct 1201 ttttgtgaag cagaatatat ctatttgtca tctcccactg caccttgtag ataggtcagt 1261 ttcttttcga tattgttatg aatatcagct aattgcattg tagaagctat agccaaaaaa 1321 gctctgtttt ggagtactca tgtatcgcgt aattatactg gaaaccagtg tcaacacttc 1381 tacactgaga aattcatgag gatgaacaca ttatgctca // LOCUS HUMCAMPPK 3036 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human cAMP-dependent protein kinase type I-alpha subunit, complete cds. ACCESSION M33336 KEYWORDS cAMP-dependent protein kinase; cAMP-dependent protein kinase type I; protein kinase. SOURCE Human testis, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3036) AUTHORS Sandberg,M., Skalhegg,B. and Jahnsen,T. TITLE The two mRNA forms for the type I-alpha regulatory subunit of cAMP-dependent protein kinase from human testis are due to the use of different polyadenylation site signals JOURNAL Biochem. Biophys. Res. Commun. 167, 323-330 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 88 1233 protein kinase type I-alpha subunit /nomgen="PRKAR1" /map="7p13-qter" /hgml_locus_uid="LW0133F" BASE COUNT 821 a 552 c 682 g 981 t ORIGIN 1 gctgggagca aagcgctgag ggagctcggt acgccgccgc ctcgcacccg cagcctcgcg 61 cccgccgccg cccgtcccca gagaaccatg gagtctggca gtaccgccgc cagtgaggag 121 gcacgcagcc ttcgagaatg tgagctctac gtccagaagc ataacattca agcactgctc 181 aaagattcta ttgtgcagtt gtgcactgct cgacctgaga gacccatggc attcctcagg 241 gaatactttg agaggttgga gaaggaggag gcaaaacaga ttcagaatct gcagaaagca 301 ggcactcgta cagactcaag ggaggatgag atttctcctc ctccacccaa cccagtggtt 361 aaaggtagga ggcgacgagg tgctatcagc gctgaggtct acacggagga agatgcggca 421 tcctatgtta gaaaggttat accaaaagat tacaagacaa tggccgcttt agccaaagcc 481 attgaaaaga atgtgctgtt ttcacatctt gatgataatg agagaagtga tatttttgat 541 gccatgtttt cggtctcctt tatcgcagga gagactgtga ttcagcaagg tgatgaaggg 601 gataacttct atgtgattga tcaaggagag acggatgtct atgttaacaa tgaatgggca 661 accagtgttg gggaaggagg gagctttgga gaacttgctt tgatttatgg aacaccgaga 721 gcagccactg tcaaagcaaa gacaaatgtg aaattgtggg gcatcgaccg agacagctat 781 agaagaatcc tcatgggaag cacactgaga aagcggaaga tgtatgagga attccttagt 841 aaagtctcta ttttagagtc tctggacaag tgggaacgtc ttacggtagc tgatgcattg 901 gaaccagtgc agtttgaaga tgggcagaag attgtggtgc agggagaacc aggggatgag 961 ttcttcatta ttttagaggg gtcagctgct gtgctacaac gtcggtcaga aaatgaagag 1021 tttgttgaag tgggaagatt ggggccttct gattattttg gtgaaattgc actactgatg 1081 aatcgtcctc gtgctgccac agttgttgct cgtggcccct tgaagtgcgt taagctggac 1141 cgacctagat ttgaacgtgt tcttggccca tgctcagaca tcctcaaacg aaacatccag 1201 cagtacaaca gttttgtgtc actgtctgtc tgaaatctgc ctcctgtgcc tcccttttct 1261 cctctcccca atccatgctt cactcatgca aactgcttta ttttccctac ttgcagcgcc 1321 aagtggccac tggcatcgca gcttcctgtc tgtttatata ttgaaagttg cttttattgc 1381 accattttca atttggagca ttaactaaat gctcatacac agttaaataa atagaaagag 1441 ttctatggag actttgctgt tactgcttct ctttgtgcag tgttagtatt caccctgggc 1501 agtgagtgcc atgctttttg gtgagggcag atccagcacc tattgaatta ccatagagta 1561 atgatgtaac agtgcaagat tttttttttt aagtgacata attgtccagt tataagcgta 1621 tttagactgt ggccatatat gctgtatttc tttgtagaat aaatggtttc tcattaaact 1681 ctaaagatta gggaaatgga tatagaaaat cttagtatag tagaaagaca tctgcctgta 1741 attaaactag tttaagggtg gaaaaatgaa aatttttgct aattatcaat gggatatgat 1801 tggttcagtt ttttttttcc agagttgttg tttgccaagc taatctgcct ggtttattta 1861 tatcttgtta ttaatgtttc ttctccaatt ctgaaatact tttgagtatg gctatctata 1921 cctgcctttt aagtttgaaa ctaactcata gatgcaaata ttggttagta tttaactaca 1981 tctgcctcgg ctcacaaatt ccgattagac ctttatccag ctagtgccaa ataattgatc 2041 agatgctgaa ttgagaataa gaatttgagg tctacattct tggttgttaa tttagagcgt 2101 ttggttaaag tatgtccttc agctgactcc agtataatct cctctgctca ttaaactgat 2161 tccaggagat tggatttgct gtgactagat acagatggag caaatgtcct aacagagaaa 2221 tagaggtgat gctgctaaag ggagaaatgc caggcggaca aagttcagtg tcgggaattt 2281 tccccgtgac attcactggg gcatgagatt ttggaagaag ttttttactt tggtttagtc 2341 tttttttcct cctttttatt cagctagaat ttctggtggg ttgatggtag ggtataatgt 2401 gtctgtgttg cttcaaattg gtctgaaagg ctatcctgct gaaagtcctg ctttcctatc 2461 tagcatttat tcctctggca aacttttctt tcttttcttt tttaaagtaa acttgtgtat 2521 tgagtcttaa ctgtatttca gtattttcca gccttatgtg ttacattatt ccaatgatac 2581 ccaacagttt atttttatta tttttttaaa caaaatttca cagttctgta atgtaggcac 2641 ttttattttc attgtgattt atatataagg taatgtaggg ttatatttgg gagtgactgc 2701 aagcattttt ccatctgtgt gcaactaact gactctgtta ttgatccctt ctcctgccct 2761 ttcccaggta atttaaattg gtcatggtag atttttttca tagatttgaa aaacttttag 2821 gttgttacca agtatgaagt ataaatctgg ggaagaggtt ttatttacat tttagggtgg 2881 gtaagaaagc caccttgtta caaatttttt aatttccaaa ataatctata ttaaatgagg 2941 gtttctgatc tgtactttgt gtttagctac ctttttatat ttaaaaaatt aaaaatgaaa 3001 attatgttct tacaagctta aagcttgatt tgatct // LOCUS HUMIGHCU 426 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged H-chain mRNA V-J4-region, hybridoma AE6-5, 5' end. ACCESSION M29468 KEYWORDS immunoglobulin heavy chain; joining exon; processed gene; variable region; variable region subgroup VH-I. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVH, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 426) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 1 > 426 Ig H-chain V-J4-region precursor /nomgen="IGHV" /map="14q32.33" /hgml_locus_uid="LK0080M" sigp 1 57 Ig H-chain signal peptide matp 58 > 426 Ig H-chain recomb 384 385 V-region DNA end/J4-region DNA start BASE COUNT 93 a 114 c 129 g 90 t ORIGIN 1 atggactgga cctggaggat cctcttcttg gtggcagcag ccacaggagc ccactcccag 61 gtgcagctgg tgcaatctgg ggctgaggtg aagaagcctg gggcctcagt gaaggtctcc 121 tgcgaggctt ctggatacac cttcaccggc cactatatgc actgggtcgg acaggcccct 181 ggacaagggc ttgagtggat ggggtggatc aaccctaaca gtggtggcac aaactatgca 241 cagaagtttc agggcagggt caccatcacc agggacacgt ccatcaacac agcctacatg 301 gagctgagca ggctgagatc tgacgacacg gccgtgtatt actgtgcgag agcctcatat 361 tgtggttatg actgctatta cttctttgac tactggggcc agggaaccct ggtcaccgtc 421 tcctcg // LOCUS HUMIGHCV 489 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged anti-myelin H-chain mRNA V-J4-region, hybridoma AE6-5, 5' end. ACCESSION M29470 KEYWORDS autoantibody; immunoglobulin heavy chain; joining exon; processed gene; variable region; variable region subgroup VH-III. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVH, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 489) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 64 > 489 Ig H-chain V-J4-region precursor /nomgen="IGHV" /map="14q32.33" /hgml_locus_uid="LK0080M" sigp 64 120 Ig H-chain signal peptide matp 121 > 489 Ig H-chain recomb 450 451 V-region DNA end/J4-region DNA start BASE COUNT 101 a 121 c 144 g 123 t ORIGIN 1 cccagccctg ggattttcag gtgttttcat ttggtgatca ggactgaaca gagagaactc 61 accatggagt ttgggctgag ctggcttttt cttgtggcta ttttaaaagg tgtccagtgt 121 gaggtgcagc tgttggagtc tgggggaggc ttggtacagc ctgaggggtc cctgagactc 181 tcctgtgcag tctccggatt cacttttagc agctttgcca tgacctgggt ccgccaggct 241 ccagggaagg ggctggagtg ggtctcagct attagtacta gtggtggtag cacatactac 301 gcagagtccg tgaagggccg cttcaccatc tccagagaca attccatgca cacgctgtat 361 ctgcaaatga acagcctgag agccgaggac acggccgtct attactgtgc gaaaggtcct 421 acatattgta gtagaatcag ctgccctccg gactactggg gccagggaac cctggtcacc 481 gtctcctca // LOCUS HUMIGHVW 778 bp ds-DNA PRI 25-MAY-1990 DEFINITION Human Ig germline H-chain pseudogene V-J6-region, hybridoma AE6-5, 5' end. ACCESSION M29466 KEYWORDS germline; immunoglobulin heavy chain; joining exon; variable region; variable region subgroup VH-I. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone II4a, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 257 to 778) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 256) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept.ps 257 302 pseudo-Ig H-chain V-J6-region precursor, exon 1 /nomgen="IGHV" /map="14q32.33" /hgml_locus_uid="LK0080M" 384 > 778 pseudo-Ig H-chain V-J6-region precursor, exon 2 sigp.ps 257 302 pseudo-Ig H-chain signal peptide 384 394 pseudo-Ig H-chain signal peptide matp.ps 395 > 778 pseudo-Ig H-chain recomb 733 734 V-region DNA end/J6-region DNA start signal 157 164 octamer recognition sequence BASE COUNT 193 a 197 c 216 g 172 t ORIGIN 1 aacacagagt tcagggatgg gtaggggatg cgtggcctct aacaggatta cggcttgaac 61 cctcagcttc tacaattgtg tcgtccatgt gtcatgtatt tgctctttct catcctgggt 121 caggaattgg gctattaaat agcatccttc atgaatatgc aaataactga ggtgaatata 181 gatatctgtg tgccctgaga gcatcaccca aaaaccacac ccctccttgg gagaatccct 241 agatcacagc tctaccatgg actggacctg gagcatcctt ttcttggtgg cggcagcaac 301 actaacggac tccccagtcc cggggctgag agagaaacca ggccagtcat gtgagacttc 361 acccatcctg tgtctctcca caggtgccca ctcccaggtt cgactggtgc agtctggagc 421 tgacgtgaag aagcctgggg cctcagtgaa ggtctcctgc aaggcttctg gttacacctt 481 taccagctat gatatcagct gggtgcgaca ggcccctgga caagggcttg agtggatggg 541 atggatcagc gcttttaatg gtaacacaga ctatgcacag aagttccagg gcagggtcac 601 catgaccaca gacacatcca cgagcacagc ctacacggag ctgaggagcc tgagatctga 661 cgacacggcc gtgtattact gtgcgagaga cgggggcttg ggagtggtgg tgatagttga 721 aacctaaagc aagtactaca acgatatgga cgtctggggc caagggacca cggtcacg // LOCUS HUMIGKAW 405 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged kappa-chain mRNA V-J1-region, hybridoma AE6-5, 5' end. ACCESSION M29467 KEYWORDS joining exon; kappa-immunoglobulin; processed gene. variable region subgroup VK-IV; variable region. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuUCVK, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 405) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 1 > 405 Ig kappa-chain V-J1-region precursor /nomgen="IGKV" /map="2p12" /hgml_locus_uid="LF0081F" sigp 1 60 Ig kappa-chain signal peptide matp 61 > 405 Ig kappa-chain recomb 369 370 V-region DNA end/J1-region DNA start BASE COUNT 95 a 110 c 101 g 99 t ORIGIN 1 atggtgttgc agacccaggt cttcatttct ctgttgctct ggatctctgg tgcctacggg 61 gacatcgtga tgacccagtc tccagactcc ctggctgtgt ctctgggcga gagggccacc 121 atcaactgca agtccagcca gagtctttta tacagctcca acaataagaa ctacttagct 181 tggtaccagc agaaaccagg acagcctcct aagttgctca tttactgggc atctacccgg 241 gaatccgggg tccctgaccg attcagtggc agcgggtctg ggacagattt cactctcacc 301 atcagcagcc tgcaggctga agatgtggca gtttattact gtcagcaata ttatagtact 361 cctccgatgt tcggccaagg gaccaaggtg gaaatcaaac gaact // LOCUS HUMIGKAX 407 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human Ig rearranged anti-myelin kappa-chain mRNA V-J4-region, hybridoma AE6-5, 5' end. ACCESSION M29469 KEYWORDS autoantibody; joining exon; kappa-immunoglobulin; processed gene; variable region; variable region subgroup VK-IIIa. SOURCE Human (patient POP) hybridoma AE6-5 DNA, clone pHuPopVK, derived from B-CLL cell line UC729-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 407) AUTHORS Spatz,L.A., Wong,K.K., Williams,M., Desai,R., Golier,J., Berman,J.E., Alt,F.W. and Latov,N. TITLE Cloning and sequence analysis of the variable heavy (VH) and light (VL) chain regions of an anti-myelin/DNA antibody from a patient with peripheral neuropathy and chronic lymphocytic leukemia JOURNAL J. Immunol. 144, 2821-2828 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by L.A.Spatz, 26-OCT-1989, for release after publication. Columbia University, Department of Neurology BB-322, 630 W. 168th street, New York, NY 10032. FEATURES from to/span description pept 24 > 407 Ig kappa-chain V-J4-region precursor /nomgen="IGKV" /map="2p12" /hgml_locus_uid="LF0081F" sigp 24 83 Ig kappa-chain signal peptide matp 84 > 407 Ig kappa-chain recomb 368 369 V-region DNA end/J-region DNA start BASE COUNT 95 a 123 c 102 g 87 t ORIGIN 1 ctcagttagg acccagagga accatggaag ccccagctca gcttctcttc ctcctgctac 61 tctggctccc agataccacc ggagaaattg tgttgacaca gtctccagcc accctgtctt 121 tgtctccagg ggaaagagcc accctctcct gcagggccag tcagagtgtt agcagctact 181 tagcctggta ccaacagaaa cctggccagg ctcccaggct cctcatctat gatgcatcca 241 acaaggccac tggcgtccca gccaggttca gtggcagtgg gtctgggaca gacttcactc 301 tcaccatcag cagcctagag cctgaagatt ttgcagttta ttactgtcag cagagtagca 361 agtggcctct cactttcggc ggagggacca aggtggagat caaagga // LOCUS HUMLSP1A 1631 bp ss-mRNA PRI 25-MAY-1990 DEFINITION Human lymphocyte-specific protein 1 (LSP1) mRNA, complete cds. ACCESSION M33552 KEYWORDS lymphocyte-specific protein; phosphoprotein. SOURCE Human CTL AH2 cell line, cDNA to mRNA, clones hlsp1.[3,5-1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1631) AUTHORS Jongstra-Bilen,J., Young,A.J., Chong,R. and Jongstra,J. TITLE Human and mouse LSP1 genes code for highly conserved phosphoproteins JOURNAL J. Immunol. 144, 1104-1110 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 109 1128 lymphocyte-specific protein 1 (LSP1) mRNA < 1 1631 LSP1 mRNA BASE COUNT 373 a 496 c 500 g 262 t ORIGIN 1 tttaacaccc atagtaggcc taaaagcagc caccaattaa gaaagccaca gcacgtacac 61 ccactccagg gatctgccag caccctgtgg ggcccagact acaggctgat ggcggaggct 121 tcgagtgacc cgggtgctga ggagcgggaa gagttgctgg ggcccactgc tcagtggagc 181 gtggaggacg aggaggaggc cgtccacgag caatgccagc atgagagaga caggcagctt 241 caggcccagg acgaggaggg aggcggccat gtccccgagc ggccgaagca ggagatgctc 301 ctcagcctga agccctcgga ggcccctgaa ctggatgagg acgagggctt tggcgactgg 361 tcccagaggc cagagcagcg gcagcagcac gagggggcgc agggcgcctt ggacagcgga 421 gagccccccc agtgcaggag tcctgagggg gagcaagagg acaggcccgg cctgcatgcc 481 tacgaaaagg aggacagtga tgaagtccac ctggaggagt tgagtctgag caaggagggg 541 ccaggcccag aggacactgt ccaggacaac ctgggggccg caggggctga ggaggaacag 601 gaggagcacc agaaatgtca gcagcccagg acacccagcc ccttggtctt ggaggggacc 661 atcgaacaga gctcgcctcc cctgagccct accaccaaac tcatcgacag gaccgagtcc 721 ctaaaccgct ccatagagaa gagtaacagt gtgaagaaat cccagccaga cttgcccatc 781 tccaagattg atcagtggct ggaacaatac acccaggcca tcgagaccgc tggccggacc 841 cccaagctag cccgccaggc ctccatagag ctgcccagca tggctgtggc cagtaccaag 901 agtcggtggg agacgggtga ggtacaggct cagtctgcgg ccaagactcc gtcctgcaag 961 gatattgtgg ctggagacat gagcaagaaa agcctctggg agcagaaggg aggctccaag 1021 acctcatcaa caattaagag caccccatct gggaagaggt ataagtttgt ggccaccggg 1081 catgggaagt atgagaaggt gcttgtggaa gggggcccgg ctccctaggc gtcccatctc 1141 gcttcctggg tctgcaggtc cagccggctg gcaccctcca tgtacccagg ggagattcca 1201 gccagacacc cgccccccgg ccctggctaa gaagttgctt cctgttgcca gcatgaccta 1261 ccctcgcctc tttgatgcca tccgctgcca cctccttttg ctcctggacc ctttagcctc 1321 tctgcccttc cactctctga ccaccgcccc cgccctcccc acccagctcc gcttcttgtt 1381 acttggggga ggaaagaaac tcctgatcat tggccaaagg gacttacccc tggagaggcc 1441 aagtgccttc taggaagtta ggaggttgag gcacagcctg tgcagagagg gtgggtcacc 1501 cccccagatc caaggggaaa ctgcaggtca agggctgata acggccatgc aggatgcttg 1561 atgctgcgtc ccccgctgct tgccgccccc caccccgcca ttttgtataa taaagctccc 1621 tgtgtattct c // LOCUS MUSVG1A 1985 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse Vgr-1 protein mRNA, complete cds. ACCESSION J04566 KEYWORDS Vgr-1 protein. SOURCE Mouse (strain ICR) 8.5 day embryo (George), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1985) AUTHORS Lyons,K., Graycar,J.L., Lee,A., Hashmi,S., Lindquist,P.B., Chen,E.Y., Hogan,B.L.M. and Derynck,R. TITLE Vgr-1, a mammalian gene related to Xenopus Vg-1, is a member of the transforming growth factor beta gene superfamily JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 4554-4558 (1989) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by A.Lee. 22-MAY-1989, for release after publication. FEATURES from to/span description pept 16 1332 Vgr-1 protein BASE COUNT 448 a 566 c 551 g 420 t ORIGIN 1 ccttgaccgt aagacatgag gaagatgcaa aaggagatcc tgtcggtgct ggggcccccg 61 cacaggccgc ggcccctgca cggtctccag cagcctcagc ccccggtgct cccgccacag 121 cagcagcagc agcagcagca gcagcagacg gcccgcgagg agccccctcc agggcggctg 181 aagtccgctc cactcttcat gctggatctc tacaacgccc tgtccaatga cgacgaagag 241 gatggggcat cggagggtgt ggggcaagag cctgggtccc acggaggggc cagctcgtcc 301 cagctcaggc agccgtctcc cggcgctgca cactccttga accgcaagag tctcctggcc 361 ccgggacccg gtggcggtgc gtccccactg actagcgcgc aggacagcgc tttcctcaac 421 gacgcggaca tggtcatgag ctttgtgaac ctggtggagt acgacaagga gttctcccca 481 catcaacgac accacaaaga gttcaagttc aacctatccc agattcctga gggtgaggcg 541 gtgacggctg ctgagttccg cgtctacaag gactgtgtgg tggggagttt taaaaaccaa 601 acctttctta tcagcattta ccaagtcttg caggagcatc agcacagaga ctctgaccta 661 tttttgttgg acacccgggt ggtgtgggcc tcagaagaag gttggctgga atttgacatc 721 acagcaacta gcaatctgtg ggtggtgaca ccgcagcaca acatggggct ccagctgagt 781 gtggtgactc gggatggact ccacgtcaac ccccgtgcgg cgggcctggt gggcagagac 841 ggcccttacg acaagcagcc cttcatggtg gccttcttca aggtgagcga ggtccacgtg 901 cgcaccacca ggtcagcctc cagtcggcgg cggcagcaga gtcgcaaccg gtccacccag 961 tcgcaggacg tgtcccgggg ctccggttct tcagactaca acggcagtga gttaaaaaca 1021 gcttgcaaga agcatgagct ctatgtgagc ttccaggacc tgggatggca ggactggatc 1081 attgcaccca aaggctacgc tgccaactac tgtgatggag agtgttcctt cccactcaac 1141 gcacacatga atgccaccaa ccacgccatt gtacagacct tggtccacct tatgaatccc 1201 gagtacgtcc ccaaaccatg ctgcgcacca accaaactga atgccatctc ggttctttac 1261 ttcgatgata actccaatgt catcttgaaa aagtacagga atatggtcgt gagagcttgt 1321 ggttgccatt aagttgaagc tggtgtgtgt gtgtgggtgg gggcatggtt ctgccttgga 1381 ttcctaacaa caacatctgc cttaaaccac gaacaacagc acagcgaagc gggatggtga 1441 cacacagagg gatcgtgaca cgcagacaca tctcccgctg gtgccttacc cacggaggct 1501 tttatgagga ccttgtcaag ggctttccca gttcctaact gagcagttgc tggtctgcag 1561 gaagctggaa ggcttgtagt acaggcctgg aaactgcagt tacctaatgt tcgcctcccc 1621 caaccccgcc cggagtagtt ttagctttta gatctagctg cttgtggtgt aagtaaactt 1681 gaaggaatat taaatatccc tgggttgaaa gacccggtgg tggctctaca gcacccatcc 1741 cagggagatt tttgcagaca tccgaatgga ggggagaagg gcactctttc aggttccatt 1801 cccagcaagg gcagctcaca caggacctgc agcctggcca tcagcaggct ctgtggaggt 1861 gccttctgtc tactgttgta gttacgtgtt ttgtgttgac tctcggtggt gtgagaatgt 1921 actaatctct gtcaagacaa actgtagcat ttccacccca tcctcctccc tccctcacag 1981 aattc // LOCUS NPAIFP1A 1730 bp ds-DNA VRL 25-MAY-1990 DEFINITION AcNPV mutant with an S.frugiperda insertion element IFP1.6. ACCESSION M32775 KEYWORDS insertion element; insertion element IFP1.6. SOURCE AcNPV (mutant strain AcFP875.2) DNA. ORGANISM Autographa californica nuclear polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 1730) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 118 119 AcNPV DNA end/S.frugiperda DNA start recomb 1683 1684 S.frugiperda DNA end/AcNPV DNA start site 115 118 viral target sequence site 1684 1687 viral target sequence BASE COUNT 610 a 279 c 252 g 589 t ORIGIN 1 tcgtcgattg cgttttgatc aacgacttga gcagagacac gttaatcaac tgttcaaatt 61 gatccatatt aactatatca acccgatgcg tatatggtgc gtaaaatata ttttttaacc 121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac 181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat 241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta 301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc 361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat 421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta 481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg 541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat 601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa 661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa 721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc 781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc 841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa 901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa 961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta 1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc 1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg 1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat 1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa 1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt 1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca 1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat 1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt 1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa 1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga 1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa 1681 aggttaaccc tcttatactt tgcactctgc gttaatacgc gttcgtgtac // LOCUS NPAIFP2A 2225 bp ds-DNA VRL 25-MAY-1990 DEFINITION AcNPV with an S.frugiperda insertion element IFP2.2. ACCESSION M32779 KEYWORDS insertion element; insertion element IFP2.2. SOURCE AcNPV DNA. ORGANISM Autographa californica nuclear polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 2225) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 26 27 AcNPV DNA end/S.frugiperda DNA start recomb 2191 2192 S.frugiperda DNA end/AcNPV DNA start BASE COUNT 760 a 407 c 349 g 709 t ORIGIN 1 ccgtctcacc ataacaccgt ttttaccaca gaacatataa agagagtaga aatgaaatag 61 ttcctgattt gtatcagaac gcgtgtcgcc gtttctccca ccctttaatt tcactacaag 121 caagagacca atgtctgagc ggattacccc cacaaataaa taacacacat ttaaatctcg 181 aatttgctat tggttgtcac acaaacgcaa tgactgcatt tgacgtacac tctaacacgc 241 ataaaaactc atgtatgcta gggcgagaca aaggtcgatt gttgatatcg ataaaattta 301 agtaatcttg tttttgtttt acaagtttgt tgaatattgt gaaattgtat tcatgcgaaa 361 tagaaaccaa gtaataaaag taaaataaat aaaaaataca taactgaatt ttaaaaccaa 421 ttttattact caataagtta aaaatcagta aaaaataaat aaatcatagg cacttttgaa 481 cgtcaaagca aatataataa tattaataac gtctgtctgt cggtcagccc tctagtgaag 541 ctatttgctc gttccaaagt gtagattcct atggagaaga tcgaagatct gttcgttcga 601 tctttttcaa ttagattcac aatacaattc ttgataacat tgcttttctt tgcatcgatt 661 gcttgaacta tgtgagaaca atgtaaaact aatattcagt caaagcgata gaaaaaaaaa 721 ccttaagaac aacaaaatta atacagtctg gagctgacca gtcccagttg acagcgctcg 781 ttcaccaaca tgacacgtac accagcaccg tccaactcaa ccatgttgca gggaatcgaa 841 cgatccaatg cccgtgccac cgccagtgag acctttgagt gagcatgaca actctaagct 901 gacacaaatg cattctacta gtctaattta gctaattaca gggctctcac atttacaaaa 961 atctgtataa agtacagaac acattgaaat aacatggtta tattatttca ttttttaaat 1021 ttacgaaaga tgatatttta aatatcgcca tttttagaca aaaaaactga tttaatgaga 1081 ctaatttaaa attttctaaa ataaaatcag tttacttact gaattacaca ttgcgatatt 1141 gtgcggataa acgcctaatt ttaagtttct tttccctgac tttgttatta tgtaattatc 1201 aacatcgaaa tgttgcgaac gaaatatgct acatttagta ggaatccagg agcccatagt 1261 acaggttttc ctagtttggg ttcctgaatc ccaccacttg tcatttcaat tgtcttaaac 1321 ctggtcgtgt aacttgctac ataatttatt atactattaa ttattatata agcttagttt 1381 ttaagaccaa tgtctgtata aatatggtgg agataaaaat atacgcagtt tctaatctgt 1441 gtggtggaac ttgaaaaaaa aaaaataatt ctaccagttg tgtcaatcca caattttgtt 1501 acacctggat ctgttggaaa cctaaaacat aatattaaaa taaatcatat gtcacattct 1561 ataaaaaaat aaaaacataa aactcactca aaatctatat tactaattag tcgtaataaa 1621 taatacttat tatttaaatt ataaagtgtg ctattgatta taatagactg tatcatagta 1681 acaatcgaca ttgattacaa atttgcccaa cactatagat gtcatagatt ttgtggaaat 1741 gagcttttgt ttcggtactc gggactctcg caatggacgt catacattgc gcgattgctc 1801 acgtagtgca agattgtact catctatcgc aagcaatgta ttactattag tctgcttgcg 1861 attatatcgt ttttgtctat aaatttcata atactaatgt tataattaca aatatcacac 1921 gtatttttat gtttttggac ttaccggtga aaagaaattc cttctcgtgc tatgctacac 1981 acctggtttc ttttccgaca cgttataata gcgcaagacg gcataatata atatttttgt 2041 ggagatacgt acgctgtcaa aacatgacag cgattgctag ccggtaagta tgaacatttg 2101 agcgttatga cagcggtatg cacgttcgtt cgcaccctcg gcacgcgtgg cccgcccaca 2161 aggccaatcc agtctctttc ttttatccat ggtttttaca aatggaaatg tatttgtaaa 2221 acggc // LOCUS RATPFKFBP1 2917 bp ds-DNA ROD 25-MAY-1990 DEFINITION Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, 5' flank and exon 1. ACCESSION M26215 KEYWORDS 6-phosphofructo-2-kinase; alternative splicing; fructose-2,6-bisphosphatase. SEGMENT 1 of 2 SOURCE Rat liver DNA, clone lambda-20. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2917) AUTHORS Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G. TITLE 5' flanking sequence and structure of a gene encoding rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.I.Darville, 08-AUG-1989. FEATURES from to/span description pept 1374 / 1401 M-type 6-phosphofructo-2-kinase (EC 2.7.1.105)/fructose-2,6-bisphosphatase (EC 3.1.3.46; PFK-2/FBPase-2), exon 1 pre-msg 1255 > 2917 PFK-2/FBPase-2 mRNA and introns (alt.) IVS 1402 > 2917 PFK-2/FBPase-2 intron A BASE COUNT 779 a 627 c 605 g 906 t ORIGIN 1 ggatccagac agaactccgg cttaagtgag cggcccctgg cggtggctgt gaggggcttc 61 tgcctggagt cctttgggga actttctcag ttgaaagatg gagcagttaa gaatttactt 121 agctagcctc cgttttaatg gagtctggag gaaggagact acctcaaatc ctccctgtag 181 agccatattt gtcttgactt tcaaagcgtg agtcagactt gcatttggct ccatttctca 241 gatgagcaaa atcgaggcct agaacattta agctccggag cagtatcttt tccctatcct 301 gtctacgctt tgtcctttca ttcttaggat aaaccaggaa gaggcactga agctaagtag 361 aagttgaata aattcgtctc agcctcacag aagttataga gccaagattt aagcccttct 421 taatcaggta aacgattgat tttctgtcca cacaactctg tgtcttttcc cgtgggcacc 481 actgcacttt acgatccatt ctagtgcaac atttaccaca gtgaacagct cccattgatt 541 tccattgtca ctagaccgct aacagcctga atgcagtggc catcttggac ccagcttata 601 tgataacgga ataaatattt agcagatgga tgaagcagca ggccagtagc gatgagttct 661 tgacttattt tttttaacag gaaatgccca atttccagga aaactataac tttgcatcgt 721 aaggtttttt cactgaaaat tgctacagag ttattcccac tcgaatacac acaccctcag 781 aatctcacag cctttaagcc atgatcttca gtgggttgct cctctaccct cagaggactt 841 aaaggtcact agctattgac attagaaaaa catctatgta attaaaagtc tgtgccttga 901 gcttttccat gcattgggag ttactgacta aagatcgagt cccaggagtg ggttaattag 961 gacaagggaa atgaacgttt ttcttcatgt ttccagattg ctttcaagca agggattttg 1021 ccaatgctgt actaaaggca gacagaaacc tttgtcccaa ggaaatccag cttttgctca 1081 cctcaatagc tggttttaat ttctccattc tccagtggct tgttaatatt agtactgacc 1141 tttggggcaa ggtgaacaca tggttggact gaagagaaaa cgctctgggt ggctcaggaa 1201 cttctttggc aagtacagca gctaatattt caacactgca cacacccctt aacaagggcg 1261 cttcctcagc cttcccagga accaacggat gtctccgagt tttctatttg atgccaccca 1321 ccggacctgc tttgggggtc tgtaaatgca agagagccat tggaaaatta gcgatggaag 1381 aaaaagcctc taagagaaca ggtaggtgag cctaaaggaa aaacagacag gtctttttac 1441 aggtcaaaaa tccttctttc tggcttcttg gaagattttt tctgcatagg attctgattt 1501 agctttatta aaggggatgg gagggttggg gaatgataat atgaatacct aattcaactg 1561 ccaatatttt atttttttaa ttattttatt aatattccaa atgttgcccc ctccctattg 1621 tcagtatttt aaaattcatt tgtacttgac tttttaggac cctctctaaa ggtcagaccc 1681 ctactatctt tgaaattcca atttttgttt tatttagttt tggaaaatga atttaactgt 1741 gcttcccatg ggagcctgaa actttctatg tagcctggag tggccatgat tgtgtaaccc 1801 tcccagccta gcttccacct atgtagtgta tgtaattttt taaattaaat gcacaataga 1861 gagtgcatct ctgacctctc ctctagtgta taaagttgag ttgatttgct gtgaaacttt 1921 gacacattct ttagtttctc tgggtgtcaa tggatacctt agtaaaacag aaattaaagc 1981 ccctagcctc cttcccacag aggactggcc tgatgatcca gaatctctaa taggcagaat 2041 ccagagttct tttgcttcag ggctttcatg ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt 2101 gtgtgtgtgt gtgtgtgtgt gtgtgtgaca ttcatgttct tggcttttat gtggcgtcca 2161 caggtagtac cacccacagc cctctctcta tgtgctctcc ttcctaagac tcgtcctaga 2221 taagaatgca gtgagctttg tggagcagac aggagtctga aaatagtggc ataatggggc 2281 cagagcagtc actgagaaca aagggattct aggctgctgt cccagaagtc ctgatatata 2341 ttcatacatg aatctagcct gtaatcatgt gattatccgt cagccttcta gcctataaca 2401 aatatattca caaccacaca tttcaagact taccaataag gagaccccat ttatgacttt 2461 attgcccatc ctatcttgat aaaatgagct tcctcaccac attgtacatg gaggtgggag 2521 caggctctat ggccgaacat catactgcat agtttcttcc ctagagatag ttctacttta 2581 tggttcagcc tttagccctg agtatgaagc tgagtaaacc ctgtgtctat aaccaataca 2641 ttaaaacgtg ttagaatttt atcattttat tctgtgccat cctaaagttt taaggttttt 2701 tttttttttt ggatgttgct ctttacctca gaaaaaccag ctctcccctg tccatttccc 2761 tgaaagctgt cataatatct accatggtga actgggattt cttgtcacca tgtttatctc 2821 ttctgcagca gagagatgta gattgtgaac atgatcactg aacaagtgtc ctccatatca 2881 gagcccaaga tgaacagtca gcaggtcaca gggatcc // LOCUS RATPFKFBP2 3720 bp ds-DNA ROD 25-MAY-1990 DEFINITION Rat 6-phosphofructo-2-kinase/fructose-2,6-bisphosphatase gene, exon 1'. ACCESSION M26216 KEYWORDS 6-phosphofructo-2-kinase; alternative splicing; fructose-2,6-bisphosphatase. SEGMENT 2 of 2 SOURCE Rat liver DNA, clone lambda-20. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3720) AUTHORS Darville,M.I., Crepin,K.M., Hue,L. and Rousseau,G.G. TITLE 5' flanking sequence and structure of a gene encoding rat 6- phosphofructo-2-kinase/fructose-2,6-bisphosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6543-6547 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.I.Darville, 08-AUG-1989. FEATURES from to/span description pept 2913 / 3009 L-type 6-phosphofructo-2-kinase (EC 2.7.1.105)/fructose-2,6-bisphosphatase (EC 3.1.3.46; PFK-2/FBPase-2), exon 1' pre-msg 2664 > 3720 PFK-2/FBPase-2 mRNA and introns (alt.) IVS 3010 > 3720 PFK-2/FBPase-2 intron A' BASE COUNT 1020 a 752 c 859 g 1089 t ORIGIN About o.1 kb after segment 1. 1 aagcttctct caaggctata atactgactt tcagagaaga tgaacactaa aactggtctt 61 aactaagaca gagttcagtg agaataaaag atacttcagg ctgaggaaga gcctaaaagc 121 aggaaaacgc tgtgttcgag ggtcctgtaa tcagccagtg tggttaaaca tgggggagtc 181 ctggggggta gaagactgat aggtggctta ggacttgatt tagaagaagt ttgaatgcta 241 gtctgagtca tttctctaag gcttgttttt tggtgtttct gagttcagaa ttgctaacgg 301 tagtgaacaa tttcatagtt agttctaggc agggttaaat tagggtgtca aacctggacc 361 ctgagtttac ccattcagac cttgtcagag gccttctagc tacctagctc agatttcttg 421 ctactaaaat tatcttcttg attcttggca gtgttgtaag gacaaaattg taaacataaa 481 agacatgttc atgttgttgg ctagattacc ttagacaagc catcccctcc tcaggcccat 541 ttccctgtgc tattactggg gaagtttcaa agaaacaaat aagataaaga aaatgttact 601 ttgtgcgcac tggaaaacat acaaagttac ggtaagatca ttaaaacaga cttccactaa 661 ttatgtgact tactaggatt aagagactca tgtcttgcaa agtctctttc tgagactcta 721 ggagttatct gctaatctgg aaaaaactca ttatattgta ttttttactt taaattgata 781 gataattgtt tatacttgtg tggtacagtg catgtatata atgtacaatg gttaatagct 841 ttatgtatca cctcaaatat ctatcatttc tcctttccac cagctatatt gaagtatcca 901 attatctgtt gtgaaccata gttatccaga tgtgctcttg aacatcagaa gttattcctc 961 ctgtttccca gaatccatcc tgtttgttct ctcttttctc cctctccttt gcccactagt 1021 ctatactcag tttctcacca atctctttta cctttgtcca gtaggtgaga gcatgtagta 1081 tttgatgttt tttgccagac ttaagttaac agaaacctag gaaggagttt acctttctaa 1141 ttgacactca agaagtcttt acagtataaa tgttaatccc gtgtcacaat aacgtgctag 1201 accttgcaga ttacttggag ggaaaaagat gaatcaatta ctgcccaaag gaaacaacag 1261 tttgaatcat atgtattcag cttccccagg tttcttcatg tgtatcaata tattgactct 1321 tcgctaacat tgagggaatg gaccctacta tccttatatc tagcaatggg cagaagtgag 1381 agtccaccca gacagtatgg ttcctaatcc cattgttatt gttcttaact gctctcatcc 1441 attgcctcag taggctcctt gagataggta ggccttcacg ttgaaagtct tatatgctca 1501 gcaactgcag tgaatctaac ccttctatag actaaggttc tgagaagata ttttaggaag 1561 tgtcttaaaa ggcaaagtaa agctgggagc atttgagggc agggactgga gatttgtttt 1621 cccgagagat tgcccttgca tactccagct cagtggaaga aagttgagcc tcctaatacc 1681 caatcactgg agctttgctg ggggtgggaa ggttacaaag aatgaagtaa aagggcactt 1741 ggacatcttc caggcaagcc atgcagttgc atgaattcta cagataaaag tctgaaaata 1801 actgggggaa tggcaataag agtggggaca agtgaaaggg agacatctgg atgagatatc 1861 tctgcctgtg gatttgtgct atttaaatag atgccattac ctcttattcc ccatagtctc 1921 ctcagtgtgg tttccacctc tcctaagatc ccatcccaac actttggctt gaagagtttt 1981 agcacgggtt tgaattagat tctgatgcct ttagtgctgg gtcatttcag cattggcagc 2041 catgtgagcg acccttactt atcggctctc tgcccagcaa tggctgccct cactctcatt 2101 cctaaggtgt cagatttact ctgatttatt taaccttggc tggagagagt ataaaggtga 2161 tcagatagtg ccaatctctt tgaagtgccc aggacagctc attttaagag ggagaggtcc 2221 ggtagagagg cagggaacaa actctaggga aagacagttt tacagtgcct tgttgaatag 2281 ggtgagtggt gaggtagtga aagtctactt ccagtttggg cctttaattg ctagctcaaa 2341 acattgcatt ttgattagga acattatgtt tgtggttttg ttgtgggggg gggggcttgt 2401 aatgaacctc tgtttctgag ttggagctta atgttctaat taggcctgct ttgaaatcaa 2461 tttcaaagct tagacctatc tccatcacct ttgtcatgca cacaacacag gatgtaagca 2521 catgatcttt ggtcttttat ttgcatactc tactagtcct gccttcattg gcgagaagta 2581 actggccaga cctgggctgt cttgctggct gccttccttg ctgggctcca acttatcttc 2641 tatgtacata gcccttggag ttcagaggcc tctgctgact tctgctcagc ctctggctct 2701 cagcacccac atccccactt gtacatctgg gagcagaaga aaaagaggga gctggagata 2761 atctgtgaga ggctccgaag cccagtacat ccccctgact cagccctgcc cacctgctgc 2821 tgtgactaca gcaacagaca agacagctag gaagatagga agtgaggtcc aataccttgt 2881 gggcagtggt gtcattagct gcaacgccta agatgtctcg agagatggga gaactcactc 2941 aaaccaggtt acagaagatc tggattccac acagcagcag tagcagcgtg ctgcaacggc 3001 gaaggggctg taagtgaggg ttgtcatttg tgggtggtgg tgatgggtgg gtctagaaga 3061 acaagtggtg ttccccatcc actcctaccc tcacaaagaa gttggaatag aagatcattt 3121 ggatgaaatt gctagagtgt aacaagaggc taatatgaga agggtaagga agtaagagaa 3181 aggcaggtag aaaagacaac gatgaggctt agataaaaag aacccaaact acagggaccc 3241 ctgacctaag ggctgtctcc ctttatttga gaaacggtcg gaatgcttct tgtctggaat 3301 gcgagcgttc tagtgtagtg ttacccgaga gtattttatt ttagcttctc ttatgaaagc 3361 tctggaaaca ggtcttgttt aatttatgcc agggtgaaag aaggtgatag ttcctggcgt 3421 ggtttctcgg aggctaggtg cttagtccac accagtaccg tgtcactgct cagcattgcc 3481 cttcctcttc caccatcctc ttaaaaataa ctgtacctgc ccagctgggg acaactgcgt 3541 acttggcctt gacattcaga gacagtcttg ctgttctgag gagtcgaggg gacagacagt 3601 atcacaaaga ctgatctggt gttcagttgg agcttaacat cctaattaga cctgatgtct 3661 ttattttaaa actcaggcct ttgaaatttt aaggaggagc agaggcaaag taagcaaagc // LOCUS SPFIFP1A 1801 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 879. ACCESSION M32776 KEYWORDS insertion element; insertion element IFP1.6. SOURCE S.frugiperda DNA, clone lambda 879. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1801) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description site 115 118 5' insertion target site site 1755 1758 3' insertion target site BASE COUNT 629 a 274 c 273 g 625 t ORIGIN 1 gagaatcaat gttcactgga aatatatccg ttttagtttg taacgaagaa aaagatgcga 61 tgaagtgttt tgtgacgtgg attcttagaa ggaagaattc aatgttctat tttattaacc 121 ttttgaccgc cagagcataa acgcttcgcc gtgccctcca cgccgagaca caaattcaac 181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat 241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta 301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc 361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat 421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta 481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg 541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat 601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa 661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa 721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc 781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc 841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa 901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa 961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta 1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc 1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg 1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat 1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa 1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt 1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca 1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat 1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt 1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa 1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga 1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa 1681 aggtaatgtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg 1741 tggcgcacag ggcattaata tctatcacct taccttttat tattacttta atgtataatg 1801 g // LOCUS SPFIFP1B 1820 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 883. ACCESSION M32777 KEYWORDS insertion element; insertion element IFP1.6. SOURCE S.frugiperda DNA, clone lambda 883. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1820) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description site 115 118 5' insertion target site site 1774 1777 3' insertion target site BASE COUNT 636 a 281 c 270 g 633 t ORIGIN 1 agagttaagt ggctttgatg atttgtgcat atagtgccat tgtgcataca aaaccaaaat 61 gttttatatt caaacttaat tttatcgttg tcatctaaaa cgaaagccat ttctttaacc 121 ttttgaccgc cagagcataa acgcttcgct gtgccctcca cgccaagcca caaattcaac 181 gaaataacct tgaaaatatt agggattcca aaatgttatc gataaaccaa agtgagtaat 241 aattttcgtc tgattttttt ttgaataaac tgcttataac ataaaaagaa tatttattta 301 acctaatctc ccctgtaata aacttaagac cacaataata gcaactatca ataaaatatc 361 agccttttga gacacaaaaa tacatataaa attgcagcaa gttacagcac tattatctat 421 cagttacgcg ccattggcat gtatacatgc caatggcgcg gcaactatca atttttttta 481 aatctctcgt ttatgattag ggatgtagct gaataatgta acttatcgag ttcatttatg 541 tattaggtaa tcaaatttga aaactgctaa ataggtacag attttagtga tattatttat 601 acactttcat attcttaaaa tagttgcctt gtatttgtaa aatgtcacat gaaacttaaa 661 aatataaaaa atcaacaatc gtatttattt attatgctca ttaacagcct tgtaattgaa 721 ccgtcacgtg aaacttaaaa atatataaaa aatcaacaat tgtgtataat tattatgctc 781 aatacagcct tgtatttgta aaatgtcaca tgaaacttaa aaatataaaa aatcacaatc 841 gtatttattt attatgctca ttaacagcct tgtaattgaa aaaggttacg tgaaacttaa 901 aaatatataa aaatcaacaa ttgtacctat aaaacattat gctcactaac agtcttggaa 961 ttgtaaaatt taaaactaaa ttgtatatta tctataatta ttatcaataa caaaaattta 1021 ataaataaat agcaattata atgtaggtac tcgataaaat attttttgca acttccccgc 1081 atttaaaaac acggcacgcg cttacgacgt cacgagtccg acatctatac atgtcaatgg 1141 cgcagaaggg attgcatatc ggtagtatgc taggatgcgc atgattggcg aaaagggaat 1201 gtcgataata ttgaaaatat agtaaatatg ctattatacc caagaatttt tcagtttaaa 1261 atatagtata tttacctttt ttctcagaga ctaaggttat atatatactt tttaatagtt 1321 attacaaaaa ataagcgagt catttagctg aagttttaat attacttgaa ttacactaca 1381 tcactagtag gcatgtatac atgccattgg cgcgagagat acagttcttt tttagccgat 1441 atttaagttg tttttatcaa tcaaatatta gttattctat attgcaattt tgttcactgt 1501 gtacaattat gtaataactg ggtattcaga tcttaagata catcaataaa acacgtgaaa 1561 gtaatgattt aatgtgttta ttttgttttt ctctgtgatc tgtttcttgg catctataga 1621 tgtcggtggc gcacagggca cgcttgcgca tgacatctat acatgccatt ggcgttcaaa 1681 aggacacgtg aaagtaatga tttaatgtgt ttattttgtt tttctctgtg atctgtttct 1741 tggcatctat agatgtcggt ggcgcacagg gcattaatcc attttctatc ataagttatt 1801 tgatacttag atctgtaagt // LOCUS SPFIFP1C 1741 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA, clone lambda 889. ACCESSION M32778 KEYWORDS insertion element; insertion element IFP1.6. SOURCE S.frugiperda DNA, clone lambda 889. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1741) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry FEATURES from to/span description site 60 63 5' insertion target site site 1696 1699 3' insertion target site BASE COUNT 609 a 275 c 263 g 594 t ORIGIN 1 ggtaggcagt gacaatggaa cgccaattgc tacgaatctt acatacttca ttcgcttcat 61 taaccttttg accgccagag cataaacgct tcgctgtgcc ctccacgcca agccacaaat 121 tcaacgaaat aaccttgaaa atattaggga ttccaaaatg ttatcgataa accaaagtga 181 gtaataattt tcgtctgatt tttttttgaa taaactgctt ataacataaa aagaatattt 241 atttaaccta atctcccctg taataaactt aagaccacaa taatagcaac tatcaataaa 301 atatcagcct tttgagacac aaaaatacat ataaaattgc agcaagttac agcactatta 361 tctatcagtt acgcgccatt ggcatgtata catgccaatg gcgcggcaac tatcaatttt 421 ttttaaatct ctcgtttatg attagggatg tagctgaata atgtaactta tcgagttcat 481 ttatgtatta ggtaatcaaa tttgaaaact gctaaatagg tacagatttt agtgatatta 541 tttatacact ttcatattct taaaatagtt gccttgtatt tgtaaaatgt cacatgaaac 601 ttaaaaatat aaaaaatcaa caatcgtatt tatttattat gctcattaac agccttgtaa 661 ttgaaccgtc acgtgaaact taaaaatata taaaaaatca acaattgtgt ataattatta 721 tgctcaatac agccttgtat ttgtaaaatg tcacatgaaa cttaaaaata taaaaaatca 781 caatcgtatt tatttattat gctcattaac agccttgtaa ttgaaaaagg ttacgtgaaa 841 cttaaaaata tataaaaatc aacaattgta cctataaaac attatgctca ctaacagtct 901 tggaattgta aaatttaaaa ctaaattgta tattatctat aattattatc aataacaaaa 961 atttaataaa taaatagcaa ttataatgta ggtactcgat aaaatatttt ttgcaacttc 1021 cccgcattta aaaacacggc acgcgcttac gacgtcacga gtccgacatc tatacatgtc 1081 aatggcgcag aagggattgc atatcggtag tatgctagga tgcgcatgat tggcgaaaag 1141 ggaatgtcga taatattgaa aatatagtaa atatgctatt atacccaaga atttttcagt 1201 ttaaaatata gtatatttac cttttttctc agagactaag gttatatata tactttttaa 1261 tagttattac aaaaaataag cgagtcattt agctgaagtt ttaatattac ttgaattaca 1321 ctacatcact agtaggcatg tatacatgcc attggcgcga gagatacagt tcttttttag 1381 ccgatattta agttgttttt atcaatcaaa tattagttat tctatattgc aattttgttc 1441 actgtgtaca attatgtaat aactgggtat tcagatctta agatacatca ataaaacacg 1501 tgaaagtaat gatttaatgt gtttattttg tttttctctg tgatctgttt cttggcatct 1561 atagatgtcg gtggcgcaca gggcacgctt gcgcatgaca tctatacatg ccattggcgt 1621 tcaaaaggtg tttattttgt ttttctctgt gatctgtttc ttggcatcta tagatgtcgg 1681 tggcgcacag ggcattaaca taacttggta taaattgttg cataattgca ttgagagcag 1741 a // LOCUS SPFIFP6A 1792 bp ds-DNA INV 25-MAY-1990 DEFINITION S.frugiperda insertion element IFP1.6 DNA. ACCESSION M32780 KEYWORDS insertion element; insertion element IFP2.2. SOURCE S.frugiperda DNA, clone lambda 858. ORGANISM Spodoptera frugiperda Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Noctuoidea; Noctuidae. REFERENCE 1 (bases 1 to 1792) AUTHORS Beames,B. and Summers,M.D. TITLE Sequence comparison of cellular and viral copies of host cell DNA insertions found in Autographa californica nuclear polyhedrosis virus JOURNAL Virology 174, 354-363 (1990) STANDARD simple staff_entry BASE COUNT 586 a 326 c 287 g 593 t ORIGIN 1 atgatgtttg tacataatgt tggtacgttt ctcccaccct ttaatttcac tacaagcaag 61 agaccaatgt ctgagcggat tacccccaca aataaataac acacatttaa atctcgaatt 121 tgctattggt tgtcacacaa acgcaatgac tgcatttgac gtacactcta acacgcataa 181 aaactcatgt atgctagggc gagacaaagg tcgattgttg atatcgataa aatttaagta 241 atcttgtttt tgttttacaa gtttgttgaa tattgtgaaa ttgtattcat gcgattcaaa 301 ttgtatcacc aacatgacac gtacaccagc accgtccaac tcaaccatgt tgcagggaat 361 cgaacgatcc aatgcccgtg ccaccgccag tgagaccttt gagtgagcat gacaactcta 421 agctgacaca aatgctgaaa tgcattctac tagtctaatt tagctaatta cagggctctc 481 acatttacaa taatctgtat aaagtacaga acacattgaa ataacatggt tatattattt 541 cattttttaa atttacgaaa gatgatattt taaatatcgc catttttaga caaaaaaact 601 gatttaatga gactaattta aaattttcta aaataaaatc agtttactta ctgaattaca 661 cattgcgata ttgtgcggat aaacgcctaa ttttaagttt cttttccctg actttgttat 721 tatgtaatta tcaacatcga aatgttgcga acgaaatatg ctacatttag taggaatcca 781 ggagcccata gtacaggttt tcctagtttg ggttcctgaa tcccaccact tgtcatttca 841 attgtcttaa acctggtcgt gtaacttgct acataattta ttatactatt aattattata 901 taagcttagt ttttaagacc aatgtctgta taaatatggt ggagataaaa atatacgcag 961 tttctaatct gtgtggtgga acttgaaaaa aaaaaaatta attctaccag ttgtgtcaat 1021 ccacaatttt gttacacctg gatctgttgg aaacctaaaa cataatatta aaataaatca 1081 tatgtcacat tctataaaaa aataaaaaca taaaactcac tcaaaatcta tattactaat 1141 tagtcgtaat aaataatact tattatttaa attataaagt gtgctattga ttataataga 1201 ctgtatcata gtaacaatcg acattgatta caaatttgcc caacactata gatgtcatag 1261 attttgtgga aatgagcttt tgtttcggta ctcgggactc tcgcaatgga cgtcatacat 1321 tgcgcgattg ctcacgtagt gcaagattgt actcatctat cgcaagcaat gtattactat 1381 tagtctgctt gcgattatat cgtttttgtc tataaatttc ataatactaa tgttataatt 1441 acaaatatca cacgtatttt tatgtttttg gacttaccgg tgaaaagaaa ttccttctcg 1501 tgctatgcta cacacctggt ttcttttccg acacgttata atagcgcaag acggcataat 1561 ataatatttt tgcggagata cgtacgctgt caaaacatga cagcgattgc tagccggtaa 1621 gtatgaacat ttgagcgtta tgacagcggt atgcacgttc gttcgcaccc tcggcacgcg 1681 tggcccgccc acaaggccaa tccagtctct ttcttttatc catggttttt acaaatggaa 1741 atgtatttgt aaaacggcgt tggtacatct gtacctaaaa atgatatagt at // LOCUS TACLSEG 7102 bp ss-RNA VRL 25-MAY-1990 DEFINITION Tacaribe virus L protein and P-11 protein genes, complete cds. ACCESSION J04340 M33513 KEYWORDS L protein; P-11 protein. SOURCE Tacaribe virus (strain T.RVL.II 573), cDNA to viral RNA. ORGANISM Tacaribe virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Arenaviridae. REFERENCE 1 (bases 1 to 449) AUTHORS Iapalucci,S., Lopez,N., Rey,O., Zakin,M.M., Cohen,G.N. and Franze-Fernandez,M.T. TITLE The 5' region of Tacaribe virus L RNA encodes a protein with a potential metal binding domain JOURNAL Virology 173, 357-361 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 420 to 7102) AUTHORS Iapalucci,S., Lopez,R., Rey,O., Lopez,N., Franze-Fernandez,M.T., Cohen,G.N., Lucero,M., Ochoa,A. and Zakin,M.M. TITLE Tacaribe virus L gene encodes a protein of 2210 amino acid residues JOURNAL Virology 170, 40-47 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.Iapalucci, 08-AUG-1989. FEATURES from to/span description pept 70 357 P-11 protein pept 7072 440 (c) L protein BASE COUNT 2071 a 1653 c 1240 g 2138 t ORIGIN L segment. 1 cgcaccgggg atcctaggcg ttacgtgcac tcttctttat tgggctggat tacacaaaac 61 ttttcaagca tgggcaactg caatcgaact cagaaaccca gttcgtcttc taacaacctc 121 gagaagccac cacaagctgc agaatttaga agaacggcag agcccagtct atatgggagg 181 tacaattgca aatgttgttg gtttgctgac aaaaacttaa tcacctgtag tgaccattat 241 ctttgtctaa gatgtcacca aatcatgctg agaaactcag agctgtgcaa catctgctgg 301 aagcccctgc caacatccat aagagtcccg ctagaggcaa gtgctccgga tctgtaacca 361 acgacaaaca aggtcgctga ccccccgggg ggcccccaag ggggcccccg tgggggtggc 421 ccccggggga gtttgggttc tactcaatgt cctccactgt gtttctccta gctagaggtc 481 tgcacataat gcctctcaac ctcaaggacc cctggttgtc agatacaaca aggtctttca 541 gggacttact gtatgcaacc ttgaagttga caaattcaac ccagtccccg atcaactcca 601 gggacctgat caagccagat ctatgtaaat catagacaag gacaaagtcc tcttgttgga 661 atagcatcac ctgtgcagtt gattttaaat agggccagag ttggttcaac aaagcattca 721 catcttcagg gtcgtccagt gcgccaattg ccttcactat tatatcacca gtcaactcca 781 ctctgtaagt agaaactttc tcagttccca taaaaaagtg gccgtctctt aacaccaatg 841 gctccccttg atcgactggc ccattgagtg tataggcata cttgtcactc aattcagata 901 aaaatccacc aaagctgaat cttacatccg tcagaattag gttgtcattt gtctccacaa 961 gatctgcatc catcatcagg gaagccatat taaaagtgcc ttgacctgtg aatatcggaa 1021 gtcctgcaat catggaaact ccatccaaca agaggtgact tccccccgac cctgacactg 1081 actgcgtcac atccaccacc tgtaactcta cgcttgcttc tttaagatca atgactttag 1141 aaacacatgt aacactaaat gagtaagtca ttttcctctt aaacctgtac ttagttgatg 1201 ttctagaatg ctccacctgg atcaaaaacc tcaagtcaca atcctcatgg tcatcaaagt 1261 tcccaagaag atccagcaag tcattgatgt gtctccctgg agccagtggt tgctctccaa 1321 ttgtgatttt gtcaatgtac aatctaacat cttcaagatt tgaaatcaca taacccattg 1381 ctatgtcttc aataaacatc gctctactga tcttctgatc cctaactaag aaggtgaaat 1441 ctgttaactc acccaattca atctgaccat catcttccag ctccaatact tcattaaacc 1501 aaaaaaacga ttttaggcaa gaagttgaca tcactaaagg gttcaccatc gatgatagga 1561 tgatttgaac catgaagttg taacaaacct gttgaggact cagtccaact tccttatcta 1621 cgtaatcaga ctgcctaacg tgtgccatgg cacagtcgct ctgtttgact atgtagacac 1681 tattgtctct ggctttcagt aggtggctga cgattcctag attctcattc atcaaatcca 1741 atgctatgca aacatctaag aacttcagct tcctcccatc cagtgttctg ttcacttgca 1801 aatcgctcat gaagggagac aagtgctcat caaataattt cgggtaattt ttccgaatgg 1861 aatacaatac atgattcata cccagttgat cttctaattg acgaacaatt ctaggcttaa 1921 cagcccagaa caaattcctg ttacttaaca ccactggaat ctgtggctct ttcacagatg 1981 agaataccca ctcccctatt tcgcaggcat tggtcaaaac taaagaaaag taatcccaaa 2041 gaataggttt ggaaaatttt tcaatttttt ccagtgaggg tctgcaataa atcacacctc 2101 ttttcatgtt gcacacctca cacagacagg gcttcacctt tgtataaata tctttaatgt 2161 aaaggaaacc gccctcacca tctctgacac atttgctgcc tatactttta caaaaaccta 2221 taaaaccaga tgccactgaa gactggaatg cagatttgtt tatggactcc gctaagatct 2281 ttttaacacc ctttgtgaag ttccttgaca acctggactg caatgtctta atcagtgttg 2341 gcacctcttc tctctcaaga acccttctgc tcgtcattaa cttggtcctc aatactagtc 2401 ttaggtcacc aaaacttgac aaattcaccc atctaaagcc taaggcacgc ataatatctt 2461 gctctgagca acttgaataa cgcaagaacc ccacaagggc gtcatcacca tctctaccaa 2521 taagactgat caggttttcc tcaaaaactt tgcctctttt tatatcatta aaaacccttt 2581 tagcacagct tctaatgaag cctgtcatct catcatcaga aaatattgat tcaatcttcc 2641 tctgcaatcg ataacccctc gtcccatcca cccaatcttt tacatcttgt ttctcaacag 2701 caccaaaagg ggtttccttg aagccggagt atttgatcag ctggttcacc ctttgagaga 2761 tctttgagac aatttggaca ctaacaccat ttgcgacaca ctggtcacag attgtgtcta 2821 ttgtttctga caactgtgtt ggtgttttgc atttcacatt gtgcaacgct gcagcaacaa 2881 attgagttaa tagcggggtt tcttccccca tcacaaaaaa tcttgatttg aattcagcta 2941 caaaggtgcc cgctacactc ttggggctaa caaatttgtt caattttgag gagaggtact 3001 catgaaaaca caacatttcc aaccactcat cattaacttg attatcatct gaagcacaag 3061 gcaatttgat aagtgtgacc tggtcatcgc tagaagtgta ggaagttgga atgacatcat 3121 atagtaaatc tagcacataa tttaagaact gttctgttat caatccatac aaatcagaca 3181 tgttatgcag gatgccttgg cccatatcta gaacagacat tatgtgactt ggtatctctc 3241 cttggactgg catgaattgg tggaagaatt cctctcccac gcttgagcta ccacaaccca 3301 tcaaccctaa gcttctcttt agattgccag tacagtaagc ctcagcaaca tttactggaa 3361 cttctataac tttatggagg tgccatctca gcacatttaa gactggttca agattaataa 3421 tgttcctttc ttttggggtc ctaagttcca aaaattggag gaatgtcaag aaaagcgcag 3481 gactcatggt tggtccccat ttagagtgat caagagaaca tgataggtca ccattgttca 3541 ctgctaactt catgtcacag atggctctat caaattcttt ttctgaactg agacatgtat 3601 atctcatgga actgccaact gcctctgcaa agtcttcaac taaccttgtt atcagcttag 3661 tgttaagatc cccaacgtaa agttccctat tggaacccac ttgctccttg taacttaagc 3721 caaactttaa gttaccatta tttgaactga tactggtaaa ctcagtagga gaatcatcag 3781 agtaaaagca caggttcctt agtgcagcat tggtaaaata actcctatcc aacttatctg 3841 ctatagcttc agaattgctc tctctcatgc taagtcttac ttcatctttc actctcagtg 3901 cttcattagt gaaaccaagc cttgatctac tcctatgttc ataacttcct agtttttgat 3961 cataatgacc ttgtatcaat aagtatttga agcactcaaa atactcttca tcaaaatatg 4021 cagaagtagt gagatttttc aaaagcaatc ctagaggaca aacttctaaa ggtctctcca 4081 agaaatactc tggtttgctt gaggagtaat agaccacatc acatagttct ttatacaagt 4141 ctgaagcgag tacatcagga tcaaagtctt tcacctcatg aagtgagacc tctgtcctaa 4201 tctctctcat aatcccaaaa ggaccccaaa gttcctcaag tgtacctgtg ctgcttttaa 4261 ccctgccttc cttttgattc ggacttagat tgacatctcc agatttggag tccttcattt 4321 tgaccattgc taattgcacg gaatttttta tatcattgaa agcttctgct tgctcttctg 4381 tcaacccttc aaacatcatt gacagctctt cacaatctct ctgggtttta gaaccaatac 4441 taacaagccc ggttaagttt tttagtatca agtaatctaa tgtacttgga tcaatgttgt 4501 atcttccctt tgttttgaac ctttccgcca tggaagaaac acatgaagaa atcatcttct 4561 ctctatcata taccgaaaga acattcccaa gttcatctag ttttggtact actacgctct 4621 tattgctaga gaggtctaaa gcagtgctag taatgtttgg cttgaatggg tcgttcttaa 4681 gctcacctgt gacttttagc tttcctttat taaataatga gatgcaataa ctcaacagtt 4741 cattgctgac ccctggtaat ttggtgtcag aacagtcttt gagggtcttt gaataaaatt 4801 tatccagccc gtccaacatc attccttctt gagcttctgt cagttgacca tttaaagatg 4861 ggttgactat aacacagtta aaatcaattt tgggttcaat gaatttctca aaacatttga 4921 tttggtctgt taatctgtca ggtgtttcct ttgttattaa atgacaaagg taacttacat 4981 ttaagagata cttgaacctc ctagcaaaat aggggtccga atcaacacct tcaccaagaa 5041 ggcattggaa aaggtccact gccagcctct gaactatgac ttcagagcca gacatacact 5101 ccacttttaa tttggacagt agatccacgt ggtggaattg atttgagtat gccatcaaaa 5161 aataccttag cccttgcaga aatacctgat ttcttttgga cggggcgcat aaaatgctta 5221 acaataaaag tcttagtttc tctttggata tcaccataag atcactgttg aagtccagcc 5281 aagttgtcat ctcatcacac atctcaacca acacctcttg tgaaaagatt ggcagaaaaa 5341 atctcttcgg gtcacagtaa aaagagccct tctcgacaag aacaccatct tcacatgtgt 5401 aaattgagta gcatcctgaa cgctcacctg tcttttgata aaaaagaata acaatcagtc 5461 catcagttaa aaagaatctc tgttggtagc actcccttag tctcacattg ccaaaatacc 5521 ttgagtattc tttttcatta ataagcaacc ttgaagaaaa cgaggttttc atagagttta 5581 ttaaggaaag agacactgag gagagtattt tgaaagcctc aacctgttca gcatctgaca 5641 tcatgtgaaa tgtcttatgt ttacattcag tgttaaagtt ccccccagca gagtatctca 5701 taatgggtct atcagaagat attgtctgag ctatagatat tagcttggat aaatcaggat 5761 atcttgccca caccgatatt tgtgaccaga tatcttgttg agacctcctc aagattttag 5821 agcaattctc cacccactta acattcgtct ctattttctt tttaattaat ttcaccgcct 5881 ccaaaaaatc caaggccctg tcattgacta gtgtgaccat ttgacccagg atgagaccag 5941 accttctcaa agagtccagt gcttcgggtt tggctgtaaa taggtcatct aaaaagttta 6001 ccattatgag atctagaaag agcaacgtat ttcttcgggt gttaaaaact tttaatccct 6061 taattttatt gcaaatcgac aacacgccta aataggcctt ctcatggggg cagacaacat 6121 actgatctaa ttcagagttt gatttctccc tgtcctgcct ttttagtgct gtgaaagtca 6181 aactatagat tatgtccatg aatttgttga cacaaaaggg acaatctgag gtggtgttat 6241 cctcatgttt ccctaaccta attaaattgt tcagaagctc tgatctatta gttatcctaa 6301 acttccccaa gtttttgtca aaaacttcct ttttgtacca gagtcttaat ttcattaatt 6361 cttccctaat cacagattct cgaacatcgc ctttgactcc aacaaataat gatttggcca 6421 gtctatcatc atactccaaa agatttcttt tgagacactc tatattctca aatcctaaac 6481 tttgattacg aacgtttgaa agactgtgta tcagacgttg atactcttga tcctcaatat 6541 caatctctgc ggacgatcgg gatttccgca aaatttcaat taacaaccat ctgatcctct 6601 ccaccaccca ttcaggagtg aacctattag aataattagt tctaccgtca acaacaggaa 6661 caagtgtgat gcctgacctc agaaggtctt gtttgagatt tagtagtttt gtcatgtcac 6721 tcttccattt tctctcaaat tcttctggat tgaccctcac aaatgtttcc aatagtatta 6781 atgtttttcc agtgagtttg aatccatcag gtatcacttt agggagagtg ggacatggga 6841 ttccatgatc actgaggatc acatcaacat atttttggca ggtgttcaag tggcatgcat 6901 gagattcaca ggactctaac tccacaaggt tggagagaag tttaaaacct tcttgaagca 6961 gtgaccctga atgacagtgt gacaggaagg catctttctg gtgggcaaac tcatgcctat 7021 ttggaatgtg ttttctaacc aagtctttga gttcagacac agtttcatcc atggtcaagt 7081 gccgcctagg atccactgtg cg // LOCUS DRHBCDA 922 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.heteroneura bicoid (bcd) mRNA, 3' end. ACCESSION M32125 KEYWORDS bicoid protein. SOURCE D.heteroneura, cDNA to mRNA. ORGANISM Drosophila heteroneura Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 922) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 36 bicoid protein (bcd) (AA at 1) BASE COUNT 309 a 157 c 140 g 316 t ORIGIN 1 ggtggcaaca cgcagttcgc atactgtttc aattaggctg ctacattaat aataattgta 61 aagaaaaaaa aaaaaaaaaa acaatttcaa tgtttgtgga gtttatagct ttaaccgaca 121 cgttgtttcc tgattgtaca aagatctaat aattgtagat atccatatgc atagtattta 181 agccaattta acattgttta aaccaagtat gtaacgcata gttgtagctg ccgagttttt 241 taatgcattt tctactgaca taaactgaag tgccattatt tggaaacttt atttgtaaag 301 cgttctttga tctcaacgct gtctggctgg acatttgcca aagcccatga atgcccaacc 361 agacactgtt gagacgaata tgggctttaa ttgaacgctt tacagaagaa gtttataaat 421 acacataaaa tgcatttctt tcttataact caggccaact ggcatagcaa gtaaaactgt 481 actgatctaa atgaaaaatg tttctcttcc tgataatcgc acgtccttct atggacatta 541 tatgccatta tcagtcgaag atcgaacgtt taaccaataa tgttcttcag aatattttta 601 agctttgtta tgccacagtt tgcatgttgc aattaaacat tcttcacgtg agaattgttt 661 acaaattgtc tctaaacaca tgtgaaccac aggcctcact tatctaaacg aaatctgtac 721 ttactttcag aatgtaagat agttttatca attgaaaatg cattccaaat atgtatctca 781 gttacccttg aatggtgcac catttaaata ttattctaaa cgtaacttat ctatatatct 841 gtatgaaata gtatatttgg ggatatattt aaaatattca gagtcctgtt ttgtatacta 901 ttttaataaa tcttacaaac ta // LOCUS DRQBCDA 987 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.sechellia bicoid (bcd) mRNA, 3' end. ACCESSION M32124 KEYWORDS bicoid protein. SOURCE D.sechellia, cDNA to mRNA. ORGANISM Drosophila sechellia Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 987) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 301 a 204 c 169 g 313 t ORIGIN 1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa cgccgcgggc 61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca 121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat 181 ctacgcgtag aaagttaggt ctagtcctaa gatccgtgta aatggttccc agggaagttt 241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca 301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatctg ggtggttgct 361 ccattaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat 421 tccatggaaa gttttcgccc cacatttcgg aaattaaatt ccgagccatc aagaaaatct 481 tctctgttac aatttgacat agctaaaaac tgtactaatc aaaatgaaaa atgtttctct 541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg 601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac 661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc 721 agtttcggag attttttgct gccaaacgga atacttataa aaacccaaat tctatttaca 781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct 841 ttgacgctgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca 901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattt cggaattata cttagatact 961 tatcccatat ttccgtccct ttctaga // LOCUS DRSBCDA 978 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.simulans bicoid (bcd) mRNA, 3' end. ACCESSION M32123 KEYWORDS bicoid protein. SOURCE D.simulans, cDNA to mRNA. ORGANISM Drosophila simulans Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 978) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 296 a 203 c 171 g 308 t ORIGIN 1 tatcaatgca cgatggatac gataatgcaa gcgtataatc cccatcggaa ccgcgcgggc 61 aactcgcagt ttgcctactg cttcaattag cctggacgag aggcgtgtta gagagtttca 121 ttagctttag gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat 181 ctacgcgtag aaagttaggt ctagtcctaa gatctgtgta aatggttccc agggaagttt 241 tatgtactag cctagtcagc aggcgacacg gattccagtg catatcttag tgatactcca 301 gttaactcca tactttccct gcaatacgct attcgcctca gatgtatttg ggtggttgct 361 ccactaaagc ccaggaatat ccagccagtt acatttgagg ccatttgggc ttaagcgtat 421 tccatggaaa gttttctccc cacatttcgg aaattaaatt ccgagccagc aagaaaatct 481 tctctgttac aatttgacat ggctaaaaac tgtactaatc aaaatgaaaa atgtttctct 541 tgggcgtaat ctcatacaat gattaccctt aaagatcgaa catttaaaca ataatatttg 601 atatgatatt ttcaatttct atgctatgcc aaagtgtctg acataatcaa acatttgcac 661 attctttgac caagaatagt cagcaaattg tattttcaat caatgcagac catatgttcc 721 agtttcggag attttttgct gccaaacgga atacttataa aaacccacat tctatttaca 781 tcactaagaa gagcattgca atctgtttag cctcaagttt aattttaaaa tggtgcacct 841 ttgacgttgt ctctttaagc tttgtatttt taattacgaa aatatataag aactactcca 901 ctcgggtaaa ttgtgactaa ctacacaact taacaaattc cgaaggatac ttagcccata 961 tttccgtccc tttctaga // LOCUS DRTBCDA 963 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.teissieri bicoid (bcd) mRNA, 3' end. ACCESSION M32121 KEYWORDS bicoid protein. SOURCE D.teissieri, cDNA to mRNA. ORGANISM Drosophila teissieri Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 963) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 281 a 195 c 168 g 319 t ORIGIN 1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cccatcgtaa cgccgcgggc 61 aactcgcagt ttgcctactg cttcaattag cctggacgaa cggcgtatta gagagtttta 121 ttagccttat gttaaccact gttgttcctg attgtacaaa taccaagtga ttgtagatat 181 ctacgcgtag aaagttaggt ctagtcctaa tggttcccag ggaagtttta tgtagtagac 241 tagtcagcag gctacacgga ttccagtgca cgtcataggg ataccctagt ttactcgata 301 ctttccctgt aatacgcttt tcgcctcaat tgtatctggg tggctgctcc ataaagcccg 361 ggaatatcca accagttaca tttgaagcca tttgggcttt agcgtattcc acagaaagtt 421 atcgtcccac aacttatatt ccgagccagc aagaaaatct tctctgtaac tatttgacat 481 agctaaaaac tgtactaatc aaaatgaaaa atgtttctct tgggcgtaat cttatacaat 541 gattaccctc aaagatcgaa catttaagca ataatatttg atatgatatt ttcaatttct 601 atgctatgcc aaagcgtctg acataatcaa acatttgcac attcttcgac caagaatagt 661 cagcaaattg tattttcaat caatgcagat catgtgttcc agtttcggag attttttgct 721 gtcaacggaa taactatcct acccacactg tctatttcta tcacttagaa gagcattgaa 781 atctgttttg tttagcctca actttaattt taaaatggtg cacctttgat gttgtctttt 841 taagatttgt atttttaatt atgaaaatat ataagaacta ctccactcgg gtaatttgtg 901 actaacttag tattttaaca aatttcaagg gacgcgtagg cattctttcc ctctctttct 961 aga // LOCUS DRVBCDA 1023 bp ss-mRNA INV 25-MAY-1990 DEFINITION D.virilis bicoid (bcd) mRNA, 3' end. ACCESSION M32122 KEYWORDS bicoid protein. SOURCE D.virilis, cDNA to mRNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1023) AUTHORS Macdonald,P.M. TITLE Bicoid mRNA localization signal: Phylogenetic conservation of functional and RNA secondary structure JOURNAL Unpublished (1990) Dept. Biological Sci., Stanford U., Stanford,CA STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by P.M.Macdonald, 14-FEB-1990. FEATURES from to/span description pept < 1 90 bicoid protein (bcd) (AA at 1) BASE COUNT 326 a 176 c 160 g 361 t ORIGIN 1 tatcaatgta cgatggatac gataatgcaa gcgtataatc cgcatcgtaa tgccggtggc 61 aacacgcagt tcgcctactg ttttaattag atcgggacac caggcgcccc atataatata 121 ataattattt taagtacaca aattttaatg tttgtggtgt ttctttagct ttaagtgaca 181 cgttagttcc tgattgtaca aagatttagt gattgtagat atctatgcgt agtattttaa 241 gcaaattatc aatattttaa atgaattttg taaagcttaa ttaagtcata gctgtaattt 301 tccaagcatt ttcaatgtat ttccataaga aacttccttg gtaaaacctt cattggtctc 361 aactctgtct ggctgggcat ttgccaaagc ccatgagtgc ccatccagac atacttgaga 421 caataatggg cttaagctga aggttttacc gaagaagttt tcgtatgaaa atctcatgta 481 agattcattt ctataatttt ggcaccttgg catagcttac aaaaaaactg cactgatcta 541 aatggaaaat gtttctcttc ctgataatcg cacgcctttc tttttggcct gccattatca 601 gccgaagatc gaacatttaa acaataatac tcgtcagaat atttttaatt cttgctatgc 661 cccggttttc atgttacaat tgaacattct tcacgtgaga attgttcaaa tattgtacca 721 caatacatga ataccacatg cctaagttat cgaaacgaat actatatatt actggaggat 781 tttcgttttt ttcttataat tgaaaatggg ttgaaaattt accagttacc ttacgaatcg 841 tgcacctttc aactgatgtt ttaaagctaa gttatcttta atatagtgca tttgggaata 901 tatattaaat gttcaataca atatttccat aaatttatat attaataaat cttacaaact 961 aaccaatttt aagaagtgtt ttgcttttaa cgttttacga taaacaaagt aaacgctttc 1021 gaa // LOCUS M33420 71 bp ds-DNA BAD 25-MAY-1990 DEFINITION Figure 3. Nucleotide sequence of junctions in the multicloning site and promoter probe plasmids. ACCESSION M33420 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 71) AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M. TITLE Transposable elements for efficient manipulation of a wide range of gram-negative bacteria: Promoter probes and vectors for foreign genes JOURNAL Gene 85, 83-89 (1989) STANDARD simple staff_entry BASE COUNT 20 a 15 c 13 g 23 t ORIGIN 1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatccc 61 gtcgttttac a // LOCUS MUSTCM1AA 805 bp ds-DNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor germline alpha-chain V-region, 5' end. ACCESSION M33586 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; germline; variable region. SOURCE Mouse (strain C.B20) lymphocyte DNA, clone p2111. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 805) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 188 > 526 T-cell receptor alpha-chain V-region sigp 188 244 T-cell receptor alpha-chain V-region signal peptide matp 245 526 T-cell receptor alpha-chain V-region BASE COUNT 169 a 218 c 203 g 215 t ORIGIN 1 caagaatcag atattcaaaa ccttccaggc tcgtgtttgc agcgtcagtt tttatgtgaa 61 cgggagactt ctccagcaga ctcttcccta cacctgcttc aagctcttgc tttcctcagc 121 ctctttcctg cctgtactgt tccagagttc ctccacgaca gagctgcagc cttcccaagg 181 ctcagccatg ctcctggcgc tcctctcagt cctggggata cactttctcc tgagagatgc 241 ccaagctcag tcagtgacac agcccgatgc tcgcgtcact gtctctcaag gagcctctct 301 gcagctgaga tgcaagtatt cctactctgg gacaccttat ctgttctggt atgtccagta 361 cccgcggcag gggctgctgc tgctcctcaa gtactattcc ggagacccag tggttcaagg 421 agtgaacggc tttgaggctg agttcagcaa gagtaactct tccttccacc cgcggaaagc 481 ctccgtgcac tggagcgact cggctgtgta cttctgtgct gtgagcgcac agtgtgtggg 541 gctgcagggg gagctgaaca caaacaccga tggctgtggt gctcagactc aagatctctc 601 ctgtggtctc tgtggcatct ctttggactt ggagagcaaa gaaggaaagt ttcgcagtct 661 cagctgtctt tctgcaactc atttccaact aaaacatcca ctatatgcaa tgctcaattg 721 tagttctgtg gaatgaggag actgtgatgt tatcctgagt gtatggctga gtgaacattg 781 cttgggcagt gactagacta gggta // LOCUS MUSTCM1AB 265 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33587 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 265) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 265 T-cell receptor alpha-chain V-region (AA at 2) BASE COUNT 54 a 78 c 67 g 64 t 2 others ORIGIN 1 cctctcctgg tacatccagc accatggccg tggcctccag tttctcctca agtactattc 61 gggaaaccca gtggttcaag gagtgaacgg cttcgaggct gagttcagca agagcgactc 121 ttccttccac cttcggaaag cctccgtgca ctggagcgac tcggctgtgt acttctgtgc 181 tgcgaccngn ttatctggta gcttcaataa gttgaccttt ggagcaggga ccagactggc 241 tgtgtgccca tacatccaga accca // LOCUS MUSTCM1AC 163 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33588 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 163) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 163 T-cell receptor alpha-chain V-region (AA at 3) BASE COUNT 42 a 42 c 44 g 35 t ORIGIN 1 tccacttgca gaaagcctca gtgcaagagt cagactcggc tgtgtactac tgtgctctga 61 ggccggggac acaggttgtg gggcagctca ctttcgggag cggtacaaga ctccaagttt 121 atgcaaacat ccagaaccca gaacctgctg tgtaccagtt aaa // LOCUS MUSTCM1AD 287 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33589 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 287) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 287 T-cell receptor alpha-chain V-region (AA at 3) BASE COUNT 69 a 88 c 67 g 63 t ORIGIN 1 ctattgcctt tttctggtat gtgcaatatc tcaacgaagc ccctcaggta ctcctgcgga 61 gctccacaga caacaagagg accgagcacc aagggttcca cgccactctc cataagagca 121 gcagctcctt ccatctgcag aagtcctcag cgcagctgtc agactctgcc ctgtactact 181 gtgctctgat ccggggacac aggttgtggg gcagctcact ttcgggagcg gtacaagact 241 ccaagtttat gcaaacatcc agaacccaga acctgctgtg taccagt // LOCUS MUSTCM1AE 283 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33590 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 283) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 283 T-cell receptor alpha-chain V-region (AA at 1) BASE COUNT 73 a 88 c 62 g 60 t ORIGIN 1 attgcctttt tctggtatgt gcaatatctc aacgaagccc ctcaggtact cctgcggagc 61 tccacagaca acaagaggac cgagcaccaa gggttccacg ccactctcca taagagcagc 121 agctccttcc atctgcagaa gtcctcagcg cagctgtcag actctgccct gtactactgt 181 gctctgagaa cctcaggagg aaactacaaa cctacgtttg ggaaagggac cagcctcgtg 241 gttcatccat acatccagaa cccagaacct gctgtgtacc agt // LOCUS MUSTCM1AF 246 bp ss-mRNA ROD 25-MAY-1990 DEFINITION Mouse T-cell receptor rearranged mRNA alpha-chain V-region, partial cds. ACCESSION M33591 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; constant region; joining exon; variable region. SOURCE Mouse (strain BALB/c) lymphocyte, cDNA to mRNA, clone M14T-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 246) AUTHORS Fondell,J.D., Marolleau,J.-P., Primi,D. and Marcu,K.B. TITLE On the mechanism of non-allelically excluded V-alpha-J-alpha T cell receptor secondary rearrangements in a murine T cell lymphoma JOURNAL J. Immunol. 144, 1094-1103 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 246 T-cell receptor alpha-chain V-region (AA at 3) BASE COUNT 69 a 80 c 53 g 44 t ORIGIN 1 aatatctcaa cgaagcccct caggtactcc tgcggagctc cacagacaac aagaggaccg 61 agcaccaagg gttccacgcc actctccata agagcagcag ctccttccat ctgcagaagt 121 cctcagcgca gctgtcagac tctgccctgt actactgtgc tctgagaacc tcaggaggaa 181 actacaaacc tacgtttggg aaagggacca gcctcgtggt tcatccatac atccagaacc 241 cagaac // LOCUS SYNCATAA 66 bp ds-DNA SYN 25-MAY-1990 DEFINITION Synthetic chloramphenicol acetyl-transferase (cat) gene, partial cds. ACCESSION M33419 KEYWORDS chloramphenicol acetyl-transferase. SOURCE Synthetic DNA, clones pEJL4[30,49]. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M. TITLE Transposable elements for efficient manipulation of a wide range of gram-negative bacteria: Promoter probes and vectors for foreign genes JOURNAL Gene 85, 83-89 (1989) STANDARD simple staff_entry BASE COUNT 16 a 12 c 18 g 20 t ORIGIN 1 aattcggtga tgctgccaac ttactgattt agtggatccg gtgattgatt gagcaagctt 61 cgacga // LOCUS SYNIS1AA 65 bp ss-mRNA SYN 25-MAY-1990 DEFINITION Synthetic insertion element IS1. ACCESSION M33418 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 65) AUTHORS Joseph-Liauzun,E., Fellay,R. and Chandler,M. TITLE Transposable elements for efficient manipulation of a wide range of gram-negative bacteria: Promoter probes and vectors for foreign genes JOURNAL Gene 85, 83-89 (1989) STANDARD simple staff_entry BASE COUNT 21 a 12 c 11 g 21 t ORIGIN 1 aattcggtaa tgactccaac ttattgatag agtactcata tgtctagact cgaggatcca 61 aattt //
GenBank-Updates@genbank.bio.net (05/26/90)
LOCUS ECOSPEA 3236 bp ds-DNA BCT 26-MAY-1990 DEFINITION E.coli arginine decarboxylase (speA) gene, complete cds, agmatinase (speB) and methionine adenosyltransferase (metK) genes, 5' end. ACCESSION M31770 KEYWORDS agmatinase; arginine decarboxylase; metK gene; methionine adenosyltransferase; speA gene; speB gene. SOURCE E.coli (strain K12) DNA, clones pLC2-5 and lambda-[1H10,23G45]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3236) AUTHORS Moore,R.C. and Boyle,S.M. TITLE Nucleotide sequence and analysis of the speA gene encoding arginine decarboxylase of Escherichia coli JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R.C.Moore, 02-FEB-1990. FEATURES from to/span description pept 192 < 1 (c) methionine adenosyltransferase (metK) pept 987 2963 arginine decarboxylase (speA) (EC 4.1.1.19) pept 3101 > 3236 agmatinase (speB) signal 3030 3067 rho-independent transcription terminator signal 811 816 -35 region signal 839 844 -10 region binding 977 980 ribosome binding site site 1878 1886 pyridoxal phosphate binding site (put.) BASE COUNT 743 a 784 c 860 g 849 t ORIGIN 62.9 min on K12 map. 1 tacccaaggt cgctggtggt gatttcgccg ccaactaaaa ccaatgccgg tttttacgta 61 ggtttcgcaa gcaacgcgtg ctttcggatc ctgttcgagg atcgcgtcta aaacggcatc 121 agaaatttgg tcagcaattt tgtcaggatg cccttcagag acggactcgg acgtaaaaag 181 gtgttttgcc atatttaata tcacctaaag agaatttggt tagctcaaac tgttgtgtgg 241 attttctgtg gtagcggatc ctaccacgac tctgcaggtt aaaaacactg gcagtctgag 301 tgttaatcgg tatggatgga ttaacatctg gatggctatt ttaggtcaat tcttcaccct 361 atttccactt ttttttgaat cgtgtctcat tctgttaaaa acgtggctgg aaatttttcc 421 tgacaatgcc ggcattctgc gtatttatct tttgcaattt tctgccattg tggggtataa 481 aacgcggcgc gcggcttaaa taaaaagcac acgacgtttc tttcgtgttg ccacttccag 541 ccgggttcaa atcagagttt tggcttgtgg gttcgtctta acaggcggcc gtggaggtga 601 tacgaaataa tgaaccgttg tctgctgctt aacctgtctc accgttctgg tgaagattcg 661 ttccccgcac tctgcatctc tgctttgcat acctgccgat gttataccca tctcggcgct 721 tctcaggatt caagagctgg ttacagttac tgaggactga acaagggcgc tcttgtaaaa 781 acaagagttt tctcgtggtt tcgccgaact ttcacactta cgttcggtta tgtgcttaat 841 aatgttatga aaaagaaacc ggttgcgcag ttggagcgtc agcattcact gctggaaaat 901 ccatgtgctt atgggttgtt atcgcagttc caggctgcga tagtcgttaa ctgttttaca 961 cttaataaaa taatttgagg ttcgctatgt ctgacgacat gtctatgggt ttgccttcgt 1021 cagcgggcga acacggtgta ctacgctcca tgcaggaggt tgcaatgagc tcccaggaag 1081 ccagcaagat gctgcgtact tacaatattg cctggtgggg caataactac tatgacgtta 1141 acgagctggg ccacattagc gtgtgcccgg acccggacgt cccggaagct cgcgtcgatc 1201 tcgcgcagtt agtgaaaact cgtgaagcac agggccagcg tctgcctgca ctgttctgtt 1261 tcccacagat cctgcagcac cgtttgcgtt ccattaacgc cgcgttcaaa cgtgcgaggg 1321 aatcctacgg ctataacggc gattacttcc ttgtttatcc gatcaaagtt aaccagcacc 1381 gccgcgtgat tgagtccctg attcattcgg gcgaaccgct gggtctggaa gccggttcca 1441 aagccgagtt gatggcagta ctggcacatg ctggcatgac ccgtagcgtc atcgtctgca 1501 acggttataa agaccgcgaa tatatccgcc tggcattaat tggcgagaag atggggcaca 1561 aggtctatct ggtcattgag aagatgtcag aaatcgccat tgtgctggat gaagcagaac 1621 gtctgaatgt cgttcctcgt ctgggcgtgc gtgcacgtct gcgttcgcag ggttcgggta 1681 aatggcagtc ctccggcggg gaaaaatcga agttcggcct ggctgcgact caggtactgc 1741 aactggttga aaccctgcgt gaagccgggc gtctcgacag cctgcaacta ctgcacttcc 1801 acctcggttc gcagatggcg aatattcgcg atatcgcgac aggcgttcgt gaatccgcgc 1861 gtttctatgt ggaactgcac aagctgggcg tcaatattca gtgcttcgac gtcggcggcg 1921 gtctgggcgt ggattatgaa ggtactcgtt cgcagtccga ctgttcggtg aactacggcc 1981 tcaatgaata cgccaacaac attatctggg cgattggcga tgcgtgtgaa gaaaacggtc 2041 tgccgcatcc gacggtaatc accgaatcgg gtcgtgcggt gactgcgcat cacaccgtgc 2101 tggtgtctaa tatcatcggc gtggaacgta acgaatacac ggtgccgacc gcgcctgcag 2161 aagatgcgcc gcgcgcgctg caaagcatgt gggaaacctg gcaggagatg cacgaaccgg 2221 gaactcgccg ttctctgcgt gaatggttac acgacagtca gatggatctg cacgacattc 2281 atatcggcta ctcttccggc atctttagcc tgcaagaacg tgcatgggct gagcagcttt 2341 atttgagcat gtgccatgaa gtgcaaaagc agctggatcc gcaaaaccgt gctcatcgtc 2401 cgattatcga cgagctgcag gaacgtatgg cggacaaaat gtacgtcaac ttctcgctgt 2461 tccagtcgat gccggacgca tgggggatcg accagttgtt cccggttctg ccgctggaag 2521 ggctggatca agtgccggaa cgtcgcgctg tgctgctgga tattacctgt gactctgacg 2581 gtgctatcga ccactatatt gatggtgacg gtattgccac gacaatgcca atgccggagt 2641 acgatccaga gaatccgccg atgctcggtt tctttatggt cggcgcatat caggagatcc 2701 tcggcaacat gcacaacctg ttcggtgata ccgaagcggt tgacgtgttc gtcttccctg 2761 acggtagcgt agaagtagaa ctgtctgacg aaggcgatac cgtggcggac atgctgcaat 2821 atgtacagct cgatccgaaa acgctgttaa cccagttccg cgatcaagtg aagaaaaccg 2881 atcttgatgc tgaactgcaa caacagttcc ttgaagagtt cgaggcaggt ttgtacggtt 2941 atacttatct tgaagatgag taagtcctgt gttacttgaa tccgcttaat ttagcggtga 3001 taatccgcca caatttattg tgacaaatcc aacccttcct cgtcgggcct aacgacgcgg 3061 aagggttttt ttatatcgac tttgtaatag gagtccatcc atgagcacct taggtcatca 3121 atacgataac tcactggttt ccaatgcctt tggtttttta cgcctgccga tgaacttcca 3181 gccgtatgac agcgatgcag actgggtgat tactggcgtg ccgttcgata tggcca // LOCUS FIBEGASE 2310 bp ds-DNA BCT 26-MAY-1990 DEFINITION F.succinogenes endoglucanase 3 (cel3) gene, complete cds. ACCESSION M29047 M29681 KEYWORDS cellobiosidase; endoglucanase. SOURCE F.succinogenes (strain S85, ATCC 19169) DNA. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 2310) AUTHORS McGavin,M.J., Forsberg,C.W., Crosby,B., Bell,A.W., Dignard,D. and Thomas,D.Y. TITLE Structure of the cel-3 gene from Fibrobacter succinogenes S85 and characteristics of the encoded gene product, endoglucanase 3 JOURNAL J. Bacteriol. 171, 5587-5595 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Dignard, 14-OCT-1989. FEATURES from to/span description pept 177 2153 endoglucanase 3 precursor sigp 177 245 endoglucanase 3 signal peptide A (alt.) sigp 177 251 endoglucanase 3 signal peptide A' (alt.) matp 246 2150 endoglucanase 3 A (alt.) matp 252 2150 endoglucanase 3 A' (alt.) site 167 172 ribosome binding site site 2172 2213 region of dyad symmetry BASE COUNT 649 a 653 c 529 g 479 t ORIGIN 1 ggatccgggt gcgtcagtta aataaaatat tttttaacgt ttttcgtaca gaaagtggac 61 ttttagacca aaacacttat tacacttttt attccgatat atcattttac atagcataaa 121 accgaccccc aaatatatct ttggtaaaaa agaaaaaatc accttaagag ggttttatgc 181 aactcaagaa tttctatccc aaaatgagcg ttctcggtat cgcaaccgtg atggcactta 241 ccgcctgtgg cgatgaaaat acccaggcac tgttcgccaa caatccggtt ccgggtgccg 301 aaaatcaggt tccggtttct agcagcgaca tgagcccgac ctctagcgac gctgtcattg 361 acccgacctc cagctctgcc gcagtggtcg acccgtctac gctccctgca gaaggtccta 421 ttaccatgcc ggaaggtctc ggcactttgg tcgatgactt tgaagatggc gataacttga 481 gcaaaatcgg tgattactgg tacacctaca acgataacga caacggtggt gcatccatca 541 tcacgactcc gctaaacgaa gaagaaaaca tcatcccggg ccgcgtcaac aacggttcca 601 actacgcctt gcaagtcaac tacacgcttg atagaggcga ttacgaatac gatccgtacg 661 taggctgggg cgtgcaggtc gcaccggacg aagccaacgg acatttcggc ggccttacct 721 actggtacaa gggcggcgca cacgaagtac atatcgaaat caccgacgtc gaagactacg 781 acgtgcatct cgccaagttc ccggcatccc gcacatggaa gcaggctgtc gtccgcttca 841 aggacctcgt tcaaggtggc tggggcaagg aaattccgtt cgacgccaag cacatcatgg 901 caatcagctt ccaggccaag ggaaacaaga gcaagctcgt gaccgactcc ctcttcatcg 961 acaacatcta cctgcaggat tcttccgaag ttgaaaagga ccagccggat atggaaatca 1021 aggacccggt cattccggtc gttgaattta ccgaagctga aatcactgtg acgaacccgt 1081 tgcaggaaaa ggccatgaag tacctcaaca agggtgtcaa ctttaccaac tggctcgaaa 1141 acgcagatgg caagttcaag tcctttgaat tgggcgaaag cgacgtcaag attcttgccg 1201 acaacggatt caagagcctc cgcttgccga ttgaccttga cctctatgcc acaaaccgtg 1261 acgcattcat cgcaggcacc gacacagaac tcaagttcga tgacgacacc ttgttcctgg 1321 ttctcgactc cttcgtagaa tggaccgcca agtacaacat gtctttcgtg attgactacc 1381 atgaatatga caacagctac aacaccacca gcgctaagga ccccaactac atcaagatga 1441 tggcagaaac gtggaagcat gttgcagccc actacgccga aagcccccgc gaagacttgt 1501 tcttcgaact cttgaacgaa ccggacatga gcgatggtaa ggtcactgca gcaacatgga 1561 ccaccgcagc ccaggccatg attgacgcca tccgcacggt tgataccaag cacaccatcc 1621 tcttcggtga tgcccagtgg tactccatca cgctcctcgc caagcgcact ccgttcaccg 1681 atgacaacat catctacgtg atccacacct acgaaccgtt cgccttcacg catcagggcg 1741 gttcctggac ggactacgcc accatccacg atattccgtt cccctacgat ccggcaaagt 1801 ggtctacggt ttctggcgac ttcggtgtca acaagagcac aaagtcctac gtgaaaacca 1861 acatcaagaa ctactacaag accggcagca aggaagccat cttggaacag attctcaagg 1921 ccaagaagtg ggccgccacc aacaacgtac cggtgatcat caacgaattc ggcgcattga 1981 acctccgctc taccgctgaa tcccgcctca actacctcac ggccatgcgc gaaatctgcg 2041 ataccctcca gattccttgg acgcactggg gctacaccgg caacttctcc gtgatcgaaa 2101 acggcaagtt gattgaaggc ctcgacaagg cactcggcgt cggtagcaaa taagtctctc 2161 cttaaaaccc cctcaaaaaa aggtcacgca gaaatgcgtg gcttttttag taggaagtag 2221 acggtaggaa gttggaagtt agaagtagga agtaacagga atggcgcaat ggatacagtt 2281 gacacagata cattacaaaa ccccggatcc // LOCUS SFSSA 1747 bp ss-RNA VRL 26-MAY-1990 DEFINITION Sandfly fever sicilian virus S RNA encoding N protein, complete cds, and NS-s protein, complete cds. ACCESSION J04418 KEYWORDS N protein; NS-s protein. SOURCE Sandfly fever Sicilian virus, cDNA to viral RNA. ORGANISM Sandfly fever Sicilian virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Uukuvirus. REFERENCE 1 (bases 1 to 1747) AUTHORS Marriott,A.C., Ward,V.K. and Nuttall,P.A. TITLE The S RNA segment of Sandfly fever sicilian virus: Evidence for an ambisense genome JOURNAL Virology 169, 341-345 (1989) STANDARD full staff_review REFERENCE 2 (bases 693 to 695) AUTHORS Marriott,A.C., Ward,V.K. and Nuttall,P.A. JOURNAL Unpublished (1990) Oxford, UK STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.C.Marriott 12-JAN-1989. FEATURES from to/span description pept 42 782 N protein pept 1727 924 (c) NS-s protien revision 693 695 ttc in [2]; tc in [1] BASE COUNT 483 a 358 c 459 g 447 t ORIGIN Unreported. 1 acacaaaggt ccctagttaa tctgagtgag ctaagtttga aatggacgag taccagaaaa 61 ttgctgttga gtttggagag caggctattg atgagactgt gatccaggat tggctacaag 121 catttgcgta tcaaggattt gatgccagaa caattataca caaccttgtg cagcttggag 181 ggaagagttg ggaagaggat gccaagaaga tgatcatcct atccctaact cgtggcaaca 241 agcccaagaa gatggttgag agaatgtctc cagagggagc aagagaagtt aagagcctgg 301 ttgcaaagta taagatagta gagggcagac caggcaggaa tggaattacc ctgtcaaggg 361 tgctgcagcc ctggctgggt ggacagtcca agctgtggaa gtggttgaaa acttcttacc 421 agtcccaggg agcacaatgg accgcattgt gtggacaaac ataccccagg cagatgatgc 481 atccaagctt tgccggtctc attgacccaa gcctcgacca ggaggatttt aatgcagtat 541 tggatgctca caaacttttc ttgttcatgt tttccaaaac aatcaatgtc agcctccgcg 601 gtgcgcagaa gagagacatt gaggaatcat tctctcaacc aatgcttgct gctatcaata 661 gctcattcat tgacaacact cagaggaggg cattcttgac taagtttggg atcctaactt 721 ctggagcaag agctacagca gttgtaaaga agattgcaga agtttacagg aaactagagt 781 aagctgctgc tagtgtgggg tgggatgggg attctgggtt gggggttctg gggtggaggg 841 tggctaggtg gggggtggca agggtggatt cggtttgggt tggggtcatg gggaggggtg 901 ggtctggggc tgggcagcgg agatcaaaag tcagagtcag acgagctctc atcattttca 961 tccacatgac tgtgtattgg ggtccaaaga gaattgccat actcggtgag gccagtagaa 1021 gggtcacttg ctctatagga tctaatcact gttcttacat caagtgcctc cccagaggag 1081 gcagtgtcaa aaggctctgc attgataagt ctgagacaaa ccagagatcc tatctctcta 1141 aatagatcgt atccattgta atgctcatca ctaagaccca acctcctagc ttcttgtagt 1201 atctttttgt gtgcctgaac tatgcactca tccaagctat gtgaatcccc cattctcaga 1261 atgtaagaca ttagctgatc ccttgtttgt agccctctca caaatctatc actgcatatg 1321 ctaaagatct cacaatcagg gatacctagt ggccagctaa gagccttcag gacatttggc 1381 agcccctttc tagagaaact tgtgaggtca aacctggaga ggtcacttgc cataccttgg 1441 aaggtataca tcataggctt gacagaacta aaatagcatg ctgggcccca agaagctggc 1501 aactctccaa gggaataaaa gtcagccagt gagtttctgc gtccaaaccc aagtcttaac 1561 ttctctagtg gtatttcaca atgctcataa gttgaaacgt catgagtgtg aaatttattg 1621 taggcaacat aagacacact ggagaggagt ctatgacacc tcacatcaat gttaattgcc 1681 gggtagtcaa acatgtactg gctgttcatc atgttgttgt tgatcattga ctagggggtc 1741 tttgtgt // LOCUS RSSB800AB 437 bp ds-DNA BCT 26-MAY-1990 DEFINITION R.sphaeroides B800-850 alpha and beta subunits of major light-harvesting complex. ACCESSION X05200 KEYWORDS light-harvesting complex. SOURCE Rhodobacter sphaeroides. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 437; no enum.) AUTHORS Ashby,M.K., Coomber,S.A. and Hunter,C.N. TITLE Cloning,nucleotide sequence and transfer of genes for the B800-850 light harvesting complex of Rhodobacter sphaeroides JOURNAL FEBS Lett. 213, 245-248 (1987) STANDARD simple automatic FEATURES from to/span description pept 40 195 B800-850 beta subunit (AA 1-51) pept 210 374 B800-850 alpha subunit (AA 1-54) BASE COUNT 78 a 156 c 124 g 79 t ORIGIN 1 gccctagcgc acaccgtcga tttaccattg gagacgcaca tgactgacga tctcaacaaa 61 gtctggccga gcggcctcac cgttgccgaa gccgaagaag ttcataagca actcatcctc 121 ggcacccgcg tcttcggtgg catggctctg ctcgcgcact tcctcgccgc cgctgcgacc 181 ccctggctcg gctgatatga gagactgaca tgaccaacgg caaaatctgg ctcgtggtga 241 aaccgaccgt cggcgttccg ctgttcctca gcgctgccgt catcgcctcc gtcgttatcc 301 acgctgctgt gctgacgacc accacctggc tgcccgccta ctaccaaggc tcggctgcgg 361 tcgcggccga gtaatgctgc gcaagcgcgg gcctgcgggc ccacgccagc cagtccgtga 421 gtccgagcag gccggga // LOCUS RSSPETA 316 bp ds-DNA BCT 26-MAY-1990 DEFINITION R.sphaeroides Rieske Fe-S protein cytochrome b (petA) gene, 5' end. ACCESSION M18577 KEYWORDS cytochrome b. SOURCE R.sphaeroides (strain GA) DNA. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 316) AUTHORS Davidson,E. and Daldal,F. TITLE fbc operon, encoding the Rieske Fe-S protein cytochrome b, and cytochrome c1 apoproteins previously described from Rhodopseudomonas sphaeroides, is from Rhodopseudomonas capsulata JOURNAL J. Mol. Biol. 195, 25-29 (1987) STANDARD full staff_entry FEATURES from to/span description pept 32 > 316 Rieske Fe-S protein cytochrome b (gtg start codon) BASE COUNT 54 a 111 c 99 g 52 t ORIGIN Unreported. 1 ctgcagcggc ccgaggaagg gagaagttct cgtgtccaac gcagaagatc acgcaggcac 61 tcgcagggat ttcctgtatt acgccacggc cggagccggg gcggtggcca ccggggccgc 121 cgtctggccg ctgatcaacc aaatgaatcc gtcggccgac gtgcaggccc tcgcctccat 181 cttcgtcgat gtgagctcgg tcgagccggg tgtccagctg accgtcaagt tcctcggcaa 241 accgatcttc atccgccgcc gcaccgaggc cgacatcgag ctcggccgct ccgtccagct 301 cggccagctg gtcgac // LOCUS HUMERCC3A 2751 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human DNA repair helicase (ERCC3) mRNA, complete cds. ACCESSION M31899 KEYWORDS Cockayne's syndrome; DNA repair; excision repair; helicase. SOURCE Human lymphoid cell line K562 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2751) AUTHORS Weeda,G., Van Ham,R.C.A., Vermeulen,W., Bootsma,D., Van der Eb,A.J. and Hoeijmakers,J.H.J. TITLE Identification of the molecular defect involving the human repair disorders xeroderma pigmentosum and Cockayne's syndrome in the ERCC-3-encoding, a presumed DNA repair helicase JOURNAL Mol. Cell. Biol. 10, 2570-2581 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G. Weeda, 07-FEB-1990, for release after publication. FEATURES from to/span description pept 96 2444 DNA repair helicase /hgml_locus_uid="LF0034Q" /map="2q21" /nomgen="ERCC3" BASE COUNT 727 a 668 c 726 g 630 t ORIGIN 1 gggagcttcc ggattgagcc ggaagtcccc ccagagcgga tgccgcggcg ggcctgtggg 61 agcggggtca tcttctctct gctgctgtag ctgccatggg caaaagagac cgagcggacc 121 gcgacaagaa gaaatccagg aagcggcact atgaggatga agaggatgat gaagaggacg 181 ccccggggaa cgaccctcag gaagcggttc cctcggcggc ggggaagcag gtggatgagt 241 caggcaccaa agtggatgaa tatggagcca aggactacag gctgcaaatg ccgctgaagg 301 acgaccacac ctccaggccc ctctgggtgg ctcccgatgg ccatatcttc ttggaagcct 361 tctctccagt ttacaaatat gcccaagact tcttggtggc tattgcagag ccagtgtgcc 421 gaccaaccca tgtgcatgag tacaaactaa ctgcctactc cttgtatgca gctgtcagcg 481 ttgggctgca aaccagtgac atcaccgagt acctcaggaa gctcagcaag actggagtcc 541 ctgatggaat tatgcagttt attaagttgt gtactgtcag ctatggaaaa gtcaagctgg 601 tcttgaagca caacagatac ttcgttgaaa gttgccaccc tgatgtaatc cagcatcttc 661 tccaggaccc cgtgatccga gaatgccgct taagaaactc tgaaggggag gccactgagc 721 tcatcacaga gactttcaca agcaaatctg ccatttctaa gactgctgaa agcagtggtg 781 ggccctccac ttcccgagtg acagatccac agggtaaatc tgacatcccc atggacctgt 841 ttgacttcta tgagcaaatg gacaaggatg aagaagaaga agaagagaca cagacagtgt 901 cttttgaagt caagcaggaa atgattgagg aactccagaa acgttgcatc cacctggagt 961 accctctgtt ggcagaatat gacttccgga atgattctgt caaccctgat atcaacattg 1021 acctaaagcc cacagctgtc ctcagaccct atcaggagaa gagcttgcga aagatgtttg 1081 gaaacgggcg tgcacgttcg ggggtcattg ttcttccctg cggtgctgga aagtccctgg 1141 ttggtgtgac tgctgcatgc actgtcagaa aacgctgtct ggtgctgggc aactcagctg 1201 tttctgtgga gcagtggaaa gcccagttca agatgtggtc caccattgac gacagccaga 1261 tctgccggtt cacctccgat gccaaggaca agcccatcgg ctgctccgtt gccattagca 1321 cctactccat gctgggccac accaccaaaa ggtcctggga ggccgagcga gtcatggagt 1381 ggctcaagac ccaggagtgg ggcctcatga tcctggatga agtgcacacc ataccagcca 1441 agatgttccg aagggtgctc accatcgtgc aggcccactg taagctgggt ttgactgcga 1501 ccctcgtccg cgaagatgac aaaattgtgg atttaaattt tctgattggg cctaagctct 1561 acgaagccaa ctggatggag ctgcagaata atggctacat cgccaaagtc cagtgtgctg 1621 aggtctggtg ccctatgtct cctgaatttt accgggaata tgtggcaatc aaaaccaaga 1681 aacgaatctt gctgtacacc atgaacccca acaaatttag agcttgccag tttctgatca 1741 agtttcatga aaggaggaat gacaagatta ttgtctttgc tgacaatgtg tttgccctaa 1801 aggaatatgc cattcgactg aacaaaccct atatctacgg acctacgtct cagggggaaa 1861 ggatgcaaat tctccagaat ttcaagcaca accccaaaat taacaccatc ttcatatcca 1921 aggtaggtga cacttcgttt gatctgccgg aagcaaatgt cctcattcag atctcatccc 1981 atggtggctc caggcgtcag gaagcccaaa ggctagggcg ggtgcttcga gctaaaaaag 2041 ggatggttgc agaagagtac aatgcctttt tctactcact ggtatcccag gacacacagg 2101 aaatggctta ctcaaccaag cggcagagat tcttggtaga tcaaggttat agcttcaagg 2161 tgatcacgaa actcgctggc atggaggagg aagacttggc gttttcgaca aaagaagagc 2221 aacagcagct cttacagaaa gtcctggcag ccactgacct ggatgccgag gaggaggtgg 2281 tggctgggga atttggctcc agatccagcc aggcatctcg gcgctttggc accatgagtt 2341 ctatgtctgg ggccgacgac actgtgtaca tggagtacca ctcatcgcgg agcaaggcgc 2401 ccagcaaaca tgtacacccg ctcttcaagc gctttaggaa atgatgctta ggcagggtac 2461 ttcgttcaag accggcgctt ggcacccttg ttggaaaggg attttcagca taacattttc 2521 cttccacctc tttgaccttc cctccagcgt tggccaaatt gtgctgagga agatgcatca 2581 agggcttggc tgtgccttca taggtcatct agggttttat aaaggaggag gagacaatat 2641 tttttcaaac tttttgggga gtggggtcat ttctgtatat aaaaaatgtt aatatttaag 2701 gtgtatttat gttaccgttc tgaataaaca gaatggacca ttgaaccagt a // LOCUS BOLREPA 182 bp ds-DNA PLN 26-MAY-1990 DEFINITION B.campestris tandemly repeated DNA. ACCESSION M30962 KEYWORDS repetitive DNA. SOURCE B.campestris (strain Var B-85) seedling DNA. ORGANISM Brassica campestris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 182) AUTHORS Das Gupta,J. and Mandal,R.K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry, Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA. FEATURES from to/span description BASE COUNT 54 a 39 c 29 g 60 t ORIGIN 1 aagcttctta catcgtgatt catcctggtt tgattagaat gacaaagaag ctgtccaatt 61 cccaaacagg aaaactggga tcacctgatt tgaaagtggg ttagcttctt catcctaact 121 cctatgagat ttcttcaact tcctagtgat tctccattac tttaagtatc aaaatcaagc 181 tt // LOCUS BOLREPB 182 bp ds-DNA PLN 26-MAY-1990 DEFINITION B.juncea tandemly repeated DNA. ACCESSION M30963 KEYWORDS repetitive DNA. SOURCE B.juncea (strain Var B-9) DNA. ORGANISM Brassica juncea Unclassified. REFERENCE 1 (bases 1 to 182) AUTHORS Das Gupta,J. and Mandal,R.K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.K.Mandal, 21-DEC-1989. Bose Institute, Dept. Biochemistry, Centenary Building, P 1/12 CIT Scheme VIIM, Calcutta 700 054 INDIA. FEATURES from to/span description BASE COUNT 53 a 42 c 30 g 57 t ORIGIN 1 aagcttctta cagagtcatt tatcctggtt tgattggaac accgaagaag ctgtcctatt 61 cccaaactgg gaaactggaa tcacctgatt agaaagtggg ataacttctt catcccaact 121 cctatgagat ttattcaact tcctggtgat tctccaacac tttatgtatc caaatcaagc 181 tt // LOCUS HUMHPV16A1 336 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H022. ACCESSION M33610 KEYWORDS insertion site. SEGMENT 1 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 336) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 325 326 Human DNA end/HPV-16 DNA start BASE COUNT 114 a 61 c 56 g 105 t ORIGIN 1 aggtatataa atggccaagg tagaagatat caaaatgagg tggatttgat ttctcatgtg 61 agactcatag ctaatttaaa tgaaaattta aataagattt atttgacatg attgggaaca 121 attcaattca actttacaaa cactgattaa atgtctacca tctggatggc accgtgctaa 181 gtgagtctcc aaacctgaac tgtgattata aagggcattt ataaactttc cctcaaagat 241 aggacatttg cccatgtaat catgccatct ttaaaagcat cactctaaat tatttaggtg 301 acttctaact ttgcccagta ctctgtccca cagcta // LOCUS HUMHPV16A2 1002 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 3' flank, clone H022. ACCESSION M33611 KEYWORDS insertion site. SEGMENT 2 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1002) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 9 10 HPV-16 DNA end/Human DNA start BASE COUNT 284 a 190 c 206 g 322 t ORIGIN About 3187 bp after segment 1. 1 gaagtggaat aaagtgaaag cctcactctt ctctagccta agttttagag tccagtgaag 61 cattgcaagc ataggctttg tagtcagaaa accctgagat caaatcctgg ttctaccact 121 tgctatagcg atcttgggca aggggtcaga tctctctaag cctgtttcct catctgtaag 181 gaagggtatt atatcacata aggttactgt gaggactaaa ttagactaag tatgcaatag 241 gaatacaggg tccagttttc tttggatgta atgggcctgg aaaattcctt aaaatccttt 301 tcacctacaa aatcttatga agttctgcct attttctgct taaaaacttt aaaaaattaa 361 tagaaataaa agagaattct actagagaga taggttgacg ttacttcttc cttgcttttt 421 ccttaaagtg gaatgttaaa aactaggata tgcctggaaa gtgttctatc tacaaaaaag 481 gaagttagca gccgctgaaa agtaactaca gatggctatt cactttactc tgaaagcatt 541 tgctgttgat ataatcacac cacaggaaaa catcataatg ttggctgaaa gaaatctgaa 601 atgacacagc aataatgctt catcatgtag aagttggttt caagtttttt tttttttttc 661 ggtctggata gtgtgattgc aagaagggag gctatgctag cttggttata agcagggaag 721 ttggctgtga ggagataaac agagatctca caggaattct ggggtagaaa tcactggacc 781 ggaactgaag ggctatctcc cagcttctgt ttctgccttt tcattcagtc attccctcgt 841 ttactcaaca gttccctctg ctttggtggc agtttctgct ccttctcaag gctgacttgc 901 acatggctct gacttgctgt ggcctcctct ccatcattct ctgcatcagg tgctttcaac 961 cttgatttta ttgtttatat atacttatga acttttctgc ag // LOCUS HUMHPV16B 871 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H404. ACCESSION M33612 KEYWORDS insertion site. SOURCE Human cervical cancer DNA, clone H404. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 871) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 861 862 Human DNA end/HPV-16 DNA start BASE COUNT 291 a 147 c 164 g 269 t ORIGIN 1 atactctgag taaacaagta aaacatttgg taaaataact ggaaggatat ataccatagt 61 aaatgattct ttttcaaatt ttctattata tagctatata aggtatgaat ctagtagtta 121 ccctcaaatt agggtaaaca atttcctcag cagtttgagc agctcatctc ataatacttt 181 gcaaagatag ccacacaagg gaatgggctg cttgatttga acacaggtgg ggatggatta 241 atagaactgg ggatcaggga acattgggca ggactaataa gaattaggca gtcagaaaaa 301 gatttacaaa aaagactgta taacgagtct aaagataaat tctacctatt taacatttct 361 gcctgagttt ggagaaggca agaaaacatt cttctcttcc tcttacgtac acagacaatt 421 agggaagcca caatgagata atttatgcta tgttagtgag taacacataa ttttccttca 481 cagctgatat aacttgatta ctggagtggc agtggaaggg catggagacc caggccatgg 541 tcacttttct aggtgctcct acgactcaat ttctctcttc tgtcttgatt cctttgggag 601 attcctggat tttagaaaat cagatgagta agttgttatc atctgaaaaa tgccctctta 661 ccacacaatt atctattaga ggaaagttta ggaacagttg gtttaactga gagaaataaa 721 gataatctct atctcccttg cctgctctta ggataagggt tctgagatcc tatataatct 781 tatatcattt aacataaaca caatttctta ctttgcttga aaagttgtat taaagattcc 841 agggtgcagt taaatacact tcacaatata c // LOCUS HUMHPV16C1 1130 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H705. ACCESSION M33613 KEYWORDS insertion site. SEGMENT 1 of 2 SOURCE Human cervical cancer DNA, clone H705. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1130) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 1120 1121 Human DNA end/HPV-16 DNA start BASE COUNT 321 a 222 c 244 g 343 t ORIGIN 1 tgccatcatt aatgcagctg gcacgacagg ttcccgactg aaacggcagt gagcgcaacg 61 caattaatgt gagttagctc actcattagg caccccaggc tttacacttt atgcttccgg 121 ctcgtatgtt gtgtggaatt gtgagcggat aacaatttca cacaggaaac agctatgaca 181 tgattacaat tcgagctcgg taccagcaca atgaggaatg catgctagca caagtgaaac 241 tcatagatgt ccattgtgct atgcattttt tccttgggcc tgatccattt atccatttac 301 tggtttcctg tctgtaaatt tagaaaagat acaggctctc tgaaaagtaa tttctgtctc 361 ttacaagtga agggttaatc aaccaatcca cataattttc tccagtactg agagatcatt 421 tgttttaata aatgcaaata aggtttctta tagttaaagg taattggctt ttcattgtaa 481 ttcttgatgc tggtcatttt gtgtctgagt tgttcctaat tgctttggtt cagagtctga 541 gaaatgaaat agccccttga ctataactgt aactacaatt ataacaattt atttatttaa 601 atcagcaatc cctgcaaagt catttacagt ttgtttattt cagtatgttt tacaaggtgc 661 aacaaaagca gcctcatcac atagcaaatc tttcttacag gattaaaagt taatgggtaa 721 ggtaagtctg gcataggcat taaagtggaa gcattgtttc ttcttgactg gtcaacttta 781 gagacaactt ttcccattcg aagttatcta tcctctaaaa tatacagaga ttgaggccag 841 gtgggatggc tcacccctgt aatcccagaa ctttgggagg ccaagatgag tggattgctt 901 gagctcagaa gtttgagacc ttggtaacat ggcgggatgc cggtggtgcc atgcctgtgg 961 tcccagcttc ttgggggctg aggtgggagg accttctgag cctggtggca aagttgcagt 1021 aagctgtgtt ggtatcactg cactccagcc tgcactcctg cacaaagcaa gaccctgtct 1081 caaaacaata aataaattaa aatatagaga gactttgcat tgcaaaggca // LOCUS HUMHPV16C2 148 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 3' flank, clone H705. ACCESSION M33614 KEYWORDS insertion site. SEGMENT 2 of 2 SOURCE Human cervical cancer DNA, clone H705. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 148) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 28 29 HPV-16 DNA end/Human DNA start BASE COUNT 45 a 24 c 28 g 51 t ORIGIN About 489 bases after segment 1. 1 attatcacag atggtacaat gggcctactg atgcagtgat aatagtactg agatgtacta 61 ttatcccaca tttagttaag ttaggattga tcctagattc acatgttgtc agtgtgatgc 121 cttaaatatc aagtttccaa ttaagctt // LOCUS HUMHPV16D2 510 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 3 ' flank, clone H901. ACCESSION M33616 KEYWORDS insertion site. SEGMENT 2 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 510) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 10 11 HPV-16 DNA end/Human DNA start BASE COUNT 140 a 133 c 94 g 143 t ORIGIN About 3994 bp after segment 1. 1 acattattat ggaaacagat ctgtgagtac caagaaaaga ggataaagat tcatcccatc 61 caccagtcat tcccatgcac ctctacccgc catcccctgt atccaggaca acccccttct 121 gacaccaaaa tgcatttcac cattggctgc tgtcggtaga taatacctgc tcagcatttg 181 ggacaagttc cagacataac ttcctcttag tgaatgatcc tgacaggaga aagaattgag 241 cttaatttat gccatctaat aacctcagtg cagctacttg ggaagttagc cctccagagt 301 ttcccccaaa gttttctcca gtgaattaca gtgccatata ttctcattgc taccagcgct 361 gctcccaaaa tctatctgct gtttaatagt ttttaccttt caaaaatgca agctggctgg 421 gcgtggattt ttgaaagcat tcctcctgcc ttggcctctc aaagtgctgg attagagggt 481 gccttctaat cccagcaatc agcattggaa // LOCUS HUMHPV1D1 510 bp ds-DNA PRI 26-MAY-1990 DEFINITION Human DNA/HPV-16 insertion site, 5' flank, clone H901. ACCESSION M33615 KEYWORDS insertion site. SEGMENT 1 of 2 SOURCE Human cervical cancer DNA, clone H022. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 510) AUTHORS Wagatsuma,M., Hashimoto,K. and Matsukura,T. TITLE Analysis of integrated human papillomavirus type 16 DNA in cervical cancers: Amplification of viral sequences together with cellular flanking sequences JOURNAL J. Virol. 64, 813-821 (1990) STANDARD simple staff_entry FEATURES from to/span description site 500 501 Human DNA end/HPV-16 DNA start BASE COUNT 159 a 82 c 101 g 168 t ORIGIN 1 attcgagctc ggtacccaac atctcaaaat tttgttcttc agtctgtaaa atgggatgat 61 aaatctctca ggtttggtgt aagaaaaaaa taatatgctc acctaataga ccttcaatta 121 ctggtagttt ccatcatctt aatgaggatt atatctttat agtgagcacc cattagatgg 181 tgttgataaa tacatcaatg agtattttag gcagaaagca gagtaaagca gaagtactgg 241 cattctttgc tgtactcagt tttattaact gattttatat tgatcacgtt ctttgttaca 301 tgtcagtatt atagtggcag ttgaaggtgg taatattttt agtctccgtt agtgaaatga 361 caggcattga gctctcagtc atacctttgt aggccttcgt tgaggtgaat acctacctct 421 taactagaaa aagatggaga atttcttgct tggaaggaaa ttaatgcaat gtccaggtca 481 tctcctaaaa agcctgaagg aaacaaagta // LOCUS HUMMHDQBH 1104 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human MHC HLA-DQ beta mRNA, complete cds. ACCESSION M32577 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (DR4-Dw14), cDNA to mRNA, LS40 homozygous cell line. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1104) AUTHORS Hilden,J.M., Curtsinger,J.M., Cairns,J.S. and Bach,F.H. TITLE DQ beta sequences in HLA-DR4 haplotypes JOURNAL Hum. Immunol. 18, 261-264 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 754 MHC HLA-DQ beta precursor (AA at 2) /nomgen="LS0098W" /map="6p21.3" /hgml_locus_uid="HLA-DQB1" sigp < 1 64 MHC HLA-DQ beta signal peptide (AA at 2) matp 65 751 MHC HLA-DQ beta BASE COUNT 231 a 324 c 303 g 246 t ORIGIN 1 aggccttcgg gtagcaactg tgaccttgat gctggcgatg ctgagcaccc cggtggctga 61 gggcagagac tctcccgagg atttcgtgta ccagtttaag ggcatgtgct acttcaccaa 121 cgggacggag cgcgtgcgtc ttgtgaccag atacatctat aaccgagagg agtacgcacg 181 cttcgacagc gacgtggggg tgtatcgggc ggtgacgccg ctggggccgc ctgccgccga 241 gtactggaac agccagaagg aagtcctgga gaggacccgg gcggagttgg acacggtgtg 301 cagacacaac taccagttgg agctccgcac gaccttgcag cggcgagtgg agcccacagt 361 gaccatctcc ccatccagga cagaggccct caaccaccac aacctgctgg tctgctcagt 421 gacagatttc tatccagccc agatcaaagt ccggtggttt cggaatgacc aggaggagac 481 aactggcgtt gtgtccaccc cccttattag gaacggtgac tggaccttcc agatcctggt 541 gatgctggaa atgactcccc agcgtggaga cgtctacacc tgccacgtgg agcaccccag 601 cctccagaac cccatcatcg tggagtggcg ggctcagtct gaatctgccc agagcaagat 661 gctgagtggc attggaggct tcgtgctggg gctgatcttc ctcgggctgg gccttattat 721 ccatcacagg agtcagaaag ggctcctgca ctgactcctg agactatttt aactgggatt 781 ggttatcact tttctgtaac gcctgcttgt ccctgcccag aattcccagc tgcctgtgtc 841 agcctgtccc cctgagatca gagtcctaca gtggctgtca cgcagccacc aggtcatctc 901 ctttcatccc cacctcgagg ctgatggctg tgaccctgct tcctgcactt acccagagcc 961 tctgcctgtg cacggccagc tgcgtctact gaggccccaa ggggtttctg tttctattct 1021 ctcctcagac tgctcaagag aagcacatga aaaccattac ctgactttag agctttttta 1081 cataattaaa catgatcctg agtt // LOCUS HUMMHDR1C 1191 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human class II HLA-DRB1-BON mRNA, complete cds. ACCESSION M33600 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (haplotype DRB1-BON) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1191) AUTHORS Coppin,H.L., Avoustin,P., Fabron,J., Huchenq,A., Garnier,J.M., Thomsen,M. and De Preval,C. TITLE Evolution of the HLA-DR1 gene family: Structural and functional analysis of the new allele "DR-BON" JOURNAL J. Immunol. 144, 984-989 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 71 871 MHC HLA-DR1-BON precursor sigp 71 157 MHC HLA-DR1-BON signal peptide matp 158 868 MHC HLA-DR1-BON BASE COUNT 258 a 312 c 344 g 277 t ORIGIN 1 gcccaagtat caagagggag agtgagactt gcctgcttct ctggcccctg gtcctgtcct 61 gttctccagc atggtgtgtc tgaagctccc tggaggctcc tgcatgacag cgctgacagt 121 gacactgatg gtgctgagct ccccactggc tttggctggg gacacccgac cacgtttctt 181 gtggcagctt aagtttgaat gtcatttctt caatgggacg gagcgggtgc ggttgctgga 241 aagatgcatc tataaccaag aggagtccgt gcgcttcgac agcgacgtgg gggagtaccg 301 ggcggtgacg gagctggggc ggcctgatgc cgagtactgg aacagccaga aggacatcct 361 ggaagacgag cgggccgcgg tggacaccta ctgcagacac aactacgggg ttggtgagag 421 cttcacagtg cagcggcgag ttgagcctaa ggtgactgtg tatccttcaa agacccagcc 481 cctgcagcac cacaacctcc tggtctgctc tgtgagtggt ttctatccag gcagcattga 541 agtcaggtgg ttccggaacg gccaggaaga gaaggctggg gtggtgtcca caggcctgat 601 ccagaatgga gattggacct tccagaccct ggtgatgctg gaaacagttc ctcggagtgg 661 agaggtttac acctgccaag tggagcaccc aagtgtgacg agccctctca cagtggaatg 721 gagagcacgg tctgaatctg cacagagcaa gatgctgagt ggagtcgggg gcttcgtgct 781 gggcctgctc ttccttgggg ccgggctgtt catctacttc aggaatcaga aaggacactc 841 tggacttcag ccaacaggat tcctgagctg aaatgcagat gaccacattc aaggaagaac 901 cttctgtccc agctttgcag aatgaaaagc tttcctgctt ggcagttatt cttccacaag 961 agagggcttt ctcaggacct ggttgctact ggttcggcaa ctgcagaaaa tgtcctccct 1021 tgtggcttcc tcagctcctg cccttggcct gaagtcccag cattgatgac agcgcctcat 1081 cttcaacttt tgtgctcccc tttgcctaaa ccgtatggcc tcccgtgcat ctgtacctca 1141 ccctgtacga caaacacatt acattattaa atgtttctca aagatggagt t // LOCUS HUMMHDRBBB 1216 bp ss-mRNA PRI 26-MAY-1990 DEFINITION Human MHC class II HLA-DR beta-1 mRNA (DR2.3), 5'end. ACCESSION M32578 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human type I diabetic (Dw4/LD MN2), cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1216) AUTHORS Freeman,S.M., Saunders,T.L., Madden,M., Segall,M., Bach,F.H. and Wu,S. TITLE Comparison of DR beta-1 alleles from diabetic and normal individuals JOURNAL Hum. Immunol. 19, 1-6 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 62 862 MHC HLA-DR beta-1 precursor /nomgen="LV0063D" /map="6p21.3" /hgml_locus_uid="HLA-DRB1" sigp 62 148 MHC HLA-DR beta-1 signal peptide matp 149 859 MHC HLA-DR beta-1 BASE COUNT 265 a 331 c 341 g 279 t ORIGIN 1 agttctccct gagtgagact tgcctgctcc tctggcccct ggtcctgtcc tgttctccag 61 catggtgtgt ctgaagctcc ctggaggttc ctacatggca gtgctgacag tgacactgat 121 ggtgctgagc tccccactgg ctttggctgg ggacacccga ccatgtttct tgcagcagga 181 taagtatgag tgtcatttct tcaacgggac ggagcgggtg cggttcctgc acagaggcat 241 ctataaccaa caggagaacg tgcgcttcga cagcgacgtg ggggagtacc gggcggtgac 301 ggagctgggg cggcctgacg ctgagtactg gaacagccag aaggacatcc tggagcaggc 361 gcgggccgcg gtggacacct actgcagaca caactacggg gctgtggaga gcttcacagt 421 gcagcggcga gttgagccta aggtgactgt gtatcctgca aggacccaga ccctgcagca 481 ccacaacctc ctggtctgct ctgtgaatgg tttctatcca ggcagcattg aagtcaggtg 541 gttccggaac ggccaggaag agaaggctgg ggtggtgtcc acaggcctga ttcagaatgg 601 agactggacc ttccagattc tggtgatgct ggaaacagtt cctcggagtg gagaggttta 661 cacctgccaa gtggagcacc caagcgtgac gagccctctc acagtggaat ggagagcaca 721 gtctgaatct gcacagagca agatgctgag tggaatcggg ggctttgtgc tgggcctgct 781 cttccttggg gccgggctat tcatctactt caagaatcag aaagggcact ctggacttca 841 cccaacagga ctcgtgagct gaagtgcaga tgaccacatt caagggggaa ccttctgccc 901 cagctttgca tgatgaaaag ctttcctgct tggctcttat tcttccacaa gagaggactt 961 tctcaggccc tggttgctac cggttcagca actctgcaga aaatgtccat ccttgtggct 1021 tcctcagctc ctgcccttgg cctgaagtcc cagcattgat ggcagtgcct catcttcaac 1081 tttagtgctc ccctttacct aaccctacgg cctcccatgc atctgtactc cccctgtgcc 1141 acaaatggac tacgttatta aatttttctg aagcccagag ttaaaaatca tctgtccacc 1201 tggcaccaaa gacaaa // LOCUS J05239 162 bp ds-DNA BAD 26-MAY-1990 DEFINITION Figure 1. Sequence of the 166-bp restriction fragment. ACCESSION J05239 REFERENCE 1 (bases 1 to 162) AUTHORS Jones,B.K. and Yeung,A.T. TITLE dna base composition determines the specificity of uvrabc endonuclease incision of a psoralen cross-link JOURNAL J. Biol. Chem. 265, 3489-3496 (1990) STANDARD unannotated staff_entry COMMENT Bad entry: secondary reference to PNASU 75, 5314-5318 (1978): lac promoter sequence. FEATURES from to/span description BASE COUNT 40 a 43 c 40 g 39 t ORIGIN 1 cctccgttga gccatctgga tcggcagcgt tgtcttcatc aaccggaacg agcatgccgg 61 agagcagctc actcattagg caccccaggc tttacacttt atgcttccgg ctcgtataat 121 gtgtggaatt gtgagcggat aacaatttca cacaggaaac ag // LOCUS MLVENVB 2002 bp ss-RNA VRL 26-MAY-1990 DEFINITION Murine leukemia virus 10A1 derivative env gene, complete cds. ACCESSION M33470 KEYWORDS envelope protein. SOURCE Murine leukemia virus 10A1 derivative viral RNA, clone 10A1. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 2002) AUTHORS Ott,D., Friedrich,R. and Rein,A. TITLE Sequence analysis of amphotropic and 10A1 murine leukemia viruses: Close relationship to mink cell focus-inducing viruses JOURNAL J. Virol. 64, 757-766 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 65 2002 env protein BASE COUNT 528 a 553 c 482 g 439 t ORIGIN 1 ggatccacgc cgctcacgta aaggcggcga caacccctcc ggccggaaca gcatcaggac 61 cgacatggaa ggtccagcgt tctcaaaacc ccttaaagat aagattaacc cgtggaagtc 121 cttaatggtc atgggggtct atttaagagt agggatggca gagagccccc atcaggtctt 181 taatgtaacc tggagagtca ccaacctgat gactgggcgt accgccaatg ccacctccct 241 tttaggaact gtacaagatg ccttcccaag attatatttt gatctatgtg atctggtcgg 301 agaagagtgg gacccttcag accaggaacc atatgtcggg tatggctgca aataccccgg 361 agggagaaag cggacccgga cttttgactt ttacgtgtgc cctgggcata ccgtaaaatc 421 ggggtgtggg gggccaagag agggctactg tggtgaatgg ggttgtgaaa ccaccggaca 481 ggcttactgg aagcccacat catcatggga cctaatctcc cttaagcgcg gtaacacccc 541 ctgggacacg ggatgctcca aaatggcttg tggcccctgc tacgacctct ccaaagtatc 601 caattccttc caaggggcta ctcgaggggg cagatgcaac cctctagtcc tagaattcac 661 tgatgcagga aaaaaggcta attgggacgg gcccaaatcg tggggactga gactgtaccg 721 gacaggaaca gatcctatta ccatgttctc cctgacccgc caggtcctca atatagggcc 781 ccgcatcccc attgggccta atcccgtgat cactggtcaa ctacccccct cccgacccgt 841 gcagatcagg ctccccaggc ctcctcagcc tcctcctaca ggcgcagcct ctatagtccc 901 tgagactgcc ccaccttctc aacaacctgg gacgggagac aggctgctaa acctggtaga 961 aggagcctat caggcgctta acctcaccaa tcccgacaag acccaagaat gttggctgtg 1021 cttagtgtcg ggacctcctt attacgaagg agtagcggtc gtgggcactt ataccaatca 1081 ttctaccgcc ccggccagct gtacggccac ttcccaacat aagcttaccc tatctgaagt 1141 gacaggacag ggcctatgca tgggagcact acctaaaact caccaggcct tatgtaacac 1201 cacccaaagt gccggctcag gatcctacta ccttgcagca cccgctggaa caatgtgggc 1261 ttgtagcact ggattgactc cctgcttgtc caccacgatg ctcaatctaa ccacagacta 1321 ttgtgtatta gttgagctct ggcccagaat aatttaccac tcccccgatt atatgtatgg 1381 tcagcttgaa cagcgtacca aatataagag ggagccagta tcgttgaccc tggcccttct 1441 gctaggagga ttaaccatgg gagggattgc agctggaata gggacgggga ccactgccct 1501 aatcaaaacc cagcagtttg agcagcttca cgccgctatc cagacagacc tcaacgaagt 1561 cgaaaaatca attaccaacc tagaaaagtc actgacctcg ttgtctgaag tagtcctaca 1621 gaaccgaaga ggcctagatt tgctcttcct aaaagaggga ggtctctgcg cagccctaaa 1681 agaagaatgt tgtttttatg cagaccacac gggactagtg agagacagca tggccaaact 1741 aagggaaagg cttaatcaga gacaaaaact atttgagtca ggccaaggtt ggttcgaagg 1801 gcagtttaat agatccccct ggtttaccac cttaatctcc accatcatgg gacctctaat 1861 agtactctta ctgatcttac tctttggacc ctgcattctc aatcgattgg tccaatttgt 1921 taaagacagg atctcagtgg tccaggctct ggttttgact caacaatatc accagctaaa 1981 acctatagag tacgagccat ga // LOCUS MLVENVC 2001 bp ss-RNA VRL 26-MAY-1990 DEFINITION Murine leukemia virus env gene, complete cds. genes. ACCESSION M33469 KEYWORDS envelope protein. SOURCE Murine leukemia virus viral RNA, clone 4070A. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 2001) AUTHORS Ott,D., Friedrich,R. and Rein,A. TITLE Sequence analysis of amphotropic and 10A1 murine leukemia viruses: Close relationship to mink cell focus-inducing viruses JOURNAL J. Virol. 64, 757-766 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 37 2001 env protein BASE COUNT 532 a 560 c 472 g 437 t ORIGIN 1 ggccgacacc cagagtggac catcctctgg acggacatgg cgcgttcaac gctctcaaaa 61 ccccctcaag ataagattaa cccgtggaag cccttaatag tcatgggagt cctgttagga 121 gtagggatgg cagagagccc ccatcaggtc tttaatgtaa cctggagagt caccaacctg 181 atgactgggc gtaccgccaa tgccacctcc ctcctgggaa ctgtacaaga tgccttccca 241 aaattatatt ttgatctatg tgatctggtc ggagaggagt gggacccttc agaccaggaa 301 ccgtatgtcg ggtatggctg caagtacccc gcagggagac agcggacccg gacttttgac 361 ttttacgtgt gccctgggca taccgtaaag tcggggtgtg ggggaccagg agagggctac 421 tgtggtaaat gggggtgtga aaccaccgga caggcttact ggaagcccac atcatcgtgg 481 gacctaatct cccttaagcg cggtaacacc ccctgggaca cgggatgctc taaagttgcc 541 tgtggcccct gctacgacct ctccaaagta tccaattcct tccaaggggc tactcgaggg 601 ggcagatgca accctctagt cctagaattc actgatgcag gaaaaaaggc taactgggac 661 gggcccaaat cgtggggact gagactgtac cggacaggaa cagatcctat taccatgttc 721 tccctgaccc ggcaggtcct taatgtggga ccccgagtcc ccatagggcc caacccagta 781 ttacccgacc aaagactccc ttcctcacca atagagattg taccggctcc acagccacct 841 agccccctca ataccagtta ccccccttcc actaccagta caccctcaac ctcccctaca 901 agtccaagtg tcccacagcc acccccagga actggagata gactactagc tctagtcaaa 961 ggagcctatc aggcgcttaa cctcaccaat cccgacaaga cccaagaatg ttggctgtgc 1021 ttagtgtcgg gacctcctta ttacgaagga gtagcggtcg tgggcactta taccaatcat 1081 tccaccgctc cggccaactg tacggccact tcccaacata agcttaccct atctgaagtg 1141 acaggacagg gcctatgcat gggggcagta cctaaaactc accaggcctt atgtaacacc 1201 acccaaagcg ccggctcagg atcctactac cttgcagcac ccgccggaac aatgtgggct 1261 tgcagcactg gattgactcc ctgcttgtcc accacggtgc tcaatctaac cacagattat 1321 tgtgtattag ttgaactctg gcccagagta atttaccact cccccgatta tatgtatggt 1381 cagcttgaac agcgtaccaa atataaaaga gagccagtat cattgaccct ggcccttcta 1441 ctaggaggat taaccatggg agggattgca gctggaatag ggacggggac cactgcctta 1501 attaaaaccc agcagtttga gcagcttcat gccgctatcc agacagacct caacgaagtc 1561 gaaaagtcaa ttaccaacct agaaaagtca ctgacctcgt tgtctgaagt agtcctacag 1621 aaccgcagag gcctagattt gctattccta aaggagggag gtctctgcgc agccctaaaa 1681 gaagaatgtt gtttttatgc agaccacacg gggctagtga gagacagcat ggccaaatta 1741 agagaaaggc ttaatcagag acaaaaacta tttgagacag gccaaggatg gttcgaaggg 1801 ctgtttaata gatccccctg gtttaccacc ttaatctcca ccatcatggg acctctaata 1861 gtactcttac tgatcttact ctttggacct tgcattctca atcgattggt ccaatttgtt 1921 aaagacagga tctcagtggt ccaggctctg gttttgactc agcaatatca ccagctaaaa 1981 cccatagagt acgagccatg a // LOCUS MTYRPVP 6331 bp ss-RNA VRL 26-MAY-1990 DEFINITION Eggplant mosaic virus genome. ACCESSION J04374 KEYWORDS replicase protein; virion protein. SOURCE Eggplant mosaic tymovirus viral RNA. ORGANISM Eggplant mosaic virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tymovirus. REFERENCE 1 (bases 1 to 6331) AUTHORS Osorio-Keese,M.E., Keese,P. and Gibbs,A. TITLE Nucleotide sequence of the genome of eggplant mosaic tymovirus JOURNAL Virology 172, 547-554 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 102 2051 overlapping out-of-phase protein pept 109 5628 replicase protein (putative) pept 5633 6199 virion protein BASE COUNT 1337 a 2441 c 987 g 1566 t ORIGIN 1 gtaatcagaa ccagaactaa ccctgttatc agccttagtt cttttacttt cctgtccaaa 61 tttctgaacc gactagtgcc ttcctagaac ccactacgtc aatgcctcat ggcctttcag 121 tctgctctcg aagctctcaa ctcaactact cacagagatg cttctacaaa tccaattctg 181 aactccgtcg tggaacctct ccgcgactct ctatccctat atccctggct ccttcccaaa 241 gaagccgttc cccaccttct atcctggggc atcccgaact ccggcctcgg agtcactccc 301 cacccccacc caatccacaa aacagtcgag acttttctcc tgttcaatca ctggcatgct 361 ctcgctcgcc tgccttcaac tgtgatgttc atgaaaccgt ccaagtttca aaaacttgcg 421 gctctaaacc caaaattcca agagttgatc aactttcgac tcactgccgc cgacaccact 481 cgctacccct ccacctcact cacttttcca agcaattcaa tttgcttcat gcacgatgct 541 ctgatgtact tttctccagc tcagatcgtc gatctcttca ctcagtctcc cgcactcgag 601 accctgtact gcagtctcat agtgcctcca gagtctcatt tcacagatct ctctctcttc 661 cccgagatct acacttacaa gatctcaggt cagactctcc actacatccc ggagaatcac 721 cactccggct cgtacaatca gcccctccaa gccccatctt ggctgaagat ttcctccatc 781 ctctcgcctt ccctcgcttt gtctgtgacc aagctggaat cttggggccc agtccactcc 841 atattgatcc agcgaggcct accaccaaag ccctctctct ctgcacgccc ccccgtcctg 901 ccaaatcaac ctccccgtgc aacaactccc aactcccaaa accaactgct gcatcagaca 961 agccagctat tcttccaact gcagcagcct caactcagcc tggtctcctt ccgaattcca 1021 gactgcgtag aactgccaca agccaccttt ctgcgccaac ctctccgcca ccggctagtg 1081 ccaacaagcg tttacaacgc tctcttcacc tacactcgcg cagtccgcac tcttcgcact 1141 tccgacccag ccggatttgt gcgaactcaa agcaacaaac ccgagcacgc ttgggtcact 1201 ccaaacgcgt gggacaatct gcagaccttg tctgtcaatg ccccccaccg cccccaagta 1261 tgctaccact tcttctcctc ccccgtggca aggttaaagc tccacttcgc ccaacactgg 1321 cgagcctatc ttttggctct caccccattc cttaccacgt cacctcttct cctcccctta 1381 ttcaatttca acaccccttt ccccctccct cggctacttt ctctgtttcg ccgctcggtg 1441 tcctcaccac ggcttttgca ctcaatccta cccagtcagc tgagaggagc tgcgatcccg 1501 aatcgcccac tcccactctg ggtcacaaaa ctacatcact ttctcgactc ccactccctc 1561 ctccccactc cccccattcg gcccaggata gagcttcagc gcttgccact gatgtctcta 1621 attccgaaac caaaaattgt ccttccccta ctgtccctcc tcctttcctc cccaaccatc 1681 tacatccact tcttccaggc acagaccccc caacaactcc acgacaatta tcaccttcac 1741 cttcatccct ctcgcttcga actttcctgg actctgcagt catatcatgt gactcaagcc 1801 cagtccttcc tccctctcct tctcccagct cccactcaag ctcaagcttc caatcctgca 1861 cctcgccccc ccgctttcca tgctatcccc ctcccccctc agccctcgac ctcctcttct 1921 cctccactcc aggaaccgac cctttccccc cacctgatac acccccccct cacaagagaa 1981 ccatcgccct tgaacggctg cgcctgcgac agtgcgctac tcccttccac agctgcgatg 2041 acgtctgctg aacatcccac tccactcaac ccccccacac ctagcccaac accagacgtc 2101 cctcctcccg actcacccgg taacccatca cttttgaagc aagtccctcc cgaagcgaac 2161 ttgcatccta tccacaaccc agacctcccc tcttccacca ctcttccttc tggggccctg 2221 acactggtcc cagccaaaac tccttccatc tacgccaatc ccaccccccc cagttcccat 2281 ccgttcaccc cactggctga tgaccccact gctgtgggtc cttgcctacc gttccacgtt 2341 ctccacccgg ctgactactt tcctctttca gccgagtttc tcacacggac ccggcatgtc 2401 cccccctctt ctctctcaca tccaaaactc aattgcctac tcacctgctt ttctgaactt 2461 tcaggacact ctgagtcaga tctttggttg tccctgcaat caatacttcc tgactcccaa 2521 ctccaaaatc ctgaagtctc gacacttggc ctgtccactg acattctcac agctctctgc 2581 ttcatctacc attcatctgt gactctccat gccccctcag gagtttatca ctacggcata 2641 gcctcctctt ctaccgtcta tgtcatccac tatcaaccag gccctcctcc tcatttttct 2701 ctctccccta gacttgccgc ttctgctcct cgctgcaacc ccaccaacag cagattggtc 2761 agacaagctc tgcggtttaa attgaacggc gagtttctcc ccttcaccca ggcttacgcg 2821 catgaatctt ccatcaccca tgccaaaaac ctcatctcca acatgaagaa tggttttgat 2881 ggaatcatgt cttctctcac tgactcctct aagggtccct ccccccgtga aaaactgacc 2941 actctcgact ctctcataga tgtcgctgcc cctcgcgaag tttctctcat ccacatcgcc 3001 ggcttcgcag gctgcggcaa gacccacccc atccaaaaac tcctccaaac ttcccctttt 3061 cacgacttcc gaatctcatg ccccactaat gaactccgat ccgaatggaa gcgtgatatg 3121 caaccaacag ctgaaaatgt ttggaggttc tccacatggg aatccagcct gctcaaacat 3181 tccgagatcc tcgtaatcga cgagatttac aagctccctc gtggctacct agatctctcc 3241 atccttgctg atccaactct ctccttggtc atcatccttg gtgaccctct ccaaggagag 3301 tatcactcga cctctcctca cagctccaat cactttcttc caagtgaggt ccaccgcttc 3361 aagtcttaca tcgactgcta ctgtttttgg tcccaccgca ttccaaagca gatagcatcc 3421 ttgttcggcg tagtatgcca caacacgaac gaaggtttcg tgagagccct cacatctcat 3481 ccccccaatt ccaaaaacct caccaatgcg accaacactg ctctcagtct ccaacagatg 3541 ggccaccacg ctatcaccat cagcgccaga agggtcacct tcaccgaggc ccatacaatt 3601 ctgcttgatc gtcataccaa ccttctctcc cccaacaact gtcttgttgc cctcacccgc 3661 agccgcactg gcgtctactt cgtcggcaat ctgcacctgg catcaaacag ctttggcaca 3721 aactacatgt tctctcaagc tctctgccaa ggcacaatcg acctaaacaa cgtgttcccc 3781 cacatcatgc ctcacctccc gaaaatgtat gaacccatcc gctcccggtc caaccgtttt 3841 gtgtctgggt ccctcaattt tcgaccaacc accaattccc gcctcctttc cagtctcact 3901 aagccaaccc acctcccccc tcacatccct accaaccact ccctggatgt cctagtttcc 3961 aaccctgtgc tccttggtga gaccctcgac cctcgattgg aggtcctcca cctcccccca 4021 actcgcctcc cattgcatct ggacctcctg cccacagtac cttcctcttc cagcttctcc 4081 tcagtcgacc atcttttccc aacccccatc tcccccgcta tctgcggcta caccttcgaa 4141 aatttggccg cattcttcct cccagctcat gacccggacc taaaggaggt gctcatcaat 4201 gaccaaaaga gcaaccagtt cccatacttg gacgcccctt ttgagctttc gtgccaaccc 4261 tcctcactgt tggcaccaat tcacaagccg gcctcggatc caacccttct ccctggctcc 4321 atcaagaaac gcctcagatt ccgcgcttct tcctccccat attccatcac tccatctgat 4381 caacttcttg gtcaacacct cttctcttct ttgtgcctgg cttatgggcg caaccccaat 4441 tctgtcctcc ccttccaacc tgagctcttc agtgagtgca tatgcattaa tgattacgct 4501 caactctcct ccaagactca agccaccatc gtggccaatc atcaaaggtc tgatcctgac 4561 tggcgcctaa ctgctgtccg catctttgcc aaggctcaac acaaagtaaa cgacgcttcc 4621 atcttttccg ggtggaaggc ttgccaaact ctagccctga tgcacggtta catcattctc 4681 gtactcggcc cagtcaagaa ataccaacgc atttttgatt ccaaggacag acctccccac 4741 atctactacc actgcggtaa aactccctcc cagctctccc aatggtgcca aactcacctt 4801 tctggctctt cctacatcgc caacgactac actgcctttg atcagtccca acacggcgag 4861 gctgtggtcc tggaatgttt gaagatgcgc cgcctctcca tcccggactc tctcattcag 4921 ctccactccc acctcaagtg ttccgtcgac acccagttcg gccccctcac ctgcatgcgc 4981 ctcactggcg agccgggcac ttatgatgac aactctgact acaacctagc tgtcatctac 5041 tcccaatact ccctcaatgg ccaccccatt ctgatctcag gcgatgactc cgtcctttgc 5101 ggcacaccgc ccccttctcc actttggccc actctcaaga aaatgcttca tctccgtttc 5161 aagatcgaac ggacctccca ccccctcttc tgcgggtatt acgtctcccc tcatggcgct 5221 gcccgcaacc cgtatgctct cttcgccaag ctcatgatct gcgttgatga caagagcctc 5281 catgacaaga agttgtccta tctctctgaa ttctccactg gccatctggc tggcgacctg 5341 gtcacctcca ttctcccttc ccacctactt ccctatcagt ccgccgtgca cgacttcttc 5401 tgccggaatt gcacgcccgc ggaaaaaatt ctcctgtctc tggacccaat ccctgagtcc 5461 aaaatcctcc agctcattct caaagttcgc tgggcttctc aagctttctt ttcctacctg 5521 cctcaaaaag ctcgcgaact ccttgtggca cgctcttctc tcccgtccct ctattccaat 5581 cccaaagtct ctcaactgga gtctgaattg cttcccttct ctcaatagat caatggaaga 5641 cacagcaatc atcagaagcc ctcagccctc cataaacgca ccaggcttcc atctgccacc 5701 caccgactca caacaatcct ctgctattga actccccttc cagtttcagg ccaccacttt 5761 tggcgcgact gaaacagctg ctcaaatcag tctggcctcc gccaacgcta ttaccaagct 5821 cgcgtctctc taccgccatg tgcggctcac gcagtgcgct gccaccatca ctccgacagc 5881 ggccgccatt gccaatcctc tcactgtcaa catcgtctgg gtgtctgaca attccactgc 5941 caagcccacc gagattctca atgtctttgg tggatcttcc tacacgtttg gcggcgccct 6001 caatgccacc aagcccctta ccatccctct ccccatgaac tcggtcaact gtatgctcaa 6061 ggactctgtt ctttacacag attgcccaaa gctcctggcc tactcagctg ctcccagctc 6121 tccctccaaa accccaaccg ccactatcca aatccatggc aagctccgct tgtcctcccc 6181 cctcctccaa gccaattaac tctctctccc tcagccacca cctcgctcct cccccatctc 6241 ctatggtaat tgcggacagt tccgctccct ctagcacaca gaggtccatt tgggtgcgac 6301 tcccccccct cccgtgggtc aacgggaacc a // LOCUS RATRGHA 542 bp ds-DNA ROD 26-MAY-1990 DEFINITION Rat growth hormone (rGH) gene, intron B repetitive DNA. ACCESSION M32696 KEYWORDS repetitive DNA. SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 542) AUTHORS Guitierrez-Hartmann,A., Lieberburg,I., Gardner,D., Baxter,J.D. and Cathala,G.G. TITLE Transcription of two classes of rat growth hormone gene-associated repetitive DNA: Differences in activity and effects of tandem repeat structure JOURNAL Nucleic Acids Res. 12, 7153-7173 (1984) STANDARD simple staff_entry BASE COUNT 199 a 114 c 118 g 111 t ORIGIN 1 aacagtaatg acagagaggg ctggagagat ggctcagtgg ttaagagcac ccgactgctc 61 ttccaaaggt cctgagttca attccagcaa ccacatggtg gctcacaacc atctgtaaag 121 agatccgatg ccctcttctg gtgtgtctga agacagctac agtgtactta tataataaac 181 aaataaatct ttaaaaaaaa aaacaaaaac ggggctggag agatggctca gcggttaaga 241 gcgcccgact gctcttccag aggtcatgag ttcaattcca gcaaccacat ggtggctcac 301 aaccatctgt aaagagatct gatgccctct tctggtgtat ctgaagacag ctacagtgta 361 cttatatata ataaataaat aaatctttaa aaaaaaaaca aaacaggggc tggggattta 421 gctcagtggt agagcgctta cctaggaagc gcaaggccct gggttcggtc cccagctccg 481 aaaaaaagaa ccaaaaaaaa aaaaaaaaac caaaacaaaa acaaaacagt aatgacagag 541 ag // LOCUS ALRVSRC 1801 bp ss-RNA VRL 26-MAY-1990 DEFINITION Rous sarcoma virus (Schmidt-Ruppin D strain) v-src gene, complete cds. ACCESSION M33292 KEYWORDS oncogene; pp60v-src; src gene; tyrosine kinase. SOURCE Rous sarcoma virus (strain Schmidt-Ruppin D) RNA, clone psrc1. ORGANISM Rous sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian sarcoma viruses. REFERENCE 1 (bases 1 to 1801) AUTHORS Reddy,S., Mazzu,D., Mahan,D. and Shalloway,D. TITLE Sequence and functional differences between Schmidt-Ruppin D and Schmidt-Ruppin A strains of pp60v-src JOURNAL Unpublished (1990) 406 S. Frear Bldg, University Park, PA 16802 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Shalloway, 26-MAR-1990. FEATURES from to/span description pept 65 1645 pp60v-src protein BASE COUNT 392 a 537 c 551 g 321 t ORIGIN 1 actctgctgg tggcctcgcg taccactgtg gccaagcggt agctggaacg tgcagccgac 61 caccatgggg agtagcaaga gcaagcctaa ggaccccagc cagcgccggc gcagcctgga 121 gccacccgac agcacccacc acgggggatt cccagcctcg cagaccccca acaagacagc 181 agcccccgac acgcaccgca cccccagccg ctccttcggg accgtggcca ccgagcccaa 241 gctcttcgag gacttcaaca cttctgacac cgttacgtcg ccgcagcgtg ccggggcact 301 ggctggcggc gtcaccactt tcgtggctct ctacgactac gagtcctgga ttgaaacgga 361 cttgtccttc aagaaaggag aacgcctgca gattgtcaac aacacggaag gtaactggtg 421 gctggctcat tccgtgacta caggacagac gggctacatc cccagtaact atgtcgcgcc 481 ctcagactcc atccaggctg aagagtggta ctttgggaag atcactcgtc gggagtccga 541 gcggctgctg ctcaaccccg aaaacccccg gggaaccttc ttggtccggg agagcgagac 601 gacaaaaggt gcctattgcc tctccgtttc tgactttgac aacgccaagg ggctcaatgt 661 gaagcactac aagatccgca agctggacag cggcggcttc tacatcacct cacgcacaca 721 gttcagcagc ctgcagcagc tggtggccta ctactccaaa catgctgatg gcttgtgcca 781 ccgcctgacc aacgtctgcc ccacgtccaa gccccagacc cagggactcg ccaaggacgc 841 gtgggaaatc ccccgggagt cgctgcggct ggaggtgaag ctggggcagg gctgctttgg 901 agaggtctgg atggggacct ggaacggcac caccagagtg gccataaaga ctctgaagcc 961 cggcaccatg tccccggagg ccttcctgca ggaagcccaa gtgatgaaga agctccagca 1021 tgagaagctg gttcaactgt acgcagtcgt gtcggaagag cccatctaca tcgtcattga 1081 gtacatgagc aaggggagcc tcctggattt cctgaaggga gagatgggca agtacctgcg 1141 gctgccacag ctcgttgata tggctgatca gattgcatcc ggcatggcct atgtggagag 1201 gatgaactac gtgcaccgag acctgcgggc ggccaacatc ctggtggggg agaacctggt 1261 gtgcaaggtg gctgactttg ggctggcacg cctcatcgag gacaacgagt acacagcacg 1321 gcaaggtgcc aagttcccca tcaagtggac agcccccgag gcagccctct atggccggtt 1381 caccatcaag tcggatgtct ggtccttcgg catcctgctg actgagctga ccaccaaggg 1441 ccggatgcca tacccaggga tgggcaacgg ggaggtgctg gaccgggtgg agaggggcta 1501 ccgcatgccc tgcccgcccg agtgccccga gtcgctgcat gaccttatgt gccagtgctg 1561 gcggagggac cctgaggagc ggcccacttt tgagtacctg caggcccagc tgctccctgc 1621 ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa caaggcaagg 1681 cttggccgac aattgcatga agaatctgct tagggttagg cgttttgcgc tgcttcgcga 1741 tgtacgggcc agatatacgc gtatctgagg ggactagggt gtgtttaggc gaaaagcggg 1801 g // LOCUS AVIH2AA 3800 bp ds-DNA BCT 26-MAY-1990 DEFINITION A.vinelandii H2 uptake hydrogenase (hoxK), complete cds, and H2 uptake hydrogenase (hoxG), complete cds. ACCESSION M33152 KEYWORDS H2 uptake hydrogenase. SOURCE A.vinelandii (strain OP) DNA, clone pALM21. ORGANISM Azotobacter vinelandii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 3800) AUTHORS Menon,A.L., Stultz,L.W., Robson,R.L. and Mortenson,L.E. TITLE Cloning, nucleotide sequence and characterization of the (NiFe) hydrogenase structural genes and hoxG from Azotobacter vinelandii JOURNAL Unpublished (1990) U of Georgia, Dep Biochemistry, Athens, GA 30602 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Robson, 22-MAR-1990. FEATURES from to/span description pept 149 1225 H2 uptake hydrogenase (hoxK) precursor sigp 149 283 H2 uptake hydrogenase signal peptide (put.) matp 284 1222 H2 uptake hydrogenase pept 1222 3030 H2 uptake hydrogenase (hoxG) ORF 3047 3769 ORF3 BASE COUNT 686 a 1318 c 1213 g 583 t ORIGIN 1 tgtatcaagc catgacaaaa acatggcatt ggcgcattat tcgtgcggtt ttcattcagc 61 aaccgtgggc catacaaccg gcgcgccgtc atagccgaag gacggtgcgc aggggcgccg 121 ataacgacct ggccacaagg gtaacggcat gtctcgactc gaaactttct atgacgtgat 181 gcggcgtcag ggcatcacgc gccgcagctt tctcaaatat tgcagcctga ccgccgcggc 241 cctgggcctc ggcccggcct tcgccccgcg gatcgcccac gcgatggaaa ccaagccgcg 301 cactccggtg ctctggctgc acggcctgga gtgcacctgc tgctccgagt cgttcatccg 361 ttcggcccac ccgctggtca aggacgtggt gctgtcgatg atctcgctgg actacgacga 421 caccctgatg gccgccgccg gccaccaggc cgaggccgcc ctcgaagaga ccatgcgcaa 481 gtacaagggc gagtacatcc tcgccgtgga gggcaacccg ccgctcaacg aggacggcat 541 gttctgcatc gtcggcggca agccgttcat cgagcagctc aggcatgtgg cgaaggacgc 601 caaggcggtg atcgcctggg gcagttgcgc cagttggggc tgcgtgcagg cggcccggcc 661 caacccgacc caggcggtgc cgatccacaa ggtcatcacc gacaagccga tcgtcaaggt 721 gcccggctgc ccgccgatcg ccgaggtgat gaccggggtg atcacctaca tgctgacctt 781 cggcaagctg cccgagctgg accgccaggg gcggccgaag atgttctacg gccagcgcat 841 ccacgacaag tgctaccgcc gcccgcactt cgacgccggc cagttcgtcg agcactggga 901 cgacgagggc gcgcgcaagg gctactgcct gtacaaggtc ggctgcaagg gcccgaccag 961 ctacaacgcc tgctcgacgg tgcgctggaa cgagggcact tccttcccga tccaggccgg 1021 ccacggctgc atcggctgct cggaggacgg tttctgggac aagggctcgt tctatgaacg 1081 cctgaccacc attccgcagt tcggcatcga gaagaacgcc gacgaaatcg gcgccgccgt 1141 cgccggcggg gtcggcgcgg ccatcgccgc gcatgccgcg gtcaccgcca tcaagcgcct 1201 gcagaacaag ggggatcgcc catgagcagc ctgccgaacg ccagccaact ggacaagtcc 1261 ggcaggcgca tcgtcgtcga cccggtgacc cgcatcgagg gccacatgcg ctgcgaggtc 1321 aacgtcgacg ccagcaacgt gatcaccaac gccgtctcca ccggcaccat gtggcgcggc 1381 ctggaggtca tcctcaaggg ccgcgacccg cgcgacgcct gggccttcgt cgagcgcatc 1441 tgcggcgtct gcaccggcac ccatgcgctg acctcggtgc gcgcggtgga ggatgccctg 1501 gacatccgca tcccctacaa cgcccacctg atccgcaacc tgatggacaa gacgctgcag 1561 gtgcacgacc acatcgtgca cttctaccac ctgcacgcgc tggactgggt caacccggtc 1621 aacgccctga aggccgatcc caaggctacc tccgccctgc agcaggcggt ttcgccggcc 1681 catgccaagt ccagccccgg ctacttccgc gacgtgcaga cgcgcctgaa gaagttcgtc 1741 gagagcggcc agctcggcct gttctccaac ggctactggg acaatccggc ctacaagctg 1801 ccgcccgagg cggacctgat ggccgtggcc cactacctgg aggcgctgga cctgcagaag 1861 gacatcgtca agatccatac catcttcggc ggcaagaacc cgcatccgaa ctacatggtc 1921 ggcggcgtgg cctgcgccat caacctggac gacgtcggcg ccgccggcgc gccggtcaac 1981 atgaccagcc tgaacttcgt cctcgaacgc atccacgagg cccgcgagtt caccaggaac 2041 gtctacctgc cggacgtgct ggcggtcgcc gggatctaca aggactggct gtacggcggc 2101 ggtctggccg cgcacaacct gctgtcctac ggcaccttca ccaaggtgcc ctacgacaag 2161 tccagcgacc tgttgccggc cggcgccatc gtcggcggca attgggacga ggtgctgccg 2221 gtcgacgtgc gcgatcccga ggagatccag gagttcgtca gccactcctg gtacagctac 2281 gccgacgaaa ccaaggggct gcatccctgg gacggcgtca ccgagccgaa attcgagctc 2341 ggcccgaaca ccaagggcag ccgcacccac atccaggaaa tcgacgaggc gcacaagtac 2401 agctggatca aggcgccgcg ctggcgcggc cacgctatgg aggtcggccc gctggcacgt 2461 tacatcatcg cctacgcttc gggccgcgaa tacgtgaagg aacaggtcga ccgctcgctg 2521 gccgccttca accagagcac cggcctgaac ctcggcctca agcagttcct gccctcgacc 2581 ctcggccgca ccctggcgcg cgccctggag tgcgagctgg cggtggacag catgctcgac 2641 gactggcagg ccctggtcgg caacatcaag gccggcgacc gcgccaccgc caacgtcgag 2701 aagtgggacc cgagcacctg gccgaaggag gccaagggcg tgggcatcaa cgaggcgccg 2761 cgcggcgccc tgggccactg gatcaggatc aaggacggca agatcgagaa ctaccaggcg 2821 atcgtgccga ccacctggaa cggcaccccg cgcgaccatc tgggcaacat cggcgcctac 2881 gaggccgcgc tgctcaacac caggatggag cgcccggacg agccggtgga gatcctgcgc 2941 accctgcaca gcttcgaccc ctgcctggcc tgttcgaccc acgtgatgtc gccggacggc 3001 caggagctga cccgggtgaa ggtccgctga accggaggat tgcgcgatgg cactggaaaa 3061 atccctggaa accggcgacg gccaggagaa ggtccgcaag cagaccgcgg tgtacgtcta 3121 cgaggcgccg ctgcgcctct ggcactgggt cacggcgctg tccatcgtcg tgctcggcgt 3181 gaccggctac ttcatcggcg cgccgctgcc gacgatgccc ggcgaggcga tggacaacta 3241 cctgatgggc tacatccgct tcgcccactt cgccgccggc tacgtgctgg cgatcggctt 3301 cctcggccgg gtctactggg ccttcgtcgg caaccaccac gcccgcgagc tgttcctcgt 3361 gccggtgcac cgcaaggcct ggtggaagga gctgtggcac gaggtgcgct ggtacctgtt 3421 cctggaaaag accccgaaga agtacatcgg ccacaacccc ctgggccagt tggcgatgtt 3481 ctgcttcttc gtggtcggcg cggtgttcat gagcgtcacc ggcttcgccc tctacgccga 3541 ggggctgggg cgggacagct gggccgaccg gctgttcggc tgggtgatcc cgctgttcgg 3601 ccagagccag gacgtgcaca cctggcacca cctgggcatg tggtacctcg tcgtcttcgt 3661 catggtgcat gtctacctgg ccgtgcgcga agacatcgtt tcccggcagt cgctgatctc 3721 caccatggtc ggcggctggc ggatgttcaa ggacgaccgg ccggattgag ccccgtgtcg 3781 tcccttccgt ccgggccggt // LOCUS RABIGHAS 402 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-1. ACCESSION M29412 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 402) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 402 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 402 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 355 356 D-region end/J-region start BASE COUNT 86 a 108 c 118 g 90 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagcctctg gattctccct cagtagttac tacatgcaat gggtccgcca ggctccaggg 181 aaggggctgg aatggatcgg aatcattggt agtagtggta gcacatacta cgcgagctgg 241 gtgaagggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgtgccagag catatattag taatactgat 361 ggttctggct ttaacttgtg gggccaaggc accctggtca cc // LOCUS RABIGHAT 399 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-3-1. ACCESSION M29413 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-3-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 399) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 399 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 399 Ig mu-chain recomb 342 343 V-region end/D-region start recomb 361 362 D-region end/J-region start BASE COUNT 88 a 114 c 111 g 86 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggcgg tcgcctggta aagcctgacg aatccctgac actcacctgc 121 acagcctctg gattctccct cagtacctac aacatgatct gggtccgcca ggctccagga 181 aaggggctgg aatacatcgg ccacattagt tttggtggta gcacatacta cgcgagctgg 241 gcgaaaggtc gatgcaccat atccaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgtgccaggg gatggactcc taaaagtctt 361 tcagccttta acttgtgggg cccaggcacc ctggtcacc // LOCUS RABIGHAU 390 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 1-5. ACCESSION M29414 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 1-5. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 390) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 390 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 390 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 368 369 D-region end/J-region start BASE COUNT 86 a 100 c 119 g 85 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccgggga tcgcctggtc acgcctggga cacccctgac actcacatgc 121 acagtctctg gattctccct caatagttat gtagtgggct gggtccgcca ggctccagag 181 aagggactgg aatacatcgg aaccatttgg gtcgatggta agacatacta cgcgagctgg 241 acgaagggcc gattcaccat ctctaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacatatttc tgtgccagat atggtagtag tggtgattta 361 ggcgtgtggg gccaagggac cctggtcacc // LOCUS RABIGHAV 351 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 2-1. ACCESSION M29415 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 2-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 351) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept < 1 > 351 Ig mu-chain V-D-J precursor (AA at 1) sigp < 1 21 Ig mu-chain signal peptide matp 22 > 351 Ig mu-chain recomb 306 307 V-region end/D-region start recomb 339 340 D-region end/J-region start BASE COUNT 77 a 101 c 100 g 73 t ORIGIN 1 gtgctcaaag gtgtccagtg tcagtcgctg gaggagtccg ggggtcgcct ggtcacgcct 61 gggacacccc tgacactcac ctgcacagcc tctggattct ccctcagtag ctactggatg 121 acctgggtcc gccaggctcc agggaagggg ctggaatgga tcggaatcat tgttcatggt 181 gatagcgcat actacgcgag ctgggcgaaa ggccgattca ccatctccag aacctcgacc 241 acggtggatc tgaaaatcac cagtccgaca accgaggaca cggccaccta tttctgtgcc 301 agagaatatt atggtactat taacttgtgg ggcccaggca ccctggttac c // LOCUS RABIGHAW 408 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-2. ACCESSION M29416 KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-2. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 408) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 408 Ig gamma-chain V-D-J precursor sigp 1 57 Ig gamma-chain signal peptide matp 58 > 408 Ig gamma-chain recomb 342 343 V-region end/D-region start recomb 369 370 D-region end/J-region start BASE COUNT 77 a 112 c 127 g 92 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcgctggagg agtccggggg tcgcctggtc acgcctggga catccctgac actcacctgc 121 acagtctctg gattctccct cagtactagt gcaatggcct gggtccgcca ggctccaggg 181 aaggggctgg aatatgtcgg agtcattagt ggaagtggtg gcacatacta cgcgagctgg 241 gcgagcggcc ggttcaccat ttccaaagcc tcgtcgacca cggtggatct gaaaatgacc 301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagtcaggga tagtcatggt 361 tatattggtg atgcttttga tccctggggc ccaggcaccc tggtcacc // LOCUS RABIGHAX 390 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-3-1. ACCESSION M29417 KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-3-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 390) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 390 Ig gamma-chain V-D-J precursor sigp 1 57 Ig gamma-chain signal peptide matp 58 > 390 Ig gamma-chain recomb 339 340 V-region end/D-region start recomb 356 357 D-region end/J-region start BASE COUNT 77 a 110 c 120 g 83 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagtctctg gattctccct cagtagtcgc tggatgagct gggtccgcca ggctccaggg 181 gaggggctgg aatccatcgg agccattgat actggtggta gcgcatacta cgcgaactgg 241 gtgaaaggcc gactcaccat ctccaaaacc tcgtcgacca cggtggattt gaaaatgacc 301 agtctgacaa ccgaggacac ggccacctat ttctgtgcca gagattatag tggtggactt 361 gacttgtggg gcacaggcac cctggtcacc // LOCUS RABIGHAY 399 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged gamma-chain mRNA V-D-J region, clone 3-4. ACCESSION M29418 KEYWORDS diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 3-4. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 399) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 399 Ig gamma-chain V-D-J precursor sigp 1 57 Ig gamma-chain signal peptide matp 58 > 399 Ig gamma-chain recomb 339 340 V-region end/D-region start recomb 364 365 D-region end/J-region start BASE COUNT 83 a 116 c 116 g 84 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcgctggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagcctctg gattcaccat cagtagctac cacatgatct gggtccgcca ggctccaggg 181 gaggggctgg aatacatcgg atggattagt actggtggta gcgcatacta cgcgaactgg 241 gcaaaaggcc gattcaccat ctccagaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgttgcagaa ctcctgctgt tagtaaatgg 361 gacttgtggg gcccgggcac cctagtcacc gtctcctca // LOCUS RABIGHAZ 384 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 4-1. ACCESSION M29419 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 4-1. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 384) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 384 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 384 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 365 366 D-region end/J-region start BASE COUNT 82 a 114 c 113 g 75 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagtctctg gaatcgacct cagtggctac cacatgagct gggtccgcca ggctccaggg 181 gaggggctgg aatggatcgg aaccatgagt actactgata acacatatta cgcgagctgg 241 gcaaaaggcc gattcaccat ctccaaaacc tcgaccacgg tggatctgaa aatgaccagt 301 ctgacagccg cggacacggc cacctatttc tgtgccagag gacaggcaac ttttattccc 361 tggggcccag gcaccctggt cacc // LOCUS RABIGHBA 393 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 5-2. ACCESSION M29420 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 5-2. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 393) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept 1 > 393 Ig mu-chain V-D-J precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 393 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 364 365 D-region end/J-region start BASE COUNT 80 a 103 c 118 g 92 t ORIGIN 1 atggagactg ggctgcgctg gcttctcctg gtcgctgtgc tcaaaggtgt ccagtgtcag 61 tcggtggagg agtccggggg tcgcctggtc acgcctggga cacccctgac actcacctgc 121 acagtctctg gaatcgacct cagtagcttt gcaatggcct gggttcgcca ggctccaggg 181 aaggggctgg agtggatcgg aatcattaat ggttatggta ctacatacta cgcgagctgg 241 gtgaatggcc gattcaccat ctccaaaacc tcgacctcgg tggatctgaa aatgaccagt 301 ctgacaaccg aggacacggc cacctatttc tgtgtcagat atcttagtga tggttggtat 361 ctagacttgt ggggccaagg caccctggtc acc // LOCUS RABIGHBB 375 bp ss-mRNA MAM 26-MAY-1990 DEFINITION Rabbit Ig rearranged mu-chain mRNA V-D-J region, clone 7-2. ACCESSION M29421 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region. SOURCE Rabbit (haplotype b) adult spleen, cDNA to mRNA, clone 7-2. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 375) AUTHORS DiPietro,L.A. and Knight,K.L. TITLE Restricted utilization of germ-line VH gene and diversity of D regions in rabbit splenic Ig mRNA JOURNAL J. Immunol. 144, 1969-1973 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.DiPietro, 25-OCT-1989. FEATURES from to/span description pept < 1 > 375 Ig mu-chain V-D-J precursor (AA at 1) sigp 1 57 Ig mu-chain signal peptide matp 58 > 375 Ig mu-chain recomb 339 340 V-region end/D-region start recomb 361 362 D-region end/J-region start BASE COUNT 82 a 98 c 112 g 83 t ORIGIN 1 gtgctcaaag gtgtccagtg tcagtcggtg gaggagtccg ggggtcgcct ggtcacgcct 61 gggacacccc tgacactcac ctgcacagtc tctggattct ccctcaataa ttatgcaatg 121 ggctgggtcc gccaggctcc agggaagggg ctagaatgga tcggaaccat tggtactggt 181 ggtagcgtat actacgcgaa ctgggcaaaa ggccgattca ccatctccag aacctcgacc 241 acggtggatc tgaaaatgac cagtctgaca accgaagaag gacacgccac ctatttctgt 301 gccagagtgg ctggtggtac tgtttttggc tatgtggggt actttaactt gtggggccaa 361 ggcaccctgg tcacc // LOCUS PHVARCA 902 bp ss-mRNA PLN 26-MAY-1990 DEFINITION P.vulgaris arcelin 2 mRNA, complete cds. ACCESSION M28470 KEYWORDS arcelin. SOURCE P.vulgaris, cDNA to mRNA, clone pARC2-11 and pARC2-191. ORGANISM Phaseolus vulgaris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 902) AUTHORS John,M.E. and Long,C.M. TITLE Sequence analysis of arcelin 2: A lectin-like plant protein JOURNAL Gene 86, 171-176 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by M.E.John, 29-SEP-1989. FEATURES from to/span description pept 1 798 arcelin 2 BASE COUNT 249 a 282 c 164 g 207 t ORIGIN 1 atggcttcct ccaacttact caccctagcc ctcttccttg tgcttctcac ccacgcaaac 61 tcaagcaacg acgcctcctt caacgtcgag acgttcaaca aaaccaacct catcctccaa 121 ggcgatgcca ccgtctcatc cgaaggccac ttactactaa ccaatgttaa aggcaacgaa 181 gaggactcta tgggccgcgc cttctactcc gcccccatcc aaatcaatga cagaaccatc 241 gacaacctcg ccagcttctc caccaacttc acattccgta tcaacgctaa gaacaatgaa 301 aattccgcct atggccttgc ctttgctctc gtccccgtcg gctctcggcc caaacttaaa 361 ggccgttatc taggtctttt caacacagcc aactacgacc gcgacgccca tactgtggct 421 gtggtgttcg acaccgtcag caaccgtatt gaaatcgacg tgaactccat ccggcctatc 481 gcaacggagt cttgcaattt cggccacaac aacggagaaa aggccgaggt tcggatcacc 541 tattactccc ccaagaacga cttgagggtt tctctgcttt acccttcttc ggaagaaaag 601 tgccacgtct ctgccacagt gccgctggag aaagaagttg aggactgggt gagcgttggg 661 ttctctgcca cctcagggtc gaaaaaagag accactgaaa cgcacaacgt cctctcttgg 721 tctttttctt ccaacttcat caattttgag ggcaaaaaat ctgaacgttc caacatcctc 781 ctcaacaaga tcctctagac tcccaaagcc agcttcactg tgacagtaaa accttcctta 841 tacgctaata atgttcatct gtcacacaaa ctacaataaa taaaatggga gcaataaata 901 aa // LOCUS DROGOALA 2204 bp ss-mRNA INV 26-MAY-1990 DEFINITION Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo59. ACCESSION M29731 J05089 KEYWORDS G protein; guanine nucleotide-binding protein. SOURCE D.melanogaster adult head cDNA to mRNA, clone lambda-DGo59. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2204) AUTHORS Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M. TITLE Immunological and molecular characterization of G-o-alpha-like proteins in the Drosophila central nervous system JOURNAL J. Biol. Chem. 264, 18552-18560 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Thambi 04-OCT-1989. FEATURES from to/span description pept 166 1230 G-o-alpha-like protein BASE COUNT 757 a 493 c 459 g 495 t ORIGIN 1 gaattccgtg ctcggcaagt gcaacgttga aatcgttaaa ctgtacataa gcaaataaga 61 cataaagaaa aaagtccagg aaaattggaa aacaaaagcc cgaaaaccga aaagccccgt 121 gtaaatccga atccgaatcc aaatcagtat ccaaacccaa ccacaatggg ctgcaccaca 181 tccgccgaag aacgcgccgc catccagcga tccaaacaga tcgagaagaa tctaaaggag 241 gatggaatcc aggcggccaa ggacatcaag ctcctgctgc tgggtgccgg tgagtcgggc 301 aagagcacaa tagtcaaaca gatgaaaatc attcacgaga gcggcttcac tgcggaggac 361 tttaaacaat atcgaccggt tgtctacagc aacacaatac aatcattagt tgcaatattg 421 cgcgcgatgc caaccctaag tattcagtac agcaataacg agcgggagag cgatgccaag 481 atggtgttcg acgtatgcca acgcatgcac gacaccgagc ccttctcgga ggagctgctg 541 gccgccatga aacgcctctg gcaggacgcc ggtgtccagg agtgcttctc gcgcagcaac 601 gaataccaac taaacgattc cgcaaaatat ttcctggacg atttggatcg gttaggcgcc 661 aaggattacc agccaactga acaagatatc ttgcgcactc gcgtcaagac cactggcatc 721 gttgaggtac acttctcctt caaaaacctc aactttaaat tgtttgacgt gggcggtcag 781 cgctcggaac gtaagaaatg gatacactgc ttcgaagatg tcacggcgat cattttctgc 841 gtggccatgt ccgagtacga tcaagtcttg catgaggatg aaaccacgaa ccgcatgcaa 901 gagtcgctga aactgtttga ctcgatctgt aacaacaaat ggttcacgga cacctcgatt 961 attctatttc tgaacaagaa ggatttgttc gaggagaaga ttcgcaagag tcccctgacg 1021 atttgcttcc ccgaatacac aggtggacag gagtacggcg aggcggctgc ttacattcag 1081 gctcaatttg aagcgaaaaa caaatcaacc tcaaaagaaa tctactgcca catgacgtgt 1141 gccacagata ccaataacat tcagtttgta ttcgatgctg tcaccgatgt catcatagca 1201 aacaacctgc gcggctgtgg actgtactaa gatggattcc aggccggatc ccgacgatgt 1261 cgacgtccga gtcgatattg atgacgatga cgattatgtg gagcagaatg ggggcgttac 1321 gagggaacac cgtaacggta ttaaagagca gcgcggagca caacaaccca ccagcattga 1381 tcaaaaaacc aaacaattta ggagcagatg atagaaccaa ccaacaaacc aaccgcaaac 1441 cacacagaaa acataggaca ctgaacaagc aaagcccaaa gaacttttat ttgtttaaca 1501 aaaaaacggc ggacggacgg aaatcccgaa tggatgttat agggaaaatg agcgacaagt 1561 acattacata atatcgataa tattgaagca gatgcagatg caaatacaca caatgctaat 1621 gatgatcagg gcgactatga ctaaatgagg cagcaggcaa ctgacactgg gacacgcgat 1681 taaagtcaca tctgaaaaaa ggcagttgat tgaaaggcat ttctatatac aaacatatac 1741 aaacacatac atatgcatta tgcaaagcca catgtacgac atgacactaa cacactcaca 1801 cgacaaacac aagcgccaac attgcataca gttgttgttt ggtctgaata atttttatag 1861 aatttcataa tttatgtgta gtttagtttc ctcatgtatt tattaaaaca aaaaccaaac 1921 gagcgtatat ctacatatac cgcatatata tatatacata cacttctata catatatata 1981 tatatatata catatatata aatattatat attaaatgtt tcctgttgca atctctcttt 2041 aaaattattc atgccatcaa cgctctgcat ttgtcatgct tgtttagact taagttcgaa 2101 agtttcaaca aaatccagcg tcaaaggaaa tatcaatatt catttgattg agtgtcagcg 2161 tgtggtctaa agtaaatata taaaataaca aaccaaaaaa aaaa // LOCUS DROGOALB 2558 bp ss-mRNA INV 26-MAY-1990 DEFINITION Drosophila melanogaster G-o-alpha-like protein, clone lambda-DGo21. ACCESSION M29732 J05089 KEYWORDS G protein; guanine nucleotide-binding protein. SOURCE D.melanogaster adult head cDNA to mRNA, clone lambda-DGo21. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2558) AUTHORS Thambi,N.C., Quan,F., Wolfgang,W.J., Spiegel,A. and Forte,M. TITLE Immunological and molecular characterization of G-o-alpha-like proteins in the Drosophila central nervous system JOURNAL J. Biol. Chem. 264, 18552-18560 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Thambi 04-OCT-1989. FEATURES from to/span description pept 520 1584 G-o-alpha-like protein BASE COUNT 867 a 557 c 530 g 604 t ORIGIN 1 gaattccggt tgcctatttc tctcgcttac ctatttattt agcatacatt ttccaagcat 61 cctgtgaaaa aaccatcaca agttttcctt cgaacggaat gccaagtgca ttctggaagg 121 aaatcgttgt acatctacat aatgccaata aagaaaatgt aactaaagta aaaaaaaaaa 181 aaaagagcta aaccgttaaa ttaaagtttt aaagttaaaa aaacgctgaa taagtgttaa 241 atatatataa caaaaatatt gttgaattga agaaaaccaa agttcaaaaa cctgaaaaaa 301 ccataaagaa gtgattgaaa aatcagttga agtgccgtac tgaaaattaa agtccagtga 361 cacgatcgaa tccctcggat agcggagtta gtttagcccc ccgaattcga gtccccgcac 421 gttgtacacc tggtttttct cgctggcaac gtagtcggcc attgagttgg ccgataccaa 481 acgaccttca aaacgttttg cgtcgaggca atacgcacca tgggctgcgc acagtctgcc 541 gaggagcgag ccgcagccgc caggagtcgc ctcatcgagc gcaatctaaa ggaggatgga 601 atccaggcgg ccaaggacat caagctcctg ctgctgggtg ccggtgagtc gggcaagagc 661 acaatagtca aacagatgaa aatcattcac gagagcggct tcactgcgga ggactttaaa 721 caatatcgac cggttgtcta cagcaacaca atacaatcat tagttgcaat attgcgcgcg 781 atgccaaccc taagtattca gtacagcaat aacgagcggg agagcgatgc caagatggtg 841 ttcgacgtat gccaacgcat gcacgacacc gagcccttct cggaggagct gctggccgcc 901 atgaaacgcc tctggcagga cgccggtgtc caggagtgct tctcgcgcag caacgaatac 961 caactaaacg attccgcaaa atatttcctg gacgatttgg atcggttagg cgccaaggat 1021 taccagccaa ctgaacaaga tatcttgcgc actcgcgtca agaccactgg catcgttgag 1081 gtacacttct ccttcaaaaa cctcaacttt aaattgtttg acgtgggcgg tcagcgctcg 1141 gaacgtaaga aatggataca ctgcttcgaa gatgtcacgg cgatcatttt ctgcgtggcc 1201 atgtccgagt acgatcaagt cttgcatgag gatgaaacca cgaaccgcat gcaagagtcg 1261 ctgaaactgt ttgactcgat ctgtaacaac aaatggttca cggacacctc gattattcta 1321 tttctgaaca agaaggattt gttcgaggag aagattcgca agagtcccct gacgatttgc 1381 ttccccgaat acacaggtgg acaggagtac ggcgaggcgg ctgcttacat tcaggctcaa 1441 tttgaagcga aaaacaaatc aacctcaaaa gaaatctact gccacatgac gtgtgccaca 1501 gataccaata acattcagtt tgtattcgat gctgtcaccg atgtcatcat agcaaacaac 1561 ctgcgcggct gtggactgta ctaagatgga ttccaggccg gatcccgacg atgtcgacgt 1621 ccgagtcgat attgatgacg atgacgatta tgtggagcag aatgggggcg ttacgaggga 1681 acaccgtaac ggtattaaag agcagcgcgg agcacaacaa cccaccagca ttgatcaaaa 1741 aaccaaacaa tttaggagca gatgatagaa ccaaccaaca aaccaaccgc aaaccacaca 1801 gaaaacatag gacactgaac aagcaaagcc caaagaactt ttatttgttt aacaaaaaaa 1861 cggcggacgg acggaaatcc cgaatggatg ttatagggaa aatgagcgac aagtacatta 1921 cataatatcg ataatattga agcagatgca gatgcaaata cacacaatgc taatgatgat 1981 cagggcgact atgactaaat gaggcagcag gcaactgaca ctgggacacg cgattaaagt 2041 cacatctgaa aaaaggcagt tgattgaaag gcatttctat atacaaacat atacaaacac 2101 atacatatgc attatgcaaa gccacatgta cgacatgaca ctaacacact cacacgacaa 2161 acacaagcgc caacattgca tacagttgtt gtttggtctg aataattttt atagaatttc 2221 ataatttatg tgtagtttag tttcctcatg tatttattaa aacaaaaacc aaacgagcgt 2281 atatctacat ataccgcata tatatatata catacacttc tatacatata tatatatata 2341 tatacatata tataaatatt atatattaaa tgtttcctgt tgcaatctct ctttaaaatt 2401 attcatgcca tcaacgctct gcatttgtca tgcttgttta gacttaagtt cgaaagtttc 2461 aacaaaatcc agcgtcaaag gaaatatcaa tattcatttg attgagtgtc agcgtgtggt 2521 ctaaagtaaa tatataaaat aacaaaccaa aaaaaaaa // LOCUS MTYCLCGA 6319 bp ss-RNA VRL 26-MAY-1990 DEFINITION Turnip yellow mosaic virus Club Lake isolate, complete genome. ACCESSION J04373 KEYWORDS complete genome; nucleotide binding protein; replicase; virion protein. SOURCE Turnip yellow mosaic virus Club Lake isolate cDNA to viral RNA. ORGANISM Turnip yellow mosaic virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tymovirus. REFERENCE 1 (bases 1 to 6319) AUTHORS Keese,P., Mackenzie,A. and Gibbs,A. TITLE Nucleotide sequence of the genome of an Australian isolate of turnip yellow mosaic tymovirus JOURNAL Virology 172, 536-546 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Gibbs, 04-AUG-1989. FEATURES from to/span description pept 96 5630 replicase polyprotein pept 89 1975 Unknown protein pept 5645 6214 virion protein BASE COUNT 1461 a 2426 c 1061 g 1371 t ORIGIN 1 gtaatcaact accaattcca gctctctttt gacaactggt cttataccaa ctttccgtac 61 acttgcaacc ctcgtaagac aattgcaaat gagtaatggc cttccaatta gcattggacg 121 cccttgcacc cacgactcac agagatccct ctctgcatcc gattctcgaa tccacagtag 181 attcgattcg ctcctcgata cagacctacc catggtccat tccgaaggaa cttctgcccc 241 tactcaactc ctacggcatc ccaacatctg gtttgggaac atcccaccac ccccacgccg 301 cccacaagac aatcgagact tttctccttt gcacccactg gtctttccag gccaccactc 361 ccagctccgt catgttcatg aaacccagca agttcaacaa acttgcccag gtgaactcaa 421 actttcggga attgaagaac taccgcctgc accccaacga cagcactcgt taccccttca 481 catcaccaga ccttcccgtt ttccccacca ttttcatgca cgacgccctg atgtactacc 541 atccctccca gatcatggac ctgttcttgc agaaaccaaa cctcgaacgt ctgtacgcca 601 gcctcgtagt accacccgag gcccatcttt ccgaccaatc cttcttcccg aagttgtaca 661 cgtacacgac gacccgccac actcttcact acgtcccgga aggtcacgaa gccggcagct 721 acaaccaacc atccgacgcc cactcttggc tccgaatcaa ttcaattcgc ctcggcaacc 781 accacctctc agtgacgatc ctggaatcct ggggccctgt ccactcgctc ctaattcaac 841 gagggacccc cccccccgac ccatcactcc aggccccttc aacacccatg gcgtccgacc 901 tctttcggtc ttaccaagag ccccgcctcg acgtggtctc cttccgaatc ccagacgcca 961 tcgaacttcc acaggccaca ttccttcaac aaccgcttcg agaccgactg gtcccccgag 1021 ccgtctacaa cgccctgttc acctacacca gagcggtccg cacactccgg acttcagacc 1081 cagcggcatt cgtaaggatg cattcctcca aaccggacca cgattgggtc acctcgaacg 1141 cctgggacaa tctgcagacc ttcgcacttc tgaacgtacc ccttcgacca aacgtcgtct 1201 accacgtcct tcagagccca attgcctccc tagctcttta cctgaggcaa cattggcgcc 1261 gtcttaccgc caccgccgtt cccatcctct ccttcctaac cctcctgcag cgcttcctcc 1321 cattgcctat acctctggca gaggtaaaat ccatcacagc cttccgaagg gagctctacc 1381 gaaagaaggc cccccaccac cccctcgacg tcttccatct ccagcaacac ctccgcaatc 1441 accactccgc gatctcggcc gtacgcccag cttccccacc ccaccaaaga cttccacacg 1501 cgctccagaa agctgcattg ctgctcctcc gaccgatatc gcccctcttg acagcgaccc 1561 cgttctttcg gtccgaacag aagtccatgc tcccgaacgc cgaactttca tggaccctga 1621 agcgcttcgc gctgccttgg caggcctccc tagtcctcct ctctctgtcg gaatcatccg 1681 tactgcttca caaactgttc tccccaccaa ctctccaagc ccaacacgac acctaccacc 1741 gacatcttca ccctggatcc tacagtctcc agtgggagag gacgccattg tcgattccga 1801 ggacgacagc atttcttcct ttcactccca cgacttcaac agcccctccg gaccactccg 1861 aagccagtct ccctcccgct ttcgcctcca cctccgttcc ccgtccacct ccagtggcat 1921 cgagccttgg agcccagcct cctacgacta cggcagcgcc cccgacaccg attgaaccca 1981 cccagcgcgc tcatcaaaat tctgacctca cgcttgaaag ttcaacccca attgaacccc 2041 ccccaccccc catccaatcc tccgacatcc cgccttccgc ccccgttctt ttcccagaaa 2101 tcaactcacc gcatcgtttt tcccccaaac ttcccaccac acccgatttc gaacccaccc 2161 gcacttcacc ccctccttcc acttcgcatc aagattcgac tgaccccgcg gaccccctga 2221 tgggctccca ccttctgcac cattcactac ctgcacctcc cacccacccg cttcaatctt 2281 cacagctctt gcccgcacct ttgacaaacg accccaccgc gatcggcccg gtactcccct 2341 ttgaagaact ccacccacgc aggtaccccg aaaacaccgc cactttcctc acgaggctcc 2401 gttcacttcc ttcaaaccat ctaccacaac ccaccctgaa ttgtctcctc tctgctgtct 2461 ccgaccaaac caaggtttcc gaggatcacc tctgggagtc cctacagaca attctcccag 2521 acagccaact caggaacgaa gagatcaact ctctcgggct ttcaactgaa cacctcactg 2581 cgttggccca tctttacaac ttccaggcaa ccatctactc cgatcgtggt cccatcctct 2641 tcggcccatc cgacaccatt aagagaatcg acatcaccca caccaccgga ccgccatccc 2701 acttttcacc cggcaaaaga cttttaggca gccaaccctc agctaagggc catccctccg 2761 actcactcat cagagccatg aagtctttca aagtatccgg caactacctt cccttctctg 2821 aggcccacaa ccatcccacc tccatctcac atgccaagaa cttggtttca aacatgaaga 2881 atggattcga cggcatcctc tcccttctcg acgtctccac aggccaacga accggaccca 2941 cccccaaaga cgcgatcatt cagatagacc actacctcga caccaacccc ggcaaaacca 3001 cccctgtggt gcattttgct ggtttcgctg gctgtggaaa gacatatccg atccaacagc 3061 tccttaaaac taaactgttc aaagactttc gggtctcctg ccccaccaca gaactcagaa 3121 ccgaatggaa gactgcgatg gaacttcatg gctcccagtc atggcgcttt aacacttggg 3181 agtcttccat tctcaagtca tccagaattc tggtcatcga tgaaatctac aaaatgccaa 3241 gagggtacct cgacctttcc attctcgctg accccgccct cgaactcgtc ataattctcg 3301 gtgatcctct ccagggcgag taccactctc aatccaaaga ctcatccaat caccgccttc 3361 cctccgaaac tctcaggctg ctaccataca ttgacatgta ctgctggtgg agttatcgca 3421 ttccccaatg tatcgcccga ctcttccaaa ttcacagctt caatgcctgg cagggaatca 3481 tcggctccgt ttcaactccc caggatcaat cccccgttct caccaacagt catgcctcat 3541 ctctcacctt caacagcctg ggatatcgct cctgcacgat cagctctagc caaggcctca 3601 cattctgcga ccctgccatc atcgtcctgg acaactacac caagtggctc tcctcggcca 3661 acggcctcgt cgccctcacc cgatccagat caggtgtcca attcatgggc ccctcttcct 3721 atgtcggggg aaccaacggc tcttctgcca tgttttctga cgccttcaac aacagcctca 3781 tcatcatgga tcgctacttc ccatccctgt tcccacaact caagctcatc acctcccccc 3841 tcacaactcg cagccccaaa ctcaacgggg ccacccccag cgcatctccc acccatcgct 3901 cgccaaactt ccacctcccc ccacacattc ccctctctta tgatcgtgat ttcgtcacgg 3961 tcaacccaac tctccctgat cagggacccg aaacaagact cgacacccac ttcctcccac 4021 cttctcggct cccgcttcat ttcgatctcc caccagctat cacccccccc ccgatttcca 4081 caagcgtcga cccgccacaa gctaaagcta gccccgtcta tccaggcgag ttcttcgatt 4141 ctctggcggc gttcttctta ccagcacacg acccatcaac aagggaagta ctccacaaag 4201 atcaatctag caaccagttc ccttggttcg accgaccctt cagcttgtcc tgccagccct 4261 caagtttaat ttctgccaag catgcaccca accacgatcc gacccttctg cctgcctcca 4321 tcaataaacg cttgcgattc agacccagtg aagcaccgca ccaaatcacc gcagacgacg 4381 tggtcctagg cctgcaactc ttccactctc tctgccgcgc ctactcacgt caacccaaca 4441 tcaccgttcc attcaaccct gaacttttcg cagaatgtat ctctctgaat gaatacgcgc 4501 agctcagttc caaaacccaa tccaccatag tggccaacgc ttcacgctcc gacccagact 4561 ggcgacacac caccgtcaag atttttgcga aagctcaaca caaagtcaac gacggctcca 4621 tcttcggttc atggaaggcc tgccaaactc tcgcactcat gcatgattac gtaattctgg 4681 ttcttggacc cgtcaagaaa tatcaaagaa tcttcgacaa cgttgatcgg ccatctcaca 4741 tctactcaca ctgcggcaag acacccaacc aacttcgaga ttggtgccag gaacatctca 4801 ctcattccac cccaaaaatc gcaaacgact acaccgcctt cgaccaatcc cagcatggag 4861 aatccgtggt tcttgaagcc ctcaaaatga agagactgaa cattccgagc catttgattc 4921 agctccatgt ccacctcaag accaacgtct ccacccagtt cggccccctc acatgcatgc 4981 gcctgaccgg ggaacccgga acctacgacg acaacactga ctacaacctc gcagtcatct 5041 actctcagta tgacgttggt tcctgcccca tcatggtctc tggcgacgac tcactcatag 5101 accaccctct tcccactcgc cacgactggc cctctgttct caaacgcctc cacctccgct 5161 ttaaacttga actcacttct catcccctct tttgtggcta ctacgtcggt ccagcaggct 5221 gcatccgcaa ccccttggcc cttttctgca agctcatgat cgcagtggac gatgacgccc 5281 tcgacgaccg acgactcagc tacctcaccg agttcaccac cggacacctc cttggcgaat 5341 cactatggca cctcctccct gaaacccacg tccagtatca gtcagcttgc tttgacttct 5401 tctgcagacg ttgcccaaaa cacgagaaga tgctcctcga tgattccaca cccacactca 5461 gcctcctcga acgaatcact tcttcaccga ggtggctcac caagaacgcc atgtacctcc 5521 tccccgccaa gctcagactg gctatcacct ctctgtctca aacgcaatct ttcccagaat 5581 ccattgaggt ttcccacgct gagtctgaat tgcttcacta tgtccaatag caatcagccc 5641 cgacatggaa atcgacaaag aactcgcccc ccaagaccgc accgtcaccg tcgccaccgt 5701 tttaccgact gtccccggcc cctcaccttt caccatcaaa caaccgttcc agtctgaagt 5761 tctgtttgct gggaccaaag atgccgaggc ctctctcacc atcgccaaca tcgacagcgt 5821 ttccaccctc accaccttct atcgtcatgc ctctctggaa tcactctggg tcaccatcca 5881 tcctaccttg caagccccag ctttcccgac cacggttggc gtttgctggg tacccgccaa 5941 ctccccagtc actcccaccc aaatcaccaa gacctacggc ggccagatct tctgcattgg 6001 aggcgccatc aacactctct cacccctcat tgtcaagtgc ccacttgaaa tgatgaaccc 6061 ccgggtcaaa gattcaattc aataccttga ctcgcccaaa ctcctcatct ccatcaccgc 6121 tcaacccacc gctccccccg catcgacctg cataataact gtatcaggaa ctctctcgat 6181 gcattctccg ctcatcacgg acacttccac ctaagttctc gatctttaaa atcgttagct 6241 cgccagttag cgaggtctgt ccccacacga cagataatcg ggtgcaactc ccgccccttt 6301 tccgagggtc atcggaacc // LOCUS RATTH2BAA 181 bp ds-DNA ROD 26-MAY-1990 DEFINITION Rat TH2B gene promoter region. ACCESSION M33578 KEYWORDS H2B histone; histone; transcription regulatory element. SOURCE Rat DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 181) AUTHORS Hwang,I., Lim,K. and Chae,C.-B. TITLE Characterization of the S-phase-specific transcription regulatory elements in a DNA replication-independent testis-specific H2B (TH2B) histone gene JOURNAL Mol. Cell. Biol. 10, 585-592 (1990) STANDARD simpl staff_entry FEATURES from to/span description mRNA 161 > 181 H2B histone mRNA signal 61 68 octamer signal signal 110 115 hexamer signal BASE COUNT 45 a 47 c 32 g 57 t ORIGIN 1 acctgattgg ctgattggtg atgaattaac caatcagaaa gcaccacttg aattcccctt 61 atttgcatac aaggaacatt tattgtccaa tcatctttcg cgtgctcata cgtcatccaa 121 ggcccacgcc tataaatacc tctcttcttg gccttcaagc ggtgtgtttt ctcagcagtt 181 g // LOCUS TCVDIGAA 347 bp ss-RNA VRL 26-MAY-1990 DEFINITION Turnip crinkle virus defective interfering RNA. ACCESSION M29290 KEYWORDS defective interfering RNA. SOURCE Turnip crinkle virus cDNA to RNA. ORGANISM Turnip crinkle virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tombusvirus. REFERENCE 1 (bases 1 to 347) AUTHORS Li,X.H., Heaton,L.A., Morris,T.J. and Simon,A.E. TITLE turnip crinkle virus defective interfering rnas intensify viral symptoms and are generated de novo JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9173-9177 (1989) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by A.E.Simon, 20-OCT-1989. FEATURES from to/span description RNA 1 347 defective interfering RNA BASE COUNT 90 a 98 c 86 g 73 t ORIGIN 1 gggataaaaa aggaggctta ccaaccttct ctctattcac gatgcctctt ctacacacac 61 tcaaaacagc gctcgcagtg ggactccttg gagccaggta ctaccccgaa ggttcaaaac 121 caagaccccc aagtcgcttt actttgagat gtgttagaaa gccccaaggt cattttactt 181 tgacctgtgt tagagaccca aaacggtggc agcactgtct agctgcgggc attagactgg 241 aaaactagtg ctctctgggt aaccactaaa atcccgaaag ggtgggctag tggcgaccct 301 ccgaactaaa agacagcctc cctcctcgcg gggggggggg cctgccc //
GenBank-Updates@genbank.bio.net (05/30/90)
LOCUS FSBISO 676 bp ss-mRNA VRT 30-MAY-1990 DEFINITION C.commersoni isotocin/neurophysin mRNA, complete cds. ACCESSION X16621 M25145 KEYWORDS isotocin; neurophysin. SOURCE C.commersoni hypothalamic region, cDNA to mRNA, clones pCIT-1, M132.1 and M135.1. ORGANISM Catostomus commersoni Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Cypriniformes; Cyprinoidei; Catostomidae. REFERENCE 1 (bases 1 to 676) AUTHORS Figueroa,J., Morley,S.D., Heierhorst,J., Krentler,C., Lederis,K. and Richter,D. TITLE Two isotocin genes are present in the white sucker Catostomus commersoni both lacking introns in their protein coding regions JOURNAL EMBO J. 8, 2873-2877 (1989) STANDARD full automatic REFERENCE 2 (bases 1 to 676) AUTHORS Heierhorst,J., Morley,S.D., Figueroa,J., Krentler,C., Lederis,K. and Richter,D. TITLE Vasotocin and isotocin precursors from the white sucker, Catostomus commersoni: Cloning and sequence analysis of the cDNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 5242-5246 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.D.Morely, 31-MAY-1989. FEATURES from to/span description pept 43 507 isotocin-1 precursor sigp 43 102 isotocin-1 signal peptide matp 103 129 isotocin-1 matp 139 504 neurophysin (put.) mRNA < 1 676 isotocin mRNA signal 573 578 poly-A signal signal 657 662 poly-A signal BASE COUNT 147 a 169 c 176 g 184 t ORIGIN 1 ctgaatcgtt tcactacctg tcaatttcag cctttgcaaa atatgtctgg aagcatgttc 61 tcagtgtttt ctctgctgta tctgctgtct gtctgctcag cctgttacat ctccaactgc 121 cccatcggtg gtaaacgagc catacaggat tcaccctctc gacagtgtat gtcttgcggt 181 ccgggggatc gcggccggtg ttttggtccc agtatctgct gtggtgaggg tctcggctgt 241 ctgctgggct ctccggagac tcagcgatgt ctggaggagg actttctgcc ctccccgtgt 301 gaggccggtg ggaaagtgtg cggatatgag ggccgctgtg ctgctccagg tgtctgctgt 361 gactcagagg gctgcagcgt tgatcagtcg tgtgttgatg gtgacggtga cgctacagct 421 gtcagtcaac ctgccagcag ccaagattta ctgctgaaac tactgcacct gtcaaacccc 481 gcccacccat acagactcca ccaatgacat cagagtgaca tccgctcagg agaccaccag 541 ccaatcagaa ttattcttga agaccttttg taaataaagt ttgtatagcc gatagacatg 601 tacatggctc atacagatat tagtagaact gtattgatag tgtttactgt gcagctatta 661 aagtcagaga gaacgt // LOCUS HUMTCBYAL 416 bp ss-mRNA PRI 30-MAY-1990 DEFINITION Human T-cell receptor beta-chain mRNA V9-D-J2.3-C2 region, partial cds. ACCESSION M33240 KEYWORDS T-cell receptor beta-chain; constant region; diversity exon; joining exon; pseudogene; variable region. SOURCE Human (individual VW) T-cell line VW114, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 416) AUTHORS Hansen,T. and Lundin,K. JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Hansen, 22-MAR-1990. Institute of Transplantation Immunology, The National Hospital, N-0027 Oslo 1 Norway FEATURES from to/span description pept.ps < 1 > 416 T-cell receptor beta-chain V-D-J-C region precursor /nomgen="TCRB" /map="7q35" /hgml_locus_uid="LP0151Q" (AA at 1) sigp.ps < 1 35 T-cell receptor beta-chain signal peptide matp.ps 36 > 416 T-cell receptor beta-chain recomb 317 318 V-region DNA end/D-region DNA start recomb 330 331 D-region DNA end/J-region DNA start recomb 375 376 J-region DNA end/C-region DNA start BASE COUNT 121 a 105 c 87 g 103 t ORIGIN 1 atgtggccct ctgcctcctg caagcagatc cactggacac agccgtttcc cagactccaa 61 aatacctggt cacacagacg ggaaaaaagg agtctcttaa atgagaacaa aatctgggcc 121 ataatgctat gtattggtat aaacaggact ctaagaaatt tctgaagaca atgtttatct 181 acagtaacaa ggagccaatt ttaaatgaaa cagttccaaa tcgcttctca cctgactctc 241 cagacaaagt tcatttaaat cttcacatca attccctgga gcttggtgac tctgctgtgt 301 atttctgtgc cagcagccaa gggcaattgc acagatacgc agtattttgg cccaggcacc 361 cggctgacag tgctcgagga cctgaaaaac gtgttcccac ccgaggtcgc tgtgtt // LOCUS RATODCAB 2153 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Rat ornithine decarboxylase (ODC) mRNA, complete cds. ACCESSION J04791 KEYWORDS ornithine decarboxylase. SOURCE Rat (strain Sprague-Dawley) testis, cDNA to mRNA, clone pODC2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2153) AUTHORS Wen,L., Huang,J.-K. and Blackshear,P.J. TITLE Rat ornithine decarboxylase gene: Nucleotide sequence, potential regulatory elements, and comparison to the mouse gene JOURNAL J. Biol. Chem. 264, 9016-9021 (1989) STANDARD full staff_review COMMENT Draft entry for [1] kindly submitted by L.Wen, 28-MAY-1989, for release after publication. FEATURES from to/span description pept 160 1545 ornithine decarboxylase (EC 4.1.1.17) ORF 39 71 ORF signal 1855 1860 poly-A signal BASE COUNT 541 a 460 c 565 g 587 t ORIGIN 1 cggccgggtg cgccttgggg tttagtggcg gcttctccat gggtccagcc agccgcttcc 61 ctgtgctgtg aggagacagc attcagagtt gaccttgtga gagctggcca taatttaatt 121 ccatctctag gttttctgta ggcacatcga gaaccaacca tgggcagctt tactaaggaa 181 gagtttgact gccatatcct cgatgaaggt ttcactgcta aggacattct ggaccaaaaa 241 atcaatgaag tttcttcctc tgatgataag gatgctttct atgttgcgga cctcggagac 301 gttctaaaga agcatctgag gtggctgaaa gctcttcccc gtgttactcc cttctatgct 361 gtcaagtgta atgacagcag agccatagtg agcaccctgg ctgccattgg gacaggattt 421 gattgtgcaa gcaagactga aatacagttg gtgcaggggc ttggggtgcc tccagagagg 481 attatctatg caaatccttg taagcaagtg tctcagatca agtatgctgc cagtaatgga 541 gtccagatga tgacttttga cagtgaaatt gagttgatga aagttgccag agcacatcca 601 aaggcaaagt tggttttgcg gattgccact gatgattcca aagcagtttg tcggctcagt 661 gttaagtttg gtgccacact gaaaaccagc aggcttctct tggaacgggc aaaagagcta 721 aatattgatg tcattggtgt cagcttccat gtgggcagtg ggtgtactga ccctgagacc 781 ttcgtgcagg cagtgtcaga tgcccgctgt gtctttgaca tgggaacaga agttggtttc 841 agcatgtatc tgcttgacat tggtggtggc tttcctgggt ctgaagacac gaagcttaaa 901 tttgaggaga tcaccagtgt aatcaaccca gctctggaca agtacttccc atcggactct 961 ggagtgagaa tcatagctga gccaggcaga tactacgtcg catcagcttt cacacttgca 1021 gtgaatatca ttgccaaaaa aaccgtgtgg aaggagcaga ccggctcgga cgatgaagat 1081 gagtcaaacg agcaaacttt gatgtattac gtgaatgatg gagtgtatgg gtcatttaac 1141 tgcattcttt atgaccatgc acatgtgaag gccctgctgc agaagagacc caagccagat 1201 gagaagtatt actcatccag catctgggga ccaacatgtg atggccttga tcggatcgtc 1261 gagcgctgta gcctgcctga aatgcatgtg ggtgattgga tgctgtttga gaacatgggt 1321 gcatacactg ttgctgctgc ttctactttc aatgggttcc agaggccaaa catctactac 1381 gtaatgtcac ggtcaatgtg gcaactcatg aagcaaatcc agagccatgg cttcccgcca 1441 gaagtggagg agcaggatgt tggcactctg cccatgtctt gtgcccagga gagcgggatg 1501 gaccgtcacc ctgcagcctg tgcttctgct agtatcaatg tatagatgcc attcttgtag 1561 ctcttacctg caagtttagc ttgagttcac ggcatttggg gggaccattt aacttaatta 1621 ctgctagttt ggaatgtctt tgtaagagta gggttggcac caatgcagta tggaaagact 1681 aggagatggg ggtcacactt actgtgttcc tatggaaact ttgaatattt tatatggatt 1741 tttattcact tttcagacct gatactaatg agtgcccctc ggctgctgag caagcatttg 1801 tagcttgtac attggcagaa tgggctaaaa gcttatgttg tgacccattt tgaaaataaa 1861 gtatcttgaa atgattggac attggagaat gtgtgcaagt atcccttaca gaaggcacaa 1921 acttctgcac aggctgtgtg ttacagcagt gagtctagcc cagcagagat gtggatgata 1981 caaagctgtg ccccctctgt acagcatcaa tgtgcttagc ccatctcaag tgtttactgt 2041 gaacttggtg cccaaagtct cttaagagtg tcatctgcct agtggcctct tgacttggcc 2101 acttcctaag gagagggcat ctgaggctct ttgaaccttg cctgcagaaa ccc // LOCUS BPOORF2 3135 bp ds-DNA BCT 30-MAY-1990 DEFINITION B.polymyxa beta-1,4-endoglucanase gene, complete cds, and ORF2, complete cds. ACCESSION M33791 M33840 KEYWORDS beta-1,4-endoglucanase. SOURCE B.polymyxa DNA. ORGANISM Bacillus polymyxa Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3135) AUTHORS Baird,S.D., Johnson,D.A. and Seligy,V.L. TITLE Molecular cloning, expression, and characterization of endo-beta- 1,4-glucanase genes from Bacillus polymyxa and Bacillus circulans JOURNAL J. Bacteriol. 172, 1576-1586 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1447 2640 beta-1,4-endoglucanase ORF 1072 509 (c) ORF2 BASE COUNT 848 a 689 c 676 g 922 t ORIGIN 1 gaattcataa aaaacgtttt caaaaagatg aaaaggactt attcaaacat tcagttcagc 61 tttaggaccg attctctgtg gttgatgacc ggattcatac ggatgatttc tgacagctcc 121 tcatgatcct gttctttttg atcaattttc tccagcagta gcctggcact tcttttcccc 181 atttcaaaac tcggctggga aacggttgtg atcggcggat tataaaaaga cgcaaaggat 241 acatcatcta tgccaatcaa ggaaaagtca ttaggtatgg tcagatgatt attttttgca 301 taaatcagca cctgctccaa aatcatgtca ttcgtagcaa taattgcagt gggcggattc 361 tccatgtgaa atagttgtcc aatcagatta ggcatttcat ccaacctgcc gcttttgata 421 tagttttcat tcacgggtat gtcatgttta cccagcgtat ttcggtaacc tgacagtcgc 481 tcacttcgtg tggtgatcgc ttttttgcct aaggggaacg tgataatccc aatgttgtta 541 tgaccatgct caatgagtgc ttcaacgcaa gctcgctggc cttttcatta ttcagtagca 601 ccgtatctac gctgagttca tcgatttttc ggtccacaaa cacgaaggga tagccatttt 661 tgacaaggga aacatataat ttcttgttct cttccgtcgg gaaaatgata agtccatcca 721 cttgccgagc aatcatcgac tggacatact ttttctcttt gagcgaatca tcatcggtat 781 tgcagacaat gacctggaca ttttccagtt ggcattcatc ttcaatggca cgcacgactt 841 ctgttgtaaa tctcgataaa atggttgaag aaataacccc cacaatgaag gtcttcttct 901 gcttgagact acgtgccacc tcattgggta tataactgag atcctcaatc gcttgctcga 961 tcttcttctt ggtgttttca ctcatatagt tgtatctctt ttgcagaaac tgcgaaactg 1021 tactcttcga caccccggca cgttctgcca cttccttcat cgttgttacc ataattaccc 1081 tcctttcttt tttgcatgtc atgcgtgcta attttcaaat atatattctg caacctgtat 1141 gtaaagtgaa tttcacttta ttatatacct tcttttacag caggaaaata taataaataa 1201 agagttagag cgtgcctcta ctcatataca agcgcggcca aaaatatggg gtggaaactc 1261 cctcacgaat cttttacaag gggaactttc ctacttagct tgataatccg ttgttataaa 1321 tctattgatt tcgaacgaaa tcttagaaat atacgaattt tatgaacttg tttgatggaa 1381 gatttgaaca taaaatgaaa atgtaatcgg ttacaaattc ttttaaataa aggaggaagc 1441 agaattatga agaaaaaagg attaaaaaaa acatttttcg tcattgcctc cctcgtaatg 1501 ggcttcacac tgtatggcta tacacccgtt tctgcagatg cagccagtgt gaaaggatat 1561 tatcacaccc aaggaaacaa gattgtagac gaatccggga aagaagcggc atttaacggc 1621 ctgaactggt tcggtctgga aactcctaat tacaccttgc atggactgtg gagccgctca 1681 atggacgaca tgctggatca ggtgaagaaa gaaggctaca atctgattcg tctgccttac 1741 agcaatcagt tgttcgattc cagttcccgt ccagacagta ttgattatca caaaaaccct 1801 gatctggtcg gattaaaccc gattcaaatt atggacaagc tgatcgaaaa agctggacaa 1861 cgcggtattc agattatcct tgaccgtcac cgtccaggct caggtgggca atccgagctg 1921 tggtacacat cccagtaccc tgagtctcgc tggattagtg actggaaaat gttggctgat 1981 cgttataaaa ataaccccac cgtcattggt gcggatttgc acaacgagcc acacggtcaa 2041 gcaagctggg gtacaggcaa tgcctccaca gactggcgtc tggcggcaca acgtgcaggg 2101 aatgcgattc tgtccgtgaa tccgaattgg ctgattctcg tagaaggtgt agaccacaat 2161 gtacaaggca acaatagcca atactggtgg ggtggcaacc tgacaggtgt agccaactat 2221 cctgtcgttc tggacgtacc gaaccgtgtc gtatattctc cacacgatta cggccccggt 2281 gtgtcttcgc agccatggtt caacgacccg gccttcccgt ccaacctgcc agcgatctgg 2341 gatcaaacct ggggctacat cagcaaacaa aacatagctc cggtgctggt tggtgaattc 2401 ggcggccgta atgttgattt gtcctgccct gaggggaaat ggcaaaatgc gcttgttcac 2461 tatattggtg ccaacaacct gtactttacg tactggtccc tgaatccgaa tagcggcgac 2521 acaggcggtc tgctgctgga tgactggact acctggaatc gtccgaagca agatatgctg 2581 ggtcgaatta tgaagcctgt tgtttccgta gcccagcaag cggaagcagc agccgaatag 2641 gcacaggcct tcaacttcac ttaattaaaa aagccgattt tcctccttca ggccagatta 2701 ctgctgggct atgaagatgt ggagaaacgg ctttttttgt tcttccagtt ttacacattg 2761 taaattcccc attgaatggg tcattttgta aactaaatct ctctttcaac cgataaatac 2821 atatgttgga ttttaattta gaaaggaagt attcatgtga ttatttggaa gggttttggt 2881 attttaaata ttattattcc agggatttta tttgttattg tcggtagttt agtatccgct 2941 ctagggcttg actccataga ttcccggctg ccaatggctt ttgtgttcat tgtgtcggga 3001 gttattatct ggtatttggg gaaagcactc aactccgact caaagtgcta gttgatatgg 3061 aaacgggaca acgctatcga atgggaaccc agcacagtct gttttttatc cctatgcatt 3121 attggggccc tgcag // LOCUS SHPMHDQB 3123 bp ds-DNA MAM 30-MAY-1990 DEFINITION Ovine MHC class II OLA-DQB beta-chain gene, partial cds. ACCESSION M33307 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Ovine (strain Merino) DNA, clone B10-1, library Ch.28. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 3123) AUTHORS Scott,P.C. and Brandon,M.R. TITLE Nucleotide sequence of ovine MHC class II beta genes: DQB and DRB JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.C.Scott 28-MAR-1990. Univ of Melbourne, Veterinary Preclinical Centre, Parkville, Victoria, AUSTRALIA 3052 FEATURES from to/span description pept / 204 473 MHC OLA-DQB beta-chain, exon 2 (AA at 3) 876 1157 MHC OLA-DQB beta-chain, exon 3 1656 1766 MHC OLA-DQB beta-chain, exon 4 2232 2255 MHC OLA-DQB beta-chain, exon 5 2697 2710 MHC OLA-DQB beta-chain, exon 6 IVS < 1 203 MHC OLA-DQB intron A IVS 474 875 MHC OLA-DQB intron B IVS 1158 1655 MHC OLA-DQB intron C IVS 1767 2231 MHC OLA-DQB intron D IVS 2256 2696 MHC OLA-DQB intron E signal 3014 3020 poly-A signal BASE COUNT 635 a 773 c 851 g 707 t 157 others ORIGIN 1 cgggaggccg gggacgagcc gctggaggac ctgggttcgc atgcacgaag gcgcagggcc 61 gggctggaac cgaacagggg ccgagcggcg gcgggttgcc ggtttcggga cccgctggcg 121 cgcggccggc acctccccct ctgccgagcg cgccccgtgg ggctgtgtgg gctgagcctg 181 accgagcggc tgtctccccg cagaggattt cgtgtaccag tttataggcc actgttactt 241 caccaacggg acggagcggg tgcggcatgt gaccagatac atctacaacc aggaggagtt 301 cgtgcgcttc gacagcggac tgggacgagc accggcggtg accctgctgg ggcggccgga 361 cgccgagtac tggaacagcc agaaggacat catggagcgg acgcgggccg aggtggacac 421 ggtgtgcaga cacaactacc aggtggaagc cgccttcacc tggcagcggc gaggtgagtg 481 ccggcgcccg ctccgcgggg ccgccctcca cccgccagga ctccgcaccg aaggactgag 541 tctccggggt cggtccccag accacggatg ggacagagag gcgctgaggg acaggggccc 601 aggggcacag cgtgagggcg gggcatcgaa tttggcaccc tggagccttc ctgcagnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tcccacctgg aatgtgctgt gtggtctcac 841 gtctcactgt cttttcctgc ctgttcttcc ctcagtggaa cctacagtga ccgtctcccc 901 atccaggact gaggctctaa accaccacaa cctgctggtc tgctcggtga cagatttcta 961 tccaggccag atcaaggttc ggtggttccg gaatgaccgg gaggagacag ccggcgttgt 1021 gtccacccct cttattagga acggggactg gaccttccag atcctcgtga tgctggaaat 1081 gaccccccag cgaggagatg tgtacacctg ccgcgtggag caccccagcc tccagagccc 1141 catctcggtg gaatggcgta agggcacttg gtttcctttc actgtgggcc caacaggtca 1201 gggcagacag agcttcccca gtccatgccc tctcatccct tgtccccacc atcactactg 1261 aactggaaat cacaggacac aagagtgctc atgcctccta gcacaggcat cagaagagcc 1321 aaatcacatt gtcttttcac atacagggaa ctcactctac acatcatggc cccagagccc 1381 agcctggtag ctctgtagaa ttgactggtg accacagtct taaggtctaa agttatagaa 1441 gtgtccctga gagcagggat ccactttcac cttctctcac ctgcccactg tgtccaaagc 1501 tctgttggca ggtcccttcc atgggggtgg tcagaatgga gagccaggtt cccctgacac 1561 ctccacctcc tgtaccttag actagacctc aagcttccta aagggacacc atgagatgtg 1621 gggacaaacg ctgatactca ggctctgctc cccaggggcg cagtctgaat ctgcccagag 1681 taagatgctg agtggtgttg ggggcttcgt gctggggctg atcttcctca gtctgggcct 1741 catcatccat cacaggagcc agaagggtaa ggaactctgg ggaaatggga aaatgggctg 1801 tgattgagac cctctgttca gggcagtgtc tgctctgggt gtagttcttt ccacctgatc 1861 ctgaaacagg gaggcggggc tggggggtgg gaggaaatgg acaacttagg gagacattgg 1921 agtctgactt tactggtttg agagggtagc cctgccatgg agatggcagt ggaatttatt 1981 ctaagactac cttaaagttc atcattgtct catgggctcc tttctgaaaa cttcctgcat 2041 taagaggatc agagcactgg tttcctttcc ttctagtgac agcttcattc atttggggga 2101 attttagctt agggcagtta aggcctgtgg tcaatggata aaaaagaaat acattttccc 2161 ttgagtagct tatctcattt ccctttggat gagtaaggca gtggctgttt gtgtcacgag 2221 agcttttgta ggacattgtg ggcccctacc agcaggtgct ctgtttctgt tctgtgtcag 2281 tggggttgtg ggataaagga gggaagggtg tgaggtgagt gtacctgggt gcagtgtctc 2341 gttcatgacc tgttccctac tatggaatca ggagttaggg aagaagtttc tgtaggaggt 2401 tctgtaggaa gccataactt tgtggcatct ttctgtgaaa cttggagcca gagctctggt 2461 ttgaaagata gataccagga tgttgcctac tttgtgccac atgttggtgg ctactgcctg 2521 tgggcattta taagtgactg aatgtggtag aaagaaggtg agctatcatt aaatttacta 2581 aaaaattgaa atcttcacat cccccagaag gacaatagcc gctcccctgc ttcccatgcc 2641 tccttgccag gttgaagtgt gggtcctgtg tgctgatcac tctctctctt ctacagggct 2701 cacgcgctga ctcctgagga tattttggga ttggtgtttg ctcttcttta atcatgtgcc 2761 tgttccgccc ggaattcnnn cttctgagat cagagttggt caccaggtca tttcccgtgg 2821 ccatccacca accaaggctc tggctgtgat gctgcttcct ccactgaccc tggagcctct 2881 gactgtgcat tttcagctgc atctactcag accccaagag cttcctctgc tcccattttt 2941 acccaacaga gtgtgcaaaa gaagcacatt gaaataattt accaggctgt agagtttttt 3001 tccctttaat cataattaaa catgattatg aggttatctg tactctgaac ttccttaaat 3061 gagtaggagg gtagggaacc actgcagaat gaagggaaac tattttgggg tgaaccagcc 3121 aag // LOCUS SHPMHDRB 2277 bp ds-DNA MAM 30-MAY-1990 DEFINITION Ovine MHC class II OLA-DRB beta-chain pseudogene, partial cds. ACCESSION M33306 KEYWORDS cell surface glycoprotein; class II gene; major histocompatibility complex; pseudogene. pseudogene. integral membrane protein. SOURCE Ovine (strain Merino) DNA, clone C5-2, library CH.28. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 2277) AUTHORS Scott,P.C. and Brandon,M.R. TITLE Nucleotide sequence of ovine MHC class II beta genes: DQB and DRB JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.C.Scott, 28-MAR-1990. Univ of Melbourne, Veterinary Preclinical Centre, Parkville, Victoria, AUSTRALIA 3052 FEATURES from to/span description pept.ps / 891 1169 MHC OLA-DRB beta-chain, exon 3 1555 1665 MHC OLA-DRB beta-chain, exon 4 1914 1937 MHC OLA-DRB beta-chain, exon 5 2144 2155 MHC OLA-DRB beta-chain, exon 6 (alt.) 2232 / 2243 MHC OLA-DRB beta-chain, exon 6' (alt.) IVS < 1 890 MHC OLA-DRB intron B (no splice consensus) IVS 1170 1554 MHC OLA-DRB intron C (no splice consensus) IVS 1666 1913 MHC OLA-DRB intron D (no splice consensus) IVS 1938 2143 MHC OLA-DRB intron E (alt.; no splice consensus) IVS 1938 2231 MHC OLA-DRB intron E' (alt.; no splice consensus) IVS 2156 > 2277 MHC OLA-DRB intron F (alt.; no splice consensus) IVS 2244 > 2277 MHC OLA-DRB intron F' (alt.; no splice consensus) BASE COUNT 485 a 437 c 447 g 513 t 395 others ORIGIN 1 ccaacgtcgc taaaaaagag agactgtata aaatgatcag ttcagtaaaa agactgctta 61 ttttgcaaag ggggcctcct tggtggctga tacagcaaag aatgtgcctg caatgtgggg 121 gacccaggtt caaactctag gttgggaagt tcccctggag aagggaatgg ctaccctctc 181 cagtattctt gtctggagaa ttccatggac agagctgctg tccagtgggt agcaaagttg 241 gacacaactg agtgactaac actttcactt tcacttcatt ttgcaaagga gagaaattga 301 tcagctgagt gtaattgcac gtgttctttt tttaaaaaat aaatgactat ttgctccttt 361 ccttctcctt cagagctcnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngtcaaccct 601 aaataccttg caaagattgg taaacacatc aggtcataat gaataattct tttgggctat 661 gaaaaaacaa aacaaaatta atttcctcca catctctgaa atctaacagt cccgtattgt 721 aagaattaaa gtctaagctc atggttctaa ctcctgttgg gtcacatggc cagttttcag 781 agaggtctgc caagactgga gcatgggact agagactagg ggcagaaagg cagccaggga 841 gatttactct gccttcctca ctgatttcct ccactttttc tctcccagtg gagcctacag 901 tgactgtgta tcctgaaaag acccagcccc tgcagcacca cgacctcctg gtctgctctg 961 tggatggttt ctatctgggc catgctgaag tcaggtggtt ctggaaaggc caggaagagg 1021 aggctggggt gatctccaca ggcctgatcc agaacggaga ctagaacctc cagaccatgg 1081 tgatgcttga aatggttcct cagagtggag aggtctacac ctgccaagtg gagcacccca 1141 gccggacgag ccctctcaca gtggaacgag tgagaagctt ctgatcttgt gagttcctac 1201 caccgtggag ggggcttgtt ttccctgagt gtcaggtgcc tctctccaca ccatgtgttc 1261 atttgctcat gtttctcnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1381 nnnnnnnnnn nntgatcgac ttagcgtgat ttttcacact tctttctatg cagtggtcag 1441 ttattttagt tcttccggct gccctccctc attcctgaag ctctcagtct cagagtctca 1501 atcagcaaag ttgtatttca gataaaaatc acaaaaccat ggtttctttt ctcaggggca 1561 cagtctgact ctgcttagag taagatgatg agtggaatag gggactttgt tctgggtttg 1621 ctctttcttg tggtggggct gttcacccac ttcaggaatc agaaacgtga gagcctgttt 1681 acagggggat cctcatannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1741 nnnnnnnnnn nnnnnnnnnn nnnntcgggt cgctctcttt ctgccctgga tcatctggtg 1801 tggttgcagt gtcgagaaat ttcagatggg gctttgggtc tgaggacatt gaatctggta 1861 acagaattgc cttcatatct ggtaaatctg tatatcttcc cttcagtttc ccaggacact 1921 ctggacttca gccaacaggt catgcccttt aatcctcttt tagagataga tttggtctct 1981 ggctcagtgg taaagaacta tcctgccaat gcagagacac aagagacatg ggtttcatcc 2041 ctgagtcggg aatatctctt ggaataggaa atagcaacac actccagtat tcttgcctgg 2101 aaaatttcat ggaccagagg agcctgggac ctacagtccc ctagggttga aaagagtcag 2161 acatgattgt gtgagtgcat gggcacaagc gcaccctgta cacacacaca cacacacaca 2221 cacacacaca cggtctccta gaattgatcg tagagatgac aagacaggag acagaca // LOCUS ASGALDAA 3497 bp ds-DNA PLN 30-MAY-1990 DEFINITION A.niger aldehyde dehydrogenase (aldA) gene, complete cds. ACCESSION M32351 KEYWORDS aldehyde dehydrogenase. SOURCE A.niger DNA. ORGANISM Aspergillus niger Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 3497) AUTHORS O'Connell,M.J. and Kelly,J.M. TITLE Physical characterization of the aldehyde-dehydrogenase-encoding gene of Aspergillus niger JOURNAL Gene 84, 173-180 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1187 1251 aldehyde dehydrogenase (aldA), exon 1 1321 2522 aldehyde dehydrogenase, exon 2 2580 2649 aldehyde dehydrogenase, exon 3 2703 2859 aldehyde dehydrogenase, exon 4 pre-msg 1158 2969 aldA mRNA and introns (alt., + 1 bp) pre-msg 1158 3016 aldA mRNA and introns (alt., +/- 1 bp) IVS 1252 1320 aldehyde dehydrogenase intron A IVS 2523 2579 aldehyde dehydrogenase intron B IVS 2650 2702 aldehyde dehydrogenase intron C BASE COUNT 833 a 995 c 806 g 863 t ORIGIN 1 agatctcact agtcagtggc taattggact atacagggat gtttgcaaag tgcgtgaact 61 agcatctact aagtactaat ccccatcatg aaaggaacaa ggggactggc tgactaaaat 121 gttacagctt gcaggtcaac aaactagctc aagagactag attccacttg gtccccgcca 181 cttacaccga ggtggagggg ttccggaaat acgaaaagag cttgctccaa tcgctgtgag 241 taatattccc tagtgagtta caaaggagat gactgataac tcactactaa gaggttgcct 301 agctactagc ctcgtcttct cccggccggc gaaggtcctc cattcttctt cggatgatgt 361 aatgaagaaa ccagatgtaa ctgactagta accatatcaa gtcaagacac gataaccatc 421 cccttccact accactacca gtaccagtac cagtaccaca gggactcttc gatggatgaa 481 tactagttag ttacacggta aaatggctga cacaggggga atctgggtgg tggtggtggt 541 ggtggtggtg gccaggaaaa agagccacgg atagcgaaac cccgcacgat cgctgccgat 601 tgttggatag agagtgggga acgggctggc atgtgggaat tcacctactt ttcaggttcc 661 cttcatgccc tggtcactta cgctttaccg cgatacaata taatatgatg gttacttata 721 ggggtctcgg gagtacgtag gttaactcgc ggtgggtttc gggctcaagg agcttggtct 781 ctccaatcgg agcatctaga tgagggggtg gttatccgcg gagggattag aatgaggtaa 841 atggcaatgg atgagacccg gaatttctcc catggatccg actagaccag tccaattgaa 901 ctaaactaaa ctaaacccag tggacgagga tgatctccgc tggcgtttcc ccctcgcgcc 961 tcgcaagctt cccctccctt caaccatcaa ccatcaaacc aacttctctt tcctctttct 1021 ctctcttccc ccccccactt ccaccccacc actagtacaa agtaactaaa cttcctgtct 1081 gctttattaa ttgatagctc tccccctctc cgctcttctt tcttcttctc ttccctctct 1141 ctctttcttt ctcttctaac caacacctcc tctcccatat ctcatcatgt ctgacctctt 1201 cgccaccatc acgaccccca acggcgtcaa atacgaacag cctctgggct tgtatgttat 1261 ccccgcgtcc cgtcctctcg atatcgcgct tttggaaact aaccactgtt tactttacag 1321 gttcattgac ggtgaattcg tcaagggcgc ggaaggcaag accttcgaga ctatcaaccc 1381 cagcaacgag aagcccattg tcgccgttca cgaagccacg gagaaggatg tggatactgc 1441 cgttgccgct gctcgcaagg ccttcgaggg ctcctggcgc caggtcactc cttccacccg 1501 tggccgcatg ctgaccaagc tggccgacct tttcgagcgc gatgccgaga tcctggcttc 1561 catcgaagct ctggacaatg gcaagtctat caccatggcc cacggtgata tcgctggcgc 1621 cgccggttgc ctgcgctact atggtggctg ggccgacaag atccatggtc agaccatcga 1681 caccaactcc gagaccttga actacactcg ccacgaaccc atcggtgtct gcggacagat 1741 cattccctgg aacttcccct tgctgatgtg ggcctggaag attggtcctg ccatcgctac 1801 cggtaacact gttgttatca agactgctga gcagactcct ctgtccggtc tctatgccgc 1861 gaacgtgatc aaggaggctg gcatccccgc tggtgtggtt aacgtcatct ccggtttcgg 1921 tcgtgtggcc ggttccgcca tctctcacca catggacatc gacaaggtcg ccttcactgg 1981 ctccaccctt gttggccgta ctatcctcca ggccgccgcc aagagcaacc tcaagaaggt 2041 gaccctcgag ctgggtggca agtcccccaa cattgtcttc aacgatgccg acattgacaa 2101 cgctatctcc tgggccaact tcggtatctt ctacaaccac ggtcagtgct gctgtgctgg 2161 ttctcgtatc ctggttcagg aaggcatcta cgacaagttc attgcccgtc tcaaggagcg 2221 tgctctccag aacaaggtcg gtgacccctt cgccaaggac accttccaag gtccccaggt 2281 ttcgcagctc cagttcgacc gcatcatgga atacatccag cacggtaagg acgccggtgc 2341 taccgtggct gttggtggtg agcgtcacgg cacagagggt tacttcatcc agcctaccgt 2401 cttcaccgac gtcacctccg acatgaagat caaccaggag gagatcttcg gccccgtcgt 2461 cactgtccag aagttcaagg acgtcgagga cgctatcaag atcggcaaca gcacttctta 2521 tggtgagttc tttttcgcca ctaaattgca gtgtgtcaag tgctaatctt tcgtcctagg 2581 tctcgctgcc ggtatccaca ccaaggatgt caccaccgcc atccgtgtct ccaacgccct 2641 ccgtgctggg taagctatac ttaacactac attcctgtat tcattcacta acatgcacat 2701 agaaccgtct gggtcaacag ctacaacctg atccaatacc aggttccctt cggaggattc 2761 aaggagtccg gtatcggtcg cgagctcggc tcgtacgccc tcgagaacta cacccagatc 2821 aaggccgtgc actaccgtct gggcgacgct ctcttctaat ttaatatttg aaaccatgag 2881 aaaatgggaa gagttcgctt cagcgacgag ctcgtggata ttgggttacc tttttcgcag 2941 atagccatga ctggttcatg caaaacaatc aggatacgaa tttacgatga ttacaaattg 3001 ttcttacatt gcagtaactg tagacctttg taaatataat ctgtctcaag cttcaacctc 3061 cacataatct tggtactggc tataatcact ccctagtggt taactaagga cgaatcacca 3121 ccccttccac tcgttcaatt gcaatcgacc tccatcaggc cattgcatat cgtacccatt 3181 gttacacacc caaattcccg ggacaaccac cggcgagttt gagtagcctg ggtctagtct 3241 cgggagaatt actacgccga gaagagagac cccctcgggt cgaagatggc ccgaagcgag 3301 ggcattcatc agcgaataca caaggaccga atgattggtt atgtcaccga cgttgatgat 3361 gatcagccat tcaaaaagac agctaatttc aatcttcagt acttcatagg ttgattgatt 3421 acttgctggt gccgaaaata ggtattcggg aactataatc tcctatcttg tcagttctgg 3481 tactagaatg catcgat // LOCUS POABETA1 2293 bp ds-DNA PLN 30-MAY-1990 DEFINITION P.agilis beta-1 tubulin (beta-1-tub) gene, complete cds. ACCESSION M33372 KEYWORDS beta-1 tubulin. SOURCE P.agilis (strain Aragoa) DNA. ORGANISM Polytomella agilis Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Dunaliellaceae. REFERENCE 1 (bases 1 to 2293) AUTHORS Conner,T.W., Thompson,M.D. and Silflow,C.D. TITLE Structure of the three beta-tubulin-encoding genes of the unicellular alga, Polytomella agilis JOURNAL Gene 84, 345-358 (1989) STANDARD simple staff_review FEATURES from to/span description pept 233 250 beta-1 tubulin (beta-1-tub), exon 1 327 353 beta-1 tubulin, exon 2 432 555 beta-1 tubulin, exon 3 624 848 beta-1 tubulin, exon 4 974 1911 beta-1 tubulin, exon 5 IVS 251 326 beta-1 tubulin intron A IVS 354 431 beta-1 tubulin intron B IVS 556 623 beta-1 tubulin intron C IVS 849 973 beta-1 tubulin intron D BASE COUNT 524 a 578 c 464 g 727 t ORIGIN 1 tttttttgta aggtttaaga ggtcatttgc ttcctaaatg gagtttaaaa aaattaaatg 61 tgtaggtgct ggttgattat atgcttccgg ctcgtatgtt gtgtggaatg tgagcggata 121 acaatttcac acaggaaaca gctatgacat gattacgaat tatatatatg tgccgtcctc 181 ccaaaaaatt taaattgaac tacgcctgtt ttgtcaattt aaccggtagc aaatgcgtga 241 aatcgttcat gtgagtaata tttatttgat atctgccact tactctcctt attctaatag 301 catcttctcc ttttcctcac ctacagattc aaggtggcca gtgtggtaac caggtgtgtt 361 tttcgccctc ttgcctgcta aatgttcgtt ttgactctat caacacctct aacatgcttc 421 tctccctgta gattggtgcc aagttctggg aagtcgtttc tgacgagcat ggcattgacc 481 ccaccggtac tgctcttggt gacagcgacc tccagcttga gcgcatcaac gtctacttca 541 acgaggccac tggaagtaag taatttgagc ccctcctcaa tatttttttc agctccaatt 601 aaaatctttt tttctcactc caggccgcta tgtcccccgt gccatcctca tggatcttga 661 gcccggtacc atggattccg tccgctctgg tccctatggc cagattttcc gccctgacaa 721 ctttgtcttc ggtcagactg gtgccggtaa caactgggct aagggccact atactgaggg 781 tgctgagctc attgactccg tccttgatgt tgtccgcaag gaggctgagt cctgcgattg 841 ccttcagggt aagaatttga tatcttttat ttctatgatt cttagattat tttttcaacc 901 taacatagtt aatatttcaa attgtatctc actgttcatt tttataaaat taatttttat 961 tatgtcttta aaggcttcca ggtctgccac tccctcggtg gtggtaccgg ctccggtatg 1021 ggaaccctcc ttatttccaa gatccgtgag gagtaccctg accgcatgat gctcaccttc 1081 tccgtcgtcc cctcccccaa ggtgtctgat accgtcgttg agccctacaa cgccaccctc 1141 tccgttcatc agctcgttga gaacgccgat gagtgcatgg tcctcgataa cgaggctctc 1201 tacgatatct gcttccgcac ccttaagctc accaccccca ccttcggtga ccttaaccat 1261 cttatctccg ccgtcatgtc cggcatcacc tgctgcctcc gcttccccgg tcagctcaac 1321 gctgatctcc gtaagcttgc cgtcaacctt atccccttcc cccgtctcca tttcttcatg 1381 gttggtttca cccccctcac ctcccgtggc agccagcagt accgtgccct caccgtcccc 1441 gagcttaccc agcagatgtg ggatgccaag aacatgatgt gcgccgctga tccccgccac 1501 ggtcgttacc tcaccgcctc cgcccttttc cgtggtcgta tgtccaccaa ggaggtcgat 1561 gagcagatgc ttaacgtcca gaacaagaac tcttcttact tcgttgagtg gatccccaac 1621 aatgtcaagt cctccgtttg cgatatccct cctaagggtc ttaagatgtc tgctaccttc 1681 attggtaaca gcactgctat ccaggagatg ttcaagcgtg tctctgagca gttcactgcc 1741 atgttcaggc gcaaggcttt cttgcattgg tatactggtg agggtatgga tgagatggag 1801 ttcactgagg ccgagtccaa catgaacgat ctcgtctccg agtaccagca gtaccaggat 1861 gcctctgctg aagaggaggg cgagttcggt gaggaagagg aggagaacta aactatcaca 1921 aactatttgt gaaactagcc gctgatttta aggtttatgt agaaggggga tacccacaat 1981 ttttgcttta atgattcagt ttatgtttaa ctgtatgtat gtttttttat aatgaatagc 2041 agtcactgtc cactctataa tgtaatttta ttttgttttc ttcactaata gcgtaatgcc 2101 gagagcttgt aatgataaag gacgttaatt acgaagagta atatagatat aaatattttt 2161 tacatgatgt ttataattat aataacaata aaatactttt tgactcttct aacgttaatg 2221 ttattattct ctatcctcgg tcacgtttac gtttttttta ccctttgttt tatttcctgc 2281 cgcatcttta taa // LOCUS POABETA2 2121 bp ds-DNA PLN 30-MAY-1990 DEFINITION P.agilis beta-2 tubulin (beta-2-tub) gene, complete cds. ACCESSION M33373 KEYWORDS beta-2 tubulin. SOURCE P.agilis (strain Aragoa) DNA. ORGANISM Polytomella agilis Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Dunaliellaceae. REFERENCE 1 (bases 1 to 2121) AUTHORS Conner,T.W., Thompson,M.D. and Silflow,C.D. TITLE Structure of the three beta-tubulin-encoding genes of the unicellular alga, Polytomella agilis JOURNAL Gene 84, 345-358 (1989) STANDARD simple staff_review FEATURES from to/span description pept 233 277 beta-2 tubulin (beta-2-tub), exon 1 377 500 beta-2 tubulin, exon 2 565 789 beta-2 tubulin, exon 3 910 1847 beta-2 tubulin, exon 4 IVS 278 376 beta-2 tubulin intron A IVS 501 564 beta-1 tubulin intron B IVS 790 909 beta-1 tubulin intron C BASE COUNT 479 a 530 c 439 g 673 t ORIGIN 1 tttttttgta aggtttaaga ggtcatttgc aagatcagag ctagcgtatc cttatgaatc 61 gctcctacac atcttatagt gtggttacga aaatcaaatt tggagtagtt tggatcccat 121 ggatatttac tataaactag gtgtccaaac caaatatttc aaaagctatt ttttgaagct 181 ccgattgcag agacttcatt ttccttcata atatattagt agaaaaacaa ccatgcgtga 241 aattgttcat attcagggtg gccagtgcgg taaccaggta aattattaat cgttttttat 301 tctttgcact tattgcacag tcctcttaag ccctacgtct aattcttttc cttttttact 361 attcaacctc ctgtagattg gtgccaagtt ttgggaggtc gtctctgacg agcacggcat 421 tgaccccacc ggtactgcaa ttggtgacag cgatcttcag ctcgagcgta tcaacgttta 481 ctttaacgaa gccaccggca gtatgtatac ccgtattctt aagaatccaa tattttttcc 541 tgatgtttta cttgatatat acaggccgtt atgtcccccg tgccatcctc atggatcttg 601 agcctggtac tatggactcc gttcgttctg gtccctatgg tcagatcttc cgtcctgaca 661 acttcgtctt tggccagact ggtgccggta acaactgggc taagggtcat tataccgagg 721 gtgccgagct catcgactcc gtccttgatg ttgtccgcaa ggaggccgag tcttgcgact 781 gtctccaagg tagatttctt tttgcatttt tctttttcaa atttgcgtat taattcttca 841 acctattaaa ctttgacttt tccgatttgc tgaaatattt tttctcaatg ttcctttttt 901 ttgtttcagg tttccaggtt tgtcactctc tcggtggtgg tactggctct ggtatgggaa 961 ctctccttat ttccaagatc cgtgaggagt accctgaccg tatgatgctc accttctccg 1021 tcgtcccctc ccccaaggtg tctgataccg tcgttgagcc ctacaacgcc accctctccg 1081 ttcatcagct cgtcgagaac gccgatgagt gcatggtcct cgataacgag gctctctacg 1141 atatctgctt ccgcaccctt aagctcacca cccccacctt cggtgacctt aaccacctta 1201 tctccgccgt catgtccggc atcacctgct gcctccgctt ccccggtcag ctcaacgctg 1261 atctccgtaa gcttgctgtc aaccttatcc ccttcccccg tctccacttc ttcatggttg 1321 gtttcacccc cctcacctcc cgtggcagcc agcagtaccg tgctctcacc gtccccgagc 1381 ttacccagca gatgtgggat gccaagaaca tgatgtgcgc cgctgatccc cgccacggtc 1441 gttacctcac tgcctccgcc cttttccgtg gtcgtatgtc caccaaggag gtcgatgagc 1501 agatgcttaa cgtccagaac aagaactctt cttacttcgt tgagtggatc cccaacaacg 1561 tcaagtcctc cgtttgcgat atccctccta agggtcttaa gatgtctgct accttcattg 1621 gtaacagcac tgccatccag gagatgttca agcgtgtctc tgagcagttc actgccatgt 1681 tcaggcgcaa ggctttcttg cactggtaca ctggtgaggg tatggatgaa atggagttca 1741 ctgaggccga gtccaacatg aacgatctcg tctccgagta ccagcagtac caggatgcct 1801 ctgctgagga agagggcgag ttcggtgagg aagaagagga gaactaagcg ctcttttttt 1861 gtgagaatta attgaaacgc tcgatatatt tttacaaaga attagaggag tgataggtta 1921 aagcctgatc ttaatttttc tttgctgaat atgtattgat gtgttgtagt attagtcatg 1981 tgagtatgtt tgtaatatag taaaccttta tgcatctttt cgtttttata tgtagcagga 2041 gttaaaaact aaacttttga attttccaaa taacttttca tccttaataa attgtaatat 2101 tggtatatgc tttatagcca c // LOCUS POABETA3 2202 bp ds-DNA PLN 30-MAY-1990 DEFINITION P.agilis beta-3 tubulin (beta-3-tub) gene, complete cds. ACCESSION M33371 KEYWORDS beta-3 tubulin protein. SOURCE P.agilis (strain Aragoa) DNA. ORGANISM Polytomella agilis Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Dunaliellaceae. REFERENCE 1 (bases 1 to 2202) AUTHORS Conner,T.W., Thompson,M.D. and Silflow,C.D. TITLE Structure of the three beta-tubulin-encoding genes of the unicellular alga, Polytomella agilis JOURNAL Gene 84, 345-358 (1989) STANDARD simple staff_review FEATURES from to/span description pept 233 250 beta-3 tubulin (beta-3-tub), exon 1 325 351 beta-3 tubulin, exon 2 429 552 beta-3 tubulin, exon 3 620 844 beta-3 tubulin, exon 4 984 1921 beta-3 tubulin, exon 5 IVS 251 324 beta-3 tubulin intron A IVS 352 428 beta-3 tubulin intron B IVS 553 619 beta-3 tubulin intron C IVS 845 983 beta-3 tubulin intron D BASE COUNT 494 a 552 c 464 g 692 t ORIGIN 1 tttttttgta aggtttaaga ggtcatttgc ttcctaaatg gagtttaaaa aaattaaatg 61 tgtaggtgct ggttgattat tacttttatg gatcaatact caaaactcga aagtgtatca 121 aattcataac gacactaaaa attcgagaat tgtcgtacaa cctatggcct tatataaatc 181 atgctcccct tttaaaaatc aacttgttta gtttttagcc aaataaataa ccatgcgtga 241 aatagttcat gtaagttgca attgttgttg attttgccgt tttgttcata ttcttattct 301 ttttgctgtt ttctgcctcc acagattcaa ggtggccagt gcggtaacca ggtggcttct 361 gccagttttg gctccctttc ctcagctttt gaatacatga gcctttctaa cttgtcttct 421 tactgcagat cggtgccaag ttctgggagg tcgtctctga tgagcacggc atcgacccca 481 ccggtaccgc ccttggtgac agcgacctcc agcttgagcg catcaacgtc tacttcaatg 541 aggccactgg aagtaagtaa tcagcgtgtt gccaatattt tttggggctt ggatttacta 601 tttttttact tctcttcagg ccgttatgtc ccccgtgcta tcctcatgga tcttgagccc 661 ggtaccatgg attccgtccg ctctggtccc tatggtcaga ttttccgccc tgacaacttt 721 gtcttcggcc agactggtgc cggtaacaac tgggctaagg gtcactacac tgagggtgct 781 gagctcattg actctgtcct tgatgttgtc cgcaaggagg ctgagtcctg cgattgcctt 841 cagggtaaga ttatttttat tttccacttt tatgttcctt ataatagggt ggggcattaa 901 ttttgtatta ttcagcacat ctctcgtata tttaaatata taaatatatt tcttatttaa 961 aatttttaaa taatatgtta caggcttcca ggtctgccac tctctcggtg gtggtaccgg 1021 ttccggtatg ggaaccctcc ttatttccaa gatccgtgag gagtaccctg accgcatgat 1081 gctcaccttc tccgtcgtcc cctcccccaa ggtgtctgat accgtcgttg agccctacaa 1141 cgccaccctc tccgttcatc agctcgttga gaacgccgat gagtgcatgg tcctcgataa 1201 cgaggctctc tacgatatct gcttccgcac ccttaagctc accaccccca ccttcggtga 1261 ccttaaccat cttatctccg ccgtcatgtc cggcatcacc tgctgcctcc gcttccccgg 1321 tcagctcaac gctgatctcc gtaagcttgc cgtcaacctt atccccttcc cccgtctcca 1381 tttcttcatg gttggtttca cccccctcac ctcccgtggc agccagcagt accgtgccct 1441 caccgtcccc gagcttaccc agcagatgtg ggatgccaag aacatgatgt gcgctgctga 1501 tccccgtcac ggtcgttacc tcactgcctc cgcccttttc cgtggtcgta tgtccaccaa 1561 ggaggtcgat gagcagatgc ttaacgtcca gaacaagaac tcttcttact tcgttgagtg 1621 gatccccaac aatgtcaagt cctccgtttg cgatatccct cctaagggtc ttaagatgtc 1681 tgctaccttc attggtaaca gcactgccat ccaggagatg ttcaagcgtg tctctgagca 1741 gttcactgcc atgttcaggc gcaaggcttt cttgcattgg tacactggtg agggtatgga 1801 tgagatggag ttcactgagg ccgagtccaa catgaacgat ctcgtctccg agtaccagca 1861 gtaccaggat gcctctgctg aagaggaagg cgagttcggt gaggaggagg aggagaacta 1921 aagaactcaa ttgcatttca tgttttaaat tcaggaatga tctagagata ctttgctttt 1981 ttttttcttg cttgtagcac ctctcgtcgc gattttgaaa gcattatcaa tacatcctag 2041 tcagctatgt tcggtggatg atataatgaa gcttttgcac gacaatattt gtgtgctgaa 2101 ccgcttatga ctgaattaag gttatagata atttcagtgt aaattattag tgctccgcta 2161 ataccaaatc cattttttat tttatgtcca tgaagttata ta // LOCUS CELDAF1 5899 bp ds-DNA INV 30-MAY-1990 DEFINITION C.elegans serine/threonine protein kinase (daf-1) gene, complete cds. ACCESSION M32877 KEYWORDS serine protein kinase; threonine protein kinase; transmembrane protein. SOURCE C.elegans (strain N2) DNA, clones DR#[168,169], and cDNA to mRNA, clones DR#[113,114,118]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 5899) AUTHORS Georgi,L.L., Albert,P.S. and Riddle,D.L. TITLE daf-1, a Caenorhabditis elegans gene controlling dauer larva development, encodes a novel receptor protein kinase JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by L.L.Georgi, 16-MAR-1990. FEATURES from to/span description pept 268 313 serine/threonine protein kinase precursor 1 1693 1779 serine/threonine protein kinase precursor 2 2112 2239 serine/threonine protein kinase precursor 3 2972 3190 serine/threonine protein kinase precursor 4 3243 3999 serine/threonine protein kinase precursor 5 4182 4345 serine/threonine protein kinase precursor 6 4473 4739 serine/threonine protein kinase precursor 7 4785 5058 serine/threonine protein kinase precursor 8 5371 5438 serine/threonine protein kinase precursor 9 sigp 268 312 serine/threonine protein kinase signal peptide matp 313 313 serine/threonine protein kinase 1693 1779 serine/threonine protein kinase 2112 2239 serine/threonine protein kinase 2972 3190 serine/threonine protein kinase 3443 3999 serine/threonine protein kinase 4182 4345 serine/threonine protein kinase 4473 4739 serine/threonine protein kinase 4785 5058 serine/threonine protein kinase 5371 5435 serine/threonine protein kinase pre-msg 233 5899 daf-1 mRNA and introns IVS 314 1692 daf-1 intron A IVS 1780 2111 daf-1 intron B IVS 2240 2971 daf-1 intron C IVS 3191 3242 daf-1 intron D IVS 4000 4181 daf-1 intron E IVS 4346 4472 daf-1 intron F IVS 4740 4784 daf-1 intron G IVS 5059 5370 daf-1 intron H rpt 2302 2832 direct repeat rpt 2520 2634 direct repeat BASE COUNT 1759 a 1194 c 1121 g 1825 t ORIGIN Map position 29.2-29.4. 1 cttcttctct attcaattcc attcatagtg gactaggtga gccgctcgtc agtatctgct 61 cactgtctgc gtctctgctt ggctcttctc tttctctcca tgtgctccgg ctccccctcc 121 tcacactcaa ccacacacac acacacacgt atacatatct tcttatagtt ttgttgcatg 181 cttcatctct tttctcatct tttgccttct atgtactttg ctttttttcc agatctcctg 241 atgaactaaa ctcttgggaa ataaattatg aggatacggc acgtggtgtt ctgcctgctc 301 gcactggttt atggtgagta ctttgaggtt ttggtgtttg atagcaaaag tcatcaatct 361 atttttcggg ttaattgggt taatataggt attattatgc agctagatga tgatttggtt 421 ctatctaaac aaaaaagtgc aattaaaatc gaattctcac tggtttacat aaatttttag 481 ttataatcac aaatttaata gtttacgctc tatttttttt tttgacaaac tagatatagt 541 taaacatatt tgaaatttga tccaatatcc aatataagtg aaatcaagtt aaggttgtta 601 agtttgatat atgtagtagt aaatatatag aaatgttttt catatcaggt tcagccaatc 661 atattcgagt tgcgtagtgc ttatgcagtt gataggtttt ttggtataat ctagttatgt 721 tgtaactcta aaaaaatttt ctaattatca tcacgaatat gtatatcgtt taagaagtta 781 gaaaaaaaca ctgattattt ttttattagc tcaaaaagta tgtattagtc ccatttagct 841 ggcttttaga aattcgtgtt taattttgat tagtttgact atggaaattg tttgagccca 901 aagttaagca atttatgtta ttgcagatta gctgcagact ttttctatta gaatagaatc 961 ttattacata gttattattg cctaatttca gctatacaaa atgcgactta tcaacttatc 1021 attttagttg tagttgatcg cagttgaaga aaattgaaat gttattatga tcttaaaaaa 1081 atacatatca ctttaattat gagtgacttg aaattgctat tttttattgc aattcatagt 1141 tgatatatgt tcaaaaaagt caacaagtta actgatttgc tgatagtgaa tattaaattt 1201 agcgcaatga gcaattgatg caattaaatg ttagttgcag ttttattctg ttagtgggta 1261 tttagaaaat agctaggtgc aatttaaact aaattttcaa caatttagga atttaaactt 1321 tactttttag tcgcatcact tatctaggca ccccactttt taactttaaa agcgcctaac 1381 aataagtatc taagttgtcc tgtattttat tttaaaattg ccaatttgtt aattttttct 1441 ctatataatt aattatctat tgatatttac aaaatgtgcc gagtcctgcc cttccaaagg 1501 aaactagtgg ggtgcaatac caatagaaga gtacaaaatt tacagaagtc caaaataata 1561 tgcatcacgg ttgagagctt tttaaatttg ttatcagtat ttaggccttg gctcaaacac 1621 atcataaatt ggtacctgtg caagaaaatt aaccaaatac agctttctac gacccttttc 1681 aattttttcc aggcgccgaa acttccgacg atgatctcga cgaacggaca aatatcttca 1741 tccgtgataa actgattccg gcgttgaagc ttgcagaagg tagtttgttt ttttggactc 1801 aacagaaaat aatcgccggc actcaaatta gatggcattt atcgtcagaa gaagctttgg 1861 gcaaaatccc ccggtagcaa agcaaaataa tttcgttcga tcaaacatta aaagagattt 1921 tcaactatat atataatcag tgataagcct ataatcttta gtgtttcccc cgaaagcact 1981 aatctatggg aaatttcctt ttcgggggac gtgagtttat gcagaaaaaa aaattgaagt 2041 ttagtttatt tttggaagga tttttaatct tgtgataagg gatttaacaa gattttctga 2101 ttttcttgca gtgaccaagg tgaatttcac acgattacac ttatgccatt gctcaaggga 2161 agttggatgc aacgcgagga ctaccggatg ggtgcctgga atcgagttcc ttaatgagac 2221 ggatcgcagt ttttatgagg ttaggcttta accatgtggt cgagtttttc cttgcccgca 2281 atagatcctt attgaccaaa actacagtaa tcctacagta cctcaaccat cttcctctat 2341 tagacctgtc accaaagtat ttgacaaagc aaaaaaacaa gaataattta attactcgat 2401 aactccaaaa aacaacaaaa aattgtattt tggtttttta aatttcttta tctgattgtg 2461 aatattttcg ttttccccgc gatttcttta aaagatcaaa agactaagac taagataatc 2521 ttacagtacc ccgaccctat ccctccacta accttaatct cttcaaaaaa caatgagcat 2581 gttttctcct aagctacagt aatcctacag tacccctaca gtaccccagt actactacag 2641 taccccgacc ctatccctct actaatctta atctcttcaa aaaacaatga acatgttttc 2701 ttctaagcta cagtactcct acagtaccca tacagtaccc ttacaacact actacagtac 2761 ccgaccctat ccctctacta accttaatct cttcaaaaaa caaaaaaaac atgtttctcc 2821 tatactacag tactcttaca gtaccccaga ggtgtcctta taagtttgac tagtttttgt 2881 catttggcag taaaactacg gttcttctaa aatttcggtt acttggttac taggtctcaa 2941 ttctcacact agaccaatcc tctctttcca gaacacgtgc tacacggacg gaagctgcta 3001 ccagagcgct cggccgtcac ccgaaatctc acatttcgga tgcatggacg agaaatctgt 3061 gaccgacgag accgagtttc acgatacggc agcgaaagtg tgcacgaaca acacgaaaga 3121 cccgcacgct accgtctgga tttgctgtga taagggaaat ttttgtgcga acgagacgat 3181 catccacttg gtaagtgtga tctctttttt gacatccgga ttttgaattt tcaaattttc 3241 aggcgccagg cccgcagcag tcatccacgt ggctgattct tactatcctc gccctgctca 3301 cgttcatcgt gctcctgggc attgcaattt tcctcaccag aaaatcatgg gaggcgaaat 3361 tcgactggta tataagattc aagccaaaac ctggagatcc tctacgcgag acggagaaca 3421 atgtaccgat ggttacaatg ggcgacggtg ctggaagttc tgtgccagaa gttgctccaa 3481 tcgagcagca gggaagcaca atgagcacat ctgcagggaa ctcgttccct cctggaatca 3541 tgcctaataa catgaaggat atgcttgatg tactcgagga gacaagtgga tctggtatgg 3601 gtccaactac acttcacaaa ctcacgattg gtggacagat caggttaact ggtagagttg 3661 gatccggaag atttggtaac gtgtcacgtg gagactaccg aggtgaagcg gtcgctgtga 3721 aggttttcaa tgcgctcgac gagccggcgt tccacaagga gacggagatc tttgagacgc 3781 ggatgttgag gcatccgaat gtgctgagat atattggaag tgatcgggtt gatactggat 3841 ttgtaacaga actttggctg gtgacagagt atcatcccag cgggtccttg cacgattttc 3901 tgctcgagaa tacggttaac attgagacgt attataattt gatgaggtcg acggcaagtg 3961 gacttgcgtt cctgcacaat cagattggag ggagcaaagg taagtgcaat atacatgtgt 4021 ctcagaaatc taggagccta gactaggccc ctgagaagaa aaatttggac caacggtgtc 4081 tgaaactttt ctattcagag agtagtctag gacgccaact gagtccgctg gccagacgta 4141 ctctgtcgtt gaacatatat taatcaaaat acaaattcca gagtccaaca agccagcgat 4201 ggctcaccgg gatatcaaga gcaagaacat catggtcaag aacgatctga cgtgtgccat 4261 cggagatctc ggtctctcgt tatccaagcc agaagacgcg gcatcagata ttattgccaa 4321 tgagaactac aagtgcggta ctgttgtgag ttttgggatc taggccattt tagtcagaga 4381 gctcaaggct cgagaccctg ccaagacgga ggttttggcg taaagtctag ctataatctt 4441 ttttcaaaat tttcacaaat taaatttttt agagatacct tgcaccggaa atcctgaaca 4501 gtacgatgca gttcaccgtc ttcgagagct accaatgtgc cgatgtctac tcattttctc 4561 tagtcatgtg ggaaacccta tgcagatgtg aagatggaga tgttctgccg agagaagcgg 4621 ccacagtgat cccatatatt gagtggaccg atcgggatcc acaagacgct cagatgtttg 4681 atgtggtttg cacgcggagg ctcagaccta cagagaatcc tctgtggaag gatcatccgg 4741 taagataata aactttatgt aaagtacttc taactacttt tcaggaaatg aaacacatta 4801 tggagattat caaaacgtgc tggaatggca atccgagcgc ccgtttcacc tcctatattt 4861 gccgcaaacg aatggatgaa cggcagcaac tgttgctgga caagaaggca aaagctgtgg 4921 cccagacggc aggggtgaca gttcaggatc ggaagatttt gggtccccag aaaccgaaag 4981 acgagtcacc ggcaaatggt gctccgcgga tagttcagaa ggagattgat cgagaagatg 5041 agcaggaaaa ttggcggggt aagaattttg atcaggctcc tattgtggac cgagctacta 5101 agcccggacc aaaaaaaaac caacattttg ggtatcatgt ctatctggtg ccttaaatac 5161 cctacgaatt tagaaacttt tctcaaaagc ttccaactaa acaaattatc tttaaccttg 5221 aaggtcagat ctacctacta cctaatacta tagtagcgga cacttaataa gtccaggtcc 5281 ctaacctagc atagtttata tgcccgccac cgtagaattt tgcgccttta aagtgaagaa 5341 cacttttcaa cataaattaa tttttttcag aaactgcaaa aactcccaat ggacacattt 5401 ccagtaatga cgactcgtca cgcccacttc ttggttagag cccattttat cttcgttctt 5461 aggcgccaat atatcaaaat tattataatt attatgttct atatactaca cttatcacct 5521 ctcgtgactt cctcccaccc ccatatcgat gatctcattt ctcttcctca accactaact 5581 gtctgctacc tatttcttgt ttattagcgg gtgcattatg cagctaattg tccggcgccc 5641 catcattcta aaagagactc ctcaccgtgt acctcagaaa aacataaaaa tcttgtgata 5701 ggtatgtact gcataagctt tattaccctc ccccgaaact ttgtagtttg tagtttaccc 5761 tgtttttttt ctgtaccatg atttcaagtg tttttgaaat tttattttgt ttttattgtt 5821 attattatta ctattttcaa aaaaggagaa aacacatttc aatgcaaaaa ctgaaaatga 5881 aaataaaaaa tttgatttt // LOCUS MUSIGHAAI 318 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 3-14-9. ACCESSION M32760 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 3-14-9, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 318 Ig H-chain V-D-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start BASE COUNT 92 a 69 c 86 g 71 t ORIGIN 1 gaggtgaagc ttctcgagtc tggaggtggc ctggtgcagc ctggaggatc cctgaatctc 61 tcctgtgcag cctcaggatt cgattttagt agatactgga tgagttgggc tcggcaggct 121 ctagggaaag ggcaggaatg gatcggagaa atcaatctag gaagcagtac gataaactat 181 acgccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac 241 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aagactactg 301 gccaaggcta tggactac // LOCUS MUSIGHAAJ 318 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 1-5-1. ACCESSION M32761 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 1-5-1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 318 Ig H-chain V-D-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start BASE COUNT 93 a 67 c 82 g 76 t ORIGIN 1 gaggtgaagc ttctccagtc tggaggtggc ctggtgcagc ctggaggatc cctgaaactc 61 tcctgtgcag cctcaggaat cgattttagt agatactgga tgagttgggt tcggcgggct 121 ccagggaaag gactagaatg gattggagaa attaatccag atagcagtac atttaactat 181 gcaccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac 241 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aaaaaactgg 301 gactggtact tcgatgtc // LOCUS MUSIGHAAK 258 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 2-28-9. ACCESSION M32762 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 2-28-9, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 258) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 258 Ig H-chain V-D-region (AA at 1) recomb 234 235 V-region DNA end/D-region DNA start BASE COUNT 84 a 52 c 61 g 61 t ORIGIN 1 tcctgtgcag cctcaggaat cgattttagt agatactgga tgagttgggt tcggcgggct 61 ccagggaaag gactagaatg gattggagaa attaatccag atagcagtac aataaactat 121 gcaccatctc taaaggataa attcatcatc tccagagaca acgccaaaaa tacgctgtac 181 ctgcaaatga gcaaagtgag atctgaggac acagcccttt attactgtgc aaaaaactgg 241 gactggtact tcgatgtc // LOCUS MUSIGHAAO 243 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 3-27-6. ACCESSION M32759 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 3-27-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 243) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 243 Ig H-chain V-D-region (AA at 1) recomb 219 220 V-region DNA end/D-region DNA start BASE COUNT 82 a 46 c 58 g 57 t ORIGIN 1 ggaatcgatt ttagtagata ctggatgagt tgggttcggc gggctccagg gaaaggacta 61 gaatggattg gagaaattaa tccagatagc agtacaataa actatgcacc atctctaaag 121 gataaattca tcatctccag agacaacgcc aaaaatacgc tgtacctgca aatgagcaaa 181 gtgagatctg aggacacagc cctttattac tgtgcaaaaa actgggactg gtacttcgat 241 gtc // LOCUS MUSIGHAAP 246 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 2-1-3. ACCESSION M32763 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 2-1-3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 246) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 246 Ig H-chain V-D-region (AA at 1) recomb 222 223 V-region DNA end/D-region DNA start BASE COUNT 78 a 50 c 57 g 61 t ORIGIN 1 tcaggaatcg attttagcag atactggatg agctgggtcc ggcgggctcc agggaaaggg 61 ctagaatgga ttggagaaat taatccaaat agcagtacaa taaactatgc accatctcta 121 aaggataaat tcatcatctc cagagacaac gccaaaaatg cgctgtacct gcaagtgagc 181 aaagtgagat ctgttaacac atccctttat tactgtgcaa gatcttacgg tagctggttt 241 gcttac // LOCUS MUSIGHAAQ 261 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-region, hybridoma 10L16-5. ACCESSION M32764 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 10L16-5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 261) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 261 Ig H-chain V-D-region (AA at 1) recomb 219 220 V-region DNA end/D-region DNA start BASE COUNT 83 a 49 c 63 g 66 t ORIGIN 1 ggattcgatt ttagtagata ctggatgatt tgggtccggc aggctccagg gaaagggcta 61 gaatggattg gagaaattaa tccagatagc agtacgataa actatacgcc atctctaaag 121 gataaattca tcatctccag agacaacgcc aaaaatacgc tgtacctgca aatgagcaaa 181 gtgagatctg aggacacagc cctttattac tgtgcaagaa aggggtattt ttactacggt 241 agtagctcgt acttcgatgt c // LOCUS MUSIGHAAY 291 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 3-27-6. ACCESSION M32766 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 3-27-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1) recomb 285 286 V-region DNA end/J-region DNA start BASE COUNT 85 a 73 c 61 g 72 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccaacag ggttatacgc ttcctcggac g // LOCUS MUSIGHABA 291 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 1-5-1. ACCESSION M32768 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 1-5-1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1) recomb 285 286 V-region DNA end/J-region DNA start BASE COUNT 84 a 72 c 60 g 75 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g // LOCUS MUSIGKAAX 288 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-region. ACCESSION M32765 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 288) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 288 Ig kappa-chain V-region (AA at 1) BASE COUNT 83 a 70 c 60 g 75 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatgtattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccagttacta ttgtcagcag tatagtaagc ttcctcca // LOCUS MUSIGKAAZ 291 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 3-14-9. ACCESSION M32767 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 3-14-9, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1) recomb 285 286 V-region DNA end/J-region DNA start BASE COUNT 86 a 73 c 61 g 71 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aatttattaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactacgg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac g // LOCUS MUSIGKABB 291 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 2-28-9. ACCESSION M32769 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 2-28-9, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1) recomb 285 286 V-region DNA end/J-region DNA start BASE COUNT 84 a 73 c 61 g 73 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca gggcattagc aattatttaa actggtcaca gcagaaacca 121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g // LOCUS MUSIGKABC 291 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 2-1-3. ACCESSION M32770 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 2-1-3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig kappa-chain V-J-region (AA at 1) recomb 285 286 V-region DNA end/J-region DNA start BASE COUNT 83 a 72 c 61 g 75 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttcctcggac g // LOCUS MUSIGLKABD 246 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region, hybridoma 10L16-5. ACCESSION M32771 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) hybridoma 10L16-5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 246) AUTHORS Victor-Kobrin,C., Barak,Z.T., Bonilla,F.A., Kobrin,B., Sanz,I., French,D., Rothe,J. and Bona,C. TITLE A molecular and structural analysis of the V-H and V-K regions of monoclonal antibodies bearing the A48 regulatory idiotype JOURNAL J. Immunol. 144, 614-624 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 246 Ig kappa-chain V-J-region (AA at 1) recomb 240 241 V-region DNA end/J-region DNA start BASE COUNT 74 a 54 c 52 g 66 t ORIGIN 1 ggagacagag tcaccatcag ttgcagtgta agtcagggca ttagcaatta tttaaactgg 61 tatcagcaga aaccagatgg aactgttaaa ctcctgatct ataacacatc aactttatac 121 tcaggagtcc catcaaggtt cagtggcagt gggtctggga cagattattc tctcaccatc 181 agtaacctgg aacctgaaga tattgccact tattattgtc agcagtatag taagcttcct 241 ccgacg // LOCUS HUMACTAF 1123 bp ds-DNA PRI 30-MAY-1990 DEFINITION Human alpha-actin (ACTA) gene, non-coding exon 1. ACCESSION J05193 KEYWORDS alpha-actin. SOURCE Human placental DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1123) AUTHORS Reddy,S., Ozgur,K., Lu,M., Chang,W., Mohan,S.R., Kumar,C.C. and Ruley,H.E. TITLE Structure of the human smooth muscle alpha-actin gene: Analysis of a cDNA and 5' upstream region JOURNAL J. Biol. Chem. 265, 1683-1687 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Reddy, 20-DEC-1989. FEATURES from to/span description pre-msg 892 > 1123 alpha-actin mRNA and introns /hgml_locus_uid="LW0057W" /nomgen="ACTA" /map="1p21-qter" IVS 935 > 1123 ACTA intron A signal 863 869 TATA box BASE COUNT 286 a 258 c 258 g 321 t ORIGIN 1 gaattcgaga cgagatttgg gtggggacgt agaaccaaac catatcacct ggtctctcta 61 cttcctgtca aggaggttag tgggcagaga ggagggctag aggcttcctt tgaacaatct 121 cctttctttt ccaaactact tctttgacag gctgctgggt agactctctg gtcaaaggat 181 ggtccctact tatgctgcta aattgctcgg tgacaaatta gtagacaaag ctaatgcacc 241 aaaaaaatga atgtagttat agtaatgcta acatccaaat tcctctttgt aagacatagg 301 cctgtcaacc ttgtctccat acttcaattc ctatttccac tcacctccct caagaacttg 361 atttataaac agtgtgccta ccataaaatc atcactccct ctatgtattt atagacgact 421 gaaggaatat ctttcttctt tgacatgcta ccgtggtaga agggttttaa aagtccgtgc 481 taggcagagg cagccctttc tgcccctttc tgttctcagt ttattaggaa atggcctgaa 541 attccagcat gatagcaagc tggcatcctc tgtggaatgt gcaaaccatg cctgcatctg 601 cccattaccc tagctcagtg tctctgggca tttctgcagt tgttctgaag gcttggcgtg 661 tttatctccc acaggcggct gaaccgcctc ccgtttcatg agcagaccag tggaatgcag 721 tggaagagac ccaggcctcc ggccacccag attagagagt tttgtgctga ggtccctata 781 tggttgtgtt agactgaaca gcaggctcaa gtctgtcttt gctccttgtt tgggaagcaa 841 gtgggaggag agcagaaggg gctatataac ccttcagctt tcagcttccc tgaacaccac 901 ccagtgtgga gcagcccagc caagcactgt cagggtaagt ggcgccaggc caaggatgtg 961 acttatagat tccagtggct cttttaatta cccggtataa taagacatca tctgcaggga 1021 tttggctggg ttcatgcact gatatttctg aatgaagatt gtactactaa aatgattgta 1081 gcttttggct ttaatgatct aacgttaaag acagggctaa tat // LOCUS HUMACTAM 1329 bp ss-mRNA PRI 30-MAY-1990 DEFINITION Human alpha-actin (ACTA) mRNA, complete cds. ACCESSION J05192 KEYWORDS alpha-actin. SOURCE Human umbilical artery, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1329) AUTHORS Reddy,S., Ozgur,K., Lu,M., Chang,W., Mohan,S.R., Kumar,C.C. and Ruley,H.E. TITLE Structure of the human smooth muscle alpha-actin gene: Analysis of a cDNA and 5' upstream region JOURNAL J. Biol. Chem. 265, 1683-1687 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Reddy, 20-DEC-1989. FEATURES from to/span description pept 49 1182 alpha-actin /hgml_locus_uid="LW0057W" /nomgen="ACTA" /map="1p21-qter" mRNA < 1 1329 ACTA mRNA signal 1305 1310 poly-A signal BASE COUNT 326 a 361 c 327 g 315 t ORIGIN 1 gcagcccagc caagcactgt acaggaatcc tgtgaagcag ctccagctat gtgtgaagaa 61 gaggacagca ctgccttggt gtgtgacaat ggctctgggc tctgtaaggc cggctttgct 121 ggggacgatg ctcccagggc tgttttccca tccattgtgg gacgtcccag acatcagggg 181 gtgatggtgg gaatgggaca aaaagacagc tacgtgggtg acgaagcaca gagcaaaaga 241 ggaatcctga ccctgaagta cccgatagaa catggcatca tcaccaactg ggacgacatg 301 gaaaagatct ggcaccactc tttctacaat gagcttcgtg ttgcccctga agagcatccc 361 accctgctca cggaggcacc cctgaacccc aaggccaacc gggagaaaat gactcaaatt 421 atgtttgaga ctttcaatgt cccagccatg tatgtggcta tccaggcggt gctgtctctc 481 tatgcctctg gacgcacaac tggcatcgtg ctggactctg gagatggtgt cacccacaat 541 gtccccatct atgagggcta tgccttgccc catgccatca tgcgtctgga tctggctggc 601 cgagatctca ctgactacct catgaagatc ctgactgagc gtggctattc cttcgttact 661 actgctgagc gtgagattgt ccgggacatc aaggagaaac tgtgttatgt agctctggac 721 tttgaaaatg agatggccac tgccgcatgg tcatcctccc ttgagaagag ttacgagttg 781 cctgatgggc aagtgatcac catcggaaat gaacgtttcc gctgcccaga gaccctgttc 841 cagccatcct tcatcgggat ggagtctgct ggcatccatg aaaccaccta caacagcatc 901 atgaagtgtg atattgacat caggaaggac ctctatgcta acaatgtcct atcagggggc 961 accactatgt accctggcat tgccgaccga atgcagaagg agatcacggc cctagcaccc 1021 agcaccatga agatcaagat cattgcccct ccggagcgca aatactctgt ctggatcggt 1081 ggctccatcc tggcctctct gtccaccttc cagcagatgt ggatcagcaa acaggaatac 1141 gatgaagccg ggccttccat tgtccaccgc aaatgcttct aaaacacttt cctgctcctc 1201 tctgtctcat gcacacaact gtgaatgtcc tgtggaatta tgccttcagt tcttttccaa 1261 atcattccta gccaaagctc tgactcgtta cctatgtgtt ttttaataaa tctgaaatag 1321 gctactggt // LOCUS ASOTAAG1 2935 bp ds-DNA PLN 30-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds. ACCESSION M33218 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 2935) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1 828 868 Taka-amylase A precursor, exon 2 955 1070 Taka-amylase A precursor, exon 3 1140 1248 Taka-amylase A precursor, exon 4 1317 1545 Taka-amylase A precursor, exon 5 1603 1765 Taka-amylase A precursor, exon 6 1830 1976 Taka-amylase A precursor, exon 7 2041 2281 Taka-amylase A precursor, exon 8 2360 2647 Taka-amylase A precursor, exon 9 sigp 607 669 Taka-amylase A signal peptide matp 670 772 Taka-amylase A 828 868 Taka-amylase A 955 1070 Taka-amylase A 1140 1248 Taka-amylase A 1317 1545 Taka-amylase A 1603 1765 Taka-amylase A 1830 1976 Taka-amylase A 2041 2281 Taka-amylase A 2360 2644 Taka-amylase A pre-msg 543 > 2789 Taa-G1 mRNA and introns IVS 773 827 Taa-G1 intron A (no splice consensus) IVS 869 954 Taa-G1 intron B IVS 1071 1139 Taa-G1 intron C IVS 1249 1316 Taa-G1 intron D IVS 1546 1602 Taa-G1 intron E IVS 1766 1829 Taa-G1 intron F IVS 1977 2040 Taa-G1 intron G IVS 2282 2359 Taa-G1 intron H signal 2784 2789 poly-A signal BASE COUNT 818 a 752 c 657 g 708 t ORIGIN 1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact 61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca 121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca 181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag 241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat 301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat 361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt 421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc 481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg 541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag 601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct 661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat 721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt 781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg 841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg 901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac 961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc 1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat 1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga 1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt 1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct 1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct 1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact 1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct 1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga 1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc 1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg 1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt 1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca 1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt 1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca 1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca 1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa 1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag 2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg 2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa 2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc 2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa 2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga 2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg 2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga 2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt 2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg 2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag 2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag 2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga 2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca 2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc 2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg // LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 30-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end. ACCESSION M33220 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 197) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 195 > 197 Taka-amylase A (Taa-G2) precursor pre-msg 131 > 197 Taa-G2 mRNA and introns signal 2 11 CAAT box signal 95 100 TATA box BASE COUNT 59 a 52 c 36 g 50 t ORIGIN 1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa 61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg 121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc 181 cacagaaggc atttatg // LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 30-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end. ACCESSION M33222 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 198) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1) pre-msg < 1 151 Taa-G2 mRNA and introns (alt.) pre-msg < 1 156 Taa-G2 mRNA and introns (alt.) BASE COUNT 45 a 48 c 53 g 52 t ORIGIN About 2.1 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc 181 gtcctccaga ggcgagga // LOCUS ASOTTAM1 191 bp ss-mRNA PLN 30-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end. ACCESSION M33219 KEYWORDS Taka-amylase A. SEGMENT 1 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 191) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 189 > 191 Taka-amylase A (Taa) mRNA 125 > 191 Taa mRNA BASE COUNT 56 a 52 c 32 g 51 t ORIGIN 1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca 61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt 121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga 181 aggcatttat g // LOCUS ASOTTAM2 156 bp ss-mRNA PLN 30-MAY-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end. ACCESSION M33221 KEYWORDS Taka-amylase A. SEGMENT 2 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 156) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa) (AA at 1) mRNA < 1 151 Taa mRNA (alt.) mRNA < 1 156 Taa mRNA (alt.) BASE COUNT 40 a 35 c 42 g 39 t ORIGIN About 1.5 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgactgag agcccgtctc gaatcc // LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 30-MAY-1990 DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH) mRNA, complete cds. ACCESSION J05223 KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase; glyceraldehyde-3-phosphate dehydrogenase. SOURCE M.crystallinum, cDNA to mRNA. ORGANISM Mesembryanthemum crystallinum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Aizoaceae. REFERENCE 1 (bases 1 to 1354) AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J. TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3- phosphate dehydrogenase during the transition from C-3 photosynthesis to crassulacean acid metabolism in Mesembryanthemum crystallinum JOURNAL J. Biol. Chem. 265, 3497-3502 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.1.12) BASE COUNT 315 a 308 c 345 g 386 t ORIGIN 1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct 61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc 121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac 181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag 241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg 301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga 361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt 421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac 481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg 541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc 601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg 661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt 721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg 781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct 841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat 901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga 961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac 1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga 1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc 1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt 1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta 1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc 1321 agcatttatg aaacattgct gtgagctttt gagt // LOCUS ECOPRIA 2658 bp ds-DNA BCT 30-MAY-1990 DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR gene, 5' end. ACCESSION M33293 KEYWORDS cytR gene; priA gene; primosomal protein. SOURCE E.coli (strain W3110) DNA, clone pEL042. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2658) AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A. TITLE The priA gene encoding the primosomal, replicative n' protein of E.coli JOURNAL Unpublished (1990) Stanford U., Beckman Cntr., Stanford, CA 94305 STANDARD full automatic COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr., 26-MAR-1990. FEATURES from to/span description pept 64 2262 primosomal protein n' pept 2418 > 2658 cytR gene product signal 27 32 -10 region signal 5 10 -35 region binding 53 59 ribosome binding site BASE COUNT 578 a 738 c 756 g 586 t ORIGIN 1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat 61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg 121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag 181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag 241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga 301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc 361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt 421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa 481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa 541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa 601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg 661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc 721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa 781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg 841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc 901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt 961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt 1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac 1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc 1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt 1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg 1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac 1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt 1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac 1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc 1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg 1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg 1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca 1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg 1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc 1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac 1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa 1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg 1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg 2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg 2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg 2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt 2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg 2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg 2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg 2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg 2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata 2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac 2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc 2641 cggatatctg cgatcccc // LOCUS EWCTELRNA 657 bp ds-DNA INV 30-MAY-1990 DEFINITION E.crassus telomerase RNA component gene, complete cds. ACCESSION M33461 KEYWORDS telomerase RNA. SOURCE E.crassus DNA. ORGANISM Euplotes crassus Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; Spirotricha; Heterotrichida; Clevelandellina. REFERENCE 1 (bases 1 to 657) AUTHORS Shippen-Lentz,D. and Blackburn,E.H. TITLE Functional evidence for an RNA template in telomerase JOURNAL Science 247, 546-552 (1990) STANDARD simple staff_entry FEATURES from to/span description RNA 152 342 telomerase RNA component site 186 197 functional telomeric template BASE COUNT 202 a 122 c 111 g 222 t ORIGIN 1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga 61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt 121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg 181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg 241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt 301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg 361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt 421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat 481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga 541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac 601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt // LOCUS HS4DWXJ 160 bp ds-DNA VRL 30-MAY-1990 DEFINITION Epstein-Barr virus defective WZhet junction. ACCESSION M33474 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 160) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 87 90 WZhet junction BASE COUNT 41 a 45 c 41 g 33 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc 61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact 121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac // LOCUS HS4WXJ 181 bp ds-DNA VRL 30-MAY-1990 DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5. ACCESSION M33473 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 181) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 108 111 WZhet junction BASE COUNT 50 a 42 c 46 g 43 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt 61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca 121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga 181 c // LOCUS HUMREGA 4251 bp ds-DNA PRI 30-MAY-1990 DEFINITION Human regenerating protein (reg) gene, complete cds. ACCESSION J05412 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4251) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. FEATURES from to/span description pept 1571 1634 regenerating protein (reg), exon 2 (first expressed exon) 2270 2388 regenerating protein, exon 3 2696 2833 regenerating protein, exon 4 3549 3660 regenerating protein, exon 5 3856 3923 regenerating protein, exon 6 pre-msg 1196 4116 reg mRNA and introns IVS 1224 1524 reg intron A IVS 1635 2269 reg intron B IVS 2389 2695 reg intron C IVS 2834 3548 reg intron D IVS 3661 3855 reg intron E signal 1169 1174 TATA box BASE COUNT 1161 a 927 c 869 g 1294 t ORIGIN 1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc 61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat 121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt 181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc 241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt 301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc 361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc 421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat 481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac 541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg 601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg 661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata 721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag 781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc 841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga 901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt 961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag 1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc 1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa 1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa 1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat 1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta 1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc 1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag 1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac 1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt 1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct 1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact 1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt 1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat 1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg 1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt 1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa 1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac 2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga 2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat 2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa 2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc 2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc 2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag 2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca 2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc 2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac 2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca 2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta 2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt 2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca 2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta 2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag 2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata 3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt 3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca 3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct 3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa 3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga 3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt 3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa 3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc 3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc 3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg 3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag 3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg 3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt 3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt 3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct 3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc 3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca 4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt 4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc 4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct 4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c // LOCUS HUMREGRELA 1524 bp ds-DNA PRI 30-MAY-1990 DEFINITION Human reg-related sequence, complete cds. ACCESSION J05413 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human esophageal mucosa DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1524) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. BASE COUNT 382 a 368 c 345 g 429 t ORIGIN 1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt 61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt 121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg 181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat 241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg 301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga 361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata 421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat 481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag 541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa 601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc 661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac 721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta 781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca 841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt 901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat 961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct 1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc 1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga 1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc 1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct 1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct 1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc 1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg 1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc 1501 tataaatcat ataatgtaaa ggat // LOCUS MDPCGA 4801 bp ss-RNA VRL 30-MAY-1990 DEFINITION Aleutian mink disease parvovirus complete genome. ACCESSION M20036 KEYWORDS complete genome. SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (bases 1 to 4592) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 4593 to 4801) AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W., Perryman,S. and Wolfinbarger,J.B. TITLE Nucleotide sequence of the 5' terminal palindrome of Aleutian mink disease parvovirus (ADV) and construction of an infectious molecular clone JOURNAL J. Virol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by M.E.Bloom, 18-MAR-1990. Sequence reported below is (+) strand. FEATURES from to/span description ORF 116 1978 Left ORF ORF 1983 2207 Middle ORF2 ORF 1993 2211 Middle ORF1 ORF 2241 4399 Right ORF BASE COUNT 1740 a 912 c 943 g 1206 t ORIGIN 1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt 61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt 121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag 181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact 241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt 301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact 361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc 421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac 481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt 541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta 601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag 661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa 721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc 781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga 841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta 901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt 961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga 1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg 1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa 1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt 1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga 1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt 1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga 1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa 1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg 1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta 1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat 1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac 1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg 1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa 1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa 1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca 1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt 1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca 2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa 2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg 2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata 2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta 2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc 2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag 2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg 2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg 2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt 2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact 2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag 2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata 2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta 2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa 2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt 2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg 3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc 3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct 3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta 3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt 3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg 3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa 3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg 3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt 3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac 3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga 3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat 3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac 3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt 3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc 3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg 3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg 3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac 4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg 4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa 4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag 4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag 4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg 4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact 4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc 4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta 4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac 4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt 4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt 4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct 4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg 4801 a // LOCUS MDPUPS 3454 bp ss-RNA VRL 30-MAY-1990 DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial sequence. ACCESSION M32981 KEYWORDS . SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (sites) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3454, for [1]) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840 STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by M.E.Bloom 18-MAR-1990. Sequence reported below is (+) strand. BASE COUNT 1268 a 673 c 698 g 815 t ORIGIN 1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc 61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac 121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac 181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta 241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa 301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga 361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat 421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt 481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa 541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta 601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat 661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag 721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga 781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat 841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc 901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact 961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt 1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac 1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac 1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa 1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta 1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag 1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac 1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg 1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa 1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac 1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca 1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga 1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc 1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg 1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta 1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt 1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga 1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag 2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag 2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac 2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt 2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga 2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc 2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg 2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat 2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta 2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg 2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa 2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg 2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta 2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc 2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa 2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt 2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca 3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg 3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct 3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga 3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt 3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc 3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt 3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa 3421 aacctatggt tactttactt ggagaggcaa gctt // LOCUS MUSAA2DEL 300 bp ds-DNA ROD 30-MAY-1990 DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion fragment. ACCESSION M33468 KEYWORDS deletion mutant. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 130 133 deletion breakpoint BASE COUNT 82 a 45 c 76 g 97 t ORIGIN 1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact 61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt 121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt 181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata 241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa // LOCUS MUSSL20JA 300 bp ds-DNA ROD 30-MAY-1990 DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion fragment. ACCESSION M33467 KEYWORDS deletion mutant. SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description pept.ps / 42 122 dilute gene, exon 3 179 > 300 dilute gene, exon 5 recomb 129 130 deletion breakpoint BASE COUNT 86 a 51 c 61 g 102 t ORIGIN 1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac 61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac 121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta 181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta 241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc // LOCUS PRVVP4 2359 bp ds-RNA VRL 30-MAY-1990 DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds. ACCESSION M33516 KEYWORDS capsid protein VP4. SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA. ORGANISM Porcine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2359) AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K. TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain of porcine rotavirus to that of asymptomatic human rotavirus strains JOURNAL J. Virol. 64, 414-418 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 10 2337 capsid protein VP4 BASE COUNT 812 a 406 c 442 g 699 t ORIGIN 1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa 61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg 121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat 181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca 241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac 301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga 361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg 421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta 481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat 541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa 601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag 661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta 721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag 781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt 841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca 901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt 961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc 1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac 1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg 1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt 1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc 1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca 1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga 1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac 1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg 1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata 1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt 1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct 1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc 1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata 1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga 1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct 1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat 1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat 2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc 2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata 2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc 2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa 2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag 2341 ctctatagag gatgtgacc // LOCUS RATSTAA 1000 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds. ACCESSION M33329 KEYWORDS hydroxysteroid sulfotransferase a. SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1000) AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K., Yamazoe,Y. and Kato,R. TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a strong homology in amino acid sequence with the senescence marker protein SMP-2 in rat livers JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 38 892 hydroxysteroid sulfotransferase a (STa) mRNA < 1 1000 STa mRNA signal 977 982 poly-A signal BASE COUNT 299 a 187 c 211 g 303 t ORIGIN 1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga 61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa 121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg 181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt 241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa 301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct 361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc 421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta 481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc 541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa 601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga 661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc 721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg 781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt 841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa 901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa 961 gtgaatgtgg tcattgaata aattctattc tggcattgtg // LOCUS SMFPOLENV 3534 bp ss-RNA VRL 30-MAY-1990 DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and envelope (env) gene, complete cds. ACCESSION M33561 KEYWORDS envelope protein; polymerase. SOURCE Simian foamy virus type 1, cDNA to viral RNA. ORGANISM Simian foamy virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Spumavirinae. REFERENCE 1 (bases 1 to 3534) AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A. TITLE Relationship of the env genes and the endonuclease domain of the pol genes of simian foamy virus type 1 and human foamy virus JOURNAL J. Virol. 64, 406-410 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 582 polymerase (AA at 1) pept 524 3481 envelope protein BASE COUNT 1162 a 645 c 670 g 1046 t 11 others ORIGIN 1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac 61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta 121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat 181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta 241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc 301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg 361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc 421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt 481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg 541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac 601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag 661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg 721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat 781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt 841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt 901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt 961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt 1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt 1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt 1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca 1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac 1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca 1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa 1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa 1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga 1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg 1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc 1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt 1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca 1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa 1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg 1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt 1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa 1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc 2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac 2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa 2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag 2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg 2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt 2341 atatttactn nnnnnnnnnn tggtaaccct gatggaagct gcccttcatg atgtttcgat 2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat 2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt 2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta 2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga 2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt 2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa 2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt 2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg 2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg 2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt 3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc 3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt 3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa 3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg 3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc 3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg 3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact 3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg 3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat // LOCUS IPBFATPD 1035 bp ss-mRNA PLN 30-MAY-1990 DEFINITION Sweet potato F-1-ATPase (mitochondrial type) delta subunit mRNA, complete cds. ACCESSION J05397 KEYWORDS F-1-ATPase delta subunit. SOURCE I.batatas (strain Kokei No.4) tuberous root, cDNA to mRNA. ORGANISM Ipomoea batatas Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Convolvulaceae. REFERENCE 1 (bases 1 to 1035) AUTHORS Kimura,T., Takeda,S., Asahi,T. and Nakamura,K. TITLE Primary structure of a precursor for the delta-subunit of sweet potato mitochondrial F-1-ATPase deduced from full-length cDNA JOURNAL J. Biol. Chem. 265, 6079-6085 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Nakamura, 23-FEB-1990. FEATURES from to/span description pept 115 849 F-1-ATPase delta subunit precursor (EC 3.6.1.3) sigp 115 249 F-1-ATPase delta subunit signal peptide matp 250 846 F-1-ATPase delta subunit signal 964 969 poly-A signal BASE COUNT 283 a 225 c 238 g 289 t ORIGIN 1 catagggtaa actgcagcgt tctttgggct tcgttctcaa acccttccct tctgtctccg 61 aattctctcg cgccggcgaa cgattcagct tagagcaaag aagagctctc tgatatggca 121 atgacaggac gtgctagatc gatgggcttc tctattctcc agaaggccct ctcttcagcc 181 cagagatcta acgctcatcg atccattctc tgccctaccc tctcaaactc tgagctgctg 241 aggaactatg caactgcctc tgcttcaaaa gaacaaaaaa ttaaggtgcc tctgacaatg 301 tatggggttt ctggaaacta tgcatctgct ttgtaccttg cagcagtaaa gtctaacaca 361 ttagagaagg ttgagtcaga gctctatgac cttgttgagg cttcaaagaa gagtcctaca 421 ttttctcagt tcatgaggga tccatctgta cctgttgata ctagagtgaa tgccataaag 481 gaaatttgtg ctcaagctaa atttggggat accacccaga acttcttgct tattttggct 541 gaaaatggga ggctgaaaca cattgatcgc attgtgaagc gattcaaaga gctgaccatg 601 gcacacaggg gagaagtcaa agccactgtg acaacagtaa ttccccttcc cgctgatgag 661 gagaaagaat tgaaagccac attgcaggaa atggttggac agggaaagag tgttcaaatt 721 gaacagaaga ttgatccaac cattcttggt ggccttgttg tggaattcgg gcagaaggtc 781 tttgacatgt ctataaggac acgggcacgc cagatggaga gattcttgcg tgaaccactc 841 aacttttgat tgcctccaat cccgttagaa ggcgtttgtt tccatcttga ggaaagcacg 901 atttcggtct gggctccttg agctgcctgt tttccgataa acattgttct gctctttttt 961 gcaaataaag aaaacttgct ttttgtaaga atttgatggg aaaaaaagaa aacctaatgt 1021 tgtttttact tgctg // LOCUS YSCSAC7P 1500 bp ds-DNA PLN 30-MAY-1990 DEFINITION S.cerevisiae SAC7 protein gene, complete cds. ACCESSION M32335 KEYWORDS SAC7 protein. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 301 to 1500) AUTHORS Dunn,T.M. and Shortle,D. TITLE Null alleles of SAC7 suppress temperature-sensitive actin mutations in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 2308-2314 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 300) AUTHORS Dunn,T.M. JOURNAL Unpublished (1990) Univ. of Health Sciences, Bethesda MD 20814-4799 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1], [2] kindly submitted by T.M.Dunn, 23-FEB-1990. FEATURES from to/span description pept 606 1430 SAC7 protein BASE COUNT 526 a 313 c 288 g 373 t ORIGIN Chromosome 4R, between ADE8 and TRP4. 1 ggatcaccaa cttctctttc aaggaatgct caaccaacta ctttaaaacg gcctccatta 61 tcttcaagac catattcata taatacccca accaaagata ggaaatcctt ttccaaatcc 121 gcgaaacaga acaataacaa taacaacgct aattccggaa catcgcctca cgcagagttc 181 aaaaattaca gagatatgtt tttatctaat agaaatggtt tcacaggtag ggttttcggt 241 gttactttag cagaatcgtt gagcgtcgcc agtgcagagg tcattgttca aagtgagttg 301 gtgagttttg gtcggatacc catcgtggta gccaagtgcg gcgcatactt aaaagcaaat 361 ggattggaga cctcgggtat atttcgtata gcgggcaatg gtaaaagagt aaaagccctt 421 caatacatat tctcgtcgcc acctgattat ggtaccaaat tcaacgattg gaaacatata 481 cagtgcacga tgttgcatcg ctcctgagga gataccttaa taatttggcc gaaccactaa 541 tacctttatc cctatatgaa caattcagaa acccgctacg atctagacca agaatcctaa 601 ggcatatgtt gacccacgaa gtttctcatc cgaatgcgaa taaaacaaat aatgtaacag 661 taaaatcaag tagacagaac tataatgatg atggtgctaa tgatggtgac atcgaaaagg 721 aggacgccaa agatgatgaa gaaaaaagac gaagaaaaat acgtcataag agaaggctga 781 cccgcgatat aagggcagct atcaaggaat atgaagagct ctttgttacc ttatcaaatg 841 acacgaaaca gctaactata tacctactcg atttactgag tctttttgca aggcaatcac 901 agtttaacct aatgtctggt agaaacttgg cggccatctt ccaaccttca atattatcgc 961 atcctcaaca tgatatggac cccaaagaat acgagttatc ccgactagta gtggaatttt 1021 tgattgaata ctcgtacaag ctattacccc atcttttgaa gttggctaaa agggaacaac 1081 aggaacggtt gtcaaccgaa aataagaaaa ataatggaga taaacagaaa actgatccta 1141 tagaaatacc aaagatcacc tcatcggatt cgccaccaat agtttcttcc aataaaaacc 1201 caccggcgat tgacaataac aataaactag accatacgac gttatcgcca atatctactt 1261 cgataccgga gaactcatca gacctacaga cttcaaaaat gttaaagcct ccgaagcaaa 1321 gaagaccaca ctcaaaatcg tttggttcta ctccggttct ccggatgtta ttgctagtaa 1381 taaaagaaga acgagtttat tcccgtggtt acataaaccg ggaattttga gtgacacagg 1441 agacaatggc gacttaactg ctactgaagc tgaaggtgat gattatgaag aagaaaatgt // LOCUS BOVINSMN 960 bp ss-mRNA MAM 30-MAY-1990 DEFINITION Bovine inositol monophosphatase mRNA, complete cds. ACCESSION J05394 KEYWORDS inositol 1-phosphatase; inositol monophosphatase. SOURCE Bovine calf brain, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 960) AUTHORS Diehl,R.E., Whiting,P., Potter,J., Gee,N., Ragan,C.I., Linemeyer,D., Schoepfer,R., Bennett,C. and Dixon,R.A.F. TITLE Cloning and expression of bovine brain inositol monophosphatase JOURNAL J. Biol. Chem. 265, 5946-5949 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by P.Whiting, 07-FEB-1990. FEATURES from to/span description pept 61 894 inositol monophosphatase BASE COUNT 266 a 197 c 248 g 249 t ORIGIN 1 cggaacggcc gcgcgtctga caggctggtg cgcccgcggc cgcgcgcccg ccgccctgag 61 atggctgatc cttggcagga atgcatggat tatgcagtaa ccctggcagg acaagctgga 121 gaggtggttc gtgaagcgct caagaatgaa atgaatatta tggttaaaag ttctccggct 181 gatttggtaa ctgctactga ccaaaaagtt gaaaaaatgc ttatcacatc cataaaggaa 241 aagtatccat ctcacagttt cattggtgag gaatctgtgg cagctgggga aaaaagtatc 301 ttaaccgaca accctacatg gatcattgac cctattgatg gaacaactaa ctttgtacat 361 ggatttcctt ttgtagctgt ttcaattggc tttgtggtaa ataaaaagat ggagtttgga 421 attgtgtaca gttgcctgga ggataagatg tacactggcc ggaaaggaaa aggtgccttt 481 tgtaatggtc aaaaactaca ggtttcacac caagaagata ttaccaaatc tctcttggtg 541 acagagttgg gctcttccag aacaccagag actgtgagaa ttattctttc taatatagaa 601 aggcttcttt gccttcccat ccatgggatc cggggtgttg gaacagcagc tctgaacatg 661 tgcctggtgg ccgctggggc tgcggacgcg tattatgaaa tggggatcca ctgctgggat 721 gttgcaggcg ccggcatcat tgtgactgaa gctggcggag tgctcctgga tgtcacaggt 781 ggaccatttg atttgatgtc acgaagagta attgcttcaa gcaataaaac attagcagaa 841 aggatagcca aagaaattca gataatacct cttcaaagag atgatgaaga ttaactccga 901 cggcttcaca cattcaatca cagttgcgtc tctccaggtt ttctaatgca ctcatggtgt // LOCUS FVBPNGF 1350 bp ds-DNA BCT 30-MAY-1990 DEFINITION F.meningosepticum endoglycosidase F (PNGase F) gene, complete cds. ACCESSION J05411 KEYWORDS N-glycanase; endoglycosidase F; peptide-N-4-(N-acetyl-beta-D-glucosaminyl) asparagine amidase F. SOURCE F.meningosepticum (ATCC 33958) DNA, clone pGB29, subclone J. ORGANISM Flavobacterium meningosepticum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1350) AUTHORS Barsomian,G.D., Johnson,T.L., Borowski,M., Denman,J., Ollington,J.F., Hirani,S., McNeilly,D.S. and Rasmussen,J.R. TITLE Cloning and expression of peptide-N-4-(N-acetyl-beta-D- glucosaminyl) asparagine amidase F in Escherichia coli JOURNAL J. Biol. Chem. 265, 6967-6972 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Barsomian, 29-MAR-1990, for release after publication. FEATURES from to/span description pept 139 1203 endoglycosidase F precursor (EC 3.5.1.52) matp 259 1200 endoglycosidase F BASE COUNT 470 a 219 c 276 g 385 t ORIGIN 1 ggatctgtac tgagtattgt aaaggtttta taatttgatt aaaaatgaat ctttgttaat 61 aaatcagggt ttaatttgac ttttaaaacg atttcttaaa taaatacaat atattcgtat 121 aacaaatcgc aataaattat gagaaaacta ctaattttta gtatatctgc ttacttgatg 181 gcaggtatcg tttcgtgtaa aggtgtagac agtgcaacac ctgtaacaga agatcgctta 241 gctctgaatg cggtaaatgc tccggcagat aataccgtaa atattaaaac attcgacaaa 301 gtaaaaaatg cctttggtga cggattgtcc caaagtgcag aaggaacctt tacatttccg 361 gccgatgtaa caactgtaaa aacgattaag atgttcatca aaaatgaatg tcctaataaa 421 acctgtgatg aatgggatcg ttatgccaat gtttatgtaa aaaataaaac aacaggagaa 481 tggtatgaaa taggacgctt tattactcca tattgggtgg gtacggaaaa attacctcgt 541 ggactggaaa ttgatgttac cgatttcaaa tctttactgt ccggaaatac agaacttaaa 601 atttatacgg agacttgctt ggccaaagga agagaataca gtgtagactt tgatattgta 661 tatggtacac cggattataa atattcggca gtagtacctg taatccaata taacaaatca 721 tccattgatg gtgttcctta tggtaaagca catacactgg gattaaaaaa gaatattcag 781 ttaccaacaa acacggaaaa agcttatctt agaactacta tttccggatg gggacatgcc 841 aagccatatg atgcgggaag caggggctgt gcagaatggt gcttcagaac acatactata 901 gcaataaata atgcgaatac tttccaacac cagctgggtg ctttaggatg ttcagcaaac 961 cctattaata atcagagtcc gggaaattgg gctcctgaca gagcagggtg gtgtccggga 1021 atggcagtgc caacacgtat agatgtgttg aataactctt taacgggtag tacttttagt 1081 tatgaatata agttccagag ttggacaaac aacggaacca atggagatgc tttttatgca 1141 atttccagtt ttgtgattgc aaaaagtaat acacctatta gtgctccggt agttacaaac 1201 taaaaaacag catatattca tttatagata aaaccgcctt ggcaaggaat caattacttt 1261 tcgggtattt gcaaaaacac cccgcagatg aaaatctgcg gggtgtatga aaaaggtaag 1321 aagttagaga tatctttcct cagccgaaat // LOCUS POTPPIIIK 945 bp ds-DNA PLN 30-MAY-1990 DEFINITION Potato proteinase inhibitor IIK gene , 5' flank. ACCESSION M29965 KEYWORDS proteinase inhibitor IIK. SOURCE Potato (strain Russet Burbank) DNA. ORGANISM Solanum tuberosum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 945) AUTHORS Palm,C.J., Costa,M.A., An,g. and Ryan,C.A. TITLE Wound-inducible nuclear protein binds DNA fragments that regulate a proteinase inhibitor II gene from potato JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 603-607 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.An, 18-NOV-1989. FEATURES from to/span description mRNA 893 > 945 proteinase inhibitor IIK BASE COUNT 374 a 129 c 138 g 304 t ORIGIN 1 tcgacccaat tcaaagaact tgttcattaa ttattatttt tccagcaaaa tgaaaatttt 61 tagttaggat gataaaattt atccaaattt ctaccaccac ccaaaactaa atcttccaat 121 tcagaaaaaa tcattcatta caactcttgg atttgtttgt atgatgtgtt agaagcagaa 181 atgggtttaa aaggtgaatt aaatggtatt tttaactaaa gaaatgaatg cgggatatat 241 ttaagttttt tcagataatt catgaatatt tttatcctct ttccgtgtgc aataatagag 301 ttccaactta attatcacgt ggacttataa gaaaccgatg ctggatgata attatttaaa 361 aaaacaagca agtcggggtc agtacagctt gctgaatagg agaaattaaa gatagcaacc 421 aggaaaagtt aagagctttt ggctcctccg tccaattata accgtccact atatatattt 481 gagactcgta ttgagaaaca ataaataata agaatgatat tactatatta ccttttgaat 541 atattaaatt taatgctttg aaaaatatat ttgataatga ataatatcta atagcagcaa 601 gaataaaata gacacaaaca agtaaaatta ctcattgatc tttcaaatta gacaagtatt 661 atcggacatc tacttttagt atagtaaaca agtaaagatc ggataaagag agtaataaag 721 aagaagcaag cgtaagtacc ttgccaaaat aattaactaa caagcacatc tttttttttt 781 tatcaaatat taataaaaat aatttatatt aatatgaaga aaaaaaaggt tttagtttgc 841 tatctttttg atcactcgtt tgctataaat aggtggagga ggacagacac tcttcacccc 901 aaaattaaaa gaaaaagagg cagtactaat taattatcca tcatg // LOCUS CADPOLYP 1631 bp ss-RNA VRL 30-MAY-1990 DEFINITION Canine distemper virus polymerase-associated (P) protein and C protein mRNA, complete cds. ACCESSION M32418 KEYWORDS C protein; P protein; polymerase-associated protein. SOURCE Canine distemper virus, cDNA to viral RNA, clone pCDV 30. ORGANISM Canine distemper virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Paramyxoviridae; Morbillivirus. REFERENCE 1 (bases 1 to 1631) AUTHORS Barrett,T., Shrimpton,S.B. and Russell,S.E.H. TITLE Nucleotide sequence of the entire protein coding region of Canine distemper virus polymerase-associated (P) protein mRNA JOURNAL Virus Res. 3, 367-372 (1985) STANDARD simple staff_review FEATURES from to/span description pept 41 1564 polymerase-associated (P) protein pept 63 587 C protein BASE COUNT 526 a 367 c 396 g 342 t ORIGIN 1 tacccgatca tcattcaccg accacccgtt ctatccctaa atggcagagg aacaggccta 61 ccatgtcagc aaagggctgg aatgcctcaa agccctcaga gagaatcctc ctgacattga 121 ggagattcaa gaggtcagca gcctcagaga ccaaacctgc aacccaggcc aagagaatgg 181 aaccacaggc atgcaggaag aggaggactc tcagaatctc gatgaatcac acgagccaac 241 aaaaggatca aactatgtcg gccatgtacc tcaaaataat ccgggatgtg gagaacgcaa 301 tactgcgctt gtggaggcgg agcggccccc tagagaggac atccaaccag gacctggaat 361 acgatgtgat catgtttatg atcacagcgg tgaagaggtt aagggaatcg aagatgctga 421 cagtctcgtg gtacctgcag gcactgtcgg taatcgagga ttcgagagag gagaaggaag 481 ccttgatgat agcactgagg attctggcga agattattcc gaaggaaatg cttcatctaa 541 ctggggatat tctttcggcc ttaaaccgga cagagcagct gatgtgagca tgctgatgga 601 agaggaatta agtgctctac tcaggacaag cagaaatgta gggattcaga aaagggatgg 661 gaagactctg cagttcccac ataatcccga agtaagacaa gggatccgga gtgtggatcc 721 attaaaaagg ggcacagaag agaggtcagt ctcacatgga atggggatag ttgctggatc 781 gacaagtggt gcaacccaat ctgcactcaa gtcaactggg ggatcatcag agccaagtgt 841 ttctgcgggg aatgtccgcc aacctgcaat gaatgcaaag atgacccaga aatgcaaact 901 cgagtctggc acgcaactcc ctcccaggac ctcaaatgag gctgagtctg acagtgagta 961 cgatgatgag cttttctctg agatacaaga aattcgatct gccattacta aactaactga 1021 agataatcaa gcaatactta ctaaactgga taccttatta ctgcttaaag gagagactga 1081 ttcaattaag aaacaaatca gcaaacaaaa tattgctatt tccacgattg aggggcatct 1141 atcaagcatt atgatagcta tacctggttt tggaaaggac acgggagatc ctacggcaaa 1201 tgtcgacatt aacccagagc tccgccctat catagggaga gattcaggaa gaccactagc 1261 agaagttctc aagcagcccg catcatcccg cggtaatcgg aaggacagtg gtattactct 1321 gggctcaaaa ggtcaactat tgagagacct ccagctgaaa cccattgaca aagagtctag 1381 ctcggcaatc ggatacaaac cgaaggatac cgcaccttcc aaagctgtac ttgcatcatt 1441 gattagatca agcagagttg atcaaagtca caaacataac atgctggctc tgctcaaaaa 1501 tatcaagggg gatgacaacc taaacgagtt ctaccaaatg gtcaaaagta ttactcatgc 1561 ttaatctgta gcgttgacta atctactaac cggcgcaaaa ctgctttcac tatcgcttaa 1621 aagcaattat a // LOCUS ROHSEG4A 2359 bp ds-RNA VRL 30-MAY-1990 DEFINITION Human rotavirus segment 4 gene, complete cds. ACCESSION M32559 KEYWORDS . SOURCE Human rotavirus (strain RV-5, serotype 2) RNA. ORGANISM Human rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2359) AUTHORS Kantharidis,P., Dyall-Smith,M.L. and Holmes,I.H. TITLE Marked sequence variation between segment 4 genes of human RV-5 and simian SA 11 rotaviruses JOURNAL Arch. Virol. 93, 111-121 (1987) STANDARD simple staff_review FEATURES from to/span description pept 10 2337 segment 4 protein BASE COUNT 871 a 343 c 413 g 732 t ORIGIN 1 ggctataaaa tggcttcgct catttataga caacttctca ctaattcata ttcggtagac 61 ttgcatgatg aaatagaaca gattggatcg gagaaaactc aaagtgtaac agtaaatcca 121 ggtccatttg cacaaaccag atacgctcca gttaattggg gacatgggga gattaatgat 181 tcaactacag tggaaccagt tttagatggt ccttatcaac ccactacatt caaaccaccc 241 aatgattatt ggttccttat tagttcaaat acagatggag tagtctacga aagtacaaat 301 aataatgatt tttggacagc agttatcgca gttgaaccac atgttagtca aacaaatagg 361 caatatattt tatttggtga aaataaacag tttaacgtag aaaacaattc agataaatgg 421 aaatttttcg aaatgtttaa aggtagtagt cagggtgatt tttctaatag acggactcta 481 acctctaaca atagacttgt agggatgcta aaatatggtg gaagagtatg gacatttcat 541 ggtgaaacac caagagctac tactgatagt tcaaatactg cggatttaaa taatatatca 601 attatgattc attcagagtt ttatattatt ccaagatccc aagaatctaa atgtaacgag 661 tatatcaaga atggtttacc accaattcag aatactagaa acgtagttcc attatcttta 721 tcatccagat ctattcagta taggagagca caagttaatg aagatattac aatttcaaaa 781 acttcattat ggaaggaaat gcaatataat agagatatta taataagatt taaatttggt 841 aatagtgtta taaaactagg aggattggga tataaatggt ctgaaatatc atataaagca 901 gcgaattatc agtatagtta ttcacgtgat ggtgaacaag ttactgcaca taccacttgt 961 tcagtaaatg gagtaaataa ttttagctat aatggaggtt cactacctac tgatttcagt 1021 atttcgagat atgaagttag taaagaaaat tcttatgtgt atatagacta ctgggatgat 1081 tcaaaagcat ttagaaatat ggtgtatgtt agatcgctag cagcaaattt aaattcagtg 1141 aaatgtacag gtgggagtta tgattttaga ttgcccgtag gtggatggcc tattatgaat 1201 ggtggtgctg tatcattaca ttttgctgga gttacattat ctacacagtt cactgatttt 1261 gtatcattaa attcactgcg atttagattc agtttaacag tagatgaacc atctttctca 1321 ataatacgaa cacgtacaat aaacttgtac ggattaccag cagctaatcc gaacaatgga 1381 aatgagtact atgaaatgtc aggaaggttt tcacttattt ctttagttct aaccaatgat 1441 gattatcaaa ctccaattat gaattcagta acagtaaggc aggatttaga acgccagctt 1501 aatgatttgc gagaagaatt taattcattg tcacaagaaa tagctatgtc acaattaatt 1561 gatttagcat tactaccttt agatatgttc tctatgtttt cgggaataaa aagtacaatt 1621 gatctaacca aatcaatggc aactagtgta atgaaaaaat ttagaaaatc gaaattagct 1681 acatcaattt cagaaatgac taattcattg tcagatgcgg cttcatcagc atcaagaagt 1741 gcttctatta gatcaaattt atctacgatt tcaaattgga ctaatacttc aaaaagtgtg 1801 tcaaatgtaa ctgactcagt aaatgatgtt tcaacacaaa catctacaat tagtaagaaa 1861 cttagattaa gagagatgat tactcaaact gaaggaatga gttttgatga tatttcagca 1921 gctgtactga aaacgaaaat agatatgtcc acacaaattg gaaaaaacac tttacctgat 1981 atagttactg aagcgtctga aaaatttatt ccaaaacgat catatcgagt attaaaagat 2041 aatgaagtaa tggaaattaa cactgaagga aagttttttg catataaagt ggatacactc 2101 aatgagatac catttgatat aaataaattc gccgaacttg tcacggattc tccagttata 2161 tcagcgataa tagactttaa gacgttaaaa aatttaaacg acaattatgg aattactcgt 2221 attgaagcgt ttaatttaat taaatcgaat ccaaatgtgc tacgtaattt cattaatcaa 2281 aataatccaa ttataagaaa tagaattgag cagttaattc tacaatgtaa gttgtgagaa 2341 tgtcatccag gatgtgacc // LOCUS MUSCAIA 1224 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse carbonic anhydrase I (CAI) mRNA, complete cds. ACCESSION M32452 KEYWORDS carbonic anhydrase; carbonic anhydrase I; zinc metalloenzyme. SOURCE Mouse anemic spleen, cDNA to mRNA, clone pMCAI. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1224) AUTHORS Fraser,P.J. and Curtis,P.J. TITLE Molecular evolution of the carbonic anhydrase genes: Calculation of divergence time for mouse carbonic anhydrase I and II JOURNAL J. Mol. Evol. 23, 294-299 (1986) STANDARD simple staff_review FEATURES from to/span description pept 46 831 carbonic anhydrase I BASE COUNT 358 a 299 c 234 g 333 t ORIGIN 1 gccagcatct tctctgaagc tgaatcacaa cctaaataag agaaaatggc aagtgcagac 61 tggggatatg gaagcgaaaa tggtcctgac caatggagca agctgtatcc cattgccaat 121 ggtaacaacc agtctcctat tgatattaaa accagtgaag ccaatcatga ctcctctctg 181 aaaccactca gcatctccta taatcctgca actgccaaag aaattgttaa cgtgggacat 241 tctttccatg taatttttga tgacagtagc aaccaatctg ttctgaaagg tggccctctt 301 gctgatagct atcggctcac tcagttccat tttcactggg gcaactcaaa cgaccatggc 361 tctgagcaca ccgtggatgg aactagatat tctggagagc ttcacttagt tcactggaat 421 tctgcaaagt actccagtgc ttctgaagcc atctccaagg ctgatggcct ggcaatcctt 481 ggcgttttga tgaaggttgg tccagccaac ccaagcctgc agaaagtact tgatgctcta 541 aactcagtta aaactaaggg aaaacgagcc ccattcacaa attttgaccc atccagtctg 601 cttccttcat ctctggatta ctggacctac tttggctctc tgactcaccc tcctcttcat 661 gaaagtgtga cctgggtgat ctgcaaggat agcatcagtc taagcccaga gcagctggcc 721 cagctccgtg gtcttctgtc aagtgcagag ggagagtctg cagttccagt tctgagcaac 781 caccgtccac cccaacccct gaagggcaga acagtcagag cctcattttg agtcccagca 841 aggaatgagt cctcaattat gacctggccc cctctacagc agcatccaat aaagggatca 901 cgattaagaa acagacttat gtcagtgcta gacagtatac ctgcaaattt aatccacaga 961 actaaaatgc tttcatttta attcctgata cttaatgcaa atactctata agtttgccta 1021 ctgtaatttt tatgcattcg tttttgctta aatacacctt tttgtggaag tcgttcataa 1081 ttacaactaa gtgattcatt ctctttctgc tctcttcact ttaaataaac caataaataa 1141 atcttttaaa aaaaaaaaaa aaaaaagctg agaaataagc gctgatagca gccttccagt 1201 tctgttcaag tatatggctg cctg // LOCUS RATGLOBBET 287 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Rat hemoglobin beta-chain mRNA, 3' end. ACCESSION M32509 KEYWORDS hemoglobin; hemoglobin beta-chain. SOURCE Rat reticulocyte, cDNA to mRNA, clone pBRrg X. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 287) AUTHORS Crkvenjakov,R., Bucan,M., Konstantinovic,M., Fogel,M., Savic,A. and Glisin,V. TITLE Characterization of two rat globin cDNA clones JOURNAL Hemoglobin 8, 597-611 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 158 hemoglobin beta-chain (AA at 3) BASE COUNT 67 a 72 c 68 g 80 t ORIGIN 1 agctgcatgt ggatcctgag aacttcaggc tcctgggcaa catgattgtg attatgatgg 61 gccaccacct gggcaaggaa ttcaccccga gtgcacaggc tgccttccag aaggtggtgg 121 ctggagtggc cactgccttg gctcacaagt accactaaac ctcttttcct gctctggtct 181 ttgtgcaatg gtcaattgtt ccctagagag catctgtcag ttgttgtcaa aatgacatcg 241 acctttgaaa atctgtccta ctaataaaag gcatttactt tcactgc // LOCUS RATHEMALPH 194 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Rat hemoglobin alpha chain mRNA, partial cds. ACCESSION M32510 KEYWORDS hemoglobin; hemoglobin alpha-chain. SOURCE Rat reticulocyte, cDNA to mRNA, clone pBRrg 5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 194) AUTHORS Crkvenjakov,R., Bucan,M., Konstantinovic,M., Fogel,M., Savic,A. and Glisin,V. TITLE Characterization of two rat globin cDNA clones JOURNAL Hemoglobin 8, 597-611 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 194 hemoglobin alpha-chain (AA at 3) BASE COUNT 35 a 73 c 39 g 47 t ORIGIN 1 tcgacctgcc tggtgccctg tccactctga gcgacctgca tgcccacaaa cttgatcctg 61 tcaacttcaa gttcctgagc cactgcctgc tggtgacctt ggcttgccac caccctggag 121 atttcacacc cgccatgcac gcctctctgg acaaattcct tgcctccgtg agcaccgtgc 181 ttacctccaa gtac // LOCUS HUMIGKVAE 1241 bp ds-DNA PRI 30-MAY-1990 DEFINITION Human germline kappa-chain gene (A2) V-region, exons 1 and 2. ACCESSION M31952 KEYWORDS germline; immunoglobulin light chain; kappa-immunoglobulin; variable region; variable region subgroup VK-II. SOURCE Human DNA, isolated from immune serum of Hib-PS vaccine recipients. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1241) AUTHORS Scott,M.G., Crimmins,D.L., McCourt,D.W., Zocher,I., Thiebe,R., Zachau,H.G. and Nahm,M.H. TITLE Clonal characterizaton of the human IgG antibody repertoire to Haemophilus influenzae type B polysaccharide: III. A single V-kappa-II gene and one of several JK genes are joined by an invariant arginine to form the most common L chain V region JOURNAL J. Immunol. 143, 4110-4116 (1989) STANDARD simple staff_review FEATURES from to/span description pept 325 373 Ig kappa-chain V-region, exon 1 742 / 1052 Ig kappa-chain V-region, exon 2 IVS 374 741 IgK intron A iDNA 1053 > 1241 intervening DNA (5' end put.) BASE COUNT 318 a 306 c 283 g 334 t ORIGIN 1 ccgacaagaa tttggaagcc ctgacatcct ataaaacgtt acttgcccaa gattgaaact 61 ttcaaattca ggtccttctt cctcctctat gatgaattag gttttattag tttcctccaa 121 gggacacttt atatcacatt gctcacagag aagacatatc tacccccttc accccccacc 181 caatggctct ttccacacca ctgcacccac caggtgattt gcatattgtc ccctagggag 241 gaccttccct tgtgaggcct gagataaaag ctcagctcta accttgcctt gactgatcag 301 gacttctcag ttcatcttct caccatgagg ctccctgctc agctcctggg gctgctaatg 361 ctctggatac ctggtaagga tggaaggaga tgagggagga ggagggggtg ggaagctgag 421 ctctggcggc cccactgatt cccgtgttta ttctaaccat gtgttaaagg aatatggcct 481 atgctccagg gagaggaatt catattttgc cctgatgatg atttgaaaac tcctaaaagc 541 agtgctctga ataatatctt gagaaatgaa agaactcttg tgcctattta ataaagggtt 601 catttaaaga gtttgttttt atgatatgaa tacaaatttg taaaaataaa agattagcca 661 taaatcaata ccataaggca aatctcaaaa gttgttcatt atgctttcac ataaccttgc 721 acttctctct cataatttca ggatccagtg cagatattgt gatgacccag actccactct 781 ctctgtccgt cacccctgga cagccggcct ccatctcctg caagtctagt cagagcctcc 841 tgcatagtga tggaaagacc tatttgtatt ggtacctgca gaagccaggc cagcctccac 901 agctcctgat ctatgaagtt tccaaccggt tctctggagt gccagatagg ttcagtggca 961 gcgggtcagg gacagatttc acactgaaaa tcagccgggt ggaggctgag gatgttgggg 1021 tttattactg catgcaaagt atacagcttc ctcccacagt ggtacagacc aatacagaaa 1081 cctccctgct ggggtgtccc agctgctcac ttgcactgct tgtctgggga gtagctcagc 1141 agggactctg agtctgcaga agaggaggct gttggagaac tcagtggcag ggcttgctgt 1201 tgaggactct ggcccatgag gtctcagcag cacctcagtc c // LOCUS RATS14 350 bp ds-DNA ROD 30-MAY-1990 DEFINITION Rat S14 protein gene, 5' flank. ACCESSION M33553 KEYWORDS S14 protein. SOURCE Rat (strain Sprague-Dawley) male liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 350) AUTHORS Jump,D.B., Bell,A. and Santiago,V. TITLE Thyroid hormone and dietary carbohydrate interact to regulate rat liver S14 gene transcription and chromatin structure JOURNAL J. Biol. Chem. 265, 3474-3478 (1990) STANDARD simple staff_review BASE COUNT 71 a 77 c 96 g 106 t ORIGIN 1 ctagacacta agtgacatat gttcatatag atagtgtttt taaactggaa agactgtccc 61 taccttggtt tgtagccacc tgggctgagg gccagctgag gttacttggg gcctggcagc 121 ctgggtacca cttgctcatc ttacccttgg atcacattca catgatgttc tggccaggcc 181 cttgacccca gttcctctgg ggccaattca atttgtccag cctagggcat tctgggtgag 241 gttattccaa aagatgtgtc agccaaccct ccgtttctcc acagggttgt agtcatggaa 301 agactagaga aagtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt // LOCUS MUSPTKA 211 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD15. ACCESSION M33421 KEYWORDS protein tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 211) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 211 protein tyrosine kinase (AA at 3) BASE COUNT 47 a 55 c 64 g 45 t ORIGIN 1 ggatccacag ggacctggct gctcggaact gcctggtgac agagaagaat gtcctgaaga 61 tcagcgactt tgggatgtcc cgcgaagaag ctgatgggat ctatgccgcc tgcagcggcc 121 tcagacaagt ccctgttaag tggactgccc ctgaggccct taactacgga cgctactcct 181 cagagagtga tgtgtggagc tttggaattc c // LOCUS MUSPTKB 211 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD16. ACCESSION M33422 KEYWORDS protein tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD16. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 211) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 211 protein tyrosine kinase (AA at 3) BASE COUNT 50 a 50 c 59 g 52 t ORIGIN 1 ggatccacag agaccttgct gctaggaact gcatggatgc cgaagatttc acagtaaaaa 61 ttggagattt cggtatgaca cgagacatct acgagacgga ctactaccgg aaaggcggga 121 aggggttgct gcctgtgcgc tggatgtctc tcgagtccct caaggatggt gtcttcacta 181 ctcattctga cgtctggtcc ttcggaattc c // LOCUS MUSPTKC 214 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD17. ACCESSION M33423 KEYWORDS protein tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD17. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 214) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 214 protein tyrosine kinase (AA at 3) BASE COUNT 69 a 42 c 58 g 45 t ORIGIN 1 ggatccacag ggacctggca acaaggaaca tattggtgga aaatgagaac agggttaaaa 61 taggagactt cggattaacc aaagtcttgc cgcaggacaa agaatactac aaagtaaagg 121 agccagggga aagaccgata ttctggtacg cacctgaatc cttgacggag agcaagtttt 181 ctgtggcctc agatgtctgg tcctttggaa ttcc // LOCUS MUSPTKD 217 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD19. ACCESSION M33424 KEYWORDS protein tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 217) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 217 protein tyrosine kinase (AA at 3) BASE COUNT 64 a 44 c 57 g 52 t ORIGIN 1 ggatccacag agacttagct gcaagaaact gcatgttgga tgaaaaattc actgtcaagg 61 ttgctgattt cggtcttgcc agagacatgt acgataaaga gtactatagt gtccacaaca 121 agacgggtgc caagctacca gtgaagtgga tggctttaga gagtctgcaa aggcagaagt 181 tcaccaccac gtcagatgtg tggtcctttg gaattcc // LOCUS MUSPTKE 214 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD22. ACCESSION M33425 KEYWORDS protein tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD22. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 214) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 214 protein tyrosine kinase (AA at 3) BASE COUNT 59 a 46 c 55 g 54 t ORIGIN 1 ggatccaccg ggacttagca gcaagaaatg tccttgttga gagtgagcat caagtgaaga 61 tcggagactt tggtttaacc aaagcaattg aaaccgataa ggagtactac acagtcaagg 121 acgaccggga cagcccagtg ttctggtacg ctccggagtg tttaatccag tgtaaatttt 181 atatcgcctc tgacgtctgg tcctttggaa ttcc // LOCUS MUSPTKF 208 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds, clone FD175. ACCESSION M33426 KEYWORDS protein tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD175. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 208) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 208 protein tyrosine kinase (AA at 3) BASE COUNT 49 a 53 c 55 g 51 t ORIGIN 1 ggatccaccg tgatctgcga gctgctaacg tcctggtctc tgagtcactc atgtgcaaga 61 ttgcagactt tggcctcgcg agagtcatcg aagataacga gtacacagca agggaaggtg 121 cgaagttccc tatcaagtgg acagctccag aggcgttcaa cttcggctgc ttcactatca 181 aatctgacgt gtggtccttt ggaattcc // LOCUS MUSPTKG 208 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse protein tyrosine kinase (PTK) mRNA, partial cds. ACCESSION M33427 KEYWORDS protein tyrosine kinase. SOURCE Mouse haemopoietic cell line WEH1-3B D+, cDNA to mRNA, clone W3.13. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 208) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 208 protein tyrosine kinase (AA at 3) BASE COUNT 53 a 44 c 56 g 55 t ORIGIN 1 ggatccacag agacctggct gccagaaatt gtctagtgaa tgaagcagga gttgtcaaag 61 tatctgattt tggaatggcc aggtacgttc tggatgatca gtacacaagt tcttctggcg 121 ccaagttccc tgtgaagtgg tgtcccccag aagagtttaa ttacagccgc tttagcagca 181 agtcagacgt gtggtcctat ggaattcc // LOCUS DIRPARAM 2545 bp ss-mRNA INV 30-MAY-1990 DEFINITION D.immitis paramyosin mRNA, partial cds. ACCESSION M29733 KEYWORDS paramyosin. SOURCE D.immitis, cDNA to mRNA. ORGANISM Dirofilaria immitis Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 2545) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 2545 paramyosin (AA at 1) BASE COUNT 949 a 420 c 544 g 632 t ORIGIN 1 gcattcggta gtatgtcggt tgccgatttg ggaagtttaa ctcgcttgga ggacaagatt 61 cgtctactgc aagaagatct cgaatccgaa cgtgaacttc gaaatagaat cgaacgagaa 121 agagctgatc ttagtgtaca actgattgca ttaactgata gacttgaaga tgctgagggt 181 actactgata gtcagattga atcaaatcgt aaacgtgaag cagaattgca aaaattacgt 241 aaattattgg aagaatcaca attagaaaat gaagatgcaa tgaatgtttt acgtaaaaag 301 catcaagatg catgtctcga ttacgctgaa caaattgaac aattacaaaa gaaaaattca 361 aagattgatc gtgaacgtca acgtctgcaa catgaagtaa ttgagcttac tgcgacaatt 421 gatcaacttc aaaaggataa gcatttggcg gaaaaagcag cggaacgttt tgaagcgcaa 481 actatcgaat tgagtaataa agttgaagat ttaaatcgac atgttaatga tttagctcaa 541 caacgtcaac gtttacaagc tgaaaataac gatcttctca aagagattca tgatcaaaaa 601 gtacaattgg ataatttgca acacgtgaaa tatcaacttg cgcaacaact tgaagaagca 661 cgtcgacgtt tagaagatgc tgaacgcgaa cgttctcaac ttcaagcaca acttcatcaa 721 gtacaattag aattggattc tgtacgtacc gcattagatg aagaatcagc agctcgagca 781 gaagctgaac ataaacttgc tttagcaaat accgagatta cccaatggaa gagtaaattt 841 gatgctgaag ttgctcttca tcatgaagag gttgaagatt tacgaaagaa aatgctccaa 901 aaacaggctg aatatgagga acaaattgaa ataatgttgc aaaaaatttc acaacttgaa 961 aaagctaaat cacgcttaca aagtgaagtt gaagtattaa ttgttgattt ggaaaaagca 1021 caaaatacaa tagcaattct tgaacgagca aaagaacaac tcgaaaaaac tgttaatgaa 1081 ttaaaggtac gtattgatga acttactgtt gaacttgaag cagcacaacg tgaagctcgt 1141 gctgcattag cagaattaca aaagatgaaa aatttatacg aaaaagcaat tgagcaaaaa 1201 gaagccttag ctcgagaaaa taaaaaatta caagatgatt tgcatgaagc aaaagaagcg 1261 cttgctgatg caaatcgtaa attgcatgag cttgatcttg aaaatgcacg attagctggt 1321 gaaatacgag aattgcaaac agctcttaag gaatctgaag ctgcacgacg tgatgctgag 1381 aatcgtgcac aacgtgcact tgctgaatta caacaacttc gtattgaaat ggaacgtcgt 1441 ttgcaagaga aagaggaaga aatggaagca cttcgtaaga atatgcaatt tgaaattgat 1501 cgacttacag ctgcattagc cgatgctgaa gcacgtatga aggcggaaat ttcacgtcta 1561 aagaagaagt atcaagcgga aattgccgaa cttgaaatga ccgttgataa tttgaatcgg 1621 gcaaatattg aagcgcaaaa gacgattaaa aaacaaagtg aacaacttaa aatattgcaa 1681 gcatcacttg aggatactca acgtcaacta cagcaaactc ttgatcagta tgcattggcg 1741 caacgtaaag tttccgcact tagcgctgaa ttagaagaat gcaaagttgc acttgataat 1801 gcaattcgtg ctcgtaaaca agctgaaatt gatcttgaag aagcgaatgc tcgtattact 1861 gatttggtat caattaacaa taatttaaca gcaattaaaa ataaattgga aactgaatta 1921 tcgaccgcac aagctgattt ggatgaagca acgaaagaat tgcatgccgc ggatgagcgg 1981 gctaatcgtg ctttagcaga tgcagctcga gctgttgaac aacttcatga ggagcaggaa 2041 cattccatga aaattgatgc tctcaggaaa tcattagaag agcaggtgaa gcaattacaa 2101 gtacaaattc aagaagctga agctgcagca ttattgggtg gtaaacgtgt aattgctaaa 2161 cttgaaacac gtatacgtga tttggaaact gctttggatg aagaaacacg tcgacataag 2221 gaaacgcaag gtgcgcttag aaagaaagat cgacgcatca aagaagttca aatgcaagtt 2281 gatgaagagc ataagatgtt tgtgatggcg caagatactg ctgataggct tcttgaaaaa 2341 cttaacattc aaaagtggca acttggagaa gctgaatctt taacgatggc taatcttcaa 2401 cgagtacgtc gatatcaacg tgaattggaa gatgctgaag gtcgtgctga tcaagctgaa 2461 agctcattac atcttattcg tgctaaacat cgttcgtcag ttgttaccgg caaaaatgct 2521 tcagcatcta aaatttacgt tctcg // LOCUS ONGPARAM1 134 bp ss-mRNA INV 30-MAY-1990 DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-1. ACCESSION M29734 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 134) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 134 paramyosin (AA at 1) BASE COUNT 53 a 23 c 28 g 30 t ORIGIN 1 ttacaaaaga aaaattcaaa aattgaccgt gaacgtcaac gtctgcaaca tgaagtaatt 61 gagcttacag cggcaattga tcaacttcaa aaggataagc atttggctga gaaagcagca 121 gaacgttttg aagc // LOCUS ONGPARAM2 135 bp ss-mRNA INV 30-MAY-1990 DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-2. ACCESSION M29735 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 135) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 135 paramyosin (AA at 2) BASE COUNT 43 a 31 c 26 g 35 t ORIGIN 1 tttgcaacat gtcaaatatc aacttgcgca acaacttgaa gaagcacgtc gtcgtttgga 61 agatgctgaa cgtgaacgtt cacaacttca agcacaactt catcaagttc agctagaatt 121 ggattccgta cgtac // LOCUS ONGPARAM3 176 bp ss-mRNA INV 30-MAY-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-3. ACCESSION M29736 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 176) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 176 paramyosin (AA at 1) BASE COUNT 74 a 20 c 36 g 46 t ORIGIN 1 aaagttgctc ttcatcatga agaggttgaa gatttacgaa agaaaatgtt gcaaaagcaa 61 gctgaatatg aagagcaaat tgaaataatg ctgcaaaaaa tttcacaact tgaaaaggct 121 aaatcacgtt tgcaaagtga agttgaagta ttgattgttg atttagaaaa agcaca // LOCUS ONGPARAM4 414 bp ss-mRNA INV 30-MAY-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-4. ACCESSION M29737 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 414) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 414 paramyosin (AA at 3) BASE COUNT 143 a 70 c 101 g 100 t ORIGIN 1 ccttagctcg agaaaataaa aagttgcaag atgatttgca tgaagctaaa gaagcgcttg 61 ctgatgcaaa tcgtaaattg catgaacttg atcttgagaa tgcacgatta gctggtgaaa 121 ttcgagaatt gcaaacagca ctcaaagaat ctgaagcagc tcgacgtgat gctgagaatc 181 gtgcacaacg tgcacttgct gaattgcaac aacttcgcat cgaaatggaa cgtcgtttgc 241 aagagaagga agaagaaatg gaagcacttc gtaagaatat gcaatttgaa attgatcgtc 301 ttacagctgc attagctgat gctgaagcac gtatgaaggc ggagattgca cgtttgaaga 361 aaaagtatca ggcggaaatt gccgaacttg aaatgaccgt tgataatttg aatc // LOCUS ONGPARAM5 145 bp ss-mRNA INV 30-MAY-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-5. ACCESSION M29738 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 145) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 145 paramyosin (AA at 3) BASE COUNT 48 a 28 c 28 g 41 t ORIGIN 1 tattgcaagc atcacttgaa gatactcaac gtcaattaca acaaactcta gatcagtatg 61 ctttggcaca acgtaaagta tctgcgctta gtgctgaatt ggaagaatgc aaagttgcac 121 ttgataatgc aattcgtgct cgtaa // LOCUS ONGPARAM6 129 bp ss-mRNA INV 30-MAY-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-6. ACCESSION M29739 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 129) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 129 paramyosin (AA at 3) BASE COUNT 48 a 20 c 31 g 30 t ORIGIN 1 aagtacaaat tcaagaagct gaagcagcag cattattggg tggtaaacgt gtaattgcta 61 aacttgaaac acgtatacgt gatttggaaa ctgctttgga tgaagaaaca cgtcgacata 121 aggaaacgc // LOCUS MUSA1AT 1350 bp ss-mRNA ROD 30-MAY-1990 DEFINITION Mouse alpha-1-antitrypsin (AT) mRNA, complete cds. ACCESSION M33567 KEYWORDS alpha-1-antitrypsin. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1350) AUTHORS Latimer,J.J., Berger,F.G. and Baumann,H. TITLE Highly conserved upstream regions of the alpha-1-antitrypsin gene in two mouse species govern liver-specific expression by different mechanisms JOURNAL Mol. Cell. Biol. 10, 760-769 (1990) STANDARD simple staff_review FEATURES from to/span description pept 49 1287 alpha-1-antitrypsin (AT) mRNA 1 > 1350 alpha-1-antitrypsin mRNA BASE COUNT 355 a 391 c 306 g 298 t ORIGIN 1 gtctcctcag cctccgtcac cacccagctc tgggacagca agctgaaaat gactccctcc 61 atttcatggg gtctactgct tctggcaggc ctgttttgcc tggtccccag ctttctagct 121 gaggatgttc aggagacaga cacctccaga agggattcag tcccagcctc ccatgatact 181 ccctacaacc tcgagctttc catcagccta taccgggagc ttggacacaa gtccaccact 241 tccaacatct tcttctccca ggtgagcatt gccacagcct ttgctatgct ctccctaggg 301 gagaagggtg acactcacac gcagatccta gagggcctgc agttcaacct cacacaaaca 361 tcggaggctg acatccacaa ggccttccaa cacctcctcc aaaccctcaa caggccagac 421 agtgagctgc agctgagcac aggcaatggc tctttgctca acaatgacct gaagctggtg 481 gagaagtttc tggaagaggc caagaacaat tatcactcag aagtcttctc cgtcaacttt 541 gcagagtcag aggaggccaa gaaagtgatc aatgattttg tggagaaggg aacccaagga 601 aagatagctg aggcggtgaa agaccctgac gaagacacag ttttcgccct ggcgaattac 661 attctcttta aaggcaaatg gaagaagcca ttcgatccta agcacactga ggaagctgaa 721 ttccacgtgg acacagtcac cacggtgaag gtgcccatga tgaccctcac gggcatgctt 781 gacgtgcacc attgcagcac gctgtccagc tgggtgctgc tgatggatta cctgggcaat 841 cgcactgctg tcttcctcct gcccgatgat ggcaagatgc agcatctgga gcaaactctc 901 aacaaggagc tcatctctaa gttcctgcta aacaggcaca gaaggttagc ccaggtccac 961 ttacccagat tgtccctctc tggaaactat actttgaaca cactcatgag tcacctgggc 1021 atcacccgga tcttcaacaa tggggctgac ctctccggaa tcacagagga gaatgctccc 1081 ctgaagctca gcaaggctgc agataaggcc gtgctgacca tggatgagac aggaacagaa 1141 gctgcagcag ctacagtctt acaagccgtt cctatgtcta tgccccctat cctgaacttc 1201 aacaaacctt tcattttcat aatagttgaa gaacacactc agagccccct ctttgtggga 1261 aaagtggtag atcccacacg taaatgacca ccctaagaaa tcatccttcc tttctgaatt 1321 gggtcccttc cttaaactct gacggctgct // LOCUS XEBALPA 586 bp ss-mRNA VRT 30-MAY-1990 DEFINITION X.borealis alpha-I globin mRNA, complete cds. ACCESSION M32453 KEYWORDS alpha globin. SOURCE X.borealis adult immature red blood cell, cDNA to mRNA. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 586) AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W. TITLE Globin evolution in the genus Xenopus: Comparative analysis of cDNAs coding for adult globin polypeptides of Xenopus borealis and Xenopus tropicalis JOURNAL J. Mol. Evol. 23, 211-223 (1986) STANDARD simple staff_review FEATURES from to/span description pept 42 470 alpha-I globin protein BASE COUNT 175 a 140 c 111 g 160 t ORIGIN 1 atctggtgta gactgcttat tgcacaacac aaacaacaac catgcttctt tcagccgatg 61 acaagaaaca catcaaggca attatgcctt ctattgctgc tcatggcgac aaatttggtg 121 gagaagcttt gtacaggatg ttcttggtca accccaagac caaaacctac ttccctactt 181 ttgacttcca ccacaattca aaacagatca gtgctcatgg caagaaagtt gtggatgctc 241 tgaatgaagc ctccaaccat ttggataaca ttgctggaag cctgagcaag ctgagtgacc 301 tccatgccta tgacctgaga gtggatccgg gcaacttccc attgctggcc cataatattt 361 tggtggttgt tgctatgaac ttccctaagc agtttgatcc tgcaacccat aaggccctgg 421 ataagttcct ggctaccgtt tcttctgttc tgacatccaa atatcgttaa ggctcaacag 481 caacagtagc agaagtctca acatcagaca tcagttaatt atatgcaatt aaactgacaa 541 agcatgttaa agaatgttct gaaataaaca tttgaaccat tatcct // LOCUS XEBALPB 525 bp ss-mRNA VRT 30-MAY-1990 DEFINITION X.borealis alpha-II globin mRNA, complete cds. ACCESSION M32455 KEYWORDS alpha globin. SOURCE X.borealis adult immature red blood cell, cDNA to mRNA. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 525) AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W. TITLE Globin evolution in the genus Xenopus: Comparative analysis of cDNAs coding for adult globin polypeptides of Xenopus borealis and Xenopus tropicalis JOURNAL J. Mol. Evol. 23, 211-223 (1986) STANDARD simple staff_review FEATURES from to/span description pept 43 471 alpha-II globin protein BASE COUNT 159 a 123 c 98 g 145 t ORIGIN 1 atctggtgtg tactgctaat tgcaacaaaa caaacagcaa caatgcttct tacagctgat 61 gacaagaaac acatcaaggc aattttgcct tccatagctg ctcatggcga caaatttggt 121 ggagaagctt tgtacagaat gttcttgatt aatcctaaaa ccaaaaccta ctttcctaat 181 tttgacttcc accacaattc aaaacaaatc agtgctcatg gcaagaaagt tgtggatgct 241 ctgaatgaag ctgccaacca cttggataac attgctggaa gcatgagcaa gctgagtgac 301 ctccatgcct atgacttgag agtggatccg ggcaacttcc cattgctggc ccataatatt 361 ctggtgactg ttgctatgta cttccctcag cagtttgatc ctcataccca taaggctctg 421 gataagttcc tggcttccgt atcttctgtt ctgacatcca aatatcgtta aggctcagca 481 gcaaaagcaa cagcacaagt ttcaatatta atcatcagta aatca // LOCUS XEBBETA 605 bp ss-mRNA VRT 30-MAY-1990 DEFINITION X.borealis beta-I globin mRNA, complete cds. ACCESSION M32456 KEYWORDS beta globin. SOURCE X.borealis adult immature red blood cell, cDNA to mRNA. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 605) AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W. TITLE Globin evolution in the genus Xenopus: Comparative analysis of cDNAs coding for adult globin polypeptides of Xenopus borealis and Xenopus tropicalis JOURNAL J. Mol. Evol. 23, 211-223 (1986) STANDARD simple staff_review FEATURES from to/span description pept 47 487 beta-I globin protein BASE COUNT 177 a 146 c 126 g 156 t ORIGIN 1 acgggttatt tttgcagaag ctcagaataa actctaaact ttgaccatgg gtttgacagc 61 acatgatcgt cagctgatca acagcacctg gggcaaagta tgtgccaaga ctataggaaa 121 agaggccctt ggacgtctgc tgtggactta tccctggacc caaagatact ttagttcttt 181 tgggaacctc aacagtgctg acgccgtctt ccacaatgaa gcagtggctg ctcatggtga 241 aaaggtggtg acatctattg gagaggccat caagcacatg gatgacatta agggatatta 301 tgctcagctg agcaaatacc actcagaaac cctacatgtg gatccatgca acttcaagcg 361 cttcggtggc tgcttgtcca tttccctggc tcgccaattc catgaagaat atacacctga 421 gctacatgct gcctatgaac atctgtttga tgcaattgct gatgcccttg ggaagggtta 481 ccactaaacc atcctcaaga acacccaaat ggagtctcta agctacataa taccaaccta 541 caaaatgttt tcctctaaaa cgtagccatt catatctgct cctaataaaa agaaagtttc 601 ttcac // LOCUS XEBBETB 619 bp ss-mRNA VRT 30-MAY-1990 DEFINITION X.borealis beta-II globin mRNA, complete cds. ACCESSION M32458 KEYWORDS beta globin. SOURCE X.borealis adult immature red blood cell, cDNA to mRNA. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 619) AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W. TITLE Globin evolution in the genus Xenopus: Comparative analysis of cDNAs coding for adult globin polypeptides of Xenopus borealis and Xenopus tropicalis JOURNAL J. Mol. Evol. 23, 211-223 (1986) STANDARD simple staff_review FEATURES from to/span description pept 48 488 beta-II globin protein BASE COUNT 185 a 152 c 130 g 152 t ORIGIN 1 cacgagttct ttttgcagaa actcagaata ctctctcaat tttaaccatg ggtttgacag 61 cacatgaaaa acagctcatc accggcagct ggggcaaaat aaatgccaaa gccataggaa 121 aagaggccct cggcaggctg ctgaatactt ttccttggac ccaaagatac tttagttctt 181 ttgggaacct cggcagtgct gaagccatct tccacaatga agctgtagct gctcatgggg 241 aaaaggtagt gacatctgtt ggagaggcca tcaaacacat ggatgacata aagggctatt 301 atgctgagct gagcaagtac cactctgaaa ccctgcatgt ggatccaaac aacttcaagc 361 gcttcggtgg ctgcttgtcc attaccctgg gtcaccactt tggtgaagaa tatacacccg 421 agttacatgc tgcctatgaa catctgtttg atgcaattgc cgacgccctt ggcaagggtt 481 accactgaac cagcctcaag aacacctgaa cggagtcgca aagctacata gaaacaacct 541 acagtacaca ttgctaaaat gttttttccc cccaaaatgt agccattcat atgagctcct 601 aataaaataa agtttcttc // LOCUS XETALPA 581 bp ss-mRNA VRT 30-MAY-1990 DEFINITION X.tropicalis alpha globin mRNA, complete cds. ACCESSION M32454 KEYWORDS alpha globin. SOURCE X.tropicalis adult immature red blood cell, cDNA to mRNA. ORGANISM Xenopus tropicalis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 581) AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W. TITLE Globin evolution in the genus Xenopus: Comparative analysis of cDNAs coding for adult globin polypeptides of Xenopus borealis and Xenopus tropicalis JOURNAL J. Mol. Evol. 23, 211-223 (1986) STANDARD simple staff_review FEATURES from to/span description pept 42 470 alpha globin protein BASE COUNT 170 a 138 c 119 g 154 t ORIGIN 1 agcatctggt gtggactgct tattgcacaa cacaaacaac aatgcatctt acagctgatg 61 acaagaaaca catcaaggcc atttggcctt ctgtagctgc tcatggtgac aaatatggcg 121 gagaagcttt gcacaggatg ttcatgtgtg ctcccaagac caaaacctac tttcctgatt 181 ttgacttcag cgaacattca aaacacatct tggctcatgg caagaaagtt tcggatgctc 241 tgaatgaggc ttgcaaccat ctggacaaca ttgccggatg cctgtccaag ctgagtgacc 301 tccatgccta tgacctgaga gtggatccag gcaacttccc attgctggcc catcaaattc 361 tggtggttgt tgctatccat ttccctaagc agtttgaccc tgcaacccat aaggccctgg 421 acaagttcct ggtttccgta tctaatgttc tgacatccaa atatcgttaa ggctcagcag 481 taacagtagc agaagtttgg acatcagaca tcagttaatg acaaacaatc aaactgacac 541 agcttgtgaa agaatgttct gaaataaaca tttttaaaat t // LOCUS XETBETA 563 bp ss-mRNA VRT 30-MAY-1990 DEFINITION X.tropicalis beta globin mRNA, complete cds. ACCESSION M32457 KEYWORDS beta globin. SOURCE X.tropicalis adult immature red blood cell, cDNA to mRNA. ORGANISM Xenopus tropicalis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 563) AUTHORS Knoechel,W., Korge,E., Basner,A. and Meyerhof,W. TITLE Globin evolution in the genus Xenopus: Comparative analysis of cDNAs coding for adult globin polypeptides of Xenopus borealis and Xenopus tropicalis JOURNAL J. Mol. Evol. 23, 211-223 (1986) STANDARD simple staff_review FEATURES from to/span description pept 52 495 beta globin protein BASE COUNT 170 a 133 c 115 g 145 t ORIGIN 1 aacacaggct ctttttttgc agaagcttaa aataaacgct cagctttgac catggttaat 61 ctgacagcaa aggaacgcca gctcatcact ggcacctgga gcaaaatatg tgcaaagacc 121 ttgggaaaac aggcccttgg cagtatgctg tatacttatc cttggaccca aagatacttt 181 agtagttttg ggaacctcag cagtatcgaa gccatcttcc acaatgcagc tgtggctact 241 catggtgaaa aggtgttgac atctattgga gaggccatca aacacatgga tgacataaag 301 ggatattatg ctcagctgag caagtaccac tcagaaaccc tacatgtgga tccatacaat 361 ttcaagcgct tctgtagctg cacgattatt tccatggctc aaaccctcca agaagacttt 421 acacctgagc tacaggctgc tttcgaaaag ctgtttgctg caattgctga tgcccttggc 481 aagggttacc actaaaccaa cctcaagaac atgtgactgg agtctcttag ctacacagaa 541 acaaaatctc gttttttttc aaa // LOCUS ROB11AA 667 bp ds-RNA VRL 30-MAY-1990 DEFINITION Bovine rotavirus gene 11, complete cds. ACCESSION M33606 KEYWORDS . SOURCE Bovine rotavirus (strain VMRI) viral RNA. ORGANISM Bovine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 667) AUTHORS Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B. TITLE Sequence analysis of gene 11 equivalents from "short" and "super short" strains of rotavirus JOURNAL J. Virol. 64, 120-124 (1990) STANDARD simple staff_review FEATURES from to/span description pept 22 618 gene 11 protein BASE COUNT 219 a 123 c 139 g 186 t ORIGIN 1 ggcttttaaa gcgctacagt gatgtctctc agtattgacg tgacgagtct tccttctttt 61 tcttctagca tttataagaa tgaatcatct gcaacagcgt caactctttc tggaaaatct 121 attggtagga gtgtacaata tgtttcacca gatgcagaag cattcagtaa atatatgctg 181 tcaaagtctc cagaagatat tggaccatct gattctgctt caaacgatcc actcaccagc 241 ttttcgatta gatcgaatgc agttaagaca aatgcagacg ctggggtgtc tatggattca 301 tcagtacaat cacgaccatc aataaatgtt gggtgcgatc aagtggattt ctcctttaat 361 aaaggaatca aagtaaatgc taacctggat tcatcaatat cagtgtcaac aaattcaaga 421 aaggagaaat ccaaaggtga tcgtaaaagt aggaaacact accctaaaat tgaagcagaa 481 tctgattcgg atgagtacgt acttgacgat tcagacagtg atgatggcaa atgtaggaat 541 tgtaaatata aaaggaagta tttcgcacta agaatgagaa tgaaacaagt agctatgcaa 601 ttaatcgaag atttgtaggt ctgacctgag aggtcactag ggagctcccc actcccgttt 661 tgtgacc // LOCUS ROH11AA 658 bp ds-RNA VRL 30-MAY-1990 DEFINITION Human rotavirus gene 11, complete cds. ACCESSION M33607 KEYWORDS . SOURCE Human rotavirus (strain 69M) viral RNA. ORGANISM Human rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 658) AUTHORS Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B. TITLE Sequence analysis of gene 11 equivalents from "short" and "super short" strains of rotavirus JOURNAL J. Virol. 64, 120-124 (1990) STANDARD simple staff_review FEATURES from to/span description pept 22 609 gene 11 protein BASE COUNT 221 a 118 c 127 g 192 t ORIGIN 1 ggcttttaaa gcgctacagt gatgtctctc agtattgacg tgacaagtct tccctcaatt 61 tcttctagta tttataaaca tgaatcgtct tcaacaacgt caactatttc tggaaaatct 121 attggtagga gtgaacagta catttcacca gatgcagaag ctttcaataa atacatgttg 181 tcgaagtctc cagaggatat tggaccatct gattctgcat caaacgatcc actcaccagc 241 ttttcgatta gatcgaatgc agttaagaca aatgcagatg ctggcgtgtc tatggattca 301 tcaacacaat cacgaccatc aagtgatatt ggatgcgatc aagtggattt ctccttaaac 361 agaggtatta agattgatgc tatggtagat tcttcaatat caatatctac taaatctaag 421 aaagagaaat ccaaacatga aactaagaac agaaaatgtt atccaaagat tgaagcagag 481 tcagattctg atgactacgt tttagatagt gatgatggta aatgtaagaa ctgtaaatat 541 aaaaagaagt attttgcact tcggttaaga atgaaacaag ttgcaatgca attaattgaa 601 aatttgtgag tctaacctgg gaggtcacta gggagctccc cactcccgtt ttgtgacc // LOCUS ROH11AB 667 bp ds-RNA VRL 30-MAY-1990 DEFINITION Human rotavirus gene 11, complete cds. ACCESSION M33608 KEYWORDS . SOURCE Human rotavirus (strain DS-1) viral RNA. ORGANISM Human rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 667) AUTHORS Matsui,S.M., Mackow,E.R., Matsuno,S., Paul,P.S. and Greenberg,H.B. TITLE Sequence analysis of gene 11 equivalents from "short" and "super short" strains of rotavirus JOURNAL J. Virol. 64, 120-124 (1990) STANDARD simple staff_review FEATURES from to/span description pept 22 618 gene 11 protein BASE COUNT 231 a 121 c 121 g 194 t ORIGIN 1 ggcttttgaa gcgctacagt tatgtctctc agtattgatg tgactggtct tccttcaatt 61 tcttcaagtg tttataaaaa tgaatcgtct tcaacaacgt caactatttc tggaaaatct 121 attggtagga gtgaacagta catttcacca gatgcagaag ctttcagaaa atacatgttg 181 tcaaaatctc cagaagatat tggaccttct gattctgcat cgaacgatcc actcaccagc 241 ttttcgatta gatcgaatgc agttaagaca aatgcagatg ctggcgtgtc tatggattca 301 tcagcgcaat cacgaccatc tagcgacatt ggatacgatc aaatggattt ctccttgaat 361 aaaggtatta aatttgatgc tacagtggac tcttcaatat caatatctac cacatcaaag 421 aaggagaaat ctaaaaacaa aaataagtat aaaaaatgtt acccaaaaat cgaagcagag 481 tctgattccg atgattacat attagatgat tcagatagcg atgatggaaa atgtaaaaat 541 tgcaaatata aaaagaaata ttttgcactt cgtttaagaa tgaagcaagt tgcaatgcaa 601 ttaattaaag atttatgagt ctaacctgag agatcactag ggagctcccc actcccgttt 661 tgtgacc //
GenBank-Updates@genbank.bio.net (05/31/90)
LOCUS SHPMHDQAA 2387 bp ds-DNA MAM 31-MAY-1990 DEFINITION Sheep MHC class II OLA-DQA alpha-chain gene, 3' end. ACCESSION M33304 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep DNA, clone OLAIIC8-1, library CH.28. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 2387) AUTHORS Scott,P.C. and Brandon,M.R. TITLE Nucleotide sequence, polymorphism and expression of ovine MHC class II alpha genes DQA JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.C.Scott 28-MAR-1990. University of Melbourne, Veterinary Preclinical Centre, Parkville, Victoria, Australia 3052 FEATURES from to/span description pept / 359 607 MHC OLA-DQA alpha-chain, exon 2 (AA at 361) 1085 1366 MHC OLA-DQA alpha-chain, exon 3 1604 1758 MHC OLA-DQA alpha-chain, exon 4 pre-msg 358 > 2228 MHC OLA-DQA alpha-chain mRNA IVS < 1 358 MHC OLA-DQA intron A IVS 608 1084 MHC OLA-DQA intron B IVS 1367 1603 MHC OLA-DQA intron C IVS 608 1084 MHC OLA-DQA intron A IVS 1367 1603 MHC OLA-DQA intron B signal 2223 2228 poly-A signal BASE COUNT 608 a 610 c 489 g 680 t ORIGIN Chromosome 20. 1 ggatccagct gaggctaaaa ctgtaaaagt gaattagttc caattgacac agtagagtga 61 tactcttcag catcagttgg tagcccagga caagtaccag atgaaaagga tattacatgt 121 atttctgcct ggaaaaggag tcaagatgca ttgcaaagat aaggaggctt tgttgcaagg 181 ctttttaact aactggacaa ctaccaacac agagcgaaga aggagcaggt actgggaatt 241 ttgtctagaa actggccaca aatgaagccc acaatgtttg atagtcaatt tcttctttca 301 ctgcttaatg aagatctttt ctctattttt ccctttcttg ctcctcaccc tgactcagct 361 gaccacattg gcacctatgg cgtaaacatc taccaaacat atggtccctc tggctactat 421 acccatgaat ttgatggaga tgaagagttc tacgtggacc tggaaaagag ggagactgtc 481 tggcgtctgc ctgagtttag taaatttaca agttttgacc ctcagggtgc actgagaaac 541 atagctacgg tgaaacataa tttggagatc ttgattcaaa ggtccaactc tactgctgct 601 accaacagta tgtgttcacc attctgcctc tctttgttgt tcttcccctt cataccagct 661 tcactccctt tttccctagg gatagatacc cttcaccact ctataaaact ttctcctttc 721 caaggagtca ccacattttc tcatggtaat atctgaaccc tcatcttctg ccttctttcc 781 actcatatat ttccatatga tagaaagatc cttacttcca taatgcaagc tttaatattt 841 cataggagag ccccacagac atcccacttg acaggtcgtc aaaaacaggg ggacaaggat 901 aaagcagaag caacatatag tgcctcccaa cagaagggaa gcaagagctt ctcctctgtc 961 agagtgcaac tgttggtgga agggctcccc caggaggcag tgcagtaaag gctcccccag 1021 gaggcagtgc agaatcaggg cagagctact tccgtttcac atctgtgctg tttcctcacc 1081 acagaggttc ctgaggtgac tgtgttttcc aagtctcccg tgatgctggg tcaacccaac 1141 accctcatct gtcacgtgga caacatcttt ccccctgtga tcaacatcac atggctgagg 1201 aacgggcact cggtcacaga gggagtttct gagaccagtt ttctcatcaa gagtgattat 1261 tcttttctca agatcaatta tctcaccttc ctcccttctg atgatgatgt ttatgactgc 1321 aaagtggagc actggggtct ggatgagcca cttctgaaac actggggtat gtacgagttc 1381 cagccttttg gtaccttctt ttctctgtca ggtccacaat gtcatgtttg ttatccccta 1441 atcctcatgg ccttgaactt tgagggattc taatgataga cctcactctc ttctctcagc 1501 tccatgcccc aagtctgtta gaaccaaccc ctcttcaccc atcccacata cacacacaga 1561 aacactctgt attgtgactt ccacaaaact tcagctttca cagagcctga gattccagtc 1621 cctatgtcag agctgacaga gactgtggtc tgtgccctgg ggttgaccgt gggcctcgtg 1681 ggcatcgtgg tgggcaccgt cctcatcatc cgaggtctgc gctcaggtgg cccctccaga 1741 caccaggggc cgttgtgagt catactccag aaggaaggta agaattcaga tctgttagat 1801 ctgggaacat agtacagacg agggaaagtg ggaggaggct gtggacacaa atgtgctgaa 1861 agtttctaaa ttgggaaaca gcatggcagt ggcacaggag ccctgagccc actcatctca 1921 tgccttttct gttgcaggtg cgctgtccat ctttaagaac agaagaacgg acttgctaga 1981 tgacctagag ctattttctg cccaagtcca tcatgtacct tctctcctcc tcctaccctc 2041 ctcctcatac ttcttctctg gacttaaggt gctgtatcat ctcgagctca catacatttg 2101 ggattctccc ccgaccttct aatttttttt tttctcaatt gttacctatt atgggatcac 2161 tgagatatcc cacccagcta cctaatccac caatgaccct aaaataatat atccatggaa 2221 gaaataaatg cctttatgag gtctttactg actttttccc atctttcatc tcagggctga 2281 ttagttccat gtcctcttct gtttcaggcc cttttaacct catttcccag attatgtttc 2341 acggcccagt aacaccggag cagcttgtag agactagtaa tattttt // LOCUS SHPOLADQAB 3355 bp ds-DNA MAM 31-MAY-1990 DEFINITION Sheep MHC class II OLA-DQA alpha-chain gene, 3' end. ACCESSION M33305 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep (strain Merino) DNA, clone OLAIIC17-2, library CH.28. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 3355) AUTHORS Scott,P.C. and Brandon,M.R. TITLE Nucleotide sequence, polymorphism and expression of ovine MHC class II alpha genes DQA JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.C.Scott 28-MAR-1990. University of Melbourne, Veterinary Preclinical Centre, Parkville, Victoria, Australia 3052 FEATURES from to/span description pept / 1057 1305 MHC OLA-DQA alpha-chain, exon 2 (AA at 1059) 1751 2032 MHC OLA-DQA alpha-chain, exon 3 2281 2435 MHC OLA-DQA alpha-chain, exon 4 IVS < 1 1056 MHC OLA-DQA intron A IVS 1306 1750 MHC OLA-DQA intron B IVS 2033 2280 MHC OLA-DQA intron C signal 2913 2918 poly-A signal BASE COUNT 888 a 842 c 665 g 960 t ORIGIN 1 ctgcagtgca gagacctggg tttgatccct gggtcgggaa gatcccctgg agaaggaaat 61 ggctacccac catgagtatt cttgcctgga taatccatgg acagaggaac ccagcaggct 121 aaagtccatg ggatcgcaaa gcgttggaca cgactgagtg actttcacat taaagaaaga 181 aataagggga aggaagtgtc agaaatgcaa aataaagctt catttttttt tttattttca 241 gaaggaatca cggcattcac tttaagaact cactacggtt aacaaaatgg acaccttaga 301 agcattgttt attctttata tgctttgtta tgttgtcact gtgcttgcta tactgtttct 361 tataccttat aaaagaggaa tctctgttaa aatagtagga tcatgtgtgc ttttccttgg 421 ttaaaaaaaa aaaataaaga aggaatctct atgcagtaaa tgcttctgct atgcctaccc 481 ccgagggaaa agaactcgaa tgaatgtgtt attatctgat gaaaaacttt tgaagaagag 541 gaaacaatgc taataccagg aaaattcatt cattcattcc cttttttctt ctctctttcc 601 attataatct ttattacaga ctacaatgtg ccaggcactg agctgggcag gcaggaaata 661 gcagtaaacg aagcctgtga aatcctgccc ttgtgagcat acaagctcag tcgtgatgga 721 actctcaaac ccatggactg tagcctttcc cttctccagg ggacttccca acccagggat 781 cgaacccagg tctcccactg caggcggatt ctttaccagc caagccgcaa gggactctca 841 agaatcacat atgttacagt gcaaaagcag cctggattct aacaggacag ctaccaacac 901 ttagggaaaa ggaagcaggt gctggcactt tgcttagaga cattgtgcca aaggtgaagc 961 ccaccgtgtt tgaaagttag tttcttcagc tactttgttt aatatggcct tttctctctg 1021 ttttccacct tcctgctcct caccctcact tatcagctga ccactttggc tcctatggca 1081 cagagatcta ccaatctcat ggtccctctg gccagtacac ccaggaattt gatggagacg 1141 agctgtttta tgtggacctg gggaagaagg agactgtctg gaggctgcct atgtttagcc 1201 agtttgcagg ttttgatcca cagggtgcac tgagtgaaat agctacagca aaacacaact 1261 tggatatcct gactaaacgc tccaacttta cccctgctat caatggtaag tgtccaccat 1321 tctacttctc tttactgaat ctattctttc atatcaggct tcactccctt cttttctaag 1381 gagagatatc cttcaccatg ctatgaaact ttcccaagtg tccccagatt tcatagtaat 1441 tattgaacga ctcatcctct cccacctcaa agatcacata tttccatgta atataaggac 1501 ccttactccc ataacatatt ccttgaatcc ctcaaggagg agtcccacag acctcctcct 1561 taacaagcat gcccacagac agcacgggga taaagcatgg gcaacatata gcatctccca 1621 gcagaaggcg aacaagagct cctcctctgt cagactggga aacgttgtga gagggctccc 1681 ccaggaggca gtgcagaatc agggccaagc tttttcccca tttcacatct gtgctgtttt 1741 ctcaccatag aagttcctga ggtgactgtg ttttccaagt ctcccgtgat gctgggtcaa 1801 cccaacaccc tcatctgtca cgtggacaac atttttcccc ctgtgatcaa catcacatgg 1861 ctgaagaacg ggcatgcagt cacagagggt gtttctgaga ccagcttcct tcccaaggat 1921 gatcattcct tcctcaagat tggttatctc accttcctcc cttctgatga tgacgtttat 1981 gactgcaaag tggagcactg gggcctggat gagccacttc tgaaacactg gggtatgtag 2041 agttcaccct tttggtactc tctctttttc taccaagtac aaaacactga gttttggtcc 2101 tcatctcaca gtcaagcttg ttttccacac ttcaaggttt ctaaaattat agaacttcat 2161 tctcttccct tagcctggtg ccctgagttt ttgtagcatg aacacacact ctgcccaatc 2221 tcatacacag gcacatgaac caaccctctg tattctgact ttcataactt cattttccag 2281 agcctgagat tccagcccct atgtcagagc tgacagagac tgtggtctgt gccctggggt 2341 tgaccgtggg ccttgtgggc atcgtggtgg gcaccatctt catcatccaa ggcctgcgct 2401 caggtggggc ctccagacac cagggtcccc tgtgagtcgc accctagaaa ggaaggtaag 2461 gattcatatt tgtcagtgcc acagacacac ttcaggagaa agcaaagcgg gaagtaaatt 2521 gtagacacaa atgtggttga aagttgtggg aaacgtggga accagcatga ctacagcaca 2581 gaagctccct aggactcatc agtttcatgc ctttcctgtt gcaggtctct gcccatctag 2641 gattgcagaa gagaggacgt tctagacgac ctagaactag tttctggcca agttcatcat 2701 atactctctc ttcccctaca cactctgccc ctcctctctt ctctgggact taagatgctg 2761 ttacatctca gagctcacat ataccccaga gttctcccct gactttctga tatttttttc 2821 tgttctcagt agttgcctac catgagatca ctggggtatt ccacccagtt acctactccc 2881 tagtgacccc aaccccatat caccatggaa gcaataaact ccccttcaat agaaattttt 2941 cctgtctttc atcacggagc tgaccaagat catgttctct tatgctttag gccccttaat 3001 ttcatttctc agatcatgtt tcatgctgaa gaacataaga gcaactttta caaattcata 3061 atatttcagt atgttatgtt cactatggtt aatatttctc tcttcctact atttccaatc 3121 ttggctcttg gcacttccat ccacccctta attcaggcat cagtgaaagt aatatatacc 3181 ttttttcctt ggtttatatt tgttatagca gaaatacaaa aaccaagaga ggagtctgta 3241 ctcatctctt tttttttttt ggctgcactg tccagcctgt gagatcttag ttctccaacc 3301 tggcatcgaa ctcagaacca cagcaaagaa agctggtgtt cactactgaa tcagg // LOCUS BSUPOLCA 240 bp ds-DNA BCT 31-MAY-1990 DEFINITION B.subtilis DNA polymerase III (polC) gene, partial cds. ACCESSION M33543 KEYWORDS DNA polymerase III. SOURCE B.subtilis DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 240) AUTHORS Barnes,M.H., Hammond,R.A., Foster,K.A., Mitchener,J.A. and Brown,N.C. TITLE The cloned polC gene of Bacillus subtilis: Characterization of the azp12 mutation and controlled in vitro synthesis of active DNA polymerase III JOURNAL Gene 85, 177-186 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 240 DNA polymerase III (polC) (AA at 1) mut 76 76 t in wt; g in azp12 mutant BASE COUNT 59 a 52 c 56 g 73 t ORIGIN 1 gaattcggaa cccgatttgt ccggcagatg cttgaagata caaagccgac cactttttct 61 gagctcgttc agatttcagg cttgtctcac ggaactgatg tatggcttgg caatgcacag 121 gagctcatcc acaataatat ttgtgagctg agtgaggtta tcggctgccg tgatgacatt 181 atggtttatt taatctatca aggccttgag ccgtcccttg cctttaaaat catggaattc // LOCUS FIBGLUC 1426 bp ds-DNA BCT 31-MAY-1990 DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene, complete cds. ACCESSION M33676 KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase. SOURCE F.succinogenes (strain S85) DNA, clone PJI5. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1426) AUTHORS Teather,R.M. and Erfle,J.D. TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Teather, 11-APR-1990. FEATURES from to/span description pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase precursor (EC 3.2.1.73) sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase signal peptide matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase binding 132 137 ribosome binding site signal 62 66 -35 region signal 85 90 -10 region BASE COUNT 371 a 346 c 335 g 374 t ORIGIN 1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt 61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt 121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc 181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt 241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg 301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc 361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc 421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct 481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac 541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac 601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt 661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat 721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat 781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt 841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc 901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa 961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct 1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac 1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt 1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg 1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt 1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa 1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt 1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa // LOCUS CEATAT 264 bp ss-RNA VRL 31-MAY-1990 DEFINITION Caprine arthritis encephalitis virus tat gene, complete cds. ACCESSION M33674 KEYWORDS tat protein. SOURCE Caprine arthritis encephalitis virus (strain Cork) proviral DNA, clone pCo1.9, isolated from goat synovial membrane. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) see COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 11-APR-1990. Washington State University, Dept. of Veterinary Microbiology and Pathology, Pullman, WA 99164-7040 FEATURES from to/span description pept 1 264 tat protein BASE COUNT 97 a 40 c 80 g 47 t ORIGIN Map position 5856. 1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta 61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg 121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg 181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta 241 tgtaacccgg ggtggggaac ataa // LOCUS CEATATX 264 bp ss-RNA VRL 31-MAY-1990 DEFINITION Caprine arthritis encephalitis virus tat protein mRNA, complete cds. ACCESSION M33675 KEYWORDS tat protein. SOURCE Caprine arthritis encephalitis virus (strain G63) cDNA to viral RNA, clone pC63-49, isolated from goat synovial membrane. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) see COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 11-APR-1990. Washington State University, Dept. of Veterinary Microbiology and Pathology, Pullman, WA 99164-7040 FEATURES from to/span description pept 1 264 tat protein BASE COUNT 99 a 43 c 75 g 47 t ORIGIN Map position 5856. 1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata 61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg 121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg 181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta 241 tgtaacccgg ggtgggaaac ataa // LOCUS MUSLRPA 3070 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Mouse LCA-related phosphatase (LRP) mRNA, complete cds. ACCESSION M33671 KEYWORDS leukocyte common antigen-related phosphatase; tyrosine phosphatase. SOURCE Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3, cDNA to mRNA, clones mLRP-[B20,C26]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3070) AUTHORS Matthews,R.J., Cahir,E.D. and Thomas,M.L. TITLE Identification of a novel member of the protein tyrosine phosphatase family: Evidence for alternative splicing in the tyrosine phosphatase domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Thomas, 11-APR-1990. FEATURES from to/span description pept 26 2515 LCA-related phosphatase precursor (EC 3.13.48) sigp 26 82 LCA-related phosphatase signal peptide matp 83 2512 LCA-related phosphatase mRNA < 1 3070 LCA-related phosphatase mRNA site 828 936 variably spliced exon BASE COUNT 874 a 754 c 732 g 710 t ORIGIN 1 ccgcccagcg ccgggctcgg tcagcatgga ttcctggttc attcttgtcc tgtttggcag 61 tggtctaata catgttagtg ccaacaatgc tactacagtt tcaccttctt taggaacgac 121 aagattaatt aaaacatcaa caacagaatt ggctaaggaa gagaataaaa cctcaaattc 181 aacctcttca gtaatttctc tttctgtggc accaacattc agcccaaacc tgactctgga 241 gcccacctat gtgactactg ttaattcttc acactctgac aatgggacca ggagggcagc 301 cagcacggaa tctggaggca ctaccatttc cccgaacgga agctggctta ttgagaacca 361 gttcacggat gccataacag aaccctggga ggggaactcc agcactgcag caaccactcc 421 agaaaccttc cccccggcag atgagacacc aattattgcg gtgatggtgg ccctgtcctc 481 tctgctagta atcgtgttta ttatcatagt tctgtacatg ttaaggttta agaaatacaa 541 gcaagctggg agtcattcca actctttccg cctgtcaaat ggccgcacgg aggatgtgga 601 gccccaaagt gtaccacttc tggccaggtc cccaagcacc aacaggaagt acccaccact 661 gcctgtggac aagctggaag aggagattaa ccggagaatg gctgatgaca ataagatctt 721 cagagaagaa ttcaacgctc tccctgcttg tcctatccag gccacctgtg aggctgcctc 781 caaggaagaa aacaaggaaa aaaaccgcta tgtaaacatc ctgccctttc tctctttagc 841 tgtgagcaag gatgcagtga aagcactgaa caaaaccact ccattgttag aaagaaggtt 901 tattgggaaa tcaaactcca gaggctgtct ctcagatgac cactctagag tgcacctgac 961 acctgttgaa ggggtcccag attctgatta catcaacgct tcattcatta atggctacca 1021 ggaaaagaac aaattcatcg ctgcacaagg accaaaagaa gaaacagtga atgacttctg 1081 gagaatgata tgggaacaaa acacagctac tattgtcatg gtgaccaacc tgaaggagag 1141 aaaggagtgt aaatgtgccc aatactggcc agaccaaggc tgctggacct atgggaatgt 1201 ccgtgtgtct gtcgaggatg tgactgttct ggtggactac acagtacgga aattctgcat 1261 ccagcaggtg ggcgacgtga ccaacaggaa accacagcgc ctcatcactc agttccactt 1321 caccagctgg ccagactttg gggtgccttt caccccaatt ggcatgctca agttcctcaa 1381 gaaggtgaag gcctgtaacc ctcagtacgc aggggctatc gtggtccact gcagtgcagg 1441 tgtagggcgc actggcacct ttgttgtcat cgatgccatg ctggacatga tgcattcgga 1501 acgcaaagtg gatgtatacg ggtttgtgag ccggatccgg gcccagcgct gccagatggt 1561 acagacagac atgcagtacg tcttcatata ccaggccctt ctggagcatt atctgtatgg 1621 ggacacagaa ctggaagtga cttctctaga aacccaccta caaaaaattt ataacaagat 1681 cccagggacg agcaacaacg ggttagagga ggagtttaag aaattaactt caatcaaaat 1741 ccagaatgac aagatgcgca cgggaaacct tccagccaac atgaagaaga accgggtttt 1801 acagatcatt ccatatgaat ttaacagagt gatcattcca gtcaaacgag gcgaagagaa 1861 cacagactat gtgaacgcat ccttcattga tggataccgg cagaaagact cctacattgc 1921 cagccagggc cctcttctcc acacgattga ggacttctgg cgaatgatct gggagtggaa 1981 gtcctgttct atcgtaatgc tgacagaact ggaagagaga ggccaggaga agtgtgccca 2041 gtactggcca tctgatggcc tggtgtccta tggagacatc acagttgagc tgaagaagga 2101 ggaggaatgt gaaagctaca ctgtccgaga cctcctggtc accaacacca gggagaacaa 2161 gagtcggcaa atccggcagt tccacttcca cggctggcct gaggtgggca tccccagcga 2221 cggcaagggc atgatcaaca tcattgcagc agtgcagaag cagcagcagc agtcggggaa 2281 ccatcccatc actgtgcact gcagtgccgg ggcaggacgg acaggaacct tctgtgcctt 2341 gagcacagtc ctggaacgtg tgaaagcaga aggaatttta gatgtcttcc aaactgtcaa 2401 gagcctgcgg ctgcagaggc cacacatggt ccagacactg gaacagtatg aattctgcta 2461 caaggtggta caggagtaca ttgacgcctt ttcagattat gccaacttca agtgacaggt 2521 gacaaggccc acagacagga gaattgcctt taatattttg taatattctg tttttgttaa 2581 tatacccaaa attgtatata tcttataact gttttagaaa tggcacatag gcttctatta 2641 cctgttaggt ggagattttg tatgtaaatg tgttagcact gatagtcctt ttccagtgtt 2701 ttattgggaa attaaatagt gtgatatttg ggttgatata atgaaatcct cagcctggaa 2761 actgggccag attgttcctt gcttcaaata tcttttccta aagaagataa acctaagact 2821 cattccaggt agctcagtgc caactaaaac aaagcacaaa gttctcagag ctcttgagga 2881 aatggttgtc tccctgtccc caggcaggcc tcttcccctc cctgtcctgt aaatatccct 2941 cccctctcca gtccaccctc atctcccacc aagatcagcc acctcaggca tggggagtaa 3001 tgagaccaga gcgcctctct ggcaccacag cagggatcgt caggtaataa acactcttga 3061 ttccctgagg // LOCUS BOV33DPTP 982 bp ss-mRNA MAM 31-MAY-1990 DEFINITION Cow 33-kDa phototransducing protein mRNA, 3' end. ACCESSION M33529 KEYWORDS phototransducing protein. SOURCE Cow adult photoreceptor rod cell, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 982) AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S., Yamaki,K. and Shinohara,T. TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in retina and pineal gland JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Shinohara, 03-APR-1990. FEATURES from to/span description pept < 1 719 33-kDa phototransducing protein (AA at 3) BASE COUNT 334 a 164 c 215 g 269 t ORIGIN 1 aaagtttgga ggaagacttt gaaggacagg cctcacatac aggacccaaa ggagtaataa 61 atgactggag aaagtttaaa ttggagagtg aagatagtga ttcagttgca cccagcaaga 121 aggagattct cagacaaatg tcttctcctc agagtagaga tgacaaagac tcaaaagaaa 181 gattcagcag aaagatgagc gttcaagaat atgaactaat ccacaaagac aaagaagatg 241 aaaattgcct tcgtaaatac cgcagacagt gtatgcagga tatgcaccag aagctgagtt 301 ttgggcctag atatgggttt gtgtatgagc tggaatctgg ggagcaattc ctggaaacca 361 ttgaaaagga acagaaaatc accactatcg ttgttcatat ttatgaagat ggtattaagg 421 gctgtgatgc tctaaacagt agcttgatat gccttgcagc cgaataccct atggtcaagt 481 tttgtaaaat aaaggcttct aatacaggtg ccggagaccg cttttcctca gatgtactcc 541 ccacgctgct tgtctacaaa ggtggggaac tcctaagcaa tttcattagt gttactgaac 601 agctggctga agaatttttt actggggatg tggagtcttt cctaaatgaa tatgggttat 661 tacctgaaaa agagatgcat gtcctagagc agagcaaaat ggaagaggat atggaataaa 721 gattcactat gtcaatgttt catatttcgt atttctcctt taagcattga acactgattt 781 tggtagtatt cacattcttt tagggaatac caaacatagc cctggctttt ctaatttggg 841 gaagaaaaac tccagactga cactaaaatt atatgattag catgtcttaa tattagttac 901 tcaagctgat ataacacttt acctcaaaac attgtagtct tcagcaatat gttagtagac 961 aaagagaata tgaaaaatac tg // LOCUS HUM33DPTP 1182 bp ss-mRNA PRI 31-MAY-1990 DEFINITION Human 33-kDa phototransducing protein mRNA, complete cds. ACCESSION M33478 KEYWORDS phototransducing protein. SOURCE Human adult photoreceptor rod cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1182) AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S., Yamaki,K. and Shinohara,T. TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in retina and pineal gland JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Shinohara, 30-MAR-1990. FEATURES from to/span description pept 52 792 33-kDa phototransducing protein mRNA < 1 1181 33-kDa phototransducing protein mRNA BASE COUNT 405 a 187 c 242 g 348 t ORIGIN 1 aggacaccag gcacagagat ccaaactatt atatcaaatc caatccctaa aatggaagaa 61 gccaaaagcc aaagtttgga ggaagacttt gaaggacagg ccacacatac aggacccaaa 121 ggagtaataa atgattggag aaagtttaaa ttagagagtc aagacagtga ttcaattcca 181 cctagcaaga aggagattct caggcaaatg tcttctcctc agagtaggaa tggcaaagat 241 tcaaaggaac gagtcagcag aaagatgagc attcaagaat atgaactaat ccataaagag 301 aaagaggatg aaaactgcct tcgtaaatac cgtagacagt gtatgcagga tatgcaccag 361 aagctgagtt ttgggcctag atatgggttt gtgtatgagc tggaaactgg aaagcaattc 421 ctagaaacaa ttgaaaagga actgaagatc accacaattg ttgttcacat ttatgaagat 481 ggtattaagg gttgtgatgc tctaaacagt agtttaacat gccttgcagc agaataccct 541 atagttaagt tttgtaaaat aaaagcttcg aatacaggtg ctggggaccg cttttcctta 601 gatgtacttc ctacactgct catctataaa ggtggggaac tcataagcaa ttttattagt 661 gttgctgaac agtttgctga agaatttttt gctggggatg tggagtcttt cctaaatgaa 721 tatgggttac tacctgaaag agaggtacat gtcctagagc ataccaaaat agaagaagaa 781 gatgttgaat gaagattcac tatgtcaata tctcatgttt atcctttagg tattggatga 841 tggttttggt agtatctata ttgcttttgt gaacacagag tatgggcacg gctatgctaa 901 cttgacaaaa atgactgatg caacaatcga gttattagca tttcatagta ttagttactc 961 aaattgatac aatgcttgac tacaaaacaa agctgtcttc agcaacatta ttagtagaca 1021 aagaggatgt ggataatatt atgacatttt tcaaaaatcc ctttcaagtt atgttttgtc 1081 ttttttactc cattttccct catcactgtt attatttgga cttttcaaat tacattattc 1141 attataattt tctttgtgta ataaaaatga aatctcatga ag // LOCUS RAT33DPPT 1221 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Rat 33-kDa phototransducing protein mRNA, complete cds. ACCESSION M33530 KEYWORDS phototransducing protein. SOURCE Rat (strain Sprague-Dawley) adult photoreceptor rod cell, cDNA to mRNA, pineal gland library. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1221) AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S., Yamaki,K. and Shinohara,T. TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in retina and pineal gland JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Shinohara, 03-APR-1990. FEATURES from to/span description pept 92 832 33-kDa phototransducing protein BASE COUNT 399 a 234 c 282 g 306 t ORIGIN 1 gttcagtgga cagagattct cacccactgg acataatcta ggactccagg aacagagacc 61 ccaaactact acaccgaaca caacccataa aatggaagaa gccgcaagcc aaagcttaga 121 ggaagatttt gaaggacagg ccacacacac aggacccaaa ggagtaataa atgactggag 181 aaagtttaaa ttagaaagtg aagatggtga ttcaattcca cccagcaaga aggagatcct 241 cagacaaatg tcctctcctc agagcagaga tgacaaagac tcgaaagaaa gaatgagcag 301 aaagatgagc attcaagaat atgaactaat tcatcaggac aaagaagatg aaggttgcct 361 tcgcaaatac cgcagacagt gcatgcagga tatgcatcag aagctgagct ttgggcctag 421 gtatgggttt gtgtatgagc tggaaacagg ggagcaattc ctggaaacca tcgaaaagga 481 gcagaaggtc accaccatcg tggttaacat ttacgaggat ggtgtcaggg gctgtgacgc 541 actcaacagc agtttagaat gccttgcagc agaataccca atggtcaagt tctgtaaaat 601 aagagcttcg aatactggag ctggggaccg cttttcctca gacgtactcc cgacattgct 661 cgtatacaaa ggtggggagc tcataagcaa ttttattagt gtcgctgaac aatttgccga 721 agattttttc gctgcggatg tggagtcttt cctaaatgaa tatggcttac taccagaaag 781 agagatacat gacctagggc agaccaacac ggaagatgaa gatatcgagt aagcacgcac 841 ggtgcagtat ctcgtgacta tcctttgcac agtgaacatc gatggttttt ttggtagtat 901 ctatattcct ttagcaaaca ctaaatacag tcaggccaac tcaaattggg gaggggacac 961 taaaattatg taaatagcat ttttagtact aattattcaa attgagataa tattttactg 1021 caaaaaatat tgtggttctg agcaaataac tactgaacag aggaggtggg tgatgtcaac 1081 attaggtcat tgtaaaaatc cctttccagt tccgtgttag ctttttactc ctccttttcc 1141 ccttggcatt tatttttggg cttttgaggc tgaatgctac ttataaattg gtttgtatga 1201 taagaataaa gtcctatgaa g // LOCUS RAT33DPTP 1272 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Rat 33-kDa phototransducing protein mRNA, complete cds. ACCESSION M33528 KEYWORDS phototransducing protein. SOURCE Rat (strain Sprague-Dawley) adult photoreceptor rod cell, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1272) AUTHORS Abe,T., Nakabayashi,H., Tamada,H., Takagi,T., Sakuragi,S., Yamaki,K. and Shinohara,T. TITLE Analysis of the human, bovine and rat 33-kDa proteins and cDNA in retina and pineal gland JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Shinohara, 03-APR-1990. FEATURES from to/span description pept 82 822 33-kDa phototransducing protein mRNA < 1 1272 33-kDa phototransducing protein mRNA BASE COUNT 420 a 244 c 289 g 319 t ORIGIN 1 cagagattct cacccactgg acataatcta ggactccagg aacagagacc ccaaactact 61 acaccgaaca caacccataa aatggaagaa gccgcaagcc aaagcttaga ggaagatttt 121 gaaggacagg ccacacacac aggacccaaa ggagtaataa atgactggag aaagtttaaa 181 ttagaaagtg aagatggtga ttcaattcca cccagcaaga aggagatcct cagacaaatg 241 tcctctcctc agagcagaga tgacaaagac tcgaaagaaa gaatgagcag aaagatgagc 301 attcaagaat atgaactaat tcatcaggac aaagaagatg aaggttgcct tcgcaaatac 361 cgcagacagt gcatgcagga tatgcatcag aagctgagct ttgggcctag gtatgggttt 421 gtgtatgagc tggaaacagg ggagcaattc ctggaaacca tcgaaaagga gcagaaggtc 481 accaccatcg tggttaacat ttacgaggat ggtgtcaggg gctgtgacgc actcaacagc 541 agtttagaat gccttgcagc agagtaccca atggtcaagt tctgtaaaat aagagcttcg 601 aatactggag ctggggaccg cttttcctca gacgtactcc cgacattgct catatacaaa 661 ggtggggagc tcataagcaa ttttattagt gtcgctgaac aatttgccga agattttttc 721 gctgcggatg tggagtcttt cctaaatgaa tatggcttac taccagaaag agagatacat 781 gacctagggc agaccaacac ggaagatgaa gatatcgagt aagcacgcac ggtgcagtat 841 ctcgtgacta tcctttgcac agtgaacatc gatggttttt ttggtagtat ctatattcct 901 ttagcaaaca ctaaatacag tcaggccaac tcaaattggg gaggggacac taaaattatg 961 taaatagcat ttttagtact aattattcaa attgagataa tattttactg caaaaaatat 1021 tgtggttctg agcaaataac tactgaacag aggaggtggg tgatgtcaac gttaggtcat 1081 tgtaaaaatc cctttccagt tccgtgttag cttattactc ctccttttcc ccttggcatt 1141 tatttttggg cttttgaggc tgaatgctac ttataaattg gtttgtatga taagaataaa 1201 gtcctatgaa gaaacaaata gtctgtctaa agtctgccgt gagtcaagga cttaaaaact 1261 tcataaagct tt // LOCUS HUMCPIIA3A 1748 bp ss-mRNA PRI 31-MAY-1990 DEFINITION Human cytochrome P450IIA3 (CYP2A3) mRNA, complete cds. ACCESSION M33318 M33316 KEYWORDS coumarin 7-hydroxylase; cytochrome P450; cytochrome P450IIA3. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1748) AUTHORS Yamano,S., Tatsuno,J. and Gonzalez,F.J. TITLE The CYP2A3 gene product catalyzes coumarin 7-hydroxylation in human liver microsomes JOURNAL Biochemistry 29, 1322-1329 (1990) STANDARD simple staff_review FEATURES from to/span description pept 10 1494 cytochrome P450IIA3 /nomgen="CYP2A" /map="19q13.1-q13.2" /hgml_locus_uid="LN0074T" mRNA < 1 1748 CYPIIA3 mRNA allele 60 60 g in allele CYP2A3; a in allele CYP2A3v allele 488 488 t in allele CYP2A3; a in allele CYP2A3v allele 1645 1645 g in allele CYP2A3; c in allele CYP2A3v BASE COUNT 395 a 478 c 516 g 359 t ORIGIN 1 gctaccacca tgctggcctc agggatgctt ctggtggcct tgctggtctg cctgactgtg 61 atggtcttga tgtctgtttg gcagcagagg aagagcaagg ggaagctgcc tccgggaccc 121 accccattgc ccttcattgg aaactacctg cagctgaaca cagagcagat gtacaactcc 181 ctcatgaaga tcagtgagcg ctatggcccc gtgttcacca ttcacttggg gccccggcgg 241 gtcgtggtgc tgtgtggaca tgatgccgtc agggaggctc tggtggacca ggctgaggag 301 ttcagcgggc gaggcgagca agccaccttc gactgggtct tcaaaggcta tggcgtggta 361 ttcagcaacg gggagcgcgc caagcagctc cggcgcttct ccatcgccac cctgcgggac 421 ttcggggtgg gcaagcgagg catcgaggag cgcatccagg aggaggcggg cttcctcatc 481 gacgccctcc ggggcactgg cggcgccaat atcgatccca ccttcttcct gagccgcaca 541 gtctccaatg tcatcagctc cattgtcttt ggggaccgct ttgactataa ggacaaagag 601 ttcctgtcac tgttgcgcat gatgctagga atcttccagt tcacgtcaac ctccacgggg 661 cagctctatg agatgttctc ttcggtgatg aaacacctgc caggaccaca gcaacaggcc 721 tttcagttgc tgcaagggct ggaggacttc atagccaaga aggtggagca caaccagcgc 781 acgctggatc ccaattcccc acgggacttc attgactcct ttctcatccg catgcaggag 841 gaggagaaga accccaacac ggagttctac ttgaaaaacc tggtgatgac cacgttgaac 901 ctcttcattg ggggcaccga gaccgtcagc accaccctgc gctatggctt cttgctgctc 961 atgaagcacc cagaggtgga ggccaaggtc catgaggaga ttgacagagt gatcggcaag 1021 aaccggcagc ccaagtttga ggaccgggcc aagatgccct acatggaggc agtgatccac 1081 gagatccaaa gatttggaga cgtgatcccc atgagtttgg cccgcagagt caaaaaggac 1141 accaagtttc gggatttctt cctccctaag ggcaccgaag tgtaccctat gctgggctct 1201 gtgctgagag accccagttt cttctccaac ccccaggact tcaatcccca gcacttcctg 1261 aatgagaagg ggcagtttaa gaagagtgat gcttttgtgc ccttttccat cggaaagcgg 1321 aactgtttcg gagaaggcct ggccagaatg gagctctttc tcttcttcac caccgtcatg 1381 cagaacttcc gcctcaagtc ctcccagtca cctaaggaca ttgacgtgtc ccccaaacac 1441 gtgggctttg ccacgatccc acgaaactac accatgagct tcctgccccg ctgagcgagg 1501 gctgtgccgg tgcaggtctg gtgggcgggg ccagggaaag ggcagggcca agaccgggct 1561 tgggagaggg gcgcagctaa gactgggggc aggatggcgg aaaggaaggg gcgtggtggc 1621 tagagggaag agaagaaaca gaaggggctc agttcacctt gataaggtgc ttccgagctg 1681 ggatgagagg aaggaaaccc ttacattatg ctatgaagag tagtaataat agcagctctt 1741 atttcctg // LOCUS HUMCYIIA4A 1747 bp ss-mRNA PRI 31-MAY-1990 DEFINITION Human cytochrome P450IIA4 (CYP2A4) mRNA, complete cds. ACCESSION M33317 KEYWORDS cytochrome P450; cytochrome P450IIA4. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1747) AUTHORS Yamano,S., Tatsuno,J. and Gonzalez,F.J. TITLE The CYP2A3 gene product catalyzes coumarin 7-hydroxylation in human liver microsomes JOURNAL Biochemistry 29, 1322-1329 (1990) STANDARD simple staff_review FEATURES from to/span description pept 10 1494 cytochrome P450IIA4 /hgml_locus_uid="LN0074T" /nomgen="CYP2A" /map="19q13.1-q13.2" mRNA < 1 1747 CYP2A4 mRNA BASE COUNT 396 a 471 c 517 g 363 t ORIGIN 1 gctaccacca tgctggcctc agggctgctt ctggtggcct tgctggcctg cctgactgtg 61 atggtcttga tgtctgtctg gcagcagagg aagagcaggg ggaagctgcc tccgggaccc 121 accccactgc ccttcattgg aaactacctc cagctgaaca cagagcacat atgtgactcc 181 atcatgaagt tcagtgagtg ctatggcccc gtgttcacca ttcacttggg gccccggcgg 241 gtcgtggtgc tgtgtggaca tgatgccgtc agggaggctc tggtggacca ggctgaggag 301 ttcagcgggc gaggcgagca agccaccttc gactgggtct tcaaaggcta tggcgtggcg 361 ttcagcaacg gggagcgcgc caagcagctc ctgcgctttg ccatcgccac cctgagggac 421 ttcggggtgg gcaagcgagg catcgaggag cgcatccagg aggagtcggg cttcctcatc 481 gaggccatcc ggagcagcca cggcgccaat atcgatccca ccttcttcct gagccgcaca 541 gtctccaatg tcatcagctc cattgtcttt ggggaccgct ttgactatga ggacaaagag 601 ttcctgtcac tgctgagcat gatgctagga atcttccagt tcacgtcaac ctccacgggg 661 cagctctatg agatgttctc ttcggtgatg aaacacctgc caggaccaca gcaacaggcc 721 tttaagttgc tgcaagggct ggaggacttc atagccaaga aggtggagca caaccagcgc 781 acgctggatc ccaattcccc acaggacttc atcgactcct ttctcatcca catgcaggag 841 gaggagaaga accccaacac ggagttctac ttgaagaacc tgatgatgag cacgttgaac 901 ctcttcattg caggcacgga gaccgtcagc accaccctgc gctatggctt cttgctgctc 961 atgaagcacc cagaggtgga ggccaaggtc catgaggaga ttgacagagt gatcggcaag 1021 aaccggcagc ccaagtttga ggaccggacc aagatgccct acatggaggc agtgatccac 1081 gagatccaaa gatttggaga cgtgatcccc atgagtttgg cccgcagagt caaaaaggac 1141 accaagtttc gggatttttt cctccctaag ggcaccgaag tgttccctat gctgggctcc 1201 gtgctgagag accccagttt cttctccaac ccccaggact tcaatcccca gcacttcctg 1261 gatgacaagg ggcagtttaa gaagagtgat gcttttgtgc ccttttccat cggaaagcgg 1321 tactgtttcg gagaaggcct ggccagaatg gagctctttc tcttcttcac caccgtcatg 1381 cagaacttcc gcctcaagtc ctcccagtca cctaaggaca ttgacgtgtc ctccaaacac 1441 gtgggctttg ccacgatccc acgaaactac accatgagct tcctgccccg ctgagcgagg 1501 gctgtgccgg tgcaggtctg gtgggcgggg ccagggaaag gcggggtcag ggcggggttc 1561 gcggaagagg cgggtataag aatgggggga agatgcggga aaggaagggg cgtggtggct 1621 agagggaaga gaagaaacag aaggggctca gttcaccttg ataaggtgct tccgtggtgg 1681 gatgagagga agggaaacct tacattatgc tatgaagagt agtaataata gcagctctta 1741 tttcctg // LOCUS TRLKPRGXY 2568 bp ds-DNA ORG 31-MAY-1990 DEFINITION Leptomonas sp. 9S and 12S ribosomal RNA genes. ACCESSION J03814 KEYWORDS 12S ribosomal RNA; 9S ribosomal RNA; ribosomal RNA gene. SOURCE Leptomonas sp. kinetoplast DNA. ORGANISM Kinetoplast Leptomonas sp. Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Leptomonas sp.. REFERENCE 1 (bases 1 to 2568) AUTHORS Lake,J.A., de la Cruz,V.F., Ferreira,P.C.G., Morel,C. and Simpson,L. TITLE Evolution of parasitism: Kinetoplastid protozoan history reconstructed from mitochondrial rRNA gene sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4779-4783 (1988) STANDARD simple staff_review COMMENT Draft entry and computer_readable sequence for [1] kindly provided by L.Simpson, 06-MAY-1988. FEATURES from to/span description rRNA 351 1500 12S rRNA (put.) rRNA 1651 2250 9S rRNA (put.) BASE COUNT 968 a 207 c 322 g 1071 t ORIGIN BamHI site. 1 gatcctaggt aatctcgtca gctgctgatc acaccaaaaa acatatatta caataaatta 61 ctaattatac taatattaaa taagaaagat agaaatgtgg ttaatatgca ttattaaggg 121 atgtaatgat gtgatgacat gtaattaaaa gacctcggtt atatgaatta tttataaaaa 181 cttgtagggt gtggaattaa aagagtatct aagcctaagc cctaagtccc ctctctctca 241 tcgtacttta catcaattat tactgtaatt aattttcgtg agcggagcac atggcgttta 301 atttgagaca taaattagta agaaaagggt aaaaataaat tgtagatttt attttgtttt 361 tatttatcaa tatagttaat aaaataatct agaattttat gttaaatata taattatatt 421 tttgattatt atattttgtt attttattta agttaattaa attgtattat atttaatttt 481 taaatttgtt gttttatatt tagttttatg tttataattt aatgcaatac tgcacatatt 541 ttaaaatttt aatgaactgt tatttatagt tgattatttt tagtttaatg tttaaatatt 601 taactaatgg aggcacagtt gttctatatg taccaataaa aaatagtaaa attaatttaa 661 ttaatttaat aaataattat ttgatcaaaa ttagtacaaa tattatgtaa aatttaaaaa 721 ttaaatattt tatctaatat taacttatat taaatatttg gtttacattg aatattaaat 781 ttacagattt aatttgtaat taatttaata ttgtgtgttt attaaaatag aggattaatg 841 taataaattt ttaattttaa tatttaatat ttgttaaatt aatagtaaca taacttaatt 901 aaaaagaatt atttatacgt ataaattttt aaaattataa ttatattgtt aataaaatta 961 tcaagtttca aaagcgttta ttaaatgcgt ctgtctaaga tttatattta agagtattct 1021 tgtatatagt tttaagtttt aataatccta tttaattata aagttaacct caaaaagcat 1081 ttataagtag catagtaatt tgttaactaa atattaaagt gttccataga aaattttaaa 1141 attataacaa tcatcgtaac taataaatta aaataaaaat tttaaaaaaa attaaaaaat 1201 taaaataggg caagtcctac tctcctttac aaagagaaca tttaatatgt agttgtatgt 1261 ttgattgggg caatactata tcttgttata tagaaaaaga actataatta ttgaaataat 1321 aaaaggttcg agcaggttaa caagcattaa tactaaatgt gtttcatcgt ctacttattg 1381 ctataaaaaa ttgattgttc atcaaaaatg caattcgtta gttgggttaa aatcgttgta 1441 aagcagattt gtttatatat ttaatatttt tattatttta aaaattaata ttagtacgca 1501 aggattcatt atttgttgct ttatttatat ttatatttat atttatattt atatttataa 1561 tatttattat attttaattg cttttttgcg tttgtatcgt ataattacat atttattata 1621 aatatataat tctatattta aatgtagtta ttttatatat gtatatgtat atgtagtcaa 1681 ttgttattat tcatattaat ttttttaatt gtttttaaat tgttttatat tagtttattg 1741 aaatacttat ttattttcaa tgtttaggaa tagttaataa taatttataa ttttgattag 1801 attactttgt tgatgctata agaaggggtg tggaaatagt gttaaatgta agatatataa 1861 ctataataaa taacatataa attattagtc agaaatggat gccagccgtt gcggtaatat 1921 ctatgctttt aaatattata catttattat attaatttgt tacttattaa ttttagtcaa 1981 taaaaagttt tatatatttt aatttgtttt tatacaccat caggtatatg caaatataaa 2041 atgacattaa ttattaatta tattatatta tatttattca tatctttata cacataaaat 2101 ttttagattt tgacaccatg aaaaggctat cgaatggaat tgtatatttt ataatcaaaa 2161 ttaattaatt atattaaatt attaaatttt agataaaaaa ataaattcaa aaggtattgt 2221 tgcccaccaa tttttataat aaaaataacg tgcagtaatt aatgaactta taaaagtaca 2281 ttttttcata ttaaactttc attatctttt ttgttattta aatatttatg cattaaatct 2341 taaaaattta aatatttgca cacatgtgta taatatatta aattgggaac cccgttgaag 2401 gagggacaga ccaagaggac agagaggtcg ggaatttcag cgatttgatt tttttttttt 2461 tgggggagcg gagcagtcga ggaaagccca gagttttcag agcgttggcg aagaagggtc 2521 gttttattcg gaaaataaag accgttctgg aaggggagtt ttttcagg // LOCUS MUSCALP1 83 bp ds-DNA ROD 31-MAY-1990 DEFINITION Mouse calpactin I heavy chain (p36) gene, exon 1. ACCESSION M33321 KEYWORDS calpactin I; calpactin I heavy chain; p36 protein. SEGMENT 1 of 2 SOURCE Mouse DNA, clones RW22 and B34. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 83) AUTHORS Amiguet,P., D'Eustachio,P., Kristensen,T., Wetsel,R.A., Saris,C.J.M., Hunter,T., Chaplin,D.D. and Tack,B.F. TITLE Structure and chromosome assignment of the murine p36 (calpactin I heavy chain) gene JOURNAL Biochemistry 29, 1226-1232 (1990) STANDARD simple staff_review FEATURES from to/span description pept 30 / 77 calpactin I heavy chain (p36), exon 1 pre-msg 23 > 83 calpactin I heavy chain mRNA and introns IVS 78 > 83 calpactin I heavy chain intron A BASE COUNT 17 a 16 c 16 g 34 t ORIGIN 1 tttttttttc agcttttttt ttcttcaaaa tgtctactgt ccacgaaatc ctgtgcaagc 61 tcagcctgga gggtgatgta agt // LOCUS MUSCALP2 573 bp ds-DNA ROD 31-MAY-1990 DEFINITION Mouse calpactin I heavy chain (p36) gene, exon 12. ACCESSION M33322 KEYWORDS calpactin I; calpactin I heavy chain; p36 protein. SEGMENT 2 of 2 SOURCE Mouse DNA, clones RW22 and B34. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 573) AUTHORS Amiguet,P., D'Eustachio,P., Kristensen,T., Wetsel,R.A., Saris,C.J.M., Hunter,T., Chaplin,D.D. and Tack,B.F. TITLE Structure and chromosome assignment of the murine p36 (calpactin I heavy chain) gene JOURNAL Biochemistry 29, 1226-1232 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 7 63 calpactin I heavy chain (p36), exon 12 pre-msg < 1 343 calpactin I heavy chain mRNA and introns IVS < 1 6 calpactin I heavy chain intron K BASE COUNT 137 a 145 c 132 g 159 t ORIGIN About 22 kb after segment 1. 1 ctgcaggaca ccaagggtga ctaccagaag gcactgctgt acctgtgtgg tggggatgac 61 tgaagggctc agcacagtgg atcacccaga agtggctcta cctgtgcccc aacctggcgt 121 tctagagact tcgctctcca ctaatggacc cctgagctcc tccctgtgag gatgatgaca 181 gggctgccga ccctttcccc atcttagctg cccttgcctg gctttctcct cattctctcc 241 tttatgccaa agaagtgaac attccaggga gtggggcgtc agtctgtgac atgagacact 301 tcctcttatg tactgtgtcg tgaataaacc gtttttactt tagaaacaag aatgccgttg 361 cctttgcttt caagccactg ttcagaaagc agacatactt gggatttgat tgttcgccca 421 gggagttagg cccctagagg atgaaagttc tagaacttat ctcttccctc tggtagccac 481 tatacagatg gcactttttt aaaaggatga gtgaatttaa aattatatac agttcaaatt 541 ttacccctta ggcaaagggc tacagctctt atc // LOCUS HUMLSERP1 1235 bp ds-DNA PRI 31-MAY-1990 DEFINITION Human leuserpin 2 (hLS2) gene, 5' flank. ACCESSION M33659 J03921 KEYWORDS leuserpin 2; serine protease inhibitor. SEGMENT 1 of 2 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1235) AUTHORS Ragg,H. and Preibisch,G. TITLE Structure and expression of the gene coding for the human serpin hLS2 JOURNAL J. Biol. Chem. 263, 12129-12134 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Ragg, 17-JAN-1988 FEATURES from to/span description IVS 1134 > 1235 leuserpin 2 (hLS2) intron A BASE COUNT 345 a 313 c 302 g 275 t ORIGIN 1 ctcgggaggt tgaggctgca gtgagccaag atcacgccac tgcacttcag cctgggtaac 61 agagtgagac cctgtctcaa aaaacacata gggcaggcgt ggtggctcac gcatgtaatc 121 ccagcacttt gggaggccga gacgggagga tcacttcact ccaggagttc aacaccagcc 181 tggccaacat agtgaaaccc cgtctctact aaaaatacaa aaaattagtt ggacatggtg 241 gtgtgcgcct gtaatctcag ccactcagga ggctgaggca ggagaatcgc ttgaacttgg 301 gagacagagg ttgcagtgag ctgagatcgc accactgcac tccagcatgg gcagcagcgc 361 gaaactctgt ctcaaaacaa acaaacaaac aaacaaacac ccataaacac aaaatgtatc 421 acagcctcag agatccccac gaatgcctaa gtggccctga atttgggagg cactgctcag 481 taatagtcct atctgtccca caacagacag gagtgctggg ctgcacctac tggcaacaaa 541 cacagcaacc cttgactgaa gaaaggtcca tgccacaatc cccttattct gtaagccact 601 aattttgtcc tctctcctcc acctttcact gaggaacgag ctcttggaag gacagggaca 661 cccgcctagt agctgagcca gccacatcag tcctggagag caggtggagg gcagatgctg 721 tgatcatccc agaagagagg acacagttgg aggcagatgc atggtctcta ctttcagcta 781 ccctcaatgc agcctggtcc ccagaggcct gaagagcgcc ttgtttatgt ggtgacctca 841 agaggggctg ctcctgcacc aaggctatgt gtgcatgcta acacagtaac cgtcatatac 901 tcaaaatgtc agctctaaga actggagatg aggagctgca agccactcta cagttatcaa 961 aggcacagct gagggggttt gtgctgacca agctggttgc ctggtgtttg gattgggact 1021 tatttacttt ggaaaatatg cagcaacagc ccagcaccaa agttcacatc aaaatcccac 1081 tgatgacctt ggctgctttc atctctgaag cgccacttct cagaaacaca gaggtaagtt 1141 gggtttctaa tgtttctgct gattataaat tatttttggt gtttacggat aggcaactgg 1201 ttcatttttc tagcaaacta agaattcaga agctt // LOCUS HUMLSERP2 408 bp ds-DNA PRI 31-MAY-1990 DEFINITION Human leuserpin 2 (hLS2) gene, exon 2. ACCESSION M33660 J03921 KEYWORDS leuserpin 2; serine protease inhibitor. SEGMENT 2 of 2 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 408) AUTHORS Ragg,H. and Preibisch,G. TITLE Structure and expression of the gene coding for the human serpin hLS2 JOURNAL J. Biol. Chem. 263, 12129-12134 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Ragg, 17-JAN-1988 FEATURES from to/span description pept 52 > 408 leuserpin 2 (hLS2) precursor, exon 2 (first expressed exon) sigp 52 108 leuserpin 2 (hLS2) signal peptide matp 109 > 408 leuserpin 2 (hLS2) IVS < 1 36 hLS2 intron A BASE COUNT 107 a 111 c 98 g 92 t ORIGIN About 5.2 kb after segment 1. 1 aggccgcctt tcactgtgtt ctgttttccc tcccagcttt agctccgcaa aatgaaacac 61 tcattaaacg cacttctcat tttcctcatc ataacatctg cgtggggtgg gagcaaaggc 121 ccgctggatc agctagagaa aggaggggaa actgctcagt ctgcagatcc ccagtgggag 181 cagttaaata acaaaaacct gagcatgcct cttctccctg ccgacttcca caaggaaaac 241 accgtcacca acgactggat tccagagggg gaggaggacg acgactatct ggacctggag 301 aagatattca gtgaagacga cgactacatc gacatcgtcg acagtctgtc agtttccccg 361 acagactctg atgtgagtgc tgggaacatc ctccagcttt ttcatggc // LOCUS MUSIGHAAN 375 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Mouse Ig rearranged gamma-chain mRNA V-D-J4-region, hybridoma A6.1, partial cds. ACCESSION M33558 KEYWORDS N-region; diversity exon; gamma-immunoglobulin; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BW) hybridoma A6.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 375) AUTHORS Tsao,B.P., Ebling,F.M., Roman,C., Panosian-Sahakian,N., Calame,K. and Hahn,B.H. TITLE Structural characteristics of the variable regions of immunoglobulin genes encoding a pathogenic autoantibody in murine lupus JOURNAL J. Clin. Invest. 85, 530-540 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 375 Ig gamma-chain V-D-J4-region (AA at 1) recomb 300 301 V-region DNA end (3' end put.)/D-region DNA start (5' end put.) recomb 324 325 D-region DNA end/N-region DNA start recomb 327 328 N-region DNA end/J-region DNA start BASE COUNT 93 a 90 c 97 g 95 t ORIGIN 1 gaggtccagc tgcagcagtc tggacctgag ctggtgaagc ctggggcttc agtgaagatg 61 tcctgcaagg cttctggtta ctcctttact ggctacttta tgaactgggt gaagcagagc 121 catggaaaga gccttgagtg gattggacgt attaatcctt acaatggtga tactttctac 181 aaccagaagt tcaagggcaa ggccacattg actgtagaca aatcctctag cacagcccac 241 atggagctcc ggagcctgac atctgaggac tctgcactct attattgtgc aagagattcc 301 ccttattact acggtagtag ctacgggtat gctatggact actggggtca aggaacctca 361 gtcaccgtct cctca // LOCUS MUSIGKABE 321 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J2-region, hybridoma A6.1, partial cds. ACCESSION M33559 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BW) hybridoma A6.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Tsao,B.P., Ebling,F.M., Roman,C., Panosian-Sahakian,N., Calame,K. and Hahn,B.H. TITLE Structural characteristics of the variable regions of immunoglobulin genes encoding a pathogenic autoantibody in murine lupus JOURNAL J. Clin. Invest. 85, 530-540 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain V-J2-region (AA at 1) recomb 291 292 V-region DNA end/J-region DNA start BASE COUNT 82 a 74 c 76 g 89 t ORIGIN 1 gacatccaga tgattcagtc tccatcgtcc atgtttgcct ctctgggaga cagagtcagt 61 ctctcttgtc gggctagtca gggcattaga ggtaatttag actggtatca gcagaaacca 121 ggtggaacta ttaaactcct gatctactcc acatccaatt taaattctgg tgtcccatca 181 aggttcagtg gcagtgggtc tgggtcagat tattctctca ccatcagcag cctagagtct 241 gaagattttg cagactatta ctgtctacag cgtaatgcgt atcctctcac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS RATA2UGLBB 852 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Rat alpha-2u-globulin (L type) mRNA, complete cds., clone pA110. ACCESSION M26836 KEYWORDS alpha-2u-globulin; globulin. SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA110. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 852) AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M. TITLE Length polymorphism in the 3' noncoding region of rat hepatic alpha-2u-globulin mRNAs JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987) STANDARD simple staff_review FEATURES from to/span description pept 4 549 alpha-2u-globulin mRNA < 1 852 alpha-2u-globulin mRNA BASE COUNT 246 a 186 c 198 g 222 t ORIGIN 1 aacatgaagc tgttgctgct gctgctgtgt ctgggcctga cactggtctg tggccatgca 61 gaagaagcta gttccacaag agggaacctc gatgtggaca agctcaatgg ggattggttt 121 tctattgtcg tggcctctga caaaagagaa aagatagaag agaatgccag catgagagtt 181 tttatgcagc acatcgatgt cttggagaat tccttaggct tcaagttccg tattaaggaa 241 aatggagagt gcagggaact atatttggtt gcctacaaaa cgccagagga tggcgaatat 301 tttgttgagt atgacggagg gaatacattt actatactta agacagacta tgacagatat 361 gtcatgtttc atctcattaa tttcaagaac ggggaaacct tccaggcgat ggtgctctac 421 ggcagaacaa aggatctgag ttcagacatc aaggaaaagt ttgcaaaact atgtgaggcg 481 catggaatca ctagggacaa tatcattgat ctaaccaaga ctgatcactg tctccaggcc 541 cgaggatgaa gaaaggcctg agactccagg tgggcaatct ccagtgagag caagtgctga 601 gtggagactt ctcaccagga ctctagcatc accatttcct gtccatggag catcctgaga 661 caaattctgc gatctgattt ccatcctctg tcacagaaaa gtgcaatcct ggtctctcca 721 gcatcttccc tagttaccca ggacaacaca tcgagaatta aaagctttct taaatttctc 781 ttggccccac ccatgatcat tccgcacaaa tttcttgctc ttgcagttca ataaatgatt 841 acccttgcac tt // LOCUS RATA2UGLBC 838 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Rat alpha-2u-globulin (S type) mRNA, complete cds., clone pA120. ACCESSION M26837 KEYWORDS alpha-2u-globulin; globulin. SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA120. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 838) AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M. TITLE Length polymorphism in the 3' noncoding region of rat hepatic alpha-2u-globulin mRNAs JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987) STANDARD simple staff_review FEATURES from to/span description pept 15 560 alpha-2u-globulin mRNA < 1 838 alpha-2u-globulin mRNA BASE COUNT 240 a 186 c 189 g 223 t ORIGIN 1 tattccctac caacatgaag ctgttgctgc tgctgctgtg tctgggcctg acactggtct 61 gtggccatgc agaagaagct agttccacaa gagggaacct cgatgtggct aagctcaatg 121 gggattggtt ttctattgtc gtggcctcta acaaaagaga aaagatagaa gagaatggca 181 gcatgagagt ttttatgcag cacatcgatg tcttggagaa ttccttaggc ttcaagttcc 241 gtattaagga aaatggagag tgcagggaac tatatttggt tgcctacaaa acgccagagg 301 atggcgaata ttttgttgag tatgacggag ggaatacatt tactatactt aagacagact 361 atgacagata tgtcatgttt catctcatta atttcaagaa cggggaaacc ttccagctga 421 tggtgctcta cggcagaaca aaggatctga gttcagacat caaggaaaag tttgcaaaac 481 tatgtgaggc gcatggaatc actagggaca atatcattga tctaaccaag actgatcgct 541 gtctccaggc ccgaggatga agaaaggcct gagcctccag tgctgagtgg agacttctca 601 ccaggactct agcatcacca tttcctgtcc atggagcatc ctgagacaaa ttctgcgatc 661 tgatttccat cctctgtcac agaaaagtgc aatcctggtc tctccagcat cttccctagt 721 tacccaggac aacacatcga gaattaaaag ctttcttaaa tttctcttgg ccccacccat 781 gatcattccg cacaaatatc ttgctcttgc agttcaataa atgattaccc ttgcactt // LOCUS RATA2UGLBD 835 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Rat alpha-2u-globulin (S type) mRNA, complete cds., clone pA108. ACCESSION M26838 KEYWORDS alpha-2u-globulin; globulin. SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA108. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 835) AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M. TITLE Length polymorphism in the 3' noncoding region of rat hepatic alpha-2u-globulin mRNAs JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987) STANDARD simple staff_review FEATURES from to/span description pept 12 557 alpha-2u-globulin mRNA < 1 835 alpha-2u-globulin mRNA BASE COUNT 240 a 185 c 190 g 220 t ORIGIN 1 tccctaccaa catgaagctg ttgctgctgc tgctgtgtct gggcctgaca ctggtctgtg 61 gccatgcaga agaagctagt tccacaagag ggaacctcga tgtggacaag ctcaatgggg 121 attggttttc tattgtcgtg gcctctgaca aaagagaaaa gatagaagag aatggcagca 181 tgagagtttt tatgcagcac atcgatgtct tggagaattc cttaggcttc aagttccgta 241 ttaaggaaaa tggagagtgc agggaactat atttggttgc ctacaaaacg ccagaggatg 301 gcgaatattt tgttgagtat gacggaggga atacatttac tatacttaag acagactatg 361 acagatatgt catgtttcat ctcattaatt tcaagaacgg ggaaaccttc caggcgatgg 421 tgctctacgg cagaacaaag gatctgagtt cagacatcaa ggaaaagttt gcaaaactat 481 gtgaggcgca tggaatcact agggacaata tcattgatct aaccaagact gatcactgtc 541 tccaggcccg aggatgaaga aaggcctgag actccagtgc tgagtggaga cttctcacca 601 ggactctagc atcaccattt cctgtccatg gagcatcctg agacaaattc tgcgatctga 661 tttccatcct ctgtcacaga aaagtgcaat cctggtctct ccagcatctt ccctagttac 721 ccaggacaac acatcgagaa ttaaaagctt tcttaaattt ctcttggccc cacccatgat 781 cattccgcac aaatttcttg ctcttgcagt tcaataaatg attacccttg cactt // LOCUS RATA2UGLBE 867 bp ss-mRNA ROD 31-MAY-1990 DEFINITION Rat alpha-2u-globulin (L type) mRNA, complete cds., clone pA8. ACCESSION M26835 KEYWORDS alpha-2u-globulin; globulin. SOURCE Rat (strain Donryu) adult male liver, cDNA to mRNA, clone pA8. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 867) AUTHORS Ichiyoshi,Y., Endo,H. and Yamamoto,M. TITLE Length polymorphism in the 3' noncoding region of rat hepatic alpha-2u-globulin mRNAs JOURNAL Biochim. Biophys. Acta 910, 43-51 (1987) STANDARD simple staff_review FEATURES from to/span description pept 19 564 alpha-2u-globulin mRNA < 1 867 alpha-2u-globulin mRNA BASE COUNT 248 a 192 c 198 g 229 t ORIGIN 1 attctattcc ctaccaacat gaagctgttg ctgctgctgc tgtgtctggg cctgacactg 61 gtctgtggcc atgcagaaga agctagttcc acaagaggga acctcgatgt ggctaagctc 121 aatggggatt ggttttctat tgtcgtggcc tctaacaaaa gagaaaagat agaagagaat 181 ggcagcatga gagtttttat gcagcacatc gatgtcttgg agaattcctt aggcttcaag 241 ttccgtatta aggaaaatgg agagtgcagg gaactatatt tggttgccta caaaacgcca 301 gaggatggcg aatattttgt tgagtatgac ggagggaata catttactat acttaagaca 361 gactatgaca gatatgtcat gtttcatctc attaatttca agaacgggga aaccttccag 421 ctgatggtgc tctacggcag aacaaaggat ctgagttcag acatcaagga aaagtttgca 481 aaactatgtg aggcgcatgg aatcactagg gacaatatca ttgatctaac caagactgat 541 cgctgtctcc aggcccgagg atgaagaaag gcctgagcct ccaggtgggc aatctccagt 601 gagagcaagt gctgagtgga gacttctcac caggactcta gcatcaccat ttcctgtcca 661 tggagcatcc tgagacaaat tctgcgatct gatttccatc ctctgtcaca gaaaagtgca 721 atcctggtct ctccagcatc ttccctagtt acccaggaca acacatcgag aattaaaagc 781 tttcttaaat ttctcttggc cccacccatg atcattccgc acaaatatct tgctcttgca 841 gttcaataaa tgattaccct tgcactt // LOCUS MZENAR1 386 bp ss-mRNA PLN 31-MAY-1990 DEFINITION Maize glyceraldehyde-3-phosphate dehydrogenase mRNA, 5' end. ACCESSION M31481 M31482 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SEGMENT 1 of 2 SOURCE Maize leaf (strain L. W64xW128E), cDNA to mRNA, clone Zm6. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 386) AUTHORS Gowri,G. and Campbell,W.H. TITLE cDNA clones for corn leaf NADH:nitrate reductase and chloroplast NAD(P)+: glyceraldehyde-3-phosphate dehydrogenase JOURNAL Plant Physiol. 90, 792-798 (1989) STANDARD simple staff_review FEATURES from to/span description pept 12 > 386 glyceraldehyde-3-phosphate dehydrogenase precursor sigp 12 209 glyceraldehyde-3-phosphate dehydrogenase signal peptide matp 210 > 386 glyceraldehyde-3-phosphate dehydrogenase mRNA < 1 > 386 glyceraldehyde-3-phosphate dehydrogenase mRNA BASE COUNT 57 a 143 c 125 g 61 t ORIGIN 1 ggtccgtggc catggcgtcg tccatgctct ccgctaccac cgtgccactc cagcaggggg 61 gcggcctgtc cgagttctcc gggctcagga gctccgcgtc gctgcccatg cgccggaatg 121 ccacctccga cgacttcatg tccgccgtct ccttcaggac ccacgcggtc ggtacgagcg 181 gcgggccgcg gcgggcgccg acggaggcaa agctgaaggt ggccatcaac gggttcggcc 241 gcatcgggcg caacttcctg cggtgctggc acgggcgcgg cgacgcgtcg cccctcgacg 301 tcatcgccat caacgacacc ggaggcgtca agcaggcgtc ccacctgctc aagtacgact 361 ccacgctcgg catcttcgat gccgat // LOCUS MZENAR2 456 bp ss-mRNA PLN 31-MAY-1990 DEFINITION Maize glyceraldehyde-3-phosphate dehydrogenase, 3' end. ACCESSION M31483 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SEGMENT 2 of 2 SOURCE Maize leaf (strain L. W64xW1282E), cDNA to mRNA, clone Zm6. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 456) AUTHORS Gowri,G. and Campbell,W.H. TITLE cDNA clones for corn leaf NADH:nitrate reductase and chloroplast NAD(P)+: glyceraldehyde-3-phosphate dehydrogenase JOURNAL Plant Physiol. 90, 792-798 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 312 glyceraldehyde-3-phosphate dehydrogenase precursor (AA at 1) matp < 1 309 glyceraldehyde-3-phosphate dehydrogenase mRNA < 1 456 glyceraldehyde-3-phosphate dehydrogenase mRNA BASE COUNT 87 a 150 c 122 g 97 t ORIGIN About 525 bp after segment 1. 1 cgggtgccca ccccgaacgt ctccgtcgtc gacctcgtcg tgcaggtctc caagaagacc 61 ctcgccgagg aggtgaacca ggcgttccgc gacgccgcgg ccaacgagct cacgggcatc 121 ctcgaggtct gcgacgtgcc gctcgtgtcc gtcgacttca ggtgctccga cgtctcctcc 181 accatcgacg cctccctcac catggtcatg ggcgacgaca tggtcaaggt catctcctgg 241 tacgacaacg agtggggcta ctcgcagagg gtcgtcgacc tcgctgacat ctgcgccaac 301 cagtggaagt gaagcgttgc attgtaatag cactttgtaa ctcctcaccc tccacagggc 361 tttgggattc accccgaaat gactcacttt cggtgaaatt cacccgtttc ggcgaggtcg 421 tccgaatata ccggccaaat gtttccattg gatttc //
GenBank-Updates@genbank.bio.net (06/01/90)
LOCUS ECONANA 1243 bp ds-DNA BCT 01-JUN-1990 DEFINITION Escherichia coli nanA gene encoding N-acetylneuraminate lyase, complete cds. ACCESSION M20207 D00067 N00067 KEYWORDS N-acetylneuraminate lyase; NALase; nanA gene; npl gene. SOURCE E.coli JE1011 DNA, 1.2 kb HindIII-EcoRI fragment DNA containing nanA gene, which is cloned in pNL4, for [1]. E.coli C600 DNA, clone pMK6, for [2]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1243) AUTHORS Kawakami,B., Kudo,T., Narahashi,Y. and Horikoshi,K. TITLE Nucleotide sequence of the N-acetylneuraminate lyase gene of Escherichia coli JOURNAL Agric. Biol. Chem. 50, 2155-2158 (1986) STANDARD simple automatic FEATURES from to/span description pept 93 986 N-acetylneuraminate lyase BASE COUNT 300 a 309 c 322 g 312 t ORIGIN 1 bp upstream of HindIII site. 1 aagctttctg tatggggtgt tgcttaattg atctggtata acaggtataa aggtatatcg 61 tttatcagac aagcatcact tcagaggtat ttatggcaac gaatttacgt ggcgtaatgg 121 ctgcactcct gactcctttt gaccaacaac aagcactgga taaagcgagt ctgcgtcgcc 181 tggttcagtt caatattcag cagggcatcg acggtttata cgtgggtggt tcgaccggcg 241 aggcctttgt acaaagcctt tccgagcgtg aacaggtact ggaaatcgtc gccgaagagg 301 cgaaaggtaa gattaaactc atcgcccacg tcggttgcgt cagcaccgcc gaaagccaac 361 aacttgcggc atcggctaaa cgttatggct tcgatgccgt ctccgccgtc acgccgttct 421 actatccttt cagctttgaa gaacactgcg atcactatcg ggcaattatt gattcggcgg 481 atggtttgcc gatggtggtg tacaacattc cagccctgag tggggtaaaa ctgaccctgg 541 atcagatcaa cacacttgtt acattgcctg gcgtaggtgc gctgaaacag acctctggcg 601 atctctatca gatggagcag atccgtcgtg aacatcctga tcttgtgctc tataacggtt 661 acgacgaaat cttcgcctct ggtctgctgg cgggcgctga tggtggtatc ggcagtacct 721 acaacatcat gggctggcgc tatcagggga tcgttaaggc gctgaaagaa ggcgatatcc 781 agaccgcgca gaaactgcaa actgaatgca ataaagtcat tgatttactg atcaaaacgg 841 gcgtattccg cggcctgaaa actgtcctcc attatatgga tgtcgtttct gtgccgctgt 901 gccgcaaacc gtttggaccg gtagatgaaa aatatctgcc agaactgaag gcgctggccc 961 agcagttgat gcaagagcgc gggtgagttg tttcccctcg ctcgccccta ccggtgaggg 1021 gaaataaacg catctgtacc ctacaatttt cataccaaag cgtgtgggca tcgcccaccg 1081 cgggagactc acaatgagta ctacaaccca gaatatcccg tggtatcgcc atctcaaccg 1141 tgcacaatgg cgcgcatttt ccgctgcctg gttgggatat ctgcttgacg gttttgattt 1201 cgttttaatc gccctggtac tcaccgaagt acaaggtgaa ttc // LOCUS MACEPO 1462 bp ss-mRNA PRI 01-JUN-1990 DEFINITION Monkey (Cynomolgus) erythropoietin mRNA, complete cds. ACCESSION M18189 M15818 M15819 M18188 KEYWORDS erythropoietin. SOURCE Monkey cDNA to mRNA. ORGANISM Macaca fascicularis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae; Cercopithecinae. REFERENCE 1 (bases 1 to 1462) AUTHORS Lin,F.-K., Lin,C.-H., Lai,P.-H., Browne,J.K., Egrie,J.C., Smalling,R., Fox,G.M., Chen,K.K., Castro,M. and Suggs,S. TITLE Monkey erythropoietin gene: Cloning, expression and comparison with the human erythropoietin gene JOURNAL Gene 44, 201-209 (1986) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.-K.Lin, 04-AUG-1987. FEATURES from to/span description pept 217 795 erythropoietin precursor sigp 217 288 erythropoietin signal peptide matp 289 792 erythropoietin BASE COUNT 297 a 455 c 433 g 277 t ORIGIN Unreported. 1 agagtccctg ggccaccccg gtcgctcgct gcgtgcgtcc cggagccgga ccggggccac 61 cgcgccggct ctgcaccgat cccgcgcccc ctggacagcc gccctctcct ccaggcccgt 121 ggggctggcc ctgccccgct gaacttcccg ggatgaggac tcccggtgtg gtcaccgcgc 181 gcctaggtcg ctgagggacc ccggccaggc gcggagatgg gggtgcacga atgtcctgcc 241 tggctgtggc ttctcctgtc tctcgtgtcg ctccctctgg gcctcccagt cccgggcgcc 301 ccaccacgcc tcatctgtga cagccgagtc ctggagaggt acctcttgga ggccaaggag 361 gccgagaatg tcacgatggg ctgttccgaa agctgcagct tgaatgagaa tatcaccgtc 421 ccagacacca aagttaactt ctatgcctgg aagaggatgg aggtcgggca gcaggctgta 481 gaagtctggc agggcctggc cctgctctca gaagctgtcc tgcggggcca ggccgtgttg 541 gccaactctt cccagccttt cgagcccctg cagctgcaca tggataaagc catcagtggc 601 cttcgcagca tcaccactct gcttcgggcg ctgggagccc aggaagccat ctccctccca 661 gatgcggcct cggctgctcc actccgaacc atcactgctg acactttctg caaactcttc 721 cgagtctact ccaatttcct ccggggaaag ctgaagctgt acacggggga ggcctgcagg 781 agaggggaca gatgaccagg tgcgtccagc tgggcacatc caccacctcc ctcaccaaca 841 ctgcctgtgc cacaccctcc ctcaccactc ccgaacccca tcgaggggct ctcagctaag 901 cgccagcctg tcccatggac actccagtgc cagcaatgac atctcagggg ccagaggaac 961 tgtccagagc acaactctga gatctaagga tgtcgcaggg ccaacttgag ggcccagagc 1021 aggaagcatt cagagagcag ctttaaactc aggagcagag acaatgctgg gaaaacacct 1081 gagctcactc ggccacctgc aaaatttgat gccaggacac gctttggagg caatttacct 1141 gtttttgcac ctaccatcag ggacaggatg acctggagaa cttaggtggc aagctgtgac 1201 ttctccaggc ctcacgggca ctcccttggt ggcaagagcc cccttgacac tgagagaata 1261 ttttgcaatc tgcagcagga aaaattacgg acaggttttg gaggttggag ggtacttgac 1321 aggtgtgtgg ggaagcaggg cggtaggggt ggagctggga tgcgagtgag aaccgtgaag 1381 acaggatggg ggctggcctc tggttctcgt ggggtccaag ctttgtgtat tcttcaatct 1441 cactggcaag aactgaaacc ac // LOCUS AFACFXP1 1751 bp ds-DNA BCT 01-JUN-1990 DEFINITION A.eutrophus phosphoribulokinase P (cfxP) gene, complete cds. ACCESSION M33562 KEYWORDS phosphoribulokinase. SOURCE A.eutrophus (strain H16) DNA. ORGANISM Alcaligenes eutrophus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 1751) AUTHORS Kossmann,J., Klintworth,R. and Bowien,B. TITLE Sequence analysis of the chromosomal and plasmid genes encoding phosphoribulokinase from Alcaligenes eutrophus JOURNAL Gene 85, 247-252 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 621 phosphoribulokinase F(AA at 1) pept 633 1511 phosphoribulokinase P BASE COUNT 310 a 614 c 542 g 285 t ORIGIN 1 ctgcagcccg gcagcgccca ggtggcggcc ggctacgcgc tctacggtcc caccaccatg 61 ctggtgctga ccgtgggcaa tggcgtcaac ggcttcacgc tcgatcccaa cctgggcgag 121 ttcttcctga cgcaccccca cctgcgggtg ccgcccgata cccaggaatt tgccatcaac 181 gcgtcgaaca gccgcttctg ggaagcgccg gtgcagcgct acatcggcga gtgcatggct 241 ggcaagagcg ggccgcgcgg caaggatttc aatatgcgct ggatcgcatc gatggtggcc 301 gaggcgcacc gcatcctgat gcgcggcggt gtcttcatgt acccgcgcga caccaaggat 361 cccgccaagc cgggccgcct tcgcctgctg tacgaggcca acccgatcgc cttcctgatg 421 gagcaggccg gcgggcgcgc cagcacgggc cggcagacgc tgatgtcggt ggcgccgggc 481 gcgctgcacc agcgtatcgg cgtgatcttc ggctcgcgca atgaagtgga acggatcgag 541 ggctaccaca ccaaccagac cgaccccgac cttccgagtc ccctgttcaa cgagcgcagc 601 ctgttccgcg cgtccgcctg aggtgcctgg ccatgtcaga acgttatccc attatcgcta 661 tcaccgggtc ctcgggcgcc ggtaccacgt cggtgacccg taccttcgag aacatcttct 721 gccgtgaagg cgtgaagtcg gtcgtgatcg aaggcgacag cttccaccgc tacgaccgtg 781 ccgagatgaa ggtcaagatg gccgaggccg aacgcaccgg caatatgaac ttcagccact 841 tcggcgcgga gaacaacctg ttcggcgacc tggagagcct gttccgctcc tacgcggaat 901 ccggcacggg catgcgccgg cgctatctgc acagcacgga ggaggccgcg ccgttcgggc 961 agcagcccgg caccttcacc gcgtgggagc cattgccggc cgataccgac ctgctgttct 1021 acgaaggcct gcacggcggc gtggtcaccg acgaggtcaa tgtcgcccag tatcccaacc 1081 tgctgatcgg cgtggtgccc gtcatcaacc tggagtggat tcagaagctc tggcgcgaca 1141 agaaacagcg cggctactcg accgaggccg tgaccgacac catcctgcgc cgcatgccgg 1201 actacgtgaa ctacatctgc ccgcagttct cgcgcacgca tgtgaacttc cagcgggtgc 1261 cgtgcgtgga cacgtccaac ccctttatct cgcgcgaaat ccccgcgccc gatgaaagca 1321 tggtggtgat ccgctttgcc aacccgaagg ggatcgactt ccagtacctg ctgagcatga 1381 tccacgactc cttcatgtcg cgcgccaaca ccatcgtggt gccgggcggc aagatggaac 1441 tggccatgca gctgatcttc acgcccttcg tgctgcgcat gatggagcgc cgcaagcgcg 1501 ccgcgctgta aggagccgag atgaacgcac ccgaacgcat cgaccccgca gcgcgttgcg 1561 ccaacgcgct gcgcttcctg gccgccgacg cggtggagtt ggcccgctcc ggccaccccg 1621 gcgcccccat gggcatggcc gagatggccg aggtcgtgtg gcggcgccac ctgcgccaca 1681 acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac ggccatgcct 1741 ccatgctgca g // LOCUS AFACFXP2 1641 bp ds-DNA BCT 01-JUN-1990 DEFINITION A.eutrophus phosphoribulokinase P (cfxF) gene, complete cds. ACCESSION M33563 KEYWORDS phosphoribulokinase. SOURCE A.eutrophus (strain H16) DNA. ORGANISM Alcaligenes eutrophus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 1641) AUTHORS Kossmann,J., Klintworth,R. and Bowien,B. TITLE Sequence analysis of the chromosomal and plasmid genes encoding phosphoribulokinase from Alcaligenes eutrophus JOURNAL Gene 85, 247-252 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 480 phosphoribulokinase F pept 492 1370 phosphoribulokinase P BASE COUNT 306 a 555 c 505 g 275 t ORIGIN 1 ctgcaggtgc cggccgatac ccaggaattt gccatcaatg cgtcgaacag ccgcttctgg 61 gaagcgccgg tgcagcgcta catcgccgag tgcatggccg gcaagagcgg gccgcgcggc 121 aaggatttca atatgcgctg gatcgcgtcg atggtggccg aggcgcaccg catcctgatg 181 cgtggcggcg tcttcatgta cccgcgcgac tccaaggatc ccgccaagcc gggccgcctg 241 cgcctgctgt acgaggccaa tccgatcgcc ttcctgatgg agcaggctgg cgggcgcgcc 301 agcacgggcc ggcagacgct gatgtcggtg gcgccgggtg cgctgcacca gcgcattggc 361 gtgatcttcg gctcgcgcaa tgaagtggaa cggatcgagg gctaccacac cgaccagacc 421 gatcccgacc ttccgagtcc cctgttcaac gagcgcagcc tgttccgcgc gtctgcctga 481 ggtgcctggc catgtcagaa cgttatccca ttatcgccat caccggctcc tcgggcgccg 541 gtaccacgtc ggtgacccgt accttcgaga acatcttccg ccgcgagggc gtgaagtcag 601 tggtgatcga aggcgacagc ttccaccgct atgaccgcgc cgagatgaag gtcaagatgg 661 ccgaggccga gcgcaccggc aatatgaact tcagccactt cggcgaggag aacaacctgt 721 tcggcgagct ggagaacctg ttccgctcct atgccgaaac cggcacgggc atgcaccggc 781 actacctgca tagcccggag gaagccgcgc cgttcgggca ggaacccggc accttcacgc 841 agtgggagcc gctgccggcc gataccgatc tgctgttcta cgagggcctg cacggcggcg 901 tggtcaccga ttcggtcaat gtcgcccagt atcccaacct gctgatcggc gtggtccctg 961 tcatcaacct ggaatggatc cagaagctct ggcgcgacaa gaagcagcgc ggctattcga 1021 ccgaggccgt gaccgacacc atcctgcgcc gcatgccgga ctacgtgaat tacatctgtc 1081 cgcagttctc gcgcacgcat gtgaacttcc agcgcgtgcc gtgcgtggat acgtccaacc 1141 cctttatctc gcgcgaaatc cccgcgcccg atgaaagcat ggtggtgatc cgctttgcca 1201 acccgaaggg gatcgacttc cagtacctgc tgagcatgat ccacgactcc ttcatgtcgc 1261 gcgccaacac catcgtggtg cccggcggca agatggagct ggccatgcag ctgatcttca 1321 cgcccttcgt gctgcgcatg atggagcgcc gcaagcgcgc cgcgcagtaa ggactcgacc 1381 gtgaaacgcc gactcgcgag gatctcctga atgaacgcac cagaacgcat cgattccgca 1441 gcgcgctgtg ccaacgcgct gcgcttcctg gccgccgacg cggtggagca ggccaaatcg 1501 ggccaccccg gcgcacccat gggcatggcc gaaatggccg aagtgctgtg gcggcgccac 1561 ctgcgacaca acccggccaa cccggcctgg cccgaccgtg accgcttcgt gctgtccaac 1621 ggccatgcgt ccatgctgca g // LOCUS BIFLLD 1767 bp ds-DNA BCT 01-JUN-1990 DEFINITION B.longum L-lactate dehydrogenase (ldh) gene, complete cds. ACCESSION M33585 KEYWORDS L-lactate dehydrogenase. SOURCE B.longum (strain aM101-2) DNA. ORGANISM Bifidobacterium longum Prokaryota; Bacteria; Irregular, nonsporing, gram-positive rods. REFERENCE 1 (bases 1 to 1767) AUTHORS Minowa,T., Iwata,S., Sakai,H., Masaki,H. and Ohta,T. TITLE Sequence and characteristics of the Bifidobacterium longum gene encoding L-lactate dehydrogenase and the primary structure of the enzyme: A new feature of the allosteric site JOURNAL Gene 85, 161-168 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 555 1517 L-lactate dehydrogenase (EC 1.1.1.27) mRNA 493 > 1517 L-lactate dehydrogenase mRNA BASE COUNT 373 a 555 c 488 g 351 t ORIGIN 1 gtcgacgcgg tcaatgacgt gttggcggac atcgaaggca cggcctcgat tccgcgtatt 61 ctcgtattca acaaggccga tcaggcggac gaggcgactc gtgaacgact cgccgcgctg 121 cagccagatg cgttcatcgt ctccgcctat accggtgagg gattggacga gctgcgtacc 181 gcggtcgaaa gtctgctgcc ggtcccgcat gtgcatgtca acgctctgct gccgtatacc 241 gctggctccc tgatctctcg tgtacgcgaa tacggcaagg tagacaaggt ggagtaccgc 301 gatgatggca tacagcttga agcggacgtt gatgcccatc ttgcgctcag gtggtcgaac 361 agtccattga ctaacgtgat aaacatcaca gtatattcgt gagcgctaac aaccgttgaa 421 aacattacca tacggttgtc aaacagggtg gtgtgccggt agcaaaacgt cttagcgggt 481 ttatagagtg aagacgttag ttacaaggcc tgccattcat cagcagaccg cctttgaaga 541 gaggttcatc catcatggcg gaaactaccg ttaagcccac gaagcttgct gttattggtg 601 ccggtgccgt tggctccacc ctcgccttcg ccgctgccca gcgtggcatc gctcgcgaga 661 tcgtgcttga agacatcgcc aaggagcgcg tggaagccga agtgctcgac atgcagcatg 721 gctccagctt ctacccgacc gtgtccatcg acggttccga cgatcctgag atctgccgcg 781 acgccgacat ggtcgtcatc accgctggtc cgcgtcagaa gccgggtcag tctcgtcttg 841 agctcgttgg cgctaccgtc aacatcctca aggccatcat gccgaacctg gtcaaggtgg 901 ctccgaacgc catctacatg ctcatcacca acccggtcga catcgctacc cacgtggctc 961 agaagctcac cggtctgccc gagaaccaga tcttcggttc cggcaccaac ctggactccg 1021 ctcgtctgcg cttcctgatt gcccagcaga ccggcgtcaa cgtcaagaac gtgcacgcct 1081 acatcgccgg cgagcacggc gactccgaag tcccgctgtg ggagtccgcc accatcggtg 1141 gcgtccccat gtgcgactgg accccgctgc ccggccacga tccgctcgac gccgacaagc 1201 gcgaggagat ccaccaggaa gtcaagaacg ccgcttacaa gatcatcaac ggtaagggtg 1261 ccaccaacta cgccatcggc atgtccggcg tcgacatcat cgaagccgtc ctgcacgaca 1321 ccaaccgcat tctgcccgtg agctccatgc tcaaggactt ccacggcatc tccgacatct 1381 gcatgtccgt gccgaccctc ctcaaccgtc agggcgtcaa caacaccatc aacaccccgg 1441 tctccgacaa ggagctcgcc gctctgaagc gctctgccga gacgctgaag gaaaccgccg 1501 cccagttcgg cttctgataa aaaatcgctg tacggagcgc ctttcgcacc gcagcaagac 1561 tcgacctacc tttgtaggcc ttcaccttgt gcggcacgaa aatcgcacac gtacagcgat 1621 tttttatacc ctgctgaatg ctcccgttgg gagcattttt tattcgtgga gttcgttgga 1681 ggagggggag ctatagccct cgggctcgag ctggaaggtg gtgtggggga ccgagaccgg 1741 gaagtgctcg cgcaggcaat cctgcag // LOCUS BOVP450SCC 1471 bp ds-DNA MAM 01-JUN-1990 DEFINITION Bovine P-450scc gene, exon 1. ACCESSION J05245 KEYWORDS P-450scc. SOURCE Bovine calf thymus DNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1471) AUTHORS Ahlgren,R., Simpson,E.R., Waterman,M.R. and Lund,J. TITLE Characterization of the promoter/regulatory region of the bovine CYP11A (P-450-scc) gene: Basal and cAMP-dependent expression JOURNAL J. Biol. Chem. 265, 3313-3319 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 948 / 1216 P-450scc, exon 1 pre-msg 897 > 1471 P-450scc mRNA and introns BASE COUNT 324 a 367 c 396 g 384 t ORIGIN 1 gcagatgtcc ctggtgattc ctgaaacagg ccctctgttt aaattcttca gcagttagag 61 ggaaggtcaa tttttcccaa ggcttttggg ctttgattgt tttcattttt aaattatctg 121 cattctaaag agatattttg ggtggcagat tttgctctcc tacaggactt tgtctaggag 181 acggctctca ggccagctcc gacgactgtt ccaaagaagt aagggaaagc tagggtttat 241 atcaatcttt ttttttgctg ggagaagggg gatgaacatg tagtcaaaca taaaaagatc 301 actgctaatc ccaaacaaca gacacctcaa gtgaatggtt ttagtgtttt tctatatatg 361 ttgtttagtc actaagtcct gtccgactct tttgcgactc catagactgt agcccaccaa 421 gctcctctgt ccatgggatt tttctaggca agaatactgg agtgggttgc catttccttc 481 tccctgggat cttcctaacc caaggactga acccttgtct cctgcattgc aggtggattt 541 tttaccgact gagccaccag ggaagttatg tgtgcaagaa tccggggtca tggaaatttt 601 cccttagata tacatcgtat ctagggacca gtacaatgca aatgcttcct gtttttcttc 661 atcctgaagt ctcctcaggg tgcattgagg gagggagtcc cctcaggtgg gtgaccacag 721 tggctgacgc ttgatgttgt agaactggaa tgatgggtta cattctttcg tttacagtac 781 tgagtctggg aggagctgtg tgggctggag tcagccggag gaggctgacc gccctgtcag 841 cttctcactt agccttgagc tggtgattat aagctgggtc ccagggtccc agggccagag 901 tcacctgctg cagtacgagc agagacagca gcagctgtgg gggcagcatg ctagcaaggg 961 ggcttcccct ccgttcagcc ctggtcaaag cctgcccacc catcctgagc tcagtggggg 1021 agggctgggg ccaccacagg gtgggcactg gagagggagc tggcatctcc acaaagaccc 1081 ctcgccccta cagtgagatc ccctcccctg gtgacaatgg ctggcttaac ctctaccatt 1141 tctggaggga gaagggctca cagagaatcc actttcgcca catcgagaac ttccagaagt 1201 atggccccat ttacaggtaa gcctggcagg aggattgggg ctggcgggat agggaagcct 1261 gtggtggccc cctccctgaa aggtctgccc tccccttcca ggctctggtt cacctctgac 1321 tttatttctt cctgcctggc ggtggcagga gtagagttaa tgcttcccag acagtgggtt 1381 cacttcccag ccctgaggcc tcaacagtcc ccgggctcta cacccttaga aactttgggg 1441 aggtggggag gcccaagaaa ataagccccg g // LOCUS BOVTGN 725 bp ds-DNA MAM 01-JUN-1990 DEFINITION Bovine minisatellite repeat DNA. ACCESSION J04756 KEYWORDS minisatellite sequence. SOURCE Bovine (strain Holstein-Friesian) blood DNA, clone lambda-gBJ32. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 725) AUTHORS Kashi,Y., Iraqi,F., Tikochinski,Y., Ruzinzki,B., Nave,A., Beckmann,J.S., Friedmann,A., Soller,M. and Gruenbaum,Y. TITLE (TG)n Uncovers a sex-specific hybridization pattern in cattle JOURNAL Genomics 7, 31-36 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Gruenbaum, 22-JAN-1990. FEATURES from to/span description rpt 311 703 minisatellite repeat BASE COUNT 134 a 131 c 226 g 234 t ORIGIN 1 aagcttcaca tcccgagaat tccctcccag cgctcgtggt cccacagagg gctctgctgg 61 acctgcctcg ggtcacatgg caggtctggg gaggacacac ctctccccgg cagagaaatg 121 gccagaagcc aggtctgctc cacacgtgcc ttctcccaat actctctaac tttaaaaaaa 181 ctgccaaaga aaaagcggta cgtaataaca agcgcacaga tacgtaattt ataatggctg 241 acacggttgg cagggaaatg tgttacgcag gaattatgtt tttatttatg tgtgtcctgt 301 tttggagaca gcataagtaa tcatgggtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc 361 tccagcgtaa gtaatcatgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagcgtaagt 421 aatcgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gttgcctgtc tccagagtaa 481 gtaatcatgg gtgtgtgtgt gtgtgtgttg cctgtctcca gcataagtaa tcatgggtgt 541 gtgtgtgtgt gtgtgttgcc tgtctccagc ataagtaatc atgggtgtgt gtgtgtgtgt 601 gtgtgtgttg cctgtctcca gcataagtaa tcatgggggg gtgtgtgtgt gtgtgtgtgt 661 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tgcctgtctc cagggacttt tgtacagaga 721 agctt // LOCUS CELCDPK 2305 bp ss-mRNA INV 01-JUN-1990 DEFINITION C.elegans cAMP-dependent protein kinase subunit R mRNA, complete cds. ACCESSION J05220 KEYWORDS cAMP-dependent protein kinase. SOURCE C.elegans mRNA. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 2305) AUTHORS Lu,X., Gross,R.E., Bagchi,S. and Rubin,C.S. TITLE Cloning, structure, and expression of the gene for a novel regulatory subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 3293-3303 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 132 1262 cAMP-dependent protein kinase subunit R mRNA < 1 2305 cAMP-dependent protein kinase subunit R mRNA BASE COUNT 651 a 543 c 417 g 694 t ORIGIN 1 gcggcatccc ccgtcaaaag tctgttacct tccccgccaa gttttcatgt catttcaaac 61 gtgtgttttc tgctattttt ctcgcatttt tgtctctttt gtttgtttaa aacaagaaat 121 gactgaaaat tatgaacaat tattcaggtg acattgtttt tatgtcgggt ggaaacgaag 181 aggatcagtt ggcacaatgc caggcctatg tgcaacgaca caacattcag cagcttgtca 241 aagaagcaat tgttgttctg tgcatacaca aacccgacaa cccggttctc ttcttgaagg 301 atcattttga aaaactcaac gaacaacgag cacaggaagg aggaaatccg gacgctgctg 361 atgatgacga catcattgtg gagccaccaa agagatcagg tggacgcaga accggaatct 421 ctgccgaacc aatcaaggag gacgataccg agtacaagaa ggttgtgatt ccaaaggatg 481 atgctacaag aaggtcgctt gagtcagcca tgaggaagaa cttgctcttt gctcatttgg 541 aggaggatga gcaaaaaacc atgtacgatg ccatgttccc agttgagaag tctgctggag 601 agaccatcat cgaacaggga gaagaaggag ataacttcta tgttattgat aagggaaccg 661 ttgatgtcta cgtcaatcac gagtacgttc tcaccatcaa cgagggtgga tcatttggag 721 aacttgcttt gatctatgga actccacgtg ctgcaaccgt cattgccaag attgacgtga 781 aattgtgggc catcgaccgt ctcacctacc gtcgtatctt gatgggatca gtaacaaaga 841 agagaaagat gtatgacgag ttcctctcca aggttcaaat cttggccgat ctcgaccagt 901 gggagcgcgt taacgtggct gatgctcttg agagatgcga cttcgagcca ggaactcatg 961 tcgtcgagca aggacaaccc ggagacgaat tcttcatcat tctcgaggga gaagccaatg 1021 tgttgcagaa gagaagcgac gacgcaccat tcgacgttgt cggacatctt ggaatgtcgg 1081 actatttcgg agaaatcgct cttcttctcg accgcccacg tgccgccacc gttgttgcca 1141 aaacccatct caagtgcatc aagctggatc gtaaccgctt cgaacgtgtc atgggaccag 1201 ttcgtgagat cctgaagaga gacgtttcaa actacaactc atacgtcaaa ctgatgacct 1261 aatcggatta caatccgttt gccccgcatc ttcctcatca ccgtatcttc ttttccacct 1321 tttcgccaaa gtactcgtaa ctattttttg aaagtatccc aaaaattctt ccttagtttt 1381 tttttagtat tgtatttttt cctgtatttt tttggatttt ctattattct gcatactgtc 1441 tagacctgta accttcttct ctcaccaacc gtgacatctt caagggcccc ccccccaatc 1501 aattcgttcc ctatcttcat ttctagtcag tcgatgtcaa attaaattct aaatgtctaa 1561 atttctaact catagccaaa acaacattca ttgttttaac cagcccaaaa tcctagtcag 1621 aagtttgcag aaaactattt gaaactcctc gtgaaaaagc taaaaaaatg ttcaactaga 1681 atagttttag tttttattcc ctgtgtaaca aaaaaaacag ttttgcatgc ttttctatcc 1741 accctaaaat ttcaacattt tcagtagctt tacagctcaa tctttaccaa ccatttcctt 1801 caattcctat aggttaactc aaagaagaaa cgaacaccat tcttcattaa ttaacattaa 1861 gcagtgaatg ccaaatcttg tctttataat atgtaacata taattttatg agaaataaga 1921 aatgcgcatc cacaacaatc gtcccccccc cccctaccca gcacacattt ttttcttatt 1981 tgttccactt ctgaattctg aactccgtca cccttatcca gcattgcgaa gtctcttatc 2041 tacgccccct tccctcaaca tccaacttct tttttttttc atttttatat atttttcaca 2101 ttcattggaa atccgctgtc atgtcgccat ctcaatgttt tcttgtctct tttttttcat 2161 ttctccacca cattttccac tagatttctg ttatccaaaa tttttgtcgg tttcccccgc 2221 gcacacacac acaaaaaaca gtatgttata aactatctat aatgtaatct tagaggttta 2281 ttggactaat aaaatttgga agttg // LOCUS ECHOGL 1803 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.chrysanthemi oligogalacturonate lysase (ogl) gene, complete cds. ACCESSION M33583 KEYWORDS oligogalacturonate lysase. SOURCE E.chrysanthemi DNA. ORGANISM Erwinia chrysanthemi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1803) AUTHORS Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J. TITLE Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes negatively regulated by the kdgR gene product JOURNAL Gene 85, 125-134 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 271 1437 oligogalacturonate lysase (ogl) BASE COUNT 480 a 393 c 441 g 489 t ORIGIN 1 agtgaatcaa tacagccgga ggatatttat tctgacgttt ggtctggctg cttattgccg 61 aatctggatc aatatcaggc aaaataatct tgcgtcagcg caaacagtca gtgaaaaatg 121 aaagaatgtt ttataaaata aaaccacgat cacggaaaaa tgaaacgttg tttctacaat 181 accggtataa caaggcgatt ggccgggatg acgggtaatc tttcacctgg cggcgtctga 241 tttttcacga taagcgctaa ggatttacgg atggccaaag gtaaaaagct ttctttttcg 301 ttccatactt accaggattc agtcaccggc accgaagtgg tgcgtctcac tcctcccgat 361 gttatctgcc accgcaacta cttctatcag aagtgttttt ccaatgatgg cagcaagctg 421 ctttttggtg gcgcctttga cgggccgtgg aactactatt tgctggatct gaaaactcag 481 caggcgacgc aactgaccga aggtaccggc gacaatactt ttggtggttt tctgtcacca 541 gatgatgacg cgctttatta tgtaaagaac gttcgtaatt tgatgcgtgt tgacctgaat 601 acactggaag aaaccaatat ttatcaggtg ccggacgact gggtcgggta cggtacctgg 661 gttgccaact ccgactgcac caaaatggtc ggtatcgaga tcaagaaaga ggattggaaa 721 ccactgaccg actggaaaaa attccaggaa ttctacttta ccaatccatg ctgccgtttg 781 attcgtatcg atctgaaaac cggcgaagcc accaccattc tgaaggaaaa ccaatggctg 841 ggtcatccta tttaccgtcc gggtgacgat aatacggtgg ccttctgcca tgaaggtccg 901 catgacctgg ttgatgcgcg tatgtggttc atcaatgaag atggctccaa tatgcgtaag 961 gtaaaagagc atgcgccggg cgaaagctgc actcacgaat tctgggtgcc gaatggttct 1021 gcgctggcct acgtttccta tctgaaaggc agtactaacc gtttcatttg cagcgttgat 1081 ccggtaacgc tggaaaaccg tcagttgact gaaatgccgc cgtgttctca cctgatgagt 1141 aactacgatg gtacgttgat ggtgggagat gggtgtaatg cgccggtgga tgtgaaagat 1201 gacggtggct acaagactga aaacgatccg ttcctgtatg tgttcaatat gaagaccggg 1261 aaacatttcc aggtcgctca acacaacacc tcctgggaag tgctggaggg cgatcgtcag 1321 gtaacgcatc cacatccgtc ctttacgccg gatgacaagc acattctgtt tacgtctgat 1381 gtcgatggta agccggcgtt atatctggca aaagtgcctg attccgtctg gcaataactg 1441 tacctgcaac cgtgcccgcc agggcacgtt ttttcgcttt tacaataaga ttattaattc 1501 ggcagtgcta ttgtgatgga tgatattaat cggtctgtca cttatagtaa tgcccgattg 1561 ccatattcaa agcgttccct ttatactaaa acacgtgttc tatttttttt aaaacaaaaa 1621 atactgagta gggtaaccac aaaaatggct attgcagatt tagataaaca acccgattcc 1681 gtgtcatctg tcttaaaggt ctttggtatt ttgcaggcgt taggcgaaga gcgtgagatt 1741 ggcattactg aactttcaca gcgcgtgatg atgtctaaaa gtactgttta tcgtttcctg 1801 cag // LOCUS ECHPLE 1758 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.chrysanthemi pectate lysase e isoenzyme (pelE) gene, complete cds. ACCESSION M33584 KEYWORDS pectate lysase e isoenzyme. SOURCE E.chrysanthemi DNA. ORGANISM Erwinia chrysanthemi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1758) AUTHORS Reverchon,S., Huang,Y., Bourson,C. and Robert-Baudouy,J. TITLE Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes negatively regulated by the kdgR gene product JOURNAL Gene 85, 125-134 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 251 1465 pectate lysase e (pelE) BASE COUNT 534 a 480 c 379 g 365 t ORIGIN 1 tttaaaactt catttttaat ttaatcataa aaaataaaaa accaagtaac acaaagttac 61 aaatacagtc aatagtttat tttttattaa taaaaaccat tgtcatcatc gtgacaaagt 121 tcacaaaata gacactcaac cgcataaaca ttcgccaaag caaaagatga aatggtattt 181 cgtttttaga cacacatgta acaaatggac accatttgga tcgctcactg agcacacaag 241 gaaaattgcc atgaacaact cacgtatgtc ttccgtttca acacagaaaa caacaggacg 301 ttctgccttg ggaaccaaaa gcgcgctggc cgcgattatc gccaccacca tgatggtctc 361 agtcgcttcc gccgccagtc tgcaaaccac caaagcgaca gaagcggcgt caaccggctg 421 ggcaacgcag agcggcggca ccaccggcgg cgccaaagcg tcatcatcca aaatctacgc 481 ggtgaaaagc atcagcgaat tcaaagcagc actgaacgga accgattcgt cgcccaagat 541 catccaggtc accggagcaa ttgatatcag cggcggtaaa gcctacaccg actttgacga 601 tcagaaagcc cgcagccaga tcagcattcc gtccaacacc accatcatcg gtattggcaa 661 caaaggcaaa ttcaccaacg gttcgctggt ggtgaaaggc gtcagcaacg ttatcctgcg 721 taacctgtac atcgaaacgc cggtggacgt ggcgccgcat tacgaagagg gagatggttg 781 gaacgccgag tgggacgcag tagtgattga cagcacagac cacgtctggg ttgaccatgt 841 caccatcagc gacggcagcc tcaccgacga caaatacacc accaaaaacg gcgaaaaata 901 tgtgcagcat gacggctcgc tggatatcaa gcgcggctcc gactacgtta ccgtttctaa 961 tagccgcttc gagctgcacg acaaaaccat cctgatcggc cacagcgaca acaacggctc 1021 gcaggacgcc ggcaaactgc gcgtcacctt ccacaacaac ctgtttgacc gggtcggcga 1081 acgtaccccg cgcgtacgct tcggcagcgt tcacgcttac aacaacgttt acgttggcga 1141 cgtcaaccac aaagcctatc gctatcagta cagcttcggt atcggcacca gcggtagcct 1201 gctgtctgag tccaacgcgt ttaccattga taacatgaaa aaaatcagcg gccgcgacaa 1261 agaatgcagc gtggtcaagg cgtttaacgg taagatcttc tctgataaag gctcgatcat 1321 caacggcgct tcgtacaatc tgaatggttg cggctttggc ttcagcgcat acagcgccaa 1381 gatcccgtac aaatactcag cccaaaccat caccaccagc ctggcaaaca gcatcagcag 1441 caacgcgggc tacggtaaac tgtaatttcc cgcccagcgt aaaggccact tgcaaaagtg 1501 gcttttattt tcccatattc ccaccactcg tattaccccg atgttctccg aagcgccact 1561 caatacggtt caggttaatc cggccttgcc acaggcgttc atcaagaaat cgaccggttt 1621 cccccatcaa atgcagggac tagcaggaat tagtaagaga cacagattaa taaaattcta 1681 tttaataaaa atcgaatcaa ttgttaactg tttgggttat ttcagataaa aacgcttata 1741 catatagctg aatttaaa // LOCUS ECOFIMA1 66 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.coli type 1 fimbriae (fimA) gene with viral insertion site, partial cds. ACCESSION M33412 KEYWORDS type 1 fimbriae. SOURCE E.coli (strain HB101) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 66) AUTHORS Hedegaard,L. and Klemm,P. TITLE Type 1 fimbriae of Escherichia coli as carriers of heterologous antigenic sequences JOURNAL Gene 85, 115-124 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 66 type 1 fimbriae (AA at 1) recomb 3 4 E.coli DNA end/ viral DNA start recomb 63 64 viral DNA end/ E.coli DNA start BASE COUNT 12 a 19 c 17 g 18 t ORIGIN 1 gttgatcccc ggacctgcat gatcatcgct caaggaacct ctatgtatcc ctcctgttgg 61 ggggat // LOCUS ECOFIMA2 66 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.coli type 1 fimbriae (fimA) gene and viral insertion site, partial cds. ACCESSION M33413 KEYWORDS type 1 fimbriae. SOURCE E.coli (strain HB101) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 66) AUTHORS Hedegaard,L. and Klemm,P. TITLE Type 1 fimbriae of Escherichia coli as carriers of heterologous antigenic sequences JOURNAL Gene 85, 115-124 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 66 type 1 fimbriae (AA at 1) recomb 3 4 E.coli DNA end/ viral DNA start recomb 62 63 viral DNA end/ E.coli DNA start BASE COUNT 14 a 17 c 17 g 18 t ORIGIN 1 gttgatccaa acctgcgtgg agatctccag gttctggctc agaaagttgc tcgtacctta 61 ccggat // LOCUS ECOSPDE 2792 bp ds-DNA BCT 01-JUN-1990 DEFINITION E.coli speED operon speE and speD genes encoding S-adenosyl- methionine decarboxylase and spermidine synthase, complete cds. ACCESSION J02804 J04247 KEYWORDS S-adenosylmethionine decarboxylase; speD gene; speE gene; spermidine synthase. SOURCE E.coli K12 DNA, clones pT7-5-1 and LC37-29. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 643 to 2524) AUTHORS Tabor,C.W. and Tabor,H. TITLE The speED operon of Escherichia coli: Formation and processing of a proenzyme form of S-adenosylmethionine decarboxylase JOURNAL J. Biol. Chem. 262, 16037-16040 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 642; 2525 to 2792) AUTHORS Xie,Q.-W., Tabor,C.W. and Tabor,H. TITLE Spermidine biosynthesis in Escherichia coli the promoter and the termination regions of the speED operon JOURNAL J. Bacteriol. 171, 4457-4465 (1989) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly submitted by H.Tabor, 02-DEC-1987 and for [2] 13-MAR-1990. FEATURES from to/span description pept 314 661 unidentified protein pept 767 1633 spermidine synthase /nomgen="speE" pept 1649 2443 S-adenosylmethionine decarboxylase (ttg start codon) /nomgen="speD" signal 2448 2482 hairpin terminator BASE COUNT 714 a 738 c 637 g 703 t ORIGIN 1 aactgaatgc gattacgggc atcggtcgtg agcaaatcag ggatcggtaa cgttgggcgt 61 tctgccgcaa atactgcgcg gctccacagc ggcaaagcga agccacaccc agcgcgacgg 121 aatattttaa gaaatcacga cgttgcatag ttatttcctt attcttaagc aggcgaaaca 181 aaatcaaacg ttgagcatag tccttcccct tacgggaagg tcaagccgca gacacaattt 241 aatcgacaat aatggtcgtc gcctcgctcg cagtgtgcta acgtttatct tctttaagcc 301 cctggtagaa gcaatgaaga cgtttttcag aacagtgtta ttcggcagcc tgatggccgt 361 ctgcgcaaac agttacgcgc tcagcgagtc tgaagccgaa gatatggccg atttaacggc 421 agtttttgtc tttctgaaga acgattgtgg ttaccagaac ttacctaacg ggcaaattcg 481 tcgcgcactg gtctttttcg ctcagcaaaa ccagtgggac ctcagtaatt acgacacctt 541 cgacatgaaa gccctcggtg aagacagcta ccgcgatctc agcggcattg gcattcccgt 601 cgctaaaaaa tgcaaagccc tggcccgcga ttccttaagc ctgcttgcct acgtcaaata 661 atccctgata cctttttgta gaaatattga ccgtgcatcc gcggtcaatg ttagctatta 721 tgttgcgccc tttttttacg ggtgttaaca aaggaggtat caacccatgg ccgaaaaaaa 781 acagtggcat gaaacgctac acgaccagtt tgggcagtac tttgcggtag ataacgttct 841 gtatcatgaa aagaccgatc accaggatct gatcattttt gagaacgctg catttggtcg 901 cgtaatggcg ctggatggcg tagtacaaac caccgagcgc gacgagttta tctatcatga 961 gatgatgacc catgttccgc tactggccca tggtcacgcg aaacatgtgc tgattatcgg 1021 cggcggcgac ggtgccatgc tgcgtgaagt aacccgacat aaaaacgttg agtcaatcac 1081 gatggtggaa atcgatgcgg gtgtcgtatc gttctgccgt cagtatctac ccaaccataa 1141 cgccggtagc tacgacgatc cgcgctttaa gctggtgatc gacgatggcg tcaatttcgt 1201 taatcaaacc agccagacct ttgatgtcat tatctccgac tgcaccgatc ctatcggtcc 1261 cggcgaaagc cttttcactt cggcatttta tgaaggctgc aaacgttgcc tgaatcctgg 1321 cggtatcttc gtcgcacaaa acggcgtctg ctttttacag caggaagaag ccatcgacag 1381 ccatcgcaaa ctcagccatt acttcagcga cgttggcttt tatcaggcgg cgatcccgac 1441 ctattacggc ggtatcatga cttttgcatg ggcgacagat aacgacgcct tacgccatct 1501 ctcaaccgaa attattcagg cgcgttttct cgcctctggc ctgaaatgcc gttattacaa 1561 tccggcaatc catacggcag cttttgcctt acctcagtat ctgcaagacg cactggcttc 1621 acagccgtcc taaggagaag ataagaaatt gaaaaaactg aaactgcatg gctttaataa 1681 tctgaccaaa agtctgagtt tttgtattta cgatatctgc tacgccaaaa ctgccgaaga 1741 gcgcgacggt tatattgctt atatcgatga actctataat gccaaccgtc tgaccgaaat 1801 cctgtcagaa acctgttcca ttatcggggc taatattctt aacatcgccc gccaggatta 1861 cgaaccacag ggtgccagcg tcactattct ggtgagtgaa gaaccggttg acccgaaact 1921 catcgacaaa acagaacacc ccggcccact gccagaaacg gtcgttgccc atcttgataa 1981 aagtcatatt tgcgtacata cctacccgga aagtcatcct gaaggcggtt tatgtacctt 2041 ccgcgccgat attgaagtct ctacctgcgg cgtgatttct ccgctgaagg cgctgaatta 2101 cctgatccac cagcttgagt ccgatatcgt aaccattgat tatcgcgtgc gcggttttac 2161 ccgcgacatt aacggtatga agcactttat cgaccatgag attaattcga ttcagaactt 2221 tatgtctgac gatatgaagg cgctgtatga catggtggat gtgaacgtct atcaggaaaa 2281 tatcttccat accaagatgt tgcttaaaga gttcgacctt aagcactaca tgttccacac 2341 caaaccggaa gacttaaccg acagcgagcg ccaggaaatt accgctgcgc tgtggaaaga 2401 aatgcgcgag atttattacg ggcgcaatat gccagctgtt taacggctct ggcggagctc 2461 ccaggctccg ccagatttat ttacttctgc tgcacgaaat tgcggtaagc cgccacgacc 2521 tgcagaaaat cctcaacgcc gcatagcgac aggctttctt cgtcgtagta gttcatcccc 2581 tcttccattt catcgccagc gaattccagt tgattggcgc gaaccatcac ctcttcaccg 2641 tccatccaca gggtgtattc atgccctgcc cgttgccagg accgttcgct acctttcagt 2701 gcgtgcgctg cttgttccac ttcatcaagc aaggccaggt tttctttcac ctcttcatta 2761 aaccagtgcc cgaccacttc atgccccatg ga // LOCUS HUMBTRO 608 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human beta-tropomyosin gene, exon 1. ACCESSION J05247 KEYWORDS beta-tropomyosin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 608) AUTHORS Libri,D., Mouly,V., Lemonnier,M. and Fiszman,M.Y. TITLE A nonmuscle tropomyosin is encoded by the smooth/skeletal beta- tropomyosin gene and its RNA is transcribed from an internal promoter JOURNAL J. Biol. Chem. 265, 3471-3473 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 441 / 572 beta-tropomyosin IVS 573 > 608 beta-tropomyosin intron A BASE COUNT 102 a 198 c 223 g 83 t 2 others ORIGIN 1 gtctctgcag cagatgtgtg gggactctgg agagcacccc tggatacact ccatgcagag 61 ctgctcacag cctctgcggc cccttttcct ctcaggaaaa taaacacccc gaccgccttc 121 ccctctgcct gctgggctcg ctgcggcatg cagccctcct ggnntctgtg ccctgctttg 181 gggctgtgcg cccatgctgg ggtcgacagc gtgcctccac ggggcagcca tgagccttat 241 cgggggacga cgtcgctcag cgcagggagc gcggcgggag cgggtcggtg tgcggcagga 301 ggaggaggag gaggaggagc aggaagggat ggaggcaggg cgggagcggc cgccgcctga 361 gcgcttcctg cccgagccag gcggatccca caaagggctc agcggagcgg ccttgctgcg 421 ctccccgccg gccgcaggcc atggccggca tcagctccat cgatgccgtc aagaagaaga 481 tccagagcct gcagcaggtg gctgacgagg cggaggaacg cgccgagcac ctgcagcggg 541 aggccgatgc cgagcggcag gcccgggagc gggtaaggct cctgggggca ccccgcgacc 601 cccagacc // LOCUS HUMGPIIB1 2238 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human platelet glycoprotein IIb (GPIIb) gene, exon 1. ACCESSION M33319 KEYWORDS platelet glycoprotein IIb. SEGMENT 1 of 3 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2238) AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S., Schwartz,E. and Poncz,M. TITLE Organization of the gene for platelet glycoprotein IIb JOURNAL Biochemistry 29, 1232-1244 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1286 + 1473 platelet glycoprotein IIb (GPIIb), exon 1 pre-msg 1254 > 2238 GPIIb mRNA and introns IVS 1474 > 2238 GPIIb intron A BASE COUNT 596 a 554 c 595 g 493 t ORIGIN 1 gtcaacggat cagaaaatag aaatcaaaag gaaaatgtgg ctatggttac ccctagcgga 61 cctcttaaat cttcctgaga acctgctttt ttgggaaggc atgagtgcca gtaagacttg 121 gcactcctcc tcttccgctt accgagagaa aatgactttg cctttctgct caaaactcat 181 cccttcactt tgtcacccta tgtttgcatc ttccatcctt agtgtgtgtt tccatccatc 241 cagtctttca gcaatacacg tactacacat tggactcttg ggtagtctct agggctgtag 301 caaggagcct tgctcccaag ggactcattt acacaatcct gtgaacggac caagagtaaa 361 cagtgtgctc aatgctgtgc ctacgtgtgt tagcccacgc ggccagcctg aggagtcagg 421 gaaggctccc ctaggcaaag cccccaacca gaatcaagtc ttaatggtta aagagctcca 481 tcacccaaaa aggattgagg gcctaccttc aactgaacag ctaatgcata atctcagaaa 541 ctgtgagtca aaattccctg gaataactcc actttatccc caatctcctt gccacctaga 601 ccaaggtcca ttcaccaccc tgtccccagc actgactgca ctgctgtggc cacactaaag 661 cttggctcaa gacggaggag gagtgaggaa gctgctgcac caatatggct ggttgaggcc 721 gcccaaggtc ctagaaggag gaagtgggta aatgccatat ccaaaaagat acagaagcct 781 caggttttat cgggggcagc agcttccttc tccttccccg acctgtggcc aagtcacaaa 841 gcaccacagc tgtacagcca gatgggggaa gggaggagat tagaactgta ggctagagta 901 gacaagtatg gaccagttca caatcacgct atcccaagca gaaagtgatg gtggcttgga 961 ctagcacggt ggtagtagag atggggtaaa gattcaagag acatcattga taggcagaac 1021 caataggaca tggtaataaa ctattctcag gaaaggggag gagtcatggc tttcagccat 1081 gagcatccac cctctgggtg gcctcaccca cttcctggca attctagcca ccatgagtcc 1141 aggggctata gccctttgct ctgcccgttg ctcagcaagt tacttggggt tccagtttga 1201 taagaaaaga cttcctgtgg aggaatctga agggaaggag gaggagctgg cccattcctg 1261 cctgggaggt tgtggaagaa ggaagatggc cagagctttg tgtccactgc aagccctctg 1321 gcttctggag tgggtgctgc tgctcttggg accttgtgct gcccctccag cctgggcctt 1381 gaacctggac ccagtgcagc tcaccttcta tgcaggcccc aatggcagcc agtttggatt 1441 ttcactggac ttccacaagg acagccatgg gaggtgagcc gtaagggaag ttggggtatt 1501 gggagagagc aggacccctc cccatcactg cttctggggg cttcgagttt cccatttgcg 1561 atagcagttg agcaaggtga cttgtggggc ctattcaggt tgatttcttg tcaagaatgt 1621 tggggtccag gggactggct caggtgaagg tataagggca gggcacatgt gggctgatgg 1681 gcactgaaaa ctacagcaag aacaaaggga agacaagagt tgatgcttta ttttttcccc 1741 aagggtcagt tgtatgaacc actccaccct caacaccttg aaatgcagag aggaggccgg 1801 gcgcggtggc tcatgcctgt aatcccagca ctttgggagg ccgaggcggg cagatcacct 1861 gaggtcgaga attcgagacc agcctgacca acatggagaa accccgtctc tactaaaaat 1921 acaaaaaaaa aaaaaaaaaa aagaggccag gcacagtggc tcacacctgc aatcccagca 1981 ctttgggagg cagaggtggg cagatcatga ggtcaggagt tcaagaccag cctggccaat 2041 atggtgaaac cctgtctcta ttaaaaatac aaaaattagc tgagcatggt ggcacactcc 2101 tgtagtccca gctactcggg aggctgaggt aggagaatca cttgaacccg ggaggtggag 2161 gttgcagtga gctgagacta tgccactgca ctcccagccc tggggttgac agagtgacac 2221 tccgtctcaa aaaaaaaa // LOCUS HUMGPIIB2 13204 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human platelet Glycoprotein IIb (GPIIb) gene, exons 2-29. ACCESSION M33320 KEYWORDS platelet Glycoprotein IIb. SEGMENT 2 of 3 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 13204) AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S., Schwartz,E. and Poncz,M. TITLE Organization of the gene for platelet glycoprotein IIb JOURNAL Biochemistry 29, 1232-1244 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 498 619 platelet Glycoprotein IIb (GPIIb), exon 2 709 806 platelet Glycoprotein IIb, exon 3 912 1077 platelet Glycoprotein IIb, exon 4 1293 1342 platelet Glycoprotein IIb, exon 5 1419 1464 platelet Glycoprotein IIb, exon 6 1552 1680 platelet Glycoprotein IIb, exon 7 2042 2089 platelet Glycoprotein IIb, exon 8 2245 2288 platelet Glycoprotein IIb, exon 9 2461 2514 platelet Glycoprotein IIb, exon 10 2653 2705 platelet Glycoprotein IIb, exon 11 2897 3108 platelet Glycoprotein IIb, exon 12 5536 5718 platelet Glycoprotein IIb, exon 13 5952 5997 platelet Glycoprotein IIb, exon 14 6106 6210 platelet Glycoprotein IIb, exon 15 6295 6350 platelet Glycoprotein IIb, exon 16 6443 6594 platelet Glycoprotein IIb, exon 17 6783 6908 platelet Glycoprotein IIb, exon 18 7886 7953 platelet Glycoprotein IIb, exon 19 8087 8234 platelet Glycoprotein IIb, exon 20 8803 8895 platelet Glycoprotein IIb, exon 21 9506 9585 platelet Glycoprotein IIb, exon 22 10202 10282 platelet Glycoprotein IIb, exon 23 10406 10505 platelet Glycoprotein IIb, exon 24 10605 10757 platelet Glycoprotein IIb, exon 25 10874 10999 platelet Glycoprotein IIb, exon 26 11478 11591 platelet Glycoprotein IIb, exon 27 11828 11929 platelet Glycoprotein IIb, exon 28 12117 + 12233 platelet Glycoprotein IIb, exon 29 pre-msg < 1 > 13204 GPIIb mRNA and introns IVS < 1 497 GPIIb intron A IVS 620 708 GPIIb intron B IVS 807 911 GPIIb intron C IVS 1078 1292 GPIIb intron D IVS 1343 1418 GPIIb intron E (no splice consensus) IVS 1465 1551 GPIIb intron F IVS 1681 2041 GPIIb intron G IVS 2090 2244 GPIIb intron H (no splice consensus) IVS 2289 2460 GPIIb intron I IVS 2515 2652 GPIIb intron J IVS 2706 2896 GPIIb intron K IVS 3109 5535 GPIIb intron L IVS 5719 5951 GPIIb intron M IVS 5998 6105 GPIIb intron N IVS 6211 6294 GPIIb intron O IVS 6351 6442 GPIIb intron P IVS 6595 6782 GPIIb intron Q IVS 6909 7885 GPIIb intron R IVS 7954 8086 GPIIb intron S IVS 8235 8802 GPIIb intron T IVS 8896 9505 GPIIb intron U IVS 9586 10201 GPIIb intron V IVS 10283 10405 GPIIb intron W IVS 10506 10604 GPIIb intron X IVS 10758 10873 GPIIb intron Y IVS 11000 11477 GPIIb intron Z IVS 11592 11827 GPIIb intron AA IVS 11930 12116 GPIIb intron BB IVS 12235 > 13204 GPIIb intron CC BASE COUNT 3046 a 3579 c 3857 g 2722 t ORIGIN About 2000 bp after segment 1. 1 ctgcaggtca acggatctgc tagggtcctc ctatcagcac acacactcca gccccacttt 61 agaggtaccc gctaccttcc ctcattaaaa ccagctctca agaggggatc tggtaacagt 121 ctaggcaggc attccaggga gcatgtgaac cgctggttct tgttgcgggt ggaggatgga 181 ggtgttgtac agagtttagg tctttttcag caaagatctc caaaccccgg gtgttcaaaa 241 tcaaaccaaa ggggattata gtcccagctc tactcacaac tcactggtta ctttagccac 301 gagattgccc tcgctgagag tcggtttcac tgtccataag atgaagaagt acatcacggt 361 ggtctgtgag gtgtcattga ggaaagatgg tccagtgccc ccatgccaca tggccttcgg 421 gcagtgctcc cagcgccggc gccagggcct gggatacgct ggaatctgcg cggcgctcac 481 ccagctttcc tatgcagagt ggccatcgtg gtgggcgccc cgcggaccct gggccccagc 541 caggaggaga cgggcggcgt gttcctgtgc ccctggaggg ccgagggcgg ccagtgcccc 601 tcgctgctct ttgacctccg tgagtcccag gcaaggagag caaggttggg gtcagaggga 661 cgtggactgc ccgggcttca gcgccccacc ccttcttgtg ccttccaggt gatgagaccc 721 gaaatgtagg ctcccaaact ttacaaacct tcaaggcccg ccaaggactg ggggcgtcgg 781 tcgtcagctg gagcgacgtc attgtggtgg gccccgcggt acagggcaca gggaacaatc 841 gggggcaggg acactggggc caggaggagc ccaagtctcg cgccccgtcc ccatctgtgg 901 ccctttctca ggcctgcgcc ccctggcagc actggaacgt cctagaaaag actgaggagg 961 ctgagaagac gcccgtaggt agctgctttt tggctcagcc agagagcggc cgccgcgccg 1021 agtactcccc ctgtcgcggg aacaccctga gccgcattta cgtggaaaat gattttagta 1081 agcgccagct acgacctggc cccgcccact cgcgacggct tggccccgcc ccccatcgga 1141 tcccgccccc agcgccgcag cccttgcttt ggatctggcc tcgccccagg gccccgccga 1201 ctcaaggccc cgcccctgtc ccccagccct cctccgggct cgcgcgcgcc tcccttcacc 1261 cctgggctga cccctcctcc ttgtctcctc aggctgggac aagcgttact gtgaagcggg 1321 cttcagctcc gtggtcactc aggcgagtag ggagcaaaag cgcagtgggg gcggctccca 1381 aacagggccc cctctcaccc tcaggacttc ccttccaggc cggagagctg gtgcttgggg 1441 ctcctggcgg ctattatttc ttaggtacgt gcccatccgt acacctccct cccttctcgc 1501 ggccgaagga gaccgctttg ggcttcacac ccgctgtccc tcccgcccta ggtctcctgg 1561 cccaggctcc agttgcggat attttctcga gttaccgccc aggcatcctt ttgtggcacg 1621 tgtcctccca gagcctctcc tttgactcca gcaacccaga gtacttcgac ggctactggg 1681 gtaacaccgc cattccagac ttccagcacc ccgagggtca ccgcccaccg cagacggtca 1741 ggtcctgccc ctgtgggagc ctccatggcc acccctgccg gccaacccac cgcctaagcc 1801 gctcccgccc tccgctcctg cgcttccccg cagaccgccc acctcccatg cgcccaccgc 1861 tcccttccac tgcggactcg tagcgcagcc tggggcaggg cttggcccct cgaaggcctc 1921 cgtttttcca tctgcacaat gcagggctgg ggctgagtgg ccttaatctc ctccttcttt 1981 gccctccgtc ccctctgtgc ttcctcccct ggaaaagact aatttgcgcc cttgtcctca 2041 gggtactcgg tggccgtggg cgagttcgac ggggatctca acactacagg caagaaatcc 2101 acttagggcg ggagttgggt agcccagccc ggggaggagc gccttcctga aatctcccct 2161 atgtagctgg gtgcagaacg gggagcggga agtgggtagg ttctaaggct ctcattccct 2221 gagcctggct ctccctatcg ccagaatatg tcgtcgtgcc ccccacttgg agctggaccc 2281 tgggagcggt aagtgccccc accactgggc ctcccgaagc cccttatccc agttctcagg 2341 ctgacaactc ctgagcgccc cccacccccg ccccgcctcc accaaaccac cctttctcac 2401 ctggagtggg aggttgcttt gggtacaaga atgatgctct cgcctgcgct gtccgtgcag 2461 gtggaaattt tggattccta ctaccagagg ctgcatcggc tgcgcggaga gcaggtgggg 2521 gccaggtccc agtgggcgtg gctgggtgga gggggaactg agacttcaga atatttcatg 2581 ggaggtgagg gcccatttct taaagaggat gcttgtccag cggcgtgaat gatggtgctc 2641 ctcatcttgc agatggcgtc gtattttggg cattcagtgg ctgtcactga cgtcaacggg 2701 gatgggtgag gagggacatg cccccacccc tacccagttg ggtcccaaat taccagagct 2761 gcccctctgt ctccctttcc tagccctagt ctcacgtatc cactggagga acaggagagc 2821 aagggtcgag gagatttggc cctagcccca atatacccct ggtccagtcc catgtaacca 2881 ctcatctggc ccacaggagg catgatctgc tggtgggcgc tccactgtat atggagagcc 2941 gggcagaccg aaaactggcc gaagtggggc gtgtgtattt gttcctgcag ccgcgaggcc 3001 cccacgcgct gggtgccccc agcctcctgc tgactggcac acagctctat gggcgattcg 3061 gctctgccat cgcacccctg ggcgacctcg accgggatgg ctacaatggt gagggaagag 3121 aggagcccta cttgctgcag aggggttaac agccactcaa aaagcatgga gttggcctga 3181 gggcagccag aaccaggatg ggttttaagc atataagtat gtggcttaga cacatggggt 3241 gctgagtgga gagcagatgg gagagttgaa gactaattag gaagtgtttg ccttaatcca 3301 agcaagagac aatgaccacc tggatgtgga ttttggcagt ggagttagag atgggagtga 3361 cttcacagat atttaggact cggattatta ggacttggtg ggagactgga tgtggggcca 3421 ggggagaggt tggagttggg tgcctgtgat ggcctccact gcctggaact caggccgtgc 3481 agcaggtgct ggggagaggc gggagatcag cagttcagct ctggacctgt tgagcttgaa 3541 gggcttgggt gctttaggcg gaaatatcca aagaacagtt gggagtggct ctccccgctt 3601 ccacaagaga gatctgaatg ggagacaggg gtttggggaa agtggatgag gtcccgggac 3661 ctgtgaaata agaggcccag gatagagccc tagggagcaa aagcatttag gtgactccta 3721 caggaggtaa gtctgagaag gagacagagg agtgtccaga gagggaggag ggaacccagg 3781 gggtctgatg gcccgggact caaggaagag catgcgttaa agagcatgca caggaggaag 3841 tgggcgctgc agctcctgct gctgctgcaa gatacaatta ggtggggctg gagaaatatt 3901 catgggcttt agcaagaaga gggtgccagg catggtggct catacctgta atcccagcta 3961 cttgggaaat tgaagcagga gaatctcttg aacccgggaa gtggaggttg cactgagctg 4021 agcttgcgcc actactgcac tccagcctgg gtgacagagc aagactccat ctcaacaaaa 4081 taaaaaaaaa aatagagaaa gaaaggaaga aagaaaaaag aaggggaggt tattggtgac 4141 agtgacataa attgattcag gccaagatag ggtcagaagc cagaatgcaa tggggtaagg 4201 tatgaatgga gatgaaaaat tggatgcagc taatgtagac agctctttca acaggtttgt 4261 ggtaaaaagg aatttgagga atagaaagga aaaaaaaaaa catgtttgac tataagagga 4321 aaaagagaaa aggtgatcac agaaaagaga tgagggtcaa gggaagatta tttcaatgtg 4381 gaagaacatg tagtaggttg aaaatgatgt tgtggggaaa tggggggatg agccagcaga 4441 gagtccctgt gatgcctcag ggggtgggag ggtgactggc ccagtgtcag ggtgaaggaa 4501 ggaaacctct tccagggtca aatggggaaa gggaaaaaga aagttggtgt gggattatag 4561 cataacagtg ggctgcctct cttcctgaag taagagatta cgtcacctgc tgaaggaagt 4621 gtggggggtc tgggagtttg atggaatgga gaaggctaga aatagatgct agatggccag 4681 gcacggtggc tcacacctgg aatcccagca ctttgggagg ccgaggcagg aggatcactg 4741 gagcctagga gtttgacacc agcctggcca acatagggag atctcgtctc cataaaaatt 4801 tttaaaaatt agctgggcat ggtggctata gtctcaactg cttgggaagc tgaggtggga 4861 ggattgcttt agtccagaag gttgaggctg cagtaagcca tggttgcacc actgcacttc 4921 agcctgaatg acaagtgcaa gactgtctta aaataaaaaa tttaaagggc ttgggcacgg 4981 tggctcacac ctgtaatcca gcactttggg agcccaaggt gggcagatca cttgaggtca 5041 ggagttcgag atcagcctgg ccaatgtggt gaaaccccgt ctctactgaa aatacaaaaa 5101 ttagccgggc atggtggtag gcgcctgtaa tcccagctac tgaagaggct gaggcacaag 5161 aatcacttta acgggggagg cagaggttgc agtgagccga gatcgcacca ctgcactcca 5221 gccaggacaa cagagcgaga ctccatctca aaaaaaaaaa aatttagaaa agggaataat 5281 gatgcttaat tttcaggata tattttcctc aatagacagt gagagttgtc actgttttta 5341 taacaatcct acttggcagg tccctctccc acctgattgt taactcctgg agggtagggc 5401 agtgcctcct tcacccacac tttgcacccc tttcctagtc tcctgggatg ttcccagaga 5461 agctcaggaa agttttacag tcatctaggg aggctgaata acaatcagcc acttcctttc 5521 tgttactcct tccagacatt gcagtggctg ccccctacgg gggtcccagt ggccggggcc 5581 aagtgctggt gttcctgggt cagagtgagg ggctgaggtc acgtccctcc caggtcctgg 5641 acagcccctt ccccacaggc tctgcctttg gcttctccct tcgaggtgcc gtagacatcg 5701 atgacaacgg atacccaggt gccctggact gcctccagct agaaatgccc aagaaaggcc 5761 cttggacatt cgctggaagt gccaagagac acggccaggg ctcatgcctg gcctggtgtc 5821 ccactatgga ctgccagagg ggctgggtga aacctccagt gggggaggtg gtgtggggaa 5881 cccctgggaa gatgagatga ggatccccat accctaatcg ccaattctga cccattcctc 5941 gatgtctata gacctgatcg tgggagctta cggggccaac caggtggctg tgtacaggtg 6001 agcactggct ccaggggcgg gatggggaag gtcctgtgcc atcaagagga ggccaggcca 6061 ggaggagcca caatggcaag cctccccatc accctatccc atcagagctc agccagtggt 6121 gaaggcctct gtccagctac tggtgcaaga ttcactgaat cctgctgtga agagctgtgt 6181 cctacctcag accaagacac ccgtgagctg gtgaggaggc agagggcatg ggccttaaag 6241 gatctgggac ctcagaaagg ctccaacccc tgagccccac ttacgtcttt gcagcttcaa 6301 catccagatg tgtgttggag ccactgggca caacattcct cagaagctat gtgagtggca 6361 tgaagggggc aggagggagg tgggcttgga ctcccccgga ggctggccag ggaggtcctg 6421 actcttctgc ttgccctgcc agccctaaat gccgagctgc agctggaccg gcagaagccc 6481 cgccagggcc ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctgaacctg 6541 gatctgggcg gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagtacgc 6601 ccaggcaggg gattggcagg gctgggagag tagaacttac ccactggact tgttcatcta 6661 gccctggggc actgagctgg gtgctgtgag tccgggggtg gtcaggacac aggtgcctac 6721 tggccaggag aaggtgggat gtgtatggta gcaagatggc ctgactcttg cccctgtcct 6781 aggatgaggc agacttccgg gacaagctga gccccattgt gctcagcctc aatgtgtccc 6841 taccgcccac ggaggctgga atggcccctg ctgtcgtgct gcatggagac acccatgtgc 6901 aggagcaggt agggacaggc agggacaggc cagggaggtg caggacccct gatagcaaat 6961 caggattagg gttagtgcca agtcacaatg taaccccaaa accttgatgt cattccaaac 7021 cctaatgaaa acctcaaaat ccagccagtc atggtggctc acacctgtaa tcccagcact 7081 ttgggagacc gaggcaggca gattgcctga ggtcaggagt tagagaccaa cctggccaac 7141 atggtgaaaa cccatctcta ctaaaaatac aaaaaaaatt agccgggtgt ggtgacgcat 7201 gcctgtaatt ccagctactc gggaggctga agcaggagaa tcacttgaac ccaggaggca 7261 gaggttgcag tgagccaaga gtgtgccaca gcactccagc ctgggtgaca gagcaagact 7321 ctgtctcaaa aaaaaaaaaa aaagccaggc gcagtggcct cacgcctgta atcccagcac 7381 tttgggaggc caaggcgggt ggatcacgag gtcaggagat caagaccatc ctggctaaca 7441 cagtgaaacc ccgtctacta aaaatacaaa aaaaaaaaaa aaattagctg ggcgtggtgg 7501 cgggtacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaaccccgg 7561 gggcggacgt tgcagtgagc cgagatagtg ccactgcact ccagcctgga cgacagagcg 7621 agactccgtc tccaaaaata aaaaaacacc tgaaaatccc agtatcccct aagctctgat 7681 gtaaattgac aaaccctgac attgtcccaa acctccaaat ataacccgag ccccgatacc 7741 atctacaaac tccttttcgt cctcagatct tcttactccc taagccccta tgtgaacccc 7801 aagcccactg ttttcctaac cctgatgtaa tccctaaacc tcacacatcc ccaacttacc 7861 cgcacacccc aatgtgcccc tctagacacg aatcgtcctg gactgtgggg aagatgacgt 7921 atgtgtgccc cagcttcagc tcactgccag cgtgtgagga ggcctcccat tctgcccgac 7981 cctggccctt tctgcctatc atacctgctc cacaccttag tcccctcttt tcccacatcc 8041 tgggcccaga cccaggctcc ctggcttcac tcctctttcc ccacaggacg ggctccccgc 8101 tcctagttgg ggcagataat gtcctggagc tgcagatgga cgcagccaac gagggcgagg 8161 gggcctatga agcagagctg gccgtgcacc tgccccaggg cgcccactac atgcgggccc 8221 taagcaatgt cgaggtatgg cccccaccct gggaacagta cccgggacct gggaggcact 8281 ggagccttgg ctctctcatc tccctccctg agagtccctc ttctcttctg ctttgctgtc 8341 aaagatgtaa tttttttttt aatttggagg aggatacttg ctaatggtca gtcagaattc 8401 caaaactcta ttacaaaaac cagaaaaaca aaaaaggttt aggaaccaaa tgttaacagg 8461 aacctctgtt aacatttggt ggatttcctt ccagtctttt tttcaatatt gactcacact 8521 cacataagta tatatttatt ttttatgttg ttaatatagt ttataataat gggggtcata 8581 ctctaatgtt ttgtgttttt tatttccaaa atgaaaatgc ctaaaaagta gtagtgctac 8641 agcaatacac acactagcat gtgacagtcc cttgagcgac cccaccccaa gaaacccccc 8701 cctccctacc ttggcacaca aatctttcca gaccttccaa gggagcttaa atatatatat 8761 atgatgctct gtaatttctt tcttggaact gccttcctga agggctttga gagactcatc 8821 tgtaatcaga agaaggagaa tgagaccagg gtggtgctgt gtgagctggg caaccccatg 8881 aagaagaacg cccaggtgag gctgctgggt cgtggtaccg ggtctccacc aggggctcat 8941 gaataaccag attttagggg tgaggtttta gagccacata gttctgggcc agaatcttgg 9001 tcctcacact ccctttgcca acattgtcct tgggtgagtg actttccctc tctgagcccc 9061 tttaccagtg ggcttccagg taaaatagaa ataataatgg tggcctggtg cggtcgtcac 9121 gcctgtaatc ccagcactct gggaggccag agcgggtgga tcacgaggtc aggagttcaa 9181 gaccagcctg gccaacatag caaaaccccg tctctactaa aaatacaaaa attacccggg 9241 catggtggcg cacgcctata gtcagagcta ctcgggaggt tgaggcagaa aaatcacttg 9301 aacctgggag gtggaggttg cagtgagccg agatcatgcc actgcactcc agcctgggtg 9361 acagagtgag actccgtctc ggaaaaaaaa aaaaagaaaa agaatagtgg tgatcttgga 9421 gggtgaagac tggaggccac attcagggca gggctgtcct aagtggggca cttgggcagt 9481 gaccttggcc ctcctcatct cccagatagg aatcgcgatg ttggtgagcg tggggaatct 9541 ggaagaggct ggggagtctg tgtccttcca gctgcagata cggaggtact gacctggcga 9601 gcgtgcctac ccaccaccct tcccccgtct gacccccgtg cagagcccct caggtccctt 9661 ccatacagaa gggtctttcg aggccaggcg cagtggctca cacctgtaat cccagcacgt 9721 tgcgaggcca aggcagaagg atcactggag gtcaggagtt ggagaccagc ctggccaaca 9781 tggtgaaacc ccatctctac taaaatataa aattagctgg gcatggtggt gcgcacctac 9841 aatcccagct actcgggagg ctgaggcagg agaatagctt gaaccgaacc tgggaggtgg 9901 aggttgcagt gagctgagat tgggccactg cactccagcc ttccagcctg ggcgacagtg 9961 cgagattcta tctcaaaaga aaaaaaaaaa aaggtcttga agaagcctgg ttccctttct 10021 tcctcagaga tttagcgagt cttggagccc tagaggaagt tctttcccag gtctaacttc 10081 agtgtggcat gctctttgta taattagctc tctctgaact ctctaaaatt ctggcctcac 10141 ccccagaaag tcactgggct ggtgtccctg gccctgtttc tcctcatccc ctcccctcta 10201 gcaagaacag ccagaatcca aacagcaaga ttgtgctgct ggacgtgccg gtccgggcag 10261 aggcccaagt ggagctgcga gggtgagagg ccaggggtgg agaagggaga tggcattcag 10321 ggctctaaac tccagggggc gctggggaaa cctcacaggc caatcagggc atcacactct 10381 ctctgggggt cttgggcacc tgcaggaact cctttccagc ctccctggtg gtggcagcag 10441 aagaaggtga gagggagcag aacagcttgg acagctgggg acccaaagtg gagcacacct 10501 atgaggtatt ggggagcctc gcgtccctgg ctggggtgag cgggtcctca gaactccggg 10561 tgaggcgcta agctccccac accctgccac caccacccct tcagctccac aacaatggcc 10621 ctgggactgt gaatggtctt cacctcagca tccaccttcc gggacagtcc cagccctccg 10681 acctgctcta catcctggat atacagcccc aggggggcct tcagtgcttc ccacagcctc 10741 ctgtcaaccc tctcaaggta agagctgggt ggaagaaaga cctgggaagg cggccccaga 10801 ccaaccaccg gggcacctct gtgggctggg gttcggggga gacctgggcc tgaccactcc 10861 tttgcccccc caggtggact gggggctgcc catccccagc ccctccccca ttcacccggc 10921 ccatcacaag cgggatcgca gacagatctt cctgccagag cccgagcagc cctcgaggct 10981 tcaggatcca gttctcgtag tgagcaggct ctctggtctc gggcccggcc tccccgggac 11041 ccacggggca gaggggatgg gaggagggag aggggtccgg gtgtgctgtg ggcctctgtg 11101 ggccacgctt ggtccctggg agcacttcaa gtgaacatgg aggagcatgc tggcttgtgt 11161 ctggggtgag ctgaaagaca cttgcacttt ttaaaagctt cccagtacgt taaggagcat 11221 aaaacaatgc caaagcaagg ttatcataga tctgagcatt gtgcgctggg ggatgaccct 11281 ccctgcatct ctgggactat gtgagcaagc ccgtggaaag acagcatccg aagcttggat 11341 ccaaggccct tcctgatggg aaggccaccg cttcctgaac ccccggcccc ttctgcgttg 11401 ggtcctgggg gtaagggggt gggggatgat ggggtgatgg gccgggacgg ctggggactg 11461 acgatgcttc ccctcagagc tgcgactcgg cgccctgtac tgtggtgcag tgtgacctgc 11521 aggagatggc gcgcgggcag cgggccatgg tcacggtgct ggccttcctg tggctgccca 11581 gcctctacca ggtggggtgg gccgtggtgg ggcggggccg ggccttctgg gccgggacca 11641 ctttgctctg ggaggggcgg ggtttggtgt gggagggcag gaagagaggg aaggcaaggt 11701 ttactttggg ggattgcagt gggattaggt cagaggcagg gcttccccgc cgggtgtggg 11761 acctggactc cgtgcaacca ataggcctct tgtgggtgta aacggctttc aaccccaacc 11821 tgtccagagg cctctggatc agtttgtgct gcagtcgcac gcatggttca acgtgtcctc 11881 cctcccctat gcggtggccc cgctcagcct gccccgaggg gaagctcagg tgagtgtggg 11941 gggatggagc agagaccagt cctgcaggac ccattgtccc ccagtcagtg cccagccaga 12001 aaagtctgag gggtggtacg ggtgggtggc atggctggag gtcaccagcc tgaggtttga 12061 gtctttgtga aaggcaggtg tcaaggtgac tgaggagaca cgtgggtttg ccccaggtgt 12121 ggacacagct gctccgggcc ttggaggaga gggccattcc aatctggtgg gtgctggtgg 12181 gtgtgctggg tggcctgctg ctgctcacca tcctggtcct ggccatgtgg aaggtgaggt 12241 gtgaaggacg gtggagtccc cagcggggca caggcttggc tctgccctgc ctcacaggga 12301 gtcaaggaga gatggtggcc cacccaagtg ggtaatccag ggaccagggg tctatgtctc 12361 cactattaga atgtcattct cgtccagggg ggtggctcac acctgtaatc ccagcacttt 12421 ggcaggcaaa gcgtttagat cacctgaggt caagagttcg agaccagcct ggccaacatg 12481 gtgaaacccc atctctacta aaaataccaa attagccggg cgtgttgaca catgcctgta 12541 atctcagcta ctcgggaggc tgaggcagta gaattgcatg aacccaggag gcggaggttg 12601 cagtgagccg agatcacacc actgcactcc agcttgggca acagagcgag cctccatctc 12661 aaaaaaaaaa caaaaaaata gaatgtcttt ctctagtaga gcaaaaggca aaacaaacac 12721 aaaaatgtca ttctcctggg aacccttcca gacacatacc actggaaagg atagcacctg 12781 aaattctgag gcctttagac acccctgcca ccaaaaagat tcagaggata tagagggtat 12841 agagggtgta agtcctgcct tcaggaattc ctggctggtc tcaaggacaa gatgcacttc 12901 ttcctagccc tgcccttccc cttgagtgag gaagaggcca aggattggtc tagaccctat 12961 tccatacctt cctatgtggc cctggagggt cactcgctcc tctgcacctg gaggagtctc 13021 aagcacactg aagggaagac atggtgcttt tagggaaaac cacgcactag acccacaata 13081 atcaaataca tatcatcata tgctcgagtc atgcagacac aaacttcagt ataagaaaaa 13141 ttccaggctg ggcgttggtg gctcacaccg gtaaaatccc agcactttgg gaggccgagg 13201 tggg // LOCUS HUMGPIIB3 364 bp ds-DNA PRI 01-JUN-1990 DEFINITION Human platelet Glycoprotein IIb (GPIIb) gene, exon 30. ACCESSION M34344 KEYWORDS platelet Glycoprotein IIb. SEGMENT 3 of 3 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 364) AUTHORS Heidenreich,R., Eisman,R., Surrey,S., Delgrosso,K., Bennett,J.S., Schwartz,E. and Poncz,M. TITLE Organization of the gene for platelet glycoprotein IIb JOURNAL Biochemistry 29, 1232-1244 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 110 169 platelet Glycoprotein IIb (GPIIb), exon 30 pre-msg < 1 > 364 GPIIb mRNA and introns IVS < 1 109 GPIIb intron CC BASE COUNT 68 a 111 c 98 g 87 t ORIGIN About 600 bp after segment 2. 1 ttctgcgctg gtccagggag gtgctcatat gctagcatac ttcctcacat gtgctctggg 61 gccagcaaat catctgtata ccctgacctt ggcccccgtg tacccccagg tcggcttctt 121 caagcggaac cggccacccc tggaagaaga tgatgaagag ggggagtgat ggtgcagcct 181 acactattct agcaggaggg ttgggcgtgc tacctgcacc gccccttctc caacaagttg 241 cctccaagct ttgggttgga gctgttccat tgggtcctct tggtgtcgtt tccctcccaa 301 cagagctggg ctaccccccc tcctgctgcc taataaagag actgagccct gatgctgagc 361 atgc // LOCUS HUMIMPH 2858 bp ss-mRNA PRI 01-JUN-1990 DEFINITION Human IMP dehydrogenase type 1 mRNA complete cds. ACCESSION J05272 KEYWORDS IMP dehydrogenase. SOURCE Human spleen, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2858) AUTHORS Natsumeda,Y., Ohno,S., Kawasaki,H., Konno,Y., Weber,G. and Suzuki,K. TITLE Two distinct cDNAs for human IMP dehydrogenase JOURNAL J. Biol. Chem. 265, 5292-5295 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.Natsumeda, 15-FEB-1990. FEATURES from to/span description pept 601 2145 IMP dehydrogenase type 1 (EC 1.1.1.205) signal 2843 2848 poly-A signal BASE COUNT 545 a 923 c 860 g 530 t ORIGIN 1 tcggaagggg ccaggagaca ctggaaggtc cggacggcag ggaaggggac ggggttcttt 61 ccagtcccac ccgtgtaggg acacctctcc ccctcatccc ccgatgtacc ctcgctgaat 121 ctgggatggg agagacgaac cgagtctagg catctgcgta gcagcgccgg ggagagcggg 181 gagcccaggc ggagcccagt cgactcccgg attcccctgc cccgcccccg gcacgaggcc 241 ccgccccggc gccccgcccc tcctcgggac tcgaccgggc tgcgctcact gcccagccgg 301 ggccccggga gcctccaggc tcgcccgccc tgagctgcgg cctccgcatg gagggccact 361 cactccacca ccgctgcagg gaggcggacg gcgctgttcc ggagcccgga gcccggcaac 421 acccgggaca cgagacggcg gcgcagggct acagcgcccg actgctgcag gccggctacg 481 agcccgagag ccctagattg gacctcgcta cacacccgac gacaccccgt tcagaactat 541 cttcagtggt cttactggca ggtgttggtg tccagatgga tcgccttcgc agggctagcc 601 atggcggact acctgatcag cggcggcacc ggctacgtgc ccgaggatgg gctcaccgcg 661 cagcagctct tcgccagcgc cgacgacctc acctacaacg acttcctgat tctcccagga 721 ttcatagact tcatagctga tgaggtggac ctgacctcag ccctgacccg gaagatcacg 781 ctgaagacgc cactcatctc ctcccccatg gacactgtga cagaggctga catggccatt 841 gccatggctc tgatgggagg tattgggttc attcaccaca actgcacccc agagttccag 901 gccaatgaag tacgcaaggt caagaacttt gaacagggct tcatcacgga ccctgtggtg 961 ctgagcccct cgcacactgt gggcgatgtg ctggaggcca agatgcggca tggcttctct 1021 ggcatcccca tcactgagac gggcaccatg ggcagcaagc tggtgggcat cgtcacctcc 1081 cgagacatcg actttcttgc tgagaaggac cacaccaccc tcctcagtga ggtgatgacg 1141 ccaaggattg aactggtggt ggctccagca ggtgtgacgt tgaaagaggc aaatgagatc 1201 ctgcagcgta gcaagaaagg gaagctgcct atcgtcaatg attgcgatga gctggtggcc 1261 atcatcgccc gcaccgacct gaagaagaat cgagactacc ctctggcctc caaggattcc 1321 cagaagcagc tgctctgtgg ggcagctgtg ggcacccgtg aggatgacaa ataccgtctg 1381 gacctgctga cccaggcggg ggtcgacgtc atagtcttcc actcgtccca agggaattcg 1441 gtgtatcaga tcgccatggt gcattacatc aaacagaagt acccccacct ccaggtgatt 1501 ggggggaacg tggtgacagc agcccaggcc aagaacctga ttgatgctgg tgtggacggg 1561 ctgcgcgtgg gcatgggctg cggctccatc tgcatcaccc aggaagtgat ggcctgtggt 1621 cggccccagg gcactgctgt gtacaaggtg gctgagtatg cccggcgctt tggtgtgccc 1681 atcatagccg atggcggcat ccagaccgtg ggacacgtgg tcaaggccct ggcccttgga 1741 gcctccacag tgatgatggg ctccctgctg gccgccacta cggaggcccc tggcgagtac 1801 ttcttctcag acggggtgcg gctcaagaag taccggggca tgggctcact ggatcccatg 1861 gagaagagca gcagcagcca gaaacgatac ttcagcgagg gggataaagt gaagatcgca 1921 cagggtgtct cgggctccat ccaggacaaa ggatccattc agaagttcgt gccctacctc 1981 atagcaggca tccaacacgg ctgccaggat atcggggccc gcagcctgtc tgtccttcgg 2041 tccatgatgt actcaggaga gctcaagttt gagaagcgga ccatgtcgcc ccagattgag 2101 ggtggtgtcc atggcctgca ctcttacgaa aagcggctgt actgaggaca gcggtggagg 2161 ccgaggtggt ggaggggatg caccccagtg tccacttttg ggcacaggct ccctccataa 2221 ctgagtggtc cacagatttg cactacgggt tctccagctc ctttccaggc agagaggagg 2281 ggaggtcctg aggggactgc tgcccctcac tcggcatccc ctgcagagtc aggactgctc 2341 ccgggggcca ggctgccctg ggaggccccc tccgagacca gccagccagg ctctcaggac 2401 ctgcgctgcc ttaggatctt tcttgctgca gcctgctcca gcctggcccc caccccaggg 2461 gcaggcggcc cctcctggct tctcctgtag ggcacctccc tgcccctagc ctcccagcaa 2521 atggtgctct cctggccctg ctctggccct tcccgggccg tgcccctcag ccatgtggca 2581 cttctgagct cctgacctag gccaagggga ggtctctgcc cccttccccg gccctgggct 2641 acccttgggt cctgctcctc aggccgctcc cctgtccctg gccatgggta ggagactgcc 2701 ctggtcatgg ccgcctgcct gtcattcctg actcaccacc gtccccaggt gaaccattcc 2761 tcccttctcc tcagctgcag tcgaaggctt taactttgca cacttgggat cacagttgcg 2821 tcattgtgta ttaaatactt ggaataaatc aagcaggt // LOCUS MBIFTRX 1260 bp ds-DNA BCT 01-JUN-1990 DEFINITION M.thermoautotrophicum tetrahydromethanopterin formyltransferase (FTR) gene, complete cds. ACCESSION J05173 KEYWORDS tetrahydromethanopterin formyltransferase. SOURCE M.thermoautotrophicum. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 1260) AUTHORS DiMarco,A.A., Sment,K.A., Konisky,J. and Wolfe,R.S. TITLE The formylmethanofuran: Tetrahydromethanopterin formyltransferase from Methanobacterium thermoautotrophicum delta-H: Nucleotide sequence and functional expression of the cloned gene JOURNAL J. Biol. Chem. 265, 472-476 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 153 unknown protein (AA at 1) pept 211 1101 tetrahydromethanopterin formyltransferase BASE COUNT 371 a 260 c 305 g 324 t ORIGIN 1 ggatccttgg agggagatga gggagatgag ggtatagtaa atatttttgt tctctggtgc 61 tgtaataacc atatgtttat cacccggcgc ataaccctct tattattcct aggggccctc 121 agattcatgg ttttacagtt atttctaagc tgaattgaaa gtagaaaaat atattaaaca 181 ttctcccagt aaatcaacta ggtgatagtt atggagataa atggtgttga aatagaagac 241 acatttgcag aggcctttgg cataaaggtt tcaagggttc ttgtaaccgc aacaaaaaaa 301 cttgccaaaa tagcagcaac cgaggcaaca ggttacggta cctcagttat agggtgccct 361 gcagaggctg gcatcgactg ctatgttcca cctgaggaga cacctgacgg aagaccaggg 421 tacattataa tgatatgcaa cccatccaag aagagccttg accatgagct ccttgaaagg 481 atagggatgg gtatactgac agcaccaacc accgctgtct tcgatgcact ggatgatgag 541 gatgagaagc tcaacatagg attcaaacta aaattcttcg gtgacggcta cgagaaggaa 601 cttgaaattg atggaagaaa gatccactca atcccaataa tgtcagggga cttcctgatt 661 gaaagccagt tcggaataaa ggatggagtt gccggcggaa acttcttcat aatgggtgac 721 agtcaggcat cagcgctact ggctgcacag gcagctgttg atgccatagc tgcagttgaa 781 ggtactgtaa caccattccc tggtggagtg gttgcttcag gatcaaaggt cggctcaaac 841 aagtacaagt tcctcaacgc ctcaaccaat gagaagatgt gtgtgaccct gaaggacgaa 901 gttgaggaca cccagatacc tgagaacgtc aacggggtct acgagatagt catagatggt 961 gttgatgaag aggccgtaag ggaggccatg aaggaaggta taaaggctgc ctgcacggtt 1021 ccaggcataa taaagataag tgcaggaaac tatggcggta acctgggagc ctacaagata 1081 aaactccatg acctcttctg atttttttat ccatgccctg atcctatcag ggttgacctt 1141 tttttattct atgcagaatt tttccatcat ttcgctttca actatttccc atatctcgtc 1201 tcgcggaggt taacaccgag agtatacccc ttttcctgag gtcatcttca attttcatta // LOCUS PFAROP1 3061 bp ds-DNA INV 01-JUN-1990 DEFINITION P.falciparum rhoptry associated protein 1 (rop1) gene, complete cds. ACCESSION M32853 KEYWORDS rhoptry associated protein 1. SOURCE P.falciparum (strain K1) blood stage DNA, clones lambda rop[1-3]. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 3061) AUTHORS Ridley,R.G., Takacs,B., Lahm,H.-W., Delves,C.J., Goman,M., Certa,V., Matile,H., Woollett,G.R. and Scaife,J.G. TITLE Characterisation and sequence of a protective antigen from Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by R.G.Ridley, 15-MAR-1990. FEATURES from to/span description pept 628 2976 rhoptry associated protein 1 (rop1) precursor sigp 628 695 rhoptry associated protein 1 (rop1) signal peptide matp 696 2973 rhoptry associated protein 1 (rop1) BASE COUNT 1266 a 335 c 357 g 1103 t ORIGIN 1 gtatacacac atacaaatat gtaaaaatag acatttcaaa atttttaata taaaataaca 61 tatatatata taaataaata gtaaaattaa ttttttttat ttcatcattt tatatataca 121 tatatatata tatatatata tatatattta tttatttatt tatatacata tataagttat 181 ttatttattt atttcctttt tggttgtatt ttcgtttttt tgattctatt tatatattta 241 tagtaatatt tataattata attattttat ttaatttttt atctttttca ttttatatat 301 taataacttt cttttaaata ttatgatatt aatttttatt ttttagtatt tatataagat 361 tttaaaatag acaaaataaa tataaataaa ttatatatta tatatatata tatatatttt 421 ttttttttta tagttatgca catatatatg tattaatata tattatcttt aaactattag 481 aatatatact ttattattct aaatattatt aatttgtaat actcacatat atatatatat 541 atatatatat atttatttat ttatttattt atttatttat ttatatagta aaaaatcaat 601 taattttttt tttttttttt atatataatg agtttctatt tgggtagctt agtaataata 661 ttccatgtac tcttccgtaa tgtcgctgat ggtataaatg taaacggaga taataattat 721 gggaaaacaa taatcaataa tgatttcaat tttgatgatt acaattattg gacaccaata 781 aataaaaagg aatttttaaa ttcctatgaa gatgaatttt caagtgaatc ctttttagaa 841 aataaatcta gtgttgatga tggaaatata aatttaacag atacaagtac atcaaataaa 901 agttctaaaa aaggacatgg tagaagtaga gtaagatcag catcagctgc tgcaattctt 961 gaagaagatg attcaaaaga tgatatggaa tttaaagctt ctccttcagt tgttaaaaca 1021 tctactccat caggtacaca gacatctggt ttaaaatcat ctagtccatc tagtacaaag 1081 tcatcaagtc catcaaatgt aaaatcagct agtccacatg gtgaatctaa ttcttctgaa 1141 gaaagtacta ctaaatcctc aaagagaagt gcttcggttg caggtattgt aggtgccgac 1201 gaagaagcac ctcctgcacc aaaaaacacc ctcactccat tagaagaatt atatcctact 1261 aatgttaatt tatttaacta taaatattca ttaaacaata tggaagaaaa tatcaatata 1321 cttaaaaacg aaggagattt agttgcacaa aaagaagaat ttgaatatga tgaaaatatg 1381 gaaaaagcta aacaagacaa aaaaaaagca cttgagaaaa taggaaaaca atcagacgaa 1441 gaacctttta tgttttcaga aaataaattt cttgaaaatc aagtaaaaga aagaaatgtt 1501 gctggatcct tttctcgatt tttcagtaaa ttaaatcctt ttaagaaaga tgaagtaata 1561 gaaaaaactg aagtatcaaa gaaaacattt tcaggtatag gttttaatct tactgacaaa 1621 gaagctaaag tattaggtgt aggtgcaacc tatcaagaat atccagaaac catgttatat 1681 aactgtccaa acaattctaa tttgtttgat actatagaat cattacaagg aagaataatt 1741 gatattaaaa aaagagaaag catgatatca acaactttcg aacaacaaaa agaatgttta 1801 aaaaatatgg gtgtacttga tcttgaatta aacgatacac aatgtaaatt tggtacatgt 1861 ataggtagct ttggagaaca tcatcttaga ttatacgaat ttgagaatga cttatttaaa 1921 tttcatccaa atattgatta tttaacttta gctgatggat ataaattaca aaaaaatcat 1981 atatatgaat tatcccatgt aaacttttgc ttattaaatc ctaaaacatt agaagaattt 2041 ttaaaaaaaa aagaaatcaa ggatcttatg ggtggtgatg atcttataaa atataaagaa 2101 aattttgata actttatgag tatatctata acatgccata ttgaatcttt aatatatgat 2161 gatattgaag catctcaaga tattgctgct gtattaaaaa ttgctaaaag taaattacat 2221 gtaataacat caggtttatc atataaagca agaaaattag tatataaaat ttatagtgaa 2281 attcaaaaaa atccagatga actctatgaa aaattaacat ggatttatga taatatctat 2341 atgattaaaa gatattatac tgcatatgct ttagaaggtg tctgttcata tcttgaacat 2401 gataaaagtc aaatgtatac agaattacat atttataaca aaatagtcga ctctgttcgt 2461 tattatagtt catgctttaa aaacgttatt gtttataatg ctatcatttc tggtatacat 2521 gaaaaaataa aacatttctt aaaattagta ccaagacaca actttctttt ggattatcac 2581 tttaattcaa tttttgaaaa agaaattaaa ccagccaaaa aatatagtac ttcacatatt 2641 tattttgatc caactgttgc atcatatgct tattataatt tagatagaag aaccatggtt 2701 actattatta atgattattt cgaagcaaaa aaaaaagaat taaccgttat agtatctcgt 2761 atgaaaacag atatgctcag tcttcaaaat gaagaatcaa aaataccaaa tgacaaaagt 2821 gcaaattcaa aactagctac aagattaatg aaaaaattta aagctgaaat cagagatttc 2881 ttcaaagaaa tgcgtataca atatgctaaa ttaataaaca tacgttacag atctcactta 2941 aagaaaaact actttgcctt caagagatta gattaagaat ataaacttga aaaatatata 3001 taatgtaaaa atatactaaa tatattaata taatatatat atatatatat atatatatat 3061 a // LOCUS STYFLICD 350 bp ds-DNA BCT 01-JUN-1990 DEFINITION S.typhimurium phase-1 flagellin C and D (fliC, fliD) genes, partial cds. ACCESSION M33541 KEYWORDS phase-1 flagellin. SOURCE S.typhimurium DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 350) AUTHORS Inoue,Y.H., Kutsukake,K., Iino,T. and Yamaguchi,S. TITLE Sequence analysis of operator mutants of the phase-1 flagellin- encoding gene, fliC, in Salmonella typhimurium JOURNAL Gene 85, 221-226 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 313 > 350 phase-1 flagellin C pept 56 < 1 (c) phase-1 flagellin D BASE COUNT 120 a 63 c 78 g 89 t ORIGIN 1 aacaactggt ctaacggtaa gtttgatccc acacctaatg atgaaattga agccatgcct 61 tcttcctttt tgattgcaaa cagtagttaa gcgcgttatc ggcaatctgg aggcaaagtt 121 taatgataat tttgcaaaaa taatgcgcgg aataatgatg cataaagcgg ctatttcgcc 181 gcctaagaaa aagatcgggg gaagtgaaaa attttctaaa gttcgaaatt caggtgccga 241 tacaagggtt acggtgagaa accgtgggca acagcccaat aacatcaagt tgtaattgat 301 aaggaaaaga tcatggcaca agtcattaat acaaacagcc tgtcgctgtt // LOCUS SYNCRPBS1 88 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33414 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 88) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 24 a 22 c 18 g 24 t ORIGIN 1 aaatgtgatc tagatcacat ttcaagcttg actgactgag atcagcttgc aacctttccg 61 gtcgcggcga taaagacatc ttcaccgt // LOCUS SYNCRPBS2 88 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33415 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 88) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 25 a 24 c 18 g 21 t ORIGIN 1 aaatgtgatc tagatcacat ttggaattcc cggggatccg ggccctcttc aaacaggccc 61 agttcgccaa taaaatcacc tgattcag // LOCUS SYNCRPBS3 56 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33416 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 56) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 17 a 10 c 14 g 15 t ORIGIN 1 aaatgtgatc tagatcacat ttggagtgcc tgaatgggcg tacagttatg aaaccc // LOCUS SYNCRPBS4 56 bp ds-DNA SYN 01-JUN-1990 DEFINITION Synthetic E.coli cyclic AMP activated promoter DNA. ACCESSION M33417 KEYWORDS Artificial promoter; cyclic AMP activated promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 56) AUTHORS Aiba,H., Hanamura,A. and Tobe,T. TITLE Semisynthetic promoters activated by cyclic AMP receptor protein of Escherichia coli JOURNAL Gene 85, 91-97 (1989) STANDARD simple staff_entry BASE COUNT 20 a 10 c 9 g 17 t ORIGIN 1 aaatgtgatc tagatcacat ttggaattcc aaatgtgatc tacagttatg aaaccc // LOCUS TRFCRE1 3940 bp ds-DNA INV 01-JUN-1990 DEFINITION C.fasciculata retrotransposable element I (CRE 1) gene complete cds. ACCESSION M33009 KEYWORDS integrase; retrotransposable element I; reverse transcriptase. SOURCE C.fasciculata. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 3940) AUTHORS Gabriel,A., Yen,T.J., Schwartz,D.C., Smith,C.L., Boeke,J.D., Sollner-Webb,B. and Cleveland,D.W. TITLE Rapidly rearranging retrotransposon within the miniexon gene locus of Crithidia fasciculata JOURNAL Mol. Cell. Biol. 10, 615-624 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Gabriel, 21-MAR-1990. FEATURES from to/span description ORF < 416 3837 ORF (AA at 2) rpt 1 415 mini-exon gene repeat site 386 415 target site duplication site 415 415 mini-exon splice junction rpt 3899 3940 mini-exon gene repeat site 3899 3927 target site duplication BASE COUNT 890 a 1032 c 1330 g 688 t ORIGIN 1 aagcttccgg aaacaaccgg cacaaatttt gaggcggaag cgctgctttt ttttgtgtcc 61 gggggggtgc tccttggggt ccccctgtcc agccccagcc ggtcgcccac cacataggaa 121 tttgcgaagg acccccaaaa atcccggtcc ccggggcgag ttgtcccaac tttttcaaac 181 ctcatgaaga gctagttgcg tcattgaaaa gttcgtgtgc agaaaccccc tcccccacgt 241 ttgtacaatg gaagagttta cgatacaggt tttctcacgg ttttgaggtg ttttttcgaa 301 aaacaaaaaa tatagaggtg tatagcgctt atttttgaca cccccctcaa aacatgctgg 361 gggtataggt ccttccaact aacgctatat aagtatcagt ttctgtactt tattgaacgg 421 cattcggtct agtgggtcca ctcccaacct tctcctcctt ggtctcgggt tcgattccgg 481 tcgggcacga aactctcttc ctctctaaac tccacatata catccacaat aaccactctc 541 ataactgttc tggtgcgggg ccatttcgaa ccatattgac atttggcgca ccaaactcac 601 acatactaac caaaaatggc agagccaccc ggggcgacgg tccgagcgtg gcggagcacc 661 tcaagcgccg gcgggtcgaa ggatctgagc cggtcgtggt ggggtctcgc caggagggtc 721 tctccggtga gtcagcagtc gaagcgattg ttgtggaaag tggcagcgag gctgacgaag 781 aatccacggc gagggggctg ggagcctctc gacaagcgcc aggtgatgac tcgggcgggc 841 cggtgcccag ggcggaggag gacctcgccc cggttgggta cctgtatccc gagaacctgg 901 ccgtcccaca gggggcggcg tgtccggtgg tgggttgtgg gtaccgaccg aacacccggg 961 tgggaccgag actggtggaa cacctgaaca cggtgcaccg ggatatcctg ggcaacattc 1021 ctgtcgacgc ctggcgacgt cagggactcg tccgctgcct acggtgtggc tcggccttga 1081 ccgcgtccgg tcatggtcgg ggtgcacacg gtgggaagtg tggtccgtac agaagcagaa 1141 acgcggccat tagggctcgg acgcagagtt tctttggtgg aatttcgcag aattctgaca 1201 cacttacggc gggaaacggg gtctcagaag ctggtggtct agtggaagtg ggtagggtag 1261 aagacccaac cacagacccg tggtacaggg ttcgaacccc cttgaagcga caaatttacc 1321 gtacggactt ccgggtgtgg cagggcttgg cccggccagt actcctgggg tactctctgg 1381 cggatacagc agggaaggag ggacgccttc tcgccctgct caatctcccc agggaccatc 1441 tggaggtcca ggtaaatgct aagaggggaa tccaaccaca gccggctgaa atcgaggccc 1501 aggttcgaag gaaggtagtg gagttggcag ggataggagc ggtaggacga gcgatggcag 1561 tgatgactcg ggggcggctg gtcgaggtgc ccctggagcg ggtgatggag cagctcgagg 1621 agctccaccc gcaggaggac ccgcggggtt atccggcagc accagatacg agcgaggtgc 1681 tgcgggccaa ggagcagaag gtgcggcggg cgatcgcggc acggatgggg aggggtacgg 1741 cgcccggcct cgatggctgg acgcgggagc tcctcctccc cctcgcagaa gacccggccc 1801 tgctacacga gatcacgtcg gtggtctcgg atatcatgca ggggaaggtg gccgaggtgg 1861 tggcgcggag gctgcggagc agcgccgtca ccccgattcc gaaggacgag gcggggacga 1921 agatacgccc gatcgtgccg gagtcggcct ggctgaagct ggcctcgctg gtggcgatgg 1981 cggagatacc atccagcttc aaggagacct tcaaggggtg gcagtacggg gtctgggggg 2041 acgtcgccaa agcagtggcg aagatccgcc gggacagcga ggagcacgag tacctggtgg 2101 cactcgacgg ggtcaatgca tacaatacga tgagcagggc ccacatcctc caagccgtgt 2161 acgccgagca gcgcctgaag ccgatctggg gggtggtgaa ggtggcgctt ggggggccgg 2221 ggttcctggg agtatacagg gacggctgcc tcaagggcaa cctgtggtcc accaagggaa 2281 tccggcaggg catggtgctg ggccccctcc tgtacgcgac cgggatggca gcggccatcg 2341 ggccggtacg gcagcgtatc cccggggtcc ccgtgacggc ctacatcgac gacatcaccc 2401 tcgcggccag cggggcggag ggagccaggg cagccgaggc atacgcagac gccctcgaga 2461 cggtcggggt ggtcaccaac gccaggaagt cgatggtggt ggggccagaa ggcacccggg 2521 tggggatcgg gggtgtagac ctgccggtgg tggccgaggc ccggatcctg ggggcccact 2581 tccgagcaag ggggacacca gaggcccgta ccatcgagtg gctgcaggcg gccgtcgaga 2641 agtggcgacc catccaccag aagctgcggc aggacatcat cccgaagaac attgcgatga 2701 tgatgacccg catcagcctg gggtccaaga tgaccttcct cctccagacc cactcgccac 2761 aggaactgga gaccgcagcg aagacggcag acgacgaggt cgagcagacc ctccagcacc 2821 ttatggggca ggtagagatc acgccccgag caaggctgct ggcacaactc ccgatcagag 2881 agggggggct gggtctccgg cgaagcagcg agattgcgaa gttcgcacag gcagacgtgg 2941 ggcaaggcga ggcccaccag gcacacacaa aggcactaga tgaagggatc aagcaccagt 3001 tacaaccact cctctcggag tccgaggtgc agatcctgaa gtcgaacgct ggaatggggg 3061 ccgggcgggt gctgacagat agtagcctga ggatcccaga cgtggcagca acaatcgcgc 3121 tgagggagag actcctgctc agggtgctcc cggagggatg cagtgtgtgt gtggggggga 3181 cgcgacgaac taccatgtac acacgtgctc caacataccc accaagcccc ggacccgacg 3241 acacgacggg gtggtggatg agctggtggc cctggccagg aagatggggt acgagcccag 3301 caaggagccg agggcggacg ttgacgagta gggcgaggcc ggacctgtac atcacaggaa 3361 gcctgaagcc ggcggcgacg gatgtaacaa taacctaccc gggcaggcag gcgaggggag 3421 cacactcccg ttgcagcagg cctaccggaa taagatgggg gcctgggagg catgggggaa 3481 cctgcgaggg gtggacatgc agccggtggt cctcgggacg aacgcggaga tacacccgga 3541 gagtgcgaat ggatacgaag gttgacctcg gtcgaagaca aagacaaaat acataccagt 3601 tacaacgagg tgacgggacg aatcgtggag acggtgttgg ttgggaacgt ggagctgttc 3661 aacgcagtga cgaacctggc gctggtcagg gagttgatgt aggaaggtac gggtgggtag 3721 tagtagaata tcgaacggcg actaggaagt tggaccattg ttacttgtta cttgtgattc 3781 tgacgacgac gaaagattgt tccttgttac ttgttaccct gaaatttgat attacttgaa 3841 ttgatatgat atgattgata caaaaaaatt taaaaaaaaa aaaaaaaaaa aaaaaaaaat 3901 ataagtatca gtttctgtac tttattggta taagaagctt // LOCUS SQDKINEA 3693 bp ss-mRNA INV 01-JUN-1990 DEFINITION Squid kinesin heavy chain mRNA, complete cds. ACCESSION J05258 KEYWORDS kinesin; kinesin heavy chain. SOURCE Squid, cDNA to mRNA. ORGANISM Loligo pealii Eukaryota; Animalia; Eumetazoa; Mollusca; Cephalopoda; Coleoidea; Teuthoidea; Myopsida; Loliginidae. REFERENCE 1 (bases 1 to 3693) AUTHORS Kosik,K.S., Orecchio,L.D., Schnapp,B., Inouye,H. and Neve,R.L. TITLE The primary structure and analysis of the squid kinesin heavy chain JOURNAL J. Biol. Chem. 265, 3278-3283 (1990) STANDARD simple staff_review FEATURES from to/span description pept 421 3324 kinesin heavy chain BASE COUNT 1220 a 684 c 822 g 967 t ORIGIN 1 aaaagaaatc tttctaggcg gccataagtc gctgttctac aaataatacc gagatcgtca 61 taacaaaccg cctatttaac acttccgaac aacttttata gaatatttac gactgttttg 121 aatagtatat ttagctgtca ggctgcgatt tgcttccttt tatttattta tttatttatt 181 tatatttttt gtagtagaaa tttacatttt ttttttttgc gggggagacg tcaggatatt 241 attacctact tttgtttgtg cgcgtgtgtt gaagaaacga aagacgttca agctttattg 301 gattagagaa ttgataaaga tcggattaac tgtataataa cgctaaactt taatttattt 361 gggacctagg aaagtcccat ataaacccaa gaatttatat cgtcctcaaa caacgccgcc 421 atggacgtgg cttccgagtg taacattaaa gtcatctgcc gagttagacc ccttaacgaa 481 gccgaggaac gggccggcag caagttcatc cttaaattcc caactgatga ttccatctct 541 atcgctggga aagtgttcgt atttgataag gttttaaaac ctaatgtctc acaagaatat 601 gtgtacaatg ttggtgctaa gcccattgtt gctgatgtac ttagtggttg caatggaaca 661 atttttgctt atggacaaac atccagtggg aaaactcata ctatggaggg tgtgctggac 721 aaaccctcca tgcacggcat cattccacgc attgtacaag acatcttcaa ctacatttat 781 ggaatggacg aaaacttgga gtttcatatc aagatttcat attatgaaat ttacttggac 841 aaaatcaggg atcttttgga tgttaccaag accaacttgg ctgtccatga agacaaaaac 901 agagtcccgt ttgtcaaggg tgccactgaa cgatttgtat ccagtcctga agaagtaatg 961 gaagttattg atgagggtaa aaacaacaga catgtagctg tgacaaacat gaatgaacac 1021 agttcgagaa gtcatagtgt atttttaatc aatgtcaagc aagaaaatgt tgaaacccag 1081 aaaaaattaa gtggtaaact ttacctagta gatcttgctg gtagtgaaaa agtaagcaaa 1141 actggtgcag aaggtgcagt ccttgatgaa gctaaaaaca tcaataaatc tttgtcagct 1201 ctgggaaatg tcatttcagc tcttgctgat ggaaataaat ctcacgtccc atacagagac 1261 agtaaattga cgagaatctt gcaggaaagt ttgggtggta atgcccgaac cactatggtc 1321 atatgttgtt cacctgcctc ttacaatgaa tccgagacca agtctacttt gctttttggc 1381 caaagagcta agactattaa gaatgtggta tcggttaatg aggagcttac agctgatgaa 1441 tggaaacgaa gatatgagaa agaaaaagaa agggttacta agcttaaagc cactatggct 1501 aagcttgagg cagaactaca gagatggaga actgggcaag ctgtatctgt agaagaacaa 1561 gttgatctta aggaagatgt ccctgctgaa tctcctgcca ctagtacaac tagcctggct 1621 ggaggactca tcgcttcaat gaatgagggt gaccgcaccc aactggaaga agagcgtctc 1681 aagctttatc aacagttgga tgacaaggat gatgaaatta acaatcagtc tcagttaatt 1741 gaaaaattga aggagcagat gatggaacaa gaagatttaa ttgcccaatc tcgtcgtgat 1801 tatgaaaatc tacaacaaga tatgtctcga atccaagctg acaatgagtc tgccaaggat 1861 gaagtcaaag aggtcttgca agctcttgag gaactggcca tgaactatga tcagaagtct 1921 caggaggtgg aagacaaaaa taaggagaat gagaatctaa gtgaggaact taaccagaaa 1981 ttgagcacct tgaatagctt acagaatgaa ttggatcagc tgaaggattc atctatgcac 2041 cataggaaac gagttactga catgatgatt aacttgttga aagatcttgg tgatattggt 2101 actattgtag gtggaaatgc tgcagaaact aagccaactg ctggttctgg tgagaaaatt 2161 gaggaggaat tcactgtagc tcgtctgtat attagcaaaa tgaaatctga ggtgaagacg 2221 ttggtgagcc gaaacaacca actggaaaat actcagcagg acaattttaa gaaaattgaa 2281 acacatgaaa aggatttgtc caactgcaaa ttactcattc aacaacatga agccaaaatg 2341 gctagtttac aggaagccat taaagattct gagaacaaga agaggatgtt ggaagacaat 2401 gtggattctc taaatgagga atatgccaaa cttaaagctc aagaacaaat gcatttagct 2461 gctctgtccg aacgggagaa ggagaccagc caagcaagtg aaactcgaga agttcttgag 2521 aaacagatgg aaatgcatcg ggaacaacac caaaaacaac ttcaatcttt acgggatgaa 2581 atttcagaaa agcaggctac tgtggataat cttaaagatg ataaccagag actttctttg 2641 gccctggaga aacttcaggc tgattatgac aaactcaaac aagaggaagt tgaaaaggct 2701 gccaagttgg ctgacttatc attgcagatt gaccgtcggg agcaagccaa acaggatctc 2761 aaaggacttg aagagactgt ggcaaaggag ctgcaaacac tacacaatct tagaaaactt 2821 tttgtacaag atctacaaaa taaggtcaaa aagtcatgca gcaagaccga agaagaagat 2881 gaagatactg gtggtaatgc tgcccagaaa caaaagattt ctttcttaga aaataatctt 2941 gagcagctga ctaaggttca caagcagctt gttcgagaca atgctgactt gcgctgtgag 3001 ttgcccaagc tggagaaaag gttacgtgct actatggagc gagtcaagtc cttggaatct 3061 gctctcaagg atgccaagga aggcgctatg agagacagaa aaaggtacca gcatgaggtg 3121 gatagaatca aggaagctgt ccgccagaag aatctcgctc gccgaggaca tgctgcacaa 3181 atagccaaac caatccggcc tggacaacac cagtctgttt ctcctgcgca ggcagcagct 3241 attcgcggtg gtggtggttt gtcccagaat ggaccaatga ttacatctac gccaattcga 3301 atggcacctg aaagcaaggc ataaatatct agttggaaga ggaattttta ctcccattca 3361 tcaaagaaca acagaactac aaagctttct agtgcattca ttgataattg aaatggagga 3421 tatgtatctc ctcctcctct actattatat ctcctcctcc tacatgaagc tgcattccgt 3481 tttgttagcg tgtaccctgt gtctcgctgc tacagtggag tgtatgtggg ctcttgcata 3541 ccactgcata gtggcagtta ggatggctgg ttacttagct atttaactgc ctttctacct 3601 gcctgcttcc cacctgcctt agtgatggca agtgagtgac tgctctggtg gcaatccaac 3661 ttttgtacaa ttattaaaaa aaaaaaaaaa aaa //
GenBank-Updates@genbank.bio.net (06/02/90)
LOCUS RATMRF4AA 1271 bp ss-mRNA ROD 02-JUN-1990 DEFINITION Rat muscle regulatory factor mRNA MRF4, complete cds. ACCESSION M27151 KEYWORDS differentiation protein; muscle regulatory factor. SOURCE Rat cDNA to mRNA, clone lambda-cRS4-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1271) AUTHORS Rhodes,S.J. and Konieczny,S.F. TITLE Identification of MRF4: A new member of the muscle regulatory factor gene family JOURNAL Genes Dev. 3, 2050-2061 (1989) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by S.F.Konieczny 16-AUG-1989, for release after publication. FEATURES from to/span description pept 45 773 muscle regulatory factor MRF4 BASE COUNT 347 a 292 c 303 g 329 t ORIGIN 1 gtccatcacc cagttcagat cagtcagagg ccaaggagga gaacatgatg atggaccttt 61 ttgaaactgg ctcctatttc ttctacttag atggagaaaa tgtgactctt cagccattag 121 aagtggcaga gggctctcct ttgtacccag ggagtgatgg taccctatcc ccttgccagg 181 accaaatgcc ccaggaagcc gggagcgaca gcagtggaga ggaacacgtt ctggctcccc 241 caggccttca gccaccccac tgcccaggtc agtgtctgat ctgggcttgc aagacttgca 301 agagaaaatc tgcccccaca gatcgtcgga aagcagctac cctgcgcgaa aggaggaggc 361 ttaagaaaat caacgaagcc tttgaggcct tgaagcgtag aactgtggcc aaccccaacc 421 agaggctgcc caaggtggag attctgagaa gtgccatcaa ctacattgag cgtctgcagg 481 acctgctgca ccggctggat cagcaagaga aaatgcagga gctgggggtg gacccttaca 541 gctacaaacc caagcaagaa attcttgagg gtgcggattt cctgcgcacc tgcagcccgc 601 agtggccaag tgtttcggat cattccaggg gcctggtgat aactgctaag gaaggaggag 661 caagcgtcga tgcttcagcc tccagcagtc ttcagcgcct ttcttccatc gtggacagta 721 tttcctcaga ggaacgcaaa ctccccagcg tggaggaggt ggtggagaag taactcagtc 781 agcatttgga acattcttcg ctcagcagga agagcccctt tccgcctaat catttagatt 841 agggctcaca gaccccagaa tttatgaaag gcaagagact tagtgttaaa aaagaaacct 901 ctccccacct caagtgaaaa tccttcggct tggggctttt attataacta ttattgtatc 961 tgaaccgcta gtggcttagc tctagaaccc taattttgtt tttagtttgg ttggtttttt 1021 ataacatatt aacttttgct atgatcacgt gaccctttcc tgtcagttgc aaacgaagtt 1081 cattcctgtc taaatcaaag tgggaacgtt taatcctaag agtatttaat gtacttttgt 1141 aaatagtctt agtactttcg tttttatgta aacctaaagg acatatttta aatgtggaat 1201 taagcattgt atataaaatg tgtgagagtc tgatattgta atattaaaat attaatacgt 1261 ttctacacgt a //
GenBank-Updates@genbank.bio.net (06/05/90)
LOCUS HUMBREAKZ 680 bp ds-DNA PRI 05-JUN-1990 DEFINITION Human chromosome 10q24 breakpoint region. ACCESSION M33603 M31943 KEYWORDS Chromosomal translocation. SOURCE Human lung fibroblast DNA, clones G10-1 and G10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 680) AUTHORS Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A., Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M., Carroll,A.J. and Korsmeyer,S.J. TITLE The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia juxtaposes the delta T-cell receptor with TCL3, a conserved and activated locus at 10q24 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.M.Zutter, 07-FEB-1990. FEATURES from to/span description site 256 257 breakpoint BASE COUNT 119 a 215 c 196 g 150 t ORIGIN Chromosome 10q24. 1 ggatccacta gccgggtcag gcagagagaa ggaggccata gattcaggac caccttctgc 61 agctactctt ggcctgggaa tcttagagaa aggggtgcaa cgtgaacagt ttcttccgtc 121 ctggctgctg ttctaggacc caggaagcga gccccaggat cagacacaag aaaagaacag 181 gttcccccac ctccagtccc ctacacacat gcacttcgcg cccctagctc cggtgcccgt 241 ggtgctgcgg ggccagtggg ggctctgagc cccggtaaat cagcagaacc agtggccttt 301 tttttttttt ccggagagca ggcttcatct gggaggatgg cagcggggct tagccgccca 361 ggacagctag cttttcattt tcgttctctc ccggctgggt gtcctacccc gggcccagcc 421 ctgttatctt gggcacgaac aatgcaccgg taggctggtg atcggtggcg ggaagactac 481 tcccgggaca ttgggtacta agtggtggcc cgtgccgcct tctctgctgc cccgtcccgc 541 cttcctctgc tggccctggc tccctctgct ccgtttccac tctgggcact caactctccc 601 tctggcgtgg atcccccggg ctgcaggaat tcgatatcaa gcttatcgat accgtcgacc 661 tcgagggggg gcccggatcc // LOCUS HUMTCDXB 60 bp ds-DNA PRI 05-JUN-1990 DEFINITION Human T-cell receptor rearranged delta-chain gene V-region (V2-N-J1). ACCESSION M34439 M31943 KEYWORDS T-cell receptor; delta chain; rearranged gene; variable region. SOURCE Human (T-cell lymphoblastic leukemia patient 1143) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 60) AUTHORS Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A., Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M., Carroll,A.J. and Korsmeyer,S.J. TITLE The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia juxtaposes the delta T-cell receptor with TCL3, a conserved and activated locus at 10q24 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.M.Zutter, 07-FEB-1990. FEATURES from to/span description pept < 1 > 60 T-cell receptor delta-chain V2-N-J1-region (AA at 3) /hgml_locus_uid="LP0165Q" /nomgen="TCRD" /map="14q11.2" recomb 29 30 V2-region end/N-region start recomb 35 36 N-region end/J1-region start BASE COUNT 19 a 15 c 11 g 15 t ORIGIN 1 aagggtctta ctactgtgcc tgtgacacca aacgtacacc gataaactca tctttggaaa // LOCUS HUMTRANSX 50 bp ds-DNA PRI 05-JUN-1990 DEFINITION Human T-cell leukemia t(10:14)(q24:q11) chromosomal translocation. ACCESSION M33602 M31943 KEYWORDS T-cell receptor; chromosomal translocation; oncogene. SOURCE Human (T-cell lymphoblastic leukemia patient 1143) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 50) AUTHORS Zutter,M., Hockett,R.D., Roberts,C.W.M., McGuire,E.A., Bloomstone,J., Morton,C.C., Deaven,L.L., Crist,W.M., Carroll,A.J. and Korsmeyer,S.J. TITLE The t(10;14)(q24;q11) of T-cell acute lymphoblastic leukemia juxtaposes the delta T-cell receptor with TCL3, a conserved and activated locus at 10q24 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3161-3165 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.M.Zutter, 07-FEB-1990. FEATURES from to/span description ORF < 1 > 50 TCL3 oncogene (put.) recomb 20 21 10q24 DNA end/N DNA start recomb 27 28 N DNA end/14q11 DNA start BASE COUNT 12 a 10 c 19 g 9 t ORIGIN 1 cggtggtgct gcggggccag ggagggaacc gataaactca tctttggaaa // LOCUS MUSTCPB 2369 bp ds-DNA ROD 05-JUN-1990 DEFINITION Mouse t complex protein-10 (Tcp-10b) gene, complete cds. ACCESSION M22597 J03141 KEYWORDS t complex responder. SOURCE Mouse (haplotype tw5) testes germ cell DNA, clone T66B-a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 519) AUTHORS Schimenti,J., Cebra-Thomas,J.A., Decker,C.L., Islam,S.D., Pilder,S.H. and Silver,L.M. TITLE A candidate gene family for the mouse t complex responder (Tcr) locus responsible for haploid effects on sperm function JOURNAL Cell 55, 71-78 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 2369) AUTHORS Schimenti,J., Cebra-Thomas,J.A., Decker,C.L., Islam,S.D., Pilder,S.H. and Silver,L.M. JOURNAL Unpublished (1989) Lewis Thomas Lab, Princeton, NJ 08544-1014 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by L.M.Silver, 07-JUN-1989. The sequence is interrupted by introns that are not indicated in [1] or [2]. FEATURES from to/span description pept 421 1737 t complex protein-10 BASE COUNT 707 a 565 c 594 g 493 t 10 others ORIGIN Chromosome 17. 1 tgcagaaaac catggggccc gggccatatg aaggtgcagt gggttatgaa taagctctgc 61 tataggtgcc tttgactctc agggtgaaat accttgtctc cagtcagatc ccctaccgct 121 gtggctaaac ctctcattct tgtttctata cagcctaggt tatggagttg taggttgggg 181 acaaggggtt tgtcctgatg agaggcagcg aaaccttcca cagcaaaggt ctgacgaaga 241 actaagacct cccaggcgcc ctctcaaggc cctgttcacc tcaaggtaat gctggcagta 301 ctcatgtgat caggcaccat gctgactctc tgaaccaaag tgttttccac cctcctctgg 361 ggacacagcc cttaatgctt actggtcacc tcgaggctaa atagtcagca gagacaagcc 421 atgttggagg atctgagtca aggaaaaggg tccaaccatg agaagaggaa gatggagagc 481 acagcccaga tcactgagga agacagcaag cttgatgagg ttgtggggct gcagaagcag 541 atctgtgacc ttgggacaga gctcacaaga caatcatctt ggtggtgcgt agctcacaaa 601 gacctccaaa gccagatcga tgctctgata aaggagaacc aggagatccg tgcggagctg 661 aagaccttga agaagcagga tgcggaggcc accaaagcct gtataggctc gcccaccccg 721 gcaagagcaa gcaacactct gccagtgtac ataaagatag agggaattga ttccgagaag 781 acaacctcat gggatgaaag agatgagctt tctggaagtc ctccaaacag aagcacaatg 841 gccaccggag gaacagactc ccaggatgaa aggctgtctt ttacatctgt ggatgaaaag 901 gttatacaca tgtcttccaa atttctgcaa agaagcttcg gcagaatgtc accagaacca 961 ctgtctgaca gcacattcct ggacacagag tcactggctg acatctggtc ctcaaatcca 1021 gagacttcgg acagtgaact tctcctgcat gctcaagcaa gcagggtcat tccttgtttt 1081 tccccaaatg cactgtgggt gcagaatatt ccaacaaagt caagagctcc taaagaaata 1141 cagcaaacct cagacactac aaagactgat gagacaaagg aaaagcgaca cccaaacggc 1201 aaggtggagc ggatgctcag tgacgggcga accatcatca ccttccccaa tggaaccagg 1261 aaggagatca gtgctgacaa gaagaccacc ctcatcaggt tttttaacgg tgacatgaag 1321 aagatcaagt ccgatcagaa agtgatttat tattatgcgg acacacaaac aatgcacaca 1381 acctacccag atggtgttga agtggtgcag tttcctaaca agtggactga aaaattctac 1441 ccggatggct ccaaggaaac cgtgtttcct gatgggacag tgaaacagct taaggatgga 1501 tgtgaagaga cggtgttccc cgatgggaca tttgtgacag tgaagaggaa cggagacaaa 1561 accatcatgt tcagcaacgg agagaaagaa atccacacgg ccaggttcaa gnggatggaa 1621 ttcccagatg gnaccaccaa gactgtgtat tgcaatggct gccaggagac caagtatgcc 1681 tcagggaggg tcagggtcaa agatgagaag ggaactgtca tcctggactg gaagtagtgc 1741 atccaacaga ctcaaggcca ggcagtaggc tccaatctcc aaatatagtt ttgaccaaaa 1801 cagacaaagc gactcacgat tcccttgaca atcctggaga tcccctgcac atgaggagta 1861 aagagacatc caaaaattgt aaagaaagag accatgtagc caccaccatc ttgggagaga 1921 ggtcagagaa gcagtagcca gaaatgaggt cctggacatg gcttccgtgg ccttgtncnn 1981 accatgcctt ctgtcggagt ctggccctag agcagagggt acaatgggag caacgtgacc 2041 actgccagct agttcctggg agcagccgtt aacaacagct ggcncngtgt ttttgcttgg 2101 ccaacacctg cctgcatcta cttctatccc cttctagagt cctcatcaat gggatccagg 2161 gggatgtatc ttcactggtg cctgatgtat gaatagactt gatttaacca tcctgaagtg 2221 tctagacaga acctcatgtg gcattctatc ccataaattc acacaatttt gatntgtcaa 2281 tcaacaagga gttgagnaaa agcacagata aaaacaaaga cagaaacata gtgngccaca 2341 cctcctacct gttcctgtga gtctgccct //
GenBank-Updates@genbank.bio.net (06/06/90)
LOCUS HUMARC1 1753 bp ds-DNA PRI 06-JUN-1990 DEFINITION Human androgen receptor gene, exon 1. ACCESSION M27423 KEYWORDS androgen receptor; dihydrotestosterone receptor; testosterone receptor. SEGMENT 1 of 8 SOURCE Human (ATCC library XLAOXNLO1) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1753) AUTHORS Lubahn,D.B., Brown,T.R., Simental,J.A., Higgs,H.N., Migeon,C.J., Wilson,E.M. and French,F.S. TITLE Sequence of the intron/exon junctions of the coding region of the human androgen receptor gene and identification of a point mutation in a family with complete androgen insensitivity JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9534-9538 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1) AUTHORS Lubahn,D.B. JOURNAL Unpublished (1990) lubahnd%uncvx1.bitnet STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by D.B. Lubahn, 29-AUG-1989, [2] 04-JUN-1990. FEATURES from to/span description pept 79 + 1691 androgen receptor, exon 1 /nomgen="AR" /map="Xq12" /hgml_locus_uid="LL0058U" IVS 1692 > 1753 AR intron A revision 1 1 a in [2]; c in [1] BASE COUNT 344 a 546 c 573 g 290 t ORIGIN 1 agcctgttga actcttctga gcaagagaag gggaggcggg gtaagggaag taggtggaag 61 attcagccaa gctcaaggat ggaagtgcag ttagggctgg gaagggtcta ccctcggccg 121 ccgtccaaga cctaccgagg agctttccag aatctgttcc agagcgtgcg cgaagtgatc 181 cagaacccgg gccccaggca cccagaggcc gcgagcgcag cacctcccgg cgccagtttg 241 ctgctgctgc agcagcagca gcagcagcag cagcagcagc agcagcagca gcagcagcag 301 cagcagcagc aagagactag ccccaggcag cagcagcagc agcagggtga ggatggttct 361 ccccaagccc atcgtagagg ccccacaggc tacctggtcc tggatgagga acagcaacct 421 tcacagccgc agtcggccct ggagtgccac cccgagagag gttgcgtccc agagcctgga 481 gccgccgtgg ccgccagcaa ggggctgccg cagcagctgc cagcacctcc ggacgaggat 541 gactcagctg ccccatccac gttgtccctg ctgggcccca ctttccccgg cttaagcagc 601 tgctccgctg accttaaaga catcctgagc gaggccagca ccatgcaact ccttcagcaa 661 cagcagcagg aagcagtatc cgaaggcagc agcagcggga gagcgaggga ggcctcgggg 721 gctcccactt cctccaagga caattactta gggggcactt cgaccatttc tgacaacgcc 781 aaggagttgt gtaaggcagt gtcggtgtcc atgggcctgg gtgtggaggc gttggagcat 841 ctgagtccag gggaacagct tcggggggat tgcatgtacg ccccactttt gggagttcca 901 cccgctgtgc gtcccactcc ttgtgcccca ttggccgaat gcaaaggttc tctgctagac 961 gacagcgcag gcaagagcac tgaagatact gctgagtatt cccctttcaa gggaggttac 1021 accaaagggc tagaaggcga gagcctaggc tgctctggca gcgctgcagc agggagctcc 1081 gggacacttg aactgccgtc taccctgtct ctctacaagt ccggagcact ggacgaggca 1141 gctgcgtacc agagtcgcga ctactacaac tttccactgg ctctggccgg accgccgccc 1201 cctccgccgc ctccccatcc ccacgctcgc atcaagctgg agaacccgct ggactacggc 1261 agcgcctggg cggctgcggc ggcgcagtgc cgctatgggg acctggcgag cctgcatggc 1321 gcgggtgcag cgggacccgg ttctgggtca ccctcagccg ccgcttcctc atcctggcac 1381 actctcttca cagccgaaga aggccagttg tatggaccgt gtggtggtgg tgggggtggt 1441 ggcggcggcg gcggcggcgg cggcggcggc ggcggcggcg gcggcggcgg cggcgaggcg 1501 ggagctgtag ccccctacgg ctacactcgg ccccctcagg ggctggcggg ccaggaaagc 1561 gacttcaccg cacctgatgt gtggtaccct ggcggcatgg tgagcagagt gccctatccc 1621 agtcccactt gtgtcaaaag cgaaatgggc ccctggatgg atagctactc cggaccttac 1681 ggggacatgc ggtaagtttt tccttccaga aatgtcgcct ttcggcccag ggcagagtca 1741 ctctgtgttc tgg // LOCUS ONOMICACTA 2374 bp ds-DNA INV 06-JUN-1990 DEFINITION O.nova micronuclear MIC-ACT-1 gene, complete cds. ACCESSION M25531 KEYWORDS actin. SOURCE O.nova micronuclear DNA. ORGANISM Oxytricha nova Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; Spirotricha; Hypotrichida; Sporadotrichina; Oxytrichidae. REFERENCE 1 (bases 1 to 2374) AUTHORS Greslin,A.F., Prescott,D.M., Oka,Y., Loukin,S.H. and Chappell,J.C. TITLE Reordering of nine exons is necessary to form a functional actin gene in Oxytricha nova JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6264-6268 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.F.Greslin, 13-JUN-1989. FEATURES from to/span description mRNA 2028 2220 actin mRNA 1956 2016 actin mRNA 427 918 actin mRNA 947 1393 actin mRNA 1666 1703 actin mRNA 1405 1555 actin mRNA 1723 1836 actin mRNA 2245 2270 actin mRNA 1848 1939 actin mRNA site 427 918 actin, exon 1 site 947 1393 actin, exon 2 site 1405 1555 actin, exon 3 site 1666 1703 actin, exon 4 site 1723 1836 actin, exon 5 site 1848 1939 actin, exon 6 site 1956 2016 actin, exon 7 site 2028 2220 actin, exon 8 site 2245 2270 actin, exon 9 BASE COUNT 745 a 531 c 450 g 648 t ORIGIN 1 ttaagcagta aaaccataat agccagtact tactagtcac tgatgaaaat gatagtatgc 61 acaagaagcc agctaaaaag aagatatatg gattgaatat ccactatgca attggaagaa 121 tctcaccaca aggccagagg aatgcaaacc tttttggaag aacaatatcc gaagagtaag 181 agtccaagtc ctatttcaca gtaaaaattt aaatacctat gaaaagagtc ctcgaatttc 241 aaggatataa tctaatatga accacttata taccttaaaa attatataca aattatattt 301 ataaataata taaaaaaaaa ttacaatagt aagatgatgt taaaatttaa tgataaattg 361 ttctatcaag catatatttt ccatttcaaa aatgagcatt caatcttcct cttagctatt 421 aagacatcgg agtcgtcaag gctggtttct ccggcgaaga tgctcccaga gctgtgttcc 481 cttccatcat cggaagaccc aagaacgtca gcgctttgat cggagttgac tccgcttctg 541 agtacatcgg agacgaggcc caacaaaaga gaggagtcct caagatcttc taccccatcg 601 agcacggtat catcaaggat tgggaagaca tggagaagat ctggaaccac accttctacg 661 ttgaactcag agtccagccc gatgagcacc ccgtcctcct cactgaggct cccctcaacc 721 ccaagaccaa cagagaaaag atgacccaga tcatgttcga gaccttcaat gttcccgctc 781 tctacgtcgc catccaggcc gtcctctccc tctactccgc tggtagaacc accggtatcg 841 tttgcgatgc tggtgatggt gtcacccaca ccgtccccat ctatgagggt ttctccatcc 901 cccacgccgt ctcaagaatc tcattttatt actcatttaa tatggagtaa tccagcttgc 961 cggtagagac ttgaccactt tcatggccaa gctcctgact gagaagggct acgtgttcac 1021 ttcctccgct gagatggaaa tcgtcagaga catcaaggaa aagctttgct tcgtcgcttt 1081 ggactacgaa gctgccatga agcagtccta cgagagcacc accttcgaga agaactacga 1141 gcttcccgac ggtagagtca ttaccattgg taacgctaga ttcagatgcc ccgagtatct 1201 cttcaagccc ctcgagatga acggaaagga actcgactcc atccaatcac tcacctacaa 1261 ctccatccag gagtgcgatg tcgatgtcag aagagacctc taccaaaaca tcatcctctc 1321 cggaggtacc accatgtacg agggtattgg tgagagactc ctcaaggaaa ttgaggccag 1381 agctcccaag tccataatag aatatagcca agccccgaca gaagattcgc cgtctggaga 1441 ggaggttcca ccctcacttc tctgtccact ttcgccagca tgtggatcac caaggaagac 1501 tacgacgaga acggagcaag catcgtccac agaaagtgcc tctgagccca aaactctatc 1561 cctatattta tacagctgtt cagattacaa tataaagttt catcatttac tagctcattt 1621 tattgaaaca attaaatatc tctctcctat tggaaaataa atggttttct cccaagtcca 1681 tcaacgtgaa ggttattgcc agccccataa tagtatttgg gaagacaaaa ctctagaaag 1741 atattgttta atttatgaaa cgctaataca aggggtgcag gatgaggatg tccgtagcct 1801 ctgcttcctt ttaactctga tgactcgcct ttgggttgaa tattatagtt aggttgaatg 1861 agaacttagg atatgtgttc cctttgactg catctcacta atttattaag catttcccct 1921 attttatttt atttgactcc ttttaataaa attgttagct tgacgactcc ggaaccatta 1981 tcaacaacca cagtttgttt gtctgccatg tgtagtaagc ttataaaaca tattgtgcta 2041 attgattctg aggaaatata ataaatgaaa gatagaggat ttgcgggggc gaagtagtga 2101 gattgtgata ttaattatat tagagtgggt gggaaagaga agcaagttta atgctacttt 2161 aatagatata gcgccaaaga tacagccttt aataatatta aatctttcta ctacttacta 2221 cacattgaag ccataattcc cttttggttg aggccaggtt gaatgaatat ttttatcatc 2281 acaagcaaat tttaaattaa acttacacca aaatatataa attgataact taaagtctat 2341 aaatcagaat tgagcagtca tctgacttgt gatc // LOCUS ONOMICACTB 2301 bp ds-DNA INV 06-JUN-1990 DEFINITION O.nova micronuclear MIC-ACT-2 gene, complete cds. ACCESSION M25530 KEYWORDS actin. SOURCE O.nova micronuclear DNA. ORGANISM Oxytricha nova Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; Spirotricha; Hypotrichida; Sporadotrichina; Oxytrichidae. REFERENCE 1 (bases 1 to 2301) AUTHORS Greslin,A.F., Prescott,D.M., Oka,Y., Loukin,S.H. and Chappell,J.C. TITLE Reordering of nine exons is necessary to form a functional actin gene in Oxytricha nova JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6264-6268 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.F.Greslin, 13-JUN-1989. FEATURES from to/span description mRNA 2030 2225 actin mRNA 1959 2019 actin mRNA 429 920 actin mRNA 949 1395 actin mRNA 1669 1706 actin mRNA 1407 1558 actin mRNA 1726 1839 actin mRNA 2241 2266 actin mRNA 1851 1942 actin mRNA site 429 920 actin, exon 1 site 949 1395 actin, exon 2 site 1407 1558 actin, exon 3 site 1669 1706 actin, exon 4 site 1726 1839 actin, exon 5 site 1851 1942 actin, exon 6 site 1959 2019 actin, exon 7 site 2030 2225 actin, exon 8 site 2241 2266 actin, exon 9 BASE COUNT 704 a 525 c 445 g 627 t ORIGIN 1 ttaagcagta aaaccataat agccagtact tactagtcac tgatgaaaat gatagtatgc 61 acaagaagcc agctaaaaag aagatatctg gattgaatat ccactatgca attggaagaa 121 tctcaccaca aggccagagg aatgcaaacc tttttggaag aacaatatcc gaagagtaag 181 agtcccaagt cctatttcag agtaaaaatt taaataccta tgaaaagagt cctcgaattt 241 caaggatata atctaatatg aaccacttat ataccttaaa aattatatac aaattatatt 301 tataaataat atataataac tacattagta agatgatgtt aaaatttaat gataattgtt 361 ctatcaagca tatattttcc atttcaaaaa tgagcattca atcttcctct tagatatgaa 421 ggcataggag tcgtcaaggc tggtttctcc ggcgaagatg ctcccagagc tgtgttccct 481 tccatcatcg gaagacccaa gaacgtcagc gctttgatcg gagttgactc cgcttctgag 541 tacatcggag acgaggccca acaaaagaga ggagtcctca agatcttcta ccccatcgag 601 cacggtatca tcaaggattg ggaagacatg gagaagatct ggaaccacac cttctacgtt 661 gaactcagag tccagcccga tgagcacccc gtcctcctca ctgaggctcc cctcaacccc 721 aagaccaaca gagaaaagat gacccagatc atgttcgaga ccttcaatgt tcccgctctc 781 tacgtcgcca tccaggccgt cctctccctc tactccgctg gtagaaccac cggtatcgtt 841 tgcgatgctg gtgatggtgt cacccacacc gtccccatct atgagggttt ctccatcccc 901 cacgccgtct caagaatctc attttattac tcacttaata tggagtaatc cagcttgccg 961 gtagagactt gaccactttc atggccaagc tcctgactga gaagggctac gtgttcactt 1021 cctccgctga gatggaaatc gtcagagaca tcaaggaaaa gctttgcttc gtcgctttgg 1081 actacgaagc tgccatgaag cagtcctacg agagcaccac cttcgagaag aactacgagc 1141 ttcccgacgg tagagtcatc accattggta acgctagatt cagatgcccc gagtatctct 1201 tcaagcccct cgagatgaac ggaaaggaac tcgactccat ccaatcactc acctacaact 1261 ccatccagga gtgcgatgtc gatgtcagaa gagacctcta ccaaaacatc atcctctccg 1321 gaggtaccac catgtacgag ggtatcggtg agagactcct caaggaaatt gaggccagag 1381 ctcccaagtc cataatagaa tatagccagc cccgacagaa gattcgccgt ctggagagga 1441 ggttccaccc tcacttctct gtccactttc gccagcatgt ggatcaccaa ggaagactac 1501 gacgagaacg gagcaagcat cgtccacaga aagtgcctct gagcccaaaa ctctatccct 1561 atatttatac agctgttctt attacaatat aaagtttcat catttactag ctcattttat 1621 tgaaacaatt aaatatctct ctcctattgg aaaataaatg gttttctccc aagtccatca 1681 acgtgaaggt tattgccagc cccataatag tatttgggaa gacaaaactc tagaaagata 1741 ttgtttagtt tatgaaacgc taatacaagg ggtgcaggat gaggatgtcc gtagcctctg 1801 cttcctttta actctgatga ctcgcctttg ggttgaatat tatagttagg ttgaatgaga 1861 acttaggata tgtgttccct ttgactgcat ctcactaatt tattaagcat ttcccctatt 1921 ttattttatt tgactccttt taataaaatt gttagcttga cgactccgga accattatca 1981 acaaccacag tttgtttgtc tgccatgtgt agtaagctta taaacattat gtgcaaattg 2041 attctgagga aatagaataa tgaaagatag aggatttgcg ggggcgaagt agtgagattg 2101 tgatattaat tatattagag tgggtgggaa agagaagcaa gtttaatgct actttaatag 2161 atatagcgcc aaagatacaa cctttaataa tattaaatct ttctactact tactacacat 2221 tgaagccaat ttcatattca ttccctttgg gttgaggcca ggttgaatga ctattttttt 2281 catcactaag caaattttaa a // LOCUS BSUTRPS 1450 bp ds-DNA BCT 06-JUN-1990 DEFINITION B.subtilis trpS gene encoding tryptophanyl-tRNA synthetase, complete cds. ACCESSION M24068 KEYWORDS tryptophanyl-tRNA synthetase. SOURCE B.subtilis (strain QB928) DNA, clone pTSQ2. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1450) AUTHORS Chow,K.-C. and Wong,J.T.-F. TITLE Cloning and nucleotide sequence of the structural gene coding for Bacillus subtilis tryptophanyl-tRNA synthetase JOURNAL Gene 73, 537-543 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by K.-C.Chow 05-MAY-1989. FEATURES from to/span description pept 171 1163 tryptophanyl-tRNA synthetase (EC 6.1.1.2) BASE COUNT 433 a 306 c 331 g 380 t ORIGIN 1 ctgaccgaac tggaaaaaag taggctcagc gagcagtctc cgttacaaac gtcagagtga 61 ttccatttta atggaataat cagggtggta ccacggttca ttcgtccctt ttttacaggg 121 gaagaatgag ccttttttat tatgttttaa gaaatgaggt tgatgttttc atgaaacaaa 181 cgattttttc aggcattcag ccaagcggct cagtgacgct cggcaactat atcggtgcaa 241 tgaagcagtt tgtcgaactg cagcatgatt ataacagcta tttttgcatc gtcgatcagc 301 atgcgataac tgttcctcaa gaccggcttg agcttagaaa gaatatccgc aatctcgcgg 361 cgctttactt agctgtcgga cttgatccag aaaaagcaac attgtttatt cagtcagagg 421 tccccgcaca tgcgcaggcc ggatggatga tgcagtgtgt cgcctatatc ggcgagcttg 481 agcggatgac tcaatttaag gacaaatcca aaggcaatga agctgtcgtc tccggcctgt 541 taacatatcc gccgctgatg gccgctgata ttctgctgta cggaacggat cttgtacctg 601 tcggcgagga tcaaaagcag caccttgagc tgacgcggaa tcttgcagaa cgcttcaaca 661 aaaaatacaa cgacatcttt acgattccgg aagtgaaaat tccaaaagtc ggtgcacgta 721 tcatgtctct gaatgatccg ctgaagaaaa tgagcaaatc tgatccgaat cagaaagctt 781 atattacatt gctggatgag ccgaagcagc ttgaaaagaa aatcaaaagc gcagtaacgg 841 attctgaagg cattgtcaaa tttgataagg aaaacaaacc gggcgtttcc aaccttctta 901 caatttattc aatcctcggc aatacgacaa ttgaagagct tgaagcaaag tacgaaggaa 961 aaggctacgg cgagtttaaa ggtgatttgg cagaagtcgt agtgaacgca ttaaaaccga 1021 tccaggaccg ctattacgag ctgatagaat ctgaagaatt agaccggatt cttgatgaag 1081 gcgcggaacg agcgaatcgg acagcaaaca aaatgctgaa aaaaatggag aatgccatgg 1141 gtcttggaag aaaaagacgc taatcaaaaa accgctcttt gcaaagagcg gtttttttca 1201 gttgaccttt gattcgtttt ccatttccca aagcttttcg aaaaaaggct ggcctgttat 1261 taggttttcg aaaaagctag gtgtttgtcc gaccagcctt caagtgtgtc gttatacagc 1321 atgtcccaaa tctcttcaaa gctcatcctt ttgatttgtt catacatatg cggattccac 1381 tctgtgtacc agtagccaat ttccgctcgt ttttcagccc ttttaattgg tcaagcgaca 1441 taaacatcag // LOCUS ERGTUBB 2475 bp ds-DNA PLN 06-JUN-1990 DEFINITION E.graminis beta-tubulin gene, complete cds. ACCESSION X51326 M30927 KEYWORDS beta-tubulin. SOURCE E.graminis hordei (race CR3) DNA. ORGANISM Erysiphe graminis Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Erysiphales; Erysiphaceae. REFERENCE 1 (bases 1 to 2475) AUTHORS Sherwood,J.E. and Somerville,S.C. TITLE Sequence of the Erysiphe graminis f. sp. hordei gene encoding beta-tubulin JOURNAL Nucleic Acids Res. 18, 1052-1052 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.C.Somerville, 20-DEC-1989. FEATURES from to/span description pept 518 529 beta-tubulin, exon 1 581 604 beta-tubulin, exon 2 653 677 beta-tubulin, exon 3 729 771 beta-tubulin, exon 4 833 887 beta-tubulin, exon 5 934 1724 beta-tubulin, exon 6 1772 2162 beta-tubulin, exon 7 IVS 530 580 beta-tubulin intron A IVS 605 652 beta-tubulin intron B IVS 678 728 beta-tubulin intron C IVS 772 832 beta-tubulin intron D IVS 888 933 beta-tubulin intron E IVS 1725 1771 beta-tubulin intron F BASE COUNT 664 a 569 c 537 g 703 t 2 others ORIGIN 3 bp upstream of SmaI site. 1 cccggggcaa atcacactct gcctctctag cctcctnccc gaaggtcgtg ctgaaatttc 61 tggaaacagc gtaattgctg tatggtagct tagcccaact ttttttgtgc cgtccagggc 121 tctagggagt gctgttnagc tagtgagaat agaagtcttc cgagatattt gaaagcctac 181 catagctctg aaggcattgt ggcaggacct agaggatcgt aagagatatg actgacaagt 241 gttgagtcct tgcgtcctaa ttttgtgtat tattaccgtt gtgagacatc acggcgacgc 301 agaccgatct gcacttttca gtgccaagat ggtatgtaag cagtcccgtg atcggagcta 361 gcgcagcaac ggtttagtgt gagccaggtc caccgtcaac ccgcaattac tttctcgctg 421 cgccaaatct ccaatttcta catcccaact aacctccgga aacgcaccta cactataccc 481 cctatcatcc tctaccgcct atctttcatc agccaatatg cgtgaaattg ttagttaaat 541 tccatcacgg caactcttga aatcgctaat gataatttag gtccatttac agaccggtca 601 atgcgtacgt taatatttag tttgccttgt aactctacta atagagtttt agggaaacca 661 aatcggagcc gcattctgtt cgtagaatct caagcttcta gctcaactcg ctcacaccca 721 ctctacaggg caaacaattt ctggtgagca tggacttgat ggttcaggag tgtaagttcc 781 ctcgcataat taggtatccg cattcatcaa cgaattataa tgactccatt agctacaatg 841 gtacatcaga tttacaactt gagaggatga acgtatactt taacgaggtc tggtgaagct 901 ctcaaaagag aagttatcgc taaccctctt aaggcttctg gtaacaaata tgttcctcgt 961 gctgttctcg tcgacttgga gccaggtact atggatgctg tacgagctgg tccttttggc 1021 cagctattca gaccagacaa cttcgtcttc ggacaatctg gagccggaaa taattgggca 1081 aaaggtcact acactgaagg cgcggagctt gtcgaccaag tactagatgt agtgcgaaga 1141 gaagcagagg gatgtgactg tcttcagggg tttcaaataa cacattctct tgggggtggt 1201 acaggtgccg gtatgggtac gttattaatt tcaaaaatcc gggaagaatt ccctgatcga 1261 atgatggcaa ccttctcagt tgtgccgtcc cctaaggtgt ctgacactgt tgttgagcca 1321 tacaacgcaa ctctttcagt ccatcagttg gtcgaaaact ccgacgagac tttctgtatc 1381 gacaatgagg cactttacga catctgtatg aggacgttaa agctttctaa cccctcatat 1441 ggtgatctaa atcacttggt atccgctgta atgtcaggcg taactacttg ccttcgattc 1501 cctggccagc taaactctga tttgcgcaaa ctggcagtta acatggttcc tttcccacgt 1561 cttcactttt ttatggttgg atttgcaccg cttacaagcc gtggagcgca ctctttccgt 1621 gccgtaactg ttcctgagtt aactcagcag atgtttgatc cgaaaaatat gatggcagca 1681 tccgacttcc gaaatggtcg ttacttgaca tgctctgcca ttttgtaagt cagctcgtta 1741 tatacgcata ttctatacta acatattaca gccgcggtaa ggtatctatg aaggaagtag 1801 aagatcagat gcgaaatgtc caacaaaaga acgtatccta ctttgttgag tggattccaa 1861 ataatgttca aactgcccta tgttctatcc cgcctcgtgg cctaaaaatg tcttcgacat 1921 tcgtcggaaa ctcgacttcc atccaagagc tcttcaaacg tgtcggagat caattcacgg 1981 ccatgttccg gagaaaggca tttctgcatt ggtacactgg cgaaggaatg gacgaaatgg 2041 agtttactga ggccgagtct aatatgaacg atcttgtctc agagtatcaa caataccaag 2101 aagcttcaat ctcggagggt gaggaagagt atccagagga agtaagcaac gaagaagaat 2161 agatcaaaat tgtcctttta acacattgtt ctgtactgta tctgaagtag cggatacgat 2221 gaactgtacc ccattaatcg taccgttcct cttgtttgga aaattaaaag tcaccaacca 2281 aaatgcgata gaaagtgccg gtgtactggc ctagcacatc cttgtctggg tggatgtcga 2341 acactttatt tgctccatta gctagtctct ttccgcgcga caggatgaaa actttctaat 2401 catcaatcac atttcttttc tacgaaagtt tcaattctgt tactgtagag ttttttttgt 2461 aaaaccatct ctaga //
GenBank-Updates@genbank.bio.net (06/07/90)
LOCUS DROIAB 2131 bp ds-DNA INV 07-JUN-1990 DEFINITION D.melanogaster infra-abdominal (iab-4) 1.7 and 2.0 kb RNAs, complete cds. ACCESSION M32109 KEYWORDS . SOURCE D.melanogaster (strain Canton S) embryo DNA, and cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2131) AUTHORS Cumberledge,S., Zaratzian,A. and Sakonju,S. TITLE Characterization of two RNAs transcribed from the cis-regulatory region of the abd-A domain within the Drosophila bithorax complex JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3259-3263 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.Sakonju, 14-FEB-1990. FEATURES from to/span description RNA 141 1802 1.7 kb RNA RNA 141 2101 2.0 kb RNA BASE COUNT 592 a 468 c 478 g 593 t ORIGIN Chromosome III, bithorax complex. 1 ttagttcgaa gcacgtgcgt gtgagttccg catgaacggt acaccaacac tttcgaacag 61 aaacccctca cagatactcc tcacacagat actctactgt actgggctac gatatctgtg 121 agatacgcac agataccggc aggccgccgg cgtcaaagtc gacgtgctgc cggcgtttga 181 gagaccgagc ggcgtgcaag aaatcgtcgg gcaacggctt caaatgaacg caaaacgttg 241 gaaagcaaac aaccggcaaa agccatcagc agctaaaaat aaaactgcaa ctatcaaagt 301 gaatctgtag agaaagagtg atagctcagt gtgcccagtg tttatttaat tacttattta 361 aatttgattt tcaaaataag ttgtgatttt ggttcgtttg tttgtttccc cattaaatcg 421 catcgcggaa ctggtttgca ttttgtatct tttatcgctg cagttcgttt ctgtgtggtt 481 tcatgtattc cccatctatt ccatctcttt ttgcagatac aagtgaatga aaccctaacg 541 tttaaacgcc accataaaca ccaaataaca aattttctgg caaacatggc atgtcacggt 601 cccatttaac gtggaaaatc ggcccagaaa ttcaacacga cgacaacttt ttcgactgtc 661 ccagaaagca aatatgtaaa gtgtatttgt gtgcgtgcca ttgacattga cttatggtca 721 tgaaggggtt agggggacgg gtctaagggt gggggttgat ggctaaaggg gggtctcgcc 781 gctacgacaa aaacttggat tctctcgttt cttttttgtc tttcgtcttc aacgcacatt 841 tatggtttgt tttgctcaca cacacactca aaaagcctac gcagcggcag agagggcaca 901 agcagcagag aaagagacga aacatgtctc ggatgatgat aaacgcctct cgtctatttt 961 ggcgggtcca tgaatgaaac aattgcctcg aatgactgac tgccagacag atagctagaa 1021 actgccctta cggcgggcaa attggggtat tttgggtttc ggtgactttg agggaccggg 1081 acctgagggc ccggaaagcc gacagcaata aacactgtgc tcagacaaaa gtctgatgag 1141 acagaccatc atgatgctca tcatcatcgg cgataataat gattttcatt tgcacattcc 1201 acccgtttcg ctgttgccgg cttccagcat tcagattcag tttgcattca gtggggcagg 1261 ttatttatgg cacttttcct atcgccactc actggggtaa tcgattttta ttggattaat 1321 attgctttcc gcatgatttg gccttctttt cgttatcact tagcaggttt cttgacgctg 1381 aggagtggac aatgactaaa actcagtcat ctggaaaagc tggaccaagg acatccatat 1441 acaagtgctg gctaaacgtc tgtggttatc gcaatcattt atggcggaat agaaggacac 1501 gtcttatgtg tcgattttcg ggccccatcc ctgttgaaaa atcagccgga cagcagcaac 1561 caccgaacac aagacacagg acgaccggaa gtgaggaaaa cgcttgtaaa atcggtgcac 1621 acgagtgcga gcgagacacg agcgaacggg ccgaaaggaa acgacaatta cgaggacacg 1681 ccggcttgct tttctgcttt tcctgtctgc tcctttcagt tgcggccgtg caatcgtcgg 1741 gtcaacaacg ccgcgtgtcc gatttcaatt tacaaattag aatatacatc tttacatctg 1801 gatttggttt catctttatc tgcatttgga tccgcgacga cgcataattt tcagcacttt 1861 tgaataatgg cgattctgtg cgacctcgta aacgtatact gaatgtatcc tgagtgtatc 1921 ctatccggta taccttcagt atacgtaaca cgagagagca ctagcaacgt gggatgtgaa 1981 acacttttta agacacctta atacacgtag cactttgtta tatggtctta cactttttaa 2041 atatctcgac ttataaatat ataatttaaa tataaatact ttaataatgc gatataagtt 2101 aaatacttcg atttactggc tggaagttca t // LOCUS ECOMUT1X 1545 bp ds-DNA BCT 07-JUN-1990 DEFINITION E.coli mutT1 mutant DNA with an insertion sequence IS1. ACCESSION M29970 KEYWORDS insertion sequence IS1. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 300 to 315; 1084 to 1105) AUTHORS Bhatnagar,S.K., Bullions,L.C., Lew,G. and Bessman,M.J. TITLE Characterization of the defect in the Escherichia coli mutT1 mutator gene JOURNAL J. Bacteriol. 172, 2802-2803 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1545) AUTHORS Bhatnagar,S.K., Bullions,L.C., Lew,G. and Bessman,M.J. JOURNAL Unpublished (1989) Johns Hopkins, Baltimore, MD 21218 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.J.Bessman, 19-NOV-1989. FEATURES from to/span description trns 316 1083 IS1 site 308 315 5' insertion target sequence site 1084 1091 3' insertion target sequence BASE COUNT 379 a 366 c 423 g 377 t ORIGIN 1 gatccttgcc cgtgcggttc tggtaaaaaa tacaagcagt gccatggccg cctgcaataa 61 aagctaactg ttgaagtaaa aggcgcagga ttctgcgcct tttttatagg tttaagacaa 121 tgaaaaagct gcaaattgcg gtaggtatta ttcgcaacga gaacaatgaa atctttataa 181 cgcgtcgcgc agcagatgcg cacatggcga ataaactgga gtttcccggc ggtaaaattg 241 aaatgggtga aacgccggaa caggcggtgg tgcgtgaact tcaggaagaa gtcgggatta 301 ccccccaaca tttttggtga tgctgccaac ttactgattt agtgtatgat ggtgtttttg 361 aggtgctcca gtggcttctg tttctatcag ctgtccctcc tgttcagcta ctgacggggt 421 ggtgcgtaac ggcaaaagca ccgccggaca tcagcgctat ctctgctctc actgccgtaa 481 aacatggcaa ctgcagttca cttacaccgc ttctcaaccc ggtacgcacc agaaaatcat 541 tgatatggcc atgaatggcg ttggatgccg ggcaacagcc cgcattatgg gcgttggcct 601 caacacgatt ttacgtcact taaaaaactc aggccgcagt cggtaacctc gcgcatacag 661 ccgggcagtg acgtcatcgt ctgcgcggaa atggacgaac agtggggcta tgtcggggct 721 aaatcgcgcc agcgctggct gttttacgcg tatgacagtc tccggaagac ggttgttgcg 781 cacgtattcg gtgaacgcac tatggcgacg ctggggcgtc ttatgagcct gctgtcaccc 841 tttgacgtgg tgatatggat gacggatggc tggccgctgt atgaatcccg cctgaaggga 901 aagctgcacg taatcagcaa gcgatatacg cagcgaattg agcggcataa cctgaatctg 961 aggcagcacc tggcacggct gggacggaag tcgctgtcgt tctcaaaatc ggtggagctg 1021 catgacaaag tcatcgggca ttatctgaac ataaaacact atcaataagt tggagtcatt 1081 accacatttt tcgctatttg aaaaactgga atatgaattc ccggacaggc atataacact 1141 gtggttttgg ctggtcgaac gctgggaagg ggagccgtgg ggtaaagaag ggcaacccgg 1201 tgagtggatg tcgctggtcg gtcttaatgc cgatgatttt ccgccagcca atgaaccggt 1261 aattgcgaag cttaaacgtc tgtaggtcag ataaggcgtt ttcgccgcat ccgacattcg 1321 cacacgatgc ctgatgcgac gctggcgcgt cttatcaggc ctaaagggat ttctaactca 1381 ttgataaatt tgtttttgta ggtcggataa ggcgttcacg ccgcatccga catttgcaca 1441 agatgcctga tgcgacgctg tccgcgtctt atcaggccta cgtgcggcat cagacaaatg 1501 tcactgcttt ggttcttcgc tccagtcatc gctttcggaa agatc // LOCUS YSAEF1B 2411 bp ds-DNA PLN 07-JUN-1990 DEFINITION C.albicans elongation factor 1-alpha (TEF-1) gene, complete cds. ACCESSION M29934 KEYWORDS elongation factor. SOURCE C.albicans (strain SC5314) DNA. ORGANISM Candida albicans Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 2411) AUTHORS Sundstrom,P., Smith,D. and Sypherd,P.S. TITLE Sequence analysis and expression of the two genes for elongation factor 1-alpha from the dimorphic yeast Candida albicans JOURNAL J. Bacteriol. 172, 2036-2045 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Sundstrom, 17-NOV-1989. FEATURES from to/span description pept 624 2000 elongation factor 1-alpha variant 1487 1487 g in EcoRI 2.3 kb clone; a in HindIII 4.9 kb clone BASE COUNT 760 a 402 c 477 g 772 t ORIGIN 1 acggtaaatt agtgggaggc aaatctccca aatcaattaa acggaaatta ctcgaagctg 61 caaatctgtt tgctgatgga cagaatgatc atagtattaa tccagatgtt acaattgaag 121 aggatctgat gtctcaaata aggtatcaag acaacgaatc aatgatcaac caagaaagat 181 tattagagaa attggatgga tcagctgtgc ttgtggaaac actgatacga catgccagta 241 aacgaagtag aaaaaactaa ctgtatattc gctaacaaaa aaattgtatt atttttagac 301 atcgtaaatg gggctgtagc cctaattatt ttttatttct cgtgatctgc acgtgtactg 361 ttgatttttt tgtgcgattg aaattatatt gtgctgttat cttaataaaa tcagtcatac 421 cttttttttt ttgggttttt gttgtttaat ttgtgataaa catcccatga ggaacagcga 481 gaaagttttt tgctgttcac tttctcgttc aacttttgca aagtaaagta aagaaaaaaa 541 aattttccct cgtcatcttg atttttactt cttcttcttc ttcttctttt tcctttcttc 601 acacattata gtcataatca atcatgggta aagaaaaaac tcacgttaac gttgttgtta 661 ttggtcacgt cgattccggt aaatctacta ccaccggtca cttaatttac aagtgtggtg 721 gtatcgataa aagaaccatt gaaaaattcg aaaaagaagc tgctgaattg ggtaaaggtt 781 ctttcaaata cgcttgggtc ttggacaaat tgaaggctga aagagaaaga ggtatcacca 841 ttgatattgc tttgtggaaa ttcgaaactc caaaatacca cgttaccgtc attgatgctc 901 caggtcacag agatttcatc aagaatatga tcactggtac ttctcaagct gattgtgcta 961 ttttgattat tgctggtggt actggtgaat tcgaagccgg tatttctaag gatggtcaaa 1021 ccagagaaca cgctttgttg gcttacactt tgggtgtcaa acaattgatt gttgctgtca 1081 acaagatgga ctctgtcaaa tgggacaaaa acagatttga agaaatcatc aaggaaacct 1141 ccaacttcgt caagaaggtt ggttacaacc caaagactgt tccattcgtt ccaatctctg 1201 gttggaatgg tgacaacatg attgaaccat ccaccaactg tccatggtac aagggttggg 1261 aaaaggaaac caaatccggt aaagttactg gtaagacctt gttagaagct attgacgcta 1321 ttgaaccacc aaccagacca accgacaaac cattgagatt gccattgcaa gatgtttaca 1381 agattggtgg tattggtact gtgccagtcg gtagagttga aactggtatc atcaaagccg 1441 gtatggttgt tactttcgcc ccagctggtg ttaccactga agtcaagtcc gttgaaatgc 1501 atcacgaaca attggctgaa ggtgttccag gtgacaatgt tggtttcaac gttaagaacg 1561 tttccgttaa agaaattaga agaggtaacg tttgtggtga ctccaagaac gatccaccaa 1621 agggttgtga ctctttcaat gcccaagtca ttgttttgaa ccatccaggt caaatctctg 1681 ctggttactc tccagtcttg gattgtcaca ctgcccacat tgcttgtaaa ttcgacactt 1741 tggttgaaaa gattgacaga agaactggta agaaattgga agaaaatcca aaattcgtca 1801 aatccggtga tgctgctatc gtcaagatgg tcccaaccaa accaatgtgt gttgaagctt 1861 tcactgacta cccaccatta ggtagattcg ctgtcagaga tatgagacaa accgttgctg 1921 ttggtgtcat caaatctgtt gaaaaatccg acaaagctgg taaagttacc aaggctgctc 1981 aaaaagctgc taagaaataa gctagttgaa tattatgtaa gatctgttag agtttttatt 2041 ttgtattcat ttatttagtt tattttctta ttatatttga aatataatat atattttaaa 2101 aaaaaaattt acagtgtaga atttttggta gtccggtttg ttttgaaatc agtggtggta 2161 ttcaatattt gattaaattt tggtatgaat ttgtgttgaa aaataaaaaa taagcgagaa 2221 atttgcgtgg catattattt gtaatgttcg aatattctct cgtacaccaa aaggtttacg 2281 attattcatt gacatataat aaaatctaat tactaaagag ttttgctgga aatgcatgta 2341 tagtttatat catttctatt atctggacca tcatggaaga atagcaatta tttcttgtta 2401 tcggtagtaa a // LOCUS YSAEF2A 2369 bp ds-DNA PLN 07-JUN-1990 DEFINITION C.albicans elongation factor 1-alpha (TEF-2) gene, complete cds. ACCESSION M29935 KEYWORDS elongation factor. SOURCE C.albicans (strain SC5314) DNA. ORGANISM Candida albicans Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 2369) AUTHORS Sundstrom,P., Smith,D. and Sypherd,P.S. TITLE Sequence analysis and expression of the two genes for elongation factor 1-alpha from the dimorphic yeast Candida albicans JOURNAL J. Bacteriol. 172, 2036-2045 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Sundstrom, 17-NOV-1989. FEATURES from to/span description pept 656 2032 elongation factor 1-alpha variant 2188 2190 gta in one strand of clone 2.6 EcoRI; gtata in opposite strand. BASE COUNT 750 a 428 c 430 g 761 t ORIGIN 1 gtatcaagaa aaaattgtat agtgcttgct gttcgatatt gctagaggca aagaaaaagg 61 attacctcac gcctaaaatg gaatttgaaa attgaggaga tgagttgagc ttttgtcttt 121 tacactttcc ctattcaatt gtgccgcaaa aacttcaata tctaacacag tgtgaaatat 181 ttgttatgta taccgaaagc caaataccat ttaaactggt atttcaagat ttcataaatc 241 aagtttctgg ttaaattaat tgctgttaaa cagaacaaga gcttcacatt gatgttggaa 301 taactactat agaattataa aatacacaaa accaggtcgt gtacgcaata acttttcaat 361 gtatgaaaaa attttttttt tttttgcttc ttctcttcgc ttcttttttc tagttatcaa 421 tttggcaatg aattgatttg attgcccttt gtttagttta ctatgaaaaa aaattttttt 481 tttgtctctg tcagattttc tctcattttt ttttttgcct cataatgaac aactcaaaaa 541 agaatataaa tttctcatca aagacctttc caattgaaaa ttttttcttt ccttcttttc 601 cttttttttt ttctttcttt cttcttttct aagtatacat tttatataat caatcatggg 661 taaagaaaaa actcacgtta acgttgttgt tattggtcac gtcgattccg gtaaatccac 721 taccaccggt cacttaattt acaagtgtgg tggtattgat aaaagaacca ttgaaaagtt 781 cgaaaaagaa gctgctgaat tgggtaaagg ttctttcaaa tacgcttggg tcttggacaa 841 attgaaggct gaaagagaaa gaggtatcac cattgatatc gctttgtgga aattcgaaac 901 tccaaaatac cacgttaccg tcattgatgc tccaggtcac agagatttca tcaagaatat 961 gatcactggt acttctcaag ctgattgtgc tattttgatt attgctggtg gtactggtga 1021 attcgaagcc ggtatttcta aggatggtca aaccagagaa cacgctttgt tggcttacac 1081 tttgggtgtc aaacaattga ttgttgctgt caacaagatg gactctgtca aatgggacaa 1141 aaacagattt gaagaaatca tcaaggaaac ctccaacttc gtcaagaagg ttggttacaa 1201 cccaaagact gttccattcg ttccaatctc tggttggaat ggtgacaaca tgattgaacc 1261 atccaccaac tgtccatggt acaagggttg ggaaaaggaa accaaatccg gtaaagttac 1321 tggtaagacc ttgttagaag ctattgacgc tattgaacca ccaaccagac caaccgacaa 1381 accattgaga ttgccattgc aagatgttta caagatcggt ggtattggta ctgtgccagt 1441 cggtagagtt gaaactggta tcatcaaagc cggtatggtt gttactttcg ccccagctgg 1501 tgttaccact gaagtcaaat ccgttgaaat gcatcacgaa caattggctg aaggtgttcc 1561 aggtgacaat gttggtttca acgttaagaa cgtttccgtt aaagaaatta gaagaggtaa 1621 cgtttgtggt gactccaaga acgatccacc aaagggttgt gactctttca atgcccaagt 1681 cattgttttg aaccatccag gtcaaatctc tgctggttac tctccagtct tggattgtca 1741 cactgcccac attgcttgta aattcgacac tttggttgaa aagattgaca gaagaactgg 1801 taagaaattg gaagaaaatc caaaattcgt caaatccggt gatgctgcta tcgtcaagat 1861 ggtcccaacc aaaccaatgt gtgttgaagc tttcactgac tacccaccat taggtagatt 1921 cgctgtcaga gatatgagac aaaccgttgc tgttggtgtc atcaaatctg ttgaaaaatc 1981 cgacaaagct ggtaaagtta ccaaggctgc tcaaaaagct gctaagaaat aaactaatta 2041 aatacctttg tttaaatagt tgtgattata ttattttttt aaaatatata taaataagag 2101 aataatcata caacatgtag taaggagcga agagtcccag ctctgtaaat aaaaaaaaga 2161 agaggagtta tgtgcttctt tgtttatgta tatatatata tatatatata tatattctta 2221 ccagttagtt aatcttacac atatgttgat ggtgctatat tgtacataca aactctaaat 2281 catatcagaa ctgcttggaa acctttccca ctaacaacac ccatccaatc cccttggttc 2341 tgtctccaac acacgttatt gatttccaa //
GenBank-Updates@genbank.bio.net (06/08/90)
LOCUS BSUHEMAC 3795 bp ds-DNA BCT 08-JUN-1990 DEFINITION B.subtilis 5-aminolevulinate synthase (hemA) and porphobilinogen deaminase (hemC) genes, complete cds. ACCESSION M32130 KEYWORDS 5-aminolevulinate synthase; porphobilinogen deaminase. SOURCE B.subtilis (strain 168) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3795) AUTHORS Petricek,M., Rutberg,L., Schroeder,I. and Hederstedt,L. TITLE Cloning and characterization of the hemA region of the Bacillus subtilis chromosome JOURNAL J. Bacteriol. 172, 2250-2258 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by I.C.Schroeder, 16-FEB-1990. FEATURES from to/span description pept 232 1599 5-aminolevulinate synthase (hemA, EC 2.3.1.37) pept 1607 2437 unidentified protein (ORF2) pept 2470 3414 porphobilinogen deaminase (hemC, EC 4.3.1.8) pept 3404 > 3795 unidentified protein (ORF4) BASE COUNT 1068 a 689 c 953 g 1085 t ORIGIN Map position 245 degrees. 1 atgcatatca ccttcttgtt ttttagagct gatgtgtagt aaatttctgc tgtttttggt 61 attgtcaata ggaatgcttc ttttccctga agctttttct aatatagcat aagaatttta 121 aaatctgttc acattttgtg aaagaaacta tgttataatt attataaata atgagttcta 181 tgttagaatg attataaatt aagattgggt gttgggggtg taattagagc gatgcatata 241 cttgttgtgg gagtagatta taaatccgcc cctattgaga tacgtgaaaa agtaagtttt 301 cagccgaatg agctggcaga agcaatggtg cagctgaaag aagagaaaag cattcttgaa 361 aacatcattg tctcaacctg caaccgcact gaaatttatg cggtagtcga ccagcttcat 421 accggccgtt attatataaa aaagttttta gctgattggt ttcaattaag caaagaagag 481 ctgtcaccgt tcttaacgtt ttatgagagc gatgccgctg ttgagcattt attccgtgta 541 gcctgcggac ttgattctat ggtgattggc gaaacgcaga ttctcggaca ggtacgcgac 601 agctttaaaa cagctcagca agaaaaaacg atcgggacta tttttaatga gctgtttaag 661 caggcagtta cagtgggcaa acggactcac gccgaaacag acattggctc aaatgcggtg 721 tcagtaagct atgctgcagt tgaacttgcc aaaaaaatct tcggaaatct ttcaagcaag 781 cacatattga ttctcggtgc gggaaaaatg ggcgagcttg ctgcggaaaa cctgcacgga 841 cagggaatcg gcaaggtcac tgtcattaac cgaacatact tgaaagcgaa ggagcttgca 901 gaccgttttt caggtgaagc gagaagcttg aatcagcttg aaagcgcgct tgcggaggct 961 gatattttaa tcagttcaac cggtgcaagt gaatttgtcg tgtccaaaga gatgatggaa 1021 aacgcgaata agcttcgcaa gggacgtccg ctgtttatgg tcgacattgc cgtgcctaga 1081 gatcttgatc cggcgctgaa tgatcttgaa ggtgtttttc tttatgatat cgacgatctg 1141 gaaggcattg tagaagcgaa catgaaagag cggagagaaa cagctgaaaa agttgaactg 1201 ttaattgaag aaaccattgt ggaatttaaa caatggatga atacacttgg tgttgtgcct 1261 gttatttctg cattgcgcga aaaggcgctt gccatccagt cagaaacgat ggacagcatt 1321 gagcgtaagc tgcctcactt aagcacaaga gagaaaaaac tgttgaacaa acacaccaaa 1381 agtattatta accaaatgct tcgtgatccg attttaaagg tgaaagagct tgcggcagat 1441 gctgattctg aagaaaagct cgcgttgttt atgcagattt ttgatattga agaagctgcg 1501 ggccgtcaaa tgatgaaaac cgttgaaagc agccagaagg tccactcttt taagaaggct 1561 gaatcaaaag cgggctttag cccacttgta agtgagtgaa agctgaatga ttgatactgc 1621 aatggcaaga cttaatgagg ggacaatcgt catttacgcg ttaagtgtac tcttttattt 1681 tatagatttt cttcaacaca accggaaggc tggaaaaatg gccttctggt tgctttctat 1741 tgtctggact ctgcaaaccg tgtatttggc ctattttatg tgggtgacgg ggcggtttcc 1801 ggtattaaat gtgacagagg cactttattt ttatgcctgg gtgcttgtca cgctgtcact 1861 tgtactgaca aagcttttac gtgttgactt tatcgtgttt tttacaaatg ttataggatt 1921 ttctatgatc gccattcaca cattttcacc gacagagcag cagtcagctg ctttttccgg 1981 gcagcttgta tccgagcttt tggtgattca tattacaatg gcgattcttt catacggcgc 2041 tttttccctt tctttcgttt tttctgtgct atatatgttt caatatcatg tgctgaaaaa 2101 gaaaaagtgg ggaaaatggc tgttgagaat agaagattta tctaagcttg attatatggc 2161 gtatgtttta aatgtcattg gggttccgat gctgctgctg agtttgattc tcggcgtcat 2221 ttgggcgtat gtctcactag aaacgctgta ttggtttgac gccaaagtgc ttggttcgtt 2281 tgtcgtcctg ctgctgtaca gctattatct ttatatcagg ctgattaagg agctgcaagg 2341 aaaggtcgct gcactgtgga atacggcttg ttttctggtg ctgatgatca attatttcct 2401 gcttggaagc ctgtcgcaat tccattggtt cagttaaacg atgtcccaag cagattcggg 2461 aggaaagaaa tgatgagaac gattaaagta ggttccagac ggagcaaact cgctatgact 2521 caaacaaaat gggttattca aaaactgaag gaaatcaatc cttcgtttgc ttttgaaatt 2581 aaagagatcg tgacaaaggg cgaccggatt gtcgatgtta cactctcaaa agtgggtgga 2641 aaagggcttt ttgtcaaaga aattgaacag gcgcttttaa acgaagagat tgatatggca 2701 gtgcacagca tgaaggacat gcctgctgtt ttgcctgaag gccttgtgat cggctgtatt 2761 cctgaacggg aggacccgcg tgatgccctt atttcaaaga atcgcgtaaa gctttcagaa 2821 atgaagaaag gtgctgtcat tggcacaagc agtttaagaa gaagcgcgca gcttttgatt 2881 gagcgccctg accttacaat taaatggatt agaggtaata ttgatacaag acttcaaaag 2941 ctggaaacag aggattatga cgcaattatt ttagcggctg ccggcctttc cagaatgggt 3001 tggaagcaag atgtcgtaac cgaattcctt gagcctgagc gctgtttgcc tgctgtgggg 3061 cagggagccc tggcgattga gtgccgagaa tcggatgaag agctgttggc gttgttttct 3121 cagtttacag atgaatatac aaaacggact gtcttagcgg aacgtgcttt tttaaacgcg 3181 atggagggcg gctgccaggt tccgatcgcg ggctactccg tgttaaatgg acaggatgaa 3241 attgaaatga caggtcttgt cgcttcacct gacggcaaaa tcatttttaa agaaaccgtc 3301 accggaaacg atccggagga agtaggaaag cgctgtgccg ctcttatggc tgacaaagga 3361 gcaaaagatt taattgatcg tgtaaaacgg gagcttgacg aggatggaaa atgattttcc 3421 gttgaaagga aaaacagtgc ttgtcacccg gaataaggca caggcagcat catttcagca 3481 aaaagtggag gcgcttggcg gtaaagcggt tttaacctct ttgattacgt ttcgccgcgc 3541 tttgccgaat gatgttgcgg aacaggtaag agaggatctt gccgcgccag gctggcttgt 3601 ttttacaagt gtgaacgggg cagacttctt tttttcttat ctgaaggaaa atcagcttat 3661 tctccctgcg cataaaaaaa ttgcagccgt cggtgaaaaa accgcgcgcc gtttaaaaat 3721 gcataacgta tcggttgatg tgatgccaca ggagtatatt gctgaacaat tgcgtgacgc 3781 tcttaagcag catgc // LOCUS SHFIPAH 2900 bp ds-DNA BCT 08-JUN-1990 DEFINITION S.flexner invasion plasmid antigen H (ipaH) gene, complete cds. ACCESSION M32063 KEYWORDS invasion plasmid antigen. SOURCE S.flexner (strain M90T-W; serotype 5) DNA, clone pWR390. ORGANISM Shigella flexneri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2900) AUTHORS Hartman,A.B., Venkatesan,M., Oaks,E.V. and Buysse,J.M. TITLE Sequence and molecular characterization of a multicopy invasion plasmid antigen gene, ipaH, of Shigella flexneri JOURNAL J. Bacteriol. 172, 1905-1915 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.B.Hartman, 12-FEB-1990. FEATURES from to/span description pept 251 1849 invasion plasmid antigen H pept 2277 > 2900 ORF3 signal 139 144 -35 region signal 162 167 -10 region signal 2111 2116 -35 region signal 2132 2137 -10 region BASE COUNT 868 a 657 c 578 g 797 t ORIGIN 1 catagaaaac ctccataaat aaattacaac taacttctgt tatgtgtaaa atggaaacta 61 ttaaaactta atatcggaaa tggtaagtga aatttgtata aatatacaat tttaaatatt 121 tattctcaca aatataaggt tgacctagca ttatgttctc tgtaaataat acacactcat 181 cagtttcttg ctccccctct attaactcaa actcaaccag taatgaacat tatctgagaa 241 tcctgactga atgggaaaag aactcttctc ccgggaagag cgaggcattg cttttaacag 301 actctcccag tgctttcaga atcaagaagc agtattaaat ttatcagacc taaatttgac 361 gtctcttccc gaattaccaa agcatatttc tgctttgatt gtagaaaata ataaattaac 421 atcattgcca aagctgcctg catttcttaa agaacttaat gctgataata acaggctttc 481 tgtgatacca gaacttcctg agtcattaac aactttaagt gttcgttcta atcaactgga 541 aaaccttcct gttttgccaa accatttaac atcattattt gttgaaaata acaggctata 601 taacttaccg gctcttcccg aaaaattgaa atttttacat gtttattata acaggctgac 661 aacattaccc gacttaccgg ataaactgga aattctctgt gctcagcgca ataatctggt 721 tacttttcct caattttctg atagaaacaa tatcagacaa aaggaatatt attttcattt 781 taatcagata accactcttc cggagagttt ttcacaatta gattcaagtt acaggattaa 841 tatttcaggg aatccattgt cgactcgcgt tctgcaatcc ctgcaaagat taacctcttc 901 gccggactac cacggcccgc agatttactt ctccatgagt gacggacaac agaatacact 961 ccatcgcccc ctggctgatg ccgtgacagc atggttcccg gaaaacaaac aatctgatgt 1021 atcacagata tggcatgctt ttgaacatga agagcatgcc aacacctttt ccgcgttcct 1081 tgaccgcctt tccgataccg tctctgcacg caatacctcc ggattccgtg aacaggtcgc 1141 tgcatggctg gaaaaactca gtgcctctgc ggagcttcga cagcagtctt tcgctgttgc 1201 tgctgatgcc actgagagct gtgaggaccg tgtcgcgctc acatggaaca atctccggaa 1261 aaccctcctg gtccatcagg catcagaagg ccttttcgat aatgataccg gcgctctgct 1321 ctccctgggc agggaaatgt tccgcctcga aattctggag gacattgccc gggataaagt 1381 cagaactctc cattttgtgg atgagataga agtctacctg gccttccaga ccatgctcgc 1441 agagaaactt cagctctcca ctgccgtgaa ggaaatgcgt ttctatggcg tgtcgggagt 1501 gacagcaaat gacctccgca ctgccgaagc tatggtcaga agccgtgaag agaatgaatt 1561 tacggactgg ttctccctct ggggaccatg gcatgctgta ctgaagcgta cggaagctga 1621 ccgctgggcg caggcagaag agcagaagta tgagatgctg gagaatgagt actctcagag 1681 ggtggctgac cggctgaaag catcaggtct gagcggtgat gcggatgcgc agagggaagc 1741 cggtgcacag gtgatgcgtg agactgaaca gcagatttac cgtcagctga ctgacgaggt 1801 actggccctg cgattgtctg aaaacggctc acgactgcac cattcataat cacgtcgcat 1861 aagcataaac cgcagaccgg attgactccg gaaaaactgt gacccgatta cggaccttaa 1921 caacaacccg taaatcctcg ctcaataccg gcagggattt acggcgtgca actgactttt 1981 ttgaggggat aaccaaccag atcgtttgct atgggaatat cgagacagta atgagttaaa 2041 tgataaaaat tgtttgaaaa tataggggat aaagatcaat ccaaactgga tgaaagtaga 2101 actggtcaca ttaacatggg tagactgata taacaatcga cggttactgg aaagacagga 2161 acatattcct ccagccggaa tgaaaacgcc gataaagctc taggattgtt tttttaaaga 2221 ctttctcgtt ttatttgcat taatagacca agatatgaat agtgaggggt taataaatga 2281 aaccgatcaa caatcattct ttttttcgtt ccctttgtgg cttatcatgt atatctcgtt 2341 tatcggtaga agaacagtgt accagagatt accaccgcat ctgggatgac tgggctaggg 2401 aaggaacaac aacagaaaat cgcatccagg cggttcgatt attgaaaata tgtctggata 2461 cccgggagcc tgttctcaat ttaagcttac tgaaactacg ttctttacca ccactccctt 2521 tgcatatacg tgaacttaat atttccaaca atgagttaat ctccctacct gaaaattctc 2581 cgcttttgac agaacttcat gtaaatggta acaacttgaa tatactcccg acacttccat 2641 ctcaactgat taagcttaat atttcattca atcgaaattt gtcatgtctg ccatcattac 2701 caccatattt acaatcactc tcggcacgtt ttaatagtct ggagacgtta ccagagcttc 2761 catcaacgct aacaatatta cgtattgaag gtaatcgcct tactgtcttg cctgaattgc 2821 ctcatagact acaagaactc tttgtttccg gcaacagact acaggaacta ccagaatttc 2881 ctcagagctt aaaatatttg // LOCUS BOVMTAK5 3852 bp ds-DNA MAM 08-JUN-1990 DEFINITION Bovine mitochondrial adenylate kinase gene, exon 6 and 7. ACCESSION D90069 KEYWORDS ATP:AMP phosphotransferase; adenylate kinase. SEGMENT 5 of 5 SOURCE Bovine mitochondria DNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 3852) AUTHORS Tanaka,H., Yamada,M., Kishi,F. and Nakazawa,A. TITLE Isolation and characterization of bovine mitochondrial adenylate kinase gene JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hiroshi Tanaka Department of Biochemistry Yamaguchi University School of medicine 1144 Kogushi Ube, Yamaguchi 755 Japan Phone: 0836(22)2215 Fax: 0836(22)2315 FEATURES from to/span description pept / 126 347 adenylate kinase 2A exon 6(a+b) (EC 2.7.4.3) pept / 126 + 321 adenylate kinase 2B exon 6a pept + 2572 2576 adenylate kinase 2B exon 7 pre-msg < 1 491 adenylate kinase 2A mRNA and introns pre-msg < 1 3448 adenylate kinase 2B mRNA and introns IVS 322 2571 adenylate kinase 2B intron BASE COUNT 942 a 915 c 933 g 1062 t ORIGIN 1 tctagattgt ctgtctctcc ctgtgatgga ctttgggatc agacagcatc tcagatctgc 61 atctggcctt ggattaggag accttgctga gtcccagcct ttcattgccc tcttcttctt 121 ctcagatcac tggggaaccg ttgatccgcc gatcagatga taataagaag gccttgaaaa 181 tccgcctgga agcctaccac actcaaacca ccccgctggt ggagtactac agtaaacggg 241 ggatccactc cgccatcgat gcgtcccaga cccctgacgt cgtgtttgca agcatcctag 301 cagcattctc caaagccaca tgtaaagact tggttatgtt tatttaatgt tgggtccaaa 361 aaggaacttc tttcttcttc ctttccctgt tgaacgagtg ggtgggaatg gcagagtagg 421 cagagggaag cttcctcagg ccagcaagaa tatcatttga tgtattgatt aaaaaagcac 481 ttgcttcatg tacctttggt gtgtgcgcaa gtctcatctc atctgtgtgt gtgtgtggtg 541 tgtatgtgcg tgcacgtgca cacacgtgtg taagtataag tatgtgtaca ctcttatact 601 tcttaaattg taggcgagac tgtttacttc tttagccgta ctctttattt tcgacttccc 661 tggtggcgca gacggtaaag catctgtcta caatgcagga gacccgggtt cgatccctgg 721 gttgggaaga tcccctggag aaggaaatgg caatccactc taggactgtt gcctggaaaa 781 tcccatggac agaggagcct ggtaggctac agtccatggg gttgcaaaga gtcagacaca 841 actgagcgac ttcacttcac ttcactttat tttcaaccaa aataaactaa ctcaagtatc 901 cgccaggcag acaagttcct ggtatttttc catctccagc agctaattag aagtcctggg 961 tcacatgcag tcaggcaggg ctacagttcc tgatggcagc ctgttggcca gaaattccac 1021 ttgttttctc acccataatg aaaagtcttt gagtcattgt ggaaagggat cattaatttt 1081 ttccccctaa acaggaatga aaaggcactt actttataga ttccagaaat tactgggaga 1141 gggtatcacc atagaaagag cctggccaag ttgaattatt tttgtgatct gccccatggt 1201 tctataaagt aggaatcatt cggcacctgt gttgaatact aaattgatgc caaagaatta 1261 aaccaggtga aaggtcctct tgaattcaga ctgtcttctg aacatccagg ctaggcatct 1321 gagagcagcc agtccacttc cccaaagaca gcaggataga tttgtcttta ttttcaatgt 1381 ttgctctgtt tgcaagtgtg aacaaaacgg tgatttgagg atcatttggg gatcatgttg 1441 attagcattt attcaagccc agtctctaca ggatatgaag tcacatatat cccttttgaa 1501 cgcagccctc tcccttccat atctagtcct tcacccctta tgtcatgacg gtggtggcca 1561 cagtctcccc aagaaaggac cacccccaaa ataagatcac ctatggccgt aaccagcttt 1621 tcttcctaac ctcggctccc ggctcttagg catctcttga gatccacact gtgccttttg 1681 gtctctggtt tgatctgtgg caaacggagg aattaataga ctgctatgag ggaccacaaa 1741 aacagcaggc tctggaaaaa ccattaaaaa gtcagtgaca ggtccagtaa ataatgccac 1801 ctgcccaggg tactctgctg actcagtgca gtggcctcgg aagtgcccag cccccacctg 1861 acctgagctc tcctgctgtc tgtgagacca ccagagggct tgtcctctgt ccgcctggaa 1921 actggctgcg gtacgtggcc acctgtgtcc ctctctgtct gcttcaagtg agtgtctgct 1981 ggctctgctc tgccttgttt ccctggttgg agactaactc cacccattct taacggaaac 2041 ccagcctggc atgggatgtt tctgggaagc acatgatttc tgagaatgga caaggaaagg 2101 gagtgaaacc aaaactgtct gctgtgtctg cagggatctg ggctgcttct ctgggtgagg 2161 tggcctcatg aatcttagag ttgactcccc tttgacgggg gacagttgca ggcacttggc 2221 ctcaaagttg aggtccttga atgattgctg atagcgtgtg catgatctgt gctcaaatat 2281 ttgtctgtgt gcattcacac gtgcacacat atacatgtga tatttgggct accaatggga 2341 aagggaaggt cgtcgggggt gcatatgtcc tcttgggggt gggtagaatt caatttttgc 2401 tttgcttaat tccagagcct taacctcagc aggcatgaga gctttcacct ggcagagaaa 2461 tgtccagtgt ttaatgcatt tggttactgg ggtgtgcatg ccaaccttct ctgatggtta 2521 cttggttttt aaaaatgcat tttgttttct cttgtctcct ctttatttca gcctagtaac 2581 agaaggccag gcgggactgc accactactc atcaccccac ggcatgatcc ctgctcttag 2641 gtgctgggca gagggagggg cggtcagggt aaggatgggg acagaggagc ggtgagggcc 2701 caagcggggg agaggagctg tgttgccatg aagcagtttc atttacgtgg agtagaactc 2761 ttaaaagcat aagcaaaggg gaaaattaat tttttaaaac acttgattgg agggtataaa 2821 tagaaacaag gagatgcaat attatttcta aggaatcatg ctttcattta ctttggactg 2881 gtgacaactg gacaggtctc agtgccctga gacctcagct ttgatcagaa cctcatgcca 2941 gcccaggaat gcgggaaatc acactgttgt cctgtctgac ccatagcttg gaacagggca 3001 ggagttgatt actaatcctg gttacccata ccatgaaatc agaacgtcat ctccccacgt 3061 gcaatagaca tggaatgtgt tcctgggggc ggcgtctcaa caaatccggc ttactggagt 3121 catggggcag gctggtgtcc ctcccttcca tactcaccac tgatttacca gcccacctgc 3181 cctcacgggt gagccctcgg cagccaccca gcatacgcca cagtcctgtg ctcttgcctt 3241 cctccatcca catcatgtga aaggactctt tttaatcaat gagcaagtgt cctaagcaac 3301 attatccaaa gactgtcctt tccatcctca aatcctatga ctgggatcag tctacaacac 3361 tgtgatgtat tattttcaat gaggtgcctt tcttaactgt ccaaatgctg ccttgtttgg 3421 cccttaaata aagtgtgtta aaagtttgta tcccctgttg tggcattttt taaggagctg 3481 tgggctagta aaatgacgct gaattctgga tctgacttag aagccctttt ctgggtcatt 3541 agaagtcata agaaggcaga aaggtccaag agtgtgttca taagaagagt ctgctccctc 3601 ttggccttca gctcatctgc aggattttct tgagcccaag gacaaggcag cttgggccaa 3661 tggagacagt gctgtgcttt gggttcaggg atgtacaaca aatcccagcc aatcatttgc 3721 tcctgcttcc ttgtgtgaga aagagactag tggtggtcca gtggttaaga ctcttacctg 3781 ccactgcaga ggacactggt tcgatcccgg gtctgggaag atcccacatg ccgtgggcac 3841 tggacccatg tc // LOCUS BBRHWP 4330 bp ds-DNA BCT 08-JUN-1990 DEFINITION B. brevis cell wall protein (HWP) gene. ACCESSION D90050 KEYWORDS HWP; cell wall protein. SOURCE Bacillus brevis (strains HPD31, HPD52 and HPO33) genomic DNA. ORGANISM Bacillus brevis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4330) AUTHORS Ebisu,S., Tsuboi,A., Takagi,H., Naruse,Y., Yamagata,H., Tsukagoshi,N. and Udaka,S. TITLE Conserved structures of cell wall protein genes among protein-producing Bacillus brevis strains JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Norihiro Tsukagoshi Department of Food Science and Technology Faculty of Agriculture, Nagaya University Furo-cho, Chikusa-ku Nagoya 464 Japan Phone: 052-781-5111 x6869 Fax: 052-781-4447 FEATURES from to/span description pept 548 3898 HWP precursor (ttg start codon) sigp 548 706 HWP signal peptide sigp 638 706 HWP signal peptide matp 707 3895 HWP mature protein site 132 132 transcription initiation site site 255 255 transcription initiation site site 325 325 transcription initiation site site 373 373 transcription initiation site site 386 386 transcription initiation site BASE COUNT 1401 a 804 c 1000 g 1125 t ORIGIN 1 atcgattacc aaaaaaccgc gaaagattcc gcggctgaga gctatcgttt gaccaatttg 61 cgctttgaaa cggtctggca acgaccctcg aagtgattca gccgaggagg aattatccaa 121 ccgtgagaat cagtaccaaa aagcgatcct gcagtacaat ctggctgtgg taaactttga 181 aactgccctc ggaagctaag aagaatttgt aaggttttga atggaactgg aaaaaggttc 241 aatggtgaca gtccgtccgg cgtaccctat aatacgagtt gtggcggatg tcactgctta 301 catattacaa gtgaatacga cccatgaaaa tttctttaaa ctttttttct gaggcgccgc 361 aacttttgct cgactgaggc gtttaatagg gtgtcacacg aaaaacgggg aattgtgtaa 421 aaaagattca cggattctag catttgtgtt acactagtga ttgttacatt ttacacaata 481 gctgaatata ctagagattt ttaacacaaa aaagcgaagc tgtcctgcga aaggaggtga 541 cacgcgcttg caggattcgg gctttaaaaa gaaagataga tcaacaaata ttccccaaga 601 acaatttgtt tatactagag gaggagaaca caaggttatg aaaaaggtcg ttaacagtgt 661 attggctagt gcgctcgcca tcacagttgc tccaatggct ttcgcagcag aggacacaac 721 tacagctcct aaaatggatg ctgcaatgga gaaaaccgta aaacgtctgg aagctcttgg 781 cctggtagca ggttatggca acggcgattt cggtgcagac aaaaccatca ctcgtgcaga 841 gttcgctact ctaatcgttc gcgctcgcgg tctggagcaa ggtgcgaaac tggcacaatt 901 caacactact tacacagatg ttagatccac tgactggttc gctggttttg taaacgtagc 961 ttccggtgaa gaaatcgtaa aaggtttccc ggacaaatct tttaaaccac aaaaccaagt 1021 tacttatgct gaagcagtaa ccatgatcgt tcgtgctttg ggttatgagc catccgttcg 1081 aggtgtatgg ccgaacagca tgatctccaa aggttccgaa ctgaacattg caaaaggtat 1141 caacaaccct aacatgcagc agttcgcggc gacaatcttc aaaatgctgg acaacgctct 1201 tcgcgttaag ctgatggagc aaatcgaata cggtactgac atccgtttaa acgtaactga 1261 cgaaactctc ttgactaaat atttgaaagt taccgtacgt gatatggact gggctcacga 1321 aaagggtaac aattctgatg aattgccact tgtaacaaac gtacctgcta ttggtctggg 1381 tagtttgaaa gcaaatgaag ttactttgaa tggaaaagat gctgatctgg gtagcaacac 1441 tacttataaa gtagctgaag gcatcaatcc taacgcattt gatggtcaaa aagtacaagt 1501 gtggatcaaa gatgaccgag aaaatgtcat cgtttggatg gaaggttccg aagacgaaga 1561 tgtcgttatg gaccgtgtga gtgctctgta cctgaaaggt aaagccttca cagatgatat 1621 tgtaaaagat cttagcaagt ctgatttgga tgatgtaaaa atcgaaatgg atggtagcga 1681 gaaatcgtat cgcttgacgg aagatacgaa aatcacttat aacttcactc gtttcaacga 1741 tccagtagat gctctgagca aaatttacaa agacaacgac acgtttggtg ttaaagttgt 1801 tttgaatgat aacaatgaag ttgcatacct ccacatcatt gacgatcaaa caattgataa 1861 aagcgtaaaa ggcgttaaat acggttccaa agttattagc aaaatcgatg ctgataagaa 1921 gaaaatcacc aacttggata actccaaatt tagcgacttg gaagatcagg atgaaggaaa 1981 agacttcttg gtattccttg atggccaacc agctaaactg ggtgatctga aagaatccga 2041 tgtttactcg gtatactatg ctgatggtga taaagataag tacttggtgt ttgctaaccg 2101 taacgttgca gaaggaaaag tagaaaaagt ggtaagccgt aacaagacag acattcgtct 2161 gactgttggt ggtaaaactt acaaagtata tccagatgct tcttactccg aaaacgcaaa 2221 taaagatgtt aagaaggtta attccgactt ggatctgatt agcaaccttg atggagaaga 2281 agtaaaactt ctcttggatc catccggtcg agttcgtcac atcgaaacaa aagacgcgat 2341 cgatgatcgt aaaccattgg caatcatcac taagggtgct acctataact ccagcaaaga 2401 tacttatgac tttactgtta tgacccaaaa aggtaaaaca caaatcgtat ctttggatca 2461 aaaggacatc tacgatagat atggggttaa ctacgataaa tccaatgata agcgtcaagc 2521 tttcgaaaag gatttagttg agcttctgca acctaaggta gttaaagagg actcagctac 2581 tgatgccaat caaaccgttc ttttggaagt taattttgat tccaaaggcg aagttgataa 2641 agtcaaagtt cttgatagca agctgaaata ctccgagaaa agcacttggg ataaacttgc 2701 ggatgaagat gacgacgtcg ttggtgatta tgaagtaacc gataagactg ctgtcttcaa 2761 aatgactggt gaccttactc cagctactgg aactaaacgt ggtgaactga aaaacgctgg 2821 tactgctaaa tttaaggatg ttgctaagaa gagcgatctg aaggtttggt actcagtgga 2881 ggaagacaaa ggtgaagttc aagcgatctt cgtagtagat ggcagtggtt tgggtggcga 2941 ccatcagttt ggtatggtca aacaatatgg tactgctagt aaacaggaca ctatcactat 3001 agttacaaaa gatggtgatt ctgtcactga aaaagagtac aaattggatg gcgacgctga 3061 tgatttgaaa gttgatcaag acatcagacg tggagatgtc atttctttca cactgaattc 3121 tgatggagaa gttattgttg acgatgtagt cgaggttgta aataacaacc acattgataa 3181 cactgcttct aaatcagcta cgctcatgcc tgaagacgaa cgtcaaaaag caggaatcga 3241 caaattggtt gttgctcgcg ttgacgaagt tgatggtaac actatttcct tgaactatgc 3301 tgacggaaag acacaaaaat attacacaaa agcatccact gcgttcattg atgtgtatga 3361 cggtcttgaa ggaattgatg gagtagatga aggcgactac atcgtaatga tcgatagcgc 3421 cgatattgac ggaactcgct ttgactatgt actggtagtt tctagcgatg atgagatccg 3481 cacgcaacac atctccacta aagcagttac ggacttcctg aacaagccaa ccagactatg 3541 taccaaatcc tggcgatggg gaagaagtag tcacggcacc aaagttaata cagttaacga 3601 tgaagcagtt gtagatggta ttgtaactct tccagctgat gcatctgtta gaaacttcaa 3661 cattgcattt gatcaagaaa ttaacagcaa agatgcaacg gtaactgtta ctaatgaaga 3721 tacgcttggt aacgtaacgg tatctgaggt tgcgacagat gcaaaagtat tgagcttcaa 3781 gactgctaaa ctggacacta caaaaactta catcatcaca gttaaaggct tgaaagataa 3841 aaacggtaaa gcagtgaaag atgtaactct ttacgttgaa tttgttgctg gagtgtgact 3901 tgttgctcta taaacactaa tagtgtgaaa tgagtgaaga agggagaagg attaaaacct 3961 tctcccttct ttaactgttt tggcgaattg aagggtgtaa cccccttccg tggaaacaac 4021 cattaatcca cgtgactaat cctcctgcat gcgttgtaag ctaactctta cagggtgtga 4081 agccaggtga agacgtaacc cttagtggtg cgggtctccg taaaagctaa ctatggttag 4141 gaaacgaatc tacgtctgag gcgtgtcagg tttggtaatt agcccttcag ctgctaatta 4201 ccccacaggc tgagatgcct ggagggttct aacgaattaa ggttgttaac gttagttcat 4261 cttgatatgt atagtaggaa cctaaggttt agcaaaaaaa gggatagcca aaatggaaca 4321 agtgagatct // LOCUS BSPRSDA 2996 bp ds-DNA BCT 08-JUN-1990 DEFINITION Bacillus sp. raw-starch-digesting amylase gene. ACCESSION D90112 KEYWORDS alpha amylase; raw-starch-digesting amylase. SOURCE Bacillus sp.(strain B1018) DNA. ORGANISM Bacillus sp. Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2996) AUTHORS Itokor,P., Tsukagoshi,N. and Udaka,S. TITLE Nucleotide sequence of the raw-starch-digesting amylase gene from Bacillus sp. B1018 and its strong homology to the cyclodextrin glucanotransferase genes JOURNAL Biochem. Biophys. Res. Commun. 166, 630-636 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Shigezo Udaka Department of Food Science and Technology, Faculty of Agriculture Nagoya University Furo-cho, Chikusa-ku Nagoya 464 Japan Phone: 052-782-5111 x6356 Fax: 052-781-4447 FEATURES from to/span description pept 313 2454 raw-starch-digesting amylase precursor (EC 3.2.1.1) sigp 313 393 raw-starch-digesting amylase signal peptide matp 394 2454 raw-starch-digesting amylase mature peptide binding 302 306 ribosome binding site signal 101 106 -35 region signal 125 130 -10 region rpt 2567 2580 inverted repeat rpt 2585 2598 inverted repeat BASE COUNT 764 a 852 c 774 g 606 t ORIGIN 10 bp upstream of RsaI site. 1 ttatttgagt acattttatg tattcccaca ttgcgcccga tatctacgct tagaaaaaaa 61 tcgtcggaaa agcgccccaa aaaattttta ttgttattta ttgacagttg tattcgcttt 121 catctacaat gatggaggaa cgcaatactc gatataattt aagggccatg cattccgtga 181 ccgcacaccc ggtatggaac aaccccggta tctcgatgga gaagccgggg ttttttgtcg 241 ccctttttta ggaggtgatc cggcgacagc ggatcaagcc tggaattcaa ataattacat 301 aggaggtata acatgaagaa atttctgaaa atgacagccg cgttttccct gggattatcc 361 ctggcgttcg ggcttttcag ccccgcccag gccgcgccgg atacctcggt atccaacaag 421 caaaatttca gcaccgacgt catctatcaa attttcaccg acaggttttc ggacggcaat 481 cccgccaaca atccgaccgg cgcggcgttt gacggaacct gcacgaacct ccggctgtat 541 tgcggcggcg actggcaggg catcatcaac aaaatcaacg acggttacct gaccgggatg 601 ggcgttaccg ccatctggat ctcccagccg gtcgaaaaca tctacagcat catcaattat 661 tccggcgtca acaacacggc ctatcacggc tactgggccc gggacttcaa gaagacgaat 721 ccggcctacg gcacgattgc ggacttccag aacctgatcg ccgccgcgca tgccaaaaac 781 atcaaagtca ttatcgactt cgccccgaac catacgtcgc ccgcctcgtc cgaccagcct 841 tcctttgcgg aaaacggccg gctgtacgat aacggcacgc tgctcggggg atacacgaac 901 gatacgcaga acctgttcca ccataacggc ggcacggact tttccacgac cgaaaacggc 961 atctacaaaa acctgtacga tctcgccgac ctgaaccata acaacagcac gtcggacgtc 1021 tacttgaagg acgcgatcaa aatgtggctg gatctcggca tcgacggcat ccgcatggat 1081 gcggtgaagc atatgccgtt cggctggcag aagagcttta tggctgccgt caacaactat 1141 aagccggtct ttaccttcgg cgaatggttc ctgggcgtaa acgaagtagg cccggaaaac 1201 cataagtttg ccaacgaatc cggcatgagc ctgcttgatt tccgttttgc ccaaaaggtg 1261 cggcaggtgt tccgggacaa caccgacaat atgtacggcc tgaaggcgat gctggagggc 1321 tccgcagccg attacgccca ggtggatgac caggtgacgt tcatcgacaa ccatgacatg 1381 gagcgtttcc acgcaagcaa tgcaaaccgc cggaagctgg agcaagcgct ggcgttcacg 1441 ctgatcctcg cgcgcgtccc cgccatttat tacggcaccg agcagtacat gtcgggtggg 1501 accgatccgg acaaccgggc gcggatccct tccttctcca cgtcgacgac cgcctatcaa 1561 gtcattcaaa agctggcgcc gctgcgcaag tccaacccgg ccatcgccta cggatcgacg 1621 caggagcgct ggatcaacaa cgacgtgctc atttatgagc gcaaattcgg cagcaacgtt 1681 gccgtcgttg ccgtcaaccg caatttgaac gcgccggctt ccatttcggg acttgtcact 1741 tccctgccgc aaggcagcta caatgacgtc cttggcggcc ttctgaacgg caacacgtta 1801 acggtaggct ccggcggagc cgcctccaat ttcacgcttg cggccggcgg cacggcggtg 1861 tggcagtaca ccgcggcaac ggcgacgccg accatcgggc atgtcgggcc gatgatggcc 1921 aagccgggcg tgacgatcac gatcgacggc cgcggcttcg gctctagcaa aggcaccgtc 1981 tacttcggca cgacggcggt gagcggcgcc aacatcacgt cttgggaaga cacgcagatc 2041 aaagtgaaaa ttccggccgt cgcaggcggc atctacaaca ttaaagtcgc aaacgccgcc 2101 ggaacggcaa gcaacgtgta cgacaacttc gaggtattgt ccggagacca ggtcagcgtc 2161 cgcttcgtgg tcaacaacgc gacaacggcc cttgggcaaa atctctacct gacgggcaat 2221 gtcagcgagc tggggaactg ggacccggca aaagcgatcg ggccgatgta caaccaggtc 2281 gtttaccaat atccgaactg gtattatgac gtcagcgttc cggccggcaa aacgatcgag 2341 ttcaagtttt tgaaaaaaca aggctccacc gtcacgtggg aaggcggcag caaccacacc 2401 ttcaccgcgc cgtccagcgg caccgcgacc attaacgtga attggcagcc ataaggcgtg 2461 agggataggc ggctggcatt cattggaaaa ggcggactat atgacgtccg ttccgtgagc 2521 aacgctcatc gctccgttca aaccgccaca aggctgatct tcagccaaaa aaagagggga 2581 cctttcccct ctttttttat ttccgttgac taacggtatt cccaaaaatt acattggggg 2641 ataagctccc tcccctctaa tagcaataac aagagcgtaa acccaaccag gtgatccata 2701 gcgtgcggtc gcctttaatc ccggtatcaa aatgtatcct accttacaaa aatgatcgga 2761 tcatacaaaa tagtgcgtac tactcaacga aatagaacct acatacagaa cgatcgatcc 2821 agatttcaac gaacggcacg gtcgtttaaa aaaatggtgt gcggggtgcg agaatatgca 2881 agaatatcaa ctgactttga aagataagcg gatcgtatgg gggaaggcga tcgaccttga 2941 gcctctcatt ggcaaatatc ctggcgactc gattagacag ggcatgaacg aagctt // LOCUS HUMALPL 3101 bp ds-DNA PRI 08-JUN-1990 DEFINITION Human alkaline phosphatase (EC 3.1.3.1) gene. ACCESSION D90054 KEYWORDS alkaline phosphatase. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3101) AUTHORS Matsuura,S., Kishi,F. and Kajii,T. TITLE Characterization of a 5'-flanking region of the human liver/bone/ kidney alkaline phosphatase gene: Two kinds of mRNA from a single gene JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumio Kishi Department of Pediatrics Yamaguchi University School of Medicine Ube, Yamaguchi 755 Japan Phone: 0836-22-2258 Fax: 0836-22-2696 FEATURES from to/span description pre-msg 2130 3101 alkaline phosphatase mRNA and intron IVS 2341 3101 alkaline phosphatase intron rpt 375 664 Alu sequence rpt 2631 2926 Alu sequence BASE COUNT 775 a 752 c 692 g 882 t ORIGIN chromosome 1; map position p34-36.1. 1 aagctttctc cagcgagtat gatggtttct gcaggttctt ggcataaagc ctttatcaga 61 ttaaggaaat tcttttcaat acctggtttg ctgagggctt ctgtcacatc gttttctgtg 121 accccattcc ctctccctag gtgagcacgt caagtttgat cagggtgtta aactgccacc 181 cctgtgccta tgattcccaa atttatactc taacccagac ttctttttca aatgccagag 241 ccaaatattc agctgcctcc ttagtgtctc cacttctaaa agacatctcc aactcaacat 301 atccaaaaac aagttcctga ttgtctccac ctcatgcctc aaaagaccac cccaaacgcc 361 gaaaggctga atgctttttt ctttttcttt tttttttttt tctgagatgg agtctcactc 421 tgttgcccag gctggactgc agtgatgcga tctcagctca ctgcaaactc tgcttcctgg 481 gttcaagtga ttctcctacc tcagcctctc aggtagctgg gactacaggt gcacaccacc 541 atgcccagct aatttttgta gagagagttt caccatgttg gccaggctgg tctcaaacac 601 ctgaccttaa gggatccacc cgcctcagcc tctcaaagtg ctgggattac aggtgtgagc 661 catcgcactt ggctcggtag tatatggctc agaaacattg ccatttacaa tagttcccca 721 aaaagcaaaa ttcttaggta taaatctgga ttcagagtcc agaatgctaa ccattacacg 781 atggaacccg taggtataaa tctaagaaaa catatccaag atctacaggc tgaagactac 841 agagtgctga taaaaccgaa gaactctgac tgaatgagtg gagagacgtg gtgtcttcat 901 gactgggcaa ctccatgtgg tatagacgta aaccctccca cattgatctg tggatttaat 961 accataccta tcaaaaacac agtggtggag gacagatcag ggatcgccag gtttagggat 1021 ggggggattg tgtaactata aagaacgcaa gagagatttt tggggtggca gagctgttct 1081 gggtcctgac ggtggcggtg gtggttacat aaatctatcc atgtgtcaaa cgtcagaaca 1141 ctcattttac acttgggggc aacagaaatc cctccctctg gagggggtga ctgatggtaa 1201 cctgattgct aattctggaa tcaggagccc tgtggtcagg tttctgctct gcaacttcct 1261 gttggtaacc ttgggcaagt ctccgtccag agccttggtt ttctcatctg taaaaggaga 1321 tgataggtcc ttttctgtcc actgcatagc tgattagtga aacatcatgg tgaaattctt 1381 tatgaactat ggagtgcagc acatagactt gctttcattt tgtcagtatc ctttatagat 1441 tgttcatgta agctcccaaa gagtagtatt tattttattg aaataaaatg cacgtagaga 1501 aaaatgtgtg tatcatacat tgacagctga acccaccgtg taaccagcac ccacccaccc 1561 agatcaatca taaaccgaac cgcaccagca ccccagcagc ccgttcccgt ttccgtaccc 1621 tccacgtgga gcctccgttc tgtctcccaa cgccctgggt tagtttttat actttctgtc 1681 atcggaatca cactgtaagt gctcttgggt ttagcttcct ttgctcaagc ttaccttgtg 1741 cgattcattc atgttgttgt gaggagctgt ggatcatcca ttctccttgc tgtctgtggt 1801 ggtttctgtg ttgtgaacac acacaatgta ttatccagcc tgccgtagat ggaggcagtt 1861 ttgaagccat tataaacagg gctgatgtgc acattctgct ggagagaaac gggtcccagg 1921 gtacaggtag gatgatcagc ttcggtagat cctgccggtt ttcccatgcg ctgtgcctgt 1981 ctgcactcca ccaacggcga gcggaccttc cggtagttaa acatcttcac gaactcttgg 2041 actttcctgc acacacagag aagataattt tggatggctc ttcccttccc cccacaacct 2101 tccttagggc actggctttc aactgatgta aatatttact atgccaagca ctaggagggc 2161 agagacaaac aagacaaagt cctcacactt agaaactccc ggtgtggcag ctgagatggc 2221 ccaggaaaga actatattac cttcaaaaag agaggtacat gcgatgtttg aggtggcatg 2281 aagctcagtg gtgttatatt ggaatgagtg agtgaccatc ctggagcctt cctgaaagag 2341 gtgacttcat ttttaagtga ttttaaataa tagtttaatg aattagtatt tcgtattcag 2401 ttaataacat ttttctgatt ttaggatttg ctatagaaat atttggaaac cgtaaagtag 2461 aacaaaaaaa aaatgtagga atcatctgaa attccaaatt ctaccactca cagttaagtg 2521 ttgttagatg ttagatgtgg gatattgcct tttaatttcc actctgcgcc gctaccccca 2581 gcccctaccc cagagccgtc acttctggca ctggagcgca gcttgcgtgg tttttttttt 2641 tttttttttt tttttgagac agagtcctgc ctgtcgccca ggctggagtg cagtggcgcg 2701 atctcggctc actgcaactc cccctcccgg gttcacgcca ttctcctgcc tcagcctcct 2761 gagtagctgg gactacaggc gcccgctacc tctcccggct aattttttgt atttttagta 2821 gagacggggt ttcactgtgt tagccaggat ggtctcgatc tcctgacttc gtgatccgcc 2881 cgcctcggcc tcccaaagtg ggcagatcac ctgaggtaga gagttcgaga ccagacctga 2941 ccaacatgga ccccatctct actaaaaata caaaattggc cagggcatgg tggcgcaagc 3001 tgctaatccc agccactcag ggaggctgag gctggaaaat tgcttgaacc cgacctgcag 3061 gcatgcaagc ttggcgtaat catggtcata gctgttttcc t // LOCUS RATCNRAA 2337 bp ss-mRNA ROD 08-JUN-1990 DEFINITION Rat calcineurin A alpha mRNA, complete cds. ACCESSION D90035 KEYWORDS calcineurin; calcineurin A alpha; calmodulin binding protein; calmodulin-dependent protein phosphatase; isoform. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2337) AUTHORS Ito,A., Hashimoto,T., Hirai,M., Takeda,T., Shuntoh,H., Kuno,T. and Tanaka,C. TITLE The Complete Primary Structure of Calcineurin A, a Calmodulin Binding Protein Homologous with Protein Phosphatases 1 and 2A JOURNAL Biochem. Biophys. Res. Commun. 163, 1492-1497 (1989) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Takayoshi Kuno Department of Pharmacology Kobe University School of Medicine 7-5-1 Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone: 078-341-7451 x3273 Fax: 078-351-6531 Peptides, 78-329 and 391-414, seem to be putative catalytic domain and calmodulin binding domain, respectively. FEATURES from to/span description ORF 208 1773 calcineurin A alpha signal 1944 1950 polyadenylation signal BASE COUNT 649 a 523 c 596 g 569 t ORIGIN 1 cgggaggagg agtgaaggcg gcggcggcgg aggagggacg cgcggagccg gcagtaactt 61 tcgagccagc ccagagcccg gagctccagc cgagcggttt gcagcgcggc ggcgcggcgc 121 tgagtgtctg gcccgccggt gcggtcgggg tgtgcagtcg gacgggacca gcagcgcgtc 181 gctgtccccc cctcccggtg actggagatg tccgagccca aggcgattga tcccaagttg 241 tcgactacgg acagggtggt gaaagccgtt ccatttccgc caagtcaccg gctgacagca 301 aaggaagtgt ttgataacga tgggaagcct cgtgtggata tcttaaaagc acatctcatg 361 aaggaaggca ggctggaaga aagtgtcgcg ttgagaataa taacagaggg tgcttcgatt 421 ctccgacagg aaaaaaactt gctggatatt gatgccccag tcacagtttg cggggacatc 481 catggacaat tctttgactt gatgaagctc tttgaagtgg gaggatctcc tgccaacact 541 cgctacctct tcttagggga ctatgttgac agagggtact tcagtatcga atgtgtgctg 601 tatttgtggg ccttgaaaat tctttacccc aaaacactgt ttttacttcg tggaaaccat 661 gaatgtaggc acctaacaga gtatttcacg tttaaacaag aatgtaaaat aaagtattca 721 gaacgcgttt atgacgcctg tatggatgcc ttcgactgcc ttcccctggc tgcgctgatg 781 aaccaacaat tcctgtgtgt acacggtggt ttgtctccag agattaacac tctagatgac 841 atcagaaaat tagaccgatt caaagaacca cctgcttatg ggcctatgtg tgacatcttg 901 tggtcagacc ccctggagga ctttggaaat gagaagactc aggaacattt cactcacaac 961 acagtcaggg gttgttcgta cttctacagt tacccggctg tatgtgactt cctgcagcac 1021 aataatttgt tgtccatact ccgagcccac gaagcccagg acgcagggta ccgcatgtac 1081 aggaaaagcc aaacaactgg cttcccgtct ctaattacga tcttctcggc accaaattac 1141 ttagatgtgt acaataataa agctgcagtg ttgaagtacg agaacaacgt gatgaacatc 1201 aggcagttca actgctcccc ccatccgtac tggctcccaa atttcatgga tgttttcacc 1261 tggtcgctgc catttgttgg ggagaaagtg actgagatgc tggtaaacgt cctgaacatc 1321 tgctcagatg atgaactggg gtcagaagaa gatggatttg acggagccac ggctgcagcc 1381 cggaaggagg tcatcaggaa caagatccga gcaataggca aaatggccag agtattctca 1441 gttctcagag aagagagtga gagcgttcta actctgaagg gcctgacccc gactggcatg 1501 ctccccagcg gagtgctctc tggcgggaaa caaactctgc aaagcgctac tgttgaggcc 1561 attgaggctg atgaagccat caaaggattc tcaccacaac ataagattac cagcttcgag 1621 gaggccaagg gcttagaccg aattaacgag aggatgccgc ctcgcagaga cgccatgcct 1681 tccgacgcca accttaactc catcaacaag gctctcgcct cagagactaa cggcacagac 1741 agcaacggca gtaatagcag caatattcag tgaccacttc ctgttcactt tttttttttg 1801 agctgcaggg catgatgggt ttgctgcatc tcagcagttg gatgttcttg cctctgacgg 1861 tagcttgttt gctctggggg ggccaggaat tggattcagt ttacactatc atgaaaaaaa 1921 aaaagaggga gagagagaga gataataaaa ctatattttg gtgagggtgg tgattaaaca 1981 cctcttttgg gtatgccttt aaaaatgctt ctaggaaaaa aaaagtttta aaaagaaagc 2041 taatgctagt ctatacttca atgttagggg aatgaacacg ttttcctagc gcactgggga 2101 cttttagata ggttaatgaa aggcctttta ttctgttact ggacacgaaa actttgtcta 2161 atttcttata ctctattgta cgtttacagt cgcagcacta aaaatggatg acatcaaaca 2221 tttttaaaca gaaaaaaaag atgtacaaac taaataagga ctatttattg ataatgtttt 2281 gctactcttg tcagacaatg gctataaact gaattaggca gtcttaaaaa aaaaccg // LOCUS BOVPAS 3589 bp ss-mRNA MAM 08-JUN-1990 DEFINITION Bovine poly(ADP-ribose) synthetase mRNA. ACCESSION D90073 KEYWORDS DNA binding protein; helix-turn-helix; nuclear location signal; poly(ADP-ribose) synthetase. SOURCE Bovine thymus, cDNA to mRNA, clone pSO-7. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 3589) AUTHORS Saito,I., Hatakeyama,K., Kido,T., Ohkubo,H., Nakanishi,S. and Ueda,K. TITLE Cloning of a full-length cDNA encoding bovine thymus poly(ADP-ribose) synthetase: evolutionally conserved segments and their potential functions JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Isao Saito Department of Clinical Science and Laboratory Medicine Faculty of Medicine, Kyoto University Shogoinkawahara-cho 54, Sakyo-ku Kyoto 606 Japan Phone: 075-751-3467 Fax: 075-771-4792 Zinc-binding finger motifs are observed in seq. 21-51 and 128-165. The sequences, 200-220 and 250-270, showed helix-turn-helix structure. Nucleotide-binding fold was found in seq. 890-903. FEATURES from to/span description pept 77 3127 poly(ADP-ribose) synthetase (EC 2.4.2.30) signal 224 231 nuclear location signal BASE COUNT 975 a 879 c 1010 g 725 t ORIGIN 1 tggatgagac ggcactagca gcagcgggtc tccaggtcgc ggcgtcgggc ttccggagct 61 gtgagcggtt agaaggatgg cggagtcttc agacaagctc taccgggtcg agtacgccaa 121 gagcgggcgc gcgtcttgca agaaatgcaa agagagcatc cccaaggact cgatccggat 181 ggccttcatg gtggagtcgc ccatgttcga tgggaaaatc ccgcactggt accacctctc 241 ctgcttctgg aaggtcggct tctccatctg gcaccctgat gtcgaggtgg aggggttctc 301 tgagctccgc tgggatgacc agcagacgat caagaagatg gccgagactg gcggacggac 361 agatgtttca ggcaaaggcc aagatggagt tggcagcaag accgagaaga cgttgattga 421 cttcggggca gggtacgcca agtccaacag aagcacgtgc aagagctgca tggagaagat 481 agacaagggc caggtgcgcc tgtctaagaa ggtggtgtac cccgataagc cccagctggg 541 catggttgac tgctggtacc acccaaagtg ttttgttcag aaacgggagg agctgggctt 601 ccgtcccgag ttcagcgcaa cgcacctcat gggcttcagc gtcctcaccg cagaggacca 661 agaaaccctc aagaagcaac tcccggccat caagggtgaa agaaagagaa aaggtgatga 721 ggtggatgga atagatgaag tgaccaagaa gaagtctaaa aaagaaaaag acaaggagat 781 taaacttgaa aaggccctta aggcccagaa cgacctgatc tggaatgtca aggacgagct 841 aaagaaagcg tgctctacga acgacctgaa agagttgctc atcttcaaca agcaggaagt 901 gccttccggg gagtcggcga tcttggaccg cgtggccgac ggtatggtgt ttggcgccct 961 ccttccctgc gaggaatgct cgggccagct ggtcttcaag ggcgacgcct attactgtac 1021 cggggatgtg actgcctgga ccaagtgtat ggtcaagaca cagacgccca accggaagga 1081 gtgggtgacc ccaaaggaat tccgagaaat ctcttacttc aagaaactga agatcaaaaa 1141 gcaggaccgt atattccccc cagagagcag caccccagtg ggggcagcag ccccaccctc 1201 cgcagcttca gcgcctgccg ctgtgcactc tggcccccca gacaagccat tatccaacat 1261 gaagatcctg actctcggga aactctccca gaacaaggat gaagtgaagg ccacgattga 1321 gaaactcggg gggaaattga cagggacggc caacaaggcc tccctgtgta tcagcaccaa 1381 aaaggaggtg gacaagttga ataaaaagat ggaggaagta aaagaagcca acatccgtgt 1441 cgtgtctgag gacttcctcc aagacatctc cgcctccacc aagagccttc aggagttgct 1501 ctccacccac ctcttgtccc cctggggagc cgaggtgaag gtggagcctg ttgaagcagt 1561 gggcccaaag gggaagtcgg gggccgcgcc ctccaagaag agcaagggtc ccgtcaagga 1621 ggaaggtacc aacaaatctg aaaagaggat gaaattaact cttaaaggag gagcagctgt 1681 cgaccctgat tcaggtctgg aacacaatgc acacgtcctc gagaaaggcg ggaaggtctt 1741 cagcgccacc ctcgggctcg tggacatcgt caaagggacc aactcctatt acaagctgca 1801 gctcctggag gatgacaaag agagcaggta ctggatattc aggtcctggg gccgtgtggg 1861 cacggtgatt ggtagtaaca aactggagca gatgccatcc aaggaggatg ccattgagca 1921 ttttatgaaa ttatatgaag agaaaaccgg aaacgcctgg cactctaaaa actttacgaa 1981 gcatcccaaa aagttctacc ctctggagat tgactacggc caggatgaag aggcggtgaa 2041 gaagttgaca gtaaaccctg gcaccaagtc caagctcccc aagccagtgc agaacctcat 2101 taagatgatc tttgatgtag agagtatgaa gaaagccatg gtggagtatg agattgacct 2161 tcagaagatg cccttgggga agctgagcaa aaggcagatc caggctgcat actccatcct 2221 cagtgaggtc cagcaggcac tgtcccaggg cagcagtgac tctcacatcc tggatctctc 2281 caaccgcttc tacaccctga tcccccacga cttcgggatg aagaagcccc cgctgctgaa 2341 caacgcaaac agcgtgcagg ccaaggtgga aatgctagac aacctgctgg atattgaggt 2401 ggcctacagt ctacttaggg gtggttctga tgacagcagc aaggacccca ttgatgtcaa 2461 ctatgagaag ctcaaaactg acattaaggt tgtggacaaa gattccgaag aagccgagat 2521 cattaggaag tatgtgaaga acactcacgc gaccacacac aacgcgtacg acttagaagt 2581 cgtcgacatc ttcaagatag agcgcgaagg ggaaagccag cgttacaagc cgtttaagca 2641 gctgcataac cggaggctgc tgtggcacgg gtccaggacc accaacttcg cgggcatcct 2701 gtcccagggt ctccggatag ccccacctga agcacctgtg acgggctaca tgtttggtaa 2761 gggaatctat ttcgcggaca tggtctccaa gagtgccaac tactgccaca cgtcccaggg 2821 agacccaata ggcttgatcc tgttgggaga agctgccctt ggaaacatgt atgaattgaa 2881 acatgctcga catatcagca agttacccaa gggcaagcac agtgtcaaag gtttaggcaa 2941 aactacccct gacccatcgg ctagtattac tgtggatggt gtggaggtgc ctctcgggac 3001 agggatttca tctggtgtta atgacacctg tctgttgtat aacgagtaca tcgtctacga 3061 cattgctcag gtccacctga agtacctgct gaagctgaag ttcaacttta agacatccct 3121 gtggtgagct gtgaggtgcg caggctggtg actggcgtgc tttgcatgac tggccaggca 3181 gaaacctcag ctgagttgtc gaggcatagc tagcttcact cagtctcctt agaagggatt 3241 ttatatgaac acttggtcac atgtttcaga gcgattttcc cagcttttca aatcccttga 3301 ttgtgttggg cgggggagcg ggcggggggt ggtttcgggg tttttctcct gccagccata 3361 accctggtag gtgcaactga caatagagga gaagctgggg agggtcttac ctgtgtctag 3421 attagtccaa tgaaagagcc tcgcaggcca cattagaaca tctgccttac tggtttcccc 3481 agggaaggaa aaaaataatc ttctaccctt gtttttaagt attcatcttt agttttggaa 3541 aaatgttaag tgtttatttt gaggtcaaaa taaaaactaa tttcatact // LOCUS CHKNMYC 6360 bp ds-DNA VRT 08-JUN-1990 DEFINITION Chicken N-myc protein gene. ACCESSION D90071 KEYWORDS N-myc gene; nuclear protein; oncogene. SOURCE Chicken (strain white leghorn) embryo DNA, clone CNG1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 6360) AUTHORS Sawai,S., Kato,K., Wakamatsu,Y. and Kondoh,H. TITLE Organization and expression of the chicken N-myc gene JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hisato Kondoh Department of Molecular Biology School of Science, Nagoya University Furocho, Chikusaku Nagoya 464-01 Japan Phone: 052-781-5111 x6685 Fax: 052-782-8575 FEATURES from to/span description pept 2872 + 3598 N-myc protein exon 1 pept + 4682 5280 N-myc protein exon 2 pre-msg 2354 6208 N-myc mRNA and introns IVS 2496 2769 intron A IVS 3599 4681 intron B BASE COUNT 1318 a 1648 c 1735 g 1659 t ORIGIN 1 tgagagtaga cctaaaaagg agaacttgac gtagtattgg tggatgaaaa gctgttagtg 61 tgtgcctgaa acccagaagc caaccatatc ctggaccaca tgaaaagaag catggccagc 121 acggcaaggg tggtgattgt ccctctctac tccattctca aattcctgta cccaaagaga 181 aaacatagct atatacatat atgtatatgt gggggttttt ttgtttgttt ttttgctgtt 241 cccaggacta tgtttagcca atgcatcaaa acgcatacaa gcatttgcgg tcacttgagt 301 tggcactgca ctgtgcctcg tgctctggtt gccacccaga tgctgatcac acaccgtggg 361 tttggtggtt gctgtaccat ggatgcaccg ggtgtgaggc agggccttca cactgagatg 421 ggagccccac agctgccatc ttcctcctcc ggactctgtg ctgcagccag gtccaaccag 481 ctccatctgg ccagcccctc tgtatgcttt cagctctgca gcggacatcc aaacaacagt 541 gtgaacgctt aaggcgacct caccttcctg gaagggcctt gatacctcag tcttggccac 601 ccaagcacaa catggaggca agcgggcatc accccaggga acagcgtagc ccaggggctg 661 gagcgcaatg gggcaacgcg gtgaaggtgc acggtgagga actattaccg tctggctctg 721 ttctcagaaa caaactcctc ctcagcgact ctcacacaga ccctccctca gagctgccac 781 ttttgaagcc gccgggctca ccccgggcgg ctcgggagcg gcgccccgcc cggccgcaca 841 caaacggcgc cggcagcctt cctgcgagca acgcccacgt ccccgcctca acctgcagcc 901 tatgagcggg taggcgcctc cccaagcggg cgccggaccg ggatcgcaca gccgaggacg 961 gcgccgcgct gccgtaccgc ggtctccccg ccgctgtccg cgcccgcggg gaaggaggca 1021 gagcggtggg tgtccgccgc cggggggacg cccggccgcc gcctccggcc ctccgcaagg 1081 tgtcgccgcg cgcagcggaa cgccgggtcc ggccgggagc tgccgtgcgg tgcggcgggg 1141 ccggaggtgt gcgcggcgag cggggaggcg atcaccgcag gacgagcgct agttctttat 1201 aaagcccaga gcgagggcga ccccgcgttc cagacacttc cattattcct ctcctttatt 1261 agtatttttt tcacctctcc ctcggtcttc gcagataggg ccaccgctct cacgcagcgc 1321 cgcgtccccc cggggagggg cggccccgca cccccgggcg tgccgggcgc ctcgcaccgc 1381 gttggagaat aaagagcgaa tggggctttt ggctcgcaag cggaggcacc tccctcgtct 1441 atggaaatca gggggtgggg ttacgccgcg ctcccccccc cccttttttt ttttttttta 1501 atgctccccg ggtgtgtcag ctatctttat ttttttattt tttttttcct cccctccctt 1561 ccccccaccc cgccactcct ccccctcgca acacccaccg ccggcagcgg ggatttaaga 1621 ggaggaggaa gcggcagggt ctgggctgcc agcagcggga ggacaaaacc cccatgaaat 1681 aaatacaatc ccacatccag ctgggaggat gggacgggag cggcagcctc cgccggccct 1741 caatgggggt ttcattgagc ggatatttgg gctgtgctgc ggagcggtcg caccgagctc 1801 gccgcactct gtaggtgcct ctttgctctt cgcattttta acggtttcgt gtatttcggg 1861 agcagggaag cgcacaatgc tggaaaggag cagggggcgg cgcgggggcg agcagctgag 1921 ccgcgtttcg gtgcatttca tcagagcgct ttgcacgact gtgtgtgtgt gtgtgtgttt 1981 tttttttttt tttcctcccc cttttgctct ccgtcagctg ttcgcaaccc agcacacgtt 2041 ttcccttcgc tgggtcggga ctaagagctc atttcggttt caatgcccga tgcagaggag 2101 catggggctg cgggtgagcg gccgcgggta gcgcgggggc accgaggtaa gggcgctgct 2161 cccgggggcg aaccgtgtgg ctcttgtttt aacctgcaaa gagggagaaa aacttcctgt 2221 agcgagtcag gctttgttgg ggtgtgttca tcgtttggtg tgcgtcggtg tttgctgctg 2281 ctggattccc ctgatggagt tgcacggctc ggggagcccc ttctttgcat ccttctctcc 2341 ccactacact ggaaatgaca aaaggagtaa acttcctttg cttgcggtcg cgtataggag 2401 ggtttctgca gggaggcgtg catctgcccg cggctgtgct gctctgagct ctggaggcgg 2461 tcgggcaggg cgcggagctc cgtgcaggac cggaggtgtg tgtgtgtgtg tgtgtgtgtg 2521 tgtgtgtgtg caccctggag cggctcacgg ccgctgcctg cggcgtccga gaaggtgcgg 2581 tgcgttcctc cttaacttgt aaaggctcct ggtttgtttg ggtttttttt tccctatcca 2641 taatggaatt gtgttttttt atttttcctt tttttttttt tttccccact acataatagg 2701 gaattacagg cgtctttttt tttcgtgttt taattgtgtt cgtttcttta tttccatttt 2761 ctgctttagt ttcaggtcct ggacacacct ctcctctcct tggtggaact accgtaaaac 2821 ccgcctctaa cacgagcccc cagcccggga atcgacacgg agagccccgc catgccggga 2881 atgatcagca agaacccgga cctcgagttc gattccttac agccttgttt ctacccggac 2941 gaagatgatt tctatttatg cgggccggac tcggcccccc ccggggaaga catctggaag 3001 aagttcgagc tgctgcccac ccctccgctg tcccccagcc gggccgggct gcaggagcca 3061 ccccccgggg ggggctccat tgccgtgggg aggggcggac ctggggagtg ccgccccgtt 3121 gatcccctgg actgggcgtc cgagttgctc ctcctgcccc ccgaggccga gctgtggggc 3181 agcacggacg gcgcggattt cttcgagacg ggcctcgggg cgagcaacaa cctcaactcc 3241 atcatcatcc aggactgcat gtggagtgcg ttctcggccc gcgagaagct ggagcgggcg 3301 gtcagcgaga agctgcagag caagcctccc gccgccgccc cgccgcctcc gcccccggtt 3361 gtacccaccg ccgcttgtcg ccgccgcgaa cagccccagc gcggtcccgg ccgcgcggag 3421 ctgggcggct ccgtgcccga gtgcgtggac ccggccgtgg tcttcccctt ccccgtcaac 3481 aagcgggagg cggcggtgcc aagcggcggc gagaccccgc ggggcgggcg ccggccgggt 3541 cccgcggggg agagccgggc cagcagcagc tccggggacg acacgctcag cgactcgggt 3601 caagcgcgcg gctgacgggt gggcggtgcg gggtcgggcc gccgcagcgc acactcaaca 3661 gccgcggggc ccgtgcgcgc cccacggggc tgggttctcc ggggctgtgg gccaacagca 3721 aggacggtgt gcccgggctc gggctgtgcc gtgcggctcc tcgggcttcc catgtcctca 3781 tgtccatcgg ggtgtgtgtg cgccattctg ggggccgttt tggggagcgt gtggtcagaa 3841 gcgaggggca gtgccatgct gaaccacgtt cggtggggga tgtttacggc gttgggaggc 3901 tggaacagag cggcagcccg cagagggtcc ttgtctgtac ggtgacagag cctaaaatgc 3961 cctcagatag ttcccatcgg agaaatccgg ttcggcagtg tagttctggc tgtgtaagtg 4021 agcggtccgt atgatcgtca ccataaactt ggcgatccgt ctcggctcgc ggagcctact 4081 gtccctttaa tgtctggttt gacagctttg ggtgaggaag cacttccaac agctgtcttc 4141 ttggcactgc accaagcgcc ggttaaaggg tccccggcgg gagcagcttc accttctgca 4201 ctagaacaaa cccagcgatt gtttcgtttc ccgtctgctt ttccaccaac ccagggctgt 4261 gttgtggctc tgtacgtgca gtttgtgccc caggtctgca gctgtagtaa aaatgaatgc 4321 attttgtatg aaaacttttc gccactggga ttagttgggt ggtagcccca ctttcattta 4381 aaatgagaag ttgttaaaga aaaggagttg gagacaatcc ctatgggtcc cttcaaactc 4441 ggggtattct atgaaaaaca aacctcataa agttccccca aacttgcaga gatctatttt 4501 gtttagtttt gtgtatgtta tatacgtctc tagaactata tttgattatg gacagtgagc 4561 acaccctatt ttttttacca tcagttattt tattggtatt tctaatgtgc atatcttgtc 4621 aataaatatt attgcttttg agagaatatc aatttctttt tccattttct ttaaccgaaa 4681 gatgatgaag acgaggagga agaggatgat gaagaagaaa tagatgttgt gacagtggag 4741 aaaagacgct cctcctccaa caaggctgtt accaccctca ctattacagt gcgtcctaaa 4801 aataccactt ttccatcggt caggacacag cagaatgaac tgattttaaa gcgttgtgcg 4861 ccaattcacc agcagcataa ttatgccgct ccttctccgt atatggagag tgaagatgtg 4921 ccaccacaga aaaagttaaa agccgaggtg ccccgtccag taaaacccat gatccaacca 4981 aagtctaaga gttcaagtcc tcgaaactct gattcggagg acagcgaacg tcgacgtaac 5041 cataatatct tggagcgtca gaggcgtaat gatctgcggt ccagtttcct cacgttaagg 5101 gaccacgttc cagaacttgt taaaaatgag aaagctgcaa aagttgtgat tttgaaaaaa 5161 gccactgaat atgtccattc ccttcaggca gaggagcaga agttattgct agaaaaggag 5221 aaattgcaag ccaggcaaca acagttgcta aagaaaatag agtacaagcg gacttgctaa 5281 acttcttttt ctcgtttttt ttcttttctt tttggttttt tttttttttt ttggctgacc 5341 aggacagtca ttgccacttt gcacattttt gattctttaa aaaaaaaaaa aaaattgtgt 5401 tttttgacgt taagaatgtt ggttttactt tcaattcagt ccctgaagta attgacaaac 5461 tttattatcc gggtacgagc aaatgggtgt tcttgcaaga agtttattgc gagactacca 5521 aacacaatgg actgcctttg tttttcattt taagaactgt agatggtgga tttttttaat 5581 tttttttaaa ttgttgtgag catttggagc tgctgatgac atctagttga gttgaaaaca 5641 tccattccta atttttatgg tgcttatgtt ctaacagatg ttactttagg gggttggcat 5701 ttgtacccct ctggaatttt ctgtaaatac catctacaca cttgcctttt gtacatgtct 5761 tgggttatga gaggtggctt ttgctgccag tattagactg gaagttcata cctaagtact 5821 gtaatacctc aatgtttgag gagcatgttt ttgtatacaa atatattgtt aatctctgtt 5881 atgtactgta ctaattctta cattgcctgt atactttagt acgtcgctga tacataacta 5941 aatttgatac ttatattttc gtatgaaaat gagttgtgaa agttttgagt agatattact 6001 ttatcacttt tttgaactaa gaaacttttg taaagaaatt tactatatat gccttttcct 6061 agcctgtttc ttccagttaa tgtatttgtt aatgtttggt gcatagaact gggtaactgc 6121 aaagttctgt gtttaatttc ttccaatggt gtacatttag tgctgcgtct tatagcactt 6181 tgaaatacct catgtttatg aaaataaata gcaattacat gatgtgccat ttactatttt 6241 ttcttttaaa tggagttcct tgaaataaag tttttaactt tgcaatgttt atgaaataaa 6301 aacctagtcc catatgtaag gatttgaact ctcaagtata ggtgatgttt acggacaagc // LOCUS BMEGDH1 2834 bp ds-DNA BCT 08-JUN-1990 DEFINITION B. megaterium glucose dehydrogenase gene and ORFs. ACCESSION D90043 KEYWORDS glucose dehydrogenase. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2834) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan. FEATURES from to/span description pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47) ORF 181 867 ORF1 ORF 1086 1946 ORF2 signal 89 94 put. -35 region for ORF1 signal 116 121 put. -10 region for ORF1 signal 167 172 ORF2 ribosome binding site signal 893 928 termination signal signal 972 977 put. -35 region for ORF2 signal 997 1003 put. -10 region for ORF2 signal 1070 1076 ORF2 ribosome binding site signal 1949 1954 glucose dehydrogenase ribosome binding site signal 2751 2783 termination signal for glucose dehydrogenase BASE COUNT 896 a 442 c 644 g 852 t ORIGIN 1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga 61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag 121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa 181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg 241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt 301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat 361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca 421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat 481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag 541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta 601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat 661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct 721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt 781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt 841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg 901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt 961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt 1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt 1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt 1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt 1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg 1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta 1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca 1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg 1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag 1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa 1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat 1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca 1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg 1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc 1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga 1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag 1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga 1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg 2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa 2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt 2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt 2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt 2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga 2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt 2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa 2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat 2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg 2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc 2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc 2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca 2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga 2821 ctggtgttga attc // LOCUS BMEGDH2 1202 bp ds-DNA BCT 08-JUN-1990 DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene. ACCESSION D90044 KEYWORDS glucose dehydrogenase; isozyme. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1202) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura Department of Fermentation Technology Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax: 06-876-9036 FEATURES from to/span description pept 125 910 glucose dehydrogenase signal 27 32 put. -35 region signal 49 55 put. -10 region signal 111 116 SD sequence signal 928 959 termination signal BASE COUNT 427 a 190 c 268 g 317 t ORIGIN 1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa 61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt 121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt 181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg 241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc 301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc 361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt 421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg 481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa 541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc 601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa 661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa 721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg 781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt 841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg 901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt 961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg 1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa 1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg 1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg 1201 ag // LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 08-JUN-1990 DEFINITION B.mori PTTH mRNA. ACCESSION D90082 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to mRNA, clones P1, P2, C2, C9 and C19. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 1023) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept 34 708 preproPTTH matp 379 708 PTTH subunit signal 768 773 polyadenylation signal signal 826 831 polyadenylation signal signal 997 1002 polyadenylation signal signal 1004 1009 polyadenylation signal BASE COUNT 358 a 179 c 186 g 300 t ORIGIN 1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc 61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg 121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac 181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg 241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta 301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa 361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct 421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc 481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc 541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct 601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg 661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt 721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata 781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc 841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt 901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca 961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt 1021 aaa // LOCUS BMOPTTHP4 944 bp ss-mRNA INV 08-JUN-1990 DEFINITION B.mori preproPTTH mRNA. ACCESSION D90083 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to mRNA, clone P4 and C21. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 944) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept < 1 631 preproPTTH matp 302 631 PTTH subunit signal 691 696 polyadenylation signal signal 749 754 polyadenylation signal signal 918 923 polyadenylation signal signal 925 930 polyadenylation signal BASE COUNT 337 a 163 c 177 g 267 t ORIGIN 1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt 61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag 121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga 181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa 241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag 301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa 361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg 421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag 481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga 541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag 601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt 661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa 721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag 781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt 841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata 901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa // LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 08-JUN-1990 DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA. ACCESSION D90047 KEYWORDS Ip; complex II; iron sulfur subunit; sdh B; succinate-ubiquinone oxidoreductase. SOURCE Human liver mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 958) AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M. TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning of iron sulfur(Ip) subunit of liver mitochondria JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kiyoshi Kita Department of Parasitology Juntendo University 2-1-1,Hongo Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax: 03-814-9300 FEATURES from to/span description pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B) (AA at 1) site 74 95 iron-sulfur binding site I site 167 179 iron-sulfur binding site II site 224 236 iron-sulfur binding site III BASE COUNT 319 a 212 c 204 g 223 t ORIGIN 1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt 61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg 121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag 181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt 241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac 301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat 361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag 421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag 481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc 541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg 601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc 661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat 721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct 781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt 841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata 901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMNCAW 2287 bp ss-mRNA PRI 08-JUN-1990 DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA. ACCESSION D90064 KEYWORDS CEA; CEA gene family; PI-anchored membrane protein. SOURCE Human white blood cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2287) AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and Matsuoka,Y. TITLE Characterization of a cDNA clone encoding a new species of the nonspecific cross-reacting antigen (NCA), a member of the CEA gene family JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumiko Arakawa First Department of Biochemistry School of Medicine Fukuoka University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone: 092-801-1011 x2892 Fax: 092-801-3600 FEATURES from to/span description ORF 87 1136 nonspecific cross-reacting antigen ORF BASE COUNT 618 a 593 c 453 g 623 t ORIGIN 1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag 61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc 121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca 181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc 241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa 301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag 361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg 421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg 481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca 541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc 601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc 661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac 721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga 781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag 841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt 901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa 961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca 1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag 1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc 1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa 1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact 1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc 1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa 1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt 1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc 1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga 1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga 1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc 1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt 1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt 1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa 1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca 1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat 1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga 2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt 2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac 2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc 2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg 2281 gctgatc // LOCUS PIGDESTN 1681 bp ss-mRNA MAM 08-JUN-1990 DEFINITION Porcine destrin mRNA. ACCESSION D90053 J05290 KEYWORDS actin-binding protein; cofilin; destrin. SOURCE Pig adult brain, cDNA to mRNA, clone PD2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1681) AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S., Iida,K. and Yahara,I. TITLE Destrin, a mammalian Actin-Depolymerizing Protein, Is Closely Related to Cofilin --Cloning and Expression of Porcine Brain Destrin cDNA JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Moriyama Department of Biophysics and Biochemistry Faculty of Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan Phone: 03-821-2111 x4408 FEATURES from to/span description pept 54 551 destrin signal 1645 1660 polyadenylation signal BASE COUNT 507 a 322 c 372 g 480 t ORIGIN 1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt 61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt 121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag 181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa 241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg 301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc 361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg 421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc 481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat 541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc 601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg 661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct 721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga 781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt 841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt 901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt 961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac 1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca 1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca 1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt 1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta 1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag 1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc 1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg 1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga 1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt 1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta 1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taacttaaaa aaaaaaaaaa 1681 a // LOCUS RATPMP70X 3324 bp ss-mRNA ROD 08-JUN-1990 DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA. ACCESSION D90038 J05256 KEYWORDS PMP70; peroxisomal membrane protein. SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181, 189,201]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3324) AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T. TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Keiju Kamijo Department of Biochemistry Shinshu University School of Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax: 263-33-6458 FEATURES from to/span description pept 36 2015 peroxisomal membrane protein (PMP70) signal 3271 3276 polyadenylation signal (put.) BASE COUNT 921 a 658 c 773 g 972 t ORIGIN 1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga 61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc 121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg 181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac 241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta 301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac 361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact 421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg 481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag 541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc 601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta 661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg 721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa 781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta 841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa 901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct 961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg 1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca 1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc 1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg 1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta 1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg 1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa 1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg 1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat 1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc 1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa 1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg 1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg 1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac 1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca 1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc 1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa 1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt 2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta 2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca 2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat 2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg 2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga 2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct 2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt 2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat 2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt 2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt 2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg 2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac 2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg 2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg 2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca 2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc 3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg 3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa 3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt 3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat 3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag 3301 attaaaaaaa aaaaaaagga attc // LOCUS VACSANT 1525 bp ds-DNA VRL 08-JUN-1990 DEFINITION Vaccinia virus surface (S) antigen gene. ACCESSION D90076 KEYWORDS S gene; surface antigen. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxviridae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1525) AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y. TITLE Identification and nucleotide sequence of the gene encoding a surface antigen induced by vaccinia virus JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Yoshiaki Ueda National Institute of Health Gakuen, Musashimurayama Tokyo 190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315 FEATURES from to/span description pept 382 1437 surface antigen S BASE COUNT 568 a 229 c 258 g 470 t ORIGIN 1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata 61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat 121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa 181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat 241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac 301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata 361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca 421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc 481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt 541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt 601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa 661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta 721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca 781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc 841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga 901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat 961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta 1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat 1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa 1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact 1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga 1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc 1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt 1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata 1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt 1501 gctatagaat gaatgaggat acgcg // LOCUS YSCA1 881 bp ds-DNA PLN 08-JUN-1990 DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1). ACCESSION D90072 X13682 KEYWORDS acidic ribosomal protein; ribosomal protein. SOURCE S. cerevisiae (strain IFO-40028) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 315; 631 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989) STANDARD full staff_entry REFERENCE 3 (bases 277 to 742) AUTHORS Tsurugi,K. and Mitsui,K. TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1 from Saccharomyces cerevisiae JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988) STANDARD simple automatic COMMENT These data kindly submitted in computer readable form by: Kazuhiro Mitsui Department of Biochemistry Yamanashi Medical college Tamaho, Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257 FEATURES from to/span description pept 313 633 acidic ribosomal protein A1 signal 125 135 UASrpg box1 signal 182 192 UASrpg box2 signal 716 721 poly(A) signal variant 303 303 a in [1]; g in [3] variant 684 685 tt in [1]; t in [3] variant 719 719 a in [1]; t in [3] BASE COUNT 270 a 165 c 157 g 289 t ORIGIN 1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt 61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca 121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa 181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca 241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa 301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac 361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca 421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac 481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct 541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc 601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg 661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa 721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg 781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc 841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c // LOCUS YSCCDC23X 3107 bp ds-DNA PLN 08-JUN-1990 DEFINITION S. cerevisiae CDC23 protein gene. ACCESSION D90081 KEYWORDS CDC23 protein. SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line D22) DNA, clone YX34. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3107) AUTHORS Doi,A. and Doi,K. TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces cerevisiae JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Doi The Institute of Scientific and Industrial Research Osaka University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111 Fax: 06-877-4977 FEATURES from to/span description ORF 765 2645 ORF for CDC23 site 153 161 calcium-binding site BASE COUNT 1010 a 595 c 629 g 873 t ORIGIN 1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact 61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt 121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc 181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt 241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg 301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct 361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt 421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga 481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag 541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat 601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact 661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc 721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc 781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac 841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag 901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag 961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt 1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt 1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc 1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta 1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc 1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga 1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag 1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa 1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact 1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat 1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta 1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg 1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc 1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg 1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt 1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg 1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact 1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt 2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca 2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca 2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt 2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg 2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca 2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat 2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg 2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa 2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta 2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata 2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat 2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc 2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact 2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc 2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc 2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga 3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga 3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag // LOCUS RATLIPAA 3225 bp ss-mRNA ROD 08-JUN-1990 DEFINITION Rat mRNA for hormone sensitive lipase. ACCESSION X51415 J03087 KEYWORDS hormone sensitive lipase; lipase. SOURCE Rat adipose tissue cDNA to mRNA. ORGANISM Rattus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3225) AUTHORS Holm,C., Kirchgessner,T.G., Svenson,K.L., Lusis,A.J., Belfrage,P. and Schotz,M.C. TITLE Nucleotide sequence of rat adipose hormone sensitive lipase cDNA JOURNAL Nucleic Acids Res. 16, 9879-9879 (1988) STANDARD full staff_entry REFERENCE 2 (sites; for [1]) AUTHORS Holm,C., Kirchgessner,T.G., Svenson,K.L., Fredrikson,G., Nilsson,S., Miller,C.G., Shively,J.E., Heinzmann,C., Sparkes,R.S., Mohandas,T., Lusis,A.J., Belfrage,P. and Schotz,M.C. TITLE Hormone-sensitive libase: Sequence, expression, and chromosomal localization to 19 cent-q13.3 JOURNAL Science 241, 1503-1506 (1988) STANDARD full staff_entry FEATURES from to/span description pept 616 2889 hormone sensitive lipase signal 3209 3214 polyA signal (put.) signal 3225 3225 polyA signal BASE COUNT 701 a 952 c 916 g 656 t ORIGIN 1 gaagaaactc gtaatggggc ttgagtgtgt gggtaaggaa gggaactgct gagcgtccag 61 atttcccaga acaagaagga caagtccaaa gacaataaac aaagatagga gttcatcgtt 121 gaatacctgg aaggaagaag gaagagggtg ggcagcttct gggatagaga agcagattct 181 ttggagttca ggagattgga ctcctagatc caggaaagag ggcaaaagcc atggacagtg 241 ggtttctgat tctgaagagg agagataggc aggggctgct tagcctgggg cttcgactcc 301 cagccgatag gcagaaagag gatgaggccc cagacttcta gaagacaagg agataaattc 361 cgaggtgtga gagagaagat aatggaaagc tcctgcatct ccaggaggct tggaacagac 421 cggtgaacct gaggaaggag agggctacgg ctttggactc ctgaatctcc tgtatcttgg 481 cagggatggt cctcggtttg aatactgggt tgggcctagg gagagagagg taggagctgt 541 acccctgggc ctgacagagc aaagagtaat cccagacctt cccatcttcc cgcagcctcg 601 gcattctcac acagcatgga tttacgcaca atgacacagt cgctggtggc gctcgcagaa 661 gacaacatgg ccttcttctc aagccagggc ccaggagaga cagcacggcg gctgtccaac 721 gtctttgcag gtgttcggga acaggcactg gggctagaac caactctagg tcagctcttg 781 ggtgtggcac accatttcga cctggacact gagacaccag ccaacggata ccgtagcttg 841 gtgcacacag ctcgttgctg cctggcacac ctactacaca aatcccgcta tgtggcctct 901 aaccgcagaa gtatcttctt ccgtgccagc cacaacctag cagaactgga ggcctacctg 961 gctgccctca cccagctccg tgctctagcc tactacgccc agcgcctgct gaccatcaac 1021 cgaccaggag tgctcttctt tgagggcgat gagggactca gcgctgactt cctacaggac 1081 tatgtcacgc tacataaagg ctgcttttac ggtcgctgcc tgggcttcca gttcacacct 1141 gccatccggc cgttcctgca gactctctcc atcgggctgg tgtccttcgg ggaacactac 1201 aaacgcaacg agacgggcct cagtgtgact gcaagttccc tctttacggg tggccgattc 1261 gccatagacc cagagttgcg tggggctgaa tttgagcgca tcatacagaa cctggacgtg 1321 cacttctgga aagccttctg gaatatcacg gagatcgagg tgctatcgtc tctggccaac 1381 atggcatcaa ccaccgtgag ggtaagccgc ctgcttagct tgccgcctga ggcctttgag 1441 atgccactaa cctctgaccc caagctcaca gttaccatct cacctccctt ggcacacacg 1501 ggaccaggcc ctgtgctagc caggctcatc tcctatgacc tgcgggaagg gcaggacagc 1561 aagatgctca acagcctggc aaaatctgag ggcccacgcc tggagctgcg cccacggcct 1621 caacaagccc cccgctcacg agccctggtt gtccacattc acggtggtgg ctttgtggca 1681 cagacctcca aatcccatga gccctacctc aagaactggg cccaggagct aggcgtcccc 1741 atcatctcca tcgactactc cctggcccct gaggccccct tcccccgagc actggaggag 1801 tgtttttttg cctactgctg ggctgtcaag cactgtgaac tgcttggttc aaccggagag 1861 cggatatgcc ttgcggggga cagcgcaggt gggaacctct gcatcaccgt gtcccttcgg 1921 gcagcagcct atggggtgag ggtgccagat ggcatcatgg cagcctaccc agttaccacc 1981 ctgcagtcct ctgcttctcc ctctcgtctg ctgagcctca tggatcctct tctaccactg 2041 agcgtactct ccaagtgtgt gagcgcctat tcagggacag agacggagga ccattttgac 2101 tcagaccaga aggcattggg cgtgatgggg ctggtgcaga gagacacgtc cctgttcctc 2161 agagacctcc gcctgggcgc ctcctcatgg ctcaactcct tcctggagtt aagtgggcgc 2221 aagccccata agaccccagt tgcctgcaac agagacactg cgccccacgg attctgggcg 2281 cttacggagt ctatgcgcag gagtgtgtct gaggcagccc tggcccagcc tgagggcttg 2341 ctgggcacag attccttgaa gaagctaaca ataaaggact tgagctttaa gggcaactca 2401 gagccatcag acagccccga gatgtcacag tcaatggaga cacttggccc ctccacaccc 2461 tcggatgtca acttttttct gcgatccggg aattcccagg aagaggctga aaccagagat 2521 gatataagcc ccatggacgg aatcccccgc gtgcgcgctg ccttccctga tggtttccac 2581 ccacggcgct caagccaagg tgtcctccac atgcccctct actcgtcacc catagtcaag 2641 aaccccttca tgtctcctct gctggcccct gacgtcatgc tgaagaccct gccgcccgtg 2701 caccttgtgg cgtgcgctct ggaccccatg ctggatgact cggtcatgtt cgcgcggcga 2761 ctgaaggacc tgggccagcc cgtgacgctg aaagtggtag aggacctgcc gcatggtttc 2821 ctgagcttag cagccctgtg tcgtgagacc ggcaggccgc ggagttgtgc gtgcagcgca 2881 tccggctgat cctcaccccc cctgctgcac cgctgacctg agctgggaac ggcggggggc 2941 ggcactaaaa gacctcttgc acccttctgc gcgggcttcc gtgatgagtg cgctccgagt 3001 cgggctctag gccccctcgg tcgggctggg cgggaggggg cgggccgtgc cttaaccata 3061 agtggggcgg ggcaggggcc aaaagctgaa cctgggggag ggggacgcac acacacacac 3121 ctgtcaccga gacagctgga cctgcactct accactgcct tctactgctg ctgtgaccga 3181 ccaccgctag tcggttttgc tttttgtaaa taaaagttat ttaat //
GenBank-Updates@genbank.bio.net (06/27/90)
LOCUS KAEASTAB 3324 bp ds-DNA BCT 27-JUN-1990 DEFINITION K.aerogenes arylsulfatase (atsA) and atsB genes, complete cds. ACCESSION M31938 KEYWORDS arylsulfatase. SOURCE K.aerogenes (strain W70) DNA, clone pAS123. ORGANISM Klebsiella aerogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3324) AUTHORS Murooka,Y., Ishibashi,K., Yasumoto,M., Sasaki,M., Sugino,H., Azakami,H. and Yamashita,M. TITLE A sulfur- and tyramine-regulated Klebsiella aerogenes operon containing the arylsulfatase (atsA) gene and the atsB gene JOURNAL J. Bacteriol. 172, 2131-2140 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.Murooka, 02-FEB-1990, for release after publication. FEATURES from to/span description pept 452 1669 atsB protein pept 1685 3079 arylsulfatase (atsA) (EC 3.1.6.1) mRNA 420 3172 atsB and atsA mRNA (alt.) mRNA 421 3172 atsB and atsA mRNA (alt.) signal 383 388 atsB -35 region signal 408 413 atsB -10 region binding 441 444 atsB ribosome binding site binding 1672 1675 atsA ribosome binding site rpt 3118 3142 inverted terminal repeat, copy A rpt 3148 3172 inverted terminal repeat, copy B BASE COUNT 758 a 994 c 928 g 644 t ORIGIN 1 cggctggctt ttaacaaagc agacatgata gggcctcccc gtcgagatga tggtacatga 61 cacatggatg actactgata attgaagcat acatttcaca tttaagtaac acaatcatta 121 acatattaac aatgtgatag ccgttacaaa attttcagca atacattgtt atataaagca 181 tttaaaaaac accccagcca tgctttacga tagcggcgtt aaaacataaa caaataagat 241 taacacccgg cgtacctgcc cctggtgata aaaaaaccct attcccgacg cgtgattctc 301 atcctaaaga acgatggatt tttgtttttt ttcacttata cccttaacag gatgcattta 361 attttcgccg cggcttaatc atgtgattac gcctgacccg ctgttgccaa aataatatca 421 acagtaccgg tcattaaccg aggataagcc gatgctgaat atcgccgccc tgcgccagca 481 gcaaattccg ctggccgctg agccgcgctc gccggtgccg tttcatattc tgatgaagcc 541 gattggcccc gcctgcaatc tcgcctgccg ctattgctat tacccgcagg acgaaacgcc 601 ggtcaacaag atggatgacg cgcggctgga gcagtttatc cgccgctata ttgccgccca 661 gcccgccggc gcgcgggaaa tcaactttgt ctggcagggg ggcgagccgc tgctggccgg 721 cctgagcttc tacaaaaaag cgctcgccct gcaggcacgc tatgcccccg acggcgtgac 781 tatcagcaac agcctgcaga ccaacgggac gctgatcaac gacgcatggt gccgactgtt 841 ccgcgaacat ggctttatta tcgggttgag cctcgaaggc aacgaagcgc tgcaggacta 901 ccatcgtccg gataaacgcg gccggtcgac ctggtcggcg gcgctgcgcg gcattgacct 961 gctccatcag catcaggtgg actttaatct gctggtggtg gtgcataacg agatggcggc 1021 ccacgcggcg gcgatttatg accggctggt cagcctcggc gcgcgctatc tgcagtttca 1081 gccgctgatg agcgaaggcg cggccctgcg cgaaggatac cagctcagcg ccgataactg 1141 gggacgtttt atggtcggca tctggcgaca gtggcggaag cgctgcgata gagggcgggt 1201 gttcgttatc aatatcgaac aggcgtgggc gcagtatttc actcatacca gcggcagctg 1261 cgtgcacagc gcccgctgcg gcagcaacct ggtgatggag cccgacggac agctctacgc 1321 ctgcgatcat ctgatcaacg ccgaacatcg gctgggtcgc cttgacgagc agacgcttgc 1381 cgccgcagtt gacgcctcgg tgcaactgcc tttcggtcag cagaaaagtc tgcgccgcga 1441 atgccagact tgctcggtaa aaatggtctg ccagggcggc tgcccggcgc atctcaacgc 1501 cgcaggcaac aaccgcctct gcggaggcta ctaccgcttc tttagcgaca tcctcgcgcc 1561 attacgcccc ttttcccgcg accttaatgg cctgaaggcc tggcgggccg cgtttgttgg 1621 gactgccgca tactgcgtag caccttaccc tgatgatatc cccctctgaa caggagagtc 1681 agtcgtgaac aaaaaagcca tggccgcggc ggtcagcatg atcctcgccg gtggcgcgca 1741 cgccgcgcag caggagcgtc caaacgtcat cgtgattatc gccgatgata tgggctactc 1801 ggacatcagc ccctttggcg gcgagatccc cacccccaac ctgcaggcga tggccgagca 1861 gggaatgcgc atgagccagt attacacctc gccgatgtcg gccccggcgc gctcaatgct 1921 gctcaccggc aacagtaacc agcaggctgg gatgggcggc atgtggtggt acgacagcac 1981 cattggcaag gagggctacg agctgcggtt gaccgaccgc gtcaccacca tggccgagcg 2041 ctttaaagac gcggggtata acaccctgat ggccggtaaa tggcacctcg gttttgttcc 2101 cggcgccacg ccgaaagatc gcggctttaa tcacgccttc gcctttatgg gcggcggcac 2161 cagccacttt aacgacgcga taccgctggg gaccgttgaa gccttccaca cctactacac 2221 ccgcgacggc gagcgcgtct ccctaccgga tgatttttac tccagcgaag cctacgcccg 2281 ccagatgaac agctggatta aagcgacgcc gaaggaacag ccggtcttcg cctggctggc 2341 cttcaccgcc cctcatgatc ccctgcaggc gcctgacgag tggattaagc gcttcaaagg 2401 ccagtatgag cagggttacg ccgaggtcta tcgccagcgc atcgcccgcc tgaaagcgct 2461 gggaattatc catgacgata ccccgctgcc gcatctggag ctggataaag agtgggaagc 2521 gctaacgcca gagcagcaga aatatacggc gaaagtgatg caggtgtatg cggcgatgat 2581 cgccaatatg gacgcgcaaa tcggcaccct gatggagacg ctgaaacaaa ccgggcgcga 2641 taaaaatacc ctgctggtct ttttaaccga taacggcgcc aacccggcgc agggtttcta 2701 ctacgaatct accccggaat tctggaagca gttcgataac agctatgaca acgtcggccg 2761 caaaggatca tttgtctcct acggcccgca ctgggccaac gtcagcaacg ccccctacgc 2821 caattatcac aaaaccacca gcgcccaggg cggcatcaat accgacttta tgatctccgg 2881 tcccgggatc acccgccacg gtaaaatcga cgcctcgacg atggcggtgt atgacgtggc 2941 gccgacgcta tatgaattcg ccggcatcga tccgaacaag tcgctggcga aaaagccggt 3001 gttgccgatg atcggcgtca gtttaagcgc tatctcaccg gcgaagtaca ggagccgccg 3061 cgccgaacta cggggttgaa ctgcatcatc aggcggcctg ggtcgatggc gaatggaagc 3121 tgcgacggct ggtgccgcgc ggcctcaccg ccggcgacgc gccgtggcag ctatttaatc 3181 tgcacgacga cccgctggag acgcatgatg tcgcggccga acatccgatc gggtcaaagc 3241 catgagcgag gcctacgagg catttgctaa gcgcaccatg gttaccaaag cgcagggcaa 3301 aatgaatgct acgtcggtat cgat // LOCUS RATGAH 1003 bp ss-mRNA ROD 27-JUN-1990 DEFINITION Rat L-glutamine amidohydrolase mRNA, 3' end. ACCESSION J05499 KEYWORDS L-glutamine amidohydrolase. SOURCE Rat (strain Sprague-Dawley) liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1003) AUTHORS Smith,E.M. and Watford,M. TITLE Molecular cloning of a cDNA for rat hepatic glutaminase: Sequence similarity to kidney-type glutaminase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Waterford, 08-MAY-1990. FEATURES from to/span description pept < 1 660 L-glutamine amidohydrolase (EC 3.5.1.2) BASE COUNT 270 a 243 c 258 g 232 t ORIGIN 1 ttccggatgt gtctgtcacc cccgttagac aagctgggga acagccacag gggcatcagc 61 ttctgccaga agttggtgtc tctgtttaac ttccacaact acgacaacct gcggcactgt 121 gctcggaagt tagacccacg gagggaaggg ggggaagttc ggaacaagac cgtggtgaac 181 ctgttatttg ctgcatatag tggagatgtc tcagctcttc gaaggtttgc cttgtctgcc 241 gtggatatgg agcagaagga ctatgattcc cgcacagccc tacatgtggc ggcagcggaa 301 ggacacattg acgttgtcaa gtttctgatc gaggcttgca aagtgaatcc ttttgtcaag 361 gacaggtggg gcaacattcc cctggatgat gccctgcagt tcaatcacct ggaggtggtc 421 aaactgcttc aggattacca tgactcctac atgctgtctg agactcaagc tgaggtacag 481 ctgagactct gtcaaaagag aactgagaga gcatgtgtga gcacaggcca gggcagcccg 541 tgctcaagaa aaagcatgag cgggccacaa tttaacccaa ggccaccaaa aatactattg 601 caagctgctt cagtgggatc aacacagcca tctggtgaca caggccagtg ttttctgtga 661 gaatcaaaat gccccattcc ctcatcggac agcacagaga aaagcttcag tggacacctg 721 agcagagcta gccacggaga cctcaaggta tagcttaagt gacatcctcc accagaaagt 781 agcccaggct tttacccagg tccccatttc aacttccttg gagagcgtct agctacatgc 841 atatgtatct gtcacagagc aagagaggtg ggtgagagcc caatcacctg gctttagaaa 901 tctgcagaga tctgtccatc ttagccaaga catgctgcta ctgctgacag gagttttata 961 gacaaagtat tttgtgttca aataaacttt aattaccgga att // LOCUS PSEPROC 1167 bp ds-DNA BCT 27-JUN-1990 DEFINITION P.aeruginosa delta-1-pyrroline-5-carboxylate reductase gene, complete cds. ACCESSION M33557 KEYWORDS delta-1-pyrroline-5-carboxylate reductase. SOURCE P.aeruginosa (strain PAO1) DNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1167) AUTHORS Savoiz,A., Jeenes,D.J., Kocher,H.P. and Haas,D. TITLE Comparison of proC and other housekeeping genes of Pseudomonas aeruginosa with their counterparts in Escherichia coli JOURNAL Gene 86, 107-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Savioz, 04-APR-1990. FEATURES from to/span description pept 102 923 delta-1-pyrroline-5-carboxylate reductase (EC 1.5.1.2) binding 90 94 ribosome binding site binding 921 927 ribosome binding site BASE COUNT 182 a 431 c 380 g 174 t ORIGIN Chromosome PAO1. 1 ctcgaggcag ccatcggcga aggtgcgacc tgggtccgca tcggtaccgc cctgttcggc 61 gcccgcgact acggcgcgcc ggcttcttga aggaatcccc tatgagcaca ccccgcatag 121 cgttcatcgg cgccggcaac atggccgcca gcctgatcgg cggcctgcgc gcccaaggcg 181 tgccggcggc gcagatccgc gccagcgacc cgggcgccga gcaacgcgcg aagatcgccg 241 gcgagttcgc catcgacgtg gtcgagtcca acgccgaggc cgtggccgac gccgacgtcg 301 tggtcctgtc ggtcaagccg caggccatga aagccgtgtg ccaggccttg gcgccggcgc 361 tgaagccgga gcaactgatc gtctccatcg ccgccggcat cccctgcgcc agcctcgaag 421 cctggctcgg ccagccgcgc ccggtggtcc gctgcatgcc caacaccccg gcgctgctgc 481 gccagggcgc cagcgggctg tacgccaacg cgcaggtcag cgccgcgcaa tgcgagcagg 541 ccgggcagtt gctgtccgcg gtgggcatcg ccctgtggct ggacgacgaa gcgcagatcg 601 acgcggtgac cgcagtgtcg ggcagcggcc cggcgtattt cttcctgctg atgcaggcca 661 tgaccgacgc cggcgagaag ctcggcctgt cccgcgagac agcctcgcgc ctgaccctgc 721 agaccgccct cggtgcggcg cagatggcgc tgtccagcga ggtcgagccg gccgaactgc 781 ggcgccgggt gacttcgccc aacggcacca ccgaggcggc gatcaaatcc ttccaggcca 841 acggcttcga agcactggtg gagcaggccc tgaacgccgc cagccagcgc tccgccgagc 901 tggccgaaca gcttggccaa taaggagtag tcgatgatcg gcctcaacac cgctgctatc 961 tatatcctcc agaccctcgg cagcctctac ctgctgatcg tgctgctgcg cttcatcctg 1021 cagttggtcc gcgcggactt ctacaacccc ctcagccagt tcatcgtgcg cgccaccaag 1081 ccgctgctga acccgctgcg gcggatcatc cccggcttcg gcgggatcga cctggcctcg 1141 ctggtgctgg cgatcctgat ccagctg // LOCUS ECOPEPD 2311 bp ds-DNA BCT 27-JUN-1990 DEFINITION E.coli, peptidase D (pepD) gene, complete cds. ACCESSION M34034 KEYWORDS peptidase D. SOURCE E.coli (strain K12; substrain CM17) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2311) AUTHORS Henrich,B., Monnerjahn,U. and Plapp,R. TITLE Peptidase D gene (pepD) of Escherichia coli K12: Nucleotide sequence, transcript mapping, and comparison with other peptidase genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Henrich, 04-MAY-1990. Author address: B.Henrich Universitaet Kaiserlautern, Fac Biologie, Postach 3049, D-6750 Kaiserslautern, FRG FEATURES from to/span description pept 187 1644 peptidase D mRNA 94 1693 peptidase D mRNA mRNA 127 1693 peptidase D mRNA (alt.) binding 173 179 ribosome binding site signal 83 88 TATA box signal 113 118 TATA box BASE COUNT 558 a 608 c 590 g 555 t ORIGIN 1 gatctggcgc actaaaaacc agtatttcac atgagtccgc gtctttttac gcactgcctc 61 tccctgacgc gggataaagt ggtattctca aacatatctc gcaagcctgt cttgtgttga 121 caacattttc tgctaaccct gtgacctgca atactgtttt gcgggtgatc gacaaggaga 181 cttaacgtgt ctgaactgtc tcaattatct ccacagccgc tgtgggatat ttttgccaaa 241 atctgttcta ttcctcaccc gtcctatcat gaagagcaac tcgctgaata cattgttggt 301 tgggcaaaag agaaaggttt ccatgtcgaa cgcgatcagg taggtaatat cctgattcgt 361 aaacctgcta ccgcaggtat ggaaaatcgt aaaccggtcg tcttacaggc ccacctcgat 421 atggtgccgc agaaaaataa cgacaccgtg catgacttca cgaaagatcc tatccagcct 481 tatattgatg gcgaatgggt taaagcgcgc ggcaccacgc tgggtgcgga taacggcatt 541 ggtatggcct ctgcgctggc ggttctggct gacgaaaacg tggttcacgg cccgctggaa 601 gtgctgctga ccatgaccga agaagccggt atggacggtg cgttcggctt acagggcaac 661 tggttgcagg ctgatattct gattaacacc gactccgaag aagaaggtga aatctacatg 721 ggttgtgcgg ggggtatcga cttcacctcc aacctgcatt tagatcgtga agcggttcca 781 gctggttttg aaaccttcaa gttaacctta aaaggtctga aaggcggtca ctccggcggg 841 gaaatccacg ttgggctggg taatgccaac aaactgctgg tgcgcttcct ggcgggtcat 901 gcggaagaac tggatctgcg ccttatcgat ttcaacggcg gcacactgcg taacgccatc 961 ccgcgtgaag cctttgcgac cattgctgtc gcagctgata aagtcgacgt cctgaaatct 1021 ctggtgaata cctatcagga gatcctgaaa aacgagctgg cagaaaaaga gaaaaatctg 1081 gccttgttgc tggactctgt agcgaacgat aaagctgccc tgattgcgaa atctcgcgat 1141 acctttattc gtctgctgaa cgccaccccg aacggtgtga ttcgtaactc cgatgtagcc 1201 aaaggtgtgg ttgaaacctc cctgaacgtc ggtgtggtga ccatgactga caataacgta 1261 gaaattcact gcctgatccg ttcactgatc gacagcggta aagactacgt ggtgagcatg 1321 ctggattcgc tgggtaaact ggctggcgcg aaaaccgaag cgaaaggcgc atatcctggc 1381 tggcagccgg acgctaattc tccggtgatg catctggtac gtgaaaccta tcagcgcctg 1441 ttcaacaaga cgccgaacat ccagattatc cacgcgggcc tggaatgtgg tctgttcaaa 1501 aaaccgtatc cggaaatgga catggtttct atcgggccaa ctatcaccgg tccacactct 1561 ccggatgagc aagttcacat cgaaagcgta ggtcattact ggacactgct gactgaactg 1621 ctgaaagaaa ttccggcgaa gtaattattt gatttgctgc cggatggcgt ttaatcgcct 1681 tccggcagtt tcatccttca ttatccttcg ataaaagcca tccctgtaaa tgtccgtcgc 1741 gggttgccac gttcaatctg gtgatggaac attcgccgct gcgatttcag cgccgcgcta 1801 ttttcctgtt gctgttgctc cagcttccag gcaatcagca atcgtgccag ccgcttgtta 1861 gcatgctgac tacgctctga ctgaaccttc acgctaatac cggatgccaa atgcgtggcg 1921 cgtaccgccg agtcggtttt attgacatgt tgaccgcccg gccccgacga acgcagcgtc 1981 tcataacgga ttgcatccga ttgttcctgc tcatcagcgg taaaacgccc aatgcccaga 2041 aaccagtttt tgcgcccatg atgaggccga tacggactcg gacaaatcca ctgaatagtg 2101 ccacaccact ttcgcttaat gcccatgcgt tatcgccatc gagagaaacc agcgccgaac 2161 gcagtgtgtc agagtagcgg cccgtttctg tttccagcac cgttaccgcg acgtcttgcc 2221 gggtaaaaac gaccaacaca gaagaagctg cgccaatcgg gtagccgcgt ccggggtgta 2281 aatctggcat ccccactacg cgctgcatgt t // LOCUS HUMTGFB1B 5089 bp ss-mRNA PRI 27-JUN-1990 DEFINITION Human transforming growth factor beta-1-binding protein mRNA, complete cds. ACCESSION M34057 KEYWORDS transforming growth factor beta-1-binding protein. SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5089) AUTHORS Kanzaki,T., Olofsson,A., Moren,A., Wernstedt,C., Hellman,U., Miyazono,K., Claesson-Welsh,L. and Heldin,C.-H. TITLE TGF-beta-1 binding protein: A component of the large latent complex JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.-H.Heldin, 08-MAY-1990. FEATURES from to/span description pept 91 4275 transforming growth factor beta-1-binding protein precursor sigp 91 151 transforming growth factor beta-1-binding protein signal peptide matp 152 4272 transforming growth factor beta-1-binding protein BASE COUNT 1355 a 1127 c 1255 g 1352 t ORIGIN 1 tgaattcggc acgaggtggg ctctctcttt tcccctcttg ctcctttctt ttcttttttt 61 ctgttttttt aaaccttcca aggcaagttc atggatacta agctgatgtg tttgttgttc 121 tttttctccc tgcctccgct cctagtgagt aaccacactg gccgcatcaa ggtggtcttt 181 actccgagca tctgtaaagt gacctgcacc aagggcagct gtcagaacag ctgtgagaag 241 gggaacacca ccactctcat tagtgagaat ggtcatgctg ccgacaccct gacggccacg 301 aacttccgag tggtaatttg ccatcttcca tgtatgaatg gtggccagtg cagttcaagg 361 gacaaatgtc agtgccctcc aaatttcaca ggaaaacttt gtcagatccc agtccatggt 421 gccagcgtgc ctaaacttta tcagcattcc cagcagccag gcaaggcgtt ggggacgcat 481 gtcatccatt caacacatac cttgcctctg accgtgacta gccagcaagg agtcaaagtg 541 aaatttcctc ctaacatagt caatatccat gtgaaacatc ctcctgaagc ttccgtccag 601 atacatcagg tttcaagaat tgatggccca acaggccaga agacaaaaga agctcaacca 661 ggccaatccc aagtctcgta ccaagggctt cctgtccaga agacccagac catacattcc 721 acatactccc accagcaggt cattcctcac gtctaccccg tggctgctaa gacacagctt 781 ggccggtgct tccaggaaac cattgggtca cagtgtggca aagcgctccc tggcctttca 841 aagcaagagg actgctgtgg aactgtgggt acctcctggg gctttaacaa atgccagaaa 901 tgccccaaga aaccatctta tcatggatac aaccaaatga tggaatgcct accgggttat 961 aagcgggtta acaacacctt ttgccaagat attaatgaat gtcagctaca aggtgtatgc 1021 cctaatggtg agtgtttgaa taccatgggc agctatcgat gtacctgcaa aataggattt 1081 gggccggatc ctaccttttc aagttgtgtt cctgatcccc ctgtgatctc ggaagagaaa 1141 gggccctgtt accgacttgt cagttctgga agacagtgta tgtaccctct gtctgttcac 1201 ctcaccaagc agctctgctg ttgtagtgtg ggcaaggctg ggccacactg tgagaaatgt 1261 ccccttccag gcacagctgc ttttaaggaa atctgtcctg gtggaatggg ttatacggtt 1321 tctggcgttc atagacgcag gccaatccat caccatgtag gtaaaggacc tgtatttgtc 1381 aagccaaaga acactcaacc tgttgctaaa agtactcatc ctccacctct cccagccaag 1441 gaagagccag tggaggccct gaccttctcc cgggaacacg gggccaggag tgcggagcca 1501 gaagtggcaa ctgcaccccc tgaaaaggaa ataccttcat tggatcaaga gaaaaccaaa 1561 cttgagcctg gtcaacccca gctgtctcca ggcatttccg ctattcatct gcatccacag 1621 tttccagtag tgattgaaaa aacatcacct cctgtgcctg ttgaagtagc tcctgaagct 1681 tctacgtcta gtgccagcca agtgattgct cctactcaag tgacagaaat caatgaatgt 1741 actgtgaacc ctgatatctg tggagcagga cactgcatta acctaccagt gagatatacc 1801 tgtatatgct acgagggcta caggttcagt gaacaacaga ggaaatgtgt ggatattgat 1861 gagtgtactc aggtccaaca cctctgctcc cagggccgct gtgaaaacac cgagggaagt 1921 ttcttgtgca tttgcccagc aggatttatg gccagtgagg agggtactaa ctgcatagat 1981 gttgacgaat gcctgaggcc ggacgtctgt ggggaggggc actgtgtcaa tactgtgggg 2041 gccttccggt gtgaatactg tgacagcggg taccgcatga ctcagagagg ccgttgtgag 2101 gatattgatg aatgtttgaa tccaagcact tgtccagatg agcagtgtgt gaattctcct 2161 ggatcttacc agtgcgttcc ctgcacagaa ggattccgag gctggaatgg acagtgcctt 2221 gatgtggacg agtgcctgga accaaacgtc tgcgcaaatg gtgattgttc caaccttgaa 2281 ggctcctaca tgtgttcatg ccacaaaggc tatacccgga ctccggacca caagcactgt 2341 agagatattg atgaatgtca gcaagggaat ctatgtgtaa acgggcagtg caaaaatacc 2401 gagggctcct tcaggtgcac ctgtggacag gggtaccagc tgtcggcagc taaagaccag 2461 tgtgaagaca ttgatgaatg ccagcaccgt catctctgtg ctcatgggca gtgcaggaac 2521 actgagggct cttttcaatg tgtgtgtgac cagggttaca gagcatctgg gcttggagac 2581 cactgtgaag atatcaatga atgcttggag gacaagagtg tttgccagag aggagactgc 2641 attaatactg cagggtccta tgattgtact tgtccggatg gatttcagct agatgacaat 2701 aaaacatgtc aagatattaa tgaatgtgaa catccagggc tctgtggtcc gcaaggggag 2761 tgcctaaaca cagagggttc tttccattgt gtctgccagc agggtttctc aatctctgca 2821 gatggccgta cgtgtgaaga tattgatgaa tgtgtaaaca acactgtttg tgacagtcac 2881 gggttttgtg acaatacagc tggctccttc cgctgcctct gttatcaggg ctttcaagcc 2941 ccacaggatg ggcaagggtg tgtggatgtg aatgaatgtg aactgctcag tggggtgtgt 3001 ggtgaagcct tctgtgaaaa cgtggaaggg tccttcctgt gcgtgtgtgc tgatgaaaac 3061 caagagtaca gccccatgac tgggcagtgc cgctcccgga cctccacaga tttagatgta 3121 gatgtagatc aacccaaaga agaaaagaaa gaatgctact ataatctcaa tgacgccagt 3181 ctctgtgata atgtgttggc ccccaatgtc acgaaacaag aatgctgctg tacatcaggc 3241 gcgggatggg gagataactg cgaaatcttc ccctgcccgg tcttgggaac tgctgagttc 3301 actgaaatgt gtcccaaagg gaaaggtttt gtgcctgctg gagaatcatc ttctgaagct 3361 ggtggtgaga actataaaga tgcagatgaa tgcctacttt ttggacaaga aatctgcaaa 3421 aatggtttct gtttgaacac tcggcctggg tatgaatgct actgtaagca agggacgtac 3481 tatgatcctg tgaaactgca gtgctttgat atggatgaat gtcaagaccc cagtagttgt 3541 attgatggcc agtgtgttaa tacagagggc tcttacaact gcttctgtac tcaccccatg 3601 gtcctggatg cgtcagaaaa aagatgtata cgaccggctg agtcaaacga acaaatagaa 3661 gaaactgatg tctaccaaga tttgtgctgg gaacatctga gtgatgaata cgtgtgtagc 3721 cggcctcttg tgggcaagca gacaacgtac actgagtgct gctgtctgta tggagaggcc 3781 tggggcatgc agtgtgccct ctgccccctg aaggattcag atgactatgc tcagctgtgt 3841 aacatccccg tgacgggacg ccggcagcca tatggacggg acgccttggt tgacttcagt 3901 gaacagtata ctccagaagc cgatccctac ttcatccaag accgttttct aaatagcttt 3961 gaggagttac aggctgagga atgcggcatc ctcaatggat gtgaaaatgg tcgctgtgtg 4021 agggtccagg aaggttacac ctgcgattgc ttggatgggt atcacttgga tactgccaag 4081 atgacctgtt tcgatgtaaa tgaatgcgat gagttgaaca accggatgtc tctctgcaag 4141 aatgccaagt gcattaacac cgatggttcc tacaagtgtt tgtgtctgcc aggctacgtg 4201 ccttctgaca agccaaacta ctgcactccg ttgaataccg ccttgaattt agagaaagac 4261 agtgacctgg agtgaaacag aatctacata acctaagccc atatactctg cactgtgtaa 4321 aggaaaaggg agaaatgtat tatacttgag acattgcacc taccccggaa ggctggaaat 4381 acagaaacag catggagttg caagtcctct gaagacaatg agaggattta ggatgagccc 4441 gataggtgtg gcagaccaaa tggacatttc tctaaaaaac cagtatatat agtctgttca 4501 tatgtaaaat tcaatggaag agaggtggaa cagtgctgtt attttaaaca gaaggttgta 4561 ttattatgtt gttttgtttt ttttactatt gcttgattaa atttggcatt taaatagtgg 4621 tggaaatatt tttatataat tttcattttt tggttgtgca gttccttggc tactgttttt 4681 cttttacttc agttttttaa aaatctcaaa tgaaaaagtc ttcgatacaa tattgttaag 4741 ctgtattata agtattgtta cacagggtta tgcaattccc ggcctggagc atttttgaaa 4801 ttcaaattgt ctgtcctgtg gagcaggcag tgattttgtt ccaaaacttt gtatacacat 4861 ttggagaaaa gtactttata ttttcagtgt tttgtctgat tttaatgtcc gttcttagcc 4921 aagctgctag caggtgttaa ttggatccct ttccttcact gaaatggaag agtttataag 4981 cttacgttag tattgtaata tgtaaagtaa gcccaacaaa aatttttaaa aatttgatga 5041 tccccaatat atctaccatt gtatgttaaa taaatctcgt gccgaattc // LOCUS ALRVSRC 1801 bp ss-RNA VRL 27-JUN-1990 DEFINITION Rous sarcoma virus (Schmidt-Ruppin D strain) v-src gene, complete cds. ACCESSION M33292 KEYWORDS oncogene; pp60v-src; src gene; tyrosine kinase. SOURCE Rous sarcoma virus (strain Schmidt-Ruppin D) RNA, clone psrc1. ORGANISM Rous sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian sarcoma viruses. REFERENCE 1 (bases 1 to 1801) AUTHORS Reddy,S., Mazzu,D., Mahan,D. and Shalloway,D. TITLE Sequence and functional differences between Schmidt-Ruppin D and Schmidt-Ruppin A strains of pp60v-src JOURNAL J. Virol. 64, 3545-3550 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Shalloway, 26-MAR-1990. FEATURES from to/span description pept 65 1645 pp60v-src protein BASE COUNT 392 a 537 c 551 g 321 t ORIGIN 1 actctgctgg tggcctcgcg taccactgtg gccaagcggt agctggaacg tgcagccgac 61 caccatgggg agtagcaaga gcaagcctaa ggaccccagc cagcgccggc gcagcctgga 121 gccacccgac agcacccacc acgggggatt cccagcctcg cagaccccca acaagacagc 181 agcccccgac acgcaccgca cccccagccg ctccttcggg accgtggcca ccgagcccaa 241 gctcttcgag gacttcaaca cttctgacac cgttacgtcg ccgcagcgtg ccggggcact 301 ggctggcggc gtcaccactt tcgtggctct ctacgactac gagtcctgga ttgaaacgga 361 cttgtccttc aagaaaggag aacgcctgca gattgtcaac aacacggaag gtaactggtg 421 gctggctcat tccgtgacta caggacagac gggctacatc cccagtaact atgtcgcgcc 481 ctcagactcc atccaggctg aagagtggta ctttgggaag atcactcgtc gggagtccga 541 gcggctgctg ctcaaccccg aaaacccccg gggaaccttc ttggtccggg agagcgagac 601 gacaaaaggt gcctattgcc tctccgtttc tgactttgac aacgccaagg ggctcaatgt 661 gaagcactac aagatccgca agctggacag cggcggcttc tacatcacct cacgcacaca 721 gttcagcagc ctgcagcagc tggtggccta ctactccaaa catgctgatg gcttgtgcca 781 ccgcctgacc aacgtctgcc ccacgtccaa gccccagacc cagggactcg ccaaggacgc 841 gtgggaaatc ccccgggagt cgctgcggct ggaggtgaag ctggggcagg gctgctttgg 901 agaggtctgg atggggacct ggaacggcac caccagagtg gccataaaga ctctgaagcc 961 cggcaccatg tccccggagg ccttcctgca ggaagcccaa gtgatgaaga agctccagca 1021 tgagaagctg gttcaactgt acgcagtcgt gtcggaagag cccatctaca tcgtcattga 1081 gtacatgagc aaggggagcc tcctggattt cctgaaggga gagatgggca agtacctgcg 1141 gctgccacag ctcgttgata tggctgatca gattgcatcc ggcatggcct atgtggagag 1201 gatgaactac gtgcaccgag acctgcgggc ggccaacatc ctggtggggg agaacctggt 1261 gtgcaaggtg gctgactttg ggctggcacg cctcatcgag gacaacgagt acacagcacg 1321 gcaaggtgcc aagttcccca tcaagtggac agcccccgag gcagccctct atggccggtt 1381 caccatcaag tcggatgtct ggtccttcgg catcctgctg actgagctga ccaccaaggg 1441 ccggatgcca tacccaggga tgggcaacgg ggaggtgctg gaccgggtgg agaggggcta 1501 ccgcatgccc tgcccgcccg agtgccccga gtcgctgcat gaccttatgt gccagtgctg 1561 gcggagggac cctgaggagc ggcccacttt tgagtacctg caggcccagc tgctccctgc 1621 ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt taagctacaa caaggcaagg 1681 cttggccgac aattgcatga agaatctgct tagggttagg cgttttgcgc tgcttcgcga 1741 tgtacgggcc agatatacgc gtatctgagg ggactagggt gtgtttaggc gaaaagcggg 1801 g // LOCUS RATLACREC 2041 bp ss-mRNA ROD 27-JUN-1990 DEFINITION Rat lactogen receptor mRNA, complete cds. ACCESSION M34083 KEYWORDS lactogen receptor. SOURCE Rat (strain Sprague-Dawley) luteinized ovary, cDNA to mRNA, clone R1. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2041) AUTHORS Zhang,R., Buczko,E., Tsai-Morris,C.H., Hu,Z.-Z. and Dufau,M.L. TITLE Isolation and characterization of two novel rat ovarian lactogen receptor cDNA species JOURNAL Biochem. Biophys. Res. Commun. 168, 415-422 (1990) STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by M.L.Dufau, 08-MAY-1990. FEATURES from to/span description pept 93 1925 lactogen receptor precursor sigp 93 149 lactogen receptor signal peptide matp 150 1922 lactogen receptor signal 2031 2036 poly-A signal BASE COUNT 615 a 497 c 460 g 469 t ORIGIN 1 gtgagcactg cagatgtttt gcacatgagc cctgaaggga gcctctgata cattgcctat 61 agcaagaaga aggggccaac ctgaagaaaa acatgccatc tgcacttgct ttcgtcctac 121 ttgttctcaa catcagcctc ctgaagggac agtcaccacc agggaaacct gagatccaca 181 aatgtcgctc tcctgacaag gaaacattca cctgctggtg gaatcctggg acagatggag 241 gacttcctac caattattca ctgacttaca gcaaagaagg agagaaaacc acctacgaat 301 gtccagacta caaaaccagt ggccccaact cctgcttctt tagcaagcag tacacttcca 361 tctggaaaat atatatcatc acagtaaatg ccacgaacca aatgggaagc agttcctcgg 421 atccacttta tgtggatgtg acttacatcg ttgagccaga gcctcctcgg aacctgacat 481 tagaagtaaa acagctaaaa gacaaaaaaa catatctgtg ggtaaaatgg tccccaccca 541 ccataactga tgtgaaaact ggttggttta caatggaata tgaaattcga ttaaagcctg 601 aagaagcaga agagtgggag atccatttta caggtcatca aacacagttt aaagtttttg 661 acctatatcc agggcaaaag tatcttgtcc agactcgctg caagccagac catggatact 721 ggagtagatg gagccaggag agttccgttg aaatgccaaa tgacttcacc ttgaaggaca 781 caaccgtgtg gatcattgcg gccattctct ctgctgtcat ctgtttgatt atggtctggg 841 cagtggcttt gaagggctat agcatgatga cctgcatctt tccaccagtt cctgggccaa 901 aaataaaagg atttgatacc catctgctgg agaagggcaa gtctgaagag ctgctgagtg 961 ccttggggtg ccaagacttt ccccctactt ctgactgtga ggacttgctg gtggagttct 1021 tagaagttga tgacaatgag gacgagcggc taatgccatc ccattccaaa gagtatccag 1081 gtcaaggtgt taagcccaca cacctagatc ccgacagtga ctctgttcac ggaagctatg 1141 acagccattc tcttttatct gaaaagtgtg aggaacccca ggcctacccc cctactttgc 1201 acatccctga gatcactgag aagccagaga atcctgaagc aaatattcct cccaccgtgg 1261 acccccaaag caccaacccc aattttcatg tagatgcacc caaatcttca acatggccat 1321 tactgcctgg ccaacacatg cccagatctc cttaccacag tgttgctgat gtgtgcaagc 1381 tagccggaag tcctgtgaat acactggact ctttcttgga caaagcagag gaaaatgttc 1441 taaagttgtc taaagccctt gagactggag aggaagaagt ggctgaggaa aaagggggaa 1501 aaagcttccc ttctgacaaa caaaacacac cttggccgct gctccaggag aaaagcccca 1561 ctgtctatgt taaaccccca gattatgtgg agattcacaa agtcaacaaa gatggagtgc 1621 tatcattatt ccccaagcag agagaaaaca accagacaga gaagcctggg gttcctgaaa 1681 ccagtaagga gtatgccaag gtgtctggca ttacggataa caatatcctc gtattagtgc 1741 cagactcacg agccaagaac acagcgttgc tcgaggaatc agccaagaag gctccaccat 1801 cgtttgaagc tgaccaatct gagaaagatc tggccagctt cactgcaacc tcaagcaacc 1861 gcagactcca actgggtagg ctggattacc tggatcctac gtgcttcatg cactcctttc 1921 actgatagct agacttatgg aacgattggc taaactgtga tttctcttca ggtaacacta 1981 cagagtcatg aagtaatgtg gtctgctagc aaatgttaca ggatgtgggt ataaaaaaaa 2041 a // LOCUS YSCCP1A 1403 bp ds-DNA PLN 27-JUN-1990 DEFINITION S.cerevisiae centromere-binding protein CP1 gene, complete cds. ACCESSION M34070 KEYWORDS CP1 protein; DNA-binding protein; centromere-binding protein. SOURCE S.cerevisiae (strain X2180) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1403) AUTHORS Baker,R.E. and Masison,D.C. TITLE Isolation of the gene encoding the Saccharomyces cerevisiae centromere-binding protein CP1 JOURNAL Mol. Cell. Biol. 10, 2458-2467 (1990) STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by R.E.Baker, 25-APR-1990. FEATURES from to/span description pept 256 1311 centromere-binding protein CP1 BASE COUNT 508 a 284 c 310 g 301 t ORIGIN 1 ggatccttcg ggaccaccat gtctcaccca ccaaggtcaa aaaaaataaa atcggacgag 61 aaaagtattg ggcaaacaac aattatccac gtgcttatca cgttttttta tttaatgtag 121 aatgtaaacc attatacctt aaagataaac actttgttaa ttattcgttt tagttggcgg 181 tcattttcat ttctgctaca agtaccaaca tcaagtgctt aaaatataat acggttttct 241 acacttttat taacgatgaa ctctctggca aataataata agctttctac tgaggatgag 301 gaaatccatt cggcaagaaa aagaggctat aacgaggagc agaactacag cgaggccagg 361 aaaaagcaaa gagatcaagg cttgttgtca caagagagca atgacggaaa tattgactct 421 gctctgttga gcgaaggggc tacgctaaaa gggactcaaa gtcagtatga aagcggactg 481 acgtctaaca aagatgagaa aggaagtgat gacgaagatg cgtcagtggc tgaggctgct 541 gttgccgcta ccgtcaatta tacagactta atccagggcc aggaagatag ttctgatgcc 601 cacacatcta atcaaacgaa cgcgaatggt gagcacaagg attcccttaa tggggaaaga 661 gccataaccc catcaaatga aggagtcaag ccaaacacct ccctggaggg aatgacgtct 721 tcaccaatgg aatctacaca gcaatctaaa aacgacatgc tcataccatt ggcagaacac 781 gatcgtgggc cagaacacca acaggatgat gaagacaatg atgacgctga cattgatctc 841 aaaaaggata taagcatgca gccgggtcgt cgtggaagaa aacctactac tttggccaca 901 acagacgagt ggaaaaagca aagaaaagac tcccataaag aagtcgaaag gcgtcgccgc 961 gaaaacatca acactgcaat caacgtttta agcgacctcc tgcccgtgag agaatcaagt 1021 aaggcagcaa tattggcgcg tgccgctgaa tacattcaaa aattgaaaga aaccgatgaa 1081 gcaaacatcg aaaagtggac gttacaaaaa ttgcttagcg agcaaaacgc atcgcaatta 1141 gccagtgcaa atgagaaact gcaggaagaa ctgggaaatg cttacaaaga aatcgagtac 1201 atgaaacgcg ttttaaggaa ggagggaata gaatacgagg atatgcacac tcacaagaaa 1261 caagagaacg aaagaaaaag cactaggagc gataatccac atgaggcttg agagttaaaa 1321 atagatagct aaatgtattt cgagtctccc tatgtatctg cgtctctgcc ccatagcacc 1381 tctgttaagt ttttatcaat att // LOCUS MUSCD3A 1347 bp ss-mRNA ROD 27-JUN-1990 DEFINITION Mouse CD3-eta protein mRNA, complete cds. ACCESSION M33158 KEYWORDS CD3-eta protein. SOURCE Mouse (strain B6/CBA F1J) female thymus, cDNA to mRNA, clone pBS17. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1347) AUTHORS Jin,Y.-J., Clayton,L.K., Howard,F.D., Koyasu,S., Sieh,M., Steinbrich,R., Tarr,G.E. and Reinherz,E.L. TITLE Molecular cloning of the CD3-eta subunit identifies a CD3-zeta-related product in thymus-derived cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3319-3323 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.L.Reinherz, 22-MAR-1990. FEATURES from to/span description pept 139 759 CD3-eta protein precursor sigp 139 201 CD3-eta signal peptide matp 202 756 CD3-eta protein mRNA < 1 1347 CD3-eta mRNA signal 1329 1334 poly-A signal BASE COUNT 374 a 342 c 336 g 295 t ORIGIN 1 cgaggagagg cacccggtac atgttctcta ggagaacaaa gccagcagag actccatcag 61 cgcctccttt tctcctcatc ctcccaggca tagctgcctc tgcctctgcc tctgggtacc 121 atcccaggga agcagaagat gaagtggaaa gtgtctgttc tcgcctgcat cctccacgtg 181 cggttcccag gagcagaggc acagagcttt ggtctgctgg atcccaaact ctgctacttg 241 ctagatggaa tcctcttcat ctacggagtc atcatcacag ccctgtacct gagagcaaaa 301 ttcagcagga gtgcagagac tgctgccaac ctgcaggacc ccaaccagct ctacaatgag 361 ctcaatctag ggcgaagaga ggaatatgac gtcttggaga agaagcgggc tcgggatcca 421 gagatgggag gcaaacagca gaggaggagg aacccccagg aaggcgtata caatgcactg 481 cagaaagaca agatggcaga agcctacagt gagatcggca caaaaggcga gaggcggaga 541 ggcaaggggc acgatggcct ttaccaggac agccacttcc aagcagtgca gttcgggaac 601 agaagagaga gagaaggttc agaactcaca aggacccttg ggttaagagc ccgccccaaa 661 ggtgaaagca cccagcagag tagccaatcc tgtgccagcg tcttcagcat ccccactctg 721 tggagtccat ggccacccag tagcagctcc cagctctaag gccctgtgct cagctctggt 781 gatgaccctg gctgctgtca catgagttgt ggtgaggatg ggacttttga aaaatctgat 841 gttccaattt ctttcatgca tgctctactc agaagtgagc aagggccaaa actcctgggc 901 atgcatctga atgaatctct caacttagaa aagacttgcc tgcccctctg agctggccag 961 gtgtccccac ctaccctttg gcatgcctcc aagtgccagg acgccacaga ctgcattggg 1021 gccgaagact tcccttttct ttggtttttt tgtttgtttt gagtttatac aatcattaag 1081 aaatctttgg ttttggctgg aaatggaaaa acaaaacaaa tcaaagaaac caccctcccc 1141 tggcttatag cagcagtatt atgacctgac ctggctgagc tttccccccc tccaactttg 1201 ggggtggaaa ttgcaagtta agaactacat tcaaagaaaa cgttgaaagg gccggagaag 1261 cagcttccag aaagcccgtc ggatataaga ttgtcaaata ataataacta ttattataat 1321 atattaataa taaacttaaa tatttgg // LOCUS YSCPRP5 3017 bp ds-DNA PLN 27-JUN-1990 DEFINITION S.cerevisiae pre-RNA processing protein (PRP5) gene, complete cds. ACCESSION M33191 KEYWORDS PRP5 protein; helicase; pre-RNA processing protein. SOURCE S.cerevisiae (strain SPJ 5.41) vegetative DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3017) AUTHORS Dalbadie-McFarland,G. and Abelson,J. TITLE PRP5: A helicase-like protein required for mRNA splicing in yeast JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4236-4240 (1990) STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by G.Dalbadie-McFarland, 26-MAR-1990, for release after publication. FEATURES from to/span description pept 284 2833 PRP5 BASE COUNT 1101 a 469 c 636 g 811 t ORIGIN Chromosome 2R. 1 gttgacattc tacttttcta agacggataa tggacgactc cacgttgtag tagttgtcac 61 attaatcgga aaagtgtctc acatgtaaag gattaaaata agaaatcttc taatttatgt 121 tgaaatatct tgatttacca ttacccggat tagatgaaac taattaaagt catttagagc 181 aatggaccaa actaaaggta ttagagttct ttctaatata atacagcgcc aaataattcg 241 agaagggaaa tcataacgct gttgccttgg gtttatttga ggtatggaaa ctattgattc 301 gaagcaaaat attaataggg agtctttatt ggaggaaagg aggaaaaaac tagcaaaatg 361 gaaacaaaaa aaagcacaat ttgatgctca gaaagagcat caaacttcac gcaatgacat 421 tgttactaac agtttagaag gtaagcagac tactgaaaag tttacggaaa gacaggaacg 481 tgtaaaagaa gagcttcgga agagaaaaaa tgaatttagg aaatctgacg aacctgtatc 541 ggttaagcct tcgaagaaaa agtcgaagag aagtaaggta aagaagaaaa tatccttcga 601 cttcagtgat gacgatgatt ctgaaatagg agtttctttt agatcgaagg agcacattca 661 aaaagctcca gagcatgata atgaaaaaga tccactggat gagtttatga cgtcattgaa 721 ggaggagaaa atgagtaata gtaaaggcat gtacgacaga ggcgacattc ttgatgtaga 781 ggatcaattg tttgaacttg gaggaaccga cgatgaggat gttgaagata atacggataa 841 ttctaatata gcgaaaattg ctaaactcaa agcaaaaaaa cgtgtaaagc aaatttatta 901 ctctccagaa gagcttgagc cgttccaaaa aaatttttat atagaatctg aaacagtttc 961 ctcgatgtca gagatggaag ttgaggagct tagacttagt ctggataata tcaaaataaa 1021 gggaacaggt tgccccaaac cagtcacaaa gtggtctcaa ctgggactat caacggacac 1081 catggtttta attacagaaa agttgcactt tggctcctta acgcctattc aatcccaggc 1141 tcttcccgct attatgtcag gtcgtgatgt tataggaata tcaaaaactg gctccggcaa 1201 gaccatctcc tatcttttac cattgctaag acaagtaaaa gctcaacggc cattatcaaa 1261 acatgaaaca gggcccatgg gtttaattct ggctccaact agagagctag ctttacaaat 1321 acatgaagaa gttactaaat tcacagaagc agatacatcc attagatcag tatgctgtac 1381 aggaggttct gaaatgaaaa agcagattac tgatcttaaa agaggcactg agattgttgt 1441 tgccacaccg ggacgattta ttgatatatt aacactaaat gatgggaaat tacttagtac 1501 taaaagaata acgttcgtag taatggatga ggcagacagg ctgttcgatt taggttttga 1561 acctcaaata acgcaaatca tgaaaactgt tcgaccggat aaacagtgtg ttctatttag 1621 tgcaactttt ccgaacaaac tacgcagttt tgccgtaaga gttttgcatt ccccaatatc 1681 tattacgatt aattcaaagg gaatggttaa tgaaaacgta aaacaaaagt ttagaatatg 1741 ccattcagaa gatgaaaaat ttgacaatct ggtgcagctt atccatgagc gcagtgaatt 1801 ttttgacgag gttcaaagtg aaaatgacgg acagtcaagc gatgttgaag aagttgatgc 1861 taaagccatt atatttgtat caagtcagaa tatttgcgac ttcatttcaa aaaagctgtt 1921 aaatgctgga attgtgacct gtgctattca tgcaggtaag ccatatcaag aaaggcttat 1981 gaacttagaa aagttcaaac gagaaaagaa cagtattctt ctctgtacag aggttctctc 2041 aagaggttta aatgttcccg aagtgtcgtt ggttattatc tataacgccg tcaaaacttt 2101 cgcgcaatat gttcatacta ctggaagaac tgccagaggt agtcgctccg gtactgctat 2161 tactcttcta ttgcatgatg aactgtcagg tgcttatatc ttgagcaagg caatgcgcga 2221 tgaagaaata aaagctttag accctctgca agcaaaagaa cttcaggaaa tgagtgccaa 2281 gtttgaatcc ggtatgaaaa agggtaagtt taggttgtcg aaagggttcg gcggcaaagg 2341 actggaaaat atcaaaagta aaagagagga agctcaaaat aaagatttgg agctcaaaaa 2401 gaatgataag aggagcgatg acttggaaaa gaaaatcagc aatccacgcg aagggcatga 2461 cagtgtgtct gagtcttctg cgcttattcc gagactcaat tatgaactat tcaaagagtc 2521 tacagacggg tcgatcatct tttacgccaa ggtttatatt aatgatttac cccagattgt 2581 aagatgggag gctacaaaga atacaacctt attatttatc aagcatgaaa ccggatgtag 2641 tattactaac aagggtaaat tttaccctga agggaaggaa ccaaagaatg aaaacgatga 2701 gcctaaacta tacttattga tcgaaggcca agatgagaaa gacatacaat taagtataga 2761 attgctggag caaaaagtta aagagggggt cgtaaaggct gcaagcttgt ctttgaagag 2821 tactaaatac tagtttttag aatttaactc actcgtggtg ctatatactt tcgtagttag 2881 ttctgaaatt tttagggtgg taatttaaag catattcaat cataataaaa aaaaaaaaag 2941 agaattatta ggaaaaataa aaactaaaat aatatataat atatgatgta gaaaaaaatc 3001 ttgaaatttt tatcgat // LOCUS STMACM 1155 bp ds-DNA BCT 27-JUN-1990 DEFINITION S.globisporus N-acetylmuramidase M1 (acm) gene, complete cds. ACCESSION M30645 KEYWORDS N-acetylmuramidase M1. SOURCE S.globisporus (strain ATCC 21553) DNA. ORGANISM Streptomyces globisporus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1155) AUTHORS Lichenstein,H.S., Hastings,A.E., Langley,K.E., Mendiaz,E.A., Rohde,M.F., Elmore,R. and Zukowski,M.M. TITLE Cloning and nucleotide sequence of the N-acetylmuramidase M1-encoding gene from Streptomyces globisporus JOURNAL Gene 88, 81-86 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.S.Lichenstein, 12-DEC-1989, for release after publication. FEATURES from to/span description pept 175 1059 N-acetylmuramidase M1 precursor matp 406 1056 N-acetylmuramidase M1 BASE COUNT 175 a 454 c 372 g 154 t ORIGIN 1 cgccattgct ctagaccggc ggagccgaca gatgggtctt tacgggggtc tacgcgcgtg 61 cattacttgt gtcgtgctca tggcgaccgg cctttccggg ccccttcgcc ggggcaagca 121 ccggtcattc gcgcgttcca cggctacatc ccccactcgt gcctggaggc agtcatgccc 181 gcgtacagct ctctcgcacg ccgcggccgc agacccgcgg tcgtcctcct cggcggtctc 241 gtcagcgcct ccctggcgct caccctggcg cccaccgccg ccgccgcgcc cctcgcgccc 301 ccgcccggca aggacgtcgg gcccggcgag gcgtacatgg gtgtcggcac ccgcatcgag 361 caggggctcg gcgccggccc cgacgagcgc accatcggcc cggccgacac cagcggtgtc 421 caggggatcg acgtgtcgca ctggcagggc tccatcaact ggagctcggt gaagtcggcc 481 gggatgtcct tcgcctacat caaggcgacc gagggcacca actacaagga cgaccggttc 541 agcgcgaact acaccaacgc gtacaacgcg gggatcatcc ggggcgccta ccacttcgcc 601 cgcccgaacg cctccagcgg cacggcgcag gccgactact tcgccagcaa cggcggcggc 661 tggtcccgcg acaaccggac cctgccgggc gtcctggaca tcgagcacaa cccctccggc 721 gccatgtgct acgggctctc caccacgcag atgcgcacct ggatcaacga cttccacgcc 781 cggtacaagg cgcgcaccac ccgcgacgtc gtcatctaca ccacggcgag ctggtggaac 841 acctgcaccg gcagctggaa cggcatggcg gccaagtccc cgttctgggt ggcccactgg 901 ggcgtgagcg ccccgacggt gccgagcggc ttcccgacct ggacgttctg gcagtactcg 961 gcgaccggcc gggtcggcgg cgtcagcggg gacgtcgacc gcaacaagtt caacggctcc 1021 gccgcccgtc tgctggccct ggccaacaac acggcgtgag acggccggag ggccgggggc 1081 acggcacgca cgccctgccc ccggccctcc cccgcgcccg gcgcggctac cgcatccgcc 1141 cgaggtgctc gcgca // LOCUS YSCKRE5 4383 bp ds-DNA PLN 27-JUN-1990 DEFINITION S.cerevisiae Kre5 protein (KRE5) gene, complete cds. ACCESSION M33556 KEYWORDS Kre5 protein. SOURCE S.cerevisiae (strain GRF88) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 4383) AUTHORS Meaden,P., Hill,K., Wagner,J., Slipetz,D., Sommer,S.S. and Bussey,H. TITLE The yeast KRE5 gene encodes a probable endoplasmic reticulum protein required for (1->6)-beta-D-glucan synthesis and normal cell growth JOURNAL Mol. Cell. Biol. 10, 3013-3019 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.Bussey, 04-APR-1990, for release after publication. FEATURES from to/span description pept 77 4174 KRE5 protein precursor sigp 77 128 KRE5 protein signal peptide matp 129 4171 KRE5 protein BASE COUNT 1448 a 758 c 826 g 1351 t ORIGIN Chromosome 15R. 1 tatataacgt ggcatattaa agattaattg tcctggtaga atatagacgt atcagtgtga 61 gtgcctctgt tgattaatga gactacttgc gttggtattg ttattgttgt gtgcgccgct 121 tcgtgcatgg acttatagct tacgatatgg catacccgaa tctgctcagg tctggtctat 181 tttagttcat ttactgggcg atgttgataa tcagctgtta actaatttat atcctttggt 241 taccggtttg gatgacgaga ttgatattca agaaaatctt gtggcgctaa cttccaatgt 301 attaagggag cgatacgata aagaggatgt ggctgattta ttggaactgt atgctagtct 361 ttaccctatg ggtatgatac agcacgatat cagttccaat gcagaacaag acgatgcaaa 421 tagtagctat ttcgttttga atggtaatag gtacgaaaag cccgacgacg tgttctactt 481 gaaatctaag gatttaacaa ttcaacagaa agtcccagat gttgatgtta tacaacctta 541 cgatgttgtc attggtacta actcagaagc gccgatattg atcttgtacg gttgtcctac 601 cgttattgac tccgacttcg aagaattcaa taggaattta tttatggaag caatgaatgg 661 agagggaaaa tttagattta tttggagatc cacatgttcc cttgatggga aaagcgtgga 721 gtatccctta actcatccgc ttgaaattac tttacaaaat ggttctagaa tgagctccat 781 acctcaatta aaaaaaatac tatatactgt acccaaagaa atattggttg gagcagacaa 841 cgatgatcag ctccatgatc tagaaccaga agaattacgt gaacttgatt tgagagtaac 901 atcgttaatc tcagaatttt accaatataa aaaggatatc acagccactc taaatttcac 961 caaaagtatt gttaacaact ttccactaat ctctaaacaa ctgattaagg tttcatctgt 1021 taacaaggat ataataacaa gtaatgaaga actcaatagt aaaggcttcg attacaacat 1081 gctaggtctc tatattaatg gacagaattg gaaaattacc tcactgactc cgtacaattt 1141 gcttactgct ttaaaaactg aataccaaag tttactgaaa attacgaacc ttttgcaaga 1201 actcgagcca tcgaaatgca tactagattc caagttttta ctcaataagt tttctcaatt 1261 ttcattgggg aagttgcaaa acttacaacc aatcaaaatg gatctccaca caattccagg 1321 gttctcagaa tcagtaatat acttcaatga tatcgaaagc gacccgcaat atgacgaatt 1381 agtaaatagt gttcaagcat tttttgataa atcgaaattc ggagagttgc ctgaaataaa 1441 gcaaaactgg tcagagatca tattcgttat agatttcgcc cgtttagaag atagtgaggt 1501 gaaggaggca ttgggtgggt tggttcgtgc cgttaatgtt gtctcccagg gatatccgca 1561 aagagtcgga ctattgccat ttagttcaga tagtgacaag tccgttgtta ataaaattta 1621 cgagctgaag aactcaactg acaatttaac agaattaaaa agttttttgg agacaatgct 1681 gcttgcagat ggcctttccg cgaatgcaaa acattcaaaa cacataccag ttccagatgt 1741 tttccatcta cttgatgaac ttcaaattga cgaaacatca attataatca atggagagat 1801 ttacccattt agaaaaaatt ggaattattt aattgcaaaa gttatcaaaa aggacactga 1861 atttattcgt aaagaattga gcaattcttc tccgaaaaac aaacaaatta gcgtaaggga 1921 cttattgcat tacaaatctg caaatctgag acataataaa tatacaccaa attattttgc 1981 tgattcggta tattcttcgg tcaacaatac tgcattggaa agcgtatgct cagaaagaat 2041 aggctactat actaaaaatg aagaatacaa tttattacac acaatcacat tagtggatga 2101 ttttggctct attcatgctt tgaaaagatt gagaaacttg ttgcatactt cctttgttgg 2161 tgttaggatc agaatcattc acgtaggtga tatttctgat atttggtatc aattgcgtgg 2221 aagtctttcc caaaaagatc caataggctc aataaataca tttattgatg ctttgaaact 2281 taaaaaggta aaaagtcaca cgtacaaaaa aagcggctta aaccagttag gccttcataa 2341 atggcttcct gacattccat tatttgaatt gcaaaagggt tcatttattg ctttgaacgg 2401 tagatttatc atcttgatca aaatgaagtg ccagaaacag aacatttcga aggccaaaat 2461 cataaagaga gaagctctaa gaacgatcga ttcagttttc gccctagatt tactttttcc 2521 aggtttctca caggaaataa tcaatcctga tttgatagaa atgatctcct ccattttaac 2581 taggttgttt taccaaggta cacatatata caataatggt attgattata ccactgaaag 2641 tagcttacca agaatggatt tgagcgagtt ttttagacct aataatttaa cgatgtttga 2701 agatggaaaa tcagcttcta ttgatttact actaatttta gatccacttg aagagagaac 2761 gcaaatgatt ctttctcttg ttgagcaatt caggcctttg aaatttgtta atattcaggt 2821 aattttaatg ccgacactgg aattaaacat tgtccctatt agaagaatat acgttgatga 2881 cgcagatatt gtcaaatcaa taacttctga ggatagcaga tcagatccag aagtagatat 2941 tgaaatggat gttcctaatt ctttcattgt agataataat tatcggataa aaaaattgct 3001 catagaatta cattccttct ctagcaaaac agtcctttca actggcaata ttgatggtat 3061 ggggggtgta tgcctagcac ttgtcgattc tgcagggaac attattgaca aaactacaac 3121 aatgaaaacc tttggctatg gacaatttca taccgacaaa tttttaaagg gttgctatat 3181 aaaaagttgt gattcaagat ataccgttca gtcattttct actgacgggc atcccgactt 3241 tataccatca gattccttgg atatactgtc gtacaatcca caaaaaatcg ctgtaaaaat 3301 ttcagaagag cctacacacg aggaagaata cgaggaaggt cgcaacaatg atacaataat 3361 caatattttt actattttag agtccgggcc agatgaggaa gagaggtaca tgcaaatgat 3421 tttatccatt ttgtcaaagt gtcccgaaac gcaaaaggtg aattttttca ttttagatca 3481 gccgtttatc tccgacactt taaggaaatc atgtgagtat ataaattcct ctgatgaaat 3541 gagaggcaat gtcatttttt tgaattatga atggcctcaa tggttaagac cgcaaagatt 3601 ttcttcaagg agaagggatg tctctagatt tctgttcttg gatgtccttt tacctcaaaa 3661 catctccaaa gtgttatata tgagtccaac tgaagtaccg ctggatcctt ttgacatttt 3721 tcaatttcaa ggcctcaaac gtgcacctct aggactattc cgaatgagtg gtgatggtta 3781 ttggaaagaa ggatactggg aaaaaatgtt aagggagaat aatttagaat tttattctac 3841 cgaaccggcc tttttagtaa acttagagag gtttcgggag ttagatgctg gtgataaata 3901 caggattcac tatcaacgta tttctacaga cgccatgtct cttgtcaata tcggccaaga 3961 tctagttaac aacctacaac tcgaggttcc gattaggttt ctcaagggat cgtataagaa 4021 gaaattagtt attaatgatg aatgtgtttc tgaatggaag aaaaaaataa ataagttcgc 4081 atcctctcct ggcgatgaag acgtacctgg agaaagtgtt agcagcaaat accaagattc 4141 cgacaatgcc gctcctctgc atgacgaatt ataactactc cagaagagtt cattacgcga 4201 ctgtccaaga gcgtgaagaa ttgcttctgc gcataagtct tcttccataa tagattttta 4261 tatcattttt agaacataaa atttctcgcc aagctagttt tttagctaaa agcagatatc 4321 cagtaacatg ggttccgctt tttgcagcga atactatgaa gagttttgcc cgactggctc 4381 ccc // LOCUS STMCURA 2192 bp ds-DNA BCT 27-JUN-1990 DEFINITION S.curacoi curC, CurA genes, complete cds, curD gene, 3' end and curB gene 5' end. ACCESSION M33704 KEYWORDS . SOURCE S.curacoi (strain ATCC 13385) DNA. ORGANISM Streptomyces curacoi Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 2192) AUTHORS Bergh,S. and Uhlen,M. TITLE Cloning, analysis and heterologous expression of the polyketides synthesis genes of Streptomyces curacoi JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by S.Bergh, 13-APR-1990. Author address: S.Bergh, Dept. of Biochemistry, Teknikringen 30, Stockholm, SWEDEN S-100 44. E-mail: staffan@physchem.kth.se FEATURES from to/span description pept < 1 32 curD protein (AA at 3) pept 77 541 curC protein pept 538 1806 curA protein pept 1803 > 2192 curB protein BASE COUNT 349 a 823 c 731 g 289 t ORIGIN 1 tgatcaccga ccgcagggcc gcgcagccct gacagcgcag ggacccccac cccaccgacc 61 gacctggaga cagaccatgg acaagatgcg cccgcgcgtc gtggacgtca aagagatcga 121 gcccaaccgc aagcgcggcg gtgacctgcg cacgctgctc acccctgtca cggtgggtgc 181 caccagcggg ttcatgggcc tggccatcat gcggccgggc gaacgcatca gcgagcacta 241 ccacccgtac tccgaggagt tcgtgtacgt cgtcgagggc aggctggagg tcgacctcga 301 tggcgaaacg tttcctctcc gtgccgacca gggcctcatg atccccatcg acatgcggca 361 ccgcttccgc aacgtcggtg acgaggaagc ccggatggtc ttccatctga gcccgctggc 421 accgaagccg agcctcggcc acgtcgacac ggaggccccg gcgatcagcg acgacgtcaa 481 ggcgtacccg ctcgtccagg aggagagcgg gcggccggaa cgacccgggg tcctgtcgtg 541 acccggcggg tggcggtcac cggcataggc gtggtcgctc cgggaggcat cggggtgccc 601 gcgttctggg acctcctctc gagcggccgt accgcgacac gtggcatcac cctgttcgac 661 cccgaggggc tgcggtcccg catcgccgcc gagtgcgact tcgacccgct cgcccacggc 721 ctcgaccccg agctggtcga acgcgccgac cggtacatac agttcgccct ggtcgccgcc 781 gacgaggccg tgaccgacag cggcatcgac ttcggcaccg agaacccctg gcgcgtggcc 841 gtgtccctgg gcagcgcggt gggcgggacg accaggctgg agcacgacta cgtcctcgtc 901 agcgaacggg gacagcgctg ggacgtcgac caccgcgccg ccgagccgga actgcaccgc 961 gccttctcgc ccagcacgct ggccgcggac gtcgcggagc ggttcggcgc ccagggaccg 1021 gtgcagaccg tgtccaccgg ctgcacctcc gggctcgacg cggtgggtta cgccttccac 1081 accatcgagg agggccgcgc cgacgtctgc atcgccgggg cgtcggactc gccgatctcc 1141 ccaatcacca tggcctgctt cgacgccatc aaggccacgt cgcccaacaa cgacgacccg 1201 gagcacgcct cccgcccctt cgacgcccac cgcgacggct tcgtcatggg cgagggcgcc 1261 gccgtcctcg tcctggagga actggaacac gcccgtgcgc gcggcgcaca tgtctactgc 1321 gagatcggcg gctacgccac cttcggcaac gcctaccaca tgaccggact gaccagtgag 1381 ggcctggaga tggcccgggc catcgacgtc gcgctcgacc acgcgcgcgt cgacccgacc 1441 gacatcgact acgtcaacgc ccacggctcg ggcacccggc agaacgaccg ccacgagacc 1501 gccgcggtca agaagtccct gggcgcccac gcctacgaca cccccatgag ctcgatcaag 1561 tccatggtgg gccactcgct cggcgcgatc ggcgcgatcg aggtcgtcgc ctgcgtactc 1621 gccctggccc ggcaggtggt cccgccgacg gcgaactacg agaccccgga ccccgagtgc 1681 gacctggact acgtaccgcg caccgcacgg ccccgccggc tcgaccacgt gctctccgtg 1741 ggcagcggct tcggcgggtt ccagtccgcg gtgctcctga cggggccggc tgggaggaaa 1801 cgatgaacgg cagtggtagt ggcatacgca cccggcggac cgccgtcacg gggatcggtg 1861 tggtcgcgcc caacggcctg cacgccgaca cctactggaa gtccgtcaag gagggcgcga 1921 gcgtactgga ccgcatcacc cgggagggct gcgagcacct tccgctccgc gtcgccggtg 1981 aggtccgcgg attcgacccg tcggcgctca tcgaggagac cttcctcgtc cagaccgacc 2041 tgttcaccca cttcgccctg gccgccgcgg acgccgccct ccaggacgcg gggctgacca 2101 aggccgccgc cgtggccgac tcgccgtact cggtcggtgt cgtcaccgcc gccgggtccg 2161 gtggcggcga gttcgggcag cgggaactgc ag // LOCUS MUSTRKB 2484 bp ss-mRNA ROD 27-JUN-1990 DEFINITION Mouse tyrosine protein kinase B (trkB) mRNA, complete cds. ACCESSION M33385 KEYWORDS tyrosine protein kinase. SOURCE Mouse adult brain, cDNA to mRNA, TK-negative clone pFRK42. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2484) AUTHORS Klein,R., Conway,D., Parada,L.F. and Barbacid,M. TITLE The trkB tyrosine protein kinase gene codes for a second neurogenic receptor that lacks the catalytic kinase domain JOURNAL Cell 61, 647-656 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Barbacid, 29-MAR-1990, for release after publication. FEATURES from to/span description pept 1 1431 tyrosine protein kinase B (trkB) precursor sigp 1 93 tyrosine protein kinase B (trkB) signal peptide matp 94 1428 tyrosine protein kinase B (trkB) BASE COUNT 638 a 614 c 549 g 683 t ORIGIN 1 atgtcgccct ggctgaagtg gcatggaccc gccatggcgc ggctctgggg cttatgcctg 61 ctggtcttgg gcttctggag ggcctctctc gcctgcccga cgtcctgcaa atgcagttcc 121 gctaggattt ggtgtactga gccttctcca ggcatcgtgg cattcccgag gttggaacct 181 aacagcgttg acccggagaa catcacggaa attctcattg caaaccagaa aaggctagaa 241 atcatcaatg aagatgacgt tgaagcttac gtggggctga gaaaccttac aattgtggat 301 tccggcttaa agtttgtggc ttacaaagcg tttctgaaaa acagcaacct gcggcacata 361 aatttcacac gaaacaagct gacgagtttg tccaggagac atttccgcca ccttgacttg 421 tctgacctga tcctgacggg taatccgttc acgtgctcct gcgacatcat gtggctcaag 481 actctccagg agactaaatc cagccccgac actcaggatt tgtactgcct caatgagagc 541 agcaagaaca tgcccctggc gaacctgcag atacccaatt gtggtctgcc atctgcacgt 601 ctggctgctc ctaacctcac cgtggaggaa ggaaagtctg tgaccctttc ctgcagtgtg 661 gggggtgacc cactccccac cttgtactgg gacgttggga atttggtttc caagcacatg 721 aatgaaacaa gccacacaca gggctcctta aggataacga acatttcatc tgatgacagt 781 ggaaagcaaa tctcttgtgt ggcagaaaac cttgtaggag aagatcaaga ttctgtgaac 841 ctcactgtgc attttgcgcc aactatcacg tttctcgagt ctccaacctc agatcaccac 901 tggtgcattc cattcactgt gagaggcaac cccaagcctg cgcttcagtg gttctacaat 961 ggggccatac tgaatgagtc caagtacatc tgtactaaga tccacgtcac caatcacacg 1021 gagtaccatg gctgcctcca gctggataac cccactcata tgaataacgg agactacacc 1081 ctgatggcca agaacgagta tgggaaggat gagagacaga tctccgctca cttcatgggc 1141 cggcctggag tcgactacga gacaaaccca aattaccctg aagtcctcta tgaagactgg 1201 accacgccaa ctgacattgg ggatactacg aacaaaagta atgaaatccc ctccacggat 1261 gttgctgacc aaagcaatcg ggagcatctc tcggtctatg ccgtggtggt gattgcatct 1321 gtggtgggat tctgcctgct ggtgatgttg ctcctgctca agttggcgag acattccaag 1381 tttggcatga aaggttttgt tttgtttcat aagatcccac tggatgggta gctgagataa 1441 aggaaagaca aaggctgggg ctgtggtgct tgttgcctga cgccctgtga gctgaactct 1501 gggactgctg ttgcctatcc caggaagtgc tgcttatttg agggtgtctg gtggaaatgg 1561 gtaatctccg aggatgtctg cagcctgctt gttgtgagct gtgactgggg aaccccaagg 1621 cagaggcagg ggtcaggcag ctgagaagca gcagaagaac acacttagat tcaccttctg 1681 ttcttacaat agttcaaata tagaatcgaa gtgaaatctc attggattat gcctctctaa 1741 tgaaaagcga gctgtttgac tatacggaaa atgtgctgac attaattgct tctgtttatt 1801 aaaggtgatt tgcaaattaa aaactctgca tctatcatct atccatctat ctgtttgtct 1861 atcatatcta tctgtctgtc tatctgtcta tcatctatct acctacctct ctatcatatc 1921 tatctgtctg tctatctatc tatctatcta tctatctatc tatctatcta tctatctatc 1981 tatctatcat ctatctacct atcatcgatc tacttatcta tcatctatct atctacctat 2041 catcgattta cttatctatc atctatctat ctatctatct atctatctat ctatctatct 2101 atctgtcatc tatctaaagt catagctagg tctaagtgca cactaaaagt ctaatccaca 2161 cataacacct atttcagcaa catcttctgt tctctaacct ttgctaactt ctgtgatttc 2221 cacctacaac cctgcgactg atagacttaa aggcacattg gtggtgtcat tagtaggttc 2281 tttgttttgc tggcagcaaa gacccaaact cttcgctaac gattgctttc aaagtccacc 2341 cggcaggtag aacggagcag caccagggac tgtgtggcca ggagtatgga cctgaattaa 2401 ccacagcctg agaataaata atggtagggt atatgcatat agggaattaa aatcttgtcc 2461 ctttccattg ccctctgcta accg // LOCUS STRPRH 1784 bp ds-DNA BCT 27-JUN-1990 DEFINITION Streptococcus protein H gene, complete cds. ACCESSION M29398 KEYWORDS IgG binding protein; protein H. SOURCE Streptococcus (group A, strain AP1) DNA. ORGANISM Streptococcus sp. Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 1784) AUTHORS Gomi,H., Hozumi,T., Hattori,S., Tagawa,C., Kishimoto,F. and Bjoerck,L. TITLE The gene sequence and some properties of protein H: A novel IgG binding protein JOURNAL J. Immunol. 144, 4046-4052 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Gomi, 25-OCT-1989, for release after publication. FEATURES from to/span description pept 328 1458 protein H precursor sigp 328 471 protein H signal peptide matp 472 1455 protein H binding 316 321 ribosome binding site signal 246 251 -35 region signal 270 274 -10 region site 1489 1514 terminator BASE COUNT 734 a 343 c 338 g 369 t ORIGIN 1 gaattccgtt accatcaaca ggtgaaacag ctaacccatt cttcacagcg gcagccctta 61 ctgttatggc aacagctgga gtagcagcag ttgtaaaacg caaagaagaa aactaagcta 121 tcactttgta atactgagtg aacatcaaga gagaaccagt cggttctctc ttttatgtat 181 agaagaatga ggttaaggag aggtcacaaa ctaaacaact cttaaaaagc tgacctttac 241 tccttttgat taactatata taataaaaat attaggaaaa taatagcact attaattttc 301 ttttttaata aaatcaagga gtagataatg actagacaac aaaccaagaa aaattattca 361 ctacggaaac taaaaaccgg tacggcttca gtagccgttg ctttgaccgt tttgggcgca 421 ggttttgcaa accaaacaac agttaaggcg gaaggggcta aaattgattg gcaagaagag 481 tataaaaagt tagacgaaga taatgctaaa cttgttgagg ttgttgaaac cacaagtttg 541 gaaaacgaaa aactcaagag tgagaatgag gagaataaga aaaatttaga caaacttagc 601 aaagaaaatc aaggaaagct cgaaaaattg gagcttgact atctcaaaaa attagatcac 661 gagcacaaag agcaccaaaa agaacaacaa gaacaagaag agcgacaaaa aaatcaagaa 721 caattagaac gtaaatacca acgagaagta gaaaaacgtt atcaagaaca actccaaaaa 781 caacaacaat tagaaacaga aaagcaaatc tcagaagcta gtcgtaagag cctaagccgt 841 gaccttgaag cgtctcgtgc agctaaaaaa gaccttgaag ctgagcacca aaaacttgaa 901 gctgagcacc aaaaacttaa agaagacaaa caaatctcag acgcaagtcg tcaaggccta 961 agccgtgacc ttgaagcgtc tcgtgcagct aaaaaagagc ttgaagcaaa tcaccaaaaa 1021 cttgaagctg agcaccaaaa acttaaagaa gacaaacaaa tctcagacgc aagtcgtcaa 1081 ggcctaagcc gtgaccttga agcgtctcgt gcagctaaaa aagagcttga agcaaatcac 1141 caaaaacttg aagcagaagc aaaagcactc aaagaacaat tagcgaaaca agctgaagaa 1201 cttgcaaaac taagagctgg aaaagcatca gactcacaaa cccctgatac aaaaccagga 1261 aacaaagctg ttccaggtaa aggtcaagca ccacaagcag gtacaaaacc taaccaaaac 1321 aaagcaccaa tgaaggaaac taagagacag ttaccatcaa caggtgaaac agctaaccca 1381 ttcttcacag cggcagccct tactgttatg gcaacagctg gagtagcagc agttgtaaaa 1441 cgcaaagaag aaaactaagc tatcactttg taatactgag tgaacatcaa gagagaacca 1501 gtcggttctc tcttttatgt atagaagaat gaggttaagg agaggtcaca aactaaacaa 1561 ctcttaaaaa gctgaccttt actaataatc gtctttgttt tataatgaaa acattaacga 1621 aataatttat taaggagaga atactaatga atattagaaa taagattgaa aatagtaaaa 1681 cactactatt tacatccctt gtagccgtgg ctctactagg agctacacaa ccagtttcag 1741 ccgaaacgta tacatcacgc aattttgact ggtctgggga attc // LOCUS BSUSACP 3518 bp ds-DNA BCT 27-JUN-1990 DEFINITION B.subtilis sacT gene, complete cds, enzyme II (sacP) gene (of the sucrose phosphotransferase system), complete cds, and sucrase gene, 5' end. ACCESSION J03006 M33761 KEYWORDS enzyme II; sucrase; sucrose transferase system. SOURCE B.subtilis (strain Marburg 168) DNA, clone pBSG8-10. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1987 to 3518) AUTHORS Fouet,A., Arnaud,M., Klier,A. and Rapoport,G. TITLE Bacillus subtilis sucrose-specific enzyme II of the phosphotransferase system: Expression in Escherichia coli and homology to enzymes II from enteric bacteria enzymes II from enteric bacteria JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 8773-8777 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 2070) AUTHORS Debarbouille,M., Arnaud,M., Fouet,A., Klier,A. and Rapoport,G. TITLE The sacT gene regulating the sacPA operon in B.subtilis shares strong homology with transcriptional antiterminators JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by A.Fouet, 16-SEP-1987. Draft entry and computer-readable sequence for [2] kindly submitted by M.Debarbouille 18-APR-1990. Author address: M.Debarbouille Institut Pasteur 25-Rue du Dr. Roux 75724 Paris Cedex 15 FRANCE Emr:DEDONDER%PASTEUR.BITNET@CUNYVM.CUNY.EDU FEATURES from to/span description pept 136 897 sacT protein pept 2008 3390 enzyme II pept 3387 > 3518 sucrase BASE COUNT 1013 a 712 c 867 g 926 t ORIGIN 1 ctttacatgg cggtcaaaag gaggtaaaat cccagtcaaa agcaaattgg cgggagagat 61 aacctcttga aaatctataa agtattaaac aacaatgcgg ctttaataaa agaggatgat 121 caggaaaaaa tcgtgatggg accgggaatc gcttttcaaa aaaagaaaaa tgatctcatc 181 cctatgaata aagtggaaaa gatctttgtc gtgcgcgacg agaatgaaaa gttcaaacaa 241 atcctgcaaa cactgccgga ggagcatata gaaatagctg aggatatcat cagctatgcg 301 gaaggagagc tcgcggcgcc gctgagcgat cacattcata tcgcgctttc cgaccatttg 361 tcctttgcga tcgaaaggat tcaaaatggg ctgcttgtgc agaataaatt gctgcatgag 421 ataaaggcgc tctataaaaa agaatatgag atcggcctgt gggcgatcgg acatgtaaaa 481 gagacattgg gcgtgtctct gcctgaagat gaagcgggtt atattgccct tcacatccat 541 acggcgaaga tggatgcgga gagcatgtat tcagcgctga agcatacgac catgatcaaa 601 gaaatgatag agaaaataaa acaatacttt aatcgaaagg tggatgaaaa cagcatttcc 661 tatcaacgcc ttgtcacgca tttgcgatac gcggtcagcc ggttggaatc aaatgaagcg 721 ctccatcgta tggacgagga gatgctttat ttcatccaaa aaaagtattc attcgcctat 781 caatgtgcgc tggagctggc ggagtttttg aaaaatgaat atcaattaca tttgccggaa 841 tccgaggccg gctatatcac gctgcatgtc cagcgtcttc aagatctctc ggaataaccg 901 ctttgacttg cagggagtga tctctggaag tttttttatt gatcaggatt cagtgtgtag 961 tacatccagc ccatcataac tattcccgcg gtcagattgc cgagcgtaac ggggattaag 1021 tttctgactg ctcccatcag tgtcactgta tcagggtgct cgatcaaaag cgagatggcg 1081 aatgtgcaca tattggcaat gctgtgttca aagccggaaa taaagaagca gaaaacgaaa 1141 agcatcatgg taaaaagctt tgctccttcc cctttgagag acattggaat gaaaaaggcg 1201 aggcacacaa gccaattgca cagcattcct ctgaaaaaca attcggaagc cggcggctcc 1261 attttgtgct ctgccaaatg aatcaaaaag gaatgaacag aaggctcctc aaaaagtccc 1321 gtcgcgctga tcaggatggc aaacagaatg gcgccgatta aattgccggc atagcttgac 1381 atccacaagt atagggtgtc gcgccagctg atttttttcc ggagcgcggt ataggtgaaa 1441 taaaaggtgt tgccggtaaa taaatctccg cctccgtaag caatcatcag aatagcggcc 1501 ccgaaagtga cagcggctgc cggaaaggca aacggagaat cagccataaa gaaatagctg 1561 cctgttttgc ttgcggctgt aatgccaaaa ccaataaata tactggccaa aatggaccga 1621 agcacataac ggattttgct tgaagcgaat atgttttgtt ttttcaaagc atactgttca 1681 accttttgta atgcttgagt ttccattgtc ttttcctgct ttctttcatg attgagcata 1741 gtatggcacg atattgtgaa gtatttcaca attaattttt acgaattttt gtctacatgt 1801 tatatcaaac tatttaacta aaaaaatagt tgacgaaaac gctatcatga tttatgatga 1861 aagcgtattc ttaatctgaa taagcgggat tgtgactggt aaagcaggca agacctaaaa 1921 tttgcgtaaa tgaaaaagga tcgctgtgtc ctttattcgt tggcgaattt taggtctttt 1981 ttgtttaaat aaaaggggga gaaatagatg gattacaaag agactgcaaa acgcctcatt 2041 gagcttctcg gagggaaaga aaatattatc agcgcggctc attgtgcaac aagactgcgt 2101 ttagtgatga aagatgaatc aaagatagat caagcacaag tagaagagct tgatggcgtt 2161 aaaggggctt tcagcagctc tggccagtac cagatcattt tcggaacagg ccttgtgaat 2221 aaagtattcg atgccttttc taaagaagct gatatcgagc gtgaagaaca cgtcaatcat 2281 caggatgcgg caaaagaaaa gctgaatccc gctgcgagat ttgcgaaaac gctttctaat 2341 atttttgttc caatcatccc agctattgta gccagcggcc tattaatggg attgctgggt 2401 atgatcaatg cgtttcattg gatgagcaag gattctgcgc ttttgcagct gcttgatatg 2461 ttttcaagtg cagcattcat tttcttgccg attctaatcg gggtcagcgc ttcaaaagag 2521 tttggcagca atccatactt gggagcggtc atcgggggaa tcatgattca tccgaatctt 2581 ttaaatccat ggggattggc ggaagaacaa ctgattacat gcatcttttc ggatttgata 2641 tcgctcttct cggctacagg gaactgttat ccctgtcctg cttgcggtgt atgtgatgag 2701 caaggtagag aaatggacga gaaaagtggt tccacatgcg gtgatttgct tgttacacca 2761 tttgttactg tgattgtcac cgggtttgtt gcttttatcg cgattggccc tctaggcaga 2821 gcgcttggct ccggcattac ggttgcttta acttatgtat atgaccatgc cggttttgtc 2881 gcaggtctga ttttcggggg cacgtattca ctcatcgtgc tgacaggcgt tcatcacagc 2941 ttccatgcga ttgaggcagg gctaattgca gatataggca aaaactactt gctgccgatc 3001 tggtcaatgg cgaatgtcgc acagggcggg gcaggtcttg ccgtcttctt tatggcgaaa 3061 aaggcaaaga caaaagaaat cgcgcttccg gcagcatttt ccgcttttct cggcattact 3121 gagcccgtca tattcggagt caatctgcgc taccgaaaac cgtttatcgc cgcgatgatc 3181 ggaggcgcat tgggtggggc atatgtcgtc tttacacatg tagctgcaaa cgcttacggg 3241 ttaacgggaa ttccgatgat cgccattgcg gctccgtttg gcttcagcaa cttgattcac 3301 tatctgattg gcatggcgat tgcggcagtg tcagcgttca ttgcagcatt tgtcatgaag 3361 attaatgagg atgaggagag gaaaaaatga cagcacatga ccaggagctt cgtcgccggg 3421 cttatgaaga agtggagaaa aaagagccca tcgctaacag cgatccgcac cgccagcatt 3481 ttcatatcat gccgccggtt gggctgctga atgacccg // LOCUS MUSFGFR 2983 bp ss-mRNA ROD 27-JUN-1990 DEFINITION Mouse FGF receptor mRNA, complete cds. ACCESSION M33760 KEYWORDS FGF receptor. SOURCE Mouse cell line NIH3T3, cDNA to mRNA, clone A-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2983) AUTHORS Mansukhani,A., Moscatelli,D., Talarico,D., Levytska,V. and Basilico,C. TITLE A murine FGF receptor expressed in CHO cells is activated by both bFGF and K-FGF JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Basilico, 22-MAR-1990. FEATURES from to/span description pept 44 2245 FGF receptor precursor sigp 44 113 FGF receptor signal peptide matp 114 2242 FGF receptor BASE COUNT 699 a 834 c 799 g 651 t ORIGIN 1 gaattccgga gccttgttac caacctctaa ccgcagaact gggatgtggg gctggaagtg 61 cctcctcttc tgggctgtgc tggtcacagc cactctctgc actgccaggc cagccccaac 121 cttgcccgaa caagatgcac tcccatcctc ggaagatgat gacgacgacg atgactcctc 181 ctcggaggag aaagagacgg acaacaccaa accaaaccgt aggcctgtag ctccctactg 241 gacatcccca gagaaaatgg agaagaaact gcatgcggtg cccgctgcca agacggtgaa 301 gttcaagtgc ccgtcgagtg ggacacccaa ccccactctg cgctggttga aaaatggcaa 361 agagtttaag cctgaccacc gaattggagg ctacaaggtt cgctatgcca cctggagcat 421 cataatggat tctgtggtgc cttctgacaa gggcaactac tcctgcatcg tggagaatga 481 gtatgggagc atcaaccaca cctaccagct tgacgtcgtg gaacgatctc cgcaccgacc 541 catccttcag gcagggctgc ctgcaaacaa gacagtggcc ctggcgagca atgtggagtt 601 catgtgtaag gtgtacagcg atccgcagcc tcacattcag tggctgaagc acatcgaggt 661 gaacgggagt aagatcgggc cagacaactt gccgtatgtc cagatcctga agactgctgg 721 agttaatacc accgacaagg aaatggaggt gcttcaccta cggaatgtct cgtttgagga 781 tgcgggggag tatacgtgct tggcgggtaa ctctatcgga ctctcccatc actctgcatg 841 gttgaccgtt ctggaagccc tggaagagag accagctgtg atgacctcac cgctctacct 901 ggagatcatt atctactgca ccggggcctt cctgatctcc tgcatgttgg gctctgtcat 961 catctataag atgaagagcg gcaccaagaa gagcgacttc catagccaga tggctgtgca 1021 caagctggcc aagagcatcc ctctgcgcag acaggtaaca gtgtcagctg actccagtgc 1081 atccatgaac tctggggttc tcctggttcg gccctcacgg ctctcctcca gcgggacccc 1141 catgctggct ggagtctccg aatatgagct ccctgaggat ccccgctggg agctgccacg 1201 agacagactg gtcttaggca aaccacttgg cgagggctgc ttcgggcagg tggtgttggc 1261 tgaggccatc gggctggata aggacaaacc caaccgtgtg accaaagtgg ccgtgaagat 1321 gttgaagtcc gacgcaacgg agaaagacct gtcggatctg atctcggaga tggagatgat 1381 gaaaatgatt gggaagcaca agaatatgat caaccttctg ggagcgtgca cacaggatgg 1441 tcctctttat gtcattgtgg agtacgcctc caaaggcaat ctccgggagt atctacaggc 1501 ccggaggcct cctgggctgg agtactgcta taaccccagc cacaaccccg aggaacagct 1561 gtcttccaaa gatctggtat cctgtgccta tcaggtggct cggggcatgg agtatcttgc 1621 ctctaagaag tgtatacacc gagacctggc tgctaggaac gtcctggtga ccgaggataa 1681 cgtaatgaag atcgcagact ttggcttagc tcgagacatt catcatatcg actactacaa 1741 gaaaaccacc aacggccggc tgcctgtgaa gtggatggcc cctgaggcgt tgtttgaccg 1801 gatctacaca caccagagcg atgtgtggtc ttttggagtg ctcttgtggg agatcttcac 1861 tctgggtggc tccccatacc ccggtgtgcc tgtggaggaa cttttcaagc tgctgaagga 1921 gggtcatcga atggacaagc ccagtaactg taccaatgag ctgtacatga tgatgcggga 1981 ctgctggcat gcagtgccct ctcagagacc tacgttcaag cagttggtgg aagacctgga 2041 ccgcattgtg gccttgacct ccaaccagga ttatctggac ctgtccatac cgctggacca 2101 gtactcaccc agctttcccg acacacggag ctccacctgc tcctcagggg aggactctgt 2161 cttctctcat gagccgttac ctgaggagcc ctgtctgcct cgacacccca cccagcttgc 2221 caacagtgga ctcaaacggc gctgactacc aaccctgtcc cagttttctc ccattccgtc 2281 gtcacccgtg ccctcaccca caaatcccct gttggacaca ctgccttctc ctcctccttg 2341 ccgctggcaa gagccagtgc ctgactgaag gccttcctgt gttgtggcct tccccctcca 2401 tcacccccaa gacccctctt ctccctcttc ttagcctgct gtgtgagaga ggagccaaga 2461 ggcaggtgct tgccgacggc cgcatcctcc ttcccaggtg ttggaccaag acccgccccg 2521 ctgctggcac tgcttggagg tgtgcagagc ggaagcaagt ggagacgtat ccggggcatt 2581 cctgttgacc catcagcccc ttctgttctg gcggcagggc cttggggctc ctggaagccg 2641 tgaggtttct gtttaggcct taaccgaagg caacctctgc tccagatgga tggtaccagt 2701 agcttcttaa ttccaatact aatttgcttt gctgaccaaa tacctgcctg gtaccagaag 2761 acagggaggc agagactggg agccgtgatg tgcccttggc tgagcctaga cttggggctc 2821 tgtacatagc tatgaagaaa aacacaaagt gtataaatct tgagatatat ttacatgtct 2881 ttttaaaaag ggtcgttact agagatttac catgggggag acgcccaggg tagcatccgt 2941 tgctatatat taaaaacaca aacgaacaaa aaaaaaagaa ttc // LOCUS BOVTCRA01 402 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable region, and joining region. ACCESSION D90010 KEYWORDS T-cell receptor; alpha chain; joining region; leader sequence; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA52. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 402) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept 1 > 402 T-cell receptor alpha chain, leader sequence, variable region and joining region BASE COUNT 114 a 111 c 91 g 86 t ORIGIN 1 atgatgaaat cctcaagagt tttaccagtg atcctgtggc ttcagttgat ctcggtgagc 61 agccagcaga acacagtgga gcagagtcct gcgtctctgc ccgtcccaga gggagaggcc 121 gcctctctcg gctgcactta ttctgacagt gcttctcagt acttcatatg gtacagacag 181 taccctggga aaggccctga gtttctgctg caagtgtatg ccaacaaaga caaagaggaa 241 ggaaaattta cagcacagtc caataaaacc agcaagcatg tctccctgcg tatcagagac 301 tccgagccca gcgactcagc cacctacctc tgtgcagctt ctaactacca aggcagtcaa 361 ttaaactttg gaacaggaac tagactcacc atcacagcca aa // LOCUS BOVTCRA02 408 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable region, and joining region. ACCESSION D90011 KEYWORDS T cell receptor; alpha chain; joining region; leader sequence; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA29. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 408) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinary Public Health Obihiro Univ. of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept 1 > 408 T-cell receptor alpha chain, leader sequence, variable region and joining region BASE COUNT 115 a 109 c 95 g 89 t ORIGIN 1 atgatgaaat cctcaagagt tttattagtg atcctgttgg cttcagttga tcttgagcag 61 ccagcagaac acagtggagc agagcctgcg tctctgcccg tcccagaggg agccgctgca 121 tctctcggct gcacttattc tgacagcaat tctctgtact tcacatggta cagacagtac 181 cctgggaaag gccctgagtt tctgctgcaa gtgtatgcca acaacaacaa agaggaagga 241 aaatttacag cgcagtccaa taaaaccaac aagcatgtct ccctgcgtat cagagactcc 301 gagcccagcg actcagccac ctacctctgt gcagttgata cgatatccac cactgcaggg 361 accaagctaa cctttggaga aggaaccagg ctgatagtca agctgaaa // LOCUS BOVTCRA03 411 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable region, and joining region. ACCESSION D90012 KEYWORDS T cell receptor; alpha chain; joining region; leader sequence; variable region. SOURCE Bovine T-cell, cDNA to mRNA, clone BTA44. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 411) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept 1 > 411 T-cell receptor alpha chain, leader sequence, variable region and joining region BASE COUNT 104 a 114 c 101 g 92 t ORIGIN 1 atgatgaaat cctcaagagt tttattggtg atcctgtggc ttcagttgat cttggcgagc 61 acccagcaga acacagtgga gcagagtcct gcgtctctgc ccgtcccaga gggagccgcc 121 gcatctctcc gctgcactta ttctgacagc acttctcggt acttcacatg gtacagacag 181 taccctggga aaaggcctga gtttctcgtg caagtgtatg ccaacaacaa caaagaggaa 241 ggaaaattta cagcgcagtc caataaaacc agcaagcacg tctccctgcg catcagagac 301 tctgagccca gcgactcagc cacctacctc tgtgcagtga gcaatgatta tgggggtgct 361 gccaaccagg tcatctttgg gtctggcacg ctgctttctg tcaagccaaa a // LOCUS BOVTCRA04 393 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable region, joining region. ACCESSION D90013 KEYWORDS T cell receptor; alpha chain; joining region; leader sequence; variable region. SOURCE Bovine T-cell, cDNA to mRNA, clone BTA43. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 393) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinary Public Health Obihiro Univ. of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido Japan 080 Phone: 0155-48-5111 x281 FEATURES from to/span description pept 1 > 393 T-cell receptor alpha chain, leader sequence, variable and joining region BASE COUNT 111 a 88 c 95 g 99 t ORIGIN 1 atgaagacac aaagaagagt cttgctgagt ctcttgtgga tacagatttg ctgcctcaga 61 gtgcagatga aggtggagca gagtccgggg gttctgactc tccaagaggg gagaaattcc 121 tctctgatat gcaattattc tatttccatg accagtgtgc agtggttcca acaaaatcct 181 gatggacgcc tcatctcctt gttttatata gcttcaggaa tgcagcagaa aggaagactg 241 aaatccacca ttaatagtaa ggagcgttac agtcaactct acatcagaga ctcccagcct 301 ggggactcag ccacttactt ctgtgctgcg atcaggactg acaattccag aatcttcttt 361 ggaactggaa cccaggtggt ggtaaagccc aaa // LOCUS BOVTCRA05 393 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable region, and joining region. ACCESSION D90014 KEYWORDS T cell receptor; alpha chain; joining region; leader sequence; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA20. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 393) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept 1 > 393 T cell receptor alpha chain, leader sequence, variable and joining region BASE COUNT 96 a 97 c 93 g 107 t ORIGIN 1 atgtccttgg tgttcacctt gatgcttgag atgctcctgt ttctgagagc tggagcccag 61 tcagtgaccc agcctgatga ccacattgct gtctctgaag agccgtctgg agctgaagtg 121 caactactca tcttctgttt caccgatctc ttctggtaca tacagtaccc caaccaagga 181 ctccagcttc tcctgaagta cgtgtctgga gacaatcttg tttcaggcat caaaggtttt 241 gaggctgaat ttaggaacag tgagacgtct ttccacctga ggaaaatacc agctcattgg 301 aaagactcgg ccaagtactt ctgtgctctg agtaggactg acaattccag aatcttcttt 361 ggaactggaa cccaggtggt ggtaaagccc aaa // LOCUS BOVTCRA06 393 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable region, and joining rgion. ACCESSION D90015 KEYWORDS T cell receptor; alpha chain; joining region; leader sequence; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA25. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 393) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept 1 > 393 T cell receptor alpha chain, leader sequence, variable and joining region BASE COUNT 102 a 91 c 113 g 87 t ORIGIN 1 atgaagagaa aatggggagc cctgctgggg tttctgtggg tccagatttg cttggtgagg 61 ggagtaaagg tggagcagag tccttcagtc ttgagcctgc aggagggagc caactctact 121 ctgaggtgca atttttctga cacagtggac agtgtgcagt ggttccagca gaatcccgga 181 ggagccctca ccaccctgtt tttcatagct tcagggacga agaagaatga aaggatgagt 241 tccacagtga attctaagga gcggtatagc accctacaca tcacagcctc ccagctggaa 301 gacgcagcca cctacctctg tgcggtcgat ctcggttcag gaaacaggca acttgtcttt 361 ggaaaaggca caagacttgc tgtgactcca aaa // LOCUS BOVTCRA07 399 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, leader sequence, variable region, and joining region. ACCESSION D90016 KEYWORDS T cell receptor; alpha chain; joining region; leader sequence; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA15. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 399) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept 1 > 399 T-cell receptor alpha chain, leader sequence, variable and joining region. BASE COUNT 91 a 90 c 103 g 115 t ORIGIN 1 atggctttgc agagcactct ggaagcagtg tggctgggtg tcctcctcag ctctctctgg 61 aaggttgcag agagcaagga gcaagtattt cagtctccca ctgtggtctc tttggaggga 121 gctgtggcag aaatttcctg taatcactct atatccaatg tttatgactt cctctggtac 181 tttcacttcc caggatttgc accaagactc ctcattaagg gcttagggcc ttctcaacag 241 ggacgctaca acatgacata tgagaggttc tcttcatccc tgctcatcct ccaggtgcag 301 acggcagatg caggggttta ctattgtgct ctgaaggcga tttcaagtgg ccagaagttg 361 gtttttggaa gtgggaccat gttaaaggtg aatcttaaa // LOCUS BOVTCRA08 348 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90017 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA49. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 348) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 348 T-cell receptor alpha chain, variable and joining region. (AA at 1) BASE COUNT 99 a 90 c 75 g 84 t ORIGIN 1 ttgagtcttg ctaagacctc ccagcccatc ttcattgact catacgaagg acaagaagtg 61 aacatatctt gtaaccacac cacaatacgt acaatgagca atatcttgcg gtatgtgaag 121 gttccccatc cagggaccaa atttatcatt caaggatata agacaaacgt ggaaaatgaa 181 gtggcatctc tgttaattcc tcctgacagg aagttcagca ccctgagcct gccccaggct 241 tccctgagag acaccgctgt gtactactgc atcgtgagag atcaatatgg gggtgctgcc 301 aaccagctca tctttgggac tggcacgctg ctttctgtca agccaaaa // LOCUS BOVTCRA09 306 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90018 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA28. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 306) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 306 T-cell receptor alpha chain, variable and joining region. (AA at 1) BASE COUNT 85 a 76 c 78 g 67 t ORIGIN 1 ttgcttctgc aagaaggaga gaacttcacc acatactgca attcctcaag cattttaagc 61 agcttacagt ggtacaagca gagccctggg ggcagtcctg tcgtcttgat gatattagct 121 aagggtggag aagtgaagac agagcagaga ctgacaggtc agtttggaga gaccaaacag 181 cacagctccc tgcacctcac ggctgcccag ctctcagatg caggaaccta cttctgcgtc 241 agggatcgga caaacctctg gcaaaattgt ctttggaaga gggactcagc ttcatgttct 301 cccaaa // LOCUS BOVTCRA10 282 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90019 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA42. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 282) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 282 T-cell receptor alpha chain, variable and joining region (AA at 1) BASE COUNT 80 a 70 c 70 g 62 t ORIGIN 1 accagctact gcaattcctc aagcatttta agcagcttac agtggtataa gcagaggtct 61 gggagcagtc ctgtcctctt gatgatatta gctaagggtg gagaagtgaa gacagagcag 121 agactgacag gtcggcttgg agagaccaaa cagcacagct ccctgcacct cacggctgcc 181 cagctctcag atgcaggaac ctacttctgc acggggattc tgagcacagg ttatcagaaa 241 ttcgtatttg gaactggcac ccaacttttg atcaacccaa aa // LOCUS BOVTCRA11 258 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90020 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA13. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 258) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 258 T-cell receptor alpha chain, variable and joining region (AA at 1) BASE COUNT 81 a 70 c 54 g 53 t ORIGIN 1 agacacagct tcagactact tcccttggat aaacaaggac ctgggaaagg tccccaactc 61 cttatagcta ttcgttcaaa tatgggtgaa aaaaaagacc agagactgac tgttttattg 121 aataagacca ccaaacatct ctccctgcac atcgcaacca tccaacctgg agactcagct 181 gtctacttct gtgcagcaag aggtgcaggc cagaagctgg tattcgggcg aggaaccagg 241 ctgaccatca accccaaa // LOCUS BOVTCRA12 189 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90021 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA18. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 189) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 189 T-cell receptor alpha chain, variable and joining region. (AA at 1) BASE COUNT 59 a 52 c 33 g 45 t ORIGIN 1 tttaacattc cttcaggaac aaaacaagat ggaagattaa acgccacgac aatccctaaa 61 gaacgtcgca gctcactgca catttcctct tcacagacca cagactcagg cacttacttc 121 tgtgctggtg aacaaacctc tggcaaaatt gtctttggaa gagggactca gcttcatgtt 181 ctccccaaa // LOCUS BOVTCRA13 171 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90022 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA40. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 171) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 171 T-cell receptor alpha chain, variable and joining region. (AA at 1) BASE COUNT 39 a 36 c 50 g 46 t ORIGIN 1 gatctcgatg cggcaagggg atccagtgac tgtgaagtga ccctatacag tttcggggag 61 cccattatct tttctgggag tgactctacc ttgtacttct gtgctgtgag cggggattat 121 gggaacagac tcacttttgg aaatgggact cgagtgctgg tcacaccaaa a // LOCUS BOVTCRA14 156 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90023 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA15. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 156) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 156 T-cell receptor alpha chain, variable and joining region. (AA at 1) BASE COUNT 39 a 36 c 31 g 50 t ORIGIN 1 tttttttttt cattccttag acgctctgat gcacacagtt acctccttct gaaggaactc 61 cacatgaaag actttgcctc ttacctctgt gttgtgatgg atggcaacta tcagtggatc 121 tggggctctg ggaccaaact aattataaag ccagaa // LOCUS BOVTCRA15 156 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90024 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA21. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 156) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 156 T-cell receptor alpha chain, variable and joining region. (AA at 1) BASE COUNT 49 a 34 c 27 g 46 t ORIGIN 1 tttttttttc aaaagggaaa aaaaatcatc agcctcacca tctcttcctt gtacctggcc 61 gactcagcaa agtatttctg tgctcttcgg gatagtggaa ctagcagcta taaacagaca 121 tttggaaaag gcactctctt aattgtgaat ccaaaa // LOCUS BOVTCRA16 147 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90025 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA26. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 147) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Dep. of Veterinart Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 147 T-cell receptor alpha chain, variable and joining region (AA at 1) BASE COUNT 45 a 31 c 38 g 33 t ORIGIN 1 aagaaggatg aaaactactt ccacctgatg aaagcttcag cccattggag tgattctgcc 61 aagtacttct gtgctctgtg tgatccatgg ggaaacacag gaagactcac ctttgggcag 121 gggacagtgt tgcaagtaaa accagaa // LOCUS BOVTCRA17 135 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90026 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA27. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 135) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Department of Veterinary Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 FEATURES from to/span description pept < 1 > 135 T-cell receptor alpha chain, variable and joining region (AA at 1) BASE COUNT 36 a 40 c 34 g 25 t ORIGIN 1 aggagcggaa tgtccctaca catcacagcc tcccagctgg aagacgcagc cacctacctc 61 tgtgtggtgg acaggactgg tgtctacaat aagctgacct ttggaaccgg aaccaggctg 121 actgtacact cacaa // LOCUS BOVTCRA18 123 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90027 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA36. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 123) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Department of Veterinary Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 FEATURES from to/span description pept < 1 > 123 T-cell receptor alpha chain, variable and joining region (AA at 1) BASE COUNT 34 a 39 c 28 g 22 t ORIGIN 1 gagcggtata gcaccctaca catcacagcc tcccagctgg aagacgcagc cacctacctc 61 tgtgcggtgg aggaaaacac actgatcttt ggctcaggaa ccactttgac agtcaagcct 121 caa // LOCUS BOVTCRA19 123 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90028 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA41. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 123) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Department of Veterinary Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 FEATURES from to/span description pept < 1 > 123 T-cell receptor alpha chain, variable joining region. (AA at 1) BASE COUNT 23 a 34 c 32 g 34 t ORIGIN 1 tttttccttt ttgcggacca cagactcagg cacttacttc tgtgctgtgc agctggaagg 61 ggggtgctgc caaccagctc atctttggga ctggcacgct gctttctgtc aagccaagca 121 caa // LOCUS BOVTCRA20 51 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, variable and joining region. ACCESSION D90029 KEYWORDS T cell receptor; alpha chain; joining region; variable region. SOURCE Bovine T cell, cDNA to mRNA, clone BTA46. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 51) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Department of Veterinary Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 > 51 T-cell receptor alpha chain, variable and joining region (AA at 1) BASE COUNT 12 a 16 c 11 g 12 t ORIGIN 1 gccaaccagc tcatctttgg gactggcacg ctgctttctg tcaagccaaa a // LOCUS BOVTCRA21 955 bp ss-mRNA MAM 27-JUN-1990 DEFINITION Bovine T-cell receptor alpha chain, constant and 3' untranslated region. ACCESSION D90030 KEYWORDS T cell receptor; alpha chain; constant region. SOURCE Bovine T cell, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 955) AUTHORS Ishiguro,N., Tanaka,A. and Shinagawa,M. TITLE Sequence analysis of bovine T-cell receptor alpha chain JOURNAL Immunogenetics 31, 57-60 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Naotaka Ishiguro Department of Veterinary Public Health Obihiro University of Agriculture and Veterinary Medicine Inada-cho Obihiro, Hokkaido 080 Japan Phone: 0155-48-5111 x281 FEATURES from to/span description pept < 1 423 T-cell receptor alpha chain, constant region signal 935 940 polyadenylation signal BASE COUNT 223 a 288 c 211 g 233 t ORIGIN 1 gtcaaggacc ccaaccccac tgtgtaccag ctgagaagcc cccaatccag tgacacttct 61 gtctgcctat tcaccgattt tgattcaaat caagtcaaca tggaaaaaat catggggtcc 121 gaggggagca cggtgcacaa aacaaacagc accgtgctca acatggagat cctgggatcc 181 aagagcaacg ggatagtgac ttggggaaac accagcgatg ctggatgcgc atacaccttc 241 aacgagacca ttcccttcgc ctccagcttg gaaatctcct gtaatgccaa gctggtagag 301 aaaagctttg aaacagatat aaacctaaac tcccaaaacc tgtcagtgat agtgttccgc 361 atcctcctcc tgaaggtggt cgggtttaat ctgctcatga cgctgcggct ctggtccagt 421 tgaggtcacc gacaacctga gagccccgtg ctccctcgcc ccttgctcct cgtcgccctt 481 cctctccctc ttcaagcaga gaggcgcact ctctgccccc atggatgaga aggctccctc 541 ctcctctctg gcctggctgg ccataacacc aactggatcc tccacatact ggtgatcaag 601 atgctgacaa gctgcccagc acggctgcca cccactctct tcctcactgc tgcttgtcac 661 tgcctggcat tcttggcaaa gtcggggggc tccccacccc caccccccag agactgcctc 721 tgatgtccca ctggatggtg gatccccagt gggttctcct gggctctagc tcctggagaa 781 tgttgtgtgt tgtttatatt tttttacata gtgttcataa aaaaatatat atcacccttt 841 tccccaacat gtggggaaaa attatctttt cacgatctag gccctgctct tctgtgtatc 901 cgagccacat tgtatattct gctgccacgg cttcaataaa agcgatttgg aagag // LOCUS SUSFCK 3985 bp ss-mRNA INV 27-JUN-1990 DEFINITION S.purpuratus flagellar creatine kinase mRNA, complete cds. ACCESSION M33763 KEYWORDS creatine kinase; flagellar creatine kinase. SOURCE S.purpuratus sperm, cDNA to mRNA, clone WD2.2. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 3985) AUTHORS Wothe,D.D., Charbonneau,H. and Shapiro,B.M. TITLE The phosphocreatine shuttle of sperm: Flagellar creatine kinase results from a gene triplication JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.D.Wothe, 18-APR-1990. Author address: D.D.Wothe University of Washington Dept. of Biochemistry SJ-70 Seattle, WA 98195 E-mail:WOTHE@toby.acs.washington.edu FEATURES from to/span description pept 95 3619 flagellar creatine kinase (E.C. 2.7.3.2) signal 3961 3966 poly-A signal BASE COUNT 1184 a 923 c 910 g 968 t ORIGIN 1 ccctaaactt caatcacact tctaatcaag aaaggaacaa atttgtatta attatttcat 61 tttctacata tgcattgtaa tcagtgaccc agcaatggga tgcgcagcat cgtcacaaca 121 aacgacggcc acaggtggtc aaccagctgc tggtgaaaag gcaaacccgg cccccgctaa 181 taacaatcca aacgcagcaa acaaagcaga aacgacagga gcagctgaag aactcactaa 241 agaatctgaa ccatttgtgg aaccggatcc caattaccct gacctaagca agcacaacaa 301 ttacctggca gaatctttaa cgccttccat ctacaacaaa atctgtaacc ttcgcacact 361 ttctggttac tctgtggatg gttgcatgca gaccggcgtg gacaatccag gacatccttt 421 catcaagact gtcggattgg tggctgggga tgaagagtgt tacgatctct ttgctgacct 481 ctttgatcca actattgaca aaagacacaa cggatatccc cgcaatgcaa agcacacaac 541 tgatctcaac cctgatcatc tcaagggtgg tgatgacttt gatccaaagt atgtcctttc 601 atgcagagtt cgtacaggcc gctgcattag agggtacggc ctcccaccac attgcacgag 661 ggctgaacgt cgagacgttg agaaagtttg caaagacgca ctggcaactc tagatgggcc 721 actcaaaggc acgtattacc cactaacggg gatgacagag gaaatgcagg acaaactaat 781 tgcagatcat ttcctattcg ataagcccgt ttctcctttg cttatgtcag ctcgcatggc 841 ccgtgattgg cctgatggac gtgggatctg gcacaatgcc gataaaaact tcctggtttg 901 gataaatgag gaagatcaca ctcgggttat ttccatggaa acaagtggca acatgaagaa 961 tgtctttaaa cgattctgta atggtctcaa caaagtcgag aatgctctca aagcaaaagg 1021 gtatgaattc tcatggaatg agcatcttgg gtatgtgctc acttgtccat ccaaccttgg 1081 tacaggtgta cgtgcaggag tccacatcaa gattcctctt ttcagcaagc acgctggatt 1141 tgaatccatt ctgaagcatt atcgtctcca gaaacgtggc acaggaggcg tggatacggc 1201 atccactgac ggcacgttcg acatttcaaa cctagatcgt cttggcactt ctgaagtaca 1261 acaggttcaa tccgttgttg atggtgtcaa gaagctgatt gagttggaaa aagccttaga 1321 aaagggatca gatatcagcg gccagattcc ccgggaccca gcaatagttc gtgcagaaca 1381 agtgaaagaa ggatatccag acctatccaa acacaacaat catctggccc actgtctgac 1441 atacgacatt tggaaaagtc tcaaggacaa aaagacgcca agtggcttta ctctggatgg 1501 ctgcatccaa acgggagtca tgaaccctgg tcatcctcac atcatgaccg ttggaatggt 1561 tgcaggagat gaggaatcat atgacgtctt cgctgatatc tttgacccgg tcattgatgc 1621 acgacacgga ggttatccca aagatgccgt ccatgtcacc aacataaacc atgcagatct 1681 taaaggagga gataacctgg atcccaaata tgttttgtct tgccgtgttc gcacaggacg 1741 cagcatcatc gggtattctc tgcccccgca ttgcaccgtg gaagagagag ctgccgtaga 1801 gacgatcacg atcggagctc ttgataaatt tgatggagac ttgcaaggaa aatactaccc 1861 actggaagga atgtctgatg aaacacaaac acagctgata gatgaccatt tcctcttcga 1921 taagcctgtt tctcctcttc ttaccgccgc aagaatgcat cgtgattggc cacagggaag 1981 aggaatttgg cataacgaaa acaagaactt cttggtgtgg gtcaatgaag aagatcatat 2041 ccgtgtcatc tctatggaga aggatgggaa tatgcgagcg gtcttcaagc gattctgtga 2101 aggtttgcag aagttcgagc agatgatcaa aaaggatggc aaggagttca tgtggaacaa 2161 acatctgggt tacgtactta cttgtccgtc taacctggga accggcctac gcgctggcgt 2221 acatgtgaaa ctgcctctgc tgagcaagta tccgagattc gaccagatcc taagagccct 2281 ccgacttcag aagagaggta caggtggagt agatactgcc tcaactgatg gaaccttcga 2341 catctcgaac ctcgacaggc ttggatcatc agaagtacag caagttcagt ttgttgtcga 2401 tggcgttgaa cttctcgtcc aaatggagaa gaaacttgaa aagggcgagg acatttttga 2461 tattctacca cagcaatgcc gtcccaaacc accaataaag cctttcagtt atgactaccc 2521 cgatttctct ctgcacaaca actggatgtc taaatgtatg acagaagaaa tctacaacaa 2581 actttgcaac ttgaagacaa aaggtggagt caccctgaat gactgcattc aaactggcat 2641 tgacaatcct ggacatccat acatcatgac tgttggtttg gtagctggag atgaagagtg 2701 ctacgaggtt ttcgctcctc tcttcgaccc ggtgataagt gccaggcacg gcggctacgc 2761 tcttgatgcc aaacacccta ccaacctcaa tgcagcagaa ctcaaaggcg gagatgattt 2821 ggaccctgag ttcgtcctat cttgtcgtgt acgtactggt cgttgcatcc gcggcttggc 2881 tctcccgcca tgttgcacca gagcagaacg tgcggaagta gagaagatta caacagaagc 2941 attatctacc ctaagtggac ctctgaaggg aaagtactac ccactaaccg gtatgacaga 3001 cgaagagcaa gagaagctca tcgaagacca cttcctcttt gacaagcccg tatccccgct 3061 gttgctttgt gccaatatgg cacgtgactg gccccagggt cgtggcatat ggcacaatga 3121 tgagaagaat ttccttgtct gggtcaacga ggaagatcac actcgagtca tctccatgga 3181 gaagagcggg aacatgaagc gagtgtttga gaggttctgc gacggtctca aaaaggtcga 3241 agactccatc aagtccaagg gttaccagtt catgtggaat gagcatcttg gttatgtact 3301 gacatgtcca tccaacctcg ggacgggact gcgtgcaggc gtccatgtca aggttccttt 3361 gttaagtcaa cagaaaatct ttgactctat ccttgatcac atgcgtctgc aaaagcgagg 3421 aacaggaggt gtagacacag cctctactga cggcaccttt gatatctcca actcggatcg 3481 catcggtttc tccgaagtcc atcttgtcca acaactcgta gacggtgtca agctcctcgt 3541 caacctcgaa aaggctctca tgaagggaga agacatcaac agtctcctgc cagagaagct 3601 aagggaagat tcctcataaa gccaggatcc acagcttgca gaatagtggc cgttgtcgcg 3661 caagtgttat aaatgccacc tttagaagat gtagcctaca cattaaaaaa aaatacaatt 3721 agaattgaaa tctataagga cgatttgaat ataccataaa gatttgccca tgattttgaa 3781 tcaaaataac ttcatgctga acttgtaaat gaaaaagtaa tgttagaaaa ctctgtaaat 3841 tgggaagata taaaatttca tggccgtttt tctttctgat gaagtacact aattatatta 3901 atttctctta ttaatatgta tggaaacttt taattatgta aaaacaagga aacgaactga 3961 aatacaattc aaactacttt tcatt // LOCUS HAMPPTPCA 2126 bp ss-mRNA ROD 27-JUN-1990 DEFINITION Hamster dihydroorotate synthetase mRNA, partial cds. ACCESSION M33702 KEYWORDS CAD; aspartate transcarbamoylase; carbamoyl phosphate synthetase; dihydroorotase; dihydroorotate synthetase; pyrimidine pathway trifunctional protein complex. SOURCE Hamster SV40 transformed cell line BHK21, cDNA to mRNA, clone pCAD142. ORGANISM Cricetulus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2126) AUTHORS Williams,N., Simpson,R.J., Moritz,R.L., Peide,Y., Crofts,L., Minasian,E., Leach,S.J., Wake,R.G. and Christopherson,R.I. TITLE Location of dihydroorotase domain within trifunctional hamster dihydroorotate synthetase JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.I.Christopherson, 13-APR-1990. Univ. of Sydney, Dept. of Biochem., NSW, 2006 AUSTRALIA FEATURES from to/span description pept < 1 > 2126 dihydroorotase synthetase complex (AA at 3) (EC 3.5.2.3) BASE COUNT 415 a 676 c 609 g 426 t ORIGIN Short arm of chromosome B9. 1 gtcgacggct ttcctccttc gtcaccaagg gctaccgcac gcggcgcctg gctgctgact 61 tctctgtgcc tctcatcatc gacatcaagt gcaccaaact cttcgtggag gccctgggtc 121 agattggccc cgccccgcct ttgaaggttc atgtagactg catgacctcc cagaaactgg 181 tgcggctccc tgggttgatc gacgtccatg tgcaccttcg ggagccaggt gggacacaca 241 aagaagactt tgcctcgggc acagctgctg ccttggctgg gggtgtcacc atggtctgtg 301 ccatgcctaa tacccggccc cccatcattg atgcccctgc tctggccctg gcccagaagc 361 tggcagaggc cggtgcccgc tgtgactttg ccctgttcct tggggcctca tctgaaaacg 421 cggggactct gggtgctgta gccgggtctg cagcaggact aaagctctac ctcaacgaga 481 ccttctccga gctgcggctg gacagtgtgg cccaatggat ggagcatttt gaaacgtggc 541 cttcccacct ccccattgtg gcccatgcag agcggcagag tgttgccgca gtcctcatgg 601 tggctcagct gacccagcgc ccagtgcaca tatgtcacgt ggctcggaag gaagagatcc 661 tgctgattaa aactgcgaag gcacaggggc tgcctgtgac ctgtgaggtc gcaccccacc 721 acctcttcct gaatcgggaa gacttggagc gcctgggacc tgggagggga gaggtccggc 781 ctgagcttgg ttcccgagag gatatggagg ctctgtggga gaatatggct gtcatcgact 841 gctttgcctc agaccacgct ccccataccc tggaggagaa gtgtgggccc aagcctccac 901 ccggcttccc agggctggag accatgctgc cgctcctgct gacagctgtg agtgagggcc 961 ggctcagtct ggatgacctg ctacagcgcc tgcaccacaa cccgcgccgg atcttccacc 1021 tgccccttca ggaggacacc tatgtagagg tggatctgga gcatgagtgg acaatcccta 1081 gccacatgcc cttctccaag gcccgctgga ccccttttga ggggcagaag gtgaagggta 1141 ccatccgccg tgtggtcctg cgaggggagg ttgcttatat cgatggacag gtgttggtac 1201 ccccaggcta tggacaagat gtacggaagt ggcctcaggg ggctgttccc cagccccctc 1261 cttcagctcc tgccaccaca gaaataacca cgacacctga gagaccacgc cgagtcatcc 1321 caggccttcc tgatggccgc ttccacttgc caccccgaat ccacagagcc tctgacccag 1381 gtttgccagc tgaggaaccc aaagagaagc catccaggaa agtagtggag ccagagctca 1441 tggggacccc cgacggtccc tgctaccctg caccgccggt gcctagacag gcatcacctc 1501 agaacctggg ctcttctggc ctactgcacc cacagacttc acccctgctg cactccttag 1561 tgggccaaca catcctgtct gtcaagcagt tcactaagga tcagatgtct catctgttca 1621 acgtcgcgca cacactacgg atgatggtgc agaaagagcg gagccttgac atcctaaagg 1681 gcaaggtcat ggcctccatg ttctacgagg tgagcacccg caccagtagc tcctttgcag 1741 cagccatggc ccggctcggg ggcgctgtcc tcagcttttc agaagccacg tcctccgtcc 1801 agaaggggga atcccttgcc gactctgtgc agaccatgag ttgctacgct gatgtcgttg 1861 tgctccggca ccctcagcct ggagctgtgg agctggcagc caaacactgt cgcagaccag 1921 tgatcaatgc tggggatgga gtcggagagc accctactca ggccctgctg gacatcttca 1981 ctatccggga agagctgggg actgtcaatg gcatgacgat caccatggta ggtgacctga 2041 agcatgggcg cacagtgcac tccctggcct gcctgctcac ccagtaccgt gtgagcctac 2101 gctacgtggc acctcccagc ctgcgc // LOCUS TRBALDA 1119 bp ds-DNA INV 27-JUN-1990 DEFINITION T.brucei fructose-bisphosphate aldolase (ald) gene, complete cds. ACCESSION M19994 KEYWORDS fructose-bisphosphate aldolase. SOURCE T.brucei brucei (strain 427) DNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1119) AUTHORS Marchand,M., Poliszczak,A., Gibson,W.C., Wierenga,R.K., Opperdoes,F.R. and Michels,P.A.M. TITLE Characterization of the genes for fructose-bisphosphate aldolase in Trypanosoma brucei JOURNAL Mol. Biochem. Parasitol. 29, 65-76 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.A.M.Michels, 03-AUG-1988. FEATURES from to/span description pept 1 1119 fructose-bisphosphate aldolase (ald) (EC 4.1.2.13) BASE COUNT 240 a 334 c 305 g 240 t ORIGIN 1 atgtccaagc gtgttgaagt tctgcttacc caactccctg cgtacaaccg cctgaagacg 61 ccatatgaag cggagctgat tgaaaccgcc aaaaagatga ccgcccccgg taagggtctc 121 ctcgccgccg atgagtccac tggttcttgc tcgaagcgct ttgcgggcat cggtctcagc 181 aacactgcag agcaccgccg tcagtaccgc gctctcatgc tggaatgcga aggtttcgag 241 cagtacatca gcggtgtcat cctgcacgat gagaccgtgt atcagaaagc taagaccggc 301 gaaacattcc ctcagtacct ccgtcgtcgt ggtgtggtgc ctggcatcaa aaccgattgc 361 ggtctcgagc ccctcgtgga gggtgccaag ggcgagcaga tgactgctgg tctcgacggt 421 tacatcaaac gggccaagaa atactacgct atgggctgcc gcttctgcaa gtggcgcaac 481 gtgtacaaga tccagaacgg caccgtgtct gaagctgttg ttcgtttcaa cgctgaaaca 541 ctcgctcgct acgctatcct ctcccaactc tgcggtcttg tcccgattgt ggagcctgag 601 gtgatgatcg atggcactca cgacattgag acttgccaac gcgtttcgca gcatgtgtgg 661 tcggaggtgg tttctgcact ccaccgccac ggcgttgtat gggagggatg cctgctgaag 721 cccaacatgg tcgtccctgg cgctgaatcc ggcctcaagg ccactgcgga gcaggttgcc 781 gagtacaccg tcaaaaccct cgcccgtgtt attccccctg cgctccccgg tgtgacgttc 841 ctatcaggtg gcctcagtga ggttatggcc tccgagtacc tcaacgctat gaacaactgc 901 ccgttaccac gcccatggaa actgaccttt tcatacgccc gtgcacttca gtccagcgcc 961 ataaagcggt ggggtggaaa ggaatctggt gtcgaagctg gccgccgtgc gttcatgcac 1021 cgcgcaaaga tgaactcact tgcccaactc ggcaagtaca accgtgctga tgacgacaag 1081 gactcgcagt ctctgtacgt tgctggcaac acatactaa //
GenBank-Updates@genbank.bio.net (06/28/90)
LOCUS HUMSODB 8841 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human ornithine decarboxylase gene, complete cds. ACCESSION M33764 KEYWORDS ornithine decarboxylase. SOURCE Human sultan myeloma cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 8841) AUTHORS Hickok,N.J., Wahlfors,J., Crozat,A., Halmekyto,M., Alhonen,A., Janne,J. and Janne,O.A. TITLE Human ornithine decarboxylase-encoding loci: Nucleotide sequence of the expressed gene and characterization of a pseudogene JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.J.Hickok, 18-APR-1990. FEATURES from to/span description pept 3687 3788 ornithine decarboxylase, exon 3 (first expressed exon; E.C. 4.1.17) /nomgen="ODC1" /map="2p25" /hgml_locus_uid="LP0064V" 4072 4245 ornithine decarboxylase, exon 4 4452 4624 ornithine decarboxylase, exon 5 4883 5017 ornithine decarboxylase, exon 6 5148 5229 ornithine decarboxylase, exon 7 5406 5489 ornithine decarboxylase, exon 8 6546 6708 ornithine decarboxylase, exon 9 6791 6903 ornithine decarboxylase, exon 10 6997 7211 ornithine decarboxylase, exon 11 7851 7995 ornithine decarboxylase, exon 12 pre-msg 388 8338 ODC1 mRNA and introns IVS 602 3454 ODC1 intron A IVS 3565 3669 ODC1 intron B IVS 3789 4071 ODC1 intron C IVS 4246 4451 ODC1 intron D IVS 4625 4882 ODC1 intron E IVS 5018 5147 ODC1 intron F IVS 5230 5405 ODC1 intron G IVS 5490 6545 ODC1 intron H IVS 6709 6790 ODC1 intron I IVS 6904 6996 ODC1 intron J IVS 7212 7850 ODC1 intron K signal 359 363 TATA box rpt 2277 2586 Alu-repeat BASE COUNT 2261 a 1884 c 2279 g 2417 t ORIGIN 1 aggctctgcc gttcagctgc cgcgggcggg gccggggcct gcggcgtcgt gcgccgtgcg 61 ggaccagttc caggcgggcg agaccgcgca gggcggggcg gggcgaggcg gccgcagggc 121 ggggagggcg gggagaggcg gccgcagggc ggggagggcg gggcgcgaag ccgggggcgg 181 gggccacgcg tggggcaggc ggtgctcggc tcggctgacg tcggcccgcc ggcgccccac 241 cacgtccgcg cgggcccggg ttgccaccgc cggcccccgc ccctcccccg gcggtgtccc 301 ggccggaacc gatcgtggct ggtttgagct ggtgcgtctc catggcgacc cgccggtgct 361 ataagtaggg agcggcgtgc cgtggggctt tgtcagtccc tcctgtagcc gccgccgccg 421 ccgccgcccg ccgcccctct gccagcagct ccggcgccac ctcgggccgg cgtctccggc 481 gggcgggagc caggcgctga cgggcgcggc gggggcggcc gagcgctcct gcggctgcga 541 ctcaggctcc ggcgtctgcg cttccccatg gggctggcct gcggcgcctg ggcgctctga 601 ggtgagggac tccccggccg cggaggaagg gagggagcga gggcgggagc ggggcgggct 661 gcgggcccgg gcccgggcac gtgtgcggcg cgcctcgccg gcctgcggag acacgtggtc 721 gccgagcggg ccacgacctt gaggcgccgc ttcctcccgg cccggggttc tcccgcggct 781 ggataagggt gatccgggcg cctcgttctg cccccgtctt cacagctcgg ggctggaggg 841 gcctagggga gacccacccg gagaccctgc ggccccgcgc cggcctcttt cccaaccctt 901 cggcggccgc gcgctggccg gggagccgtt ggggaggccc tggcggccgc gcagcaggtg 961 caggggcgca gagcctgggc tcgccttggt acagacgagc gggccccggc cttggcgcct 1021 tcagtttcct tccagttttt attttcgctg tgtctacaga gcagatgaca ccaatttgga 1081 aacccgcgag agtgggtaga gctaagatag tcttgctgta gtagctgtga tattagatgc 1141 tcggccatga cttagaggtg tttatttaag gactgtgaat gactcggtga tttcggaaaa 1201 gcttggctta gatgaacgga catacacagg ggagacagcc ctaaggtttg cagaaaaggc 1261 tgattgtgct gtttgcgaag tcgaaataat tggtgaaagt gtagaaggca gaacctctca 1321 ggaatgtctg gggaggacaa agaatgtgtt ggctgacttt gtttaaacat aaaattgggc 1381 agactttaat tgatttgtga aatttttttc aaagtttgtt tgaattagcc cctatctctt 1441 ctaacattat cctcttgtgc taattgattg accattttaa ataacttagc tgttacagaa 1501 agaccgaaag gtgttcttca gtaaaatata ttcaagtaag ttacttaagt aacgccttaa 1561 aagatacaga aaagcaaaaa agtattggcg tattaaaaag aaatcaaaac tttccaagtt 1621 taggcctgaa cattgcctta aaaatattta ataaggcctc aaatgaccca gtccgagact 1681 gcatgagcct atttattatt aaattgtaaa tattcttcat ataaacaaaa atatataacc 1741 atgtctgtaa caaaaatggt tttgctagcg ttgttactct cttcccttct ccgaggggtg 1801 atttaggcaa cttcggaggt tgacaatgcc aagcagtcac aatagataga gctttaaagc 1861 aaattctatg catgggtttg gatttatgac aggcccgtca ccctgggcct gtcatagtac 1921 cccatgccag agcaaactgt gtccccgaac cattgcctgg cctctgtgcc cgtaggctgc 1981 tggcactgaa gtgggttgca cagtggaaaa gaagaaagct ctacctggca gaaattttta 2041 aaggttaaaa taaataattt taagaaagct ggttcacaag gtgccacatt tgatgaaagc 2101 aaaatacagt ggcttttatt gttactagag tgatgttctt gcttgttttt cttttttggt 2161 gaagttagcc ccaaattatt ctcatagcta agcaaatacg agagtgactg taaggacagt 2221 tggcattccc ggaattgcta aacttggtag gcaacgctgg tttaagaata ctgagttcta 2281 gccgggcgtg gtggctcacg cctgtaatcc caacactttg ggaggctgag gcaggcggat 2341 cacctgaggt cgggagttgg agaccagcct gactaacatg gagaaacgcc atctccacta 2401 aaaatataaa attagccagg ccccgggtgt ggtggcacat gccggtaatc ccagctactc 2461 gggagactga ggcaggagaa tcgcttgaac ccaggaggcg gaggttgagg tgagccgaga 2521 tcatgccatt gcactccagc ctgggcaaca agagtaaaac tctgtctcaa aaaaaaaaaa 2581 aaaaaatact gaattctgat caggtaacag caactgtaat acaatgtgat aagttgactt 2641 gaagattaca gtttttaaga agtatatacc cagctaatac atgaaaatta actcgtaaaa 2701 tctcaaatgc tccagacatt tccatgatgc ctgttggtca gtaaaaatca ttctaagact 2761 tagtggaagt aggaaatgtt tgtatggcaa cgtggtgaaa tcctgtctct actaaaaatg 2821 tgtataaagg ctataatgta atcccagcac tttggaagac cgaggcgggt ggatcacctg 2881 gggtcaggag tttgagaccc acctggacca caaaaattag ccgggcatgg tggcaggcgc 2941 ctgtaatccc agctgctggg gaggctgagg caggagaatc gcttgaaccc gggaggcaga 3001 ggttgcagtg agccaagatt gcaccgctgc actccagcct gggtgacagc gtgagactct 3061 gtctcaaaaa aaataaaaaa gtctataatg ctattttaag tttctaagga actgaaactg 3121 ctctgaaata aatcagacca ttataagact tttttccata tcagtgagct aagtgcagat 3181 aagcttctga aacttgcatg ctagattttt ttggtacaaa tatttgaaat gcttagtgtg 3241 ctgccttgga aaaacctggt attttttgtt gtgtccttat actgccaagg tttatggaat 3301 catgtacctt atgcctagta ataattagga tgaccaggcc agtgagtggt tcatatccgg 3361 ggcatgatta gctctgcgtg tgctcagcca gtgccccatc ttcaactcga tgtgttccta 3421 aggtagacag caaattccct attttatttc tcagattgtc actgctgttc caagggcaca 3481 cgcagaggga tttggaattc ctggagagtt gcctttgtga gaagctggaa atatttcttt 3541 caattccatc tcttagtttt ccatgtaagt attcagttta catttatgtt gcaggttaat 3601 cttaagaatt gtattgctaa ggcttctaag tgaatttctc cactctattt gcattttgtt 3661 gcatttcaga ggaacatcaa gaaatcatga acaactttgg taatgaagag tttgactgcc 3721 acttcctcga tgaaggtttt actgccaagg acattctgga ccagaaaatt aatgaagttt 3781 cttcttctgt aagtatatga ggcccatgct ggcagtgcag ctgagagtgc caggcaagtg 3841 gaaaactttg gcaaggtcta aggaagagca atgaggctta catgtcttgt tatggaatgt 3901 agaaattaat tcactggtgg taaattaata gtgataatgg tgatactcat atcagtggct 3961 agactcaaaa gagcaggatt cattgtgact gatgggaatg aaggtcgctg gctattggtg 4021 tggtgtgtgg tgaggctgct agtgagtcac ctgtgaccac tcttgtttca ggatgataag 4081 gatgccttct atgtggcaga cctgggagac attctaaaga aacatctgag gtggttaaaa 4141 gctctccctc gtgtcacccc cttttatgca gtcaaatgta atgatagcaa agccatcgtg 4201 aagacccttg ctgctaccgg gacaggattt gactgtgcta gcaaggtaag cgatagcagc 4261 aggcctcaaa agcgttgtat aaaatgggcc tggtattccc cacgaggcag atacaagttg 4321 tgttttttgg gcaataaatg ctcactaaag gcaaatgggg cgggggggta catgacaact 4381 tcccatgctt ttctgtttat tccacgtgtt aagccacata tggatagcat gacaccactc 4441 ttctttttca gactgaaata cagttggtgc agagtctggg ggtgcctcca gagaggatta 4501 tctatgcaaa tccttgtaaa caagtatctc aaattaagta tgctgctaat aatggagtcc 4561 agatgatgac ttttgatagt gaagttgagt tgatgaaagt tgccagagca catcccaaag 4621 caaagtgagt tattccccca tctgagggca agatcgggag cataagatat gtggattctt 4681 atcaaacaaa cttaaatttc tgattattat atttctatac tttagtagaa agtagttgaa 4741 acccccattg agtcatgaag cctgggactc aaactacaga atatatcagc gacagtattt 4801 agaacaggat tgtttttatt ttaattgtgg ctataagtga acatctatca tgagacattt 4861 gctgcacttt ccttgcttgt aggttggttt tgcggattgc cactgatgat tccaaagcag 4921 tctgtcgtct cagtgtgaaa ttcggtgcca cgctcagaac cagcaggctc cttttggaac 4981 gggcgaaaga gctaaatatc gatgttgttg gtgtcaggtg agattttggt gggatagcta 5041 gaggtcaaga cattgaacag tttgagtttt acaggctttc tcctagtgtt tgctattatt 5101 ttaagaaata ctaagacaca gtgtctcgtc tctttatttt accccagctt ccatgtagga 5161 agcggctgta ccgatcctga gaccttcgtg caggcaatct ctgatgcccg ctgtgttttt 5221 gacatggggg tgagtatacg tgaccctgtt agggaagggc gggacacaac tgacaataac 5281 tagtcttaat tctagagtta actttttatg gcagttggtt ctgtattaca tgggtttcag 5341 cctatctgct gcatacattt ttgttattag ctgtggatct ggctgactta ttttcttgat 5401 tctaggctga ggttggtttc agcatgtatc tgcttgatat tggcggtggc tttcctggat 5461 ctgaggatgt gaaacttaaa tttgaagagg taatttagaa caaaactgta atactcagta 5521 gccgttctaa taaattcctt tttggaatat ttcaaaattt aagtgtctta actaatacca 5581 caatgggctg aagtgtcttg gtgtgatatt tttgagtgat ttctttgtgc tgtctgacat 5641 tacacttgat accatttggt tttctaaagt gtgaatcagc tttcccagaa gtcttggata 5701 attggttaca ttggaaatca tggctcacac ctgtaatcca gcacttgggg aggccaaggt 5761 ggtaggatca cttgagccca ggagtttgag accagcctgg gcaacacagt gagaccccat 5821 ctctacaaaa aaaattttaa aattagcctg gtgtggtggc gggcacctgt aatcccagct 5881 acttggaagg ctgaggtggg aggatcactt gagcccagga ggttgaggct gcagtgagcc 5941 atgatcatgc cactgcactc agcctgggct acagagtgag accctgtctc aaaaaaaaaa 6001 aagaaaaagc atgttgctgt gggcttccta gagaatatgc tgactgtagc acatcatcac 6061 cccaaatgtg ctttgctaga cctatgcttc ctctccttaa aatacttgaa atgtttagtc 6121 acttaggaag ttaagccatt atattggtgc ttgaatttat aaaatatatc cacatggttt 6181 gttaaaatca tgacgtaggc agaataggat ttttatcctg ttggcatgta tttgttaaaa 6241 tgttttgaca tcttgatgcc ttcctaggta gtagttagtt gcgtactgtt ctttgataaa 6301 aatcataccc ataacatcct aaaggagata gggtgcctgg aggggaatga aaacgagcca 6361 cctgggatat gtagcctggt tttcagggag atgttgatgt ttttttgctt ttgttacttt 6421 aatgataaac ctgtctgttg atgcctggtc tcatgatgtc atgtcacaag gccctgtgat 6481 gttactcccc catgtgaatt tcccacaatg aaggctgctc tttcttttct gtttcactct 6541 cttagatcac cggcgtaatc aacccagcgt tggacaaata ctttccgtca gactctggag 6601 tgagaatcat agctgagccc ggcagatact atgttgcatc agctttcacg cttgcagtta 6661 atatcattgc caagaaaatt gtattaaagg aacagacggg ctctgatggt atgtataaag 6721 gacgaatcac ttcatgtata actgaaagct gatgcaaaaa gtcattaaga ttgttgatct 6781 gcctttctag acgaagatga gtcgagtgag cagaccttta tgtattatgt gaatgatggc 6841 gtctatggat catttaattg catactctat gaccacgcac atgtaaagcc ccttctgcaa 6901 aaggtaattt ctgagcatac tgtataaaac aattaagagg actggtcaca acacgtgtaa 6961 ttaagtagta cttcctctct ccgtctcttt atatagagac ctaaaccaga tgagaagtat 7021 tattcatcca gcatatgggg accaacatgt gatggcctcg atcggattgt tgagcgctgt 7081 gacctgcctg aaatgcatgt gggtgattgg atgctctttg aaaacatggg cgcttacact 7141 gttgctgctg cctctacgtt caatggcttc cagaggccga cgatctacta tgtgatgtca 7201 gggcctgcgt ggtaagtaag ccatgcatgt tgatggtgct gccaagaata ggcaccttct 7261 tggatgtgtg cttcttgtct agacgaataa gaaattgtct tgcctaagat taaatatata 7321 tggatatttt tcctaagaaa agttttagaa aagactgatg agtgtatttc tatgtaattg 7381 gaatatattt aagttcatgc catgtgtctt gtggtttcct tattaccaaa acggtgactg 7441 aagaaacgct tgctttagaa atacattgaa ttggccaggt gtgctggctc acacctgaaa 7501 tcacaacaca ttgggaggcc aaggcagaag gatcacttga gcccaggagt tcgagcctgg 7561 gcaacatagt gagaccctgt ctctacaaaa aattaaaaaa ttagttggcc atggtagtgg 7621 gcgcctgtag tcccagctgc ttggctaagg tgagaggttt gcttgagcct gggaggttga 7681 ggctgcggtg agctatgata gcaccattgt attccagcct gagtaacaga gaaagaccct 7741 gtctcagaaa aaaaaaaaat acattgaatt gtttcctgat ggaagtaaat actctcatgc 7801 ccagttagga gtgagtcagg gtttttaata tgccactttt tctttctcag gcaactcatg 7861 cagcaattcc agaaccccga cttcccaccc gaagtagagg aacaggatgc cagcaccctg 7921 cctgtgtctt gtgcctggga gagtgggatg aaacgccaca gagcagcctg tgcttcggct 7981 agtattaatg tgtagatagc actctggtag ctgttaactg caagtttagc ttgaattaag 8041 ggatttgggg ggaccatgta acttaattac tgctagtttt gaaatgtctt tgtaagagta 8101 gggtcgccat gatgcagcca tatggaagac taggatatgg gtcacactta tctgtgttcc 8161 tatggaaact atttgaatat ttgttttata tggattttta ttcactcttc agacacgcta 8221 ctcaagagtg cccctcagct gctgaacaag catttgtagc ttgtacaatg gcagaatggg 8281 ccaaaagctt agtgttgtga cctgttttta aaataaagta tcttgaaata attaggcatt 8341 gggacgtttt tatggtgtgt tcattccaga cagttcacga atcccgtata gctcgctctg 8401 attctcagag aacaatgagt gggtccaccc acacacaggt aggaggacag gtgagacgga 8461 agccccatcc tcccatgtgg acggtgcaca tctgctcagc ccaccccaca tgtccagagt 8521 tggctgcaaa ctccttgtcc agagcctctg gtggtgggac ctacttaagt ctgacggacc 8581 tgtcctgtcc aggccagtgc ccagggaagg tgtgggaggc cctttgagcc tggcctgcag 8641 agaccatccg tgtcccctcc caccttcatg cctgtgagaa gttaggaatg tatacggtac 8701 cacatttggc agtcagctta ttttaataaa ttcagcaaca gcaagtccct accatgttgt 8761 gtatcttcac catcttgtct gaccatgacc actggccttg tgtgttcttt tactcaacgt 8821 gtacccccgc tctcccccaa a // LOCUS XELFIMB1 1341 bp ss-mRNA VRT 28-JUN-1990 DEFINITION X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds. ACCESSION J02910 KEYWORDS integumentary mucin; mucin. SOURCE X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1341) AUTHORS Probst,J.C., Gertzen,E.-M. and Hoffmann,W. TITLE An integumentary mucin (FIM-B.1) from Xenopus laevis homologous with Von Willebrand factor JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Hoffmann, 26-APR-1990. FEATURES from to/span description pept < 1 1197 integumentary mucin (FIM-B.1; AA at 1) mRNA < 1 1341 FIM-B.1 mRNA signal 1319 1324 poly-A signal rpt 1 279 repeat sequence BASE COUNT 472 a 290 c 268 g 311 t ORIGIN 1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag 61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca 121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa 181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta 241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca 301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact 361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca 421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc 481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa 541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa 601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg 661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat 721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact 781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt 841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca 901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa 961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg 1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt 1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac 1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga 1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata 1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa 1321 taaatggttt ttttcagctg t // LOCUS LACISTRAN 808 bp ds-DNA BCT 28-JUN-1990 DEFINITION L.lactis insertion sequence IS946 transposase gene, complete cds. ACCESSION M33868 KEYWORDS transposase. SOURCE L.lactis DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 808) AUTHORS Romero,D.A. and Klaenhammer,T.R. TITLE Characterization of gram-positive insertion sequence IS946, an Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid pTR2030 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Romero, 26-APR-1990. Author address: D.A.Romero North Carolina State University Dept. of Food Science Box 7624, Raleigh, NC 27695 FEATURES from to/span description pept 74 754 transposase (put.) rpt 1 18 inverted repeat rpt 791 808 inverted repeat signal 6 35 promoter region binding 62 65 ribosomal binding site BASE COUNT 281 a 137 c 158 g 232 t ORIGIN 1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag 61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt 121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata 181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa 241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga 301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga 361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt 421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc 481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca 541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg 601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat 661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac 721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta 781 tttgattttt aaactttgca acagaacc // LOCUS RSSTRXA 464 bp ds-DNA BCT 28-JUN-1990 DEFINITION R.sphaeroides Y thioredoxin (trxA) gene, complete cds. ACCESSION M33806 KEYWORDS thioredoxin Y. SOURCE R.spaeroides DNA, clone pUTC3. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 464) AUTHORS Pille,S., Chuat,J.-C., Breton,A.M., Clement-Metral,J.D. and Galibert,F. TITLE Cloning, nucleotide sequence, and expression of the Rhodobacter sphaeroides Y thioredoxin gene JOURNAL J. Bacteriol. 172, 1556-1561 (1990) STANDARD simple staff_review FEATURES from to/span description pept 57 377 thioredoxin (trxA) BASE COUNT 84 a 147 c 153 g 80 t ORIGIN 1 ccggcagggg gccatcctta ctttccaagc aagcctacgg atcaggagaa ttgaatatgt 61 ccaccgttcc cgtgacggac gccaccttcg acaccgaggt gcgcaagtcc gacgtgcccg 121 tcgtcgtcga tttctgggcc gaatggtgcg gcccctgccg gcagatcggc ccggcgctcg 181 aggagctctc gaaggaatat gccggcaagg tgaagatcgt gaaggtcaat gtcgacgaga 241 accccgagag cccggcgatg ctgggcgttc gcggcatccc ggcgctgttc ctgttcaaga 301 acggtcaggt cgtgtcgaac aaggtcggcg ctgcgccgaa ggccgcgctg gccacctgga 361 tcgcttcggc gctctgagat cctctgccat agcagagacc ggacggggcg ccttcgggcg 421 cccttttcct tgagcaagcg accgaccgcg acgggcgagg atcc // LOCUS PSECATRA 1070 bp ds-DNA BCT 28-JUN-1990 DEFINITION P.putida catR regulatory protein gene, complete cds, and catBC gene, 5' end.. ACCESSION M33817 KEYWORDS regulatory protein. SOURCE P.putida DNA. ORGANISM Pseudomonas putida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1070) AUTHORS Rothmel,R.K., Aldrich,T.L., Houghton,J.E., Coco,W.M., Ornston,L.N. and Chakrabarty,A.M. TITLE Nucleotide sequencing and characterization of Pseudomonas putida catR: a positive regulator of the catBC operon is a member of the lysR family JOURNAL J. Bacteriol. 172, 922-931 (1990) STANDARD simple staff_review FEATURES from to/span description pept 135 1004 catR regulatory protein pept 3 < 1 (c) cis,cis-muconate-lactonizing enzyme I (catB) (EC 5.3.3.4) mRNA 61 < 1 (c) catBC mRNA BASE COUNT 199 a 355 c 329 g 187 t ORIGIN 1 cattgttgcc aggtccgttg tttcgggctg ccagccgcgg gcccagtggt gatttacttg 61 ttgttcaagg attgcgcgag accctgatag ccgtccaata tcgaatgaat ctcccaccat 121 accctggagg tctgatggag ctgcgccact tgcgttactt caaggtcctg gccgaaaccc 181 tgaacttcac tcgcgccgcc gagctgctgc acattgccca gccgccgctg agccggcaga 241 tcagccaact cgaggaccag ctcggtacct tgctggtagt gcgcgagcgc ccgctgcggc 301 tgacagaggc gggtcgcttc ttctacgaac agagctgcac cgtgctgcag ctgcagaaca 361 tcagcgacaa cacccgtcgc attggccagg gccagcgcca gtggctgggg atcggcttcg 421 ccccgtcgac cctgtacaag gtgctgccgg agctgatccg cgagctgcgc caggacagcg 481 agctggaact gggcctcaac gagatgacca cgctgcagca ggtggaggcg ctgaaaagcg 541 ggcgcatcga catcgccttc gggcgcattc gcatcgatga cccggcgatt caccaacaag 601 tgctgtgcga ggacccgctg gtggcggtat tgcccaagga tcacccgctg gcgagcagcc 661 cgcttacact tgcccagctg gctggcgagg cgttcatcct ctacccggcc aacccgcggc 721 ccagctatgc cgaccatgtg ctggcactgt tcgcccacca cggcatgagc atccacgtca 781 gccaatgggc caacgaactg cagaccgcca tcggcctggt ggccgtcggc gtgggcgtga 841 ccctggtgcc ggcgtcggtg caacagcagc accgcaccga tatcgaatat gtaagcctgc 901 tcgacagcgg cgccgtcagc ccgatcatcc tcagccggcg caagggcgat gtgagcccga 961 tcgtgcagcg ctgcctgaca ttgattgcac aacaggccga atgagggcgc gctcctggaa 1021 tcaacggtct ggctaaacac ctgccccctg tgggagcggg ttcacccgcg // LOCUS HUMMHDQ1A 1515 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human MHC class II HLA-DQ-alpha gene, regulatory region. ACCESSION M33765 KEYWORDS cell surface antigen; cell surface glycoprotein; class II gene; major histocompatibility complex. SOURCE Human (DR4, DR4) DNA, clone H11A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1515) AUTHORS Ombra,M.N., Del Pozzo,G., Perfetto,C. and Maffe,A. TITLE Effect of the AIR-I locus on the activation of an enhancerless HLA-DQA1 JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Guardiola, 18-APR-1990. BASE COUNT 441 a 322 c 340 g 412 t ORIGIN Chromosome 6p21.3. 1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag 61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga 121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac 181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag 241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc 301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag 361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag 421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag 481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct 541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg 601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta 661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc 721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt 781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga 841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta 901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt 961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg 1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt 1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc 1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc 1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac 1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg 1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca 1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac 1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta 1501 gcaaaataag ggatc // LOCUS FSOSTI35X 1856 bp ds-DNA PLN 28-JUN-1990 DEFINITION F.solani STI35 protein gene, complete cds. ACCESSION M33642 KEYWORDS STI31; heat-shock gene. SOURCE F.solani phaseoli mycelial ATCC 60860 isolate DNA, clone cKAES018. ORGANISM Fusarium solani Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 1856) AUTHORS Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and Smith,D.A. TITLE STI35, a stress-responsive gene in Fusarium spp JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.L.Schardl, 08-APR-1990. FEATURES from to/span description pept 564 1443 STI35 protein, exon 1 1492 1586 STI35 protein, exon 2 IVS 1444 1491 STI35 intron A BASE COUNT 401 a 585 c 445 g 425 t ORIGIN 1 acacactctc tgttgacgca tcctcagact cggaaataca tttcgttggc cgatatcgcg 61 ggttactcca tcatgatatc agccatccat tacatcgtta cccctcatct gcggtataaa 121 agccagccat tgtctcctgc tgcttgatcc attctctctt cttcttcatc aactcatcaa 181 caacttgaac caaaaccctt atcaagactc aaggtttgta ctcgaggtga gaggtgatga 241 actggtctga cacctgtgac atgctagata tcatcacttc cggtacggat cctccgtcga 301 ggaatctgca gccatggagt acctttttgc gggtagttgt ccgaacctga ggttggagcc 361 attcaggtaa agcatcatgc atgagccggt gcccgttctt cccttcttgg ccattgtctg 421 tactcctgtg tacatcaatg tcaagaagag gagagcgtgg ctgagattat acggcgaaac 481 ttgatctgga taataccagc gaaaggatca tgctctcctc cccccattga ccccaatccc 541 ctaacacaac cccaggccca acaatgtctc ctcccgctgc cgtctccccc cctgcccgct 601 ctgctgagct cgcctccgcc cccgccgtca agctccccgt cggcctgagc aagaacagcg 661 ctgctgccac caccgtcgag gagatggagg gcaagtggga cgacttcaag tttgccccca 721 tccgcgagag ccaggtctcc cgggccatga cccgccgcta cttccaggac ctcgacaact 781 acgccgagtc cgacattgtc attgtcggcg cgggatcctg cggtcttagc acccgctaca 841 tcctcggcaa gaagaggccc gacctcaaga tcgccatcat cgaggcttct gtgtctcctg 901 gtggtggtgc ctggctcggt ggccagctct tctcggccat ggtgatgcgc aagcccgccg 961 atgccttcct ccgcgaagtc ggcgtccctt acgaggacga gggcaactat gtcgtcgtca 1021 agcacgctgc cctcttcacc tcgaccatca tgtccaaggt gctccagctc cccaactgca 1081 agctcttcaa cgccacctgc gtcgaggacc tcatcacccg accttccaag gagggagtcc 1141 gcatctctgg tgtcgtcacc aactggaccc tcgtgtcgat gcaccacgac gaccagtcct 1201 gcatggaccc caacaccatc aacgcgcccc tcgtcatctc caccaccggc cacgacgccc 1261 ccatgggtgc cttttgcgtc aagcgtctcg tcagcatggg acgcatcgag aagctcggcg 1321 gcatgcgtgg tctcgacatg aacgtggccg aggatgccat cgtcaaggga acccgtgaga 1381 ttgtccctgg tctgattgtc ggaggaatgg agctttccga ggttgacgga gccaaccgaa 1441 tgggtgagtt tctcgacacg agtatcaata actggttgct aactcgacta ggccctactt 1501 tcggtgccat ggttctgagc ggtctcaagg ctgccgagga ggccctcaag gtcattgaca 1561 tccgccagaa gcagaactcg ttctaagcga gttgacgtac aagaagaaca tggactggga 1621 aacacgatga ttcaggaaac atgaattcag gagccaggac ctcgtctctc tttcgaggca 1681 tggatccttg tcgctcatta ggccctttca gagaatgaaa ttggatatca aaactgctga 1741 attgctcata ctttctcgat ataaaacttg acgtacaatg tgcaactcca agaactttac 1801 tcatcctcca tctccttccc ctttggcttc tcgggcggaa acttcatccc cccgtt // LOCUS FSOSTI35Y 1177 bp ss-mRNA PLN 28-JUN-1990 DEFINITION F.oxysporum STI35 protein mRNA, complete cds. ACCESSION M33643 KEYWORDS STI35 protein. SOURCE F.oxysporum cucumerinum (isolate B1-GK) mycelial culture, cDNA to mRNA, clone pKAE5030. ORGANISM Fusarium oxysporum Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 1177) AUTHORS Choi,G.H., Marek,E.T., Schardl,C.L., Richey,M.G., Chang,S. and Smith,D.A. TITLE sti35, a stress-responsive gene in Fusarium spp JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.L.Schardl, 08-APR-1990. FEATURES from to/span description pept 29 991 STI35 protein mRNA < 1 1145 STI35 mRNA (alt.) mRNA < 1 1177 STI35 mRNA (alt.) BASE COUNT 264 a 328 c 291 g 294 t ORIGIN 1 aagatatcat aacctgccca agatcaaaat ggcccctccc gctgctgtct cccctccctc 61 tcgctctgca gagcttgcta catctaccaa gctcccagtc atgagcaaga acatcaacac 121 caagaccgtt gaggagatgc tcggtcagtg ggatgatttc aagtttgctc ccatccgcga 181 gagccaggtc tcccgcgcca tgactcgccg ctacttccag gaccttgata actatgccga 241 gtctgacatt gttatcatcg gtgctggttc gtgcggtctg agcgccgcgt acattcttgg 301 caagaagcgt cctgatctca agatcgccat cattgaggct tctgtgtctc ctggtggtgg 361 tgcttggctt ggtggacagc tcttctccgc catgatcatg cgcaagcctg ctgatgcttt 421 cctccgcgag gtcggtgttc cttatgaaga tgagggtaac tacgtcgttg tcaagcacgc 481 cgctctcttc acctcgacca tcatgtccaa ggttcttcag atgcccaaca tcaagctctt 541 caatgccact tgtgttgagg atctcatcac ccgaccttcc gaggagggag tccgcattgc 601 cggtgttgtc accaactgga ctcttgtttc catgcaccac gatgaccagt cttgcatgga 661 ccccaacact attaacgctc ctttgatcat ctccaccacc ggccacgatg gcccgatggg 721 agctttctgt gtcaagcgcc ttgtcagcat gcagcgcatt gagaagctcg gtggtatgcg 781 tggtcttgac atgaacctcg ccgaggatgc cattgtcaag ggaacccgtg agattgttcc 841 tggtcttatt gttggtggaa tggaactttc cgaggttgac ggtgccaacc gcatgggtcc 901 taccttcggt gccatggctc tcagtggtct caaggctgcc gaggaagctc tcaagatctt 961 cgacactcgc aagaagcaga acgatctgta agcgagatca tcaaagaagg tcatagttgg 1021 gaaacggaac atgtgaattc aggatcctgg acctcgtctc tcattcgagg catggaacct 1081 tgtcattcat ttagtctgag cgactcaagt ttggaaaaca tgaacccaaa gtcggatgcc 1141 actcgaaaga gacgccaaat aaagcaatac taaattg // LOCUS STMGLUSYN 1208 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.hygroscopicus glutamine synthetase (GLNB) gene, complete cds. ACCESSION M33783 KEYWORDS glutamine synthetase. SOURCE S.hygroscopicus (strain ATCC 21705) DNA. ORGANISM Streptomyces hygroscopicus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1208) AUTHORS Kumada,Y., Takano,E., Nagaoka,K. and Thompson,C.J. TITLE Streptomyces hygroscopicus has two glutamine synthetase genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.J.Thompson, 20-APR-1990. Author address: C.J.Thompson Institut Pasteur 28 Rue du Dr. Roux 75015 Paris FRANCE Tel: (33) 145688842 FEATURES from to/span description pept 22 1158 glutamine synthetase (GLNB) precursor sigp 22 84 glutamine synthetase signal peptide matp 85 1155 glutamine synthetase BASE COUNT 198 a 432 c 418 g 160 t ORIGIN 1 gaattccttg gaaacggctg gatgtccggc tcggtaacct gtggttcaca aacgggcaac 61 ggacgagaaa tggcgcgttg cgagagtgcg gaggtacgcc cgcaccgtac ccgcaccgac 121 cgtgaagagg accccgtgag catcaaggcc gagtacatct ggatcgacgg cacgcagccg 181 accgccaagc tccgctccaa gaccaagatc ctgtccgacg gcagccggct tccgcggtgg 241 ggcttcgacg gttccagcac caaccaggcc gaaggccacg cctcggacct cgtactggag 301 ccggtgttca gctgcccgga cccgatccgc ggcggcgacc acctgctggt gctgtgcgag 361 gtgctgcaca ccgacctcac cccgcacccc tccaacaccc gggcgctgct gcgcccggtc 421 gcggagcggt tcgccggcca ggagccgatc ttcggcatcg agcaggagta caccttcctc 481 aagggcgacc gcccgctcgg cttccccgag ggcggcggct acccggcccc gcaggccgac 541 tactactgcg gcgtgggcgc cgacgcgatc ttcggccggg agatcgtcga gaagcacctc 601 gacctgtgcc tggcggccgg tctgggcctg tccggcatca acgccgaggt catgcccggc 661 cagtgggagt tccaggtcgg cgcgctgccg ccgctggagg tctcggacca catgtgggtg 721 gcgcgctggc tgctgcaccg ggtggcggag gagttcggcg tcaccgcgtc gctggacgcc 781 aagccggcca agggcgactg gaacggcgcg ggcgcgcaca ccaacttctc cacccgcgcg 841 atgcgcgagg gctacgaccc gatcatcacc gcctgcgagg cgctgggcca ggacgacaag 901 ccgctggagc acgtccgcca gtacggcacc ggcatcgagg accggctgac cggcgcgcac 961 gagaccgccc cctgggacgc gtactcctac ggcgcctccg accgcggcgc ctcggtgcgc 1021 atcccctggc aggtcgaggt cgagaagaag ggctacatcg aggaccggcg cccgaacgcc 1081 aacgtcgacc cgtacgtggt cacccggctg atggtggaca cctgctgcac ggagctggcg 1141 cggcgcgagc agatctgacg cctgcggtca cgtggccgtg cggtcacgtg accgcacggt 1201 cgcgcgag // LOCUS BLLEBG 2323 bp ds-DNA BCT 28-JUN-1990 DEFINITION B.lautus endo-beta-1,4-glucanase (celB) gene, complete cds. ACCESSION M33762 KEYWORDS endo-beta-1,4-glucanase. SOURCE B.lautus (strain PL236) DNA. ORGANISM Bacillus lautus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2323) AUTHORS Joergensen,P.L. and Hansen,C.K. TITLE Multiple endo-beta-1,4-glucanases from Bacillus lautus PL236 and characterization of the celB gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.K.Hansen, 18-APR-1990. Author address: C.K.Hansen Institut Pasteur 24 rue du Dr. Roux 75724 Paris Cedex 15 FRANCE Email: DEDONDER%PASTEUR.BITNET@CUNYVM.CUNY.EDU FEATURES from to/span description pept 172 1872 endo-beta-1,4-glucanase precursor (celB) (E.C. 3.2.1.4) sigp 172 262 endo-beta-1,4-glucanase signal peptide matp 263 1869 endo-beta-1,4-glucanase signal 46 51 -35 region signal 70 75 -10 region binding 158 164 ribosome binding site signal 1956 1981 transcription terminator BASE COUNT 644 a 508 c 563 g 608 t ORIGIN 1 ggggttttgt tcatttcaag aaagagcacg aataaacatc gctaatttac acatgattta 61 ttctctaatt attatggtgc atgccaattg gaaaatgtag tagattagtc atcgtaacct 121 gctttcatgc caagatatgt aatttttaaa aaagagtaaa ggagaattca gatgaaaaaa 181 cgtagaagca gtaaagttat tctttcgttg gccatcgttg ttgcattatt ggcagccgtc 241 gaacctaatg ccgctttagc agcggctcca ccaagtgcca tgcagtccta tgttgaagcg 301 atgcagcctg gctggaacct tggcaattct ctggatgctg tcggtgcgga tgagacgctg 361 gcacggggca atccgcggat cacgaaagag ctcattcaga acatcgctgc gcaaggctat 421 aagagcatac ggattcctgt tacctgggat tcccatatcg gcgcggcccc aaattatcaa 481 attgaagctg cgtacctcaa tcgagtgcag gaggtcgtac agtgggcttt ggacgcgaac 541 ctctatgtga tgattaatgt ccatcatgat tcctggctat ggatcagcaa aatggagtcg 601 cagcacgatc aagtactggc ccgttataat gcgatttgga cgcaaattgc caacaagttc 661 aagaacagcc cgagcaagct gatgttcgag agcgtgaatg agcctcgctt tacggatggc 721 ggaactacgg atgaagccaa gcagcaaaaa atgctggacg agctgaacgt atcctttttc 781 aacatcgtca gaaattccgg cggccagaac gcgactcgcc cgctagttct ttctacgttg 841 gaggcctctc ccacccaaga gagaatgacg gcgctttata atacgatgac caaactgaac 901 gacaagaatc tgatcgcaac cgttcatttt tatggattct ggccgtttag cgtaaatatc 961 gcaggatata cgaaatttga tgcggagacg caaaatgata ttataacgac cttcgataac 1021 gtgtataaca catttgtagc aaagggaatc ccggtggtag tcggcgaata tggccttctt 1081 ggattcgata agaataccgg cgtcattgaa cagggtgaga aattgaaatt tttcgagttt 1141 tttgcccagt atgtgaagca aaaaagcatt tccactatgc tatgggataa cggacagcac 1201 ttcaaccgca cgagcttcaa gtggtctgac ccggatttat tcaatatgat caaggccagt 1261 tggaccggac gttcatccac ggcttccagc gacctgatcc atgtcaagca gggcacggcg 1321 gtaaaagata cttcggttca gctcaatctt aacgggaata cgctaacttc cctttccgta 1381 aatggaacga cactgaaatc aggcacagat tacactttaa acagcagcag attaactttt 1441 aaagcgagcc agttgaccaa gctgacctcc ttgggcaaat tgggggtcaa cgcgacgatc 1501 gtgactaaat tcaatagagg cgccgactgg aagttcaacg tagtcctgta caatacgcct 1561 aagcttagca gtacgacggg gactacttct tcctttgcga ttccaacggc tttcaacggg 1621 gatcagcttg ctacgatgga agcggtctat gtaaacggcg gcaatgccgg tccgcataac 1681 tggacttcct ttaaggaatt cgaaacgacg ttcagccccg cttatagcga ggggaaaatc 1741 aaactgcagc aggcgttctt taatgaagtg aatgatacca cagtcacgct caagttccaa 1801 ttctggagcg gggagatcgt caactacacg attaaaaaga gcggttcgac ggtgacgggt 1861 acggcttcat aagcgagttt ggcaaaaaag gaccgatata ctgcctaatt tggtattgcc 1921 ttagttgaaa gcaattgctc cgaataaaca gaatgaagcc ccggccagct ggccgggact 1981 tatgcgttta ggaagtataa acgaatcatc agcaatttat ttagctcgtc tcagttcagc 2041 aatatcggct tcatgtgaaa cggagcggat gaacaatctt tcgagcaatt tctcatgctc 2101 ctgctgggtt tggagaacgg tttgctgatt agttttaagt acagatatat cctcacggac 2161 ttgattgatt catgtggtcc gttagttctt ctacctttgt atttgtggca gcaacgatat 2221 gaattaattg ttgaatgtgc ccgccatgac tgtttagctg ctcattgtgg ctttgtaact 2281 gttctcggat ttctttgaat tcttggtcgt gctcattaag ctt // LOCUS HLFRRD1 394 bp ss-rRNA BCT 28-JUN-1990 DEFINITION Haloferax 16S ribosomal RNA, partial sequence. ACCESSION M33803 KEYWORDS 16S ribosomal RNA. SEGMENT 1 of 3 SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA. ORGANISM Haloferax sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 394) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL J. Bacteriol. 172, 756-761 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 394) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by M.L.Dyall-Smith, 19-JUN-1990. FEATURES from to/span description rRNA < 1 > 394 16S rRNA revision 249 249 c in [1]; g in [2] BASE COUNT 102 a 101 c 126 g 65 t ORIGIN 1 gaaaagctca gtaacacgtg gccaaactac cctacagaga acgataacct cgggaaactg 61 aggctaatag ttcatacggg agtcatgctg gaatgccgac tccccgaaac gctcaggcgc 121 tgtaggatgt ggctgcggcc gattaggtag acggtggggt aacggcccac cgtgccgata 181 atcggtacgg gttgtgagag caagagcccg gagacggaat ctgagacaag attccgggcc 241 ctacggggcg cagcaggcgc gaaaccttta cactgcacgc aagtgcgata aggggacccc 301 aagtgcgagg gcatatagtc ctcgcttttc acgactgtaa ggcggtcgtg gaataagagc 361 tgggcaagac cggtgccagc cgccgcggta atac // LOCUS HLFRRD2 254 bp ss-rRNA BCT 28-JUN-1990 DEFINITION Haloferax 16S ribosomal RNA, partial sequence. ACCESSION M33804 KEYWORDS 16S ribosomal RNA. SEGMENT 2 of 3 SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA. ORGANISM Haloferax sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 254) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL J. Bacteriol. 172, 756-761 (1990) STANDARD full staff_review FEATURES from to/span description rRNA < 1 > 254 16S rRNA BASE COUNT 67 a 59 c 85 g 43 t ORIGIN About 140 bp after segment 1. 1 tacgtccggg gtaggagtga aatcccgtaa tcctggacgg accaccgatg gcgaaagcac 61 ctcgagaaga cggatccgac ggtgagggac gaaagctagg gtctcgaacc ggattagata 121 cccgggtagt cctagctgta aacgatgctc gctaggtgtg acacaggcta cgagcctgtg 181 ttgtgccgta gggaagccga gaagcgagcc gcctgggaag tacgtccgca aggatgaaac 241 ttaaaggaat tggc // LOCUS HLFRRD3 539 bp ss-rRNA BCT 28-JUN-1990 DEFINITION Haloferax 16S ribosomal RNA, partial sequence. ACCESSION M33805 KEYWORDS 16S ribosomal RNA. SEGMENT 3 of 3 SOURCE Haloferax sp. (sub-species PHENON K; strain Aa 2.2) ribosomal RNA. ORGANISM Haloferax sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 539) AUTHORS Holmes,M.L. and Dyall-Smith,M.L. TITLE A plasmid vector with a selectable marker for halophilic archaebacteria JOURNAL J. Bacteriol. 172, 756-761 (1990) STANDARD full staff_review FEATURES from to/span description rRNA < 1 > 539 16S rRNA BASE COUNT 132 a 140 c 159 g 108 t ORIGIN About 45 bp after segment 2. 1 ctcaacgccg gacatctcac cagctccgac tacagtgatg acgatcaggt tgatgacctt 61 atcacgacgc tgtagagagg aggtgcatgg ccgccgtcag ctcgtaccgt gaggcgtcct 121 gttaagtcag gcaacgagcg agacccgcac ttctaattgc cagcagcagt ttcgactggc 181 tgggtacatt agaaggactg ccgctgctaa agcggaggaa ggaacgggca acggtaggtc 241 agtatgcccc gaatgagctg ggctacacgc gggctacaat ggtcgagaca atgggttgct 301 atctcgaaag agaacgctaa tctcctaaac tcgatcgtag ttcggattga gggctgaaac 361 tcgccctcat gaagctggat tcggtagtaa tcgcatttca atagagtgcg gtgaatacgt 421 ccctgctcct tgcacacacc gcccgtcaaa gcacccgagt gaggtccgga tgaggccacc 481 acacggtggt cgaatctggc ttcgcaaggg ggcttaagtc gtaacaaggt agccgtagg // LOCUS ZMOADHA 1365 bp ds-DNA BCT 28-JUN-1990 DEFINITION Z.mobilis alcohol dehydrogenase I (adhA) gene, complete cds. ACCESSION M32100 KEYWORDS alcohol dehydrogenase I. SOURCE Z.mobilis (strain CP4) DNA. ORGANISM Zymomonas mobilis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods. REFERENCE 1 (bases 1 to 1365) AUTHORS Keshav,K.F., Yomano,L.P., An,H. and Ingram,L.O. TITLE Cloning of the Zymomonas mobilis structural gene encoding alcohol dehydrogenase I (adhA): Sequence comparison and expression in Escherichia coli JOURNAL J. Bacteriol. 172, 2491-2497 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K. Keshav, 14-FEB-1990, for release after publication. FEATURES from to/span description pept 218 1231 alcohol dehydrogenase I (adhA) (EC 1.1.1.1) mRNA 108 > 1365 adhA mRNA (alt.) mRNA 137 > 1365 adhA mRNA (alt.) site 1261 1289 adhA transcription termination signal BASE COUNT 370 a 288 c 340 g 367 t ORIGIN 1 tttttgggcg tatttttatc aagaaaattt aaataatcac atttttatta ttttagattt 61 aagtattgat acaagtgata tctataaatg tttttataac tttctggatc gtaatcggct 121 ggcaatcgtt ttccctatat tcgcaagatg tatgtcagcc gcagattttt cgactgacct 181 ctatctctcc gagatatatc aacaaaaggt agtcaccatg aaagcagccg tcataactaa 241 agatcatacg atcgaagtga aagacaccaa attacgccct ctgaaatacg gggaagcgct 301 tttggaaatg gaatattgcg gggtatgtca taccgatctc catgtgaaaa acggggattt 361 cggcgatgaa accggcagaa ttaccggcca tgaaggtatc ggtatcgtca agcaggtcgg 421 ggaaggggtt acttctctga aagccggtga ccgcgccagt gttgcatggt tcttcaaagg 481 ctgcggccat tgcgaatatt gtgtcagcgg gaatgaaacg ctttgccgca acgttgaaaa 541 tgccggttat acggttgacg gcgctatggc agaagaatgc atcgtcgttg ccgattactc 601 ggtcaaggtg ccagatggtc ttgatcctgc ggttgccagc agcatcactt gcgcgggtgt 661 aaccacctat aaagcagtca aagtttctca gatacagccg ggacaatggc tggccatcta 721 tggcttgggc ggtttaggca atctagccct tcaatatgcc aagaatgttt tcaacgccaa 781 agtgatcgcg atcgatgtca atgatgaaca gctcgctttt gccaaagagt tgggcgcaga 841 tatggtcatc aatccgaaaa acgaagatgc tgccaaaatc attcaggaaa aagtcggcgg 901 cgcacatgcg acggtggtga cggctgttgc caaatccgcc tttaactcgg ctgttgaggc 961 tatccgcgcg ggtggccgtg ttgtcgccgt tggtctgcct cctgaaaaaa tggatttgag 1021 cattcctcgt ttggtgcttg acggtatcga agtcctaggt tccttggtcg gaacgcggga 1081 agatttgaaa gaggccttcc agtttgcagc cgaaggtaag gtcaagccga aagttaccaa 1141 gcgtaaagtc gaagaaatca accaaatctt tgacgaaatg gaacatggta aattcacagg 1201 ccgtatggtt gttgatttta cccatcacta ggttttcgtg aaggcggaag cataaacgga 1261 aaaagccttt ctcttaccag aaaggctttt tctttgtcgt ctgataaaaa ttttcataca 1321 gaattttaac acggcactcg ttgctataag ccgctatcca agctt // LOCUS MUSIGKABG 349 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V32 region, 5' end. ACCESSION M33992 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BXSB) adult spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 349) AUTHORS D'Hoostelaere,L.A. and Klinman,D. TITLE Characterization of new mouse Vk groups JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.D'Hoostelaere, 04-MAY-1990. The frame shift between 178 and 184 could occur anywhere within those boundaries. Author address: L.A.D'Hoostelaere Grenzacherstrasse 487 Postfach 4005 Basel, Switzerland E-mail: IN%BASEL@UTOROCI.bitnet FEATURES from to/span description pept 1 178 Ig kappa-chain V32 region precursor 180 > 349 Ig kappa-chain V32 region precursor sigp 1 60 Ig kappa-chain V-region signal peptide matp 61 > 349 Ig kappa-chain V-region site 178 184 frame shift BASE COUNT 99 a 84 c 76 g 90 t ORIGIN 1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt 61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc 121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa 181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc 241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca 301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg // LOCUS MUSIGKABH 349 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V33 region, 5' end. ACCESSION M33993 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BXSB) adult spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 349) AUTHORS D'Hoostelaere,L.A. and Klinman,D. TITLE Characterization of new mouse Vk groups JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.D'Hoostelaere 04-MAY-1990. The frame shift between 178 and 184 could occur anywhere within those boundaries. Author address: L.A.D'Hoostelaere Grenzacherstrasse 487 Postfach 4005 Basel, Switzerland E-mail: IN%BASEL@UTOROCI.bitnet FEATURES from to/span description pept 1 178 Ig kappa-chain V33-region precursor 180 > 348 Ig kappa-chain V33-region precursor sigp 1 60 Ig kappa-chain V-region signal peptide matp 61 > 348 Ig kappa-chain V-region BASE COUNT 99 a 84 c 76 g 90 t ORIGIN 1 atgagggtcc ttgctgagct cctggggctg ctgctgttct gctttttagg tgtgagatgt 61 gacatccaga tgaaccagtc tccatccagt ctgtctgcat cccttggaga cacaataacc 121 atcacttgcc atgccagtca gaaaattaat gtttggttaa gctggtacca gcagaaaaaa 181 aaaaggaaat attcctaaac tattgatcta taggacttcc aacttgcaca caggcgtccc 241 atcaaggttc agtggcagtg gatcaggaac aggtttcaca ttaaccatca gcagcctgca 301 gcctgaagac attgccactt actactgtca acagggtcaa aattatccg // LOCUS HUMCAIX 2785 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human carbonic anhydrase I (CAI) mRNA, complete cds. ACCESSION M33987 KEYWORDS carbonic anhydrase I. SOURCE Human EBV transformed SH B cell line DNA, and cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2785) AUTHORS Lowe,N., Brady,H.J.M., Barlow,J.H., Sowden,J.C., Edwards,M. and Butterworth,P.H.W. TITLE Structure and methylation patterns of the gene encoding human carbonic anhydrase I JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Since no intron sequences were provided this entry is treated as if originating from an mRNA. Draft entry and computer-readable sequence for [1] kindly submitted by N.Lowe, 09-MAY-1990. Author address: N.Lowe Department of Biochemistry University College London Gower Street, London WC1E 6BT, U.K. E-mail:UCBCMAR%EUCLID.UCL.AC.UK@CUNYVM.CUNY.EDU FEATURES from to/span description pept 1048 1833 carbonic anhydrase I (EC 4.2.1.1) /hgml_locus_uid="LS0141U" /nomgen="CA1" /map="8q13-q22" mRNA 902 1939 carbonic anhydrase I mRNA (alt.) mRNA 902 2165 carbonic anhydrase I mRNA (alt.) signal 874 879 TATA box BASE COUNT 840 a 603 c 492 g 850 t ORIGIN 1 ctttagccca acagtcaaaa ataattgatg ctaccctaca aatgtccaaa actctagtat 61 atcatatttc taagttacag caaatattag tcctgctaaa ccagggagct ttggcaaaaa 121 tgttttttga cagtaaattt gtccttgatt atatattaac tagtcaaaga ggtgtttgta 181 acattattag agcttcttgt tgtaggtggg ttaacaccac caatcaagag gtcattctaa 241 cagaaagcct ggatcagaaa accatcaccc taaaaaaaca tgccttacat atttaacaca 301 ctctgaaatc cagtcaaaat atgactaaag gcccttgcca tgactgatgt attctcctgg 361 ccaacgccaa acaaatggga gcctggttac gagtcagcct tcagggactt gtcacatttc 421 tacttggttt cttccttgtt attgtcataa taaaatgttt tctatgctgt ttagtgcaac 481 ttaggcccta ttctgtagaa gtctcctcta ctattcaggc cactcaaaca ccccaaataa 541 ttgagttcaa aatcgacatc aagatataaa ggaatcagtg actaaatata tttcatatat 601 ggtattttta ttgattattg tgctgtcttg acctagtatg gaggccttgg ctagaggctg 661 gtcagtttcc tctcttgagc agctgattaa atccacaccc caaccacttc ccttatcagg 721 ttctcacact ctggggccac tatgtaccca ctctaatcac cacagggcca gacatcagac 781 aattaaggac agcgcccatg ccccaaagcc cgccaaaatt atgcaaatta ttcaaaatta 841 ttcaacctag ctaaccccac cctttttgct gtacataagc tgcccattcc ccctccagcc 901 tgtggtaccc agtcctcagg tgcaaccccc tgcgtggtcc tctgtggcag ccttctctca 961 ttcagagctg ttttccacag aggtagtgaa aagaactgga ttttcaagtt cactttgcaa 1021 gagaaaaaga aaactcagta gaagataatg gcaagtccag actggggata tgatgacaaa 1081 aatggtcctg aacaatggag caagctgtat cccattgcca atggaaataa ccaatcccct 1141 gttgatatta aaaccagtga aaccaaacat gacacctctc tgaaacctat tagtgtctcc 1201 tacaacccag ccacagccaa agaaattatc aatgtggggc attctttcca tgtaaatttt 1261 gaggacaacg ataaccgatc agtgctgaaa ggtggtcctt tctctgacag ctacaggctc 1321 tttcagtttc attttcactg gggcagtaca aatgagcatg gttcagaaca tacagtggat 1381 ggagtcaaat attctgccga gcttcacgta gctcactgga attctgcaaa gtactccagc 1441 cttgctgaag ctgcctcaaa ggctgatggt ttggcagtta ttggtgtttt gatgaaggtt 1501 ggtgaggcca acccaaagct gcagaaagta cttgatgccc tccaagcaat taaaaccaag 1561 ggcaaacgag ccccattcac aaattttgac ccctctactc tccttccttc atccctggat 1621 ttctggacct accctggctc tctgactcat cctcctcttt atgagagtgt aacttggatc 1681 atctgtaagg agagcatcag tgtcagctca gagcagctgg cacaattccg cagccttcta 1741 tcaaatgttg aaggtgataa cgctgtcccc atgcagcaca acaaccgccc aacccaacct 1801 ctgaagggca gaacagtgag agcttcattt tgatgattct gagaagaaac ttgtccttcc 1861 tcaagaacac agccctgctt ctgacataat ccagttaaaa taataatttt taagaaataa 1921 atttatttca atattagcaa gacagcatgc cttcaaatca atctgtaaaa ctaagaaact 1981 taaattttag ttcttactgc ttaattcaaa taataattag taagctagca aatagtaatc 2041 tgtaagcata agcttatctt aaattcaagt ttagtttgag gaattcttta aaattacaac 2101 taagtgattt gtatgtctat ttttttcagt ttatttgaac caataaaata attttatctc 2161 tttctttctg ttgtgcattc agtttctaaa accattaagt ttctactcca tttacattca 2221 aaaatcttaa atactttact tgcaagagta ttttgcttca aatacaacaa cctaagagca 2281 gctggagatg aaatattggg aaattcattt gcttactcct gaagacaaaa atatagctga 2341 gatgaccact ggatttaata tcgttatgct ggcccaacat tgctaccatt tgtgttgtct 2401 gtgatcaaaa tgattatctt ttatatagga agatgacgct tctggatatt gctttcactt 2461 cttctcccca cgttagcaag gacaatgctt ctctgccatt attacaacta gttagtttgc 2521 atggagaatc tttactttaa aattggaaga aaagtcacaa gtgaatggtt tataaaaatg 2581 ctaaagaagt cattcttgct tagaatcata tagaaacatc atgcaatctt ttagtcagat 2641 gtgcgcttca ccttatgcta tttttatctt taattgacac acaataattg tacatgttta 2701 tggagtatag tgtggtgttt tctgtttgtt tgtttgtttt ttgagacaag gtctcactct 2761 gccagtcagg gtggagtgcg atggt // LOCUS HUMPDHBZ 1408 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human pyruvate dehydrogenase E1-beta subunit mRNA, 3' end. ACCESSION M34056 J05549 KEYWORDS pyruvate dehydrogenase E1-beta subunit. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1408) AUTHORS Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J. TITLE Characterization of two cDNA clones for pyruvate dehydrogenase E1~subunit and its differential regulation in cultured fibroblasts JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.J.Song, 07-MAY-1990. Author address: B.J.Song Laboratory of Metabolism and Molecular Biology National Institute on Alcohol Abuse and Alcoholism 12501 Washington Ave. Rockville, MD 20852 E-mail: landsman@thebard.nlm.nih.gov FEATURES from to/span description pept < 1 1008 pyruvate dehydrogenase E1-beta subunit precursor /hgml_locus_uid="LU0223C" /nomgen="PDHB" /map="unassigned" (AA at 1) sigp 1 18 pyruvate dehydrogenase E1-beta subunit signal peptide matp 19 1005 pyruvate dehydrogenase E1-beta subunit signal 1397 1402 poly-A signal BASE COUNT 404 a 268 c 339 g 397 t ORIGIN Chromosome 3p13-3q23. 1 tggaccgcgc cggctgccgt gcaggtgaca gttcgtgatg ctataaatca gggtatggat 61 gaggagctgg aaagagatga gaaggtattt ctgcttggag aagaagttgc ccagtatgat 121 ggggcataca aggttagtcg agggctgtgg aagaaatatg gagacaagag gattattgac 181 actcccatat cagagatggg ctttgctgga attgctgtag gtgcagctat ggctgggttg 241 cggcccattt gtgaatttat gaccttcaat ttctccatgc aagccattga ccaggttata 301 aactcagctg ccaagaccta ctacatgtct ggtggccttc agcctgtgcc tatagtcttc 361 aggggaccca atggtgcctc agcaggtgta gctgcccagc actcacagtg ctttgctgcc 421 tggtatgggc actgcccagg cttaaaggtg gtcagtccct ggaattcaga ggatgctaaa 481 ggacttatta aatcagccat tcgggataac aatccagtgg tggtgctaga gaatgaattg 541 atgtatgggg ttccttttga atttcctccg gaagctcagt caaaagattt tctgattcct 601 attggaaaag ccaaaataga aaggcaagga acacatataa ctgtggtttc ccattcaaga 661 cctgtgggcc actgcttaga agctgcagca gtgctatcta aagaaggagt tgaatgtgag 721 gtgataaata tgcgtaccat tagaccaatg gacatggaaa ccatagaagc cagtgtcatg 781 aagacaaatc atcttgtaac tgtggaagga ggctggccac agtttggagt aggagctgaa 841 atctgtgcca ggatcatgga aggtcctgcg ttcaatttcc tggatgctcc tgctgttcgt 901 gtcactggtg ctgatgtccc tatgccttat gcaaagattc tagaggacaa ctctatacct 961 caggtcaaag acatcatatt tgcaataaag aaaacattaa atatttagtt tggacttgaa 1021 tatcaagtcg ttgaaattta tttgaaatac ttgctggcac tgcacctgga tttgtactgc 1081 aagacctgac tattcataaa ggaaaacgat ttctaaagca acagcaggta tttttgtaca 1141 gggaagttta aatgtgtttg tgtatggaaa actctccact ctcctcccct agatgccatg 1201 cttccttttg tctgttacgg ttgccatgtt ctttgaataa caaattatat cacattttat 1261 cctctctcac cacaaggaca aagtatggat gtggcagagt cctgatgaaa gatgtatcca 1321 aacaagataa cttatatgta taaaattaaa gcatataata cacatttact gttagtttgt 1381 tttgataagg aataaaggaa tttctaac // LOCUS HUMPYRUV 1095 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human pyruvate dehydrogenase E1-beta subunit mRNA, complete cds. ACCESSION M34055 J05549 KEYWORDS pyruvate dehydrogenase E1-beta subunit. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1095) AUTHORS Huh,T.L., Casazza,J.P., Huh,J.W., Chi,Y.T. and Song,B.J. TITLE Characterization of two cDNA clones for pyruvate dehydrogenase E1~subunit and its differential regulation in cultured fibroblasts JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.J.Song, 07-MAY-1990. Author address: B.J.Song Laboratory of Metabolism and Molecular Biology National Institute on Alcohol Abuse and Alcoholism 12501 Washington Ave. Rockville, MD 20852 E-mail: landsman@thebard.nlm.nih.gov FEATURES from to/span description pept 6 1085 pyruvate dehydrogenase E1-beta subunit precursor /hgml_locus_uid="LU0223C" /nomgen="PDHB" /map="unassigned" sigp 6 95 pyruvate dehydrogenase E1-beta subunit signal peptide matp 96 1082 pyruvate dehydrogenase E1-beta subunit mRNA < 1 1095 pyruvate dehydrogenase E1-beta subunit mRNA signal 1061 1066 poly-A signal BASE COUNT 288 a 221 c 302 g 284 t ORIGIN Chromosome 3p13-3q23. 1 ccaagatggc ggcggtgtct ggcttggtgc ggagacccct tcgggaggtc tccgggctgc 61 tgaagaggcg ctttcactgg accgcgccgg ctgccgtgca ggtgacagtt cgtgatgcta 121 taaatcaggg tatggatgag gagctggaaa gagatgagaa ggtatttctg cttggagaag 181 aagttgccca gtatgatggg gcatacaagg ttagtcgagg gctgtggaag aaatatggag 241 acaagaggat tattgacact cccatatcag agatgggctt tgctggaatt gctgtaggtg 301 cagctatggc tgggttgcgg cccatttgtg aatttatgac cttcaatttc tccatgcaag 361 ccattgacca ggttataaac tcagctgcca agacctacta catgtctggt ggccttcagc 421 ctgtgcctat agtcttcagg ggacccaatg gtgcctcagc aggtgtagct gcccagcact 481 cacagtgctt tgctgcctgg tatgggcact gcccaggctt aaaggtggtc agtccctgga 541 attcagagga tgctaaagga cttattaaat cagccattcg ggataacaat ccagtggtgg 601 tgctagagaa tgaattgatg tatggggttc cttttgaatt tcctccggaa gctcagtcaa 661 aagattttct gattcctatt ggaaaagcca aaatagaaag gcaaggaaca catataactg 721 tggtttccca ttcaagacct gtgggccact gcttagaagc tgcagcagtg ctatctaaag 781 aaggagttga atgtgaggtg ataaatatgc gtaccattag accaatggac atggaaacca 841 tagaagccag tgtcatgaag acaaatcatc ttgtaactgt ggaaggaggc tggccacagt 901 ttggagtagg agctgaaatc tgtgccagga tcatggaagg tcctgcgttc aatttcctgg 961 atgctcctgc tgttcgtgtc actggtgctg atgtccctat gccttatgca aagattctag 1021 aggacaactc tatacctcag gtcaaagaca tcatatttgc aataaagaaa acattaaata 1081 tttagtttgg acttg // LOCUS ENTOMPX 664 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.cloacae outer membrane protein X (OmpX) gene, complete cds. ACCESSION M33878 KEYWORDS outer membrane protein X. SOURCE E.cloacae (strain 2249-1) DNA. ORGANISM Enterobacter cloacae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 664) AUTHORS Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and van de Klundert,J.A.M. TITLE Molecular characterization of an Enterobacter cloacae outer membrane protein OmpX JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.Stoorvogel, 26-APR-1990. Author address: J.Stoorvogel Dept of Medical Microbiol, Univ Hospital, P.O.Box 9600, 2300 RC Leiden, THE NETHERLANDS FEATURES from to/span description pept 83 601 outer membrane protein X precursor sigp 83 151 outer membrane protein X signal peptide matp 152 598 outer membrane protein X signal 18 25 -35 region signal 43 48 -10 region binding 74 77 ribosome binding site signal 620 649 transcription termination signal BASE COUNT 171 a 156 c 162 g 175 t ORIGIN 1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac 61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt 121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc 181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta 241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg 301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta 361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca 421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta 481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca 541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta 601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc 661 ctct // LOCUS ACARGA 1212 bp ds-DNA INV 28-JUN-1990 DEFINITION A.castellanii 5S RNA. ACCESSION M34003 KEYWORDS 5S RNA. SOURCE A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1. ORGANISM Acanthamoeba castellanii Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; Acanthamoebidae. REFERENCE 1 (bases 1 to 1212) AUTHORS Zwick,M.G., Wiggs,M. and Paule,M.R. TITLE Cloning, sequencing and genomic organization of a 5S RNA gene from the eukaryotic protist Acanthamoeba castellanii JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.R.Paule, 04-MAY-1990. Author address: M.R.Paule Colorado State University, Dept. Biochemistry, Fort Collins, CO 80523 FEATURES from to/span description RNA 489 610 5S RNA precursor RNA 489 607 5S RNA BASE COUNT 195 a 374 c 329 g 314 t ORIGIN 1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg 61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc 121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta 181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt 241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac 301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg 361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag 421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt 481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt 541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc 601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg 661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa 721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag 781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc 841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact 901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct 961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt 1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg 1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc 1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg 1201 gcaaatgtcg ac // LOCUS HUMCDR34 2412 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 M34243 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S.K., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: A highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990 FEATURES from to/span description pept 503 1174 cerebellar-degeneration-related antigen (CDR34) BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS RATHGF 2485 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat hepatocyte growth factor mRNA, complete cds. ACCESSION M32987 KEYWORDS hepatocyte growth factor. SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2485) AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M., Shimizu,S. and Nakamura,T. TITLE Deduced primary structure of rat hepatocyte growth factor and expression of the mRNA in rat tissues JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Nakamura, 26-MAR-1990. FEATURES from to/span description pept 143 2329 hepatocyte growth factor BASE COUNT 750 a 533 c 577 g 625 t ORIGIN 1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc 61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga 121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct 181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca 241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa 301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa 361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc 421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt 481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa 541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg 601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct 661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag 721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat 781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg 841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga 901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta 961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc 1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg 1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca 1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa 1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat 1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta 1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg 1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc 1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg 1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga 1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca 1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa 1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc 1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat 1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct 1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct 1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac 1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg 2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt 2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga 2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt 2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc 2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa 2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa 2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca 2461 ttaatatatg tggcgttttc tgttg // LOCUS DDIDPYK1A 1090 bp ss-mRNA INV 28-JUN-1990 DEFINITION D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds. ACCESSION M33785 KEYWORDS protein-tyrosine kinase-1. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1090) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. Author address: J.L.Tan Stanford University School of Medicine Dept. of Cell Biology Stanford, CA 94303 (415) 723-7408 FEATURES from to/span description pept < 1 1014 protein-tyrosine kinase-1 (DPYK1; AA at 1) BASE COUNT 371 a 198 c 186 g 335 t ORIGIN 1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca 61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat 121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat 181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca 241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta 301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa 361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa 421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat 481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac 541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct 601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt 661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat 721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat 781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca 841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat 901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta 961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt 1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca 1081 cttcaactcg // LOCUS DDIDPYK2A 1292 bp ss-mRNA INV 28-JUN-1990 DEFINITION D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds. ACCESSION M33784 KEYWORDS protein-tyrosine kinase-2. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1292) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. Author address: J.L.Tan Stanford University School of Medicine Dept. of Cell Biology Stanford, CA 94303 (415) 723-7408 FEATURES from to/span description pept < 1 1234 protein-tyrosine kinase-2 (DPYK2; AA at 2) BASE COUNT 491 a 207 c 204 g 390 t ORIGIN 1 cgattctaca agtacaacaa actctactaa agatatcaca tttttagttt gtgataatcc 61 tgattcaact aaagaaaaga gtaacgtttc aaatacttca tcaataattt ccgcttcaaa 121 tttaaataga catataacac caaattctca tatgagacct agaggtagat caatttctga 181 atctttaatt atgtcaccaa ttaataaaga atctttaaat gatattcaaa gagcaattga 241 aagtgaaaaa ataaagaaaa ctaaatttga agaattaaaa tcaatattgg gcgaaagaga 301 atatataatt gatataaatg atattcaatt tatacaaaaa gttggagaag gtgcattcag 361 tgaagtttgg gaaggttggt ggaaaggtat tcatgttgcc ataaaaaagt taaagattat 421 aggagatgaa gaacaattca aagagagatt cattagagag gttcaaaatt tgaaaaaagg 481 aaatcatcaa aacattgtca tgtttattgg tgcatgttat aaaccagcat gtatcataac 541 agagtatatg gcaggtggta gtctttacaa tatacttcat aatccaaata gttccactcc 601 aaaagttaaa tattctttcc cattggtttt gaaaatggca accgacatgg cattgggctt 661 attacatctt cattccatca ccattgtgca tcgtgattta accagtcaaa acattctatt 721 ggatgaattg ggtaatataa agatctctga ttttggttta tctgctgaaa agagtagaga 781 aggttcaatg acaatgacaa atggtggcat ttgcaatcca agatggagac cacccgaatt 841 gacaaagaat ttaggtcact actcggaaaa ggttgatgtc tattgtttct ctctagtagt 901 ttgggaaatt ttaactggcg aaattccttt ctctgattta gatggatctc aacgatccgc 961 tcaagtagct tatgctggtt taagaccacc aataccagag tattgcgatc ctgaattaaa 1021 attactctta actcaatgtt gggaggctga tccaaatgat agacctccct ttacctatat 1081 agtaaacaaa ttaaaagaaa tctcttggaa taatccaatt ggtttcgtct ctgatcaatt 1141 ctatcaatat agcgaacctt caactccaag attagcatta tcaaatcaat cttcaaattc 1201 aagtagtatt tctttatcac caactaaatt ataaaaaaaa aaaaaaaaaa aaacaaattt 1261 caaacaccaa acaccaccac tcatcaaaat cg // LOCUS MUSH2A1X 929 bp ds-DNA ROD 28-JUN-1990 DEFINITION Mouse histone H2A.1 gene, complete cds. ACCESSION M33988 KEYWORDS histone H2A.1. SOURCE Mouse (BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 929) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Sch|mperli, 02-MAY-1990. Author address: Dr.Schmperli Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 164 556 histone H2A.1 mRNA 119 624 histone H2A.1 mRNA BASE COUNT 238 a 247 c 222 g 222 t ORIGIN 1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac 61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct 121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa 181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt 241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc 301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct 361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct 421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg 481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa 541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca 601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg 661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta 721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga 781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt 841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa 901 gggccattaa atcccacttc catagtttc // LOCUS MUSH2A1XX 693 bp ds-DNA ROD 28-JUN-1990 DEFINITION Mouse histone H3.2 gene, complete cds. ACCESSION M33989 KEYWORDS histone H3.2. SOURCE Mouse (strain BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 693) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Sch|mperil, 02-MAY-1990. Author address: D.Schlmperil Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 151 561 histone H3.2 mRNA 116 611 histone H3.2 mRNA BASE COUNT 159 a 206 c 184 g 144 t ORIGIN 1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt 61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat 121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag 181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc 241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc 301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc 361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc 421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg 481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc 541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc 601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg 661 tttttcaaag atttatctag ttacaaagta tac // LOCUS NEUPSAS 3681 bp ds-DNA PLN 28-JUN-1990 DEFINITION N.crassa arginine-specific carbamoyl phosphate synthetase (ARG-2) gene, complete cds. ACCESSION J05512 KEYWORDS carbamoyl phosphate synthetase. SOURCE N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and cDNA to mRNA, clone pAB1. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 3681) AUTHORS Orbach,M.J., Sachs,M.S. and Yanofsky,C. TITLE The Neurospora crassa arg-2 locus: Structure and expression of the gene encoding the small subunit of arginine-specific carbamoyl phosphate synthetase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.S.Sachs, 07-MAY-1990. FEATURES from to/span description pept 1451 1630 arginine-specific carbamoyl phosphate synthetase, exon 1 (ARG-2) (EC 6.4.5.5) 1687 2868 arginine-specific carbamoyl phosphate synthetase, exon 2 pept 870 902 ORF, exon 1 1270 1282 ORF, exon 2 1359 1387 ORF, exon 3 pre-msg 752 3082 ARG-2 mRNA and introns pre-msg 809 3082 ARG-2 mRNA and introns (alt.) IVS 1631 1686 ARG-2 intron A IVS 903 1269 ORF intron A IVS 1283 1358 ORF intron B signal 161 166 TGACTC box signal 568 573 TGACTC box signal 1008 1013 TGACTC box signal 1307 1312 TGACTC box BASE COUNT 760 a 1093 c 863 g 965 t ORIGIN 1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg 61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg 121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg 181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga 241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg 301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg 361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg 421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa 481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa 541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg 601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa 661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag 721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca 781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag 841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc 901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc 961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc 1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat 1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc 1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct 1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg 1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt 1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa 1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat 1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg 1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag 1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt 1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc 1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt 1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac 1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat 1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta 1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg 1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc 2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac 2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc 2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac 2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc 2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc 2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg 2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct 2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt 2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc 2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag 2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt 2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag 2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc 2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg 2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca 2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg 3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag 3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc 3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg 3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca 3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag 3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa 3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg 3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg 3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc 3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc 3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg 3661 atgctacccc ctgtcctgca g // LOCUS MUSIMPD 1707 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse IMP dehydrogenase mRNA, complete cds. ACCESSION M33934 KEYWORDS IMP dehydrogenase. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1707) AUTHORS Tiedemann,A.A. and Smith,J.M. TITLE Isolation and DNA sequence of a cDNA encoding mouse IMP dehydrogenase JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.M.Smith 27-APR-1990. Author address: Dr. J.M. Smith Seattle Biomedical Research Institute 4 Nickerson St. Seattle, WA 98109 FEATURES from to/span description pept 67 1611 IMP dehydrogenase (EC 1.2.1.14) BASE COUNT 457 a 408 c 451 g 391 t ORIGIN 1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc 61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc 121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt 181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag 241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg 301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa 361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc 421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc 481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc 541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc 601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat 661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg 721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa 781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat 841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga 901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag 961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta 1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc 1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt 1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct 1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc 1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat 1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa 1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct 1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa 1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag 1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc 1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human osteopontin mRNA, complete cds. ACCESSION J04765 KEYWORDS osteopontin; phosphoprotein. SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1425) AUTHORS Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D. TITLE cDNA cloning, chromosomal location and RFLP analysis of human osteopontin JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.F.Young, 04-MAY-1990. FEATURES from to/span description pept 68 1012 osteopontin precursor /hgml_locus_uid="LK0223T" /nomgen="OPN" /map="unassigned" sigp 68 115 osteopontin signal peptide matp 176 1012 osteopontin mRNA < 1 1425 osteopontin mRNA BASE COUNT 449 a 293 c 296 g 387 t ORIGIN Chromosome 4q13. 1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact 61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc 121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga 181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca 241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc 301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga 361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca 421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac 481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg 541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga 601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt 661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga 721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca 781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc 841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt 901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct 961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa 1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa 1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg 1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa 1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg 1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact 1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag 1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag // LOCUS RATALADPS 1612 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene, complete cds. ACCESSION J04764 KEYWORDS delta-aminolevulinate dehydratase; pseudogene. SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1612) AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H. TITLE Isolation and characterization of a rat delta-aminolevulinate dehydratase processed pseudogene JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.R.Bishop, 20-APR-1990. FEATURES from to/span description pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase rpt 101 108 8 bp flanking direct repeat rpt 1385 1392 8 bp flanking direct repeat BASE COUNT 404 a 411 c 418 g 379 t ORIGIN 1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt 61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt 121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca 181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta 241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc 301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa 361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct 421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga 481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca 541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg 601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat 661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg 721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat 781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac 841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat 901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga 961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca 1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct 1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa 1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc 1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga 1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa 1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga 1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta 1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt 1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg 1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at // LOCUS MZEZEISP 215 bp ds-DNA PLN 28-JUN-1990 DEFINITION Z.mays zein storage protein gene, 5' flank. ACCESSION M33830 KEYWORDS storage protein; zein. SOURCE Z.mays endosperm DNA, clone pMS1. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 215) AUTHORS Grasser,K.D., Maier,U.-G., Haass,M.M. and Feix,G. TITLE Maize high mobility group proteins bind to CCAAT and TATA boxes of a zein gene promoter JOURNAL J. Biol. Chem. 265, 4185-4188 (1990) STANDARD simple staff_review BASE COUNT 84 a 42 c 26 g 63 t ORIGIN 1 gtataaatta taacccaaca aagtaatcac tcaatgtcaa aaccaactag ataccatgtc 61 atctctacct tatcttacta atatcttttt gcaaattcga aattaatctt gcacaagcac 121 aaggactgag atgtgtataa atatctctta gattagtagc taatatatcg cacatattat 181 tgagaccaac tagcaacata gaaagcacaa tagtg // LOCUS CHKARCASR 2392 bp ss-mRNA VRT 28-JUN-1990 DEFINITION Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase, complete cds. ACCESSION M31764 KEYWORDS 5-aminoimidazole ribonucleotide carboxylase; 5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase. SOURCE Chicken liver, cDNA to mRNA, clone pZD1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2392) AUTHORS Chen,Z., Dixon,J.E. and Zalkin,H. TITLE Cloning of a chicken liver cDNA encoding 5-aminoimidazole ribonucleotide carboxylase and 5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase by functional complementation of Escherichia coli pur mutants JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Zalkin, 01-FEB-1990. FEATURES from to/span description pept 59 1339 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase matp 59 835 SAICAR synthetase domain matp 836 1336 AIR carboxylase domain site 2341 2347 poly-A signal BASE COUNT 696 a 466 c 527 g 702 t 1 others ORIGIN 1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat 61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga 121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac 181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac 241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa 301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg 361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa 421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg 481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac 541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg 601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac 661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg 721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga 781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct 841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca 901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac 961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg 1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc 1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca 1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc 1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc 1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt 1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag 1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt 1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg 1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta 1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta 1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt 1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat 1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc 1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg 1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt 1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa 1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc 2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct 2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc 2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac 2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa 2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat 2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc // LOCUS ECORAFRO 1289 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli raffinose repressor (rafR) gene, complete cds, and alpha-galactosidase (rafA) gene, 5' end. ACCESSION M29849 KEYWORDS alpha-galactosidase; raffinose repressor. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1289) AUTHORS Aslanidis,C. and Schmitt,R. TITLE Regulatory elements of the raffinose operon: Nucleotide sequences of operator and repressor genes JOURNAL J. Bacteriol. 172, 2178-2180 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Schmitt, 13-NOV-1989. FEATURES from to/span description pept 166 1176 raffinose repressor pept 1270 > 1289 alpha-galactosidase binding 156 161 ribosome binding site binding 1258 1263 ribosome binding site signal 120 126 -35 region signal 144 149 -10 region signal 1212 1216 -35 region signal 1235 1240 -10 region site 1196 1213 raf operator site 1217 1234 raf operator BASE COUNT 317 a 283 c 326 g 363 t ORIGIN 1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata 61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga 121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg 181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca 241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc 301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat 361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc 421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac 481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat 541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct 601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga 661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc 721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat 781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat 841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat 901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa 961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat 1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg 1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt 1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg 1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg 1261 agatgtctaa tgatttcaaa gtactgcag // LOCUS HSEDILSTD 351 bp ds-DNA VRL 28-JUN-1990 DEFINITION Equine herpesvirus type 1 subfragments from near the left terminus of STD DNA. ACCESSION M34145 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 351) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 55 a 117 c 127 g 52 t ORIGIN 1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca 61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga 121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt 181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg 241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc 301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g // LOCUS HSEDISA 633 bp ds-DNA VRL 28-JUN-1990 DEFINITION Equine herpesvirus type 1 defective interfering particle DNA. ACCESSION M34143 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 633) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_entry BASE COUNT 124 a 184 c 202 g 123 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact 301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc 361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca 421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt 481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg 541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc 601 acggcggggc gcagaacgcc cctggccccc ggg // LOCUS HSEDISTD 290 bp ds-DNA VRL 28-JUN-1990 DEFINITION Equine herpesvirus type 1 subfragments from the inverted repeat of STD DNA. ACCESSION M34144 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG.. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 290) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 69 a 72 c 77 g 72 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc // LOCUS HUMAR 1230 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) mRNA, complete cds. ACCESSION M30704 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SOURCE Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones lambda-AR1 and lambda-AR2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1230) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 210 968 amphiregulin precursor /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 210 266 amphiregulin signal peptide matp 510 962 amphiregulin (alt.) matp 528 962 amphiregulin (alt.) mRNA 1 1230 amphiregulin mRNA BASE COUNT 375 a 273 c 285 g 297 t ORIGIN 1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc 61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg 121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga 181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg 241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg 301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg 361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt 421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac 481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc 541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag 601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc 661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca 721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca 781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg 841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat 901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta 961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc 1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat 1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga 1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt 1201 tattttacag ctcattaaac ttttttaacc // LOCUS HUMAR1 968 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 1. ACCESSION M30698 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 1 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 968) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 859 + 919 amphiregulin precursor, exon 1 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 859 915 amphiregulin signal peptide pre-msg 649 > 968 amphiregulin mRNA and introns IVS 920 > 968 amphiregulin intron A BASE COUNT 206 a 280 c 257 g 225 t ORIGIN 1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt 61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg 121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg 181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag 241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta 301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta 361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac 421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct 481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt 541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc 601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca 661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc 721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca 781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc 841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg 901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca 961 tggcaggt // LOCUS HUMAR2 348 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 2. ACCESSION M30699 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 2 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 348) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 299 amphiregulin precursor, exon 2 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp 290 + 299 amphiregulin pre-msg < 1 > 348 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron A IVS 300 > 348 amphiregulin intron B BASE COUNT 88 a 75 c 86 g 99 t ORIGIN 1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc 61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca 121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc 181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc 241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg 301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata // LOCUS HUMAR3 302 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 3. ACCESSION M30700 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 3 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 302) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 52 + 252 amphiregulin precursor, exon 3 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 52 + 252 amphiregulin matp 60 + 252 amphiregulin (alt.) pre-msg < 1 > 302 amphiregulin mRNA and introns IVS < 1 51 amphiregulin intron B IVS 253 > 302 amphiregulin intron C BASE COUNT 125 a 46 c 54 g 77 t ORIGIN 1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg 61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa 121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta 181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag 241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc 301 tg // LOCUS HUMAR4 254 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 4. ACCESSION M30701 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 4 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 254) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 204 amphiregulin precursor, exon 4 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 51 91 amphiregulin (both alt.) pre-msg < 1 > 254 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron C IVS 205 > 254 amphiregulin intron D BASE COUNT 81 a 42 c 53 g 78 t ORIGIN 1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca 61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag 121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac 181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat 241 aatgggaggt taat // LOCUS HUMAR5 217 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, exon 5. ACCESSION M30702 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 5 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 217) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 50 143 amphiregulin precursor, exon 5 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" pre-msg < 1 > 217 amphiregulin mRNA and introns IVS < 1 49 amphiregulin intron D BASE COUNT 89 a 26 c 41 g 61 t ORIGIN 1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac 61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg 121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata 181 tctttagatc atatcctata attttgaaaa atttaac // LOCUS HUMAR6 354 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human amphiregulin (AR) gene, 3' flanking region. ACCESSION M30703 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 6 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 354) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pre-msg < 1 294 amphiregulon mRNA and intron BASE COUNT 104 a 48 c 52 g 150 t ORIGIN 1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat 61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat 121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct 181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta 241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa 301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc // LOCUS HUMROSSAA 1890 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds. ACCESSION M32294 KEYWORDS Ro ribonucleoprotein autoantigen. SOURCE Human T-cell line Jurkat, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1890) AUTHORS McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J., Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M., Sontheimer,R.D. and Capra,J.D. TITLE Molecular cloning, expression and Chromosome 19 localization of a human Ro/SS-A auto antigen JOURNAL J. Clin. Invest. 85, 1379-1391 (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.P.McCauliffe, 23-FEB-1990. FEATURES from to/span description pept 67 1320 Ro ribonucleoprotein autoantigen (Ro/SS-A) precursor sigp 67 117 Ro ribonucleoprotein autoantigen signal peptide matp 118 1317 Ro ribonucleoprotein autoantigen site 1881 1886 poly-A signal (put.) BASE COUNT 464 a 497 c 520 g 409 t ORIGIN Chromosome 19. 1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg 61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag 121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc 181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt 241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg 301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa 361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac 421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc 481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac 541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca 601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac 661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac 721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag 781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac 841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg 901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag 961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc 1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac 1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag 1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag 1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag 1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag 1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg 1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt 1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt 1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt 1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc 1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct 1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca 1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact 1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact 1861 acaaacaaaa tttctattaa attaaatttt // LOCUS PRD1XII 1701 bp ds-DNA PHG 28-JUN-1990 DEFINITION Bacteriophage PRD1 early protein genes XII and XIX, complete cds. ACCESSION M30146 KEYWORDS early protein XII; early protein XIX. SOURCE Bacteriophage PRD1 (Viradea strain) DNA. ORGANISM Bacteriophage PRD1 Viridae; ds-DNA nonenveloped viruses; Tectiviridae. REFERENCE 1 (bases 1 to 1701) AUTHORS Gerendasy,D. and Ito,J. TITLE Nucleotide sequence and transcription of the right early region of bacteriophage PRD1 JOURNAL J. Bacteriol. 172, 1889-1898 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Gerendasy, 27-NOV-1989. FEATURES from to/span description pept 242 724 early protein XII pept 797 955 early protein XIX (put.) mRNA 179 732 early protein XII mRNA mRNA 181 1100 early protein XIX mRNA binding 230 233 ribosome binding site binding 785 788 ribosome binding site rpt 1 111 inverted terminal repeat site 145 172 right early promoter BASE COUNT 419 a 447 c 390 g 445 t ORIGIN 1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca 61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat 121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca 181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat 241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag 301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg 361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc 421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct 481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc 541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc 601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc 661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc 721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa 781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc 841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat 901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc 961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct 1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa 1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg 1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag 1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac 1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc 1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga 1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg 1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt 1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg 1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc 1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg 1681 cgagaaaacc gccttgacaa a // LOCUS PSEALALPQ 3141 bp ds-DNA BCT 28-JUN-1990 DEFINITION P.aeruginosa exopolysaccharide alginate regulatory protein (algP and algQ) genes, complete cds. ACCESSION M32077 KEYWORDS alginate regulatory protein P; alginate regulatory protein Q. SOURCE P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient PA02003. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 3141) AUTHORS Konyecsni,W.M. and Deretic,V. TITLE DNA sequence and expression analysis of algP and algQ, components of the multigene system transcriptionally regulating mucoidy in Pseudomonas aeruginosa: algP contains multiple direct repeats JOURNAL J. Bacteriol. 172, 2511-2520 (1990) STANDARD full staff_review COMMENT draft entry and computer-readable sequence [1] kindly submitted by V.Deretic, 16-FEB-1990. FEATURES from to/span description pept 781 1263 alginate regulatory protein Q pept 2079 3137 alginate regulatory protein P (put.) mRNA 661 1310 algQ mRNA mRNA 2054 3141 algP mRNA BASE COUNT 607 a 1026 c 1009 g 499 t ORIGIN 1 bp upstream of BamHI site; map position 9 min. 1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc 61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg 121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg 181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct 241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa 301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg 361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc 421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg 481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt 541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat 601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat 661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc 721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac 781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat 841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag 901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta 961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat 1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc 1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc 1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc 1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc 1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg 1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca 1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg 1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga 1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac 1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga 1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc 1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa 1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc 1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc 1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag 1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc 1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt 2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg 2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag 2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac 2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg 2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt 2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg 2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca 2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca 2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg 2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga 2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc 2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa 2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca 2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg 2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg 2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac 3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc 3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc 3121 cgaccagcgc ctcctaaggc g // LOCUS STYAROF 1235 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (aroF) gene, complete cds. ACCESSION M31302 KEYWORDS 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase. SOURCE S.typhimurium (strain LT2) DNA, clone pGM59. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 165 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. TITLE Regulation of the Salmonella typhimurium aroF gene in Escherichia coli JOURNAL J. Bacteriol. 172, 2259-2266 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Herrmann, 13-JAN-1990. Author address: K.M.Herrmann Purdue University Biochemistry department West Lafayette IN, 47907 (317) 494-1653 FEATURES from to/span description pept 165 1235 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (E.C. 4.1.2.15) site 1 20 operator aroFo2 site 52 71 operator aroFo1 site 69 113 promoter site 75 94 TyrR box-1 BASE COUNT 309 a 286 c 343 g 297 t ORIGIN 1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt 61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg 121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc 181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg 241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg 301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg 361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg 421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga 481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa 541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt 601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc 661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt 721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc 781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg 841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag 901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag 961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg 1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata 1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg 1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag 1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa // LOCUS TRBANTAT 18598 bp ds-DNA INV 28-JUN-1990 DEFINITION T.brucei expression site variant specific antigen 4 - 7 genes, complete cds. ACCESSION M33720 KEYWORDS expression site variant specific antigen. SOURCE T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 18598) AUTHORS Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P., Salmon,D. and Steinert,M. TITLE The genes and transcripts of an antigen gene expression site from T.brucei JOURNAL Cell 57, 835-845 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Pays, 20-JAN-1989. FEATURES from to/span description pept 1174 2196 expression site variant specific antigen 7 pept 3528 4733 expression site variant specific antigen 6 pept 8519 9772 expression site variant specific antigen 5 pept 14450 18157 expression site variant specific antigen 4 mRNA 986 2676 ESAG-7 mRNA mRNA 3483 5056 ESAG-6 mRNA mRNA 8260 11304 ESAG-5 mRNA trns 5305 5808 rime retroposon BASE COUNT 5298 a 3468 c 4707 g 5125 t ORIGIN 1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta 61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat 121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg 181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg 241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta 301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc 361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat 421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt 481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt 541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt 601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa 661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag 721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg 781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat 841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca 901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa 961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat 1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg 1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga 1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg 1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca 1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta 1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag 1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc 1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa 1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa 1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg 1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga 1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg 1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt 1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa 1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt 1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg 1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt 2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa 2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa 2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact 2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac 2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa 2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt 2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta 2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt 2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg 2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca 2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta 2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact 2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat 2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat 2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg 2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata 3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg 3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga 3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa 3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg 3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt 3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg 3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt 3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc 3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt 3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga 3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa 3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa 3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg 3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc 3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag 3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt 3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg 4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg 4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg 4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg 4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga 4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg 4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg 4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa 4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg 4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag 4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg 4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga 4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg 4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga 4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa 4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc 4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa 4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag 5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga 5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca 5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat 5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct 5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca 5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg 5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg 5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat 5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca 5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg 5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg 5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca 5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac 5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc 5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg 5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata 6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg 6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg 6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc 6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat 6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac 6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa 6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa 6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta 6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag 6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat 6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg 6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca 6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg 6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag 6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca 6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa 6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct 7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa 7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat 7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag 7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat 7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa 7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta 7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta 7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc 7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca 7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga 7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa 7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg 7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga 7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct 7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt 7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact 7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag 8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg 8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga 8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca 8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca 8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg 8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt 8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg 8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat 8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag 8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga 8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt 8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat 8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga 8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga 8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga 8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa 9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata 9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac 9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc 9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg 9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt 9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa 9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct 9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca 9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac 9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa 9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc 9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt 9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc 9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac 9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta 9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt 9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt 10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt 10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg 10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa 10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata 10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac 10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta 10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg 10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa 10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag 10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt 10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg 10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag 10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg 10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc 10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa 10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg 10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca 11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat 11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc 11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca 11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt 11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg 11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat 11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac 11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct 11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag 11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag 11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt 11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat 11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc 11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata 11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga 11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg 12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt 12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg 12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg 12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc 12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa 12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa 12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg 12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact 12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc 12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct 12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa 12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta 12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga 12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc 12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc 12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata 12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg 13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg 13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt 13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct 13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata 13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca 13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc 13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt 13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa 13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc 13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt 13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta 13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca 13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca 13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct 13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg 13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc 13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc 14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc 14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg 14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt 14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc 14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg 14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc 14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct 14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac 14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt 14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt 14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc 14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt 14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa 14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga 14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt 14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga 15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg 15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt 15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga 15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct 15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat 15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt 15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg 15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa 15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac 15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat 15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac 15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga 15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca 15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg 15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg 15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc 15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag 16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga 16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac 16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag 16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct 16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat 16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat 16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat 16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt 16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct 16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga 16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt 16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct 16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat 16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa 16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc 16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat 16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat 17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct 17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac 17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt 17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa 17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc 17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc 17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc 17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat 17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg 17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat 17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc 17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc 17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga 17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc 17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg 17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg 18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg 18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt 18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc 18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct 18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga 18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga 18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct 18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag 18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc 18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt // LOCUS BABUGU2A 680 bp ds-DNA PRI 28-JUN-1990 DEFINITION Baboon U2 small nuclear RNA gene, complete cds. ACCESSION M33777 KEYWORDS U2 small nuclear RNA. SOURCE Baboon (P.hamadryas) DNA, clone lambda-Pu2. ORGANISM Papio hamadryas Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae; Cercopithecinae. REFERENCE 1 (bases 1 to 680) AUTHORS Matera,A.G., Weiner,A.M. and Schmid,C.W. TITLE Structure and evolution of the U2 snRNA multigene family in primates: Gene amplification under natural selection JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.G.Matera, 20-APR-1990. Author address: A.G.Matera Dept. of Chemistry University of California Davis, CA 95616 FEATURES from to/span description uRNA 191 378 U2 small nuclear RNA signal 99 114 upstream control element I signal 136 157 upstream control element II signal 379 420 3' processing box BASE COUNT 148 a 159 c 224 g 149 t ORIGIN 1 ggatccaagc acgggtgtaa gatttccctt ttcaaaggtg gagaataaga aaggagcccg 61 agtgtgtaac ggcgtccata gtggggtgga cgagacagag gggatggggc aaggagcgag 121 ggtggggctc tcaccgcgac ttgaatgtgg atgagagtgg gacggtgacg gcgggcgcga 181 aggcgagcgg atcgcttctc ggccttttgg ctaagatcaa gtgtagtatc tgttcttatc 241 agtttaatat ctgatacgtc ctctatccga ggacaatata ttaaatggat ttttggagca 301 gggagatgga ataggagctt gctccgtcca ctccacgcat cgacctggta ttgcagtacc 361 tccaggaacg gtgcaccccc tccggggata caaccgtttg ctaaaagtag agggaggtga 421 gagacgacaa caccggcggg gcggctcgca cgccaagcgt ctgtgactac gtcttgccgg 481 ctcaactgct tcgttggagt accacggggt tcctgagctg cgggcggcag acggtaggct 541 tcgggcacgc ttccatttac gctgtgactg cagcgctttg ggaaggctac gaccccccgc 601 agccgaccga ctcctttgga ggtcgttagg atctcagctt ggcagtcgag tggtggtgac 661 cttttaaagg aatgggatcc // LOCUS BOVLMCB 326 bp ds-DNA SYN 28-JUN-1990 DEFINITION Synthetic bovine liver microsomal lipase-solubilized cytochrome b-5 gene, complete cds. ACCESSION J02905 KEYWORDS cytochrome b-5. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 326) AUTHORS Funk,W.D., Lo,T.P., Mauk,M.R., Brayer,G.D., MacGillivray,R.T.A. and Mauk,A.G. TITLE Mutagenic, electrochemical, and crystallographic investigation of the cytochrome b-5 oxidation-reduction equilibrium: Involvement of asparagine-57, serine-64, and heme propionate-7 JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by G.Mauk, 16-MAY-1990. FEATURES from to/span description pept 27 311 microsomal cytochrome b-5 binding 16 19 ribosome binding site BASE COUNT 106 a 82 c 77 g 61 t ORIGIN 1 gtacggtacc gtaaaggaga acaacaatga gcaaagctgt caaatactac actctggaac 61 aaatccaaaa acacaacaac tcgaaatcga cgtggctgat cctcgactac aaagtatacg 121 acctgactaa attcctggaa gagcacccgg gaggcgaaga agtcctgcgc gaacaggccg 181 gcggtgacgc gactgaaaac tttgaggatg ttggccatag taccgacgct cgtgaactgt 241 cgaaaacgtt catcatcggt gagctgcacc cggacgatcg ttctaaaatc accaaaccgt 301 ccgaatctta atgagaattc gtctga // LOCUS DROEST6P 5424 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster carboxylesterase 6 and P (Est-6 and Est-P) genes, complete cds. ACCESSION M33780 M33781 KEYWORDS carboxylesterase. SOURCE D.melanogaster (strain Dm145) adult DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 929 to 4027) AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and Richmond,R.C. TITLE Molecular analysis of duplicated esterase genes in Drosophila melanogaster JOURNAL Mol. Biol. Evol. 7, 9-28 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 5424) AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and Richmond,R.C. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.C.Richmond, 19-APR-1990. FEATURES from to/span description pept 1173 2559 carboxylesterase 6 (Est-6) precursor (EC 3.1.1.1), exon 1 2611 2858 carboxylesterase 6 precursor, exon 2 sigp 1173 1235 carboxylesterase 6 signal peptide matp 1236 2559 carboxylesterase 6 2611 2855 carboxylesterase 6 pept 3052 4438 carboxylesterase P (Est-P) precursor (EC 3.1.1.1), exon 1 4495 4742 carboxylesterase P precursor, exon 2 sigp 3052 3111 carboxylesterase P signal peptide matp 3112 4438 carboxylesterase P 4495 4739 carboxylesterase P pre-msg 1132 2867 Est-6 mRNA and intron (alt.) pre-msg 1132 2998 Est-6 mRNA and intron (alt.) IVS 2560 2610 Est-6 intron A IVS 4439 4494 Est-P intron A signal 1104 1110 TATA box BASE COUNT 1536 a 1117 c 1235 g 1536 t ORIGIN Chromosome 3L 69A1-5. 1 aagcttgcta tatatctatc tgtatatacc tatttgctta gaccgaactt aatgattttc 61 cctttttcca aaacttacac aaatatatac ctaatacctt acactgaaaa aactgttcaa 121 ttcactatgt acttattagt agttcttata aaccttgtta ttactaatta cgtacgaaaa 181 cagaacggga aataaatatc taaaaattta aatagtaaac tgagctgatg tttctacaca 241 aggatgaaca caagtatcaa agtgctcata ggtacagcat ttctagaaca caattacacc 301 agtcttttgc tagcaataaa ttttcactca tctcccactt ttctctagaa ctatcctgaa 361 tttatttatt tgattctatt ttgtcgcaag cacactcaat tcttgatctt tttaataata 421 atcgtagcgg aaagctgtat ctgtacaggc aagaccaccg cccctcaaaa tgccaacgaa 481 ctacagtcgg attcaaaagc gtgcacctgc aagtgcaagt tgtatgatat cttgcaccat 541 tttatttcga tacacgtttt ggtacttcga tttaatatgg ctaattgtta ttcagaaaat 601 aaacacaaaa acttaggaca catgttcaaa aatagaaata tgtacatatt tgttgcaaac 661 cttaaatctt attaattatc tttatctata atcaaaaagc ctactcattt gttaaatggt 721 ttcgttttaa agtgcattat caaattcatc tggataaaca ttcaaacaaa aaattgcatg 781 catgtaaagt aattaatcaa gttgactcga agcgattgtg caatataagt ctcacctgaa 841 gtgggattgg caactgcttg ggatgcatgt cgaaaatcta tattaagccc agtcaaatat 901 tttaagcgta aagtaacaaa cctaatttgt atacggctat cgttttaatt cgcacacgcc 961 tatcaactgg atgatgttca cactagagtt actcccattt gaaagccggg cattggaaaa 1021 ctaatctcat gcgtgcccag atctcaattg agactggttg actggatgtt caggtggccg 1081 ggtggcgata agccgatcga tggaataaaa ggggccgcaa ttgccgcatc tcaagatagt 1141 tgcggtctga attcgccgga gtgaggagca acatgaacta cgtgggactg ggacttatca 1201 ttgtgctgag ctgcctttgg ctcggttcga acgcgagtga tacagatgac cctctgttgg 1261 tgcagctgcc ccagggcaag ctacgtggtc gcgataatgg aagctactac agctacgaat 1321 cgattcccta cgccgaaccg cccactggcg atctacgatt cgaggctcca gagccgtaca 1381 aacaaaagtg gtcggatata ttcgatgcca ccaaaacccc ggtggcgtgc ctgcagtggg 1441 atcagttcac gcctggggcc aacaaattgg taggagagga ggattgccta accgtcagcg 1501 tctacaagcc gaagaatagc aagaggaata gctttccggt ggtggcccac attcacggag 1561 gtgcctttat gttcggtgca gcatggcaaa atggacacga gaacgtgatg cgtgagggca 1621 aattcattct ggtgaagata agctatcgcc tggggccatt gggtttcgtg agcaccggcg 1681 atagggatct tcccggaaac tatggactga aagatcaacg gctggctctc aaatggatta 1741 agcagaatat agccagtttt ggtggagaac cgcagaacgt actgttggtt ggtcactccg 1801 ctggaggagc ttcggtccat ctgcagatgc ttcgtgaaga tttcggccag ctggccaggg 1861 cggcattctc gtttagtgga aatgctctag atccatgggt tatacagaag ggagcaagag 1921 gacgagcctt tgaactggga cgcaacgtgg gatgtgaatc ggctgaagac tcgaccagcc 1981 tgaagaaatg cctaaagtca aagccagcca gtgaattagt caccgccgtc cgtaaattcc 2041 ttatattttc ctatgtgccc tttgctccat ttagtcctgt attggagcca tcggatgctc 2101 cagacgccat tatcacccag gatcccaggg atgtcattaa gagcggaaag ttcggacagg 2161 ttccgtgggc tgtttcctat gtcacagagg atggtggcta caatgccgcc ttgcttttga 2221 aggaacggaa atctggaata gttatcgatg atctaaacga gcgttggctt gagttggcac 2281 catatttact attctaccgg gacacgaaga ccaaaaagga tatggacgac tactcgcgga 2341 aaattaagca ggagtatata ggcaatcaga gatttgacat cgaaagctat tcagaattgc 2401 agcggctatt cacggatatt ctcttcaaga atagcacgca ggagtcattg gatcttcatc 2461 gcaaatatgg aaagagtcct gcctacgctt atgtctatga caatccagcc gaaaaaggaa 2521 tcgcacaggt cctggccaat cgaaccgatt atgattttgg taaggaaatc gtacttttaa 2581 atggacttag ttaaatcatt acttttatag gaactgtaca cggtgacgac tactttttga 2641 tattcgaaaa tttcgtacga gatgtggaaa tgcgtccgga tgagcagata atttcgagaa 2701 attttatcaa tatgctggca gattttgctt cgagtgataa tggctctcta aaatatggtg 2761 aatgcgattt caaagataat gtaggtagtg agaaattcca attattagct atttatattg 2821 atggctgcca gaataggcag catgtggaat ttccgtaagt tacatgaata aaatcaaaaa 2881 tttttcgttc tgtgtaattt ttaattattt aattattctc aactggcttt aaatatcatt 2941 tgtacaaaac gtgtttgtcg ctttatattt tggtttttgt tgtttcttta taaagaataa 3001 taaacgtttg tttacccgca gtttgattag tactttgtgg agttccagat catgagtata 3061 ttcaaacggc tgttgtgcct gactttgctg tggatagcag ctttagaatc tgaagctgat 3121 cccttgattg ttgagataac aaatggaaaa atccgtggca aagataatgg gttgtactac 3181 agctacgaat cgattcccta tgccgagcat ccaactggtg ccctccgttt tgaagcacct 3241 cagccgtata gtcatcattg gactgatgtt ttcaatgcca cgcagtctcc agttgagtgc 3301 atgcagtgga atcagtttat aaacgaaaac aataagctga tgggtgatga ggattgctta 3361 acggtaagca tctataagcc aaagaaaccc aatcggagca gctttcctgt cgtagtactc 3421 ctgcatggag gtgctttcat gttcggtagt ggatccatat atggacacga ctccattatg 3481 cgtgagggaa ctttgcttgt ggtaaaaata agctatcgtc ttggaccatt gggttttgca 3541 agtaccggcg atagacactt gccgggaaac tatggtctaa aggatcaacg tctggcccta 3601 caatggatca agaagaacat tgctcacttt ggtggaatgc cagataatat tgtgctcatt 3661 ggtcactctg caggcggtgc ttcggctcat ttgcagctgt tgcacgagga tttcaaacat 3721 ttggccaaag gagcgatttc ggtgagcggc aatgcattgg atccttgggt catacagcag 3781 ggtggacgac gacgtgcatt tgaactgggt cgtattgtcg gttgtggaca cacaaatgtc 3841 tccgcagaac tcaaggactg cttgaagtct aagccggcta gcgatatagt ctctgctgtc 3901 cgaagcttcc ttgtgttttc ctatgtaccc ttcagtgctt ttggacctgt tgtggagccg 3961 tcagatgcac cagacgcctt tctaaccgag gacccaagag cagtgattaa gagcgggaag 4021 tttgcccaag tcccttgggc tgtgacgtac accactgagg acgggggata caacgctgct 4081 cagctgttgg aaagaaacaa attaactggc gagagttgga ttgacctact caatgatcga 4141 tggtttgatt gggcaccata cttgctcttc tatcgggacg ccaagaaaac catcaaagat 4201 atggatgatc tttcatttga tctcaggcag cagtatctag cagatcggcg attcagtgtg 4261 gaaagttatt ggaacgtgca gcgaatgttt actgatgttc ttttcaagaa tagcgtgcca 4321 agtgcaatag atcttcaccg aaagtatggc aaaagtccgg tttattcttt tgtctacgat 4381 aatcctaccg attccggagt gggtcaattg ctttccaatc gaacagatgt acattttggt 4441 gggtacatac ttgcgtttca aatacttaaa gctctagctt tcttattatt aaaggtactg 4501 tccacggaga tgactttttc ttgattttca atacagctgc ataccgtatc ggcattcgtc 4561 cggatgaaga agttatttca aaaaagttta taggtatgct ggaggatttc gcactcaacg 4621 ataagggaac attaacattt ggagaatgta atttccaaaa taatgtgaac agcaaggaat 4681 atcaagtgct gcgtatttca cgaaacgctt gtaaaaacga ggaatatgct cggtttccct 4741 aaggaataaa tgcatttaaa aaatgtaaac agtataataa tgtaattaaa gaacaaattt 4801 attacacctt tatgccgcct caactgtcag aggtttttca ataagttcgt cttcttcttc 4861 tgggtcgttt tcctcgccct ccacattccc atacatctca atcttgctgt cgaggaagcg 4921 aaaatccttg aggatatgca acttgttgtg aaaccagcaa tggaaatgaa tttaatggcg 4981 atgtttgcca aacagcagag tagcaggcag tgtgctccac gggaatattg gcttgcagta 5041 ctccatttag agtgtaggat tgcacatgat tgtgctcacc accgattaag acacaatcgt 5101 ccacaaagtc acatagatgc acgcgtccgg ggaagctaaa gacacatgtg ctttccatgg 5161 aacgcaggtg gaaaattgaa gctttgggtc caccaccgca gagcagccaa tcctcgttga 5221 cagccactgc tcctatccat ttgccccaat ccggacgtaa cagatttggg ttcttgtaag 5281 gctccagcat tgaagtgtgc tgctgctgct tggtgctcca cactcgcaca gtgccatcct 5341 cggcgccaga gaagatctgt ccgttggcat tgcccaccac actgtgcacg tagtctgtgt 5401 gtccgcggta ctcgcgctga attc // LOCUS ECOLASNII 1530 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds. ACCESSION M34277 KEYWORDS L-asparaginase II. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1530) AUTHORS Jennings,M.P. and Beacham,I.R. TITLE Analysis of the Escherichia coli gene encoding L-asparaginase II, ansB, and its regulation by cyclic AMP receptor and FNR proteins JOURNAL J. Bacteriol. 172, 1491-1498 (1990) STANDARD simple staff_review FEATURES from to/span description pept 385 1431 L-asparaginase II (ansB) precursor sigp 385 450 L-asparaginase II signal peptide matp 451 1428 L-asparaginase II mRNA 323 > 1431 ansB mRNA BASE COUNT 409 a 381 c 382 g 358 t ORIGIN 1 aaatgggcgc gaaagcggtg ctgaaaagcg gcggtaaccc attacagaat gtgctgggaa 61 gcctgggaag cctggggggg ctgcaatcct caatccaaac cgagtggaaa aagcaggaaa 121 aagatttcca gcagtttggc aaagatgttt gtagccgcgt tgtgactctg gaagatagcc 181 gcaaagccct ggtcgggaat ttaaaataat cctctatttt aagacggcat aatacttttt 241 tatgccgttt aattcttcgt tttgttacct gcctctaact ttgtagatct ccaaaatata 301 ttcacgttgt aaattgttta acgtcaaatt tcccatacag agctaaggga taatgcgtag 361 cgttcacgta actggaggaa tgaaatggag tttttcaaaa agacggcact tgccgcactg 421 gttatgggtt ttagtggtgc agcattggca ttacccaata tcaccatttt agcaaccggc 481 gggaccattg ccggtggtgg tgactccgca accaaatcta actacacagt gggtaaagtt 541 ggcgtagaaa atctggttaa tgcggtgccg caactaaaag acattgcgaa cgttaaaggc 601 gagcaggtag tgaatatcgg ctcccaggac atgaacgata atgtctggct gacactggcg 661 aaaaaaatta acaccgactg cgataagacc gacggcttcg tcattaccca cggtaccgac 721 acgatggaag aaactgctta cttcctcgac ctgacggtga aatgcgacaa accggtggtg 781 atggtcggcg caatgcgtcc gtccacgtct atgagcgcag acggtccatt caacctgtat 841 aacgcggtag tgaccgcagc tgataaagcc tccgccaacc gtggcgtgct ggtagtgatg 901 aatgacaccg tgcttgatgg ccgtgacgtc accaaaacca acaccaccga cgtagcgacc 961 ttcaagtctg ttaactacgg tcctctgggt tacattcaca acggtaagat tgactaccag 1021 cgtaccccgg cacgtaagca taccagcgac acgccattcg atgtctctaa gctgaatgaa 1081 ctgccgaaag tcggcattgt ttataactac gctaacgcat ccgatcttcc ggctaaagca 1141 ctggtagatg cgggctatga tggcatcgtt agcgctggtg tgggtaacgg caacctgtat 1201 aaatctgtgt tcgacacgct ggcgaccgcc gcgaaaaccg gtactgcagt cgtgcgttct 1261 tcccgcgtac cgacgggcgc taccactcag gatgccgaag tggatgatgc gaaatacggc 1321 ttcgtcgcct ctggcacgct gaacccgcaa aaagcgcgcg ttctgctgca actggctctg 1381 acgcaaacca aagatccgca gcagatccag cagatcttca atcagtacta atcgcctcgc 1441 cccggtatcg tgccggggct ttttcacttc agactcacgt ccattgccaa ttttaattac 1501 cctaatgata atcaccggaa taaattattt // LOCUS HUMPADPRP 2085 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human poly(ADP-ribose) polymerase (PADPRP) gene, exon 1. ACCESSION M34337 KEYWORDS poly(ADP-ribose) polymerase. SOURCE Human peripheral blood leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2085) AUTHORS Ogura,T., Nyunoya,H., Takahashi-Masutani,M., Miwa,M., Sugimura,T. and Esumi,H. TITLE Characterization of a putative promoter region of the human poly (ADP-ribose) polymerase gene: Structural similarity to that of the DNA polymerase beta gene JOURNAL Biochem. Biophys. Res. Commun. 167, 701-710 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1943 / 2062 poly(ADP-ribose) polymerase /hgml_locus_uid="LZ0154E" /nomgen="PPOL" /map="1q41-q42" pre-msg 1777 > 2085 PPOL mRNA and introns IVS 2063 > 2085 PPOL intron A BASE COUNT 495 a 535 c 550 g 505 t ORIGIN 1 tttagggatg atatagttgt caacccagag atggcatgat catgcctttt gacttggtca 61 ttctctaagt aaaactttta tttgttccat catattttcc acttattctg tttaccttca 121 aaatatcttt tttttttttt tttgagacag ggtcacactg tcacccaggc tagagtccag 181 tggcactatc atggctcacc acagcctcaa ccttcagggc tcaggtgatc ctcccacttc 241 agcctcccga gtagatggga ctacaggcac ctgccaccac ccccagctaa tttttgtaga 301 gacaaggttt tgccatgttg tccaggctgg tcttgaactc ctgggctcaa gggatccggc 361 cacctcagcc tcccaaagtg ctaggattat aggcatgagc cactgtgccc agcctacctt 421 caacgtatct aactggttac taacttttag gattcggcct atgtctcaca accttcttgc 481 ttactcaaca tccttgtctc ttaagccact agcttcttct ctatggttaa cactttttat 541 gagttttatt catctgctta tttttcttat cctctatacc agaattgaat attttcaaat 601 aaagcacact catgttacaa tctttgaaat ggaaaaaaaa aatgcatagg attagaaaag 661 aaaccaattt taataaacta tattttgaag tatagttcta tattaaacaa caagatctag 721 gccaggtgca gtggctcatg cctgtaatcc cagcaatttg ggaagtcgag gtgggaggat 781 tgcttgaggc caggggttca agaccagcct gggcaacatg gagagattcc ccatctcttt 841 ctttacacac acacacacac acacacacaa aatatctgat agcaacaggt gcagtcatta 901 ccacaatttc gagtagtgat gagcttaata atatttcgag ttatcaccaa caactgtaaa 961 ctaacatgaa aacgtctgtg atgactattg cccacaaagt cacaggtact gctaatactc 1021 ctggtatttg tagtaaattc ataataaagg aaatgctagg tttcagttgg tattttgtcc 1081 cgacggtctg tggacggcag gttagaacgc ccgtccaagc caggagggtg gacctagcac 1141 tgcagggtcc acctcgggcc aatcaactat attcccgagg cgggggcctg cgcttcccgg 1201 acccagctgc cctcagggga gagaggacac acttaagagt ttggggccgg cgtggtagct 1261 catgcccctg atcccagcac ttcgggaggc tgaggcgtga agatcacttg tagcaggagt 1321 ttgagaccag tctagccaac ttggcgagac cctgtcccta aaaaaaattt ttttttaatt 1381 agccagttgt ggtgagcgcc tgtagtccca gctactcggg aggctgaggt gggaggatcg 1441 ctgggctcag gagttccaga ctgcagtgag ccatgatggc ggcactgcac tccagcgcgg 1501 tgagactcag tctcaaaaat aaaaggggga ggggttgggg gtaaaattag ttgtgaaatc 1561 aagtaagact tcctgggaca gaacaatcaa aggggtggcg ccgggtcctc caaagagcta 1621 ctagctcagc ccaagccccg cctcggcccc cagggcagcg gccgcagagc tccacccggc 1681 aggcgcccgg gaaactccgc cccccggccg gcagggcgcg cgcccgccgg ccccgccccg 1741 tggacgcggg ttccgtggcg ttcccgcggc caggcatcag caatctatca gggaacggcg 1801 gtggccggtg cggcgtgttc ggtgcgctct ggccgctcag ccgtggcggc tgggtgagcg 1861 cacgcgaggc ggcgaggcgg caagcgtgtg tttctaggtc gtggcgtcgg gcttccggag 1921 ctttggcggc agctagggga ggatggcgga gtcttcggat aagctctatc gagtcgagta 1981 cgccaagagc gggcgcgcct cttgcaagaa atgcagcgag agcatcccca aggactcgct 2041 ccggatggcc atcatggtgc aggtgcgggc cgctgtgcgg cgggg // LOCUS HUMREVERB1 295 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human Rev-ErbA-alpha mRNA, 5' end. ACCESSION M34339 KEYWORDS . SEGMENT 1 of 2 SOURCE Human fetal skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 295) AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W. TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription from the noncoding DNA strand of a thyroid hormone receptor gene results in a related protein that does not bind thyroid hormone JOURNAL DNA 9, 77-83 (1990) STANDARD simple staff_review FEATURES from to/span description pept 204 > 295 Rev-ErbA-alpha protein (put.) BASE COUNT 58 a 107 c 74 g 56 t ORIGIN 1 ccgttgcctc aacgtccaac ccttcctgca gggctgcagt ccggccaccc caagcccttg 61 ctgcagggtg cttcggatcc tgatcgtgag tcgcggggtc cactccccgc ccttagccag 121 tgcccagggg gcaacagcgg cgatcgcaac ctctagtttg agtcaaggtc cagtttgaat 181 gaccgctctc agctggtgaa gacatgacca ccctggactc caacaacaac acaggtggcg 241 tcatcaccta cattggctcc agtggctcct ccccaagccg caccagccct gaatc // LOCUS HUMREVERB2 285 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human Rev-ErbA-alpha mRNA, partial cds. ACCESSION M34340 KEYWORDS . SEGMENT 2 of 2 SOURCE Human fetal skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 285) AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W. TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription from the noncoding DNA strand of a thyroid hormone receptor gene results in a related protein that does not bind thyroid hormone JOURNAL DNA 9, 77-83 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 285 Rev-ErbA-alpha protein (put.) (AA at 1) BASE COUNT 62 a 79 c 99 g 45 t ORIGIN About 1.7 kb after segment 1. 1 ctcgtaaagg agagagaagt gcagagttcg attctgtaca agggggcagc ggcagaaggc 61 cggccgggcg ggtcactggg cgtccacccg gaaggacagc agcttctcgg aatgcatgtt 121 gttcagggtc cgcaggtccg gcagcttgag cagcagcttg gtgaagcggg aagtctccaa 181 gggccggttc ttcagcacca gagcccgaag agcccgcagc agcgtctcct ggagctgctc 241 caccgaagcg gaattctcca tgcccgagcg gtctgcagag acaag // LOCUS HUMTFEB 1526 bp ss-mRNA PRI 28-JUN-1990 DEFINITION Human TFEB protein mRNA, partial cds. ACCESSION M33782 KEYWORDS . SOURCE Human BJAB B cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1526) AUTHORS Carr,C.S. and Sharp,P.A. TITLE A helix-loop-helix protein related to the immunoglobulin E box binding proteins JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.S.Carr, 19-APR-1990. FEATURES from to/span description pept < 1 > 1526 TFEB protein (AA at 1) BASE COUNT 324 a 495 c 493 g 214 t ORIGIN 1 ggtggagtga cagccggagc ccgggcgccg gcgcggggct cggtgacagc ggaggcggcg 61 gcccgggcgg gacgagggag cggccgcggc gccgacggcg ggcgggaagg gcggacgggg 121 cgggcctccc tggtggcacg gggccggagc aggccgggag caggtggcgg gcacagcttg 181 cgggccaggc acccgaactt gcgacaagtt gccggagccg gggcgcgggc ggcggacaga 241 ttgaccttca gagcgaggga gccagcgccg gcagccacca tggcgtcacg catagggttg 301 cgcatgcagc tcatgcggga gcaggcgcag caggaggagc agcgggagcg catgcagcaa 361 caggctgtca tgcattacat gcagcagcag cagcagcagc aacagcagca gctcggaggg 421 ccgcccaccc cggccatcaa tacccccgtc cacttccagt cgccaccacc tgtgcctggg 481 gaggtgttga aggtgcagtc ctacctggag aatcccacat cctaccatct gcagcagtcg 541 cagcatcaga aggtgcggga gtacctgtcc gagacctatg ggaacaagtt tgctgctgcc 601 cacatcagcc cagcccaggc tctccgaaac ccccaccagc cgcctcccca ggggtgcgag 661 ctggacacgt gctgtcctcc tccgctggca acagtgctcc caataagccc catggccatg 721 ctgcacattg gctccaaccc tgagagggag ttggatgatg tcattgacaa cattatgcgt 781 ctgacgatgt ctcttggcta catcaatcct gaaatgcaga tgcccaacac gctacccctg 841 tccagcagcc acctgaatgt gtacagcagc gacccccagg tcacagcctc cctggtgggc 901 gtcaccagca gctcctgccc tgcggacctg acccagaagc gagagctcac agatgctgag 961 agcagggccc tggccaagga gcggcagaag aaagacaatc acaacttaat tgaaaggaga 1021 cgaaggttca acatcaatga ccgcatcaag gagttgggaa tgctgatccc caaggccaat 1081 gacctggacg tgcgctggaa caagggcacc atcctcaagg cctctgtgga ttacatccgg 1141 aggatgcaga aggacctgca aaagtccagg gagctggaga accactctcg ccgcctggag 1201 atgaccaaca agcagctctg gctccgtatc caggagctgg agatgcaggc tcgagtgcac 1261 ggcctcccta ccacctcccc gtccggcatg aacatggctg agctggccca gcaggtggtg 1321 aagcaggagc tgcctagcga agagggccca ggggaggccc tgatgctggg ggctgaggtc 1381 cctgaccctg agccactgcc agctctgccc ccgcaagccc cgctgcccct gcccacccag 1441 ccaccgtccc cattccatca cctggacttc agccacagcc tgagctttgg gggcagggag 1501 gacgagggtc ccccgggcta ccccga // LOCUS MUSFCERII 2237 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse low affinity IgE receptor (Fc-epsilon-RII) mRNA, complete cds. ACCESSION M34163 KEYWORDS IgE receptor. SOURCE Mouse (strain Balb/c) splenic B cell, cDNA to mRNA, clone p23.18. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2237) AUTHORS Gollnick,S.O., Trounstine,M.L., Yamashita,L.C., Kehry,M.R. and Moore,K.W. TITLE Isolation, characterization, and expression of cDNA clones encoding the mouse Fc receptor for IgE (Fc-epsilon-RII) JOURNAL J. Immunol. 144, 1974-1982 (1990) STANDARD simple staff_review FEATURES from to/span description pept 175 1170 low affinity IgE receptor (Fc-epsilon-RII) mRNA < 1 2237 Fc-epsilon-RII mRNA BASE COUNT 592 a 622 c 533 g 490 t ORIGIN 1 ggagaagact actgtcttca acacactagc ctgagctacc ttatccaagt gctccacata 61 ttccagaagg agaaggacag acttcaagtt caaatcactt ccagagctgt gagtgacaag 121 tgccttggca ggtagtgcac gcctcatcac tgaaaggatc caaacaagac tgccatggaa 181 gaaaatgaat actcaggata ctgggaacct cctagaaagc gttgctgctg tgcaagacgt 241 gggacacagc tcatgttggt ggggctgctg agcacagcaa tgtgggctgg cctgctggcc 301 ctgcttcttc tgtggcactg ggaaacggag aagaatctaa aacagctggg agacactgca 361 attcagaatg tctctcatgt taccaaggac ttacaaaaat tccagagtaa tcaattggcc 421 cagaagtccc aggttgttca gatgtcacaa aacttgcaag aactccaagc tgaacagaag 481 caaatgaaag ctcaggactc tcggctctcc cagaacctga ccggactcca ggaggatcta 541 aggaacgccc aatcccagaa ctcaaaactc tcccagaacc tgaacagact ccaagacgat 601 ctagtcaaca tcaaatccct gggcttgaat gagaagcgca cagcctccga ttctctagag 661 aaactccagg aagaggtggc aaagctgtgg atagagatac tgatttcaaa gggaactgca 721 tgcaacatat gtcccaagaa ctggctccat ttccaacaga agtgctacta ttttggcaag 781 ggctccaagc agtggatcca ggccaggttc gcctgcagtg acctgcaagg gcgactagtc 841 agcatccaca gccaaaagga acaggacttc ctgatgcaac acatcaacaa gaaggattcc 901 tggattggcc tccaggatct caatatggag ggagagtttg tatggtcgga cgggagccct 961 gtgggttata gcaactggaa tccaggggag cccaataacg ggggccaggg tgaggactgt 1021 gtgatgatgc ggggatccgg ccagtggaac gacgccttct gccgcagcta cttggatgca 1081 tgggtgtgtg agcagctggc aacatgtgag atatctgccc ccttagcctc tgtgactcca 1141 acaaggccca ccccaaaaag tgaaccctga caaacttctg ctcacactct tctggatttc 1201 tcctctacct ttatcgtgga aacagctggg ccctgaggat acccctatca gggcccaggg 1261 ctctctgtga ccgaaggctt tgattatgtt cccacccata ctgaagcagc tggtggatgc 1321 cagctcctgc cagctaccca gaaaccctct ccagctctcc agctaagctg gccatcccat 1381 tccatctgcc ttcctcaaac ctgggcccca gccttgctag ctccctgact acgggcatgc 1441 atgtgggcag ctgagccaac cagggagctg ctgagaacaa agatttcgaa ggcttctttt 1501 gcagtcccca cctcctatca agttccccac tttctccccc tcggcatcag agaacagggg 1561 ttccctttcc ccaggatctg ggatgagtcc tcccatcaag tttgcatcag tggtcccagg 1621 actccgaccc tccttggagg ctaccaggtg tgctcctggt gcgggaggta ttgaaggaac 1681 tctaaacagc tccagcaagg cgagcctggc tctgtctggt aggcctggcc cttctctccc 1741 attccttcta ccttactaaa agctgttaga gaacagtcct aaagctagcc cccaaggtct 1801 attcccttat ttggccactt cctcctcctg aggctgacta caaggtccag ctatccaagt 1861 actgaagtct aacatcaaaa gccccctttg tctcacctaa gtagcaatgc ccaatcaaaa 1921 tacaccatca catcatagcc cagtctaaca gaccgccctt tttctcttca taaaattaca 1981 cctgcaacca ggcgtagtgg tgcaggcctt tagtcccagc acttgggagg cagagacaag 2041 cgaatttctg agttcgaggc cagcctggtc tacaaagtga gttccaggac agccagggct 2101 acacagagaa accctgtctc gaagaaagaa aaaaaaaaaa aattacacct gcgaggtcac 2161 ttgggctgct gtttttctgc ctgagtcaga gggcagccac ttaacttttc ttccctgctt 2221 aataaaggat ctctgtg // LOCUS RATATM1 1585 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 1b. ACCESSION M34137 KEYWORDS alpha-tropomyosin. SEGMENT 1 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1585) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review COMMENT [1] reports isolation of exon 1b which is the first exon coding for the alternatively expressed low-M-r alpha-tropomyosins. FEATURES from to/span description pept 1275 / 1406 low-M-r alpha-tropomyosin BASE COUNT 360 a 408 c 449 g 368 t ORIGIN 1 agaggtgttc cagctttgga agataagtga ggcaggctgt tttcttttct acctttgttt 61 ccattctcca cctcccacat ttaagcaagt tcaggactga ctctttgaca gaggtgtagg 121 attaaaaagc tcggagactt gacaggcttt aaaaaaaaca tagcggaaga aggaatgaca 181 gaggcgggtt cttacagact gtgtccatgt ggctttccag ctggtaacca gaggttctgt 241 cacaggtgtt tgtggtcttg cagacccggg ttcctcagat tccagatatc aggctgaatt 301 gcctgaagtt gcacagttcg ttatggggta cagacaaaat cttatggtag cgtttaatta 361 catagtttga gttcttaaat aagtgaggca tatgtaaatg ttttatatca tctgataact 421 tcaacagcct tgtgaagtgt cctgtactat tatactcgtt tttaagagag gaaatccctg 481 agaagtcatt tgtccaaagg cacacagcca gcagagactt aagagccaga ttgaaacctt 541 aaaaagatgt atttagaaca ggagatttaa ccactacgct ttgccagcta gtagtttaca 601 cttggtgaga tagtttgggt gttgggattc ttgtagtgag gatcttgtgt tctgtgttct 661 tcacaaggat ctttaggata ttcctatttc aggctccgag ttccttgttc agactctggg 721 aaatcgggag ttgccaatgc ctttagtatt cggaatgatg gcgttcttta caactcaaag 781 agtttaggga acgttcaagg tctaggttcc tacccttact cagcttcagg ggaatctcga 841 tttcagagcg ctccaaatgg ccagcaggag tcgccatagg ctttagcatc cagttcctct 901 acttggagtc acccaggctc cgcggaatgg atgcaaacgc cctcctgcgg tagcctcgct 961 ctaggcaaag cagcccagag ccagccattc ccgcgcgcac gcgcagtggt tccaagggta 1021 gccagttcct cgggtccacc ggccccgccc gcctgcccgc cgcgcccatc ccctgcgttc 1081 gcgcacccgc ccgcggctct ctagtgcggc cctggggccg tgacttccgg gctgcttctt 1141 agccgcaggg ggcgccggcg ttgcaaagcg ggcctgggcg ggacgaaccg gacctgggct 1201 ccgtaactgc aggagcccag cagtagcagc tgcagagcgc ggaggcagag cacagcctca 1261 gccgccgcct cgccatggcg ggtagcagct cgctggaggc ggtgcggaga aagatccgga 1321 gcctgcagga acaggcagat gccgcggagg agcgtgcggg cagcctgcag cgagagctgg 1381 accaggagcg gaagctgcgg gaaaccgtaa gggatccacc catcacccga ggcccccacg 1441 ccgcacctgg cgcacctagg ccaactggca cccggctact cagcctgccg gtctatctcc 1501 tcggtctaaa gtaaaacctc caagggggag caagagtgct tgctgttgag aaggttctgg 1561 gggcgagcat gaaatgggtc ccggg // LOCUS RATATM2 462 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 9c. ACCESSION M34138 KEYWORDS alpha-tropomyosin. SEGMENT 2 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 462) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review COMMENT [1] reports isolation of exon 9c which is the final exon coding for the alternatively expressed brain alpha-tropomyosins TMBr-1 and TMBr-3. FEATURES from to/span description pept / 375 448 alpha-tropomyosin BASE COUNT 97 a 141 c 100 g 124 t ORIGIN Unknown number of bp after segment 1. 1 ctcgcctaga atctctgctg agctcccaca aagcatgagt gtctggggct gccctccagc 61 ctggccttag tccccaaatc tgttgtccta aactacacat aagtgagtca ctgggtggct 121 ttaatgaggt gcagaaggag cattggttct gcacctcttc gcctctgctc tacaccaagg 181 aaagattctg gcccctcctt tattctgcca gacctccatg cgcctcaagg gcacagggca 241 gagggcctgg ggtcagggta ccttgccgct gtcctttgta ttgtgttctc agtctgttaa 301 ccgtttctca ctgactcttg ctaacctctt ccttctgcct cttcttttct gctaaccctt 361 gctgacctgc ccagatcaac tctaccatca actcgagcaa aaccgccgtc taactaatga 421 actaaagctg gccctgaatg aggattaaaa ccctgggcca ag // LOCUS RATPLPA1 1658 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exons 1 and 2. ACCESSION M34108 KEYWORDS parathyroid hormone-like peptide. SEGMENT 1 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg10-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1658) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1477 + 1577 parathyroid hormone-like peptide (PLP) precursor, exon 2 (first expressed exon) sigp 1477 + 1577 parathyroid hormone-like signal peptide pre-msg 951 > 1658 PLP mRNA and introns IVS 1259 1452 PLP intron A IVS 1578 > 1658 PLP intron B BASE COUNT 325 a 507 c 466 g 360 t ORIGIN 1 gggaaccagt tacacacaca cacacacaca cacacacaca cacacacaca cacacacact 61 ggtaagaacc agtagcccac aacattctcc ccctctcccc acccacccac ccagcagcgc 121 tgagaagctg aaacgactgg cataaggtta ctcagtgaat ctatagcagg acatagagta 181 ggttatcaga ggaagagtgg ctaagttgga ttctgtgtgc tccaggttct ctgagatcac 241 aaagtgggtc ttcgatcctt gttcacacac cctttctagg atctgctcca gatgtgatgt 301 ccccagctgt tctcaaacag ttaagaccaa aaatcgcctt tgctctcctg ggtgctaccc 361 ttagtcccct ccccagggga aggccatgcc aggcaggaag gctggcttgc aacagtgcga 421 gcttgcaaac gagcttggga ctcttgggat tcgcaggcca cccggagttc ctccttgccg 481 ccggtgggga ccagggctcc tggacctggg ctcctggact caggcggcag caggctggtt 541 caggaggagg agagcgtcag atcggcccac gcggcgactg gggtggggct ccgtggctcc 601 cagctcccct gtgctccccg cccccgccgg cccgctcacc ccagtctgct ccgccggcgc 661 gcgttcctcg ggcgccacct ctttgcgact cgctcacttc tcagcaggtt ggccacagag 721 cgtgtgaacc ttccaaggct gggctcacag ctcctctcca acctgcgccg cccagctggg 781 cgtccgcccc gccgccggaa ccgcctcgcg ccacctggcc ctcgcatcca cgacacgcgc 841 gcctgcaact tgttcaaggg cgttgtggaa tcaactttcc ggaagcaacc agcccaccgg 901 aggaggtaga ccgacagcta tgtatatata cgctctctgc cgcaagtggc tttgcaccca 961 gaggcctggt gcgcaggaca gctgactcct gcgggcacac tcgcgtttga agaggggttc 1021 gccctgcccc gcgacccaga gcgctgccgc caagactaat tagacattgc tatgggagcc 1081 acagcaacgc gtcacgcatc cccgacgcct atgtaaaacg gttcggtctt cgctctttca 1141 gaggaaactc tctgattgct ttttcccttc tcgggtccct tttttttttt tttttttttt 1201 ttttttttgc ctgtgcggtt tgagagaggc acagttggag gcgctgattc ctacaaaagt 1261 aagtgctcag gggctcggaa acactttttt gctttctttt tcttgttttc ttttttcctt 1321 ataacttgtg agatgccctc gacattggag tggcttgttg ggaagccacg gggttccaaa 1381 agagggagag gggcagtgag aggaagagat agatggagag ccaggactga ccctcgggtt 1441 ccactcttgc aggtcccaga gccagcgagc ggcacgatgc tgcggaggct ggttcagcag 1501 tggagcgtcc tggtgttcct gctcagctac tccgtgccct cccgcgggcg ctcggtggag 1561 gggcttggtc gcaggctgta agtaccggat cttcctcccg ggtgagggat ggaggagacc 1621 agcaccagag gtgccgaact gaggcgggca atgcggaa // LOCUS RATPLPA2 144 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B (partial). ACCESSION M34109 KEYWORDS parathyroid hormone-like peptide. SEGMENT 2 of 5 SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 144) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 144 parathyroid hormone-like peptide intron B BASE COUNT 42 a 26 c 21 g 55 t ORIGIN 3 kb after segment 1. 1 gcagaacact atgttttgac tgaagtccaa gttgcctctg ataaactttt gcccaatttt 61 tcttttaatt tttttacagt aaccaaagac aaggataaag tataaatcca cttatttttc 121 catgctggca ctttggtttg taat // LOCUS RATPLPA3 145 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B (partial). ACCESSION M34110 KEYWORDS parathyroid hormone-like peptide. SEGMENT 3 of 5 SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 145) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 145 parathyroid hormone-like peptide intron B BASE COUNT 22 a 34 c 34 g 55 t ORIGIN 640 bp after segment 2. 1 agtctccata gggtcaatcc actgtcatgt gcttcttatg ctgtgctgtt tcttgtcccc 61 ttgctgtgac actcacttct gcctttcttg gacttgagaa cacgttgggg acttttgaga 121 atcggagtcc tgttttgtgg acttt // LOCUS RATPLPA4 920 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 3. ACCESSION M34111 KEYWORDS parathyroid hormone-like peptide. SEGMENT 4 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg9-0. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 920) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 94 + 516 parathyroid hormone-like peptide (PLP) precursor, exon 3 sigp + 94 100 parathyroid hormone-like signal peptide matp 101 + 516 parathyroid hormone-like peptide IVS < 1 93 PLP intron B IVS 517 > 920 PLP intron C BASE COUNT 198 a 353 c 166 g 203 t ORIGIN 1.3 kb after segment 3. 1 tccctactct agaggtcaac tgttcaccca tctcattcct tcaagtccaa cacgggcagg 61 tcctcagctg acaactgtct tctcccctga cagaaaacgc gcggtgtctg agcaccagct 121 actgcatgac aagggcaagt ccatccaaga cttgcgccgc cgtttcttcc tccaccatct 181 gattgcggag atccacacag ctgaaatcag agctacctcg gaggtgtccc ctaactccaa 241 acctgctccc aacaccaaaa accaccctgt gcggtttggg tcagacgacg agggcagata 301 cctaactcag gaaaccaaca aggtggagac gtacaaagag cagccactca agacgcccgg 361 gaagaagaag aaaggcaagc cggggaaacg cagagaacag gagaaaaaaa agcgaaggac 421 tcggtctgcc tggccaggca caactgggag tggcctgctt gaggaccccc agccccacac 481 ctccccgacc tcgacctccc tggagcccag ctcaaggtat tgtctgtccg ctgtgctcag 541 tagggctccc ctccctccac cggcctgggt cccagagatg cagaatcctc tgtatcttga 601 tctgagacta catcagttag gccacaatct cagtccattc ctggttagcc tgctagacaa 661 tttaatttcc ctacttccct cccaccagcc caatcaacca tccttcctcc ctccctccct 721 ccttccctcc ctcactccct ccctccctcc atccctccct ccctaccatc ctccctccct 781 tccttctccc tttcctttcc atcccctccc ctcttctccc tttcctttcc atcccctccc 841 ctcttctcct cccctctctc tgattcctgt cttcctcact cctccccaca catagggtct 901 tcctcaagcc aaggctgact // LOCUS RATPLPA5 855 bp ds-DNA ROD 28-JUN-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 4. ACCESSION M34112 KEYWORDS parathyroid hormone-like peptide. SEGMENT 5 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg9-0. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 855) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 21 30 parathyroid hormone-like peptide (PLP) precursor, exon 4 matp + 21 27 parathyroid hormone-like peptide pre-msg < 1 449 PLP mRNA and introns IVS < 1 20 PLP intron C BASE COUNT 280 a 139 c 157 g 279 t ORIGIN 3.6 kb after segment 4. 1 ccccttcctt gtcttcacag gacgcattga aactttcacc gaagatcttc caagacacgt 61 tacaggattc tgtaatagta aacatatgga aagtattaga catatttatt gcctgtacat 121 actgtaaatg cattgggatc aaactgtctc cccaggaaac tgcacatggg tcatgttacc 181 ataatttatt ttgtcaactg atgtatttat ttgtaaatgt atcttggtgc tgctgactct 241 gtttttttgt aacataatgc actttaggta tacatatcaa gtatgtcgat gaatttaaca 301 cgaaaaagtg tctctatttt gtggttcatt ttgatgagtt ctgaaatata attatctaga 361 ctgatctccc tctgcgcatg taaaaatggc actattttaa atttgtaaag aatgtctaat 421 aaaatataat ctaattacac cgtgacttac aatgttggtt ttaattcttt aaggtttcta 481 atcagagaaa cacagtacaa ttttttttta aattctactt gaataaaatc ctaagcttga 541 tgtgtttgtg tggggatcta ctccaatgtg tccgcattag cttttagtta aaaaccactt 601 aaaatgtatt ttgccaatat ttcagctcaa gtcaatagaa tggctgaaca gcctgaactt 661 caagttgaaa agaaagacaa gaaagggcaa aaaataagca gagaagtcag gagaaagggt 721 caaagagggt tgtccagagt taaaaccaga gaagtgggtc cagacttgaa gactctgaat 781 aaattttgcc aatgtaaagt ttagtttgca agggcttgct ctcgattcat aacaataata 841 aaagatctgc tcggg // LOCUS RATTMBR1A 1050 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-1) mRNA, complete cds. ACCESSION M34135 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1050) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept 41 886 brain alpha-tropomyosin (TMBr-1) mRNA < 1 1050 TMBr-1 BASE COUNT 336 a 220 c 306 g 188 t ORIGIN 1 ctctggctcc tctgcctaag ggcccctcgc caccgccacc atggacgcca tcaagaagaa 61 gatgcagatg ctgaagctcg acaaagagaa cgccttggat cgagcagagc aggcggaggc 121 tgacaagaag gctgcggaag accggagcaa gcagctggaa gatgagctgg tgtcactgca 181 aaagaaactc aaggccactg aagatgaact ggacaaatac tccgaggctc tcaaagatgc 241 ccaggagaaa ctggagctgg cggagaaaaa ggccacagat gctgaagctg acgtagcatc 301 tctgaacaga cgcatccagc tggttgagga ggagttggat cgcgctcagg agcgtctggc 361 cacagctcta cagaagctgg aggaggctga gaaggctgca gatgagagtg agagaggcat 421 gaaagtcatt gaaagccgag cccaaaaaga tgaagaaaag atggagattc aggagatcca 481 gctgaaagag gccaagcaca ttgctgaaga tgctgaccga aagtatgaag aggtggcccg 541 taagctggtc atcatcgaga gcgatctgga gcgtgcggag gagagggctg agctctcgga 601 aggcaaatgt gccgagcttg aagaagagtt gaaaacggtg acgaacaact tgaagtcact 661 ggaggctcag gctgagaagt actctcagaa agaagacaag tatgaagagg agatcaaggt 721 tctctctgac aagctgaagg aggctgagac ccgggctgag tttgcagaga gatcagtaac 781 caaattggag aaaagcattg atgacttaga agatcaactc taccatcaac tcgagcaaaa 841 ccgccgtcta actaatgaac taaagctggc cctgaatgag gattaaaacc ctgggccaag 901 ttctaggaac ggagtccatg ttcagaaaat ctatgactct cggggttaaa aaatgataac 961 ttctagagaa atgggaagga aatgcattgg ctggaaggcc agccttgcct atgttttcct 1021 ctgtatatct taattaaatc ttcaataacc // LOCUS RATTMBR2A 1004 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-2) mRNA, complete cds. ACCESSION M34134 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk15. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1004) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept 137 892 brain alpha-tropomyosin (TMBr-2) mRNA < 1 1004 TMBr-2 BASE COUNT 265 a 242 c 321 g 176 t ORIGIN 1 cttagccgca gggggcgccg gcgttgcaaa gcgggcctgg gcgggacgaa ccggacctgg 61 ctccgtaact gcaggagccc agcagtagca gctgcagagc gcggaggcag agcacagcct 121 cagccgccgc ctcgccatgg cgggtagcag ctcgctggag gcggtgcgga gaaagatccg 181 gagcctgcag gaacaggcag atgccgcgga ggagcgtgcg ggcagcctgc agcgagagct 241 ggaccaggag cggaagctgc gggaaaccgc tgaagctgac gtagcatctc tgaacagacg 301 catccagctg gttgaggagg agttggatcg cgctcaggag cgtctggcca cagctctaca 361 gaagctggag gaggctgaga aggctgcaga tgagagtgag agaggcatga aagtcattga 421 aagccgagcc caaaaagatg aagaaaagat ggagattcag gagatccagc tgaaagaggc 481 caagcacatt gctgaagatg ctgaccgaaa gtatgaagag gtggcccgta agctggtcat 541 catcgagagc gatctggagc gtgcggagga gagggctgag ctctcggaag gcaaatgtgc 601 cgagcttgaa gaagagttga aaacggtgac gaacaacttg aagtcactgg aggctcaggc 661 tgagaagtac tctcagaaag aagacaagta tgaagaggag atcaaggttc tctctgacaa 721 gctgaaggag gctgagaccc gggctgagtt tgcagagaga tcagtaacca aattggagaa 781 aagcattgat gacttagaag ataagtttct ttgcttctct cctcccaaga ctccttcatc 841 aagccggatg tcccacctct ctgagctctg catctgtctg ctctccagct gacccaggtt 901 tctttctagt gcccacccac cctagggcca ggcacagacc gtgctttcta ttgtacagag 961 gtgatcctcc cagtgtaaaa taaacactgt acgctagtcc tgtt // LOCUS RATTMBR3A 2488 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-3) mRNA, 3' end. ACCESSION M34136 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk10. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2488) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 604 brain alpha-tropomyosin (TMBr-3) mRNA < 1 2488 TMBr-3 BASE COUNT 713 a 531 c 579 g 665 t ORIGIN 1 tgaagctgac gtagcatctc tgaacagacg catccagctg gttgaggagg agttggatcg 61 cgctcaggag cgtctggcca cagctctaca gaagctggag gaggctgaga aggctgcaga 121 tgagagtgag agaggcatga aagtcattga aagccgagcc caaaaagatg aagaaaagat 181 ggagattcag gagatccagc tgaaagaggc caagcacatt gctgaagatg ctgaccgaaa 241 gtatgaagag gtggcccgta agctggtcat catcgagagc gatctggagc gtgcggagga 301 gagggctgag ctctcggaag gcaaatgtgc cgagcttgaa gaagagttga aaacggtgac 361 gaacaacttg aagtcactgg aggctcaggc tgagaagtac tctcagaaag aagacaagta 421 tgaagaggag atcaaggttc tctctgacaa gctgaaggag gctgagaccc gggctgagtt 481 tgcagagaga tcagtaacca aattggagaa aagcattgat gacttagaag atcaactcta 541 ccatcaactc gagcaaaacc gccgtctaac taatgaacta aagctggccc tgaatgagga 601 ttaaaaccct gggccaagtt ctaggaacgg agtccatgtt cagaaaatct atgactctcg 661 gggttaaaaa atgataactt ctagagaaat gggaaggaaa tgcattggct ggaaggccag 721 ccttgcctat gttttcctct gtatatctta attaaatctt caataaccaa gaagttttta 781 tgatacatta tgtatatggg aaaatattta taccaatcaa ttcagctctt cctgcatcag 841 caggccctct ttgaagtgat gataatgagg actcccccac caccaccatc cccccaaaaa 901 aaggaaaaaa ttttctattt cctctaaaca gctgtgagtc ctgctttaaa tatctcttct 961 acatttgtat tcctaacatc ctcctaacat gtatagtgtt tgaaatgtga tccaacgtgt 1021 acctatgctt gggcaaaata gcttttgaaa acaggagctc gtaccacaag tccctggttg 1081 aaaggtatgc tttgttaagt ttaacggtgt tggaatctga ttatcgtgtc atgctgttaa 1141 gcgggatact ataaaaagac taagagaatg tccctaatga agtgtgcatg agatgtgttc 1201 atataaccat ttgcgagccg tagaaataaa tctatttaaa aagtggtcag aaccttcctt 1261 acgtcatctt cgttttctgt gaggtcgtgt tctgacccac ctacccgatc tttaactgcc 1321 tgtaactcag aatcctgtgc tggtattaac gtcttcttcg gtctgtgtaa caatgctgta 1381 cctgtatttg gtttttgacc ctcccttgtt ccccgtagca cgcagtgggt gagccttcac 1441 caaacacacc ctgcacgtta cgtcagggtg ggttgcaggt gtaggtgcac ccggggcctt 1501 gtgtagatgt gagtgacttc caaagcatgt gtagtgccac tcatctgtgt gtgactggtg 1561 tttgcttact aaatgtacaa accactaccg agtgtgaagc tcagccattc atgactcaat 1621 tttctgatcc catcacagag aaagtggcac atgccaaaga agaaaacctt agtatgcacc 1681 agatgctgga ccagacttta ctggagctaa acaacatgtg aaaccctcct tagctgcggc 1741 cacattctct cgtttccccc tgttgtttta ttttgtcttt aaacacatgc ttaccatgaa 1801 accccttcaa tgcttttttt ttttatatat atatatacct ttaccactgt cactgcaaca 1861 tctgcagaga gccagctagg gcagggggtg gggaaagaca cacagaaagc aggtgtactt 1921 gttatttaaa tgtgccattt cccaggttga cattgccaca cttcacagag tttaggcatg 1981 cagtttgctt agccagtgta ggaagcctca cgaaaacaaa aggattaaca gcctaagtgc 2041 caatggtggg gtcatcagga agactgatgt tggagacaat caggtgcaga ttggtgctaa 2101 tttaaacaaa agaacccctg gtgtcttttg ttcagtctaa tacaatttag acttctgtcc 2161 aacactaatt tatttttgtc ttgagtagca ctgcgagagg atatgggttc catgtgcccg 2221 tgttcctcac agccaagggt ttgaaagcca cgctggtctt gtaagattcc cattccttcc 2281 cagctgacat gcttgtggtc ccttccaact cagtggggag agggctcagt gctgttcgcc 2341 cccccccccc ccaattctcc atgtcagggt aagagatcta ggcactacat atgtaattgg 2401 taaagaaaag gcattttctc gagagttata actatgtaaa cattgtataa tgatatgaaa 2461 taaaatgcac attgtaggac attttcta // LOCUS RSPNIFHD 1375 bp ds-DNA BCT 28-JUN-1990 DEFINITION R.rubrum dinitrogenase reductase (nifH) gene, complete cds; and dinitrogenase alpha subunit (nifD') gene, 5' end. ACCESSION M33774 KEYWORDS dinitrogenase alpha subunit; dinitrogenase reductase. SOURCE R.rubrum DNA. ORGANISM Rhodospirillum rubrum Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1375) AUTHORS Lehman,L.J., Fitzmaurice,W.P. and Roberts,G.P. TITLE The cloning and functional characterization of the nifH gene of Rhodospirillum rubrum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.J.Lehman, 18-APR-1990. FEATURES from to/span description pept 367 1254 dinitrogenase reductase (nifH) pept 1370 > 1375 dinitrogenase alpha subunit (nifD) binding 355 358 ribosome binding site binding 1357 1361 ribosome binding site signal 145 160 upstream activating sequence signal 175 190 upstream activating sequence signal 280 296 ntr promoter BASE COUNT 287 a 404 c 431 g 253 t ORIGIN 1 ggggatccca aggagggacc ctccgtcctg accaagaact gtgcgttcgt gttttcgccc 61 attcctttat caccgatcgg gcggcgacgg gaagggcgtt ctgggcagta gcgcaccatc 121 gcgcatagcg tcgcgccgcc ccgctgttgg gtccgcgaca gggttttggg gcgttgtcgg 181 gtttgcgaca cggttgaaaa taaccctaca ttccttgtta gatagattta ttcctttgat 241 taaaaacgat aattttcaaa aaaatcgggc aaatcaggtt tggcacgctg ggtgcattaa 301 cggggtgagc gtggcggtgg agcttccccg ccttaggacc ggatcacgaa gaaaaggatt 361 gattccatga gcgcactgcg tcagatcgcg ttctacggca agggcggtat cggcaagtcg 421 acgacgtccc agaacacgct ggccgctctg gtcgaaatgg gccagcggat cctgattgtc 481 ggctgcgatc ccaaggccga ttccacccgc ctgatcctga acaccaagct gcaggacacc 541 gttcttcatc tggccgccga ggccggctcg gtcgaagacc tcgacgtcgc cgatgtggtg 601 aagatcggct acaagggcat caagtgcacc gaatccggtg gtccggagcc cggcgtcggc 661 tgtgccggcc gtggcgtgat caccgccatc aacttccttg aggaaaacgg cgcctacgac 721 gacctcgatt acgtctctta cgacgtgctg ggcgacgtgg tttgcggtgg cttcgccatg 781 ccgatccgcg agaacaaggc tcaggaaatc tacatcgtca tgtcgggcga gatgatggcg 841 ctgtatgccg ccaacaacat cgccaagggc attttgaaat acgcccacac cggtggggtt 901 cgtctgggcg gtctgatctg caacgagcgt cagactgata aggaagtcga actggccgag 961 gccctggccg gccgtttggg ctgccgcctc atccacttcg tgccgcgcga caacggcgtt 1021 cagcacgccg agctgcgccg ccaaacggtg atccaatacg cccccgacag caagcaggcc 1081 ggtgagtatc gcacgctggc gaccaagatc cacaacaact ccggccaggg cgtggtgccg 1141 accccgatca ccatggaaga cctggaagag atgctgatgg aattcggcat catgaagtcc 1201 gacgaagagg cgctggccga gctcgaagcc aaggaatccg ccgccgccaa ttaaggccgt 1261 cgccgtgcga cgttcccccc ggccggccgg tcaggcgcag gggcgacgga agcgggcgac 1321 ctccaagaag gcctcttcgc gaggcgatag tcgatcagag gagagggata tgagc // LOCUS YSCMTFDEL 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial novel junction sequence. ACCESSION M34150 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review FEATURES from to/span description site 33 44 novel junction site BASE COUNT 11 a 25 c 19 g 15 t ORIGIN 1 cttcggagtt cggccccccc ataagggggg gacctcactc cttccccact gcactggatg 61 cgggacttat // LOCUS YSCMTGAP12 457 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial gap 12 sequence. ACCESSION M34149 X15185 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 457) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review COMMENT The gap 12 sequence is a segment of yeast mitochondrial DNA that had not been determined previously. BASE COUNT 161 a 57 c 57 g 182 t ORIGIN 1 tacaatttat aatttaataa agaaggaaat aaataataat aactcctttt ggggttccgg 61 tggggttcac acctttataa ataataaata aagatgttta ctcctcttcg gggttcggtc 121 ccctttttgg gttccggaac taattaatat tttatataat aataataata tattaatata 181 atttcattat taataaatat ctcctgcggg gttcggtccc ccccgtaagg ggggggtcct 241 cactccttcg gagcgtacta ttattataaa taattatata ttataatata attaaaaagt 301 attataattg aaacgaaaat tgtaatttta aatggaataa taattattat atatttaata 361 tatttaataa agttataata tctctttcta ccggactatt ttattttatt ttattttatt 421 tttataaaga aaaatagtat aatattatct tctcctc // LOCUS YSCMTORI1A 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial ori1 gene, 5' flank. ACCESSION M34153 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review BASE COUNT 19 a 11 c 24 g 16 t ORIGIN 1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag 61 ggggtgggtg // LOCUS YSCMTORI2A 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial ori2 gene, 5' flank. ACCESSION M34152 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 32 > 70 ori2 mRNA (+/- 4 bp) BASE COUNT 23 a 0 c 18 g 29 t ORIGIN 1 tttttttata agataatttt tgtaaatata taagtaataa attaagtttt atagggggag 61 ggggtgggtg // LOCUS YSCMTORI7A 70 bp ds-DNA ORG 28-JUN-1990 DEFINITION S.cerevisiae mitochondrial ori7 gene, 5' flank. ACCESSION M34154 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review BASE COUNT 19 a 11 c 24 g 16 t ORIGIN 1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag 61 ggggtgggtg // LOCUS CFXCSMA 859 bp ds-DNA BCT 28-JUN-1990 DEFINITION C.aurantiacus 5.7 kd chlorosomal protein (cmsA) gene, complete cds. ACCESSION M33964 KEYWORDS chlorosomal protein. SOURCE C.aurantiacus (strain J-10-fl) DNA. ORGANISM Chloroflexus aurantiacus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Multicellular filamentous green bacteria; Chloroflexaceae. REFERENCE 1 (bases 107 to 554) AUTHORS Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J. TITLE Gene encoding a 5.7 kd chlorosomal protein in Chloroflexus aurantiacus: Regulated message levels and a predicted carboxy terminal protein extension JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 859) AUTHORS Theroux,S.J., Redlinger,T.E., Fuller,R.C. and Robinson,S.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Robinson, 01-MAY-1990. Author address: S.J.Robinson University of Massachusetts 108 Morrill Science Center Amherst, MA 01003 FEATURES from to/span description pept 210 452 5.7 kd chlorosomal protein (cmsA) mRNA 135 > 452 cmsA mRNA binding 196 204 ribosome binding site rpt 464 495 inverted repeat BASE COUNT 158 a 232 c 283 g 186 t ORIGIN 1 gatcggcgac ttccggcggg tcattacgct gccgaccagt ctggccggcc tggaagccgg 61 tcaggccgag atggagggtg aatggttgat tgtccccttc atggcgccgc aaatggcgca 121 acgctgaggc agcctgaact gtggggtttc tcggaggctt gcctccggtg acctgaataa 181 tggagtccgt ttgataagga ggtgtgtgca tggcgacgag aggctggttc tcggagtcgt 241 cggcgcaggt ggcgcaaatc ggcgacatca tgttccaggg ccactggcaa tgggtctcga 301 atgcgctaca ggccaccgcg gcagcggttg acaacatcaa ccgcaatgct tacccgggcg 361 tgtcccggag cggctcgggc gagggagcgt tcagcagcag cccgagcaac ggcttccgtc 421 cgaagcgcat tcgctcgcgc ttcaaccgct aggctggcgg gcaggcgtga ggcgtgcgtt 481 ctgtcgcctc acgccgatcc ctcgggagcg ttatgattaa cctgctctac tacagtcata 541 ccaccgtacc ggctcatctg gaagagatta cgcgactgga tgtgctcgat cctgacgagc 601 aagtgctggt cgcgatagat ggcgttttgg tcgatcagca gggacagcgg ctgagcggcc 661 caacgctgca cgattattgt ctgatcaccg acttgcgggt gatcctgtgg gcacgagatt 721 atggccggca tttgtgttat gcctttccat tgcatgagct gacgttgatt gatgggcgcg 781 ggattgatcc gatccacggc gcagtgacga tgggctttag cgcgccagat accgatgatc 841 agatttttac gctgactct // LOCUS VVCACTA 3385 bp ds-DNA PLN 28-JUN-1990 DEFINITION Volvox carteri actin gene, complete cds. ACCESSION M33963 KEYWORDS actin. SOURCE Volvox carteri (female strain HK10) DNA, clone lambda-VAc21. ORGANISM Volvox carteri Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Volvocaceae. REFERENCE 1 (bases 1 to 3385) AUTHORS Cresnar,B., Mages,W., Mueller,K., Salbaum,J.M. and Schmitt,R. TITLE Structure and expression of the single Volvox carteri actin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Mages, 01-MAY-1990. Author address: W.Mages Lehrstuhl fuer Genetik Universitaet Regensburg Universitaetsstr.31, D-8400 Regensburg, FRG FEATURES from to/span description pept 450 509 actin, exon 2 (first expressed exon) 613 708 actin, exon 3 780 926 actin, exon 4 1009 1074 actin, exon 5 1150 1197 actin, exon 6 1396 1567 actin, exon 7 1667 1821 actin, exon 8 1959 2142 actin, exon 9 2425 2630 actin, exon 10 pre-msg 195 > 2630 actin mRNA and introns IVS 258 439 actin intron A IVS 510 612 actin intron B IVS 709 779 actin intron C IVS 927 1008 actin intron D IVS 1075 1149 actin intron E IVS 1198 1395 actin intron F IVS 1568 1666 actin intron G IVS 1822 1958 actin intron H IVS 2143 2424 actin intron I signal 120 124 CAAT box signal 164 169 TATA box BASE COUNT 775 a 830 c 928 g 852 t ORIGIN 1 aagcttgaag agaggtctta tggaccctag tcggtcacga tccagacacc aaacgtaacg 61 aaaaaaagga ataattagcc gggcttgttg atgcatattc gcggctttac attatcccgc 121 caatacgttc acgtacggca caggacatcc ttttcactcg agctataaac tatcatcatc 181 cagattgctt gttttacctt gcaccagtta ctatcagcac caaccggccc ttgacttcgc 241 acagccttct aaaaaaggta cgcgcgacgt gtttagcgtt cgggcagccg cagctaggag 301 gtttggtggc tgcgaggccc ttctgcccca ggataagaag gggattaact tccgttgcca 361 taacatgaag tccggggcct tgtccatgtt cgatcgccaa cgatgcaacc tatgctgtct 421 taccgctggg cttatgcagg gcagcaacaa tggctgagga gggcgaggtt tctgctctgg 481 tgtgcgacaa cggctcgggc atggtcaagg tacgttttct ggctgctgct cggttgggcc 541 cggccaggcc caccccgcag ccttgctgag cttttgatgg atcgcatcta acacgtggcc 601 cctgatttgt aggctggctt tgctggtgat gatgccccac gagctgtgtt tccgagcatt 661 gttggtcggc cccgccatac gggtgtgatg gtcggcatgg ggcagaaggt gaatagaatg 721 acggcgatgc aggcacggac tcctcgacat ttttataatg atctttttgt tttcgccagg 781 attcctacgt gggcgacgag gcacagtcta agcgtggtat cttgacgttg cgctacccta 841 ttgagcacgg tatcgttacc aactgggatg acatggagaa gatctggcac cacactttct 901 ttaatgagct tcgtgtggca ccggaggtgc gtctgcgtac tagcttcctt ggctttcagc 961 tatggaccag ttcgacattg acaactgctt tccttcccaa atctgtagga gcacccagtc 1021 cttctcacgg aggcgcctct taatcccaag gcgaaccgtg agaagatgac gcaggtacgg 1081 atattgtttt atgcacatac cgttgcgttc gttcccaagc ttgcctcccg ataatctccg 1141 tttgcgcaga tcatgttcga gaccttcaac gtccctgcca tgtatgtggc gatccaggtg 1201 acgattgggg gaggggactt agtaatacct cgattgctgt tgttcgattc acccctctca 1261 tttcgatttg cgcacttggg gtggcatcat gtttcgtgcg tcggggtagt tgttaaccct 1321 gcaaccctag ctactgccac taatgacagt aggaaagtaa tatttaattg ctgtccttca 1381 tgctgacaac tgcaggccgt gctgtcgctg tatgctagcg gtcgtacgac gggtatcgtt 1441 ctagattccg gtgacggtgt aacccacacg gtgcccatct atgaaggcta cgcgctgccg 1501 cacgccatcc tgcgtctgga ccttgccggc cgtgacctga cggactacct gatgaagatt 1561 ctaatgggtg aggcattcgg atagggggac caaaataccg taccgcgaga aatgggttat 1621 gagtcgtgtg ttttgacttg tgttgaacgg cttcccttcg ttacagagcg tggttactca 1681 ttcacgacca ctgctgagcg tgaaatcgtg cgcgacatca aggagaagct ctgctatgtg 1741 gcgctggatt tcgagcagga aatggcaaca gcagcctcca gctcggcgct tgagaagacg 1801 tacgagctgc ctgatggcca ggtgcggccg tagctagacg ccgacgaata gcctaaaact 1861 ttcggtgatg atcagcttag atacatttga ttgtgtcggc attgcacgaa aaaagtcgtt 1921 gtccataatg ttaaaattgc cctcttttct ccctgcagcc aatcacaatt ggcaacgagc 1981 gcttccgttg ccctgaggtg ctgtacaacc ctagcctaat cggcatggag gcggtcggta 2041 tccacgacac tacctttaac agcatcatga agtgcgatgt cgatatccgc aaggatctgt 2101 acaacaacat cgtgctctct ggtggaacga ccatgttccc tggtaagcaa tgcatactac 2161 tgttagactt caggacatgt tgtgatcgca ttagcgatta gaaccgggtt agcgcgggac 2221 agacaggagg tacggtgttt ggggctcggg gaactcttat gaggatgaat aagcggttct 2281 tagcgtctat gtcacggacg gatctatgga attccgtcag tatctggagg cttgcatgtt 2341 gcatcgctcc ccctgaactc catcgtcctc cgtgaaggtc acagtgcctt gacaaatgcg 2401 atctgtgctg ttgaacaatt gcaggcatcg ctgatcgcat gaccaaggag atcacggccc 2461 ttgcgccgtc tgccatgaag atcaaggtcg tcgccccgcc ggagcgcaag tactctgtgt 2521 ggattggtgg ttcaatcctg gcgtccttgt cgaccttcca gcaaatgtgg atcgcgaaga 2581 gcgagtacga cgagagtggt ccgtcgattg tgcaccggaa gtgcttctaa gcataatagc 2641 ttgcggactg ttattcggcc tctttcggtt ataactgacc ggaagcgggt tactcttcag 2701 tcaaggcgga tgcctatgat tttgcaaaaa ttcgagcgga gatgtcatct tggtttcaag 2761 ctgcgtgggt gagaaatcaa ctgtgggatc catgtcgaca gcttaacaac agtctctacc 2821 atttcatggt atgcggctgt agccgtagcg agtacgtcgt catgagcagc ggtagtttga 2881 tcgtgcgaag tagtctgata ggggcagggc gcgtggggat ggctaataga acgaagagga 2941 tgtgtgcatt caagcgagtg gatgtcggcg attggtctag gccaagcaaa gtgcgtgtaa 3001 gggaaagctg gatgacacga ccattgctcg tgtgctttcg gaacttccac tcttgaactc 3061 aggcctatgc gctgccttga gtatcctgcc ggataaaagg cgggaggcgg taacgttagt 3121 agacccttcc aggggtataa atggtcaatt acaacaaaac tgtaacataa cgaagcgctg 3181 cacgtagtac gaaatccgtt gcaaacgtga aacccagcta cttgccaata cttaaacgcg 3241 catcagcatt gtgtacaaga acgtcaatga ggcattgcgt aaattacaca tgtgagcact 3301 aagaaaacag ctgtatgttg tcacgtttat tcactagcgc acagcccgca agtaagcggt 3361 ggcaaagccc agctttggta caaag // LOCUS HAMCD44 1089 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Hamster hyaluronate receptor (CD44), complete cds. ACCESSION M33827 KEYWORDS CD44 hyaluronate receptor. SOURCE Hamster, cDNA to mRNA. ORGANISM Cricetulus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1089) AUTHORS Aruffo,A., Stamenkovic,I., Melnick,M., Underhill,C.B. and Seed,B. TITLE CD44 is the principal cell surface receptor for hyaluronate JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Seed, 23-APR-1990. FEATURES from to/span description pept 1 1089 hyaluronate receptor (CD44) BASE COUNT 299 a 282 c 277 g 231 t ORIGIN 1 atggacaagt tttggtggca cgcagcttgg ggactctgcc tcttgccgct gagcctggcg 61 cacgagcaga tcgatttgaa cataacctgc cgctatgcag gtgtattcca cgtggagaaa 121 aatggccgct acagcatctc acggactgag gcagctgacc tctgccaagc tttcaacagc 181 actctgccca ccatggacca gatggtgatg gccctgagca agggctttga aacatgcagg 241 tatgggttca tagaaggcca cgtggtgatc ccgaggatcc agcccaatgc catctgtgca 301 gccaaccaca ctggggtgta tatcctcaca tccaacacat ctcactacga tacatattgc 361 ttcaatgcct cagcacccct tgaagaagac tgtacatctg tcacagacct gcccaattcc 421 ttcgaaggac cagttaccat aactattgtc aaccgtgatg gtacccgcta cagcaagaag 481 ggcgagtata gaacacacca agaagacatt gatgcctcaa ataccacaga tgatgatgtc 541 agcagcggat cctccagtga gaagagcacc tcagggggct atgttttcca cacctacctt 601 cccactatac actcaactgc agaccaggat gatccctact tcatcgggag caccatggcc 661 accagagacc aagactcatc catggatccc agggggaatt ccctcactgt gactgatgga 721 tccaaattaa ctgaacactc aagtgggaat caagacagtg ggcttaactc aacttctcgt 781 cctggaggaa aacctcgagt tccagaatgg ctcatcgtct tggcatctct cctggcgctg 841 gctctgattc ttgctgtttg cattgctgtc aacagtagga gaaggtgtgg acagaagaaa 901 aagctggtga tcaacagtgg caatggaaag gtggaggaca ggaagccaag tgagctcaac 961 ggggaggcca gcaagtctca ggaaatggtg catttggtga acaaggaacc atcagagact 1021 cctgaccagt ttatgacagc tgatgagacc cggaatctgc agaatgtgga catgaagatt 1081 ggggtgtag // LOCUS DROGNBPSA1 441 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exon 1. ACCESSION M33996 KEYWORDS guanine nucleotide-binding protein. SEGMENT 1 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 441) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pre-msg 362 > 441 guanine nucleotide-binding protein (Gs-alpha) mRNA and introns (alt.) pre-msg 174 > 441 Gs-alpha mRNA and introns (alt.) IVS 426 > 441 intron A signal 152 157 promoter region (put.) variant 365 365 c in DNA, g in cDNA BASE COUNT 140 a 89 c 85 g 127 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgcta ttctacatag ctcgggatac tttattttat tattaatatt cgctaaaata 61 gcacatcgtc atttggcata gatgtttttg gcccagtaac atatctatct acatcaccgc 121 ccgcactatc cgataggcag actgactatc gataaaagta tctacttcgt tgcatcccta 181 ggttgataca taaaataaaa aatatataaa aagtgagtta gtactggtat taaaagtaat 241 ttttaagcta caaattgatg aattgcgagc acacagagac taagaacatt gggaatccgg 301 aaaacagtgg ctggcgcgtc gggtgcgata gaattgcctg tttttcagac cccaaaagtc 361 gctcccaacc gagtcgtgca cctatagtct tggcccgaag ttgctcccat gaataaaaat 421 gttcggtaat ttcacggtaa a // LOCUS DROGNBPSA2 897 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exons 2,3 and 4. ACCESSION M33997 KEYWORDS guanine nucleotide-binding protein. SEGMENT 2 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 897) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 265 540 guanine nucleotide-binding protein (Gs-alpha-S), exon 2 607 703 guanine nucleotide-binding protein (Gs-alpha-S), exon 3 760 + 880 guanine nucleotide-binding protein (Gs-alpha-S), exon 4 pep$ 265 540 guanine nucleotide-binding protein (Gs-alpha-L), exon 2 607 703 guanine nucleotide-binding protein (Gs-alpha-L), exon 3 760 + 880 guanine nucleotide-binding protein (Gs-alpha-L), exon 4 pre-msg < 1 > 897 Gs-alpha mRNA and introns IVS < 1 29 intron A IVS 541 606 intron B IVS 704 759 intron C IVS 881 > 897 intron D BASE COUNT 266 a 225 c 226 g 180 t ORIGIN About 1.4 kbp downstream of segment 1. 1 ccatctacta taccattgta cgaatgcagc ctgctgcaga gatccactga agcgctgagc 61 caacgccaac ggtggtgcca ctgagatcgg agatcggaga tcgcagaccg gagaccgacg 121 agcacacaag ggattgtggt ttgatccgag gaacccaagc actcagctat actggagcag 181 aacggagaaa gcacaagggg tgccgagtcg cccgacgcga accaggaaaa ccgagcgacg 241 tgttggtgga ccccgtgggc tgcgatgggt tgctttgggt cgcccacctc caagcagtcg 301 gacgtgaact cggaggactc gaagagccag aagcgccgga gcgatgcaat atctagacag 361 ttgcagaagg acaaacagct ctacagggcc acacacaggc tgctcctcct gggggcgggc 421 gagtccggca aatcaaccat agtcaagcaa atgcgaatat tgcatgtcga cggattttct 481 gactcggaaa agaaacagaa aattgatgat attaaaaaga atattcgaga cgctatcttg 541 gtgagttcca tcccgccggc catctaagcc caaacacttg cttaactcct tgccaatccc 601 ttacagacta ttacaggagc catgagcaca cttaatccac ctgtagcttt agaaaagaag 661 gaaaatgaac ccagagtgga gtacattcag gattatgcat ctagtaagag cctctgccgc 721 cccaacttaa gttgttatca accattatct gccccgcagg tccggacttt aattatcctc 781 ctgaatttta tgaacataca gaagaactat ggaaagacaa gggcgttctt caaacctatg 841 agaggtcgaa tgagtatcaa ttaatcgatt gtgcgaaata gtaagtaatg cccaccc // LOCUS DROGNBPSA3 1380 bp ds-DNA INV 28-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exons 5,6,7,8 and 9. ACCESSION M33998 KEYWORDS guanine nucleotide-binding protein. SEGMENT 3 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1380) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 4 132 guanine nucleotide-binding protein (Gs-alpha-S), exon 5 206 385 guanine nucleotide-binding protein (Gs-alpha-S), exon 6 445 569 guanine nucleotide-binding protein (Gs-alpha-S), exon 7 746 828 guanine nucleotide-binding protein (Gs-alpha-S), exon 8 909 1055 guanine nucleotide-binding protein (Gs-alpha-S), exon 9 pep$ + 4 132 guanine nucleotide-binding protein (Gs-alpha-L), exon 5 206 385 guanine nucleotide-binding protein (Gs-alpha-L), exon 6 445 569 guanine nucleotide-binding protein (Gs-alpha-L), exon 7 755 828 guanine nucleotide-binding protein (Gs-alpha-L), exon 8 909 1055 guanine nucleotide-binding protein (Gs-alpha-L), exon 9 pre-msg < 1 1115 Gs-alpha mRNA and introns (alt.) pre-msg < 1 1176 Gs-alpha mRNA and introns (alt.) pre-msg < 1 1243 Gs-alpha mRNA and introns (alt.) IVS < 1 3 intron D IVS 133 205 intron E IVS 386 444 intron F IVS 570 745 intron G (for Gs-alpha-S) (no splice consensus) IVS 570 754 intron G' (for Gs-alpha-L) IVS 829 908 intron H variant 780 780 t in DNA, c in cDNA signal 1083 1088 polyA signal signal 1154 1159 polyA signal signal 1221 1226 polyA signal BASE COUNT 441 a 265 c 254 g 420 t ORIGIN About 500 bp downstream of segment 2. 1 aagcttcctg gaccgagtga gcacaatcaa gaatccaaac tacaccccta atgagcagga 61 tattcttcgg tgccgtgttt tgacttctgg aatatttgaa acaagatttc aagtggacaa 121 agtaaacttt cagtaagtgt aatcccaagt gatatcgcaa tattttagtt tctattccat 181 tctaatgata tgcattcgtg tgcagcatgt tcgatgtcgg tggccagcgg gacgagcgta 241 ggaaatggat tcagtgtttc aatgatgtaa ctgctatcat attcgtaact gcgtgctcaa 301 gttataacat ggttttgcgg gaagatccca cccagaaccg acttcgagaa tctttggatt 361 tgttcaagag tatttggaac aacaggtgaa ccgagcaatt tggcgccatc tcgacaaatg 421 gaaactaacg agttgttctt tcagatggct tcgcacgatt tctattatac tatttttaaa 481 taagcaagat ttgttagcag agaaaattaa ggctggaaaa agtaaattgt cggaatattt 541 ctccgagttt aacaaatacc aaacgccaag taagtaaaat agtataaccc atagtagagg 601 ccaattgagg gcgagggagt catcagatca accgactcag atctggccaa gaaattcgca 661 attccttttc aaatttgcaa accgtgccca agttaactta agaaatttaa tggataatgt 721 attttgtcta atttcaaatg tgctgtcgac acaggtgacg caataatgga atccaatgat 781 gacccagaag taatacgagc aaaatatttc atacgagacg agtttctggt gggttatatc 841 acatctccgt ttcttcctcc acctatctaa ctagccttaa tccttaatgg gatctctttc 901 ttatatagcg tatatctacc gctagcggag acggaaaaca ctactgctat ccacatttca 961 catgcgccgt tgacacagaa aacattaaac gtgtgtttaa tgattgcaga gacattattc 1021 aaaggatgca ccttcgtcaa tatgaattgt tataggttat ccccatcgcc gtaatgcaag 1081 taaataaaaa tattaatgac ttactgtttt aaatataatt atcaaatgta atttgtattt 1141 aaatagtccc ttgattaaaa caaaaaattc gctgcattta gatctggtat gacctttcac 1201 aaaactcaac attaaagaat aataaataaa catgttaata acagacacgc ttgtttcgat 1261 tattttggct attctaccag gatgcacctg ctttattttc tatagtccct tgacatgaag 1321 cattttagag tatttccgct atcagagtcc gtatccatat ccatgggcat ttacattcac // LOCUS SYCSBEFA 850 bp ds-DNA BCT 28-JUN-1990 DEFINITION Synechocystis sp. cytochrome B559 alpha subunit (psbE) and beta subunit (psbF) genes, complete cds. ACCESSION M33897 X06988 KEYWORDS cytochrome B559 alpha subunit; cytochrome B559 beta subunit; photosystem II protein. SOURCE Synechocystis sp. (strain PCC6803) DNA, clone pKW1261. ORGANISM Synechocystis sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 553) AUTHORS Pakrasi,B.P., Williams,J. and Arntzen,C. TITLE Targeted mutagenesis of the psbE and psbF genes blocks photosynthetic electron transport: Evidence for a functional role of cytochrome B559 in photosystem II JOURNAL EMBO J. 7, 325-332 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 850) AUTHORS Pakrasi,H.B., Nyhus,K.J. and Granok,H. TITLE Targeted deletion mutagenesis of the beta subunit of Cytochrome B559 protein destabilizes the reaction center of photosystem II JOURNAL Z. Naturforsch., C, Biosci. 45, 423-429 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Himadri, 01-MAY-1990. Washington University Dept. of Biology Campus Box 1137 St. Louis, MO 63130 FEATURES from to/span description pept 121 366 cytochrome B559 alpha-subunit (psbE) pept 403 537 cytochrome B559 beta subunit (psbF) binding 109 112 ribosomal binding site (put.) binding 392 396 ribosomal binding site (put.) BASE COUNT 197 a 196 c 197 g 260 t ORIGIN 1 gctaccccag caccgaggtg gcaatggttc cctaggcggc tcacaaaata gtagactaga 61 ctctacttgc tttgcatttg tcagtcaatg ttgttttgaa aaattgaagg agaacacaaa 121 atgtcaggga ctaccggcga gcgtccattt tccgatattg tcaccagcat tcgctactgg 181 gtgatccaca gcatcaccat cccgatgttg tttattgctg gttggttgtt tgtcagcacg 241 ggcttagcct acgatgcttt tggcactccc cgccccgatg aatatttcac ccagacccgt 301 caagagttgc ccattctcca ggaacgctac gacattaatc aggaaattca agagtttaat 361 caataaaaca tttaattgtt cttttttagt tggtaattaa caatggcaac ccaaaatcct 421 aatcaaccgg ttacttatcc catttttacg gtgcgctggc tggcggttca caccctggcg 481 gtgccctctg tcttctttgt cggggcgatc gccgcgatgc aatttattca acgctaggag 541 tttttcatgg acagaaattc aaacccaaac cgccaaccgg tggaattgaa ccgcacttct 601 ttatacctgg gtctattgtt ggtggctgtg ttggggattt tgttctccag ctatttcttt 661 aactaaactt ttttaatacg caatttagga ggcatggtat gttcgcagaa ggcagaatcc 721 ctttgtgggt ggtgggtgta gtggccggta ttggcgccat tggtgttcta ggattatttt 781 tctacggagc ctatgctggt ttaggttctt ccatgtaatc gagggctagc cgccacacaa 841 tatcatggtt // LOCUS ECOPURCA 2060 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli 5-phosphoribosyl 5-aminoimidazole 4-N-succinocarboxamide synthetase (SAICAR) and dapX gene, complete cds. ACCESSION M33928 KEYWORDS 5'-P-ribosyl 5-aminoimidazole 4-N-succinocarboxamide synthetase; SAICAR synthetase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2060) AUTHORS Tiedemann,A.A., DeMarini,D.J., Parker,J. and Smith,J.M. TITLE DNA sequence of the purC gene encoding SAICAR synthetase and organization of the dapA-purC region of Escherichia coli K12 JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (sites only) AUTHORS He,B., Shiau,A., Choi,K.Y., Zalkin,H. and Smith,J.M. TITLE Genes of the Escherichia coli pur regulon are negatively controlled by a repressor-operator interaction JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Authorin sequence for [1] kindly submitted by J.M.Smith, 27-APR-1990. Seattle Biomedical Research Institute 4 Nickerson Street Seattle, WA 98109-1651 e-mail: jms0234@max.acs.washington.edu FEATURES from to/span description pept 1281 1994 5'-phosphoribosyl-5-aminoimidazole-4- N-succinocarboxamide synthetase (EC 6.3.2.6) pept 637 1068 dapX protein signal 1106 1111 -10 region binding 1112 1127 purine repressor protein binding site (PUR box) site 1180 1216 repetitive extragenic palindrome BASE COUNT 501 a 504 c 587 g 468 t ORIGIN 1 gcatgccggt ttgctgtaaa gtttagggag atttgatggc ttactctgtt caaaagtcgc 61 gcctggcaaa ggttgcgggt gtttcgcttg ttttattact cgctgcctgt agttctgact 121 cacgctataa gcgtcaggtc agtggtgatg aagcctacct ggaagcggca ccgcttgcgg 181 agccttcatg ccccggctgg aatgattttg ccggtgacct ccggtgttat gcaatcccgg 241 tgaccaacgg tagtggtgct gtcggtaagg cgctgacatt cgtccaccag cccagccgct 301 ggcactggtt tctggcgcgc gtacccagtt cacgggcgat accgcttcat tgctggtgga 361 aaatggtcgt ggcaatactc tgtggccgca ggtggttagc gtgctgcagg cgaaaaacta 421 caccatcacc caagtgatga tgctggtcag acactgacca ccgattgggt acaatggaac 481 cgtctggacg aagacgagca gtatcgtggt cgttatcaaa tctctgttaa gccgcagggt 541 tatcagcagg cggttacggt taaactgctg aacctggaac aggcgggcaa accggttgca 601 gacgcggctt ccatgcagcg ttacagcacg gagatgatga acgttatttc cgccggtctg 661 gataaatctg ccactgacgc cgcgaaccgt gcgcaaaatc gtgcctccac cactatggac 721 gtacaaagtg cagctgatga caccggttta ccaatgctgg tcgtacgcgg gccgttcaat 781 gtggtttggc aacgtctgcc agcggcgctg gaaaaagtgg gcatgaaagt gaccgacagc 841 acccgttcgc agggcaacat ggccgtaact tataagccgc tgtctgacag cgactggcag 901 gaactgggcg cgagcgatcc aggcctggca tccggtgact ataaactgca ggttggcgat 961 ttagataacc gcagcagcct acagttcatc gatccgaaag gtcatactct gactcagagt 1021 cagaacgacg cgctggtagc tgtcttccag gctgcgttta gcaagtaaaa atacagggct 1081 ggaatcatcc ggcccttttt tctgatatga tacgcaaacg tgtgcgtctg caggaaaacg 1141 cgattttagc ggtaattcgc acgaaatttg tttgtcggac gtagttcgga taaggcgttc 1201 acgccgcatc cgacaaaaca tccggcacac cagacagcaa aagattttaa aacgttaatt 1261 cacacccagg agtgataaag atgcaaaagc aagctgagtt gtatcgtggt aaagcgaaaa 1321 ccgtatacag cacggaaaac ccggacctgt tggtgctcga attccgcaat gatacgtcag 1381 caggggatgg cgcgcgcatt gagcagtttg atcgcaaagg tatggtgaac aacaagttca 1441 actacttcat tatgagcaaa ctggctgaag cgggtatccc gactcaaatg gagcgtctgc 1501 tctccgatac cgaatgtctg gtgaaaaagc tggatatggt gccggttgag tgtgtcgtgc 1561 gtaaccgtgc tgctggctct ctggtgaaac gtcttggaat cgaagaaggt attgagctga 1621 acccgccgct gttcgatctg ttcctgaaaa acgacgccat gcacgatccg atggtcaacg 1681 aatcttactg cgaaaccttt ggctgggtga gcaaagagaa cctggcgcgt atgaaagagc 1741 tgacctacaa agcgaacgac gtgctgaaaa aactgttcga tgatgctggt ctgattctgg 1801 tcgacttcaa gctggaattt ggtctgtaca aaggcgaagt ggtactgggt gatgagttct 1861 ccccggacgg tagccgcctg tgggacaaag aaacgctgga gaaaatggac aaagaccgtt 1921 tccgccagag cctcggtggc ctgatcgaag cctatgaagc cgtcgcccgc cgcctgggtg 1981 tacagctgga ctgatttttc tgttcatcat cttgccgtgc agctggcacg gcaagacaac 2041 cgctctcgga atatatgcat // LOCUS PIGSGLT1A 3649 bp ss-mRNA MAM 28-JUN-1990 DEFINITION Pig Na+/glucose cotransporter protein (SGLT1) mRNA, 3' end. ACCESSION M34044 KEYWORDS Na+/glucose cotransporter. SOURCE Pig kidney cell line LLC-PK1, cDNA to mRNA, clone pPSGT-[B1,C2]. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 3649) AUTHORS Ohta,T., Isselbacher,K.J. and Rhoads,D.B. TITLE Regulation of glucose transporters in LLC-PK1 cells: Effects of glucose and monosaccharides JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Ohta 03-MAY-1990. Mass General Hospital Building 49 13th Street Charlestown, MA 02129 FEATURES from to/span description pept < 1 1818 Na+/glucose cotransporter (SGLT1) signal 1935 1940 polyA signal (put.) signal 2086 2091 polyA signal (put.) signal 2238 2243 polyA signal (put.) signal 3205 3210 polyA signal (put.) signal 3628 3633 polyA signal (put.) BASE COUNT 905 a 790 c 871 g 1083 t ORIGIN 1 ttcttcctgg ctggacgaag tatggtgtgg tggccggttg gagcttctct gtttgccagt 61 tatatcggaa gcgggcattt tgttgggctg gcaggaactg gagcagctgc aggcattgcc 121 actggtggct ttgaatggaa tgctctgatt tgggtggttg ttctgggctg gctgtttgtc 181 cccatttaca ttaaggcagg ggtggtgacg atgccagagt atctgcggaa gcgatttgga 241 ggcaagcgga tccaggtcta cctctctatc ctgtccctga tgctctacat tttcaccaag 301 atctcggcag acatcttctc tggggccata ttcatcacct tggccttggg cctggatctg 361 tacctggcca tctttctctt actggcaatc actggccttt acacaatcac agggggcctg 421 gctgctgtga tttatacgga taccttgcag acagcaatca tgctggtggg gtcttttatc 481 ctaactgggt ttgcttttca tgaagtggga ggttatgatg ccttcatcga aaaatacatg 541 aatgccattc cgactgtgat ttctgatgga aatatcacca tcaagaaaga atgttatgcc 601 ccgagggctg actccttcca tattttccga gatcctctca agggagacct gccatggcct 661 gggctcacct ttgggctgtc catcctcgcc ctgtggtact ggtgcacaga tcaggtcatt 721 gtgcagcgct gtctctcggc caaaaacatg tcacatgtga aggctggctg cgtcatgtgt 781 gggtacttta agctgctgcc catgtttgtc atagtgatgc cagggatgat cagccgcgtc 841 ctgtacacag aaaaaattgc ctgcaccgtc ccctcggaat gtgagaaata ttgcggcacc 901 aaggttggct gttccaacat tgcctacccg accttggtgg tggaactcat gcccaatgga 961 ttgcgaggcc tgatgctgtc ggtcattttg gcctctctca tgagctccct gacctccatc 1021 ttcaacagcg ccacgactct cttcaccatg gatgtctacg ccaagatccg gaagagagca 1081 tctgagaaag agctcatgat tgcgggaagg ctgttcatcc tggtgctgat tggcatcagc 1141 attgcctggg tgcccattgt gcagtcagca caaagtgggc agctcttcga ttacatccag 1201 tctgtcacca gttacttggg accacccatc gcagctgtct tcctacttgc tattttctgc 1261 aagagagtca atgaggaggg agccttttgg ggactggtca taggatgtat gattgggctg 1321 gcccgtatga ttaccgagtt tgcctatgga accgggagct gcgtggagcc cagcaactgt 1381 cccacaatta tctgtggtgt gcactacttg tactttgcca tcatcctctt tgtcatttcc 1441 atcatcatcg tcctggtcgt ctccctcttc accaagccca ttccagatgt gcatctctac 1501 cgcctgtgtt ggagcctgcg caacagcaaa gaggagcgta ttgacctgga tgcagaggag 1561 gaggacattc aagaagcccc agaagagacc attgaaatag aagttcctga ggagaagaaa 1621 ggatgcttca ggaggaccta tgacctgttt tgtggcctgg accagcagaa gggccccaaa 1681 atgaccaagg aagaggaggc agccatgaag ctgaagatga cagacacctc agagaagcct 1741 ttgtggcgga cagtagtgaa catcaatggc attatcctgc tgaccgtggc cgtattctgc 1801 catgcatatt ttgcctgaaa tcctgccttc tgccacaggc tgtgccctga aggctggact 1861 ctttactctg tttcctttta gtcccattct gtgctgtcaa agggacacca gccagttgta 1921 aattttgcct agtcaataaa tgtgtacatt tgtaattagc cagaaaaaaa aatcagtgat 1981 ctgttattaa cttatgtatt tgaggccagt gtcatacagt catctgtaca tatcagacct 2041 gcagaaggaa gtccactcag tcggatgtct aggaaaaggc agactaagaa acagaagtcc 2101 cgtgatgtct gatgcaagtt tgtctcaggt agattcagca tgtcagtgtt gtttgtgatc 2161 cttgaatgtt attttaagag ttgtggtcgc cctggttcgg ccacttccta taaacctcac 2221 ctctcataat ttttcaaaag aaaaaatccc ccccctcccg atatcatgtc aagtttttcc 2281 ggcacaagag taacaatttg ataagttgat gaacacattg aagctgagga tggaagtttg 2341 atggagtccc atcctgggtc tttgctctcc aaaggagagg tctgggatgg taatacttcc 2401 cccaaagaga gggacccagg tgccaatctc aggccactag ggaaatcagg taacttgagt 2461 caaggcgatt ttctgcattt cttagaagtg tccattgtga gttccttggg ttgagatttc 2521 attcattttg gcatatcttt tgcctgtact gtcttagtca ctcagcatgt ctgtggacca 2581 aatgaatgat ataacatttt gttttataat gtttgcttta tttttggtta actttttctt 2641 atggctgcac aagtacaacc tctcgtgact ataatgaatt gggtgcattt ccataggcct 2701 gtcttcctca tggtgccaat gctggcatac aggtctagaa gctgactctc tcacataata 2761 gagcaaatgc acctcttgga ggcagccgtc atctcaaagt aaccacactc tcctgaaaca 2821 ctgaaacttt ttggcctgag accctccttc tatgcgctca catgaccata gctcatattt 2881 ttttgaggtg tcttgattga tgatcatcca ctgaccaagg ttagagtggg tggtctctgt 2941 tccaggtggt agcacttgac actcttattt cttgtttcct agggagaatt tttaaaaatt 3001 aaatttaatt aatttaattc ttttaattcc accagtgaaa tattaattat aatataatcc 3061 tgtggaagtc atggtgttgg tattgactgg tttcaagcta gcttggaaaa gagaagtgat 3121 ctgagattct gtggagctaa gagcggttcc ttgggtataa aagtggctta acaggtatac 3181 aatggcttgt tcttgtataa ggaaaagaaa attttgatat gccctatatt ccccttaaag 3241 ctcaggcaga gaagatgcca aaagtggaga atttcacatt attcagagta cttgtgggag 3301 tttgtgatgt ttttgagctt tgagaggata aagagggatt atagaaagtt ggggaatgga 3361 acaggcataa gtatctagat tgtcctagat tctgtttccg actttgccaa tgattcactg 3421 tgtatattgc agtaaattat tttgtctcaa ctgtgtattt tcctctttga aaaatggaga 3481 ccttggtgtt atagatctac ctacctcctc tgggtgtgac aaggtcattt tgaaaaggtt 3541 agatataata catgaaagat gccaagcaaa tttatattaa gcttcattaa tgtagtccca 3601 tacattaaat tttactactt tggtaaaaat aaaagacatt aaaaatgat // LOCUS RATSHIIA 3064 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Rat K+ channel protein (KShIIA) mRNA, complete cds. ACCESSION M34052 KEYWORDS K+ channel protein. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3064) AUTHORS McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R. TITLE Molecular cloning of a member of a novel class of K+ channel genes in mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by R.Bernardo, 07-MAY-1990. FEATURES from to/span description pept 80 1771 K+ channel protein (KShIIa) BASE COUNT 777 a 718 c 729 g 840 t ORIGIN 1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag 61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt 121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct 181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct 241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc 301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga 361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt 421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt 481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat 541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct 601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga 661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg 721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc 781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt 841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta 901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac 961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct 1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg 1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag 1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac 1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt 1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc 1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac 1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg 1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa 1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa 1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa 1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga 1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc 1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa 1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag 1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt 1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt 1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc 2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg 2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa 2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg 2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga 2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc 2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt 2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat 2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc 2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca 2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa 2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga 2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga 2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt 2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt 2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt 2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa 3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa 3061 tgat // LOCUS ECOHIP 726 bp ds-DNA BCT 28-JUN-1990 DEFINITION E-coli hip gene for integration host factor beta-subunit (IHF beta). ACCESSION X04864 KEYWORDS 30S ribosomal subunit protein S1; integration host factor; integration host factor beta-subunit; rpsA gene. SOURCE Escherichia coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 597) AUTHORS Weisberg,R.A. JOURNAL Unpublished (1987) NICHD/NIH, Bldg.6 Rm.306, Bethesda MD 20892. STANDARD simple automatic REFERENCE 2 (bases 1 to 597) AUTHORS Flamm,E. and Weisberg,R.A. TITLE Primary structure of the hip gene of Escherichia coli and of its product,the beta-subunit of integration host factor JOURNAL J. Mol. Biol. 183, 117-128 (1985) STANDARD simple automatic REFERENCE 3 (bases 1 to 726; revises [1],[2]) AUTHORS Mengeritsky,G., Giladi,H., Teff,D., Goitein,D., Weisberg,R.A. and Oppenheim,A.B. TITLE Revised DNA sequence of the hip gene of E.coli JOURNAL Unpublished (1990) STANDARD simple automatic COMMENT **map:20'; EMBL features not translated to GenBank features: key from to description INVREP 495 508 inverted repeat B INVREP 510 523 inverted repeat B' TERM 495 523 put. transcription terminator [3] Author address G.Mengeritsky Dept of Mol Genetics, Hebrew University-Hadassah Medical School Jerusalem, Israel GALINA%HUJIMD.BITNET@CUNYVM.CUNY.EDU FEATURES from to/span description pept < 1 29 rpsA pept 190 474 IHF revision 455 459 gcgcc in [3]; gc in [1],[2] rpt 40 48 inverted repeat A rpt 53 61 inverted repeat A' signal 40 61 put. transcription terminator signal 76 81 -35 region signal 99 104 -10 region binding 103 116 IHF binding site site 475 483 Lambda-E.coli erossover point mut 335 335 g in wt; a in hip157 (gly is asp) BASE COUNT 204 a 167 c 162 g 193 t ORIGIN 1 aagctttcaa agcagctaaa ggcgagtaat tctctgactc ttcgggattt ttattccgaa 61 gtttgttgag tttacttgac agattgcagg tttcgtcctg taatcaagca ctaagggcgg 121 ctacgggccg cccttaatca atgcagcaac agcagccgct taatttgcct ttaaggaacc 181 ggaggaatca tgaccaagtc agaattgata gaaagacttg ccacccagca atcgcacatt 241 cccgccaaga cggttgaaga tgcagtaaaa gagatgctgg agcatatggc ctcgactctt 301 gcgcagggcg agcgtattga aatccgcggt ttcggcagtt tctctttgca ctaccgcgca 361 ccacgtaccg gacgtaatcc gaagactggc gataaagtag aactggaagg aaaatacgtt 421 cctcacttta aacctggtaa agaactgcgc gatcgcgcca atatttacgg ttaagttttt 481 atactaagtt ggcattataa aaaagcattg cttatcaatt tgttgcaacg aacaggtcac 541 tatcagtcaa aataaaatca ttatttgatt tcaattttgt cccactccct gcctctgtca 601 tcacgatact gtgatgccat ggtgtccgac ttatgcccga gaagatgttg agcaaactta 661 tcgcttatct gcttctcata gagtcttgca gacaaactgc gcaactcgtg aaaggtaggc 721 ggatcc // LOCUS GPIGPC3A 5077 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Guinea pig complement C3 protein (GPC3) mRNA, complete cds. ACCESSION M34054 KEYWORDS complement C3. SOURCE Guinea pig (strain Hartley) liver, cDNA to mRNA. ORGANISM Cavia porcellus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Hystricomorpha; Caviidae; Caviinae. REFERENCE 1 (bases 1 to 5077) AUTHORS Auerbach,H.S., Burger,R., Dodds,A. and Colten,H.R. TITLE Molecular basis of complement C3 deficiency in guinea pigs JOURNAL J. Clin. Invest. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by H.S.Auerbach 07-MAY-1990. FEATURES from to/span description pept 28 5028 complement C3 protein (GPC3) precursor sigp 28 93 complement C3 protein signal peptide matp 94 2040 complement C3 beta chain matp 2053 5025 complement C3 alpha chain signal 5055 5060 polyA signal (put.) BASE COUNT 1174 a 1537 c 1504 g 862 t ORIGIN 1 cctgggtcgc tcggctctcc ctgcaccatg ggacccgccg caggtcccag cctgctgctg 61 ctgctgctgg ccagcgtctc cctggccctg ggggacccca tgtactcgat catcaccccc 121 aacatcctgc ggctggagaa cgaggagacc gtggtgctgg aggcccacga agtgcaaggg 181 gacatccctg tcacagtcac tgtccacgac tttccggcca agaagaatgt gctgtccagt 241 gagaagaccg tgctgaccag tgccacggga tatctaggca ctgtcaccat caagatcccc 301 gccagcaagg agttcaagtc cgataagggg cgcaagttgg tggtcgtgca ggccgccttt 361 gggggcacgc agctggagaa ggtggtgctg gtgagcctgc agagcgggta cctcttcatc 421 cagacggaca agaccatcta cacgccgggc tccaccgtgc tctaccggat cttcaccgtg 481 gacagcgacc tgctgcccgt gggccggacc atcattgtca ccatcgagac acccgacggc 541 atccccatca agcgggacac cctgtcttcc aacaaccagc atggcatctt gcccttgtcc 601 tggaacatcc ccgagttggt caacatgggg cagtggaaaa tccaggcctt ctacgagaac 661 tcgcccaagc aagtcttctc ggctgagttc gaggtgaaag aatacgtgct gcccagcttc 721 gaggtcctgg tggagcccac ggagaagttc tactacatcg acgatccgaa aggcctggag 781 gtcaacatca tagccaggtt cctctacggg aagaacgtgg acggaacggc ctttgtcatc 841 ttcggggtcc aggacggtga ccagaggatc tccctggccc agtccctcac ccgagtcgtg 901 atcgaggacg gcagcggaga ggtcgtgctg agcaggcagg tgctgctgga cggggtgcag 961 ccctccaggc ccgaggccct ggtggggaag tccctgtacg tgtctgtcac ggtcatcctg 1021 cactcaggca gcgacatggt ggaggcagag cgcagtggga tccccatcgt cacctccccc 1081 taccagatcc acttcaccaa gacgcccaag tacttcaagc cggctatgcc cttcgaaatc 1141 atggtgctgg tgacgaaccc cgacggctct ccggcccccc acgtccccgt ggtgacccag 1201 ggctccaacg tgcagtctct cacccaagct gacggcgtgg ccaggctgag catcaacacc 1261 cccaacaccc ggcagcctct gagcgtcacg gttcaaacaa agaaaggggg catcccggac 1321 gcccggcagg ccatcaacac catgcaggcc ttgccctaca ccaccatgta caactccaac 1381 aactatttac acctctccat gccgcgcacc gagctcaagc ctggcgagac catcaacgtc 1441 aacttccacc tgcgaagcga cccgaaccag gaagctaaga tccgctatta cacctacctg 1501 atcatgaaca aggggaagct gctgaaggtg ggtcggcagc ctcgagagcc gggccaggcc 1561 ctggtggtgc tgcccatgcc catcaccaag gagctgattc cctccttccg cctggtggcc 1621 tactacactc tcatcggtgc cagcgcgcag agggaggtgg tggccgactc tgtgtgggca 1681 gacgtcagag actcctgtgt gggcacgctg gtcgtcaaag gcggctccgg gaaagatggc 1741 caggacaagc ggcagcagca tctaccccgg cagcagatga ccctcaggat agagggcaac 1801 cagggggccc gcgtgggcct ggtggccgtg gacaagggtg tgtttgtgct gaacaagaaa 1861 cacaaactga ctcagagtaa gatctgggac gtggtggaga aagcggacat cggctgcacc 1921 ccgggcagcg ggaaggacta cgcgggcgtc ttcacggacg cggggctgtc cttcaagtcc 1981 agcaaggcgg ggctgcagac ggcccagcgg gaaggtctgg actgccccaa gcccgccgcg 2041 cgccgccgcc gctcggtgca gctcatggag aggaggatgg acaaagccgg gaagtacaag 2101 agcaaggagc tgcgcaggtg ctgcgaggac ggcatgcgcg aaaaccccat gcagttttcc 2161 tgccagcgcc gcgcgcgcta cgtgagcctg ggcgaggcgt gcgtcaaggc cttcctggac 2221 tgctgcacct acatggccca gctgcgccag cagcaccggc gcgagcagaa cctgggcctg 2281 gcccggagtg acatggatga agacattata ccagaagagg acatcatttc cagaagccag 2341 tttccagaga gctggctgtg gaccatagag gaactgaagg agccagagag aaacggcatc 2401 tccaccaaga ccatgaacat cttcctcaaa gacagcatca ctacttggga gattctggcc 2461 gtgagcttgt cagataagaa aggcatctgc gtggcggacc ccttcgaggt cacggtgatg 2521 caggatttct tcatcgacct gcggctaccc tactccgtgg tacgcaacga gcaggtggag 2581 atccgcgcgg tcctctacaa ctaccgcgag gcccagagtc tcaaggtgag ggtggagctg 2641 ctgcacaacc ccgccttctg cagcctggcc accgccaaga agcgccacac ccagacggtg 2701 accatcggac ccaagtcctc tgtggcggtg ccctacgtcc tggtgcccct gaagatcggc 2761 ctgcaggagg tagaggtcaa agctgccgtc tacaactact tcatcagcga cggcgtgaag 2821 aagaccctca aggtcgtgcc cgaagggatg agagtcaaca aaacggtggc cattcgcact 2881 ctgaacccag agcagctggg ccaaggcggc gttcagaggg aggaaatccc agctgcagac 2941 ttaagcgacc aagtcccaga caccgattct gagaccaaga ttctcctcca agggaccccg 3001 gtggcccaga tggccgagga cgctgtggac gcggagcggc tgaagcacct gatcatcacg 3061 ccctcgggct gcggagagca gaacatgatc ggcatgacgc ccaccgtgat cgcggtgcac 3121 tacctggacc agacggagca gtgggagaag ttcggcctgg agaagcggca ggaggcgctg 3181 aacctcatca acagggggta cacacaacag ctggctttca aacagcccaa ctgggcctac 3241 gcggctttca agaaccgggc gtccagcact tggctgaccg cctacgtggt gaaggtcttc 3301 tcgctggcgg ccaatctcat tggcatcgac tcggaggtcc tctgcggggc ggtgaaatgg 3361 ctgatcctgg agaagcagaa gcccgacgga gtcttccagg aggacgggcc cgtgatacac 3421 caagagatga ttggtggtgt ccggaccgcc caggaggctg atgtgtccct cacggccttc 3481 gtgctcattg cattgcaaga ggctaaagat atctgcagag cacaggtcaa caacctggaa 3541 gcgaacatca acaaagctgg cgattacatc gagtcaagat acgcagacgt gaggagaccg 3601 tacacgctgg ccatcgctgg ctacgccttg gccctgctgg agaggctgaa cggcgcaacc 3661 ctccaaaaat ttctaaacgc agctacagag aagaaccgct gggaggaggc ccgtcagaag 3721 ctgtacagcg tggaagccac gtcctacgcc ctcctggccc tgctcctgct gaaggacttc 3781 gacgcggtgc ctcccgtcgt gcgctggctc aacgaacaga gatactacgg caggggctat 3841 ggctccaccc aggccacctt catggtgttc caagccttgg cccagtacca gacagatgtc 3901 cccgatcaca aggacctgaa catggaggtg gccctccagc tgcccagccg cagctccccg 3961 tccaaatttc gcctcgtctg ggaagccggt agcctcctgc ggtcagaagc gaccaagcaa 4021 aacgagggct tcaaactgac agctaaaggg aagggccagg gcacgttgtc ggtggtggct 4081 gtgtactatg ccaagacgaa acgtaaagtt gtctgcaaga acttcgacct cagggtcact 4141 ctgaagccgg cccccgatac agtcaagaag cctcaggagg ccaagagcac catgatcctt 4201 ggcatctgta ccaggtacct gggagaccag gatgccacta tgtccatcct ggacatatcc 4261 atgatgaccg gtttcattcc tgacacagat gacctcaagc tgctggccac cggcgtggac 4321 agatacatct ctaagtacga gatgaacaaa gacttcagca agaacaccct catcatctac 4381 ctggacaagg tctcacactc cgaggaagaa tgtctgtcct tcaaaatcca ccagttcttc 4441 aatgtggggc tcatccagcc tggctccgtc aaggtctact cctactataa cctagatgaa 4501 acatgtaccc agttctacca cccggagaag gaggacggga tgctgaacaa actctgccac 4561 aaggacttgt gccgttgcgc agaggaaaac tgcttcatac agctgccgga gaagatcacc 4621 ctggatgagc ggctggagaa ggcctgcgag cccggggtgg actacgtgta caagaccaag 4681 ctgctcaaga tggagctgtc ggatgacttc gatgagtaca tcatgacgat cgagcaggtc 4741 atcaagtcag gctccgatga ggtgcaggcc gggaaggagc gcagattcat cagccacatc 4801 aagtgcagag acgccctgca cctgaaggag gggaagcact acctcatgtg gggcctctcc 4861 tcggacctgt ggggggagag acccaacatg agttacatca tcgggaagga cacctgggtg 4921 gaagcgtggc ccgaggcgga agaatgccaa gacgaggaga accagcagca atgccaggac 4981 ctgggcactt tcacagagaa catggtggtt tttggctgcc ccaactgacc acgcccgtca 5041 gtcatcctcc ctccaataaa gcttcgggtc tatggca // LOCUS HUMPP14B 8076 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human placental protein 14 (PP14) gene, complete cds. ACCESSION M34046 KEYWORDS placental protein 14. SOURCE Human (cell line GM1416) DNA, clone PP14G. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 8076) AUTHORS Vaisse,C., Atger,M., Potier,B. and Milgrom,E. TITLE Human placental protein 14 gene: Sequence and characterization of a short duplication JOURNAL DNA (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Vaisse, 04-MAY-1990. FEATURES from to/span description pept 3019 3114 placental protein 14 precursor (PP14), exon 1 3510 3649 placental protein 14 precursor (PP14), exon 2 4036 4109 placental protein 14 precursor (PP14), exon 3 5482 5592 placental protein 14 precursor (PP14), exon 4 6644 6748 placental protein 14 precursor (PP14), exon 5 7019 7035 placental protein 14 precursor (PP14), exon 6 sigp 3019 3072 placental protein 14 signal peptide matp 3073 3114 placental protein 14, exon 1 3510 3649 placental protein 14, exon 2 4036 4109 placental protein 14, exon 3 5482 5592 placental protein 14, exon 4 6644 6748 placental protein 14, exon 5 7019 7032 placental protein 14, exon 6 pre-msg 2971 8010 PP14 mRNA and introns IVS 3115 3509 PP14 intron A IVS 3650 4035 PP14 intron B IVS 4110 5481 PP14 intron C IVS 5593 6643 PP14 intron D IVS 6749 7018 PP14 intron E IVS 7036 7786 PP14 intron F signal 2891 2899 CAAT box signal 2942 2948 TATA box signal 7992 7997 polyA signal BASE COUNT 1636 a 2226 c 2203 g 2011 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcttta cctagcccta cgtcctgaag attttctctc acgctttctt ctaaaagttg 61 tatagtttta tgttttactt ttaaactatg agttaatgca tatgtcaggt gtgagtttta 121 gatggaggtt cttcgtttgc atgggatcga ttgcacgtga tgaattgctc cagcaccatt 181 tgttgcaaag actgtccttc ttttagaggg actcccgctt gccaggcctc tggtttaatg 241 aaacatgacc agagtgactc catcttaacg tgaataacta gacactcaca aggcacctat 301 aaggttatat aacgaggcta tgctgctcga tactgactac gacaatttcc tgtttctctg 361 cctacaggac accaccaccc ccatccagag catgatgtgc cagtacctgg gtgggtctca 421 cagcacatga gctcaacgtg ggtgagaggc agcagctact tccatggctg ggaaccctgg 481 ggagctgaca actggcttcc tgtccacctc agtgcctgtg ggctggtttt ttctttcttg 541 gttttttttt tatttgtttg tttgctttgt tttttttgag acagggtctc attctgtcac 601 tcaggctgga gtgcagtggc gtgatctcgg ctcactacaa cctccacctc ccagattcaa 661 gcgattctcg tgcctcagct tcccgagtag ctgggattac aggcgcgagc ttatggtctg 721 aaaatacccg catactaagc tgaccaccaa ttataactgc agaacattta tgcccatacg 781 aggcatctcc caccaagcct ggagaatgta ccgatgacct gggagtgcag ggggttatct 841 ttgctcacaa ataacgtcaa cgagtaggct gaggctgaag ggcaaatggt cattgatcac 901 actaggagcc cctatcttta gcgagtgcat ctgcatgatc caggtttcac tgtagctcat 961 tacagcttct tacaaacaga ggcactcaca gaggacgggc gttcctcctg ctcgctgagg 1021 ttgcccggct ctggcacaga gtcatttccc ataaacttgc tttcactgtg ctctgtgagt 1081 caccttgaat tctttcccgt gtgagatcta agaacccatt cttggggtct ggactgggac 1141 cctcttttcc gacaacactt cccccacgca ctgttcttgc agctttgtta aaggtctgtt 1201 gttgttgtgt gtgagtccat gtttggactc tctatttccc attgatctat gtgtccattt 1261 ctctgccaat accacaatct cttggtgact attgataata gtgcttgtaa acttgggtag 1321 actaactctt tatactttat tcttcttcaa aatggcttta gcaattgtag ctcctttgcc 1381 ttgacatgta aattttagaa aaaaaaattt ctctatatat ctacatatgg tggggtgtgg 1441 tggctcacac ctataattcc agcactttgg gaggccgagg tgagcagatc acttgaggtc 1501 agcagttcga gaccagcttg gccaaaatag tgaaaacccc tctctactaa aaagacaaac 1561 attagccagg tgtggtggtg ggcgcctata atcccagcta ctctggaggc tgaagcagca 1621 ggattacttg aacctgggat gcagaggttg cagtgagtca agatcgtacc actgcactcc 1681 accctgggca acagagcaag acttggcctc aaaaaaatat atatatgtat atatgtgtgt 1741 gtgtgtgtgt atacacacac acacacacac atgtctatag ctacctacaa aatatcttgc 1801 tgagaatttg ataggaattg cattaaatct ctaggttgag ttgaggagag ctaacacctt 1861 tgctgtgttg aagcttccag cccatgaaca cattgtgcct gtcctttacc ttaggtcttt 1921 gatttccctt ccttgggttc agagtttaag tcctgtccat atttggtgag atttttgcct 1981 acatgctgtg tgtgtgatta caaatcctga atttccagtt tcgggttctg tctactcctt 2041 gacactgtgt gggaacacgg tggatttttg tgcgattgac cttgctgacc tcactcggtt 2101 ctaggagggg ttttgtagag ttcttgcaat tttgtacctg ggcaatgatg ttatatgcga 2161 atagagacag gtatacctcc ccctctccaa tctgcctgcc tttagttacc atttcttgcc 2221 tagctgctct ggccggaatg acctgctctg atggatgggg gagtgggagt ggacacccgt 2281 gtcttgcctc aggactcaag tgctctccag ggagtgagat gcgggctcga ttgtaagtta 2341 taaagtgatg cttccctgtg ggaaaaagta catccaatat agactgtagg acaaagtctg 2401 aaagtccact gcctcccatt tccacccagt cttgcctgtg agtcagagag aaacagtgct 2461 aacaggcagt gtgcacctgt ccagaggctg gctgtttgag ggacacaggt gtgcacacct 2521 ggggatgtct gagtggaagg tacaggtcag gattatgacc gtgcagtcag tcacccactg 2581 gcatgatgtc tgtggcatcc tggggcagcc atggggctct caggccctct gcctgcccca 2641 caggccatac ccctgccctg gacacagctg tcctcagtgc tggcctctga cccaacattg 2701 tccaggagcc ccaacccaga aggtgctccc gccgctgcca gcctggaccc gacccaggcc 2761 cctcccgcct gaggccctgc caagaactgc ccagcccgga cacagaggag gttccgcgtg 2821 gacgcaggga agagcctccc attgccccag tggaggaagc tgcccagggg ccaaggatga 2881 gtcacaggtt cgaggaatca catggcgagg ctgtgggcgg ggatcttgtc tgccctcctc 2941 ctacataagg ccccctgagc ccacactgcc tcagcatccc tctggctcca gagctcagag 3001 ccacccacag ccgcagccat gctgtgcctc ctgctcaccc tgggcgtggc cctggtctgt 3061 ggtgtcccgg ccatggacat cccccagacc aagcaggacc tggagctccc aaaggtttga 3121 ggctggggga gcgggcactt tactgtggga ggcctggggc gggtgggagc tgcgggcagg 3181 cgggaagcca ggatctcaga aacctacagg aagcacagaa tggacgccat gacgtcagga 3241 agccctcagc cctgctctcc atctttaggg tggcctctct ggtttcccag catcctaggt 3301 gactcattat ttggactttg gaacactcct gagttagcac acactggtca ttttaagtac 3361 aggaaatttc atagcccagg atctggtaga tagcagacaa ccatccaatg ctcactgtac 3421 ccatcccagt tagactcagc cccgtctgca ccgggtgcaa cgagagccat ggtggggtgg 3481 gaccgccgtg cagcccaagg ccccctcagt tggcagggac ctggcactcc atggccatgg 3541 cgaccaacaa catctccctc atggcgacac tgaaggcccc tctgagggtc cacatcacct 3601 cactgttgcc cacccccgag gacaacctgg agatcgttct gcacagatgg tgggtttctc 3661 atcattgaga cgggctgggc gggggctcag tctcccccct caggggtcca ggactgggtg 3721 ggttgggcgg agctggactt agccccaggc attttctgac agccaggggc ttcactgtgg 3781 cccttccatg agggtggggt ggaaaaccag ggctccagac gttccctgtc cccttggatc 3841 ccctgcccca ggctctgggc caacagccaa ccacacagtg cagccccagg tcagactgag 3901 gagaaggtct gggcggctgc gggctgcggt gctccttgga cccggggaag ttcccgtggt 3961 gacctgattt taggagtgac agtgaaggca actccaattc aagtggccac tcatcctatt 4021 gtcaccacct ttcagggaga acaacagctg tgttgagaag aaggtccttg gagagaagac 4081 tgagaatcca aagaagttca agatcaactg tgagtgtccc caggccccaa gggctggctc 4141 agtgctggca tgctagccac gctctcccag aggcggctct gctggggcat gagggagtgg 4201 ggcctggcct gtccccactc tctctgcttc agggagtcag agtgtttact ccggtcaacc 4261 tgatgctgac cccagaggca tcttttacct ggagggcagg ggaagcacta attcttggca 4321 tgacatgact ggatgtgggt ctgcactgtg cccaggccaa ggggacaggt gctttgttgc 4381 actgttcact ctggcctcac aaaaggccag ggaggctgca ggcgagcagg tgggcaggtg 4441 ggcaggtggg taggtgggta ggtggatatg tatacaggtg gcaggagggt aggtgaacag 4501 gtgggtaggt gggcaggtgg ctaggtgagt aagtggttag gtgaacaggt gggcaggtga 4561 gcaggtggtt aggtgaacag gtgggcaggt ggataggtga acaggtgggc aggtgggtag 4621 gtgggtaggt atacaggtgg acaggtgggt aggtggacag gtgggcaggt gagtaggcga 4681 acaggtgggt gggtgaacag gtggccaggt gaacaagttg gtaggtgggc aggtgggtag 4741 gtgggtaggt gaacaggtgg gcaggtgggc aggtgggcag gtgggcaagt ggctgctgtt 4801 cccgtgggcc tggctgcctc ctgcgcactc tggggctgca gctctggtct taggctgagc 4861 tcccaggcct ctctggggga agagagaggg gcttacagca tgtccttggt ccactgaatt 4921 cttcctaaca atttgcaaca ttttgttcta ttttgttaat tattattttt ttaaaaagac 4981 agaggtggtc agggtctggg gcctcttatc ccctcatggg cacattttcc cagcaaatac 5041 agtttgcttc tcatgcttgg gacttgcctc aggcctttct gaccctgctt gccctcccca 5101 gaatcgagcc actctccaag gtccatttct tctccctcct cccgcccctg tgccctgttc 5161 ctgtgccatc tcccgccatc ctcacccgta cgtgacttct cagttggagt ctctccaggt 5221 cacagcctcc ctgcctgccg tgtctgcctc tccacggcac acctggcctc tcgccctcag 5281 ccggggctcc atggccctcc acattgcctc tcctcccctt tcctccctgg cttccctgat 5341 catggtccac agcaggggcc acgtcccatg gtgtcagtga tgaggaagcc acttagtgtg 5401 gtgggatgtc cacacacctg cacaggactc tgctgagacg gaggcttcat cttccttttg 5461 gttcttctct tctttcccca gatacggtgg cgaacgaggc cacgctgctc gatactgact 5521 acgacaattt cctgtttctc tgcctacagg acaccaccac ccccatccag agcatgatgt 5581 gccagtacct gggtgggtct cacagcacat gagctcaacg tgggtgagag gcagcagcta 5641 cgtccatggc tgggaaccct ggggagctga caactggctt cctgtccacc tcggtgcctg 5701 tgggctgact ttttctttct tggttttttt tatttgtttg ttgtttgctt tgtttttttg 5761 agaaagggtc tcattctgtc actcaggctg gagtgtagtg acgtgatctc ggctcactgc 5821 aacttctgcc tcccagattc aagcgattct cgtgcctcag cctccagagt agctgggatt 5881 acaggcgcac gccaccatgc ccagctaatt tttgtatttt ttggtagaga cggggtttca 5941 ccatgttggc caggctggtc tccaactcct ggatcaagtg atccacccgc cttggcctcc 6001 caaaggctgg gattacaggt catccaccac gcccggccag gctgagtttt tctccagcgg 6061 ttcatcgagt cctctgacaa agcaaggagc tgatataggg ccagtgggac ggtcgccagt 6121 caaggggctg ggcttggtgg atagattaat actcactggg cgtccagtca aaacgccctg 6181 aaacctatga tgctgtcaac caaacgaagg ccaggaatac caaaatagcc acataggcac 6241 agcccttccc catgtttctg agcacagtgt ttcctctggg gtcacacagg tgtcttcttg 6301 atcagcctca gccatgcttg gtgagagccg ggcactggga gagccaggca ctgtgctctc 6361 ctgtgacgct gtagacacca tcctaagctg tgcagacccc agcgctgccc agagcggagc 6421 agagggggcc gggcaaggag tgggagctgg ggtcagggaa cctggaggtg cagtggacag 6481 agccccggag accgccctag ggacctactc cagaccaaac tctgccagac ctcggagcac 6541 tggggcctcc ttctctgccc tccctcctca ggcaaggcct ctggagctcc ccagctctca 6601 tggaagcccc aggggcccag gactgaccca gcctcttcca cagccagagt cctggtggag 6661 gacgatgaga tcatgcaggg attcatcagg gctttcaggc ccctgcccag gcacctatgg 6721 tacttgctgg acttgaaaca gatggaaggt gagctctgcc taggacacgc ccagcctcag 6781 ctggaggaga agctgcctct ttcttagccc gagccccctg ctggctctgc aggactcagg 6841 tcactccttt ttggcccctc ccctgttctc ccctggcctt ctggggtgca gagccaccct 6901 gaggtggggt cctgccctct cccaccatcc tttcatccct tctctagccc tggggctgct 6961 gtgtccccag ctgtctcttc tctcgctgac acctccactg tcccatctcc tcccacagag 7021 ccgtgccgtt tctaggtgag ctcctgcctg gtcctgcctc ctgggtaatg tatcagcctc 7081 gcccactgtc tgcggctgcc tctctgggcc cctgggacag accctactgt gtccagttca 7141 gggctgaccc tacaggaatg aactggggtc tggtcttgtg attccagaaa gccaggctgc 7201 tgacgtcccc attcacgagc ccagcctgtg tcttgcagcc attgtattag tcacgggctt 7261 gtgccctata gtcagacctc atgctttctt ttggggttag gggtgttggt tggaaatggt 7321 gggggctata ggaggaggaa ggaggatggt tacatggaag ggcatgagaa gctggggacc 7381 tgcaggtctc ggtcccacgt tctttttttt ttttcttttt ttaagatgga gtctcgctct 7441 gtcaccaggc tggagtgcag tggcacaatc tcagctcact gcaacctcga cctcctgggt 7501 tcaagcgatt ctcctgcctc aaccccccga gttgctggaa ccacaggcgt gtgccaccat 7561 gcccagctaa tttttgtatt tttaatagaa acggggtttc accatgttgg ccacgatggt 7621 ctcaatctct tgacctcatg atccccccgc tttagcctcc caaagtgctg ggatttcagt 7681 gccacattct taagggggtg tgctcaagcc caccacatcc ttccagggct cccccgaaac 7741 accctgctct tcctccctct acttaagtga cctgtaaacc caacagctca cctccgcctc 7801 caggaagacc agactcccac ccttccacac ctccagagca gtgggacttc ctcctgccct 7861 ttcaaagaat aaccacagct cagaagacga tgacgtggtc atctgtgtcg ccatcccctt 7921 cctgctgcac acctgcacca cggccatggg gaggctgctc cctgggggca gagtctctgg 7981 cagaggttat taataaaccc ttggagcatg tcctgtctgg atgcgcagcc actgctgggt 8041 gtgggattca gggacgaggg cctggggtcg gggcag // LOCUS PHAMP1A 1313 bp ss-mRNA PLN 28-JUN-1990 DEFINITION P.chrysosporium Mn-dependent peroxidase (MP-1) mRNA, complete cds. ACCESSION J04980 KEYWORDS Mn-dependent peroxidase. SOURCE P.chrysosporium (strain BKM-F-1767), cDNA to mRNA, clone lambda- MP-1. ORGANISM Phanerochaete chrysosporium Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; Agaricales; Corticiaceae. REFERENCE 1 (bases 1 to 1313) AUTHORS Pease,E.A., Andrawis,A. and Tien,M. TITLE Manganese-dependent peroxidase form Phanerochaete chrysosporium: Primary structue deduced from cDNA sequence JOURNAL J. Biol. Chem. 264, 13531-13535 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.A.Pease, 21-JUN-1989, for release after publication. FEATURES from to/span description pept 36 1184 Mn-dependent peroxidase precursor (MP-1) sigp 36 72 Mn-dependent perocidase signal peptide matp 73 1181 Mn-dependent perocidase BASE COUNT 251 a 462 c 334 g 266 t ORIGIN 1 tcagctctca aggacatccg cactcgaata tcgcaatggc cttcggttct ctcctcgcct 61 tcgtggctct cgccgccata actcgcgccg ccccgactgc ggagtctgca gtctgtccag 121 acggtacccg cgtcaccaac gcggcgtgct gcgctttcat tccgctcgca caggatttgc 181 aagagactct gttccagggt gactgtggcg aagatgccca cgaagtcatc cgtctgacct 241 tccacgacgc tattgcaatc tcccagagcc taggtcctca ggctggcggc ggtgctgacg 301 gctccatgct gcacttcccg acaatcgagc ccaacttctc cgccaacagc ggcatcgatg 361 actccgtcaa caacttgctt cccttcatgc agaaacacga caccatcagt gccgccgatc 421 ttgtacagtt cgccggtgcg gtcgcgctga gcaactgccc aggtgctcct cgcctcgagt 481 tcatggctgg acgtccgaac actaccatcc ccgcagttga gggcctcatt cctgagcctc 541 aagacagcgt caccaaaatc ctgcagcgct tcgaggacgc cggcaacttc tcgccgttcg 601 aggtcgtctc gctcctggct tcacacaccg ttgctcgtgc ggacaaggtc gacgagacca 661 tcgatgctgc gcccttcgac tcgacaccct tcaccttcga cacccaggtg ttcctcgagg 721 tcctgctcaa gggcacaggc ttcccgggct cgaacaacaa caccggcgag gtgatgtcgc 781 cgctcccact cggcagcggc agcgacacgg gcgagatgcg cctgcagtcc gactttgcgc 841 tcgcgcgcga cgagcgcacg gcgtgcttct ggcagtcgtt cgtcaacgag caggagttca 901 tggcggcgag cttcaaggcc gcgatggcga agcttgcgat cctcggccac agccgcagca 961 gcctcattga ctgcagcgac gtcgtccccg tcccgaagcc cgccgtcaac aagcccgcga 1021 cgttccccgc gacgaagggc cccaaggacc tcgacacgct cacgtgcaag gccctcaagt 1081 tcccgacgct gacctctgac cccggtgcta ccgagaccct catcccccac tgctccaacg 1141 gcggcatgtc ctgccctggt gttcagttcg atggccctgc ctaaaccact cacctccggc 1201 aatgcacctt tagtagatgt cgattctatt tggttgatta gaagtactgg gttctcgtgt 1261 catataccac caagttttca gaatgcaagt ctcctttctt tccaaaaaaa aaa // LOCUS PEAHSP177A 772 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA, complete cds. ACCESSION M33901 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 772) AUTHORS Vierling,E. and Helm,K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 65 523 17.7 kDa heat shock protein (hsp17.7) BASE COUNT 254 a 127 c 170 g 221 t ORIGIN 1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag 61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga 121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc 181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga 241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt 301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga 361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc 421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc 481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat 541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg 601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata 661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc 721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac // LOCUS PEAHSP179A 700 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA, complete cds. ACCESSION M33900 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 700) AUTHORS Vierling,E. and Helm,K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept < 1 469 17.9 kDa heat shock protein (hsp17.9) BASE COUNT 209 a 123 c 170 g 198 t ORIGIN 1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt 61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa 121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt 181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga 241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg 301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc 361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa 421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct 481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg 541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa 601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt 661 gagtatgtga aagtttcttt tgattaagct gtatttatcc // LOCUS PEAHSP181A 862 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA, complete cds. ACCESSION M33899 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 862) AUTHORS Vierling,E. and DeRocher,A. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 12 488 18.1 kDa heat shock protein (hsp18.1) BASE COUNT 278 a 128 c 207 g 249 t ORIGIN 1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg 61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac 121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg 181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga 241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag 301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt 361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc 421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt 481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc 541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat 601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt 661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa 721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac 781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga 841 gtctctccaa gatggagaat tc // LOCUS PEAHSP227A 795 bp ss-mRNA PLN 28-JUN-1990 DEFINITION Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA, complete cds. ACCESSION M33898 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 795) AUTHORS Vierling,E. and Helm,K. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 44 637 22.7 kDa heat shock protein (hsp22.7) BASE COUNT 261 a 122 c 170 g 242 t ORIGIN 1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct 61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc 121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc 181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca 241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca 301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga 361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca 421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa 481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca 541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga 601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa 661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg 721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa 781 cttgttggct tttgt // LOCUS LACNISC 1222 bp ds-DNA BCT 28-JUN-1990 DEFINITION L.lactis insertion sequence IS981. ACCESSION M33933 KEYWORDS insertion sequence. SOURCE L.lactis (strain LM0230) DNA, clone pKM40. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1222) AUTHORS Polzin,K.M. and McKay,L.L. TITLE Identification, DNA sequence and distribution of IS981, a new, high copy number insertion sequence in the lactococci related to the IS2/IS3 family of IS-elements JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.M.Polzin 27-APR-1990. University of Minnesota Dept. of Food Science 1334 Eckles Avenue, St. Paul, MN 55108 e-mail: kayla@molbio.cbs.umn.edu FEATURES from to/span description pept 58 318 ORF1 pept 330 1169 ORF2 binding 44 50 ribosomal binding site (put.) rpt 1 40 5' inverted repeat rpt 1183 1222 3' inverted repeat signal 33 38 -10 region signal 269 274 -35 region signal 294 299 -10 region BASE COUNT 442 a 233 c 206 g 341 t ORIGIN 132 bp upstream of PvuII site. 1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg 61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa 121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata 181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa 241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc 301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca 361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta 421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac 481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac 541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt 601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa 661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata 721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag 781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata 841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc 901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca 961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa 1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct 1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac 1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac 1201 tctaaaaact tgacttaacg tc // LOCUS DRONCDA 2294 bp ss-mRNA INV 28-JUN-1990 DEFINITION D.melanogaster non-claret disjunctional protein (ncd) mRNA, complete cds. ACCESSION M33932 KEYWORDS non-claret disjunctional protein. SOURCE D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA, clone pNB40. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2294) AUTHORS Mcdonald,H.B. and Goldstein,L.S.B. TITLE Identification and characterization of a gene encoding a kinesin-like protein in Drosophila JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.B.Mcdonald, 30-APR-1990. FEATURES from to/span description pept 111 2168 non-claret disjunctional protein site 180 319 alpha helical domain BASE COUNT 622 a 661 c 603 g 408 t ORIGIN 1 bp upstream of EcoRI site; chromosome 3 map position 99BC. 1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt 61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta 121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag 181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa 241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct 301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg 361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc 421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc 481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg 541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt 601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct 661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg 721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg 781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga 841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc 901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag 961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg 1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc 1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc 1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg 1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg 1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca 1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg 1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg 1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg 1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg 1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg 1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc 1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc 1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg 1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca 1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc 1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact 1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca 2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc 2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg 2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat 2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa 2281 aaaaaaaaaa aaaa // LOCUS ECOPHOAJ 3133 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.fergusonii alkaline phosphatase (phoA) gene, complete cds. ACCESSION M33966 KEYWORDS alkaline phosphatase. SOURCE E.fergusonii (ATCC 35469) DNA. ORGANISM Escherichia fergusonii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3133) AUTHORS DuBose,R.F. and Hartl,D.L. TITLE The molecular evolution of bacterial alkaline phosphatase: Correlating variation among enteric bacteria with experimental manipulations of the protein JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.F.DuBose, 01-MAY-1990. FEATURES from to/span description pept 544 1962 alkaline phosphatase (phoA) (EC 3.1.3.1) site 466 486 pho box regulatory sequence BASE COUNT 874 a 711 c 774 g 774 t ORIGIN 1 aactattaat taagcaattt tttaattcgt ggaattataa tgcgcgagaa acgtttcgcg 61 ctaatttgtg ttaagaatat ttgccaaaag tattcgttat gtcataaaaa tactactttc 121 agactctagg aagttttgag tctaaatgtt aacgtcacca tactttggat aaggaaatgc 181 agttatgaaa aatcttattt ctgagttgtt gattaaactt gcacaaaaag aagaagagtc 241 aaaggagttg gttgcccagg tagaagccct ggagattatc gtcacggcta tgttgcgtaa 301 tatggggcaa cctgaacaaa aacgcgtaat cgaccagata gaaggggctt tggccggggt 361 aaagcctgat gccagcgttc ctgacggcga tacggaaatg ctgcgcgcat atgtaaagaa 421 gctcttaagg catccccggc aataagacac agaaccgaca cgacactgtc ataaagttgt 481 catcctcgct acctatagtc gctttgtttt tattttactg tatttataca tggagataat 541 aaagtgaaac aaagcgctat tgcattggcc ctgttatctt gtctgattac cccggtaagc 601 caggctcaaa cttctcaaaa tatcaatatc ctggaaaatc gtgccgctca gggtgatatt 661 acaatgccag gtggcgctcg ccgtttaagt ggcgatcaaa ctgaggcatt acgtgcatct 721 cttaatgata agccagctaa aaatattatt ttattaattg gcgatggcat gggggattct 781 gaaattactg ctgcacgaaa ttatgcagaa ggtgcaggtg gttattttaa agggatcgat 841 gcgttgccat taaccgggca gtacactcac tatgcattgg acaaaaaaac cggcaaacct 901 gattacgtga cagattctgc tgcatccgca accgcatgga cgacaggcgt caaaacatac 961 aatggtgcgc tgggtgtcga tatccatgaa aatcctcata ccactattct tgagatggca 1021 aaagcagccg gtctggcaac aggcaacgtg tctacggctg agttgcagga tgcgacacct 1081 gctgcgttgg tttctcatgt cacctcacgc aaatgctatg gcccttctgt taccagtgaa 1141 aaatgcccgg gcaatgcact ggaaaagggg gggaaaggtt caatcactga acaactgttg 1201 aatgcccgtg cggatgtgac gttgggtggt ggtgcaaaaa cgtttgctga aacggcaacc 1261 gctggagagt ggcagggtaa aacattgcgt gagcaagctc tggctcgcgg ttatcagatt 1321 gtcagcgatg ccgcatccct cgcagcagtg acgcaagcag ggcaggacaa acctctgctc 1381 gggttatttg ctgaaggaaa tatgcctgta cgttggcatg ggccaaaggc gagctatcac 1441 gggaatctcg ataagcccgc agtaacctgc acgccaaatc ctcagcgtaa cgaaactgta 1501 ccgacactgg cacagatgac agataaagct attgaattac ttagcaaaaa tgagagaggt 1561 ttcttcctgc aagtagaagg ggcatctata gataagcaag atcacgcggc taatccatgt 1621 ggacagattg gtgagacggt tgatctcgac gaagctgttc aacgtgctct ggaatttgcc 1681 aaaaaagatg gcaatacact ggtgatcgta actgcagatc atgcgcattc cagccagatt 1741 gtggcaccag ataccaaagc gccaggttta actcaggcgc taaatacgaa agatggtgcg 1801 gtgatggcga taagctacgg taactctgaa gaagactctc aggagcatac cggtagccag 1861 ctgcgcatcg cagcgtatgg gccgaatgcc gctaatgtgg tcggactgac cgatcaaacg 1921 gatcttttct acaccatgaa agccgctctg ggcttgcaat aatccatacc gacggccatt 1981 aattgcgccg tcgggtggtt ttttcaccat cagcagccag acttataaac actatccttc 2041 gggcgtattt tccggcccga aggaaactgg tgaagttgca acctcacaaa aggatggtgc 2101 gatgaaaatg acattactgg ttaccctgct atttgggctg gtttttttaa ccactgtcgg 2161 tgctgccgag agaacattaa cgccgcaaca acagcgtatg accacctgta atcagcaagc 2221 tactgcacag gcgttaaaag gtgatgcccg aaaaacgtat atgagcgatt gcctgaaaaa 2281 cagcaaatct cccccgaatg aaaaaagcct gacaccacaa caacaaaaga tgcgtgaatg 2341 taatacccag gcgacggaac aatcactgaa aggtgacgac cgtaataaat ttatgagtgc 2401 gtgtctgaag aaagctgcat agatccccct tagtgcttac gggtgagaca cgcaaacgcc 2461 tcacccgaaa tatcatactt ctccctcacc ccccctccta taatttggga aatttgtttc 2521 taaatattcc caaaaaaata atgaatgatg aaaacttcct taaaaaagcg gtcgcaccgg 2581 gtgaagaacc accttcttca cctcaaaacg aacatcaacg ttccgggtta cgctttgccc 2641 gtcgtgtgag gctaccaaga gttgtcggtt ttggcggtat gttctcccca tcgcctcggc 2701 gctggtggag caaccgatcg ctggatggtg gtggttgatc cttgttggct ggtcattcgt 2761 ttggccacat ctggcctggc aaatgtccag tcgtgcaatg gaccccttaa accgtgaaat 2821 ttacaaccta aaaacggatg cgattactgc cgggatgtgg gttggcataa tgggtgttaa 2881 cgcactacct tcaacgacga tgctgatgat tatgtgcctt aacctgatgg gcgccggagg 2941 acttaggttg ttcgtcgccg ggctggtatt gatggtagtc gcatgtctgg tgacgctaca 3001 gttgaccggc atccgcgtcg cgtttaccag ttcgcccctt gagtggtggc tttcactccc 3061 tgtcattgtg ttgtatccct taattttcgc ctgggtcagc tatcagacag caactaaatt 3121 ggcagagcat aaa // LOCUS SMAPHOAA 2672 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.marcescens alkaline phosphatase (phoA) gene, complete cds. ACCESSION M33965 KEYWORDS alkaline phosphatase. SOURCE S.marcescens (ATCC 13880) DNA. ORGANISM Serratia marcescens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2672) AUTHORS DuBose,R.F. and Hartl,D.L. TITLE The molecular evolution of bacterial alkaline phosphatase: Correlating variation among enteric bacteria with experimental manipulations of the protein JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.F.DuBose, 01-MAY-1990. FEATURES from to/span description pept 667 2094 alkaline phosphatase (phoA) (EC 3.1.3.1) site 575 595 pho box regulatory sequence BASE COUNT 588 a 845 c 749 g 490 t ORIGIN 1 ggccagtgaa ttcctggcgc tgaacccgaa cggcaagatc ccggcgatca tcgatcccga 61 tggccccggc ggccggccgc tgccgctgtt cgaatccggc gccattctgc tctacctggc 121 ggaaaagagc ggccgtttcc tgccgcagga cccggcgcag cgttacgaaa ccattcaatg 181 ggtgttcttc cagatggccg ccgtcggccc gatgttcggc cagctcggtt tcttccataa 241 attcgccggg cgcgaatatg aagacaagcg gccgctggag cgctacaaaa acgaatcgaa 301 acgcctgctc ggcgtgctgg aaacgcgact tgaaggccgc gactggatca tgggcgcgga 361 gtacaccatc gccgatatct cgctgctggg ttgggtgcgc aacctgatcg gcttctatga 421 ggcgcgcgaa ctggtggagt tcgacagttt ccctcgcgtc gcgcagtggc tggagcgcgg 481 tctggcgcgc ccggcggtgc agcgtgggct cgctatcccg gcgcgcgcgg cctgagcgtc 541 aaacgggatg gccgccgggc cgtcccgcct tccgctgtca taaaattgtc atccctctgc 601 tctactctcc tttcgtaaca atttgatgtt atttacattg ttgtattacg acaaggagct 661 taccccatgc aacctgctgt atccctgatc gccggcgccg tactgtccgc cctgctctgc 721 tcctccgcca tcgcggcgga aacctcggcc aatgccgatg gcctgaccga tcgcgccgcg 781 cgcggcaacc tcgtcgaacc gggcggcgcc cgccgtctgg ccggcgatca gaccaccgcg 841 ctgaaggcat cgctgtcgga caaaacggcg aaaaacgtga tcctgctgat cggcgacggc 901 atgggggatt ccgagatcac cgccgcgcgc aactacgccg aaggcgcagg cggttacttc 961 aagggcatcg acgccctgcc gctgaccgga caatacaccc attactcgct ggacaggaaa 1021 acccacaagc cggactacgt gaccgactcc gccgcctccg ccaccgcctg ggccaccggg 1081 gtgaaaacct ataacggcgc gctcggcgtg gacgtgaacg gcaaggatca accgacgctg 1141 ctggagatcg ccaaggcggc cggcaaggcc accggcaacg tctccaccgc cgagttgcaa 1201 gacgccacgc cggcggcgct ggtctcgcac gtcatctcac gcaagtgcta tggcccggag 1261 gaaaccagcg agaaatgcgc cgctaacgcg ctggaaaatg gcggccgtgg ctccatcacc 1321 gaacaactgc tgaaaacccg cgccgacgtc acgctgggcg gcggcgcgaa atcgttcaac 1381 cagctggcaa aaagcggcga atggcaagga aaatcgctga aggaccaggc ggcggcgcag 1441 ggttatcagt gggtgagcaa cgccgacgaa ctgcaggccg ttaccctcgc caaccagcaa 1501 aaaccgctgc tggggctgtt cgccgacggc aatatgccgg tgcgttggct ggggccgaaa 1561 gccagctacc atggcaacct cgacaaaccg gcagtgacct gcgaaaacaa cccggcgcgc 1621 accgccgcca cgccaacgct ggcggcgatg accgaaaaag cgatcgcgct gttgaaagac 1681 aacccgaacg gcttcttcct gcaggtagag ggcgcgtcga tcgataaaca ggatcacgcc 1741 gccaacccgt gcgggcagat tggcgaaacc gtcgatctgg acgaagcggt acaaaaagcg 1801 ctggccttcg cccgcgccga cggcaacacg ctggtgatcg tcaccgccga ccacgcgcac 1861 gccagccaga tcgtctccgc cgacgccaaa gcgccgggcc tgacccaaaa gctgaccacc 1921 aaggacggcg cgccgatgac cctcagctac ggcaactctg aagaggaatc tcagggacac 1981 accggcaccc agctgcgcgt cgcggcctac ggcccgcacg ccgccaacgt agtgggcctg 2041 accgatcaga ccgatctgtt cttcaccatg cgcgatgcga tgggtatcaa gtaaccgacc 2101 cctgaacgat ggccgcccgc cacggcggcc atttattatt ggttaccgct aggtatttgc 2161 aatttaaccg caattattgt gtgataacga aataacgtct aattacgcgg caattaaccc 2221 atgaaaataa cgcggttata cggcctttct tccctctctg tttattcctg cgacagcaac 2281 cggccgtttt ttgcttatat tcattttttt tagttacaaa agtagttttt aacattctcg 2341 caaacaataa taccccgcca gtaacgtgaa ccgatcaatt tgcatccgga cggcctgcaa 2401 tagttcactt ttcgttaaat accctattaa caaaaaggca ttattgctgc ctatcggctt 2461 tcgccgggcg gtcggcttca ttatttccag ggtaactccc gctaaattct ttcacctgca 2521 atgagtagaa tttacacaca agagatttaa ttcaatgaga aaattactcg tcctgatttt 2581 cagtctcagc ctgtttggcg ttacgcaaca ggcggcggct gaagagaaga ataaaaccgt 2641 cgacctgcag gcatgcaagc ttggcactgg cc // LOCUS STYMALGA 1232 bp ds-DNA BCT 28-JUN-1990 DEFINITION S.typhimurium inner membrane protein genes (malG and malF) genes, complete cds, and 3' end. ACCESSION M33921 KEYWORDS . SOURCE S.typhimurium (strain LT-2) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1232) AUTHORS Francoz,E., Schneider,E. and Dassa,E. TITLE Sequence of malG in Salmonella typhimurium JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT malG protein is proposed to form with proteins malK, malF, and malE a hetero-oligomeric protein complex that performs the ATP-dependent translocation of maltose and of maltodextrins through the inner membrane of enterobacteriaceae. Draft entry and computer-readable sequence for [1] kindly submitted by E.Dassa, 30-MAR-1990. Institut Pasteur Unite de Programation Moleculaire et Toxicologie Genetique Inserm U169, CNRS URA 271 e-mail: hofnung%pasteur.bitnet@cunyvm.cuny.edu FEATURES from to/span description pept < 1 64 inner membrane protein (malF) pept 79 969 inner membrane protein (malG) binding 66 70 ribosomal binding site (put.) signal 1050 1073 rho-independent transcription termination signal (put.) BASE COUNT 241 a 336 c 332 g 323 t ORIGIN 91 minutes on the Styphimurium map. 1 cttcctgctg gtaggcgcgc tggcaatagt gaacctgaaa gccacgcgta tgaagtttga 61 ttaagggaga taataacaat ggctatggtc caacccaaat ctcagaaact gcgtctcctc 121 atcacgcacc tggggctgct gattttcatc gcagcgatta tgttcccgct gctgatggtc 181 atcgctatct cgctacgtga agggaacttc gccaccggaa gcctgatccc ggataaaatc 241 tcctgggaac actggcggct ggcgctgggt ttcagcgtgg aacatgcgga tggtcgggta 301 acgccgcctc ccttcccggt cctgctgtgg ctgtggaact cggtgaaaat cgccggtatt 361 accgccatcg gcatcgtggc gctctccacc acctgcgcct acgctttcgc acgaatgcgt 421 tttccaggta aagcaacact gctcaaaggg atgctgattt tccaaatgtt tccggcggtt 481 ttgtctctgg tagcgctgta tgcgttattt gatcgcctgg gacaatacat tccatttatc 541 gggctgaata cgcatggcgg cgtgatcttc gcctatcttg gcggcatcgc gctgcatgtt 601 tggacgatta aaggctactt cgaaactatc gacagttcgc tggaagaggc cgccgcgctg 661 gatggcgcaa ccccgtggca ggcgtttcgc ctggtgctgc tgccgctttc cgtaccgatt 721 ctggcggtgg tgtttattct gtcgttcatc gcagcgatta ccgaagttcc ggtcgcctcg 781 ttgttactgc gcgatgtgga cagctatacg ctggccgtgg gtatgcagca atatctcaac 841 ccgcaaaact acctgtgggg cgactttgcc gccgccgccg tcctttcagc tattcctatc 901 accctggtgt tcctgttggc gcaacgttgg ctggtcaacg gcctgacggc agggggtgtg 961 aaaggttaag ttttatcgaa gtacctgccc ggcaacgggc aatgccactg ccctcaagtt 1021 atgttatccc cacttatggt cgtattttgg cgctcttcgg agcgcttttt ttatgcgtat 1081 agacagcggc taatttattg ccacgccata cgtccgttac ggtgtggcga cgcctattct 1141 cgccgtagtc ggttcgaatt acataaccac aagtgataac caacaggagg atcgccgccg 1201 aatagagtaa tacatccatt gctttatgat cc // LOCUS ECOCLPPA 1236 bp ds-DNA BCT 28-JUN-1990 DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component (clpP) gene, complete cds. ACCESSION J05534 KEYWORDS ATP-dependent protease. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1236) AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J., Bowers,B. and Gottesman,S. TITLE Sequence and structure of ClpP, the proteolytic component of the ATP-dependent Clp protease of E.coli JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.Gottesman, 23-APR-1990. FEATURES from to/span description pept 378 1001 ATP-dependent protease (clpP) mRNA 279 > 1236 clpP mRNA pept < 1 133 ORF signal 244 249 -35 region signal 267 272 -10 region signal 165 193 transcription termination signal binding 367 372 ribosome binding site (put.) BASE COUNT 329 a 291 c 318 g 298 t ORIGIN 1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga 61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa 121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc 181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg 241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac 301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt 361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg 421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt 481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta 541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc 601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc 661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt 721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga 781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg 841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat 901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg 961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg 1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt 1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc 1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg 1201 cgcaagctga ttgccggtcc atccgtgtat atctgc // LOCUS PLYORIA 375 bp ds-DNA VRL 28-JUN-1990 DEFINITION Human polyomavirus BK (strain WWT) origin of replication. ACCESSION M34048 KEYWORDS origin of replication. SOURCE Polyomavirus BK (strain WWT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 375) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 375 regulatory region site 143 210 P block site 211 249 Q block site 250 302 R block BASE COUNT 123 a 86 c 95 g 71 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt 301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta 361 aactggacaa aggcc // LOCUS PLYORIB 424 bp ds-DNA VRL 28-JUN-1990 DEFINITION Human polyomavirus BK (strain UT) origin of replication. ACCESSION M34049 KEYWORDS origin of replication. SOURCE Human polyomavirus BK (strain UT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 424) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 424 regulatory region site 143 210 P block site 211 249 Q block site 250 261 R block site 263 315 P block site 316 349 Q block site 350 361 R block BASE COUNT 134 a 103 c 105 g 82 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa 301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc 361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa 421 ggcc // LOCUS ECOUNCGA 85 bp ds-DNA BCT 28-JUN-1990 DEFINITION E.coli H+-ATPase gamma subunit (uncG) gene, 3' end. ACCESSION M34095 KEYWORDS H+-ATPase. SOURCE E.coli (strain KF12) DNA, clone pBWG1. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 85) AUTHORS Iwamoto,A., Miki,J., Maeda,M. and Futai,M. TITLE H+-ATPase gamma-subunit of Escherichia coli: Role of the conserved carboxyl-terminal region JOURNAL J. Biol. Chem. 265, 5043-5048 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 84 H+-ATPase gamma subunit (uncG) (AA at 1) BASE COUNT 21 a 24 c 23 g 17 t ORIGIN 1 ctgcagttgg tatacaacaa agctcgtcag gccagcatta ctcaggaact caccgagatc 61 gtctcggggg ccgccgcggt ttaaa // LOCUS HUMPIM1B 1737 bp ds-DNA PRI 28-JUN-1990 DEFINITION Human protein kinase (Pim-1) promoter DNA. ACCESSION M34228 KEYWORDS protein kinase; proto-oncogene. SOURCE Human cell line K562 DNA, clone pLH/P. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1737) AUTHORS Meeker,T.C., Loeb,J., Ayres,M. and Sellers,W. TITLE The human Pim-1 gene is selectively transcribed in different hemato-lymphoid cell lines in spite of a G+C-rich housekeeping promoter JOURNAL Mol. Cell. Biol. 10, 1680-1688 (1990) STANDARD simple staff_entry FEATURES from to/span description mRNA 1699 > 1737 Pim-1 mRNA (alt.) mRNA 1700 > 1737 Pim-1 mRNA (alt.) mRNA 1703 > 1737 Pim-1 mRNA (alt.) mRNA 1705 > 1737 Pim-1 mRNA (alt.) mRNA 1706 > 1737 Pim-1 mRNA (alt.) mRNA 1732 > 1737 Pim-1 mRNA (alt.) mRNA 1733 > 1737 Pim-1 mRNA (alt.) mRNA 1735 > 1737 Pim-1 mRNA (alt.) binding 891 900 Sp1 binding site binding 1422 1431 Sp1 binding site binding 1433 1442 Sp1 binding site binding 1496 1505 Sp1 binding site binding 1222 1248 AP2 binding site binding 1290 1308 AP2 binding site binding 1662 1685 AP2 binding site binding 1340 1366 PPF-348 binding site BASE COUNT 360 a 546 c 426 g 405 t ORIGIN 1 ctttgaatac tcaggaggtg aatttgggtc atttcatctc ctctcttagc tgctgtatga 61 ccctgtgcca tcctatttat gtttcatttt cctatctgca aaaagggagt aattcttgtt 121 cccttgggtt ttctcctagt cataaaaaat gaatgagttc ggccacagta cacaaacaaa 181 agaaaatatg taatattttg taggataaat tcataaaagt tgtggaggat ctgggcactt 241 ttataaagca agcactgtag aaaactttca aaggaaacat ttaaattcac taatgacagc 301 tatagggatc ctgatttttc ctttaaatgg caggcacttc aaaaattaaa atagaatgta 361 gttagctttc aattacttag gccactgaca attcaattta tggattatat agtattttaa 421 tttactcatt tcacacgtcg ttatcaactc tacatgaagt ttaaaaggac aaaggatgca 481 agtggaattg gtcagtgttc cgggttattt ccaagggagg cagagggtgg ggggtttcct 541 ttgagacaag acttggggtt ggccaataat tgctggtatt gcctgcctgg taataacagg 601 ctgatgaaaa ggtgactaca aaacgtgaaa actggttaaa tcaagcgcac cctcccaccc 661 tagttttaga tgaggaattt tccgccctca cagaaggggc tgaggcagca tctggcatca 721 caacactaac atttgcttcg tgatttcctc tttacccggc cctttgacac acatcccttc 781 ccagaaatca ggattcgctg gtgcttttgc atttctaaaa tgggaatccc gtggctgagc 841 ttttagccgg ccggaacgac tgagggctgc atccctttcg caggagcggc gtcccgcctc 901 cagttctgtt ccaggcctga ctcctccact ccctccgtga ctcatgtctg cggatccttc 961 gcccccgacg cgccccccaa cacacaaacc cccagaatcc gcccccagcc tacagcgcga 1021 cgtcagcccg ccccagccga cttggaggtc tcgggtctga gtcacacaga aagaccaccc 1081 tcgtcggcat ccccacacac agtccgacac ccggcgcgcc ggcctccccg cctgacacac 1141 taacgcccgt cgtctccgcg caacttgtta tgctccggct cgagcccttg acccaaaaac 1201 ctcagcgaaa cggagagccg cagagccggc ctcgggcggc ctttgatggc tttgttattg 1261 tttgggtttg aatcgatacg cccctcccca tccttcctcc ctcgcggccc tacacccagc 1321 tcccgcctcc cctcacgccc cgcgcccctc cccctccatt ttggcgcctt ttccttcccg 1381 ccacgtcgtg gcggcgtaga gaccattctg accgcgagag ctgggcgggg cgggggcggg 1441 gcgcgccgag ttatgcagat caatcggcct ctggttggct ggagtagcgc tggcaggggc 1501 ggggccgggg cgcggccaca gagcgcgcgg ggcgggggcc gaggggagtc gcccagtccc 1561 gccgcttccc caccccctct cctccctcgg ccggcccggc agccctgctc cccgccttgg 1621 cctcccggag aggccccgcc ccgtcccccc ccgccgcgcc ctccccgcgc gccctccccg 1681 ccggcgcgct cctccccttt actcctggct gcggggcgag ccgggcgtct gctgcag // LOCUS ONNCG 11835 bp ss-RNA VRL 28-JUN-1990 DEFINITION O'Nyong-nyong virus (Gulu strain) complete genome. ACCESSION M20303 M33999 KEYWORDS nonstructural protein; polyprotein. SOURCE O'Nyong-nyong virus (strain Gulu, isolated from Anopheles gambiae), cDNA to viral RNA, clones 2,4,7,8,9,11,13,14,17,17s,18,22,23,27, passed in BHK-21 cells. ORGANISM O'Nyong-nyong virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae; Alphaviridae. REFERENCE 1 (bases 1 to 11835) AUTHORS Levinsion,R.S., Strauss,J.H. and Strauss,E.G. TITLE Complete sequence of the genomic RNA of O'Nyong-nyong virus and its use in the construction of alphavirus phylogenetic trees JOURNAL Virology 175, 110-123 (1990) STANDARD simple staff_entry REFERENCE 2 (bases 4079 to 7621) AUTHORS Strauss,E.G., Levinson,R., Rice,C.M., Dalrymple,J. and Strauss,J.H. TITLE Nonstructural proteins nsP3 and nsP4 of Ross River and O'Nyong-nyong viruses: Sequence and comparison with those of other alphaviruses JOURNAL Virology 164, 265-274 (1988) STANDARD full staff_entry FEATURES from to/span description pept 80 7624 O'Nyong-nyong polyprotein A matp 80 1684 nonstructural protein P1 matp 1685 4078 nonstructural protein P2 matp 4079 5788 nonstructural protein P3 matp 5789 7621 nonstructural protein P4 pept 7670 11413 O'Nyong-nyong polyprotein B matp 7670 8449 C protein matp 8450 8641 E3 protein matp 8642 9910 E2 protein matp 9911 10093 6K protein matp 10094 11410 E1 protein mRNA 1 11835 O'Nyong-nyong mRNA RNA 7621 7669 26S RNA signal 11817 11835 polyA signal BASE COUNT 3676 a 2860 c 2859 g 2440 t ORIGIN 1 atagctgcgt gatacacaca cgcagcttac gggtttcata ctgctctact ctgcattgca 61 agagattaaa gtacccatca tggattcagt gtatgtagac atagatgctg acagcgcgtt 121 tctgaaggcg ttgcagcaag cataccccat gtttgaggtg gaaccaaagc aggtcacgcc 181 aaatgaccat gcaaacgcta gagcattttc gcatctagca ataaaactga tagagcagga 241 aattgatcca gactcaacca ttctagacat tggtagcgca ccagctagga ggatgatgtc 301 tgatagaaaa taccactgcg tctgcccgat gcgcagcgca gaagaccctg agaggctcgc 361 gaattacgcg agaaaacttg cgtcagccgc tggaaaggtg acagataaaa acatctccgg 421 aaaaattaat gatctacaag ctgtgatggc cgtaccgaat atggaaacat ccacattctg 481 cctacacact gatgctacat gcaaacaaag aggagacgtc gccatttatc aagacgtcta 541 cgccgtccat gcacctacct cgctgtacca tcaggcgatt aaaggagtcc gcgtggcata 601 ctggataggg ttcgatacga cacctttcat gtacaatgca atggctggcg catacccatc 661 atattcaaca aactgggctg atgagcaggt actgaaagct aagaacatag ggctgtgttc 721 aacagaccta tctgagggta gacgaggcaa actatccatc atgagaggca aaaaattgaa 781 gccatgcgac cgagtgctat tctcggtcgg ctcaacactc taccctgaaa gtcgtaaact 841 tctacaaagc tggcatttac catcggtatt tcatctgaag ggtaaactca gcttcacctg 901 ccgctgtgac acgatcgtct catgcgaagg atacgttgtc aagagagtga ccatgagtcc 961 aggcatctac ggaaagacat cggggtatgc tgtaactcat catgccggcg gcttcctgat 1021 gtgcaagacg acagatacag tagacggcga aagggtatca ttctccgtgt gtacttacgt 1081 accagctact atctgcgacc agatgactgg aatccttgcc actgaggtaa ccccagaaga 1141 cgcacagaaa ctactggttg ggctaaacca acggatagtg gtcaatggca ggacgcaacg 1201 taatacaaac accatgaaaa actacctgct cccaatagtt gctcaggcct tcagcaagtg 1261 ggccaaagaa tgtcgaaagg acatggagga cgaaaaactc ttgggtgtcc gagagaggac 1321 cttaacgtgc tgttgcctat gggcatttag aaagcacaag acgcatacgg tgtacaaaag 1381 accggatacc cagtcaatcc aaaaggtccc tgccgaattt gacagctttg tgataccaag 1441 tctgtggtcg tcaggtttat caattccgct gagaaccaga atcaagtggc tcttgagcaa 1501 agctccaaaa tacgagcaac taccgcacag cggaaacgcc gaggaagcag cccaggctga 1561 aacagatgcg gtagaagaac aggaggcaga gctaacccga gaagctatgc caccattgca 1621 ggcgacacag gatgacattc aggtagaaat tgatgtagag caacttgaag accgagcagg 1681 agcgggcata gtcgaaacac caagaggagc aatcaaagtc acagcccaac cgtcagacct 1741 tgttgtcgga gagtacttag tactgacacc gcaggcggtc ctgcgcagcc aaaaactcag 1801 tctgattcac gcgcttgcag agcaggtaaa aacgtgcaca catagtgggc gagcaggcag 1861 gtacgcggtt gaagcatacg atgggcgtgt tctagtgccc tcgggctacg cgatacccca 1921 ggaagacttc cagagcttaa gcgaaagtgc caccatggta tttaacgagc gagagtttgt 1981 gaaccggaag ttacaccaca tcgccatgca cggcccagcg ctgaacactg atgaagagtc 2041 atatgaactg gtaagggtag agaaaacaga acacgagtac gtctatgacg ttgatcagaa 2101 gaaatgttgt aagagggagg aagcaacagg actagtgcta gtaggcgact taactagccc 2161 accataccat gagttcgcct acgaaggact aaaaatccgc ccagcatgtc catacaaaac 2221 ggcagttata ggtgtcttcg gagtaccggg ttctggcaag tcggctataa tcaaaaacct 2281 ggtaaccagg caagacttgg tgactagtgg aaaaaaagaa aactgccaag aaatctccaa 2341 tgacgtaatg cggcaaagga aattggagat atctgcacgt acagtcgact cactactcct 2401 gaatggatgt aacaagccag tggaagtact gtacgtggac gaggcattcg cttgtcattc 2461 gggaaccctg ttggcactga tagccatggt tagaccgcgt cagaaggtcg tactttgtgg 2521 cgacccaaag cagtgcggat tcttcaatat gatgcaaatg aaggtcaact ataatcacaa 2581 catctgcaca caggtgtacc ataaaagcat atcaaggcgg tgtacactgc ctgtaacagc 2641 catcgtgtcc tcgttgcatt acgagagcaa gatgcgcact acaaatgagt acaaccagcc 2701 aatcgtagtg gatactacgg gcataacaaa accagaaccc ggggacttag tgttaacgtg 2761 tttccgggga tgggttaagc agctgcaaat agactaccgt ggaaacgaag tcatgacagc 2821 agctgcttct caggggctga ccagaaaagg tgtttatgca gtaaggcaga aagtcaacga 2881 aaaccctctg tatgcaccaa catcagaaca cgttaacgtg ctattgacac gcacagaggg 2941 caagttgaca tggaagacac tctcaggcga cccatggata aagatactgc agaacccccc 3001 aaaaggggac tttaaggcaa caatcaagga gtgggaagca gaacacgcct ccatcatggc 3061 aggaatatgc aatcaccaga tggcttttga cacatttcag aacaaagcta atgtatgctg 3121 ggctaaatgc ctggtcccta ttcttgacac tgctggaatc aaattaagtg acaggcagtg 3181 gtctcagata gtgcaagctt ttaaagaaga tagggcctac tctccagaag ttgcactgaa 3241 tgaaatatgc actcgcatat atggggtaga cctggacagc ggactattct caaagccact 3301 gatatccgtc tactatgcag acaaccactg ggacaataga ccaggaggaa aaatgttcgg 3361 gttcaaccct gaggtggcac ttatgcttga aaagaaatat ccctttacaa aaggtaagtg 3421 gaacattaac aagcagatat gtataactac cagaaaggtt gacgaattta accccgaaac 3481 caacataata ccggccaacc gcagactgcc gcactcactc gtggctgaac accactcagt 3541 gagaggggaa agaatggaat ggctggtaaa caaaatcagc ggtcaccaca tgttgttggt 3601 tagcggtcat aatcttatat taccaacaaa aagagtcacc tgggtagcac cgttaggcac 3661 ccgaggtgca gactacacat ataacctgga acttggtcta ccagccacac taggcagata 3721 tgacctggta gttatcaata tccatactcc attccgcata catcattacc agcagtgtgt 3781 agatcacgca atgaagctcc agatgctagg gggggactct ctacggctgt taaagccggg 3841 aggttcactt ctgattagag cttacgggta cgccgaccga accagtgaaa gggtcattag 3901 cgtattggga cgcaagttca gatcgtccag ggctctgaaa cctcagtgca tcacgagcaa 3961 tacagaaatg ttcttcctat ttagccgatt cgacaatgga agaaggaact tcaccacaca 4021 tgttatgaac aaccagctga acgcagtgta tgcaggactg gccactagag cgggctgtgc 4081 cccgtcatac cgagtgaaac ggatggacat cgcaaagaac actgaggaat gcgtggtaaa 4141 cgccgccaat ccgcgcggag taccaggcga tggagtatgt aaagccgtgt atagaaaatg 4201 gccagaatca ttcagaaaca gtgcaacacc agtggggact gcaaagacaa tcatgtgcgg 4261 tcaatacccc gtcatccacg cagtaggccc taacttctca aactattctg aggctgaagg 4321 ggatagggaa ttggcttcag tgtatagaga agtggcgaaa gaagtgtcta ggctaggagt 4381 gagcagtgta gccatccctt tgctctcaac cggtgtgtac tcaggaggca aagacagact 4441 gctgcaatca ctaaaccatc ttttcgcagc gatggattcg acagatgcag acgttgtcat 4501 ctactgcagg gacaaggaat gggagaagaa gatcactgaa gccatatcac taagatccca 4561 ggtagaatta ctagatgatc acatctcagt ggattgcgac attgtacgcg ttcatccaga 4621 cagcagcttg gcaggccgaa aggggtacag cacagtagag ggagcactct actcgtacct 4681 agagggaaca agattccacc aaactgcagt agatatggca gagatatata ccatgtggcc 4741 gaaacaaact gaagccaacg aacaggtctg cctatatgct ctgggggaga gtatagagtc 4801 cgtcaggcaa aaatgtcccg tagacgacgc cgacgcctca ttccctccga aaacagtccc 4861 gtgcctatgc cgttatgcta tgacgcctga acgagttgca cgtctacgca tgaatcatac 4921 caccagcatc atagtgtgct cgtcttttcc gctgccgaaa tacaaaatcg agggcgtgca 4981 aaaagtaaaa tgttcgaaag cactcttgtt tgatcacaac gtaccgtctc gagtgagccc 5041 gagaacgtac aggcctgcgg acgaaatcat acagacacct caaacaccaa ctgaagcgtg 5101 tcaggacgca caactcgtgc agtcaataaa tgatgaagca gtgccagttc cctcagactt 5161 agaggcttgt gacgcaacta tggactggcc ctctatcggc accgtatcaa caagacaaag 5221 acacgactca tctgacagcg agtatagtgg ctccagaagt aacatacaac tagtgacggc 5281 ggacgtgcat gcaccaatgt acgcacattc gctggcgtcc agcggaggtt caatgctgtc 5341 gctgtccagt gaaccagctc agaacggcac aatgatacta cttgactcag aagacacaga 5401 cagtataagc agagtaagca caccgatcgc cccgcccaga agacgtttgg gaaggaccat 5461 aaatgtgacc tgcgacgagc gggaagggaa aatactccct atggccagcg acaggttctt 5521 cactgctaag ccatacactg tcgcactgag cgtatcaaca gcagatatga ctgtgtatcc 5581 catccaggca ccgctaggat tgataccacc acctaccctc gaaccgatca ctttcggaga 5641 cttcgccgaa ggtgaaatag acaacctcct gacaggggca ttgacatttg gggacttcga 5701 gccaggtgaa gtggaagagc tgacggatag cgagtggtca acgtgctcgg acacagacga 5761 agagttacga ctagacagag cagggggtta catattctcc tctgacactg gtcaaggtca 5821 tctacagcaa aaatcagtac gtcaaacgac gctaccggta aacattgttg aagaggtcca 5881 cgaagagaaa tgctatccac ctaaattgga tgagatcaaa gagcaactac tacttaagag 5941 acttcaggag agtgcttcca cggctaaccg gagtaggtac caatctagaa aagtggaaaa 6001 catgaaagcc acgattatcc acagactgaa agagggttgc agactctatt tggcgtcaga 6061 aacaccgagg gtcccatctt accgagtcac atacccggcg cccatctact cgccttcaat 6121 caatatcaaa ctgactaacc cagagactgc agtagcagtg tgtaacgagt ttttggccag 6181 aaactatcca actgtggcat cctaccaagt cactgacgag tacgacgcgt acttggatat 6241 ggtagacggg tccgaaagtt gcctagacag agctacattc aacccgtcta aactcaggag 6301 ttacccaaaa caacactctt accacgcacc caccatcaga agtgcagtgc catcaccatt 6361 ccaaaatacg ttgcagaatg tcttggcagc ggccacaaaa agaaactgca acgtaacgca 6421 gatgagggaa ctacccacta tggactccgc agtgtttaac gtggagtgtt ttaagaagta 6481 cgcttgcaac caagagtact ggagagagtt cgcctcaagc cctataaggg taacgacaga 6541 gaatctgaca atgtatgtga cgaaactaaa ggggcctaaa gcggcggcac tcttcgcaaa 6601 aacacacaac ttgctgccgc tacaagaggt accgatggac aggttcacaa tggacatgaa 6661 acgtgatgtg aaagtgacac caggtacaaa gcacaccgag gaaaggccga aagtacaggt 6721 catacaggcg gcagaaccac tggcaacagc atacctgtgt ggcatacaca gagagttggt 6781 gagaagacta aatgcagttc tgctaccgaa tgtccacaca ctgttcgata tgtcagccga 6841 agacttcgat gcaattatag ccacccattt caaaccgggc gatgctgtac tagaaactga 6901 catagcctca tttgataaga gtcaagacga ctcgcttgcg tcgaccgcca tgatgttgct 6961 agaagacctt ggggtagatc aacctatcct ggatctgata gaagcagcat tcggcgaaat 7021 atccagttgt catctaccga cgggtacgcg gttcaagttc ggcgcaatga tgaaatcagg 7081 catgtttcta accctgtttg tcaataccct cctgaacatc accattgcca gtcgggtgtt 7141 agaggagcga ttgactactt cagcctgtgc agctttcatt ggggacgaca acataataca 7201 tggggttgtc tctgacgcac taatggctgc acgttgtgct acgtggatga acatggaagt 7261 gaaaatcatc gatgcagtag tgtcagagaa ggcgccatac ttctgtgggg gatttatttt 7321 acacgacacg gtgacaggca cgtcgtgcag agtagcagac cctttaaaga gactgttcaa 7381 gctaggcaaa cctctggcag ctggagacga acaggatgag gacagaagac gtgctctggc 7441 agatgaggtt actagatggc aaagaaccgg cttagtcaca gaattggaaa aagcagtata 7501 ttcaaggtat gaagtacaag gaataacagc cgtaataaca tcaatggcta cctttgcgaa 7561 tagcaaagaa aactttaaga aactaagagg gcccgtcgta accttgtacg gcggacctaa 7621 ataggtacgc actacagcta cctacttaga gcatacatcg ctaatagcca tggagttcat 7681 accagcacaa acttactaca atagaagata ccagcctaga ccctggactc aacgccctac 7741 tatccaggtg atcaggccaa aaccacgccg aagaaggcct gcaggacaac tcgcacaact 7801 gatatccgca gtcagcagac tagcactgcg tacagttccc cagaaaccac gccggacccg 7861 aaaaattaag aagcaaaagc aagtaaagca agaacaacag agtactacga accagaagaa 7921 aaaggcgccg aaacaaaagc agacccaaaa gaaaaagaga ccaggacgaa gggaaaggat 7981 gtgcatgaag attgaaaatg actgcatctt cgaagtcaga catgaaggaa aagtaacggg 8041 gtatgcatgc ctagtaggtg ataaggtaat gaaaccagca cacgtgaaag gaactattga 8101 caacgcagac ctagcgaagt tggcgttcaa aagatcatcc aaatatgatc tagagtgcgc 8161 acagatacca gtgcacatga aatcggacgc ctcaaagttc acccatgaaa aaccagaagg 8221 ctattacaac tggcatcacg gagcagtaca gtattctgga gggaggttca cgatccctac 8281 aggcgcagga aagcctgggg acagcggaag accaatcttt gacaacaagg ggcgtgtcgt 8341 ggctattgtt ctaggcggag caaacgaagg aaccaggaca gcactatctg tagtgacttg 8401 gaataaagac atagtcacaa aaatcacacc agaggggtca gttgaatgga gccttgccct 8461 ccctgtcatg tgcctgttgg caaatacaac cttcccatgt tcccaaccgc cttgcgcgcc 8521 gtgctgctac gaaaagaaac cggaagaaac cttgagaatg ctggaggaca acgtcatgca 8581 accaggatat taccagttac tcgattcagc attggcctgc tcacaacgtc gtcaaaaacg 8641 taatgcaaga gaaaacttca atgtctacaa agtcactagg ccgtacttag cccactgtcc 8701 tgactgcggg gagggacact catgccacag cccaatagca ttagaacgga tcagaagtga 8761 ggcaacagat ggtaccttga aaatccaggt atctctgcaa atcggaataa agacagacga 8821 cagccacgat tggacgaagc tacggtatat ggatagccat acacctgtgg atgcagaccg 8881 atccgggttg tttgtcagaa cgtcagcacc gtgcaccatc acgggaacga tgggacattt 8941 catactagca cgctgtccga aaggagagac gctgacggta ggatttgtag acagtagaag 9001 gatcagtcac acgtgcatgc acccgttccg ccacgagcca ccgctgatag ggagagagaa 9061 gtttcactcc cgcccgcagc atggcaaaga actaccttgc agtacatacg tccataccac 9121 agcggcaact gctgaggaaa tagaagtgca tatgccgcca gatacccctg actacacgct 9181 gatgacacag caagcgggaa acgttaagat cacagttgac ggccagacgg tacgatacaa 9241 gtgcaaatgt gacggctcca atgaaggatt aataaccgct gacaaagtca taaataactg 9301 caaagtagac caatgccaca cagcggttac aaaccacaag aaatggcaat acaattcacc 9361 gctgaccccg cggaactccg aacaaggaga tagaaaaggt aagatccata tcccatttcc 9421 actggtgaac acaacctgca gggtaccaaa agcaagaaat ccgactgtca catacggtaa 9481 aaacagagtc actctgctgt tacatccaga ccacccaaca ctcctttcgt accgcgccat 9541 gggaaggatc ccggattacc atgaagagtg gataacaaac aagaaggaaa taagtatcac 9601 agtaccagca gaaggcttag aggttacgtg gggtaataat gacccataca aatattggcc 9661 ccaactgtct acaaatggta ctgcgcacgg gcacccacat gaaataatcc tctattacta 9721 tgagctgtac ccaactacca caattgctgt actagctgct gcttctatcg taataacatc 9781 tttggtaggt ctatcattag gcatgtgcat atgcgcgaga cgcaggtgca tcacgccata 9841 tgagctgact ccaggagcta ccatcccatt cctcctaggt gtactatgct gtgccaggac 9901 tgcaaaagca gcatcgtact acgaagctgc aacatacctc tggaatgagc aacaaccatt 9961 attttggtta cagcttctaa tccctctgtc agctgcaatt gttgtgtgta attgcctaaa 10021 acttttacca tgctgctgca aaacattgac ttttttagcc gtcatgagca tcggtgcccg 10081 cactgtgacc gcgtacgagc acgcaacagt gatcccgaac acggtgggag taccgtgtaa 10141 gactcttgtt agcagaccag ggtacagccc tatggtctta gaaatggagc tacagtcggt 10201 cactctggaa ccagcattat ccttggatta cattacgtgt gagtataaaa caatcacacc 10261 gtccccgtac gtaaaatgct gtggtacagc tgaatgtaag gccaagaacc tgccagatta 10321 taactgcaaa gtattcacag gcgtctaccc atttatgtgg ggaggagcat actgcttctg 10381 tgacgcagag aacacacagc tcagcgaggc acacgttgag aaatcagaat catgcaaaac 10441 tgagtttgca tcagcctaca gagcccacac agcttcagta tcagctaaac tacgtgtctt 10501 ttaccaaggg aataatatca ccgtgtctgc atacgccaat ggtgatcatg cagttacggt 10561 ggaagacgcg aagtttgtca tcggtccact atcgtccgcc tggtcaccat ttgataataa 10621 gatcgtggtg tacaaaggcg aagtctacaa tatggactat ccacctttcg gcgcagggag 10681 gccaggacag ttcggtgaca tccagagccg cacgccagac agcaaggacg tctatgcgaa 10741 tacgcagtta atactgcaaa gaccagcggc aggagcaata cacgtgcctt actcccaggc 10801 accttcgggc tttaagtact ggctcaagga aaaaggggca tcattgcagc atactgcacc 10861 atttggctgt cagatagcaa caaacccggt aagagcagtg aactgtgcag tgggcaacat 10921 accagtctcc attgacatcc cagatgcagc tttcaccagg gtcactgacg ctccttccat 10981 cacagacatg tcctgcgaag tagcttcgtg tacccattca tctgattttg gaggtgccgc 11041 agtcataaag tacacagcta gtaaaaaagg aaaatgcgcc gtgcactctg taacaaatgc 11101 ggtcactatc cgcgaaccta acgtagatgt caagggaaca gcacaattgc aaattgcctt 11161 ctcgaccgca ctagctagtg cggaattcaa ggtgcagatc tgctccacac tggtacactg 11221 ctcagcgacg tgccatcctc ctaaagacca tatagtcaat tacccgtcac ctcacaccac 11281 actaggagtg caggacattt caacgacagc tatgtcttgg gtccagaaga ttacaggagg 11341 agtgggactc gtggttgcta tagctgcttt gatcttaatt atagttctct gcgtatcatt 11401 tagcagacac taaagtacac atgccccaaa gtaatatact gtacaggtat acgtgctccc 11461 tgagcagcac gatatatgta ttatccataa aagaaaaaca aaacaaaaat ataaaaatta 11521 taaaatacaa aagtataaaa acaggtattg gtaccccctt agaggtacat tatttaacca 11581 gtgaaaatag gtattggtgc ccccttagag gcacattata gaaccaggta taggtgcccc 11641 cttagaggta caccaacaat aggtataagt gcccccttag tggcacacta accaccacaa 11701 tcggtaatat gaagagacgt aggtatgaag cttcgtaaaa gctgccgaac ttactttaag 11761 atgtaggcgt accgaactct tcgacaattc tccgacgcag ggacgtagga gaagttattt 11821 tgtttttaat atttc // LOCUS PNGE155A 155 bp ds-DNA INV 28-JUN-1990 DEFINITION P.redivivus E155 gene. ACCESSION M33925 KEYWORDS . SOURCE P.redivivus (strain SC) adult DNA, clone E155. ORGANISM Panagrellus redivivus Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae. REFERENCE 1 (bases 1 to 155) AUTHORS de Chastonay,Y., Muller,F. and Tobler,H. TITLE Two highly reiterated DNA sequences in the low C-value genome of Panagrellus redivivus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.De Chastonay, 26-APR-1990. Institue of Zoology Perolles, Fribourg CH-1700 SWITZERLAND e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu BASE COUNT 43 a 33 c 30 g 49 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccaag gtggcccccg gggccatacg gtacttaaca aatttgcata aatgtgtgat 61 ttttgacaca aaacgagctt attaacatct ttggaattgc ccatgaatct tcatttgaat 121 gtggattata gatcttatcg ccccccaagc ttttg // LOCUS PNGE167A 167 bp ds-DNA INV 28-JUN-1990 DEFINITION P.redivivus E167 gene. ACCESSION M33926 KEYWORDS . SOURCE P.redivivus (strain SC) adult DNA, clone E167. ORGANISM Panagrellus redivivus Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae. REFERENCE 1 (bases 1 to 167) AUTHORS de Chastonay,Y., Muller,F. and Tobler,H. TITLE Two highly reiterated DNA sequences in the low C-value genome of Panagrellus redivivus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.De Chastonay, 26-APR-1990. Institue of Zoology Perolles, Fribourg CH-1700 SWITZERLAND e-mail: stocker%cfruni51.bitnet@cunyvm.cuny.edu BASE COUNT 54 a 31 c 27 g 55 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcccga tgttcagatt tgaactttgt atgaatttga gatggcccat ttgatcattg 61 cggattcttc agaatccgaa tctaatggct cagacccaca gaataagctt taaataggct 121 attctaacta tattgattca actaatctaa agtatgcctt aaaacat // LOCUS SNDHNGP 2600 bp ss-RNA VRL 28-JUN-1990 DEFINITION Sendai virus hemagglutinin-neuraminidase (HN) gene, complete cds. ACCESSION M34033 KEYWORDS glycoprotein; hemagglutinin; neuraminidase. SOURCE Parainfluenza virus type 4A (strain Toshiba) from primary monkey kidney cells, cDNA to viral RNA, clone pG4HF1. ORGANISM Parainfluenza virus type 1 Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Paramyxoviridae; Paramyxovirus. REFERENCE 1 (bases 1 to 2600) AUTHORS Bando,H., Kondo,K., Kawano,M., Komada,H., Tsurudome,M., Nishio,M. and Ito,Y. TITLE Molecular cloning and sequence analysis of human parainfluenza type 4A virus HN gene: Its irregularities on structure and activities JOURNAL Virology 175, 307-312 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 343 2064 hemagglutinin-neuraminidase (HN) mRNA 53 2600 HN mRNA mRNA < 1 52 fusion protein (F) mRNA signal 39 52 consensus polyA signal signal 2587 2600 consensus polyA signal BASE COUNT 955 a 561 c 410 g 674 t ORIGIN 1 acaaaacgac acccaatcaa acaatctcca catttcaatt ttaagaaaaa aactattata 61 ggggggaaca cacttctcag ccctgattgc tcaaggccct tgcatgtgca accgagacac 121 cccccacaag caccggaata agacctgaca acaaagtagc agccaccacg acccaaaaac 181 aaaattaaaa ggatccggta acagcccatc aaccagcaat catagaatcc aacaatccag 241 agagacgtca catcaactca tccacgaatc ttcgaaggga acatcccaga caaaatcaca 301 gcccattccc tgatcacgga taaactgaga aagatcacaa gaatgcaaga ctcacatggt 361 aatacacaaa tactcaacca ggcaaattca atggtgaaaa gaacatggag attactattt 421 cgaattgcaa ccttaatatt acttgtttca atatttgtgt tatcgctcat aattgtatta 481 cagtcaacac cggggaattt gcaaaacgat atcaatataa ttagaaagga gctcaatgaa 541 cttatggaga attttgaaac tacatctaag tcactgttaa gtgtatcaaa tcaaatcact 601 tacgatgtat cagtacttac tcctataaga caagaagcta ttgaaacaaa catcatttca 661 aaaataaaag atcattgcaa agatagagta attaaagaag gaagcacttg cacattgaat 721 cgcagccctt tgcatgatgt ctctttttta aatgggttca ataaattcta tttcacatat 781 aaagataata tgcaaattaa gtttaaatca ttattagatt accccaattt tattccaact 841 gctacaactc cccacggatg cattcgaatt ccatcattct ccttaggtca aacccattgg 901 tgttataccc ataatataaa cctactagga tgtgcagacc ctgcatctag caatcaatat 961 gtatcactag gaaccttaca agtcttaaaa atgggtgacc cttattttaa agtcgagcat 1021 agtcattatt taaatgacgg gaggaatcga aagagttgtt cagtggttgc tgtccccgac 1081 ggatgcctgc ggaattgtgt gaccatgaca aaaaatgaga cagagaattt caaagacctc 1141 aattggcaac acaattactt acatacatat catataatgg taccattaaa gactcgtata 1201 ataaatccac caggatcatc cagagattgg gttcatatcg caccaggggt aggctcgggc 1261 cttttgtatg ccaaattact tatatttcct ttgtatgggg gtctcacgga aaaatcagtg 1321 atacataata atcaatcagg gaaatatttt ttccctaatt caactaaatt gcaatgccgt 1381 aacagcacta tggaaaaaat aaaaggagca aaagattcat acacaataac ttacttctca 1441 gggagactta tacagagtgc atttctggtt tgtgatctaa gacaatttct ttctgaagat 1501 tgtgaaatct taattcctag taatgattac atgatggtcg gtgcagaggg tcgattatat 1561 aacattgaga acaacatatt ttattatcag agaggatcca gctggtggcc ttatccgagc 1621 ctctatagaa tcaggttaaa ccttagtaag aaatatccta gaataactga aattaaattt 1681 acaaaaattg aaatcgcccc aagaccaggc aacaaagatt gtccaggaaa taaggcttgc 1741 ccaaaagaat gtataacggg agtctaccaa gatatattgc cactaagtta tcccaatact 1801 gcatttccac acttaaaaca agcgtattat acaggttttt atcttaataa ctcgctcgag 1861 agacgcaatc caacatttta tactgctgac aatctagatt accatcaaca ggaaagatta 1921 ggtaaattca atcttactgc tggatactct actacaactt gttttaaaca gaccactact 1981 gcgaggttat actgtctcta cataattgaa gtgggtgact cagtcattgg ggactttcag 2041 atcacccttt ttttagcagc ttaatagacc agactgttaa ttaatcaaca aagttattct 2101 gtaatataaa ctgatcttat aagtgaaaag atgcctatcc aaggaggttg atagacaaat 2161 agtaaaagta gcaattgtaa caaaactcta aggaaaaagt aattcgagaa atattataga 2221 ctgacttcag agcaaacaca acatcgatcc ataatagtca atataatcaa taatactcta 2281 tgagacctta cctatcaaca gcaaaaaaca cagtccatca agcggaaccc aactcgctcc 2341 atccttaatc atccactgaa agaaaaaata tacgaaggac catcggccac cgggtccaaa 2401 caatctagca caaaaattca aacaaccgcc aaactctgtt cggcctcaac aaacaatccg 2461 ccaagccatc tgtcattcct ataccaacac acaaccatcc cattcctcaa aagcaattca 2521 atccgcgacc caaagaagac tctccacata tccagctaat ccgtcgatcc gacacatcat 2581 cgtatctttt aagaaaaaaa // LOCUS YSCRP026A 2133 bp ds-DNA PLN 28-JUN-1990 DEFINITION S.cerevisiae RNA polymerase II sixth subunit (RP026) gene, complete cds. ACCESSION M33924 KEYWORDS RNA polymerase II; oncogene; transmembrane protein; tyrosine kinase. SOURCE S.cerevisiae single-copy plasmid DNA, clone 2. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2133) AUTHORS Archambault,J., Schappert,K.T. and Friesen,J.D. TITLE A suppressor of an RNA polymerase II mutant of Saccharomyces cerevisiae encodes a subunit common to RNA polymerase I, II, and III JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Friesen, 01-MAY-1990. The Hospital for Sick Children 555 University Avenue Toronto, Ontario, CANADA M5G 1X8 e-mail: jim@sickkids.toronto.edu FEATURES from to/span description pept 1026 1045 RNA polymerase II sixth subunit (RP026), exon 1 1122 1569 RNA polymerase II sixth subunit (RP026), exon 2 IVS 1046 1121 RP026 intron A BASE COUNT 644 a 374 c 408 g 707 t ORIGIN Chromosome XVI. 1 tctagaaaaa gacatatcag gacatatatg acacttccaa tttttggtta ctagtgagtc 61 gtcatgaata atcatgtgca tttgtaaacc attttccccc acacatggtt tgctacaaat 121 aggacatttt aatttaggat gatcattttt tatatgcgat tgcaattgtg accatatccg 181 aaactctttg caacaaccag caaaagtaca ttggtaagga ttttctacct caggatcgtg 241 atgtttagaa atgtgatttc ttagcctata tggcctctga aagcttttat tgcagtgtgg 301 acaggttagt ttatgtaaat gaacagataa aatatgtgcc cttaattgtg gatgcttgta 361 gaatcggagg ttgcatcctt cctctggaca aatgaaagat ttggtatgcg ttacttcgtg 421 tcgcttcagt tgctggcgag tcgtcactcc ttttccacaa taagaacatt ggaatggttt 481 cgtatcagaa tgcgtataca agtgtctctc taagtgactc tttttaacga aggattttgc 541 acactatcac actgaaatgc tcttaaaccc tgatgtacgc ttaattggtg ttcagtcaaa 601 attgaaggtc ttgtaaatgc ctatcacagc atcatagtca cagaaatatg tctttggcct 661 attgctcgat gagcttctag ttgatgttaa actgttcaat gattctgaac tctctgatcg 721 tgatatggga atggtctcct gtttaagttc ggccagtggc attccttcat tatttagaac 781 ctctcctccc atttgacagc gatttcaaca gttactactg tgcttaatgc ccttatttga 841 tagttagttc ttcttataat aaataatcat tgtatattga ttattcgtat agtatgatat 901 ttaattgaaa aaaaaatttt ttttttttta agaatatcat tcaaaggatc aatcacaacc 961 ttgaagaaag gctaaaaaga cacattttgc aggtaacagt gtaaagatta aggctacaag 1021 tgatcatgtc agactacgag gaggcgtatg tgcattcata ttggagaaat tgcgtgcttt 1081 agttttaatt atatttacta acttctatca ttgtgataca ggtttaacga cggtaatgag 1141 aacttcgaag attttgatgt agagcatttt tctgatgagg agacttatga ggaaaaacct 1201 caattcaagg atggtgaaac aaccgatgcc aacggtaaga ccatcgttac tggtggtaat 1261 ggcccagaag attttcaaca gcatgagcaa ataagaagaa agacacttaa ggaaaaggcc 1321 atcccaaaag accaaagagc aactactcca tacatgacca aatatgaaag ggcaagaatt 1381 ttaggtacaa gagccctaca aatttccatg aatgcgccag tttttgtaga tttagaaggt 1441 gaaactgatc cattgcgtat tgccatgaag gagttggctg aaaagaaaat tcctttggtt 1501 attagaagat atttaccaga tggttccttt gaggactgga gtgtggagga actcattgtg 1561 gatttgtgat tacacctgca ttttccttta tgtgtatttg ctttgtttgt atattattat 1621 aactataaag aaccgattta ttttgtaatg caaataactg gacgagagaa ccacgctata 1681 acctatttgg taataaaaat gtttgtatga ataaaaaagc tgctgactgc tttctttgat 1741 atgtgtatat agaatattta tcggtgattg atctaagaaa agcttacctc tgaaagttct 1801 ctcttcgtcg cttgctgatt tgtagtaaac aatttgaaaa gtttagcaaa ttcttcctca 1861 ggattttcaa aaccggcttc ctttaaagat caatgacctc attcaaggga acgtttaaat 1921 cgtgtcctct tccaatggct tttaaacttt cttccaaatt cttcacgctc ggggaaactg 1981 cgataagttc ttccccatga tagataagaa tatgggaaag cttacacctt cttctccaac 2041 ttccgcggta ctggtgtcat tgtcaggtac cattttggac cattcctcat ctgtcaatgt 2101 cttccccaat gttgcatata tcttggtaag atc // LOCUS MUS25OAS 1412 bp ss-mRNA ROD 28-JUN-1990 DEFINITION Mouse 2'-5' oligo A synthetase mRNA, complete cds. ACCESSION M33863 KEYWORDS 2'-5' oligo A synthetase. SOURCE Mouse cell line L-929, cDNA to mRNA, clone L3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1412) AUTHORS Coccia,E.M., Nissim,A. and Chebat,J. TITLE A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3 cells impairs EMCV but not VSV replication JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.M.Coccia, 26-APR-1990. FEATURES from to/span description pept 36 1139 2'-5' oligo A synthetase signal 1389 1394 poly-A signal BASE COUNT 340 a 371 c 390 g 311 t ORIGIN 1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc 61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg 121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag 181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca 241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct 301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg 361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca 421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg 481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca 541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta 601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc 661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc 721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt 781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc 841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc 901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga 961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc 1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga 1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag 1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc 1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca 1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg 1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc 1381 tgatcaacaa taaaccacag caggtgccgt ca // LOCUS PSEHUTR 1604 bp ds-DNA BCT 28-JUN-1990 DEFINITION P.putida histidine utilization genes repressor protein (hut) gene, complete cds. ACCESSION M33922 KEYWORDS histidine utilization genes repressor protein. SOURCE P.putida (ATCC 12633) DNA. ORGANISM Pseudomonas putida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1604) AUTHORS Allison,S.L. and Phillips,A.T. TITLE Nucleotide sequence of the gene encoding the repressor for the histidine utilization genes of Pseudomonas putida JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.T.Phillips, 26-APR-1990. FEATURES from to/span description pept 167 913 histidine utilization genes repressor protein (hut) pept 910 1482 protein of unknown function binding 46 65 hut binding site (put.) binding 894 898 ribosomal binding site (put.) site 82 95 operator site signal 94 122 promoter (put.) signal 1516 1543 transcription termination signal BASE COUNT 295 a 484 c 513 g 312 t ORIGIN 1 ggacatggct ggcccagccc gtaggcaaca gagcgcgttc ggcgaagtag gcggacatcg 61 gtcaaatcct gttattgtta acttgtatat acatatacag gcgtttgcct gccgggtaaa 121 ctgcggcaag ctaccgttca ttccctatgc acaaggatcc aacgccgtgc cgacacctcc 181 tgtctccgcg ctggttgccc agatgggcga gggcccggcg ccgctgtatg cccgggtcaa 241 acagatgatc atccagcaga tcgacaacgg cagctggccg ccgcatcacc gggtcccctc 301 ggagagtgaa ctggtcaacg agctaggctt cagccgcatg accatcaacc gtgccctgcg 361 cgaactcacg gccgacggcc tgctggtgcg catgcagggg gtcggcacgt tcgtagccga 421 gccaaagggc cgttcggcgt tgttcgaagt caacaacatt gccgatgaaa ttgccgcgcg 481 cggccatcag catagctgcc aggtgatcac gctcaccgag gaagcagccg gttccgaacg 541 ggccctggcc ctggacatgc gtgaaggcca gcgggtgttc cactcgctga tcgtgcattt 601 cgagaacggc gtgccggtgc agatcgagga ccgctacgtc aacgccgcga tcgcacccga 661 ctacctcaag caggatttca cccggcagac gccatatgcc tacctgtccc aggtagcgcc 721 gctgaccgag ggtgagcacg tggtcgaagc catcctggcc gagccggaag aatgccgcct 781 gctgcagatc gagcggggcg aaccttgcct gctgatccgc cgtcgtactt ggtccggccg 841 ccagccggta accgcggcgc ggctgatcca ccccggttcc cgtcatcgcc tggaaggacg 901 tttcagcaaa tgagccagct gcagttgttg cgcgcacagg attacccgcg catgccgtgg 961 aagaacggtg gcggtttcac cgaagagatc acccgcgaca gtggagaggg cctggacggc 1021 tttggctggc gcctgtcgat tgccgatatc gaagagtctg gcggcttttc caccttcgcc 1081 ggttaccagc ggatcatcac cgtgctgcag ggcgatggca tgcgcctgtt ggtcgatggc 1141 cagcccagcc ggccgttgct gccgttcgat gcctttgcct tcagcggcga aagccaggtc 1201 agctgcaagc tgctgggtgg ggcgatccgc gatttcaacc tgatctatgc accgcaacgg 1261 taccgggcga ggttgcagtg gtttgatggc acgagccgtt tgtacagctc ggcgtcgaca 1321 gtgctgttgt ttgctgccag cagtcacgtg gaagtgtcca tggcggggcg tgaggtgcag 1381 cggttggggt tgtatgactg cctgcggctg gagggcaacg atgagttgct tgggctggaa 1441 gttcaggggc ggttttgctt gattgagctc atttctcgct gatgggcttg gcgatacatt 1501 ttcatcgcct gtgagatcga gcgccgcgcg ggcggcgctc gatttgcgcg ccgccgcaaa 1561 actcaagccg gaccgacgct cgcttcaccc ccccaaaaaa aatc // LOCUS MUSCR2 427 bp ss-mRNA ROD 28-JUN-1990 DEFINITION M.musculus complement receptor type 2 (CR2) mRNA, 3' end. ACCESSION M35685 J04153 KEYWORDS complement receptor type 2. SOURCE M.musculus (strains BALB/c, DBA/2, CBA/J and C57B10.D2/OSN C5-deficient) and Swiss nude (strain nu/nu), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 427) AUTHORS Fingeroth,J.D., Benedict,M.A., Levy,D.N. and Strominger,J.L. TITLE Identification of murine complement receptor type 2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 242-246 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Fingeroth, 13-JAN-1989. FEATURES from to/span description pept < 1 285 complement receptor type 2 BASE COUNT 148 a 95 c 76 g 108 t ORIGIN Chromosome 1. 1 tgtttacaac cgaagatcaa aagagggcaa atattatcta ttttgaaaga tagttattca 61 tataatgaca ctgtggcatt ttcttgtgaa cctggcttca ccttgaaggg caacaggagc 121 attcgatgca atgctcatgg cacatcggag ccaccggtac cagtgtgtga aaaaggaaaa 181 cacagagaaa gcaattatta tacaaagaca agacccaaag aaggagctct tcatttagaa 241 acacgagaag tatattctat tgatccatat aacccagcaa gctgatgaca tgacaaatca 301 agatgtagaa ctctcagcta cctcttcagc accatatctg cttacatgcc accaagctac 361 cctccacgac aataatggac taaacctctg atttgtaagc cagccccaat taaatgtttt 421 tctctat // LOCUS RABCYP2C16 2006 bp ss-mRNA MAM 28-JUN-1990 DEFINITION Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds. ACCESSION M29968 KEYWORDS cytochrome P450; monooxygenase. SOURCE Rabbit (strain New Zealand White) adult liver, cDNA to mRNA. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 2006) AUTHORS Hassett,C. and Omiecinski,C.J. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hassett, 20-NOV-1989. Author Address [1]: C.Hasset University of Washington Department of Environmental Health SC-34 Seattle, WA 98195 FEATURES from to/span description pept 66 1529 cytochrome P450IIC16 (CYP2C16) mRNA < 1 2006 CYP2C16 mRNA signal 1985 1989 poly-A signal BASE COUNT 524 a 474 c 472 g 536 t ORIGIN 1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag 61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac 121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta 181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag 241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg 301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca 361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga 421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga 481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa 541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct 601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg 661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt 721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt 781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca 841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt 901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa 961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata 1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca 1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg 1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc 1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc 1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga 1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc 1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg 1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt 1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc 1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac 1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac 1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga 1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc 1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat 1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa 1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct 1981 cattaataaa gtggatgcat tcactg //
GenBank-Updates@genbank.bio.net (06/29/90)
LOCUS MUSTKBLK 2094 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse B lymphocyte kinase (blk) mRNA, complete cds. ACCESSION M30903 KEYWORDS B lymphocyte kinase; tyrosine kinase. SOURCE Mouse (strain C57B410, haplotype b) B-lymphoid hybridoma cell line LB27.4 + 71.67, cDNA to mRNA, clones 102, 103, 201, 205, 215. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites) AUTHORS Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V. TITLE Specific expression of a tyrosine kinase gene, blk, in B lymphoid cells JOURNAL Science 247, 332-336 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2094) AUTHORS Dymecki,S.M., Niederhuber,J.E. and Desiderio,S.V. JOURNAL Unpublished (1990) Howard Hughes Med. Inst., Baltimore MD 21205 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.V.Desiderio, 21-DEC-1989. FEATURES from to/span description pept 350 1849 B lymphocyte kinase BASE COUNT 485 a 542 c 607 g 460 t ORIGIN 1 cagcctggct gaaggtcagg tgcccatgaa acttctcaga aacacctccg gggctgagga 61 gccagaagtc tccagaagac ttgactggag ggcagagctc ccactgagct gcttcttagg 121 gctgttcatg gcggtctgat cacaacaggg ggcacacaca cctctgtgtg ctgttggcag 181 gaagccacaa gccactgaag ctgactgaga tgagaagaat ccatctgggg atgggattgg 241 cttttcttaa ctgggtccag agcttctggt tgccgctaga aagttcttca accacacaga 301 ccctgggatc tgcacagcta tgttgagaag ctactgaagc cctgcaagga tggggctgct 361 gagcagcaag aggcaggtca gtgagaaggg caagggctgg agccccgtga agatccgcac 421 ccaggacaag gctcccccac ccctgccacc cctggttgtc ttcaaccacc ttgccccacc 481 atctcctaac caggacccag atgaagagga gcgttttgtg gtggctctgt ttgactatgc 541 cgctgtgaat gacagggacc ttcaggtgct gaagggtgag aagctccagg tcttgaggag 601 cactggagac tggtggttgg ccaggtcact cgtcacagga agagaaggtt atgtgcccag 661 caactttgtg gccccagtag agactctgga agtagaaaaa tggttcttca ggaccatcag 721 ccggaaggat gctgagaggc agttgctggc tccgatgaac aaggccggct cctttctcat 781 cagagagagt gagagcaata aaggtgcctt ttccctgtcc gtgaaagata tcaccaccca 841 gggggaggtg gtcaagcact ataagatccg atcactggac aatggaggct attacatctc 901 cccccggatc acctttccca ccctccaggc cctggtgcag cactattcaa agaaagggga 961 tggtttgtgt cagaagttga ctctgccctg tgtgaacctg gccccgaaga acctttgggc 1021 ccaagatgaa tgggaaatcc ccaggcagtc tctcaagttg gtccggaaac ttgggtctgg 1081 gcagtttggc gaagtctgga tgggttatta caaaaataac atgaaggtgg ccatcaagac 1141 cctgaaggag ggaaccatgt caccggaagc tttcctgggc gaggccaacg tgatgaaaac 1201 cctgcagcat gagaggctgg ttcgtctcta cgctgtggtc accagagagc ccatttacat 1261 cgtcactgaa tacatggcca gaggatgctt gctggatttt ctgaagaccg atgaaggtag 1321 caggttgtcc cttccaaggc tgattgacat gtcagcccag gttgcagagg ggatggctta 1381 catagagcgc atgaattcca tccaccgtga cctgcgggca gccaacatcc tggtgtctga 1441 gacgttgtgc tgcaaaatcg ctgacttcgg cttggccagg atcattgaca gtgaatacac 1501 tgcccaagag ggggccaagt tccccatcaa gtggaccgcc ccggaggcca tccacttcgg 1561 ggtgtttacc atcaaggctg atgtgtggtc cttcggagtc ttgctgatgg tgattgtcac 1621 ctatgggcgc gttccctacc caggaatgag caaccctgag gtcatccgta gcctggagca 1681 cggctaccga atgccatgcc cggagacatg tccaccggag ttgtacaatg atatcatcac 1741 tgagtgctgg cggggccggc cagaggagcg gcctaccttt gagttcctgc agtcggtgtt 1801 ggaggacttc tacacagcca cggagggcca atatgagctg cagccctagc aaccaatggt 1861 ctcttcaggg ctcccagcct gagcctcctt ctggactctg actttgggtt aatatgttca 1921 cacttagctc ttcttccagg taggttgcag aatcagggaa gaatggtgtg cctgcttctc 1981 aagtgaggga gccaggagac aaagatggct tcatggtgag gcatgagctc tcaaggcaca 2041 acctcggtcc acctttcact ggctgtgtgc ctttggccaa gttactttgc tgcg // LOCUS CHKSUH2AC 75 bp ds-DNA SYN 29-JUN-1990 DEFINITION Chicken and sea urchin chimeric TATA-H2A-c gene. ACCESSION J02525 KEYWORDS conalbumin; histone. SOURCE Chicken and sea urchin DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 75) AUTHORS Grosschedl,R., Wasylyk,B., Chambon,P. and Birnstiel,M.L. TITLE Point mutation in the TATA box curtails expression of sea urchin H2A histone gene in vivo JOURNAL Nature 294, 178-180 (1981) STANDARD full staff_review COMMENT The TATA-H2A-c gene was formed by replacing the "TATA box" of sea urchin H2A with that of chicken conalbumin. A single base change from 't' to 'g' in the third position of the "TATA box" reduces the rate of transcription by 80%. FEATURES from to/span description mRNA 68 > 75 H2A-c mRNA recomb 23 24 sea urchin DNA/BamHI linker recomb 25 26 BamHI linker/chicken DNA mut 41 41 t in wild type; g in "TAGA" mutation recomb 60 61 chicken DNA/sea urchin DNA BASE COUNT 20 a 21 c 21 g 13 t ORIGIN 68 bp upstream of H2A-c mRNA start 1 tcgctgaccg gtctctccga tccggagggc tgctcctcta taaaagggga agaaagagga 61 accattcaag ccagc // LOCUS CPVW8 70 bp ds-DNA VRL 29-JUN-1990 DEFINITION CPV-BR white pock mutant recombination junction, variant W8. ACCESSION M11471 KEYWORDS recombination joint. SOURCE Cow pox virus (strain Brighton red; CPV-BR) DNA, passed in chorioallantoic membrane (CAM) of 11-day old chicken embryo. ORGANISM Cowpox virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 70) AUTHORS Pickup,D.J., Ink,B.S., Parsons,B.L., Hu,W. and Joklik,W.K. TITLE Spontaneous deletions and duplications of sequences in the genome of cowpox virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6817-6821 (1984) STANDARD full staff_review COMMENT In the white pock variants of cowpox virus, 32 to 38 kb of the 3' end (region 2) were replaced by inverted copies, 21 to 50 kb long, of the 5' end (region 1) of the genome. These variants possess inverted terminal repeats. The limits of the inner ends of the deletions appear to be defined by the location of the nearest essential gene on one end and the location of the gene that encodes pock-redness on the other. The sequences adjacent to the recombination/transfer sites show no homology to each other or any unusual structural feature. At the recombination site of variant W8 one base pair "t" is shared by the two combining sequences. FEATURES from to/span description recomb 36 37 region 1 end/region 2 start (crossover event) BASE COUNT 19 a 11 c 14 g 26 t ORIGIN Undetermined number of bp downstream of EcoRI site. 1 ctttttattg agtggtggta gttacggata tctaatttat ccatccagta tgggtataca 61 acacgaattc // LOCUS DRORGM108 1059 bp ds-DNA INV 29-JUN-1990 DEFINITION D.melanogaster 28S rRNA gene with 1.006 kb type I insertion. ACCESSION K03141 KEYWORDS insertion sequence. SOURCE D.melanogaster ribosomal DNA, clone MB27. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1059) AUTHORS Roiha,H. and Glover,D.M. TITLE Duplicated rDNA sequences of variable lengths flanking the short type I insertions in the rDNA of Drosophila melanogaster JOURNAL Nucleic Acids Res. 9, 5521-5532 (1981) STANDARD simple staff_review COMMENT The rRNA genes of D.melanogaster can contain two types of non-homologous insertion sequences, which are unlike intervening sequences found in other genes. These insertion sequences appear to have a repressive effect on the transcription of rRNA genes. FEATURES from to/span description rpt 24 30 rRNA duplication rpt 1037 1043 28S rRNA duplication recomb 30 31 28S rRNA end/insertion sequence start recomb 1036 1037 insertion sequence end/28S rRNA start BASE COUNT 237 a 244 c 311 g 265 t 2 others ORIGIN 164 bp upstream of BstI site. 1 gaatggatta acgagattcc tactgtccct gtcttagctg ggagcagagg aagactcgcc 61 tagaggagtg tttaatccag agttggcaga acagatggga cgatgacagc gaaccaggac 121 gggtgacgca taggtttatc ccatacgtta gtcttgccta tcgggatcca agttttggat 181 tctcgatgag gacgtctttc ctgcttacag ggcacgggtc gttcaatgca tttttgcacg 241 ggagagccct cagcgatacc actgcttgcg catgtggcga tccatatgag gactggatgc 301 atatcttgtg cgcttgcccc ctatatgcag atctgcggga cctagatgga cttggagtgc 361 agcgccttgg cgaaactgga tcttcgaggg aatcctcgat gatcaagaga agactcaacg 421 gctggcaatg tttgcggaag aagtgttcct gaggaggagg ggcgtttagc tcagtcatct 481 ctcgcgtgtg gtttagcggg cgagaatact accacagttc cgctgttgct tgtcgtaaga 541 gacgactaat acagcgatag gattcctcta accctgcttg tcggagcaaa agggggaggc 601 ccaccgagcc tcttttcggt accacgggtt gagcagctat ccaagactgc tcattgaggt 661 aggccccctg gtgggagtat cgtggtggct gtggttggta cccatatcgc gggtagagcc 721 ttcatgctcg acgtttgagt tacggtgcta gttgcgcaaa actcgggtgc tgtgacccag 781 agatcagtag agattttagg tagatctcgc tcctcagcaa gggggagtgc ttgcccggca 841 agcangntac tcgaattgct accggggtgg tcgctatgta catagctata gcttctagtc 901 cgggacgctt gtctggcgta tccagacaca tgcaccatat gctcacttgt gggtgtatag 961 ggtgccgtgg ttgtaatccc ttcagtgtgg aacacgccac gtaaaataag ttcggaggga 1021 tccgaaaagc atacattgtc cctatctact atctagcaa // LOCUS ECOLACIS 113 bp ds-DNA BCT 29-JUN-1990 DEFINITION E.coli lacZ and IS1T insertion junction. ACCESSION K03504 KEYWORDS insertion element IS1T; lacZ/IS1 fusion protein. SOURCE E.coli K12 DNA, clone pCSH11. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 113) AUTHORS Malamy,M.H., Rahaim,P.T., Hoffman,C.S., Baghdoyan,D., O'Connor,M.B. and Miller,J. TITLE A frameshift mutation at the junction of an IS1 insertion within lacZ restores beta-galactosidase activity via formation of an active lacZ-IS1 fusion protein JOURNAL J. Mol. Biol. 181, 551-555 (1985) STANDARD full staff_review COMMENT The site of integration of IS1 into lacZ is found 18 nucleotides from the end of the sequence determining the C-terminus of beta-galactosidase. Reversion to lac+, promoted by ICR191, results from the loss of a "g" from the "ggg" sequence located at the junction of lacZ and IS1. As a result an active but temperature-sensitive lacZ/IS1 fusion protein is formed containing six amino acids derived from IS1 which replace six amino acids encoded by lacZ FEATURES from to/span description pept < 1 57 lacZ/IS1 fusion protein (AA at 1) recomb 37 38 E.coli DNA end/IS1 DNA start BASE COUNT 24 a 25 c 28 g 36 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccagc tgagcgccgg tcgctaccat taccaggtaa tgctgccaac ttactgattt 61 agtgtatgat ggtgatttta aggtgcttgc gtggcttcca tttccatcag atg // LOCUS ECOOMPASA 70 bp ds-DNA SYN 29-JUN-1990 DEFINITION E.coli outer membrane protein ompA gene-plasmid pBR322 junction (synthetic), clone pOMPA21. ACCESSION J02534 KEYWORDS artificial gene; membrane protein; outer membrane protein. SOURCE E.coli and plasmid pBR322 DNA, clone pOMPA21. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 70) AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E., Kurz,C. and Schaller,H. TITLE Export of a protein into the outer membrane of Escherichia coli K12: Stable incorporation of the OmpA protein requires less than 193 amino-terminal amino-acid residues JOURNAL Eur. J. Biochem. 122, 223-231 (1982) STANDARD full staff_review COMMENT [1] explored to what extent ompA gene can be shortened without impairing outer membrane incorporation of resulting protein fragments, and found that more than 133 NH2-terminal residues of the mature OmpA protein suffice but are unstable, while 193 such residues are stable. FEATURES from to/span description pept < 1 70 OmpA' protein (modified OmpA; aa at 2) recomb 13 14 E.coli end/pBR322 start BASE COUNT 15 a 17 c 24 g 14 t ORIGIN 13 bp upstream of BamHI site 1 gctgagcaac ctggatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg 61 ctccaagtag // LOCUS ECOOMPASB 66 bp ds-DNA SYN 29-JUN-1990 DEFINITION E.coli outer membrane protein ompA gene / AD3 insert / plasmid pBR322 synthetic construct, clone pOMPA21-11. ACCESSION J02532 KEYWORDS artificial gene; membrane protein; outer membrane protein. SOURCE E.coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-11. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E., Kurz,C. and Schaller,H. TITLE Export of a protein into the outer membrane of Escherichia coli K12: Stable incorporation of the OmpA protein requires less than 193 amino-terminal amino-acid residues JOURNAL Eur. J. Biochem. 122, 223-231 (1982) STANDARD full staff_review COMMENT See comments for first entry in this series. AD3 is a derivative of a synthetic lac operator. FEATURES from to/span description pept < 1 42 OmpA' (modified OmpA protein) recomb 13 14 E.coli end/AD3 insert start recomb 60 61 AD3 insert end/pBR322 start BASE COUNT 19 a 14 c 17 g 16 t ORIGIN 13 bp upstream of BamHI site 1 gcgatcactc ctggatccgg aattcaaatt gtgagcggat aacaatttga attccggatc 61 gcgaag // LOCUS ECOOMPASC 66 bp ds-DNA SYN 29-JUN-1990 DEFINITION E.coli outer membrane protein ompA gene / AD3 insert / plasmid pBR322 synthetic construct, clone pOMPA21-16. ACCESSION J02533 KEYWORDS artificial gene; membrane protein; outer membrane protein. SOURCE E. coli, AD3 and plasmid pBR322 DNA, clone pOMPA21-16. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Bremer,E., Cole,S.T., Hindennach,I., Henning,U., Beck,E., Kurz,C. and Schaller,H. TITLE Export of a protein into the outer membrane of Escherichia coli K12: Stable incorporation of the OmpA protein requires less than 193 amino-terminal amino-acid residues JOURNAL Eur. J. Biochem. 122, 223-231 (1982) STANDARD full staff_review COMMENT See comments for first entry in this series. AD3 is a derivative of a synthetic lac operator. FEATURES from to/span description pept < 1 34 OmpA' (modified OmpA; aa at 2) recomb 13 14 E.coli end/AD3 insert start recomb 60 61 AD3 insert end/pBR322 start BASE COUNT 22 a 14 c 15 g 15 t ORIGIN 20 bp upstream of EcoRI site 1 acagaccaag cacgatccgg aattcaaatt gtgagcggat aacaatttga attccggatc 61 tgatcg // LOCUS HUMIGLAM2 872 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human Ig rearranged lambda-chain mRNA VJC-region subgroup lambda-IV from heterohybridoma H6-3C4. ACCESSION M18645 KEYWORDS immunoglobulin; immunoglobulin light chain; lambda-immunoglobulin; variable region; variable region subgroup lambda-IV. SOURCE Human hybridoma H6-3C4 (cell fusion of mouse myeloma NS1 cells and human peripheral blood lymphocytes), cDNA to mRNA, clone p3C4-lambda-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 872) AUTHORS Yamasaki,N., Komori,S. and Watanabe,T. TITLE Complementary DNA for a human subgroup IV immunoglobulin lambda-chain JOURNAL Mol. Immunol. 24, 981-985 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 31 732 Ig VJC-region recomb 90 91 V-region start recomb 375 376 V-region end/J-region start recomb 412 413 J-region end/C-region start recomb 729 730 C-region end BASE COUNT 211 a 269 c 220 g 172 t ORIGIN 254 bp upstream of BamHI site. 1 tcagaagcag agttctgggg tgtctccacc atggcctgga cccctctctg gctcactctc 61 ttcactcttt gcataggttc tgtggtttct tctgaactga ctcaggaccc tgttgtgtct 121 gtggccttgg gacagacagt caggatcaca tgccaaggag acagcctcag aacctatcat 181 gcaagctggt accagcagaa gccaagacag gcccctgtac ttgtcatcta tgatgaaaac 241 aaccggccct cagggatccc agaccgattc tctggctcca cctcaggaaa cacagcttcc 301 ttgaccatca ctggggctca ggcggaagat gaggctgact attactgtaa ctcccgggac 361 agcagtggta accgtctggt attcggcgga gggaccaagc tgaccgtcct aggtcagccc 421 aaggctgccc cctcggtcac tctgttcccg ccctcctctg aggagcttca agccaacaag 481 gccacactgg tgtgtctcat aagtgacttc tacccgggag ccgtgacagt ggcctggaag 541 ggagatagca gccccgtcaa ggcgggagtg gagaccacca caccctccaa acaaagcaac 601 aacaagtacg cggccagcag ctatctgagc ctgacgcctg agcagtggaa gtcccacaga 661 agctacagct gccaggtcac gcatgaaggg agcaccgtgg agaagacagt ggcccctaca 721 gaatgttcat aggttctcaa ccctcacccc caccacggga gactagagct gcaggatccc 781 aggggagggg tctctcctcc caccccaagg catcaagccc ttctccctgc actcaataaa 841 ccctcaataa atattctcat tgtcaaggaa tt // LOCUS HUMTCAXU 401 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human T-cell receptor active alpha-chain mRNA, clone HAP36 (V-alpha-9.1 family; J-alpha-H family). ACCESSION M13737 KEYWORDS T-cell receptor; T-cell receptor alpha-chain. SOURCE Human adult PHA-stimulated peripheral blood T lymphocyte, cDNA to mRNA, clone HAP36. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 401) AUTHORS Yoshikai,Y., Kimura,N., Toyonaga,B. and Mak,T.W. TITLE Sequences and repertoire of human T-cell receptor alpha chain variable region genes in mature T lymphocytes JOURNAL J. Exp. Med. 164, 90-103 (1986) STANDARD simple staff_review FEATURES from to/span description ORF < 1 > 401 T-cell receptor active alpha-chain precursor recomb 321 322 V region end/D-J region start recomb 387 388 D-J region end/C region start BASE COUNT 115 a 102 c 87 g 97 t ORIGIN 1 atgaagccca ccctcatctc agtgcttgtg ataatattta tactcagagg aacaagagcc 61 cagagagtga ctcagcccga gaagctcctc tctgtcttta aaggggcccc agtggagctg 121 aagtgcaact attcctattc tgggagtcct gaactcttct ggtatgtcca gtactccaga 181 caacgcctcc agttactctt gagacacatc tctagagaga gcatcaaagg cttcactgct 241 gaccttaaca aaggcgagac atctttccac ctgaagaaac catttgctca agaggaagac 301 tcagccatgt attactgtgc tctaagtgtt tataaccagg gaggaaagct tatcttcgga 361 cagggaacgg agttatctgt gaaacccaat atccagaacc c // LOCUS HUMTCAYR 406 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human T-cell receptor active alpha-chain V-region (V-J-C) mRNA, partial cds, clone AA25. ACCESSION M17657 J02992 KEYWORDS T-cell receptor alpha-chain; antigen receptor; constant region; joining region; processed gene; variable region. SOURCE Human peripheral blood lymphocyte, cDNA to mRNA, clone AA25. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 406) AUTHORS Klein,M.H., Concannon,P., Everett,M., Kim,L.D.H., Hunkapiller,T. and Hood,L. TITLE Diversity and structure of human T-cell receptor alpha-chain variable region genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6884-6888 (1987) STANDARD simple staff_review FEATURES from to/span description pept 1 > 406 T-cell receptor alpha-chain V-region (V-J-C) precursor /nomgen="TCRA" /map="14q11.2" /hgml_locus_uid="LX0123X" sigp 1 69 T-cell receptor alpha-chain signal peptide matp 70 > 403 T-cell receptor alpha-chain recomb 340 341 TCR V-region DNA end/TCR J-region DNA start recomb 400 401 TCR J-region DNA end/TCR C-region DNA start BASE COUNT 104 a 95 c 98 g 109 t ORIGIN Chromosome 14q11.2. 1 atgatgatat ccttgagagt tttactggtg atcctgtggc ttcagttaag ctgggtttgg 61 agccaacgga aggaggtgga gcaggatcct ggacccttca atgttccaga gggagccact 121 gtcgctttca actgtactta cagcaacagt gcttctcagt ctttcttctg gtacagacag 181 gattgcagga aagaacctaa gttgctgatg tccgtatact ccagtggtaa tgaagatgga 241 aggtttacag cacacgtcaa tagagccagc cagtatattt ccctgctcat cagagactcc 301 aagctcagtg attcagccac ctacctctgt gtggtgaaca ttcgcccagg aaacacacct 361 cttgtctttg gaaagggcac aagactttct gtgattccaa atatcc // LOCUS LAMCG 48502 bp ds-DNA circular PHG 29-JUN-1990 DEFINITION Bacteriophage lambda, complete genome. ACCESSION J02459 M17233 KEYWORDS DNA binding protein; circular; coat protein; complete genome; origin of replication; repressor; unidentified reading frame. SOURCE Lambda wild-type and lambda strain cI857s7. ORGANISM Bacteriophage lambda Viridae; ds-DNA nonenveloped viruses; Siphoviridae. REFERENCE 1 (bases 37224 to 37940) AUTHORS Sauer,R.T. TITLE DNA sequence of the bacteriophage lambda cI gene JOURNAL Nature 276, 301-302 (1978) STANDARD full staff_review REFERENCE 2 (bases 37990 to 38982) AUTHORS Schwarz,E., Scherer,G., Hobom,G. and Kossel,H. TITLE Nucleotide sequence of cro, cII and part of the O gene in phage lambda DNA JOURNAL Nature 272, 410-414 (1978) STANDARD full staff_review REFERENCE 3 (bases 34957 to 35615; r-strand) AUTHORS Franklin,N.C. and Bennett,G.N. TITLE The N protein of bacteriophage lambda, defined by its DNA sequence, is highly basic JOURNAL Gene 8, 107-119 (1979) STANDARD full staff_review REFERENCE 4 (bases 37768 to 40293) AUTHORS Schwarz,E., Scherer,G., Hobom,G. and Kossel,H. TITLE The primary structure of the phage lambda P gene completes the nucleotide sequence of the plasmid lambda-dvh93 JOURNAL Biochem. Int. 1, 386-394 (1980) STANDARD full staff_review REFERENCE 5 (bases 27724 to 29275) AUTHORS Hoess,R.H., Foeller,C., Bidwell,K. and Landy,A. TITLE Site-specific recombination functions of bacteriophage lambda: DNA sequence of regulatory regions and overlapping structural genes for int and xis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 2482-2486 (1980) STANDARD full staff_review REFERENCE 6 (bases 43682 to 45218) AUTHORS Daniels,D.L. and Blattner,F.R. TITLE Nucleotide sequence of the Q gene and the Q to S intergenic region of bacteriophage lambda JOURNAL Virology 117, 81-92 (1982) STANDARD full staff_review REFERENCE 7 (bases 1 to 48502; fragments) AUTHORS Sanger,F., Coulson,A.R., Hong,G.F., Hill,D.F. and Petersen,G.B. TITLE Nucleotide sequence of bacteriophage lambda DNA JOURNAL J. Mol. Biol. 162, 729-773 (1982) STANDARD full staff_review REFERENCE 8 (review; complete genome) AUTHORS Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F. and Blattner,F.R. TITLE Appendix I: A molecular map of coliphage lambda JOURNAL (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and Weisberg,R.A. (Eds.); Lambda II: 469-517; Cold Spring Harbor Laboratory, New York (1983) STANDARD full staff_review REFERENCE 9 (review; complete genome with annotation) AUTHORS Daniels,D.L., Schroeder,J.L., Szybalski,W., Sanger,F., Coulson,A.R., Hong,G.F., Hill,D.F., Petersen,G.B. and Blattner,F.R. TITLE Appendix II: Complete annotated lambda sequence JOURNAL (in) Hendrix,R.W., Roberts,J.W., Stahl,F.W. and Weisberg,R.A. (Eds.); Lambda II: 519-674; Cold Spring Harbor Laboratory, New York (1983) STANDARD full staff_review REFERENCE 10 (bases 45493 to 45963) AUTHORS Imada,M. and Tsugita,A. TITLE Amino acid sequence of lambda phage endolysin JOURNAL Nature New Biol. 233, 230-231 (1971) STANDARD full staff_review REFERENCE 11 (bases 44588 to 44780) AUTHORS Lebowitz,P., Weissman,S.M. and Radding,C.M. TITLE Nucleotide sequence of a ribonucleic acid transcribed in vitro from lambda phage deoxyribonucleic acid JOURNAL J. Biol. Chem. 246, 5120-5139 (1971) STANDARD full staff_review REFERENCE 12 (bases 1 to 12; both strands) AUTHORS Wu,R. and Taylor,E. TITLE Nucleotide sequence analysis of DNA. II. Complete nucleotide sequence of the cohesive ends of bacteriophage lambda DNA JOURNAL J. Mol. Biol. 57, 491-511 (1971) STANDARD full staff_review REFERENCE 13 (bases 38597 to 38672) AUTHORS Dahlberg,J.E. and Blattner,F.R. TITLE In vitro transcription products of lambda DNA: Nucleotide sequences and regulatory sites JOURNAL (in) Fox,C.F. and Robinson,W.S. (Eds.); Virus Research. Proceedings of 1973 ICN-UCLA Symposium: 533-544; Academic Press, New York (1973) STANDARD full staff_review REFERENCE 14 (sites; fragments at the 3'-terminus) AUTHORS Weigel,P.H., Englund,P.T., Murray,K. and Old,R.W. TITLE The 3'-terminal nucleotide sequences of bacteriophage lambda DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 70, 1151-1155 (1973) STANDARD full staff_review REFERENCE 15 (bases 35434 to 35618) AUTHORS Dahlberg,J.E. and Blattner,F.R. TITLE Sequence of the promoter-operator proximal region of the major leftward of bacteriophage lambda JOURNAL Nucleic Acids Res. 2, 1441-1458 (1975) STANDARD full staff_review REFERENCE 16 (bases 35583 to 35600) AUTHORS Kleid,D.G., Agarwal,K.L. and Khorana,H.G. TITLE The nucleotide sequence in the promoter region of the gene N in bacteriophage lambda JOURNAL J. Biol. Chem. 250, 5574-5582 (1975) STANDARD full staff_review REFERENCE 17 (bases 37945 to 38018) AUTHORS Maniatis,T., Jeffrey,A. and Kleid,D.G. TITLE Nucleotide sequence of the rightward operator of phage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 1184-1188 (1975) STANDARD full staff_review REFERENCE 18 (bases 44588 to 44773) AUTHORS Sklar,J., Yot,P. and Weissman,S.M. TITLE Determination of genes, restriction sites, and DNA sequences surrounding the 6s template of bacteriophage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 1817-1821 (1975) STANDARD full staff_review REFERENCE 19 (bases 37945 to 38027) AUTHORS Maniatis,T., Ptashne,M., Backman,K., Kleid,D., Flashman,S., Jeffrey,A. and Maurer,R. TITLE Recognition sequences of repressor and polymerase in the operators of bacteriophage lambda JOURNAL Cell 5, 109-113 (1975) STANDARD full staff_review REFERENCE 20 (bases 37946 to 38039) AUTHORS Smith,G.R., Eisen,H., Reichardt,L. and Hedgpeth,J. TITLE Deletions of lambda phage locating a p-rm mutation within the rightward operator JOURNAL Proc. Natl. Acad. Sci. U.S.A. 73, 712-716 (1976) STANDARD full staff_review REFERENCE 21 (bases 35578 to 35667; 37903 to 38027) AUTHORS Ptashne,M., Bachman,K., Humayun,M.Z., Jeffrey,A., Maurer,R., Meyer,B. and Sauer,R.T. TITLE Autoregulation and function of a repressor in bacteriophage lambda JOURNAL Science 194, 156-161 (1976) STANDARD full staff_review REFERENCE 22 (bases 37905 to 37989) AUTHORS Walz,A., Pirrotta,V. and Ineichen,K. TITLE Lambda repressor regulates the switch between p-r and p-rm promoters JOURNAL Nature 262, 665-669 (1976) STANDARD full staff_review REFERENCE 23 (bases 27617 to 27934) AUTHORS Landy,A. and Ross,W. TITLE Viral integration and excision: structure of the lambda att sites JOURNAL Science 197, 1147-1160 (1977) STANDARD full staff_review REFERENCE 24 (bases 39062 to 39170) AUTHORS Denniston-Thompson,K., Moore,D.D., Kruger,K.E., Furth,M.E. and Blattner,F.R. TITLE Physical structure of the replication origin of bacteriophage lambda JOURNAL Science 198, 1051-1056 (1977) STANDARD full staff_review REFERENCE 25 (bases 38610 to 38732) AUTHORS Scherer,G., Hobom,G. and Koessel,H. TITLE DNA base sequence of the p-o promoter region of phage lambda JOURNAL Nature 265, 117-121 (1977) STANDARD full staff_review REFERENCE 26 (bases 38041 to 38241) AUTHORS Roberts,T.M., Shimatake,H., Brady,C. and Rosenberg,M. TITLE Sequence of cro gene of bacteriophage lambda JOURNAL Nature 270, 274-275 (1977) STANDARD full staff_review REFERENCE 27 (bases 27616 to 28935) AUTHORS Davies,R.W., Schreier,P.H. and Buechel,D.E. TITLE Nucleotide sequence of the attachment site of coliphage lambda JOURNAL Nature 270, 757-760 (1977) STANDARD full staff_review REFERENCE 28 (bases 35578 to 35667) AUTHORS Humayun,Z., Jeffrey,A. and Ptashne,M. TITLE Completed DNA sequences and organization of repressor-binding sites in the operators of phage lambda JOURNAL J. Mol. Biol. 112, 265-277 (1977) STANDARD full staff_review REFERENCE 29 (bases 37206 to 37263; 37914 to 37970) AUTHORS Humayun,Z. TITLE DNA sequence at the end of the cI gene in bacteriophage lambda JOURNAL Nucleic Acids Res. 4, 2137-2143 (1977) STANDARD full staff_review REFERENCE 30 (bases 44467 to 44807) AUTHORS Sklar,J.L. TITLE Structure and function of two regions of DNA controlling the synthesis of prokaryotic RNAs JOURNAL Thesis (1977) Yale University, New Haven, CT, USA STANDARD full staff_review REFERENCE 31 (bases 38470 to 39189) AUTHORS Hobom,G., Grosschedl,R., Lusky,M., Scherer,G., Schwarz,E. and Koessel,H. TITLE Functional analysis of the replicator structure of lambdoid bacteriophage DNAs JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 165-178 (1979) STANDARD full staff_review REFERENCE 32 (bases 37938 to 38016; 35589 to 35666; comp strand) AUTHORS Flashman,S.M. TITLE Mutational analysis of the operators of bacteriophage lambda JOURNAL Mol. Gen. Genet. 166, 61-73 (1978) STANDARD full staff_review REFERENCE 33 (bases 38212 to 38362) AUTHORS Rosenberg,M., Court,D., Shimatake,H., Brady,C. and Wulff,D.L. TITLE The relationship between function and DNA sequence in an intercistronic regulatory region in phage lambda JOURNAL Nature 272, 414-423 (1978) STANDARD full staff_review REFERENCE 34 (bases 29711 to 29811; 31043 to 31058) AUTHORS Davies,R.W., Schreier,P.H. and Buechel,D.E. TITLE Determination of the endpoints of partial deletion mutants of the attachment site of bacteriophage lambda by DNA sequencing JOURNAL Nucleic Acids Res. 5, 3209-3218 (1978) STANDARD full staff_review REFERENCE 35 (bases 21661 to 31129; fragments) AUTHORS Hoess,R.H. and Landy,A. TITLE Structure of the lambda att sites generated by int-dependent deletions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 5437-5441 (1978) STANDARD full staff_review REFERENCE 36 (bases 38453 to 38500) AUTHORS Sprague,K.U., Faulds,D.H. and Smith,G.R. TITLE A single base-pair change creates a chi recombinational hotspot in bacteriophage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 6182-6186 (1978) STANDARD full staff_review REFERENCE 37 (bases 13 to 72; 48391 to 48502) AUTHORS Nichols,B.P. and Donelson,J.E. TITLE 178-Nucleotide sequence surrounding the cos site of bacteriophage lambda DNA JOURNAL J. Virol. 26, 429-434 (1978) STANDARD full staff_review REFERENCE 38 (bases 38597 to 39688) AUTHORS Scherer,G. TITLE Nucleotide sequence of the O gene and of the origin of replication in bacteriophage lambda DNA JOURNAL Nucleic Acids Res. 5, 3141-3156 (1978) STANDARD full staff_review REFERENCE 39 (bases 38453 to 38500) AUTHORS Smith,G.R., Faulds,D.H. and Sprague,K.U. TITLE Nucleotide-sequence analysis of a chi site JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 1067-1068 (1979) STANDARD full staff_review REFERENCE 40 (sites; transcription termination sites) AUTHORS Adhya,S. and Gottesman,M. TITLE Control of transcription termination JOURNAL Annu. Rev. Biochem. 47, 967-996 (1978) STANDARD full staff_review REFERENCE 41 (bases 38008 to 39328) AUTHORS Moore,D.D., Denniston-Thompson,K., Kruger,K.E., Furth,M.E., Williams,B.G., Daniels,D.L. and Blattner,F.R. TITLE Dissection and comparative anatomy of the origins of replication of lambdoid phages JOURNAL Cold Spring Harb. Symp. Quant. Biol. 43, 155-163 (1979) STANDARD full staff_review REFERENCE 42 (bases 27711 to 27826) AUTHORS Ross,W., Landy,A., Kikuchi,Y. and Nash,H. TITLE Interaction of int protein with specific sites on lambda att DNA JOURNAL Cell 18, 297-307 (1979) STANDARD full staff_review REFERENCE 43 (bases 37305 to 37352) AUTHORS Lieb,M. TITLE Is5 increases recombination in adjacent regions as shown for the repressor gene of coliphage lambda JOURNAL Gene 12, 277-280 (1980) STANDARD full staff_review REFERENCE 44 (bases 29055 to 29131) AUTHORS Miller,H.I., Abraham,J., Benedik,M., Campbell,A., Court,D., Echols,H., Fischer,R., Galindo,J.M., Guarneros,G., Hernandez,T., Mascarenhas,D., Montanez,C., Schindler,D., Schmeissner,U. and Sosa,L. TITLE Regulation of the integration-excision reaction by bacteriophage lambda JOURNAL Cold Spring Harb. Symp. Quant. Biol. 45, 439-445 (1981) STANDARD full staff_review REFERENCE 45 (bases 36245 to 36343) AUTHORS Pirrotta,V., Ineichen,K. and Walz,A. TITLE An unusual polymerase binding site in the immunity region of phage lambda JOURNAL Mol. Gen. Genet. 180, 369-376 (1980) STANDARD full staff_review REFERENCE 46 (bases 37940 to 38016) AUTHORS Rosen,E.D., Hartley,J.L., Matz,K., Nichols,B.P., Young,K.M., Donelson,J.E. and Gussin,G.N. TITLE DNA sequence analysis of prm- mutations of coliphage lambda JOURNAL Gene 11, 197-205 (1980) STANDARD full staff_review REFERENCE 47 (bases 38212 to 38467) AUTHORS Wulff,D.L., Beher,M., Izumi,S., Beck,J., Mahoney,M., Shimatake,H., Brady,C., Court,D. and Rosenberg,M. TITLE Structure and function of the cy control region of bacteriophage lambda JOURNAL J. Mol. Biol. 138, 209-230 (1980) STANDARD full staff_review REFERENCE 48 (bases 38237 to 38334) AUTHORS Court,D., Brady,C., Rosenberg,M., Wulff,D.L., Behr,M., Mahoney,M. and Izumi,S. TITLE Control of transcription termination: A rho-dependent termination site in bacteriophage lambda JOURNAL J. Mol. Biol. 138, 231-254 (1980) STANDARD full staff_review REFERENCE 49 (bases 38102 to 38166) AUTHORS Calva,E. and Burgess,R.R. TITLE Characterization of a rho-dependent termination site within the cro gene of bacteriophage lambda JOURNAL J. Biol. Chem. 255, 11017-11022 (1980) STANDARD full staff_review REFERENCE 50 (bases 28929 to 29198) AUTHORS Abraham,J., Mascarenhas,D., Fischer,R., Benedik,M., Campbell,A. and Echols,H. TITLE DNA sequence of regulatory region for integration gene of bacteriophage lambda JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 2477-2481 (1980) STANDARD full staff_review REFERENCE 51 (bases 27724 to 29525) AUTHORS Davies,R.W. TITLE DNA sequence of the int-xis p-i region of the bacteriophage lambda; overlap of the int and xis genes JOURNAL Nucleic Acids Res. 8, 1765-1782 (1980) STANDARD full staff_review REFERENCE 52 (bases 27501 to 27615) AUTHORS Mizuuchi,M. and Mizuuchi,K. TITLE Integrative recombination of bacteriophage lambda: extent of the DNA sequence involved in attachment site function JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3220-3224 (1980) STANDARD full staff_review REFERENCE 53 (bases 27479 to 27633) AUTHORS Hsu,P.-L., Ross,W. and Landy,A. TITLE The lambda phage att site: functional limits and interaction with int protein JOURNAL Nature 285, 85-91 (1980) STANDARD full staff_review REFERENCE 54 (bases 23131 to 23248) AUTHORS Rosenvold,E.C., Calva,E., Burgess,R.R. and Szybalski,W. TITLE In vitro transcription from the b2 region of bacteriophage lambda JOURNAL Virology 107, 476-487 (1980) STANDARD full staff_review REFERENCE 55 (bases 30493 to 30569) AUTHORS Smith,G.R., Schultz,D.W. and Crasemann,J.M. TITLE Generalized recombination: Nucleotide sequence homology between chi recombinational hotspots JOURNAL Cell 19, 785-793 (1980) STANDARD full staff_review REFERENCE 56 (bases 37940 to 38023) AUTHORS Meyer,B.J., Maurer,R. and Ptashne,M. TITLE Gene regulation at the right operator (o-r) of bacteriophage lambda. II. o-r-1, o-r-2, and o-r-3: their roles in mediating the effects of repressor and cro JOURNAL J. Mol. Biol. 139, 163-194 (1980) STANDARD full staff_review REFERENCE 57 (bases 29055 to 29124) AUTHORS Abraham,J. and Echols,H. TITLE Regulation of int gene transcription by bacteriophage lambda: location of the start generated by an int constitutive mutation JOURNAL J. Mol. Biol. 146, 157-165 (1981) STANDARD full staff_review REFERENCE 58 (bases 43681 to 45634) AUTHORS Daniels,D.L. TITLE Control of late transcription in bacteriophage lambda JOURNAL Thesis (1981) University of Wisconsin, Madison, WI, USA STANDARD full staff_review REFERENCE 59 (bases 35468 to 35819) AUTHORS Horn,G.T. and Wells,R.D. TITLE The leftward promoter of bacteriophage lambda JOURNAL J. Biol. Chem. 256, 1998-2002 (1981) STANDARD full staff_review REFERENCE 60 (bases 32503 to 35905) AUTHORS Ineichen,K., Shepherd,J.C.W. and Bickle,T.A. TITLE The DNA sequence of the phage lambda genome between p-l and the gene bet JOURNAL Nucleic Acids Res. 9, 4639-4653 (1981) STANDARD full staff_review REFERENCE 61 (bases 43860 to 45001) AUTHORS Petrov,N.A., Karginov,V.A., Mikryukov,N.N., Serpinski,O.I. and Kravchenko,V.V. TITLE Complete nucleotide sequence of the bacteriophage lambda DNA region containing gene Q and promoter p-r JOURNAL FEBS Lett. 133, 316-320 (1981) STANDARD full staff_review REFERENCE 62 (bases 44972 to 45057) AUTHORS Smith,G.R., Comb,M., Schultz,D.W., Daniels,D.L. and Blattner,F.R. TITLE Nucleotide sequence of the chi recombinational hotspot chi+d in bacteriophage lambda JOURNAL J. Virol. 37, 336-342 (1981) STANDARD full staff_review REFERENCE 63 (bases 38686 to 39224) AUTHORS Moore,D.D., Denniston,K.J. and Blattner,F.R. TITLE Sequence organization of the origins of DNA replication in lambdoid coliphages JOURNAL Gene 14, 91-101 (1981) STANDARD full staff_review REFERENCE 64 (bases 35468 to 35711) AUTHORS Remaut,E., Stanssens,P. and Fiers,W. TITLE Plasmid vectors for high-efficiency expression controlled by the pl promoter of coliphage lambda JOURNAL Gene 15, 81-93 (1981) STANDARD full staff_review REFERENCE 65 (bases 35468 to 35541) AUTHORS Drahos,D. and Szybalski,W. TITLE Antitermination and termination functions of the cloned Nutl, N and tl1 modules of coliphage lambda JOURNAL Gene 16, 261-274 (1981) STANDARD full staff_review REFERENCE 66 (bases 27650 to 27741) AUTHORS Kravchenko,V.V. and Mikryukov,N.N. TITLE Localization of the promoter p-att of the binding site of Escherichia coli polymerase on phage lambda DNA near the integration site JOURNAL Dokl. Biochem. 264, 148-151 (1982) STANDARD full staff_review REFERENCE 67 (bases 39219 to 39338) AUTHORS Moore,D.D. and Blattner,F.R. TITLE Appendix: Sequence of lambda ri c 5b JOURNAL J. Mol. Biol. 154, 81-83 (1982) STANDARD full staff_review REFERENCE 68 (bases 37938 to 38018) AUTHORS Hawley,D.K. and McClure,W.R. TITLE Mechanism of activation of transcription initiation from the lambda p-rm promoter JOURNAL J. Mol. Biol. 157, 493-525 (1982) STANDARD full staff_review REFERENCE 69 (bases 25157 to 27484) AUTHORS Hong,G.F. TITLE A systematic DNA sequencing strategy JOURNAL J. Mol. Biol. 158, 539-549 (1982) STANDARD full staff_review REFERENCE 70 (bases 35577 to 35647) AUTHORS Hyman,H.C. and Honigman,A. TITLE The use of the plasmid pha10 in the isolation of lambda pl promoter mutations JOURNAL Mol. Gen. Genet. 185, 515-517 (1982) STANDARD full staff_review REFERENCE 71 (bases 38262 to 38386) AUTHORS Lau,L.F., Roberts,J.W. and Wu,R. TITLE Transcription terminates at lambda tr1 in three clusters JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 6171-6175 (1982) STANDARD full staff_review REFERENCE 72 (bases 31299 to 31408) AUTHORS Luk,K.-C. and Szybalski,W. TITLE Transcription termination: Sequence and function of the rho-independent t-l3 terminator in the major leftward operon of bacteriophage lambda JOURNAL Gene 17, 247-258 (1982) STANDARD full staff_review REFERENCE 73 (bases 35437 to 37348) AUTHORS Landsmann,J., Kroeger,M. and Hobom,G. TITLE The rex region of bacteriophage lambda: Two genes under three-way control JOURNAL Gene 20, 11-24 (1982) STANDARD full staff_review REFERENCE 74 (bases 40218 to 43972) AUTHORS Kroeger,M. and Hobom,G. TITLE A chain of interlinked genes in the NinR region of bacteriophage lambda JOURNAL Gene 20, 25-38 (1982) STANDARD full staff_review REFERENCE 75 (bases 31299 to 31408) AUTHORS Luk,K.-C. and Szybalski,W. TITLE Characterization of the cloned terminators t-r1, t-l3 and t-i, and the Nutr antitermination site of coliphage lambda JOURNAL Gene 20, 127-134 (1982) STANDARD full staff_review REFERENCE 76 (bases 48424 to 48500) AUTHORS Miwa,T. and Matsubara,K. TITLE Identification of sequences necessary for packaging DNA into lambda phage heads JOURNAL Gene 20, 267-279 (1982) STANDARD full staff_review REFERENCE 77 (bases 18414 to 18746) AUTHORS Luk,K.-C. and Szybalski,W. TITLE A cluster of leftward, rho-dependent t'j terminators in the J gene of coliphage lambda JOURNAL Gene 21, 175-191 (1983) STANDARD full staff_review REFERENCE 78 (bases 48469 to 48498) AUTHORS Miwa,T. and Matsubara,K. TITLE Lambda phage DNA sequences affecting the packaging process JOURNAL Gene 24, 199-206 (1983) STANDARD full staff_review REFERENCE 79 (bases 33287 to 33486) AUTHORS Knight,D.M. and Echols,H. TITLE The cIII gene and protein of bacteriophage lambda JOURNAL J. Mol. Biol. 163, 505-510 (1983) STANDARD full staff_review REFERENCE 80 (bases 33000 to 33244; 33420 to 33543; 33629 to 34080) AUTHORS Luk,K.-C. and Szybalski,W. TITLE The tl2 cluster of transcription termination sites between genes bet and ral of coliphage lambda JOURNAL Virology 125, 403-418 (1983) STANDARD full staff_review REFERENCE 81 (bases 37938 to 38019) AUTHORS Shih,M.-C. and Gussin,G.N. TITLE Mutations affecting two different steps in transcription initiation at the phage lambda p-rm promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 496-500 (1983) STANDARD full staff_review REFERENCE 82 (bases 1 to 56; 48474 to 48502) AUTHORS Feiss,M., Kobayashi,I. and Widner,W. TITLE Separate sites for binding and nicking of bacteriophage lambda DNA by terminase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 955-959 (1983) STANDARD full staff_review REFERENCE 83 (bases 29063 to 29140) AUTHORS Benedik,M., Mascarenhas,D. and Campbell,A. TITLE The integrase promoter and t1' terminator in bacteriophages lambda and 434 JOURNAL Virology 126, 658-668 (1983) STANDARD full staff_review REFERENCE 84 (bases 45901 to 46443) AUTHORS Taylor,A., Benedik,M. and Campbell,A. TITLE Location of the R-z gene in bacteriophage lambda JOURNAL Gene 26, 159-163 (1983) STANDARD full staff_review REFERENCE 85 (sites; cohesive ends) AUTHORS Hohn,B. TITLE DNA sequences necessary for packaging of bacteriophage lambda DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 7456-7460 (1983) STANDARD full staff_review REFERENCE 86 (sites; Pre-promoter mutations) AUTHORS Wulff,D.L., Mahoney,M., Shatzman,A. and Rosenberg,M. TITLE Mutational analysis of a regulatory region in bacteriophage lambda that has overlapping signals for the initiation of transcription and translation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 555-559 (1984) STANDARD full staff_review REFERENCE 87 (sites; attP recombination site) AUTHORS Craig,N.L. and Nash,H.A. TITLE E. coli integration host factor binds to specific sites in DNA JOURNAL Cell 39, 707-716 (1984) STANDARD full staff_review REFERENCE 88 (sites; major leftward transcription unit) AUTHORS Edlind,T.D., Cooley,T.E., Richards,S.H. and Ihler,G.M. TITLE Long range base-pairing in the leftward transcription unit of bacteriophage lambda: Characterization by electron microscopy and computer-aided sequence analysis JOURNAL J. Mol. Biol. 179, 351-365 (1984) STANDARD full staff_review REFERENCE 89 (sites; prohead binding) AUTHORS Frackman,S., Siegele,D.A. and Feiss,M. TITLE A functional domain of bacteriophage lambda terminase for prohead binding JOURNAL J. Mol. Biol. 180, 283-300 (1984) STANDARD full staff_review REFERENCE 90 (sites; cII binding site mutations) AUTHORS Place,N., Fien,K., Mahoney,M.E., Wulff,D.L., Ho,Y.-S., Debouck,C., Rosenberg,M., Shih,M.-C. and Gussin,G.N. TITLE Mutations that alter the DNA binding site for the bacteriophage lambda cII protein and affect the translation efficiency of the cII gene JOURNAL J. Mol. Biol. 180, 865-880 (1984) STANDARD full staff_review REFERENCE 91 (sites; nutR mutations) AUTHORS Warren,F. and Das,A. TITLE Formation of termination-resistant transcription complex at phage lambda nut locus: Effects of altered translation and a ribosomal mutation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 3612-3616 (1984) STANDARD full staff_review REFERENCE 92 (sites; light chain oligonucleotides) AUTHORS Coleclough,C. and Erlitz,F.L. TITLE Use of primer-restriction-end adapters in a novel cDNA cloning strategy JOURNAL Gene 34, 305-314 (1985) STANDARD full staff_review REFERENCE 93 (sites; nutL antiterminator) AUTHORS Peltz,S.W., Brown,A.L., Hasan,N., Podhajska,A.J. and Szybalski,W. TITLE Thermosensitivity of a DNA recognition Site: Activity of a truncated nutL Antiterminator of coliphage lambda JOURNAL Science 228, 91-93 (1985) STANDARD full staff_review REFERENCE 94 (sites; rho utilization sites A and B) AUTHORS Chen,C.-Y.A. and Richardson,J.P. TITLE Sequence elements essential for rho-dependent transcription termination at lambda-tR1 JOURNAL J. Biol. Chem. 262, 11292-11299 (1987) STANDARD simple staff_review REFERENCE 95 (bases 2521 to 3300) AUTHORS Hong,G.F. TITLE Sequencing of large double-stranded DNA using the dideoxy sequencing technique JOURNAL Biosci. Rep. 2, 907-912 (1982) STANDARD simple staff_entry COMMENT Contributed on tape by F.Sanger via D.L.Daniels. Most of references [10] through [80] are either annotated by [8] and [9], which are the immediate sources for the annotation below, or they are cited in Table 3 of [8]. Only references [1] through [9] are represented in the features table herein. This is the best representation to date of the wild-type lambda l-strand, though much of the sequence was determined for the cI857s7 strain and changed to wild-type [8]. All reported variations leading to the strains cI857s7, imm21, imm434, lac5, Nin5 and b2 are included in the annotation. The first twelve bases are the sticky ends. A significant fraction of the known mutations affecting replication and transcription have been annotated below; a large number of point mutations, deletions and substitutions have not. For a complete account of lambda mutations in relation to the sequence, see [8]. Each coding sequence belongs to a reading frame (orf) whose number, given in parentheses, should indicate the number of amino acids coded. The starting points for translation are known with varying degrees of certainty; for example, the start site for the N protein, given here as 35438, may turn out to be downstream (on the complementary strand) at 35360. When direct empirical evidence such as mutation or amino acid sequence is lacking, the start point is said to be putative. For a summary of the evidence bearing upon the coding sequences, see [7],[8]. Intergenic spaces in lambda are typically short and overlapping: the multiple reading frames (mult) range between a span of 1 and a span of 103. In most cases, a start codon precedes a termination codon, exceptions being the m-l boundary (13429) and the 314-194 boundary (21973) which show the E.coli trp operon pattern of "translational coupling" (see <ecotrp>). Transcription in the central region, bases 22686 to 37940, is leftward off the l-strand. In our annotation, this is indicated by the letter "c" and the descriptive term "comp strand". Signals and recognition sites in this region, without judgement made about their polarity, are treated accordingly, hence their span should be read toward the left rather than toward the right. Furthermore some leftward transcription is located outside the central region, and that is also indicated by "c" and "comp strand". In general, the estimates for the extent or span of signals (e.g. operators), binding sites (e.g. Nutr, int-binding sites, etc.) and of the attachment site (att) vary in the literature. This annotation follows [8]. No attempt is made to annotate promoters as signals because of the indefiniteness of their span, however known promoter mutants are given. The cII protein is known to bind in the -35 regions of p-i (29091) and pre(38369). Transcript termination sites must be understood to be conditional on the N and Q proteins and less than 100% efficient. There remain terminators to be found and some of those annotated may have significance only in vitro. FEATURES from to/span description pept 191 736 nu1 (DNA packaging;181) pept 711 2636 A (DNA packaging;641) pept 2633 2839 W (head-tail joining;68) pept 2836 4437 B (capsid component;533) sigp 2839 2901 leader peptide matp 2902 4434 processed B pept 4418 5737 C (capsid component;439) pept 5132 5737 nu3 (capsid assembly;201) pept 5747 6079 D (head-DNA stabilization;110) pept 6135 7160 E (capsid component;341) pept 7202 7600 Fi (DNA packaging;117) pept 7612 7965 Fii (head-tail joining;117) pept 7977 8555 Z (tail component;192) pept 8552 8947 U (tail component;131) pept 8955 9695 V (tail component;256) pept 9711 10133 G (tail component;140) pept 10115 10549 T (tail component;144) pept 10542 13103 H (tail component;853) pept 13100 13429 M (tail component;109) pept 13429 14127 L (tail component;232) pept 14276 14875 K (tail component;199) pept 14773 15444 I (tail component;223) pept 15505 18903 J (tail:host specificity;1132) pept 18965 19585 lom (outer host membrane;206a) pept 19650 20855 orf-401 pept 21029 21973 orf-314 pept 21973 22557 orf-194 pept 23918 22686 (c) ea47 pept 25399 24509 (c) ea31 (296) pept 26973 25396 (c) ea59 (525) pept 28882 27812 (c) int (integration;356) pept 29078 28860 (c) xis (excision;72) pept 29655 29374 (c) ea8.5 (93) pept 30395 29847 (c) ea22 (182) pept 32028 31348 (c) exo (exonuclease;226) pept 32810 32025 (c) bet (recombination;261) pept 33232 32816 (c) gam (recombination;138) pept 33330 33187 (c) kil(host-killing;54) pept 33463 33299 (c) cIII (antitermination;89) pept 33904 33536 (c) ea10 (ssb;122) pept 34287 34087 (c) ral(restriction alleviation;66) pept 35438 35037 (c) N (early gene regulator;133) pept 36259 35825 (c) rexb (exclusion;144) pept 37114 36275 (c) rexa (exclusion;279) pept 37940 37227 (c) cI (repressor;237) pept 38041 38241 cro (antirepressor; also tof;66) pept 38360 38653 cII (antitermination;119) pept 38686 39585 O (DNA replication;299) pept 39582 40283 P (DNA replication;233) pept 40280 40570 ren(exclusion;96) pept 40644 41084 Nin 146 (pept unknown;146) pept 41081 41953 Nin 290 (pept unknown;290) pept 41950 42123 Nin 57 (pept unknown;57) pept 42090 42272 Nin 60 (pept unknown;60) pept 42269 42439 Nin 56 (pept unknown;56) pept 42429 43043 Nin 204 (pept unknown;204) pept 43040 43246 Nin 68 (pept unknown;68) pept 43224 43889 Nin 221 (pept unknown;221) pept 43886 44509 Q (late gene regulator;207) pept 44621 44815 orf-64 pept 45186 45509 S (cell lysis;107) pept 45493 45969 R (cell lysis;158) pept 45966 46427 Rz (cell lysis;153) mRNA 35582 18482 (c) mRNA-pl (alt.; via t'j4 terminator) mRNA 35582 18597 (c) mRNA-pl (alt.; via t'j3 terminator) mRNA 35582 18637 (c) mRNA-pl (alt.; via t'j2 terminator) mRNA 35582 18671 (c) mRNA-pl (alt.; via t'j1 terminator) mRNA 35582 27538 (c) mRNA-pl (alt.; via ti terminator) mRNA 35582 31262 (c) mRNA-pl (alt.; via tl3 terminator) mRNA 35582 33100 (c) mRNA-pl (alt.; via tl2d terminator) mRNA 35582 33141 (c) mRNA-pl (alt.; via tl2c terminator) mRNA 35582 33494 (c) mRNA-pl (alt.; via tl2b terminator) mRNA 35582 33930 (c) mRNA-pl (alt.; via tl2a terminator) mRNA 35582 34560 (c) mRNA-pl (alt.; via tl1 terminator) mRNA 29065 27538 (c) mRNA int (integration; 356; via ti terminator) mRNA 23231 < 23231 (c) mRNA-pbl mRNA 37940 35798 (c) mRNA-prm (via timm terminator) mRNA 38343 35798 (c) mRNA-pre (via timm terminator) mRNA 36256 35798 (c) mRNA-plit (via timm terminator) mRNA 38023 38135 mRNA-pr (alt.; via tr0 terminator) mRNA 38023 38315 mRNA-pr (alt.; via tr1a terminator) mRNA 38023 38337 mRNA-pr (alt.; via tr1b terminator) mRNA 38023 38370 mRNA-pr (alt.; via tr1c terminator) mRNA 38023 40624 mRNA-pr (alt.; via tr2 terminator) mRNA 38675 38599 (c) mRNA-oop transcription mRNA mRNA 44587 44780 mRNA-pr' transcription (late genes) mRNA binding 27583 27602 int binding site 1 binding 27615 27634 int binding site 2 binding 27714 27747 int binding site 3 binding 27814 27854 int binding site 4 binding 35534 35518 (c) Nutl(N-utilization leftward)putative binding site binding 38265 38281 Nutr(N-utilization rightward)putative binding site binding 39034 39051 ori iteron 1(O binding site) binding 39054 39071 ori iteron 2(O binding site) binding 39078 39095 ori iteron 3(O binding site) binding 39101 39118 ori iteron 4(O binding site) mut 19368 23278 lac5 substitution mut 21737 > 21737 b2 substitution terminating at the att site recomb 24389 24390 lambda::lambdoid hybridization site mut 27537 27537 t in sib3 , c in wild-type mut 27547 27547 a in hef13 , g in wild-type mut 27568 27568 a in sib2 , c in wild-type mut 27573 27573 t in sib1 , g in wild-type recomb 27723 27738 attachment core(att)for host chromosome insertion mut 29063 29063 a in xis am6 , g in wild-type recomb 31266 31267 lambda::lambdoid hybridization site mut 34378 38617 imm21 region mut 35528 35528 a in Nutl63,g in Nutl96,t in Nutl18,c in wild-type mut 35530 35530 g in wild-type deleted in Nutl3 mut 35583 38245 imm434 region mut 35596 35596 a in vir2, t in v003, c in wild-type mut 35606 35606 c in vir101 , t in wild-type signal 35607 35591 (c) operator-l1 (first base on comp strand) mut 35621 35621 t in v305 , c in wild-type mut 35622 35622 t in v305 , g in wild-type signal 35631 35615 (c) operator-l2 (first base on comp strand) signal 35651 35635 (c) operator-l3 (first base on comp strand) mut 35940 35940 a in rex209 , g in wild-type mut 35947 35947 a in rex111 , g in wild-type mut 37287 37287 a in cIam14, c in wild-type mut 37308 37308 c in cIam504, g in wild-type mut 37313 37313 a in cIam505, g in wild-type variant 37589 37589 t in strain cI857s7([2]); c in wild type mut 37589 37589 t in ind1 , c in wild-type mut 37629 37629 c in cIam499, g in wild-type mut 37635 37635 c in cIam212, a in wild-type mut 37680 37680 a in cIam34, c in wild-type variant 37742 37742 t in strain ci857s7([2]); c in wild-type mut 37742 37742 t in ci857 , c in wild-type mut 37808 37808 a in cIam282, g in wild-type mut 37872 37872 c in cIam302, a in wild-type signal 37951 37967 operator-r3 mut 37954 37954 t in prm-e37 , c in wild-type mut 37955 37955 g in vc3 , a in wild-type mut 37957 37957 t in or3-r1 , c in wild-type mut 37958 37958 t in or3-r2, a in or3-r3 mutants, g in wild-type mut 37965 37965 g in or3-c12 , a in wild-type mut 37966 37966 c in or3-c10 , t in wild-type mut 37971 37971 g inp-rmup-1 , a in wild-type mut 37973 37973 t in prm-m104, 116, u31 mutants, c in wild-type signal 37974 37990 operator-r2 mut 37978 37978 t in prm-e104, g in vc3, a in wild-type mut 37979 37979 a in virl, t in prm-e93, c in wild-type mut 37985 37985 t in vn , g in wild-type mut 37989 37989 t deleted in mah4 mutant mut 37990 37990 g deleted in mch9 mutant mut 37991 37991 g in pr-x3 , a in wild-type signal 37998 38014 operator-r1 mut 38003 38003 a in vs326 , c in wild-type mut 38007 38007 t in prm-uv8, a in vir3, c in wild-type mut 38008 38008 a in prm-uv93, m36 mutants, g in wild-type mut 38009 38009 c in vs387, t in vc1, g in wild-type site 38249 38266 rho utilization site A (rutA) site 38282 38301 rho utilization site B (rutB) mut 38302 38302 a in cin-1 , g in wild-type mut 38306 38306 c in cnc1 , t in wild-type mut 38307 38307 g in cnc8 , a in wild-type mut 38350 38350 g in cy3048, a in wild-type mut 38354 38354 c in cy2001, t in wild-type mut 38357 38357 t in cy3019, c in wild-type mut 38364 38364 g in can1 , t in wild-type mut 38370 38370 t in cy3003 , c in wild-type mut 38371 38371 t in cy42 , a in wild-type mut 38376 38376 g in cy844 , a in wild-type mut 38379 38379 a in cy3008 , g in wild-type mut 38380 38380 t in cy3001 , c in wild-type mut 38430 38430 c in cII2002 , t in wild-type signal 38543 38557 ice(inceptor signal for DNA replication) mut 39122 39122 a in ti-12 , c in wild-type recomb 39157 39158 lambda::lambdoid hybridization site recomb 39165 39166 lambda::lambdoid hybridization site mut 39268 39268 t in ric5b , c in wild-type mut 39292 39292 a in ric5b , g in wild-type mut 40501 43307 Nin5 substitution variant 43082 43082 a in strain cI857s7 ([2]); g in wild-type unsure 43082 43082 g or a, cited in [8] recomb 43884 43885 lambda::lambdoid hybridization site variant 45352 45352 a in strain cI857s7 ([2]); g in wild-type mut 45352 45352 a in sam7 , g in wild-type BASE COUNT 12334 a 11362 c 12820 g 11986 t ORIGIN 5' end of the l-strand of the lambda chromosome (sticky end). 1 gggcggcgac ctcgcgggtt ttcgctattt atgaaaattt tccggtttaa ggcgtttccg 61 ttcttcttcg tcataactta atgtttttat ttaaaatacc ctctgaaaag aaaggaaacg 121 acaggtgctg aaagcgaggc tttttggcct ctgtcgtttc ctttctctgt ttttgtccgt 181 ggaatgaaca atggaagtca acaaaaagca gctggctgac attttcggtg cgagtatccg 241 taccattcag aactggcagg aacagggaat gcccgttctg cgaggcggtg gcaagggtaa 301 tgaggtgctt tatgactctg ccgccgtcat aaaatggtat gccgaaaggg atgctgaaat 361 tgagaacgaa aagctgcgcc gggaggttga agaactgcgg caggccagcg aggcagatct 421 ccagccagga actattgagt acgaacgcca tcgacttacg cgtgcgcagg ccgacgcaca 481 ggaactgaag aatgccagag actccgctga agtggtggaa accgcattct gtactttcgt 541 gctgtcgcgg atcgcaggtg aaattgccag tattctcgac gggctccccc tgtcggtgca 601 gcggcgtttt ccggaactgg aaaaccgaca tgttgatttc ctgaaacggg atatcatcaa 661 agccatgaac aaagcagccg cgctggatga actgataccg gggttgctga gtgaatatat 721 cgaacagtca ggttaacagg ctgcggcatt ttgtccgcgc cgggcttcgc tcactgttca 781 ggccggagcc acagaccgcc gttgaatggg cggatgctaa ttactatctc ccgaaagaat 841 ccgcatacca ggaagggcgc tgggaaacac tgccctttca gcgggccatc atgaatgcga 901 tgggcagcga ctacatccgt gaggtgaatg tggtgaagtc tgcccgtgtc ggttattcca 961 aaatgctgct gggtgtttat gcctacttta tagagcataa gcagcgcaac acccttatct 1021 ggttgccgac ggatggtgat gccgagaact ttatgaaaac ccacgttgag ccgactattc 1081 gtgatattcc gtcgctgctg gcgctggccc cgtggtatgg caaaaagcac cgggataaca 1141 cgctcaccat gaagcgtttc actaatgggc gtggcttctg gtgcctgggc ggtaaagcgg 1201 caaaaaacta ccgtgaaaag tcggtggatg tggcgggtta tgatgaactt gctgcttttg 1261 atgatgatat tgaacaggaa ggctctccga cgttcctggg tgacaagcgt attgaaggct 1321 cggtctggcc aaagtccatc cgtggctcca cgccaaaagt gagaggcacc tgtcagattg 1381 agcgtgcagc cagtgaatcc ccgcatttta tgcgttttca tgttgcctgc ccgcattgcg 1441 gggaggagca gtatcttaaa tttggcgaca aagagacgcc gtttggcctc aaatggacgc 1501 cggatgaccc ctccagcgtg ttttatctct gcgagcataa tgcctgcgtc atccgccagc 1561 aggagctgga ctttactgat gcccgttata tctgcgaaaa gaccgggatc tggacccgtg 1621 atggcattct ctggttttcg tcatccggtg aagagattga gccacctgac agtgtgacct 1681 ttcacatctg gacagcgtac agcccgttca ccacctgggt gcagattgtc aaagactgga 1741 tgaaaacgaa aggggatacg ggaaaacgta aaaccttcgt aaacaccacg ctcggtgaga 1801 cgtgggaggc gaaaattggc gaacgtccgg atgctgaagt gatggcagag cggaaagagc 1861 attattcagc gcccgttcct gaccgtgtgg cttacctgac cgccggtatc gactcccagc 1921 tggaccgcta cgaaatgcgc gtatggggat gggggccggg tgaggaaagc tggctgattg 1981 accggcagat tattatgggc cgccacgacg atgaacagac gctgctgcgt gtggatgagg 2041 ccatcaataa aacctatacc cgccggaatg gtgcagaaat gtcgatatcc cgtatctgct 2101 gggatactgg cgggattgac ccgaccattg tgtatgaacg ctcgaaaaaa catgggctgt 2161 tccgggtgat ccccattaaa ggggcatccg tctacggaaa gccggtggcc agcatgccac 2221 gtaagcgaaa caaaaacggg gtttacctta ccgaaatcgg tacggatacc gcgaaagagc 2281 agatttataa ccgcttcaca ctgacgccgg aaggggatga accgcttccc ggtgccgttc 2341 acttcccgaa taacccggat atttttgatc tgaccgaagc gcagcagctg actgctgaag 2401 agcaggtcga aaaatgggtg gatggcagga aaaaaatact gtgggacagc aaaaagcgac 2461 gcaatgaggc actcgactgc ttcgtttatg cgctggcggc gctgcgcatc agtatttccc 2521 gctggcagct ggatctcagt gcgctgctgg cgagcctgca ggaagaggat ggtgcagcaa 2581 ccaacaagaa aacactggca gattacgccc gtgccttatc cggagaggat gaatgacgcg 2641 acaggaagaa cttgccgctg cccgtgcggc actgcatgac ctgatgacag gtaaacgggt 2701 ggcaacagta cagaaagacg gacgaagggt ggagtttacg gccacttccg tgtctgacct 2761 gaaaaaatat attgcagagc tggaagtgca gaccggcatg acacagcgac gcaggggacc 2821 tgcaggattt tatgtatgaa aacgcccacc attcccaccc ttctggggcc ggacggcatg 2881 acatcgctgc gcgaatatgc cggttatcac ggcggtggca gcggatttgg agggcagttg 2941 cggtcgtgga acccaccgag tgaaagtgtg gatgcagccc tgttgcccaa ctttacccgt 3001 ggcaatgccc gcgcagacga tctggtacgc aataacggct atgccgccaa cgccatccag 3061 ctgcatcagg atcatatcgt cgggtctttt ttccggctca gtcatcgccc aagctggcgc 3121 tatctgggca tcggggagga agaagcccgt gccttttccc gcgaggttga agcggcatgg 3181 aaagagtttg ccgaggatga ctgctgctgc attgacgttg agcgaaaacg cacgtttacc 3241 atgatgattc gggaaggtgt ggccatgcac gcctttaacg gtgaactgtt cgttcaggcc 3301 acctgggata ccagttcgtc gcggcttttc cggacacagt tccggatggt cagcccgaag 3361 cgcatcagca acccgaacaa taccggcgac agccggaact gccgtgccgg tgtgcagatt 3421 aatgacagcg gtgcggcgct gggatattac gtcagcgagg acgggtatcc tggctggatg 3481 ccgcagaaat ggacatggat accccgtgag ttacccggcg ggcgcgcctc gttcattcac 3541 gtttttgaac ccgtggagga cgggcagact cgcggtgcaa atgtgtttta cagcgtgatg 3601 gagcagatga agatgctcga cacgctgcag aacacgcagc tgcagagcgc cattgtgaag 3661 gcgatgtatg ccgccaccat tgagagtgag ctggatacgc agtcagcgat ggattttatt 3721 ctgggcgcga acagtcagga gcagcgggaa aggctgaccg gctggattgg tgaaattgcc 3781 gcgtattacg ccgcagcgcc ggtccggctg ggaggcgcaa aagtaccgca cctgatgccg 3841 ggtgactcac tgaacctgca gacggctcag gatacggata acggctactc cgtgtttgag 3901 cagtcactgc tgcggtatat cgctgccggg ctgggtgtct cgtatgagca gctttcccgg 3961 aattacgccc agatgagcta ctccacggca cgggccagtg cgaacgagtc gtgggcgtac 4021 tttatggggc ggcgaaaatt cgtcgcatcc cgtcaggcga gccagatgtt tctgtgctgg 4081 ctggaagagg ccatcgttcg ccgcgtggtg acgttacctt caaaagcgcg cttcagtttt 4141 caggaagccc gcagtgcctg ggggaactgc gactggatag gctccggtcg tatggccatc 4201 gatggtctga aagaagttca ggaagcggtg atgctgatag aagccggact gagtacctac 4261 gagaaagagt gcgcaaaacg cggtgacgac tatcaggaaa tttttgccca gcaggtccgt 4321 gaaacgatgg agcgccgtgc agccggtctt aaaccgcccg cctgggcggc tgcagcattt 4381 gaatccgggc tgcgacaatc aacagaggag gagaagagtg acagcagagc tgcgtaatct 4441 cccgcatatt gccagcatgg cctttaatga gccgctgatg cttgaacccg cctatgcgcg 4501 ggttttcttt tgtgcgcttg caggccagct tgggatcagc agcctgacgg atgcggtgtc 4561 cggcgacagc ctgactgccc aggaggcact cgcgacgctg gcattatccg gtgatgatga 4621 cggaccacga caggcccgca gttatcaggt catgaacggc atcgccgtgc tgccggtgtc 4681 cggcacgctg gtcagccgga cgcgggcgct gcagccgtac tcggggatga ccggttacaa 4741 cggcattatc gcccgtctgc aacaggctgc cagcgatccg atggtggacg gcattctgct 4801 cgatatggac acgcccggcg ggatggtggc gggggcattt gactgcgctg acatcatcgc 4861 ccgtgtgcgt gacataaaac cggtatgggc gcttgccaac gacatgaact gcagtgcagg 4921 tcagttgctt gccagtgccg cctcccggcg tctggtcacg cagaccgccc ggacaggctc 4981 catcggcgtc atgatggctc acagtaatta cggtgctgcg ctggagaaac agggtgtgga 5041 aatcacgctg atttacagcg gcagccataa ggtggatggc aacccctaca gccatcttcc 5101 ggatgacgtc cgggagacac tgcagtcccg gatggacgca acccgccaga tgtttgcgca 5161 gaaggtgtcg gcatataccg gcctgtccgt gcaggttgtg ctggataccg aggctgcagt 5221 gtacagcggt caggaggcca ttgatgccgg actggctgat gaacttgtta acagcaccga 5281 tgcgatcacc gtcatgcgtg atgcactgga tgcacgtaaa tcccgtctct caggagggcg 5341 aatgaccaaa gagactcaat caacaactgt ttcagccact gcttcgcagg ctgacgttac 5401 tgacgtggtg ccagcgacgg agggcgagaa cgccagcgcg gcgcagccgg acgtgaacgc 5461 gcagatcacc gcagcggttg cggcagaaaa cagccgcatt atggggatcc tcaactgtga 5521 ggaggctcac ggacgcgaag aacaggcacg cgtgctggca gaaacccccg gtatgaccgt 5581 gaaaacggcc cgccgcattc tggccgcagc accacagagt gcacaggcgc gcagtgacac 5641 tgcgctggat cgtctgatgc agggggcacc ggcaccgctg gctgcaggta acccggcatc 5701 tgatgccgtt aacgatttgc tgaacacacc agtgtaaggg atgtttatga cgagcaaaga 5761 aacctttacc cattaccagc cgcagggcaa cagtgacccg gctcataccg caaccgcgcc 5821 cggcggattg agtgcgaaag cgcctgcaat gaccccgctg atgctggaca cctccagccg 5881 taagctggtt gcgtgggatg gcaccaccga cggtgctgcc gttggcattc ttgcggttgc 5941 tgctgaccag accagcacca cgctgacgtt ctacaagtcc ggcacgttcc gttatgagga 6001 tgtgctctgg ccggaggctg ccagcgacga gacgaaaaaa cggaccgcgt ttgccggaac 6061 ggcaatcagc atcgtttaac tttacccttc atcactaaag gccgcctgtg cggctttttt 6121 tacgggattt ttttatgtcg atgtacacaa ccgcccaact gctggcggca aatgagcaga 6181 aatttaagtt tgatccgctg tttctgcgtc tctttttccg tgagagctat cccttcacca 6241 cggagaaagt ctatctctca caaattccgg gactggtaaa catggcgctg tacgtttcgc 6301 cgattgtttc cggtgaggtt atccgttccc gtggcggctc cacctctgaa tttacgccgg 6361 gatatgtcaa gccgaagcat gaagtgaatc cgcagatgac cctgcgtcgc ctgccggatg 6421 aagatccgca gaatctggcg gacccggctt accgccgccg tcgcatcatc atgcagaaca 6481 tgcgtgacga agagctggcc attgctcagg tcgaagagat gcaggcagtt tctgccgtgc 6541 ttaagggcaa atacaccatg accggtgaag ccttcgatcc ggttgaggtg gatatgggcc 6601 gcagtgagga gaataacatc acgcagtccg gcggcacgga gtggagcaag cgtgacaagt 6661 ccacgtatga cccgaccgac gatatcgaag cctacgcgct gaacgccagc ggtgtggtga 6721 atatcatcgt gttcgatccg aaaggctggg cgctgttccg ttccttcaaa gccgtcaagg 6781 agaagctgga tacccgtcgt ggctctaatt ccgagctgga gacagcggtg aaagacctgg 6841 gcaaagcggt gtcctataag gggatgtatg gcgatgtggc catcgtcgtg tattccggac 6901 agtacgtgga aaacggcgtc aaaaagaact tcctgccgga caacacgatg gtgctgggga 6961 acactcaggc acgcggtctg cgcacctatg gctgcattca ggatgcggac gcacagcgcg 7021 aaggcattaa cgcctctgcc cgttacccga aaaactgggt gaccaccggc gatccggcgc 7081 gtgagttcac catgattcag tcagcaccgc tgatgctgct ggctgaccct gatgagttcg 7141 tgtccgtaca actggcgtaa tcatggccct tcggggccat tgtttctctg tggaggagtc 7201 catgacgaaa gatgaactga ttgcccgtct ccgctcgctg ggtgaacaac tgaaccgtga 7261 tgtcagcctg acggggacga aagaagaact ggcgctccgt gtggcagagc tgaaagagga 7321 gcttgatgac acggatgaaa ctgccggtca ggacacccct ctcagccggg aaaatgtgct 7381 gaccggacat gaaaatgagg tgggatcagc gcagccggat accgtgattc tggatacgtc 7441 tgaactggtc acggtcgtgg cactggtgaa gctgcatact gatgcacttc acgccacgcg 7501 ggatgaacct gtggcatttg tgctgccggg aacggcgttt cgtgtctctg ccggtgtggc 7561 agccgaaatg acagagcgcg gcctggccag aatgcaataa cgggaggcgc tgtggctgat 7621 ttcgataacc tgttcgatgc tgccattgcc cgcgccgatg aaacgatacg cgggtacatg 7681 ggaacgtcag ccaccattac atccggtgag cagtcaggtg cggtgatacg tggtgttttt 7741 gatgaccctg aaaatatcag ctatgccgga cagggcgtgc gcgttgaagg ctccagcccg 7801 tccctgtttg tccggactga tgaggtgcgg cagctgcggc gtggagacac gctgaccatc 7861 ggtgaggaaa atttctgggt agatcgggtt tcgccggatg atggcggaag ttgtcatctc 7921 tggcttggac ggggcgtacc gcctgccgtt aaccgtcgcc gctgaaaggg ggatgtatgg 7981 ccataaaagg tcttgagcag gccgttgaaa acctcagccg tatcagcaaa acggcggtgc 8041 ctggtgccgc cgcaatggcc attaaccgcg ttgcttcatc cgcgatatcg cagtcggcgt 8101 cacaggttgc ccgtgagaca aaggtacgcc ggaaactggt aaaggaaagg gccaggctga 8161 aaagggccac ggtcaaaaat ccgcaggcca gaatcaaagt taaccggggg gatttgcccg 8221 taatcaagct gggtaatgcg cgggttgtcc tttcgcgccg caggcgtcgt aaaaaggggc 8281 agcgttcatc cctgaaaggt ggcggcagcg tgcttgtggt gggtaaccgt cgtattcccg 8341 gcgcgtttat tcagcaactg aaaaatggcc ggtggcatgt catgcagcgt gtggctggga 8401 aaaaccgtta ccccattgat gtggtgaaaa tcccgatggc ggtgccgctg accacggcgt 8461 ttaaacaaaa tattgagcgg atacggcgtg aacgtcttcc gaaagagctg ggctatgcgc 8521 tgcagcatca actgaggatg gtaataaagc gatgaaacat actgaactcc gtgcagccgt 8581 actggatgca ctggagaagc atgacaccgg ggcgacgttt tttgatggtc gccccgctgt 8641 ttttgatgag gcggattttc cggcagttgc cgtttatctc accggcgctg aatacacggg 8701 cgaagagctg gacagcgata cctggcaggc ggagctgcat atcgaagttt tcctgcctgc 8761 tcaggtgccg gattcagagc tggatgcgtg gatggagtcc cggatttatc cggtgatgag 8821 cgatatcccg gcactgtcag atttgatcac cagtatggtg gccagcggct atgactaccg 8881 gcgcgacgat gatgcgggct tgtggagttc agccgatctg acttatgtca ttacctatga 8941 aatgtgagga cgctatgcct gtaccaaatc ctacaatgcc ggtgaaaggt gccgggacca 9001 ccctgtgggt ttataagggg agcggtgacc cttacgcgaa tccgctttca gacgttgact 9061 ggtcgcgtct ggcaaaagtt aaagacctga cgcccggcga actgaccgct gagtcctatg 9121 acgacagcta tctcgatgat gaagatgcag actggactgc gaccgggcag gggcagaaat 9181 ctgccggaga taccagcttc acgctggcgt ggatgcccgg agagcagggg cagcaggcgc 9241 tgctggcgtg gtttaatgaa ggcgataccc gtgcctataa aatccgcttc ccgaacggca 9301 cggtcgatgt gttccgtggc tgggtcagca gtatcggtaa ggcggtgacg gcgaaggaag 9361 tgatcacccg cacggtgaaa gtcaccaatg tgggacgtcc gtcgatggca gaagatcgca 9421 gcacggtaac agcggcaacc ggcatgaccg tgacgcctgc cagcacctcg gtggtgaaag 9481 ggcagagcac cacgctgacc gtggccttcc agccggaggg cgtaaccgac aagagctttc 9541 gtgcggtgtc tgcggataaa acaaaagcca ccgtgtcggt cagtggtatg accatcaccg 9601 tgaacggcgt tgctgcaggc aaggtcaaca ttccggttgt atccggtaat ggtgagtttg 9661 ctgcggttgc agaaattacc gtcaccgcca gttaatccgg agagtcagcg atgttcctga 9721 aaaccgaatc atttgaacat aacggtgtga ccgtcacgct ttctgaactg tcagccctgc 9781 agcgcattga gcatctcgcc ctgatgaaac ggcaggcaga acaggcggag tcagacagca 9841 accggaagtt tactgtggaa gacgccatca gaaccggcgc gtttctggtg gcgatgtccc 9901 tgtggcataa ccatccgcag aagacgcaga tgccgtccat gaatgaagcc gttaaacaga 9961 ttgagcagga agtgcttacc acctggccca cggaggcaat ttctcatgct gaaaacgtgg 10021 tgtaccggct gtctggtatg tatgagtttg tggtgaataa tgcccctgaa cagacagagg 10081 acgccgggcc cgcagagcct gtttctgcgg gaaagtgttc gacggtgagc tgagttttgc 10141 cctgaaactg gcgcgtgaga tggggcgacc cgactggcgt gccatgcttg ccgggatgtc 10201 atccacggag tatgccgact ggcaccgctt ttacagtacc cattattttc atgatgttct 10261 gctggatatg cacttttccg ggctgacgta caccgtgctc agcctgtttt tcagcgatcc 10321 ggatatgcat ccgctggatt tcagtctgct gaaccggcgc gaggctgacg aagagcctga 10381 agatgatgtg ctgatgcaga aagcggcagg gcttgccgga ggtgtccgct ttggcccgga 10441 cgggaatgaa gttatccccg cttccccgga tgtggcggac atgacggagg atgacgtaat 10501 gctgatgaca gtatcagaag ggatcgcagg aggagtccgg tatggctgaa ccggtaggcg 10561 atctggtcgt tgatttgagt ctggatgcgg ccagatttga cgagcagatg gccagagtca 10621 ggcgtcattt ttctggtacg gaaagtgatg cgaaaaaaac agcggcagtc gttgaacagt 10681 cgctgagccg acaggcgctg gctgcacaga aagcggggat ttccgtcggg cagtataaag 10741 ccgccatgcg tatgctgcct gcacagttca ccgacgtggc cacgcagctt gcaggcgggc 10801 aaagtccgtg gctgatcctg ctgcaacagg gggggcaggt gaaggactcc ttcggcggga 10861 tgatccccat gttcaggggg cttgccggtg cgatcaccct gccgatggtg ggggccacct 10921 cgctggcggt ggcgaccggt gcgctggcgt atgcctggta tcagggcaac tcaaccctgt 10981 ccgatttcaa caaaacgctg gtcctttccg gcaatcaggc gggactgacg gcagatcgta 11041 tgctggtcct gtccagagcc gggcaggcgg cagggctgac gtttaaccag accagcgagt 11101 cactcagcgc actggttaag gcgggggtaa gcggtgaggc tcagattgcg tccatcagcc 11161 agagtgtggc gcgtttctcc tctgcatccg gcgtggaggt ggacaaggtc gctgaagcct 11221 tcgggaagct gaccacagac ccgacgtcgg ggctgacggc gatggctcgc cagttccata 11281 acgtgtcggc ggagcagatt gcgtatgttg ctcagttgca gcgttccggc gatgaagccg 11341 gggcattgca ggcggcgaac gaggccgcaa cgaaagggtt tgatgaccag acccgccgcc 11401 tgaaagagaa catgggcacg ctggagacct gggcagacag gactgcgcgg gcattcaaat 11461 ccatgtggga tgcggtgctg gatattggtc gtcctgatac cgcgcaggag atgctgatta 11521 aggcagaggc tgcgtataag aaagcagacg acatctggaa tctgcgcaag gatgattatt 11581 ttgttaacga tgaagcgcgg gcgcgttact gggatgatcg tgaaaaggcc cgtcttgcgc 11641 ttgaagccgc ccgaaagaag gctgagcagc agactcaaca ggacaaaaat gcgcagcagc 11701 agagcgatac cgaagcgtca cggctgaaat ataccgaaga ggcgcagaag gcttacgaac 11761 ggctgcagac gccgctggag aaatataccg cccgtcagga agaactgaac aaggcactga 11821 aagacgggaa aatcctgcag gcggattaca acacgctgat ggcggcggcg aaaaaggatt 11881 atgaagcgac gctgaaaaag ccgaaacagt ccagcgtgaa ggtgtctgcg ggcgatcgtc 11941 aggaagacag tgctcatgct gccctgctga cgcttcaggc agaactccgg acgctggaga 12001 agcatgccgg agcaaatgag aaaatcagcc agcagcgccg ggatttgtgg aaggcggaga 12061 gtcagttcgc ggtactggag gaggcggcgc aacgtcgcca gctgtctgca caggagaaat 12121 ccctgctggc gcataaagat gagacgctgg agtacaaacg ccagctggct gcacttggcg 12181 acaaggttac gtatcaggag cgcctgaacg cgctggcgca gcaggcggat aaattcgcac 12241 agcagcaacg ggcaaaacgg gccgccattg atgcgaaaag ccgggggctg actgaccggc 12301 aggcagaacg ggaagccacg gaacagcgcc tgaaggaaca gtatggcgat aatccgctgg 12361 cgctgaataa cgtcatgtca gagcagaaaa agacctgggc ggctgaagac cagcttcgcg 12421 ggaactggat ggcaggcctg aagtccggct ggagtgagtg ggaagagagc gccacggaca 12481 gtatgtcgca ggtaaaaagt gcagccacgc agacctttga tggtattgca cagaatatgg 12541 cggcgatgct gaccggcagt gagcagaact ggcgcagctt cacccgttcc gtgctgtcca 12601 tgatgacaga aattctgctt aagcaggcaa tggtggggat tgtcgggagt atcggcagcg 12661 ccattggcgg ggctgttggt ggcggcgcat ccgcgtcagg cggtacagcc attcaggccg 12721 ctgcggcgaa attccatttt gcaaccggag gatttacggg aaccggcggc aaatatgagc 12781 cagcggggat tgttcaccgt ggtgagtttg tcttcacgaa ggaggcaacc agccggattg 12841 gcgtggggaa tctttaccgg ctgatgcgcg gctatgccac cggcggttat gtcggtacac 12901 cgggcagcat ggcagacagc cggtcgcagg cgtccgggac gtttgagcag aataaccatg 12961 tggtgattaa caacgacggc acgaacgggc agataggtcc ggctgctctg aaggcggtgt 13021 atgacatggc ccgcaagggt gcccgtgatg aaattcagac acagatgcgt gatggtggcc 13081 tgttctccgg aggtggacga tgaagacctt ccgctggaaa gtgaaacccg gtatggatgt 13141 ggcttcggtc ccttctgtaa gaaaggtgcg ctttggtgat ggctattctc agcgagcgcc 13201 tgccgggctg aatgccaacc tgaaaacgta cagcgtgacg ctttctgtcc cccgtgagga 13261 ggccacggta ctggagtcgt ttctggaaga gcacgggggc tggaaatcct ttctgtggac 13321 gccgccttat gagtggcggc agataaaggt gacctgcgca aaatggtcgt cgcgggtcag 13381 tatgctgcgt gttgagttca gcgcagagtt tgaacaggtg gtgaactgat gcaggatatc 13441 cggcaggaaa cactgaatga atgcacccgt gcggagcagt cggccagcgt ggtgctctgg 13501 gaaatcgacc tgacagaggt cggtggagaa cgttattttt tctgtaatga gcagaacgaa 13561 aaaggtgagc cggtcacctg gcaggggcga cagtatcagc cgtatcccat tcaggggagc 13621 ggttttgaac tgaatggcaa aggcaccagt acgcgcccca cgctgacggt ttctaacctg 13681 tacggtatgg tcaccgggat ggcggaagat atgcagagtc tggtcggcgg aacggtggtc 13741 cggcgtaagg tttacgcccg ttttctggat gcggtgaact tcgtcaacgg aaacagttac 13801 gccgatccgg agcaggaggt gatcagccgc tggcgcattg agcagtgcag cgaactgagc 13861 gcggtgagtg cctcctttgt actgtccacg ccgacggaaa cggatggcgc tgtttttccg 13921 ggacgtatca tgctggccaa cacctgcacc tggacctatc gcggtgacga gtgcggttat 13981 agcggtccgg ctgtcgcgga tgaatatgac cagccaacgt ccgatatcac gaaggataaa 14041 tgcagcaaat gcctgagcgg ttgtaagttc cgcaataacg tcggcaactt tggcggcttc 14101 ctttccatta acaaactttc gcagtaaatc ccatgacaca gacagaatca gcgattctgg 14161 cgcacgcccg gcgatgtgcg ccagcggagt cgtgcggctt cgtggtaagc acgccggagg 14221 gggaaagata tttcccctgc gtgaatatct ccggtgagcc ggaggctatt tccgtatgtc 14281 gccggaagac tggctgcagg cagaaatgca gggtgagatt gtggcgctgg tccacagcca 14341 ccccggtggt ctgccctggc tgagtgaggc cgaccggcgg ctgcaggtgc agagtgattt 14401 gccgtggtgg ctggtctgcc gggggacgat tcataagttc cgctgtgtgc cgcatctcac 14461 cgggcggcgc tttgagcacg gtgtgacgga ctgttacaca ctgttccggg atgcttatca 14521 tctggcgggg attgagatgc cggactttca tcgtgaggat gactggtggc gtaacggcca 14581 gaatctctat ctggataatc tggaggcgac ggggctgtat caggtgccgt tgtcagcggc 14641 acagccgggc gatgtgctgc tgtgctgttt tggttcatca gtgccgaatc acgccgcaat 14701 ttactgcggc gacggcgagc tgctgcacca tattcctgaa caactgagca aacgagagag 14761 gtacaccgac aaatggcagc gacgcacaca ctccctctgg cgtcaccggg catggcgcgc 14821 atctgccttt acggggattt acaacgattt ggtcgccgca tcgaccttcg tgtgaaaacg 14881 ggggctgaag ccatccgggc actggccaca cagctcccgg cgtttcgtca gaaactgagc 14941 gacggctggt atcaggtacg gattgccggg cgggacgtca gcacgtccgg gttaacggcg 15001 cagttacatg agactctgcc tgatggcgct gtaattcata ttgttcccag agtcgccggg 15061 gccaagtcag gtggcgtatt ccagattgtc ctgggggctg ccgccattgc cggatcattc 15121 tttaccgccg gagccaccct tgcagcatgg ggggcagcca ttggggccgg tggtatgacc 15181 ggcatcctgt tttctctcgg tgccagtatg gtgctcggtg gtgtggcgca gatgctggca 15241 ccgaaagcca gaactccccg tatacagaca acggataacg gtaagcagaa cacctatttc 15301 tcctcactgg ataacatggt tgcccagggc aatgttctgc ctgttctgta cggggaaatg 15361 cgcgtggggt cacgcgtggt ttctcaggag atcagcacgg cagacgaagg ggacggtggt 15421 caggttgtgg tgattggtcg ctgatgcaaa atgttttatg tgaaaccgcc tgcgggcggt 15481 tttgtcattt atggagcgtg aggaatgggt aaaggaagca gtaaggggca taccccgcgc 15541 gaagcgaagg acaacctgaa gtccacgcag ttgctgagtg tgatcgatgc catcagcgaa 15601 gggccgattg aaggtccggt ggatggctta aaaagcgtgc tgctgaacag tacgccggtg 15661 ctggacactg aggggaatac caacatatcc ggtgtcacgg tggtgttccg ggctggtgag 15721 caggagcaga ctccgccgga gggatttgaa tcctccggct ccgagacggt gctgggtacg 15781 gaagtgaaat atgacacgcc gatcacccgc accattacgt ctgcaaacat cgaccgtctg 15841 cgctttacct tcggtgtaca ggcactggtg gaaaccacct caaagggtga caggaatccg 15901 tcggaagtcc gcctgctggt tcagatacaa cgtaacggtg gctgggtgac ggaaaaagac 15961 atcaccatta agggcaaaac cacctcgcag tatctggcct cggtggtgat gggtaacctg 16021 ccgccgcgcc cgtttaatat ccggatgcgc aggatgacgc cggacagcac cacagaccag 16081 ctgcagaaca aaacgctctg gtcgtcatac actgaaatca tcgatgtgaa acagtgctac 16141 ccgaacacgg cactggtcgg cgtgcaggtg gactcggagc agttcggcag ccagcaggtg 16201 agccgtaatt atcatctgcg cgggcgtatt ctgcaggtgc cgtcgaacta taacccgcag 16261 acgcggcaat acagcggtat ctgggacgga acgtttaaac cggcatacag caacaacatg 16321 gcctggtgtc tgtgggatat gctgacccat ccgcgctacg gcatggggaa acgtcttggt 16381 gcggcggatg tggataaatg ggcgctgtat gtcatcggcc agtactgcga ccagtcagtg 16441 ccggacggct ttggcggcac ggagccgcgc atcacctgta atgcgtacct gaccacacag 16501 cgtaaggcgt gggatgtgct cagcgatttc tgctcggcga tgcgctgtat gccggtatgg 16561 aacgggcaga cgctgacgtt cgtgcaggac cgaccgtcgg ataagacgtg gacctataac 16621 cgcagtaatg tggtgatgcc ggatgatggc gcgccgttcc gctacagctt cagcgccctg 16681 aaggaccgcc ataatgccgt tgaggtgaac tggattgacc cgaacaacgg ctgggagacg 16741 gcgacagagc ttgttgaaga tacgcaggcc attgcccgtt acggtcgtaa tgttacgaag 16801 atggatgcct ttggctgtac cagccggggg caggcacacc gcgccgggct gtggctgatt 16861 aaaacagaac tgctggaaac gcagaccgtg gatttcagcg tcggcgcaga agggcttcgc 16921 catgtaccgg gcgatgttat tgaaatctgc gatgatgact atgccggtat cagcaccggt 16981 ggtcgtgtgc tggcggtgaa cagccagacc cggacgctga cgctcgaccg tgaaatcacg 17041 ctgccatcct ccggtaccgc gctgataagc ctggttgacg gaagtggcaa tccggtcagc 17101 gtggaggttc agtccgtcac cgacggcgtg aaggtaaaag tgagccgtgt tcctgacggt 17161 gttgctgaat acagcgtatg ggagctgaag ctgccgacgc tgcgccagcg actgttccgc 17221 tgcgtgagta tccgtgagaa cgacgacggc acgtatgcca tcaccgccgt gcagcatgtg 17281 ccggaaaaag aggccatcgt ggataacggg gcgcactttg acggcgaaca gagtggcacg 17341 gtgaatggtg tcacgccgcc agcggtgcag cacctgaccg cagaagtcac tgcagacagc 17401 ggggaatatc aggtgctggc gcgatgggac acaccgaagg tggtgaaggg cgtgagtttc 17461 ctgctccgtc tgaccgtaac agcggacgac ggcagtgagc ggctggtcag cacggcccgg 17521 acgacggaaa ccacataccg cttcacgcaa ctggcgctgg ggaactacag gctgacagtc 17581 cgggcggtaa atgcgtgggg gcagcagggc gatccggcgt cggtatcgtt ccggattgcc 17641 gcaccggcag caccgtcgag gattgagctg acgccgggct attttcagat aaccgccacg 17701 ccgcatcttg ccgtttatga cccgacggta cagtttgagt tctggttctc ggaaaagcag 17761 attgcggata tcagacaggt tgaaaccagc acgcgttatc ttggtacggc gctgtactgg 17821 atagccgcca gtatcaatat caaaccgggc catgattatt acttttatat ccgcagtgtg 17881 aacaccgttg gcaaatcggc attcgtggag gccgtcggtc gggcgagcga tgatgcggaa 17941 ggttacctgg attttttcaa aggcaagata accgaatccc atctcggcaa ggagctgctg 18001 gaaaaagtcg agctgacgga ggataacgcc agcagactgg aggagttttc gaaagagtgg 18061 aaggatgcca gtgataagtg gaatgccatg tgggctgtca aaattgagca gaccaaagac 18121 ggcaaacatt atgtcgcggg tattggcctc agcatggagg acacggagga aggcaaactg 18181 agccagtttc tggttgccgc caatcgtatc gcatttattg acccggcaaa cgggaatgaa 18241 acgccgatgt ttgtggcgca gggcaaccag atattcatga acgacgtgtt cctgaagcgc 18301 ctgacggccc ccaccattac cagcggcggc aatcctccgg ccttttccct gacaccggac 18361 ggaaagctga ccgctaaaaa tgcggatatc agtggcagtg tgaatgcgaa ctccgggacg 18421 ctcagtaatg tgacgatagc tgaaaactgt acgataaacg gtacgctgag ggcggaaaaa 18481 atcgtcgggg acattgtaaa ggcggcgagc gcggcttttc cgcgccagcg tgaaagcagt 18541 gtggactggc cgtcaggtac ccgtactgtc accgtgaccg atgaccatcc ttttgatcgc 18601 cagatagtgg tgcttccgct gacgtttcgc ggaagtaagc gtactgtcag cggcaggaca 18661 acgtattcga tgtgttatct gaaagtactg atgaacggtg cggtgattta tgatggcgcg 18721 gcgaacgagg cggtacaggt gttctcccgt attgttgaca tgccagcggg tcggggaaac 18781 gtgatcctga cgttcacgct tacgtccaca cggcattcgg cagatattcc gccgtatacg 18841 tttgccagcg atgtgcaggt tatggtgatt aagaaacagg cgctgggcat cagcgtggtc 18901 tgagtgtgtt acagaggttc gtccgggaac gggcgtttta ttataaaaca gtgagaggtg 18961 aacgatgcgt aatgtgtgta ttgccgttgc tgtctttgcc gcacttgcgg tgacagtcac 19021 tccggcccgt gcggaaggtg gacatggtac gtttacggtg ggctattttc aagtgaaacc 19081 gggtacattg ccgtcgttgt cgggcgggga taccggtgtg agtcatctga aagggattaa 19141 cgtgaagtac cgttatgagc tgacggacag tgtgggggtg atggcttccc tggggttcgc 19201 cgcgtcgaaa aagagcagca cagtgatgac cggggaggat acgtttcact atgagagcct 19261 gcgtggacgt tatgtgagcg tgatggccgg accggtttta caaatcagta agcaggtcag 19321 tgcgtacgcc atggccggag tggctcacag tcggtggtcc ggcagtacaa tggattaccg 19381 taagacggaa atcactcccg ggtatatgaa agagacgacc actgccaggg acgaaagtgc 19441 aatgcggcat acctcagtgg cgtggagtgc aggtatacag attaatccgg cagcgtccgt 19501 cgttgttgat attgcttatg aaggctccgg cagtggcgac tggcgtactg acggattcat 19561 cgttggggtc ggttataaat tctgattagc caggtaacac agtgttatga cagcccgccg 19621 gaaccggtgg gcttttttgt ggggtgaata tggcagtaaa gatttcagga gtcctgaaag 19681 acggcacagg aaaaccggta cagaactgca ccattcagct gaaagccaga cgtaacagca 19741 ccacggtggt ggtgaacacg gtgggctcag agaatccgga tgaagccggg cgttacagca 19801 tggatgtgga gtacggtcag tacagtgtca tcctgcaggt tgacggtttt ccaccatcgc 19861 acgccgggac catcaccgtg tatgaagatt cacaaccggg gacgctgaat gattttctct 19921 gtgccatgac ggaggatgat gcccggccgg aggtgctgcg tcgtcttgaa ctgatggtgg 19981 aagaggtggc gcgtaacgcg tccgtggtgg cacagagtac ggcagacgcg aagaaatcag 20041 ccggcgatgc cagtgcatca gctgctcagg tcgcggccct tgtgactgat gcaactgact 20101 cagcacgcgc cgccagcacg tccgccggac aggctgcatc gtcagctcag gaagcgtcct 20161 ccggcgcaga agcggcatca gcaaaggcca ctgaagcgga aaaaagtgcc gcagccgcag 20221 agtcctcaaa aaacgcggcg gccaccagtg ccggtgcggc gaaaacgtca gaaacgaatg 20281 ctgcagcgtc acaacaatca gccgccacgt ctgcctccac cgcggccacg aaagcgtcag 20341 aggccgccac ttcagcacga gatgcggtgg cctcaaaaga ggcagcaaaa tcatcagaaa 20401 cgaacgcatc atcaagtgcc ggtcgtgcag cttcctcggc aacggcggca gaaaattctg 20461 ccagggcggc aaaaacgtcc gagacgaatg ccaggtcatc tgaaacagca gcggaacgga 20521 gcgcctctgc cgcggcagac gcaaaaacag cggcggcggg gagtgcgtca acggcatcca 20581 cgaaggcgac agaggctgcg ggaagtgcgg tatcagcatc gcagagcaaa agtgcggcag 20641 aagcggcggc aatacgtgca aaaaattcgg caaaacgtgc agaagatata gcttcagctg 20701 tcgcgcttga ggatgcggac acaacgagaa aggggatagt gcagctcagc agtgcaacca 20761 acagcacgtc tgaaacgctt gctgcaacgc caaaggcggt taaggtggta atggatgaaa 20821 cgaacagaaa agcccactgg acagtccggc actgaccgga acgccaacag caccaaccgc 20881 gctcagggga acaaacaata cccagattgc gaacaccgct tttgtactgg ccgcgattgc 20941 agatgttatc gacgcgtcac ctgacgcact gaatacgctg aatgaactgg ccgcagcgct 21001 cgggaatgat ccagattttg ctaccaccat gactaacgcg cttgcgggta aacaaccgaa 21061 gaatgcgaca ctgacggcgc tggcagggct ttccacggcg aaaaataaat taccgtattt 21121 tgcggaaaat gatgccgcca gcctgactga actgactcag gttggcaggg atattctggc 21181 aaaaaattcc gttgcagatg ttcttgaata ccttggggcc ggtgagaatt cggcctttcc 21241 ggcaggtgcg ccgatcccgt ggccatcaga tatcgttccg tctggctacg tcctgatgca 21301 ggggcaggcg tttgacaaat cagcctaccc aaaacttgct gtcgcgtatc catcgggtgt 21361 gcttcctgat atgcgaggct ggacaatcaa ggggaaaccc gccagcggtc gtgctgtatt 21421 gtctcaggaa caggatggaa ttaagtcgca cacccacagt gccagtgcat ccggtacgga 21481 tttggggacg aaaaccacat cgtcgtttga ttacgggacg aaaacaacag gcagtttcga 21541 ttacggcacc aaatcgacga ataacacggg ggctcatgct cacagtctga gcggttcaac 21601 aggggccgcg ggtgctcatg cccacacaag tggtttaagg atgaacagtt ctggctggag 21661 tcagtatgga acagcaacca ttacaggaag tttatccaca gttaaaggaa ccagcacaca 21721 gggtattgct tatttatcga aaacggacag tcagggcagc cacagtcact cattgtccgg 21781 tacagccgtg agtgccggtg cacatgcgca tacagttggt attggtgcgc accagcatcc 21841 ggttgttatc ggtgctcatg cccattcttt cagtattggt tcacacggac acaccatcac 21901 cgttaacgct gcgggtaacg cggaaaacac cgtcaaaaac attgcattta actatattgt 21961 gaggcttgca taatggcatt cagaatgagt gaacaaccac ggaccataaa aatttataat 22021 ctgctggccg gaactaatga atttattggt gaaggtgacg catatattcc gcctcatacc 22081 ggtctgcctg caaacagtac cgatattgca ccgccagata ttccggctgg ctttgtggct 22141 gttttcaaca gtgatgaggc atcgtggcat ctcgttgaag accatcgggg taaaaccgtc 22201 tatgacgtgg cttccggcga cgcgttattt atttctgaac tcggtccgtt accggaaaat 22261 tttacctggt tatcgccggg aggggaatat cagaagtgga acggcacagc ctgggtgaag 22321 gatacggaag cagaaaaact gttccggatc cgggaggcgg aagaaacaaa aaaaagcctg 22381 atgcaggtag ccagtgagca tattgcgccg cttcaggatg ctgcagatct ggaaattgca 22441 acgaaggaag aaacctcgtt gctggaagcc tggaagaagt atcgggtgtt gctgaaccgt 22501 gttgatacat caactgcacc tgatattgag tggcctgctg tccctgttat ggagtaatcg 22561 ttttgtgata tgccgcagaa acgttgtatg aaataacgtt ctgcggttag ttagtatatt 22621 gtaaagctga gtattggttt atttggcgat tattatcttc aggagaataa tggaagttct 22681 atgactcaat tgttcatagt gtttacatca ccgccaattg cttttaagac tgaacgcatg 22741 aaatatggtt tttcgtcatg ttttgagtct gctgttgata tttctaaagt cggttttttt 22801 tcttcgtttt ctctaactat tttccatgaa atacattttt gattattatt tgaatcaatt 22861 ccaattacct gaagtctttc atctataatt ggcattgtat gtattggttt attggagtag 22921 atgcttgctt ttctgagcca tagctctgat atccaaatga agccataggc atttgttatt 22981 ttggctctgt cagctgcata acgccaaaaa atatatttat ctgcttgatc ttcaaatgtt 23041 gtattgatta aatcaattgg atggaattgt ttatcataaa aaattaatgt ttgaatgtga 23101 taaccgtcct ttaaaaaagt cgtttctgca agcttggctg tatagtcaac taactcttct 23161 gtcgaagtga tatttttagg cttatctacc agttttagac gctctttaat atcttcagga 23221 attattttat tgtcatattg tatcatgcta aatgacaatt tgcttatgga gtaatctttt 23281 aattttaaat aagttattct cctggcttca tcaaataaag agtcgaatga tgttggcgaa 23341 atcacatcgt cacccattgg attgtttatt tgtatgccaa gagagttaca gcagttatac 23401 attctgccat agattatagc taaggcatgt aataattcgt aatcttttag cgtattagcg 23461 acccatcgtc tttctgattt aataatagat gattcagtta aatatgaagg taatttcttt 23521 tgtgcaagtc tgactaactt ttttatacca atgtttaaca tactttcatt tgtaataaac 23581 tcaatgtcat tttcttcaat gtaagatgaa ataagagtag cctttgcctc gctatacatt 23641 tctaaatcgc cttgtttttc tatcgtattg cgagaatttt tagcccaagc cattaatgga 23701 tcatttttcc atttttcaat aacattattg ttataccaaa tgtcatatcc tataatctgg 23761 tttttgtttt tttgaataat aaatgttact gttcttgcgg tttggaggaa ttgattcaaa 23821 ttcaagcgaa ataattcagg gtcaaaatat gtatcaatgc agcatttgag caagtgcgat 23881 aaatctttaa gtcttctttc ccatggtttt ttagtcataa aactctccat tttgataggt 23941 tgcatgctag atgctgatat attttagagg tgataaaatt aactgcttaa ctgtcaatgt 24001 aatacaagtt gtttgatctt tgcaatgatt cttatcagaa accatatagt aaattagtta 24061 cacaggaaat ttttaatatt attattatca ttcattatgt attaaaatta gagttgtggc 24121 ttggctctgc taacacgttg ctcataggag atatggtaga gccgcagaca cgtcgtatgc 24181 aggaacgtgc tgcggctggc tggtgaactt ccgatagtgc gggtgttgaa tgatttccag 24241 ttgctaccga ttttacatat tttttgcatg agagaatttg taccacctcc caccgaccat 24301 ctatgactgt acgccactgt ccctaggact gctatgtgcc ggagcggaca ttacaaacgt 24361 ccttctcggt gcatgccact gttgccaatg acctgcctag gaattggtta gcaagttact 24421 accggatttt gtaaaaacag ccctcctcat ataaaaagta ttcgttcact tccgataagc 24481 gtcgtaattt tctatctttc atcatattct agatccctct gaaaaaatct tccgagtttg 24541 ctaggcactg atacataact cttttccaat aattggggaa gtcattcaaa tctataatag 24601 gtttcagatt tgcttcaata aattctgact gtagctgctg aaacgttgcg gttgaactat 24661 atttccttat aacttttacg aaagagtttc tttgagtaat cacttcactc aagtgcttcc 24721 ctgcctccaa acgatacctg ttagcaatat ttaatagctt gaaatgatga agagctctgt 24781 gtttgtcttc ctgcctccag ttcgccgggc attcaacata aaaactgata gcacccggag 24841 ttccggaaac gaaatttgca tatacccatt gctcacgaaa aaaaatgtcc ttgtcgatat 24901 agggatgaat cgcttggtgt acctcatcta ctgcgaaaac ttgacctttc tctcccatat 24961 tgcagtcgcg gcacgatgga actaaattaa taggcatcac cgaaaattca ggataatgtg 25021 caataggaag aaaatgatct atattttttg tctgtcctat atcaccacaa aatggacatt 25081 tttcacctga tgaaacaagc atgtcatcgt aatatgttct agcgggtttg tttttatctc 25141 ggagattatt ttcataaagc ttttctaatt taacctttgt caggttacca actactaagg 25201 ttgtaggctc aagagggtgt gtcctgtcgt aggtaaataa ctgacctgtc gagcttaata 25261 ttctatattg ttgttctttc tgcaaaaaag tggggaagtg agtaatgaaa ttatttctaa 25321 catttatctg catcatacct tccgagcatt tattaagcat ttcgctataa gttctcgctg 25381 gaagaggtag ttttttcatt gtactttacc ttcatctctg ttcattatca tcgcttttaa 25441 aacggttcga ccttctaatc ctatctgacc attataattt tttagaatgg tttcataaga 25501 aagctctgaa tcaacggact gcgataataa gtggtggtat ccagaatttg tcacttcaag 25561 taaaaacacc tcacgagtta aaacacctaa gttctcaccg aatgtctcaa tatccggacg 25621 gataatattt attgcttctc ttgaccgtag gactttccac atgcaggatt ttggaacctc 25681 ttgcagtact actggggaat gagttgcaat tattgctaca ccattgcgtg catcgagtaa 25741 gtcgcttaat gttcgtaaaa aagcagagag caaaggtgga tgcagatgaa cctctggttc 25801 atcgaataaa actaatgact tttcgccaac gacatctact aatcttgtga tagtaaataa 25861 aacaattgca tgtccagagc tcattcgaag cagatatttc tggatattgt cataaaacaa 25921 tttagtgaat ttatcatcgt ccacttgaat ctgtggttca ttacgtctta actcttcata 25981 tttagaaatg aggctgatga gttccatatt tgaaaagttt tcatcactac ttagtttttt 26041 gatagcttca agccagagtt gtctttttct atctactctc atacaaccaa taaatgctga 26101 aatgaattct aagcggagat cgcctagtga ttttaaacta ttgctggcag cattcttgag 26161 tccaatataa aagtattgtg taccttttgc tgggtcaggt tgttctttag gaggagtaaa 26221 aggatcaaat gcactaaacg aaactgaaac aagcgatcga aaatatccct ttgggattct 26281 tgactcgata agtctattat tttcagagaa aaaatattca ttgttttctg ggttggtgat 26341 tgcaccaatc attccattca aaattgttgt tttaccacac ccattccgcc cgataaaagc 26401 atgaatgttc gtgctgggca tagaattaac cgtcacctca aaaggtatag ttaaatcact 26461 gaatccggga gcactttttc tattaaatga aaagtggaaa tctgacaatt ctggcaaacc 26521 atttaacaca cgtgcgaact gtccatgaat ttctgaaaga gttacccctc taagtaatga 26581 ggtgttaagg acgctttcat tttcaatgtc ggctaatcga tttggccata ctactaaatc 26641 ctgaatagct ttaagaaggt tatgtttaaa accatcgctt aatttgctga gattaacata 26701 gtagtcaatg ctttcaccta aggaaaaaaa catttcaggg agttgactga attttttatc 26761 tattaatgaa taagtgctta cttcttcttt ttgacctaca aaaccaattt taacatttcc 26821 gatatcgcat ttttcaccat gctcatcaaa gacagtaaga taaaacattg taacaaagga 26881 atagtcattc caaccatctg ctcgtaggaa tgccttattt ttttctactg caggaatata 26941 cccgcctctt tcaataacac taaactccaa catatagtaa cccttaattt tattaaaata 27001 accgcaattt atttggcggc aacacaggat ctctctttta agttactctc tattacatac 27061 gttttccatc taaaaattag tagtattgaa cttaacgggg catcgtattg tagttttcca 27121 tatttagctt tctgcttcct tttggataac ccactgttat tcatgttgca tggtgcactg 27181 tttataccaa cgatatagtc tattaatgca tatatagtat cgccgaacga ttagctcttc 27241 aggcttctga agaagcgttt caagtactaa taagccgata gatagccacg gacttcgtag 27301 ccatttttca taagtgttaa cttccgctcc tcgctcataa cagacattca ctacagttat 27361 ggcggaaagg tatgcatgct gggtgtgggg aagtcgtgaa agaaaagaag tcagctgcgt 27421 cgtttgacat cactgctatc ttcttactgg ttatgcaggt cgtagtgggt ggcacacaaa 27481 gctttgcact ggattgcgag gctttgtgct tctctggagt gcgacaggtt tgatgacaaa 27541 aaattagcgc aagaagacaa aaatcacctt gcgctaatgc tctgttacag gtcactaata 27601 ccatctaagt agttgattca tagtgactgc atatgttgtg ttttacagta ttatgtagtc 27661 tgttttttat gcaaaatcta atttaatata ttgatattta tatcatttta cgtttctcgt 27721 tcagcttttt tatactaagt tggcattata aaaaagcatt gcttatcaat ttgttgcaac 27781 gaacaggtca ctatcagtca aaataaaatc attatttgat ttcaattttg tcccactccc 27841 tgcctctgtc atcacgatac tgtgatgcca tggtgtccga cttatgcccg agaagatgtt 27901 gagcaaactt atcgcttatc tgcttctcat agagtcttgc agacaaactg cgcaactcgt 27961 gaaaggtagg cggatcccct tcgaaggaaa gacctgatgc ttttcgtgcg cgcataaaat 28021 accttgatac tgtgccggat gaaagcggtt cgcgacgagt agatgcaatt atggtttctc 28081 cgccaagaat ctctttgcat ttatcaagtg tttccttcat tgatattccg agagcatcaa 28141 tatgcaatgc tgttgggatg gcaattttta cgcctgtttt gctttgctcg acataaagat 28201 atccatctac gatatcagac cacttcattt cgcataaatc accaactcgt tgcccggtaa 28261 caacagccag ttccattgca agtctgagcc aacatggtga tgattctgct gcttgataaa 28321 ttttcaggta ttcgtcagcc gtaagtcttg atctccttac ctctgatttt gctgcgcgag 28381 tggcagcgac atggtttgtt gttatatggc cttcagctat tgcctctcgg aatgcatcgc 28441 tcagtgttga tctgattaac ttggctgacg ccgccttgcc ctcgtctatg tatccattga 28501 gcattgccgc aatttctttt gtggtgatgt cttcaagtgg agcatcaggc agacccctcc 28561 ttattgcttt aattttgctc atgtaattta tgagtgtctt ctgcttgatt cctctgctgg 28621 ccaggatttt ttcgtagcga tcaagccatg aatgtaacgt aacggaatta tcactgttga 28681 ttctcgctgt cagaggcttg tgtttgtgtc ctgaaaataa ctcaatgttg gcctgtatag 28741 cttcagtgat tgcgattcgc ctgtctctgc ctaatccaaa ctctttaccc gtccttgggt 28801 ccctgtagca gtaatatcca ttgtttctta tataaaggtt agggggtaaa tcccggcgct 28861 catgacttcg ccttcttccc atttctgatc ctcttcaaaa ggccacctgt tactggtcga 28921 tttaagtcaa cctttaccgc tgattcgtgg aacagatact ctcttccatc cttaaccgga 28981 ggtgggaata tcctgcattc ccgaacccat cgacgaactg tttcaaggct tcttggacgt 29041 cgctggcgtg cgttccactc ctgaagtgtc aagtacatcg caaagtctcc gcaattacac 29101 gcaagaaaaa accgccatca ggcggcttgg tgttctttca gttcttcaat tcgaatattg 29161 gttacgtctg catgtgctat ctgcgcccat atcatccagt ggtcgtagca gtcgttgatg 29221 ttctccgctt cgataactct gttgaatggc tctccattcc attctcctgt gactcggaag 29281 tgcatttatc atctccataa aacaaaaccc gccgtagcga gttcagataa aataaatccc 29341 cgcgagtgcg aggattgtta tgtaatattg ggtttaatca tctatatgtt ttgtacagag 29401 agggcaagta tcgtttccac cgtactcgtg ataataattt tgcacggtat cagtcatttc 29461 tcgcacattg cagaatgggg atttgtcttc attagactta taaaccttca tggaatattt 29521 gtatgccgac tctatatcta taccttcatc tacataaaca ccttcgtgat gtctgcatgg 29581 agacaagaca ccggatctgc acaacattga taacgcccaa tctttttgct cagactctaa 29641 ctcattgata ctcatttata aactccttgc aatgtatgtc gtttcagcta aacggtatca 29701 gcaatgttta tgtaaagaaa cagtaagata atactcaacc cgatgtttga gtacggtcat 29761 catctgacac tacagactct ggcatcgctg tgaagacgac gcgaaattca gcattttcac 29821 aagcgttatc ttttacaaaa ccgatctcac tctcctttga tgcgaatgcc agcgtcagac 29881 atcatatgca gatactcacc tgcatcctga acccattgac ctccaacccc gtaatagcga 29941 tgcgtaatga tgtcgatagt tactaacggg tcttgttcga ttaactgccg cagaaactct 30001 tccaggtcac cagtgcagtg cttgataaca ggagtcttcc caggatggcg aacaacaaga 30061 aactggtttc cgtcttcacg gacttcgttg ctttccagtt tagcaatacg cttactccca 30121 tccgagataa caccttcgta atactcacgc tgctcgttga gttttgattt tgctgtttca 30181 agctcaacac gcagtttccc tactgttagc gcaatatcct cgttctcctg gtcgcggcgt 30241 ttgatgtatt gctggtttct ttcccgttca tccagcagtt ccagcacaat cgatggtgtt 30301 accaattcat ggaaaaggtc tgcgtcaaat ccccagtcgt catgcattgc ctgctctgcc 30361 gcttcacgca gtgcctgaga gttaatttcg ctcacttcga acctctctgt ttactgataa 30421 gttccagatc ctcctggcaa cttgcacaag tccgacaacc ctgaacgacc aggcgtcttc 30481 gttcatctat cggatcgcca cactcacaac aatgagtggc agatatagcc tggtggttca 30541 ggcggcgcat ttttattgct gtgttgcgct gtaattcttc tatttctgat gctgaatcaa 30601 tgatgtctgc catctttcat taatccctga actgttggtt aatacgcttg agggtgaatg 30661 cgaataataa aaaaggagcc tgtagctccc tgatgatttt gcttttcatg ttcatcgttc 30721 cttaaagacg ccgtttaaca tgccgattgc caggcttaaa tgagtcggtg tgaatcccat 30781 cagcgttacc gtttcgcggt gcttcttcag tacgctacgg caaatgtcat cgacgttttt 30841 atccggaaac tgctgtctgg ctttttttga tttcagaatt agcctgacgg gcaatgctgc 30901 gaagggcgtt ttcctgctga ggtgtcattg aacaagtccc atgtcggcaa gcataagcac 30961 acagaatatg aagcccgctg ccagaaaaat gcattccgtg gttgtcatac ctggtttctc 31021 tcatctgctt ctgctttcgc caccatcatt tccagctttt gtgaaaggga tgcggctaac 31081 gtatgaaatt cttcgtctgt ttctactggt attggcacaa acctgattcc aatttgagca 31141 aggctatgtg ccatctcgat actcgttctt aactcaacag aagatgcttt gtgcatacag 31201 cccctcgttt attatttatc tcctcagcca gccgctgtgc tttcagtgga tttcggataa 31261 cagaaaggcc gggaaatacc cagcctcgct ttgtaacgga gtagacgaaa gtgattgcgc 31321 ctacccggat attatcgtga ggatgcgtca tcgccattgc tccccaaata caaaaccaat 31381 ttcagccagt gcctcgtcca ttttttcgat gaactccggc acgatctcgt caaaactcgc 31441 catgtacttt tcatcccgct caatcacgac ataatgcagg ccttcacgct tcatacgcgg 31501 gtcatagttg gcaaagtacc aggcattttt tcgcgtcacc cacatgctgt actgcacctg 31561 ggccatgtaa gctgacttta tggcctcgaa accaccgagc cggaacttca tgaaatcccg 31621 ggaggtaaac gggcatttca gttcaaggcc gttgccgtca ctgcataaac catcgggaga 31681 gcaggcggta cgcatacttt cgtcgcgata gatgatcggg gattcagtaa cattcacgcc 31741 ggaagtgaat tcaaacaggg ttctggcgtc gttctcgtac tgttttcccc aggccagtgc 31801 tttagcgtta acttccggag ccacaccggt gcaaacctca gcaagcaggg tgtggaagta 31861 ggacattttc atgtcaggcc acttctttcc ggagcggggt tttgctatca cgttgtgaac 31921 ttctgaagcg gtgatgacgc cgagccgtaa tttgtgccac gcatcatccc cctgttcgac 31981 agctctcaca tcgatcccgg tacgctgcag gataatgtcc ggtgtcatgc tgccaccttc 32041 tgctctgcgg ctttctgttt caggaatcca agagctttta ctgcttcggc ctgtgtcagt 32101 tctgacgatg cacgaatgtc gcggcgaaat atctgggaac agagcggcaa taagtcgtca 32161 tcccatgttt tatccagggc gatcagcaga gtgttaatct cctgcatggt ttcatcgtta 32221 accggagtga tgtcgcgttc cggctgacgt tctgcagtgt atgcagtatt ttcgacaatg 32281 cgctcggctt catccttgtc atagatacca gcaaatccga aggccagacg ggcacactga 32341 atcatggctt tatgacgtaa catccgtttg ggatgcgact gccacggccc cgtgatttct 32401 ctgccttcgc gagttttgaa tggttcgcgg cggcattcat ccatccattc ggtaacgcag 32461 atcggatgat tacggtcctt gcggtaaatc cggcatgtac aggattcatt gtcctgctca 32521 aagtccatgc catcaaactg ctggttttca ttgatgatgc gggaccagcc atcaacgccc 32581 accaccggaa cgatgccatt ctgcttatca ggaaaggcgt aaatttcttt cgtccacgga 32641 ttaaggccgt actggttggc aacgatcagt aatgcgatga actgcgcatc gctggcatca 32701 cctttaaatg ccgtctggcg aagagtggtg atcagttcct gtgggtcgac agaatccatg 32761 ccgacacgtt cagccagctt cccagccagc gttgcgagtg cagtactcat tcgttttata 32821 cctctgaatc aatatcaacc tggtggtgag caatggtttc aaccatgtac cggatgtgtt 32881 ctgccatgcg ctcctgaaac tcaacatcgt catcaaacgc acgggtaatg gattttttgc 32941 tggccccgtg gcgttgcaaa tgatcgatgc atagcgattc aaacaggtgc tggggcaggc 33001 ctttttccat gtcgtctgcc agttctgcct ctttctcttc acgggcgagc tgctggtagt 33061 gacgcgccca gctctgagcc tcaagacgat cctgaatgta ataagcgttc atggctgaac 33121 tcctgaaata gctgtgaaaa tatcgcccgc gaaatgccgg gctgattagg aaaacaggaa 33181 agggggttag tgaatgcttt tgcttgatct cagtttcagt attaatatcc attttttata 33241 agcgtcgacg gcttcacgaa acatcttttc atcgccaata aaagtggcga tagtgaattt 33301 agtctggata gccataagtg tttgatccat tctttgggac tcctggctga ttaagtatgt 33361 cgataaggcg tttccatccg tcacgtaatt tacgggtgat tcgttcaagt aaagattcgg 33421 aagggcagcc agcaacaggc caccctgcaa tggcatattg catggtgtgc tccttattta 33481 tacataacga aaaacgcctc gagtgaagcg ttattggtat gcggtaaaac cgcactcagg 33541 cggccttgat agtcatatca tctgaatcaa atattcctga tgtatcgata tcggtaattc 33601 ttattccttc gctaccatcc attggaggcc atccttcctg accatttcca tcattccagt 33661 cgaactcaca cacaacacca tatgcattta agtcgcttga aattgctata agcagagcat 33721 gttgcgccag catgattaat acagcattta atacagagcc gtgtttattg agtcggtatt 33781 cagagtctga ccagaaatta ttaatctggt gaagtttttc ctctgtcatt acgtcatggt 33841 cgatttcaat ttctattgat gctttccagt cgtaatcaat gatgtatttt ttgatgtttg 33901 acatctgttc atatcctcac agataaaaaa tcgccctcac actggagggc aaagaagatt 33961 tccaataatc agaacaagtc ggctcctgtt tagttacgag cgacattgct ccgtgtattc 34021 actcgttgga atgaatacac agtgcagtgt ttattctgtt atttatgcca aaaataaagg 34081 ccactatcag gcagctttgt tgttctgttt accaagttct ctggcaatca ttgccgtcgt 34141 tcgtattgcc catttatcga catatttccc atcttccatt acaggaaaca tttcttcagg 34201 cttaaccatg cattccgatt gcagcttgca tccattgcat cgcttgaatt gtccacacca 34261 ttgattttta tcaatagtcg tagtcatacg gatagtcctg gtattgttcc atcacatcct 34321 gaggatgctc ttcgaactct tcaaattctt cttccatata tcaccttaaa tagtggattg 34381 cggtagtaaa gattgtgcct gtcttttaac cacatcaggc tcggtggttc tcgtgtaccc 34441 ctacagcgag aaatcggata aactattaca acccctacag tttgatgagt atagaaatgg 34501 atccactcgt tattctcgga cgagtgttca gtaatgaacc tctggagaga accatgtata 34561 tgatcgttat ctgggttgga cttctgcttt taagcccaga taactggcct gaatatgtta 34621 atgagagaat cggtattcct catgtgtggc atgttttcgt ctttgctctt gcattttcgc 34681 tagcaattaa tgtgcatcga ttatcagcta ttgccagcgc cagatataag cgatttaagc 34741 taagaaaacg cattaagatg caaaacgata aagtgcgatc agtaattcaa aaccttacag 34801 aagagcaatc tatggttttg tgcgcagccc ttaatgaagg caggaagtat gtggttacat 34861 caaaacaatt cccatacatt agtgagttga ttgagcttgg tgtgttgaac aaaacttttt 34921 cccgatggaa tggaaagcat atattattcc ctattgagga tatttactgg actgaattag 34981 ttgccagcta tgatccatat aatattgaga taaagccaag gccaatatct aagtaactag 35041 ataagaggaa tcgattttcc cttaattttc tggcgtccac tgcatgttat gccgcgttcg 35101 ccaggcttgc tgtaccatgt gcgctgattc ttgcgctcaa tacgttgcag gttgctttca 35161 atctgtttgt ggtattcagc cagcactgta aggtctatcg gatttagtgc gctttctact 35221 cgtgatttcg gtttgcgatt cagcgagaga atagggcggt taactggttt tgcgcttacc 35281 ccaaccaaca ggggatttgc tgctttccat tgagcctgtt tctctgcgcg acgttcgcgg 35341 cggcgtgttt gtgcatccat ctggattctc ctgtcagtta gctttggtgg tgtgtggcag 35401 ttgtagtcct gaacgaaaac cccccgcgat tggcacattg gcagctaatc cggaatcgca 35461 cttacggcca atgcttcgtt tcgtatcaca caccccaaag ccttctgctt tgaatgctgc 35521 ccttcttcag ggcttaattt ttaagagcgt caccttcatg gtggtcagtg cgtcctgctg 35581 atgtgctcag tatcaccgcc agtggtattt atgtcaacac cgccagagat aatttatcac 35641 cgcagatggt tatctgtatg ttttttatat gaatttattt tttgcagggg ggcattgttt 35701 ggtaggtgag agatctgaat tgctatgttt agtgagttgt atctatttat ttttcaataa 35761 atacaattgg ttatgtgttt tgggggcgat cgtgaggcaa agaaaacccg gcgctgaggc 35821 cgggttattc ttgttctctg gtcaaattat atagttggaa aacaaggatg catatatgaa 35881 tgaacgatgc agaggcaatg ccgatggcga tagtgggtat catgtagccg cttatgctgg 35941 aaagaagcaa taacccgcag aaaaacaaag ctccaagctc aacaaaacta agggcataga 36001 caataactac cgatgtcata tacccatact ctctaatctt ggccagtcgg cgcgttctgc 36061 ttccgattag aaacgtcaag gcagcaatca ggattgcaat catggttcct gcatatgatg 36121 acaatgtcgc cccaagacca tctctatgag ctgaaaaaga aacaccagga atgtagtggc 36181 ggaaaaggag atagcaaatg cttacgataa cgtaaggaat tattactatg taaacaccag 36241 gcatgattct gttccgcata attactcctg ataattaatc cttaactttg cccacctgcc 36301 ttttaaaaca ttccagtata tcacttttca ttcttgcgta gcaatatgcc atctcttcag 36361 ctatctcagc attggtgacc ttgttcagag gcgctgagag atggcctttt tctgatagat 36421 aatgttctgt taaaatatct ccggcctcat cttttgcccg caggctaatg tctgaaaatt 36481 gaggtgacgg gttaaaaata atatccttgg caaccttttt tatatccctt ttaaattttg 36541 gcttaatgac tatatccaat gagtcaaaaa gctccccttc aatatctgtt gcccctaaga 36601 cctttaatat atcgccaaat acaggtagct tggcttctac cttcaccgtt gttcggccga 36661 tgaaatgcat atgcataaca tcgtctttgg tggttcccct catcagtggc tctatctgaa 36721 cgcgctctcc actgcttaat gacattcctt tcccgattaa aaaatctgtc agatcggatg 36781 tggtcggccc gaaaacagtt ctggcaaaac caatggtgtc gccttcaaca aacaaaaaag 36841 atgggaatcc caatgattcg tcatctgcga ggctgttctt aatatcttca actgaagctt 36901 tagagcgatt tatcttctga accagactct tgtcatttgt tttggtaaag agaaaagttt 36961 ttccatcgat tttatgaata tacaaataat tggagccaac ctgcaggtga tgattatcag 37021 ccagcagaga attaaggaaa acagacaggt ttattgagcg cttatctttc cctttatttt 37081 tgctgcggta agtcgcataa aaaccattct tcataattca atccatttac tatgttatgt 37141 tctgagggga gtgaaaattc ccctaattcg atgaagattc ttgctcaatt gttatcagct 37201 atgcgccgac cagaacacct tgccgatcag ccaaacgtct cttcaggcca ctgactagcg 37261 ataactttcc ccacaacgga acaactctca ttgcatggga tcattgggta ctgtgggttt 37321 agtggttgta aaaacacctg accgctatcc ctgatcagtt tcttgaaggt aaactcatca 37381 cccccaagtc tggctatgca gaaatcacct ggctcaacag cctgctcagg gtcaacgaga 37441 attaacattc cgtcaggaaa gcttggcttg gagcctgttg gtgcggtcat ggaattacct 37501 tcaacctcaa gccagaatgc agaatcactg gcttttttgg ttgtgcttac ccatctctcc 37561 gcatcacctt tggtaaaggt tctaagctca ggtgagaaca tccctgcctg aacatgagaa 37621 aaaacagggt actcatactc acttctaagt gacggctgca tactaaccgc ttcatacatc 37681 tcgtagattt ctctggcgat tgaagggcta aattcttcaa cgctaacttt gagaattttt 37741 gcaagcaatg cggcgttata agcatttaat gcattgatgc cattaaataa agcaccaacg 37801 cctgactgcc ccatccccat cttgtctgcg acagattcct gggataagcc aagttcattt 37861 ttcttttttt cataaattgc tttaaggcga cgtgcgtcct caagctgctc ttgtgttaat 37921 ggtttctttt ttgtgctcat acgttaaatc tatcaccgca agggataaat atctaacacc 37981 gtgcgtgttg actattttac ctctggcggt gataatggtt gcatgtacta aggaggttgt 38041 atggaacaac gcataaccct gaaagattat gcaatgcgct ttgggcaaac caagacagct 38101 aaagatctcg gcgtatatca aagcgcgatc aacaaggcca ttcatgcagg ccgaaagatt 38161 tttttaacta taaacgctga tggaagcgtt tatgcggaag aggtaaagcc cttcccgagt 38221 aacaaaaaaa caacagcata aataaccccg ctcttacaca ttccagccct gaaaaagggc 38281 atcaaattaa accacaccta tggtgtatgc atttatttgc atacattcaa tcaattgtta 38341 tctaaggaaa tacttacata tggttcgtgc aaacaaacgc aacgaggctc tacgaatcga 38401 gagtgcgttg cttaacaaaa tcgcaatgct tggaactgag aagacagcgg aagctgtggg 38461 cgttgataag tcgcagatca gcaggtggaa gagggactgg attccaaagt tctcaatgct 38521 gcttgctgtt cttgaatggg gggtcgttga cgacgacatg gctcgattgg cgcgacaagt 38581 tgctgcgatt ctcaccaata aaaaacgccc ggcggcaacc gagcgttctg aacaaatcca 38641 gatggagttc tgaggtcatt actggatcta tcaacaggag tcattatgac aaatacagca 38701 aaaatactca acttcggcag aggtaacttt gccggacagg agcgtaatgt ggcagatctc 38761 gatgatggtt acgccagact atcaaatatg ctgcttgagg cttattcggg cgcagatctg 38821 accaagcgac agtttaaagt gctgcttgcc attctgcgta aaacctatgg gtggaataaa 38881 ccaatggaca gaatcaccga ttctcaactt agcgagatta caaagttacc tgtcaaacgg 38941 tgcaatgaag ccaagttaga actcgtcaga atgaatatta tcaagcagca aggcggcatg 39001 tttggaccaa ataaaaacat ctcagaatgg tgcatccctc aaaacgaggg aaaatcccct 39061 aaaacgaggg ataaaacatc cctcaaattg ggggattgct atccctcaaa acagggggac 39121 acaaaagaca ctattacaaa agaaaaaaga aaagattatt cgtcagagaa ttctggcgaa 39181 tcctctgacc agccagaaaa cgacctttct gtggtgaaac cggatgctgc aattcagagc 39241 ggcagcaagt gggggacagc agaagacctg accgccgcag agtggatgtt tgacatggtg 39301 aagactatcg caccatcagc cagaaaaccg aattttgctg ggtgggctaa cgatatccgc 39361 ctgatgcgtg aacgtgacgg acgtaaccac cgcgacatgt gtgtgctgtt ccgctgggca 39421 tgccaggaca acttctggtc cggtaacgtg ctgagcccgg ccaaactccg cgataagtgg 39481 acccaactcg aaatcaaccg taacaagcaa caggcaggcg tgacagccag caaaccaaaa 39541 ctcgacctga caaacacaga ctggatttac ggggtggatc tatgaaaaac atcgccgcac 39601 agatggttaa ctttgaccgt gagcagatgc gtcggatcgc caacaacatg ccggaacagt 39661 acgacgaaaa gccgcaggta cagcaggtag cgcagatcat caacggtgtg ttcagccagt 39721 tactggcaac tttcccggcg agcctggcta accgtgacca gaacgaagtg aacgaaatcc 39781 gtcgccagtg ggttctggct tttcgggaaa acgggatcac cacgatggaa caggttaacg 39841 caggaatgcg cgtagcccgt cggcagaatc gaccatttct gccatcaccc gggcagtttg 39901 ttgcatggtg ccgggaagaa gcatccgtta ccgccggact gccaaacgtc agcgagctgg 39961 ttgatatggt ttacgagtat tgccggaagc gaggcctgta tccggatgcg gagtcttatc 40021 cgtggaaatc aaacgcgcac tactggctgg ttaccaacct gtatcagaac atgcgggcca 40081 atgcgcttac tgatgcggaa ttacgccgta aggccgcaga tgagcttgtc catatgactg 40141 cgagaattaa ccgtggtgag gcgatccctg aaccagtaaa acaacttcct gtcatgggcg 40201 gtagacctct aaatcgtgca caggctctgg cgaagatcgc agaaatcaaa gctaagttcg 40261 gactgaaagg agcaagtgta tgacgggcaa agaggcaatt attcattacc tggggacgca 40321 taatagcttc tgtgcgccgg acgttgccgc gctaacaggc gcaacagtaa ccagcataaa 40381 tcaggccgcg gctaaaatgg cacgggcagg tcttctggtt atcgaaggta aggtctggcg 40441 aacggtgtat taccggtttg ctaccaggga agaacgggaa ggaaagatga gcacgaacct 40501 ggtttttaag gagtgtcgcc agagtgccgc gatgaaacgg gtattggcgg tatatggagt 40561 taaaagatga ccatctacat tactgagcta ataacaggcc tgctggtaat cgcaggcctt 40621 tttatttggg ggagagggaa gtcatgaaaa aactaacctt tgaaattcga tctccagcac 40681 atcagcaaaa cgctattcac gcagtacagc aaatccttcc agacccaacc aaaccaatcg 40741 tagtaaccat tcaggaacgc aaccgcagct tagaccaaaa caggaagcta tgggcctgct 40801 taggtgacgt ctctcgtcag gttgaatggc atggtcgctg gctggatgca gaaagctgga 40861 agtgtgtgtt taccgcagca ttaaagcagc aggatgttgt tcctaacctt gccgggaatg 40921 gctttgtggt aataggccag tcaaccagca ggatgcgtgt aggcgaattt gcggagctat 40981 tagagcttat acaggcattc ggtacagagc gtggcgttaa gtggtcagac gaagcgagac 41041 tggctctgga gtggaaagcg agatggggag acagggctgc atgataaatg tcgttagttt 41101 ctccggtggc aggacgtcag catatttgct ctggctaatg gagcaaaagc gacgggcagg 41161 taaagacgtg cattacgttt tcatggatac aggttgtgaa catccaatga catatcggtt 41221 tgtcagggaa gttgtgaagt tctgggatat accgctcacc gtattgcagg ttgatatcaa 41281 cccggagctt ggacagccaa atggttatac ggtatgggaa ccaaaggata ttcagacgcg 41341 aatgcctgtt ctgaagccat ttatcgatat ggtaaagaaa tatggcactc catacgtcgg 41401 cggcgcgttc tgcactgaca gattaaaact cgttcccttc accaaatact gtgatgacca 41461 tttcgggcga gggaattaca ccacgtggat tggcatcaga gctgatgaac cgaagcggct 41521 aaagccaaag cctggaatca gatatcttgc tgaactgtca gactttgaga aggaagatat 41581 cctcgcatgg tggaagcaac aaccattcga tttgcaaata ccggaacatc tcggtaactg 41641 catattctgc attaaaaaat caacgcaaaa aatcggactt gcctgcaaag atgaggaggg 41701 attgcagcgt gtttttaatg aggtcatcac gggatcccat gtgcgtgacg gacatcggga 41761 aacgccaaag gagattatgt accgaggaag aatgtcgctg gacggtatcg cgaaaatgta 41821 ttcagaaaat gattatcaag ccctgtatca ggacatggta cgagctaaaa gattcgatac 41881 cggctcttgt tctgagtcat gcgaaatatt tggagggcag cttgatttcg acttcgggag 41941 ggaagctgca tgatgcgatg ttatcggtgc ggtgaatgca aagaagataa ccgcttccga 42001 ccaaatcaac cttactggaa tcgatggtgt ctccggtgtg aaagaacacc aacaggggtg 42061 ttaccactac cgcaggaaaa ggaggacgtg tggcgagaca gcgacgaagt atcaccgaca 42121 taatctgcga aaactgcaaa taccttccaa cgaaacgcac cagaaataaa cccaagccaa 42181 tcccaaaaga atctgacgta aaaaccttca actacacggc tcacctgtgg gatatccggt 42241 ggctaagacg tcgtgcgagg aaaacaaggt gattgaccaa aatcgaagtt acgaacaaga 42301 aagcgtcgag cgagctttaa cgtgcgctaa ctgcggtcag aagctgcatg tgctggaagt 42361 tcacgtgtgt gagcactgct gcgcagaact gatgagcgat ccgaatagct cgatgcacga 42421 ggaagaagat gatggctaaa ccagcgcgaa gacgatgtaa aaacgatgaa tgccgggaat 42481 ggtttcaccc tgcattcgct aatcagtggt ggtgctctcc agagtgtgga accaagatag 42541 cactcgaacg acgaagtaaa gaacgcgaaa aagcggaaaa agcagcagag aagaaacgac 42601 gacgagagga gcagaaacag aaagataaac ttaagattcg aaaactcgcc ttaaagcccc 42661 gcagttactg gattaaacaa gcccaacaag ccgtaaacgc cttcatcaga gaaagagacc 42721 gcgacttacc atgtatctcg tgcggaacgc tcacgtctgc tcagtgggat gccggacatt 42781 accggacaac tgctgcggca cctcaactcc gatttaatga acgcaatatt cacaagcaat 42841 gcgtggtgtg caaccagcac aaaagcggaa atctcgttcc gtatcgcgtc gaactgatta 42901 gccgcatcgg gcaggaagca gtagacgaaa tcgaatcaaa ccataaccgc catcgctgga 42961 ctatcgaaga gtgcaaggcg atcaaggcag agtaccaaca gaaactcaaa gacctgcgaa 43021 atagcagaag tgaggccgca tgacgttctc agtaaaaacc attccagaca tgctcgttga 43081 agcatacgga aatcagacag aagtagcacg cagactgaaa tgtagtcgcg gtacggtcag 43141 aaaatacgtt gatgataaag acgggaaaat gcacgccatc gtcaacgacg ttctcatggt 43201 tcatcgcgga tggagtgaaa gagatgcgct attacgaaaa aattgatggc agcaaatacc 43261 gaaatatttg ggtagttggc gatctgcacg gatgctacac gaacctgatg aacaaactgg 43321 atacgattgg attcgacaac aaaaaagacc tgcttatctc ggtgggcgat ttggttgatc 43381 gtggtgcaga gaacgttgaa tgcctggaat taatcacatt cccctggttc agagctgtac 43441 gtggaaacca tgagcaaatg atgattgatg gcttatcaga gcgtggaaac gttaatcact 43501 ggctgcttaa tggcggtggc tggttcttta atctcgatta cgacaaagaa attctggcta 43561 aagctcttgc ccataaagca gatgaacttc cgttaatcat cgaactggtg agcaaagata 43621 aaaaatatgt tatctgccac gccgattatc cctttgacga atacgagttt ggaaagccag 43681 ttgatcatca gcaggtaatc tggaaccgcg aacgaatcag caactcacaa aacgggatcg 43741 tgaaagaaat caaaggcgcg gacacgttca tctttggtca tacgccagca gtgaaaccac 43801 tcaagtttgc caaccaaatg tatatcgata ccggcgcagt gttctgcgga aacctaacat 43861 tgattcaggt acagggagaa ggcgcatgag actcgaaagc gtagctaaat ttcattcgcc 43921 aaaaagcccg atgatgagcg actcaccacg ggccacggct tctgactctc tttccggtac 43981 tgatgtgatg gctgctatgg ggatggcgca atcacaagcc ggattcggta tggctgcatt 44041 ctgcggtaag cacgaactca gccagaacga caaacaaaag gctatcaact atctgatgca 44101 atttgcacac aaggtatcgg ggaaataccg tggtgtggca aagcttgaag gaaatactaa 44161 ggcaaaggta ctgcaagtgc tcgcaacatt cgcttatgcg gattattgcc gtagtgccgc 44221 gacgccgggg gcaagatgca gagattgcca tggtacaggc cgtgcggttg atattgccaa 44281 aacagagctg tgggggagag ttgtcgagaa agagtgcgga agatgcaaag gcgtcggcta 44341 ttcaaggatg ccagcaagcg cagcatatcg cgctgtgacg atgctaatcc caaaccttac 44401 ccaacccacc tggtcacgca ctgttaagcc gctgtatgac gctctggtgg tgcaatgcca 44461 caaagaagag tcaatcgcag acaacatttt gaatgcggtc acacgttagc agcatgattg 44521 ccacggatgg caacatatta acggcatgat attgacttat tgaataaaat tgggtaaatt 44581 tgactcaacg atgggttaat tcgctcgttg tggtagtgag atgaaaagag gcggcgctta 44641 ctaccgattc cgcctagttg gtcacttcga cgtatcgtct ggaactccaa ccatcgcagg 44701 cagagaggtc tgcaaaatgc aatcccgaaa cagttcgcag gtaatagtta gagcctgcat 44761 aacggtttcg ggatttttta tatctgcaca acaggtaaga gcattgagtc gataatcgtg 44821 aagagtcggc gagcctggtt agccagtgct ctttccgttg tgctgaatta agcgaatacc 44881 ggaagcagaa ccggatcacc aaatgcgtac aggcgtcatc gccgcccagc aacagcacaa 44941 cccaaactga gccgtagcca ctgtctgtcc tgaattcatt agtaatagtt acgctgcggc 45001 cttttacaca tgaccttcgt gaaagcgggt ggcaggaggt cgcgctaaca acctcctgcc 45061 gttttgcccg tgcatatcgg tcacgaacaa atctgattac taaacacagt agcctggatt 45121 tgttctatca gtaatcgacc ttattcctaa ttaaatagag caaatcccct tattgggggt 45181 aagacatgaa gatgccagaa aaacatgacc tgttggccgc cattctcgcg gcaaaggaac 45241 aaggcatcgg ggcaatcctt gcgtttgcaa tggcgtacct tcgcggcaga tataatggcg 45301 gtgcgtttac aaaaacagta atcgacgcaa cgatgtgcgc cattatcgcc tggttcattc 45361 gtgaccttct cgacttcgcc ggactaagta gcaatctcgc ttatataacg agcgtgttta 45421 tcggctacat cggtactgac tcgattggtt cgcttatcaa acgcttcgct gctaaaaaag 45481 ccggagtaga agatggtaga aatcaataat caacgtaagg cgttcctcga tatgctggcg 45541 tggtcggagg gaactgataa cggacgtcag aaaaccagaa atcatggtta tgacgtcatt 45601 gtaggcggag agctatttac tgattactcc gatcaccctc gcaaacttgt cacgctaaac 45661 ccaaaactca aatcaacagg cgccggacgc taccagcttc tttcccgttg gtgggatgcc 45721 taccgcaagc agcttggcct gaaagacttc tctccgaaaa gtcaggacgc tgtggcattg 45781 cagcagatta aggagcgtgg cgctttacct atgattgatc gtggtgatat ccgtcaggca 45841 atcgaccgtt gcagcaatat ctgggcttca ctgccgggcg ctggttatgg tcagttcgag 45901 cataaggctg acagcctgat tgcaaaattc aaagaagcgg gcggaacggt cagagagatt 45961 gatgtatgag cagagtcacc gcgattatct ccgctctggt tatctgcatc atcgtctgcc 46021 tgtcatgggc tgttaatcat taccgtgata acgccattac ctacaaagcc cagcgcgaca 46081 aaaatgccag agaactgaag ctggcgaacg cggcaattac tgacatgcag atgcgtcagc 46141 gtgatgttgc tgcgctcgat gcaaaataca cgaaggagtt agctgatgct aaagctgaaa 46201 atgatgctct gcgtgatgat gttgccgctg gtcgtcgtcg gttgcacatc aaagcagtct 46261 gtcagtcagt gcgtgaagcc accaccgcct ccggcgtgga taatgcagcc tccccccgac 46321 tggcagacac cgctgaacgg gattatttca ccctcagaga gaggctgatc actatgcaaa 46381 aacaactgga aggaacccag aagtatatta atgagcagtg cagatagagt tgcccatatc 46441 gatgggcaac tcatgcaatt attgtgagca atacacacgc gcttccagcg gagtataaat 46501 gcctaaagta ataaaaccga gcaatccatt tacgaatgtt tgctgggttt ctgttttaac 46561 aacattttct gcgccgccac aaattttggc tgcatcgaca gttttcttct gcccaattcc 46621 agaaacgaag aaatgatggg tgatggtttc ctttggtgct actgctgccg gtttgttttg 46681 aacagtaaac gtctgttgag cacatcctgt aataagcagg gccagcgcag tagcgagtag 46741 catttttttc atggtgttat tcccgatgct ttttgaagtt cgcagaatcg tatgtgtaga 46801 aaattaaaca aaccctaaac aatgagttga aatttcatat tgttaatatt tattaatgta 46861 tgtcaggtgc gatgaatcgt cattgtattc ccggattaac tatgtccaca gccctgacgg 46921 ggaacttctc tgcgggagtg tccgggaata attaaaacga tgcacacagg gtttagcgcg 46981 tacacgtatt gcattatgcc aacgccccgg tgctgacacg gaagaaaccg gacgttatga 47041 tttagcgtgg aaagatttgt gtagtgttct gaatgctctc agtaaatagt aatgaattat 47101 caaaggtata gtaatatctt ttatgttcat ggatatttgt aacccatcgg aaaactcctg 47161 ctttagcaag attttccctg tattgctgaa atgtgatttc tcttgatttc aacctatcat 47221 aggacgtttc tataagatgc gtgtttcttg agaatttaac atttacaacc tttttaagtc 47281 cttttattaa cacggtgtta tcgttttcta acacgatgtg aatattatct gtggctagat 47341 agtaaatata atgtgagacg ttgtgacgtt ttagttcaga ataaaacaat tcacagtcta 47401 aatcttttcg cacttgatcg aatatttctt taaaaatggc aacctgagcc attggtaaaa 47461 ccttccatgt gatacgaggg cgcgtagttt gcattatcgt ttttatcgtt tcaatctggt 47521 ctgacctcct tgtgttttgt tgatgattta tgtcaaatat taggaatgtt ttcacttaat 47581 agtattggtt gcgtaacaaa gtgcggtcct gctggcattc tggagggaaa tacaaccgac 47641 agatgtatgt aaggccaacg tgctcaaatc ttcatacaga aagatttgaa gtaatatttt 47701 aaccgctaga tgaagagcaa gcgcatggag cgacaaaatg aataaagaac aatctgctga 47761 tgatccctcc gtggatctga ttcgtgtaaa aaatatgctt aatagcacca tttctatgag 47821 ttaccctgat gttgtaattg catgtataga acataaggtg tctctggaag cattcagagc 47881 aattgaggca gcgttggtga agcacgataa taatatgaag gattattccc tggtggttga 47941 ctgatcacca taactgctaa tcattcaaac tatttagtct gtgacagagc caacacgcag 48001 tctgtcactg tcaggaaagt ggtaaaactg caactcaatt actgcaatgc cctcgtaatt 48061 aagtgaattt acaatatcgt cctgttcgga gggaagaacg cgggatgttc attcttcatc 48121 acttttaatt gatgtatatg ctctcttttc tgacgttagt ctccgacggc aggcttcaat 48181 gacccaggct gagaaattcc cggacccttt ttgctcaaga gcgatgttaa tttgttcaat 48241 catttggtta ggaaagcgga tgttgcgggt tgttgttctg cgggttctgt tcttcgttga 48301 catgaggttg ccccgtattc agtgtcgctg atttgtattg tctgaagttg tttttacgtt 48361 aagttgatgc agatcaatta atacgatacc tgcgtcataa ttgattattt gacgtggttt 48421 gatggcctcc acgcacgttg tgatatgtag atgataatca ttatcacttt acgggtcctt 48481 tccggtgatc cgacaggtta cg // LOCUS MUSIGKCLO 422 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse IgMk rearranged kappa light-chain mRNA variable region (V-J-kappa) anti-DNA autoantibody. ACCESSION M20834 KEYWORDS autoantibody; immunoglobulin light chain; kappa-immunoglobulin; kappa-immunoglobulin light chain; variable region. SOURCE Mouse (strain MRL/Mp -lpr/lpr) spleen hybridoma cell line MRL-DNA22, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 422) AUTHORS Kofler,R., Strohal,R., Balderas,R.S., Johnson,M.E., Noonan,D.J., Duchosal,M.A., Dixon,F.J. and Theofilopoulos,A.N. TITLE Immunoglobulin kappa light chain variable region gene complex organization and immunoglobulin genes encoding anti-DNA autoantibodies in lupus mice JOURNAL J. Clin. Invest. 82, 852-860 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by R. Kofler 28-JUL-1988 FEATURES from to/span description pept 33 > 422 IgMk light chain precursor sigp 33 98 Ig light chain signal peptide matp 99 > 422 Ig light chain mRNA 1 > 422 IgMk mRNA recomb 386 387 V-region end/J-region start BASE COUNT 112 a 109 c 95 g 106 t ORIGIN Chromosome 6. 1 taattagcta gggaccaaaa ttcaaagaga aaatggattt tcaggtgcag attttcagct 61 tcctgctaat cagtgcctca gtcataatgt ccagaggaca aattgttctc acccagtctc 121 cagcaatcat gtctgcatct cctggggaac gggtcaccat gacctgcagt gccagctcaa 181 gtgtaagttc cagctacttg tactggtacc agcagaagcc aggatcctcc cccaaactat 241 ggatttatag cacatccaac ctggcttctg gagtccctgc tcgcttcagt ggcagtgggt 301 ctgggacctc ttattctctc acaatcagca gcatggaggc tgaagatgct gccacttatt 361 actgccagca gtacagtggt tacccattca cgttcggcac ggggacaaaa ttggaaataa 421 aa // LOCUS MUSIGKCLY 312 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse Ig unproductively rearranged kappa-chain VJ2C mRNA from plasmacytoma BFPC 61B, partial cds. ACCESSION M12189 KEYWORDS immunoglobulin; immunoglobulin light chain; kappa-immunoglobulin; variable region. SOURCE Mouse plasmacytomas, cDNA to mRNA, clone BFPC 61B. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Bernard,O., Gough,N.M. and Adams,J.M. TITLE Plasmacytomas with more than one immunoglobulin kappa mRNA: Implications for allelic exclusion JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 5812-5816 (1981) STANDARD simple staff_entry FEATURES from to/span description pept.ps < 1 > 312 Ig kappa-chain VJC-regions (AA at 3) recomb 245 246 V-region end/J2-region start recomb 285 286 J2-region end/C-region start BASE COUNT 75 a 88 c 82 g 67 t ORIGIN 1 ctgcaccagg ggagaaggtc acaatgactt gcagggccag ctcaagtgta agttacatgg 61 actggtacca gcagaagcca ggatcctccc ccaaaccctg gatttatgcc acatccaccc 121 tagtttctgg agttcctgct cgcttcagtg gcagtgggtc tgggacctct tactctctca 181 caatcagcag agtggaggct gaagatgctg ccacttatta gtgcccgcag tggagtagta 241 acccacccga cgttcggtgg aggcaccaag ctggaaatca aacgggctga tgctgcacca 301 actgtatcca tc // LOCUS MUSTCBXZA 396 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Mouse T-cell receptor active beta-chain V-region V11DJ mRNA. ACCESSION M13676 KEYWORDS T-cell antigen receptor; T-cell receptor; T-cell receptor beta-chain; antigen receptor; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (C57BL/6) T-helper hybridoma, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 390) AUTHORS Behlke,M.A., Spinella,D.G., Chou,H.S., Sha,W., Hartl,D.L. and Loh,D.Y. TITLE T-cell receptor beta-chain expression: Dependence on relatively few variable region genes JOURNAL Science 229, 566-570 (1985) STANDARD full staff_review REFERENCE 2 (bases 391 to 396) AUTHORS Behlke,M.A. JOURNAL Unpublished (1985) Howard Hughes Med Inst, St.Louis, MO 63110. STANDARD full staff_review COMMENT Draft entry and computer-readable copy of the sequence in [2] were kindly provided by M.A.Behlke, 25-OCT-1985. FEATURES from to/span description pept 1 > 396 T-cell receptor beta-chain precursor (V11DJ) sigp 1 57 T-cell receptor beta-chain signal peptide matp 58 > 396 T-cell receptor beta-chain recomb 342 343 V11-region end/D-region start recomb 345 346 D-region end/N-region start recomb 348 349 N-region end/J-region start recomb 390 391 J-region end/C-region start BASE COUNT 113 a 96 c 81 g 106 t ORIGIN Chromosome 6. 1 atgagctgca ggcttctcct ctatgtttcc ctatgtcttg tggaaacagc actcatgaac 61 actaaaatta ctcagtcacc aagatatcta atcctgggaa gaacaaataa gtctttggaa 121 tgtgagcaac atctgggaca taatgctatg tactggtata aacagagcgc tgagaagccg 181 ccagagctca tgtttctcta caatcttaaa cagttgattc gaaatgagac ggtgcccagt 241 cgttttatac ctgaatgccc agacagctcc aagctacttt tacatatatc tgccgtggat 301 ccagaagact cagctgtcta tttttgtgcc agcagccaga taactagtaa ccaagacacc 361 cagtactttg ggccaggcac tcggctcctc gtgtta // LOCUS P18XIS911 115 bp ds-DNA SYN 29-JUN-1990 DEFINITION Plasmid pACYC184/insertion element 91 (IS91), left junction. ACCESSION K03543 KEYWORDS insertion sequence. SEGMENT 1 of 2 SOURCE Plasmid pACYC184 and insertion element 91 DNA, clone pSU234. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 115) AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F. TITLE Transposition of IS91 does not generate a target duplication JOURNAL J. Bacteriol. 169, 442-443 (1987) STANDARD full staff_review COMMENT No duplication of an insertion target sequence seems to take place upon the insertion of IS91. FEATURES from to/span description recomb 16 17 pBR322 DNA end/IS91 DNA start rpt 16 24 IS91 inverted terminal repeat BASE COUNT 26 a 26 c 29 g 34 t ORIGIN 16 bp upstream of TaqI site. 1 ttaacgaccc tgccctcgag taggcagcct ggcggctgcg gcttgtcatg gtctggaatt 61 accgttataa aaaaagataa tgtcattgtc tttcaggtag ttatatggcc cgttc // LOCUS P18XIS912 115 bp ds-DNA SYN 29-JUN-1990 DEFINITION Plasmid pACYC184/insertion element 91 (IS91), right junction. ACCESSION M15256 KEYWORDS insertion sequence. SEGMENT 2 of 2 SOURCE Plasmid pACYC184 and insertion element 91 DNA, clone pSU234. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 115) AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F. TITLE Transposition of IS91 does not generate a target duplication JOURNAL J. Bacteriol. 169, 442-443 (1987) STANDARD full staff_review COMMENT No duplication of an insertion target sequence seems to take place upon the insertion of IS91. FEATURES from to/span description rpt 93 100 IS91 inverted terminal repeat recomb 101 102 IS91 DNA end/pACYC184 DNA start BASE COUNT 25 a 20 c 29 g 41 t ORIGIN About 1.85 kb after segment 1; 97 bp upstream of TaqI site. 1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt 61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tgaaccgacg accgg // LOCUS PBR322IS2 115 bp ds-DNA SYN 29-JUN-1990 DEFINITION Plasmid pBR322/insertion element 91 (IS91), right junction. ACCESSION M15257 KEYWORDS insertion sequence. SEGMENT 2 of 2 SOURCE Plasmid pBR322 and insertion element 91 DNA, clone pSU240. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 115) AUTHORS Diaz-Aroca,E., Mendiola,M.V., Zabala,J.C. and de la Cruz,F. TITLE Transposition of IS91 does not generate a target duplication JOURNAL J. Bacteriol. 169, 442-443 (1987) STANDARD full staff_review COMMENT No duplication of an insertion target sequence seems to take place upon the insertion of IS91. FEATURES from to/span description rpt 93 100 IS91 inverted terminal repeat recomb 101 102 IS91 DNA end/pBR322 DNA start BASE COUNT 25 a 19 c 27 g 44 t ORIGIN About 1.85 kb after segment 1; 97 bp upstream of TaqI site. 1 tttaatagtc gtttccgctc atatggtgca caaggggtgt tgaagaaaca tccgttttgt 61 ggtgcttttt tagtcttttg gggatttaaa ttcctatcga tcaaggatct taccg // LOCUS RATMYCSE 903 bp ds-DNA ROD 29-JUN-1990 DEFINITION Rat c-myc/Ig epsilon translocation DNA. ACCESSION M23417 KEYWORDS epsilon immunoglobulin; switch region; translocation. SOURCE Rat (strain LOU) immunocytoma cell line IR162 DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 903) AUTHORS Tian,S.-S. and Faust,C. TITLE A simple and precise aberrant translocation of the rat c-myc gene into the epsilon-heavy chain switch region of the IgE-producing immunocytoma, IR162 JOURNAL J. Biol. Chem. 264, 1846-1853 (1989) STANDARD simple staff_entry FEATURES from to/span description recomb 298 299 Ig epsilon switch region DNA end/c-myc DNA start signal 617 621 TATA box signal 777 781 TATA box BASE COUNT 212 a 250 c 238 g 203 t ORIGIN 1 gggcccagtt cagctctgtg ccaggccccc agaaagcata ggatatttct gataaacctg 61 attgtctcta cctaccctga tatagtctgg ctactatatc aggacaatct tcttggagaa 121 cagttaaaaa catccataaa aaatttcgag caatctattt ttttttaaat taatttattt 181 tttggatttt ttttaaactt acatttcaaa tgttatcact tttcccccac ccaaccaccc 241 acccctttcc acctctctgt cctgacattc ccctacactg gtgtgtgtgt ggggtccaga 301 ctaatacacg caggacaaga acacagttca gccgagcgct gcgcccgaac aactgtacag 361 aaagggaaag gaccgacgcg cgagcaggag aaaatggtcg ggcgcgcagt taattcatgc 421 tgcgctatta ctgtttacac cccgagtcgg agtactgggc tgcgggactg aggctcctcc 481 tctttccccg gctccccact agtgcccctc ccgagttccc aaagcagagg gcgggggagc 541 gagaggagga aaaaaataga ggtcgggagg ggagggagaa agagattctc tggctaatcc 601 ccgcccaccc gccctttata atccgggggt ctgcgcggct aaggaccccc gggctgcgct 661 gctctccgct gccgcctccg ccgcgcccac tccgctcgcc tcctgcctcc aaaagggcag 721 ggcttcgccg aggcttggcg ggaaaaagaa gcgaggggag ggatccggag tcgcagtata 781 aaagaagctt ttcgggcgtt ttttttctga ctcgctgtag taattccagc gagagacaga 841 gggagtgagc gggcgggttg gaagagccca gtgtgcagag ccccactccg ggcttcctag 901 gaa // LOCUS RATRAFB 2719 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Rat activated c-raf oncogene mRNA encoding a fused protein, complete cds. ACCESSION M15428 KEYWORDS fusion protein; oncogene; protein kinase; raf oncogene. SOURCE Rat primary NIH 3T3 cell transformant IQ7-2, cDNA to mRNA, clone pA5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2719) AUTHORS Ishikawa,F., Takaku,F., Nagao,M. and Sugimura,T. TITLE Rat c-raf oncogene activation by a rearrangement that produces a fused protein JOURNAL Mol. Cell. Biol. 7, 1226-1232 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.Ishikawa, 29-MAY-1987. The activated rat c-raf gene (see below) is transcribed to produce a fused mRNA, in which the 5' half of the sequence is replaced by an unknown rat sequence. The normal and activated c-raf were each connected to the long terminal repeat of Rous sarcoma virus and transfected into NIH 3T3 cells. Only the activated form had transforming activity. FEATURES from to/span description pept 206 2014 raf fusion protein mRNA < 1 > 2525 raf mRNA recomb 898 899 rat DNA of unknown origin end/raf DNA start site 1032 1032 homology to v-raf start BASE COUNT 762 a 644 c 698 g 615 t ORIGIN 123 bp upstream of BglI site. 1 gccgtgcgca tagaggccgg tgcgcggccc ttgctcgttt aacgcgggac tatatttccc 61 agggtccgtc gcgggagtct ccggcgggca ggcgcgcggg agactgcgag cgaggcgccg 121 acggggcggc tcaggcgtct gggtccgcgc atctccttgc tccttcgctt ctccttcagc 181 cgctgctgcc acgaccccgg ccgacatggc ggcggtgttg cagcaagtgc tggagcgccc 241 ggagctgaac aagctgccta agtcgaccca gaacaaactt gagaagttcc tggctgaaca 301 gcagtccgaa atcgactgcc tgaaggggcg gcacgagaaa tttaaggtgg agagtgagca 361 acaatacttt gagatagaga agagactatc ccagagtcag gagaggcttg ttaatgaaac 421 ccgggagtgt cagaacttga ggctggagct tgagaagcta aataaccaag taaaagtatt 481 aactgagaaa aacaaagaac ttgaaactgc tcaagaccgc aatctaggca ttcagagcca 541 gtttacaaga gcaaaggaag agttagaagc tgaaaaaaga gatttaatca gaaccaatga 601 gaggttatct caggaagttg aatatttaac agaggatgtt aaacgtctaa acgaaaaact 661 taaagaaagc aatacaacga agggtgaact tcagttaaag ctggatgaac ttcaagcttc 721 tgatgtcact gtgaagtacc gagaaaaacg cttagaacaa gaaaaggaat tgctacacaa 781 tcaaaattca tggctaaaca cagagttgaa aaccaaaact gatgagctat tggctctagg 841 aagagaaaag ggaaatgaaa ttctggaact taagtgtact cttgaaaaca aaaaggaaga 901 ggatgcaatt cgaagtcaca gtgaatcagc ctcaccttca gccctgtcca gcagccccaa 961 caacctgagc ccaacaggct ggtcacagcc caaaacccct gtgccagcac aaagagagag 1021 ggcgccagga tctgggaccc aggaaaaaaa caaaattagg cctcgtgggc agagagattc 1081 aagttattac tgggaaatag aagccagtga ggtgatgctg tctactcgga ttggctcggg 1141 ctcctttggc actgtgtaca agggcaagtg gcatggagat gttgcagtaa agatcctaaa 1201 ggtggttgac ccaactccag agcaacttca ggccttcagg aacgaggtgg ctgttttgcg 1261 caaaacacgg catgttaata tcctgctgtt catggggtac atgacaaagg acaacctggc 1321 gattgtgacc cagtggtgtg aaggcagcag tctctacaaa cacctgcatg tccaggagac 1381 caaattccag atgttccagc taattgacat tgcccggcag acagctcagg gaatggacta 1441 tttacatgca aagaacatca tccacagaga catgaaatcc aacaatatat ttctccatga 1501 aggcctcacg gtgaaaatcg gagattttgg tttggcaaca gtgaagtcgc gctggagtgg 1561 ttctcagcag gttgaacagc ccactggctc tgtgctgtgg atggccccag aagtaatccg 1621 aatgcaggat aacaacccgt tcagcttcca gtccgatgtc tactcctatg gcattgtgct 1681 gtatgagctg atgactgggg agcttcccta ctcccacatc aacaaccgag accagatcat 1741 cttcatggtg ggccgtgggt acgcctcccc agatcttagc aggctctaca agaactgccc 1801 caaggcaatg aagaggttgg tggctgactg tgtgaagaaa gtcaaagaag aaaggccttt 1861 gtttcctcag atcctgtctt ccattgagct gcttcagcac tctctgccga aaatcaacag 1921 gagcgcctct gagccttccc tgcatcgggc agctcacact gaggacatca atgcttgtac 1981 gctgaccaca tccccaaggc taccagtctt ctagctgacg ttatagctgt tcttaggcca 2041 ccaggggacg aagaagagtc agcaggcacc actttctgtt tccttggggg cagaatgcat 2101 gtttccggaa aagctgctgc taaggaccta gactactcac agggccttaa cttcatattg 2161 ccttcttttc tacccttcct gccctggaaa tggaagctgt ccgccaagcc agcctgctcc 2221 agaggtatac aagtcagcga gtatttttag ggcaaatggc cttggagaga gaaggcaggg 2281 cactccggct actgcaggga catgcagttg ggaacttggc tcattgagct gtacagacag 2341 tggtgcagtg ccagttttgc acatggagtc ctggccacct gggggagcct gctttggtac 2401 tacagaactt cactttgtgg acacaccttc ctcttactga gtctaagatg tcctgtgcag 2461 aggatgcttt ccaagcacgg tgctccacct tctggcagcc tcccacacgc tgaatctgtc 2521 ttccaggagc tgccctatgg ggtgctgcag cccagcccta tctctatagt cacatccttg 2581 tctgtaagaa agccaggaat acaggttttc ttaatgattt tgggttttaa ttttgttttt 2641 attgagcctg ataaaataca gttatctgat ggttcctcaa ttatgttatt ttaataaaat 2701 aaattaaatt taaaaaaaa // LOCUS YSCINV 68 bp ss-mRNA SYN 29-JUN-1990 DEFINITION Yeast (S.cerevisiae) synthetic invertase signal peptide region fused to human interferon gene. ACCESSION M13346 KEYWORDS gene fusion; invertase signal peptide; signal peptide. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 68) AUTHORS Chang,C.N., Matteucci,M., Perry,L.J., Wulf,J.J., Chen,C.Y. and Hitzeman,R.A. TITLE Saccharomyces cerevisiae secretes and correctly processes human interferon hybrid proteins containing yeast invertase signal peptides JOURNAL Mol. Cell. Biol. 6, 1812-1819 (1986) STANDARD full staff_review FEATURES from to/span description pept 12 > 68 yeast invertase signal/human interferon fusion peptide sigp 12 68 invertase signal peptide recomb 8 9 EcoRI sequence end/yeast invertase signal start BASE COUNT 13 a 16 c 12 g 27 t ORIGIN 3 bp upstream of EcoRI site. 1 ccgaattcat gatgcttttg caagctttcc ttttcctttt ggctggtttt gcagccaaaa 61 tatctgca // LOCUS ADGLS 1477 bp ds-DNA VRL 29-JUN-1990 DEFINITION Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units. ACCESSION K03325 KEYWORDS alternate splicing; terminal repeat. SOURCE Adenovirus type 7 (simian) DNA. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 1477) AUTHORS Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E. TITLE E1a regions of the human adenoviruses and of the highly oncogenic simian adenovirus 7 are closely related JOURNAL J. Virol. 53, 399-409 (1985) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Kimelman (15-AUG-1985). The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA hybridization and S1 nuclease digestion. The peptide coding regions were deduced by analogy to other adenoviruses and following the most likely mode of translation. The intron of the mRNA III transcribed from the E1a of other adenoviruses has not yet been determined by S1 mapping. Enhancer sequences are located at positions 165-173 and 265-270. A TATA box is located at position 408-413. A polyadenylation signal is located at posityion 1447-1452. FEATURES from to/span description pept 486 837 E1a 28 K protein from mRNA I, exon 1 1178 1407 E1a 28 K protein from mRNA I, exon 2 pept 486 1056 E1a 24 K protein from mRNA II, exon 1 1178 1407 E1a 24 K protein from mRNA II, exon 2 pre-msg 439 1477 E1a mRNA IVS 838 1177 E1a 24K pept intron A IVS 1057 1177 E1a 28K pept intron A BASE COUNT 331 a 334 c 442 g 370 t ORIGIN 5' end of the l strand of the genome. 1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt 61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg 121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt 181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt 241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga 301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc 361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc 421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg 481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata 541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca 601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg 661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc 721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg 781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg 841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg 901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg 961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc 1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact 1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga 1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca 1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc 1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca 1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg 1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta 1441 tcaggaaata aaaagattaa ctggattctt tgtgcct // LOCUS ADGRS 116 bp ds-DNA VRL 29-JUN-1990 DEFINITION adenovirus type 7 (simian) right end of genome. ACCESSION V00035 J01991 KEYWORDS terminal repeat. SOURCE adenovirus type 7 (simian) dna. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 116) AUTHORS Broker,T.R. TITLE terminal nucleotide sequences and restriction endonuclease cleavage maps of simian and murine adenoviruses JOURNAL (in) Tooze,J. (Ed.); DNA Tumor Viruses, Second Edition Revised: 1037-1046; Cold Spring Harbor Laboratory, Cold Spring Harbor (1981). STANDARD simple staff_review REFERENCE 2 (bases 1 to 116) AUTHORS Tolun,A., Alestroem,P. and Pettersson,U. TITLE sequence of inverted terminal repetitions from different adenoviruses: demonstration of conserved sequences and homology between sa7 termini and sv40 dna JOURNAL Cell 17, 705-713 (1979) STANDARD simple staff_review COMMENT L-strand is given, from 5' to 3'. BASE COUNT 25 a 15 c 46 g 30 t ORIGIN 1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg 61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct // LOCUS ADGSA7PE1 2338 bp ds-DNA VRL 29-JUN-1990 DEFINITION Simian adenovirus SA7P, leftmost 2338 bp. ACCESSION X01027 KEYWORDS E1A protein; E1b protein; inverted terminal repeat; overlapping genes; transforming gene. SOURCE Simian adenovirus SA 7P. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 2338; enum. 1 to 2338) AUTHORS Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and Van Ormondt,H. TITLE The nucleotide sequence of the leftmost XhoI fragment (6%) of simian adenovirus SA7P JOURNAL J. Gen. Virol. 65, 1699-1708 (1984) STANDARD simple automatic COMMENT **map: leftmost XhoI fragment (6%); EMBL features not translated to GenBank features: key from to description SITE 1 183 inverted terminal repeat (ITR) of SA7P PRM 409 416 TATA E1A region CAP 440 440 cap site E1A RNAs MSG 440 838 "12S" E1A RNA MSG 440 838 1st exon "12S" E1A RNA IVS 839 1175 intron "12S" E1A RNA MSG 1176 >1445 2nd exon "12S" E1A RNA MSG 440 1057 1st exon "13S" E1A RNA IVS 1058 1175 intron "13S" E1A RNA MSG 1176 >1445 2nd exon "13S" E1A RNA SITE 1445 1450 AATAAA E1A PRM 1508 1514 TATA E1B CAP 1538 1538 cap E1B RNA's This virus, an independent isolate from Moscow, USSR, is highly related to simian adenovirus 7 (SA7). The E1A sequences are 95% homologous to those of SA7 see $<K03322>$ J. VVrol. 5:399-409(1985) FEATURES from to/span description pept 487 838 E1A 21K protein (part 1) 1176 / 1401 E1A 21K protein (part 2) pept 487 1057 E1A 29K protein (part 1) 1176 / 1401 E1A 29K protein (part 2) pept 1569 2111 pot. 21K E1B protein pept 1874 > 2338 pot. 55K E1B protein BASE COUNT 514 a 535 c 725 g 564 t ORIGIN 1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag 61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg 121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt 181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt 241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg 301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg 361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg 421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct 481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat 541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac 601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag 661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg 721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct 781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt 841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg 901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat 961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg 1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact 1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg 1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga 1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac 1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc 1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc 1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata 1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct 1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca 1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg 1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc 1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt 1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg 1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct 1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca 1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc 1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc 2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa 2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg 2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat 2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg 2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag // LOCUS ADGVARNA 850 bp ds-DNA VRL 29-JUN-1990 DEFINITION Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene. ACCESSION M14082 KEYWORDS virus-associated RNA. SOURCE Simian adenovirus type 7 DNA. ORGANISM Mastadenovirus s7 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 850) AUTHORS Larsson,S., Svensson,C. and Akusjaervi,G. TITLE Characterization of a low-molecular-weight virus-associated (VA) RNA encoded by simian adenovirus type 7 which functionally can substitute for adenovirus type 5 VA RNA-1 JOURNAL J. Virol. 60, 635-644 (1986) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Larsson, 15-JAN-1987. Simian adenovirus type 7 VA RNA stimulates protein synthesis in a transient expression assay and can functionally substitute for VA RNA-I during lytic growth of human Ad5. The SA7 genome encodes only one VA RNA species. FEATURES from to/span description RNA 181 352 virus-associated RNA (5' and 3' ends +/-2bp) ORF 105 < 1 (c) ORF A cds ORF 366 > 850 ORF B cds BASE COUNT 140 a 260 c 338 g 112 t ORIGIN 150 bp upstream of XbaI site; 30 mu on viral map. 1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc 61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg 121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga 181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg 241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg 301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt 361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag 421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc 481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg 541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc 601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc 661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca 721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc 781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg 841 tggccccggg // LOCUS ADZITRL 150 bp ds-DNA VRL 29-JUN-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori left end. ACCESSION J01958 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 38 a 29 c 48 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgcg // LOCUS ADZITRR 150 bp ds-DNA VRL 29-JUN-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori right end. ACCESSION J01959 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 40 a 28 c 47 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgaa // LOCUS HSKGAORA 3726 bp ds-DNA VRL 29-JUN-1990 DEFINITION Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown function. ACCESSION M33303 KEYWORDS glycoprotein A. SOURCE Herpesvirus of turkey DNA. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 3726) AUTHORS Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K. TITLE Homologies between herpesvirus of turkey and Marek's disease virus type-1 DNAs within two co-linearly arranged open reading frames, one encoding glycoprotein A JOURNAL Gene 84, 399-405 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 207 1676 glycoprotein A pept 2118 2756 ORF2 BASE COUNT 1016 a 884 c 815 g 1011 t ORIGIN 1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat 61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt 121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc 181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc 241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg 301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg 361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg 421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca 481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta 541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt 601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg 661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta 721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc 781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg 841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata 901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat 961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg 1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg 1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca 1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct 1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga 1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta 1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta 1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc 1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa 1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt 1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt 1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt 1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc 1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact 1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat 1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt 1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa 1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca 2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg 2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg 2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc 2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat 2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg 2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg 2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta 2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg 2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca 2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca 2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac 2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac 2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata 2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta 2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc 2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga 3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc 3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc 3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt 3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg 3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt 3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc 3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa 3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca 3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga 3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag 3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat 3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat 3721 gaattc // LOCUS HSKTK 2039 bp ds-DNA VRL 29-JUN-1990 DEFINITION Herpesvirus of turkey thymidine kinase gene, complete cds. ACCESSION M26659 KEYWORDS thymidine kinase. SOURCE Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo fibroblasts. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 2039) AUTHORS Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K. TITLE Genetic and biochemical characterization of the thymidine kinase gene from herpesvirus of turkeys JOURNAL J. Virol. 63, 2847-2852 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.L.Martin, 03-AUG-1989. FEATURES from to/span description pept 608 1540 thymidine kinase (EC 2.7.1.21) BASE COUNT 570 a 458 c 437 g 574 t ORIGIN 649 bp upstream of XbaI site. 1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa 61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg 121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt 181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta 241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa 301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc 361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg 421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat 481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct 541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg 601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc 661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc 721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt 781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc 841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga 901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt 961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact 1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc 1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg 1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt 1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga 1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt 1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct 1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta 1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga 1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga 1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat 1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat 1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt 1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa 1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc 1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc 1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc 1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat // LOCUS LACHSP 3095 bp ds-DNA BCT 29-JUN-1990 DEFINITION Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene, complete cds. ACCESSION M30192 KEYWORDS . SOURCE Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3095) AUTHORS Hill,C., Miller,L.A. and Klaenhammer,T.R. TITLE Sequence of HSP gene from pTR2030 JOURNAL Unpublished (1989) North Carolina State Univ., Raleigh, NC 27695 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hill, 29-NOV-1989. FEATURES from to/span description pept 1105 2991 hsp gene (aborts phage infection) BASE COUNT 1129 a 439 c 518 g 1009 t ORIGIN 1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg 61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga 121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat 181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt 241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac 301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga 361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat 421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca 481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac 541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct 601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca 661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat 721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa 781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg 841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc 901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat 961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt 1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag 1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca 1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct 1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca 1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca 1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata 1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat 1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa 1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt 1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata 1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa 1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat 1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt 1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat 1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat 1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa 1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat 2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct 2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct 2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat 2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct 2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta 2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata 2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa 2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa 2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata 2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac 2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct 2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa 2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa 2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc 2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa 2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt 3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat 3061 ctcctcatcc gcatttggat aaagatgact ataaa // LOCUS PH15RER 2016 bp ds-DNA PHG 29-JUN-1990 DEFINITION Bacteriophage phi-15 right early region containing genes 17, 16.5-16.9, complete cds. ACCESSION M28830 KEYWORDS DNA replication protein. SOURCE Bacteriophage phi-15 DNA. ORGANISM Bacteriophage phi-15 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 2016) AUTHORS Benes,V., Arnold,L., Smrt,J. and Paces,V. TITLE Nucleotide sequence of the right early region of Bacillus phage phi-15 and comparison with related phages: Reorganization of gene 17 during evolution JOURNAL Gene 75, 341-347 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 203 664 gene 17 peptide pept 666 992 gene 16.9 peptide (put.) pept 989 1306 gene 16.8 peptide (put.) pept 1308 1700 gene 16.7 peptide (put.) pept 1697 1861 gene 16.6 peptide (put.) pept 1858 1971 gene 16.5 peptide (put.) BASE COUNT 744 a 374 c 382 g 516 t ORIGIN 1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca 61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa 121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa 181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga 241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc 301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt 361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca 421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc 481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt 541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga 601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa 661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca 721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag 781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa 841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta 901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca 961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg 1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga 1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg 1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat 1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt 1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt 1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa 1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc 1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt 1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc 1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg 1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat 1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt 1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt 1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg 1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc 1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt 1981 ttgattgttt cacgtggaac atcactgtat acgcat // LOCUS STLENV 4122 bp ss-RNA VRL 29-JUN-1990 DEFINITION Simian T-cell leukemia virus, pol-env-pX-3' LTR region. ACCESSION M11373 M11374 KEYWORDS env gene; pol gene. SOURCE Simian T-cell leukaemia virus genomic RNA. ORGANISM Simian T-cell lymphotropic virus type I Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 4122) AUTHORS Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and Yoshida,M. TITLE Sequence homology of the simian retrovirus genome with human T-cell leukemia virus type I JOURNAL Virology 144, 59-65 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 311 1777 env protein BASE COUNT 905 a 1456 c 741 g 1020 t ORIGIN 1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt 61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc 121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc 181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc 241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac 301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc 361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca 421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct 481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca 541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg 601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg 661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca 721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg 781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa 841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga 901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac 961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg 1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta 1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt 1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc 1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc 1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc 1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt 1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca 1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca 1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc 1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca 1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat 1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc 1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca 1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca 1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac 1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag 1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc 2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt 2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg 2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt 2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta 2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc 2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca 2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga 2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac 2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa 2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa 2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt 2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag 2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa 2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc 2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc 2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat 3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact 3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt 3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc 3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt 3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc 3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt 3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa 3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta 3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg 3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat 3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc 3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg 3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt 3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt 3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag 3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc 3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc 4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc 4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca // LOCUS FIVCG 9474 bp ss-RNA VRL 29-JUN-1990 DEFINITION Feline immunodeficiency virus, Petaluma strain, complete genome. ACCESSION M25381 M25729 KEYWORDS complete genome; env gene; gag gene; pol gene. SOURCE Feline immunodeficiency virus, Petaluma strain, clones FIV-14 [1], [3], and 34TF10 [2]. REFERENCE 1 (bases 1 to 355) AUTHORS Olmstead,R.A., Barnes,A.K., Yamamoto,J.K., Hirsch,V.M., Purcell,R.H. and Johnson,P.R. TITLE Molecular cloning of feline immunodeficiency virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2448-2452 (1989) STANDARD full automatic REFERENCE 2 (bases 1 to 9472) AUTHORS Talbott,R.L., Sparger,E.E., Lovelace,K.M., Fitch,W.M., Pedersen,N.C., Luciw,P.A. and Elder,J.H. TITLE Nucleotide sequence and genomic organization of feline immuno- deficiency virus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 5743-5747 (1989) STANDARD full automatic REFERENCE 3 (bases 1 to 9474) AUTHORS Olmstead,R.A., Hirsch,V.M., Purcell,R.H. and Johnson,P.R. TITLE Nucleotide sequence analysis of feline immunodeficiency virus: genome organization and relationship to other lentiviruses JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8088-8096 (1989) STANDARD full automatic REFERENCE 4 (bases 7175 to 7182; revision) AUTHORS Elder,J.H. and Scripps Institute LaJolla,CA. JOURNAL Unpublished (1990) STANDARD full automatic COMMENT [2] sequence kindly provided in computer-readable form by John Elder, Scripps Institute, La Jolla, CA. [3] sequence kindly provided in computer-readable form by Robert Olmstead, NIAID, NIH, Bethesda, MD. [1] contains the LTR sequence from the complete genome in [3]. The sequence from [3] is shown (with differences from [2]); the substantive difference from [2] is at position 6123 where the latter possesses an early stop codon in orf2, which is thought to encode the transactivator protein. Clone 34TF10 is, nevertheless, biologically active as described in [2]. A difference earlier reported at 7175-7182 turns out to have been a typo [2,4]. Orf1 is weakly homologous (PLSEARCH) with the Q orf of visna and has no detectable similarity to the vif protein of HIV and SIV. This is consistent with the clustering of FIV with the ungulate lentiviruses [2],[3]. Several "urfs" are identified by [2] and [3] of which only orf3, possibly analogous to the L orf of visna, is annotated below; a small, possibly associated, orf is found at 8955-9170. For a fuller discussion of the reading frames, see [2] and [3]. FEATURES from to/span description pept 628 1980 gag polyprotein pept / 1869 5243 pol polyprotein (NH2 terminus uncertain) (AA at 1869) pept 5236 5991 orf1 pept 5992 6228 orf2 pept 6266 8836 env polyprotein pept 6712 6918 orf3 LTR 1 355 5' LTR LTR 9120 9474 3' LTR rpt 217 281 R repeat 5' copy rpt 9336 9400 R repeat 3' copy binding 357 377 primer binding site variant 12 12 t in [3]; c in [2] variant 122 122 c in [3]; t in [2] variant 146 146 g in [3]; t in [2] variant 373 374 ga in [3]; a in [2] variant 498 501 ggac in [3]; agca in [2] variant 1575 1575 g in [3]; a in [2] variant 1840 1840 g in [3]; c in [2] variant 1977 1977 a in [3]; g in [2] variant 2028 2028 g in [3]; a in [2] variant 2033 2033 c in [3]; t in [2] variant 2288 2288 a in [3]; g in [2] variant 2464 2464 g in [3]; a in [2] variant 2490 2490 c in [3]; t in [2] variant 2580 2580 c in [3]; t in [2] variant 2631 2631 t in [3]; c in [2] variant 2735 2735 a in [3]; g in [2] variant 3733 3733 a in [3]; g in [2] variant 3743 3743 a in [3]; t in [2] variant 3812 3812 g in [3]; a in [2] variant 4202 4202 g in [3]; a in [2] variant 4211 4211 c in [3]; t in [2] variant 4271 4271 c in [3]; t in [2] variant 4493 4493 a in [3]; g in [2] variant 4540 4540 g in [3]; t in [2] variant 4706 4706 c in [3]; t in [2] variant 4891 4891 g in [3]; a in [2] variant 5332 5332 g in [3]; a in [2] variant 5497 5497 t in [3]; c in [2] variant 5559 5559 t in [3]; c in [2] variant 5986 5986 t in [3]; c in [2] variant 6000 6000 c in [3]; a in [2] variant 6123 6123 g in [3]; a in [2] variant 6149 6149 t in [3]; c in [2] variant 6179 6179 a in [3]; g in [2] variant 6238 6238 g in [3]; t in [2] variant 6245 6248 attt in [3]; ttt in [2] variant 6306 6306 t in [3]; c in [2] variant 6430 6430 c in [3]; t in [2] variant 6548 6548 a in [3]; c in [2] variant 6739 6739 c in [3]; t in [2] variant 6753 6753 t in [3]; c in [2] variant 6758 6758 a in [3]; g in [2] variant 6764 6764 a in [3]; g in [2] variant 6779 6779 a in [3]; g in [2] variant 6782 6783 aa in [3]; gg in [2] variant 7175 7182 acaggagg in [3]; caggagga in [2] variant 7381 7381 a in [3]; g in [2] variant 7484 7484 g in [3]; a in [2] variant 7512 7513 ag in [3]; ca in [2] variant 7618 7618 a in [3]; c in [2] variant 7625 7625 g in [3]; a in [2] variant 7670 7670 a in [3]; g in [2] variant 7704 7708 aaaaa in [3]; caaat in [2] variant 7750 7750 t in [3]; c in [2] variant 7808 7810 atg in [3]; gta in [2] variant 7824 7828 aaaag in [3]; caaaa in [2] variant 7996 7996 g in [3]; a in [2] variant 8054 8054 g in [3]; c in [2] variant 8129 8129 g in [3]; a in [2] variant 8222 8222 g in [3]; a in [2] variant 8238 8238 a in [3]; g in [2] variant 8329 8329 g in [3]; a in [2] variant 8377 8377 t in [3]; c in [2] variant 8394 8394 c in [3]; t in [2] variant 8548 8550 gaa in [3]; aac in [2] variant 8588 8588 g in [3]; a in [2] variant 9131 9131 t in [3]; c in [2] variant 9241 9241 c in [3]; t in [2] variant 9265 9265 g in [3]; t in [2] BASE COUNT 3571 a 1370 c 2119 g 2414 t ORIGIN 5' terminus of 5' LTR 1 tgggatgagt attggaaccc tgaagaaata gaaagaatgc ttatggacta gggactgttt 61 acgaacaaat gataaaagga aatagctgag catgactcat agttaaagcg ctagcagctg 121 cctaaccgca aaaccacatc ctatggaaag cttgctaatg acgtataagt tgttccattg 181 taagagtata taaccagtgc tttgtgaaac ttcgaggagt ctctttgttg aggacttttg 241 agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 301 tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 361 cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 421 tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg 481 agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata 541 aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 601 acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 661 gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 721 gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 781 gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 841 gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 901 gtagcaggac ttttaaatat gacggtgtct actgctgctg cagctgaaaa tatgtattct 961 caaatgggat tagacactag gccatctatg aaagaagcag gtggaaaaga ggaaggccct 1021 ccacaggcat atcctattca aacagtaaat ggagtaccac aatatgtagc acttgaccca 1081 aaaatggtgt ccatttttat ggaaaaggca agagaaggac taggaggtga ggaagttcaa 1141 ctatggttta ctgccttctc tgcaaattta acacctactg acatggccac attaataatg 1201 gccgcaccag ggtgcgctgc agataaagaa atattggatg aaagcttaaa gcaactgaca 1261 gcagaatatg atcgcacaca tccccctgat gctcccagac cattacccta ttttactgca 1321 gcagaaatta tgggtatagg attaactcaa gaacaacaag cagaagcaag atttgcacca 1381 gctaggatgc agtgtagagc atggtatctc gaggcattag gaaaattggc tgccataaaa 1441 gctaagtctc ctcgagctgt gcagttaaga caaggagcta aggaagatta ttcatccttt 1501 atagacagat tgtttgccca aatagatcaa gaacaaaata cagctgaagt taagttatat 1561 ttaaaacagt cattgagcat agctaatgct aatgcagact gtaaaaaggc aatgagccac 1621 cttaagccag aaagtaccct agaagaaaag ttgagagctt gtcaagaaat aggctcacca 1681 ggatataaaa tgcaactctt ggcagaagct cttacaaaag ttcaagtagt gcaatcaaaa 1741 ggatcaggac cagtgtgttt taattgtaaa aaaccaggac atctagcaag acaatgtaga 1801 gaagtgaaaa aatgtaataa atgtggaaaa cctggtcatg tagctgccaa atgttggcaa 1861 ggaaatagaa agaattcggg aaactggaag gcggggcgag ctgcagcccc agtgaatcaa 1921 atgcagcaag cagtaatgcc atctgcacct ccaatggagg agaaactatt ggatttataa 1981 attataataa agtaggtact actacaacat tagaaaagag gccagaaata ctcatatttg 2041 taaatggata tcctataaaa tttttattag acacaggagc agatataaca attttaaata 2101 ggagagattt tcaagtaaaa aattctatag aaaatggaag gcaaaatatg attggagtag 2161 gaggaggaaa gagaggaaca aattatatta atgtacattt agagattaga gatgaaaatt 2221 ataagacaca atgtatattt ggtaatgttt gtgtcttaga agataactca ttaatacaac 2281 cattattagg gagagataat atgattaaat tcaatattag gttagtaatg gctcaaattt 2341 ctgataagat tccagtagta aaagtaaaaa tgaaggatcc taataaagga cctcaaataa 2401 aacaatggcc attaacaaat gaaaaaattg aagccttaac agaaatagta gaaagactag 2461 aaagagaagg gaaagtaaaa agagcagatc caaataatcc atggaataca ccagtatttg 2521 ctataaaaaa gaaaagtgga aaatggagaa tgctcataga ttttagagaa ttaaacaaac 2581 taactgagaa aggagcagag gtccagttgg gactacctca tcctgctggt ttacaaataa 2641 aaaaacaagt aacagtatta gatatagggg atgcatattt caccattcct cttgatccag 2701 attatgctcc ttatacagca tttactttac ctagaaaaaa taatgcggga ccaggaagga 2761 gatttgtgtg gtgtagtcta ccacaaggct ggattttaag tccattgata tatcaaagta 2821 cattagataa tataatacaa ccttttatta gacaaaatcc tcaattagat atttaccaat 2881 atatggatga catttatata ggatcaaatt taagtaaaaa ggagcataaa gaaaaggtag 2941 aagaattaag aaaattacta ttatggtggg gatttgaaac tccagaagat aaattacagg 3001 aagaaccccc atatacatgg atgggttatg aattacatcc attaacatgg acaatacaac 3061 agaaacagtt agacattcca gaacagccca ctctaaatga gttgcaaaaa ttagcaggaa 3121 aaattaattg ggctagccaa gctattccag acttgagtat aaaagcatta actaacatga 3181 tgagaggaaa tcaaaaccta aattcaacaa gacaatggac taaagaagct cgactggaag 3241 tacaaaaggc aaaaaaggct atagaagaac aagtacaact aggatactat gaccccagta 3301 aggagttata tgctaaatta agtttggtgg gaccacatca aataagttat caagtatatc 3361 agaaggatcc agaaaagata ctatggtatg gaaaaatgag tagacaaaag aaaaaggcag 3421 aaaatacatg tgatatagcc ttaagagcat gctataagat aagagaagag tctattataa 3481 gaataggaaa agaaccaaga tatgaaatac ctacttctag agaagcctgg gaatcaaatt 3541 taattaattc accatatctt aaggccccac ctcctgaggt agaatatatc catgctgctt 3601 tgaatataaa gagagcgtta agtatgataa aagatgctcc aataccagga gcagaaacat 3661 ggtatataga tggaggtaga aagctaggaa aagcagcaaa agcagcctat tggacagata 3721 caggaaagtg gcaagtgatg gaattagaag gcagtaatca gaaggcagaa atacaagcat 3781 tattattggc attaaaagca ggatcagagg agatgaatat tataacagat tcacaatatg 3841 ttataaatat tattcttcaa caaccagata tgatggaggg aatctggcaa gaagttttag 3901 aagaattgga gaagaaaaca gcaatattta tagattgggt cccaggacat aaaggtattc 3961 caggaaatga ggaagtagat aagctttgtc aaacaatgat gataatagaa ggggatggga 4021 tattagataa aaggtcagaa gatgcaggat atgatttatt agctgcaaaa gaaatacatt 4081 tattgccagg agaggtaaaa gtaataccaa caggggtaaa gctaatgttg cctaaaggat 4141 attggggatt aataatagga aaaagctcga tagggagtaa aggattggat gtattaggag 4201 gggtaataga cgaaggatat cgaggtgaaa ttggagtaat aatgattaat gtatcaagaa 4261 aatcaatcac cttaatggaa cgacaaaaga tagcacaatt aataatattg ccttgtaaac 4321 atgaagtatt agaacaagga aaagtagtaa tggattcaga gagaggagac aatggttatg 4381 ggtcaacagg agtattctcc tcttgggttg acagaattga ggaagcagaa ataaatcatg 4441 aaaaatttca ctcagatcca cagtacttaa ggactgaatt taatttacct aaaatggtag 4501 cagaagagat aagacgaaaa tgcccagtat gcagaatcag aggagaacaa gtgggaggac 4561 aattgaaaat agggcctggt atctggcaaa tggattgcac acactttgat ggcaaaataa 4621 ttcttgtggg tatacatgtg gaatcaggat atatatgggc acaaataatt tctcaagaaa 4681 ctgctgactg tacagttaaa gctgtcttac aattgttgag tgctcataat gttactgaat 4741 tacaaacaga taatggacca aattttaaaa atcaaaagat ggaaggagta ctcaattaca 4801 tgggtgtgaa acataagttt ggtatcccag ggaacccaca gtcacaagca ttagttgaaa 4861 atgtaaatca tacattaaaa gtttggattc ggaaattttt gcctgaaaca acctccttgg 4921 ataatgcctt atctctcgct gtacatagtc tcaattttaa aagaagaggt aggataggag 4981 ggatggcccc ttatgaatta ttagcacaac aagaatcctt aagaatacaa gattattttt 5041 ctgcaatacc acaaaaattg caagcacagt ggatttatta taaagatcaa aaagataaga 5101 aatggaaagg accaatgaga gtagaatact ggggacaggg atcagtatta ttaaaggatg 5161 aagagaaggg atattttctt atacctagga gacacataag gagagttcca gaaccctgcg 5221 ctcttcctga aggggatgag tgaagaagat tggcaggtaa gtagaagact ctttgcagtg 5281 ctccaaggag gagtaaatag cgctatgcta tacatatcta ggctacctcc ggatgaaaga 5341 gaaaagtata aaaaagactt caagaaaaga ctttttgaca cagaaacagg atttataaag 5401 agactacgga aagctgaagg aataaaatgg agctttcata ctagagatta ttacatagga 5461 tatgtcagag aaatggtggc aggatccact acatcattaa gtctaaggat gtatatatat 5521 ataagtaacc cactatggca ttctcagtat cgtccaggtt tgaaaaattt caataaggaa 5581 tggccttttg taaatatgtg gataaaaaca ggatttatgt gggatgatat tgaaaaacaa 5641 aatatttgta taggaggaga agtttcacca ggatggggac cagggatggt aggtatagca 5701 ataaaagctt ttagttgtgg cgaaagaaag attgaggcta ctcctgtaat gattataaga 5761 ggagaaatag atccaaaaaa atggtgcgga gattgttgga atttaatgtg tcttagaaac 5821 tcacctccaa agactttaca aagactcgct atgttggcgt gtggcgtgcc ggctaagaag 5881 tggcgaggat gctgtaatca acgctttgtt tctccttaca gaacgcctgc tgatttagag 5941 gtcattcaat ccaagcccag ctggaacctg ttatggtcgg gagaattatg aatggaagac 6001 ataatagtat tattcaatag ggtcactgag aaactagaaa aagaattagc tatcagaata 6061 tttgtattag cacatcaatt agaaagggac aaagctatta gattactaca aggattattt 6121 tggagatata gatttaagaa accccgagta gattattgtt tatgttggtg gtgttgcaaa 6181 ttctattatt ggcagttgca atctacatta tcaataacta ctgcttagaa atatttagat 6241 taatatttca tttgcaacaa taagaatggc agaaggattt gcagccaata gacaatggat 6301 aggactagaa gaagctgaag agttattaga ttttgatata gcaacacaaa tgagtgaaga 6361 aggaccacta aatccaggag taaacccatt tagggtacct ggaataacag aaaaagaaaa 6421 gcaaaactac tgtaacatat tacaacctaa gttacaagat ctaaggaacg aaattcaaga 6481 ggtaaaactg gaagaaggaa atgcaggtaa gtttagaaga gcaagatttt taaggtattc 6541 tgatgaaagt gtattgtccc tggttcatgc gttcatagga tattgtatat atttaggtaa 6601 tcgaaataag ttaggatctt taagacatga cattgatata gaagcacccc aagaagagtg 6661 ttataataat agagagaagg gtacaactga caatataaaa tatggtagac gatgttgcct 6721 aggaacggtg actttgtacc tgattttatt tataggaata ataatatatt cacagacaac 6781 caacgctcag gtagtatgga gacttccacc attagtagtc ccagtagaag aatcagaaat 6841 aattttttgg gattgttggg caccagaaga acccgcctgt caggactttc ttggggcaat 6901 gatacatcta aaagctaaga caaatataag tatacgagag ggacctacct tggggaattg 6961 ggctagagaa atatgggcaa cattattcaa aaaggctact agacaatgta gaagaggcag 7021 aatatggaaa agatggaatg agactataac aggaccatca ggatgtgcta ataacacatg 7081 ttataatgtt tcagtaatag tacctgatta tcagtgttat ttagatagag tagatacttg 7141 gttacaaggg aaaataaata tatcattatg tctaacagga ggaaaaatgt tgtacaataa 7201 agttacaaaa caattaagct attgtacaga cccattacaa atcccactga tcaattatac 7261 atttggacct aatcaaacat gtatgtggaa tacttcacaa attcaggacc ctgaaatacc 7321 aaaatgtgga tggtggaatc aaatggccta ttataacagt tgtaaatggg aagaggcaaa 7381 agtaaagttt cattgtcaaa gaacacagag tcagcctgga tcatggttta gagcaatctc 7441 gtcatggaaa caaagaaata gatgggagtg gagaccagat tttgaaagta aaaaggtgaa 7501 aatatctcta cagtgcaata gcacaaaaaa cctaaccttt gcaatgagaa gttcaggaga 7561 ttatggagaa gtaacgggag cttggataga gtttggatgt catagaaata aatcaaaact 7621 tcatgctgaa gcaaggttta gaattagatg tagatggaat gtagggagta atacctcgct 7681 cattgataca tgtggaaaca ctcaaaaagt ttcaggtgcg aatcctgtag attgtaccat 7741 gtattcaaat aaaatgtaca attgttcttt acaaaacggg tttactatga aggtagatga 7801 ccttattatg catttcaata tgaaaaaggc tgtagaaatg tataatattg ctggaaattg 7861 gtcttgtaca tctgacttgc catcgtcatg ggggtatatg aattgtaatt gtacaaatag 7921 tagtagtagt tatagtggta ctaaaatggc atgtcctagc aatcgaggca tcttaaggaa 7981 ttggtataac ccagtggcag gattacgaca atccttagaa cagtatcaag ttgtaaaaca 8041 accagattac ttagtggtcc cagaggaagt catggaatat aaacctagaa ggaaaagggc 8101 agctattcat gttatgttgg ctcttgcagc agtattatct attgccggtg cagggacggg 8161 ggctactgct atagggatgg taacacaata ccaccaagtt ctggcaaccc atcaagaagc 8221 tgtagaaaag gtgactgaag ccttaaagat aaacaactta agattagtta cattagagca 8281 tcaagtacta gtaataggat taaaagtaga agctatggaa aaatttttgt atacagcttt 8341 cgctatgcaa gaattaggat gtaatcaaaa tcaatttttc tgcaaaatcc ctcctgagtt 8401 gtggacaagg tataatatga ctataaatca aacaatatgg aatcatggaa atataacttt 8461 gggggaatgg tataaccaaa caaaagattt acaacaaaag ttttatgaaa taataatgga 8521 catagaacaa aataatgtac aagggaagaa agggatacaa caattacaaa agtgggaaga 8581 ttgggtagga tggataggaa atattccaca atatttaaag ggactattgg gaggtatctt 8641 gggaatagga ttaggagtgt tattattgat tttatgttta cctacattgg ttgattgtat 8701 aagaaattgt atccacaaga tactaggata cacagtaatt gcaatgcctg aagtagaagg 8761 agaagaaata caaccacaaa tggaattgag gagaaatggt aggcaatgtg gcatgtctga 8821 aaaagaggag gaatgatgaa gtatctcaga cttattttat aagggagata ctgtgctgag 8881 ttcttccctt tgaggaaggt atgtcatatg aatccatttc gaatcaaatc aaactaataa 8941 agtatgtatt gtaaggtaaa aggaaaagac aaagaagaag aagaaagaag aaagccttca 9001 agaggatgat gacagagtta gaagatcgct tcaggaagct atttggcacg acttctacaa 9061 cgggagacag cacagtagat tctgaagatg aacctcctaa aaaagaaaaa agggtggact 9121 gggatgagta ttggaaccct gaagaaatag aaagaatgct tatggactag ggactgttta 9181 cgaacaaatg ataaaaggaa atagctgagc atgactcata gttaaagcgc tagcagctgc 9241 ctaaccgcaa aaccacatcc tatggaaagc ttgctaatga cgtataagtt gttccattgt 9301 aagagtatat aaccagtgct ttgtgaaact tcgaggagtc tctttgttga ggacttttga 9361 gttctccctt gaggctccca cagatacaat aaatatttga gattgaaccc tgtcgagtat 9421 ctgtgtaatc ttttttacct gtgaggtctc ggaatccggg ccgagaactt cgca // LOCUS ASTSSP12S 3341 bp ds-DNA PLN 29-JUN-1990 DEFINITION A.sativa 12S globulin seed storage protein gene, complete cds. ACCESSION J05485 KEYWORDS 12S globulin; storage protein. SOURCE A.sativa (genomic library lambda-Charon-34), clone lambda-OG1-E1. ORGANISM Avena sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 3341) AUTHORS Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A. TITLE Analysis of seed storage protein genes of oats JOURNAL J. Biol. Chem. 265, 9652-9658 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Shotwell, 17-APR-1990. FEATURES from to/span description pept 983 1310 12S globulin seed storage protein, exon 1 1428 1702 12S globulin seed storage protein, exon 2 1829 2344 12S globulin seed storage protein, exon 3 2445 2882 12S globulin seed storage protein, exon 4 pre-msg 945 > 3341 12S globulin seed storage protein mRNA and introns IVS 1311 1427 12S globulin seed storage protein intron A IVS 1703 1828 12S globulin seed storage protein intron B IVS 2345 2444 12S globulin seed storage protein intron C signal 885 890 CAT box signal 917 921 TATA box signal 2915 2920 polyA addition signal signal 2965 2970 polyA addition signal BASE COUNT 1044 a 677 c 606 g 1014 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctttt tggaaagtca ttttgcctcc tgaactccag tgtttcctat ttattaaaaa 61 aaactaaaaa ctatatttat aagtttgaaa aaatcatgaa acaaaattgt aaaaattgct 121 agtgatatat cccacaaacg tgcaaaatct caatttgaag tgctttgtat ttcgagctac 181 acaaaaatga caaagtgtga ctttttatgt gatttgaaat cactatacta cagatctaca 241 attttgttct ttttgtgaag ctataaatac acattatttt gagttgagat ttttcatgtt 301 tgtgctatga atcataggct acatcctgat ttatttttag aattttttgg aacctaaaat 361 atgttctaga ttatttttta aaaagtggga tcattgatgc ccatacacac gaaatctcca 421 ctcaattctt ttatacatta tctttctata tctactaacg tggattatac atcatagtaa 481 gtttcttact acatgtgctt tcttgttaca atgtggacat gactcttcca cttttgggct 541 ttatgttgta tggatatact catgacatgg aattttgtcc acacacgtag aaccatccat 601 atatattgtt gtgcatagaa caaaacacaa gcaagccatg aaaaaggagt cacaggtgcc 661 acaaactgtt gtaggaagta taactagtat gaggcctttt atttgacgtc ggacaatggc 721 caagagctac atataaaaga tggtgctaga tttgtgagtc agcaccagct gtaggcagaa 781 aacaacacat atcttttgag acaaagttat atctattcac atttaaaacc atgatctgtt 841 gagtcaccat gaatatcttc ttatctatgt taataattac atgtcatcat gtttatcctg 901 gactactttt tatggctata aaatcaaact tacaataagg aaactagcac caatccacct 961 tctacaatct cttcaaacaa tcatggcaac tactaggttt ccatcattgt tgttttactc 1021 ctatattttt ctcttgtgca atgggtcaat ggctcagcta ttcgggcaga gctttacccc 1081 atggcaaagc tctcgacaag gaggtttaag ggggtgcaga tttgataggc tacaagcatt 1141 tgaaccactt cgacaagtgg ggtcacaagc gggtatcatt gagtactttg atgagcagaa 1201 tgagcaattt cgttgtgcag gtgtatccgt catccgtcgt gttattgagc ctcaaggcct 1261 cttgttacct caataccaca acgctcctgg cttggtgtac atccttcaag gttagtgtct 1321 aattgattat aaaaattgcc tttgttatac ttcacttagg atttagatgt gccaaatgtt 1381 acaccgttca tattttttaa caatgaaaca aatgtgtttt cttttaggta ggggattcac 1441 agggttgact ttccctggat gcccggcgac cttccaacaa cagttccaac catttgatca 1501 agcccagttt gctgaaggtc aaagccaaag ccaaaatctt aaggatgaac accaaagagt 1561 tcaccacatc aaacaaggag atgttgttgc tctaccggct ggcatagtac actggtgcta 1621 caacgatggt gatgcaccga ttgtagctgt ctatgtcttc gacgtaaaca acaacgctaa 1681 tcagcttgaa ccaaggcaaa aggtaactat acaacttaat gcacacaaaa tatatacaac 1741 tatttacaag tgatccaact atgatttgaa gtatacatta gtgggatatt aatgaactct 1801 gtttaacttc atcgatataa aattttagga gttcttgttg gctggtaaca acaagagaga 1861 gcaacagttt ggacaaaaca tattcagtgg attcagtgtc caacttctta gtgaggccct 1921 tggtataagt cagcaagtag cacaaaagat tcagagtcaa aatgaccaaa gaggtgagat 1981 aattcgtgtg agtcaaggcc ttcaattctt gaagcctttt gtttcccaac aaggaccagt 2041 agagcatcaa gcctaccaac caattcaaag tcaagaagaa caatcaaccc aataccaggt 2101 agggcaatca ccacaatatc aagaaggaca atcaactcaa taccagccag gacagtcatg 2161 ggaccaaagt ttcaatggtt tggaggagaa tttctgttca ttggaggcaa ggcaaaacat 2221 cgaaaacccg aaacgtgccg acacgtacaa cccacgtgct ggcaggataa cacatctcaa 2281 tagcaagaat tttcccaccc ttaacctggt gcaaatgagt gctacaagag taaatttata 2341 ccaggtattt atgatactac attcaacaca ctatcttatt tttagatatt ctaagcttca 2401 tacaaccgat taataatatg gcatacaaat aattgctatt gcagaatgct attctttcac 2461 catactggaa cattaatgct cacagtgtca tgcacatgat ccaaggacgt gctcgagttc 2521 aagttgtcaa taaccatggt cagaccgtat tcaatgacat tcttcgtcgc ggacaactgc 2581 taatcatacc acaacactat gttgttctca agaaggcaga gcgtgaagga tgccagtata 2641 tttcattcaa gaccaacccc aattccatgg ttagccagat cgcaggaaag acctccatcc 2701 tacgtgcatt gcccgttgat gtcctcgcca atgcataccg catttctaga caggaagccc 2761 aaaacctcaa aaataaccgt ggagaagagt ttgatgcatt cacccctaag tttacacaaa 2821 cgggctccca gagttaccag gacgaggggg agtcatcttc gactgagaag gcatccgagt 2881 gaataagtga gtgtaatgga aactagtata gtgaaataaa ggcatcgcat gtttgcagcc 2941 tagtggtata taaccgctta tctcaataaa aaagtttctc cgtgttatat tgtttgcttg 3001 tttcttgtac tcttcttaat tttatctttt atgatcaacc aacttacctc tcttcttcaa 3061 ttttctgcca ctcttcacat atgcatatca aggagacgta tagtccactt gtactagatt 3121 tattttttgg tactatcttt tgtggtttga gtatttctta ggtccgataa aaacattcaa 3181 tttactgatg ctaaattttt tcagcaaacc tagccaccta aaattgtgaa tcacatacct 3241 tcaaatttct atgccaaagt gtgcgatttt tatgttctat agatccaagc ttgggctgca 3301 ggtcgactct agaggatcga tccccgggcg agctcgaatt c // LOCUS ASTSSPA 1548 bp ds-DNA PLN 29-JUN-1990 DEFINITION A.sativa seed storage protein gene, complete cds. ACCESSION J05486 KEYWORDS avenin; prolamine; storage protein. SOURCE A.sativa (genomic library lambda-Charon-34), clone lambda-AV45-X1. ORGANISM Avena sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1548) AUTHORS Shotwell,M.A., Boyer,S.K., Chesnut,R.S. and Larkins,B.A. TITLE Analysis of seed storage protein genes of oats JOURNAL J. Biol. Chem. 265, 9652-9658 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Shotwell, 17-APR-1990. FEATURES from to/span description pept 484 1113 seed storage protein mRNA 411 > 1548 seed storage protein mRNA signal 343 347 CAT box signal 380 384 TATA box signal 1189 1194 polyA addition signal signal 1252 1258 polyA addition signal BASE COUNT 484 a 375 c 323 g 366 t ORIGIN 1 tctagaacaa tccttcctgt tactagttta atctacctta tacacagaca agtttacaaa 61 ctttgcaagc catgacaata tggtttatgc ctaacagcct gtagaaaaac acaaacttag 121 tttcagaaaa tgatgcaata tagatcagtg attgacatgt aaagcgaaaa agatgagaca 181 tattgccaac tatcaagctt tcttgatagt cgaaaggtga tacatgtaat ggcaaagcaa 241 cttttatgat cgatccaaaa ataacatttg taggtagtgc caccaaacac aacataccaa 301 ataatcagtt cgagaagcat acaagcattt tttaaggaaa agcaaatgca aagatgaaaa 361 gaaactatgc catggcagat ataaatagac ctgcaccatg gagatcctcc tccatcattc 421 atccttcaca caccgagagc acaaacatta aaacctagca agcagtatca acacaaatct 481 accatgaaga acttcctcat ccttgccctc ctttccatgg cggcgaccat ggccactgcg 541 cagtttgacc ctagcgaaca ataccagcca tatcctgagc aacaacagcc aattctacaa 601 caacagttgc tgttgcagca acaacaacag atgttgttgc agcaacaacc attgttgcag 661 gttttgcagc aacagttgaa cccatgcagg cagttcctcg tgcaacagtg cagcccggtg 721 gcagcggtgt cattcctccg gtcgcaaatc ctgcaacaga gcagctgcca ggtgatgagg 781 caacaatgct gccgacggct ggagcagatc cccgagcagc tccggtgccc agccatccat 841 agcgtcgtcc aggccatcat tatgcagcaa caacaacaac agttcttcca gcctcagatg 901 caacaacagt tcttccagcc tcagatgcaa caggtgacac agggcatctt ccagcctcag 961 atgcaacagg tgacacaggg catcttccag actcagatgc aaggtcagat cgaggggatg 1021 agggcgtttg cgctgcaggc cctgccggcg atgtgcgatg tatacgtccc gccgcactgc 1081 cccgtcgcca ccaccccgct cagtggcttc taagaacact acaagagcta tagtactaca 1141 taaataccat cagcgtttag ccgatggacc gatcttgtag cggtgacaaa taaaataaaa 1201 gggtcatgca ctaacatgtg tgacagccga cccgtactag tccaaacttg gaaataaaag 1261 aatgcaaagt tcttgtctgc ataacattga ttgtcgtttt tccattcatg tttataccta 1321 accataagct catgcattaa ctagctgctt atgtagcctt cattgatgtg aacataataa 1381 cagagtgggt gtaaaagatg gatttgaatt gaggcactat taaagtgagg agtaagaacc 1441 aggtgcagat gatagctctt taatggtgag gcttgtgagt tgtgacctta ttggctgaca 1501 tgcattggat tctaaagagt acatgttgat gctttcaaca atgtcgac // LOCUS DOGSPC18A 847 bp ss-mRNA MAM 29-JUN-1990 DEFINITION Dog microsomal signal peptidase complex (SPC 18) mRNA, complete cds. ACCESSION J05466 KEYWORDS ER membrane protein; signal peptidase complex. SOURCE Dog liver, cDNA to mRNA, lambda-18-[50,51]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 847) AUTHORS Shelness,G.S. and Blobel,G. TITLE Two subunits of the canine signal peptidase complex are homologous to yeast sec-11 protein JOURNAL J. Biol. Chem. 265, 9512-9519 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.S.Shelness, 04-APR-1990. FEATURES from to/span description pept 39 578 microsomal signal peptidase complex BASE COUNT 212 a 162 c 216 g 257 t ORIGIN 1 cctctccgct gcccgtgtcc tccttcaccg gccgcgccat gctgtctcta gattttttgg 61 acgatgtgcg gcgaatgaac aagcggcagc tgtattatca agtcctaaat tttggaatga 121 ttgtctcctc ggcactaatg atctggaagg gactgatggt cataactgga agtgaaagtc 181 caattgtagt ggtgctcagt ggcagcatgg agccagcatt tcatagagga gatcttctct 241 ttttaacaaa tcgagttgaa gatcccatcc gagtgggaga aattgttgtt tttaggatag 301 aaggaagaga gattcctata gttcaccgag tcttgaagat tcatgaaaag caaaatggac 361 atatcaagtt tttgaccaaa ggagataata atgccgttga tgaccgaggc ctctataaac 421 aaggacaaca ctggctagag aagaaggatg tcgtggggag agcaagggga tttgttcctt 481 atattggaat tgtgacgatc ctcatgaatg actatccgaa atttaagtat gccgtcctct 541 tcttgctggg tttattcgtg ctggtccatc gtgagtaaga agtctacctt gctgttcctg 601 gaagatgccg tacttttcgt tcctgactgt ttggagtaga tattggtcta tgattggtgg 661 agtgcagaac gtacatgttg gcgcttcttg gtagcaatgg tttgcattag tttctgtttc 721 cacgccaggg tctgtgtggg cgggtgcacc aagatgcgca ccgaggggac tctcaatcac 781 aggatttcat atgttgtcat tgtcacactt tcacattttt gtacatcagt gaatttttta 841 tattaaa // LOCUS DROCRPA 7226 bp ss-mRNA INV 29-JUN-1990 DEFINITION D.melanogaster crumbs protein mRNA, complete cds. ACCESSION M33753 KEYWORDS crumbs protein. SOURCE D.melanogaster (strain Oregon R) 3-12 hr embryo, cDNA to mRNA, clone lambda-gt10. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 7226) AUTHORS Tepass,U., Theres,C. and Knust,E. TITLE Crumbs encodes an EGF-like protein expressed on apical membranes of Drosophila epithelial cells and required for organization of epithelia JOURNAL Cell 61, 787-799 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by U.Tepab 17-APR-1990. FEATURES from to/span description pept 214 6633 crumb protein BASE COUNT 1880 a 1860 c 1879 g 1607 t ORIGIN Chromosome 95F9-13. 1 aaaaaaaata aaaataacct gttctaagcc aagtgccgat tgtgtgtgtg tgtgtgtgtg 61 cgtatgaagt gtttgcgtgg agcagagaga atgggagaat gcaattacca acaacaacaa 121 gctcaatgag tatcgtttga atagcaataa tatctagcca aaagcaaccc tttccaatac 181 gaaaactcga cttgagccaa caaatcagcg atcatggcta aaatcgccaa tgcgtcactg 241 tcgcaacaac aaaaacagcg gcaagcagaa acagcaacga ccacaacaac aacagtggcg 301 gcatcaggta gaaacagcaa caacaacagc aagaagtcga gaccgcacga aaagcgccac 361 gtcaaattac gctcgcattt gctcaagcgc cgcccatttc agtttattcg tcgccctcaa 421 tggataccgc tcttcatact tatttactta gcaacagatg tcgcctcagt ggcggtgccg 481 acgaaggagg cgtactttaa tggctccact tacctccgcc tcaccacgcc gatgcccatt 541 tgggatcact cggcgattag tttccgctcg tgccgcggcg gcgagatcct cgcccagcag 601 tacaacaaga actccattgt aatctcagtg ctcaatgact ttctgcaaat ctcactggct 661 ggacccgccg tccatgggcc caacaaccgg ctggatgtca agctgcccta ccaactgctg 721 gacaaccgct ggcatacgct gcagttcaag tacgagtacg gaaatctcta cctgcatgtg 781 gatcgcgcgg caagcatatt tgccaactcc acgtacaaca gtcagttcct gactaaccag 841 gacattggct acaaggacgc catcttgata ctgggaaact ccttctccgg ttgccttctg 901 gatggaccag gtctgcaatt tgtgaacaac tcgacggtgc agaatgtggt ctttggtcac 961 tgtcccttga ccccgggtcc ctgcagcgat cacgatcttt tcactcgact gccagacaac 1021 ttttgtctga atgatccctg tatgggccat ggaacctgct cgtccagtcc agagggatac 1081 gaatgtcggt gcacggcacg ttactcggga aaaaattgcc agaaggacaa tggctcgccg 1141 tgtgccaaaa atccatgtga gaacggtggt tcctgcctgg agaattccga ggggaattac 1201 cagtgcttct gtgatcccaa ccacagtggt cagcactgcg agacggaggt gaacatccac 1261 ccactctgcc agacgaaccc ctgcctgaac aatggagcat gtgtggtgat tggcggcagt 1321 ggagcactca cctgcgagtg tcccaaagga tacgctggtg ccaggtgcga ggtggacacg 1381 gatgagtgcg cctcacagcc gtgtcagaac aacggaagct gcatagaccg tatcaatgga 1441 ttcagctgcg attgcagcgg caccggttac acgggtgcgt tctgccagac gaatgtggat 1501 gagtgcgaca agaatccgtg cctgaacggc ggcagatgct tgcataccta cggatggtac 1561 acctgccagt gtctggatgg ttggggcggc gagatttgtg atcgacccat gacatgccag 1621 acgcagcagt gtttcaacgg tggcacctgc ctggacaagc ccatcggctt ccagtgcctc 1681 tgtccgccgg agtacactgg tgaactgtgc caaatcgcac ccagttgcgc acagcagtgc 1741 cccatcgatt cggagtgcgt aggcggcaaa tgcgtttgta agccaggctc atcgggttac 1801 aactgtcaaa cgagcacggg cgatggagct tctgctttgg ccctgacacc catcaactgc 1861 aatgccacca atggaaagtg tctcaatgga ggaacttgct ccatgaacgg aacccactgc 1921 tattgcgccg tcggctattc gggagatcgt tgcgagaagg ccgagaactg ttcgccactg 1981 aactgccagg aaccgatggt atgtgtccag aatcagtgcc tctgcccgga gaacaaggtg 2041 tgcaaccagt gcgccacaca accgtgccag aatggcggtg aatgtgtgga cctaccgaat 2101 ggagactacg agtgcaagtg cacccgtgga tggactggac gcacctgtgg caacgacgtg 2161 gacgagtgca ccctgcatcc gaagatctgc ggcaatggta tctgcaagaa cgagaaggga 2221 tcgtacaaat gctactgtac gcctggattt accggagtcc actgcgattc cgatgtggac 2281 gaatgcctta gctttccctg cctcaacgga gccacgtgcc acaacaagat aaatgcctac 2341 gagtgcgttt gccaaccagg atatgaggga gagaactgcg aagtagatat cgatgagtgc 2401 ggcagtaatc cctgctcgaa cggatccacc tgcatcgaca ggatcaacaa cttcacctgc 2461 aactgcattc cgggaatgag gggccgcata tgtgacatcg acattgacga ctgtgtggga 2521 gatccctgct tgaatggtgg tcagtgcatc gaccaattgg gtggcttccg ttgtgactgc 2581 agtggcaccg gttatgaggg cgagaattgt gagctgaaca tagacgaatg cctctcgaat 2641 ccgtgcacga atggtgccaa gtgcctggac agggttaagg actacttctg cgattgccac 2701 aacggctaca agggaaagaa ctgtgagcag gacatcaacg agtgcgagag taatccctgc 2761 cagtacaatg gcaactgcct agagcgttcg aacataacgc tttaccaaat gagtaggatc 2821 acggatctgc ccaaggtgtt tagccagccc tttagctttg agaatgccag cggctacgag 2881 tgcgtctgtg tgcccggcat cattggaaag aactgcgaga tcaacattaa tgagtgcgac 2941 agtaatccat gcagcaagca cggaaattgc aatgacggga ttggaaccta tacctgcgaa 3001 tgcgaacctg gcttcgaggg cacccactgt gagatcaaca tagatgaatg tgatcgctat 3061 aatccctgcc agaggggcac ttgctatgac cagatagacg actacgactg cgactgcgat 3121 gcgaactatg gaggcaaaaa ctgttccgtt cttctcaaag gctgtgacca gaatccttgt 3181 ttaaatggcg gtgcctgctt gccgtatttg atcaacgagg taactcatct gtacacctgc 3241 acctgcgaga acggcttcca gggcgataaa tgcgaaaaga ccactactct gtccatggtg 3301 gccactagcc tgatttcggt taccacggaa cgcgaggaag gctacgacat taatcttcaa 3361 tttaggacca cactgcccaa cggagttttg gctttcggaa ccaccggcga aaagaatgaa 3421 ccagttagct atattttgga gctgattaat ggacgactga atctacattc ctcgctccta 3481 aataagtggg agggcgtgtt tattggatcg aagctgaacg atagcaactg gcacaaggtg 3541 tttgtggcca taaacacttc gcatctagtc ctttcggcca acgacgagca ggctattttt 3601 ccggttggct cctatgaaac ggccaataat agccagcctt cattcccgcg cacatacctc 3661 ggaggcacca ttcccaatct gaagtcctat ctgcgccacc tcacccacca accatcagct 3721 tttgtgggct gcatgcaaga tataatggtc aatggtaaat ggatctttcc cgatgagcag 3781 gatgcgaata tcagttatac caaactggaa aatgtccaga gcggttgtcc gcgcaccgaa 3841 caatgtaaac cgaatccttg ccattccaac gttgaatgca cggacctttg gcacaccttc 3901 gcttgtcact gtcccagacc attctttggg cacacatgtc agcacaatat gactgctgcc 3961 acttttggcc acgagaacac aacccattcg gccgtgattg tggagaccac agatgtggct 4021 aggcgagcta ttagatccat cctagacatt tccatgttta ttcgtacacg cgagccaact 4081 ggtcaagtct tttacttggg cactgatcca cgcaaagccc ccactaaaaa tattggcgac 4141 tcatatgtgg cggccaaatt acacggcggc gagctgctgg taaagatgca gttcagcggt 4201 actccggagg cctacaccgt tggtggccag aaattggaca acggctataa ccacctgatc 4261 gaggtggtgc gcaatcaaac gctcgtgcag gtcaagctca atggcaccga gtacttccgc 4321 aagacgctgt ccacgacggg tctgctggac gcacagttgc tttacttggg cggacctgca 4381 cccacacgcg agtcacttct gggcgcgacc acagaaccag gaataattcc agttccggga 4441 gcaggaatac ccattgaaga cacaacagtg cccaaggagg cggacgacag cagggactac 4501 tttaagggca taatccagga cgtgaaggtt agcaatggct cgctcaacct gattgtggag 4561 atgtattccc taaacgtgac agatgtccaa gtgaatgcca aacctctggg cgctgtaacc 4621 atcgatcgtg cctccgtgtt acctggtgag gtgtccgatg atctgtgccg aaagaatgcc 4681 tgcctccata atgccgagtg caggaacaca tggaatgact acacttgcaa atgcccaaat 4741 ggttacaagg gaaagaagtg tgccaggaga tcagaatttt gccaacacgt tacatgtccg 4801 ggacagagct tgtgccaaaa tctggacgat ggctatgagt gtgttacgaa caccacattt 4861 actgggcagg aacgcagtcc tctcgccttc ttctacttcc aggagcagca atctgacgat 4921 attgtcagtg aagcgtctcc caagcaaact cttaaaccgg taattgatat tgccttccga 4981 ctcgtgctgg aggtactctg cctgtacata gacaatgtgg acggattctt tgaaattgga 5041 gtaaacggag gacgagtgac catcacctgg aagctaagtg cactgcattt tggcgagtct 5101 gcacgctttg agaaggagaa cactgacgga gaatggagtc gcatttacct cagggcacac 5161 aacagcaaac tggagggtgg atggaaggga tgggaatcga tggtggatcc gacgccagct 5221 ttttccacgg acatcgacca agcggccttc cagtccctga tcgctacaag tacccaggtt 5281 tacttgggtg gcatgccaga gtcgcgacaa gcacgaggat ccactttgtc tgcccagcag 5341 ggctctcagt tcaagggctg tgtgggagag gcaaggctcg gcgatctttt actgccctac 5401 ttctccatgg cggaactgta ttcgcgcacc aatgtttcag tacagcaaaa ggctcaattc 5461 cgtctaaatg ccacacgacc tgaggagggc tgcatcctgt gcttccagtc agactgcaaa 5521 aatgacggct tctgtcaatc tccttcagat gagtacgcct gcacctgtca gcctggattc 5581 gagggcgatg attgcggcac ggacatcgac gagtgtctta acacggaatg cttgaacaac 5641 ggcacctgca tcaaccaggt tgcagctttc ttctgccagt gtcagccagg attcgagggt 5701 cagcactgtg agcaaaacat cgacgagtgt gcggatcagc cgtgccacaa cggtggcaac 5761 tgcacggatc ttatcgcatc gtacgtgtgc gactgccctg aggactatat gggcccgcag 5821 tgcgacgtgc tgaagcaaat gacctgcgag aacgagccat gtcggaatgg atcaacctgc 5881 cagaatggat tcaatgcttc tactggcaat aactttacat gtacatgcgt gcccggcttc 5941 gagggtccac tgtgtgacat acccttctgt gaaataacgc cttgcgataa cggtggcctc 6001 tgcctgacca ctggagcggt accgatgtgc aaatgtagtc tgggatacac tggtcgcctg 6061 tgcgagcagg acattaacga gtgcgaatcg aatccatgcc agaacggtgg tcagtgtaag 6121 gacctcgtcg gcaggtacga gtgcgattgc agggcacgga ttcgagggat tcgctgtgaa 6181 aatgacatcg acgagtgcaa catggaggga gattactgcg gcggattggg ccggtgtttc 6241 aacaagcccg gatccttcca gtgcatctgc cagaaaccct attgcggagc ctactgcaac 6301 tttacggatc cctgcaacgc tacggacctc tgtctaaacg gcggtcgctg cgtagagtcc 6361 tgcggcgcca aaccggacta ctactgcgag tgtccggaag gtttcgcggg aaagaattgc 6421 acagcaccga ttacggccaa ggaggacggg ccttcgacca cagacattgc catcattgta 6481 atacccgtag tggtggtgct gctgctgatc gcgggagccc tcctgggcac cttcctggtg 6541 atggccagga acaagcgagc aaccaggggc acctatagcc cgagcgtcgt caagagtact 6601 gcaacccacg gctggaaatg gacaacgtac tgaagccacc gccggaagag cgactaattt 6661 agttttgagt tttgagcatg aacgacgatt agcaaagcaa acaaaagata tttttaaatc 6721 cgcccatata cacctagctg taggagtaac tcaatgtttt gtactaagtt ggcccctagt 6781 tatggtttac atcttaaggt gctcaaagca atagcagcag cagcagcagc caggcttcca 6841 cctcgactcg tcccttatcc cctgcctacg ctttagttag ttaataatgc cgttgtctat 6901 ttattctagt agttagatga catacgtacc gccctatagt cgttatgtag ttatgttccc 6961 gatagtttag attccgtatt cgatttctcg tatatgtaat cctaaagctg cgaagaaact 7021 tgagctccga actgtcgatt tcccctttgc attccacaag gaagcccatc aaatgtattg 7081 tatgagtatt tttgtaaata agattgtttg ccgactctta aagtaaatta gctgagctaa 7141 aatcgaaatt aaaaacagaa aatcctatat gcctatatat cgcgagtata tatcagtaaa 7201 tgtggcctta taacaaaaaa aaaaaa // LOCUS ECOINCRNA 263 bp ss-RNA BCT 29-JUN-1990 DEFINITION E.coli antisense Inc RNA. ACCESSION M33752 KEYWORDS antisense RNA. SOURCE E.coli clinical isolate RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 263) AUTHORS Lopez,X.X. and Rodriguez,J.C. TITLE Isolation and nucleotide sequence of an incompatibility gene from the incFv1 plasmid pSU401: comparison to other IncF genes JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Rodriguez, 17-APR-1990. Univ de Cantabria, Depto Biologia Molecular, c/Cardenal Herrera Oria s/n, 39011 Santander, SPAIN FEATURES from to/span description signal 179 185 -10 promoter signal 205 211 -35 promoter BASE COUNT 70 a 54 c 71 g 68 t ORIGIN 1 gatcgtttaa ggaattttgt ggctggccac gccgtaaggt ggcaaggaac tggttctgat 61 gtggatgtac aggagccaga aaagcgaaaa ccccgataat cttctctaac tttggcgagt 121 gcagaaagat taccggggcc atctaaaaac cgtatagcca acaattcagc tatgcgggga 181 gtatagttat atgcccggaa aagttcaaga cttctttctg tgctcgctcc ttctgcgcat 241 tgtaagtgca ggatggtgtg act // LOCUS FLASL287HA 1095 bp ss-RNA VRL 29-JUN-1990 DEFINITION Influenza A/SL/2/87, A(H1N1) virus hemagglutinin (seg 4) RNA, 5' end. ACCESSION M33748 KEYWORDS hemagglutinin. SOURCE Influenza type A/SL/2/87 passed in rhesus monkey kidney cells. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1095) AUTHORS Rajakumar,A., Swierkosz,E.M. and Schulze,I.T. JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review REFERENCE 2 (bases 33 to 1095) AUTHORS Rajakumar,A., Swierkosz,E.M. and Schulze,I.T. TITLE Sequence of an influenza virus hemagglutinin determined directly from a clinical sample JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.T.Schulze 17-APR-1990. St. Louis School of Med, Dept of Microbiology, 1402 S Grand Blvd St. Louis Missouri 63104 FEATURES from to/span description pept 33 > 1095 hemagglutinin precursor protein sigp 33 83 signal peptide matp 84 1061 hemagglutinin 1 (HA1) matp 1066 > 1095 hemagglutinin 2 (HA2) BASE COUNT 381 a 233 c 237 g 244 t ORIGIN 1 agcaaaagca ggggaaaata aaaacaacca aaatgaaagc aaaactacta gtcctgttat 61 gtgcatttac agctacagat gcagacacaa tatgtatagg ctaccatgcg aacaactcaa 121 ccgacactgt tgacacagta cttgaaaaga acgtgacagt gacacactct gtcaacctac 181 ttgaggacag tcacaacgga aaactatgtc gactaaaagg aatagcccca ctacaattgg 241 gtaattgcag tattgccgga tggattttag gaaacccaga atgcgaatca ctgttttcta 301 agaaatcatg gtcctacatt gcagaaacac caaactccga gaatggaaca tgttacccag 361 ggtatttcgc tgactatgag gaactgaggg agcaattgag ttcagtatca tcattcgaga 421 gattcgaaat attccccaaa gaaagctcat ggcccaacca caccgtaacc aaaggagtaa 481 cggcatcatg ctcccataag gggaaaagca gtttttacag aaatttgcta tggctgacgg 541 agaagaatgg ctcgtaccca aatctgagca agtcctatgt gaacaacaaa gagaaagaag 601 tccttgtact atggggtgtt catcacccgt ctaacatagg ggaccaaagg gccatctatc 661 atacagaaaa tgcttatgtc tctgtagtgt cttcacatta taacaggaga ttcaccccag 721 aaatagcaaa aagacccaaa gtaagagatc aagaaggaag aattaactac tactggactc 781 tgctggaacc cgaggacaca ataatatttg aggcaaatgg agatctaata gcgccatggt 841 atgcttttgc actgagtaga ggctttgggt caggaatcat cacctcaaac gcatcaatgg 901 atgaatgtga cgcgaagtgt caaacacccc agggagctat aaacagtagt cttcctttcc 961 agaatgtaca cccagtcaca ataggagagt gcccaaagta tgtcaggagt acaaaattaa 1021 gaatggttac aggactaagg aacatcccat ccattcaatc cagaggtttg tttggagcca 1081 ttgccggttt cattg // LOCUS TREKPMDA 999 bp ds-DNA ORG 29-JUN-1990 DEFINITION T.evansi kinetoplast DNA minicircle. ACCESSION M33751 KEYWORDS . SOURCE T.evansi bloodstream forms DNA, clone AnTat 3/3. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.Bajyana Songa 17-APR-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS RATCASPAT 1535 bp ss-mRNA ROD 29-JUN-1990 DEFINITION Rat cytosolic aspartate aminotransferase mRNA, complete cds. ACCESSION D00252 KEYWORDS cytosolic aspartate aminotransferase. SOURCE Rat (strain Wistar), cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1535) AUTHORS Horio,Y., Tanaka,T., Taketoshi,M., Nagashima,F., Tanase,S., Morino,Y. and Wada,H. TITLE Rat cytosolic aspartate aminotransferase: Molecular cloning of cDNA and expression in Escherichia coli JOURNAL J. Biochem. 103, 797-804 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Wada, 07-SEP-1989. FEATURES from to/span description pept 43 1284 cytosolic aspartate aminotransferase (EC 2.6.1.1) mRNA 1 1535 cytosolic aspartate aminotransferase mRNA BASE COUNT 360 a 409 c 410 g 356 t ORIGIN 1 agctcccagc tcgtgctcga ggtcccctca gattccatcg cgatggcccc tccatcattc 61 tttgcccagg ttccacaggc cccgccggtt ctggtcttta agctcattgc ggacttccgg 121 gatgatcccg atccccgcaa ggttaacctc ggcgtgggag cgtaccgcac agatgactct 181 cagccctggg ttttgccagt agtgaggaag gtcgaacaga agattgctaa cgaccacagt 241 ctcaaccacg agtacttgcc catcctgggc ctggcggagt tccggagctg tgcttctcag 301 ctagtacttg gggacaacag cccagctctc agggagaatc gggttggggg tgtgcagtct 361 ttgggaggga ccggtgcact tcgaattgga gctgacttct tagggcgatg gtacattggc 421 acagacaaca agaacacgcc cgtctacgta tcatcgccga cctgggagaa ccataatggc 481 gtgttttctg ccgctggttt taaagacatt cggtcctatc gctactggga tgcagagaag 541 agaggacttg atctccaggg tttcctgaat gatctggaga atgctcctga gttctccatc 601 tttgtcctcc acgcctgtgc acacaaccca acggggaccg acccaactga agaggagtgg 661 aagcagatcg ccgccgtcat gaagcgccgt tttctgttcc ccttctttga ctcagcctat 721 cagggctttg catctggaga cctagagaaa gatgcctggg ctattcgcta ttttgtgtct 781 gaaggcttcg agctcttctg tgcccagtcc ttctccaaga acttcgggct ctacaatgag 841 agagtgggga atctgaccgt ggtcggaaaa gagcatgaca gcgtcctgcg ggtcctttcc 901 cagatggaga agattgtacg aatcacctgg tccaatcccc ctgcccaggg agctcggatc 961 gtggccacca ccctctccaa ccctgagctc tttaaggagt ggaaaggaaa cgtgaagaca 1021 atggctgacc ggattctgac catgagatcc gaactcaggg cgcgactaga agctctcaag 1081 actcccggga cttggtctca catcactgag cagattggaa tgttcagctt tactgggttc 1141 aaccccaagc aggtcgagta tttggtcaac gagaagcaca tctatctgat gccgagcggt 1201 cggatcaaca tgtgcggctt gaccaccaag aacctagatt atgtggctac ctccatcaat 1261 gaagctgtca ccaaattcca gtgaagaaac accgagtagt tcataccacc aaagcagttc 1321 ctgtcacagc tttcctgcct gcgcaaacct agccgtacat gttgtttatt agagatgacc 1381 accatgggga ggcagccgct gtttagctgg ccccacaaga gaagacattt cttgaacatg 1441 aacctgggtc gggtgggggg atgactgggg ttagggcctt ttggaaacca gagcagatta 1501 aagttattta agaataaaaa aacccgttga tatgc // LOCUS RATGL5 599 bp ds-DNA ROD 29-JUN-1990 DEFINITION Rat NICER element (GL5-14) 3' long terminal repeat. ACCESSION M33534 KEYWORDS long terminal repeat. SOURCE Rat (strain NEDH) liver NGS-TC12 induced pheochromocytoma cell line DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 599) AUTHORS Cho,K.-O., Minsk,B. and Wagner,J.A. TITLE NICER elements: A family of nerve growth factor inducible cAMP-extinguishable retrovirus-like elements JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Wagner, 03-APR-1990. FEATURES from to/span description rpt 21 575 3' viral long terminal repeat recomb 575 576 viral DNA end/rat DNA start signal 345 349 CAAT box signal 402 406 TATA box signal 481 486 Poly-A signal BASE COUNT 174 a 154 c 139 g 132 t ORIGIN 1 tggtagaaga agaggggaat ggaaaggaaa attatacaga tttaaggttt aaaaatatga 61 agttaaaaga gtatgtttca actcaggact aaacactgtg aaaagcaagt ccagacagcc 121 ccgccctgcc gctagaacta acagaccata aaaggaaagg aatgcagaac agaccaggag 181 taccggacct gactcacagg ccacctggca ggaagagata agcccccagc ccccgacatc 241 caggacgccc caaagctgcc aatgtgtgta gctataactt attacctcat catgtgaaat 301 agccaatcat atgtgaacat gtctttgtgc ctcgtttgaa tccaccaatc cccgtaacta 361 tgcatctgct tctgtacgcc cacttctgct tccccaatcc ctataaaatc cccatgctgg 421 agctgctggg tgcgcaagtc ctccgaagag tctgtgtgcc cgcaggtacc tgtgttttcc 481 aataaaccct cttgctgatg gcatcgcagt ggactcggct cggtcattgg ggcttggact 541 cctcctgagg gaaggtcctc tccgggggtc tttcagaaga agagaattaa cagaccaaa // LOCUS RATGL5A 599 bp ds-DNA ROD 29-JUN-1990 DEFINITION Rat NICER element (GL5-14) 5' long terminal repeat. ACCESSION M33535 KEYWORDS long terminal repeat. SOURCE Rat (strain NEDH, isolate GL4-14) liver NGS-TC12 induced pheochromocytoma cell line DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 599) AUTHORS Cho,K.-O., Minsk,B. and Wagner,J.A. TITLE NICER elements: A family of nerve growth factor inducible cAMP-extinguishable retrovirus-like elements JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3778-3782 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Wagner, 03-APR-1990. FEATURES from to/span description recomb 18 19 rat DNA end/viral DNA start rpt 19 584 5' viral long terminal repeat signal 344 348 CAAT box signal 401 405 TATA box mRNA 439 > 599 mRNA signal 479 485 Poly-A signal binding 579 586 primer binding site BASE COUNT 160 a 158 c 138 g 143 t ORIGIN 1 ttattatgtg tccttgaagg gaaaggaaaa ttatacagat ttaaggttta aaaatatgaa 61 gttaaaagag tatgtttcaa ctcaggacta aacactgtga aaagcaagtc cagacagccc 121 cgccctgccg ctagaactaa cagaccataa aaggaaagga atgcagaaca gaccaggagt 181 accggacctg actcacaggc cacctggcag gaagagataa gcccccagcc tccgacatcc 241 aggacgcccc aaagctgcca atgtgtgtag ctatacctta ttacctcatc atgtgaaata 301 gccaatcata tgtgaacatg tctttgtgcc tcgtttgaat ccaccaatcc ccgtaactat 361 gcatctgctt ctgtacgccc acttctgctt ccccaatccc tataaaagcc ccatgctgga 421 gctgctgggt gcgcaagtcc tccgaagaga ctgtgtgccc gaaggtacct gtgttttcca 481 ataaaccctc ttgctgattg catcgagtgg actcggctcg gtcattgggc gcttggactc 541 ctcctgaggg aaaggtcctc tccgggggtc tttcattatg gggctcgccc cgggatttg // LOCUS XELHMGCOA 2970 bp ss-mRNA VRT 29-JUN-1990 DEFINITION X.laevis 3-hydroxy-3-methylglutaryl-coenzyme A reductase mRNA, complete cds. ACCESSION M29258 KEYWORDS 3-hydroxy-3-methylglutaryl-coenzyme A reductase. SOURCE X.laevis liver, cDNA to mRNA, clone 2. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2970) AUTHORS Chen,H. and Shapiro,D.J. TITLE Nucleotide sequence and estrogen induction of Xenopus laevis 3-hydroxy-3-methylglutaryl-coenzyme A reductase JOURNAL J. Biol. Chem. 265, 4622-4629 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Chen, 20-OCT-1989. FEATURES from to/span description pept 115 2766 3-hydroxy-3-methylglutaryl-coenzyme A reductase mRNA < 1 2970 3-hydroxy-3-methylglutaryl-coenzyme A reductase BASE COUNT 821 a 676 c 714 g 759 t ORIGIN 1 ggcgggaggt agaacatctg tgtgtgtcat ctccgtatcc ttgtggtatt tctctgccgc 61 cgccgggtca gtcaccagcc agcatcggcc aaagatatcc ttaatagagc caacatgctg 121 tccagactat ttcgcatgca tgggcagttt gtggcctctc atccctggga ggtcatcgtt 181 gggactgtga ctctcacaat ctgcatgatg tcaatgaaca tgttcactgg gaatgacaaa 241 atctgtggct ggaattatgc ttgccccaag tttgaagaag atgttctgag cagtgacatc 301 atcatcctga caatcacaag atgcatcgcc atcctttaca tttatttcca gttccaaaac 361 ctcagacagc tcggctcaaa atacatttta gggattgctg ggcttttcac catcttctca 421 agctttgttt ttagcacagt ggttattcac ttcctggata aagaactgac aggcctaaac 481 gaagctcttc catttttcct gcttttgatc gatctttcca aggctagtgc tttagccaaa 541 tttgctctaa gttccaactc acaggatgaa gtccgagaca atattgcccg tggcatggca 601 attctaggac ctacctttac ccttgaggcg cttgtagaat gtcttgtgat tggagtcggt 661 acaatgtcag gtgtgcggca gttagaaatc atgtgttgct tcggctgcat gtctgtcctt 721 gccaattatt ttgccttcat gaccttcttt ccagcttgtg tttcattggt gttagagctt 781 tctagagaga gccgagaggg ccgtccaatc tggcagctca gtcagtttgc tagtgtacta 841 gaagaagagg aggataataa acccaatcct gttactcaga gagtgaaaat gattatgtct 901 ttgggtttgg tccttgttca tgctcatagt cggtggataa gtgaaccctc ttcccaaaac 961 agcacctcaa tcagtgacca tgaagttacc accatgctgg atgatatgat gccaaagaga 1021 gttgagccta gcatgcctct ctggcagttc tatctttcta gaatggtgac catggatgta 1081 gaacagatca ttactcttgg tcttgctctc ctccttgccg tgaagtacat attctttgag 1141 caaacagaga cagaatcaac cttctctatg aagaacccga taatctcccc cgtagcggtg 1201 caaaagaagc aaatagagag ttgctgcaga cgggaacctg aacaggaaaa aactgtccat 1261 gtttccacaa cagaagaagc ttcatcaaag gaggaaactg aggcagttat aaagccatta 1321 ccccttgaga catcgcccaa agccaagttt atagtgggag attcttctcc actggaattg 1381 tccccagagg ataaaaatac catgtttgat ctaccagagg aaccccgtcc acttgatgaa 1441 tgtgttcgta tactcaagaa tccagataaa ggcgcacagt atctgactga tgcagaagtc 1501 attagcctgg tgaatgccaa acacatccca gcctacaaac tggaaaccat gatggaatct 1561 ccccgagagg gcgtggccat tcgccggcaa atgttgtctg acaagctccc ccagcgatca 1621 gcactgcaga gcttgcctta taagaactac aactactctt tagtaatggg tgcttgttgt 1681 gagaatgtca tcgggtacat gcccatccct gtaggtgtag ctggacctct cttgctgaac 1741 aacaaagagt atcaggtgcc catggcaaca actgaaggat gtcttgtggc cagcacaaac 1801 agaggttgca gagccataat gctgggagga ggtgccaaaa gtcgtgttct ggctgatggg 1861 atgactcgtg gaccggttgt taggttacca actgcctgtg atgcagcaga agtaaaggcc 1921 tggctagaca gcgcagaggg attcaaagtc ataaaagatg catttgatag cacaagcaga 1981 tttgcacgac tcgggagact acaaaactgc gttgctggca ggaacctgta catccgtttc 2041 cagtcaaaaa caggagatgc catgggaatg aacatgattt cgaaggtcac tgaacaagct 2101 cttgctagac tacaagaaga gttccctgat ctccatgttt tggctgtgag cggcaactac 2161 tgtacggata agaagccagc tgctattaac tggatagagg gaagaggaaa gtcagtggtt 2221 tgtgaggcca tcattccagc caaagttgta cgggaggttc taaagagctc aacagaagcg 2281 ttagttgaag tgaatatcaa taagaatttt attggttctg caatggcggg aagcatcgga 2341 gggtacaatg cacatgctgc aaacattgtg actgccattt acattgcttg cggacaggat 2401 gctgctcaga atgttgggag ctccaattgc atcacaataa tggaggctac cggtcccacg 2461 tatgaagatt tgtacatcag ctgcacaatg ccctctatag aaattggaac tgtaggaggt 2521 ggaactaact tggctcccca gcaagcttgc ttacagatgc tgggagtgca aggagcaagc 2581 acagagacac cagggaaaaa cgcctgccag cttgctcaga tagtgtgtag cacagttatg 2641 gccggggaac tgtcactaat ggcggcttta gcagcgggtc acctcgtcaa gagtcacatg 2701 gtccacaaca ggtcaaaaat taaccttcag gacctgccag gcacgtgcac aaagaaagct 2761 gcctgattgg ccgtaggaag agccgaatct taaaatgaac ttctctgaga aacggtgctg 2821 tatctcatca caacacaagg tggcgccaca tctgcctgag aaatggaatt gcctcatcta 2881 catatggatt cccaggcccg agcttcacaa tcaccatgtt aatccagtct tttattgatg 2941 atggcatctt aaaaagaaac ttacataatt // LOCUS HUMPFKMIX 1139 bp ss-mRNA PRI 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM) alternatively spliced mRNA, partial cds. ACCESSION J05533 KEYWORDS phosphofructokinase. SOURCE Human (34 week old) placenta, cDNA to mRNA, clone HPFKM1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 62; 1110 to 1139) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1139) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. JOURNAL Unpublished (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 1139 muscle phosphofructokinase (AA at 3) (EC 2.7.1.11) /nomgen="PFKM" /map="1cen-q32" /hgml_locus_uid="LL0081L" BASE COUNT 285 a 262 c 337 g 255 t ORIGIN 1 gaaaaccaat cacctcagaa gacatcaaga atggcagcag gatgggtgtg gaagcagtga 61 tggcactttt ggaggggacc ccagataccc cagcctgtgt agtgagcctc tctggtaacc 121 aggctgtgcg cctgcccctc atggaatgtg tccaggtgac caaagatgtg accaaggcca 181 tggatgagaa gaaatttgac gaagccctga agctgagagg ccggagcttc atgaacaact 241 gggaggtgta caagcttcta gctcatgtca gacccccggt atctaagagt ggttcgcaca 301 cagtggctgt gatgaacgtg ggggctccgg ctgcaggcat gaatgctgct gttcgctcca 361 ctgtgaggat tggccttatc cagggcaacc gagtgctcgt tgtccatgat ggtttcgagg 421 gcctggccaa ggggcagata gaggaagctg gctggagcta tgttgggggc tggactggcc 481 aaggtggctc taaacttggg actaaaagga ctctacccaa gaagagcttt gaacagatca 541 gtgccaatat aactaagttt aacattcagg gccttgtcat cattgggggc tttgaggctt 601 acacaggggg cctggaactg atggagggca ggaagcagtt tgatgagctc tgcatcccat 661 ttgtggtcat tcctgctaca gtctccaaca atgtccctgg ctcagacttc agcgttgggg 721 ctgacacagc actcaatact atctgcacaa cctgtgaccg catcaagcag tcagcagctg 781 gcaccaagcg tcgggtgttt atcattgaga ctatgggtgg ctactgtggc tacctggcta 841 ccatggctgg actggcagct ggggccgatg ctgcctacat ttttgaggag cccttcacca 901 ttcgagacct gcaggcaaat gttgaacatc tggtgcaaaa gatgaaaaca actgtgaaaa 961 ggggcttggt gttaaggaat gaaaagtgca atgagaacta taccactgac ttcattttca 1021 acctgtactc tgaggagggg aagggcatct tcgacagcag gaagaatgtg cttggtcaca 1081 tgcagcaggg tgggagccca accccatttg ataggaattt tgccactaag atgggcgcc // LOCUS J05525 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05525 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 4 a 8 c 3 g 5 t ORIGIN 1 ctcatctcag atacctggcc // LOCUS J05526 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05526 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 5 a 6 c 5 g 4 t ORIGIN 1 actcagcgag gtacttgcac // LOCUS J05527 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05527 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 7 a 8 c 3 g 2 t ORIGIN 1 tctcaaccag acaaggaccc // LOCUS J05528 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05528 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 7 a 3 c 4 g 6 t ORIGIN 1 catcaagaat gttcgtatga // LOCUS J05529 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05529 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 2 a 5 c 6 g 7 t ORIGIN 1 gtcccttcag ctggtggtta // LOCUS J05530 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05530 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 6 a 4 c 5 g 5 t ORIGIN 1 cagaattctg gtaagtcacg // LOCUS J05531 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05531 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 4 a 6 c 8 g 2 t ORIGIN 1 gctcctgcag ggcagcagga // LOCUS J05532 20 bp ds-DNA BAD 29-JUN-1990 DEFINITION Human muscle phosphofructokinase (PFKM). ACCESSION J05532 REFERENCE 1 (bases 1 to 20) AUTHORS Sharma,P.M., Reddy,G.R., Babior,B.M. and McLachlan,A. TITLE Alternative splicing of the transcript encoding the human muscle isoenzyme of phosphofructokinase JOURNAL J. Biol. Chem. 265, 9006-9010 (1990) STANDARD simple staff_entry COMMENT Intron/exon boundary sequence. McLachlan promises to send us full exon sequence before it is published in August 1990 using the same accession numbers. BASE COUNT 4 a 3 c 7 g 6 t ORIGIN 1 tgtccaggtg gtaagtactg //
GenBank-Updates@genbank.bio.net (06/30/90)
LOCUS DROGNBPSA1 441 bp ds-DNA INV 30-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exon 1. ACCESSION M33996 KEYWORDS guanine nucleotide-binding protein. SEGMENT 1 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 441) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pre-msg 362 > 441 guanine nucleotide-binding protein (Gs-alpha) mRNA and introns (alt.) pre-msg 174 > 441 Gs-alpha mRNA and introns (alt.) IVS 426 > 441 intron A signal 152 157 promoter region (put.) variant 365 365 c in DNA, g in cDNA BASE COUNT 140 a 89 c 85 g 127 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgcta ttctacatag ctcgggatac tttattttat tattaatatt cgctaaaata 61 gcacatcgtc atttggcata gatgtttttg gcccagtaac atatctatct acatcaccgc 121 ccgcactatc cgataggcag actgactatc gataaaagta tctacttcgt tgcatcccta 181 ggttgataca taaaataaaa aatatataaa aagtgagtta gtactggtat taaaagtaat 241 ttttaagcta caaattgatg aattgcgagc acacagagac taagaacatt gggaatccgg 301 aaaacagtgg ctggcgcgtc gggtgcgata gaattgcctg tttttcagac cccaaaagtc 361 gctcccaacc gagtcgtgca cctatagtct tggcccgaag ttgctcccat gaataaaaat 421 gttcggtaat ttcacggtaa a // LOCUS DROGNBPSA2 897 bp ds-DNA INV 30-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exons 2,3 and 4. ACCESSION M33997 KEYWORDS guanine nucleotide-binding protein. SEGMENT 2 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 897) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 265 540 guanine nucleotide-binding protein (Gs-alpha-S), exon 2 607 703 guanine nucleotide-binding protein (Gs-alpha-S), exon 3 760 + 880 guanine nucleotide-binding protein (Gs-alpha-S), exon 4 pep$ 265 540 guanine nucleotide-binding protein (Gs-alpha-L), exon 2 607 703 guanine nucleotide-binding protein (Gs-alpha-L), exon 3 760 + 880 guanine nucleotide-binding protein (Gs-alpha-L), exon 4 pre-msg < 1 > 897 Gs-alpha mRNA and introns IVS < 1 29 intron A IVS 541 606 intron B IVS 704 759 intron C IVS 881 > 897 intron D BASE COUNT 266 a 225 c 226 g 180 t ORIGIN About 1.4 kbp downstream of segment 1. 1 ccatctacta taccattgta cgaatgcagc ctgctgcaga gatccactga agcgctgagc 61 caacgccaac ggtggtgcca ctgagatcgg agatcggaga tcgcagaccg gagaccgacg 121 agcacacaag ggattgtggt ttgatccgag gaacccaagc actcagctat actggagcag 181 aacggagaaa gcacaagggg tgccgagtcg cccgacgcga accaggaaaa ccgagcgacg 241 tgttggtgga ccccgtgggc tgcgatgggt tgctttgggt cgcccacctc caagcagtcg 301 gacgtgaact cggaggactc gaagagccag aagcgccgga gcgatgcaat atctagacag 361 ttgcagaagg acaaacagct ctacagggcc acacacaggc tgctcctcct gggggcgggc 421 gagtccggca aatcaaccat agtcaagcaa atgcgaatat tgcatgtcga cggattttct 481 gactcggaaa agaaacagaa aattgatgat attaaaaaga atattcgaga cgctatcttg 541 gtgagttcca tcccgccggc catctaagcc caaacacttg cttaactcct tgccaatccc 601 ttacagacta ttacaggagc catgagcaca cttaatccac ctgtagcttt agaaaagaag 661 gaaaatgaac ccagagtgga gtacattcag gattatgcat ctagtaagag cctctgccgc 721 cccaacttaa gttgttatca accattatct gccccgcagg tccggacttt aattatcctc 781 ctgaatttta tgaacataca gaagaactat ggaaagacaa gggcgttctt caaacctatg 841 agaggtcgaa tgagtatcaa ttaatcgatt gtgcgaaata gtaagtaatg cccaccc // LOCUS DROGNBPSA3 1380 bp ds-DNA INV 30-JUN-1990 DEFINITION D.melanogaster guanine nucleotide-binding protein (Gs) alpha subunit gene, exons 5,6,7,8 and 9. ACCESSION M33998 KEYWORDS guanine nucleotide-binding protein. SEGMENT 3 of 3 SOURCE D.melanogaster (strain Canton S) embryo DNA, clones lambda-DGs-[4,5,7,10,11,12]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1380) AUTHORS Quan,F. and Forte,M.A. TITLE Two forms of Drosophila melanogaster Gs-alpha are produced by alternate splicing involving an unusual splice site JOURNAL Mol. Cell. Biol. 10, 910-917 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 4 132 guanine nucleotide-binding protein (Gs-alpha-S), exon 5 206 385 guanine nucleotide-binding protein (Gs-alpha-S), exon 6 445 569 guanine nucleotide-binding protein (Gs-alpha-S), exon 7 746 828 guanine nucleotide-binding protein (Gs-alpha-S), exon 8 909 1055 guanine nucleotide-binding protein (Gs-alpha-S), exon 9 pep$ + 4 132 guanine nucleotide-binding protein (Gs-alpha-L), exon 5 206 385 guanine nucleotide-binding protein (Gs-alpha-L), exon 6 445 569 guanine nucleotide-binding protein (Gs-alpha-L), exon 7 755 828 guanine nucleotide-binding protein (Gs-alpha-L), exon 8 909 1055 guanine nucleotide-binding protein (Gs-alpha-L), exon 9 pre-msg < 1 1115 Gs-alpha mRNA and introns (alt.) pre-msg < 1 1176 Gs-alpha mRNA and introns (alt.) pre-msg < 1 1243 Gs-alpha mRNA and introns (alt.) IVS < 1 3 intron D IVS 133 205 intron E IVS 386 444 intron F IVS 570 745 intron G (for Gs-alpha-S) (no splice consensus) IVS 570 754 intron G' (for Gs-alpha-L) IVS 829 908 intron H variant 780 780 t in DNA, c in cDNA signal 1083 1088 polyA signal signal 1154 1159 polyA signal signal 1221 1226 polyA signal BASE COUNT 441 a 265 c 254 g 420 t ORIGIN About 500 bp downstream of segment 2. 1 aagcttcctg gaccgagtga gcacaatcaa gaatccaaac tacaccccta atgagcagga 61 tattcttcgg tgccgtgttt tgacttctgg aatatttgaa acaagatttc aagtggacaa 121 agtaaacttt cagtaagtgt aatcccaagt gatatcgcaa tattttagtt tctattccat 181 tctaatgata tgcattcgtg tgcagcatgt tcgatgtcgg tggccagcgg gacgagcgta 241 ggaaatggat tcagtgtttc aatgatgtaa ctgctatcat attcgtaact gcgtgctcaa 301 gttataacat ggttttgcgg gaagatccca cccagaaccg acttcgagaa tctttggatt 361 tgttcaagag tatttggaac aacaggtgaa ccgagcaatt tggcgccatc tcgacaaatg 421 gaaactaacg agttgttctt tcagatggct tcgcacgatt tctattatac tatttttaaa 481 taagcaagat ttgttagcag agaaaattaa ggctggaaaa agtaaattgt cggaatattt 541 ctccgagttt aacaaatacc aaacgccaag taagtaaaat agtataaccc atagtagagg 601 ccaattgagg gcgagggagt catcagatca accgactcag atctggccaa gaaattcgca 661 attccttttc aaatttgcaa accgtgccca agttaactta agaaatttaa tggataatgt 721 attttgtcta atttcaaatg tgctgtcgac acaggtgacg caataatgga atccaatgat 781 gacccagaag taatacgagc aaaatatttc atacgagacg agtttctggt gggttatatc 841 acatctccgt ttcttcctcc acctatctaa ctagccttaa tccttaatgg gatctctttc 901 ttatatagcg tatatctacc gctagcggag acggaaaaca ctactgctat ccacatttca 961 catgcgccgt tgacacagaa aacattaaac gtgtgtttaa tgattgcaga gacattattc 1021 aaaggatgca ccttcgtcaa tatgaattgt tataggttat ccccatcgcc gtaatgcaag 1081 taaataaaaa tattaatgac ttactgtttt aaatataatt atcaaatgta atttgtattt 1141 aaatagtccc ttgattaaaa caaaaaattc gctgcattta gatctggtat gacctttcac 1201 aaaactcaac attaaagaat aataaataaa catgttaata acagacacgc ttgtttcgat 1261 tattttggct attctaccag gatgcacctg ctttattttc tatagtccct tgacatgaag 1321 cattttagag tatttccgct atcagagtcc gtatccatat ccatgggcat ttacattcac // LOCUS SYCSBEFA 850 bp ds-DNA BCT 30-JUN-1990 DEFINITION Synechocystis sp. cytochrome B559 alpha subunit (psbE) and beta subunit (psbF), and photosystem II L protein genes, complete cds. ACCESSION M33897 X06988 KEYWORDS cytochrome B559 alpha subunit; cytochrome B559 beta subunit; photosystem II protein L. SOURCE Synechocystis sp. (strain PCC6803) DNA, clone pKW1261. ORGANISM Synechocystis sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 553) AUTHORS Pakrasi,H.B., Williams,J.G.K. and Arntzen,C.J. TITLE Targeted mutagenesis of the psbE and psbF genes blocks photosynthetic electron transport: Evidence for a functional role of cytochrome B559 in photosystem II JOURNAL EMBO J. 7, 325-332 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 850) AUTHORS Pakrasi,H.B., Nyhus,K.J. and Granok,H. TITLE Targeted deletion mutagenesis of the beta subunit of Cytochrome B559 protein destabilizes the reaction center of photosystem II JOURNAL Z. Naturforsch., C, Biosci. 45, 423-429 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.B.Pakrasi, 01-MAY-1990. Washington University Dept. of Biology Campus Box 1137 St. Louis, MO 63130 FEATURES from to/span description pept 121 366 cytochrome B559 alpha-subunit (psbE) pept 403 537 cytochrome B559 beta subunit (psbF) pept 547 666 L protein of photosystem II (PS2) binding 109 112 ribosomal binding site (put.) binding 392 396 ribosomal binding site (put.) BASE COUNT 197 a 196 c 197 g 260 t ORIGIN 1 gctaccccag caccgaggtg gcaatggttc cctaggcggc tcacaaaata gtagactaga 61 ctctacttgc tttgcatttg tcagtcaatg ttgttttgaa aaattgaagg agaacacaaa 121 atgtcaggga ctaccggcga gcgtccattt tccgatattg tcaccagcat tcgctactgg 181 gtgatccaca gcatcaccat cccgatgttg tttattgctg gttggttgtt tgtcagcacg 241 ggcttagcct acgatgcttt tggcactccc cgccccgatg aatatttcac ccagacccgt 301 caagagttgc ccattctcca ggaacgctac gacattaatc aggaaattca agagtttaat 361 caataaaaca tttaattgtt cttttttagt tggtaattaa caatggcaac ccaaaatcct 421 aatcaaccgg ttacttatcc catttttacg gtgcgctggc tggcggttca caccctggcg 481 gtgccctctg tcttctttgt cggggcgatc gccgcgatgc aatttattca acgctaggag 541 tttttcatgg acagaaattc aaacccaaac cgccaaccgg tggaattgaa ccgcacttct 601 ttatacctgg gtctattgtt ggtggctgtg ttggggattt tgttctccag ctatttcttt 661 aactaaactt ttttaatacg caatttagga ggcatggtat gttcgcagaa ggcagaatcc 721 ctttgtgggt ggtgggtgta gtggccggta ttggcgccat tggtgttcta ggattatttt 781 tctacggagc ctatgctggt ttaggttctt ccatgtaatc gagggctagc cgccacacaa 841 tatcatggtt // LOCUS MUSBFGFR 2526 bp ss-mRNA ROD 30-JUN-1990 DEFINITION Mouse basic fibroblast growth factor receptor (bFGF-R) mRNA, complete cds. ACCESSION M28998 KEYWORDS fibroblast growth factor receptor; transmembrane protein; tyrosine kinase. SOURCE Mouse (strain CBA/CaH), cDNA to mRNA, clone NE2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2526) AUTHORS Reid,H.H., Wilks,A.F. and Bernard,O. TITLE Two forms of basic fibroblast growth factor receptor-like mRNA are expressed in the developing mouse brain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1596-1600 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.Bernard, 11-OCT-1989. FEATURES from to/span description pept 58 2526 fibroblast growth factor-receptor precursor sigp 58 117 fibroblast growth factor-receptor signal peptide (put.) matp 118 2523 fibroblast growth factor-receptor BASE COUNT 579 a 710 c 708 g 529 t ORIGIN 1 gtggaatatc catggaggta cggagccttg ttaccaacct ctaaccgcag aactgggatg 61 tggggctgga agtgcctcct cttctgggct gtgctggtca cagccactct ctgcactgcc 121 aggccagccc caaccttgcc cgaacaagct cagccctggg gagtccctgt ggaagtggag 181 tctctcctgg tccaccctgg cgacctgcta cagcttcgct gtcggcttcg cgatgatgtg 241 cagagcatca actggctgcg ggatggggtg cagctggtgg agagcaaccg tacccgcatc 301 acaggggagg aggtggaggt gcgggactcc atccccgctg actctggcct ctacgcttgc 361 gtgaccagca gcccctctgg cagcgatacc acctacttct ccgtcaatgt ctcagatgca 421 ctcccatcct cggaagatga tgacgacgac gatgactcct cctcggagga gaaagagacg 481 gacaacacca aaccaaaccg taggcctgta gctccctact ggacatcccc agagaaaatg 541 gagaagaaac tgcatgcggt gcccgctgcc aagacggtga agttcaagtg cccgtcgagt 601 gggacaccca accccactct gcgctggttg aaaaatggca aagagtttaa gcctgaccac 661 cgaattggag gctacaaggt tcgctatgcc acctggagca tcataatgga ttctgtggtg 721 ccttctgaca agggcaacta cacctgcatc gtggagaatg agtatgggag catcaaccac 781 acctaccagc ttgacgtcgt ggaacgatct ccgcaccgac cccatccttc agcagggctg 841 cctgccaaca agacagtggc cctgggcagc aatgtggagt tcatgtgtaa ggtgtacagc 901 gatccgcagc ctcacattca gtggctgaag cacatcgagg tgaacgggag taagatcggg 961 ccagacaact tgccgtatgt ccagatcctg aagactgctg gagttaatac caccgacaag 1021 gaaatggagg tgcttcatct acggaatgtc tcctttgagg atgcggggga gtatacgtgc 1081 ttggcgggta actctatcgg actctcccat cactctgcat ggttgaccgt tctggaagcc 1141 ctggaagaga gaccagctgt gatgacctca ccgctctacc tggagatcat tatctactgc 1201 accggggcct tcctgatctc ctgcatgttg ggctctgtca tcatctataa gatgaagagc 1261 ggcaccaaga agagcgactt ccatagccag atggctgtgc acaagctggc caagagcatc 1321 cctctgcgca gacaggtaac agtgtcagct gactccagtg catccatgaa ctctggggtt 1381 ctcctggttc ggccctcacg gctctcctcc agcgggaccc ccatgctggc tggagtctcc 1441 gaatatgagc tccctgagga tccccgctgg gagctgccac gagacagact ggtcttaggc 1501 aaaccacttg gcgagggctg cttcgggcag gtggtgttgg ctgaggccat cgggctggat 1561 aaggacaaac ccaaccgtgt gaccaaagtg gccgtgaaga tgttgaagtc cgacgcaacg 1621 gagaaggacc tgtcggatct gatctcggag atggagatga tgaaaatgat tgggaagcac 1681 aagaatatca tcaaccttct gggagcgtgc acacaggatg gtcctcttta tgtcattgtg 1741 gagtacgcct ccaaaggcaa tctccgggag tatctacagg cccggaggcc tcctgggctg 1801 gagtactgct ataaccccag ccacaacccc gaggaacagc tgtcttccaa agatctggta 1861 tcctgtgcct atcaggtggc tcggggcatg gagtatcttg cctctaagaa gtgtatacac 1921 cgagacctgg ctgctaggaa cgtcctggtg accgaggata acgtaatgaa gatcgcagac 1981 tttggcttag ctcgagacat tcatcatatc gactactaca agaaaaccac caacggccgg 2041 ctgcctgtga agtggatggc ccctgaggcg ttgtttgacc ggatctacac acaccagagc 2101 gatgtgtggt cttttggagt gctcttgtgg gagatcttca ctctgggtgg ctccccatac 2161 cccggtgtgc ctgtggagga acttttcaag ctgctgaagg agggtcatcg aatggacaag 2221 cccagtaact gtaccaatga gctgtacatg atgatgcggg actgctggca tgcagtgccc 2281 tctcagagac ctacgttcaa gcagttggtg gaagacctgg accacattgt ggccttgacc 2341 tccaaccagg agtatctgga cctgtccata ccgctggacc agtactcacc cagctttccc 2401 gacacacgga gctccacctg ctcctcaggg gaggactctg tcttctctca tgagccgtta 2461 cctgaggagc cctgtctgcc tcgacacccc acccagcttg ccaacagtgg actcaaacgg 2521 cgctga // LOCUS BSSBIODAYB 4382 bp ds-DNA BCT 30-JUN-1990 DEFINITION B.sphaericus bioDAYB operon genes, complete cds. ACCESSION M29292 KEYWORDS adenosylmethionine-8-amino-7-oxononanoate aminotransferase; bioB; biotin synthase; dethiobiotin synthase. SOURCE B.sphaericus (strain IF03525) DNA. ORGANISM Bacillus sphaericus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 456 to 511) AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S., Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U. TITLE Cloning and characterization of the Bacillus sphaericus genes controlling the bioconversion of pimelate into dethiobiotin JOURNAL Gene 87, 63-70 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 4382) AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S., Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Goleckler, 20-OCT-1989, for release after publication. FEATURES from to/span description pept 509 1213 dethiobiotin synthase (bioD) (ttg start codon) pept 1194 2561 adenosylmethionine-8-amino-7-oxononanoate aminotransferase (bioA) (gtg start codon) pept 2566 3156 biotin synthase (bioY) pept 3156 4154 bioB protein (gtg start codon) mRNA < 456 4364 bioDAYB mRNA binding 491 503 ribosome binding site binding 1179 1186 ribosome binding site binding 2545 2556 ribosome binding site binding 4317 4364 ribosome binding site BASE COUNT 1376 a 746 c 952 g 1308 t ORIGIN 1 aagctttgca cacttctgtt tcgtatcctc atattgaact tgatgaaacc ttcctatggc 61 cgtatgcatt gagatttttt ctcgatgttc tgcttgcaat gttcgatatt cttcttgccg 121 aatagctaca cgataccaaa attcataacg caacggtaaa tctcttattt cgtaagtaag 181 caaagtattt aaaatactgc tcatttgttc atatgtatct agctttttat ctgtctcctt 241 aaatagtcca aacattttgc caccccctgt tttgattaat actacaacct atgataaaaa 301 ccctttaata tttcttggga aataatccaa cgttgataaa acggggtgaa tatccgatca 361 atcgagtgaa atttaggata gaataccctc ggaaaaagca ttatctgaat catttatgta 421 aaaatgcaaa aaaaggcatt tacaaaagga aaaagaatgt gttaacttaa aaactatagt 481 tggttaacta aaagaggggg aggtacagtt gcaacacttt tgggttgttg gaacagatac 541 agatgttgga aaaacatttg tcaccacatt attaatgcgt aatttgcaaa aacagggcgt 601 acgtgtaacg ccttataaac cagtccaaac tggtgaagtg tatgatggtg aacaagccta 661 ttacttcgac acagcgatgt atgaaaaata ttccttgcaa ttgctagaca gagagaattt 721 aaatggctat tcatttaaag aggctgcatc gccacatttt gcggctcaac tggaggggca 781 gcaaattgac acacagcagt tattaaagca aatgcaactt ttacagcaaa catgggatgt 841 tgttatttgt gaaggagcgg gtgggctctt tgtgccatta gatgcatgtg gcgaaacgac 901 attgttggat gtcattgttg aaagtaaact acccgttgtc gtggttacac gaacagcact 961 aggaacaatt aaccatacgc tcttaacgtt agaggcattg actacacgga aaattgaagt 1021 gcttggtctt gtatttaacg gtgatatggg gagcaggatg gagcaagaca atatccaaac 1081 gattttacag tattatacat tgccctatat gacgatacca aagctggaag agctgtcgga 1141 cattaatgag tatgcaatta cgggcacatc attgtttgaa aggctgatta gacgtgaaac 1201 aagtattaac tgagctacaa gaaaaagatt tacaacatgt ctggcatcct tgctcacaaa 1261 tgaaagatta tgaggctttt ccaccaatcg ttataaaaaa aggcgaaggt gtatggctgt 1321 atgatgaaca gaatcaacgc tatcttgatg cggtatcttc atggtgggtc aatttatttg 1381 gacatgccaa tccacgtatt agccaagcat taagtgaaca agcatttacg ttggagcata 1441 caatttttgc gaatttttca catgagccag cgattaaact cgcacaaaaa ttagtagctt 1501 taacaccaca aagtttacaa aaagtatttt ttgcagataa tggttcatct gctatagaag 1561 tcgctttaaa aatgagtttt caatatcata tgcaaacggg gaaaacgcaa aaaaaacgct 1621 ttttggcatt aacggatgcc taccatggtg aaacattagg tgctttatcc gtcggtggcg 1681 tagatcttta taacgaagtg tatcaaccac tgttattgga tacggtacga gcacaaggcc 1741 cagattgttt ccgttgccca ttcaagcatc atccggatag ttgccatgcc caatgtatta 1801 gttttgtaga ggatcagttg cgcatgcatc ataaggaaat tacggcggtt attattgagc 1861 cactcattca agcggcagca gggatgaaaa tgtatccagc tatttatttg cgacgtttac 1921 gtgaactatg tacgcaatat gatgtgcatc taattgcaga cgaaattgct gtaggttttg 1981 ggcgcacagg tacacttttt gcctgtgagc aggctaatat ctctccggat tttatgtgtt 2041 tatcaaaagg tttaacaggt gggtatttac cactgtctgt cgtaatgacg acgaatgatg 2101 tatatcaggc attttatgat gattatgcca cgatgaaggc gtttttacat tcacatagtt 2161 acacagggaa tacacttgcc tgccgtgttg ctctagaggt attggcgata tttgaagaag 2221 aacagtatat agacgttgtg caagacaaag gtgaacgcat gcgaaagcta gccttggagg 2281 cttttagtga tttacctttt gttggtgaat atcggcaagt tgggtttgtc ggggcgattg 2341 aacttgtggc gaatcgcgat accaaagagc cattaccgag tgaggagcgc atcggctatc 2401 aaatatacaa aagagcttta gcaaaagggt tactgattcg tccacttggg aatgttttgt 2461 atttcatgcc accatacatt ataacggacg atgaaatgca atttatgatt caaacaacaa 2521 aagatacaat tgttcaattt tttgaagagc gggagggatg agggcatgtt gaaacaacag 2581 tcaacgttat cacttgtgat gattgcgatg tttgctgcat taacagcagt tggtgccttc 2641 attaaaattc cattaccgct cgtgccgttt acattacaaa ttgtctttgt ctttttagcg 2701 ggttgcttac tcggtggtcg caatggattt caaagtcagc tagtttacat aggaataggt 2761 ttagttggct tgccagtttt tacacaaggt ggaggcatta catatgtatt gcagccgact 2821 tttggttact taataggatt tgctcttgct gcattagtaa tcggctatat gattgatcga 2881 gtagaatcac caacgaaaaa gcatttcatt gttgccaata ttatagggct tatcattatt 2941 tatgcagtcg cagtacctta tttatatgta gcattaaatg tatggttaaa catgaaatca 3001 agttggtctc atgtattttt agtaggcttt gtcaatagta ttgttgcaga cttttgctta 3061 gcaattgctt ctgccctttt agctgaacgt ctatacaaag tattccgttc cgctagagct 3121 ataaaacttg tgcaaattga aaaggagaat gtttagtgaa ttggttacaa ttagcagatg 3181 aagtgattgc aggcaaggta attagcgatg atgaggcact tgccatttta aatagtgatg 3241 atgatgatat tttaaagcta atggacggcg catttgccat tcgtaagcac tattacggta 3301 aaaaagtaaa gttaaatatg attatgaatg ctaaaagtgg ctattgccca gaggattgtg 3361 gctattgctc gcagtcatct aaatcgaccg ctcctattga gaaatatccg ttcattacaa 3421 aagaagaaat attagcgggg gcaaagcgtg cgtttgaaaa taaaattggt acgtattgca 3481 tcgtcgcaag cggacgtggg ccgactcgta aagatgtcaa tgtagtgagt gaagccgttg 3541 aagaaattaa agcaaaatat ggcttaaaag tttgcgcttg cttaggttta ctaaaagaag 3601 aacaagcaca acaattaaaa gaagcgggtg ttgatcgcta caatcataac ttaaatacat 3661 cagagcgtca ccattcctat attacgacga cgcacacata tgaggatcgt gttaataccg 3721 ttgaggttgt aaagaaacat ggtatttccc catgttctgg agccattatt gggatgaaag 3781 aaacgaaaat ggatgtcgtg gaaattgcac gcgcattgca tcagttggac gcggattcaa 3841 ttccagttaa cttcttacat gcaattgatg gaacgaaact tgaaggaaca caggacttaa 3901 atcctcgcta ttgcttaaaa gtattagcgt tattccgcta catgaatcct tcgaaggaaa 3961 ttagaatttc cggtggtcgc gaagtcaatt taggattcct tcagccattt ggactgtatg 4021 cagcaaatag tatttttgtt ggggattact taactactga aggacaagaa gccaatagcg 4081 attatcgtat gcttgaagat ttgggctttg aaatcgagct gacacaaaag caagaagaag 4141 cattttgttc ttaattcaac caatcattat gaaataaaat ctactactac acaatatgat 4201 tacctcaaaa ccgtgtgagc gtcgtggaaa aggcgcacag acggtttttt ggtcgataaa 4261 agagaaggag aaaggtaaat aaatggttcc gataatatac ctataaaatg atggttttca 4321 caaaatgttc aatgaaagcg ttttgaaatt gaacagtttg tgaagggctt cacataaagc 4381 tt // LOCUS BSSBIOXWF 4530 bp ds-DNA BCT 30-JUN-1990 DEFINITION B.sphaericus bioXWF operon genes, complete cds. ACCESSION M29291 KEYWORDS 7-keto-8-aminopelargonic acid synthetase; bioF; bioW; bioX. SOURCE B.sphaericus (strain IF03525) DNA. ORGANISM Bacillus sphaericus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1081 to 1151) AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S., Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U. TITLE Cloning and characterization of the Bacillus sphaericus genes controlling the bioconversion of pimelate into dethiobiotin JOURNAL Gene 87, 63-70 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 4530) AUTHORS Gloeckler,R., Ohsawa,I., Speck,D., Ledoux,C., Bernard,S., Zinsius,M., Villeval,D., Kisou,T., Komogawa,K. and Lemoine,U. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Goleckler, 20-OCT-1989, for release after publication. FEATURES from to/span description pept 1149 1649 bioX protein pept 1630 2367 bioW protein pept 2364 3533 7-keto-8-aminopelargonic acid synthetase (bioF) mRNA < 1081 3560 bioXWF mRNA binding 1132 1140 ribosome binding site binding 1612 1621 ribosome binding site binding 2350 2355 ribosome binding site BASE COUNT 1426 a 768 c 958 g 1378 t ORIGIN 1 aagctttact acaaccagga ggatgtagtt taggcgctgt tttaccatct aaaacaagtg 61 atggtaaaac ttatgtcctt agaaattacg atttgtcacc agctatatct gatatgagac 121 tttgtacaac taaggtaaaa gggaaatata gtcatactgg attctctgtt tcgtactttg 181 gtcgaagtga aggcttaaat gaagagggat tctgtgtagc ttttgcctca tgcggtattc 241 cagttggaaa acatccagga atgaaaaagc ctatcctaaa aggacttcag tttatggtaa 301 tagtaagagc tttacttgaa aattgtaaag atgttgagga aggaattact tatttagaga 361 atatgcctat tggaacaaat atgaatttgc tactgtcaga tgccaaaggg aatgtagctt 421 tagtagaaac gtatgatgga gaaaagttcg tagaaagggg caatcaaaaa tcaggatttt 481 tgattgcaac taatcacgct gtaatgccca aaataatgaa gttagaagac agaaaattag 541 aacaatctga aatacgttat aattttttga aaaataactt agagagtgat gattttttta 601 caaagaataa attacaacaa ttaatgttta atgaataccc taatggtgta acagtgcaca 661 atttcgaaga aaatttctga actgttcatt caattctatt taatcttaac gacaaacaat 721 tagatatttc tttcggttcc cctataaata ataaaatcta taaattaaaa gtaggagaaa 781 ccttgccttt taatgaatta gaagttttca tagagaactc aaactatgga ccaaacttct 841 ggaagctaat caaataataa tatggccaaa gaaacacttt cagctgcatg atgagccgag 901 agtgtttttt tatcattcct ctattcaata ataaatcgct tatttcacct tcttggaggt 961 ttgaaatcct cttgtttcat aggtttctgg cgttttcctt tttgacgaag cactgttaaa 1021 gctctatcct tttttgaaaa aaatatagaa aaacaatttt aatctacctt cctatctata 1081 aatgtgttaa cttaattatt attaaggtta actcaaattg aagaagttag aatgggagga 1141 ataggaggat gcgaaagttt tctacatatg atcttgctca gatttcatta ctagcttgtc 1201 ttattatcgt tacaggcatg tttaagattc caacaggtat tcctggatct gagtttcaat 1261 tatcagcacc gattgccgtt gcgattgcag cagtatttgg atttaagcga tattttcttg 1321 cgggaatcat tgcaagtcta atcttatttt tactaggtat acactccatc ttaaatgttg 1381 aaatttcaat aattttccga ttgactgttg gtctaatcat tgttttatta ggaacttcaa 1441 ttccggtact agttgtggca ggaccgattg gaacaatggt tgctagactt ggattggctt 1501 ttacgttagg gaccccgttt ttgccactat tcgttttggc gattccaggg atggtcatta 1561 cggctgtcag tgtttatcca ataacgaaaa tgttatatgc aattaataag aaagtagcag 1621 gtgatcatca tgttagaaac gtgttatagc attcgaatgc gtgcagctga aaaaaatctc 1681 gaaggaggag aaaagcatat atctggtggg gaacggatag ggagtgaatt tcaaatagag 1741 ccaattgtaa aacagttatt gaacaaagca aggaatcatt cgcgcggaga tgctgacttt 1801 attcaaatta ccgttgaaaa acttacaggt gatcagatac tgtatatgcc accgttagaa 1861 ataacgacaa ttgatgagag ttcaattgaa agggcacata aagaagctag gagtatatta 1921 acctcagtag gtgtttccaa gcaggcacaa aatgttgctt ttcatctact tgctagtaat 1981 caaaatcttc gtggggctat cctccttcat agtcaaactg gcttacgact tgacaatcgc 2041 ggactgaaag gcgttcgtgt atcacgaatc gattggcaag acgctgatgt aggttacaat 2101 gagcgtgttc gtgaagcgct agctctggca acgaaagtgg caaattctcc gtataccatc 2161 gcagaattat gttggtcaga tgatccagaa tacgttactg gctatgtaag caatcatgag 2221 attggttatg tcagaattac gcctttaaaa agggaaggct gtgaaagtgg cggacgtatt 2281 ttttttgtgt cagatgaagt tgagctagaa tcatatatac actatttaga aagagaacct 2341 attctcatta gggggcattt aaaatgaatg atcgctttcg aagggaactg caagtaatag 2401 aagagcaagg attgacaagg aagttacgtt tgttttcaac tggaaatgaa agtgaggtag 2461 tgatgaatgg taagaaattt ttgctatttt catcgaataa ctacttaggc cttgcaacag 2521 atagtcgttt gaaaaagaaa gcaactgaag gcattagtaa atacggtaca ggggctggcg 2581 gttctcgact tacaactgga aacttcgaca ttcatgaaca gctagaatct gaaattgcag 2641 attttaaaaa gactgaagcg gccattgtat tcagcagtgg gtatttagcg aacgtaggtg 2701 tgatttcgag cgtgatgaag gcaggagata ctatcttttc tgatgcttgg aatcacgcga 2761 gtattataga tggttgtcga ttaagtaaag ccaaaacgat tgtttatgaa catgcggata 2821 tggtggattt agagcggaaa ttaaggcaat cacatgggga tggattgaag ttcatcgtaa 2881 cggatggcgt ttttagtatg gatggtgata ttgcgccact tccaaaaata gtagagttag 2941 ccaaggaata caaagcgtac ataatgattg atgatgcgca tgcaacaggt gttcttggca 3001 atgatggttg tggtaccgct gattattttg gtttgaaaga tgagattgat tttacagtag 3061 gcacgttgag taaagcgatt ggtgcagagg gtggatttgt atcgacatca tccattgcta 3121 agaactattt gttaaataac gcccgatctt ttattttcca aacagcttta tcgccaagtg 3181 cgattgaagc agcgcgagaa ggcatttcca tcatacagaa tgagcccgag cggagaaagc 3241 aattgctgaa aaatgcgcag tacttacgat tgaaattaga ggaatctggt tttgtaatga 3301 aagaagggga aacacctatt atttctctta tcattggtgg ttctcatgaa gccatgcagt 3361 tttctgcgaa actactggat gaaggtgtct ttattccagc gattcgacca ccaacagtgc 3421 cgaaagggtc aagtcggttg cgtataacgg taatggctac acatacaata gagcagctcg 3481 atatggtcat tagtaaaatt aagaaaatag gaaaagaaat ggggattgta taattgtttg 3541 agtgcctggc actcaaacaa ttttttgatt gctttttctt attaattaca tagcactaac 3601 cttcaagttc tgcaattgcc tctaattcaa ttaagcagcc atggcttaat tctttaattg 3661 ggataatggc gcgggcaggc ttatgactac cgaaaaattg tgcataagtt tcatttacgt 3721 ccgcccataa atcagcagat gttatataga cgcgacacat aacaacagcc tccttcgtaa 3781 gaccgcttgc ttgaagcacg acttccatct tttgaagcgc catgaatgtt tcagcgtgaa 3841 tgccaccagt tggaggtttg cctgtagctg gatcgactga agtttgcccc gaaatatata 3901 aattcccgtt actaatcatg cctggtgtat aatgcccgat tggctcttgt tctatcaaaa 3961 ctttcttcat aataagttcc tccttataaa cgttgtaatt gatttagccc aatattgcca 4021 cctgaaataa ggaagcaaac atggtcgttt ttcgtaatcg gtaaattgcc tcgaagagct 4081 gctcctatcc cgatggcaga ggagggttca gctaatattt tgccttctag tagcaatgtt 4141 ttcatcccat taagtatgga agcctcgtct acacgcacaa tgtcatcaac taatttttca 4201 acaatcggga agtttcgctc cctaggttgt aaagttaata gagcatccgc taacgattgt 4261 tgttcagcta atagtaatct tttaccaact tgaatacttt ctgaatagcg agggacgacg 4321 gctggttcaa caccaaaaat tttaatatgt ggtgctaaag attttaccgc agttgcgaca 4381 ccacttagca gaccaccacc accaatagga acaacaattg cacttacctg agagaactgt 4441 tctaaaattt ccaatcctac tgtcccttgc cccgccataa tatcgtaatc atcataagga 4501 tgcacaatgg cataaccatg ttctaagctt // LOCUS HUMATPAS01 1458 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exons 2-5. ACCESSION M28284 KEYWORDS adenosine triphosphatase. SEGMENT 1 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 1458) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept / 35 148 adenosine triphosphate alpha subunit, exon 2 (EC 3.6.1.3) (AA at 35) 308 442 adenosine triphosphate alpha subunit, exon 3 876 993 adenosine triphosphate alpha subunit, exon 4 1109 + 1377 adenosine triphosphate alpha subunit, exon 5 IVS < 1 34 adenosine triphosphate alpha subunit intron A IVS 149 307 adenosine triphosphate alpha subunit intron B IVS 443 875 adenosine triphosphate alpha subunit intron C IVS 994 1108 adenosine triphosphate alpha subunit intron D IVS 1378 > 1458 adenosine triphosphate alpha subunit intron E BASE COUNT 267 a 493 c 429 g 269 t ORIGIN 1 gtcgacggat caacatcccc acatctcccc acagctgtac ctgggcatcg tgctggcggc 61 cgtggtgatc atcactggct gcttctccta ctaccaggag gccaagagct ccaagatcat 121 ggagtccttc aagaacatgg tgccccaggt gaagggtgcc cagcaagggc cagagggggt 181 gttagtgtat gggctggggc cgggcccagc aagggccaga gggggtgtta gtgtatgggc 241 tggggccggg ccagtgaccc ccaggcagag ggagtctggg aggtgacatt actccatccc 301 acctcagcaa gccctggtga tccgggaagg tgagaagatg caggtgaacg ctgaggaggt 361 ggtggtcggg gacctggtgg agatcaaggg tggagaccga gtgccagctg acctgcggat 421 catctcagcc cacggctgca aggtgggcct gtagggcccg gctttaccct ccctcgggct 481 gcccaggagc tcaggcccag ccctcctccc tcacactcag gagtccaggt cccaacccct 541 cctccctcag acccaggagt ccaggccccc aggccctcct ccctcagacc caggagttta 601 ggccctgggc acctcctcac ccaacagtca ggagtccagg cccagccctc acctttcgga 661 gatccttagg gaccctagac cttggccagc agcactgtgc cttccccacc ttcaagatcc 721 cagcctctgg cctcccacac accactcacc cactgggcac ccaggcttct agctgtgatc 781 tccaggcaca caggcttcag cccaagccct gtacacaaat accctcctgt cccaggccct 841 ggactgaacc ctctctctgc tgcacctacc cccaggtgga caactcctcc ctgactggcg 901 aatccgagcc ccagactcgc tctcccgact gcacgcacga caaccccttg gagactcgga 961 acatcacctt cttttccacc aactgtgtgg aaggtgaggc gggtgcagag aagacacaca 1021 gctgggcaga ccagggatgt gtcccagggg tcaggcctca gaacctccct gagccacccc 1081 acctcagcct aacccctctg ggctgcaggc acggctcggg gcgtggtggt ggccacgggc 1141 gaccgcactg tcatgggccg tatcgccacc ctggcatcag ggctggaggt gggcaagacg 1201 cccatcgcca tcgagattga gcacttcatc cagctcatca ccggcgtggc tgtcttcctg 1261 ggtgtctcct tcttcatcct ctccctcatt ctcggataca cctggcttga ggctgtcatc 1321 ttcctcatcg gcatcatcgt ggccaatgtc ccagagggtc tgctggccac tgtcactgta 1381 aggccaggct cctgggtctg ggtggggagg gcctgggggt ctgggctcct gggtctgagg 1441 gaggaggggc tgggatcc // LOCUS HUMATPAS02 200 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 6. ACCESSION M28285 KEYWORDS adenosine triphosphatase. SEGMENT 2 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 200) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 33 + 200 adenosine triphosphate alpha subunit, exon 6 (EC 3.6.1.3) IVS < 1 32 adenosine triphosphate alpha subunit intron E BASE COUNT 44 a 65 c 52 g 39 t ORIGIN 1 ccctgtaatt gcctgccttg ctcgtcctcc aggtgtgtct gaccgtgacc gccaagcgca 61 tggcccggaa gaactgcctg gtgaagaacc tggaggctgt agagaccctg ggctccacgt 121 ccaccatctg ctcagataag acagggaccc tcactcagaa ccgcatgaca gtcgccacca 181 tgtggtttga caaccagatc // LOCUS HUMATPAS03 200 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 7 (5' end). ACCESSION M28286 KEYWORDS adenosine triphosphatase. SEGMENT 3 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 200) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 67 / 200 adenosine triphosphate alpha subunit, exon 7 (EC 3.6.1.3) IVS < 1 66 adenosine triphosphate alpha subunit intron F (no splice consensus) BASE COUNT 43 a 68 c 55 g 34 t ORIGIN 1 ggccaccctc tgatcggtcc ccagctctcc atccatgaga ccgaggaccc caacgacaac 61 cgatacctgc tggtgatgaa gggtgccccc gagcgcatcc tggaccgctg ctccaccatc 121 ctgctacagg gcaaggagca gcctctggac gaggaaatga aggaggcctt tcagaatgcc 181 taccttgagc tcggtggcct // LOCUS HUMATPAS04 559 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exons 8 and 9. ACCESSION M28287 KEYWORDS adenosine triphosphatase. SEGMENT 4 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 559) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept / 155 330 adenosine triphosphate alpha subunit, exon 8 (EC 3.6.1.3) (AA at 157) 401 + 537 adenosine triphosphate alpha subunit, exon 9 IVS < 1 154 adenosine triphosphate alpha subunit intron G IVS 331 400 adenosine triphosphate alpha subunit intron H IVS 538 > 559 adenosine triphosphate alpha subunit intron I BASE COUNT 125 a 165 c 157 g 112 t ORIGIN 1 acttaatata tatatagtag tatgaaggaa gtatcctaga atgaatgctg actgccgtct 61 gctgatggga gatgaatgcg gcgatcagac atctaggcat gggtgcaggt tccgaggctg 121 gaaccctcac accccaaccc ttccctgcca ctaggtttct gccattatta cctgcccgag 181 gagcagtatc cccaaggctt tgccttcgac tgtgatgacg tgaacttcac cacggacaac 241 ctctgctttg tgggcctcat gtccatgatc ggcccacccc gggcagccgt ccctgacgcg 301 gtgggcaagt gtcgcagcgc aggcatcaag gtgtggcttg ggtgctgggg aggcaagcag 361 gcgtggcgga gaggccatcc ctaaaaaaca atgcctgcag gtcatcatgg tcaccggcga 421 tcaccccatc acggccaagg ccattgccaa gggtgtgggc atcatctctg agggcaacga 481 gactgtggag gacatcgccg cccggctcaa cattcccgtc agccaggtta accccaggtg 541 agccacccat tcccagcca // LOCUS HUMATPAS05 323 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 10. ACCESSION M35821 KEYWORDS adenosine triphosphatase. SEGMENT 5 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 323) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 62 + 212 adenosine triphosphate alpha subunit, exon 10 (EC 3.6.1.3) IVS < 1 61 adenosine triphosphate alpha subunit intron I IVS 213 > 323 adenosine triphosphate alpha subunit intron J BASE COUNT 59 a 106 c 97 g 61 t ORIGIN 1 gatcactttg ccactcctca cacaccctga cctcagccat cgctctctct gctcttccca 61 gggatgccaa ggcctgcgtg atccacggca ccgacctcaa ggacttcacc tccgagcaaa 121 tcgacgagat cctgcagaat cacaccgaga tcgtcttcgc ccgcacatcc ccccagcaga 181 agctcatcat tgtggagggc tgtcagagac aggtgggctg cgctcccgca gaggagggga 241 cggggccttg actcctgggt cctcactgag gccggggctg gtttcctggg tctgagggag 301 gaggggctgg gggtctgacc tgg // LOCUS HUMATPAS06 428 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 11. ACCESSION M35822 KEYWORDS adenosine triphosphatase. SEGMENT 6 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 428) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 147 + 315 adenosine triphosphate alpha subunit, exon 11 (EC 3.6.1.3) IVS < 1 146 adenosine triphosphate alpha subunit intron J IVS 316 > 427 adenosine triphosphate alpha subunit intron K BASE COUNT 83 a 110 c 147 g 88 t ORIGIN 1 tctagagagt ctgagcctaa ctcctagtct aggaagagag ttggacctaa cttctagtct 61 aagagagggc tgggtcctgg ctttctggat ctgaggaggg actctgggga ctggcctctg 121 gtgtcatcct taccctctct ccccagggtg caattgtggc tgtgaccggg gatggtgtga 181 acgactcccc cgctctgaag aaggccgaca ttggggtggc catgggcatc gctggctctg 241 acgtctccaa gcaggcagct gacatgatcc tgctggacga caactttgcc tccatcgtca 301 caggggtgga ggagggtgag ttggccaggg gtggcctgga gaccaggtca ctaccggagc 361 ctgagaccag caaggggaac tggccagggc tgcaggggga tgtgtggcag agaccaaggc 421 ccctgcct // LOCUS HUMATPAS07 939 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, intron J. ACCESSION M28288 KEYWORDS adenosine triphosphatase. SEGMENT 7 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 939) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description IVS < 1 > 939 adenosine triphosphate alpha subunit intron K BASE COUNT 287 a 197 c 250 g 205 t ORIGIN 1 ggatcccgtc ctgagtgaca cgagccagac acaaaagcat tcttagcatc caactccaac 61 tggagcaggc tcagcacccg tctctgggct aaaagtcagg ataatggtta tgtctaggag 121 gagggatata gactggaaga gagcagacaa aacctaaatg gggggctcgg aatggttttc 181 tatcttgact ggggtctggt ggttgcatgg ctgtgtacat gtgtaaaaat tgtatacacc 241 ctctctggtt gaatatttta atgagatata atatgcccag ctcagtgtat gcatgccata 301 tctcaataaa tattcaacca gtacaggttg tctaaagtaa acaaaataaa gattaatcca 361 ggccaggcgg tggggctaca cctgtaatcc caagactttg gagaccgaag caggaggatt 421 gcctgagatc aagagttcaa gaccagcctg gccaacatga tgaaacccca tctctactaa 481 aaatacaaaa attagctggg cttggtggtg ggcacctgta accccagcta cttgggaggc 541 tgaggcagga gaatcatttg aacctcagag gcagacgttg cagtgagccg caattgtgcc 601 attgcatcca gcctgggcaa cagagcaaga ctccatctca aaaaaataaa ataaaataaa 661 ataaacatta gtcaggcacg gtggctcgag cctgtggtcg cagctacttg ggaggctgag 721 gtgggaggat cacttgagga ggtcgaggtt gcagtgactg atgattgcag cactgggcta 781 ccagcctggg caacgaggtg gagaccttct ctctaaataa ataaataagt aaaaagatta 841 gcccatgaac aatgaggtca atcattaaga aagaacaggc gggcacggtg gcaaacacct 901 gtaatcccag cactttggga gcgctgaggc gggtggatc // LOCUS HUMATPAS08 1619 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exons 12 and 13. ACCESSION M28289 KEYWORDS adenosine triphosphatase. SEGMENT 8 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 1619) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 1069 1223 adenosine triphosphate alpha subunit, exon 12 (EC 3.6.1.3) 1301 + 1424 adenosine triphosphate alpha subunit, exon 13 IVS < 1 1068 adenosine triphosphate alpha subunit intron K IVS 1224 1300 adenosine triphosphate alpha subunit intron L IVS 1425 > 1619 adenosine triphosphate alpha subunit intron M BASE COUNT 372 a 419 c 489 g 339 t ORIGIN 1 ggatccaagg tccagctgtg gagttgtgtc tgtagctcgc cccttcttgc tgtgtagtat 61 ttgactgcat ggatgtaccg tggtttattc atccctcacc tgttgatgga catggggttg 121 actctggctt ttggctctga tgagtggtgc tatggtgaac attctcgtac ttgcttttcg 181 ctgggcataa acagtcctcc tctcttttgg gtataaaacc taggagtcgg attactcggt 241 cggtggggtg ggggatgttt gatagcaact ccgaaaccat cttccaaagt ggttatttca 301 agaatgtttc cacaaaggcg atctggctgg atgtttatgt ggaaaggtag gtgggtgagg 361 ggccaggtgt ggagacttat gcctgtaatc ccagcacttc gggaggctga gatggaatca 421 cagcccagaa attagacaag cctaggaaaa gtggtgagac tctgtctcta ctaaaaatac 481 aaaaattagc tggacgtggt ggcgcgtgac tgtagtctca gctacatggg aggctgaggc 541 aggaggatca cttgagcctg ggagtcgagg ttgcagtgag ccgtgatcat gccactgcac 601 tgcagcctgg gcaacagagt gagaccctgt cccaaaaaga aaataatggt gggtgggctg 661 gggggtagaa aaggacgcag gccgggcgcg gtcgtcacgc ctgtaatccc aggcacttca 721 ggaggccgag gagggcagat cacgaggtca ggagattgag accatgcagg ctaacacagt 781 gaaaccccgt ctctaataaa aatacaaaaa aattagccgg gcgtggtggc aggtgcctgt 841 agtgccagct actcgggagg ctgaggcagg agaatggcgt gaacccggga ggcggagctt 901 gcagtgagcc gagatcgcac cactgcactc cagcctgggc aacacagcga gactctgtct 961 caaaaaaaaa aaaaaaaaaa aaaaggacgt tggatgaggg cagaggaggg gcagagggag 1021 tggggctccc tggcatgggc gcctgctctg agcctgcctg tgccacaggc cgcctgatct 1081 tcgacaacct aaagaagtcc attgcctaca ccctgaccag caatatcccg gagatcacgc 1141 ccttcctgct gttcatcatg gccaacatcc cgctgcccct gggcaccatc accatcctct 1201 gcatcgatct gggcactgac atggtgagcc ctggcagcca cccttggggc caggagggtg 1261 gagtcctccc ctctccggct caccggcctc ctccgcctag gtccctgcca tctcactggc 1321 gtacgaggct gccgaaagcg acatcatgaa gagacagccc aggaacccgc ggacggacaa 1381 attggtcaat gagagactca tcagcatggc ctacgggcag attggtgagg caccggggac 1441 tccatctcct taccaccatg ccggcctaga gcatgcctgg ccaccgtgcg tgcttgggac 1501 cctggcattg actcagggga gcagacgtgg gcaggaccaa ccagtgagct atctgagggg 1561 ggggtctgca ccccatcctt ctccacctcc tcctctctgc tgttgatgtg tgcggatcc // LOCUS HUMATPAS09 276 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 14. ACCESSION M28290 KEYWORDS adenosine triphosphatase. SEGMENT 9 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 276) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 86 + 231 adenosine triphosphate alpha subunit, exon 14 (EC 3.6.1.3) IVS < 1 85 adenosine triphosphate alpha subunit intron M IVS 232 > 276 adenosine triphosphate alpha subunit intron N BASE COUNT 46 a 83 c 81 g 66 t ORIGIN 1 ccgggccctc tgaggtgccc tgggttaggc tgactggccc caatctgagc ctctgcctga 61 ggacttacct ctctcctcct tccaggaatg atccaggctc tcggtggctt cttctcttac 121 tttgtgatcc tggcagaaaa tggcttcttg cccggcaacc tggtgggcat ccggctgaac 181 tgggatgacc gcaccgtcaa tgacctggaa gacagttacg ggcagcagtg ggtgagtagg 241 gcagggtgct gtgtctctgc ccaccgtaag atcccc // LOCUS HUMATPAS10 567 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 15. ACCESSION M28291 KEYWORDS adenosine triphosphatase. SEGMENT 10 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 567) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 366 + 496 adenosine triphosphate alpha subunit, exon 15 (EC 3.6.1.3) IVS < 1 365 adenosine triphosphate alpha subunit intron N IVS 497 > 567 adenosine triphosphate alpha subunit intron O BASE COUNT 95 a 179 c 161 g 132 t ORIGIN 1 ctgatgccct tgccagccca tctgagcctt tgtcccactg ctgagctcgt ctgccctgcc 61 agccttcttc cccacctctg tctgtccctt caaagcccag ttgctgtctg tctccttggg 121 aggccacagg gtcttatcct cctccctggc ccctggtgga tgtgagatca ccattaactt 181 tcttcttcct tgtacgtctc ctgactcctc cctctgggac tatgagcccg cagaaggaag 241 acacacctga ggccctgagg accccatgca ggatgggcgg ggcaaagagc acggaacgtc 301 ggggtggcgg ctagggctgc agtgccacta actgacgtcc cgtgccctgg tcaccgctcc 361 tgcagacata cgagcagagg aaggtggtgg agttcacctg ccacacggcc ttctttgtga 421 gcatcgttgt cgcccagtgg gccgatctga tcatctgcaa gacccggagg aactcggtct 481 tccagcaggg catgaagtga gggccggggg acatggtgac tggacagcca tctgtctgtc 541 cgagtcgtct gtctgtcgtg tgtactc // LOCUS HUMATPAS11 212 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 16. ACCESSION M28292 KEYWORDS adenosine triphosphatase. SEGMENT 11 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 212) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 28 + 129 adenosine triphosphate alpha subunit, exon 16 (EC 3.6.1.3) IVS < 1 27 adenosine triphosphate alpha subunit intron O IVS 130 > 212 adenosine triphosphate alpha subunit intron P BASE COUNT 33 a 81 c 53 g 45 t ORIGIN 1 cccagccctg cccttctgtg cctccaggaa caagatcatg atcttcgggc tgtttgagga 61 gacggccctg gctgccttcc tgtcctactg ccccggaatg gacgtggccc tgcgcatgta 121 ccctctcaag tgagtgcccc gctgccccca gccctgccca caccagcgcc tgccatggag 181 cctcttcctt agactaggtc tgaactaggc cc // LOCUS HUMATPAS12 1047 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -activated adenosine triphosphatase alpha subunit gene, exon 17 and 18. ACCESSION M28293 KEYWORDS adenosine triphosphatase. SEGMENT 12 of 12 SOURCE Human placenta DNA, clones lambda-NK-alpha[3-2,10-3]. REFERENCE 1 (bases 1 to 1047) AUTHORS Ovchinnikov,Y.A., Monastyrskaya,G.S., Broude,N.E., Allikmets,R.L., Ushkaryov,Y.A., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulubova,I.E., Petrukhin,K.E., Gryshin,A.V., Sverdlov,V.E., Kiyatkin,N.I., Kostina,M.B., Modyanov,N.N. and Sverdlov,E.D. TITLE The family of human Na+, K+ -ATPase genes: A partial nucleotide sequence related to the alpha-subunit JOURNAL FEBS Lett. 213, 73-80 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 101 192 adenosine triphosphate alpha subunit, exon 17 (EC 3.6.1.3) 474 502 adenosine triphosphate alpha subunit, exon 18 IVS < 1 100 adenosine triphosphate alpha subunit intron P BASE COUNT 145 a 391 c 236 g 275 t ORIGIN 1 ggatcctggg agactgcccc tctgcgtccc ctccagtccc ctgaaactct gcctctcgtt 61 agggccccgc actcaagccc tcctgctctc ccctctgcag gcccagctgg tggttctgtg 121 ccttccccta cagtttcctc atcttcgtct acgacgaaat ccgcaaactc atcctgcgca 181 ggaacccagg gggtgaggga gctcggcaag gcagccgagg ggggcggggg gcagcagggt 241 ctcaggcagc tggtcccagg ctcccctcgc cctgctggat gctctgccac ctggttccca 301 ctcttctctc tcttcccatc tctccggcac ccactctgtc ttctcacggg tctctgtctg 361 tgtggtttcc ttgtctctct ccctctctgt ccctctctct gctgggcggc tcgccttgcc 421 tgtctctctc catctcttac tctgtctctt tctttccttc tttgtctctc caggttgggt 481 ggagaaggaa acctactact gacctcagcc ccaccacatc gcccatctct tccccgtccg 541 caggccagga ccgcccctgt cagtcccccc aattttgtat tctgggggga ggagccctct 601 cttcctgtgg ccccaccttg gcccccaccc cctccactat ctcctgccgc ccccactctg 661 gctggcttct ctcccctgcc ccaaacctct ctcctctctc ttttctgtgt cagtttctct 721 ccctctcctc acccctctat ccattcctcc cgccccagcc acctccctgg gctctttttt 781 actccccttc agccccccgg tgatgccatc tctggttctg gacaattatc aaatatatca 841 gtggggagag agaagcggtg tgtgtgttgt gcctgctttc cagacgggga ctgcggctgg 901 gacagcatcc cctctgttgg ggcgcgctgc caggcgactc tcgaaactgc aggaacttag 961 cctggcactg gcttggaagt cacggaatct cagagccatc taatcacgga gttttcaact 1021 ccacgaaagt cagagcagct tgaattc // LOCUS HUMATPASE1 309 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 2, clone lambda-NK-alpha-RD-16. ACCESSION M27571 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 1 of 3 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 309) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept / 21 + 289 Na+, K+ -ATPase alpha subunit, exon 2 (AA at 23) IVS < 1 20 Na+, K+ -ATPase alpha subunit intron A IVS 290 > 309 Na+, K+ -ATPase alpha subunit intron B BASE COUNT 54 a 88 c 96 g 71 t ORIGIN 1 gatgccccac catgttgcag gcactgccag gggcattgtg attgccacag gagaccggac 61 ggtgatgggc cgcatagcta ctctcgcctc aggcctggag gttgggcgga cacccatagc 121 aatggagatt gaacacttca tccagctgat cacaggggtc gctgtattcc tgggggtctc 181 cttcttcgtg ctctccctca tcctgggcta cagctggctg gaggcagtca tcttcctcat 241 cggcatcata gtggccaacg tgcctgaggg gcttctggcc actgtcactg tgagtgggtc 301 aggctgagg // LOCUS HUMATPASE2 239 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3, clone lambda-NK-alpha-RD-16. ACCESSION M27576 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 2 of 3 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 239) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 219 Na+, K+ -ATPase alpha subunit, exon 3 IVS < 1 20 Na+, K+ -ATPase alpha subunit intron B IVS 220 > 239 Na+, K+ -ATPase alpha subunit intron C BASE COUNT 53 a 76 c 66 g 44 t ORIGIN 1 ttctcctctt tctctaccag gtgtgcctga ccctgacagc caagcgcatg gcacggaaga 61 actgcctggt gaagaacctg gaggcggtgg agacgctggg ctccacgtcc accatctgct 121 cggacaagac gggcaccctc acccagaacc gcatgaccgt cgcccacatg tggtttgaca 181 accaaatcca tgaggctgac accaccgaag atcagtctgg tgattgggtg ctccagcgg // LOCUS HUMATPASE3 150 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 4, clone lambda-NK-alpha-RD-16. ACCESSION M27578 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 3 of 3 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 150) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 / 130 Na+, K+ -ATPase alpha subunit, exon 4 IVS < 1 20 Na+, K+ -ATPase alpha subunit intron C IVS 131 > 150 Na+, K+ -ATPase alpha subunit intron D BASE COUNT 31 a 49 c 38 g 32 t ORIGIN 1 tccccctcat ttcctcccag gggccacttt tgacaaacga tcccctacgt ggacggccct 61 gtctcgaatt gctggtctct gcaaccgcgc cgtcttcaag gcaggacagg agaacatctc 121 cgtgtctaag gtagggggtc aggacacaca // LOCUS HUMATPASA1 308 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 2, clone lambda-NK-alpha-TW-4. ACCESSION M27572 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 1 of 2 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-TW-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 308) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept / 21 / 288 Na+, K+ -ATPase alpha subunit, exon 2 (AA at 22) IVS < 1 20 Na+, K+ -ATPase alpha subunit intron A IVS 289 > 308 Na+, K+ -ATPase alpha subunit intron B BASE COUNT 56 a 79 c 80 g 93 t ORIGIN 1 tccctcccct tctttttaag gaccgcacgt ggtattgttg tctacactgg ggatcgcact 61 gtgatgggaa gaattgccac acttgcttct gggctggaag gaggccagac ccccattgct 121 gcagaaattg aacattttat ccacatcatc acgggtgtgg ctgtgttcct gggtgtgtct 181 ttcttcatcc tttctctcat ccttgagtac acctggcttg aggctgtcat cttcctcatc 241 ggtatcatcg tagccaatgt gccggaaggt ttgctggcca ctgtcacggt aagaggcagg 301 tgatggtc // LOCUS HUMATPASA2 95 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 4, 3' end, clone lambda-NK-alpha-TW-4. ACCESSION M27579 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 2 of 2 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-TW-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 95) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 / 75 Na+, K+ -ATPase alpha subunit, exon 4 (AA at 1) IVS 76 > 95 Na+, K+ -ATPase alpha subunit intron D BASE COUNT 27 a 20 c 21 g 27 t ORIGIN 1 gctctgtcca gaattgcagg tctttgtaac agggcagtgt ttcaggctaa ccaggaaaac 61 ctacctattc ttaaggtatg ctcaagagtt aacta // LOCUS HUMATPASEA 188 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3 (5' end). clone lambda-NK-alpha-SW3.2. ACCESSION M27574 KEYWORDS Na+, K+ -ATPase alpha-subunit. SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-SW3.2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 188) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept / 21 > 188 Na+, K+ -ATPase alpha subunit, exon 3 (AA at 21) IVS < 1 20 Na+, K+ -ATPase alpha subunit intron B BASE COUNT 45 a 57 c 50 g 36 t ORIGIN 1 gtccttccct ctcctgctag gtgaccctgt cgctgacagc aaaacggatg gccaagaaga 61 actgcctggt gaagaacctg gaggctgtgg agacccctgg ctccacctcc atcatctgct 121 cggacaagac tgggacactg acccagaaca ggatgacagt ggcccatctg tggttcgaca 181 atcagatc // LOCUS HUMATPASEB 141 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3 (partial), clone lambda-NK-alpha-R15-1. ACCESSION M27575 KEYWORDS Na+, K+ -ATPase alpha-subunit. SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R15-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 141) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 141 Na+, K+ -ATPase alpha subunit, exon 3 (AA at 1) BASE COUNT 32 a 41 c 41 g 27 t ORIGIN 1 ctgtccctga cagccaagcg cctggccagt aagaactgcg tggtcaagaa cctggaggcg 61 gtggagacat tgggctccac ttcggtgatc tgctcggaca agacagggac tctcactcag 121 aaccgcatga ctgtgtccca t // LOCUS HUMATPAB1 309 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 2, clone lambda-NK-alpha-R3-2. ACCESSION M27570 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 1 of 3 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 309) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept / 21 + 289 Na+, K+ -ATPase alpha subunit, exon 2 (AA at 23) IVS < 1 20 Na+, K+ -ATPase alpha subunit intron A IVS 290 > 309 Na+, K+ -ATPase alpha subunit intron B BASE COUNT 45 a 103 c 89 g 72 t ORIGIN 1 ctaacccctc tggcctgcag gcacggctcg gggcgtggtg gtggccacgg gcgaccgcac 61 tgtcatgggc cgtatcgcca ccctggcatc agggctggag gtgggcaaga cgcccatcgc 121 catcgagatt gagcacttca tccagctcat caccggcgtg gctgtcttcc tgggtgtctc 181 cttcttcatc ctctccctca ttctcggata cacctggctt gaggctgtca tcttcctcat 241 cggcatcatc gtggccaatg tcccagaggg tctgctggcc actgtcactg taaggccagg 301 ctcctgggt // LOCUS HUMATPAB2 239 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 3, clone lambda-NK-alpha-R3-2. ACCESSION M27573 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 2 of 3 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 239) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 219 Na+, K+ -ATPase alpha subunit, exon 3 (AA at 22) IVS < 1 20 Na+, K+ -ATPase alpha subunit intron B IVS 220 > 239 Na+, K+ -ATPase alpha subunit intron C BASE COUNT 54 a 77 c 68 g 40 t ORIGIN 1 ctgccttgct cgtcctccag gtgtgtctga ccgtgaccgc caagcgcatg gcccggaaga 61 actgcctggt gaagaacctg gaggctgtag agaccctggg ctccacgtcc accatctgct 121 cagataagac agggaccctc actcagaacc gcatgacagt cgcccacatg tggtttgaca 181 accagatcca cgaggctgac accactgagg accagtcagg tgagcgcagg ccccgggta // LOCUS HUMATPAB3 150 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human Na+, K+ -ATPase alpha-subunit gene, exon 4, clone lambda-NK-alpha-R3-2. ACCESSION M27577 KEYWORDS Na+, K+ -ATPase alpha-subunit. SEGMENT 3 of 3 SOURCE Human placenta and adult brain DNA, clone lambda-NK-alpha-R3-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 150) AUTHORS Sverdlov,E.D., Monastyrskaya,G.S., Broude,N.E., Ushkaryov,Y.A., Allikmets,R.L., Melkov,A.M., Smirnov,Y.V., Malyshev,I.V., Dulobova,I.E., Petrukhin,K.E., Grishin,A.V., Kijatkin,N.I., Kostina,M.B., Sverdlov,V.E., Modyanov,N.N. and Ovchnikov,Y.A. TITLE The family of human Na+ K+ -ATPase genes: No less than five genes and/or pseudogenes related to the alpha-subunit JOURNAL FEBS Lett. 217, 275-278 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 / 130 Na+, K+ -ATPase alpha subunit, exon 4 (AA at 22) IVS < 1 20 Na+, K+ -ATPase alpha subunit intron C IVS 131 > 150 Na+, K+ -ATPase alpha subunit intron D BASE COUNT 27 a 50 c 38 g 35 t ORIGIN 1 ctcacacatg cctcccccag ggacctcatt tgacaagagt tcgcacacct gggtggccct 61 gtctcacatc gctgggctct gcaatcgccc tgtcttcaag ggtggtcagg acaacatccc 121 tgtgctcaag gtgggttagc tactggcctc // LOCUS NGORRDA 1486 bp ss-rRNA RNA 30-JUN-1990 DEFINITION N.gonorrhoeae 16S ribosomal RNA. ACCESSION M34130 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE N.gonorrhoeae (strain 2013) ribosomal RNA. ORGANISM Neisseria gonorrhoeae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1486) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. University of Illinois Urbana, IL 61801 FEATURES from to/span description rRNA 1 1486 16S ribosomal RNA BASE COUNT 365 a 334 c 460 g 294 t 33 others ORIGIN 1 nnaacataag agtttgatcc tggctcagat tgaacgctgg cggcatgctt tacacatgca 61 agtcggacgg cagcacaggg aagcttgctt ctcgggtggc gagtggcgaa cgggtgagta 121 acatatcgga acgtaccggg tagcggggga taactgatcg aaagatcagc taataccgca 181 tacgtcttga gagggaaagc aggggacctn cgggccntgc gctatcngag cggccgatat 241 ctgattagct ngtnggcggg gtaaaggccc accaaggcga cgatcagtag cgggtctgag 301 aggatgatcc gcnacactgg gactgagaca cggccnagac tcctacggga gncagcagtg 361 ggnnattttg gacaatgggc gcnagcctga tccagccatg cngcgtgtct gaagaaggcc 421 ttcgggttgt aaaggacttt tgtcagggaa gaaaaggccg ttgccaatat cggcggccga 481 tgacggtacc tgaagaataa gcaccggcta actacgtgcn agcagccncg nnnatacgta 541 gggtgcgagc gttaatcgga attactgggc gtaaagcggg cgcagacggt tacttaagca 601 ggatgtgaaa tcncngggct caacccggna actgcgttct gaactgggtg actcgagtgt 661 gtcagaggga ggtggaattc cacgtgtagc agtgaaatgc gtagagatgt ggaggaatac 721 cnatggcgaa ggcagcctcc tgggataaca ctgacgttca tgtccgaaag cgtgggtagc 781 aaacaggatt agataccctg gtagtccacg ccctaaacga tgtcaattag ctgttgggca 841 actnnattgc ttggtagcgt agctaacgcg tgaaattgac cgcctgggga gtacggtcgc 901 aagattaaaa ctcaaaggaa ttgacggggn accgcacaag cggtggatga tgtggattaa 961 ttcgatnnaa cgcgaagaac cttacctggt tttgacatgt gcggaatcct ccggagacgg 1021 agnagtgcct tcgggagccg taacacaggt gctgcatggc tgtcgtcagc tcgtgtcgtg 1081 agatgttggg ttaagtcccg caacgagcgc aacccttgtc attagttgcc atcattcggt 1141 tgggcactct aatgagactg ccggtgacaa gccggaggaa ggtggggatg acgtcaagtc 1201 ctcatggccc ttatgaccag ggcttcacac gtcatacaat ggtcggtaca gagggtagcc 1261 aagcgcgagc ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc 1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc 1381 ccngnncttg tacacaccgc ccgtcacacc atgggagtgg ggnataccag aagtaggtag 1441 ggtaaccgca aggagtccgc ttaccacggt atgcttcatg actggg // LOCUS SPLRRDA 1492 bp ss-rRNA RNA 30-JUN-1990 DEFINITION S.volutans 16S ribosomal RNA. ACCESSION M34131 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE S.volutans (strain ATCC 19554) ribosomal RNA. ORGANISM Spirillum volutans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1492) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. University of Illinois Urbana, IL 61801 FEATURES from to/span description rRNA 1 1492 16S ribosomal RNA BASE COUNT 389 a 327 c 446 g 299 t 31 others ORIGIN 1 tcaaatgaag agtttgatcc tggctcagat tgaacgctgn cngcatgcct tacacatgca 61 agtcgaacgg taacagggac ttcggtccgc tgacgagtgg cgaacgggtg agtaaagcat 121 aggaatacat ccattagtgg gggataacgc atcgaaaggt gtgctaatac cgcatacgct 181 ctaaggagga aagcanggga tccnagacct tgcgctaatg ggctagccta tgtccgatta 241 gctagttggt agagtaaaag cctaccaagg cgacgatcgg taactggtct gagaggatga 301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaatt 361 ttggacaatg ggggaaaccc tgatccagcc atgccgcgtg ggtgaagaag gccttcgggt 421 tgtaaagccc tttcgggagg aaagaaaagc attgctctaa catagcaatg tgatgacggt 481 acctccataa gaagcaccgg ctaactacgt gccagcngcc gcggtaatac gtagggtgcn 541 agcgttaatc ggaattactg ggcgtaaagc gtgcgcaggc ggctagataa gccagatgtg 601 aaatccccga gctcaacttg ggtatcgcgc ttggaactgt ctagcnagag tttggcngag 661 gggggtagaa ttccacgtgt agcagtgaaa tgcgtagaga tgtggaggaa taccgatggc 721 gaaggnngcc ccctgggcta aaactgacgc tcatgcncga aagcgtgggg agcaaacagg 781 attagatacc ctggtagtcc acgccctaaa cgatgtcaac tggttgttgg gagggtaaac 841 ctcttagtaa cgaagctaac gcgagaagtt gactgcctgg ggagtacggt cgcaagatta 901 aaactcaaag gaattgacgg gganccgcac aagcggtgga ttatgtggat taattcgatg 961 caacgcgaaa aaccttacct acctttgaca tgtcgagaat cctgcagaga tgcgggngtg 1021 ccttcgggaa ctcgaacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt 1081 gggttaagtc ccgcaacgag cgcaaccctt atcattagtn gctacgcaag ggcactctaa 1141 tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct catggccctt 1201 atgggtaggg cttcacacgt aatacaatgg tcggtacaga gggttgccaa cccgtgaggg 1261 ggagctaatc tcagaaagcc gatcgtagtc cggattgcag tctgcaactc gactgcatga 1321 agtcggaatc gctagtaatc gcggatcagc atgcngcggt gaatacgttc ccgggtcttg 1381 tacacaccgc ccgtcacacc atgggagtgg gtttcaccag aagtagttag cctaaccgcn 1441 aggagggcga ttaccacggt gngattcatg nnnnnnnnnn nnnngtaaca ag // LOCUS LOAREPDNA 839 bp ds-DNA INV 30-JUN-1990 DEFINITION Loa loa repetitive DNA sequence. ACCESSION M34259 KEYWORDS repetitive DNA. SOURCE Loa loa DNA. ORGANISM Loa loa Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Orthorrhapha; Tabanoidea; Tabanidae. REFERENCE 1 (bases 1 to 839) AUTHORS Klion,A.D., Raghavan,N., Brindley,P.J. and Nutman,T.B. TITLE Cloning and characterization of a species-specific repetitive DNA sequence from Loa loa JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.D.Klion, 11-MAY-1990. Author address: A.D.Klion NIAID/NIH, Lab Parasitic Diseases Bldg 4, Rm 126 9000 Rockville Pike Bethesda, MD 20892 BASE COUNT 207 a 130 c 199 g 303 t ORIGIN 1 gatcaaatac ctgatacgct cgtataaccg tgtcatccgg atatttatta tattaattat 61 taaagtgtta atcaaacatg agcaaaactt tttcgacgac gcaaattaca ttcctcgttt 121 tagtgcttta acccgaatgt aatataacat aacaaccatt tgcttaggct taggtttttc 181 gggtttatgc ttaggcttag gtttttcggg tttatgctta ggcttaggct taggttcttc 241 gggtttatgc ttaggcttag gcttaggttt ctcgggttta tgcttaggtt catgcttagg 301 cttaggctta ggcttgggct taggcttagg cttaggctta ggtttttcgg gtttatgctt 361 aggcttaggc ttaggcttgg gcttaggctt aggcttaggc ttaggttttt cgggtttatg 421 cttaggctta ggcttaggct taggcttggg cttaggctta ggcttaggct taggcttagg 481 cttaggtttt tcgggtttat gcttaggttt atgcttaggg tttatgctta gttttttttt 541 gaacactgtt cgataaccat ataagtatca taaatgtaaa catgtaaaat ttcagcattt 601 ataacaagaa gcaccaaaaa acaccgatgg atgaagcaaa agcggacgat gagcatgcca 661 cagtgaggag ctgttaagcg acttcgtgct gctacaaatt tgcattatgc aatggattac 721 agtaatccat tgttggatac agtgttgttt tctgatttgt agtaatccat cacagtaatt 781 cagcaaagtg aagtgtactc tgttgaaagc gtaagtattg cattggtgat attatgatc // LOCUS CFXRRDA 1486 bp ss-rRNA BCT 30-JUN-1990 DEFINITION C.aurantiacus 16S ribosomal RNA. ACCESSION M34116 KEYWORDS 16S ribosomal RNA. SOURCE C.aurantiacus (strain J10FL) ribosomal RNA. ORGANISM Chloroflexus aurantiacus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Multicellular filamentous green bacteria; Chloroflexaceae. REFERENCE 1 (bases 1 to 1486) AUTHORS Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and Woese,C.R. TITLE The green non-sulfur bacteria: A deep branching in the eubacterial line of descent JOURNAL Syst. Appl. Microbiol. 9, 47-53 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 15-MAY-1990. FEATURES from to/span description rRNA 1 1486 16S ribosomal RNA BASE COUNT 301 a 394 c 495 g 224 t 72 others ORIGIN 1 acaatgnaga gtttgatcct ggctcaggac gaacgctggc ggcgtgccta atgcatgcaa 61 gtcgaacgca gcaggcgtgc ctggctgcgt ggcgaacggc tgaggaacac gtgggtgacc 121 tgccccggag tgggggatac cccgtcgaaa gacgggacaa tcccgcatac gctcgacgga 181 ggaaagccga aggcgctctg nnaggggccn gcggcccatc aggtagttgg tgtggtaacg 241 gcgcaccaag ccaatgacgg gtacccggtc tgagaggacg accgggcaga ctgggactga 301 gacacggccc agactcctac gggaggcagc agcaaggaat tttccccaat gggcgcaagc 361 ctgagggagc aacgccgcgt ggaggacgac ggccttcggg ttgtaaactc ctttcgggtg 421 ggacgatgct gacggtacca ccagaagcag ccccggctaa ctctgtgcca gcagccgcgg 481 taagacagag ggggcnagcg ttgtccggag ttactgggcg taaagggcgc gcaggcggtg 541 ggctgcgtcg gcgctgaaag cgccccgctt aacggggcga ggcgcgccga tacgagtcca 601 ctcgaggcaa gcagagggtg gcggaattcc gggtggagcg gtgaaatgcg tagagatccg 661 gaggaacgcc agtggggaag ccggccacct gggcttgacc tgacgctgag gcgcgacagc 721 gtggggagca aaccggatta gatacccggg tagtccacgc cgtaaacgat gccggctcgg 781 cgtctggcgc acgttggtgt gctgggtgcc ttagctcacg cggtaagccg gccgcctggg 841 gannnnnnnn nnnngcttaa aactcaaagg aattgnnnnn nnnnngcaca agnnnnnnnn 901 nnnnnngttt aattcgacgc aacacgaaga accttacccg gacttgacat ggcgctgcat 961 cccctggaaa cagggncgcc tgncgaggcg caccacaggt gctgcatggc tgtcgtcagc 1021 tcgtgtcgtg agatgttggg ttcagtcccg caacgagcgc aacccgtgtc ggtagttaca 1081 ggtgtctacc gagactgccg ntgaccggcg gaggaaggcg cggatgacgt caagtcagca 1141 tggcccttac gtccggggcg acacacacgc tacaatggcc acgacaatgc gttgccaagc 1201 cgcaaggtgg agctaatcgc ctaaacgtgg tctcagtgca gantcggggc tgcaactcgc 1261 ccccgtgaag gcggagttgc tagtaaccgc gtatcagcca tggcgcggtg aatacgttcc 1321 cgggccttgt acacaccgcc cgtcacgtca tgggagtggc caatgcttga agtccgtgtg 1381 ctaaccccag tcggggaggc agcggccgag ggcaggggcc gcgacnnnnn nnnnnnngta 1441 acaagnnnnn ngtaccggaa gnnnnnnnnn gatcacctcc tttcta // LOCUS DVURRDA 1551 bp ds-DNA BCT 30-JUN-1990 DEFINITION D.desulfuricans 16S ribosomal RNA gene. ACCESSION M34113 KEYWORDS 16S ribosomal RNA. SOURCE D.desulfuricans (ATCC 27774) DNA. ORGANISM Desulfovibrio desulfuricans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1551) AUTHORS Oyaizu,H. and Woese,C.R. TITLE Phylogenetic relationships among the sulfate respiring bacteria, myxobacteria and purple bacteria JOURNAL Syst. Appl. Microbiol. 6, 257-263 (1985) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. FEATURES from to/span description rRNA 1 1551 16S ribosomal RNA BASE COUNT 387 a 364 c 488 g 311 t 1 others ORIGIN 1 tgaactggag agtntgattc tggctcagat tgaacgctgg cggcgtgctt aacacatgca 61 agtcgaacgc gaaagggact tcggtcctga gtaaagtggc gcacgggtga gtaacgcgtg 121 gataatctgc ccttatgatc gggataacag ttggaaacgg ctgctaatac cggatacgct 181 caaaatgaac tttttgagga aagatggcct ctgcttgcat gctatcacgt aaggatgagt 241 ccgcgtccca ttagcttgtt ggcggggtaa cggcccacca aggcatcgat gggtagccga 301 tttgagagga tgatcggcca cactggaact gaaacacggt ccagactcct acgggaggca 361 gcagtgggga atattgcgca atgggcgaaa gcctgacgca gcgacgccgc gtgagggatg 421 aaggttttcg gatcgtaaac ctctgtcaga agggaagaaa ctacgttgtg ctaatcagca 481 gcgtactgac ggtaccttca aaggaagcac cggctaactc cgtgccagca gccgcggtaa 541 tacggagggt gcaagcgtta atcggaatta ctgggcgtaa agcgcacgta ggctgtagtg 601 taagtcaggg gtgaaatccc acggctcaac cgtggaactg cctttgatac tgcacaactt 661 gaatccggga gagggtggcg gaattccagg tgtaggagtg aaatccgtag atatctggag 721 gaacatcagt ggcgaaggcg gccacctgga ccggtattga cgctgaggtg cgaaagcgtg 781 gggagcaaac aggattagat accctggtag tccacgctgt aaacgatgga tgctagatgt 841 cggggagtat tcttcggtgt cgtagttaac gcgttaagca tcccgcctgg ggagtacggt 901 cgcaaggctg aaactcaaag aaattgacgg gggcccgcac aagcggtgga gtatgtggtt 961 taattcgatg caacgcgaag aaccttacct aggtttgaca tccacggaac cctcccgaaa 1021 aggaggggtg cccttcgggg agccgtgaga caggtgctgc atggctgtcg tcagctcgtg 1081 tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ctatggatag ttgccagcaa 1141 gtaatgttgg gcactctatt cagactgccc gggttaaccg ggaggaaggt ggggacgacg 1201 tcaagtcatc atggccctta cgcctagggc tacacacgta ctacaatggc gcgcacaaag 1261 gggagcgaga ccgcgaggtg gagccaatcc caaaaaacgc gtcccagtcc ggattgcagt 1321 ctgcaactcg actgcatgaa gttggaatcg ctagtaattc gagatcagca tgctcgggtg 1381 aatgcgttcc cgggccttgt acacaccgcc cgtcacacca cgaaagtcgg ttttacccga 1441 agccggtgag ccaaccagca atggaggcag ccgtctacgg tagggccgat gattggggtg 1501 aagtcgtaac aaggtagccg taggggaacc tgcggctgga tcacctcctt t // LOCUS HPTRRDA 1484 bp ss-rRNA BCT 30-JUN-1990 DEFINITION H.aurantiacus 16S ribosomal RNA. ACCESSION M34117 KEYWORDS 16S ribosomal RNA. SOURCE H.aurantiacus (ATCC 23779) ribosomal RNA. ORGANISM Herpetosiphon aurantiacus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Cytophagaceae. REFERENCE 1 (bases 1 to 1484) AUTHORS Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and Woese,C.R. TITLE The green non-sulfur bacteria: A deep branching in the eubacterial line of descent JOURNAL Syst. Appl. Microbiol. 9, 47-53 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 15-MAY-1990. FEATURES from to/span description rRNA 1 1484 16S ribosomal RNA BASE COUNT 331 a 359 c 465 g 265 t 64 others ORIGIN 1 nttgtcggcg agtttgatcc tggctcagga cgaacgctgg cggcgtgcct aatgcatgca 61 agtcgaacgc attcttcgga aggagtggcg cacggctgag gaacacgtga ctaacctacc 121 ccggtgtggg ggataacggg tcgaaagact cgctaatccc gccatacgat ngcctcggcg 181 gaggaaagcc gtaaggcgca ctggnngggg gtcgcgtccc attagatagt tggtgtggta 241 atggcgcacc aagtcgatga tgggtctctg gtctgagagg ncgaccagac agattgggac 301 tgagacacgg cccaaactcc tacggggngc agcagcaagg aattttcggc aatgggcgca 361 agcctgaccg agcaacgccg cgtggaggat gacggctctt gggttgtaaa ctccttttgg 421 gggggacgat aatgacggta ccctccgaat caggcccggc taactacgtg ccagcagccg 481 cggtaatacg taggggccaa gcgttgtccg gaattactgg gcgtaaagcg tgnntaggcn 541 gtcgatgatg tgccgcgtga aagcgccgga gtaatgccgg ccaggtcgcg gcagacacgt 601 tgactagagg ctcgcagagg aacgtggaat tcccggtgta gtggtgaaat gcgtagatat 661 cgggaggaac accagtggcg caagcggcgt tctgggcgag acctgacact gagccacgac 721 ggcgtgggga gcaaacagga ttagataccc tggtagtcca cgcagtaaac gatgcatacc 781 aggtgtggga tggcgttcgc gtcgttccgt gccgcagctt acgcgatgag tatgccgcct 841 ggggactacg agcgcaagct taaaactcaa aggaattgnn nnnnnnnngc acaagnnnnn 901 nnnnnnnnng tttaattcga cgcaacgcga agaaccttac ctagtcttga catagcactg 961 caagcttcgg aaatgaagtt cgccttcgag ggtgtgctac aggtgctgca tggctgtcgt 1021 cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga gcgcaacccc tgtgaggtgt 1081 tacaagtgtc acctcagact gccgttgtca acaacggagg aaggcgggga tgacgtcaag 1141 tccgcatggc ccttacgact agggcgacac acacgctaca atggctggga gaatgcgccg 1201 cgacctggca acaggctgcg aatcgagaac accagtcaca gttcagattg ggggctgcaa 1261 ctcgccccca tgaaggcgga gttgctagta atcgccggtc agcatacggc ggtgaatcag 1321 tacccgggcc ttgtacacac cgcccgtcac gtcatggaag tgggaaacac ctgaagtccg 1381 tggctaaccg caaggaggca gcggccgagg gtgggtctcg taactgnnnn nnnnnngtaa 1441 caagnnnnnn nnnnnngaag nnnnnnnnng atcacctcct ttct // LOCUS MXARRDB 1540 bp ds-DNA BCT 30-JUN-1990 DEFINITION M.xanthus 16S ribosomal RNA gene. ACCESSION M34114 KEYWORDS 16S ribosomal RNA. SOURCE M.xanthus (strain MD207) DNA. ORGANISM Myxococcus xanthus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; Myxococcaceae. REFERENCE 1 (bases 1 to 1540) AUTHORS Oyaizu,H. and Woese,C.R. TITLE Phylogenetic relationships among the sulfate respiring bacteria, myxobacteria and purple bacteria JOURNAL Syst. Appl. Microbiol. 6, 257-263 (1985) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. FEATURES from to/span description rRNA 1 1540 16S ribosomal RNA BASE COUNT 384 a 372 c 488 g 295 t 1 others ORIGIN 1 caattggaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa 61 gtcgagcgcg aataggggca acccttagta gagcggcgca cgggtgcgta acacgtggat 121 aatctgcctg agtgctcggg ataaccagtc gaaagattgg ctaataccgg ataagcccac 181 ggtttcttcg gagactgagg gaaaaggtgg cctctgtata caagctatca cattcagatg 241 agtccgcggc ccatcagcta gttggcgggg taatggccca ccaaggcaac gacgggtagc 301 tggtctgaga ggacgatcag ccacactgga actgagacac ggtccagact cctacgggag 361 gcagcagtgg ggaattttgc gcaatgggcg aaagcctgac gcagcaacgc cgcgtgtgtg 421 atgaaggtct ttggattgta aagcactttc gaccgggaag aaaacccgtt ggctaacatc 481 caacggcttg acggtaccgg gagaagaagc accggctaac tctgtgccag cagccgcggt 541 aatacagagg gtgcaagcgt tgttcggaat tattgggcgt aaagcgcgtg taggcggcgt 601 gacaagtcgg gtgtgaaagc cctcagctca actgaggaag tgcgcccgaa actgttgtgc 661 ttgagtgccg gagagggtgg cggaattccc caagtagagg tgaaattcgt agatatgggg 721 aggaacaccg gtggcgaagg cggccacctg gacggtaact gacgctgaga cgcgaaagcg 781 tggggagcaa acaggattag ataccctggt agtccacgcc gtaaacgatg agaactaggt 841 gtcgtgggag ttgacccccg cggtgccgaa gctaacgcat taagttctcc gcctgggaag 901 tacggtcgca agactaaaac tcaaaggaat tgacgggggn ccgcacaagc ggtggagcat 961 gtggtttaat tcgacgcaac gcgcagaacc ttacctggtc ttgacatcct cagaatcctt 1021 cagagatgag ggagtgcccg caagggaact gagagacagg tgctgcatgg ctgtcgtcag 1081 ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caaccctcgc ctttagttgc 1141 cacgcaagtg gatctctaga gggactgccg gtgttaaacc ggaggaaggt ggggatgacg 1201 tcaagtcctc atggccttta tgaccagggc tacacacgtg ctacaatggc cggtacagag 1261 cgttgccaac ccgcgagggg gagctaatcg cataaaaccg gtctcagttc agattggagt 1321 ctgcaactcg actccatgaa ggaggaatcg ctagtaatcg cagatcagca cgctgcggtg 1381 aatacgttcc cgggccttgt acacaccgcc cgtcacacca tgggagtcga ttgctccaga 1441 aatcatctca ccaagaggtg ctcaaggagt ggtcggtaac tggggtgaag tcgtaacaag 1501 gtagccgtag gggaacctgc ggctggatca cctcctttct // LOCUS RBLRRDA 1491 bp ss-rRNA BCT 30-JUN-1990 DEFINITION R.acidophila 16S ribosomal RNA. ACCESSION M34128 KEYWORDS 16S ribosomal RNA. SOURCE R.acidophila (strain 7050) ribosomal RNA. ORGANISM Rhodopseudomonas acidophila Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1491) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese Microbiology Department, 131 Burrill Hall University of Illinois 407 S. Goodwill Ave. Urbana, IL 61801 E-mail: ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1491 16S ribosomal RNA BASE COUNT 364 a 341 c 449 g 300 t 37 others ORIGIN 1 tacaacttga gagtttgatc ctggctcaga acgaacgctg gcggcaggct taacacatgc 61 aagtcgaacg ggcatagcaa tatgtcagtg gcagacgggt gaggaacacg tgggaacatg 121 ccctttggtt cggaacaact cagggaaact tgagctaata ccggataagc ctttcgaggg 181 aaagatttat cgccgaagga gtggcccgcg tccgattagc tagttggtga ggtaatggct 241 caccaaggcg acgatcggna gctngtctga gaggatgatc agccacattg ggactgagac 301 acggcccaaa ctcctacggg aggcagcagt ggggaatatt ggacaatggg cgcaagccng 361 atccagccat gccgcgtgag tgatgacggc cctagggttg taaagctctt tcacccacga 421 cgataatgac ggtagtgggc gaagaagccc cggctaactt cgtgccagca gccgcggtaa 481 tacgaagggg gcnagcgttg ttcggattta ctgggcgtaa agcgcacgta ggcngatctt 541 taagtcaggg gtgaaatgcc ggagctcaac ttcggaactg cctttgatac tggagatctt 601 gagttcgaga gaggtgagtg gaactgcgag tgtagaggtg aaattcgtag atattcgcaa 661 gaacaccagt ggcgaaggcg gctcactggc tcgatactga cgctgaggtg cnnaagcgtg 721 gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgga tgctagccgt 781 taggcagctt gctgcttagt ggcgcagcta acgctttaag catcccgcct ggggagtacg 841 gtcgcaagat taaaactcaa aggaattgac gggggcncgc acaagcggtg gagcatgtgg 901 tttaattcga agcaacgcgc agaaccttac cagcttttga catgtcttgg acggatagca 961 gagatgttat cttctcttcg gagccgagaa cacaggtgct gcatggctgt cgtcngctcg 1021 tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcaac ccacgtcctt agttgccatc 1081 attaagttgg gcactctagg gagactgccg gtgataagcc gagaggaagg tgtggatgac 1141 gtcaagtcct catggccctt acaggctggg ctacacacgt gctacaatgg cggtgacaat 1201 gggaagcgaa agggcgacct ctagcaaatc tcaaaaaacc gtctcagttc ggattgtact 1261 ctgcaactcg agtgcatgaa ggtggaatcg ctagtaatcg cagatcagaa cgctgcggtg 1321 aatacgttcc cgggccttgt acacaccgcc cgtcacacca tgggagttgg ctttacccga 1381 aggcgtttcg ctaaccgcaa ggaggcaggc gaccacggta gggtcagcgn ctnggnnnnn 1441 nnngtaacaa gnnnnnnnnn nnngaacctg nnnnnngatc acctcctttc t // LOCUS RCARRDA 1470 bp ss-rRNA BCT 30-JUN-1990 DEFINITION R.capsulatus 16S ribosomal RNA. ACCESSION M34129 KEYWORDS 16S ribosomal RNA. SOURCE R.capsulatus (strain B10, ATCC 33303) ribosomal RNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1470) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese Microbiology Department, 131 Burrill Hall University of Illinois 407 S. Goodwill Ave. Urbana, IL 61801 E-mail: ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1470 16S ribosomal RNA BASE COUNT 343 a 330 c 423 g 275 t 99 others ORIGIN 1 ttcaacttga gagtttgatc ctggctcaga acgaacgctg gcggcaggcc taacacatgc 61 aagtcgagcg agaccttcgg gtctagcggc ggacgggtga gtaacgcgtg ggaacgtgcc 121 ctttgctacg gaatagcccc gggaaactgg gagtaatacc gtatgtgccc ttcggggnaa 181 agatttatcg gcaaaggatc ggcccgcgtt ggattaggta gttggtgggg taatggccta 241 ccaagccgac gatccatagc tggtttgaga ggatgatcag ccacactggg actgagacac 301 ggcccagact cctacgggag gcagcagtgg ggaatcttag acaatggggg aaaccctgat 361 ctagccatgc cgcgtgagcg atgaaggcct tagggttgta aagctctttc aggtgggaag 421 ataatgacgg taccaccaga agaagccccg gctaactccg nnnnnnnngc cgcggtaata 481 cgnnnnnnnn nnnnnnnnnn nngaattact gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 541 nnnnngnggt gaaatcccag ggctcaacct tggaactgcc tttgaaactc ctggtcttga 601 ggncgagaga ggtgagtgga attccgagtg tagaggtgaa attcgtagat attcggagga 661 acaccagtgg cgaaggcggc tcactggctc gatactgacg ctgaggtgcg aaagcgtggg 721 gagcaaacag gattagatac cctggtagtc cacgccgtaa acgatgaatg ccagtcgtcg 781 gcaggcatgc ctgtcggtga cacactaacg gattaagcat tccgcctggg gagaacggtc 841 gcaagattaa aactcaaagg aattgacggg ggcccgcaca agcggtggag catgtggttt 901 aattcgaagc aacgcgcaga accttaccaa cccttgacat cgagatcgcg gttaccagag 961 atggtttcct tcagttcggc tggatcttag acaggtgctg catggctgtc gtcagctcgt 1021 gtcgtgagat gttgggttaa gtccggcaac gagcgcaacc cacactttca gttgccatca 1081 ttcagttggg cactctggaa gaactgccga tgataagtcg gaggaaggtg tggaggacgt 1141 caagtcctca tggcccttac gggttgggct acacacgtgc tacaatggtg gtgacaatgg 1201 gccaatccca aaaagccatc tcagttcgga ttggggtctg caactcgacc ccatgaagtn 1261 ggaatcgcta gtaatcgcgt aacagcaaga cgcggtgaat acgttcccgg gccttgtaca 1321 caccgcccgt cacaccatgg gaattgggtc taccctaaga aggtgcgcca accagcaang 1381 gaggcagcna gccacggtag gcacaggnng gnnnnnnnnn nngtaacaag nnnnnnnnnn 1441 nngaacctgn nnnnngatca cctcctttct // LOCUS RMVRRDA 1484 bp ss-rRNA BCT 30-JUN-1990 DEFINITION R.vannielii 16S ribosomal RNA. ACCESSION M34127 KEYWORDS 16S ribosomal RNA. SOURCE R.vannielii (strain EY33) ribosomal RNA. ORGANISM Rhodomicrobium vannielii Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1484) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese Microbiology Department, 131 Burrill Hall University of Illinois 407 S. Goodwill Ave. Urbana, IL 61801 E-mail: ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1484 16S ribosomal RNA BASE COUNT 343 a 343 c 433 g 292 t 73 others ORIGIN 1 caaaatctga gagtttgatc ctggctcaga acgaacgctg gcggcaggct taacacatgc 61 aagtcgaacg caccgcaagg tgagtggcag acgggtgagt aacgcgtggg aaccttccct 121 ttggtacgga ataacttcgg gaaaccgaag ctaataccgt atatctcctc cgggagaaag 181 atttatcgcc aaaggatggg cccgcgttgg attagctagt tggtgtggta acggcgcacc 241 aaggcgacga tccatagctg gtctgagagg atgatcagcc acactgggac tgagacacgg 301 cccagactcc tacgggaggc ngcngtgggg aatcttggac aatgggggca accctgatcc 361 agccatgccg cgtgagtgaa gaaggcctta gggttgtaaa gctcttttgg cggggacgat 421 aatgacggta cccgcagaat aagccccggc taacttcgtg ccagcagccg cggtaatacg 481 aagggggcna gcgttgttcg gaatcactgg gcgtaaagcg tacgcaggcg gattgataag 541 tcaggggtga aatcccgggg ctcaacctcg gaattgcctt tgatactgtc tgtctcgagt 601 tcgggagagg ttggcggaat tcctagtgta gaggtgaaat tcgtagatat taggaagaac 661 accagtggcg aaggnngcca actggcccga tactgacgct catgtacgaa agcgtgggga 721 gcaaacagga ttagataccc tggtagtcca cgctgtaaac tatggatgct agccgttggg 781 gagcttgctc ttcagtggcg cagctaacgt cttaagcatc ccgcctgggg agtacggtcg 841 caagattaaa actcaaagga attgacgggg gcccgcacaa gcggtggagc atgtggttta 901 attcgaggca acgcgaagaa ccttaccagc tcttgacatg tcgtgctacg tggagagatt 961 cacggttccc ttcggggacg cgaacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt 1021 gagatgttgg gttaagtccc gcaacgagcg caaccctcgc ccttagttgc taccatttag 1081 ttgagcactc taaggnnacc gccggtgata agccggagga aggtggggat gacgtcaagt 1141 catcatggcc cttacgggct gggctacaca cgtgctacaa tggcggtgac agtgggcagc 1201 gacacagcga tgtgatgcta atcccaaaaa gccgtctcag ttcagattgc actctgcaac 1261 tcgagtgcat gaagtcggaa tcgctagtaa tcgcggatca gcatgccgcg gtgaatacgt 1321 tcccgggcct tgtacacacc gcccgtcaca ccatgggagt tggttttacc cgaaggcgtn 1381 angctaaccg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngtaa 1441 caagnnnnnn nnnnnngaac ctgnnnnnng atcacctcct ttct // LOCUS THRRRDA 1528 bp ds-DNA BCT 30-JUN-1990 DEFINITION T.roseum 16S ribosomal RNA gene. ACCESSION M34115 KEYWORDS 16S ribosomal RNA. SOURCE T.roseum (strain ATCC 27502) DNA. ORGANISM Thermomicrobium roseum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1528) AUTHORS Oyaizu,H., Debrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and Woese,C.R. TITLE The green non-sulfur bacteria: A deep branching in the eubacterial line of descent JOURNAL Syst. Appl. Microbiol. 9, 47-53 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 15-MAY-1990. FEATURES from to/span description rRNA 1 1528 16S ribosomal RNA BASE COUNT 295 a 416 c 585 g 226 t 6 others ORIGIN 1 ggggatggag agtttgatcc tggctcaggg ggaacgctgg cggcgtgcct aatgcatgca 61 agtcggacgg gangcacgcn ctctggcgtg ccgaccgtgg cggacgggtg cgtaacacgt 121 ggggaaccct cccgggtgcg ggggataacc cggggaaact cgggctaata ccccatacgc 181 ttggctggcg gtgggccggc caaggaaagg cgcaggcgac tgtgctgtgc tcggagggcc 241 ctgcggccta tcagctagac ggtagggtaa cggcctaccg tggcgatgac gggtagctgg 301 tctgagagga tggccagcca cacgggcact gagacacggg cccgactcct acgggaggca 361 gcagcaggga atcttccgca atgggggcaa ccctgacgga gcgacgccgc gtgcgggagg 421 aagcccttcg gggtgtaaac cgctgttcgg ggggacgatc gagcgacggt accctcggag 481 caagtcccgg ctaactacgt gccagcagcc gcggtaagac gtagggggcg agcgttaccc 541 ggagtcactg ggcgtaaagg gcgtgtaggc ggctgggtac gccgcgtgtg aaagtccccg 601 gctcaaccgg ggagggtcgc gcgggacggc ctggctcgag ggcgggagag gcgggtggaa 661 ttcccggtgt agcggtgaaa tgcgtagaga tcgggaggaa cgccggtggc gaaggcggcc 721 cgctggcccg tacctgacgc tgaggcgcga aggcgtgggg agcgaaccgg attagatacc 781 cgggtagtcc acgcagtaaa cgatgcgggc gaggtgtggg tggtgttgac cccatccgtg 841 ccggcgccaa cgcagtaagc ccgccgcctg gggagtacgg ccgcaaggct aaaactcaaa 901 ggaattgacg ggggcccgca caagcagcgg agcgtgtggt ttaattcgac gcaacgcgaa 961 gaaccttacc agggcttgac atgccaccga acctggctga aaggctgggg tgcccggtga 1021 gggagcggtg gcacaggtgc tgcatggctg tcgtcagctc gtgccgtgag gtgttgggtt 1081 aagtcccgca acgagcgcaa ccctcggggt cagttacgcg ggtgtctgac ccgactgccg 1141 gggaaagccc ggaggaagga ggggatgacg tcaagtcagc atggccctga cgccctgggc 1201 gacacacacg ctacagtgac cgggacagtg ggcagcgaag gggcgacctg gagccaatcc 1261 cgcaaacccg gtcgtggtgg ggatcgcagg ctgcaacccg cctgcgtgaa cgcggagttg 1321 ctagtaaccg ccggtcagcc atacggcggt gaatacgttc ccgggccttg tacacaccgc 1381 ccgtcacgtc acgaaagctg gcttcacctg aagctggtgg gccaaccggc gcacgctggg 1441 ggcagccgtc gagggtgggg ctggtgattg ggacgaagtc gtaacaaggt agccgtaccg 1501 gaaggnnnng gtggatcacc tcctttct // LOCUS RATTATFL 11973 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat tyrosine aminotransferase (TAT) gene, 5' flank. ACCESSION M34257 KEYWORDS tyrosine aminotransferase. SOURCE Rat (strain Wistar) adult male liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 8314 to 8413) AUTHORS Boshart,M., Weih,F., Schmidt,A., Fournier,R.E.K. and Schuetz,G. TITLE A cyclic AMP response element mediates repression of tyrosine aminotransferase gene transcription by the tissue-specific extinguisher locus Tse-1 JOURNAL Cell 61, 905-916 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 11973) AUTHORS Labeit,S., Strahle,U. and Weih,F. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by M.Boshart, 11-MAY-1990. BASE COUNT 3162 a 2618 c 2675 g 3518 t ORIGIN 1 gttcatgcat gtacaagtac atgtgaatgt gtgtgaacaa cttcagatgt cattacttgg 61 gccataatta ctgttgttga tgttgatagg gcctatcttc gcctgggggt ggccagtgag 121 gctggctaac tgctcagtca atccagacat ttgctcatct ctccgttccc aatgctggca 181 ttactacccg tgcctcccac accaccatgc ctggtatttt ttttttttcc tttttgtctt 241 tttaaatctg ggctgtaagg ttgaactcag gtccttgtgc atgcaagtca agtgttttac 301 agactgagct atctctccta gtcctgtgct ggaaattccc tcagacactt ctacatcaga 361 ttcaaatcct ttctctaggt gagagatgcc attcatttgc caacaaggcg gaatgggtga 421 gcttgtccaa aataccacgt ttaatcaggt tgtctgtttt ggttggctta tttgacactt 481 acaagtcaac attgcaacat tattatatat ttgtcttctg attttaaaaa tgatgggtga 541 tgcgcttgtg ttttaacctt ttctaggaga taaacgtcta cctgttgaat taacagactt 601 actttttaaa catcctgtcg tacatgacac ttttttttaa acatttattc atttattata 661 tataagtaca ctgtagctgc tcttcagaca caccagaaga gggcatcgga tctctttaca 721 gatggttgtg agccaccacg tggttgctgg gaattgaact caggacctct ggaagagtag 781 tcgggtgctc ttaaccgctg agccatttct ccagcccgac actttttaat actatataaa 841 acaagttaaa taaatacgat cgttgccctt agaaatgcac aaacacgaaa tgggacagac 901 ttttcacttt ctttcatgcg tagcacttcc acgaaaccac aagttaggaa caatgatcaa 961 atatttacag gacccctaag catcacagtt aataattttc atgtggggac gcacctatat 1021 gtgattggca gttgaatcaa caaagcagca aaaatcaaca actttaaaat aagatgtgta 1081 ggtcattttg gtcaactaaa atttgagaaa tcgctgattt gtgtatgcac gtgtgttcct 1141 aaaaccgtta ctttatgtag ttctttttca aagaaaggag atgggcaccc ccaggaaagc 1201 tggttatacc cattgcttgg agtttctgat gactgctttg acatcagggc agcattctgt 1261 catcatccgt gtgtactctg ggatgacagg gacctggacc tgtgtagaag agtcccctaa 1321 ctcccttctc actccccagg gcttaaggct tgcctgggtc acctgatgct cgtttgcagt 1381 gggcgaggta gggcaggtga ggagagtggg tccattgccg tgagacacgc aatcagtggc 1441 aatagtgaag tcactgagcc ggacgtttct caatatttgc tctggcagag ggaggggggg 1501 gggagggagg gagggaggga ggtagagctg tttttttgtt ctaagggact gaaaactctt 1561 ctggaaatct acggggaatg catggagagc cagcactggg tttggggctc agggggtctc 1621 gtggacagca tgtccagcct ttcctagtgc taatctactt ggcaaatgga gattcctcct 1681 tcgggtgtca gctcaggggt agatgaagca ccctctgcct tttccgggct catcctgaac 1741 ttcctataga acactgatct ataagcctct cgcctgtcct cctggagcac ctgccattca 1801 caggattcat cagggtggtg ttcagggaag acacccattg gggtcgtcaa cacaatagag 1861 ccaactgaag aagggaggat ctgcctccca ttttatattt attttattat tttatttgag 1921 acagagtatg tctatgtagc cttggctagg ctggaacttt ccttttttga gcaggttggc 1981 ctcaaactca ccaagattat ctggctatac atctcctgag tgctgggatt aaaggagtgt 2041 accactgtac ctggcttctc tctctctctg tctctgtctc tgtctgtctc tctgtctctc 2101 tgtctctctg tctctctctc tctctctctc tctttctctc tctctctctt ccttccttcc 2161 tcttcctttc cttctttcct tcttttcttc tctctctctt tctttcttcc tttctttctt 2221 tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctt 2281 ctttctcttc tagtttttgt gtgtgtgtct gtgtgttttg gatacagtct tgctatgtga 2341 ccctacgctc ttctcaaact tgtatttatt ttgccacagt tttcccagag ctgagagttg 2401 ctttccttaa aaaaaacaaa aaatcacagg ggttggggat ttagctcagt ggtagcgcta 2461 tggcctagca agcgcgaagg ccctgggttc ggtcctcagc tccgaaaaaa aagggaaaaa 2521 aaattacata ttacttttag agaataacag gtttctgttg tgaaatttcc gtaagtattt 2581 acaacaaatg cagtactgac tattaaacca agagccatgt gtatgctagg caaatgctct 2641 acccctgacc tgcgtaatgc actttgatga cactcttaac ccgttgcatc ctggtcacac 2701 tcttctcctt cctcttccca aatagataat cttctgcctt tatgcctttt tcttttctct 2761 acatctagat tccgcatgtg agattcttcc ttagctggtc ttctcttact gaacaccgtg 2821 ctctgtccag ttgtctgcaa atggcatcat tccgtccttt agagctgaac agaactccac 2881 cgtgtatgcg tactgcatct tttttatcca catctctgtg gacggaccct tagctgattg 2941 cgaagtctgc ctattacgaa tacagtgacg gcagatgcgc aggcatctac ttccgctcct 3001 ccaaggtgcg caggcgtctc ggcagtacac cgacttccgc tccacccggg acatacctgc 3061 gagtggtata gctaggtcac agggtgattc tatttttcga atctatacta atttccatgc 3121 ggctatatta atttacattc ccacaacaat atatggaggt tccattccgc ctgtacatta 3181 tatcccccct ctctttctct gcctcaagta tgctaggcaa gcacttttac cagtcagcca 3241 acattcccag cccacttatg gttttcttgg tgatgccatt atgattggga tgaggtgaac 3301 atctccatgt agttaagatt ggccttttcc tgattctatt tccctcctgt tcatcagaca 3361 gaccactgaa tagcctaatc atatggggcc ctgaggatct ccagttataa aagttaatga 3421 ttctggttgc ttaattgact taggtaacag ttaataaaac acagggcctc atgatttact 3481 gcctatgagg cagaatagct taactaccct gctgttagag gaggccacct tggcccgagg 3541 tttggttctg ttctttagta gctgtgtggc ctttggcaag ttattcaact ctcctatgtt 3601 tgggtttctg cagttttcaa atgcgtttat aatagtacct accacatggt ggaatcagta 3661 aagtaacaca gacttagaat tgccaagtgc caagaacaca agagatgctt gatcaatatt 3721 tacgggtgac taaaattatt atttctacat ttttagtcat tcttaaaagg caatgctcct 3781 tccccccttt ttcttctgga cagaatagaa aaaacatcgt tttcaattca taataaattt 3841 ccattctcat aaaatatcca gtaccattta aggagttaat acaagctcct tgaccctcat 3901 cacaggtatt aataaagctt tatgaattac attatcagtc acttgttcat tcagccatcc 3961 tactgagcac tggctgtatg atagacacca tctcatgaaa caacgataaa taaatattct 4021 gctctcatgg aattaatatt ccagcagggg gaaataggca ataattaaaa tatgtaaaat 4081 gtatactata ttttactgat tctaaaacat ccattttccc agatgttaaa acccagaatc 4141 taggtgtggt gcctcccatt tgtaatccta gtattcagga ggctgagcag aaggctgcaa 4201 gttcaaggct tgcctcagtt aaaaaaataa attctcaggg ctggggattg tattacttgg 4261 ttgttaggat gcttacctaa gcacacatta gaccctaaat ttgaccctca atactgcata 4321 aaactgggtc tggtgacatc caggtggtgg aggcaggagg atcagaagtt taaagttatc 4381 tggctacata gtgagttcca agacaggtca gccacaacct tgtctaaaaa aaaaaaaaaa 4441 accacacaat taatattcat ttcacactga ctgaccagca tttttctttt gaacaaaaaa 4501 caaaacaaaa caacatcaga ccaagacaac aaatgcgtca acagcctctt agttttgatg 4561 ctggacacag ttgcggttag atatagtctc ctcctcctgc ccgcttgctg gaaggtaaaa 4621 ccaatatcct ttgggtagca aaatcccggc accttctctc ggactttgtt ttgctcatag 4681 ttctgtggca ggaaacttga tgtcctagac ttgggtaatt cactgtgatt tggccctcgc 4741 tgatggtcct ctcccctctg taggtcccag cagcgagtct ggataaagat tgctgacttg 4801 cataatctct ggcatttgtt accaggcggg gtagggaacg agtgatcctg agacttcact 4861 gagggaggtc agcagcaaag ctctggaaca ttccatgttc cttggccaag tgcatgctcc 4921 acaacttaca gatcatggag actgaactgg gactttctgc agcaggaagt caggctcagc 4981 ccctacctgt ccgaatggca gaccttgtat tacccattaa gccacgggac tgggatgctc 5041 aggattgctg tggtctaagc aggtagtgag agatgagagg ctggctctga ctggtttggg 5101 ttattatttg tgagtgctca aaaggaaaca gatggtgcgg tcaacctagg cagtttacga 5161 aaagggactt tcacttgtga ctgtgttcta gcaggatttg tctgtggcct cttgtgcttc 5221 tgagcggaga ctgggaacga ttacatgggt tcacatgtgg tcacagtcat cctgggccaa 5281 caacaggctc agatgggcta gccatccaaa atggcttctt catcggttca gtgcgccgat 5341 agttttgcat cccctctttg acaaaacaca cgggatttct tttagtttta ttttaccttt 5401 atattttttc atcttattaa atctttccct ccactttatt tgtattgcca tataaccttt 5461 tccccacata ctatttcctg attttttgat ccccttccct ctgctcccag gtttccctgc 5521 ctttcctcct atccagattc actctctttc tgtctctcac tagaaaacaa ccaggcttct 5581 aaaggataat aataaaatat aatataacac aagaaaccaa catctaatac actgggataa 5641 agacaaaata aaaagaagaa aagaacacaa taagagacag acacagagac ccactagttc 5701 atacacttag gaatcccaca aaaacactaa actggaagcc ataatataga cgcaaaggac 5761 ctgtagggta aaaggagaga aaaatatgaa ataattaaat acaaaaaaaa aaaatagccc 5821 tggcgtggca ctgtgaaaca aggcatctcc aatgttgagt tggtttccca tggaccatct 5881 cctgctgggc atggggcccg cccctaagag tagtttgttt ccccagcgag actcccttgg 5941 agaaaagtaa aattttattt gccagtgttt ttaagtggtg ataacttctg gattagggat 6001 ggggtacgtg tccaactctc ctttcaactc taggacccat ctggtgcagc cctgtgacag 6061 ccctgcacgt gctgccacag tctctgtgag ttcctgtgtg gatcagtcct gttgaccaga 6121 gggcttgttt tctcggcgtc ttccatgcct ctggctctcc tgcctcctct tctgcagggt 6181 tcccagagac acccatttaa gccggggtat gccaaggttt ctcacgctct gcatgctgtc 6241 tggctacggc tttctatttg tcccatctgc cgaagaagga agcctctctg atgatgactg 6301 agcagggcac tgatctactt ccctgactga agcacacact ggatttcttt cacgaaagct 6361 cagggtacta agcgagattt ggtagacgaa ctggcctgtg ttcagaacgc tcagaatgac 6421 tcctctggct ttcatcggtc aacccagtca tggcaaactt atttcctgat aaaatgctca 6481 ggatgaggac tggattgcag atgcggccac aaataaaaga acacgaagtt caccctggag 6541 acaaacggct gaggtaagag tcatggttgc cacagttatg caaaacacaa aacaaataag 6601 gtggtgctat ttgctaaact ccgtgagtgg ggcacccaga aggcacctca gtctgcaaac 6661 agatgaagat tttatttagc ttttgaagag cagagtgctg aagtagagag aaaagccaca 6721 acctttaaag ggtgagcagc cctttgaaaa cccactatcc ctttatacca tgaacaaaac 6781 agcagatcta atgctggctc ttcctcccct cattaccttc tccctttttt atgtgctatt 6841 tactgccaac tggtgagcac gtccagaccg aggcttatag gaagcaccac gtaagacaag 6901 agggcataag tggagaacaa ggaattgatt gtatgtgcag tacccaagtg atactgtcca 6961 ttattctgga tggaaaggaa cgacttagct ggtgaggaga tggaaaaggc aggcggatgt 7021 ttcctgcatc agctgaagag ttagttgact attttttttc ttaatttgtt tttgtttgtg 7081 tatgagtgct ttgcgcgcac ctagtctgtg taccacatgt gtgcaatgct caaggaggcc 7141 agaagagggc gccgtatctc ctggagctgg cgttgcagat ggtcccgaac ggccacgggg 7201 gtgctatgaa ttgaacaaaa agtaggccag caggaagttc atcgtctcag gaaaaagtct 7261 cattaaaata tcctaaaata agacccatag agtaccaagc tgagagagta aggatataga 7321 atgtacgtct ggaacaatag acacatttca acatgttaat catgggcaga aagaattgat 7381 attgcttgtg tccaggtggt agaattactg aagaactttt aaaggtttat ttgtttttat 7441 tttatgtgtg ggagtgtttt gcctgcagga tttgagtgtg ctacgcgcac gtctgttctt 7501 cttggagggt gtcagatggc ctgaactgaa gttacagttg gttgtgagcc aactgtgtag 7561 atgctggaaa ccaaatcccc atcaggtcaa acaaatgctc tgaactgctg agccatctct 7621 ccagccgcct ggcggtagaa ttattttaaa aatgcttttg tatgccaggc atggtggctc 7681 acagctgtaa agccagaact tgggaggtag aggcagaagg atcagcagtt tgggccatca 7741 caggctggct acattgtgag tatgagggtc tcaaaaaaca aaacaaaaat tatcccaaag 7801 ataaagaaca acgacaactc caaaacaaac aattaaccca cttctggcaa aagcgacagc 7861 gtacacttag aaaagctcac tagaagtcat ttggttttta ttttgatatt agcttttact 7921 taggtatcaa ttcgtttgtg tgtttagttt tgagataccc cagactatag ttgtttgttt 7981 ttcaagggaa ggacttcact gtgcctgttg gtgtaacctt tagtgagtgt cagagcttgc 8041 tgggccttga ctcgaggtct agtttatctg gtattgcaga aacttcctat acagctctgg 8101 gtcctgtcac acactctccc accctgcctg gtctgctttc tgctgaggga aaacattagc 8161 ttccagaggc aggtagcacc agaacttgtt tgctctgcct gtttcccctc tgacctcagt 8221 tctgatgttg gctcccacac ccagatgaag gtgtgttcgt ggcattgacc tgatgatgca 8281 gctgccctgg ctcccttcgc atgcccctct gccctgcagc ttctgcgtca gcgccagtat 8341 ttgctggacc acttgccaac ccctgtgcag cctgctgctc tttgatctgt atacctgaag 8401 ccaagccagc ccggcctgta agcagctgtt ctagagcttt gcaacccgca tcctaagcca 8461 aggtctcatc taaggcaaaa ataagggggt tattttctac agggacaggt gcgggacact 8521 cacagatgct tcaagaacac agtgggccga gaaaatgaac aatgtccgtc tttgattaaa 8581 tggggcaaag gcaacatctt taatattatc attccgtcaa tcactttgag aaaagcttat 8641 ttaagaagac aattcccaat ggttttcaaa ccaccggtta ggaaggtaat tagcctttct 8701 gaaaatacca tcaaggagtc tgaagaactg taacgtcggg gtgtggtggc ccacaccttg 8761 tcctcccagc acctgggagg ccgagacaaa gaagatgtag agttgggggc cagcttctgg 8821 tactctcgat gttcaaaacc aaaacagaac ggtttaaaaa ttaagactct ggcatagaat 8881 attctagaat caaacgtacc acaaacagat tcaggaatac tcggaattca gtataaacta 8941 aagcaacttt ttaaaattag tagggagaac caggtgtggt ggtacacacc tttaattcca 9001 gcacatggag gcaggggcag gcagatcttt gtgagttcaa gggcagcctg gtctacatgg 9061 caagttccag gctagccagg gcagcatagt gaagctctgc ttacaaaagc aaaacaaaaa 9121 accgggaaaa ccttcagttt ttgaataaga cacatatgtg agttcctaag atgcatgaac 9181 gccacagaga ccactcaaga gagcaccaag gctcctacgc aaaggcaagt tagctccaga 9241 gccctaggag agctgctcat gggacagggc agcgacagcc accaagctga gagtcagctt 9301 gtgtgcccag gagggaggcg ttgggtcaga gcctctggag gacccctgaa gtctcttctc 9361 agtgttctct atcacaggga gagctgtcag cccctggaat gtggttctat gtctagaaaa 9421 ctatcccata aataacagga agcccaaggt ttaccaatct ctgctgtaca ggatgttcta 9481 gctactttat ttgcaataga aaatctgaaa gtttccccat gtccaacaag actagaacaa 9541 acaagtcctg cgtagtcgcc tgtcggtttc tgggtgtggt ggtatagccc tgtaatccca 9601 gcatttggga agctgaggtg ggaggatcgg gagttcaagg tcagcttggg ctacttagaa 9661 agaccttgtc tcaaaagaag tggagggggg gtggtggtgg tggtggtggt ggtgtaaaat 9721 tgatctcttt gtatgataat gtccatacaa tatattaata ttgaaaacag tcatgccata 9781 gaacaatttt gttcagtaaa tgtacatcac ataagtaact tcaaatttta gaagggataa 9841 ttttaaagtc aggcaggcag cattaaagaa gaatgaggaa atcaatttaa aattcttttt 9901 taaagactta cttactggat ttttgcctat gggtaatttg catgtatgtt ataacaccct 9961 gtatgtttct ggttcctatg gggatcagaa gagggtatta ggtcctctgg gactggaggt 10021 atagaccgta gtttgtgagt caccgtgtgg gttctaagaa ctgaaccggg tcttctggaa 10081 gggcagccag tgctcttaac ctctgagcca tcttgccatc cccaataaaa ttagttaaga 10141 aattattatt attgttgttg tcgttttcgt ccttattagt gtgtgtgttc agcagtcagt 10201 tctgtccttc cacagtggca tctaagagtg gaacctaggt tggtggtcag gcttgtgcag 10261 gaagtgattt ttgccggctg agcaatctca tcaggcttcg gacatttttt aactgttttt 10321 atatttttat tttatagttt ttattgtttt aattcaaagt gtcagagaat ctttcaaaat 10381 gttgcaatcc atataaaaat gaacgaggtg gttggcattt ttaacactaa gtcttcgcaa 10441 tgcaacgtct cttttatatc aacaatgcat ctcactttag accagccaca tatcaaggat 10501 caaaagtcaa ttatggctgt agccattata ggaggccgtg caactctaaa gtatgtacag 10561 ccggcggtac agcgtttgct ttggttcaat ttcctatatc aaaaaggagt gtgtgggagg 10621 gagttgtaga aagtgactga ggaagacacc cagtgtccta atgtcaacag gaggatccac 10681 agcaggactc tggcaacacc cgtgtttatg aaacatctct gcgtgctgta gcttttattg 10741 gggggtttca gtgatccgtg tgtttgggca ctagaactct gtcggaccca gggggatgtg 10801 agccttgtgc atctaaatcg cagtccgcct tttcaggccc atcgcaggaa gaagccattg 10861 tagagatgca gcgtggtggt gggagcactg cacatgcgca gagacgctac tatgcaaata 10921 atagtctagc gcctcttgtg gacggtgttg tagctgcggt ctgtgcctgt tggagcaagg 10981 agcctgtgtg gccctggaat cagattacct gctttagggt aggagagaga tcctaccact 11041 gttgttggtt cctttgttaa cgttgtgatc agttacaagc tgctctggga attttccact 11101 gtttaaagaa cgacttcaga tcccttaact cgctgcactc agttttgtac tctgatgaaa 11161 atgaagctgc agagatcttt ttggctgcct tgtttcttga ttgattatta atttgtttga 11221 actgcccagt ttgttctaac tggagtccaa aactctgtgg tgtgccaaga aaaggaaatt 11281 aaggcatagg gcaaagaaat aataataacg actacggtga cgacgatgat gatgacgatg 11341 taaatgccat tatcgaacga gagcatgatt ctatctgtgc tttgaaaact tattaaattt 11401 tgtacgcatt tattctgcat ttctatttag ctatccgcat acttatctat tttggtgaat 11461 cagtgaaatt ttttatttgt ttttcccttg gaatgcggtt gaatttttgt ggatattacc 11521 gttgtccgta gcaaatccca catatggtag atgggtattc aaccaaactc ttatcattgc 11581 ttatatccag gagtgagact agagtgcaca gagaaagggg taccagaagg accctcgttt 11641 aaatgtattt ctgtactgtt aacacaattt tcataaccaa aaaagaccaa taaagttaat 11701 cactgtcacc ccaactgccc tccaccattt tctcaaacat ctcaccaaat gactagaaag 11761 agttaacagg attccagata cttgatgtaa ggacaaatcc cagattggaa ggtggcccag 11821 ggttggggtg agaaacagca gagtgggggg tggggtatgg gggtaggtcc gggggaggga 11881 cttagttctc actctcaacc aatagcacga aggcttcggg cccaacgccc attggctgaa 11941 actatttcaa gggtcaggac tgcacctgag ctc // LOCUS DRRADHA 1321 bp ds-DNA INV 30-JUN-1990 DEFINITION D.orena alcohol dehydrogenase (Adh) gene, 5' flank. ACCESSION M33488 Z00032 KEYWORDS alcohol dehydrogenase. SOURCE D.orena DNA. ORGANISM Drosophila orena Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1321) AUTHORS Moses,K., Heberlein,U. and Ashburner,M. TITLE The Adh gene promoters of Drosophila melanogaster and Drosophila orena are functionally conserved and share features of sequence structure and nuclease-protected sites JOURNAL Mol. Cell. Biol. 10, 539-548 (1990) STANDARD simple staff_review BASE COUNT 409 a 295 c 260 g 357 t ORIGIN 1 tctagattga ttctactacg ctgcctccag cagacacgcc cccccagcac cccatcccct 61 cccgctggca ccagtttcag tattacacgt atggaaatta agccgaagtt caattgcgag 121 cgcaggaaca acacgagctt gcatctttct acacttcttg gcgcgctatg cttgacattc 181 acaaggtcga agctcttagc gttctgactc ggggtgctac actgcacaaa attacattat 241 gcattcttca aatatcccta atataccaaa tatgtattaa aaagtgatca tttaccgatc 301 gttcgaaaga cggtataggc ttacaaaaaa atttgccaaa gtaatttaca aattcataat 361 ggttttttca agtatataat atgtacatca atattttctt acatgtattt tatgggatga 421 ttatgtttta aactacactc aattttttct cagtgcactt cctggtgttc cattttctat 481 tgggtccgta cccggcgttt gtttgcagat cacttgcttg cctattatta tagcatttta 541 cacattacaa aattctggac gtcgctgctg cagccgctgt cgacgtcaac tgcactcgcc 601 cccacgacaa aacggtattt aaggcgctgt gcaagtccca gtcgccgatt atgtctcagt 661 gcagttgtca gttgcagttc agcagaccgg ctagcgagta cttgcatctc ttcaaattta 721 cctaattgat caagtaagtg ggaaagggac ctcattatgc aatgtcgaat agtaagagat 781 cactatcact aatggtggag cataataaaa tcaattgcat gcaatcgaaa tgaatgcaaa 841 ccggcacaag cagtagcaaa cctactaaac aaattaaaat ttggaggctg tgccgtggcg 901 aatatttgac ttggaaaatc acctgttgtt taaccgctaa aaataggaat tttaacatta 961 agcatccctg ttaatcggcg ccgtgccttc gttagctatc tcaaaagcgc gcgcgtgcag 1021 acgagcagta attttccaag catcaggcat agttgggcat aaattataaa catagaaact 1081 gaatactaat atagaaaaag ctttgccgcc acaaaatccc aaacaaaaac aaaatatgtg 1141 ccgaaaaata aaacaaaaat aaaccgtaaa ccgggcagcg ttgccgtcgt tgcgggctgt 1201 gaagcttacg tgaatagccg agagatcgcg taatgataga taaagaaagc tctacgtaag 1261 cgaagcttct gggggataga tcttcctata aatacgggac cgacgcgaac tggaaacgaa 1321 c // LOCUS M33489 1380 bp ss-mRNA BAD 30-JUN-1990 DEFINITION D.melanogaster alcohol dehydrogenase (Adh) mRNA, 5' flank. ACCESSION M33489 Z00032 KEYWORDS alcohol dehydrogenase. SOURCE D.melanogaster, cDNA to mRNA. REFERENCE 1 (bases 1 to 1380) AUTHORS Moses,K., Heberlein,U. and Ashburner,M. TITLE The Adh gene promoters of Drosophila melanogaster and Drosophila orena are functionally conserved and share features of sequence structure and nuclease-protected sites JOURNAL Mol. Cell. Biol. 10, 539-548 (1990) STANDARD simple staff_review COMMENT Secondary reference. BASE COUNT 426 a 290 c 241 g 423 t ORIGIN 1 tctagattga ttctacgctg cctccagcag ccacccctcc catccccatc cccatcacca 61 tccagtcccg ttggctccca gtcacagtat tacacgtatg caaattaagc cgaagttcaa 121 ttgcgaccgc aggaacaaca cgatctttct acacttctcc ttgctatgct tgacattcac 181 aaggtcaaag ctcttaatat tctggcttgt ggccctacac tgtaagaaat tactatagaa 241 ataagggtac acggaataag atattttttt tagtccatat gcttttaaca aatgtgtttt 301 gagtttatgt tatattattg ttagaaaaca ggtgtttttt tttaaatcgg ttaaaaaatt 361 actacgagag aaaaatacaa attttgtaaa taagattgac tctttttcga ttttggaata 421 ttttcattca ttttatgttt ttacgttttc acttatttgt ttctcagtgc actttctggt 481 gttccatttt ctattgggct ctttaccccg catttgtttg cagatcactt gcttgcgcat 541 ttttattgca ttttacatat tacacattat ttgaacgccg ctgctgctgc atccgtcgac 601 gtcgactgca ctcgccccca cgagagaaca gtatttaagg agctgcgaag gtccaagtca 661 ccgattattg tctcagtgca gttgtcagtt gcagttcagc agacgggcta acgagtactt 721 gcatctcttc aaatttactt aattgatcaa gtaagtagca aaagggcaca caattgaagg 781 aaattcttgt ttaattgaat ttattatgca agtgcggaaa taaaatgaca gtattaaata 841 gtaaatattt tgtaaaatca tatataatca aatttattca atcagaacta attcaagctg 901 tcacaagtag tgcgaactca attaattggc atcgaattaa aatttggagg cctgttccgc 961 atattcctct tggaaaatca cctgttagtt aacttctaaa aataggaatt ttaacataac 1021 tcgtccctgt taatcggcgc cgtgccttcg ttagctatct caaaagcgag cgcgtgcaga 1081 cgagcagtaa ttttccaagc atcaggcata taatatacta atactaatac taatactaat 1141 ataagaatac taatatagaa aaaaagcttt gccggtacaa aatcccaaac aaaaacaaac 1201 cgtgtgtgcc gaaaaataaa aataaaccat aaactaggca gcgctgccgt cgccggctga 1261 gcagcctgcg tacatagccg agatcgcgta acggtagata atgaaaagct ctacgtaacc 1321 gaagcttctg ctgtacggat cttcctataa atacggggcc gacacgaact ggaaaccaac // LOCUS RATRIBOI 1456 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat ribophorin I (Rpn-I) gene, 5'end. ACCESSION M33508 KEYWORDS ribophorin I. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1456) AUTHORS Behal,A., Prakash,K., D'Eustachio,P., Adesnik,M., Sabatini,D.D. and Kreibich,G. TITLE Structure and chromosomal location of the rat ribophorin I gene JOURNAL J. Biol. Chem. 265, 8252-8258 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Kreibich, 03-APR-1990. FEATURES from to/span description pept 1362 > 1456 ribophorin I (Rpn-I) mRNA 1338 > 1456 Rpn-I mRNA (alt.) mRNA 1341 > 1456 Rpn-I mRNA (alt.) mRNA 1347 > 1456 Rpn-I mRNA (alt.) mRNA 1352 > 1456 Rpn-I mRNA (alt.) mRNA 1353 > 1456 Rpn-I mRNA (alt.) mRNA 1359 > 1456 Rpn-I mRNA (alt.) binding 1314 1324 SP1 binding site binding 1325 1333 SP1 binding site binding 1247 1256 SP1 binding site signal 1297 1300 CAAT box BASE COUNT 342 a 325 c 336 g 453 t ORIGIN 1 ctgcaggtgc taatttcctg acttcttctc tgtagggagg agcatgtcct tccctccgct 61 ctctcattag ccagtgtcag tgtttgaata tttccatctc cttccaaaat ttttttttct 121 tttttttttt ttcggagctg gggaccgaac ccagggcctt agcgcttgct aggccaagct 181 accactgagc taaatcccca acccccaaaa tttaccttta aattacatcc ttcaagcaac 241 aatattggga catgttcatt agatcagaat tgataagagg caacacggaa atgaaagagg 301 tgacagatct ctaccaaggc cttattgccc ttagtcagag cgaatgaagg agcagcagtc 361 tccttttgtg agtggagact gtacaccaca atcgtgcagg gggcttcgag attttatagg 421 gttgaaacgg gactttaggg gagcaaagtt ttatcagtag atttttcttt atcattatta 481 tcatcattgt tgttgttgtt tttaaatggt ttcaaggccc aataatgcca gtagatgttc 541 tctttgctac attatttttc ttatcaggtg ttatcagtat accgactaga gtttttccac 601 tagcttgaag ttcatttgtt ttaggtcatt aattcttcag tgtgactttg ggagagacag 661 cactagagga atccatctgc aagaagcctt tgtaaaaggt cttgccgcag agagaccctt 721 tcctgccttt ctgtcatatg acgatatgcg attccttctt tagacaatgc aatgccaaag 781 ttccatcttg gtagtaagga gcagccctca cctgacagct tagtcctgcc agtaacttca 841 ccttggactt ctacaattgt gctaagtgcc aacagatact tgatgttaat atcatcagcc 901 ttggttagtt gcttaaggtt tctgccatcg ctcatattaa aaattgaact aaatattagg 961 ggagagactt gaatctttgg aaatttcccg tttttcttta aacttttgcc cacttagtat 1021 tcatatcaat gtagagaagg caaacttatt tccttagact aagctgtttc tgagcggaga 1081 ggccattttt tttttttttt ttttttttgc ataagcatga aggaaacagg agataaggaa 1141 ggacgactac cacagaaatc actggtgagt ctagctaata agcgctcaaa gacgtgtgcc 1201 tgcgctttaa gatttttcaa tcaggcgccc catcgcggtt gccttggaga cggggtcctt 1261 cctgacccgg cccagccatc ttgtggtgct gttggccaat caacggttgc gccgtggcgg 1321 gggggggcgg ggcgcgtgtc ggtggcgctc agctcgcggt catggaggcg ccgatcgtct 1381 tgctgctgct gctatggctc gccttgcccg acgcctggca gcgcctcttc ggaggctccg 1441 ccgctggtca acgagg // LOCUS DROCYCB 2612 bp ss-mRNA INV 30-JUN-1990 DEFINITION D.melanogaster cyclin B mRNA, complete cds. ACCESSION M33192 KEYWORDS cyclin B. SOURCE D.melanogaster 3-12 hour embryo, cDNA to mRNA, clone Nr 4/4. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2612) AUTHORS Lehner,C.F. and O'Farrell,P.H. TITLE The roles of Drosophila cyclins A and B in mitotic control JOURNAL Cell 61, 535-547 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.F.Lehner, 26-MAR-1990, for release after publication. FEATURES from to/span description pept 250 1842 cyclin B mRNA < 1 2612 cyclin B mRNA BASE COUNT 710 a 670 c 635 g 597 t ORIGIN Chromosome 2; map postion 59A 1 aattcaaacg gcagcttggc ctcgcttcgt gtgcaaaact cgatcaggtt ttcggataat 61 tgacaatccc gcactcgact tgtaaattat aaactattag taatcatata ttcggacaac 121 aaattcaaca gctaaacaag cgttcggtca cagaaacgcg atcaaaagtc aaccaaacca 181 agtgatagcc aagcgtctgc ctatcttcgt gttaattgtg tttgtacaga tagaaaagaa 241 gcaatcaaaa tggtgggcac aacactgaaa atgcgtggcg atgagaacgc ttcggagaac 301 ttcaagcaag tgcaattgaa gaaattgacg gttccttcca tggaggcaac aacaaaacgc 361 gcggccttgg gcgatttgca gaatcgcggc ataagtcgtc ccatcgcagc gaaggatgcg 421 gcacagaaag actccaagga tctcaagctc acagacgccc tgcgcaatgc caaagctcgg 481 gtggacagcc actggaagaa acagccactg ggcagcacca atggcaatgg caatggcgcc 541 gttccgccca aggtcaacga ggggggcgtg tcggcgtttt tgcgttcgaa ttcggtgcgc 601 aatcgcgttc cgaccaagac cactgtagaa cccactaaag ttacagtcaa gtccagttct 661 tccgagaacg tgaacgagcc caccttgaag cgcgaggaca gcaatctgtc gaagaagtcg 721 ctgaccaaac tcggtgccgc tttggccaaa cccgtgatgg gagtttcagg aattcgacgg 781 gaaccagtag ctgtttcccg caaagaggca gagaccaaga aggaactgcc agaaaccaag 841 aaggactcac tggaagtgaa aaaggatgcg accaggatgc cccttattag gggcaacagt 901 gcagtcacta cgaccacatc cacgatgccc accaccatgt ccctttccag caagcgcttg 961 gctgggatcg aggacattga tgccaatgac aaggagaacc tggtactggt ctccgaatat 1021 gtaaacgaca tctacgacta cttgtatcag gtggagctgg agcagcccat tcacaaggat 1081 cacctggccg gacagaagga ggtgtcccac aagatgcgag ccgtgctgat cgattggatc 1141 aacgaagtcc acctgcagtt ccatctggct gcagagacct tccagctggc ggtggctatc 1201 attgatcgct acctgcaggt ggtcaaggac accaaacgca cgtacttgca attggtggga 1261 gtgacagcac tcttcatagc caccaagtac gaggagctgt tcccgccggc aatcggagat 1321 ttcgtcttca tcacggacga cacctacact gcccggcaga tccgacagat ggagctgcaa 1381 atcttcaagg ccatcgactg taatctgtcg cgtccgctgc cgattcactt ccttcgacgc 1441 tactcgaagg ctgctggcgc cgaggacgag caccatacga tgtccaagta cttcatcgag 1501 ttagcttccg tggactacga aatggccact tacaggccat cggagattgc agctgcctca 1561 ctgttcctgt cgctgcactt gctcaatgga aaccaccggg ccggtacagg attcaacgac 1621 cgtcactgga cgcccactct gaccttctac tcgcgatact cggccgcgca cttgcgtccg 1681 attacccggc tgatcgcgaa actggcccgg gacgctcctc aggccaagct gaaggccatc 1741 tacaacaagt accagggcag caagttccag aagatcgcgc tgcgaacgga gctgaccggt 1801 gcgctgatgg actcgattgt gggccagagc cagaggaaat agtgcggtcc aaggcggact 1861 ggaagaccct gacttaccta gtttagttta atttgttttc atttttaaat ttgtagcgta 1921 tttcattttc tgttcgtttc gtgttcgtta aaaaatgcgt atagttaccg tagtcgcatt 1981 gccaactatc tttacctgca tcacccatcc ctaagatatc gtaatctgct ggagtccctt 2041 gagcagtttt cggctactgc caagagctgg ctcccggcat ctttgcccac ggagcacaag 2101 ttgctcgcga gccggccgct gggagtgagt tcctccggta agactaggaa ccgaactaaa 2161 ctggagcccg tcagctcctt tcgctgggtc accatctcag tgggagcgag cggcggagcg 2221 gtggacaccg agagagagag tgggcaacga gttcatttgc tggccgaaca catcggcgtt 2281 gtctctccaa gaattatatt ttataattgt acccgtgaaa tcatttcatc ccctacctac 2341 ctatcactaa ttgtatttgt cgttagtttt tagtttcgat gggaatgaat aaatccaaaa 2401 aattgtgcaa aattattccc ataaatcgca taaaaaagag acgtagacta tttgtaattt 2461 atatcatgta tttcgcacat tcatacaaaa acaacttatg ctagttatat tgtttaccac 2521 taagcaacga ttaaaacacg ttttaactaa ctgatgtttc ccaaaaatgt ctctgaacaa 2581 tacatagaaa ataaagtttt gtataatttt at // LOCUS RATSV4I1 37 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat/SV40 5' recombination junction. ACCESSION K01218 KEYWORDS . SEGMENT 1 of 3 SOURCE Rat (strain Fisher) SV40-transformed isogenic cell line SV14B. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 37) AUTHORS Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and Botchan,M. TITLE Analysis of the sites of integration of viral DNA sequences in rat cells transformed by adenovirus 2 or SV40 JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980) STANDARD simple staff_review COMMENT The three segments of this entry represent an integration of the SV40 genome in which two SV40 fragments are integrated in opposite orientations with a 40 bp insert of undetermined origin between them. The authors report that SV40 and adenovirus inserts into host cell DNA appears to be chaotic, with inversions, deletions, and recombination with segments of uncertain origin. FEATURES from to/span description recomb 25 26 rat DNA end/SV40 DNA start BASE COUNT 12 a 8 c 5 g 12 t ORIGIN About 57 bp 3' to EcorI site. 1 acttcacatt ctaggatgtc tggctactaa aaaatct // LOCUS RATSV4I2 58 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat/SV40 recombination junction DNA and an insert of undetermined origin. ACCESSION K01219 KEYWORDS . SEGMENT 2 of 3 SOURCE Rat (strain Fisher) SV40-transformed isogenic cell line SV14B. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 58) AUTHORS Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and Botchan,M. TITLE Analysis of the sites of integration of viral DNA sequences in rat cells transformed by adenovirus 2 or SV40 JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980) STANDARD simple staff_review COMMENT The three segments of this entry represent an integration of the SV40 genome in which two SV40 fragments are integrated in opposite orientations with a 40 bp insert of undetermined origin between them. The authors report that SV40 and adenovirus inserts into host cell DNA appears to be chaotic, with inversions, deletions, and recombination with segments of uncertain origin. FEATURES from to/span description recomb 9 10 SV40 DNA end/insert of unkown origin DNA start recomb 49 50 insert of unknown origin DNA end/SV40 DNA start BASE COUNT 11 a 17 c 11 g 19 t ORIGIN About 3850 bp after segment 1. 1 tttgtgatct ttgcaatccc acagattgca gcataccagg cttccctgtc cttgtgca // LOCUS RATSV4I3 40 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat/SV40 3' recombination junction. ACCESSION K01220 KEYWORDS . SEGMENT 3 of 3 SOURCE Rat (strain Fisher) SV40-transformed isogenic cell line SV14B. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 40) AUTHORS Sambrook,J., Greene,R., Stringer,J., Mitchison,T., Hu,S.-L. and Botchan,M. TITLE Analysis of the sites of integration of viral DNA sequences in rat cells transformed by adenovirus 2 or SV40 JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 569-584 (1980) STANDARD simple staff_review COMMENT The three segments of this entry represent an integration of the SV40 genome in which two SV40 fragments are integrated in opposite orientations with a 40 bp insert of undetermined origin between them. The authors report that SV40 and adenovirus inserts into host cell DNA appears to be chaotic, with inversions, deletions, and recombination with segments of uncertain origin. FEATURES from to/span description recomb 16 17 SV40 DNA end/Rat cell DNA start BASE COUNT 9 a 8 c 7 g 16 t ORIGIN About 770 bp after segment 2. 1 gctttttaaa tcctcacttc aggggtaaga gtttcattct // LOCUS RATSV4J1 42 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat/SV40 DNA recombination junction, clone pEM2. ACCESSION K02743 KEYWORDS . SEGMENT 1 of 2 SOURCE Rat (strain Fisher) SV40 transformed cell line tsA30-N2 DNA, clone pEM2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 42) AUTHORS Hasson,J.-F., Mougneau,E., Cuzin,F. and Yaniv,M. TITLE Simian virus 40 illegitimate recombination occurs near short direct repeats JOURNAL J. Mol. Biol. 177, 53-68 (1984) STANDARD full staff_review COMMENT [1] reports that of two junctions between rat and SV40 DNA studied, one was a very abrupt transition, whereas the other (shown here) had an area of homology (bp 16-26) between the two sequences at the junction. The more likely recombination site at base 26 is given in the sites table, although base 16 might be another candidate. FEATURES from to/span description recomb 25 26 rat DNA end/SV40 DNA (base 439) start BASE COUNT 6 a 6 c 10 g 20 t ORIGIN 1 atggctgtat gcttttttta gagcctttgc tgcaattttg tg // LOCUS RATSV4J2 1357 bp ds-DNA Rod 30-JUN-1990 DEFINITION Rat/SV40 recombination junction, clone pEM5. ACCESSION K02744 KEYWORDS . SEGMENT 2 of 2 SOURCE Rat (strain Fisher) SV40 transformed cell line tsA30-N2 DNA, clone pEM5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1357) AUTHORS Hasson,J.-F., Mougneau,E., Cuzin,F. and Yaniv,M. TITLE Simian virus 40 illegitimate recombination occurs near short direct repeats JOURNAL J. Mol. Biol. 177, 53-68 (1984) STANDARD full staff_review COMMENT [1] reports that of two junctions between rat and SV40 DNA studied, one, shown here, was a very abrupt transition, whereas the other had an area of homology between the two sequences at the junction. A member of an interspersed middle-repetitive family bordered by 15 bp direct repeats is present at positions 595 to 790. Short direct repeats (ttg)6 and (tatc)12 are present at positions 53-70 and 310-357, respectively. [1] speculates that the presence of these short repeats provides a good environment for the viral integration process. FEATURES from to/span description recomb 24 25 SV40 DNA (base 705) end/rat DNA start unsure 306 307 tt could be ttt BASE COUNT 336 a 273 c 228 g 520 t ORIGIN About 5 kb after segment 1; 823 bp upstream of HindII site. 1 caacttgcat ctgttgctac tgttttttac tctgatgttg ctactgagaa ttttgttgtt 61 gttgttgttg gtggttttta aagacagtct cactatgtag tccaggctgg cctggaactt 121 gctctgtagg gtaagctggc catgaactca gtttgtatcc agtagctaga gtgagtcctt 181 gaacttggtc ctcctgcctt tgcctctcaa ctgctggcag ctactgagaa acttaactta 241 atctcattgc ttttttctct gacttttttt agaattggtt tttaatctta atactctatg 301 gtgcattcct atctatctat ctatctatct atctatctat ctatctatct atctatcacc 361 tatctctctt atatatctgt accctgtact ttttcagctt tttattatgg ggctcacgca 421 catgcacata atacgtctgc ataccatggc acacatgtga aggccacaca tgtagactaa 481 tcagttttct ccttctacta tgtgcatcct gggaatgaac ttaggtcatc gggcttggtg 541 gcaggtattt ttgtttgcta agctacctca ccagcccctc tggttctttt taaattcttt 601 tgagtttttt tttaatattt atttatttat tatacataag tacactgtag ctatcttcag 661 acacaccaga agagggcatt ggttgtgtga gccaccatgt agttgctggg atttgaactc 721 aggacctctg gaagagcagt cagtgctctt aaccgctgag ccatctctcc aacccttctt 781 ttgagttttt atgtgtcttt tctttttata aatttctaaa gtcaacgtaa aaagcattgg 841 atttgattac agcatgtttt tcttttatta aaaatattga ggcttttaac tctttagaga 901 caataaaccc aataagcatt agcataaata ccaattctta gaacaataca ttttcaaaca 961 cagaaacatt tactgaaaaa aggaactttt ttgcatcttt ataaatctgc tctattctag 1021 cttataaggt aaatgcttta ggctcatgtc caatttatat tcaatgtttg ttgatgtctg 1081 caatctatga ggaaaatcta acctcacaaa ggtgcatggt ttagaaacag aaagtttgac 1141 tttggttttg ttgttgtttg gatagtggtc tgtttggttg tgagtgtcac atttccttac 1201 atatgtgtgt gtcattatac ttttacatat tacttttaca tatgtgtcat tataactttg 1261 cctaatttga ccctctcctc ccctgctacc cgactctatt gctggttcaa ttcttcaccc 1321 caaatagtct ctcccagtat tcaggtcaca tgaattc // LOCUS RATSV4M1 37 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat/SV40 5' recombination junction. ACCESSION K01093 KEYWORDS . SEGMENT 1 of 3 SOURCE Rat cell line 14B DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 37) AUTHORS Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J. TITLE Integration and excision of SV40 DNA from the chromosome of a transformed cell JOURNAL Cell 20, 143-152 (1980) STANDARD full staff_review COMMENT The authors report that parts of the SV40 genome are integrated in opposite orientations and that a segment of SV40 has been deleted in rat cell line 14B. There is a 40 bp insertion of unknown origin. [1] analyzes 3 segments with rat proviral boundaries. FEATURES from to/span description recomb 25 26 rat DNA end/SV40 DNA start BASE COUNT 12 a 8 c 5 g 12 t ORIGIN 1 acttcacatt ctaggatgtc tggctactaa aaaatct // LOCUS RATSV4M2 58 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat SV40 proviral DNA with insert of undetermined origin. ACCESSION K01094 KEYWORDS recombinant DNA. SEGMENT 2 of 2 SOURCE Rat cell line 14B DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 58) AUTHORS Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J. TITLE Integration and excision of SV40 DNA from the chromosome of a transformed cell JOURNAL Cell 20, 143-152 (1980) STANDARD full staff_review COMMENT The authors report that parts of the SV40 genome are integrated in opposite orientations and that a segment of SV40 has been deleted in rat cell line 14B. There is a 40 bp insertion of unknown origin. [1] analyzes 3 segments with rat proviral boundaries. FEATURES from to/span description recomb 9 10 SV40 DNA end/DNA of unknown origin start recomb 49 50 DNA of unknown origin end/SV40 DNA start BASE COUNT 19 a 11 c 17 g 11 t ORIGIN About 3085 bp after segment 1. 1 tgcacaagga cagggaagcc tggtatgctg caatctgtgg gattgcaaag atcacaaa // LOCUS RATSV4M3 40 bp ds-DNA ROD 30-JUN-1990 DEFINITION Rat/SV40 3' recombination junction. ACCESSION K01095 KEYWORDS . SEGMENT 3 of 3 SOURCE Rat cell line 14B DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 40) AUTHORS Botchan,M., Stringer,J., Mitchison,T. and Sambrook,J. TITLE Integration and excision of SV40 DNA from the chromosome of a transformed cell JOURNAL Cell 20, 143-152 (1980) STANDARD full staff_review COMMENT The authors report that parts of the SV40 genome are integrated in opposite orientations and that a segment of SV40 has been deleted in rat cell line 14B. There is a 40 bp insertion of unknown origin. [1] analyzes 3 segments with rat proviral boundaries. FEATURES from to/span description prov < 1 16 SV40 DNA BASE COUNT 9 a 8 c 7 g 16 t ORIGIN About 818 bp after segment 2. 1 gctttttaaa tcctcacttc aggggtaaga gtttcattct // LOCUS XELXOTCH 9163 bp ss-mRNA VRT 30-JUN-1990 DEFINITION X.laevis Xotch protein mRNA, complete cds. ACCESSION M33874 KEYWORDS Xotch. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 9163) AUTHORS Coffman,C., Harris,W. and Kintner,C. TITLE Xotch, the Xenopus homolog of the Drosophila notch gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Coffman, 26-APR-1990. Author address: C.R.Coffman University of California Dept. of Biology-B022, La Jolla, CA 92093 FEATURES from to/span description pept 346 7917 Xotch protein mRNA < 1 9163 Xotch mRNA signal 9146 9151 poly-A signal rpt 409 4632 EGF-like repeats rpt 4684 5025 lin-12/Notch repeats rpt 5953 6591 cdc-10/SWI-6 repeats BASE COUNT 2499 a 2192 c 2188 g 2284 t ORIGIN 1 gaattccgga cgcctggtgg aactaaacag actggaatgc actggaacac ggaagcctat 61 gcggggatac attgtagcgc tgagcccttc cgctgtaaca ttgtagcgct gagccgctca 121 tacactgtag cgctgtaact tcaggctgtg cgcaaactga gtgcgctatt ctctctccaa 181 tctggaaact ctgtcgcttt tggaccaggg attgcactga cttggattat cttcaccttt 241 ctctaaggga tttatagatt tttgccttgg gacctttatc ctccccttgt cctgggacag 301 ggaccttcat caggatctat gttcctatag tgagactgca ggaaaatgga tcggataggg 361 ctggctgtgc tgctctgctc cctgcctgtc ctcacacaag gtcttcgatg cactcagact 421 gcagaaatgt gcctgaacgg tgggaggtgc gaaatgaccc ctggtggaac tggggtgtgc 481 ctatgcggta atttgtattt cggtgaaagg tgccagttcc ccaatccctg caccataaag 541 aatcagtgta tgaactttgg aacctgcgag cctgtgctac aggggaacgc catagacttc 601 atctgccact gccccgtggg tttcaccgac aaagtgtgcc tgactcccgt ggataacgcc 661 tgcgtcaaca acccgtgccg gaatggagga acctgtgaac ttttgaatag cgtgactgaa 721 tacaaatgca ggtgtccccc aggatggaca ggggattcgt gccagcaggc agatccatgt 781 gcctctaatc catgtgccaa tggtgggaag tgtttgccct ttgagatcca gtatatctgc 841 aagtgtcccc ctggattcca tggagctacc tgcaagcagg acattaatga gtgcagccaa 901 aatccctgca agaatggcgg ccagtgcatc aatgagtttg gatcataccg ctgcacctgt 961 cagaacaggt ttactggaag aaactgtgat gagccatatg tcccctgcaa cccatcgccg 1021 tgcttgaatg gagggacctg ccgccaaact gatgacacct cctatgattg cacttgtctt 1081 ccaggcttta gtggacaaaa ttgtgaagag aacattgacg actgcccgag caacaactgc 1141 cgcaatggtg ggacctgtgt ggatggagtc aacacctata actgccagtg tccacctgac 1201 tggacaggcc aatactgcac ggaggatgtg gacgaatgcc aactgatgcc aaacgcttgc 1261 caaaatgggg gaacctgcca caacacctac ggtggataca actgtgtttg tgtcaacggc 1321 tggaccggag aggactgcag cgagaacatt gacgattgtg ccaacgccgc ctgccatagt 1381 ggtgccacct gtcatgatcg cgtggcatct ttctactgcg agtgcccaca cggccggaca 1441 ggtctccttt gccacctgga taatgcctgt atcagcaatc cctgcaacga aggctccaac 1501 tgtgacacca acccagtcaa cgggaaagcc atttgcacct gcccacctgg gtacactggt 1561 cctgcttgca acaatgatgt ggatgagtgc tctctgggtg ccaatccatg tgagcatggt 1621 gggagatgta ccaacaccct gggttccttc cagtgcaact gcccccaggg atatgcagga 1681 ccccgctgcg agattgatgt caatgaatgt ctttccaacc cgtgccaaaa tgattccaca 1741 tgcctggacc agattggaga gtttcagtgt atatgtatgc caggttacga aggcctttac 1801 tgtgaaacca acatagatga atgtgccagc aacccctgcc tacataatgg caagtgcatt 1861 gataaaatca acgagttccg ctgcgactgt cccacaggct tcagtggaaa cctctgccag 1921 catgactttg atgagtgtac cagcactccg tgcaaaaatg gagccaaatg cttggatggt 1981 ccaaattctt acacctgcca atgtacagaa ggtttcaccg ggcgccactg cgaacaagac 2041 atcaatgaat gcattcctga tccgtgccac tacggcacct gcaaagacgg cattgctacc 2101 ttcacatgcc tgtgccggcc tgggtacacg gggcgtctgt gtgataatga catcaatgag 2161 tgcctgagca agccgtgcct aaacgggggg cagtgtacag accgagagaa tggatacata 2221 tgcacttgtc cgaaaggaac cacaggggtg aattgcgaaa ccaaaataga tgactgtgcc 2281 agtaacctgt gcgataatgg aaagtgcatt gacaagattg atggctacga gtgcacctgc 2341 gagcctggtt acacaggcaa gttgtgcaat attaacatta acgaatgtga cagcaatcca 2401 tgccgcaatg gagggacttg caaagaccag atcaatggct tcacatgtgt gtgcccagat 2461 ggctaccatg accacatgtg cctgtctgaa gtaaatgaat gcaacagcaa tccatgtatc 2521 cacggggcgt gccatgatgg agtcaatggg tataaatgtg actgtgaagc aggatggagc 2581 ggtagcaatt gtgacatcaa taacaatgag tgtgaatcca acccgtgtat gaatggtggt 2641 acctgcaaag acatgaccgg agcatacatc tgtacttgca aagcaggatt tagtgggcca 2701 aactgccaaa ctaacatcaa cgaatgttcc tctaacccct gcttgaacca tggaacgtgt 2761 attgatgatg ttgctggtta caaatgcaac tgcatgttac cttatacagg tgccatttgt 2821 gaggcagttt tggctccgtg tgcaggcagt ccttgtaaaa atggtggcag atgtaaagaa 2881 tcagaagact ttgaaacctt ctcttgtgaa tgtccacccg ggtggcaagg gcaaacatgt 2941 gagattgaca tgaatgaatg cgtaaatcgc ccttgccgta acggtgcaac gtgtcaaaac 3001 accaatggca gctacaagtg caactgcaaa cctggttaca ctgggcggaa ctgcgagatg 3061 gacattgatg actgccaacc aaatccttgt cacaatggag gatcctgttc tgacggcatt 3121 aatatgtttt tctgtaactg cccggccggg ttccgggggc ctaagtgcga agaggacatt 3181 aacgagtgtg ccagtaaccc gtgcaaaaat ggagccaact gcactgattg tgtaaacagc 3241 tacacatgca cctgccagcc tggatttagt ggcatccact gtgaaagcaa cacccctgac 3301 tgcaccgaga gctcgtgctt taatggtggc acctgcattg atgggatcaa tacgtttaca 3361 tgccagtgcc cacctggatt taccggaagc tactgccaac atgatatcaa tgaatgtgac 3421 tccaagcctt gtttgaacgg aggcacctgc caggacagtt acgggacata caagtgcacc 3481 tgtccgcaag gatacactgg gctgaactgc cagaacctgg tgcgttggtg tgactcttct 3541 ccctgtaaaa atggaggaaa gtgctggcag acaaacaact tttatcgttg tgagtgcaag 3601 agtggctgga ctggtgttta ctgtgatgtc cccagtgtct cctgcgaggt tgctgctaag 3661 cagcaaggtg tggatatcgt tcatctgtgt cgcaattctg gcatgtgtgt ggacactgga 3721 aacacacatt tctgccgctg tcaggctggc tacacaggca gttactgcga ggaacaggtg 3781 gacgaatgct cgccgaatcc ttgtcagaac ggtgccacat gcacagacta cttaggtggc 3841 tactcttgcg agtgtgttgc cggctaccat ggagtaaact gctccgagga gatcaatgag 3901 tgtttgtccc acccgtgcca gaatggagga acctgtattg atctcattaa cacctacaaa 3961 tgctcctgcc ccagaggaac tcaaggtgtg cactgtgaaa ttaatgtgga tgattgcacg 4021 cccttctacg actccttcac cttggaaccc aaatgcttca acaatggcaa atgtatagac 4081 cgcgtgggtg gctataactg catctgccct ccaggctttg taggggaacg ctgtgaaggc 4141 gatgtgaacg agtgcctgtc taatccctgt gactcccgtg gcacccagaa ctgcattcaa 4201 ctggtgaatg attaccggtg tgagtgccgg cagggattca caggaaggcg ctgtgaatct 4261 gttgtggatg gttgtaaggg aatgccctgc cgaaacggtg gaacatgtgc tgtcgccagc 4321 aatacagaac gtggatttat ttgcaaatgc cctcctgggt ttgacggagc cacttgcgag 4381 tatgattcac gaacctgcag caaccttcgc tgccagaatg gcggcacttg catttcggtg 4441 ctgacgagtt ccaaatgcgt gtgctcggaa ggctatactg gcgccacatg tcagtaccct 4501 gttatcagtc cttgtgcttc ccacccttgt tacaatggag gaacctgcca gttctttgct 4561 gaggaacctt tcttccagtg cttctgtcca aaaaacttta acgggctctt ctgccatatc 4621 ttggattatg agtttcccgg gggactaggc aagaatatta ctcctcctga taatgacgat 4681 atctgcgaaa acgagcagtg ctctgagctg gctgacaaca aagtctgcaa cgccaactgc 4741 aacaaccatg cttgcggatg ggatggagga gactgctctc tcaacttcaa tgacccatgg 4801 aagaactgca cccagtcgct gcagtgctgg aaatatttca atgatggcaa atgcgattct 4861 cagtgcaaca acactgggtg tctgtacgac gggtttgact gccagaaagt tgaggttcag 4921 tgcaaccctt tgtatgacca gtactgcaag gatcactttc aagatggcca ctgcgaccaa 4981 ggctgtaaca atgcagagtg cgaatgggat ggcctggact gcgccaacat gccagagaac 5041 ctggctgaag gcaccctggt gttggtcgtc ctcatgcccc ctgaaaggct taagaacaac 5101 tctgtcaatt tcctccgtga gctgagccga gtgctccaca caaatgtggt gttcaagaaa 5161 gacagcaaag gcgaatacaa gatctaccct tactatggca atgaggaaga acttaaaaaa 5221 catcacatca agaggtcaac tgattactgg tcggatgctc cttctgccat cttcagcaca 5281 atgaaagaat ccattttact aggaaggcat cgtagggagt tggatgaaat ggaagttagg 5341 gggtcaatcg tatacttgga aatagacaac cgccagtgtt acaaatcctc ctctcagtgc 5401 ttcaacagtg ctacagatgt tgccgcattc cttggggctt tggcctccct tgggagtctg 5461 gacacccttt cctataagat tgaggcagtg aaaagtgaga acatggagac ccccaagccc 5521 tctactctct atcccatgtt atccatgttg gtcattccgt tattaatcat cttcgtcttc 5581 atgatggtca tcgttaataa gaagcgtcgc cgtgaacacg acagctttgg ttccccgacg 5641 gctttattcc aaaagaaccc agcaaaaaga aacggcgaga cgccttggga ggattctgtt 5701 ggcttaaagc ccattaagaa tatgacagat ggctctttta tggacgataa tcagaatgaa 5761 tggggagatg aggagaccct ggaaaacaag aggttcaggt ttgaagagca agtgattctc 5821 ccagaacttg ttgacgacaa aactgacccc cgacagtgga cacggcagca ccttgacgcc 5881 gctgatctgc gcatttcatc catggcccca acgccgccgc agggagagat tgaggccgac 5941 tgtatggatg tcaatgtccg tggccctgat ggcttcaccc cacttatgat tgcctcctgt 6001 agcggaggtg gattagagac tggaaacagc gaagaggaag aggatgcttc agctaatatg 6061 atttccgact tcattgggca gggtgcccaa ctgcataacc aaaccgaccg cactggcgag 6121 acagcacttc atctggccgc aagatacgca cgtgctgatg cagccaagcg cctgttggaa 6181 tcgagtgcgg acgccaacgt ccaggataac atgggcagga ctcctctcca tgcagcagtg 6241 gcggctgatg ctcagggcgt attccagatt ctcattcgga accgagcgac agacttagac 6301 gcccgcatgt ttgatggcac tacccctctg atcctggccg ctcggctggc cgtggaaggg 6361 atggtggagg agcttatcaa tgctcatgca gatgtcaacg ctgttgatga atttggaaaa 6421 tctgctttgc attgggcagc ggctgtgaat aacgttgatg ctgcagctgt gcttctcaag 6481 aatagtgcaa ataaggacat gcaaaacaac aaggaagaga catccctgtt cttggccgca 6541 agagaaggca gctacgaaac tgccaaagtc cttttggatc actacgccaa ccgtgacatc 6601 acagaccaca tggatcggct gcctcgtgac atcgcccaag aacgcatgca ccacgacatt 6661 gttcacctgc tggatgaata taaccttgtg aagagcccaa cgctgcacaa tggtccgttg 6721 ggagcaacga cattatcacc tcccatctgc tcccctaatg gttacatggg gaacatgaag 6781 ccttctgttc agagcaagaa agcccgcaag cccagtatca aaggtaatgg ctgcaaagag 6841 gccaaagagc tgaaagccag aaggaaaaaa tctcaagatg ggaaaacaac tctcttggat 6901 tctggcagtt ctggagtgtt gtccccagtg gactccctgg agtcaacaca tggatactta 6961 tcagatgtgt cttctcctcc gctgatgacc tctccgtttc agcagtctcc atccatgcct 7021 ctgaaccact tgacaagcat gccagagtcc cagcttggca tgaatcacat aaacatggcc 7081 accaagcagg aaatggcagc aggttccaac agaatggctt ttgatgccat ggtgccacgt 7141 ctgacccatc tcaatgcctc aagccctaat accatcatga gcaatggatc catgcatttc 7201 actgtgggag gagctccgac tatgaacagc caatgtgact ggttagctag gctgcagaat 7261 gggatggtcc agaatcagta tgacccaatc agaaatggca tccaacaagg caatgctcaa 7321 caagctcaag ctcttcagca tggccttatg acctcgctcc ataatggtct gccagcaaca 7381 actctctccc aaatgatgac ctatcaggcc atgcccaaca caaggctagc caatcagcca 7441 catctaatgc aagcccagca aatgcaacag cagcaaaact tgcagttgca ccagagcatg 7501 cagcaacaac atcacaattc cagcacgacc tctactcaca tcaactcacc attctgcagc 7561 agtgacataa gccagacgga cctgcagcaa atgtcaagca acaacattca ttcagtaatg 7621 ccccaggaca ctcagatatt tgctgcatct ctgccttcca atcttacgca gtctatgaca 7681 actgcacaat ttttaacccc cccttcccag catagctact cctccccaat ggacaataca 7741 ccaagccatc aactacaagt accagaccac ccgttcctga cgccttctcc tgagtcacct 7801 gaccagtggt caagctcctc ccctcattcc aatatgtctg actggtcaga aggaatatca 7861 agtcctccca cgagtatgca gcctcagcgc acccacatac ctgaagcttt caagtaaaaa 7921 aaaaaaaagt ttaaaaaaat gtaaaatatt tttaaagaca ctgagagaga ctttaagaga 7981 ctgaaggaaa tttttatatg gtttttatac ttaaaataac agaacatttg aattttctag 8041 tatttattta tatatacgtt tgacctaaaa cactgccctt ttatttataa gctttttttc 8101 atttttagga caaaatattt taacttcttg ccttgaaagt ttttcagttc taaatcttat 8161 gaaattggtt cctgcctggt attgaaaacg gcaatgtatt tattttttat ttacctgaat 8221 agtatacagg aacaaaccac tggggtgggg gggttatcgg gatgtgtatt tagcagaaaa 8281 aagattttct ataaaatgaa atctttcagg ttttcattta tagcactaaa aagattccag 8341 tattaatttt aaattaaatc atgaagaaga tgctccgaat acctctcgct gtcagggagg 8401 tgcccaatag gttttttttt tttgtttttt ttttggaaaa cctgaaactc ttggtaagta 8461 cagaaagaaa gcataaaata caggagggca cgcctaggtc tgatgcattc tcaatgggaa 8521 taaaatcctg aaaagtgact cattttaaaa aattatattt gttagaaaaa acaaactgcc 8581 attttgaatc ccttgtgtcg catggggtat tggtcagaaa atgcatgcaa tttttggtac 8641 aagcccgtgg catattttct acagtttttt tttttttttt tttttttttg taaatatcta 8701 tggtggattt tgtttcaaaa atctatttgt atatcctgat ttatattgct tatccgagta 8761 ggagtgcagg ctggatggaa ccatgccgaa gacaaatcgc agattggaca ttgtttttta 8821 tttttttgcc tacagagact atttgcacat gaacaacaac aaaaaaaaga agatatattt 8881 aaatataaaa ttttatacaa acgaaaaaac aaaaaacaaa aactgtcgat ttgcctctag 8941 ttccacttct tctgtgatac gatgcagttg ggcagttcat tccttatgcc gtactgtaca 9001 aagatctgta gaatttagtt gtttttcctg aacaaataaa tatgtaatgt atcgagcaga 9061 aatgtagttt acaaaaaaaa ctatttgttg ttgttgctgt taacacttct gtaaacaaat 9121 tttttttggt ttgtttattt gttgtaataa aattgtacaa aat // LOCUS CCRFTRCX 2202 bp ds-DNA BCT 30-JUN-1990 DEFINITION C.crescentus transcriptional protein (ftrC) gene, complete cds. ACCESSION M30946 KEYWORDS transcriptional protein. SOURCE C.crescentus (strain CB15) DNA. ORGANISM Caulobacter crescentus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Prosthecate bacteria. REFERENCE 1 (bases 1 to 2202) AUTHORS Van Way,S.M. and Mullin,D.A. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Mullin, 20-DEC-1989. Author address: D.A.Mullin Tulane University Dept. of Biology 2000 Perciual Stern Hall New Orleans, LA 70118 FEATURES from to/span description pept 340 1707 transcriptional protein (ftrC) BASE COUNT 383 a 741 c 750 g 328 t ORIGIN 1 gaattcggag gggccatgct ggcctccgag gctccgatcg aactcagcga caagaccgct 61 gcggacctgg cgccggtctt cgacgttccg gtcaacatct cggccgtgct gggccgcgcc 121 aacatgtccg tggcgcagct gctgcagctg ggccagggct cgatcctgga gctggaccgc 181 aaggtcggcg aggcgatcga catctacgtc aataaccgcc tggtcgcccg gggcgaggtc 241 gtcgtcgtcg acgagcgcct gggcgtgacc atgacggaaa tcatcaagga cggcgaccag 301 ggctgacgcc cggtggtcgt tcgagaggga gagtaagaga tgcggcttct ggtcgttgga 361 aaactgaacg ggcagctctc ggtcgccgtg aagatggcga tgaacgcggg cgcgaaggtc 421 tcgcacgtcg aaacgacgga gcaggcgacc aacgcgctgc gggcggggca gggcgctgac 481 cttctgatgg tcgactatgt gctcgacatc gccggtctga tcgccgccaa cgaagccgag 541 cggatgcggg tgccggtggt ggcctgcggc gtcgacgccg atccgatgcg cgcggccaac 601 gccatcaagg ccggggccaa ggagttcatc ccgctgccgc cggacgccga gctgatcgcc 661 gccgtcctgg ccgccgtcac cgacgacgaa aagccgatgg tcgtccgcga cccggccatg 721 gagcaggtca tcaagctggc cgaccaggtc gccccctccg aagcctcgat cctgatcacc 781 ggggagagcg gctcgggtaa ggaggtcatg gcccgctacg tccacggcaa gtcgcgccgg 841 gccaaggcgc cgttcatcag cgtcaactgc gccgccatcc ccgagaacct gctggaaagc 901 gagctgttcg gccacgagaa gggcgccttc accggggcca tggcccgccg catcggcaag 961 ttcgaggagg ccgacggcgg caccctgctg ctggacgaaa tcagcgaaat ggacgtgcgc 1021 ctgcaagcca agctgctgcg cgccatccag gagcgcgaga tcgaccgcgt gggcggctcc 1081 aagccggtca aggtcaatat ccgcatcctg gccaccagca accgcgacct ggcccaggcg 1141 gtgaaggacg ggacgttccg ggaagacctg ctctaccgtc tgaacgtcgt gaacctgcgc 1201 ctgccgccgc tgcgcgagcg tccggccgac gtgatcagcc tgtgcgagtt cttcgtgaag 1261 aagtactcgg ccgccaacgg catcgaggaa aagccgatct cggccgaggc caagcgccgc 1321 ctgatcgctc accgctggcc gggcaacgtc cgcgagctgg aaaacgccat gcaccgggcg 1381 gtgctgctgt cggcgggccc ggagatcgag gagttcgcca tccgtctgcc cgacggccag 1441 ccgatggccc cggcgccgga cgtcgcggtt gcccgcggcg cccagatggc cgccgacgcc 1501 gcctcgcgcg ccttcgtcgg ctcgaccgtc gccgaggtcg agcagcagct gatcatcgac 1561 accctggagc actgcctggg caaccgcacc catgcggcca acatcctggg catctcgatc 1621 cgcaccctgc gcaacaagct gaaggaatat tccgacgccg gcgtgcaggt gccgccgccc 1681 cagggcgggg tcggcgcggc cgcttaagac cagggcgtcg aggtggcggg accgaaaacc 1741 tcgcccttcg acaagctcag ggtgaggttt tctatgcctg gcccggcgct tcattcgtcc 1801 tcaccctgag cctgtcgaag ggcgaggacg acgcaaagct ccgggccgat ctggccgccg 1861 ccgcgaccat cggtccggcg ccgcgccgcg aaaaatttgg ttaacaagat ggcggcgggc 1921 cgcaaatcgc tccaagcgaa agcctagctt cccatccccg gttcaaggtt ccgaatggcc 1981 gacgccgccg ccccgaacgc cagctcgatg cccagcgcca agtcgctgct cgacgggttg 2041 atgcgcggcg agatgggcct ggccctgggc gtggtcggca tcatcgtgct gctgatcatc 2101 ccggttccgg cgccgctgct ggacgtgctg ctggccattt cgctgacggg atcggtgctg 2161 atcctgatga cggcgattct gatcaagaag ccgctggaat tc // LOCUS TRCTC26 638 bp ss-mRNA INV 30-JUN-1990 DEFINITION T.cruzi AMP-inducible protein (TC26) mRNA, partial cds. ACCESSION M34062 KEYWORDS AMP-inducible protein. SOURCE T.cruzi (strain X10/4) trypomastigote, cDNA to mRNA, clone G26. ORGANISM Trypanosoma cruzi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 638) AUTHORS Heath,S., Hieny,S. and Sher,A. TITLE A cyclic AMP inducible gene expressed during the development of infective stages of Trypanosoma cruzi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.Heath, 08-MAY-1990. FEATURES from to/span description pept < 1 > 638 AMP-inducible protein (TC26) (AA at 1) BASE COUNT 154 a 174 c 168 g 142 t ORIGIN 1 accccaccag accaactggt aatggtagcg accggcgctc agcttggaat tccgtgcgct 61 gcttctagcc ctagagaagc tgatgatccc ccgtatccga cacaggcgta caaccctgct 121 tgtggttacg gacagtcagt ctcttctagc ggctctaaac aagggcccgc tcaatcagac 181 agactggacg gaggatcaga tctggcggcg tctcttgaca ctgacgtgtg ctggctggtc 241 gtgcacctgc agttttgtta cggacattgt ggagtacatg ctaacgagct tgcagatcag 301 tatgcgagga actatggaaa gtggacaata cacggagcaa ggaatcgcac ctttatggca 361 tacggatctg ctgacgtgtt ttactaccca gctcaccaac aagtggcgta ctaccattcg 421 tcaagacact catcgctacc tgctttgcgg cacaaggcca tcagatctcg cggtaaggac 481 ctgatcactc aggaagttct acaccgtcag aactggttca cctcgcaagg gcaaggtgcg 541 gggaatctga gctctggggc cgactatctg ggccgtgaga gattgcacga accaatgccg 601 tttctgcaac atctcaccgg aacagtctgc atatatat // LOCUS HUMSATM29A 700 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human minisatellite MS29 locus 1 polymorphism. ACCESSION M28875 KEYWORDS minisatellite. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 700) AUTHORS Wong,Z., Royle,N.J. and Jeffreys,A.J. TITLE A novel human DNA polymorphism resulting from transfer of DNA from chromosome 6 to chromosome 16 JOURNAL Genomics 7, 222-234 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.J.Jeffreys, 04-OCT-1989. BASE COUNT 156 a 188 c 212 g 144 t ORIGIN Chromosome 6p25-pter. 1 ggacagccag agtgcccaca agctcggcag gaaggaggct cacatcgccg cggaagtctc 61 gtgatggttc tggtagaatc gtcctcttga aaccttctgc tgccattatt gggtgctgtc 121 caatatgcct tacgcaaaac actgacaata gcatacacct ttccagaatg ccggcacgcg 181 ctgtgcaatg cccaaactgt ccctgtgagt tgaatgcccc tggcaggatg cgtgtccact 241 cccgcagggc cgtgggtgcc tcccttgtct cactcctctg cttcaaccct tcccagaact 301 ctgaaggttt gagatgaaat gactgagcca cacctagtgc aggaatgagc accagggcct 361 ctctgcaacg gctgcttaag ggccacagga taagcatagt gtggcgcccc cagttattcc 421 ctgtttcact cattggacca ccctgatagg cttcagggag gcaggaacct cctctaaggc 481 agcaggagag gggtggatgg ggcagccaac aggtgcatct catgcctccg tgttctccat 541 agggagggag gtgggctaag aggcgagtgg ggcagaggcc tgggccagcg tgaagtgtca 601 ggagtaaggg gggagaagac tggagggggc aaatccagga ggcagctgag gctcgactgt 661 gcaaatacaa gcccggttac gtctgtgtag cttagcttcc // LOCUS HUMSATM29B 700 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human minisatellite MS29 locus 2 polymorphism. ACCESSION M28876 KEYWORDS minisatellite. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 700) AUTHORS Wong,Z., Royle,N.J. and Jeffreys,A.J. TITLE A novel human DNA polymorphism resulting from transfer of DNA from chromosome 6 to chromosome 16 JOURNAL Genomics 7, 222-234 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.J.Jeffreys, 04-OCT-1989. BASE COUNT 182 a 189 c 146 g 183 t ORIGIN Chromosome 16p11. 1 ggacagccag agtgcccaca agctcggcag gaaggaggct cacatcgccg cggaagtctc 61 gtgatggttc tggtagaatc gtcctcttga aaccttctgc tgccattatt gggtgctgtc 121 caatatgcct tacgcaaaac actgacaata gcatacacct ttccagaatg ccggcacgcg 181 ctgtgcaatg cccaaactgt ccctgtgagt tgaatgcccc tggcaggatg cgtgtccact 241 cccgcagggc cgtgggtgcc tcccttgtct cactcctctg cttcaaccct tcccagaact 301 ctgaaggttt gagatgaaat gactgagcca gtgagtgaaa agaccagcca agaatgtgaa 361 aaacctcatc tggacacaag gcaaaataat tattttctcc cttggttacc ttttccattc 421 cctttcttca agggaaagaa ccatgtctta ttttacttgg catcagcagc tctcagaaca 481 gtggcttgac caataccaag cattcaataa atgtttattg aaggaatgaa tcaatggcca 541 cccacaacct atacctctta cagcccattt aatgtccttt agccccttcc ataggtattt 601 gcctggaatg tttgtaagat ttttgaaaat ctcagaaaca acaccagccc tccacccatg 661 ggaagatgtg tctcaggcta acacctgtgc tcttgttgat // LOCUS HUMSATOD 1900 bp ds-DNA PRI 30-JUN-1990 DEFINITION Human O family dispersed repeat element. ACCESSION M28877 KEYWORDS repetitive element. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1900) AUTHORS Wong,Z., Royle,N.J. and Jeffreys,A.J. TITLE A novel human DNA polymorphism resulting from transfer of DNA from chromosome 6 to chromosome 16 JOURNAL Genomics 7, 222-234 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.J.Jeffreys, 04-OCT-1989. FEATURES from to/span description rpt 410 785 O family dispersed repeat BASE COUNT 484 a 501 c 344 g 567 t 4 others ORIGIN 1 gccacctctg tctaatgttc tcatatagct ccaactctaa cacctcctta cattctacca 61 tctgtgtctt agactgcagc aagccctgta actggctctc ggtctccagg cttctcccga 121 cctagacttg cttacattgc tgtacaagtg aggtttttaa cattctcagt cttgtcactc 181 actgcttaaa ctccttcaag aattttcact cccttcaggc caagggctat atgaagccct 241 tcagatgtgg cccttgcttg tgtctccagc cccatctctc agcagtgctc ttaacactcc 301 caaccatcat gagctaatta ctatccttcc aacatgccat gctccttcac acccctcttg 361 ctttgaacac gctgactctt ctgcctgaaa tgtctttccc agttcccttc tgatatggtt 421 tggctgtgtc cccacccaaa tctcatcttg aattgtagtt cccataatcc ccacatgttg 481 tgggagggac ctggtaggag gtaattgagt catgggggtg attaccctca tgctgccatt 541 ctcatggtag tgaacgtgag ttctcacaag atcttatggt tttataaggg gcttctctcc 601 ctttgctcag cacttcacct tgctgccacc atgtgaagga cacgttttct tccccttcca 661 ccatgattgt aagtttcctg aggcctcctc agccatgctg aactgtgagt caattaactt 721 ctttccttta taaattaccc agtctcgggt atgtatttat tagcagtgtg agaatggact 781 aatacacccc cttcactcac ttttcagtct tcaaagtaca cagccatcct ttactgcccc 841 atgcagcacc ttcaggtctt gagctctatg cagggaggta agccacggaa gtcacaagca 901 catcttttat actagtgcac acaactgaaa actcacagac tggtaaaagc acatctttta 961 tactagtgca cacggctgaa aactcacaga ctggtaccaa accagcttgg ttctgttctg 1021 cttccactgc ctgtcaagac tgtgtgctct gggcaaggta cttcaaccac tgggtcttgg 1081 cttcctcatc tggaaaatga aaataaaact gcaactttaa aaagcttttc tgaaatcaaa 1141 tgcagcaatg tacataaaag gcaaataagc tcaccctatg ttttacctaa ttttcttttt 1201 atcctgctct gtgtatttct taagccaacc atcttacttc ctttttagaa tgcatcagga 1261 tagacacaaa taaatggaaa cactgttaca aagcccctgc tcacacttca gaattcagat 1321 tgcattacct cctccaggaa gtcttcctta atcactctct gctcgagtct catttagatg 1381 atctcctgtg tgcttgcata tcgtctgtgc atacctctgt gttgggagga gtggtaagag 1441 atggggctgg agactcaaga aaaggccaca tcggaagggc ctcgatcctt tcccctgaag 1501 ggagtaaagg ctctccaaag agtttaagca gtgagtgaaa agaccagcca agaatgtgaa 1561 aaacctcatc tggacacaag gcaaaataat tattttctcc cttggttacc ttttccattc 1621 cctttcttca agggaaagaa ccatgtctta ttttacttgg catcagcagc tctcagaaca 1681 gtggcttgnn nnataccaag cattcaataa atgtttattg aaggaatgaa tcaatggcca 1741 cccacaacct atacctctta cagcccattt aatgtccttt agccccttcc ataggtattt 1801 gcctggaatg tttgtaagat ttttgaaaat ctcagaaaca acaccagccc tccacccatg 1861 ggaagatgtg tctcaggcta acacctgtgc tcttgttgat // LOCUS BSURFOLA 4880 bp ds-DNA BCT 30-JUN-1990 DEFINITION B.subtilis folic acid biosynthetic operon encoding para-aminobenzoic acid synthase, component I (pab), glutamine amidotransferase (trpG), pot. p-aminobenzoate synthesis enzyme (pabC), and pot. dihydropteroate synthase (sul) genes, complete cds. ACCESSION M34053 KEYWORDS H2-pteridine-CH2OH pyrophosphokinase; dihydropteroate synthase; glutamine amidotransferase; para-aminobenzoic acid synthase. SOURCE B.subtilis (strain VB157) vegetative cell DNA, clones pJS[1,2]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4880) AUTHORS Slock,J., Stahly,D.P., Han,C.-y. and Crawford,I.P. TITLE An apparent folic acid biosynthetic operon of Bacillus subtilis containing pab, an amphibolic trpG, a third gene required for synthesis of para-aminobenzoic acid, and the dihydropteroate synthase gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.P.Stahly, 07-MAY-1990. Author address: D.P.Stahly University of Iowa Dept. of Microbiology Iowa City, IA 52242 e-mail: cmdsix@vaxa.weeg.uiowa.edu FEATURES from to/span description pept 313 1725 para-aminobenzoic acid synthase, component I (pab) pept 1739 2323 glutamine amidotransferase (trpG) pept 2323 3204 pot. p-aminobenzoate synthesis enzyme (pabC) pept 3186 4043 pot. dihydropteroate synthase (sul) (EC 2.5.1.15) pept 4036 4398 ORF2 binding 300 303 ribosome binding site binding 1724 1730 ribosome binding site binding 2306 2313 ribosome binding site binding 3175 3179 ribosome binding site binding 4023 4027 ribosome binding site binding 4381 4385 ribosome binding site binding 1720 1728 methyltryptophan resistance protein binding site BASE COUNT 1512 a 939 c 1205 g 1224 t ORIGIN 1 gaattctcgg aggcatctca tcaggtgcgg caatttatgc tgctcttcaa gtagcaaaaa 61 aactcggaaa aggcaaaaaa gtattggcca tcattccaag taacggcgaa cgttatctaa 121 gcacgccgct gtaccaattc gattaaaaaa agccaaaact cccggttcgc cgggagtttt 181 tttatatttc gtgcatcaaa tatgccatgt ctgcgacgaa tgttattctt ggaagagaaa 241 ttcacttttt tcactaacaa cattgcttta caattaaaaa caagtaaaaa agaaaataga 301 ggaatgatac aaatggcaca acgcagaccg gcaggcaaaa aaataccttt tcaaaaagac 361 tcattcttac aacaatttga gaaacttgcg caatcccgga aacatcatgt acttctcgaa 421 agtgcaagag gcggcagata tagtatagcc ggtcttgatc caattgcgac tgtgaaagga 481 aaagacggaa taactacaat taagcatggt gatgagatgc tgtttaaaga aggtgatcca 541 ttacgggcct tccacagctg gtttaaaaca ctggaaacag aaacgaatca tgagttccct 601 gactttcaag gcggggcaat cgggtttctc agctatgatt acgcacggta cattgaaaat 661 tttaaaatgc tctcattaga tgatttagaa acaccagata tttattttct tgtttttgat 721 gatatagcag tttatgacca tcaagaagag tctctatggc tgattactca tgttaatggt 781 tctgatcagg aaacagcgga tgtgaagcta tctgagttag agcagatgtg gttgactgag 841 cttcccgctg tcacttcgcg agagatgaag cctgaaacag ctggttcttt cgcggcgcca 901 tttaccgagg atgggttctc acaagctgta gagaaaatca aacaatacat tgccagcgga 961 gatgtgtttc aagtcaatct atcaataagg cagtcacagt cactgtctgt ccacccatat 1021 caaatttaca aaaccttgag agaagtaaat ccttctcctt atatggcgta tttagaaaca 1081 cctgatttcc aaatcatttg cggatcgcct gaactgcttg tcagcaaaaa gggcaagcta 1141 ttagagacga gaccgattgc gggcacccgt tccagaggga aaacaaatga agaagacgag 1201 gcgcttgcaa acgaattgat acacaatgaa aaagaacgcg cggaacatgt catgctggtt 1261 gatcttgagc gaaatgatct gggaagagta tcacgttacg ggtctgtgcg cgtaaatgaa 1321 ttcatggcaa ttgaaaaata ctcgcatgtg atgcacattg tgtctaatgt ccaaggtgaa 1381 ctgcaggatg ggtatgatgc tgtagatatt attcatgctg tgtttcccgg aggaaccatt 1441 actggtgcac cgaaagtaag aacgatggaa attatagaag aacttgagcc gacacgccga 1501 gggctttata ctggatctat aggatggttt ggatataatc acgatctgca gtttaatatc 1561 gtcattcgaa ccatttatgc aaccggaggg caggcattta tgcagtccgg tgcaggagtt 1621 gtgattgatt ctgttccgaa gcacgaatac aaggaatcat tcaaaaaagc ttttgcgatg 1681 caaagagcat tagagctgag cgaagaagag acaaaaatta gatgaggtga gcggagaaat 1741 gattttaatg attgataact acgattcatt cacgtacaac ttggtacagt atttgggcga 1801 gcttggggaa gagctggttg tgaaacgcaa tgacagcatc acaatcgatg aaattgaaga 1861 actgtctccg gactttctga tgatatctcc cggaccgtgc agccctgatg aggcgggaat 1921 cagcctcgaa gcaattaaac atttcgcagg gaaaattcct attttcggtg tatgtctcgg 1981 acatcagtcc atcgcacaag tgttcggtgg tgatgttgtt agggcagaac ggcttatgca 2041 cgggaaaacc tcggatatcg agcatgacgg caaaaccatt tttgaagggt tgaaaaatcc 2101 ccttgttgcg acgcgatacc actcgctgat cgtaaaacct gagacgctgc caagctgttt 2161 tacagtaaca gcacaaacga aagaaggaga aatcatggct attcgccaca atgacctccc 2221 gatagagggt gtgcaatttc acccagagtc tattatgacc tcctttggga aagaaatgct 2281 cagaaatttt attgagacat atcgcaagga agttattgcg tgatgatata tgtgaacggc 2341 cggtatatgg aggagaaaga tgcagttctt tctccttttg accatgggtt tttatatggg 2401 atcggtgttt tcgaaacctt caggctctac gaaggctgcc cgtttttgct tgattggcat 2461 atagagaggc ttgaacgcgc gcttaaggat ctgcaaatcg aatatactgt ttcaaagcat 2521 gagatccttg agatgctgga caagctgctt aagctaaatg atatcaagga tggcaatgcc 2581 cgcgtcagac tgaacatatc agcgggaatc agcgataaag gctttgtcgc ccagacgtac 2641 gataagccca ccgttttatg ctttgtgaat cagctgaaac cagaaagtct tcctttgcaa 2701 aaagaaggaa aggttttatc gatacggaga aacactccag agggatcgtt ccggttaaaa 2761 tctcatcatt atctaaataa tatgtatgcg aaacgggaaa ttggaaatga tccgcgtgtc 2821 gagggaattt tcttaacgga ggatggagct gttgcagagg gtatcatatc taatgtcttc 2881 tggagaaaag gtcgctgtat ctatacgcca tcgcttgata ccggtattct ggacggcgtc 2941 acgagacgtt ttatcataga aaatgcaaag gatatcggcc tggagctcaa aacgggccga 3001 tatgagctag aagctctctt gacagctgac gaagcgtgga tgacgaactc ggttcttgaa 3061 atcatcccgt ttactaaaat agaagaagta aactatggaa gccaaagcgg agaagcaacc 3121 tccgctcttc aattgttata caaaaaagaa ataaagaaca tgattcatga aaagggagga 3181 agagcatggc gcagcacaca atagatcaaa cacaagtaat ccacactaag cccagcgctt 3241 tatcatataa agagaagacg ctggtgatgg gaattttaaa cgtaacgcct gactctttct 3301 cggacggcgg aaaatatgac agcttggaca aggcgctgct gcacgcgaaa gagatgatcg 3361 atgatggtgc ccatatcatt gatattggag gggaatcgac aaggcctggc gctgagtgcg 3421 tatctgagga tgaggagatg tccagagtca ttccggtgat tgagcggatt acgaaagagc 3481 ttggtgttcc tatttctgta gacacgtaca aggcttctgt cgcagatgaa gcagtgaaag 3541 ccggtgcatc cattatcaat gatatttggg gagccaaaca tgatccgaag atggcttccg 3601 ttgcagctga acataatgtt ccaattgtac tcatgcataa ccgccctgaa agaaactaca 3661 atgacttatt gccggatatg ctgtcggact taatggagag tgtaaaaatt gctgttgagg 3721 ccggagtaga cgagaagaac attattcttg atcctggtat cggtttcccg aaaacctatc 3781 acgataactt ggcagtgatg aacaaactag agattttcag cggattggga tatccggttc 3841 ttctggcaac ctcccgaaaa agattcatcg gacgtgttct ggatcttccg cctgaggagc 3901 gggctgaggg cacaggcgcg actgtgtgtc tcggcattca aaaaggctgt gacattgtca 3961 gggtccatga tgtaaagcaa attgccagaa tggcgaaaat gatggacgcg atgctgaata 4021 agggaggggt gcaccatgga taaagtttat gtagaaggta tggagtttta cggatatcac 4081 ggtgtgttca cagaagaaaa caaacttggc cagcggttta aagtcgattt aaccgctgag 4141 ctggatttaa gcaaagctgg acagacagac gaccttgagc aaacgatcaa ctatgctgag 4201 ctctatcacg tatgtaaaga tatcgtggaa ggggagcctg tgaaattggt ggaaacgctg 4261 gcggaacgta ttgctggcac tgttctcgga aaatttcagc ctgttcagca atgtacggtg 4321 aaagtgatta agccagaccc gccaattccc ggacactata aatcagtagc aattgaaatt 4381 acgagaaaaa agtcatgaac aacatagctt atattgcact tggatctaat attggagata 4441 gagaaacgta tttaaggcaa gcagtggctt tactgcatca gcatgctgcg gtgacagtca 4501 ctaaagtgtc gtctatttac gaaactgacc cggtcggata cgaagatcaa gctcaatttt 4561 tgaatatggc tgttgaaatc aagacatcat tgaacccttt tgaactcctt gaactgacgc 4621 agcagataga aaatgaatta ggcagaacaa gggaagtaag atggggaccg cggacggcag 4681 accttgacat tttgttattt aatcgtgaaa atattgaaac agagcaacta attgttccgc 4741 atccgagaat gtatgagcgt ttgtttgtcc ttgcgccgct tgcggaaatt tgccagcagg 4801 ttgaaaaaga ggctacaagc gccgaaacag accaagaagg tgtaagagta tggaagcaga 4861 aatctggggt agacgaattc //
GenBank-Updates@genbank.bio.net (07/03/90)
LOCUS DROANNIX 1104 bp ss-mRNA INV 03-JUL-1990 DEFINITION D.melanogaster annexin IX mRNA, 3' end. ACCESSION M34068 J05501 KEYWORDS annexin IX. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-6. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1104) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.C.Suedhof, 04-MAY-1990. FEATURES from to/span description pept < 1 891 annexin IX (AA at 1) signal 1091 1096 poly-A signal BASE COUNT 281 a 300 c 306 g 217 t ORIGIN 1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg 61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc 121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg 181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga 241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt 301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag 361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat 421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc 481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc 541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag 601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag 661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg 721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt 781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag 841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca 901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa 961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg 1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt 1081 ccctctgtgc aataatcgga attc // LOCUS DROANNX 1192 bp ss-mRNA INV 03-JUL-1990 DEFINITION D.melanogaster annexin X mRNA, complete cds. ACCESSION M34069 J05501 KEYWORDS annexin X. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-16. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1192) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.C.Suedhof , 04-MAY-1990. FEATURES from to/span description pept 91 1056 annexin X signal 1175 1184 poly-A signal BASE COUNT 271 a 348 c 363 g 210 t ORIGIN Chromosome 93B or 19A-4,7. 1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac 61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag 121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga 181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg 241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag 301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag 361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc 421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag 481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc 541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc 601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg 661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc 721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg 781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc 841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc 901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag 961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac 1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc 1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct 1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc // LOCUS NEUMPPX 2038 bp ss-mRNA PLN 03-JUL-1990 DEFINITION N.crassa matrix processing peptidase (MPP) mRNA, complete cds. ACCESSION J05484 KEYWORDS matrix processing peptidase. SOURCE N.crassa, cDNA to mRNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2038) AUTHORS Schneider,H., Arretz,M., Wachter,E. and Neupert,W. TITLE Matrix processing peptidase of mitochondria: Structure-function relationships JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by W.Neupert, 17-APR-1990. FEATURES from to/span description pept 41 1774 matrix processing peptidase BASE COUNT 439 a 626 c 552 g 421 t ORIGIN 1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc 61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc 121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga 181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga 241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa 301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc 361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc 421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc 481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg 541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct 601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg 661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt 721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa 781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac 841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc 901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc 961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga 1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc 1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc 1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg 1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct 1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg 1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat 1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga 1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat 1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg 1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg 1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct 1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga 1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt 1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc 1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc 1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct 1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa // LOCUS HUMCYP2DG 5503 bp ds-DNA PRI 03-JUL-1990 DEFINITION Human debrisoquine 4-hydroxylase mutant allele (CYP2D6-MA1) gene, complete cds. ACCESSION M33189 KEYWORDS debrisoquine 4-hydroxylase. SOURCE Human individual MAGA DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5503) AUTHORS Gonzalez,F.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Gonzalez, 23-MAR-1990, for release after publication. Author address: F.Gonzalez National Cancer Institute Bldg. 37 Rm. 3E-24 National Institute of Health Bethesda, Md 20892 FEATURES from to/span description pept 814 993 debrisoquine 4-hydroxylase, exon 1 1696 1877 debrisoquine 4-hydroxylase, exon 2 2419 2571 debrisoquine 4-hydroxylase, exon 3 2661 2820 debrisoquine 4-hydroxylase, exon 4 3254 3430 debrisoquine 4-hydroxylase, exon 5 3621 3762 debrisoquine 4-hydroxylase, exon 6 3970 4157 debrisoquine 4-hydroxylase, exon 7 4612 4753 debrisoquine 4-hydroxylase, exon 8 4852 5030 debrisoquine 4-hydroxylase, exon 9 pre-msg 726 5103 debrisoquine 4-hydroxylase mRNA and introns IVS 994 1695 debrisoquine 4-hydroxylase intron A IVS 1878 2418 debrisoquine 4-hydroxylase intron B IVS 2572 2660 debrisoquine 4-hydroxylase intron C IVS 2821 3253 debrisoquine 4-hydroxylase intron D IVS 3431 3620 debrisoquine 4-hydroxylase intron E IVS 3763 3969 debrisoquine 4-hydroxylase intron F IVS 4158 4611 debrisoquine 4-hydroxylase intron G IVS 4754 4851 debrisoquine 4-hydroxylase intron H signal 689 702 TATA box BASE COUNT 1066 a 1537 c 1851 g 1049 t ORIGIN Chromosome 22. 1 ggctgggaag tggggtactt ggtgccgggt ctgtatgtgt gtgtgactgg tgtgtgtgag 61 agagaatgtg tgccctaagt gtcagtgtga gtctgtgtat gtgtgaatat tgtctttgtg 121 tgggtgattt tctgcgtgtg taatcgtgtc cctgcaagtg tgaacaagtg gacaagtgtc 181 tgggagtgga caagagatct gtgcaccatc aggtgtgtgc atagcgtctg tgcatgtcaa 241 gagtgcaagg tgaagtgaag ggaccaggcc catgatgcca ctcatcatca ggagctctaa 301 ggccccaggt aagtgccagt gacagataag ggtgctgaag gtcactctgg agtgggcagg 361 tgggggtagg gaaagggcaa ggccatgttc tggaggaggg gttgtgacta cattagggtg 421 tatgagccta gctgggaggt ggatggccgg gtccactgaa accctggtta tcccagaagg 481 ctttgcaggc ttcaggagct tggagtgggg agagggggtg acttctccga ccaggcccct 541 ccaccggcct accctgggta agggcctgga gcaggaagca ggggcaagaa cctctggagc 601 agcccatacc cgccctggcc tgactctgcc actggcagca cagtcaacac agcaggttca 661 ctcacagcag agggcaaagg ccatcatcag ctccctttat aagggaaggg tcacgcgctc 721 ggtgtgctga gagtgtcctg cctggtcctc tgtgcctggt ggggtggggg tgccaggtgt 781 gtccagagga gcccatttgg tagtgaggca ggtatggggc tagaagcact ggtgcccctg 841 gccgtgatag tggccatctt cctgctcctg gtggacctga tgcaccggcg ccaacgctgg 901 gctgcacgct actcaccagg ccccctgcca ctgcccgggc tgggcaacct gctgcatgtg 961 gacttccaga acacaccata ctgcttcgac caggtgaggg aggaggtcct ggagggcggc 1021 agaggtgctg aggctcccct accagaagca aacatggatg gtgggtgaaa ccacaggctg 1081 gaccagaagc caggctgaga aggggaagca ggtttggggg acttcctgga gaagggcatt 1141 tatacatggc atgaaggact ggattttcca aaggccaagg aagagtaggg caagggcctg 1201 gaggtggagc tggacttggc agtgggcatg caagcccatt gggcaacata tgttatggag 1261 tacaaagtcc cttctgctga caccagaagg aaaggccttg ggaatggaag atgagttagt 1321 cctgagtgcc gtttaaatca cgaaatcgag gatgaagggg gtgcagtgac ccggttcaaa 1381 ccttttgcac tgtgggtcct cgggcctcac tgctcaccgg catggaccat catctgggaa 1441 tgggatgcta actggggcct ctcggcaatt ttggtgactc ttgcaaggtc atacctgggt 1501 gacgcatcca aactgagttc ctccatcaca gaaggtgtga cccccacccc cgccccagga 1561 tcaggaggct gggtctcctc cttccacctg ctcactcctg gtagccccgg gggtcgtcca 1621 aggttcaaat aggactagga cctgtagtct ggggggatcc tggcttgaca agaggccctg 1681 accctccctc tgcagttgcg gcgccgcttc ggggacgtgt tcagcctgca gctggcctgg 1741 acgccggtgg tcgtgctcaa tgggctggcg gccgtgcgcg aggcgatggt gacccgcggc 1801 gaggacacgg ccgaccgccc gcctgtgccc atcacccaga tcctgggttt cgggccgcgt 1861 tcccaaggca agcagcggtg gggacagaga cagatttccg tgggacccgg gtgggtgatg 1921 accgtagtcc gagctgggca gagagggcgc ggggtcgtgg acatgaaaca ggccagcgag 1981 tggggacagc gggccaagaa accacctgca ctagggaggt gtgagcatgg ggacgagggc 2041 ggggcttgtg acgagtgggc ggggccactg ccgagacctg gcaggagccc aatgggtgag 2101 cgtggcgcat ttcccagctg gaatccggtg tcgaagtggg gggcggggac cgcacctgtg 2161 ctgtaagctc agtgtgggtg gcgcggggcc cgcggggtct tccctgagtg caaaggcggt 2221 cagggtgggc agagacgagg tgggcaaagc cctgccccag ccaagggagc aaggtggatg 2281 cacaaagagt gggccctgtg accagctgga cagagccagg gactgcggga gaccaggggg 2341 agcatagggt tggagtgggt ggtggatggt ggggctaatg ccttcatggc cacgcgcacg 2401 tgcccgtccc acccccaggg gtgttcctgg cgcgctatgg gcccgcgtgg cgcgagcaga 2461 ggcgcttctc cgtctccacc ttgcgcaact tgggcctggg caagaagtcg ctggagcagt 2521 gggtgaccga ggaggccgcc tgcctttgtg ccgccttcgc caaccactcc ggtgggtgat 2581 gggcagaagg gcacaaagcg ggaactggga aggcggggga cggggaaggc gaccccttac 2641 ccgcatctcc cacccccaag acgccccttt cgccccaacg gtctcttgga caaagccgtg 2701 agcaacgtga tcgcctccct cacctgcggg cgccgcttcg agtacgacga ccctcgcttc 2761 ctcaggctgc tggacctagc tcaggaggga ctgaaggagg agtcgggctt tctgcgcgag 2821 gtgcggagcg agagaccgag gagtctctgc agggcgagct cccgagaggt gccggggctg 2881 gactggggcc tcggaagagc aggatttgcg tagatgggtt tgggaaagga cattccagga 2941 gaccccactg taagaagggc ctggaggagg aggggacatc tcagacatgg tcgtgggaga 3001 ggtgtgcccg ggtcaggggg caccaggaga ggccaaggac tctgtacctc ctatccacgt 3061 cagagatttc gattttaggt ttctcctctg ggcaaggaga gagggtggag gctggcactt 3121 ggggagggac ttggtgaggt cagtggtaag gacaggcagg ccctgggtct acctggagat 3181 ggctggggcc tgagacttgt ccaggtgaac gcagagcaca ggagggattg agaccccgtt 3241 ctgtctggtg taggtgctga atgctgtccc cgtcctcctg catatcccag cgctggctgg 3301 caaggtccta cgcttccaaa aggctttcct gacccagctg gatgagctgc taactgagca 3361 caggatgacc tgggacccag cccagccccc ccgagacctg actgaggcct tcctggcaga 3421 gatggagaag gtgagagtgg ctgccacggt ggggggcaag ggtggtgggt tgagcgtccc 3481 aggaggaatg aggggaggct gggcaaaagg ttggaccagt gcatcacccg gcgagccgca 3541 tctgggctga caggtgcaga attggaggtc atttgggggc taccccgttc tgtcccgagt 3601 atgctctcgg ccctgctcag gccaagggga accctgagag cagcttcaat gatgagaacc 3661 tgcgcatagt ggtggctgac ctgttctctg ccgggatggt gaccacctcg accacgctgg 3721 cctggggcct cctgctcatg atcctacatc cggatgtgca gcgtgagccc atctgggaaa 3781 cagtgcaggg gccgagggag gaagggtaca ggcgggggcc catgaacttt gctgggacac 3841 ccggggctcc aagcacaggc ttgaccagga tcctgtaagc ctgacctcct ccaacatagg 3901 aggcaagaag gagtgtcagg gccggacccc ctgggtgctg acccattgtg gggacgcatg 3961 tctgtccagg ccgtgtccaa caggagatcg acgacgtgat agggcaggtg cggcgaccag 4021 agatgggtga ccaggctcac atgccctaca ccactgccgt gattcatgag gtgcagcgct 4081 ttggggacat cgtccccctg ggtgtgaccc atatgacatc ccgtgacatc gaagtacagg 4141 gcttccgcat ccctaaggta ggcctggcgc cctcctcacc ccagctcagc accagcccct 4201 ggtgatagcc ccagcatggc tactgccagg tgggcccact ctaggaaccc tggccaccta 4261 gtcctcaatg ccaccacact gactgtcccc acttgggtgg ggggtccaga gtataggcag 4321 ggctggcctg tccatccaga gcccccgtct agtggggaga caaaccagga cctgccagaa 4381 tgttggagga cccagcgcct gcagggagag ggggcagtgt gggtgcctct gagaggtgtg 4441 actgcgccct gctgtggggt cggagagggt actgtggagc ttctcgggcg caggactagt 4501 tgacagagtc cagctgtgtg ccaggcagtg tgtgtccccc gtgtgtttgg tggcaggggt 4561 cccagcatcc tagagtccag tccccactct caccctgcat ctcctgccca gggaacgaca 4621 ctcatcacca acctgtcatc ggtgctgaag gatgaggccg tctgggagaa gcccttccgc 4681 ttccaccccg aacacttcct ggatgcccag ggccactttg tgaagccgga ggccttcctg 4741 cctttctcag caggtgcctg tggggagccc ggctccctgt ccccttccgt ggagtcttgc 4801 aggggtatca cccaggagcc aggctcactg acgcccctcc cctccccaca ggccgccgtg 4861 catgcctcgg ggagcccctg gcccgcatgg agctcttcct cttcttcacc tccctgctgc 4921 agcacttcag cttctcggtg cccactggac agccccggcc cagccaccat ggtgtctttg 4981 ctttcctggt gaccccatcc ccctatgagc tttgtgctgt gccccgctag aatggggtac 5041 ctagtcccca gcctgctccc tagccagagg ctctaatgta caataaagca atgtggtagt 5101 tccaactcgg gtcccctgct cacgccctcg ttgggatcat cctcctcagg gcaaccccac 5161 ccctgcctca ttcctgctta ccccaccgcc tggccgcatt tgagacaggg gtatgttgag 5221 gctgagcaga tgtcagttac ccttgcccat aatcccatgt cccccactga cccaactctg 5281 actgcccaga ttggtgacaa ggactacatt gtcctggcat gtggggaagg ggccagaatg 5341 ggctgactag aggtgtcagt cagccctgga tgtggtggag agggcaggac tcagcctgga 5401 ggcccatatt tcaggcctaa ctcagcccac cccacatcag ggacagcagt cctgccagca 5461 ccatcacaac agtcacctcc cttcatatat gacaccccaa aac // LOCUS CHKCOLCARB 1394 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end, clone 7 and 13. ACCESSION M28659 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SOURCE Chicken 17 day old embryo cartilage, cDNA to mRNA, clones 7 and 13. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1394) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 147 > 1394 alpha(IX) collagen-proteoglycan BASE COUNT 372 a 341 c 358 g 323 t ORIGIN 1 tccctccccg ctgactgcgt ggggcaggag gagcattctg cacccattca tactctcgtt 61 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg 121 agacgtgaag aaaaccagca gagaagatga aaagcaactg gaaaattaca gctttcttgt 181 atatgtgtag ttttctgggg tctttcatct cagctaccta ccagcaacaa tcaagattgc 241 cagtcattct gggtgctcgt caaagaactg atctctgccc aacaatcagg attggcgaag 301 atgacttgcc aggctttgac ctgatttctc agttccagat agaaaaagct gcttctcaag 361 gaattgtcca gagagtagtg ggttctactg ctctacaagt ggcttataaa ttgggaccca 421 atgtagactt caggattcca accagtgcaa tatattccaa tggattgcct gatgaatact 481 cctttcttac tacttttcgg atgactggag ccacacttca gaaatactgg actatttggc 541 agattcagga ttcttcagga aaagaacaag ttggagtgaa tctcaatggt ccaatgaaaa 601 gcgttgagtt ttcttataaa ggagtggatg gaagtctcca gactgcatca tttttacatt 661 tgcctttctt gtttgattcc caatggcaca agcttatgat aagtgtggaa acaaccagcg 721 ttacactttt tattgactgt ataaaggtag aaaccctaaa cataaaacca aaggggaaaa 781 tcagtgttga tggcttctca gtgcttggaa gactcaaaaa taatcctcaa atttcagttc 841 cgtttgaagt ccagtggatg ccgattcact gcgatcccct gcggccccag agagaaggtt 901 gtggtgagct cccagcccgg ataagccaga cagtgattga gagaggtctt cctggtccac 961 caggcccccc aggtccacca gggccaccag gagttcctgg cattgatggc atcgatggag 1021 agagaggacc taacggcccc cccggtccac cgggtccgga cggcgacgca ggcaaagcgg 1081 gatccccggg cctgcctgga gagccaggag ctgatgggtt aacaggccct gatggatcac 1141 caggtgccac aggaccgaaa ggacagaagg gtgagccagg acctccaggt gctcgtggac 1201 ttccgggcaa gggtcttctt ggaccacccg gtccagctgg tgctgcagga cttcccggtg 1261 aagtaggccg tgctggccca cctggtgatc caggaaaaag gggaccacca ggaccaccag 1321 gaccaccagg ccctcgagga acaattggtc tgcaagacgg tgacccattg tgtcccaatg 1381 cttgtccacc tggc // LOCUS CHKCOLCARC 776 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan mRNA, 5' end, clone YM43. ACCESSION M28660 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SOURCE Chicken 17 day old embryo cartilage, cDNA to mRNA, clone YM43. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 776) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 255 > 776 alpha(IX) collagen-proteoglycan BASE COUNT 119 a 319 c 235 g 103 t ORIGIN 1 gaattcccga cacccccacc tgcatcaccc cccccccatc tcgcagtccc tcgcccccat 61 caaagcccct ttgtgccacc tccgtcgcca cccggcccca gaatagcagc acgctcacct 121 gcaggggggg tcggagccag cgcctgccct cgtcccccgc tgctccatat taatcagccc 181 cttcctcctc ctcctcctcc tcctcctcct gccggtccct ccgcagtccg acacttacag 241 ccccgctccc ggccatggcc caccgcagcc ccgcgctctg cctgctgctc ctgcacgctg 301 cctgcctctg cctggcccag ctccgggggc caccaggaga gcccggccca cgagggcccc 361 caggtccgcc aggagtgccg ggagcggatg gcattgatgg tgacaaaggc tctcccggag 421 cccccggctc cccaggtgcc aaaggggagc ccggagcccc gggtccggat gggcctccag 481 ggaagccagg cttagacggt cttacgggag ccaaagggag ccggggccca tggggggggc 541 aaggactgaa gggtcagcct ggactgccgg ggccgccggg gctccccggt ccctcgctgc 601 caggaccacc cgggctgcca ggccaggtcg gactgcccgg ggagatcgga gtgccaggac 661 ccaagggcga tcctggaccc gatggcccac ggggcccccc gggtccccca gggaaacccg 721 gccccccagg acacatccaa ggagtggagg gaagcgcaga tttcttgtgc ccgacc // LOCUS CHKCOLCOR 602 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken cornea alpha-1(IX) collagen-proteoglycan mRNA, 5' end. ACCESSION M28658 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SOURCE Chicken 8 day old embryo cornea, cDNA to mRNA, clone IN212. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 602) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 89 > 602 alpha(IX) collagen-proteoglycan BASE COUNT 121 a 175 c 186 g 120 t ORIGIN 1 tgcagctgaa aggtgaactg ggctgtaagg cacattttgg atttctgtgt attgtagcac 61 ctaggtggct gcaaaatctg tccccactat ggcctgggct gcatggggcc ctctgcttct 121 cgggcttttc ttgcagattt tttgcctctg ccttgctcaa agaggtcttc ctggtccacc 181 aggcccccca ggtccaccag ggccaccagg agttcctggc attgatggca ttgatggaga 241 gagaggacct aacggccccc ccggtccacc gggtccggac ggcgacgcag gcaaagcggg 301 atccccgggc ctgcctggag agccaggagc tgatgggtta acaggccctg atggatcacc 361 aggtgccaca ggaccgaaag gacagaaggg tgagccagga cctccaggtg ctcgtggacc 421 tccgggcaag ggtcttcttg gaccacctgg tccagctggt gctgcaggac ttcccggtga 481 agtaggccct gctggcccac ctggtgatcc aggaaaaagg ggaccaccag gaccaccagg 541 accaccaggc cctcgaggaa caattggtct gcaagatggt gacccattgt gtcccaatgc 601 tt // LOCUS CHKCOLG1 840 bp ds-DNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exon 1, and cornea alpha-1(IX) collagen-proteoglycan gene, 5' flank. ACCESSION M28662 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SEGMENT 1 of 2 SOURCE Chicken DNA. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 840) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. FEATURES from to/span description pept 807 / 820 cartilage alpha(IX) collagen-proteoglycan, exon 1 pre-msg 661 > 840 cartilage alpha(IX) collagen-proteoglycan IVS 821 > 840 cartilage alpha(IX) collagen-proteoglycan intron A BASE COUNT 249 a 202 c 157 g 232 t ORIGIN 1 ccacccgtga gaattcctca agtgaaaatg caaatgaaca gaaattataa attgttcaga 61 aactgagtat atgttctcca aatttctctg aacgaggccc ctctctttgg aaagtataat 121 gtgtgtgtga ataacaactg aacaacagga gtcctcttag taatgcctat gtgcattcct 181 tgaaaaggtt caagtttaag cagtaaaagt ccttttaaat aattggtttt attcagaaga 241 atcaactagg acactaccag ataggcttct ccagagacct tctgatggat aaatcaacaa 301 gaactgaaaa tatcttcttt ataggactga tgttcttttc ttgtgaaagt ttttagcttt 361 aacaccacag tgaagccacc agtttccaca aaatcccttg gtacatgtta ttattctttt 421 atctgcctca ctgaacagtg cccctgccat ttggtgactg gcatcgctta actcatatag 481 tgttaatctt tctaccctga tgtcggcata agcagcaccc ctttcttcac tctcttggct 541 tctttatatt cagctggctc cagagatccg ccctcagacc ccaccaggat acagacgtct 601 gtccagcccc cacctccttc cctttgcaag attaaaacca acccagcagc ctgcacctcc 661 ctccccgctg agtcctgcgt ggggcaggag gagcattctg cacccattca tactctcgtt 721 aacaggactt atgacaggga accagagagt gtgaatatat acaccaaata ttcacatgtg 781 agacgtgaag aaaaccagca gagaagatga aaagcaactg gtaagagaac aagtgggatt // LOCUS CHKCOLG2 840 bp ds-DNA VRT 03-JUL-1990 DEFINITION Chicken cartilage alpha-1(IX) collagen-proteoglycan gene, exons 6 and 7, and cornea alpha-1(IX) collagen-proteoglycan gene, exon 1. ACCESSION M28661 J05129 KEYWORDS IX collagen-proteoglycan; extracellular matrix protein. SEGMENT 2 of 2 SOURCE Chicken DNA, clones 13 and 26. ORGANISM Gallus domesticus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 840) AUTHORS Nishimura,I., Muragaki,Y. and Olsen,B.R. TITLE Tissue-specific forms of type IX collagen-proteoglycan arise from the use of two widely separated promoters JOURNAL J. Biol. Chem. 264, 20033-20041 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.R.Olsen, 12-SEP-1989, for release after publication. The first amino acid for the open reading frame for exon 6 which is indicated in the features as starting at nucleotide 9 could start at nucleotide 11. FEATURES from to/span description pept 636 / 707 cornea alpha(IX) collagen-proteoglycan, exon 1 pept / 9 86 cartilage alpha(IX) collagen-proteoglycan, exon 6 (AA at 9) 750 / 770 cartilage alpha(IX) collagen-proteoglycan, exon 7 pre-msg 541 > 840 cornea alpha(IX) collagen-proteoglycan mRNA and introns pre-msg < 1 > 840 cartilage alpha(IX) collagen-proteoglycan mRNA and introns IVS < 1 8 cartilage alpha(IX) collagen-proteoglycan intron E IVS 87 749 cartilage alpha(IX) collagen-proteoglycan intron F IVS 771 > 840 cartilage alpha(IX) collagen-proteoglycan intron G IVS 708 > 840 cornea alpha(IX) collagen-proteoglycan intron A BASE COUNT 181 a 214 c 222 g 223 t ORIGIN 1 cctaccagtt tgaagtccag tggatgctga ttcactgcga tcccctgcgg ccccagagag 61 aaggttgtgg tgagctccca gcccgggtga cccgcgttcc cagcctgaca gtgctgaact 121 gggctgccac taaatctatg aagttcacag gagcttcatt tttccccgtc tatgtccaga 181 gaagtctatt tcaccatacc tgactgaaat ttggtgcctt tagcaatcca gccccctgga 241 gtagcagcct tactttaact cttccatgcc ttcctatctt ttccttctca gccagtgcta 301 gggtcagagg cttttgaaag atatccctga cagcgaagag agactgctgt ctccttgcag 361 actcctgggc aacctgaggg agggaaaccc ttgcctggga ggtgagggag ggtgccaaaa 421 caacagcgag cagggcaaag ggttaaaggt actgctgtca ttcaatcctc ttcctcccag 481 ccttcagctc tcctccaatc ccacgaccct ctcccaggca gttaataagg aactgtgagg 541 ggtgccttgc agctgaaagg tgaactgggc tgtaaggcac attttggatt tctgtgtatt 601 gtagcaccta ggtggctgca aaatctgtcc ccactatggc ctgggctgca tggggccctc 661 tgcttctcgg gcttttcttg cagatttttt gcctctgcct tgctcaagta agtttattct 721 gactttatac ctgtttttct cccttacaga taagccagac agtgattgag gtaagtgtga 781 gggaagggat ggtgctgcat cgtaagggaa agggtttgga tgaagagggg ctgaaggctg // LOCUS RATIRF1A 2078 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds. ACCESSION M34253 KEYWORDS interferon regulatory factor 1; transcription factor. SOURCE Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2078) AUTHORS Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A. TITLE Interferon regulatory factor 1 is an immediate-early gene under transcriptional regulation by prolactin in Nb2 T cells JOURNAL Mol. Cell. Biol. 3087, 3094 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.-Y.Yu-Lee, 11-MAY-1990. FEATURES from to/span description pept 198 1184 interferon regulatory factor 1 (IRF-1) BASE COUNT 542 a 544 c 540 g 452 t ORIGIN 1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg 61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac 121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct 181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc 241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc 301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt 361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca 421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga 481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc 541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa 601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct 661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg 721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg 781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg 841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac 901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca 961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct 1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg 1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact 1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc 1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag 1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg 1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct 1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg 1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga 1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact 1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact 1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa 1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct 1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct 1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt 1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga 1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt 1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta 2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMINTB1A 1146 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain). ACCESSION M34189 KEYWORDS integrin; integrin beta-1 subunit cytoplasmic domain. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1146) AUTHORS Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F., Stefanuto,G. and Silengo,L. TITLE A human integrin beta-1 subunit with a unique cytoplasmic domain JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by F.Altruda, 10-MAY-1990. FEATURES from to/span description pept < 1 117 integrin beta-1 subunit (AA at 1) (cytoplasmic domain) BASE COUNT 373 a 184 c 222 g 367 t ORIGIN 1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg 61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt 121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct 181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga 241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt 301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct 361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct 421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa 481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata 541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta 601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg 661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac 721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg 781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt 841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag 901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg 961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt 1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga 1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc 1141 gaattc // LOCUS NEUAMTR 5928 bp ds-DNA PLN 03-JUL-1990 DEFINITION N.crassa mating type protein gene, complete cds. ACCESSION M33876 KEYWORDS mating type protein. SOURCE N.crassa (strain 74-ORS-A) DNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 5928) AUTHORS Grotelueschen,J., Metzenberg,R.L. and Glass,N.L. TITLE The Neurospora crassa A mating type region JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by N.L.Glass, 26-APR-1990. FEATURES from to/span description pept 4121 4283 mating type protein, exon 1 4343 5046 mating type protein, exon 2 IVS 4284 4342 mating type protein intron A signal 3858 3906 GC signal signal 3945 3949 CAAT box signal 5319 5326 poly-A signal BASE COUNT 1470 a 1459 c 1526 g 1473 t ORIGIN 1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg 61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga 121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct 181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct 241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag 301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa 361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga 421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa 481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg 541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg 601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg 661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc 721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg 781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc 841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg 901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc 961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat 1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa 1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc 1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg 1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt 1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa 1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc 1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca 1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat 1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac 1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca 1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta 1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga 1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt 1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa 1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac 1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc 1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc 2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg 2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga 2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg 2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca 2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca 2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc 2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca 2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat 2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag 2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc 2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca 2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg 2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag 2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg 2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata 2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg 3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca 3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca 3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt 3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt 3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca 3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag 3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac 3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta 3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca 3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac 3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg 3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca 3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt 3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt 3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc 3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa 3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc 4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt 4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat 4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc 4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt 4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat 4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag 4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat 4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg 4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg 4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg 4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag 4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat 4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt 4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct 4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc 4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat 4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg 5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca 5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc 5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg 5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg 5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg 5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg 5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca 5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa 5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct 5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg 5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag 5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg 5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc 5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc 5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac // LOCUS HUMNCADH 3451 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human N-cadherin mRNA, complete cds. ACCESSION M34064 KEYWORDS N-cadherin; cell adhesion molecule; transmembrane protein. SOURCE Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3451) AUTHORS Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D., Spurr,N. and Goodfellow,P.N. TITLE The N-cadherin gene maps to human Chromosome 18 and is not linked to the E-cadherin gene JOURNAL J. Neurochem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.H.Barton, 08-MAY-1990. FEATURES from to/span description pept < 1 2247 N-cadherin (AA at 1) BASE COUNT 1041 a 703 c 724 g 983 t ORIGIN Chromosome 18. 1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag 61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg 121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg 181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca 241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac 301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga 361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc 421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac 481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga 541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca 601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct 661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata 721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg 781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc 841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc 901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag 961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc 1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc 1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct 1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt 1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct 1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt 1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc 1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct 1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat 1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt 1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg 1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg 1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt 1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt 1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga 1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct 1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc 1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag 2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt 2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt 2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct 2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac 2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc 2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag 2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta 2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt 2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt 2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca 2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta 2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag 2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg 2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga 2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta 2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt 3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt 3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt 3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg 3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa 3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa 3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct 3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg 3421 ctaattttgg ataacaaaaa aaggggaatt c // LOCUS MHVNSGII 870 bp ss-RNA VRL 03-JUL-1990 DEFINITION Murine hepatitis virus non-structural protein gene-2 (NS2). ACCESSION M34035 KEYWORDS non structural protein. SOURCE Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 870) AUTHORS Schwarz,B., Routledge,E. and Siddell,S.G. TITLE The coronavirus MHV 30 kDa non-structural protein NS2 is not essential for virus replication in transformed murine cells JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.G.Siddell, 04-MAY-1990. Author address: S.G.Siddell Inst of Virology Univ of Wuerzburg Versbacherstrasse 7 8700 Wuerzburg FEATURES from to/span description pept 40 837 non-structural protein-2 (NS2) BASE COUNT 269 a 147 c 187 g 267 t ORIGIN 1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt 61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt 121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag 181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa 241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac 301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat 361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg 421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc 481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag 541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg 601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat 661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag 721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg 781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt 841 gtttttacac tattagtgta ataaacttat // LOCUS MCAMV6 1904 bp ds-DNA VRL 03-JUL-1990 DEFINITION Cauliflower mosaic virus (CaMV) gene six protein gene, complete cds. ACCESSION M23620 KEYWORDS gene six protein. SOURCE Cauliflower mosaic virus (strain D4) DNA. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (sites) AUTHORS Daubert,S. and Routh,J. TITLE Determinants of symptomatology in the DNA sequence CaMV JOURNAL mol plant microb interact (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1904; for [1]) AUTHORS Daubert,S. and Routh,J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1], [2] kindly submitte by S.Daubert, 08-AUG-1989, for release after publication. FEATURES from to/span description pept 1 1563 gene six protein signal 1629 1635 TATA box signal 1547 1550 CCAT enhancer 1 BASE COUNT 645 a 450 c 379 g 430 t ORIGIN bps 5774 to 7678 of genome. 1 atggagaaca tagaaaaact cctcatgcaa gagaaaatac taatgctaga gctcgatcta 61 gtaaaagcaa aaataagctt agcaagagct aacggctctt cgcaacaagg agaactctct 121 ctccaccgtg aaacaccgga aaaagaagaa gcagttcatt ctgcactggc cacttttacg 181 ccaacccaag taaaagctat tccagagcaa acggctcctg gtaaagaatc aacaaatccg 241 ttgatggcta gtatcttgcc aaaagatatg aattcagttc agactgaaat taggctcaaa 301 aggccatcgg acttcttacg tccttatcag ggaatttcaa tcccacaaaa atctgagctt 361 aacagcacag ttactcttca cggagtagaa tcgggtattc aacaccctca tatcaactac 421 tacgttgtgt ataacggtcc acacgccggt atatacgatg actggggttg tacaaaggcg 481 gcaacaaacg gcgttcccgg agttgcacaa aagaagtttg ccactattac agaggcaaga 541 gcagcagctg acgcatacac aacaagtcag caaacagaca ggttgaactt catccccaaa 601 ggagaagctc aactcaagcc caagagcttt gcgaaggcct taaccagccc atcaaagcaa 661 aaagcccact ggctcacgct aggaaccaaa aggcccagca gtgatccagc cccaaaagag 721 atctcctttg ccccggagat caccatggac gactttctct atctctacga tctaggaaga 781 aagttcgacg gagaaggtga cgataccatg ttcaccactg ataatgagaa gattagcctc 841 ttcaatttca gaaagaatgc tgacccacag atggttagag aggcctacgc agcaggtctc 901 atcaagacga tctacccgag caataatctc caggagatca aataccttcc caagaaggtt 961 aaagatgcag tcaaaagatt caggactaac tgcatcaaga acacagagaa agatatattt 1021 ctcaagatca gaagtactat tccagtatgg acgattcaag gcttgcttca taaaccaagg 1081 caagtaatag aaattggagt ctctaagaaa gtagttccta ctgaatcaaa ggccatggag 1141 tcaaaaattc agatcgagga tctaacagaa ctcgccgtga agactggcga acagttcata 1201 cagagtcttt tacgactcaa tgacaagaag aaaatcttcg tcaacatggt ggagcacgac 1261 actctcgtct actccaagaa tatcaaagat acagtctcag aagaccaaag ggctattgag 1321 acttttcaac aaagggtaat atcgggaaac ctcctcggat tccattgccc agctatctgt 1381 cacttcatcg aaaggacagt agaaaaggaa ggtggcacct acaaatgcca tcattgcgat 1441 aaaggaaagg ctatcattca agatgcctct accgacagtg gtcccaaaga tggaccccca 1501 cccacgagga gcatcgtgga aaaagaagac gttccaacca cgtcttcaaa gcaagtggat 1561 tgatgtgaca tctccactga cgtaagggat gacgcacaat cccactaccc ttcgcaagac 1621 ccttcctcta tataaggaag ttcatttcat ttggagagga cacgctgaaa tcaccagtct 1681 ctctctacaa gactatctct ctctattttc tccagaataa tgtgtgagta gtttcccgat 1741 aagggaatta gggttcttat agggtttcgc tcatgtgttg agcatataag aaacccttag 1801 tatgtatttg tatttgtaaa atacttctat caataaaatt tctaattcct aaaaccaaaa 1861 tccagtacta aaatccagat ctcctaaagt ccctatagat cttt // LOCUS CREAPCYN 577 bp ss-mRNA PLN 03-JUL-1990 DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds. ACCESSION J05524 KEYWORDS apoplastocyanin. SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone PC6-2. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 577) AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and Bogorad,L. TITLE Isolation and characterization of a complementary DNA clone for an algal pre-apoplastocyanin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.Hill, 11-MAY-1990. FEATURES from to/span description pept 22 459 apoplastocyanin (PC6-2) precursor sigp 22 162 apoplastocyanin signal peptide matp 163 456 apoplastocyanin BASE COUNT 91 a 189 c 171 g 126 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc 61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt 121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg 181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag 241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc 301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc 361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac 421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat 481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac 541 gcagagagcg tgtagctctg taactttttc ggaattc // LOCUS RATSVPIIA 4161 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat seminal vesicle secretion II protein (SVS II) gene, complete cds. ACCESSION J05443 KEYWORDS seminal vesicle secretion II protein. SOURCE Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4161) AUTHORS Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G., Matusik,R.J., Carr,S.A. and Crabb,J.W. TITLE Structural characterization of the rat seminal vesicle secretion II protein and gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.E.Harris, 25-APR-1990. FEATURES from to/span description pept 2065 2137 seminal vesicle secretion II protein (SVS II) precursor, exon 1 2377 3548 seminal vesicle secretion II protein precursor, exon 2 sigp 2065 2130 seminal vesicle secretion II protein signal peptide matp 2131 3545 seminal vesicle secretion II protein signal 1934 1946 CAAT box signal 2013 2020 TATA box BASE COUNT 1243 a 902 c 822 g 1194 t ORIGIN 1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct 61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct 121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt 181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt 241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa 301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc 361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac 421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata 481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta 541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta 601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca 661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc 721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc 781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc 841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa 901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca 961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac 1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct 1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt 1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac 1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta 1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt 1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg 1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact 1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt 1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac 1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac 1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa 1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt 1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat 1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct 1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag 1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg 1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc 2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc 2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag 2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca 2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt 2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga 2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga 2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag 2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc 2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg 2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat 2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct 2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg 2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg 2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt 2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac 2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga 3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa 3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg 3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta 3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag 3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc 3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa 3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat 3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa 3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta 3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc 3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg 3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag 3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa 3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac 3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat 3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg 3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc 4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga 4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc 4141 aatggtgacc ctgttgaatt c // LOCUS CODCPRRKA 94 bp ss-RNA RNA 03-JUL-1990 DEFINITION Codium fragile chloroplast 4.5S RNA, complete cds. ACCESSION M35276 M15192 KEYWORDS 4.5S RNA. SOURCE C.fragile chloroplast RNA. ORGANISM Chloroplast Codium fragile Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Brypsidales; Codiaceae; Codium fragile. REFERENCE 1 (bases 1 to 94) AUTHORS Francis,M.A., Balint,R.F. and Dudock,B.S. TITLE A novel variety of 4.5 S RNA from Codium fragile chloroplasts JOURNAL J. Biol. Chem. 262, 1848-1854 (1987) STANDARD simple staff_review FEATURES from to/span description RNA 1 94 4.5 S RNA BASE COUNT 35 a 11 c 14 g 34 t ORIGIN 1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc 61 atttgtattt gaaaattgtt taggaactag gcac // LOCUS HAMAPBRBD 2339 bp ds-DNA ROD 03-JUL-1990 DEFINITION Hamster apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35187 KEYWORDS apolipoprotein B. SOURCE Hamster DNA. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2339) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review COMMENT Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications on line 4 and line 17 of printed sequence. The hamster sequence should be 1 bp to the left on these lines. FEATURES from to/span description pept < 1 > 2339 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 725 a 519 c 450 g 645 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat 61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat 121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc 181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct 241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat 301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt 361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat 421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac 481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta 541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag 601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg 661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc 721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc 781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc 841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt 901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt 961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt 1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat 1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga 1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac 1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag 1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac 1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt 1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg 1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact 1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac 1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg 1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg 1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt 1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt 1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt 1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac 1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca 2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa 2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac 2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta 2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct 2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc // LOCUS MUSAPBRBD 2354 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35186 KEYWORDS apolipoprotein B. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2354) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 2354 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 757 a 525 c 423 g 649 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat 61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat 121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc 181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct 241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat 301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt 361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat 421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac 481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg 541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa 601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta 661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc 721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc 781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt 841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt 901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt 961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt 1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat 1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga 1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac 1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag 1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac 1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt 1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt 1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa 1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc 1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg 1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca 1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc 1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac 1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag 1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct 1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag 2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg 2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt 2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat 2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga 2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat 2341 gtatatctga attc // LOCUS RATBPTT 1035 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Rat beta-tachykinin mRNA, complete cds. ACCESSION M35277 M15191 KEYWORDS neurokinin A; substance P; tachykinin. SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1035) AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D. TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance P and neurokinin A JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987) STANDARD simple staff_review FEATURES from to/span description pept 100 492 beta-preprotachykinin sigp 100 156 beta-tachykinin signal peptide (3' end could be 171) matp 172 204 substance P matp 294 321 neurokinin mRNA < 1 1035 beta-preprotachykinin mRNA BASE COUNT 289 a 212 c 245 g 289 t ORIGIN 1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc 61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg 121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat 181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gcccgagccc 241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta 301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt 361 tatgggcatg gtcagatctc tcacaaaagg cataaaacag attcctttgt tggactaatg 421 ggcaaaagag ctttaaattc tgtggcttat gaaagaagcg caatgcagaa ctacgaaaga 481 aggcgtaaat aaaccctgta acgcactatc tattcatctc catctgtgtc cgcgagcagt 541 gagcggtaaa ataaaaatgt gcgctatgag gaatgattat ttatttaata tcaaatgttg 601 ttatgagtga aaaactcaaa aaagtgttta ttttttcata ttgtgccaat aagcattgta 661 attctaatgt ggtgacctcc tcagacagaa gtagaaatta gttgtaactt cagcaaagca 721 cagtgttgat ggagttgtac aagtttgcca gcgatgcaag tctccaaaga cagaaaggct 781 gctgtgaggc agtgcaggcg gctgctgctg gaggcagaga aactcctgtg tgtcttgcgc 841 ttcccttggt tgcttttatc ctaatgatgt actgagagtt tggtatctga ctctatttgt 901 atcctagcag catgtttcct gtgttgtgac tatatagaga tgtttttaaa agtttcaatg 961 tacttctctg gtcttcagtc attgtatgat gtgttgtgat agctaccatt ttaaataaaa 1021 gaatgtatct tcagg // LOCUS CHPRGIT 1051 bp ds-DNA PRI 03-JUL-1990 DEFINITION Chimpanzee rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M30947 KEYWORDS internal transcribed spacer. SOURCE Chimpanzee DNA. ORGANISM Pan troglodytes Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 1051) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.L.Gonzalez, 20-DEC-1989. BASE COUNT 69 a 411 c 425 g 146 t ORIGIN Acrocentric chromosomes 14, 15, 17, 22, 23. 1 acggagccga aggggggcgc gaggccgcgg cggcgccgcc gcgcgcttcc ctccccccca 61 ccccgccgca acgcggcgcg tgcgcgggcg gggcccgtgt gccgttcgtt cgttcgttcg 121 ctgcccggcc ccgccgccgc gagagccgag aactcgggag ggcgacgttg gggggagagc 181 gagagagaca gaaagaaggg ggcgcgtgtt cgctgcgcgt gtcgtggggc cggcggggag 241 cggtccccgg cctcgggccc gacggacgtg tgtgtcggcg ggcgcggggg cggttctcgg 301 cggcgtcacg gcgggtttgg gggggggggt ctcggtgccc tcctccccgc cggggcccgt 361 cgtccggccc cgccgcgcgc cggctccccg tcgtcggggc cgggccggat tcccgtcgcc 421 gcctccgccg cgcgccgctc cgcgccaccg ggcacggccc cgctcgctct ccccggcctt 481 cccgctaggg cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc 541 ccgccgtcca ggtacctagc gcgttccggc gcggaggttt aaagacccct tgggggatcg 601 cccgtccgcc cgcgggtcgg gggcggtggt gggcccgcgg gggagtcccg tcgggagggg 661 cccggcccct cccgcgcctc ccccgcggac tccgcccccg gccggggccg cgccgcctcg 721 ccggctcggg tcgcggcggc cgtcgggtgg gggctttacc cggcggccgt cgcgtgcgcg 781 cgtgccgcgc gtgtggcgtg cgccccgcgc cgtgggggcg ggaacccccc gggcgcctgt 841 ggggtggtgt ccgcgctcgc ccctgcgtgg gcggcgcgcg cctccccgtg gtgtgaaacc 901 ttccgacccc tctccggagt ccggtcccgt ttttgctgtc tctctggccg gcctgaggca 961 accccctctc ctctgggggg gggggacgtg ccgcgccagg agggcctccc ggtgtgtttg 1021 tcgggagcgc cctcgccaaa tcgacctcgt a // LOCUS CHPRGITX 2512 bp ds-DNA PRI 03-JUL-1990 DEFINITION Chimpanzee 28S ribosomal RNA gene fragment. ACCESSION M30950 KEYWORDS 28S ribosomal RNA. SOURCE Chimpanzee DNA. ORGANISM Pan troglodytes Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 193 to 261; 375 to 945; 1042 to 1079; 1334 to 1357; 1742 to 1958; 2190 to 2204) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2512) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by I.L.Gonzalez, 20-DEC-1989. FEATURES from to/span description rRNA < 1 > 2512 28S ribosomal RNA BASE COUNT 397 a 837 c 924 g 354 t ORIGIN Chromosomes 14, 15, 17, 22, 23. 1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg 61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg 121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga 181 cccctccacc cgccctccct tccccccgcc gcccctcctc ctcctccccg gagggggcgg 241 gctccggcgg gtgcgggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc 301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg 361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtcct 421 cctcctcccc ccccgtctcc gccccccggc cccgcgtcct cccccgggag ggcgcgcggg 481 tcggggcggt ggcggcggcg gcggcggtgg cggcggtggc ggcgggaccg aaaccccccc 541 cgagtgttac agccccccgg cagcagcact cgccgaatcc cggggccgag ggagcgagac 601 ccgtcgccgc gctctccccc ctcccggcgc ccacccccgc ggggatatcc tccgcgaggg 661 gggtctcccc cgcgggggcg cgccggcgtc tcctcgtggg ggggccgggc cacccctccc 721 acggcgcgac cgctctccca cccctcctcc ccgcaacccc cctctcccgg cgacggggag 781 ggccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg 841 ccgtcgggcc cgggggaggt tctctcgggg ccacgcgcgc gtcccccgaa gagggggacg 901 gcggagccga gcgcacgggg tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa 961 cacggaccaa ggagtctaac acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc 1021 gcaatgaagg tgaaggccgg cgcgctcgcc ggccgaggtg ggatccgagg cctctccagt 1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc 1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac 1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg 1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc 1321 tggcgctctc gcagacccga cgcacacccc cccacgcagt tttatccggt aaagcgaatg 1381 attagaggtc ttggggccga aacgatctca acctattctc aaactttaaa tgggtaagaa 1441 gcccggctcg ctggcgtgga gccggggtgg aatgcgagtg cctagtgggc cacttttggt 1501 aagcagaact ggcgctgcgg gatgaaccga acgccgggtt aaggcgcccg atgccgacgc 1561 tcatcagacc ccagaaaagg tgttggttga tatagacagc aggacggtgg ccatggaagt 1621 cggaatccgc taaggagtgt gtaacaactc acctgccgaa tcaactagcc ctgaaaatgg 1681 atggcgctgg agcgtcgggc ccatacccgg ccgtcgccgg cagtcgagag tggacgggag 1741 cggcgggggc ggcgcgggcg tgtgcgcgcg cgcgtgtgtg cgtgtgtgtc ggagggcggc 1801 ggcggtggcg gcgggggtgg ggtcctcccc ctcccccacg ccgcctcccc tcctcccacc 1861 caccaccgcc gccgccaccc ccgctccccg cccccggagc cccgcggacg ctacgccgcg 1921 acgagtagga gggccgctgc ggtgagcctt gaagcctagg gcgcgggccc gggtggagcc 1981 gccgcaggtg cagatcttgg tggtagtagc aaatattcaa acgagaactt tgaaggccga 2041 agtggagaag ggttccatgt gaacagcagt tgaacatggg tcagtcggtc ctgagagatg 2101 ggcgagcgcc gttccgaagg gacgggcgat ggcctccgtt gccctcggcc gatcgaaagg 2161 gagtcgggtt cagatccccg aatccggagt ggcggagatg ggcgccgcga ggcgtccagt 2221 gcggtaacgc gaccgatccc ggagaagccg gcgggagccc cggggagagt tctcttttct 2281 ttgtgaaggg cagggcgccc tggaatgggt tcgccccgag agaggggccc gtgccttgga 2341 aagcgtcgcg gttccggcgg cgtccggtga gctctcgctg gcccttgaaa atccggggga 2401 gagggtgtaa atctcgcgcc gggccgtacc catatccgca gcaggtctcc aaggtgaaca 2461 gcctctggca tgttggaaca atgtaggtaa gggaagtcgg caagccggat cc // LOCUS GORRGIT 987 bp ds-DNA PRI 03-JUL-1990 DEFINITION Gorilla rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M30948 KEYWORDS internal transcribed spacer. SOURCE Gorilla DNA. ORGANISM Gorilla gorilla Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 987) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.L.Gonzalez, 20-DEC-1989. BASE COUNT 65 a 398 c 390 g 134 t ORIGIN Chromosomes 22 and 23. 1 acggagcgaa gggcgaggcc gcggcggtgg cgccgccgcg tgcttccctc ccccccaccg 61 acgcggcgcg tgcgcgggcg gggcccgtgc cgttcgttcg ttcgttcgtt cgctgcccgg 121 ccccgccgcc gcgagagccg aggactcggg agggagacgg ggggggagaa gagaaaggag 181 gcctgtccgt gtgtgcgtgt cgtggggccg gccgcgctgg tgagcggcgg cgaggcctcc 241 ccggccgcgg cccgacgacg tgtgtgtcgg cgggtgcggg ggcggttctc ggcggcgtca 301 cggcgggttt ggggcctcgg tgccctcctc cccgccgggg cccgtcgtcc ggccccgccg 361 ccggcccccc cgtcgtcggg gccggccggg ttcccgtcgc cgccgccgcc gccgccgtcg 421 tcgcctccgc cgcgccaccg ggaccggccc cgctcgctct ccccggcctt cccgctaggg 481 cgtctcgagg gtcgggggcc ggacgccggt ccccccctcc tcgtccgccc ctccccgccg 541 ttccaggtac ctagcgcgtt ccggcgcgga ggtttaaaga ccccttgggg gatcgcccgt 601 ccgccccgtg ggtcgggggc ggtgggcccg cgggggggtc ccgtcgggag gggcccggcc 661 cctcccgcgc ctccaccgcg gactccgccc cccggccggg gccgcggcgg ccgtcgggtg 721 ggggctttac ccggcggccg tgcgcccccg cgccgtgggg gcgggaaccc ccgggcgcct 781 gtggggcgtg tcagcgctcg cccccgcgtg ggcgccgcgc ctccccgtgg tgtgaaacct 841 tccgacccct ctccggagtc cggtcccgtt tgctgtccgt ctggccggcc tgaggcaacc 901 ccccctcctc cgtggggggg gggggacgtg ccgcgccagg agggccctcc cggtgtcggg 961 agcgccctcg ccaaatcgac ctcgtta // LOCUS GORRGITX 2467 bp ds-DNA PRI 03-JUL-1990 DEFINITION Gorilla 28S ribosomal RNA gene fragment. ACCESSION M30951 KEYWORDS 28S ribosomal RNA. SOURCE Gorilla DNA. ORGANISM Gorilla gorilla Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 193 to 261; 375 to 944; 1041 to 1079; 1334 to 1354; 1737 to 1913; 2145 to 2159) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2467) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by I.L.Gonzalez, 20-DEC-1989. FEATURES from to/span description rRNA < 1 > 2467 28S ribosomal RNA BASE COUNT 386 a 819 c 912 g 350 t ORIGIN Chromosomes 22 and 23. 1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg 61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg 121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga 181 cccctccacc cgccctccct tcccccgccg cccctcctcc tcctccccgg agggggcggg 241 ctccggcggg tgcggggggt gggcgggcgg ggccgggggt ggggtcggcg ggggaccgtc 301 ccccgaccgg cgaccggccg ccgccgggcg catttccacc gcggcggtgc gccgcgaccg 361 gctccgggac ggctgggaag gcccggcggg gaaggtggct cggggggccc cgtccgtccg 421 tccgtccgtc ctcctccccc gtctccgccc cccggccccg cgtcctccct cgggaagggg 481 cgcgcgggtc ggggcggcgg cggcggcggt ggcggcggcg gcggcggcgg cgggaccgaa 541 acccccccga gtgttacagc cccccggcca gccagccatc gccgaatccc ggggccgagg 601 gagcgagacc cgtcgccgcg ctctcccccc tcccggcgcc cacccccgcg ggggtccccc 661 gcgagggggt cccccgcggg ggcgcgccgg cgtctcctcg tgggggggcc gggccacccc 721 tcccacggcg cgaccgctct cccacccctc gcttccccgc acactccccc ggcgacgggg 781 tgccgcgcgc gggtcggggg gcggggcgga ctgtccccag tgcgccccgg gcgggtcgcg 841 ccgtcgggcc cgggggaggt tctcccgggg ccacgcgcgc gtcccccgaa gagggggacg 901 gcggagcgag cgcacggggt cggcggcgat gtcggctacc cacccgaccc gtcttgaaac 961 acggaccaag gagtctaaca cgtgcgcgag tcgggggctc gcacgaaagc cgccgtggcg 1021 caatgaaggt gaaggccggc gcgctcgccg gccgaggtgg gatcccgagg cctctccggt 1081 ccgccgaggg cgcaccaccg gcccgtctcg cccgccgcgc cggggaggtg gagcacgagc 1141 gcacgtgtta ggacccgaaa gatggtgaac tatgcctggg cagggcaagc cagaggaaac 1201 tctggtggag gtccgtagcg gtcctgacgt gcaaatcggt cgtccgacct gggtataggg 1261 gcgaaagact aatcgaacca tctagtagct ggttccctcc gaagtttccc tcaggatagc 1321 tggcgctctc gcagacccct cctccccccc acgcagtttt atccggtaaa gcgaatgatt 1381 agaggtcttg gggccgaaac gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc 1441 cggctcgctg gcgtggagcc gggtggaatg cgagtgcctg tgggccactt ttggtaagca 1501 gaactggcgc tgcgggatga accgaacgcc gggttaaggc gcccgatgcc gacgctcatc 1561 agaccccaga aaaggtgttg gttgatatag acagcaggac ggtggccatg gaagtcggaa 1621 tccgctaagg agtgtgtaac aactcacctg ccgaatcaac tagccctgaa aatggatggc 1681 gctggagcgt cgggcccata cccggccgtc gccggcagtc gagagtggac gggagcggcg 1741 ggggcggcgc gcgcgcgcgc gtgtggggtc ggagggcggc gtgtgggcgg tggggtcctc 1801 gcccccctcc cccgcgcctc ccctcctccc acccccgctc cccgcccccg ggagccccgc 1861 ggacgctacg ccgcgacgag taggagggcc gctgcggtga gccttgaagc ctagggcgcg 1921 ggcccgggtg gagccgccgc aggtgcagat cttggtggta gtagcaaata ttcaaacgag 1981 aactttgaag gccgaagtgg agaagggttc catgtgaaca gcagttgaac atgggtcagt 2041 cggtcctgag agatgggcga gcgccgttcc gaagggacgg gcgatggcct ccgttgccct 2101 cggccgatcg aaagggagtc gggttcagat ccccgaatcc ggagtggcgg agatgggcgc 2161 cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga agccggcggg agccccgggg 2221 agagttctct tttctttgtg aagggcaggg cgccctggaa tgggttcgcc ccgagagagg 2281 ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc ggtgagctct cgctggccct 2341 tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc gtacccatat ccgcagcagg 2401 tctccaaggt gaacagcctc tggcatgttg gaacaatgta ggtaagggaa gtcggcaagc 2461 cggatcc // LOCUS ORARGIT 1070 bp ds-DNA PRI 03-JUL-1990 DEFINITION Orangutan rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M30949 KEYWORDS internal transcribed spacer. SOURCE Orangutan DNA. ORGANISM Pongo pygmaeus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 1070) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.L.Gonzalez, 20-DEC-1989. BASE COUNT 60 a 438 c 438 g 134 t ORIGIN Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23. 1 acggagcgaa gagcgaggcc cgcggcggcg ccgccgcggc gtccttcctc gtcggccggc 61 cggccgcgtt tctcccccgc ttcccgcggc gcgtgcgcgg gcggggcccg tgccgttcgc 121 gcgcacgcgc gggcgtgcgt gcgtgcgtcg cccggccccg ccggccgcga gagccggaga 181 acctcgggag ggagagagag gggggagaga gagagcggtg tgtgtgtgcg cgcgcgcgtg 241 tctcgggggc ggccggcgcg gcggggagcg gtccccggcc gcggccccga cgtgtgtgtc 301 ggcgggcgcg ggtgcggtcc tcggcggcgt cgcggcgggg tggggggtgt ctcggtgccc 361 ctccccgccg gggcccgtcg tcccgtcccc gacccgccgg ctccgcgtcg ggggccggcc 421 gggttcccgc cgcccccgtc gcctccgcca cgccgcgcca ccgggccggg ccggcccggc 481 ccgccccgct cgctctcccc ggccttcccg ctagggcgtc tcgagggtcg ggggccggac 541 gccggtcccc gcgcctcctc gtccgccccc ccctcccccc gccgtccagg tacctagcgc 601 gttccggcgc ggaggtttaa agaccccttg ggggatcgcc cgtccgcccg tgggtcgggg 661 gcggtgggcc cgcgtgggga gtcccgtcgg gaggggcccg gcccctcccg cgcctccacc 721 gcggactccg cccccccggc cggggcgctg ccgccgccgc cgcggtcgcg gcggccgtcg 781 ggtgggggct ttacccggcg gccgtcgtgc cgtccgtcgc gcgcgtgccc cgcgccgtgg 841 gggcgggaac cccccgggcg cctgtggggt ggtgtccgcg ctcgcccccg cgtgggcggc 901 gcgcgcctcc ccgtggtgtg cgacaccttc cgacccctct ccggagtccg gtcccgtttg 961 ccgtctgact ggccggcctg aggcgacccc cccctgcggg ggggaagtgc cgcgccaggg 1021 gcgagggcct cccggtgtgt cgggggcgcc ctcgcccgat cgagctcgta // LOCUS ORARGITX 2487 bp ds-DNA PRI 03-JUL-1990 DEFINITION Orangutan 28S ribosomal RNA gene fragment. ACCESSION M30952 KEYWORDS 28S ribosomal RNA. SOURCE Orangutan DNA. ORGANISM Pongo pygmaeus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 193 to 263; 377 to 985; 1084 to 1120; 1376 to 1394; 1780 to 1933; 2165 to 2179) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2487) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by I.L.Gonzalez, 20-DEC-1989. FEATURES from to/span description rRNA < 1 > 2487 28S ribosomal RNA BASE COUNT 393 a 814 c 921 g 359 t ORIGIN Chromosomes 11, 12, 13, 14, 15, 16, 17, 22, 23. 1 gtcaacaagt accgtaaggg aaagttgaaa agaactttga agagagagtt caagagggcg 61 tgaaaccgtt aagaggtaaa cgggtggggt ccgcgcagtc cgcccggagg attcaacccg 121 gcggcgggtc cggccgtgtc ggcggcccgg cggatctttc ccgccccccg ttcctcccga 181 cccctccacc cgccctccct cccccgccgc ccctcctcct cctccccgcg gggagggggc 241 gggctccggc gggtgcgggg gtgggcgggc ggggccgggg gtggggtcgg cgggggaccg 301 tcccccgacc ggcgaccggc cgccgccggg cgcatttcca ccgcggcggt gcgccgcgac 361 cggctccggg acggctggga aggcccggtg gggaaggtgg ctcggggggc cccgtccgtc 421 cgtccgtccg tcctcctccc tcctcccccc tcgtcttccc cccggccccg cgtcctccct 481 cgggagggcg cgcgggtcgg gggcggcggc gggggtggct gctgctgctg ctgcggcggc 541 ggcgggaccg aaccccccga gtgttacagc cccggcagca gcgctcgccg aacccggggc 601 cgagggagcg agacccgtcg ccgcgctctc ccccctcccg gcgcccaccc ccgcgggggt 661 cccccgcgag ggggtccccc ccgcgggggc gcgccggcgt ctcctcgcgt ggggggccgg 721 gccgcccctc ccacggcgcg accgctctcc cacccccccc ttccccgcgc acccccggcg 781 acgggggccc gcgcgggcgg ggggggcggg gcggactgtc cccagtgcgc cccgggcggg 841 tcgcgccgtc gggcccgggg aagagagagg gagaggaggg ggttctcctc ctcctcctcc 901 cctctcgggg ccacgcgcgc gtccctcgaa gagggggacg gcggagccga gcgcacgggg 961 tcggcggcga tgtcggccac ccacccgacc cgtcttgaaa cacggaccaa ggagtctaac 1021 acgtgcgcga gtcgggggct cgcacgaaag ccgccgtggc gcaatgaagg tgaaggccgg 1081 cgcgctcgcc ggccgaggtg ggatcccgag gcctctccag tccgccgagg gcgcaccacc 1141 ggcccgtctc gcccgccgcg ccggggaggt ggagcacgag cgcacgtgtt aggacccgaa 1201 agatggtgaa ctatgcctgg gcagggcgaa gccagaggaa actctggtgg aggtccgtag 1261 cggtcctgac gtgcaaatcg gtcgtccgac ctgggtatag gggcgaaaga ctaatcgaac 1321 catctagtag ctggttccct ccgaagtttc cctcaggata gctggcgctc tcgcagactc 1381 gaccgaccga ccgcagtttt atccggtaaa gcgaatgatt agaggtcttg gggccgaaac 1441 gatctcaacc tattctcaaa ctttaaatgg gtaagaagcc cggctcgctg gcgtggagcc 1501 gggcgtggaa tgcgagtgcc tagtgggcca cttttggtaa gcagaactgg cgctgcggga 1561 tgaaccgaac gccgggttaa ggcgcccgat gccgacgctc atcagacccc agaaaaggtg 1621 ttggttgata tagacagcag gacggtggcc atggaagtcg gaatccgcta aggagtgtgt 1681 aacaactcac ctgccgaatc aactagccct gaaaatggat ggcgctggag cgtcgggccc 1741 atacccggcc gtcgccggca gtcgagagtg gacgggagcg gcgggggcgg ggtgcgtgcg 1801 ggtgtggggg tgtgtgtggg ggggggtcct ccccccccgc cactcctcct cctcccaccc 1861 ctcccccgga gcagccccgc ggacgctacg ccgcgacgag taggagggcc gctgcggtga 1921 gccttgaagc ccagggcgcg ggcccgggtg gagccgccgc aggtgcagat cttggtggta 1981 gtagcaaata ttcaaacgag aactttgaag gccgaagtgg agaagggttc catgtgaaca 2041 gcagttgaac atgggtcagt cggtcctgag agatgggcga gcgccgttcc gaagggacgg 2101 gcgatggcct ccgttgccct cggccgatcg aaagggagtc gggttcagat ccccgaatcc 2161 ggagtggcgg agacgggcgc cgcgaggcgt ccagtgcggt aacgcgaccg atcccggaga 2221 agccggcggg agccccgggg agagttctct tttctttgtg aagggcaggg cgccctggaa 2281 tgggttcgcc ccgagagagg ggcccgtgcc ttggaaagcg tcgcggttcc ggcggcgtcc 2341 ggtgagctct cgctggccct tgaaaatccg ggggagaggg tgtaaatctc gcgccgggcc 2401 gtacccatat ccgcagcagg tctccaaggt gaacagcctc tggcatgttg gaacaatgta 2461 ggtaagggaa gtcggcaagc cggatcc // LOCUS ACCRRSAA 1536 bp ss-rRNA RNA 03-JUL-1990 DEFINITION A.calcoaceticus 16S ribosomal RNA. ACCESSION M34139 KEYWORDS 16S ribosomal RNA. SOURCE A.calcoaceticus (strain 33604) ribosomal RNA. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1536) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1536 16S ribosomal RNA BASE COUNT 387 a 329 c 460 g 330 t 30 others ORIGIN 1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca 61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta 121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc 181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga 301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt 421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt 481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc 541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt 601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga 661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg 721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag 781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga 841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact 901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat 961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt 1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt 1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga 1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat 1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta 1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac 1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg 1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct 1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn 1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt // LOCUS BDERRSAA 1553 bp ss-rRNA RNA 03-JUL-1990 DEFINITION B.stolpii 16S ribosomal RNA. ACCESSION M34125 KEYWORDS 16S ribosomal RNA. SOURCE B.stolpii (strain uki-2) ribosomal RNA. ORGANISM Bdellovibrio stolpii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1553) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology, 131 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1553 16S ribosomal RNA BASE COUNT 381 a 320 c 429 g 316 t 107 others ORIGIN 1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa 61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt 121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca 181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg 241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg 301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc 361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag 421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc 481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt 541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc 601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc 661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga 721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg 781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat 841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag 901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat 961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct 1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag 1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc 1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat 1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac 1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg 1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc 1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac 1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt // LOCUS PLTRRSAA 1525 bp ss-rRNA RNA 03-JUL-1990 DEFINITION P.staleyi 16S ribosomal RNA. ACCESSION M34126 KEYWORDS 16S ribosomal RNA. SOURCE P.staleyi (strain ATCC 27377) ribosomal RNA. ORGANISM Planctomyces staleyi Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales. REFERENCE 1 (bases 1 to 1525) AUTHORS Woese,C.R. and Oyalzu,H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 15-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1525 16S ribosomal RNA BASE COUNT 376 a 348 c 482 g 315 t 4 others ORIGIN 1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa 61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa 121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg 181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt 241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga 301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga 361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg 421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc 481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta 541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac 601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg 661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg 721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag 781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga 841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac 901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat 961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt 1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt 1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact 1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta 1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta 1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga 1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac 1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca 1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa 1501 cctgcggctg gatcacctcc tttct // LOCUS RDCRRSAA 1478 bp ss-rRNA BCT 03-JUL-1990 DEFINITION R.purpureus 16S ribosomal RNA. ACCESSION M34132 KEYWORDS 16S ribosomal RNA. SOURCE R.purpureus (strain 6770) ribosomal RNA. ORGANISM Rhodocyclus purpureus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1478) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1478 16S ribosomal RNA BASE COUNT 367 a 345 c 470 g 289 t 7 others ORIGIN 1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca 61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat 121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct 181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt 241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg 301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat 361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg 421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg 481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc 541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt 601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga 661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg 721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag 781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa 841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt 901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga 961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag 1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga 1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg 1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc 1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac 1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg 1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc 1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc 1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt // LOCUS HUMFGF2H 3365 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34185 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3365) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 256 2457 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 786 a 962 c 917 g 700 t ORIGIN 1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac 61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca 121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa 181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct 241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca 301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc 361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc 421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa 481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca 541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga 601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac 661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag 721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac 781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag 841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac 901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag 961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt 1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag 1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc 1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag 1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc 1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt 1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag 1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg 1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa 1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac 1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc 1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc 1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc 1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc 1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg 1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc 1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg 1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg 2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg 2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac 2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga 2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag 2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg 2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag 2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg 2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc 2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc 2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct 2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc 2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag 2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg 2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg 2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga 2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg 3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca 3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag 3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat 3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg 3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat 3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa 3361 aaaaa // LOCUS HUMFGF3H 3503 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34186 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3503) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 527 2722 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 777 a 1044 c 1005 g 677 t ORIGIN 1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc 61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg 121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga 181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc 241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc 301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg 361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga 421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc 481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa 541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc 601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc 661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac 721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt 781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga 841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat 901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta 961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat 1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat 1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa 1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt 1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc 1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt 1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga 1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt 1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa 1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc 1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat 1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga 1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga 1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt 1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa 1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc 1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg 1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc 2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc 2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt 2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa 2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat 2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct 2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg 2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg 2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg 2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta 2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt 2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa 2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc 2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc 2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact 2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga 2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc 3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc 3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac 3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc 3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac 3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg 3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga 3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt 3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata 3481 tattaaaaac aaaaaaaaaa aaa // LOCUS HUMFGF4H 2283 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34187 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2283) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 417 1325 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 566 a 621 c 612 g 484 t ORIGIN 1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca 61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc 121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga 181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc 241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc 301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg 361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt 421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta 481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg 541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg 601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg 661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt 721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg 781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca 841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt 901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta 961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa 1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct 1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct 1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc 1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca 1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc 1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca 1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt 1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag 1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga 1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga 1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa 1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc 1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc 1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg 1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt 1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa 1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag 2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca 2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca 2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac 2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa 2281 aaa // LOCUS HUMFGF5H 1625 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) secreted form mRNA, complete cds. ACCESSION M34188 KEYWORDS FGF receptor; fibroblast growth factor receptor. SOURCE Human female placenta endothelial cell line HUVEC, cDNA to mRNA, clone h5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1625) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 523 1425 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 368 a 480 c 489 g 288 t ORIGIN 1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg 61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc 121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc 181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat 241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc 301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc 361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct 421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg 481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc 541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc 601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct 661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc 721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa 781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc 841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg 901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc 961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg 1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt 1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg 1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca 1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg 1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat 1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg 1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca 1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca 1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt 1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg 1621 aaaaa // LOCUS SYNLACZA 6476 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD1.27. ACCESSION M34296 KEYWORDS lacZ. SOURCE Cloning vector pPD1.27. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6476) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3500 3501 E.coli lacZ end/SV40 start recomb 3634 3635 SV40 end/synthetic start recomb 3657 3658 synthetic end/pUC19 start recomb 4076 4077 pUC19 end/C.elegans sup-7 start recomb 4442 4443 C.elegans sup-7 end/pUC19 start BASE COUNT 1560 a 1646 c 1676 g 1594 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa 3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat 3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg 3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt 3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt 3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa 3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt 3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct 3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZB 6096 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD8.02. ACCESSION M34297 KEYWORDS lacZ. SOURCE Cloning vector pPD8.02. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6096) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3275 3276 synthetic end/unknown DNA start recomb 3696 3697 unknown DNA end/C.elegans sup-7 end recomb 4062 4063 C.elegans sup-7 end/pUC19 start BASE COUNT 1458 a 1571 c 1616 g 1451 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta 3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg 3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg 3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat 3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca 3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga 3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa 3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac 4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta 4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 6061 ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZC 7376 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD8.33. ACCESSION M34298 KEYWORDS lacZ. SOURCE Cloning vector pPD8.33. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7376) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 3524 3425 unknown DNA end/C.elegans sup-7 end recomb 4555 4556 synthetic end/pUC19 start recomb 4976 4977 pUC19 end/C.elegans sup-7 start recomb 5342 5343 C.elegans sup-7 end/pUC19 start BASE COUNT 1863 a 1793 c 1814 g 1906 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct 4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZD 5730 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD16.43. ACCESSION M34299 KEYWORDS lacZ. SOURCE Cloning vector pPD16.43. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5730) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3264 3265 synthetic end/unknown DNA start recomb 3513 3514 unknown DNA end/synthetic start recomb 3524 3525 synthetic end/pUC19 start BASE COUNT 1348 a 1488 c 1580 g 1314 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga 3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg 3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa 3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca 3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt 4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca 5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc 5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga 5701 gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZE 7010 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD16.51. ACCESSION M34300 KEYWORDS lacZ. SOURCE Cloning vector pPD16.51. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7010) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4544 4545 synthetic end/pUC19 start recomb 4793 4794 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4524 4525 unknown DNA end/synthetic start recomb 4804 4805 synthetic end/pUC19 start BASE COUNT 1753 a 1710 c 1778 g 1769 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg 4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct 4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc 4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt 6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg 6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZF 7088 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD18.32. ACCESSION M34301 KEYWORDS lacZ. SOURCE Cloning vector pPD18.32. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7088) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4622 4623 synthetic end/pUC19 start recomb 4871 4872 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4602 4603 unknown DNA end/synthetic start recomb 4882 4883 synthetic end/pUC19 start BASE COUNT 1780 a 1724 c 1797 g 1787 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa 4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt 4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt 4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt 4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt 4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa 4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta 4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta 4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata 4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac 4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc 4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc 4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt 4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat 4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat 4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata 5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 7081 aaacagct // LOCUS SYNLACZG 6563 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD21.28. ACCESSION M34302 KEYWORDS lacZ. SOURCE Cloning vector pPD21.28. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6563) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4097 4098 synthetic end/pUC19 start recomb 4346 4347 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4077 4078 unknown DNA end/synthetic start recomb 4357 4358 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1723 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct 4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc 4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga 4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg 4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc 4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata 4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt 4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa 5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag 6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 6541 acaatttcac acaggaaaca gct // LOCUS SYNLACZH 6562 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD22.04. ACCESSION M34303 KEYWORDS lacZ. SOURCE Cloning vector pPD22.04. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6562) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 179 180 synthetic end/E.coli trpS start recomb 263 264 E.coli trpS end/synthetic start recomb 267 268 synthetic end/E.coli lacZ start recomb 3261 3262 E.coli lacZ end/synthetic start recomb 4096 4097 synthetic end/pUC19 start recomb 4345 4346 pUC19 end/synthetic start recomb 3321 3322 synthetic end/unknown DNA start recomb 4076 4077 unknown DNA end/synthetic start recomb 4356 4357 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1722 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg 181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt 241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc 301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc 421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc 481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca 541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc 601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga 661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg 721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg 781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag 841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga 901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg 961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt 1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta 1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga 1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg 1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa 1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc 1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga 1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt 1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg 1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac 1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg 1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg 1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta 1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt 1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg 1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt 1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg 1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc 2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg 2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga 2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg 2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt 2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac 2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc 2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc 2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta 2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc 2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga 2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg 2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg 2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg 2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc 2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg 2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg 3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct 3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca 3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg 3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct 3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg 3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc 3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca 3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct 3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa 3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac 3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca 3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc 3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa 3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc 3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt 3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg 3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat 4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta 4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct 4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac 4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg 4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg 4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag 4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg 4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga 4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat 4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca 4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc 4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca 4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg 4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca 4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata 4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag 4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg 5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca 5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta 5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct 5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca 5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag 5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat 5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt 5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa 5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga 6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc 6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc 6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca 6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa 6541 caatttcaca caggaaacag ct // LOCUS SYNLACZI 6567 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD22.11. ACCESSION M34304 KEYWORDS lacZ. SOURCE Cloning vector pPD22.11. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6567) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 184 185 synthetic end/E.coli trpS start recomb 268 269 E.coli trpS end/synthetic start recomb 272 273 synthetic end/E.coli lacZ start recomb 3266 3267 E.coli lacZ end/synthetic start recomb 4101 4102 synthetic end/pUC19 start recomb 4350 4351 pUC19 end/synthetic start recomb 3326 3327 synthetic end/unknown DNA start recomb 4081 4082 unknown DNA end/synthetic start recomb 4361 4362 synthetic end/pUC19 start BASE COUNT 1587 a 1651 c 1724 g 1605 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg 181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca 241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga 301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga 481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc 541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga 601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg 661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg 721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt 781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct 841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa 901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg 961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac 1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga 1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa 1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac 1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat 1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca 1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct 1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct 1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac 1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag 1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc 1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc 1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga 1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa 1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga 1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg 1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa 1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa 2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct 2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga 2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc 2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa 2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac 2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg 2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat 2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca 2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct 2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga 2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca 2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac 2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg 2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc 2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg 2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg 3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct 3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt 3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca 3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga 3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca 3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt 3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc 3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag 3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt 3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa 3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg 3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg 3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg 3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat 3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc 3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg 3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg 4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt 4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa 4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag 4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta 4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag 4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt 4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa 4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag 6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg 6541 gataacaatt tcacacagga aacagct // LOCUS SYNLACZJ 7242 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD26.77. ACCESSION M34305 KEYWORDS lacZ. SOURCE Cloning vector pPD26.77. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7242) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4776 4777 synthetic end/pUC19 start recomb 5025 5026 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4756 4757 unknown DNA end/synthetic start recomb 5036 5037 synthetic end/pUC19 start BASE COUNT 1780 a 1801 c 1846 g 1815 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc 3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca 3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt 3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc 3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct 3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg 3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca 3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt 3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga 3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat 3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta 3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa 3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt 4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga 4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg 4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat 4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt 4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg 4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta 4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct 4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga 4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt 4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat 4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg 4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg 4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg 5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg 5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg 6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct // LOCUS SYNLACZK 6620 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD34.110. ACCESSION M34306 KEYWORDS lacZ. SOURCE Cloning vector pPD34.110. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6620) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 237 238 synthetic end/E.coli trpS start recomb 321 322 E.coli trpS end/synthetic start recomb 325 326 synthetic end/E.coli lacZ start recomb 3319 3320 E.coli lacZ end/synthetic start recomb 4154 4155 synthetic end/pUC19 start recomb 4403 4404 pUC19 end/synthetic start recomb 3379 3380 synthetic end/unknown DNA start recomb 4134 4135 unknown DNA end/synthetic start recomb 4414 4415 synthetic end/pUC19 start BASE COUNT 1592 a 1665 c 1731 g 1632 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta 181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta 241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc 301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct 361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat 901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag 2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat 2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc 3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg 3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc 3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt 3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa 3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac 3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc 3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa 3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa 3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg 3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc 3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt 4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg 4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact 4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga 4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa 4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca 4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta 4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt 4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg 4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa 5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa 6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 6601 atttcacaca ggaaacagct // LOCUS SYNLACZL 5808 bp ds-DNA SYN 03-JUL-1990 DEFINITION Cloning vector pPD16.01. ACCESSION M34307 KEYWORDS lacZ. SOURCE Cloning vector pPD16.01. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5808) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 3342 3343 synthetic end/unknown DNA start recomb 3591 3592 unknown DNA end/synthetic start recomb 3602 3603 synthetic end/pUC19 start BASE COUNT 1375 a 1502 c 1599 g 1332 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg 3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg 3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg 3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat 3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg 3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt 3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag 5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt 5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta 5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta 5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct // LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 03-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33841 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 967) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 961 967 inverted repeat B rpt 560 810 R region signal 792 797 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 530 536 TATA box BASE COUNT 254 a 235 c 222 g 256 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct 121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc 301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt 361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa 421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa 481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg 541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg 601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc 661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg 721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc 781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga 841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct 901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac 961 ccctaca // LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 03-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33842 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 960) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 954 960 inverted repeat B rpt 555 803 R region signal 785 790 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 525 531 TATA box BASE COUNT 250 a 232 c 219 g 259 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct 121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag 301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa 361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt 421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa 481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt 541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca 601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac 661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct 721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat 781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg 841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa 901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca // LOCUS MUSMHEBF1 573 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-f gene, exon 1. ACCESSION M35677 M34123 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 573) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept 301 + 394 MHC E-beta-f, exon 1 IVS 395 > 573 MHC E-beta-f intron A BASE COUNT 135 a 151 c 138 g 149 t ORIGIN Chromosome 17. 1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgtgt 61 atccatgtaa tgaagagaac tgcaagtttc agaagggaac ctgcaaactg aatctctaac 121 taggaactga tgatgctgaa cttctttgat gctgattggc tcccagcact ggccttaccc 181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga 241 ataaagttac agtctgaagc ttgccttccc ctctgactct cgtgtctcct ctcctgcagc 301 atgatgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg 361 agccctccag tggctttggt cagagactcc agacgtaaat gcacacctca ggtgctggga 421 tgctcggggt cggggaagga aggagctaac attctcactg tccagtccaa gtccctcgaa 481 actattgata tcttctgtga gcatgcacag tcctcacatg aactctaaac tatgtcccca 541 aacagacgcc tggatgtttg tgctctcaga tct // LOCUS MUSMHEBF2 495 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-f gene, exon 2. ACCESSION M35678 M34123 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 495) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 61 + 330 MHC E-beta-f, exon 2 IVS < 1 60 MHC E-beta-f intron A IVS 331 > 495 MHC E-beta-f intron B BASE COUNT 111 a 112 c 180 g 92 t ORIGIN About 3.0 kb after segment 1; chromosome 17. 1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag 61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc 121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg 181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc 241 cggagatcct ggaggatgcg cgggccgcgg tggacacgta ctgcagatac aactatgaga 301 tcttggataa attccttgtg cggcggagag gtgagacagg acagggtggg tggggcggaa 361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc 421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc 481 aggggaacgg aattc // LOCUS MUSMHEBF3 2155 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-f gene, exons 3,4,5 and 6. ACCESSION M35679 M34123 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 3 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2155) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 122 403 MHC E-beta-f, exon 3 971 1081 MHC E-beta-f, exon 4 1484 1507 MHC E-beta-f, exon 5 1802 1815 MHC E-beta-f, exon 6 IVS < 1 121 MHC E-beta-f intron B IVS 404 970 MHC E-beta-f intron C IVS 1082 1483 MHC E-beta-f intron D IVS 1508 1801 MHC E-beta-f intron E BASE COUNT 459 a 586 c 569 g 541 t ORIGIN About 3.9 kb after segment 2; chromosome 17. 1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcatgc 61 agacagccta caaggaggac agctccaccc tcatggctcc ttctcacctc tctttctcta 121 gttgagccta cggtgactgt gtaccccaca aagacgcagc ccctggaaca ccacaacctc 181 ctggtctgct ctgtgagtga cttctaccct ggcaacattg aagtcagatg gttccggaat 241 ggcaaggagg agaaaacagg aattgtgtcc acgggcctgg tccgaaatgg agactggacc 301 ttccagacac tggtgatgct ggagacggtt cctcagagtg gagaggttta cacctgccag 361 gtggagtatc ccagcctgac cgaccctgtc acggtcgagt ggagtgagtg gtaacttcca 421 gactctgtga atgcccgccc gggtgggtgt ggtttatccc tgcctgtcag ctttctccac 481 ccacacactc tttccactgg ctttgtgctg tcctgccttt caccatggct tacagtgtag 541 gtgcgtgaag cttctacaag cacagttgcc ccctgggaag cagttatgcc cccatagact 601 catctgagcc tgccagtgac ataacaggtc ctggaatctt cttggcccct gctgcagtct 661 ctgccgttgc tgggttgtgt tcctcctgct gctgctgctg ctgacgatgg acaaggagca 721 gtgcagggtc atgactgaac tcagggacat atagtcatag ctctgccttt gctacccctc 781 agagctcagc agcttcctgt cagctcggct caggcctgtt tggttggttt ctcaacatga 841 ccaggaatgt tgacagccag atcttctaga acacacttct tccttgggct caaagctccg 901 agtctcaggg gtccggagtg gaaatgggat ttgggctaaa accctccaaa cctttggctt 961 cctttctcag aagcacagtc cacatctgca cagaacaaga tgttgagtgg agttgggggc 1021 ttcgtgctgg gcctcctctt cctcggagcg gggctgttca tctacttcag gaaccagaaa 1081 ggtaaggagc ctggtgggag ccccaactcc atagcatttc agggaaaagc catggctttg 1141 ttctcaggat gccattggcc ctgtgacctc aggtttcatt ggattctgaa tgcaacagtc 1201 tgtggttact tgatttgacc ctgaggaggg ataacacatg ggagagttaa gttgattctg 1261 gcttgagacc tgaggacaga ggaaggctgg ggggagccat gggcactgcc ggtgactgaa 1321 gctccctaag cccctccctc tgtccatgct cctcttggtt ctgtgtgctc tgggcagtat 1381 taccagagga atctcaggtg gcagctcaga gtctggggac atgtgtctgg ggacagatct 1441 gccttcatgc atgtaagcat ctattttatt ctctcttttc taggacagtc tggacttcag 1501 ccaacaggta acacccattg tcttctctca gagacagatc tgctttccct acagtatggg 1561 ggctggggtg atggactcag ggcacaaaat ggggaagact gagatcccag ggttggccag 1621 gcagttagca ctgagccttg ctccctgcac ttactgaagc ctgtgctctg aagcagcaat 1681 gactcggggc atgagaagtt cctctctgct cactgccatg ctgtaaggag aggcctgaag 1741 cagtcagaga agccactgca gagtgaggtc tggaaacagc cctgtcccct gtgctctaca 1801 ggactcctga gctgagatga agtaacaagg ctgaaggaag gagttccccc ccgtgtctcc 1861 atgccatgaa aacatgtcct gcttggccca catccctcca gagacactgc tcttccagga 1921 cctggctcct cctgattctc caccctggag atctgtgctc ctgatggctg cttatccctg 1981 acccaggcct tgcagctccc agaacagagg ccccactctt cacatctcct gtcccctttt 2041 gtcccttgcc ttttgtctgg cacttctgag ccagtctgct gtcatatgct tttttacatt 2101 tttctcaaat aaacaaataa tgaaagtcat ctgcttcata gagtttcaag cagaa // LOCUS MUSMHEBQ1 574 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-q gene, exon 1. ACCESSION M35680 M34124 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 3 SOURCE Mouse inbred strain B10.M) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 574) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept 301 + 394 MHC E-beta-q, exon 1 IVS 395 > 574 MHC E-beta-q intron A BASE COUNT 132 a 152 c 143 g 147 t ORIGIN Chromosome 17. 1 cagctgcctc tgcctcctga gtgctgggat atgaggcatg gccagcagcc cagactgagt 61 atccatgtaa tgaagagaac tgcaagtttc agaaggggac ctgcaaactg aatctctaac 121 tagcaactga tgatgctgga ctcctttgat gctgattggc tcccagcact ggccttaccc 181 aatccagtgg caaagcagtg aatgccctgt ctcttattat cttagcaatg agtaaagaga 241 ataaagttac agtctgaagc ttgccttccc ctctgactcc tgtgtctcct ctcctgcagc 301 atggtgtggc tccccagagt tccctgtgtg gcagctgtga tcctgttgct gacagtgctg 361 agccctccag tggctttggt cagagactcc agacgttaag tgcacacctc aggtgctggg 421 atgctcgggg tcggggaagg aaggagctaa cattctcact gtccaggcca agtccctcgg 481 aactattgat atcttctgtg agcatgcaca gtcctcacat gaactctaaa ctatgtcccc 541 aaacagaagc ctggatgttt gtgctctcag atct // LOCUS MUSMHEBQ2 495 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-q gene, exon 2. ACCESSION M35681 M34124 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 3 SOURCE Mouse inbred strain B10.G) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 495) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 61 + 330 MHC E-beta-q, exon 2 IVS < 1 60 MHC E-beta-q intron A IVS 331 > 495 MHC E-beta-q intron B BASE COUNT 113 a 116 c 176 g 90 t ORIGIN About 3.0 kb after segment 1; chromosome 17. 1 cagctgagag ggactcgggc atcttgtcgg cagagaagaa gataattctt gtctccacag 61 catggttttt ggaatactgt aaatctgagt gtcatttcta caacgggacg cagcgcgtgc 121 ggtttctgaa aagatacttc tacaacctgg aggagaacct gcgcttcgac agcgacgtgg 181 gcgagttccg cgcggtgacc gagctggggc ggccagacgc cgagaactgg aacagccagc 241 cggagatcct ggagcaaaag cgggccgcgg tggacacgta ctgcagacac aactatgaga 301 tcttcgataa cttccttgtg cggcggagag gtgagacagg acagggtggc tggggcggaa 361 ccacggtgag ggtggggctg tggggagcag cagaaggcgg tgcgcatgtg cgcaggagcc 421 gcagggaatg ctgggttccc tgcagctgga gccacaggcg cttttaagca gcctcttggc 481 aggggaacgg aattc // LOCUS MUSMHEBQ3 2159 bp ds-DNA ROD 03-JUL-1990 DEFINITION Mouses MHC class II E-beta-q gene, exons 3,4,5 and 6. ACCESSION M35682 M34124 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 3 of 3 SOURCE Mouse inbred strain B10.G) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2159) AUTHORS Begovich,A.B., Vu,T.H. and Jones,P.P. TITLE Characterization of the molecular defects in the mouse E-beta-f and E-beta-q genes: Implications for the origin of MHC polymorphism JOURNAL J. Immunol. 144, 1957-1964 (1990) STANDARD full staff_review FEATURES from to/span description pept + 126 407 MHC E-beta-q, exon 3 975 1085 MHC E-beta-q, exon 4 1488 1511 MHC E-beta-q, exon 5 1806 1819 MHC E-beta-q, exon 6 IVS < 1 125 MHC E-beta-q intron B IVS 408 974 MHC E-beta-q intron C IVS 1086 1487 MHC E-beta-q intron D IVS 1512 1805 MHC E-beta-q intron E BASE COUNT 459 a 589 c 572 g 539 t ORIGIN About 3.9 kb after segment 2; chromosome 17. 1 gatccattct ggatggatag atggaggtag gcaggcaggc aggcaggcag gcaggcaggc 61 atgcagacag cctacaagga ggacagctcc accctcatgg ctccttctca cctctctttc 121 tctagttgag cctacggtga ctgtgtaccc cacaaagacg cagcccctgg aacaccacaa 181 cctcctggtc tgctctgtga gtgacttcta ccctggcaac attgaagtca gatggttccg 241 gaatggcaag gaggagaaaa caggaattgt gtccacgggc ctggtccgaa atggagactg 301 gaccttccag acactggtga tgctggagac ggttcctcag agtggagagg tttacacctg 361 ccaggtggag catcccagcc tgaccgaccc tgtcacggtc gagtggagtg agtggtaact 421 tccagactct gtgaatgccc gcccgggtgg gtgtggttta tccccgcctg tcagctttct 481 ccacccacac actctttcca ctggctttgt gctgtcctgc ctttcaccat ggcttacagg 541 gtaggtgcgt gaagcttcta caagcacagt tgccccctgg gaagcagtta tgcccccata 601 gactcatctg agcctgccag tgacataaca ggtcctggaa tcttcttggc ccctgctgca 661 gtctctgccg ttgctgggtt gtgttcctcc tgctgctgct gctgctgacg atggacaagg 721 agcagtgcag ggtcatgact gaactcaggg acatatagtc atagctctgc ctttgctacc 781 cctcagagct cagcagcttc ctgtcagctc ggctcaggcc tgtttggttg gtttctcaac 841 atgaccagga atgttgactg ccagatcttc tagaacacac ttcttccttg ggctcaaagc 901 tccgagtctc aggggtccgg agtggaaatg ggatttgggc taaaaccctc caaacctttg 961 gcttcctttc tcagaagcac agtccacatc tgcacagaac aagatgttga gtggagttgg 1021 gggcttcgtg ctgggcctcc tcttcctcgg agcggggctg ttcatctact tcaggaacca 1081 gaaaggtaag gagcctggtg ggagccccaa ctccatagca tttcagggaa aagccatggc 1141 tttgttctca ggatgccatt ggccctgtga cctcaggttt cattggattc tgaatgcaac 1201 agtctgtggt tacttgattt gaccctgagg agggataaca catgggagag ttaagttgat 1261 tctggcttga gacctgagga cagaggaagg ctggggggag ccatgggcac tgccggtgac 1321 tgaagctccc taagcccctc cctctgtcca tgctcctctt ggttctgtgt gctctgggca 1381 gtattaccag aggaatctca ggtggcagct cagagtctgg ggacatgtgt ctggggacag 1441 atctgccttc atgcatgtaa gcatctattt tattctctct tttctaggac agtctggact 1501 tcagccaaca ggtaacaccc attgtcttct ctcagagaca gatctgcttt ccctacagta 1561 tgggggctgg ggtgatggac tcagggcaca aaatggggaa gactgagatc ccagggttgg 1621 ccaggcagtt agcactgagc cttgctccct gcacttactg aagcctgtgc tctgaagcag 1681 caatgactcg gggcatgaga agttcctctc tgctcactgc catgctgtaa ggagaggcct 1741 gaagcagtca gagaagccac tgcagagtga ggtctggaaa cagccctgtc ccctgtgctc 1801 tacaggactc ctgagctgag atgaagtaac aaggctgaag gaaggagttc ccccccgtgt 1861 ctccatgcca tgaaaacatg tcctgcttgg cccacatccc tccagagaca ctgctcttcc 1921 aggacctggc tcctcctgat tctccaccct ggagatctgt gctcctgatg gctgcttatc 1981 cctgacccag gccttgcagc tcccagaaca gaggccccac tcttcacatc tcctgtcccc 2041 ttttgtccct tgccttttgt ctggcacttc tgagccagtc tgctgtcata tgctttttta 2101 catttttctc aaataaacaa ataatgaaag tcatctgctt catagagttt caagcagaa // LOCUS RATHPA1 3282 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat haptoglobin (Hp) gene, exons 1,2 and 3. ACCESSION M34230 KEYWORDS haptoglobin. SEGMENT 1 of 3 SOURCE Rat (strain Wistar) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3282) AUTHORS Marinkovic,S. and Baumann,H. TITLE Structure, hormonal regulation, and identification of the interleukin-6- and dexamethasone-responsive element of the rat haptoglobin gene JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1065 1069 haptoglobin (Hp), exon 1 2797 2879 haptoglobin, exon 2 3125 + 3226 haptoglobin, exon 3 pre-msg 1032 > 3282 Hp mRNA and intron IVS 1070 2796 Hp intron A IVS 2880 3124 Hp intron B IVS 3227 > 3282 Hp intron C BASE COUNT 858 a 743 c 830 g 831 t 20 others ORIGIN 1 ctaatttaaa aacgtttttt aaaacgagtg aagccattta ggttgagcgg ctacattagc 61 gtgaacagca ttccagaaca ggtgtcgggc tgaaacattg gttttctcct gggctgcgaa 121 cacagcgagt ctgccattga ggactctgtc tctacactag catgtggtgt ggctttccgc 181 taacaacaat cagaggagac acagcaggct catttcactg atttcaaatc ggaagacttt 241 tagcaacagg aagatgtcct catgggtcgg gaagcaactg tgaaacggaa ccgatttctt 301 tttactgttc tgtgggcgag actgcaggaa tttctacact ggatttaagt gattccgaga 361 taagtccaga gaagggagcc agtacaaggg tcccatgtca gtctacctat agagctttag 421 tcactctgag attgaagagc agtggaccaa gacccaataa ctcagtctgc tgcctgcaaa 481 ttccagagct ctccacaccc aggagatggt catgcttggg caggagagtt gaaaaaagaa 541 aagacttctt ttatagtctg agttaagggc tgggtcacaa gggtgtttaa aaaaaaaaaa 601 aaagagggct ggggatttag ctcagtggta gagcgcttac ctaggaagca caaggcctgg 661 gttcggtccc agctcgaaaa aaagaccaaa aaaaaaaaaa aaaaaaaaaa aaaaagagag 721 gtctcgtccc tctcccagtt aagtatcaga ttaacagccc ctattccccg tcccactctc 781 tggggttatc acactgcggt gggtgggagg ggtcgtgaag ttgctagatt tcttcatgat 841 ttgtaaaata acaccacgag gagagccaag tatgaagcaa gagctcagct cttgaaaagg 901 ggtttgcttt gtggttactg gaacagtcac tgaccttagc aaggccgaca ttgtgcaaac 961 acagaaatgg aagaaaagga ggtggggtga aaccgaagca taaaaagggt gagcaggagt 1021 cagcacagcg cacgccttct ggaaagaggt gagagaggcc cacgatgagg tgagtccaca 1081 gtccacactt ttgggcacac aatgcagatg tctctgggag agtgagaaaa tgggatgcag 1141 gaacagggcc gatgggcacc gttctgtggg agttaagccc gcagcctgca ggcgcatatg 1201 gcgagggata gagctgtgga tgcattgcaa cacactgtaa acttacctga agcgttgtga 1261 gacttttttt tttttttttg gtcttttttt tcggagctgg ggaccgaacc caggccttgc 1321 tgccttccta ggcaaagtcc gctctaccac tgagctaaat tccccaaccc cgcgttgtga 1381 gacttttgtt ttataacttg actatgcagt ttgagtgtga attttgttgg gtgaagacct 1441 caggctgaaa tgtcaaaggc aggaagtgaa gggaccagtg acaaagcccc ttcctccctg 1501 tgtccatgag agatgggcag gacagacagg gctttctatc tctaaggagg atctttccca 1561 gtgagatgaa aggttttgtt ttttaccagg catgcagcag cttcctggga tgctggctgt 1621 gctgttaaca gacttcctgc ttttaaagga acaaagacaa tagtcacaca gtctagtggc 1681 accatcaagg catccccctt cctttttaaa atcaaaatat aaagactttg aaggttacaa 1741 aaagactaga agcatagtgt ccaaaaggaa ttcctaactg gccagaatct acagggaatt 1801 ggttaccgtt taagtgtggt ctgtgtacca atggtggcca caagtcatgc tgagaggaag 1861 ccagttttct ccaggtaact tctggtttga tacacaatcc ctttttttaa aattatttat 1921 ttatttattt gtttgtttgt ttctgtgagt acactgtcgc tgtctttaga cacaccagaa 1981 gagggcatcg gatctcatta cagaggttgt gagccaccat gtgttgctgg gaattgaact 2041 caggacctct ggaagagcag tcagtcgtct taaccgctgg gaattgaact caggacctct 2101 ggaagagcag tcagtgctct taaccgctga gccctctctc cagccctgat atataacctt 2161 aagaccaaat acttatgaag taataggagc aagcacatgt gagttatata catatgtata 2221 tatttgggtc atagtgcaca cccagggatt ctagagctga ggcagggtga agtctgggag 2281 ttcaggagtt gtgacagcta gaaagatgga ctgtgtctnn nnnnnnnnnn nnnnnnnnta 2341 accttttcat tttggaattc caaaaagaga agagccaaat aaattagagc catcatcttt 2401 aagttagcta cgatgtccta acaatgtctt catagctgga acttaatgat gcgtgcagag 2461 gcttcccctt gctgacgttg tggtcaccac cagaggcaga ggcagaggca gaggctcact 2521 ttgctctgtg cctcctcccc agttggttct tgttccacct cccactctcg ggcgggagac 2581 aggcacttgt tatgtagcac tacgtaaagc cccgatcctc ctgcctcaga gtggagagct 2641 ggggtagcac atatgcttcc acactggtgc tgctttcctt cgggtcatgg tgctcccttt 2701 ctaagcttct acaaaattcc ccagtgacac cttgcttgcg tgtaatgcac aaatgcaaga 2761 agaccaactc tactccttct tgccacttct ctacagagcc ctgggagctg tcgtcactct 2821 cctgctctgg ggtcagcttt ttgctgtgga attgggcaat gatgccacag acattgaagg 2881 tgagtctcag gggtttccca ggagctgtgc accccagcag gctgtggccc tgtctgacca 2941 catcagtccc gcactgtatt aaggaagacc cagacctcct ctcgcctaga ccctcggggc 3001 ctcccggcct cagcttccac tcggtgcaag ggagtctggt gttcagggca gctccgtctc 3061 ttctggcttt gcacggggag catctgatca ccacagccct ttcctcgctt ctttctcttg 3121 gcagatgaca gctgcccaaa gcccccagag attgcaaacg gctatgtgga acacttggtt 3181 cgttatcgct gccgacagtt ctacaaacta cagaccgaag gagatggtaa ggctgtttga 3241 gcgggtaggg ctaggctgtc acaccagaac ttaagtgctg ct // LOCUS RATHPA2 482 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat haptoglobin (Hp) gene, exon 4. ACCESSION M34231 KEYWORDS haptoglobin. SEGMENT 2 of 3 SOURCE Rat (strain Wistar) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 482) AUTHORS Marinkovic,S. and Baumann,H. TITLE Structure, hormonal regulation, and identification of the interleukin-6- and dexamethasone-responsive element of the rat haptoglobin gene JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 296 + 370 haptoglobin (Hp), exon 4 IVS < 1 295 Hp intron C IVS 371 > 482 Hp intron D BASE COUNT 118 a 108 c 120 g 136 t ORIGIN 1 ttaacccgtg agccgtctcc agtccaggga gtgtagtcta tctacgactt tgtacagcct 61 acattcctga caatttctaa gagcttcatt gtgtctttaa agctcccgtg gttgtcatag 121 cctccttttg ggagagacac tctttaattc cattttttca atgaggaaac tgaggacgga 181 gatgccaagg tagcttgtga ggggaagagt cttgatctga actctgacct cttcctgtcc 241 aactctttca tcaggccaca ttcattttct ctgagctcac ctccttttgt ttcaggaatc 301 tacaccttaa acagtgagaa gcaatgggtg aacccagctg ctggcgataa actccccaag 361 tgtgaggcag gtgggtgttg aggtcttaaa gcatggggct aaaatggggc catgtttctc 421 ttgtgtgcct gagtgagtaa gacagggtca gagagacacg ctgcaaagga ggacaatgac 481 ta // LOCUS RATHPA3 1245 bp ds-DNA ROD 03-JUL-1990 DEFINITION Rat haptoglobin (Hp) gene, exon 5. ACCESSION M34232 KEYWORDS haptoglobin. SEGMENT 3 of 3 SOURCE Rat (strain Wistar) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1245) AUTHORS Marinkovic,S. and Baumann,H. TITLE Structure, hormonal regulation, and identification of the interleukin-6- and dexamethasone-responsive element of the rat haptoglobin gene JOURNAL Mol. Cell. Biol. 10, 1573-1583 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 329 1107 haptoglobin (Hp), exon 5 IVS < 1 328 Hp intron D BASE COUNT 309 a 297 c 342 g 297 t ORIGIN 1 ctgcagaggc tctggaagaa tcagccacca ctgcttgcga aaccaacagt acaggaacac 61 tgcccttgcc acctgctccg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 121 tgtgtgtgtg tgtgtacgtg tgtataaata tatatatgta tatacctaca tacatatgta 181 atcgtcatca cacatacaca ttccacaatc ctcttgaaag tcaatgacag acctgaaagc 241 tgtgtacatt tcattcttag acaaagttgc cctgcagggg cctggtgtga actgctgctc 301 acatcggtct ctcctcctcc ctccgcagtg tgtgggaagc ccaagcatcc tgtggaccag 361 gtacagcgca tcatcggtgg ttccatggac gccaaaggca gctttccttg gcaggccaag 421 atgatctcca gacatggact caccactggg gccacactga tcagtgacca gtggctgctg 481 accactgccc aaaacctctt cctgaatcac agtgagaatg cgacagccaa ggacattgcc 541 cctaccttaa cactctatgt ggggaaaaac cagctggtgg agattgagaa ggtagttctc 601 caccccgagc gctctgtggt ggatatcggg ctgatcaagc tcaaacagaa agtgcttgtc 661 actgagaaag tcatgcctat ctgcctgcct tccaaagact acgtagcgcc aggccgcatg 721 ctatgtgtcc ggttgggggc gcggaatgtc aactttagat ttactgaacg tctcaagtat 781 gtcatgctgc ctgtggctga ccaggagaag tgtgagctgc actatgagaa aagcacagtg 841 cctgagaaga aaggcgctgt aactcctgtt ggggtacagc ccatcttgaa taagcatacc 901 ttctgtgctg gccttaccaa gtatgaggaa gacacttgct atggtgacgc tggcagtgcc 961 tttgccgtcc atgacacgga ggaggacacc tggtatgcag ctgggatcct gagctttgac 1021 aagagttgtg ccgtagctga gtatggtgtg tatgtgaagg caactgatct gaaggactgg 1081 gtccaggaaa caatggccaa gaactagttc agggctgact agagggctgc acacagtggg 1141 gcagggcaat tcaccctgga agaggaagta gaagggttgg ggacataatc tgagggctgc 1201 tagccctgca ttgctcagtc aataataaaa aacgagcttt ggacc // LOCUS MUSTCAXL 331 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD1. ACCESSION M34194 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 331 T-cell receptor alpha-chain (AA at 1) BASE COUNT 71 a 86 c 89 g 85 t ORIGIN 1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg 61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg 121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat 181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg 241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata 301 acttttgggg ctggaaccaa actcaccatt a // LOCUS MUSTCAXM 334 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD3. ACCESSION M34196 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 334 T-cell receptor alpha-chain (AA at 1) BASE COUNT 97 a 84 c 75 g 78 t ORIGIN 1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg 61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg 121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga 181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag 241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg 301 acatttggat ctggcacaaa agtcattgtt ctac // LOCUS MUSTCAXN 327 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-14. ACCESSION M34198 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 327 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 85 g 70 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt 301 ggacagggaa ccaagttatc tatcaag // LOCUS MUSTCAXO 324 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-27. ACCESSION M34200 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 83 g 69 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc 301 tctgggacca agctaattat aaag // LOCUS MUSTCAXP 297 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-6. ACCESSION M34202 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 297) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 297 T-cell receptor alpha-chain (AA at 1) BASE COUNT 85 a 73 c 74 g 65 t ORIGIN 1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac 61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc 121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc 181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt 241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg // LOCUS MUSTCAXQ 193 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, T-cell clone V2.1. ACCESSION M34204 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 193) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 193 T-cell receptor alpha-chain (AA at 2) BASE COUNT 56 a 45 c 49 g 43 t ORIGIN 1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata 61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc 121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt 181 tctgagtgtt ctg // LOCUS MUSTCAXR 333 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-33. ACCESSION M34206 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 333 T-cell receptor alpha-chain (AA at 1) BASE COUNT 84 a 96 c 78 g 74 t 1 others ORIGIN 1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg 61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc 121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga 301 ctcacctttg ggaaaggaac tcagctgatc atc // LOCUS MUSTCAXS 339 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-7. ACCESSION M34208 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor alpha-chain (AA at 1) BASE COUNT 86 a 103 c 75 g 75 t ORIGIN 1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg 61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat 121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg 181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag 241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga 301 aagctcacgt ttggacacgg caccatcctt agggtccat // LOCUS MUSTCAXT 342 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1F12C4. ACCESSION M34210 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 342) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 342 T-cell receptor alpha-chain (AA at 1) BASE COUNT 80 a 102 c 79 g 81 t ORIGIN 1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg 61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc 121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact 301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at // LOCUS MUSTCAXU 324 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1D3A6. ACCESSION M34212 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 82 a 80 c 81 g 81 t ORIGIN 1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg 61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga 121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc 181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac 241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt 301 caaggaaccg ttctgagtgt tata // LOCUS MUSTCAXV 210 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 1E1O. ACCESSION M34214 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor alpha-chain (AA at 1) BASE COUNT 54 a 53 c 50 g 53 t ORIGIN 1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg 61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca 121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact 181 tttggtcaag gaaccgttct gagtgttata // LOCUS MUSTCAXW 234 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34216 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 234) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 234 T-cell receptor alpha-chain (AA at 1) BASE COUNT 55 a 59 c 58 g 62 t ORIGIN 1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct 61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct 121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta 181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc // LOCUS MUSTCAXX 312 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 2B11. ACCESSION M34218 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 312 T-cell receptor alpha-chain (AA at 1) BASE COUNT 89 a 79 c 79 g 64 t 1 others ORIGIN 1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag 61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta 121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac 181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg 241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact 301 ttgcaagtgc aa // LOCUS MUSTCBYAO 153 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD1. ACCESSION M34195 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 153) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 153 T-cell receptor beta-chain (AA at 1) recomb 122 123 V-region end/J-region start BASE COUNT 39 a 37 c 36 g 41 t ORIGIN 1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc 61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta 121 ggttctggaa atacgctcta ttttggagaa gga // LOCUS MUSTCBYAP 111 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD3. ACCESSION M34197 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 111) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 111 T-cell receptor beta-chain (AA at 1) recomb 73 74 V-region end/J-region start BASE COUNT 18 a 37 c 25 g 31 t ORIGIN 1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc 61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a // LOCUS MUSTCBYAQ 321 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-14. ACCESSION M34199 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 292 293 V-region end/J-region start BASE COUNT 106 a 81 c 61 g 73 t ORIGIN 1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc 61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag 121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc 181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct 241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac 301 accttgtact ttggtgcggg c // LOCUS MUSTCBYAR 210 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-27. ACCESSION M34201 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor beta-chain (AA at 1) recomb 167 168 V-region end/J-region start BASE COUNT 57 a 56 c 43 g 54 t ORIGIN 1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc 61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg 121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag 181 ttcttcggac cagggacacg actcaccgtc // LOCUS MUSTCBYAS 255 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-6. ACCESSION M34203 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 255) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 255 T-cell receptor beta-chain (AA at 1) recomb 219 220 V-region end/J-region start BASE COUNT 57 a 68 c 77 g 53 t ORIGIN 1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc 61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc 121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc 181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg 241 ggcacccgac tatcg // LOCUS MUSTCBYAT 213 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, T-cell clone V2.1. ACCESSION M34205 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 213) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 213 T-cell receptor beta-chain (AA at 1) recomb 171 172 V-region end/J-region start BASE COUNT 48 a 59 c 58 g 48 t ORIGIN 1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat 61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc 121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag 181 cagttcttcg gaccagggac acgactcacc gtc // LOCUS MUSTCBYAU 207 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-33. ACCESSION M34207 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 207) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 207 T-cell receptor beta-chain (AA at 1) recomb 163 164 V-region end/J-region start BASE COUNT 65 a 54 c 36 g 52 t ORIGIN 1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt 61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag 121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt 181 ttcggtcatg gaaccaagct gtctgtc // LOCUS MUSTCBYAV 156 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-7. ACCESSION M34209 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 156) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 156 T-cell receptor beta-chain (AA at 1) recomb 118 119 V-region end/J-region start BASE COUNT 30 a 48 c 40 g 38 t ORIGIN 1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct 121 gaacagtact tcggtcccgg caccaggctc acggtt // LOCUS MUSTCBYAW 132 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1F12C4. ACCESSION M34211 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 132) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 132 T-cell receptor beta-chain (AA at 1) recomb 94 95 V-region end/J-region start BASE COUNT 35 a 39 c 28 g 30 t ORIGIN 1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg 61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc 121 agactcacag tt // LOCUS MUSTCBYAX 303 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1D3A6. ACCESSION M34213 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 303 T-cell receptor beta-chain (AA at 1) recomb 259 260 V-region end/J-region start BASE COUNT 80 a 72 c 79 g 72 t ORIGIN 1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac 61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat 121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg 181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc 241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt 301 gtt // LOCUS MUSTCBYAY 159 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 1E1O. ACCESSION M34215 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 159 T-cell receptor beta-chain (AA at 1) recomb 113 114 V-region end/J-region start BASE COUNT 35 a 40 c 39 g 45 t ORIGIN 1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg 61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga 121 aatacgctct attttggaga aggaagccgg ctcattgtt // LOCUS MUSTCBYAZ 321 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34217 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 275 276 V-region end/J-region start BASE COUNT 86 a 78 c 81 g 76 t ORIGIN 1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc 61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg 121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac 181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct 241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga 301 gaaggaagcc ggctcattgt t // LOCUS MUSTCBYBA 339 bp ss-mRNA ROD 03-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 2B11. ACCESSION M34219 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor beta-chain (AA at 1) recomb 294 295 V-region end/J-region start BASE COUNT 90 a 88 c 88 g 73 t ORIGIN 1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca 61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg 121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc 181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa 301 aacaccttgt actttggtgc gggcacccga ctatcggtg // LOCUS CHKATHA 188 bp ss-mRNA VRT 03-JUL-1990 DEFINITION Chicken avian thymic hormone mRNA, partial cds. ACCESSION M34330 KEYWORDS avian thymic hormone; parvalbumin. SOURCE Chicken thymus, cDNA to mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 188) AUTHORS Palmisano,W.A. and Henzl,M.T. TITLE Partial nucleotide sequence of the parvalbumin from chicken thymus designated "avian thymic hormone" JOURNAL Biochem. Biophys. Res. Commun. 167, 1286-1293 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 188 avian thymic hormone (AA at 1) BASE COUNT 53 a 43 c 50 g 42 t ORIGIN 1 ccggatcaga tcaagaaggt ttttggaatc cttgatcagg acaagagcgg cttcattgaa 61 gaagaagagc ttcagctgtt tctgaagaac ttctcttcga gtgccagagt cctcacctct 121 gcggagacca aagctttcct ggctgcaggt gacaccgacg gcgacgacaa aataggcgta 181 gaagaatt // LOCUS DDISGSPA 1957 bp ds-DNA INV 03-JUL-1990 DEFINITION D.discoideum spore germination-specific protein (270-11) gene, complete cds. ACCESSION M33862 KEYWORDS spore germination-specific protein. SOURCE D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1957) AUTHORS Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L. TITLE a shared internal theronine-glutamic acid-threonine-proline repeat defines a family of dictyostelium discoideum spore germination- specific proteins JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.L.Ennis 17-APR-1990. FEATURES from to/span description pept 171 228 spore germination-specific protein, exon 1 316 1856 spore germination-specific protein, exon 2 IVS 229 315 spore germination-specific protein intron A BASE COUNT 724 a 359 c 258 g 616 t ORIGIN 1 aaataatttt attattttct tttgtaaaag taattaaata aaaaaaaaaa taaaaaataa 61 ataaaattaa ataaagtcaa ttaaaaaaaa aaaaataata taaatatata taaaataaaa 121 aaaaaaaaac aaaacaataa tagtttatga tataaatttt taataataat atgaaaaata 181 tatatagttt attcttatta tttgcattaa taagtgcaac atttgcaagt aagttgaaaa 241 aaaaaaaaaa aaaattatat tgtaaatttt aaataaaaaa caatatacta attattaatt 301 ttaaaattaa attagataat gcatttattg tacattggaa ttcagattca atttcaaaaa 361 aattaacggg acaaattggt gatacaatct ctttttatac aagtgatgga aattctcatg 421 atgtaaaaag ttcagatggt tctgtttcgt caagtgtttt ctctggtagt cttacaaatc 481 ctggaatttt caaggtaaca cttactaaag aaggtaatat tgaatttacc agttcatatg 541 atgaaggtct ttctgcaaca atagtagttt cttctggtgg tcaaattccg attacaacaa 601 cttcatcaac tacaactgat ggtagttcaa ccccttccac tccaacttca acaacttcag 661 cctcaactac tacaagtggt ggtagtgcta caacaacaac aggagaacca attactgatg 721 gttctaatgg aggcgccagt tccacaactg gcaatagcgg gacgacaggt tctgctacca 781 ctactacttc ttcttcttcc gataattccg atggcagtgt aggtacttca actacaactt 841 caccagctat cacaacttca agtgggtcaa taatcgatcc aacttcacca cctacaactg 901 attcatcctc taatagtggt ggttatggtt catcatcttc aattgaaaat ggcgtagaat 961 gtttattaac aatcactcaa gatgcatttg attcttggac atatgataat attatttaca 1021 ccgtttatca agtaaattta acaaatattg gtacactttc agttgagtct gttattctca 1081 ctccaaatga taactcttta atttaccata cttgggaatt ggtttatgat ggaacttcac 1141 tcactcttcc aacctataga aaagctggtc caatcaatcc agaggaaacc attatctttg 1201 gttatatctc tagaaatagt actgatgtta catttgcttt aagtccaaca tgttcagatt 1261 catcaagtcc aactccaact cctactgaga ctccaactga gactccaact gagactccaa 1321 ctgagactcc aactgagact ccaactgaaa ctccaactga aactccaact gaaactgaaa 1381 ctccaacacc aacaccatca agctcatcta gtgatgtaga tagtggttca tcatctgaaa 1441 ttgaaacccc aacaccaact gaaactgata ccccaacccc aacaccatca agttcttcaa 1501 gtgaaggaag tggatcatca tcagaaactc aaccaccaat tactccacca ccaaccactg 1561 gtacttcttg tttagcccaa gtccaacaaa aagttatcaa ctcatggatt aatggtgaag 1621 ttgatcatta tatacaagtt gaggctacta ttgttaacca aggttcaact ccaatttcat 1681 cttttaattt ttattctgat gctgaacaaa tttggtcagt tgaaaaaaca ggaaccaata 1741 cctataaatt accaagttgg ttctcaacaa ttccagttgg tgggtcccat acctttggtt 1801 atattgttaa atctgctgaa ttatctgacc tcgaaggagt tcaatataca tgttgatttt 1861 aaaactctct ttttgtaata ataaaaaaaa aaaaaaaatt ttttggaaat aaatttaatt 1921 ttcaaaaact agttttgatt tcactttatt taataat // LOCUS DDISGSPB 3655 bp ds-DNA INV 03-JUL-1990 DEFINITION D.discoideum pore germination-specific protein gene, complete cds. ACCESSION M33861 KEYWORDS endo-(1,4)-beta-D-glucanase; spore germination-specific protein. SOURCE D.discoideum (strain AX-3) germinating spore, cDNA to mRNA, and DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 3655) AUTHORS Giorda,R., Ohmachi,T., Shaw,D.R. and Ennis,H.L. TITLE A shared internal theronine-glutamic acid-threonine-proline repeat defines a family of Dictyostelium discoideum spore germination- specific proteins JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.L.Ennis, 17-APR-1990. FEATURES from to/span description pept 1346 1412 spore germination-specific protein, exon 1 1505 3555 spore germination-specific protein, exon 2 IVS 1413 1504 spore germination-specific protein intron BASE COUNT 1346 a 482 c 459 g 1368 t ORIGIN 1 tttttttttt ttaatatttt ttattttatt ttttttttta attattatta attattaatc 61 tttattataa acaaaatgca tatgtgttaa aattattata accaaaaatt aattaattta 121 aaaaactaag aactatagtt ctgagatttt caatagtttt tttcaaataa tatgatttct 181 ttttcaaggg tcattaaaat tatattatta gaactattta aaaaaaattc aaaagttaaa 241 tatttaactt ttgcattttt aaaaccatca attataataa ttaattattt tattattttt 301 tttttttttt tttttttttt aattattttt gttttttttt tttttttttt tttttttttt 361 ttttattaaa aaaactatga atactttaaa ttatagtttt tcattttttt attaactgat 421 cataatttaa tttaatttaa tttaatttat ttttttgtat ttaatactcg aaaaccacat 481 acccatgatt aattaaaaaa aataaaaaaa aataaaaaaa aaagaaaaag tactttttca 541 aataaaaaat gtttataaaa aaaaattttt ttttgaggcc aagttaatat ttttgggtag 601 ttaaaatact aagatttgtt ccaatttgga tttttaatgg tttttatttt taaaaataat 661 aatttaacat ttttctaatc aattttcaaa tttttttttt tataactgat ttcttttttt 721 tttattttaa ttttttttta attttttttt atttaaaaaa tatttcaagt tgtacatttc 781 cgttagaatt tcatttggaa gatattagat tttaatttaa aaacaatttt cctaaaaaat 841 aaaataaaaa atgcgaaatt taattttttt tttttattaa taattatttt gaattaaatt 901 tttttttttt tttttttttt ttcccagatt tccaatctta taaaaaggaa ttgtttttta 961 tttttttttt tttcattttc aaaaaactaa tttattagat ctttaaaaaa aaaaaaaaaa 1021 ataataataa taataaaaat aataatatta tctattatcc aaatttgttt ttgcaattaa 1081 tttcgttatt ttttttttta aaaaactcac cacatactta cacaccaaaa aataacaaaa 1141 ataataattc tattattata atcaatttat tgtagtataa gtttaacttt taaagttcta 1201 ttaaaaaaaa aaaaaaaaaa aaaaaaagaa aaaaaaaaat atataaaata ataaaacttt 1261 tgtttattat ttttatgtac tataaatttc aaattcctat atctaaattt ttaatatttc 1321 taaattttta taaattaaaa ccaatatgaa aatattgaaa aattgtatat tattaataat 1381 atttgggtta ttatcaactc aattaattaa tggtaaagta taaaaaaaaa aaaaaaaaaa 1441 aaatattata tttcttaaac aaaaaaaaaa acaaaatatt aattcttaat ttttttttta 1501 ttagcggata ccgattattg ttcattactt gaaaatgcat taatgtttta taaaatgaat 1561 agagctggtc gtttaccaga taacgatata ccatggagag gtaattcagc attgaatgat 1621 gcaagtccaa attcagctaa agatgccaat ggtgatggta atttaagtgg tggttatttt 1681 gatgctggtg atggtgttaa atttggttta ccaatggctt attctatgac tatgttgggt 1741 tggtcattca ttgaatatga atccaatatt gctcaatgtg gtttgacaag tttatacctc 1801 gatacaatta aatatggtac cgactggctt attgcagcac atactgccga taatgaattt 1861 gcaggccaag ttggtgatgg taatgttgat cattcttggt ggggtcctcc agaagatatg 1921 acaatggctc gtccaactta tatgttaaca accgaagcac caggtactga aattgcaatg 1981 gaagcagcat cagcattagc tgcagcttca atagcattta aatcttcaaa cccaacatac 2041 gctgcaactt gcttagcaca tgctaaaact cttcataatt tcgggtacac ttatcgtggt 2101 gtttattcag attccattac gaatgctcaa gctttttata attcatggtc tggctataag 2161 gatgatttag tttggggtag catttggtta tataaagcaa ctcaagattc agattattta 2221 acaaaagccg ttgcagatta tgcatcaggt ggtgttggtg gaatggcaca aggtaattct 2281 cacgattggg ataataaagc accaggttgt tgtttattat tatctaaatt agttccaacc 2341 acaagtactt ataaaactga tttcgaaggt tggttaaatt attggttacc aggtggaggt 2401 gtcacttata ctccaggtgg tttagcatgg atcagacaat ggggtccagc tcgttatgct 2461 gccactgccg ctttccttgg ttctttagct ggtactgaaa aaggcacaga tttcactcaa 2521 aaacaagttg actatttaat tggtaataat ccaaatcaac aatcatttgt agttggtatg 2581 ggtccaaatt atccaattaa tccacatcat cgtgctgccc atcattctac aactaatgat 2641 ataaataatc cagttaataa tttatacctc ttaaaaggtg ctttagttgg tggaccaggt 2701 tcaaatgatg aatatactga tgatagaact gattatattt caaatgaagt tgcaactgat 2761 tataatgctg gtttcgttgg tgcattagct tctcttgtaa atccatcttc aacttctgtt 2821 ccaaccacaa ctccaacagt aactgaaacc ccaacagaga ctccaactga gactccaact 2881 gagactccaa ctgagactcc aacagagact ccaacagaaa ctccaacaga gactccaaca 2941 gaaactccaa cagagactcc aacagaaact ccaacagaaa ctccaacaga aactccaaca 3001 gaaactccaa cagaaactcc aacagaaact ccaaccgaga ctccaactga aactgttact 3061 ccaaccccaa cagtaacacc aactgaaact ccatcaagtg gagaatcttt atcaatctat 3121 aaaagtggat taaaaaatga tttccaagat tggtcatggg gtgagcattc attaactgat 3181 acaacaaatg ttgaatctgg agaaaccaat tcaatttcat ttacaccaaa agcatatggt 3241 gcagtatttt taggatgttt cgaatgtatt gatactgata catacaataa tattgaattt 3301 gatattaatg gtggtagcag tggtgctcaa ttattaagaa taactgttgt taaaaatagt 3361 aaatctgttg gttccaaatt aattaccgat cttaatggtg gaactccaat cgaagcaaat 3421 tcatggacta aaattaaagc atcctttatt gatgacttta aagtatctgg taaagtcgat 3481 ggtatttgga ttcaagatat caaaggtgat acccaatcaa ctgtatacat aagtaatatt 3541 attgcaactg cttaaaaaaa tattaatatt aaatattaaa aaaagtataa ataaaataat 3601 cttaaattaa aaaaaataag tgttttcgaa attttctata gatatatatc taaaa // LOCUS ECOCYSXE 1396 bp ds-DNA BCT 03-JUL-1990 DEFINITION E.coli cysteine regulon 33 Kd (cysE) and 16 Kd protein (cysX) genes, complete cds. ACCESSION M34333 KEYWORDS cysE gene; cysX gene. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1396) AUTHORS Tei,H., Murata,K. and Kimura,A. TITLE Structure and expression of cysX, the second gene in the Escherichia coli K-12 cysE locus JOURNAL Biochem. Biophys. Res. Commun. 167, 948-955 (1990) STANDARD simple staff_review FEATURES from to/span description pept 221 1042 33 Kd protein (cysE) pept 919 527 (c) 16 Kd protein (cysX) BASE COUNT 325 a 362 c 393 g 316 t ORIGIN 1 cgcgaactgg cgcatcgctt cggcgttgaa atgccaataa ccgaggaaat ttatcaagta 61 ttatattgcg gaaaaaacgc gcgcgaggca gcattgactt tactaggtcg tgcacgcaag 121 gacgagcgca gcagccacta accccaggga acctttgtta ccgctatgac ccggcccgcg 181 cagaacgggc cggtcattat ctcatcgtgt ggagtaagca atgtcgtgtg aagaactgga 241 aattgtctgg aacaatatta aagccgaagc cagaacgctg gcggactgtg agccaatgct 301 ggccagtttt taccacgcga cgctactcaa gcacgaaaac cttggcagtg cactgagcta 361 catgctggcg aacaagctgt catcgccaat tatgcctgct attgctatcc gtgaagtggt 421 ggaagaagcc tacgccgctg acccggaaat gatcgcctct gcggcctgtg atattcaggc 481 ggtgcgtacc cgcgacccgg cagtcgataa atactcaacc ccgttgttat acctgaaggg 541 ttttcatgcc ttgcaggcct atcgcatcgg tcactggttg tggaatcagg ggcgtcgcgc 601 actggcaatc tttctgcaaa accaggtttc tgtgacgttc caggtcgata ttcacccggc 661 agcaaaaatt ggtcgcggta tcatgcttga ccacgcgaca ggcatcgtcg ttggtgaaac 721 ggcggtgatt gaaaacgacg tatcgattct gcaatctgtg acgcttggcg gtacgggtaa 781 atctggtggt gaccgtcacc cgaaaattcg tgaaggtgtg atgattggcg cgggcgcgaa 841 aatcctcggc aatattgaag ttgggcgcgg cgcgaagatt ggcgcaggtt ccgtggtgct 901 gcaaccggtg ccgccgcata ccaccgccgc tggcgttccg gctcgtattg tcggtaaacc 961 agacagcgat aagccatcaa tggatatgga ccagcatttc aacggtatta accatacatt 1021 tgagtatggg gatgggatct aatgtcctgt gatcgtgccg gatgcgatgt aatcatctat 1081 ccggcctaca gtaactaatc tctcaatacc gctcccggat accccaactg tcgccaggct 1141 tcatacacca ctaccgacac cgcattggac agattcatgc tgcggctgtc cggcaccatc 1201 ggaatgcgaa ttttttgttc agcgggcagg gcatcaagaa tgctcgctgg caggccgcgt 1261 gtttccgggc cgaacatcag ataatcgcca tcctgatagc ttacggcgct gtgagcaggt 1321 gtacctttcg tggtgagggc gaacaggcgc tgggattttc tgcttcgagg aacgcgcgat 1381 agtcatgatg acgcgt // LOCUS ECOTRPP 74 bp ds-DNA SYN 03-JUL-1990 DEFINITION Expression plasmid pDS20 derivative. ACCESSION M34334 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 74) AUTHORS Latta,M., Philit,M., Maury,I., Soubrier,F., Denefle,P. and Mayaux,J.-F. TITLE Tryptophan promoter derivatives on multicopy plasmids: A comparative analysis of expression potentials in Escherichia coli JOURNAL DNA 9, 129-137 (1990) STANDARD simple staff_review BASE COUNT 18 a 18 c 15 g 23 t ORIGIN 1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc 61 aaatattctg aaat // LOCUS HUMCYTOK 1724 bp ss-mRNA PRI 03-JUL-1990 DEFINITION Human cytokeratin 8 mRNA, complete cds. ACCESSION M34225 KEYWORDS cytokeratin 8. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1724) AUTHORS Yamamoto,R., Kao,L.-C., McKnight,C.E. and Strauss,J.F.III. TITLE Cloning and sequence of cDNA for human placental cytokeratin 8. Regulation of the mRNA in trophoblastic cells by cAMP JOURNAL Mol. Endocrinol. 4, 370-374 (1990) STANDARD simple staff_review FEATURES from to/span description pept 35 1486 cytokeratin 8 BASE COUNT 401 a 498 c 524 g 301 t ORIGIN 1 ttcggcaatt cctacctcca ctcctgcctc caccatgtcc atcagggtga cccagaagtc 61 ctacaaggtg tccacctctg gcccccgggc cttcagcagc cgctcctaca cgagtgggcc 121 cggttcccgc atcagctcct cgagcttctc ccgagtgggc agcagcaact ttcgcggtgg 181 cctgggcggc ggctatggtg gggccagcgg catgggaggc atcaccgcag ttacggtcaa 241 ccagagcctg ctgagcccct tgtccctgga ggtggacccc aacatccagg ccgtgcgcac 301 ccaggagaag gagcagatca agaccctgaa caacaagttt gcctccttca tagacaaggt 361 acggttcctg gagcagcaga acaagatgct ggagaccaag tggagcctcc tgcagcagca 421 gaagacggct cgaagcaaca tggacaacat gttcgagagc tacatcaaca accttaggcg 481 gcagctggag actctgggcc aggagaagct gaagctggag gcggagcttg gcaacatgca 541 ggggctggtg gaggacttca agaacaagta tgaggatgag atcaataagc gtacagagat 601 ggagaacgaa tttgtcctca tcaagaagga tgtggatgaa gcatacatga acaaggtaga 661 gctggagtct cgcctggaag ggctgaccga cgagatcaac ttcctcaggc agctgtatga 721 agaggagatc cgggagctgc agtcccagat ctcggacaca tctgtggtgc tgtccatgga 781 caacagccgc tccctggaca tggagagcat cattgctgag gtcaaggcac agtacgagga 841 tattgccaac cgcagccggg ctgaggctga gagcatgtac cagatcaagt atgaggagct 901 gcagagcctg gctgggaagc acggggatga cctgcggcgc acaaagactg agatctcaga 961 gatgaaccgg aacatcagcc ggctccaggc tgagattgag ggcctcaaag gccagagggc 1021 ttccctggag gccgccattg cagatgccga gcagcgtgga gagctggcca ttaaggatgc 1081 caacgccaag ttgtccgagc tggaggccgc cctgcagcgg gccaagcagg acatggcccg 1141 gcagctgcgt gagtaccagg agctgatgaa cgtcaagctg gccctggaca tcgacatcgc 1201 cacctacagg aagctgctgg agggcgagga gagcccgctg gagtctggga tgcagaacat 1261 gagtattcat acgaagacca ccggcggcta tgcgggtggt ttgagctcgg cctatgggga 1321 cctcacagac cccggcctca gctacagcct gggctccagc tttggctctg gcgcgggctc 1381 cagctccttc agccgcacca gctcctccag ggccgtggtt gtgaagaaga tcgagacacg 1441 tgatgggaag ctggtgtctg agtcctctga cgtcctgccc aagtgaacag ctgcggcagc 1501 ccctcccagc ctacccctcc tgcgctgccc cagagcctgg gaaggaggcc gctatgcagg 1561 gtagcactgg gaacaggaga cccacctgag gctcagccct agccctcagc ccacctgggg 1621 agtttactac ctggggaccc cccttgccca tgcctccagc tacaaaacaa ttcaattgct 1681 tttttttttt ttggtcccaa aataaaacct cagctagctc tgcc //
GenBank-Updates@genbank.bio.net (07/04/90)
LOCUS TRBMVAT5A 1544 bp ss-mRNA INV 04-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (MVAT5) mRNA, complete cds. ACCESSION M33825 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. BASE COUNT 497 a 396 c 366 g 285 t ORIGIN 1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt 61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt 121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa 181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg 241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt 301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa 361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac 421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg 481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc 541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg 601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca 661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca 721 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca 781 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat 841 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt 901 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac 961 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa 1021 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac 1081 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc 1141 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc 1201 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat 1261 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca 1321 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc 1381 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta 1441 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa 1501 gaatttttgc tactttaaaa acttctgata tattttaaca ccta // LOCUS TRBWRATATA 1544 bp ss-mRNA INV 04-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (WRATat A) mRNA, complete cds. ACCESSION M33823 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 42 1457 variant surface glycoprotein (WRATat A) mRNA < 1 1544 WRATat A mRNA BASE COUNT 545 a 387 c 345 g 267 t ORIGIN 1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc 61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt 121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac 181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg 241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg 301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc 361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa 421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct 481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca 541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg 601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac 661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca 721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa 781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac 841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg 901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt 961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg 1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca 1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg 1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc 1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag 1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt 1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga 1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg 1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta 1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg // LOCUS TRBWRATATB 1582 bp ss-mRNA INV 04-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (WRATat B) mRNA, complete cds. ACCESSION M33824 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1582) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description mRNA < 1 1582 WRATat B mRNA BASE COUNT 532 a 384 c 376 g 290 t ORIGIN 1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact 61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga 121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa 181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac 241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga 301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat 361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga 421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc 481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca 541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc 601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg 661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag 721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa 781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga 841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact 901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca 961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca 1021 cgacctcata acagctaaat ccggtacgac ataagcggcg caatcactcg gctaaagcaa 1081 ctacaaatcg ccaaagcaaa gctaaaagca agacggctac tgctgcaaaa cagggaacgg 1141 cagcaaacgc gacttatggc gctagcagac aagatgcaag aattgtacca agaggcctta 1201 catgacgagg ttcaactcag gaaggaagcg cagaacaaac ctcaagaaac accagattct 1261 gacaagcaaa aagcatgcga gaaatatcac aacaagtcaa aggaatgcaa agaaaatggt 1321 tgccaatgga gtggaactga agaaaccact aggaaagtgc gaagctaaac ccaaagcagg 1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac 1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt 1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact 1561 tctggtatat tttaacacct tt // LOCUS HUMTNC1 1085 bp ds-DNA PRI 04-JUL-1990 DEFINITION Human fast skeletal muscle troponin C gene, exon 1. ACCESSION M33771 J05502 KEYWORDS troponin C. SEGMENT 1 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1085) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept 1033 + 1035 troponin C, exon 1 pre-msg 968 > 1085 troponin C mRNA and introns (alt.) pre-msg 969 > 1085 troponin C mRNA and introns (alt.) IVS 1036 > 1085 troponin C intron A BASE COUNT 266 a 276 c 306 g 237 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct 61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc 121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa 181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa 241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc 301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac 361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc 421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca 481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga 541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt 601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc 661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga 721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc 781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt 841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt 901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag 961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg 1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg 1081 tggac // LOCUS HUMTNC2 1773 bp ds-DNA PRI 04-JUL-1990 DEFINITION Human fast skeletal muscle troponin C gene, exons 2-6. ACCESSION M33772 J05502 KEYWORDS troponin C. SEGMENT 2 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1773) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept + 51 102 troponin C, exon 2 247 390 troponin C, exon 3 479 593 troponin C, exon 4 758 894 troponin C, exon 5 1506 1537 troponin C, exon 6 pre-msg < 1 1668 troponin C mRNA and introns IVS < 1 50 troponin C intron A IVS 103 246 troponin C intron B IVS 391 478 troponin C intron C IVS 594 757 troponin C intron D IVS 895 1505 troponin C intron E BASE COUNT 358 a 455 c 628 g 331 t 1 others ORIGIN About 2.5 kb after segment 1. 1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc 61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag 121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg 181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc 241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg 301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg 361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat 421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc 481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac 541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct 601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag 661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc 721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc 781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg 841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg 901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct 961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc 1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta 1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat 1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac 1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc 1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa 1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa 1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg 1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc 1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca 1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg 1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc 1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc 1741 ggctggcacc acccggagca cccttgggga tcc // LOCUS ARRSEG11 631 bp ds-RNA VRL 04-JUL-1990 DEFINITION Adult diarrhea rotavirus segment 11 non-structural protein mRNA, complete cds. ACCESSION M34380 KEYWORDS non-structural protein. SOURCE Adult diarrhea rotavirus, cDNA to viral RNA, clone AD63, and viral RNA segment 11. ORGANISM Adult diarrhea rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 631) AUTHORS Chen,G.-M., Hung,T. and Mackow,E.R. TITLE cDNA cloning of each genomic segment of the group B rotavirus ADRV: Molecular characterization of the 11th RNA segment JOURNAL Virology 175, 605-609 (1990) STANDARD simple staff_review FEATURES from to/span description pept 59 571 non-structural protein BASE COUNT 246 a 104 c 122 g 159 t ORIGIN 1 ggtatataaa agtcagtaga cggctggaaa cgttgcacgt actactcact acccagagat 61 ggcagaagcg tctgagttca attttaccat aaagagaaag cagagaacaa tgagtgatcg 121 aagaacgaga gaggatacaa aacaaaaaaa gattgaagag aaatctgata ttgatttggt 181 tgattcagcc tctgtctatt cacaagagtc ttctagaagt aactatagtg atgcttacga 241 taaattaaaa cgtgaaccaa tggttgaaga atcaaatgac gcaaagtata ggaattttga 301 attttctgaa gatgaggaag ttcatagacc atcaagtaaa gcatcagaca aatcatacag 361 agaaatgaaa cgtaagcatg atgacattaa tacgtcagat tctattttag agaaactgtc 421 agaattaaat ctcgagattg aaaagataaa acaaatgaac caaccaatta caattgacgc 481 agcattcaac atgattcttc gaaatgtcga caatctaaca ataaggcaga agcaggcctt 541 aataaatgca attgttaatt ctatgaacta gactgtggtc tccacctgtc atctactggg 601 gtcaacctct tgagttatat ttaaaaaacc c // LOCUS BOVFESUP 1112 bp ss-mRNA MAM 04-JUL-1990 DEFINITION Bovine Rieske iron-sulfur protein mRNA, complete cds. ACCESSION M34336 KEYWORDS Rieske iron-sulfur protein. SOURCE Bovine heart, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1112) AUTHORS Usui,S., Yu,L. and Yu,C.-A. TITLE Cloning and sequencing of a cDNA encoding the Rieske iron-sulfur protein of bovine heart mitochondrial ubiquinol-cytochrome c reductase JOURNAL Biochem. Biophys. Res. Commun. 167, 575-579 (1990) STANDARD simple staff_review FEATURES from to/span description pept 16 825 Rieske iron-sulfur protein precursor sigp 16 234 Rieske iron-sulfur protein signal peptide matp 235 822 Rieske iron-sulfur protein BASE COUNT 269 a 249 c 281 g 313 t ORIGIN 1 ggagctgctg tcgccatgtt gtcggttgcc gcccgctcgc gccattcgcg cccgtcctat 61 cggccacgtc ccgcggggtg gcgcgcgctg cggccctggt acagccggtc gtccaaggag 121 tcgcctgtgc tggacttgaa gcgctccgtc ctgtgccggg agtcgctcag aggccaggcc 181 gcggcggctt tggtcgcttc cgtgagcctc aatgtccctg cttctgtccg ttattcccat 241 acagacatca aagtgcctga cttctctgac tatcgtcgcc ctgaagtgtt agatagtaca 301 aagtcttcaa aagagagcag tgaggctcgg aaaggtttct cctatttggt gactgcaaca 361 actactgtgg gtgttgcata tgctgccaag aatgtcgtct ctcagtttgt ttccagcatg 421 agtgcttctg ccgatgtgtt ggccatgtcg aaaattgaaa tcaagttatc cgatattcca 481 gaggggaaga acatggcttt caaatggaga ggcaaacccc tgttcgtgcg ccatcgaacc 541 aagaaggaaa ttgaccagga agctgcagtt gaagtgtccc agttgaggga cccacagcat 601 gacttagaac gagtaaagaa acctgagtgg gttatcttga taggtgtttg cactcatctt 661 ggttgtgtac ccattgcaaa tgcaggagat tttggtggtt attactgccc ttgccatggg 721 tcacactatg atgcgtctgg cagaatcaga aaggggcctg cacctctcaa ccttgaagtt 781 ccctcatatg agttcaccag cgatgatatg gtgattgttg gttagagaga cctggactcg 841 agttacaggc ttctttcgtc ttcatgtcac ctcagaagag ttacttgaag aacaagcctt 901 ctgtgctttg aattagttga tttgaaatat ttaagaattg ctaataatgt gtttgcaaac 961 atgtaaattg aatttaatgt tgaatacttt caagcattca cctaaaaaag ccccccatta 1021 aacattgtta tgctcaaagg gtgcagtgtc tttgatagtt aattctaatt aaaaattgta 1081 gatgagtgta taaaatattt gtgaaatctg ta // LOCUS HUMARX 3231 bp ss-mRNA PRI 04-JUL-1990 DEFINITION Human androgen-receptor mRNA, complete cds. ACCESSION M34233 KEYWORDS androgen-receptor. SOURCE Human testis and prostate cancer cell line LNCaP, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3231) AUTHORS Govindan,M.V. TITLE Specific region in hormone binding domain is essential for hormone binding and trans-activation by human androgen receptor JOURNAL Mol. Endocrinol. 4, 417-427 (1990) STANDARD simple staff_review FEATURES from to/span description pept 109 2829 androgen-receptor /hgml_locus_uid="LL0058U" /nomgen="AR" /map="Xq12" BASE COUNT 713 a 941 c 883 g 694 t ORIGIN 1 agctagctgc agcgactacc gcatcatcac agcctgttga actcttctga gcaagagaag 61 gggaggcggg gtaagggaag taggtggaag attcagccaa gctcaaggat ggaagtgcag 121 ttagggctgg gaagggtcta ccctcggccg ccgtccaaga cctaccgagg agctttccag 181 aatctgttcc agagcgtccg cgaagtgatc cagaacccgg gccccaggca cccagaggcc 241 gcgagcgcag cacctcccgg cgccagtttg ctgctgctgc agcagcagca gcagcagcag 301 cagcagcagc agcagcagca gcagcaagag actagcccca ggcagcagca gcagcagcag 361 ggtgaggatg gttctcccca agcccatcgt agaggcccca caggctacct ggtcctggat 421 gaggaacagc aaccttcaca gccgcagtcg gccctggagt gccaccccga gagaggttgc 481 gtcccagagc ctggagccgc cgtggccgcc agcaaggggc tgccgcagca gctgccagca 541 cctccggacg aggatgactc agctgcccca tccacgttgt ccctgctggc ccccactttc 601 cccggcttaa gcagctgctc cgctgacctt aaagacatcc tgagcgaggc cagcaccatg 661 caactccttc agcaacagca gcaggaagca gtatccgaag gcagcagcag cgggagagcg 721 agggaggcct cgggggctcc cacttcctcc aaggacaatt acttaggggg cacttcgacc 781 atttctgaca acgccaagga gttgtgtaag gcagtgtcgg tgtccatggg cctgggtgtg 841 gaggcgttgg agcatctgag tccaggggaa cagcttcggg gggattgcat gtacgcccca 901 cttttgggag ttccacccgc tgtgcgtccc actccttgtg ccccattggc cgaatgcaaa 961 ggttctctgc tagacgacag cgcaggcaag agcactgaag atactgctga gtattcccct 1021 ttcaagggag gttacaccaa agggctagaa ggcgagagcc taggctgctc tggcagcgct 1081 gcagcaggga gctccgggac acttgaactg ccgtctaccc tgtctctcta caagtccgga 1141 gcactggacg aggcagctgc gtaccagagt cgcgactact acaactttcc actggctctg 1201 gccggaccgc cgccccctcc gccgcctccc catccccacg ctcgcatcaa gctggagaac 1261 ccgctggact acggcagcgc ctgggcggct gcggcggcgc agtgccgcta tggggacctg 1321 gcgagcctgc atggcgcggg tgcagcggga cccggttctg ggtcaccctc agccgccgct 1381 tcctcatcct ggcacactct cttcacagcc gaagaaggcc agttgtatgg accgtgtggt 1441 ggtggtgggg gtggtggcgg cggcggcggc ggcggcggcg gcggcgaggc gggagctgta 1501 gccccctacg gctacactcg gccccctcag gggctggcgg gccaggaaag cgacttcacc 1561 gcacctgatg tgtggtaccc tggcggcatg gtgagcagag tgccctatcc cagtcccact 1621 tgtgtcaaaa gcgaaatggg cccctggatg gatagctact ccggacctta cggggacatg 1681 cgtttggaga ctgccaggga ccatgttttg cccattgact attactttcc accccagaag 1741 acctgcctga tctgtggaga tgaagcttct gggtgtcact atggagctct cacatgtgga 1801 agctgcaagg tcttcttcaa aagagccgct gaagggaaac agaagtacct gtgcgccagc 1861 agaaatgatt gcactattga taaattccga aggaaaaatt gtccatcttg tcgtcttcgg 1921 aaatgttatg aagcagggat gactctggga gcccggaagc tgaagaaact tggtaatctg 1981 aaactacagg aggaaggaga ggcttccagc accaccagcc ccactgagga gacaacccag 2041 aagctgacag tgtcacacat tgaaggctat gaatgtcagc ccatctttct gaatgtcctg 2101 gaagccattg agccaggtgt agtgtgtgct ggacacgaca acaaccagcc cgactccttt 2161 gcagccttgc tctctagcct caatgaactg ggagagagac agcttgtaca cgtggtcaag 2221 tgggccaagg ccttgcctgg cctccgcaac ttacacgtgg acgaccagat ggctgtcatt 2281 cagtactcct ggatggggct catggtgttt gccatgggct ggcgatcctt caccaatgtc 2341 aactccagga tgctctactt cgcccctgat ctggttttca atgagtaccg catgcacaag 2401 tcccggatgt acagccagtg tgtccgaatg aggcacctct ctcaagagtt tggatggctc 2461 caaatcaccc cccaggaatt cctgtgcatg aaagccatgc tactcttcag cattattcca 2521 gtggatgggc tgaaaaatca aaaattcttt gatgaacttc gaatgaacta catcaaggaa 2581 ctcgatcgta tcattgcatg caaaagaaaa aatcccacat cctgctcaag acgcttctac 2641 cagctcacca agctcctgga ctccgtgcag cctattgcga gagagctgca tcagttcact 2701 tttgacctgc taatcaagtc acacatggtg agcgtggact ttccggaaat gatggcagag 2761 atcatctctg tgcaagtgcc caagatcctt tctgggaaag tcaagcccat ctatttccac 2821 acccagtgaa gcattggaaa ccctatttcc ccaccccagc tcatgccccc tttcagatgt 2881 cttctgcctg ttataactct gcactactcc tctgcagtgc cttggggaat ttcctctatt 2941 gatgtacagt ctgtcatgaa catgttcctg aattctatct gctgggcttt ttttttctct 3001 ttctctcctt tctttttctt cttccctccc tatctaaccc tcccatggca ccttcagact 3061 ttgcttccca ttgtggctcc tatctgtgtt ttgaatggtg ttgtatgcct taaatctgtg 3121 atgatcctca tatggcccag tgtcaagttg tgcttgttta cacgcatctc tgtgccagcc 3181 acacaaaccg tttacttact taccgcaagg gaacttagag agctagaatt c // LOCUS DBIDB58 2149 bp ds-DNA PLN 04-JUL-1990 DEFINITION D.biflorus lectin DB58 gene, complete cds. ACCESSION M34271 J05280 KEYWORDS lectin. SOURCE D.biflorus 3-week old leaf and stem DNA. ORGANISM Dolichos biflorus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 2149) AUTHORS Harada,J.J., Spadoro-Tank,J., Maxwell,J.C., Schnell,D.J. and Etzler,M.E. TITLE Two lectin genes differentially expressed in Dolichos biflorus differ primarily by a 116-base pair sequence in their 5' flanking regions JOURNAL J. Biol. Chem. 265, 4997-5001 (1990) STANDARD simple staff_review FEATURES from to/span description pept 902 1729 lectin DB58 mRNA 865 > 2149 lectin DB58 mRNA BASE COUNT 671 a 474 c 294 g 710 t ORIGIN 1 gaagagatgt tgaaatttaa tatattttta ttattatgga tgaaaattta cgtgactctc 61 ttattaaaca atgattgcct tataattttt tttttcaata attaaaagat gcgttaaaaa 121 gaataaggaa aatcctcttt taatatatat aattactttt tataatactt tacgaaaaat 181 attattatta ttattattat tatattatta cagtagatta taaatagtta tatcttaaaa 241 aaaatttgtc aatatatcac tatttaaaaa ataattatag aatattcgtt atgtgagtat 301 tttttaaaag atattaataa aaggactcaa attcattctt tacctcatat ttattttagt 361 ttttttttaa atttcatttt aatttattgt gaatagtata atatataatt caattattac 421 aaatgaataa ttaaattatt aggaactgat aaagtaaagt aaaaaaagtg aagaacaata 481 tatctacatt aaaagaacat tatgatatat gtggagtttc tgcatcactt tgggaccgaa 541 agtggactat attggtaata tgtattgtac tacataaatt atatgattac caaagttata 601 tgcaccgtac tatctcaatt atatgtttcc aaaaccacaa aaaatataat tttttttctt 661 tttaactcta ttataatata ttgttttatt atttaatata aataaaacca tacaataaaa 721 gcatataaca agagaaatca ttttcaaaca cgtgtttatc tctatagatg atgtgattta 781 gattttgagt ttctatggtt aagaatacca aaacctagat gaggttttga actataaata 841 tacaacacag gcttctccat atgcatcacc catcaccata gttgagcaaa acatcaaagc 901 catggcttcc tccacagtct ctgttgtcct ctccctcttc ctgctgcttc tcacccaagc 961 atactcagct gacatccaat ccttctcctt caaaaacttc aactcatcca gcttcatcct 1021 ccaaggtgac gccaccgtat cctccagcaa gttacgactc acaaaagtta agggcaacgg 1081 ccttcccacc ctgtcttctc tcggccgcgc cttctactcc tcccccatcc aaatctatga 1141 caagtccacc ggcgccgtcg ccagctgggc cacctccttc acagccaaca ttttcgctcc 1201 caacaagtca tcctccgccg acgggattgc cttcgctttg gtacccgtcg ggtctgagcc 1261 caaaagcaac agtggctttc taggtgtttt cgacagcgat gtctacgaca actccgccca 1321 gactgtggca gtggagttcg acacgttctc caacaccgac tgggacccca ccagccgcca 1381 cattgggatc gacgtgaact ccatcaagtc tatcagaacg gcgtcgtggg gtttggccaa 1441 cggccaaaac gccgagattc tgatcaccta caacgccgcc acctcccttt tggtggcttc 1501 tctggtccac ccttctcgca gaacgagcta catcgtctcc gaaagagtgg acatcactaa 1561 tgaacttccg gagtatgtga gcattgggtt ctccgccacc accgggcttt ccgaaggcta 1621 cactgaaact cacgacgtcc tctcttggtc ttttgcttcg aagctcccag atgactccac 1681 cactgaacct ttggacattg ctagctacct cgtccgcaac gtcctctaga ctccttcttc 1741 ctccctgtga cagtgcttcc atgccataac gctctcttcc tcttcctctt catcttcctc 1801 ttcctcttcc cactccttct tctttcttcc taataatctt catctgtcac agtaattaat 1861 attatcttac cttctttcta cactctctgt tcttctttta tatgttaacc tcatctccta 1921 atcaactctg aacataatat tttaatacaa acatcacata taatatttat tataataatt 1981 acccgttaaa atgtctttta atacatatat taaaataaat atatgaagaa aatattagta 2041 tataattatt tgagaaatat aattaaattt ataatgtgaa ttaattgctc tatgagtgaa 2101 tagtccaaaa caaatatttg aaaaaggaaa aaaattgaaa atggatagg // LOCUS DBISLEC 2230 bp ds-DNA PLN 04-JUL-1990 DEFINITION D.biflorus seed lectin (SL) gene, complete cds. ACCESSION M34270 J05280 KEYWORDS lectin. SOURCE D.biflorus 28-day postanthesis seed DNA. ORGANISM Dolichos biflorus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 2230) AUTHORS Harada,J.J., Spadoro-Tank,J., Maxwell,J.C., Schnell,D.J. and Etzler,M.E. TITLE Two lectin genes differentially expressed in Dolichos biflorus differ primarily by a 116-base pair sequence in their 5' flanking regions JOURNAL J. Biol. Chem. 265, 4997-5001 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1013 1840 seed lectin mRNA 976 > 2230 seed lectin mRNA BASE COUNT 677 a 498 c 347 g 708 t ORIGIN 1 ttaacgcctt taagttttat gaagagatgg gtgcgaagat ggttgaagaa tacaggtttt 61 tcagaatgac tgggaaggct cttcaggatt ctggttaaac attcactgta acatgttatt 121 taacttattc ctattttatg atttctaata agattattaa ttatttttta taatacttta 181 cgaaaaaaaa ttattattat tattatatta ttataataga ttataaacaa ttatatctta 241 aaagaaatta tatcactatt taaaaaataa ttatagaata ttcgttatgt gagcattttt 301 taaaagatat taataaaagg actcaaattc attctttacg tcatattttt cgaaatgtat 361 tattttattt tttattaaat ttcattttaa tttattgtga atagtataat atataattat 421 gaataattaa tattattagg aagtgataaa gtaaagtaaa gaatgaagaa taatatatct 481 acactttaaa agaacattat gatatatgtc gagtttctgc atcactttgg gaccaaaagt 541 gtactatatt ggtaatatgt actgtggtac ataaattata tgataaccaa agttacatgg 601 accgtactat ctcaattata tgttaccgaa atcacaaaaa atataattat tttttctttt 661 taactctatt ataatatatt ttttattatt taatataaaa aaaaccatag gataaaagca 721 tataacaaga gaaaacaata tttaaacacg ttatctctat agcccaatag ggacggcaca 781 ccccccactt caagcgcacc tctgtgtcaa cagaatcact ctaagagtat agtcgtggaa 841 tcggtgaacc acgccagctg cttagatgcg tggcgcagag gatgtgattt agattttgag 901 tttctatggt taagattacc aaaatctaga tgaggttttc aactataaat agacaagaga 961 ggctcctcaa tatgcatcac ccatcaccat agttgagcaa aacatcaaag ccatggcttc 1021 ctccacagtc tctgttgtcc tctccctctt cctgctgctt ctcacccaag caaactcagc 1081 caacatccaa tccttctcct tcaaaaactt caactcaccc agcttcatcc tccaaggtga 1141 cgccaccgta tcctccggca agttacaact caccaaagtt aaggaaaacg gcattcccac 1201 cccgtcttct ctcggccgcg ccttctactc ctcccccatc caaatctatg acaagtccac 1261 aggcgccgtc gccagctggg ccacctcctt cacagtcaag atttcagctc ccagcaaggc 1321 atccttcgcc gatgggattg ccttcgcttt ggtacccgtc gggtctgagc ccagacgcaa 1381 cggtggctat ctaggtgttt tcgacagcga tgtctacaac aactccgccc agactgtggc 1441 agtggagttc gacacgttct ccaacagcgg ctgggacccc agcatgaagc acattgggat 1501 cgacgtgaac tccatcaagt ctatcgcaac ggtgtcgtgg gatttggcca acggcgaaaa 1561 cgccgagatt ctgatcacgt acaacgccgc cacctccctt ttggtggctt ctctggtcca 1621 cccttctcgc agaacgagct acatcctctc cgaaagagtg gacatcacta atgaacttcc 1681 cgagtatgtg agcgttgggt tctccgccac caccgggctt tccgaaggct acattgaaac 1741 tcacgacgtc ctctcttggt cttttgcttc caagctccca gatgactcca ccgctgaacc 1801 tctggacctt gctagctacc tcgtccgcaa cgtcctctag actccttctt cctccctgtg 1861 acggtgcttc catgccataa cactctcttc ctcttcctct tcctcttccc accccttctt 1921 ctttcttcat aataataatc ttcatctgtc acagtaataa tattatctta ccttctttct 1981 acactctccg ttcttctttt atattttaac ctcatctcct aatcaactct taacaacata 2041 atatttaata taaataaaat tacccgttaa aatgtctttt aatacatata ttaagataaa 2101 tatatgaaga aaagattagt atataattat ttgagaaata tgttattatg ggttaagtaa 2161 tgaatttatt ttaagaaaat atgtaattga gtaaacctgt tttcgtaaac aactttgtaa 2221 aatggaattc // LOCUS STMBLAC1 957 bp ds-DNA BCT 04-JUL-1990 DEFINITION S.badius beta-lactamase gene, complete cds. ACCESSION M34178 KEYWORDS beta-lactamase. SOURCE S.badius DNA. ORGANISM Streptomyces badius Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 957) AUTHORS Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B. TITLE Molecular analysis of beta-lactamases from four species of Streptomyces: Comparison of amino acid sequences with those of other beta-lactamases JOURNAL J. Gen. Microbiol. 136, 589-598 (1990) STANDARD simple staff_review FEATURES from to/span description pept 16 957 beta-lactamase precursor (EC 3.5.2.6) sigp 16 93 beta-lactamase signal peptide matp 94 954 beta-lactamase BASE COUNT 156 a 357 c 320 g 124 t ORIGIN 1 caggaggttt cgcccatgca tcacccccgc gtcaggaccg ccgtcgccgg tctggtcgcc 61 gtgctgtccc tcgtgccgct ggcggcctgc gggcagagcg actccaccgc gcccccttcg 121 tccgccaagc ccgccacgag cgcgtcggcc tcgctgccgc ggccgaagcc gtacaccggc 181 gacttcaaga agctggagcg cgagttcgac gcacggctgg gcgtgtacgc gatcgacacc 241 ggcaccggac gcgaggtgac ccacaacgac cgggcgcgct tcgcctacca ctccaccttc 301 aaggcgctcc aggccgcggt cgtcctcagc acctactccc tggacggcct cgacaagcgg 361 gtgacgtaca cccgcgagga cctggtcgcc cactctccgg tgaccgagaa gcacgtcgac 421 accggcatga cgctgaagga gctgtgcgac gcctccgtgc gctacagcga caacaccgcg 481 gccaacctcc tcttcgaccc acgtgcggcc gggcccaagg gcctggacgc ctcgctggag 541 aagctcggcg acgacatcac gcggatggac cgggaggagc cggagctgag ccgatgggtc 601 cccggtgaga agcgcgacac ctccacgccc cgggcgctgg ccgaggatct gcgcgcgttc 661 gtcctgggca aggccctgcg cgcacccgag cgggcgcagc tgacgacgtg gctgcggacg 721 aacaccaccg gggacgcggt gatcagggcg ggtgtgccgg agaactgggt cgtcggtgac 781 aagaccggga ccggcagcta ctacggggcc cgcaacgaca tcgccgtggt gtggcccccg 841 gactccgcgc ccatcgtgat cgccatcctg tcccaccgcg gtacgaagga cgcggagccg 901 gacgacgaac tcatcgccga agcggcctcc gtggtcgtcg actcgctgtc ctcctag // LOCUS STMBLAC2 948 bp ds-DNA BCT 04-JUL-1990 DEFINITION S.lavendulae beta-lactamase gene, complete cds. ACCESSION M34180 KEYWORDS beta-lactamase. SOURCE S.lavendulae DNA. ORGANISM Streptomyces lavendulae Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 948) AUTHORS Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B. TITLE Molecular analysis of beta-lactamases from four species of Streptomyces: Comparison of amino acid sequences with those of other beta-lactamases JOURNAL J. Gen. Microbiol. 136, 589-598 (1990) STANDARD simple staff_review FEATURES from to/span description pept 31 948 beta-lactamase precursor (gtg start codon) (EC 3.5.2.6) sigp 31 133 beta-lactamase signal peptide matp 134 945 beta-lactamase BASE COUNT 136 a 373 c 333 g 106 t ORIGIN 1 gaccgcaaga gccgtagggg agagcgcatc gtggggacca caggagcccg tccgagccgt 61 cgtgccgtac tgaccgccgc cgcgggggcg gcggtggcgg ggatccccct gggcgggtcc 121 acggccttcg cggccccgcg ggggaacccg gacgtcctgc ggcagctgcg cgccctcgaa 181 caggaacact ccgcccgcct gggcgtgtac gcccgggaca cggccaccgg ccggaccgtc 241 ctccaccgcg cggaggagcg ctttccgatg tgctcggtct tcaagaccct cgccgtcgcc 301 gccgtcctgc gcgacctcga ccgggacggc gagttcctgg ccacacgcct cttctacacc 361 gaacaggaag tgaaggactc cgggttcggg ccggtcaccg gcctgcccga gaacctcgcg 421 gccggcatga ccgtcgagcg gctgtgcgcc gccgccatct gccagagcga caacgccgcg 481 gcgaacctgc tgctccggga gctgggcggc ccggaggcgg tcacccgttt ctgccggtcg 541 gtgggggacc ggaccacccg gctggaccgc tgggagcccg aactgaactc cgccgagccc 601 ggccggctca ccgacaccac caccccgcgc gccatcgggg cgacctacgg cgaactcgtc 661 ctcggcgacg cgctcgcccc gcgggaccgc gagcggctca ccggctggct cctggcgaac 721 acgacgagca ccgaacgctt ccgcaagggc ctgcccgccg actggaccct gggcgacaag 781 accggcggcg gcgcgtacgg cacgaacaac gacgccggcg tcacctggcc cccgcaccgc 841 ccgccggtgg tgatggtcgt cctgacgacc cacgaccggc cggacgcggt cgccgacaac 901 ccgctcgtgg ccaagacggc agcgctgctg gcgtcggccc tcggctga // LOCUS STMBLAC3 951 bp ds-DNA BCT 04-JUL-1990 DEFINITION S.fradiae beta-lactamase gene, complete cds. ACCESSION M34179 KEYWORDS beta-lactamase. SOURCE S.fradiae DNA. ORGANISM Streptomyces fradiae Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 951) AUTHORS Forsman,M., Haeggstroem,B., Lindgren,L. and Jaurin,B. TITLE Molecular analysis of beta-lactamases from four species of Streptomyces: Comparison of amino acid sequences with those of other beta-lactamases JOURNAL J. Gen. Microbiol. 136, 589-598 (1990) STANDARD simple staff_review FEATURES from to/span description pept 31 951 beta-lactamase precursor (gtg start codon) (EC 3.5.2.6) sigp 31 133 beta-lactamase signal peptide matp 134 948 beta-lactamase BASE COUNT 133 a 352 c 361 g 105 t ORIGIN 1 tcgtcctgcc gtagggagaa ggggtccatc gtggatcgca ccaccgcacg tccgaaccgc 61 cgagccgtcc tcgccacagg ggtgggggcc gcgctggcgg ccaccgcggc agcggcgggc 121 cccgcgcacg ccgctccggg gcggggcgcc cgcgtcgagg ggcgcctgcg ggcgctggag 181 cgcacgcacg acgcgcggct cggcgcgttc gcgtacgaca cgggcacggg gcggacggtg 241 gcgtaccgcg cggacgagcg cttcccgatc gcgtcgatgt tcaagacgat cgccgtcgcc 301 gccgtgctgc gcgacctgga ccgggacggc gaggtcctcg cccggcgcgt ccactacacg 361 gcggactacg tgaagcggtc gggctactcc cccgtcaccg ggctgccgga gaacgtggcg 421 aacggcatga ccgtcgccga gctgtgcgag gcgacgctga cccgcagcga caacacggcg 481 gccaacctgc tcctgcgcga tctgggcggc ccgaccgcgg tgacccgttt ctgccgctcc 541 gtcggcgacc acgtcacccg gctggaccgc tgggagcccg agctgaactc ggccgagccc 601 gggagggtca ccgacaccac gtcgccgcgc gccatcggcc gcacgtacgg gcggctgatc 661 ctcggcgacc tgctggcggc gcacgaccgg gagcggctga cgcggtggat gctggacaac 721 cggacgagtg acgagcggtt ccgcaagggc ctgcccgcgg actggctgct ggcggacaag 781 acgggcggcg gcgactacgg caccaacaac gacgcggggg tcgcctggcc gcccggacgc 841 ccgccggtcg tgctggccgt ccagacgacc cggttcaccc cggacgccga ggccgacaac 901 gtcctcgtcg cggaggcggc acgcctcctg gcggaggcga tgaccgactg a // LOCUS LBKRRNA 1528 bp ss-rRNA RNA 04-JUL-1990 DEFINITION Lactobacillus kandleri 16S small subunit ribosomal RNA. ACCESSION M23038 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE Lactobacillus kandleri (strain DSM 20593) rRNA. ORGANISM Lactobacillus kandleri Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1528; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1528 16S rRNA BASE COUNT 411 a 326 c 436 g 336 t 19 others ORIGIN 1 nntaaattga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc 61 aagtcgaacg cactgtggtt gaaatgagat gagaagcttg cttcaagtca aatgccaaca 121 ttgcagtgag tggcgaacgg gtgagtaaca cgtgggaaac ctncctctta gcaggggata 181 acatctggaa acaggtgcta ataccgtata acaataaaaa ccgcatggtt ttgatttaaa 241 agatggttct gctatcacta agagatggtc ccncggtgca ttagttagtt ggtgaggtaa 301 aggctcacca agacgatgat gcatagccga gttgagagac tgatcggcca caatgggact 361 gagacacggc cnatactcct acgggagnca gcagtaggga atcttccaca atggacgaaa 421 gtctgatgga gcaacgccgn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta 481 agagaagaat gacattgaga gtaactgttc aatgtgtgac ggtatcttac cagaaaggaa 541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccnagcgtta tccggattta 601 ttgggcgtaa agcgagcgca gacggttatt taagtctgaa gtgaaagccc tcggcttaac 661 cgagnnattg ctttggaaac tggataactt gagtgcagta gaggaaagtg gaactccatg 721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctttctgga 781 ctgtaactga cgttgaggct cnnaagtgtg ggtagcaaac aggattagat accctggtag 841 tccacaccgt aaacgatgag tgctagctgt tcgagggttt ccgcccttga gtggcgaagc 901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg 961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagnaacgc gaagaacctt 1021 accaggtctt gacatccttt gaccactcca gagatggagn tttcccttcg gggtcaaagt 1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccncaa 1141 cgagcgcaac ccttattgtt agttgccagc atttagttgg gcactctagc aagactgccg 1201 gtgacaaacc ggaggaaggc ggggatgacg tcaaatcatc atgcccctta tgacctgggc 1261 tacacacgtg ctacaatggc aagtacaacg agcagccaac ccgcgagggt gcgcaaatct 1321 cttaaagctt gtctcagttc ggactgtagg ctgcaactcg cctacacgaa gtcggaatcg 1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggncttgt acacaccgcc 1441 cgtcacacca tgagagtttg taacacccaa agtcggtgag gtaaccttta taggggccag 1501 ccgcctaagg tgggacagat gatnaggg // LOCUS LBMRRNA 1524 bp ss-rRNA RNA 04-JUL-1990 DEFINITION Lactobacillus minor, 16S small subunit ribosomal RNA. ACCESSION M23039 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.minor(strain DSM 20014) rRNA. ORGANISM Lactobacillus minor Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1524; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1524 16S rRNA BASE COUNT 414 a 320 c 431 g 345 t 14 others ORIGIN 1 nncaaaatga gagtttgatc ctggctcagg atnaacgctg gcggcgtgcc taatacatgc 61 aagtcgaacg ctttgtggtt caactgatat gaagagcttg ctcggatttg aagatgaaca 121 ttgcaaagag tggcgaacgg gtgagtaaca cgtgggaaac ctncctctta gcgggggata 181 acacttggaa acaagtgcta ataccgcata atactgataa ccgcatggtt atcagttgaa 241 agatggttct gctatcacta aganatggtc ccncggtgta ttagctagtt ggtaaggtaa 301 tggcttacca aggcaatgat acatagccga gttgagagac tgatcggcca caatgggact 361 gagacacggc cnatactcct acgggaggca gcagtaggga atcttccaca atggacgaaa 421 gtctgatgga gcaacgccgn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta 481 agagaagaat gacattgaga gtaactgttc agtgtgtgac ggtatcttac cagaaaggaa 541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccaagcgtta tccggattta 601 ttgggcgtaa agcgagcgca gacggttgtt taagtctgaa gtgaaagccc acagcttaac 661 tgtggaagtg ctttggaaac tggataactt gagtgcagta gaggagagtg gaactccatg 721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctctctgga 781 ctgtaactga cgttgaggct cgnaagtgtg ggtagcaaac aggattagat accctggtag 841 tccacaccgt aaacgatgag tgctagatgt ttgagggttt ccgcccttaa gtgtcgcagc 901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg 961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagnaacgc gaagaacctt 1021 accagctctt gacatccttt gaccacttca gagatgaagc tttcccttcg gggacaaagt 1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa 1141 cgagcgcaac ccttattact agttgccagc atttagttgg gcactctagt gagactgccg 1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgagctgggc 1261 tacacacgtg ctacaatggc aagtacaacg agcagctaac ccgcgagggt acgcgaatct 1321 cttaaaactt gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg 1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cnggtcttgt acacaccgcn 1441 cgtcacacca tgagagtttg taacacccaa agccggtgag gtaacctttt aggagccaac 1501 cgtctaaggt gggacagatg atta // LOCUS LBTRRNA 1529 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.halotolerans 16S small subunit ribosomal RNA. ACCESSION M23037 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.halotolerans (strain DSM 20190) rRNA. ORGANISM Lactobacillus halotolerans Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1529; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1529 16S rRNA BASE COUNT 406 a 329 c 437 g 328 t 29 others ORIGIN 1 nncaaaatga gagtttgatc ctggctcagg angaacgctg gcggcgtgcc taatacatgc 61 aagtcgaacg ccttgtcgtt ctactgattt agagagcttg ctcaatactg acgtagaact 121 atacaaggag tggcgaacgg gtgagtaaca cgtgggaaac cnncctctta gtaggggata 181 acacctggaa acaggngcta ataccgcata acaacgataa ccgcatggtt atcgcttgaa 241 aggtggcgta agctaccgct aagagatggt cccgcggtgc attagttagt tggtgaggta 301 atggctcacn nagacgatga tgcatagccg agttgagaga ctgatcggcc acaatgggac 361 tgagacacgg ccnntnctcc tacgggagnc agcagtaggg aatcttccac aatggacgaa 421 agtctgatgg agcaacgcng ngtgtgtgat gaagggtttc ggctcgtaaa acactgttgt 481 aagagaagaa tgatactaag agtaactgtt tagtatgtga cggtatctta ccagaaagga 541 acggctaaat acgtgccagc agccncggta atacgtatgt tccnagcgtt atccggattt 601 attgggcgta aagcgagcgc agacggttat ttaagtctga agtgaaagcc cncagcttaa 661 ctgtggnagt gctttggaaa ctggataact tgagtgcagt agaggaaagt ggaactccat 721 gtgtagcggt gaaatgcgta gatatatgga agaacaccag cggcgaaggc ggctttctgg 781 actgtaactg acgttgaggc tcgnaagtgt gggtagcaaa caggattaga taccctggta 841 gtccacaccg taaacgatga gtgctagatg tttgggggtt tccgccccna agtgtcgtag 901 ctaacgcatt aagcactccg cctggggagt acgaccgcaa ggttgaaact caaaggaatt 961 gacggggacc ngcacaagcg gnggagcatg tggtttaatt cgaagnaacg cgaagaacct 1021 taccaggnct tgacatcctt tgaccacctc agagatgagg ctttcccttc ggggacaaag 1081 tgacaggtgg ngcatggttg tcgtcagctc gtgtcgtgag atgttgggtt aagtcccgca 1141 acgagcgcaa cccttatgat tagttgccag catttagttg ggcactctaa tcagactgcc 1201 ggtgacaaac cggaggaagg tggggatgac gtcaaatcat catgcccctt atgacctggg 1261 ctacacacgt gctacaatgg caagtacaac gagcagctaa cccgtaaggg cacgcgaatc 1321 tcttaaaact tgtctcagtt cggattgtgg gctgcaactc gcccncatga agtcggaatc 1381 gctagtaatc gcggatcagc acgccgcggt gaatacgttc ccgggtcttg tacacaccgc 1441 ncgtcacncc atgagagttt gtaacaccca aagccggtga ggtaacccgc aagggaacca 1501 gccgtctaag gtgggacaga tgattnggg // LOCUS LCORRNA 1525 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.confusus 16S small subunit ribosomal RNA. ACCESSION M23036 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.confusus (strain DSM 20196) rRNA. ORGANISM Lactobacillus confusus Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1525; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1525 16S rRNA BASE COUNT 407 a 325 c 440 g 339 t 14 others ORIGIN 1 nccaaattga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc 61 aagtcgaacg ctttgtggtt caactgattt gaagagcttg ctcagatatg acgatggaca 121 ttgcaaagag tggcgaacgg gtgagtaacn cgtgggaaac cnacctctta gcaggggata 181 acatttggaa acagatgcta ataccgtata acaatgacaa ccgcatggtn gtnatttaaa 241 agatggttct gctatcacta agagatggtc ccgcggtgca ttagctagtt ggtaaggtaa 301 tggcttacca aggcgatgat gcatagccga gttgagagac tgatcggcca caatgggact 361 gagacacggc ccatactcct acgggaggca gcagtaggga atcttccaca atgggcgaaa 421 gcctgatgga gcaacgccnn gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta 481 agagaagaat gacattgaga gtaactgttc aatgtgtgac ggtatcttac cagaaaggaa 541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccnagcgtta tccggattta 601 ttgggcgtaa agcgagcgca gacggttatt taagtctgaa gtgaaagccc tcagctcaac 661 tgaggaattg ctttggaaac tggatgactt gagtgcagta gaggaaagtg gaactccatg 721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctttctgga 781 ctgtaactga cgttgaggct cnaaagtgtg ggtagcaaac aggattagat accctggtag 841 tccacaccgt aaacgatgag tgctaggtgt ttgagggttt ccgcccttaa gtgccgcagc 901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg 961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaannaacgc gaagaacctt 1021 accaggtctt gacatccctt gacaactcca gagatggagn gttcccttcg gggacaaggt 1081 gacaggtggn gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa 1141 cgagcgcaac ccttattact agttgccagc attcagttgg gcactctagt gagactgccg 1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgacctgggc 1261 tacacacgtg ctacaatggc gtatacaacg agttgcaaac ccgcgagggt gagctaatct 1321 cttaaagtac gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg 1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggtcttgt acacaccgcc 1441 cgtcacacca tgagagtttg taacacccaa agccggtggg gtaaccttcg ggagccagcc 1501 gtctaaggtg ggacagatga ttagg // LOCUS LEURRNAC 1493 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.cremoris 16S small subunit ribosomal RNA. ACCESSION M23034 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.cremoris (strain DSM 20346) rRNA. ORGANISM Leuconostoc cremoris Prokaryota; Bacteria; Firmicutes; Gram-positive cocci. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1493; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1493 16S rRNA BASE COUNT 398 a 324 c 438 g 324 t 9 others ORIGIN 1 tcaaattgag agtttgatcc tggctcagga tgaacgctgg cggcgtgcct aatacatgca 61 agtcgaagcc acagcgaaag gtgcttgcac ctttcaagtg agtggcgaac gggtgagtaa 121 cacgtggaca acctgcctca aggctgggga taacatttgg aaacagatgc taataccgaa 181 taaaacttag tgtcgcatga cacaaagtta aaaggcgctt cggcgtcacc tagagatgga 241 tccgcggtgc attagttagt tggtggggta aaggcctacc aagacaatga tgcatagccg 301 agttgagaga ctgatcggcc acattgggac tgagacacgg cccaaactcc tacgggaggc 361 tgcagtaggg aatcttccac aatgggcgaa agcctgatgg agcaacgccg cgtgtgtgat 421 gaaggctttc gggtcgtaaa gcactgttgt atgggaagaa cagctagaat aggaaatgat 481 tttagtttga cggtaccata ccagaaaggg acggctaaat acgtgccagc agccgcggta 541 atacgtatgt cccgagcgtt atccggattt attgggcgta aagcgagcgc agacggttta 601 ttaagtctga tgtgaaagcc cggagctcaa ctccggnatg gcattggaaa ctggttaact 661 tgagtgcagt agaggtaagt ggaactccat gtgtngcggt ggaatgcgta gatatatgga 721 agaacaccag tggcgaaggc ggcttactgg actgcaactg acgttgaggc tcgnaagtgt 781 gggtagcaaa caggattaga taccctggta gtccacaccg taaacgatga acactaggtg 841 ttaggaggtt tccgcctctt agtgccnaag ctaacgcatt aagtgttccg cctggggagt 901 acgaccgcaa ggttgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg 961 tggtttaatt cgaagcaacg cgaagaacct taccaggtct tgacatcctt tgaagctttt 1021 agagatagaa gtgttctctt cggagacaaa gtgacaggtg gtgcatggtc gtcgtcagct 1081 cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttattg ttagttgcca 1141 gcattcagat gggcactcta gcgagactgc cggtgacaaa ccggaggaag gcggggacga 1201 cgtcagatca tcatgcccct tatgacctgg gctacacacg tgctacaatg gcgtatacaa 1261 cgagttgcca acccgcgagg gtgagctaat ctcttaaagt acgtctcagt tcggattgta 1321 gtctgcaact cgactacatg aagtcggaat cgctagtaat cgcggatcag cacgccgcgg 1381 tgaatacgtt cccgggtctt gtacacaccg cccgtcacac catgggagtt tgtaatgccc 1441 aaagccggtg gcctaacctt tnaggaagga gcngtctaag gnagganaga nga // LOCUS LEURRNAL 1499 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.lactis 16S small subunit ribosomal RNA. ACCESSION M23031 M23032 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.lactis (strain DSM 20202) rRNA. ORGANISM Leuconostoc lactis Prokaryota; Bacteria; Firmicutes; Gram-positive cocci. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1499; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1499 16S rRNA BASE COUNT 396 a 320 c 435 g 316 t 32 others ORIGIN 1 nntaaattga gagtttgatc ctggctcagg atnaacgctg gcggcgtgcc taatacatgc 61 aagtcgaagc cgcagcgaaa ggtgcttgca ccnttcaagc gagtggcgaa cgggtgagta 121 acacgtggat aacctgcctc aaggctgggg ataacatttg gaaacagatg ctaataccga 181 ataaaactta gtatcgcatg atacaaagtt caaaggcgct acggcgncac ctagagatgg 241 gnccgcggtg cattagttag ttggtggggt aaaggcctac caagacaatg atgcatagcc 301 gagttgagag actnatcggc cacattggga ctgagacacg gcccnaactc ctacgggagg 361 cagcagtagg gaatcttcca caatgggcga aagcctnatg gagcaacgcc nngtgtgtga 421 tgaaggcttt agggtcgtaa agcacngttg tatgggaaga aatgctagaa tagggaatga 481 ttctagttcg acggtaccat accagaaagg gacggctaaa tacgtgccag cagcnncggt 541 aatacgtatg tccnnagcgt tatccggatt tattgggcgt aaagcgagcg cagacggttg 601 attaagtctg atgtgaaagc cnggagctca actccggnat ggcattggaa actggttaac 661 ttgagtgttg tagaggtaag tggaactcca tgtgtagcgg tggaatgcgt agatatatgg 721 aagaacacca gtggcgaagg cggcttactg gacaacaact gacgttgagg ctcnnaagtg 781 tgggtagcaa acaggattag ataccctggt agtccacacc gtaaacgatg aatactaggt 841 gttaggaggt ttccgcctct tagtgccnaa gctaacgcat taagtattcc gcctggggag 901 tacgaccgca aggttgaaac tcaaaggaat tgacggggac cngcacaagc ggtggagcat 961 gtggtttaat tcgaagnaac gcgaagaacc ttaccaggtc ttgacatcct ttgaagcttc 1021 tagagataga agtgttctct tcggagacaa agtgacaggt ggngcatggt cgtcgtcagc 1081 tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttatt gttagttgcc 1141 agcattcagt tgggcactct agcgagactg ccggtgacaa accggaggaa ggcggggacg 1201 acgtcagatc atcatgcccc ttatgacctg ggctacacac gtgctacaat ggcgtataca 1261 acgagttgcc aacccgcgag ggtgagctaa tctcttaaag tacgtctcag ttcggactgc 1321 agtctgcaac tcgactgcac gaagtcggaa tcgctagtaa tcgcggatca gcacgccgcg 1381 gtgaatacgt tcccnggnct tgtacacacc gnnnntcacn ccatgggagt ttgtaatgcc 1441 cnaagccggt ggcctaacct tatggaggga gccgtctaag gcaggacaga tgactaggg // LOCUS LEURRNAM 1554 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.mesenteroides 16S small subunit ribosomal RNA. ACCESSION M23035 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.mesenteroides (DSM 20343) rRNA. ORGANISM Leuconostoc mesenteroides Prokaryota; Bacteria; Firmicutes; Gram-positive cocci. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1554; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1554 16S rRNA BASE COUNT 406 a 338 c 447 g 334 t 29 others ORIGIN 1 tcaaattgag agtttgatcc tggctcagga tgaacgctgg cggcgtgcct aatacatgca 61 agtcgaagcc acagcgaaag gtgcttgcac ctttcaagtg agtggcgaac gggtgagtaa 121 cacgtggaca acctgcctca aggctgggga taacatttgg aaacagatgc taataccgaa 181 taaaacttag tgtcgcatga cacaaagtta aaaggcgctt cggcgtcacc tagagatgga 241 tccgcggtgc attagttagt tggtggggta aaggcctacc aagacaatga tgcatagccg 301 agttgagaga ctgatcggcc acattgggac tgagacacgg cccaaactcc tacgggaggc 361 tgcagtaggg aatcttccac aatgggcgaa agcctgatgg agcaacgccg cgtgtgtgat 421 gaaggctttc gggtcgtaaa gcactgttgt atgggaagaa cagctagaat aggaaatgat 481 tttagtttga cggtaccata ccagaaaggg acggctaaat acgtgccagc agccgcggta 541 atacgtatgt cccgagcgtt atccggattt attgggcgta aagcgagcgc agacggttta 601 ttaagtctga tgtgaaagcc cggagctcaa ctccggnatg gcattggaaa ctggttaact 661 tgagtgcagt agaggtaagt ggaactccat gtgtngcggt ggaatgcgta gatatatgga 721 agaacaccag tggcgaaggc ggcttactgg actgcaactg acgttgaggc tcgnaagtgt 781 gggtagcaaa caggattaga taccctggta gtccacaccg taaacgatga acactaggtg 841 ttaggaggtt tccgcctctt agtgccnaag ctaacgcatt aagtgttccg cctggggagt 901 acgaccgcaa ggttgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg 961 tggtttaatt cgaagcaacg cgaagaacct taccaggtct tgacatcctt tgaagctttt 1021 agagatagaa gtgttctctt cggagacaaa gtgacaggtg gtgcatggtc gtcgtcagct 1081 cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttattg ttagttgcca 1141 gcattcagat gggcactcta gcgagactgc cggtgacaaa ccggaggaag gcggggacga 1201 cgtcagatca tcatgcccct tatgacctgg gctacacacg tgctacaatg gcgtatacaa 1261 cgagttgcca acccgcgagg gtgagctaat ctcttaaagt acgtctcagt tcggattgta 1321 gtctgcaact cgactacatg aagtcggaat cgctagtaat cgcggatcag cacgccgcgg 1381 tgaatacgtt cccgggtctt gtacacaccg cccgtcacac catgggagtt tgtaatgccc 1441 aaagccggtg gcctaacctt ttaggaagga gccgtctaag gcaggacaga tgaccggggn 1501 nnnnnngtaa caagnnnnnn nnnnnngaac ctgnnnnnng atcacctcct ttct // LOCUS LEURRNAO 1510 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.oenos 16S small subunit ribosomal RNA. ACCESSION M35820 M23032 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.oenos (strain DSM 20252) rRNA. ORGANISM Leuconostoc oenos Prokaryota; Bacteria; Firmicutes; Gram-positive cocci. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1510; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1510 16S rRNA BASE COUNT 387 a 316 c 431 g 351 t 25 others ORIGIN 1 nttaaattga gagtttgatc ctggctcagg atgaacgctg gtggcgtgcc taatacatgc 61 aagtcgtacg tagccgctga attgatcttc gggtgaagtg aggcaatgac tagagtggcg 121 aactggtgag taacacgtna gaaacctgcc ctttagtggg ggataacatt tggaaacaga 181 tgctaatacc gcgtaacaac aaatcacacn tgtgatctgt ttgaaaggtc cttttggatc 241 gctagaggat ggtcttgcgg cgtattagct tgttggtagg gtagaagcct accaaggcaa 301 tgatgcgtag ccgagttgag agactggctg gccncattgg gactgagaca ctgcccnaac 361 tcctacggga gnctgcagta gggaattttc cgcaatgcac gaaagtgtga cggagcgacg 421 ccgcgtgtgt gatgaaggct ttcgggtcgt aaagcactgt tgtaagggaa gaataactga 481 attcagagaa agttttcagc ttgacggtac cttaccagaa agggatggct aaatacgtgc 541 cagcagccgc ggtaatacgt atgtcccnag cgttatccgg atttattggg cgtaaagcga 601 gcgcagacgg tttattaagt ctgatgtgaa atcccgaggc ccaacctcgg tactgcattg 661 gaaactgatt tacttgagtg cgatagaggc aagtggaact ccatgtgtag cggtgaaatg 721 cgtagatatg tggaagaaca ccagtggcga aagcggcttg ctagatcgta actgacgttg 781 aggctcgaaa gtatgggtag caaacgggat tagatacccc ggtagtccat accgtaaacg 841 atgggtgcta gttgttaaga ggtttccgcc tcctagtgac gtagcaaacg cattaagcac 901 ccngcctgag gagtacggcc gcaaggctga aacttaaagg aattgacggg gnnccgcaca 961 agcggtggag catgtggttt aattcgaaga tacgcnnaaa accttaccag gtcttgacat 1021 accaatgatc gcttttgtaa tgaaagcttt tcttcggaac attggataca ggtggtgcat 1081 ggtcgtcgtc agctcgtgtc gtgagatgtt gggttnagtc ccgcaacgag cgcnaccctt 1141 gttattagtt gccagcattt ngttgggcac tctaatgaga ctgccggtga tnaaccggag 1201 gaaggtgggg acgacgtcag atcatcatgc cccttatgac ctgggcaaca cacgtgctac 1261 aatgggaagt acaacgagtc gcaaaccggc gacggtaagc taatctctta aaacttctct 1321 cagttcggac tggagtctgc aactcgactc cacgaaggcg gaatcgctag tnatcgcgaa 1381 tcagcatgtc gcggtgaatn cgttcccngg ncttgtacac nccgcnngtc aaatcntggg 1441 agtcggaagt acccaaagtc gcttggctaa cttttngagg ccggtgccta aggtaaaatc 1501 gatgactggg // LOCUS LEURRNAPS 1524 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.paramesenteroides 16S small subunit ribosomal RNA. ACCESSION M23033 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.paramesenteroides (strain DSM 20288) rRNA. ORGANISM Leuconostoc paramesenteroides Prokaryota; Bacteria; Firmicutes; Gram-positive cocci. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1524; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1524 16S rRNA BASE COUNT 404 a 326 c 435 g 340 t 19 others ORIGIN 1 nttaaaatga gagtttganc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc 61 aagtcgaacg ctttgtcttt aactgatatg aagagcttgc tctgatttga ttttatctga 121 caaagagtgg cgaacgggtg agtaacacgt gggtaaccnr cctcttagca ggggataaca 181 tttggaaaca agtgctaata ccgtataata ccaacaaccg catggtngtn ggttgaaaga 241 tggttctgct atcactaaga gatggacccn cggtgcatta gctagttggt aaggtaacgg 301 cttaccaagg caatgatgca tagccgagtt gagagactga tcggccacaa tgggactgag 361 acacggccca tactcctacg ggagncagca gtagggaatc ttccacaatg ggcgcaagcc 421 tgatggagca acgccnngtg tgtgatgaag ggtttcggct cgtaaaacac tgttataaga 481 gaagaacggc actgagagta actgttcagt gtgtgacggt atcttaccag aaaggaacgg 541 ctaaatacgt gccagcagcc gcggtaatac gtatgttccn agcgttatcc ggatttattg 601 ggcgtaaagc gagcgcagac ggttatttaa gtctgaagtg aaagccctca gctcaactga 661 ggaatggctt tggaaactgg atgacttgag tgcagtagag gaaagtggaa ctccatgtgt 721 agcggtgaaa tgcgtagata tatggaagaa caccagtggc gaaggcggct ttctggactg 781 taactgacgt tgaggctcga aagtgtgggt agcaaacagg attagatacc ctggtagtcc 841 acaccgtaaa cgatgagtgc tagatgttcg agggtttccg cccttgagtg tcgcagctaa 901 cgcattaagc actccgcctg gggagtacga ccgcaaggtt gaaactcaaa ggaattgacg 961 gggaccngca caagcggngg agcatgtggt ttaattcgaa gnaacgcgaa gaaccttacc 1021 aggncttgac atcccttgct aatcctagaa ataggangtt cccttcgggg acaaggtgac 1081 aggtggngca tggttgtcgt cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga 1141 gcgcaaccct tattattagt tgccagcatt cagttgggca ctctagtgag actgccggtg 1201 acaaaccgga ggaaggtggg gatgacgtca aatcatcatg ccccttatga cctgggctac 1261 acacgtgcta caatggcata tacaacgagt cgccaacccg cgagggtgcg ctaatctctt 1321 aaagtatgtc tcagttcgga ttgtaggctg caactcgcct acatgaagtc ggaatcgcta 1381 gtaatcgcgg atcagcacgc cgcggtgaat acgttcccgg gncttgtaca caccgcccgt 1441 cacaccatga gagtttgtaa cacccaaagc cggtggggta accttttagg agccagccgt 1501 ctaaggtggg acagatgatn aggg // LOCUS LVIRRNAA 1582 bp ss-rRNA RNA 04-JUL-1990 DEFINITION L.viridescens 16S small subunit ribosomal RNA. ACCESSION M23040 KEYWORDS 16S ribosomal RNA; small subunit ribosomal RNA. SOURCE L.viridescens (strain DSM 20410) rRNA. ORGANISM Lactobacillus viridescens Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (sites) AUTHORS Yang,D. and Woese,C.R. TITLE Phylogenetic structure of the "leuconostocs": An interesting case of a rapidly evolving organism JOURNAL Syst. Appl. Microbiol. 12, 145-149 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1582; for [1]) AUTHORS Yang,D. and Woese,C.R. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 10-MAR-1989, for release after publication. Author address: C.R.Woese Microbiology Department 131 Burrill Hall 407 S. Goodwin Ave Urbana, IL 61801 Ph: 217-333-9369 E-mail carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1582 16S rRNA BASE COUNT 426 a 334 c 436 g 351 t 35 others ORIGIN 1 tacaaaatga gagtttgatc ctggctcagg atgaacgctg gcggcgtgcc taatacatgc 61 aagtcgaacg ctttgtggtc caactgattt gaagagcttg ctcagatatg acgatggaca 121 ttgcaaagag tggcgaacgg gtgagtaaca cgtgggaaac ctacctctta gcaggggata 181 acacttggaa acaagtgcta ataccgtata acactaataa ccgcatggtt attagttaaa 241 agntggtctt gctatcacta agagatggtc ccacggtgta ttagctagtt ggtaaggtaa 301 tggcttacca aggcaatgat acatagccga gttgagagac tgatcggcca caatgggact 361 gagacacggc ccatactcct acgggaggca gcagtaggga atcttccaca atggacgaaa 421 gtctgatgga gcaacgccgc gtgtgtgatg aagggtttcg gctcgtaaaa cactgttgta 481 agagaagaat gacattgaga gtaactgttc agtgtgtgac ggtatcttac cagaaaggaa 541 cggctaaata cgtgccagca gccgcggtaa tacgtatgtt ccaagcgtta tccggattta 601 ttgggcgtaa agcgagcgca gacggttatt taagtccgaa gtgaaagccc acagcttaac 661 tgtggaagtg ctttggaaac tggataactt gagtgcagta gaggagagtg gaactccatg 721 tgtagcggtg aaatgcgtag atatatggaa gaacaccagt ggcgaaggcg gctctctgga 781 ctgtaactga cgttgaggct cgaaagtgtg ggtagcaaac aggattagat accctggtag 841 tccacaccgt aaacgatgag tgctagatgt ttgagggttt ccgcccttaa gtgtcgcagc 901 taacgcatta agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg 961 acggggaccn gcacaagcgg tggagcatgt ggtttaattc gaagcaacgc gaagaacctt 1021 accaggtctt gacatccttt gaccacttca gagatgaagc tttcccttcg gggacaaagt 1081 gacaggtggt gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa 1141 cgagcgcaac ccttattact agttgccagc attcagttgg gcactctagt gagactgccg 1201 gtgacaaacc ggaggaaggt ggggatgacg tcaaatcatc atgcccctta tgacctgggc 1261 tacacacgtg ctacaatggc aagtacaacg agcagctaac ccgcgagggt acgcgaatct 1321 cttaaaactt gtctcagttc ggattgtagg ctgcaactcg cctacatgaa gtcggaatcg 1381 ctagtaatcg cggatcagca cgccgcggtg aatacgttcc cgggncttgt acacaccgcc 1441 cgtcacacca tgagagtttg taacacccaa agccggtgag gtaacctttt aggagccaac 1501 cgtctaaggt gggacagatg attagggnnn nnnngtaaca agnnnnnnnn nnnnnnnnnn 1561 nnnnnnngat cacctccttt ct // LOCUS DIRPARAM 2545 bp ss-mRNA INV 04-JUL-1990 DEFINITION D.immitis paramyosin mRNA, partial cds. ACCESSION M29733 KEYWORDS paramyosin. SOURCE D.immitis, cDNA to mRNA. ORGANISM Dirofilaria immitis Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 2545) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 2545 paramyosin (AA at 1) BASE COUNT 949 a 420 c 544 g 632 t ORIGIN 1 gcattcggta gtatgtcggt tgccgatttg ggaagtttaa ctcgcttgga ggacaagatt 61 cgtctactgc aagaagatct cgaatccgaa cgtgaacttc gaaatagaat cgaacgagaa 121 agagctgatc ttagtgtaca actgattgca ttaactgata gacttgaaga tgctgagggt 181 actactgata gtcagattga atcaaatcgt aaacgtgaag cagaattgca aaaattacgt 241 aaattattgg aagaatcaca attagaaaat gaagatgcaa tgaatgtttt acgtaaaaag 301 catcaagatg catgtctcga ttacgctgaa caaattgaac aattacaaaa gaaaaattca 361 aagattgatc gtgaacgtca acgtctgcaa catgaagtaa ttgagcttac tgcgacaatt 421 gatcaacttc aaaaggataa gcatttggcg gaaaaagcag cggaacgttt tgaagcgcaa 481 actatcgaat tgagtaataa agttgaagat ttaaatcgac atgttaatga tttagctcaa 541 caacgtcaac gtttacaagc tgaaaataac gatcttctca aagagattca tgatcaaaaa 601 gtacaattgg ataatttgca acacgtgaaa tatcaacttg cgcaacaact tgaagaagca 661 cgtcgacgtt tagaagatgc tgaacgcgaa cgttctcaac ttcaagcaca acttcatcaa 721 gtacaattag aattggattc tgtacgtacc gcattagatg aagaatcagc agctcgagca 781 gaagctgaac ataaacttgc tttagcaaat accgagatta cccaatggaa gagtaaattt 841 gatgctgaag ttgctcttca tcatgaagag gttgaagatt tacgaaagaa aatgctccaa 901 aaacaggctg aatatgagga acaaattgaa ataatgttgc aaaaaatttc acaacttgaa 961 aaagctaaat cacgcttaca aagtgaagtt gaagtattaa ttgttgattt ggaaaaagca 1021 caaaatacaa tagcaattct tgaacgagca aaagaacaac tcgaaaaaac tgttaatgaa 1081 ttaaaggtac gtattgatga acttactgtt gaacttgaag cagcacaacg tgaagctcgt 1141 gctgcattag cagaattaca aaagatgaaa aatttatacg aaaaagcaat tgagcaaaaa 1201 gaagccttag ctcgagaaaa taaaaaatta caagatgatt tgcatgaagc aaaagaagcg 1261 cttgctgatg caaatcgtaa attgcatgag cttgatcttg aaaatgcacg attagctggt 1321 gaaatacgag aattgcaaac agctcttaag gaatctgaag ctgcacgacg tgatgctgag 1381 aatcgtgcac aacgtgcact tgctgaatta caacaacttc gtattgaaat ggaacgtcgt 1441 ttgcaagaga aagaggaaga aatggaagca cttcgtaaga atatgcaatt tgaaattgat 1501 cgacttacag ctgcattagc cgatgctgaa gcacgtatga aggcggaaat ttcacgtcta 1561 aagaagaagt atcaagcgga aattgccgaa cttgaaatga ccgttgataa tttgaatcgg 1621 gcaaatattg aagcgcaaaa gacgattaaa aaacaaagtg aacaacttaa aatattgcaa 1681 gcatcacttg aggatactca acgtcaacta cagcaaactc ttgatcagta tgcattggcg 1741 caacgtaaag tttccgcact tagcgctgaa ttagaagaat gcaaagttgc acttgataat 1801 gcaattcgtg ctcgtaaaca agctgaaatt gatcttgaag aagcgaatgc tcgtattact 1861 gatttggtat caattaacaa taatttaaca gcaattaaaa ataaattgga aactgaatta 1921 tcgaccgcac aagctgattt ggatgaagca acgaaagaat tgcatgccgc ggatgagcgg 1981 gctaatcgtg ctttagcaga tgcagctcga gctgttgaac aacttcatga ggagcaggaa 2041 cattccatga aaattgatgc tctcaggaaa tcattagaag agcaggtgaa gcaattacaa 2101 gtacaaattc aagaagctga agctgcagca ttattgggtg gtaaacgtgt aattgctaaa 2161 cttgaaacac gtatacgtga tttggaaact gctttggatg aagaaacacg tcgacataag 2221 gaaacgcaag gtgcgcttag aaagaaagat cgacgcatca aagaagttca aatgcaagtt 2281 gatgaagagc ataagatgtt tgtgatggcg caagatactg ctgataggct tcttgaaaaa 2341 cttaacattc aaaagtggca acttggagaa gctgaatctt taacgatggc taatcttcaa 2401 cgagtacgtc gatatcaacg tgaattggaa gatgctgaag gtcgtgctga tcaagctgaa 2461 agctcattac atcttattcg tgctaaacat cgttcgtcag ttgttaccgg caaaaatgct 2521 tcagcatcta aaatttacgt tctcg // LOCUS ONGPARAM1 134 bp ss-mRNA INV 04-JUL-1990 DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-1. ACCESSION M29734 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 134) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 134 paramyosin (AA at 1) BASE COUNT 53 a 23 c 28 g 30 t ORIGIN 1 ttacaaaaga aaaattcaaa aattgaccgt gaacgtcaac gtctgcaaca tgaagtaatt 61 gagcttacag cggcaattga tcaacttcaa aaggataagc atttggctga gaaagcagca 121 gaacgttttg aagc // LOCUS ONGPARAM2 135 bp ss-mRNA INV 04-JUL-1990 DEFINITION O.volvulus paramyosin mRNA partial cds, clone OV-2. ACCESSION M29735 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 135) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 135 paramyosin (AA at 2) BASE COUNT 43 a 31 c 26 g 35 t ORIGIN 1 tttgcaacat gtcaaatatc aacttgcgca acaacttgaa gaagcacgtc gtcgtttgga 61 agatgctgaa cgtgaacgtt cacaacttca agcacaactt catcaagttc agctagaatt 121 ggattccgta cgtac // LOCUS ONGPARAM3 176 bp ss-mRNA INV 04-JUL-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-3. ACCESSION M29736 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 176) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 176 paramyosin (AA at 1) BASE COUNT 74 a 20 c 36 g 46 t ORIGIN 1 aaagttgctc ttcatcatga agaggttgaa gatttacgaa agaaaatgtt gcaaaagcaa 61 gctgaatatg aagagcaaat tgaaataatg ctgcaaaaaa tttcacaact tgaaaaggct 121 aaatcacgtt tgcaaagtga agttgaagta ttgattgttg atttagaaaa agcaca // LOCUS ONGPARAM4 414 bp ss-mRNA INV 04-JUL-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-4. ACCESSION M29737 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 414) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 414 paramyosin (AA at 3) BASE COUNT 143 a 70 c 101 g 100 t ORIGIN 1 ccttagctcg agaaaataaa aagttgcaag atgatttgca tgaagctaaa gaagcgcttg 61 ctgatgcaaa tcgtaaattg catgaacttg atcttgagaa tgcacgatta gctggtgaaa 121 ttcgagaatt gcaaacagca ctcaaagaat ctgaagcagc tcgacgtgat gctgagaatc 181 gtgcacaacg tgcacttgct gaattgcaac aacttcgcat cgaaatggaa cgtcgtttgc 241 aagagaagga agaagaaatg gaagcacttc gtaagaatat gcaatttgaa attgatcgtc 301 ttacagctgc attagctgat gctgaagcac gtatgaaggc ggagattgca cgtttgaaga 361 aaaagtatca ggcggaaatt gccgaacttg aaatgaccgt tgataatttg aatc // LOCUS ONGPARAM5 145 bp ss-mRNA INV 04-JUL-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-5. ACCESSION M29738 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 145) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 145 paramyosin (AA at 3) BASE COUNT 48 a 28 c 28 g 41 t ORIGIN 1 tattgcaagc atcacttgaa gatactcaac gtcaattaca acaaactcta gatcagtatg 61 ctttggcaca acgtaaagta tctgcgctta gtgctgaatt ggaagaatgc aaagttgcac 121 ttgataatgc aattcgtgct cgtaa // LOCUS ONGPARAM6 127 bp ss-mRNA INV 04-JUL-1990 DEFINITION O.volvulus paramyosin mRNA, 3' end, clone OV-7. ACCESSION M29740 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 127) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 56 paramyosin (AA at 30 mRNA < 1 127 paramyosin mRNA BASE COUNT 44 a 20 c 24 g 39 t ORIGIN 1 ctgctggcaa aaatgcttca gcatcgaaga tttacgttct cgaagatgat cagtagatag 61 atagatggtg ataatagttg aaatcaactg atatcatctg ttaacaatca ttaaaatgtg 121 tctcatc // LOCUS ONGPARAM7 129 bp ss-mRNA INV 04-JUL-1990 DEFINITION O.volvulus paramyosin mRNA, partial cds, clone OV-6. ACCESSION M29739 KEYWORDS paramyosin. SOURCE O.volvulus, cDNA to mRNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 129) AUTHORS Limberger,R.J. and McReynolds,L.A. TITLE Filarial paramyosin: cDNA sequences from Dirofilaria immitis and Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 38, 271-280 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 129 paramyosin (AA at 3) BASE COUNT 48 a 20 c 31 g 30 t ORIGIN 1 aagtacaaat tcaagaagct gaagcagcag cattattggg tggtaaacgt gtaattgcta 61 aacttgaaac acgtatacgt gatttggaaa ctgctttgga tgaagaaaca cgtcgacata 121 aggaaacgc // LOCUS CORDTXRA 2529 bp ds-DNA BCT 04-JUL-1990 DEFINITION C.diphtheriae toxin repressor (dtxR) gene, complete cds. ACCESSION M34239 KEYWORDS regulatory protein; toxin repressor. SOURCE C.diphtheriae (strain C7) DNA, clone pHH2500.. ORGANISM Corynebacterium diphtheriae Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 1390) AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R. TITLE Molecular cloning and DNA sequence analysis of the diptheria tox regulatory element (dtxR) from Corynebacterium diptheriae JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2529) AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.R.Murphy, 11-MAY-1990. University Hospital Dept Medicine-Rm E613 88 East Newton Street Boston, MA 02118-2393 FEATURES from to/span description pept 681 1361 toxin repressor protein (dtxR) pept 25 456 ORF1 pept 1384 2370 ORF3 signal 595 629 promoter sequence binding 664 669 ribosomal binding site (put.) binding 1200 1283 DNA-binding region (put.) BASE COUNT 691 a 615 c 593 g 630 t ORIGIN 1 bp upstream of HindIII site. 1 aagctttctc gaatcaaacg cgagatgtat caacagctcg gacgtgaagc aaccaacgaa 61 gagcttgcgg aagaatctgg cattgaagaa tccaaaattg aaatgttgct tcgccagtct 121 cgtgacccag taagcctaga tatgccagtt ggtgccgatg aagaggctcc acttggtgac 181 tttatcgaag attccgaggc gactgatgcc gaatcagctg tagttgcgtc gttgcgtcac 241 tctgatatca gaacagtctt ggacactttg gagcaacgtg aacaggacgt tattcgcctt 301 cgttatgggc tcgatgatgg agttccccgc actttagatc aaattggtcg tcaatttgga 361 ctgtctcgtg aaagagttcg ccaaattgaa cgcgaggtca tgtcgaaact gcgtgacggt 421 gcccgtgccg aaaaacttcg agcatacgca caataaagcg tttgcttaga tatgcctacc 481 aataaagaca taaacgccta ttaaaagcaa tctttagatt aggcgttata attaaagtct 541 catcgaaaaa cgcgctgcgg gactacaacg caacaagaaa actattccat atttttcacg 601 ctacaattcg ttgtagattg ataggaattg atcaccagca cacaacagtc tccatggcac 661 tataaggaaa gaggcttaca atgaaggact tagtcgatac cacagagatg tacttgcgta 721 ctatctatga gctggaagaa gagggagtca cccctcttcg cgctaggatc gctgagcgtc 781 tggaacaatc tggacctaca gttagccaaa ccgttgcccg tatggagcgc gatggacttg 841 tcgttgtcgc ctcagaccgc agtctacaaa tgacaccgac aggccgcact ttagcgactg 901 cagttatgcg taaacatcgc ttagctgagc gccttcttac cgatatcatt ggtctagata 961 tcaataaagt tcacgatgaa gcctgccgct gggaacacgt tatgagtgac gaagttgaac 1021 gcaggctcgt gaaagtattg aaagatgtca gtcggtcccc cttcggaaac ccaattccag 1081 gtctcgacga actcggcgta ggcaattctg acgcggcagc ccccggaact cgcgttattg 1141 acgctgccac cagcatgccc cgcaaagtac gcattgttca gattaacgaa atctttcaag 1201 ttgaaacgga tcagtttaca cagctcctcg atgctgacat ccgtgttgga tcagaagtcg 1261 aaattgtaga tagagacggc cacatcacgt tgagccacaa tggaaaagat gtcgaactcc 1321 tcgatgatct ggctcacact attcgtatcg aagaactcta aatactaaag gcggcgaaat 1381 tagatgaaac tcctcgttac cggtggcgcc ggatacgtag gaagtgtctg ttccactgtt 1441 ctgctcgaac aggggcacga agtaacaatt gtcgataatc ttacaaccgg caatcgcgat 1501 gctgttccgc taggagccac ttttgtcgag ggtgatatca aagacgttgc ggaaaacgtt 1561 ttatcatctg attcttttga tgcggttctt cacttcgcgg ctcgttcgct tgtaggcgaa 1621 tcagttgaaa agccagatga atattggcag cacaacatgg tgaccacact cgcacttctt 1681 gatgccatga aacgtaataa cgtgcgaaac attgtctttt cttctacagc tgctacatac 1741 ggcgaacccg aaacagtccc gattactgaa gacgctccta cccacccaac gaatccatac 1801 ggcgcaacca agctgtctat cgactacgcg atcacatcat acgcacatgc atatgggttt 1861 gctgcgacga gtctgcggta cttcaacgta gctggcgcat atggtttggt gggtgaaaat 1921 cgagaaatag aaacccatct cattccgctc gttttacaag tagcactggg acaccgtgac 1981 aagattttca tgtttggtga cgactggcca actgaagatg gaacccctat ccgcgattac 2041 attcatattc gtgatcttgc agatgcccat attttggcac ttcaaagtaa cgtcgaaggc 2101 agccaccgga tctttaatct cggatccggt gaaggatatt cggtcaaaca ggtcattgat 2161 acctgcagag aagttaccgg tcatcccatc cccgctgagg tagctcctcg acgggctggt 2221 gatcctgcag tgctcattgc atcatcggct aaggcgcaat ccgaactagg atggaagcca 2281 caacgcactg atctgcacac gattgtttct gatgcttggg cttttacgtc tcaactcgga 2341 gacaaagccc atagcgcttc tcgtggataa caacgagtag gtttatcttt tgcagtccgc 2401 ggtgtgatta cttaagtatt cacaccgcgg attttacatt tgttgactca ccaaaatgtt 2461 tctgagcttc ctgactccca ttgataacaa gatcaacaat ggatccagat agacctcgat 2521 aataagctt // LOCUS ECOANSBA 1643 bp ds-DNA BCT 04-JUL-1990 DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds. ACCESSION M34234 KEYWORDS L-asparaginase II; ansB gene. SOURCE E.coli (strain K12 isolate JM108) DNA, clones lambda-ansB and pansB7. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1643) AUTHORS Bonthron,D.T. TITLE L-asparaginase II of Escherichia coli K-12: Cloning and sequencing of the ansB gene and its position on the chromosomal map JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.T.Bonthron, 11-MAY-1990. FEATURES from to/span description pept 346 1392 L-asparaginase II (ansB) (EC 3.5.1.1) sigp 346 411 L-asparaginase II signal peptide matp 412 1389 L-asparaginase II binding 335 339 ribosomal binding site (put.) site 1400 1425 transcription termination signal (put.) BASE COUNT 442 a 404 c 405 g 392 t ORIGIN 63.8 min on K12 map. 1 gcggtaaccc attacagaat gtgctgggaa gcctgggggg gctgcaatcc tcaatccaaa 61 ccgagtggaa aaagcaggaa aaagatttcc agcagtttgg caaagatgtt tgtagccgcg 121 ttgtgactct ggaagatagc cgcaaagccc tggtcgggaa tttaaaataa tcctctattt 181 taagacggca taatactttt ttatgccgtt taattcttcg ttttgttacc tgcctctaac 241 tttgtagatc tccaaaatat attcacgttg taaattgttt aacgtcaaat ttcccataca 301 gagctaaggg ataatgcgta gcgttcacgt aactggagga atgaaatgga gtttttcaaa 361 aagacggcac ttgccgcact ggttatgggt tttagtggtg cagcattggc attacccaat 421 atcaccattt tagcaaccgg cgggaccatt gccggtggtg gtgactccgc aaccaaatct 481 aactacacag tgggtaaagt tggcgtagaa aatctggtta atgcggtgcc gcaactaaaa 541 gacattgcga acgttaaagg cgagcaggta gtgaatatcg gctcccagga catgaacgat 601 aatgtctggc tgacactggc gaaaaaaatt aacaccgact gcgataagac cgacggcttc 661 gtcattaccc acggtaccga cacgatggaa gaaactgctt acttcctcga cctgacggtg 721 aaatgcgaca aaccggtggt gatggtcggc gcaatgcgtc cgtccacgtc tatgagcgca 781 gacggtccat tcaacctgta taacgcggta gtgaccgcag ctgataaagc ctccgccaac 841 cgtggcgtgc tggtagtgat gaatgacacc gtgcttgatg gccgtgacgt caccaaaacc 901 aacaccaccg acgtagcgac cttcaagtct gttaactacg gtcctctggg ttacattcac 961 aacggtaaga ttgactacca gcgtaccccg gcacgtaagc ataccagcga cacgccattc 1021 gatgtctcta agctgaatga actgccgaaa gtcggcattg tttataacta cgctaacgca 1081 tccgatcttc cggctaaagc actggtagat gcgggctatg atggcatcgt tagcgctggt 1141 gtgggtaacg gcaacctgta taaatctgtg ttcgacacgc tggcgaccgc cgcgaaaacc 1201 ggtactgcag tcgtgcgttc ttcccgcgta ccgacgggcg ctaccactca ggatgccgaa 1261 gtggatgatg cgaaatacgg cttcgtcgcc tctggcacgc tgaacccgca aaaagcgcgc 1321 gttctgctgc aactggctct gacgcaaacc aaagatccgc agcagatcca gcagatcttc 1381 aatcagtact aatcgcctcg ccccggtatc gtgccggggc tttttcactt cagactcacg 1441 tccattgcca attttaatta ccctaatgat aatcaccgga ataaattatt ccgcgtgagg 1501 tttttcgggt gaaaaagcaa tggattgttg gcacggcact gcttatgttg atgactggta 1561 atgcctgggc agatggcgaa ccgccaactg aaaatatctt aaaagatcaa ttcaaaaagc 1621 agtatcacgg cattctcaag ctt // LOCUS HUMADE5RC 1946 bp ds-DNA SYN 04-JUL-1990 DEFINITION Synthetic human/adenovirus type 5 recombination junction. ACCESSION M34061 KEYWORDS . SOURCE Human fetal bone marrow fibroblasts transformed with Adeno5/SV40 recombinant virus (cell line H13.1) DNA, clone pH13.1SV1. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 1946) AUTHORS Romani,M., De Ambrosis,A., Alhadeff,B., Purrello,M., Gluzman,Y. and Siniscalco,M. TITLE Preferential viral integration at the highly recombinogenic chromosomal site 1p36 in human cells transformed with an Adeno5/SV40 hybrid JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Romani 08-MAY-1990. FEATURES from to/span description recomb 429 430 adenovirus type 5 end/Human DNA start rpt 430 1134 tandem Alu repeats BASE COUNT 492 a 548 c 510 g 396 t ORIGIN Chromosome 1p36. 1 tttgacccgg aacgcggaaa acacctgaga aaaacacctg ggcgagtctc cacgtaaacg 61 gtcaaagtcc ccgcggccct agacaaatat tacgcgctat gagtaacaca aaattattca 121 gatttcactt cctcttattc agttttcccg cgaaaatggc caaatcttac tcggttacgc 181 ccaaaattta ctacaacatc cgcctaaaac cgcgcgaaaa ttgtcacttc ctgtgtacac 241 cggcgcacac caaaaacgtc acttttgcca catccgtcgc ttacatgtgt tccgccacac 301 ttgcaacatc acacttccgc cacactacta cgtcacccgc cccgttccca cgccccgcgc 361 cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat aaggtatatt 421 attgatgata tataatatag aggctgaggc aggagggttg cttgagcctg ggaggctgcg 481 gctgcagtga gccatgatca caccactgca cttcagcctg ggcaacagag cgagactccg 541 tctcaaaaaa tagtaatagg ccaggcgcgg tggctcatgc cagtaatccc agcgctttgg 601 gaggctgagg caggcagatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg 661 tgaaaccctg tctctactaa aaatacaaaa attagttggg catggtggcg gcgcacacca 721 gtaatcccag ctactcaaga ggctgagggc acttgggcag cagaggaggt tgtagtgagc 781 agagatcacg ccactgcact ctagcctggg caacagaggg agactctgtc tcaaaaaata 841 ataggccggg cacggtggct catgcctgta atcccagcac tttgggaggc cgaggcgggt 901 ggatcacctg agatcaggag tttaagacca gcctggccaa catggtgaaa ccccatctct 961 actaaaaata caaaaaatta gccaggcgtg gtggcgggcg tctgtaatcc cagctacttg 1021 ggaggctgaa gcaggagaat cacttgagcc caggaggcag aggttgcagt gagctgagat 1081 catgccatta cactccagcc tgggcaacaa gagtgaaact ccgtctcaaa aaataataat 1141 aataatgata ataataataa taattgcatc tattcactga gcgcctcctg tacaccaggc 1201 accatgtggc attaggggcc catgctgtca tcatattctg ccagccaccc atggggtggt 1261 ggggtaattc cccaggccac acctgtccac acctctcttc aggtactgcg tgggcatgtg 1321 cggagacggc gccaatgact gtggggccct gaaggcggct gatgtcggca tctcgctgtc 1381 ccaggcagaa gcctcagtgg tctcaccctt cacctcgagc atggccagta ttgagtgcgt 1441 gcccatggtc atcaggtaag gcaggcaggg accggtgggt gagggctcag cagggctggt 1501 cagccaacct gggtccccta atgtccgtgc cctgccaccc agggaggggc gctgttccct 1561 tgacacttcg ttcagcgtct tcaagtacat ggctctgtac agcctgaccc agttcatctc 1621 cgtcctgatc ctctacacgg tgagtatctg cagagctccg tacccagagc cgtcccagga 1681 ctagggaggg gacacagccc tgtgccttgg agctggcagg tcccagatag atgacgtggg 1741 cccagatcct gaccctgcca ctctccggct gtgtggttgg agaggtggca aggcttgggg 1801 gagatgagaa aaacaccagg cacggagcct ggtacccact aggtcctcac aaatggcagc 1861 agctgtcatg ggctaacatc ttgaccgcaa ctctgggagc tccccaatct gcagggaaga 1921 ctgaacggag gcagggccaa aaccct // LOCUS MCLPCCA 1701 bp ds-DNA BCT 04-JUL-1990 DEFINITION M.laminosus phycoerythrocyanin (PCC) alpha and beta subunit genes, complete cds, and linker protein, 5' end. ACCESSION M34254 KEYWORDS light-harvesting biliprotein; phycoerythrocyanin. SOURCE M.laminosus (strain Fischerella PCC 7603) DNA. ORGANISM Mastigocladus laminosus Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 1701) AUTHORS Eberlein,M. and Kufer,W. TITLE Genes encoding both subunits of phycoerythrocyanin, a light- harvesting, biliprotein from the cyanobacterium Mastigocladus laminosus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Kufer, 11-MAY-1990. Botanisches Institut Ludwig-Maximilians- Universitaet Muenchen Menzingerstr. 67 D-8000 Muenchen 19 Federal Republic of Germany FEATURES from to/span description pept 279 797 phycoerythrocyanin beta-subunit pept 884 1372 phycoerythrocyanin alpha-subunit pept 1528 > 1701 phycoerythrocyanin linker protein precursor matp 1531 > 1701 phycoerythrocyanin linker protein BASE COUNT 526 a 312 c 383 g 480 t ORIGIN 1 agctttacat ttaaggatat ggcgctcaaa gaatgggagt tttacagtta ttaaataaag 61 ttcataaaaa agaatacaag taatatataa aatacgaaac taaggtttga aagtaattgc 121 aaatatcaag gcaaacgaaa agataaaggc taattgttgc acgtaaaaat ggtatttttt 181 tgatttttag gtgtagttat tggcggattt tatctggtct cggtttgtct tgattaaaaa 241 tgcaatagct gccatgaaat cgaggagaaa atagaaaaat gcttgatgct ttttccagag 301 tagttgaaca agctgataaa aaaggtgctt acctaagcaa tgatgaaata aatgctttgc 361 aggctatagt tgcagatagc aacaaacgtt tggatgtagt gaatcggctt acaagcaacg 421 cctcctcaat tgtggcgaat gcttatcgtg ctttagtagc agaacgtccg caggtattta 481 atcctggtgg tccttgtttt caccatcgca accaagctgc ttgcatccgc gatttaggat 541 ttattctgcg ttatgttacg tactccgtac ttgctggtga cactagtgtc atggacgatc 601 gctgcttgaa tggtttgcgc gaaacctatc aagctcttgg tactcccggt gatgctgtag 661 cttctggaat taaaaaaatg aaagaagctg cacttaaaat tgctaacgat cccaatggca 721 ttaccaaagg tgattgcagt caattaatgt ctgaattagc aagttatttt gaccgagcag 781 cggctgctgt tgcttaatgc agtcaaaggc gcgactaagt gacaaaattt tcaaactctt 841 ttttagtaat taatttgaca agtttttgga gaagaagcga atcatgaaga ccccgttaac 901 cgaagcgatc gccgccgccg atcttcgcgg ttcttatttg agcaacacag aattgcaagc 961 agtatttggt cgtttcaatc gcgctcgtgc tggtttagaa gcagcaagag cattcgctaa 1021 caacggtaag aaatgggcgg aagcagcagc caatcacgta tatcaaaaat ttccctacac 1081 tacccaaatg caaggacctc agtatgcttc tactccagaa gggaaagcaa agtgcgtgcg 1141 ggatatcgac cattacttac gcacaattag ctactgctgt gtagttggcg gtactggtcc 1201 cttggatgac tatgtagttg caggcttaaa ggaatttaac agcgccctgg gtttatctcc 1261 cagctggtac atagctgctc tagaatttgt ccgcgacaat cacggtttga ccggtgatgt 1321 tgctggtgaa gccaacacct acatcaacta tgcaattaat gcattgagct aaatctaaac 1381 agcgaatgtc ctttgtattt gcttaatagc taatggctga tagctgatgg ctaatggcta 1441 atggaaaaat gcaattagca attagcagtg agttattggc cattaactgt gtttgagaaa 1501 aatgataaaa atggagcgat tgttaacatg agtacatcag tcgcagaacg cctcgcaatt 1561 aaagatgaag tagacaaaaa aattgaattg cgccctaatt ggagcgaaga cgagctgcaa 1621 attgttttta aaacagccta cgagcaagtt tttggacgtc agggattata tgccagtcaa 1681 agatttgcaa ctgccgaagc t // LOCUS YSCTSDNAA 238 bp ds-DNA PLN 04-JUL-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-ii. ACCESSION M34310 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-ii. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 238) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 146 g 92 t ORIGIN 1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg 61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg 121 gtgtggtgtg tgggtgtggt gtgtgggtgt ggtgtgggtg tgggtgtggt gtgtgtgggt 181 gtggtgtgtg ggtgtgggtg tggtgtggtg tgtgtgggtg tgtgggtgtg gtgtgtgt // LOCUS YSCTSDNAB 297 bp ds-DNA PLN 04-JUL-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-i. ACCESSION M34311 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-i. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 297) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 184 g 113 t ORIGIN 1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg 61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg 121 gtgtggtgtg ggtgtgggtg tgggtgtggt gtgtgtgtgg gtgtggtgtg tgggtgtggg 181 tgtgtgtggg tgtgggtgtg ggtgtggtgt ggtgtgtggg tgtggtgtgt gggtgtggtg 241 tgtgtgggtg tggtgtgtgt gggtgtgggt gtgtgtgggt gtgggtgtgg tgtgtgg // LOCUS YSCTSDNAC 325 bp ds-DNA PLN 04-JUL-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-ii. ACCESSION M34312 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-4-ii. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 325) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 199 g 126 t ORIGIN 1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg 61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg 121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg gtgtgtgggt gtggtgtggg tgtggtgtgt 181 gggtgtgtgg gtgtgggtgt gtgtggtgtg tggtgtgtgt ggtgtgggtg tggtgtgtgg 241 gtgtgggtgt ggtgtgtggg tgtgggtgtg gtgtgtgtgt gtgggtgtgg tgtgtgggtg 301 tgggtgtggt gtgtgggtgt gtggg // LOCUS YSCTSDNAD 248 bp ds-DNA PLN 04-JUL-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-i. ACCESSION M34313 KEYWORDS . SOURCE S.cerevisiae (strain XS595-6C) extrachromosomal plasmid DNA, clone YLP108CA-4-i. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 248) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 151 g 97 t ORIGIN 1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg 61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg 121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg tgtgtgtggg tgtggtgtgg gtgtggtgtg 181 tgggtgtgtg tgggtgtggt gtgtgtgggt gtgtgggtgt ggtgtgtgtg ggtgtggtgt 241 gtgggtgg // LOCUS BOVMET 1047 bp ss-mRNA MAM 04-JUL-1990 DEFINITION Bovine metalloproteinase inhibitor mRNA, complete cds. ACCESSION M32303 KEYWORDS metalloproteinase inhibitor. SOURCE Bovine aorta endothelium, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1047) AUTHORS Boone,T.C., Johnson,M.J., De Clerck,Y.A. and Langley,K.E. TITLE cDNA cloning and expression of a metalloproteinase inhibitor related to tissue inhibitor of metalloproteinases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2800-2804 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.E.Langley, 23-FEB-1990, for release after publication. FEATURES from to/span description pept 291 953 metalloproteinase inhibitor precursor sigp 291 368 metalloproteinase inhibitor signal peptide matp 369 950 metalloproteinase inhibitor signal 1042 1047 polyA signal BASE COUNT 220 a 350 c 312 g 165 t ORIGIN 1 gaattccggc ttctatggag cactcgggac caggtccgcg gcgcgcgcac tcgctcgctc 61 gccgcccccc agccagctct cgcttccgcg ccgccagccg cgccccgcgc ctcctcgctg 121 caccccgcga cctagagcca agaaagtttg tgtggcgagt gagggccgga gaggagagcg 181 cgcccgcgga gtgccgtcca gaccagcgcg gccccggcgg agaggggagc gccccgagcc 241 caggcggcgg cggctagccc gagtccgcga cccccgcccc tccgcccgcc atgggcgccg 301 ccgcccgcag cctgccgctc gcgttctgcc tcctgctgct ggggacgctg ctcccccggg 361 ccgacgcctg cagctgctcc ccggtgcacc cgcaacaggc gttttgcaat gcagacatag 421 tgatcagggc caaagcagtc aataagaagg aggtggactc tggcaacgac atctacggca 481 accccatcaa gcggattcag tatgagatca agcagataaa gatgttcaag ggacctgatc 541 aggacataga gtttatctac acagcccccg ccgctgccgt gtgtggggtc tcgctggaca 601 ttggaggaaa gaaggagtat ctcattgcag ggaaggccga ggggaatggc aatatgcata 661 tcaccctctg tgacttcatc gtgccctggg acaccctgag tgccacccag aagaagagcc 721 tgaaccacag gtaccagatg ggctgtgagt gcaagatcac tcgatgcccc atgatcccat 781 gctacatctc ctctccggac gagtgcctct ggatggactg ggtcacggag aagaacatca 841 acggacacca ggccaagttc ttcgcctgca tcaagagaag cgacggctcc tgcgcctggt 901 accgcggagc agcacccccc aagcaggagt ttctggacat cgaggacccg taagcaggcc 961 accaggactc ctggggccaa ttgacagtgt ccaagagttc agactggtcc agctccgaca 1021 tcccttcctg gacacagcat gaataaa // LOCUS CFDCG 1291 bp ss-DNA circular VRL 04-JUL-1990 DEFINITION Coconut foliar decay virus, complete genome. ACCESSION M29963 KEYWORDS circular; complete genome. SOURCE Coconut foliar decay virus DNA, clones NMORG, pORG, and pORGSAU3A. ORGANISM Coconut foliar decay virus Viridae; ss-DNA viruses. REFERENCE 1 (bases 1 to 1291) AUTHORS Rohde,W., Randles,J.W., Langridge,P. and Hanold,D. TITLE Nucleotide sequence of a circular single-stranded DNA associated with coconut foliar decay virus JOURNAL Virology 176, 648-651 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by W.Rhode, 15-MAR-1989, for release after publication. FEATURES from to/span description pept 103 975 ORF 1 pept 314 775 ORF 2 pept 639 797 ORF 3 pept 1098 1286 ORF 4 pept 987 823 (c) ORF 5 pept 568 422 (c) ORF 6 site 40 70 stem-loop structure BASE COUNT 336 a 323 c 332 g 300 t ORIGIN 1 cgccaaaaac ctctgctaag tcccgtgcta agtataaata gccgcggggg ctagtattac 61 ccccgcggct ccccaacctc tgctaacccc gcttggctat aaatgggttc ctccattcgc 121 cgctggtgct tcactctgaa ctacgagaca gaggaagagg cggcgaatgt tgtaaggcgc 181 atcgagtccc ttaacttggt ctatgctata gttggggacg aggttgcacc ttcaactggc 241 caacgacacc tccaaggatt catccacttg aagaccggtc ggcgactgca aggattgaag 301 actgttcttg ggaatgacag gattcacctg gagccgaccc gtggttccga cgaacagaat 361 agagactact gttcgaagga acgggtgctt ctcgagcacg gagtcccgac tcgtcctgga 421 gtcaaaaggc cacgattggc ccaacgattt gctgaggaac ctgatgaact ccgcctggaa 481 gacccaggcg gataccgaag atgcgttgta cacggagctt cggtggaatg gacaagatgg 541 gccgctgaaa atccgttccc atttccatat cacaattggc agcttgaagt gctgtctgcg 601 atcggagagc cagcggacga tcgcacaatc ctctggatat gcggacgaga cggaggagac 661 gggaagtccg tgtttgccaa atatctcgga ctcaagcccg actggttcta cacatgtggt 721 ggaaccagaa aggacgtatt gtaccagtac atcgaggacc caaaacgaaa tttaatcctc 781 gatgtaccca ggtgtaattt agagtattta aattatgccc tgttagaatg tgttaagaac 841 agggcattca gttcggacaa atacgaaccc cttagttatc ttgggttcga ccatgtgcat 901 gtactcgtat ttgccaatgt cctgcctgat tatttgaaaa tcagcaggga cagaataaaa 961 ctgtggaata tttaaagtat gtgtcatcta aattacacca atacccgccc gcccacgcgc 1021 tatcgtttac atcttatgaa tatcctgccc aggccgaagg cctgggaggt gctacccggc 1081 cgaaggccgg gaacaatatg aatcgagtta tgggcgggcc cacaataaaa gattccattt 1141 ggataagaac gaatctgtta ctttgcttgc agtgcacgca accactttcc acgtcaccaa 1201 tccaggtgag tagcttgctg gagaagaaag ccgcaagcct ctatctaccg tccatttgtt 1261 tttgcgcgat cggacggctg agttgatctg g // LOCUS HUMMET 1035 bp ss-mRNA PRI 04-JUL-1990 DEFINITION Human metalloproteinase inhibitor mRNA, complete cds. ACCESSION M32304 KEYWORDS metalloproteinase inhibitor. SOURCE Human fetal aorta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1035) AUTHORS Boone,T.C., Johnson,M.J., De Clerck,Y.A. and Langley,K.E. TITLE cDNA cloning and expression of a metalloproteinase inhibitor related to tissue inhibitor of metalloproteinases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2800-2804 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.E.Langley, 23-FEB-1990, for release after publication. FEATURES from to/span description pept 255 917 metalloproteinase inhibitor precursor /nomgen="TIMP" /map="Xp11.3-p11.23" /hgml_locus_uid="LJ0125P" sigp 255 332 metalloproteinase inhibitor signal peptide matp 333 914 metalloproteinase inhibitor signal 1011 1016 polyA signal BASE COUNT 216 a 361 c 306 g 152 t ORIGIN 1 gaattccggc ccgccgtccc ccaccccgcc gccccgcccg gcgaattgcg ccccgcgccc 61 ctcccctcgc gcccccgaga caaagaggag agaaagtttg cgcggccgag cggggcaggt 121 gaggagggtg agccgcgcgg gaggggcccg cctcggcccc ggctcagccc ccgcccgcgc 181 ccccagcccg ccgccgcgag cagcgcccgg accccccagc ggcggccccc gcccgcccag 241 ccccccggcc cgccatgggc gccgcggccc gcaccctgcg gctggcgctc ggcctcctgc 301 tgctggcgac gctgcttcgc ccggccgacg cctgcagctg ctccccggtg cacccgcaac 361 aggcgttttg caatgcagat gtagtgatca gggccaaagc ggtcagtgag aaggaagtgg 421 actctggaaa cgacatttat ggcaacccta tcaagaggat ccagtatgag atcaagcaga 481 taaagatgtt caaagggcct gagaaggata tagagtttat ctacacggcc ccctcctcgg 541 cagtgtgtgg ggtctcgctg gacgttggag gaaagaagga atatctcatt gcaggaaagg 601 ccgaggggga cggcaagatg cacatcaccc tctgtgactt catcgtgccc tgggacaccc 661 tgagcaccac ccagaagaag agcctgaacc acaggtacca gatgggctgc gagtgcaaga 721 tcacgcgctg ccccatgatc ccgtgctaca tctcctcccc ggacgagtgc ctctggatgg 781 actgggtcac agagaagaac atcaacgggc accaggccaa gttcttcgcc tgcatcaaga 841 gaagtgacgg ctcctgtgcg tggtaccgcg gcgcggcgcc ccccaagcag gagtttctcg 901 acatcgagga cccataagca ggcctccaac gcccctgtgg ccaactgcaa aaaaagcctc 961 caagggtttc gactggtcca gctctgacat cccttcctgg aaacagcatg aataaaacac 1021 tcatccccgg aattc // LOCUS HUMACEX 1141 bp ds-DNA PRI 04-JUL-1990 DEFINITION Human angiotensin-converting enzyme (ACE) gene, 5' end. ACCESSION M34434 KEYWORDS angiotensin-converting enzyme. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1141) AUTHORS Shai,S.-Y., Langford,K.G., Martin,B.M. and Bernstein,K.E. TITLE Genomic DNA 5' to the mouse and human angiotensin-converting enzyme genes contains two distinct regions of conserved sequence JOURNAL Biochem. Biophys. Res. Commun. 167, 1128-1133 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1136 > 1141 angiotensin-converting enzyme BASE COUNT 169 a 427 c 364 g 181 t ORIGIN 1 ctgtccctgg ctccttccct gatcccaccg ccagcctcac cccacggttc ctccattgcc 61 ccacctccca ctgcgccgcc gggcctctgc cagggtcaag gggcttcccc cctctggcag 121 cagacgccat ggtgccgagg tggcctccac aaccgccctg tgcgccaata ggacaagact 181 gtcctccctc ccccacactt gtcactttga gggacacgtg gatgagacag gaaaacacag 241 gggagtgtgg agacctgagg tgacttggag caagcctctc aacctgagcg gcaatttctt 301 catctgtaaa atgagggggt tgttctcatc tctgaggctt tgtgtcgctc tcaaagcctg 361 ctagcctcgg gttctaggac tctgttggga tcgtgtgtga tgttttctgc tgagcgacgg 421 cagcctgtgt cctcgggggg aaagagggca ggcgctccaa agctcctgcg ctctgtggct 481 ccccctccct cgcagcccca agccccaggt gtgccggccg ccctgagccc ctccagcacc 541 tcccggaggc gcctgcaaga cacctaaggt ccccgcctcc ctcctctccc ccccgccaca 601 cccctacccc cggcaggcga cgtccccgcc cctcgaccat ggcctggtga agaagccggc 661 caggcccgat cagccccatc cccgccgcac gagcggcgcc tgcggacagc tcctggggcc 721 ccggccttgt cactccggag gcgggaggct ccggggggtc gggctgggaa gatcgagccg 781 gaggccgcta ggctcccagg ccccggccga ggctgcgcgg ccgcacggtg ggcaggctcg 841 ggtgttccgg caaactgccg ggtccccatc ttcaaaagag aggaggccct ttctccagct 901 tcctctgcgg gagcccgacc cagccccatc ccgccacccc cgggctgcac ctcggcccct 961 ccccggcccg cgcccctgcc cggggcgggc caggaacctc ggcccgcgcc gctggggact 1021 ttggagcgga ggaggaagcg cggcggggcg ggggcggggg tgtgtcgggt tttataaccc 1081 gcagggcggc cgcggcgcag gagaaggggc agagccgagc accgcgcacc gcgtcatggg 1141 g // LOCUS MURU3REG 492 bp ss-RNA VRL 04-JUL-1990 DEFINITION Murine retrovirus SL3-2 U3 region. ACCESSION M34383 KEYWORDS long terminal repeat. SOURCE Murine retrovirus SL3-2, cDNA to viral RNA. ORGANISM Murine retrovirus SL3-2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses. REFERENCE 1 (bases 1 to 492) AUTHORS Dai,H.Y., Etzerodt,M., Baekgaard,A.J., Lovmand,S., Jorgensen,P., Kjeldgaard,N.O. and Pedersen,F.S. TITLE Multiple sequence elements in the U3 region of the leukemogenic murine retrovirus SL3-2 contribute to cell-dependent gene expression JOURNAL Virology 175, 581-585 (1990) STANDARD simple staff_review BASE COUNT 158 a 136 c 114 g 84 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc 61 atgggaaaaa taccagagct gatgttctca gaaaaacaag aacaaggaag tacagagagg 121 ctaaaaagta ccgggactag ggccaagaac agatggtccc cagaccgcta actgcagtaa 181 cgacaggata tctgtggtta agcactaggg ccccggccca gggccaagaa cagatggtcc 241 ccagaccgct aacgacagga tatctgtggt taagcactag ggccccggcc cagggccaag 301 aacagatggt ccccagaaat agctaaaaca acaacagttt caagagaccc agaaactgtc 361 tcaaggttcc ccagatgacc ggggatcaac cccaagcctc atttaaacta accaatcagc 421 tcgcttctcg cttctgtacc cgcgcttatt gctgcccagc tctataaaaa gggtaagaac 481 cccacactcg gc // LOCUS MUSACEX 1299 bp ds-DNA ROD 04-JUL-1990 DEFINITION Mouse angiotensin-converting enzyme (ACE) gene, 5' end. ACCESSION M34433 KEYWORDS angiotensin-converting enzyme. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1299) AUTHORS Shai,S.-Y., Langford,K.G., Martin,B.M. and Bernstein,K.E. TITLE Genomic DNA 5' to the mouse and human angiotensin-converting enzyme genes contains two distinct regions of conserved sequence JOURNAL Biochem. Biophys. Res. Commun. 167, 1128-1133 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1294 > 1299 angiotensin-converting enzyme BASE COUNT 262 a 393 c 369 g 275 t ORIGIN 1 ggatcctctt tctgaactcc ataggtacca ctctcagaat cccctcctaa ccctccacca 61 gtacagcctg cttcctccag aaacacaaca gctcacacaa acccacagcc ctcaaatcct 121 gggagcccaa gcaatatgga ccacctaagt tctcagcagg tggccccttg ctccttcctc 181 ctttctccat ctctacccgt cccctgggac ctctttggga gcaagtcacc tgacaggcca 241 gcacagtctc actctgagtc cttgcaggtc acacaatctc ccatgtgaac tccatcaccc 301 catctggcaa ggacaagggt ccgcccacct ctttgggggc acagccctgg tgctcagagg 361 ttaacttcac aggctgctga tgcatttaga ggacctgatt ctccttcccc caggcctgtc 421 actttgaggg acgggtggag gagacaggga aatgactggg agtgtggaga cctgaggtga 481 cttgaagcca atctgtctca ggggtaattt ctccattttt aaaaagcagg gggttgttgt 541 aatctctaag gcacattttt ttttagcttt cagagcgtgc aactgcgagg ttctagggct 601 ctgttgggac cacacatgat ggttcttgtg agttgaacag ctgtgtatcc aaagagtagg 661 ggcaggcatg ccagtgctga tggcctagtc tatggctccc cctcatagag cagaatagac 721 tctatttcga ggcatcccaa caaccgaagc ccaaggtgaa ctggttgctg acatctgcag 781 cacctagcca aggcgcctgt caggacagct aaacttcccg cgcccacgca ggcgacccct 841 ttcctccatg acctggcaac cttaagtcca tcccggtggc accagcctgc cacccccgcc 901 cccttgtcac tcaagagccc ggaaaggctt ggagggttat gccaagcctg ctgggctctg 961 tggcctgaga ctacgtggtg gcggccagag ggcacggttg ggccgcagca gtgtttgcag 1021 cgggcaggga cccttttcta gcttcctctg agagagcccg atctagcccc atttctccat 1081 ccaggctgct ccctggcccc acagggctct ccttgcccag ggcgggcggg gaatcagccg 1141 ggttcgcgct gccggggact ttggagcgag gaggaagcgc ggaggggcgg ggaggtgggg 1201 gtgtgtcggg aggcggtggt ggctgggttt tataatccgc agggcggtcg cggcacgaga 1261 gaaggggcag cgccgcaccg cgcgcaccgc gccatgggg // LOCUS MUSOCT3 1324 bp ss-mRNA ROD 04-JUL-1990 DEFINITION Mouse octamer binding transcription factor (oct-3) mRNA, complete cds. ACCESSION M34381 KEYWORDS octamer binding transcription factor. SOURCE Mouse embryo carcinoma cell line P19, cDNA to mRNA, clone lambda-C1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1324) AUTHORS Okamoto,K., Okazawa,H., Okuda,A., Sakai,M., Muramatsu,M. and Hamada,H. TITLE A novel octamer binding transcription factor is differentially expressed in mouse embryonic cells JOURNAL Cell 60, 461-472 (1990) STANDARD simple staff_review FEATURES from to/span description pept 50 1183 octamer binding transcription factor (oct-3) mRNA 1 1324 oct-3 mRNA BASE COUNT 297 a 361 c 400 g 266 t ORIGIN 1 gtgagccgtc tttccaccag gcccccggct cggggtgccc accttcccca tggctggaca 61 cctggcttca gacttcgcct cctcaccccc accaggtggg ggtgatgggt cagcagggct 121 ggagccgggc tgggtggatt ctcgaacctg gctaagcttc caagggcctc caggtgggcc 181 tggaatcgga ccaggctcag aggtattggg gatctcccca tgtccgcccg catacgagtt 241 ctgcggaggg atggcatact gtggacctca ggttggactg ggcctagtcc cccaagttgg 301 cgtggagact ttgcagcctg agggccaggc aggagcacga gtggaaagca actcagaggg 361 aacctcctct gagccctgtg ccgaccgccc caatgccgtg aagttggaga aggtggaacc 421 aactcccgag gagtcccagg acatgaaagc cctgcagaag gagctagaac agtttgccaa 481 gctgctgaag cagaagagga tcaccttggg gtacacccag gccgacgtgg ggctcaccct 541 gggcgttctc tttggaaagg tgttcagcca gaccaccatc tgtcgcttcg aggccttgca 601 gctcagcctt aagaacatgt gtaagctgcg gcccctgctg gagaagtggg tggaggaagc 661 cgacaacaat gagaaccttc aggagatatg caaatcggag accctggtgc aggcccggaa 721 gagaaagcga actagcattg agaaccgtgt gaggtggagt ctggagacca tgtttctgaa 781 gtgcccgaag ccctccctac agcagatcac tcacatcgcc aatcagcttg ggctagagaa 841 ggatgtggtt cgagtatggt tctgtaaccg gcgccagaag ggcaaaagat caagtattga 901 gtattcccaa cgagaagagt atgaggctac aggacacctt tcccaggggg ggctgtatcc 961 tttcctctgc ccccaggtcc ccactttggc accccaggct atggaagccc ccacttcacc 1021 acactctact cagtcccttt tcctgagggc gaggcctttc cctctgttcc cgtcactgct 1081 ctgggctctc ccatgcattc aaactgaggc accagccctc cctggggatg ctgtgagcca 1141 aggcaaggga ggtagacaag agaacctgga gctttggggt taaattcttt tactgaggag 1201 ggattaaaag cacaacaggg gtggggggtg ggatggggaa agaagctcag tgatgctgtt 1261 gatcaggagc ctggcctgtc tgtcactcat cattttgttc ttaaataaag actggacaca 1321 cagt // LOCUS PP1SIM 1900 bp ds-DNA PHG 04-JUL-1990 DEFINITION Bacteriophage P1 sim region proteins, complete cds. ACCESSION M34382 KEYWORDS sim region. SOURCE Bacteriophage P1 viral DNA. ORGANISM Bacteriophage P1 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 1900) AUTHORS Maillou,J. and Dreiseikelmann,B. TITLE The sim gene of Escherichia coli phage P1: Nucleotide sequence and purification of the processed protein JOURNAL Virology 175, 500-507 (1990) STANDARD simple staff_review FEATURES from to/span description pept 470 718 sim region ORF 1 pept 791 1570 sim region ORF 2 BASE COUNT 636 a 323 c 360 g 581 t ORIGIN 1 ccccatcacg ccccaccata attcgggcgt aacgcggttt acggacacaa aaataccgca 61 atatcggaaa tctgcggttg tccgcactaa cattcaggct gtcaaaaccg gtcgcagaat 121 ttgctacgac ggtggaacta taagcctgaa cgattaaaag gtcaatatga tgcgaaaaga 181 tagcattcgc gacttaaaaa tacaaattta ttagagcatt atttgtttaa taaatacaca 241 gttggatcta ataacctctt ttttttaaag gcgaaaatat gtaccctaaa tgagttataa 301 ggcaggtgag gttataatga gaaaactatt actaccgtta ttatttatgg ctgggactgt 361 taatgcagca tcaagcgtaa aggagatttg taccgattat acgaaatacc ttgggcacgt 421 ttacgccttt gcgatcaatg actattaatc cagtgctagg agtggattta tgctgattcg 481 tttgttttta gtgctttcct ttttaacatt taatgttttt gctgatgaag ttgacttttc 541 gaaggtagat tgcaattcag tggaaacaag aaaagctctt attgaagaat ataacgaaat 601 attatcgtca tatggaataa cagtggttga ttcttataat caaaaaacta ttcagaaagg 661 aataaataaa ctggtctgtt atggggttta ccaatattca gatggctctt cggagtagtt 721 tatttataag catacccaaa tgtctgtgaa ttaattagtg agtttaaacc gattaatgag 781 tgaatggaaa atgaaattat ttaatgtaat aacattttgt tgtgctattt ttgctggaag 841 cgcgatagct gataataaat tgccagattg gctttctacc tcaaagaaag attatgattt 901 agtaagggca ttctatttgt ctggatttgc ttcgaaagca atgaacaatc aatttggtta 961 tcatttgcca tctgagttgg ttaatgattt taaagataat gaatttgctg ctcaggaaaa 1021 atggaacaca attccaattg tgtatggtga aataaaatcc ataagaatgg tgaataataa 1081 accaattgta gaattattta ctccagggga aaatgcaacg cctataaatt atatcaaatt 1141 gaaaatattg gattcaaagc aagactctct gttaaaacta aaaaaagggg atgatatata 1201 tgcagtgtgc tccggtgcta attttagctt agtgccaatt ctgagcaact gcactccagc 1261 aacagacgtc attgatgctg cactctcttt ttctggtgaa tatatgttcc ctgcttttga 1321 ttctttttca cctactaagc aaaacgtcaa atatatattc acaaatcaag atcctgttca 1381 gatgatgaat tttatagggt acctatcctt agtcgataca acgaaggata agaataaaat 1441 ggatatggtt cgtaagtgta cgccttggaa gccggaatgt tcacaacaat tcgttgatgt 1501 aatggaagga tttgatagca tcatgtataa atatgaggga gaatttaaaa actacataga 1561 attaaaatag tgtttgaaag gatagtcaat ttaagtttta acaccaacgc ccattaaagg 1621 gcttttattg ttttactcaa aacaacctga ttagtgtata accatatcat atccctcatt 1681 ccacctacac tgattacccc cagacaacaa tattcctact caatgaacaa atgactactc 1741 gtagaatcgg ttaacacacc agattctacg aggtttcaat gacaccacga caattactcg 1801 aagacgtcaa aacccgcttc acacctttga ttgcggatga accggcctta ctggaatccc 1861 tgctaagaaa agcattggga acctaccagg atagggcggg // LOCUS PSEPST1 1446 bp ds-DNA BCT 04-JUL-1990 DEFINITION Plasmid pPST1 (from Pseudomonas stutzeri) ORF1 gene, complete cds. ACCESSION M34385 KEYWORDS . SOURCE Plasmid pPST1 (from Pseudomonas stutzeri, strain MO-19) DNA. ORGANISM Pseudomonas stutzeri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1446) AUTHORS Fujita,M., Kubota,M., Futai,M. and Amemura,A. TITLE Identification and DNA sequencing of a new plasmid (pPST1) in Pseudomonas stutzeri MO-19 JOURNAL Plasmid 22, 271-274 (1989) STANDARD simple staff_review FEATURES from to/span description pept 576 1034 ORF1 BASE COUNT 267 a 446 c 425 g 308 t ORIGIN 1 gcatgccatc gacccccgtc aataccttga gcacatgctg cgtccagcgc gtagccatcc 61 acatctgctc cgggctggcg taggcgtaaa cctgcttggc cttggcgttc tgcggcttga 121 gttccagttc tgcccgaacg tggtcaggat cgcccccggt ctgcttgccc ttttcgtaga 181 tgcgcttcat ggctgccgag gtgcgcgacc cgatgtagag cgtccgcccg tcttgctcgc 241 ggtgagagtc gcctacatgc gtgaccttga ggccgaactt gtctgccgtc tccagaccca 301 gcgcgtagag agagtcccac gcccccggct cgcagtagtc gatagtcacg tcagccctga 361 gtaggttgtg ccccctgaac tcatcccgaa cgacagcagc aaaggcgggt gcccggtcgc 421 cgcttgcaga ggcccatacg cgcgtcccga cggtgttacc gccccactgc acccgtgcaa 481 gcacagaatc gcctctgacg acgttatagg cccgttcata gccatttttc ggggtggtcg 541 gcgttacgtc cgaacgactc gaaagagccg gccagatgat caaggaaccc ggtcgggttt 601 ggcctcgatg ctggcggtgt agtagtcgaa cctcattttt cacccccaga ggctctagag 661 cgcttcgcgg ggtttttttg gctcggtgtg taggggggtt aggtgttgtg acccccgtag 721 ttacctatgc gggggttggg cggtctgcaa acatgatcca tgagagcgac gcaaaggccc 781 gagcatctcg ttcatgtgag cctctccaga ctcaaggcct agagaccatc ccggccctag 841 attcaacgtg ccagcgcccg ggctttctgg caggcccgga gaaccctttg ttggacaggg 901 ttaaccatag tcaggaactc cgagacaaga cccgacatcg tgacgccctg aacctcagaa 961 agacccctga tcgtgtcgta cacgtcctga tcgagcgtaa ccgtgattct cggtttctgt 1021 gtggccattg gtgatgcact ccgggcaaag tgatgcaccg aatgtacgca gccgaaggcg 1081 aaaggcaacc cgtgcgaatt gtttacatgt gcgcatgagc gcatgagcgc atgtgatttg 1141 tgtgcatgtg atttgtatgc atgtgcgcat actggcgcgc tacgcttgcc cgccgagccg 1201 agttgccggc tcgtaccggc gctacgctcc cggcccgatc cgggcacctc gacccggccc 1261 gacgaatttg tccgcgccgg tggacttatt tgtccgacac tagccgagca acgaacagcc 1321 cgaacgactc aatatcgccg cccagtgcct ccagcttgcg gcaatagttg ccgtactgct 1381 tagccatgaa ctcaagagcg cgctcatcat cggtaagctg gcggatcgtt ccaggcggtg 1441 ctggct // LOCUS BNACRUCA 1686 bp ss-mRNA PLN 04-JUL-1990 DEFINITION B.napus cruciferin mRNA, 3' end. ACCESSION J05233 KEYWORDS 12S storage globulin; cruciferin; seed development protein. SOURCE B.napus (strain Svaloefs Karat) seed, cDNA to mRNA, clone pCRU1. ORGANISM Brassica napus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 1686) AUTHORS Roedin,J., Ericson,M.L., Josefsson,L.-G. and Rask,L. TITLE Characterization of a cDNA clone encoding a Brassica napus 12 S protein (cruciferin) subunit: Relationship between precursors and mature chains JOURNAL J. Biol. Chem. 265, 2720-2723 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 1521 cruciferin precursor (AA at 1) sigp < 1 60 cruciferin signal peptide matp 61 948 cruciferin alpha-subunit matp 949 1518 cruciferin beta-subunit signal 1543 1548 polyA signal signal 1671 1676 polyA signal BASE COUNT 462 a 455 c 415 g 354 t ORIGIN 1 ccttgcgaga cagccgtcgc aacgttcggg gttctcctcg tcctcaacgg ctgtctcgca 61 aggcagtcgc taggggttcc tcctcagcta gggaacgcgt gtaacctcga taacttagac 121 gttctccagc ctaccgaaac tatcaagagc gaggctggtc gggtcgagta ctgggatcac 181 aacaatcctc agatccgatg tgctggtgtc tctgtctctc gtgttataat cgaacaaggc 241 ggtctctacc ttcctacctt cttcagctcc cccaaaattt caatcgttgt tcaaggaatg 301 ggtattagcg gaagagtggt ccctggatgc gcggaaacct tcatggactc gcagcctatg 361 caaggacaac aacaaggtca accatggcag ggacaacaag gacaacaggg tcagcaggga 421 caacaaggtc aacagggtca gcagggacaa caaggtcaac agggtcagca gggtcaacag 481 ggacagcagg gtcagcagca gcaagggttc cgtgacatgc accagaaggt cgaacatgtt 541 cgacatggag acatcattgc cattactgca ggctcttccc attggatcta caacaccggt 601 gaccagccac ttgtcattat ctgccttctc gacattgcca actaccaaaa ccaactcgac 661 cgcaacccaa gaacgttccg tctggccgga aacaacccac agggcggttc ccagcagcag 721 cagcaacaac aacagaacat gttgagcggg ttcgaccctc aggtcctagc ccaggcattg 781 aaaatcgacg ttaggttggc tcaggagctt cagaaccaac aagacagcag aggaaacatc 841 gttcgtgtta agggaccttt ccaggttgtg aggccgcctc ttagacagcc atacgagagt 901 gagcagtgga gacacccccg tggcccacca caaagcccac aagacaacgg cttggaggag 961 actatctgca gcatgaggac ccacgagaac attgatgacc cagcccgtgc tgacgtgtat 1021 aagcccaacc tcggccgtgt gactagcgct aacagctaca ctttacccat cttgcagtat 1081 atcagactca gcgccacccg tggcattctc cagggtaatg cgatggtgct tccgaaatac 1141 aacatgaacg cgaacgagat cttgtactgc actcaaggac aagcaaggat tcaagtggtg 1201 aacgacaacg gacagaacgt gctggaccag caggtgcaga agggacagct cgtggtcatc 1261 ccacaaggat tcgcctatgt tgtccagtcc caccaaaaca acttcgaatg gatttctttc 1321 aagacaaacg ctaacgcgat ggtcagcact ttggccggta gaacctcggc cttgagggca 1381 ttgccactag aggtcataac caacgctttc caaatttctc tcgaggaagc tagaaggatc 1441 aagttcaaca cgcttgagac cactttgact cgtgcgcgcg gtggacaacc ccagttgatc 1501 gaggagatag tcgaggctta agttaaaacg tttactttta ctaataaagt agtgacatgg 1561 ttactattgt aatggtcagt ttgtaatcat gtccactcta agtttttaac gtatgtgtaa 1621 aatatgtgtc taaagaacac gccggcacgt ctcttgtatg taacccttct aataaatacc 1681 cccccc // LOCUS BS2ORIA 2279 bp ds-DNA BCT 04-JUL-1990 DEFINITION Plasmid pBS2 origin of replication. ACCESSION M34386 KEYWORDS origin of replication. SOURCE Plasmid pBS2 DNA. ORGANISM Plasmid pBS2 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 2279) AUTHORS Darabi,A., Forough,R., Bhardwaj,G., Watabe,M., Goodarzi,G., Gross,S.C. and Watabe,K. TITLE Identification and nucleotide sequence of the minimal replicon of the low-copy-number plasmid pBS2 JOURNAL Plasmid 22, 281-286 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 195 470 ORF1 pept 1139 1948 ORF2 binding 169 177 ribosomal binding site (put.) binding 1110 1117 ribosomal binding site (put.) orgrpl 865 1008 origin of replication DNA BASE COUNT 699 a 343 c 562 g 675 t ORIGIN 1 bp upstream of HindIII site. 1 aagctttgtt gaatatgggg aatagttata accagatggg aagccttttt tcggctgtac 61 cttattatca caaggctata aaggcagcta aaataagcgg agcgaaggaa attacccagg 121 cttattatga tttagctctg attcatttta ggaacaatga aaagattgaa ggacgcggat 181 tctttggaaa agcaatggag caagctgaag aatttaacga tagtctgttt caggatttat 241 taaatgttct aaaggcatta tttattgaga ctggaagcag gcaaaaggtc atgaatgccc 301 ttgaggcgtt gagaacggga caaggctatc catattttga agagctagct ttaatcgctg 361 cagaatttta tactatggat aagcgcatgg aagattctat atatttttac aatgagatgg 421 tgtgcgctca aagacaaatt caaaggggcg atttcctgta tgaagtttaa aggtcttttt 481 tcagcggtcc ttattgtaag cctgttggtt ggagcggggt attctttcgt gcaccatgat 541 gaggtttcag ttgcttcaag gaatgcaaca tgattaagaa taaagaatcg agatttacca 601 cttaaagata tggaaaacag atcgttattg atctgttttt ttctttgtgt gctgaccatg 661 cgggcatgat gcggagataa cgctgggtgc tgcggagata atgcgggcat gggtgcgctg 721 atcgcggaga ctgaaaaggc aaaatttcaa aaaaatctcc cccctacggg ggggaagaat 781 ggttttgatc ttttgttttg ggttttaaaa aagccggctg ttttcagccg gtactttttc 841 gattttggcg aagccgaaat cgggtctttt cttatcttga tactatatag aaacaacatc 901 atttttcaaa atcagttcaa agtcttgtgt gtcaagggtt agatcggttt ttgacaggta 961 aaaactcctt ctgctattat taaggtgtcg aatcaaaata atagaatgct agagaactag 1021 ctcagaagga gtttttttgt tgatttattc atctgaaaac gattatagca tcctcgaaga 1081 taaaaccgca acaggtaaaa agcgggattg gaaggggaaa aagagacgta cgaacctcat 1141 ggcggagcat tacgaagcat tagagagtaa gattggggca ccttactatg gcaaaaaggc 1201 tgaaaaacta attagttgtg cagagtatct ttcgtttaag agagacccgg agacgggcaa 1261 gttaaaactg tatcaagccc atttttgtaa agtgaggtta tgcccgatgt gtgcgtggcg 1321 caggtcgtta aaaattgctt atcacaataa gttgatcgta gaggaagcca atcggcagta 1381 cggctgcgga tggatttttc tcacgctgac ggttcggaat gtaaagggag aacggctgaa 1441 gccacaaatt tctgagatga tggaaggctt taggaaactg ttccagtaca aaaaagtaaa 1501 aacttcggtt cttggatttt tcagagcttt agagattacc aaaaatcatg aagaagatac 1561 ataccatcct cattttcatg tgttgatacc agtaaggaaa aattattttg ggaaaaacta 1621 tattaagcag gcggagtgga cgagcctttg gaaaaaggcg atgaaattgg attacactcc 1681 gattgttgat attcgtcgag tgaaaggtaa agccaagatt gacgctgaac tgattgaaaa 1741 cgatgtgcgg gaagcaatga tggagcaaaa agctgttctc gaaatttcta aatatccggt 1801 taaggatacg gatgttgtgc gcggcaataa ggtgactgaa gacaatctga acacggtgct 1861 ttacttggat gatgcgttgg cagctcgaag gttaattgga tacggtggca ttttgaagga 1921 gatacataaa gagctgaatc ttcggtgatg cggaggacgg cgatctggtc aagattgagg 1981 aagatgatga cgaggttgca aatggtgcat tgaggttatg gcttattggc atcctggcat 2041 taaaaattac ataatcaaat aaaaaaagca gacctttaag gcgtgctttt tttaactaac 2101 ccatttgtat tgtgttgaaa tatgttttgt tcgtttttta tctttgtagc tttttatgat 2161 taattgtcta tgataaccta gggaatcttt acattgtctg tgttgaatag ttacataaac 2221 tggtttgctc ttaggatagt ttaaatatac tgttgtcatt gctccgaaaa caaaagctt // LOCUS ECOCYSBA 1407 bp ds-DNA BCT 04-JUL-1990 DEFINITION E.coli cysteine regulatory protein (cysB) gene, complete cds. ACCESSION M34332 KEYWORDS cysB gene; cysteine regulatory protein. SOURCE E.coli (mutant strain CB64) DNA, clone pLB1. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1407) AUTHORS Tei,H., Watanabe,K., Murata,K. and Kimura,A. TITLE Analysis of the Escherichia coli K-12 cysB gene and its product using the method of gene fusion JOURNAL Biochem. Biophys. Res. Commun. 167, 962-969 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 349 1323 cysteine regulatory protein (cysB) BASE COUNT 378 a 312 c 326 g 391 t ORIGIN 1 ggatccggaa ggtaataaga ccatggttcg ctttagccgt aaaaccaaac agcaatatgt 61 ctcttcggaa aaagacggaa aggcgactgg ctggtcagca ttttatgttg atggcaaatg 121 ggttgaagga aaaaaataac ctttaattct gtcaggtttt tataaacaaa gggtcgcgaa 181 agcggccctt ttttattgca tattattttt tcttcacacc tatacactaa ggctataaat 241 gatatagtgg ttatagttag cacctttttt attattaaat cgtattagtc acccgccagg 301 tgtgacgaaa aaacgatgtt ctgatggcgt ctaagtggat ggtttaacat gaaattacaa 361 caacttcgct atattgttga ggtggtcaat cataacctga atgtctcatc aacagcggag 421 ggactttaca catcacaacc cgggatcagt aaacaagtca gaatgctgga agacgagcta 481 ggcattcaaa ttttttcccg aagcggcaag cacctgacgc aggtaacgcc agcagggcaa 541 gaaataattc gtatcgctcg cgaagtcctg tcgaaagtcg atgccataaa atcggttgcc 601 ggagagcaca cctggccgga taaaggttca ctgtatatcg ccaccacgca tacccaggca 661 cgctacgcat taccaaacgt catcaaaggc tttattgagc gttatcctcg cgtttctttg 721 catatgcacc agggctcgcc gacacaaatt gctgatgccg tctctaaagg caatgctgat 781 ttcgctatcg ccacagaagc gctgcatctg tatgaagatt tagtgatgtt accgtgctac 841 cactggaatc gggctattgt agtcactccg gatcacccgc tggcaggcaa aaaagccatt 901 accattgaag aactggcgca atatccgttg gtgacatata ccttcggctt taccggacgt 961 tcagaactgg atactgcctt taatcgcgca gggttaacgc cgcgtatcgt tttcacggca 1021 acggatgctg acgtcattaa aacttacgtc cggttagggc tgggggtagg ggtcattgcc 1081 agcatggcgg tggatccggt cgccgatccc gaccttgtgc gtgttgatgc tcacgatatc 1141 ttcagccaca gtacaaccaa aattggtttt cgccgtagta ctttcttgcg cagttatatg 1201 tatgatttca ttcagcgttt tgcaccgcat ttaacgcgtg atgtcgttga tgcggctgtc 1261 gcattgcgct ctaatgaaga aattgaggtc atgtttaaag atataaaact gccggaaaaa 1321 taatttcttg cgttattttc ggcacctttt atgtagcgaa ggtgccggaa tatattctct 1381 tttgttactt atttctatac cgtgcac // LOCUS HUMARHIII 60 bp ds-DNA BAD 04-JUL-1990 DEFINITION Synthetic DNA encoding angiogenin/RNase hybrid (ARH-III), partial cds. ACCESSION M34329 KEYWORDS . SOURCE Human DNA, clone pAng2. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 60) AUTHORS Bond,M.D. and Vallee,B.L. TITLE Replacement of residues 8-22 of angiogenin with 7-21 of RNase a selectively affects protein synthesis inhibition and angiogenesis JOURNAL Biochemistry 29, 3341-3349 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 60 angiogenin/RNase protein (AA 4-23) (AA at 1) BASE COUNT 17 a 18 c 13 g 12 t ORIGIN 1 tcgaggtata caaaattcga acgccagcac atggactctt cgacaagcgc tgcctccgat // LOCUS HUMIGIIDO 1914 bp ss-mRNA PRI 04-JUL-1990 DEFINITION Human interferon-gamma-inducible indoleamine 2,3-dioxygenase (IDO) mRNA, complete cds. ACCESSION M34455 KEYWORDS indoleamine 2,3-dioxygenase. SOURCE Human diploid fibroblast cell line FS-4, cDNA to mRNA, clone C42. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1914) AUTHORS Dai,W. and Gupta,S.L. TITLE Molecular cloning, sequencing and expression of human interferon- gamma-inducible indoleamine 2,3-dioxygenase cDNA JOURNAL Biochem. Biophys. Res. Commun. 168, 1-8 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 640 1851 indoleamine 2,3-dioxygenase (IDO) (EC 1.13.11.17) BASE COUNT 563 a 396 c 427 g 528 t ORIGIN 1 tgagaagggc aaatgctatc attggaaaaa ctgacaaaag tcccaatagg aaaaataagg 61 aagtggagag ttactatgtt tctaattttt catgtgcttc tatttttttc ctacttcaga 121 gccattgact aatagttgag tataacacag gttgtgtttc cgggctgctg aaacatgaca 181 ctaatatttt caaagaactg tggaagccta aaaggaagcc aatgagaaat aactaaatga 241 gagtttagga ctgcagcctt cattttcatt caaagattta aaagtttcca taaagtaaaa 301 tgttcttctc cggccacctg ttttcatagt tctgtgtttt ccttcaggcc tttctggctt 361 cctatatggc agtaagaaaa tgatgtgctt aatgattaca aatttcatat ggaatacgaa 421 ctttcagttt gtacatatga tgcacagaga tgcttttgtg gttttattgg ttttcatatt 481 acaaacaaag aaactagaaa atgaaaccat tccaaaagtg gaagtaattt ctcactgccc 541 ctgtgataaa ctgtggtcac tggctgtggc agcaactatt ataagatgct ctgaaactct 601 tcagacactg agggggcacc agaggagcag actacaagaa tggcacacgc tatggaaaac 661 tcctggacaa tcagtaaaga gtaccatatt gatgaagaag tgggctttgc tctgccaaat 721 ccacaggaaa atctacctga tttttataat gactggatgt tcattgctaa acatctgcct 781 gatctcatag agtctggcca gcttcgagaa agagttgaga agttaaacat gctcagcatt 841 gatcatctca cagaccacaa gtcacagcgc cttgcacgtc tagttctggg atgcatcacc 901 atggcatatg tgtggggcaa aggtcatgga gatgtccgta aggtcttgcc aagaaatatt 961 gctgttcctt actgccaact ctccaagaaa ctggaactgc ctcctatttt ggtttatgca 1021 gactgtgtct tggcaaactg gaagaaaaag gatcctaata agcccctgac ttatgagaac 1081 atggacgttt tgttctcatt tcgtgatgga gactgcagta aaggattctt cctggtctct 1141 ctattggtgg aaatagcagc tgcttctgca atcaaagtaa ttcctactgt attcaaggca 1201 atgcaaatgc aagaacggga cactttgcta aaggcgctgt tggaaatagc ttcttgcttg 1261 gagaaagccc ttcaagtgtt tcaccaaatc cacgatcatg tgaacccaaa agcatttttc 1321 agtgttcttc gcatatattt gtctggctgg aaaggcaacc cccagctatc agacggtctg 1381 gtgtatgaag ggttctggga agacccaaag gagtttgcag ggggcagtgc aggccaaagc 1441 agcgtctttc agtgctttga cgtcctgctg ggcatccagc agactgctgg tggaggacat 1501 gctgctcagt tcctccagga catgagaaga tatatgccac cagctcacag gaacttcctg 1561 tgctcattag agtcaaatcc ctcagtccgt gagtttgtcc tttcaaaagg tgatgctggc 1621 ctgcgggaag cttatgacgc ctgtgtgaaa gctctggtct ccctgaggag ctaccatctg 1681 caaatcgtga ctaagtacat cctgattcct gcaagccagc agccaaagga gaataagacc 1741 tctgaagacc cttcaaaact ggaagccaaa ggaactggag gcactgattt aatgaatttc 1801 ctgaagactg tgagaagtac aactgagaaa tcccttttga aggaaggtta atgtaaccca 1861 acaagagcac attttatcat agcagagaca tctgtatgca ttcctgtcat tacc // LOCUS RATRPL35AA 451 bp ss-mRNA ROD 04-JUL-1990 DEFINITION Rat 60S ribosomal subunit protein L35 mRNA, complete cds. ACCESSION M34331 X51705 KEYWORDS ribosomal protein. SOURCE Rat, cDNA to mRNA, clone pL35-[6,9]. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 451) AUTHORS Suzuki,K., Olvera,J. and Wool,I.G. TITLE The primary structure of rat ribosomal protein L35 JOURNAL Biochem. Biophys. Res. Commun. 167, 1377-1382 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 48 419 ribosomal protein L35 mRNA < 1 451 ribosomal protein L35 mRNA BASE COUNT 138 a 126 c 121 g 66 t ORIGIN 1 tcctctttct ctcgaaccgg gcggcctcat ccgcgaacag cgcagcaatg gccaagatta 61 aggctcggga cctgcgcggc aagaagaagg aggagctgtt gaaacaactg gatgatctga 121 aggtggaact gtcccagctt cgcgtggcca aagtgacagg cggcgccgcg tccaagctct 181 ccaagatacg agtcgtacgc aaatccatcg cccgtgtcct cactgtcatt aatcagactc 241 aaaaggaaaa cctcaggaaa ttctacaagg gaaagaagta caagcccctg gacctgcgac 301 ccaagaagac aagagccatg cgccgccggc tcaccaagca tgaagagaag ctgaagacca 361 agaagcagca gcggaaggag cggctgtacc cactgcgcaa gtacgcagtc aaggcctgag 421 acgacgacaa taacaataaa gtccaaaact g // LOCUS SCMIMP23A 1058 bp ss-mRNA INV 04-JUL-1990 DEFINITION S.mansoni 23 kDa integral membrane protein (Sm23) mRNA, complete cds. ACCESSION M34453 KEYWORDS integral membrane protein. SOURCE S.mansoni (strain Puerto Rican), cDNA to mRNA, clone 3.7.2. ORGANISM Schistosoma mansoni Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; Strigeata; Schistosomatoidea; Schistosomatidae. REFERENCE 1 (bases 1 to 1058) AUTHORS Wright,M.D., Henkle,K.J. and Mitchell,G.F. TITLE An immunogenic Mr 23,000 integral membrane protein of Schistosoma mansoni worms that closely resembles a human tumor-associated antigen JOURNAL J. Immunol. 144, 3195-3200 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 16 672 23 kDa integral membrane protein (Sm23) mRNA < 1 1058 23 kDa integral membrane protein mRNA BASE COUNT 269 a 189 c 219 g 381 t ORIGIN 1 gtcccgtttg aaagtatggc aacgttgggt actggtatgc gttgtctaaa aagttgcgta 61 ttcgttttga acatcatctg tctgttatgt tccctggtat taattggggc tggtgcctat 121 gtggaagtca agtttagtca atatggagac aatttacaca aagtctggca ggcagcacct 181 atcgctatta ttgttgtcgg agtaataatc ctgatagtga gctttctggg ttgttgtgga 241 gccataaagg aaaacgtctg catgctgtat atgtatgcgt tcttcctcgt cgtacttctg 301 attgctgagt tggccgctgc catagttgca gttgtttaca aagataggat cgattcagaa 361 attgatgcat taatgactgg tgctctggat aaaccgacaa aagagataac tgaattcatg 421 aatttgattc aatcatcatt ccattgctgt ggagccaaag gccctgatga ttatcgtggt 481 aacgtgccag cctcgtgcaa agaagagaat ttgacctata ccgaaggctg tgtatctgtc 541 tttggagcat tcttaaaacg caacctggtg attgttgcct gtgttgcatt tggtgtatgc 601 tttttccagt tgttaagcat tgtcatagcc tgctgtttgg gtcgccaaat aaaggagtat 661 gagaatgtgt aaacactgca aagaaaacag tatgatgttt atcaatcccg ttctgttttc 721 cctctggctt ttatgaaatg ttgctttttt attgcctaga taattgtgcc ttggctaata 781 atcgtgtact cgacttcgtt tacgatataa ttattatact taaatatgat actgtcattc 841 tcattgtttc acttatcatt cttataccaa atgagctgca ttgattccct ggtatttacc 901 tacaaactga tgacaaatca tcttcatttt tatggtgtgt cttataacat ttctcgttta 961 tataatttta agtctttggt ataccttttg ttttacagac cgcttatttt gccgccacag 1021 cattagacat ttcttgtata aattaattcc gttaactt // LOCUS STMMELCLAC 63 bp ds-DNA BAD 04-JUL-1990 DEFINITION S.antibioticus melC-lacZ translational fusion protein mRNA, 5' end. ACCESSION M34505 KEYWORDS . SOURCE S.antibioticus DNA, clones pIF41[3,5]. ORGANISM Streptomyces antibioticus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 63) AUTHORS Tseng,H.-C., Lin,C.-K., Hsu,B.-J., Leu,W.-M., Lee,Y.-H.W., Chiou,S.-J., Hu,N.-T. and Chen,C.W. TITLE The melanin operon of Streptomyces antibioticus: Expression and use as a marker in gram-negative bacteria JOURNAL Gene 86, 123-128 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 63 lacZ-melC fusion protein pept 49 > 63 melC protein BASE COUNT 16 a 20 c 15 g 12 t ORIGIN 1 atgaccatga ttacgccaag ctcgcgaggc ctcgagatct atcgatgcat gccggaactc 61 acc // LOCUS STMMELCTAC 101 bp ds-DNA BAD 04-JUL-1990 DEFINITION S.antibioticus tac promoter and melC gene, 5' end. ACCESSION M34506 KEYWORDS . SOURCE S.antibioticus DNA, clone pMELA1. ORGANISM Streptomyces antibioticus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 101) AUTHORS Tseng,H.-C., Lin,C.-K., Hsu,B.-J., Leu,W.-M., Lee,Y.-H.W., Chiou,S.-J., Hu,N.-T. and Chen,C.W. TITLE The melanin operon of Streptomyces antibioticus: Expression and use as a marker in gram-negative bacteria JOURNAL Gene 86, 123-128 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 87 > 101 melC protein BASE COUNT 30 a 23 c 23 g 25 t ORIGIN 1 ccctgttgac aattaatcat cggctcgtat aatgtgtgga attgtgagcg gataacaatt 61 tcacacagga aacaggatca gcttgcatgc cggaactcac c // LOCUS SUSSPERA 2150 bp ss-mRNA INV 04-JUL-1990 DEFINITION S.purpuratus speract egg protein mRNA, complete cds. ACCESSION J02896 KEYWORDS egg protein; speract. SOURCE Sea urchin (library lambda-Zap), cDNA to mRNA. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 2150) AUTHORS Ramarao,C.S., Burks,D.J. and Garbers,D.L. TITLE A single mRNA encodes multiple copies of the egg peptide speract JOURNAL Biochemistry 29, 3383-3388 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 98 988 speract protein precursor pept 206 988 speract protein precursor matp 500 529 speract protein matp 767 796 speract protein matp 800 829 speract protein matp 833 862 speract protein matp 866 895 speract protein matp 500 529 speract protein homolog (put.) matp 668 697 speract protein homolog (put.) matp 701 730 speract protein homolog (put.) matp 734 763 speract protein homolog (put.) matp 899 928 speract protein homolog (put.) matp 932 961 speract protein homolog (put.) signal 1320 1325 polyA signal signal 1463 1468 polyA signal BASE COUNT 656 a 442 c 437 g 615 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcggct actcgacttt acgctagatc tctaccgagc tttctcgtcg aatagaacaa 61 ccgaccgacc cacgtcgttg aagatgaagt tcatcgtatg cctcctggcc ctcgtggcgt 121 agcttcgggc aagcctgtca tcagcggtca ggaccagaag tacacccttg ctgattccag 181 caaggatcta gagggcattg ctcacatggc tatcgtcgac tcactttccc ctctacacat 241 ctcgctctcc tctttggaat ctgcgtggaa caacctcctc aacattgctt ctcaggaaga 301 ctactctatc ccggaacttt ccatacccaa aatcgacgtg aagagcatct tgtcatgcaa 361 gcccaagtat tctcccaagt accctgtggt cttgcagtac atctctgatc attacgtcca 421 ggtccaggac catatcgcca acgccaagga acttacagaa ggactgaaat tcgtctcgca 481 gctcatcatg tacaagaaga tcgaccacga taccctggcc tctgtctcta aaatgttgag 541 caattacttg actgactacg catcgacaat ctcatcactt aaatcagtcg tctgtcaaga 601 tcaaacagcg ccatctcatc ctatggacga gagttacatg gacacacccc tgtcaatgat 661 cttgaaaggc accatgccta ctggcgctgg agtagacaaa ggcttcgcac ttggtggtgg 721 tggagttggt aaaggattca acttgaacgg cggtggagtt ggcaaaggct ttgacttgaa 781 cggcggtgga gttggcaaag gattcgactt gaacggcggt ggagttggca aaggctttga 841 cttgaacggc ggtggagttg gcaaaggctt tgacttgaac ggcggtggag ttggcaaagg 901 tttcgctttg ggcggtggtg gagttggcaa aggattcagc ctcaccggag gaggcgtcgg 961 cagggaagtc gaaatcaaag actggtaatc gacgttctgg cgctcaatag atctgacatc 1021 aaatgccatg cgcccgtaca ggaatttcat aattgccatt tttttttaca ctgatatcaa 1081 tactttggtc gtgcctgtcc aattatacaa atatgtatta aataatagca tcaaatgcaa 1141 aaaaaaaaaa aaaaaaaaaa gggtcttgtt ttcttgagct tgcaatgatc aaccatcatc 1201 cagtgtcgtc atgatggtct gcgtcacaag agaggcgata agcaaaattt aagactcaag 1261 atggtatgtt aaactttgat cttaaggacg ttttctttgt ttcctcatca aatattgaaa 1321 ataaattttt aacaaaaaaa attacatatg cctgtaaaat ggctatttca gcatttttct 1381 ctgttttcta cgattcatat ttctttttca ttggaaaaac tagaaagaat atataaaata 1441 taaaatattt tcaaaatagt ttaataaatt atttccaaaa gacaactttg tcaaaaagaa 1501 ttgaaaaaag aagaaaatga attgaaagca aaattttctt ttcaaaatgt cagcaactga 1561 tagtgatagc cgcaggtatt acatcaggca aatcaaacag gtaaaggtca tacaatgtac 1621 cttctagatg ttactttaca agattttaga tctactttgg tatcttcttt cctttttcct 1681 ttgattgtaa actcctttca agtaatacga attaatacca taagttgcac ctgcctagcc 1741 tatacagtcg acactgccca caaacaaatc gagaagtata gaataacgca acataaaaaa 1801 ggaaagattt tggtagctgt gctcttttcc tcttttctta tctttcgaat gtttactctc 1861 ctgtactcat ttaaaatggt aaattaaaaa gagagccaga tttttatttg gaaggttgca 1921 aatcgattga cagataggtc atcagtgtag agctttgata caagtgacaa aaacggtcca 1981 tacgcttgtt gagcaatatg gttacttcat gaaggccttg tatatttcac ataatgataa 2041 caaatactag aagtcaatga tgcattgctg caaagaattt tgataacgtg actttctttg 2101 tagtctggtg cttggtaaaa tccagagacg caatgaaaat acttgaattc // LOCUS TETTCBP23A 751 bp ss-mRNA INV 04-JUL-1990 DEFINITION T.thermophila calcium-binding protein (TCBP-23) mRNA, complete cds. ACCESSION J05227 KEYWORDS calcium-binding protein. SOURCE T.thermophila (strain B1868), cDNA to mRNA, clone TCBP-23. ORGANISM Tetrahymena thermophila Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 751) AUTHORS Takemasa,T., Takagi,T., Kobayashi,T., Konishi,K. and Watanabe,Y. TITLE The third calmodulin family protein in Tetrahymena: Cloning of the cDNA for Tetrahymena calcium-binding protein of 23 kDa (TCBP-23) JOURNAL J. Biol. Chem. 265, 2514-2517 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 28 651 calcium-binding protein (TCBP-23) signal 661 666 polyA signal signal 676 681 polyA signal BASE COUNT 249 a 120 c 152 g 230 t ORIGIN 1 gtagaaagca aacaaacact aaaagaaatg gaacaccaaa tcatcaccta aaacgtctac 61 gctcctgata ctgaagccaa gcttgatgtt gctagaaagc tttttgctta gtttgattct 121 aacaagaacg gtactttaga tcctagcgaa gttgctggac tcatcaagac tacttttgaa 181 aatatgggtg ttaaggacta cagcgtcact gctgatgatg tcaagcttta catgaagagt 241 gttgatgttg ataacaacgg tcttgtttcc tactctgaat acgaagaata cgtcattgct 301 tgcctcaaga aagctggctt cgactgtgaa gttaagcaaa aggtaaaaag atctgccaaa 361 aagagagacg ctgctactga aatgaagttg gacgttgcca gaagactctt cgctaagtac 421 gactctgata agagtggtta attagaagaa aaggaagttt atggtgtcat tactgaaacc 481 tataagcaaa tgggtatgga ttacaagccc actgaagctg atgttaagct ttggatgtcc 541 atgactgata ctgataagaa tggaactgtc tctattgttg aatatgaaga tttcgtcatt 601 tctggtctta agaaggctgg tttcatggtc aaggaattca ctcaagcttg atcttatttt 661 aataaacata tttcaaataa aaattgaact gtatttgtgt atttcatgtc atttcttaaa 721 attttaaatg atttttttaa tatcgttaaa a // LOCUS TRNRESRA 480 bp ds-DNA BCT 04-JUL-1990 DEFINITION Transposon Tn2501 putative res region. ACCESSION M34377 KEYWORDS . SOURCE Transposon Tn2501 DNA, clone pTM41. ORGANISM Transposon Tn2501 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 480) AUTHORS Michiels,T. and Cornelis,G. TITLE Site-specific recombinations between direct and inverted res sites of Tn2501 JOURNAL Plasmid 22, 249-255 (1989) STANDARD simple staff_entry FEATURES from to/span description site 285 310 inverted repeat region I site 325 356 inverted repeat region II site 366 387 inverted repeat region III BASE COUNT 125 a 110 c 110 g 135 t ORIGIN 1 gcgattctgg atatgacgct actgtgaggg acattgcttt tgtccggaat aaatccaggt 61 ccacgcagat aacagagtaa tacagcaaag cccaggcgat ttgctggccg ccggtgttta 121 ttaatgagtg caatatcctg ttcattcaga aaacacattc gggtcagaat aatttcatca 181 tctggaataa ccagcaagcg ctcctgctct tcactgctca gtatctgtcg ccgtggcata 241 aatgcttccc tcgcgtatag tttccatatg ttatggattg gctatgaccg aaaccggtcg 301 gtttcgtaca ccttctaata tatctgtccg aaagtaagtg aaattcattt cgtacaggcg 361 ctataattcg gacatccatt tcgtacggaa agtttcctat gtcacgagtt tttgcttact 421 gccgggtttc gactcttgag caaaccaccg agaatcagcg aagggaaatt gaagcggcgg // LOCUS HUMALRMB 1354 bp ss-mRNA PRI 04-JUL-1990 DEFINITION Human aldose reductase mRNA, complete cds. ACCESSION J05474 KEYWORDS aldose reductase. SOURCE Human adult and fetus retina, and muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1354) AUTHORS Nishimura,C., Matsuura,Y., Kokai,Y., Akera,T., Carper,D., Morjana,N., Lyons,C. and Flynn,T.G. TITLE Cloning and expression of human aldose reductase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Nishimura, 26-APR-1990. FEATURES from to/span description pept 33 983 aldose reductase (EC 1.1.1.21) mRNA < 1 1354 aldose reductase mRNA BASE COUNT 320 a 352 c 365 g 317 t ORIGIN 1 cggtacgtgc ggccttgggg agcgcagcag ccatggcaag ccgtctcctg ctcaacaacg 61 gcgccaagat gcccatcctg gggttgggta cctggaagtc ccctccaggg caggtgactg 121 aggccgtgaa ggtggccatt gacgtcgggt accgccacat cgactgtgcc catgtgtacc 181 agaatgagaa tgaggtgggg gtggccattc aggagaagct cagggagcag gtggtgaagc 241 gtgaggagct cttcatcgtc agcaagctgt ggtgcacgta ccatgagaag ggcctggtga 301 aaggagcctg ccagaagaca ctcagcgacc tgaagctgga ctacctggac ctctacctta 361 ttcactggcc gactggcttt aagcctggga aggaattttt cccattggat gagtcgggca 421 atgtggttcc cagtgacacc aacattctgg acacgtgggc ggccatggaa gagctggtgg 481 atgaagggct ggtgaaagct attggcatct ccaacttcaa ccatctccag gtggagatga 541 tcttaaacaa acctggcttg aagtataagc ctgcagttaa ccagattgag tgccacccat 601 atctcactca ggagaagtta atccagtact gccagtccaa aggcatcgtg gtgaccgcct 661 acagccccct cggctctcct gacaggccct gggccaagcc cgaggaccct tctctcctgg 721 aggatcccag gatcaaggcg atcgcagcca agcacaataa aactacagcc caggtcctga 781 tccggttccc catgcagagg aacttggtgg tgatccccaa gtctgtgaca ccagaacgca 841 ttgctgagaa ctttaaggtc tttgactttg aactgagcag ccaggatatg accaccttac 901 tcagctacaa caggaactgg agggtctgtg ccttgttgag ctgtacctcc cacaaggatt 961 accccttcca tgaagagttt tgaagctgtg gttgcctgct cgtccccaag tgacctatac 1021 ctgtgtttct tgcctcattt ttttccttgc aaatgtagta tggcctgtgt cactcagcag 1081 tgggacagca acctgtagag tggccagcga gggcgtgtct agcttgatgt tggatctcaa 1141 gagccctgtc agtagagtag aagtctcttc cagtttgctt tgcccttctt tctaccctgc 1201 tggggaaagt acaacctgaa tacccttttc tgaccaaaga gaagcaaaat ctaccaggtc 1261 aaaatagtgc cactaacggt tgagttttga ctgcttggaa ctggaatcct ttcagcaaga 1321 cttctctttg cctcaaataa aaagtgcttt tgtg // LOCUS CHTHYPABX 2468 bp ds-DNA BCT 04-JUL-1990 DEFINITION C.psittaci hypA and hypB protein genes, complete cds. ACCESSION M25101 KEYWORDS . SOURCE C.psittaci (strain GPIC) elementary body DNA, clone pGP57. ORGANISM Chlamydia psittaci Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 2468) AUTHORS Morrison,P.R., Belland,R.J., Lyng,K. and Caldwell,H.D. TITLE Chlamydial disease pathogenesis: The chlamydial hypersensitivity antigen is a stress response protein JOURNAL J. Exp. Med. 170, 1271-1283 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.P.Morrison, 02-JUN-1989. FEATURES from to/span description pept 314 622 hypA protein pept 673 2307 hypB protein signal 83 88 -35 region signal 120 127 -10 region BASE COUNT 855 a 462 c 514 g 637 t ORIGIN 1 gaattcttaa caaagaagat aacgctctcg aatcgtacat gaacttctta aaaagtggtg 61 gctccgactt cccattagaa atcttgaaga agtccggatt ggatatggca accagcgagc 121 ccatatacaa ggccttttcc ttcatagaga gaaaaattca agagttatca tctttaattt 181 aaaacaacta aagaaaagta gcacttgata ttattaagtg ctaaaatcat tgccaaaaaa 241 cgagagactt tggtatcgtt cctgagaaac ggcaaagtct cttttagaac aagaaacaca 301 aggagcttat aacatgtcag atcaagcaac gacccttagg attaagcccc tgggcgatag 361 aattttagtg aaaagagaag aagaagattc tacagcgcgc ggcggcatca ttttacctga 421 tacagcaaag aaaaaacagg atcgagcaga ggtattagtc ctaggcactg gaaaacgaga 481 taaagatggc aacgtcctac cttttgaagt taccgtgggt gatactgttt taatagataa 541 atacgcggga caagaactta ccgttgatgg tgaggagtac gtcattgttc aggaaagcga 601 agttatggca gttctcaagt aagagaaatc attatttata gattgcaaaa agttaaggag 661 cacaaaaaaa caatggcagc aaaaaatatt aaatataacg aagacgccag aaaaaaaatc 721 cataaaggag ttaaaaccct tgcagaagct gtaaaagtaa ccttaggtcc taaaggccgt 781 catgtggtta tcgataaaag ctttggttct cctcaagtta ccaaagatgg cgtaactgtc 841 gctaaagaaa ttgagctcga agacaagcat gagaacatgg gagctcaaat ggtaaaagaa 901 gtcgctagca aaactgcaga taaagctggt gatggaacta caacagctac tgttcttgca 961 gaagctatct acagtgaagg attgagaaac gtaactgcag gcgccaatcc tatggacctc 1021 aaaagaggca ttgataaggc agtaaaagtc gttgtcgatg aaatcaaaaa aattagtaaa 1081 cccgtacaac atcacaaaga aatagctcaa gtagcgacta tttctgcaaa taatgatgct 1141 gaaatcggta atcttatcgc cgaagccatg gaaaaagttg gcaaaaacgg ctctattact 1201 gttgaagaag ctaaaggttt cgaaactgtc ctcgacgttg tcgaaggtat gaatttcaac 1261 cgcggatacc tatccagcta cttctctaca aatcctgaaa cacaagaatg tgttttagaa 1321 gaagctctcg tgcttatcta tgataaaaaa atttccggaa tcaaagattt tctaccagtt 1381 ttacaacaag tagcagaatc aggacgtccc ctacttatca ttgctgaaga tatcgaagga 1441 gaagctttag ctactttagt agtaaacaga ctacgtgctg gattcagagt gtgtgcagta 1501 aaagctcctg gatttggtga tagaagaaaa gctatgttag aagacatcgc tattttaact 1561 ggtggtcaac tcatcagcga agagcttggc atgaagcttg agaacacaac tctagctatg 1621 ttaggaaaag ctaaaaaagt catcgtttcc aaagaagata caacaattgt tgaaggtctt 1681 ggcagcaaag aagatattga atctcgctgc gaaagtatca aaaaacaaat cgaagacagt 1741 acttctgatt acgacaaaga aaaactccaa gaacgtttag ctaaactttc cggaggcgta 1801 gctgtaatcc gtgtaggagc tgctacagaa atcgaaatga aagagaaaaa agacagagta 1861 gatgatgctc agcatgcaac tcttgctgca gttgaagaag gtattctacc tggcggtggt 1921 acagctttag ttcgctgcat ccctacttta gaagctttca ttcctattct tacaaatgaa 1981 gatgagcaaa tcggagcacg tattgttctc aaagcattat ccgctccatt aaagcaaatt 2041 gcagccaatg ctggtaaaga aggcgctatc atctgtcaac aagtgctttc tcgctcctct 2101 agcgaaggct atgatgcttt acgcgatgct tacaccgaca tgattgaggc aggaattctc 2161 gatccaacta aagttacacg ttgtgcttta gaaagcgcag cttctgtagc tgggcttcta 2221 ttaacaacag aagctttaat tgccgatatt cctgaagaga aatcctcttc tgctcccgca 2281 atgccaggcg caggaatgga ttattaatcc ttaatttaga gagcattttc tctaatatta 2341 taaggtctcc tttcatccat cttgagaaga aagggggcct tttttatttt ctaatatttc 2401 tttcttcatc tatgttggaa accaagataa atcatattct catcatgcat gtttaaactt 2461 ttaaagaa // LOCUS TRBPGKM 3306 bp ds-DNA INV 04-JUL-1990 DEFINITION T.brucei phosphoglycerate kinase isozyme-8 (cPGK-8, gPGK) genes, complete cds. ACCESSION M33775 KEYWORDS phosphoglycerate kinase. SOURCE T.brucei (strain EATRO 164) DNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 3306) AUTHORS Alexander,K., Parail,A.C. and Parsons,M. TITLE An allele of Trypanosoma brucei cytoplasmic phosphoglycerate kinase is a mosaic of other alleles and genes JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by M.Parsons, 18-APR-1990. Author address: M.Parsons Seattle Biomedical Research Institute 4 Nickerson Street Seattle, WA 98109-1651 (206) 284-8846 FEATURES from to/span description pept 175 1437 phosphoglycerate kinase (cPGK-8; E.C. 2.7.2.3) pept 1792 3114 phosphoglycerate kinase (gPGK; E.C. 2.7.2.3) allele 406 406 c in allele 8; g in allele 2 allele 466 467 cg in allele 8; gc in allele 2 allele 603 604 ga in allele 8; gaaga in allele 2 allele 640 640 g in allele 8; t in allele 2 allele 729 730 tc in allele 8; ca in allele 2 allele 783 783 g in allele 8; a in allele 2 allele 840 840 a in allele 8; c in allele 2 allele 944 950 aatcgaa in allele 8; atccatt in allele 2 allele 1018 1018 g in allele 8; t in allele 2 allele 1107 1107 g in allele 8; a in allele 2 allele 1425 1428 cgaa in allele 8; tgat in allele 2 allele 1439 1457 attcatcctgttacgggcc in allele 8; gtacatcaggcgaagggtt in allele 2 allele 1483 1483 t in allele 8; g in allele 2 allele 1512 1512 t in allele 8; c in allele 2 allele 1537 1540 ttta in allele 8; ta in allele 2 allele 2015 2015 a in allele 8; g in allele 2 allele 2083 2084 cg in allele 8; gc in allele 2 allele 2136 2136 t in allele 8; c in allele 2 allele 2151 2151 t in allele 8; c in allele 2 allele 2196 2199 cttt in allele 8; gttc in allele 2 allele 2268 2268 c in allele 8; c in allele 2 allele 2400 2400 g in allele 8; a in allele 2 allele 2919 2919 c in allele 8; t in allele 2 allele 3031 3031 g in allele 8; a in allele 2 allele 3057 3057 g in allele 8; t in allele 2 allele 3196 3197 tg in allele 8; gtttg in allele 2 allele 3248 3248 t in allele 8; c in allele 2 allele 3257 3257 g in allele 8; t in allele 2 allele 3273 3274 ag in allele 8; agag in allele 2 allele 3284 3286 ttg in allele 8; tg in allele 2 allele 3297 3299 aat in allele 8; at in allele 2 BASE COUNT 863 a 659 c 833 g 951 t ORIGIN 1 aaaccgaatt cccttcccca agtctcgcag tcacttcttt tcaacgtttt ctcacttagc 61 caacatactt gtattttttg tttcaaaaca ttaaaaaatt gtaacaaggg agtttcttat 121 ttttttgaaa aaactatata tatcgatata tacttatctg atcacaaatc aaatatgtca 181 ctaaaagaga ggaagagcat taacgaatgc gatcttaagg gaaagaaggt tcttatccgt 241 gttgacttta atgttcctct ggatgatggt aatattacca acgactaccg aatccgatca 301 gctctgccag ccgtccaaaa agttctcaca gaaggcggca gctgtgttct catgagccac 361 ctcgggaggc cgaaaggcgt ttctatggct gaaggcaaag aactgcggag cgctggcggt 421 attcccgggt tcgagcagaa ggcaacactc aaaccggtag ccaagcgcct cagcgaattg 481 ttatcgagac ccgtcacatt cgcacctgac tgtctgaacg ctgcagatgt cgtctctaag 541 atgtctccgg gcgatgttgt tctgcttgaa aatgtacggt tctacaaaga agagggcagc 601 aagagcactg aggaacgtga agccatggcc aagatccttg cgtcatatgg tgatgtttac 661 atcagtgatg cttttggtac agctcaccgt gacagtgcta ccatgaccgg aattccaaag 721 attttgggtc acggtgctgc cggttatttg atggagaagg agatttcata cttcgctaag 781 gtgcttggta acccgccgcg tccgctggtt gctatcgttg gtggagcgaa agtgagcgaa 841 aagatccaac ttctggataa catgttgcag cgcatcgatt atctcttaat tggtggtgca 901 atggcataca catttctgaa ggctcagggt tacagcattg gaaaatcgaa gtgcgaggaa 961 agtaaacttg aatttgctcg atccctgctg aagaaggcgg aggaccgcaa ggtgcaggtt 1021 attcttccaa ttgatcatgt ttgccacacg gaattcaaag ctgtggattc tccattgata 1081 actgaggatc aaaacatccc tgaagggcat atggctctgg atattggtcc caagactatt 1141 gaaaaatatg ttcagacgat tgggaagtgt aagagcgcca tttggaacgg tcccatgggt 1201 gtatttgaaa tggttcctta ttccaaaggt acatttgcaa ttgcgaaagc catgggtcga 1261 ggaactcacg agcatggact catgagtatc atcggtggtg gtgacagcgc aagtgcagct 1321 gagttgagcg gtgaggcgaa gcgcatgtct catgtttcaa cgggtggtgg tgcgtctttg 1381 gaactcctcg agggcaaaac gcttcccggt gttacagtat tggacgaaaa ggagtaatat 1441 tcatcctgtt acgggcctgt tttatggaat tgtgtttttt agtccttttt atttgttggt 1501 taggtattgg ttcgtacgtg actattattt ttttttttta ggataacatt tatgtttttt 1561 ctctactcat tttatttttt ggtactatgt atcgattgca cagtaatatt tcaatcgcta 1621 tgatatttct atctttttga tcgttctact gtgtaacttt gtgtttatct ttgttacttc 1681 actctttttt cactcaaatc gtttgggctg caggctcagc tctttggtga tatcaaagca 1741 taattgctgc ggagatacgt ttttccacct aataagtaat tgtgatacaa gatgaccctt 1801 aacgagaaga agagcattaa tgaatgcgat cttaagggaa agaaggttct tatccgtgtt 1861 gactttaatg ttcccgtgaa aaacggtaag atcaccaacg actaccgaat ccgatcagct 1921 ctgccaacgc tcaagaaggt tctcacagaa ggcggcagct gtgttctcat gagccacctc 1981 gggaggccga aaggtattcc catggcgcaa gctgacaaaa tacggagcac tggcggtgtt 2041 cccgggttcc aacagaaggc aacactcaaa ccggtagcca agcgcctcag cgaactgcta 2101 ttgaggcccg tcacattcgc acctgactgc ctgaatgctg cagatgtcgt ttctaagatg 2161 tctccgggcg atgttgttct gcttgaaaat gtacgctttt acaaagaaga gggcagcaag 2221 aaggcaaaag accgtgaagc catggccaag atccttgcgt catatggcga tgtttacatc 2281 agtgatgctt ttggtacagc tcaccgtgac agtgctacca tgaccggaat tccaaagatt 2341 ttgggcaacg gtgctgccgg ttatttgatg gagaaggaga tttcatactt cgctaaggtg 2401 cttggtaacc cgccgcgtcc gctggttgct atcgttggtg gagcgaaagt gagcgacaag 2461 atccaacttc tggataacat gttgcagcgc atcgattatc tcttaattgg tggtgcaatg 2521 gcatacacat ttctgaaggc tcagggttac agcattggaa aatcgaagtg cgaggaaagt 2581 aaacttgaat ttgctcgatc cctgctgaag aaggcggagg accgcaaggt gcaggttatt 2641 cttccaattg atcatgtttg ccacacggaa ttcaaagctg tggattctcc attgataact 2701 gaggatcaaa acatccctga agggcatatg gctctggata ttggtcccaa gactattgaa 2761 aaatatgttc agacgattgg gaagtgtaag agcgccattt ggaacggtcc catgggtgta 2821 tttgaaatgg ttccttattc caaaggtaca tttgcaattg cgaaagccat gggtcgagga 2881 actcacgagc atggactcat gagtatcatc ggtggtggcg acagcgcaag tgcagctgag 2941 ttgagcggtg aggcgaagcg catgtctcat gtttcaactg gtggtggtgc gtctttggaa 3001 ctcctcgagg gcaaaacgct tcccggcgtt gcagtattgg acgaaaagtc ggcggtggtg 3061 tcgtatgcct ctgcaggtac tggaactctt tctaaccggt ggagctctct ttaaacttcc 3121 agaaaaaata tatttctgca aaatactttt ggaagtttgt cttgtcttta tagatgaagg 3181 atttgtttct ttttttgtga tgttttcaag gttaattagt tttgggggtt cgttatctta 3241 attattttgg tgggtgggag taaataaagc agaggtaaat tttttggtga cacaaaaatt 3301 gggaag //
GenBank-Updates@genbank.bio.net (07/06/90)
LOCUS MUSLRPA 3070 bp ss-mRNA ROD 06-JUL-1990 DEFINITION Mouse leukocyte common antigen-related phosphatase mRNA, complete cds. ACCESSION M36033 M33671 KEYWORDS leukocyte common antigen-related phosphatase; protein-tyrosine-phosphate phosphohydrolase; tyrosine phosphatase. SOURCE Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3, cDNA to mRNA, clones lambda-mLRP-[B20,C26]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3070) AUTHORS Matthews,R.J., Cahir,E.D. and Thomas,M.L. TITLE Identification of an additional member of the protein-tyrosine-phosphatase family: Evidence for alternative splicing in the tyrosine phosphatase domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4444-4448 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Thomas, 11-APR-1990. FEATURES from to/span description pept 26 2515 leukocyte common antigen-related phosphatase precursor (EC 3.13.48) sigp 26 82 leukocyte common antigen-related phosphatase signal peptide matp 83 2512 leukocyte common antigen-related phosphatase mRNA < 1 3070 leukocyte common antigen-related phosphatase mRNA site 827 936 gg in short clone lambda-mLRP-B20 BASE COUNT 874 a 754 c 732 g 710 t ORIGIN 1 ccgcccagcg ccgggctcgg tcagcatgga ttcctggttc attcttgtcc tgtttggcag 61 tggtctaata catgttagtg ccaacaatgc tactacagtt tcaccttctt taggaacgac 121 aagattaatt aaaacatcaa caacagaatt ggctaaggaa gagaataaaa cctcaaattc 181 aacctcttca gtaatttctc tttctgtggc accaacattc agcccaaacc tgactctgga 241 gcccacctat gtgactactg ttaattcttc acactctgac aatgggacca ggagggcagc 301 cagcacggaa tctggaggca ctaccatttc cccgaacgga agctggctta ttgagaacca 361 gttcacggat gccataacag aaccctggga ggggaactcc agcactgcag caaccactcc 421 agaaaccttc cccccggcag atgagacacc aattattgcg gtgatggtgg ccctgtcctc 481 tctgctagta atcgtgttta ttatcatagt tctgtacatg ttaaggttta agaaatacaa 541 gcaagctggg agtcattcca actctttccg cctgtcaaat ggccgcacgg aggatgtgga 601 gccccaaagt gtaccacttc tggccaggtc cccaagcacc aacaggaagt acccaccact 661 gcctgtggac aagctggaag aggagattaa ccggagaatg gctgatgaca ataagatctt 721 cagagaagaa ttcaacgctc tccctgcttg tcctatccag gccacctgtg aggctgcctc 781 caaggaagaa aacaaggaaa aaaaccgcta tgtaaacatc ctgccctttc tctctttagc 841 tgtgagcaag gatgcagtga aagcactgaa caaaaccact ccattgttag aaagaaggtt 901 tattgggaaa tcaaactcca gaggctgtct ctcagatgac cactctagag tgcacctgac 961 acctgttgaa ggggtcccag attctgatta catcaacgct tcattcatta atggctacca 1021 ggaaaagaac aaattcatcg ctgcacaagg accaaaagaa gaaacagtga atgacttctg 1081 gagaatgata tgggaacaaa acacagctac tattgtcatg gtgaccaacc tgaaggagag 1141 aaaggagtgt aaatgtgccc aatactggcc agaccaaggc tgctggacct atgggaatgt 1201 ccgtgtgtct gtcgaggatg tgactgttct ggtggactac acagtacgga aattctgcat 1261 ccagcaggtg ggcgacgtga ccaacaggaa accacagcgc ctcatcactc agttccactt 1321 caccagctgg ccagactttg gggtgccttt caccccaatt ggcatgctca agttcctcaa 1381 gaaggtgaag gcctgtaacc ctcagtacgc aggggctatc gtggtccact gcagtgcagg 1441 tgtagggcgc actggcacct ttgttgtcat cgatgccatg ctggacatga tgcattcgga 1501 acgcaaagtg gatgtatacg ggtttgtgag ccggatccgg gcccagcgct gccagatggt 1561 acagacagac atgcagtacg tcttcatata ccaggccctt ctggagcatt atctgtatgg 1621 ggacacagaa ctggaagtga cttctctaga aacccaccta caaaaaattt ataacaagat 1681 cccagggacg agcaacaacg ggttagagga ggagtttaag aaattaactt caatcaaaat 1741 ccagaatgac aagatgcgca cgggaaacct tccagccaac atgaagaaga accgggtttt 1801 acagatcatt ccatatgaat ttaacagagt gatcattcca gtcaaacgag gcgaagagaa 1861 cacagactat gtgaacgcat ccttcattga tggataccgg cagaaagact cctacattgc 1921 cagccagggc cctcttctcc acacgattga ggacttctgg cgaatgatct gggagtggaa 1981 gtcctgttct atcgtaatgc tgacagaact ggaagagaga ggccaggaga agtgtgccca 2041 gtactggcca tctgatggcc tggtgtccta tggagacatc acagttgagc tgaagaagga 2101 ggaggaatgt gaaagctaca ctgtccgaga cctcctggtc accaacacca gggagaacaa 2161 gagtcggcaa atccggcagt tccacttcca cggctggcct gaggtgggca tccccagcga 2221 cggcaagggc atgatcaaca tcattgcagc agtgcagaag cagcagcagc agtcggggaa 2281 ccatcccatc actgtgcact gcagtgccgg ggcaggacgg acaggaacct tctgtgcctt 2341 gagcacagtc ctggaacgtg tgaaagcaga aggaatttta gatgtcttcc aaactgtcaa 2401 gagcctgcgg ctgcagaggc cacacatggt ccagacactg gaacagtatg aattctgcta 2461 caaggtggta caggagtaca ttgacgcctt ttcagattat gccaacttca agtgacaggt 2521 gacaaggccc acagacagga gaattgcctt taatattttg taatattctg tttttgttaa 2581 tatacccaaa attgtatata tcttataact gttttagaaa tggcacatag gcttctatta 2641 cctgttaggt ggagattttg tatgtaaatg tgttagcact gatagtcctt ttccagtgtt 2701 ttattgggaa attaaatagt gtgatatttg ggttgatata atgaaatcct cagcctggaa 2761 actgggccag attgttcctt gcttcaaata tcttttccta aagaagataa acctaagact 2821 cattccaggt agctcagtgc caactaaaac aaagcacaaa gttctcagag ctcttgagga 2881 aatggttgtc tccctgtccc caggcaggcc tcttcccctc cctgtcctgt aaatatccct 2941 cccctctcca gtccaccctc atctcccacc aagatcagcc acctcaggca tggggagtaa 3001 tgagaccaga gcgcctctct ggcaccacag cagggatcgt caggtaataa acactcttga 3061 ttccctgagg // LOCUS MUSLRPB 144 bp ds-DNA ROD 06-JUL-1990 DEFINITION Mouse leukocyte common antigen-related phosphatase gene, exon N. ACCESSION M36034 M33671 KEYWORDS leukocyte common antigen-related phosphatase; protein-tyrosine-phosphate phosphohydrolase; tyrosine phosphatase. SOURCE Mouse (strain (C57BL/6 X DBA/2)F1) pre-B cell line 70Z/3 DNA, clone lambda-mLRP-B20. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 144) AUTHORS Matthews,R.J., Cahir,E.D. and Thomas,M.L. TITLE Identification of an additional member of the protein-tyrosine-phosphatase family: Evidence for alternative splicing in the tyrosine phosphatase domain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4444-4448 (1990) STANDARD full staff_review FEATURES from to/span description pept / 25 / 132 leukocyte common antigen-related phosphatase, exon N (EC 3.13.48) (AA at 27) IVS < 1 24 leukocyte common antigen-related phosphatase intron X IVS 133 > 144 leukocyte common antigen-related phosphatase intron X+1 BASE COUNT 50 a 30 c 30 g 34 t ORIGIN 1 gcttactcaa acgatcatac aaagttctct ctttagctgt gagcaaggat gcagtgaaag 61 cactgaacaa aaccactcca ttgttagaaa gaaggtttat tgggaaatca aactccagag 121 gctgtctctc aggtcagaga aaac // LOCUS HUMACOLA1 34 bp ds-DNA PRI 06-JUL-1990 DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 1. ACCESSION M34534 KEYWORDS alpha-1 type IX collagen. SEGMENT 1 of 4 SOURCE Human DNA, clone YMh304. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 34) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept 19 / 32 alpha-1 type IX collagen precursor, exon 1 /hgml_locus_uid="LG0170H" /nomgen="COL9A1" /map="6q12-q14" pep$ 19 / 32 alpha-1 type IX collagen precursor, exon 1 sigp 19 / 32 alpha-1 type IX collagen signal peptide sig$ 19 / 32 alpha-1 type IX collagen signal peptide IVS 33 > 34 COL9A1 intron A BASE COUNT 14 a 5 c 9 g 6 t ORIGIN 1 agaaaatcaa ctgggaaaat gaagacctgc tggt // LOCUS HUMACOLA2 88 bp ds-DNA PRI 06-JUL-1990 DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 6. ACCESSION M32135 KEYWORDS alpha-1 type IX collagen. SEGMENT 2 of 4 SOURCE Human DNA, clone YMh617. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 88) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept / 3 / 86 alpha-1 type IX collagen, exon 6 (AA at 3) /hgml_locus_uid="LG0170H" /nomgen="COL9A1" /map="6q12-q14" pep$ / 3 + 86 alpha-1 type IX collagen, exon 6 (AA at 3) IVS < 1 2 COL9A1 intron E IVS 87 > 88 COL9A1 intron F BASE COUNT 23 a 24 c 22 g 19 t ORIGIN Unknown number of bp after segment 1. 1 agtttgaact tcaatggatg ctgatccatt gtgaccccct gcggcccagg acagaaactt 61 gccatgagct gccagccaga ataacggt // LOCUS HUMACOLA3 74 bp ds-DNA PRI 06-JUL-1990 DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 1*. ACCESSION M32133 KEYWORDS alpha-1 type IX collagen. SEGMENT 3 of 4 SOURCE Human DNA, clone Ymh617. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 74) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept < 1 + 72 alpha-1 type IX collagen, exon 1* (AA at 1) /hgml_locus_uid="LG0170H" /nomgen="COL9A1" /map="6q12-q14" IVS 73 > 74 COL9A1 intron G* BASE COUNT 5 a 21 c 32 g 16 t ORIGIN Unknown number of bp after segment 2. 1 atggcctgga ctgcgcggga ccgcggggcc ctggggctgc tgctgttggg gctctgcttg 61 tgcgcggctc aagt // LOCUS HUMACOLA4 25 bp ds-DNA PRI 06-JUL-1990 DEFINITION Human alpha-1 type IX collagen (COL9A1) gene, exon 7. ACCESSION M32137 KEYWORDS alpha-1 type IX collagen. SEGMENT 4 of 4 SOURCE Human DNA, clone YMh617. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 25) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept + 3 / 23 alpha-1 type IX collagen, exon 7 /hgml_locus_uid="LG0170H" /nomgen="COL9A1" /map="6q12-q14" pep$ + 3 / 23 alpha-1 type IX collagen (COL9A1), exon 7 IVS < 1 2 COL9A1 intron G* IVS 24 > 25 COL9A1 intron H* IVS < 1 2 COL9A1 intron F IVS 24 > 25 COL9A1 intron G BASE COUNT 7 a 10 c 7 g 1 t ORIGIN Unknown number of bp after segment 3. 1 agcccagcca gaccaccgac gaggt // LOCUS MUSACOLA1 34 bp ds-DNA ROD 06-JUL-1990 DEFINITION Mouse alpha-1 type IX collagen (COL9A1) gene, exon 1. ACCESSION M34533 KEYWORDS alpha-1 type IX collagen. SEGMENT 1 of 4 SOURCE Mouse DNA, clone YMm513. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 34) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept 19 / 32 alpha-1 type IX collagen (COL9A1) precursor, exon 1 pep$ 19 / 32 alpha-1 type IX collagen precursor, exon 1 sigp 19 / 32 alpha-1 type IX collagen signal peptide sig$ 19 / 32 alpha-1 type IX collagen signal peptide IVS 33 > 34 COL9A1 intron A BASE COUNT 13 a 6 c 9 g 6 t ORIGIN 1 agagaaccca ctgggaaaat gaagaacttc tggt // LOCUS MUSACOLA2 88 bp ds-DNA ROD 06-JUL-1990 DEFINITION Mouse alpha-1 type IX collagen (COL9A1), exon 6. ACCESSION M32134 KEYWORDS alpha-1 type IX collagen. SEGMENT 2 of 4 SOURCE Mouse DNA, clones YMm[413,911,341]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 88) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept / 3 / 86 alpha-1 type IX collagen (COL9A1), exon 6 (AA at 3) pep$ / 3 + 86 alpha-1 type IX collagen, exon 6 (AA at 3) IVS < 1 2 COL9A1 intron E IVS 87 > 88 COL9A1 intron F BASE COUNT 25 a 22 c 21 g 20 t ORIGIN Unknown number of bp after segment 1. 1 agtttgaact ccagtggatg ctgattcatt gtgaccccct gagacccagg agagaaacct 61 gtcatgagct gccaatcaga atcacagt // LOCUS MUSACOLA3 80 bp ds-DNA ROD 06-JUL-1990 DEFINITION Mouse alpha-1 type IX collagen (COL9A1) gene, exon 1*. ACCESSION M32132 KEYWORDS alpha-1 type IX collagen. SEGMENT 3 of 4 SOURCE Mouse DNA, clones YMm[413,911,341]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 80) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept < 1 + 78 alpha-1 type IX collagen (COL9A1), exon 1* (AA at 1) IVS 79 > 80 COL9A1 intron F* BASE COUNT 8 a 19 c 31 g 22 t ORIGIN Unknown number of bp after segment 2. 1 atggcctggg ctgcctgggg acgaggagtg cttgggctgt cactgatgct gtctgggctc 61 cgcttgtgtg ctgctcaagt // LOCUS MUSACOLA4 25 bp ds-DNA ROD 06-JUL-1990 DEFINITION Mouse alpha-1 type IX collagen (COL9A1) gene, exon 7. ACCESSION M32136 KEYWORDS alpha-1 type IX collagen. SEGMENT 4 of 4 SOURCE Mouse DNA, clones YMm[413,911,341]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 25) AUTHORS Muragaki,Y., Nishimura,I., Henney,A., Ninomiya,Y. and Olsen,B.R. TITLE The alpha-1 (IX) collagen gene gives rise to two different transcripts in mouse embryonic and human fetal RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2400-2404 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by B.R.Olsen, 15-FEB-1990. FEATURES from to/span description pept + 3 / 23 alpha-1 type IX collagen (COL9A1), exon 7 pep$ + 3 / 23 alpha-1 type IX collagen, exon 7 IVS < 1 2 COL9A1 intron G* IVS 24 > 25 COL9A1 intron H* IVS < 1 2 COL9A1 intron F IVS 24 > 25 COL9A1 intron G BASE COUNT 8 a 7 c 7 g 3 t ORIGIN Unknown number of bp after segment 3. 1 agaccagcca gaccactgat gaggt //
GenBank-Updates@genbank.bio.net (07/07/90)
LOCUS TRBMVAT5A 1544 bp ss-mRNA INV 07-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (MVAT5) mRNA, complete cds. ACCESSION M33825 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. BASE COUNT 497 a 396 c 366 g 285 t ORIGIN 1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt 61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt 121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa 181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg 241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt 301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa 361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac 421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg 481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc 541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg 601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca 661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca 721 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca 781 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat 841 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt 901 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac 961 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa 1021 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac 1081 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc 1141 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc 1201 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat 1261 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca 1321 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc 1381 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta 1441 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa 1501 gaatttttgc tactttaaaa acttctgata tattttaaca ccta // LOCUS TRBWRATATA 1544 bp ss-mRNA INV 07-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (WRATat A) mRNA, complete cds. ACCESSION M33823 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 42 1457 variant surface glycoprotein (WRATat A) mRNA < 1 1544 WRATat A mRNA BASE COUNT 545 a 387 c 345 g 267 t ORIGIN 1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc 61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt 121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac 181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg 241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg 301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc 361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa 421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct 481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca 541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg 601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac 661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca 721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa 781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac 841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg 901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt 961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg 1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca 1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg 1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc 1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag 1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt 1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga 1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg 1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta 1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg // LOCUS TRBWRATATB 1585 bp ss-mRNA INV 07-JUL-1990 DEFINITION T.brucei variant surface glycoprotein (WRATat B) mRNA, complete cds. ACCESSION M33824 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1585) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. BASE COUNT 535 a 384 c 376 g 290 t ORIGIN 1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact 61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga 121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa 181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac 241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga 301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat 361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga 421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc 481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca 541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc 601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg 661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag 721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa 781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga 841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact 901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca 961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca 1021 cgacctcata acagctaaat ccggtacgac ataagcggcg caatcactcg gctaaagcaa 1081 ctacaaatcg ccaaagcaaa gctaaaagca agacggctac tgctgcaaaa cagggaacgg 1141 cagcaaacgc gacttatggc gctagcagac aagatgcaag aattgtacca agaggcctta 1201 catgacgagg ttcaactcag gaaggaagcg cagaacaaac ctcaagaaac accagattct 1261 gacaagcaaa aagcatgcga gaaatatcac aacaagtcaa aggaatgcaa agaaaatggt 1321 tgccaatgga gtggaactga agaaaccact aggaaagtgc gaagctaaac ccaaagcagg 1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac 1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt 1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact 1561 tctggtatat tttaacacct ttaaa // LOCUS HUMMHB17W 3520 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human MHC class I HLA-Bw58 gene, complete cds. ACCESSION M11799 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human B lymphoblastoid cell line WT49 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3520) AUTHORS Ways,J.P., Coppin,H.L. and Parham,P. TITLE The complete primary structure of HLA-Bw58 JOURNAL J. Biol. Chem. 260, 11924-11933 (1985) STANDARD simple staff_review FEATURES from to/span description pept 88 160 HLA-Bw58 of MHC, exon 1 /nomgen="HLA-B" /map="6p21.3" /hgml_locus_uid="LL0111R" 290 559 HLA-Bw58 of MHC, exon 2 806 1081 HLA-Bw58 of MHC, exon 3 1657 1932 HLA-Bw58 of MHC, exon 4 2037 2153 HLA-Bw58 of MHC, exon 5 2595 2627 HLA-Bw58 of MHC, exon 6 2734 2777 HLA-Bw58 of MHC, exon 7 pre-msg 67 3386 HLA-Bw58 of MHC, mRNA and introns IVS 161 289 HLA-Bw58 intron A IVS 560 805 HLA-Bw58 intron B IVS 1082 1656 HLA-Bw58 intron C IVS 1933 2036 HLA-Bw58 intron D IVS 2154 2594 HLA-Bw58 intron E IVS 2628 2733 HLA-Bw58 intron F BASE COUNT 665 a 981 c 1099 g 775 t ORIGIN Chromosome 6p21.3. 1 tctagagaag ccaatcagtg tcgccggggt cccagttcta aagtccccac gcacccaccc 61 ggactcagaa tctcctcaga cgccgagatg cgggtcacgg cgccccgaac cgtcctcctg 121 ctgctctggg gggcagtggc cctgaccgag acctgggccg gtgagtgcgg ggtcgggagg 181 gaaatggcct ctgtggggag gagcgagggg accgcaggcg ggggcgcagg acctgaggag 241 ccgcgccggg aggagggtcg ggcgggtctc agcccctcct cgcccccagg ctcccactcc 301 atgaggtatt tctacaccgc catgtcccgg cccggccgcg gggagccccg cttcatcgca 361 gtgggctacg tggacgacac ccagttcgtg aggttcgaca gcgacgccgc gagtccgagg 421 acggagcccc gggcgccatg gatagagcag gaggggccgg agtattggga cggggagaca 481 cggaacatga aggcctccgc gcagacttac cgagagaacc tgcggatcgc gctccgctac 541 tacaaccaga gcgaggccgg tgagtgaccc cggcccgggg cgcaggtcac gactccccat 601 cccccacgta cggcccgggg tcgccccgag tctccgggtc cgagatccgc ctccctgagg 661 ccgcgggacc cgcccagacc ctcgaccggc gagagcccca ggcgcgttta cccggtttca 721 ttttcagttg aggccaaaat ccccgcgggt tggtcggggc ggggcggggc tcgggggacg 781 gggctgaccg cggggccggg gccagggtct cacatcatcc agaggatgta tggctgcgac 841 ctggggcccg acgggcgcct cctccgcggg catgaccagt ccgcctacga cggcaaggat 901 tacatcgccc tgaacgagga cctgagctcc tggaccgcgg cggacaccgc ggctcagatc 961 acccagcgca agtgggaggc ggcccgtgtg gcggagcagc tgagagccta cctggagggc 1021 ctgtgcgtgg agtggctccg cagatacctg gagaacggga aggagacgct gcagcgcgcg 1081 ggtaccaggg gcagtgggga gccttcccca tctcctatag gtcgccgggg atggcctccc 1141 acgagaagag gaggaaaatg ggatcagcgc tagaatgtcg ccctcccttg aatggagaat 1201 ggcatgagtt ttcctgagtt tcctctgagg gccccctctt ctctctagga caattaaggg 1261 atgacgtctc tgaggaaatg gaggggaaga cagtccctag aatactgatc aggggtcccc 1321 tttgacccct gcagcagcct tgggaaccgt gacttttcct ctcaggcctt gttctctgcc 1381 tcacactcag tgtgtttggg gctctgattc cagcacttct gagtcacttt acctccactc 1441 agatcaggag cagaagtccc tgttccccgc tcagagactc gaactttcca atgaatagga 1501 gattatccca ggtgcctgcg tccaggctgg tgtctgggtt ctgtgcccct tccccacacc 1561 aggtgtcctg tccattctca ggctggtcac atgggtggtc ctagggtgtc ccatgagaga 1621 tgcaaagcgc ctgaattttc tgactcttcc catcagaccc cccaaagaca cacgtgaccc 1681 accaccccgt ctctgaccat gaggccaccc tgaggtgctg ggccctgggc ttctaccctg 1741 cggagatcac actgacctgg cagcgggatg gcgaggacca aactcaggac actgagcttg 1801 tggagaccag accagcagga gatagaacct tccagaagtg ggcagctgtg gtggtgcctt 1861 ctggagaaga gcagagatac acatgccatg tacagcatga ggggctgccg aagcccctca 1921 ccctgagatg gggtaaggag ggggatgagg ggtcatatct cttctcaggg aaagcaggag 1981 cccttctgga gcccttcagc agggtcaggg cccctcgtct tcccctcctt tcccagagcc 2041 atcttcccag tccaccatcc ccatcgtggg cattgttgct ggcctggctg tcctagcagt 2101 tgtggtcatc ggagctgtgg tcgctactgt gatgtgtagg aggaagagct caggtaggga 2161 aggggtgagg ggtggggtct gggttttctt gtcccactgg gggtttcaag ccccaggtag 2221 aagtgttccc tccctcatta ctgggaagca gcatccacac aggggctaac gcagcctggg 2281 accctgtgtg ccagcactta ctcttttgtg cagcacatgt gacaatgaag gacggatgta 2341 tcaccttgat ggttgtggtg ttggggtcct gatttcagca ttcatgagtc aggggaaggt 2401 ccctgctaag gacagacctt aggagggcag ttggtccagg acccacactt gctttcctcg 2461 tgtttcctga tcctgccttg ggtctgtagt catacttctg gaaattcctt ttgggtccaa 2521 gacgaggagg ttcctctaag atcttaaggc cctgcttcct cccagtcccc tcacaggaca 2581 ttttcttccc acaggtggaa aaggagggag ctactctcag gctgcgtgta agtggtgggg 2641 gtgggagtgt ggaggagctc acccacccca taattcctcc tgtcccacgt ctcctgcggg 2701 ctctgaccag gtcctgtttt tgttctactc cagccagcga cagtgcccag ggctctgatg 2761 tgtctctcac agcttgaaaa ggtgagattc ttggggtcta gagtgggtcg ggtggcgggt 2821 ctgggggtgg gtggggcaga ggggaaaggc ctgggtaatg gggattcttt gattgggatg 2881 tttcgcgtgt gtggtgggct gtttacagtg tcatcgctta ccatgactaa ccagaatttg 2941 ttcatgactg ttgttttctg tagcctgaga cagctgtctt gtgagggact gagatgcagg 3001 atttcttcac gcctcccctt tgtgacttca agagcctctg gcatctcttt ctgcaaaggc 3061 acctgaatgt gtctgcgtcc ctgttagcat aatgtgagga ggtggagaca cagcccaccc 3121 ttgtgtccac tgtgacccct gttcccatgc tgacctgtgt ttcctcccca gtcatctttc 3181 ctgttccaga gaggtggggc tggatgtctc catctctgtc tcaactttac gtgcactgag 3241 ctgcaacttc ttacttccct actgaaaata agaatctgaa tataaatttg ttttctcaaa 3301 tatttgctat gagaggttga tggattaatt aaataagtca attcctggaa tttgagagag 3361 caaataaaga cctgagaacc ttccagaatc tgcatgttcg ctgtgctgag tctgttgcag 3421 gtggggtgtg gagaaggctg tggggggccg agtgtggacg gggcctgtgc ccatttggtg 3481 ttgagtccat catgggcttt atgtggttag tcctcagctg // LOCUS HUMTGFB1B 5075 bp ss-mRNA PRI 07-JUL-1990 DEFINITION Human transforming growth factor-beta 1 binding protein mRNA, complete cds. ACCESSION M34057 KEYWORDS transforming growth factor-beta 1 binding protein. SOURCE Human fibroblast, cDNA to mRNA, clone BPA13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5075) AUTHORS Kanzaki,T., Olofsson,A., Moren,A., Wernstedt,C., Hellman,U., Miyazono,K., Claesson-Welsh,L. and Heldin,C.-H. TITLE TGF-beta-1 binding protein: A component of the large latent complex of TGF-beta-1 with multiple repeat sequences JOURNAL Cell 61, 1051-1061 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.-H.Heldin, 08-MAY-1990. FEATURES from to/span description pept 91 4275 transforming growth factor-beta 1 binding protein precursor sigp 91 150 transforming growth factor-beta 1 binding protein signal peptide matp 151 4272 transforming growth factor-beta 1 binding protein BASE COUNT 1353 a 1122 c 1252 g 1348 t ORIGIN 1 tgaattcggc acgaggtggg ctctctcttt tcccctcttg ctcctttctt ttcttttttt 61 ctgttttttt aaaccttcca aggcaagttc atggatacta agctgatgtg tttgttgttc 121 tttttctccc tgcctccgct cctagtgagt aaccacactg gccgcatcaa ggtggtcttt 181 actccgagca tctgtaaagt gacctgcacc aagggcagct gtcagaacag ctgtgagaag 241 gggaacacca ccactctcat tagtgagaat ggtcatgctg ccgacaccct gacggccacg 301 aacttccgag tggtaatttg ccatcttcca tgtatgaatg gtggccagtg cagttcaagg 361 gacaaatgtc agtgccctcc aaatttcaca ggaaaacttt gtcagatccc agtccatggt 421 gccagcgtgc ctaaacttta tcagcattcc cagcagccag gcaaggcgtt ggggacgcat 481 gtcatccatt caacacatac cttgcctctg accgtgacta gccagcaagg agtcaaagtg 541 aaatttcctc ctaacatagt caatatccat gtgaaacatc ctcctgaagc ttccgtccag 601 atacatcagg tttcaagaat tgatggccca acaggccaga agacaaaaga agctcaacca 661 ggccaatccc aagtctcgta ccaagggctt cctgtccaga agacccagac catacattcc 721 acatactccc accagcaggt cattcctcac gtctaccccg tggctgctaa gacacagctt 781 ggccggtgct tccaggaaac cattgggtca cagtgtggca aagcgctccc tggcctttca 841 aagcaagagg actgctgtgg aactgtgggt acctcctggg gctttaacaa atgccagaaa 901 tgccccaaga aaccatctta tcatggatac aaccaaatga tggaatgcct accgggttat 961 aagcgggtta acaacacctt ttgccaagat attaatgaat gtcagctaca aggtgtatgc 1021 cctaatggtg agtgtttgaa taccatgggc agctatcgat gtacctgcaa aataggattt 1081 gggccggatc ctaccttttc aagttgtgtt cctgatcccc ctgtgatctc ggaagagaaa 1141 gggccctgtt accgacttgt cagttctgga agacagtgta tgtaccctct gtctgttcac 1201 ctcaccaagc agctctgctg ttgtagtgtg ggcaaggctg ggccacactg tgagaaatgt 1261 ccccttccag gcacagctgc ttttaaggaa atctgtcctg gtggaatggg ttatacggtt 1321 tctggcgttc atagacgcag gccaatccat caccatgtag gtaaaggacc tgtatttgtc 1381 aagccaaaga acactcaacc tgttgctaaa agtactcatc ctccacctct cccagccaag 1441 gaagagccag tggaggccct gaccttctcc cgggaacacg gggccaggag tgcggagcca 1501 gaagtggcaa ctgcaccccc tgaaaaggaa ataccttcat tggatcaaga gaaaaccaaa 1561 cttgagcctg gtcaacccca gctgtctcca ggcatttccg ctattcatct gcatccacag 1621 tttccagtag tgattgaaaa aacatcacct cctgtgcctg ttgaagtagc tcctgaagct 1681 tctacgtcta gtgccagcca agtgattgct cctactcaag tgacagaaat caatgaatgt 1741 actgtgaacc ctgatatctg tggagcagga cactgcatta acctaccagt gagatatacc 1801 tgtatatgct acgagggcta caggttcagt gaacaacaga ggaaatgtgt ggatattgat 1861 gagtgtactc aggtccaaca cctctgctcc cagggccgct gtgaaaacac cgagggaagt 1921 ttcttgtgca tttgcccagc aggatttatg gccagtgagg agggtactaa ctgcatagat 1981 gttgacgaat gcctgaggcc ggacgtctgt ggggaggggc actgtgtcaa tactgtgggg 2041 gccttccggt gtgaatactg tgacagcggg taccgcatga ctcagagagg ccgttgtgag 2101 gatattgatg aatgtttgaa tccaagcact tgtccagatg agcagtgtgt gaattctcct 2161 ggatcttacc agtgcgttcc ctgcacagaa ggattccgag gctggaatgg acagtgcctt 2221 gatgtggacg agtgcctgga accaaacgtc tgcgcaaatg gtgattgttc caaccttgaa 2281 ggctcctaca tgtgttcatg ccacaaaggc tatacccgga ctccggacca caagcactgt 2341 agagatattg atgaatgtca gcaagggaat ctatgtgtaa acgggcagtg caaaaatacc 2401 gagggctcct tcaggtgcac ctgtggacag gggtaccagc tgtcggcagc taaagaccag 2461 tgtgaagaca ttgatgaatg ccagcaccgt catctctgtg ctcatgggca gtgcaggaac 2521 actgagggct cttttcaatg tgtgtgtgac cagggttaca gagcatctgg gcttggagac 2581 cactgtgaag atatcaatga atgcttggag gacaagagtg tttgccagag aggagactgc 2641 attaatactg cagggtccta tgattgtact tgtccggatg gatttcagct agatgacaat 2701 aaaacatgtc aagatattaa tgaatgtgaa catccagggc tctgtggtcc gcaaggggag 2761 tgcctaaaca cagagggttc tttccattgt gtctgccagc agggtttctc aatctctgca 2821 gatggccgta cgtgtgaaga tattgatgaa tgtgtaaaca acactgtttg tgacagtcac 2881 gggttttgtg acaatacagc tggctccttc cgctgcctct gttatcaggg ctttcaagcc 2941 ccacaggatg ggcaagggtg tgtggatgtg aatgaatgtg aactgctcag tggggtgtgt 3001 ggtgaagcct tctgtgaaaa cgtggaaggg tccttcctgt gcgtgtgtgc tgatgaaaac 3061 caagagtaca gccccatgac tgggcagtgc cgctcccgga cctccacaga tttagatgta 3121 gatgtagatc aacccaaaga agaaaagaaa gaatgctact ataatctcaa tgacgccagt 3181 ctctgtgata atgtgttggc ccccaatgtc acgaaacaag aatgctgctg tacatcaggc 3241 gcgggatggg gagataactg cgaaatcttc ccctgcccgg tcttgggaac tgctgagttc 3301 actgaaatgt gtcccaaagg gaaaggtttt gtgcctgctg gagaatcatc ttctgaagct 3361 ggtggtgaga actataaaga tgcagatgaa tgcctacttt ttggacaaga aatctgcaaa 3421 aatggtttct gtttgaacac tcggcctggg tatgaatgct actgtaagca agggacgtac 3481 tatgatcctg tgaaactgca gtgctttgat atggatgaat gtcaagaccc cagtagttgt 3541 attgatggcc agtgtgttaa tacagagggc tcttacaact gcttctgtac tcaccccatg 3601 gtcctggatg cgtcagaaaa aagatgtata cgaccggctg agtcaaacga acaaatagaa 3661 gaaactgatg tctaccaaga tttgtgctgg gaacatctga gtgatgaata cgtgtgtagc 3721 cggcctcttg tgggcaagca gacaacgtac actgagtgct gctgtctgta tggagaggcc 3781 tggggcatgc agtgtgccct ctgccccctg aaggattcag atgactatgc tcagctgtgt 3841 aacatccccg tgacgggacg ccggcagcca tatggacggg acgccttggt tgacttcagt 3901 gaacagtata ctccagaagc cgatccctac ttcatccaag accgttttct aaatagcttt 3961 gaggagttac aggctgagga atgcggcatc ctcaatggat gtgaaaatgg tcgctgtgtg 4021 agggtccagg aaggttacac ctgcgattgc ttggatgggt atcacttgga tactgccaag 4081 atgacctgtt tcgatgtaaa tgaatgcgat gagttgaaca accggatgtc tctctgcaag 4141 aatgccaagt gcattaacac cgatggttcc tacaagtgtt tgtgtctgcc aggctacgtg 4201 ccttctgaca agccaaacta ctgcactccg ttgaataccg ccttgaattt agagaaagac 4261 agtgacctgg agtgaaacag aatctacata acctaagccc atatactctg cactgtgtaa 4321 aggaaaaggg agaaatgtat tatacttgag acattgcacc taccccggaa ggctggaaat 4381 acagaaacag catggagttg caagtcctct gaagacaatg agaggattta ggatgagccc 4441 gataggtgtg gcagaccaaa tggacatttc tctaaaaaac cagtatatat agtctgttca 4501 tatgtaaaat tcaatggaag agaggtggaa cagtgctgtt attttaaaca gaaggttgta 4561 ttattatgtt gttttgtttt ttttactatt gcttgattaa atttggcatt taaatagtgg 4621 tggaaatatt tttatataat tttcattttt tggttgtgca gttccttggc tactgttttt 4681 cttttacttc agttttttaa aaatctcaaa tgaaaaagtc ttcgatacaa tattgttaag 4741 ctgtattata agtattgtta cacagggtta tgcaattccc ggcctggagc atttttgaaa 4801 ttcaaattgt ctgtcctgtg gagcaggcag tgattttgtt ccaaaacttt gtatacacat 4861 ttggagaaaa gtactttata ttttcagtgt tttgtctgat tttaatgtcc gttcttagcc 4921 aagctgctag caggtgttaa ttggatccct ttccttcact gaaatggaag agtttataag 4981 cttacgttag tattgtaata tgtaaagtaa gcccaacaaa aatttttaaa aatttgatga 5041 tccccaatat atctaccatt gtatgttaaa taaat // LOCUS MUSPLGN 2720 bp ss-mRNA ROD 07-JUL-1990 DEFINITION Mouse plasminogen mRNA, complete cds. ACCESSION J04766 KEYWORDS plasminogen. SOURCE Mouse cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2720) AUTHORS Degen,S.J. TITLE Characterization of the cDNA coding for mouse plasminogen and localization of the gene to mouse chromosome 17 JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Degen, 08-MAY-1990. FEATURES from to/span description pept 25 2463 plasminogen precursor sigp 25 81 plasminogen signal peptide matp 82 2460 plasminogen signal 2699 2704 poly-A signal BASE COUNT 731 a 707 c 710 g 572 t ORIGIN Chromosome 17. 1 ccggtgctgt tggccagtcc caacatggac cataaggaag taatccttct gtttctcttg 61 cttctgaaac caggacaagg ggactcgctg gatggctaca taagcacaca aggggcttca 121 ctgttcagtc tcaccaagaa gcagctcgca gcaggaggtg tctcggactg tttggccaaa 181 tgtgaagggg aaacagactt tgtctgcagg tcattccagt accacagcaa agagcagcaa 241 tgcgtgatca tggcggagaa cagcaagact tcctccatca tccggatgag agacgtcatc 301 ttattcgaaa agagagtgta tctgtcagaa tgtaagaccg gcatcggcaa cggctacaga 361 ggaaccatgt ccaggacaaa gagtggtgtt gcctgtcaaa agtggggtgc cacgttcccc 421 cacgtaccca actactctcc cagtacacat cccaatgagg gactagaaga gaactactgt 481 aggaacccag acaatgatga acaagggcct tggtgctaca ctacagatcc ggacaagaga 541 tatgactact gcaacattcc tgaatgtgaa gaggaatgca tgtactgcag tggagaaaag 601 tatgagggca aaatctccaa gaccatgtct ggacttgact gccaggcctg ggattctcag 661 agcccacatg ctcatggata catccctgcc aaatttccaa gcaagaacct gaagatgaat 721 tattgccaca accctgacgg ggagccaagg ccctggtgct tcacaacaga ccccaccaaa 781 cgctgggaat actgtgacat cccccgctgc acaacacccc cgcccccacc cagcccaacc 841 taccaatgtc tgaaaggaag aggtgaaaat taccgaggga ccgtgtctgt caccgtgtct 901 gggaaaacct gtcagcgctg gagtgagcaa acccctcata ggcacaacag gacaccagaa 961 aatttcccct gcaaaaatct ggaagagaac tactgccgga acccagatgg agaaactgct 1021 ccctggtgct ataccactga cagccagctg aggtgggagt actgtgagat tccatcctgc 1081 gagtcctcag catcaccaga ccagtcagat tcctcagttc caccagagga gcaaacacct 1141 gtggtccagg aatgctacca gagcgatggg cagagctatc ggggtacatc gtccactacc 1201 atcacaggga agaagtgcca gtcctgggca gctatgtttc cacacaggca ttcgaagacc 1261 ccagagaact tcccagatgc tggcttggag atgaactact gcaggaaccc ggatggtgac 1321 aagggccctt ggtgctacac cactgacccg agcgtcaggt gggaatactg caacctgaag 1381 cggtgctcag agacaggagg gagtgttgtg gaattgccca cagtttccca ggaaccaagt 1441 gggccgagcg actctgagac agactgcatg tatgggaatg gcaaagacta tcggggcaaa 1501 acggccgtca ctgcagctgg caccccctgc cagggatggg ctgcccagga gccccacagg 1561 cacagcatct tcaccccaca gacaaaccca cgggcagatc tggaaaagaa ctactgccga 1621 aacccagatg gggatgtgaa tggtccttgg tgctatacaa caaaccccag aaaactttat 1681 gactattgtg acatccccct gtgtgcatca gcatcatcct ttgagtgcgg gaaacctcag 1741 gtggaaccga agaaatgccc tgggagggtg gtgggtggct gcgtggccaa ccctcactcc 1801 tggccctggc aaatcagcct tagaacaaga tttaccggac agcacttctg tggcggtact 1861 ttaatagccc cagagtgggt tctgactgct gcccactgtt tggagaaatc ttcaagacct 1921 gaattctaca aggttatcct gggtgcgcac gaagaatata tccgtgggtt ggatgttcag 1981 gaaatatcag tagccaaact gatcttggag cccaacaacc gtgacattgc cctgctgaaa 2041 ctaagccgcc cagccaccat cacggataaa gtcattccag cttgtctgcc atctccaaat 2101 tacatggttg ctgaccggac aatatgttac atcaccggct ggggagagac tcaagggact 2161 ttcggtgccg gtcgtctcaa ggaggctcag ctgcctgtga ttgagaacaa ggtgtgcaac 2221 cgcgtcgagt atctgaacaa cagagtcaaa tccacggagc tctgtgccgg gcaactggct 2281 ggtggcgtcg acagctgcca aggcgacagt ggaggacctc tggtttgctt cgagaaggac 2341 aagtacattt tacaaggagt cacttcttgg ggtcttggct gtgctcgccc caataagcct 2401 ggtgtctacg ttcgtgtctc acggtttgtt gattggattg aaagggagat gaggaataac 2461 tgactaggtg gaaggccgag caaaacctct gcttactaaa gcttactgaa tatggggaga 2521 gggcttaggg tgtttggaaa aactgacagt aatcaaactg ggacactaca ctgaaccaca 2581 gcttcctgtc gcccctcagc ccctcccctt tttttgtatt attgtgggta aaattttcct 2641 gtctgtggac ttctggattt tgtgacaata gaccatcact gctgtgacct ttgttgaaaa 2701 taaactcgat acttactttg // LOCUS HUMMHEA 4938 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human MHC class I lymphocyte antigen (HLA-E) (HLA-6.2) gene, complete cds. ACCESSION M21533 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; lymphocyte antigen; major histocompatibility complex. SOURCE Human lymphoblastoid (cell line B-LCL) DNA, clone HLA-6.2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4938) AUTHORS Koller,B.H., Geraghty,D.E., Shimizu,Y., DeMars,R. and Orr,H.T. TITLE A novel HLA class I gene expressed in resting T lymphocytes JOURNAL J. Immunol. 141, 897-904 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.T.Orr, 23-NOV-1988. FEATURES from to/span description pept 705 768 MHC HLA-E precursor, exon 1 /nomgen="HLA-E" /map="6p21.3" /hgml_locus_uid="LT0100B" 899 1168 MHC HLA-E precursor, exon 2 1411 1686 MHC HLA-E precursor, exon 3 2308 2583 MHC HLA-E precursor, exon 4 2708 2824 MHC HLA-E precursor, exon 5 3575 3607 MHC HLA-E precursor, exon 6 3712 3752 MHC HLA-E precursor, exon 7 sigp 704 768 MHC HLA-E signal peptide matp 899 1168 MHC HLA-E 1411 1686 MHC HLA-E 2308 2583 MHC HLA-E 2708 2825 MHC HLA-E 3576 3607 MHC HLA-E 3712 3749 MHC HLA-E pre-msg < 705 4479 MHC HLA-E mRNA and introns IVS 769 898 MHC HLA-E, intron A IVS 1169 1410 MHC HLA-E, intron B IVS 1687 2307 MHC HLA-E, intron C IVS 2584 2707 MHC HLA-E, intron D IVS 2825 3574 MHC HLA-E, intron E IVS 3608 3711 MHC HLA-E, intron F IVS 3755 3919 MHC HLA-E, intron G BASE COUNT 1065 a 1332 c 1380 g 1161 t ORIGIN Chromosome 6p21.3. 1 ggcatgatgg acgcgattag ccggtaacgg atacatgaag catgaggtct ttgaagtcat 61 tggcggctct ctggtgtacg agtctcagct ggttgactac aaggcactct gaagcagatc 121 ctgttttttg ttttgttttg ttttttgttc tgtttttccc cctagacatc tcactctgct 181 gcccaggctg gagtgcagtg gtgtgatctc ggctcactgc aaccatccat cctcctgggt 241 tcaagtagat tctccatgcc tcagcctccc aagtagccag gattacagcc atgcgccacc 301 acgccggcta attttttgga cttttagtag agacagggtt tctccatatt ggtcgggctg 361 gtctcgaact cccaacctca ggtgatcagc ccgccttggc ctcccaaagt gctgagatta 421 caggcgtgag ccaccgcgcc cagccaggac taatttctaa gagtgtgcag agataccgaa 481 acctaaaagt ttaagaactg ctgattgctg ggaaactctg cagtttcccg ttcctctcgt 541 aacctggtca tgtgtccttc ttcctggata ctcatgacgc agactcagtt ctcattccca 601 atgggtgtcg ggtttctaga gaagccaatc agcgtcgcca cgactcccga ctataaagtc 661 cccatccgga ctcaagaagt tctcaggact cagaggctgg gatcatggta gatggaaccc 721 tccttttact cctctcggag gccctggccc ttacccagac ctgggcgggt gagtgcgggg 781 tcgggatgga aacggcctct accgggagta gagaggggcc ggcccggcgg gggcgaagga 841 ctcggggagc cgcgccggga ggagggtcgg gccgatctca gcccctcctc gcccccaggc 901 tcccactcct tgaagtattt ccacacttcc gtgtcccggc ccggccgcgg ggagccccgc 961 ttcatctctg tgggctacgt ggacgacacc cagttcgtgc gcttcgacaa cgacgccgcg 1021 agtccgagga tggtgccgcg ggcgccgtgg atggagcagg aggggtcaga gtattgggac 1081 cgggagacac ggagcgccag ggacaccgca cagattttcc gagtgaacct gcggacgctg 1141 cggcgctact acaatcagag cgaggccggt gagtgacccc ggccaggagc aggtcacgac 1201 ccctccccat cccccacgga cggcgcgggt cccctcgaat cttcgggtcc cagattcacc 1261 ccaaggctgc ggaacccgcc cagaccctag accggggaga gtctcaggcg cctttacccg 1321 gttctttttc agtttaggcc aaaatgccca cagggtggtg gcgacggggg cggggcttgg 1381 tgggcgggac tgactaaggg gcggggccag ggtctcacac cctgcagtgg atgcatggct 1441 gcgagctggg gcccgacagg cgcttcctcc gcgggtatga acagttcgcc tacgacggca 1501 aggattatct caccctgaat gaggacctgc gctcctggac cgcggtggac acggcggctc 1561 agatctccga gcaaaagtca aatgatgcct ctgaggcgga gcaccagaga gcctacctgg 1621 aagacacatg cgtggagtgg ctccacaaat acctggagaa ggggaaggag acgctgcttc 1681 acctgggtaa gagggtccac agggctactc tcccatctcc ttcttgggct aggactgtgc 1741 ccacagctga cagacctcaa acagtagaag aaacagggat ggaggccaga ataccactcc 1801 tcccttggat caggagaggg agctgtcacc tgaggtacag gagatcctat accacagagt 1861 gactctctta aagggccaga cctctctcag gggcaattaa ggaatctagt ctcgctggag 1921 attccatcct tcagatgaac tgatgagcag ttctctttga ctcccagtat taggaatcac 1981 gggggagttt ctctcgtgcc tgattctcag ccccacacca agagtttttg gaggtctgac 2041 tccagctttt ctcagtcact cagcatccac acaggccagg accagaaatc ccttttcacc 2101 ttctaccctg ggctagctca tcccgattct agaactttcc aaggaataag aggctatccc 2161 agatccctaa gtccaggctg gtgtcaaggt tttgtcctct tctcctacta taattgtcct 2221 cttccttctc aggatggtca catgggtgct gctggagtgt cccatgagag atacaaagtg 2281 cctgaatttt ctgactcttc ccctcagagc ccccaaagac acacgtgact caccacccca 2341 tctctgacca tgaggccacc ctgaggtgct gggccctggg cttctaccct gcggagatca 2401 cactgacctg gcagcaggat ggggagggcc atacccagga cacggagctc gtggagacca 2461 ggcctgcagg ggatggaacc ttccagaagt gggcagctgt ggtggtgcct tctggagagg 2521 agcagagata cacgtgccat gtgcagcatg aggggctacc cgagcccgtc accctgagat 2581 ggagtaagga gggggatggg aggtcatgtc tcttctcagg gaaagcggga gcccttctgg 2641 agcccttccg cagggtcagg gctgaggcct gggggtcagg gccccttacg ttcccctctt 2701 ttcccagagc cggcttccca gcccaccatc cccatcgtgg gcatcattgc tggcctggtt 2761 ctccttggat ctgtggtctc tggagctgtg gttgctgctg tgatatggag gaagaagagc 2821 tcaggtgggg aagggagaag ggtggggtct gagttttctt gtcccactgg gtgtttcaag 2881 ccctaggtaa aagtgtgtcc tgcctcgtta ctgggaagca ccatccacac acacgagcct 2941 acccagcctg gggccctgtg tgccagcacc tactcttttt ttttgagacg gagtcttggc 3001 tctgtcaccc aggctggagt gcaatggcgt ggtttcagct cactgcaacc tccgcctccc 3061 aggttcaagc aattctcctg cctcagcctc cctagtagct gggactacac atgcgtgcca 3121 ccacacctgg ctaatttttt tttttgtatt tttagtggag atggggtttc actatgttgg 3181 ccaggctggt ctcgaactcc tgactttgtg atctgcctgc ctcggcctcc caaagtgctg 3241 ggattacagt cgtgagccac cgcacccagc cgcacctact cttttgtaaa gcacctgtga 3301 caatgaagga cagatttatc accttgacga ttgtggtgat ggggacctga tcccagcagt 3361 cacaggtcac aggggaaggt ccctgctgaa gacagacctc agaagggcag ttgatccagg 3421 acccacacct gctttcttca cgtttcctga tcctgccctg ggtctgcagt cacagttcag 3481 gaaacttctc tgggatccaa aactaggagg ttcctctagg accttatggc cctgcctcct 3541 ccctggcccc tcacaggaca ttttcttcca acaggtggaa aaggagggag ctactctaag 3601 gctgagtgta agtgcggggc gggagcgtgg aggagctcgc ccaccctata attcctcctg 3661 caccacatct cctgtgggct ctgaccaggt cttgtttttg ttctacccca gggagcgaca 3721 gtgcccaggg gtctgagtct cacagcttgt aaaggtgaga ttctgggggt ctgaagtggg 3781 tggagggtgg ggcagagggg acaggactgg gttgtgggga ttttttgatt cagaattttt 3841 gagtgtgtgg tgggctgttc agagtgtcat cacttaccgt gactgacctg aatttgttca 3901 tgactatttt cttctgtagc ctgagacagc tgccttgtgt gcgactgaga tgcacagctg 3961 ccttgtgtgc gactgagatg caggatttcc tcacgcctcc cctatgtgtc ttaggggact 4021 ctggcttctc tttttgcaag ggcctctgaa tctgtctgtg tccctgttag cacaatgtga 4081 ggaggtagag aaacagtcca cctctgtgtc taccatgacc cccttcctca cactgacctg 4141 tgttccttcc ctgttctctt ttctattaaa aataagaacc tgggcagagt gcggcagctc 4201 atgcctgtaa tcccagcact tagggaggcc gaggagggca gatcacgagg tcaggagatc 4261 gaaaccatcc tggctaacac ggtgaaaccc cgtctctact aaaaaataca aaaaattagc 4321 tgggcgcaga ggcacgggcc tgtagtccca gctactcagg aggcggaggc aggagaatgg 4381 cgtcaacccg ggaggcggag gttgcagtga gccaggattg tgcgactgca ctccagcctg 4441 ggtgacaggg tgaaacgcca tctcaaaaaa taaaaattaa aaaataaaaa aagaacctgg 4501 atctcaattt aatttttcat attcttgcaa tgaaatggac ttgaggaagc taagatcata 4561 gctagaaata cagataattc cacagcacat ctctagcaaa tttagcctat tcctattctc 4621 tagcctattc cttaccacct gtaatcttga ccatatacct tggagttgaa tattgttttc 4681 atactgctgt ggtttgaatg ttccctccaa cactcatgtt gagacttaat ccctaatgtg 4741 gcaatactga aaggtggggc ctttgagatg tgattggatc gtaaggctgt gccttcattc 4801 atgggttaat ggattaatgg gttatcacag gaatgggact ggtggcttta taagaagagg 4861 aaaagagaac tgagctagca tgcccagccc acagagagcc tccactagag tgatgctaag 4921 tggaaatgtg aggtgcag // LOCUS HUMGSTPI1 1724 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exons 1-5. ACCESSION X08094 M33082 KEYWORDS glutathione S-transferase pi. SEGMENT 1 of 3 SOURCE Human, cell line HPB-ALL, lymphoid DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1724) AUTHORS Cowell,I.G. JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 1724) AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B. TITLE The structure of the human glutathione S-transferase pi gene JOURNAL Biochem. J. 255, 79-83 (1988) STANDARD simple staff_entry COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST mRNA sequence. EMBL features not translated to GenBank features: key from to description SITE 116 123 TRE-box SITE 129 134 Sp1 binding site SITE 138 143 Sp1 binding site PRM 154 157 TATA-box MSG 182 214 Exon 1 MSG 499 534 Exon 2 MSG 819 925 Exon 3 MSG 1042 1129 Exon 4 MSG 1488 1591 Exon 5 [1] Author address Cowell I.G., University College and Middlesex, School of Medicine, Windeyer Building, Cleveland Street, London, W1P 6DB, United Kingdom. Submitted (13-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept 214 214 glutathione S-transferase pi, exon 1 /hgml_locus_uid="LQ0085R" /map="11q13-qter" /nomgen="GST3" 499 534 glutathione S-transferase pi, exon 2 819 925 glutathione S-transferase pi, exon 3 1042 1129 glutathione S-transferase pi, exon 4 1488 + 1591 glutathione S-transferase pi, exon 5 pre-msg 182 > 1724 GST3 mRNA and introns IVS 215 498 GST3 intron A IVS 535 818 GST3 intron B IVS 926 1041 GST3 intron C IVS 1130 1487 GST3 intron D IVS 1592 > 1724 GST3 intron E revision 1532 1541 atgaatgac in [1]; gtgaatgacg in [2] revision 1707 1707 n in [1]; g in [2] BASE COUNT 272 a 598 c 541 g 313 t ORIGIN 1 tcgctgcgac tccgggactc cagggcgccc ctctcggccg acgcccgggg tgcagcggcg 61 ccgggctggg gccggcggga gtccgcggga ccctccagaa gagcggccgg cgccgtgact 121 cagcactggg gcggagcggg gcgggaccac ccttataagg ctcggaggcc gcgaggcctt 181 cgctggagtt tcgccgccgc agtcttcgcc accagtgagt acgcgcggcc cgcgtccccg 241 gggatggggc tcagagctcc cagcatgggg ccaacccgca gcatcaggcc cgggctcccg 301 gcaggctcct cgcccacctc gagacccggg acggggccta ggggacccag gacgtcccca 361 gtggccgtta gcggctttca gggggcccgg agcgcctcgg ggagggatgg gaccccgggg 421 gcgggagggg ggggcagctc cactcaccgc gccttggcat cctccccggg ctccacaaat 481 tttctttgtt cgctgcagtg ccgccctaca ccgtggtcta tttcccagtt cgaggtagga 541 gcatgtgtct ggcaggaagg gaggcaggcc tggctgcagc ccacagcccc tcgcccaccc 601 ggagagatcc gaaccccctt atccctccgt cgtgtggctt ttaccccggg cctccttcct 661 gttccccgcc tctcccgcca tgcctgctcc ccgccccagt gttgtgtaaa atcttggagg 721 aacctgtttc ctgttccctc ctgcactcct gaccctcccg ggttgctgcg aggcggagtg 781 gcccggtccc cacatctcgt acttctccct ccccgcaggc cgctgcgcgg ccctgcgcat 841 gctgctggca gatcagggcc agagctggaa ggaggaggtg gtgaccgtgg agacgtggca 901 ggagggctca ctcaaagcct cctgcgtaag tgaccatgcc cgggcaaggg gagggggtgc 961 tgggccttag ggggctgtgc actaggatcg ggggacgccc aagctcagtg ccccttccct 1021 gagccatgcc tcccccaaca gctatacggg cagctcccca agttccagga cggagacctc 1081 accctgtacc agtccaatac catcctgcgt cacctgggcc gcacccttgg tgagtcttaa 1141 cctcaagtcc aggagtcagg caagcctctg cccccggagc ccttttgttt aaatcaggcc 1201 ccgcagccct ctggagggga ggaaactgag acccactgag gttacgtagt ttgccaagtc 1261 aagcctggtt gtttgcaatc cttgcctgtg ccaggctggc tcccaggtgt caggtgagct 1321 ctgagcacct ggtgtgtggc agtctctcat ccttccacgc acatcctctt cccctcctcc 1381 caggctgggg ctcatcagac agaaccccct tggcctggcc catccccaat gactgtgtgt 1441 gatcagcgcc cagtcaaccc ggcctgctcc cctccaccca accccagggc tctatgggaa 1501 ggaccagcag gaggcagccc tggtggacat ggtgaatgac ggcgtggagg acctccgctg 1561 caaatacatc tccctcatct acaccaacta tgtgtgagca tctgcaccag gttgggcact 1621 gggggctgaa caaagaaagg ggcttcttgt gccctcaccc cccttacccc tcaggtggct 1681 tgggctgacc ccttcttggg tcagggggca ggggctgggt cagg // LOCUS HUMGSTPI2 591 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exon 6. ACCESSION X08095 M33083 KEYWORDS glutathione S-transferase pi. SEGMENT 2 of 3 SOURCE Human, cell line HPB-ALL, lymphoid DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 591) AUTHORS Cowell,I.G. JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 591) AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B. TITLE The structure of the human glutathione S-transferase pi gene JOURNAL Biochem. J. 255, 79-83 (1988) STANDARD simple staff_entry COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST mRNA sequence. EMBL features not translated to GenBank features: key from to description TRANSCR <1 >591 primary transcript MSG 470 577 Exon 6 [1] Author address Cowell I.G., University College and Middlesex, School of Medicine, Windeyer Building, Cleveland Street, London, W1P 6DB, United Kingdom. Submitted (13-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept + 470 + 577 glutathione S-transferase pi, exon 6 /hgml_locus_uid="LQ0085R" /map="11q13-qter" /nomgen="GST3" IVS < 1 469 GST3 intron E IVS 578 > 591 GST3 intron F BASE COUNT 131 a 111 c 225 g 124 t ORIGIN About 300 bp after segment 1. 1 tgggaccagt cagcagaggc agcgtgtgtg cgcgtgcgtg tgcgtgtgtg tgcgtgtgtg 61 tgtgtacgct tgcatttgtg tcgggtgggt aaggagatag agatgggcgg gcagtaggcc 121 caggtcccga aggccttgaa cccactggtt tggagtctcc taagggcaat gggggccatt 181 gagaagtctg aacagggctg tgtctgaatg tgaggtctag aaggatcctc cagagaagcc 241 agctctaaag cttttgcaat catctggtga gagaacccag caaggatgga caggcagaat 301 ggaatagaga tgagttggca gctgaagtgg acaggatttg gtactagcct ggttgtgggg 361 agcaagcaga ggagaatctg ggactctggt ggtctggcct ggggcagacg ggggtgtctc 421 aggggctggg agggatgaga gtaggatgat acatggtggt gtctggcagg aggcgggcaa 481 ggatgactat gtgaaggcac tgcccgggca actgaagcct tttgagaccc tgctgtccca 541 gaaccaggga ggcaagacct tcattgtggg agaccaggtg agcatctggc c // LOCUS HUMGSTPI3 499 bp ds-DNA PRI 07-JUL-1990 DEFINITION Human glutathione S-transferase pi (GST-pi) gene, exon 7. ACCESSION X08096 M33084 KEYWORDS glutathione S-transferase pi. SEGMENT 3 of 3 SOURCE Human, cell line HPB-ALL, lymphoid DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 499) AUTHORS Cowell,I.G. JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 499) AUTHORS Cowell,I.G., Dixon,K.H., Pemble,S.E., Ketterer,B. and Taylor,J.B. TITLE The structure of the human glutathione S-transferase pi gene JOURNAL Biochem. J. 255, 79-83 (1988) STANDARD simple staff_entry COMMENT See X08094 - X08096 for GST pi gene sequence; see X06547 for GST mRNA sequence. EMBL features not translated to GenBank features: key from to description TRANSCR <1 ? primary transcript MSG 17 ? Exon 7 [1] Author address Cowell I.G., University College and Middlesex, School of Medicine, Windeyer Building, Cleveland Street, London, W1P 6DB, United Kingdom. Submitted (13-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept + 17 205 glutathione S-transferase pi exon 7 /hgml_locus_uid="LQ0085R" /map="11q13-qter" /nomgen="GST3" IVS < 1 16 GST3 intron F signal 260 265 poly-A signal revision 127 128 tc in [1]; cg in [2] BASE COUNT 114 a 140 c 141 g 104 t ORIGIN About 200 bp after segment 2. 1 ctggcctgcc ctgcagatct ccttcgctga ctacaacctg ctggacttgc tgctgatcca 61 tgaggtccta gcccctggct gcctggatgc gttccccctg ctctcagcat atgtggggcg 121 cctcagcgcc cggcccaagc tcaaggcctt cctggcctcc cctgagtacg tgaacctccc 181 catcaatggc aacgggaaac agtgagggtt ggggggactc tgagcgggag gcagagtttg 241 ccttcctttc tccaggacca ataaatttct aagagaggct actatgagca ctgtgtttcc 301 tgggacgggc ttagggcgtt ctcagcctcg aggtcggtgg gagggcagag cagaggacta 361 gaaaacagct cctccagcca ccgtcagtgg cttctggagc cctcagcctg gctgtgttta 421 ctcaaccatc acaaactaga agaggaagaa aaaaaaagag agagagtaca agagaaataa 481 caatggtcct gtcctggag //
GenBank-Updates@genbank.bio.net (07/10/90)
LOCUS HUMCDR34 2412 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S.K., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: A highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990 FEATURES from to/span description pept 503 1174 cerebellar-degeneration-related antigen (CDR34) BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS RATHGF 2485 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat hepatocyte growth factor mRNA, complete cds. ACCESSION M32987 KEYWORDS hepatocyte growth factor. SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2485) AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M., Shimizu,S. and Nakamura,T. TITLE Deduced primary structure of rat hepatocyte growth factor and expression of the mRNA in rat tissues JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Nakamura, 26-MAR-1990. FEATURES from to/span description pept 143 2329 hepatocyte growth factor BASE COUNT 750 a 533 c 577 g 625 t ORIGIN 1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc 61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga 121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct 181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca 241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa 301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa 361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc 421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt 481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa 541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg 601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct 661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag 721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat 781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg 841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga 901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta 961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc 1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg 1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca 1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa 1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat 1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta 1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg 1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc 1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg 1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga 1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca 1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa 1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc 1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat 1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct 1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct 1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac 1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg 2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt 2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga 2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt 2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc 2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa 2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa 2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca 2461 ttaatatatg tggcgttttc tgttg // LOCUS HUMINSGS 351 bp ds-DNA SYN 10-JUL-1990 DEFINITION Human (synthetic) insulin gene, complete cds. ACCESSION J02547 M25881 KEYWORDS artificial gene; insulin. SOURCE Synthetic human DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 79 to 351) AUTHORS Brousseau,R., Scarpulla,R., Sung,W., Hsiung,H.M., Narang,S.A. and Wu,R. TITLE Synthesis of a human insulin gene: V. Enzymatic assembly, cloning and characterization of the human proinsulin DNA JOURNAL Gene 17, 279-289 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 351) AUTHORS Georges,F., Brousseau,R., Michniewicz,J., Prefontaine,G., Stawinski,J., Sung,W., Wu,R. and Narang,S.A. TITLE Synthesis of a human insulin gene: VII. Synthesis of preproinsulin-like human DNA, its cloning and expression in M13 bacteriophage JOURNAL Gene 27, 201-211 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 351) AUTHORS Narang,S.A., Brousseau,R., Georges,F., Michniewicz,J., Prefontaine,G., Stawinski,J. and Sung,W. TITLE The human preproinsulin gene: synthesis, cloning, gene modification, and expression studies JOURNAL Can. J. Biochem. 62, 209-216 (1984) STANDARD full staff_review COMMENT In places where the human insulin amino acid sequence is identical to the rat insulin amino acid sequence, the synthetic sequence follows the published nucleotide sequence for rat (see separate entry). FEATURES from to/span description pept 6 350 synthetic preproinsulin sigp 6 77 synthetic insulin signal peptide matp 90 179 synthetic insulin B-chain matp 186 278 synthetic insulin C-chain matp 285 347 synthetic insulin A-chain BASE COUNT 65 a 93 c 100 g 93 t ORIGIN 78 bp upstream of EcoRI site. 1 aattcatggg cctatggatc cgtctactgc ctctgatcgc gctgctgatc ctctggggac 61 cggatccagc tgcggccgaa ttccggatgt ttgtcaatca gcacctttgt ggttctcacc 121 tggtggaggc tctgtacctg gtgtgtgggg aacgtggttt cttctacaca cccaagaccc 181 gtcgtgaagc tgaagacctt caagtgggtc aagttgaact tggtgggggt cctggtgcgg 241 gttctcttca acctttggct ctcgagggat cacttcaaaa gcgtggcatt gtggagcagt 301 gctgcaccag catctgctcc ctctaccaac tggagaacta ctgcaactga g // LOCUS TRFRRECF 212 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA e from the large ribosomal subunit. ACCESSION K02691 M25882 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 212) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i) are found in ribosomes of other organisms. rRNAs e,f,g, and j were analyzed by [1]. In rRNA e the number of "g" residues at positions 91-93 and 123-124 were ambiguous. At positions 77-81 three "c"s and two "t"s were found, but the order was unclear. Positions 116 and 119 gave strong "t" bands but also consistently gave weak bands in the "u-2" track. [1] is not sure of the reason: sequencing artifact or an indication of cistron heterogeneity. No evidence of this heterogeneity was found in chemical gels. FEATURES from to/span description rRNA 1 212 ribosomal RNA e modified 125 125 p (putative) BASE COUNT 46 a 53 c 51 g 62 t ORIGIN 5' end of mature rRNA e. 1 tagtggaaat gcgaaacact tgccaggtga caaatcaatc ctcccacggt gagctttctt 61 ttcaccataa tccacatctc cggctttgct gggcttgggc ctttttactt ctcgcgttgt 121 tcggtgcggg ggcccaagat tgaaaaatgc agctctccct acgtactgtc attgttgtga 181 gttctgcgca ttaaagcaaa aacctggggt gt // LOCUS TRFRRFCF 183 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA f from the large ribosomal subunit. ACCESSION K02692 M25883 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 183) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i) are found in ribosomes of other organisms. rRNAs e,f,g, and j are reported by [1]. FEATURES from to/span description rRNA 1 183 ribosomal RNA f BASE COUNT 41 a 49 c 57 g 36 t ORIGIN 5' end of mature rRNA f. 1 gtgagattgt gaagggatct cgcaggcatc gtgagggaag tatggggtag tacgagagga 61 actcccatgc cgtgcctcta gtttctgggg tttgtcgaac ggcaagtgcc ccgaagccat 121 cgcacggtgg ttctcggctg aacgcctcta agccagaagc caatcccaag accagatgcc 181 ccc // LOCUS TRFRRGCF 136 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA g from the large ribosomal subunit. ACCESSION K02693 M25884 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 136) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j), when normally only two (h,i) are found in the ribosomes of other organisms. rRNAs e,f,g, and j are reported by [1]. There was some question whether rRNA g contained 135 or 136 bp, starting with base 1 or 2 in the sequence presented below. FEATURES from to/span description rRNA 1 136 ribosomal RNA g BASE COUNT 31 a 37 c 40 g 28 t ORIGIN 5' end of mature rRNA g. 1 acaacgtccc tctccaaacg agagaatatg catgggctgg catgagcggc atgcttcact 61 ccggtggggc tcgaggggca cttacgtccc gaggcgctga accttgaggc ctgaaatttc 121 atgctctggg actaaa // LOCUS TRFRRJCF 73 bp ss-rRNA RNA 10-JUL-1990 DEFINITION Trypanosomatid (C.fasciculata) small rRNA j from the large ribosomal subunit. ACCESSION K02694 M25885 KEYWORDS ribosomal RNA. SOURCE Trypanosomatid (C.fasciculata) ribosomal RNA. ORGANISM Crithidia fasciculata Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 73) AUTHORS Schnare,M.N., Spencer,D.F. and Gray,M.W. TITLE Primary structures of four novel small ribosomal RNAs from Crithidia fasciculata JOURNAL Can. J. Biochem. 61, 38-45 (1983) STANDARD full staff_review COMMENT The large subunit of the ribosome of C.fasciculata contains six small rRNAs (designated e,f,g,h,i,j) when normally only two (h,i) are found in the ribosomes of other organisms. rRNAs e,f,g, and j are reported by [1]. There was some question whether rRNA j contained 72 or 73 bp, starting with base 1 or 2 in the sequence presented below. FEATURES from to/span description rRNA 1 73 ribosomal RNA j BASE COUNT 17 a 23 c 14 g 19 t ORIGIN 5' end of mature rRNA j. 1 tcatcgaatc gccacctaca cgactggagc ttgctccctc gtcggcctct agtatattca 61 tgatcacaag gta // LOCUS YSCRGEA 1798 bp ds-DNA PLN 10-JUL-1990 DEFINITION Yeast (S.cerevisiae) 18S ribosomal RNA gene. ACCESSION J01353 M27607 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE Yeast (S.cerevisiae + D4) DNA, clones pY1rA3 and prYC. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1798) AUTHORS Rubtsov,P.M., Musakhanov,M.M., Zakharyev,V.M., Krayev,A.S., Skryabin,K.G. and Bayev,A.A. TITLE The structure of the yeast ribosomal RNA genes. I. The complete nucleotide sequence of the 18S ribosomal RNA gene from Saccharomyces cerevisiae JOURNAL Nucleic Acids Res. 8, 5779-5794 (1980) STANDARD full staff_review REFERENCE 2 (bases 1 to 1798; revises [1]) AUTHORS Mankin,A.S., Skryabin,K.G. and Rubtsov,P.M. TITLE Identification of ten additional nucleotides in the primary structure of yeast 18S rRNA JOURNAL Gene 44, 143-143 (1986) STANDARD full staff_review FEATURES from to/span description rRNA 1 1798 18S ribosomal RNA revision 943 943 a in [2]; g in [1] revision 962 962 a in [2]; g in [1] revision 982 983 ag in [2]; ga in [1] revision 988 999 tcgaagatgatc in [2]; tc in [1] revision 1002 1002 g in [2]; a in [1] revision 1122 1123 ag in [2]; aag in [1] revision 1742 1742 a in [2]; g in [1] BASE COUNT 480 a 348 c 459 g 511 t ORIGIN 9 bp upstream of Sau3A site. 1 tatctggttg atcctgccag tagtcatatg cttgtctcaa agattaagcc atgcatgtct 61 aagtataagc aatttataca gtgaaactgc gaatggctca ttaaatcagt tatcgtttat 121 ttgatagttc ctttactaca tggtataacc gtggtaattc tagagctaat acatgcttaa 181 aatctcgacc ctttggaaga gatgtattta ttagataaaa aatcaatgtc ttcggactct 241 ttgatgattc ataataactt ttcgaatcgc atggccttgt gctggcgatg gttcattcaa 301 atttctgccc tatcaacttt cgatggtagg atagtggcct accatggttt caacgggtaa 361 cggggaataa gggttcgatt ccggagaggg agcctgagaa acggctacca catccaagga 421 aggcagcagg cgcgcaaatt acccaatcct aattcaggga ggtagtgaca ataaataacg 481 atacagggcc cattcgggtc ttgtaattgg aatgagtaca atgtaaatac cttaacgagg 541 aacaattgga gggcaagtct ggtgccagca gccgcggtaa ttccagctcc aatagcgtat 601 attaaagttg ttgcagttaa aaagctcgta gttgaacttt gggcccggtt ggccggtccg 661 attttttcgt gtactggatt tccaacgggg cctttccttc tggctaacct tgagtccttg 721 tggctcttgg cgaaccagga cttttacttt gaaaaaatta gagtgttcaa agcaggcgta 781 ttgctcgaat atattagcat ggaataatag aataggacgt ttggttctat tttgttggtt 841 tctaggacca tcgtaatgat taatagggac ggtcgggggc atcggtattc aattgtcgag 901 gtgaaattct tggatttatt gaagactaac tactgcgaaa gcatttgcca aggacgtttt 961 cattaatcaa gaacgaaagt taggggatcg aagatgatct ggtaccgtcg tagtcttaac 1021 cataaactat gccgactaga tcgggtggtg tttttttaat gacccactcg gtaccttacg 1081 agaaatcaaa gtctttgggt tctgggggga gtatggtcgc aaggctgaaa cttaaaggaa 1141 ttgacggaag ggcaccacta ggagtggagc ctgcggctaa tttgactcaa cacggggaaa 1201 ctcaccaggt ccagacacaa taaggattga cagattgaga gctctttctt gattttgtgg 1261 gtggtggtgc atggccgttt ctcagttggt ggagtgattt gtctgcttaa ttgcgataac 1321 gaacgagacc ttaacctact aaatagtggt gctagcattt gctggttatc cacttcttag 1381 agggactatc ggtttcaagc cgatggaagt ttgaggcaat aacaggtctg tgatgccctt 1441 agaacgttct gggccgcacg cgcgctacac tgacggagcc agcgagtcta accttggccg 1501 agaggtcttg gtaatcttgt gaaactccgt cgtgctgggg atagagcatt gtaattattg 1561 ctcttcaacg aggaattcct agtaagcgca agtcatcagc ttgcgttgat tacgtccctg 1621 ccctttgtac acaccgcccg tcgctagtac cgattgaatg gcttagtgag gcctcaggat 1681 ctgcttagag aagggggcaa ctccatctca gagcggagaa tttggacaaa cttggtcatt 1741 tagaggaact aaaagtcgta acaaggtttc cgtaggtgaa cctgcggaag gatcatta // LOCUS DROSHA1A 1473 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster Sha12 protein mRNA, complete cds. ACCESSION M32660 KEYWORDS . SOURCE D.melanogaster, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1473) AUTHORS Butler,A., Wei,A. and Salkoff,L. TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in Drosophila JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1473) AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L. TITLE Diverse K+ currents expressed by a Drosophila extended gene family which is conserved in mouse JOURNAL Science 248, 599-603 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Salkoff, 07-MAR-1990. FEATURES from to/span description pept 1 1473 Sha12 protein BASE COUNT 302 a 434 c 432 g 305 t ORIGIN Chromosome 3 left arm at locus 76B. 1 atggcctcgg tcgccgcttg gctgcccttc gcccgggcgg cggccatcgg gtgggtgccg 61 atagccaccc acccactgcc accgcccccg atgcccaagg atcgccgcaa aacggacgac 121 gagaagctcc tgatcaacgt ctccgggcgg cgcttcgaga cgtggcggaa tactttggag 181 aagtatccgg acaccctttt aggttccaat gaaagggagt tcttctacga cgaggactgc 241 aaagaatact tcttcgatcg ggacccggac atcttccggc acatactgaa ctactaccgg 301 acgggcaagc tgcactaccc gaagcacgaa tgcctcacca gctacgacga ggagctggcc 361 ttctttggaa taatgccgga tgtcattggc gattgctgct acgaggacta ccgggaccgg 421 aagcgggaga acgcggagcg gctgatggac gacaagctgt cggagaacgg ggatcagaat 481 ctgcagcagc tgaccaacat gcgccagaag atgtggcggg ccttcgagaa tccgcacacg 541 tcgacgagcg ccctggtgtt ctactatgtt acgggtttct tcatcgccgt ctccgtgatg 601 gccaacgtgg tggagacggt gccgtgtggc caccggccgg gcagagcggg aactctgccc 661 tgcggcgagc gctacaagat cgtcttcttc tgcctggata ccgcctgcgt gatgatcttt 721 acggcggagt acctacttcg actcttcgcc gcccccgatc gctgcaagtt cgtgcgctcg 781 gtgatgagca ttattgatgt ggtggccatt atgccgtact acattggcct cgggatcacc 841 gacaacgacg acgtgagcgg tgctttcgtc acgctgcgcg tgttccgtgt cttccgcata 901 ttcaagttct cgcgccactc gcaaggactt cggatcctcg gctacacgct caagtcctgc 961 gccagcgaac tgggcttcct tgtcttctcg ctggccatgg ccattatcat ctttgccacc 1021 gtcatgttct acgccgagaa gaacgtcaat ggcaccaact tcacatcgat tccggcggcc 1081 ttctggtata ccatcgtcac aatgacgacg ctgggatatg gcgacatggt gccagagaca 1141 atagctggca aaattgtggg cggcgtctgc tcgcttagcg gtgtgctggt catcgcctta 1201 cctgtacctg ttatcgtatc gaactttagt agaatctatc accagaacca gcgagcggac 1261 aagcgcaagg cgcagcggaa agctcgcctg gcgcgcatcc gcattgccaa ggcctcgtcc 1321 ggagccgcct ttgttagcaa gaagaaggcc gccgaggccc ggtgggctgc ccaggagtcg 1381 ggcatcgagc tggatgacaa ctatcgggac gaggacatct tcgagctgca gcaccatcat 1441 ttgctgcgat gtctggagaa gacaacgatg tag // LOCUS DROSHABA 2778 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster Shab11 protein mRNA, complete cds. ACCESSION M32659 KEYWORDS . SOURCE D.melanogaster, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2778) AUTHORS Butler,A., Wei,A. and Salkoff,L. TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in Drosophila JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2778) AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L. TITLE Diverse K+ currents expressed by a Drosophila extended gene family which is conserved in mouse JOURNAL Science 248, 599-603 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Salkoff, 07-MAR-1990. FEATURES from to/span description pept 1 2775 Shab11 protein BASE COUNT 679 a 784 c 788 g 527 t ORIGIN 1 atggtcgggc aattgcaagg tggacaggct gctggccagc aacagcaaca gcaacaagcg 61 actcagcaac agcaacactc gaagcagcag ctgcaacagc agcagcagca acagcagcaa 121 ctgcaactca agcagcatca gcagcagcaa caggacatcc tgtatcagca acataacgag 181 gcaattgcaa ttgcacgcgg actgcaggct gcaacacctg ccgacatcgg cgataatcag 241 ccgtactacg atacaagcgg taatgtcgat tgggagcggg cgatgggagc cggtggagct 301 ggtgcatatg gtggcatcgg catcggatct ctaccagcag ctggcggtgc tgcttatcac 361 cttgggccag ctaatcccgc aggcctcgtt tctcgtcact tggattacgg tgatggcggc 421 caccttgctg gcccatccgc cggtcttcct gctggagctg tgggatcagg agcaggagcg 481 ggagccggtg cgggagcatc agtcacggga tcaggatcag gagcagggac aggaacagga 541 accggagccg gatctggatc gggcagtgga gcagcaggca aggaagttcg ctacgcccct 601 ttcccagtcg catcaccaac gcactcgatt cccacaacct cccagcagat cgttggcggc 661 gtcggtggcg tgggcgtcgg tggtgccagc agccagtcga tttcgggcgg tgtacccacc 721 cacagccaga gcaacaccac cggcgctctg cagcggacac attccagatc catgtcctcc 781 ataccgccgc ccgagccgtt catgatagcc cagtcgaagg cggtcaacag ccgcgtgtcc 841 atcaacgtgg gcggggtgag gcacgaggtc ctgtggagga cgctggagcg gctgccccac 901 acgcggctcg ggcggctggg ggagtgcacc acccacgagg ccatcgtgga gctgtgcgac 961 gactactcgc tggcggacaa cgagtacttc ttcgaccgac atccgaagag cttcagctcc 1021 atcctgaact tctatcgcac cggcaagctg cacatcgtcg acgagatgtg cgtgctcgcg 1081 tttggtgatg acctggagta ctggggcgtc gacgaactgt acctggagtc ctgctgccag 1141 cacaagtacc accagcgcaa ggagaacgtt cacgaggaga tgcgtaagga ggccgagtcc 1201 ctgcggcagc gcgacgagga ggaattcggc gaaggtaaat tctccgagta ccagaagtat 1261 ctgtgggagc tcctcgagaa gcctaacact agtttcgccg cccgggttat cgcagtgata 1321 tccatactat tcatagtcct gtctaccata gccctgacgt tgaacaccct accacaacta 1381 caacacattg acaacggtac accacaggat aatccgcaat tggcaatggt tgaggccgtg 1441 tgtatcacgt ggttcactct agagtacata cttaggttta gctcctcgcc ggacaagtgg 1501 aagttcttta agggcggcct taacataatc gatctattgg caatactccc atactttgtt 1561 tcgttatttc tattggaaac gaataagaat gcaacggacc agttccagga tgtgcgtcgg 1621 gtggtgcagg tctttcgcat catgcgcatc ctgcgggtcc ttaagctggc ccgtcactca 1681 acgggcctgc agtcgttagg ctttacgctg cgtaactcat ataaggaact cggtctacta 1741 atgctgttcc tggccatggg cgttctcata ttttcttcgc tggcatattt tgccgaaaag 1801 gatgaaaagg atacaaaatt cgtttcaata ccggaagcat tttggtgggc gggtattaca 1861 atgacaactg ttggctacgg ggacatctgt cccacaactg cactgggaaa ggttattggt 1921 actgtgtgtt gcatatgcgg tgttctggtg gtcgctttgc ctattcccat catcgttaac 1981 aattttgctg aattttataa gaatcagatg cgccgcgaaa aggccctcaa gcgtcgcgag 2041 gcactcgatc gtgccaagcg cgagggcagc attgtctcct tccatcatat caatctgaaa 2101 gatgccttcg ccaagtccat ggatctcatc gatgtgattg tcgacacagg aaagcaaaca 2161 aatgtcgtgc atccgaaggg taaaagacaa agcaccccca atataggcag gcagaccctc 2221 gatgtgcaaa gcgccccagg ccacaatctc tcgcaaacgg acggcaacag caccgaaggc 2281 gagtctacca gcggacgcaa tccggccacc accggaaccg gatgctataa gaattacgac 2341 cacgtagcca acctgcgcaa ctccaacctg cacaaccgac gcggatccag ctctgagcag 2401 gatgcagtgc cgccctacag cttcgacaat cccaatgccc gccagacctc aatgatggcc 2461 atggagagct atcggcgcga cgaacaggca ctgctgcagc aacagcaaca gcagcagcaa 2521 cagatgttgc agatgcaaca gattcagcag aaggccccga acggaaatgg aggtgcaacc 2581 ggaggaggag tggccaacaa cctggccatg gtggccgcat caagtgccgg aacagccgtg 2641 gccaccgcca ccaatgccag taatgccagc aataccgccc ccgggtcaga gggcgccgag 2701 ggaggcgtga tggagatggg ggcggtgtcg atgacgacaa cctttcccag gccaagggac 2761 tgcccatcca gatgatga // LOCUS DROSHAWA 1497 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster Shaw2 protein mRNA, complete cds. ACCESSION M32661 KEYWORDS . SOURCE D.melanogaster, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1497) AUTHORS Butler,A., Wei,A. and Salkoff,L. TITLE Shal, Shab, and Shaw: Three genes encoding potassium channels in Drosophila JOURNAL Nucleic Acids Res. 18, 2173-2174 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1497) AUTHORS Wei,A., Covarrubias,M., Butler,A., Baker,K., Pak,M. and Salkoff,L. TITLE Diverse K+ currents expressed by a Drosophila extended gene family which is conserved in mouse JOURNAL Science 248, 599-603 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Salkoff, 07-MAR-1990. FEATURES from to/span description pept 1 1497 Shaw2 protein BASE COUNT 344 a 426 c 390 g 337 t ORIGIN Chromosome 2 left arm at locus 24B-C. 1 atgaatctga tcaacatgga ctcggaaaac agggtggtgc tcaatgtggg tggcattagg 61 cacgaaacct acaaggccac gctgaagaag attccggcta cgcgattatc gcgattaaca 121 gaggcgctgg ccaactatga tccgatactg aatgagtact tctttgatcg gcatccgggc 181 gtcttcgcac aagtgctcaa ctattacaga actggaaagc tgcattatcc cacggatgtg 241 tgcggtccgc tgtttgagga ggaattggag ttctggggcc tagactcgaa ccaagtggag 301 ccctgctgtt ggatgaccta cacacagcat cgcgacaccc aggaaaccct agccgtactc 361 gatcgtctcg atctggatac ggaaaaaccg tccgaagagg aattggcacg caaattcggc 421 ttcgaggagg actactacaa aggcacaata tcctggtggc aggaaatgaa gccgcgcatt 481 tggtccttgt tcgatgagcc ctacagttcc aatgcagcca agactattgg cgtggtttcg 541 gtgttcttca tctgcatttc gatcctgtcg ttctgcctga agacccatcc cgatatgcgg 601 gtgcccatcg tccggaacat tacagtgaaa actgcgaatg gaagtaatgg ctggtttttg 661 gacaaaacgc agaccaatgc gcacatagcc ttcttctata tcgaatgcgt gtgcaatgcc 721 tggtttacct ttgaaatatt ggtgcgcttt atctcatcgc cgaacaagtg ggagttcatc 781 aagtcatctg ttaacatcat agactacata gcgacgctta gtttttatat cgatctagtg 841 cttcagcggt tcgcatcgca cctggagaac gctgacatcc tcgagttctt ctcgatcatc 901 cgcatcatgc gtctgttcaa gctgacgcgc cactcgtccg gactgaagat cctgatccag 961 acgttccggg cctcggccaa ggagctgacc ctgctggtgt tcttcctcgt cctgggcatc 1021 gtgatcttcg ccagccttgt ctactacgcg gagcgcatcc agcccaatcc gcacaacgac 1081 ttcaacagca taccgctggg cctgtggtgg gccctggtca caatgaccac cgtcggctac 1141 ggcgacatgg cccccaaaac ctacattggc atgttcgtgg gtgccctctg cgccctggcc 1201 ggcgtactaa ccatcgcact gccagtgccc gtcatcgtca gcaacttcgc catgtactac 1261 tcgcacacgc aggccagggc caaactgcca aagaagcgga gacgagtgct tcccgtcgag 1321 cagccgcgcc agcccagact gccaggtgcc cctggtggtg tcagtggttg cggcaccccg 1381 ggctcgggtc cccactccgg tccgatggga tccggcggaa ctggaccacg tcgcatgaac 1441 aataaaacaa aggacctggt cagccccaag tcagatatgg ccttcagttt cgactaa // LOCUS SUVSATA 332 bp ss-RNA VRL 10-JUL-1990 DEFINITION Subterranean clover mottle virus satellite RNA (virusoid) sequence. ACCESSION M33000 KEYWORDS . SOURCE Subterranean clover mottle virus (isolated from Trifolium subterraneum) satellite RNA. ORGANISM Subterranean clover mottle virus Viridae; ss-RNA nonenveloped viruses; Velvet tobacco mottle virus group. REFERENCE 1 (bases 1 to 332) AUTHORS Davies,C., Haseloff,J. and Symons,R.H. TITLE Structure, self-cleavage, and replication of two viroid-like satellite RNAs (virusoids) of subterranean clover mottle virus JOURNAL Virology 177, 216-224 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Symons, 20-MAR-1990. FEATURES from to/span description site 62 63 self-cleavage site site 1 120 high sequence homology with virusoid of subterranean clover mottle virus site 232 332 high sequence homology with virusoid of subterranean clover mottle virus BASE COUNT 77 a 93 c 80 g 82 t ORIGIN 1 agaggcatac cctcctcgcg gattttgaag gtgttctagc tacccaagta ttccacgctg 61 tctgtacttg tatcagtaca ctgacgagtc cctaaaggac gaaacagcgc accgcaatct 121 acgtataccc cgattcgact tgcttggagc aagcgttcga cagagtgccg cgcctggaat 181 gacgcggttc tggccacact cacccgggag gccatcgggc ggattatact agttgtcaag 241 gacctgtcgt tagttctact atacattact acactacgtg ttacttgtta ggtggcccca 301 cctcactttc gtgaaggcta gagaacgtcc ac // LOCUS SUVSATB 388 bp ss-RNA VRL 10-JUL-1990 DEFINITION Subterranean clover mottle virus satellite RNA (virusoid) sequence. ACCESSION M33001 KEYWORDS . SOURCE Subterranean clover mottle virus (isolated from Trifolium subterraneum) satellite RNA. ORGANISM Subterranean clover mottle virus Viridae; ss-RNA nonenveloped viruses; Velvet tobacco mottle virus group. REFERENCE 1 (bases 1 to 388) AUTHORS Davies,C., Haseloff,J. and Symons,R.H. TITLE Structure, self-cleavage, and replication of two viroid-like satellite RNAs (virusoids) of subterranean clover mottle virus JOURNAL Virology 177, 216-224 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Symons, 20-MAR-1990. FEATURES from to/span description site 63 64 self-cleavage site site 1 120 high sequence homology with virusoid of subterranean clover mottle virus site 286 388 high sequence homology with virusoid of subterranean clover mottle virus BASE COUNT 97 a 106 c 91 g 94 t ORIGIN 1 agaggcatac cctcctcgcg gattttgaag gtgtttcagc tacccaaagt attccacgct 61 gtctgtactt atatcagtac actgacgagt ccctaaagga cgaaacagcg caccgcaact 121 tggccagacc tcgccaatca cccccacacc aagccaaaaa ccggtcccca acgcagttta 181 gtatcaagtc gtcgcatcca cgctcccgag ggaggaagtt tgcgccttga ggttctgcac 241 ggtcgtggta acaggaaaag tgttggaatg tttgaaggtc ttgcggttgt caaggaccaa 301 gtcgttagtg ttactatata ttactaccct acgtgttact ttgttaggtg gccccacctc 361 actttcgtga aggctaggaa acgtccac // LOCUS BOVCYP4SC 1073 bp ss-mRNA MAM 10-JUL-1990 DEFINITION Bovine cytochrome P450-scc mRNA fragment. ACCESSION M25920 KEYWORDS cytochrome P450-scc. SOURCE Bovine adrenal gland, cDNA to mRNA, clone pBA644. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1073) AUTHORS Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L. TITLE An approach to the molecular biology of congenital adrenal hyperplasia JOURNAL Ann. N.Y. Acad. Sci. 458, 238-251 (1985) STANDARD simple staff_entry COMMENT The coding region for cytochrome P450-scc was not indicated in [1]. BASE COUNT 259 a 243 c 193 g 378 t ORIGIN 1 taagtctgaa ttttgcaata aggaactcat gatttgaatt acagtcagct cccattcctg 61 tttttgctga ctatatagag ccttctccat ttttggctgc aaaacatata atcagtctga 121 tttggtattt atcattttgt gacataatgt gtaagagtgc ctcgtctgtt tggaaaaggt 181 agtttctatg accagtgtgt ctcttggcaa actctgttaa cctttgtctc accacttcat 241 tttgtattcc aaggcctttg tttctctgtt tctccaggta tctcttgact tcctactttt 301 accttccaat cctctaggat gaaaaggaca tctttttttt tttttttggt gtagttctag 361 aaggtcttca tagaaagggt caacttcaac ttcttaggca tcagtggtta gggcatatac 421 ttggattact gtaatgttaa atggtttgct ttggaaacta accaagatca ttctgttgct 481 tttgagattg cacccaaata ctgcattttg gactcttctg tttactatga ggactactcc 541 atttaatcta aaggattctt aggccacaat agtagatata atggtcatct gaattattat 601 aaatttatca attttcttcc attttagttc actgaattct aacttattga tgcttcattc 661 ttgccatctc ctgcttgacc atgtttttta ccttgattca tggacctgac attccaggtt 721 cctatgcaat attattctgt atagtgtcag acttactttc accaccagac atatccacaa 781 ctgtatatca tttccgtttt ggcccagctg cttcactttt tctggaacta ttcatatctg 841 ccctccactc tttcccaata gcatattgga cacattctcg aacacaggga gccgggggac 901 aggtgctggt ttcttctggc acacctgggg cagctgaaca cagtgttgac tggcagacac 961 agccccacac caaacgctcg ctaacactga cactgttccc gtgatggcca gggagccccc 1021 tccccaaaaa cctgctcctg gaagctggca ggatttgtgc cattcataag ggt // LOCUS BOVCYPC21 920 bp ss-mRNA MAM 10-JUL-1990 DEFINITION Bovine cytochrome P450-c21 mRNA fragment. ACCESSION M25921 KEYWORDS cytochrome P450-c21. SOURCE Bovine adrenal gland, cDNA to mRNA, clone pBA4.8. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 920) AUTHORS Chung,B.-C., Matteson,K.J., Morin,J.E., Mellon,S.H. and Miller,W.L. TITLE An approach to the molecular biology of congenital adrenal hyperplasia JOURNAL Ann. N.Y. Acad. Sci. 458, 238-251 (1985) STANDARD simple staff_entry COMMENT The coding region for cytochrome P450-c21 was not indicated in [1]. BASE COUNT 185 a 278 c 203 g 254 t ORIGIN 1 gttcagatgc tgtgtcccat tgggaaagtt cagcaggtta ccagggccac ggcctcagtc 61 atcctcagaa tcgctgtccc tcttggcagg gacagagcac cgcaccgcag acagcagcac 121 gtcttccacg ggcttcttgg gattctcctc caggctcgtc ttgatggctc cagactcaga 181 gcaacttcca ctccaactcg tccaaagtca ggttcatgcc accaaacacc agaggtccgg 241 ataactgagc cttgatgtca ccttcaaggt acacaaatac cgtggcagat tcctatcagg 301 gtaactgggt atgcaggtgg ttgaaatggc tttgataaac ttgacatcag gaaacttcct 361 ggcgaggtgc actcaagtgc tgatttatca gggcacagag gggaatccct tgtttgtaaa 421 ggtgcaggat gacccataag ccctcaccag ctttggtaac ttcttgaaca taatcctttc 481 cagagatttc caaaacctct ccaaatttgt tcttcagttg ggtcgctttc cattcggcca 541 gcctttgctg cctgtacatt tcaattgcac gttcgtcttc ctcattaaat tcgtcttcat 601 tatcctccag ttcttccaaa gtcatgtctt catatgtttt cacaatggac tgctggagga 661 tccgctgctc ctcttcttct gcctccttct ccagatcttt caaatcttcc tttgaaggca 721 agatgccttt tttgcgtaag atgtcattcc actcggtgtc tgcgttgggg tcctgcattt 781 tctgtcaaat cgctagggcc ctgccggcca cagccacccg gcccgtgagc tctctaccgc 841 gcacgcaggc gccactcgcc tcctctccca gcctgccctg agatctcgtc cgcccgttgg 901 ccctccttct cttggcgccg // LOCUS MUSINT4 3000 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse proto-oncogene Wnt-4 protein mRNA, complete cds. ACCESSION M32502 KEYWORDS Wnt protein; proto-oncogene. SOURCE Mouse (strain BALB/c) 8.5 day old embryo, cDNA to mRNA, (library of B.Hogan). REFERENCE 1 (bases 1 to 3000) AUTHORS Roelink,H., Wagenaar,E., Lopes da Silva,S. and Nusse,R. TITLE Wnt-3, a gene activated by proviral insertion in mouse mammary tumors, is homologous to int-1/Wnt-1 and is normally expressed in mouse embryos and adult brain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 4519, 4523 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H. Roelink, 03-MAR-1990. FEATURES from to/span description pept 46 1113 Wnt-4 protein BASE COUNT 703 a 789 c 787 g 721 t ORIGIN Chromosome 11. 1 cctcttcatg atcgccggca aacttcctcc tcggcgctgc ttctaatgga gccccacctg 61 ctcgggctgc tactcggcct cctgctcagt ggcaccaggg tcctcgctgg ctacccaatt 121 tggtggtccc tggccctggg ccagcagtac acatctctgg cctcccagcc tctgctctgc 181 ggctccatcc caggcctggt ccccaagcaa ctgcgcttct gccgcaatta catcgagatc 241 atgcccagcg tagcagaagg tgtgaagctg ggcatccagg agtgccagca tcagttccgg 301 ggccgccggt ggaactgtac caccatagat gacagcctgg ccatctttgg gcctgtcttg 361 gacaaagcca cccgtgaatc ggccttcgtg catgccatcg cctcggctgg tgtcgccttc 421 gcagtcacac gctcctgcgc tgagggaacc tccaccatct gcggctgtga ctcacatcat 481 aaggggccac ctggagaagg ctggaagtgg ggcggctgca gcgaggacgc cgacttcggg 541 gtgctggtgt cccgggaatt tgcggatgcg cgggagaaca ggccagatgc ccgctcagct 601 atgaacaagc acaacaatga agcaggccga acgaccatcc tggaccacat gcacctaaag 661 tgtaaatgcc acgggttgtc cggcagctgc gaggtgaaga cctgctggtg ggcccagccc 721 gacttccgtg ccattggcga cttcctcaag gacaagtacg acagtgcctc cgagatggtg 781 gtggagaaac accgtgagtc ccgaggctgg gtggagaccc tgcgggctaa gtacgcgctc 841 ttcaagccac ccaccgagag ggacctggtc tactacgaga actcccccaa cttttgtgag 901 cccaacccag agacgggctc ctttggtacc agggaccgga cttgcaatgt cacctcccac 961 ggcatcgatg gctgcgatct gctgtgctgt ggccggggcc acaacacgag gacggagaaa 1021 cggaaggaga aatgccattg cgtcttccac tggtgctgct atgtcagctg ccaagagtgt 1081 attcgcatct acgatgtgca cacctgcaag tagtgagcca gggcactggg aaggggtaga 1141 ttgtgcggct ggatccattc atcgaagtcc catgagaagc aggatctaga tccaggccag 1201 ccttcggcac tggccagcaa ggagcatgga ctgttgccag ctgcatgtga taaacgacct 1261 ggacccagcc ggcctcggac ggacgggcgg cttctttctc aactaacgtc tctccccctg 1321 ctctggatgg tgtacggctt tacagagggg ctttctttat ggttttacca gggtctgctg 1381 gggacagact cgaggcttac ctttgcacat gttaaagaaa ataaaaatga aaaaaaaaaa 1441 tctaccgcaa cagaacaggc tgggctagtg tgagctcttg gcctggtggg aaggacaaga 1501 ccatggcgag attctgtgtc caagctgcct ctactcgtga cattccaaga tgcctctgag 1561 gtgggaactg tgaagtagga cagagccccg cagtcccctc ttgtccgtcg actcccattt 1621 aaattggaca taccttgtcg ttctgagaaa agccatagat aggtgtagct gggatgtagt 1681 gatggggagg cccctggcca acagtgggag caagatcttg agttttgaag acctcagagt 1741 tctgggcggc ctgggaagcc atctgcagaa cagagttcct tgtgggctcc tgttttcgct 1801 agccctgttc tgccctggag cgacagtcag atctccacgc ccctttctgt tgttctacag 1861 tgtccacctt tactacgcgt tttttttttt tttttcatga tgaccttgta aataggtcag 1921 atgtggaggc aggtctcttc tggctccatc caccacaccc agaaagaatg ggctgctctg 1981 cccttctcag ccttgctaac cagcagacac cgaggagagc agcggggcac cttagagagc 2041 aatctaaaca tggttggcag gtggggaggg taaagagtcc cacttccttt gtgttagaag 2101 gcagactacc ctgcgtcctt ttctcccatt ggctgaagta accagaaaga caagagatcc 2161 ttaacaagcc cttcttccca cttgtaaaag ggatagccta tctcagttcc caaggatctg 2221 gattagatag atattcaaaa gaggcaagca gcgaatggag gcagctccca gctctgttcc 2281 cgacgcatga tggtactggc tgggtttagt aaggtgggtg gggctgcacg gatcaatcca 2341 tcaactccgt cttaaggaga atcagaaaga ggagataaaa tgggggaatg gggcagaaca 2401 aagaatttgt cctttcccgc ttctgtctag ggtctgctaa tgctggcttg acgaggggtc 2461 agccacttct ttcctgttgt gcagttggct tgccaagcag gctccagtag gcccttgcct 2521 gcactctcta ccatgtgacc atgagcactg ctctagggac acctcccatc ccttcctagc 2581 accccaaatg ccccttccca tctctccttc cagaagttgg aaatcaagtc aactggataa 2641 cgcttgtgtg agacacttga gcagaacgga tacaacaatt tacaagtctc ttcatatcta 2701 tgtattctat attaaaagtg ataaagtcat gtttccgggg cgtattcaag tagctgacaa 2761 gtaattattt aataatagta catgagcgca ttgtaattat cctcgccata gtcaggtaat 2821 agcatccaat gggaggtccc taccaacctg ctgtatccaa agttttgtaa aaagttgtag 2881 aagttgttga tctttttgat tttatattca aaaagtctct ttttataaat attatttatt 2941 atacaatgta tatacctttg agttaactaa gattatatat tatataaata tatatatatt // LOCUS DRONCDA 2294 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.melanogaster non-claret disjunctional protein (ncd) mRNA, complete cds. ACCESSION M33932 KEYWORDS non-claret disjunctional protein. SOURCE D.melanogaster (strain dp cl cn bw) 0-4 hr embryo, cDNA to mRNA, clone pNB40. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2294) AUTHORS Mcdonald,H.B. and Goldstein,L.S.B. TITLE Identification and characterization of a gene encoding a kinesin-like protein in Drosophila JOURNAL Cell 61, 991-1000 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.B.Mcdonald, 30-APR-1990. FEATURES from to/span description pept 111 2168 non-claret disjunctional protein site 180 319 alpha helical domain BASE COUNT 622 a 661 c 603 g 408 t ORIGIN 1 bp upstream of EcoRI site; chromosome 3 map position 99BC. 1 gaattgataa aatcggttgc aaggaggcag acgtatcttc taagttaggc acaacacagt 61 tggcgatgga atcccggcta ccgaaaccgt cgggcctgaa gaaaccccaa atgccgatta 121 aaaccgtgct gcccacagat cgaattcgcg caggattggg aggtggagcc gctggagcag 181 gcgccttcaa tgtcaatgcc aaccagacat actgcggcaa cttattgccg cccctctcaa 241 gggacctcaa caatctgccc caggtgctgg agcgtcgcgg aggaggagca cgtgccgcct 301 ccccagagcc catgaagttg ggccaccggg ccaagctgag acgtagccgt agcgcttgcg 361 acatcaacga actgcgtggt aacaagcgca ctgcggctgc tccttcattg cccagcattc 421 ccagcaaagt atcccgcctg ggcggtgcac tcactgtttc cagccagcga ctagtgcgtc 481 ctgcggcgcc ttcgtcaata acagcaacag ctgtcaaaag accaccagta acgcgtcctg 541 ctccacgggc tgcaggagga gcagccgcca agaaaccagc aggaacagga gcagcagctt 601 cgtcaggagc cgcggctgct gctcccaagc gcatcgctcc ctacgacttc aaggcccgct 661 tccacgatct gctagagaag cacaaggtgc ttaagacaaa gtacgaaaag caaacagagg 721 acatgggcga gctggagtcc atgcctcagc aactggagga gacgcagaac aagcttatcg 781 agacggagtc ctcgctgaag aacacccaga gcgacaacga gtgtcttcag aggcaggtga 841 agcagcatac cgccaaaatt gaaacaatca catcgacgct gggcaggacc aaagaggagc 901 tatccgagct gcaagcaata catgagaaag taaaaacgga gcatgctgct ctaagcacag 961 aagtggtgca tctgcgccag cgcaccgagg aactcctgcg ctgcaatgag cagcaggccg 1021 ccgagctgga gacctgcaaa gagcagctct tccagtcgaa catggagcgc aaagagctgc 1081 acaacacggt catggacctg cgcggcaaca tccgggtctt ctgtcgaata cgaccgccgc 1141 tggagtccga ggagaaccgt atgtgttgca cctggaccta tcacgacgag tccaccgtgg 1201 agctgcagag cattgacgca caggccaaaa gcaagatggg ccagcagatc ttctcattcg 1261 accaggtctt ccacccgctc tcctcgcagt cggacatctt cgagatggtc tcgccgctca 1321 tccagtcggc cctggatggc tacaatatct gcatctttgc ctacggacag acgggcagtg 1381 gcaagaccta cacaatggac ggagtgccgg agagtgtggg cgtcataccg cgcacggtgg 1441 atctgctctt cgactccatc cggggatatc gcaacttggg ctgggagtac gagatcaagg 1501 ccacctttct ggagatctac aacgaggtgc tctacgatct gctgagcaac gagcagaagg 1561 acatggagat tcgaatggcc aagaacaaca agaacgacat ctacgtgtcc aacataacgg 1621 aggagacggt tctggatcca aatcacctgc gccacctcat gcacacggcc aagatgaacc 1681 gtgccaccgc ctcgacagct ggcaacgagc gctcctctcg ttcccacgcg gttaccaagc 1741 ttgagctcat cggacgccat gccgaaaagc aagagatctc cgtgggttcc ataaacctgg 1801 tggatttggc cggctctgag tctcccaaga cgagcacccg gatgaccgag acaaagaaca 1861 tcaatcgctc gctatcggag ctcaccaacg taatcctggc gctgctgcag aagcaggacc 1921 acatcccgta caggaactcc aagctgacgc acctgctgat gccctcgctg ggcggcaact 1981 cgaaaacgct tatgttcatc aacgtctcgc cgttccaaga ctgtttccaa gagtccgtca 2041 agtcgctgcg cttcgcggcc tccgtaaact cctgcaaaat gaccaaggcc aagcggaatc 2101 gctacctgaa caactcggtg gccaacagca gcacacagag caacaacagc ggcagtttcg 2161 ataaataaag aatgcattct gagcccagtt ttaacaattt tcaaatttct aacctgttat 2221 tgcttaattt atgtgtgttt acttttagtg caaataaact aataaagtgc tggaaaaaaa 2281 aaaaaaaaaa aaaa // LOCUS YSCVPS1A 2457 bp ds-DNA PLN 10-JUL-1990 DEFINITION S.cerevisiae GTP-binding protein (VPS1) gene, complete cds. ACCESSION M33315 KEYWORDS GTP-binding protein. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2457) AUTHORS Rothman,J.H., Raymond,C.K., Gilbert,T., O'Hara,P.J. and Stevens,T.H. TITLE A putative GTP binding protein homologous to interferon-inducible Mx proteins performs an essential function in yeast protein sorting JOURNAL Cell 61, 1063-1074 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.O'Hara, 02-APR-1990. FEATURES from to/span description pept 318 2432 GTP-binding protein (VPS1) BASE COUNT 812 a 454 c 496 g 695 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgata gatacttgaa tcctctaata gtcgaaaaat gctcgagggt aaaccacttg 61 tgcgcttgga ctggcctagt ttccaaaacc aatgttctaa tggattgatt tcttccccaa 121 acattattaa gtggccgggt cacccaaaga cttgggcgcc gttgattcgc gtcgctttgc 181 catcaagaga acaacatatc ttccaagaca gaccgagata attcatctat ttactcctaa 241 aaaagaatta gagaggcctt ttatagcacc aaaataagga ccgtacgaaa actgcacatt 301 ttatattatc agatatcatg gatgagcatt taatttctac tattaacaag cttcaggacg 361 ctttggcgcc cttaggagga ggatctcaat ctcctattga tttaccacag atcaatgttg 421 tcggttccca gtcgtcagga aagtcgtccg ttttggagaa cattgttggt agggatttct 481 tgccaagagg tactggtatt gtcaccagga gacctttagt gttacaattg attaatagga 541 gaccaaaaaa gtcagaacat gctaaagtaa accaaactgc taatgaattg attgacttga 601 acatcaacga tgatgacaag aaaaaggatg aatcaggaaa gcaccaggaa gagggacaat 661 ctgaagacaa taaagaggaa tggggtgaat ttttgcattt acctggtaag aagttttata 721 attttgacga aattagaaag gaaatcgtca aagaaactga caaagtgaca ggtgccaatt 781 caggtatttc ttctgtgccc attaacttga gaatttattc tccgcatgtt cttactttga 841 cgttagtgga tttgcctggg ttgacgaagg ttcccgtagg tgaccaacct cctgatattg 901 aaagacaaat taaggacatg ttgttaaagt atatttcgaa accaaacgct atcatattat 961 ctgttaatgc cgctaacacc gatttagcca acagcgatgg tttgaagctg gctagagagg 1021 tcgatccaga aggaacgaga actattggtg tcttgacaaa agtcgatttg atggatcaag 1081 gtacagatgt catagatatt ttggctggaa gagtcattcc tttgagatat ggttatatcc 1141 cagttatcaa tagaggtcaa aaggatattg aacacaaaaa aacaatcaga gaagcccttg 1201 aaaacgaaag aaaatttttt gagaaccatc cctcttacag ttctaaagct cattactgtg 1261 gtacaccata tttggctaaa aagttaaact caatcttatt acaccacatt aggcaaactc 1321 tgccagaaat caaagcgaaa atcgaagcca cattgaaaaa atatcaaaac gaacttataa 1381 acttgggccc agaaactatg gattcagcta gttcggttgt tttgagcatg attactgatt 1441 tttccaatga atatgccggt atcttggacg gtgaggcgaa ggagctttcc agtcaggaac 1501 tttctggtgg tgctagaatt tcttacgtat tccatgaaac tttcaaaaat ggtgtagact 1561 ctttggatcc attcgaccag atcaaagatt ctgatatcag aaccattatg tacaatagtt 1621 caggttctgc cccatctttg tttgtcggta ccgaagcttt tgaagtttta gttaaacagc 1681 aaattagaag atttgaagaa ccatctctac gtttagttac tctggtgttt gatgaacttg 1741 ttcgtatgct aaaacagatt atttcacaac caaagtactc aaggtatcct gctctaagag 1801 aagcgatttc taatcagttc attcagttct taaaggatgc tactattcct acgaatgagt 1861 ttgttgtcga tatcatcaaa gctgaacaaa cttacatcaa tacagcccat cccgaccttt 1921 tgaagggttc tcaagcaatg gttatggtgg aagaaaaatt acatcctcgc caagtcgctg 1981 ttgacccaaa gacgggtaaa ccattaccaa cccaaccatc gtctagtaag gcgccagtta 2041 tggaagagaa atcaggattt tttggtgggt tcttctccac taaaaacaag aagaaattgg 2101 cagctttgga atccccacct cctgttttaa aagctactgg ccaaatgaca gagagggaaa 2161 caatggaaac agaagtaatc aagttgttga ttagtagtta tttctctatt gtcaaaagaa 2221 ccattgccga tattatacca aaggctttga tgcttaaatt gattgtgaaa agtaaaactg 2281 atattcagaa agttttactc gaaaaacttt acggaaagca agatattgaa gaattaacga 2341 aagaaaacga cataaccatt caaagaagaa aagaatgtaa gaagatggtc gagatattga 2401 gaaacgctag tcaaatcgtc tcctctgttt aggttttcct catctatacc ggtcgac // LOCUS R75RELAX 99 bp ds-DNA BCT 10-JUL-1990 DEFINITION Plasmid R751 relaxation region. ACCESSION M33118 KEYWORDS . SOURCE Plasmid R751 DNA. ORGANISM Plasmid R751 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 99) AUTHORS Pansegrau,W., Ziegelin,G. and Lanka,E. TITLE The origin of conjugative IncP plasmid transfer: Interaction with plasmid-encoded products and the nucleotide sequence at the relaxation site JOURNAL Biochim. Biophys. Acta 951, 365-374 (1988) STANDARD simple staff_entry BASE COUNT 32 a 26 c 22 g 19 t ORIGIN 1 gaataaggga cagtgaagat agataaccgg ctcgccggtt agctaacttc acacatcctg 61 cccgccttac ggcgttaata acaccaagga aagtctaca // LOCUS RP4RELAX 99 bp ds-DNA BCT 10-JUL-1990 DEFINITION Plasmid RP4 relaxation region. ACCESSION M33117 KEYWORDS . SOURCE Plasmid RP4 DNA. ORGANISM Plasmid RP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 99) AUTHORS Pansegrau,W., Ziegelin,G. and Lanka,E. TITLE The origin of conjugative IncP plasmid transfer: Interaction with plasmid-encoded products and the nucleotide sequence at the relaxation site JOURNAL Biochim. Biophys. Acta 951, 365-374 (1988) STANDARD simple staff_entry BASE COUNT 27 a 28 c 28 g 16 t ORIGIN 1 gaataaggga cagtgaagaa ggaacacccg ctcgcgggtg ggcctacttc acctatcctg 61 cccggctgac gccgttggat acaccaagga aagtctaca // LOCUS ACCTRPF 1466 bp ds-DNA BCT 10-JUL-1990 DEFINITION A.calcoaceticus 5'-phosphoribosyl anthranilate isomerase (trpF) and tryptophan synthase (trpB) genes, complete cds and 5' end. ACCESSION M34485 KEYWORDS 5'-phosphoribosyl anthranilate isomerase; tryptophan synthase. SOURCE A.calcoaceticus DNA. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1466) AUTHORS Ross,C.M., Kaplan,J.B., Winkler,M.E. and Nichols,B.P. TITLE An evolutionary comparison of Acinetobacter calcoaceticus trpF with trpF genes of several organisms JOURNAL Mol. Biol. Evol. 7, 74-81 (1990) STANDARD simple staff_review FEATURES from to/span description pept 506 1147 5'-phosphoribosyl anthranilate isomerase pept 1149 > 1466 tryptophan synthase (trpB) (gtg start codon) BASE COUNT 430 a 297 c 334 g 405 t ORIGIN 1 gatcaagttt agttgcatct gttgaatcat cagcaaaaac agttgttgaa gaaaacccca 61 ttgcaattgc aatcgccccc actaaacggg taggctgaaa agaaatagac atgtattgtg 121 ctccatacat tcaccccacg tgaatgattg agtggataga tgtaacaagc aggtctccgg 181 actcaaatgg catctcaaaa agagacaagc atattcacct tcccacatct atgcatgcag 241 tggcgtaagt ctaaatgact tttttaatat ggtttacatt tttaccgttg cgggggcagc 301 actggatttg caccagtttc cctaaagcga atgcttttaa cttgttacga attgtgtaaa 361 gtataaagtc tgagcgaaga ttaaacaatc tgaatacgat caaattcgtt caactttgac 421 gcaaagcaca aaaattgcat tacaatactt agcccaatga tggatagatc ggctgtctgt 481 caggcaatac aatgagcttc tttctatgcg aacgcgcgca aaaatttgcg gtattacccg 541 ttcccaagat gtccaagcag cagtaagtgc aggtgcagat gccattggac tggttttttt 601 cccaccaagt cctcgacatg tttctatagc gcaagcgcaa gcattgctcc agcatattcc 661 cgcttatgtt caggtggttg gtttatttgt gaatgcaact gcggatcaaa tcaaatcagt 721 gcttgattgt gtggctttgg atgtattaca actacatggc gatgaaacgc ctgagcaatg 781 tcaagagatt gctctgcagt gcaagcgtcg ctggtataaa gccattcaag ttaaaccaga 841 gcttgatgta gttgatgaag ttcagcgtta tcaggccgct ggtgcaagtg cggtattgct 901 ggatgcgtgg catccagagc tcaaaggtgg aactggtcat caatttgatt ggtcgaagtt 961 tcccaagctg gatattccac ttattcttgc aggcggttta acgcctgaaa atgttgtaga 1021 tgccattcaa accacacacg cttttgcagt ggatgtgagc ggaggggtag aggccgcaaa 1081 aggtattaaa gataaacaac tcatcgaacg atttatgcaa ggagtccaat gtggatcagc 1141 aaaataacgt gattgactat acgcaatatc cagatgctcg tgggcatttt ggtattcatg 1201 gcggacgttt tgtatcagaa acacttatgg cggcacttga agatttagaa aatctttaca 1261 accgcatgaa aaatgacgaa cagtttctgg cagaatttga ccgcgatctt gcctattatg 1321 taggtcgtcc tagtccactt tattatgctg aacgatggtc aaagaagctc ggtggtgcgc 1381 aaatttactt aaaacgtgaa gacctgaatc atacaggttc acacaaagtt aataacacca 1441 ttggtcaggc attattggcc aagctt // LOCUS BCIGLCA 2316 bp ds-DNA BCT 10-JUL-1990 DEFINITION B.circulans beta-1,3-glucanase A1 (glcA) gene, complete cds. ACCESSION M34503 KEYWORDS beta-1,3-glucanase. SOURCE B.circulans (strain WL-12) DNA, clone pNT003. ORGANISM Bacillus circulans Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2316) AUTHORS Yahata,N., Watanabe,T., Nakamura,Y., Yamamoto,Y., Kamimiya,S. and Tanaka,H. TITLE Structure of the gene encoding beta-1,3-glucanase A-1 of Bacillus circulans WL-12 JOURNAL Gene 86, 113-117 (1990) STANDARD simple staff_review FEATURES from to/span description pept 241 2289 beta-1,3-glucanase A1 (glcA) BASE COUNT 705 a 489 c 538 g 584 t ORIGIN 1 ggaaattcaa cccacagagt atcgacaaat gatgcgccaa aacgtagaac gtgaagtaca 61 ataccacagt acaaatatat aaattgaatc aaaacccaaa aaattgggat ataacaaaaa 121 taattgtacc ttttcagcag attatcctat tcgatagaat aaagatattc ccccatgtaa 181 gcgatttcct ttatacgcat agattgggag aaactattat cctatcaaag gagggcaatt 241 atgaaaccat ctcactttac ggagaaacgg tttatgaaaa aggtacttgg tttgttctta 301 gtggttgtga tgctggctag tgttggcgtg ttgccaactt caaaagttca agcagctggg 361 accacagtta cctcaatgga gtacttctca ccagcagatg gacctgttat ttcaaaatct 421 ggcgttggca aagccagcta cggatttgtt atgcctaagt tcaatggagg ctccgctacg 481 tggaacgatg tttacagtga cgtgggtgtc aatgtgaaag tgggtaacaa ctgggttgat 541 attgatcaag ccggaggtta tatctataac caaaactggg ggcactggag cgatggcggt 601 ttcaatggct attggttcac cctttccgca acaaccgaaa ttcaactgta ctccaaagcg 661 aatggtgtta agcttgaata tcaacttgta ttccaaaaca ttaacaaaac aaccatcaca 721 gcgatgaatc cgacacaagg gccgcaaatt acagcaagtt tcacaggcgg tgcaggcttt 781 acatatccaa cgttcaacaa tgattctgcg gtaacctatg aagccgtagc ggatgatttg 841 aaggtgtatg taaaacctgt aaacagcagc tcatggattg atattgacaa taatgcagcc 901 agcggctgga tttatgatca caacttcggc caattcaccg acggtggagg aggttactgg 961 tttaacgtaa cggaatcgat caacgtcaaa ttggaatcaa agacttcttc ggctaacctt 1021 gtttatacaa ttacgtttaa tgaacctaca agaaattcat atgtcattac gccatacgaa 1081 ggaacaacct tcacagcaga tgcgaatggt tccattggaa tcccgcttcc caaaattgat 1141 gggggtgcgc caatcgccaa agaactgggc aatttcgtat atcagattaa catcaatggg 1201 caatgggtgg atttgagtaa ctccagtcag agcaagtttg catactcggc taatggctac 1261 aacaatatgt ctgatgccaa ccagtggggg tactgggccg attatatcta tggcctttgg 1321 ttccagccaa tccaggaaaa tatgcaaatc cgtatcggat atccgctgaa cggacaggcg 1381 ggtggaaata ttggcaacaa cttcgtcaac tataccttca tcggtaatcc aaatgctccg 1441 cgtccggatg tatccgatca agaggatatc tcgatcggaa caccaactga cccggctatt 1501 gcgggcatga atcttatctg gcaggatgaa tttaacggaa ctacactgga tacaagtaaa 1561 tggaactatg aaacaggtta ttatctcaat aacgatcccg ctacttgggg atggggaaat 1621 gcagagttgc agcactacac aaacagcaca caaaatgtat atgtacagga cgggaagctg 1681 aatatcaaag ccatgaacga tagcaaatct ttcccgcagg atccgaatcg gtatgcacag 1741 tattcttcag gtaagattaa caccaaggat aaactctcct tgaagtacgg cagagtagat 1801 tttcgtgcca agcttcctac aggggatggc gtttggccag cgctgtggat gcttccaaaa 1861 gattctgtat atggcacatg ggctgcatcg ggtgaaatcg atgttatgga agcaagagga 1921 cgtcttccag ggtctgtaag cggtaccata cactttggcg gacaatggcc cgtgaaccag 1981 tcttcgggtg gcgattatca cttcccagaa gggcaaactt ttgccaatga ttatcatgta 2041 tactcggtag tctgggaaga ggacaatatt aaatggtatg tcgacggcaa gtttttctat 2101 aaagtcacta accagcagtg gtattccaca gctgcaccga ataatccgaa tgctcctttc 2161 gatgagccgt tctacctcat tatgaacttg gcagtcggcg gaaacttcga cggaggccgt 2221 actccgaacg cgtccgatat cccggcaact atgcaagtgg attatgtacg tgtgtataaa 2281 gaacagtaat aaaacagccg tttccgcgat tggggt // LOCUS CHKAGLOB 1737 bp ds-DNA VRT 10-JUL-1990 DEFINITION Chicken alpha-globin gene, alpha-5HR DNA fragment. ACCESSION M34465 KEYWORDS alpha-globin. SOURCE Chicken fibroblast DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1737) AUTHORS Kalandadze,A.G., Bushara,S.A., Vassetzky,Y.S.Jr. and Razin,S.V. TITLE Characterization of DNA pattern in the site of permanent attachment to the nuclear matrix located in the vicinity of replication origin JOURNAL Biochem. Biophys. Res. Commun. 168, 9-15 (1990) STANDARD simple staff_review BASE COUNT 356 a 589 c 447 g 345 t ORIGIN 1 gcggcacggg gcggccccgg gcccggcgcg cacttactgg ccttggcggc ggggtgctcg 61 gcgccgcgct ggaaggggaa gcggaagagc agcttgttgc cgcggctgcc cgagctcaca 121 aggataacgc tgatggggct ggtgctctcg cccatgccgc cgcgccacag cgagcaccgg 181 gcgggcaacg acggacgcgg ctccgcggaa ggcggcccgg cccgcgcgac ttccgcttcc 241 gcgcctccgc cgccgccgcc ggttcccccg ggccgcggcc gagcggcggg gcggagctgc 301 gggcacagcg ctccccgggc aggtcgcgct cagaggccgg gccgccgctt cagcgccgtg 361 ccctcagtgc ggcccagcgc cgtgcccgca gcgctgccca cacgccctcg gggtgcccca 421 cggctgctgc ttgctcccgg tgcccgccgt tcctcccagc acctcgcagt gcagccgtgc 481 ctgaagtgca gcccagcacc tcacacctca gccccgggct cccagtacga ccagcaggtc 541 acgttggagt ctcttgtcct caagactgcg cagtgtctca cctttgagcc ttgtgccccc 601 cattcagccc agcacatcac actgtagccc ttacaccctc accacagcac agcacctcac 661 gttcaggccc cagcacgtca agatggagcc ctgtgccccc agacagccag catggaacca 721 tcaaatcctt agagttggaa gatgtctgaa tccttgtgcc cccagttcag cccggcacct 781 ctcacacccc actcaacact cttcagccaa gagcctacag ctcaacccag cacctcacgc 841 cacccagcag cactcccgcc atcagcccag tgcccccagt ccggatcggt acctctcatg 901 cccatgcaca gtgcaccaga tcagcctagc accactagtt cattccagca cctcacgtgc 961 ccacagccaa ccactccagc acccccggtg ccctagtcac acctctccgc tgcctcaagg 1021 ttcattccca cctcttccca catcccctca caccccctca ttattttcat gtctcgcaat 1081 ctcctttggt cacttggagt cattcagtta tgacaactcc agaactagaa gctgctggcc 1141 agcagcaagt gccacaaact gtgttccccc ggcagctctt ctggctcatt tgtcttattg 1201 tgtgtccagc tgagatcaga aagctatcgg caattatgtc agaggatggc ccagtttttc 1261 acatagattt gtctgtattt gatagcaata tttagtattt ggtgctccga gtatccccac 1321 tctggatttt tctctgcaag attcttccct tggacttcag gcagagaagg ggactgaaag 1381 ggagatgagc acccgcagtg agggcttaat ctgcacggcc attctctgca aggcaggtga 1441 taacaactga agcaagagaa gctgtcattg aggggagaga gttgttggtg agcgattaaa 1501 gagcagtcac attatcacag cagagcattc atcgtggccc agtgctgggg agctacgtta 1561 gaattgccca gtgtgtctgc ttcccagcat aactatgcat tcttcaatta aaaaactgca 1621 ggcatgtttg ccatttccag ctctcggaga tgagttaaag caaagctctg gaaacctgca 1681 agctctctga gtgctagtag aatgaaatga aagaataaag ccagatatag attctgc // LOCUS HUMPDHBA 1484 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human pyruvate dehydrogenase beta-subunit mRNA, complete cds. ACCESSION M34479 KEYWORDS pyruvate dehydrogenase. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1484) AUTHORS Ho,L. and Patel,M.S. TITLE Cloning and cDNA sequence of the beta-subunit component of human pyruvate dehydrogenase complex JOURNAL Gene 86, 297-302 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1 1080 pyruvate dehydrogenase beta-subunit precursor /hgml_locus_uid="LU0223C" /nomgen="PDHB" /map="unassigned" sigp 1 90 pyruvate dehydrogenase beta-subunit signal peptide matp 91 1077 pyruvate dehydrogenase beta-subunit mRNA < 1 1484 pyruvate dehydrogenase beta-subunit mRNA BASE COUNT 414 a 287 c 369 g 414 t ORIGIN 1 atggcggcgg tgtctggctt ggtgcggaga ccccttcggg aggtctccgg gctgctgaag 61 aggcgctttc actggaccgc gccggctgcg ctgcaggtga cagttcgtga tgctataaat 121 cagggtatgg atgaggagct ggaaagagat gagaaggtat ttctgcttgg agaagaagtt 181 gcccagtatg atggggcata caaggttagt cgagggctgt ggaagaaata tggagacaag 241 aggattattg acactcccat atcagagatg ggctttgctg gaattgctgt aggtgcagct 301 atggctgggt tgcggcccat ttgtgaattt atgaccttca atttctccat gcaagccatt 361 gaccaggtta taaactcagc tgccaagacc tactacatgt ctggtggcct tcagcctgtg 421 cctatagtct tcaggggacc caatggtgcc tcagcaggtg tagctgccca gcactcacag 481 tgctttgctg cctggtatgg gcactgccca ggcttaaagg tggtcagtcc ctggaattca 541 gaggatgcta aaggacttat taaatcagcc attcgggata acaatccagt ggtggtgcta 601 gagaatgaat tgatgtatgg ggttcctttt gaatttcctc cggaagctca gtcaaaagat 661 tttctgattc ctattggaaa agccaaaata gaaaggcaag gaacacatat aactgtggtt 721 tcccattcaa gacctgtggg ccactgctta gaagctgcag cagtgctatc taaagaagga 781 gttgaatgtg aggtgataaa tatgcgtacc attagaccaa tggacatgga aaccatagaa 841 gccagtgtca tgaagacaaa tcatcttgta actgtggaag gaggctggcc acagtttgga 901 gtaggagctg aaatctgtgc caggatcatg gaaggtcctg cgttcaattt cctggatgct 961 cctgctgttc gtgtcactgg tgctgatgtc cctatgcctt atgcaaagat tctagaggac 1021 aactctatac ctcaggtcaa agacatcata tttgcaataa agaaaacatt aaatatttag 1081 tttggacttg aatatcaagt cgttgaaatt tatttgaaat acttgctggc actgcacctg 1141 gatttgtact gcaagacctg actattcata aaggaaaacg atttctaaag caacagcagg 1201 tatttttgta cagggaagtt taaatgtgtt tgtgtatgga aaactctcca ctctcctccc 1261 ctagatgcca tgcttccttt tgtctgttac ggttgccatg ttctttgaat aacaaattat 1321 atcacatttt atcctctctc accacaagga caaagtatgg atgtggcaga gtcctgatga 1381 aagatgtatc caaacaagat aacttatatg tataaaatta aagcatataa tacacattta 1441 ctgttagttt gttttgataa ggaataaagg aatttctaac atga // LOCUS LEIGP63A 3047 bp ds-DNA INV 10-JUL-1990 DEFINITION L.chagasi major surface glycoprotein (gp63) gene, complete cds. ACCESSION M28527 KEYWORDS glycoprotein; protease; surface antigen. SOURCE L.chagasi (isolate MHOM/BR/82/BA-2C1a) DNA, clones pLc63-[1 and 2]. ORGANISM Leishmania chagasi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 3047) AUTHORS Miller,R.A., Reed,S.G. and Parsons,M. TITLE Leishmania gp63 molecule implicated in cellular adhesion lacks an Arg-Gly-Asp sequence JOURNAL Mol. Biochem. Parasitol. 39, 267-274 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.Parsons, 03-OCT-1989. FEATURES from to/span description pept 496 2295 gp63 protein BASE COUNT 423 a 1195 c 925 g 504 t ORIGIN 1 ggtacctccc ccaccccggc cctccggccc cgcgcccccg cctctgtgct gtgccgtgcc 61 ctggactccc tctcctccac ctctcctcgc ttctgtcgct ccgcctcccc gagcgacccg 121 cggcgccgcg cggtgcgtgt ctggtgcggc gagtggcggg gtgccgtccc ccctcgctgc 181 ggcacccctc cccgcgccac cacggaggca cccgtgagca cgccaacaga ccaacgcact 241 cacgtcccca tcgtcctccc ccctccccgc accagcaccg acgtgctctc cgctctccct 301 ccctcaccac ctcccctcgc accctccctt gccttctccc tgtcccctcc ctccccagat 361 ccgccaacgc atccgatccc gctacacccc cctctccccc gcccacacgc acgcgcacac 421 cgccgtgcac aagccctcgc cctcgccctc gccaccacac cccactgccc acagcgcccc 481 cgcgcctgca gagccatgtc cgtcgacagc agcagcacgc accggcaccg cagcgtcgcc 541 gcgcgcctgg tgcgcctcgc ggctgccggc gccgcagtca tcgctgctgt cggcaccgcg 601 gccgcgtggg cacacgccgg tgcggtgcag caccgctgca tccacgacgc gatgcaggca 661 cgcgtgcggc agtcggtggc gcgccaccac acggcccccg gcgccgtgtc cgcggtgggc 721 ctgccgtacg ttactctcga caccgcggcc gccgccgatc gccggccggg cagcgcgccc 781 acagtcgtgc gcgccgcgaa ctggggcgcg ctgcgcatcg ccgtctccac cgaggacctc 841 accgaccccg cctaccactg cgctcgcgtc gggcagcaca tcaagaggcg acttggcggc 901 gtcgacatat gcacggccga ggacatcctc accgacgaga agcgcgacat cctggtcaag 961 cacctcatcc cgcaggcgct gcagctgcac acggagcggc tgaaggtgcg gcaggtgcag 1021 gacaagtgga aggtgacggg catgggcgac gatgtgtgca gcgacttcaa ggtgccgccg 1081 gcgcacatca ccgatggcct gagcaacacc gacttcgtga tgtacgtcgc ctccgtgccg 1141 agcgaggagg gtgtgctggc gtgggccacg acctgccagg tgttctctga cggccatcca 1201 gccgtgggcg tcatcaacat ccccgcggcg aacattgcgt cgcggtacga ccagctggtg 1261 acgcgtgtcg tcacgcacga gatggcgcac gcgctcggct tcagcgtcgg cttcttcgaa 1321 ggcgcccgca tcctggagag catttcgaac gttcggcaca aggacttcga tgttcccgtg 1381 atcaacagca gcacggcggt ggcgaaggcg cgcgagcagt acggctgcga caccttggag 1441 tatctggaga tcgaggacca gggcggtgcg ggctccgccg ggtcgcacat caagatgcgc 1501 aacgcgcagg acgagctcat ggcgcctgcc gcagctgccg ggtactacag cgccctgacc 1561 atggccatct tccaggacct cggcttctac caggcggact tcagcaaggc cgaggtgatg 1621 ccgtggggcc ggaacgccgg ctgcgccttc ctcagcgaga agtgcatgga gcggaacatc 1681 acgaagtggc cggcgatgtt ctgcaatgag aacgaggtga ctatgcgctg ccccaccagt 1741 cgtctcagcc ttggaaagtg cggtgttacc cgtcacccgg accttccgcc gtactggcag 1801 tacttcacgg acccgtccct cgccggcatc tccgccttca tggactgctg ccctgtcgtg 1861 gagccctacg gtgatggcag ctgcgcacag cgtgcgtctg aagcgggcgc accattcaaa 1921 ggcttcaacg tcttctccga cgcggcgcgc tgcatcgatg gcgccttcag gccgaagacg 1981 agtcacggca taatcaagtc gtacgccgga ctgtgcgcca acgtgcggtg cgacacggcc 2041 acgcgcacgt acagcgtgca ggtgcacggc ggcagcggct acgccaactg cacgccgggc 2101 ctcagagttg agctgagcac cgtgagcagc gccttcgagg agggcggcta catcacgtgc 2161 ccgccgtacg tggaggtgtg ccagggcaac gtgcaggctg ccaaggacgg cggcaacgcc 2221 gcggctggtc gccgtggtcc gcgcgccgcg gcgacggcgc tgctggtggc cgcgctgctg 2281 gccgtggcgc tctagacggt ggataggacg ggtgctgatg gcgtgtcccc tgctcccccc 2341 tccctccctc cctctcgttg tctctcggaa gagctccacg ctgtcctttc atctcctcgc 2401 ctgttctacg cttgcttcgc tgcgccgctg caccgggccg gtcctcgccg accctcgcct 2461 gccctctccc cctcctctct cccgccaccc caccccgctc cccgctgcgc acggtgcctg 2521 tgcgcttgga gagaggtgca gcagcgcgcg ggagctgagg gagggagggg gtgtcgtgcg 2581 cgggtgcgca tgccttcttt cacttcctta tttgtcttct atttgttccc tgcggcaccc 2641 gcacaccccc acccgctggc ggccatccgc ggcatccgcg ggtgcgtgcg cggtgtgtct 2701 gccttctctc tcctcctttc gctctgtttc cctgtcctcg gactccccgg cgccagcgtg 2761 agctccgcag tcaccgccca cccggcgctc cggcgcggtc agcgccaccc caccccaccc 2821 cctctccccc attcgtgcgt gtctcttctc gctttttctg tttcctcttg tagcagggcg 2881 cgccgcgttg tgggagcggt ggcggcctct gcgcgcggac ggcatgcagg tcggccggga 2941 gagtctcccg ccagcgcccg cgcagcgcag agccgtcgcc cacccaccgt ctcctcccac 3001 cttcgcatgc cgccgcacta ggtgcacgtc gtcggcacga ccaccga // LOCUS PFATUBB 2833 bp ds-DNA INV 10-JUL-1990 DEFINITION P.falciparum beta-tubulin gene, complete cds. ACCESSION M28398 KEYWORDS tubulin. SOURCE P.falciparum (Brazilian strain 7G8, isolate 78G) nonsynchronous blood stage DNA, clone 768. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 2833) AUTHORS Sen,K. and Godson,G.N. TITLE Isolation of alpha and beta-tubulin genes of Plasmodium falciparum using a single oligonucleotide probe JOURNAL Mol. Biochem. Parasitol. 39, 173-182 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.N.Godson, 27-SEP-1989. FEATURES from to/span description pept 654 749 beta-tubulin, exon 1 1112 2064 beta-tubulin, exon 2 2228 2516 beta-tubulin, exon 3 IVS 750 1111 TUBB intron A IVS 2065 2227 TUBB intron B BASE COUNT 1018 a 311 c 455 g 1049 t ORIGIN 1 aattcctagt ttatttaatt taaaaattaa aagatcgaat gctcaacatt ttaaaaagaa 61 atctgtgaaa catatcttaa caagaaatgg tgtaacaaaa gaaacaatat taaatgataa 121 attaccaaag ataaatgatg aaattgacag aacatataat ggacacaaaa tggatgaaaa 181 tttacaggat aaacaaaaaa ggaatcatgg agtaaatata aaattaataa atgaatatga 241 aaatatcatg tgaagaataa attctcaaaa tcattgattg tatgacaaga ttcaagaatt 301 ggttatataa aaatatattt aggaaaagta attttgggtc atatgtatca acatttacag 361 gtgtatttgg aggtgctgca gctgttagct gtttctgcca taagtggagc ttgtataact 421 aaatttagtg ttacattggt tccggtattt gcatgttttg ggggtgtctt tgcgattatt 481 ataatattat taatattagg aacatggatg cttgttacat ggttatggca acacaaagaa 541 gtagtatttt tttttttttt taatttttac ttaatatatc ctcttacaat ataaaatatt 601 tatatattta aaaaaaaaag aaaaaatttt ctttgagatt attttattaa agaatgagag 661 aaattgttca tattcaagct ggccaatgtg gaaatcaaat aggtgcaaag ttttgggaag 721 tcatttctga tgagcatgga atagatccag taagtttaaa aaaaaaatat atttatttat 781 atgaatctgt aaacatatgt atatttatat atatatatat atatatatgg aagaataatt 841 ttgtgtgtat aatttggggt ccttcccctt tattgtattc tataaatgcc tcctttatat 901 tgataataat ttatatatgt aaacctttaa tgacgaggct tatatataaa aaccttagat 961 attataaata aatgtatatt atgtacatat gacgatatcg ctctctctat atatatatat 1021 atatatatat atatatattt atttatttat atatttattt atttatttat ttatttattt 1081 tttttttttt tttttatttt atttttttta gagtggtacc tatagtgggg acagtgactt 1141 acagttagaa agagttgacg ttttttacaa cgaagcaaca ggaggtagat atgttccaag 1201 agctatattg atggacttgg aacctggtac tatggatagt gttcgtgctg gcccctttgg 1261 tcaattattt cgtccagata attttgtgtt tggtcaaaca ggtgcaggaa ataattgggc 1321 taaaggacat tatactgaag gtgctgaatt gatagatgca gttttagatg tgcttagaaa 1381 agaagcagaa ggttgtgatt gtttacaagg atttcagatt actcattcat taggtggtgg 1441 tacaggtagt ggtatgggta ctttgttgat tagtaaaata agagaggagt atcctgatcg 1501 tattatggaa acattttctg tatttccatc accaaaagtt tctgatactg ttgttgaacc 1561 atataatgct acattatcag tccatcagtt ggttgaaaat gctgatgaag ttcaagttat 1621 cgataatgaa gctttatatg acatatgttt taggactctt aaattaacaa caccaacata 1681 tggagattta aatcaccttg tatcagctgc aatgtcaggt gtaacctgtt cgttaagatt 1741 tcctggtcaa cttaacagtg acttaagaaa attagctgtt aatttgatcc cattcccacg 1801 tttacatttc tttatgtacg ggtttgctcc tttaactagt agaggcagtc aacaatacag 1861 agccttaact gtgccggagt taacacaaca aatgttcgac gcaaaaaata tgatgtgcac 1921 aagtgatcca agacatggaa gatatttaac ggcatgtgct atgtttagag gaagaatgtc 1981 cacaaaggaa gttgacgaac aaatgttaaa cgttcaaaat aaaaactcat cttattttgt 2041 cgaatggatt cctcacaaca caaagtaaga aggaacaatt gatactagta tgcatgtttt 2101 tttgtttata tgtatttata tatatatata tatatatgta ttcatttata tattttgaaa 2161 tatacatttt acatataaat tttttttttt tctttttctt tttttttttt tttgtttttt 2221 tctttagatc aagtgtttgt gatattccac cattgggatt aaaaatggct gttacttttg 2281 taggaaactc aaccgccatt caagaaatgt ttaaaagagt ttctgatcaa tttactgcta 2341 tgtttagaag aaaagccttt ttgcactggt acaccggaga aggtatggac gagatggaat 2401 ttacagaagc tgaatcaaat atgaatgatt tagtttcaga atatcaacaa tatcaagatg 2461 ctacagcaga agaggaagga gaatttgaag aagaagaagg agacgtagaa gcctaaatct 2521 atttatattt atgaaaatat atacatatta tatatatatg tatatgtaat taacaagaat 2581 aaaaaataaa aaataaaaaa aaaataaaat aaaaaaataa aaatacataa taaaaaagta 2641 taaaataaat atctaatcat taattatata taacaatata atttaactct tttttttttt 2701 attattattg aagttatgtt cgggtatata taacatatat ataaattata tatatgttgc 2761 agtttctttt tttttttttt tttttttttt tcttatcatt tgattttaca ctcacatata 2821 tatgacatat ata // LOCUS RATADOME1 2513 bp ds-DNA ROD 10-JUL-1990 DEFINITION Rat S-adenosylmethionine decarboxylase pseudogene, complete cds. ACCESSION M34463 KEYWORDS S-adenosylmethionine decarboxylase; pseudogene. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2513) AUTHORS Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P., Ihalainen,R. and Pajunen,A. TITLE Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA. Comparison with an intronless rat pseudogene JOURNAL Gene 86, 193-199 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 436 1379 pseudo-S-adenosylmethionine decarboxylase BASE COUNT 771 a 478 c 518 g 746 t ORIGIN 1 tctactaaac atgataaaga atttaagaaa tccatctctt cacttccagt ctatatatct 61 ttgagatgct attcaggata ctgagttaaa aaataagatt aggcttacac agcatggcgc 121 ggaacattag ctaactctca ctcaactctg acaagaaagc agcagactac atgagactga 181 actgtatctg cctttagttc caacagactc acgttcaact tttcttcacg aaaacagcca 241 gggaaatttt attagtcctt ttttaaaaat agttaatata aaattataac aacaacagca 301 gcagcaacaa caacaaggac cctgaactta gtaacacacg tggaacaaac cgtagcagcg 361 actggagcag tgggagaaga gatttaattt aggtgatttt tttggatttg ttggttgttg 421 gtcagcctca cagtgatgga agttgcacat atttttttga agggactgag aagctgctag 481 aggtctggtt ttccagacag cagtccgacg ccagccaggg acatggggat cttcatacca 541 tcccaagatc tcagtgggat gtgcttttga ggatgtgcag tcctcaacca taagtacgac 601 aaagatgcac aagcaggaag cttacacact cagtgagagt agcatgttta tacatttcat 661 gtgatactac cctcttactg aaagctctgg tttccaggtt gaagctcgct agggattacc 721 gtgggttaga ctcaattctt ttattctcat aagaatttca tgaagccctc tctccaaggg 781 tacccacacc gaaatttcca cgaagaaatc gaatttctta atgcagtttt cccaaatgga 841 gcagcatatt gtatgggaca aacgaattct gactgttggt acttatatac ttggatctcc 901 agagagccga gtcatcaaac agtcagatca accctgggaa ttctgatgag tgagcttgac 961 ctagcagtta cggaccagtt ctattgctgc aaaggatgtc actcgtgaga gtgaattcat 1021 gacctgatat caggtcattg atgacacact gtttaatcct tgcagcttct tgatgaatgg 1081 aatgaaatcg attggactag tcacatcgct ccagaagcag agttctctta tgttagcttt 1141 gaaacaaacc taagtgagac atcctatgac agcccgatca ggaaagttgg gaaattcgtc 1201 aagccaggaa aatttgtgac caccttgttt gttaatcaga gttctaaatg tcgcacaggc 1261 cattcttcat cccagaagat tgacggtttt aaacatcatg attgccaaag tgctatgctc 1321 aacgaagata aatgcaatat tgaatgtatc aaatgaaaag aattcagtct ctggtggagg 1381 gggattggag caaggatgaa tcagcccact aaagaaaact ccatggaaaa gacaggctat 1441 gcagtgcact ttaatcagct tcacacggtg cctaccatgc cttcactaac taaccaagta 1501 gtgatagaaa tgtccactaa gtcaaagcag aaatgtaata ctaagcattc tgacctcagt 1561 aagcaccacc attgccacca ttgccaccaa tttttactaa aggaaatttt gaatcaaatg 1621 aggatctgta gtttccgtct gttctgaggt cggctgttct ctttggtctt cgtttcacca 1681 tggcgctcag atgatcaaat gagtagctgc cagagggagg aatctccagg ttacttagcc 1741 tggagaatgg atgaatggat gaaacagcac aatattatga ctgtttagaa atacaggctt 1801 tcaagagtcg gcatgttagt ggcatttgta gatactgtgg aatttaagca gcaaagaaca 1861 aattggacta aatttcctat taattgccct cccactgttt cttggtagtt tctggactgg 1921 cacatcgatg tttttttttt ttttttcctt ccatatttaa aatgaagcac ttttttagca 1981 tttctaagca aagaatgcac ttggtttgta atcaagtagt tggaacgctg tctgaatgtt 2041 tactttatac accatgctga ttgaacgctt cattgaggaa gctttcagtc agttattggt 2101 ctgattctgt aatgagcaca gcacgtggtt tgaattgcca tttggaggac cagtgcttat 2161 ttaggctgga tcgcgtaaac cggtagattt tagcttgagg tttgattccc tcaccttata 2221 aaattaagaa ttctaatgtt gaaaattgca taggtttgtg tgaaacaaag cccagaagag 2281 aaactgtagg tagactagta atcttgtgta attataggtg agaagtttta gtgccgtaat 2341 ttctttgttg gcgttggact tttatcagct gaaatgtatt tctgtaccac aatgtaagct 2401 tcaataaagt ttgcttaatt gtctagtaac attaaaaaat ataagattaa tagaattgat 2461 ctcaacagta aggaaacaaa actaccttta ttattacata acataatctt tca // LOCUS RATADOMET 3102 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat S-adenosylmethionine decarboxylase mRNA, complete cds. ACCESSION M34464 M21155 J04048 M21783 KEYWORDS AdoMet decarboxylase; S-adenosylmethionine decarboxylase. SOURCE Rat prostate, cDNA to mRNA, clone pSAMr1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 232 to 1821) AUTHORS Pajunen,A., Crozat,A., Janne,O.A., Ihalainen,R., Laitinen,P.H., Stanley,B., Madhubala,R. and Pegg,A.E. TITLE Structure and regulation of mammalian S-adenosylmethionine decarboxylase JOURNAL J. Biol. Chem. 263, 17040-17049 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 3102) AUTHORS Pulkka,A., Keraenen,M.-R., Salmela,A., Salmikangas,P., Ihalainen,R. and Pajunen,A. TITLE Nucleotide sequence of rat S-adenosylmethionine decarboxylase cDNA. Comparison with an intronless rat pseudogene JOURNAL Gene 86, 193-199 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Crozat, 27-OCT-1988. FEATURES from to/span description pept 273 1274 S-adenosylmethionine decarboxylase (EC 4.1.1.50) mRNA 1 3102 S-adenosylmethionine decarboxylase mRNA BASE COUNT 835 a 650 c 724 g 893 t ORIGIN 1 cggggaaagc agcggactac aagagactga actgtatctg cctctatttc caacggactc 61 acgttcaact ttcgctcacg aaaatagccg ggaaaatttt attagtcctt tttttaaaaa 121 aagttaatat aaaattatag caaaaaaaaa aaaaggaacc tgaactttag taacacagct 181 ggaacaatcc gcagcggcgg caggagcggc gggagaagag tttaatttag ttgattttct 241 gtggttgttg gttgttcgct agtctcacgg tgatggaagc tgcacatttt ttcgaaggga 301 ccgagaaact gctggaggtc tggttctcca gacagcagtc cgacgcaagc cagggatctg 361 gggaccttcg taccatccca agatccgagt gggatgtcct tctgaaggat gtgcagtgct 421 caatcataag tgtgacaaag actgacaagc aggaagctta tgtactcagt gagagtagca 481 tgtttgtctc caagagacgt ttcattttga agacatgtgg taccaccctc ttactgaaag 541 cactggttcc cctgttgaag cttgctaggg actacagtgg gtttgactcg attcaaagct 601 tcttttattc tcgtaagaat ttcatgaagc cttctcacca agggtaccca caccggaatt 661 tccaggaaga aatcgagttt cttaatgcaa ttttcccaaa cggagcagga tattgtatgg 721 gacgtatgaa ttctgactgt tggtacctgt acactttgga tctcccagag agccgagtaa 781 tcaatcagcc agatcaaacc ctggaaattc tgatgagtga gcttgaccca gcagttatgg 841 accagttcta catgaaagat ggtgttactg caaaggatgt cactcgtgag agtggaattc 901 gtgacctgat accaggttct gtcattgatg ccacactgtt caatccttgt ggctactcaa 961 tgaatggaat gaaatcggat ggaacatatt ggactattca catcactcca gaaccagaat 1021 tttcttatgt tagctttgaa acaaacctaa gtcagacctc ctatgatgac ctgatcagga 1081 aagttgtgga agtcttcaag ccaggaaaat ttgtgaccac cttgtttgtt aatcagagtt 1141 ctaagtgtcg cacagtgctt tcttcgcccc agaagattga cggtttcaaa cgtcttgatt 1201 gccagagcgc tatgttcaac gattacaatt ttgtttttac cagttttgct aagaaacagc 1261 aacaacagag ttgattagga aaaatgaaaa agaaaaaacg caaaaagaga agacacacag 1321 gaggtggtgg ctgctttcta gatgttgatc ctgggggcca tgctgaccgt gaccaccacc 1381 ttgtagctgc agaaagccct aggtgtaatg atagtgtaat cattttgaag tgtatgcatt 1441 attatatcaa ggagttagat atcttgcatg aatgctctct tctgtgttta ggtgttctat 1501 gccactcttg ctgtggaact gaagtgcatg tagaaaagaa ctctgactgt atgaatcttt 1561 acgacacttg tgaaaacgat tcgacttggt ttatgcacag cgtaatattt ctgcaggcat 1621 cgtccaaaat cccccacaga caaggctttc gtccccatta gatgcggcct cagctgacca 1681 ttggcgactg ttctatttgc tgccagagtt tttacatcca gttacctcca ctttctagag 1741 catattctct actaatgttc aaaaccgatt tctacttcat acgggtgtct tatgcaatgg 1801 caattaaagt tttcttccac aagttgagtc tttgtaagga aatgattcca gttgcttgtt 1861 ttgtgttcta ctgttttagt aattgctcct gcatttatag tcctatggtt tttcactacc 1921 cctgatgaag caatacacgg tcacactgtg ggcttacatt gtaatcttca ccccagatgg 1981 gagctcagag acggtccctt gctcattttt ccctaagatg tagaatgtgg ccttgctatt 2041 ggcatgccct tctgtggaag ataaatgatg gaagtgaaag tatcccgggg gtgagcaagg 2101 agaaccaccc catggcagtg atgggcttgg cagtgcactc cgagctctca cagtggagtg 2161 cccaccatgc cttcactaac tcactgagca gtgataggat gcccaccaag tcagagcaga 2221 aatctaaccc taaggattct cacctcggta agtgccgcca ttgccaccac tttactaaag 2281 gaagtttccg ctcagaggag agtctgtact tcccgcctgt cctaatgtca gctgttctct 2341 ctggtctttc accatggcgt tcagatgctc aaatgaatgg ctgatcggcc gcagggagga 2401 ctctccgggt tactgggcct ggagaatgga gaaacaggca cggtattctg acagttaatg 2461 gcaccagaga tgcgggcttt caagagctgg cctgttagtg gcatttttaa gcagaaaaga 2521 gcaaactaga cgaagttccc tatttattgc cctcccactg tttccttggc agtttctgga 2581 ctggcgcaat gatgccttgt tccttccgta tttataacga agctaaaaag cgtttctaag 2641 catggagtct acttggtttg aaatcaagtg gttggaacac tgtctggatt tttactttac 2701 gcagtgttga ttgaacgctt cgttggggaa gccttcagtc cgcttcatcg gtctgttctg 2761 taatgagcac agcacaccta gtttgaattg ctgtttggag ggccagtgct tatttgagct 2821 gggtcttgta acccagtaga ttttggcttg aggtctgact cccccatctt acgaaattaa 2881 gaattctaat gttggaaatt gcatagggtt tgcgtggaaa aaagcccagg gaaaaaaaaa 2941 aaaaaacaga aggcggacta gtgatctagt gtgattacag gcggggaagt tttggtgcca 3001 taatttcttt gttggtgttg gacttttaat cagctgaaat gtatttctgt accacaatgt 3061 aagcttcaat aaaagtttgc ttaattgtct agtaacatcc ag // LOCUS HUMCD38 1407 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human lymphocyte differentiation antigen CD38 mRNA, complete cds. ACCESSION M34461 KEYWORDS cell surface glycoprotein; lymphocyte differentiation antigen CD38; membrane glycoprotein. SOURCE Human PHA-treated peripheral blood cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1407) AUTHORS Jackson,D.G. and Bell,J.I. TITLE Isolation of a cDNA encoding the human CD38 (T10) molecule, a cell surface glycoprotein with an unusual discontinuous pattern of expression during lymphocyte differentiation JOURNAL J. Immunol. 144, 2811-2815 (1990) STANDARD simple staff_review FEATURES from to/span description pept 70 972 lymphocyte differentiation antigen CD38 /hgml_locus_uid="LZ0047A" /nomgen="CD38" /map="4" BASE COUNT 381 a 332 c 326 g 368 t ORIGIN 1 ctaaagctct cttgctgcct agcctcctgc cggcctcatc ttcgcccagc caaccccgcc 61 tggagcccta tggccaactg cgagttcagc ccggtgtccg gggacaaacc ctgctgccgg 121 ctctctagga gagcccaact ctgtcttggc gtcagtatcc tggtcctgat cctcgtcgtg 181 gtgctcgcgg tggtcgtccc gaggtggcgc cagacgtgga gcggtccggg caccaccaag 241 cgctttcccg agaccgtcct ggcgcgatgc gtcaagtaca ctgaaattca tcctgagatg 301 agacatgtag actgccaaag tgtatgggat gctttcaagg gtgcatttat ttcaaaacat 361 ccttgcaaca ttactgaaga agactatcag ccactaatga agttgggaac tcagaccgta 421 ccttgcaaca agattcttct ttggagcaga ataaaagatc tggcccatca gttcacacag 481 gtccagcggg acatgttcac cctggaggac acgctgctag gctaccttgc tgatgacctc 541 acatggtgtg gtgaattcaa cacttccaaa ataaactatc aatcttgccc agactggaga 601 aaggactgca gcaacaaccc tgtttcagta ttctggaaaa cggtttcccg caggtttgca 661 gaagctgcct gtgatgtggt ccatgtgatg ctcaatggat cccgcagtaa aatctttgac 721 aaaaacagca cttttgggag tgtggaagtc cataatttgc aaccagagaa ggttcagaca 781 ctagaggcct gggtgataca tggtggaaga gaagattcca gagacttatg ccaggatccc 841 accataaaag agctggaatc gattataagc aaaaggaata ttcaattttc ctgcaagaat 901 atctacagac ctgacaagtt tcttcagtgt gtgaaaaatc ctgaggattc atcttgcaca 961 tctgagatct gagccagtcg ctgtggttgt tttagctcct tgactccttg tggtttatgt 1021 catcatacat gactcagcat acctgctggt gcagagctga agattttgga gggtcctcca 1081 caataaggtc aatgccagag acggaagcct ttttccccaa agtcttaaaa taacttatat 1141 catcagcata cctttattgt gatctatcaa tagtcaagaa aaattattgt ataagattag 1201 aatgaaaatt gtatgttaag ttacttcctt tagagcacaa tggatctcga gggatcttcc 1261 atacctacca gttctgcgcc tgcgagtcgc ggccgcatct agaggatctt tgtgaaggaa 1321 ccttacttct gtggtgtgac ataattggac aaactaccta tagagattta aagctctaag 1381 gtaaatataa aatttttaag tgtataa // LOCUS MUSCD28 1492 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse glycoprotein CD28 mRNA, complete cds. ACCESSION M34563 KEYWORDS glycoprotein CD28. SOURCE Mouse lymphoma T cell line EL4, cDNA to mRNA, clone lambda-SSD1.5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1492) AUTHORS Gross,J.A., St John,T. and Allison,J.P. TITLE The murine homologue of the T lymphocyte antigen CD28: Molecular cloning and cell surface expression JOURNAL J. Immunol. 144, 3201-3210 (1990) STANDARD simple staff_review FEATURES from to/span description pept 57 713 glycoprotein CD28 precursor sigp 57 113 glycoprotein CD28 signal peptide matp 114 710 glycoprotein CD28 BASE COUNT 401 a 355 c 332 g 404 t ORIGIN 1 acacactctg ccttgctcac agaggagggg ctgcagccct ggccctcatc agaacaatga 61 cactcaggct gctgttcttg gctctcaact tcttctcagt tcaagtaaca gaaaacaaga 121 ttttggtaaa gcagtcgccc ctgcttgtgg tagatagcaa cgaggtcagc ctcagctgca 181 ggtattccta caaccttctc gcaaaggaat tccgggcatc cctgtacaag ggcgtgaaca 241 gcgacgtgga agtctgtgtc gggaatggga attttaccta tcagccccag tttcgctcga 301 atgccgagtt caactgcgac ggggatttcg acaacgaaac agtgacgttc cgtctctgga 361 atctgcacgt caatcacaca gatatttact tctgcaaaat tgagttcatg taccctccgc 421 cttacctaga caacgagagg agcaatggaa ctattattca cataaaagag aaacatcttt 481 gtcatactca gtcatctcct aagctgtttt gggcactggt cgtggttgct ggagtcctgt 541 tttgttatgg cttgctagtg acagtggctc tttgtgttat ctggacaaat agtagaagga 601 acagactcct tcaagtgact accatgaaca tgactccccg gaggcctggg ctcactcgaa 661 agccttacca gccctacgcc cctgccagag actttgcagc gtaccgcccc tgacagggac 721 ccctatccag aagcccgccg gctggtaccc gtctacctgc tcatcatcac tgctctggat 781 aggaaaggac agcctcatct tcagccggcc actttggacc tctactgggc caccaatgcc 841 aactatttta gagtgtctag atctaacatc atgatcatct tgagactctg gaatgaatga 901 cagaagcttc tatggcagga taaagtctgt gtggcttgac ccaaactcaa gcttaataca 961 tttattgact tgattgggga agttagagta gagcaatcaa aaagatcatt cattcagcct 1021 tgggaagtca atttgcaggc tcctggatga gccctgcccc gttttcactt gccagcacat 1081 ttcagtcatg tggtgtgata gccaaagatg ttttggacag agaagaaagg atagaaaaac 1141 cttctctttg gctaagttgg tgtttggggt ggggataggt tagagtatag tacttaacta 1201 tttgaaaaat aatgaaaaca cttttttcac tcatgaaatg agccacttag ctcctaaata 1261 gtgttttcct gttagtttag aaagttgtgg acatattttt ttaatgattt ctgaccattt 1321 ttaatcacat tgactcatgg aatggcctca aagcaccccc cagtgcttct ttcctcattc 1381 ccggtcatgg gaactcagta ttattaatag tcacaacatg atttcagaac tagatagccc 1441 tcccacacca agaagaatgt gagaggaagt aaggtcactt tatgtaaaaa cg // LOCUS MUSIGHAAU 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 123) mRNA V region, partial cds. ACCESSION M34523 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (BALB/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 83 a 61 c 77 g 73 t ORIGIN 1 caggtccagc tgcagcagtc tgggccagag gtggtgaggc ctggggtctc agtgaagatt 61 tcctgcaagg gttccgacta cacattcact gattatgcta tgcactgggt gaagcagagt 121 catgcaaaga gtctagagtg gattggagtt attagtactt acaatggtaa tacaaactac 181 aaccagaagt ttaagggcaa ggccacaatg actgtagaca aatcctccag cacagcctat 241 atggaacttg ccagattgac atctgaggat tctgccatgt attactgtgt acgt // LOCUS MUSIGHAAV 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 126) mRNA V region, partial cds. ACCESSION M34524 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (BALB/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 83 a 69 c 77 g 65 t ORIGIN 1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggcttc agtgaagata 61 tcctgcaaga cttctggata cacattcact gaatacacca tgcactgggt gaagcagagc 121 catggaaaga gccttgagtg gattggaggt attaatccta acaatggtgg ttctaactac 181 aaccagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac 241 atggagctcc gcagcctgac atctgaggat tctgcagtct attactgtgc aaga // LOCUS MUSIGHAAW 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 125) mRNA V region, partial cds. ACCESSION M34525 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 83 a 66 c 74 g 71 t ORIGIN 1 cagatccagt tggtgcagtc tggacctgaa ctgaagaagc ctggagagac agtcaagatc 61 tcctgcaagg cttctggtta taccttcaca gactattcaa tgcactgggt gaagcaggct 121 ccaggaaagg gtttaaagtg gatggactgg ataaacactg agactggtgt gccaacatat 181 gcagatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat 241 ttgcagatca acgacctcaa aaatgaggac acggctacat atttctgtac taga // LOCUS MUSIGHAAX 294 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig gamma-chain (anti-insulin Ab 127) mRNA V region, partial cds. ACCESSION M34526 KEYWORDS gamma-immunoglobulin; immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig gamma-chain V-region (AA at 1) BASE COUNT 81 a 80 c 61 g 72 t ORIGIN 1 gatgtgcagc ttcaggaggt aggacctgac ctggtgaaac cttctcagtc actttcactc 61 acctgcactg tcactggcta ctccatcacc agtggttata gctggcactg gatccggcag 121 tttccaggaa acaaactgga atggatgggc tacatacact acagtgatag ctctaactac 181 aacccatctc tcaaaagtcg aatctctatc actcgagaca catccaagaa ccagttcttc 241 ctgcagttga attctgtgac tactgaggac acagccacat attactgtgc aagg // LOCUS MUSIGKABI 300 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 123) mRNA V region, partial cds. ACCESSION M34527 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 300 Ig kappa-chain V-region (AA at 1) BASE COUNT 74 a 86 c 70 g 70 t ORIGIN 1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc 61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc 121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc 181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa 241 gatgctgcca cttattactg ccagcagtgg agtagtaaac cacccatcac gttcggtgct // LOCUS MUSIGKABJ 300 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 126) mRNA V region, partial cds. ACCESSION M34528 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 300 Ig kappa-chain V-region (AA at 1) BASE COUNT 77 a 80 c 66 g 77 t ORIGIN 1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaga tagcgtcagt 61 ctttcctgca gggccagcca aagtattagc aacaacctac actggtatca acaaaaatca 121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc 181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact 241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggcctcacac gttcggctcg // LOCUS MUSIGKABK 312 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain (anti-insulin Ab 127) mRNA V region, partial cds. ACCESSION M34529 KEYWORDS immunoglobulin heavy-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-region (AA at 1) BASE COUNT 78 a 81 c 77 g 76 t ORIGIN 1 gacattgtgc tgacccaatc tccagcttct ttggctgtgt ctctagggca gagggccacc 61 atatcctgca gagccagtga aagtgttgat agttatggca atagttttat gcactggtac 121 cagcagaaac caggacagcc acccaaactc ctcatctatc gtgcatccaa cctagaatct 181 gggatccctg ccaggttcag tggcagtggg tctaggacag acttcaccct caccattaat 241 cctgtggagg ctgatgatgt tgcaagctat tactgtcagc aaagtaatga ggaacctccc 301 acgttcggag gg // LOCUS MUSIGKABL 312 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V region, partial cds. ACCESSION M34530 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Ewulonu,U.K., Nell,L.J. and Thomas,J.W. TITLE V-H and V-L gene usage by murine IgG antibodies that bind autologous insulin JOURNAL J. Immunol. 144, 3091-3098 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-region (AA at 1) BASE COUNT 71 a 96 c 69 g 76 t ORIGIN 1 caaattgttc tcacccagtc tccaacaatc atgtctgcat ctctagggga acgggtcacc 61 atgacctgca ctgccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag 121 ccaggatcct cccccaaact ctggatttat agtacatcca acctggcttc tggagtccca 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgccag cagtatcatc gttccccacc cacgttcggt 301 gctgggacca ag // LOCUS HUMINSR01 2085 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 1. ACCESSION M23100 M32822 KEYWORDS Alu repetitive sequence; insulin receptor. SEGMENT 1 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1933) AUTHORS Seino,S., Seino,M., Nishi,S. and Bell,G.I. TITLE Structure of the human insulin receptor gene and characterization of its promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 114-118 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 2085) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept 1824 + 1923 human insulin receptor precursor, exon 1 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" sigp 1824 1904 human insulin receptor signal peptide matp 1905 + 1923 human insulin receptor pre-msg 1541 > 2085 hINSR mRNA and introns (alt.) pre-msg 1542 > 2085 hINSR mRNA and introns (alt.) pre-msg 1548 > 2085 hINSR mRNA and introns (alt.) IVS 1924 > 2085 hINSR intron A rpt < 1 76 Alu repeat BASE COUNT 417 a 631 c 702 g 335 t ORIGIN Chromosome 19p13.3-13.2. 1 agatctggcc attgcactcc agcctgggca acagagaaaa actccatcta aaaaaaaaaa 61 aaaaaaaaaa aaaaaacaga gagagagaga gagagagaga gaaggaaacg gaactggggg 121 gaggatttgc aaaaatatgg ttagggatgg cacttcagag atgaagccat cctggagtgt 181 tacgggcaag ggaaatgctg gggcaaagcc ccagaggcag gaataggttt ggcctgttgc 241 atgaacagtg ggtccagctc ctagcaaact gtttattgaa tgaaagaaga atgaatgcct 301 tgggtctagg gttgtgctgg gcgctttctt aagttttctt tcccgggtac ctccccagaa 361 ctggcatgca ggtattatta aacccattac acaagtgaaa ctggcccaga gacagaaaag 421 tccctggtcc aagaccacac aggagtgagg ggtggaggaa ccctcctccc attgagttct 481 ggctttccta tactgaaagc cccttcctct cctgcagtaa ggtaggtgga accgctgtcc 541 cgccttgttg gtgaatgtcg ttgctagact tcagacacat acaggctggt ctgctgaaaa 601 tcagagatgt ccacctgcgc cctattcgag gtctccggcg tcttctttgg cgtcgtcttt 661 gccctttcag aagcgtctgc acatttttcc aggtgtcatt tctccaactt gaacacaggg 721 agcgcactgg gcacgcgggc acgtggctgt ccccaggggc ctggcttggg tctcgcccct 781 gggccggggc gcacgcgcgg gcgggacatc tgggggcgcc cacgcgctct gggacgagtg 841 tcgctggcca ggcccggact gaggaaaggc gagtgagaca ctactcgcct ggggtgcaaa 901 atttaaggga gtgaaaaaaa aaaaaaaaga aagaaaccaa aaccacctcg agtcaccaaa 961 ataaacattt taatgcagta ttttttaaaa aatcaacagg aatcctccaa agcccactat 1021 gaacaaaata gcaaaatggt agagaaagga tctgtgccgc tgcgtcgggc ctgtggggcg 1081 cctccggggg tctgaaactg gaggagactc ggggctgtag ggcgcgcgga tctggggcgc 1141 gccctcggtc ccggcgcgcc cagggcctcc cgcgcggggc ccggcacagg gaggcgggga 1201 ggcgggcggg gcggggcggg accgggcggc acctccctcc cctgcaagct ttccctccct 1261 ctcctgggcc tctcccgggc gcagagtccc ttcctaggcc agatccgcgc cgccttttcc 1321 cgcggcccgc acggggccca gctgacgggc cgcgttgttt acgggccgga gcagccctct 1381 ctcccgccgc ccgcccgcca cccgccagcc caggtgcccg cccgccagtc agctagtccg 1441 tcggtccgcg cgtccctctg tcccggagcc cgcagatcgc gacccagagc gcgcggggcc 1501 gagagccgag agacagtccc gggcgcagcg cggagctccg ggccccgaga tcctgggacg 1561 gggcccgggc cgcagcggcc ggggggtcgg ggccaccacc gcaagggcct ccgctcagta 1621 tttgtagctg gcgaagccgc gcgcgccctt cccggggctg cctctgggcc ctccccggca 1681 ggggggctgc ggcccgcggg tcgcgggcgt ggaagagaag gacgcgcggc ccccagcgcc 1741 tcttgggtgg ccgcctcgga gcatgacccc cgcgggccag cgccgcgcgc tctgatccga 1801 ggagaccccg cgctcccgca gccatgggca ccgggggccg gcggggagcg gcggccgcgc 1861 cgctgctggt ggcggtggcc gcgctgctac tgggcgccgc gggccacctg taccccggag 1921 agggtgagtc tgggggcgcg ggcgtgggcg gggagcgccg cgatggggag aggaccccac 1981 ccaagccaaa atcgatcccc cgcttgtgga ctgagaaccc tccccagggg cggggggcgg 2041 tggccaggac ggtagctcct gcatcgcgta gggggagcgg gaagc // LOCUS HUMINSR02 928 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 2. ACCESSION M32823 KEYWORDS insulin receptor. SEGMENT 2 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 928) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 174 + 725 human insulin receptor precursor, exon 2 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 174 + 725 human insulin receptor pre-msg < 1 > 927 hINSR mRNA and introns IVS < 1 173 hINSR intron A IVS 726 > 927 hINSR intron B BASE COUNT 218 a 234 c 237 g 239 t ORIGIN About 25.0 kbp downstream of segment 1. 1 tactttacag agaaagctac tcatcccggc tggctgcaga gtttacaggg cccgggatga 61 aaacacaggg cccaggtttc ctgtccatga agccggctct gcccctgatc cttctgatgc 121 atccaccgtg cgtctgctca cctgtcttgc tttctgttca ttttctcttg tagtgtgtcc 181 cggcatggat atccggaaca acctcactag gttgcatgag ctggagaatt gctctgtcat 241 cgaaggacac ttgcagatac tcttgatgtt caaaacgagg cccgaagatt tccgagacct 301 cagtttcccc aaactcatca tgatcactga ttacttgctg ctcttccggg tctatgggct 361 cgagagcctg aaggacctgt tccccaacct cacggtcatc cggggatcac gactgttctt 421 taactacgcg ctggtcatct tcgagatggt tcacctcaag gaactcggcc tctacaacct 481 gatgaacatc acccggggtt ctgtccgcat cgagaagaac aatgagctct gttacttggc 541 cactatcgac tggtcccgta tcctggattc cgtggaggat aattacatcg tgttgaacaa 601 agatgacaac gaggagtgtg gagacatctg tccgggtacc gcgaagggca agaccaactg 661 ccccgccacc gtcatcaacg ggcagtttgt cgaacgatgt tggactcata gtcactgcca 721 gaaaggtacg ccggggatac agggttctaa gcagtgtctc gtgccttgtt ctagaaagct 781 taaaatgttt tatggcttaa aaatgttaaa tggtcattag gtaggggccg gggaatagtg 841 ggtggtggca ttcactagcc cagggagtgg cagacatttt ctgtaaagac tcagatagta 901 gatacttcag attttgcagg ccatatgg // LOCUS HUMINSR03 639 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 3. ACCESSION M32824 KEYWORDS insulin receptor. SEGMENT 3 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 639) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 114 + 435 human insulin receptor precursor, exon 3 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 114 + 435 human insulin receptor pre-msg < 1 > 639 hINSR mRNA and introns IVS < 1 113 hINSR intron B IVS 436 > 639 hINSR intron C BASE COUNT 134 a 171 c 163 g 171 t ORIGIN About 25.0 kbp downstream of segment 2. 1 gatccagaat tgctgcatat gcagacagga attggacaaa gccatttatt tatttattta 61 tttatttatt tatttattta tttatttccc tctctctctc tctctctctc cagtttgccc 121 gaccatctgt aagtcacacg gctgcaccgc cgaaggcctc tgttgccaca gcgagtgcct 181 gggcaactgt tctcagcccg acgaccccac caagtgcgtg gcctgccgca acttctacct 241 ggacggcagg tgtgtggaga cctgcccgcc cccgtactac cacttccagg actggcgctg 301 tgtgaacttc agcttctgcc aggacctgca ccacaaatgc aagaactcgc ggaggcaggg 361 ctgccaccaa tacgtcattc acaacaacaa gtgcatccct gagtgtccct ccgggtacac 421 gatgaattcc agcaagtgag ttctggatgt gggtctgggg ggcagccgag aggagaagga 481 acgtggggtt ggttgtgacg atgccgcttg ttaaaactgt gtgcaaaccc agggttaatt 541 ggctatgagt gaggtctctg ctctcagatg ctacttttgc accctgtttt ggtcctgggc 601 ttgggagtgg gagttgacta cctttttctc taaaggacc // LOCUS HUMINSR04 663 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 4. ACCESSION M32825 KEYWORDS insulin receptor. SEGMENT 4 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 663) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 318 + 466 human insulin receptor precursor, exon 4 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 318 + 466 human insulin receptor pre-msg < 1 > 663 hINSR mRNA and introns IVS < 1 317 hINSR intron C IVS 467 > 663 hINSR intron D BASE COUNT 159 a 195 c 171 g 138 t ORIGIN About 15.0 kbp downstream of segment 3. 1 ccaacatggt aaccccgtct ctactcaaaa atacaaaaat tagccaggca cggtggcggg 61 cacctataat cccagctact gtggaggctg aggcaggaga atctcttgaa cccagaaggc 121 agaggttgca gtgagctgag atcgcaccac tgcactccag cctgggcaac agagcgagac 181 tctgtcacac aaacacacac acacacacaa agaaatacca tatcaggcag aaagatgcct 241 gagatgtctg aaggaccttg gataccgtga cacccccctc ccctttctct ttctctctct 301 ctctgctccg tccttagctt gctgtgcacc ccatgcctgg gtccctgtcc caaggtgtgc 361 cacctcctag aaggcgagaa gaccatcgac tcggtgacgt ctgcccagga gctccgagga 421 tgcaccgtca tcaacgggag tctgatcatc aacattcgag gaggcagtga gtgtctctgt 481 gtgggcgtcg ggggtgcctg ttgggctcca tgtccctctg agctgtgagc ggggaagaaa 541 agcagtgcag accctgctgc gtgctcctac agcactttta ggatggtcgt tcagtggctc 601 ccccatggat agaaccatgc tgggagtctg cctcaaaacc tgaaatgaac agctcagtct 661 tcc // LOCUS HUMINSR05 410 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 5. ACCESSION M32826 KEYWORDS insulin receptor. SEGMENT 5 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 410) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 188 + 332 human insulin receptor precursor, exon 5 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 188 + 332 human insulin receptor pre-msg < 1 > 410 hINSR mRNA and introns IVS < 1 187 hINSR intron D IVS 333 > 410 hINSR intron E BASE COUNT 105 a 80 c 100 g 125 t ORIGIN About 3.0 kbp downstream of segment 4. 1 gggcagaagt atgcttgacc catttaagga atgctaagga cttcagattg tgttctaagc 61 atgatgagtt ttgagctggg tatgtccagt catttgcagc ctgagggtta tcttctcacc 121 atggagaatc atgagaagat tgaaatatgt ctatagaaac ccactggata ttctctcctt 181 tccttagaca atctggcagc tgagctagaa gccaacctcg gcctcattga agaaatttca 241 gggtatctaa aaatccgccg atcctacgct ctggtgtcac tttccttctt ccggaagtta 301 cgtctgattc gaggagagac cttggaaatt gggtacgtgg gcctgattgt gtgtatggcc 361 tgagtgctaa ctaggaagtt cgtgtattag aacaacttaa ggattttttt // LOCUS HUMINSR06 554 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 6. ACCESSION M32827 KEYWORDS insulin receptor. SEGMENT 6 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 554) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 189 + 403 human insulin receptor precursor, exon 6 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 189 + 403 human insulin receptor pre-msg < 1 > 554 hINSR mRNA and introns IVS < 1 188 hINSR intron E IVS 404 > 554 hINSR intron F BASE COUNT 154 a 129 c 130 g 141 t ORIGIN About 1.0 kbp downstream of segment 5. 1 ggccatgaaa acttcctcaa cttcctctgt tatccacatt caacaaatat gtgttgagta 61 tgtgccaagc aagtggagag gattaggcac gtagcactga acaagatcaa ctccgagcat 121 ggccacacca tcttggagtt gtagaagacc agccgttgaa tgactagatg tgtgtgtttt 181 ttccatagga actactcctt ctatgccttg gacaaccaga acctaaggca gctctgggac 241 tggagcaaac acaacctcac catcactcag gggaaactct tcttccacta taaccccaaa 301 ctctgcttgt cagaaatcca caagatggaa gaagtttcag gaaccaaggg gcgccaggag 361 agaaacgaca ttgccctgaa gaccaatggg gaccaggcat cctgtaagtc actggtcccc 421 aacctttttg gcacgaggga ccggtttagt ggaagatggt ttttccatgg actggtggtg 481 ggtggggatg gtttcagcat gattcaagtg cattacattt actatgcact ttattcctat 541 tatgattaca ttgt // LOCUS HUMINSR07 592 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 7. ACCESSION M32828 KEYWORDS insulin receptor. SEGMENT 7 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 592) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 277 + 403 human insulin receptor precursor, exon 7 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 277 + 403 human insulin receptor pre-msg < 1 > 592 hINSR mRNA and introns IVS < 1 276 hINSR intron F IVS 404 > 592 hINSR intron G BASE COUNT 125 a 144 c 144 g 179 t ORIGIN About 1.0 kbp downstream of segment 6. 1 ttgcgcgggt acagactgcg cttattcagt tgactgtctg gctgagtcaa gtcattggct 61 tacgtgagtg tgagtggcca agttgcaaaa ctggctctta cctttgaatc ttcccccatt 121 catactcagc caggcacatg gggaggagac ccttaaggga atagcagcat cacctctgcc 181 ttctcacggt ccctccagga agtgtggggg tcccaggctt tggtctgaaa ctacactgaa 241 atagctcatt tttgcctttt gttttaactt ttccaggtga aaatgagtta cttaaatttt 301 cttacattcg gacatctttt gacaagatct tgctgagatg ggagccgtac tggccccccg 361 acttccgaga cctcttgggg ttcatgctgt tctacaaaga ggcgtaagta gaagagttag 421 agagacgctg aggaggcgag ggctggctgg ctctgtgctt gctacgtttg tgctccaatc 481 tgcccctctt gggttcctgt ctatctccct cctcctcctg gaataaatat cttaggttcc 541 tttttacaat ctcaccagtc gatggcatgc aaagtcaata gtgtctgctt tt // LOCUS HUMINSR08 401 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 8. ACCESSION M32829 KEYWORDS insulin receptor. SEGMENT 8 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 401) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 124 + 374 human insulin receptor precursor, exon 8 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 124 + 374 human insulin receptor pre-msg < 1 > 401 hINSR mRNA and introns IVS < 1 123 hINSR intron G IVS 375 > 401 hINSR intron H BASE COUNT 90 a 98 c 112 g 101 t ORIGIN About 3.0 kbp downstream of segment 7. 1 cattagattg ttgggtgagt aacatgtgac cctatgggat gtaacttccc aggcctcatc 61 tgcacggcac tcagtgtgac ggtcttgtaa gggtaactgc cttctgctgt tttgtcttga 121 aagcccttat cagaatgtga cggagttcga tgggcaggat gcgtgtggtt ccaacagttg 181 gacggtggta gacattgacc cacccctgag gtccaacgac cccaaatcac agaaccaccc 241 agggtggctg atgcggggtc tcaagccctg gacccagtat gccatctttg tgaagaccct 301 ggtcaccttt tcggatgaac gccggaccta tggggccaag agtgacatca tttatgtcca 361 gacagatgcc accagtgagt gtgtcttggg aatgtgaatt c // LOCUS HUMINSR09 420 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 9. ACCESSION M32830 KEYWORDS insulin receptor. SEGMENT 9 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 420) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 106 + 273 human insulin receptor precursor, exon 9 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 106 + 273 human insulin receptor pre-msg < 1 > 420 hINSR mRNA and introns IVS < 1 105 hINSR intron H IVS 274 > 420 hINSR intron I BASE COUNT 85 a 125 c 94 g 116 t ORIGIN About 3.0 kbp downstream of segment 8. 1 ggtgccctca tgatgtcttt aacttgtgtg tcccccgcca tcctcccacc agctttcttt 61 gcacactgtt tctcatgatg gacccgtttc ctttctccct ggcagacccc tctgtgcccc 121 tggatccaat ctcagtgtct aactcatcat cccagattat tctgaagtgg aaaccaccct 181 ccgaccccaa tggcaacatc acccactacc tggttttctg ggagaggcag gcggaagaca 241 gtgagctgtt cgagctggat tattgcctca aaggtgagtg caggcagctg tgctaggatc 301 ggtggggttt gcacacgtgt gtctgatgca ctttgcttca cctctaggga agcagctatc 361 tcttcctgtg tctcagtgtc ggaaggcaca cacacacact ccattctatc tcatatgaaa // LOCUS HUMINSR10 517 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 10. ACCESSION M32831 KEYWORDS insulin receptor. SEGMENT 10 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 517) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 187 + 388 human insulin receptor precursor, exon 10 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 187 + 388 human insulin receptor pre-msg < 1 > 517 hINSR mRNA and introns IVS < 1 186 hINSR intron I IVS 389 > 517 hINSR intron J BASE COUNT 83 a 88 c 194 g 152 t ORIGIN About 11.0 kbp downstream of segment 9. 1 tttgtggtgt gtgtatgtgt ggtgtgttgt gtgatgtgtg tggtgtgtgt gtgggggggt 61 gtgtggtgtg tgtatgtgtg gtgtgtgtgg tgtgtgtgtg tggtgtgtgt gtgtgggggg 121 ggtgtgtgtg tgtatgtgtg ttcagccgca gagacttgag cccccctttt ctgtttcttt 181 ctccagggct gaagctgccc tcgaggacct ggtctccacc attcgagtct gaagattctc 241 agaagcacaa ccagagtgag tatgaggatt cggccggcga atgctgctcc tgtccaaaga 301 cagactctca gatcctgaag gagctggagg agtcctcgtt taggaagacg tttgaggatt 361 acctgcacaa cgtggttttc gtccccaggt caggacttgg cgctgggctc tcttagtggg 421 tgccaattgg cttggtgttg gtggaaggtc attacttagg gaccgagagg tagtgggagg 481 gagagacggc agaaccctgg gtggagtctg aatggag // LOCUS HUMINSR11 343 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 11. ACCESSION M32832 KEYWORDS insulin receptor. SEGMENT 11 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 343) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 123 + 158 human insulin receptor precursor, exon 11 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 123 + 158 human insulin receptor pre-msg < 1 > 343 hINSR mRNA and introns IVS < 1 122 hINSR intron J IVS 159 > 343 hINSR intron K BASE COUNT 68 a 97 c 98 g 80 t ORIGIN About 2.0 kbp downstream of segment 10. 1 tggtccaggg tcaaagccag ggtgccctta ctcggacaca tgtggcctcc aagtgtcaga 61 gcccagtggt ctgtctaatg aagttccctc tgtcctcaaa ggcgttggtt ttgtttccac 121 agaaaaacct cttcaggcac tggtgccgag gaccctaggt atgactcacc tgtgcgaccc 181 ctggtgcctg ctccgcgcag ggccggcggc gtgccaggca gatgcctcgg agaacccagg 241 ggtttctctg gctttttgca tgcggcgggc agctgtgctg gagagcagat gcttcaccaa 301 ttcagaaatc caatgccttc actctgaaat gaaatctggg cat // LOCUS HUMINSR12 719 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 12. ACCESSION M32833 KEYWORDS insulin receptor. SEGMENT 12 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 719) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 161 + 435 human insulin receptor precursor, exon 12 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 161 + 435 human insulin receptor pre-msg < 1 > 719 hINSR mRNA and introns IVS < 1 160 hINSR intron K IVS 436 > 719 hINSR intron L BASE COUNT 137 a 198 c 195 g 189 t ORIGIN About 8.0 kbp downstream of segment 11. 1 ggtcattcct ggcagtctgt attgtaatcc atgttcccca ttgctgcacc ctcctgcgct 61 ctgatctttc ttcttaatca agccttttat tctccagtgt cactttttta aaaaaaatga 121 tggtgatggt gtcatcatac atgtcctact gtcgttccag gccatctcgg aaacgcaggt 181 cccttggcga tgttgggaat gtgacggtgg ccgtgcccac ggtggcagct ttccccaaca 241 cttcctcgac cagcgtgccc acgagtccgg aggagcacag gccttttgag aaggtggtga 301 acaaggagtc gctggtcatc tccggcttgc gacacttcac gggctatcgc atcgagctgc 361 aggcttgcaa ccaggacacc cctgaggaac ggtgcagtgt ggcagcctac gtcagtgcga 421 ggaccatgcc tgaaggtagg gctgctggtc cggggtccga gtgtcatggg tgggacatca 481 aggctgactt tttgtttgag acggagcctt gctctgtcgc ccaggctgga gtacagtggt 541 gcgacctcag ctcactccag cctctgccac ctatgtcaag tgattccctg cttcagcctc 601 ccaagtagct gggactacag gtgtctgcca ccacgcccag ctaatttttg tatttttagt 661 agagatgggg tttcaccata ttgcccaggc tggtcttgaa ctcctgggct caagtgatc // LOCUS HUMINSR13 439 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 13. ACCESSION M32834 KEYWORDS insulin receptor. SEGMENT 13 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 439) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 93 + 232 human insulin receptor precursor, exon 13 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 93 + 232 human insulin receptor pre-msg < 1 > 439 hINSR mRNA and introns IVS < 1 92 hINSR intron L IVS 233 > 439 hINSR intron M BASE COUNT 98 a 114 c 105 g 122 t ORIGIN About 1.0 kbp downstream of segment 12. 1 gtcaccagcc caaggttgca ccatggacag gtggcagaag tgggatctca tccaagagtt 61 acatccctgc ctctcacttc ctctccttac agccaaggct gatgacattg ttggccctgt 121 gacgcatgaa atctttgaga acaacgtcgt ccacttgatg tggcaggagc cgaaggagcc 181 caatggtctg atcgtgctgt atgaagtgag ttatcggcga tatggtgatg aggtaaggcc 241 cttgactctt gggcatgccc ctgcaccact tcagcatgcc ccttcagagt tgcacttggt 301 acctccttcc tctgctgaaa ttttgattcc agtgcttctc tcatcaggta ctgtgctatt 361 agtacttaaa gccttgatac ctgacttcgc aggaagatgg gtcagaaatg ccaatctacc 421 agcttgttac ttttcttag // LOCUS HUMINSR14 386 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 14. ACCESSION M32835 KEYWORDS insulin receptor. SEGMENT 14 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 386) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 85 + 244 human insulin receptor precursor, exon 14 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 85 + 244 human insulin receptor pre-msg < 1 > 386 hINSR mRNA and introns IVS < 1 84 hINSR intron M IVS 245 > 386 hINSR intron N BASE COUNT 62 a 123 c 115 g 86 t ORIGIN About 6.0 kbp downstream of segment 13. 1 tggctgtgag ctccctgcga ggggtggaca ctcccagatg tgcaaagctc agccaccctc 61 cttctcctcc tctcttcctc ccaggagctg catctctgcg tctcccgcaa gcacttcgct 121 ctggaacggg gctgcaggct gcgtgggctg tcaccgggga actacagcgt gcgaatccgg 181 gccacctccc ttgcgggcaa cggctcttgg acggaaccca cctatttcta cgtgacagac 241 tattgtaagt ctccatggca gcctcagctg actggggctg tgcttagcac tgagcatggt 301 gggacattgc aggggatgac ttggagaggc cgcagtgctg gccctggcct tgactctcag 361 gcctatcagc tgctgcggtg cttgcc // LOCUS HUMINSR15 429 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 15. ACCESSION M32836 KEYWORDS insulin receptor. SEGMENT 15 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 429) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 92 + 194 human insulin receptor precursor, exon 15 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 92 + 194 human insulin receptor pre-msg < 1 > 428 hINSR mRNA and introns IVS < 1 91 hINSR intron N IVS 195 > 428 hINSR intron O BASE COUNT 117 a 67 c 82 g 163 t ORIGIN About 3.0 kbp downstream of segment 14. 1 cccacccatt ccaggagtgg atgtgatttt tgatgtgaac tttgttggaa acacattgat 61 atgaaacata tattttctta ttctatttca gtagacgtcc cgtcaaatat tgcaaaaatt 121 atcatcggcc ccctcatctt tgtctttctc ttcagtgttg tgattggaag tatttatcta 181 ttcctgagaa agaggtgagt tcagtgagtt cagtggtgtg ctgggaacag ttggttctct 241 gggggaaaac atgccttgat ataggtatag gcatatttaa gtttattatg aattttgctg 301 atataggatg tgtaacatgc aatttacaga taattgtcat aatatgatat acacaactct 361 ttattgtaaa ttccctctag acagttgatt ctcacagaat gtttttattg attttttttt 421 ttgcccaaa // LOCUS HUMINSR16 480 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 16. ACCESSION M32837 KEYWORDS insulin receptor. SEGMENT 16 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 480) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 261 + 328 human insulin receptor precursor, exon 16 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 261 + 328 human insulin receptor pre-msg < 1 > 480 hINSR mRNA and introns IVS < 1 260 hINSR intron O IVS 329 > 480 hINSR intron P BASE COUNT 123 a 131 c 109 g 117 t ORIGIN About 2.0 kbp downstream of segment 15. 1 aaaaacaaaa acaaaaacaa aacaaaaaaa aaaccaccca gggagggatg agtgctccca 61 tgttgatgca cttacatacc tgtctgatgg gcttccattc aaaacataaa ggtcccccat 121 ccctgcccta gactgcatct aggattatgg ggattctgct ggtaagggct gccatttgcc 181 ttggggagtc ttgtatgaaa cacctttctg cagagtccca tgagaatctc aagctaacgt 241 gcctcgtttt cctcctccag gcagccagat gggccgctgg gaccgcttta cgcttcttca 301 aaccctgagt atctcagtgc cagtgatggt gagtaccatc ccttccctgt gggtggccag 361 aaccctactc atcagcttcc tttgccttca ccattgagtg agagtgaagg atgggttccc 421 cagggaggcc aagaaaagcc ctcttattca tttgagcttg ccaaactgcc cttgctgcag // LOCUS HUMINSR17 485 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 17. ACCESSION M32838 KEYWORDS insulin receptor. SEGMENT 17 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 485) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 136 + 380 human insulin receptor precursor, exon 17 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 136 + 380 human insulin receptor pre-msg < 1 > 485 hINSR mRNA and introns IVS < 1 135 hINSR intron P IVS 381 > 485 hINSR intron Q BASE COUNT 96 a 119 c 162 g 108 t ORIGIN About 1.0 kbp downstream of segment 16. 1 cccggcatgg gtcctggatc acagaactca tttcatgagt gttttcgagg gggtttgggt 61 gagggcttgg gtggaaggtg gctgcagacc cccaagggat cctccaagga tgctgtgtag 121 ataagtaaga agtagtgttt ccatgctctg tgtacgtgcc ggacgagtgg gaggtgtctc 181 gagagaagat caccctcctt cgagagctgg ggcagggctc cttcggcatg gtgtatgagg 241 gcaatgccag ggacatcatc aagggtgagg cagagacccg cgtggcggtg aagacggtca 301 acgagtcagc cagtctccga gagcggattg agttcctcaa tgaggcctcg gtcatgaagg 361 gcttcacctg ccatcacgtg gtgagtccag tgggggtggg acatgggctg gctttcctga 421 cccttccctt tctctgcctc ctcctcctgc acagagcgac agaggacaca gggtgtatcc 481 tccta // LOCUS HUMINSR18 287 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 18. ACCESSION M32839 KEYWORDS insulin receptor. SEGMENT 18 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 287) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 117 + 227 human insulin receptor precursor, exon 18 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 117 + 227 human insulin receptor pre-msg < 1 > 287 hINSR mRNA and introns IVS < 1 116 hINSR intron Q IVS 228 > 287 hINSR intron R BASE COUNT 51 a 85 c 98 g 53 t ORIGIN About 2.0 kbp downstream of segment 17. 1 acgctgcatc caggccacag ggtgctgtgt gtgacataga caccagggag ggaggagaac 61 cctggtgagt cgaatcacgg accctcctcc aagaaccctg gttgcttgct ctgcaggtgc 121 gcctcctggg agtggtgtcc aagggccagc ccacgctggt ggtgatggag ctgatggctc 181 acggagacct gaagagctac ctccgttctc tgcggccaga ggctgaggta agctgcttcg 241 ggggacccag cggggtactc ggtggagcac ccgctcctgg cctcctc // LOCUS HUMINSR19 322 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 19. ACCESSION M32840 KEYWORDS insulin receptor. SEGMENT 19 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 322) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Bell, 14-MAR-1990. FEATURES from to/span description pept + 45 + 204 human insulin receptor precursor, exon 19 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 45 + 204 human insulin receptor pre-msg < 1 > 322 hINSR mRNA and introns IVS < 1 44 hINSR intron R IVS 205 > 322 hINSR intron S BASE COUNT 81 a 76 c 79 g 86 t ORIGIN About 0.5 kbp downstream of segment 18. 1 gatcccagtg ctgctgaaac accaaccccg tgtttctgtt ttagaataat cctggccgcc 61 ctccccctac ccttcaagag atgattcaga tggcggcaga gattgctgac gggatggcct 121 acctgaacgc caagaagttt gtgcatcggg acctggcagc gagaaactgc atggtcgccc 181 atgattttac tgtcaaaatt ggaggttcgt ctggctttct gctttgaaaa cataacgacc 241 caggccaggt ttgatttcag aaggaagttg tctataatga gccgttaagt cttttctgat 301 aatataaagg ggcaagtact tc // LOCUS HUMINSR20 288 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 20. ACCESSION M32841 KEYWORDS insulin receptor. SEGMENT 20 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 288) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 115 + 244 human insulin receptor precursor, exon 20 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 115 + 244 human insulin receptor pre-msg < 1 > 288 hINSR mRNA and introns IVS < 1 114 hINSR intron S IVS 245 > 288 hINSR intron T BASE COUNT 61 a 55 c 102 g 70 t ORIGIN About 0.5 kbp downstream of segment 19. 1 gacgtgggcc aggtgaaccc ctcttagggc tctgtgagag gtggggcagt caaggtggca 61 gatgctagga ccaaggctga aggttaagag cgtgtgaacc ttttgtgttg tcagactttg 121 gaatgaccag agacatctat gaaacggatt actaccggaa agggggcaag ggtctgctcc 181 ctgtacggtg gatggcaccg gagtccctga aggatggggt cttcaccact tcttctgaca 241 tgtggtgagt tgtgtgtgga tgggtggatg gacgctgggc ttgaattc // LOCUS HUMINSR21 407 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 21. ACCESSION M32842 KEYWORDS insulin receptor. SEGMENT 21 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 407) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 101 + 235 human insulin receptor precursor, exon 21 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 101 + 235 human insulin receptor pre-msg < 1 > 407 hINSR mRNA and introns IVS < 1 100 hINSR intron T IVS 236 > 407 hINSR intron U BASE COUNT 73 a 75 c 118 g 141 t ORIGIN About 1.0 kbp downstream of segment 20. 1 ttgcgtgtgt gtgtgcgttt gcgtgtgtgt gtttgcgcgc gcgcgtgtgt gtgtgtgtct 61 aaatggcttc tttgttacta ctatcaactg tcatcggcag gtcctttggc gtggtccttt 121 gggaaatcac cagcttggca gaacagcctt accaaggcct gtctaatgaa caggtgttga 181 aatttgtcat ggatggaggg tatctggatc aacccgacaa ctgtccagag agagtgtaag 241 tgtagaaagg gtttaaggtg tgtgaggtgt tcgttgaaag ggtattgccc tttacacgtg 301 tgcttggttt tgcctttcct atgtctacac gctcaccgtg tttgcatgct gtatgttaca 361 ggtgtgtttg tgtttgcata gcttgtcttt acatgcatgc ttgcatt // LOCUS HUMINSR22 873 bp ds-DNA PRI 10-JUL-1990 DEFINITION Human insulin receptor (hINSR) gene, exon 22. ACCESSION M32972 KEYWORDS insulin receptor. SEGMENT 22 of 22 SOURCE Human fetal liver DNA, clone lambda-hINSR-[1-13]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 873) AUTHORS Seino,S., Seino,M. and Bell,G.I. TITLE Human insulin-receptor gene: Partial sequence and amplification of exons by polymerase chain reaction JOURNAL Diabetes 39, 123-128 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.I.Bell, 14-MAR-1990. FEATURES from to/span description pept + 83 437 human insulin receptor precursor, exon 22 /hgml_locus_uid="LG0007M" /nomgen="INSR" /map="19p13.3-p13.2" matp + 83 434 human insulin receptor pre-msg < 1 873 hINSR mRNA and introns IVS < 1 82 hINSR intron U BASE COUNT 199 a 217 c 234 g 223 t ORIGIN About 2.0 kbp downstream of segment 21. 1 ctgcagggac aagagtgggg gtttgggagg atgcgtggca gggcccccag actcacccag 61 gacgtgtcct tctgccccgc agcactgacc tcatgcgcat gtgctggcaa ttcaacccca 121 agatgaggcc aaccttcctg gagattgtca acctgctcaa ggacgacctg caccccagct 181 ttccagaggt gtcgttcttc cacagcgagg agaacaaggc tcccgagagt gaggagctgg 241 agatggagtt tgaggacatg gagaatgtgc ccctggaccg ttcctcgcac tgtcagaggg 301 aggaggcggg gggccgggat ggagggtcct cgctgggttt caagcggagc tacgaggaac 361 acatccctta cacacacatg aacggaggca agaaaaacgg gcggattctg accttgcctc 421 ggtccaatcc ttcctaacag tgcctaccgt ggcgggggcg ggcaggggtt cccattttcg 481 ctttcctctg gtttgaaagc ctctggaaaa ctcaggattc tcacgactct accatgtcca 541 gtggagttca gagatcgttc ctatacattt ctgttcatct taaggtggac tcgtttggtt 601 accaatttaa ctagtcctgc agaggattta actgtgaacc tggagggcaa ggggtttcca 661 cagttgctgc tcctttgggg caacgacggt ttcaaaccag gattttgtgt tttttcgttc 721 cccccacccg cccccagcag atggaaagaa agcacctgtt tttacaaatt cttttttttt 781 tttttttttt tttttttttg ctggtgtctg agcttcagta taaaagacaa aacttcctgt 841 ttgtggaaca aaatttcgaa agaaaaaacc aaa // LOCUS BT1NAMTA 1091 bp ds-DNA PHG 10-JUL-1990 DEFINITION Bacteriophage T1 DNA N-6-adenine-methyltransferase (M.T1) gene, complete cds. ACCESSION J05393 KEYWORDS DNA N-6-adenine-methyltransferase. SOURCE Bacteriophage T1 DNA. ORGANISM Bacteriophage T1 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1091) AUTHORS Schneider-Scherzer,E., Auer,B., de Groot,E.J. and Schweiger,M. TITLE Primary structure of a DNA (N-6-adenine)-methyltransferase from Escherichia coli virus T1 JOURNAL J. Biol. Chem. 265, 6086-6091 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 171 824 DNA N-6-adenine-methyltransferase (M.T1) pept 824 1072 pot. protein HP 83 binding 161 164 ribosomal binding site (put.) binding 813 816 ribosomal binding site (put.) signal 141 146 TATA box BASE COUNT 345 a 205 c 266 g 275 t ORIGIN 1 aaaagggaag tttctcaaaa aggtccggga gcgtggcggc ttctctgccg tcgcatacgg 61 attcgggcaa ttcaagatcg caatttacga aatgatgaaa tagcactttt tgttaaaact 121 gccgggatgg aatctggcat tattatctca ccaaaacgag aggaataaaa atgaaagact 181 ttaatgatat cgaaactatc gactttgcag aaactggttg ctcattcact cgcgaagcaa 241 tagcatcagg cggttattat caggcattga aaacgccaac ctgtaaagag atttcagggc 301 gtcgatacaa ggggacaaat acccctgacg ctgttcgtga tttatggtca actccgcgag 361 aggttattgc ataccttgag ggtcgttatg ggaaatatga tctcgacgct gcggcaagcg 421 aagaaaataa agtttgcgag aagttttact ctcaggaaac aaactgctta aaacgttggt 481 ggggaaagaa taagcacgtt tggttaaatc ctccttatag ccgacctgat atatttgtca 541 actctactgc gtggtttact gaagcgcggc agaacgcagc tgaaataatc tggattgaag 601 cggacttgac tgaggatatt gacggcaatg aatacgcacg atccggtcgc ctggctttca 661 tatccggtga aactggaaag gccgtagacg gtaataacaa aggttcggta atttttatta 721 tgcgcgaact taaagaaggt gaggtgcaac agactcacta catcccaatc acaagcattt 781 gcccttcggt gaaaaacaaa cgagcaaagg tgaggaaagt atgatgagcg aaaaaatggt 841 tcctgttaaa ttaactgagc aaggtttatg gctactttat cgagctacgt gctgcgaaat 901 tatggagcga aacggattga ctcaggatgt tattggttgc gatctgtggg agttcactag 961 ttctcttgat atgcttttcg atgagataaa aaatgaatac atagagaact ggccttcaat 1021 catacagaaa gacgtggaag aacttaaagc tgatacaatc gtacagcact aattgctaaa 1081 actacccggc g // LOCUS STVBLSG 1130 bp ds-DNA BCT 10-JUL-1990 DEFINITION Streptoverticillum sp. blasticidin S-acetyltransferase (bls) gene, complete cds. ACCESSION M34537 KEYWORDS blasticidin S-acetyltransferase. SOURCE Streptoverticillum sp. (strain JCM4673) DNA. ORGANISM Streptoverticillum sp. Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1130) AUTHORS Perez-Gonzalez,J.A., Ruiz,D., Esteban,J.A. and Jimenez,A. TITLE Cloning and characterization of the gene encoding a blasticidin S acetyltransferase from Streptoverticillum sp JOURNAL Gene 86, 129-134 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 147 557 blasticidin S-acetyltransferase (bls) (147 could be 225) binding 136 139 ribosomal binding site (put.) BASE COUNT 151 a 420 c 394 g 165 t ORIGIN 1 gatcagcgcc ggcccccacc ggcactgtgc atcagcgtac ggccggggta cgacaacgga 61 agcggattgg caaaactgcc tggccccggt gtttatggtg agctttatgt tcagtattga 121 ggcggtgaac gacccggaac gacgcgatgt tgtccttgcc acggttgcag accgtcaacg 181 acgaacgttc gcccgccctg cgggcgttgc ggcgcacgcc ggtgatggag gcgcggccgc 241 tggaggtgta cgccacgtac gcctgcggcg agcgcgggga gctggcgggc gggctcgtcg 301 gtcatgtgca gtggcaatgg ctgcacgtgg acctgctgtg ggtggacgcg ggggcccgcg 361 gggcggggct gggctcgcgg ttgatcgcgc gggcggaggc ccgcgcccgg gaggagttcg 421 gctgcatcgg cagccaggtg gagacctggg acttccaggc gccggggttc taccagcggg 481 tggggtatcg cctcgcggcg agcatcccgg actatccgcc cgggatcacg agccacctgc 541 tggtgaagga gctttgaggc gccccgtcag gggcgcgggg ccgttactcc ggggctgcgc 601 cccggacccc cgggtggcgc gtcgactgcg ggccggtggg ggcttgtcgc gcagttcccc 661 gcgcccctta cggggcgcct ggtcgcgccc acgcggcgga gccgcatatc gagcacagcc 721 ccgcgcccct tacggggcgc tgctctaggc cacccgccgt gccccctccc ccgccgccgt 781 gccgaacagt cgtgccgtcc ccagtgcctc ggtgaccacc ttggtcaccc tttcctcatc 841 tgccccatcc accaaggcga ttgccgagcc gccgaagccg ccgcccgtca tccgggcccc 901 cagggccccc gccttcaccg ccgtctccac caccacgtcc aattccgcac aggacacccg 961 gaagtcgtcg cgcagcgagg cgtgcccctc cgtcagcagt gggcccacag ccctcgcatc 1021 ccccgcggcc agcagggccg cgacccgctc cacccggtcg ttctccgtca ccacgtgacg 1081 gaccaaggcg cgctccgcgg caggcaactc acccagtgcc gcctgcagac // LOCUS HUMGAPDH 1268 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) mRNA, complete cds. ACCESSION M33197 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SOURCE Human lung cancer cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1268) AUTHORS Tokunaga,K., Nakamura,Y., Sakata,K., Fujimori,K., Ohkubo,M., Sawada,K. and Sakiyama,S. TITLE Enhanced expression of a glyceraldehyde-3-phosphate dehydrogenase gene in human lung cancers JOURNAL Cancer Res. 47, 5616-5619 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 61 1068 glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.1.12) /hgml_locus_uid="LM0055R" /nomgen="GAPD" /map="12p13" mRNA < 1 1268 GAPDH mRNA BASE COUNT 295 a 385 c 326 g 262 t ORIGIN 1 gttcgacagt cagccgcatc ttcttttgcg tcgccagccg agccacatcg ctcagacacc 61 atggggaagg tgaaggtcgg agtcaacgga tttggtcgta ttgggcgcct ggtcaccagg 121 gctgctttta actctggtaa agtggatatt gttgccatca atgacccctt cattgacctc 181 aactacatgg tttacatgtt ccaatatgat tccacccatg gcaaattcca tggcaccgtc 241 aaggctgaga acgggaagct tgtcatcaat ggaaatccca tcaccatctt ccaggagcga 301 gatccctcca aaatcaagtg gggcgatgct ggcgctgagt acgtcgtgga gtccactggc 361 gtcttcacca ccatggagaa ggctggggct catttgcagg ggggagccaa aagggtcatc 421 atctctgccc cctctgctga tgcccccatg ttcgtcatgg gtgtgaacca tgagaagtat 481 gacaacagcc tcaagatcat cagcaatgcc tcctgcacca ccaactgctt agcacccctg 541 gccaaggtca tccatgacaa ctttggtatc gtggaaggac tcatgaccac agtccatgcc 601 atcactgcca cccagaagac tgtggatggc ccctccggga aactgtggcg tgatggccgc 661 ggggctctcc agaacatcat ccctgcctct actggcgctg ccaaggctgt gggcaaggtc 721 atccctgagc tgaacgggaa gctcactggc atggccttcc gtgtccccac tgccaacgtg 781 tcagtggtgg acctgacctg ccgtctagaa aaacctgcca aatatgatga catcaagaag 841 gtggtgaagc aggcgtcgga gggccccctc aagggcatcc tgggctacac tgagcaccag 901 gtggtctcct ctgacttcaa cagcgacacc cactcctcca cctttgacgc tggggctggc 961 attgccctca acgaccactt tgtcaagctc atttcctggt atgacaacga atttggctac 1021 agcaacaggg tggtggacct catggcccac atggcctcca aggagtaaga cccctggacc 1081 accagcccca gcaagagcac aagaggaaga gagagaccct cactgctggg gagtccctgc 1141 cacactcagt cccccaccac actgaatctc ccctcctcac agttgccatg tagacccctt 1201 gaagagggga ggggcctagg gagccgcacc ttgtcatgta ccatcaataa agtaccctgt 1261 gctcaacc // LOCUS MUSMK2P 728 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK) mRNA, complete cds. ACCESSION M35833 J05447 KEYWORDS MK protein; retinoic acid-responsive protein. SOURCE Mouse (strain BALB/c) adult liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites) AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T. TITLE Structure of a retinoic acid-responsive gene, MK, which is transiently activated during the differentiation of embryonal carcinoma cells and the mid-gestation period of mouse embryogenesis JOURNAL J. Biol. Chem. 265, 9441-9443 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 728; for [1]) AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.Shyuichiro, 20-APR-1990, for release after publication. FEATURES from to/span description pept 44 466 retinoic acid-responsive protein MK precursor sigp 44 109 retinoic acid-responsive protein MK signal peptide matp 110 463 retinoic acid-responsive protein MK mRNA 1 728 MK2 mRNA BASE COUNT 184 a 211 c 206 g 127 t ORIGIN 1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt 61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga 121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag 181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca 241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga 301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa 361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa 421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga 481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat 541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc 601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata 661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac 721 ttttttcc // LOCUS MUSMKPG 2929 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK) gene, complete cds. ACCESSION M34094 J05447 KEYWORDS MK protein; alternative splicing; retinoic acid-responsive protein. SOURCE Mouse (strain BALB/c) adult liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2929) AUTHORS Matsubara,S., Tomomura,M., Kadomatsu,K. and Muramatsu,T. TITLE Structure of a retinoic acid-responsive gene, MK, which is transiently activated during the differentiation of embryonal carcinoma cells and the mid-gestation period of mouse embryogenesis JOURNAL J. Biol. Chem. 265, 9441-9443 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.Shyuichiro, 20-APR-1990, for release after publication. FEATURES from to/span description pept 1298 1373 retinoic acid-responsive protein (MK) precursor, exon 1 1491 1649 retinoic acid-responsive protein (MK) precursor, exon 2 1766 1927 retinoic acid-responsive protein (MK) precursor, exon 3 2631 2656 retinoic acid-responsive protein (MK) precursor, exon 4 sigp 1298 1363 retinoic acid-responsive protein (MK) signal peptide matp 1364 1373 retinoic acid-responsive protein (MK) 1491 1649 retinoic acid-responsive protein (MK) 1766 1927 retinoic acid-responsive protein (MK) 2631 2653 retinoic acid-responsive protein (MK) pre-msg 463 2918 MK3 mRNA and introns (minor alt.) pre-msg 1007 2918 MK2 mRNA and introns (major alt.) pre-msg 1048 2918 MK1 mRNA and introns (minor alt.) IVS 816 1296 MK3 intron A IVS 1052 1296 MK2 intron A IVS 1374 1490 MK1 intron A, and MK2 and MK3 intron B IVS 1650 1765 MK1 intron B, and MK2 and MK3 intron C IVS 1928 2630 MK1 intron C, and MK2 and MK3 intron D signal 85 91 GC box signal 143 149 GC box signal 274 280 GC box signal 598 604 GC box signal 852 858 GC box signal 910 916 GC box signal 939 945 GC box BASE COUNT 660 a 771 c 930 g 568 t ORIGIN 1 tggccaccaa catctcagat cacttcggga gatgggtctg ccccgatcct gacctctgcc 61 tagggcctta ggctcacagc gcctggggcg gagctgattt tcccgctcct gcagggatga 121 taacaatgaa agtaaaagag gtggggcggg ggccaggctt gggttctttg gtcttttggc 181 cctgtgccct ggagcagtcc cctccccctg gcttgtactg gggggggggg gggggatctg 241 cttgaggtga gcctgaggcc ccagggtcag gggtgggcgg ttatcacctc cgggggaagc 301 ccggtctgga acttctcaga cagctcttgt cagcgacaag atttaccaaa ctcatttcta 361 tgtgcttccc catccccccc aacgcccttc cctcctcctc ctcccccaaa cctgcactag 421 aaaaaggctc tcgagccttg ctcacccgga gccatctgag gtcccaggta cccagctccc 481 tgccacatca gagacccttc ttgcactctg agtgaactga ttaaaaaaaa aaaaaaaaaa 541 aaaaaaccaa gccggaggtg agccgggcct cgaagggaag gttcgcgggt gcggtggccg 601 ccccgagcct gtgacaccag gacatactcc cggggcccgc ggtgggcaag cgaagtggtg 661 acctgagagc tgacaggctg cgagagggaa aagtatagac aggcctagac caggggaagg 721 ggaggggata gagagctggg cctgctacga ggggacctga gccagaagcg cactggtaaa 781 accgaactcc aggaccagag acccagagat cagaggtgag aggcacagac gcgggagtcc 841 cggctcggcg aggggcggga gtggaggcgg ggactagggg ggtctgggga ggtgcgggtt 901 tggggggagg gggcgggtcc ttccacggga tggggggagg ggcgggggcc catgtgaccg 961 gctcagaccg gttctggaga caaaaggggc cttagcggcc ttagcgggac aggccggagc 1021 gggagggagc gaagcatcga gcagtgagcg agtgagcgca cgcagtggct gtggccccag 1081 tcccttcagg cggctgctct gccaccaagg gggctgaggt gggggtgggg gtacgctgag 1141 acatcggttc caagtcctcc ctccgtctcc cccttgtcgg tccgacgttt tgggcctgga 1201 aagtgggaca agtcagtcaa gggtgggagg tccttcccgc ggttcctagc ggagaagaga 1261 ctaggcgaga aactctaacc caggttttac ccctaggatg cagcaccgag gcttcttcct 1321 tctcgccctt cttgccctct tggtggtcac gtccgcggtg gccaaaaaaa aaggtgatgg 1381 gataggatgg gctcaggagt aaaagctggg gtgggcaggt gaggcaggcc gtgtgaccaa 1441 gtgctggtcc ggcacgccat gtccttaact ttgttccttg cgccctgtag agaaggtgaa 1501 gaagggcagc gagtgttcgg agtggacctg ggggccctgc acccccagca gcaaggactg 1561 cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc cagcgcgtcc attgcaaggt 1621 gccctgcaac tggaagaagg aatttggagg tgaggtggcg cgcgggagga gggcgggaag 1681 ccagagggta tgtccttata aaccggaggc agggaggaca tccacaaccc tcctgtctct 1741 caccgtgggg ccactctccc atcagccgac tgcaaataca agtttgagag ctggggggcg 1801 tgtgatggga gcactggcac caaagcccgc caagggaccc tgaagaaggc gcggtacaat 1861 gcccagtgcc aggagaccat ccgcgtgact aagccctgca cctccaagac caagtcaaag 1921 accaaaggtc agcgaatatg gtggggttgt gggccaggct actccatgct ctgtctctgc 1981 agagcagtct taaagttagg aatgggcagg cacttgaggg ccactctcag gagatgctaa 2041 accctctgcc caagtaggaa ctactctttc tgttggatca tccgacctgg gttcctggga 2101 aaggcttgtc tttgtcaact gaggaaggtg gggtgggatc agggaggagt taactctgcg 2161 cttaaaacta tggaaaggcc tgtcccaaag gtacatgctg ctacctgact cccaacagct 2221 attgaggcca gcagggcaga ggtgactctg cccatttccc cggtgaggaa cttggagtac 2281 tctgatccta gatgaaaata gaaagttgaa agtcaggctt ggtagctcgt gcctgtaaaa 2341 agcggcactt caggactgag gcagtaacac tgccttgagt tcaaggttac agactgagag 2401 acttgagagt ctgtctttaa aggggggggg ggggcgcgag ggttaaaaag ttgaacgaat 2461 aaagaaagat ttcatatcac atggctgccc tttcccacca cttccaggtg aactggtcag 2521 tcaccactag ggggcaggat tttctctcct tgatggacat gtctgcgttg tctggtgagt 2581 ccgagctagg tcacccaccg cactaatgca tctccgttat tgttttccag ccaagaaagg 2641 aaaaggaaag gactaagtca ggaggccaga gagcctccgg cctcgcctgg agcctgaacg 2701 gagccctcct ctcccacagg cccaagatat aacccaccag tgccttttgt cttcctgtca 2761 gctctgtcaa tcacgcctgt cctctcacgc ccacaccaag tgcccaaagt ggggagggac 2821 aagagattct ggaaagtgag cctccccata ccctcttttg ttctccccac cctgatactt 2881 gttattaaga aatgaataaa ataaactcac ttttttccaa taaaagctt // LOCUS MUSCRRY01 676 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 1. ACCESSION M34164 KEYWORDS complement receptor. SEGMENT 1 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 676) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept 414 + 531 complement receptor (Crry; liver) precursor, exon 1 sigp 414 530 complement receptor (liver) signal peptide (put.) matp 531 + 531 complement receptor (liver) pep$ 414 + 660 complement receptor (spleen) precursor, exon 1 sigp 414 530 complement receptor (spleen) signal peptide (put.) matp 531 + 660 complement receptor (spleen) IVS 532 > 676 Crry intron A IVS 661 > 676 Crry intron A' BASE COUNT 148 a 162 c 193 g 173 t ORIGIN 1 atccgaattc atcataagga aataggttct tactgtatac tagacagggt atgcaactgt 61 cagctcactg ttgcagatta gggttaggct ccacccttgc agatttttaa aaggagtaag 121 gccgggctat atgccaaacc gagttcccat aatgccttgt tttctttgga gtcgaaggtt 181 cctgcaagtg gaaaacttcc tggagctgac ctactaggta ttgaaccagt ttctgcattg 241 ctgaatcaat ctcccaaggg taattccaca gaaatcccag gggcttggag taaacaagac 301 cgcgcctagc ccagctagag gaagttttat tccggaaccc agcgccattt ctgggtggga 361 ctgctttcta caccatttgc cgtaaaacgt tgtttgagaa cggtgtgagg ggaatggagg 421 tctcttctcg gagttcagag cctctggatc cggtgtggct ccttgtagcc ttcggccggg 481 gaggagtcaa gctagaagtt ttgctgctgt tcttgctgcc atttactttg ggtgagctgc 541 ggggaggcct ggggaagcac ggacacacgg ttcaccggga acccgcggta aataggctct 601 gcgcagactc caaacgctgg tctgggctgc ctgtgagtgc tcagcgcccc tttcccatgg 661 gtgagcgtgg ggcgcc // LOCUS MUSCRRY02 200 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 2. ACCESSION M34165 KEYWORDS complement receptor. SEGMENT 2 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 200) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 190 complement receptor (Crry; liver) precursor, exon 2 matp 11 + 190 complement receptor (liver) pep$ + 11 + 190 complement receptor (spleen) precursor, exon 2 matp + 11 + 190 complement receptor (spleen) IVS < 1 10 Crry intron A IVS < 1 10 Crry intron A' IVS 191 > 200 Crry intron B BASE COUNT 60 a 51 c 35 g 54 t ORIGIN Undetermined number of base pairs after segment 1. 1 cattcaacag gtcactgccc agccccatca cagcttcctt ctgccaaacc tataaatcta 61 actgatgaat ccatgtttcc cattggaaca tatttgttgt atgaatgtct cccaggatat 121 atcaagaggc agttctctat cacctgcaaa caagactcaa cctggacgag tgctgaagat 181 aagtgtatac gtgagtaact // LOCUS MUSCRRY03 120 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 3. ACCESSION M34166 KEYWORDS complement receptor. SEGMENT 3 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 120) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 110 complement receptor (Crry; liver) precursor, exon 3 matp + 11 + 110 complement receptor (liver) pep$ + 11 + 110 complement receptor (spleen) precursor, exon 3 matp + 11 + 110 complement receptor (spleen) IVS < 1 10 Crry intron B IVS 111 > 120 Crry intron C BASE COUNT 36 a 21 c 25 g 38 t ORIGIN Undetermined number of base pairs after segment 2. 1 tttttcatag gaaaacaatg taaaactcct tcagatcctg agaatggctt ggtacatgta 61 cacacaggca ttcagtttgg atcccgtatt aattatactt gtaatcaagg gtgagttggc // LOCUS MUSCRRY04 104 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 4. ACCESSION M34167 KEYWORDS complement receptor. SEGMENT 4 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 104) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 96 complement receptor (Crry; liver) precursor, exon 4 matp + 11 + 96 complement receptor (liver) pep$ + 11 + 96 complement receptor (spleen) precursor, exon 4 matp + 11 + 96 complement receptor (spleen) IVS < 1 10 Crry intron C IVS 97 > 104 Crry intron D BASE COUNT 20 a 20 c 27 g 37 t ORIGIN Undetermined number of base pairs after segment 3. 1 ctgtgtgtag ataccgcctc attggttcct cctctgctgt atgtgtcatc actgatcaaa 61 gtgttgattg ggatactgag gcacctattt gtgagtgtaa gttg // LOCUS MUSCRRY05 422 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 5. ACCESSION M34168 KEYWORDS complement receptor. SEGMENT 5 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 422) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 412 complement receptor (Crry; liver) precursor, exon 5 matp + 11 + 412 complement receptor (liver) pep$ + 11 + 412 complement receptor (spleen) precursor, exon 5 matp + 11 + 412 complement receptor (spleen) IVS < 1 10 Crry intron D IVS 413 > 422 Crry intron E BASE COUNT 106 a 100 c 104 g 112 t ORIGIN Undetermined number of base pairs after segment 4. 1 ctttgcccag ggattccttg tgagataccc ccaggcattc ccaatggaga tttcttcagt 61 tcaaccagag aagactttca ttatggaatg gtggttacct accgctgcaa cactgatgcg 121 agagggaagg cgctctttaa cctggtgggt gagccctcct tatactgtac cagcaacgat 181 ggtgaaattg gagtctggag cggccctcct cctcagtgca ttgaactcaa caaatgtact 241 cctcctccct atgttgaaaa tgcagtcatg ctgtctgaga acagaagctt gttttcctta 301 agggatattg tggagtttag atgtcaccct ggctttatca tgaaaggagc cagcagtgtg 361 cattgtcagt ccctaaacaa atgggagcca gagttaccaa gctgcttcaa gggtaagctc 421 ga // LOCUS MUSCRRY06 206 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 6. ACCESSION M34169 KEYWORDS complement receptor. SEGMENT 6 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 206) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 196 complement receptor (Crry; liver) precursor, exon 6 matp + 11 + 196 complement receptor (liver) pep$ + 11 + 196 complement receptor (spleen) precursor, exon 6 matp + 11 + 196 complement receptor (spleen) IVS < 1 10 Crry intron E IVS 197 > 206 Crry intron F BASE COUNT 61 a 33 c 58 g 54 t ORIGIN Undetermined number of base pairs after segment 5. 1 ctaattgcag gagtgatatg tcgtctccct caggagatga gtggattcca gaaggggttg 61 ggaatgaaaa aagaatatta ttatggagag aatgtaacct tggaatgtga ggatgggtat 121 actctagaag gcagttctca aagccagtgc cagtctgatg gcagctggaa tcctcttctg 181 gccaaatgtg tatctcgtaa gtacaa // LOCUS MUSCRRY07 44 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 7. ACCESSION M34170 KEYWORDS complement receptor. SEGMENT 7 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 44) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 34 complement receptor (Crry; liver) precursor, exon 7 matp + 11 + 34 complement receptor (liver) pep$ + 11 + 34 complement receptor (spleen) precursor, exon 7 matp + 11 + 34 complement receptor (spleen) IVS < 1 10 Crry intron F IVS 35 > 44 Crry intron G BASE COUNT 8 a 8 c 10 g 18 t ORIGIN Undetermined number of base pairs after segment 6. 1 tctctttcag gctcaatcag tggtctaatt gttggtaagt tctg // LOCUS MUSCRRY08 96 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 8. ACCESSION M34171 KEYWORDS complement receptor. SEGMENT 8 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 96) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 86 complement receptor (Crry; liver) precursor, exon 8 matp + 11 + 86 complement receptor (liver) pep$ + 11 + 86 complement receptor (spleen) precursor, exon 8 matp + 11 + 86 complement receptor (spleen) IVS < 1 10 Crry intron G IVS 87 > 96 Crry intron H BASE COUNT 27 a 10 c 19 g 40 t ORIGIN Undetermined number of base pairs after segment 7. 1 tcctgtttag gaattttcat tgggataatc gtctttattt tagtcatcat tgttttcatt 61 tggatgattc tgaagtataa aaaacggtga gtaaag // LOCUS MUSCRRY09 125 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 9. ACCESSION M34172 KEYWORDS complement receptor. SEGMENT 9 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 125) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 115 complement receptor (Crry; liver) precursor, exon 9 matp + 11 + 115 complement receptor (liver) pep$ + 11 + 115 complement receptor (spleen) precursor, exon 9 matp + 11 + 115 complement receptor (spleen) IVS < 1 10 Crry intron H IVS 116 > 125 Crry intron I BASE COUNT 46 a 24 c 25 g 30 t ORIGIN Undetermined number of base pairs after segment 8. 1 taccaattag caataccaca gatgaaaagt ataaagaagt gggtattcat ttaaattata 61 aagaagacag ctgtgtccgc cttcagtctc tgctcacaag tcaggagaac agcaggtaca 121 tatgc // LOCUS MUSCRRY10 128 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry) gene, exon 10. ACCESSION M34173 KEYWORDS complement receptor. SEGMENT 10 of 10 SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 128) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 11 56 complement receptor (Crry; liver) precursor, exon 10 matp + 11 53 complement receptor (liver) pep$ + 11 56 complement receptor (spleen) precursor, exon 10 matp + 11 53 complement receptor (spleen) IVS < 1 10 Crry intron I BASE COUNT 41 a 30 c 20 g 37 t ORIGIN Undetermined number of base pairs after segment 9. 1 tttgctgaag taccactagc ccagcacgga attcactcac tcaagaagtc tcctaaatag 61 cagcaacgtg aaatgagaac atgctctgtc tgtatcactt ttaaaataaa ctgtttcctt 121 ttaagatc // LOCUS MUSCRRYPS 1272 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse complement receptor (Crry-ps) pseudogene DNA fragment. ACCESSION M34174 KEYWORDS complement receptor; pseudogene. SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1272) AUTHORS Paul,M.S., Aegerter,M., Cepek,K., Miller,M.D. and Weis,J.H. TITLE The murine complement receptor gene family: III. The genomic and transcriptional complexity of the Crry and Crry-ps genes JOURNAL J. Immunol. 144, 1988-1996 (1990) STANDARD simple staff_review BASE COUNT 377 a 256 c 276 g 363 t ORIGIN 1 tgcccagccc catcacagct tccttctgcc aaacctataa atctaactga tgaatccatg 61 tttcccattg gaacatctgt gaaatatgaa tgtctcccag gatatatcaa gaggcagttc 121 tctatcacct gcaaacaaga ctcaacctgg acgagtgctg aagataagtg tatacgaaaa 181 caatgtaaaa ctcctttaga tcctcagaat ggcttggtac atgtacacac aggcattcag 241 tttggatccc gtattaatta tacttgtaat aaaggatacc gcctcattgg ttcctcctct 301 gctgtatgtg tcatcactga tcaaagtgtt gattgggata ctgaggcacc tatttgtgag 361 tggattcctt gtgatatacc cccaggcatt cccaatggag atttcttcag ttcaactaga 421 gaagactttc attatggaat ggtggttacc taccgctgca acactgatgc gagagggaag 481 gcgctcttta acctggtggt tatactgtac cagcaacgat ggtgaaattg gagtctggag 541 tggccctcct cctcagtgca ttggattcaa caaatgtact cctcctccct atgttgaaaa 601 tgcagtcatg ctgtctgaga acagaagctt gttttcctta agggatattg tggagtttag 661 atgtcaccct ggctttatca tgaaaggagc cagcagtgtg cattgtcagt ccctaaacaa 721 atgggagcca gagttaccaa gctgcttcaa gggagtgata tgtcgtctcc ctcaggagat 781 gagtggattc cagaaggggt tgggaatgaa aaaagaatat tattatggag agaatgtaac 841 cttggaatgc gaggatgggt atactctaga aggcagttct caaagccagt gtcagtctga 901 tggcagctgg aatcctcttc tggccaaaag tgtatcgcgc tcaatcagtg gtctaattgt 961 tggaattttc attgggatga tcatctttat tttattcatc attgttttca tttggatgat 1021 tctgaagtat aaaaaacgca ataccacaga tgaaaagtat aaagaagtgg gtattcattt 1081 aaattataaa ggagacagct gtgtctgcct tcagtctctg ctcacaagtc aggagaacag 1141 cactaccact agcccagcac agaattcact cgctcaagaa gtctcctaaa tagcagcaac 1201 gtgaaatgag aacatgtctt tctgtatcat ttttaaaata aactatttct tttaagaaaa 1261 aaaagaaaga aa // LOCUS BSURGRRNB 7430 bp ds-DNA BCT 10-JUL-1990 DEFINITION B.subtilis rrnB operon with 23S rRNA, 16SrRNA, 5S rRNA and tRNA gene cluster: Val-, Thr-, Lys-, Leu-cug-, Gly-ggc-, Leu-uua-, Arg-, Pro-, Ala-, Met-, Ile-, Ser-uca-, Met-f-, Asp-, Phe-, His-, Gly-gga-, Ile-, Asn-, Ser-agc- and Glu-tRNA. ACCESSION K00637 M10606 X00007 KEYWORDS 23S ribosomal RNA; 5S ribosomal RNA; ribosomal RNA; transfer RNA; transfer RNA-Ala; transfer RNA-Arg; transfer RNA-Asn; transfer RNA-Asp; transfer RNA-Glu; transfer RNA-Gly; transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu; transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe; transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr; transfer RNA-Val. SOURCE B.subtilis 168 DNA, library of Ferrari et al, clone pBC204 [1]; clone pGS227 [2]; clone pGS332 [3]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 4897 to 7430) AUTHORS Green,C.J. and Vold,B.S. TITLE Sequence analysis of a cluster of twenty-one tRNA genes in Bacillus subtilis JOURNAL Nucleic Acids Res. 11, 5763-5774 (1983) STANDARD simple staff_review REFERENCE 2 (bases 1 to 1168) AUTHORS Stewart,G.C. and Bott,K. TITLE DNA sequence of the tandem ribosomal RNA promoter for B.subtilis operon rrnB JOURNAL Nucleic Acids Res. 11, 6289-6300 (1983) STANDARD simple staff_review REFERENCE 3 (bases 1 to 7430; revises [1],[2]) AUTHORS Green,C.J., Stewart,G.C., Hollis,M.A., Vold,B.S. and Bott,K.F. TITLE Nucleotide sequence of the Bacillus subtilis ribosomal RNA operon, rrnB JOURNAL Gene 37, 261-266 (1985) STANDARD simple staff_review REFERENCE 4 (sites for [1],[2] and [3]) AUTHORS Su,S.L. and Dubnau,D. TITLE Binding of Bacillus subtilis ermC' methyltransferase to 23S rRNA JOURNAL Biochemistry 29, 6033-6042 (1990) STANDARD simple staff_entry COMMENT Draft entry and sequence in computer readable form for [1],[2],[3] kindly provided by K.F.Bott, 26-DEC-1985. The RNAs, encoded by the sequence presented below, are probably transcribed as one polycistronic unit, including the tRNA region, because there are no obvious terminator stem loop structures until after the end of the tRNA region at positions 7245-7272 and 7392-7413 [3]. [1] notes that though the Ile-tRNA-nau sequence has the methionine anticodon "cau", it is highly homologous to Ile-tRNA-gau; the "c" in the wobble position may be post-transcriptionally modified to recognize "aua" codons. Promoter P1 is located at positions 184-189 (-35 region) and 207-213 (-10 region), and P2 at 276-281 (-35 region) and 299-304 (-10 region). A third promoter region could be at positions 5517-5522. A potential stem-loop structure, necessary for processing of the mature 16S rRNA, is found at positions 327-360 [2]. FEATURES from to/span description rRNA 485 2034 16S rRNA rRNA 2203 5129 23S rRNA rRNA 5185 5300 5S rRNA tRNA 5322 5397 Val-tRNA tRNA 5430 5504 Thr-tRNA tRNA 5543 5618 Lys-tRNA tRNA 5629 5715 Leu-tRNA-cug tRNA 5721 5795 Gly-tRNA-ggc tRNA 5810 5895 Leu-tRNA-uua tRNA 5905 5981 Arg-tRNA tRNA 5997 6073 Pro-tRNA tRNA 6079 6151 Ala-tRNA tRNA 6172 6248 Met-tRNA tRNA 6251 6327 Ile-tRNA-nau tRNA 6334 6425 Ser-tRNA-uca tRNA 6443 6519 Met-tRNA-f tRNA 6531 6607 Asp-tRNA tRNA 6620 6695 Phe-tRNA tRNA 6712 6788 His-tRNA tRNA 6799 6872 Gly-tRNA-gga tRNA 6888 6964 Ile-tRNA-gau tRNA 6975 7049 Asn-tRNA tRNA 7053 7143 Ser-tRNA-agc tRNA 7169 7240 Glu-tRNA revision 504 504 c in [3]; t in [2] revision 571 573 tcc in [3]; tc in [2] revision 5029 5031 gga in [3]; ga in [1] anticdn 5355 5357 Val-tRNA anticodon tac anticdn 5463 5465 Thr-tRNA anticodon tgt anticdn 5576 5578 Lys-tRNA anticodon ttt anticdn 5663 5665 Leu-tRNA-cug anticodon cag anticdn 5753 5755 Gly-tRNA-ggc anticodon gcc anticdn 5844 5846 Leu-tRNA-uua anticodon taa anticdn 5939 5941 Arg-tRNA anticodon acg anticdn 6031 6033 Pro-tRNA anticodon tgg anticdn 6112 6114 Ala-tRNA anticodon tgc revision 6165 6167 act in [3]; at in [1] anticdn 6206 6208 Met-tRNA anticodon cat anticdn 6285 6287 Ile-tRNA-nau anticodon cat anticdn 6370 6372 Ser-tRNA-uca anticodon tga anticdn 6477 6479 Met-tRNA-f anticodon cat anticdn 6565 6567 Asp-tRNA anticodon gtc anticdn 6653 6655 Phe-tRNA anticodon gaa anticdn 6746 6748 His-tRNA anticodon gtg anticdn 6831 6833 Gly-tRNA-gga anticodon tcc anticdn 6922 6924 Ile-tRNA-gau anticodon gat anticdn 7007 7009 Asn-tRNA anticodon gtt anticdn 7087 7089 Ser-tRNA-agc anticodon gct anticdn 7202 7204 Glu-tRNA anticodon ttc BASE COUNT 1906 a 1694 c 2125 g 1705 t ORIGIN 65 bp upstream of MboI site; 280 degrees on the B.subtilis map. 1 ctttaatgct ccccttgtgg tcatcagtat ttagttcgtt tcacatacaa gaaaacgaaa 61 aaaacaacaa gatcacatga ctgatgtata tgttctttta agaaacttat atgatacaca 121 cgctttagaa atcatggcga ggattatagt ttatttgttt tatagatttt ttttaaaaaa 181 ctattgcaat aaataaatac aggtgttata ttattaaacg tcgctgatgc acagcggaca 241 caactagatg cttcaaaaca acttgaaaaa agttgttgac aaaaaagaag ctgaatgtta 301 tattagtaaa gctgcttcat tgagaagtaa cgaaatgatc tttgaaaact aaacaagaca 361 aaacgtacct gttaattcag tttttaaaaa tcgcactgcg atgtgcgtat catcaaacag 421 ggcctgcacg acgcaggtca cacaggtgtc gccgcaggat gcggtgaact taacctgtga 481 tccatttatc ggagagtttg atcctggctc aggacgaacg ctggcggcgt gcctaataca 541 tgcaagtcga gcggacaggt gggagcttgc tccgatgtta gcggcggacg ggtgagtaac 601 acgtgggtaa cctgcctgta agactgggat aactccggga aaccggggct aataccggat 661 ggttgtttga accgcatggt tcaaacataa aaggtggctt cggctaccac ttacagatgg 721 acccgcggcg cattagctag ttggtgaggt aacggctcac caaggcaacg atgcgtagcc 781 gacctgagag ggtgatcggc cacactggga ctgagacacg gcccagactc ctacgggagg 841 cagcagtagg gaatcttccg caatggacga aagtctgacg gagcaacgcc gcgtgagtga 901 tgaaggtttt cggatcgtaa agctctgttg ttagggaaga acaagtaccg ttcgaacagg 961 gcggtacctt gacggtacct aaccagaaag ccacggctaa ctacgtgcca gcagccgcgg 1021 taatacgtag gtggcaagcg ttttccggaa ttattgggcg taaagggctc gcaggcggtt 1081 tcttaagtct gatgtgaaag cccccggctc aaccggggag ggtcattgga aactggggaa 1141 cttgagtgca gaagaggaga gtggaattcc acgttgtagc ggtgaaatgc gtagagatgt 1201 ggaggaacac cagtggcgaa ggcgactctc tggtctgtaa ctgacgctga ggagcgaaag 1261 cgtggggagc gaacaggatt agataccctg gtagtccacg ccgtaaacga tgagtgctaa 1321 gtgttagggg gtttccgccc cttagtgctg cagctaacgc attgagcact ccgcctgggg 1381 agtacggtcg caagactgaa actcaaagga attgacgggg gcccgcacaa gcggtggagc 1441 atgtggttta attcgaagca acgcgaagaa ccttactagg tcttgacatc ctctgacaat 1501 cctagagata ggacgtcccc ttcggggcag agtgacaggt ggtgcatggt tgtcgtcagc 1561 tcgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccttgat cttagttgcc 1621 agcattcagt tgggcactct aaggtgactg ccggtgacaa accggaggaa ggtggggatg 1681 acgtcaaatc atcatgcccc ttatgacttg ggctacacac gtgctacaat ggacagaaca 1741 aagggcagcg aaccgcgagg ttaagccaat cccacaaatc tgttctcagt tcggatcgca 1801 gtctgcaact cgactgcgtg aagctggaat cgctagtaat cgcggatcag catgccgcgg 1861 tgaatacgtt cccgggcctt gtacacaccg cccgtcacac cacgagagtt tgtaacaccc 1921 gaagtcggtg aggtaacctt ttaggagcca gccgccgaag gtgggacaga tgattggggt 1981 gaagtcgtaa caaggtagcc gtatcggaag gtgcggctgg atcacctcct ttctaaggat 2041 attatacgga atataagacc caaggtctta taaacagaac gttccctgtc ttgtttagtt 2101 ttgaaggatc attccttcga aacgtgttct ttgaaaacta gataacagta gacatcacat 2161 tcaattagta acacaagata tcacatagtg attcttttta acggttaagt tagaaagggc 2221 gcacggtgga tgccttggca ctaggagccg atgaaggacg ggacgaacac cgatatgctt 2281 cggggagctg taagcaagct ttgatccgga gatttccgaa tggggaaacc caccactcgt 2341 aatggagtgg tatccatatc tgaattcata ggatatgaga aggcagaccc ggggaactga 2401 aacatctaag tacccggaga agagaaagca aatgcgattc cctgagtagc ggcgacgaac 2461 acgggatcag cccaaaccaa gaggcttgcc tctgtggttg taggacactc tgtacggagt 2521 tacaaaagaa cgaggtagat gaagaggtct ggaaagggcc cgccatagga ggtaacagcc 2581 ctgtagtcaa aacttcgttc tctcctgagt ggatcctgag tacggcggaa cacgtgaaat 2641 tccgtcggaa tccgggagga ccatctccca aggctaaata ctccctagtg accgatagtg 2701 aaccagtacc gtgagggaaa ggtgaaaagc accccggaag gggagtgaaa gagatcctga 2761 aaccgtgtgc ctacaagtag tcagagcccg ttaacggtga tggcgtgcct tttgtagaat 2821 gaaccggcga gttacgatcc cgtgcaaggt taagcagaag atgcggagcc gcagcgaaag 2881 cgagtctgaa tagggcgcat gagtacgtgg tcgtagaccc gaaaccaggt gatctaccca 2941 tgtccagggt gaagttcagg taacactgaa tggaggcccg aacccacgca cgttgaaaag 3001 tgcggggatg aggtgtgggt aggggtgaaa tgccaatcga acctggagat agctggttct 3061 ctccgaaata gctttagggc tagcctcaag gtaagagtct tggaggtaga gcactgattg 3121 gactaggggc cctcaccggg ttaccgaatt cagtcaaact ccgaatgcca atgacttatc 3181 cttgggagtc agactgcgag tgataagatc cgtagtcgaa agggaaacag cccagaccgc 3241 cagctaaggt cccaaagtat acgttaagtg gaaaaggatg tggagttgct tagacaacca 3301 ggatgttggc ttagaagcag ccaccattta aagagtgcgt aatagctcac tggtcgagtg 3361 actctgcgcc gaaaatgtac cggggctaaa cgtatcaccg aagctgcgga ctgttcttcg 3421 aacagtggta ggagagcgtt ctaagggctg tgaagccaga ccggaaggac tggtggacgg 3481 cttagaagtg agaatgccgg tatgagtagc gaaaagaggg gtgagaatcc ctccaccgaa 3541 tgcctaaggg ttcctgagga aggctcgtcc gctcagggtt agtcgggacc taagccgagg 3601 ccgaaaggcg taggcgatgg acaacaggtt gatattcctg taccacctcc tcaccatttg 3661 agcaatgggg ggtcgcagga ggatagggta agcgcggtat tggatatccg cgtccaagca 3721 gttaggctgg gaaataggca aatccgtttc ccataaggct gagctgtgat ggcgagcgaa 3781 atatagtagc gaagttcctg attccacact gccaagaaaa gcctctagcg aggtgagagg 3841 tgcccgtacc gcaaaccgtc acaggtaggc gaggagagaa tcctaaggtg atcgagagaa 3901 ctctcgttaa ggaactcggc aaaatgaccc cgtaacttcg ggagaagggg tgctctgtta 3961 gggtgcaagc ccgagagagc cgcagtgaat aggcccaggc gactgtttag caaaaacaca 4021 ggtctctgcg aagccgtaag gcgaagtata ggggctgacg cctgcccggt gctggaaggt 4081 taagaggagc gcttagcgta agcgaaggtg cgaattgaag ccccagtaaa cggcggccgt 4141 aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac ccgcacgaaa 4201 ggcgcaacga tctgggcgct gtctcaacga gagactcggt gaaattatag tacctgtgaa 4261 gatgcaggtt acccgcgaca ggacggaaag accccgtgga gctttactgc agcctgatat 4321 tgaatgttgg tacagcttgt acaggatagg taggagcctt ggaaaccgga gcgccagctt 4381 cggtggaggc atcggtggga tactaccctg gctgtattga ccttctaacc ccccgccctt 4441 atcgggcggg gagacagtgt caggtgggca gtttgactgg ggcggtcgcc tcctaaaagg 4501 taacggaggc gcccaaaggt tccctcagaa tggttggaaa tcattcgcag agtgtaaagg 4561 cacaagggag cttgactgcg agacctacaa gtcgagcagg gacgaaagtc gggcttagtg 4621 atccggtggt tccgcatgga agggccatcg ctcaacggat aaaagctacc ccggggataa 4681 caggcttatc tcccccaaga gctccacatc gacggggagg tttggcacct cgatgtcggc 4741 tcatcgcatc ctggggctgt agtcggtccc aagggttggg ctgttcgccc attaaagcgg 4801 tacgcgagct gggttcagaa cgtcgtgaga cagttcggtc cctatccgtc gcgggcgctg 4861 gaaatttgag aggagctgtc cttagtacga gaggaccggg atggacgcac cgctggtgta 4921 ccagttgttc tgccaagggc atcgctgggt agctatgtgc ggacgggata agtgctgaaa 4981 gcatctaagc atgaagcccc cctcaagatg agatttccca ttccgcaagg aagtaagatc 5041 cctgaaagat gatcaggttg ataggtctga ggtggaagtg tggcaacaca tggagctgac 5101 agatactaat cgatcgagga cttaaccata tttttgaatg atgtcacacc tgttatctag 5161 ttttgagaga acactctcaa tttgtttggt ggcgatagcg aagaggtcac acccgttccc 5221 ataccgaaca cggaagttaa gctcttcagc gccgatggta gtcgggggtt tccccctgtg 5281 agagtaggac gccgccaagc aattgcacgt tagtgcaata tggaggatta gctcagctgg 5341 gagagcatct gccttacaag cagagggtcg gcggttcgag cccgtcatcc tccaccattt 5401 ttcattatac atatcggttt tacatatatg ccggtgtagc tcaattggta gagcaactga 5461 cttgtaatca gtaggttggg ggttcaagtc ctcttgccgg caccactttt atatgatata 5521 atattcaagt ctattgtaag aagagccatt agctcagttg gtagagcatc tgacttttaa 5581 tcagagggtc gaaggttcga gtccttcatg gctcaccatt tacatgttgc ggatgtggcg 5641 gaattggcag acgcgctaga atcaggctct agtgtcttta cagacgtggg ggttcaagtc 5701 ccttcatccg caccatttct gcggaagtag ttcagtggta gaacaccacc ttgccaaggt 5761 gggggtcgcg ggttcgaatc ccgtcttccg ctccaactat accatccacg ccggggtggt 5821 ggaattggca gacacacagg acttaaaatc ctgcggtagg tgactaccgt gccggttcaa 5881 gtccggccct cggcattaag ttttgcgccc gtagctcaat tggatagagc gtttgactac 5941 ggatcaaaag gttaggggtt cgactcctct cgggcgcgcc atgatctata tgaaatcggg 6001 aagtagctca gcttggtaga gcacatggtt tgggaccatg gggtcgcagg ttcgaatcct 6061 gtcttcccga ccattcttgg ggccttagct cagctgggag agcgcctgct ttgcacgcag 6121 gaggtcagcg gttcgatccc gctaggctcc acttgatttc aaaaactatt tggcggtgta 6181 gctcagctgg ctagagcgta cggttcatac ccgtgaggtc gggggttcga tcccctccgc 6241 cgctaccaat ggacctttag ctcagttggt tagagcagac ggctcataac cgtccggtcg 6301 taggttcgag tcctacaagg tccaccacta tacggaggaa tacccaagtc tggctgaagg 6361 gatcggtctt gaaaaccgac agggtgtcaa agcccgcggg ggttcgaatc cctcttcctc 6421 cgccatacat attcctaatc atcgcggggt ggagcagttc ggtagctcgt cgggctcata 6481 acccgaaggt cgcaggttca aatcctgccc ccgcaaccaa attttaaaat ggtccggtag 6541 ttcagttggt tagaatgcct gcctgtcacg caggaggtcg cgggttcgag tcccgtccgg 6601 accgccattt aaatacttag gctcggtagc tcagttggta gagcaacgga ctgaaaatcc 6661 gtgtgtcggc ggttcgattc cgtcccgagc caccatttat caatatgctt tggcggttgt 6721 ggcgaagtgg ttaacgcacc agattgtggc tctggcattc gtgggttcga ttcccatcaa 6781 tcgccccaaa taaaaattgc gggtgtagtt tagtggtaaa acctcagcct tccaagctga 6841 tgtcgtgggt tcgattccca tcacccgctc catttctata tcgtcatggg cctgtagctc 6901 agctggttag agcgcacgcc tgataagcgt gaggtcgatg gttcgagtcc attcaggccc 6961 accatgactt ttgttccaca gtagctcagt ggtagagcta tcggctgtta accgatcggt 7021 cgcaggttcg aatcctgcct gtggagccaa atggagaagt actcaagtgg ctgaagaggc 7081 gcccctgcta agggtgtagg tcgtgtaagc ggcgcgaggg ttcaaatccc tccttctccg 7141 ccatatgatt acagatatca taattatcgg cccgttggtc aagcggttaa gacaccgccc 7201 tttcacggcg gtaacacggg ttcgaatccc gtacgggtca tcccagaagc cttgcatatc 7261 ctgcaaggtt tttttgtttt tataaatcat gtatatgtct tagattttgt tctttatttt 7321 aaaaacagac tacaaaaatc tccatatatt tcgtttttct tcagaaaatg aagttaattg 7381 tctataagta taagccgttt cagggaaagg gctttttttt atttcttcga // LOCUS ECOAROCX 1690 bp ds-DNA BCT 10-JUL-1990 DEFINITION E.coli chorismate synthase (aroC) gene, complete cds. ACCESSION M33021 KEYWORDS aroC gene; chorismate synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1690) AUTHORS White,P.J., Millar,G. and Coggins,J.R. TITLE The overexpression, purification and complete amino acid sequence of chorismate synthase from Escherichia coli K12 and its comparison with the enzyme from Neurospora crassa JOURNAL Biochem. J. 251, 313-322 (1988) STANDARD simple staff_review FEATURES from to/span description pept 492 1562 chorismate synthase (EC 4.6.1.4) BASE COUNT 403 a 467 c 466 g 354 t ORIGIN 1 gtcgacgcgg tggatatctc tccagacgcg ctggcggttg ctgaacagaa catcgaagaa 61 cacggtctga tccacaacgt cattccgatt cgttccgatc tgttccgcga cttgccgaaa 121 gtgcagtacg acctgattgt cactaacccg ccgtatgtcg atgcgaagat atgtccgacc 181 tgccaaacaa taccgccacg agccggaact gggcctggca tctggcactg acggcctgaa 241 actgacgcgt cgcattctcg gtaacgcggc agattacctt gctgatgatg gcgtgttgat 301 ttgtgaagtc ggcaacagca tggtacatct tatggaacaa tatccggatg ttccgttcac 361 ctggctggag tttgataacg gcggcgatgg tgtgtttatg ctcaccaaag agcagcttat 421 tgccgcacga gaacatttcg cgatttataa agattaagta aacacgcaaa cacaacaata 481 acggagccgt gatggctgga aacacaattg gacaactctt tcgcgtaacc accttcggcg 541 aatcgcacgg gctggcgctc ggctccatcg tcgatggtgt tccgccagcc attccgctga 601 cggaagcgga cctgcaacat gacctcgacc gtcgtcgccc tgggacatcg cgctatacca 661 cccagcgccg cgagccggat caggtcaaaa ttctctccgg tgtttttgaa ggcgttacta 721 ccggcaccag cattggcttg ttgatcgaaa acactgacca gcgctctcag gattacagtg 781 cgattaagga cgttttccgt ccaggccatg ccgattacac ctacgaacaa aaatacggtc 841 tgcgcgatta tcgcggcggt ggacgttctt ccgcccgcga aaccgccatg cgcgtggcgg 901 caggagctat tgccaaaaaa tatctcgccg agaaatttgg tattgaaatc cgtggctgcc 961 tgacccagat gggcgacatt ccgctggata tcaaagactg gtcgcaggtc gagcaaaatc 1021 cgtttttttg cccggacccc gacaaaatcg acgcgttaga cgagttgatg cgtgcgctga 1081 aaaaagaggg cgactccatc ggcgctaaag tcaccgttgt tgccagtggc gttcctgccg 1141 gacttggcga gccggtcttt gaccgcctgg atgctgacat cgcccatgcg ctgatgagca 1201 tcaacgcggt gaaaggcgtg gaaattggcg acggctttga cgtggtggcg ctgcgcggca 1261 gccagaaccc cgatgaaatc accaaagacg gtttccagag caaccatgcg ggcggcattc 1321 tcggcggtat cagcagcggg cagcaaatca ttgcccatat ggcgctgaaa ccgacctcca 1381 gcattaccgt gccgggtcgt accattaacc gctttggcca agaagttgag atgatcacca 1441 aaggccgtca cgatccctgt gtcgggatcc gcgcagtgcc gatcgcagaa gcgaatgctg 1501 gcgatcgttt taatggatca cctgttacgg caacgggcgc aaaatgccga tgtgaagact 1561 gatattccac gctggtaaaa aatgaataaa accgcgattg cgctgctggc tctgcttgcc 1621 agtagcgcca gcctggcagc gacggcgtgg caaaaaataa cccaacctgt gccgggtagc 1681 gccaaatcga // LOCUS PFAMSA2 819 bp ds-DNA INV 10-JUL-1990 DEFINITION P.falciparum 45 kD merozoite surface antigen (MSA 2) gene, complete cds. ACCESSION M28891 KEYWORDS integral membrane protein; surface antigen. SOURCE P.falciparum DNA, clone 3D7. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 819) AUTHORS Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and Anders,R.F. TITLE Structural diversity in the 45-kilodalton merozoite surface antigen of Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. 39, 227-234 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by J.A. Smythe, 06-OCT-1989. FEATURES from to/span description pept 1 819 45 kD merozoite surface antigen precursor sigp 1 60 45 kD merozoite surface antigen signal peptide matp 61 819 45,000 merozoite surface antigen rpt 157 228 12 base repeat rpt 301 321 9 base repeat BASE COUNT 304 a 157 c 143 g 215 t ORIGIN 1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt 61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg 121 agaagtatgg cagaaagtaa gccttctact ggtgctggtg gtactgctgg tggtagtgct 181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg ttctggtgat 241 ggtaatggtg cagatgctga gggaagttca agtactcccg ctactaccac aactaccaaa 301 actaccacaa ctaccacaac tactaatgat gcagaagcat ctaccagtac ctcttcagaa 361 aatccaaatc ataaaaatgc cgaaacaaat ccaaaaggta aaggagaagt tcaagaacca 421 aatcaagcaa ataaagaaac tcaaaataac tcaaatgttc aacaagactc tcaaactaaa 481 tcaaatgttc cacccactca agatgcagac actaaaagtc ctactgcaca acctgaacaa 541 gctgaaaatt ctgctccaac agccgaacaa actgaatccc ccgaattaca atctgcacca 601 gagaataaag gtacaggaca acatggacat atgcatggtt ctagaaataa tcatccacaa 661 aatacttctg atagtcaaaa agaatgtacc gatggtaaca aagaaaactg tggagcagca 721 acatccctct taaataactc tagtaatatt gcttcaataa ataaatttgt tgttttaatt 781 tcagcaacac ttgttttatc ttttgccata ttcatataa // LOCUS PFAMSA2X 864 bp ds-DNA INV 10-JUL-1990 DEFINITION P.falciparum 45,000 merozoite surface antigen (MSA2) gene, complete cds. ACCESSION M28892 KEYWORDS integral membrane protein; surface antigen. SOURCE P.falciparum (isolate Indochina 1) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 864) AUTHORS Smythe,J.A., Peterson,M.G., Coppel,R.L., Saul,A.J., Kemp,D.J. and Anders,R.F. TITLE Structural diversity in the 45-kilodalton merozoite surface antigen of Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. 39, 227-234 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by J.A. Smythe, 06-OCT-1989. FEATURES from to/span description pept 1 864 45 kD merozoite surface antigen precursor sigp 1 60 45 kD merozoite surface antigen signal peptide matp 61 864 45 kD merozoite surface antigen rpt 169 312 12 base repeat sequence rpt 379 397 9 base repeat sequence BASE COUNT 288 a 157 c 179 g 240 t ORIGIN 1 atgaaggtaa ttaaaacatt gtctattata aatttcttta tttttgttac ctttaatatt 61 aaaaatgaaa gtaaatatag caacacattc ataaacaatg cttataatat gagtataagg 121 agaagtatga cagaaagtaa tcctcctact ggtgctagtg gtagtgctgg tggtagtgct 181 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct 241 ggtggtagtg ctggtggtag tgctggtggt agtgctggtg gtagtgctgg tggtagtgct 301 ggtggtagtg ctggttctgg tgatggtaat ggtgctaatc ctggtgcaga tgctgagaga 361 agtccaagta ctcccgctac taccacaact accacaacta ctaatgatgc agaagcatct 421 accagtacct cttcagaaaa tccaaatcat aataatgccg aaacaaatca agcaaataaa 481 gaaactcaaa ataactcaaa cgttcaacaa gactctcaaa ctaaatcaaa tgttccaccc 541 actcaagatg cagacactag aagtcctact gcacaacctg aacaagctga aaattctgct 601 ccaacagccg aacaaactga atcccccgaa ttacaatctg caccagagaa taaaggtaca 661 ggacaacatg gacatatgca tggttctaga aataatcatc cacaaaatac ttctgatagt 721 caaaaagaat gtaccgatgg taacaaagaa aactgtggag cagcaacatc cctcttaaat 781 aactctagta atattgcttc aataaataaa tttgttgttt taatttcagc aacacttgtt 841 ttatcttttg ccatattcat ataa // LOCUS XELRASX 1143 bp ss-mRNA VRT 10-JUL-1990 DEFINITION X.laevis ras protein mRNA, complete cds. ACCESSION M34657 KEYWORDS ras protein. SOURCE X.laevis defolliculated oocyte, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1143) AUTHORS Andeol,Y., Gusse,M. and Mechali,M. TITLE Characterization and expression of a Xenopus ras during oogenesis and development JOURNAL Dev. Biol. 139, 24-34 (1990) STANDARD simple staff_review FEATURES from to/span description pept 196 756 ras protein mRNA < 1 1143 ras protein mRNA BASE COUNT 346 a 275 c 279 g 243 t ORIGIN 1 gaattcgcca gtgttacaga atgggagttc tgaggcgctg tgactaatcc cccccacccc 61 cgcatattgg ggaaatccac cggcgggcag aaagccagag ggagaactaa ggggggccaa 121 accaaaggaa aacgcaggag ccaaagcctc cagaaacaca gggatccgtg acgagcccga 181 gtcggtgctg gtgaaatgac ggagtacaaa ctggtggtgg ttggtgctgg aggcgtgggg 241 aagagcgcac tcacaatcca gctcattcag aaccattttg tggacgagta tgatcctact 301 attgaggact cgtataggaa gcaggtggtg atagacgggg agacctgcct cctagatatc 361 ctggacactg cggggcaaga ggaatacagc gctatgaggg atcagtacat gcgcacggga 421 gaaggctttc tctgtgtctt tgctattaac aacacaaagt ccttcgagga cgtccatcat 481 tacagggaac agattaacag agttaaagat tccgatgacg ttcccatggt gttagttggt 541 aacaaatgcg acctcccatc ccggactgtg gacacaaagc aagcgcagga actggcaaag 601 agctatggta ttccttttat agagacctct gccaaaacta gacagggagt cgaagacgcc 661 ttctataccc tagtccgtga aatccgcaag cacaaggaga agatcagcaa cgggaaaaag 721 aaaaagtcct ccaaaaggaa gtgtgtcgtt ctttaacgtg ccaacctgcc cccccctgcc 781 atcctcgtgg atcagagaaa accgtgccat cacacacctg aagtcaaaga aaaaaaaagt 841 gtggactttt gtcgttgctg tggaaaccat tgaattgcca tgaaatttaa aaaaaaaacc 901 aaaacattga ccacttattt taacacaacc gataaatggc acaggctgtg ccccaatcgt 961 gtatatattc ttcatgaaca aactgtttta tcagaaagac agatgcaata gccccttctt 1021 tttaccccaa ttaaccctcc tcctggtttc tatttctccc tggaaaagac gttggtcgac 1081 cagaggggaa gaacctgccc aggcctttct tacagcccca tttgaataaa gattgaaaca 1141 ctc // LOCUS HUMSPTB 6765 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds. ACCESSION J05500 KEYWORDS beta-spectrin; spectrin. SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and V252. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6765) AUTHORS Winkelmann,J.C., Chang,J.G., Tse,W.T., Marchesi,V.T. and Forget,B.G. TITLE Full length sequence of the cDNA for human erythroid beta-spectrin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Winkelmann, 08-MAY-1990. FEATURES from to/span description pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14" /hgml_locus_uid="LS0033T" mRNA < 1 6765 SPTB mRNA signal 6716 6722 poly-A signal BASE COUNT 1627 a 1822 c 2144 g 1172 t ORIGIN Chromosome 14q23-q24. 1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag 61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa 121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg 181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg 241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg 301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca 361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc 421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac 481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc 541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag 601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag 661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta 721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca 781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc 841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct 901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc 961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg 1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca 1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca 1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca 1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag 1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg 1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc 1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata 1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga 1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg 1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc 1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg 1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg 1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga 1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg 1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc 1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg 1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg 2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc 2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc 2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt 2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga 2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc 2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg 2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc 2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat 2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgtcaggc ctgcgggagc 2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg 2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga 2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca 2761 ggttcgacat cctggaccag gagatgaaga ccttcatgac tcagattgat ggtgtgaacc 2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc 2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg 2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt 3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag 3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg 3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga 3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat 3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg 3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca 3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg 3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc 3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg 3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg 3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat 3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga 3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg 3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg 3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc 3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct 3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc 4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc 4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc 4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa 4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg 4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcgatcagac gacccgggca 4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga 4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg 4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa 4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg 4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta 4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg 4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg 4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc 4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt 4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct 4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc 4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg 5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg 5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca 5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa 5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc 5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca 5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg 5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc 5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga 5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca 5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg 5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg 5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc 5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct 5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc 5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca 5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc 6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt 6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc 6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg 6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca 6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg 6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga 6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg 6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac 6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg 6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg 6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag 6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa 6721 attttattgt ataaatatat caccatttac atgaggggaa acact // LOCUS STYEUTBC 2526 bp ds-DNA BCT 10-JUL-1990 DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes, complete cds. ACCESSION J05518 KEYWORDS ethanolamine ammonia-lyase. SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2526) AUTHORS Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M. TITLE Cloning, sequencing and expression of the genes encoding the alcohol-dependent ethanolamine ammonia-lyase of Salmonella typhimurium JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.M.Babior, 08-MAY-1990. FEATURES from to/span description pept 141 1499 ethanolamine ammonia-lyase (eutB) pept 1518 2378 ethanolamine ammonia-lyase (eutC) binding 130 133 ribosome binding site binding 1507 1510 ribosome binding site BASE COUNT 563 a 687 c 779 g 497 t ORIGIN 1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga 61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt 121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt 181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg 241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa 301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc 361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca 421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc 481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc 541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc 601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg 661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc 721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga 781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc 841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct 901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt 961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct 1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg 1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc 1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga 1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc 1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc 1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg 1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg 1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat 1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg 1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc 1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg 1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc 1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg 1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg 1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc 1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg 1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga 2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact 2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca 2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg 2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc 2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg 2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga 2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg 2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt 2521 gcctcc // LOCUS XELPCNA 1018 bp ss-mRNA VRT 10-JUL-1990 DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete cds. ACCESSION M34080 KEYWORDS nuclear protein; proliferating cell nuclear antigen. SOURCE X.laevis oocyte, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1018) AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M. TITLE Characterization and developmental expression of Xenopus proliferating cell nuclear antigen (PCNA) JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Leibovici, 08-MAY-1990. FEATURES from to/span description pept 28 813 proliferating cell nuclear antigen (PCNA) mRNA < 1 1018 PCNA mRNA BASE COUNT 284 a 223 c 237 g 274 t ORIGIN 1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc 61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca 121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact 181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag 241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg 301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt 361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa 421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc 481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca 541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa 601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat 661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca 721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg 781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc 841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa 901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa 961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac // LOCUS YSCAAC2A 1333 bp ds-DNA PLN 10-JUL-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete cds. ACCESSION M34076 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1333) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 235 1158 ADP/ATP-translocator protein (AAC2) BASE COUNT 388 a 209 c 301 g 435 t ORIGIN 1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc 61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg 121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa 181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt 241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt 301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat 361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt 421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat 481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg 541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct 601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc 661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg 721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc 781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct 841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg 901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga 961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa 1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg 1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc 1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc 1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag 1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc 1321 agccactgtt gtt // LOCUS YSCAAC3 1308 bp ds-DNA PLN 10-JUL-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete cds. ACCESSION M34075 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1308) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 78 1034 ADP/ATP-translocator protein (AAC3) BASE COUNT 353 a 228 c 263 g 464 t ORIGIN 1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata 61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc 121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg 181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag 241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca 301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta 361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt 421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg 481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat 541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg 601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac 661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga 721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt 781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa 841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga 901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag 961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta 1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc 1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata 1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa 1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc 1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt // LOCUS ECODKSA 1273 bp ds-DNA BCT 10-JUL-1990 DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds. ACCESSION M34945 KEYWORDS dnaK suppressor. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1273) AUTHORS Kang,P.J. and Craig,E.A. TITLE Identification and characterization of a new Escherichia coli gene that is a dosage-dependent suppressor of a dnaK deletion mutation JOURNAL J. Bacteriol. 172, 2055-2064 (1990) STANDARD simple staff_review FEATURES from to/span description pept 229 441 ORF 1 pept 619 1074 dnaK suppressor (dksA) BASE COUNT 343 a 301 c 333 g 296 t ORIGIN 1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa 61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc 121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata 181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg 241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt 301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg 361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa 421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg 481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg 541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc 601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt 661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa 721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa 781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac 841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac 901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc 961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg 1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag 1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga 1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct 1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt 1261 acgcatagaa gat // LOCUS STFCYCLI 2180 bp ss-mRNA INV 10-JUL-1990 DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds. ACCESSION M33880 KEYWORDS cyclin B. SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone lambda-gt10-cyc10. ORGANISM Asterina pectinifera Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; Stelleroidea; Asteroidea; Spinulosida; Asterinidae. REFERENCE 1 (bases 1 to 2180) AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T. TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes cyclin JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by T.Kishimoto, 11-MAY-1990. FEATURES from to/span description pept 126 1313 cyclin B (CYC) mRNA < 126 2180 cyclin B mRNA BASE COUNT 650 a 440 c 482 g 608 t ORIGIN 1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg 61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga 121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt 181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc 241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag 301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca 361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa 421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct 481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac 541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt 601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc 661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc 721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc 781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg 841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt 901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca 961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca 1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag 1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc 1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat 1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca 1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg 1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt 1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt 1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg 1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa 1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt 1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga 1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc 1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca 1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac 1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat 1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag 1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat 2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg 2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt 2161 gtaataaagt taatggaaat // LOCUS CHKMTTGHA 90 bp ds-DNA ORG 10-JUL-1990 DEFINITION Chicken mitochondrial His-tRNA gene. ACCESSION M34496 M34497 KEYWORDS transfer RNA-His. SOURCE Chicken (strain white leghorn) liver mitochondrial DNA. ORGANISM Mitochondrion Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus. REFERENCE 1 (bases 1 to 90) AUTHORS L'Abbe,D., Lang,B.F., Desjardins,P. and Morais,R. TITLE Histidine tRNA from chicken mitochondria has an uncoded 5'-terminal guanylate residue JOURNAL J. Biol. Chem. 265, 2988-2992 (1990) STANDARD simple staff_entry COMMENT the "n"s in the tRNA sequence are probably modified bases. FEATURES from to/span description tRNA 11 79 His-tRNA anticdn 41 43 His-tRNA anticodon gtg variant 10 10 t in DNA, n in tRNA variant 18 18 t in DNA; n in tRNA variant 19 19 a in DNA; n in tRNA variant 26 26 c in DNA; n in tRNA variant 27 27 c in DNA; n in tRNA variant 35 35 t in DNA; n in tRNA variant 45 45 t in DNA; n in tRNA BASE COUNT 27 a 22 c 17 g 24 t ORIGIN 1 acccctctat gcaaacatag tttaacccaa acattagatt gtgattctaa aaataggagt 61 ttaaccctcc ttgttcgccg aggggaggcc // LOCUS DDISAS1A 2145 bp ss-mRNA INV 10-JUL-1990 DEFINITION D.discoideum GTP-binding protein (SAS1) gene, complete cds. ACCESSION M34456 KEYWORDS GTP-binding protein. SOURCE D.discoideum, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 2145) AUTHORS Saxe,S.A. and Kimmel,A.R. TITLE SAS1 and SAS2, GTP-binding protein genes in Dictyostelium discoideum with sequence similarities to essential genes in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 2367-2378 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1095 1706 GTP-binding protein (SAS1) BASE COUNT 929 a 249 c 238 g 727 t 2 others ORIGIN 1 gggaattatt aggacatcag gtttaaaacc tattcagaca ccagaataca atttgaattg 61 agcggcaacg ttcctttcac tctgcactac atcagcatta ttagagagaa aggttgaaaa 121 acctctatcg aaggtggtgg aattgctgag aagtaacagc aataaataaa acattcaaac 181 cgatagatga gaggttcaaa atccatctag ttagtagggc taaaaaacta caaatcataa 241 acccgatccg atacctaaga ctcctttttt tttttttttt tttttaataa atcaaataat 301 cacatgacct tggagtcttg gtctgcccac gaatttaaag tgcaaagttt attttattta 361 aactgggtgc atgcaaacat tactctatcg accgatttat ccaattttaa tactaaaatc 421 ttaaaaacca gaaagaanna ataataataa taataataat aataataata ataataataa 481 taataataat aataataata ataataataa taataataat aataataata ataataataa 541 taataataat aataataata ataataacaa ccttatttga aaattcaaat taaaaaaaaa 601 agaaatagct ttacatttta aaattaaaat tcataaataa aaccattata aaaatattga 661 agtatatcaa taggtttaat ttaattattg tttatttaat aaaaaaaaaa aaaaaaaaaa 721 aaaattattt aatcggttca atttaacttt ttcgaagaat tatttttttt aagaaaacat 781 ttcaacccaa aaaaataaaa aaaataaaaa aataaaaatt taaatcgaat ggttgaaatg 841 ttttcttaaa aaaacaaaaa ttaaaataaa ttttattttt tttgaattaa atttcaattc 901 agcaattcaa taattttaac gttttcactt catcaaaaat tataaataga atattaaaca 961 caacacaaca caactatcca aactaaaaca attaaaatca aaactctaat tttttataaa 1021 aatttattta ttttctcatc tcaataaaaa catttaaaaa cataattggt aatatagata 1081 tttttttcaa aataatgact tctccagcaa caaataaacc agcagcctac gattttttag 1141 ttaaattact tttaattggg gatagtggtg taggaaagtc atgtctttta ttacgttttt 1201 ctgatggttc tttcacacca agtttcatcg ctactattgg tatcgatttc aaaattcgta 1261 caattgaatt agagggtaaa agaattaaat tacaaatttg ggacactgca ggtcaagaaa 1321 gattcagaac tatcactaca gcatactatc gcggtgctat gggtatccta ttggtttatg 1381 atgtcactga tgaaaaatct tttggtagca ttagaaattg gattagaaat atcgagcaac 1441 atgcttcaga ctcagttaat aaaatgttaa tcggtaataa atgtgatatg accgaaaaga 1501 aagttgttga tagctcaaga ggtaaatcac ttgcagacga atatggtatt aaatttttag 1561 aaacttctgc caaaaacagt gtaaatgtag aggaagcctt tattggttta gcaaaagata 1621 ttaaaaaacg tatgattgat acaccaaatg atcctgatca taccatatgc attactccaa 1681 acaataagaa aaatacttgt tgttaaattg gggccatttt aattttcaca ttattagatg 1741 aaaaaaaaaa aaaaaaaaaa ctaaaattaa aagtaaaaaa cacttttttt tatttaaaaa 1801 tattattttt cattagtcat gaatggttac gtctaaacga tctaatattt ctctatagta 1861 gtgaattatt gcttcatgaa ttttagtgaa aagtttagct taataataat aataataata 1921 ataataataa taataataat aataataata ataataataa ataataataa caattttaaa 1981 attaaatatc caatgttgaa tattttaagt caaaaataat aataataatt ggaatgtatt 2041 ttaaaattaa aattcataaa taaactatta attattgttt attgccttta atggctaacc 2101 tattttttat agtttaaaaa taatttataa ttaatttttt taaat // LOCUS DDISAS2A 989 bp ds-DNA INV 10-JUL-1990 DEFINITION D.discoideum GTP-binding protein (SAS2) gene, complete cds. ACCESSION M34457 KEYWORDS GTP-binding protein. SOURCE D.discoideum DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 989) AUTHORS Saxe,S.A. and Kimmel,A.R. TITLE SAS1 and SAS2, GTP-binding protein genes in Dictyostelium discoideum with sequence similarities to essential genes in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 2367-2378 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 49 675 GTP-binding protein (SAS2) BASE COUNT 421 a 141 c 142 g 285 t ORIGIN 1 atcaatcaat aaactacaaa tttataatat agatattttt tcgaaataat gacttctcca 61 gcaacaaata aatcagcagc ctacgattat ttaattaaat tacttttaat cggtgatagt 121 ggtgtaggta aatcatgtct tttattacgt ttttctgaag attctttcac accaagtttc 181 atcactacta ttggtatcga tttcaaaatt cgtacaattg aattggaagg taaaagaatt 241 aaattacaaa tttgggatac tgcaggtcaa gaaagattca gaactatcac tacagcatac 301 tatcgtggtg ctatgggtat cctattggtt tatgatgtca ctgatgaaaa atcttttggt 361 aacattagaa attggattag aaatatcgag caacatgcta cagactctgt taataaaatg 421 ttaatcggta ataaatgtga tatggctgaa aagaaagttg ttgatagctc aagaggtaaa 481 tcacttgcag acgaatatgg tattaaattt ttagaaacct cagccaaaaa cagtataaat 541 gtagaggaag cctttattag tttagcaaaa gatattaaaa aacgtatgat tgatacacca 601 aatgaacaac cacaagttgt tcaaccaggt acaaatcttg gtgcaaataa caataagaaa 661 aaagcttgtt gttaaattgg gtgctatttt aattttcaca ttatattatt agataaaaat 721 aaaaaaaaaa aaaaaaatct taaaaaaaaa aaaaaaagtc atcaaaatta ttcacctaaa 781 aaaataacat ataaaccctg ggtttcaagg cagaggatga ttcacttaca acaacaacaa 841 caacaaccaa caacaacaac aacaaccaac aacaacaact aacaacaaca acaaataata 901 ataataataa aaataataat aataaatccc caagttgtga agttgtgttg aaattaataa 961 gagtgggagg tttatatcgc ataaataac // LOCUS HUMLAMBA 2850 bp ss-mRNA PRI 10-JUL-1990 DEFINITION Human lamin B mRNA, complete cds. ACCESSION M34458 KEYWORDS intermediate filament; lamin B. SOURCE Human T-cell line MOLT-4, cDNA to mRNA, clone LAM-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2850) AUTHORS Pollard,K.M., Chan,E.K.L., Grant,B.J., Sullivan,K.F., Tan,E.M. and Glass,C.A. TITLE In vitro posttranslational modification of lamin B cloned from a human T-cell line JOURNAL Mol. Cell. Biol. 10, 2164-2175 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 342 2102 lamin B mRNA < 342 2850 lamin B mRNA signal 2834 2839 polyA signal BASE COUNT 776 a 614 c 748 g 712 t ORIGIN 1 cgcgagcagg agacggcggc gggcgaaccc tgctgggcct ccagtcaccc tcgtcttgca 61 ttttcccgcg tgcgtgtgtg agtgggtgtg tgtgttttct tacaaagggt atttcgcgat 121 cgatcgattg attcgtagtt cccccccgcg cgcctttgcc ctttgtgctg taatcgagct 181 cccgccatcc caggtgcttc tccgttcctc taaacgccag cgtctggacg tgagcgcagg 241 tcgccggttt gtgccttcgg tccccgcttc gccccctgcc gtcccctcct tatcacggtc 301 ccgctcgcgg cctcgccgcc ccgctgtctc cgccgcccgc catggcgact gcgacccccg 361 tgccgccgcg gatgggcagc cgcgctggcg gccccaccac gccgctgagc cccacgcgcc 421 tgtcgcggct ccaggagaag gaggagctgc gcgagctcaa tgaccggctg gcggtgtaca 481 tcgacaaggt gcgcagcctg gagacggaga acagcgcgct gcagctgcag gtgacggagc 541 gcgaggaggt gcgcggccgt gagctcaccg gcctcaaggc gctctacgag accgagctgg 601 ccgacgcgcg acgcgcgctc gacgacacgg cccgcgagcg cgccaagctg cagatcgagc 661 tgggcaagtg caaggcggaa cacgaccagc tgctcctcaa ctatgctaag aaggaatctg 721 atcttaatgg cgcccagatc aagcttcgag aatatgaagc agcactgaat tcgaaagatg 781 cagctcttgc tactgcactt ggtgacaaaa aaagtttaga gggagatttg gaggatctga 841 aggatcagat tgcccagttg gaagcctcct tagctgcagc caaaaaacag ttagcagatg 901 aaactttact taaagtagat ttggagaatc gttgtcagag ccttactgag gacttggagt 961 ttcgcaaaag catgtatgaa gaggagatta acgagaccag aaggaagcat gaaacgcgct 1021 tggtagaggt ggattctggg cgtcaaattg agtatgagta caagctggcg caagcccttc 1081 atgagatgag agagcaacat gatgcccaag tgaggctgta taaggaggag ctggagcaga 1141 cttaccatgc caaacttgag aatgccagac tgtcatcaga gatgaatact tctactgtca 1201 acagtgccag ggaagaactg atggaaagcc gcatgagaat tgagagcctt tcatcccagc 1261 tttctaatct acagaaagag tctagagcat gtttggaaag gattcaagaa ttagaggact 1321 tgcttgctaa agaaaaagac aactctcgtc gcatgctgac agacaaagag agagagatgg 1381 cggaaataag ggatcaaatg cagcaacagc tgaatgacta tgaacagctt cttgatgtaa 1441 agttagccct ggacatggaa atcagtgctt acaggaaact cttagaaggc gaagaagaga 1501 ggttgaagct gtctccaagc ccttcttccc gtgtgacagt atcccgagca tcctcaagtc 1561 gtagtgtacg tacaactaga ggaaagcgga agagggttga tgtggaagaa tcagaggcga 1621 gtagtagtgt tagcatctct cattccgcct cagccactgg aaatgtttgc atcgaagaaa 1681 ttgatgttga tgggaaattt atccgcttga agaacacttc tgaacaggat caaccaatgg 1741 gaggctggga gatgatcaga aaaattggag acacatcagt cagttataaa tatacctcaa 1801 gatatgtgct gaaggcaggc cagactgtta caatttgggc tgcaaacgct ggtgtcacag 1861 ccagcccccc aactgacctc atctggaaga accagaactc gtggggcact ggcgaagatg 1921 tgaaggttat attgaaaaat tctcagggag aggaggttgc tcaaagaagt acagtcttta 1981 aaacaaccat acctgaagaa gaggaggagg aggaagaagc agctggagtg gttgttgagg 2041 aagaactttt ccaccagcag ggaaccccaa gagcatccaa tagaagctgt gcaattatgt 2101 aaaattttca actgtcttcc tcaaaataaa gaagtatggt aatctttacc tgtatacagt 2161 gcagagcctt ctcagaagca cagaatattt ttatatttcc tttatgtgaa tttttaagct 2221 gcaaatctga tggccttaat ttcctttttg acactgaaag ttttgtaaaa gaaatcatgt 2281 ccatacactt tgttgcaaga tgtgaattat tgacactgaa cttaataact gtgtactgtt 2341 cggaaggggt tcctcaaatt ttttgacttt ttttgtatgt gtgttttttc ttttttttta 2401 agttcttatg aggaggggag ggtaaataaa ccactgtgcg tcttggtgta atttgaagat 2461 tgccccatct agactagcaa tctcttcatt attctctgct atatataaaa cggtgctgtg 2521 agggagggga aaagcatttt tcaatatatt gaacttttgt actgaatttt tttgtaataa 2581 gcaatcaagg ttataatttt ttttaaaata gaaattttgt aagaaggcaa tattaaccta 2641 atcaccatgt aagcactctg gatgatggat tccacaaaac ttggttttat ggttacttct 2701 tctcttagat tcttaattca tgaggagggt gggggaggga ggtggaggga gggaagggtt 2761 tctctattaa aatgcattcg ttgtgttttt taagatagtg taacttgctt aaatttctta 2821 tgtgacatta acaaataaaa aagctctttt // LOCUS VIBANGRA 4379 bp ds-DNA BCT 10-JUL-1990 DEFINITION V.anguillarum trans-acting transcriptional activator (angR), S-acyl fatty acid synthesis thioesterase-like protein genes, complete cds, and outer membrane protien (omp), 3' end. ACCESSION M34504 KEYWORDS S-acyl fatty acid synthesis thioesterase-like protein; outer membrane protein; trans-acting transcriptional activatior. SOURCE V.anguillarum DNA, clone pJHC-A103. ORGANISM Vibrio anguillarum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 4379) AUTHORS Farrell,D.H., Mikesell,P., Actis,L.A. and Crosa,J.H. TITLE A regulatory gene, angR, of the iron uptake system of Vibrio anguillarum: Similarity with phage P22 cro and regulation by iron JOURNAL Gene 86, 45-51 (1990) STANDARD simple staff_entry COMMENT Fur protein is a product of the ferric uptake regulatory gene (fur). FEATURES from to/span description pept < 1 275 outer membrane receptor protein pept 361 3507 trans-acting transcriptional activator (angR) pept 3504 4262 S-acyl fatty acid synthase thioesterase-like protein (ORF6) binding 154 158 Fur binding site binding 348 353 ribosome binding site (put.) binding 3488 3493 ribosome binding site (put.) signal 34 39 -35a region (put.) signal 63 68 -10a region (put.) signal 109 114 -35b region (put.) signal 126 131 -10b region (put.) signal 3387 3392 -35c region (put.) signal 3405 3410 -10c region (put.) signal 3443 3448 -35d region (put.) signal 3456 3461 -10d region (put.) BASE COUNT 1320 a 914 c 902 g 1243 t ORIGIN 1 ggaacctacc agtgatgcgt caacttactc ttattggtca agcaaattac atgtcagagc 61 aatatattga tgcacaaaac actcaatcac tgtctgcaca gactattttt gatttaggtg 121 ctcgctataa ctctaccatc gccaatcaaa gtgtcatttg gcgtcttgcg gtcaacaacg 181 taaccgatga agcatattgg actaccaccc attacgctag ccttgcgttg ggtgcccctc 241 gtacggtgat gctatctgct acagcggatt tttaatctcg gtcaattttg cccttgacct 301 ttctggttaa gggcattcgt cttccccttc cccccatttg gctttttatg agaatttaga 361 atgaatcaaa atgaacatcc cttcgctttc cctgagacaa aattaccttt aacctccaat 421 caaaattggc agttatcaac ccaaagacag cgtactgaaa aaaaatcgat taccaatttt 481 acgtatcagg aatttgatta cgaaaacatt tcgagggaca cattagaacg ctgcctcaca 541 acaataatta agcatcaccc aatattcgga gctaagctca gtgacgactt ctacctccat 601 tttccgagca aaactcacat tgaaaccttt gcagttaatg acttaagtaa tgccttaaaa 661 caagatattg ataaacagtt ggccgatacg cgttctgcag taacgaaaag ccgctcacaa 721 gcgataatct ctatcatgtt tagtatattg ccaaaaaaca taatcaggct tcatgtacgc 781 ttcaactcag ttgttgtaga taatccaagt gttacgcttt tttttgagca gcttactcag 841 ttattatcgg gaagtcccct ttctttttta aatcaagaac agactatctc cgcatacaat 901 cacaaagtta ataatgagtt gcttagtgtt gatcttgagt ccgcaagatg gaatgaatat 961 attctaacac tacctagttc agcaaacctt cccacaattt gtgaacccga aaaactggat 1021 gaaaccgata tcactcgcag gtgcattaca ctgtcacaaa ggaagtggca gcagttggtt 1081 actgttagca aaaaacataa tgtcacaccg gagataactc ttgccagtat attttcgacc 1141 gttttatcac tctgggggca tcaaaaatac ctcatgatga gatttgatat caccaaaatc 1201 aatgactaca cgggcatcat aggccagttt accgaacctt tattagtggg tatgtccggc 1261 tttgagcaga gctttctttc tcttgttaaa aacaaccaaa aaaagttcga agaagcttat 1321 cattatgacg ttaaagtacc tgtttttcag tgtgttaata aattatctaa tatttcggat 1381 tctcaccgtt atcctgctaa tatcactttt tctagcgagc ttttaaacac aaaccatagc 1441 aaaaaagctg tatggggatg tcgtcaatca gccaatactt ggctttcttt acatgctgta 1501 atcgagcaag aacaacttgt cttacaatgg gacagccaag acgcaatctt cccaaaagac 1561 atgatcaaag atatgttaca tagttacacc gatttattag acttactcag ccaaaaagat 1621 gtcaactggg cacagccttt accaactttg ctgccaaaac atcaggagtc catacgcaat 1681 aaaataaatc aacagggaga cctagaacta actaaagaac tcctccatca gcgttttttt 1741 aaaaacgtag agtccacccc taatgctctt gcgattatcc atggtcaaga gtcattagat 1801 tatataactt tagcaagcta cgccaagagt tgtgcgggtg cactaaccga agctggagta 1861 aaatcaggag accgcgttgc tgtcactatg aataaaggca ttggtcaaat agtggcagta 1921 ttgggaatat tatatgctgg ggctatttat gttcctgtct ctctagatca accacaagaa 1981 aggcgggaaa gtatttatca aggtgctgga attaacgtta ttcttattaa cgaatcagat 2041 agtaaaaatt ccccttcaaa tgatcttttc tttttcctgg actggcaaac agcgataaag 2101 agtgagccaa tgcgtagccc tcaagatgtc gcgccaagtc aaccagccta tattatctac 2161 acatcaggct caacaggaac ccctaaggga gtggtgattt ctcaccaagg cgctcttaat 2221 acatgtatcg cgatcaatcg acgttatcaa attgggaaaa atgatcgagt attggctctt 2281 tcagcactac attttgacct ttcggtatac gacatctttg gcctactttc tgccggcggc 2341 actatcgtat tagtcagtga gcttgaaaga cgtgacccga ttgcttggtg tcaagcaatt 2401 gaggagcata atgtcaccat gtggaatagc gtcccagcat tatttgatat gttattaact 2461 tacgctactt gctttaactc tatcgctccc tcaaaactcc gtttaaccat gctttcggga 2521 gactggattg gattagattt accgcagcgt tatcgcaatt atcgtgtaga tggccaattt 2581 attgcgatgg gaggagccac cgaagcatcg atatggtcaa acgtctttga cgtagagaaa 2641 gttccgatgg agtggcgctc tatcccttat ggctatcctc tacctagaca acaatatcga 2701 gttgtcgatg acttggggcg agattgccca gattgggtag ctggcgaact ttggattggt 2761 ggtgacggta tcgcactggg gtattttgac gatgaattga aaacgcaagc tcagttttta 2821 catattgatg gccatgcttg gtatcgtact ggtgacatgg gctgttattg gccagatggt 2881 actcttgagt tcttggggcg aagagacaag caggtcaaag taggaggtta cagaattgag 2941 ttgggagaaa tcgaagttgc actcaataat ataccggggg tgcagcgtgc ggttgctatc 3001 gcagtgggca ataaagacaa aactctagca gcattcatcg ttatggattc ggagcaagca 3061 ccaatagtta cagcgccgtt ggatgcagaa gaagttcaac ttttgttgaa caaacaactg 3121 cctaactaca tggttcccaa acgcataatt ttccttgaaa ccttccccct aaccgctaat 3181 ggtaaagtcg atcataaagc tctaactcga atgactaacc gagaaaagaa aacatctcaa 3241 agcataaata aacctattat tactgcgagt gaagatagag tagccaaaat ttggaatgac 3301 gttcttggtc ctacagaact ctataaatcg agtgatttct ttttgtcggg aggagatgca 3361 tacaacgcaa tagaggtagt caaacgttgt cataaagctg gctatctaat caagctatca 3421 atgttgtacc gttattctac gattgaagct ttcgctatta tcatggaccg ttgtcgatta 3481 gcacctcagg aagaggctga gttatgagcc ctttaatcaa acttgcagcc tcttcgaggc 3541 tgcatgatgc aactcattat gttttatgcc cttttgcagg aggtggtagt ggtgcattta 3601 gacactggcg tacattatcc cttgaaaatg aagtgatttc ggtaatgctt tatcctggta 3661 gagaatttcg tatagacgac ccaacagtca taaacatcgg cacattagca gaagaaatga 3721 tccaagcttt aaaaacctgt aatcaacgaa tagaagatac gatcattgtc ggtcatagta 3781 tgggcgcgca agtggcgtat gaagcaagta aaaaactagt aaatcagggg ctatttctga 3841 aagggctgat catctctggt tgtcaagctc ctcatatcaa agggcgaagg ttactaggtg 3901 aatgcgatga taaaaccttt attcataatc tagtcgagat tggagggtgt gatccaagtt 3961 tagctaaaag tccagagtgg tggccgatat ttctgccagc tttgagggcg gactttacgg 4021 ctacagaaca gtatattttc acatcacttc caaatgataa ggaaggcctt cctatcccaa 4081 ctctattgat ttcaggtgat caagatagag aagctaactt ttcagaaata gaagagtgga 4141 aactttggtg taataaagtc gttgatcatt tagtggtcga gggcgggcat ttctatataa 4201 cagagcaacc tcaaatgatg cttgaatgca tccgggcttt atcaaccgaa acgactgcct 4261 aatactaagg ttcggttgat agatttttag acaaacaact tcaaacgaca agggtatgca 4321 tttaagcaat gcataccctg ggcttttcga tcaacactat tacttggttt ccggaattc // LOCUS VIBLUXABG 3200 bp ds-DNA BCT 10-JUL-1990 DEFINITION P.leiognathi luciferase alpha (luxA), beta (luxB) subunit, and gamma protein (luxG) genes, complete cds. ACCESSION M34564 KEYWORDS gamma protein; luciferase. SOURCE P.leiognathi (strain 554) DNA, clone pPHL[6,11,12]. ORGANISM Photobacterium leiognathi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 3200) AUTHORS Illarionov,B.A., Blinov,V.M., Donchenko,A.P., Protopopova,M.V., Karginov,V.A., Mertvetsov,N.P. and Gitelson,J.I. TITLE Isolation of bioluminescent functions from Photobacterium leiognathi: Analysis of luxA, luxB, luxG and neighboring genes JOURNAL Gene 86, 89-94 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 145 ORF1 pept 182 1246 luciferase alpha-subunit (luxA) pept 1295 2272 luciferase beta-subunit (luxB) pept 2293 2979 gamma protein (luxG) pept 3081 > 3200 ORF2 binding 171 174 ribosomal binding site (put.) binding 1284 1287 ribosomal binding site (put.) binding 2281 2284 ribosomal binding site (put.) binding 3072 3075 ribosomal binding site (put.) BASE COUNT 1092 a 616 c 586 g 906 t ORIGIN 1 tcgagcagcc attggcttag acagtgaagt gattgattta gttgatgata ttagtgagcc 61 aaactttgaa gatctcacca ttattacagt taatgaacgt cgtttgaaaa ataaaattga 121 aaacgaaatg ttcgctagcg cttaaaccaa tacctattca agtcatcaaa aggaaaagat 181 aatgaaattt ggcaatattt gtttctcata ccagccacca ggtgaatctc ataaagaagt 241 catggatcgc tttgttcgtc ttggcgttgc ttcagaagaa ttaaacttcg acaccttctg 301 gacacttgag caccacttca ctgaattcgg cctaacaggt aacttatatg ttgcttgtgc 361 caatattctt ggtcgtacca aaaaacttaa cgtcggcaca atgggtatcg tactaccaac 421 agctcaccct gctcgccaaa tggaagatct actgctactg gatcaaatgt caaaaggacg 481 ttttaacttt ggtgtagtac gtggtctata ccataaagat ttccgggtat ttggtgttac 541 gatggaagat tctcgttcga tcactgaaga tttccataaa atgatcatgg acggctctaa 601 atcaggcgtt ttacacactg atggtaaaaa cattgaattc ccagatgtaa atgtctatcc 661 agaggcctac ctagacaaga tccctacttg tatgacagcg gaatctgcgg cgacaacgac 721 ctggctagca gaacgtggtt tgccaatggt actgagctgg atcatcacca ccagcgagaa 781 aaaagcacag atggaactat acaatgaaat tgcagctgag catgggcacg atattcacaa 841 tatcgaccac agcatgacct tcatctgttc cgttaatgaa gatccagaaa aagcagaaag 901 tgtctgccgt gacttcctat caaactggta cgagtcctac accaatgcga ccaatatctt 961 taaagacagt aaccaaactc gtggttatga ctatcacaaa ggtcaatggc gtgactttgt 1021 actacaaggc cataccgata cccgtcgtcg tcttgattac agtaataacc taaaccctgt 1081 tggtacacct gaaaaatgta ttgaaattat ccagcgagat atcgatgcaa cagggatcaa 1141 caacatcacc cttggttttg aagcaaacgg ttctgagcaa gaaatcatcg catcgatgga 1201 acgcttcatg acacaagtgg cgccatacct aaaagatccg aaataaactg ccacattaaa 1261 gccattgaat taaattataa ataaggaaaa aaacatgaat tttggattat tctttctgaa 1321 ctttcagctc aaaggtatga catctgaagc agtactagac aacatgatcg atactattgc 1381 tttggttgat aaagacgagt accacttcaa aaccgcattt gtgaacgaac accatttttc 1441 taaaaacggt atcgttgggg cacctatgac agctgcaagt tttctactag gtttaactga 1501 acgccttcat attggttcat tgaatcaagt gatcaccact caccacccag tccgtattgc 1561 agaagaagct agcttacttg atcaaatgtc agatgggcgt tttattcttg ggttaagtga 1621 ttgtgttagt gatttcgaga tggacttctt taaacgccaa cgagatagcc aacaacaaca 1681 attcgaagcc tgttacgaaa ttctaaatga cggtatcact accaactact gttatgcgaa 1741 taatgacttt tataacttcc caaaaatctc tatcaaccca cactgtatta gtaaagaaaa 1801 cctaaaacag tatattttag cgaccagcat gggcgtggtg gaatgggctg cgaaaaaagg 1861 gttaccactg acttaccgct ggagtgatac gctggcagaa aaagaaaatt actatcaacg 1921 ttatttaact gtcgccgctg aaaataatgt cgacattact catgttgatc accaattccc 1981 attacttgtt aacattaatc cggatcgtga tattgctaaa caagaaatgc gtgactatat 2041 ccgtggttat attgctgaag cttacccaaa tacagatcaa gaagaaaaaa ttgaagagct 2101 aattaagcaa catgcggttg gtacagaaga tgaatattat gaatcatcta aatatgcttt 2161 agaaaaaaca ggttcaaaga atgtattgct atcttttgaa tcaatgaaaa ataaagccgc 2221 tgtcatcgac cttattaata tggttaatga aaaaatcaag aaaaatctat aataaataac 2281 aggataataa aaatgacaaa atggaattat ggcgtcttct tccttaattt ttaccatgta 2341 ggacagcaag agccatcatt aaccatgagc aatgcgttag aaacattacg tattatagat 2401 gaagatacat ctatctatga tgttgttgca tttagcgaac accacataga taaaagctac 2461 aatgatgaaa cgaaattagc gccatttgtt agccttggca aacaaattca tattttagcc 2521 accagccctg aaacggttgt aaaagcggct aaatatggga tgccactact gtttaaatgg 2581 gatgatagtc aacaaaagcg tatcgaatta ttaaaccatt accaagcagc tgcggctaaa 2641 tttaatgtcg atattgcagg tgttcgtcat cgattaatgt tatttgtcaa tgttaatgac 2701 aacccaacgc aagccaaagc tgagcttagc atttacttag aagattacct ctcttacacc 2761 caagcagaaa catccattga tgaaatcatc aatagcaatg ctgcaggcaa cttcgatacg 2821 tgtttacatc acgttgctga aatggctcaa ggtttaaata ataaagtcga tttcttattt 2881 tgctttgaat cgatgaaaga tcaagagaat aaaaaatcac taatgattaa ctttgataaa 2941 cgcgttatta attatagaaa agaacacaac cttaactaat tcagttaagt caatttaaat 3001 taaaacttcg tcaatcattg tcattattaa tggcagtgtg gcttcttacg ctgccattaa 3061 attttttatt aaggtgtaat atgactactt tattagatat tgatactaac gatattattg 3121 ttagttcaga actcgatgat attattttct catcatcacc gtttacatta acctttgatg 3181 agcaagaaaa attaaagcaa // LOCUS YSCSLP1A 3456 bp ss-mRNA PLN 10-JUL-1990 DEFINITION S.cerevisiae vacuolar function expression protein (SLP1) gene, complete cds. ACCESSION M34474 KEYWORDS . SOURCE S.cerevisiae, cDNA to mRNA, clone pYKK101. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3456) AUTHORS Wada,Y., Kitamoto,K., Kanbe,T., Tanaka,K. and Anraku,Y. TITLE The SLP1 gene of Saccharomyces cerevisiae is essential for vacuolar morphogenesis and function JOURNAL Mol. Cell. Biol. 10, 2214-2223 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 692 2767 SLP1 protein signal 536 544 TATA box BASE COUNT 1136 a 588 c 691 g 1041 t ORIGIN 1 ctgcagctaa tcacgtgctc acatctttac tcaatgagat tgatggtgtt gaagagttaa 61 agggtgtagt tattgtagcg gcgacgaata gacctgatga aatagatgct gctcttctaa 121 ggcctggtag gttagataga cacatttacg ttggccctcc agacgtaaac gcccgcttgg 181 aaatcttaaa gaagtgcaca aagaaattta atacagaaga gtctggagtc gatcttcatg 241 aattggcaga ccgtacagaa ggttattccg gagctgaagt tgtgctgctt tgtcaagaag 301 cgggcttggc tgccataatg gaagatttag atgtcgcaaa agtggaatta cgtcattttg 361 agaaagcttt taaaggaatt gctaggggca ttactccaga aatgctctct tattatgaag 421 agtttgctct aagaagcggt tcatcttcgt aagcttgttc atagtcaatt cttttccttt 481 gtgtgctcaa taatagtaga tagaaattat actgaactcc ggtcattttg tataatatat 541 taatcacttc acacgaacat acataaataa aatatcataa aggttagcaa attggaacta 601 gttatatgtt aattagttaa aagatagaaa attcgagaaa ggaagaaaaa gctgatattg 661 cccatctcca actttatcaa atcatttcac gatgaataga ttttggaata ctaagaaatt 721 ttcattaaca aatgccgatg gactatgtgc taccttaaat gagatatctc aaaatgatga 781 agttcttgtg gttcaaccaa gtgtattgcc agtactcaat agtttgctaa ctttccaaga 841 tttgactcaa tcaactcctg taaggaaaat tacgttactc gatgatcagc taagtgacga 901 tttaccgagt gccttaggca gcgttccgca aatggatctt atttttctta ttgatgtcag 961 aacatctctc cgactccctc cacaactgct tgatgctgct caaaagcaca atttatcatc 1021 tttgcatata atatactgtc gatggaaacc gtctttccaa aatactttgg aggatacaga 1081 gcaatggcaa aaggatggtt tcgatttgaa ttcaaaaaaa acacatttcc ctaacgtcat 1141 tgaatctcag ttaaaggagc tatcgaacga atataccctt tacccttggg atctcttgcc 1201 cttcccacag attgatgaaa atgttctatt gactcattcc ctttataaca tggaaaatgt 1261 aaacatgtat tatcccaact tacgttcttt gcagagtgcc acagagtcaa tactggttga 1321 tgatatggtc aattcgttgc agagcttgat ttttgaaact aatagtatca taacaaatgt 1381 tgtgtcgata ggtaatctgt ctaagagatg tagccatctt ttgaagaaac gaatcgatga 1441 gcatcaaaca gagaatgatt tattcatcaa gggtacgctt tatggtgaac gaaccaactg 1501 tggactagaa atggacttga ttatcttgga aaggaatacc gatcctataa cgccattgtt 1561 gacacaactt acgtatgcag gaatactaga tgatctatat gaattcaatt ctggcataaa 1621 gataaaggag aaagacatga acttcaatta taaggaagat aaaatatgga atgatttgaa 1681 atttttaaat tttgggtcga ttgggccgca gttaaataaa ttggcaaagg aactacaaac 1741 gcaatatgat acaaggcata aagccgagag cgtacatgaa atcaaagaat tcgttgattc 1801 cttaggttct ttgcaacaaa ggcaagcttt tttgaaaaat cacacaacct tatcatccga 1861 cgttttgaaa gtggtagaga ctgaagagta cggatctttc aataaaatct tagagttaga 1921 gctggaaatt ttgatgggaa atacacttaa taacgacatt gaagatatta tactcgagtt 1981 gcagtaccag tacgaggttg atcaaaagaa gattctcaga ttaatctgtt tattgtctct 2041 ttgtaaaaat tcacttcgag aaaaggatta tgaatatcta agaaccttta tgatcgactc 2101 ttggggcatt gaaaaatgct ttcaacttga atcattggct gagttaggat ttttcactag 2161 caaaacggga aaaactgatt tgcatattac aacaagtaag tcaacaagat tacagaaaga 2221 ataccgttat atttcacaat ggttcaatac agtacccata gaagacgagc atgctgccga 2281 taaaatcaca aatgagaacg atgacttctc ggaagccact tttgcttaca gtggtgtagt 2341 gcccttgaca atgagactgg ttcagatgtt atatgatagg tctatcttgt tccataatta 2401 ttcctcgcag cagcctttta tactgtcaag agaacctaga gtttctcaaa cggaggattt 2461 aattgaacag ttatatggag actcacatgc gatcgaagag agtatatggg tcccgggaac 2521 cattacaaaa aagatcaatg caagcatcaa gagcaataat agacggtcca tagacggatc 2581 taatgggaca tttcatgctg cagaggatat tgcactcgta gtattcctcg gaggtgtaac 2641 aatgggtgaa atagctataa tgaagcattt gcaaaaaata ctaggtaaaa aaggtatcaa 2701 taaaaggttt atcatcatcg ccgatggctt gatcaatggc acaaggatca tgaactctat 2761 atcttaatta ttatatgata gatttgttaa ttttttgtat atgcaaatgt gcttttttca 2821 ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa aagctagggg 2881 actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct ggcttagtga 2941 tttataagaa tggttaatta gttttgtatc ctttattttc tagatagagc cacagagcaa 3001 actaaacaga aaagttatcc atttccatta cgcaatgttg tgccaacaga tgattagaac 3061 gacagctaag agaagtagca atatcatgac cagacctatt atcatgaaga ggtcagtaca 3121 cttcaaagac ggtgtgtatg aaaatatccc attcaaagtc aaaggaagaa agacacctta 3181 cgccttatct catttcgggt tcttcgctat tggatttgct gttccatttg ttgcctgcta 3241 tgttcaattg aaaaagtcag gtgcttttta aaacaccccc ctaagttgaa ggatagatgt 3301 gtgtacatag cgtgcttggt tgagacgttt tagagtgtgt tctttgctat tcctaggtgc 3361 gcatatcatc gttttattta tttgtacaat tttcttttca tatattcata atcctctcct 3421 tgtgccttcg tattgagacg gcgggaaaga aggatc // LOCUS CHKMHBLBA 2405 bp ds-DNA VRT 10-JUL-1990 DEFINITION Chicken MHC class II B-LBII-beta gene, complete cds. ACCESSION M29763 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Chicken (haplotype B12) DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2405) AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C. TITLE Organization of a functional chicken class II B gene JOURNAL Immunogenetics 31, 179-187 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 16; 2332 to 2405) AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Zoorob, 06-NOV-1989, for release after publication. Author address: R.Zoorob Institut d'Embryologie Cellulaire et Moleculaire du CNRS et du College de France 49bis av. de la Belle Gabrielle F-94736 Nogent sur Marne France FEATURES from to/span description pept 828 918 MHC B-LII-beta chain, exon 1 1127 1396 MHC B-LII-beta chain, exon 2 1483 1764 MHC B-LII-beta chain, exon 3 1847 1957 MHC B-LII-beta chain, exon 4 2049 2072 MHC B-LII-beta chain, exon 5 2175 2188 MHC B-LII-beta chain, exon 6 pre-msg 808 2331 MHC B-LII-beta chain mRNA and intron IVS 919 1126 MHC B-LII-beta chain intron A IVS 1397 1482 MHC B-LII-beta chain intron B IVS 1765 1846 MHC B-LII-beta chain intron C IVS 1958 2048 MHC B-LII-beta chain intron D IVS 2073 2174 MHC B-LII-beta chain intron E BASE COUNT 380 a 728 c 902 g 395 t ORIGIN 1 ggatccatgg gtgacgtaag gatgaggttc cagcacatat tggacccttc tgcgtttgca 61 tggagggatc ttcgggggat ctttgtgatc ttcagtgatt ttcagtggtc tttggtggtc 121 ttcagtgctc ttcgttggtc tttgacaaag atgcagagga gcaccgctcc cagacggacc 181 ccccggggac cccatttgtc gccatcccca ctgggacatg cagccattga ccacagccct 241 ccggctgcga ccacccaact gattccttat ccaaagtcca ctctttgcac acttacctcc 301 aatttagtga taaggatgtg gcgtgggacc gtcccaatgg ccgcacacaa gtccaggtag 361 atgatatggg atgaccatga agggatcaca gagaggaaca cggggtgacc acgaggagca 421 acgaaggaaa cgctgagtga ccacgggcag aaaatggtgt gaccattagg ggacaacgag 481 agggaacaga agtagtaagg agtgagaatg gggtgacaaa gaggtgacca tggcataact 541 ttgataagac cattgggtga ccgcagggtg atggccatac catggggtga gcactggatg 601 accatggagg tcattggagg accatcgggt gggacgaggg ccgtggggac acccgtgggg 661 cggtgggacg ggggcagagt gtcagaagga gccccgcggc gcagaactct gcctggagac 721 gggtgacgcc gcccggcgcc gccgccgctc attggccctc cccgcccggc cccgggctcg 781 cggctggcgc ggggtgccgg gtcccccatc gtccggcggc agcagccatg gggagcgggc 841 gcgtcccggc ggcgggggcc gtgctggtgg cactgctggc gctgggagcc cggccggccg 901 ccggcacgcg gccctcgggt gagctcggag ccgcggcgcg gggacggcgc tgcgtccccc 961 ccggagaaac ccccggagcc cttctggccg tgcgcagcgc tcggggctgc ggggggacgg 1021 agggcggggg ggggcggcgg agccgtgggg ggcagcgggg ccggggaggg ggcggggggt 1081 gtggcggggg gcggctgtgt gccctgaccg tgccctctgc ccgcagcgtt cttcttctgc 1141 ggtgcgatat ccgagtgcca ctacctgaac ggcaccgagc gggtgaggta tctgcaaagg 1201 tacatctaca accggcagca gttcacgcac ttcgacagcg acgtggggaa atttgtggcc 1261 gattcaccgc tgggtgagcc gcaagctgaa tactggaaca gcaacgccga gcttctggag 1321 aaccgaatga atgaagtgga caggttctgc cggcacaact acgggggtgt ggagtccttc 1381 acggtgcaga ggagcggtga gtgccgcggg gcgcagcgcg gacggacggg caggcgccgc 1441 gctctggcgg tcggtccgca gcgctccccc cgtgccccgc agtggagccc aaggtgaggg 1501 tctcggcgct gcagtcgggc tccctgcccg aaaccgaccg tctggcgtgc tacgtgacgg 1561 gcttctaccc gccggagatc gaggtgaagt ggttcctgaa cgggcgggag gagacggagc 1621 gcgtggtgtc cacggacgtg atgcagaacg gggactggac gtaccaggtg ctggtggtgc 1681 tggagaccgt cccgcggcgc ggggacagct acgtgtgccg ggtggagcac gccagcctgc 1741 ggcagcccat cagccaggcg tggggtaagg cccccgggcc ctgccccgcc gcggggggag 1801 cgggagcgcg gcccgccgcg ctgagccgcc gccttcgtcc ccgcagagcc gccggcggac 1861 gcgggcagga gcaagctgct gacgggcgtg gggggcttcg tgctggggct cgtcttcctg 1921 gcgctggggc tcttcgtgtt cctgcgcggt cagaaaggtg agcgctgggg aggggggctg 1981 cgcggggggg gtcgggagcg gggggtgggg ggcagcgtcc gcgctgacct cgtctcgctg 2041 tgtttcaggg cgccccgtcg ccgccgctcc aggtaacgtc ccgttcccat tcccgttccc 2101 gttcccgttc ccgttccgcg ctgcgagcgg ccccgatccc ggcgcggggc tcagctctgc 2161 ccgtctcccc gcagggatgc tgaattagct gctgccccgc cgagccgctg cacccgcacc 2221 ccccgctctc ccggccgtcg cctcggctct ccctcgggct gccaccgcgt ccgttggaga 2281 tgtcgccacg atgcacgctt cgtccccatc ctaataaacg cgctgacttt gaccccgctg 2341 ttcgctgccc gtgaatcatt ggggactttc cgtcgcgtgg gaggagggga gggaagtgaa 2401 agctt // LOCUS CHKMHBLIIB 444 bp ds-DNA VRT 10-JUL-1990 DEFINITION Chicken MHC class II B-LBIII-beta gene, exon 1. ACCESSION M29764 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Chicken (haplotype B12) DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 444) AUTHORS Zoorob,R., Behar,G., Kroemer,G. and Auffray,C. TITLE Organization of a functional chicken class II B gene JOURNAL Immunogenetics 31, 179-187 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Zoorob, 06-NOV-1989, for release after publication. FEATURES from to/span description pept 125 / 215 MHC B-LIII-beta chain, exon 1 pre-msg 105 > 444 MHC B-LIII-beta chain mRNA and intron IVS 216 > 444 MHC B-LIII-beta chain intron A BASE COUNT 50 a 156 c 176 g 62 t ORIGIN 1 ctgatcgggg tacccgcaac ggagatctgc ctggagacgg gtgatgccgc ccagcccagg 61 cactcactgc tccagagcag cggcgcgggc tgccggcacc cttcctcctc ctccggcagc 121 agccatgggg agcggccgtg tcctggtggc cggggccgtg ctggtagcac tggtggcgct 181 gggagcacgg caggccgccg gcacgcggcc ctcaggtgag ctcggagtcc cggtgtgggg 241 atggtgcagg gtggtccctc ccggtgtctc ccggcgccca ccccagcccc gtgcgcagcg 301 ctcggagctc cgcggctcag gatgccggcg acagcgcgtc cgcagccgtc gtgggcgtgg 361 ggggcacggg acggagcgcg gacgggagtg gctttcgggt ctgccgaggg gcagctggct 421 cctgacggtg ccccctcccc gcag // LOCUS RATLY6A 1221 bp ds-DNA ROD 10-JUL-1990 DEFINITION Rat Ly6-A antigen gene, exon 2. ACCESSION M30692 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA, (library of Clontech), clone RK6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1221) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept / 497 834 Ly6-A antigen, exon 2 (put.) (AA at 499) IVS < 1 496 Ly6-A intron A (no splice consensus) signal 1204 1209 poly-A signal BASE COUNT 286 a 315 c 314 g 306 t ORIGIN 1 gtagtccggc tgctggctga gttgtaaggc aggagggagg ctgggtgtgt tttgtcttgc 61 atgtagccct ctctgcagag ggcctggctt cactcacaca agcctggtaa catctggtac 121 atcgaactct aagaatcggc aagcccactg ctgccgtctc cttaagagtt catttaggga 181 gtctgtcagg aacttgggca ggagtccaca ctaagggaag cttacttccc aaacagtggt 241 gctgggtgga aagtggagga ctcatgagaa cccctagttt aagactttta gagaagcagt 301 ctgaagcact gtggagatgt ggtcccatcg ccatcctgga gtagggataa ttttgcccag 361 gagccccagc aatgggtcag aggagcaaaa cgacgacagc tgtaagtggt ctcagaagat 421 gctagaggaa acagaagatg aactggcagc tgagacttgg cggtaactta ctggcttcga 481 cactatgcgt gttactctca gggcctaaac tgctacaatt gcacgatgat cccatttggt 541 aatacctgct catcaactgc tacctgcccc taccctgatg gagtctgtgc tattcaggtg 601 gcagaagttg ttatgagctc tgtaagacag aaagtaaagg accatatttg ccttcccgtc 661 tgcccaacga gtcctcaaac aaccgagatc ctgggtactg ttgtcgacat gaagatttcc 721 tgttgcaata cagatctttg caacgcagca gggcccactg gaggcagcac ctggaccatg 781 gcaagggtgc ttctgttcag cctgggctca ttcctcctgc agaccttgct gtaatggctc 841 ctccaaggcc ccgccaccct tgtcctttta tcctcatgtg taatcactcc tccctggagc 901 cctctagtga taaattctga gtaatagaaa ctctgaggtg ggggtagggt gtggaacacc 961 ttgtttcaac tctatagccc ctgctgggta ggtgccccac tcccctctct agggctttca 1021 gatatgtact tcctggaatg ccattatgtt gtggtttgct gctcttggcc ctggaggcat 1081 gtggacagca cggggaagag acagaaaccc aaggcactgt gtgaccacct ccatccatac 1141 ataaaaatct ggggtcctgc agggttccca cacatgcctc tcaacatccc cctatttgag 1201 tccaataaac tctctgttct c // LOCUS RATLY6B 905 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat Ly6-B antigen mRNA, complete cds. ACCESSION M30689 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of Clontech), clone RK10. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 905) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept 103 510 Ly6-B antigen (put.) signal 878 882 poly-A signal signal 599 604 poly-A signal BASE COUNT 206 a 248 c 213 g 238 t ORIGIN 1 ctcttgctct cctccagcca caagtggtct cagaagatgc tagaatgtag aggaaacaga 61 agatgaactg gcaggttttg cctgtgcgcc ccttctcaga ggatgaacag atcttgtgct 121 atgaagtcct gtgtgctcat ccttctcctg gccctactgt gtgcagaaag agctcagggg 181 ctaaactgct acaattgcac gatgatccca tttggtaata cctgctcatc aactgctacc 241 tgcccctacc ctgatggagt ctgcactatt caggtggcag aagttgttgt gagctctgta 301 agactgaaag taaagagcaa tctctgcctt cccggctgcc ccaagagtcc tcaaacacct 361 gaggtcctcg gtaccgttgt ccatgtgaat actgactgtt gcaatacaga tctttgcaac 421 gcagcaggtc ccactggagg cagcacgtgg accatggcag gggtgcttct gttcatcctg 481 ggctcagtcc tcctgcagac cttgctgtga tggaccctcc aaggccctgc cacccttgtc 541 cttttatcct tatgtgtaat cactccttcc tggagccctc tagtgataaa ttctgagtaa 601 taaaaattca gaggggggat tgagtgtgga acaccttgtt gcaactctat agccactgct 661 ggataggttc cccactcccc tctctagggc tttcagatat gtacttccta gaatgccatt 721 gtgttttggt ttgctgctct tggccctgga ggcaggggac agcacgggga agaggcagaa 781 acccaaggca ctgtgacacc acctccatcc atacataaaa atctggggtt ctgcagggtt 841 cccacacatg cctctgaaca tccccctatt tgagtccaat aaactctctg ttctcccacg 901 gaatt // LOCUS RATLY6C 931 bp ds-DNA ROD 10-JUL-1990 DEFINITION Rat Ly6-C antigen gene, complete cds. ACCESSION M30690 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA and DNA, (library of Clontech), clone RK3. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 931) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept 76 262 Ly6C antigen, exon 1 (put.) 342 559 Ly6C antigen, exon 2 (put.) IVS 263 341 Ly6C antigen intron A (no splice consensus) signal 927 931 poly-A signal BASE COUNT 202 a 257 c 224 g 248 t ORIGIN 1 gccctgggac gtaattggaa gtctattaac tggctccaat ttccaaggtt ttctctgtgc 61 accccttctc tgaggatgaa cagttcttgc gctatgaagt cctgtatgct catctttttc 121 ctggccctac tgtgtgcaga aagagctcag ggcctaaagt gctacagttg catagaagtc 181 ccacttaatg ctaactgctc aacagctacc tgcccctact ctgatggagt gtgtgtttct 241 caggtgttag aagctgtaga gggtctccta gatgcaactt cccagggaac tgcaagagtc 301 tgagaggctg gttgcccttt ttgctctgcc actgagtgat cgctctgtaa gacggacagc 361 aaagagcaat ctctgccttc caatctgccc caagtttcct caaagaaccg agatcctggg 421 taccgttgtc tacacgaagg tttcctgttg caatacagat ctttgcaatg cagcaggtcc 481 cactggaggc agcacctgga ccgtggcagg ggtgcttctg ttcagcctgg gctcagtcct 541 cctggagacc ttgctgtgat ggcccctcca aggccccgcc acccttgtcc ttttagcctc 601 atgtgtaatc actcctctga agccctctag tgataaattc tgagtaatag aaactcccag 661 gtgggggtag ggtgtggaac accttgattc aactctatag cccctgctgg gtaggtgccc 721 cactcccctc tctaggactt tcagatctgt acttcctgga atgccattgt gttgtggttt 781 gctgctcttg gccctggagg cacatggaca gcacagggaa gaggcagaaa cccaaggcac 841 tgtgacacca cccccatcca tacataaaaa tctggggttc tgcagggttc ccacacatgc 901 ctctcaaggt tcccctattt tagtccaata a // LOCUS RATLY6CA 783 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat Ly6-C antigen mRNA, exon 2. ACCESSION M30691 KEYWORDS antigen. SOURCE Rat (strain Sprague-Dawley) adult kidney, cDNA to mRNA, (library of Clontech), clone RK11. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 783) AUTHORS Friedman,S., Palfree,R.G.E., Sirlin,S. and Haemmerling,U. TITLE Analysis of three distinct Ly6-A-related cDNA sequences isolated from rat kidney JOURNAL Immunogenetics 31, 104-111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Friedman, 14-DEC-1989, for release after publication. FEATURES from to/span description pept / 1 403 Ly6-C antigen, exon 2 (put.) (AA at 2) signal 771 783 poly-A signal BASE COUNT 173 a 219 c 187 g 204 t ORIGIN 1 gaacagttct tgcgctatga agtcctgtat gctcatcttt ttcctggccc tactgtgtgc 61 agaaagagct cagggcctaa agtgctacag ttgcatagaa gtcccactta atgctaactg 121 ctcaacagct acctgcccct actctgatgg agtgtgtgtt tctcaggtgt tagaagctgt 181 agagggctct gtaagacgga cagcaaagag caatctctgc cttccaatct gccccaagtt 241 tcctcaaaga accgagatcc tgggtaccgt tgtctacacg aaggtttcct gttgcaatac 301 agatctttgc aatgcagcag gtcccactgg aggcagcacc tggaccgtgg caggggtgct 361 tctgttcagc ctgggctcag tcctcctgga gaccttgctg tgatggcccc tccaaggccc 421 cgccaccctt gtccttttag cctcatgtgt aatcactcct ctgaagccct ctagtgataa 481 attctgagta atagaaactc ccaggtgggg gtagggtgtg gaacaccttg attcaactct 541 atagcccctg ctgggtaggt gccccactcc cctctctagg actttcagat ctgtacttcc 601 tggaatgcca ttgtgttgtg gtttgctgct cttggccctg gaggcacatg gacagcacag 661 ggaagaggca gaaacccaag gcactgtgac accaccccca tccatacata aaaatctggg 721 gttctgcagg gttcccacac atgcctctca aggttcccct attttagtcc aataaactct 781 ctg // LOCUS RATTAG1 5040 bp ss-mRNA ROD 10-JUL-1990 DEFINITION Rat axonal glycoprotein (TAG-1), mRNA, complete cds. ACCESSION M31725 KEYWORDS glycoprotein. SOURCE Rat 13 day old embryo spinal cord axon, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5040) AUTHORS Furley,A.J., Morton,S.B., Manalo,D., Karagogeos,D., Dodd,J. and Jessell,T.M. TITLE The axonal glycoprotein TAG-1 is an immunoglobulin superfamily member with neurite outgrowth-promoting activity JOURNAL Cell 61, 157-170 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Furley,A.J.W., 30-JAN-1990, for release after publication. FEATURES from to/span description pept 224 3346 axonal glycoprotein (TAG-1) precursor sigp 224 313 axonal glycoprotein signal peptide matp 314 3343 axonal glycoprotein BASE COUNT 1144 a 1486 c 1397 g 1013 t ORIGIN 1 gaattcccgc ccgctgccgc cacgccagga cagccagtgg ctaaggccgg cggggcaagc 61 agccctgagg ctggcagcag ggtctgctca ccaggcggcc gcagcagtgc cccagccaac 121 acccttcccg cactctaggt gtgcctgagt ctccagttga ttctcccgga gcggagctgc 181 ggctcctctc ttttggactc tgcctctgcc tgaaagaccc accatgggga cacacgccag 241 gaaaaaggca agcttgctgc tgctggtgct ggccacagtg gccctggtct cctctccagg 301 atggagtttt gcccagggaa ccccagctac ctttggaccc atcttcgaag agcaacccat 361 tggcctgcta ttcccagagg agtctgcaga ggatcaggtg acactggcgt gccgtgcccg 421 tgctagccct ccagccacct acaggtggaa gatgaatggc acagatatga acctggaacc 481 tggctcccgt caccagctga tggggggcaa cctggtcatc atgagcccca ccaagacaca 541 ggatgctggt gtctaccagt gcctagcctc caacccagta ggcactgtgg tcagcaagga 601 ggctgtcctc cgctttggct ttctacagga attctccaag gaggagagag accctgtgaa 661 aacccatgag ggctggggag tgatgctgcc ctgtaacccg cctgcccatt acccaggttt 721 gtcctaccgc tggctcctca acgagttccc caacttcatc ccaacggatg ggcgacactt 781 cgtgtcccag actacaggaa acctgtacat cgcccggacc aatgcctcag acctgggcaa 841 ctactcttgt ttggctacca gccacatgga cttttccacc aagagtgtct tcagcaaatt 901 tgcgcagctc aacctggctg cggaagatcc ccgactcttc gctcccagta tcaaagctcg 961 gttccccccg gagacctacg cactagttgg gcagcaagtc accctggagt gctttgcctt 1021 tgggaacccg gttccccgga tcaagtggcg caaagtggat ggttccttgt cccctcagtg 1081 ggccacagct gagcccaccc tgcagatccc cagcgtgagc tttgaagacg agggtaccta 1141 tgaatgtgag gcagagaact ccaagggtcg tgacaccgtc cagggacgca tcatcgtgca 1201 agctcagcct gagtggctaa aggtgatctc agacacagag gccgacattg gctccaactt 1261 acgttggggc tgtgcagcag caggcaaacc ccggcccatg gtgcgctggc tgagaaacgg 1321 ggaacctctg gcctcccaga accgggtgga ggtcttggct ggggacctgc gattctctaa 1381 gctgagcctg gaggactctg gcatgtacca gtgtgtggct gaaaacaagc atggcaccat 1441 ctatgccagt gctgagctgg ctgtacaagc tctggcccca gacttcaggc agaaccctgt 1501 gagacggctg atccctgcag ctcgaggcgg agagatcagc atcctgtgcc agcctcgcgc 1561 agccccaaaa gctacaatac tttggagcaa gggtactgag attttgggga acagtaccag 1621 agtgactgtc acttccgatg gcaccttgat catcagaaac atcagccgat ccgatgaagg 1681 caaatatacc tgctttgctg agaacttcat gggcaaagcc aacagtaccg ggatcctgtc 1741 cgtgcgcgat gcaaccaaga tcaccctggc tccctccagt gctgacatca acgtgggtga 1801 caacctgacc ctacaatgtc atgcctcgca cgaccccact atggacctca cgttcacctg 1861 gaccctggat gatttcccta ttgactttga taagcctgga ggtcactacc ggagagccag 1921 tgcgaaggag accattgggg acctgactat cctcaacgcc cacgtacgcc atggagggaa 1981 gtacacatgc atggcccaga ctgtggtaga tggtacatcc aaggaggcca cagtcctggt 2041 ccgaggtccc ccaggtcccc cagggggtgt ggtggtgaga gacatcggag acaccaccgt 2101 tcagcttagc tggagtcgtg gctttgacaa ccacagcccc attgccaagt acacgctgca 2161 agctcgtact ccaccctcgg ggaaatggaa gcaggttcgg accaatcctg tgaatatcga 2221 gggtaatgcc gagactgccc aggtgctggg tctcatgcct tggatggact atgagtttcg 2281 ggtttcagct agcaacatct tgggcactgg ggagcccagc gggccctcca gcaaaatccg 2341 cactaaggaa gcagtcccct cagtggcacc atcgggactc agtggagggg gaggagcccc 2401 tggagagctc atcatcaact ggactcccgt gtcacgggag taccagaacg gagacggctt 2461 cggctacctg ctgtccttcc gcaggcaagg cagctccagc tggcagactg cccgggtgcc 2521 tggcgctgat gcgcagtact tcgtctacgg caatgacagc atccagccct acacaccctt 2581 tgaggtcaag atccgaagct acaatcgccg gggggatggg cccgagagcc tcactgcgtt 2641 agtgtactca gcagaggaag agcccagggt ggcccctgcc aaggtctggg ccaaggggtc 2701 ctcatcttca gagatgaacg tgagctggga gcctgtgcta caagacatga acggcattct 2761 cctgggatat gagattcgct actggaaagc cggggacaac gaagcagccg ctgaccgagt 2821 gaggacagca gggctagaca ccagtgcccg agtcactggc ctgaacccca acaccaaata 2881 ccacgtaact gtgagggcct acaaccgggc cggcactgga cccgctagcc cttcagctga 2941 tgccatgacc gtgaagcccc cgccacggag acctcctggc aacatctcct ggactttctc 3001 aagctccagt ctcagcctta agtgggaccc tgtggttcct ctccgaaatg aatctacggt 3061 cactggctac aagatgctgt atcagaatga tttgcaccca actcctacgc tccacctcac 3121 cagcaagaac tggatagaaa taccagtacc cgaagacatt ggccacgctc tggtacagat 3181 tcgaaccaca gggcctggag gggatgggat ccccgcagaa gtccacattg tgagaaatgg 3241 aggcacaagc atgatggtgg agagcgccgc cgcccgccct gcccatcccg gacctgcgtt 3301 ctcctgcatg gtgatattga tgctcgctgg ctaccagaag ctctgatctc aacactgccc 3361 gccacgccca agctggacac ccaccctaac agacacagcg gctgaccaca gctccctttc 3421 gtccaaggtg gtccaacact gtgcctgagc gtggttggct tagacaccta ctcccaacag 3481 taccctttat gtaggaggta ggatattcct attctgccac aggatagaac catgcgagga 3541 aattttcttt aagtcaagag gcactgggca gtgacttcca tgataatagt actaggccta 3601 atgcctggac cccttggggt cttggtcgaa aggaacgggc ctttgattaa gcagatggtc 3661 ctttggggcc acaagtggca ctgccatctg agatcagagt accaggccca gcaggaacat 3721 gggcagcagt ggggtattgt tttccctcta tgaagcagag ggacctcttc tagtcctcac 3781 tggagaagca ccatggttgg tcccgacacg gtcttccatg actccctggc ttcctcggta 3841 gccaaggaca aggccctggg ttactgggga tagaagctca aaagggttga gaggctaccc 3901 cacccgatgg aaaggggcac cagcctaagc ccattggcca tcctggtggc actgccctct 3961 cagccagcac tgccaagcca atcctgtcgt cctccagatg gaatggtgga gtgacagagc 4021 cacttcaggt ggctatgtga ctaaagggct tgcctcgagg agttgccttg cctcatcaag 4081 atgcttcctt catggaccct ccagggtacg ggcaggagat gtccatctga acgctactct 4141 cttcccttca gctctgctgc aaacttgtgc ctgcctccac ctcccacaac tgcaggcccc 4201 agaaatcagc tctcaacaca gcatccattc tttgtcctgg gatagagagg catccgagaa 4261 gggccagcat caaagtggcc ctgcctgctt ccaggaatat cctccatcac ctggccacac 4321 ctgctcccca gaactgcctg gactactctc ttcagtcccc acaagaaaaa gggttaataa 4381 gggggggggg ggtggcctgc cttgagttct gggtagttac cagggataga ccagactacg 4441 ggagctgaag aagccttata acttgactta tccgtaccct acacttaaca gacgaggaaa 4501 tggaggtgca gaagggttag ggacttcttg ggggtcacat ggtctgtaag gacaaggcat 4561 ggtcagcaca gggtctcctc cccacctgtg ggaggctcta tagagagagg gaggatgttg 4621 agcagtcaca gcctgtcctc taggactctg gaggactctg gaggaggagc cctctgcttc 4681 aagaggttct ggctggtgag atggacaaat gagctccaac caaggcatag gcagattcca 4741 ggagtcaatg gcctggggca gccttctgct gggaactcgg cagggagcac tgtctggaag 4801 cctctcgggc ttgctcattt caagaagagg ccaaagcaag gacagagttc cttagacgag 4861 gaccctgcag cagcacgacc agaaaacccc agtgtccacg ccctcagccc acgggggcag 4921 cagagcaggc atttcaagat gcacttgccc tgctgctcct taggccattt ctgtagttta 4981 cagttagagc tctattttgt tatgggtttt taaacttcaa gccttgctct gtttttctgg // LOCUS MUSADAM01 2308 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 1 (non-coding). ACCESSION M34242 J04767 KEYWORDS adenosine deaminase. SEGMENT 1 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2308) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pre-msg 885 > 2308 adenosine deaminase (ADA) mRNA and introns IVS 1008 > 2308 ADA intron A BASE COUNT 479 a 615 c 698 g 516 t ORIGIN Chromosome 2. 1 cccacctcaa ggtgcgcaca agttacttaa ggaacttgct acaatatagc cctgctcccg 61 cccccaaaat cccaccaaac ctagagtatg gttctaaaca gctcacctgt taagtctcct 121 tggccaatcc tctagaagtt gaccatagta tgaagttttc tgcagcgtag tttttttctg 181 cccccctttc actactgtgt ctgagcacat gtgctgtgct ttgtagctga aactggcttt 241 attgctgcag aaaccagtcc actgtattta cccacagcac tgatgtgagc attctaaata 301 catctcgatg cgtgggcata tttatccagc gtaactgccc caggagagat gaactgtgtg 361 ttcctgtcca ccccctgtat cagcacctga gactagtctc agagtctctc tcacacacaa 421 cagtgttctc tgcatcccac ccgccctcac ctggtgaact ccggcagtcg ccgctaaatc 481 tccctaatta cacacttctt ctgccttgtg attctgcaac aagtgggtct atccctcaaa 541 atccagcccc ataaggcttc aggactgtgt ggctccagct tcagcctgca caaagtaggc 601 gcccaagcaa cactggaagc ctcggtactg aaggggcccg gaaggggcag gtgagacatt 661 ggagtcacgt ctgcaggggg ctcacctggg agcttcctag ggtgtagcca gcagggaagg 721 tctggggttc agaattccgg gaaatgcgcg ccagagttgc aggcgggggg gggggggggg 781 ggggggcggg gccgtggctc cggaaggcgg ggtctctctg tgggcgtagc gtgggcgggg 841 ctgtgcgggg cagcccggta aaaaagagcg tggcgggccg cggtctctga gagccatcgg 901 gaagcgaccc tgccagcgag ccaacgcaga cccagagagc ttcggcggag agaaccggga 961 acacgctcgg aaccatggcc cagacacccg cattcaacaa acccaaagta agcaccgagg 1021 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg 1081 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca 1141 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaag 1201 ggctccgttg ccagggttct gtcgggctgt cccggggctt agcggggccc acctttggcg 1261 cctttaacct agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca 1321 ggccttagct tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaac 1381 ctgggggtgt ctctggtcgg ccttcgggtt cggctgctgt ctatgcgaac ctgggagtgc 1441 ctccagtcgg ccttcgggtt cggctgctgc ctatgccctg tgccctggag gtctcagcct 1501 cgctgtctgc caatgggcat ccagtgcggc ggggctgcac agctgtgtgg gactgggcta 1561 ggacctgggt gtctgagccc cagtagaatg gggcccaggg tctctagctg ttaaatgttc 1621 agtgtatggc tttatactta agtgttatga ttactttctg ggcaacaggt aacctaggtt 1681 tgtgggtgcg cccgtgggaa aatctatgat ccaaaccaga aaaggaaggg atagaggctt 1741 cagggtgcca ggaggaaccc ctacacatac tgaccgtttg gccatatggg tttatttggg 1801 atgaagtttt agcccattga ccccagagga gaacccttta tctgtctttc tgcaagctgt 1861 ggcttcttgg aaacagggag actccaggtc cccaaggcca gatttgcagc ccttacagat 1921 tctgtctagt cagccaggca aattgaactg gtcagcagaa gtgtgggact gagaactcag 1981 ggggagggat cagagacagt cacccttaga cttacccctc caagaaacag atgctgagtg 2041 gggggcgggg tggcagacgt atgaatcccg tgtgcatgtt gtgtcatata tgcgtgcatg 2101 gagggagcgg gagggaagat gggcagtggg cctgtattcc atgcacttac catagggaac 2161 acactctgcc cctctagcta gaggctagaa gggcagggca agtcttccta cccaaccaat 2221 gcctgctgca catcttgtct ggtggctcct gaccacagtt ggtgctctta gacatcaaag 2281 ggtgagtttt cttttgatgg tctgaatt // LOCUS MUSADAM02 207 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 2 (non-coding). ACCESSION M34243 J04767 KEYWORDS adenosine deaminase. SEGMENT 2 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 207) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pre-msg < 1 > 207 adenosine deaminase (ADA) mRNA and introns IVS < 1 78 ADA intron A IVS 141 > 207 ADA intron B BASE COUNT 39 a 58 c 54 g 56 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 1. 1 gctcctcggg ctctgtggtg gcttctgagg tgtcctctgg ctctgtggta tctcacgctc 61 tttttctgtc ccttgcaggt agagttacac gtccacctgg atggagccat caagccagaa 121 accatcttat actttggcaa gtaagtccaa ggacaaccac agaccttccc aggattgcag 181 agcgtgtaca gctcttcttg gggggcc // LOCUS MUSADAM03 382 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 3 (first expressed exon). ACCESSION M34244 J04767 KEYWORDS adenosine deaminase. SEGMENT 3 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 382) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept 235 + 299 adenosine deaminase (ADA, EC 3.5.4.4), exon 3 (first expressed exon) IVS < 1 176 ADA intron B IVS 300 > 382 ADA intron C BASE COUNT 86 a 117 c 94 g 85 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 2. 1 aacacacaca tgcctgatgc cagcaaagga ggcctgaagg cattggtacc cctggaatta 61 gagttacagc tggtcatggg cctccatgtg ggtctcgtct tctgcaagaa cagccagtgt 121 gctcttaccc accaagccct ggtgcagccc ctcacccttg actttatttt taggaggaag 181 agaggcatcg ccctcccggc agatacagtg gaggagctgc gcaacattat cggcatggac 241 aagcccctct cgctcccagg cttcctggcc aagtttgact actacatgcc tgtgattgcg 301 taagttgctc cccaaccctt gtgccccaca gtagcatcca tccctataac caaggtcagg 361 cctgagctgc tgctgtacaa gg // LOCUS MUSADAM04 346 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 4. ACCESSION M34245 J04767 KEYWORDS adenosine deaminase. SEGMENT 4 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 346) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 93 + 227 adenosine deaminase (ADA), exon 4 IVS < 1 92 ADA intron C IVS 228 > 346 ADA intron D BASE COUNT 77 a 85 c 111 g 73 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 3. 1 acagttgtag ttacctcgtt ggctactaga cgtcccaagg agctgagaaa ggttgccaac 61 ctgtgttctt cttcccttcc caggggctgc agagaggcca tcaagaggat cgcctacgag 121 tttgtggaga tgaaggcaaa ggagggcgtg gtctatgtgg aagtgcgcta tagcccacac 181 ctgctggcca attccaaggt ggacccaatg ccctggaacc agactgagtg agtgacatca 241 ctggaggggg ctgtgctgag cggggctctg agctgaggat ggagtgctta gagccctggc 301 ctggtccatg gactcagagc gactcagctc agtcctaagt gcacga // LOCUS MUSADAM05 385 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 5. ACCESSION M34246 J04767 KEYWORDS adenosine deaminase. SEGMENT 5 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 385) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 115 + 230 adenosine deaminase (ADA), exon 5 IVS < 1 114 ADA intron D IVS 231 > 385 ADA intron E BASE COUNT 84 a 115 c 94 g 92 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 4. 1 tctccatcta gaaatagaag ggcagagaga catcactaca tccctgctcc agttccatgg 61 ctgcccatgg tcttcccttg gcctaaagtc ctccctcttc ctctctccac acagagggga 121 cgtcacccct gatgacgttg tggatcttgt gaaccagggc ctgcaggagg ggaggcaagc 181 atttggcatc aaggtccggt ccattctgtg ctgcatgcgc caccagccca gtgagtaccg 241 ccgcaccctg ctggctgcct ggcctataac aaggtggacc gactatccag cgtccccacc 301 tcgtatttct agagttttct aaaaaacacc tgtgaacttt tggtgactct ggtgagtcct 361 taacaggaaa ttgggacttg cacag // LOCUS MUSADAM06 189 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 6. ACCESSION M34247 J04767 KEYWORDS adenosine deaminase. SEGMENT 6 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 189) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 18 + 145 adenosine deaminase (ADA), exon 6 IVS < 1 17 ADA intron E IVS 146 > 189 ADA intron F BASE COUNT 37 a 39 c 75 g 38 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 5. 1 ggcccgtgcc cctgcaggct ggtcccttga ggtgttggag ctgtgtaaga agtacaatca 61 gaagaccgtg gtggctatgg acttggctgg ggatgagacc attgaaggaa gtagcctctt 121 cccaggccac gtggaagcct atgaggtggg cctgagaagg ggagggtggc cctgggggag 181 cttgggtag // LOCUS MUSADAM07 307 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exons 7 and 8. ACCESSION M34248 J04767 KEYWORDS adenosine deaminase. SEGMENT 7 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 307) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 11 82 adenosine deaminase (ADA), exon 7 164 + 265 adenosine deaminase, exon 8 IVS < 1 10 ADA intron F IVS 83 163 ADA intron G IVS 266 > 307 ADA intron H BASE COUNT 68 a 86 c 85 g 68 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 6. 1 tcccttccag ggcgcagtaa agaatggcat tcatcggacc gtccacgctg gcgaggtggg 61 ctctcctgag gttgtgcgtg aggtaaggag ccagtgaccc cgggcctctt cttcctgatt 121 ctgttcctgt ccctggactc acctcctctc tgcttctcca caggctgtgg acatcctcaa 181 gacagagagg gtgggacatg gttatcacac catcgaggat gaagctctct acaacagact 241 actgaaagaa aacatgcact ttgaggtgag acgccaaggc agagagagtg agctctggct 301 accccgt // LOCUS MUSADAM08 249 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 9. ACCESSION M34249 J04767 KEYWORDS adenosine deaminase. SEGMENT 8 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 249) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 112 + 176 adenosine deaminase (ADA), exon 9 IVS < 1 111 ADA intron H IVS 177 > 249 ADA intron I BASE COUNT 56 a 77 c 60 g 56 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 7. 1 ctgaggcaat gaagcacaaa gctatccaga atagaacctc agctgggctc agccctgacc 61 agtctggccc cggccactat gccagccagc cacacatcct gccccttgca ggtctgcccc 121 tggtccagct acctcacagg cgcctgggat cccaaaacga cgcatgcggt tgttcggtga 181 gatctggttc cgggacccat tttgttttga ttccggaatt cacctatagt gagtcgtata 241 aattcgtaa // LOCUS MUSADAM09 340 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 10. ACCESSION M34250 J04767 KEYWORDS adenosine deaminase. SEGMENT 9 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 340) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 141 + 270 adenosine deaminase (ADA), exon 10 IVS < 1 140 ADA intron I IVS 271 > 340 ADA intron J BASE COUNT 81 a 77 c 92 g 90 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 8. 1 ttaatacgag aatgcaaccc tttgtgttgt ctaaggttgt ataaagatgg aagagggagg 61 tggtggaagg gcagtgatgg ttcttggagt gaagaggctc tctctctctc tcttttcttc 121 ctgcctggcc cctcccccag cttcaagaat gataaggcca actactcact caacacagac 181 gaccccctca tcttcaagtc caccctagac actgactacc agatgaccaa gaaagacatg 241 ggcttcactg aggaggagtt caagcgactg gtgagtatgt gtgagctatg agcctgacac 301 tggcccaggt gtgtgtgtgt gtgtatatgt gtgtgtgtgt // LOCUS MUSADAM10 279 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 11. ACCESSION M34251 J04767 KEYWORDS adenosine deaminase. SEGMENT 10 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 279) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pept + 88 171 adenosine deaminase (ADA), exon 11 IVS < 1 87 ADA intron J IVS 188 > 279 ADA intron K BASE COUNT 67 a 88 c 61 g 63 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 9. 1 ggatctgttt cccccactat gatgcccttg cccttgctaa cagggctgct tccttccttg 61 tcctgactcc atgtttcccc cttctagaac atcaacgcag cgaagtcaag cttcctccca 121 gaggaagaga agaaggaact tctggaacgg ctctacagag aataccaata gccaccacag 181 actgacggta cgcttgtgca gggcgcaata accaccccac cacactgtcc tccttaactc 241 tgtgcgattg tggcagaagt cttgggcagg agcacacct // LOCUS MUSADAM11 442 bp ds-DNA ROD 10-JUL-1990 DEFINITION Mouse adenosine deaminase (ADA) gene, exon 12 (non-coding). ACCESSION M34252 J04767 KEYWORDS adenosine deaminase. SEGMENT 11 of 11 SOURCE Mouse lung fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 442) AUTHORS Al-Ubaidi,M.R., Ramamurthy,V., Maa,M.-C., Ingolia,D.E., Chinsky,J.M., Martin,B.D. and Kellems,R.E. TITLE Structural and functional analysis of the murine adenosine deaminase gene JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Al-Ubaidi, 11-MAY-1990. FEATURES from to/span description pre-msg < 1 > 287 adenosine deaminase (ADA) mRNA and introns IVS < 1 69 ADA intron K signal 282 287 poly-A signal BASE COUNT 97 a 111 c 114 g 120 t ORIGIN Chromosome 2; undetermined number of base pairs after segment 10. 1 ttctgtgctt ctaccatgcc ttacatgtca tgagacctga cctttctatt tctctgactt 61 gaccagcagg gcgggtcccc tgaagatggc aaggccactt ctctgagcct catcctgtgg 121 ataaagtctt tacaactctg acatattgac cttcattcct tccagacctt ggagaggcca 181 ggtctgtcct ctgattggat atcctggcta ggtcccaggg gacttgacaa tcatgcacat 241 gaattgaaaa ccttccttct aaagctaaaa ttatggtgtt caataaagca gctggtgact 301 ggtatcttgc agcacatggt gaatacggtc tcggggctgc tggctaggat gctaagaaag 361 gaggagcctg ggccctacgc tgagtgtcag gtctggggag ctagggtctc ttccgcaggt 421 cgactctaga gatccccggg ct // LOCUS TRBMVAT5A 1664 bp ss-mRNA INV 10-JUL-1990 DEFINITION T.brucei MVAT5-like variant surface glycoprotein mRNA, complete cds. ACCESSION M33825 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1664) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 43 1596 MVAT5-like variant surface glycoprotein BASE COUNT 533 a 435 c 395 g 301 t ORIGIN 1 tttctgtact atattgcaga agcaacactg agaactccac agatgatagg aaaagccttt 61 attattttat ctttacttaa cgagctgcca acgccgacgg cagcacaagc ggcacagggt 121 ggtgccctcg gaaaagacgt atggctacct ctcgctaaat tcacggcgac ggccgcgaaa 181 atcccaggca gggcggcaaa gctgcttcaa gacaggtcgg cccaaatagt taaccttatg 241 aaactccaag ttcaggcaga catatgcctc aacaaagcag cgtcagaggt gagcgcactt 301 gggtggcagg cgctcgctgt tgcaatagca gcagacatcg gcagcctgca aagcttgcaa 361 cagcagagga gtgaagaggc aatagcggcc gcggcagctg ccgaattcgc tcggggccac 421 gcagcggaat tcttcaaagt agctgcggca gtccaaagcg ccgccaatag cggctgcctg 481 acaacaaaca ataaaggtgg cgcagccggc agcgtgataa acggattctc gacactcggc 541 accgcggagc agccagcaat cggcgctaca tcgacggctc acgtcggcga cgacataacg 601 gcgataacaa caacagggtt cagcgaccta gcagcaacag acggcatacg caccgactca 661 ctaacagcgg acacaaactg cgttcttttc aagggaggca gcgatggacc actaacgaca 721 gcaaacttcg gccagtcgat ccctttcgca ggcggctatc taacaaggaa cccgacagcc 781 aacacagcca gcagcgccga cggtacggac tttgtaagca accccgaaga cagcaagata 841 gcaggcataa aagtctacag ggacgcccac gccgccgcag cgaaaatacg cacagcggca 901 accttcggct cgaccttcac cgacttcaag aagctggacc aggctaagaa gtcagtccat 961 ttgcgcgcag cagtaaagaa cataattctc ggcaaacctg acggatccgt agacgacctt 1021 tccggcgaaa tagacacaaa gataaaccag gtattcggcg aggaccaaga aacattccac 1081 agcaggtttt gggatcaact aacaaaagta aaagtggaaa aggcggcgag tggacaagaa 1141 gaaacgacac tcgatgcaat cacttctttt gcagccttaa gccgagctcg gacttattac 1201 tccacgaaag tgatcaaagg tttgagagat aagatatcct cactagaaat taaaaattcc 1261 aaaacggaag ttaaagtcac tgacgccgac tgcaacaaac accaatcaaa agacaaatgc 1321 gcagccccat gcaaatggaa cgagaatacc actgacataa acaaaaaatg ctcattagat 1381 cccgtaaaag cgacagaaca gcaagcagcc cagacagcag gagcaggaga aggagctgca 1441 ggaacaacaa cagataaatg caaagataag aaaaaggatg actgcaaatc tccggactgc 1501 aaatgggagg gtgaaacttg caaagattcc tctattctcc taaacaaaca attcgcccta 1561 atggtttctg cagcctttgt ggccttgctt ttttaatttt ttccccctct ttttcttaaa 1621 gaatttttgc tactttaaaa acttctgata tattttaaca ccta // LOCUS TRBWRATATA 1544 bp ss-mRNA INV 10-JUL-1990 DEFINITION T.brucei WRATat A variant surface glycoprotein mRNA, complete cds. ACCESSION M33823 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1544) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 42 1457 WRATat A variant surface glycoprotein mRNA < 1 1544 WRATat A mRNA BASE COUNT 545 a 387 c 345 g 267 t ORIGIN 1 gaacagtttc tgtactatat tgcggacaaa tctagaaggc catgtccgtt ctgtttctgc 61 tcctagcaat aacacgaaca gcctcggtga aagcagcgga aggagaccag gcggctgatt 121 ttttgccttt atgcgaagcc tggcaggcaa ctaaagcgct agcaaatgcg gcgtataaac 181 tcccgccgtt tccaccagat ctgacagaca tactaaactt taacataact gtggctcccg 241 aggaatggaa agcaatcttt acagatggcg gatctgacaa cacatgggaa agattcgccg 301 aaggacacaa gaatactcta aatggcggca actggaaaac aagatgggaa catatcaagc 361 aagcaaggca agatacaaaa gaagcttcgt caccgtggaa cgcgttaaac agcaaattaa 421 taaacacagc cacagtcaat accaccagag cctacatagc aagcatagca gacgaagcct 481 tcgacctata ccaggggaca cagacacccc tacaaacacc caaagccttg gaagccgcca 541 gcctagcaga agcagcgaaa gcaatacttt gctcagaccc cctaaagcca acagccgacg 601 ggcaggcatg cacagatata acagcgacgc caagcaaagc ggcaacatgc ccaactggac 661 gaagcagcaa gggaggggcg ccaataggac tagatacggt ctgtctctgc tcaacaaaca 721 aaccaagtat gcatagcaga cgacgaaaag cggcagcagt gatgaccgac ggacaactaa 781 aagacggcat cctcaagaaa ttattagcgg cgtgcccaaa aaagccaacc ctaaatgaac 841 cagcagcagc cgcccgccac gcagtaacgg tactcgcaac acggctagct caaaaagttg 901 cgcgcgccga agaaggccaa ataattctcg gaaccagagc cgaaacggac tgcgctagtt 961 cgggatcagc ctgtgtagaa tatactaact ttttcaaaga tggcgatggc ttagcagctg 1021 ttccctgggt gaagaagctg ctggcggcgg cagattttta cgacacaatc gaaaagcgca 1081 aagaaagcga caaaaacgcc gcgacagcaa tagcagccct caaatctgct ttaatcaggg 1141 aatttagaag accaggacaa gaacaaacac tggcaacaac aggaactaaa agcagcagcc 1201 cccaaagcac ccaacaaaaa gcatccgaag ccgaagcaaa ttgcaatgac aaagccaaag 1261 aaactgaatg caactcccca tgcaaatggg ataaggaaga aaaggatgag aaaaaaaggt 1321 gcaagctgag tgaggaaggc aaacaagcag aaaaagaaaa ccaagaaggg aaagatggga 1381 aagcaaacac cacaggaagc agcaattctt ttgtcattaa aacttcccct cttttgcttg 1441 cagttttgct tctttaatcc ctccccctcc ctttaaaatt tttgataaaa atttttgcta 1501 cttgaaaaac tttctcatat attttaacac ctaaaagttt cccg // LOCUS TRBWRATATB 1585 bp ss-mRNA INV 10-JUL-1990 DEFINITION T.brucei WRATat B variant surface glycoprotein mRNA, complete cds. ACCESSION M33824 KEYWORDS variant surface glycoprotein. SOURCE Trypanosoma brucei rhodesiense, cell line WRATat1, cDNA to mRNA. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 1585) AUTHORS Reddy,L.V., Hall,T. and Donelson,J.E. TITLE Sequences of three VSG mRNAs expressed in a mixed population of Trypanosoma brucei rhodesiense JOURNAL Biochem. Biophys. Res. Commun. 169, 730-736 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Donelson, 23-APR-1990. FEATURES from to/span description pept 41 1504 WRATat B variant surface glycoprotein BASE COUNT 535 a 384 c 376 g 290 t ORIGIN 1 aacagtttct gtactatatt gcagtttcgc gttcagctta atgtggataa tcttggcact 61 gctaacttta gctgggtccc gcgtcgccca tggggcaggt aagaatgtca acggcgttga 121 gttcaacctt ttttgtcaca tagctaacat gctaaacgcg gaaaagatcg aagacgacaa 181 aactgatggc ctagaccgcc aagctgccga ggcatggacg gcaatcgaca gcatatttac 241 agtaacagcc aacgaaagct actacagtga aggaccagcc agcgcagcaa atacgaccga 301 cgaaaaccag gatgccaagc cggaacgggt agcaaaatgg gtgcagaaac gcaaccaaat 361 agacaaaatc gcagctcctg gtaatgagaa aaacggaaaa tacgcgcgac gaccaaggga 421 cagaatgtca gcagcaacag gagcgaaact cgatacggtt ttcacactcg cttcggaggc 481 acgagtccga ctaatgcaga tagacacaga gatagcaaca aataaacaag aaatcaggca 541 gcagctagga ctgcattgct cggaggggca aggcaagggt cagagcagaa accagcatcc 601 ggataatgcc gcattcgcaa gcgactactc aactgcgtgc aaaggatcga caggaccagg 661 aaaaagtctt gcgaacgacc tagtatgtat ctgcagcact gacaccagcc aagcccaaag 721 cacactacag atgtgcacga gcatcgacga tgcgaacagc ttattcagta ccctacacaa 781 acgaagccaa tgccaaggcg attttccttg ccctcatcgg gtttgtgcta agacagccga 841 aacaagcgag ctgacggaaa ccaacataaa caactgtgta acggctttta cagcgacact 901 gggcagacat acaaagagtt cggccacaaa tgaaggggcc tatgtctttg ggagcggaca 961 gaacagcggc gacgagtgca acgggggagc agcaacaggg caatcctgtg tcagctatca 1021 cgacctcata acagctaaat ccggtacgac actaagcggc gcaatcactc ggctaaagca 1081 actacaaatc gccaaagcaa agctaaaagc aagacggcta ctgctgcaaa acagggaacg 1141 gcagcaaacg cgacttatgg cgctagcaga caagatgcaa gaattgtacc aagaggcctt 1201 acatgacgag gttcaactca ggaaggaagc gcagaacaaa cctcaagaaa caccagattc 1261 tgacaagcaa aaagcatgcg agaaatatca caacaagtca aaggaatgca aagaaaatgg 1321 ttgccaatgg agtggaactg aagaaaccat aggaaagtgc gaagctaaac ccaaagcagg 1381 aacagaagcc gcaacaacgg gaccaggaga gagagatgca ggagccactg caaacaccac 1441 aggaagcagc aattcttttg tcattaaaac ttcccctctt ttgtttgcat ttttgctttt 1501 ttaatttttc ccctcaaatt tccccctctt ttttaaaatt tttctttcta cttggaaact 1561 tctggtatat tttaacacct ttaaa //
GenBank-Updates@genbank.bio.net (07/11/90)
LOCUS CDCXYNAB 6067 bp ds-DNA BCT 11-JUL-1990 DEFINITION C.saccharolyticum xylanase A (XynA), beta-xylosidase (XynB) and acetyl esterase (XynC) genes, complete cds. ACCESSION M34459 KEYWORDS acetyl esterase; beta-xylosidase; xylanase. SOURCE C.saccharolyticum DNA, clone pNZ1400. ORGANISM Caldocellum saccharolyticum Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 6067) AUTHORS Luethi,E., Love,D.R., McAnulty,J., Wallace,C., Caughey,P.A., Saul,D. and Bergquist,P.L. TITLE Cloning, sequence analysis, and expression of genes encoding xylan-degrading enzymes from the thermophile "Caldocellum saccharolyticum" JOURNAL Appl. Environ. Microbiol. 56, 1017-1024 (1990) STANDARD simple staff_review FEATURES from to/span description pept 195 1223 xylanase/beta-xylosidase (XynA) precursor sigp 195 293 xylanase/beta-xylosidase signal peptide matp 294 1220 xylanase/beta-xylosidae pept 1257 2057 acetyl esterase (XynC) pept 2198 2491 ORF 3 pept 2491 3429 ORF 4 pept 3445 4911 beta-xylosidase (XynB) (3445 could be 3463) pept 5439 > 6067 ORF 6 BASE COUNT 2230 a 787 c 1243 g 1807 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccccgc aaagcctaaa ataagtacat ttagaatgat ggcagaaaat ggttatatta 61 cccttgaatt tacgttaagt aaaaatgctg tggtgctttt tgaggtaagc aaggttgtag 121 atgagtcaga tacttatata ggacttgacg atagtaaaat accaggttat tagttgcttt 181 ataaaataaa aggaatgagg tgtttaattg tgtgcgaaaa tttagagatg ctaaacttat 241 cattagcaaa aacatacaaa gattacttta aaataggtgc tgcagtaact gcgaaagatt 301 tagaaggagt tcatagggat attcttttga agcattttaa tagcctcaca ccagaaaatg 361 ccatgaagtt tgaaaatatt catccagaag agcagagata taattttgaa gaggttgcca 421 ggataaaaga gtttgcaatt aaaaatgaca tgaagttaag aggacataca tttgtttggc 481 ataatcaaac tccggggtgg gtgtttttag ataagaatgg ggaagaagcc tcaaaagagt 541 tagttattga aaggttaaga gagcatataa aaactttgtg tgagagatac aaggatgtag 601 tatatgcgtg ggatgtggtg aacgaagcag tagaagataa aacagaaaag cttttgcgag 661 aatcaaactg gagaaaaatt attggagatg attatattaa aattgctttt gagatagcaa 721 gagaatatgc aggagatgca aagttatttt ataacgatta taacaatgaa atgccttata 781 aattagaaaa aacctacaaa gttctaaaag agcttttaga aagaggtact ccaatagatg 841 gaattggtat acaagcacac tggaatatat gggataaaaa tcttgttagt aatttaaaaa 901 aggctataga agtatatgct tccttaggtt tagaaattca tattacagaa cttgacattt 961 cagtatttga gtttgaagat aagaggactg acttgtttga accaaccccg gaaatgcttg 1021 aactacaagc aaaagtatat gaagatgtat ttgcagtttt tcgagaatat aaagatgtaa 1081 taacttctgt tacattatgg ggtattagcg acagacacac atggaaagat aacttccctg 1141 taaagggtcg aaaagattgg cctctcttat tcgacgtaaa tggaaaacca aaagaagcct 1201 tgtacaggat attaagattt taaagatttt ttaacgaaga aaggggttct tttaatatgg 1261 ctatcatgca aatcaacttt tattcaaaga tgttgaaaaa gaacacaaca attttggcca 1321 ttttacccgt agataaacca gataagaaat tccagaaaga tgttgatagt gaaaatttga 1381 aaaccttata tcttttgcat ggttatgctg gtaactacat ggattggttg tgtggagccc 1441 gaattgttga attatcaatg cgatataatg ttgctgtgtt tctgccatca ggtgaaaata 1501 gtttttattt agatgatgaa gaaaaggaag aatattttgg tgaatttgtg ggaaatgaaa 1561 ttatagaatt tacaagaagc gtttttccta ttcctcaaaa aagggaaaaa acttttattg 1621 gcggtttatc aatgggaggt tacggtgctc ttagaaatgg gcttaaatat aacaagaatt 1681 ttgtaggtat aatagcttta tcatcagcac taataattca taagattgca ggtattccta 1741 aggattatag gaatgcttat gcaagttata actattatag acgagtgttt ggagacctaa 1801 actctttaat aggtagcgat aaagacataa atgccttagt tactaagcta aaacaagaaa 1861 aaggtagtat tccaaaaata tacatggcat gcggcagaga tgacttttta gttcaagaaa 1921 acagagattt atttaatttt ttgaaaaatg aaggtataga cgtggtttat gaggaagacg 1981 aaggtggaca tgactgggat ttttggaaca aatatattgc aaatgctttt gagtggatga 2041 gtaaggtttc tgattaagtc ttcacgtacc ctgttttaag ttttacaaat agatttgtgg 2101 ggtgaatagg tttttttaac actattttat taaggaagag gatgaaaaat aaaaaaagtg 2161 gacaaatttc ttgttaattg taattacatg cattgcaatg gttttctttt ttacatcgtg 2221 tactattcag tctgctatag agcagaagaa aactgttgag gaaatcttgg gaaaaatagg 2281 tgagagtgag gacaaaacaa attcaagggg gcaaccagca acaatgaaag aggatgaagt 2341 tgaagataat cctttaaaag atgtatataa agattatttc ctggttggag cagcaattaa 2401 tggctattct gttgaaactg ctgctatcaa tcatcctggt atggctgcaa ttttgaaaaa 2461 aactttaaca gtacaaccct atctaatttg atgaaacaac aatacctttt agattatgaa 2521 gctacaaaag caagtaaaaa tggaatgcca gtgtgtaaat ttgacagctg cattcctgct 2581 ttacaatttt gtaaggaaaa tggcataaaa atgagaggac atgtgttagt atggcataat 2641 cagacaccag aatggttttt ccacaaagac tatgatgtat cgaaaccact tgtagatgct 2701 gctactatgg aacgccggtt ggaaagttat atcaaacagg taattgaatt ttgtcaaaaa 2761 aattatcccg gtgtagtcta ttgctgggat gttgttaacg aagctatact tgatgatggt 2821 tcatggagag aaatcaataa taattggtat accattatga aagaaaagta tgtggaaaag 2881 gcattttatt atgcaagaaa atatgccaaa aaagatgttg ccctgtttta caatgattac 2941 aatgtttttc tccctgcaaa gagagaagca atttataatc ttgctcagaa acttaaagaa 3001 aaaggattga ttgacgggtt gggtcttcaa cctacagtag gcttgaatta tcctgaatta 3061 gattctgatg atatagattc attcaaaacg acattagaaa catttgcaaa acttggctta 3121 caaattcata ttactgagtt aaattttgaa ataaagggag atgagagcaa tcgtactcct 3181 gaaaatctca aaaaacaagc agataggtat tacgaaatga tgaagttatt attgaaggaa 3241 gatactgata atggtgggcc ttgcaacata acttgtgtta ctgtttttgg tatctgtgac 3301 gattatccac tatataaaaa ttttaagcag tgcatgtatc tttgggataa aaattgcaat 3361 cctaaaccat gtttttattc atttctccaa gcaggtttag actggaaagc atctttatta 3421 agcaaataag aatgaacaac acttatggag aggaggaaaa taatgaaaat aactattaat 3481 tatggaaaga gacttgggaa aataaacaaa ttttgggcaa aatgtgttgg aagctgtcat 3541 gctacaactg cgttaagaga agactggcga aagcaattaa aaaaatgtcg tgacgaactt 3601 ggttttgagt atattcgatt tcatggttgg ttgaatgatg atatgagtgt ttgttttaga 3661 aatgatgatg ggctactttc attctcattc ttcaacatag attctataat tgattttctt 3721 ttggagatag gtatgaaacc atttattgaa ctgagcttta tgccagaagc gttagcgtca 3781 ggtacaaaga cagttttcca ttacaaagga aatataacac cgccgaaatc ttatgaagaa 3841 tggggtcagc tgattgagga gttagcaagg catcttatta gcagatatgg gaaaaatgaa 3901 gtaagagaat ggttttttga ggtatggaac gaaccaaatc taaaggattt cttctgggca 3961 ggaacaatgg aagaatattt taagctttac aaatatgctg cttttgcaat aaagaaagtg 4021 gactctgaac taagggtagg tggaccagct actgcaatcg atgcatggat acctgaacta 4081 aaagattttt gtacaaaaaa tggtgttcca atagatttta tttcaacgca tcaatatcca 4141 acagatttag cattcagtac aagctcaaat atggaagagg ctatggcaaa agcaaagaga 4201 ggtgaattag cagagagggt aaaaaaggct ttagaggaag catatccatt gcctgtttac 4261 tacactgaat ggaataactc tccaagtcct cgagacccat atcacgacat accttacgat 4321 gctgctttta ttgtaaaaac aataattgac attatagatt taccacttgg gtgttattct 4381 tattggacat ttacagatat ctttgaagaa tgtggacaga gttctttacc ttttcatggg 4441 ggattcgggc ttctaaatat tcatggtata ccaaaaccat cctatagagc atttcaaatt 4501 ttagataaac taaacggtga gaggattgag atagagtttg aagataaaag cccaaccatt 4561 gattgtatag ctgtccagaa tgagagagag ataatacttg tgatctcaaa ccataatgtt 4621 ccgctgtctc ctattgatac cgaaaatata aaagttgttt taaaaggtat tgagaattgc 4681 cgagaagttt ttgttgagag aatagatgaa tataatgcca atccaaaaag agtatggctt 4741 gaaatgggca gtcctgcgta tctcaataga gaacagattg aggagttgat aaaagcatca 4801 gaactaaaga aagagaaagt ttcatggggg attgtgaata ataatgaaat tacatttgat 4861 ttaagtgttt tacctcactc agttgtggct gttacaatta agaatggtta gtgaaatgtt 4921 aagagagaaa agcaattttg tatatctctt ttaattttta cctttgacac atcaaacaat 4981 ctaaattaaa attaaagtat agtgttttgc atactcaaca tagtataaat tatataaggg 5041 taacattaat accctttttg tttttgtaag ggggtgtttt tgtggcaaag cacacgcaaa 5101 aaggtaaatc agctgccaca gccgccgtgt cagacaaaga aaaagcaagg tttgttccta 5161 aaaatattca agctgagata aaagaaaaga ttaaagacac tggtgaaaaa gtagcaaagg 5221 ctgagggtaa ggacaaagca cttttacagt taaagctgga gagcaacaaa aaggttgata 5281 agaaaaaatt caaaaaggat agaagtgttg agaggaataa aacttcatta aatagatttt 5341 taagtttaga taaaattaaa tccctatatt caaaagagat acataataaa ctttcacaca 5401 tctttgaaga tgcagtttct gaggtttata gaattttaat ggggctaaag tatatcaaaa 5461 aggcgccaaa ttacaccgaa attgttctga aggcaaagat attttcaacc ttgattttga 5521 tgattgtaat attattttta atcaacaaaa tgccttctac atacaaaaaa gcgtatgcag 5581 ttgttttgaa caatcagatt gtagggtatg tgaaggacaa gactgaagca caaaaccttc 5641 ttacccagat taaaaaagaa gtagaggaaa gacacaatac agacagtttc attttacaaa 5701 gtaagcttca actaaagagc attgagcctg gtcaatatcg tgagacaagg gttgatgagc 5761 tgaaaaatac tatcatagaa aaggggaagg tccttgtaaa aaggtatgct atttttgtta 5821 attcaaaacc atattttgta tttgaaaatc cacaaactcc aaataatatt cttaacaagc 5881 taaaaaaggt ctattataat gacaaggcat cacaggcaaa attcttagag aaggtagaaa 5941 taaaaccagt ttatgtctca ccagctatta aagtagctga tgaagctact gccttaacaa 6001 agattatgtt tgggaaagac caggtaatag aatatacagt caaggaagga gatactcttt 6061 gggatcc // LOCUS PFAAMA1 2307 bp ds-DNA INV 11-JUL-1990 DEFINITION P.fragile apical membrane antigen 1 (AMA1/AG352) gene, complete cds. ACCESSION M29898 KEYWORDS apical membrane antigen. SOURCE P.fragile (Nilgiri strain) DNA, from Macaca mulatta, clone AG352VATV1. ORGANISM Plasmodium fragile Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 2250) AUTHORS Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F., Collins,W.E., Anders,R.F. and Kemp,D.J. TITLE Apical membrane antigen of Plasmodium fragile JOURNAL Mol. Biochem. Parasitol. 39, 279-284 (1990) STANDARD full staff_review REFERENCE 2 (bases 2251 to 2307) AUTHORS Peterson,M.G., Nguyen-Dinh,P., Marshall,V.M., Elliott,J.F., Collins,W.E., Anders,R.F. and Kemp,D.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.Marshall, 15-NOV-1989. FEATURES from to/span description pept 275 1963 apical membrane antigen 1 (AMA1/AG352) precursor sigp 284 322 put. apical membrane antigen 1 signal peptide matp 323 1960 apical membrane antigen 1 BASE COUNT 794 a 445 c 499 g 569 t ORIGIN 1 taagttccct ttctacaccc ggatgcctcc tagagcaaat aggagtttca agcgtttaca 61 tgtaatttac caagcgtttg taattttgca actttgcaat ttttctactg cgcaagtttg 121 taaccgtgaa gctgctcacc tgtgtgacgg ccaattttta ccaacggtta aacctgttag 181 tggctatttt tttctcgccc ccctcctgat tgatgtgcag agggagagaa ccaaatagct 241 gcctttttct tgagtcacaa tttaacaaca caatatgaat aaaatatact gcatactgtt 301 tttaagtgcc cagtgccttg tgcacatggg taagtgcgag ccaaaccaga agccgagcag 361 gctgacccgc agcgctaaaa acgttttgtt ggaacaggag cctatggttg agagaagtac 421 acgaatgagt aacccatgga aagcattcat ggaaaagtac gatatcgaaa aaacacacag 481 ttctggtatt cgagtagatt taggggaaga tgcagaagtg ggaaattcca gctatagaat 541 accagcagga aaatgtcctg tttttggaaa gggtatcgtt atacagaatt ctgaggttag 601 tttcttaaca cctgtagcta caggcaatca aaagttgaag gatggaggtt tcgcctttcc 661 acaagcaaat gatcatattt cccctatatc cataaaaaac cttagagaaa ggtataaaga 721 gaatccagat ttgatgaagc taaacgattt agctttgtgt aaaactcatg cagccagctt 781 tgtaatggaa atggataaaa attcgtccta tagacaccca gctgtatatg atgaagataa 841 aaaaatatgt tacatgttgt atttatcagc gcaagaaaat atgggtccaa gatactgtag 901 taaagatgca gaaaataaag atgctatgtt ttgcttcaag ccagataaaa atgaaacatt 961 tgaccatctt gcctatttaa gcaaaaatgt ggttaatgat tggcaaaaca aatgcccccg 1021 taaaaattta ggaaattcta aatttggatt atgggtggat ggaaactgtg aagaaatccc 1081 atacgttcaa gacgtgcagg caaaggatct acgcgaatgt aacagaatcg ttttcgaagc 1141 tagcgcttca gatcaaccaa ctcagtacga agaagaacta accgattatc aaaaaataca 1201 agaaggcttt agacaaaacg atcagggtat gattaaaagt gcttttcttc cagtaggtgc 1261 attcaactcg gacaatttta agagtaaagg aagaggatat aactgggcaa atttcgatac 1321 tgaaaataag gtttgttacc tttttaatgc caaacccact tgcctcatta atgacaaaaa 1381 ctttatcgca acaacagcgt tatctcatcc ccaagaagta gacaatgagt ttccatgcag 1441 catatacaaa gatgaaatgg aaagggaaat gaggaaagaa tcgaggaaca tgagtctgta 1501 caatgttgat aaggcacgga ttgttctgcc aaggatattt atctccaacg ataaggacag 1561 tctcaaatgt ccatgcgcac cagaacacat taccaacagt acctgcaact tttacgtttg 1621 taactgtgta gagaaaaggg cagaaattaa agaaaataac gaagtggcca taaaggaaga 1681 atttaagcaa gattaccaat acgcgcaagg tgaatccaaa aatcagatgc tcctaattat 1741 tatcggaata actggaggtg tgtgtgtggt cgcactggct tccatgtttt acttcaggaa 1801 gaaagctcac aatgataagt atgacaagat ggagcaggca gacgggtacg ggaaacccac 1861 caccaggaaa gacgagatgc tcgaccccga ggcgtccttc tggggtgaag aaaagcgggc 1921 ctcccacacc acccctgtgc tgatggagaa gccttactac tgagcgggga agcaaccgaa 1981 ttggtgaggg cctctttggt cgtaaacaaa gtgggggtgc ctcacaatgc atattttcaa 2041 cccgcgtcat gtaaaaaaga aaaacgagac acacccagct ggccaacaaa ttgcccacaa 2101 gggaggagaa atggagcaag ctaaaattgg gctattgtca tcatcaccag ttaccgagga 2161 aatgaaaaca acaacaaaaa aaaacgtaac acatggtaaa gtaactgatt ggttaagcaa 2221 agccgagtga aaatttaccc cacttgcgat ttaaaagcat gatttgcctc caccaaatgg 2281 acctctccac tattaatatt accggag // LOCUS RICAAMYA 1553 bp ss-mRNA PLN 11-JUL-1990 DEFINITION Rice alpha-amylase mRNA, complete cds, clone pOS103. ACCESSION M24286 KEYWORDS 1,4-alpha-D-glucan glucanohydrolase; alpha-amylase. SOURCE Rice (strain M202), cDNA to mRNA, clone pOS103. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1553) AUTHORS O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and Rodriguez,R.L. TITLE The alpha-amylase genes in Oryza sativa: Characterization of cDNA clones and mRNA expression during seed germination JOURNAL Mol. Gen. Genet. 221, 235-244 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Kumagai, 25-APR-1989. Author address: M.H.Kumagi UC Davis, Dept. of Genetics, Davis, Ca. 95616 FEATURES from to/span description pept 34 1338 alpha-amylase (EC 3.2.1.1) BASE COUNT 340 a 486 c 446 g 281 t ORIGIN 1 atcaatcatc catctccgaa gtgtgtctgc agcatgcagg tgctgaacac catggtgaac 61 aaacacttct tgtccctttc ggtcctcatc gtcctccttg gcctctcctc caacttgaca 121 gccgggcaag tcctgtttca gggattcaac tgggagtcgt ggaaggagaa tggcgggtgg 181 tacaacttcc tgatgggcaa ggtggacgac atcgccgcag ccggcatcac ccacgtctgg 241 ctccctccgc cgtctcactc tgtcggcgag caaggctaca tgcctgggcg gctgtacgat 301 ctggacgcgt ctaagtacgg caacgaggcg cagctcaagt cgctgatcga ggcgttccat 361 ggcaagggcg tccaggtgat cgccgacatc gtcatcaacc accgcacggc ggagcacaag 421 gacggccgcg gcatctactg cctcttcgag ggcgggacgc ccgactcccg cctcgactgg 481 ggcccgcaca tgatctgccg cgacgacccc tacggcgatg gcaccggcaa cccggacacc 541 ggcgccgact tcgccgccgc gccggacatc gaccacctca acaagcgcgt ccagcgggag 601 ctcattggct ggctcgactg gctcaagatg gacatcggct tcgacgcgtg gcgcctcgac 661 ttcgccaagg gctactccgc cgacatggca aagatctaca tcgacgccac cgagccgagc 721 ttcgccgtgg ccgagatatg gacgtccatg gcgaacggcg gggacggcaa gccgaactac 781 gaccagaacg cgcaccggca ggagctggtc aactgggtcg atcgtgtcgg cggcgccaac 841 agcaacggca cggcgttcga cttcaccacc aagggcatcc tcaacgtcgc cgtggagggc 901 gagctgtggc gcctccgcgg cgaggacggc aaggcgcccg gcatgatcgg gtggtggccg 961 gccaaggcga cgaccttcgt cgacaaccac gacaccggct cgacgcagca cctgtggccg 1021 ttcccctccg acaaggtcat gcagggctac gcatacatcc tcacccaccc cggcaaccca 1081 tgcatcttct acgaccattt cttcgattgg ggtctcaagg aggagatcga gcgcctggtg 1141 tcaatcagaa accggcaggg gatccacccg gcgagcgagc tgcgcatcat ggaagctgac 1201 agcgatctct acctcgcgga gatcgatggc aaggtgatca caaagattgg accaagatac 1261 gacgtcgaac acctcatccc cgaaggcttc caggtcgtcg cgcacggtga tggctacgca 1321 atctgggaga aaatctgagc gcacgatgac gagactctca gtttagcaga tttaacctgc 1381 gatttttacc ctgaccggta tacgtatata cgtgccggca acgagctgta tccgatccga 1441 attacggatg caattgtcca cgaagtactt cctccgtaaa taaagtagga tcagggacat 1501 acatttgtat ggttttacga ataatgctat gcaataaaat ttgcactgct taa // LOCUS RICAAMYB 1682 bp ss-mRNA PLN 11-JUL-1990 DEFINITION Rice alpha-amylase mRNA, complete cds, clone pOS137. ACCESSION M24287 KEYWORDS 1,4-alpha-D-glucan glucanohydrolase; alpha-amylase. SOURCE Rice (strain M202), cDNA to mRNA, clone pOS137. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS O'Neil,S.D., Kumagai,M.H., Majumdar,A., Huang,N., Sutliff,T.D. and Rodriguez,R.L. TITLE The alpha-amylase genes in Oryza sativa: Characterization of cDNA clones and mRNA expression during seed germination JOURNAL Mol. Gen. Genet. 221, 235-244 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Kumagai, 25-APR-1989. Author address: M.H.Kumagi UC Davis, Dept. of Genetics, Davis, Ca. 95616 FEATURES from to/span description pept 78 1382 alpha-amylase (EC 3.2.1.1) BASE COUNT 355 a 491 c 519 g 317 t ORIGIN 1 atccatcatc tacaagagat cgatcagtag tggttagcag caactcacta tcgaacacgg 61 tttcagctta cacagatatg aagaacacca gcagcttgtg tttgctgctc ctcgtggtgc 121 tctgcagctt gacctgtaac tcgggtcaag cacaggtcct cttccagggt ttcaactggg 181 agtcgtggaa gcagcagggt ggctggtaca acatgttgaa aggccaagtc gacgacatcg 241 ccaaggccgg ggtcacccac gtctggctgc cgccgccgtc gcactccgtg gcgcgagggt 301 acatgccggg gcgtctctac gacctggacg cgtccaagta cggcacggcg gcggagctca 361 agtcgctgat cgcggcgttc cacgggaagg gcgtccagtg cgtcgccgac gtcgtgatca 421 accaccggtg cgccgagaag aaggacgccc gcggcgtgta ctgcgtgttc gagggcggga 481 cgcgcgaccg cctcgactgg ggccccggca tgatctgcag cgacgacacg cagtactccg 541 acggcacggg ccaccgcgac accggcgagg ggttcggcgc ggcgcccgac atcgaccacc 601 tcaacccgcg cgtccagcgg gagctcaccg actggctcaa ctggctcaag tccgacgtcg 661 gcttcgacgg ctggcgcctc gacttcgcca agggatactc cacggacatc gctaagatgt 721 acgtcgagag ctgcaagccg ggcttcgtcg tcgccgagat atggaactcg ctgagctaca 781 acggcgacgg caagccggcg gccaaccagg accagggccg gcaggagctg gtgaactggg 841 tgaacgccgt cggcgggccg gcgatgacgt tcgacttcac caccaagggc ctcctgcagg 901 cgggcgtcca gggcgagctg tggcggctgc gcgacggcaa cggcaaggcg cccggcatga 961 tcgggtggct gccagagaag gccgtcacgt tcgtcgacaa ccacgacacc ggctcgacgc 1021 agaagctttg gccgttcccc tccgacaagg tcatgcaggg ctacgcctac atcctcaccc 1081 accccggagt cccctgcatc ttctacgacc acatgttcga ctggaacctg aagcaggaga 1141 taaccgcgct ggcggcgatc agggagagga acggcatcaa cgccgggagc aagctccgga 1201 tcgtcgtcgc cgacgccgac gcatacgtcg ccgtcgtcga cgagaaggtc atggtgaaga 1261 tcgggacgag gtacgacgtg ggcaacgcgg tgccgtcgga tttccatcag acggtgcacg 1321 gcaaggacta cagcgtctgg gagaaggggt ccctccgcgt cccggcgggg cggcacctat 1381 agcgggctca agccctaaac tgaacgggat agtcatgctc aaaccagttt ctacacggca 1441 agaatttact gattcttata ctttttcagt caattaaatt atggttttta tatatgtaat 1501 tttgtatccg attgtagcgt tcgaataagt aggcaggctc tctagcctct aggttaattg 1561 cgggcatatg tagcttgcca gttaattgtg tttgtatcac gcagtttgta accgttggtg 1621 catatatatg tcaggttcag gatgcagtaa aaaatcatac tgcaccgatc agtgagtttt 1681 ta // LOCUS HUMCEAPX 494 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human cell adhesion protein (SQM1) mRNA, complete cds. ACCESSION M33374 KEYWORDS cell adhesion protein. SOURCE Human squamous carcinoma cell line SCC25, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 494) AUTHORS Wong,Y.-C., Tsao,S.-W., Kakefuda,M. and Bernal,S.D. TITLE cDNA cloning of a novel cell adhesion protein expressed in human squamous carcinoma cells JOURNAL Biochem. Biophys. Res. Commun. 166, 984-992 (1990) STANDARD simple staff_review FEATURES from to/span description pept 36 443 cell adhesion protein (SQM1) mRNA < 1 494 SQM1 mRNA BASE COUNT 107 a 154 c 163 g 70 t ORIGIN 1 ccctcggtgc tgcagggatc tgcaggactg cagccatggg ggcgcacctg gtccggcgct 61 acctgggcga tgcttcggtg gagcccgacc ccctgcagat gccaaccttc ccgccagact 121 acggcttccc cgaacgcaag gagcgcgaga tggtggccac acagcaggag atgatggacg 181 cgagtgaggc tcagctgcgg gactactgcg cccaccacct catccggctg ctcaagtgca 241 agcgtgacag cttcccaagt tgctggcctg caagcaggaa gcggcacgac tcgggactac 301 tgcgcaccgc aagctatgtg atgcgcatga aggagtttga gcgggacgag ggctgctcca 361 gcggaagaag cggcgggaga agaaggcggc aaatctgcaa aggccaggga cccggggaag 421 tggaccccaa ggtggccctg taggggtgca ccccccaccc tatggaccag tcaaataaaa 481 ccttcaggcc cctc // LOCUS REOCEAP1 1463 bp ds-RNA VRL 11-JUL-1990 DEFINITION Reovirus sp. (serotype ST1) sigma-1 protein gene, complete cds. ACCESSION M32860 KEYWORDS cell attachment protein; sigma-1 protein. SOURCE Reovirus sp. (serotype ST1), cDNA to viral RNA. ORGANISM Reovirus sp. Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1463) AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K. TITLE Identification of conserved domains in the cell attachment proteins of the three serotypes of Reovirus JOURNAL Virology 174, 399-409 (1990) STANDARD simple staff_review FEATURES from to/span description pept 14 1426 sigma-1 protein BASE COUNT 426 a 291 c 369 g 377 t ORIGIN 1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact 61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca 121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat 181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg 241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca 301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga 361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt 421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg 481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga 541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt 601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg 661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta 721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc 781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg 841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat 901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga 961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac 1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt 1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc 1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca 1201 gacgggggat actgaaccat cgtcaactat tgatccattg tctacgacat ttgccgcggt 1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg 1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt 1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc 1441 ctcggcacaa ggggtcaatc atc // LOCUS REOCEAP2 1440 bp ss-RNA VRL 11-JUL-1990 DEFINITION Reovirus sp. (serotype ST2) sigma-1 protein gene, complete cds. ACCESSION M32861 KEYWORDS cell attachment protein; sigma-1 protein. SOURCE Reovirus sp. (serotpe ST2), cDNA to viral RNA. ORGANISM Reovirus sp. Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1440) AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K. TITLE Identification of conserved domains in the cell attachment proteins of the three serotypes of reovirus JOURNAL Virology 174, 399-409 (1990) STANDARD simple staff_review FEATURES from to/span description pept 14 1402 sigma-1 protein BASE COUNT 384 a 316 c 381 g 359 t ORIGIN 1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac 61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa 121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact 181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg 241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc 301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga 361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat 421 tactaatgac cttggatcat tgaatacgag gctgacgact gaattgaacg atgtccgcca 481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt 541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg 601 gttcgaccgc aataatgttc tgcagttatt cttatcgaac cagcagaaag ggttgggatt 661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg 721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc 781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct 841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag 901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc 961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata 1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga 1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt 1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc 1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc 1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat 1321 cacgaatcat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat 1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc // LOCUS REOCEAP3 1416 bp ss-RNA BAD 11-JUL-1990 DEFINITION Reovirus sp. (serotype ST3) sigma-1 protein gene, complete cds. ACCESSION M32862 KEYWORDS cell attachment protein; sigma-1 protein. SOURCE Reovirus sp. (serotype ST3) viral DNA. ORGANISM Reovirus sp. Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1416) AUTHORS Duncan,R., Horne,D., Cashdollar,L.W., Joklik,W.K. and Lee,P.W.K. TITLE Identification of conserved domains in the cell attachment proteins of the three serotypes of reovirus JOURNAL Virology 174, 399-409 (1990) STANDARD simple staff_review COMMENT Secondary reference. Please see: Proc. Natl. Acad. Sci. U.S.A. 82, 24-28 (1985), accession m10262. FEATURES from to/span description pept 13 1380 sigma-1 protein BASE COUNT 376 a 301 c 365 g 374 t ORIGIN 1 gctattggtc ggatggatcc tcgcctacgt gaagaagtag tacggctgat aatcgcatta 61 acgagtgata atggagcatc actgtcaaaa gggcttgaat caagggtctc ggcgctcgag 121 aagacgtctc aaatacactc tgatactatc ctccggatca cccagggact cgatgatgca 181 aacaaacgaa tcatcgctct tgagcaaagt cgggatgact tggttgcatc agtcagtgat 241 gctcaacttg caatctccag attggaaagc tctatcggag ccctccaaac agttgtcaat 301 ggacttgatt cgagtgttac ccagttgggt gctcgagtgg gacaacttga gacaggactt 361 gcagacgtac gcgttgatca cgacaatctc gttgcgagag tggatactgc agaacgtaac 421 attggatcat tgaccactga gctatcaact ctgacgttac gagtaacatc catacaagcg 481 gatttcgaat ctaggatatc cacgttagag cgcacggcgg tcactagcgc gggagctccc 541 ctctcaatcc gtaataaccg tatgaccatg ggattaaatg atggactcac gttgtcaggg 601 aataatctcg ccatccgatt gccaggaaat acgggtctga atattcaaaa tggtggactt 661 cagtttcgat ttaatactga tcaattccag atagttaata ataacttgac tctcaagacg 721 actgtgtttg attctatcaa ctcaaggata ggcgcaactg agcaaagtta cgtggcgtcg 781 gcagtgactc ccttgagatt aaacagtagc acgaaggtgc tggatatgct aatagacagt 841 tcaacacttg aaattaattc tagtggacag ctaactgtta gatcgacatc cccgaatttg 901 aggtatccga tagctgatgt tagcggcggt atcggaatga gtccaaatta taggtttagg 961 cagagcatgt ggataggaat tgtctcctat tctggtagtg ggctgaattg gagggtacag 1021 gtgaactccg acatttttat tgtagatgat tacatacata tatgtcttcc agcttttgac 1081 ggtttctcta tagctgacgg tggagatcta tcgttgaact ttgttaccgg attgttacca 1141 ccgttactta caggagacac tgagcccgct tttcataatg acgtggtcac atatggagca 1201 cagactgtag ctatagggtt gtcgtcgggt ggtgcgcctc agtatatgag taagaatctg 1261 tgggtggagc agtggcagga tggagtactt cggttacgtg ttgagggggg tggctcaatt 1321 acgcactcaa acagtaagtg gcctgccatg accgtttcgt acccgcgtag tttcacgtga 1381 ggatcagacc accccgcggc actggggcat ttcatc // LOCUS RATGLYSN 2386 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat glycogen synthase mRNA, complete cds. ACCESSION J05446 KEYWORDS UDP glucose:glycogen 4-alpha-D-glucosyltransferase; glycogen synthase. SOURCE Rat adult liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2386) AUTHORS Bai,G., Zhang,Z., Werner,R., Nuttall,F.Q., Tan,A.W.H. and Lee,E.Y.C. TITLE The primary structure of rat liver glycogen synthase deduced by cDNA cloning: Absence of phosphorylation sites 1a and 1b JOURNAL J. Biol. Chem. 265, 7843-7848 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Y.C.Lee, 16-MAR-1990. FEATURES from to/span description pept 46 2157 glycogen synthase (EC 2.4.1.11) mRNA < 1 2386 glycogen synthase mRNA signal 2365 2370 poly-A signal BASE COUNT 657 a 581 c 579 g 569 t ORIGIN 1 ctgcaaccgg tccccttcgg caccagacac acagctggac gaagaatgct caggggccgc 61 tccttgtctg tgacgtccct tggtgggctc cctgcatggg aagctgaaag actccccgtg 121 gaagacttat tgctttttga agtttcctgg gaagtgacca acaaagttgg gggcatctgt 181 actgtgatcc agagcaaagc caaaaccaca gccaatgaat ggggagagaa ttacttcctg 241 ataggtccgt attttgagca taatgtgaag actcaagtag agccatgcag gcccgccaac 301 gatgccgtca gaaaagctgt ggatgccatg aacaaacatg gctgccaggt gcattttgga 361 agatggctga tagaagggag tccgtatgtg gtgctttttg acatcagctc ctcagtgtgg 421 aacctggaca ggtggaaggg agacttctgg gaagcatgtg gcgttggcat ccctcacgac 481 gaccgagaag ccaatgacat gctcatattt gggtctttaa ctgcctggtt cttaaaggag 541 gtgacggacc atgcagacgg taaacacgtc attgcccaat tccatgaatg gcaggctgga 601 actgggctga tcctttctcg tgccaggaaa ctccccatcg ccacaatatt tacaacccat 661 gccacactgc tggggcggta tctctgtgca gcaaatattg acttctacaa ccagcttgat 721 aagttcaaca tagacaaaga ggccggggag aggcagattt atcaccgcta ctgcatggag 781 cgggcttccg tgcactgtgc gcacgtgttt accacagtgt cagaaatcac agccatcgag 841 gcggacgaca tgctgaagag gaagcctgat gtggtgactc caaacggctt gaacgttaag 901 aagttttctg cggtgcacga atttcaaaat ctccatgcca catacaaggc caggatacag 961 gattttgttc gaggtcattt ctatggccac ctggacttcg atcttgaaaa gacgttattt 1021 cttttcattg ctgggaggta tgagttctcc aacaagggag cagacatctt cctagaatcc 1081 ttatccaggc tcaatttcct cctaaggatg cataagagta acgtcactgt ggtagtgttt 1141 ttcatcatgc ctgccaagac aaacaatttc aacgtggaaa ccctgaaggg ccaggcggtg 1201 cggaaacagc tgtgggacac tgtgcactgt atgaaggaaa agtttggcaa gaaactctac 1261 gatgggttat taagaggaga aatacccgac atgaatagta ttttggatcg agatgactta 1321 acaattatga aaagagccat tttttcaact cagagacact ctttgcctcc tgtgaccact 1381 cacaatatga tcgacgattc cacggatccc atcctcagca ccattcgacg aattggactt 1441 ttcaacaatc gcacagacag agtcaaggtg attttacacc cagaattcct gtcctccacc 1501 agccccctac taccaatgga ttatgaagag tttgtccgag gctgtcacct tggggtattt 1561 ccatcatact atgagccctg gggttacacg ccagccgaat gcacagtgat gggcatcccc 1621 agtgtgacta cgaacctctc tggtttcggg tgtttcatgc aggagcatgt ggctgaccct 1681 accgcgtacg gtatttatat cgtcgacagc gtccgctctc cagatgattc ttgcaaccag 1741 ctgactcagt ttctctatgg gttctgtaaa cagtcccgcc gccaaagaat catccagagg 1801 aaccgcaccg agaggctctc agatcttctg gactggagat acctgggcag atattaccag 1861 catgccagac atctgacact gagcagggct tttccagaca aattctacct ggagcccaca 1921 tccccaccaa cgacggatgg ctttaagtat cccaggccct cctcagtacc accttcccca 1981 tcaggatccc agacttcaag tcctcagagc agcgatgtgg aaaacgaagg ggatgaggat 2041 gagagatatg atgaggaaga ggaggctgag agggaccggc taaacatcaa gtcaccattt 2101 tccctgaacc acatcccaaa ggggaagaaa aagcttcatg gagaatataa gaactgagct 2161 caaatgaaat gattccaaat ccacaagaaa atgagctgag cccaagtcca tccctgatgc 2221 ataccgacag atatttacag aatgacgtcg gaaatctaga atctgtgtcc agatcactga 2281 tagtaacttg tagccaccga catgtgtcac cgtactgtga tggtactttt gttgtctaat 2341 tggaaatttc aatctgttat tgataataaa ttaccaaatc taaatg // LOCUS RABCYP2C16 2006 bp ss-mRNA MAM 11-JUL-1990 DEFINITION Rabbit cytochrome P450IIC16 (CYP2C16) mRNA, complete cds. ACCESSION M29968 KEYWORDS cytochrome P450; monooxygenase. SOURCE Rabbit (strain New Zealand White) adult liver, cDNA to mRNA. ORGANISM Oryctolagus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 2006) AUTHORS Hassett,C. and Omiecinski,C.J. TITLE Sequence and gene expression of rabbit cytochrome P450 IIC16: Comparison ti highly related family members JOURNAL Nucleic Acids Res. 18, 1429-1434 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hassett, 20-NOV-1989. Author Address [1]: C.Hasset University of Washington Department of Environmental Health SC-34 Seattle, WA 98195 FEATURES from to/span description pept 66 1529 cytochrome P450IIC16 (CYP2C16) mRNA < 1 2006 CYP2C16 mRNA signal 1985 1989 poly-A signal BASE COUNT 524 a 474 c 472 g 536 t ORIGIN 1 cggcatcggt accaaataag atagacagtg ctactcagaa atccaagaaa atggtggaag 61 aaataatgga tccagttgtg gtcctggtgt tgggtctctg ctgtttgctt ctcctttcac 121 actggaagca aaattccggg agggggaagc tccctcccgg ccccactcct ttccccatta 181 ttggaaatat tctccagata gatgctaagg acatcagcaa atccctaact aagttctcag 241 aacgctatgg ccccgtgttc actgtgtatc tgggcatgaa gcccgctgta gtgctgcatg 301 gataccaggc agtgaaggag gccctggttg atcttggaga ggagtttgct ggaagaggca 361 gttttcctat gcttgataaa gttagtaagg gactcggaat cgttttcacc aatggaaaga 421 gatggaaaga gatccggcgc ttctcgctca tgaccctgcg gaatttcggg atggggaaga 481 ggagcattga ggaccgagtt caagaggagg cccgctgcct ggtggaggag ctgagaaaaa 541 ccaacgcctc accctgtgat cccaccttta tcctgggctg tgctccctgc aatgtgatct 601 gctccattat tttccataat cgctttgatt ataaagatga ggagtttctt aaactattgg 661 aaaaattcaa tgaaaatgtt aggattctga gttctccatg gttgcaggtc tgcaataatt 721 tccctgctct tattgattac ttaccaggaa gtcataagac cttactaaag aattctgatt 781 atgtgaaaaa ttttattatg gagaaagtga aggaacacca aaaattcctg gatgttaaca 841 atcctcggga ctttatagat tgtttcttga tcaaaatgga gcaggaaaac catttggagt 901 tcactcttga aagcttggta accactgtgt ttgatttgtt tggagctggg actgagacaa 961 cgagcacaac gctgagatac tccctcctgc tcctgctgaa gcaccccgag gtcgcagata 1021 aagtgcagga ggagattgag cgtgtgattg gcaggcaccg gagcccctgc atgcaggaca 1081 ggagccgcat gccttacaca gatgccgtaa tacatgagat ccagagattc attgacctgg 1141 tccccaataa tctgccccac acagtgaccc gtgacattaa attcagaaac tactttatcc 1201 ccaagggtac ggacatcatg acatcactga catccgtgct acatgatgaa aaagcatttc 1261 ctaacccaaa ggtatttgac cctggacact ttctggatga gagtggcaac ttcaagaaga 1321 gtgactactt catgcctttc tcagcaggaa aacggatctg tgtgggagag gccctggccc 1381 gcatggagct gtttttgttc ctgacctcca ttttgcagaa ctttaaactg caatctctgg 1441 ttgagccaaa ggacctggac atcactgcag ttctcaatgg atttgtttct gtgccacctt 1501 cgttccagct ctgcttcatt cctgtttgaa aaggagcaga ctggcttcta ctgtgccatc 1561 atttcaaagg cattgcccat caccttactg catttgagac acttctttaa cttttctcac 1621 atcttactat tcccttaaga tctagtgaaa acctaacttc tgtgggtgat cccctgagac 1681 tgcctgccct gaccatgcaa gaggtagaga gggcatggca agccatgctc ctgggaggga 1741 ccccacagcc tggctgctgg caggtggcgg gacccaggca catttctctc cattcctgcc 1801 tgtcaggtaa actgctccta gctgtgtcca aagcccatca agaaagctac cgtaggctat 1861 gtgaccttca agatgattgt aggagcatat cagtaccaat attgcctcta tcctatagaa 1921 ttagtactgc cctgaattag ttacaccctt tctgcctgcc ctttagaaag tgtgcatgct 1981 cattaataaa gtggatgcat tcactg // LOCUS HUMGAPA 4307 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds. ACCESSION M23379 KEYWORDS GTPase-activating protein. SOURCE Human placenta, cDNA to mRNA, clone 101. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4307) AUTHORS Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A., Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and McCormick,F. TITLE Molecular cloning of two types of GAP complementary DNA from human placenta JOURNAL Science 242, 1697-1700 (1988) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by C.M.Long, 31-MAR-1989. For sequence of clone 16 refer to M23612. FEATURES from to/span description pept 119 3262 ras p21 GTP-ase-activating protein (GAP) BASE COUNT 1280 a 856 c 957 g 1214 t ORIGIN 1 cctcagcctg gggagctgaa ggggagacgc gtctgggtgg ggctgctcgg agcccgggcc 61 tggtggcccc tggggctccc gggcgggcag ggtagggcag agtagagcgg gcttcaacat 121 gatggcggcc gaggccggca gtgaggaggg cggcccggta acagccggag ctggaggagg 181 cggcgcggca gcgggctcca gtgcctatcc cgcagtgtgt cgggtgaaga tacccgcggc 241 cctgcctgtg gcagccgccc cctatcctgg gctggtggag accggagtgg ctggaactct 301 gggtggcgga gccgctttgg ggtcagagtt cctaggagcc gggtctgtgg caggggcact 361 ggggggagct ggactgacag ggggaggtac tgctgctggc gtagctggtg ctgctgctgg 421 cgtggccggt gctgctgttg ctggacctag tggagacatg gctctcacca aactgcccac 481 ttcgttgctt gctgagactc tcgggccagg cggcggtttt ccccctctgc cccctccccc 541 ttacctgccc cctttggggg cgggcctcgg gacagtggac gaaggtgact ctctggatgg 601 accagaatac gaggaggaag aggtggccat accgttgacc gctcctccaa ctaaccagtg 661 gtatcacgga aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc 721 tggcagttat cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct 781 tagccagatg aatgttgtca accattttag gattattgct atgtgtggag attactacat 841 tggtggaaga cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg 901 tttgcttaaa ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag 961 aaggcgtgta cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt 1021 cttaaaagga gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac 1081 aaatttaaga acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg 1141 ggaagaagat ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc 1201 ttataattta ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa 1261 tactcctggc gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat 1321 atgtccaacg ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga 1381 catcatagat cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt 1441 accaatgcag gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa 1501 taccatccgt cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct 1561 tctgaaaaag ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga 1621 tgcccaactt atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga 1681 tctcagtgta tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt 1741 tcagatagta gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac 1801 tccagaacaa gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag 1861 tagtccaggg acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga 1921 agaagcccat aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa 1981 tagtgtccaa gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga 2041 gtttgtcttt gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa 2101 aacaaagaaa agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca 2161 gaaagggcat gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat 2221 tgaaccaggg tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga 2281 agagtacagt gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt 2341 atcacatgta tgtggacaag accgaacact actggccagc atcctactga ggatttttct 2401 tcacgaaaag cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga 2461 agatgaagcc actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta 2521 tatgaaagcc actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat 2581 aatggaaagc aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt 2641 gaacactaat ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat 2701 ggcttcagaa atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt 2761 tcagcataag tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt 2821 tcttcgactc atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc 2881 tccatctcct attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc 2941 aaatcttgtg gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa 3001 aagcaacaaa catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc 3061 ggacactaca gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat 3121 ttgcgtggct cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca 3181 cgtattgaaa aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac 3241 aaaaaccaat gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt 3301 ccaacatggt aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac 3361 ttttccacat tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa 3421 taactatgcc agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg 3481 aatcaatctt taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg 3541 ctatgactgt atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc 3601 aaagttttgc tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt 3661 attgacaatt gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag 3721 gaatatgacc atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat 3781 tataatagga acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg 3841 gaactgttga aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca 3901 tacgacttat tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa 3961 atgtttacaa gtaaatagtt tgaattcagt aaatattatt ggttgttgta ttgatcaatg 4021 catgttaccc attcaaccat tttatagact accaatttct tttatgttaa ctagaatgct 4081 tttgttaaaa gttatttgtt cattatttgt gctacccctt tgattatgca gacaacctca 4141 tcagctgcct aacttatcca tctttgaact tctgactact tgttgtatct gctggatatt 4201 tagttcaact gtatagtttt atttacttct gtatgtgtat ttttgtgaag tattcacaaa 4261 ggttaagtta aaataaaacc aagggatatc ttgcaaaaaa aaaaaaa // LOCUS HUMGAPB 3456 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human GTPase-activating protein ras p21 (GAP) mRNA, complete cds. ACCESSION M23612 KEYWORDS GTP-ase-activating protein. SOURCE Human placenta, cDNA to mRNA, clone 16. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3456) AUTHORS Trahey,M., Wong,G., Halenbeck,R., Rubinfeld,B., Martin,G.A., Ladner,M., Long,C.M., Crosier,W.J., Watt,K., Koths,K. and McCormick,F. TITLE Molecular cloning of two types of GAP complementary DNA from human placenta JOURNAL Science 242, 1697-1700 (1988) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by C.M.Long, 31-MAR-1989. For sequence of clone 101 refer to M23379. FEATURES from to/span description pept 100 2712 ras p21 GTP-ase-activating protein (GAP) site 49 51 5' in frame termination codon BASE COUNT 1134 a 640 c 687 g 995 t ORIGIN 1 ggaagaggtg gccataccgt tgaccgctcc tccaactaac cagtaagtta agactgctgt 61 tcaggaattt gggaagctgg ctccagaaaa gaagtggaaa tgaaggggtg gtatcacgga 121 aaacttgaca gaacgatagc agaagaacgc ctcaggcagg cagggaagtc tggcagttat 181 cttataagag agagtgatcg gaggccaggg tcctttgtac tttcatttct tagccagatg 241 aatgttgtca accattttag gattattgct atgtgtggag attactacat tggtggaaga 301 cgtttttctt cactgtcaga cctaataggt tattacagtc atgtttcttg tttgcttaaa 361 ggagaaaaat tactttaccc agttgcacca ccagagccag tagaagatag aaggcgtgta 421 cgagctattc taccttacac aaaagtacca gacactgatg aaataagttt cttaaaagga 481 gatatgttca ttgttcataa tgaattagaa gatggatgga tgtgggttac aaatttaaga 541 acagatgaac aaggccttat tgttgaagac ctagtagaag aggtgggccg ggaagaagat 601 ccacatgaag gaaaaatatg gttccatggg aagatttcca aacaggaagc ttataattta 661 ctaatgacag ttggtcaagt ctgcagtttt cttgtgaggc cctcagataa tactcctggc 721 gattattcac tttatttccg gaccaatgaa aatattcagc gatttaaaat atgtccaacg 781 ccaaacaatc agtttatgat gggaggccgg tattataaca gcattgggga catcatagat 841 cactatcgaa aagaacagat tgttgaagga tattatctta aggaacctgt accaatgcag 901 gatcaagaac aagtactcaa tgacacagtg gatggcaagg aaatctataa taccatccgt 961 cgtaaaacaa aggatgcctt ttataaaaac attgttaaga aaggttatct tctgaaaaag 1021 ggcaaaggaa aacgttggaa aaatttatat tttatcttag agggtagtga tgcccaactt 1081 atttattttg aaagcgaaaa acgagctacc aaaccaaaag gattaataga tctcagtgta 1141 tgttctgtct atgtcgttca tgatagtctc tttggcaggc caaactgttt tcagatagta 1201 gttcagcact ttagtgaaga acattacatc ttttactttg caggagaaac tccagaacaa 1261 gcagaggatt ggatgaaagg tctgcaggca ttttgcaatt tacggaaaag tagtccaggg 1321 acatccaata aacgccttcg tcaggtcagc agccttgttt tacatattga agaagcccat 1381 aaactcccag taaaacattt tactaatcca tattgtaaca tctacctgaa tagtgtccaa 1441 gtagcaaaaa ctcatgcaag ggaagggcaa aacccagtat ggtcagaaga gtttgtcttt 1501 gatgatcttc ctcctgacat caatagattt gaaataactc ttagtaataa aacaaagaaa 1561 agcaaagatc ctgatatctt atttatgcgc tgccagttga gccgattaca gaaagggcat 1621 gccacagatg aatggtttct gctcagctcc catataccat taaaaggtat tgaaccaggg 1681 tccctgcgtg ttcgagcacg atactctatg gaaaaaatca tgccagaaga agagtacagt 1741 gaatttaaag agcttatact gcaaaaggaa cttcatgtag tctatgcttt atcacatgta 1801 tgtggacaag accgaacact actggccagc atcctactga ggatttttct tcacgaaaag 1861 cttgaatcgt tgttgttatg cacactaaat gacagagaaa taagcatgga agatgaagcc 1921 actaccctat ttcgagccac aacacttgca agcaccttga tggagcagta tatgaaagcc 1981 actgctacac agtttgttca tcatgctttg aaagactcta ttttaaagat aatggaaagc 2041 aagcagtctt gtgagttaag tccatcaaag ttagaaaaaa atgaagatgt gaacactaat 2101 ttaacacacc tattgaacat actttcagag cttgtggaga aaatattcat ggcttcagaa 2161 atacttccac cgacattgag atatatttat gggtgtttac agaaatctgt tcagcataag 2221 tggcctacaa ataccaccat gagaacaaga gttgttagtg gttttgtttt tcttcgactc 2281 atctgtcctg ccatcctgaa tccacggatg ttcaatatca tctcagattc tccatctcct 2341 attgctgcaa gaacactgat attagtggct aaatctgtgc agaacttagc aaatcttgtg 2401 gaatttggag ctaaggagcc ctacatggaa ggtgtcaatc cattcatcaa aagcaacaaa 2461 catcgtatga tcatgttttt agatgaactt gggaatgtac ctgaacttcc ggacactaca 2521 gagcattcta gaacggacct gtcccgtgat ttagcagcat tgcatgagat ttgcgtggct 2581 cattcagatg aacttcgaac gctcagtaat gagcgtggtg cacagcagca cgtattgaaa 2641 aagcttctgg ctataacaga actgcttcaa caaaaacaaa accagtatac aaaaaccaat 2701 gatgtcaggt agcagccttc gccccagtgt tctgcatgga ttcagcatgt ccaacatggt 2761 aattcacttc agtttaatgt ctcctttgct cttgccaaaa aatagcacac ttttccacat 2821 tccagtgatg tgtgagctat gcaaacaaaa tccaagattc tgctggtgaa taactatgcc 2881 agcaaccttg taagctatct gtgcaggata tttgcactat ttccacatgg aatcaatctt 2941 taacaacctc tgagccttgg tgtacagacc acctttcaca aaacgaaatg ctatgactgt 3001 atcttgatat ctcgaacttt caaaatatat tttcagtaca cccagttgcc aaagttttgc 3061 tgtctcttag agaaagaact atgaaatcaa ctgacaagaa acacattctt attgacaatt 3121 gtgtataact ggattgcaga ctgttcttac tgtaactact tcctgattag gaatatgacc 3181 atttgactgt tcaatgatta tttgtattta cagtttccag agtttgtcat tataatagga 3241 acaatctttg ctgtatactt ttaaaaaata ctctgctatt tctcttgctg gaactgttga 3301 aagaaaatat atagaatgat ctattgctca tcagctttat tttttaaaca tacgacttat 3361 tttgttgaaa ttgtcaaaga ctgtatttag atctcataat gctttgttaa atgtttacaa 3421 gtaaatagtt tgaattcagt aaatattaaa aaaaaa // LOCUS YSCSDH 1665 bp ds-DNA PLN 11-JUL-1990 DEFINITION S.cerevisiae succinate dehydrogenase iron-protein subunit (SDH) gene, complete cds. ACCESSION J05487 KEYWORDS succinate dehydrogenase iron-protein subunit. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1665) AUTHORS Lombardo,A., Carine,K. and Scheffler,I.E. TITLE Cloning and characterization of the iron-sulfur subunit gene of succinate dehydrogenase from Saccharomyces cerevisiae JOURNAL J. Biol. Chem. 265, 10419-10423 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.E.Scheffler, 13-APR-1990. FEATURES from to/span description pept 738 1538 succinate dehydrogenase iron-protein subunit (SDH) (EC 1.3.99.1) signal 585 589 CAAT box signal 616 622 TATA box BASE COUNT 532 a 349 c 345 g 439 t ORIGIN Chromosome VII. 1 atcttacaag taacttaagt caaggcgtga aaagtaccac cactgtgctt gacatgcaaa 61 agggttgcag agtgcgtcta ccaaggtacg tggaccatga tcaaatcatc aagccttatg 121 atctacgtga ggcccaagga caatactggc tcaagaccgt gaatggagga gtattatgaa 181 tgatgaaatc ctgtcgcacg tatattgcca ggcaaagaac tagcagtaat tgtgtcatgt 241 cagcacattg ctgaggtgca aatggccacc caagagctta ttggagcaca ggatatcttc 301 atcagggaat tacattggaa agatccggtc ttcaaattaa ctcaagtctc aatccgaata 361 cttcattccc atcagcgatc ctgaagaacg tcggtccttg tacaggaaca tcgccattgc 421 tgttagagaa tacaataagt actgtgaagc tatcctatga tcacatatga aagtatatac 481 ccgcttttgt acactatgta gctataattc aatcgtatta ttgtacgtcc gcacgaccat 541 gccttagaaa tatccgcagc gcgcaaaagg cggcctcgca ttggcccaat tagctccggt 601 gtaaaaaggg caaactatat aagggattaa tgactttcta tgagaatgcc aaaaaatgtt 661 aggctaaagg aagggattga aaggaatata gttgagctat actttcttga aatactggag 721 tatacatatt tatagggatg ttgaacgtgc tattgagaag gaaggccttt tgtttggtga 781 cgaagaaggg tatggctact gccacaacag ctgcagctac gcataccccc agattgaaaa 841 cttttaaagt ttacagatgg aatccagacg agccaagtgc taaacctcat ttacagtcat 901 atcaagtgga tctgaatgac tgtgggccca tggtacttga tgcgctgtta aagatcaaag 961 acgaacagga ttctacccta acttttagaa gatcatgtag agaaggtatc tgcggttcat 1021 gtgccatgaa cattggcggt agaaacacgc tagcttgtat atgtaagatc gaccagaacg 1081 aatccaaaca actcaagatc tatccattac cccacatgtt tattgtcaaa gatttggtac 1141 ctgatttaac taacttctac caacaataca aatctatcca accttactta cagagatcat 1201 cgtttccaaa ggatggaacg gaagtgctac aaagtattga agatcgtaag aaactggatg 1261 gtctttacga atgtattctg tgtgcatgct gctctacttc atgtccatcg tactggtgga 1321 accaagaaca gtatttgggc cctgccgtgc taatgcaagc ctaccgttgg ctaattgact 1381 ctagagacca agctacaaag acaagaaagg ccatgctaaa caactccatg tcattgtaca 1441 gatgtcacac catcatgaac tgtactagaa cttgtccaaa gggcttgaat cctggtttgg 1501 ctattgctga aattaagaaa tctttggcat ttgcctagac tatcagaaaa acagctagcc 1561 ccgaagaact cagaagcctc tcaaatgatt ttggcactaa taaaagcacc aactattatt 1621 attattattt tcaaggacga aactcaccat tctcacacat tcctt // LOCUS BOVPDEAP 585 bp ss-mRNA MAM 11-JUL-1990 DEFINITION Bovine cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (PDE), partial cds. ACCESSION M33140 M29465 KEYWORDS cone photoreceptor cyclic nucleotide phosphodiesterase. SOURCE Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 585) AUTHORS Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K., Mullaney,P.J., Walsh,K.A. and Beavo,J.A. TITLE Identification of a noncatalytic cGMP-binding domain conserved in both the cGMP-stimulated and photoreceptor cyclic nucleotide phosphodiesterases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 585 cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (AA at 1) BASE COUNT 198 a 115 c 126 g 146 t ORIGIN 1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa 61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt 121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt 181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt 241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag 301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct 361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc 421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa 481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca 541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat // LOCUS CHKG1CLSE 240 bp ds-DNA VRT 11-JUL-1990 DEFINITION Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer cor segments B3 and B4. ACCESSION M33954 KEYWORDS delta-1-crystallin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 240) AUTHORS Goto,K., Okada,T.S. and Kondoh,H. TITLE Functional cooperation of lens-specific and nonspecific elements in the delta-1-crystallin enhancer JOURNAL Mol. Cell. Biol. 10, 958-964 (1990) STANDARD simple staff_review FEATURES from to/span description site 17 116 core segment B3 site 112 235 core segment B4 BASE COUNT 61 a 52 c 53 g 74 t ORIGIN 1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg 61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt 121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga 181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa // LOCUS CHKOVAL 9206 bp ds-DNA VRT 11-JUL-1990 DEFINITION Chicken ovalbumin gene, complete cds. ACCESSION J00895 KEYWORDS ovalbumin. SOURCE Chicken oviduct DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1343 to 8906) AUTHORS Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R., Brownlee,G.G. and O'Malley,B.W. TITLE Complete nucleotide sequence of the chicken chromosomal ovalbumin gene and its biological significance JOURNAL Biochemistry 20, 6437-6446 (1981) STANDARD full staff_review REFERENCE 2 (bases 1043 to 1562; 2675 to 4732; 8885 to 9206) AUTHORS Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P. TITLE The ovalbumin gene-sequence of putative control regions JOURNAL Nucleic Acids Res. 8, 127-142 (1980) STANDARD full staff_review REFERENCE 3 (bases 1357 to 1389; 2941 to 3052; and ivs junctions) AUTHORS Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P. TITLE Ovalbumin gene: evidence for a leader sequence in mRNA and DNA sequences at the exon-intron boundaries JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978) STANDARD full staff_review REFERENCE 4 (bases 1282 to 1420; 2952 to 2999) AUTHORS Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C., Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and Chambon,P. TITLE Organisation and sequences at the 5' end of a cloned complete ovalbumin gene JOURNAL Nature 278, 428-434 (1979) STANDARD full staff_review REFERENCE 5 (bases 1343 to 8906; exons only) AUTHORS McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E., Givol,D., Fields,S., Robertson,M. and Brownlee,G.G. TITLE Sequence of chicken ovalbumin mRNA JOURNAL Nature 273, 723-728 (1978) STANDARD full staff_review REFERENCE 6 (bases 2675 to 5042) AUTHORS Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F., O'Malley,B.W. and Brownlee,G.G. TITLE Sequence of three introns in the chick ovalbumin gene JOURNAL Nature 278, 370-372 (1979) STANDARD full staff_review REFERENCE 7 (bases 1 to 1042) AUTHORS Heilig,R., Muraskowsky,R. and Mandel,J.L. TITLE The ovalbumin gene family: The 5' end region of the X and Y genes JOURNAL J. Mol. Biol. 156, 1-19 (1982) STANDARD full staff_review REFERENCE 8 (bases 5576 to 5624) AUTHORS Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W. TITLE The ovalbumin gene: Alleles created by mutations in the intervening sequences of the natural gene JOURNAL Cell 16, 201-211 (1979) STANDARD simple staff_entry REFERENCE 9 (bases 443 to 611) AUTHORS Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M. TITLE The steroid-dependent regulatory element in the ovalbumin gene does not function as a typical steroid response element JOURNAL J. Biol. Chem. 265, 7590-7595 (1990) STANDARD simple staff_entry COMMENT Eight exons reported. Sequence homologies with adenovirus early 1a, adenovirus major late, chicken conalbumin, chkx and chky genes noted for 5' flanking sequence. OV1.3 is identical to OV1.8 except that intron E is shorter by 522 nucleotides [8]. FEATURES from to/span description pept 2996 3163 ovalbumin, exon 1 3415 3465 ovalbumin, exon 2 4047 4175 ovalbumin, exon 3 4576 4693 ovalbumin, exon 4 5652 5794 ovalbumin, exon 5 6126 6281 ovalbumin, exon 6 7864 8259 ovalbumin, exon 7 pre-msg 1343 8906 oval mRNA and introns IVS 1390 2978 oval intron A IVS 3164 3414 oval intron B IVS 3466 4046 oval intron C IVS 4176 4575 oval intron D IVS 4694 5651 oval intron E IVS 5795 6125 oval intron F IVS 6282 7863 oval intron G allele 1282 1282 g may be c [1] allele 1309 1309 g may be a [1] allele 1376 1376 g may be c [1],[7] allele 1385 1385 g may be a [1],[7] allele 1393 1393 a may be g [1] allele 3010 3010 t may be c [1] allele 3154 3154 a may be g [1] allele 5747 5747 g may be a [1] allele 8032 8032 can be g [5] conflict 1471 1471 c in [1]; t in [2],[7] conflict 1523 1523 a in [1]; t in [2],[7] conflict 1538 1539 at in [1]; a in [2] conflict 2681 2681 t in [1],[6]; g in [2] conflict 2696 2696 a in [1],[6]; g in [2] conflict 3552 3552 a in [1],[6]; g in [2] conflict 3683 3683 c in [1],[6]; t in [2] conflict 3934 3934 a in [1],[6]; c in [2] conflict 3960 3960 a in [1],[6]; g in [2] conflict 3994 3994 t in [1],[6]; c in [2] conflict 4207 4208 tt in [1],[6]; ttt in [2] conflict 4396 4396 t in [1],[6]; c in [2] conflict 5636 5636 c in [1]; g in [3] conflict 6100 6100 t in [1]; tc in [3] conflict 6119 6119 t in [1]; tg in [3] conflict 8492 8505 gactcacagtactg in [1]; g in [5] site 443 611 steroid-dependent regulatory element [9] BASE COUNT 2994 a 1749 c 1721 g 2742 t ORIGIN 5 bp upstream of PstI site. 1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca 61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg 121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc 181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg 241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca 301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga 361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca 421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca 481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa 541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt 601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg 661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt 721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt 781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc 841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg 901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac 961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag 1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat 1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag 1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat 1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg 1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc 1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa 1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac 1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac 1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta 1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt 1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat 1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg 1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat 1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac 1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg 1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac 1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa 2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact 2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa 2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa 2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa 2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc 2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac 2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt 2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt 2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc 2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat 2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa 2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat 2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca 2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg 2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt 2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg 3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca 3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt 3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa 3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct 3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc 3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc 3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt 3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt 3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct 3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa 3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag 3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta 3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg 3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat 3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag 3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta 3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca 4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta 4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta 4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg 4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat 4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac 4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag 4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat 4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta 4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat 4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa 4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa 4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa 4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt 4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa 4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca 4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc 4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat 5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg 5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga 5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt 5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc 5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt 5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag 5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga 5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa 5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc 5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt 5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc 5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga 5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta 5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg 5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct 5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt 6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca 6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc 6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat 6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca 6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg 6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt 6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag 6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa 6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag 6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga 6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac 6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa 6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca 6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat 6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat 6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag 6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga 7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt 7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag 7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac 7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc 7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg 7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac 7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt 7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga 7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga 7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta 7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga 7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag 7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt 7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg 7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg 7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc 7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct 8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa 8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc 8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc 8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact 8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt 8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga 8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac 8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa 8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa 8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa 8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc 8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca 8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt 8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt 8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc 8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag 9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa 9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg 9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg 9181 ctgccctgat ctcggctggg gtgatg // LOCUS PMUCEN 150 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage Mu wild type DNA fragment with a gyrase cleavage site. ACCESSION M32302 KEYWORDS . SOURCE Bacteriophage Mu (wild type) DNA. ORGANISM Bacteriophage mu Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Pato,M., Howe,M. and Higgins,P. TITLE DNA gyrase binds to a centrally located replication enhancer (CEN) in the bacteriophage Mu genome JOURNAL Unpublished (1990) In Press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.L.Pato, 23-FEB-1990. FEATURES from to/span description site 49 50 DNA gyrase cleavage site in complimentary strand mut 44 44 a in wt; g in Mu nuB103 mut 50 50 g in wt; c in Mu nuB1 BASE COUNT 40 a 38 c 27 g 45 t ORIGIN Map position at 18.0 kb. 1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat 61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt 121 atcttccttt cactttcttt ctccagtact // LOCUS RATRNRTR 2577 bp ds-DNA ROD 11-JUL-1990 DEFINITION Rat snRNP-associated polypeptide N, complete cds. ACCESSION J05497 KEYWORDS snRNP-associated polypeptide N. SOURCE Rat male adult (Fisher) DNA, clones rgV and rgIII2. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2577) AUTHORS Schmauss,C. and Lerner,M.R. TITLE The closely related small nuclear ribonucleoprotein polypeptides N and B/B' are distinguishable by antibodies as well as by differences in their mRNAs and gene structures JOURNAL J. Biol. Chem. 265, 10733-10739 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.Schmauss, 13-APR-1990. FEATURES from to/span description pept 1314 2036 snRNP-associated polypeptide N signal 774 779 TATA box signal 698 702 CAAT box BASE COUNT 822 a 537 c 581 g 637 t ORIGIN 1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg 61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact 121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag 181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag 241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga 301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg 361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct 421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg 481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc 541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt 601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca 661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat 721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa 781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc 841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg 901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag 961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa 1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag 1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga 1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt 1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt 1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg 1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag 1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct 1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag 1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg 1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg 1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag 1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga 1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag 1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa 1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa 1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc 1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata 2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc 2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg 2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca 2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa 2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca 2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa 2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat 2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg 2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc // LOCUS WUCSSP 1323 bp ds-DNA INV 11-JUL-1990 DEFINITION W.bancrofti species specific DNA fragment. ACCESSION M27140 KEYWORDS . SOURCE W.bancrofti DNA, clone IWb35. ORGANISM Wuchereria bancrofti Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 1323) AUTHORS Dissanayake,S. and Piessens,W.F. TITLE Cloning and characterization of a Wuchereria bancrofti-specific DNA sequence JOURNAL Mol. Biochem. Parasitol. 39, 147-150 (1990) STANDARD simple staff_entry BASE COUNT 399 a 204 c 251 g 469 t ORIGIN 1288 bp upstream of SacI site. 1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt 61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat 121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta 181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga 241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt 301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg 361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa 421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg 481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg 541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat 601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc 661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc 721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag 781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg 841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa 901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct 961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat 1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac 1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata 1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca 1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg 1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg 1321 agc // LOCUS YSCHXT2 2890 bp ds-DNA PLN 11-JUL-1990 DEFINITION S.cerevisiae high affinity hexose transporter-2 (HXT2) gene, complete cds. ACCESSION M33270 KEYWORDS high affinity hexose transporter-2. SOURCE S.cerevisiae (isogenic strain to S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2890) AUTHORS Kruckeberg,A.L. and Bisson,L.F. TITLE The HXT2 gene of Saccharomyces cerevisiae is required for high affinity glucose transport JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.L.Kruckeberg, 26-MAR-1990. FEATURES from to/span description pept 818 2443 high affinity hexose transporter-2 BASE COUNT 808 a 571 c 535 g 976 t ORIGIN 1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg 61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc 121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata 181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt 241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac 301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc 361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa 421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac 481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg 541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac 601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc 661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca 721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca 781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg 841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga 901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa 961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt 1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag 1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt 1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg 1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat 1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc 1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc 1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt 1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt 1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc 1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc 1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat 1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa 1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt 1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa 1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc 1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat 1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa 2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat 2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc 2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt 2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt 2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa 2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc 2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact 2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt 2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt 2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg 2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt 2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa 2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa 2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat 2881 cggccatact // LOCUS RATGAH 1003 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat L-glutamine amidohydrolase mRNA, 3' end. ACCESSION J05499 KEYWORDS L-glutamine amidohydrolase. SOURCE Rat (strain Sprague-Dawley) liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1003) AUTHORS Smith,E.M. and Watford,M. TITLE Molecular cloning of a cDNA for rat hepatic glutaminase: Sequence similarity to kidney-type glutaminase JOURNAL J. Biol. Chem. 265, 10631-10636 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Waterford, 08-MAY-1990. FEATURES from to/span description pept < 1 660 L-glutamine amidohydrolase (EC 3.5.1.2) BASE COUNT 270 a 243 c 258 g 232 t ORIGIN 1 ttccggatgt gtctgtcacc cccgttagac aagctgggga acagccacag gggcatcagc 61 ttctgccaga agttggtgtc tctgtttaac ttccacaact acgacaacct gcggcactgt 121 gctcggaagt tagacccacg gagggaaggg ggggaagttc ggaacaagac cgtggtgaac 181 ctgttatttg ctgcatatag tggagatgtc tcagctcttc gaaggtttgc cttgtctgcc 241 gtggatatgg agcagaagga ctatgattcc cgcacagccc tacatgtggc ggcagcggaa 301 ggacacattg acgttgtcaa gtttctgatc gaggcttgca aagtgaatcc ttttgtcaag 361 gacaggtggg gcaacattcc cctggatgat gccctgcagt tcaatcacct ggaggtggtc 421 aaactgcttc aggattacca tgactcctac atgctgtctg agactcaagc tgaggtacag 481 ctgagactct gtcaaaagag aactgagaga gcatgtgtga gcacaggcca gggcagcccg 541 tgctcaagaa aaagcatgag cgggccacaa tttaacccaa ggccaccaaa aatactattg 601 caagctgctt cagtgggatc aacacagcca tctggtgaca caggccagtg ttttctgtga 661 gaatcaaaat gccccattcc ctcatcggac agcacagaga aaagcttcag tggacacctg 721 agcagagcta gccacggaga cctcaaggta tagcttaagt gacatcctcc accagaaagt 781 agcccaggct tttacccagg tccccatttc aacttccttg gagagcgtct agctacatgc 841 atatgtatct gtcacagagc aagagaggtg ggtgagagcc caatcacctg gctttagaaa 901 tctgcagaga tctgtccatc ttagccaaga catgctgcta ctgctgacag gagttttata 961 gacaaagtat tttgtgttca aataaacttt aattaccgga att // LOCUS CEACAEVA 264 bp ds-DNA VRL 11-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34092 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from goat synovial membrane, clone pCol.9. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 97 a 40 c 80 g 47 t ORIGIN 1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta 61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg 121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg 181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta 241 tgtaacccgg ggtggggaac ataa // LOCUS CEACAEVB 264 bp ds-DNA VRL 11-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34093 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from goat synovial membrane, clone pC63-49. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 99 a 43 c 75 g 47 t ORIGIN 1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata 61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg 121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg 181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta 241 tgtaacccgg ggtgggaaac ataa // LOCUS FLAPR834HA 540 bp ss-RNA VRL 11-JUL-1990 DEFINITION Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds. ACCESSION M34335 KEYWORDS glycoprotein; hemagglutinin. SOURCE Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally from human. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 540) AUTHORS Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P. TITLE Rapid detection of influenza virus H1 by the polymerase chain reaction JOURNAL Biochem. Biophys. Res. Commun. 167, 425-430 (1990) STANDARD simple staff_review COMMENT Sequence reported is + strand. FEATURES from to/span description pept < 1 > 540 hemagglutinin (AA at 2) BASE COUNT 179 a 121 c 123 g 117 t ORIGIN 1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta 61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac 121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat 181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg 241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa 301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc 361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa 421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt 481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa // LOCUS HUMGPPSBAA 355 bp ds-DNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1. ACCESSION M34422 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, clone PS-beta-G C. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 355) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description IVS 1 355 pregnancy-specific beta-1 glycoprotein intron C1 BASE COUNT 110 a 64 c 62 g 119 t ORIGIN Chromosome 19. 1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag 61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc 121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg 181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa 241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct 301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag // LOCUS HUMGPPSBD 1418 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34421 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1418) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 79 1359 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 79 180 pregnancy-specific beta-1 glycoprotein signal peptide matp 181 1356 pregnancy-specific beta-1 glycoprotein mRNA 1 1418 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 408 a 398 c 296 g 316 t ORIGIN Chromosome 19. 1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac 61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc 121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc 181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt 241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac 301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca 361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg 421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa 481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc 541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg 601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag 661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc 721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat 781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat 841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg 901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata 961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac 1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc 1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc 1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa 1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc 1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct 1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga 1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac // LOCUS HUMGPPSBE 1856 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34420 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1856) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 82 1368 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 82 183 pregnancy-specific beta-1 glycoprotein signal peptide matp 184 1365 pregnancy-specific beta-1 glycoprotein mRNA 1 1856 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 544 a 476 c 364 g 472 t ORIGIN Chromosome 19. 1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag 61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc 121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact 181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta 241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag 301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct 361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc 421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga 481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc 541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact 601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg 661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga 721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg 781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag 841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg 901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg 961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag 1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac 1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc 1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag 1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct 1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc 1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt 1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta 1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca 1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt 1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat 1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga 1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt 1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg 1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc // LOCUS HUMGPPSBF 2004 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M23575 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone pSP1-i. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2004) AUTHORS Rooney,B.C., Horne,C.H.W. and Hardman,N. TITLE Molecular cloning of a cDNA for human pregnancy-specific beta-1-glycoprotein: Homology with human carcinoembryonic antigen and related proteins JOURNAL Gene 71, 439-449 (1988) STANDARD simple staff_review FEATURES from to/span description pept 232 1518 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 232 333 pregnancy-specific beta-1 glycoprotein signal peptide matp 334 1515 pregnancy-specific beta-1 glycoprotein BASE COUNT 579 a 510 c 406 g 509 t ORIGIN 1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca 61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta 121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt 181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc 241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca 301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc 361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct 421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta 481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc 541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac 601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac 661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag 721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat 781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt 841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg 901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac 961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa 1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt 1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat 1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca 1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat 1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa 1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag 1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg 1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct 1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac 1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg 1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact 1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg 1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt 1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc 1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc 1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg 1981 taataaagtt atctgcacaa accc // LOCUS HUMLEUELA 920 bp ss-mRNA PRI 11-JUL-1990 DEFINITION Human elastase/medullasin mRNA, complete cds. ACCESSION M34379 KEYWORDS elastase; medullasin. SOURCE Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 920) AUTHORS Okano,K., Aoki,Y., Shimizu,H. and Naruto,M. TITLE Functional expression of human leukocyte elastase (HLE)/medullasin in eukaryotic cells JOURNAL Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990) STANDARD full staff_review FEATURES from to/span description pept 39 842 elastase/medullasin precursor (EC 3.4.21.37) sigp 39 119 elastase/medullasin signal peptide matp 126 839 elastase/medullasin BASE COUNT 141 a 329 c 287 g 163 t ORIGIN 1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg 61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct 121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc 181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg 241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata 301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg 361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca 421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg 481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg 541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca 601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct 661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct 721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac 781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact 841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca 901 ataaacattc tctgttttgt // LOCUS MSGIS6110 1360 bp ds-DNA BCT 11-JUL-1990 DEFINITION M.tuberculosis-50 complex IS6110 insertion sequence-like element. ACCESSION M29899 KEYWORDS insertion sequence. SOURCE M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone I21. ORGANISM Mycobacterium tuberculosis Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae. REFERENCE 1 (bases 1 to 1360) AUTHORS Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H., Gicquel,B. and Guesdon,J.L. TITLE IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex JOURNAL Nucleic Acids Res. 18, 188-188 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Thierry 15-NOV-1989. BASE COUNT 269 a 439 c 432 g 220 t ORIGIN 1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca 61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca 121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt 181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc 241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa 301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga 361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg 421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg 481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc 541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt 601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca 661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca 721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg 781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag 841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt 901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg 961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca 1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg 1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac 1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc 1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc 1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct 1321 cagatcagag agtctccgga ctcaccgggg cggttcacga // LOCUS MUSIGHAAT 348 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig J558 family active H-chain mRNA V-J3 region from hybridoma CE5, partial cds. ACCESSION M34119 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 348 Ig H-chain V-J3 region (AA at 1) recomb 294 295 J-region end/D-region start recomb 303 304 D-region end/J-region start BASE COUNT 89 a 82 c 97 g 80 t ORIGIN Chromosome 12. 1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat 181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac 241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS MUSIGKCSR 321 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733, partial cds. ACCESSION M34120 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma GA733, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 87 a 83 c 74 g 77 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga 301 ggcaccaaac tggaaatcaa a // LOCUS MUSIGKCSS 321 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma C017-1A, partial cds. ACCESSION M34121 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 89 a 77 c 78 g 77 t ORIGIN Chromosome 6. 1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc 61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca 121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat 181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct 241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSIGKCST 318 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma CE5, partial cds. ACCESSION M34122 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 318 Ig kappa-chain (AA at 1) BASE COUNT 84 a 78 c 80 g 76 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag 181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc 301 accaagctgg aaatcaaa // LOCUS PSERRSAA 1517 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.aeruginosa 16S ribosomal RNA. ACCESSION M34133 KEYWORDS 16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA. SOURCE P.aeruginosa (strain 25330) ribosomal RNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1517) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept of Microbiology 407 S. Goodwin Avenue Urbana, IL 61801 email: carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1517 16S rRNA BASE COUNT 379 a 334 c 470 g 301 t 33 others ORIGIN 1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta 121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc 181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga 301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat 421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt 481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg 541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg 601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag 661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc 721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg 781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag 841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta 901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag 961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg 1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt 1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca 1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg 1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg 1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact 1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg 1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta 1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn 1501 nnnnnnnnnn gaacctg // LOCUS RATNESTIN 5946 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat nestin mRNA, complete cds. ACCESSION M34384 KEYWORDS intermediate filament protein; nestin. SOURCE Rat (strain E15) embryo central nervous system, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5946) AUTHORS Lendahl,U., Zimmerman,L.B. and McKay,R.D.G. TITLE CNS stem cells express a new class of intermediate filament protein JOURNAL Cell 60, 585-595 (1990) STANDARD simple staff_review FEATURES from to/span description pept 128 5545 nestin (128 could be 161) mRNA 1 5495 nestin mRNA BASE COUNT 1600 a 1401 c 1871 g 1074 t ORIGIN 1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa 61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac 121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg 181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc 241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga 301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga 361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt 421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc 481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga 541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc 601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga 661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt 721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg 781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga 841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac 901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca 961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct 1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg 1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg 1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc 1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt 1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc 1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca 1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga 1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc 1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc 1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga 1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct 1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct 1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa 1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca 1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg 1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt 1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt 2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca 2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga 2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca 2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact 2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg 2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat 2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca 2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa 2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt 2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg 2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa 2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg 2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga 2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc 2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag 2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc 3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc 3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac 3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt 3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa 3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt 3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc 3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc 3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc 3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt 3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga 3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga 3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac 3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga 3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa 3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc 3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga 3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga 4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc 4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa 4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag 4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga 4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc 4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt 4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga 4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga 4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg 4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg 4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc 4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct 4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga 4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga 4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc 4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga 4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa 5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct 5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca 5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga 5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt 5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg 5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga 5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc 5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga 5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg 5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt 5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg 5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc 5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac 5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc 5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct 5941 acctac // LOCUS SYNCMPA 1885 bp ds-DNA BCT 11-JUL-1990 DEFINITION Synechococcus sp. 42-kD membrane protein (cmpA) gene, complete cds. ACCESSION M32999 KEYWORDS membrane protein. SOURCE Synechococcus sp. (strain PCC 7942) DNA. ORGANISM Synechococcus sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 1885) AUTHORS Omata,T., Carlson,T.J., Ogawa,T. and Pierce,J. TITLE Sequencing and modification of the gene encoding the 42 kilodalton protein in the cytoplasmic membrane of Synechococcus PCC 7942 JOURNAL Plant Physiol. 93, 305-311 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Pierce, 20-MAR-1990. FEATURES from to/span description pept 141 1493 49-kD membrane protein (cmpA) BASE COUNT 452 a 508 c 487 g 438 t ORIGIN 1 ggttatcagc cttatcggtc tggaataacc agttggccta aagtcatgca gacagagcgt 61 ttctgcgcct ctcgtgaagc aattcgcaca acttgtccat ctttagaggc atctcctgtt 121 gtgggatgta ggggagacgt atgaacgaat ttcaaccagt caatcgtcgt cagtttctgt 181 tcacgctcgg agcaaccgct gctagcgcta ttttgctgaa gggttgcggt aatcctcctt 241 ccagtagcgg cggcgggact tctagtacaa ctcagccaac tgctgcaggg gcgagtgatc 301 tggaagtcaa gacaatcaaa ttgggctaca tccccatctt tgaagcggct ccactgatca 361 ttggccgcga aaaaggcttt tttgccaaat atggcttgga tgttgaagtc tcgaaacaag 421 ccagctgggc agctgctcgc gataacgtca ttctcggttc tgctggtggc ggcatcgatg 481 gcggtcagtg gcaaatgccg atgcctgcct tgctaacgga aggtgcgatc agcaacggtc 541 aaaaagttcc catgtatgtc ttggcttgct tgagcaccca aggcaatggc atcgctgttt 601 ccaatcagct caaggcccaa aatctgggct tgaagctagc gcccaaccgc gactttatcc 661 tcaactaccc gcaaactagc ggccggaagt tcaaagcatc ctacaccttc ccgaacgcca 721 accaagactt ctggattcgc tattggtttg cagctggcgg tatcgatcct gataaagaca 781 ttgaactctt gaccgttccc agcgcagaaa ctctacaaaa tatgcgcaat ggcacgatcg 841 attgcttcag taccggcgat ccctggccgt cgcggattgc caaagatgac atcggctatc 901 aagctgcgct gacaggtcaa atgtggcctt accaccccga ggaattcttg gcgctgcgag 961 cagactgggt agacaaacat ccgaaagcta cgctcgcctt gctgatgggc ttgatggaag 1021 cgcagcaatg gtgcgatcag aaagcaaatc gggcagagat ggccaagatc ctctccggtc 1081 gcaacttctt taacgtgccg gtttcgatcc tgcagccgat tctggaaggt caaatcaaag 1141 ttggagcaga cggaaaagat ctcaacaact ttgatgccgg cccgctcttc tggaagagtc 1201 cgcgcggcag tgtctcctat ccctacaaag ggctcaccct ctggttcttg gtggagtcga 1261 tccgctgggg cttcaacaag caagtgctac ctgacattgc agccgcccag aaactcaacg 1321 atcgcgtgac tcgtgaagac ctctggcaag aggcagccaa gaaattaggg gtgcccgctg 1381 cggatatccc aaccggatcg actcgcggta ccgagacctt ctttgatggc atcacctaca 1441 acccagacag tccgcaagct tatctccaaa gcttgaagat taaacgggca taagtagggg 1501 cttcaatcat caaccttagt tcagtcacta tcaggagata gacagaccat ggttactgca 1561 cgggaaacaa gacgaaacgg aagtcgtcct tctggcttaa aaaaatggcg tcagaaactc 1621 gatggcatct tgctaccgct agcaggaatt ttgggtttcc tcatcatttg gcagatcttt 1681 tctagcacgg gcaacccgct tgcccggccc tgctcagtct cttcacagaa gagagaacac 1741 gcgagttgct gccctatccc ttcttggatc gcggcgggct tgataaaggt ctgttctggc 1801 agacgtatcg cttagttctg acgcgggtgg cccagggctt ttcgatccgc agccatcatc 1861 ggcatcggca tttccgttgg aattc // LOCUS ECOOXYR 1264 bp ds-DNA BCT 11-JUL-1990 DEFINITION E.coli oxyR regulatory protein gene, complete cds. ACCESSION J04553 KEYWORDS oxyR gene; regulatory protein. SOURCE E.coli (strain K12, CSH50) DNA, clones pAQ17 and pMomR1200. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1264) AUTHORS Christman,M.F., Storz,G. and Ames,B.N. TITLE Oxyr, a positive regulator of hydrogen peroxide-inducible genes in Escherichia coli and Salmonella typhimurium, is homologous to a family of bacterial regulatory proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3484-3488 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by G.Storz, 23-MAY-1989. FEATURES from to/span description pept 203 1120 oxyR protein mRNA 170 > 1264 oxyR mRNA signal 133 138 -35 region signal 157 162 -10 region binding 191 195 ribosome binding site BASE COUNT 289 a 308 c 368 g 299 t ORIGIN 89.6 minutes on K12 map. 1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc 61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat 121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta 181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc 241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag 301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg 361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg 421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct 481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct 541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact 601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga 661 acgattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca 721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat 781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc 841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc 901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggagcgca aacgcgatgg 961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg 1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat 1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc 1141 gatagcttcc gccatcgtcg ggtagttaaa ggtggtgttg acgaagtact caatagtgtt 1201 gccgccacct ttctgttcca taatcgcctg accgatatga ataatttcgg cgagcgcgct 1261 cgcc // LOCUS CLLRRE 1860 bp ss-rRNA RNA 11-JUL-1990 DEFINITION C.sapidus 18S rRNA, 3' end. ACCESSION M34360 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE C.sapidus rRNA. ORGANISM Callinectes sapidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; Brachyrhyncha; Portunoidea; Portunidae. REFERENCE 1 (bases 1 to 1860) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1860 18S ribosoma RNA BASE COUNT 359 a 375 c 421 g 367 t 338 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct 61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt 121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta 181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc 241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc 301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc 361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac 421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac 481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat 541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct 601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg 661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc 721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata 841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc 901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc 961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg 1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt 1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt 1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac 1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg 1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct 1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc 1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga 1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag 1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc 1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc 1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn 1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn // LOCUS HUMNCSRC 138 bp ds-DNA PRI 11-JUL-1990 DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene, exons 3, 4, NI, and NII, partial cds. ACCESSION M34469 KEYWORDS membrane-associated tryosine protein kinase. SOURCE Human adult brain DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Pyper,J.M. and Bolen,J.B. TITLE Identification of a novel neuronal C-SRC exon expressed in human brain JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 29 membrane-associated tyrosine protein kinase (C-SRC), exon 3 (AA at 1) 30 47 membrane-associated tyrosine protein kinase, exon NI 48 80 membrane-associated tyrosine protein kinase, exon NII 81 > 136 membrane-associated tyrosine protein kinase, exon 4 variant 117 117 a or g variant 135 135 c or t BASE COUNT 36 a 40 c 40 g 22 t ORIGIN 1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc 61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga 121 cagacaggct acatcccc // LOCUS MHVAPEPA 1000 bp ss-RNA VRL 11-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34435 KEYWORDS peplomer protein. SOURCE Murine (strain JHM-DL) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1000) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review BASE COUNT 258 a 192 c 227 g 317 t 6 others ORIGIN 1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag 61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa 121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct 181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa 241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta 301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt 361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca 421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac 481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg 541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga 601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg 661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt 721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt 781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa 841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa 901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag 961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa // LOCUS MHVAPEPB 843 bp ss-RNA VRL 11-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34436 KEYWORDS peplomer protein. SOURCE Murine (strain A59) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 843) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review FEATURES from to/span description site 400 401 deletion site BASE COUNT 227 a 153 c 183 g 280 t ORIGIN 1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag 61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa 121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct 181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa 241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta 301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt 361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt 421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc 481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc 541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat 601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt 661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc 721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact 781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc 841 gaa // LOCUS MHVSP 3780 bp ss-mRNA VRL 11-JUL-1990 DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds. ACCESSION X04797 M34437 KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein; surface projection glycoprotein. SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones pJMS1010, pJS112, and pJS92. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 3780) AUTHORS Schmidt,I., Skinner,M. and Siddell,S. TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection Glycoprotein of Coronavirus MHV-JHM JOURNAL J. Gen. Virol. 68, 47-56 (1987) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description SITE 24 30 put. region of homology preceeding mRNA 5' initiation codons SITE 121 129 pot. N-glycosylation site SITE 208 216 pot. N-glycosylation site SITE 430 438 pot. N-glycosylation site SITE 604 612 pot. N-glycosylation site SITE 1099 1107 pot. N-glycosylation site SITE 1333 1341 pot. N-glycosylation site SITE 1636 1644 pot. N-glycosylation site SITE 1732 1740 pot. N-glycosylation site SITE 1756 1764 pot. N-glycosylation site SITE 1825 1833 pot. N-glycosylation site SITE 1900 1914 put. proteolytic cleavage site SITE 1972 1980 pot. N-glycosylation site SITE 2023 2031 pot. N-glycosylation site SITE 2293 2301 pot. N-glycosylation site SITE 2440 2448 pot. N-glycosylation site SITE 3139 3147 pot. N-glycosylation site SITE 3301 3309 pot. N-glycosylation site SITE 3331 3339 pot. N-glycosylation site SITE 3388 3396 pot. N-glycosylation site SITE 3436 3444 pot. N-glycosylation site SITE 3499 3507 pot. N-glycosylation site SITE 3553 3654 pot. transmembrane domain SITE 3622 3675 cysteine-rich region SITE 3655 3660 charge cluster SITE 3715 3723 pot. N-glycosylation site FEATURES from to/span description pept 31 3738 surface protein S precursor sigp 31 60 surface protein S signal peptide matp 61 3736 surface protein S BASE COUNT 1030 a 718 c 800 g 1232 t ORIGIN 1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc 61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat 121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact 181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct 241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg 301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac 361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt 421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct 481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat 541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag 601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt 661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg 721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct 781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt 841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt 901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt 961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa 1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact 1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg 1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt 1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt 1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt 1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat 1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat 1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt 1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct 1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc 1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct 1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt 1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat 1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc 1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt 1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc 1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat 2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt 2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat 2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct 2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct 2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac 2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag 2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa 2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg 2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct 2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta 2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac 2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag 2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg 2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca 2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc 2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc 3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata 3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca 3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg 3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat 3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg 3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt 3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc 3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag 3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa 3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt 3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa 3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt 3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt // LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds. ACCESSION M34598 M29013 J03562 KEYWORDS constant region; immunoglobulin; immunoglobulin light chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone Y31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 713) AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A. TITLE Structure of a third murine immunoglobulin lambda light chain variable region that is expressed in laboratory mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 2 to 450) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review REFERENCE 3 (bases 266 to 429) AUTHORS Sanchez,P. and Cazenave,P.-A. TITLE A new variable region in mouse immunoglobulin lambda light chains JOURNAL J. Exp. Med. 166, 265-270 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region sigp 12 68 Ig lambda chain signal peptide matp 69 > 713 Ig lambda chain recomb 379 380 V-region end/J2-region start recomb 414 415 J2-region end/C2-region start BASE COUNT 175 a 190 c 159 g 189 t ORIGIN 1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt 61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag 121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc 181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca 241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta 301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa 361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc 421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca 481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga 541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca 601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca 661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct // LOCUS MUSIGLVD 681 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds. ACCESSION M34597 KEYWORDS constant region; germline; immunoglobulin light chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 681) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review FEATURES from to/span description pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region, exon 1 209 531 Ig lambda-chain precursor Vx-J2-C2 region, exon 2 sigp 59 104 Ig lambda-chain signal peptide 209 219 Ig lambda-chain signal peptide matp 220 528 Ig lambda-chain IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A recomb 530 531 Vx-region end/J2-region start recomb 565 566 J2-region end/C2-region start BASE COUNT 179 a 156 c 136 g 210 t ORIGIN Chromosome 16. 1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat 61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt 121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc 181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca 241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag 301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg 361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct 421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa 481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa 541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta 601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa 661 tcctcttgga cttcgtaggg c // LOCUS PAERRE 1877 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.kadiakensis 18S rRNA, 3' end. ACCESSION M34359 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.kadiakensis rRNA. ORGANISM Palaemonetes kadiakensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Palaemonoidea; Palaemonidae. REFERENCE 1 (bases 1 to 1877) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1877 18S ribosoma RNA BASE COUNT 339 a 321 c 399 g 313 t 505 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt 61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt 121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac 181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg 241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca 301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct 361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna 421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga 481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac 541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn 601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac 841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna 901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta 961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta 1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa 1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt 1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc 1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt 1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn 1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg 1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna 1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg 1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca 1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact 1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag 1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnnnnn // LOCUS PBESVBRA 584 bp ds-DNA INV 11-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment alpha DNA. ACCESSION M34601 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 584) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 461 541 27 bp repeats BASE COUNT 205 a 41 c 85 g 251 t 2 others ORIGIN 1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac 61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc 121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata 181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat 241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat 301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa 361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt 421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg 481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg 541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga // LOCUS PBESVBRB 593 bp ds-DNA INV 11-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment a DNA. ACCESSION M34602 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 593) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 484 510 27 bp repeat motif BASE COUNT 209 a 40 c 97 g 247 t ORIGIN 1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca 61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac 121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa 181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt 241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat 301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat 361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat 421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg 481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt 541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg // LOCUS PEURRE 1902 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.aztecus 18S rRNA, 3' end. ACCESSION M34362 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.aztecus rRNA. ORGANISM Penaeus aztecus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; Penaeidae. REFERENCE 1 (bases 1 to 1902) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1902 18S ribosoma RNA BASE COUNT 340 a 369 c 427 g 332 t 434 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt 61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt 121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg 181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc 241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg 301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc 361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag 481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga 541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga 601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg 661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc 721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn 781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn 841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg 901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg 961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna 1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn 1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg 1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa 1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn 1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg 1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac 1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta 1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct 1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg 1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact 1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg 1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt 1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn // LOCUS POCRRE 1874 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.ascensionis 18S rRNA, 3' end. ACCESSION M34358 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.ascensionis rRNA. ORGANISM Procaris ascensionis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Procaridoidea; Procarididae. REFERENCE 1 (bases 1 to 1874) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1874 18S ribosoma RNA BASE COUNT 298 a 291 c 331 g 266 t 688 others ORIGIN 1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct 61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt 121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac 181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt 241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac 301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc 481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa 541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt 781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg 841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat 901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat 961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga 1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac 1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt 1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca 1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt 1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta 1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct 1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag 1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca 1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn 1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg 1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg 1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnn // LOCUS PRARRE 1869 bp ss-rRNA RNA 11-JUL-1990 DEFINITION P.leonensis 18S rRNA, 3' end. ACCESSION M34363 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.leonensis rRNA. ORGANISM Procambarus leonensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; Astacoidea; Cambaridae. REFERENCE 1 (bases 1 to 1869) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1869 18S ribosoma RNA BASE COUNT 409 a 412 c 459 g 419 t 170 others ORIGIN 1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt 61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt 121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat 181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt 241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc 301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca 361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg 421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt 481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact 541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt 601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag 661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc 721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn 781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa 841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg 901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac 961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg 1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg 1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg 1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa 1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta 1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact 1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg 1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg 1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac 1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa 1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn 1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt 1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa 1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnn // LOCUS PVIC1RPTA 711 bp ds-DNA INV 11-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28745 M25758 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 711) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 711 circumsporozoite protein (AA at 1) site 1 15 region 1 site 697 711 region 2 site 502 597 post repeat variable region site 598 696 post repeat constant region BASE COUNT 261 a 151 c 226 g 73 t ORIGIN 1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat 61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca 121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca 181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga 241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc 301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa 361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga 421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat 481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca 541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga 601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta 661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c // LOCUS PVIC1RPTB 657 bp ds-DNA INV 11-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28746 M25759 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK210) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 657) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 657 circumsporozoite protein (AA at 1) site 1 15 region 1 site 643 657 region 2 rpt 16 477 27 bp repeats site 478 544 3' post repeat variable region site 545 642 3' post repeat constant region BASE COUNT 229 a 135 c 224 g 69 t ORIGIN 1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat 61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa 121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca 181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat 241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca 301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga 361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca 421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga 481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca 541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa 601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc // LOCUS SHV2A 554 bp ss-RNA VRL 11-JUL-1990 DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA, partial cds. ACCESSION M34085 KEYWORDS . SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 554) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL Unpublished (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. Author address: E.A.Brown 547 Burnett-Womack CB# 7030 Chapel Hill, NC 27599 FEATURES from to/span description pept < 1 > 554 segment 2A-encoded protein (AA at 1) BASE COUNT 184 a 71 c 138 g 161 t ORIGIN Segment 2A; map position 3108-3662. 1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg 61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag 121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa 181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg 241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg 301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg 361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat 421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt 481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg 541 tcaggaatac agga // LOCUS SHVVP1CP 2373 bp ss-RNA VRL 11-JUL-1990 DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds. ACCESSION M34084 KEYWORDS capsid protein VP1. SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed in cwll line BS-C-1. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 2373) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL J. Virol. 63, 4932-4937 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. FEATURES from to/span description pept < 1 > 2373 capsid protein VP1 (AA at 1) BASE COUNT 661 a 447 c 491 g 774 t ORIGIN Segment P1; map position 735-3107. 1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg 61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact 121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca 181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga 241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa 301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg 361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca 421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct 481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc 541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat 601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct 661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca 721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt 781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat 841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca 901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag 961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa 1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg 1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt 1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc 1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg 1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag 1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat 1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct 1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct 1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag 1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta 1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca 1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg 1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt 1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg 1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg 1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca 1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg 2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat 2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc 2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg 2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg 2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat 2341 gatcctcgaa ctgaagagga tcgtaaattt gaa // LOCUS STNRRE 1885 bp ss-rRNA RNA 11-JUL-1990 DEFINITION S.hispidus 18S rRNA, 3' end. ACCESSION M34361 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE S.hispidus rRNA. ORGANISM Stenopus hispidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; Stenopodidae. REFERENCE 1 (bases 1 to 1885) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1885 18S ribosoma RNA BASE COUNT 327 a 336 c 430 g 368 t 424 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt 61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact 121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn 181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta 241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag 301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt 361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga 421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn 481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag 541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag 601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn 661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc 721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg 841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg 901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg 961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt 1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt 1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg 1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat 1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg 1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt 1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga 1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng 1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt 1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc 1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg 1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg 1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn 1861 nnnnnnnnnn nnnnnnnnnn nnnnn // LOCUS MSQMUD76A 124 bp ds-DNA BAD 11-JUL-1990 DEFINITION A.dirus DNA probe pMU-D76. ACCESSION M34656 KEYWORDS . SOURCE A.dirus (Strain D) wild-caught female DNA, clone pMU-D76. ORGANISM Anopheles dirus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 124) AUTHORS Panyim,O., Yasothornsrikul,S., Tungpradubkul,S., Baimai,V., Rosenberg,R., Andre,R.G. and Green,C.A. TITLE Identification of isomorphic malaria vectors using a DNA probe JOURNAL Am. J. Trop. Med. Hyg. 38, 47-49 (1988) STANDARD simple staff_review BASE COUNT 32 a 33 c 34 g 25 t ORIGIN 1 gatctgcact cggcgtgaat ttggttacca tcgaatgtgc ggaaaaagtt ttaccccgtg 61 cgcagtgcgg aacacgccag acttgttaca cacggaaacg gaccacgaac gtgttacgcg 121 cacg // LOCUS ACCCITSYN 1895 bp ds-DNA BCT 11-JUL-1990 DEFINITION A.anitratum citrate synthase gene, complete cds. ACCESSION M33037 KEYWORDS citrate synthase. SOURCE A.anitratum DNA, clone pLJD1. ORGANISM Acinetobacter anitratum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1895) AUTHORS Donald,L.J. and Duckworth,H.W. TITLE Expression and base sequence of the citrate synthase gene of Acinetobacter anitratum JOURNAL Biochem. Cell Biol. 65, 930-938 (1987) STANDARD simple staff_review FEATURES from to/span description pept 264 1538 citrate synthase precursor matp 267 1535 citrate synthase signal 120 125 -35 signal signal 141 146 -10 signal signal 1560 1585 pot. transcription termination signal BASE COUNT 534 a 406 c 357 g 598 t ORIGIN 1 gtactcaacg cttaattttt ttctgcacgt tcttcttgaa ttgacttatg ataccatccc 61 gatgcagtga ttttactgac tttttttgct cgggtcttga tgactaactc tctgtgggaa 121 cgtcattttt tatccataag tataattgac aaaatttcag tactcactaa tcttatagca 181 aattttgaca ccgtctgatt cgcacatgag aaaattagga tttcgagtca gataatcatt 241 caccaggaca ggagatctat tgaatgtctg aagcaactgg caaaaaagcc gtattacatc 301 ttgatggcaa agaaattgaa ttaccaattt acagtggcac attaggtccc gatgtaatcg 361 acgttaaaga tgtattggcc tcaggtcact ttacttttga tcctggtttt atggcgacag 421 cttcatgcga gtctaaaatc acatttatcg atggtgacaa aggtatttta ttacaccgcg 481 gttacccgat tgaccagtta gcgactcaag cagactacct tgaaacttgt tatttattat 541 taaatggcga gttaccaact gctgaacaaa aagttgagtt cgatgcgaaa gttcgtgctc 601 atactatggt tcatgatcaa gttagccgtt tcttcaatgg tttccgtcgt gatgctcacc 661 ctatggcaat catggttggt gtagtaggcg cattatctgc tttctatcac aacaaccttg 721 acattgaaga catcaaccac cgcgaaatta ctgcgattcg tttgattgct aaaattccaa 781 cgcttgctgc ttggagctac aaatatactg taggtcagcc attcatctat ccacgtaatg 841 acttaaatta cgcggaaaac ttcttacaca tgatgtttgc aactcctgca gaccgtgact 901 acaaagtaaa ccctgttctt gctcgtgcaa tggatcgtat ctttacgctt cacgctgacc 961 acgaacaaaa cgcgtctact tctacagttc gtcttgctgg ttctactggt gcgaatccat 1021 atgcgtgtat ctctgctggt atctctgctc tttggggtcc tgcacacggt ggtgcgaacg 1081 aagcagttct taaaatgctt gatgaaatcg gtagcgttga aaatgttgct gagttcatgg 1141 aaaaagttaa acgcaaagaa gttaaactta tgggcttcgg tcaccgcgtt tacaaaaact 1201 tcgatccacg cgctaaagtg atgaagcaaa cttgtgacga agttcttgaa gcattaggta 1261 tcaatgatcc tcaattagcg cttgctatgg aacttgaacg tattgcattg aacgacccgt 1321 actttgttga acgtaaactt taccctaacg tagacttcta ctctggtatc atccttaaag 1381 cgattggtat cccaacagaa atgtttaccg ttatcttcgc tcttgcacgt acagttggct 1441 ggatcagtca ctggttagaa atgcacagcg gtccttacaa aattggtcgt cctcgtcagc 1501 tttacactgg tgaagtgcaa cgtgacatca agcgttaata ttcgaaagaa tattaatgta 1561 aaaagctgcc taatggcagt tttttttata aataagtttt aaaagttatt cttcttcaaa 1621 catatttaat aagtgatgac taataccatc agctcttagc caagccaact cataacttgc 1681 ttcggccaaa gctaaaatac gtctttcaaa ctcagtccat acttgtttaa cttgcgcttc 1741 tgaatcccta aaccactgtc atagctaaat gcttattctt ttcacatatt tttaaggcat 1801 ggtagagttt agccctttac tcgccccttc attaacctga cacgtttacc taatataaat 1861 ccttctacat gctgtagact gggaacatag gtacc // LOCUS ECOGUAC 1991 bp ds-DNA BCT 11-JUL-1990 DEFINITION E.coli GMP reductase (guaC) gene, complete cds. ACCESSION M33020 KEYWORDS GMP reductase. SOURCE E.coli (strain K12) DNA, clone pDS89. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1991) AUTHORS Andrews,S.C. and Guest,J.R. TITLE Nucleotide sequence of the gene encoding the GMP reductase of Escherichia coli K12 JOURNAL Biochem. J. 255, 35-43 (1988) STANDARD simple staff_review FEATURES from to/span description pept 210 1250 GMP reductase (guaC) (E.C. 1.6.6.8) mRNA 25 < 1250 guaC mRNA (put.) mRNA 90 < 1250 guaC mRNA (put.) mRNA 126 < 1250 guaC mRNA (put.) BASE COUNT 493 a 494 c 508 g 496 t ORIGIN 1 gaattcatca tgattatcaa aacgttaaaa atgagtgcac gaaagcgaaa ttgatgaaac 61 gttcgctcac tatttaccag gtaaatttat gggattgtag cgtaaaaaaa gacaatttcg 121 cagtcttgcg ccgcattgat tagtgcgtat gatagcgtca ctggagttgc gctcttaccc 181 ttatagccat taaccccagg aatccgcaca tgcgtattga agaagatctg aagttaggtt 241 ttaaagacgt tctcatccgc cctaaacgct ccactcttaa aagccgttcc gatgttgaac 301 tggaacgtca attcaccttc aaacattcag gtcagagctg gtccggcgtg ccgattatcg 361 ccgcaaatat ggacaccgta ggcacatttt ctatggcctc tgcgctggct tcttttgata 421 ttttgactgc tgtgcataaa cactattctg tcgaagagtg gcaagcgttt atcaacaatt 481 cttccgctga tgtgctgaaa catgtgatgg tttctaccgg tacgtctgat gcggatttcg 541 aaaaaactaa acagattctc gacctgaacc cggcattaaa cttcgtttgt attgacgtgg 601 cgaatggtta ttccgaacac ttcgtgcagt tcgttgcgaa agcgcgtgaa gcgtggccga 661 ccaaaaccat ttgtgctggt aacgtagtga ctggtgaaat gtgtgaggag cttatcctct 721 caggtgccga tatcgttaaa gttggcattg gcccaggttc tgtttgtaca actcgcgtca 781 aaacaggcgt cggttatccg caactttctg cggtaatcga atgtgccgat gctgcgcacg 841 gtctgggcgg aatgatcgtc agcgatggtg gctgcaccac gccgggcgat gtggcgaaag 901 cctttgcgcg tgccgatttc gtcatgcttg gcggcatgct ggcgggccac gaagagagcg 961 gcggtcgcat cgttgaggag aacggcgaga aatttatgct gttctacggc atgagctccg 1021 agtctgcgat gaaacgtcac gttggcggcg ttgcggaata tcgcgcagca gaaggtaaaa 1081 ccgttaagct gccgctgcga ggcccggttg aaaataccgc gcgagatatt ttgggcggcc 1141 tgcgttcagc ttgtacatac gttggggctt cacgcctgaa agagctgacc aagcgcacca 1201 cgtttattcg tgtgcaggaa caagaaaacc gcatcttcaa caacctgtaa tctcccaacg 1261 ctggcgtgga gcaacacgcc acggttatcc catcccactc atcgcatcgc ctaaatggaa 1321 aattggcaga tacattgcca ccaccagcgt accaataatt cctcccgtta tgatcagcaa 1381 cgcggttcag taaggctgcg aggttatccg ccagcgccat tgtgttttcc cgatgatgat 1441 gggcgaggtt gtctaacatg agatccagag agccggatgc ctctcctgtt ctcactaatt 1501 gcaaacagag cgggctaaac tcaccggtat tttttagcgc cagccagatg ggttgaccgt 1561 tactgatatc gtgctggatt tgtgtcagaa gttgcaccca gtacgggcag cgcattgttt 1621 ctctgacgct ctctacgccc tgtaaaaaag taatgcctgc actttgtgtc agcgccagaa 1681 tcgtaaagat ctgcgtgagt ttttgtcccc gcatcagtga acccataatc gggatgcgta 1741 acagcaattt ctgccgcact ataagccagg tcggtcggcg catcagcaac ttattggcta 1801 tcgccagcag aaagccgaac acaccagcag ccagctccat tcgccactaa agtctgccag 1861 cgtcatgatc ccctgcgtta gtgccggtag tggggtgttg aaggtcttat agatagcggc 1921 aaactccggc agacacaaaa tgcagcattg ccacaaccac catgattagc catcgctaaa 1981 atgatgatgg g // LOCUS HUMDKERB 8815 bp ds-DNA PRI 11-JUL-1990 DEFINITION Human cytokeratin 8 (CK8) gene, complete cds. ACCESSION M34482 KEYWORDS cytokeratin 8. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 8815) AUTHORS Krauss,S. and Franke,W.W. TITLE Organization and sequence of the human gene encoding cytockeratin 8 JOURNAL Gene 86, 241-249 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1113 1436 cytokeratin 8, exon 1 3972 4180 cytokeratin 8, exon 2 4809 4869 cytokeratin 8, exon 3 5344 5439 cytokeratin 8, exon 4 5958 6248 cytokeratin 8, exon 5 7113 7333 cytokeratin 8, exon 6 7492 7550 cytokeratin 8, exon 7 8380 8567 cytokeratin 8, exon 8 IVS 1437 3971 CK8 intron A IVS 4181 4808 CK8 intron B IVS 4870 5343 CK8 intron C IVS 5440 5957 CK8 intron D IVS 6249 7112 CK8 intron E IVS 7334 7491 CK8 intron F IVS 7551 8379 CK8 intron G signal 1007 1011 TATA box signal 8779 8784 poly-A signal BASE COUNT 1868 a 2324 c 2481 g 2142 t ORIGIN 1 tcaacggatc tcgctctttt ttttctttgg agatggaatc tcgctctgtc gcccaggctg 61 gagtgcagtg gcaagtctca gctcactgca actctgcctc ccgggttcaa gtgattctcc 121 tgcctcagcc tcctgagtag ctgggattac accatggcca gctaattttt gtatttttag 181 tagagatggg gtttcaccat gttggtcagg cttgtcttga actcctgacc tcgtgatccg 241 cctacctcag cctcccaaag tgctgggatt acaggcgtgc acagcgtgcc ctggccttgg 301 atctcttttt atcttgcacc ttcagatgta gagggacgac agccactgtg tgtgtatgtg 361 tatgtgtgtg tgtgtgtgtg tgtgtgcgcg tgtgatgttt attcactcat ttatttattc 421 attcattcat tccacaaata tctacccaga ccctcttggc actgcaccag gtcgtagggg 481 tagaacagta acctggaaag atgaggcaaa tggttgattt cagattcaag gctttggact 541 ccagctgttc tgtcatccag ctcaggcagg ccctcataat cgcttcaatc agggagaaca 601 caggagagtt tctctggggt gtcggcagct cagaggagac ccaaatacta ggagacccct 661 tttcccatgc ttcccagtcc tccagtttat ttcccccagg aaggagggag acaagaccca 721 gagtcagggt tgtagtggct gggcggccca ggcaagtctg cttgttacac gacttgtgcc 781 aggacaggat ttcttccagt ttcatattca ctgaactgcc ttttcctggg tttctggggg 841 tggtgctgga gtgggctcca gggttggaac gggcccttgc gacgcgtctc tgctgccccc 901 acctgagtct gccccgaggt ggcaggtgac gggttcacgc gacgcctctg gcctagccac 961 tcaggtacga ggcctttccc ccactccccg gggctgggat ctcttttata aaaggccatt 1021 cctgagagct ctcctcacca agaagcagct tctccgctcc ttctaggatc tccgcctggt 1081 tcggcccgcc tgcctccact cctgcctcta ccatgtccat cagggtgacc cagaagtcct 1141 acaaggtgtc cacctctggc ccccgggcct tcagcagccg ctcctacacg agtgggcccg 1201 gttcccgcat cagctcctcg agcttctccc gagtgggcag cagcaacttt cgcggtggcc 1261 tgggcggcgg ctatggtggg gccagcggca tgggaggcat caccgcagtt acggtcaacc 1321 agagcctgct gagccccctt gtcctggagg tggaccccaa catccaggcc gtgcgcaccc 1381 aggagaagga gcagatcaag accctcaaca acaagtttgc ctccttcata gacaaggtga 1441 gggtcccctg cgtggctgac tgtgccccgc agcccctttc tcctggtagt cccggtccct 1501 atgcacatct ccagccccca gctggcgtcc tgctgggcct cacccgccct gggcacactc 1561 tcccttccat cctccgacct cacccctccc gtgcaccttg gtttgggctg ggtgagggtg 1621 gggagagggt ctggacagcc gggatgaatc ctggggcttc cttcttccct tttaaactgg 1681 agggtcttgg aagagagaga caacttaagg gtacagccta gttcccacca cccctctcta 1741 caaatcccgt tcttcctcag gtcattctgt cccaaattat aaaaaataat agcggttatt 1801 gttctcaccc caacccagtt ctgaccgtct tttaacgtat gcctgcggca gtcccagctg 1861 ttcgggacta ccctcctcca ggttcgcctc ttcgccagca ctacccaagg ctccccagtg 1921 gtgcctttgt gatttttttt ctttcttttt tttacatagg ggtttggtgt gattctagca 1981 ttctaggaga aggaagtggg tgtctcggtt caaacgggca aatattgatt gaggcctttg 2041 gccgccggag gcctgagtgc gggggtcaca gaatgagtca tacggcccct ggcccggcag 2101 cgtgggcggg gccgagggcg gggtgagggc tgcgggcagc agtctgcggg acgctctcct 2161 ccactggcgg agctcggcgt cgggggcggt gtgggtgggg tggggtgggg tggggtgggc 2221 tggggtgggg tggaggaggc gagggcctgg cctcggaaag cccatgcagg attcaaagtc 2281 tcctgggacg ccgcccgggg tttacgtcct gttaagttta tggcttcaga taacgcggtc 2341 gcccaccaac gcccctcgcc cattcagccc gtgtcccttt ctcggcgtcc tgtccctgct 2401 gcccccagcc tcggctccac tttccacaca gcaggagcca gggccgggtt ttgcagcctg 2461 ggactccgct gcctgagccc cggcccccgg cggccccgag gattgggccc ttcacgctga 2521 ctggctcctg ggaggcattg tgggaacggg aggagggaaa tcctggggca gagtaagccg 2581 ggaggaaccg gagccccagg aacccagtgg tcgggggccc tcgctgtcca agcgcctgga 2641 cttgacttgt tgactgcgtt ttgctagccc tggggtcctt atagagagca gctaagcata 2701 ggctttggaa tctgaattct tggtctgcac tcgtctgccg gttcctggtt atggactccc 2761 ttgccaagtc ttatttcctc atctataaaa tgaatatgag agcccctaaa tccatatagc 2821 aaaagttttt gccttattca aacttacata tgtaaagagt tcagcagtgc ttggcccaca 2881 ttccattagg ataagatgtt ataatcactt ttttttaaaa aataattttg gggcagaatg 2941 actggggaag aaagcgattt gcagagagtg gtggagggaa ctaggctgta cccttaaaag 3001 atttctgtcc cctccagttt agaaggagtt acaagttttt ttgtttgttt gagacagagt 3061 tactctgtgc ccaggctgga gtgcagtggt gtgatctcag ctcactgcaa cgctccgctt 3121 cctgggttca agcgattctc ctgcctcagc caccgagtag ctgggactac aagtgcgtgc 3181 acagcccggt taattttgta attattgtag gcaaggttca atatgttggc aggctggtct 3241 cgaactctga cttcagaaat ccgcctgcct tgaccaccca aagtgctgga attacagcgt 3301 gagcctccac gcccggcctc tttttcaatc ttaacatctt tagaaaggtt ggctattttt 3361 ggccgggcgc gggcttacgc ctataatccc agcactttgg gaggccaagg cgggccaatc 3421 acaaggtcag gagttcgaga ccatcctgcc taagacggtg aaaccctgtc tctactaaaa 3481 atacaaaaaa attagtgggg cgtggtggca cgcacggctg cctgtagccc cagccactcg 3541 ggaggctgag gcaggggcag gagaatggca tgaacttggg aggcggagct tgcagtgagc 3601 tgagatcttg cactgcactc tagcctgggc cggagactcc caaagaaagc ttggctattt 3661 ttattgatgt gtaatataca acctatgtaa atgaagttag gcctattggt ttgcaaatgc 3721 agctttaaca taattacctt acctgtctcc ttcccctacc caatgctgag ggacattgct 3781 ccccacctca ccatcatgcc atgctttctc cccctggtca taggtgatct ttccagaaca 3841 gctaaccagg tgcctggggt ctggagactt actgcttgag gagtgaatta agagaaaaga 3901 ctgcttgctt tcctccagac tttgagccct ggcctgatgt agaccttttt gctctctcct 3961 ccttcgtata ggtacggttc ctggagcagc agaacaagat gctggagacc aagtggagcc 4021 tcctgcagca gcagaagacg gctcgaagca acatggacaa catgttcgag agctacatca 4081 acaaccttag gcggcagctg gagactctgg gccaggagaa gctgaagctg gaggcggagc 4141 ttggcaacat gcaggggctg gtggaggact tcaagaacaa gtgagcaact ccaccctcca 4201 cccaactgaa gtcacctgct ctcctccacc ccttgacctt gggactaagt ccatggccct 4261 ctgttgtggg aagtgcagtc ctatctaatt agggtgacca cctgatgagg tttctcggac 4321 agtctgtgtt tatgccaggt tctagcacat tgttgatagt acccacccct ttcaatctaa 4381 ctgtctggat ttgaagaaca aattatgtgt caatgttgac atggtaaacc tgagacggga 4441 gagataggca gcctgtgggc ctcacttttg tacttaacat tctggcccct ctttagtctt 4501 gacccttgac ctctagcaaa ctctagaaag ttctgtctga ggtctcatgt caggccctgc 4561 tgttaacact ctcaaggtgt ccaatccgat gtgtattcat ggatttggag agagatttcc 4621 tgcttcccac gggctaaggg aggggtgagg gtggagaggg cagctgggga aggcagaagg 4681 accagccttc tcatatcctc atctctgtga actgaatttc ctgatttcac aacgcccctg 4741 tctcccaaaa gaccaagggc aacctccctt ttgccttcat cctctaattg taagtctttt 4801 cctcacaggt atgaggatga gatcaataag cgtacagaga tggagaacga atttgtcctc 4861 atcaagaagg tgagggagtc tcccttctcc tatctggaca ctggaggctg gggctcagag 4921 actcagacca agaagctttc tgggttttgt ccctaaatat tcctaagtag tgggacaaac 4981 tcatttatgt aaacatttgg gtgcacagaa aggtagacaa ggatggagtg gtaggtgcat 5041 ttggacagaa ctcttgacat cggtgttggg acatggttca gaaaacagag cagtagaact 5101 ggagatctgg ctctagaagg ctccctagag aaggaggtgg aagagggtgt gttgcaggaa 5161 gcagaggtga aggtgtgtgg gctgagaatg cacatgtgat gggcagaggc tgggctggaa 5221 gatcaatcca caaagtggca actagaaagt cctgtgacca ggccattggg tggaccttgg 5281 gagccccttg gttggggttg ggtgtggaaa cccagctcag gctcccctct cctcatcccc 5341 caggatgtgg atgaagctta catgaacaag gtagagctgg agtctcgcct ggaagggctg 5401 accgacgaga tcaacttcct caggcagcta tatgaagagg tatgttcctg gtcgcaggag 5461 agtgagggtc cccagccttg tcagcgcctc caccctgaga ctcaaccaga ggctcctccc 5521 agcccccagc acactaataa gacaaaggac cccactgctg actaattaca gccaccaata 5581 tttgctcggc tagtatttat tgggtctata tgttctgtcc ctcgcatgag gtgagtcatt 5641 accccatttc acagacgaga aagtgggctc agagaagtga aataacgtat ccaaggtcat 5701 catagggtgt ggtgattcag cagcaactct gtccccaaag cccttgttcc taatctttga 5761 gctgcattgg atccctctgt gcacctagta ttggtgaccc agttcctttt tcaggaactt 5821 tgcccctctc cctgaccctg actcccacct gctcctctcc tctgctgccc ctgtcttata 5881 cctaagaaag gctgttgtgg aaaagggggc tcctgtgtgc agagacaggg cctcaccact 5941 tgccctcttc cccacaggag atccgggagc tgcagtccca gatctcggac acatctgtgg 6001 tgctgtccat ggacaacagc cgctccctgg acatggacag catcattgct gaggtcaagg 6061 cacagtacga ggatattgcc aaccgcagcc gggctgaggc tgagagcatg taccagatca 6121 agtatgagga gctgcagagc ctggctggga agcacgggga tgacctgcgg cgcacaaaga 6181 ctgagatctc tgagatgaac cggaacatca gccggctcca ggctgagatt gagggcctca 6241 aaggccaggt atgggccggg ttgggggtgg gagggttcct tggacacaat cctggtgaga 6301 ggagataatg taggaagagt gaagtttctg ggagtcgggg aaggaatcct agaccagggt 6361 tcaggagttg gaggggcagc cacagttcag cttctcagtc tgcttctgag aagcaaaggg 6421 atgcagggaa ggtcccttgg gccaggacag aggtgaaagg ggactggggc aggtatgttg 6481 gggactcgtg atacatgctc caagcctgct ttaatcagtc atatgcatca ggggtaaggt 6541 tgagctctgc tgctttaagg aaagtctaga acccagggat ctagtccagt tagggtaggg 6601 ggaccttaca gtgtcgcagg tcgagaaggg tgtggagggg aagcacctgg aaactgctca 6661 tgtctccctg atctgcttcc ttagtctcgt ttatttattt atttattttt gagacagagt 6721 cttgctctgt cgcccaggct ggagtgcagt ggcgtgatct cggctcactg caagctccgc 6781 ctcctgggtt cacactattc tcctgactca gcctcctgag tagctgggac tacaggcgcc 6841 cgcaccaggc tggctaattt tttttgtatt tttgctagag acggggtttc actgtgttag 6901 ccaggactcg tcgatctcct gaccttgtga tctgcccgcc tcgcctccca aagtgctggg 6961 attacaggca tgagcactgt gcccggccct tagtctcatt aattgagctg gggagtcagc 7021 ctagtgtgtg gaggacctga gggagggtgg acgcacggag gaagagaagg catacccaac 7081 ctgacctact tacctgtccc ctacccacag agagggcttc cctggaggcc gccattgcag 7141 atgccgagca gcgtggagag ctggccatta aggatgccaa cgccaagttg tccgagctgg 7201 aggccgccct gcagcgggcc aagcaggaca tggcgcggca gctgcgtgag taccaggagc 7261 tgatgaacgt caagctggcc ctggacatcg agatcgccac ctacaggaag ctgctggagg 7321 gcgaggagag ccggtgggtg tgggtacctc tgaccggacc tgcttcccta tccctgggac 7381 ctggggtggg gacggtggga gccccctgaa gccccttgga cttggggtcc tgttgttctg 7441 ggccaagaag ggctaggagt tggtcctgac accccatttg acagggtaca ggctggagtc 7501 tgggatgcag aacatgagta ttcatacgaa gaccaccggc ggctatgcag gtggtgtccc 7561 agggccctgg atgagggcgg gaggcagggc cagggaggct cagctccagg gagggggctg 7621 tgctcagtcg ctcacagtga cctcagcctg agcactcatg ttcttgggag aatcctaggg 7681 tggggaggca catattcagg gaactccagt aataacttta ttacttagta acttcatatt 7741 agaagataca ccaataacca tagctgtgtg ccaggcactt gcgtaagtat cctacaggtt 7801 ttatgtgatt tattttattt attaatttaa tttaattttt ttgagacgaa gtctcgctgt 7861 caccaagctg agtgcagtgc tgatctcagc tcactgtaac ctcacctcct gggttcaaga 7921 gattctcctc cgtcaggcct cccaagtagc tgggactaca ggcgcatacc accatgccca 7981 tgctaatttt tgtattttta gtagagacgg ggtttcactg tgttgggcag gctggtctcg 8041 aactcctgac cttgtgatca gtgctgggat tacaggcatg agacactggg cctggctgta 8101 atttattttt tatatgacac ctgtaaacgt cttcagttga ggaaggctga ggtgcagcta 8161 aatgtccaag ctgacacagg ctatatatat ggcagctgtt ttccaccctg ctcctggttt 8221 tccctgacag ttctggagta gtgaaccatg caatcactga tcaggagagc tgggttaacc 8281 tccatccctg gggctatgtt gggaatgagc agggagaagg gcatggagcc tgccatggtg 8341 ggcttctgta ctcatgtggc tacctctgtc cctcaccagg tggtctgagc tcggcctatg 8401 ggggctcaca agccggcctc agctacagcc tgggctccag ctttggctct ggcgcgggct 8461 ccagctcctt cagccgcacc agctcctcca gggccgtggt tgtgaagaag atcgagacac 8521 gtgatgggaa gctggtgtct gagtcctctg acgtcctgcc caagtgaaca gctgcggcag 8581 cccctcccag cctacccctc ctgcgctgcc ccagagcctg ggaaggaggc cgctatgcag 8641 ggtagcactg ggaacaggag acccacctga ggctcagccc tagccctcag cccacctggg 8701 gagtttacta cctggggacc ccccttgccc atgcctccag ctacaaaaca attcaattgc 8761 tttttttttt tggtccaaaa taaaacctca gctagctcgc cgaatgtcct tgctt // LOCUS HUMSRU30S 179 bp ss-RNA RNA 11-JUL-1990 DEFINITION Human 30S small nuclear ribonucleotide protein pre-mRNA complex, exons 1 and 2 (partial). ACCESSION M34493 KEYWORDS small nuclear ribonucleoprotein. SOURCE Human Hela cell pre-mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 179) AUTHORS Pruzan,R., Furneaux,H., Lassota,P., Hong,G.Y. and Hurwitz,J. TITLE Assemblage of the prespliceosome complex with separated fractions isolated from Hela cells JOURNAL J. Biol. Chem. 265, 2804-2813 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 55 small nuclear ribonucleotide protein (snRNP), exon 1 (AA at 2) 142 > 179 small nuclear ribonucleotide protein, exon 2 pre-msg < 1 > 179 snRNP pre-mRNA complex IVS 56 141 30S small nuclear RNA intron A BASE COUNT 30 a 48 c 46 g 55 t ORIGIN 1 aatacacgga attcactctc ttccgcatcg ctgtctgcga gggccagctg ttggggtgag 61 tgtgacctgc acgtctaggg cgcagtagtc cagggtttcc ttgatgatgt catacttatc 121 ctgtcccttt tttttccaca gctcgcggtt gaggacaaac tcttcgcggt ctttccagt // LOCUS K5TPA1PRO 307 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor DNA. ACCESSION M34486 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1266. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 307) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 304 > 307 bacteriophage BK5-2 mRNA BASE COUNT 80 a 56 c 43 g 128 t ORIGIN 1 gatcaaggtg tgtaggtgta atctctagct taggaacgct tttgatacag aacgtgtgat 61 tgtccgtttt taactttctt gttttgtcat cttcataaac tcacaaagtt tatttttgga 121 acaaattttt cttttttatc gtatgacgta acttttttca tttggtccat cataagcttt 181 tttaatattg tcagcttttg ctttttcgac gttctctacc gacgctttca aaatctttaa 241 tgaaaaaaac cgtaaccatc gaatttttct tccatatttt caaagaatcc gttactatct 301 aacgatc // LOCUS K5TPA3PRO 182 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pa3 DNA. ACCESSION M34488 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1268. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 182) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 169 > 182 promotor region mRNA BASE COUNT 57 a 21 c 24 g 80 t ORIGIN 1 ttttcagaat atgaagttaa aagttctcta atatttttat ccgttaaaga gtatcctata 61 aataaaattg gggattctgt taagtttgac aatatttttc gcatttacta atgctaattt 121 agattcatta tttttataat cctcactagt tatacatata gtatttgggt ttttgactga 181 tc // LOCUS K5TPF1PRO 177 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pf1 DNA. ACCESSION M34490 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1262. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 177) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description pept 149 > 177 ORF mRNA 110 > 177 ORF mRNA BASE COUNT 63 a 29 c 37 g 48 t ORIGIN 1 cctttattct tcgtgcaagg aggcgcaaga tggtcaaaac ttacaaaccg attgatttta 61 acagaaaatg taagattgga gttactaaaa cagtaactta ctccaactgg aggtaagatt 121 gaaaaaattg acccaggaac ggttttaaat gttcgatttc gcggctaaaa tgagatc // LOCUS K5TPF2PRO 1209 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pf2 and an ORF, partial cds. ACCESSION M34487 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1261. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1209) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 249 > 1209 bacteriophage BK5-2 mRNA BASE COUNT 377 a 213 c 235 g 384 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctgaa tatggttcgt aaccctatgg catttctcaa tactctttca tctaaaactg 61 aaactagcgg aagtgatagt gctgctggac ttactattcc gcaagatatc cgtactatga 121 ttaacacatt ggttcgccaa tatgactcac tacaacaata tgtacgtgtt gagagtgttt 181 ctacttcaaa cggtagtcgt gtatatgaaa aatggactga tgtaactccg ttgactgtaa 241 tggatgcaga agatggaaaa attcctgatc ttgataatcc acgtttggac aattattaaa 301 tacttgatta aacgttatgc gggaatcatc aatgccaact aatacattgc ttaaagatac 361 agcagaaaat attcttgcat ggttatcaag ctggattgct aagaaagtgg ttgtgactcg 421 taaccaagcg attattgcag caatgggtac agttcctaaa aaaccaacaa tcgctaaatt 481 tgatgatgtt attactatga ttaatacatc tgttgatcct gcgattatcg ccacttcaag 541 tcttttgact aaccagtcag ggttgaataa acttgctttg gttaaaactg ctgaaggtaa 601 atatttgctc gaaccagacc caacaaaacc taattcatat ctaattaaag gtaaaaaagt 661 tattgttgtt gcagatcgct ggcttccaaa tagtggatca acagtttatc cactttacta 721 tggagatatg tcgcaagcta ttacattgtt tgaccgtgaa aacatgtcat tacttccaac 781 aaatattggt gctggtgcat ttgaaactga tactactaaa attcgtgtaa tcgatcgctt 841 cgatgttaaa actgctgact cagaagcttt agttgctggt tcacttactg caattgcaga 901 ccaagtaggt aattttactg caggaaagta ggtaatttat gacagtaact gttgatgact 961 tactagatca gttatcagaa gatgatgatc gcaaaccgca acttcaaatt tatttgatac 1021 agcaaaagca tatgtgaaaa atgcagtgag ttctgataca gttgatgctc catttttcag 1081 tgtagaaaac gtttatccga tttatgatgt agctgttctt agctattcta tggatttgtg 1141 gattaatcgt tctacgacta tgccgcctac tacggctgta gatcacatgg ttggtcagtt 1201 gagaggcct // LOCUS K5TPG2PRO 195 bp ds-DNA PHG 11-JUL-1990 DEFINITION Bacteriophage BK5-T promotor Pg2 DNA. ACCESSION M34489 KEYWORDS . SOURCE Bacteriophage BK5-T DNA from Lactococcus lactis, clone pMU1265. ORGANISM Bacteriophage BK5-T Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 195) AUTHORS Lakshmidevi,G., Davidson,B.E. and Hillier,A.J. TITLE Molecular characterization of promoters of the Lactococcus lactis subsp. cremoris temperate bacteriophage BK5-T and identification of a phage gene implicated in the regulation of promoter activity JOURNAL Appl. Environ. Microbiol. 56, 934-942 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 154 > 195 promotor region mRNA BASE COUNT 71 a 28 c 40 g 56 t ORIGIN 1 agagatttac gaaaagttga gtgctttagc tgaaattgat agacttttcc attggtctag 61 ccatttacat caagaacgat tacaatttgt tagtaaatat ccaaatgtta tggaaaaata 121 cagacaagca aactaaggag ggtatattga atgaccgaca aactaatatc gctggtcatc 181 aaagtgtgtg actgg // LOCUS MUSH2A 1805 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse (H-2a haplotype) DNA fragment. ACCESSION D90007 KEYWORDS . SOURCE Mouse (strain B10.A, haplotype H-2a) DNA, clone B10.A.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1805) AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T., Steinmetz,M. and Moriwaki,K. TITLE Recombinational hotspot specific to female meiosis in the mouse major histocompatibility complex JOURNAL Immunogenetics 31, 79-88 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihiko Shiroishi National Institute of Genetics Yata-1111 Mishima, Shizuoka 411 Japan Phone: 0559-75-0771 FEATURES from to/span description site 1 288 MT-consensus rpt 1227 1242 TCTG repeat LTR 1533 1568 Xenotropic solitary LTR sequence BASE COUNT 391 a 400 c 409 g 605 t ORIGIN Chromosome 17. 1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt 61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg 121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct 181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc 241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa 301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct 361 cctctgcttt cctggacatg taattttttt ttttgagaca aggtcattct tgttgtctat 421 tcttgactgg ctttgaattc agaatctgca ggctctgcct ctctggtaac atgtaacatt 481 ttccatatgt aacattttta ccagccattt cccagtaaat gagttacttc atttgaggtt 541 ttgtcttaaa tccccgtgag caatgttttg ttagtttcca aagcacgagg attctaagtg 601 tctatttgtt gctaagttgc caggctgtta cagagcacag tttctgggac cctggctctc 661 tgaaactgac tagggattgc tttagtataa acataaacca ctgggactct ggctctttga 721 aactgactag ggattgcttt agtacaagta taaaccactc agtcctggtc ctacttggct 781 tcaaaagttg aatatcgctt ttggtatttg agatggagat ttaaagatgg aattttatta 841 gtcttctgcc tggttttctt tctttctttg ctcttactgc cttgtggctc agaaccagct 901 gttgcctgtt tgatagtttg tgaccaatac ctgtactgtt aaattggcca tttgagaact 961 caaaaagtcc caacttgtag tgttttcggt ttccatggtc ttagatattt ccactgcaga 1021 caacatcaag ttgccagtgg ttaacaactg tctttcagaa ctctcaagta tttcggtggg 1081 tctgccagcc cttgtaacgt agcgccacgt ggtatatgct tatttgtctg tctgtctgtc 1141 tgttgtgcaa gatgcctgtg tgccctgagg tcagaggaca gcttcaaggg ctctccattc 1201 ttccctgacc acgtggatcc agggaataga actttgacca ttacccacgg gccatgttat 1261 ttcttgacag ttctgttgta catttgtttt agtctttggc tttatttatt tttctcaccc 1321 tcagtttccc tttgtctcag atgctttttt ttttttttta aatcttgcct tgggagatgt 1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt 1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc 1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg 1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg 1621 tttcagggca ggctggtagt cttctgactc tgctcactgg ccactttcag ttcctgcttt 1681 ctgaatccta tccagagttc tcagtggtca tcagactctg gagaggacga ggggaagggg 1741 tgggctctta aactatcatt tatatttaaa aaaaattaaa caacagagtt agaagcagat 1801 ccagg // LOCUS MUSH2B 1634 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse (H-2b haplotype) DNA fragment. ACCESSION D90008 KEYWORDS . SOURCE Mouse (strain C57BL/10, haplotype H-2b) DNA, clone B10.30. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1634) AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T., Steinmetz,M. and Moriwaki,K. TITLE Recombinational hotspot specific to female meiosis in the mouse major histocompatibility complex JOURNAL Immunogenetics 31, 79-88 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihiko Shiroishi National Institute of Genetics Yata-1111 Mishima, Shizuoka 411 Japan Phone: 0559-75-0771 FEATURES from to/span description site 1 285 MT-consensus rpt 1128 1143 TCTG repeat LTR 1534 1569 Xenotropic solitary LTR sequence BASE COUNT 340 a 373 c 367 g 554 t ORIGIN Chromosome 17. 1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt 61 ggcgccattc ctgggctggc agtcctggct tctataagaa agcaggctga gcaagtgatg 121 acgcccctcc atggcctctg catcagctcc tgcctccaga ttcctgtcct gatttcttcg 181 gtgactaaca gctatgtgga agtgtaaaca ggatgaacgc tttccttccc aggtagcttt 241 ggtcctgggg tttcattgca gtaatagtaa ccctagatgg gacaagactt tgatcaagcg 301 ttccctttca ttgtcccctt cctgtagaca tgacttctct tcctatagac agtctcccct 361 ctgctttcct ggacacggaa tttttttttt tttttttttg agacaaggtc tttcttgtct 421 attctcgact ggctttgaat tcagaatctg cagctctgcc tctctagtaa catgtagcat 481 tttccatatg taacattttt accagccatt tcccagtaaa tgagttactt catttggggt 541 tttatcctaa atccccgtga gcaatgtttt gttagtttcc aaagcacgag gattctaagt 601 gtctatttgt tgccaagttg ccaggctgtt acagagcaca gtttctggga ccctggctct 661 ctgaaactga ctagggattg ctttagtata aacataaacc actgggactc tggctctttg 721 aaactgacta gggattgctt tagtacaagt ataaaccact cagtcctggt cctacttggc 781 ttcaaaagtt gaatatcgca tttggtattt gagatggaga tttaaagacg gaattttatt 841 agtcttctgc ctggttttct ttctttcttt gctcttactg ccttgtggct cagaaccagc 901 tgttgcctgt ttgatagttt gtgaccaata cctgtactgt taaattggcc atttgagaac 961 tcaaaaagtc ccaacttgta gtgttttcgg tttccatggt cttagatatt tccactgcag 1021 acaacatcaa gttgccagtg gttaacaact gtctttcaga actctcaagt gtttcggtgg 1081 gtctgccagc ccttgtaacg tagcgccacg tggtatatgc ttatttgtct gtctgtctgt 1141 ctgttgtgca agatgcctgt gtgccctgag gtcagaggac agcttcaagg gctctgcatt 1201 cttccctgac cacgtggatc cagggaatag aactttgacc attacccacg ggccatgtta 1261 tttcttgaca gttctgttgt acatttgttt tagtctttgg ctttatttat ttttctcacc 1321 ctcagtttcc ctttgtctca gatgcttttt tttttttttt aatcttgcct ctgggagatg 1381 tttcaaactc ttggaacgaa tgatacagtt gtttgattga tagaacgaag ccttccagtg 1441 tgaatgcgtt tgcatttcag cttgttgctg gctggctgtg tggtgctggt tcagacatgt 1501 cacaggcttg aggtgttaag gctaactgag ttcggagagt ccccacctga ccccttctcc 1561 gttcccctca ccagggagac ctccctcctg gctgcagttg agcagggtgc accggggctg 1621 gtttcagggc atgc // LOCUS MUSH2WM7 1630 bp ds-DNA ROD 11-JUL-1990 DEFINITION Mouse (H-2wm7 haplotype) DNA fragment. ACCESSION D90009 KEYWORDS . SOURCE Mouse (strain B10.MOL-SGR, haplotype H-2wm7) DNA, clone SGR.31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1630) AUTHORS Shiroishi,T., Hanzawa,N., Sagai,T., Ishiura,M., Gojobori,T., Steinmetz,M. and Moriwaki,K. TITLE Recombinational hotspot specific to female meiosis in the mouse major histocompatibility complex JOURNAL Immunogenetics 31, 79-88 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihiko Shiroishi National Institute of Genetics Yata-1111 Mishima, Shizuoka 411 Japan Phone: 0559-75-0771 FEATURES from to/span description site 1 288 MT-consensus rpt 1126 1141 TCTG repeat LTR 1533 1568 Xenotropic solitary LTR sequence BASE COUNT 342 a 363 c 370 g 555 t ORIGIN Chromosome 17. 1 acgtctggaa caactttcta aattagtgat tgatagggga gggccaagcc cattgtgggt 61 ggcgccattc ctgggctggc agtcctggtt tctataagaa agcaggctga gcaagtgatg 121 aggacgcccc tccatggcct ctgcatcagc tcctgcctcc agattcctgt cctgatttct 181 tcggtgacta acagctatgt ggaagtgtaa acaggatgaa cgctttcctt cccaggtagc 241 tttggtcctg gggtttcatt gcagtaatag taaccctaga tgggacaaga ctttgatcaa 301 gtgttccctt tcattgtccc cttcctgtag acatgacttc tcttcctata gacagtctct 361 cctctgcttt actggacatg taattttttt tttgagacaa ggtcattctt gttgtctatt 421 cttgactggc tttgaattca gaatctgcag gctctgcctc tctggtaaca tgtaacattt 481 tccatatgta acatttttac cagccatttc ccagtaaatg agttacttca tttgaggttt 541 tgtcttaaat ccccgtgagc aatgttttgt tagtttccaa agcacgagga ttctaagtgt 601 ctatttgttg ctaagttgcc aggctgttac agagcacagt ttctgggacc ctggctctct 661 gaaactgact agggattgct ttagtataaa cataaaccac tgggactctg gctctttgaa 721 actgactagg gattgcttta gtacaagtat aaaccactca gtcctggtcc tacttggctt 781 caaaagttga atatcgcttt tggtatttga gatggagatt taaagatgga attttattag 841 tcttctgcct ggttttcttt ctttctttgc tcttactgcc ttgtggctca gaaccagctg 901 ttgcctgttt gatagtttgt gaccaatacc tgtactgtta aattggccat ttgagaactc 961 aaaaagtccc aacttgtagt gttttcggtt tccatggtct tagatatttc cactgcagac 1021 aacatcaagt tgccagtggt taacaactgt ctttcagaac tctcaagtgt ttcggtgggt 1081 ctgccagccc ttgtaacgta gcgccacgtg gtatatgctt atttgtctgt ctgtctgtct 1141 gttgtgcaag atgccggtgt gccctgaggt cagaggacag cttcaagggc tctgcattct 1201 tccctgacca cgtggatcca gggaacagaa ctttgaccat tatccacggg ccatgttatt 1261 tcttgacagt tctgttgtac atttgtttta gtctttggct ttatttattt ttctcaccct 1321 cagtttccct ttgtctcaga tgcttttttt ttttttttta atcttgcctc tgggagatgt 1381 ttcaaactct tggaacgaat gatacagttg tttgattgat agaacgaagc cttccagtgt 1441 gaatgcgttt gcatttcagc ttgttgctgg ctggctgtgt ggtgctggtt cagacatgtc 1501 acaggcttga ggtgttaagg ctaactgagt tcggagagtc cccacctgac cccttctccg 1561 ttcccctcac cagggagacc tccctcctgg ctgcagttga gcagggtgca ccggggctgg 1621 tttcagggca // LOCUS MUSMHH2IE 576 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Mouse MHC class II I-E-beta-1 (haplotype H2b/K) gene, partial cds. ACCESSION M28408 KEYWORDS cell surface glycoprotein; class II gene; integral membrane glycoprotein; major histocompatibility complex. SOURCE Mouse (strain B10 (3R)) adult spleen (haplotype H2b/k), cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 576) AUTHORS Gorski,J. and Hayes,C.E. TITLE The I-J-disparate mouse strains B10.A(3R) and B10.A(5R) have identical I-E beta sequences JOURNAL Immunogenetics 39, 127-129 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted J.Gorski, 27-SEP-1989. The sequence for Mouse (strain B10.A (5R)) is identical to [1]. FEATURES from to/span description pept < 1 > 576 H2-I-E-beta (AA at 1) BASE COUNT 140 a 149 c 178 g 109 t ORIGIN Chromosome 17. 1 gtcagagact ccagaccatg gtttttggaa tactgtaaat ctgagtgtca tttctacaac 61 gggacgcagc gcgtgcggct tctggaaaga tacttctaca acctggagga gaacctgcgc 121 ttcgacagcg acgtgggcga gttccgcgcg gtgaccgagc tggggcggcc agacgccgag 181 aactggaaca gccagccgga gttcctggag caaaagcggg ccgaggtgga cacggtgtgc 241 agacacaact atgagatctc ggataaattc cttgtgcggc ggagagttga gcctacggtg 301 actgtgtacc ccacaaagac gcagcccctg gaacaccaca acctcctggt ctgctctgtg 361 agtgacttct accctggcaa cattgaagtc agatggttcc ggaatggcaa ggaggagaaa 421 acaggaattg tgtccacggg cctggtccga aatggagact ggaccttcca gacactggtg 481 atgctggaga cggttcctca gagtggagag gtttacacct gccaggtgga gcatcccagc 541 ctgaccgacc ctgtcacggt cgagtggaaa gcacac // LOCUS RATFAPS 1271 bp ss-mRNA ROD 11-JUL-1990 DEFINITION Rat testis-specific farnesyl pyrophosphate synthetase mRNA, complete cds. ACCESSION M34477 KEYWORDS farnesyl pyrophosphate synthetase. SOURCE Rat adult (Sprague-Dawley), cDNA to mRNA, clone TF1.4. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1271) AUTHORS Teruya,J.H., Kutsunai,S.Y., Spear,D.H., Edwards,P.A. and Clarke,C.F. TITLE Testis-specific transcriptional initiation sites of rat farnesyl pyrophosphate synthetase mRNA JOURNAL Mol. Cell. Biol. 10, 2315-2326 (1990) STANDARD simple staff_review FEATURES from to/span description pept 158 1219 farnesyl pyrophosphate synthetase mRNA 1 1271 farnesyl pyrophosphate synthetase mRNA BASE COUNT 329 a 319 c 347 g 276 t ORIGIN 1 ttatatttgg gttctgccta ctgagccggg agtctgggaa ctacaactcc cagagtgctg 61 agcggatgca cgctctgctt ttaggtgtaa gccgcaaaca tcttggaccc cgggagaatc 121 cgcgttgaag cacagagcat ttagctcctc tgtcagaatg aatggggacc agaaactgga 181 tgttcataac caagaaaagc agaatttcat ccagcacttc tcccagattg tcaaggtgct 241 gactgaggat gaactgggac acccagagaa gggagatgct attacccgga tcaaagaggt 301 cctggagtac aacactgtag gaggcaagta caatcggggt ctgacggtgg tacagacctt 361 ccaggaactg gtggaaccaa ggaaacagga tgctgagagc ctacagcggg ccctgacggt 421 gggctggtgt gtagaactgc tccaggcttt cttcctcgtg ttagatgaca tcatggactc 481 ttcccacact cgccgggggc agatctgctg gtatcagaag ccgggcatag gcttggatgc 541 catcaacgat gctctgcttc tggaagccgc tatctaccgc ctgcttaagt tctactgcag 601 ggagcagccc tactacctca acctgctgga gctctttcta cagagttcct atcagactga 661 gatcgggcag actctcgacc tcatcacagc accccagggc caagtggatc ttggtagata 721 cactgaaaag aggtacaaat ctatcgtcaa gtacaagaca gctttctact ctttctacct 781 gcctatcgcg gctgccatgt acatggctgg aattgatggg gagaaggaac acgctaatgc 841 cctgaagatc ctgctggaga tgggcgagtt cttccagatc caggacgact accttgatct 901 ctttggagac cccagtgtga ccggaaaggt cggcactgac atccaggaca acaaatgcag 961 ctggctggtg gttcagtgtc tgctacgagc cactcctcag cagcgccaga tcttagagga 1021 gaattatggg cagaaggacc cagaaaaagt ggcgcgggtg aaagcactgt acgaggagct 1081 ggatctgcgg agtgtgttct tcaagtacga ggaagacagt tacaaccgcc tcaagagtct 1141 catagagcag tgctccgcgc ccctgccccc atccatcttc ctggaactag caaacaagat 1201 ctacaagcgg agaaagtaac ctcgaattgt agaggctgcg agggaggggt ctcaataaat 1261 tattgttcaa c // LOCUS TTHRPEGL 2340 bp ds-DNA BCT 11-JUL-1990 DEFINITION Thermus thermophilus trpL, anthranilate synthase I and II (trpE and trpG) genes, complete cds. ACCESSION X07744 KEYWORDS anthranilate synthase I; anthranilate synthase II; trpE gene; trpG gene; trpL gene. SOURCE Thermus thermophilus (strain HB8 (ATCC 27634) DNA. ORGANISM Thermus thermophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 2340) AUTHORS Sato,S., Nakada,Y., Kanaya,S. and Tanaka,T. TITLE Molecular cloning and nucleotide sequence of Thermus thermophilus HB8 trpE and trpG JOURNAL Biochim. Biophys. Acta 950, 303-312 (1988) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description PRM 37 42 pot. -35 region PRM 60 65 pot. -10 region [1] Author address: Sato S., Mitsubishi Kasei, Institute of Life Sciences, 11, Minamiooya Machida-Shi, Tokyo, Japan. Submitted (24-MAY-1988) on tape to the EMBL data library. FEATURES from to/span description pept 72 107 trpL protein pept 169 1557 anthranilate synthase I (trpE) (EC 4.1.3.27) pept 1603 2217 anthranilate synthase II (trpG) BASE COUNT 351 a 764 c 855 g 370 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgggc cctggagggg cggccccttt agcccctgga cagggccccc gtgtcccgct 61 atcctgaggc catggccctt ccctccgccc tctggtggcc cggctaggcc ccggggcggg 121 aggcctttcc ccggggcaca ccccggggct ttgtttttgg gggacggcat ggagcggatc 181 cgaccttacc gcaaaacctt cctcgcggac ctggagaccc cggtgaccgc ctacctgaag 241 cttgccgaga aggctccggt gagcttcctt ttggagtcgg tggagcgggg gcgccaaagc 301 cgcttctcca tcgtgggggt gggggcgcgg cgcaccttcc gcctgaagga cggggtcttc 361 acggtgaacg gggagcgggt ggaaacccgt gatcccttgc gcgccctcta cgagagggtc 421 tacgccccct tggagcgcca ccccgacctc ccccccttct tcggcggggt ggtgggctac 481 gccgcctacg acctcgtccg ctactacgaa aggcttccga gcctcaagcc cgacgacctc 541 ggcctccccg acctcctctt cgtggagccc gaggtggtgg ccgtctttga ccacctgaag 601 aacctcctcc acctcgtggc cccagggagg gaccccgagg aggcggaggc ccgcctcttt 661 tgggcggaga ggcggctcaa gggccccttg cccggggtgc cgggggagag ggcggggggg 721 agggcccgct tccaggcgga cttttcccgg gaggcctacc tggaggcggt gaggagggcc 781 ctggactaca tccgggcggg ggacatcttc caggtggtcc tctccttgag gctctcctcc 841 cccctcaccg tccacccctt cgccctctac cgggcgctga ggagcgtgaa cccgagcccc 901 tacatgggct acctggacct gggggaggtg gtcttggtct cggcgagccc ggaaagcctc 961 ctccgctcgg acggccgaag ggtggtcacc cggcccatcg cgggcacgag gccgaggggg 1021 aaggacgagg aggaggacaa aaggcttgcc gaggagctcc ttagggacga gaaggaggtc 1081 gcggagcacg tgatgcttct ggacctctcc cgcaacgaca tcggccgggt cgccgccttc 1141 ggcacggtgc gggtcctcga gcccctccac gtggagcact actcccacgt gatgcacctg 1201 gtctccacgg tggagggcat cttggccgag gggaagaccc ccctggacgc cctggccagc 1261 gtgctgccca tggggacggt ctccggggcc ccgaagatcc gggccatgga gatcattgaa 1321 gaactggagc cccaccgccg ggggccctac gggggaagct tcggctacct cgcctacgac 1381 ggggccatgg acatggccct caccctgcgc accttcgtgg tggcgaaggg gtggatgcac 1441 gtccaggcgg gggcggggat cgtggcggac tcggtgccgg agagggagta cgaggagtgc 1501 tggaacaagg cgcgggcgct cctcaaggcg gtggagatgg cggaggcggg gctgtgatcc 1561 caccccatgc cggcaggggc ccggtaagga ggcctggtag gcatggctgc taacggagcg 1621 aaggggagaa aggttatgag ggtcttggtg gtggacaact acgacagctt cacctacaac 1681 ctggtgcagt acctggggga gctcggggcg gagcccatcg tgtggcggaa cgaccgcttc 1741 cggctggagg aggtggaggc cctggacccg gaccggatcc tcatcagccc ggggccttgc 1801 accccctttg aggcggggct ttccgtcccc ttggtccagc gctacgcccc ccgctacccc 1861 atcctggggg tctgcctcgg acaccaggcc atcggggcgg ccttcggggg gaaggtggtc 1921 cccgcccccg tcctcatgca cggcaaggtg agccccatcc accacgacgg caccggggtc 1981 ttccgggggc tagatagccc cttccccgcc acccgctacc actccctggc ggtggtggag 2041 gtgccggagg ccctcgtggt gaacgcctgg gcggaggagg cgggggggcg gacggtgatg 2101 ggcttccgcc accgggacta ccccacccac ggggtgcagt tccacccgga aagctacctt 2161 acggaggcgg gtaaactcat cctcaagaac ttcctggagg acccatggac gcggtgaaga 2221 aggccattct gggcgaggtt ttggaggaag aggaggccta cgaggtcatg cgggccctga 2281 tggcggggga ggtctccccg gtgcgggcgg cggggctttt ggtggccttg agcctgaggg // LOCUS XELGBBBLI 6777 bp ds-DNA VRT 11-JUL-1990 DEFINITION X.laevis beta-L-I globin gene, upstream region. ACCESSION M34470 KEYWORDS beta-L-I. SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 6777) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD simple staff_review BASE COUNT 2121 a 1247 c 1228 g 2181 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc 61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt 121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt 181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag 241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt 301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca 361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac 421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac 481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag 541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac 601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa 661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga 721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta 781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca 841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga 901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa 961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat 1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt 1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt 1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt 1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg 1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg 1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg 1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag 1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg 1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc 1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa 1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga 1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat 1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg 1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag 1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat 1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt 1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa 2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag 2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa 2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat 2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc 2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga 2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat 2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca 2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac 2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct 2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg 2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa 2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt 2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt 2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt 2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat 2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat 3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat 3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact 3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa 3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt 3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc 3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat 3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct 3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga 3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa 3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca 3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc 3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa 3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc 3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata 3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa 3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta 3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca 4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat 4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc 4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct 4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa 4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc 4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat 4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt 4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac 4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg 4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac 4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta 4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac 4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa 4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc 4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg 4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag 4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc 5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct 5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg 5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata 5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat 5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt 5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat 5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca 5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc 5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa 5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc 5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag 5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct 5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt 5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg 5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg 5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat 6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta 6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca 6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg 6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt 6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt 6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga 6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat 6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa 6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca 6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca 6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat 6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac 6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc // LOCUS XELHBBBAI 2027 bp ds-DNA VRT 11-JUL-1990 DEFINITION X.laevis beta-A-I globin gene, upstream region. ACCESSION M34471 KEYWORDS beta-A-I. SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2027) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD simple staff_review BASE COUNT 792 a 266 c 284 g 685 t ORIGIN 1 bp upstream of BglII site. 1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct 61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat 121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc 181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt 241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt 301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta 361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa 421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata 481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc 541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct 601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat 661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca 721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct 781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt 841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat 901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg 961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa 1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt 1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag 1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt 1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata 1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca 1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt 1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag 1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg 1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat 1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata 1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc 1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt 1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa 1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc 1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg 1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca 1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc // LOCUS XELHBBBLII 910 bp ds-DNA VRT 11-JUL-1990 DEFINITION X.laevis beta-L-II globin gene, upstream region. ACCESSION M34472 KEYWORDS beta-L-II. SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 910) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD simple staff_review BASE COUNT 322 a 152 c 128 g 308 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc 61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc 121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt 181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc 241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt 301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta 361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct 421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt 481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta 541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact 601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac 661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag 721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct 781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca 841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat 901 attttttttt // LOCUS CLONEUR 4835 bp ds-DNA BCT 11-JUL-1990 DEFINITION C.botulinum neurotoxin gene, complete cds. ACCESSION M30196 KEYWORDS neurotoxin. SOURCE C.botulinum (strain 62A, subtype A) DNA. ORGANISM Clostridium botulinum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4835) AUTHORS Binz,T., Kurazono,H., Wille,M., Frevert,J., Wernars,K. and Niemann,H. TITLE The complete sequence of the botulinum type A neurotoxin and its comparison with other Clostridial neurotoxins JOURNAL J. Biol. Chem. 265, 9153-9158 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Niemann, 29-NOV-1989. FEATURES from to/span description pept 358 4248 neurotoxin mRNA 240 > 4835 neurotoxin mRNA signal 344 349 ribosome binding site site 4400 4432 potential terminator BASE COUNT 1934 a 517 c 756 g 1628 t ORIGIN 1 aagcttctaa atttaaatta ttaagtataa atccaaataa acaatatgtt caaaaacttg 61 atgaggtaat aatttctgta ttagataata tggaaaaata tatagatata tctgaagata 121 atagattgca actaatagat aacaaaaata acgcaaagaa gatgataatt agtaatgata 181 tatttatttc caattgttta accctatctt ataacggtaa atatatatgt ttatctatga 241 aagatgaaaa ccataattgg atgatatgta ataatgatat gtcaaagtat ttgtatttat 301 ggtcatttaa ataattaata atttaattaa ttttaaatat tataagaggt gttaaatatg 361 ccatttgtta ataaacaatt taattataaa gatcctgtaa atggtgttga tattgcttat 421 ataaaaattc caaatgcagg acaaatgcaa ccagtaaaag cttttaaaat tcataataaa 481 atatgggtta ttccagaaag agatacattt acaaatcctg aagaaggaga tttaaatcca 541 ccaccagaag caaaacaagt tccagtttca tattatgatt caacatattt aagtacagat 601 aatgaaaaag ataattattt aaagggagtt acaaaattat ttgagagaat ttattcaact 661 gatcttggaa gaatgttgtt aacatcaata gtaaggggaa taccattttg gggtggaagt 721 acaatagata cagaattaaa agttattgat actaattgta ttaatgtgat acaaccagat 781 ggtagttata gatcagaaga acttaatcta gtaataatag gaccctcagc tgatattata 841 cagtttgaat gtaaaagctt tggacatgaa gttttgaatc ttacgcgaaa tggttatggc 901 tctactcaat acattagatt tagcccagat tttacatttg gttttgagga gtcacttgaa 961 gttgatacaa atcctctttt aggtgcaggc aaatttgcta cagatccagc agtaacatta 1021 gcacatgaac ttatacatgc tggacataga ttatatggaa tagcaattaa tccaaatagg 1081 gtttttaaag taaatactaa tgcctattat gaaatgagtg ggttagaagt aagctttgag 1141 gaacttagaa catttggggg acatgatgca aagtttatag atagtttaca ggaaaacgaa 1201 tttcgtctat attattataa taagtttaaa gatatagcaa gtacacttaa taaagctaaa 1261 tcaatagtag gtactactgc ttcattacag tatatgaaaa atgtttttaa agagaaatat 1321 ctcctatctg aagatacatc tggaaaattt tcggtagata aattaaaatt tgataagtta 1381 tacaaaatgt taacagagat ttacacagag gataattttg ttaagttttt taaagtactt 1441 aacagaaaaa catatttgaa ttttgataaa gccgtattta agataaatat agtacctaag 1501 gtaaattaca caatatatga tggatttaat ttaagaaata caaatttagc agcaaacttt 1561 aatggtcaaa atacagaaat taataatatg aattttacta aactaaaaaa ttttactgga 1621 ttgtttgaat tttataagtt gctatgtgta agagggataa taacttctaa aactaaatca 1681 ttagataaag gatacaataa ggcattaaat gatttatgta tcaaagttaa taattgggac 1741 ttgtttttta gtccttcaga agataatttt actaatgatc taaataaagg agaagaaatt 1801 acatctgata ctaatataga agcagcagaa gaaaatatta gtttagattt aatacaacaa 1861 tattatttaa cctttaattt tgataatgaa cctgaaaata tttcaataga aaatctttca 1921 agtgacatta taggccaatt agaacttatg cctaatatag aaagatttcc taatggaaaa 1981 aagtatgagt tagataaata tactatgttc cattatcttc gtgctcaaga atttgaacat 2041 ggtaaatcta ggattgcttt aacaaattct gttaacgaag cattattaaa tcctagtcgt 2101 gtttatacat ttttttcttc agactatgta aagaaagtta ataaagctac ggaggcagct 2161 atgtttttag gctgggtaga acaattagta tatgatttta ccgatgaaac tagcgaagta 2221 agtactacgg ataaaattgc ggatataact ataattattc catatatagg acctgcttta 2281 aatataggta atatgttata taaagatgat tttgtaggtg ctttaatatt ttcaggagct 2341 gttattctgt tagaatttat accagagatt gcaatacctg tattaggtac ttttgcactt 2401 gtatcatata ttgcgaataa ggttctaacc gttcaaacaa tagataatgc tttaagtaaa 2461 agaaatgaaa aatgggatga ggtctataaa tatatagtaa caaattggtt agcaaaggtt 2521 aatacacaga ttgatctaat aagaaaaaaa atgaaagaag ctttagaaaa tcaagcagaa 2581 gcaacaaagg ctataataaa ctatcagtat aatcaatata ctgaggaaga gaaaaataat 2641 attaatttta atattgatga tttaagttcg aaacttaatg agtctataaa taaagctatg 2701 attaatataa ataaattttt gaatcaatgc tctgtttcat atttaatgaa ttctatgatc 2761 ccttatggtg ttaaacggtt agaagatttt gatgctagtc ttaaagatgc attattaaag 2821 tatatatatg ataatagagg aactttaatt ggtcaagtag atagattaaa agataaagtt 2881 aataatacac ttagtacaga tatacctttt cagctttcca aatacgtaga taatcaaaga 2941 ttattatcta catttactga atatattaag aatattatta atacttctat attgaattta 3001 agatatgaaa gtaatcattt aatagactta tctaggtatg catcaaaaat aaatattggt 3061 agtaaagtaa attttgatcc aatagataaa aatcaaattc aattatttaa tttagaaagt 3121 agtaaaattg aggtaatttt aaaaaatgct attgtatata atagtatgta tgaaaatttt 3181 agtactagct tttggataag aattcctaag tattttaaca gtataagtct aaataatgaa 3241 tatacaataa taaattgtat ggaaaataat tcaggatgga aagtatcact taattatggt 3301 gaaataatct ggactttaca ggatactcag gaaataaaac aaagagtagt ttttaaatac 3361 agtcaaatga ttaatatatc agattatata aacagatgga tttttgtaac tatcactaat 3421 aatagattaa ataactctaa aatttatata aatggaagat taatagatca aaaaccaatt 3481 tcaaatttag gtaatattca tgctagtaat aatataatgt ttaaattaga tggttgtaga 3541 gatacacata gatatatttg gataaaatat tttaatcttt ttgataagga attaaatgaa 3601 aaagaaatca aagatttata tgataatcaa tcaaattcag gtattttaaa agacttttgg 3661 ggtgattatt tacaatatga taaaccatac tatatgttaa atttatatga tccaaataaa 3721 tatgtcgatg taaataatgt aggtattaga ggttatatgt atcttaaagg gcctagaggt 3781 agcgtaatga ctacaaacat ttatttaaat tcaagtttgt atagggggac aaaatttatt 3841 ataaaaaaat atgcttctgg aaataaagat aatattgtta gaaataatga tcgtgtatat 3901 attaatgtag tagttaaaaa taaagaatat aggttagcta ctaatgcatc acaggcaggc 3961 gtagaaaaaa tactaagtgc attagaaata cctgatgtag gaaatctaag tcaagtagta 4021 gtaatgaagt caaaaaatga tcaaggaata acaaataaat gcaaaatgaa tttacaagat 4081 aataatggga atgatatagg ctttatagga tttcatcagt ttaataatat agctaaacta 4141 gtagcaagta attggtataa tagacaaata gaaagatcta gtaggacttt gggttgctca 4201 tgggaattta ttcctgtaga tgatggatgg ggagaaaggc cactgtaatt aatctcaaac 4261 tacatgagtc tgtcaagaat tttctgtaaa catccataaa aattttaaaa ttaatatgtt 4321 taagaataac tagatatgag tattgtttga actgcccctg tcaagtagac aggtaaaaaa 4381 ataaaaatta agatactatg gtctgatttc gatattctat cggagtcaga ccttttaact 4441 tttcttgtat cctttttgta ttgtaaaact ctatgtattc atcaattgca agttccaatt 4501 agtcaaaatt atgaaacttt ctaagataat acatttctga ttttataatt tcccaaaatc 4561 cttccatagg accattatca atacatctac caactcgaga catactttga gttgcgccta 4621 tctcattaag tttattcttg aaagatttac ttgtatattg aaaaccgcta tcactgtgaa 4681 aaagtggact agcatcagga ttggaggtaa ctgctttatc aaaggtttca aagacaagga 4741 cgttgttatt tgattttcca agtacatagg aaataatgct attatcatgc aaatcaagta 4801 tttcactcaa gtacgccttt gtttcgtctg ttaac //
GenBank-Updates@genbank.bio.net (07/12/90)
LOCUS BOVB1A 781 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Cow beta-crystallin (p-Beta 25/23) mRNA, complete cds. ACCESSION M33010 KEYWORDS beta-crystallin; crystallin. SOURCE Cow lens cortex, cDNA to mRNA, clone p-Beta 25/23. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 781) AUTHORS Gorin,M.B. and Horwitz,J. TITLE Cloning and characterization of a cow beta crystallin cDNA JOURNAL Curr. Eye Res. 3, 939-948 (1984) STANDARD simple staff_review FEATURES from to/span description pept 6 653 beta-crystallin BASE COUNT 217 a 185 c 187 g 192 t ORIGIN 1 tccagatgga gacccagact gtgcagcagg agctgaaatc ccttccaacc accaagatgg 61 ctcaaactaa ccccatgccg gggtctgtgg ggccatggaa gattaccatc tatgaccagg 121 agaacttcca gggcaagaga atggaattca ccagctcctg cccaaatgtc tctgagcgca 181 attttgacaa cgtccggtct ctcaaggtgg aatgtggcgc ctgggttggt tatgagcata 241 ccagcttctg tgggcaacag tttgtcctgg agagaggaga gtaccctcgc tgggatgcct 301 ggagcgggag taatgcctat cacattgagc gcctcatgtc cttccgcccc atctgttcag 361 ctaatcataa ggagtctaag attacaattt ttgagaaaga aaatttcatt ggacgccaat 421 gggaaatctg tgatgactac ccctccttgc aagccatggg ttggcccaac aacgaagttg 481 gctctatgaa gatacaatgt ggagcctggg tttgctacca gtatcctggg taccgtggct 541 atcagtatat cttggaatgt gaccatcatg gaggagacta caaacactgg agagagtggg 601 gttctcatgc ccagacttcc cagattcaat ccattcgccg tatccaacag tagtggatta 661 aaagctccaa gtaagaattc ctcaagcatg agaccttcct aaacaatcta gaataaaata 721 tatgttctgc tgatattgct tccaaatgtt agctgctgaa atccacaata aatgtcatta 781 a // LOCUS CFICENB 439 bp ds-DNA BCT 12-JUL-1990 DEFINITION C.fimi endoglucanase B (cenB) gene, 5' end. ACCESSION M33026 KEYWORDS endoglucanase; endoglucanase B. SOURCE C.fimi DNA. ORGANISM Cellulomonas fimi Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 439) AUTHORS Owolabi,J.B., Beguin,P., Kilburn,D.G., Miller,R.C.Jr. and Warren,R.A.J. TITLE Expression in Escherichia coli of the Cellulomonas fimi structural gene for endoglucanase B JOURNAL Appl. Environ. Microbiol. 54, 518-523 (1988) STANDARD simple staff_review FEATURES from to/span description pept 275 > 439 endoglucanase B (cenB) precursor sigp 275 373 endoglucanase B signal peptide matp 374 > 436 endoglucanase B BASE COUNT 58 a 173 c 154 g 54 t ORIGIN 1 ggatcccgcg cccggcgcga gcccgcaacc cacgcgccca cggatcgggc ctcacgagcc 61 cgacgttggc ggccgggccg gggggcgacc tcgagaccga ggagcccccg cgtgaggcga 121 cgttggccgc gcacgccgct ggtgagcggg ctgaatcgtt tagggcgttg acctgcggac 181 ggacccgtct ggacgatgcg ccaggcgtcg tgcgggtgcg actgcggaca gcacgggtcg 241 ccgaccacca ctcccgtgcc cggaagagga ccccatgctc cgccaagtcc cacgcacgct 301 cgtcgcgggt ggctccgccc tcgccgtcgc cgtcggggtg ctcgtcgccc cgctcgcgac 361 cggcgcggcc gccgcgccca cctacaacta cgccgaggcc ctgcagaagt cgatgttctt 421 ctaccaggcg cacggctcc // LOCUS RATLACTAS 250 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat lactase-phlorizin hydrolase mRNA, partial cds. ACCESSION M34730 KEYWORDS lactase; lactase-phlorizin hydrolase. SOURCE Rat (strain Sprague-Dawley CD) newborn, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 250) AUTHORS Bueller,H.A., Kothe,M.J.C., Goldman,D.A., Grubman,S.A., Sasak,W.V., Matsudaira,P.T., Montgomery,R.K. and Grand,R.J. TITLE Coordinate expression of lactate-phlorizin hydrolase mRNA and enzyme levels in rat intestine during development JOURNAL J. Biol. Chem. 265, 6978-6983 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 250 lactase-phlorizin hydrolase (AA at 2) BASE COUNT 66 a 62 c 66 g 56 t ORIGIN 1 agaaaggatc ttctaccaca aaacctatat caacgaggct ctgaaagcct acaagctgga 61 tggtgtggac cttcgagggt actctgcctg gacgctgatg gacgacttcg agtggctgct 121 tggctacacc atgagatttg gattgtatca cgttgacttt aatcatgtga gcagacctcg 181 cacagcaaga gcctcagcca gatactatgc agaggtcatt gccaacaatg gcatgcccct 241 ggccgggaag // LOCUS BOVARRB 1945 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Cow beta-arrestin mRNA, complete cds. ACCESSION M33601 KEYWORDS beta-arrestin; inhibitor. SOURCE Cow adult brain cortex, cDNA to mRNA, clone pBARRESTIN-1/1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (sites) AUTHORS Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J. TITLE Beta-arrestin: A protein that regulates beta-adrenergic receptor function JOURNAL Science 248, 1547-1550 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1945; for [1]) AUTHORS Lohse,M.J., Benovic,J.L., Codina,J., Caron,M.G. and Lefkowitz,R.J. JOURNAL Unpublished (1990) See COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.J.Lohse, 06-APR-1990, for release after publication. Author address [1]: M.J.Lohse Howard Hughes Medical Institute, Box 3821, Duke University Medical center Durham, NC 27710 FEATURES from to/span description pept 97 1353 beta-arrestin BASE COUNT 419 a 590 c 556 g 380 t ORIGIN 1 gttccgggaa ccggctggcc cgcgcccctc ctgtcggccg gggattttcc agcctgggcg 61 ctgacgccgc ggacctcccc gcggccgcct cggaccatgg gcgacaaagg gacgcgggtg 121 ttcaagaagg cgagccccaa tggaaagctc accgtctatc tgggaaagcg ggactttgtg 181 gaccacatcg acctcgtgga gcccgtggat ggagtggttc ttgtggatcc ggagtatctc 241 aaggagagga gagtctatgt gacgctgacc tgcgccttcc gctacggccg ggaggacctg 301 gatgtcctgg gcctgacctt tcgcaaggac ctgtttgtgg ccaacgtgca gtctttcccg 361 ccggcccctg aggacaagaa gcccctgacg cggctgcagg agcgcctcat caagaagctg 421 ggcgagcatg cctacccttt cacctttgag atccctccga acctcccatg ctctgtgact 481 ttgcagccgg gacctgaaga tacagggaag gcctgcggtg tggactacga agtgaaagcc 541 ttctgtgcgg agaacctgga ggagaagatc cacaagcgga attctgtgcg cctggtcatc 601 cggaaggttc agtatgcccc agagaggcct ggcccccagc ccacggccga gaccaccagg 661 cagttcctca tgtcagacaa gcccttgcat ctggaggcct ccctggacaa ggagatctac 721 taccacggag aacccatcag tgtcaacgtc catgtcacca acaacaccaa caagacggtg 781 aagaagatca agatctcggt gcgccagtat gcagacatct gtctgttcaa cacagcccag 841 tacaagtgcc ctgtggccat ggaagaggct gatgacacag tggcacccag ctctacgttc 901 tgcaaggtct acacgctgac ccccttcctg gccaacaatc gagagaagcg gggcctcgcc 961 ctggacggga agctcaaaca cgaggacacg aacctggcct ccagcaccct gttgagggaa 1021 ggagccaacc gggagatcct gggcatcatt gtttcctaca aagtgaaagt gaagctggtg 1081 gtgtctcgtg gcggcctgtt gggagatctt gcatccagtg atgtggccgt ggaactgcct 1141 ttcaccctaa tgcaccccaa gcccaaagag gaacccccac accgggaagt tccagagcac 1201 gagacgccgg tagataccaa tctcatagaa cttgacacca acgatgacga cattgtgttt 1261 gaggactttg cccgccagag actaaaaggc atgaaggatg acaaggagga agaggaggat 1321 ggtaccggct ctccgcggct caacgacaga tagactgggg ctgccctccc tccgggcagc 1381 tccaggtcca ctctcatgca ctaggatgct tgttcgtctt cttcctgtcc tggctccccc 1441 tcccctttgt tcttccagtt tctaccaggg ggccccagcg gtcttccagg tcacggtggc 1501 gaacccctgg cctcaggatt ggcccccatc accatgccaa cagggccaca ggcagcaccc 1561 tcaccctctc actgcatcac ttctccattc cccctctttt cctattgacc cccagacagg 1621 ccagcacagc tctggccttc ggatttgact cgggatgggg agcagaaagg ggaagatggg 1681 gcacaagggc ttggcgaggt ggggatgggg gctcaagacg cgtgagagga tgtggccact 1741 gtcccaggtg atgaatacag ttctggcagc taaaacatga ccgctttgaa ggccaccctc 1801 ctctggctgg gaggggacag acccatggat agattgtcca cacagatttg ctcgaagttc 1861 agacctacca aacagctgtc ttcttcttcc ctcgtccctg ccccctgttc ctctgtggct 1921 gacagtgacc ttggtgaagg tttgt // LOCUS BBVRNA3 389 bp ss-RNA VRL 12-JUL-1990 DEFINITION Black beetle virus RNA3 proteins B1 and B2 genes, complete cds. ACCESSION M33065 KEYWORDS . SOURCE Black beetle virus. ORGANISM Black beetle virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Nodaviridae. REFERENCE 1 (bases 1 to 389) AUTHORS Guarino,L.A., Ghosh,A., Dasmahapatra,B., Dasgupta,R. and Kaesberg,P. TITLE Sequence of the black beetle virus subgenomic RNA and its location in the viral genome JOURNAL Virology 139, 199-203 (1984) STANDARD simple staff_review FEATURES from to/span description pept 10 318 B1 protein pept 20 340 B2 protein BASE COUNT 118 a 120 c 98 g 53 t ORIGIN 1 tcgttaccaa tgttaaacga tgccaagcaa actcgcgcta atccaggaac ttcccgaccc 61 cattcaaacg gcggtggaag cagccatggg aatgagctac caagacgcac cgaacaacgt 121 gcgcagggac ctcgacaacc tgcacgcttg cctaaacaag gcaaaactaa cggtaagtcg 181 gatggtaaca tcactgctgg agaaacccag cgtggtggca tacctagagg gaaaggcccc 241 cgaggaggca aaaccaacac tcgaagaacg cctccgaaag ctggagctca gccacagcct 301 tccaacaacc ggaagtgacc ccccacccgc aaaactgtag gtggctctta ggagcaccca 361 cacccgttct agcccgaaag ggcagaggt // LOCUS MUSURNAA 54 bp ss-uRNA ROD 12-JUL-1990 DEFINITION Mouse small nuclear RNA. ACCESSION M34036 KEYWORDS small nuclear RNA. SOURCE Mouse plasmacytoma cell line P301 small nuclear RNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 54) AUTHORS Chernokhvostov,V.V. and Georgiev,G.P. TITLE Complexes of nuclear matrix DNA with proteins tightly bound to DNA contain a specific small-size RNA of a novel type JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.V.Chernokhvostov, 04-MAY-1990. or address:V.V.Chernokhvostov . of Molecular Biology, USSR Acad. Sci. Vavilova str., 32 117984, Moscow USSR FEATURES from to/span description uRNA 1 54 small nuclear RNA BASE COUNT 19 a 12 c 13 g 10 t ORIGIN 1 agaagacacc ctgatttaac ttctggtatc ggaagatgca agagccgaac caga // LOCUS RATCYP2A1 18820 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA1 (CYP2A1) gene, complete cds. ACCESSION M33312 KEYWORDS B2 repetitive sequence; LINE repetitive sequence; cytochrome P450; hepatic steroid hydroxylase IIA1. SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 18820) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept 4573 4749 hepatic steroid hydroxylase IIA1 (CYP2A1), exon 1 5050 5212 hepatic steroid hydroxylase IIA1, exon 2 7638 7787 hepatic steroid hydroxylase IIA1, exon 3 8005 8165 hepatic steroid hydroxylase IIA1, exon 4 9386 9562 hepatic steroid hydroxylase IIA1, exon 5 12760 12898 hepatic steroid hydroxylase IIA1, exon 6 13340 13527 hepatic steroid hydroxylase IIA1, exon 7 13960 14101 hepatic steroid hydroxylase IIA1, exon 8 17010 17191 hepatic steroid hydroxylase IIA1, exon 9 pre-msg 4545 17380 CYP2A1 mRNA and introns IVS 4750 5049 CYP2A1 intron A IVS 5213 7637 CYP2A1 intron B IVS 7788 8004 CYP2A1 intron C IVS 8166 9385 CYP2A1 intron D IVS 9563 12759 CYP2A1 intron E IVS 12899 13339 CYP2A1 intron F IVS 13528 13959 CYP2A1 intron G IVS 14102 17009 CYP2A1 intron H rpt 3750 4026 B2 repeat BASE COUNT 5274 a 4402 c 3695 g 5449 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctagt acggtagccc tggctttcat caactagtta gtgccaaata tttgagaaaa 61 gttacaggtt caagctaata aaagttgcag agagtataaa agaatgcaga ttagacaaga 121 aaaaattaat tagagccctt ctagccaaca aagcctcaga tccaggagaa aagactacca 181 tagaaatggc caaaggctta tttatcaaag aaactgggct cagtggcagc aggatgacca 241 ccttgcctgt gtttattgtt gccacagcac tgttggataa agatgcaaat aaattaactt 301 tgggacagaa gttgatcatg actgctcctc cccctgcccc cgcaattctg attgaggctc 361 agtaatgcct acatgcttca ttatcacact ttactaatca gccctggctg agatattttc 421 cagccacctg tttccctgaa ccctgcgact cttccaccca accctgactt gggctgtcca 481 cttcatcaat tcgatgaggt tcaggcccag atacacaata ccagacctta cttgaggaac 541 tctcatccat cagaaacaga gcatacctgg ttcacggaca gaagtagctt catccataag 601 ggtcagagga gaacaggggc agcaataaca acagaaggaa aggtaatctg gactcagtct 661 cttccttcca ggcacttcaa ctcaaaagaa caaactaaga acattaacac aagtcctcat 721 catgggaaaa ggactggctg ttagcatctg cagggacagc cagatatgca tttacaacta 781 ctcatgtgca cggagccatt tacaaggaac aacagctcct aacagcagaa ggaaaaactg 841 tcagaaataa agataaaact ctacaggaca tcaaaaagaa accaatggct agagataaca 901 gtctgactga taaggcttct aagaaggtag ccttaaagga aacagccaac actagattgg 961 ccattgtcct acctgaacca cctagagtaa ctgataaatg cagaaaaaga aattaaatgg 1021 gctggtgatg gtcaactgaa ggtaagcgaa tattgcccac ttctcagaaa gatccacagt 1081 cactcacttg ggagtaaaat gaatgacaaa ctttaaagtt tgccagctga ccaacactca 1141 caggaagccc aaacatccaa attcctgact gcgagttaag agacttggag cctactggga 1201 aaattgattt ttttcagaaa tcaagcaaga aagatatggc tcaaaatatc tgctgatatt 1261 tgtagacatt ttttttcgag atagatggat agatgtacta ggagagatct ttccgatgtt 1321 tggagcacct aaggtaacag gatcagacaa tgggcctgtc ttcatatctc aggtaagtca 1381 gggacttgct aagatcttgg ggactaattg gaaactccat tgttcatatc atccccagag 1441 ttcagggcag gtagaaagga tgaatagaac tctaaaagag accttaacag aattagcctt 1501 ggagactggt ggggactggg tgaggctctt tccctttgcc ctatattagg tgtacatggc 1561 attctagctc ccattgtatc tagcctacag ttggtagcta ttacagaact gaaaaatgat 1621 aatttaagat ttaaggtcag agctaccaaa taggctcatg aatttgtttg ggcctaaatt 1681 atgtaccttc tgtgaagcag gcctggttcc agaaccacac aagtcaaaag agactgggtc 1741 tctatgaaga gatttcacca aggtgcgact aaacccatgt gaaaatggca attcatcatc 1801 ctgttgatca tgatcaccac ctgggtgtac aacaaccaca ccagaccagt tcctccaatg 1861 aagaactctg cctggctcca gctgtaccaa aatagagggt tcaaaaggac accaagccct 1921 tcaagttaaa gttgactcag tctcagtcct gagtctcttg cccctgctaa ctctatgtct 1981 atatatactg tatgtcttag atccccccct gttaggaagg taccctagct ggatccttga 2041 taattttact tttatttctg acttttggcc cctgtatttt aagttgctta gtagtttata 2101 agagaattca gtcaagttaa ttatcttaag gcaacactat ctacagctgg aagcagggaa 2161 gcaagcatat gagttagaag actataagct tcaagatcaa agctatgcta aaagaaaagg 2221 ggggaatgaa aagccagagt tggggtcaat ctgaggccaa tgagaaaaac ccaccattaa 2281 catccaagca cagaacgacc cttctcttcc agaaagagta aagctagttt agttcctgga 2341 acagctacaa gccaaactgt tgaacaaagc cacatgtaac tccccatcca acctccagaa 2401 agtcccagaa tggcacactg accacaagtc attttggagg ttacttcacc ccactaatag 2461 tagtactctt cctagttact gttgtgcaaa ttctgcccca attgtttgta aggtatatac 2521 agacccagtt agagtctgct cagggtcttc tctttctgaa agggagtcaa ccccgacgca 2581 ttaaaataaa gctagtcttg gttttgcatt gattagcacc tccttgagtc tcactcaagg 2641 ggtcccggaa agggtcagat tagacctcat atacctctga gcacagcttg tatggtgact 2701 aagatacagg atacccacag gctgggatta gagagtttaa accaaagatc tttcatccat 2761 gtgctccatg cctgccctgt gcccaggggg aaacatggat tctaattaca gaagcctccc 2821 taaggatctt aatgggaacc aagtaggaga cttttccagt tagaagcctt ctgacaactg 2881 gggtttcccc atattggtag tttaggttgt tatttcacaa aactacaatt ccttcaccaa 2941 ctggagttct gagttattct cctctagtct ggaaaatgat ctgctaaaat atagctgtgg 3001 ttttctaccc ttttcaaagc catacataga cagggaaggt tgcccatcct tccctgaagt 3061 tgaagatcct tttagaagtc aatgcaccca tcagtggtga taaatgcctt taatcccagt 3121 atgcagcaaa ctctgtgagt ttgacgccaa attggtctac agagtgtgtt ctagaacagt 3181 cagagctaaa gagagaaaca ctctgtggaa aagaaagaaa gaaaaaagaa aggaaggaaa 3241 gaaggaagga aggaaggaag gaaggaagga aggaaggaag gaaggaaaaa gaaaggaagg 3301 aaggaaaaag gtacagagag agggaaagag ggagggagaa aaataacata tatgaagaca 3361 cagtacagga ccaatctggg ctcaggtgcc cactttagtc tcctactgga attttcatcc 3421 acttgtacca gaaactcagc acccacagat ccttcttgcc atgtgacctt ccagtccata 3481 gtttggaatc tttcctgttt tccttactaa tatttttctc ctaataaaaa gactaaacca 3541 tctagactct aggactccag agatgactct gtgggtaaga gcacttgttg ctcttgtaga 3601 agacccaggt tttattccta gaacccacat ggtggcttac aaccatgtgt gaccccattt 3661 ccaaaggatt cctctaaatc ttttggcatc tttggacagt gtgcccaatt gttacccaga 3721 cttaaatgga agagaaacct tcatattaca taaaaaatta acacaataag atccataaag 3781 aaatatataa aagaagataa tcttttttaa aaaggataca ttggatacat tgccaggcct 3841 ggagagatgg ctcagtggtt aagagcactg acagctcttc cagaggtcct gagttcaaat 3901 tcccagcaac cacatggtga ctcacaatta tctgtaatgg gattcaatgc ccactactgg 3961 tgtgtctgaa gacagtgaca gtgtactcat atacatgaaa gaaataaatg aatcttgaga 4021 acaacattta atgcctgaag ccatgtttcc tgtattgttc cagtcaaatc taagaatgtg 4081 aattctatca cagaccacaa catttacatc tatgagggct ttcttcatga gctcaaccat 4141 acaaatatag atttttagtt ctagatttga tctggtggac ccagaaatgg acagcctcct 4201 gataatagcc acagtcccca atacagcacc aatctcatca tagcatttga agagtgtatt 4261 atatgttggc ctgttcacct tgtccactaa aaccctcagc ttggtccacc aaagcctctt 4321 tgactgcatt gtatcaacac accaaagcac accgaggctt taagaatttt gaagtaagcc 4381 tgccacccag aggtcttcct atttgcccat gttgtgggtg ttgcaacaaa gacagggtca 4441 gtgttaggag ataggattgg agggtaaaag actcaactag acaaacagga gcaaaggcca 4501 tcctgtgtcc ctgggagtat aaaggtacta tctcagcctt ggctatcagc ctgtcaatcc 4561 tcactggcca ctatgctgga cacaggactg cttctggtgg tcatattggc ctccctgagc 4621 gtcatgctct tggtgtccct ctggcagcag aaaatcaggg ggagattgcc tccaggaccc 4681 actcctttgc ctttcattgg aaattatctg cagctgaata caaaagacgt atacagttcc 4741 atcacacagg tatcactgga tgaggggatg gatgggacat gggagcacaa gaggctgtga 4801 tgttttgcat gttttgtggc agaagattca tagaggaatc caaagtcttg tattagtgga 4861 gtttagaaag ataaggagct atttcaagtc tttggtttgt tgtttgttgt ttgttttgtt 4921 tgttgtttgt ttctttgttt ctttgttttt tcaatcattt atttgtagag taacacataa 4981 tctgacctct gtgtactggt ccagttcagt gaataagtca tctaacagcc cccatctacc 5041 ccacatcagc tcagtgagcg ctatggtcct gtgttcacca tccaccttgg gcctcgccgg 5101 gttgtggtgc tttatggata cgatgcagtc aaagaggctt tggtggacca agctgaggag 5161 ttcagtggac gaggcgaaca ggctacctac aatacactct tcaaaggcta tggtgaggag 5221 gataccacat tggggaacat gcccaaggac atttgttggc gtcatttaag tagccttcat 5281 actaactcat ctctccctca aggctgtaca gagttctctg aatttctctc catatccatg 5341 ttgaatgttg gctctcattg tgaccctccc tagcatttct gagattgaaa acagactttt 5401 gcaaattctg tgggttcttt cttccatcct tctctaccgt tttcttccgc cctttctacc 5461 acctatcact agataggaaa gaaaaggaga tagaggtgaa aggggacatt actgttagat 5521 tatttcctgc tgattaggag tgacgagctc cttagggaaa gttttatctt ctctgtcagg 5581 atatctaatt tcttcttgtt gttatttctt tacataagac tacttaacaa atcacaagca 5641 acagcaacta accaatagcc aaaaccaatt tctcagggtc cttgcattta cacaaccttg 5701 aggagtccca gtatcctgag tgtcacacac tctcagaaac tatctgcagc tggcaaaatc 5761 ataacctcct gctttggaca acctgaacca gccccatatg ccatacctgg gagtaaacag 5821 aaacatattt ctataatagt tctgtatttt tcaaagaaat caaatttctt actacatctg 5881 gccattgctg ctcttctctc tctctctctc tctctctctc tctctctctc tctcacacac 5941 acacacacac acacacacac aaacacacac acacaaacac acacacacaa acacacacac 6001 acaaacacac acacacacaa acacacacac acaaacacac acacacgcac gcacacacac 6061 acaacctctc ggcattctcc tagatggatg actccttttt aatttagctg atatttttat 6121 ccttcttaaa catttatcca cacacagagc atcagttgca ggtctcaggc attcactcct 6181 gatgcctctg gattggtttt ttagattctt tgttcttact tttccatcta tgggtgctgg 6241 gctctcaagc acatctctgc acagtgtgtg tgcctggtgc ccatggaagc aaaaagatgg 6301 agtcagatct cctgaactcc aggggttccc tgagttccag ggttatgagc tgccaggtga 6361 gtgctggggt acaagcacag gtcctctgca aggtcagcca gtgctcttga gtgcagagcc 6421 agctttgctg ccccccactg cctatatttt taaatgctgt tttacatact ccatgtgttg 6481 tccctaagat gtgtataatg cttatagaac gtcacagtct ggtaagtgct ggccaaagct 6541 acagaagtat aaaatggcct tgaacagcaa aacactggtt ataagcaaga aaggtcaaaa 6601 taaagagaaa atccacaaag agccaaatat ctttataaca ttaattctgt agttaaaatt 6661 taacacagag agtgtatctc gttccttgaa gaactgaagg acacacaaat gactacttct 6721 acctagggtc aaaatatagc ggtgactaca gctcaagaca cacaaaacca gagtcaagaa 6781 tcagggagtg gtaataaaat aataaaaaat cctggctcag ggtttcttcc cacctttccc 6841 tgatgaaagg cacacacagc ctttatattt tagtctgcct tatgcagcac aatagctggg 6901 cagctgccta ccctccatgc tgttagaatc cattttccta ttgaaagccc caagttaata 6961 ctttacaagt ttctttatac catatttgct attcttgacc caactgagga gcccttttgg 7021 ccacactgtc ttggcccata gcacatggtg tctctccttc taccttctgc tctttcttct 7081 tccatggctt ccacagaggc tcctcaatcc cattctcctt cctcatgctc tctagcccca 7141 gaaaactaag caccacaagt ctcttctccc agctattagc tgctgacatc tttatttacc 7201 aatcagaatg aactgcgggc aggatcactc agacaaacta cagactccaa atcttagagg 7261 ccaacactta ctgttatagg aaacaataaa agacaaaaac ctcaacacca gggtatgttt 7321 ctgggtaggc tgtccttgct ttaatgggga tttgctgttt tcagaaaatg ctcaatattg 7381 attgattttg ccatttccag gaccctttgc tgcattctgt ctgtaagtct ctttttattt 7441 gcctggctga cttgtttcaa ctttctttct ctgactgtgt ctgatgcaca gtctgtgttt 7501 gtgtcttttg tgtccttgcc atttctatcc aactttgtct cttttctttc ccccttagaa 7561 cccctttcca gggtgggcct catccatcct cagcctcagt ctacttctcc tgacccctta 7621 tatttatatc tctacaggcg tggcattcag cagtggggag cgggcaaaac aactcaggcg 7681 cctctctata gccacattga gagattttgg tgtgggcaag cgtggtgtag aggagcgtat 7741 cctggaggag gcaggctatt tgatcaagat gttgcagggc acttgtggta agcaagagac 7801 cattaagtgt ttgggcaaga gaaagaacat ccctgacacc tagaccctat gggttgtgga 7861 taagaagggc ggggaagacc gcctaccaaa ccatccccag aatctggtgc tgagagattg 7921 gtgcctcact ccaattccca caccatctgc taactcttct ccctcataat gccaatgtct 7981 tccaaacaat gtcacccctc tcaggagccc ccattgaccc caccatctac ctgagcaaaa 8041 cagtctccaa tgttattagc tccattgtct tcggggaacg cttcgactat gaggacacgg 8101 agttcctgtc actgctgcag atgatgggtc aaatgaacag atttgcagct tcacccacag 8161 ggcaggtaac agatccagct ctgccaattg tccttatagt gtcccacatt gaccatacca 8221 acaaagggca aggaccaccc tgactctcat ggctacaaac aaaagctccc ctcaaaaaca 8281 gaagctcccc tcaaaaccag cctttacttc agaaaactga acctttacat cagagcccac 8341 agaagctatc cagtgctcac aatctaatgt cctctggata tctcagtagc ctgagaacac 8401 agccctctgc ttgactctct tccctgggca ggtttctcca gcttaacctc taataaatcc 8461 tctatgtggt cctcctgaaa atttagacaa ctgcccaagg gatacaagtg accacctctg 8521 gccccctcct ccaatcctga acacctacct agttctgcaa aactgtggtc agtaaagcta 8581 ttcagtccat acacccagtt ctccccaaag atcccactga cacaatggca caaaagtcac 8641 ctgttgtctc aggtaaattc aggaatgagt agacaggcac ctcaaccaag gcaaccaagc 8701 acagacctct ggatggactg tttccccaaa cacccatatg tctcccagct acacacaacc 8761 cacatcaaga caatatctga caggtgtgtc tcacacctta taacctgaac caccccacca 8821 tgaagacctg actatgtgaa aaaccgattc taatctcaaa caaatatcaa gacatctaat 8881 cttagccctc tcaaatgccc aaacatatag atacttgatt cactgcgaca ctcatgtcct 8941 gaatactaga aacctggagt aatggtctga tccaaaaatc agttaaataa ctgaatgtct 9001 actaatgttc ccttttgatc cagttcattg ggattgtaag acaatgacct tcattcttta 9061 aatcacctag aaaactgtgg tctctggggc ctctgacagt tcagtggttt aagagcatgc 9121 actgctcatc ctgaggaccg agttcagttc ccactaccta tgctgaacat ttcaaaactc 9181 tatgggagta cacctgcacc gtgcacataa ttaaaagtaa aatattcaaa cgaatataaa 9241 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa 9301 cacctggatg gatcccttga gactcgaccc actcccacgg gtgttgccac tgacaagcct 9361 tttcttttct cctcccaccc cccagctcta tgacatgttc cattcagtga tgaagtacct 9421 gcctggacca cagcaacaga tcatcaaggt tactcagaaa ctggaagact tcatgataga 9481 gaaagtgagg cagaaccata gtaccctgga ccccaattcc ccaaggaact tcattgactc 9541 ctttctcatc cgcatgcaag aggtgatccc aatcatggtg gatggaatgt ctaaaacagg 9601 gcagctctaa atcatcctag aaaaggagga ggaatatagg cccattaagt gcccatgatt 9661 ctcctcacag tcccggttat agttaaacct cactctttca cctgttgagc cttatccaag 9721 ccagggtatg ggttagcaaa ttaccatgac aaccgatatt ccagtgttcc cctatgagac 9781 actgttttca gtgttcaact acttagcatg cactgaagct actgtcgaag accctgtgga 9841 gcctaaactt cgcaaagagg gaaagtgtgc ccagacttgc atgctgactt tatggagaca 9901 gaaaactata cagccttgcc tctatggctc tcaggctttt actattagcc acatggtctc 9961 tagcatttca tatctctgtt aggaaataca catcagtaca catcagtggc ctaagacctg 10021 ggtttttttt tcttttgtct gttctagtaa tttttttatt gtttttcatt tttgtgtttt 10081 tttcttttat tggatttttt atttctattt cagatattat cccctttctt ggtttccctt 10141 ccagaaacct gctatctcct catgcttcta tgaggattct ctcccaccca cacaacactc 10201 cctgccacct ccctgtgctg acattcccct acactggggc atcgagccca gacaggacca 10261 agggtctctc ctcccattga tacccaacaa ggccatcctc tgttatacat atggctgaag 10321 caataggtac atccctgtgt actcttggga tggtttagtc actgggagct ctggtgggtc 10381 tggttggtta atattgttgt tcttcttata gggtggcaaa ccccttcagc tccttcagtc 10441 ctttctctaa ctcctccata tgggaccatt ttctcagttc aatggttgac tgcaagcatc 10501 tgcctctgta attgtcacgc tctgcagagc ctctcaggag acagctatat gaggatcctg 10561 tcaacatata tttcttggca tccacaatat tgtgtgagtt tagaggatgt caatgggatg 10621 aatccacctg tagggcagtc tctgaatggc ctttccttca gactctgctc caaactttgt 10681 ctttgtattt ccttctttga gtatttttgt tccccctttc aagaaggact gaagcatact 10741 cacttgagtc tttcttcttc ttgagtttca tgtggtctct gaattctatc ttgggtattc 10801 caagtttttg gactaatatt tacttctcag tgagtgcata ccatgtgttg ggttacctca 10861 cttaggatga tattttttag ttccatccat ttgcctaaga atttcatgaa gtcattattt 10921 ttaatagcag tgtagtactc cattgtgtaa atttactata ttttttgtat atatttctct 10981 gttgaagaac atctagtttc tttccagctt ctggctatta taaataaggc tgttatgaac 11041 atagtggaga gtgtgtcttt gttatatgtt ggagcatctt ttgagtatat gcccaggaat 11101 ggtatagctg agtcctcaca taatactatg tccaattttc tgaggaacct ccaggatgat 11161 atccagagtg gttgtatcaa attacaatcc accaacaatg gaggagtgtt actctttctc 11221 cacatcctta ccagcatctg ttgtcacctt cgtttttgac ctttgccatt ctaactggtg 11281 tgaggtggaa tctcagagtt gttttgattt gcatttccct gatgactaag gaggttgaac 11341 atttctttag gtacttctca accatattcc taagctgaga attctttgct tagctcttta 11401 ctccattttt aatggggtta tttgattctc tggagtctaa cttcttgagt tctttgtata 11461 tatttaacat tagccctcta tcggatgtgg gattggtaaa gatcttttcc caatctgttg 11521 gttgtcgatt tgtcctaatg acagtgtcct ttgccttaca gaagctttgc aactttatga 11581 agtagtattt gtcaattctt gatcttagag cataagccat tggtgttttg tttaggaaac 11641 tctccctggt gcccatgtgt tcaagaccct ttcccacttt ctgttctatt agttccagtg 11701 tatctggttt tattttagtt taattttatt tttcttggat aattatgtat tacacatcaa 11761 atgttattcc ctttgtcccc tctctcatat ccccttcccc tccctctgcc tctatgggga 11821 tgctaccacc cccatccacc cactcccacc tcaaccccct agcattccct tacattgaga 11881 aaaagagcct tcactagacc aagggctttt cctcctattg atgctggaca atgccatcct 11941 ttgctacata tgcagctgaa gccacgggtc cttccatggg tacgctttgg ttggtggttt 12001 aggccctggg agctctcgtg gagtctggtt ggttagttga tattattctt ccatccctaa 12061 aatgaatgac agtcacctag acagagaaat gagcaaagct tctcatgcaa acccaagact 12121 gctaacacag cctggagatc tttttccaac gattggtctg gaccctatga gaactagatc 12181 caaaggaaat tgcagaagtg ctgcctattg catccctctc ctccatgagg aacttaatcc 12241 acagttgacg gctgtttaga gacgatgaaa taatattcct ttgcagtgtg gctactagta 12301 aattgacctt tctcaagtaa agaacccctc gcccatatgc atgcagccac acctaattat 12361 aagcagttac ccacaacacc cccaacaaac aggaaaatag gaaggagact tattaggaat 12421 aagaaatggt tcaaaaaaat ggaaagtaga aaataataga ggggaatacg tttaaagtgc 12481 atttcatgta tacgtctgaa aaataaggac tcaaggttca gtgggtatgg aaggggattc 12541 atctgggagg gtttggagga ggggtatgaa tatattcaca atacaataaa tgaaattctc 12601 aaagaattaa taaaattatt tataaaagaa ttactagaaa tgtttcagaa aattaaaacc 12661 cttaatgttc cccaaggatg acaaaatgat agatttatgc cctctctctc tctctctctc 12721 tctctctctc tctctctctc tctctctctc tctctgcagg agaaaaatgg caattcagag 12781 ttccacatga agaacctagt gatgacaaca ctaagcctct tctttgctgg gtctgagaca 12841 gtcagctcca cactacgcta cggcttcctt ctactcatga agcatccaga tgtggagggt 12901 gaggctggct atgtggcagg gaagttggga accgcagact ctccaactgc ttacaaccta 12961 acaatgaccc tcacttctcc caggttcctg gatgctcagt catgctcagc tatgcagaga 13021 caggggcata ttaaatgcat aaacacagtt ctcacaaact taaaatatta gacattccca 13081 aattgatttc actctgactt ccagatctct gctctctgtt ctcttccctg actcctgctt 13141 cttctcccca ccatgattct gtcacgaaaa ggataaaatg accctgtcca gcatttaggt 13201 atggatatat gtttaaatgg tttaaatgca tgttatttac agagacatgt aatacatgca 13261 gtggtacaca tgtgaactat tccacctgct ttgaggcctc tggattttta aaaatacccc 13321 atctccgctt gtctttcagc caaggtccat gaggaaattg agcaggtgat cggcaggaac 13381 cgacagcctc agtatgagga ccacatgaag atgccctaca cccaggctgt gatcaatgag 13441 atccaaagat tttctaactt ggctcccttg ggcattcctc gaaggattat caagaacaca 13501 accttccgtg gcttcttcct ccccaaggta gcagccatgc ccatccagga ggggcctcca 13561 gcccacttac tgatgcttca gggcttcttt ccatctgtag ctatctaact ccactctaat 13621 tcctccaacc aaagaattca tccacatgtc cccaaattct tgtccagctg ctttgaactc 13681 cattttctat ctactcttct gccttgctac cttccaatct ctcaactcct gggctagagg 13741 caaaggcctg ctgtcacact aacaccctat cttagcacat gatcccctgg agctcaaatc 13801 tccaattgct gatggcacat atcgtagccc ctcaaatctc ctattcccta atgccttttc 13861 ctgaggagac ctccaactct gtgccttgca gttgtctata tttggacatc ctttctccat 13921 caacccatct tctaaaatct cctttcttcc ctcttccagg gcaccgatgt gttccctata 13981 ttaggttctc tgatgacaga cccaaagttc ttccctagcc ccaaagactt cgacccccag 14041 aacttcctgg atgacaaggg acagttgaag aaaaatgctg ctttcctccc tttctccact 14101 ggtaaggaga cagtgggtta ttgaactact gttcacacca acatgggtag cacatgccag 14161 cttccctgtc tgtgatgctg cctagaatca ggctaaccag gtatagcccc tgcacctccc 14221 aagcaccaga catgctggat gcaggtgaga ggatccctgg gaccagtgat ctgtgtcaga 14281 gaccggggag gggttgggaa taccaacttt cctaggtgat gctcatgcaa gcaatttctt 14341 cacactcttt ctaatgcagc ttttaaataa ttgtttgttt ttctttattt tttaagtaat 14401 ttatttaatg tgcaatggtg tgaggttgtc agatgccttg gaactgaact tatagatgat 14461 tatgagctgc catgtggctg ctgagaattg aaccttggat cttcagaaga acagacagtg 14521 ctcttaacca atgagccatc tcccagcccc atcttcagac tcttaaaagt gggataacaa 14581 ccaggtggta taggtgcatg cctttaacca cagtactggt ggatatctga gttcaacacc 14641 agcctgggac tatagagtga gttacaggac aacccaggct acatggagga aaccatgact 14701 tcaaaaacta aaaataaata aataataggt aggtagatag atagatagat agatacatac 14761 atacatagat acatagatac atagatacat agatacatag atacatagat acatagatac 14821 atagatacat agatacatag atagatgcat agatagatac atagatagat agagacatag 14881 atagatgcat agatagatac atagatagat agatgcatag atagatagat acatagatag 14941 aaagatgcat gtatacatac atgcatgcat acatagataa atagatgact cataaaaaat 15001 taaaagaata aaaaaataaa caaggccaca gcagagcatc tacatttgag aggataatta 15061 ataattgata gaggaagcat ctgtactcca tattgctcca gcctaaaatg agttgtccca 15121 cgttgtgtgt agggacacca gggttttaag agggttagga gcctttccta atgatccctc 15181 atgctccagt atagcagccc cttctccttt tttttttctt tttttcttta ttaacttgag 15241 tatttcttat taacatttcg agtgttattc cctttcccgg tttccaggcc aacatccccc 15301 taatccctcc ccctcccctt ctttatgggt gttcccctcc ccaccctccc cccattgccg 15361 cgctcccccc aacaatcaca ttcacagggg gttcagtctt agcaggacca aggacttccc 15421 cttccattgg tgctcttact aggctattca ttgctaccta tgaggttgga gtccagggtc 15481 agtccatgta tagtctttag gtagtggctt agtccctgga agctctggtt ggttggcatt 15541 gttgttcata tggggtttcg agtcccttca agctcttcca gttctttctc tgattccttc 15601 aacgggggtc ctattctccc acccttcccc cactgccgcc ctccccccaa caatcacgtt 15661 cactggggct gaaccccatt tttaataggg ttatttgtct ccctgcggtc taacttcttg 15721 agttctttgt atattttgga tataagccct ctatctgttg taggattggt aaagatcttt 15781 tcccaatctg ttggttgccg ttttgtccta accacagtgt ctttgcctta cagaagcttt 15841 gcagttttat gagatcccat ttgtcgattc ttgatcttag agcataagcc attggtgttt 15901 tgttcaggaa attttctcca gtgcccatgt gttcaagatg cttccccact ttttttccta 15961 ttagtttgag tgtatctggt ttgatgtgga ggtccttgat ccacttggac ttaagctttg 16021 tacagcgtga taagcatgga tcaatctgca ttcttctaca tgttgacctc cagttgaacc 16081 agcaccattt gctgaaaatg ctatcttttt tccattgaat ggttttggcc cctttgtcaa 16141 aaatcaagtg accataggta ggtgggttca tttctgagtc ttcaattcta ttccattgat 16201 ctatctgtct gtctctgtac caataccatg cagtttttat cactattgct ctgtaatact 16261 gcttgagttc agggatagtg attccccctg aagtcctttt attgttgagg atagttttag 16321 ctatcctggg ttttttgtta ttctagatga atttgcaaat tgttctgtct aactctttga 16381 agaattggat tggtattttg atggggattg cattgaatct gtagatcgct tttggtaaaa 16441 tggtcatttt tactagatta atcctgccaa tccatgaaca tgggagatct ttccatcttc 16501 tgaggtcttc ttcaatttct ttcttcagcg tcttgaagtt cttattgtac agatctttta 16561 cttgcctggt taaagtcaca ccaaggtatt ttatattatt tgggactatt atgaagggta 16621 tcgtttccct aatttctttc tcggcttgtt tctcttttgt gtagaggaag gcaactgatt 16681 tatttgagtt aattttatac ccagccactt tgctgaagtt gtttatcagc tttagtagtt 16741 ctctggtgga acttttggga tcacttaaat acactatcat gtcatctgca aatagtgata 16801 ttttgacttc ttcttttcca atctttatcc ccttgatctc cttttgttgt ctgattgctc 16861 tggcttgaac ttcaagaact atattgaata agtagggaga gagtgcagcc ccttctcttt 16921 aagagaacac agctttgcac ttggcactga ggcaaggcag cggtgagagc ttccttccca 16981 actgtgctcc ttccctctct cctcttcagg gaagcgattc tgcttgggag atggcctggc 17041 taagatggag ctcttcctgc tgctcaccac tattttacag aacttccgtt tcaagttccc 17101 aatgaaacta gaagacatca acgagtcccc caaacccttg gggtttacca ggatcatacc 17161 aaagtacacc atgagcttca tgcccatctg attctgagtt gaatcaaggt ggggcaagag 17221 ggagagagag cctgaagtgg ggccagggtg caggtggaga gaacagggga ggtgaagatg 17281 agggttaaga agggaccaca cccatggaag aaacacaaaa gacttctcac tttggtaaaa 17341 ttgtaacagt cctaataaaa agaaagaaat actcagtggg cagcagtaac aacaactgag 17401 actcatgggg caaaggtggc tcacctctgc agaagctgtc ctgtccttct ctcagtcctc 17461 tacacaagag cagcatgtcc ccaagtccaa cgtacaggtt gcaaagatgg aacttacaaa 17521 tttgaaccta aactgaggtg gaaaaaactc aagttagcta ggattgatgt tttggactct 17581 atcaccagca ttcaggaggg agggaacatg gctctctacc atgtctgcca ggactacaca 17641 gtgagagctt atctcaaaag aaaaaagaaa aaaagaaaaa aatttatata tatatatata 17701 tatatgtata tatatgtata tatatatatg gagagagaga gagagagaga gagagagaga 17761 gagagagaga gagagagaga gagtttgcat tgtacatgat cagggaaata ataaaaacta 17821 gtttgacagt cacataccag tgggttctaa tttatcaaac tccaccccca cccccactgc 17881 cactgctgcc ctatgaagga actgaacaga agcttaactt tccttgggcc atttcgacag 17941 ctgttgtgtc atcaaggctt ctgttttcct atggagacac tacacatggg acagagagga 18001 taacagggag ctcatgactg agagaccttc aggccaaagc acttgaacct ttgtttatcc 18061 tgtttattct gaattttctg cttctgggct ctcatttccc caccattaaa atgagaatat 18121 caatatttac agctgcactg catctctttt tggagtgatt cctggtaact aagaaataag 18181 tagaaaatgg aaggatgaaa tccaccagga ggtttgagta aattccactg tgggaaacac 18241 aggggactgt gggatggcaa ggatgagagc tggaaagaat gcaaggccac actatgtctc 18301 atgcatattt tatatctttt ttatattctt tatatctttg tagtgttttt attagcctac 18361 aaagaaatac atttctcact ggcaacttct tacatatata tcactaccta tgttctcatt 18421 cactttcctt cgctggtctt ggcctcttcg caaaattatt caccggtaat ttattcacac 18481 tttctaattt ttgagcatgg tgcattccag taagatttaa tctctgtggc catggtgttt 18541 cacagctctg taacactgaa gcacattcat catcaactgc actgaagtca tcaacttaag 18601 aagcaaagga ggattcttct ggtctccatc tgcgcccaga gctaagtctg ccccacaacc 18661 ctccagattc aaaacctccc cagacagagc tagtcctcca ggagtgctct cactactaag 18721 gccacaagtg agaccccatt tcccttcaat accgatccaa agaggagccc accagatacc 18781 aggtaccaaa gttaaatgag gatccgttga cctgcaggtc // LOCUS RATCYP2A21 7247 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 1 and 2. ACCESSION M33313 KEYWORDS LINE repetitive sequence; cytochrome P450; hepatic steroid hydroxylase IIA2. SEGMENT 1 of 3 SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 7247) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept 5556 5732 hepatic steroid hydroxylase IIA2 (CYP2A2), exon 1 6198 + 6360 hepatic steroid hydroxylase IIA2, exon 2 pre-msg 5528 > 7247 CYP2A2 mRNA and introns IVS 5733 6197 CYP2A2 intron A IVS 6361 > 7247 CYP2A2 intron B rpt < 1 649 LINE repeat rpt 1120 2122 LINE repeat rpt 5852 6037 dre repeat rpt 5852 5863 5' direct repeat rpt 6026 6037 3' direct repeat BASE COUNT 2306 a 1563 c 1530 g 1848 t ORIGIN 1 aactatcctc aacaataaaa ggacttctca gggaatcact atccctgaac tcaagcagta 61 ttacagagca atagtgatta aaaactgcat ggtattggta cagagacata cagatagacc 121 aatggaatag aactgaagac ccaaaaatga acccaagcac ctatggtcac ttgatttttg 181 acaaaggaac caaaaccatc caatggaaaa aagatagcat tttcagcaaa tggtgctggt 241 tcaactggag gtcagtatgt agaagaatgc agatcaatac attcttatca ccctgtacaa 301 agcttaagtc caaatggatc aaggacctcc acatcaaacc agatacactc aaactaatag 361 gagaaaaagt ggggaagcat ctcgaacaca tgggcactgg agaaaaatcc ctgaacaaaa 421 taccagtggc ctatgctcta agatcaagaa tcgacaaatg ggatttcata aaactacaaa 481 gcttctgtaa ggccaaggac actgttgtta ggacaaaacg gcaaccaaca gattgtgaaa 541 acatctttac caatcctaaa actgatagag gctcatatcc aaaatataca aagaactcat 601 gaagttagag tgcagggaga caaataaccc tattaaaaaa tggggttcat gggtgtagat 661 ctctcctgag agacacaccc agaatacagc atattcatat gcgaatgcca gcagcaatcc 721 actgaactga gaatgggacc cccgttgaag gaatcagaga aaggactgga agagcttgaa 781 ggggctcgag accccatatg aacaataatg tcaaccaacc agagcttcca gggactaagc 841 tattacccaa agactgtaca tggagtgacc ctgggctcca actgcataag tagcaatgaa 901 tagcctagta agagcacagt ggaaagggaa gcccttagtc ctgccaagac tgaaccccca 961 gtgaatgtga ttgttggggg gaggacagta atgggtggag gatggggagg ggaacaccaa 1021 tatagagggg agggggagga gttaggggga atgttggcct ggaaactggg aaagggagta 1081 acaatcgaaa tgtaaataag aaatactcaa gttgataaag ataaaaaaaa agtgaggttc 1141 agagctaaac aatgaattca cagctgagga atgccaaatg gctgagaagc accaaagaaa 1201 tgttcaacat ctttagtcat aagggaaatg caaatcaaaa caaccctgag attctacctc 1261 acaccagtca gaatggctaa gatcaaaaac tcaggtgaca ccaaatgctg gcgaggatgt 1321 ggagaaagag gaatactcct ccattgttgg taggattgca gactgctaca accattctgg 1381 aaatcagtct ggaggttcct cataaaattg gacatagatc tacctgagga cccagctcta 1441 cctctcttgg gcatataccc aaaagatgca ccaacatata acaaagacac atgctccact 1501 gtgttcatag cagccttatt tataatgggc agaagctgga aagaacccag atgcccttca 1561 acagaggaat ggatacagaa aatgtagtac atctacacaa tggaatacta ctccgctatc 1621 aaaaacaatg actttatgat attcataggc aaatggatgg aactcgaaaa tatcatcctt 1681 agtgaggtaa cccaatcaca gaaaaacata catggcatgc actcattggt aagtggatat 1741 tagcccaaat gctcaaatta ccctagatgc acagaacaca tgaaactcaa gaaggatgac 1801 caaaatgcgg atgcttcact ccttctttaa aacaggaaca agaataccct tgggagagga 1861 tagggaggca aagtttagaa cagaggcaga acgaacaccc attcagagcc tgcccacatg 1921 tggcccatac atatatagcc accaaactag ataagatgga tgaagcaaag aagtgcaggc 1981 tgacaggaga tctatgtaga tagatctctc ctgaaagaca cagccagaat acagcaaata 2041 cataggcgaa taccagcagc aaaccactga actgagaatg ggaccctgtt gaaggaatta 2101 gagaaaggac tgaatgttgt tgtaaaaata taaaaataaa gagtaatgtt ggtcttttac 2161 cccgctaggt atcttggcgg aaacacatcc cagccacgca ctttcctaca ctcaaaccct 2221 cacataaaag aacacacaac acaataatct ttgacccaat tggtaagata taattgccta 2281 cttaaacata caaagcccgg taccatccat cccttgagaa cattaataac aatttgtaaa 2341 tacacagagc agaatcttaa catcaccagc tatcttgtcc tgccacggct tctccgcccc 2401 tctctccctc ctgtctcttc ctctctccct tagtctcctc ctcttcctta aaacttctct 2461 cccgcccatc cttccttctc ctccaatgac aggcctcctt ctatcctgta cctgcccctc 2521 accagtactt tacaaattca gtggagaggt ggttctggtg aagtcacctg agttctgagt 2581 ccttgactag gcagctgtcc ttggggcagt ggaattagca tcaaaataca gtaacttcag 2641 ggcaaaccag aataactgaa agagcttgaa ggggcttgaa accccatatg aacaacaatg 2701 tcaaccaacc agagcttcca gggactaaga ctatacaagg actgaccctg ggctccaact 2761 gcataggtag caatgaatag cctagtaagg ccaccagtgg aaggggaagc ccttggtact 2821 gccaagactg aacccccagt gaatgtgatt gttgggggag gacggtaatg gggcgaggat 2881 ggggagggga acaccaatat agaggggagg gggagaggga gggggatgtt ggcctgaaaa 2941 cctggaaagg gaataacaat tgtaatgtaa ataagaaatg gctcagtggt taggagcact 3001 gactgctctc ccataggttc tgagttcaaa tcccagcaac cacatggtgg ctcacagcca 3061 tttgtatggg atccgattcc ctcttctggt gtgtctgaag acagcaacat tgtacttata 3121 aatgaataaa caaataaata aatctttaaa aaaaagaaat acccaattta ataaagatgg 3181 agaacaaaaa acaagaagat acattgctag ggctagagac atggctcagc agttaagagc 3241 actgactgct cttccagagg tcctgagttc aattcccaga accacatgat ggctcacaac 3301 aatctgcaat gggattcaaa gatcacttct ggtgtgtcta aagacagtga caatgtactc 3361 atatacatga aagaaagaat gaaatcttta aaactttcaa aagctgaaga catgctccct 3421 atattattcc aggcaaatcg aagaatttga attctatcac aaactacaat actcacatca 3481 atgagggttc ttttcatgtg ctcaaccaca caaatgtaga tttttagtta tggatttgat 3541 ctggggaacc tagacatgga cagtctccag ataatgccca cagttaccaa tacagcatcc 3601 ctctcaacat agcctttgaa gagtgtgcta taggttggcc tgttcacctc atccacttaa 3661 ctcctcagct tggtctccca aagcctcttt gactgcatcc cattcataaa ggaccacaac 3721 ccagtgaggc tttaagaatt ttgaagtact ggcagcagcc tatgccctgg ggacccctga 3781 gcatctcacc agttccaggt cggagactcg gctacatacg atggcaccga acccagatac 3841 tcactggaaa ggaccgtacc tggtgctgct gaccaccctg acagccatca actctcagcc 3901 ctcaccagcc gtgtactagc tgttggggct gagagctggg acctagagct gggaccagtt 3961 cttcaaaaag ctccctagac ttaatttcat gtttgccccg ggttttatca agataggtgt 4021 ggggataggc ttgatttcta ttacaaatga tgtaacattg catatgttag tactcctaac 4081 acttcttggg actgtgcctc agggatcaca atctgtataa gtttagaagt tctaaaagct 4141 agtcatgacc ttggtgtgta ggtttagata gtgtccagat tggaatcctg atgctaaaga 4201 cttagtaaga cacaaaaaaa ggagttgaga attacttagg gctaaggcta tctaggtgct 4261 gcaagggcag cacaaggaca tctgctgttg caatgcaagg cttatagaga attcagaact 4321 gccatttagg agtaattaaa gactccatga ataaacttag agaaaggtta gacaaaaggc 4381 agacagagaa gcgcatcagg gatggtttga gagctggttt agtagatctc cttggatgac 4441 tactctggta ttttccctta tgggaccctt cttagttttg cttctgcttc tgattatagg 4501 tccatgtgtg ttagagaaac tagttaatag gtttgactcc tacaaaaaga tagagacgct 4561 caacaaggtt ggtttgagtc ttggttcact cggtctccct ggatgactac cctactctct 4621 gctatatggc tgggccatta ctaataattt tcttggtttt agtttttgga ccctgcgtga 4681 caaacaggtt aattgctttt gttacaaatc gagtgagtgc tgtgcggttg gttctgagac 4741 aacagtacca gtcagttagg acaactggtg agaccaaata agagacttga tatcaaaatt 4801 ctaagattag aattacttag tagaagaaga ggggaatgaa aggaaaatta tacagattta 4861 aggtttaaaa atatgaagtt aaaagagtat gtttcaactc aggactaaac actgtgaaaa 4921 gcaagtccag gcagccccgc cctgccgcta gaactaacag accataaaag gaaaggaatg 4981 cagaacagac caggagtacc ggatctgact cacaggccac ctggcaggaa gagataagcc 5041 cccagccccc gacatccagg acgccccaaa cctgccaatg tgtgtagcta taccttatta 5101 cctcatcatg tgaaatagcc aatcatatgt gaacatgtct atgtgcctcg tttgaatcca 5161 ccaatccccg taactatgca tctgcttctg tacgcccgct tctgcttccc caatccctat 5221 aaaagcccca tgctggagct gctgggcgcg caagtcctcc gaagagactg tgtgcctgca 5281 ggtacctgtg ttttccaata aaccctcttg ctgattgcaa aaaaaaaaaa aaaaaaaaaa 5341 aaaaaaaaaa gaattttgaa gtaagcctgc cacctttctt cctatttgcc catgttgtgg 5401 gtgttgcaac aaagactggg tcaatgttag aaaatagggt tgggaggcaa aagactcaac 5461 tagacaaaca ggagcaaagg ccatcctgtg tccctgggag tataaaggta ctatctcagc 5521 cttggctatc agtctgtcca tcctcactgg ctactatgct ggacacagga ctgctcctgg 5581 tggtcatact ggcctcccta agtgtcatgt tcttggtgtc cctctggcag cagaaaatca 5641 gggagagatt gcctccagga cccactcctt tgcctttcat tggaaattat ctgcagctga 5701 atatgaaaga cgtatacagt tccatcacac aggtatcact ggatgagggg atggatggga 5761 catgggagtc caagaagctg ggttgttttg catgttttgt ggcagaagat tcatagagta 5821 aatccaaagt cttgtattca tggagtttag aaagataagg agcgggctgg agagatggct 5881 cagcggttaa gagcaccatg tgctcttcca aaggtcctga gttcaaatcc cagtaaccac 5941 atggtggctc acaaccatct ataatgagat ctggtgccct cttcttgtat tcttaatcat 6001 aataaataaa taaatctaaa aaaataagat aaggagctat ttcaactctt tggtttgttg 6061 tttgcattcg tttgtttgtt tgtttgtttg tttgtttttc aatcatttat ttgtagaata 6121 acacataatc tgacctctgt gtactggtct agttcagtga ataagtcatc taacagcctc 6181 catctaccca acatcagctc agtgagcgct atggtcctgt gttcaccatt caccttgggc 6241 ctcgacggat tgttgtgctt tatggatacg atgcagtcaa agaggctttg gtggaccaag 6301 ctgaggagtt cagtggacgt ggcgaactgc ctacctttaa tatactcttc aaaggctatg 6361 gtgaggagga taccacattg gggagcatgc ccaaggacat ttgttggcct catttaagta 6421 gccttcatcc taactcatct ttcccctcaa ggctgtacat agtcctctga tttttctctc 6481 catattcaag ttgaatgttg cttcttattg tgacccttcc tagtctttct atgattctct 6541 gtgggtgctt cctttcattc ttcttcaccc ttttcttcca ttctttaacc ctcataatac 6601 taggtaggag ataaaaagag atagaggaaa aaggggacac tattgttaga ctacttcctt 6661 ctgagaggta atgagttcct tagggcaagt ttgatcatct cagtcaggat atctaatttc 6721 ttcttcctgt tgttactttg cacaaggcga cttaacaaag cacagccaac agcaaccaac 6781 caacaaccaa aaccaatctc tcaaggccct tgcattaaaa taacctctga ggaatcccca 6841 gtatcctaag ggtcacactc tcagaaacta tctgcagtag gcaaaatcat acccctgcta 6901 gagcacaaaa taaatcatag gtctctgctt tggacaatct gattcatccc catattgcat 6961 acctggaatt aaaaaaacat attcctataa tatttctgta tttgtcaaaa aaaaacaaaa 7021 ttcttttttt tttatcttta agtaatactc caactttatt gaataaagga ataaatggag 7081 ttttcaagtt ttcccatcat ggttattttt aaagccacct gatacatgac agtacttatc 7141 aaaacaagat gtttatctat ttttgtcatt tgtatttttg cttaatttta tattcataat 7201 atatttaaat taactaatag ttcatggtaa cacttggcca cacaggt // LOCUS RATCYP2A22 4753 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 3,4 and 5. ACCESSION M33325 KEYWORDS cytochrome P450; hepatic steroid hydroxylase IIA2. SEGMENT 2 of 3 SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4753) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 1815 1964 hepatic steroid hydroxylase IIA2 (CYP2A2), exon 3 2182 2342 hepatic steroid hydroxylase IIA2, exon 4 3567 + 3743 hepatic steroid hydroxylase IIA2, exon 5 IVS < 1 1814 CYP2A2 intron B IVS 1965 2181 CYP2A2 intron C IVS 2343 3566 CYP2A2 intron D IVS 3744 > 4753 CYP2A2 intron E BASE COUNT 1258 a 1204 c 911 g 1380 t ORIGIN About 5 kb after segment 1. 1 aatcaatagt ttttaagcta ctaacccttt ctagagatga tgaaaataga aaactggaag 61 aatgcctagg tagcaaatga ccttggaagt tagggactaa aaatttaagt ccacatctgt 121 gcaagataaa aattaactct tagtttgcat aagctcttat ttttttcata agtcttattt 181 gtttttttat ctttattaac ttgagtattt cttatttaca tttcgattat tattcccctt 241 cccagtttct gggtggatga ctccttttta acttagctga tatttttatt cttcttaaac 301 atttatccac acacagagca tcagtcgcag gtctgaggca caccctgcta gtgcctctgg 361 attgttttta aagatcattt gctcttactt ttctatctat gggtgttttg cttatgtgta 421 tatgtgtaca caagtctggt gcccatggaa gcaaaaagat ggagtcagat ctcctgaact 481 ccaggggttc catgagttct ataagctgtc aggcgagtgc tggggttcaa gcacaggtcc 541 tctgcaaggt cagccagtgc tcttgagtgc agagccagct ttgctgtcca tccccccgcc 601 cccgcgcatg tatttttaaa tgttgtttta catatgtcat gtgttgtccc taagatgtgt 661 ataatgctta tagaacatta cagtctggta agtgctggcc aaagttacag aagtataaaa 721 tggccttgag cagcaaaaca ttggttataa gcaagaaagt tcaaaataaa gagaaaatcc 781 acaaagagcc aaatatcttt ataacattaa ttctgtggtt gcgatttaac accaaggggg 841 tatctgtttc cctgaactaa ggggcacaga aatggctact actacttagg gtcaaaatag 901 tgactacagc tcaggacaca taagcaaaac cagagccaaa gaccagggag tggtaataaa 961 ataataaaaa atcctggctc agggattcgt cccacctttc cctggtgaaa gacacacaca 1021 gcctttatat tttagtctgc cttatgcagc acaatagctg ggcagctgcc taccctccat 1081 gttgttagaa tccatttccc tatcaatagc cttgagttga tactttacaa atttccatat 1141 tccatttttg ctgttcttaa cccaatttaa cagccttctg ggccacaatc tcttggccct 1201 tagcacatgg tatctctcct ttgcccttct tctctttctt cttccttggc ttccacggaa 1261 gctcctcggt cccattctcc ttcctcatgc tctagccaag gaaacctaaa cccctcctat 1321 gtcccttctc cccagctatt agctgctggc atctttattt accaaccaaa gtaaatgggg 1381 gcagagtccc ccaggctaag ggcagattcc aaatcttaga aggcagcacg aagcagtata 1441 gtaaacagta aaagaaaaaa acgcaacacc agagtacgtt tctatgtatg ctgtccttgc 1501 tttaatgtgg agtttctgtt ttcagaaaat gctcaaattt ggttctttta gccatgtcag 1561 cgacctggag cagcattctg agtctctctg cttctgtctg taactctctg tttccttgcc 1621 tggctgactt gttccaactt tcttactctg actgtgtctg ctgcagagcc tctgttcgtt 1681 tcttcagtgt tcttgccatc tcaatcccat ctttgtctct tttctttcct ctaagaaggc 1741 ctttccagca tgggcctggg ccttcctcag cctcagacta cctcacccca acacccatgt 1801 tcatgtctct acaggttttt cattgagcaa tgtggaacag gccaagcgta tcaggcgctt 1861 caccatagcc acattgagag attttggtgt gggcaagcgt gatgtacagg agtgtatcct 1921 ggaggaggca ggctatttga tcaagacgtt gcagggcact tgtggtaagc aagagaccat 1981 taagtgtttg ggcaagagaa agaacatccc tgacacctag accctatggg ttgtggagaa 2041 ggaggacggc gaagaccgcc taccaaacca tctccagaat ctggtgctga gagattggtg 2101 cctcactcca attcccacac catctgctaa ctcttctccc tcataatgcg aatgtcatcc 2161 aaacaatgtc acccctctca ggagccccca ttgacccttc catctacctg agcaaaacag 2221 tctccaatgt cattaactcc attgtcttcg ggaaccgctt cgactatgag gacaaagagt 2281 tcttgtcact gttggagatg atcgatgaaa tgaatatatt tgcagcctca gccacagggc 2341 aggtaaaaga ttccagctct gccaattgtg cttataatgt cctacattgg ccataccgac 2401 aaagggcaag gactacccca acgctcatgt ccacaaacat tcccctcaaa aacagaagct 2461 cccctcaaaa ccaaccttta ccttcagaaa actgaacctt tacatcagag cccacaggag 2521 ctatccagtg ctcacaatct aatgacctct ggatatctca agggcctgag aacaaagccc 2581 tctgcttggc tctcttccct gggcaggttt cccccgctta aattctgaca gatcctctgt 2641 gtggtcgtcc tgaaagttga gacacctgcc caagggagac aagtgatcac ctcaggcccc 2701 ctcctccaat cctgagcacc tacctggttc tgcaaaacta tggccagtaa agtcattcac 2761 actggacaca ctgctctccc aaaagatctc actggcacca tgacacgaga gtcacctgct 2821 tgtctcaggt aaattcagga atgagtagac aggaacctca accaaggcaa ccaagcacag 2881 acctctagat ggactgtttc cccaaacacc catacgactg ccaaccagcc acacacagtc 2941 caattcaaaa aggtctgaca ggtgtgtccc acaccttata acccgaacca tcttatcctg 3001 aatactttac tatgtggaaa acagattcta atctcaaaca aatatcaaga gatctaaatt 3061 cagccttctt tggtgcccaa acatctaaat acttgagtca ctgtgataac cctggcctga 3121 acacaggaaa cctggattaa tggtctaatc aaaaaatcaa ttgaatagtt gaatgtctgc 3181 taatgtcccc ttttgatcca gctcatccag attgtaggac aatgaccctc attctttaaa 3241 tcaactagaa aattgcagtc tctggggctt cagactgttc agtagtttaa gagcatgtac 3301 tgctcatcct gaggacctga gttcagttcc cagtacgtat gctggacatt gcacagctca 3361 aggggagtac acctgcactc gtgcacataa ttaaaagtaa aatattcaaa tgaatataaa 3421 gagttctttc aagagtggag gtgctgtttg ttgcaattca tcctaacata aatacatgaa 3481 cacctggatg aatgacttaa tacaagtgcc actcccactc aatgttgcca ctgacaagcc 3541 ttttcttttc tcctcccacc ccccagctct atgacatgtt ccattcagtg atgaagtacc 3601 tgcctggacc acagcaacag atcatcaagg ttactcagaa actggaagac ttcatgatag 3661 agaaagtgag gcagaaccat agtaccctgg accccaattc cccaaggaac ttcattgact 3721 cctttctcat ccgcatgcaa gaggtgatcc caatcatggt ggatggaatg tctaagactg 3781 agcagctgga aatcacccta gaaaaggagg aggaatataa gcccattaag tgcccatgat 3841 tctcctcaca gtcccggtta tagttaaacc tcactctttc acctgttgag ccttatccaa 3901 gccagggtat gggttagcaa attaccatga caaccgatat tccagtgttc ccctatgaga 3961 cactgttttc agtattcaac tacttagcat gcactgaagc aactgtcgaa gaccctgtgg 4021 agcctaaatt tcgcaaggag ggaaagtgtg cccagacttg catgctaact tcatgcagac 4081 agaaaactgc ttgcctctat ggctctcagg attttactat tagccacctg gactctagca 4141 tttcatatct ctgttagaaa atacatatca atacacaacc ctgaactggg caacctgggt 4201 tgttgtattt tttcttctat tatctgctct agtaattatg tattgttttt tattttaatg 4261 ttgtttttct tttttttttc atctttatta aattgaagat ttcttattta catttaaatt 4321 gttattcccc ttcccggttt ccaggccaac attctctaac ccctcccctt ccccttctat 4381 atgggcttcc ccttcatatc ctccccccat taccaccctt cccccaacaa tcacgttcac 4441 tgggtgttca gtcttggcag gacccggggc ttccccttcc actggtgctc ttacaagcct 4501 cattgcttcc tatgaggttg gagcccaggg tcagtccatg tgtagtcgtc gggtagtggc 4561 ttagtccctg gaagctctgg ttgcttagca ttgttgttca tatagggtct cgaccccttc 4621 aagctcttac actcctttcg ctgattcctt caacgggggt cccgttctca gttcagtggt 4681 ttgctcctgg catttgccta tgtatttgct gtattctggc tgtgtctctc aggagagatc 4741 cgttgacctg cag // LOCUS RATCYP2A23 5080 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat hepatic steroid hydroxylase IIA2 (CYP2A2) gene, exons 6,7,8 and 9. ACCESSION M34392 KEYWORDS cytochrome P450; hepatic steroid hydroxylase IIA2. SEGMENT 3 of 3 SOURCE Rat (strain Sprague Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5080) AUTHORS Matsunaga,T., Nomoto,M., Kozak,C.A. and Gonzalez,F.J. TITLE Structure and in vitro transcription of the rat CYP2A1 and CYP2A2 genes and regional localization of the CYP2A gene subfamily on mouse chromosome 7 JOURNAL Biochemistry 29, 1329-1341 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 1322 1460 hepatic steroid hydroxylase IIA2 (CYP2A2), exon 6 1876 2063 hepatic steroid hydroxylase IIA2, exon 7 2496 2637 hepatic steroid hydroxylase IIA2, exon 8 3636 3817 hepatic steroid hydroxylase IIA2, exon 9 pre-msg < 1 4588 CYP2A2 mRNA and introns IVS < 1 1321 CYP2A2 intron E IVS 1461 1875 CYP2A2 intron F IVS 2064 2495 CYP2A2 intron G IVS 2638 3635 CYP2A2 intron H BASE COUNT 1470 a 1191 c 970 g 1449 t ORIGIN About 15 kb after segment 1. 1 gaattctttg tatatattgg acaatagccc tctatcagat gtacaattgg taaagagctt 61 ttcccaatct gttggttgtc gttttgtcct aataacagtg tcctttgcct tacagaagct 121 ttgcaatttt atgaagtccc atttgttgat tcttgatctt agagcataag cccttggtgt 181 tctcttcagg aaattctccc atgtgccctg tgttcaaggc tcttacccgc tttctcttct 241 attagtttca gtgcatctgg ttttatttta attttgtttt atttttcttg tatatttttg 301 tacttacact tcaaatgcta tctcctttgt acattctctg atatctcctc cctgtcccca 361 tgcttctatg aggatgctct cacttccacc cacccactcc cacctcaatg ccttgacatt 421 cacctacatt ggggaaatgg gcctttactg gaccaaggac ttttcctcct attaatgatg 481 gacaatgcca tcctctgcta tatatacagc tgaagccatg cttccctcca tttgtactct 541 ttggttgggg gtttagtctc tgggagctct gagggaagag tctggttggt tgataatttt 601 gctcttccag ccatgaaatg aaagacagtc acctatacag agaaacaagc aaagcttctc 661 ctgcaaacca aagattccaa acacaacctg gacattgctt ttccaaccat tggtctggac 721 actttgagaa ctagatacaa agaaaattcc agaagtgctg ccacttgggt ccatttctga 781 ggaatttaat ccacagttga tggctgctta gagatgatga aatcatattc ctttgcagtg 841 tggctactag taaattgccc tttctcaagt gaagaaccac tcacccatat gcatgcagcc 901 acacctaatt ataagcagat ctccccccaa ataaaaacag gaaaatatga ggaagactta 961 ttagaaatta gaaatggttc aataaaataa aaatagagat aatggagggg aatatgttta 1021 aggtgcattt cacatatatg tctgaaaaat gaagactcaa gattcagtgg gtatggaatg 1081 ggattcatct gggagggctt gagggagggg tgtgaatgta ttcacagtac aataaatgaa 1141 attctcaaag aaataataaa aatatttata caataatgac tagaaatgtt ttagaaaatt 1201 aaaaccctta gtgttcccca aaaggagtac aaaatgataa atagatttgc gttctctctc 1261 tctctgtctc tgtctctgtc tctgtctctc tgtctctctc tctctctctc tcccccccca 1321 ggagaaatat gttaattcag aattccacat gaacaaccta gtgatgtcat cattaggcct 1381 cctctttgct gggactgggt cagtcagctc cacgctatac catggtttcc tgctactcat 1441 gaagcatcca gatgtggaag gtgaggctgg ctgtgtggca aggaagttgg gaaccccaga 1501 ttctccaacc tgacaatgac cctcacctct cccagatccc tggatgctca gacatcctga 1561 ctatgcagac acagaggcat attaaatgca taaacagagt actaagttaa aatattaaac 1621 attctgaaat tgatttccca ctgactgcca gatccctgtt ctctgttccc tgacttctcc 1681 ttctccccac catgatttgg tcatgaaaag gataaaatga tcctggccag catttaggta 1741 tggatgtatg tatagatggt ctaaatgcat gtttacagag acatgtaata catacagtgg 1801 tacacatgtg aactattcca catgctttga ggtctctgga tttttagaaa cagcccatct 1861 tcctttgtct tccagccaag gtccatgagg aaattgagcg agtgatcggc aggaaccgac 1921 agcctcagta tgaggaccac atgaagatgc cctacaccca ggctgtgatc aatgagatcc 1981 aaagattttc taacttggct cccttgggca ttcctcgaag gattatcaag aacacaacct 2041 tccgtggctt cttcctcccc aaggtgcagc caggcccacc caagtagggg cctccaaccc 2101 actccctgat gcttcagggc ctctttccat ctacagccat ctaactcaac tctaattcct 2161 ccaaccaaag aattcaccca catgtcccca acttcttgtc acactgcttt gaactccaag 2221 ttctatctga tcttctgcct tactactatc caatctctca actcctgggc taacacacta 2281 acacattatc tcagaacatg attcccctag agctcaaatc tccaatttct ggtggcacgc 2341 atcacagccc ctcaaaactc ctattcccta atgccctttc ctcaggagac ccccaactct 2401 gtgcctttcc gttctcttca tttggacact agcaccactt ggggtccttt ctccatcaac 2461 ccatcttctc aaatttcctt tctttcctct tccagggcac cgatgtattc cctataatag 2521 gttctctgat gacagaacca aagttcttcc ctaaccacaa agacttcaac ccccagcact 2581 tcctggatga caagggacag ttgaagaaga atgctgcatt tctccctttt tccattggta 2641 aggagacagt gggttattag accactgctc ataccaacag ggataactca tgccagttcc 2701 catctctgtg attctgccta gcatcaggct aaccaggtac aatccctgca cctcccaagc 2761 accacgactc aggtcaaagt atcaatgaga tcagtgatct ctttcagaga ctgggaagcg 2821 gttcagaaca ccaaatttcc caggtcatgc tcatgcaagc aatttcttca tactcttttt 2881 aaagcagttt taaatgattt ttttgttatt ttttaataat tcatctaatg tgcattggtg 2941 tgaggttgtc agattcatta gaactggact tatagacatt ttatctgcca tgtgggtgct 3001 gagaattgaa ccttggttct tcagaagagc agacagtgct cttaaccagt gagccatctc 3061 ccagccccat attcaaattt taaaagggga taacaaccag gtggtggtgg tacatgtctt 3121 taaacccagt actcaagaag cagaagcagg tggatatcta agttcaatgc cagctggatc 3181 tatagagtaa gttagaagaa aacccagact aaatggagga aaccctgact taaaaaacta 3241 aaaataaata aataatagat agatagatgc atgcatgtat acatacatat atgcatacct 3301 acatgcatgc atacatagat acatagatga ctcagagata attagatgaa taaataaata 3361 aacaagacca cagcaggcat ccacatctga gaataaaatt aataattggt agaggaagca 3421 tctggactcc atattgcttc agcctacaat gagttgcccc actttgtgtg tagggacact 3481 ggggttctga gagggttagg aacctttcct aatgatcact catgctccag gttagcaccc 3541 cttttcccta agagaacaag gctgctcact gggtactgag ggaaagaagt gagatcttgc 3601 tccaagtctg tgctccttac ttctctcctc tttaggaaag cgattctgct tgggagatag 3661 cctggctaaa atggagctct tcctgctgct caccaccatc ttgcagaact tccgttttaa 3721 gttcccaatg aatctagaag acatcaacga gtaccccagt cccatagggt ttaccaggat 3781 cataccaaat tacaccatga gcttcatgcc catctgattc tgagttgaat caaggtgggg 3841 caagagggag ggagagcctg aagtggggcc agggtgcagg tggagagaac agagaagatg 3901 aagatgaggg ttaagaaggg accacaccca tggaagaaac acaaaagact tctcagtttg 3961 gtaaaattgt aacagtccta ataaaaagaa agaaacaccc agtaggcagc agtaacaaca 4021 actgagactc atggggcaaa ggtggctcac ctctgcagaa gctgtcctgc ccttctctca 4081 ctcagtcctc tacacaagag cagcatgtcc ccaagcccaa cgtacaggtt caaaagatag 4141 aacttaaaaa atttgaacct aaactgaggt ggaaaagaca cagttagcta ggattgacac 4201 attggactct atcaccagca ttcaggaggg agggaacatg gctccctagg aggcctgcca 4261 gaattacaaa gtgaaactca tctcaaaaaa ggaacaacag aaaataaaat ttcaaattga 4321 tttctcttag accataagag tccagatctg tatccaaagc tatttggtta tattttttgt 4381 tattgttgtt ttgtttacac attgtgtttt tctttcggtt tgtaagtctg tttgggatat 4441 ttaatttaca tttactgatt agtgtgggtg gtagggcata ccatggctca aatgtggaaa 4501 ccaaagaaaa gcttttggaa gtgtcatctc ccttacaata cgtgtgtcca agaactcaaa 4561 ttcagacaat aaagcttgat agcaagcact tctacctact gagacatcta actggccaat 4621 ttagggagtt tattttaatt tatttactta ctaatttata tgaatataag tcctctatct 4681 gcatggccac ctgcgtggca gacgaaggca tcagatcact ttacagaagg ttgagtccac 4741 ccagtggtgg atggaaattg aactcaggac ttctagaagc cgtcaaattt tgagccacct 4801 cttcaacccc ttaaacaagt ttcttaaggt caccctttcc tcaaatgaaa caacaaggac 4861 ttggaatatt ttaacataac ctgagtcctc ctacctgagg tgttgtttct acaagcctgg 4921 caggcaactg atctacctcc aacatacact ttccaacagt cttgctttct catccacacc 4981 ttaatcacct gacacctgtt ggcctcagcc cctgtgccag gtaagtccat tttgtctgac 5041 tcagtcagtc tgggagacaa aaatcccttt gacagaattc // LOCUS ECOUGRE 108 bp ds-DNA SYN 12-JUL-1990 DEFINITION Synthetic uteroglobin (UG) mRNA expressed in E.coli, 5' end. ACCESSION M34596 KEYWORDS uteroglobin. SOURCE E.coli DNA, clone pLE103-1. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 108) AUTHORS Miele,L., Cordella-Miele,E. and Mukherjee,A.B. TITLE High level bacterial expression of uteroglobun, a dimeric eukaryotic protein with two interchain disulfide bridges, in its natural quaternary structure JOURNAL J. Biol. Chem. 265, 6427-6435 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 91 > 108 synthetic uteroglobin binding 77 81 ribosomal binding site (put.) signal 10 26 phi-10 promoter BASE COUNT 37 a 23 c 19 g 29 t ORIGIN 1 gatccaaatt aatacgactc actataggga gaccacaacg gtttccctct agaaataatt 61 ttgtttaact ttaagaagga gatatacacc atggctgcag ccaagctt // LOCUS HCVCG3PE 12283 bp ss-RNA VRL 12-JUL-1990 DEFINITION Hog cholera virus polyprotein mRNA, complete cds. ACCESSION M31768 KEYWORDS envelope glycoprotein E1. SOURCE Hog cholera virus (strain Brescia), cDNA to viral RNA, passed in SK-6 cells. ORGANISM Hog cholera virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae; Mucosal disease virus group. REFERENCE 1 (bases 1 to 12283) AUTHORS Moormann,R.J.M., Warmerdam,P.A.M., van der Meer,B., Schaper,W.M.M., Wensvoort,G. and Hulst,M.M. TITLE Molecular cloning and nucleotide sequence of Hog cholera virus strain Brescia and location in the genome of the sequence encoding envelope protein E1 JOURNAL Virology 177, 184-198 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Moorman,R.J.M., 01-FEB-1990, for release after publication. FEATURES from to/span description pept 361 12057 hog cholera virus protein precursor matp 2428 3538 envelope glycoprotein E1 (put.) BASE COUNT 3850 a 2559 c 3182 g 2692 t ORIGIN 1 agttcattct cgtgtacatg attggacaaa tcaaaatctc aatttggttc agggcctccc 61 tccagcgacg gccgagctgg gctagccatg cccacagtag gactagcaaa cggagggact 121 agccgtagtg gcgagctccc tgggtggtct aagtcctgag tacaggacag tcgtcagtag 181 ttcgacgtga gcagaagccc acctcgagat gctatgtgga cgagggcatg cccaagacac 241 accttaacct agcgggggtc gttagggtga aatcacacca tgtgatggga gtacgacctg 301 atagggtgct gcagaggccc actattaggc tagtataaaa atctctgctg tacatggcac 361 atggagttga atcattttga acttttatac aaaacaaaca aacaaaaacc aatgggagtg 421 gaggaaccgg tatacgatgt aacggggaga ccattgtttg gagacccaag tgaggtacac 481 ccacaatcaa cattgaagct accacatgat agggggagag gcaacatcaa aacaacactg 541 aagaatctac ctaggagagg tgactgcagg agtggcaacc acctaggccc ggttagtggg 601 atatatgtaa agcccggccc tgtcttttat caggactaca tgggcccagt ctatcataga 661 gcccctctgg agttttttga cgaagcacag ttctgtgagg tgaccaaaag gataggtagg 721 gtgacaggta gtgacggaaa gctttaccat atatacgtgt gcatcgatgg ttgcatcctg 781 ctgaagctag ccaagagggg cgagccaaga accctgaagt ggattagaaa tctcaccgac 841 tgtccattgt gggttaccag ttgttctgat gatggtgcaa gtgcaagtaa agagaagaaa 901 ccagatagga tcaacaaggg taaattaaag atagccccaa aagagcatga gaaggacagc 961 aggactaagc cacctgatgc tacgattgta gtggaaggag taaaatacca ggtcaaaaag 1021 aaaggtaaag ttaagggaaa gaatacccaa gacggcctgt accacaacaa gaataaacca 1081 ccagaatcta ggaagaaatt agaaaaagcc ctattggcat gggcagtgat agcaattatg 1141 ttataccaac ctgttgcagc cgaaaatata actcaatgga acctgagaga caacggtacc 1201 aatggtatcc agcacgctat gtaccttaga ggagtcagca gaagcttgca tgggatctgg 1261 ccagaaaaaa tatgcaaagg agtccccacc tacctggcca cagacacgga actgagagaa 1321 atacagggaa tgatggtagc cagcgagggg acaaactata cgtgctgtaa gttacagaga 1381 catgaatgga acaaacatgg atggtgtaac tggtataaca tagacccctg gatacagtta 1441 atgaatagaa cccaagcaaa cttggcagaa ggccctccga gcaaggagtg cgccgtgact 1501 tgcaggtacg ataaaaatgc tgacattaac gtggtcaccc aggccagaaa caggccaacc 1561 accctaactg gctgcaagaa agggaccaat ttttcttttg cgggtacagt tatagagggc 1621 ccatgtaatt tcaacgtttc tgtcgaggat atcttatatg gggatcatga gtgtggcagt 1681 ctactccagg atacggctct atacctagta gatggaatga ccaacactat agagagagcc 1741 aggcagggag ccgcgagggt gacatcttgg ctagggaggc aactccgcat tgccgggaag 1801 aggttggagg gcagaagcaa aacctggttt ggtgcctatg ccctatcacc ttattgtaat 1861 gtgacaacga aaatagggta catatggtac actaacaact gtaccccggc ttgcctcccc 1921 aaaaatacaa agataatagg ccccggtaaa tttgacacta acgcggaaga cggaaagatt 1981 ctccatgaga tggggggcca cctatcagaa tttctgctgc tctctctggt cgttctgtct 2041 gacttcgccc ctgaaacagc cagcgcgtta tacctcattt tgcactacgt gatccctcaa 2101 tcccatgaag aacctgaagg ctgtgacaca aaccagctga atttaacagt ggaactcagg 2161 actgaagacg tgataccatc atcagtctgg aatgttggca aatatgtgtg tgttagacca 2221 gactggtggc catatgaaac caaggtggct ttgttatttg aagaggcagg acaggtcgta 2281 aagttagcct tgcgggcact gagggattta accagggtct ggaatagcgc atcaaccacg 2341 gcattcctca tctgcttgat aaaagtatta agaggacagg tcgtgcaagg tgtgatatgg 2401 ctgttactgg taactggggc acaaggccgg ctagcctgca aggaagatca caggtacgct 2461 atatcaacaa ccaatgagat agggctacat ggggccgaag gtctcactac cacctggaaa 2521 gaatacaacc acaatttgca actggatgat gggaccgtca aggccatctg catggcaggt 2581 tcctttaaag tcacagcact taatgtggtt agtaggaggt atctggcatc attacataag 2641 gacgctttac ccacttccgt gacattcgag ctcctgttcg acgggaccag cccattgacc 2701 gaggaaatgg gagatgactt cgggttcgga ctgtgtccgt atgatacgag ccctgtagtc 2761 aagggaaaat acaacacaac cttgttgaat ggtagtgcat tctacctagt ttgcccaata 2821 gggtggacgg gtgttataga gtgcacggca gtgagcccga caactctgag aacagaagtg 2881 gtaaagacct tcagaagaga gaaacccttt ccgtacagaa gggattgtgt gaccactaca 2941 gtggaaaatg aagatctatt ctactgtaaa tgggggggca attggacatg tgtgaaaggt 3001 gaaccagtga cctacacggg ggggccagta aaacaatgca gatggtgtgg cttcgacttc 3061 aatgagcctg acggactccc acactacccc ataggtaagt gcattttggc aaatgagaca 3121 ggttacagaa tagtggattc aacggactgt aacagagatg gcgttgtaat cagcacagag 3181 gggagtcatg agtgcttgat tggtaacaca actgtcaagg tgcatgcatt agatgaaaga 3241 ctaggcccta tgccatgcag gcctaaggag atcgtctcta gtgcgggacc tgtaaggaaa 3301 acttcctgta cattcaacta cgcaaaaact ctgaggaaca ggtattatga gcccagggac 3361 agctatttcc aacaatatat gctcaagggc gagtatcagt actggtttga tctggatgtg 3421 accgaccgcc actcagatta cttcgcagaa ttcattgtct tggtggtggt ggcactgttg 3481 ggaggaagat atgtcctgtg gctaatagtg acctacatag ttctaacaga acaactcgcc 3541 gctggtctac agttaggcca gggtgaggta gtgttaatag ggaacttaat cacccacaca 3601 gatattgagg ttgtagtata tttcttactg ctctatttgg tcatgagaga tgagcctata 3661 aagaaatgga tactactgct gttccatgct atgaccaaca atccagttaa gaccataaca 3721 gtggcactgc tcatggttag cggggttgcc aagggtggaa agatagatgg tggttggcag 3781 cggctgccgg agaccaactt tgatatccaa ctcgcgctga cagttatagt agtcgctgtg 3841 atgttgctgg caaagaaaga tccgactacc gtccccttgg ttataacggt ggcaaccctg 3901 agaacggcta agataactaa tggacttagt acagatctag ccatagctac agtgtcaaca 3961 gctttgctaa cctggaccta cattagtgac tattataaat acaagacctt gctacagtac 4021 cttattagca cagtgacagg tatcttcttg ataagggtac tgaagggggt aggtgagtta 4081 gatttacaca ccccaacctt accatcttac agacccctct tcttcatcct cgtgtacctc 4141 atttccactg cagtggtaac aagatggaat ctggacatag ccggattgct gctgcagtgt 4201 gtcccaaccc ttttaatggt tttcacgatg tgggcagaca tccttaccct gatcctcata 4261 ctgcctactt acgagttgac aaaactatat tacctcaagg aagtgaagat tggggcagaa 4321 aggggctggt tgtggaagac caacttcaag agggtaaatg acatatacga agttgaccaa 4381 gctggtgagg gggtgtacct tttcccatca aaacaaaaga caggtacaat aacaggtact 4441 atgttgccac tgatcaaagc catactcata agttgcatca gcaataagtg gcaatttata 4501 tatctattgt acttgatatt cgaagtgtct tactaccttc acaagaagat catagatgaa 4561 atagcaggag ggaccaactt catctcgaga cttgtagccg ctctgatcga agccaattgg 4621 gcctttgaca acgaagaagt tagaggttta aagaagttct tcttgctgtc tagtagggtt 4681 aaagaactga tcatcaaaca caaagtgagg aatgaagtga tggtccactg gtttggcgac 4741 gaagaggtct atgggatgcc gaagctggtt ggcttagtca aggcagcaac actgagtaaa 4801 aataaacatt gtattttgtg caccgtctgt gaaaacagag agtggagagg agaaacctgc 4861 ccaaaatgcg gccgttttgg gccaccagtg acctgtggca tgaccctagc cgactttgaa 4921 gaaaaacact ataagaggat tttctttaga gaggatcaat cagaagggcc ggttagggag 4981 gagtatgcag ggtatctgca atatagagcc agagggcaat tattcctgag gaatctcccg 5041 gtgctagcaa caaaagtcaa gatgctcctg gtcggaaatc ttgggacgga ggtgggggat 5101 ttggaacacc ttggctgggt gctcagaggg cctgccgttt gcaagaaggt taccgaacat 5161 gagaaatgca ccacatccat aatggacaaa ttaactgctt tcttcggtgt tatgccaagg 5221 ggcaccacac ctagagcccc tgtgagattc cccacctctc tcttaaagat aagaaggggg 5281 ctggaaactg gctgggcgta cacacaccaa ggtggcatca gttcagtgga ccatgtcact 5341 tgtgggaaag acttactggt atgtgacact atgggccgga caagggttgt ttgccaatca 5401 aataacaaga tgacagacga gtccgagtat ggagttaaaa ctgactccgg atgcccggag 5461 ggagctaggt gttacgtgtt caaccgagag gcagttaata tatccgggac taaaggagct 5521 atggtccact tacaaaaaac tggaggagaa ttcacctgtg tgacagcatc agggactccg 5581 gccttctttg atctcaagaa cctcaaaggc tggtcagggc taccgatatt tgaggcatca 5641 agtggaagag tagtcggcag ggttaaggtc gggaagaatg aggactctaa accaaccaag 5701 cttatgagtg gaatacaaac agtctccaaa agtaccacag acttgacaga aatggtaaag 5761 aaaataacaa ccatgaacag gggagaattc agacaaataa cccttgccac aggtgccgga 5821 aaaaccacgg aactccctag atcagtcata gaagagatag gaaggcataa gagggtcttg 5881 gtcttgatcc ctctgagggc ggcagcagag tcagtatacc aatatatgag acaaaaacac 5941 ccaagcatag cattcaactt gaggataggg gagatgaagg aaggggacat ggccacaggg 6001 ataacctatg cctcatatgg ttacttctgt cagatgccac aacctaagct gcgagccgcg 6061 atggttgagt actccttcat attccttgat gagtaccact gttccacccc cgaacaattg 6121 gctatcatgg gaaagatcca cagattttca gagaacctgc gggtagtagc catgaccgca 6181 acaccagcag gcacggtaac aactacaggg caaaaacacc ctatagaaga atacatagcc 6241 ccagaagtga tgaaggggga agacttaggt ccagagtact tggacatagc tggactaaag 6301 ataccagtag aggagatgaa gagtaacatg ctggtctttg tgcccacaag gaacatggct 6361 gtagagacgg caaagaaact gaaagctaag ggttataact caggctacta ttatagtgga 6421 gaggatccat ctaacctgag ggtggtaaca tcacagtccc cgtacgtggt ggtagcaacc 6481 aacgcaatag aatcaggtgt tactctccca gacttggatg tggtcgtcga cacagggctt 6541 aagtgtgaaa agaggatacg gctgtcacct aagatgccct tcatagtgac gggcctgaag 6601 agaatggctg tcacgattgg ggaacaagcc cagagaaggg ggagagttgg gagagtgaag 6661 cctgggagat actacaggag tcaagaaacc cccgttggtt ccaaagatta ccattacgac 6721 ctactgcaag cacagaggta cggtatagaa gatgggataa acatcaccaa atcttttaga 6781 gagatgaatt atgattggag cctttatgag gaggatagtc tgatgattac acaattggaa 6841 atcctcaaca atctgttgat atcagaagag ctaccaatgg cagtaaaaaa tataatggcc 6901 aggactgacc acccagaacc aatccaactg gcgtacaaca gctacgaaac gcaggtgcca 6961 gttctattcc caaaaataaa aaatggagag gtgactgaca gttacgataa ctataccttc 7021 ctcaacgcaa gaaagctggg ggatgatgta ccaccctacg tgtatgccac agaggatgag 7081 gacttagcgg tagagctgct gggcttagac tggccggacc ctgggaacca aggaaccgtg 7141 gaggctggta gagcactaaa acaagtagtt ggtctatcaa cagctgagaa cgccctgtta 7201 gtagctttat tcggctatgt aggatatcag gcactctcaa agaggcatat accagtagtc 7261 acagacatat attcaattga agatcacagg ttggaagaca ccacacacct acagtatgcc 7321 ccgaatgcta tcaagacgga ggggaaggag acagaattga aggagctagc tcagggggat 7381 gtgcagagat gtatggaagc tatgactaat tatgcaagag atggcatcca attcatgaag 7441 tctcaggcac tgaaagtgaa agaaaccccc acttacaaag agacaatgga caccgtggcg 7501 gactatgtaa agaagttcat ggaggcactg gcggacagca aagaagacat cataaaatat 7561 gggttgtggg ggacgcacac agccttatat aagagcatcg gtgctaggct tgggaacgag 7621 actgcgttcg ctaccctggt cgtgaaatgg ctggcatttg ggggagaatc aatagcagac 7681 catgtcaaac aagcggccac agacttggtc gtttactata tcatcaacag acctcagttc 7741 ccaggagaca cggagacaca acaggaagga aggaaatttg tagccagcct actggtctca 7801 gccctggcta cttacactta caaaagctgg aattacaata atctgtccaa gatagttgaa 7861 ccggctttgg ctactctgcc ctatgccgcc acagctctca agctattcgc ccccactcga 7921 ttggagagcg ttgtcatact gagtaccgca atctacaaaa cctacctatc aatcaggcgc 7981 ggaaaaagcg atggtttgct aggcacaggg gttagtgcgg ctatggaaat catgtcacaa 8041 aacccagtat ctgtgggtat agcggtcatg ctaggggtgg gggccgtagc ggcccacaat 8101 gcaatcgaag ccagtgagca gaagagaaca ctactcatga aagtttttgt aaagaacttc 8161 ttggatcagg cagccactga tgaattagtc aaggagagcc ctgagaaaat aataatggct 8221 ttgtttgaag cagtgcagac agtcggcaac cctcttagac tggtatacca cgtttacgga 8281 gttttttaca aagggtggga ggcaaaagag ttggcccaaa ggacagccgg taggaatctt 8341 ttcactttga taatgtttga ggctgtggaa ctactgggag tagatagcga aggaaagatc 8401 cgccagctat caagcaatta catactagag ctcctgtata agttccgtga cagtatcaag 8461 tccagcgtga ggcagatggc aatcagctgg gcccctgccc cttttagttg tgattggaca 8521 ccgacggatg acagaatagg gcttccccaa gataatttcc tccgagtgga gacaaaatgc 8581 ccctgtggtt acaagatgaa agcagttaag aattgtgctg gggagttgag actcttagag 8641 gaggaaggct catttctctg caggaataaa ttcgggagag gttcacggaa ctacagggtg 8701 acaaaatact atgatgacaa tctatcagaa ataaagccag tgataagaat ggaaggacat 8761 gtggaactct actacaaggg agccactatt aaactggatt tcaacaacag taaaacaata 8821 ttggcaaccg ataaatggga ggtcgatcac tccactctgg tcagggtgct caagaggcac 8881 acaggggctg gatattgtgg ggcatacctg ggtgagaaac cgaaccacaa acatctgata 8941 gagagggact gcgcaaccat caccaaagat aaggtttgtt ttctcaagat gaagagaggg 9001 tgtgcattta cttatgactt atcccttcac aaccttaccc ggctgattga attggtacac 9061 aagaataact tggaagacaa agagattcct gccgttacgg tcacaacctg gctggcttac 9121 acatttgtaa atgaagatat agggaccata aaaccagcct tcggggagaa aataacacca 9181 gagatgcagg aggagataac cttgcagcct gctgtattgg tggatgcaac tgacgtgacc 9241 gtgaccgtgg taggggaaac ccctactatg actacagggg agaccccaac aacgttcacc 9301 agctcaggtc cagacccgaa aggccaacaa gttttaaaac tgggtgtagg tgaaggccaa 9361 taccccggga ctaatccaca gagagcaagc ctgcacgaag ccatacaaag cgcagatgaa 9421 aggccctctg tgctgatatt ggggtctgat aaagccacct ctaatagagt gaaaactgta 9481 aagaatgtga aggtatacag aggcagggac ccactagaag tgagagatat gatgaggagg 9541 ggaaagatcc tagtcatagc cctgtctagg gttgataatg ctctattgaa atttgtagat 9601 tacaaaggca cctttttaac tagagagacc ctggaggcat taagtttggg taggccaaaa 9661 aagaaaaaca taaccaaggc agaagcacag tggttgctgc gcctcgaaga ccaaatggaa 9721 gagctacccg attggttcgc agccggggaa cccatttttc tagaggccaa tattaaacat 9781 gacaggtatc atctggtagg ggatatagct actatcaaag agaaagccaa acaattgggg 9841 gctacagact ctacaaagat atccaaggag gttggtgcaa aagtatattc tatgaaattg 9901 agtaattggg tgatgcaaga agaaaacaaa cagagcaact tgaccccctt atttgaagag 9961 ctcctacagc agtgtccacc cggaggccaa aacaaaactg cacatatggt ctctgcttac 10021 caactagctc aagggaactg gatgccaacc agctgccatg tttttatggg gaccatatct 10081 gccagaagga ctaagaccca tccatatgaa gcatatgtca agttaaggga gttggtagag 10141 gaacacaaga tgaaaacatt gtgtcccgga tcaagtctgc gtaacgacaa tgaatgggta 10201 attggcaaga tcaaatacca gggcaacctg aggaccaaac acatgttgaa ccccggcaag 10261 gtggcagagc aactgcacag agaaggacac agacacaatg tgtataacaa gacaataggc 10321 tcagtgatga cagctactgg catcaggttg gagaagttgc ccgtggttag ggcccagaca 10381 gacacaacca acttccacca agcaataagg gataagatag acaaggaaga gaatctacag 10441 accccgggtt tacataagaa actaatggaa gttttcaatg cattgaaacg acccgagtta 10501 gagtcctcct atgacgctgt ggaatgggag gaattggaga gaggaataaa cagaaagggt 10561 gctgctggtt tctttgaacg caaaaacata ggggagatat tggattcaga gaaaattaaa 10621 gtagaagaga ttattgacaa tctgaaaaag ggtagaaata tcaaatacta tgaaaccgca 10681 atcccaaaaa atgaaaagag ggatgtcaat gatgactgga ccgcaggtga ctttgtggac 10741 gagaagaaac ccagagtcat acaataccct gaagcaaaaa caaggctggc catcaccaag 10801 gtgatgtata agtgggtgaa gcagaagcca gtagtcatac ccgggtatga agggaagaca 10861 cctctgttcc aaatttttga caaagtaaag aaggaatggg atcaattcca aaatccagtg 10921 gcagtgagct tcgacactaa ggcgtgggac acccaggtga ccacaaatga tctggagctg 10981 ataaaggaca tacaaaagta ctacttcaag aagaaatggc ataaatttat tgacaccctg 11041 actatgcata tgtcagaagt acccgtaatc actgctgatg gggaggtgta tataaggaaa 11101 gggcaaagag gtagtggaca gcccgacaca agcgcaggca acagcatgct aaatgtgtta 11161 acaatggttt atgccttctg cgaggccaca ggggtaccct acaagagttt tgacagggtg 11221 gcaaaaattc atgtgtgtgg ggacgatggt ttcctgatca cagagagagc tctcggcgag 11281 aaattcgcaa gcaagggagt ccaaatcctg tatgaagctg ggaagcccca gaagatcact 11341 gaaggggaca aaatgaaagt ggcctaccaa tttgctgata ttgagttttg ctcccataca 11401 ccaatacaag taaggtggtc agataacact tctagctaca tgccagggag aaatacaacc 11461 acaatcctgg ctaaaatggc cacaaggtta gattccagtg gtgagagggg taccatagcg 11521 tacgagaaag cagtagcatt cagcttcctg ctaatgtatt cctggaaccc actaatcaga 11581 aggatttgct tattggtact atcaactgaa ctgcaagtga aaccagggaa gtcaaccact 11641 tactattatg aaggggaccc gatatctgcc tacaaggaag tcatcggcca caatcttttc 11701 gatctcaaga gaacaagctt cgagaagctg gccaagttaa atctcagcat gtccgtactc 11761 ggggcctgga ctagacacac cagcaaaaga ctactacaag actgtgtcaa tatgggtgtt 11821 aaagagggca actggttagt caatgcagac agactggtga gtagtaagac tggaaatagg 11881 tatgtacctg gagaaggcca caccctgcaa gggagacatt atgaagaact ggcgttggca 11941 agaaaacaga tcaacagctt ccaagggaca gacaggtaca atctaggccc aatagtcaac 12001 atggtgttaa ggaggctgag agtcatgatg atgaccctga tagggagagg ggtatgagtg 12061 cgggtgaccc gcgatctgga cccgtcagta ggaccctatt gtagataaca ctaatttttt 12121 atttatttag atattactat ttatttattt atttatttat tgaatgagta agaactggta 12181 caaactacct catgttacca cactacactc attttaacag cactttagct ggaaggaaaa 12241 ttcctgacgt ccacagttgg actaaggtaa tttctaacgg ccc // LOCUS HUMC6A2A1 2159 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI, alpha-2 collagen type VI-a, and alpha-2 collagen type VI-a' gene, exons 6,5,4 and 3. ACCESSION M34571 KEYWORDS alpha-2 collagen type VI; alternative splice. SEGMENT 1 of 3 SOURCE Human leukocyte DNA, clone D1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2159) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept / 75 120 alpha-2 collagen type VI, exon 6 273 425 alpha-2 collagen type VI, exon 5 590 1042 alpha-2 collagen type VI, exon 4 1307 + 1345 alpha-2 collagen type VI-a, exon 3 pept / 75 120 alpha-2 collagen type VI-a, exon 6 273 425 alpha-2 collagen type VI-a, exon 5 590 1042 alpha-2 collagen type VI-a, exon 4 1307 + 1345 alpha-2 collagen type VI-a', exon 3 pept / 75 120 alpha-2 collagen type VI-a', exon 5 273 425 alpha-2 collagen type VI-a', exon 4 590 1042 alpha-2 collagen type VI-a', exon 3 1307 + 1345 alpha-2 collagen type VI-a', exon 2 pre-msg < 1 > 2159 alpha-2cVI mRNA and introns IVS < 1 74 intron E IVS 121 272 intron D IVS 426 589 intron C IVS 1043 1306 intron B IVS 1346 > 2159 intron A BASE COUNT 391 a 653 c 749 g 366 t ORIGIN 1 tgtccggacc ccagccagac tgctgtgaac tcttctgggc ccggggactg ccctgcctgc 61 cgtgtgcatt gcaggagtgt gacgtcatga cctacgtgag ggagacctgc gggtgctgcg 121 gtgaggcact gcccacggca gggtcggggc ccatgcaccg ggtggagggc gggagtgcag 181 cagggctggg tcatcgctgg gtcctgcatg tgcacgtgac cctagggtct gaggtctccc 241 ggtacccccc gatgaccctg ccaccccccc agactgtgag aagcgctgtg gcgccctgga 301 cgtggtcttc gtcatcgaca gctccgagag cattgggtac accaacttca cactggagaa 361 gaacttcgtc atcaacgtgg tcaacaggct gggtgccatc gctaaggacc ccaagtccga 421 gacaggtcac ggggcagggc gggtgcagca ttgcgggggg ccgcgggcgc gtgggaggcg 481 atgagatggg agaagtccag acgcgtccct ccaacgaggg cctctgcatg gctggggatg 541 ccccagaccc cgaggcctct ggcaacgacc tcacgcgtgc ggcttgcagg gacgcgtgtg 601 ggcgtggtgc agtacagcca cgagggcacc tttgaggcca tccagctgga cgacgaacat 661 atcgactccc tgtcgagctt caaggaggct gtcaagaacc tcgagtggat tgcaggtggc 721 acctggacac cctcagccct caagtttgcc tacgaccgcc tcatcaagga gagccggcgc 781 cagaagacac gtgtgtttgc ggtggtcatc acggacgggc gccacgaccc tcgggacgat 841 gacctcaact tgcgggcgct gtgcgaccgc gacgtcacag tgacggccat cggcatcggg 901 gacatgttcc acgagaagca cgagagtgaa aacctctact ccatcgcctg cgacaagcca 961 cagcaggtgc gcaacatgac gctgttctcc gacctggtcg ctgagaagtt catcgatgac 1021 atggaggacg tcctctgccc gggtgtacgt gtgggcgcgg ggcagtcagg ccgaggagca 1081 gcaggcccca gccgcgtcta gcgtgaccgc cagggacacc cctcacctga gggacgaatg 1141 tgcagcccaa ggatcttggg ctgtgggtgg gaaggggtcg gcctctcggg ctgcagggca 1201 gacgcgccag ctcgaccctg agcctgtcta ggcagatcag tgaacggccg ctgagggttc 1261 gctagggact gaccctggcc tggccggcct ctctcctctc ttccagaccc tcagatcgtg 1321 tgcccagacc ttccctgcca aacaggtaat gcagggacct gagccaccac cccagactag 1381 caaagcagcc ctggtgtcct tcctcctcga gggccgggct gggggagggg ccgtgcaggg 1441 acccgggggc ggcggacgac tgcggaggct gctccttagg gagatggccc caggatggca 1501 gcacagggga ggaggggctt ggggaaggca ggctcccagg aacgcaggaa cagcatcacg 1561 aggccatgag gtgggtgctg ctagcctggc gctgtgctcg gcatgtggcc actggtcttg 1621 aaggcccacc atgggcttgc agtctccctc agctgccgcc cagctcccat gggctggccg 1681 tgcatgtgcc accggaggaa gccctggatc agtgagtgaa accatcccgg ggtggaagca 1741 ctgacacccc ccagcaccag caggtcttgc tccaaccctg gcctgcctcg atcgagctgc 1801 agctgcggct ctcatctctg ggagtggggg agcccatgtc cggatgattg gcccagcgtg 1861 gtgtgaagct ggagctgggg gtgccgttca gctgctgctg gactggtgct gcccccatgg 1921 tgcactgctg caaccgttgc tgggcccaca ggaggtcccc gggggcggtt atgtagctga 1981 gtccccctca ttgagccgtc cccttccagg agtgtgaggg tggggatgcc atggagacag 2041 ggtgggaggg tccagactga gaggaccaca gggtaggaaa cctccaaggg tctgctggta 2101 ctaagtcagc ccttctcagc actcgggatc gcgatgtgcg atcgagagtc catggggag // LOCUS HUMC6A2A2 1348 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI and alpha-2 collagen type VI-a gene, exons 2a and 2b. ACCESSION M34572 KEYWORDS alpha-2 collagen type VI; alternative splice. SEGMENT 2 of 3 SOURCE Human leukocyte DNA, clone D1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1348) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 437 462 alpha-2 collagen type VI, exon 2a pept + 730 1025 alpha-2 collagen type VI-a, exon 2b pre-msg < 1 1336 alpha-2cVI mRNA and introns IVS < 1 436 intron A (alt. splice site) IVS < 1 729 intron A (alt. splice site) signal 1285 1291 polyA signal BASE COUNT 238 a 457 c 398 g 255 t ORIGIN About 1.0 kbp downstream from segment 1. 1 tctggctact ggtgacacac tgctgtgcct gccctggcct tctccagaca gccctgtcca 61 cccaaagccc agccaccctg gcctgcagca ggcctgtgga gttctcagtt gcgtggggac 121 cagagggtgc tggagaaaca aaccagacgc agctgaaggc agtcagggca gggcgcaatc 181 agcgataaga gctgcatagg ggccacagcg taacctgagc tccagtcggt ggaaagaaaa 241 ggcagagacg ttgcagaggc caggtctgct caggggaaga cagttctggg tgtagaggac 301 tcacatccca gagaggctga ggaagggttt accacgcaag cttctcattc gggactcttg 361 aggggtggct ggggtcttcc tggcgacggg ctgcggcact gaagccctac tggagtttgg 421 cctgtctccg gcacaggttt ggacggagct gttttgtgct gaaaggtttt ctcggggtcc 481 gtggtgtccc ccaaaggtgc caccgtgcgg gtctcctagc tccctgccag cttcctgtcc 541 ctgtgctcac tgcccccacg cctcctgcca aggccgagcc acacacccgc tccacctgca 601 tttcctctac cgactcgcca gcccaaatgc cgctcttcac tctggcctcg ctgagcggct 661 gcccgaggag gagctctagg ccgacgccca ccgcaggcct tacagtcgtc tctggacgct 721 cccttgcaga tgcaccgtgg cctggcggcg agcccccggt caccttcctc cgcacggaag 781 aggggccgga cgccaccttc cccaggacca ttcccctgat ccaacagttg ctaaacgcca 841 cggagctcac gcaggacccg gccgcctact cccagctggt ggccgtgctg gtctacaccg 901 ccgagcgggc caagttcgcc accggggtag agcggcagga ctggatggag ctgttcattg 961 acacctttaa gctggtgcac agggacatcg tgggggaccc cgagaccgcg ctggccctct 1021 gctaaagccc gggcacccgc ccagccgggc tgggccctcc ctgccacact agcttcccag 1081 ggctgccccc gacaggctgg ctctcagtgg aggccgagag atctggaatc ggggtcagcg 1141 gggctacagt ccttccaggg gctctggggc agctcccagc ctcttcccat gctggtggcc 1201 accgtgtccc ttgctgcggc tgcatcttcc agtctctcct ccgtcttcca gtggccgctc 1261 tctttataag aaccctggtc attgaattta aggcccaccc caagtccaga atgacctcgc 1321 aagaccctta actcactccc gtctgcag // LOCUS HUMC6A2A3 1174 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI-a' gene, exon 1. ACCESSION M34573 KEYWORDS alpha-2 collagen type VI; alternative splice. SEGMENT 3 of 3 SOURCE Human leukocyte DNA, clone D1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 140 738 alpha-2 collagen type VI-a', exon 1 pre-msg < 1 1028 alpha-2cVI mRNA and introns IVS < 1 139 intron A (alt. splice site) signal 1010 1015 polyA signal BASE COUNT 189 a 439 c 364 g 182 t ORIGIN About 2.1 kbp downstream of segment 2. 1 ctgcagaaac gccccgcaga gcccagtggt ctgtgaggtt gcaggcaggg tgcgaatgga 61 agggacaggt gcggggctgg cacctgcccg gtcctgccca cctctcctcc gcccagcccg 121 cacctgcggt ctcccacaga gctgtccgtg gcacagtgca cgcagcggcc cgtggacatc 181 gtcttcctgc tggacggctc cgagcggctg ggtgagcaga acttccacaa ggcccggcgc 241 ttcgtggagc aggtggcgcg gcggctgacg ctggcccgga gggacgacga ccctctcaac 301 gcacgcgtgg cgctgctgca gtttggtggc cccggcgagc agcaggtggc cttcccgctg 361 agccacaacc tcactgccat ccacgaggcg ctggagacca cacaatacct gaactccttc 421 tcgcacgtgg gcgcaggcgt ggtgcacgcc atcaatgcca tcgtgcgcag cccgcgtggc 481 ggggcccgga ggcacgcaga gctgtccttc gtgttcctca cggacggcgt cacgggcaac 541 gacagtctgc acgagtcggc gcactccatg cgcaacgaga acgtggtacc caccgtgctg 601 gccttgggca gcgacgtgga catggacgtg ctcaccacgc tcagcctggg tgaccgcgcc 661 gccgtgttcc acgagaagga ctatgacagc ctggcgcaac ccggcttctt cgaccgcttc 721 atccgctgga tctgctagcg ccgccgcccg ggccccgcag tcgagggtcg tgagcccacc 781 ccgtccatgg tgctaagcgg gcccgggtcc cacacggcca gcaccgctgc tcactcggac 841 gacgccctgg gcctgcacct ctccagctcc tcccacgggg tccccgtagc cccggccccc 901 gcccagcccc aggtctcccc aggccctccg caggctgccc ggcctccctc cccctgcagc 961 catcccaagg ctcctgacct acctggcccc tgagctctgg agcaagccca ataaaggctt 1021 tgaacccatt gcgtgcctgc gagcttctgt gcgcaggaga gacctcaaag gtgtcttgtg 1081 gccaggaggg aaacactgca gctgtcgctc gcccaccagg gtcaatggct cccccgggcc 1141 cagcctgacc tcctaggaca tcaactgcag gtgc // LOCUS HUMC6A2AA 888 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human alpha-2 collagen type VI mRNA, 3' end. ACCESSION M34570 KEYWORDS alpha-2 collagen type VI. SOURCE Human fibroblast, cDNA to mRNA, clone F221. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 888) AUTHORS Saitta,B., Stokes,D.G., Vissing,H., Timpl,R. and Chu,M.-L. TITLE Alternative splicing of the human alpha-2(VI) collagen gene generates multiple mRNA transcripts which predict three protein variants with distinct carboxyl termini JOURNAL J. Biol. Chem. 265, 6473-6480 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 54 alpha-2 collagen type VI BASE COUNT 136 a 329 c 249 g 174 t ORIGIN 1 gtgtgcccag accttccctg ccaaacaggt ttggacggag ctgttttgtg ctgaaaggtt 61 ttctcggggt ccgtggtgtc ccccaaaggt gccaccgtgc gggtctccta gctccctgcc 121 agcttcctgt ccctgtgctc actgccccca cgcctcctgc caaggccgag ccacacaccc 181 gctccacctg catttcctct accgactcgc cagcccaaat gccgctcttc actctggcct 241 cgctgagcgg ctgcccgagg aggagctcta ggccgacgcc caccgcaggc cttacagtct 301 tctctggacg ctcccttgca gatgcaccgt ggcctggcgg cgagcccccg gtcaccttcc 361 tccgcacgga agaggggccg gacgccacct tccccaggac cattcccctg atccaacagt 421 tgctaaacgc cacggagctc acgcaggacc cggccgccta ctcccagctg gtggccgtgc 481 tggtctacac cgccgagcgg gccaagttcg ccaccggggt agagcggcag gactggatgg 541 agctgttcat tgacaccttt aagctggtgc acagggacat cgtgggggac cccgagaccg 601 cgctggccct ctgctaaagc ccgggcaccc gcccagccgg gctgggccct ccctgccaca 661 ctagcttccc agggctgccc ccgacaggct ggctctcagt ggaggcccag agatctggaa 721 tcggggtcag cggggctaca gtccttccag gggctctggg gcagctccca gcctcttccc 781 atgctggtgg ccaccgtgtc ccttgctgcg gctgcatctt ccagtctctc ctccgtcttc 841 cagtggccgc tctctttata agaaccctgg tcattgaatt taaggccc // LOCUS PPH47CG 7726 bp ds-DNA VRL 12-JUL-1990 DEFINITION Human papillomavirus type 47 (HPV-47) +-sense strand. ACCESSION M32305 KEYWORDS . SOURCE Human papillomavirus type 47 DNA, clone pTZ18R. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 7726) AUTHORS Kiyono,T., Adachi,A. and Ishibashi,M. TITLE Genome organization and taxonomic position of human papillomavirus type 47 inferred from its DNA sequence JOURNAL Virology 177, 401-405 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by T.Kiyono, 23-FEB-1990, for release after publication. FEATURES from to/span description pept 966 981 E1/E4 fusion protein, exon 1 3324 4000 E1/E4 fusion protein, exon 2 pept 208 678 ORF E6 pept 668 979 ORF E7 pept 966 2783 ORF E1 pept 2725 4245 ORF E2 pept 3086 4000 ORF E4 pept 4334 5890 ORF L2 pept 5903 7447 ORF L1 pre-msg 198 4465 HPV-47-1 mRNA and intron IVS 982 3323 HPV-47-1 intron pre-msg < 1 4465 HPV-47-2 mRNA and intron IVS 1359 2677 HPV-47-2 intron signal 4424 4429 polyA signal BASE COUNT 2369 a 1517 c 1727 g 2113 t ORIGIN 1 aacggtaagt ttgcattaat gtaccaggtg cggtacagat catttcacaa tggatattat 61 tgttgccaac taccatagtc ataatcaagt tcttgcctgt atcgttttcg taccttacct 121 acagtatttt atattaatat ataaataaat aaatatataa atgtgtattt atttctcagg 181 ctcagttctt tgcaattatt aagacaaatg gctcagaagg ctttggaaca gactacagtt 241 aaagaggaaa agctagaact acctactact attagaggct tagctcaatt gttagacata 301 cctttagtag attgtttgct accttgcaac ttttgtggca gatttcttga ctatttagaa 361 gtttgtgaat ttgattataa aaagcttact ttaatttgga aagactacag tgtttatgcc 421 tgctgccgtt tgtgctgctc agcaactgcc acatatgaat ttaatgtttt ttatcaacaa 481 acagtgttag gtagagatat tgagctagct acaggccttt ccatttttga gattgacata 541 aggtgtcata cctgcctgtc atttcttgac attattgaaa agttagatag ctgtggaaga 601 ggacttccct ttcacaaagt aagaaacgcc tggaagggtg tttgtaggca gtgtaagcat 661 ttttacaatg attggtaaag aggtcaccgt gcgagatatt gttctggagt taagtgaggt 721 tcaacctgaa gtattaccag ttgacctgtt ttgcgacgag gaattaccaa atgaacaaca 781 ggcggaggag gagctagaca tcgacagagt cgttttcaaa gtgattgcac cgtgcggttg 841 cagctgctgc gaggtcaagc ttcgcatttt tgtgaacgca acaaaccgtg gcatcaggac 901 atttcaggaa cttttgactg gtgatctgca gctcctctgc ccagagtgcc gtgggaactg 961 caaacatggc ggattctaaa ggtagtacat ctaaagaagg gtttggtgat tggtgtattt 1021 tggaagctga ctgtagtgat gttgaggatg atttgggaca attatttgag agagatacag 1081 actcagatat ctcggacctg ttagacaatt gtgacctgga tcagggcaat tcacgggaac 1141 tatttcatca acaggagtgt aagcaaagcg aggagcaatt acaaaaacta aaacgaaagt 1201 atcttagtcc aaaagctgtc gcgcagctta gtccgcgtct tgagtcaatt tcattgtcac 1261 ctcagcagaa atccaagaga aggctctttg cagagcaaga cagcggactc gagttaacct 1321 ttaacaatga agctgaagat gttactcctg aggtggaggt accggctata gactctcggc 1381 cggatgatga tgagggagga tcaggggatg tagatattca ttatacagca ttgttgcgtt 1441 ccagcaacca aaaggccaca ttactggcaa aattcaaaca agcgtttggg gtaggcttta 1501 atgaattgac aagacaattc aaaagctaca aaacctgctg taatcattgg gttgtatccg 1561 tatatgcagt ccatgatgat ctatttgaaa gctcaaagca gctgttgcaa cagcattgtg 1621 actatatatg ggtccgtggg atagatgcaa tgtcattata tctattgtgt tttaaggcgg 1681 gaaaaaatcg tgggacagtt cataagctaa ttaccacaat gttaaatgtg catgagcaac 1741 agatattgtc tgagcctcca aagttaagaa atacagctgc tgcattattt tggtacaaag 1801 gatgtatggg acctggagtg ttcacccacg gtccttaccc tgaatggatt gcacaattaa 1861 ccattttggg ccataagagt gctgaggcaa gtgcgtttga tctgtcagtc atggttcaat 1921 gggcatttga taacaatctg tttgaggagg cagacattgc atacggatat gcaagactgg 1981 caccagagga tagcaatgca gttgcatggc ttgcacataa taaccaagct aaatatgtta 2041 gagaatgtgc tatgatggtt cgatactaca aaaaggggca aatgagagat atgagcatgt 2101 ctgagtggat atatacaagg atacatgaag tagagggaga aggacagtgg tctagcattg 2161 ttaaattttt aagatatcaa gaaataaatt ttatttcatt tttggctgct ttaaaagatt 2221 tattacattc agtacctaaa cgcaattgta ttttattcca tggccctcca aatacaggaa 2281 agtcatcgtt tggaatgtcc ttaataaaag ttctaagggg gagagtatta tcatttgtaa 2341 actccaaaag tcagttttgg ttgcagcctc ttggagaatg taaaatagca ttattagatg 2401 atgttacaga tccatgttgg gtgtatatgg atcaatattt aagaaatggg ttagatgggc 2461 attttgtgtc tttggattgt aaatatagag cacccatgca aacaaagttt ccacctttaa 2521 tacttacatc taatattaat gtacatgcag agaccaatta tagataccta catagtagaa 2581 ttaagggttt tgaatttaaa aatccatttc ctatgaaagc agataataca cctcaatttg 2641 agttaactga ccaaagctgg aaatcttttt ttacaaggct ttggacacac ttagacctga 2701 gtgaccaaga agacgagggc gaacatggag aatctcagcg agcgtttcaa tgctctgcaa 2761 gaacagctaa tgaacattta tgaagctgca gaacagacat taaaggcaca aattttacat 2821 tggcagacat tgcgaaaaga agctgtgaca ctctactttg ctaggcagaa aggcataaat 2881 aggttgggat accaaccagt gcctgcatta gcaatatctg aggcaagggc caaagaggct 2941 atatatatgg tgttgcagtt agagtcgcta caaaaatcag cgtttgcttt ggagccttgg 3001 accttagtgg acactagtac agagactttt aagagtgctc cagaaaatca ttttaaaaag 3061 gggcctgtac ctgtggaggt gatatatgac aaagatgaag caaatgctaa tttgtatact 3121 atgtggacat ttgtgtatta catggattca gatgatgtgt ggcataagac aacaagtggg 3181 gtcaatcaaa ctggcattta ctacctatat ggaacattta aacactatta tgtgttattt 3241 gctgatgatg caaagagata tagtgctact ggagaatggg aagttaaagt taataaggaa 3301 actgtgttta ctcctgtcac tagctccaca ccaccagggt caccaggagg acaaacagac 3361 ccagacacct cctccaagac ccccaccacc accacagccg ccactgacac ctcgcccaga 3421 cgccaatcca tcaataaaca gtcacaacaa accgaaacca aacgaagagg gtacggacgg 3481 agaccatcaa gcagaacaag gcgaccgcaa acgcaccaaa ggcgatccag atccagatcc 3541 cggtcgcggt ccagttctca aacccactct tccaccacca ccaccaccac cacctacagg 3601 tccaggtcta cgtcgctcaa caagactcgt gctcgttcca ggtcaaggtc cacctccaga 3661 tctaccagca ccaccagtag aaggggaggt agagggtcat ccacaaggca aagatcgcga 3721 tcaccctcca cctacacctc aaaacggtca cgggaaggaa acacaagggg cagagggagg 3781 gggagacaag ggagagcagg gagcagtggg gggagagagc agcgacggag aaggagatca 3841 ttctcaacct cccctgactc ctccaaacga gtcagacggg agtctcctaa ataccgtggc 3901 gtgtctccta gcgaggtggg aaagcaactt cgatcagttg gtgcaaaaca ttcagggcga 3961 cttggaaggt tattggagga agctagggac cccccagtaa ttcttgtgcg aggggacgca 4021 aacacattaa aatgctttcg caacagagca aggaacaaat atagagggct ttttagatca 4081 ttcagcacta cattttcctg ggtagctgga gatagcattg agcgtctagg caggtccaga 4141 atgctcatta gcttttcctg cctcactcag agaagggatt ttgatgatgc tgtcaaatat 4201 ccaaaaggag tcgagtggtc atatggtagt cttgatagcc tttaacaagc attaacgctg 4261 ctttgctact aactgctatt aacaaccaca gctttttttt tacgtttttt tattttactg 4321 attttgtact gcaatggcgc gtgctagaag ggtcaaacgt gactctgtaa cacatatata 4381 tcagacctgc aaacaggcag gcacttgccc ctcggacgtt gttaataaag ttgagcaaac 4441 aacagttgct gacaatattt tgaaatatgg cagtgctggt gtcttttttg gaggccttgg 4501 cataggaaca ggccgaggga ctgggggtgc tactgggtac gtgccacttg gggaaggtcc 4561 tggtgtccgt gtgggaggaa ccccaacggt tgtaaggcct tctcttgttc ctgaagcaat 4621 tggaccagtt gatattttac ccattgacac aatcgcacct gtcgagccta ctgcttcatc 4681 tttagtccca ttaacagagt cgtctggtgc tgatttactt cccggtgaag ttgaaactat 4741 agccgaaata catcctattc ctgaaggtcc gacaatcgac tcccctgtag tcaccacaac 4801 gacaggttcc agtgctgttc tggaagtggc tccagaacct gtacccccta cacgtgttag 4861 aattgctaga acacaatatc ataatccctc ttttcagata ctcactgaat caacacctgc 4921 gcagggcgag agttctcttg ctgaccatat tttggtcacc tcagggtctg gtggacaaag 4981 gataggcggt gatataacag acgaaattga acttactgag tttccaagca gatatacatt 5041 tgaaatagaa gaacccaccc ctccacgaaa aagtagcaca ccattacaaa ctgtagcctc 5101 tgcagtaagg cgacggggct tctcattaac aaatagaaga ttggtacaac aagtagctgt 5161 agacaatcct ttatttttaa gtcaaccttc taagatggta agattctcat ttgacaatcc 5221 agcttttgaa gaagaggtta ccaatatttt tgaacaggat gttaacagct ttgaagaacc 5281 tccagacagg gattttcttg atattaaaca attgggccgt cctcaatatt ctacaacacc 5341 agcaggttat attagggtaa gcagactagg aactcgaggc accattcgca ctcgttctgg 5401 tgcacaaata ggttctcagg tacactttta tagagattta agttctataa atactgagga 5461 tccaatagaa ctacagcttt tagggcagca ttctggagat gctactattg ttcaaggtcc 5521 tgtagaaagc acatttatag atatggacat tgctgaaaac cctttatctg aaacaataga 5581 tgcttcatct aatgatttac ttttggatga gactgtggag gattttagtg ggtcccaatt 5641 agtaattgga aatcgaagga gtacaacatc atatactgtt cccagatttg agactactag 5701 aagtagttcc tattatgttc aagacacaga tggttattat gttgcttacc cagagtcacg 5761 ggacactatt gatattattt accctacacc tgaattacct gtagttgtca ttcacaccca 5821 tgacaattct ggagactttt acttacatcc tagtcttaga aggcgtaagc gtaaaagaaa 5881 atatttgtga tttgcattgc agatggcagt gtggcactcg gctaacggta aagtatacct 5941 tcctccatca acaccagtgg ccagggttca aagcacggat gaatacatac aaaggactaa 6001 tatctattat catgcaaata ctgaccgcct tttaacagta ggacatccat atttcaatgt 6061 atacaataat aatggaacta cattagaggt tccaaaagta tcaggtaatc agcatagggt 6121 gtttcgctta aaattgccag atcctaatag atttgctcta gcggacatgt ctgtatacaa 6181 ccctgacaaa gaacgcttgg tgtgggcctg caggggtcta gaaattggaa ggggtcaacc 6241 tttaggtgtt ggcagtactg gtcacccata ttttaataag gtaaaagata cagaaaacag 6301 taattcctat atcacaaact caaaagatga cagacaagac acctcttttg atcctaaaca 6361 aatacagatg tttattgtgg gctgcactcc atgtattggc gaacactggg ataaggcaga 6421 gccttgtggg gaacagcaaa ctggtctttg tcctcctatt gaattaaaaa acacatacat 6481 tcaggatggc gacatggcag acattggttt tggcaacatt aatttcaagg ccttacaaca 6541 cagtaggtct gatgttagtc ttgacattgt aaatgaaact tgcaagtacc cggattttct 6601 caaaatgcaa aatgatgttt atggggatgc ttgctttttt tatgctcgta gagagcaatg 6661 ttatgccaga catttttttg ttagaggggg aaaaacaggt gatgacatac caggagcaca 6721 ggttggcaat ggtaatatga aaaatcaatt ttacattcct ggtgctacgg gtcaggctca 6781 gagcactata ggtaatgcca tgtatttccc aactgtcagt ggctcactag tctctagtga 6841 tgctcaactg tttaacaggc cattctggct ccaaagggct cagggtcata ataatggcat 6901 tctgtgggct aatcaaatgt ttgtcacagt tgtagacaac acaagaaata caaatttcag 6961 catctctgtt tactctcagg caggggacat aaaggatata caggattata atgcagacaa 7021 ttttagagag tatcaaagac atgtggagga atatgaaatt tctgtaatat tacaattgtg 7081 caaagttcct ttaaaagcag aagttttagc acaaattaat gccatgaatt cgtctctttt 7141 agaggaatgg cagttaggat ttgtgcctac tccagacaac cctattcagg atacatatag 7201 atatctagaa tctttggcca ctaggtgtcc tgaaaagtct cctccaaaag agaaggttga 7261 cccctacaaa ggtttaaact tttgggatgt cgatatgaca gagcgccttt ccctggattt 7321 agatcaatat tcattaggta gaaagttctt attccaggct ggattacagc agacgaccgt 7381 aaacggtaca aaaacaactc cttacagggg gtccatcaga ggaacaaagc gcaaacgaaa 7441 aaattgaaga tgaccgtttt cggtacagat tgtttaactt ttacacagta ttcaaggaat 7501 gtctgtttac tgtgactaag tgtaactctg ccaaagaaac aaccgcaccc ggtacacgta 7561 ttcagcttgt tgccaaaaca gataagcttg gcagtcagaa cacaccgtgt tcgtcgcaac 7621 acgctcggat taggtcttct gccaaaagaa atttaatctt gttatcgttt ttggcgatca 7681 catttggcac cgcgggcagc tgttttggca ctacaagaca accgtt // LOCUS RUBCG 9755 bp ss-RNA VRL 12-JUL-1990 DEFINITION Rubella virus complete genome encoding nonstructural protein, capsid protein, glycoproteins E1 and E2, complete cds. ACCESSION M15240 M18901 M32735 KEYWORDS C protein; glycoprotein; glycoprotein E1; glycoprotein E2; hemagglutinin. SOURCE Rubella virus (strain Therien) cDNA to genomic RNA and cDNA to mRNA, clones pRUB1025[1010,1012,1002,1006,1015,1001]. ORGANISM Rubella virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Rubivirus. REFERENCE 1 (bases 8155 to 9754) AUTHORS Frey,T.K., Marr,L.D., Hemphill,M.L. and Dominguez,G. TITLE Molecular cloning and sequencing of the region of the rubella virus genome coding for glycoprotein E1 JOURNAL Virology 154, 228-232 (1986) STANDARD full staff_review REFERENCE 2 (bases 5917 to 9754; revises [1]) AUTHORS Frey,T.K. and Marr,L.D. JOURNAL Unpublished (1987) STANDARD full staff_review REFERENCE 3 (bases 5247 to 8366) AUTHORS Frey,T.K. and Marr,L.D. TITLE Sequence of the region coding for virion proteins C and E2 and the carboxy terminus of the nonstructural proteins of rubella virus: comparison with alphaviruses JOURNAL Gene 62, 85-99 (1988) STANDARD full staff_review REFERENCE 4 (bases 1 to 9755) AUTHORS Domminguez,G., Wang,C.-Y. and Frey,T.K. TITLE Sequence of the genome RNA of rubella virus: Evidence for genetic rearrangement during togavirus evolution JOURNAL Virology 177, 225-258 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence in [2] kindly provided by T.K.Frey, 01-JUN-1987. Draft entry and computer-readable sequence for [4] kindly submitted by G.Dominguez, 09-MAR-1990, for release after publication. Glycoprotein E1 contains the viral hemagglutinin activity. Multiple copies of the C protein comprise the nucleocapsid. FEATURES from to/span description pept 39 6656 nonstructural polyprotein precursor pept 6505 9696 structural polyprotein precursor matp 6505 7404 capsid protein (C) matp 7405 8250 glycoprotein E2 matp 8251 9693 glycoprotein E1 mRNA 6428 9755 subgenomic RNA BASE COUNT 1457 a 3781 c 3007 g 1510 t ORIGIN 1 atggaagcta tcggacctcg cttaggactc ccattcccat ggagaaactc ctagatgagg 61 ttcttgcccc cggtgggcct tataacttaa ccgtcggcag ttgggtaaga gaccacgtcc 121 gatcaattgt cgagggcgcg tgggaagtgc gcgatgttgt taccgctgcc caaaagcggg 181 ccatcgtagc cgtgataccc agacctgtgt tcacgcagat gcaggtcagt gatcacccag 241 cactccacgc aatttcgcgg tatacccgcc gccattggat cgagtggggc cctaaagaag 301 ccctacacgt cctcatcgac ccaagcccgg gcctgctccg cgaggtcgct cgcgttgagc 361 gccgctgggt cgcactgtgc ctccacagga cggcacgcaa actcgccacc gccctggccg 421 agacggccag cgaggcgtgg cacgctgact acgtgtgcgc gctgcgtggc gcaccgagcg 481 gccccttcta cgtccaccct gaggacgtcc cgcacggcgg tcgcgccgtg gcggacagat 541 gcttgctcta ctacacaccc atgcagatgt gcgagctgat gcgtaccatt gacgccaccc 601 tgctcgtggc ggttgacttg tggccggtcg cccttgcggc ccacgtcggc gacgactggg 661 acgacctggg cattgcctgg catctcgacc atgacggcgg ttgccccgcc gattgccgcg 721 gagccggcgc tgggcccacg cccggctaca cccgcccctg caccacacgc atctaccaag 781 tcctgccgga caccgcccac cccgggcgcc tctaccggtg cgggccccgc ctgtggacgc 841 gcgattgcgc cgtggccgaa ctctcatggg aggttgccca acactgcggg caccaggcgc 901 gcgtgcgcgc cgtgcgatgc accctcccta tccgccacgt gcgcagcctc caacccagcg 961 cgcgggtccg actcccggac ctcgtccatc tcgccgaggt gggccggtgg cggtggttca 1021 gcctcccccg ccccgtgttc cagcgcatgc tgtcctactg caagaccctg agccccgacg 1081 cgtactacag cgagcgcgtg ttcaagttca agaacgccct gtgccacagc atcacgctcg 1141 cgggcaatgt gctgcaagag gggtggaagg gcacgtgcgc cgaggaagac gcgctgtgcg 1201 catacgtagc cttccgcgcg tggcagtcta acgccaggtt ggcggggatt atgaaaggcg 1261 cgaagtgcgc cgccgactct ttgagcgtgg ccggctggct ggacaccatt tgggacgcca 1321 ttaagcggtt cctcggtagc gtgcccctcg ccgagcgcat ggaggagtgg gaacaggacg 1381 ccgcggtcgc cgccttcgac cgcggccccc tcgaggacgg cgggcgccac ttggacaccg 1441 tgcaaccccc aaaatcgccg ccccgccctg agatcgccgc gacctggatc gtccacgcag 1501 ccagcgaaga ccgccattgc gcgtgcgctc cccgctgcga cgtcccgcgc gaacgtcctt 1561 ccgcgcccgc cggccagccg gatgacgagg cgctcatccc gccgtggctg ttcgccgagc 1621 gccgtgccct ccgctgccgc gagtgggatt tcgaggctct ccgcgcgcgc gccgatacgg 1681 cggccgcgcc cgccccgccg gctccacgcc ccgcgcggta ccccaccgtg ctctaccgcc 1741 accccgccca ccacggcccg tggctcaccc ttgacgagcc gggcgaggct gacgcggccc 1801 tggtcttatg cgacccactt ggccagccgc tccggggccc tgaacgccac ttcgccgccg 1861 gcgcgcatat gtgcgcgcag gcgcgggggc tccaggcttt tgtccgtgtc gtgcctccac 1921 ccgagcgccc ctgggccgac gggggcgcca gagcgtgggc gaagttcttc cgcggctgcg 1981 cctgggcgca gcgcttgctc ggcgagccag cagttatgca cctcccatac accgatggcg 2041 acgtgccaca gctgatcgca ctggctttgc gcacgctggc ccaacagggg gccgccttgg 2101 cactctcggt gcgtgacctg cccgggggtg cagcgttcga cgcaaacgcg gtcaccgccg 2161 ccgtgcgcgc tggcccccgc cagtccgcgg ccgcgtcacc gccacccggc gaccccccgc 2221 cgccgcgccg cgcacggcga tcgcaacggc actcggacgc tcgcggcact ccgccccccg 2281 cgcctgcgcg cgacccgccg ccgcccgccc ccagcccgcc cgcgccaccc cgcgctggtg 2341 acccggtccc tcccattccc gcggggccgg cggatcgcgc gcgtgacgcc gagctggagg 2401 tcgcctgcga gccgagcggc ccccccacgt caaccagggc agacccagac agcgacatcg 2461 ttgaaagtta cgcccgcgcc gccggacccg tgcacctccg agtccgcgac atcatggacc 2521 caccgcccgg ctgcaaggtc gtggtcaacg ccgccaacga ggggctactg gccggctctg 2581 gcgtgtgcgg tgccatcttt gccaacgcca cggcggccct cgctgcaaac tgccggcgcc 2641 tcgccccatg ccccaccggc gaggcagtgg cgacacccgg ccacggctgc gggtacaccc 2701 acatcatcca cgccgtcgcg ccgcggcgtc ctcgggaccc cgccgccctc gaggagggcg 2761 aagcgctgct cgagcgcgcc taccgcagca tcgtcgcgct agccgccgcg cgtcggtggg 2821 cgtgtgtcgc gtgccccctc ctcggcgctg gcgtctacgg ctggtctgct gcggagtccc 2881 tccgagccgc gctcgcggct acgcgcaccg agcccgtcga gcgcgtgagc ctgcacatct 2941 gccaccccga ccgcgccacg ctgacgcacg cctccgtgct cgtcggcgcg gggctcgctg 3001 ccaggcgcgt cagtcctcct ccgaccgagc ccctcgcatc ttgccccgcc ggtgacccgg 3061 gccgaccggc tcagcgcagc gcgtcgcccc cagcgacccc ccttggggat gccaccgcgc 3121 ccgagccccg cggatgccag gggtgcgaac tctgccggta cacgcgcgtc accaatgacc 3181 gcgcctatgt caacctgtgg ctcgagcgcg accgcggcgc caccagctgg gccatgcgca 3241 ttcccgaggt ggttgtctac gggccggagc acctcgccac gcattttcca ttaaaccact 3301 acagtgtgct caagcccgcg gaggtcaggc ccccgcgagg catgtgcggg agtgacatgt 3361 ggcgctgccg cggctggcat ggcatgccgc aggtgcggtg caccccctcc aacgctcacg 3421 ccgccctgtg ccgcacaggc gtgccccctc gggcgagcac gcgaggcggc gagctagacc 3481 caaacacctg ctggctccgc gccgccgcca acgttgcgca ggctgcgcgc gcctgcggcg 3541 cctacacgag tgccgggtgc cccaagtgcg cctacggccg cgccctgagc gaagcccgca 3601 ctcatgagga cttcgccgcg ctgagccagc ggtggagcgc gagccacgcc gatgcctccc 3661 ctgacggcac cggagatccc ctcgaccccc tgatggagac cgtgggatgc gcctgttcgc 3721 gcgtgtgggt cggctccgag catgaggccc cgcccgacca cctcctggtg tcccttcacc 3781 gtgccccaaa tggtccgtgg ggcgtagtgc tcgaggtgcg tgcgcgcccc gaggggggca 3841 accccaccgg ccacttcgtc tgcgcggtcg gcggcggccc acgccgcgtc tcggaccgcc 3901 cccacctctg gcttgcggtc cccctgtctc ggggcggtgg cacctgtgcc gcgaccgacg 3961 aggggctggc ccaggcgtac tacgacgacc tcgaggtgcg ccgcctcggg gatgacgcca 4021 tggcccgggc ggccctcgca tcagtccaac gccctcgcaa aggcccttac aatatcaggg 4081 tatggaacat ggccgcaggc gctggcaaga ctacccgcat cctcgctgcc ttcacgcgcg 4141 aagaccttta cgtctgcccc accaatgcgc tcctgcacga gatccaggcc aaactccgcg 4201 cgcgcgatat cgacatcaag aacgccgcca cctacgagcg ccggctgacg aaaccgctcg 4261 ccgcctaccg ccgcatctac atcgatgagg cgttcactct cggcggcgag tactgcgcgt 4321 tcgttgccag ccaaaccacc gcggaggtga tctgcgtcgg tgatcgggac cagtgcggcc 4381 cacactacgc caataactgc cgcacccccg tccctgaccg ctggcctacc gagcgctcgc 4441 gccacacttg gcgcttcccc gactgctggg cggcccgcct gcgcgcgggg ctcgattatg 4501 acatcgaggg cgagcgcacc ggcaccttcg cctgcaacct ttgggacggc cgccaggtcg 4561 accttcacct cgccttctcg cgcgaaaccg tgcgccgcct tcacgaggct ggcatacgcg 4621 catacaccgt gcgcgaggcc cagggtatga gcgtcggcac cgcctgcatc catgtaggca 4681 gagacggcac ggacgttgcc ctggcgctga cacgcgacct cgccatcgtc agcctgaccc 4741 gggcctccga cgcactctac ctccacgagc tcgaggacgg ctcactgcgc gctgcggggc 4801 tcagcgcgtt cctcgacgcc ggggcactgg cggagctcaa ggaggttccc gctggcattg 4861 accgcgttgt cgccgtcgag caggcaccac caccgttgcc gcccgccgac ggcatccccg 4921 aggcccaaga cgtgccgccc ttctgccccc gcactctgga ggagctcgtc ttcggccgtg 4981 ccggccaccc ccattacgcg gacctcaacc gcgtgactga gggcgaacga gaagtgcggt 5041 acatgcgcat ctcgcgtcac ctgctcaaca agaatcacac cgagatgccc ggaacggaac 5101 gcgttctcag tgccgtttgc gccgtgcggc gctaccgcgc gggcgaggat gggtcgaccc 5161 tccgcactgc tgtggcccgc cagcacccgc gcccttttcg ccagatccca cccccgcgcg 5221 tcactgctgg ggtcgcccag gagtggcgca tgacgtactt gcgggaacgg atcgacctca 5281 ctgatgtcta cacgcagatg ggcgtggccg cgcgggagct caccgaccgc tacgcgcgcc 5341 gctatcctga gatcttcgcc ggcatgtgta ccgcccagag cctgagcgtc cccgccttcc 5401 tcaaagccac cttgaagtgc gtagacgccg ccctcggccc cagggacacc gaggactgcc 5461 acgccgctca ggggaaagcc ggccttgaga tccgggcgtg ggccaaggag tgggttcagg 5521 ttatgtcccc gcatttccgc gcgatccaga agatcatcat gcgcgccttg cgcccgcaat 5581 tccttgtggc cgctggccat acggagcccg aggtcgatgc gtggtggcag gcccattaca 5641 ccaccaacgc catcgaggtc gacttcactg agttcgacat gaaccagacc ctcgctactc 5701 gggacgtcga gctcgagatt agcgccgctc tcttgggcct cccttgcgcc gaagactacc 5761 gcgcgctccg cgccggcagc tactgcaccc tgcgcgaact gggctccact gagaccggct 5821 gcgagcgcac aagcggcgag cccgccacgc tgctgcacaa caccaccgtg gccatgtgca 5881 tggccatgcg catggtcccc aaaggcgtgc gctgggccgg gattttccag ggtgacgata 5941 tggtcatctt cctccccgag ggcgcgcgca gcgcggcact caagtggacc cccgccgagg 6001 tgggcttgtt tggcttccac atcccggtga agcacgtgag cacccctacc cccagcttct 6061 gcgggcacgt cggcaccgcg gccggcctct tccatgatgt catgcaccag gcgatcaagg 6121 tgctttgccg ccgtttcgac ccagacgtgc ttgaagaaca gcaggtggcc ctcctcgacc 6181 gcctccgggg ggtctacgcg gctctgcctg acaccgttgc cgccaatgct gcgtactacg 6241 actacagcgc ggagcgcgtc ctcgctatcg tgcgcgaact taccgcgtac gcgggggcgc 6301 ggcctcgacc acccggccac catcggcgcg ctcgaggaga ttcagacccc ctacgcgcgc 6361 gccaatctcc acgacgccga ctaacgcccc tgtacgtggg gcctttaatc ttacctactc 6421 taaccaggtc atcacccacc gttgtttcgc cgcatctggt gggtacccaa cttttgccat 6481 tcgggagagc cccagggtgc ccgaatggct tctactaccc ccatcaccat ggaggacctc 6541 cagaaggccc tcgaggcaca atcccgcgcc ctgcgcgcgg aactcgccgc cggcgcctcg 6601 cagtcgcgcc ggccgcggcc gccgcgacag cgcgactcca gcacctccgg agatgactcc 6661 ggccgtgact ccggagggcc ccgccgccgc cgcggcaacc ggggccgtgg ccagcgcagg 6721 gactggtcca gggccccgcc ccccccggag gagcggcaag aaactcgctc ccagactccg 6781 gccccgaagc catcgcgggc gccgccacaa cagcctcaac ccccgcgcat gcaaaccggg 6841 cgtgggggct ctgccccgcg ccccgagctg gggccaccga ccaacccgtt ccaagcagcc 6901 gtggcgcgtg gcctgcgccc gcctctccac gaccctgaca ccgaggcacc caccgaggcc 6961 tgcgtgacct cgtggctttg gagcgagggc gaaggcgcgg tcttttaccg cgtcgacctg 7021 catttcacca acctgggcac ccccccactc gacgaggacg gccgctggga ccctgcgctc 7081 atgtacaacc cttgcgggcc cgagccgccc gctcacgtcg tccgcgcgta caatcaacct 7141 gccggcgacg tcaggggcgt ttggggtaaa ggcgagcgca cctacgccga gcaggacttc 7201 cgcgtcggcg gcacgcgctg gcaccgactg ctgcgcatgc cagtgcgcgg cctcgacggc 7261 gacagcgccc cgcttccccc ccacaccacc gagcgcattg agacccgctc ggcgcgccat 7321 ccttggcgca tccgcttcgg tgccccccag gccttccttg ccgggctctt gctcgccacg 7381 gtcgccgttg gcaccgcgcg cgccgggctc cagccccgcg ctgatatggc ggcacctcct 7441 acgctgccgc agcccccctg tgcgcacggg cagcattacg gccaccacca ccatcagctg 7501 ccgttcctcg ggcacgacgg ccatcatggc ggcaccttgc gcgtcggcca gcattaccga 7561 aacgccagcg acgtgctgcc cggccactgg ctccaaggcg gctggggttg ctacaacctg 7621 agcgactggc accagggcac tcatgtctgt cataccaagc acatggactt ctggtgtgtg 7681 gagcacgacc gaccgccgcc cgcgaccccg acgcctctca ccaccgcggc gaactccacg 7741 accgccgcca cccccgccac tgcgccggcc ccctgccacg ccggcctcaa tgacagctgc 7801 ggcggcttct tgtctgggtg cgggccgatg cgcctgcgcc acggcgctga cacccggtgc 7861 ggtcggttga tctgcgggct gtccaccacc gcccagtacc cgcctacccg gtttggctgc 7921 gctatgcggt ggggccttcc cccctgggaa ctggtcgtcc ttaccgcccg ccccgaagac 7981 ggctggactt gccgcggcgt gcccgcccat ccaggcgccc gctgccccga actggtgagc 8041 cccatgggac gcgcgacttg ctccccagcc tcggccctct ggctcgccac agcgaacgcg 8101 ctgtctcttg atcacgccct cgcggccttc gtcctgctgg tcccgtgggt cctgatattt 8161 atggtgtgcc gccgcgcctg tcgccgccgc ggcgccgccg ccgccctcac cgcggtcgtc 8221 ctgcaggggt acaacccccc cgcctatggc gaggaggctt tcacctacct ctgcactgca 8281 ccggggtgcg ccactcaagc acctgtcccc gtgcgcctcg ctggcgtccg ttttgagtcc 8341 aagattgtgg acggcggctg ctttgcccca tgggacctcg aggccactgg agcctgcatt 8401 tgcgagatcc ccactgatgt ctcgtgcgag ggcttggggg cctgggtacc cgcagcccct 8461 tgcgcgcgca tctggaatgg cacacagcgc gcgtgcacct tctgggctgt caacgcctac 8521 tcctctggcg ggtacgcgca gctggcctct tacttcaacc ctggcggcag ctactacaag 8581 cagtaccacc ctaccgcgtg cgaggttgaa cctgccttcg gacacagcga cgcggcctgc 8641 tggggcttcc ccaccgacac cgtgatgagc gtgttcgccc ttgctagcta cgtccagcac 8701 cctcacaaga ccgtccgggt caagttccat acagagacca ggaccgtctg gcaactctcc 8761 gttgccggcg tgtcgtgcaa cgtcaccact gaacacccgt tctgcaacac gccgcacgga 8821 caactcgagg tccaggtccc gcccgacccc ggggacctgg ttgagtacat tatgaattac 8881 accggcaatc agcagtcccg gtggggcctc gggagcccga attgccacgg ccccgattgg 8941 gcctccccgg tttgccaacg ccattcccct gactgctcgc ggcttgtggg ggccacgcca 9001 gagcgccccc ggctgcgcct ggtcgacgcc gacgaccccc tgctgcgcac tgcccctgga 9061 cccggcgagg tgtgggtcac gcctgtcata ggctctcagg cgcgcaagtg cggactccac 9121 atacgcgctg gaccgtacgg ccatgctacc gtcgaaatgc ccgagtggat ccacgcccac 9181 accaccagcg acccctggca tccaccgggc cccttggggc tgaagttcaa gacagttcgc 9241 ccggtggccc tgccacgcac gttagcgcca ccccgcaatg tgcgtgtgac cgggtgctac 9301 cagtgcggta cccccgcgct ggtggaaggc cttgcccccg ggggaggcaa ttgccatctc 9361 accgtcaatg gcgaggacct cggcgccgtc ccccctggga agttcgtcac cgccgccctc 9421 ctcaacaccc ccccgcccta ccaagtcagc tgcgggggcg agagcgatcg cgcgaccgcg 9481 cgggtcatcg accccgccgc gcaatcgttt accggcgtgg tgtatggcac acacaccact 9541 gctgtgtcgg agacccggca gacctgggcg gagtgggctg ctgcccattg gtggcagctc 9601 actctgggcg ccatttgcgc cctcccactc gctggcttac tcgcttgctg tgccaaatgc 9661 ttgtactact tgcgcggcgc tatagcgcct cgctagtggg cccccgcgcg aaacccgcac 9721 taggccacta gatccccgca cctgttgctg tatag // LOCUS XEL68KSA 2009 bp ss-mRNA VRT 12-JUL-1990 DEFINITION X.laevis 68 kDa serum albumin mRNA, complete cds. ACCESSION M18350 KEYWORDS serum albumin. SOURCE X.laevis adult liver hepatocyte (lambda-ZAP library), cDNA to mRNA, clone pX1A14. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2009) AUTHORS Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and Schoenberg,D.R. TITLE Xenopus laevis serum albumin: Sequence of the cDNAs encoding the 68 and 74 kDa peptides, relationship of the 74 kDa albumin to alpha-fetoprotein, and the regulation of albumin gene expression by thyroid hormone during development JOURNAL Mol. Endocrinol. 3, 464-473 (1989) STANDARD full staff_review REFERENCE 2 (bases 1842 to 2009) AUTHORS Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L. TITLE Extranuclear estrogen-regulated destabilization of Xenopus laevis serum albumin mRNA JOURNAL Mol. Endocrinol. 3, 805-814 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Schoenberg, 14-NOV-1988. Draft entry and computer-readable sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989. FEATURES from to/span description pept 39 1859 68 kDa serum albumin precursor sigp 39 110 68 kDa serum albumin signal peptide matp 111 1856 68 kDa serum albumin signal 1986 1994 poly-A signal site 1 1 cap site BASE COUNT 699 a 388 c 393 g 529 t ORIGIN 87 bp upstream of HinfI site. 1 aggcttctca gaggtcccca cccaatacat ctccagtcat gaagtggatc accctcattt 61 gtctgttaat tagctccact ttaatagaat caagaataat tttcaaaaga gatacagatg 121 tagaccatca caagcatatt gctgacatgt acaatttatt gactgagcgg accttcaaag 181 gacttacatt ggctattgtc tcacagaatc tccagaaatg ttcattggag gagctgtcta 241 aactggtgaa tgaaattaat gactttgcca aatcctgtac aggaaacgac aaaactcctg 301 agtgtgaaaa acccataggc accctgtttt atgacaaact ctgcgcagat ccaaaagtgg 361 gtgttaatta tgagtggagc aaagagtgct gttctaagca agatccagag agagcacagt 421 gcttcagggc acatagagtt tttgaacata atccagtaag gcctaaacct gaggaaactt 481 gtgcattatt caaagaacac cctgatgatc ttctctcagc attcatacat gaagaggcga 541 gaaaccatcc agacctttat cccccagcag tactattatt aacacagcaa tatggcaaac 601 ttgttgaaca ttgttgtgaa gaagaagaca aggataaatg ctttgcagaa aagatgaagg 661 aactgatgaa acacagtcat tctattgaag ataagcaaaa acatttctgc tggattgtaa 721 ataattatcc tgaaagagtt attaaagcac taaatttggc cagagtgagc cacagatatc 781 ctaagcctga tttcaagctt gcccataaat ttaccgagga gactacacac ttcattaagg 841 attgttgtca tggggacatg tttgaatgca tgacagagag gctggagctt tctgagcata 901 cctgtcaaca taaagatgag ttatcaacaa aacttgaaaa atgctgtaac ttacctttgc 961 ttgagcgtac atactgcatt gtcaccttgg aaaatgatga cgttcctgct gaattatcaa 1021 agccaattac agaatttaca gaggaccctc atgtttgtga gaagtatgct gagaataaaa 1081 gtttcttaga gatatctcca tggcagagtc aagaaacacc agaattgtct gaacaattcc 1141 ttttgcaatc tgcaaaagaa tatgaatctt tgctgaacaa gtgctgcttt tcagacaatc 1201 ctcctgaatg ctacaaggat ggagctgaca gatttatgaa tgaagccaag gagagatttg 1261 catatttgaa acaaaactgt gatatcttgc atgaacatgg agaatatctc tttgaaaatg 1321 aattgctcat aagatacaca aagaaaatgc cccaagtgtc agatgaaaca ttgattggaa 1381 tagcacacca aatggcagat attggtgagc actgctgtgc cgtacctgaa aatcaaagga 1441 tgccatgtgc agaaggagac cttaccattc tcattggaaa aatgtgtgaa aggcaaaaga 1501 agacatttat aaataaccac gttgctcatt gctgcactga ctcatattct gggatgcgtt 1561 catgctttac tgctcttggt ccagatgagg actatgtacc acccccagtt actgatgaca 1621 catttcactt tgacgacaag atatgcactg ctaatgataa agaaaaacag catatcaaac 1681 agaaattcct tgtgaagctg attaaagtta gtcctaaatt ggaaaaaaat cacattgatg 1741 aatggctgct ggaattcctt aagatggtac agaaatgctg tactgcagat gaacaccagc 1801 catgttttga tacagagaaa ccagtactga ttgaacactg tcaaaaactc catccataag 1861 agtccataag agcaaagacc agtcttcaaa ctcactgagg aacaccttcc atctctcaaa 1921 cacaagaaaa aaaagttcct tcagctgaaa agagcatttg cttagagcat tcaactgtgt 1981 gttgtaataa ataaagcatt ttaaaaaat // LOCUS XEL74KSA 1957 bp ss-mRNA VRT 12-JUL-1990 DEFINITION Xenopus laevis 74 kDa serum albumin mRNA, complete cds. ACCESSION M21442 KEYWORDS serum albumin. SOURCE X.laevis adult liver hepatocyte, cDNA to mRNA, clone pX1A74.1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1957) AUTHORS Moskaitis,J.E., Sargent,T.D., Smith,L.H.Jr., Pastori,R.L. and Schoenberg,D.R. TITLE Xenopus laevis serum albumin: Sequence of the complementary deoxyribonucleic acids encoding the 68- and 74-kilodalton peptides and the regulation of albumin gene expression by thyroid hormone during development JOURNAL Mol. Endocrinol. 3, 464-473 (1989) STANDARD full staff_review REFERENCE 2 (bases 1801 to 1957) AUTHORS Schoenberg,D.R., Moskaitis,J.E., Smith,L.H. and Pastori,R.L. TITLE Extranuclear estrogen-regulated destabilization of Xenopus laevis serum albumin mRNA JOURNAL Mol. Endocrinol. 3, 805-814 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Schoenberg, 14-NOV-1988. Draft entry and computer-readable sequence for [2] kindly provided by D.Schoenberg, 18-DEC-1989. FEATURES from to/span description pept < 1 1818 74 kDa serum albumin (AA at 1) sigp < 1 66 74 kDa serum albumin signal peptide matp 67 1815 74 kDa serum albumin signal 1935 1944 poly-A signal BASE COUNT 667 a 383 c 382 g 525 t ORIGIN 1 tggatcaccc tgatttgtct gttaattagc tcctctttca ttgaatcaag gatacttttc 61 aaaagagata cagatgcaga ccatcacaag catattgctg atgtatacac cgcattgact 121 gagcggacct tcaaaggact tacattggct attgtctctc agaatctcca gaaatgttcg 181 ttggaggagt tatctaagct ggtgaatgaa ataaatgact ttgccaaatc ctgtattaat 241 gacaaaactc ctgagtgtga aaaaccagtg ggcaccctgt tttttgacaa actctgtgca 301 gatccagcag tgggtgttaa ttatgagtgg agcaaagagt gctgtgccaa gcaagatcca 361 gagagggctc agtgcttcaa ggcgcacaga gatcatgaac atacttcaat aaagcctgaa 421 cctgaggaaa cctgcaaatt actcaaagaa caccctgatg atcttctctc agcgttcatt 481 catgaagagg caagaaacca tccagacctt tatccaccag cagtattagc attaaccaag 541 caatatcaca aacttgctga acattgttgt gaagaagaag acaaggaaaa atgcttctca 601 gaaaagatga agcaacttat gaaacaatct cattccattg aagataagca acatcatttc 661 tgctggattc tggataattt tcctgaaaaa gttcttaaag cactaaattt ggccagagtg 721 agccacagat atcctaaagc tgaattcaag cttgcccata attttactga ggaggttaca 781 cactttatta aagattgttg ccatgacgac atgtttgaat gcatgactga gaggctggag 841 cttactgagc atacctgtca acataaagat gagttatcat caaaacttga aaaatgctgt 901 aatatacctt tgcttgagcg tacatactgc attgtcacct tggaaaatga tgacgttcct 961 gctgaattgt ctcagccaat tacagaattt acagaggacc ctcatgtgtg tgagaagtat 1021 gctgagaata acgaagtttt cttaggaaga tatctccatg ctgtgtcaag aaaacaccag 1081 gaattgtctg aacaattcct tttgcaatct gcaaaagaat atgaatcttt gctgaacaag 1141 tgctgcaaaa cagacaatcc tcctgaatgc tacaaggatg gagctgacag atttatgaat 1201 gaagccaagg agagatttgc atatttgaaa caaaactgtg atatcttgca tgaacatgga 1261 gaatatctct ttgaaaatga attgctcata agatacacaa agaaaatgcc ccaagtgtca 1321 gatgaaacat tgattggaat agcacaccaa atggcagata ttggtgagca ctgctgtgcc 1381 gtacctgaaa atcaaaggat gccatgtgca gaaggagacc ttaccattct cattggaaaa 1441 atgtgtgaaa ggcaaaagaa gacatttata aataaccacg ttgctcattg ctgcactgac 1501 tcatattctg ggatgcgttc atgctttact gctcttggtc cagatgagga ctatgtacca 1561 cccccagtta ctgatgacac atttcacttt gacgacaaga tatgcactgc taatgataaa 1621 gaaaaacagc atatcaaaca gaaattcctt gtgaagctga ttaaagttag tcctaaattg 1681 gaaaaaaatc acattgatga atgttctgct gaattcctta agatggtaca gaaatgctgt 1741 actgcagatg aacaccagcc atgttttgat acagagaaac cagtactgat tgaacactgt 1801 caaaaactcc atccataaga gtccattaga gcaaaggcca gccttcaaac tcactgagga 1861 acatcttcca tctctcacat gaaaaaagtt tcctccatct gaaaagaaaa tttgttcatt 1921 caactgtctg ttgaaataaa taaagcgttt aaaatat // LOCUS MUSHOX28A 216 bp ds-DNA ROD 12-JUL-1990 DEFINITION Mouse homeobox protein gene Hox-2.8, partial cds. ACCESSION M34004 KEYWORDS homeobox protein. SOURCE Mouse (strain CBA) DNA, clone YNOTHOX-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 216) AUTHORS Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and Lehrach,H. TITLE A yeast artificial chromosome containing the mouse homeobox cluster Hox-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Krumlauf, 04-MAY-1990, for release after publication. FEATURES from to/span description pept < 1 > 216 Hox-2.8 homeobox protein (AA at 1) site 34 216 homeobox Hox-2 BASE COUNT 51 a 68 c 66 g 31 t ORIGIN Chromosome 11D. 1 ggccccggat tgccagaatg cggcggcagc ggctcccgca gactgcgcac ggcctacacc 61 aacacgcaac tgctggagct ggagaaggag ttccacttca ataagtacct gtgccggccg 121 cgtcgcgtcg agatcgctgc cttgctggac ctcaccgaaa ggcaggtcaa agtctggttc 181 cagaaccgac gcatgaaaca caagcggcag acggag // LOCUS MUSHOX29A 183 bp ds-DNA ROD 12-JUL-1990 DEFINITION Mouse homeobox protein gene Hox-2.9, partial cds. ACCESSION M34005 KEYWORDS homeobox protein. SOURCE Mouse (strain CBA) DNA, clone YNOTHOX-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 183) AUTHORS Rubock,M.J., Larin,Z., Cook,M., Papalopulu,N., Krumlauf,R. and Lehrach,H. TITLE A yeast artificial chromosome containing the mouse homeobox cluster Hox-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4751-4755 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Krumlauf, 04-MAY-1990, for release after publication. FEATURES from to/span description pept < 1 > 183 Hox-2.9 homeobox protein (AA at 1) site 1 183 homeobox Hox-2 BASE COUNT 50 a 53 c 54 g 26 t ORIGIN Chromosome 11D. 1 cccggcggtc tccgcacaaa cttcaccacg cgccagctga cggagctgga gaaggaattt 61 catttcaaca aatacctgag ccgtgcccgg agggtggaga tcgccgccac cctggagctc 121 aatgaaacgc aggtgaagat ctggttccag aaccggcgca tgaagcagaa gaaacgcgag 181 cga // LOCUS BCCIPMD 1101 bp ds-DNA BCT 12-JUL-1990 DEFINITION B.coagulans 3-isopropylmalate dehydrogenase gene, complete cds. ACCESSION M33099 KEYWORDS 3-isopropylmalate dehydrogenase. SOURCE B.coagulans (ATCC 7051) DNA. ORGANISM Bacillus coagulans Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1101) AUTHORS Sekiguchi,T., Ortega-Cesena,J., Nosoh,Y., Ohashi,S., Tsuda,K. and Kanaya,S. TITLE DNA and amino-acid sequences of 3-isopropylmalate dehydrogenase of Bacillus coagulans. Comparison with the enzymes of Saccharomyces cerevisiae and Thermus thermophilus JOURNAL Biochim. Biophys. Acta 867, 36-44 (1986) STANDARD simple staff_review FEATURES from to/span description pept 1 1101 3-isopropylmalate dehydrogenase BASE COUNT 288 a 255 c 328 g 230 t ORIGIN 1 atgaaaatga aactggccgt actgcccggc gatgggatcg ggccggaagt gatggatgca 61 gcgatccgcg ttttaaaaac agtgttggac aatgacgggc atgaagccgt ttttgaaaat 121 gcgctgattg ggggcgccgc cattgatgaa gcggggacgc ccctaccgga agaaacgctt 181 gacatttgcc gcaggagcga tgccattttg ctcggcgcgg taggggggcc gaaatgggat 241 cataacccgg cttccctccg cccggaaaaa ggcctgctcg ggctccggaa agaaatgggg 301 ctgtttgcga acctgcgccc ggttaaagca tatgccacac ttttaaacgc atcgccttta 361 aaacgggaac gtgtggaaaa cgtcgatctt gttattgtcc gcgaactgac gggcggcctc 421 tattttgggc gcccgagtga aaggcgcggg ccgggcgaga atgaagtggt agacacgctt 481 gcctatacaa gggaagagat tgaaagaatt attgagaaag cattccagct tgcccaaatc 541 agaagaaaaa aactggcatc cgtcgataag gcgaatgtgc tggaatcaag cagaatgtgg 601 cgcgaaattg cggaagaaac cgcgaaaaag tatccggacg tggaattgag ccatatgctt 661 gtcgactcaa cttcgatgca gctgattgca aatccgggcc aatttgatgt cattgtaaca 721 gagaatatgt tcggcgatat tttaagcgat gaagcgtccg tgattaccgg cagcctcggc 781 atgttgccat ccgcaagcct ccgttccgac cggttcggca tgtatgaacc ggtccacggc 841 tccgcgccgg atattgccgg gcagggaaaa gccaacccgc tcgggacagt gctgtcagcg 901 gctttgatgc tccgttattc gttcgggctt gagaaagaag cggcggccat tgaaaaagca 961 gtggatgatg tgcttcaaga cggctattgt acaggcgatt tgcaggtggc aaacggaaaa 1021 gtggtcagta caattgagct cacagaccgg ctgatcgaaa aattaaataa cagcgcagcc 1081 ggtccgcgca tttttcaata a // LOCUS DROSGS3A 151 bp ds-DNA SYN 12-JUL-1990 DEFINITION D.melanogaster synthetic Sgs-3 glue protein gene/Adh gene, 5' flank. ACCESSION M34726 KEYWORDS alcohol dehydrogenase; glue protein. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 151) AUTHORS Roark,M., Raghavan,K.V., Todo,T., Mayeda,C.A. and Meyerowitz,E.M. TITLE Cooperative enhancement at the Drosophila Sgs-3 locus JOURNAL Dev. Biol. 139, 121-133 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 131 > 151 synthetic Sgs-3 glue protein mRNA recomb 130 131 Adh DNA end/Sgs-3 synthetic DNA start BASE COUNT 51 a 25 c 36 g 39 t ORIGIN 1 gtcgacccaa aagtatcaaa caaaggggag aaggcttgtg tttgcataat cgaaatactg 61 actccatttt tagaattgca gtttcagtga aagcgtacct ataaaaaggt gaggtatccg 121 caagaaaagt atcagtttgt ggtaccgagc t // LOCUS MZESOD3A 1037 bp ss-mRNA PLN 12-JUL-1990 DEFINITION Z.mays manganese superoxide dismutase (SOD-3) mRNA, complete cds. ACCESSION M33119 KEYWORDS manganese superoxide dismutase; superoxide dismutase. SOURCE Z.mays (strain W64A), cDNA to mRNA, clone pSod3.1c. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1037) AUTHORS White,J.A. and Scandalios,J.G. TITLE Isolation and characterization of a cDNA for mitochondrial manganese superoxide dismutase (SOD-3) of maize and its relation to other manganese superoxide dismutases JOURNAL Biochim. Biophys. Acta 951, 61-70 (1988) STANDARD simple staff_review FEATURES from to/span description pept 46 753 manganese superoxide dismutase (SOD-3) (EC 1.15.1.1) BASE COUNT 237 a 259 c 296 g 245 t ORIGIN 1 gaattccacg cacccaggag atacagcgag cgagcgacca aagccatggc tctccgcacc 61 ctggcatcga agaaggtcct atccttcccg ttcggcggcg cgggccggcc gttggcggcg 121 gcggcgtctg cgaggggggt gacgacggtc acactccccg acctctccta cgacttcggc 181 gcgctggaac cggccatctc gggggagatc atgcgcttgc accaccaaaa gcaccacgcc 241 acctacgtcg ccaactacaa caaggcgctg gagcagcttg aaactgccgt ctccaagggc 301 gacgcctccg ctgtcgtcca gctgcaggcg gcgatcaagt tcaacggcgg cggtcatgtg 361 aaccattcaa tcttctggaa gaacctcaag cccattagcg aaggtggcgg ggagccgcct 421 catgggaaac ttggctgggc catcgatgag gattttggtt cgtttgaggc acttgtaaag 481 aagatgaatg cagaaggcgc tgctttccaa gggtctggat gggtgtggtt agctttggat 541 aaagaggcaa aaaaggtttc agttgaaaca acagctaatc aggatcctct ggtgactaaa 601 ggtgcaagct tggttccgct gttggggatt gatgtctggg aacatgcata ctacctgcag 661 tacaagaatg ttaggccgga ttacctgaac aacatctgga aggtgatgaa ctggaaatat 721 gctggagagg tgtacgaaaa tgttcttgct tgaattgtct taacggacaa tacacatctg 781 cgcgcgcggg tttcggctgt ttgatcatgt gaaataaaga tggacctgtc tagcggctgg 841 accttgtgta catttcactg agatagacta atggacggcc tgccgatttt gttcgtcctg 901 cttgcgtgct actctgtctc tgctcctagt ttttggcatc atgtttatgt tgagcaaggt 961 gatgcccaag ggaagccatt cccactcttg tctccattaa taaaatcagc tgagcttccg 1021 atgtttgctt ggaattc // LOCUS RATA2UGLBA 300 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat alpha-2u-globulin gene, 5' end. ACCESSION M33213 KEYWORDS alpha-2u-globulin. SOURCE Rat DNA, clone 91. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Kurtz,D.T., McCullough,L., Bishop,D.K. and Manos,M.M. TITLE DNA sequences required for hormonal induction of rat alpha-2u-globulin genes JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 985-988 (1983) STANDARD simple staff_review FEATURES from to/span description pept 292 > 300 alpha-2u-globulin mRNA 233 > 300 alpha-2u-globulin mRNA BASE COUNT 108 a 58 c 76 g 58 t ORIGIN 1 acccactaat ttttcgtggg aatatgtttt gcgaaatgta tgagtgatag aatcaatcca 61 taggagatga catcgccaag tttcaaaagg gcaggaacaa tcgtggcttc acatcagtac 121 atggaaaaca ttccacaaag cctgagaaga atggaaggcc catatgagaa ggaaaaaaaa 181 acaccgaaac ccagagagag tataaagacg agcaaagtgc tggaggtgga gtgtgggcac 241 catcagcaga gggattgtcc cgacagagag gcaattctat tccctaccaa catgaagctg // LOCUS SHRRGBA 409 bp ds-DNA INV 12-JUL-1990 DEFINITION Brine shrimp 5.8S ribosomal RNA gene. ACCESSION M33097 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SOURCE Brine shrimp DNA. ORGANISM Artemia sp. Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Branchiopoda; Sarsostraca; Anostraca; Artimiidae. REFERENCE 1 (bases 1 to 409) AUTHORS Vaughn,J.C., Sperbeck,S.J. and Hughes,M.J. TITLE Molecular cloning and characterization of ribosomal RNA genes from the brine shrimp: Nucleotide sequence analysis and evolution of the 5.8 S rRNA gene region and its flanking nucleotides JOURNAL Biochim. Biophys. Acta 783, 144-151 (1984) STANDARD simple staff_review FEATURES from to/span description rRNA 177 339 5.8S ribosomal RNA BASE COUNT 104 a 82 c 116 g 107 t ORIGIN 1 ggtgaaaaat agtcatattg gggacgagag tggcttcttg tgattcaagg atcatggata 61 ccactccgcg agactaaagg gagtgaaggt gagcttgccc caacagagca tggcttgagg 121 tgtgcaaggg tgcaattgca ttggccttgt ttgagggaga atttgaaaca ttcaatagaa 181 tgacccttga ggatggatca cttggctcac attacgaaga cgaacgcagc tagacgcgtg 241 attccatgcg aactgcagga cacatggaac gtctatattt tgaacgcaaa ttgcatgtcc 301 agcctttgag cttggactac gtctggctga gagacggatg tttttatcat tcggtcatct 361 gggtataccg tcactgcgag gctccttgct tctatagggc cgttgatcg // LOCUS GLATSAA 3053 bp ss-mRNA INV 12-JUL-1990 DEFINITION G.lamblia trophozoite surface antigen (TSA 417) mRNA, complete cds. ACCESSION M33641 KEYWORDS major surface protein; trophozoite surface antigen. SOURCE G.lamblia (strain WB, ATCC 30957), cDNA to mRNA, clone pFDG417. ORGANISM Giardia lamblia Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae. REFERENCE 1 (bases 1 to 3053) AUTHORS Gillin,F.D., Hagblom,P., Harwood,J., Aley,S.B., Reiner,D.S., McCaffery,M., So,M. and Guiney,D. TITLE Isolation and expression of the gene for a major surface protein of Giardia lamblia JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4463-4467 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by S.B.Aley, 08-APR-1990. FEATURES from to/span description pept 205 2346 trophozoite surface antigen protein precursor (TSA 417) sigp 205 255 trophozoite surface antigen protein signal peptide matp 256 2343 trophozoite surface antigen binding 188 193 ribosomal binding site (put.) signal 67 72 TATA box signal 2359 2364 misc. signal signal 2831 2837 polyA signal BASE COUNT 785 a 777 c 865 g 626 t ORIGIN 1 gaattcttac gctatgtacg gcttatattg acaggattgc tacaggctat gaatactatg 61 ctagagtata aacatgtatc cacggcgatc tgggggtctt ctcggagact agtggccagt 121 taccatggac acgcaagaag ctgtctgtgg tagcctggcc ccgggctttg cgttggaagc 181 gccacccagc aggtcggcgg cctaatgttc ggcagatttt tgctcgcgat cgtcatcctt 241 cagctggcac ggacagcctg cacccaagaa gctgacgatg gaaagtgtaa aacgtgtggc 301 gtcaccattg gtcaagacac ttggtgctct gagtgcaacg gagcaaacta cgcccccgtg 361 aacggccagt gtgtagacgt caacgctgag gggccaagca aaacgctttg tccgcaacat 421 agcgcaggga agtgcacgca gtgcggaggc aactcattca tgtacaagga cggctgttat 481 tccagcggag aaggccttcc tggacacagc ctgtgcttaa gttccgacgg agatggcgta 541 tgcaccgagg cggccccggg gtactttgct ccggtgggag cggcgaacac tgaacagtct 601 gtgatcgcat gtggcgatac aactggagta acaatagcag ctggcggaaa cacatacaag 661 ggcattgctg actgcgcaga atgcagcgcc cctgacgcaa cagccggcgc tgaggccggc 721 aaggttgcaa cgtgtaccaa gtgtggagtc agtaagtatc tcaaggataa cgtgtgcgta 781 gataaagccc aatgtaattc tggtagcact aataagttcg ttgcagttga tgattctgag 841 aatggcaaca agtgtgtttc ttgcagcgat aacctcaatg gtggcgttgc caattgcgac 901 acctgtagct acgatgagca atctaagaag atcaagtgta caaaatgcac cgataacaac 961 tacctgaaaa ccacaagcga aggcacgtcg tgcgtacaaa aagaccaatg caaagacggc 1021 ttcttcccca aggatgacag cagtgcagga aataaatgcc tcccttgtaa tgacagcacc 1081 gacggaattg ccaattgcgc cacgtgtgct ctggttagtg gccgatcagg ggctgccctc 1141 gttacatgct ccgcctgcac ggatggatac aagcctagtg ccgacaaaac tacgtgcgag 1201 gcggtaagca actgcaagac ccccggatgc aaggcgtgca gcaacgaagg aaaggagaac 1261 gaggtctgca cagactgtga tggtagcaca tacctcacgc cgacaagcca gtgcatagac 1321 agctgcgcta agattggaaa ctactatgga gccaccgaag gagcaaagaa actctgtaaa 1381 gagtgcactg cggctaactg caagacttgc gatgatcagg ggcagtgcca agcatgcaac 1441 gacgggttct ataaaaacgg cgacgcgtgc tctccgtgcc acgaaagctg caagacatgc 1501 agcgcaggca ctgccagcga ctgcaccgag tgtcccaccg gaaaagcact caggtacggg 1561 gacgacggta ctaagggcac gtgcggagaa ggctgcacaa cgggcacagg agcaggagca 1621 tgcaagacgt gtgggctcac tatcgatggc gctagctact gctctgagtg cgccacaacg 1681 acagaatatc ctcaaaatgg cgtctgtgca ccaaaggcta gccgcgccac acctacgtgc 1741 aacgactcgc ctattcagaa tggtgtttgt ggaacgtgtg ccgataacta ctttaagatg 1801 aacggagggt gctatgaaac agtcaagtat cccggtaaga cggtttgcat tagtgcacca 1861 aatggtggta cgtgtcaaaa agctgcagat ggttacaagt tggattcagg tacccttaca 1921 gtttgttctg aagggtgtaa ggaatgtgct agcagtaccg actgtactac gtgtctggac 1981 ggatatgtaa agagtgcaag tgcgtgcaca aagtgtgacg ctagctgcga aacatgtaat 2041 ggagcagcta caacatgtaa ggcgtgtgct acgggatact acaagaccgc atcaggagaa 2101 ggtgcgtgca cgtcttgtga aagtgatagc aacggagtca ctggtattaa gggctgccta 2161 aactgcgccc ctccgcccaa caataaaggt tccgtcctct gctacctcat aaaggatagc 2221 ggtagcacca acaagagcgg gctctccact ggtgccatag cgggtatctc cgtcgctgtc 2281 atcgttgttg tcggcggcct catcggcttc ctctgctggt ggttcctctg cagggggaag 2341 gcgtagatgt acttagatag taaaccgtca tcgatgggtc tgctcggtgt ctgttcctgc 2401 tagcacagac agcagggtct cagccagtgc accaagcatc aggcgtgtgg atgaatgttt 2461 ggcttatcca gtagcgccct tgcgtgtcca cgggctcaca tgtgaccaac agtgctgtac 2521 aggtaggtag agaccagacc acggatccca tgcactgaat gcaactcctt tgcagccgtg 2581 atgggtcagt tgtggcaatt tataagacaa aacgagggcc ctgtccatcg cacagtccct 2641 tgcagcgctt ccagacgcgg agctggcggc ggtcctgcac tacctcgccg agttccgtgg 2701 gccagaggtc ttcggggact gccttcagac cttgctcagc tcgacgagcc ccgggacgag 2761 accctcaggc ttgcggcaca gaaagacata cgcggcttcc tcgaaaggat cgacagaaag 2821 gactcagctc aataaatgcc actcttcacg tcctcgttcc gcggtacatc gtgtagctgt 2881 acatccagtg gaacttttcg actcagaagt gaagttgatg agctctgtgg tagataactt 2941 ctcatggcct ggagtggcag acactgtgag cagctgattg gcatgcaatt cacaccctag 3001 acgcggtgga gagatacccc cgtccatcca ttacaaacaa gtcccacaag ctt // LOCUS HUMPLG01 1272 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 1. ACCESSION M33272 J05286 KEYWORDS plasminogen. SEGMENT 1 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1272) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept 1077 + 1125 plasminogen precursor, exon 1 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" sigp 1077 + 1125 plasminogen signal peptide pre-msg 957 > 1272 PLG mRNA and introns IVS 1126 > 1272 PLG intron A BASE COUNT 391 a 231 c 269 g 381 t ORIGIN 1 gaattccgca gacattccac ccaagaccat tgggctccca cctctactct tttgccagtt 61 aatgaatagg caggaatttc actgcctgga aagaggaaca atgctttctg gtccttattt 121 cacatctaaa atagagaggt caattgattt attcctaaat atctttgaac actaaaatag 181 aagttttaca gcatatatac tacctggttg ctctagactt aagccaggga aaagtacaga 241 ttcaacattt aaaattgaga tagacgcttt ccacttaatg ctaccagtct tgctttattt 301 catgagaatg agaatataat aatatggcat acgttcattt gggggaaaga ttgatgtctt 361 ataacataat ttataattac agaaaacatg tgagttcact gggaataaat aaattttgaa 421 gataataaga tactttcact tatgtcataa tttctatgtc atttggtgta ggatgtagag 481 atattaacgt ttacacctaa ctcaagtttg tcatctaaga cctgaaaggg ttttgtctat 541 cagctgcacc cctgggtaga gacacaacct tggggaaggc ctcagcccca tccctcgtac 601 agcaggaatg agaacagccc tgcctgttgg gaagcttgag ggaggctatg gacgtgcagc 661 gcttggcaga aggtctcgtc atggaaggtt ccagcaaatg tgagatactt ttatgatttc 721 attttctcca aaagaaaggg aataagagaa gaggggagga aataagacta attgcgagag 781 ataaagtaca agggtgaggg aaggaataag gagacatgac ggcagcgtgg agcagccgag 841 gggggagatt gctttcacca cttcccagca tctattgcag attccaccct caaacatttt 901 gtaaggactc tttattcaag gtaacgtttg aaccctgctg agccagtggc atgggtctct 961 gagagaatca ttaacttaat ttgactatct ggtttgtgga tgcgtttact ctcatgtaag 1021 tcaacaacat cctgggattg ggacccactt tctgggcact gctggccagt cccaaaatgg 1081 aacataagga agtggttctt ctacttcttt tatttctgaa atcaggtaag acatagtttt 1141 tttaaattat aataattatt ttttctccca caatgtagta aaaatacata tgccatggct 1201 ttatgtgcaa ttcatttaat ttttgattca tgaaacttcc agttgaaaat cttgtataag 1261 attgaggaat tc // LOCUS HUMPLG02 161 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron A (partial). ACCESSION M33273 J05286 KEYWORDS plasminogen. SEGMENT 2 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 161) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 161 plasminogen intron A /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 52 a 46 c 20 g 43 t ORIGIN Unknown number of base pairs after segment 1. 1 gaattcaccc atttaggcat acaatccaat ggatttcaag atattgagag ttgtgcagcc 61 accatcagaa taaattttaa aactattcat acccccaaaa acgcactcca ctctccttag 121 ctgttacccc aatctgcagc ttctggcaac cactaatcta c // LOCUS HUMPLG03 376 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 2. ACCESSION M33274 J05286 KEYWORDS plasminogen. SEGMENT 3 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 376) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 196 + 331 plasminogen (PLG) precursor, exon 2 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" sigp 196 203 plasminogen signal peptide matp 204 + 331 plasminogen IVS < 1 195 PLG intron A IVS 332 > 376 PLG intron B BASE COUNT 114 a 74 c 78 g 110 t ORIGIN Unknown number of base pairs after segment 2. 1 tctttattta tgtccaaatg cccgactgtg tgttcttaac taaacatttt gattcatagc 61 tacccattct acttccagta aacagaaagt tttatttggt taatgctaac caaatagatt 121 aaaaggaagt catgacaatt agacattgac attgatttac tgaccattta ttccacttgg 181 atctcccacc tctaggtcaa ggagagcctc tggatgacta tgtgaatacc cagggggctt 241 cactgttcag tgtcactaag aagcagctgg gagcaggaag tatagaagaa tgtgcagcaa 301 aatgtgagga ggacgaagaa ttcacctgca ggtatttcca ttgtcgttgc acctacgcag 361 gaatctgtaa ttcaga // LOCUS HUMPLG04 291 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 3. ACCESSION M33275 J05286 KEYWORDS plasminogen. SEGMENT 4 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 291) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 46 + 152 plasminogen (PLG) precursor, exon 3 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 46 + 152 plasminogen IVS < 1 45 PLG intron B IVS 153 > 291 PLG intron C BASE COUNT 88 a 60 c 51 g 92 t ORIGIN Unknown number of base pairs after segment 3. 1 taaataaaga aaaatactta ttggatttcc tgcttcgttc tgcagggcat tccaatatca 61 cagtaaagag caacaatgtg tgataatggc tgaaaacagg aagtcctcca taatcattag 121 gatgagagat gtagttttat ttgaaaagaa aggtgagtac attttcttcc tcctcctcct 181 actgtcctcc ccatcctccc actcttcctc tttctctatt ctatctttaa tttatgagac 241 cagaggagga aggcactatc gtgttataaa actgaattct gagttaggac a // LOCUS HUMPLG05 69 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron C (partial). ACCESSION M33276 J05286 KEYWORDS plasminogen. SEGMENT 5 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 69) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 69 plasminogen intron C /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 26 a 9 c 13 g 21 t ORIGIN Unknown number of base pairs after segment 4. 1 aagtgcagat taaatctaaa ctttatctgg tgaagttatt agttcttaca agtagcaagc 61 aaacggtaa // LOCUS HUMPLG06 57 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron C (partial). ACCESSION M33277 J05286 KEYWORDS plasminogen. SEGMENT 6 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 57) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 57 plasminogen intron C /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 18 a 12 c 7 g 20 t ORIGIN Unknown number of base pairs after segment 5. 1 agtgcaacat ctacaataat tactttcctt atttttgaag tggaccatat ctcgaca // LOCUS HUMPLG07 341 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 4. ACCESSION M33278 J05286 KEYWORDS plasminogen. SEGMENT 7 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 341) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 167 + 281 plasminogen (PLG) precursor, exon 4 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 167 + 281 plasminogen IVS < 1 166 PLG intron C IVS 282 > 341 PLG intron D BASE COUNT 89 a 73 c 77 g 102 t ORIGIN Unknown number of base pairs after segment 6. 1 tggctcagtt tactgcagcc tttttgcaga tgcaaaagat gatcttttag aaagcagaaa 61 cagggggtct ggtgcatgag atctttttct caacgtgact atgctgtgca gaccttcatg 121 tggtgtcttg tgaaagactt tgaccactgt gtggacttcc cttcagtgta tctctcagag 181 tgcaagactg ggaatggaaa gaattacaga gggacgatgt ccaaaacaaa aaatggcatc 241 acctgtcaaa aatggagttc cacttctccc cacagaccta ggtaagacat tccctttcat 301 ctttgtgttc atctactgta aagttgtccc tctgtgtctg t // LOCUS HUMPLG08 354 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 5. ACCESSION M33279 J05286 KEYWORDS plasminogen. SEGMENT 8 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 354) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 69 + 208 plasminogen (PLG) precursor, exon 5 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 69 + 208 plasminogen IVS < 1 68 PLG intron D IVS 209 > 354 PLG intron E BASE COUNT 101 a 83 c 72 g 98 t ORIGIN Unknown number of base pairs after segment 7. 1 ttctgccttg ctaatagcaa gctgattttt agaatatagt ctaagtgctt cttttccatc 61 ctccccagat tctcacctgc tacacacccc tcagagggac tggaggagaa ctactgcagg 121 aatccagaca acgatccgca ggggccctgg tgctatacta ctgatccaga aaagagatat 181 gactactgcg acattcttga gtgtgaaggt caggagtggt tctagaaaat gttttcattt 241 ctgcccttca cctgtaaaat aatttgttgt aaagcccctt cccacaggga tgttattaat 301 aattgagtaa cgtattcacc tctgggaaag aagcaaaacc ccagaattaa cctg // LOCUS HUMPLG09 206 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 6. ACCESSION M33280 J05286 KEYWORDS plasminogen. SEGMENT 9 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 206) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 56 + 176 plasminogen (PLG) precursor, exon 6 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 56 + 176 plasminogen IVS < 1 55 PLG intron E IVS 177 > 206 PLG intron F BASE COUNT 51 a 57 c 39 g 59 t ORIGIN Unknown number of base pairs after segment 8. 1 ttcatccatt tcagttttct tcttcctctc tgtccttcct tcccactctg tccagaggaa 61 tgtatgcatt gcagtggaga aaactatgac ggcaaaattt ccaagaccat gtctggactg 121 gaatgccagg cctgggactc tcagagccca cacgctcatg gatacattcc ttccaagtaa 181 gtctcactgg gaaaaacatt ccatgt // LOCUS HUMPLG10 100 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron F (partial). ACCESSION M33281 J05286 KEYWORDS plasminogen. SEGMENT 10 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 100 plasminogen intron F /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 29 a 17 c 26 g 28 t ORIGIN Unknown number of base pairs after segment 9. 1 ccaaaatgat aaggtcactg attctgttga gtgattttta cacatgtaaa ctgttagaaa 61 aacagtgctt ggcagccggg catggtggca catgctgtag // LOCUS HUMPLG11 247 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 7. ACCESSION M33282 J05286 KEYWORDS plasminogen. SEGMENT 11 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 247) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 68 + 186 plasminogen (PLG) precursor, exon 7 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 68 + 186 plasminogen IVS < 1 67 PLG intron F IVS 187 > 247 PLG intron G BASE COUNT 70 a 63 c 51 g 63 t ORIGIN Unknown number of base pairs after segment 10. 1 cttgaaaaag agtcttatcc atgaatgtaa atgttcagtg ctactaaaat ctttcttgtc 61 cattcagatt tccaaacaag aacctgaaga agaattactg tcgtaacccc gatagggagc 121 tgcggccttg gtgtttcacc accgacccca acaagcgctg ggaactttgc gacatccccc 181 gctgcagtga gtatgatgca cacccagatt ccaggatttg gacctgccct gttcttgaaa 241 tcaaaag // LOCUS HUMPLG12 244 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 8. ACCESSION M33283 J05286 KEYWORDS plasminogen. SEGMENT 12 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 244) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 47 + 209 plasminogen (PLG) precursor, exon 8 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 47 + 209 plasminogen IVS < 1 46 PLG intron G IVS 210 > 244 PLG intron H BASE COUNT 68 a 72 c 45 g 59 t ORIGIN Unknown number of base pairs after segment 11. 1 ctcaaaaaat atatatattc attgtaactt attttgccca ttcaagcaac acctccacca 61 tcttctggtc ccacctacca gtgtctgaag ggaacaggtg aaaactatcg cgggaatgtg 121 gctgttaccg tgtccgggca cacctgtcag cactggagtg cacagacccc tcacacacat 181 aacaggacac cagaaaactt tccctgcaag taagtcccct ccagtctcat tctgctgcta 241 tgga // LOCUS HUMPLG13 217 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 9. ACCESSION M33284 J05286 KEYWORDS plasminogen. SEGMENT 13 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 217) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 36 + 181 plasminogen (PLG) precursor, exon 9 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 36 + 181 plasminogen IVS < 1 35 PLG intron H IVS 182 > 217 PLG intron I BASE COUNT 64 a 56 c 52 g 45 t ORIGIN Unknown number of base pairs after segment 12. 1 ttggaaagct aaactcacaa tcacttcttt ttcagaaatt tggatgaaaa ctactgccgc 61 aatcctgacg gaaaaagggc cccatggtgc catacaacca acagccaagt gcggtgggag 121 tactgtaaga taccgtcctg tgactcctcc ccagtatcca cggaacaatt ggctcccaca 181 ggtaagcaag ggtatgggag cttactgagg gcccaag // LOCUS HUMPLG14 409 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 10. ACCESSION M33285 J05286 KEYWORDS plasminogen. SEGMENT 14 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 409) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 132 + 291 plasminogen (PLG) precursor, exon 10 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 132 + 291 plasminogen IVS < 1 131 PLG intron I IVS 292 > 409 PLG intron J BASE COUNT 116 a 102 c 73 g 118 t ORIGIN Unknown number of base pairs after segment 13. 1 tctgtctgct aatacagaaa agagaacagt cataattctc agaggctacc gtactgtttt 61 tgtcataaat tgcttcatgc ttcttttttt tcagtaattg ttaagcttga tttcttttat 121 tttaatttca gcaccacctg agctaacccc tgtggtccag gactgctacc atggtgatgg 181 acagagctac cgaggcacat cctccaccac caccacagga aagaagtgtc agtcttggtc 241 atctatgaca ccacaccggc accagaagac cccagaaaac tacccaaatg cgtatgtctt 301 tgatttttac tgtaagaggg gcatcagcca actgaaattt ctgttaaaag agccatgctt 361 catgcttcaa gccaacttcc taggaccaaa tttctcttag acccagaat // LOCUS HUMPLG15 266 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 11. ACCESSION M33286 J05286 KEYWORDS plasminogen. SEGMENT 15 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 266) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 60 + 241 plasminogen (PLG) precursor, exon 11 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 60 + 241 plasminogen IVS < 1 59 PLG intron J IVS 242 > 266 PLG intron K BASE COUNT 66 a 68 c 65 g 67 t ORIGIN Unknown number of base pairs after segment 14. 1 ctgggtgccc ctgaatattc tcccacctct tgtgacctgt attgttttgg aatttccagt 61 ggcctgacaa tgaactactg caggaatcca gatgccgata aaggcccctg gtgttttacc 121 acagacccca gcgtcaggtg ggagtactgc aacctgaaaa aatgctcagg aacagaagcg 181 agtgttgtag cacctccgcc tgttgtcctg cttccaaatg tagagactcc ttccgaagaa 241 ggtaagaaat ctgtggctgg acatct // LOCUS HUMPLG16 224 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 12. ACCESSION M33287 J05286 KEYWORDS plasminogen. SEGMENT 16 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 224) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 26 + 174 plasminogen (PLG) precursor, exon 12 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 26 + 174 plasminogen IVS < 1 25 PLG intron K IVS 175 > 224 PLG intron L BASE COUNT 57 a 56 c 56 g 55 t ORIGIN Unknown number of base pairs after segment 15. 1 aatcatccat tttttccctg tacagactgt atgtttggga atgggaaagg ataccgaggc 61 aagagggcga ccactgttac tgggacgcca tgccaggact gggctgccca ggagccccat 121 agacacagca ttttcactcc agagacaaat ccacgggcgg gtctggaaaa aaatgtaagc 181 cactttgatt tggactcttt ggccttttgc tcaccaatct ttgc // LOCUS HUMPLG17 223 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 13. ACCESSION M33288 J05286 KEYWORDS plasminogen. SEGMENT 17 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 223) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 31 + 124 plasminogen (PLG) precursor, exon 13 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 31 + 124 plasminogen IVS < 1 30 PLG intron L IVS 125 > 223 PLG intron M BASE COUNT 56 a 44 c 58 g 65 t ORIGIN Unknown number of base pairs after segment 16. 1 gctggagctt acatgccttc ttgttttcag tactgccgta accctgatgg tgatgtaggt 61 ggtccctggt gctacacgac aaatccaaga aaactttacg actactgtga tgtccctcag 121 tgtggtaggt tgccttcttt ttggtaagga aactgcttac ttaatatgga tttgcaacaa 181 aaaaggaaaa gggcttctga gcagactgct tctggggagg aga // LOCUS HUMPLG18 296 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 14. ACCESSION M33289 J05286 KEYWORDS plasminogen. SEGMENT 18 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 296) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 126 + 246 plasminogen (PLG) precursor, exon 14 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 126 + 246 plasminogen IVS < 1 125 PLG intron M IVS 247 > 296 PLG intron N BASE COUNT 74 a 72 c 69 g 81 t ORIGIN Unknown number of base pairs after segment 17. 1 atgattttac tatttagttc ggcctttaag atgtcaaaaa ctcagtgctt ggaatttgtc 61 tcgaattaca ccacaaaatt gctaccttgt ctcaaatggg atttctttcc caccttgtgc 121 cacagcggcc ccttcatttg attgtgggaa gcctcaagtg gagccgaaga aatgtcctgg 181 aagggttgta ggggggtgtg tggcccaccc acattcctgg ccctggcaag tcagtcttag 241 aacaaggtaa gaacaggccc agaaacgatt tatactgtcc ctccacgtaa gccctg // LOCUS HUMPLG19 361 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 15. ACCESSION M33290 J05286 KEYWORDS plasminogen. SEGMENT 19 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 361) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 66 + 140 plasminogen (PLG) precursor, exon 15 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 66 + 140 plasminogen IVS < 1 65 PLG intron N IVS 141 > 361 PLG intron O BASE COUNT 93 a 77 c 80 g 111 t ORIGIN Unknown number of base pairs after segment 18. 1 ttctgtacaa tggagcagaa caaagtatca atttaactaa aatttgaact aaatcctctt 61 tccaggtttg gaatgcactt ctgtggaggc accttgatat ccccagagtg ggtgttgact 121 gctgcccact gcttggagaa gtatgtttag gggacaattg acatgaagtc ttgtcttaaa 181 tactttttct gtccttcttt tcctcctttc ctcctttcct ttctcactct tcctcccttc 241 cttctctggc tgtgacacta gggaccaggc cagggcaatt ggataagaga gaagggaagg 301 gtttctagaa agaaactgca gaggaaagac acagtacaga tgattttgtg ggcctgaata 361 a // LOCUS HUMPLG20 331 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 16. ACCESSION M34272 J05286 KEYWORDS plasminogen. SEGMENT 20 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 331) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 41 + 181 plasminogen (PLG) precursor, exon 16 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 41 + 181 plasminogen IVS < 1 40 PLG intron O IVS 182 > 331 PLG intron P BASE COUNT 80 a 81 c 79 g 91 t ORIGIN Unknown number of base pairs after segment 19. 1 ctggaccata ttttcctctt gacatcctca tcttttctag gtccccaagg ccttcatcct 61 acaaggtcat cctgggtgca caccaagaag tgaatctcga accgcatgtt caggaaatag 121 aagtgtctag gctgttcttg gagcccacac gaaaagatat tgccttgcta aagctaagca 181 ggtactcgtt cacctgtggt cttcacccca cgctggtgaa gatatttgct ttatgtctgg 241 gttttatggg ccatggcact gcatggcagt ggggaggaac tgtctatcac atgaaaggct 301 caagggcttt ggggacagca tcaatcttca a // LOCUS HUMPLG21 251 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 17. ACCESSION M34273 J05286 KEYWORDS plasminogen. SEGMENT 21 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 251) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 41 + 147 plasminogen (PLG) precursor, exon 17 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 41 + 147 plasminogen IVS < 1 40 PLG intron P IVS 148 > 251 PLG intron Q BASE COUNT 71 a 61 c 51 g 68 t ORIGIN Unknown number of base pairs after segment 20. 1 gcagagcagt caaacataac tgctgatgct tttctttcag tcctgccgtc atcactgaca 61 aagtaatccc agcttgtctg ccatccccaa attatgtggt cgctgaccgg accgaatgtt 121 tcatcactgg ctggggagaa acccaaggtg agataaattc cattgcccac ataacgaatt 181 ggttttgacc tacagtccat gtgacaaaat gatcattttg gagaaagctg tgcaaattcc 241 tatccatgaa t // LOCUS HUMPLG22 101 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, intron Q (partial). ACCESSION M34274 J05286 KEYWORDS plasminogen. SEGMENT 22 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 101) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review FEATURES from to/span description IVS < 1 > 101 plasminogen intron Q /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" BASE COUNT 24 a 34 c 24 g 19 t ORIGIN Unknown number of base pairs after segment 21. 1 agaagggtgc tccctcacac aactacagca gtccaggtga tgcacccact gcccaatgct 61 tggtagtcaa gaggagcttc ctccctgcag ctctgcccag a // LOCUS HUMPLG23 254 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 18. ACCESSION M34275 J05286 KEYWORDS plasminogen. SEGMENT 23 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 254) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 69 + 214 plasminogen (PLG) precursor, exon 18 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 69 + 214 plasminogen IVS < 1 68 PLG intron Q IVS 215 > 254 PLG intron R BASE COUNT 61 a 54 c 63 g 76 t ORIGIN Unknown number of base pairs after segment 22. 1 tgttctggaa tatcctcctg aatgtgtttt gggtgcagtt gccatttctt tcatcttttt 61 aaacacaggt acttttggag ctggccttct caaggaagcc cagctccctg tgattgagaa 121 taaagtgtgc aatcgctatg agtttctgaa tggaagagtc caatccaccg aactctgtgc 181 tgggcatttg gccggaggca ctgacagttg ccaggtaagc aaagatcaag agaccaaagt 241 tagtcttgtg ctct // LOCUS HUMPLG24 1236 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human plasminogen gene, exon 19. ACCESSION M34276 J05286 KEYWORDS plasminogen. SEGMENT 24 of 24 SOURCE Human leukocyte and lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1236) AUTHORS Petersen,T.E., Martzen,M.R., Ichinose,A. and Davie,E.W. TITLE Characterization of the gene for human plasminogen, a key proenzyme in the fibrinolytic system JOURNAL J. Biol. Chem. 265, 6104-6111 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ichinose, 26-MAR-1990, for release after publication. FEATURES from to/span description pept + 41 202 plasminogen (PLG) precursor, exon 19 /hgml_locus_uid="LW0013Z" /nomgen="PLG" /map="6q26-q27" matp + 41 199 plasminogen pre-msg < 1 427 PLG mRNA and introns (alt.) pre-msg < 1 458 PLG mRNA and introns (alt.) pre-msg < 1 1184 PLG mRNA and introns (alt.) IVS < 1 40 PLG intron R BASE COUNT 365 a 233 c 297 g 341 t ORIGIN Unknown number of base pairs after segment 23. 1 agcctaaccc tcacatgcat ttttctctcc ctctgtatag ggtgacagtg gagggcctct 61 ggtttgcttc gagaaggaca aatacatttt acaaggagtc acttcttggg gtcttggctg 121 tgcacgcccc aataagcctg gtgtctatgt tcgtgtttca aggtttgtta cttggattga 181 gggagtgatg agaaataatt aattggacgg gagacagagt gacgcactga ctcacctaga 241 ggctgggacg tgggtaggga tttagcatgc tggaaataac tggcagtaat caaacgaaga 301 cactgtcccc agctaccagc tacgccaaac ctcggcattt tttgtgttat tttctgactg 361 ctggattctg tagtaaggtg acatagctat gacatttgtt aaaaataaac tctgtactta 421 actttgattt gagtaaattt tggttttggt cttcaacatt ttcatgctct ttgttcaccc 481 caccaatttt aaatgggcag atggggggat ttagctgctt ttgataagga acagctgcac 541 aaaggactga gcaggctgca aggtcacaga ggggagagcc aagaagttgt ccacgcattt 601 acctcatcag ctaacgaggg cttgacatgc atttttactg tctttattcc tgacactgag 661 atgaatgttt tcaaagctgc aacatgcatg gggagtcatg cgaaccgatt ctgttattgg 721 gaatgaaatc tgtcaccgac tgcttgactt gagcccaggg gacacagagc agagagctgt 781 atatgatgga gtgaaccggt ccatggatgt gtaacacaag accaactgag agtctgaatg 841 ttattctggg gcacacgtga gtctaggatt ggtgccaaga gcatgtaaat gaacaacaag 901 caaatattga aggtggacca cttatttccc attgctaatt gcctgcccgg ttttgaaaca 961 gtctgcagta cacacggtga caggagaatg acctgtggga gagatacatg tttagaagga 1021 agagaaagga caaaggcaca cgttttacca tttaaaatat tgttaccaaa caaaaatatc 1081 cattcaaaat acaatttaac aatgcaacag tcatcttaca gcagagaaat gcagagaaaa 1141 gcaaaactgc aagtgactgt gaataaaggg tgaatgtagt ctcaaatcct caaagagctg 1201 tgtttatttc attgacaaat agattatttg tattca // LOCUS PARGANTI1 162 bp ds-DNA INV 12-JUL-1990 DEFINITION P.primaurelia G surface antigen gene, 5' end. ACCESSION M11194 KEYWORDS G surface antigen; surface antigen. SEGMENT 1 of 2 SOURCE P.primaurelia macronucleus DNA. ORGANISM Paramecium primaurelia Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae. REFERENCE 1 (bases 1 to 162) AUTHORS Meyer,E., Caron,F. and Baroin,A. TITLE Macronuclear structure of the G surface antigen gene of Paramecium primaurelia and direct expression of its repeated epitopes in Escherichia coli JOURNAL Mol. Cell. Biol. 5, 2414-2422 (1985) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Caron, 24-OCT-1985. FEATURES from to/span description pept 19 > 162 G surface antigen BASE COUNT 55 a 23 c 25 g 59 t ORIGIN 1 tgaattttaa tacttttaat gaataataaa ttcatcatat tctcattgtt gcttgcttta 61 gtagcaagtc aaacatacag tttaacatca tgcacatgtg cataattgtt atcagaagga 121 gattgcatca aaaatgtttc acttggatgt tcatgggata ca // LOCUS PARGANTI2 798 bp ds-DNA INV 12-JUL-1990 DEFINITION P.primaurelia G surface antigen gene, partial cds. ACCESSION M11193 KEYWORDS G surface antigen; surface antigen. SEGMENT 2 of 2 SOURCE P.primaurelia macronucleus DNA. ORGANISM Paramecium primaurelia Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae. REFERENCE 1 (bases 1 to 798) AUTHORS Meyer,E., Caron,F. and Baroin,A. TITLE Macronuclear structure of the G surface antigen gene of Paramecium primaurelia and direct expression of its repeated epitopes in Escherichia coli JOURNAL Mol. Cell. Biol. 5, 2414-2422 (1985) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Caron, 24-OCT-1985. FEATURES from to/span description pept < 1 > 798 G surface antigen (AA at 1) rpt 1 222 direct repeat 1 rpt 223 444 direct repeat 2 rpt 445 666 direct repeat 3 rpt 667 > 798 direct repeat 4 BASE COUNT 262 a 153 c 177 g 206 t ORIGIN 1 tgtgcttcaa ttactggaac aggattaacc actgctattt gtggaactta tgatgcaggt 61 tgtgtggcaa atgttaacgg aacagcttgt taagaaaaat tagcaacatg tgatttgtat 121 ttaactcaaa actcttgttc tacctcggca gctgcagcaa cagcagataa atgtgcatgg 181 agtggaaccg cttgccttgc agttacaact gttggtaccc attgtgctta tgttactgga 241 actggactta ctgatttaat atgtgcagca tataatgcaa attgtacagc taataaagct 301 ggaacagcat gtcaggagaa aaaggctact tgcaatttat acacaacaga agccacctgt 361 tcaacatcag cagctgcagc aacagcagat aaatgcgcat ggagtggagc agcttgcctt 421 gcagtaacaa ctgttgctac agagtgtgct tatgttactg gaactggact tactgattta 481 atatgtgcag catataatgc aaattgtaca gctaataaag ctggaacagc atgtcaggag 541 aaaaaggcta cttgcaattt atacacaaca gaagccacct gttcaacatc agcagctgca 601 gcaacagcag ataaatgcgc atggagtgga gcagcttgcc ttgcagtaac aactgttgct 661 acagagtgtg cttatgttac tggaactgga ctaacaaatg caatatgtgc agcatataat 721 gcaaattgta cagctaataa agctggaaca gcatgtcagg agaaaaaggc tacttgcaat 781 ttatacacaa cagaagcc // LOCUS BOVCASA 1123 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Bovine alpha-s1-casein mRNA, complete cds. ACCESSION M33123 KEYWORDS alpha-s1-casein. SOURCE Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA, clone p-alpha-s1 C228. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1123) AUTHORS Nagao,M., Maki,M., Sasaki,R. and Chiba,R. TITLE Isolation and sequence analysis of bovine alpha-s1-casein cDNA clone JOURNAL Agric. Biol. Chem. 48, 1663-1667 (1984) STANDARD simple staff_review FEATURES from to/span description pept 64 708 alpha-s1-casein precursor sigp 64 108 alpha-s1-casein signal peptide matp 109 705 alpha-s1-casein mRNA < 1 1123 alpha-s1-casein mRNA BASE COUNT 331 a 247 c 223 g 322 t ORIGIN 1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca 61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat 121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt 181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt 241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc 301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa 361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac 421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa 481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac 541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg 601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat 661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga 721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg 781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg 841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct 901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa 961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa 1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact 1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc // LOCUS BPHINTXIS 1741 bp ds-DNA PHG 12-JUL-1990 DEFINITION Bacteriophage phi-11 integrase (int) and excisionase (xis) genes, complete cds. ACCESSION M34832 KEYWORDS excisionase; integrase. SOURCE Bacteriophage phi-11 DNA. ORGANISM Bacteriophage phi-11 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1741) AUTHORS Ye,Z.-H., Buranen,S.L. and Lee,C.Y. TITLE Sequence analysis and comparison of int and xis genes from Staphylococcal bacteriophages l54a and phi-11 JOURNAL J. Bacteriol. 172, 2568-2575 (1990) STANDARD simple staff_review FEATURES from to/span description pept 267 67 (c) excisionase (xis) pept 379 1425 integrase (int) BASE COUNT 650 a 264 c 263 g 564 t ORIGIN 1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc 61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa 121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac 181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt 241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca 301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt 361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt 421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa 481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac 541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct 601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga 661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta 721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt 781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct 841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta 901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt 961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt 1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa 1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat 1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta 1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa 1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat 1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat 1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg 1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc 1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt 1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat 1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact 1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag 1741 t // LOCUS BSTNPRAS 3510 bp ds-DNA BCT 12-JUL-1990 DEFINITION B.stearothermophilus neutral protease (nprS), and transcriptional activator (nprA) genes, complete cds. ACCESSION M34237 KEYWORDS neutral protease; transcriptional activator. SOURCE B.stearothermophilus (strain TELNE) DNA, clone pSP53. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3510) AUTHORS Nishiya,Y. and Imanaka,T. TITLE Cloning and nucleotide sequences of the neutral protease gene and its transcriptional activator gene from Bacillus stearothermophilus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Nishiya, 11-MAY-1990. FEATURES from to/span description pept 181 1401 transcriptional activator (nprA) pept 1750 3405 neutral protease (nprS) precursor sigp 1570 2274 neutral protease (nprS) signal and propeptide matp 2275 3219 neutral protease (nprS) BASE COUNT 1177 a 581 c 737 g 1015 t ORIGIN 1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa 61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat 121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa 181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt 241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa 301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa 361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag 421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa 481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat 541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa 601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt 661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg 721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac 781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga 841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat 901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat 961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca 1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac 1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat 1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca 1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt 1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac 1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta 1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga 1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg 1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag 1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc 1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt 1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag 1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga 1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca 1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc 1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt 1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt 2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact 2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac 2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct 2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac 2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta 2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa 2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca 2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca 2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat 2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt 2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac 2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca 2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat 2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat 2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt 2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa 3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc 3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca 3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat 3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa 3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc 3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa 3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca 3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt 3481 gcaaaatgcc caaaggctta taatgttgat // LOCUS HSEGP14 3347 bp ds-DNA VRL 12-JUL-1990 DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete cds. ACCESSION M34861 KEYWORDS glycoprotein 14. SOURCE Equine herpesvirus type 1 DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3347) AUTHORS Guo,P. TITLE Characterization of the gene and an antigenic determinant of equine herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent glycoproteins of other herpesviruses JOURNAL Gene 87, 249-255 (1990) STANDARD simple staff_review FEATURES from to/span description pept 300 3239 glycoprotein 14 (gp14) BASE COUNT 885 a 891 c 851 g 720 t ORIGIN 1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg 61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg 121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg 181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc 241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca 301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg 361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct 421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa 481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct 541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt 601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac 661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc 721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg 781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct 841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat 901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga 961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata 1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt 1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg 1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg 1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt 1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc 1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag 1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact 1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg 1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta 1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag 1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta 1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc 1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg 1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca 1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa 1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg 1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac 2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg 2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc 2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg 2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc 2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga 2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg 2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag 2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg 2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc 2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac 2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg 2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg 2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa 2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg 2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga 2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc 3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa 3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga 3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg 3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa 3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact 3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg // LOCUS HUMHBGAA 1227 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human A-gamma-globin gene, 3' end. ACCESSION M33200 KEYWORDS A-gamma-globin. SOURCE Human (hereditary persistence of fetal hemoglobin individual II-1) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1227) AUTHORS Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G. TITLE Gamma gene promoter and enhancer structure in Seattle variant of hereditary persistence of fetal hemoglobin JOURNAL Blood 71, 1108-1112 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 3 A-gamma-globin (AA at 1) /hgml_locus_uid="LK0092S" /nomgen="HBG1" /map="11p15.5" mut 794 794 t in wt; c in mutant mut 970 970 c in wt; a in mutant mut 1186 1186 a in wt; g in mutant BASE COUNT 366 a 204 c 294 g 363 t ORIGIN 1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca 61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca 121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc 181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt 241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg 301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc 361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg 421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa 481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg 541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc 601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga 661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt 721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa 781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta 841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca 901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat 961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa 1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc 1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc 1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca 1201 gatgaagaag gagaaaaaaa taagctt // LOCUS HUMHBQ1A 1114 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human theta-1-globin gene, complete cds. ACCESSION M33022 KEYWORDS theta-1-globin. SOURCE Human black female with alpha-thal-2 heterozygosity white cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1114) AUTHORS Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J. TITLE Nucleotide sequence of the human theta-1-globin gene JOURNAL Biochem. Genet. 26, 207-211 (1988) STANDARD simple staff_review FEATURES from to/span description pept 359 453 theta-1-globin, exon 1 /hgml_locus_uid="LV0155X" /nomgen="HBQ1" /map="16p13.3" 538 742 theta-1-globin, exon 2 852 980 theta-1-globin, exon 3 IVS 454 537 theta-1-globin intron A IVS 743 851 theta-1-globin intron B BASE COUNT 166 a 386 c 393 g 169 t ORIGIN 1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg 61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt 121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc 181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga 241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc 301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat 361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa 421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc 481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac 541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc 601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg 661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg 721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg 781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc 841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag 901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc 961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc 1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag 1081 gacgcgagag gaaggcgccg cccctcccca agga // LOCUS HUMITIH1A 1149 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds. ACCESSION M33033 KEYWORDS inter-alpha-trypsin inhibitor heavy chain. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1149) AUTHORS Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and Martin,J.-P. TITLE Human inter-alpha-trypsin inhibitor: Isolation and characterization of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence of the H chain JOURNAL Biol. Chem. Hoppe-Seyler 369, 15-18 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1149 inter-alpha-trypsin inhibitor heavy chain (AA at 1) /hgml_locus_uid="LE0221G" /nomgen="ITIH1" /map="3p21.2-p21.1" BASE COUNT 332 a 292 c 253 g 272 t ORIGIN 1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat 61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat 121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa 181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag 241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct 301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc 361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga 421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc 481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac 541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg 601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc 661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact 721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc 781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca 841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa 901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca 961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa 1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag 1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga 1141 ttttatttc // LOCUS HUMPTHROM 327 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human thrombin mRNA, 5'end. ACCESSION M33031 KEYWORDS serine protease; thrombin. SOURCE Human, cDNA to mRNA, clone pIIH13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 327) AUTHORS MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C. TITLE Recombinant genetic approaches to functional mapping of thrombin JOURNAL Ann. N.Y. Acad. Sci. 485, 73-79 (1986) STANDARD simple staff_review FEATURES from to/span description pept 28 > 327 thrombin precursor /hgml_locus_uid="LD0134L" /nomgen="F2" /map="11p11-q12" sigp 28 156 thrombin signal peptide matp 157 > 327 prothrombin BASE COUNT 60 a 97 c 109 g 61 t ORIGIN 1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc 61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct 121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg 181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc 241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag 301 acagcgagga cgcctcgaga taagctt // LOCUS MUSCC3A 312 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse complement component C3 mRNA, partial cds. ACCESSION M33032 KEYWORDS complement component C3. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Fey,G.H., Wiebauer,K. and Domdey,H. TITLE Amino acid sequences of mouse complement C3 derived from nucleotide sequences of cloned cDNA JOURNAL Ann. N.Y. Acad. Sci. 421, 307-312 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 complement component C3 precursor (AA at 1) matp < 1 27 complement component C3-beta subunit (AA at 1) matp 40 273 complement component C3-alpha subunit matp 274 > 312 complement component C3-alpha' subunit BASE COUNT 90 a 77 c 90 g 55 t ORIGIN 1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga 61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt 121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc 181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa 241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca 301 gaagaagata tt // LOCUS MUSN038A 1260 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse nucleolar protein N038 mRNA, complete cds. ACCESSION M33212 KEYWORDS nucleolar protein N038. SOURCE Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone lambda-FML-185.19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1260) AUTHORS Schmidt-Zachmann,M.S. and Franke,W.W. TITLE DNA cloning and amino acid sequence determination of a major constituent protein of mammalian nucleoli: Correspondence of the nucleoplasmin-related protein N038 to mammalian protein B23 JOURNAL Chromosoma 96, 417-426 (1988) STANDARD simple staff_review FEATURES from to/span description pept 79 957 nucleolar protein N038 mRNA < 1 1260 nucleolar protein N038 mRNA BASE COUNT 419 a 214 c 306 g 321 t ORIGIN 1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga 61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct 121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat 181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa 241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta 301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt 361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag 421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa 481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa 541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat 601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct 661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa 721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa 781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa 841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc 901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa 961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg 1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt 1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac 1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag 1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt // LOCUS RATPOS 1804 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat type-2A protein phosphatase catalytic subunit mRNA, complete cds. ACCESSION M33114 KEYWORDS type-2A protein phosphatase catalytic subunit. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1804) AUTHORS Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S., Sugimura,T. and Nagao,M. TITLE Molecular cloning of cDNA for the catalytic subunit of rat liver type 2A protein phosphatase, and detection of high levels of expression of the gene in normal and cancer cells JOURNAL Biochim. Biophys. Acta 951, 123-129 (1988) STANDARD simple staff_review FEATURES from to/span description pept 114 1043 type-2A protein phosphatase catalytic subunit mRNA < 1 1804 type-2A protein phosphatase catalytic subunit mRNA BASE COUNT 482 a 391 c 437 g 494 t ORIGIN 1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg 61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg 121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc 181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat 241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc 301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta 361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc 421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga 481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt 541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga 601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac 661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc 721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag 781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc 841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc 901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta 961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc 1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc 1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt 1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct 1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat 1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga 1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc 1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa 1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa 1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg 1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt 1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa 1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct 1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact 1801 atat // LOCUS RATSCP2 1409 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds. ACCESSION M34728 KEYWORDS sterol carrier protein-2. SOURCE Rat liver, cDNA to mRNA, clone SP43. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1409) AUTHORS Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and Davis,L.G. TITLE Characterization of a cDNA encoding rat sterol carrier protein-2 JOURNAL DNA Cell Biol. 9, 159-165 (1990) STANDARD simple staff_review FEATURES from to/span description pept 307 1128 sterol carrier protein-2 (SCP-2) mRNA < 1 1409 sterol carrier protein-2 mRNA BASE COUNT 387 a 300 c 390 g 332 t ORIGIN 2 bp upstream of EcoRI site. 1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc 61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc 121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc 181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt 241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt 301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc 361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa 421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac 481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc 541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg 601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta 661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc 721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag 781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg 841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct 901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag 961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg 1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc 1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg 1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca 1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc 1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg 1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac 1381 ttgagtttca gaataaagtt cagaatagt // LOCUS SYNPSBAII 556 bp ds-DNA BCT 12-JUL-1990 DEFINITION Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end. ACCESSION M34833 KEYWORDS D1 protein; photosystem II. SOURCE Synechococcus (strain PCC 7942) DNA. ORGANISM Synechococcus sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 556) AUTHORS Bustos,S.A., Schaefer,M.R. and Golden,S.S. TITLE Different and rapid responses of four cyanobacterial psbA transcripts to changes in light intensity JOURNAL J. Bacteriol. 172, 1998-2004 (1990) STANDARD simple staff_review FEATURES from to/span description pept 81 425 ORF1 pept 527 > 556 photosystem II D1 protein (psbAII) mRNA 59 > 556 psbAII mRNA (alt.) mRNA 478 > 556 psbAII mRNA (alt.) BASE COUNT 136 a 145 c 142 g 133 t ORIGIN 1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag 61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg 121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac 181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag 241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg 301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc 361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg 421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg 481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt 541 gcagcggcgc gagagc // LOCUS ABCAARAA 1624 bp ds-DNA BCT 12-JUL-1990 DEFINITION A.aceti acetic acid resistance protein (aarA) gene, complete cds. ACCESSION M34830 KEYWORDS acetic acid resistance protein. SOURCE A.aceti (strain 10-8) DNA, clone pAR1611. ORGANISM Acetobacter aceti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 1624) AUTHORS Fukaya,M., Takemura,H., Okumura,H., Kawamura,Y., Horinouchi,S. and Beppu,T. TITLE Cloning of genes responsible for acetic acid resistance in acetobacter aceti JOURNAL J. Bacteriol. 172, 2096-2104 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 185 1495 acetic acid resistance protein (aarA) signal 1508 1545 transcription termination signal binding 171 176 ribosomal binding site (put.) BASE COUNT 400 a 446 c 404 g 374 t ORIGIN 1 gcatgcattt gcacacattc gcgcgaccct aagcccaaaa aactgtggtt ttccaagcat 61 actcctttcc gataacgctt cgtttatcgc tggcaacctt ccggtttcct tttgaatgag 121 tgacaaagtg tgacgagcag gccgcagcag cgaccgtggc ccaaccatgc agaaggaaac 181 actaatgagc gcgtcgcaga aagaaggtaa gctatctacc gctaccattt cggttgatgg 241 aaaatccgcc gaaatgcctg tgctttcagg cactctggga ccggatgtta tcgacatccg 301 caaacttccg gcgcaactgg gcgttttcac gtttgaccca ggttacgggg aaacagcggc 361 ctgcaacagc aaaatcacct ttattgatgg tgataaaggc gttctgctgc accgtggtta 421 ccctattgcg cagctggacg aaaatgcttc ctacgaagaa gttatttatc tgcttttgaa 481 tggcgaactg cccaacaagg tgcagtacga caccttcacc aacaccctta caaaccatac 541 gctgctgcac gagcagatcc gtaacttctt taacggcttc cggcgtgatg cccacccaat 601 ggccattctg tgtggtacgg ttggggcttt gtctgccttc tacccagatg ccaacgatat 661 tgccattccc gccaatcggg atctggccgc catgcggctg attgccaaaa tcccaaccat 721 tgcggcatgg gcttacaaat acacgcaggg tgaagccttt atctacccgc ggaatgatct 781 gaactacgca gaaaacttcc tgtccatgat gttcgcgcgc atgtccgaac cttacaaggt 841 caaccctgtt ctggcccgcg ccatgaaccg gattctgatt ctgcatgccg atcatgagca 901 gaatgcctct acctccaccg tacgtctggc tggttctaca ggggccaatc cgtttgcctg 961 tattgctgcg ggcattgccg ctctgtgggg acctgcacat ggtggcgcaa acgaagctgt 1021 gctgaaaatg ctggcccgta ttggcaagaa agaaaatatt cctgccttta tcgcacaggt 1081 gaaggacaag aacagcggcg taaagctgat gggctttggc caccgcgttt acaagaactt 1141 cgacccacgt gcgaagatca tgcagcagac ctgccacgaa gtgctgacag aacttggcat 1201 taaggatgat ccgctgctgg atctggcggt tgagctggaa aagattgctc tgagcgatga 1261 ttacttcgtg cagcgcaaac tttacccgaa tgtggatttc tactctggca tcattctcaa 1321 ggccatgggc atccccacca gtatgtttac tgtgctgttt gccgtagccc gcaccaccgg 1381 ctgggtgagc cagtggaagg aaatgattga agaaccgggc cagcgtatca gccgccctcg 1441 ccagctttat attggcgcac cgcagcgtga ctatgtgccg cttgccaaac gctaaaacag 1501 actaacccaa aaagccgact tcccgtaagg aaagtcggct ttttgtttgc acgctgtttc 1561 caaaaaaata gggcggcaga gcgaataaac gctacctagc cttcaggcat aaaaaaacgc 1621 atgc // LOCUS BOVBADPTA 708 bp ss-mRNA MAM 12-JUL-1990 DEFINITION Cow beta adaptin mRNA, partial cds. ACCESSION M34177 J05273 KEYWORDS beta adaptin. SOURCE Cow brain, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 708) AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and Parham,P. TITLE Conservation and diversity in families of coated vesicle adaptins JOURNAL J. Biol. Chem. 265, 4814-4820 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 708 beta adaptin (AA at 1) BASE COUNT 198 a 159 c 175 g 176 t ORIGIN 1 gctgtgaaga aagtgattgc tgctatgact gtggggaaag acgttagctc tctctttcca 61 gatgtagtga actgtatgca gacggataat ctggaactga agaagcttgt gtatctctac 121 ttgatgaact atgccaagag tcagccagac atggccatca tggctgtcaa cagctttgtg 181 aaggattgtg aagatcccaa tcctctgatt cgagctttgg cagtcagaac catggggtgc 241 atccgggtgg acaagataac agagtatctc tgtgagcccc tccgcaagtg cttaaaggat 301 gaagatccct acgtccggaa gacagcagca gtctgcgtgg caaaactcca tgacatcaat 361 gcccagatgg tggaagatca gggatttctg gattctctgc gggatctcat agcagattca 421 aatccaatgg tggtggctaa tgctgtagca gcactatctg aaatcagtga atctcacccc 481 aacagcaact tactcgatct gaatccacag aacattaata agctactgac agccctgaat 541 gagtgcaccg aatggggcca gattttcatc ctggactgct tatctaatta caatcctaaa 601 gatgaccggg aggctcagag catctgtgag cgggtaactc cccggttatc tcatgccaac 661 tcagcagtgg tgctttcagc agtaaaagtc ctaatgaaat ttttggaa // LOCUS BSUSENSA 1773 bp ds-DNA BCT 12-JUL-1990 DEFINITION B.subtilis transcription regulatory protein (senS) gene, complete cds. ACCESSION M34826 M30611 KEYWORDS transcription regulatory protein. SOURCE B.subtilis (strain DB2) DNA, clone pWL[77,80]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1773) AUTHORS Wang,L.-F. and Doi,R.H. TITLE Complex character of senS, a novel gene regulating expression of extracellular-protein genes of Bacillus subtilis JOURNAL J. Bacteriol. 172, 1939-1947 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Doi, 11-DEC-1989, for release after publication. FEATURES from to/span description pept 1486 1683 transcription regulatory protein senS binding 1470 1477 ribosomal binding site (put.) site 1455 1474 transcription termination signal site 1671 1698 rho-independent transcription terminator BASE COUNT 460 a 397 c 388 g 528 t ORIGIN 1 agttcttgga aattctgatt ttcgatatct ggcgaattta cgtagtctcc catcgtttct 61 ttcgaaaggg acgttctcag cccctcaatc cagcggacat tttgtctttt ttctccaggg 121 gatgtccagt ttgttaagta ttcctgggcg atgattgcgt cacgataata aaatgccgtt 181 tggtcgggag cgacccgtcc ggctgccccg ccgagtgctt gctgccagac actggcgttt 241 tgattcggag cgtgctctaa aaagtgtttt attgttgaga tcgcacgttc tgataatggc 301 ttttcaatga aagagccgga gcgtttcatt ttttgaggct gattgcctcc cgggctgtta 361 aaaaaggtta ccgcttcaat gaatggcgtt gtttttacca ttccgcttga cggacttcct 421 gctttcaata aaggctttaa cagttttttt aactctgttt ttggcccgac aaattggccg 481 agggcttcta tgcggtttac ttctttaggc caaaactcta ttgatgatgt aagccggtca 541 tctgtatacg gggcccagtt ctgccacgtg ttatatactt cctcaaaatc atcccatccc 601 catgtaatag aaaaaatcga cacttgagag atgggcactg ctttaaatgt catggaggtg 661 actatgccga aattgcctcc tccgcctccc tgagacgccc aaaatgtgga tgatttgaac 721 agctgactgt aatcagatca gcgccctctt tttcgtctgc tacgatcatc tcaagctgca 781 cgaggctgtc gcaagtaaga ccggcagccc ttgttaaaag tccaattccc cctccgagag 841 ttaaacctgt gagccctaca ttagcaatgg tgcctgcggg aagcgtcagg ccgtattgcc 901 agagtgtccg atagacttct cccaattcag cccccgcttc aatataggcc agctttttat 961 cctgattcac agttattttt ttcatctcgc ttaaatcaat aacaagaccg ttatttaaaa 1021 gggaaaagtt ctcatagctg tgtctgccgc ctctaatacg gaaaggcaca cggttttcac 1081 gcgcccattt cagcgcattg agtgcatcct gtttgttttg gcaaaacaca atgatgtcag 1141 atcctttcta agcttaggtt aatattggtt cttgcttcgt tatagtccgg atcatcccgt 1201 gtcacgatac gtccggtcaa ttttgtcttt tccacactcc cacatctctt tctctcgtat 1261 tctagtttct ctagcttatg cgtcagggga aaagagtgta taaggaaaaa gcggggatgc 1321 aatctgatac agtgtcaaca ccctcaaaaa atagttgaca ggtcggtatt gtatgaatta 1381 acatggtcag tacaaatttt tcaaatttat cgcgctgatc ggaacaccga aggctcttat 1441 cgtttagata agggcctttt ttgtatgaaa aaggggggat tattgatggg agtcaaaaaa 1501 gaaaagggga gaaaacgatt caggaagcga aaaacctacg ggaatcagat tttgccgctt 1561 gagctgctga ttgaaaaaaa caaacgagag attataaaca gcgcggaact catggaagaa 1621 atttatatga agattgatga gaagcatacg caatgtgtaa ctaaatataa aaaaacccgc 1681 tgactacaac gggtttttgc atttctccat taagaatctt ttttaatcgg caatccaagg 1741 ccttctgcca cgcgttttcc gtattcagga tcc // LOCUS CHITDNA 176 bp ds-DNA INV 12-JUL-1990 DEFINITION C.thummi telomeric DNA. ACCESSION M33211 KEYWORDS telomeric DNA. SOURCE C.thummi heat-shocked larvae, cDNA to mRNA, clone lambda-Cth5. ORGANISM Chironomus thummi Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Chironomidae. REFERENCE 1 (bases 1 to 176) AUTHORS Carmona,M.J., Morcillo,G., Galler,R., Martinez-Salas,E., de la Campa,A.G., Diez,J.L. and Edstroem,J.E. TITLE Cloning and molecular characterization of a telomeric sequence from a temperature-induced Balbiani ring JOURNAL Chromosoma 92, 108-115 (1985) STANDARD simple staff_entry BASE COUNT 63 a 34 c 27 g 52 t ORIGIN Chromosome III. 1 aattctagaa aaatcgagtt ttttcgaaaa catgaaaatt ttttttctct catcctagaa 61 caagtgtttt agacctcaaa acagatgtga acataaaagt gatgtattga caaaagttgc 121 tccaaactga gatgcatcca acgtgatatc gatatcccat gtacccccct atggaa // LOCUS ECOSUHBA 1017 bp ds-DNA BCT 12-JUL-1990 DEFINITION E.coli extragenic suppressor (suhB) gene, complete cds. ACCESSION M34828 KEYWORDS extragenic suppressor; suhB gene. SOURCE E.coli DNA, clone pRY61. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1017) AUTHORS Yano,R., Nagai,H., Shiba,K. and Yura,T. TITLE A mutation that enhances synthesis of sigma-32 and suppresses temperature-sensitive growth of the rpoH15 mutant of Escherichia coli JOURNAL J. Bacteriol. 172, 2124-2130 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 194 997 suhB protein signal 122 127 -35 region signal 145 150 -10 region BASE COUNT 244 a 278 c 259 g 236 t ORIGIN 55 min on K12 map. 1 catggcacgg gcaacagaac ccatattgcc ggtgtgtgac gtctccacca gcacaattcg 61 aatattttgc agcattgtct ttcttcatct aaagattatt cacgcatctt atcataaaac 121 gaagacagat gccgatctcg ctgctatact ctgcgccgtt ttcccgttct ttaacatcca 181 gtgagagaga ccgatgcatc cgatgctgaa catcgccgtg cgcgcagcgc gcaaggcggg 241 taatttaatt gccaaaaact atgaaacccc ggacgctgta gaagcgagcc agaaaggcag 301 taacgatttc gtgaccaacg tagataaagc tgccgaagcg gtgattatcg acacgattcg 361 taaatcttac ccacagcaca ccatcatcac cgaagaaagc ggtgaacttg aaggtactga 421 tcaggatgtt caatgggtta tcgatccact ggatggcact accaacttta tcaaacgtct 481 gccgcacttc gcggtatcta tcgctgttcg tatcaaaggc cgcaccgaag ttgctgtggt 541 atacgatcct atgcgtaacg aactgttcac cgccactcgc ggtcagggcg cacagctgaa 601 cggctaccga ctgctcggca gcaccgctcg cgatctcgac ggtactattc tggcgaccgg 661 cttcccgttc aaagcaaaac agtacgccac tacctacatc aacatcgtcg gcaaactgtt 721 caacgaatgt gcagacttcc gtcgtaccgg ttctgcggcg ctggatctgg cttacgtcgc 781 tgcgggtcgt gttgacggtt tctttgaaat cggtctgcgc ccgtgggact tcgccgcagg 841 cgagctgctg gttcgtgaag cgggcggcat cgtcagcgac ttcaccggtg gtcataacta 901 catgctgacc ggtaacatcg ttgctggtaa cccgcgcgtt gttaaagcca tgctggcgaa 961 catgcgtgac gagttaagcg acgctctgaa gcgttaatga ctcaggcggg tgatatc // LOCUS HUMBADPTA 5701 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human beta adaptin mRNA, complete cds. ACCESSION M34175 J05273 KEYWORDS beta adaptin. SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5701) AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and Parham,P. TITLE Conservation and diversity in families of coated vesicle adaptins JOURNAL J. Biol. Chem. 265, 4814-4820 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 178 2991 beta adaptin mRNA < 1 5701 beta adaptin mRNA signal 5683 5688 polyA signal BASE COUNT 1528 a 1373 c 1284 g 1516 t ORIGIN 1 ctgcccacca tctttgtccc tggcaaagtg ggttttgcgc agtggcttag acctagaaaa 61 gaatcgtgac gggcaggaaa ccattacacc accacctggg ctgtgctctc cggctcccgc 121 cgccaccccc gccctcgcct tcgcctccgc tccggtgcac attaaagatc caaagtcatg 181 actgactcca agtatttcac aaccaataaa aaaggagaaa tatttgaact aaaagctgaa 241 ctcaacaatg aaaagaaaga aaagagaaag gaggctgtga agaaagtgat tgctgctatg 301 accgtgggga aggatgttag ttctctcttt ccagacgtag tgaactgtat gcagactgac 361 aatctggaac taaagaagct tgtgtatctc tacttgatga actacgccaa gagtcagcca 421 gacatggcca tcatggctgt aaacagcttt gtgaaggact gtgaagatcc taatcctttg 481 attcgagcct tggcagtcag aaccatgggg tgcatccggg tagacaaaat tacagaatat 541 ctctgtgagc cgctccgcaa gtgcttgaag gatgaggatc cctatgttcg gaaaacagca 601 gcagtctgcg tggcaaaact ccatgatatc aatgcccaaa tggtggaaga tcagggattt 661 ctggattctc tacgggatct catagcagat tcaaatccaa tggtggtggc taatgccgta 721 gcggcattat ctgaaatcag tgagtctcac ccaaacagca acttacttga tctgaaccca 781 cagaacatta ataagctgct gacagccctg aatgaatgca ctgaatgggg ccagattttc 841 atcctggact gcctgtctaa ttacaaccct aaagatgatc gggaggctca gagcatctgt 901 gagcgggtaa ctccccggct atcccatgcc aactcagcag tggtgctttc agcggtaaaa 961 gtcctaatga agtttctaga attgttacct aaggattctg actactacaa tatgctgctg 1021 aagaagttag cccctccact tgtcactttg ctgtctgggg agccagaagt gcagtatgtc 1081 gccctgagga acatcaactt aattgtccag aaaaggcctg aaatcttgaa gcaggaaatc 1141 aaagtcttct ttgtgaagta caatgatccc atctatgtta aactagagaa gttggacatc 1201 atgattcgtt tggcatctca agccaacatt gctcaggttc tggcagaact gaaagaatat 1261 gctacagagg tggatgttga ctttgttcga aaagctgtgc gggccattgg acggtgtgcc 1321 atcaaggtgg agcaatctgc agagcgctgt gtaagcacat tgcttgatct aatccagacc 1381 aaagtgaatt atgtggtcca agaagcaatt gttgtcatca gggacatctt ccgcaaatac 1441 cccaacaagt atgaaagtat catcgccact ctgtgtgaga acttagactc gctggatgag 1501 ccagatgctc gagcagctat gatttggatt gtgggagaat atgctgaaag aattgacaat 1561 gcagatgagt tactagaaag cttcctggag ggttttcacg atgaaagcac ccaggtgcag 1621 ctcactctgc ttactgccat agtgaagctg tttctcaaga aaccatcaga aacacaggag 1681 ctagtccagc aggtcttgag tttggcaaca caggattctg ataatcctga ccttcgagac 1741 cggggctata tttattggcg ccttctctca actgaccctg ttacagctaa agaagtagtc 1801 ttgtctgaga agccactgat ctctgaggag acggacctta ttgagccaac tctgctggat 1861 gagctaatct gccacattgg ttctttggcc tctgtgtatc ataagcctcc caatgctttt 1921 gtggaaggaa gtcatggaat tcatcgtaaa cacttgccaa ttcatcatgg gagcactgat 1981 gcaggtgaca gccctgttgg cactaccact gcaacgaacc tggaacagcc tcaggttatc 2041 ccctctcaag gtgatcttct aggggatctt ttaaaccttg acctcggtcc cccagtcaat 2101 gtgccacagg tgtcctccat gcagatggga gcagtggatc tcctaggagg aggactagat 2161 agtctggtgg gacaatcctt catcccatca tcggtgcctg caacctttgc tccttcacct 2221 acacctgctg tggtcagcag tggactgaat gacctgtttg aactctccac agggataggc 2281 atggcacctg gtggatatgt ggctcctaag gctgtctggc tacctgcagt aaaggctaaa 2341 ggcttggaga tttccggaac atttactcac cgccaagggc acatctatat ggaaatgaac 2401 ttcaccaata aagctctgca gcacatgaca gattttgcaa tccagtttaa caaaaatagc 2461 tttggtgtca tccccagcac tcctctggcc atccatacac cactgatgcc aaaccagagc 2521 attgatgtct ccctgcctct caataccttg ggcccagtca tgaagatgga acctctgaat 2581 aacctccagg tggctgtgaa aaacaatatc gatgtcttct acttcagctg cctcatccca 2641 ctcaatgtgc tttttgtaga agatggcaaa atggagcgcc aggtcttcct tgcaacatgg 2701 aaggatattc ccaatgaaaa tgaacttcag tttcagatta aggaatgtca tttaaatgct 2761 gacactgttt ccagcaagtt gcaaaacaac aatgtttata ctattgccaa gaggaatgtg 2821 gaagggcagg acatgctgta ccaatccctg aagctcacta atggcatttg gattttggcc 2881 gaactacgta tccagccagg aaaccccaat tacacgctgt cactgaagtg tagagctcct 2941 gaagtctctc aatacatcta tcaggtctac gacagcattt tgaaaaacta acaagactgg 3001 tccagtaccc ttcaaccatg ctgtgatcgg tgcaagtcaa gaactcttaa ctggaagaaa 3061 ttgtattgct gcgtagaatc tgaacacact gaggccacct agcaaggtag taactagtct 3121 aacctgtgct aacattaggg cacaacctgt tggatagttt tagcttcctg tgaacatttg 3181 taaccactgc ttcagtcacc tcccacctct tgccacctgc tgctgctatc tgtccttact 3241 tgtgggcttc tccatgctgt gccaatggct ggctttttct acaccctctt ttgagtgtag 3301 tttggtattt tgtaattgag agctcatttc aaaagcagaa aaagacaaca aatattaaag 3361 caaggaaaag tgtaactgaa acactgcact ttactgtttt atacttttgt acatatgaga 3421 aatcaaggga ttagtgcaac cagtagaagg cattgaaatg actgtcatta accacacagt 3481 cctggaggca gagatgcagt tacctaccct agcttttgat gggttctctt acctgtagta 3541 gccttatccc tggtcatttg gattttcagt ttgctttttt ctttttttcc cctccaaact 3601 ccttttcctt ggccaagcct tcatgcttcc ccctttccat attataatct catttgattg 3661 ctctgcagtt gggaacggtg atcttcttga atgatgtttc agtgtgcaaa aactatagag 3721 cctgtcagca ccaaagctga cagaagttat accttactcc tttcctttcc cctgaacaaa 3781 cctgctaatc ccactaattc aggaatttga gtagagatgg ggaacaagaa cccagatgct 3841 gtcccctcac cccctctcct gtatttctca ggtccagttc aaatctaaaa ttctactttt 3901 agagttgaaa cagagtaata acttatctaa ccctcttttc ctacaaagga gaaagataaa 3961 aggcacaaag gttaccgcca aggcccgtca gctgtgtagt ggcaaagccg agaccgagtc 4021 tcctaagtcc ccgtcagtgt ggttttcacc acaggactgt ctcttgtcgt tttcccctaa 4081 tgccttctcc tgccttttct gtgcctagtt tttggctctt cacatattcc atattgattt 4141 tgacgctctg tatattggca tcaggtggca gctgaatatc ttttgaatta ctcgaaggta 4201 aagccagatg ccagaatgaa ggtgtagcca gtgtttccca tatgcccctg gagccccact 4261 tattgaggcc agcagaatag gtgcagagat gaagtgagct tagagatgtt gcaaatgctc 4321 tttatccctt cagctctctg atctgctctt tcttcatgat acttagtctg cagggcatat 4381 taagatcatc ccagaggttc aggcagttcc tgtcatctct gaaaagactg ggggatatga 4441 aatcttcccc ctaccccact taatgcgttg gatatgattt ttcaaagaat gcttcatgcc 4501 caaaatacca gcctgtttag cagtgttaca ctgtttgatc tgcgggcact tgttgcattg 4561 cctggcaccc aatattcagg gtccatgact aagactggtc ttctcagatg ccctgcttaa 4621 atcaggggca cttcaggctc cacaggcgtc atgttggact gagacctaac tcactggact 4681 cagaggagga atcgtggaaa acaagagcaa aactacccca cacccctatt tcatgtctga 4741 aataaccctg tttcatacca gttgcaaagc ttgtggggag cggtcccaca aagcactttc 4801 ttaaaccttg agaatctcca agagaaaaat atttggggaa ggagggagga aatatgtccc 4861 ttgcacacca cccctgaagc acatggcagt aggaaacagc ataggattgt atgtgggagg 4921 tggataggtc ggtgatgtgt ggagcggaaa agcaggttgg taaagttccc ttcttgggac 4981 ttattcctgg agtcagtgga tacaagtagt gcagaaggtt cacactgcaa atagtgttct 5041 catctcaaag caaactatca ttccagaagg aaaagtgtgt cagggcaagc agacaacaca 5101 atttcctatc agaatatgtc cctcaacccc cgaaacaagg cttctctcag cctccccacc 5161 agtgatggat aacagctcct attctcagct gacctgactg agccaaccca tgaactcttc 5221 actccttggg gaagccacct cccatcacac ccctgagcag agttagggag gaattctact 5281 tcccataaaa ggacctctcc tgagaggcaa aacctgttgc ctccaccacg gcttccctct 5341 tggctcattc caagcttggc caaattgggg aagtgggatg gaggttgccc tgcatccccc 5401 ctcctctgcc tgagtgtgtc tttgtaatgt cagctggcat catacaaaga gcaggagaag 5461 caaacaccca gaactctttt gctggtcaga gattccctga gtgtctgtcc tcacccaagc 5521 ctgctctgtg tctgtgttgt gaagcttgag actctggaaa gaaatgggga gggggggcag 5581 gggaaatgtt gccctaagaa tgcttctcat tcctctgttc ttattgggtc ctgtttttcg 5641 ggagggtggg ggttggggga agcttgacct tgtgtcttcg tcaataaact cacatttaca 5701 c // LOCUS HUMCD59A 1671 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human lymphocytic antigen CD59/MEM43 mRNA, complete cds. ACCESSION M34671 X15861 KEYWORDS CD59 antigen; cell surface antigen; integral membrane protein. SOURCE Human peripheral blood monocyte, cDNA to mRNA, clone R18.. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 108 to 443) AUTHORS Sawada,R., Ohashi,K., Okano,K., Hattori,M., Minato,N. and Naruto,M. TITLE Complementary DNA sequence and deduced peptide sequence for CD59/MEM43 antigen, the human homologue of murine lymphocyte antigen Ly-6c JOURNAL Nucleic Acids Res. 17, 6728-6728 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 1671) AUTHORS Sawada,R., Ohashi,K., Anaguchi,H., Okazaki,H., Hattori,M., Minato,N. and Naruto,M. TITLE Isolation and expression of the full-length cDNA encoding CD59 antigen of human lymphocytes JOURNAL DNA 9, 213-220 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer readable copy for sequence [1] kindly provided by Naruto,M., 17-JUL-1989. [1] Author address: Naruto,M. Basic Research Laboratories Toray Industries Inc 1111 Tebiro Kamakura 248, Japan. FEATURES from to/span description pept 30 416 antigen CD59 precursor (CD59) /hgml_locus_uid="LY0169B" /nomgen="CD59" /map="11pter-p13" sigp 30 104 CD59 signal peptide matp 105 413 CD59 protein mRNA < 1 1671 CD59 mRNA signal 527 532 polyA signal BASE COUNT 434 a 347 c 390 g 500 t ORIGIN 1 ggcgccgcca ggttctgtgg acaatcacaa tgggaatcca aggagggtct gtcctgttcg 61 ggctgctgct cgtcctggct gtcttctgcc attcaggtca tagcctgcag tgctacaact 121 gtcctaaccc aactgctgac tgcaaaacag ccgtcaattg ttcatctgat tttgatgcgt 181 gtctcattac caaagctggg ttacaagtgt ataacaagtg ttggaagttt gagcattgca 241 atttcaacga cgtcacaacc cgcttgaggg aaaatgagct aacgtactac tgctgcaaga 301 aggacctgtg taactttaac gaacagcttg aaaatggtgg gacatcctta tcagagaaaa 361 cagttcttct gctggtgact ccatttctgg cagcagcctg gagccttcat ccctaagtca 421 acaccaggag agcttctccc aaactccccg ttcctgcgta gtccgctttc tcttgctgcc 481 acattctaaa ggcttgatat tttccaaatg gatcctgttg ggaaagaata aaattagctt 541 gagcaacctg gctaagatag aggggctctg ggagactttg aagaccagtc ctgtttgcag 601 ggaagcccca cttgaaggaa gaagtctaag agtgaagtag gtgtgacttg aactagattg 661 catgcttcct cctttgctct tgggaagacc agctttgcag tgacagcttg agtgggttct 721 ctgcagccct cagattattt ttcctctggc tccttggatg tagtcagtta gcatcattag 781 tacatctttg gagggtgggg caggagtata tgagcatcct ctctcacatg gaacgctttc 841 ataaacttca gggatcccgt gttgccatgg aggcatgcca aatgttccat atgtgggtgt 901 cagtcaggga caacaagatc cttaatgcag agctagagga cttctggcag ggaagtgggg 961 aagtgttcca gatagcaggg catgaaaact tagagaggta caagtggctg aaaatcgagt 1021 ttttcctctg tctttaaatt ttatatgggc tttgttatct tccactggaa aagtgtaata 1081 gcatacatca atggtgtgtt aaagctattt ccttgccttt ttttattgga atggtaggat 1141 atcttggctt tgccacacac agttacagag tgaacactct actacatgtg actggcagta 1201 ttaagtgtgc ttattttaaa tgttactggt agaaaggcag ttcaggtatg tgtgtatata 1261 gtatgaatgc agtggggaca ccctttgtgg ttacagtttg agacttccaa aggtcatcct 1321 taataacaac agatctgcag gggtatgttt taccatctgc atccagcctc ctgctaactc 1381 ctagctgact cagcatagat tgtataaaat acctttgtaa cggctcttag cacactcaca 1441 gatgtttgag gctttcagaa gctcttctaa aaaatgatac acacctttca caagggcaaa 1501 ctttttcctt ttccctgtgt attctagtga atgaatctca agattcagta gacctaatga 1561 catttgtatt ttatgatctt ggctgtattt aatggcatag gctgactttt gcagatggag 1621 gaatttcttg attaatgttg aaaaaaaacc cttgattata ctctgttgga c // LOCUS HUMKER19PA 1586 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human keratin K19 pseudogene. ACCESSION M33101 KEYWORDS keratin K19; pseudogene. SOURCE Human, cDNA to mRNA, clone IF7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1586) AUTHORS Savtchenko,E.S., Schiff,T.A., Jiang,C.-K., Freedberg,I.M. and Blumenberg,M. TITLE Embryonic expression of the human 40-kD keratin: Evidence from a processed pseudogene sequence JOURNAL Am. J. Hum. Genet. 43, 630-637 (1988) STANDARD simple staff_entry FEATURES from to/span description pept.ps 120 1310 keratin 19 pseudogene signal 1412 1418 polyA signal BASE COUNT 377 a 405 c 470 g 334 t ORIGIN 1 attgataaac atataatctg atatttatgt aaagtagcta ttttttaaaa aaagtatggc 61 tcctccctcg aatcgcagcc tctgggacca gggtcgctcc atccgtcgtc cgcctcgcca 121 tgacttccta cacgtatcgc cagtcgtagg ccaagtagtc cttctggggc ctgggtggtg 181 gctccgtgag ttttgtggca gaggttgcct ttcgcgcgct cagcatgcac tgggcctctg 241 gaggccgcgg cgtgtccgtg tcctccgccc gcttcgtgtc tgtcctcgtc ctccttgggg 301 ggctacggcg gcgtcttggc cgtgtcctac gggctgctgg cgggcaacga gaagctcaat 361 atgcagaacc tcagcgaccc tctggcctcc tacctggaca aggtgggcgc cctggaggac 421 gccaacggca aactggaggt gaagatccgc gactggtacc agaagcaggg gcccgggcct 481 cccgtgacta cagccactct acaagactat ccaggacctg cggtacaaga ttcttggtgc 541 caccattgag aactccagga ttgtcctgga gatcgacaac gcccgtctgg ctgcagatga 601 cttccgaacc aagagtgaga cggagcaggc tctgcgcatg agcggaggcc gacatcaacg 661 gcctgcgcag ggtgctggac gagctgaccc tggccattac cgacctggag atgcagatct 721 aaggcctgaa ggaagagctg gcctacctga agaagaacca tgagaaggaa atcagtgggc 781 tgaggggcca agtgggaggc caggtcagtg gggaggtgga ttcggctcag ggcacctatc 841 tcgccaagat cctgagttac atgcgaacgc aatacgaggt catggcggac aacaactgga 901 aggatgctga agcctggttc accagccgga ctgaagaatt gaaccgggag gtcgctggcc 961 acacagatca gctccagatg agccggtcca aggtcgctga cctgcggcgc accctccagg 1021 gtcttgagct ggagctgcag tcacggctga gcatgaaagc cgccttggaa gccacactgg 1081 cagaaacgga ggcgcgcttt ggagtccact tggcgcagat ccagccgctg atcaactgta 1141 ttgaagccca gctgggcgat gtgcgagctg atagtgagcg gcagaatcag gattaacagc 1201 agttcatgga catcaagtcg cggctggagc aggagatctc cacctaccgc agcctgctcg 1261 agggccagaa ggatcactac aacaacctgt ccgcctccaa ggtcctctga ggcagcaggc 1321 taaggggctt ctactgtcct ttggagggtg tctcctgggt agggggatgg gaaggaaggg 1381 acccttaccc cctgctcttc ccctgatctg ccaataaaat tttatggtcc aaggggaaaa 1441 aaaaaaaaaa aaaaaatata tatatatata tatatatata tatatatgtg tgtgtgtgtg 1501 tgtgtgtgtg tgtatatata cgtgtgtgtg tatatatata tatatgaaaa acaatacatg 1561 ctcgttgtag aaatgtggaa acatgg // LOCUS HUMLOX15A 2671 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human 15-lipoxygenase mRNA, complete cds. ACCESSION M23892 KEYWORDS 15-lipoxygenase. SOURCE Human reticulocyte, cDNA to mRNA, clone 15LOX. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2671) AUTHORS Sigal,E., Craik,C.S., Highland,E., Grunberger,D., Costello,L.L., Dixon,R.A.F. and Nadel,J.A. TITLE Molecular cloning and primary structure of human 15-lipoxygenase JOURNAL Biochem. Biophys. Res. Commun. 157, 457-464 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 4 1992 15-lipoxygenase mRNA < 1 2671 15-lipoxygenase mRNA BASE COUNT 580 a 743 c 718 g 630 t ORIGIN 1 aagatgggtc tctaccgcat ccgcgtgtcc actggggcct cgctctatgc cggttccaac 61 aaccaggtgc agctgtggct ggtcggccag cacggggagg cggcgctcgg gaagcgactg 121 tggcccgcac ggggcaagga gacagaactc aaggtggaag taccggagta tctggggccg 181 ctgctgtttg tgaaactgcg caaacggcac ctccttaagg acgacgcctg gttctgcaac 241 tggatctctg tgcagggccc cggagccggg gacgaggtca ggttcccttg ttaccgctgg 301 gtggagggca acggcgtcct gagcctgcct gaaggcaccg gccgcactgt gggcgaggac 361 cctcagggcc tgttccagaa acaccgggaa gaagagctgg aagagagaag gaagttgtac 421 cggtggggaa actggaagga cgggttaatt ctgaatatgg ctggggccaa actatatgac 481 ctccctgtgg atgagcgatt tctggaagac aagagagttg actttgaggt ttcgctggcc 541 aaggggctgg ccgacctcgc tatcaaagac tctctaaatg ttctgacttg ctggaaggat 601 ctagatgact tcaaccggat tttctggtgt ggtcagagca agctggctga gcgcgtgcgg 661 gactcctgga aggaagatgc cttatttggg taccagtttc ttaatggcgc caaccccgtg 721 gtgctgaggc gctctgctca ccttcctgct cgcctagtgt tccctccagg catggaggaa 781 ctgcaggccc agctggagaa ggagctggag ggaggcacac tgttcgaagc tgacttctcc 841 ctgctggatg ggatcaaggc caacgtcatt ctctgtagcc agcagcacct ggctgcccct 901 ctagtcatgc tgaaattgca gcctgatggg aaactcttgc ccatggtcat ccagctccag 961 ctgccccgca caggatcccc accacctccc cttttcttgc ctacggatcc cccaatggcc 1021 tggcttctgg ccaaatgctg ggtgcgcagc tctgacttcc agctccatga gctgcagtct 1081 catcttctga ggggacactt gatggctgag gtcattgttg tggccaccat gaggtgcctg 1141 ccgtcgatac atcctatctt caagcttata attccccacc tgcgatacac cctggaaatt 1201 aacgtccggg ccaggactgg gctggtctct gacatgggaa ttttcgacca gataatgagc 1261 actggtgggg gaggccacgt gcagctgctc aagcaagctg gagccttcct aacctacagc 1321 tccttctgtc cccctgatga cttggccgac cgggggctcc tgggagtgaa gtcttccttc 1381 tatgcccaag atgcgctgcg gctctgggaa atcatctatc ggtatgtgga aggaatcgtg 1441 agtctccact ataagacaga cgtggctgtg aaagacgacc cagagctgca gacctggtgt 1501 cgagagatca ctgaaatcgg gctgcaaggg gcccaggacc gagggtttcc tgtctcttta 1561 caggctcggg accaggtttg ccactttgtc accatgtgta tcttcacctg caccggccaa 1621 cacgcctctg tgcacctggg ccagctggac tggtactctt gggtgcctaa tgcaccctgc 1681 acgatgcggc tgcccccgcc aaccaccaag gatgcaacgc tggagacagt gatggcgaca 1741 ctgcccaact tccaccaggc ttctctccag atgtccatca cttggcagct gggcagacgc 1801 cagcccgtta tggtggctgt gggccagcat gaggaggagt atttttcggg ccctgagcct 1861 aaggctgtgc tgaagaagtt cagggaggag ctggctgccc tggataagga aattgagatc 1921 cggaatgcaa agctggacat gccctacgag tacctgcggc ccagcgtggt ggaaaacagt 1981 gtggccatct aagcgtcgcc accctttggt tatttcagcc cccatcaccc aagccacaag 2041 ctgacccctt cgtggttata gccctgccct cccaagtccc accctcttcc catgtcccac 2101 cctccctaga ggggcacctt ttcatggtct ctgcacccag tgaacacatt ttactctaga 2161 ggcatcacct gggaccttac tcctctttcc ttccttcctc ctttcctatc ttccttcctc 2221 tctctcttcc tctttcttca ttcagatcta tatggcaaat agccacaatt atataaatca 2281 tttcaagact agaatagggg gatataatac atattactcc acacctttta tgaatcaaat 2341 atgatttttt tgttgttgtt aagacagagt ctcactttga cacccaggct ggagtgcagt 2401 ggtgccatca ccacggctca ctgcagcctc agcgtcctgg gctcaaatga tcctcccacc 2461 tcagcctcct gagtagctgg gactacaggc tcatgccatc atgcccagct aatatttttt 2521 tattttcgtg gagacggggc ctcactatgt tgcctaggct ggaaatagga ttttgaaccc 2581 aaattgagtt taacaataat aaaaagttgt tttacgctaa agatggaaaa gaactaggac 2641 tgaactattt taaataaaat attggcaaaa g // LOCUS MUSBPGALA 334 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse beta-galactoside-binding lectin (L-14.5) mRNA, 5' end. ACCESSION M33214 KEYWORDS beta-galactoside-binding lectin. SOURCE Mouse (strain C57BL/6) 12 day old embryo melanoma cell line UV-2237-IP, cDNA to mRNA, clone L3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Raz,A., Carmi,P. and Pazerini,G. TITLE Expression of two different endogenous galactoside-binding lectins sharing sequence homology JOURNAL Cancer Res. 48, 645-649 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 16 > 334 14 kDa beta-galactoside-binding lectin (L-14.5) BASE COUNT 86 a 94 c 91 g 63 t ORIGIN 1 gaattgggta caatcatggc ctgtggtctg gtggatcagc aagctgaatc tcaaactggg 61 gcaatgtctc aaagttcggg gcagaggtgg acctcggacg acaggagctt tgtgctgacc 121 ctgggaaaag acagcaacaa ccgttgccta cacttcaatc ctcgcttcaa tgcccatgga 181 gacgccaaca ccattctgtg taacaccaag gaagatggga cctggggaac cgaacaccgg 241 gaacctgcct tccccttcca gcccgggagc atcacagagt gtgcatgcac ctttgaccag 301 gctgacctga ccatgcaagc tgccagacgg acat // LOCUS MUSBPGALB 621 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse beta-galactoside-binding lectin (L-34) mRNA, 3' end. ACCESSION M33215 KEYWORDS beta-galactoside-binding lectin. SOURCE Mouse (strain C57BL/6) 12 day old embryo melanoma cell line UV-2237-IP3, cDNA to mRNA, clone M5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 621) AUTHORS Raz,A., Carmi,P. and Pazerini,G. TITLE Expression of two different endogenous galactoside-binding lectins sharing sequence homology JOURNAL Cancer Res. 48, 645-649 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 420 34 kDa beta-galactoside-binding lectin (L-34) BASE COUNT 164 a 171 c 154 g 132 t ORIGIN 1 cccagggcaa cctggggcac ctggggccat ccccagtgct cctggaggct atcctgctgc 61 tggcccttat ggtgtccccg ctggaccact gacgtgccct atgacctgcc cttgcctgga 121 ggagtcatgc cccgcatgct gatcacaatc atgggcacag tgaaacccaa cgcaaacagg 181 attgttctag atttcaggag agggaatgat gttgccttcc actttaaccc ccgcttcaat 241 gagaacaaca gaagactcat tgtgtgtaac acgaagcagg acaataactg gggaaaggaa 301 gaaagacagt cagccttccc ctttgagagt ggaaaaccat tcaaaataca agtcctggtt 361 gcagctgacc attcaggttg cggtcacgat gctcactact gcagtacaac catcggatga 421 agaacctccg ggaaatcagc caactggcga tcagtggtga cataaccctg caccagcgct 481 gaaccagcgc catgatctaa gccagaaggg gcggcaccga aaccggccct gtgtgcctta 541 ggagtgggaa actttgcatt tctctctcct tatccttctt gtaagacatc catttaataa 601 agtctcatgc tgagagaaaa g // LOCUS MUSP32A 1510 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Mouse tumor-induced 32 kD protein (p32) mRNA, complete cds. ACCESSION M33203 KEYWORDS tumor-induced protein. SOURCE Mouse (strain BALB/c) fibroblast cell line 3T3 A31, cDNA to mRNA, clone pMp32S. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1510) AUTHORS Kageyama,H., Hiwasa,T., Tokunaga,K. and Sakiyama,S. TITLE Isolation and characterization of a complementary DNA clone for a M-r 32,000 protein which is induced with tumor promoters in BALB/c 3T3 cells JOURNAL Cancer Res. 48, 4795-4798 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 75 944 32 kD protein (p32) mRNA < 1 1510 p32 mRNA signal 1491 1496 poly-A signal BASE COUNT 352 a 420 c 381 g 357 t ORIGIN 1 ccgcgcagag ccgtctcgag catagcccgg agcctgaatc gagcagaacc agcctgaact 61 agcccagtcc ggtgatggag cgtccacagc ccgacagcat gccccaggat ttgtctgagg 121 ccttgaagga ggccaccaag gaggtacaca tccaagccga gaatgctgag ttcatgaaga 181 actttcagaa gggtcaggtg tccagagaag gctttaagct ggtgatggct tccttgtacc 241 atatctacac ggccctggaa gaggagatag agcgcaacaa gcagaaccca gtctatgccc 301 cactctactt ccctgaggag ctgcaccgaa gggctgccct ggagcaggac atggccttct 361 ggtatgggcc tcactggcag gaaatcatcc cttgcacgcc agccacacag cactatgtaa 421 agcgtctcca cgaggtgggg cgcactcacc ctgagctgct ggtggcccac gcatataccc 481 gctacctggg tgacctctca gggggtcagg tcctgaagaa gattgcacag aaggccatgg 541 ccttgcccag ctctggggag ggcctggctt tttttacctt cccgaacatc gacagcccca 601 ccaagttcaa acagctctat cgtgctcgaa tgaacactct ggagatgaca cctgaggtca 661 agcacagggt gacagaagag gctaagaccg ccttcctgct caacattgag ctgtttgagg 721 agctgcaggt gatgctgaca gaggaacaca aagaccagag tccctcacag atggcgtcac 781 ttcgtcagag gcctgctagc ctggtgcaag atactgcccc tgcagagaca ccccgaggga 841 aaccccagat cagcactagc tcatcccaga caccgctcct ccagtgggtc ctcactctca 901 gcttcctgtt ggcaacagtg gcagtgggaa tttatgccat gtaaatgcaa tactggcccc 961 caggggctgt gaactctgtc caatgtggcc ttctctctgt aagggagaat cttgcctggc 1021 tctcttctct tgggcctcta agaaagcttt tggggtccct agcccactcc ctgtgtttcc 1081 tttctctctg gaatggaggg agatacctga cacagttccc tcaccaaaag cacatccagc 1141 cagtggcctg aactttgaaa ccagcagccc caaatcctgc agcagagccc caaaactggc 1201 ctgtaaaagc agctgttctg agcccagtgc ccatggttgt aagcatccat gttgactgac 1261 cacgactgct gtcccccagt gccatggcca ctttgatatc cgtttccaga catttctgtc 1321 tcgtatttct gtcttgtttt ttattatttc cccagttcta ccagagtaat ggtattttgt 1381 tgttttgttt tgtcttgttt ttcctaacaa agtggggcta tcttttgagg ggtgggtggg 1441 aaagaattat ttaatagttg taaccttggt ctctaacttc tgtgtgaaat aataaatggc 1501 attatctaac // LOCUS PASLKTCABD 7742 bp ds-DNA BCT 12-JUL-1990 DEFINITION P.haemolytica leukotoxin gene cluster, complete cds. ACCESSION M24197 M34943 M34944 KEYWORDS LktA membrane protein; cytolysin; hemolysin; leukotoxin. SOURCE P.haemolytica (strain PHL101) DNA, clones lambda-sh132 and pSH224. ORGANISM Pasteurella haemolytica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 7742) AUTHORS Highlander,S.K., Chidambaram,M., Engler,M.J. and Weinstock,G.M. TITLE DNA sequence of the Pasteurella haemolytica leukotoxin gene cluster JOURNAL DNA 8, 15-28 (1989) STANDARD full staff_review REFERENCE 2 (bases 46 to 276 and 3576 to 3813) AUTHORS Highlander,S.K., Engler,M.J. and Weinstock,G.M. TITLE secretion and expression of the Pasteurella haemolytica leukotoxin JOURNAL J. Bacteriol. 172, 2343-2350 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.K.Highlander, 24-APR-1989. FEATURES from to/span description pept 216 719 leukotoxin (LktC) pept 735 3596 leukotoxin membrane protein (LktA) pept 3670 5796 leukotoxin (LktB) pept 5808 7244 leukotoxin (LktD) mRNA 186 > 3596 lktCA mRNA signal 173 178 -10 region promoter binding 205 209 lktC ribosomal binding site (put.) binding 722 726 lktA ribosomal binding site (put.) binding 3660 3665 lktB ribosomal binding site (put.) binding 5797 5802 lktD ribosomal binding site (put.) BASE COUNT 2573 a 1357 c 1568 g 2244 t ORIGIN 1 taatattaca atgtaattat tttgtttaat ttccctacat tttgtataac tttaaaacac 61 tcctttttct cttctgatta tataaaagac aaaaaataca atttaagcta caaaaaacaa 121 caaaaaacaa caaaaaacac gacaataaga tcgagtaatg attatattat gttataattt 181 ttgacctaat ttagaataat tatcgagtgc aaattatgaa tcaatcttat tttaacttac 241 taggaaacat tacttggcta tggatgaact cctccctcca caaagaatgg agctgtgaac 301 tactagcacg caatgtgatt cctgcaattg aaaatgaaca atatatgcta cttatagata 361 acggtattcc gatcgcttat tgtagttggg cagatttaaa ccttgagact gaggtgaaat 421 atattaagga tattaattcg ttaacaccag aagaatggca gtctggtgac agacgctgga 481 ttattgattg ggtagcacca ttcggacatt ctcaattact ttataaaaaa atgtgtcaga 541 aataccctga tatgatcgtc agatctatac gcttttatcc aaagcagaaa gaattaggca 601 aaattgccta ctttaaagga ggtaaattag ataaaaaaac agcaaaaaaa cgttttgata 661 catatcaaga agagctggca acacgactta aaaatgaatt taattttatt aaaaaataga 721 aggagacatc ccttatggga actagactta caaccctatc aaatgggcta aaaaacactt 781 taacggcaac caaaagtggc ttacataaag ccggtcaatc attaacccaa gccggcagtt 841 ctttaaaaac tggggcaaaa aaaattatcc tctatattcc ccaaaattac caatatgata 901 ctgaacaagg taatggttta caggatttag tcaaagcggc cgaagagttg gggattgagg 961 tacaaagaga agaacgcaat aatattgcaa cagctcaaac cagtttaggc acgattcaaa 1021 ccgctattgg cttaactgag cgtggcattg tgttatccgc tccacaaatt gataaattgc 1081 tacagaaaac taaagcaggc caagcattag gttctgccga aagcattgta caaaatgcaa 1141 ataaagccaa aactgtatta tctggcattc aatctatttt aggctcagta ttggctggaa 1201 tggatttaga tgaggcctta cagaataaca gcaaccaaca tgctcttgct aaagctggct 1261 tggagctaac aaattcatta attgaaaata ttgctaattc agtaaaaaca cttgacgaat 1321 ttggtgagca aattagtcaa tttggttcaa aactacaaaa tatcaaaggc ttagggactt 1381 taggagacaa actcaaaaat atcggtggac ttgataaagc tggccttggt ttagatgtta 1441 tctcagggct attatcgggc gcaacagctg cacttgtact tgcagataaa aatgcttcaa 1501 cagctaaaaa agtgggtgcg ggttttgaat tggcaaacca agttgttggt aatattacca 1561 aagccgtttc ttcttacatt ttagcccaac gtgttgcagc aggtttatct tcaactgggc 1621 ctgtggctgc tttaattgct tctactgttt ctcttgcgat tagcccatta gcatttgccg 1681 gtattgccga taaatttaat catgcaaaaa gtttagagag ttatgccgaa cgctttaaaa 1741 aattaggcta tgacggagat aatttattag cagaatatca gcggggaaca gggactattg 1801 atgcatcggt tactgcaatt aataccgcat tggccgctat tgctggtggt gtgtctgctg 1861 ctgcagccgg ctcggttatt gcttcaccga ttgccttatt agtatctggg attaccggtg 1921 taatttctac gattctgcaa tattctaaac aagcaatgtt gagcacgttg caaataaaaa 1981 ttcataacaa aattgtagaa tgggaaaaaa ataatcacgg taagaactac tttgaaaatg 2041 gttacgatgc ccgttatctt gcgaatttac aagataatat gaaattctta ctgaacttaa 2101 acaaagagtt acaggcagaa cgtgtcatcg ctattactca gcagcaatgg gataacaaca 2161 ttggtgattt agctggtatt agccgtttag gtgaaaaagt ccttagtggt aaagcctatg 2221 tggatgcgtt tgaagaaggc aaacacatta aagccgataa attagtacag ttggattcgg 2281 caaacggtat tattgatgtg agtaattcgg gtaaagcgaa aactcagcat atcttattca 2341 gaacgccatt attgacgccg ggaacagagc atcgtgaacg cgtacaaaca ggtaaatatg 2401 aatatattac caagctcaat attaaccgtg tagatagctg gaaaattaca gatggtgcag 2461 caagttctac ctttgattta actaacgttg ttcagcgtat tggtattgaa ttagacaatg 2521 ctggaaatgt aactaaaacc aaagaaacaa aaattattgc caaacttggt gaaggtgatg 2581 acaacgtatt tgttggttct ggtacgacgg aaattgatgg cggtgaaggt tacgaccgag 2641 ttcactatag ccgtggaaac tatggtgctt taactattga tgcaaccaaa gagaccgagc 2701 aaggtagtta taccgtaaat cgtttcgtag aaaccggtaa agcactacac gaagtgactt 2761 caacccatac cgcattagtg ggcaaccgtg aagaaaaaat agaatatcgt catagcaata 2821 accagcacca tgccggttat tacaccaaag ataccttgaa agctgttgaa gaaattatcg 2881 gtacatcaca taacgatatc tttaaaggta gtaagttcaa tgatgccttt aacggtggtg 2941 atggtgtcga tactatttac ggtaacgacg gcaatgaccg cttatttggt ggtaaaggcg 3001 atgatattct cgatggtgga aatggtgatg attttatcga tggcggtaaa ggcaacgacc 3061 tattacacgg tggcaagggc gatgatattt tcgttcaccg taaaggcgat ggtaatgata 3121 ttattaccga ttctgacggc aatgataaat tatcattctc tgattcgaac ttaaaagatt 3181 taacatttga aaaagttaaa cataatcttg tcatcacgaa tagcaaaaaa gagaaagtga 3241 ccattcaaaa ctggttccga gaggctgatt ttgctaaaga agtgcctaat tataaagcaa 3301 ctaaagatga gaaaatcgaa gaaatcatcg gtcaaaatgg cgagcggatc acctcaaagc 3361 aagttgatga tcttatcgca aaaggtaacg gcaaaattac ccaagatgag ctatcaaaag 3421 ttgttgataa ctatgaattg ctcaaacata gcaaaaatgt gacaaacagc ttagataagt 3481 taatctcatc tgtaagtgca tttacctcgt ctaatgattc gagaaatgta ttagtggctc 3541 caacttcaat gttggatcaa agtttatctt ctcttcaatt tgctagagca gcttaatttt 3601 taatgattgg caactctata ttgtttcaca cattatagat tgccgtttta ttttataaaa 3661 ggagacaata tggaagctaa ccatcaaagg aatgatcttg gtttagttgc cctcactatg 3721 ttggcacaat accataatat ttcgcttaat ccggaagaaa taaaacataa atttgatctt 3781 gacggaaaag ggctttcttt aactgcttgg cttttagctg caaaatcgtt agcgttgaaa 3841 gcgaaacaca ttaaaaaaga gatttcccgc ttacacttgg tgaatttacc ggcattagtt 3901 tggcaagata acggtaaaca ttttttattg gtaaaagtgg ataccgataa taaccgctat 3961 ttaacttaca atttggaaca agatgctcca caaattctgt caacagacga atttgaagcc 4021 tgctatcaag ggcagttaat tttggtcacg tccagagctt ccgtagtagg tcaattagca 4081 aagttcgatt tcacctggtt tattccggcg gtgatcaaat accgaaaaat ctttctagaa 4141 accttgattg tttcgatctt tttgcaaatt tttgccctaa ttacaccgct attcttccaa 4201 gttgttatgg ataaagtact ggtgcatcga ggtttttcaa ccttgaatat cattacggtt 4261 gccttagcta ttgtgatcat ctttgaaatt gtactaagtg gtttgagaac ctatgttttt 4321 tctcatagca ctagccgtat tgatgttgaa ttaggcgcta aattatttcg acatttatta 4381 tcactaccca tttcttattt tgaaaacaga cgagttggag atacagtcgc tagggttaga 4441 gaattagatc aaattcgtaa tttccttacc ggacaagcat taacctcggt gttagatctc 4501 ttattctctt ttatcttttt tgccgtaatg tggtattaca gcccaaaatt aaccttggta 4561 attcttggtt cattgccctg ctatatttta tggtcaattt ttattagtcc gattttaaga 4621 cggcgtttag atgagaaatt tgcccgaagt gctgataacc aagcattctt agttgagtcg 4681 gtaacagcca tcaatatgat taaagcgatg gcggttgctc cacaaatgac ggatacatgg 4741 gataaacagc tggcaagcta tgttttcatc agtttccgtg tcaccgtatt agcaaccatt 4801 gggcaacaag gtgtacaact tattcaaaaa accgttatgg tgattaacct ttggttaggg 4861 gcacacttag ttatttcagg cgatctgagt attgggcaat taattgcctt taatatgcta 4921 tcagggcaag tgattgcacc ggtgattcgg ctggctcagc tctggcaaga tttccaacaa 4981 gttgggattt ccgtcactcg cttaggtgat gttttaaact ctccaaccga acaatatcaa 5041 ggcaaattat cactaccaga aataaaaggc gatatctcat ttaaaaatat ccgctttaga 5101 tataaaccag atgcaccaac tattttaaat aatgtgaatt tagaaattag gcaaggagaa 5161 gtgattggga ttgttggacg ttccggttca ggcaaaagta ctctgactaa attactgcaa 5221 cgtttttata ttcctgaaaa tgggcaggtt ttgattgatg gacatgatct agccttagct 5281 gatccaaact ggctacgccg tcaaataggt gtagtgctgc aagataatgt gttattaaac 5341 cgcagtatcc gagaaaatat tgcgctatca gatccaggaa tgccaatgga gcgagtaatt 5401 tatgcagcaa aattagcagg ggctcacgat tttatttcag aattgcgtga aggttatacc 5461 accattgtgg gtgaacaagg agcggggctt tcaggcgggc aacgccaacg gattgcgatt 5521 gctcgagctt tggtaaacaa cccgaaaatc ctgatttttg atgaggcaac cagtgccctc 5581 gattacgaat ctgagcatat tattatgcaa aatatgcaaa aaatatgcca aggcagaacc 5641 gtgattttga ttgcacatcg tttatcgacc gtcaaaaatg cggatcgaat tattgtgatg 5701 gaaaaggggg aaattgttga gcaaggcaag caccacgaat tactgcaaaa cagtaacgga 5761 ctttattcct acttacacca attacaactt aattaagaag gaaaacaatg aaaatatggc 5821 ttagtggtat ttatgaattt ttcctacgct ataaaaacat ttgggcagaa gtatggaaaa 5881 ttcgtaaaga attagaccac ccaaacagaa aaaaagacga aagtgaattt ttaccggcac 5941 atttagaact gattgaaacc ccggtttcta aaaaaccacg tctaattgct tatttgatta 6001 tgctattttt agttgtggca attgtgcttg ccagtgtaag caaagttgaa attgtggcga 6061 ctgctcccgg taaattaact tttagtggca gaagtaaaga aattaaaccg attgaaaacg 6121 ccattgtaca agaaattttc gttaaagatg ggcagtttgt ggaaaaaggg caattattag 6181 tcagcttaac tgcattgggt tctgatgcag atatcaaaaa gaccatggct tcactttctt 6241 tagctaaact ggagacctat cgctaccaaa ctttgcttac tgccattgaa aaagagtcct 6301 tgccggtgat tgatttatct agaaccgaat ttaaagattc atcggaagaa gatcgactac 6361 gtattaaaca cttaattgag gagcaataca ccacttggca aaaacaaaaa acacagaaaa 6421 ctttagcgta taagcgtaaa gaggctgaaa aacaaacaat atttgcctat gtccgtaaat 6481 atgaaggtgc aacacgtatt gaacaagaaa aattaaaaga ctttaaggca ctttataaac 6541 agaagtcttt atctaagcac gaacttcttg cgcaagaaaa taaattaatt gaggctcaga 6601 atgcagtagc tgtttatcgc tcaaaattaa atgaattaga aaatgatcta ctcaatgtaa 6661 aagaagaact tgaattgatc acgcaattct ttaaaagcga tgtgttggaa aaattaaagc 6721 aacatattga aaatgaacgc caacttcggc tcgagttaga aaaaaataat caacgcagac 6781 aggcctcgat gatcagagca ccggtttccg gtacggttca gcaactgaaa attcacacta 6841 taggtggtgt tgttacgact gctgaaacct tgatgatcat tgtgccggaa gacgatgtgt 6901 tagaggccac cgctctggtt ccaaacaaag atatcggctt tgttgcagca gggcaggagg 6961 tgattattaa agtggaaact ttcccttata cacgctatgg ttatctaact ggtcgaatta 7021 aacatattag cccggatgcg attgaacaac ctaatgtagg cttagttttt aatgcaacta 7081 tagctataga taggaagaat ctaacatcgc ctgatgggcg aaaaattgat ttgagttcag 7141 gtatgacaat aactgctgaa atcaaaaccg gtgaacggag tgtaatgagt tatttactca 7201 gcccattaga agaatctgtc acagaaagtt taagggaacg ctaatcgaac caaaacaaag 7261 ccataaaagc cattttgagc ttttatggct ttatttttta gtccacaagc ggacaaaaaa 7321 gcccaatttt ttacactttt ataacaaatt gttctaacta aaaattacta attcttttct 7381 tttatagcga tctctatttc atttcattaa cattgactag aagggattat gagcctaagc 7441 attacgaatc tttctcttgg ctaccgcaaa aatcagcaaa ggcttatttg aaaagcacgg 7501 tgtcgaggtg gaaaaaccgg tgatgtttcg cagctgggct cagttggtgg aagcttttta 7561 agtggcaatg tgaacgtggt gcatctgctt tcgcctatga gtttgtgggc gaaatatgga 7621 gcaaatgctc cggtgaaagc ggtaatgtgg aatcacttgg caggttcggc tttaacggtt 7681 cgccctgaaa tcaacagtat tgccgaactc tccggcaaaa cggtagaact tccgttttgg 7741 ta // LOCUS RATBADPTA 3477 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat beta adaptin mRNA, complete cds. ACCESSION M34176 J05273 KEYWORDS beta adaptin. SOURCE Rat lymphocyte, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3477) AUTHORS Ponnambalam,S., Robinson,M.S., Jackson,A.P., Peiperl,L. and Parham,P. TITLE Conservation and diversity in families of coated vesicle adaptins JOURNAL J. Biol. Chem. 265, 4814-4820 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 72 2885 beta adaptin mRNA < 1 3477 beta adaptin mRNA BASE COUNT 914 a 851 c 822 g 890 t ORIGIN 1 cggggctgtg ctctctgact gccgccgcca ccccgcccct tgcctccggt tcacgctgaa 61 gatccagaat catgactgac tccaagtact tcacaaccaa taagaaggga gaaatctttg 121 aattaaaagc tgaactcaac aatgaaaaga aagaaaagag gaaggaggct gtgaagaaag 181 tgattgctgc tatgactgtg gggaaagacg ttagctctct cttcccagat gtggtgaact 241 gtatgcagac tgacaacctg gaactaaaga agcttgtgta cctctatctg atgaactatg 301 ccaagagtca gccagacatg gccatcatgg ctgtcaacag ctttgtgaag gattgtgaag 361 accccaatcc tttgattcga gccttggcag ttagaaccat gggatgcatc cgggtggaca 421 agattacaga gtatctctgt gaacccctcc gcaagtgctt gaaggatgaa gacccctatg 481 ttcggaaaac agcagcagta tgcgtggcaa aactccatga tatcaatgcc cagatggtgg 541 aagatcaggg atttctggat tctctgcggg atctcatagc agattcaaac ccaatggtgg 601 tggctaatgc tgtagcagca ttgtctgaga tcagtgagtc tcacccaaac agcaacttac 661 ttgatctgaa ccctcagaat atcaataagc tgctcacagc cctgaatgag tgcactgagt 721 ggggccagat tttcatcttg gactgcctgt ctaattacaa ccctaaagat gaccgggaag 781 ctcagagcat ctgtgagcga gtgacgcctc ggctctctca tgccaattct gcagtggtgc 841 tttcagcagt aaaagttctg atgaagtttc tagagttgtt acccaaggac tctgactact 901 acaatatgct gctaaagaag ctagcgcctc cacttgtcac tttgctctct ggggagccag 961 aagtgcagta tgttgccctg aggaacatca acctaattgt ccagaaaagg cctgaaatct 1021 tgaagcagga aatcaaggtc ttctttgtga agtacaatga tcctatctat gttaaactag 1081 agaagttaga catcatgatt cgtcttgcat cccaagccaa cattgctcag gttctggcag 1141 aactgaagga atatgccact gaagttgatg tggactttgt tcgcaaagct gtgagggcca 1201 ttggacggtg tgccatcaaa gtggagcaat cagcagaacg ctgtgtgagc acactgcttg 1261 atctaatcca gaccaaagta aattatgtgg tccaagaggc aattgttgtc atcagggaca 1321 tcttccgaaa ataccccaac aagtatgaga gcattatcgc cacgctgtgt gagaacttgg 1381 actccctgga tgaacccgat gcccgagcgg ctatgatttg gattgtagga gagtatgctg 1441 aaagaatcga taatgccgat gagttactag agagcttcct ggaaggtttt catgatgaaa 1501 gcacccaggt gcagctcacg ttgcttaccg ccatagtgaa actgtttctc aagaagccat 1561 cagaaacaca ggagctggtc caacaggtct tgagcttggc cacacaggat tctgataatc 1621 ctgaccttcg agatcggggt tatatttatt ggcgccttct ttcaactgac cctgtgacag 1681 ccaaagaagt agtgttgtct gagaagccat tgatctctga ggaaacagac ctcattgaac 1741 ctaccctcct ggatgagctc atctgccaca ttggttcttt ggcctccgtg taccataaac 1801 ctccgaatgc ttttgtggaa gggagccatg gcattcatcg caaacacttg ccaattcacc 1861 atgggagcac tgatgcaggt gatagccctg ttggcaccac cactgcaacc aacctggaac 1921 agcctcaggt catcccctct caaggtgacc ttctggggga tcttttaaat cttgacctgg 1981 gtcccccagt gaatgtcccg caagtgtcct ccatgcagat gggagcagtg gatcttttag 2041 gaggaggact ggatagcctg gtaggacagt ccttcatccc gtcatcagtg cctgcaacct 2101 tcgctccttc acctactcct gctgtggtca gcagtggtct gaatgacctg tttgagcttt 2161 ccactgggat aggcatggca cctggcggat atgtggctcc taaggcagtc tggctacctg 2221 ctgtaaaggc taaaggcttg gagatttcgg ggacgtttac tcaccgccaa gggcacatct 2281 atatggaaat gaacttcacc aacaaagctc tgcagcacat gacggatttt gccatccagt 2341 ttaacaagaa tagcttcggt gtcatcccga gcactccctt ggccatacat actccgctga 2401 tgccaaacca gagcattgat gtgtctctgc ctctcaacac cttgggccca gtcatgaaga 2461 tggagcctct gaataacttg caggtggctg ttaaaaacaa tattgatgtc ttctacttca 2521 gctgcctcat cccactcaat gtgctttttg tagaagatgg caaaatggaa cgccaggtct 2581 tccttgcgac gtggaaggat attcccaatg aaaatgagct ccaatttcag attaaggagt 2641 gtcatttaaa cgctgacaca gtttccagca agttgcaaaa caacaatgtt tacactatcg 2701 ccaagaggaa tgtggagggg caggacatgc tgtaccagtc cctgaagctc actaatggca 2761 tttggatttt ggcagagctg cggatccagc caggaaaccc caattatacg ctgtcgctga 2821 agtgtagagc ccctgaagtc tctcagtaca tctatcaggt ctacgacagc attttgaaaa 2881 actaataaat gggtccagtc agcctgtaat cagtgcaagc cacgaactct taactgaaag 2941 acactgtatt gttgtgtaga gcctgaaccc aaaccctgcg gtacccaccc cggtagtggc 3001 cagtcatttt gtgctgatat tagcactcac cccattggta ggttagcttc ccgtgacatc 3061 tccttccact atcgcccacc tctgccacct gccgctgctc tctgtcctta gttgtgagtt 3121 cctctgtgct gtgccaatgg ctagcctttt ctacaccctc ttttgagtgt agtttgatat 3181 tttgtaatcg aaagctcatt tcacaagcag aaaaaggcaa caagttaatt agagcgagga 3241 agagtgtcac tgaaacatac actgcacctt attgttttat atttttgtac agatgagata 3301 gatattgagg tagaacgctg agtagaaagg gtgactgacc ctcctcagac acagtcttat 3361 tggagacata tggccctggc cccttctggg caaggagagg cgaccccact cctggtcttt 3421 tgcattttca ccttggccac gccttccagc tctcttatgc ccatgctctc tcatttg // LOCUS RATPSPB 1620 bp ss-mRNA ROD 12-JUL-1990 DEFINITION Rat pulmonary surfactant-associated glycoprotein A (SP-A) mRNA, complete cds. ACCESSION M33201 KEYWORDS pulmonary surfactant protein A. SOURCE Rat fetal lung, cDNA to mRNA, clone SP-A [0.9, 1.6]. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1620) AUTHORS Fisher,J.H., Emrie,P.A., Shannon,J., Sano,K., Hattler,B. and Mason,R.J. TITLE Rat pulmonary surfactant protein A is expressed as two differently sized mRNA species which arise from differential polyadenylation of one transcript JOURNAL Biochim. Biophys. Acta 950, 338-345 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 56 802 pulmonary surfactant protein A (56 could be 29) BASE COUNT 425 a 400 c 396 g 399 t ORIGIN 1 cagatatcca cacagcctgc aggtctgtat gtggaagcca ctggggatag tagccatgtc 61 actgtgttct ttggccttca ccctcttctt gactgttgtc gctggtatca agtgcaatgt 121 gacagacgtt tgtgctggaa gccctgggat ccctggagct cctggaaacc atggtctgcc 181 tggcagagac gggagagacg gtgtcaaagg agaccctgga cctccaggtc ccatgggccc 241 tcctggagga atgccaggtc ttcctggacg cgatgggctg cccggaggac ctggtgcacc 301 tggaggacgt ggagacaagg gagagcctgg agaaaggggc ctgccaggat ttccagctta 361 cctggatgag gagctccaga ctgaactcta tgagatcaaa catcagattc tgcaaacaat 421 gggagtcctc agcttgcaag gatccatgct gtcagtgggg gataaagtct tttccaccaa 481 tgggcagtca gtcaactttg ataccattaa agagatgtgt accagagcag gaggcaacat 541 tgctgtcccg aggactcctg aggagaacga ggccattgca agtattgcga agaagtacaa 601 caactatgtc tacttgggca tgattgaaga ccagactcct ggagacttcc actacctgga 661 tggggcttct gtgaactaca ccaactggta cccaggagaa cccaggggtc agggcaaaga 721 aaagtgtgta gaaatgtata cagatgggac atggaatgat aggggctgcc tgcagtaccg 781 gctggctgtt tgtgaatttt gatcaagcaa ttagacgaaa agatgaaccc tcacactgcc 841 tctatcctga tgattcatct ggtctgtaaa accctgcaac tacctttact tgtggccttc 901 agtaattaga agcatctttt gtcacccccg ctcccacata gttcccaaac acttctccat 961 attcattagc aatcctgagt gtttccctag agtcccatct gagcgttcat tcaaggtagc 1021 cattgtaaac cttggccttg accatgagat ggatagatac ttcctttttc ctcactttat 1081 ccagtcttca tttataaatg gtggccatga agacccagca tggaaggacc ctctaactaa 1141 gtgctgccct ctgacctttc cacccttctg tagctcggtg tcccaggatt tagaagtcca 1201 ggttaaacat aggggatttc tgggaaagcc tagtatgtgg gtgcaggcca cattcatgcc 1261 atctgtatcc atggctttca aggcaaacat tgtctctaag aagccagaga accaggagaa 1321 ccaggtagga ccaggtagta ctgggggaac ataaactcac ttggtttggc atgtatggct 1381 cctccttggg tctggaggtg ccatcttgac cttgaactaa cagcagccac cctgggtttt 1441 gagagaacga ccttcccagc ccagacccca actcaagtaa tttcctgcta acagacacag 1501 cctcagttca ctttacatca ctgaggcatt catgatacga actgcaatct gttttctcct 1561 ctcgtgagtt caatcagcta ttcattaaag tcaactgcat tcaaaaaaaa aaaaaaaaaa // LOCUS FSBCRYGM1 613 bp ss-mRNA VRT 12-JUL-1990 DEFINITION Carp gamma-crystallin (gamma-m1) mRNA, complete cds. ACCESSION X12902 M33115 KEYWORDS crystallin; gamma-crystallin. SOURCE Cyprinus carpio lens, cDNA to mRNA. ORGANISM Cyprinus carpio Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae. REFERENCE 1 (bases 1 to 613) AUTHORS Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C. TITLE Carp gamma-crystallins with high methionine content: Cloning and sequencing of the complementary DNA JOURNAL Biochim. Biophys. Acta 951, 226-229 (1988) STANDARD simple staff_review COMMENT [1] Author address Chang W.-G., Institute of Biological Chemistry, Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C.. Submitted (09-SEP-1988) on tape to the EMBL data library. FEATURES from to/span description pept 34 570 gamma-crystallin (gamma-m1) BASE COUNT 166 a 133 c 167 g 147 t ORIGIN 1 ctgaagcact gagataaaca accctctacc atcatgggca agatcatctt ctacgaggac 61 aggaacttcc agggccgcag ctatgactgc atgagcgact gctctgatat ctcctcttac 121 ctcagccgcg ttggttcaat cagggtggag agtggttgtt tcatggtcta tgagcgcaac 181 agctacatgg ggaaccagtt cttcctgagg aggggcgagt accatgatat gcagcgcatg 241 atgagcatgg gcatgatgtt tgacactatc agatcctgcc gcatgattcc tccatacagg 301 ggttcctaca gaatgaggat ctacgagagg gacaccttcg gaggacagat gcacgaggtg 361 atggatgact gtgacaacat catggaacgt taccgtatgt ctgactggca gtcttgtcat 421 gtgatggacg gccactggct cttctatgag cagccacact acagaggcag aatgtggtac 481 ttcaggcctg gagagtacag gagcttcaga gatatgggat acagcaacat gagattcatg 541 agcatgaggc gtatcactga tatgtgttaa actgctagaa tatagaagga attaaagtgt 601 tattctcaga act // LOCUS FSBCRYGM2 554 bp ss-mRNA VRT 12-JUL-1990 DEFINITION Carp gamma-crystallin (gamma-m2) mRNA, complete cds. ACCESSION X12903 M33116 KEYWORDS crystallin; gamma-crystallin. SOURCE Cyprinus carpio lens, cDNA to mRNA. ORGANISM Cyprinus carpio Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Cypriniformes; Cyprinoidei; Cyprinidae. REFERENCE 1 (bases 1 to 554) AUTHORS Chang,T., Jiang,Y.-J., Chiou,S.-H. and Chang,W.-C. TITLE Carp gamma-crystallins with high methionine content: Cloning and sequencing of the complementary DNA JOURNAL Biochim. Biophys. Acta 951, 226-229 (1988) STANDARD simple staff_review COMMENT [1] Author address Chang W.-G., Institute of Biological Chemistry, Academia Sinica, P.O. Box 23-106, Taipeh 10098, Taiwan R.O.C.. Submitted (09-SEP-1988) on tape to the EMBL data library. FEATURES from to/span description pept 7 528 gamma-crystallin gamma-m2 (AA 1 - 173) BASE COUNT 142 a 124 c 150 g 138 t ORIGIN 1 tggcccatga aggtcacctt ttatgaggac aggaacttcc agggtcgctc ttatgactgt 61 atgagcgact gtgccgattt ctcctcctac atgagccgct gtcactcttg cagagtgcac 121 agcggatgct ggatgatgta cgatcaaccc aactacatgg gaaatcagta tttctttagg 181 aggggagagt atgctgatta catgtctatg tttggaatga gcaactgcat caggtcctgc 241 cgtatgatcc ctatgcacag gggatcctac agaatgagga tctacgagag ggagaacttc 301 atgggccaga tgtacgaaat ggccgatgac tgtgacagta tcatggaccg ttaccgcatg 361 cctcactgcc agtcctgcca tgtgatggac ggccactggc tcatgtatga gcagccccac 421 tacagaggca ggatgtggta cttcaggcct ggagagtaca ggagcttcag caatatgggt 481 ggaatgagat tcatgagcat gaggcgtatc atggactcct ggtactagag tttatattaa 541 taaaataact cctc // LOCUS HUMIL2A1 940 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human interleukin 2 gene, exons 1 and 2. ACCESSION M33199 KEYWORDS interleukin; interleukin 2. SEGMENT 1 of 2 SOURCE Human DNA, clones Lm HIG[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 940) AUTHORS Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K. TITLE Organization of the DNA regions flanking the human interleukin 2 gene JOURNAL Biomed. Res. 6, 197-205 (1985) STANDARD simple staff_review FEATURES from to/span description pept 629 775 interleukin 2, exon 1 /nomgen="IL2" /map="4q26-q27" /hgml_locus_uid="LT0164X" 865 / 924 interleukin 2, exon 2 IVS 776 864 IL2 intron A IVS 925 > 940 IL2 intron B BASE COUNT 313 a 181 c 137 g 309 t ORIGIN 1 cttcaactca ataagcattt taagtattct aatcttagta tttctctagc tgacatgtaa 61 gaagcaatct atcttattgt atgcaattag ctcattgtgt ggataaaaag gtaaaaccat 121 tctgaaacag gaaaccaata cacttcctgt ttaatcaaca aatctaaaca tttattcttt 181 tcatctgttt actcttgctc ttgtccacca caatatgcta ttcacatgtt cagtgtagtt 241 ttatgacaaa gaaaattttc tgagttactt ttgtatcccc acccccttaa agaaaggagg 301 aaaaactgtt tcatacagaa ggcgttaatt gcatgaatta gagctatcac ctaagtgtgg 361 gctaatgtaa caaagaggga tttcacctac atccattcag tcagtctttg ggggtttaaa 421 gaaattccaa agagtcatca gaagaggaaa aatgaaggta atgttttttc agacaggtaa 481 agtctttgaa aatatgtgta atatgtaaaa cattttgaca cccccataat atttttccag 541 aattaacagt ataaattgca tctcttgttc aagagttccc tatcactctc tttaatcact 601 actcacagta acctcaactc ctgccacaat gtacaggatg caactcctgt cttgcattgc 661 actaagtctt gcacttgtca caaacagtgc acctacttca agttctacaa agaaaacaca 721 gctacaactg gagcatttac ttctggattt acagatgatt ttgaatggaa ttaatgtaag 781 tatatttcct ttcttactaa aattattaca tttagtaatc tagctggaga tcatttctta 841 taacaatgca ttatactttc ttagaattac aagaatccca aactcaccag gatgctcaca 901 tttaagtttt acatgcccaa gaaggtaagt acaatatttt // LOCUS HUMIL2A2 569 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human interleukin 2 (IL-2) gene, 3' flank. ACCESSION M33198 KEYWORDS Alu repetitive sequence; interleukin; interleukin 2. SEGMENT 2 of 2 SOURCE Human DNA, clones Lm HIG[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 569) AUTHORS Nishino,N., Obaru,K., Maeda,S., Shimada,K. and Onoue,K. TITLE Organization of the DNA regions flanking the human interleukin 2 gene JOURNAL Biomed. Res. 6, 197-205 (1985) STANDARD simple staff_review FEATURES from to/span description rpt 136 449 Alu-repeat /nomgen="IL2" /map="4q26-q27" /hgml_locus_uid="LT0164X" rpt 130 135 5' insertion target sequence rpt 450 455 3' insertion target sequence BASE COUNT 204 a 131 c 96 g 138 t ORIGIN Unknown number of bp after segment 1. 1 agcttcaata agatccaatg aatattctag attctatttg tcttctgaag acagcttaat 61 ctaatttaga taaaaataac atcatccaga gcctctacac tatttcagac acatgtagca 121 tcagcttaaa aattatgaaa cctactggct aacacgtgaa accttgtcac taccaaaaat 181 acaaaaaaaa aaaaattagc tgagtgtggt ggcgggcgcg tagtcccagc tactcaggag 241 gctgaggcag gagaatggcg tgaacttggt aggcagagct gcagtgagcc aagatcgtgc 301 cattgcactc cagcctgggt gacagagcaa gactccatct caaaaaaaaa aaaaaaaaaa 361 aagagacctg ctaacacaca cacacacaca cacacacaca ctctctctct ctctctctct 421 ctctctctct ctctctctct ctctctctca aattaagttg ggcggcaagg ggaaacaata 481 aacatctcca acataggatt caagtgtagt tataagatac agactttaac taatataata 541 tgttcaagaa aataaagcat catatctag // LOCUS TOMCPKA 103 bp ss-rRNA ORG 12-JUL-1990 DEFINITION Tomato chloroplast 4.5S ribosomal RNA. ACCESSION M33098 KEYWORDS 4.5S ribosomal RNA; ribosomal RNA. SOURCE Tomato (strain Mill) chloroplast ribosomal RNA. ORGANISM Chloroplast Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Lycopersicon esculentum. REFERENCE 1 (bases 1 to 103) AUTHORS Zhen-Qi,C., Xiao,X. and E,-Sheng.W. TITLE The nucleotide sequence of 4.5 S rRNA from tomato chloroplasts JOURNAL Biochim. Biophys. Acta 866, 89-91 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 103 4.5S ribosomal RNA BASE COUNT 29 a 19 c 32 g 23 t ORIGIN 1 gaaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagtg 61 atgtatgcag ctgaggcatc ctaacagatc ggtagacttg aac // LOCUS EBOMAY 157 bp ss-RNA VRL 12-JUL-1990 DEFINITION Ebola virus 3' proximal protein gene, 5' end. ACCESSION M33062 KEYWORDS . SOURCE Ebola virus (strain MAY; Zaire 1976) RNA. ORGANISM Ebola virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae. REFERENCE 1 (bases 1 to 157) AUTHORS Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D. TITLE Conservation of the 3' terminal nucleotide sequences of Ebola and Marburg virus JOURNAL Virology 149, 251-254 (1986) STANDARD simple staff_review FEATURES from to/span description pept 53 > 157 3'proximal protein BASE COUNT 56 a 22 c 31 g 48 t ORIGIN 1 gggcacacaa aaagaaagaa gaatttttag gatcttttgt gtgcgaataa ctatgaggaa 61 gattaataat ttcctctcat tgaaatttga tgatcggaat ttgaaattga aattgttgat 121 ctgtaatcac accgttgatt cagagccaca cacaagt // LOCUS ECOBISCASD 3337 bp ds-DNA BCT 12-JUL-1990 DEFINITION E.coli biotin sulfoxide reductase (bisC) gene, complete cds. ACCESSION M34827 KEYWORDS biotin sulfoxide reductase; bisC gene. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3337) AUTHORS Pierson,D.E. and Campbell,A. TITLE Cloning and nucleotide sequence of bisC, the structural gene for biotin sulfoxide reductase in Escherichia coli JOURNAL J. Bacteriol. 172, 2194-2198 (1990) STANDARD simple staff_review FEATURES from to/span description pept 577 2757 biotin sulfoxide reductase (bisC) BASE COUNT 764 a 856 c 927 g 790 t ORIGIN 1 tatccccgct gcgggttacg ctaacaccag tgccgcgcat tttgtcgcgc agttcgcttc 61 ctgcacatcc atgtaataac caacgccgcc gcccagagct gcgcctgctg ctgcgccaat 121 cagcgcgcct ttaccgcgat ctttcttcga agaagagagc gcaccaatac ccgcgcccac 181 gagagagccc agacctgcgc cgatagcaga tttacctgct tcgcgttcgc cggtgtaagg 241 gttagttgtg cagccagata ccgccagagc gccactcact acggcggcaa taagataaac 301 acgtttcttc attgttaatc cttaataacc tttttattct ttgccacggg ttccgtggcg 361 ggagattatg ccgcgtgaac atgaagatta ttcctgggaa tactcggaaa tttgtaagta 421 atatttaact gctcaataca tctaaccttt caggagtctt cggtttggcc aactcatcct 481 cacgatattc cgttctgact gccgccattg ggggcccatg ctggttgaaa ccgacggcga 541 aaccgtgttt agctgcgtgg cgcgttagcc acaggaatgg aaaactcctt gcagagcgcg 601 gttcgcgacc aggttcacag caatacgcgg gtacgatttc caatggtgcg aaaaggcttt 661 cttgcgtcac cggaaaaccc gcaaggcatt cgtgggcagg atgaatttgt tcgcgtgagt 721 tgggatgagg cgctggatct tattcaccaa caacataaac gcattcgtga ggcttatggt 781 ccggcatcga tttttgctgg ttcctacggc tggcgttcaa acggcgtgct gcataaggcc 841 tcgacattat tacaacgcta tatggcgctg gcaggcggtt ataccgggca tctgggggat 901 tattcgaccg gcgcggcaca ggcgatcatg ccgtatgtcg tgggtggtag tgaagtttat 961 caacagcaga ccagttggcc gctggtgctg gaacatagcg atgtcgtggt gctgtggagt 1021 gctaacccac tcaatacgct gaaaattgcg tggaatgcat ccgatgagca ggggctttct 1081 tacttttctg cactgcgtga cagcgggaaa aagctgatct gcattgatcc aatgcgatcg 1141 gaaaccgtcg atttctttgg cgataaaatg gagtgggtgg caccgcacat gggcaccgat 1201 gttgcgctga tgctggggat cgcccatacg ctggtggaaa atggttggca cgacgaagcg 1261 tttctggcgc gttgcaccac aggttatgcc gtcttcgcct cttatttgct gggcgagagt 1321 gacggaatag cgaaaaccgc cgaatgggca gcagagattt gtggtgttgg cgcagcgaaa 1381 atccgcgagc tggcggctat tttccaccaa aataccacca tgctgatggc aggctgggga 1441 atgcagcgcc aacagtttgg tgagcaaaaa cactggatga tcgtcacgct ggcagcaatg 1501 ttggggcaaa tcggcacacc cggcggcggt tttggtcttt cttaccattt tgccaatggt 1561 ggtaacccca cgcggcgttc tgcggtgctc tcttccatgc agggcagctt gccgggtggc 1621 tgcgatgcgg tggataaaat ccctgttgcc cgcattgttg aagcactgga aaaccctggt 1681 ggcgcatatc aacacaacgg tatgaaccga catttcccgg atattcgttt tatctggtgg 1741 gcgggcggtg ccaactttac tcatcatcag gataccaatc gcctgatccg tgcctggcaa 1801 aaaccggagc tggtggtgat ctctgaatgc ttctggacgg cggcggcaaa acacgcggat 1861 atcgttctgc ctgcgactac ctcttttgag cgtaatgatc tcaccatgac cggtgattac 1921 agtaatcagc atctggtgcc gatgaagcaa gtggtgccgc cacgctatga agcgcgtaat 1981 gattttgatg tttttgccga gttaagtgaa cgctgggaga agggcggtta tgcacgtttt 2041 acggaaggaa aaagtgagct gcaatggctg gaaacgtttt ataacgttgc ccgacagcgc 2101 ggggcaagcc agcaggttga attgccgcca tttgctgagt tctggcaagc caaccagtta 2161 attgagatgc cggaaaaccc ggacagcgag cggtttattc gcttcgctgc atttttgccg 2221 cgatccgctg gcgatccgtt aaaaacgcag cgcaagattg aaatcttctc acagcgtatt 2281 gccgattacg gttacccgga ttgccctggg catccaatgt ggctggagcc ggacgaatgg 2341 cagggcaatg ccgaaccaga acagttgcag gtactttctg cccatccggc gcaccgcctg 2401 cacagccagc tgaattacag ttctctgcgc gaattgtacg cggtggcaaa tcgtgagcct 2461 gtcaccattc atcctgacga tgcccaggag cgcggcatac aagatggcga tactgttcgg 2521 ttgtggaacg cacgcgggca aattcttgcc ggagcggtca ttagcgaggg aattaaacct 2581 ggcgtgattt gcattcacga aggggcatgg ccggatctgg atttaaccgc tgacggtatt 2641 tgtaaaaacg gcgcagtgaa cgtgctgacc aaagatctcc ccagctcgcg gctgggaatg 2701 gctgtgcggg taatacggcg ctggcatggc tggaaaaata caacggtccg gaactgacac 2761 ttacagcgtt tgaaccaccg gccagctcat aatccatgtg ggtagttggg tttcatcctg 2821 ccatgcgcaa tcgacaatgt gaaaaccctg tgcctggtaa aaatttatcg ccggttgatt 2881 tttttgataa acctccagca tcaggtgggg atggcgctgc tgcacatact gcatcagcgc 2941 cttaccaata ccgcgcctga cggccttcgg tgcgacaaac atcgctgcca gaaatcggcc 3001 ttccataatg ctgacaaaac cgagaagctt accgtcttct tcccagaccc agttttgcgc 3061 gttggcaaga taggcatccg caccagcgga atgcagtcac gccagtaatt cgcttttata 3121 aagggatgcc cccaggttgt actttccagc cacagttcga ggatcgcggg gagttctgaa 3181 cgttgcgctt cccgaatcat ggtttatttc ccggatagca acagcagcca accacatgat 3241 cattcaccag cccacatgcc tgcataaagg gagtaacaga ttgtggtgcc gacaaactta 3301 aaaccacgtt ttttcagtgc cttagatagg gcgttcg // LOCUS HUMARXA 1335 bp ss-mRNA PRI 12-JUL-1990 DEFINITION Human aldose reductase mRNA, complete cds. ACCESSION M34720 KEYWORDS aldehyde reductase; aldose reductase. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1335) AUTHORS Grundmann,U., Bohn,H., Obermeier,R. and Amann,E. TITLE Cloning and prokaryotic expression of a biologically active human placental aldose reductase JOURNAL DNA Cell Biol. 9, 149-157 (1990) STANDARD simple staff_review FEATURES from to/span description pept 14 964 aldose reductase (EC 1.1.1.21) mRNA < 1 1335 aldose reductase mRNA BASE COUNT 319 a 347 c 356 g 313 t ORIGIN 1 gagcgcagca gccatggcaa gccgtctcct gctcaacaac ggcgccaaga tgcccatcct 61 ggggttgggt acctggaagt cccctccagg gcaggtgact gaggccgtga aggtggccat 121 tgacgtcggg taccgccaca tcgactgtgc ccatgtgtac cagaatgaga atgaggtggg 181 ggtggccatt caggagaagc tcagggagca ggtggtgaag cgtgaggagc tcttcatcgt 241 cagcaagctg tggtgcacgt accatgagaa gggcctggtg aaaggagcct gccagaagac 301 actcagcgac ctgaagctgg actacctgga cctctacctt attcactggc cgactggctt 361 taagcctggg aaggaatttt tcccattgga tgagtcgggc aatgtggttc ccagtgacac 421 caacattctg gacacgtggg cggccatgga agagctggtg gatgaagggc tggtgaaagc 481 tattggcatc tccaacttca accatctcca ggtggagatg atcttaaaca aacctggctt 541 gaagtataag cctgcagtta accagattga gtgccaccca tatctcactc aggagaagtt 601 aatccagtac tgccagtcca aaggcatcgt ggtgaccgcc tacagccccc tcggctctcc 661 tgacaggccc tgggccaagc ccgaggaccc ttctctcctg gaggatccca ggatcaaggc 721 gatcgcagcc aagcacaata aaactacagc ccaggtcctg atccggttcc ccatgcagag 781 gaacttggtg gtgatcccca agtctgtgac accagaacgc attgctgaga actttaaggt 841 ctttgacttt gaactgagca gccaggatat gaccacctta ctcagctaca acaggaactg 901 gagggtctgt gccttgttga gctgtacctc ccacaaggat taccccttcc atgaagagtt 961 ttgaagctgt ggttgcctgc tcgtccccaa gtgacctata cctgtgtttc ttgcctcatt 1021 tttttccttg caaatgtagt atggcctgtg tcactcagca gtgggacagc aacctgtaga 1081 gtggccagcg agggcgtgtc tagcttgatg ttggatctca agagccctgt cagtagagta 1141 gaagtctctt ccagtttgct ttgcccttct ttctaccctg ctggggaaag tacaacctga 1201 ataccctttt ctgaccaaag agaagcaaaa tctaccaggt caaaatagtg ccactaacgg 1261 ttgagttttg actgcttgga actggaatcc tttcagcaag acttctcttt gcctcaaata 1321 aaaagtgctt ttgtg // LOCUS HUMARXB 652 bp ds-DNA PRI 12-JUL-1990 DEFINITION Human aldose reductase gene, partial cds. ACCESSION M34721 KEYWORDS aldehyde reductase; aldose reductase. SOURCE Human placenta DNA, clone lambda-gt11-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 652) AUTHORS Grundmann,U., Bohn,H., Obermeier,R. and Amann,E. TITLE Cloning and prokaryotic expression of a biologically active human placental aldose reductase JOURNAL DNA Cell Biol. 9, 149-157 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 27 aldose reductase, exon X (AA at 1) (EC 1.1.1.21) 626 > 652 aldose reductase, exon X+1 IVS 28 625 aldose reductase intron X BASE COUNT 142 a 164 c 184 g 162 t ORIGIN 1 gccaagcaca ataaaactac agcccaggta cagccacttc aggtgttgct gaccgtccac 61 aactgcctgc attcctgaca gtcctgttag ccaagaggag gaagtgactg agcctgttac 121 accctcacag gaagtatggt taggggtcct caagtacaga gtggaaaggg cacagatcgg 181 ggttttagaa gactctggca tgggctctta gattaatagt gcctgccccc actactgcaa 241 gggtgactgc cacgagggcc agcgcttgtt cattcatgtg gaacctcatc tgtacaaatg 301 taagagctct tagccgtgca gggaatgttc tttctcctga gtggtagtgt gcatttctag 361 ccagtggagg gcctcatgtg gtctcatgat atgcctgaga cactgaagcg tgtggcacag 421 tggctagcgc aggactctgg agtcagatct ggacctgaat gcgtcgccta cctgttgcta 481 gctgtgacct gacatcttgg agcccctctc tgatcacctg tggagttcta gcacgtcctt 541 ctgcaggttg tgtgtgtgag agactgagat gatgggtgcg agtgcctggc atgtatacac 601 actcactgtc tccttgggct cacaggtcct gatccggttc cccatgcaga gg // LOCUS MRV3TERM 59 bp ss-RNA VRL 12-JUL-1990 DEFINITION Marburg virus 3'terminal region of genome. ACCESSION M36065 KEYWORDS . SOURCE Marburg virus RNA. ORGANISM Marburg virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae. REFERENCE 1 (bases 1 to 59) AUTHORS Kiley,M.P., Wilusz,J., McCormick,J.B. and Keene,J.D. TITLE Conservation of the 3' terminal nucleotide sequences of Ebola and Marburg virus JOURNAL Virology 149, 251-254 (1986) STANDARD simple staff_review BASE COUNT 15 a 10 c 5 g 29 t ORIGIN 1 tctgtgtgtt ttgttctcta ctactaaaac acatagtata tttatttctt cttataatc // LOCUS RATQRED1 431 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat quinone reductase gene, exon 1. ACCESSION M33038 KEYWORDS quinone reductase. SEGMENT 1 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 431) AUTHORS Bayney,R.M. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Isolation of a quinone reductase structural gene and prediction of the NH2 terminal sequence of the protein by double-stranded sequencing of exons 1 and 2 JOURNAL Arch. Biochem. Biophys. 260, 847-850 (1988) STANDARD simple staff_review FEATURES from to/span description pept 244 + 250 quinone reductase, exon 1 pre-msg 170 > 431 quinone reductase mRNA and introns IVS 251 > 431 quinone reductase intron A BASE COUNT 91 a 125 c 119 g 96 t ORIGIN 1 taacttggta tcctcccccc agcgcctctg ggctggcaat ccagccccgc cctcgctggc 61 tgccctgcac agtgggctgg gccggaaaag caagatataa agcctgaaag tgctcagtac 121 agctcgcact agcctaggct gtggcacgca ggatctttcc gaagcatttc agggtcgtcc 181 tggcaaccag ctgctcagcc aatcagcgct tgacactacg atccgccccc aacttctgga 241 gccatggcgg gtgagtatgg ctccaactcc agcctaattc atcctgagga ggatgtaggg 301 gcttgctatg gggtttgttc cttgcctcga agttgaaaag tgtagagatt aggatcctgg 361 atgagcctcg gtgagtcccc ggaaggagag cttcttctca gaaccatagg tgcagattat 421 tctgcagccc c // LOCUS RATQRED2 410 bp ds-DNA ROD 12-JUL-1990 DEFINITION Rat quinone reductase gene, exon 2. ACCESSION M33039 KEYWORDS quinone reductase. SEGMENT 2 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 410) AUTHORS Bayney,R.M. and Pickett,C.B. TITLE Rat liver NAD(P)H:quinone reductase: Isolation of a quinone reductase structural gene and prediction of the NH2 terminal sequence of the protein by double-stranded sequencing of exons 1 and 2 JOURNAL Arch. Biochem. Biophys. 260, 847-850 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 108 / 272 quinone reductase, exon 2 IVS < 1 107 quinone reductase intron A IVS 273 > 410 quinone reductase intron B BASE COUNT 103 a 111 c 102 g 94 t ORIGIN 1 agaaactaag gtggggaacg tgtctggtcc caagcacttt tagattaggg actcacccgt 61 cctgtttgga ttttctttcc tcacctcctc acgtacgcct taaacagtga gaagagccct 121 gattgtattg gcccacgcag agaggacatc attcaactat gccatgaagg aggctgctgt 181 ggaggctctg aagaagaaag gatgggaggt ggtcgaatct gacctctatg ctatgaactt 241 taaccccctc atttccagaa acgacatcac aggtaagaat cgtctccctc cactgacagt 301 ggaccacgtg acccagcctc agcccctctt gcctcccaac aggggagccg aaggactcgg 361 agaactttca gtaccctgtt gagtcatctc tggcgtataa ggaaggccgc // LOCUS RHAFIXA 1040 bp ds-DNA BCT 12-JUL-1990 DEFINITION A.caulinodans nitrogen fixation protein (nifO and fixA) genes, complete cds and 5'end. ACCESSION M35122 KEYWORDS nitrogen fixation protein. SOURCE A.caulinodans (strain ORS571) DNA. ORGANISM Azorhizobium caulinodans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 1040) AUTHORS Kaminski,P.A., Norel,F., Desnoues,N., Kush,A., Salzano,G. and Elmerich,C. TITLE Characterization of the fixABC region of Azorhizobium caulinodans ORS571 and identification of a new nitrogen fixation gene JOURNAL Mol. Gen. Genet. 214, 496-502 (1988) STANDARD simple staff_review FEATURES from to/span description pept 328 657 nitrogen fixation protein (nifO) pept 939 > 1040 nitrogen fixation protein (fixA) BASE COUNT 181 a 349 c 328 g 182 t ORIGIN 1 gagctcggcc tctatgacat cgacgccagc gcggtgaacg tcgcgcacgt gcccgtcatt 61 ccggacgaga acgaggtgag cggcgtcgat atcgtcgtcc gcctgcgtcg cacgggccgc 121 tgagggacgc tccgcctgtc gccttcgggg cacccgcatc cgcgtagcag cgcggccgcc 181 tcccgcggac tggccatcgc cagtactggc acgggcattg cttggacctc atccgtgccc 241 cgacatcggg gcaacgggtt cgcccgccaa agcgaccgga tgagttcacc tcatccgatt 301 acgcaccaga ctttcaggag acggagcatg gcgaccgccg gcggcatcct cgatcagctc 361 aacaaggcat ccagcgcgga agacttcttc gcgctgctcg aggtcgatta cgatccccaa 421 gtggtgaatg tggtgcgcct gcatatcctg cggcgcatgg gccagtatct ggtcagcgag 481 aatttcgaag gccaggcgga tgacgccatc cgcgcccggt gcaaagaggt gctggaacag 541 gcctatgcgg acttcctcgc ctcctcgccc ttgcaggagc gggtgttcaa ggtgctgaag 601 gaggccgccc agccgccgaa gcccaagccg atggtatcgc tcaccgttct caagtgacgt 661 tccccccctc ccgcgtcctt caaggcggcc tgcacccggc aggccgccct tcgcgtttca 721 gggcgcgggc gggtggtgag gggccacggg caagacgcgc ctgtcgcatt ccgacgcggg 781 tggcggacgt tcctgtcggc ggcggagccg gggcggaaag cgcattgtgg catgccagac 841 agccctttga tttcatgcgc gttttcgggc tgaaagacag ttggtacgac acttgctcat 901 tcctccccaa gagcccaacc gttccgggag cgaacgcaat gcacatcgtc gtctgcatca 961 agcaggttcc tgactccgcg cagatccgcg tgcaccccgt gacgaacacc atcatgcgtc 1021 agggtgtgcc cacgatcatc // LOCUS STMPPG 200 bp ds-DNA BCT 12-JUL-1990 DEFINITION S.griseus brown pigment production gene, 5' flank. ACCESSION M35117 KEYWORDS brown pigment production protein. SOURCE S.griseus (strain TK21) DNA, clone pARC1. ORGANISM Streptomyces griseus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Horinouchi,S., Nishiyama,M., Nakamura,A. and Beppu,T. TITLE Construction and characterization of multicopy expression-vectors on Streptomyces spp JOURNAL Mol. Gen. Genet. 210, 468-475 (1987) STANDARD simple staff_review FEATURES from to/span description mRNA 112 > 200 brown pigment production protein mRNA BASE COUNT 22 a 79 c 71 g 28 t ORIGIN 1 gatcgtccat ggtggccatc ccaccatccg ccgcgccggg gcggcgagcg cgtttcgctg 61 ggcggacacg ctccccttgc cggtgctagc gcgaccgcgc tagcgtggtc gggtgcccaa 121 gatccgtatg acgcccctga ccgaccggcg ttcggccggt tcctgaagca cgcccccgac 181 cgcgcggccg gccgggccgg // LOCUS SYNGPCNA 111 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end. ACCESSION M35111 KEYWORDS nucleoprotein. SOURCE Synthetic DNA, clone pACRP1-LCM WE N. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 111) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 92 > 111 nucleoprotein BASE COUNT 37 a 17 c 21 g 36 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc atttgattgc 61 gcttttattt ggaaattcat tgtgtgacaa aatgtctttg tccaaagaag t // LOCUS SYNGPCNB 111 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene, 5' end. ACCESSION M35112 SOURCE Synthetic DNA, clone pACRP1-LCM WE G. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 111) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 109 > 111 glycoprotein precursor BASE COUNT 33 a 20 c 25 g 33 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaaccg gatcctaggc tttttggatt 61 gcgctttcct ttaggacaac tgggtgctgg attctatcca gtaaaaggat g // LOCUS SYNGPCNC 131 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end. ACCESSION M35113 KEYWORDS nucleoprotein. SOURCE Synthetic DNA, clone pACRP5-LCM WE N. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 131) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 112 > 131 nucleoprotein BASE COUNT 41 a 19 c 24 g 47 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg 61 gatcctaggc atttgattgc gcttttattt ggaaattcat tgtgtgacaa aatgtctttg 121 tccaaagaag t // LOCUS SYNGPCND 131 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene, 5' end. ACCESSION M35114 KEYWORDS glycoprotein precursor. SOURCE Synthetic DNA, clone pACRP5-LCM WE G. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 131) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 129 > 131 glycoprotein precursor BASE COUNT 37 a 22 c 28 g 44 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgccg 61 gatcctaggc tttttggatt gcgctttcct ttaggacaac tgggtgctgg attctatcca 121 gtaaaaggat g // LOCUS SYNGPCNE 155 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus nucleoprotein gene, 5' end. ACCESSION M35115 KEYWORDS nucleoprotein. SOURCE Synthetic DNA, clone pACRP6-LCM WE N. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 155) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 136 > 155 nucleoprotein BASE COUNT 53 a 22 c 26 g 54 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa 61 cagttttgta ataaaaaaac cccggatcct aggcatttga ttgcgctttt atttggaaat 121 tcattgtgtg acaaaatgtc tttgtccaaa gaagt // LOCUS SYNGPCNF 155 bp ds-DNA SYN 12-JUL-1990 DEFINITION Lymphocytic choriomeningitis virus glycoprotein precursor gene, 5' end. ACCESSION M35116 KEYWORDS glycoprotein precursor. SOURCE Synthetic DNA, clone pACRP6-LCM WE G. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 155) AUTHORS Matsuura,Y., Possee,R.D. and Bishop,D.H.L. TITLE Expression of the S-coded genes of lymphocytic choriomeningitis arenavirus using a baculovirus vector JOURNAL J. Gen. Virol. 67, 1515-1529 (1986) STANDARD simple staff_review FEATURES from to/span description pept 153 > 155 glycoprotein precursor BASE COUNT 49 a 25 c 30 g 51 t ORIGIN 1 tggagataat taaaatgata accatctcgc aaataaataa gtattttact gttttcgtaa 61 cagttttgta ataaaaaaac cccggatcct aggctttttg gattgcgctt tcctttagga 121 caactgggtg ctggattcta tccagtaaaa ggatg // LOCUS XELD7 1051 bp ss-mRNA VRT 12-JUL-1990 DEFINITION X.laevis pot. developmental protein (D7) mRNA, complete cds. ACCESSION M35119 KEYWORDS developmental protein D7. SOURCE X.laevis, cDNA to mRNA, clones D7.1 and D7.0. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1051) AUTHORS Smith,R.C., Dworkin,M.B. and Dworkin-Rastl,E. TITLE Destruction of a translationally controlled mRNA in Xenopus oocytes delays progesterone-induced maturation JOURNAL Genes Dev. 2, 1296-1306 (1988) STANDARD simple staff_review FEATURES from to/span description pept 27 863 pot. developmental protein D7 BASE COUNT 318 a 240 c 222 g 271 t ORIGIN 1 gaaaaccggg acgtttgggc tgcaatatgg aatttgatga gctgatgcag tgcccatatg 61 acaaaaatca tatgattcgg cccagccggt ttccctacca ccttgttaaa tgcagagaga 121 ataatcgtgc agcagctaaa attctagcaa cttgcccata taatgcccgc cacagagtcc 181 ctaaacagga gcttgatctg cacatggcca gctgtgaata cagggtgacc atggagccca 241 tttctgctgc attttcacat cagaaggtgg agacctcaac atggcaaagc cctccttgtg 301 aagaggtctg ggaaactgac gaagatcccg tgtcaaggcc aaagcccttt attttaaatg 361 attttactcc ttctcagcct tttaatatgt cagaaggtga tggaaatatg ccgtatactg 421 gaataagcag caactacaga cctgaagtcc aacctatgaa ttcagtcatg caagtaaagc 481 aaaatcaacc tgaacctgag ccttttacct ccagtgagcg aaactatgat ccacgatcca 541 aggaaccacc caatccaaag caacctgcag tgaatggcta caaacctgca actacaaata 601 caaacccatg gtgcaggcaa acgggaggat cgaggggagc tgctcctcca aagttgggtg 661 ctaaatcctc agatgagggg ccaagaaata aggaatttcc cactccaaag gcgaacttga 721 tgaatgagta cgtacctgta gcagcaaatg caaatccatg gtgcaggcaa ccaggagggt 781 ccagtgctgc ttcagaacct ttgggtgttg actccttcga tgagtggcca tgccttggac 841 gccagccatg ggttagaaag taaatcttca ctttaaaaac aggactttca tctgaacctg 901 ttcctgactt gtccaactcc tggattttta aaatttgttg tgaagttgcc atttagtatt 961 tttgtacaaa attttaacag ccttcatttt tacatattaa gctttttatc acaaatataa 1021 tactaattta cttgaatgtt atttgttaac c // LOCUS YSCNUP1 4986 bp ds-DNA PLN 12-JUL-1990 DEFINITION S.cerevisiae nucleoporin (NUP1) gene, complete cds.. ACCESSION M33632 KEYWORDS nuclear pore complex protein; nucleoporin. SOURCE S.cerevisiae (strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 4986) AUTHORS Davis,L.I. and Fink,G.R. TITLE The NUP1 gene encodes an essential component of the yeast nuclear pore complex JOURNAL Cell 61, 965-978 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by L.I.Davis, 06-APR-1990, for release after publication. FEATURES from to/span description pept 1001 4231 nucleoporin (NUP1) (put.) BASE COUNT 1590 a 1036 c 991 g 1369 t ORIGIN Chromosome XV; 14 cm prox. to ADE2. 1 gaattcatca gtgaactctt catcattcaa aaacacccaa tcatagttga acttggagtt 61 aaatctatct tccacggatt taatagactc agccaacgaa tatagatctc tattacgcac 121 tagagtgaca aaagtggcct tttcccgagg accggtatac tttggtagtt ccactttcga 181 gtatttatag cccgagccgg aagatagcac ggtactgtac tttgggcttg gggcagagcc 241 tctcgcatac tgagctccat gaaagaacac atacacagta aacacggcga ccagaagaag 301 tcctaatttt ttgtaaacag gctgcttgct agctgggatc ataatcttcg ccattttggt 361 tattgactct atcccttaaa aactcttctg atggagtact ttacttcgat tgcttaacga 421 aatctttgtg aggaaaaaga tatctcttaa aattagaaag tacaatagtc tagcgtatta 481 tactaagaat ctgcaaaaaa gaagcaagaa ggcaccacct attatagacc tttgacacga 541 agtctctctg gagtgctttg gcctacgtgt gcggtactcg tttacatggg acaaccacgg 601 tttttttttt ggtgttactg gaggtataca gtgcgtatat ccacttgtac gacaagagat 661 ttacactaca ccgcgtaaag aaaacgccga caccaaatat aagtcacgtg tatgcaaagc 721 ctattttatg ccctaatttt caagccccgg tttttacgcc ctagttttta tatttagggt 781 ttgtcgttgc acgtgatcaa tggttcgtat tatgtgacat tgaaatgctt tttcatttta 841 atttttttct ttgacgaaat ttcgtaatgt caagaaacac ttaaagaaaa taagtgatga 901 ggaactcaat aaggacacta cgtagcggtg caaatacgat aggatattag cctcgaaagg 961 gttataggga cagagagtga gcgacaattt ttagtcattc atgtcttcaa acacttcttc 1021 tgtgatgtct tctccacgtg tcgaaaagag atcgttttct tccactttaa aatcattctt 1081 cacaaacccc aataaaaaac ggccatcgag caagaaagtt ttcagttcaa acctctcata 1141 cgcgaatcat ttggaggaat cagatgttga agacacactg catgttaata agagaaagag 1201 ggtgtccggt acatcacagc atagcgacag cttaactcag aacaacaata atgcgccaat 1261 tataatatat ggaaccgaaa acactgagag accgccgctt ttgccaattt tgcccattca 1321 aagactgagg ttattaaggg agaagcagag ggtgagaaat atgcgtgagc ttggattaat 1381 tcaatcaact gaatttccat ctattacatc gtcggttata ttgggctctc aaagtaaaag 1441 cgatgaggga ggatcgtacc tatgcacatc atctactcct tcccctatta aaaacggttc 1501 ttgcactagg cagttggccg gaaaaagcgg tgaagacacc aatgttggac tacccattct 1561 caaatcattg aaaaatagat ccaatagaaa aaggtttcat agtcagtcaa aggggaccgt 1621 gtggtcagca aattttgaat atgatttgtc agaatatgac gctatacaaa aaaaggataa 1681 caaggataag gaaggtaacg ctggcggtga tcagaagaca agcgagaata gaaataatat 1741 taagagtagt atttcaaatg gcaatctggc tacaggccct aacctgacaa gcgaaattga 1801 agacctacgt gcagacatca actctaatag gttatcgaat cctcaaaaaa atctactttt 1861 aaaaggacca gcttccacag ttgcaaaaac tgcccctatt caggagagct ttgttcccaa 1921 ttcagagcgc tctggtacgc ctacgttaaa gaaaaatatt gagcccaaaa aggacaaaga 1981 aagtattgtt ttgcccaccg taggttttga ctttatcaag gacaatgaga ctccatctaa 2041 gaaaacttct cctaaggcaa cttcttctgc aggtgcagtc tttaaatcga gtgtagaaat 2101 gggaaaaacc gataagtcaa cgaaaactgc cgaggcgcct accttatcat tcaattttag 2161 ccaaaaggct aataaaacta aggctgtcga caatactgtc ccttccacaa ctttattcaa 2221 ttttggtggt aaatcagata ccgttacttc tgccagtcaa ccttttaaat ttggaaagac 2281 atccgaaaaa agtgaaaatc atacagaatc agacgcgcct ccaaaatcaa ctgctccaat 2341 attttctttt ggtaaacaag aagagaatgg tgatgaaggt gatgatgaaa atgagcccaa 2401 aagaaaaagg cgtttacctg ttagcgagga tacaaacacc aagcctttat tcgatttcgg 2461 caagaccggt gatcaaaagg agaccaaaaa gggagagtca gaaaaggacg catcagggaa 2521 accaagcttt gtctttggtg caagtgataa gcaagctgaa ggtacaccat tatttacatt 2581 cggaaaaaaa gctgatgtaa caagcaatat tgactcctct gcacaattta cctttggtaa 2641 agccgccacc gcgaaagaaa cacacaccaa accatctgag acacctgcca caatagtcaa 2701 gaagcctact tttacttttg ggcagtcaac aagtgaaaat aagatctctg agggaagtgc 2761 gaaacctaca ttctctttct ctaagtcaga ggaggaacgt aagagtagtc caatttcaaa 2821 cgaagcagct aaaccctcgt tttcgtttcc gggcaagcct gttgatgttc aagcaccgac 2881 ggatgataag actctcaagc caactttttc ttttactgaa cctgctcaaa aagattcatc 2941 tgttgtttcg gaacctaaaa agccctcctt tacgtttgcg tcttcaaaaa cctcacaacc 3001 aaagccattg ttttcatttg gtaagtcaga tgcagctaaa gaaccaccag gctctaacac 3061 ctcattttct ttcactaaac ctcctgctaa tgagacagat aaaagaccta caccgccatc 3121 tttcaccttt ggcggttcca caacaaataa tacaacaacc actagcacaa aaccatcttt 3181 tagttttggg gctcccgagt cgatgaagtc gacagcaagt acagcggcag caaatacgga 3241 gaagctatca aatggctttt cctttacaaa gttcaatcac aataaagaaa agtcaaactc 3301 tccaacttct ttcttcgatg gttctgcttc ctcaacgccg attcctgtct tgggtaagcc 3361 aacagacgct actggtaata caacatctaa atctgcattt tcattcggta ctgctaacac 3421 caatggtacc aatgcctcag caaactccac atcattctcg tttaacgccc ctgctactgg 3481 taacggcaca actactactt ccaatacctc aggaaccaat atagcgggta catttaacgt 3541 aggaaaaccg gatcaaagta tcgctagtgg caatacgaat ggagcgggct cggcatttgg 3601 cttttcgagc tcaggaacag cagcaactgg tgcagcttct aatcaatctt catttaattt 3661 tggaaacaat ggtgcagggg gtctcaatcc ttttacatca gcaacttcgt caactaatgc 3721 taatgctggt ttattcaata aacctccttc cacgaatgca caaaatgtca atgttccctc 3781 tgcttttaat tttacgggaa ataattcaac gcctggtggc ggctctgtat ttaatatgaa 3841 cggcaacact aatgctaata cggtgtttgc cggctctaat aaccaaccac atcaatcgca 3901 aaccccatct ttcaatacaa acagctcatt cacgccatca acagttccta atattaattt 3961 tagcggattg aatggcggaa ttactaatac cgcgaccaat gcattaaggc caagtgatat 4021 atttggtgcg aatgctgcct ctggttccaa ttcaaacgta acaaatccat catccatttt 4081 tgggggggca ggtggtgtgc cgacaacttc ttttgggcag ccgcagtcag cccctaatca 4141 gatggggatg ggaacaaata atggcatgag catgggcggt ggtgttatgg cgaacagaaa 4201 gattgcaaga atgaggcact ctaaaaggta aatatcggtt atgtaattag gtattgtgtt 4261 gcttttctga aggatatata ttcctatatt ccttcaactt atacactgca tatgaaactt 4321 cttgagaagt tttaaataat tggtcttttt tttaatcggg tacaggcgag ataatggata 4381 cctgttctag ggtaggcaaa tttggaggca agagcccaaa attgttttgt tttgacctgg 4441 tttctctcag ttaacatatt atgaatagca tcttacaaat aatttacgag cgaaactttc 4501 gatctgcaaa gcgctccttt tttctttccg aaaaaataca gtaaaataag gtctagtaca 4561 cagcctgatc tacaattagt gttcttttct gtttacttcg atcagctgtt tagctttgcc 4621 acttttctag tagctccaca cgctgtgaaa gtaacgagac atagaagtca cggatggatt 4681 tgaaaaggga ttggttacga tggaaaatta caataggtag tgggcctggc agtattgtgc 4741 tggatttccc atcatttttg gtagggtgcg tattcacaac aatgatgggg ccaatattac 4801 agaagcttat aggtaaactt ttggtaggtt taataacggt atgcaaattc ttagttatta 4861 ttggaagcat tgtcttcgtt atcggagttg catcaaagaa atacacatat gacgatttca 4921 aagtaagtat aaaaagatca ggagagccag gggaatcaca tgatatgagg actgaaccaa 4981 aaagaa //
GenBank-Updates@genbank.bio.net (07/13/90)
LOCUS TMGMPCPA 1413 bp ss-RNA VRL 13-JUL-1990 DEFINITION Tobacco mild green mosaic virus movement and coat protein genes, complete cds. ACCESSION M34236 KEYWORDS coat protein; movement protein. SOURCE Tobacco mild green mosaic virus (strain PV 228), cDNA to viral RNA. ORGANISM Tobacco mild green mosaic virus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Tobamovirus. REFERENCE 1 (bases 1 to 1413) AUTHORS Nejidat,A., Cellier,F., Holt,C.A., Gafny,R., Eggenberger,A.L. and Beachy,R.N. TITLE Examination of movement function complementation between two tobamoviruses in a recombinant virus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Nejidat, 11-MAY-1990. FEATURES from to/span description pept 51 851 movement protein pept 857 1336 coat protein BASE COUNT 456 a 214 c 331 g 412 t ORIGIN 1 tgttaacaaa tttttgtgtg ataaattttt atttagaact ttgtttttaa atggctgtta 61 gtctcagaga tactgtcaaa attagcgagt tcattaatct ttcgaaacag gatgagatac 121 ttccggcatt catgactaag gtcaagagtg ttagaatatc gactgtggac aagattatgg 181 ctgttaagaa tgatagtctt tctgatgtag atttacttaa aggtgttaag ttagttaaga 241 atggttatgt atgcttagct ggtttggtag tgtctgggga gtggaatctc ccggataact 301 gccgtggtgg tgtcagtgtt tgtattgtag acaagagaat gaaaagaagt aatgaggcaa 361 cgctgggtgc ttatcacgcc cctgcttgca aaaagaattt ttcctttaag ctaatcccta 421 attattcaat aacatccgag gatgctgaga agaacccgtg gcaagtgtta gtgaatatca 481 aaggagtggc tatggaagaa ggatactgtc ctttatcttt ggagttcgtt tcaatttgtg 541 tagtacataa aaataatgta aagaaaggtt tgagggaacg tattttgaga gtaacagacg 601 actcgccaat tgaactcact gaaaaagttg ttgaggagtt cgtggatgaa gtaccaatgg 661 ctgtgaaact cgaaaggttc cggaaaacaa aaaagggaaa aaaaagaaaa aaagaaaaaa 721 aaaagagagt ggtaggtaat agtgttaata ataagaaaat aaataatagt ggtaagaagg 781 gtttgaaagt tgaggaaatt gaggataatg taagtgatga cgagtctatc gcgtcatcga 841 gtacgtttta atcaacatgc cttatacaat caactctccg agccaatttg tttacttaag 901 ttccgcttat gcagatcctg tgcagctgat caatctgtgt acaaatgcat tgggtaacca 961 gtttcaaacg caacaagcta ggacaacagt ccaacagcaa tttgcggatg cctggaaacc 1021 tgtgcctagt atgacagtga gatttcctgc atcggatttc tatgtgtata gatataattc 1081 gacgcttgat ccgttgatca cggcgttatt aaatagtttt gatactagaa atagaataat 1141 agaggttgat aatcaacccg caccgaatac tactgaaatc gttaacgcga ctcagagggt 1201 agacgatgcg actgtagcta taagggcttc aatcaataat ttagctaatg aactggttcg 1261 tggaactggc atgttcaatc aagcaggctt tgagactgct agtggacttg tctggaccac 1321 aactccggct acttagctat tgttgtgaga tttcctaaaa taaagtcgct gatactttaa 1381 attcagggtg gctgatacca aaatcagcag tgg // LOCUS CLOFTHFSA 1965 bp ds-DNA BCT 13-JUL-1990 DEFINITION C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene, complete cds. ACCESSION J02911 KEYWORDS forymltetrahydrofolate synthetase. SOURCE C.thermoaceticum DNA. ORGANISM Clostridium thermoaceticum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1965) AUTHORS Lovell,C.R., Przybyla,A. and Ljungdahl,L.G. TITLE Primary structure of the thermostable formyltetrahydrofolate synthetase from Clostridium thermoaceticum JOURNAL Biochemistry 29, 5687-5694 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Lovell, 13-APR-1990, for release after publication. FEATURES from to/span description pept 56 1735 formyltetrahydrofolate synthetase (FTHFS) (ttg start codon) (EC 6.3.4.3) mRNA < 1 1760 FTHFS mRNA binding 41 44 ribosomal binding site (put.) signal 1761 1785 pot. transcription termination signal BASE COUNT 459 a 576 c 532 g 398 t ORIGIN 1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc 61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact 121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa 181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt 241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac 301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg 361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat 421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct 481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg 541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat 601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc 661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt 721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga 781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac 841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg 901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc 961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg 1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg 1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa 1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa 1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc 1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact 1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa 1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga 1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta 1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa 1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg 1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc 1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc 1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat 1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca 1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag 1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat // LOCUS HUMGLIBSA 176 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A1. ACCESSION M34816 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 176) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 31 53 human zinc finger protein binding site binding 94 116 human zinc finger protein binding site binding 157 176 human zinc finger protein binding site BASE COUNT 68 a 50 c 36 g 22 t ORIGIN 1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt 61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta 121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat // LOCUS HUMGLIBSB 166 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A2. ACCESSION M34817 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 166) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 27 49 human zinc finger protein binding site binding 90 112 human zinc finger protein binding site BASE COUNT 60 a 47 c 37 g 22 t ORIGIN 1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa 61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc 121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac // LOCUS HUMGLIBSC 313 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A3. ACCESSION M34818 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 313) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 1 23 human zinc finger protein binding site binding 64 86 human zinc finger protein binding site binding 127 149 human zinc finger protein binding site binding 190 212 human zinc finger protein binding site binding 253 275 human zinc finger protein binding site BASE COUNT 110 a 79 c 75 g 49 t ORIGIN 1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc 61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag 121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc 181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac 241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc 301 agctgcaagg gct // LOCUS HUMGLIBSD 184 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A4. ACCESSION M34819 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 184) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 32 54 human zinc finger protein binding site binding 95 117 human zinc finger protein binding site binding 158 180 human zinc finger protein binding site BASE COUNT 67 a 46 c 41 g 30 t ORIGIN 1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg 61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt 121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt 181 tgta // LOCUS HUMGLIBSE 62 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region B1. ACCESSION M34820 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone B1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 62) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 11 33 human zinc finger protein binding site BASE COUNT 18 a 26 c 11 g 7 t ORIGIN 1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc 61 tt // LOCUS HUMGLIBSF 67 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region C1. ACCESSION M34821 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone C1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 67) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 16 29 human zinc finger protein binding site BASE COUNT 15 a 26 c 15 g 11 t ORIGIN 1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct 61 ctctgct // LOCUS MLVU3RA 383 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15.4 U3 region genomic RNA. ACCESSION M34722 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 383) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 178 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 147 152 Moloney leukemia virus binding protein b binding 156 167 enhancer core binding protein binding 350 362 CCAAT-binding transcription factor binding 363 372 glucocorticoid response element BASE COUNT 127 a 105 c 84 g 67 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc 61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc 181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc 361 caagaacaga tggtccccag aaa // LOCUS MLVU3RB 348 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15 U3 region genomic RNA. ACCESSION M34723 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 348) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 177 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 112 117 Moloney leukemia virus binding protein b binding 121 132 enhancer core binding protein binding 142 154 CCAAT-binding transcription factor binding 155 164 glucocorticoid response element BASE COUNT 115 a 97 c 76 g 60 t ORIGIN 1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc 121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct 181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg 241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg 301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc // LOCUS MLVU3RC 445 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25a U3 region genomic RNA. ACCESSION M34724 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 445) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 274 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 168 179 CCAAT-binding transcription factor binding 238 252 CCAAT-binding transcription factor binding 138 143 Moloney leukemia virus binding protein b binding 209 214 Moloney leukemia virus binding protein b binding 147 158 enhancer core binding protein binding 218 229 enhancer core binding protein binding 180 189 glucocorticoid response element binding 252 263 glucocorticoid response element BASE COUNT 142 a 126 c 104 g 73 t ORIGIN 1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg 121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa 181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg 241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga 301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa 361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa 421 aaagggtaag aaccccacac tcggc // LOCUS MLVU3RD 409 bp ss-RNA VRL 13-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25b U3 region genomic RNA. ACCESSION M34725 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 409) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 238 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 174 179 Moloney leukemia virus binding protein b binding 183 194 enhancer core binding protein binding 204 215 CCAAT-binding transcription factor binding 216 225 glucocorticoid response element BASE COUNT 133 a 114 c 91 g 71 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc 61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata 181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc 241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc // LOCUS TTHISOR 6434 bp ds-DNA BCT 13-JUL-1990 DEFINITION T.thermophilus insertion sequences Is1000A and Is1000B. ACCESSION M33159 KEYWORDS insertion sequence. SOURCE T.thermophilus (strain HB8, ATCC 27634), clones pNZ1557 and 1559. ORGANISM Thermus thermophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 6434) AUTHORS Ashby,M.K. and Bergquist,P.L. TITLE Cloning and sequence of IS1000 from Thermus thermophilus HB8 JOURNAL Plasmid (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.K.Ashby, 22-MAR-1990. FEATURES from to/span description pept 666 106 (c) ORFA pept < 706 1347 ORFB pept < 1251 1961 ORFC (AA at 1251) pept < 2007 3026 ORFD (AA at 2007) pept 3711 4664 ORF 1A pept 5173 6126 ORF 1B trns 3516 4711 IS1000A trns 4978 6173 IS1000B BASE COUNT 1060 a 2367 c 2037 g 970 t ORIGIN 1 ctcgccgccc tcctcctcgg cgggggccaa gcccacctgc tcttcgccct cgccctcagc 61 ctcggcctcc tttggccccc ccggtgagct cggccaggac cgccttcaag gccctttcca 121 ggcctcgagg ctccccgtgt tctccaagac ccaggtcgcc cgcttgcgct tctcctcctc 181 gggcatctgg gcccgctccc gggccaggac ctcctcccgg gaaagccccg agcgcgccat 241 cacccgcctc acccgctcct ccaggggggc cgccaccaag agggtcccgt ggagcctccc 301 ctcccacccc ttctcaaaga ggagggggat ctccagaaag acgagggggg cctctaggcg 361 ggaaagctcc tccataagaa gccgcctgac ctccgggtgg accacggcct cgagggcctt 421 gagcctctcc gggtcggaaa agacgagccg ggcgagggcc ctccggtcca gcctcccccc 481 caccaccgcc tcggggaaga ggcgcttcag ctcctcctcc ttgttctccc gggcccgggc 541 ggcgagcgcg tccaggtcca agacggggta gccccaggac ctgaggaggg cggccacggt 601 gctcttgccg ctgccgatgt tcccggtgat gccgataata atggggtgct tcgcctcgtg 661 gcccatatgg acttcccctt ttataccccc aaggacgcct tccccgtggg cggggcggtg 721 cgggacctcc tcctgggaag aaggccaacg gacctggact acgccgcctt ggaccccgaa 781 ggggcggccg aggaggcaaa gaggcgcctc gggggaagcc tcttccccct ggaccccaag 841 cggggccact accgcctggt ggtgggggaa aggaccctgg acttcacccc cttggagggg 901 aggctggagg aggacctcct ccggcgggac taccgggtga acgccctcct ctggaagggg 961 ggcgccgtct tcggcctaaa gggggtggag gaggacctga ggcggcgcct cctcgtcccc 1021 gtgcgggagg aaaacctcta ccaggaccac ctgcgaagcc tccggggggt ccgcctcgcc 1081 gccaccttgg gcttcgggct tccccagagg acccgggagg cccttggccg ccacgcccgg 1141 ttcctccagg cccaccccga ggccctcccc gcgcgggaaa gggtcaagga ggagcttgcc 1201 cggctcctcc tctccccgag ggcggcttcg gcctgcgcct tctggagagg gtggggctcc 1261 ttggggtcta cctccccgag ctcgccctcc tcgtgggcct ccatcagggc ggggtgcacc 1321 accttcccgc ctgggaccac accctgagcg cggtcttcca cctcctctgg ctctggcccg 1381 aggcccccct cgaggcccgc ctcgccgccc tcttccacga cgtgggcaag cccctcaccc 1441 gccgctttga ccccgaggtg ggccgcttcc gcttcctggg gcacgccgag gtgggggcgg 1501 agatcgcccg ggcaagcctc ttttggctcc gcttccccaa ggaggtggtg gagcgggcgg 1561 cgggcctggt gcgccgccac atggaccgcc tccccgagga gaggaaggcc ctgcgccgct 1621 tcttcctgag gcgccaggac ctcctcccgg acctcgtcta cctcatggcg gcggaccgcc 1681 tggccacgag gggcgtggag cgcgaggcct gggaggtgct cgggcgctac gaagaggtcc 1741 tgaaagaccc cctgccgcaa aggcccctcc tctccgggga ggaggtgatg gcccttctgg 1801 gcctccaaga aggccccgag gtgggaaggg ccctaaaggc cctcctcgag gcccaggccg 1861 aaggccgggt ggggaccaag gaggaggccc gggcctttct cctatattgg aggggtggaa 1921 gggaggctca ggcttcggga accccagatc acccccattg aggggggctt cctggtctcc 1981 gaccctacgg ggtctacgag aagcccctgg ccctcaccga gggcgggctt ttcctcctct 2041 ccctcatgga ggggaggacc ctggaggagg tgcaggagga ggtgttcaag cgccacgggg 2101 tcctggtgcc gaagaaggag ctggaagacc tggcgaaggc cctggaggag gcgggcctcc 2161 tcctcacgga gaaggtggag gcgaggctca aagaggagga ggaaaagctc aagcgggagc 2221 ggcccatgcg cctcgccggg ctctcctatc ccgagggcga gcgggaggcc cgggccttcc 2281 tcgaggcctt ccgggcgagc tacccggggg aaggcgagga ggcccgggtc ctcctcatgc 2341 cccacctgga gccctcccgc gtccccgagg tctacggggc ggccctcgcc gccctggaga 2401 agaccccgcc ccctgagcgc atctacctgg tgggggtcgc ccacaggccg ctgaaggaga 2461 aggccgccgc ccttcccgtg cccttccaga cccccttcgg ccccgccctg ccggaccttc 2521 ccgccctcca ggccttagac gccctccttc cctttgagct cttcaacacg cccctcgcct 2581 tccgggagga gcacagcctg gagcttcccc tcttcttcct gaaggggcgg ttccccgagg 2641 cccgcgtcct ccccctcctc gtggccaggc gaagcccgga gctcggggag gccctgaagg 2701 tggtcctgcg ggacttcccg ggccttctcg tcctcgccgt ggacctctcc cacgtggggc 2761 cccgcttcgg ggacaccccc ctcacccgca ccctggcgga ggaggcgagg aggcgggacc 2821 tgggcttcct ggaaaggctc gcggaagggg agccggaggc cgccctcgcc ttcctggggg 2881 ccaaccccac ccgcatagac ggggtggagg tggtggcaag cctcctcccc ctcctccggg 2941 aaaggaaggg gaaggtcctg gcccaccgcc tggacctcga ggcccccacc ctaagcgccg 3001 tgggggcggg caccctggtc ctctaaagag ccccagattt ggaacgacgc gaaggcccgg 3061 gcgccgaacc gcccgggccc cttttctggg agcgcccctt tgggaaaggg cccggaaggt 3121 gggtgggcag gcgcgccggg gcctgcgtga aagcctccta gcgcgagggg tacaggcctt 3181 cacccacgac aaaggggtcg cttgatgggc aagacagtaa tccccttacg gggctcaatc 3241 ccttgcaacc cttaggagct acctttttgc gcttcccgtg acgagcgtcg caatcccctt 3301 acggggctca atcccttgca acttccttct aaggaggaat gggcatcatt cacaatcagc 3361 aagtcgcaat ccccttacgg ggctcaatcc cttgcaacgg attcgcctcg gcccgcctag 3421 ggactggatc tatgaccccg tcgcaatccc cttacggggc tcaatccctt gcaaccattg 3481 gtggaggcgc gtgataagcg agtggaggtg aggtggtcgc atgtatcttg ccccctgagg 3541 gaggccccac cgccccggtt gccgggccac cacgcccgcg ggattgatcg gtagccaccc 3601 accccggcct ccccgccgca cagtatcccg gtcgtcaaag accgcatcca catggaaggc 3661 ccacagggtc gggtggccac ctccctcagg ttaccaaaag gagagagccc atgaccttcg 3721 ccggcattga cgtcagcaaa acccacctgg acctggccct cgtctccaac tcccccaaac 3781 ccacccgcct ccgcttcccc aactcccctg aaggccgtca agccctcctc gccgccctcg 3841 cccaccacaa ccccgcctgg gtcgccctgg agcccacggg cgcttaccac ctccccctcc 3901 tcaagctcct ggcagaaaac cgcctccagg tggccctggt caacccctac catctcgccg 3961 ccttccgcaa ggccaaggga gaacgccaga agaccgaccg ccaagacgcc ctcctcctcg 4021 cccgctacgc ccaggtctac cacggggagc tctgggccta caccctcccc ccagaagccc 4081 tccgtgagct caaagccctg gtgggctacc gggaggactt ggccgggcgg gaaagggcca 4141 tcctcaacca gatggaggcg gcggagtggg cggggagcaa ggaggtcctc gccctcctcc 4201 aaaaggagct ggcctgcgtg aaggggcttc tcggggaggt ggaggccagg atccaggccc 4261 tcctcgccac cctccccgag gccgaggtcc tgatggccct gcccggggtg gggccccagg 4321 tggcggcagc ggtgctggcc ctcctgcccc cagagctctg gggccgggcg aagagggcgg 4381 cctcctacgc ggggctcatc cccgagcggg aggagtcggg aaagagcgtg gagaggagtc 4441 ggctctccaa aaaagggcct cccctcctgc ggcgaaagct ctacatgggc gccctggtgg 4501 cggtgcgcca tgacccggag atgcgggcct tctaccaccg cctgctctcg cggggaaaga 4561 gaaagaagca ggcgttggtg gccgtggccc acaagctcct caggcggatg atgggaaggc 4621 tcagggagta ctacgcgacc cagctagatc aaggggtcgc ttgacaggca agacagtatc 4681 aatcccctta cggggctcaa tcccttgcaa ctccgacggc aagatcatcg gggtgcggcg 4741 gctcctcgcc gtcgcaatcc ccttacgggg ctcaatccct tgcaacgcca gtctatgacc 4801 aagcccctag gctccttcac tttgaagtgt cgcaatcccc ttacggggct caatcccttg 4861 caacacctca tgggggagta agataggcgg gaaaggaggg aggtcgcaat ccccttacgg 4921 ggctcaatcc cttgcaacca cgtgaagacg tccctggcct cggtcgtgag ggacgtggtc 4981 gcatgtatct tgccccctga gggaggcccc accgccccgg ttgccgggcc accacgcccg 5041 cgggattgat cggtagccac ccaccccggc ctccccgccg cacagtatcc cggtcgtcaa 5101 agaccgcatc cacatggaag gcccacaggg tcgggtggcc acctccctca ggttaccaaa 5161 aggagagagc ccatgacctt cgccggcatt gacgtcagca aaacccacct ggacctggcc 5221 ctcgtctcca actcccccaa acccacccgc ctccgcttcc ccaactcccc tgaaggccgt 5281 caagccctcc tcgccgccct cgcccaccac aaccccgcct gggtcgccct ggagcccacc 5341 agcgcctacc acctccccct cctcaagctc ctggcagaaa accgcctcca ggtggccctg 5401 gtcaacccct accacctcgc cgccttccgc aaggccaagg gagaacgcca gaagaccgac 5461 cgccaagacg ccctcctcct cgcccgctac gcccaggtct accacgaaga cctccgggcc 5521 tacaccctac ccccagaaac cctccgggag ctcaaagccc tggtgggcta ccgggaggac 5581 ttggccgggc gggaaaggac catcctcaac cagatggagg cggcggagtg ggcggggagc 5641 aaggaggtcc tcgccctcct ccagaaggag ctggcctgcg tgaaggggct tctcggggag 5701 gtggaggcca ggatccaggc cctcctcgcc accctccccg aggccgaggt cctgatggcc 5761 ctgcccgggg tggggcccca ggtggcggca gcggtgctgg ccctcctgcc cccagagctc 5821 tggggccggg cgaagagggc ggcctcctac gcggggctca tccccgagcg ggaggagtcg 5881 ggaaagagcg tggagaggag tcggctctcc aaaaaagggc ctcccctcct gcggcgaaag 5941 ctctacatgg gcgccctggt ggcggtgcgc catgacccgg agatgcgggc cttctaccac 6001 cgcctgctct cgcggggaaa gagaaagaag caggcgttgg tggccgtggc ccacaagctc 6061 ctcaggcgga tgatgggaag gctcagggag tactacgcaa cccagctaga tcaaggggtc 6121 gcttgacagg caagacagta tcaatcccct tacggggctc aatcccttgc aacgaacctc 6181 ctcgggccca ccgagggggt aatcctcgca gagtcgcaat ccccttacgg ggctcaatcc 6241 cttgcaacct gaaccagaag gccctctggg cggcggcgta cgccgtggcg tcgcaatccc 6301 cttacggggc tcaatccctt gcaacgcgat tggcgtcgcc ctccccgtta tcgcttacga 6361 gtcgcatgta tcttgccccc tgagggaggc cccaccgccc cggttgccgg gccaccacgc 6421 ccgcgggatt gatc // LOCUS MLUDNAA 4171 bp ds-DNA BCT 13-JUL-1990 DEFINITION M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34 (rpmH), DNA biosynthesis initiation protein (dnaA), and DNA polymerase III beta subunit (dnaN) genes, complete cds. ACCESSION M34006 KEYWORDS 50S ribosomal subunit protein L34; DNA biosynthesis initiation protein; DNA polymerase III beta subunit; ribonuclease P. SOURCE M.luteus (strain IF03333) DNA. ORGANISM Micrococcus luteus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 4171) AUTHORS Fujita,M.Q., Yoshikawa,H. and Ogasawara,N. TITLE Structure of the dnaA region of Micrococcus luteus: Conservation and variations among eubacteria JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by N.Ogasawara, 05-MAY-1990. FEATURES from to/span description pept 435 37 (c) ribonuclease P (rnpA) pept 601 464 (c) 50S ribosomal subunit protein L34 (rpmH) pept 944 2491 DNA biosynthesis initiation protein (dnaA) pept 3242 > 4171 DNA polymerase III beta subunit (dnaN) signal 843 848 dnaA -35 signal signal 866 871 dnaA -10 signal BASE COUNT 672 a 1558 c 1361 g 580 t ORIGIN 1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg 61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta 121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag 181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg 241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct 301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg 361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg 421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg 481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg 541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca 601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag 661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc 721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc 781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc 841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc 901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc 961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc 1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct 1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc 1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga 1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg 1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac 1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt 1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc 1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg 1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc 1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc 1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga 1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca 1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga 1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt 1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc 1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc 1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta 2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac 2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa 2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac 2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct 2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc 2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa 2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa 2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg 2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact 2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc 2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg 2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct 2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg 2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc 2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca 2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc 3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac 3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca 3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc 3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct 3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc 3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg 3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga 3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc 3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg 3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct 3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac 3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga 3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg 3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac 3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc 3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga 3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac 4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct 4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag 4141 cgccaacgag tccgtgccgt gcacgctcga g // LOCUS CE1CG 6646 bp ds-DNA circular BCT 13-JUL-1990 DEFINITION Plasmid ColE1, complete genome. ACCESSION J01566 M33100 KEYWORDS cea gene; colicin; colicin E1; colicin E1 immunity; complete genome; kil gene; origin of replication. SOURCE Plasmid ColE1 DNA. ORGANISM Plasmid Colicin E1 Prokaryota; Bacteria. REFERENCE 1 (bases 1008 to 1370) AUTHORS Tomizawa,J.-I., Ohmori,H. and Bird,R.E. TITLE Origin of replication of colicin E1 plasmid DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74, 1865-1869 (1977) STANDARD full staff_review REFERENCE 2 (bases 1410 to 1599) AUTHORS Bastia,D. TITLE Determination of restriction sites and the nucleotide sequence surrounding the relaxation site of ColE1 JOURNAL J. Mol. Biol. 124, 601-639 (1978) STANDARD full staff_review REFERENCE 3 (bases 1 to 1613; 6577 to 6646) AUTHORS Oka,A., Nomura,N., Morita,M., Sugisaki,H., Sugimoto,K. and Takanami,M. TITLE Nucleotide sequence of small ColE1 derivatives: Structure of the regions essential for autonomous replication and colicin E1 immunity JOURNAL Mol. Gen. Genet. 172, 151-159 (1979) STANDARD full staff_review REFERENCE 4 (bases 623 to 832) AUTHORS Morita,M. and Oka,A. TITLE The structure of a transcriptional unit on colicin E1 plasmid JOURNAL Eur. J. Biochem. 97, 435-443 (1979) STANDARD full staff_review REFERENCE 5 (bases 1 to 192; 384 to 519; 6644 to 6646) AUTHORS Patient,R.K. TITLE Characterization of in vitro transcription initiation and termination sites in Col E1 DNA JOURNAL Nucleic Acids Res. 6, 2647-2665 (1979) STANDARD full staff_review REFERENCE 6 (bases 4783 to 5353) AUTHORS Ebina,Y., Kishi,F., Miki,T., Kagamiyama,H., Nakazawa,T. and Nakazawa,A. TITLE The nucleotide sequence surrounding the promoter region of colicin E1 gene JOURNAL Gene 15, 119-126 (1981) STANDARD full staff_review REFERENCE 7 (bases 1 to 6646) AUTHORS Chan,P.T., Ohmori,H., Tomizawa,J.-I. and Lebowitz,J. TITLE Nucleotide sequence and gene organization of ColE1 DNA JOURNAL J. Biol. Chem. 260, 8925-8935 (1985) STANDARD full staff_review REFERENCE 8 (bases 57 to 123) AUTHORS Zhiang,S., Yan,L. and Zubay,G. TITLE Regulation of gene expression in plasmid ColE1: Delayed expression of the kil gene JOURNAL J. Bacteriol. 170, 5460-5467 (1988) STANDARD simple staff_entry REFERENCE 9 (bases 639 to 753) AUTHORS Tomizawa,J.-I. TITLE Control of ColE1 plasmid replication: The process of binding of RNA I to the primer transcript JOURNAL Cell 38, 861-870 (1984) STANDARD full staff_review REFERENCE 10 (bases 5046 to 5110) AUTHORS Parker,R.C. TITLE Mitomycin C-induced bidirectional transcription from the colicin E1 promoter region in plasmid ColE1 JOURNAL Biochim. Biophys. Acta 868, 39-44 (1986) STANDARD full staff_entry COMMENT The L (light) strand is shown. This sequence was kindly supplied over computer network by J.Lebowitz (04-APR-1985). The open reading frames noted by [7] are named in FEATURES by the molecular mass of their predicted products or by the names given by [7]. FEATURES from to/span description pept 400 59 (c) colicin E1 immunity protein (imm) pept 448 585 kil protein pept 1609 1746 mob1 protein (gtg start codon) pept 1803 1612 (c) RNA I inhibition modulator protein (rom; gtg start codon) pept 4362 3943 (c) entry exclusion protein 2 (exc2) pept 4862 4443 (c) entry exclusion protein 1 (exc1) pept 5140 62 colicin E1 protein (cea) RNA 641 > 641 II RNA RNA 751 645 (c) I RNA ((3' end +/- 1 bp) [4] mRNA 1449 > 3936 mob mRNA (5' end putative) [5],[7] mRNA 5056 > 585 colE1-kil mRNA [7] mRNA 409 < 59 (c) imm mRNA [5],[7] mRNA 4887 < 3943 (c) exc mRNA [5],[7] binding 5063 5093 LexA binding site [7] ORF 605 895 10.2 kd ORF [7] (ttg start codon) ORF 1241 870 (c) 13.8 kd ORF [7] (gtg start codon) ORF 1189 1350 6.0 kd ORF [7] ORF 1408 1238 (c) 6.4 kd ORF [7] ORF 1347 1526 6.9 kd ORF [7] (gtg start codon) ORF 1709 1500 (c) 7.6 kd ORF [7] ORF 1609 1746 mob1 ORF (gtg start codon) ORF 1843 2190 mob2 ORF ORF 2180 3733 mob3 ORF ORF 2214 2399 mob4 ORF (gtg start codon) ORF 2868 3386 mob6 ORF ORF 3393 3626 mob7 ORF (gtg start codon) ORF 3657 3818 mob8 ORF ORF 3766 3936 mob9 ORF (gtg start codon) ORF 4258 4647 14.5 kd ORF [7] ORF 5435 5653 8.8 kd ORF [7] ORF 5852 6013 6.7 kd ORF [7] conflict 83 84 aa in [3],[7]; a in [5] conflict 99 100 aa in [3],[7]; a in [5] conflict 135 135 a in [3],[7]; g in [5] conflict 144 144 a in [3],[7]; g in [5] conflict 174 174 a in [3],[7]; g in [5] conflict 484 485 aa in [3],[7]; a in [5] conflict 491 492 tt in [3],[7]; t in [5] conflict 506 512 ctaacta in [3],[7]; caca in [5] conflict 517 519 cct in [3],[7]; cgg in [5] conflict 528 528 c in [7]; g in [3] conflict 652 653 aa in [3],[7]; aaa in [4] conflict 1111 1111 g in [7]; t in [1],[3] orgrpl 1197 1197 replication origin (+/- 1 bp) [1] conflict 1366 1366 a in [3],[7]; g in [1] conflict 1416 1417 cc in [3],[7]; c in [2] conflict 1454 1454 c in [3],[7]; t in [2] cutss 1466 1465 (c) relaxation (nic) cut site [2] conflict 1581 1582 aa in [3],[7]; a in [2] conflict 4784 4787 ctcc in [7]; cgc in [6] BASE COUNT 1760 a 1421 c 1798 g 1667 t ORIGIN 2 bp downstream of EcoRI site. 1 ttctatgctc ctatattgat aagaataaac ttaatactat aaatgaggtg ttagggattt 61 aattattctt tattgatata aaaagtccta gcaatccaaa tgggattgct aggaccaaac 121 aaagtagatt atatagcata aataggttta attttgctac gggggcgtta tttaggtttt 181 ttcttctttc gaaaaaatct ttctttatga agttaaaagc tatgtattca atagcatatt 241 ttgaatatgg acatagaata gtgcttatca ctattgcata tagcatctta tctgacacaa 301 ggaaataata cccttcgctg ttttttgtta taaggtatat atatataagt gtgcagtaca 361 ggccaaataa aatatttttt atgtagtatc ttaagctcat aaattaaacc tcgccatata 421 ttcttttcat tttataagga tcgagttatg aggaaaagat tttttgtggg aatattcgcg 481 ataaacctcc ttgttggatg tcaggctaac tatatacctg atgttcaggg agggaccatc 541 gcaccatcct cctcttctaa actgacgggg atcgcggttc agtagaaaag attaaaggat 601 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 661 taccaacggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 721 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag tcgggccact 781 acttcaagaa ctctgtagca ccgtttgtgc catcatcgct ctgctaatcc ggttaccagt 841 ggctgctgcc agtggcgtta aggcgtgcct taccgggttg gactcaagac gatagttacc 901 ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg 961 aacgacctac accgaactga gataccaaca gcgtgagcta tgagaaagcg ccacgcttcc 1021 cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac 1081 gagggagctt ccagggggaa acgcctggta gctttatagt cctgtcgggt ttcgccacct 1141 ctgacttgag cgtctatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc 1201 ctgctacgtg gccttcttcc tgttcctggt cttttgctca catgttcttt ccggccttat 1261 cccctgattc tgtggataac tgtgttaccg tttttgtgtg agtcagtacc gctcgccgca 1321 gtcgaacgac cgagcgtagc gagtcagtga gcgaggaagc ggaaaagcgc ctggacgtgc 1381 attttctcct tacgcatctg tgcggcattt cacacccggc atggcgtact tttcatacaa 1441 tccgcactga tgccgcatgg ttaagccagt atacactccg ctatcgctac gtgactgggt 1501 cagggctgcg ccccgacacc cgctaaaacc tgctgacgcg ccctgacggg cttgtcagct 1561 cccggcatcc gctcacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt 1621 ttcaccgtca tccccgaaac gtgcgaggca gctgcggtaa agctcatcgg cgtggtcgtg 1681 aagcgattca caaatatcgg cctgttcatc tgcgtccagt tcgttgagct tctccagcag 1741 cgttaatgtc tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttagt 1801 cactgatgcc tccgtgtaag ggggatttct gttcatgggg taatgatacc gatgaaacgc 1861 gagaggatgc tcacaatacg ggttactgat gatgaacatg cccggttact tgaacgctgt 1921 gagggtaaac aactggcggt atggatgcgg cgggtctgcc tgggggagcc ggttgcccgt 1981 tccggaaaac tgccgacact ggcaccgccg ttactgcgtc agctggccgc catcggaaat 2041 aacctgaatc agacagcccg taaggtgaac agcgggcagt ggtcttccgg tgaccgggtt 2101 caggtggtgg ccgcactgat ggccatcggg gatgagctgc gccggctgcg tctggctgtc 2161 agggaacagg gggcgcggga tgatagttaa atttcatgcc aggggaaaag gtggtggcag 2221 tggtccggtt gattacctgc tggggaggga gcgtaaccgc gaaggcgcaa cggtgcttca 2281 gggtataccg gaagaagtcc gggaactcat cgatgccacg ccatttgcga agaaatacac 2341 gtccggtgtt ctgtcgttcg cggagaagga gctgccgccg ggaggacgtg aaaaagtgat 2401 ggcgagcttt gagcgtgtac tgatgcccgg tctcgaaaaa aatcagtaca gcatcctgtg 2461 ggtggagcac caggacaagg gacggcttga gctgaatttt gtcattccga acatggagct 2521 acagaccgga aaacgcctcc agccgtacta cgaccgcgca gacaggccta gaattgatgc 2581 ttggcagacg ctgttaaatc accattacgg gctgcatgac ccgaacgccc cggagaaccg 2641 caggacgctg acactccctg ataacctgcc tgaaacgaaa caggcgcttg ctgagggcgt 2701 cacgcgaggt atagatgcac tttaccatgc cggagagata aaaggccgtc aggatgtgat 2761 tcaggcgctc actgaggcgg ggctggaagt ggtcagggtg acgcgaagca gtatcagcat 2821 tgcagatccg aacggcggga agaatatcag gctgaaagga gcattttatg agcaatcttt 2881 tgcagacggg cgcggagttc gagaaaaagc tgaaagagag agccgaatct acagagaaaa 2941 tgctgaacaa cgagttcagg aggctcggcg aatctgtaag cgaggctgtg acatcaaacg 3001 agacgaaaat cagagacgct atagccctgt tcacagcctc gacagaggaa tcgctggaaa 3061 aacaccggga aggggtgaaa gaggcgatga tgcagcacag gagggacgtg ttaaagctgg 3121 cagggaatac gggcatgatg ttactgggga tagtctttct cctgtttacc gcgagtggcg 3181 ggacgctctg gtatcttgga gggaggatac aggcgaacct ggaagaaatc aggaagcagg 3241 aagagacatt gcagaaactg aacgcgaaga catggggcgt ggagtttgtg caggacggga 3301 acaggaaatt ccttgtcctt ccgtacggga aatcagcgga ggtgattccc tttcagggga 3361 aagagtgggt acatctgaag gagtgacaca gagtgacaga gctggaaaca catttgctga 3421 acgccttaga gcagctgcaa caggactata tgcagcggct gagcgaatgg gagagcgcct 3481 tcgtggaatt gcagaagatg ttttcgctta cgcaacggga caacgcgatg ctgaacgagc 3541 gggtcatgca gttgagtcag caggtgcaac acttgagcga gcagacagaa cgcttgagcc 3601 agttatacag cgagaactgg agataagaga ggaacggctg atacaggagc gcgaacatgt 3661 gttatcccta gaacgggagc gtcagccgga aatacaggaa cgcacgctgg atggcccttc 3721 gctgggatgg tgaaaccatg aaaaatggca gcttcagtgg attaagtggg ggtaatgtgg 3781 cctgtaccct ctggttgcat aggtattcat acggttaaaa tttatcaggc gcgatcgcgc 3841 agtttttagg gtggtttgtt gccattttta cctgtctgct gccgtgatcg cgctgaacgc 3901 gttttagcgg tgcgtacaat taagggatta tggtaaatcc acttactgtc tgccctcgta 3961 gccatcgaga taaaccgcag aaatcgtgtc agccagcagc ctggattttt tcggggtaag 4021 gtctttgctg cggtaatcag tacccgcaaa ttgagttgtg ccccggagtg agtttaaaaa 4081 ttcttggctg tgaaactgct gaatgcgttg ttgcacatca ctccgtgatt ttcctgtcgc 4141 tttgtcttta gctcccatat cccagaactg ccgaaaaaaa ggcaccatca ttcgtgttgt 4201 gtcggctttt tgggtggaaa agttgggatc aaaaccgtca tcagaagcat atacgtaatg 4261 ccgggtatgt tgttccacgc tgccctttgg ggtacaggct gtgagagcta ctgttgctgt 4321 tatcatggcg ataagacaaa gtttgtttcc ggtacgtctc atgagtgtct ccctacctta 4381 aagtatttat tctcggtgag ttattatccg aagactattt tatcatcagt ctctgacagc 4441 gttcagcgac cgaacaacct ggaccagaat ccccgtttct tttccagttc cagtgcctgc 4501 ctgagctggg ctatctcatt ttgtagctgt tcccgttctg cttcctggcg tctgcgatcc 4561 atatcctgtg cctgtttatc ctcaagcatc agcgtcaggc attgtttcag ctcattcagt 4621 tcccggagaa tgcgttctgt ctgctgatca tgtggatttt ctgcatgtcc ctcactgtgc 4681 ctttctggtg tctcattctg ctttaattcg ccgtatgccc ggatcagttc actggtttca 4741 aactcccgtc gtccgtccct gccggtacgg taactgacaa ggcctccgcg catatcacgg 4801 taaaactgtg aacgcgatct gcctgtcatt tttagtgcgt cccggagtgt gtgccatgcc 4861 ataaagtgac agtgtcccat agatgtctca tctcatagtt tcagtaaaac ataatgaggt 4921 ctgagaacgg taatgtttgt gctggttttt gtggcatcgg gcgagaatag cgcgtggtgt 4981 gaaagactgt ttttttgatc gttttcacaa aaatggaagt ccacagtctt gacagggaaa 5041 atgcagcggc gtagctttta tgctgtatat aaaaccagtg gttatatgta cagtatttat 5101 ttttaactta ttgttttaaa agtcaaagag gattttataa tggaaaccgc ggtagcgtac 5161 tataaagatg gtgttcctta tgatgataag ggacaggtaa ttattactct tttgaatggt 5221 actcctgacg ggagtggctc tggcggcgga ggtggaaaag gaggcagtaa aagtgaaagt 5281 tctgcagcta ttcatgcaac tgctaaatgg tctactgctc aattaaagaa aacacaggca 5341 gagcaggctg cccgggcaaa agctgcagcg gaagcacagg cgaaagcaaa ggcaaacagg 5401 gatgcgctga ctcagcgcct gaaggatatc gtgaatgagg ctcttcgtca caatgcctca 5461 cgtacgcctt cagcaacaga gcttgctcat gctaataatg cagctatgca ggcggaagca 5521 gagcgtttgc gccttgcgaa agcagaagaa aaagcccgta aagaagcgga agcagcagaa 5581 aaggcttttc aggaagcaga acaacgacgt aaagagattg aacgggagaa ggctgaaaca 5641 gaacgccagt tgaaactggc tgaagctgaa gagaaacgac tggctgcatt gagtgaagaa 5701 gctaaagctg ttgagatcgc ccaaaaaaaa ctttctgctg cacaatctga agtggtgaaa 5761 atggatggag agattaagac tctcaattct cgtttaagct ccagtatcca tgcccgtgat 5821 gcagaaatga aaacgctcgc tggaaaacga aatgaactgg ctcaggcatc cgctaaatat 5881 aaagaactgg atgagctggt caaaaaacta tcaccaagag ccaatgatcc gcttcagaac 5941 cgtccttttt ttgaagcaac cagacgacgg gttggggccg gtaagattag agaagaaaaa 6001 caaaaacagg taacagcatc agaaacacgt attaaccgga taaatgctga tataactcag 6061 atccagaagg ctatttctca ggtcagtaat aatcgtaatg ccggtatcgc tcgtgttcat 6121 gaagctgaag aaaatttgaa aaaagcacag aataatctcc ttaattcaca gattaaggat 6181 gctgttgatg caacagttag cttttatcaa acgctgactg aaaaatatgg tgaaaaatat 6241 tcgaaaatgg cacaggaact tgctgataag tctaaaggta agaaaatcgg caatgtgaat 6301 gaagctctcg ctgcttttga aaaatacaag gatgttttaa ataagaaatt cagcaaagcc 6361 gatcgtgatg ctatttttaa tgcgttggca tcggtgaagt atgatgactg ggctaaacat 6421 ttagatcagt ttgccaagta cttgaagatt acggggcatg tttcttttgg atatgatgtg 6481 gtatctgata tcctaaaaat taaggataca ggtgactgga agccactatt tcttacatta 6541 gagaagaaag ctgcagatgc aggggtgagt tatgttgttg ctttactttt tagcttgctt 6601 gctggaacta cattaggtat ttggggtatt gctattgtta caggaa // LOCUS COGTUB1A 2672 bp ds-DNA PLN 13-JUL-1990 DEFINITION C.graminicola beta-tubulin-1 (TUB1) gene, complete cds. ACCESSION M34491 KEYWORDS beta-tubulin-1. SOURCE C.graminicola DNA. ORGANISM Colletotrichum graminicola Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Diaporthales; Diaporthaceae. REFERENCE 1 (bases 1 to 2672) AUTHORS Panaccione,D.G. and Hanau,R.M. TITLE Characterization of two divergent beta-tubulin genes from Colletotrichum graminicola JOURNAL Gene 86, 163-170 (1990) STANDARD simple staff_review FEATURES from to/span description pept 662 673 beta-tubulin-1 (TUB1), exon 1 740 763 beta-tubulin-1, exon 2 818 843 beta-tubulin-1, exon 3 905 946 beta-tubulin-1, exon 4 1014 1068 beta-tubulin-1, exon 5 1124 1908 beta-tubulin-1, exon 6 1962 2355 beta-tubulin-1, exon 7 IVS 674 739 TUB1 intron A IVS 764 817 TUB1 intron B IVS 844 904 TUB1 intron C IVS 947 1013 TUB1 intron D IVS 1069 1123 TUB1 intron E IVS 1909 1961 TUB1 intron F BASE COUNT 632 a 754 c 660 g 626 t ORIGIN 1 ggatccggct cggcgttacg gactggaaac ccaaagggca acaatgagta gagcacttag 61 caaccgtgca acaacaacac catagccagg cagctgtgta tccaggggaa cgaacccatt 121 ggtccaaggt gttaccaagc ctctaggcaa taatctccca gttacggcca tacaaacctg 181 attcatgatg cctagaattc tgagcaaagc aaaacagagc atccatgtac caatcatttc 241 tgtggatata tactcaacgc ggactagctt ggtctctttt gggcgccgcc tccgagagaa 301 tgggcatcag ccataggtcc ctttcttttt tccttttccc atccccctcc cctccctacc 361 catcactctc tcttccaaag agtaagcaaa actgtgctgc cggagtaggc agccgtcgaa 421 tcagaaaggc agccattcaa cgatgcgcaa acatgtgggg attggaaatc cagaccgcaa 481 agttacactc tgctcgggcc tatccgatac gcagcgataa gttgtcggtc cttggatagg 541 ggatttgttg tgatttacag gcatatgtat ctataagatg accacctgtc ttgagtagcg 601 gggaaaatct ctgctttcca ccatcaactt ttgacaatcc acaccattcc caaagttcat 661 aatgcgtgag attgtgagtt gtaccttgtc cacgtaccta ttgaggtatt gattctcagc 721 tgactcgggg atccaacaga tacacctcca aaccggccaa tgtgtacgta tctttcgctg 781 ccgaatggat gagccctgac tgacgaagaa tctgtagggc aaccaagtcg gtacagcatt 841 ctggtatgag gatgggaccg tcgatgcacc ttcaagagat gagactgatg ttggtcacta 901 ccaggcaaac cattcacggc gagcacgggc ttgatcaaga tggagtgtga gtctcaccgc 961 catgtaacgc gacgtcacct cgctcatcga gcgtcaagtt gacattttca tagattccgc 1021 ggctctgatg agcagcagtc cgagaggctg agcgtctact tcaccgaggt aactcgatac 1081 agtcgtcgcg cacttctccc tattcctttc tgacacgatg caggctgcca aacaaaagta 1141 cgttcctcgt gcagtcctgg ttgaccttga gcccgcaacc atggatgcaa tccgttccgg 1201 tccccttggt gacttcttcc gtcccgacaa catggtctac ggacagtccg gcgcgggtaa 1261 caactgggca aagggccact acaccgaagg cgccgagctg gttgaccaag tcctcgacgt 1321 cgtgcgccgc gaagccgaag cctgcgacag tctgcagggt ttccagatca cgcactccct 1381 cggtggcggc accggctctg gcatgggcac cctcctcatc gcaaaggtcc gcgaggaatt 1441 ccctgatcgt atgatggcga ccttctctgt cctgccgtct gccaaggtct ctgaggtcgt 1501 tgtcgagccg tacaatgcga ccttgtcgat tcatcagctc gtcgaaaact cggacgagac 1561 gttctgtatc gacaacgagg ccctgtatga tatctgccgc cgcaccctga agcaagcgca 1621 cccgtcctac ggacatctca accacctggt gtccagggtc atgtccggct tgacgacggg 1681 cttccgcttc ccggggcaat tgaacgccga cctccgcaag ctggccgtga acctcgtccc 1741 cttccccaga cttcacttct tcaccgtcgg attcgcgccg ctcacctcct cggcctcctt 1801 ctcaaacctg ggtatcgccg agctcacgca gcagatgttc gacccgaaga acgtcatgtt 1861 ggcgtctgat ttccgtgacg gacgcttctt gacttgctcc accatgttgt gagttggtcc 1921 cctcatgtcg cttgatgaat tctttgctaa cgtaccttta gccgtggaaa ggtgtccatg 1981 aagcaggtcg aggaacagat ccaggccatc aagaacaaga actcagccaa cttcgtggag 2041 tggatcccga ataacatcca gactgctcat tgctctgtgc cgcccaaggg actggacgta 2101 tcatcgacgt tcattggaaa ctccaccgca atccagaaca gtttccgcag agtcggcgac 2161 cagttcagcc tcatgttccg gagaaaggct ttcttgcatt ggtacactgg agagggcatg 2221 gatgagatgg aattcactga ggccgagtct aacatgaacg atctggtatc cgagtaccag 2281 cagtaccagg atgcaggcat ggatgacgag tacggcgaag agtacgaaga cgaggctcct 2341 gctgaggagg aatgagaagg caagacaact cttcgatgtc cttcccgagg tgaagttact 2401 ttatattcaa ggtagttgat ggtctttgaa ccttttccaa tcactgtaca cacccaatcc 2461 accatcatag tccgagcagc tcacttgaag ttattttccc tttcgactga gaaggcaata 2521 gagttgttct atctcgttgc ttgtgcagac aacgaagtgg atcgttacgc aacccctagt 2581 cttgacttgc aaatcatatc gtcccaagat ccagatggct attagttcat ggatcggtac 2641 ccttgtagat ccacctgaag tggctccccg gg // LOCUS COGTUB1B 2614 bp ds-DNA PLN 13-JUL-1990 DEFINITION C.graminicola beta-tubulin-2 (TUB2) gene, complete cds. ACCESSION M34492 KEYWORDS beta-tubulin-2. SOURCE C.graminicola DNA. ORGANISM Colletotrichum graminicola Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Diaporthales; Diaporthaceae. REFERENCE 1 (bases 1 to 2614) AUTHORS Panaccione,D.G. and Hanau,R.M. TITLE Characterization of two divergent beta-tubulin genes from Colletotrichum graminicola JOURNAL Gene 86, 163-170 (1990) STANDARD simple staff_review FEATURES from to/span description pept 441 452 beta-tubulin-2 (TUB2), exon 1 655 678 beta-tubulin-2, exon 2 738 763 beta-tubulin-2, exon 3 840 881 beta-tubulin-2, exon 4 942 996 beta-tubulin-2, exon 5 1068 1858 beta-tubulin-2, exon 6 1917 2310 beta-tubulin-2, exon 7 IVS 453 654 TUB2 intron A IVS 679 737 TUB2 intron B IVS 764 839 TUB2 intron C IVS 882 941 TUB2 intron D IVS 997 1067 TUB2 intron E IVS 1859 1916 TUB2 intron F BASE COUNT 561 a 802 c 607 g 644 t ORIGIN 1 cccgggtctt ggatcggaga gaacagagga gcagaatagg gattattggg taggcaggta 61 cgtgcccagg taacacgtcg tgcctgccaa tcaacgccga tctgagtcct cgacttgttg 121 cgacccacca tcaaatagtg cctgggaatg gtctttcccc aatcaggatc gtgcacggat 181 cccaatagta aacaagcgag ctgcaccctt ttcctctctg gcctgtctct gggcacctgg 241 cctgtggcgg tgagcgaaat cacgtttacc cgcaaaacaa aaatcaacaa cctttcccct 301 acctaaccac accttaatct catccacttt ccaaccacgt ccaccttgga agcttcgcgc 361 agctctcaag cacctcccac tcgtcctctc ctttgctcca tcgtcggcct agtcagctca 421 agagcttttc tccattcaca atgcgtgaga ttgtaagtcc ttcccctcaa tcattcgtaa 481 caaataaacc tgcgaccgac gcgtttggcg acgaatcgtc ggccttgccc ctgaacgtac 541 cccgccgaca tttccaccca acactggtcc tcaccgaaga cgaccacgat tgccatcacc 601 gacagtatgc accttgggga tatatcgctg accattgatt gttatactcg ataggttcac 661 ctccagaccg gccagtgcgt aagtcttctc tgatcccaac caacaatcca aggtgcgggg 721 ctaacttctt tgaatagggt aaccagattg gtgctgcctt ttggtgcgta gccagaccga 781 catcatcgac ttcggcgaga ttggcctcga aaagacattg gatattaata cgggcacagg 841 caaaacatct ctggcgagca cggcctcgac agcaatggcg tgtatgttgc caacctccag 901 atctggccac ttcctcgagt tcaccgctaa tttctcaaca gttacaatgg cacctctgag 961 ctccagctcg agcgcatgag cgtctacttc aacgaagttt gttatcctag ccccccccag 1021 gaagcagaca aacctattga tgaatactga ccttgtcacg tacccaggct tccggcaaca 1081 agtatgtccc tcgcgccgtc ctcgtcgact tggagcccgg taccatggat gctgttcgcg 1141 ccggcccctt cggccagctt ttccgccccg acaacttcgt ctttggccag tccggtgccg 1201 gcaacaactg ggccaagggt cactacaccg aaggagctga gcttgtcgac caggtccttg 1261 acgtcgtccg ccgcgaggct gagggctgcg actgccttca gggcttccag attacccact 1321 cccttggtgg aggtactggt gccggtatgg gtactctgtt gatctccaag attcgcgagg 1381 agttccccga ccgcatgatg gctacctttt ccgtcgttcc ctcccctaag gtttccgaca 1441 ccgttgttga gccctacaac gccactctct ccgtccacca gctggttgag aactccgatg 1501 agaccttctg cattgataac gaggctctct acgacatctg catgcgtact cttaagctct 1561 ctaacccctc gtacggcgac ctgaaccatc tcgtctctgc cgtcatgtcc ggtgtcacta 1621 cctgcctgcg tttccctggt cagctgaact ctgacctgcg taagctggcc gtcaacatgg 1681 ttcctttccc ccgtcttcac ttcttcatgg tcggattcgc tcccctgacc agccgtggtg 1741 cccactcttt ccgcgctgtc agcgttcctg agctcaccca gcagatgttc gaccccaaga 1801 acatgatggc tgcctctgac ttccgcaacg gtcgctacct gacctgctct gccatcttgt 1861 aagtgtcatt tccagccaac ctacaatgtc agcacttgct aacagctgcc tctcagccgt 1921 ggtaaggtcg ccatgaagga cgtcgaggac cagatgcgca acgtcctgaa caagaactct 1981 tcttacttcg tcgagtggat ccccaacaat gtccagaccg ccctctgctc cattcctccc 2041 cgcggcctca agatgtcctt cacctttgtc ggtaactcta ccgccatcca ggagctcttc 2101 aagcgtgtcg gtgagcagtt cactgccatg ttccgtcgca aggctttctt gcattggtac 2161 actggtgagg gtatggacga gatggagttc actgaggctg agtccaacat gaacgacttg 2221 gtctctgagt accagcagta ccaggacgct ggtgttgatg aggaggagga ggagtacgag 2281 gatgacgccc ccctggagga ggaggtttaa gcgttgtctg aaaatgctgt gccaccttgg 2341 ccatgtcttc acccaacccg tctgcggtgg catttcgctt cactattcca gctctgcact 2401 ggaaatgggc ttctagatat acctctctta gtagttcgcc tggcgtatca aaatgagtac 2461 gaagaatcag agattactct gtacaaatta ttggcaacat caaatgcata gttttatggc 2521 aattgcgaca cctctaatct tgccagagtt caagtattcg tatcttcttt cgtgactgac 2581 agattatcta tgtttaaacg tggacaccct cgag // LOCUS DDICNPA 6372 bp ds-DNA INV 13-JUL-1990 DEFINITION D.discoideum cyclic nucleotide phosphodiesterase gene, complete cds. ACCESSION M23449 KEYWORDS cyclic nucleotide phosphodiesterase. SOURCE D.discoideum (strain NC4; cell line AX3-K) DNA, clones pGP-1 and pC34. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 6372) AUTHORS Podgorski,G.J., Franke,J., Faure,M. and Kessin,R.H. TITLE The cyclic nucleotide phosphodiesterase gene of Dictyostelium discoideum utilizes alternate promoters and splicing for the synthesis of multiple mRNAs JOURNAL Mol. Cell. Biol. (1989) In press STANDARD full staff_review REFERENCE 2 (bases 3380 to 4379) AUTHORS Faure,M., Franke,J., Hall,A.L., Podgorski,G.J. and Kessin,R.H. TITLE The cyclic nucleotide phosphodiesterase gene of Dictyostelium discoideum contains three promoters specific for growth, aggregation, and late development JOURNAL Mol. Cell. Biol. 10, 1921-1930 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.H.Kessin, 23-MAR-1989. FEATURES from to/span description pept 4127 4146 cyclic nucleotide phosphodiesterase, first expressed exon (EC 3.1.4.17) 4297 5635 cyclic nucleotide phosphodiesterase, exon 2 pre-msg 1163 6060 CNP mRNA and introns IVS 1780 2266 CNP intron A (no splice consensus at 2266) IVS 2377 4077 CNP intron B IVS 4147 4296 CNP intron C signal 6015 6020 poly-A signal BASE COUNT 2615 a 750 c 504 g 2503 t ORIGIN 1 tgatcatttt gtaaatcttt caattaaatt tcaacttaca cattaataat aataaaaata 61 ataataataa taatatcaga cattcacagg atttttataa atccaatttt atttttcatt 121 gatgtcgttt tattaattga cattcaaatt gtgaataact gacaataaat aaaacagttc 181 atatatttat ttgtttgaaa aaaaaaaaaa aaaaaaaaaa tcaaagttgc atgtgcgagt 241 gtaaattggg gtttttttta taaatctaaa ttagttattt tacaatttta gaaccactac 301 caaattacaa aaaaaaaaaa aaaaaaacca ctcatttatt tttttttttt tttttttttt 361 ttttatcaca tagattccaa taaaattgtg aaacataaag aggaaaacga taatatggat 421 tattagaaaa taatcaatac tttttttttt tttttttttt tgtgttttat tggtttttaa 481 ttaaaagttt tgggggtttt ttattttaat tttttttttt tttgaaaaac atactcactc 541 acacaaaacg caatcaataa atctattttt tgaaaataaa aaaaatcttt tttttttttt 601 tttttttttt ttttctagtt ttcaaaaaat tcatttaatc gcattacatt ataaaataaa 661 aaaattcggt gtcttaaatc tcccaaaatc ttgaatgatt tgtgtgtgtt aaaataataa 721 tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaatt ccaactttta gatccccaat 781 cacataggtg tgatatatct aagttattga aaattgtgtt gatattgttt ttcttacttt 841 tttttttttt ttaaaataac cactttactt tctaaatatt gcatcgcaat gggggtatta 901 aaagttttgt tatttttatt tacacatctt gatttcaaga taaaaatttt tagtgtaaat 961 atatcaaaca ccatagagac ttttcacatt atcaagtgtt tttgactctc tatcacactc 1021 acaaatctga tttttatttt tatttttatt tttttttttt ttttatttta tttttttttt 1081 attttatttt atttttctcc aaccacaatt gatttcaaaa aaaaaaaagt ctataaaatc 1141 aaaaaaaaat ttttttttta attaatcaaa aacaaaaaca aaaaacaaaa ataaataaaa 1201 aaaaaaaaaa aaaacttgta acattaataa taataatttg taattttaaa taccaacttc 1261 aattccatat tacttttttt gtatcattat ttatatatct caatctattt taaatcacca 1321 ttgtatcact actaaaaaaa aaaaaaaaaa aaaaatcaaa aatcaaaaat ccaaatttat 1381 aaaaaaaaaa aaaaaaatca aaattgtaaa aaaataacaa tatcaccctt ttttctgtat 1441 tattacttgt atttgtaaca ctataaatta aataaataaa aatttaattt tgtatcatta 1501 ccacactatt aatttttttt ttacaatttt tgatcaataa taaatcgtta aaaaaaaaaa 1561 aaaaaaaaat ttttaataat aaaataaatt aataccaaaa ataatcaacc aaccaaccaa 1621 aaattattta attatttatt aaccatttta tcataattaa taaataaaaa aaaaaaaaaa 1681 ataaaaaaaa aacatcaaaa aaaaaaaata aaaaaacaat accatatcaa attatctctc 1741 gttctgtact attactcctt aaaaaattta ttaaaaaagg tatgtaaata atataattat 1801 tattatttta tttattatta tgattccatt taaatgtttc aaaatttggt ttttttattt 1861 tttccttatt ttcgttcgct ttgaccaatc attttttttt attattttaa tttttaatta 1921 tttttcgggt tcattccaac atgacaatgg aaaaaattaa aaaagggtat aggaaaagcg 1981 caaatcaaaa ggaaaaaaaa aaatatcttg cgtttgccaa tttttatttt tttttatttt 2041 tttttttatt tttttttttt ttgaaatcaa tcaaaatttc caaccttata ataatccaat 2101 ttaatcacaa atccagtgat tttatcgttg tgtgtgtgtg gtctgatttt aaaatcacca 2161 actgtttttt tttataatta tttttttttt attttatttt tcaatttctt ttcttttttt 2221 tttttttatt ttttaatatc ttttattttt tttttttttt ttttttttat tattattaat 2281 aatcataaaa ctgtaatagt acttaaattg taacaaaata aaaatataaa aataataata 2341 ataatttaac aataagaata gcaacaatta ttcattgttt gtatttaatc aatatttaga 2401 atttttatta tttttttttt atatttatta tttattatct atctagaata caagtagaaa 2461 atataattat ataattttat aatacccatt ataatatatt ttattttatt ttattttatt 2521 ttattttttt ttctcctaaa atcacccaaa caccccacac acaattttat gtaatattca 2581 ataacaacat tgattttcac accactcact gcccaattca attcaaacga tagtgagtaa 2641 ttaattttgt gtttagatgg tagatagaga tgatgtattt tattttttta tatcatccca 2701 agttccatat aaatcaacat tttatgatgt ttatttgttt ttcaatataa catttttatt 2761 taataaaaaa atttaattaa atcatatata acactactta ataaaaacat caataatcac 2821 accacccact accaataata aaaaataaat aaaaaaaata aataaataaa aaattaatta 2881 aatgtttata tttttataac accacacata cgcccacact ccctctttct ccccactact 2941 aaacttgata tttttattgt tttttttttt atacaataaa acattgattg ttatattttt 3001 aattttatga ttaaaaaaaa aaaaaaaaaa caaaaacaaa aactatatat aaaaataccc 3061 aaaaataaaa tatttttaag ggtagagtga taataattaa tcatacacaa aataatgaaa 3121 gtcacaagta gttataaaaa aataaaaaaa aaaaaaaaaa aaaggaaaaa aaaaaaaagt 3181 aataataata ataagaataa taaaaataat aaaaataatt taaaccctat atattgacat 3241 aatggatagc tctataatgt ataatagtag tatatttatt ttaaattata tataaaaata 3301 atttacgatt tcaaagatcg aaaccaatta taacactact gttatgaaac aaatgtgtat 3361 attttacact aacctcgtcg atatctattg tgtgtaacaa ttaggggttt tattatttcc 3421 ttttttttta aaaaaatttt tttttttttt ttatttacta ctatttttaa aaaaaaaaaa 3481 aaaaaaaaaa aaaaaaaaaa gataatttat tttgatcttt tatgtgttga tacacctttt 3541 catgtttaca cacaaacaaa aaacactcga taaattttat tattattttt tatttgtttt 3601 taatttaatt gatattaaaa aaaaaaaaat taaaaaaaaa aaaaaaaaaa aattttaagt 3661 gatatataat ttttaaaatt ttttatttat ttattttttt aaaaaattaa tttttcaaac 3721 aataatacat tcaaaattaa aacaataata acagattcaa aatatcattt ggttttgttt 3781 tttttttttt ttttatttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaattata 3841 catataaata tttataacaa taacccacaa acgccacaca ctcacttatt ttttttaatt 3901 ttaaatttta tttattttat ttatttattt atttatttaa tcaaaaaaaa aaaaaaaaaa 3961 aaaactaaaa ttaaaattac taacttattt ttttttttat aaatatatat aaaaaaaaaa 4021 aaaaaataaa attttttttt tttccaatat ctttatcttt tttttttatt ataaaagaag 4081 aaagaattaa tatagttcga ataactacaa aaaatataca aaaaaaatgg cattaaataa 4141 aaaattgtaa gtggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaattta aaaatagaaa 4201 ttgatttgtt gtttatacta attttttttt tttttttttg gaattttttt tttggaaatt 4261 tttatttttt attttttttt tttttcccaa aaatagaatt agtttattac ttttaatttt 4321 tataatttta aatattgtaa attcccatca acaagaagat tgtgatgatg acgatgaaga 4381 tataggaatt tcagcagaga gatcagaaag aagatcagtt aaaaacagta acgatggtag 4441 taatttttat aatttaaatg attattatac accagagaat tggaatcatt atagtggatc 4501 atttgcaact aaagattgta gggatgcatc atacataacc ataccattgg gtacaactgg 4561 tggattggat gaaggtaatc tctcatcatt cctcttgaca aagaaaggta gtaatctttt 4621 cattgcattg gatgcaggta cagtatggca aggtgttaga agattaacca ccttcaaata 4681 tttcaataca ctcttcaata ttacctatcc atcatgggca gttttaccag agcaaagaac 4741 ctcatggttc ctcaagaaac acgttatgag ttatttcatt ggtcacagtc atttagatca 4801 tgtaggtggt ttaattttag tttcaccaga ggattattta gcaaagaatt ggatcgatgt 4861 tcaaccacca atcaacaatg gtatcatggg tctcattcgt aaattagggt tcaaaccaac 4921 tgatttcacc tcttcaagta tcctccaaaa gaaaaccatc atgggtttac catcaaccat 4981 caattcaatc tctactaatc ttttcaataa tcaagtttgg ccaaatcttc catcatttgg 5041 tcgttatcaa tactttagtt tagcaagtgg cattgaatat ccattcacag agttggtccc 5101 atacaatgcc accactatgt ctttggtcgc caatgaattc cctttctctg tcaaagttaa 5161 accatttgaa ttgtgtcacg ataatttaat atcaacttct ttcctcttca ctgatagtat 5221 ctctggtgaa caaatcgctt tcttctctga tactggtgtt ccatcctctg tcgcttgtga 5281 ttgggaaggt aaaatctatg ccgtctggaa acaaattaaa atcgataaat taaaagctat 5341 ctacatcgaa acttctttcc caaataatac tccagactct gctatgtttg gtcatctccg 5401 tccaagagat gtcatgaaat taatggatca actcttggtt caatccattc aaacctctcc 5461 accaatgact aatctcaaac atgttaaatt aatcatcgaa cacattaaac cacaagttgc 5521 tgaagatcca aatggttgga caactcaaag agtaatctat caacaattaa aagaagcaaa 5581 taataatggt gttagaatta ttattccaaa tcaaggtgat ccaatttgta tttaaataat 5641 taattaatat aatatataat aattaaacat atttatttat ttattccacc aaaaaaaaaa 5701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa attaaaaaaa aaaaaaaaaa aaaaaaatta 5761 aataataata ataactaaaa tagaaaacaa taccaaaaaa aaaaacaaaa aaaaaaaaaa 5821 aaaccaatcg acaatttttt tttttttttt tctctaaaac aaatacataa aaaaaatcat 5881 cttataaata actcctcctc ctttcccctc cttctataat tattcagttt cattaattgt 5941 atagattttt tttttttaag tcaaccaacc tccttttttt ttaaaccaca aaaaaaaaaa 6001 aaaaaaaaaa aaaaaataaa ccattttatt tttttttttc caaaaaaatt ttccagtttt 6061 tttttttttt tattttagaa acattttatt tttactttta aaaaatatgg gtgtttattt 6121 tgagaaatat attttttatt taataacaac accctttaag gaaagataga taataattaa 6181 ttatattttt attttacata atgcattttt tttttttttt ttgggtttta ttttattatt 6241 ttttttttta ttgttttaat attattattt ttttttcata aaactttttg agaatcacta 6301 tggaagtcac ttaaattatt attttctctt tcgttatcat cttcttggaa atcaacttgt 6361 tcatcttgat ca // LOCUS HUMCPGISL 2520 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human nested gene protein gene, complete cds. ACCESSION M34677 KEYWORDS coagulation factor VIII; nested gene protein. SOURCE Human 4X B cell line (ACTT GM1202) DNA, clone p542. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2520) AUTHORS Levinson,B., Kenwrick,S., Lakich,D., Hammonds,G.Jr. and Gitschier,J. TITLE A transcribed gene in an intron of the human factor VIII gene JOURNAL Genomics 7, 1-11 (1990) STANDARD simple staff_review FEATURES from to/span description pept 585 1268 CpG island protein, exon 1 1271 1681 CpG island protein, exon 2 IVS < 1 > 2520 coagulation factor VIII, intron 22 mRNA 528 2265 CpG island protein mRNA (5' end = end of cDNA) binding 523 532 Sp1 binding site binding 543 552 AP4 binding site BASE COUNT 418 a 843 c 761 g 498 t ORIGIN Chromosome X. 1 cccgggaaag tcctcagcat gatggtgttc gttggggcgg gtgggcctca agaccatgac 61 tgcagtgttg cgacaggacg gacttctaag gcaaccgggg tggggacgga gaggtggacc 121 cggtgcttgg caaggggccc tggaccggaa gacggaggac tgagtagtcg gggcgggggg 181 ggggggggac aacagtctag aaatgcgcag acatggaatt agggtcacga gtttcgtttg 241 ggtcatgttc agatggagag gctgtgggat gcctttagca acctagacca cttgcattgc 301 agccgagcta gcagcagctg gtgccccgcc aggccccgcg taccttccca gggtcttgga 361 tccgcatccc cgagaccctc gatttgcata cgccgctcac agcccggcca ggccccgcct 421 ccgatcccgc tctttgcacc tcatcagggc aaggggcagg ctcgggctgg ggagccggcg 481 cgggcacggc ggcagcccgc cgagggggac gcagggcacg tcgccccgcc ccggcccgcc 541 cgccagctgc ggcggcgcgt cgggaagtgc ttggggcggc gagcatggcg gcagcggctg 601 caggcctggg cggcggcggc gccggcccgg gacccgaggc cggggacttc ctggcccgct 661 accggctggt atcgaacaag ctgaagaagc ggttcctgcg gaagccgaac gtggcggagg 721 ccggcgagca gttcggacag ctgggccggg agctgcgcgc ccaggagtgt ctgccctacg 781 cggcctggtg ccagctggcg gtggcgcgct gccagcaggc gctcttccac gggcccgggg 841 aggcgctggc cctcaccgag gccgcccgcc tcttcctgcg gcaggagcgc gacgcgccag 901 cgctggtctg ccccgccgcc tacggggagc cgctgcaggc cgccgccagc gccctgggcg 961 ccgcggtgcg tctgcacctc gagctgggcc agccggccgc cgccgccgcc ctctgcctcg 1021 agctggccgc cgccctgcgc gacctgggcc agccggccgc cgccgccggt cacttccagc 1081 gcgccgccca gctccagctg ccccagctgc ccctggccgc gctgcaggcg cttggcgagg 1141 ccgcctcctg ccagctgctg gcgcgcgact acaccggcgc cctggcggtc ttcacgcgca 1201 tgcagcgcct ggcgcgggag cacggcagcc acccggtgca gtcactgccg ccgcccccgc 1261 cgccggcacc ccagcccggg cccggggcga cgcccgccct accggccgcg ctgcttcctc 1321 cgaactccgg ctcggcggcg ccctctcccg cgccctgggc gccttctcgg acgtgctggt 1381 ccgctgcgag gtgtcccgcg tgctgctgct gctcctcctg caaccaccgc ccgccaagct 1441 gctgccggag cacgcccaga ccctggagaa gtactcctgg gaggcttttg acagccacgg 1501 gcaggagagc agcggccagc ttcccgagga gctctttctg ctgctccagt ctttggtcat 1561 ggctacccac gaaaaggaca cggaagccat caagtcgctg caggtggaga tgtggccact 1621 gttgactgct gagcagaacc acctccttca cctcgttctg caagaaacca tctcccccta 1681 ggacagggag tctgatccat cccattcacc cagtgacttc tttttgccca ggcctggact 1741 ttttgcatca gtcacgttaa ccagatgact ttgcctgtta ccaaacctca tgcatccacg 1801 tttgcgtctg gggaggaata aaaagacatc gttcccgctt ctgcgttttg ttattcctac 1861 tgccgccata ggaattattt cgttggctga acgttaccag caccccgaga acacattttg 1921 atagaatcag agtagaggac atggctgtct tctaaaaagc cacgacatga aaatgacaat 1981 ccctttcgtc tccttcctcc gctgcttcca cctaacgcag cctcctgcct ccgcctttgt 2041 ttcatagtga ggattttatt ttgcacggcg ccctccctct aaatacctac cctagatgat 2101 ttcatcctgc ccctcacttc ttaacatatt cctgtgtctt tgtaatggcc aaatttctcc 2161 ttccactcgt ccgcacggta cgtcttcatg ggagtcattt tattccttac agcttccttg 2221 taaccacagc cctaaatcca tgataaagct actcttgtac tcttcactca tgtgtggttt 2281 tggtgtccgt cgtctataaa atgtgactct cttacctact tgtcaggagg ccagcagcag 2341 aaccagatag ctgtttgggc tccaggatgt attcgttcct ttgaaatttg cgctatgcct 2401 cctatgtggg caacgaacct gcagagatca tctatgatgt cctccgtact cgggtgactg 2461 caatgcagat acatccagtg cagcgggtct ggtgccgatg catcgatttg cattgaattc // LOCUS HUMIGHCY 372 bp ss-mRNA PRI 13-JUL-1990 DEFINITION Human Ig active mu-chain mRNA Vv-D-J6-region, partial cds. ACCESSION M34678 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; variable region. SOURCE Human CD5+ EBV transformed B-cell, cDNA to mRNA, clone EBV 383. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 372) AUTHORS van der Heijden,R.W.J., Bunschoten,H., Pascual,V., Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D. TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody specific for a rabies virus-neutralizing monoclonal idiotypic antibody reveals extensive somatic variability suggestive of an antigen-driven immune response JOURNAL J. Immunol. 144, 2835-2839 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 372 Ig mu-cahin Vv-D-J6-region (AA at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" recomb 294 295 Vv-region end/D-region start recomb 316 317 D-region end/J6-region start BASE COUNT 84 a 107 c 108 g 73 t ORIGIN 1 gaggtgcagc tggtgcagtc tggagcagaa gtgaaaaagc cgggggagtc tctgaagatc 61 tcctgtaagg gttctggata cagctttagc acctcctgga tcggctgggt gcgccagatg 121 cccgggaaag gcctcgagtg gctggggacc atccatcctg gtgactctga cacaagaaat 181 agcccctcct tccaaggcca ggtcaccatc tcagtcgaca agtccatcag cacggcctcc 241 ctgcagtggg gcagcctgaa ggcctcggac accggtatat attactgtgc gagactgata 301 acaggtcctc gacactacta ctactacgct atggacgtct ggggccaagg gaccacggtc 361 accgtctcct ca // LOCUS HUMIGHCZ 306 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human Ig expressed mu-chain gene Vv-region. ACCESSION M34680 KEYWORDS immunoglobulin heavy chain; mu-immunoglobulin; variable region. SOURCE Human CD5+ EBV transformed B-cell DNA, clone EBV 383ex. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 306) AUTHORS van der Heijden,R.W.J., Bunschoten,H., Pascual,V., Uytdehaag,F.G.C.M., Osterhaus,A.D.M.E. and Capra,J.D. TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody specific for a rabies virus-neutralizing monoclonal idiotypic antibody reveals extensive somatic variability suggestive of an antigen-driven immune response JOURNAL J. Immunol. 144, 2835-2839 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 305 Ig mu-chain Vv-region precursor (AA at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" sigp < 1 12 Ig mu-chain Vv-region signal peptide matp 13 > 305 Ig mu-chain Vv-region BASE COUNT 66 a 81 c 97 g 62 t ORIGIN 1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag 61 tctctgaaga tctcctgtaa gggttctgga tacagcttta gcaccttctg gatcggctgg 121 gtgcgccaga tgcccgggaa aggcctggag tggctgggga ccatccatcc tggtgactct 181 gacacaagaa atagcccgtc cttccaaggc caggtcacca tctcagtcga caagtccatc 241 agcacggcct tcctgcagtg gggcagcctg aaggcctcgg acaccggtat atattactgt 301 gcgaga // LOCUS HUMIGHVV 306 bp ds-DNA PRI 13-JUL-1990 DEFINITION Human Ig germline mu-chain gene Vv-region. ACCESSION M34679 KEYWORDS immunoglobulin heavy chain; mu-immunoglobulin; variable region. SOURCE Human CD5+ EBV transformed B-cell DNA, clone 251. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 306) AUTHORS van der Heijden,R.W.J., Bunschoten H Pascual,V., Uytdehaag,F.G.C.M., Osterhaus,J.D.M.E. and Capra,J.D. TITLE Nucleotide sequence of a human monoclonal anti-idiotypic antibody specific for a rabies virus-neutralizing monoclonal idiotypic anti- body reveals extensive somatic variability suggestive of an antigen-driven immune response JOURNAL J. Immunol. 144, 2835-2839 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 305 Ig mu-chain Vv-region precursor (AA at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" sigp < 1 12 Ig mu-chain Vv-region signal peptide matp 13 > 305 Ig mu-chain Vv-region BASE COUNT 67 a 83 c 95 g 61 t ORIGIN 1 ggagtctgtg ccgaggtgca gctggtgcag tctggagcag aggtgaaaaa gcccggggag 61 tctctgaaga tctcctgtaa gggttctgga tacagcttta ccagctactg gatcggctgg 121 gtgcgccaga tgcccgggaa aggcctggag tggatgggga tcatctatcc tggtgactct 181 gataccagat acagcccgtc cttccaaggc caggtcacca tctcagccga caagtccatc 241 agcaccgcct acctgcagtg gagcagcctg aaggcctcgg acaccgccat gtattactgt 301 gcgaga // LOCUS MUSPLII 874 bp ss-mRNA ROD 13-JUL-1990 DEFINITION Mouse placental lactogen II (PL-II) mRNA, complete cds. ACCESSION M14647 KEYWORDS placental lactogen II. SOURCE Mouse (strain Balb/c) placenta, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 874) AUTHORS Jackson,L.L., Colosi,P., Talamantes,F. and Linzer,D.I.H. TITLE Molecular cloning of mouse placental lactogen cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 8496-8500 (1986) STANDARD simple staff_review FEATURES from to/span description pept 60 728 placental lactogen II (mPL-II) precursor sigp 60 152 placental lactogen II (mPL-II) signal peptide matp 153 725 placental lactogen II (mPL-II) mRNA < 1 874 mPL-II mRNA BASE COUNT 234 a 212 c 190 g 238 t ORIGIN 1 agcactcggg gaacagcagc cttctggtgt tgtcagcact tcaagagaac tcctcagaga 61 tgaagctgtc tttgagtcaa ccatgctcct tctctggggc actcctgttg ctggcagtgt 121 caaacctact tgtctgggag aaggtgactt ccttaccaaa ttatcgatta cccactgaaa 181 gcctatacca acgtgtgatt gtggtgtcac acaacgccca tgatcttgct tcaaaagcct 241 tcatggaatt tgaaatgaag tttggtagga cagcttggac ctatggcctg atgttaagcc 301 cctgccacac tgctgcaatc cttactccag aaaacagcga gcaagtccac cagacaacat 361 cggaagacct tctgaaagtg tccatcacta ttttacaagc ctgggaagag cctctgaaac 421 acatggtggc agcagtggct gctcttccac atgtacctga tactctgctg tcaagaacaa 481 aggagttgga ggaaagaatt caagggcttc tggaaggact gaagatcata ttcaataggg 541 tttacccagg agctgttgca agtgactata ctttctggtc tgcatggtca gatttgcagt 601 catctgatga atccactaag aacagtgctc ttagaacctt atggcggtgc gtgcgcaggg 661 atacacataa agttgacaat tacctcaagg tcctgaagtg tcgtgatgtt cataacaaca 721 actgctgaac atctatccct cttctctgtc tcttagaagg tccctcgtga tctagacctt 781 caaagcacca atgaatttcc ccttctttgg tgcctttcta gattgagttg tcatcttacc 841 cccaaataaa ctgattcttt gcaaatgcta aatt // LOCUS MUSRARGA 2715 bp ss-mRNA ROD 13-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma (mRAR-gamma-A) mRNA, complete cds. ACCESSION M34476 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse embryo, cDNA to mRNA, clone lambda-mE13.20. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2715) AUTHORS Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and Varmuza,S. TITLE Identification of a novel isoform of the retinoic acid receptor gamma expressed in the mouse embryo JOURNAL Mol. Cell. Biol. 10, 2335-2340 (1990) STANDARD simple staff_review FEATURES from to/span description pept 253 1629 retinoic acid receptor gamma mRNA 1 2715 RAR-gamma-A mRNA BASE COUNT 578 a 839 c 741 g 557 t ORIGIN 1 cggggctctg cggccttcct tcccccgccc tcccctccag cagtttccac caggtccctc 61 acctcagcct ggcccagtat gtaggaggga atctctgcag aggccagagg gatccttgga 121 aacccactgg acagaccagg cagggtgggc acggagcctc ccaggccagg gcagtgggca 181 tgggcggggg ctgtagctga agaccacccc cgcctgctgc agagtccaag ggattcccac 241 gccgcagcta ccatggccac caataaggag agactctttg cgcccggtgc cctggggcct 301 ggatctggtt acccaggagc aggcttccca ttcgccttcc caggtgcact cagagggtcg 361 ccaccatttg agatgctgag ccctagcttc cggggcctgg gccagcctga cctccccaag 421 gagatggctt ctctctcggt ggagacacag agcaccagct cggaggagat ggtacccagc 481 tctccctcac ccccaccacc tcctcgggtc tataagccat gctttgtatg caatgacaag 541 tcttctggct accactatgg ggtcagctcc tgtgaaggct gcaagggctt cttcagacgc 601 agcattcaga aaaacatggt gtatacatgt caccgtgaca aaaactgtat catcaacaag 661 gtcaccagaa atcgatgcca gtactgcagg ctacaaaagt gtttcgaagt gggcatgtcc 721 aaggaagctg taaggaacga tcgaaacaag aagaaaaagg aggtaaaaga ggagggctcg 781 cccgacagct atgaactgag tccacagtta gaggaactca tcaccaaggt cagcaaagcc 841 caccaggaga cttttccctc actctgccag ctgggcaagt acaccacgaa ctccagtgca 901 gatcaccggg tgcagctgga cctggggctg tgggacaagt tcagcgagct ggccaccaaa 961 tgcatcatca agattgtgga gtttgcgaag cggctgcctg gttttacagg gctcagcatt 1021 gccgaccaga tcacgctgct caaggctgct tgtctggaca tcctaatgct gcggatctgt 1081 acaaggtata ccccagagca ggacactatg acattctcgg atgggctgac cctgaaccga 1141 acccagatgc acaatgctgg ctttgggccc cttacagacc tcgtctttgc ctttgccggg 1201 cagctgctgc ccctggagat ggatgacacc gagactgggc tacttagtgc tatctgcctc 1261 atctgtggag accgaatgga cctggaagag cccgagaagg tggacaagct gcaggagccc 1321 ctgctggaag ccctgaggct ctatgcccgg cgacgagacc cagccaaacc ctacatgttc 1381 ccaaggatgc tgatgaaaat caccgacctc cggggcatca gcactaaggg agcagaaagg 1441 gctataaccc tgaagatgga gattccaggc ccgatgccac ccctgatccg agagatgctg 1501 gagaacccgg agatgtttga ggacgactcc tcgaagcctg gcccccaccc caaggcttcc 1561 agtgaggacg aagctccagg gggccagggc aaaaggggcc aaagtcccca acctgaccag 1621 gggccctgac ctaccccgtt gtggggttgg gccccaggca gcagactgac catttcccag 1681 ataccgccag tgactggggg aggacctgcc ccgcccactc tccacccttt taatgagctc 1741 gttattttgc caaagtttct aggggtgcct gtgttcatcc ctgtcctgtt ctaactggct 1801 ccctctacag tcccgcgggg atctgctgca cacctaccag aagagctgga gaaggggcga 1861 gcctgggtct agactctaaa atctcagcac tgcctttcgg attcaggcca caggctcccg 1921 acgcaagagg aagccccgcc ttcccatagc cttttcctct gccaggtgct tgggcctctg 1981 ggagcaaaca ggaacactag agaccaaaag gggggccccc cccgagggga gggctgagcc 2041 acccttgtgc cccaccctga gcgcctgact ctgtgtgatg aatctgccgc gctcagtgcc 2101 cccggtgccc catcctaatc ctcctgccgg gtgtaggggc aggctggggc ctgcgttttc 2161 tagggtgggg cctgagggga agggagaatg tgagcccgag cccgtctgta cctctgctga 2221 agtacagcac tcatcctggg ccctccatta tataagcctc tcccttactt ctgtctcgtg 2281 ccttggcctc ccgccctcat ctcagccatc aggcagggcc ctcctacact acagaggggc 2341 caggggtccc tctcctagtg ccttctaccc ttgatccccc cagagcagct tggccccggg 2401 gaggggagat gctgcttagc tgatcccacc ttgacccaga ggaagcctct atttatttat 2461 tagcttttgt ttacaccctg gaaatgaccc cttcctccag gggtcttggg agggggagcc 2521 cagggcctct gtgacccctt cctttctttc cccaagcccc cagtttatat ttagctgcca 2581 aataagattc ctgcaggctc ccccatttct ctggggggtt agggtgctgt cccctcccct 2641 ctgtttatat ctcctacccc gctgttcgca tattgctgag ttttctattt ttgcaaaata 2701 aagtgatgga aactc // LOCUS MUSRARGB 2496 bp ss-mRNA ROD 13-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma (RAR-gamma-B) mRNA, complete cds. ACCESSION M34475 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse embryo, cDNA to mRNA, clone lambda-mE7.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2496) AUTHORS Giguere,V., Shago,M., Zirngibl,R., Tate,P., Rossant,J. and Varmuza,S. TITLE Identification of a novel isoform of the retinoic acid receptor gamma expressed in the mouse embryo JOURNAL Mol. Cell. Biol. 10, 2335-2340 (1990) STANDARD simple staff_review FEATURES from to/span description pept 67 1410 retinoic acid receptor gamma mRNA 1 2496 RAR-gamma-B mRNA BASE COUNT 533 a 751 c 684 g 528 t ORIGIN 1 aggaggggga atgggctgcg ggtccttggt gttctagcac ccagtttctc tccaagccgg 61 gtcgcgatgt acgactgcat ggaatcgttt gtcccgggtc cgcgacggct gtatggggcg 121 gccgggcccg gggccggctt actacgcaga gccactggca gctcctgctt cgccggactt 181 gagtcttttg cctgggcaca gcctgccagt ctacaatcgg tggagacaca gagcaccagc 241 tcggaggaga tggtacccag ctctccctca cccccaccac ctcctcgggt ctataagcca 301 tgctttgtat gcaatgacaa gtcttctggc taccactatg gggtcagctc ctgtgaaggc 361 tgcaagggct tcttcagacg cagcattcag aaaaacatgg tgtatacatg tcaccgtgac 421 aaaaactgta tcatcaacaa ggtcaccaga aatcgatgcc agtactgcag gctacaaaag 481 tgtttcgaag tgggcatgtc caaggaagct gtaaggaacg atcgaaacaa gaagaaaaag 541 gaggtaaaag aggagggctc gcccgacagc tatgaactga gtccacagtt agaggaactc 601 atcaccaagg tcagcaaagc ccaccaggag acttttccct cactctgcca gctgggcaag 661 tacaccacga actccagtgc agatcaccgg gtgcagctgg acctggggct gtgggacaag 721 ttcagcgagc tggccaccaa atgcatcatc aagattgtgg agtttgcgaa gcggctgcct 781 ggttttacag ggctcagcat tgccgaccag atcacgctgc tcaaggctgc ttgtctggac 841 atcctaatgc tgcggatctg tacaaggtat accccagagc aggacactat gacattctcg 901 gatgggctga ccctgaaccg aacccagatg cacaatgctg gctttgggcc ccttacagac 961 ctcgtctttg cctttgccgg gcagctgctg cccctggaga tggatgacac cgagactggg 1021 ctacttagtg ctatctgcct catctgtgga gaccgaatgg acctggaaga gcccgagaag 1081 gtggacaagc tgcaggagcc cctgctggaa gccctgaggc tctatgcccg gcgacgagac 1141 ccagccaaac cctacatgtt cccaaggatg ctgatgaaaa tcaccgacct ccggggcatc 1201 agcactaagg gagcagaaag ggctataacc ctgaagatgg agattccagg cccgatgcca 1261 cccctgatcc gagagatgct ggagaacccg gagatgtttg aggacgactc ctcgaagcct 1321 ggcccccacc ccaaggcttc cagtgaggac gaagctccag ggggccaggg caaaaggggc 1381 caaagtcccc aacctgacca ggggccctga cctaccccgt tgtggggttg ggccccaggc 1441 agcagactga ccatttccca gataccgcca gtgactgggg gaggacctgc cccgcccact 1501 ctccaccctt ttaatgagct cgttattttg ccaaagtttc taggggtgcc tgtgttcatc 1561 cctgtcctgt tctaactggc tccctctaca gtcccgcggg gatctgctgc acacctacca 1621 gaagagctgg agaaggggcg agcctgggtc tagactctaa aatctcagca ctgcctttcg 1681 gattcaggcc acaggctccc gacgcaagag gaagccccgc cttcccatag ccttttcctc 1741 tgccaggtgc ttgggcctct gggagcaaac aggaacacta gagaccaaaa ggggggcccc 1801 ccccgagggg agggctgagc cacccttgtg ccccaccctg agcgcctgac tctgtgtgat 1861 gaatctgccg cgctcagtgc ccccggtgcc ccatcctaat cctcctgccg ggtgtagggg 1921 caggctgggg cctgcgtttt ctagggtggg gcctgagggg aagggagaat gtgagcccga 1981 gcccgtctgt acctctgctg aagtacagca ctcatcctgg gccctccatt atataagcct 2041 ctcccttact tctgtctcgt gccttggcct cccgccctca tctcagccat caggcagggc 2101 cctcctacac tacagagggg ccaggggtcc ctctcctagt gccttctacc cttgatcccc 2161 ccagagcagc ttggccccgg ggaggggaga tgctgcttag ctgatcccac cttgacccag 2221 aggaagcctc tatttattta ttagcttttg tttacaccct ggaaatgacc ccttcctcca 2281 ggggtcttgg gagggggagc ccagggcctc tgtgacccct tcctttcttt ccccaagccc 2341 ccagtttata tttagctgcc aaataagatt cctgcaggct cccccatttc tctggggggt 2401 tagggtgctg tcccctcccc tctgtttata tctcctaccc cgctgttcgc atattgctga 2461 gttttctatt tttgcaaaat aaagtgatgg aaactc //
GenBank-Updates@genbank.bio.net (07/14/90)
LOCUS RATCYP7 3535 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat cholesterol 7-alpha-hydroxylase (CYP7) mRNA, complete cds. ACCESSION J05430 KEYWORDS cholesterol 7-alpha-hydroxylase. SOURCE Rat (Sprague Dawley) adult male liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3535) AUTHORS Jelinek,D.F., Andersson,S., Slaughter,C.A. and Russell,D.W. TITLE Cloning and regulation of cholesterol 7-alpha-hydroxylase, the rate -limiting enzyme in bile acid biosynthesis JOURNAL J. Biol. Chem. 265, 8190-8197 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W.Russell, 07-MAR-1990, for release after publication. FEATURES from to/span description pept 30 1541 cholesterol 7-alpha hydroxylase (CYP7) (EC 1.14.13.17) rpt 2195 2313 Alu repeat BASE COUNT 1035 a 705 c 696 g 1099 t ORIGIN 1 cgctttggaa attttcctgc ttttgcaaaa tgatgactat ttctttgatt tggggaattg 61 ccgtgttggt gagctgttgc atatggttta ttgttggaat aaggagaagg aaagctggtg 121 aacctccttt ggagaacggg ttgattccgt acctgggctg tgctctgaaa tttggatcta 181 atcctcttga gttcctaaga gctaatcaaa ggaagcatgg tcacgttttt acctgcaaac 241 tgatggggaa atatgtccat ttcatcacaa actccctgtc ataccacaaa gtcttatgtc 301 atggaaaata ttttgactgg aaaaaatttc attacactac ttctgcgaag gcatttggac 361 acagaagcat tgacccaaat gatggaaata ccacggaaaa tataaacaac acttttacca 421 aaaccctcca gggagatgct ctgtgttcac tttctgaagc catgatgcaa aacctccaat 481 ctgtcatgag acctcctggc cttcctaaat caaagagcaa tgcctgggtc acggaaggga 541 tgtatgcctt ctgttaccga gtgatgtttg aagctggcta tctaacactg tttggcagag 601 atatttcaaa gacagacaca caaaaagcac ttattctaaa caaccttgac aacttcaaac 661 aatttgacca agtctttccg gcactggtgg caggccttcc tattcacttg ttcaagaccg 721 cacataaagc tcgggaaaag ctggctgagg gattgaagca caagaacctg tgtgtgaggg 781 accaggtctc tgaactgatc cgtctacgta tgtttctcaa tgacacgctc tccacctttg 841 acgacatgga gaaggccaag acgcacctcg ctattctctg ggcatctcaa gcaaacacca 901 ttcctgcaac cttttggagc ttatttcaaa tgatcaggag tcctgaagca atgaaagcag 961 cctctgaaga agtgagtgga gctttacaga gtgctggcca agagctcagc tctggaggga 1021 gtgccattta cttggatcaa gtgcaactga atgacctgcc ggtactagac agcatcatca 1081 aggaggctct gaggctttcc agtgcatcct tgaatatccg cacagctaag gaggacttca 1141 ctctccatct tgaggacggt tcctataaca tccgaaaaga tgacatgata gctctttatc 1201 cacagttaat gcacttggat cctgaaatct acccagaccc tttgactttc aaatatgacc 1261 ggtaccttga tgaaagcggg aaagcaaaga ccaccttcta cagtaatgga aacaagctga 1321 agtgtttcta catgcccttc ggatcaggcg cgacaatatg tcctggaaga ctctttgccg 1381 tccaagaaat caagcagttt ttgatcctga tgctctcctg ctttgaactg gagtttgtgg 1441 agagccaagt caagtgtccc cctctagacc agtcccgggc aggcttggga attttgccac 1501 cactacatga tattgagttt aaatataaac tgaaacactg atacgtggtt ggaagaagcg 1561 aacactggat gatgtcactt ggcggctgag agtcatcact aaacaggcct tcgggaccaa 1621 tgctcactga tgcgccctag cgactggatt agtgggaaga actttgttct cgctgcccac 1681 attcctgggt gttcacatag ctggggccag agcttcatca ctttcagaaa gcaatgtctt 1741 ttgtatttat tttcaaaatg aagatattcc aattggcagg atatttttcc taaggaaatt 1801 gctttatatt tttatgaaaa ctaccaatta attatgaaag ggcttgaaat tcacgtttta 1861 gtgaaattac tgatttttca ctagtaaggt tcttcaggtg tgaaactgta ttataaaaat 1921 gttgtaatgg gtcacactgt gctttgcata aaggtaaagg aaactatgtt tcagcctttt 1981 ctgtgtctat gagcttcgaa aataatctta ctgttctaga aacactgggg aggtttcgac 2041 atgctctcgc tatattttat tttactgttg ctagaaattt tcattccagt tttcaactac 2101 cttatctttc ccccattttg acatgcatgc caatgagaag agtatttttt aggaattaac 2161 aaggcacctc ccagaaccct accctgagac ttttaagcct ttaatcccag cactcgagaa 2221 gtagagccag gcagatctct gagtctgagg ttattctggt ctacatcagc tccagacaag 2281 ccaggactac agaatgggat cttgtctaaa aaatacagct aatctttatg tcataactga 2341 ttatgaatca acctaaaaga taaattttca atcaggactc agagaaaatg agcaattaaa 2401 aaacttagct ctgaggtatg tggaattcat taagtacaag ttgacattac atgttcttta 2461 aaaatagttt atgttttatc tctaaatgcc ctgcagatga agaataataa tgaaaagttg 2521 aataatactg tttaaacact aagtgcaata atgctttggt aatgtacttt aagagaatca 2581 ttagccgtgc cagttttact aaaatatatt tatatgtaaa ttatatttat ctttttctta 2641 taccataaat ataaaaatat tgcaacattt agtaatttta aaattatata cctttcagaa 2701 aatgatgtat gatgtttgta tgttttttaa ctttgaacag aacatttaaa ttattcatct 2761 acggtgattt ttatcttatt tatttctttt tgtctcattc atatcttgaa gaaatccaaa 2821 aatatctgaa ggaatcgctc actcaaatgt ctccctatgg ttacagaaaa attcaatacc 2881 atgtttttgt cctcggggac tgaagcaggg tgtcgtgggt gcgagcagag gctcctgctg 2941 cagcgagctt tatccacggg actccttaaa cttttaaaat cttatcacta ttatcatgca 3001 tttattacct aagtaggata tttccctttc ctttttcatt tcagcagagt cccttagcaa 3061 cccaggctga ctgggaccct ccatgtagct taagctgtga actcactgta cttcctgttt 3121 tcacttattt taggaagtaa ttttccctat cagaaatttt aattgtttag atgatgtata 3181 agagtaacac aattctgtta tatactaatc tgtagtaaac taaatttgtt cttagaacaa 3241 gtttgatgac tctcaaattg aatgtatcca tacatctttc catggcttct tgaatgccca 3301 tttctcatac acagaatgat gggtttcacg gtgatgtctt cctttcatgt ctttattctt 3361 gtgcggtgat ggttggcaaa tgatacccat ggagcaaggt tactcttcct atttctgtgc 3421 agcctaagtg ttaagaataa tttttaaata cttggaggga aggcacattt tgtgtcatat 3481 gtgaagtgac atgtgacaca cagactagca aatccatgag taaaatttta ttggg // LOCUS PDECYT550 1047 bp ds-DNA BCT 14-JUL-1990 DEFINITION P.denitrificans cytochrome c550 gene, complete cds, and iso-cytochrome oxidase subunit I (iso-COI) gene, 5' end. ACCESSION M27304 KEYWORDS cytochrome c550; cytochrome oxidase. SOURCE P.denitrificans (strain NCIB 8944) DNA, clone pRS59. ORGANISM Paracoccus denitrificans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1047) AUTHORS Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and Stouthamer,A.H. TITLE Mutagenesis of the gene encoding cytochrome c-550 of Paracoccus denitrificans and analysis of the resultant physiological effects JOURNAL J. Bacteriol. 172, 986-996 (1990) STANDARD full staff_review REFERENCE 2 (sites; amino acid correction for [1]) AUTHORS Van Spanning,R.J.M., Wansell,C., Harms,N., Oltmann,L.F. and Stouthamer,A.H. TITLE Errata: Mutagenesis of the gene encoding cytochrome c-550 of Paracoccus denitrificans and analysis of the resultant physiological effects JOURNAL J. Bacteriol. 172, 3534-3534 (1990) STANDARD full staff_review COMMENT Printed sequence for [1] kindly submitted by R.J.M.Van Spanning, 25-AUG-1989. FEATURES from to/span description pept 235 702 cytochrome c550 precursor pept 874 > 1047 iso-cytochrome oxidase subunit I sigp 235 294 signal peptide cytochrome c550 matp 295 699 cytochrome c550 ORF < 1 90 open reading frame (AA at 1) rpt 734 743 inverted repeat copy A rpt 757 766 inverted repeat copy B BASE COUNT 215 a 340 c 309 g 183 t ORIGIN 1 bp upstream of SalI site. 1 gtcgacatgg gcctgccggt cggcatggat gacctggacc tggccctgcg ccgcagcttc 61 gcacggaact ttccgccgct cgccggctga ttttcggccc tgcgacatat tctccctcag 121 ccgttctacg acgtttttcc tgtattctgc ccgcttggca catgatagcc ctgtcaatcg 181 gggaagcgga gaggtccgtt gcccggacat gagcgatacc caagaggaaa cgcgatgaag 241 atcagcatct atgccactct cgccgccatc accctcgccc tgcccgctgc ggcccaggat 301 ggcgacgccg ccaaaggcga gaaagaattc aacaagtgca aggcttgcca catgatccag 361 gcgccggacg gcaccgacat catcaagggc ggcaagaccg ggcccaacct ttacggcgtc 421 gtcggccgca agatcgcctc ggaggagggc ttcaaatacg gcgaaggcat cctcgaggtc 481 gccgaaaaga accccgacct gacctggacc gaggccgacc tgatcgaata cgtcaccgac 541 cccaagccct ggctggtcaa gatgaccgac gacaagggcg ccaagaccaa gatgaccttc 601 aagatgggca agaaccaggc cgacgtggtg gccttcctgg cccagaactc gcccgatgcg 661 ggcggcgacg gcgaggctgc ggccgagggc gaatcgaact gatcccgacg cctccgagat 721 gtctcgacgg ccccgcccct gcgtgcgaca aaagcgcgca ggggcgcgac ttttagtcct 781 ttcttcgcat tgcgaggggt aggattgtgg tctagaaaca ggcgagtccg tcggcctttg 841 cgccggccag ccgcatgtct agggagtcca cgcatggcag acgcagccgt tcacggccac 901 ggtgaccatc atgacacccg cgggttcttc acccgctggt tcatgtcaac aaaccacaag 961 gatatcggta tcctttacct gttcacggcc ggcatcgtcg gcctgatctc ggtatgcttc 1021 accgtctata tgcggatgga actgcag // LOCUS ECOSPEC 2330 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli ornithine decarboxylase (speC) and cyclic AMP receptor protein genes, complete cds. ACCESSION M33766 KEYWORDS cyclic AMP receptor protein; ornithine decarboxylase; speC gene. SOURCE E.coli (strain K12) DNA, clone pLC20-5. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2330) AUTHORS Barroso,L., Moore,R., Wright,J., Patel,T. and Boyle,S.M. TITLE Analysis and sequence of the speC (ornithine decarboxylase) gene of Escherichia coli JOURNAL Unpublished (1990) STANDARD unannotated staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.M.Boyle, 22-MAY-1990. Author address: S.M.Boyle Virginia Polytechnic Institute and State University College of Veterinary Medicine, VPI & SU Blacksburg, VA 24061 FEATURES from to/span description pept 86 2281 ornithine decarboxylase (speC) (EC 4.1.1.17) pept 719 630 (c) cyclic AMP receptor protein (CRP) mRNA 15 2289 speC mRNA (alt., primer extension) mRNA 16 2289 speC mRNA (alt., S1 mapping) mRNA 27 2289 speC mRNA (alt., primer extension) mRNA 30 2289 speC mRNA (alt., S1 mapping) mRNA 41 2289 speC mRNA (alt., S1 mapping) mRNA 42 2289 speC mRNA (alt., primer extension) signal 14 19 -10 region signal 733 728 (c) -10 region signal 755 750 (c) -35 region signal 2289 2320 rho-independent transcription termination signal binding 364 367 pyridoxal-PO4 binding site binding 66 72 GTP phosphoryl binding site binding 263 266 GTP phosphoryl binding site binding 379 382 GTP guanine binding site BASE COUNT 508 a 546 c 664 g 612 t ORIGIN 1 ctgcaggaca acagtaaaat cagagcgttt ctgcttttac tgatgtctgg cggtcggagc 61 tggtgaccag tttgacccat atctcatggg gcagggtttt ccaccttgtc cggtattctt 121 acttccccga aacgggtttg cgcttatgaa atcaatgaat attgccgcca gtagtgaact 181 ggtatcccga ctttcttctc atcgtcgcgt ggtggcgttg ggagatactg attttacgga 241 cgtcgcggca gtcgtcatta ccgctgcgga tagtcgcagt ggcattcttg cgttgcttaa 301 gcgcaccggt tttcatctac cggtgttttt gtattccgaa catgctgttg aattacctgc 361 gggcgttacg gcggtaatca acggcaacga gcagcagtgg ctggagctgg aatccgcagc 421 ctgtcagtat gaagagaatt tgctgccacc gttttatgac acgctgacgc agtacgttga 481 gatgggcaac agcacctttg cttgccctgg acatcaacat ggtgcgtttt ttaaaaagca 541 tcctgccgga cgccattttt acgatttctt tggtgagaac gtctttcgcg ccgatatgtg 601 taacgctgac gtaaaattgg gcgatctgct tattcatgaa ggatcggcga aagatgcgca 661 gaaattcgca gccaaagtct ttcatgccga taaaacctat tttgtgctga acggcacatc 721 ggcagcgaat aaagtggtga cgaatgcgct gttaacgcgt ggcgatctgg tgctcttcga 781 ccgtaacaac cataagtcga atcatcacgg cgcgctgatt caggcggggg cgacgccggt 841 ctatctggaa gcttcacgca acccgtttgg tttcattggc ggtattgatg cgcactgttt 901 taatgaagag tatctgcgcc agcaaattcg cgacgttgcg ccagaaaaag ccgacctgcc 961 gcgcccgtat cgcctggcga ttattcagct gggaacctat gacggcactg tctataacgc 1021 ccgtcaggtg atcgataccg ttgggcatct gtgtgattac attctgtttg attccgcgtg 1081 ggtcggttat gaacaattta tcccgatgat ggcggatagc tcgccgctgc tgttagaact 1141 taacgaaaac gatccgggga tctttgtgac tcagtcggtg cacaaacagc aggcgggatt 1201 ctcacagacg tcgcagatcc ataaaaaaga taaccatatc cgcggacagg cgcgtttttg 1261 cccgcataag cggttgaata acgcctttat gctccatgct tctaccagcc ctttctatcc 1321 gctgtttgct gcactggatg ttaacgccaa aattcatgaa ggggagagtg ggcgtcggct 1381 gtgggctgag tgtgttgaga tagggattga agcgcgcaag gctattcttg cgcgctgtaa 1441 gctgttccgc ccgtttatcc cgcccgttgt tgatggcaaa ttgtggcagg attatccgac 1501 atcagtgtta gccagcgacc gccgtttttt cagttttgag ccgggggcga agtggcacgg 1561 ctttgaagga tatgccgcgg atcagtattt tgttgatccg tgcaagctgt tactcactac 1621 accaggtatc gatgccgaaa ccggcgaata tagcgacttt ggcgttccgg cgacgattct 1681 ggcgcactat ctgcgtgaga acggcattgt gccggagaag tgcgatctca actccattct 1741 gtttttatta actccggcgg aaagccacga gaagctggca caactggtgg cgatgctggc 1801 gcaatttgaa cagcatattg aggatgactc gccgctggtt gaggtgttgc cgagcgttta 1861 taacaagtat ccggtgcgct atcgcgacta caccctgcgc cagttgtgtc aggagatgca 1921 cgatctgtat gtcagtttcg acgtcaaaga cctacaaaaa gcgatgttcc gccagcagag 1981 tttcccgtca gtggtgatga acccccagga tgcgcatagc gcttatattc gcggtgacgt 2041 ggagttggtg cggattcgtg atgccgaagg gcgaattgcg gcagaagggg cgttgcctta 2101 tccacctggc gtgctttgcg tggtacccgg ggaagtctgg ggtggggcgg ttcaacgtta 2161 tttccttgca ctggaagaag gggtgaattt gttgccggga ttttcgccgg agctgcaagg 2221 tgtttatagc gaaaccgatg cggatggcgt gaaacggttg tacggttatg tgttgaagta 2281 agaataaaaa aaacgggtca ccttctggcg acccgttttt ctttgcgtaa // LOCUS HUMLPLAS 557 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human L-plastin mRNA, 5' end. ACCESSION M34426 KEYWORDS L-plastin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 557) AUTHORS Lin,C.-S., Aebersold,R.H. and Leavitt,J. TITLE Correction of the N-terminal sequences of the human plastin isoforms by using anchored polymerase chain reaction: Identification of a potential calcium-binding domain JOURNAL Mol. Cell. Biol. 10, 1818-1821 (1990) STANDARD simple staff_review FEATURES from to/span description pept 138 > 557 L-plastin BASE COUNT 174 a 111 c 133 g 139 t ORIGIN 1 acttcctgcc ttgtgaccac acacccaggc ttgacaaagc tgttctgcag atcagaaaga 61 aggggttcct ggtcatacac cagtactacc aaggacagct tttttcctgc aagatctgtt 121 acctaaagca ataaaaaatg gccagaggat cagtgtccga tgaggaaatg atggagctca 181 gagaagcttt tgccaaagtt gatactgatg gcaatggata catcagcttc aatgagttga 241 atgacttgtt caaggctgct tgcttgcctt tgcctgggta tagagtacga gaaattacag 301 aaaacctgat ggctacaggt gatctggacc aagatggaag gatcagcttt gatgagttta 361 tcaagatttt ccatggccta aaaagcacag atgttgccaa gacctttaga aaagcaatca 421 ataagaagga agggatttgt gcaatcggtg gtacttcaga gcagtctagc gttggcaccc 481 aacactccta ttcagaggaa gaaaagtatg cctttgtcaa ctggataaac aaagccctgg 541 aaaatgatcc tgattgt // LOCUS HUMTPLAS 516 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human T-plastin mRNA, 5' end. ACCESSION M34427 KEYWORDS T-plastin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 516) AUTHORS Lin,C.-S., Aebersold,R.H. and Leavitt,J. TITLE Correction of the N-terminal sequences of the human plastin isoforms by using anchored polymerase chain reaction: Identification of a potential calcium-binding domain JOURNAL Mol. Cell. Biol. 10, 1818-1821 (1990) STANDARD simple staff_review FEATURES from to/span description pept 88 > 516 T-plastin BASE COUNT 169 a 85 c 125 g 137 t ORIGIN 1 aaagattccg aggtgcagaa gttgtctgag tgggttggtc ggcggcagtc gggccagacc 61 caggactctg cgactttaca tctttaaatg gatgagatgg ctaccactca gatttccaaa 121 gatgagcttg atgaactcaa agaggccttt gcaaaagttg atctcaacag caacggattc 181 atttgtgact atgaacttca tgagctcttc aaggaagcta atatgccatt accaggatat 241 aaagtgagag aaattattca gaaactcatg ctggatggtg acaggaataa agatgggaaa 301 ataagttttg acgaatttgt ttatattttt caagaggtaa aaagtagtga tattgccaag 361 accttccgca aagcaatcaa caggaaagaa ggtatttgtg ctctgggtgg aacttcagag 421 ttgtccagcg aaggaacaca gcattcttac tcagaggaag aaaaatatgc ttttgttaac 481 tggataaaca aagctttgga aaatgatcct gattgt // LOCUS MUSICSBP 2544 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Mouse interferon consensus sequence binding protein mRNA, complete cds. ACCESSION M32489 KEYWORDS DNA binding protein; interferon consensus sequence binding protein. SOURCE Mouse (strain Balb/c; haplotype H-2d) neonatal spleen and lung, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2544) AUTHORS Driggers,P.H., Ennist,D.L., Gleason,S.L., Mak W,-H., Marks,M.S., Levi,B.-Z., Flanagan,J.R., Appella,E. and Ozato,K. TITLE An interferon gamma-regulated protein that binds the interferon-inducible enhancer element of major histocompatibility complex class I genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3743-3747 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Ozato, 23-MAR-1990, for release after publication. FEATURES from to/span description pept 21 1295 interferon consensus sequence binding protein binding 21 362 DNA binding site BASE COUNT 584 a 680 c 739 g 541 t ORIGIN 1 gcgtgggaac cggcggcagg atgtgtgacc ggaacggcgg gcggcggctg cggcagtggc 61 tgatcgaaca gatcgacagc agcatgtacc cggggctgat ctgggaaaat gatgagaaga 121 ccatgttccg tatcccctgg aagcatgccg gcaagcagga ttacaatcag gaggtggatg 181 cttccatctt caaggcctgg gcagttttta aagggaagtt taaagaggga gacaaagctg 241 aaccagccac gtggaagacg aggttacgct gtgctctgaa caagagccca gattttgaag 301 aagtgactga ccggtcccag ctggacattt ctgagccata taaagtttac cgaattgtcc 361 ccgaggaaga acaaaaatgc aagctgggcg tggcacctgc aggctgcatg agcgaagttc 421 ctgagatgga gtgtggccgc tcagagattg aggagctgat caaggaacct tctgtggatg 481 agtacatggg tatgaccaag aggagcccat ccccaccaga ggcctgcagg agccagatcc 541 tccctgactg gtgggtccag cagcccagtg caggcctgcc actggtgacc ggatatgccg 601 cctatgacac acaccattca gctttctccc agatggtcat cagcttctac tacgggggca 661 agctggtggg ccaggccacc accacctgcc ttgaaggctg ccgtctctcc ctgagccagc 721 cggggctgcc taagttgtat gggccggatg gcctggaacc cgtgtgcttt ccgacggccg 781 acaccatccc cagtgagcgg cagaggcagg tgacccggaa gctgtttggg cacctggaac 841 gtggcgtgct actgcacagc aaccgcaagg gcgtgttcgt gaagcggctg tgccagggcc 901 gcgtgttctg cagcggcaac gcggtggtgt gcaagggcag gcccaacaag ctggagcggg 961 acgaggtggt gcaggtcttt gacaccaacc agttcatccg agagctgcag caattctacg 1021 ccacccagag ccgcctacct gacagcaggg tggtcctgtg cttcggggag gagtttccgg 1081 acactgtgcc cttgcgctcc aaactcattc tggtgcaggt agagcagctg tatgccaggc 1141 agctggtgga ggaagcgggc aagagctgcg gtgctggctc cctgatgcca gccctggagg 1201 agccccagcc ggaccaggct ttccgcatgt ttccggatat ctgtacctca caccagagac 1261 ccttttttag agaaaatcaa cagatcaccg tctaagcctc agtccgggca ccccacctcg 1321 cctgagctca agcttcaaga gtctgtgact aagagaattc cgaaaggatg tggagccctc 1381 tgactggggt gggcgggtgt cctccaaggg gcctccggaa gcccacagag ggatgcgctc 1441 ctgctcaggc aggtgtcaga agcttgcagg ggctgtggcc gcaacctgtg attaaagcat 1501 tccttttcct tacactattt cacccctcac cactaatggc tggcctttct gtgtgctgag 1561 gtctttcgac agttcaaatc atctggtggc agcagactcg cctttgccct tctgcggccg 1621 agggcggaga tttatgactt tctctgcttg gttggagaag aagaatcttt actattcagc 1681 ttcttttctt ttggccagaa ctctgaaaaa aaaaaaaaac tcttttttaa gaccaatatt 1741 tgtattctca caggctcagc tgtcaatcac ttgagacctt ccctgtaaag tggggcagat 1801 tttaaatatg ggtgtagata ctcgttgcag ccttcgcagg aattttggtt gtggttcatt 1861 gattcacaca gactctgtgt cagctgacag ggctgtgtgg ggcatcaaag gaggaccagg 1921 cactgtggag aagacccatt cactggcatc tcacccttcc ttgtccagct ccatacccag 1981 tcctaagacc cagtgaaaag ccacgtccaa actgtgctct gggctcatca gtgcccaccc 2041 acgtaccagg gaaaggcaca cacccatccc agtgggcaca gagcggaatg tccccctacc 2101 gcaccatttg cgcccccaat ctggctgtcc aacctagttt gtaagtaatc taaatcagtg 2161 actatagccc cgcctaaggg acacttcccg gaggagggag ccgctgaaaa ggagttagtt 2221 tgagggtcag tacacaacag gggcagaaag ccaagcagat gtgggggcag ggaggtcatc 2281 atctgctttt gtctgagaga aggagagctt ctccgtttgt tcaactttgt aacaagctgg 2341 gttacatgct ccacgcagct agagaagcct aggtgctctg cattccctgg ggaactgcag 2401 gaaagcctta cctgctgact gttgctctgg ggaaaagcct gagggtccag agcagctaca 2461 agctacaggc cataccttac aacctgaaaa gctaaggacc acggtgacct tcccggctac 2521 tgtgtgaagg tgctgggtgg ggcc // LOCUS DDIUMPCMP 694 bp ss-mRNA INV 14-JUL-1990 DEFINITION D.discoideum UMP-CMP kinase mRNA, complete cds. ACCESSION M34568 KEYWORDS UMP-CMP kinase. SOURCE D.discoideum, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 694) AUTHORS Wiesmueller,L., Noegel,A.A., Barzu,O., Gerisch,G. and Schleicher,M. TITLE cDNA-derived sequence of UMP-CMP kinase from Dictyostelium discoideum and expression of the enzyme in Escherichia coli JOURNAL J. Biol. Chem. 265, 6339-6345 (1990) STANDARD simple staff_review FEATURES from to/span description pept 54 638 UMP-CMP kinase mRNA < 1 694 UMP-CMP kinase mRNA BASE COUNT 287 a 74 c 110 g 223 t ORIGIN 1 tttaaatata taaattttgt aaaacaactt caaatttttt aaattcaaga atgatggaaa 61 aaagtaaacc aaatgttgta tttgttttag gtggaccagg ttcaggtaaa ggtactcaat 121 gcgccaacat tgtcagagat tttggatggg tacatttatc agcaggtgat ttattacgtc 181 aagaacaaca atctggttca aaagatggtg aaatgattgc aacaatgatc aaaaatggtg 241 aaatcgttcc atcaattgta actgtaaaat tattaaagaa tgctattgat gctaatcaag 301 gtaaaaactt tttagttgat ggtttcccac gtaatgaaga aaataataat tcttgggaag 361 aaaatatgaa agattttgtt gatacaaaat ttgttttatt ctttgattgt ccagaagaag 421 ttatgactca aagattatta aagagaggtg aatcaagtgg tagatcagat gataatattg 481 aatcaattaa aaagagattc aatacattca acgttcaaac taaattagtt atcgatcatt 541 ataataaatt tgacaaagtt aaaatcattc cagcaaatag agatgttaat gaagtttata 601 atgatgttga aaaccttttt aaatcaatgg gtttttaaaa aaaaaaaaaa aaaaaccaat 661 aaattttaaa attactaaat aataaatata cccc // LOCUS HUMBGALXX 2409 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human beta-galactosidase (GLB1) mRNA, complete cds. ACCESSION M34423 KEYWORDS beta-galactosidase. SOURCE Human testis, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2409) AUTHORS Yamamoto,Y., Hake,C.A., Martin,B.M., Kretz,K.A., Ahern-Rindell,A.J., Naylor,S.L., Mudd,M. and O'Brien,J.S. TITLE Isolation, characterization, and mapping of a human acid beta- galactosidase cDNA JOURNAL DNA 9, 119-127 (1990) STANDARD simple staff_review FEATURES from to/span description pept 61 2094 beta-galactosidase precursor (EC 3.2.1.23) /nomgen="GLB1" /map="3pter-p21" /hgml_locus_uid="LA0059G " sigp 61 129 beta-galactosidase signal peptide (put.) matp 130 2091 beta-galactosidase BASE COUNT 566 a 635 c 613 g 595 t ORIGIN 1 gaattcgggc gcgaagcggc cggcctgggc gccgactgca gagccgggag gctggtggtc 61 atgccggggt tcctggttcg catcctcctt ctgctgctgg ttctgctgct tctgggccct 121 acgcgcggct tgcgcaatgc cacccagagg atgtttgaaa ttgactatag ccgggactcc 181 ttcctcaagg atggccagcc atttcgctac atctcaggaa gcattcacta ctcccgtgtg 241 ccccgcttct actggaagga ccggctgctg aagatgaaga tggctgggct gaacgccatc 301 cagacgtatg tgccctggaa ctttcatgag ccctggccag gacagtacca gttttctgag 361 gaccatgatg tggaatattt tcttcggctg gctcatgagc tgggactgct ggttatcctg 421 aggcccgggc cctacatctg tgcagagtgg gaaatgggag gattacctgc ttggctgcta 481 gagaaagagt ctattcttct ccgctcctcc gacccagatt acctggcagc tgtggacaag 541 tggttgggag tccttctgcc caagatgaag cctctcctct atcagaatgg agggccagtt 601 ataacagtgc aggttgaaaa tgaatatggc agctactttg cctgtgattt tgactacctg 661 cgcttcctgc agaagcgctt tcgccaccat ctgggggatg atgtggttct gtttaccact 721 gatggagcac ataaaacatt cctgaaatgt ggggccctgc agggcctcta caccacggtg 781 gactttggaa caggcagcaa catcacagat gctttcctaa gccagaggaa gtgtgagccc 841 aaaggaccct tgatcaattc tgaattctat actggctggc tagatcactg gggccaacct 901 cactccacaa tcaagaccga agcagtggct tcctccctct atgatatact tgcccgtggg 961 gcgagtgtga acttgtacat gtttataggt gggaccaatt ttgcctattg gaatggggcc 1021 aactcaccct atgcagcaca gcccaccagc tacgactatg atgccccact gagtgaggct 1081 ggggacctca ctgagaagta ttttgctctg cgaaacatca tccagaagtt tgaaaaagta 1141 ccagaaggtc ctatccctcc atctacacca aagtttgcat atggaaaggt cactttggaa 1201 aagttaaaga cagtgggagc agctctggac attctgtgtc cctctgggcc catcaaaagc 1261 ctttatccct tgacatttat ccaggtgaaa cagcattatg ggtttgtgct gtaccggaca 1321 acacttcctc aagattgcag caacccagca cctctctctt cacccctcaa tggagtccac 1381 gatcgagcat atgttgctgt ggatgggatc ccccagggag tccttgagcg aaacaatgtg 1441 atcactctga acataacagg gaaagctgga gccactctgg accttctggt agagaacatg 1501 ggacgtgtga actatggtgc atatatcaac gattttaagg gtttggtttc taacctgact 1561 ctcagttcca atatcctcac ggactggacg atctttccac tggacactga ggatgcagtg 1621 cgcagccacc tggggggctg gggacaccgt gacagtggcc accatgatga agcctgggcc 1681 cacaactcat ccaactacac gctcccggcc ttttatatgg ggaacttctc cattcccagt 1741 gggatcccag acttgcccca ggacaccttt atccagtttc ctggatggac caagggccag 1801 gtctggatta atggctttaa ccttggccgc tattggccag cccggggccc tcagttgacc 1861 ttgtttgtgc cccagcacat cctgatgacc tcggccccaa acaccatcac cgtgctggaa 1921 ctggagtggg caccctgcag cagtgatgat ccagaactat gtgctgtgac gttcgtggac 1981 aggccagtta ttggctcatc tgtgacctac gatcatccct ccaaacctgt tgaaaaaaga 2041 ctcatgcccc cacccccgca aaaaaacaaa gattcatggc tggaccatgt atgatgatga 2101 aagcctgtgt ctttgaggga ttctaccctg aacatacctc acagatcctc cctgtcatgc 2161 cacatttcac tgattggaat gtggaaatgg aaaaggaatt taggatgtgc attttcacct 2221 gaggtttccc tgcatccctg cagtgccaaa gccccacctt cagggaccac ctggaatgtg 2281 tgaggggctg acagcacagt aacgtgcata catatctgca gggctggaat ggaagcttta 2341 aaggtggtag tgatttttat tttggaagaa tcatgttacc tttttgttaa ataaaatttg 2401 cccgaattc // LOCUS HUMGAA 3441 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human acid alpha-glucosidase (GAA) mRNA, complete cds. ACCESSION M34424 KEYWORDS acid alpha-glucosidase; acid maltase. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3441) AUTHORS Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R. TITLE Sequence of the cDNA and 5'-flanking region for human acid alpha- glucosidase, detection of an intron in the 5' untranslated leader sequence, definition of 18-bp polymorphisms, and differences with previous cDNA and amino acid sequences JOURNAL DNA 9, 85-94 (1990) STANDARD simple staff_review FEATURES from to/span description pept 33 2891 acid alpha-glucosidase /nomgen="GAA" /map="17q23" /hgml_locus_uid="LZ0096H" BASE COUNT 612 a 1170 c 1027 g 632 t ORIGIN 1 gcctgtagga gctgtccagg ccatctccaa ccatgggagt gaggcacccg ccctgctccc 61 accggctcct ggccgtctgc gccctcgtgt ccttggcaac cgctgcactc ctggggcaca 121 tcctactcca tgatttcctg ctggttcccc gagagctgag tggctcctcc ccagtcctgg 181 aggagactca cccagctcac cagcagggag ccagcagacc agggccccgg gatgcccagg 241 cacaccccgg ccgtcccaga gcagtgccca cacagtgcga cgtccccccc aacagccgct 301 tcgattgcgc ccctgacaag gccatcaccc aggaacagtg cgaggcccgc ggctgctgct 361 acatccctgc aaagcagggg ctgcagggag cccagatggg gcagccctgg tgcttcttcc 421 cacccagcta ccccagctac aagctggaga acctgagctc ctctgaaatg ggctacacgg 481 ccaccctgac ccgtaccacc cccaccttct tccccaagga catcctgacc ctgcggctgg 541 acgtgatgat ggagactgag aaccgcctcc acttcacgat caaagatcca gctaacaggc 601 gctacgaggt gcccttggag accccgcatg tccacagccg ggcaccgtcc ccactctaca 661 gcgtggagtt ctccgaggag cccttcgggg tgatcgtgcg ccggcagctg gacggccgcg 721 tgctgctgaa cacgacggtg gcgcccctgt tctttgcgga ccagttcctt cagctgtcca 781 cctcgctgcc ctcgcagtat atcacaggcc tcgccgagca cctcagtccc ctgatgctca 841 gcaccagctg gaccaggatc accctgtgga accgggacct tgcgcccacg cccggtgcga 901 acctctacgg gtctcaccct ttctacctgg cgctggagga cggcgggtcg gcacacgggg 961 tgttcctgct aaacagcaat gccatggatg tggtcctgca gccgagccct gcccttagct 1021 ggaggtcgac aggtgggatc ctggatgtct acatcttcct gggcccagag cccaagagcg 1081 tggtgcagca gtacctggac gttgtgggat acccgttcat gccgccatac tggggcctgg 1141 gcttccacct gtgccgctgg ggctactcct ccaccgctat cacccgccag gtggtggaga 1201 acatgaccag ggcccacttc cccctggacg tccagtggaa cgacctggac tacatggact 1261 cccggaggga cttcacgttc aacaaggatg gcttccggga cttcccggcc atggtgcagg 1321 agctgcacca gggcggccgg cgctacatga tgatcgtgga tcctgccatc agcagctcgg 1381 gccctgccgg gagctacagg ccctacgacg agggtctgcg gaggggggtt ttcatcacca 1441 acgagaccgg ccagccgctg attgggaagg tatggcccgg gtccactgcc ttccccgact 1501 tcaccaaccc cacagccctg gcctggtggg aggacatggt ggctgagttc catgaccagg 1561 tgcccttcga cggcatgtgg attgacatga acgagccttc caacttcatc aggggctctg 1621 aggacggctg ccccaacaat gagctggaga acccacccta cgtgcctggg gtggttgggg 1681 ggaccctcca ggcggccacc atctgtgcct ccagccacca gtttctctcc acacactaca 1741 acctgcacaa cctctacggc ctgaccgaag ccatcgcctc ccacagggcg ctggtgaagg 1801 ctcgggggac acgcccattt gtgatctccc gctcgacctt tgctggccac ggccgatacg 1861 ccggccactg gacgggggac gtgtggagct cctgggagca gctcgcctcc tccgtgccag 1921 aaatcctgca gtttaacctg ctgggggtgc ctctggtcgg ggccgacgtc tgcggcttcc 1981 tgggcaacac ctcagaggag ctgtgtgtgc gctggaccca gctgggggcc ttctacccct 2041 tcatgcggaa ccacaacagc ctgctcagtc tgccccagga gccgtacagc ttcagcgagc 2101 cggcccagca ggccatgagg aaggccctca ccctgcgcta cgcactcctc ccccacctct 2161 acacactgtt ccaccaggcc cacgtcgcgg gggagaccgt ggcccggccc ctcttcctgg 2221 agttccccaa ggactctagc acctggactg tggaccacca gctcctgtgg ggggaggccc 2281 tgctcatcac cccagtgctc caggccggga aggccgaagt gactggctac ttccccttgg 2341 gcacatggta cgacctgcag acggtgccaa tagaggccct tggcagcctc ccacccccac 2401 ctgcagctcc ccgtgagcca gccatccaca gcgaggggca gtgggtgacg ctgccggccc 2461 ccctggacac catcaacgtc cacctccggg ctgggtacat catccccctg cagggccctg 2521 gcctcacaac cacagagtcc cgccagcagc ccatggccct ggctgtggcc ctgaccaagg 2581 gtggggaggc ccgaggggag ctgttctggg acgatggaga gagcctggaa gtgctggagc 2641 gaggggccta cacacaggtc atcttcctgg ccaggaataa cacgatcgtg aatgagctgg 2701 tacgtgtgac cagtgaggga gctggcctgc agctgcagaa ggtgactgtc ctgggcgtgg 2761 ccacggcgcc ccagcaggtc ctctccaacg gtgtccctgt ctccaacttc acctacagcc 2821 ccgacaccaa ggtcctggac atctgtgtct cgctgttgat gggagagcag tttctcgtca 2881 gctggtgtta gccgggcgga gtgtgttagt ctctccagag ggaggctggt tccccaggga 2941 agcagagcct gtgtgcgggc agcagctgtg tgcgggcctg ggggttgcat gtgtcacctg 3001 gagctgggca ctaaccattc caagccgccg catcgcttgt ttccacctcc tgggccgggg 3061 ctctggcccc caacgtgtct aggagagctt tctccctaga tcgcactgtg ggccggggcc 3121 tggagggctg ctctgtgtta ataagattgt aaggtttgcc ctcctcacct gttgccggca 3181 tgcgggtagt attagccacc cccctccatc tgttcccagc accggagaag ggggtgctca 3241 ggtggaggtg tggggtatgc acctgagctc ctgcttcgcg cctgctgctc tgccccaacg 3301 cgaccgctgc ccggctgccc agagggctgg atgcctgccg gtccccgagc aagcctggga 3361 actcaggaaa attcacagga cttgggagat tctaaatctt aagtgcaatt atttttaata 3421 aaaggggcat ttggaatcaa a // LOCUS HUMGAAX 298 bp ds-DNA PRI 14-JUL-1990 DEFINITION Human acid alpha-glucosidase (GAA) gene, 5' flank. ACCESSION M34425 KEYWORDS acid alpha-glucosidase; acid maltase. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 298) AUTHORS Martiniuk,F., Mehler,M., Tzall,S., Meredith,G. and Hirschhorn,R. TITLE Sequence of the cDNA and 5'-flanking region for human acid alpha- glucosidase, detection of an intron in the 5' untranslated leader sequence, definition of 18-bp polymorphisms, and differences with previous cDNA and amino acid sequences JOURNAL DNA 9, 85-94 (1990) STANDARD simple staff_review FEATURES from to/span description IVS 274 > 298 acid alpha-glucosidase, intron A /hgml_locus_uid="LZ0096H" /nomgen="GAA" /map="17q23" BASE COUNT 38 a 88 c 128 g 44 t ORIGIN 1 cgtgcggagg tgagccgggc cggggctgcg gggcttccct gagcgcgggc cgggtcggtg 61 gggcggtcgg ctgcccgcgc ggcctctcag tgggaaagct gaggttgtcg ccggggccgc 121 gggtggaggt cggggatgag gcagcaggta ggacagtgac ctcggtgacg cgaaggaccc 181 cggccacctc taggttctcc tcgtccgccc gttgttcagc gagggaggct ctgggcctgc 241 cgcagctgac ggggaaactg aggcacggag cgggtgagac acctgacgtc tgccccgc // LOCUS HUMMAK 171 bp ds-DNA PRI 14-JUL-1990 DEFINITION Human male germ cell-associated kinase (mak) gene, exon N. ACCESSION M35863 M34569 KEYWORDS kinase. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 171) AUTHORS Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M. TITLE A novel mammalian protein kinase gene (mak) is highly expressed in testicular germ cells at and after meiosis JOURNAL Mol. Cell. Biol. 10, 2261-2268 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 25 / 157 male germ cell-associated kinase (mak), exon N (AA at 27) IVS < 1 24 mak intron X IVS 158 > 171 mak intron X+1 BASE COUNT 48 a 30 c 38 g 55 t ORIGIN 1 tttttttctc cgtatatcat caaggctttt ttcataggga catgaaacca gaaaacttgc 61 tttgtatggg tccagagctt gtgaaaattg ctgattttgg acttgcaaga gaattaaggt 121 cacagccacc atacactgac tatgtatcta ccagatggtg agtaggggtt a // LOCUS RATMAK 2372 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat male germ cell-associated kinase (mak) mRNA, complete cds. ACCESSION M35862 M34569 KEYWORDS kinase. SOURCE Rat, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2372) AUTHORS Matsushime,H., Jinno,A., Takagi,N. and Shibuya,M. TITLE A novel mammalian protein kinase gene (mak) is highly expressed in testicular germ cells at and after meiosis JOURNAL Mol. Cell. Biol. 10, 2261-2268 (1990) STANDARD simple staff_review FEATURES from to/span description pept 237 2105 male germ cell-associated kinase (mak) mRNA < 1 2372 mak mRNA BASE COUNT 683 a 575 c 574 g 540 t ORIGIN 1 tctgaatatg tggagactat tatagcaaaa tggactcagt gtctctttgc cagattcctt 61 attgctgtat cccatgagcc tacctgacct tgaggaagag ccgtgtttaa tgatgacaag 121 cctgtctccc gtgtagtgac aaaaaggagg aggaccacaa ggaaagacgg atttagattt 181 ctactcagca aggaagcgag ctctgtcagc aacctctctg gattaccctt cccaagatga 241 accgatatac aaccatgagg cagctggggg acggcacgta cgggagtgtg cttatgggca 301 agagcaatga gtctggggag ctggtggcaa tcaagaggat gaagagaaag ttctattctt 361 gggatgagtg tatgaacttg cgagaagtta agtccctgaa gaaactcaat catgccaatg 421 tgattaaact aaaagaagtt atcagagaaa atgaccatct ttattttata tttgaatata 481 tgaaagaaaa cctctatcag ctaatgaaag acagaaacaa actgttccct gagtcagtca 541 tcagaaatat tatgtatcaa atactacagg ggctggcatt tatccacaaa catggctttt 601 ttcacaggga catgaaacct gagaatttgc tttgcatggg tccagagctg gtgaagattg 661 ctgattttgg acttgcgaga gaattaagat cacagccacc atatactgac tatgtgtcta 721 ccagatggta ccgtgctcct gaagttttgc taaggtcttc agtgtacagc tctcccattg 781 acgtgtgggc cgtgggaagt ataatggccg agctatatac gtttagaccg cttttcccag 841 ggaccagtga agttgatgag atctttaaaa tttgccaagt gttagggact cccaagaaaa 901 gtgactggcc ggaggggtac cagctggcat cctccatgaa cttccgcttt ccccagtgca 961 ttcctataaa cctgaaaact ctcattccca atgccagtag tgaggctatt cagcttatga 1021 cagaaatgct taactgggat ccaaagaaac ggccaactgc aagccaggca ctgaagcacc 1081 catattttca agtcggtcag gtattgggcc cttctgcaca ccatctggat gcaaaacaga 1141 ctttgcacaa gcagctgcag cctccagagc caaagccatc ttcctctgaa cgggatccta 1201 agcctttgcc aaacatcctt gatcagcctg ccgggcagcc ccagccaaaa cagggccacc 1261 aaccactgca ggccattcag ccaccacaga acacagtggt tcagccacct ccaaagcagc 1321 agggtcacca taagcaacca caaacgatgt ttccaagtat cgtcaaaacc ataccaacga 1381 atccagtcag cacagtaggc cataaggggg cccggagacg gtggggtcag acagtcttca 1441 agtctggaga cagctgtgac aacatcgagg actgcgactt gggagcctcc cactccaaga 1501 agccgagcat ggatgccttc aaggaaaaaa agaagaagga gtctccattt cggtttccag 1561 aagcaggact cccagtctcc aaccacttga agggggaaaa tagaaattta catgcatcct 1621 taaaatctga cacaaacttg tcaactgctt caaccgctaa gcagtactat ttgaaacaat 1681 caagatacct tccgggtgtg aaccccaaga acgtgtcttt ggtagctggc ggcaaggata 1741 taaattcaca ctcttggaat aatcagctat ttcctaagtc tctgggatcc atgggggcgg 1801 acctcgcttt caagaggagt aacgcagcag ggaaccttgg gagctacagc gcttacagcc 1861 agacgggatg cgtgccttcc tttctcaaga aagaagtggg atcagctggc cagaggatcc 1921 acttggcgcc tctgggtgcg tcggctgccg actatacctg gagcacaaaa actggccgag 1981 gccagttttc aggacgaact tacaatccca cagccaaaaa tctcaatatt gtgaaccgca 2041 cacagccagt cccctcggtg cacgggagga cagactgggt ggctaagtac ggaggccacc 2101 ggtaggagaa gtggaagcct gaagcattgc tccgtagagg acaatcacgc cccttgatcc 2161 tgggagacgt ctacagagtc tatttctacc gagttccaca ggacgcacac aaccgtgggc 2221 gcctcagaga ccggaagtca gctcccgatt tcttcctttt ccggaaatgc aatgcatttt 2281 cttaacttgt ggccagcagt gctgatgcag caccgtgctg aaacttttga gcgggacttt 2341 taaagagtat tgaataaatt atttgccaaa gt // LOCUS BNRCFIA 1930 bp ds-DNA BCT 14-JUL-1990 DEFINITION B.fragilis imipenem-cefoxitin-hydrolyzing enzyme (cfiA) gene, complete cds. ACCESSION M34831 KEYWORDS imipenem-cefoxitin-hydrolyzing enzyme. SOURCE B.fragilis (strain TAL2480) DNA. ORGANISM Bacteroides fragilis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Anaerobic gram-negative straight, curved and helical rods; Bacteroidaceae. REFERENCE 1 (bases 1 to 1930) AUTHORS Thompson,J.S. and Malamy,M.H. TITLE Sequencing the gene for an imipenem-cefoxitin-hydrolyzing enzyme (cfiA) from Bacteroides fragilis TAL2480 reveals strong similarity between cfiA and Bacillus cereus beta-lactamase II JOURNAL J. Bacteriol. 172, 2584-2593 (1990) STANDARD simple staff_review FEATURES from to/span description pept 533 1282 imipenem-cefoxitin-hydrolyzing enzyme precursor sigp 533 586 imipenem-cefoxitin-hydrolyzing enzyme signal peptide matp 587 1279 imipenem-cefoxitin-hydrolyzing enzyme BASE COUNT 563 a 435 c 434 g 498 t ORIGIN 1 gatcaataag cagttcatct tgtctgcagt attgaaatca taggagttgt ccagaagcag 61 atttataggt gatgttctta caagtcagtt cttcgatagc acgcaatatg gtgtctgcgc 121 tgcaagtgcg aagagttgga tgaagagaca aatgtttcat caggtgagtt gtaacatcct 181 caatacatga gccgccacaa agatatacgc acatcagaga gcgtagaatt tcgctatatt 241 gataaccaaa catagtgcat ctcaatccca aggtggaatc tatggtttga gctaaaagag 301 catcaaattg ctccataata gaaaaaattc ctccaaaagg agtgagtttc tcagatttta 361 tttgtacttt tgccatgtca tattagagtt ttgcttgtct tcttttcgca acactaaggt 421 aagtgaaaat tctgacatgg caaaatcctg ggcaactttt tgttgctcag gcacttataa 481 ataatgttaa actatagtgt tgcggaatta aggcatcata taaaagaata aaatgaaaac 541 agtatttatc cttatctcca tgcttttccc tgtcgcagtt atggcacaga aaagcgtaaa 601 aatatccgat gacatcagta tcacccaact ctcggacaaa gtgtacactt atgtatccct 661 cgccgaaatc gaaggatggg gtatggtacc ttccaacggg atgattgtta tcaacaacca 721 ccaggcagcg ttgctggaca caccgatcaa tgacgcacaa acggaaatgc tggtcaactg 781 ggtgacagac tctttgcatg ccaaagtcac cacgtttatc ccgaaccact ggcacggcga 841 ttgtattggc ggactgggtt acctgcaaag gaaaggtgtc caatcatacg cgaaccagat 901 gacgatagac ctcgccaagg aaaaagggtt gcccgtaccg gaacatggat tcaccgattc 961 actgaccgtc agcttggacg gcatgcctct ccaatgttat tatttaggag gcggacatgc 1021 gaccgacaat atcgtggttt ggctgccgac agagaatatc ctttttggcg gatgtatgct 1081 taaagacaac caggcgacaa gcatcggcaa catctcggac gcggacgtga cggcatggcc 1141 gaaaactctc gataaggtaa aagccaagtt cccctcggcc cgttacgtcg tgcccggaca 1201 tggcgactat ggcggaaccg aactgataga gcataccaag cagatcgtga accaatatat 1261 agaaagcact tcaaagccat agcccgaatt gtaggcatct acccggtatt ccatgaacga 1321 agaggcgaaa gcttttatcc cgtccacaga atgtaacccg gccacaaacg agacataaaa 1381 atatcaattg aacaacacaa cagatccttg tttcatctac ggcgtcgcag ttaggtcttt 1441 ttttatcaat tgttaagcgt ttcttcccga ttattagcga tctttgccat caaaaaataa 1501 acgatatgaa ttggattatc ctaacgattg ccggatgctt tgaggtgctg ttcactttct 1561 gcatggggaa agtgaaagaa acaaccggaa cgaaatgtac ctgtggttca ccggattcct 1621 gatcgcaact tttataagta tgttactact ggtgaaagcc acccagaatc tgcccatcgg 1681 cacagcttat gccgtgtgga caggaatagg cgctgtgggt acggtgctcg taggtatcct 1741 tgtgtttagc gaacccgtca gttttggaga ctcttcttta tagtgatgct gatcagttca 1801 ataatcgggc tgaagatggt ttcgcattga gggggaaatt catccctcag cacctccggc 1861 aattccgcca acgaacggat catcagatac tcctccgacg atgaatctcc tcgtgttcac 1921 atcgctcagc // LOCUS CLTSTRRA 142 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34684 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-A. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 142) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 36 77 STRR1 sequence rpt 78 119 STRR1 sequence BASE COUNT 28 a 64 c 19 g 31 t ORIGIN 1 tatttattcc catcacccaa tccccaatcc ccagtcccca ttacccctta tcccagaggg 61 gccccgagtt ccccaatccc cattacccct tatcccagag ggtccccgag ttccccagtc 121 cccagtcccc aatcctgact gg // LOCUS CLTSTRRB 73 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34685 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-B. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 73) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 19 a 33 c 5 g 16 t ORIGIN 1 ggagttctcc aatccccaat ccccagtccc caatccccat tccccattcc ccattcccct 61 caaactaaag aat // LOCUS CLTSTRRC 64 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34686 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-C. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 64) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 16 a 26 c 2 g 20 t ORIGIN 1 tttttccaaa tccccaatcc ctagtcccca atccctagtc cccaatccct aatcatttca 61 attc // LOCUS CLTSTRRD 49 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34687 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-C. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 49) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 9 a 23 c 5 g 12 t ORIGIN 1 ttcctctgct tccccagtcc ccagtcccca gtccccaccc ataaaagtt // LOCUS CLTSTRRE 74 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 1 (STRR1). ACCESSION M34688 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 1-E. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 74) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 6 a 33 c 3 g 32 t ORIGIN 1 aggttttgta ttttcccttt cccctttcac cttttcccct tcccctttca cctttcccct 61 ttccccaccc ctat // LOCUS CLTSTRRF 75 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34689 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-A. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 14 a 8 c 15 g 38 t ORIGIN 1 ctggtttgtt ttgttattgg tcattggtca tttgtcattg gtcatttgtc attagttatt 61 ggtaattcat atcaa // LOCUS CLTSTRRG 64 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34690 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-B. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 64) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 10 a 10 c 11 g 33 t ORIGIN 1 tttttgttgt ttgtcatttg tcattggtca tttgtcatta gtcctttgtc attcaaaccg 61 agtt // LOCUS CLTSTRRH 77 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34691 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-C. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 77) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 18 a 6 c 17 g 36 t ORIGIN 1 ccaatttgta ttagtcattt gttattagtc atttgtcatt tgtcatttgg ggtttgtaat 61 tggtaatagg aggaatt // LOCUS CLTSTRRI 85 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34692 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-D. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 85) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 25 a 0 c 25 g 35 t ORIGIN 1 atggggtaat tggtaattgg taattggtaa ttggtaattg gtaattggta attggtaatt 61 ggtaattggt aattggtaat agatt // LOCUS CLTSTRRJ 80 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 2 (STRR2). ACCESSION M34693 KEYWORDS s. SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 2-E. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 80) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 13 a 13 c 17 g 37 t ORIGIN 1 tcccatgtca tttgtcattt gtcattggtt attggttatt ggtcatttgt catttggggt 61 ttggtaattg acctcaactc // LOCUS CLTSTRRK 76 bp ds-DNA BCT 14-JUL-1990 DEFINITION Callothrix sp. short tandem repeat family 3 (STRR3). ACCESSION M34694 KEYWORDS . SOURCE Callothrix sp. (strain PCC 7601) DNA, clone 3. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Mazel,D., Houmard,J., Castets,A.M. and Tandeau de Marsac,N. TITLE Highly repetitive DNA sequences in Cyanobacterial genomes JOURNAL J. Bacteriol. 172, 2755-2761 (1990) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 25 a 15 c 15 g 21 t ORIGIN 1 ttttaagtct gtgaacagtc aacagtcaac agtcaacagt taacagttga aaggacacag 61 cagtgctatg tccttt // LOCUS CPACM559B 1076 bp ds-DNA ORG 14-JUL-1990 DEFINITION C.paradoxa cyanelle cytochrome b-559 (psbE, psbF, psbI, and psbJ) genes, complete cds. ACCESSION M35129 KEYWORDS cytochrome b-559. SOURCE C.paradoxa cyanelle DNA. ORGANISM Cyanelle Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa. REFERENCE 1 (bases 1 to 1076) AUTHORS Cantrell,A. and Bryant,D.A. TITLE Nucleotide sequence of the genes encoding cytochrome b-559 from the cyanelle genome of Cyanophora paradoxa JOURNAL Photosyn. Res. 16, 65-81 (1988) STANDARD simple staff_review FEATURES from to/span description pept 246 470 cytochrome b-559 apoprotein (psbE) pept 510 638 cytochrome b-559 apoprotein (psbF) pept 648 764 cytochrome b-559 apoprotein (psbI) pept 787 909 cytochrome b-559 apoprotein (psbJ) signal 926 951 pot. transcription termination signal BASE COUNT 367 a 142 c 160 g 407 t ORIGIN 1 tctttaagta gtcagatatc tggaaatact tggaaaaaag aaacaagtac agacaatatt 61 ccttcaaatt tctataaaat tagttttata gataaagaag taggtttata ctaggaaatc 121 agggaacact tttacgatat gtatcattat aatagaataa ttaatatctt ataatttgaa 181 taaaatcttt tattatacat aaaagcatca tatatatata tatataattt ttaggagtga 241 tttatatgtc tggaggaact actggcgaac gcccattttc tgacattgtt actagtattc 301 gttattgggt tattcatact gtaactattc cattctttat tgttgcaggt tggctttttg 361 taagtactgg tttagcttat gatgtatttg gtactccaag accagatgaa tacttcactg 421 aagaacgtca agaagtacca attattaatc aacgtttttc aactaattaa tcgttattaa 481 tttatagttt ctaaaagagg aaaaaggaaa tgaataatcc taatcaaccg gtttcttatc 541 caatttttac agttagatgg ttagcaattc atgctattgg aattccagct gtatttttta 601 ttggatctat tactgcaatg caatttattc aacgatagga gatatatatg gttagccaaa 661 atcctaatag acaaaaagtt gaattaaatc gtacttccct attctgggga ttacttttaa 721 ttttcgtatt agcaatttta ttctctagct acatttttaa ctaaatttct tcaataagga 781 gtctttatgg caaatactgg tggacgcatt cctttatggc ttgttgctac agttgcaggt 841 ttagcagcta ttggtgtact aggaatcttt ttctatggtg gttactctgg tttaggttcc 901 tctatttaat aaaatcaatt taagattttt tgtaataaat taattaaaaa aaaatagaga 961 gttaaatttt tttaactctc tattctttag acatttataa ctaataaaag aaattaagca 1021 atttcatctg tttcaatata aacaaataat aatgccataa cattgctggt aagaca // LOCUS KVSM121M 612 bp ds-RNA VRL 14-JUL-1990 DEFINITION Killer virus of S.cerevisiae (class M1, type 1), M1-2 region mRNA fragment. ACCESSION M35150 KEYWORDS . SOURCE Killer virus of S.cerevisiae (strain A364A x S7) class M1 type 1, cDNA to viral RNA, clones pEH-[2,3]. ORGANISM Killer virus of S.cerevisiae Viridae; ds-RNA nonenveloped viruses. REFERENCE 1 (bases 1 to 612) AUTHORS Georgopoulos,D.E., Hannig,E.M. and Leibowitz,M.J. TITLE Sequence of the M1-2 region of killer virus double-stranded RNA JOURNAL Basic Life Sci. 40, 203-213 (1986) STANDARD simple staff_review BASE COUNT 205 a 140 c 141 g 125 t 1 others ORIGIN 1 aaagaaagaa cnaaaagaaa cagaaaaaga gagaacagga caacaaacgc aacaaaacac 61 aaacacaagc acactcacct tgagtctaac tggtggcacg cagcatatct caccctgaga 121 ctaactggcg gcaggcgacc gtgagcatac agcatgcccc actcgattcg agacgcgatt 181 cgcgctcgta ggtatcgagc ggctacgttg agctattatg gcagtgacat gcgattcgcg 241 cactgccaag atcagctcag caaagttaag accagtatcg gatatggtag actactacaa 301 ttcgcacagg tatgagattc tcagtctagt gtatggatga gtagttgagc caatgaatct 361 agggtttaaa ttactatgca ttgacatata acaggtacaa gcgtagataa tacttactag 421 gccccagccg gtacaccctg tattgaataa atacgactat ttggccaggt ctggacgggg 481 cagtcgaatt actaggttga gcacacacac gtgaatcaca caacataaca gtgtaggaac 541 ataatgtgcc attcgtagtc tgagacgccg ctagcctggt ttaatgcaac agcatagaag 601 aaacacacat ca // LOCUS PIGHMG2 1153 bp ss-mRNA MAM 14-JUL-1990 DEFINITION Pig non-histone chromosomal protein (HMG2) mRNA, complete cds. ACCESSION J02895 KEYWORDS non-histone chromosomal protein. SOURCE Pig thymus, cDNA to mRNA, clone pCD-MK02. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1153) AUTHORS Shirakawa,H., Tsuda,K.-i. and Yoshida,M. TITLE Primary structure of non-histone chromosomal protein HMG2 revealed by the nucleotide sequence JOURNAL Biochemistry 29, 4419-4423 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly provided by M.Yoshida, 07-DEC-1989. FEATURES from to/span description pept 144 776 non-histone protein HMG2 precursor matp 147 773 non-histone protein HMG2 mRNA < 1 1153 HMG2 mRNA BASE COUNT 361 a 224 c 301 g 267 t ORIGIN 53 bp upstream of HaeIII site. 1 acaaaccagt tcacgccgga gccgcaggta ggcagcgtcg cggtcggacc cggccgctgc 61 gcgggagcct gaggagaagc taccaccagg caagagaccc ctccgggccc cggtggacgc 121 gtcgtcgccg gcgccgcagc accatgggga agggcgaccc caacaagccg cggggcaaga 181 tgtcctcgta cgccttcttc gtgcagacct gccgggagga gcacaagaag aaacaccccg 241 attcctcggt caacttcgcc gagttctcca agaagtgctc cgagcgatgg aagactatgt 301 ctgccaagga aaagtccaag tttgaagata tggcaaaaag tgacaaagct cgctatgacc 361 gggagatgaa aaattacgtc cctcccaagg gtgacaagaa gggcaagaaa aaagatccca 421 atgctcccaa aaggcctcca tctgccttct tcctgttttg ctctgaacat cgcccaaaga 481 tcaaaagtga acaccctggc ttatccattg gggatactgc aaagaaattg ggtgaaatgt 541 ggtctgagca gtcagccaaa gataaacaac cgtatgaaca gaaagcagct aagctaaagg 601 aaaaatatga aaaggatatt gctgcatacc gtgccaaggg taagggtgaa gcaggaaaga 661 agggccctgg caggccaaca ggctctaaga agaagaatga accagaagat gaggaggaag 721 aggaagagga ggaagaagat gaagacgagg aggaagaaga tgaagatgag gaataaatgg 781 ctatcctgta atgttgtgtg tggagtgtgt gtgtgtgctc aggcaattat tttgctaaga 841 atgtgaattc aagtgcagct caatattagc ttcagtataa aaactgtaca gatttttgta 901 tagctaataa gattctttgt agagaaaata ctttttttaa aagtgcaggt tgcagctttt 961 tgaggggcta ctacatacag ttagatttta aagcttctga tgttgaatgt ttctaaatat 1021 ttaatggttt ctttaatttc ttgtgtatgg taacagcaaa ctcgtagaaa taagtatcaa 1081 tagcaaattt tgggttttct agaatgttgc attttgtttt ttaaaaaaat tttgtaataa 1141 aattatgtat att // LOCUS PSEHUTH 2249 bp ds-DNA BCT 14-JUL-1990 DEFINITION P.putida histidine ammonia-lyase (hutH) gene, complete cds. ACCESSION M35140 KEYWORDS histidase; histidine ammonia-lyase. SOURCE P.putida (ATCC 12633) DNA, clone pMC4. ORGANISM Pseudomonas putida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2249) AUTHORS Consevage,M.W. and Phillips,A.T. TITLE Sequence analysis of the hutH gene encoding histidine ammonia-lyase in Pseudomonas putida JOURNAL J. Bacteriol. 172, 2224-2229 (1990) STANDARD simple staff_review FEATURES from to/span description pept 316 1848 histidine ammonia-lyase (hutH) precursor (gtg start codon (E.C. 4.3.1.3) matp 319 1845 histidine ammonia-lyase (hutH) BASE COUNT 392 a 777 c 699 g 381 t ORIGIN 1 ccatggtggt ggcgtgggca tgggcttctc tcagcactcg ggcatggtca tcgtctgcga 61 cggcaccgat gaagccgccg agcgcatcgc ccgtgtactg accaacgacc cagggactgg 121 cgtcatgcgc cacgccgatg ccggttatga catcgccatc gactgcgcca aggagcaggg 181 cctggacctg ccgatgatca ctggctgatt gccacgcttt ggatcgcacc ccgccagggg 241 tgatactgaa caacaagaag gagcgcgcag gcacccacga acggcctgcg gctccacgcg 301 atttggagta gtaacgtgac cgaactcacc ctcaagcccg gcaccctgac cctggcccag 361 ctgcgcgcga tccatgccgc gcccgtgcgc ctgcaactgg atgccagcgc cgcgccggcc 421 atcgatgcca gcgtcgcctg tgtcgagcag atcattgccg aagaccgcac tgcctacggc 481 atcaacaccg gtttcggcct gctggcctcg acccgcatcg ccagccacga cctggaaaac 541 ctgcagcgct cgctggtgct gtcccacgcc gctggcattg gcgcgccgct ggatgacgat 601 ctggtgcggt tgatcatggt gctgaaaatc aacagcctca gccgtggctt ctccggcatt 661 cgccgcaaag tcatcgatgc gctgatcgcc ctggtcaacg ccgaagtcta cccgcacatc 721 ccgctgaaag gctcggtggg tgcttccggc gacctggcgc cgctggcgac catgtcgctg 781 gtgctgctgg gcgaaggcaa agcccgctac aagggccagt ggctgtcggc caccgaagcc 841 ctggcggttg ccggcctcga accgctgacc ctggctgcca aagagggcct ggccctgctc 901 aacggcaccc aggcgtccac cgcctatgcc ctgcgtggcc tgttctatgc cgaagacttg 961 tacgctgcgg ctatcgcctg cggcggcctg agcgtcgaag ccgtactggg ctcgcgttcg 1021 ccgttcgatg cgcgtattca cgaagcgcgt ggccagcgcg gccagatcga cactgccgcg 1081 tgcttccgcg acctgctggg cgattccagc gaagtgtcgc tgtcgcacaa gaactgcgac 1141 aaggtccagg acccgtactc gctgcgctgc cagccgcagg tcatgggcgc gtgcctgacc 1201 cagctgcgcc aggctgccga ggtgctgggc atcgaagcca acgccgtgtc ggacaacccg 1261 ctggtgttcg ctgccgaggg tgacgtgatt tccggtggca acttccacgc agaaccggtg 1321 gccatggccg ccgacaacct ggccctggcc atcgccgaaa tcggttcgct cagcgagcgc 1381 cgcatctcgc tgatgatgga caagcacatg tcccagctgc cgccgttcct ggtggaaaac 1441 ggtggggtca actccggctt catgatcgcc caggtcaccg ctgccgctct ggccagcgag 1501 aacaaggccc tgtcgcaccc gcacagcgtc gacagcctgc cgacctcggc caaccaggaa 1561 gaccacgtat cgatggcccc ggctgccggc aagcgcctgt gggaaatggc cgaaaacacc 1621 cgtggcgtgc ctgccatcga atggctgggc gcatgccagg gcctggacct gcgcaaaggc 1681 ctgaagactt cggccaagct ggagaaagca cgccaggcgc tgcgcagcga agtggcgcac 1741 tacgaccgtg accgtttctt cgcgccggac atcgaaaagg ctgtggaact gttggccaag 1801 ggtagtttga ccggtttgct gccggcaggt gtgctgccaa gcctgtaatg ccccctgggg 1861 cgctgcgcct tcgcagcaca aggctgctcc tgcaaaagcg cacgcgccct tgtacggaca 1921 gcctgtgctg cgatgggctg catgccagcc ccctgacctc acgaccgcag gacgattcaa 1981 ggacgccgac acgcaacaag ctcaaggtct caagtgcggg ctaaatgccc gccacatccg 2041 tttcatggcc ctcggttctg ccatcggaac cgggctgttc tacggctccg cctcagccat 2101 ccagatggcc ggcccggccg tgctgctggc ttacctgatc ggcggtgccg ctgtgttcat 2161 ggtcatgcgc gccctcggcg aaatggccgt gcacaacccg gtggccggct ccttcggcca 2221 ctacgccagt acctacctcg gccccatgg // LOCUS RATGABAAD 1799 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat GABA-A receptor delta subunit mRNA, complete cds. ACCESSION M35162 KEYWORDS GABA-A receptor delta subunit; gamma-aminobutyric acid receptor. SOURCE Rat adult brain, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1799) AUTHORS Zhao,Z.-Y. and Joho,R.H. TITLE Isolation of distantly related members in a multigene family using the polymerase chain reaction technique JOURNAL Biochem. Biophys. Res. Commun. 167, 174-182 (1990) STANDARD simple staff_review REFERENCE 2 (sites; corrections to figure 3 for [1]) AUTHORS Zhao,Z.-Y. and Joho,R.H. TITLE Isolation of distantly related members in a multigene family using the polymerase chain reaction technique JOURNAL Biochem. Biophys. Res. Commun. 168, 887-887 (1990) STANDARD simple staff_review FEATURES from to/span description pept 50 1399 GABA-A receptor delta subunit BASE COUNT 385 a 512 c 500 g 402 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccctg gggctagccg gacctgtccc gcgcacagcc cgcaaggcca tggacgttct 61 gggctggctg ctgctgccgc tccttctgct gtgcacgcag ccgcaccatg gcgccagagc 121 aatgaatgac attggggact acgtgggctc caacctggag atatcctggc tccccaacct 181 ggatggacta atggagggct acgcccgaaa cttccgacca ggcattggag gtcctccagt 241 gaatgtggcg cttgccctag aggtggccag cattgaccac atctcagaag caaatatgga 301 atacaccatg acagtgttcc tgcaccagag ctggcgagac agcaggctgt cctacaacca 361 taccaacgag accctgggcc tggatagccg cttcgtggac aagctgtggc tccctgacac 421 cttcattgtg aatgccaagt ctgcctggtt ccatgatgtg accgtggaaa acaagcttat 481 ccgcctacag cccgacggtg tgattttata cagcatccgc atcacctcca cagtggcctg 541 tgacatggac cttgccaagt accccatgga cgagcaggag tgcatgctgg acctggagag 601 ctatggctac tcttctgagg acattgtcta ttattggtca gaaaaccagg agcagatcca 661 cgggctggac aggctgcaac tggcccagtt cactatcacc agttaccgct tcaccacgga 721 gctgatgaac ttcaaatcag ctggccagtt ccctcgactc agcttacact tccagcttcg 781 gaggaaccgg ggtgtctaca tcatccagtc ttacatgccc tctgtcctcc tggttgccat 841 gtcctgggtc tccttctgga ttagccaagc agcagtgcct gccagagtat ctctaggcat 901 caccactgtg ctgacaatga ccacactcat ggttagtgcc cgctcctccc tcccgcgggc 961 ttctgctatc aaggctctgg atgtgtattt ctggatctgc tatgtcttcg tgtttgctgc 1021 cctggtggag tatgcatttg cccacttcaa tgctgactac aggaagaaac ggaaagccaa 1081 ggtcaaggtc acgaagccaa gggcagagat ggacgtgagg aacgccattg tcctcttctc 1141 cctctctgct gctggggtca gccaggagtt ggctatctcc cgccgtcaag gccgggtccc 1201 tgggaacctc atgggttcct ataggtctgt agaagtggag gcaaagaagg agggggggtc 1261 ccgcccagga ggcccaggag gcatccgttc cagactcaaa cccatcgatg cagacaccat 1321 cgacatctat gcccgcgctg tgttcccggc agcctttgca gcagtcaaca tcatctactg 1381 ggcggcgtat accatgtgac ggcagtgcgt acaccacatg acggctttac atgtaccacg 1441 tgcggtggtt gccacaaact cctggaggaa cagccctctc cgtggatccc aaagtgggac 1501 agagaggagc gtccgctccc tgggctttac ctcaattttg ctctaccagc ctgcacttcc 1561 tgtagatgga gcaggatggg ctttgcacag agggccatta gttggcctgt ccctgaagct 1621 gagcctgcac taaggtgccc ttgggagctg tgaagtttta gactcaggat gggcctgatt 1681 tctgggcctt tgctctgtgg gatcaggatc agagagaaag agagtagggg tggatgacca 1741 ctctcatggt gaactggagg agaagggcat tatgaggcct ctttggtccc agggaattc // LOCUS RPVCTSI 2195 bp ds-DNA VRL 14-JUL-1990 DEFINITION Raccoonpox virus Sal1 end fragment DNA. ACCESSION M23018 KEYWORDS . SOURCE Raccoonpox virus (strain V71-I-85A) DNA, passed in human tk- 143 cells. ORGANISM Raccoonpox virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 2195) AUTHORS Parsons,B.L. and Pickup,D.J. TITLE Tandemly repeated sequences are present at the ends of the DNA of Raccoonpox virus JOURNAL Virology 161, 45-53 (1987) STANDARD simple staff_review BASE COUNT 880 a 225 c 373 g 717 t ORIGIN 1 ataattaatt taattttaaa tattttattt aatgtctaga aaaaaatgtg taacccagac 61 tgtagagaac tctctaatgt ggtgaaaaaa tcaatccctt aatcgagact atcagaaaga 121 gacgaaacat tttttagtga gacattagaa agagaaagag acgaaacatt ttttagtgag 181 acattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga gaaatatttt 241 tttgtaaaac ttttttatga ctccattaga aagagatgaa acatttttag tgagacatta 301 gaaagagaaa tatttttttg taaaactttt ttatgactcc attagaaaga gacgaaacat 361 ttttagtgag acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta 421 gaaagagacg aaacattttt agtgagacat tagaaagaga aatatttttt tgtaaaacgt 481 tttcttaggt caaaagtcaa tgttttaaaa tatggactag aatatgtctg tgtaaaaact 541 tgtgaaattt tgtaccaaag aaaaacgtga gcagtatccc ctacatggat tttactagat 601 catttatgta acaaaaaatt atattattcg tgtacgttaa ttataaaaat attgtatgat 661 atatcgaggt ccattagcaa aaggtgttga tagacaatta atcgtgtgtg atgtaaccat 721 gattattaca gcagaaagag taagagtatt ttttgtaaaa cttttttatg actccattag 781 aaagagaaag agacgaaaca tttttagtga gacattagaa agagacgaaa catttttttt 841 gtaaaacttt tttatgactc cattagaaag agaaagagac gaaacatttt tagtgagaca 901 ttagaaagag aaatattttt tgtaaaactt tttttatgac tccattagaa agagaaatat 961 tttttgtaaa acttttttat gactccatta gaaagagacg aaacattttt attgagacat 1021 tagaaagaga aagagacgaa acattttttt gtaaaacttt tttatgactc cattagaaag 1081 agaaagagac gaaacatttt tagtgataca ttagaaagag aaatattttt tgtaaaactt 1141 ttttatgact ccattagaaa gagatgaaac atttttagtg agacattaga aagagaaaga 1201 gacgaaacat tttttttgta aaactttttt tatgactcca ttagaaagag aaagagacga 1261 aacattttta gtgagacatt agaaagataa agagacgaaa catttttttt gtaaaacttt 1321 ttttatgact ccattagaaa gagaaagaga cgaaacattt ttagtgagac attagaaaga 1381 gacgaaacat ttttagtgag acattagaaa gataaagaga cgaaacattt tttttgtaaa 1441 acttttttat gactccatta gaaagagaaa gagacgaaac atttttagtg agacattaga 1501 aagagacgaa acattttttt tgtaaaactt tttttatgac tccattagaa agagacgaaa 1561 catttttagt gagacattag aaagataaag agacgaaaca ttttttttgt aaaacttttt 1621 tatgactcca ttagaaagag aaagagacga aacattttta gtgagacatt agaaagagaa 1681 atattttttg taaaactttt ttatgactcc attagaaaga gatgaaacat ttttagtgag 1741 acattagaaa gagaaatatt tttttgtaaa acttttttat gactccatta gaaagagatg 1801 aaacattttt agtgagacat tagaaagaga aagagacgaa acattttttt ttgtaaaact 1861 tttttatgac tccattagaa agagatgaaa catttttagt gagacattag aaagagaaag 1921 agacgaaaca tttttttttt gtaaaacttt ttatgactcc attagaaaga gatgaaacat 1981 ttttagtgag acattagaaa gagaaagagt gacacgataa aacttgttac ttattgttag 2041 agaaaaagaa tcgactttaa tgcgaataca tttttaaatt tcaaaccgga aataagtgtc 2101 cggtacactt ttaaaattaa acgtatgtcg gaatccacaa ctaatcctga ggattacacg 2161 attcagacac acgatttaag ttttgtgtcg tcgac // LOCUS STYFLICDA 612 bp ds-DNA BCT 14-JUL-1990 DEFINITION S.muenchen phase 1 flagellin (antigen-determining region) (fliCd) gene, partial cds. ACCESSION M34824 KEYWORDS phase 1 flagellin antigen d. SOURCE S.muenchen (strain RKS3121) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 612) AUTHORS Smith,N.H., Beltran,P. and Selander,R.K. TITLE Recombination of Salmonella phase 1 flagellin genes generates new serovars JOURNAL J. Bacteriol. 172, 2209-2216 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 612 phase 1 flagellin (fliCd) (AA at 1) BASE COUNT 207 a 114 c 137 g 154 t ORIGIN 1 attgatttaa aagaaattag ctctaaaaca ctgggacttg ataagcttaa tgtccaggat 61 gcctacaccc cgaaagaaac tgctgtaacc gttgataaaa ctacctataa aaatggtaca 121 gatactatta cagcccagag caatactgat atccaaactg caattggcgg tggtgcaacg 181 ggggttactg gggctgatat caaatttaaa gatggtcaat actatttaga tgttaaaggc 241 ggtgcttctg ctggtgttta taaagccact tatgatgaaa ctacaaagaa agttaatatt 301 gatacgactg ataaaactcc gttagcaact gcggaagcta cagctattcg gggaacggcc 361 actataaccc acaaccaaat tgctgaagta acaaaagagg gtgttgatac gaccacagtt 421 gcggctcaac ttgctgctgc aggggttact ggtgccgata aggacaatac tagccttgta 481 aaactatcgt ttgaggataa aaacggtaag gttattgatg gtggctatgc agtgaaaatg 541 ggcgacgatt tctatgccgc tacatatgat gagaaacagg tacaattact gctaaacaac 601 cactatacag at // LOCUS STYFLICI 585 bp ds-DNA BCT 14-JUL-1990 DEFINITION S.typhimurium phase 1 flagellin (antigen-determining region) (fliCi) gene, partial cds. ACCESSION M34822 KEYWORDS phase 1 flagellin antigen i. SOURCE S.typhimurium (strain RKS3333) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 585) AUTHORS Smith,N.H., Beltran,P. and Selander,R.K. TITLE Recombination of Salmonella phase 1 flagellin genes generates new serovars JOURNAL J. Bacteriol. 172, 2209-2216 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 585 phase 1 flagellin (fliCi) (AA at 1) BASE COUNT 180 a 108 c 142 g 155 t ORIGIN 1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tctgcaacaa 61 aaatataagg tcagcgatac ggctgcaact gttacaggat atgccgatac tacgattgct 121 ttagacaata gtacttttaa agcctcggct actggtcttg gtggtactga ccagaaaatt 181 gatggcgatt taaaatttga tgatacgact ggaaaatatt acgccaaagt taccgttacg 241 gggggaactg gtaaagatgg ctattatgaa gtttccgttg ataagacgaa cggtgaggtg 301 actcttgctg gcggtgcgac ttccccgctt acaggtggac tacctgcgac agcaactgag 361 gatgtgaaaa atgtacaagt tgcaaatgct gatttgacag aggctaaagc cgcattgaca 421 gcagcaggtg ttaccggcac agcatctgtt gttaagatgt cttatactga taataacggt 481 aaaactattg atggtggttt agcagttaag gtaggcgatg attactattc tgcaactcaa 541 aataaagatg gttccataag tattaatact acgaaataca ctgca // LOCUS STYFLICR 579 bp ds-DNA BCT 14-JUL-1990 DEFINITION S.heidelberg phase 1 flagellin (antigen-determining region) (fliCr) gene, partial cds. ACCESSION M34823 KEYWORDS phase 1 flagellin antigen r. SOURCE S.heidelberg (strains RKS539) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 579) AUTHORS Smith,N.H., Beltran,P. and Selander,R.K. TITLE Recombination of Salmonella phase 1 flagellin genes generates new serovars JOURNAL J. Bacteriol. 172, 2209-2216 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 579 phase 1 flagellin (fliCr) (AA at 1) BASE COUNT 181 a 118 c 126 g 154 t ORIGIN 1 atcgatctga agcagatcaa ctctcagacc ctgggtctgg atacgctgaa tgttcaacaa 61 aaatataagg tcagcgatac ggctgcaact gtcactggct atacagattc tgctactgct 121 attgacaaat ctacgtttgc tgcatcagca actaccttag gtggtactcc tgctattact 181 ggtgatctga agtttgatga tactactgga aaatattacg ctgatgtttc aggtactacg 241 gctaaagatg gtgtttatga agtaacagtt gcagccgatg gaaaagtcac tttaactggc 301 acaccaacag gaccaattac tgctggcttc ccttcaactg caacaaaaga tgttaaacaa 361 actcagcaag aaaacgctga tttgacagag gccaaagccg cattgacagc agcgggtgtt 421 gcagcggccg gcacagcatc tgttgttaag atgtcttata ctgataataa cggtaaaact 481 attgatggtg gtttagcagt taaggtaggc gatgattact attctgcaac tcaaaataaa 541 gatggttcca taagtattaa tactacgaaa tacactgca // LOCUS HUMFGF1A 3343 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds. ACCESSION M34641 KEYWORDS FGF receptor-1; fibroblast growth factor receptor; transmembrane glycoprotein; tyrosine kinase. SOURCE Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA, clone 1:1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3343) AUTHORS Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L. TITLE cDNA cloning and expression of a human FGF receptor which binds acidic and basic FGF JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by L.Claesson-Welsh, 25-MAY-1990. Author address: L.Claesson-Welsh Ludwig Institute for Cancer Research Biomedical Center Box 595 S-751 24 Uppsala SWEDEN FEATURES from to/span description pept 10 2472 FGF receptor-1 precursor sigp 10 72 FGF receptor-1 signal peptide matp 73 2469 FGF receptor-1 BASE COUNT 766 a 957 c 911 g 709 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca 61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct 121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg 181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac 241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc 301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat 361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag 421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa 481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct 541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct 601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct 661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc 721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca 781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg 841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag 901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc 961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat 1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg 1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc 1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg 1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag 1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct 1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg 1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc 1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg 1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac 1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg 1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc 1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca 1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac 1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc 1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata 1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac 1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac 2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc 2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg 2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca 2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc 2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc 2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag 2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc 2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc 2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg 2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct 2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc 2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg 2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc 2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc 2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc 2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac 3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc 3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat 3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc 3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga 3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt 3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc // LOCUS BSUSPCR 2852 bp ds-DNA BCT 14-JUL-1990 DEFINITION B.subtilis spectinomycin resistance (spc) genes, complete cds. ACCESSION M31102 KEYWORDS spectinomycin resistance. SOURCE B.subtilis (strain 1A241) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2852) AUTHORS Yoshikawa,H. and Doi,R.H. TITLE The nucleotide sequence of the Bacillus subtilis spectinomycin resistance gene region JOURNAL Nucleic Acids Res. 18, 1647-1647 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.H.Doi, 04-JAN-1990. FEATURES from to/span description pept 52 549 spc ORF1 pept 563 742 spc ORF2 pept 773 1213 spc ORF3 pept 1215 2450 spc ORF4 (ttg start codon) pept 2505 > 2852 spc ORF5 BASE COUNT 830 a 535 c 673 g 814 t ORIGIN 1 tgatgcgctc gtgaagctgg acttaaattt taataaaaga aggagggaca catgattatg 61 cgtcgtattg acccaagcaa attagagtta gaagaacgct tagttacggt taaccgcgta 121 gcgaaaggtg gtcgtcgttt ccgcttcgca gctctagtcg ttgtcggtga caaaaacgga 181 cacgtaggat tcggtactgg taaagcacaa gaagtaccag aagcgattcg caaagctgtt 241 gaagatgcga aaaagaattt gattgaagta ccaatggttg gaactacaat tccacacgaa 301 atcatcggac gtttcggtgc aggtaacatc ttgttaaaac ctgcttctga aggtactgga 361 gttatcgctg gaggccctgt acgtgcggta cttgagctag ctggtgtagc tgatatcctt 421 tctaagtctt taggttctaa cacaccgatc aacatgattc gtgcaacact tcaaggttta 481 agtgaactta aacgtgctga agacgttgcg aagcttcgtg gaaaatctgt agaagaactg 541 ttaggataag gagggaacat taatggctaa attagaaatt accctcaaac gaagtgtaat 601 cggtcgcccg gaagaccaac gcgttactgt aagaacactt ggtttaaaga aaacaaacca 661 aactgttgtt catgaagaca acgctgcgat ccgtggtatg attaataaag tatctcattt 721 agtttctgtt aaagaacaat aaaaatattt gatcggataa ggaggtgtcc taatgaaact 781 tcatgaatta aaaccttcag aaggttcacg caaaacgcgt aatcgcgtag gtcgtggtat 841 tggttctggc aacggtaaaa cagctggtaa aggtcacaaa ggtcaaaacg ctcgttctgg 901 cggcggtgta cgccctggat tcgagggggg acaaatgcct ttattccaac gtcttcctaa 961 acgtggtttc actaacatca accgcaagga atacgctgta gtcaacctag acaaattgaa 1021 cggttttgca gaaggaacgg aagtcactcc tgaacttctt ctagagactg gtgttattag 1081 caaacttaat gcaggagtaa agattcttgg caacggtaaa ttagagaaaa aattaactgt 1141 aaaagccaat aaattctctg cttctgcgaa agaagctgtt gaagctgctg gcggtacagc 1201 tgaggtgatc taacttgttt aaaacaatct ccaactttat gcgtgtgagt gatatcagga 1261 ataaaatcat attcacttta ctcatgctta tcgtctttcg cataggtgcg tttattcctg 1321 tgccttacgt taacgctgaa gcgttacagg cacagtctca aatgggtgtt tttgatctcc 1381 ttaatacatt tggcggcggt gcgctttacc aattttccat tttcgcaatg ggaattactc 1441 cttatatcac ggcttcgatc atcattcagc tgcttcagat ggatgtggta ccgaagttta 1501 ccgagtggtc taagcaaggt gaagttggcc gccgtaaatt agctcagttc acaaggtact 1561 ttacgattgt gcttggtttc atccaagcgt taggtatgtc atatggattc aacaatctgg 1621 caaacggtat gctgatcgaa aaatccggtg tatcgacata tcttatcatt gctttagtgc 1681 tcactggcgg aactgccttt ttaatgtggc ttggggaaca aattacttct catggagtag 1741 gcaacggaat atcgatcatt atcttcgcgg ggattgtgtc tagtattcca aaaacaattg 1801 ggcaaatata tgagactcaa tttgtcggca gcaacgatca gttgtttatt catattgtga 1861 aagtcgcact tcttgtgatt gcgattttag cagttattgt tggagttatt ttcattcagc 1921 gaggtcagtc tacacacctt ccattgaaag tgaatcctgc aggggttatt ccggtaatct 1981 ttgcggttgc gtttttgata acgccgcgga cgatcgcgtc attctttgga acaaacgatg 2041 tgacaaagtg gattcaaaac aactttgata atacgcatcc ggtgggtatg gcgatatatg 2101 ttgcgttgat tattgccttt acgtactttt atgcttttgt acaggtaaac cctgaacaaa 2161 tggctgataa ccttaaaaaa cagggtggct atatcccggg ggttcgtcca gggaaaatga 2221 ctcaagatag aattacgagc attttgtatc gacttacgtt tgtgggttct atattcttag 2281 ccgtgatttc cattcttcct atctttttca ttcaattcgc tggattgcct caaagtgcac 2341 aaattggcgg aacatctttg ttaattgttg tcggggtagc cttggagaca atgaaacaac 2401 tagaaagcca gttggtgaaa cgaaactacc gtggatttat gaaaaactag aggaaatgga 2461 tttatccatt ccctcttaat aaagagagga cgggggaaat ccgaatgaac ttagtcttaa 2521 tggggcttcc tggtgccggt aaaggcacac agggcgaacg aattgttgag gattatggga 2581 ttcctcatat ctcaacagga gatatgttcc gtgctgctat gaaagaagaa acaccactcg 2641 gactcgaagc aaaatcttat attgataagg gagagctcgt acctgacgaa gtcacaatcg 2701 gaattgtaaa agagagactt ggcaaagatg attgtgaaag aggttttctt ctggacggat 2761 ttccgcgaac agtcgctcaa gccgaagctc ttgaagaaat tcttgaggaa tacggcaagc 2821 cgattgatta tgtcattaac attgaagtcg ac // LOCUS RATHDNFNT 1020 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat neurotrophin-3 (HDNF/NT-3) mRNA, complete cds. ACCESSION M34643 KEYWORDS neurotrophic factor; neurotrophin-3. SOURCE Rat adult (strain Sprague-Dawley) hippocampus, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1020) AUTHORS Ernfors,P., Ibanez,C.F., Ebendal,T., Olson,L. and Persson,H. TITLE Molecular cloning and neurotrophic activities of a protein with structural similarities to b-nerve growth factor: Developmental and topographical expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.P.Ernfors, 18-MAY-1990. FEATURES from to/span description pept 73 849 neurotrophin-3 (HDNF/NT-3) precursor sigp 73 120 neurotrophic factor/neurotrophin-3 (HDNF/NT-3) signal peptide matp 490 846 neurotrophic factor/neurotrophin-3 BASE COUNT 320 a 229 c 226 g 245 t ORIGIN 1 gtcgacgtcc ctggaaatag tcatacggat gccatggtta cttctgccac gatcttacag 61 gtgaacaagg tgatgtccat cttgttttat gtgatatttc ttgcttatct ccgtggcatc 121 caaggcaaca acatggatca aaggagtttg ccagaagact ctctcaattc cctcattatc 181 aagttgatcc aggcggatat cttgaaaaac aagctctcca agcagatggt agatgttaag 241 gaaaattacc agagcaccct gcccaaagca gaggcaccca gagaaccaga gcagggagag 301 gccaccaggt cagaattcca gccgatgatt gcaacagaca cagaactact acggcaacag 361 agacgctaca attcaccccg ggtcctgctg agtgacagca cccctttgga gccccctccc 421 ttatatctaa tggaagatta tgtgggcaac ccggtggtaa ccaatagaac atcaccacgg 481 aggaaacgct atgcagagca taagagtcac cgaggagagt actcagtgtg tgacagtgag 541 agcctgtggg tgaccgacaa gtcctcagcc attgacattc ggggacacca ggttacagtg 601 ttgggagaga tcaaaaccgg caactctcct gtgaaacaat atttttatga aacgaggtgt 661 aaagaagcca ggccagtcaa aaacggttgc agggggattg atgacaaaca ctggaactct 721 cagtgcaaaa cgtcgcaaac ctacgtccga gcactgactt cagaaaacaa caaactcgta 781 ggctggcgct ggatacgaat agacacttcc tgtgtgtgtg ccttgtcaag aaaaatcgga 841 agaacatgaa ttggcatctg tccccacata taaattatta ctttaaatta tatgatatgc 901 atgtagcata taaatgttta tattgttttt atatattata agttgacctt tatttattaa 961 acttcagcaa cccttacagt atataagctt ttttcataat cgggctgctc aaaaaaaaaa // LOCUS ABCRRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION Acetobacter sp. (strain MB 58) 5S ribosomal RNA, complete sequence. ACCESSION M34766 KEYWORDS 5S ribosomal RNA. SOURCE Acetobacter sp. (strain MB 58) rRNA. ORGANISM Acetobacter sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 27 a 40 c 32 g 17 t 2 others ORIGIN 1 gatctggtgg ccatggcggg agcaaatcag ccgatcccat cccgaactcg gccgtcaaat 61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagayy // LOCUS ABCRRAB 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION Acetobacter sp. (strain 914) 5S ribosomal RNA, complete sequence. ACCESSION M34767 KEYWORDS 5S ribosomal RNA. SOURCE Acetobacter sp. (strain 914) rRNA. ORGANISM Acetobacter sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 27 a 42 c 29 g 18 t 2 others ORIGIN 1 nntctgatgg ccatggcggg agcaaatcac ccgatcccat cccgaactcg gccgtcaaat 61 gccccagcgc ccatgatact ctgcctcaag gcacggaaaa gtcggtcgcc gccagatc // LOCUS ANCRRAA 117 bp ss-rRNA RNA 14-JUL-1990 DEFINITION A.aquaticus 5S ribosomal RNA, complete sequence. ACCESSION M34775 KEYWORDS 5S ribosomal RNA. SOURCE A.aquaticus (strain Ma) rRNA. ORGANISM Ancylobacter aquaticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 117) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 23 a 40 c 32 g 22 t ORIGIN 1 cctggtggcc attgcgagga gccccaaccc gatcccatcc cgaactcggc cgttaaactc 61 ctcagcgccg atggtactat gtctcaagac ctgggagagt aggtcgctgc caggcct // LOCUS BLTRRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION B.viscosus 5S ribosomal RNA, complete sequence. ACCESSION M34774 KEYWORDS 5S ribosomal RNA. SOURCE B.viscosus (strain D7) rRNA. ORGANISM Blastobacter viscosus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 23 a 38 c 35 g 22 t ORIGIN 1 gcctggtggc tatggcgagg agtccgaacc cgatcccatc ccgaactcgg ccgttaaact 61 cctcagcgcc aatggtactt cgtctcaaga cgcgggagag taggtcgctg ccaggcct // LOCUS HYVRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION H.vulgare 5S ribosomal RNA, complete sequence. ACCESSION M34768 KEYWORDS 5S ribosomal RNA. SOURCE H.vulgare (strain NP-160) rRNA. ORGANISM Hyphomicrobium vulgare Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Prosthecate bacteria. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 22 a 34 c 37 g 26 t ORIGIN 1 gacctggtga ttatggcggg tggtctgcac ccgatctcat tccgaactcg gccgtgaaac 61 cgcccagcgc caatggtact tcgtcttaag acgcgggaga gtaggtcgtc gccaggtct // LOCUS HYVRRAB 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION Hyphomicrobium sp. 5S ribosomal RNA, complete sequence. ACCESSION M34769 KEYWORDS 5S ribosomal RNA. SOURCE Hyphomicrobium sp. (strain G10) rRNA. ORGANISM Hyphomicrobium sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Prosthecate bacteria. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 20 a 34 c 39 g 26 t ORIGIN 1 gacctggtga ttatggcggg gtggctgcac ccgatcccat tccgaactcg gccgtgaaac 61 gcccctgcgc cgatggtact tcgtcttaag acgcgggaga gtaggtcgtt gccaggtct // LOCUS MAGRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.agile 5S ribosomal RNA, complete sequence. ACCESSION M34759 KEYWORDS 5S ribosomal RNA. SOURCE M.agile (strain A20) rRNA. ORGANISM Methylomonas agile Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 29 a 29 c 36 g 22 t 3 others ORIGIN 1 gcttggtggc catagcgagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gcttagcgcc gatgatagtg cgacaggttg tcgtgcgaaa gtagggaatt gtmaagcnn // LOCUS MAGRRAB 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.rubra 5S ribosomal RNA, complete sequence. ACCESSION M34760 KEYWORDS 5S ribosomal RNA. SOURCE M.rubra (strain 15S) rRNA. ORGANISM Methylomonas rubra Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 30 a 31 c 33 g 23 t 2 others ORIGIN 1 gcttggtgac catagcaagc gtgaaccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gcttagcgcc gatgatagtg tggcaggttg ccatgtgaaa gtaggtcatc gcmaagctn // LOCUS MAGRRAC 117 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.methanica 5S ribosomal RNA, complete sequence. ACCESSION M34761 KEYWORDS 5S ribosomal RNA. SOURCE M.methanica (strain 12) rRNA. ORGANISM Methylomonas methanica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 27 a 30 c 32 g 24 t 4 others ORIGIN 1 ncttggtgac catagcgagc gtgraccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gctttgcgcc gatgatagtg tggcagtttg ccatgtgaaa gtaggtcatc gcmaagn // LOCUS MLCRRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.capsulatus (strain ATCC 19069) 5S ribosomal RNA, complete sequence. ACCESSION M34758 KEYWORDS 5S ribosomal RNA. SOURCE M.capsulatus (strain ATCC 19069) rRNA. ORGANISM Methylococcus capsulatus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 30 a 36 c 36 g 16 t ORIGIN 1 gcctggcggc cagagcgagc gggaaccacc cgatcccatc ccgacctcgg aagtgaaacc 61 gcttagcgcc gatgatagtg caagaatacc tgtgcgaaag tagggaactg ccaggctc // LOCUS MLCRRAB 117 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.capsulatus (strain Y) 5S ribosomal RNA, complete sequence. ACCESSION M34762 KEYWORDS 5S ribosomal RNA. SOURCE M.capsulatus (strain Y) rRNA. ORGANISM Methylococcus capsulatus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 31 a 28 c 31 g 23 t 4 others ORIGIN 1 nyttggtgac catagcgagc gtgacccacc cgatcccatc ccgaactcgg aagtgaaacc 61 gcttagtgcc gatgatagtg tggcaatttg ccatgcgaaa gtagggaatt gcmaaan // LOCUS MTBRRAA 115 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.organophilum 5S ribosomal RNA, complete sequence. ACCESSION M34770 KEYWORDS 5S ribosomal RNA. SOURCE M.organophilum (strain NP-220) rRNA. ORGANISM Methylobacterium organophilum Prokaryota; Bacteria; Methylmonadaceae. REFERENCE 1 (bases 1 to 115) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 115 5S ribosomal RNA BASE COUNT 23 a 41 c 34 g 17 t ORIGIN 1 ccggcggcct gagcggtgtg ccagaacccg atcccatctc gaactcggcc gttaaacaca 61 ccagcgccca tggtactgtg ctccaagaca cgggagagtc ggtcgccgcc aggct // LOCUS MTBRRAB 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.extorquens 5S ribosomal RNA, complete sequence. ACCESSION M34771 KEYWORDS 5S ribosomal RNA. SOURCE M.extorquens (strain AM1) rRNA. ORGANISM Methylobacterium extorquens Prokaryota; Bacteria; Methylmonadaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 24 a 39 c 37 g 18 t ORIGIN 1 gcctggcggc ctgagcggtg tggaagaacc cgatcccatc tcgaactcgg ccgttaaaca 61 caccagcgcc gatggtactg tgtctcaaga cacgggagag tcggtcgccg ccaggccc // LOCUS MYLRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.trichosporium 5S ribosomal RNA, complete sequence. ACCESSION M34763 KEYWORDS 5S ribosomal RNA. SOURCE M.trichosporium (strain 4E) rRNA. ORGANISM Methylosinus trichosporium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 28 a 29 c 33 g 24 t 5 others ORIGIN 1 nncttggtga ctatagtgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac 61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagyn // LOCUS MYMRRAA 115 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.methylotrophus 5S ribosomal RNA, complete sequence. ACCESSION M34765 KEYWORDS 5S ribosomal RNA. SOURCE M.methylotrophus (strain NCIB 10515) rRNA. ORGANISM Methylophilus methylotrophus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 115) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 115 5S ribosomal RNA BASE COUNT 29 a 33 c 28 g 25 t ORIGIN 1 cttggggaca atagcggttt ggaaccaccc cttcccatct cgaacagggc cgtgaaacga 61 acttgcgcca atgatagtgt actcttcgta tgcgaaagta ggtcatcccc aagcc // LOCUS MYPRRAA 119 bp ss-rRNA RNA 14-JUL-1990 DEFINITION M.parvus 5S ribosomal RNA, complete sequence. ACCESSION M34764 KEYWORDS 5S ribosomal RNA. SOURCE M.parvus (strain 492) rRNA. ORGANISM Methylocystis parvus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria. REFERENCE 1 (bases 1 to 119) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 28 a 32 c 33 g 22 t 4 others ORIGIN 1 nncttggtga ccatagcgag cgtgaaccac ccgatcccat cccgaactcg gaagtgaaac 61 cgcttagcgc cgatgatagt gtggccgttt gccatgcgaa agtagggcat tgcmaagcn // LOCUS PSERRAA 118 bp ss-rRNA RNA 14-JUL-1990 DEFINITION P.stutzeri 5S ribosomal RNA, complete sequence. ACCESSION M34776 KEYWORDS 5S ribosomal RNA. SOURCE P.stutzeri (strain 8) rRNA. ORGANISM Pseudomonas stutzeri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 23 a 35 c 35 g 25 t ORIGIN 1 gcctggtggc tatggcgagg aggcctaacc cgatcccatc ccgaactcgg ccgttaaact 61 cctcagtgcc gatggtacta tgtcttaaga cctgggagag taggtcgtcg ccaggcct // LOCUS RECRRAA 116 bp ss-rRNA RNA 14-JUL-1990 DEFINITION R.vacuolatum 5S ribosomal RNA, complete sequence. ACCESSION M34773 KEYWORDS 5S ribosomal RNA. SOURCE R.vacuolatum (strain Rv) rRNA. ORGANISM Renobacter vacuolatum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 116) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 116 5S ribosomal RNA BASE COUNT 23 a 36 c 32 g 25 t ORIGIN 1 ctggtggcca ttgcgaggag ccccaattcg atctcactcc gaactcggcc gttaaactcc 61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggcct // LOCUS TUMRRAA 116 bp ss-rRNA RNA 14-JUL-1990 DEFINITION T.mutans 5S ribosomal RNA, complete sequence. ACCESSION M34772 KEYWORDS 5S ribosomal RNA. SOURCE T.mutans (strain U2) rRNA. ORGANISM Tuberoidobacter mutans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Nonprosthecate bacteria. REFERENCE 1 (bases 1 to 116) AUTHORS Bulygina,E.S., Galchenko,V.F., Govorukhina,N.I., Netrusov,A.I., Nikitin,D.I., Trotsenko,Y.A. and Chumakov,K.M. TITLE Taxonomic studies of methylotrophic bacteria by 5S ribosomal RNA sequencing JOURNAL J. Gen. Microbiol. 136, 441-446 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 116 5S ribosomal RNA BASE COUNT 23 a 36 c 33 g 24 t ORIGIN 1 ctggtggcca ttgcgaggag cctgaacccg actccactcc gaactcggcc gttaaactcc 61 tcagcgccga tggtactatg tctcaagacc tgggagagta ggtcgctgcc aggtct // LOCUS ANAPETF1A 1682 bp ds-DNA BCT 14-JUL-1990 DEFINITION A.variabilis ferredoxin (petF1) gene, complete cds. ACCESSION M35126 KEYWORDS ferredoxin. SOURCE A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1. ORGANISM Anabaena variabilis Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Nostocaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 794 1093 ferredoxin (petF1) binding 782 785 ribosomal binding site (put.) BASE COUNT 480 a 353 c 322 g 527 t ORIGIN 1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg 61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta 121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat 181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat 241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat 301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata 361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat 421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc 481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat 541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata 601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt 661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat 721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga 781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag 841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta 901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc 961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata 1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga 1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag 1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta 1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga 1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt 1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac 1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct 1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt 1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat 1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg 1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata 1681 tc // LOCUS ANIPETF1A 1074 bp ds-DNA BCT 14-JUL-1990 DEFINITION A.nidulans ferredoxin (petF1) gene, complete cds. ACCESSION M35127 KEYWORDS ferredoxin. SOURCE A.nidulans (strain R2, PCC7942) DNA, clone pA612. ORGANISM Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 1074) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 504 803 ferredoxin (petF1) binding 493 496 ribosomal binding site (put.) BASE COUNT 234 a 298 c 281 g 261 t ORIGIN 1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg 61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca 121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac 181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt 241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga 301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa 361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc 421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc 481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg 541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag 601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca 661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg 721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc 781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat 841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc 901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc 961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt 1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag // LOCUS ASAASPPA 1764 bp ds-DNA PLN 14-JUL-1990 DEFINITION A.awamori aspergillopepsin A (pepA) gene, complete cds. ACCESSION M34454 KEYWORDS aspartic proteinase; aspergillopepsin. SOURCE A.awamori (NRRL3112 mutant strain UVK143f) DNA. ORGANISM Aspergillus awamori Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 1764) AUTHORS Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H., Carlomagno,L.P. and Thompson,S.A. TITLE Molecular cloning and deletion of the gene encoding aspergillopepsin A from Aspergillus awamori JOURNAL Gene 86, 153-162 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 177 496 aspergillopepsin (pepA), exon 1 548 825 aspergillopepsin, exon 2 878 1126 aspergillopepsin, exon 3 1186 1523 aspergillopepsin, exon 4 sigp 177 308 aspergillopepsin signal peptide matp 455 496 aspergillopepsin 548 825 aspergillopepsin 878 1126 aspergillopepsin 1186 1523 aspergillopepsin pre-msg 124 1764 pepA mRNA and introns (put.) pre-msg 125 1764 pepA mRNA and introns (put.) pre-msg 127 1764 pepA mRNA and introns (put.) IVS 497 547 pepA intron A IVS 826 877 pepA intron B IVS 1127 1185 pepA intron C signal 54 58 TATA box BASE COUNT 387 a 515 c 445 g 417 t ORIGIN 1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga 61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc 121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg 181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc 241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc 301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc 361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg 421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca 481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct 541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt 601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg 661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca 721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg 781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac 841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc 901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac 961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc 1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt 1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt 1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc 1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag 1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg 1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct 1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca 1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc 1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg 1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg 1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt 1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact 1741 gaaccagtag tagtactttc cact // LOCUS BSUPSIF2A 2758 bp ds-DNA BCT 14-JUL-1990 DEFINITION B.subtilis protein synthesis initiation factor 2 (infB) gene, complete cds. ACCESSION M34836 KEYWORDS protein synthesis initiation factor 2. SOURCE B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and pJET1. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2758) AUTHORS Shazand,K., Tucker,J., Chiang,R., Stansmore,K., Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and Leighton,T. TITLE Isolation and molecular genetic characterization of the Bacillus subtilis gene (infB) encoding protein synthesis initiation factor 2 JOURNAL J. Bacteriol. 172, 2675-2687 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 381 2531 protein synthesis initiation factor 2 (infB) binding 369 375 ribosomal binding site (put.) BASE COUNT 917 a 535 c 662 g 644 t ORIGIN 1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat 61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc 121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac 181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt 241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg 301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta 361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa 421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc 481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg 541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg 601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg 661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca 721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc 781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca 841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc 901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag 961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg 1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa 1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc 1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac 1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga 1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg 1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa 1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg 1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg 1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa 1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg 1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg 1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg 1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg 1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc 1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg 1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg 1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg 2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt 2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc 2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc 2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag 2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca 2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga 2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta 2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg 2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa 2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc 2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt 2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat // LOCUS CIBRCRA 1845 bp ds-DNA BCT 14-JUL-1990 DEFINITION Plasmid Colicin Ib-P9 replication control region replicon initiator protein (repZ) gene, complete cds. ACCESSION M34837 KEYWORDS replicon initiator protein. SOURCE Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1. ORGANISM Plasmid Colicin Ib-P9 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1845) AUTHORS Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K. TITLE Organization of the replication control region of plasmid Co1Ib-P9 JOURNAL J. Bacteriol. 172, 1983-1991 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 455 1486 repZ protein mRNA 157 1762 repZ protein mRNA (alt.) mRNA 158 1762 repZ protein mRNA (alt.) mRNA 363 < 1 (c) inc RNA binding 442 448 ribosomal binding site (put.) signal 377 372 (c) inc -10 region signal 400 395 (c) inc -35 region mut 353 353 c in wild type; t in inc3 mut 374 374 t in wild type; c in inc2 mut 400 400 a in wild type; c in inc1 BASE COUNT 505 a 467 c 490 g 383 t ORIGIN 1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag 61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct 121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac 181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag 241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa 301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag 361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata 421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata 481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta 541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg 601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg 661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc 721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa 781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc 841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt 901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact 961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct 1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt 1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca 1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct 1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag 1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct 1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct 1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga 1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc 1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa 1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca 1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa 1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt 1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa 1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt // LOCUS ECOHEMBA 1085 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli porphobilinogen synthase (hemB) gene, complete cds. ACCESSION M35121 KEYWORDS 5-aminolevulinic acid dehydratase; hemB gene; porphobilinogen synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1085) AUTHORS Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A. TITLE Nucleotide sequence of the hemB gene of Escherichia coli K12 JOURNAL Mol. Gen. Genet. 214, 503-508 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 80 1054 porphobilinogen synthase (hemB) signal 15 20 hemB -35 region signal 41 46 hemB -10 region BASE COUNT 254 a 277 c 301 g 253 t ORIGIN 1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc 61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc 121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg 181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat 241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg 301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg 361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat 421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg 481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc 541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc 601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac 661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg 721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact 781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga 841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg 901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt 961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc 1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag 1081 ttaac // LOCUS ECOSTMA 1418 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli F factor (stm) gene, complete cds. ACCESSION M35124 KEYWORDS F factor; stm gene. SOURCE E.coli DNA, clones pEG[157,159,175,176,220]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1418) AUTHORS Golub,E.I. and Panzer,H.A. TITLE The F factor of Escherichia coli carries a locus of stable plasmid inheritance stm, similar to the parB locus of plasmid RI JOURNAL Mol. Gen. Genet. 214, 353-357 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 830 1009 F factor (stm) signal 1013 1039 pot. transcription termination signal signal 1074 1105 pot. transcription termination signal BASE COUNT 279 a 364 c 429 g 345 t 1 others ORIGIN 1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa 61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct 121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag 181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg 241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact 301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc 361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg 421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc 481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact 541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag 601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc 661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca 721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga 781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc 841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta 901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc 961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg 1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg 1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag 1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct 1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt 1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt 1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg 1381 aagctgtctc tggcgggctt ccatccctga cgggaaac // LOCUS HUMGPIIBA 3333 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds. ACCESSION M34480 KEYWORDS platelet glycoprotein IIb. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIb[3,4]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3333) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 33 3152 platelet glycoprotein IIb (GPIIb) mRNA 1 3333 GPIIb mRNA BASE COUNT 626 a 998 c 1040 g 669 t ORIGIN 1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag 61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct 121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt 181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg 241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga 301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg 361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca 421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga 481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc 541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg 601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg 661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc 721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc 781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg 841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca 901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc 961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca 1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag 1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg 1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg 1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc 1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc 1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct 1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag 1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc 1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca 1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc 1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc 1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg 1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc 1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg 1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg 1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga 1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca 2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact 2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg 2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga 2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct 2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg 2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc 2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca 2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc 2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac 2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact 2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca 2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa 2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc 2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc 2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg 2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct 3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc 3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca 3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga 3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt 3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc 3301 ccctcctgct gcctaataaa gagactgagc cct // LOCUS HUMGPIIIAA 4487 bp ss-mRNA PRI 14-JUL-1990 DEFINITION Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds. ACCESSION M35999 KEYWORDS platelet glycoprotein IIIa. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4487) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 19 2385 plate glycoprotein IIIa (GPIIIa) mRNA < 1 2603 GPIIIa mRNA BASE COUNT 1046 a 1090 c 1186 g 1165 t ORIGIN 1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg 61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga 121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat 181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac 241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc 301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt 361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag 421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg 481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac 541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc 601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt 661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag 721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag 781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt 841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct 901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat 961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt 1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc 1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat 1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta 1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga 1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag 1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag 1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc 1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga 1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc 1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc 1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt 1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg 1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg 1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt 1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc 1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa 1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact 2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag 2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc 2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc 2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct 2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa 2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc 2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag 2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag 2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg 2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag 2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg 2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg 2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt 2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta 2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag 2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc 3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt 3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca 3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga 3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc 3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc 3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc 3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat 3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat 3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac 3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc 3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg 3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat 3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa 3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata 3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa 3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa 3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc 4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt 4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa 4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata 4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga 4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt 4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta 4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc 4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc // LOCUS MUSCDX1A 1597 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Mouse Cdx-1 homeobox protein mRNA, complete cds. ACCESSION M35125 KEYWORDS Cdx-1 homeobox protein. SOURCE Mouse 8.5 day embryo, cDNA to mRNA, clone BH8. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1597) AUTHORS Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D., Guenet,J.-L. and Gruss,P. TITLE A mouse gene homologous to the Drosophila gene caudal is expressed in epithelial cells from the embryonic intestine JOURNAL Genes Dev. 2, 1647-1654 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 158 736 Cdx-1 homeobox protein site 389 571 Cdx-1 homeobox BASE COUNT 326 a 523 c 444 g 304 t ORIGIN 1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt 61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct 121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca 181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc 241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc 301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg 361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca 421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa 481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa 541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc 601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg 661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga 721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg 781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt 841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt 901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc 961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc 1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca 1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc 1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat 1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg 1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat 1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat 1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca 1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg 1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa 1561 aaatcataca aggcattaaa gaatttttaa aaagttt // LOCUS MUSHSP70B 3518 bp ds-DNA ROD 14-JUL-1990 DEFINITION Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds. ACCESSION M35021 KEYWORDS heat shock protein 70.1. SOURCE Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3518) AUTHORS Hunt,C. and Calderwood,S. TITLE Characterization and sequence of a mouse hsp70 gene and its expression in mouse cell lines JOURNAL Gene 87, 199-204 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 806 2734 heat shock protein 70.1 (hsp70.1) signal 507 512 inverted CCAAT box signal 547 552 TATA box BASE COUNT 827 a 993 c 1014 g 684 t ORIGIN 1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag 61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga 121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta 181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca 241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc 301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg 361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac 421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct 481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga 541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat 601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc 661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc 721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag 781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg 841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga 901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg 961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg 1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt 1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg 1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga 1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga 1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat 1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga 1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat 1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga 1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga 1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag 1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga 1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg 1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca 1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga 1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg 1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct 1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc 2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc 2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga 2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc 2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga 2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa 2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca 2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc 2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga 2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt 2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca 2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc 2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg 2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt 2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt 2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat 2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag 3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc 3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt 3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac 3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat 3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt 3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc 3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct 3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt 3481 agctatcttc agacacacca gaaaagggca tcagatct // LOCUS RDVS10 1319 bp ds-RNA VRL 14-JUL-1990 DEFINITION Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds. ACCESSION D00241 KEYWORDS S10 RNA. SOURCE Rice dwarf virus, cDNA to viral RNA. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1319) AUTHORS Omura,T., Minobe,Y. and Tsuchizaki,T. TITLE Nucleotide sequence of segment S10 of the rice dwarf virus genome JOURNAL J. Gen. Virol. 69, 227-231 (1988) STANDARD simple automatic COMMENT Three contiguous termination codons were detected at the third triplet downstream from the first stop codon. Corresponds to DDBJ (rel3.0) entry <RDVS1O>(D00241) dated 20-JUN-1988 FEATURES from to/span description pept 27 1088 ORF BASE COUNT 354 a 279 c 284 g 402 t ORIGIN 5' end of the plus-sense strand of segment S10. 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc 1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat // LOCUS RDVS10A 1320 bp ds-RNA VRL 14-JUL-1990 DEFINITION Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds. ACCESSION M35118 KEYWORDS . SOURCE Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367]. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1320) AUTHORS Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E. TITLE Nucleotide sequence of rice dwarf virus genome segment 10 JOURNAL Proc. Jpn. Acad. 63, 227-230 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 27 1088 ORF BASE COUNT 357 a 280 c 282 g 401 t ORIGIN 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt 1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat // LOCUS SRVAGSM 3651 bp ss-RNA VRL 14-JUL-1990 DEFINITION Sapporo rat virus mRNA M segment, encoding an ORF, complete cds. ACCESSION M34882 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 3651) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 47 3448 M segment ORF BASE COUNT 1119 a 692 c 755 g 1085 t ORIGIN 1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct 61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat 121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc 181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa 241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa 301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg 361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat 421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc 481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca 541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt 601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga 661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac 721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg 781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat 841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga 901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag 961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat 1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa 1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac 1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc 1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt 1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt 1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt 1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc 1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact 1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt 1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc 1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat 1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca 1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca 1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc 1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg 1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag 1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt 2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata 2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat 2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt 2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca 2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc 2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt 2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt 2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag 2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact 2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg 2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga 2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga 2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt 2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat 2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga 2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt 3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat 3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca 3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca 3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa 3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat 3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt 3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg 3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg 3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt 3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct 3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a // LOCUS SRVAGSS 1769 bp ss-RNA VRL 14-JUL-1990 DEFINITION Sapporo rat virus mRNA S segment, encoding an ORF, complete cds. ACCESSION M34881 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 1769) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 43 1332 S segment ORF BASE COUNT 569 a 331 c 410 g 459 t ORIGIN 1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa 61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag 121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat 181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc 241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca 301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt 361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg 421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg 481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag 541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc 601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat 661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg 721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc 781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc 841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa 901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg 961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct 1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa 1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc 1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt 1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt 1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct 1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat 1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa 1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat 1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag 1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt 1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa 1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc 1741 tttttgggga gtaagggagt ctactacta // LOCUS TRNSTIIA 1315 bp ds-DNA BCT 14-JUL-1990 DEFINITION Transposon Tn4521 (from E.coli) heat-stable toxin II gene, complete cds. ACCESSION M35123 KEYWORDS heat-stable toxin II; insertion sequence; transposase. SOURCE Transposon Tn4521 (from Escherichia coli) DNA. ORGANISM Transposon Tn4521 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1315) AUTHORS Hu,S.T. and Lee,C.H. TITLE Characterization of the transposon carrying the STII gene of enterotoxigenic Escherichia coli JOURNAL Mol. Gen. Genet. 214, 490-495 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 879 400 (c) heat-stable toxin II (STII) trns 88 607 insertion sequence IS2 (bases 17-536) trns 608 970 insertion sequence IS2 (bases 969-1327) BASE COUNT 313 a 349 c 320 g 333 t ORIGIN 1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct 61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac 121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc 181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt 241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt 301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact 361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata 421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt 481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca 541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc 601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc 661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg 721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag 781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt 841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct 901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg 961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt 1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga 1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac 1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct 1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat 1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga // LOCUS AVOCYP 1500 bp ss-mRNA PLN 14-JUL-1990 DEFINITION Avocado cytochrome P-450LXXIA1 (cyp71A1) mRNA, complete cds. ACCESSION M32885 KEYWORDS cytochrome P-450; cytochrome P-450LXXIA1. SOURCE Avocado (strain Hass) ripe fruit mesocarp, cDNA to mRNA, clone pAVOd8 and pAVOc8, and DNA (bases 1-16). ORGANISM Persea americana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Magnoliidae; Laurales; Lauraceae. REFERENCE 1 (bases 1 to 1500) AUTHORS Bozak,K.R., Yu,H., Sirevag,R. and Christoffersen,R.E. TITLE Sequence analysis of ripening-related cytochrome P-450 cDNAs from avocado fruit JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3904-3908 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Christoffersen, 16-MAR-1990, for release after publication. FEATURES from to/span description pept 21 1436 cytochrome P-450LXXIA1 (cyp71A1) BASE COUNT 412 a 343 c 354 g 391 t ORIGIN 1 tcactctaga gtaataatcc atggctatct tagtgtctct gctcttctta gccattgctc 61 tcaccttctt cctcctaaaa ctcaacgaaa aacgagagaa gaaaccgaac ctaccccctt 121 ctcctcccaa ccttcccatc atcggaaacc tccatcagct cggtaatctc ccacaccgct 181 ctcttcgctc ccttgcaaac gaactcggac ccttgatact cctccatctg ggtcacatcc 241 ccactctcat agtctccacc gctgagattg ccgaagagat cttgaaaacc catgatctca 301 tctttgctag ccgaccatcc acaactgcag ctcgacgaat cttctatgac tgcaccgacg 361 tggcgttctc tccctacggc gaatattgga ggcaagtaag gaagatctgt gtactcgagc 421 tccttagcat aaagagagtc aactcctacc gttcgatcag ggaagaagag gtgggcctca 481 tgatggagag gatctctcaa tcttgctcga caggtgaagc tgttaatcta tcagagctgt 541 tgctattgct atcaagcggc acaataacaa gggttgcttt tgggaagaag tacgaaggag 601 aagaagaaag gaagaacaag tttgcggatc ttgcaactga gttgacaact cttatgggag 661 ctttcttcgt gggagactac tttccttcgt ttgcatgggt tgatgttcta acagggatgg 721 atgcgaggtt gaagagaaat catggtgaat tggatgcttt tgtggatcac gtaattgatg 781 accatctcct tagtagaaaa gcgaacggct cagatggagt ggagcagaaa gatttagtgg 841 atgtgctgct acatctgcag aaggattcct cactcggcgt ccatctcaac agaaataacc 901 ttaaagccgt catcttggac atgttctctg gtggaactga tacgacggct gtgaccttag 961 aatgggctat ggcagagctt ataaagcatc ccgatgtgat ggagaaagcc caacaagagg 1021 taagaagagt tgtggggaaa aaagcaaagg tggaagaaga agatcttcat cagttgcact 1081 acttaaaact aatcatcaaa gagactctgc gcttgcatcc tgttgctcca ttattagttc 1141 cacgcgagtc cacaagggat gtagttataa ggggctatca tattcctgca aagacaagag 1201 tctttattaa tgcatgggca ataggaagag accccaagtc atgggagaat gctgaggaat 1261 ttctcccaga gagatttgtc aataattctg ttgatttcaa ggggcaagat tttcaactta 1321 ttccttttgg agcagggagg aggggctgtc cggggattgc atttggcatc agttcagttg 1381 agatttctct tgccaatctc ttgtactggt tcaactggga attacctggg atctaaccaa 1441 agaagatctg gacacgctcg aagccgttgg gataaccgtt cacatgaagt ttcctctgca // LOCUS ECODGTP 2760 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli dGTP triphosphohydrolase (dgt) gene, complete cds, and periplasmic protease (htrA) gene, 5'end. ACCESSION M31772 KEYWORDS dGTP triphosphohydrolase; dGTPase; deoxyguanosine triphosphate triphosphohydrolase; dgt gene; htrA gene; periplasmic protease. SOURCE E.coli (strain K12, isolate HR42) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2760) AUTHORS Wurgler,S.M. and Richardson,C.C. TITLE Structure and regulation of the gene for dGTP triphosphohydrolase from Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2740-2744 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.M. Wurgler, 01-FEB-1990, for release after publication. FEATURES from to/span description pept 1066 2583 dGTP triphosphohydrolase (dgt) (EC 3.1.5.1) pept 2713 > 2760 periplasmic protease (hrtA) pept 958 323 (c) ORF signal 991 996 -35 region signal 1014 1019 -10 region site 1025 1046 regulatory signal binding 1054 1065 ribosome binding site signal 2590 2630 pot. transcription termination signal BASE COUNT 711 a 665 c 706 g 678 t ORIGIN 3.6 minutes on K12 map. 1 cagctggtca acctgccgct cggcattacc tccacgccag gcaatcacca gatcgggttt 61 cagcgcgaca atgcgttcca gattcatccc ctgccaggcg gaaacctgct caatcttttg 121 cgcttgtgga ggatagtcgg aatagctgct gaccccaacc ggcgtgatcc cggcggcaaa 181 ggcaagttca gtgttggcgg gagaaagcgt gatgacgcgc ggcgcggcgt tgagccacag 241 tggcgcaaga aaagacatgg ggcgaccagc gccctgaaca gtgacttagc catgtgcaag 301 tttctgcacc agtgactaac catcaggctg gactgtttag cggcaacagc aggaactcat 361 cgaagctaag atgagactgt tgatcggcca cgtcggagat ggcgcgtacg acaacaaacg 421 ggacgttgaa attgtggcag acatgggcga ttgccgtcgc ttccatctct acagcaatgg 481 cctgtgggaa gttgtggcgg attttcgcca gaccaacaga accgttgatg aaagcgtcgc 541 cgctaacaat caggccacgt acagcgttaa gattcagttc ggcaatgcag gcctcagcgg 601 cagcgatcag tttatcgtca gctttaaagc ctgccggaca gcctggtaac tgaccgtatt 661 cataaccaaa tgccgtgaca tccgcgtcgt gataacgtgc ttcgtccgag acaacgatat 721 cgcccacttt caacgttggt gccaggccac cggcagaacc ggtgttaata atcacatctg 781 gcttgcagtg ttccaacagc aaagtggcac ccagcgccgc agcgacttta ccgatgcccg 841 atttcagaag cgcaacctcg gttccattca gttggccggt atagatttcg caaccgccga 901 gactgatagt ttgacggttt tcgattttgt cacgcagcag cgtaacttct tcttccattg 961 caccaatgat gccgattttc atagatttac tcgcgataag cccgatttga aggcatagtt 1021 taccatgcgc ttacggggaa gcgtatttct cacgcgggag aggacatggc acagattgat 1081 ttccgaaaaa aaataaactg gcatcgtcgt taccgttcac cgcagggcgt taaaaccgaa 1141 catgagatcc tgcggatctt cgagagcgat cgcgggcgta tcatcaactc tccggcaatt 1201 cgtcgtctgc aacaaaagac ccaggttttt ccactggagc gcaatgccgc cgtgcgcacg 1261 cgtcttaccc actcgatgga agtccagcag gtggggcgct acatcgccaa agaaatttta 1321 agccgtctga aagagcttaa attactggaa gcatacggcc tggatgaact gaccggtccc 1381 tttgaaagca ttgttgagat gtcatgcctg atgcacgata tcggcaatcc gccgtttggt 1441 cattttggcg aagcggcgat aaatgactgg tttcgccaac gtttgcaccc ggaagatgcc 1501 gaaagccagc ctctgactga cgatcgctgc agcgtggcgg cactacgttt acgggacggg 1561 gaagaaccgc ttaacgagct gcggcgcaag attcgtcagg acttatgtca ttttgagggg 1621 aatgcacaag gcattcgcct ggtgcataca ttgatgcgga tgaatctcac ctgggcacag 1681 gttggcggta ttttaaaata tacccgtccg gcgtggtggc gtggcgaaac gcctgagaca 1741 catcactatt taatgaaaaa gccgggttat tatctttctg aagaagccta tattgcccgg 1801 ttgcgtaaag aacttaattt ggcgctttac agtcgttttc cattaacgtg gattatggaa 1861 gctgccgacg acatctccta ttgtgtggca gaccttgaag atgcggtaga gaaaagaata 1921 tttaccgttg agcagcttta tcatcatttg cacgaagcgt ggggccagca tgagaaaggt 1981 tcgctctttt cgctggtggt tgaaaatgcc tgggaaaaat cacgctcaaa tagtttaagc 2041 cgcagtacgg aagatcagtt ttttatgtat ttacgggtaa acaccctaaa taaactggta 2101 ccctacgcgg cacaacgatt tattgataat ctgcctgcga ttttcgccgg aacgtttaat 2161 catgcattat tggaagatgc cagcgaatgc agcgatcttc ttaagctata taaaaatgtc 2221 gctgtaaaac atgtgtttag ccatccagat gtcgagcggc ttgaattgca gggctatcgg 2281 gtcattagcg gattattaga gatttatcgt cctttattaa gcctgtcgtt atcagacttt 2341 actgaactgg tagaaaaaga acgggtgaaa cgtttcccta ttgaatcgcg cttattccac 2401 aaactctcga cgcgccatcg gctggcctat gtcgaggctg tcagtaaatt accgtcagat 2461 tctcctgagt ttccgctatg ggaatattat taccgttgcc gcctgctgca ggattatatc 2521 agcggtatga ccgacctcta tgcgtgggat gaataccgac gtctgatggc cgtagaacaa 2581 taaccaggct tttgtaaaga cgaacaataa atttttacct tttgcagaaa ctttagttcg 2641 gaacttcagg ctataaaacg aatctgaaga acacagcaat tttgcgttat ctgttaatcg 2701 agactgaaat acatgaaaaa aaccacatta gcactgagtc gactggctct gagtttaggt // LOCUS RATHMGCOA 1994 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat mitochondrial 3-hydroxy-3-methylglutaryl-CoA synthase mRNA, complete cds. ACCESSION M33648 KEYWORDS 3-hydroxy-3-methylglutaryl-CoA synthase. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone lambda-cMS1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1994) AUTHORS Ayte,J., Gil-Gomez,G., Haro,D., Marrero,P.F. and Hegardt,F.G. TITLE Rat mitochondrial and cytosolic 3-hydroxy-3-methylglutaryl-CoA synthases are encoded by two different genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3874-3878 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.G.Hegardt, 06-APR-1990, for release after publication. FEATURES from to/span description pept 50 1576 3-hydroxy-3-methylglutaryl-CoA synthase precursor (EC 4.1.3.5) sigp 50 160 3-hydroxy-3-methylglutaryl-CoA synthase signal peptide matp 161 1573 3-hydroxy-3-methylglutaryl-CoA synthase mRNA 1 1994 3-hydroxy-3-methylglutaryl-CoA synthase mRNA BASE COUNT 493 a 563 c 505 g 433 t ORIGIN 1 atctctccca ggggctgtgg actgctggct ttctgttgat accttagaga tgcagcggct 61 tttggctcca gcaaggcggg tcctgcaagt gaagagagtc atgcaggaat cttcgctctc 121 acccgctcac ctgctccccg cagcccagca gaggttttct acaatccctc ctgctcccct 181 ggccaaaact gatacatggc caaaagatgt gggcatcctt gccctggagg tctactttcc 241 agcccaatat gtggaccaaa ctgacctgga gaagttcaac aatgtggaag cagggaagta 301 cacagtgggc ttgggccaga cccgtatggg cttctgttcg gtccaggagg acatcaactc 361 cttgtgcctc acagtggtgc agaggctgat ggaacgcaca aagctgccat gggatgccgt 421 aggccgcctg gaagtgggca cggaaaccat cattgacaag tccaaggctg tcaagacagt 481 gctcatggag ctcttccagg attcaggcaa cactgacatc gagggcatag ataccaccaa 541 cgcctgctat ggtggcactg cctccctctt caacgctgcc aactggatgg agtccagcta 601 ctgggatggt cgctatgccc tggtggtctg tggtgatatc gcagtctacc caagtggtaa 661 cccccgcccc acaggtggtg ccggggctgt ggcaatgctg attgggccca aggccccgct 721 agtcctggaa caagggctga ggggaaccca catggagaac gcctatgact tctacaaacc 781 aaacttggcc tcagagtatc cactggtgga tgggaagctg tctatccagt gctacctgcg 841 ggccttggac cgatgctatg cagcttaccg caggaaaatc cagaatcagt ggaagcaagc 901 tggaaacaac cagcctttca ccctcgatga cgtgcaatat atgatcttcc acacaccctt 961 ttgcaagatg gtccagaaat ccctagctcg gctgatgttc aatgacttcc tgtcatctag 1021 cagtgacaag cagaacaact tatacaaggg tctagaggcc ttcaagggtc taaagctgga 1081 agaaacctac accaacaagg atgttgacaa ggctctgctg aaggcctccc tggacatgtt 1141 caacaagaaa accaaggcct ccctttacct ctccacaaac aatgggaaca tgtacacctc 1201 gtccctctac gggtgcctgg cctcacttct ctcccaccac tctgcccaag aattggccgg 1261 ctccaggatt ggagccttct cctacggctc aggcttagca gcaagtttct tctcatttcg 1321 agtgtccaag gacgcttccc caggttcccc tctggagaag ctggtgtcta gtgtgtcaga 1381 tctgcccaaa cgtctagact cccggagacg catgtcccct gaggaattca cagaaataat 1441 gaatcagaga gagcaatttt accacaaggt gaacttctct ccccctggtg acacaagcaa 1501 cctcttccca ggcacttggt accttgaacg agtggatgag atgcaccgca gaaaatatgc 1561 ccggcgtccc gtctaaggag accaatccat acaaccattc cccggggaaa gaatgtgagc 1621 agagccgtta cccaaacggc ttccacttaa aattccaccc acagcagtga acggtgaata 1681 gacacagcga ccccatagga tctgctccgc ggtgaagggc ctccctctgt ggatcctggg 1741 tgaccctccc tgaagcagtg agcaccacag gttctgctgt ggaccagagc ccccctgtgg 1801 agagggagaa agaaagggga gccgctgacc tgcagggata cagaccttcc ccacagcctg 1861 gcagccgccc gtttgttgca gcttattatc agactgtggg ctatcatagt tcatgctcgt 1921 ttcttaaagt ttcccgagaa tttctaaaat tttgtatcta aacttttaat atggcgatta 1981 aaaggagaga agga // LOCUS ECOSLTOXI 1761 bp ds-DNA BCT 14-JUL-1990 DEFINITION E.coli Shiga-like toxin II (SLTIIva) gene, complete cds. ACCESSION M29153 KEYWORDS Shiga-like toxin II. SOURCE E.coli (serotype O128:B12, isolate H.I.8) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1761) AUTHORS Gannon,V.P.J., Teerling,C., Masri,S.A. and Gyles,C.L. TITLE Molecular cloning and nucleotide sequence of another variant of the Escherichia coli Shiga-like toxin II family JOURNAL J. Gen. Microbiol. 136, 1125-1135 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.P.J.Gannon, 20-SEP-1989, for release after publication. FEATURES from to/span description pept 254 1213 Shiga-like toxin II (SLTIIva)-A subunit precursor sigp 254 319 Shiga-like toxin II-A subunit signal peptide matp 320 1210 Shiga-like toxin II-A subunit pept 1226 1489 Shiga-like toxin II-B subunit precursor sigp 1226 1282 Shiga-like toxin II-B subunit signal peptide matp 1283 1486 Shiga-like toxin II-B subunit signal 61 66 -35 region signal 82 87 -10 region binding 241 247 A subunit binding site binding 1215 1221 B subunit binding site signal 1659 1265 rho-independent transcription termination signal BASE COUNT 487 a 316 c 396 g 562 t ORIGIN 1 atatgtgaga gattctctat tctataaaaa tataaaccat atacccgtaa tcttaaataa 61 ttgatactat tttccggaag ctatagtgca tcattatctg ctttctgttc ctgataaaca 121 gggaagtaat acattgtgca gtcatttcct gtgaggggat ttgtttgtaa taaatttctg 181 ctaaaaacat tccggacttc ttgcgaggta ttattcctgg cgctgccggt tcagactgat 241 ggagtgttaa tatatgcgac atatattatt aaagctggtg ttgttttttt gtgtttgctt 301 gtcttcagca tcttatgcag atgagtttac tgtggatttc tcttcgcaaa agagctatgt 361 tgattcattg aatagtataa ggtcggcaat atccactcca cttggaaata tatctcaggg 421 tggtgtttct gtttcagtaa ttaatcatgt tctaggcgga aactatatat cattgaatgt 481 tagaggcctt gatccatata gcgagagatt taaccacctc cgtttaataa tggaacggaa 541 taacttatat gttgcaggct ttattaatac tgaaacgaat atcttttaca gattctccga 601 tttctcacat atttcagtgc ctgatgtgat aactgtttcc atgacgacgg acagcagtta 661 ttcatcatta cagcgaatcg cagatctgga acgtacaggg atgcagattg ggcgtcattc 721 actggttggt tcatatctgg atttaatgga gttcagagga cgttccatga cccgcgcatc 781 atccagagct atgctgcgtt ttgtcacagt gatagcagaa gctctgcgat tcagacaaat 841 acagcgggga ttccgaccgg cgctgtctga ggcatctccg ctttatacaa tgacggctca 901 ggatgttgac cttaccctga actggggaag aataagtaat gttcttccag agtacagagg 961 agaggaaggg gtaagaatcg gtaggatatc ttttaatagt ctttctgcga ttctcggaag 1021 tgttgcggtc atccttaatt gccactcaac cggaagttat tcagttcgtt ccgtgagcca 1081 aaaacagaaa acagaatgcc agattgttgg agacagggcg gccattaaag taaataatgt 1141 tttgtgggaa gcgaatacaa tcgctgcttt attaaatcgc aagcctcagg atcttactga 1201 accaaaccaa taacaggggg tgaatatgaa gaagatgatt attgcagttt tattcggtct 1261 cttttctgct aattccatgg cggcggattg tgctgtagga aaaattgagt tttccaagta 1321 taatgaggat gataccttta ctgtgaaggt gtcaggaaga gaatactgga cgaacagatg 1381 gaatttgcag ccattgttac aaagtgctca gctgacaggg atgactgtaa caatcatatc 1441 taatacctgc agttcaggct caggctttgc ccaggtgaag tttaactgag aatctacggt 1501 ttatttatgc gcgtcttttg tttctggacg cagatattat tagtgttgtg gatgctgatt 1561 aaatttggtc agtgttttcg ttaaagtcat ataaatacag gggcgtcacg cccctttttg 1621 gtctgtagtt gggtgaagat catcacttta ctttgctcaa ataactcagc acttgccggt 1681 ttacccttaa ggcaatacct acgttccata aaagaagcat caattaggat gctcgtatac 1741 ttgttatgtg ctttaaatgt t // LOCUS RATCHOL7H 3545 bp ss-mRNA ROD 14-JUL-1990 DEFINITION Rat cholesterol 7-alpha-hydroxylase mRNA, complete cds. ACCESSION J05460 KEYWORDS cholesterol 7-alpha-hydroxylase; cholesterol 7-alpha-monooxygenase. SOURCE Rat (strain Wistar King) male 8 week old liver, cDNA to mRNA, clone p7-alpha-11. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3545) AUTHORS Noshiro,N., Nishimoto,M. and Okuda,K. TITLE Rat liver cholesterol 7-alpha-hydroxylase: Pretranslational regulation for circadian rhythm JOURNAL J. Biol. Chem. 265, 10036-10041 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.Noshiro, 30-MAR-1990, for release after publication. FEATURES from to/span description pept 36 1547 cholesterol 7-alpha-hydroxylase (EC 1.14.13.17) BASE COUNT 1035 a 710 c 697 g 1103 t ORIGIN 1 ggtctcccct ttggaaattt tcctgctttt gcaaaatgat gactatttct ttgatttggg 61 gaattgccgt gttggtgagc tgttgcatat ggtttattgt tggaataagg agaaggaaag 121 ctggtgaacc tcctttggag aacgggttga ttccgtacct gggctgtgct ctgaaatttg 181 gatctaatcc tcttgagttc ctaagagcta atcaaaggaa gcatggtcac gtttttacct 241 gcaaactgat ggggaaatat gtccatttca tcacaaactc cctgtcatac cacaaagtct 301 tatgtcatgg aaaatatttt gactggaaaa aatttcatta cactacttct gcgaaggcat 361 ttggacacag aagcattgac ccaaatgatg gaaataccac ggaaaatata aacaacactt 421 ttaccaaaac cctccaggga gatgctctgt gttcactttc tgaagccatg atgcaaaacc 481 tccaatctgt catgagacct cctggccttc ctaaatcaaa gagcaatgcc tgggtcacgg 541 aagggatgta tgccttctgt taccgagtga tgtttgaagc tggctatcta acactgtttg 601 gcagagatat ttcaaagaca gacacacaaa aagcacttat tctaaacaac cttgacaact 661 tcaaacaatt tgaccaagtc tttccggcac tggtggcagg ccttcctatt cacttgttca 721 agaccgcaca taaagctcgg gaaaagctgg ctgagggatt gaagcacaag aacctgtgtg 781 tgagggacca ggtctctgaa ctgatccgtc tacgtatgtt tctcaatgac acgctctcca 841 cctttgacga catggagaag gccaagacgc acctcgctat tctctgggca tctcaagcaa 901 acaccattcc tgcaaccttt tggagcttat ttcaaatgat caggagtcct gaagcaatga 961 aagcagcctc tgaagaagtg agtggagctt tacagagtgc tggccaagag ctcagctctg 1021 gagggagtgc catttacttg gatcaagtgc aactgaatga cctgccggta ctagacagca 1081 tcatcaagga ggctctgagg ctttccagtg catccttgaa tatccgcaca gctaaggagg 1141 acttcactct ccatcttgag gacggttcct ataacatccg aaaagatgac atgatagctc 1201 tttatccaca gttaatgcac ttggatcctg aaatctaccc agaccctttg actttcaaat 1261 atgaccggta ccttgatgaa agcgggaaag caaagaccac cttctacagt aatggaaaca 1321 agctgaagtg tttctacatg cccttcggat caggcgcgac aatatgtcct ggaagactct 1381 ttgccgtcca agaaatcaag cagtttttga tcctgatgct ctcctgcttt gaactggagt 1441 ttgtggagag ccaagtcaag tgtccccctc tagaccagtc ccgggcaggc ttgggaattt 1501 tgccaccact acatgatatt gagtttaaat ataaactgaa acactgatac gtggttggaa 1561 gaagcgaaca ctggatgatg tcacttggcg gctgagagtc atcactaaac aggccttcgg 1621 gaccaatgct cactgatgcg ccctagcgac tggattagtg ggaagaactt tgttctcgct 1681 gcccacattc ctgggtgttc acatagctgg ggccagagct tcatcacttt cagaaagcaa 1741 tgtcttttgt atttattttc aaaatgaaga tattccaatt ggcaggatat ttttcctaag 1801 gaaattgctt tatattttta tgaaaactac caattaatta tgaaagggct tgaaattcac 1861 gttttagtga aattactgat ttttcactag taaggttctt caggtgtgaa actgtattat 1921 aaaaatgttg taatgggtca cactgtgctt tgcataaagg taaaggaaac tatgtttcag 1981 ccttttctgt gtctatgagc ttcgaaaata atcttactgt tctagaaaca ctggggaggt 2041 ttcgacatgc tctcgctata ttttatttta ctgttgctag aaattttcat tccagttttc 2101 aactacctta tctttccccc attttgacat gcatgccaat gagaagagta ttttttagga 2161 attaacaagg cacctcccag aaccctaccc tgagactttt aagcctttaa tcccagcact 2221 cgagaagtag agccaggcag atctctgagt ctgaggttat tctggtctac atcagctcca 2281 gacaagccag gactacagaa tgggatcttg tctaaaaaat acagctaatc tttatgtcat 2341 aactgattat gaatcaacct aaaagataaa ttttcaatca ggactcagag aaaatgagca 2401 attaaaaaac ttagctctga ggtatgtgga attcattaag tacaagttga cattacatgt 2461 tctttaaaaa tagtttatgt tttatctcta aatgccctgc agatgaagaa taataatgaa 2521 aagttgaata atactgttta aacactaagt gcaataatgc tttggtaatg tactttaaga 2581 gaatcattag ccgtgccagt tttactaaaa tatatttata tgtaaattat atttatcttt 2641 ttcttatacc ataaatataa aaatattgca acatttagta attttaaaat tatatacctt 2701 tcagaaaatg atgtatgaat gtttgtatgt tttttaactt tgaacagaac atttaaatta 2761 ttcatctacg gtgattttta tcttatttat ttctttttgt ctcattcata tcttgaagaa 2821 atccaaaaat atctgaagga atcgctcact caaatgtctc cctatggtta cagaaaaatt 2881 caataccatg tttttgtcct cggggactga agcagggtgt cgtgggtgcc gagcagaggc 2941 tcctgctgca gcgagcttta tccacgggac tccttaaact tttaaaatct tatcactatt 3001 atcatgcatt tattacctaa gtaggatatt tccctttcct ttttcatttc agccgagtcc 3061 cttagcaacc caggctgact gggaccctcc atgtagctta agctgtgaac tcactgtact 3121 tcctgttttc acttatttta ggaagtaatt ttccctatca gaaattttaa ttgtttagat 3181 gatgtataag agtaacacaa ttctgttata tactaatctg tagtaaacta aatttgttct 3241 tagaacaagt ttgatgactc tcaaattgaa tgtatccata catctttcca tggcttcttg 3301 aatgcccatt tctcatacac agaatgatgg gtttcacggt gatgtcttcc tttcatgtct 3361 ttattcttgt gcggtgatgg ttggcaaatg atacccatgg agcaaggtta ctcttcctat 3421 ttctgtgcag cctaagtgtt aagaataatt tttaaatact tggagggaag gcacattttg 3481 tgtcatatgt gaagtgacat gtgacacaca gactagcaaa tccttgagta aaattttatt 3541 gggat // LOCUS HUMCTLA1A 4751 bp ds-DNA PRI 14-JUL-1990 DEFINITION Human granzyme B (CTLA-1) gene, complete cds. ACCESSION M28879 KEYWORDS cytotoxic T-lymphocyte-associated serine esterase 1; granzyme B. SOURCE Human (patient B) blood DNA, clone lambda-B30. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4751) AUTHORS Haddad,P., Clement,M.-V., Bernard,O., Larsen,C.-J., Degos,L., Sasportes,M. and Mathieu-Mahul,D. TITLE Structural organization of the hCTLA-1 gene encoding human granzyme B JOURNAL Gene 87, 265-271 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Haddad, 11-MAY-1990, for release after publication. FEATURES from to/span description pept 1161 1215 granzyme B, exon 1 /hgml_locus_uid="LQ0170R" /nomgen="CTLA1" /map="14q11-q13" 2256 2403 granzyme B, exon 2 2857 2992 granzyme B, exon 3 3200 3460 granzyme B, exon 4 4105 4248 granzyme B, exon 5 IVS 1216 2255 CTLA-1 intron A IVS 2404 2856 CTLA-1 intron B IVS 2993 3199 CTLA-1 intron C IVS 3461 4104 CTLA-1 intron D signal 4338 4343 poly-A signal BASE COUNT 1211 a 1292 c 1119 g 1129 t ORIGIN 1 gaattctata ttttgagata taccattcct catagaaaaa tttcctcaca gaaaatataa 61 aggtggaaac aaatcacaag aatcgaacca tgtagagaga cttagttgtc ttttaacaga 121 attgggcacg ggctgttcag aaacaacaat ctttcacatc cattataatg atagcattag 181 tgtagtttgt ttagcaaatg tttactgcga gcctgttatg tgctgagcct gctatgtaag 241 aagtgtggct ctctggacag gagacagaat actaaacaac acaactactg atctttggct 301 gcctggcatg cttcctcact tcatatggta tcagcaattt agcaccacaa acgtccttta 361 gagaccagcc ctttctcatt cttggttcta gtggcttgag tagactgacc ccactaccca 421 agtggatttg actcctagca attcattaat ctagcccata aatgtcaagt acaggacttt 481 attgaagcat tcagaaagag gaatagggga tgttagaatc tctagaaagg aagctatgat 541 aataaatggg ttgctagatg ggtctagtag atggtggcca tgctttgtta ctgccttgtg 601 tattgtgcta ccatagccct ccccaaactg tactctggct cctggcattt ccgtctcttc 661 aaccagatgg tcagctctct aagtgaagga gacacatctc caacatgctt ggttctagca 721 caacagaagg gctcaaacac atacctgcta aagaaactat cctgatggat ttagcagcat 781 ggccatgagg cattggcggt tctatcactg ggaactcagg tttctggtgc tccagtacct 841 ctactggctg ataccacatc ctacagttca cttcataggc ttgggttcct gctctgggct 901 gaataggtgg tccactctga gtcatcagct gtggtgatga tgtggtcact gcatgattct 961 cacacaagca cccagaggac gtcatcaggc agaggcagtg ggggtgggca gcatttacag 1021 aaaatctgtg atgagacacc acaaaaccag aggggaacat gaagtcactg agcctgctcc 1081 acctctttcc tctcccaaga gctaaaagag agcaaggagg aaacaacagc agctccaacc 1141 agggcagcct tcctgagaag atgcaaccaa tcctgcttct gctggccttc ctcctgctgc 1201 ccagggcaga tgcaggtgag tgaccgtctt ccaacctcgg ggcccaaccc atcccacagg 1261 tctcctgccc tttctccaca ttcctgatcc atctatctac caggaatgtt ctgaactcca 1321 gctcccattc taccaagacc ccccaagtgt gatgctggat aagctatcag caggaatggc 1381 agagcagcag gccattctca agaagagcca gtgggtacta tcccttcccc agagcccacc 1441 tttgtcacct ggagagtagg actttcctag aagtaaatgg cagaggatgg gaaactagaa 1501 aagagaaata ttaaattatt ctagagtagg cctggcttct gtttctggga taagacaggt 1561 gcttctctca ctgtacttag gagagaaacc cagagctcag ctgacagcag aattggtaca 1621 atcactgtcc tcagaacact gttaatgtgt ttgctcagtc ccattctcca actctgcttt 1681 tcttccctgg cctttggtgg ctcccctctt tccaaggatg aggcactacg gcaggcccca 1741 gcttccctgc tttctagaat tccaccagca ctgctctacc agccctcatc cagaggctaa 1801 ctggagccag tccatcatgc agccatgaac atttactggg cacccactac atgtcaggct 1861 ctaggaaaca ggatatgaca gtatctagat ccctccactt acaccctggc cattagaaag 1921 cagcactatc ctagacacca caggactcat aagggtcttg gaaactcacc tgaaacaaag 1981 caaagtcagg agaggaatga tcaggagcct ctgggatttc actgtcccta agacaggtat 2041 gctcgccttc aactacatat ggaagaaaga tttacagacc aaagtctgct gttcttccct 2101 ttttcagagc aggaaattga agccccttcc tccaggccac tcccaactcc aggctatccc 2161 aggctcccaa atgcccagga gttctggagc cactaagcag gtgcccaccc agcagattcc 2221 atgggtgccc acaagcagac agacttttcc ttcaggggag atcatcgggg gacatgaggc 2281 caagccccac tcccgcccct acatggctta tcttatgatc tgggatcaga agtctctgaa 2341 gaggtgcggt ggcttcctga tacaagacga cttcgtgctg acagctgctc actgttgggg 2401 aaggtgagga gcagaaaaca gcccacaccc tcctggaaac actccacaga gacccctgcc 2461 ttcttcccaa ggagctccct gggctcctgt gaacacacat gccaggaggt ctccttagag 2521 ggtgagaaaa gggcagttaa gtttgtggag agaggggaag gttggttcca gaggtgctgc 2581 tgaagtaaga aacagcagag tgaccaagcc tgccatattt agaactgggg gcatactttg 2641 gcatagaata caaactgaag caattccacc tgtgtttcta gggggaaccg aaccctgaga 2701 aacctggtgc aattaccaga attccaattc ctggggaccg actgtcctta atttcccctc 2761 agctgcagcc ctgccccagc tgtcacctgc tcttcactgt ctctgggctg tatacactgt 2821 gactccaccc ccatcctcac tctgctctct gtgcagctcc ataaatgtca ccttgggggc 2881 ccacaatatc aaggaacagg agccgaccca gcagtttatc cctgtgaaaa gagccatccc 2941 ccatccagcc tataatccta agaacttctc caatgacatc atgctactgc aggtgaggca 3001 cactcctgcc actcttgctc ttcttggtcc agttggttcc actccccctg gaatgccggc 3061 ccttccctcc tttccatcct ggcctcttgg ttagttccta tgcctcagag gagagaggga 3121 agattgtgca gccccatcac tgtgtcgggg cccagaagtt cgttggctga cctggacttt 3181 cttgcctctt ccccaccagc tggagagaaa ggccaagcgg accagagctg tgcagcccct 3241 caggctacct agcaacaagg cccaggtgaa gccagggcag acatgcagtg tggccggctg 3301 ggggcagacg gcccccctgg gaaaacactc acacacacta caagaggtga agatgacagt 3361 gcaggaagat cgaaagtgcg aatctgactt acgccattat tacgacagta ccattgagtt 3421 gtgcgtgggg gacccagaga ttaaaaagac ttcctttaag gtaagactat gcacctgcct 3481 ggattggctc ttgggagaaa gatgtttggg gaatatctga gacctggaga ctcaagtagt 3541 gggggactcc ttcacccact agactgtgat atttctctct ggaaagagaa aaggggacta 3601 gactgagctg gggagaaatt agggcctctg caaacttacc aagaggcctt atggtggatg 3661 gtgccttctt tggaaggatg aatttgcaac actccaccca ctccaggtca cagatattag 3721 gaaactgtgc ccatgggggt gcagctaatt ataaccaggt gtgtcttcag aggctggtac 3781 ccaacgtggt taatgggctg gtcctccatg gtggacatca gccctccttg cccacttctg 3841 ggtccttaaa cagccaacgg tcccacatac ctccgatctc aggatctggg ggacatgacg 3901 gaggctggcc cctgggatga ggtgaagcag taacaatgtc cagggccaga gcttggcagc 3961 tggggccacc agcggcctgc cctgccctct ggtctcccac atgtaggctg tgcaagttgg 4021 ccttttctaa aagggggctt gagatggaag agagggcagg acccggagga gcatcagctc 4081 agtccttcca ctctctattc acagggggac tctggaggcc ctcttgtgtg taacaaggtg 4141 gcccagggca ttgtctccta tggacgaaac aatggcatgc ctccacgagc ctgcaccaaa 4201 gtctcaagct ttgtacactg gataaagaaa accatgaaac gctactaact acaggaagca 4261 aactaagccc ccgctgtaat gaaacacctt ctctggagcc aagtccagat ttacactggg 4321 agaggtgcca gcaactgaat aaatacctct tagctgagtg gaaagctggt ttcttgttta 4381 ttcattgacc ctcattctca ggcaccacat ctgcgctatg caggccaatg acacaatttt 4441 gctgttttct gctttctcct ctcccctcac cccttgccac ctccccaaac ccccacatga 4501 agctgatact cagctccttc ctatccacac cagtttctcc agggcctgcc ttctgccaag 4561 gctgaagctg agcaccatca ggagacaaca tggaccactt tggtcctggg gctttgggta 4621 aacttcttac ctccttctcc agtgttacat tgacagagaa aaaagggata ataccatggg 4681 acctaactcc tcatcccact ggggctcctc attctcccct gggcttagtt tctctaccct 4741 cctctgagct c //
GenBank-Updates@genbank.bio.net (07/17/90)
LOCUS BPEP69OMP 3000 bp ds-DNA BCT 17-JUL-1990 DEFINITION B.pertussis omp69A gene for P.69 outer membrane protein, complete cds. ACCESSION J04560 KEYWORDS membrane protein; outer membrane protein. SOURCE B.pertussis (CN2992) DNA. ORGANISM Bordetella pertussis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 3000) AUTHORS Charles,I.G., Dougan,G., Pickard,D., Chatfield,S., Smith,M., Novotny,P., Morrissey,P. and Fairweather,N.F. TITLE Molecular cloning and characterization of protective outer membrane protein P.69 from Bordetella pertussis JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3554-3558 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by N.F.Fairweather 26-JUL-1989. FEATURES from to/span description pept 145 2877 P.69 outer membrane protein precursor sigp 145 246 P.69 outer membrane protein signal peptide matp 247 2277 P.69 outer membrane protein BASE COUNT 453 a 1019 c 1096 g 432 t ORIGIN 2 bp upstream of ClaI site. 1 atcgatgata cgtcgctgta acacgacaaa tagcgtgcgt tgcagcggtt ctggatggcg 61 ttattcgtac ttttgctgcg cccattcttc cctgttccat cgcggtgcgg gcatggcggg 121 cgtctgctct ccacctggca tccaatgaac atgtctctgt cacgcattgt caaggcggcg 181 cccctgcgcc gcaccacgct ggccatggcg ctgggcgcgc tgggcgccgc cccggcggcg 241 catgccgact ggaacaacca gtccatcgtc aagaccggtg agcgccagca tggcatccat 301 atccagggct ccgacccggg cggcgtacgg accgccagcg gaaccaccat caaggtaagc 361 ggccgtcagg cccagggcat cctgctagaa aatcccgcgg ccgagctgca gttccggaac 421 ggcagtgtca cgtcgtcggg acagttgtcc gacgatggca tccggcgctt tctgggcacc 481 gtcaccgtca aggccggcaa gctggtcgcc gatcacgcca cgctggccaa cgttggcgac 541 acctgggacg acgacggcat cgcgctctat gtggccggcg aacaggccca ggccagcatc 601 gccgacagca ccctgcaggg cgctggcggc gtgcagatcg agcgcggcgc caatgtcacg 661 gtccaacgca gcgccatcgt cgacgggggc ttgcatatcg gcgccctgca gtcattgcag 721 ccggaagacc ttccgcccag ccgggtggtg ctgcgcgaca ccaacgtgac cgccgtgccc 781 gccagcggcg cgcccgcggc ggtgtctgtg ttgggggcca gtgagcttac gctcgacggc 841 gggcacatca ccggcgggcg ggcagcgggg gtggcggcca tgcaaggggc ggtcgtgcat 901 ctgcagcgcg cgacgatacg gcgcggggac gcgcttgccg gcggtgcggt tcccggcggt 961 gcggttcccg gtggtgcggt tcccggcggc ttcggtcccg gcggcttcgg tcccgtcctc 1021 gacggctggt atggcgtgga cgtatcgggc tccagcgtgg agctcgccca gtcgatcgtc 1081 gaggcgccgg agctgggcgc cgcaatccgg gtgggccgcg gcgccagggt gacggtgccg 1141 ggcggcagct tgtccgcacc gcacggcaat gtcatcgaga ccggcggcgc gcgtcgcttt 1201 gcgcctcaag ccgcgcccct gtcgatcacc ttgcaggccg gcgcgcatgc ccaggggaaa 1261 gcgctgctgt accgggtcct gccggagccc gtgaagctga cgctgaccgg gggcgccgat 1321 gcgcagggcg acatcgtcgc gacggagctg ccctccattc ccggcacgtc gatcgggccg 1381 ctcgacgtgg cgctggccag ccaggcccga tggacgggcg ctacccgcgc ggtcgactcg 1441 ctgtccatcg acaacgccac ctgggtcatg acggacaact cgaacgtcgg tgcgctacgg 1501 ctggccagcg acggcagcgt cgatttccag cagccggccg aagctgggcg gttcaaggtc 1561 ctgacggtca atacgctggc gggttcgggg ctgttccgca tgaatgtctt cgcggacctg 1621 gggctgagcg acaagctggt cgtcatgcag gacgccagcg gccagcacag gctgtgggtc 1681 cgcaacagcg gcagcgagcc ggccagcgcc aacaccctgc tgctggtgca gacgccacta 1741 ggcagcgcgg cgacctttac ccttgccaac aaggacggca aggtcgatat cggtacctat 1801 cgctatcgat tggccgccaa cggcaatggg cagtggagcc tggtgggcgc gaaggcgccg 1861 ccggcgccca agcccgcgcc gcagccgggt ccccagccgc cgcagccgcc gcagccgcag 1921 ccggaagcgc cggcgccgca accgccggcg ggcagggagt tgtccgccgc cgccaacgcg 1981 gcggtcaaca cgggtggggt gggcctggcc agcacgctct ggtacgccga aagcaatgcg 2041 ttgtccaagc gcctgggcga gttgcgcctg aatccggacg ccggcggcgc ctggggccgc 2101 ggcttcgcgc aacgccagca gctggacaac cgcgccgggc ggcgcttcga ccagaaggtg 2161 gccggcttcg agctgggcgc cgaccacgcg gtggcggtgg ccggcggacg ctggcacctg 2221 ggcgggctgg ccggctatac gcgcggcgac cgcggcttca ccggcgacgg cggcggccac 2281 accgacagcg tgcatgtcgg gggctatgcc acatatatcg ccgacagcgg tttctacctg 2341 gacgcgacgc tgcgcgccag ccgcctggag aatgacttca aggtggcggg cagcgacggg 2401 tacgcggtca agggcaagta ccgcacccat ggggtgggcg cctcgctcga ggcgggccgg 2461 cgctttaccc atgccgacgg ctggttcctc gagccgcagg ccgagctggc ggtattccgg 2521 gccggcggcg gtgcgtaccg cgcggccaac ggcctgcggg tgcgcgacga aggcggcagc 2581 tcggtgctgg gtcgcctggg cctggaggtc ggcaagcgca tcgaactggc aggcggcagg 2641 caggtgcagc catacatcaa ggccagcgtg ctgcaggagt tcgacggcgc gggtacggta 2701 cacaccaacg gcatcgcgca ccgcaccgaa ctgcgcggca cgcgcgccga actgggcctg 2761 ggcatggccg ccgcgctggg ccgcggccac agcctgtatg cctcgtacga gtactccaag 2821 ggcccgaagc tggccatgcc gtggaccttc cacgcgggct accggtacag ctggtaaagc 2881 gaggagggtc tatcccccgc ggaggagttt ttcctggagc ttggccggtg ccagtctcca 2941 ggctcaggcg gccagggcct gcgggccggg caggccgcgc tggtgctggc cgaaccattg // LOCUS ATHTRPB 4225 bp ds-DNA PLN 17-JUL-1990 DEFINITION A.thaliana tryptophan synthase beta subunit (trpB) gene, complete cds. ACCESSION M23872 KEYWORDS tryptophan synthase. SOURCE A.thaliana (strain Landsberg erecta) DNA, clone lambda-M1-g, and cDNA to mRNA (strain Columbia), clone lambda-CD-2. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 882) AUTHORS Berlyn,M.B. JOURNAL Unpublished (1989) Yale Univ, 165 Prospect, New Haven CT, 06511. STANDARD full staff_review REFERENCE 2 (bases 883 to 4225) AUTHORS Berlyn,M.B., Last,R.L. and Fink,G.R. TITLE A gene encoding the tryptophan synthase beta subunit of Arabidopsis thaliana JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 4604-4608 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by M.B.Berlyn, 18-APR-1989. FEATURES from to/span description pept 1517 1858 tryptophan synthase beta subunit, exon 1 (EC 4.2.1.20) 2109 2465 tryptophan synthase beta subunit, exon 2 2549 2991 tryptophan synthase beta subunit, exon 3 3084 3178 tryptophan synthase beta subunit, exon 4 3275 3450 tryptophan synthase beta subunit, exon 5 IVS 1859 2108 tryptophan synthase intron A IVS 2466 2548 tryptophan synthase intron B IVS 2992 3083 tryptophan synthase intron C IVS 3179 3274 tryptophan synthase intron D site 3643 3645 poly-A addition site BASE COUNT 1252 a 733 c 817 g 1423 t ORIGIN 1 gaattctttc atatctcctg caaagttctt gatatcaata ctccagcagt aactaagact 61 tagactcttg agcgtaggag agtttgataa caaagactcg gcctctgtga gcttgatcca 121 accaatagag agctttctag gcaatcccga gtttttgaac ttggagggat caagcccaca 181 cgcgtaaatc tttagtgatt cgagatttgt gtttaaaatc cgaattaaaa cctaatcaaa 241 ttaaaactaa accaaaccaa atacaatcca aaattaaact aattttggtt gagtttggtt 301 atagttttac taaatccaaa ttaacagaac ataaccaaac ccgaagattt ttagagtctt 361 tagaatttta aggtgatttt agataaaaga gattaaacac aaatctcgaa aactaaagaa 421 agagtttttg aaaattttta agtgttttca tgtaaagtgg atttctctgt gttttctgca 481 ttctgcggat tataactcct atgttttttt tctccgtcaa ttatatgtgt ttattttctc 541 tattttcttt tatttttatt tttattctct atattagggt ttagtttatg aaaacttttt 601 gttatctata taggcttggg ggatgtattt aaattagaat ttaaagtgat ttgagttctt 661 tgagttttta aataatttta acgattttaa aaaagttcgt atgatttttg taaaatctat 721 taaaatctca ccttaaatca tgggatttgg atttctgtat tttgaactaa gaaaatcctc 781 tcaaatcctc caaaatcatt aaaattcaaa tccacaaatt gttctgaata acagtgaatt 841 ttaaggtgga ttttgaaata attagttcaa taacactgaa tttcatgaga ttttttaaaa 901 tacatgtttg aataacatat gatttataaa ttctacacaa atcttttaaa attctaattt 961 caatacattg tttttgaaag tgttattgac tcttgccaat atagtatccc aattcccaac 1021 ttgtgtttca ttttttcatc tatctaataa acaattagat gaacacaaaa aaatattggt 1081 aggtgatggc tcaattggat atgtttttga aaaccatgtg ttaaaaactt aaaatactat 1141 ccaacttacc ccagtcctac caactttttt tttcttctct tggtctgctt acatgtgtct 1201 gcttatatct ccaaaaggaa atagatatat aaaaattcaa atttaaatat ttgcgatttg 1261 ttaaatttta atcaatattt aatttttgtt tttttttgtt tttttttatg aagacaacaa 1321 ataaccaaat ttatcaaatc tgatcaaagc agatttagga ttttacaaat atattttttt 1381 aatatgaatt ttgtggtcag attttgacca attctctttg aaaaaaaaaa aaatctatct 1441 ataaaaacat gtgttacttt gaaaggatat ttcaaggaga agaatatatt tgactcagag 1501 agagattcat tcagtaatgg cagcctcagg cacctctgct actttcagag cctccgtttc 1561 ttcagctcct tcctcttctt cccaattgac ccatttgaaa tcacccttca aagctgtcaa 1621 atatacgcct ctgccatcgt ctcgctccaa gtcatcatcc ttctccgtct cctgcaccat 1681 cgccaaggac ccgcctgttc tcatggccgc cggatctgac ccggccctgt ggcaacgacc 1741 cgattcgttc ggtcggtttg ggaagtttgg tgggaagtat gtacctgaaa cccttatgca 1801 cgctctatct gagcttgaat ccgctttcta tgctcttgcc accgacgatg atttccaggt 1861 agagaccttg ttcttcgtga ttagataaaa atgagacatt ttgcgttgtg ggtcgttgga 1921 attgtagcgt tatcatttgc gggttgtgat aaagttcgga tcttgttgat gggatatgtt 1981 gattctaggg catttgaatg aaattggttc tgttcattta tgaaatggtt agaaatatgg 2041 ttttggttcc tctggttcaa tgtctcattc atgtcctttg tgtatttgtg tgaatgaatc 2101 tttgttagag agagttggct ggaatcttga aggactatgt gggtagagaa agtcctctgt 2161 attttgcaga gaggcttacg gagcattaca ggcgcgagaa tggcgaaggg cctcttatat 2221 acttgaagag agaagacttg aatcacacag gagctcacaa gattaacaac gctgtggctc 2281 aggctcttct tgctaagcgg ttggggaaga agaggattat tgctgagaca ggagcgggtc 2341 aacatggtgt ggctacagct actgtgtgtg cccgttttgg tttggagtgt attatctata 2401 tgggtgctca agatatggag agacaagcac tcaatgtgtt cagaatgcga cttcttggtg 2461 ccgaggtgat gcttaggatt catttccatc tcttttaaat gtgataaact ttgttactga 2521 accttttttt gggtgtctgg tgctataggt gagaggagtc cactctggaa cagcgacatt 2581 gaaggatgcg acatctgaag cgataagaga ttgggtgaca aatgtggaga ctacccatta 2641 catattggga tctgtggcgg gtcctcatcc ttaccccatg atggtcagag actttcacgc 2701 tgtgattggt aaagaaacaa ggaaacaagc gttggagaaa tggggcggga agccggatgt 2761 cttggtggct tgtgtcggtg gtggttcaaa cgctatggga ctcttccatg aatttgtgaa 2821 tgacacagag gtccggatga tcggtgtgga agcagcggga ttcggattgg acagtggcaa 2881 acacgctgct acattgacaa agggagatgt tggtgtcctc catggagcta tgagttactt 2941 gctgcaagat gatgatggac aaatcattga accacactcc atcagtgcag ggtaagtgaa 3001 attctctatt cagatccggt aaccataata agtctactca gttagccacg gctgcatgaa 3061 catggtttga ttttctctta cagattggac taccctggag tcggacccga gcacagtttc 3121 tttaaagaca tgggacgggc tgaatactat agcataaccg acgaagaagc gttggaaggt 3181 aatgtaccga tgcatagtct ttagttctgt atttgaaaat ctgttctttg tttcataact 3241 tacctttgca ttccctttgg tttgtggtct gtagcgttca agagagtgtc gcggttagag 3301 ggaatcattc cagcactgga gacctcacac gcactagctt acctcgagaa gctatgtccc 3361 acattatcag acgggacgag agtggtcttg aacttcagcg gaagaggaga taaagatgtt 3421 cagacagtgg ctaaatatct tgatgtttga agagagaaac acttatgttt gtttttcagt 3481 tattcagaca gaatgaaatc ctttttgtac ttgtaatctg tttgtacttt acttctctct 3541 ggtcagtacg tttccaacaa aattggaaat agaaaaagat tacgaagaga acgaaatttg 3601 aataaatgta ttatcaataa aataaaaaac atttcatatt ggaaatagtc ttatgttcct 3661 caatctgctt tctttttcgt tttcaaaacg aaatcctact tttgcagata caaacatgag 3721 atatgatctc agaactctaa tagcttattt atctcaaatt tcatacgaga tgttacattg 3781 gaaagaggtc caaattgcta gaagacatat atattcgcat atagtatacc aatctgccaa 3841 agttatgttc cccttgtgaa ttgtgatcct taagatttaa gaaaaaatag aagtggtgac 3901 tttgatgtat gaaaaataat aatataagca aagttgcaaa atccaataga gaaagaatct 3961 tcttttagta tatccttcac caaacatatc tctctgcctc ttctgaaatc ccaagagaca 4021 tgtctctctc tctcttttcc aagtccgaac caaatctaat gtatcaacat tacttcttct 4081 gaagtcccaa gagacattgc tctctctctt ttccaagtct gaaccaaatc taatgcatca 4141 acattacttt tgtaataaaa ttcaataagg attagctgat tactgttgtc actctttggt 4201 ttcttttcct ttgttaatag aattc // LOCUS HS5E1P 2590 bp ds-DNA VRL 17-JUL-1990 DEFINITION Murine cytomegalovirus early transcription unit e1 protein gene, complete cds. ACCESSION M35146 KEYWORDS . SOURCE Murine cytomegalovirus (salivary strand strain Smith; ATCC VR-194) DNA, passed in mouse (BALB/c) embryonal fibroblast cell line B25. ORGANISM Murine cytomegalovirus Viridae; ds-DNA enveloped viruses; Herpesviridae; Betaherpesvirinae. REFERENCE 1 (bases 1 to 2590) AUTHORS Buehler,B., Keil,G.M., Weiland,F. and Koszinowski,U.H. TITLE Characterization of the murine cytomegalovirus early transcription unit e1 that is induced by immediate-early proteins JOURNAL J. Virol. 64, 1907-1919 (1990) STANDARD simple staff_review FEATURES from to/span description pept 163 955 early transcription unit e1, exon 1 1049 1225 early transcription unit e1, exon 2 1552 1574 early transcription unit e1, exon 3 pre-msg 44 2548 e1 mRNA and introns IVS 956 1048 e1 intron A IVS 1226 1551 e1 intron B signal 16 20 TATA signal signal 2548 2553 Poly-A signal BASE COUNT 564 a 732 c 751 g 543 t ORIGIN 1 gcgtgatgca gactttataa atcgcaagcc ggagcggatc ctaggctcat ttcttctgcg 61 atcttcgcat ttagcagacc aaatgctgat agttcctgcg tcgcggtaga ttacgtgccc 121 acttttctcg tcgcgaccgg tgaaaagacc ttcgttcgga ccatggccgc gccagatcga 181 cgcggatcgc ccatcgtttc gagacgacgt tacttcacct tccggaactc gagtcggacg 241 ctgcatcaga atgtcacccg catgttcgac ctccgtcagt ggacctacga gagcgcccgc 301 gttttggact gcacgaacgg cgagggccgc accgagaact gggggcccgg ctggctgtgc 361 gcgaccatca tgcagtctcc ggacagcggc tcgtcgtccg gatcggcggc ccagggatgc 421 atgtctctgg acatcacctc ggacgacgga ccggagaagg tcatgaacat gttccaccgc 481 gggagcatcg tctgtaacaa gaccgtctct tcggtcgcga cgccgccggg gtccgacggc 541 acgggcgcca gcctcttgac tctggtggcg gacggcagct tgctgcaagt catcctggtc 601 gaacactcgc cgggcgccca ccgcgccgaa gccgacgatg tcgccgagag cgtcggcaac 661 gccgcggtgg ccgccgccgt gcagtcggcc acttcgatgg gtagctcgag ctccgtcggc 721 ggcgccggtg gcggcggctt caattcttcg gaagaacgtc gccgcgataa agactcgtcc 781 aaatctcacg atgacgaccg acgtaagaga ttcgaactct gcaacggtac ccttcatagg 841 gagcggacct cgggtagctc cggatcgagc ggcggcggcg gcggcagcgg cagcggaggc 901 agcagcggcc tcagtacaaa gcagaaggag cgcagcagga ggctcgaaga ggaatgtaag 961 tagttctgat cggggtttta ctggtcgctt tccccctcga ccttcggtga accgtcacgg 1021 gactaatgtt gttactctgt tcggaaaggt tctccacgca gcggggggga gcccaagcgc 1081 cagaagaccc accacgatcc gagaccggag agggatctgg aaccgcctag gagcagtacc 1141 actgtcgatg gtaatgcaga cgaagctgga gcggtctcct ttttgaacag ttatgctgct 1201 agttcgctgt ccgccgtcag cgatggtgcg ttaccaccct cctctccggt gactatagag 1261 acaccaaggt ttaatgactt gatagaatcc gaggaggaag acgatggaac tcggtgttcg 1321 tctgccgatg gtacggcgac ggtaaccgag accggggcgt ctgccgagat cgagcaaacg 1381 ttcaccgacg gtgggctagc atgcatgagg tccttaattg actcgcgggc cctcatggat 1441 ctcgccttta cgagccgcat gacggcggtg aagtcgggat ctgagggagc gggcaacacc 1501 tgtcctgggg aacagaggct ttctaactgc aatcttcctc cgaattcaca ggaggacaac 1561 aacccgcgtt ttgaaggcgt ctttaattcc ctgatgagac tcttagacga gtataaagac 1621 aagagcacca gctcgagttc cgccgccgta ccgggatcca gttgtacagg caccacgcag 1681 acagagggga tgcctcacca caggggtatc ttcgacaggg acacatccgc cgctcagcag 1741 aaacctgtat gtgagatacg gcccttcatc gacatagctc gctgtgtcga gccggcggcg 1801 gcgccgaccg cttcgcgctc cagagccacc acggccgccg ctcgcaggcg gggaagaggc 1861 tcgcagcctc ggaggaacag tcgggtcgcg gcgcgcgtcg cccaggtcga tccgaccgac 1921 ccacccgcag ctgctgctgc cgccgccgcc ggtcccacca cgcggggaag gagcaggagt 1981 aggaggggaa gacgaggccc agacctcacg gaggacgggt tggagattgt cgagacgggc 2041 gccaccgccg gcactctggc cgtttccgag gaggaaaccg cgatggccgc cgctatgtta 2101 gaggatatgg tcgacttgga caatgtgttc gatgatctta attgatctca tgaaagatgg 2161 ataaagaaat gatgctctgt aaactctaac tgactgttgt tatcgatcgc ctcgtctcgc 2221 tcgggtcttc tgccgatgct gccgtctgta agcacgcaag caagcactcg ctctatagca 2281 gtagaatatg tatggatgat atacttactg ttgttgttca cttactatta tcacctatat 2341 aaaacttgta ttcatgatac ttaccactga aaaaattgtg tatgtctttt tatacttttg 2401 accaccgtta ttgatgataa gtatactctt ctaactgtgt gtgtctctgt ctaccgggcc 2461 gatggagata ttatccctgg tccccctctg tctgtgctaa caacataaca tccatgtaaa 2521 taaaattaat atttttaata ttttatcaat aaaaaccaca catttgttac aataaaccgc 2581 gctcgccttt // LOCUS HSE14GP 3351 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete cds. ACCESSION M35145 KEYWORDS glycoprotein 14. SOURCE Equine herpesvirus type 1 DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3351) AUTHORS Guo,P., Goebel,S., Perkus,M.E., Taylor,J., Norton,E., Allen,G., Languet,B., Desmettre,P. and Paoletti,E. TITLE Coexpression by vaccinia virus recombinants of equine herpesvirus 1 glycoproteins gp13 and gp14 results in potentiated immunity JOURNAL J. Virol. 64, 2399-2406 (1990) STANDARD simple staff_review FEATURES from to/span description pept 300 3242 glycoprotein 14 (gp14) precursor sigp 300 554 glycoprotein 14 (gp14) signal peptide matp 555 3239 glycoprotein 14 (gp14) signal 3251 3256 Poly-A signal BASE COUNT 882 a 892 c 858 g 719 t ORIGIN 1 aacgttgggt tgttaccgca tctcaaggag gaactagctc ggtttatgat tactgcggct 61 aaaggtaatt ggtcaattag cgagtttcaa aggttttatt gctttgaggg agtgacaggt 121 gtgacggcca cgcagcggct ggcgtggaaa tatatcgggg agctcatcct agccgccgca 181 gtattctcct cggttttcca ctgtggagag gtgcgcctcc tgcgcgcaga tcgtacctac 241 ccggactcca gcggcgcaca gcgctgcgtg agcggcattt acataaccta cgaggcgtca 301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg 361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct 421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa 481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct 541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt 601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac 661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc 721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg 781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct 841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat 901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga 961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata 1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt 1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg 1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg 1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt 1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc 1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag 1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact 1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg 1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tctacgttta 1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag 1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta 1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc 1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg 1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca 1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa 1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg 1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac 2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg 2101 cgtggtgtac cctccaaaac aaagagcgga ccctatggaa cgaaatggtg aagattaacc 2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg 2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc 2281 gctcgatgga cagtaacacg tgctactccc gcccccccgt aacatttaca attactaaga 2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg 2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag 2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg 2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc 2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac 2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg 2701 acaataccgc agtgattatg caggggatcg ccagcttttt caagggcctg ggtaaagtgg 2761 gggaggccgt gggaacgctc gttctcggcg ccgccggcgc tgttgtttca accgtatctg 2821 gaatagcttc gtttttaaac aacccatttg gggggctagc catcggcctg ctggtaatcg 2881 ccggcctggt agctgcgttt tttgcttaca gatatgtaat gcagatccgc agtaacccca 2941 tgaaagctct ataccccata acaacaaagg ccttgaaaaa caaagccaaa acttcctacg 3001 gccagaacga ggaggacgat gggagcgact ttgatgaggc caagcttgaa gaggctcgcg 3061 aaatgatcaa atacatgtct atggtttcgg ccctggaaaa gcaggaaaag aaagctataa 3121 agaaaaacag tggggttggc ctgatcgcca gtaacgtctc aaagctggcc ctgcgaaggc 3181 gcggtcccaa atatacccga ctccaacaga acgataccat ggaaaatgaa aaaatggttt 3241 aaacatgttt aataaatatt atgacacgta ctcaaagtgt gacctcatat ttgcataacc 3301 actttctagt tccggcccca aggatattta agcctagtat ctccgccgag g // LOCUS MLV3ENV 232 bp ds-DNA VRL 17-JUL-1990 DEFINITION Murine leukemia virus p15-gag protein gene, complete cds. ACCESSION M35144 KEYWORDS p15-gag protein. SOURCE Murine leukemia virus (strain Emv-3) DNA. ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 232) AUTHORS Mercer,J.A., Lee,K.H., Nexo,B.A., Jenkins,N.A. and Copeland,N.G. TITLE Mechanism of chemical activation of expression of the endogenous ecotropic murine leukemia provirus Emv-3 JOURNAL J. Virol. 64, 2245-2249 (1990) STANDARD simple staff_review FEATURES from to/span description pept 17 > 232 p15-gag protein BASE COUNT 51 a 57 c 53 g 50 t 21 others ORIGIN 1 gaccgcttct aaaaacatgg gaccgaccgt aaccacccct ctgagtctga ccctagaaca 61 ctgggaagat gtccagcgca tcgcgtccaa tcagtccgta gatgtcaaga agagacgctg 121 ggtcaccttc tgctctgccg agtggccaac tttcggtgta gggtggccac aagatggtac 181 ttttaatnnn nnnnnnnnnn nnnnnnnnaa atctaaggtg ttctctcctg gt // LOCUS CTRTPDC 1740 bp ss-mRNA PLN 17-JUL-1990 DEFINITION C.roseus tryptophan decarboxylase mRNA, complete cds. ACCESSION M25151 J04521 KEYWORDS decarboxylase; tryptophan decarboxylase. SOURCE C.roseus leaf, cDNA to mRNA. ORGANISM Catharanthus roseus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Gentianales; Apocynaceae. REFERENCE 1 (bases 1 to 1740) AUTHORS De Luca,V., Marineau,C. and Brisson,N. TITLE Molecular cloning and analysis of cDNA encoding a plant tryptophan decarboxylase: Comparison with animal dopa decarboxylases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2582-2586 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Brisson, 01-JUN-1989 FEATURES from to/span description pept 70 1572 tryptophan decarboxylase (EC 4.1.1.28) mRNA 1 1740 tryptophan decarboxylase mRNA BASE COUNT 505 a 355 c 359 g 521 t ORIGIN Unreported. 1 ctctctctct ctctctctaa gactttctct ctctacacat acacctacac cagaaaaaag 61 aaaaaaataa tgggcagcat tgattcaaca aatgtagcca tgtccaattc tccagttgga 121 gaatttaagc cacttgaagc tgaggaattc cgaaaacaag cccatcgtat ggtagatttc 181 atagccgatt attacaaaaa tgtggaaaca tatccggtcc ttagcgaagt cgaacctgga 241 tatctccgaa aacgtatccc cgaaaccgct ccttacctcc ccgaaccact tgacgacatc 301 atgaaagata ttcagaagga tattatccca ggaatgacaa attggatgag ccctaatttt 361 tatgcatttt ttcctgccac tgttagttca gctgcctttt taggagaaat gttgtctact 421 gccctaaatt cagtaggctt tacttgggtt tcttcaccag ccgccaccga attagaaatg 481 attgttatgg attggttggc tcagatcctt aaactcccca aatctttcat gttttcaggt 541 accggtggcg gcgtcatcca aaacaccact agcgagtcca ttctttgtac aatcattgcc 601 gcccgggaaa gggccctgga gaagctcggt cccgatagta ttggaaaact tgtctgttac 661 ggatccgatc aaacccatac catgttcccc aaaacttgca aattggcggg aatttatccg 721 aataatatta ggttaatacc tacgaccgtc gaaacggatt tcggcatctc acctcaagtt 781 ctacgaaaaa tggtcgagga tgacgtggcg gccggatatg taccgctgtt cttatgcgct 841 accctgggta ccacctcgac cacggctacc gatcctgtgg actcactttc tgaaatcgct 901 aacgagtttg gtatttggat ccacgtggat gctgcttatg cgggaagcgc ctgtatatgt 961 cccgagttta gacattactt ggatggaatc gaacgagttg actcactgag tctgagtcca 1021 cacaaatggc tactcgctta cttagattgc acttgcttgt gggtcaagca accacatttg 1081 ttactaaggg cactcactac gaatcctgag tatttaaaaa ataaacagag tgatttagac 1141 aaagttgtgg acttcaaaaa ttggcaaatc gcaacgggac gaaaatttcg gtcgctgaaa 1201 ctttggctca ttttacgtag ctatggagtt gttaatttac agagtcatat tcgttctgac 1261 gtcgcaatgg gcaaaatgtt cgaagaatgg gttagatcag actccagatt cgaaattgtg 1321 gtaccgagaa acttttctct tgtttgtttt agattaaaac ctgacgtttc gagtttacat 1381 gtagaagaag tgaataagaa acttttggac atgcttaact cgacgggacg agtttatatg 1441 actcatacta ttgtgggagg catatacatg ctaagactgg ctgttggctc atcgctaact 1501 gaagaacatc atgtacgccg tgtttgggat ttgattcaaa aattaaccga tgatttgctc 1561 aaagaagctt gatgaataag taagggtttt tttttaattt ttttttaaat tttatatttg 1621 ctgattgttt gaagagttta aaaataaagt gatttgtaaa ggtttattgt actcaaacaa 1681 tcatgcaatt aattatatgt attaattatg acatgagaat aaaatagaat ttgtgtgtgc // LOCUS WHTAMYA 2397 bp ds-DNA PLN 17-JUL-1990 DEFINITION Wheat alpha-amylase gene, complete cds. ACCESSION M16991 KEYWORDS alpha-amylase. SOURCE Wheat (strain Chinese Spring) distal half-grain DNA, clone lambda-Amy3/33. ORGANISM Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 2397) AUTHORS Baulcombe,D.C., Huttly,A.K., Martienssen,R.A., Barker,R.F. and Jarvis,M.G. TITLE A novel wheat alpha-amylase gene (alpha-Amy3) JOURNAL Mol. Gen. Genet. 209, 33-40 (1987) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by D.C.Baulcombe, 13-AUG-1987. The region bounded by nucleotides 975 and 1032 might be a signal peptide. FEATURES from to/span description pept 963 1052 alpha-amylase, exon 1 1138 2028 alpha-amylase, exon 2 2133 2393 alpha-amylase, exon 3 pre-msg 908 > 2397 alpha-amylase mRNA and introns (5' end +/- 3 bp) IVS 1053 1137 alpha-amylase intron A IVS 2029 2132 alpha-amylase intron B BASE COUNT 613 a 615 c 599 g 570 t ORIGIN 874 bp upstream of SstI site 1 aacattcaca atgcaaaatc tatatcatca gatccatcat gaaatatatt ttcatattat 61 atattttttg gtattgtagc tgcttttttc cttgaataca cacgagtgtg atatatatat 121 ttttggtatt gtagctgctt taattatata tatatatata tatatataca tatatatata 181 tatatataat taaagaagat gatgggctaa gaagcccttg gtgttgtaga tgttgatatt 241 ttttgatata attttggtca aactttataa agtttggatt ttcagaaatc taacacagag 301 agtaaaaaga aacggaagga gtatataata agagttctag cagccgctcc tccaactgca 361 cccaactgca ccctcgcctg agccggagcg gcgaggggga ggctaggtcg ccgcgccacc 421 atgtaaaaag attctttttt tatctttacc gactctagcc tcaccttgta cacacaaaaa 481 caagccaatc gaggtttttt tctagaatca acatcaaacc atccggtttt atgtgaaccg 541 cccggttcat cggttttttt tcaaaaaacg tctgattcga ccgggttttc ctggttggac 601 tcaatcggtc caccagactt aaagaactgc aggaatcgcc ggttccggtt ttcaaaacta 661 tgagcagagt gcttgtctag tgcagatata catggagagt aattaaacgg cagtaattaa 721 tagacaaaac tttagctcct aatccatttt atctcagaag ccagcggtcc ttatccatgc 781 ggatatcttt ggagaaaaat taaggagaat tgatagtttt tactaggcac ctactaatct 841 ctaagctcca agtacctata aatacaagcg agctcaagca ttgccaaacc aaggagtagt 901 cgtcggtcca ggagtccaga tttcatagga agtagaggcg agctgcactg cagactgtag 961 ctatgggcaa gcactctgct actctctgtg gccttctggt tgttgtgctt tgtctcgcct 1021 ccagcttagc acaggctcaa attcttttcc aggtgagaaa atgcatgcat cttcaagttt 1081 catttgcacc cagaatcttt catatgatga aatgtgcttt ggtgtttcgt tggccagggg 1141 tttaattggg aatcgtggaa gacacaaggc gggtggtaca agttcatgca agggaaggtg 1201 gaggagatcg ccagcaccgg cgccacccac gtctggctcc ccccgccgtc gcaatccgtg 1261 tcgccggagg gctatctgcc ggggcagctc tacaacctca actccaagta cggcagcggg 1321 gccgacctca agtcgctgat ccaggcgttc cgcggcaaga acatctcgtg cgtcgccgac 1381 atcgtcatca accaccgctg cgccgacaag aaggacggcc gcggcgtcta ctgcatcttc 1441 gagggcggga cgtccgacaa ccgcctcgac tggggccccg acgagatctg cagcgacgac 1501 accaagtact ccaacggccg cggccacagg gacaccggcg gtggcttcga cgccgcgccc 1561 gacatcgacc acctcaaccc gcgggtccag agggagctct ccgcctggct caactggctc 1621 aaaaccgacc tcggcttcga cggatggcgc ctcgacttcg ccaagggata ctccgcggcc 1681 atggccaaga tctacgtcga caacagcaag ccggcgttcg tcgttggcga gctctacgac 1741 cgggaccgac agctgctcgc gaactgggtg cggggcgtcg gcgggccggc cacggcgttt 1801 gacttcccca ccaagggcgt tctccaggag gccgtccagg gcgatctggg gaggatgcgc 1861 ggcagcgacg gcaaggcgcc cggtatgatt gggtggatgc cggagaagac cgtcacgttc 1921 atcgacaacc acgacactgg gtcgacgcag aggctgtggc cattcccctc ggacaaggtc 1981 atgcagggct acgcctacat cctcacacac ccgggcatac catgcatcgt aagtagtagc 2041 acactacaca acctcaccat aacatttcgc atcaaacgta ccccacgatg tttgtgatct 2101 gaacttacaa ctacttggtt ttgcgcgcgc agttctacga ccatgtgttc gactggaaac 2161 tgaagcagga gatcaccgca ctggctacgg tcaggtcaag gaacgggatc catccgggta 2221 gcacactgga tatcctcaaa gccgagggcg atctctatgt tgccaagatt gggggcaagg 2281 ttataaccaa gatcgggtca agatacaata ttggcgacaa tgtgatcccc tcgggtttca 2341 agattgcagc taaaggcaac aactattgtg tctgggagaa gagcggcctc tgaattc // LOCUS STAFEMA 3446 bp ds-DNA BCT 17-JUL-1990 DEFINITION S.aureus factor essential for expression of methicillin resistance (femA) gene, complete cds, and trpA gene, 3' end. ACCESSION M23918 KEYWORDS femA protein. SOURCE S.aureus (strain NCTC 8325; organism BB270) DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 3446) AUTHORS Berger-Baechi,B., Barberis-Maino,L., Straessle,A. and Kayser,F.H. TITLE FemA, a host mediated factor essential for methicillin resistance in Staphylococcus aureus: Molecular cloning and characterization JOURNAL J Antimicrob Chemother 23, 671-673 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.Berger-Baechi, 11-APR-1989. FEATURES from to/span description pept < 1 294 trpA protein (AA at 1) pept 577 1878 FemA protein pept 1897 3156 ORF 419 protein pre-msg 514 > 3446 FemA mRNA rpt 361 390 large inverted repeat, left end rpt 405 434 large inverted repeat, right end rpt 416 423 small inverted repeat, left end rpt 430 437 small inverted repeat, right end BASE COUNT 1281 a 448 c 617 g 1100 t ORIGIN 3 bp upstream of EcoRV site; segment 18 of S.aureus chromosome. 1 gatatcgtat cccatgcgga aggctttatt tatactgtga cgatgaatgc gacaacaggg 61 caaaacggtg cgtttcatcc agaattaaaa cgaaaaattg agtcaattaa agcgatagcc 121 aatgtgccag ttgtcgcagg atttggtata agaacaccac aacatgttgc agatataaaa 181 gaggttgcag atggcattgt cattggtagc gaaatcgtta agcgatttaa atctaacacg 241 cgtgaggaaa tcattaaata tttacaatct atccaacaaa cattgaataa ttaagtttac 301 ttgatttaaa aaaattaggc gaatactgtt tgaaaaagtg aaaaacggtg aattataaaa 361 ttgaatacaa tttcaaaaaa agtaatatga gcaaacccaa acgttcatat tacttttttt 421 gaaattgtat tcaaaaatct aaatattact ataaaagtat acgcaattaa agcgtttatg 481 ttttagtttt aacattaact attgtatact tatttagatt agatttatta tttttgacat 541 ttgcagaggg gaaatagaaa aactgcaaat acggaaatga aattaattaa cgagagacaa 601 ataggagtaa tgataatgaa gtttacaaat ttaacagcta aagagtttgg tgcctttaca 661 gatagcatgc catacagtca tttcacgcaa actgttggcc actatgagtt aaagcttgct 721 gaaggttatg aaacacattt agtgggaata aaaaacaata ataacgaggt cattgcagct 781 tgcttactta ctgctgtacc tgttatgaaa gtgttcaagt atttttattc aaatcgcggt 841 ccagtgattg attatgaaaa tcaagaactc gtacactttt tctttaatga attatcaaaa 901 tatgttaaaa aacatcgttg tctataccta catatcgatc catatttacc atatcaatac 961 ttgaatcatg atggcgagat tacaggtaat gctggtaatg attggttctt tgataaaatg 1021 agtaacttag gatttgaaca tactggattc cataaaggat ttgatcctgt gctacaaatt 1081 cgttatcact cagtgttaga tttaaaagat aaaacagcag atgacatcat taaaaatatg 1141 gatggactta gaaaaagaaa cacgaaaaaa gttaaaaaga atggtgttaa agtaagattt 1201 ttatctgaag aagaactacc aatttttaga tcatttatgg aagatacgtc agaatcaaaa 1261 gcttttgctg atcgtgatga caaattttac tacaatcgct taaaatatta caaagaccgt 1321 gtgttagtac ctttagcgta tatcaacttt gatgaatata ttaaagaact aaacgaagag 1381 cgtgatattt taaataaaga tttaaataaa gcgttaaagg atattgaaaa acgtcctgaa 1441 aataaaaaag cacataacaa gcgagataac ttacaacaac aacttgatgc aaatgagcaa 1501 aagattgaag aaggtaaacg tctacaagaa gaacatggta atgaattacc tatctctgct 1561 ggtttcttct ttatcaatcc atttgaagtt gtttattatg ctggtggtac atcaaatgca 1621 ttccgtcatt ttgccggaag ttatgcagtg caatgggaaa tgattaatta tgcattaaat 1681 catggcattg accgttataa tttctatggt gttagtggta aatttacaga agatgctgaa 1741 gatgctggtg tagttaaatt caaaaaaggt tacaatgctg aaattattga atatgttggt 1801 gactttatta aaccaattaa taaacctgtt tacgcagcat ataccgcact taaaaaagtt 1861 aaagacagaa ttttttagga agggaattat caaaacatga aatttacaga gttaactgtt 1921 accgaatttg acaactttgt acaaaatcca tcattggaaa gtcattattt ccaagtaaaa 1981 gaaaatatag ttacccgtga gaatgatggc tttgaagtag ttttattagg tattaaagac 2041 gacaataaca aagtaattgc agcaagcctt ttctctaaaa ttcctactat gggaagttat 2101 gtttactatt cgaatcgtgg tccagtaatg gatttttcag atttaggatt agttgattat 2161 tatttaaaag agttagataa atatttacag caacatcaat gtttatatgt taaattagat 2221 ccgtattggt tatatcatct atatgataaa gatatcgtgc catttgaagg tcgcgagaaa 2281 aatgatgccc tagtaaactt gtttaaatca catggttacg agcatcatgg ctttacaact 2341 gagtatgata catcgagcca agtacgatgg atgggcgtat taaaccttga aggtaaaaca 2401 cccgaaacat tgaaaaagac atttgatagt caacgtaaac gtaatattaa taaagcgata 2461 aactatggtg ttaaagtcag attccttgaa cgtgatgagt tcaatctttt cttagattta 2521 tatcgtgaaa ctgaagagcg tgctggattt gtatcaaaaa cagatgatta tttttataac 2581 tttattgaca catatggaga taaagtatta gtaccattag catatattga ccttgatgaa 2641 tatgtgttaa agttgcaaca ggaattgaat gacaaagaaa atcgtcgtga tcaaatgatg 2701 gcgaaagaaa acaaatcaga taagcaaatg aagaaaattg cagaattaga taagcaaatt 2761 gatcatgatc agcatgaatt attgaatgca agtgaattga gcaaaacgga cggcccaatt 2821 ctaaaccttg cttctggcgt ttattttgca aatgcatatg aagtgaatta tttctctggt 2881 ggttcatcag aaaaatataa tcaatttatg ggaccataca tgatgcattg gtttatgatt 2941 aactattgct tcgataatgg ctatgatcgt tataatttct atggtttatc aggtgatttt 3001 acggaaaaca gtgaagatta tggcgtatac cgctttaaac gtggatttaa tgtacaaatc 3061 gaagaattaa taggggattt ctataaacca attcataaag tgaaatattg gttgttcaca 3121 acattggata aattacgtaa aaaattaaag aaatagatgt aaatgatgtt agggcttata 3181 gttattgata ctataggctc ttttttatat gttttttaat aaagcctcga ataatgatat 3241 attaaatgac aagtttaaca agtgattgca ataaatataa atatgaattg cacaacctat 3301 ttgtaaatga ttgagttgag tatacttttt aattcattca atatgactca aaacaagcca 3361 atatgtatta tatttttggt gaataaacct tagcataaca atttatgtct gattatatat 3421 gtgtgtgttt gcaagtgtgt gatatc // LOCUS WTVPNS11A 1172 bp ds-RNA VRL 17-JUL-1990 DEFINITION Wound Tumor Virus genomic segment S10 encoding Pns11 nonstructural polypeptide, complete cds. ACCESSION M24114 J04352 KEYWORDS polypeptide Pns11. SOURCE Wound Tumor Virus, cDNA to viral RNA. ORGANISM Wound tumor virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus. REFERENCE 1 (bases 1 to 1172) AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L. TITLE Complete nucleotide sequence of wound tumor virus genomic segments encoding nonstructural polypeptides JOURNAL Virology 171, 222-228 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.L.Nuss, 20-APR-1989. FEATURES from to/span description pept 25 1068 polypeptide Pns11 BASE COUNT 359 a 222 c 236 g 355 t ORIGIN 1 ggtatttttg ttggatctgc tgtgatggat gcttctgtcg acaggatcac aaatctacat 61 tttgaaatcc tcgctaaagc gggtggacat gaaatacacc aaaaatatga agcaattagg 121 aagttgaacc ttacgggtga ttcttcaaag agtaatatct ctgtatctgc aaggtcagca 181 atattgaaat gggctgacgc taaacaagga tacatagcca gccaactgga cgatcgagat 241 tatggcgacc tcatcgccaa agctgtaatc tttgtcccga tgtctgttat aacaggaggg 301 aaaaatccga aggatttaat accttacgga gttgtcgctg cagtgttaat ttttgttcct 361 gaaacgttga cactactgga tgagatagtt attaatttaa tgcatgataa gaaacctttg 421 tcatcaattt tattgacaaa gatactcaga gacatgaaaa ttgatgtctg tggtagtaat 481 tttgattcat tttattactg cccaatttcg aggtataata gacatattat taagctagct 541 ggtgcattac ctcaaatgcc gactagcgtt agattatcag ttaatgattt agctcgagta 601 gctatttccg aagtacataa ccagcttatt tcggataaac agatgttttt caaactacca 661 actggttttt cgccaaaagt gcattgttta aaagtacttt gcactactga aatggaaatt 721 tttcagaaat gggttagaac gttcatgtct gatagaccga acgaattcat ctattctgac 781 cagtttaata tattatctag gaccacctac ttttcttcag atgatccttt ttcatttttt 841 actctttgga ggggatggtc aacctacaaa gaaatactct ctcaagatca agcttcaagc 901 tttcttgaag ccataggaag tggaaagccg ttgagaagtt caatcgctac ttttccatca 961 atgtttgatg aaggtgctat ttacatcaga tatgaatgga taactccaaa agattcagca 1021 aattcgaaga aggctggctc ttcagcaccg agcgcgccta aaatgtagaa gatttgtcct 1081 tgagattcgc atctggagca gcccggccta gtcacccgga gccagatcga gctaaaactc 1141 gagttgaatt ccccagctcc aacaaacatg at // LOCUS WTVPNS4A 2565 bp ds-RNA VRL 17-JUL-1990 DEFINITION Wound Tumor Virus genomic segment S4 encoding Pns4 nonstructural polypeptide, complete cds. ACCESSION M24117 J04352 KEYWORDS polypeptide Pns4. SOURCE Wound Tumor Virus, cDNA to viral RNA. ORGANISM Wound tumor virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus. REFERENCE 1 (bases 1 to 2565) AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L. TITLE Complete nucleotide sequence of wound tumor virus genomic segments encoding nonstructural polypeptides JOURNAL Virology 171, 222-228 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.L.Nuss, 20-APR-1989. FEATURES from to/span description pept 64 2262 polypeptide Pns4 BASE COUNT 809 a 463 c 625 g 668 t ORIGIN 1 ggtattgatc ccactttcta gatcagtgag ctagtgaagt aaggttcaaa actaatcatc 61 gccatgaagc aagtagccgg gtttattggt atgagtcaaa agaacaaagg aattcagcaa 121 aaccagtggc attccggtcc gccacaggga ctgctttcag gacaaacgaa agctgaacaa 181 ggtaccagca gtcagcaagc tggagtgaat caaggagaga attccaaatc tggggcggta 241 gtccagagga aaatgccaat gaggagagac gttaattctg ccgcacaacg tcaacatgta 301 aggaggttgg tgtacacacc atactttcca aatgaaacat ggaatatatc aacattcaaa 361 aatggagcta aagacgtaga acggtcggtt atatcgacgc tgaacaacgt cgctgagaat 421 agagtcatgg acaactgtac ttccagagtc atttttgaga tgactcaaat acaatttgag 481 tcacttccag atataataag aaatgaattt actagagttg gggatgacgc cttgaaatgg 541 gcggtccctg aagatttgaa atctgcagat cttgaccata tgatggtcgt gaaattatca 601 acagaaggca cgatataccc aacgacgctt atttttccag gtggatgtag tgggatggct 661 aaattgaaat cagtctactc attcttggaa agtcagctag aaagaatagt cacaccgaca 721 ccctcggtgt ctttgaagta tgttacaagc tgggctgagc acttatttga tctgtgctcc 781 ggacagctga tcaattcaca aaacgagcgt gtagacaaac tgcttggata catgatctgg 841 gatattgaaa aagctatcac tttgactgat caagtgatcg catgttacaa ccaccctgaa 901 gttgtgctga ggaggctagg ggcctcggat attgcatgcg ctgtgttagc tggggagtct 961 gtggtgaagc tcacccgcct agcgttatcc aagtctcctg ttgatggctg ctcatgctgt 1021 agaatattgg agctgattct aaatttacca tcacgtaagc ccaacgataa ggtgccgcag 1081 gtaccattgg acatactatt cgcatctgtc tatagatacg tatcagccat gtgcatgggg 1141 cgagtgctga atgggcgcat tgacgcatcg gggatacagt cgactgatca tgcgactgct 1201 tcgatcaagc tgaatgatat aattgtaaat gatcttgagt taaggagtat gggtgttgat 1261 aaaacgtcat cattcagagg aacgcaatct atgagggcgt tctacgttcc tgagaattta 1321 gcaggctcta tactagatag gattaacgtg ttggtcatgc gccattttgg aatcctgcat 1381 atgtggggtt ttaatggtgt agtacttcaa aatcaagagg ggtactgtga ctatcacatt 1441 ataacaggat taaaccactt aacaactata acgcatacta actccatggt tgcagtacat 1501 tggggtactg aatcaaggat ggataatata tttgaaataa aagcaagaac tcttccaacc 1561 gcatccgaga cgatgataac attgatcgaa aatgccttga aggagcaact gacatcaata 1621 gtgaaagacg gcctcaggaa gggagtaagt ttttcagtga aaagaaatat aaatgactct 1681 cgctttggat ttgagactaa ttcatcccca gcgatatttc tgaaactcag agacatgcta 1741 aaaagagcta agcctttctc agacctgtta tccttggcat tgtcaaaggt aataaaaaaa 1801 gagaacgcta tgatacaacg gtcaatcacc acagtggaag tggccgttgc cattaaaatg 1861 aaagtctatg ggttggatga gtatgtcagc ctgatgaaag ttgagaagaa ggaagtggaa 1921 agtggaagcc ttccacttca agaatttctg aaattaaaat caaatgcagc aggggcacag 1981 tcatcgactg ttgcagttaa gatgaaagaa gaagaagtca atagcaaagc ctattgcctg 2041 atatcagaaa ctattgttgt aaacatggac gcagtgagaa gtgcatgtgg agtagtgcaa 2101 tcagagaatc tggtcataaa gtcagaactt tcaggaccag aattgagtga aagtgtaact 2161 tcagggttga tggagttact aggaagaaat gccggaccaa gtaaatcttg ggctgaccaa 2221 gttgaagaag cggaaaatga agaggaaaaa caaaaggagt gagtgtgctc catttctgga 2281 tcggagtgtc ttgcgaggct cgggtctata gatttataaa catgtattat aattaatatt 2341 attagtggta atatattgca tttgtaaata gtgtgcttat ctcattaggc ctgccattag 2401 ggcggggtcg tgatgaaccc tcaggactat accacgtccc attttagctc gctatagcaa 2461 atttgagtat caacataagt gactggagtt taaaccctct tagctttatt cttgagcgaa 2521 gctataatga cgatggttgg tctagggctg atggatcatc atgat // LOCUS WTVPNS7A 1700 bp ds-RNA VRL 17-JUL-1990 DEFINITION Wound Tumor Virus genomic segment S6 encoding Pns7 nonstructural polypeptide, complete cds. ACCESSION M24116 J04352 KEYWORDS polypeptide Pns7. SOURCE Wound Tumor Virus, cDNA to viral RNA. ORGANISM Wound tumor virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Phytoreovirus. REFERENCE 1 (bases 1 to 1700) AUTHORS Anzola,J.V., Dall,D.J., Xu,Z. and Nuss,D.L. TITLE Complete nucleotide sequence of wound tumor virus genomic segments encoding nonstructural polypeptides JOURNAL Virology 171, 222-228 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.L.Nuss, 20-APR-1989. FEATURES from to/span description pept 45 1607 polypeptide Pns7. BASE COUNT 551 a 298 c 338 g 513 t ORIGIN 1 ggtattttct cctgaaatac ttgacctggc tagggttcgg tgaaatgaag atttccgact 61 tctgtttcgc ctctgcaaat gatggatcat acactttaaa agcattttcg gaacttaatg 121 aatacaagga tgtagtgaaa cttgtctctg atgaaaagat tggagttgga tttcactgct 181 acaatttagg tctgatgaat atagttgaag atttttccgg aaatcttgac aacgaatctt 241 acttgaccag taaggtaggc aaaagaatgg cttccgagct agtcactgca tactcaaaat 301 ttgggtctac tagcagccgg accttacata gctcattaaa tctaccagtc gtaaatataa 361 cttcattacc aacatcccaa gctaaagatc tcaaaccgaa tcattcattg gatgataagg 421 gatctatgct acggactcaa atacattcaa ttctgactgg aaacggtccg ctaactataa 481 aacgaaggat agatgcgttt tattatagcg cctcaagtat attcactaga cacatgactt 541 cgaaatatgc taatcctgga tcaaatgtgc ctcaaaggtt ttcatttata ccagattgtg 601 caatgaataa gaaaccaacg ctatttctag agaatagaga taacgaactc caagattcca 661 tgaccatcat gttaatgttg ggacaggtct tttcggatgc tttaacatat tatctaaatg 721 cctcaatact atacggtatc ctaggtagaa ttgaatccaa agtgcaggtt gatttaccag 781 ccatcactct tgaatccgtt catgttacaa ataatttgga aatctctccg gccgcatttg 841 ctctaatagc gtcagtctgg cttgataaag ctgaaatatt atcaaaatta aatgcgatag 901 acttcatagt gtctcctgaa gataatgaag atagaatatc aaatttactg aaactgatgc 961 tgccagttca gtctaataat ataacggttg aaaaatctga cactaggttc tcggtaaccc 1021 attcagatgg atttatgagg tattatatgt gtttttctaa acacgaattt gattatggag 1081 accaccttga gtcttttggc atcccagtgc tgcgtgtaag actaggtaaa ccaatatcaa 1141 atgaactgaa caagccgatg ctggtaatgt tcaagaagca tgaatcaata tctagcataa 1201 atgttagata tcaagtaaga ggtggaagta tacctaaatt caggacctca gaatttagaa 1261 gagatatagg aatgttagtt gccaattctc ggtttatggc gactgatatt actttgatat 1321 tgtctacatt ttatccattc actcaggaaa cggataaatt atttattgaa caacacataa 1381 aagaaatttt tcttgatatg tatccatgga ttgataaact aacttcagcg gacgctaaat 1441 ctgaggttaa tatatcatat gggaatttag tgctatactc atatggagag ttagtcaaaa 1501 atagtatatt tattgcaatg atggacaatt gcaaggacgc ccggaatagt ttttctcggg 1561 ctgatatgag agagatccaa gcatttgtag cagcgtttac acagtgagta agattttttc 1621 atgaacctgg agcttgcgag gccaccagca gctaaaaccc gtaggtggag gttgtataca 1681 tggtgagtgg ggagcatgat // LOCUS ATUVIRD 3662 bp ds-DNA BCT 17-JUL-1990 DEFINITION A.tumefaciens virD1/D2 protein genes, complete cds. ACCESSION M33673 KEYWORDS virulence protein. SOURCE A.tumefaciens plasmid pTiC58 DNA, from Nicotiana tabacum. ORGANISM Agrobacterium tumefaciens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 3662) AUTHORS Wang,K., Herrera-Estrella,A.H. and Van Montagu,M. TITLE Overexpression of virD1/D2 genes in A.tumefaciens enhances T-complex formation and plant transformation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by A.Herrera-Estrella, 11-APR-1990. Author address [1]: A.Herrera-Estrella Rijksuniversiteit Gent Department of Genetics Ledeganckstraat 35, Ghent, East Flanders Belgium B-9000 e-mail [1]: jecop@gengenp.rug.ac.be FEATURES from to/span description pept 648 1091 unknown protein 1 pept 1125 2468 unknown protein 2 pept 2483 3079 unknown protein 3 ORF 3118 > 3662 unknown protein 4 BASE COUNT 870 a 975 c 1042 g 774 t 1 others ORIGIN 1 gtcgacaatg cttcatcgat atcgagcgga gttaacatgg tcgggatcag aagcaggttt 61 gagctggcaa tgatcgtgtt gttgagttcg ctcgaaccac catgcgtatc ggccagcgca 121 taatcaaatc cttggagttc ggcgtcctca taggccgcct caaggagtgc catttcctcg 181 gcggcgtaga cttcgcagaa ggagccctag gtattgctgc gaagggcgtt ttctttccat 241 cgcgttagtg gtcggttttc atcagcgtcg aagagagcca atcgtttgcc gtcacttgca 301 aaggcggcgc aaagacccat gagtgccgtg gttttgccgg ctcctccctt gaaggagcaa 361 aatgtcagaa gtttcatgtc cttatcctgt cgattttgtg aagcggaagt gtgtctgtac 421 ttttatttgt gtgtatgatt ttgcgataat tcataagtaa tgtagtaatt acctgatttt 481 atatttcaat tttattgtaa tataatttca attgtaataa tataaaaata aatatccctt 541 atgtgttctt gatttcgttt tgtatatggc tagattccca tctgccacga cgaggaaatg 601 ctacggcggg gcaagttcag atctttccgt cttctatgga ggaagctatg tcgcaaggca 661 gtaggcccac ctcaagtgac attgccgtca accagcgcga atgcgtgaag gttgaaggct 721 tcaaggtcgt cagtacccga ttaagatcgg ccgaatatga gagtttttct catcaggcac 781 gcttgctggg cctctccgac agcatggcca tacgggttgc ggtgcgccgc attggtggct 841 ttcttgaaat cgacgcagag actcgtcata ggatggaggc catactacaa tccataggaa 901 cactctcaag caacattgcc gcgctgctat ctgcctatgc cgaaaatccg acaatggatt 961 tggaggcttt gcgagctgaa cgtatcgcct tcggtaaatc tttcgctgac ctcgacggct 1021 tgctccgttc cattttgtcc gtatcacggc ggcggatcga cggttgctcg ctgctgaaag 1081 acgccttgta gcactgacgt agcacttggc ggggaacata ttcgatgccc gatcgagctc 1141 aagttatcat tcgcattgtg ccgggaggtg gcaccaagac ccttcaacaa attatcaatc 1201 agttggagta tctatcccgg aagggcaggc tggagctgca gcgttcagcc cgacatctcg 1261 atattcccct gccaccggat caaatccacg aacttgcccg aagctgggtt caagagactg 1321 gaacttatga cgaaagtcag ccagacgagg aaaggcaaca ggagttgacc acccatatta 1381 ttgtaagctt ccccgccggt acaagccagg tagcggctta tgcggcgagc cgggagtggg 1441 cagccgagat gtttgggtca ggcgcagggg ggggccgata caactatctt acggccttcc 1501 acatcgatcg cgaccaccca catctgcatg tcgtcgtcaa tcggcgcgaa cttttaggac 1561 acggctggct gaagatatct cggcgccatc cccaactgaa ttacgacgcc ctgcgcataa 1621 agatggccga gatttcactt cgtcatggca ttgccctcga tgcgagccga cgagcagaac 1681 gtggcatcac cgagcggccg atcacttatg cccaatatcg gcgccttgag cgggagcagg 1741 ctcgccaaat ccgtttcgaa gacgcggatt tggaacagtc gtcgccgcaa ggagatcatc 1801 cagaattcag ccaacctttc gatacatccc catttgaagc atccgcgggc ggaccggagg 1861 acatgcctcg gcccaacaat cggcagaatg agtcgcaagt tcatctccag gagccagctg 1921 gtgtcagcaa cgaagccggt gtccttgtgc gggttgcatt ggagacggag cgccttgctc 1981 aaccattcgt ttccgaaacc attctcgcgg acgacatagg gagcggctct tcgcgtgttg 2041 ccgagggccg tgtggagagc gcaaaccgca ctcccgatat tcctcgcgca gcaactgaag 2101 ctgccacgca cacgacacac gaccggcagc ggcgtgcaaa gcgtcctcat gatgacgacg 2161 gagggccgag tggagcaaaa cgtgtgacat tggaaggcat cgcggttggg ccccagcgaa 2221 cggccggcga acaggatggc agtagtggcc ccttagaacg gcaagctgga acgtctcggc 2281 catctccacc gacggccacg acgcgggcca gcaccgcaac cgattcattg tctgctacag 2341 cccacctcca gcaacggaga ggtgtccttt caaagcgtcc gcgtgaagat gatgatggag 2401 aaccgagtga acgcaaacgc gagagagatg agcgcagcaa ggacgggcgt gggggaaata 2461 ggagatagag aaggaccgag tgatggcaaa tgaagagttc accagacact atgcgtggcc 2521 cgttcctgtg gcttcgaatg atgaagggcg tgggaccgcg cggatcccca tccaggcaca 2581 atcaatcgtt gctggagaag acggtcggga cacttcggtc ccaacggctt tgtcgcgacc 2641 gccaattgaa gatatgccgc acggcgtcca agaaacatcg gcgagtggcg gacgactggg 2701 agcggcccgt ctgcgggatt ccgtaatccc gccaggaata tccgaagccc gcacggacct 2761 atccgcaatt ttgcggaaaa aaagcggttc tttccgcacc ggtatgcagt atctgcgtgg 2821 gcttgaacgg gaaaattttg ataaacaaga cagggaagcc agtgcgttgc cagatttaag 2881 tgcaaggggc ataaagcgac cgcgcgaaat tgagtatccc ggcaatgcaa gcggattaac 2941 cataaagagg caagacggct taggcataga gatcaatact atctcggcat cttcgcccgt 3001 gaaccggggc cgcgcattcg tcgaactggc aaggcgcgcc ggaaccgggc gtgtacaatg 3061 ttcagccatc ggcagataga gcacagaact ctgcgcagga aagttcgacg tttcccgatg 3121 gcacttctgt ttccgctctc tattcaggcc cactcgcaga atggttcgag agggatactg 3181 gcagcgagac gaccaggaat tcaggcaaca ccatttcgtc accgctccga gggctggagg 3241 agttcggcga ttccgcggac agccggtatc ttgggcgcga agctcagagt ctttcagtta 3301 ccgtaacaac gcctaattcg aatgctgagg caagttctca tagcgcacac actgaaactc 3361 tcgacgatgt cagcagtgac cgctcgagcg aacaggggag gggtcccctt ggcgccgcaa 3421 tccttggatc ccatcatgac cttttcnccg cgtgcgcaga agttatcaca aacaaaccgc 3481 gattcccctg aactgaccga cgccgaccta gcaaaagtcg atgcggtgtt cgaatctctc 3541 tccaaggggc ccccggcggg ggagagcgcc gcaccggact ttcgtgagcg cggaccaggc 3601 agcgctttcc agaaagaagg cgtgtccgac agggcgaacg gggtgcccac taactgggag 3661 gt // LOCUS CHKFIBAB 2730 bp ss-mRNA VRT 17-JUL-1990 DEFINITION Chicken alpha-fibrinogen mRNA, 3' end, and ORF, complete cds. ACCESSION M34096 KEYWORDS alpha-fibrinogen; fibrinogen. SOURCE Chicken (strain White Leghorn) liver parenchymal cell line, cDNA to mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2730) AUTHORS Weissbach,L. and Grieninger,G. TITLE Bipartite mRNA for chicken alpha-fibrinogen potentially encodes an amino acid sequence homologous to beta- and gamma-fibrinogens JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5198-5202 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Grieninger, 08-MAY-1990. FEATURES from to/span description pept < 1 1518 alpha-fibrinogen (AA at 1) ORF 1830 2582 ORF mRNA < 1 2730 alpha-fibrinogen mRNA BASE COUNT 825 a 574 c 678 g 653 t ORIGIN 1 acgatactct gtgtgttgct ctgcctcaat ttagcttggg ctcaggatgg gaagaccacc 61 tttgaaaagg agggcggagg cgggcgtggc cccaggatcc tagagaacat gcacgagtcg 121 tcctgcaagt atgagaagaa ctggcctatc tgtgtggatg atgactgggg tacaaaatgt 181 ccatcatgct gcagaatgca aggaataatt gacgacacag accagaatta cagtcaaaga 241 atagacaata ttaggcagca actcgcagac agtcaaaaca agtataaaac atccaatcgg 301 gtaattgtgg aaaccataaa tatactaaag cctggtctgg agggtgccca gcagctcgat 361 gagaattacg gtcatgtgtc aacagaactg aggaggagaa ttgtgacatt aaagcagcga 421 gttgccactc aagtgaacag aattaaagct ctgcagaaca gcatccagga gcaggtggtg 481 gaaatgaagc gcttggaggt ggacattgat attaagatac gagcttgcaa agggagctgt 541 gctagaagtt ttgattacca ggtggacaaa gaaggctatg acaacatcca gaagcatctt 601 acccaggcca gctccattga tatgcaccca gactttcaaa caaccacctt gagcacactg 661 aaaatgaggc ctcttaagga ctcgaatgtt cctgaacatt ttaagctcaa gccttcacca 721 gaaatgcaag ctatgagtgc atttaataac atcaaacaga tgcaagtggt attagaaaga 781 ccagaaacag accacgtggc agaagcaagg ggggattctt caccttccca cactggcaaa 841 ttaattactt cttctcacag gagagagagc cctagcctgg tagacaaaac ctcctctgcc 901 tcctctgtcc atagatgcac cagaactgtc accaagaaag tcattagtgg ccctgatggt 961 cctagagaag aaatagttga aaaaatggtt tcctctgatg gctcagactg ctcccatcta 1021 caaggaggac gagaaggaag cacgtaccat tttagtggga caggtgactt tcacaagcta 1081 gacaggcttc tgccagatct agagtcattc ttcacccatg actctgtatc cactagtagt 1141 aggcactcaa ttggctcaag caccagcagc catgtgactg gggcaggcag cagccaccta 1201 ggcactggag ggaaagacaa attcacagat ttaggggagg aggaagaaga tgactttgga 1261 ggacttcagc catctggatt tgctgctggc agtgcgagtc actccaagac tgtactgacc 1321 agctcctctt ctagtttcaa caagggaggc tccacttttg aaaccaagtc actaaagacc 1381 cgtgaaacat ctgagcagct aggtggggtg caacatgatc agagtgcaga ggacacccca 1441 gactttaagg cgcgcagctt cagaccagca gcaatgagta caaggagatc ctacaatggg 1501 aaaggtacac agaagtagta attgaggtag tggagccaaa ttcaattcat aaccaaactg 1561 acagaatgtt tttagatact gcagtataat agtgtgatga taaaatgcca gaactgtgtc 1621 tgtttgttgc cactttggag ataaaagaga actatatcac taatagtcag gtattgcata 1681 gacaattagc aatataagat ctgtgtaagg ttttcctctc tcaattcttt ttaatgttgc 1741 ctgtcactaa accttatgtt gtccaagaca catttaacaa attttcctca gatctattgc 1801 tgcttggatt cctctgaagt ttttattaag ctttactgat aacttctgtc aaaccagatc 1861 aacttttttt tttagactgc gatgatatcc gccagaaaca cacttctggt gccaaaagtg 1921 gcattttcaa aatcaagcca gagggatcca ataaggtttt gtcagtttat tgcgaccaag 1981 agaccacttt gggaggatgg ctattgatcc aacagagaat ggatggatca gtgaatttta 2041 accgtacgtg gcaagactac aggagaggtt tcggcagcgt ggatggcaaa gggcaaggag 2101 agttgtggct gggcaatgag aatatccact tgctgactca gaacgatact ctgcttcgag 2161 tagagttaga ggactgggat ggaaatgctg catatgcaga gtacatcgtg caggtaggga 2221 ctgaagcaga aggctatgcc ctgactgtgt cctcctatga ggggactgct ggggatgctc 2281 tggttgctgg ctggctggag gagggctctg agtacacatc ccatgcccag atgcagttca 2341 gcacttttga ccgggaccag gaccactggg aggagagctg tgcagaggtg tacgggggtg 2401 gctggtggta caacagctgc caggcagcca acctcaatgg catttactat ccagggggcc 2461 attatgaccc caggtacaat gttccatatg agatcgagaa tggagtagtc tggataccat 2521 ttagagcctc cgattattcc ctcaaagttg ttagaatgaa aatcagaccc ctggaaaccc 2581 tgtagaagga cgggcattta atatatgtgt cacaactaaa agttggaaac atcttttata 2641 tacctgtgtg cacgatgtac ctttacccta tgaatttgaa ggcagcatgg cacatctatg 2701 gcatacaaaa taaacggatt aattattaac // LOCUS ECOHYA 6023 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli hya operon encoding hydrogenase isozyme 1 large and small subunit (hyaB and A) genes and hyaC-F protein genes, complete cds. ACCESSION M34825 KEYWORDS hyaA gene; hyaB gen; hyaC gene; hyaD gene; hyaE gene; hyaF gene; hydrogenase. SOURCE E.coli (strain LE392) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 6023) AUTHORS Menon,N.K., Robbins,J., Peck,H.D.Jr., Chatelus,C.Y., Choi,E.-S. and Przybyla,A.E. TITLE Cloning and sequencing of a putative Escherichia coli [NiFe] hydrogenase-1 operon containing six open reading frames JOURNAL J. Bacteriol. 172, 1969-1977 (1990) STANDARD simple staff_review FEATURES from to/span description pept 188 1306 hydrogenase-1 small subunit (hyaA) precursor sigp 188 322 hydrogenase-1 small subunit (hyaA) signal peptide matp 323 1303 hydrogenase-1 small subunit (hyaA) pept 1303 3096 hydrogenase-1 large subunit (hyaB) pept 3115 3822 hyaC protein pept 3819 4406 hyaD protein pept 4403 4801 hyaE protein pept 4798 5655 hyaF protein signal 5657 5683 pot. transcription termination signal BASE COUNT 1334 a 1607 c 1720 g 1362 t ORIGIN 1 ttttgtgcaa aagtttcact acgctttatt aacaatactt tctggcgacg tgcgccagtg 61 cagaaggatg agctttcgtt ttcagcatct cacgtgaagc gatggtttgc cttgctacag 121 ggacgtcgct tgccgaccat aagcgcccgg tgtcctgccg gtgtcgcaag gaggagagac 181 gtgcgatatg aataacgagg aaacatttta ccaggccatg cggcgtcagg gcgttacccg 241 gcgcagcttt ctcaaatatt gtagtctggc tgccacgtcg ctgggattag gcgcgggaat 301 ggcaccaaag attgcctggg cgctggagaa caaaccgcgc attccggtgg tatggatcca 361 cggtctggaa tgcacctgct gtaccgaatc ttttatccgc tccgctcacc cactggcgaa 421 ggacgtcatc ctttccctga tttccctcga ttacgacgat actttgatgg ctgccgccgg 481 aacccaggcg gaagaagtct ttgaagacat catcacgcaa tacaatggca aatatatcct 541 cgcagtagaa ggtaatccgc cgctgggcga gcaggggatg ttctgtatca gcagcggtcg 601 accgtttatt gagaaactca aacgtgccgc tgccggagcc agcgcgatta tcgcctgggg 661 aacctgcgcg tcctggggct gcgtgcaggc cgcgcgaccc aatccgacgc aggcaacgcc 721 tatcgacaaa gtcatcaccg acaaacccat tatcaaagta cctggctgcc cgccgatccc 781 ggatgtgatg agcgccatca ttacttacat ggtgaccttt gatcgcttgc cagatgtcga 841 cagaatgggc cgtccgctga tgttctatgg tcagcgaatc cacgataaat gctatcgccg 901 cgcccacttc gacgccggag agttcgtcca gagttgggat gatgacgctg cccgcaaagg 961 ttactgcctg tacaaaatgg gctgcaaagg gcctaccacc tataacgcct gttcctccac 1021 acgctggaat gatggcgttt ctttcccaat ccagtctggt cacggctgcc tgggctgtgc 1081 ggaaaatggt ttctgggatc gcggttcgtt ctacagccgc gtggtcgata ttccgcaaat 1141 gggtactcat tccaccgccg ataccgtcgg tttaaccgcg cttggcgtgg tggcagcggc 1201 tgttggtgtg cacgcagtcg ccagcgccgt tgaccagcgc agacgtcata accagcaacc 1261 tacagaaacc gaacatcagc caggcaatga ggataaacag gcatgagcac tcagtacgaa 1321 actcagggat acaccatcaa taatgccgga cgccgcctgg tggtcgaccc gattacgcgc 1381 atcgaaggcc acatgcgctg cgaagtgaat attaacgatc agaatgtgat caccaatgcc 1441 gtctcctgcg gcaccatctt tcgcgggctg gagatcatcc tacaagggcg cgacccgcgc 1501 gatgcgcggg cgttcgttga acgtatctgc ggcgtctgta ctggcgtaca cgccctggct 1561 tcggtttacg ccatcgaaga tgctatcggt attaaagtgc cggacaacgc caatatcatc 1621 cgcaacatta tgctggcaac gctctggtgc cacgatcatc tggtgcactt ctatcagctt 1681 gccgggatgg actggatcga tgtgttagat gcgctgaaag ccgacccgcg gaaaacctcc 1741 gaactggcgc aaagtctctc ctcttggccg aaatcatccc ctggctattt cttcgacgta 1801 caaaaccgcc tgaaaaaatt tgttgaaggc gggcagttgg ggatcttccg caatggctac 1861 tgggggcacc cgcagtacaa actgccgcca gaagctaacc tgatgggctt tgcccactat 1921 ctcgaagctc tcgatttcca gcgtgaaatt gtcaaaatcc acgcggtctt tggcggtaaa 1981 aacccgcatc caaactggat tgtcggcggg atgccttgcg ccatcaacat tgacgaaagc 2041 ggcgcggtcg gggcagtcaa tatggaacgc ctgaacctgg tgcagtcaat tatcacccgc 2101 acggcggact tcattaacaa cgtgatgatc cccgacgcct tagccatcgg tcagttcaac 2161 aaaccgtgga gcgaaatcgg cactggtctt tctgataaat gcgttctcag ctacggcgca 2221 ttcccggata ttgccaacga ctttggcgag aaaagtctgc tgatgcctgg cggcgcggtg 2281 attaacggcg acttcaacaa tgtgctgcca gtggatttgg ttgatccgca gcaggtgcag 2341 gagtttgtcg accacgcctg gtatcgatat cccaacgatc aggtcgggcg tcatccgttc 2401 gatggcatca ccgacccgtg gtacaacccc ggcgatgtca aaggcagcga taccaacatt 2461 cagcagctga atgaacagga acgctactcg tggatcaaag cgccacgctg gcgcggtaac 2521 gcgatggaag tggggccgct ggcgcgcacg ttaatcgctt atcacaaagg cgatgctgcg 2581 accgttgagt cggtcgatcg catgatgtcg gcgttgaacc tgccgctttc cggtatccag 2641 tcaacgttag gccgcatttt gtgccgcgcg cacgaagcgc agtgggccgc aggtaagttg 2701 cagtatttct tcgacaagct gatgaccaac ctgaaaaacg gcaatctcgc cactgcttcc 2761 acggaaaaat gggaacctgc aacctggccg acagagtgcc gtggtgtcgg ttttaccgaa 2821 gcgccgcgcg gggcgttagg ccactgggcc gccattcgcg atggcaagat tgatctctac 2881 cagtgcgtgg tgccgaccac ctggaacgcc agcccgcgcg atcccaaagg gcagattggc 2941 gcttatgaag cggcgctgat gaacaccaaa atggcgatcc ccgagcaacc gctggagatc 3001 ctgcgtactc tgcacagctt tgacccgtgc ctcgcctgtt caacacacgt gctgggcgac 3061 gacggtagcg agctgatctc cgtgcaggtg cgttaacagc gaaggagaat catcatgcaa 3121 cagaaaagcg acaacgttgt cagccactat gtctttgaag cgccagtgcg catctggcac 3181 tggttgacgg tgttatgcat ggcggtgttg atggtcaccg gatactttat cggcaagccg 3241 ctaccttccg tcagcggcga ggcgacgtat ctgttctata tgggctacat caggttaatt 3301 cacttcagcg ccgggatggt ttttaccgtg gttttgctga tgcggatcta ctgggctttt 3361 gttggcaatc gatactcccg cgagctgttt atcgtgccgg tatggcgtaa aagctggtgg 3421 cagggcgtgt ggtatgaaat ccgctggtat ctgtttctgg caaaacgtcc gagtgccgat 3481 ataggccata atcccatcgc ccaggcggcg atgttcggct atttcctgat gtcggtcttt 3541 atgatcatca ctggttttgc gctgtacagc gaacacagcc agtacgctat ttttgcgccg 3601 ttccgttatg tggtggaatt tttctactgg acgggtggca actcaatgga cattcacagc 3661 tggcatcggc tggggatgtg gctgattggc gcgtttgtga tcggtcatgt ctacatggcg 3721 ctgcgtgaag acatcatgtc cgacgacacg gtgatctcca ccatggtcaa cggctaccgt 3781 agccacaaat ttggcaaaat aagtaacaag gagcgttcat gagcgagcaa cgcgtggtgg 3841 tcatggggct gggcaacctg ctgtgggccg atgaaggctt cggcgtgcgg gtggcggaac 3901 ggctgtatgc ccattaccac tggcccgagt atgtggagat tgtcgatggc ggtactcagg 3961 gactgaactt gctggggtat gtcgaaagcg ccagccatct gttgattctc gatgccattg 4021 actacgggct ggaacctgga acgctgcgaa cctatgccgg agaacgcatt ccggcttatc 4081 tcagcgcgaa gaaaatgagc ctgcatcaga acagtttctc cgaagtgttg gcgctggcgg 4141 atatccgcgg acatctgcca gcacatattg ccctcgtcgg tctgcaaccc gcaatgctcg 4201 acgactacgg cggtagcctg agcgaactgg cacgggagca actgcccgct gcggaacagg 4261 cggcgctggc gcagcttgct gcgtggggaa ttgtgccgca accggctaat gaatcgcgct 4321 gtctcaatta tgactgtctg tcgatggaaa attacgaagg cgttcgcttg cgccagtacc 4381 ggatgacaca ggaggagcag ggatgagcaa cgacacgcca tttgatgcgt tgtggcaacg 4441 aatgctggcg cgcggctgga cgccagtcag tgaatcccgt cttgacgact ggcttacgca 4501 agcgccagac ggcgtggtgt tattaagcag tgacccgaaa cgcacgccag aggtcagcga 4561 taatccggta atgattggcg aattactgcg cgagtttccc gactatacat ggcaggtggc 4621 gattgctgac cttgagcaga gcgaagccat cggcgatcgc tttggcgtct ttcgctttcc 4681 tgccacttta gtgtttaccg gcggaaacta tcgcggcgtg ctgaatggta ttcacccgtg 4741 ggcggaactg ataaacctga tgcgcgggct tgtcgaaccg cagcaggagc gtgcctcatg 4801 agcgaaactt ttttccatct gctggggcca ggaacgcaac cgaacgatga cagtttcagc 4861 atgaatccac tgccgatcac ctgtcaggtg aatgatgaac cgagtatggc ggccctggag 4921 caatgtgctc acagcccgca ggtgattgcg ctgttaaacg agttacaaca tcaactaagc 4981 gaacgccaac cgccgttggg cgaggtgctg gcagtcgatc tgttaaatct caacgccgac 5041 gatcgtcact ttatcaatac gcttctcggg gaaggggaag tgtcagtgcg cattcagcag 5101 gctgacgaca gtgaaagtga aatacaggag gcgatcttct gcggattatg gcgggtgcgc 5161 agacgtcgcg gcgaaaagtt gctggaggac aaactggagg ctggctgcgc gccgctggcg 5221 ttgtggcagg cggcaacgca aaatctcttg ccgacagatt cgctgttacc gccgcccatt 5281 gatggcctga tgaatggcct accgttggcg catgagttac tggcacatgt acgtaacccc 5341 gacgcgcagc cgcacagcat taatctgacg caattaccca tcagcgaggc tgatcggctt 5401 tttctctcac gtctctgtgg gccgggaaat attcagattc gtaccattgg ctatggcgag 5461 agctatatca acgccacggg gttacgccat gtctggcatt tacgctgtac ggacacctta 5521 aaaggcccgt tactggaaag ttatgaaatc tgcccaatac cggaagtggt gctggcagcg 5581 ccagaagatt tggtcgactc tgcgcagcgg cttagcgagg tatgtcagtg gctggcggaa 5641 gctgcaccga cgtaaaaaga cggtaagtat cgctttcagt cttatgaata tcgcaatcgg 5701 cgaatacctc tggtcgtaga gtttcaggat aaagagggag atctaccatt atcgggttat 5761 ttttctctct tcgcctacag gagtgcgcat gtgggatgtc attgatttat cgcgctggca 5821 gtttgctctg accgcgctgt atcacttttt atttgtaccc cttaccctgg gctgattttt 5881 ttgctggcta ttatggaaac catttacgtg gtcaccggca aaaacaatct accggcaaaa 5941 caatctaccg cgatatgacg cgcttctggg gtaagctctt cggtatcaat tttgctcttg 6001 gcgtggctac ccggcctaac cat // LOCUS HUMFDXC 1174 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human ferredoxin mRNA, complete cds. ACCESSION M34788 KEYWORDS ferredoxin. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept 19 573 ferredoxin BASE COUNT 337 a 211 c 267 g 359 t ORIGIN Chromosome 11q13-qter. 1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct 61 gctgtcctcg gcggcccggc cggccggtgg ctgcaccacg ctgggtcccg cgctggatcc 121 agcggcctgc tgaggaaccg ggggccgggc ggtagcgcgg aggcgagccg gtcgctgagc 181 gtgtcggcgc gggcccggag cagctcagaa gataaaataa cagtccactt tataaaccgt 241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga tgttgtggtt 301 gaaaataatc tagatattga tggctttggt gcatgtgagg gaaccctggc ttgttcaacc 361 tgtcacctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag 421 aatgacatgc tcgatctggc atatggacta acagacagat cacggttggg ctgccaaatc 481 tgtttgacaa aatctatgga caatatgact gttcgagtgc ctgaaacagt ggctgatgcc 541 agacaatcca ttgatgtggg caagacctcc tgaactagaa caaataggaa tattttcatg 601 gaattttacc tatttttata attattattt cttaaagtga ttaaatgaga acatggatga 661 gtggacttca tattatgact agctttacta ttttaattca ccttgcataa ctactgaatt 721 ttgtcattct tgaaagtatg caatttttat tttggttata ttacaaaaat gtcaatcaaa 781 tattaaaaaa tagttaatgt gatagaaaaa ccttacatat ttttttctta tgtttgttta 841 gcgactttag caaaatgttt tcatataatc tcatctgttt acctagaaga taggttaagg 901 aaatatatta ttattcctgt ttgatgtggg tgaaggcaga gatctaacct ggcttgttta 961 gggccatacc actaattaga aaatctgtgc tagaacctgt gtcttattcc tataagctat 1021 gtgttcagac tgaaactgga gaaattatga ctattttatt tatagtagta gttaaatctg 1081 aatgtgtatg gacaaaaata tttaattgct gagtaaactg cttaacttca aagatagtta 1141 ttgaccttat aaataaatat ttcaaaattt tgat // LOCUS HUMFDXPSA 676 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin pseudogene, complete cds. ACCESSION M34787 KEYWORDS ferredoxin; pseudogene. SOURCE Human DNA, clone h2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 676) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 19 569 pseudo-ferredoxin BASE COUNT 171 a 149 c 192 g 164 t ORIGIN Chromosome 20. 1 agttcccgac cgcgggcgat ggctgccgct gggggcgccc ggctgctgcg cgccgcttct 61 gctgtcctcg gcgacccggc cggccggtgg ctgcaccacg ccgggtcccg cgctggagcc 121 agcggcctgc tgaggagccg gggaccgggc cggagcgcgg aggcaagccg gccgctgagc 181 gtgtcggcgg gggcgcggag cagctcagaa gataaagcga cagtccactt tataaactgt 241 gatggtgaaa cattaacaac caaaggaaaa gttggtgatt ctctgctaga cgttgtggtt 301 gaaaataatc cagatattga tggctttggt gcatgtgagg gaactctaac ttgtttaacc 361 tgtcatctca tctttgaaga tcacatatat gagaagttag atgcaatcac tgatgaggag 421 aatcacatgc tcgatctggc atatggacta acagatcaca gttgggctgc caaatctgtt 481 tgacaaaatc tatggacaat atgactgttc gagtgcctga aacagtggct gatgccagac 541 aatccattga tgtgggcaag acctcctgaa ctagaacaaa taggaatatt ttcatggaat 601 tttacctatt tttataatta ttatttaagt gataaaatga gtacatggat gagtggattc 661 tgtattatga ctagct // LOCUS HUMFDXPSB 976 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin pseudogene, complete cds. ACCESSION M34789 KEYWORDS ferredoxin; pseudogene. SOURCE Human DNA, clone h3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 976) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps < 1 386 pseudo-ferredoxin BASE COUNT 340 a 156 c 168 g 312 t ORIGIN Chromosome 21. 1 ttcagtgaag ggcccagagc agctcggaag ataaaataac agtccacttt ataaaccaac 61 agtgaaacat taacaaacaa aggaaaagtt gatgactctt tgctagatgt tgtggttgaa 121 aataatgtag atattgatgg gtttgttgca tggcagggaa ccttggctct gcttgtcacc 181 tcatttttga aaagcacata tttgagaagt tagacacaat cactgacaag gagattgaca 241 tgcttgatct ggcatatgga ctgacagaca ggtcagagtt gtattgccaa atctgtttga 301 caaaatctat gggccatatg actgtctgag tacctgatgg agtggccagt gccagacaat 361 ccactgatat gggcaagacc acctaagcag aataaatagc aatgctttca ctaaatttta 421 cctattttat aattatttct tatataatta aataaaaaca tacatgaatg aacttattat 481 tatgaccaac tttactgttt taattcacct tgtataacta ctgaatttta tagttctgaa 541 aatacgcaat ttttatattt gttacattag aaaaaggtca gttaaatatt agaaaataat 601 taatatgaca ataccttaca aattttgcct tacgtttgtt tagcaacttt tagcaaaatg 661 ttttcacatg gccttatgtc tgcttaccta gaaagtaaat tttttaaaaa agatattatt 721 atccctgttt tatgtgagtg aaggcaaaga cctaaaatgg cttgttaagg gccatccaac 781 taattagaaa atatgtacac acctgcatct tgttactata aggtatatgt tagactgaaa 841 ctggagaaat tatgactatc ttacagcagc aattagatct gactgcatat ggacaaaaac 901 attaattgct cagtgaactg cttaactgac agttattaac cttacacata aagatttcaa 961 aattttgatt cagtga // LOCUS HUMFRDXA1 350 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 3. ACCESSION M34783 KEYWORDS ferredoxin. SEGMENT 1 of 2 SOURCE Human DNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 350) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 45 + 174 ferredoxin, exon 3 IVS < 1 44 ferredoxin intron B IVS 175 > 350 ferredoxin intron C BASE COUNT 99 a 56 c 78 g 117 t ORIGIN Chromosome 11q13-qter. 1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac 61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc 121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga 181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc 241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga 301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag // LOCUS HUMFRDXA2 1260 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 4. ACCESSION M34785 KEYWORDS ferredoxin. SEGMENT 2 of 2 SOURCE Human DNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1260) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 33 147 ferredoxin, exon 4 IVS < 1 32 ferredoxin intron C BASE COUNT 410 a 196 c 241 g 413 t ORIGIN About 5.3 kb after segment 1; chromosome 11q13-qter. 1 actaaaccat accttccccc ttttccatac agatcacggt tgggctgcca aatctgtttg 61 acaaaatcta tggacaatat gactgttcga gtgcctgaaa cagtggctga tgccagacaa 121 tccattgatg tgggcaagac ctcctgaact agaacaaata ggaatatttt catggaattt 181 tacctatttt tataattatt atttcttaaa gtgattaaat gagaacatgg atgagtggac 241 ttcatattat gactagcttt actattttaa ttcaccttgc ataactactg aattttgtca 301 ttcttgaaag tatgcaattt ttattttggt tatattacaa aaatgtcaat caaatattaa 361 aaaatagtta atgtgataga aaaaccttac atattttttt cttatgtttg tttagcgact 421 ttagcaaaat gttttcatat aatctcatct gtttacctag aagataggtt aaggaaatat 481 attattattc ctgtttgatg tgggtgaagg cagagatcta acctggcttg tttagggcca 541 taccactaat tagaaaatct gtactagaac ctgtgtctta ttcctacaag ctatgtgttc 601 agactgaaac tggagaaatt atgactattt tatttatagt agtagttaaa tctgaatgtg 661 tatggacaaa aatatttaat tgctcagtaa actgcttaac ttcaaagata gttattaacc 721 ttataaataa atatttcaaa attttgattc ggaagactaa gtctggacgt agacattata 781 atgctatcaa agaagtttga tctctgtttt gactaaacta gaggaaaaat gattggatgt 841 gtttattctt ttctaagcag aatggtttaa ctttgtactc tttgaaaaat aatgctgatt 901 tataaatctc tgcctataac agaatggaaa ccttatgaat gaattgtgtt tctctgtcct 961 gagctggaga agggaatgag caggctgaca cgttgcacag ccccaggtgg cgccattctc 1021 tcacgcaagg atggggctgc agggtgagca gcgtgggctg cagtgtgtca gtcccaggag 1081 tgagggagtg gcaagcacca cagattacca cgtatgtgtg gaagacattc gtacccttat 1141 ctttactata aataaattca taaaagttaa caaaggggta cacagtatgg tctttggaaa 1201 tataataaaa catcaactaa cttggactaa ttgtgaggaa gagcagaaca aattagtaga // LOCUS HUMFRDXB1 350 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 3. ACCESSION M34784 KEYWORDS ferredoxin. SEGMENT 1 of 2 SOURCE Human DNA, clone h1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 350) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 45 + 174 ferredoxin, exon 3 IVS < 1 44 ferredoxin intron B IVS 175 > 350 ferredoxin intron C BASE COUNT 99 a 56 c 78 g 117 t ORIGIN Chromosome 11q13-qter. 1 ctgatgtaga agggactatg ttcagtgttt gttgcttttg tcaggtgcat gtgagggaac 61 cctggcttgt tcaacctgtc acctcatctt tgaagatcac atatatgaga agttagatgc 121 aatcactgat gaggagaatg acatgctcga tctggcatat ggactaacag acaggtaaga 181 tttttggact gcttcaattg taataataat ctgggaacat agatgttgta ttatgttgtc 241 tatgtaagac cagacccata aatgtaatag tgttctacca ggttaaataa cagtcacaga 301 ttttgagtat cagatatgtg gcatttctct ttgttttcac tgttctacag // LOCUS HUMFRDXB2 1269 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human ferredoxin gene, exon 4. ACCESSION M34786 KEYWORDS ferredoxin. SEGMENT 2 of 2 SOURCE Human DNA, clone h1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1269) AUTHORS Chang,C.-Y., Wu,D.-A., Mohandas,T.K. and Chung,B.-C. TITLE Structure, sequence, chromosomal location, and evolution of the human ferredoxin gene family JOURNAL DNA Cell Biol. 9, 205-212 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 32 146 ferredoxin, exon 4 IVS < 1 31 ferredoxin intron C BASE COUNT 407 a 197 c 246 g 419 t ORIGIN About 5.3 kb after segment 1; chromosome 11q13-qter. 1 actaaaccat accttcccct tttccataca gatcacggtt gggctgccaa atctgtttga 61 caaaatctat ggacaatatg actgttcgag tgcctgaaac agtggctgat gccagacaat 121 ccattgatgt gggcaagacc tcctgaacta gaacaaatag gaatattttc atggaatttt 181 acctattttt ataattatta tttcttaaag tgattaaatg agaacatgga tgagtggact 241 tcatattatg actagcttta ctattttaat tcaccttgca taactactga attttgtcat 301 tcttgaaagt atgcaatttt tattttggtt atattacaaa aatgtcaatc aaatattaaa 361 aaatagttaa tgtgatagaa aaaccttaca tatttttttc ttatgtttgt ttagcgactt 421 tagcaaaatg ttttcatata atctcatctg tttacctaga agataggtta aggaaatata 481 ttattattcc tgtttgatgt gggtgaaggc agagatctaa cctggcttgt ttagggccat 541 accactaatt agaaaatctg tgctagaacc tgtgtcttat tcctataagc tatgtgttca 601 gactgaaact ggagaaatta tgactatttt atttatagta gtagttaaat ctgaatgtgt 661 atggacaaaa atatttaatt gctgagtaaa ctgcttaact tcaaagatag ttattgacct 721 tataaataaa tatttcaaaa ttttgattcg gaagactaag tctggacgta gacattataa 781 tgctatcaaa gaagtttgat ctctgttttg actaaactag aggaaaaatg attggatgtg 841 tttattcttt tctaagcaga atggtttaac tttgtactct ttgaaaaata atgctgattt 901 ataaatctct gcctataaca gaatggaaac cttatgaatt gtttgtgttt ctctgtcctg 961 agctcggtac ccctggagaa gggaatgagc aggctgacac gttgcacagc cccaggtggc 1021 gccattctct cacgcaagga tggggctgca gggtgagcag cgtgggctgc agtgtgtcag 1081 tcccaggagt gagggagtgg caagcaccac agattaccac gtatgtgtgg aagacattcg 1141 tactcttatc tttactataa ataaattcat aaaagttaac aaaggggtac acagtatggt 1201 ctttggaaat ataataaaac atcaactaac ttggactaat tgtgaggaag agcagaacaa 1261 attagtaga // LOCUS RATC1H4SY 3146 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat cytoplasmic C-1-tetrahydrofolate synthase mRNA, complete cds. ACCESSION J05519 KEYWORDS C-1-tetrahydrofolate synthase. SOURCE Rat adult liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3146) AUTHORS Thigpen,A.E., West,M.G. and Appling,D.R. TITLE Rat C-1-tetrahydrofolate synthase: cDNA isolation, tissue-specific levels of the mRNA, and expression of the protein in yeast JOURNAL J. Biol. Chem. 265, 7907-7913 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Appling, 18-MAY-1990. FEATURES from to/span description pept 121 2928 C-1-tetrahydrofolate synthase mRNA < 1 3146 C-1-tetrahydrofolate synthase mRNA signal 3138 3143 poly-A signal BASE COUNT 845 a 759 c 845 g 697 t ORIGIN 1 cgcactgagt ttccccggga gatccctgag cacgtgggtt gtgctgtccg gtaagcgcac 61 ggagggactg gaacctgaat tttgctggtg ttcttcgtgg gtcgcagact agctaaggcc 121 atggcaccag cgggaatcct gaatggaaaa gtggtctccg cgcaaatcag gaatctactg 181 aagacccagt ttactcagat gcaggagcag gtacctggct tcacccctgg cctggcaatc 241 ctgcaggttg gcgacagaga tgattccaat ctttatataa atgtgaagct gaaggctgct 301 caagagatcg ggatcaaagc cactcacatt aaattaccaa gaacttccac agagtcggag 361 gtgttaaaat acgtcatctc cctgaatgag gatgccactg tgcacggatt catagtacag 421 ctgcctttag actccgagaa ttccattaac acagaggcag tcatcaatgc cattgcccct 481 gagaaggatg tggacgggtt gacgagcatc aatgccggga agcttgccag aggtgatcta 541 aaggactgct tcattccgtg cacacccaaa ggatgcttgg aactcatcaa agagacagga 601 gtgcagattg caggaaggca cgccgtggtg gtcgggcgaa gtaaaatagt tggtgcaccc 661 atgcacgacc tgcttctgtg gaacaatgcc acggtgacca cctgccactc caagactgcc 721 gatctggaca aggaggtaaa taaaggagac attctggtgg ttgcaacagg acagcctgag 781 atggtgaaag gggagtggat caagcctgga gctgtggtca tagactgtgg gatcaattat 841 gttccagatg atacaaaacc aaatggaagg aaagttgtgg gtgacgtagc atatgacgag 901 gccaaggaga aggcgagctt tatcacgcct gtccccggtg gcgtggggcc catgaccgtg 961 gcaatgctga tgcagagcac agtagagagc gcacagcgct tcctgaagaa atttaagcca 1021 gggaagtgga caattcagta taacaagctg aacctcaaga cgcctgtacc aagtgacatt 1081 gctatatcac gatcttgcaa acccaagctc atcggtaacc tggcccgaga aattgggcta 1141 ctcactgagg aggtggaatt gtatggagaa acaaaggcca aggtcttact gtcagcacta 1201 gatcgcctga agcatcagcc agatgggaaa tacgttgtgg tgactggaat tactccaaca 1261 cccctgggag aagggaagag cacaaccacc atcgggcttg tgcaagccct tggcgcccac 1321 ctgcatcaga atgtttttgc gtgtgtgcga cagccttctc agggccccac ctttgggata 1381 aaaggtggcg ctgcaggagg cggctattcc caggtcattc ctatggaaga gtttaatctc 1441 cacctcactg gtgacatcca cgccatcact gccgctaata accttgtggc tgctgctatt 1501 gacgctcgga tatttcatga gctgacccag acagacaagg ctctctttaa tcgtctggta 1561 ccctcagtaa atggagtaag aaagttctct gacatccaaa tccgcaggtt acggaggcta 1621 ggcattgaga agactgaccc cgctgcgctg acagatgatg agataaacag atttgcaaga 1681 ctagatattg atccagaaac cataacatgg cagagagtgc tggatactaa cgatagattc 1741 ctgaggaaga tcaccattgg acaggctcca accgagaaag gccacacacg cacggcccag 1801 tttgatatct ctgtggccag tgaaatcatg gcagtcctgg ccctcactag ttctctggaa 1861 gacatgagag caaggctggg caaaatggtg gtagcgtcca gtaagaaagg ggagcctatc 1921 agctgtgaag acctgggcgt gagcggggcg ctgacggtgc tcatgaagga tgcgatcaag 1981 cccaatctca tgcagacctt agagggcacg ccggtatttg tccatgctgg gccttttgcc 2041 aacattgcac atgggaattc ctccatcatt gcagaccgga ttgcactcaa gctggttggc 2101 cctgagggct ttgtagtgac agaagcagga ttcggagcag acataggaat ggaaaagttc 2161 ttcaacatca agtgccggta ttctggtctc cagcctcatg tggtggttct tgttgccact 2221 gtcagggctc ttaagatgca cgggggtggc cccacggtca ccgctggact gcctcttccc 2281 aaggcttaca cagaagagga cctggacctg gtggaaaagg gcttcagtaa cttgaggaaa 2341 cagatagaaa atgctagaat gtttggagtg cctgtcgtgg tggccatgaa tgcattcaag 2401 acagatacag atactgagct ggacctcatc ggccgcctct ccagagagca tggggctttc 2461 gatgctgtca agtgcaccca ctgggcagaa gggggccagg gagccttagc cctggctcag 2521 gctgtccaga gagcttcaca ggcccccagc agcttccagc tcctctatga cctcaagctc 2581 tcagttgaag ataaaatcag gattattgca cagaagatct acggggccga tgacatcgaa 2641 ttgctccctg aagcacaaaa caaagcagaa atctacacaa agcagggctt tgggaatcta 2701 cccatctgca tggccaaaac acacttgtct ttatctcaca acccagagca aaaaggggtg 2761 cctactggct tcgttctgcc catccgggac atccgggcca gcgttggggc tggtttcctg 2821 taccctttag taggaacgat gagcacaatg cctggactcc ctactcgacc ctgtttttat 2881 gatatcgatt tggaccctga aactgaacaa gtgaatggat tgttttaaac agatcttcca 2941 tctccaagag gctactctgt ctggccagtg tctgttcagg cccactgagg aagtgtgcag 3001 agaagtcttg aaagtctgtg cctgccctga agagcttcag aaatagtgga agtttctcta 3061 aagcctttca tagccttaat tcagatcatg tataaattaa cataaatcat gcctatttac 3121 atagtaaaag tccacagaat aaatga // LOCUS NGOARGF 1260 bp ds-DNA BCT 17-JUL-1990 DEFINITION N.gonorrhoeae ornithine transcarbamoylase (argF) gene, complete cds. ACCESSION M34930 KEYWORDS ornithine transcarbamoylase. SOURCE N.gonorrhoeae (strain CDC 50) DNA. ORGANISM Neisseria gonorrhoeae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1260) AUTHORS Martin,P.R., Cooperider,J.W. and Mulks,M.H. TITLE Sequence of the argF gene encoding ornithine transcarbamoylase from Neisseria gonorrhoeae JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Mulks, 06-JUN-1990. Author address [1]: M.H.Hulks Michigan State University Department of Microbiology and Public Health 367 Giltner Hall, East Lansing, MI 43324 e-mail [1]: 21667CFM@MSU.bitnet FEATURES from to/span description pept 141 1136 ornithine transcarbamoylase (argF) (EC 2.1.3.3) binding 128 133 ribosome binding site signal 1172 1204 transcription termination signal site 39 48 DNA uptake sequence site 54 63 DNA uptake sequence site 713 722 DNA uptake sequence site 1175 1184 DNA uptake sequence site 1192 1201 DNA uptake sequence BASE COUNT 350 a 350 c 319 g 241 t ORIGIN 1 tccccgtcat ccgtatggag taagggattg accgcaatgc cgtctgaaca accttcagac 61 ggcattgcaa cattccgcta acccttcttt ccgcaaacgc tgcaaatacg gcgttcacgc 121 cccacataaa ggaaacgaca gtgaacctga aaaaccgcca ttttctgaaa cttttggact 181 tcacgccgga agaaatcacc gcctacctcg accttgccgc cgagttgaaa gacgccaaaa 241 aggcagggcg cgagattcag cggatgaaag ggaaaaacat cgccctgatt tttgaaaaaa 301 catccacgcg cacacgctgt gcgtttgaag tcgccgcacg cgaccaaggc gcggatcgaa 361 cctatctgga accgtccgcc agccaaatcg ggcacaaaga aagcatcaaa gacacggcgc 421 gcgtcttagg cagaatgtac gatgccatcg aatatcgcgg cttcgctcag gaaactgtcg 481 aagaattggc aaaatatgcg ggcgtacccg tgttcaacgg gctgaccaac gagttccatc 541 ccacacaaat gcttgccgac gcactgacta tgcgcgaaca cagcggcaaa cctttgaacc 601 aaaccgcgtt tgcctacgtc ggcgacgcgc gttacaacat gggcaattcc ctgctgattt 661 taggggcaaa attggggatg gacgtgcgta tcggcgcacc tcaaagcctg tggccgtctg 721 aaggcattat cgccgccgca cacgccgccg ccaaagaaac cggtgcaaaa attaccctga 781 ccgaaaacgc gcatgaagcc gtcaaaggtg tcggtttcat tcatactgac gtatgggtca 841 gcatgggcga gccgaaagaa gtctggcagg aacgcatcga tttgctgaaa gattaccgcg 901 ttacgcccga actgatggcg gcatcgggca atccgcaagt caaattcatg cactgcctgc 961 ccgccttcca caaccgcgaa accaaagtcg gcgaatggat ttacgaaacc ttcgggctga 1021 acggtgtgga agttacagaa gaagtattcg aaagtccggc cggcatcgtg ttcgatcagg 1081 cggaaaaccg tatgcacacg attaaagcgg taatggtcgc ggctctgggc gactgacaga 1141 actgtgcctg tttaaattca tccgcaacac agataccgtc tgaacacgat gttcagacgg 1201 tatccatata acaaactgcc tacacgatgt gtaggcagtc ccgtttgaaa acaatcagtt // LOCUS PFACSPA 1218 bp ds-DNA INV 17-JUL-1990 DEFINITION P.falciparum CS gene encoding circumsporozoite protein, complete cds. ACCESSION M22982 J04650 KEYWORDS circumsporozoite protein. SOURCE P.falciparum (strain NF54) DNA, clone 1.3A. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1218) AUTHORS Caspers,P., Gentz,R., Matile,H., Pink,J.R. and Sinigaglia,F. TITLE The circumsporozoite protein gene from NF54, a Plasmodium falciparum isolate used in malaria vaccine trials JOURNAL Mol. Biochem. Parasitol. 35, 185-190 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly provided by P.Caspers, 14-MAR-1989. FEATURES from to/span description pept 1 1218 circumsporozoite protein BASE COUNT 531 a 259 c 168 g 260 t ORIGIN 1 atgatgagaa aattagctat tttatctgtt tcttcctttt tatttgttga ggccttattc 61 caggaatacc agtgctatgg aagttcgtca aacacaaggg ttctaaatga attaaattat 121 gataatgcag gcactaattt atataatgaa ttagaaatga attattatgg gaaacaggaa 181 aattggtata gtcttaaaaa aaatagtaga tcacttggag aaaatgatga tggaaataac 241 gaagacaacg agaaattaag gaaaccaaaa cataaaaaat taaagcaacc agcggatggt 301 aatcctgatc caaatgcaaa cccaaatgta gatcccaatg ccaacccaaa tgtagatcca 361 aatgcaaacc caaatgtaga tccaaatgca aacccaaatg caaacccaaa tgcaaaccca 421 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca 481 aatgcaaacc caaatgcaaa cccaaatgca aacccaaatg caaacccaaa tgcaaaccca 541 aatgcaaacc caaatgcaaa cccaaatgca aaccccaatg caaatcctaa tgcaaaccca 601 aatgcaaacc caaacgtaga tcctaatgca aatccaaatg caaacccaaa cgcaaacccc 661 aatgcaaatc ctaatgcaaa ccccaatgca aatcctaatg caaatcctaa tgccaatcca 721 aatgcaaatc caaatgcaaa cccaaacgca aaccccaatg caaatcctaa tgccaatcca 781 aatgcaaatc caaatgcaaa cccaaatgca aacccaaatg caaaccccaa tgcaaatcct 841 aataaaaaca atcaaggtaa tggacaaggt cacaatatgc caaatgaccc aaaccgaaat 901 gtagatgaaa atgctaatgc caacagtgct gtaaaaaata ataataacga agaaccaagt 961 gataagcaca taaaagaata tttaaacaaa atacaaaatt ctctttcaac tgaatggtcc 1021 ccatgtagtg taacttgtgg aaatggtatt caagttagaa taaagcctgg ctctgctaat 1081 aaacctaaag acgaattaga ttatgcaaat gatattgaaa aaaaaatttg taaaatggaa 1141 aaatgttcca gtgtgtttaa tgtcgtaaat agttcaatag gattaataat ggtattatcc 1201 ttcttgttcc ttaattag // LOCUS NPA25K 900 bp ds-DNA VRL 17-JUL-1990 DEFINITION Autographa californica nuclear polyhedrosis virus 25K protein gene, complete cds. ACCESSION M22537 M23428 KEYWORDS 25K protein. SOURCE Autographa californica nuclear polyhedrosis virus DNA, clone HindIII-I. ORGANISM Autographa californica nuclear polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 900) AUTHORS Beames,B. and Summers,M.D. TITLE Location and nucleotide sequence of the 25K protein missing from baculovirus few polyhedra (FP) mutants JOURNAL Virology 168, 344-353 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.D.Beames, 10-FEB-1989. The 25K gene product is not needed for replication of the virus in tissue culture. It is believed to be important in the viral occlusion process but this has not yet been experimentally shown. Disruption of the gene leads to few polyhedra (FP) phenotype. FEATURES from to/span description pept 115 753 25K protein recomb 63 64 IFP1.6 insertion site recomb 586 587 IFP2.2 insertion site BASE COUNT 334 a 151 c 165 g 250 t ORIGIN Map position 36.6 to 37.5. 1 aagaaaacat gattacgtct gtacacgaac gcgtattaac gcagagtgca aagtataaga 61 gggttaaaaa atatatttta cgcaccatat acgcatcggg ttgatatagt taatatggat 121 caatttgaac agttgattaa cgtgtctctg ctcaagtcgt tgatcaaaac gcaaatcgac 181 gaaaatgtgt cggacaatat caagtcgatg agcgaaaaac taaaaaggct agaatacgac 241 aatctcacag acagcgttga gatatacggt attcacgaca gcaggctgaa taataaaaaa 301 attagaaact attatttaaa aaagatttgc gctttactcg acctaaactt taaacacgtc 361 atagaatctt cgtttgacaa aaaccacatt gtggccaagc tgtgtgacgc gacgcgcgct 421 aaagaatggc aaaccaagtc gcgcgagcgt cgacttaaaa attttaattt aaacattaat 481 tacgacgggc ccgtaaaaat atttgtggcc gccacagcgg agcaaaagct gttacttaaa 541 aaaactcgcg acgctctgtt gccgttttac aaatacattt ccatttgtaa aaacggtgtt 601 atggtgagac gcgacgaaaa gagtcgcgtg tttattgtta aaaatgagca aaatatagaa 661 tatctcaaag ccaacaaata ttacgctttt catagcgata gcgtcgataa ttttgagtct 721 gaaaacgata gcgaaaaaat gcttcaaatt taattaaatt taattaataa ttaattttag 781 tatcaagttg aatctcgtaa tgtaagaaaa aaaacactaa tattggttgt gattcttgcc 841 agacgtccat tttatgtgat aaataaaaca caaacacaaa gtttttatat tgtcttttat // LOCUS SHPHSKB3C 413 bp ds-DNA MAM 17-JUL-1990 DEFINITION Sheep BIIIB4 high sulfur keratin gene, 3' end. ACCESSION M21101 J03062 KEYWORDS BIIIB high-sulphur wool keratin. SOURCE Sheep (Dorset horn strain) DNA, clone pSWK31. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 413) AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E. TITLE The keratin BIIIB gene family: Isolation of cDNA clones and structure of a gene and a related pseudogene JOURNAL Genomics 4, 182-191 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.J.Frenkel, 24-OCT-1989. FEATURES from to/span description pept < 1 51 BIIIB4 high-sulfur keratin (AA at 1) BASE COUNT 112 a 99 c 84 g 118 t ORIGIN 15 bp upstream of PvuII site. 1 actacctaca ctcagcccag ctgtgagccc tgcatcccaa gctgctgctg accgacggct 61 acctcaccca gtgcctgaca gagtcaaccc agaagcttta gtgctcacct gtctcagtac 121 ctgcaactaa ttatgtctcc gctttcaaag ttggaacaag gcattatcac agacaaccct 181 cacaaaaaac aaaccaagag actttcgatg gccgtgtagt ggacatcagt gaacaaggac 241 agctggagta ggtagatgcc tacaggtttc ccagagttgt tcagttcgtt agtgttaaat 301 tgtatctttc tttggtgctt tgggaaattc gtttccagtc ttgaatggta tctttctgga 361 aattaaggaa cttcttcatg attattcaat aaagtttaca tctctggtat aat // LOCUS SHPHSKB3D 1080 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Sheep BIIIB3 high-sulfur keratin pseudogene mRNA. ACCESSION M21102 J03062 KEYWORDS BIIIB high-sulphur wool keratin. SOURCE Sheep (Dorset horn strain), cDNA to mRNA, clone psi-BIIIB3A. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1080) AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E. TITLE The keratin BIIIB gene family: Isolation of cDNA clones and structure of a gene and a related pseudogene JOURNAL Genomics 4, 182-191 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.J.Frenkel, 24-OCT-1989. FEATURES from to/span description pept.ps / 191 459 BIIIB3 high-sulfur keratin BASE COUNT 310 a 226 c 195 g 349 t ORIGIN 74 bp upstream of AvaII site. 1 aaatgtcaaa ttgattataa aaattttagt ggaaaacggt ggcataaata aaaaagagag 61 gaaaactttt ggtggaccaa cacctgatga agcggatata taaagagccc caaagtgaga 121 ggagacattt cgtcctggat agttcgtctt tcactccaag atgccaaaag aaaccaaggt 181 tcccggtgcc acggcttgct gagctcccct ctgctgcagt gttcccacca gccccgccac 241 tactatctgc tcctctgaca aattctgggc atgagtctgc ctgcccagct cctgctcaca 301 tatggtttgg ttactggaga caacctgctg tgacagctgc ccaaagcctg cacattcctc 361 aaccctatgt gccaacctgc ttcctgctca actctaccca gcccaccctg gaccaggaaa 421 ccatcagcct cacagccctg tatcccaagc tgctgctaac tgatgtttga ctcactcagt 481 gcctggcaat aacacagaag ctatctgttt ggcattcact tgctcagtat tttatcacat 541 attgaggtag acccaatggc atagatatgg aaggcctact ttttatctta atggaaagaa 601 aataaataaa ctttatgctt atttggctaa caattttttc atttgggcag gtgaatgtca 661 tctatgtcaa aatagtatta aagtctataa gacttcagac cctattttat tggtcatatt 721 gcttcctgga ccccatttat tgtattcggg attttcatag aggagaaata attttgatgg 781 ttttcctaat aagctatttc tgtggcaaaa aagagaaaga gatgtgaaat tttgttgagt 841 gctttttctg tatctatgga agtaattatt tagttttttt tatttaatct tttacatttt 901 aaagtttata actatgttca ttttgattga ttttaaaaat gttaaaccaa ccttcttttc 961 ctacgaaacc cagattgata taatgtattt tccctttcat acatgattga ctttgatttg 1021 ctacattaaa aacatttatt tatttggcta cactggatct tagttgcagc agatgagatc // LOCUS SHPHSKB3E 1047 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Sheep BIIIB4 high-sulfur keratin mRNA, complete cds. ACCESSION M21103 J03062 KEYWORDS BIIIB high-sulphur wool keratin. SOURCE Sheep (Dorset horn strain), cDNA to mRNA, clone BIIIB4. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1047) AUTHORS Frenkel,M.J., Powell,B.C., Ward,K.A., Sleigh,M.J. and Rogers,G.E. TITLE The keratin BIIIB gene family: Isolation of cDNA clones and structure of a gene and a related pseudogene JOURNAL Genomics 4, 182-191 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.J.Frenkel, 24-OCT-1989. FEATURES from to/span description pept 289 585 BIIIB4 high-sulfur keratin BASE COUNT 298 a 294 c 193 g 262 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgtgt tccagtgaca aatagcaaga gtaatttgtt cagatttagc agtgaactca 61 aacaaaggtt aattaggaaa ataaagactc tttacaagca tctaaatgtc aaaccagtca 121 aaggataatg ggtacactaa cataaacaca caggaaagaa gccttccggt tggaccaaca 181 ccactggaga gggtatataa gagcctcaga gcagaaagag agattctcac gccaggagac 241 tcgtcttctt taaaaccaaa tcaataaaaa acacagcttc ccaacaccat ggcttgctgt 301 gcccgcctct gctgcagcgt ccccaccacc cccgccacca ccatctgctc ctctgacaaa 361 ttctgcagat gtggagtctg tctgcccagc acctgcccac acacagtctg gttcctggag 421 ccaacctgct gtgacaaccg ccccccacct tgccacattc ctcagccctc tgtgcccacc 481 tgcttcctgc tcaactcttc ccagcccacc ccaggcctgg aaagcatcaa cctcacaacc 541 tacactcagc ccagctgtga gccctgcatc ccaagctgct gctgaccgac ggctgcctca 601 cccactgcct gacagagtca acccagaagc tttagtgctc acctgtctca gtacctgcaa 661 ctaattatgt ctccgctttc aaagttggaa caaggcatta tcacagacaa ccctcacaaa 721 aaacaaacca agagactttc gatggccgtg tagtggacat cagtgaacaa ggacagctgg 781 agtaggtaga tgcctacagg tttcccagcg ttgttcagtt ccttcgtgtt aaattgtatc 841 tttctttggt gctttgggaa ttctgtttcc agtcttgaat cgtatctttc tggaaattga 901 ggagcttctt catgattatt ctaataaagt ttccatctct ggcataacat aaatgtctat 961 aggtatttcc atttattttt gttaacacat caaatctata tcttaactcc aaaggacacc 1021 attttaacaa actgagaaca tctgcag // LOCUS HUMRARG2 250 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human retinoic acid receptor gamma 2 mRNA, 5' end. ACCESSION M32074 KEYWORDS retinoic acid receptor gamma. SOURCE Human fetal skin, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 250) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 62 > 250 retinoic acid receptor gamma 2 mRNA 1 > 250 retinoic acid receptor gamma 2 mRNA BASE COUNT 38 a 77 c 87 g 48 t ORIGIN Chromosome 12. 1 ggggaatcgg ctgcgggtcc ttggtgtttc cagcacccag tttcccttaa gccgggtcgc 61 gatgtacgac tgtatggaaa cgtttgcccc gggtccgcga cggctgtacg gggcggccgg 121 gcccggggcc ggcttgctgc gcagagccac cggcggctcc tgtttcgccg gacttgaatc 181 ttttgcctgg ccgcaacccg ccagcctgca atcggtggag acacagagca ccagctcaga 241 ggagatggtg // LOCUS MUSRARG1 620 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 1 mRNA, 5' end. ACCESSION M32068 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse F9 cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 620) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 399 > 620 retinoic acid receptor gamma 1 mRNA 1 > 620 retinoic acid receptor gamma 1 mRNA BASE COUNT 108 a 217 c 192 g 102 t 1 others ORIGIN Chromosome 15. 1 attttggggc ggtcacgtgg gcgggcgggc tccgagaggc cccgggattg tcccagccta 61 gagccgtgcc cccctgaagc cccccattac cgcgagtcac taacaccgcg gtgctccatc 121 cccgagaccg cccgacgccg ggacctgggg ctctgcggcc ttccttcccc cgccctcccc 181 tccagcagtt tccaccaggt ccctcacctc agcctggccc agtatgtagg agggactctc 241 tgcangaggc cagagggatc cttggaaacc cactggacag accaggcagg gtgggcacgg 301 agcctcccag gccagggcag tgggcatggg cgggggctgt agctgaagac cacccccgcc 361 tgctgcagag tccaagggat tcccacgccg cagctaccat ggccaccaat aaggagagac 421 tctttgcgcc cggtgccctg gggcctggat ctggttaccc aggagcaggc ttcccattcg 481 ccttcccagg tgcactcaga gggtcgccac catttgagat gctgagccct agcttccggg 541 gcctgggcca gcctgacctc cccaaggaga tggcttctct ctcggtggag acacagagca 601 ccagctcgga ggagatggta // LOCUS MUSRARG2 479 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 2 mRNA, 5' end. ACCESSION M32069 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse F9 L cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 479) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by P.Chambon, 13-FEB-1990, for release after publication. FEATURES from to/span description pept 291 > 479 retinoic acid receptor gamma 2 mRNA 1 > 479 retinoic acid receptor gamma 2 mRNA BASE COUNT 75 a 163 c 143 g 98 t ORIGIN Chromosome 15. 1 tctaggcttt gcttttacaa aaggtctcca cagagctagc tgcctaggca ccctaagtga 61 ggcagactga ggcagaggct acagcttcgc cttcgcgccc cgcccagttc ttttcccttg 121 ctggggatcc ccctcctccc cgcccatccc ttacccccca tgcatctcag cgccctacgc 181 tagccctccc cctcccccct ccaggagcgg ggcgctgcgg ggggaggagg gggaatgggc 241 tgcgggtcct tggtgttcta gcacccagtt tctctccaag ccgggtcgcg atgtacgact 301 gcatggaatc gtttgtcccg ggtccgcgac ggctgtatgg ggcggccggg cccggggccg 361 gcttactacg cagagccact ggcagctcct gcttcgccgg acttgagtct tttgcctggg 421 cacagcctgc cagtctacaa tcggtggaga cacagagcac cagctcggag gagatggta // LOCUS MUSRARG3 646 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 3 mRNA, 5' end. ACCESSION M32070 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse 11.5 day embryo F9 L cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 646) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Chambon, 13-FEB-1990, for release after publication. FEATURES from to/span description pept 496 > 646 retinoic acid receptor gamma 3 mRNA < 1 > 646 retinoic acid receptor gamma 3 mRNA BASE COUNT 151 a 150 c 184 g 161 t ORIGIN Chromosome 15. 1 tccccacccc caccgccacc aactgcttcc agggaactct gggattacta gagtcaggag 61 tgagccctaa ccttttcagt tctatgcccc tccccgcccc ctttaaaaat gtgtatggtg 121 ttctgtctat atgtattttc gcgtgccatt tttgtgcctg gtgcccatgg aagatggaag 181 agggaaccgg ttctgttaga actggcgatt acaaatggtt gtaaactacc atgtagatgc 241 tgggaatgga accctggtcc tctgggagag cagccactgc tcttacctgc tgagtcccaa 301 ccaatcttca actttatgga gcagaagcag agaagttaac tccttggcct ggaatttgca 361 gaagtgaacg ttaatgtaga agagttggct ttatgggggt ggggatgggg tagggggcag 421 tggtggggcc tgaaatccca acaagctaca aagagtggtg gtctgggctt tccagggagt 481 acctgttaag ggcttatgca caagggtgac aacagcggtc accagcaggt cccaagaaag 541 agaggccatg ggatgagggt gcttctgctc agcttctgct tatcttctca tgctgctttt 601 cattcagcgg tggagacaca gagcaccagc tcggaggaga tggtac // LOCUS MUSRARG4 309 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 4 mRNA, 5' end. ACCESSION M32071 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse 9.5 day embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 309) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Chambon, 13-FEB-1990, for release after publication. FEATURES from to/span description pept 300 > 309 retinoic acid receptor gamma 4 mRNA < 1 > 309 retinoic acid receptor gamma 4 mRNA BASE COUNT 77 a 81 c 100 g 51 t ORIGIN Chromosome 15. 1 gatgacaatc gtaaggcccc caaatatcca tcttccccct ctctgtaatg ggagatgcca 61 ggcttggggc ccaggaaccc ctttccagag cggcaataaa aagctgggga tggagaggca 121 gtaaaaataa agtccagatg agaagcggct ttgaagcctg ggccacagtg cctggcgggg 181 gtgggctgtt ggaaggggct gagagaggag cctgggacac cccccccccc aaatttactg 241 ggggttggca ccgcactcgc agtctagcgg tggagacaca gagcaccagc tcggaggaga 301 tggtaccca // LOCUS MUSRARG5 111 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 5 mRNA, 5' end. ACCESSION M32072 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse F9 cell, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 111) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 102 > 111 retinoic acid receptor gamma 5 mRNA 1 > 111 retinoic acid receptor gamma 5 mRNA BASE COUNT 23 a 26 c 37 g 25 t ORIGIN Chromosome 15. 1 tgaagcttct ccttgggggc tggattttgg attgaagagt cagcccctgg caccttgggg 61 atatccttgc ggtggagaca cagagcacca gctcggagga gatggtaccc a // LOCUS MUSRARG6 420 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid receptor gamma 6 mRNA, 5' end. ACCESSION M32073 KEYWORDS retinoic acid receptor gamma. SOURCE Mouse 9.5 day embryo F9, P19, and L cells, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 420) AUTHORS Kastner,P., Krust,A., Mendelsohn,C., Garnier,J.M., Zelent,A., Leroy,P., Staub,A. and Chambon,P. TITLE Murine isoforms of retinoic acid receptor gamma with specific pattern of expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2700-2704 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Chambon,P., 13-FEB-1990, for release after publication. FEATURES from to/span description pept 407 > 420 retinoic acid receptor gamma 6 mRNA 1 > 420 retinoic acid receptor gamma 6 mRNA BASE COUNT 84 a 121 c 128 g 87 t ORIGIN Chromosome 15. 1 tgaggcagcg ctctgtgggt agggagggcg tcagtgcagg agagagtacc acccacgctc 61 aggcctgtgg ggaccccagg atgggctgaa agtgagggcc ggaaaggcct tccaggcttc 121 cccaaacctc ccagcaccta ccattcaggc aaccccaccc ccagtcttga ataaactccc 181 tgcacccttc cgcccccttt cttttgaggg ggaatctaac tccagcagga ttcttatgct 241 aattgggtgc gtggggggtg gggtgggtgg agaaggcttc cctcttgtaa ggtggggagc 301 tggtctggaa cccccaacct ataggctctt ctgtcctctc actaccttgg gtctcagtat 361 ggacttgtga ccagcggtgg agacacagag caccagctcg gaggagatgg tacccagctc // LOCUS PMGMTF1 190 bp ds-DNA ORG 17-JUL-1990 DEFINITION P.magellanicus mitochondrial fragment I DNA. ACCESSION M34468 KEYWORDS . SEGMENT 1 of 3 SOURCE P.magellanicus mitochondrial DNA. ORGANISM Mitochondrion Placopecten magellanicus Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; Pectinidae; Placopecten magellanicus. REFERENCE 1 (bases 1 to 190) AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E. TITLE Molecular characterization of a repeat element causing large-scale size variation in the mitochondrial DNA of the sea scallop Placopecten magellanicus JOURNAL Mol. Biol. Evol. 7, 45-64 (1990) STANDARD simple staff_review BASE COUNT 66 a 63 c 14 g 47 t ORIGIN 1 ctttcaaaca caaaaaacac aaaaagctcg ccaataaact ttccaatttt ttaacactac 61 agctaagttt tccccattct ttcacctcaa gcgaagggaa atctatatcc ccactcccaa 121 caatctactg gcatactact tatactacca aagaaactca tctacccccc atccacccca 181 cattggtacc // LOCUS PMGMTF2 1449 bp ds-DNA ORG 17-JUL-1990 DEFINITION P.magellanicus mitochondrial fragment II DNA. ACCESSION M34466 KEYWORDS . SEGMENT 2 of 3 SOURCE P.magellanicus mitochondrial DNA. ORGANISM Mitochondrion Placopecten magellanicus Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; Pectinidae; Placopecten magellanicus. REFERENCE 1 (bases 1 to 1449) AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E. TITLE Molecular characterization of a repeat element causing large-scale size variation in the mitochondrial DNA of the sea scallop Placopecten magellanicus JOURNAL Mol. Biol. Evol. 7, 45-64 (1990) STANDARD simple staff_review BASE COUNT 405 a 403 c 193 g 448 t ORIGIN About 500 bp after segment 1. 1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc 61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt 121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca 181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa 241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat 301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg 361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag 421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc 481 ttctctcact aaaccatcca ccccccactt tccaattttt ttaacactac agctaagttt 541 tccccattct ttcacctcaa gcgagaaggg aaatctatat ccccactccc aacaatctac 601 tggcatacta cttatactac caaaagaaac tcatctaccc cccatccacc ccacattggt 661 accagaacgt tcctacaaac cctttcttcg ctataggggg aaggaacctt cttccttccc 721 ctcggacccc ttcctcgccc cctcccccct cccatcccca acccagctat cctttcccct 781 cccccctagc ccccctacac gcttaactgg tagtcccccc ccttcctccc cccctccctg 841 tgcgggtcaa cgaacctgtc tctactccgc tcttgatcaa acctcgaaaa aaaaaatctt 901 tttttttttt tttttttcgg tctccaagga ctccgtggcc gagaccagga ttctaccctg 961 tagggaaagt ggagggtaag gggttaaata tttatcccgg gcggggggaa cccggaatta 1021 tcctttataa ccaatttcaa cccaatttat ggctccgtac caaaacgcta tttgccccac 1081 ccccctctga ggctttttga ttcagaaaac acgagattta gtgatatttt tcataatatg 1141 gcacaagcaa ttcactgaag gcctaaattt ccgaccctgg aacaattttt ttcaaaaatt 1201 tgacatctaa aagtcacctg aatttacagc cttttttttg attttcaaat ttttgccgtt 1261 tttggacccc gttttttttg actttttttt tgtgtttttt tgtgtttttt tttcaccctt 1321 ttttttgcca aaaattacca tttttgcttg gctacaataa gtttacaaag ccggctataa 1381 aaaattttta gaaaaacccg aaaatatcag ttttgagttt tgaaatgaac tttttttttt 1441 tgactgcag // LOCUS PMGMTF3 746 bp ds-DNA ORG 17-JUL-1990 DEFINITION P.magellanicus mitochondrial fragment III DNA. ACCESSION M34467 KEYWORDS . SEGMENT 3 of 3 SOURCE P.magellanicus mitochondrial DNA. ORGANISM Mitochondrion Placopecten magellanicus Eukaryota; Animalia; Eumetazoa; Mollusca; Bivalvia; Pteriomorphia; Eupteriomorphia; Ostreoida; Pectinina; Pectinacea; Pectinidae; Placopecten magellanicus. REFERENCE 1 (bases 1 to 746) AUTHORS La Roche,J., Snyder,M., Cook,D.I., Fuller,K. and Zouros,E. TITLE Molecular characterization of a repeat element causing large-scale size variation in the mitochondrial DNA of the sea scallop Placopecten magellanicus JOURNAL Mol. Biol. Evol. 7, 45-64 (1990) STANDARD simple staff_review BASE COUNT 260 a 213 c 99 g 174 t ORIGIN About 500 bp after segment 2. 1 ctgcagcttt ttttttcctt ccccgaaatt ttacccccca ataaaaatgg ataaaaaccc 61 cctccaaaaa ataagaaaaa accggaaaac gcaattttcg tgtaaaaaaa aggtgttttt 121 cgaaaaaaaa aaataaaatc tgtacctcgc cgaccaaaaa actttttcat tttttcaaca 181 aatttttcaa ctttttaatc atcaaaccga gcacttttta taaacataaa ccacgaccaa 241 ctgaagggaa aaaaaacact ttattttggg cacttttctc aaaaaagctt cagctagaat 301 tgccagaaat ccttgcccaa attcgacaaa ttttaggtac ctttgttttt actgcgaagg 361 taaagttaaa cttcgcccaa tcccctctac cagcgagggg ggaataatcc ccactttcag 421 gctataagcc taacgttctt actacactac ctcgaccccc cattcctact gcccagcccc 481 ttctctcact aaaccatcca ccccccactt tcctccggga aaaaaaaaaa aaagaaggga 541 tctcatataa ccccatctta ccgctccggc acaaccaata gacgattcac aagtagagcc 601 cagccctcct tgcgacccta accctgaaag taccgaagag acccgcaacc atcacaaaca 661 aaaacatccg gcctaaagga accaaacccc aacaccagtc cgaggggtta tcgtttcaat 721 accgcatgca tagaacaaac ctgcag // LOCUS HUMCREB 1026 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human active transcription factor CREB mRNA, complete cds. ACCESSION M34356 KEYWORDS active transcription factor CREB; alternative splicing. SOURCE Human T-cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1026) AUTHORS Berkowitz,L.A. and Gilman,M.Z. TITLE Two distinct forms of active transcription factor CREB JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Z.Gilman, 16-MAY-1990. FEATURES from to/span description pept 1 1026 active transcription factor CREB-B pept 1 261 active transcription factor CREB-A 304 1026 active transcription factor CREB-A BASE COUNT 332 a 249 c 233 g 212 t ORIGIN 1 atgaccatgg aatctggagc cgagaaccag cagagtggag atgcagctgt aacagaagct 61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct 121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtgca gctgcccaat 181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct 241 ccacaagtcc aaacagttca gtcttcctgt aaggacttaa aaagactttt ctccggaaca 301 cagatttcaa ctattgcaga aagtgaagat tcacaggagt cagtggatag tgtaactgat 361 tcccaaaagc gaagggaaat tctttcaagg aggccttcct acaggaaaat tttgaatgac 421 ttatcttctg atgcaccagg agtgccaagg attgaagaag agaagtctga agaggagact 481 tcagcacctg ccatcaccac tgtaacggtg ccaactccaa tttaccaaac tagcagtgga 541 cagtatattg ccattaccca gggaggagca atacagctgg ctaacaatgg taccgatggg 601 gtacagggcc tgcaaacatt aaccatgacc aatgcagcag ccactcagcc gggtactacc 661 attctacagt atgcacagac cactgatgga cagcagatct tagtgcccag caaccaagtt 721 gttgttcaag ctgcctctgg agacgtacaa acataccaga ttcgcacagc acccactagc 781 actattgccc ctggagttgt tatggcatcc tccccagcac ttcctacaca gcctgctgaa 841 gaagcagcac gaaagagaga ggtccgtcta atgaagaaca gggaagcagc tcgagagtgt 901 cgtagaaaga agaaagaata tgtgaaatgt ttagaaaaca gagtggcagt gcttgaaaat 961 caaaacaaga cattgattga ggagctaaaa gcacttaagg acctttactg ccacaaatca 1021 gattaa // LOCUS PFAATUBII 3706 bp ds-DNA INV 17-JUL-1990 DEFINITION P.falciparum alpha-tubulin II gene, complete cds. ACCESSION M34390 KEYWORDS alpha-tubulin II. SOURCE P.falciparum haploid intraerythrocytic cell line K1 (Thailand) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 2779) AUTHORS Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G., Scaife,J.G. and Hyde,J.E. TITLE The tubulin genes of the human malaria parasite Plasmodium falciparum: Their chromosomal location and sequence analysis of the alpha-tubulin II gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 2780 to 3706) AUTHORS Holloway,S.P., Gerousis,M., Delves,C.J., Sims,P.F.G., Scaife,J.G. and Hyde,J.E. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hyde, 22-MAY-1990. FEATURES from to/span description pept 749 810 alpha-tubulin II, exon 1 1022 1142 alpha-tubulin II, exon 2 1262 2431 alpha-tubulin II, exon 3 IVS 811 1021 alpha-tubulin II intron A IVS 1143 1261 alpha-tubulin II intron B BASE COUNT 1390 a 347 c 502 g 1467 t ORIGIN Chromosome 4. 1 gaattcctta tatatgtact cgttttatat ttttttccta taatttcaac atattaaaaa 61 gtactgaata taacattata tatttacata ttcaagttat atttattttc tttttctttt 121 ttcttgatat attatattat ttgattattt ttttttcttt taatatgtgg cttatattta 181 caaaaatagt aatatatata tgttaatatg taaagaaaca gaaatgccat tgtatgaaat 241 ataaaagttt caattttttt aaaaaatata aaaaaagaaa gaaaataaca aataaaaata 301 aatatattat atatatgtgt ttatattttt atttttatat ataattatat tttttcttta 361 tgatatatat tattatatct ttataaagaa gacataaaag aaacagattg attgaagttt 421 ttcttgcttt aatataagaa agtaatttca taatgaaaat aatatatata tatgtatatg 481 atattttaat ttataattat aatgagtttg taaaaatata ttttatataa gtttatatgt 541 attttttata tatatatata tatatatata tggctttgga taatataata tatatatata 601 aataaataaa tatatatata tatatatata tatatatgta tatttattta catatttatt 661 taatataggt acaaacatag acctacatat ttttatattt atttatttta ttttatatta 721 acatttttta tctaacgttt tacgaacgat gagagaagtc attagtattc atgttggaca 781 ggctggtatt caaataggaa atgcttgctg gtaaaaaaaa aaaaaaaaaa aaaaaaaaaa 841 aaaaaatata tatatatata tataaatgca tgtgtgttgt atatttttct tcttcatttt 901 tatttttaaa gagatacata atgtttaatg tattattcaa tgtgttgtgg taaaaacatg 961 tgtataaatt ataattaaaa atatacatat atatatatat atatatattt tttttttata 1021 gggaattgtt ttgccttgaa catggaattc aaccggatgg gcagatgcca agtgaccaag 1081 tcgttgctgg tggtgatgat gcctttaata catttttctc agaaacggga gctggaaaac 1141 atgtaataat aaaaaaaaaa aaaaaaaaat atatacatat atatatatat atatatatat 1201 atttatttat ttatttattt acttatacat gtgtgcttct ttttgtcgtt actcatttta 1261 ggtaccacgt tgtgtgttcg ttgatttaga acccaccgtc gttgacgaag ttcgaacagg 1321 aacgtatcgt cagctgtttc accctgaaca actaatatct ggaaaagagg atgcagcaaa 1381 taatttcgca aggggacatt ataccatagg aaaagaaatt gttgatgtat gtttggatag 1441 ggttcgaaag ttggctgata attgcactgg attacaagga tttttgatgt ttaatgcagt 1501 aggtggaggt acaggtagtg gtcttggttg tttattatta gaaaggttgg caatagatta 1561 tggaaagaaa tcaaaattaa atttttgttc gtggccatct cctcaagtat cgacagctgt 1621 tgtagagcct tataattctg tattatcaac acattcattg ttagaacata cagatgtggc 1681 aattatgctc gataacgaag caatatatga tatatgtaag aaaaatttag atatagaaag 1741 gccaacctat actaacttga atagattgat tgctcaagtt atctcttcat taacagcatc 1801 tttaagattt gatggtgctt tgaatgttga tgtaacagaa tttcagacta atttagtacc 1861 atatcctaga attcacttta tgttatcatc atatgctcca atcataagtg ctgagaaggc 1921 atatcacgag caattgtcgg tttctgaaat aacgaattct gcctttgagc ctgcatctat 1981 gatggcaaag tgtgatccca gacatggaaa atatatggct tgttgtttaa tgtatagagg 2041 agatgtagta ccaaaggatg ttaatgctgc cgtcgcaact attaagacta agagatctat 2101 acaattcgtt gattggtgtc ctacgggatt taaatgtgga atcaattatc agccccctac 2161 cgtggttcca ggaggagatt tagcaaaagt tatgagagct gtttgcatga tcagcaactc 2221 aacagcaatt gccgaagtat tctcacgaat ggaccaaaaa tttgatttga tgtatgcgaa 2281 aagggctttc gttcattggt atgtaggtga aggtatggaa gaaggagaat ttagtgaagc 2341 tagagaggat ttggctgcct tggaaaagga ttatgaagag gtaggaatag aatcgaatga 2401 tggagaggga gaagatgagg gatatgaatg aacataaata taaatggata aataaataaa 2461 tatatatata tatatatata tatatatata tatatatatg tatatctact tataattatt 2521 tttttttttt atctcatacg tcctatattc ccaaaatatt atatatatgt tcataataat 2581 tatatggaac atatatatcc tcttatcata ttgtatattt attcgttgta aaaaataaaa 2641 aaaaatatac aagaatatta atattttttt ctatgtattt acaccttcat tagtttaata 2701 ttcaagaaaa ataatactta ctagaaataa ataaaatata tatatatata tatatgtatg 2761 tatatatgtg tgtatgtacc attaattagt aagtagtaat ttattatgca atttatttat 2821 tatatatata tatatatata tatatatata gtatcaaatt aaatatacga caattcgtaa 2881 gtacattctt gaaaaaataa atgtatagat ataatattat tttttttata caaatttata 2941 tttgtagaat atatattttt ttccctttat tattatgcat tatataataa tatatacatg 3001 ataatatgca tacatataca tattatatat atatatgtat gtatgtatat atgtcgtgtg 3061 ataatttatc tttatgctaa taatacatta cacatataaa tacttattta tatctatata 3121 tatttttttt atatacgaca ataatttatg taaacattaa gttacatatt aaagttcgat 3181 taataaatat ataataattt tttttatttt attatatcta tttttccaat gtataaataa 3241 tatatgaaat atactatatg ctatataata tgagcacaca taagatttgg tgttatataa 3301 atgtagaagt aatattaata gtatcagacg aaaagaacat tttttttttt tttttttttt 3361 tttttttggt ccttttaaat tgatataaaa aactaaactt tagagtttct ttaaaatgag 3421 acatttttta caacttaata taatttaatt tttttttttt tttttttcat atttattaga 3481 aatgaatata taggttaaca catatatatt atatatatat atattaacat gaaaaggtta 3541 aaaaaaaata tggaaaatac aaacatataa ttatatacat atatattata tatatgtatg 3601 tatctttttt atgttttaat ttggtcacgt ggttaatttt atatgatgct aaaaaaattt 3661 tcttccattt aagaaacagg aattaacaaa aaatttaagg gaattc // LOCUS XELFIMB1 1341 bp ss-mRNA VRT 17-JUL-1990 DEFINITION X.laevis integumentary mucin B.1 (FIM-B.1) mRNA, complete cds. ACCESSION J02910 KEYWORDS integumentary mucin; mucin. SOURCE X.laevis skin, cDNA to mRNA, clones pREP1218 and pXGL-3'-1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1341) AUTHORS Probst,J.C., Gertzen,E.-M. and Hoffmann,W. TITLE An integumentary mucin (FIM-B.1) from Xenopus laevis homologous with von Willebrand factor JOURNAL Biochemistry 29, 6240-6244 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Hoffmann, 26-APR-1990. FEATURES from to/span description pept < 1 1197 integumentary mucin (FIM-B.1; AA at 1) mRNA < 1 1341 FIM-B.1 mRNA signal 1319 1324 poly-A signal rpt 1 279 repeat sequence BASE COUNT 472 a 290 c 268 g 311 t ORIGIN 1 cctgcacctt cagaaacaac aggagagtca acacctgcac cttcagaaac aacaggagag 61 tcaacacctg caccttcaga aacaacagga gagtcaacac ctgcaccttc agaaacaaca 121 ggagagtcaa cacctgcacc ttcagaaaca acaggagagt caacacctgc accttcagaa 181 acaacagtac catcaggaga gtcaacacct gcaccttcag aaacaacagt accatcagta 241 ccatcaggag agtcaacacc tgcaccttca gaaacaacag agctgagaat catcccacca 301 gaagtatcta cagttgcagt gccagtaaca actggacaga taactcctgc agtaacaact 361 gaacacagca cagaagaaat actgacactt cctccgccag tggtaggacc agttcttcca 421 gcaaagccaa cagtagatat ctccaaatat acaaacacta caactactaa atcgactgtc 481 cctacaacta ccattcctcc taaagctact tgctgtggat cgtctggaga gtcagtacaa 541 gcaggtcata tgtggcaaac tggttgtgat gtgtgcacat gcaatggaac ctctggaaaa 601 acacaatgtg caccacgcca gtgtgaaaag gagataatct gtaagagcga cgaaaggagg 661 gtactgagaa aacctgggaa atcatgctgt ggatattgcg agccgctgac atgtaaacat 721 aatggaacag aatataagct tggggctacc tttatagaca aatcgaatcc ttgtatcact 781 tatagatgtg acgcatcagg cttgacagtg aatgttaaat catgcccaaa tgaacaagtt 841 tgcagtaaga gtgaaagaac ctatgattca gatggctgtt gtttctcatg tgacacatca 901 tgtaaaccag tccctgcgac agtgggtatc caaggggagt acgattatca gaatgaaaaa 961 acaaactgct ctgctaatat cataatggca aaatgttctg gtcaatgcca acataaattg 1021 acctatgata caattgacaa taaagttgtt accaaatgtc ggtgttgtaa ggcagaccgt 1081 gttgaaccaa gaaaagctca tctggtgtgt gataatggaa agaaaaaaat atataaatac 1141 aaacatatca catcttgcaa atgcacatct tgtaccgctt acaatataag actgtgaaga 1201 ctttcatgca tttgctaact gtcagctttt gcatgtacaa taaaataaac atgggggata 1261 tattggaggc agagatatct gttatggtac ataaagtatt ttttgtctgt tattgaaaaa 1321 taaatggttt ttttcagctg t // LOCUS YSCANBI 2156 bp ds-DNA PLN 17-JUL-1990 DEFINITION S.cerevisiae ANB1 locus encoding protein synthesis initiation factor (eIF-4D) gene, complete cds. ACCESSION J05455 KEYWORDS protein synthesis initiation factor eIF-4D. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2156) AUTHORS Mehta,K.D., Leung,D., Lefebvre,L. and Smith,M. TITLE The ANB1 locus of Saccharomyces cerevisiae encodes the protein synthesis initiation factor eIF-4D JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Dr. K.D.Mehta, 16-MAR-1990. FEATURES from to/span description pept 569 1042 protein synthesis initiation factor (eIF-4D) mRNA 516 > 1042 protein synthesis initiation factor mRNA (alt.) mRNA 526 > 1042 protein synthesis initiation factor mRNA (alt.) mRNA 540 > 1042 protein synthesis initiation factor mRNA (alt.) signal 395 400 TATA box BASE COUNT 602 a 422 c 369 g 763 t ORIGIN 3 bp upstream of SmaI site. 1 cccgggaatt ttagattcag gtaggaaatt gattacataa atactgttac cctgaatcat 61 attcgacgat gtcgtctcac acggaaatat aattcatttc ttggttttcc aaaaaaattt 121 tcattttttt tcactttttt gtttcgtcct cctttttttt tttttttatt ttttttcctg 181 tgttcacctt tttttttttt ttttcagttt acaactttct gcattctttt ctgtgttttt 241 tttttttcgt ttttccattg ttcgttcgtt gcctgttttt ttgccctatt gttctcgagc 301 ctaaaaattt tttcctttcc tgctttcctt tcttcgttca aagtttccta ttccattgtt 361 ctcttcggta aactcattgt tgtcggaact cagatatatt caggtcaatt tactgtactt 421 caattgactt ttttcttgaa atttcaactt gccttttcaa cttgttcttc ttttttaatc 481 ttattctaca ctttagttcc cttaccttgt tcctaattat tgtctagcaa aaagaaaaca 541 tacacctatt tcattcacac actaaaacat gtctgacgaa gaacacacct ttgaaaatgc 601 tgacgctggt gcctccgcca cctacccaat gcaatgttct gccttgagaa agaatggttt 661 cgttgtcatc aaaggtagac catgtaagat tgtcgacatg tccacttcca agaccggtaa 721 gcacggtcac gccaaggtcc atttggttac ccttgatatc ttcactggta agaagttgga 781 agatttgtct ccatccactc acaacttgga agttccattt gtcaagagaa gcgaatacca 841 attgttggat attgatgatg gctacttgtc cttgatgacc atggacggtg aaaccaaaga 901 cgatgtcaag gcaccagaag gtgaattggg tgacagcatg caagctgctt tcgatgaagg 961 taaggatttg atggtcacta tcatttctgc catgggtgaa gaagctgcca tctccttcaa 1021 ggaagctcca agatctgatt agatatttta gatttctata gcttacaaaa tacgcgatgt 1081 ttttttgatc tagaaatctt atttatacat tttatataat attataataa tcataatcat 1141 aattattcat cctaactaac ttcgttctgt ttcagctacg aagttgcctt cattatcggt 1201 acgttcgtgt taattggtgt ttgataccat agtcacttat gacgacttac agaacttttt 1261 tttttgacgg atattttccc ctcaatatgc cgaggatgat aactttataa attgccttct 1321 ttcggttcct tttttctttt taatttagct gacgggaaca tgtgtatata ccccagaaga 1381 atttaaacag aaattttttt gagctatttt acgaaaatat taaaaactag ggctacagat 1441 atcagagaaa aaattactta tatgatgaca ataacttaat atcacagtca ctatcgttct 1501 cgcctttata tgcgcaatga cagcattcga cttttcctta gaagggatcc gatcaacgaa 1561 acttcaaaaa gagtagaagt cgaaatattt tttgcacgac gagggtgaca aaccgtgcta 1621 tgtccacaaa tgaggccctg ctttcccaag ataaattgcg taaacatata attgttacaa 1681 ggatccgatc aacgaaactt caaaaagagt agaagtcgaa atattttttg cacgacgagg 1741 gtgacaaacc gtgctatgtc cacaaatgag gccctgcttt cccaagataa attgcgtaaa 1801 catataattg ttacaaatcc aaggctctgt gatttcttca ccaagtgtag ttgtgatgta 1861 tagtgtgtag gataggtaat gaccggtaaa tcaagctggt aaaaagtcct aataacaagc 1921 ccgaatctgg aataatggta atattcccat gccaggtgga cagtgttgtc cttacacaaa 1981 ctcaagagat taaaggcaac ttggccgagt ggttaaggcg aaagattaga aatcttttgg 2041 gctttgcccg cgcaggttcg agtcctgcag ttgtcgttat tttttatttc cctttctttt 2101 cgtacataca caatgcacag tcttcaaagt aaaataccaa cgtatgtatc aagatc // LOCUS BSUBMRX 1402 bp ds-DNA BCT 17-JUL-1990 DEFINITION B.subtilis multidrug-resistant protein (BMR) gene, complete cds. ACCESSION M33768 KEYWORDS multidrug-resistance. SOURCE B.subtilis DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1402) AUTHORS Neyfakh,A.A., Bidnenko,V.E. and Chen,L.B. TITLE Efflux-mediated multidrug resistance in bacteria: Similarities and dissimilarities with mammalian system JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.A.Neyfahh, 20-APR-1990. FEATURES from to/span description pept 195 1364 multidrug-resistance protein (BMR) signal 130 135 -35 promoter signal 155 160 -10 promoter BASE COUNT 353 a 300 c 323 g 426 t ORIGIN About 216 min. on B.subtilis map. 1 ggcggatatg gatggagaag tttatacaca tacacccgca gtcattcagg tcctgccaca 61 gcatatcgac atgcttgttc ccgcaaatga ataactgtca taaggcttta gaaagatttt 121 gcaaatccgt tgactctccc ctaggaggag gtcttacagt ataagggata cacacttgaa 181 aaaggggaag tcatatggag aagaaaaata ttaccttaac tatattatta accaatttat 241 ttattgcttt tttggggatc gggcttgtga ttccagtaac gccgaccatt atgaatgaat 301 tgcatttatc ggggaccgcg gtcggctata tggttgcctg cttcgctatt acacagctca 361 ttgtctcacc aatagccgga cgatgggttg atcgcttcgg gcgcaagatc atgatcgtaa 421 tcggcctgtt gttctttagt gtgtcggagt ttttgttcgg cattggaaaa acagttgaga 481 tgttatttat cacccgtatg ctgggcggta tcagcgcacc gttcattatg cccggggtca 541 cggcttttat tgcagatatc acgaccatta aaacacggcc aaaagcgctc ggttatatgt 601 cagccgctat ttcaacagga tttattatcg gccccggcat cgggggattt ttagcagaag 661 tccattcccg gctgcctttt ttctttgcgg cagcttttgc actgttagca gccattttat 721 caatcctcac gctgcgcgag ccggaacgaa accctgaaaa tcaggaaata aaaggacaga 781 agacaggctt taaacgaatt tttgccccca tgtatttcat agcttttctc attatcttaa 841 tttcgtcttt tggtttagca tcatttgaat ctttatttgc attattcgtg gatcataaat 901 tcggatttac ggccagcgac attgccatta tgattacagg aggagcgatt gttggcgcca 961 ttacgcaagt cgtcttattc gaccgcttca caagatggtt tggcgaaatt catttaattc 1021 ggtacagctt aattctctcg acgagtctgg tattcttgct gacaacggta cattcatatg 1081 ttgcgattct gctggtgaca gtcaccgtat ttgtcggatt tgatctcatg cggcctgcgg 1141 taacgactta cctgtcaaag attgcgggaa atgaacaggg gtttgccggc ggtatgaatt 1201 caatgtttac aagtatcggc aatgtattcg ggcctattat cggcggaatg ctgttcgata 1261 tagatgtaaa ctatcctttc tactttgcaa cggtcacctt agccataggg attgcactga 1321 ccattgcttg gaaagcgcct gcacatctta aagccagcac gtgataagaa gcgcattctt 1381 tgtgtactgc aaagaatgcg ct // LOCUS ECOMOR 1500 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli morphology and auto-aggregation control protein (mor) gene, complete cds. ACCESSION M34102 KEYWORDS auto-aggregation control protein; morphology control protein. SOURCE E.coli (strain K12) DNA, clones pSRW[220,226]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1500) AUTHORS Warne,S.R., Varley,J.M., Boulnois,G.J. and Norton,M.G. TITLE Identification and characterization of a gene that controls colony morphology and auto-aggregation in Escherichia coli K12 JOURNAL J. Gen. Microbiol. 136, 455-462 (1990) STANDARD simple staff_review FEATURES from to/span description pept 203 1120 morphology control protein (put.) signal 133 138 -35 region signal 157 162 -10 region rpt 1135 1164 region of dyad symmetry rpt 1167 1211 region of dyad symmetry BASE COUNT 338 a 364 c 428 g 370 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcctgga gatccgcaaa agttcacgtt ggctttagtt attcgagttg agaaactctc 61 gaaacgggca gtgacttcaa gggttaaaag aggtgccgct ccgtttctgt gagcaattat 121 cagtcagaat gcttgatagg gataatcgtt cattgctatt ctacctatcg ccatgaacta 181 tcgtggcgat ggaggatgga taatgaatat tcgtgatctt gagtacctgg tggcattggc 241 tgaacaccgc cattttcggc gtgcggcaga ttcctgccac gttagccagc cgacgcttag 301 cgggcaaatt cgtaagctgg aagatgagct gggcgtgatg ttgctggagc ggaccagccg 361 taaagtgttg ttcacccagg cgggaatgct gctggtggat caggcgcgta ccgtgctgcg 421 tgaggtgaaa gtccttaaag agatggcaag ccagcagggc gagacgatgt ccggaccgct 481 gcacattggt ttgattccca cagttggacc gtacctgcta ccgcatatta tccctatgct 541 gcaccagacc tttccaaagc tggaaatgta tctgcatgaa gcacagaccc accagttact 601 ggcgcaactg gacagcggca aactcgattg cgtgatcctc gcgctggtga aagagagcga 661 agcattcatt gaagtgccgt tgtttgatga gccaatgttg ctggctatct atgaagatca 721 cccgtgggcg aaccgcgaat gcgtaccgat ggccgatctg gcaggggaaa aactgctgat 781 gctggaagat ggtcactgtt tgcgcgatca ggcaatgggt ttctgttttg aagccggggc 841 ggatgaagat acacacttcc gcgcgaccag cctggaaact ctgcgcaaca tggtggcggc 901 aggtagcggg atcactttac tgccagcgct ggctgtgccg ccggaggcca aacgcgatgg 961 ggttgtttat ctgccgtgca ttaagccgga accacgccgc actattggcc tggtttatcg 1021 tcctggctca ccgctgcgca gccgctatga gcagctggca gaggccatcc gcgcaagaat 1081 ggatggccat ttcgataaag ttttaaaaca ggcggtttaa accgtttaac gcagctaccc 1141 gatagcttcg ccatcgtcgg gtagttaaag gtggtgttga cgaagtactc aatagtgttg 1201 ccgccacctt tctgttccat aatcgcctga ccgatatgaa taatttcggc agcgcgctcg 1261 ccaaagcagt gaatacccag aatctctttt gtttcccgat ggaacaaaat tttcagcgtg 1321 cccacgttca tgccgacgat ttgtgcgcgt gccagatgtt taaactgggc gcggcccact 1381 tcatatggca ctttcattgc ggtcagctgc tgttcggttt tgcccacaga gctgatttcc 1441 gggatggtgt aaataccggt agggatatct tcaatcagat gtgcggtggc ttcgcctttt // LOCUS ECOTGMETY 800 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli Met-tRNA-f2 (metY) gene, 5' end. ACCESSION M28401 KEYWORDS transfer RNA-Met. SOURCE E.coli (strain K12) DNA, clones p50L and p50. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 800) AUTHORS Granston,A.E., Tompson,D.L. and Friedman,D.I. TITLE Identification of a second promoter for the metY-nusA-infB operon of Escherichia coli JOURNAL J. Bacteriol. 172, 2336-2342 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.E.Granston, 27-SEP-1989. FEATURES from to/span description tRNA 682 758 Met-tRNA-f2 (metY) anticdn 716 718 Met-tRNA anticodon atg signal 560 565 -35 region promoter P1 signal 584 589 -10 region promoter P1 signal 638 643 -35 region promoter P0 signal 661 666 -10 region promoter P0 mRNA 596 > 800 P1-associated mRNA BASE COUNT 210 a 191 c 190 g 209 t ORIGIN 5 bp upstream of KpnI site. 1 ggtaccagtc acggcgcggc cagcgcgtcg ttttgaaata ggtcaggccg ccggtggtgt 61 tatgaaatgc gccacactga atagcggcaa taccttcggc caccagttgt ttgcggcagt 121 cgatcagacg tgcgttctcc gcgccgtatt ccatggcacg acgagggatc gcatcataat 181 cctcttcgtc tggctggccc aggtttgcag tatatgcata aggaaccgct cccttttgtc 241 gcatccacag cagtgcggca ctggtgtcca gaccgccaga aaaagcgata ccaatacgtt 301 gacctaccgg gagatgcttg agaatcgtcg tcataaaata acaccctgct taattaactg 361 atgatgagcc tggatttccg ctctcactga atttttatgc aaaataaatg agttttcatt 421 taatcatctt ttatcggaga caggaagagt ttagtgtgtt ttttgtaaaa taatgcgctt 481 aagggagagc aggagaaggc aaaagtattc aacaaatgaa agtgaactgg atattcattc 541 acatgattag caataaacgt tgacaaaatg tggcgtggat cactataatg cctgcagatt 601 ttacgtcccg tctcggtaca ccaaatccca gcagtatttg cattttttac ccaaaacgag 661 tagaatttgc cacgtttcag gcgcggggtg gagcagcctg gtagctcgtc gggctcataa 721 cccgaagatc gtcggttcaa atccggcccc cgcaaccact ttcccttaga gtcctttttc 781 aaatatactg tgaagacttc // LOCUS RAT5HT1A 2314 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat 5-hydroxytryptamine-1a receptor (5-HT-1a) gene, complete cds. ACCESSION J05276 KEYWORDS 5-hydroxytryptamine-1a receptor. SOURCE Rat DNA, clone D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2314) AUTHORS Albert,P.R., Zhou,Q.-Y., Van Tol,H.H.M., Bunzow,J.R. and Civelli,O. TITLE Cloning, functional expression, and mRNA tissue distribution of the rat 5-hydroxytryptamine-1a receptor gene JOURNAL J. Biol. Chem. 265, 5825-5832 (1990) STANDARD simple staff_review FEATURES from to/span description pept 123 1391 5-hydroxytryptamine-1a receptor (15-HT-1a) BASE COUNT 528 a 617 c 591 g 578 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccctc cctctgaaac tccccagaaa agaaggaaga ccctaggggg aggggcgagg 61 gagatcccct tgcagttctt tcccccctcc cttcccccac cccagcggga gtgcaggcag 121 gcatggatgt gttcagtttt ggccagggca acaacaccac agcgtcccag gagccctttg 181 ggaccggcgg caacgttact agcatctccg acgtgacctt cagctaccaa gtgatcacct 241 ctctgttgct gggtactctc attttctgcg cggtgctcgg caatgcctgc gtggttgctg 301 ccatcgccct ggagcgctcc ctccagaatg tggccaacta tctcatcggc tccttggcgg 361 ttactgatct catggtatca gtgctggtgc tgcccatggc tgctctgtac caggtgctca 421 acaagtggac cctgggccag gtcacctgcg acctgtttat cgccctggat gtactgtgct 481 gcacctcgtc catcctgcac ctgtgcgcca tcgcgctaga caggtactgg gctatcaccg 541 accctataga ctatgtgaac aaaaggacgc cccggcgcgc cgctgcgctg atctcgctca 601 cttggctcat tggctttctc atctccatcc cgcccatgct gggctggcgc accccggaag 661 accgatcgga ccccgacgcg tgcaccatca gcaaggacca cggctacacc atctactcca 721 ctttcggcgc tttctatatc ccgctgttgc tcatgctggt tctctacggg cgcatcttca 781 gagccgcacg cttccgaatc cgcaagactg tcaggaaggt ggaaaagaag ggagcaggca 841 ccagcttagg aacttcgtcg gcaccgcccc ccaagaagag cctgaacgga cagccaggta 901 gtggggactg gaggcgctgt gctgagaaca gggcggtggg gactccgtgc actaatgggg 961 cagtgaggca gggtgacgac gaggccaccc tggaggtgat cgaggtgcac cgagtgggca 1021 actccaaaga gcaccttcct ctgcccagcg agtcaggatc taactcctat gcccccgcct 1081 gcttggagag aaaaaatgag cgcaatgctg aagcaaagcg caagatggcc ttggcccgtg 1141 aaaggaagac ggtgaagact ctgggcatca tcatgggcac cttcatcctc tgctggctgc 1201 cgtttttcat tgtggctctg gtcctgcctt tctgtgaaag cagctgccac atgcctgcgt 1261 tgttgggtgc cataattaac tggctaggct actccaactc cctgctcaat ccggttattt 1321 atgcttattt caacaaagac tttcaaaacg cttttaagaa gataatcaag tgcaagttct 1381 gccgccgatg atgatgatga tggtggtggt aatagtggtg actatggaga aggaggagga 1441 ggaggaagag gaggaggaag aagaggaaga ggaggaagag tgtagggttt acaggaccct 1501 ccccattcac tcagtaccct agccctggaa gcaacaccta agataacttg cttttcctag 1561 aacatgcagc ttctcagcac atggctttca gaccttgttc tctctatatc tcgtttcaag 1621 gggacagtac tctttgcaga aatcttagag aaccactagt atctgtgtga aaagttattg 1681 gcttggtctt tggccctagg atcagttttg gtagttgtaa taacatcctc cctattcaaa 1741 atcccaattc ttcacgatgg aattttaaac cttggtgatt caaggtagaa agccgaaaca 1801 gaaaatctaa actgtcctga catctgccag gcccactaga actgtcactc tctgccctac 1861 ttctggaact ttgaggatca agactgctct gtctctcact ccctcttgct ctagaaacat 1921 ctgttccctt tgggtgtcaa tctacactgt tgaactggtc tccatgctgg tctggaaaag 1981 cagttggaag taactactct gggttatgtg ctctggaaat ttttgaatag caaaagatcc 2041 cccccacccc tgcacaacac ccactttctg tatttttttt aaattttttt ttaatttttg 2101 aactcaaaat caagtggcaa aagtgtctgt tggagtggac ttataagtag gtctggctcg 2161 taggaagagc caggaaggtc tgtggatggt ttcttgtcct ggtgttaaca ctcaccaaag 2221 aaactgacaa tcagactatt tggaaggaag agttttattc ctacagtttg ttggcctttc 2281 tgtaagttct ggcaaggctc tgtgtcttct gcag // LOCUS RATFCGRE 1341 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat Fc-gamma receptor mRNA, complete cds. ACCESSION M32062 KEYWORDS Fc-gamma receptor. SOURCE Rat (strain Fisher 344) natural killer cell, cDNA to mRNA, clone 6a6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Zeger,D.L., Hogarth,P.M. and Sears,D.W. TITLE Characterization and expression of an Fc-gamma receptor cDNA cloned from rat natural killer cells JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3425-3429 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.W. Sears, 12-FEB-1990. FEATURES from to/span description pept 50 853 Fc-gamma receptor BASE COUNT 374 a 346 c 319 g 302 t ORIGIN 1 ccagcaattt catctcctag acctcatcag actctgaccc agttcttgaa tgactttgga 61 gacccagatg tttcagaatg cacattctgg aagccaatgg ctactcccac cactgacaat 121 gttgctgctg tttgcttttg cagacaggca gacgggagat cttctgaagg ctgtggtgaa 181 acgtgatccc ccatggatcc aggtgctcaa ggacgacact gtgacgctga cgtgcgaagg 241 gacccacaat cctggaaact cttctaccca gtggttccac aaccagagct ccacctgggg 301 ccaggtccaa gccagctaca cgtttaaggc cacagtcaat gacagtggag aataccggtg 361 ccgaatggcg cacaccagcc tcagcgaccc catacatctg gaagtgattt ctgactggct 421 gctgctccag acccctcaac tggtgtttga ggaaggggaa accatcacat taaggtgcca 481 tagctggaag aacaaacagc tgactaaggt cttgttgttc cagaatggaa aacctgtgag 541 gtattattat caaagcagca atttttcaat ccccaaagcc aaccacagtc acagtgggaa 601 ctactactgc aaagcatatc taggaaggac aatgcatgtg tccaagcctg tcaccatcac 661 tgtccaaggt tcagcaaccg cgtccaccag ctctctagtg tggttccatg ccgctttctg 721 cctagtgatg tgcctcctgt ttgcagtgga caccggcctg tatttctgtg tacggagaaa 781 tcttcaaacc tcgggggagg actggaggaa atccctgtca gtcggaaagt acaaggctcc 841 acaggacaaa tgacatccca tcgtatggct aaacagcggc agcttctttt cagccacacc 901 gccgcttcat ctacagcctt cctttgaaag caacttacca ccaggctgga tatttggtcc 961 tgcaatcaca actgctagga ggctgaggca ggaggatcac caaaggctgc ccgggtttta 1021 gagaagagag agtgcaagtc tatcctggat aacccaataa gaccctgggt ttagatggct 1081 tatcaggaaa gagcacccat tgctaagctc acagacaaga ccacagattg atccctaagg 1141 acacagggca aagggggaga acaaattcct gaaagttgtc ctctgacctc cacacgtcca 1201 ccatggcaga tgcacacaat aaacaaatta aaatgtaata tatatatttt aaacaagagg 1261 tggggaaagg ctgggggttg gtggcacagt ttttatccag gacacatgat gctctggatt 1321 ttgacctcct ataataaaac g // LOCUS YEPYOPYER 1527 bp ds-DNA BCT 17-JUL-1990 DEFINITION Y.pestis virulence determinant (yopE) and yopE regulator (yerA) genes, complete cds. ACCESSION M34279 KEYWORDS virulence detrminant; yopE regulator. SOURCE Y.pestis DNA, clone pYV019. ORGANISM Yersinia pestis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1527) AUTHORS Forsberg,A. and Wolf-Watz,H. TITLE Genetic analysis of the yopE region of Yersinia spp.: Identification of a novel conserved locus, yerA, regulating yopE expression JOURNAL J. Bacteriol. 172, 1547-1555 (1990) STANDARD simple staff_review FEATURES from to/span description pept 476 84 (c) yopE regulator (yerA) pept 670 1329 virulence determinant (yopE) BASE COUNT 426 a 329 c 339 g 433 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccgaca ttactaagtg agcgttgtat ctaatcctgg gggcaggtca ttccgtataa 61 taaggcaaca accaaaaatc tactcaacta aatgaccgtg gtggtgagat tagtgatgag 121 gtttgtagcc gttcagcccc ctgcaccagc atctcaagct gagtatatag tgagttatta 181 tccaggctgt tcaatggttg tcgattccat aacactgggt gccccccaac ctcgtcccag 241 gataagatgg gttttaatat atcttgactg aatatattat ggctaagtaa ggtttccttt 301 tcatcattat tgtcaagaga aggtagggta aacattaata tttgcccgac aggatgctct 361 gttatatggc aggcgaattc cccaactttg acaccgataa ccggttcaat agtatctgga 421 atagacaacg aaagttgttg aaataattga gtgatagctt gttcaaatga atacattatg 481 atctcataat agttagataa aatatcaact taaccaaagc actctcggca gaccatcaat 541 tttagcctat aatttttagt ttttgttttg tctaatataa caacaaaaac agcagcgatt 601 ttttatatag ccatcggcta ttttcccact aagataacct tgttttaata gccaaggtaa 661 taaatagtca tgaaaatatc atcatttatt tctacatcac tgcccctgcc gacatctgtg 721 tcaggatcta gcagcgtagg agaaatgtct gggcgctcag tctcacagca aacaagtgat 781 caatatgcaa acaatctggc cgggcgcact gaaagccctc agggttccag cttagccagc 841 cgtatcattg agaggttatc atcagtggcc cactctgtga ttgggtttat ccaacgcatg 901 ttctcggagg ggagccataa accggtggtg acaccagcac ccacacctgc acaaatgcca 961 agtcctacgt ctttcagtga cagtatcaag caacttgctg ctgagacgct gccaaaatac 1021 atgcagcagt tgaatagctt ggatgcagag atgctgcaga aaaatcatga tcagttcgct 1081 acgggcagcg gccctcttcg tggcagtatc actcaatgcc aagggctgat gcagttttgt 1141 ggtggggaat tgcaagctga ggccagtgcc atcttaaaca cgcctgtttg tggtattccc 1201 ttctcgcagt ggggaactat tggtggggcg gccagcgcgt acgtcgccag tggcgttgat 1261 ctaacgcagg cagcaaatga gatcaaaggg ctggcgcaac agatgcagaa attactgtca 1321 ttgatgtgat atggataaaa acaaggggat agtgtttccc cctttttcta tcaatattgc 1381 gaatatcttc gtccctgatc tttcaggggc gaatcgtttt ttagcatgct cattgttaga 1441 atttctgact tatctctctt ctgtattact actcatgctc tggaaaatcc tgaacatgta 1501 tatctatgga ttgatgcagc actcgag // LOCUS YEPYOPYER1 1762 bp ds-DNA BCT 17-JUL-1990 DEFINITION Y.enterocolitica virulence determinant (yopE) and yopE regulator (yerA) genes, complete cds. ACCESSION M34278 KEYWORDS virulence determinant; yopE regulator. SEGMENT 1 of 2 SOURCE Y.enterocolitica DNA, clone pYV8081. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1762) AUTHORS Forsberg,A. and Wolf-Watz,H. TITLE Genetic analysis of the yopE region of Yersinia spp.: Identification of a novel conserved locus, yerA, regulating yopE expression JOURNAL J. Bacteriol. 172, 1547-1555 (1990) STANDARD simple staff_review FEATURES from to/span description pept 609 217 (c) yopE regulator (yerA) pept 803 1462 virulence determinant (yopE) rpt 1468 > 1762 insertion sequence rpt 1464 1467 5' insertion target sequence BASE COUNT 497 a 395 c 382 g 488 t ORIGIN 1 ccggtactcc tcattaccaa tgaggaggcg gtgattccac caagtgaatg ggcttaatac 61 aagtctttta cattaaaaat tgttaaaaaa cgggagataa aagtcaacac tccaacttgg 121 tgttaacttt tactgagcga aatctgatat tgctggcacc acaaatttat aggttatcgc 181 tatttccgta taataaggca acaaccaaaa atctactcaa ctaaatgacc gtggtggtga 241 gattagtgat gaggtttgta gccgttcagc cccctgtacc agcatttcaa gctgagtata 301 tagcgagtta ttatccaggt tgttcaatgg ttggcgattc cataacactg ggtgcccccc 361 gacctcgtcc caagataaga tgggttttaa tatatcttga ctgaatatat tatggctaag 421 taaggtttcc ttttcattat tattgtcaag agaaggcagg gtaaacatta atatttgccc 481 gacaggatgc tctgttatat ggcaggcgaa ttccccaact ttgacaccga taaccggttc 541 aatagtatct ggaatagaca gcgaaagttg ttgaaataat tgagtgatag cttgttcaaa 601 tgaatacatt atgatctcct aatagttaga taaaatatca agttaaccaa agcactctcg 661 gcagaccatc aatttgagcc tataattttt agtttttatt ttgtctaata taacaacaaa 721 aacagcagcg attttttata tagccaccgg ctattttccc actaagataa ccttgtttta 781 atagccaagg gaataaatag tcatgaaaat atcatcattt atttctacat cactgcccct 841 gccgacatca gtgtcaggat ctagcagcgt aggagaaatg tctgggcgct cagtctcaca 901 gcaaaaaagt gagcaatatg caaacaatct ggctgggcgc actgaaagcc ctcagggttc 961 cagcttagcc agccgtatca ctgagaagtt atcatcaatg gcccactctg cgattgaatt 1021 tatcaaacgc atgttctcgg aggggagcca taaaccggtg gtgacaccag cacccacgcc 1081 tgcacaaatg ccaagtccta cgtctttcag tgatagtatc aagcaacttg ctgctgagac 1141 gctgccaaaa tacatgcagc agttgagtag cttggatgca gagacgctgc agaaaaatca 1201 tgatcagttc gccacgggca gcggccctct tcgtggcagt atcactcaat gccaagggct 1261 gatgcagttt tgtggtgggg aattgcaagc tgaggccagt gccatcttaa acacgcctgt 1321 ttgtggtatt cccttctcgc agtggggaac tattggtggg gcggccagcg cgtacgtcgc 1381 cagtggcgtt gatctaacgc aggcagcaaa tgagctcaaa gggctggcgc aacagatgca 1441 tcaattactg tcattgatgt gatatggtga ctagtcctgc tataggttga cacttttcag 1501 ccttaaaacg cctgatgaac ttcatcgggc gttttgtatt ttaacgacag atgcggtcgc 1561 cgcgtgttgt agatttccac tgactccctg accatctttc ttgcctggcc aatatcttcc 1621 ggcttaacca gcagatattc catttttaat atcccattga cccgctccgc cagcgcattc 1681 tgataacaat catatccatc cgtcatcgaa cagatgacac cgtgacgttg atgtaacgcc 1741 tgatattctg ttgaacagta ct // LOCUS YEPYOPYER2 546 bp ds-DNA BCT 17-JUL-1990 DEFINITION Y.enterocolitica DNA with insertion element, 3' end. ACCESSION M34280 KEYWORDS insertion element. SEGMENT 2 of 2 SOURCE Y.enterocolitica DNA, clone pYV8081. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 546) AUTHORS Forsberg,A. and Wolf-Watz,H. TITLE Genetic analysis of the yopE region of Yersinia spp.: Identification of a novel conserved locus, yerA, regulating yopE expression JOURNAL J. Bacteriol. 172, 1547-1555 (1990) STANDARD simple staff_review FEATURES from to/span description rpt < 1 242 insertion element rpt 243 246 3' insertion target sequence BASE COUNT 144 a 119 c 109 g 174 t ORIGIN About 650 bp after segment 1. 1 gatccgtgac ccagaaagac aaatccgtca ttaacttgcg tagccacttc aggacggtag 61 aacggccttg aataccgttg cggtcttgcg ctgtcggtaa gtcatttccc ttttttcgac 121 ctgttcgaca acggccaatt taaaggatag aggataatgc cgttgcgtgc gtttaacata 181 ttggttcatc acattttcct aagtacatga gttaaatgtg tcaacgctat ttaggacggg 241 tcatggataa aaacaagggg atagtgtttc cccctttttc tctcaatatt gcgaatatct 301 tcgtccctga tctttcaggg gcgaatcgtt ttttagcatg ctcattgtta gaatttctga 361 cttatctctc ttctgtatta ctactcatgc tctggaaaat cctgggcgtc tatattctat 421 ggattgatgc aacactcgag aaatcaaaat atcattgcta agcgttatat agtatatacc 481 gtgcttttta tatgaaaacg gcgcatgtca gagcaaatcc agttacactc agcccctaac 541 tctaga // LOCUS HUMCDR34 2412 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human cerebellar-degeneration-related antigen (CDR34) gene, complete cds. ACCESSION M31423 KEYWORDS cerebellar-degeneration-related antigen. SOURCE Human neuroblastoma BE(2)-88n cell line DNA, clone lambda CDR34. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2412) AUTHORS Chen,Y.-T., Rettig,W.J., Yenamandra,A.K., Kozak,C.A., Chaganti,R.S.K., Posner,J.B. and Old,L.J. TITLE Cerebellar degeneration-related antigen: A highly conserved neuroectodermal marker mapped to chromosomes X in human and mouse JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3077-3081 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.-T.Chen, 17-JAN-1990 FEATURES from to/span description pept 503 1174 cerebellar-degeneration-related antigen (CDR34) BASE COUNT 743 a 334 c 669 g 666 t ORIGIN 1 atgttggttc ataagatctg gtctataagg aggaatgtcc cattaaatgt ttttgaagct 61 aattcaacta gaagcagaaa tagttgagtt ggaagatttt ctgtagagtg attttaacat 121 gggaaggctc agacagggga agcctagatt tgaaaaggcc tggacctggg gaaaggctgg 181 caagatctgg actatagaac atgttagaat actgatattc gcagacacct ggaagactga 241 atgtcagaag atcagcacac tggagacgtt ggaagacatg gatattgagc cagttgatgg 301 aagactgggt agttgttgga agacatcaag gtgctggaag acacagcagc atgctggaag 361 acctggagat gttggaagac gagcagactc ctggaagccc tggagatgct gcaagacctg 421 gagatatagg aagacactgg actttgttgc gagcttagtt ggaagacata tatttttgga 481 agacgtggat tttctggaag acatggcttg gttggaagac gtggattttc tggaagacgt 541 acctttgttg gaagacatac ctttgttgga agacgtacct ttgttggaag acgtaccttt 601 gttggaagac acaagtaggc tggaagacat taatttgatg gaagacatgg ctttgttgga 661 agacgtggat ttgctggaag acacggattt cctggaagac ctggattttt cggaagctat 721 ggatttgagg gaagacaagg attttctgga agacatggat agtctggaag acatggcttt 781 gttggaagac gtggacttgc tggaagacac ggatttcctg gaagacccgg attttttgga 841 agctatagat ttaagggaag acaaggattt tctggaagac atggatagtc tggaagacct 901 gaggccattg gaagatgtgg attttctgga agacatggct tttttggaag acgtagattt 961 tcaggaagac ccaaattatc cggaagactt ggattgttgg gaagacgtgg attttctgga 1021 agactggagg ttactggaag acatggattt tctggaagac atggattttc tggaagacgt 1081 ggatcttcag gaagacatat attggctgga agacctggat tttttccgga agatgtggat 1141 tgactggaag acctggattt ggtggaagac gtagattttc tggaagacac tgactgactg 1201 gaagacactg attgactgga agacctggat ttctttctgg aagacactga ttgactggaa 1261 gatctagatt tttctggaag aactagattt actggaagac ttggatttgg tggaagacac 1321 agatttttct ggaagacatg gattagctgg aagatctgta tttgatggaa gaccttgaaa 1381 ttattggaag acatggattt cctggaagac gtggattttc ctggaagatc tggatttggt 1441 ggaagaccag taattgctgg aagactggat ttgctggaag acttgattta ctggaagact 1501 tggagcttct tggaagacat ggattgtccg gaagacatgg attgtctgga agatgtggat 1561 tttctggaag ctcaggatta tctggaagac cttgagatta ttggaacact tgaagtcgct 1621 ggaagacccg agttgttgga agaccttgta cacaggtgcc atcggaactc ctgacattga 1681 aacattgtaa gcacaggata ttgagacatt gcaagccttg attttaagac atggtactct 1741 ggacattgat atttctgagg ccctgaacat tgggatatta atattggaag tcatagacac 1801 tgaaatctct ggaaattaga gatattgtaa gtcctgtacc ttggaactcc taaatactgg 1861 cagatataaa caacagcaga tgtagacatt tataaatcct aaaatgagaa gccctggata 1921 ttgggagaca ttggtaagca tggatacttg acatatttat gtcaaaaaga cagtttggaa 1981 gaattaaatt ttaaagatgc tccatgtcaa gaatactggc agcctggaca atatgagacc 2041 aggatattaa gaggtctatt cattcagaca ttgaggatat tgatgtacct gaaagttctt 2101 gcaggtattt aaagacttga gcattggagg aattggcgat aaaaatacac tgtaaaacta 2161 gaaagtagga gacatttaaa aatgtaaaaa ctgaatgatg taagtgctgg aagacattga 2221 agaatctaga agacctgtat ataggagaca ttggaggatt aggaccatgg ccgacttgta 2281 atttagaact ctggattctg aaagacaaga cctggacttt gaagaagggt tgttggagat 2341 attagaagac ctaaattttt aatgacttga atactgggag tttagaaaac aagggcattt 2401 gagatgctgc ag // LOCUS RATHGF 2485 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat hepatocyte growth factor mRNA, complete cds. ACCESSION M32987 KEYWORDS hepatocyte growth factor. SOURCE Rat (strain Wistar) adult liver, clones RBC[1,3] and RAC[1,2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2485) AUTHORS Tashiro,K., Hagiya,M., Nishizawa,T., Seki,T., Shimonishi,M., Shimizu,S. and Nakamura,T. TITLE Deduced primary structure of rat hepatocyte growth factor and expression of the mRNA in rat tissues JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3200-3204 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Nakamura, 26-MAR-1990. FEATURES from to/span description pept 143 2329 hepatocyte growth factor BASE COUNT 750 a 533 c 577 g 625 t ORIGIN 1 gtttagtcct agatctttcc agttaatcac acaacaaact tagctcatcg caataaaagc 61 agctcagaac cgaccggctt gcaacaggat tctttcagcc cggcatctcc tgcagaggga 121 tcagcctgct cgaactgcaa gcatgatgtg ggggaccaaa cttctgccgg tcctgttgct 181 gcagcatgtc ctgctgcacc tcctcctgct tcctgtcacc atcccctatg cagaaggaca 241 gaagaagaga agaaatactc ttcatgaatt caaaaagtca gcaaaaacta ctcttaccaa 301 ggaagaccca ttagtgaaga ttaaaaccaa aaaagtgaac tctgcagatg agtgtgccaa 361 caggtgcatc agaaacaagg gctttccatt cacttgcaag gcctttgttt ttgataagtc 421 gagaaaacga tgctactggt atcctttcaa tagtatgtca agtggagtga aaaaagggtt 481 tggccatgaa tttgacctct atgaaaacaa agactatatt agaaattgca tcattggtaa 541 aggaggcagc tataagggga cagtatccat cactaagagt ggcatcaagt gccagccttg 601 gaattccatg atcccccatg aacacagctt tttgccttcg agctatcgcg gtaaagacct 661 acaggaaaac tactgtcgaa atcctcgagg ggaagaaggg ggaccctggt gtttcacaag 721 caatccagag gtacgctacg aagtctgtga cattcctcag tgttcagaag ttgaatgcat 781 gacctgcaac ggtgaaagct acagaggtcc catggatcac acagaatcag gcaagacatg 841 tcagcgctgg gatcagcaga caccacaccg gcacaaattc ttgccggaaa gatatcccga 901 caagggcttt gatgataatt attgccgcaa tcccgatggc aagccgaggc catggtgcta 961 cactcttgac cctgacaccc cttgggagta ttgtgcaatt aaaatgtgcg ctcacagtgc 1021 tgtgaatgag actgatgttc ccatggaaac aactgaatgt ataaaaggcc aaggagaagg 1081 ttacagggga accaccaata ccatttggaa tggaattccg tgtcagcgtt gggattcgca 1141 gtacccccac aagcatgaca tcactcccga gaacttcaaa tgcaaggacc ttagagaaaa 1201 ttattgccgc aatccggatg gggctgaatc accatggtgt tttaccactg atccaaacat 1261 ccgagttggt tactgctctc aaattcccaa atgtgacgtg tcaagtggac aagattgtta 1321 tcgtggcaat gggaaaaact acatgggcaa cttatccaaa acaaggtctg gactcacatg 1381 ttccatgtgg gacaagaata tggaggattt acaccgtcat atcttctggg agccagacgc 1441 tagcaagttg actaagaatt actgccggaa ccccgatgac gacgcccatg gaccttggtg 1501 ctacacaggg aatcctctcg ttccttggga ttattgccct atttcccgtt gtgaaggaga 1561 tactacacct acaattgtca atttggacca tcctgtaata tcctgtgcca aaacaaaaca 1621 actgcgagtt gtaaatggca ttccaacaca aacaacagta gggtggatgg ttagtttgaa 1681 atacaggaat aaacacatct gtgggggatc attgataaag gaaagttggg ttcttactgc 1741 aaggcaatgt tttccagcta gaaacaaaga cttgaaagac tatgaagctt ggcttggaat 1801 ccatgatgtc catgagagag gcgaggagaa acgcaaacag atcttaaaca tttcccagct 1861 agtctatgga cctgaaggct cagatttggt tttactgaag cttgctcgcc ctgcaatcct 1921 ggataacttt gtcagtacaa ttgatttacc tagttatggc tgtacaatcc ctgaaaagac 1981 tacttgcagt atttacggct ggggctacac tggattgatc aacgcagatg gtttattacg 2041 agtagctcat ctgtatatta tggggaatga gaaatgcagt cagcaccatc aaggcaaggt 2101 gactttgaat gagtctgaat tatgtgctgg ggctgaaaag attggatcag gaccttgtga 2161 gggagattat ggtggcccac tcatttgtga acaacacaaa atgagaatgg ttcttggtgt 2221 cattgttcct ggtcgtggat gtgccatccc aaatcgtcct ggtatttttg ttcgagtagc 2281 atattatgca aaatggatac acaaagtaat tttgacatac aagttgtaat agccatagaa 2341 gaggccagtg tatttgaagc atccatggat acaggaagat ttccaagact tcaggattaa 2401 aatgtcacct aaaacaatcc taaaacaact acttgagtgt tgtgagtgtt cagatactca 2461 ttaatatatg tggcgttttc tgttg // LOCUS MUSIMPD 1707 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse IMP dehydrogenase mRNA, complete cds. ACCESSION M33934 KEYWORDS IMP dehydrogenase. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1707) AUTHORS Tiedemann,A.A. and Smith,J.M. TITLE Isolation and DNA sequence of a cDNA encoding mouse IMP dehydrogenase JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.M.Smith 27-APR-1990. Author address: Dr. J.M. Smith Seattle Biomedical Research Institute 4 Nickerson St. Seattle, WA 98109 FEATURES from to/span description pept 67 1611 IMP dehydrogenase (EC 1.2.1.14) BASE COUNT 457 a 408 c 451 g 391 t ORIGIN 1 cctctgcggc gcggtcctcg gagcggcgcg gttctcggag ccacgcgtct gtcttcctcc 61 gtggtcatgg cggactacct gattagcgga ggcacctctt acgtgccgga cgacgggctc 121 acagcgcagc agctcttcaa ctgcggggac ggcctcacct acaatgattt tctcattctt 181 cctgggtata tcgacttcac tgcagatcag gtggacttga cgtctgctct aactaagaag 241 attacactaa agaccccatt ggtttcctca cccatggaca ctgtcacaga ggctggaatg 301 gccatcgcga tggcgcttac aggaggtatt ggtttcatcc accacaactg cacacctgaa 361 ttccaggcca atgaagttcg gaaagtgaag aaatacgaac agggattcat cactgacccc 421 gtggtcctta gccccaagga tcgtgtacgc gatgtttttg aggccaaagc caggcatggc 481 ttctgtggta tccccatcac agatacaggc cggatgggga gtcgattggt gggcatcatc 541 tcctcaaggg acattgattt cctcaaggag gaagagcatg accggttctt ggaagagatc 601 atgactaaga gggaagattt ggtggtcgcc cctgccggcg tcactctgaa agaggcaaat 661 gagattctgc agcgaagtaa aaagggaaag ttgcccattg tgaatgaaaa tgatgagctg 721 gtagccatca ttgcccggac agacctaaag aagaatcgtg attaccccct ggcctccaaa 781 gatgccaaga agcaactgct gtgtggggca gccattggca ctcatgagga tgacaagtat 841 aggctggact tactggccct tgctggtgtg gatgtagtgg ttttggactc ttcccaggga 901 aactccatct tccaaatcaa tatgatcaaa tacatcaagg agaagtatcc cagtctacag 961 gtcattggag gcaatgtagt cactgctgcg caagccaaga acctcataga tgcaggtgta 1021 gatgctttgc gagtcggcat gggaagtggt tccatctgca tcacccagga agtgttggcc 1081 tgtgggcggc cccaagccac agcagtgtac aaggtctctg agtatgcccg tcgctttggt 1141 gttcctgtta ttgctgatgg aggaatccaa aatgtgggtc atattgccaa agctttggct 1201 cttggggctt ccacagtcat gatgggctcc ctcctggctg ccaccactga ggcccctggc 1261 gagtacttct tctcagatgg gatccggctg aagaaatacc gaggtatggg ttctcttgat 1321 gccatggaca aacatctcag cagccagaac cgatacttca gtgaagctga caaaatcaaa 1381 gtggcccaag gagtttcagg ggcagtgcag gacaaggggt ctatccacaa gttcgttcct 1441 tacctgattg ctggcatcca gcattcctgt caagacattg gtgccaagag tttaacccaa 1501 gtcagagcca tgacgtactc gggggagctt aaatttgaga agaggacatc ctctgctcag 1561 gtggaaggtg gcgtccacag cctccattcg tacgagaaac ggcttttctg aaaacagatc 1621 cagtatatgc cttgaatttt tcaataaaag tttgggaaaa aaaaagtgaa aaaaaaaaaa 1681 aaaaaaaaaa aaaaaaaaaa aaaaaaa // LOCUS NEUPSAS 3681 bp ds-DNA PLN 17-JUL-1990 DEFINITION N.crassa arginine-specific carbamoyl phosphate synthetase (arg-2) small subunit gene, complete cds. ACCESSION J05512 KEYWORDS carbamoyl phosphate synthetase. SOURCE N.crassa (strain 74-OR23-IVA)asexual phase DNA, clone pAR231, and cDNA to mRNA, clone pAB1. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 3681) AUTHORS Orbach,M.J., Sachs,M.S. and Yanofsky,C. TITLE The Neurospora crassa arg-2 locus: Structure and expression of the gene encoding the small subunit of arginine-specific carbamoyl phosphate synthetase JOURNAL J. Biol. Chem. 265, 10981-10987 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.S.Sachs, 07-MAY-1990. FEATURES from to/span description pept 1451 1630 arginine-specific carbamoyl phosphate synthetase small subunit, exon 1 (arg-2) (EC 6.4.5.5) 1687 2868 arginine-specific carbamoyl phosphate synthetase small subunit, exon 2 pept 870 902 ORF, exon 1 1270 1282 ORF, exon 2 1359 1387 ORF, exon 3 pre-msg 752 3082 arg-2 mRNA and introns pre-msg 809 3082 arg-2 mRNA and introns (alt.) IVS 1631 1686 arg-2 intron A IVS 903 1269 ORF intron A IVS 1283 1358 ORF intron B signal 161 166 TGACTC box signal 568 573 TGACTC box signal 1008 1013 TGACTC box signal 1307 1312 TGACTC box BASE COUNT 760 a 1093 c 863 g 965 t ORIGIN 1 ggatccagac gctatatttg attgttccct cttctgtcag cgtgcggcct ccgttgtccg 61 tgcagccgat tacgacggcc ggctctggcg caacctgggt ttgggagctt cagaagcttg 121 aaagcgaggc gaatcgcgac tgtacaagtc cccgcaactt gagtcaccga aagcgcgggg 181 tatccaccat ctttatctgc ggcggtaggt aggcggggac tttgcggggg ctttgcggga 241 cccgaccact ccaactccga gattcgttgg gggtgccccg ggcaggagat gccgagctgg 301 agtcgcgttg tttgtgaatg tcgtggctgg ggttgtgggc ataggttgcg atgtgtagtg 361 tacgtagtac atagtaggta tgttgaacac cggatgggtg attccagttg tgaggctccg 421 ggactagtag gggtggtaac cgtgatggtc aaagggcatg tggaacttgc agcgtcccaa 481 cgaagtccag gtccccagtc atcccaacgc cgcgatcaat cacccctcgg tccccgtaaa 541 tcggcctttg ggactgccta gcagcgggag tcaccgagtc ctggaccaat caggtcggcg 601 ctgaccctgt cactgcggca gaaaaaagac aaagagctca aaaaattcta ccagatccaa 661 tcaaatgtcg ccctttttct atttgcctca cccactttcg caccctcttt tatctccaag 721 tttctcccct cgacactaca actcactccg tcatcaactc ttgcccatct acaaaacaca 781 caataacccc aatcgcttcc cccgactttc ggactctttc gtccaacctt ctttcttcag 841 ctgaacttgt cttgtcgcaa tctgccctta tgaacgggcg cccgtcagtc ttcacctctc 901 aggtacgata acgcgatagc ccctccatcg tgctggtatc cagccaccat ccacccagtc 961 catcaaagaa gttcccccgg atgagcaatg aagcccccat accagcctga ctcgcgatgc 1021 tgtatcggag tagttgggac gacccgccag taacccacca acctgggtgc ttggcatcat 1081 gccgctctga gttggttttg ggagatgtgg ccagcggtga tccccaaagt tcagctcttc 1141 acggattccg attgcatatc tctcaagatc gcgcagacag ccatcctggg ggagacttct 1201 tctcctcgct tctgccgctt gccgtctgtt tgtagtgtct gatcaaaact aattctgttg 1261 cctttgtagg attacctctc aggtccgcat ccccgccgac tcccgttgac tcaccctttt 1321 ccggctggaa tactgacgat gaattttgtg tgttacagac catctgtgga gagcccttaa 1381 cgcataagag cctctcatca cccagcagcc gtaccaatca ccaccgcacc ccatcaccat 1441 tcaagtcaag atgttctctc gcttggccgc tcgtctcccc aaggcttctg ccctcaatgg 1501 cgtcgccgcc cgtcaggttc gcaacctgag ccagcccgcc attaccggca gcaagggcag 1561 gaacatgccc gcccgtgagc cgcgcactac tgccgctgcc accggcgccg aggccacctt 1621 cactatcagg gtgcgtacct acaactagtc ttctctcgtg gctaaaccta acattttctc 1681 gctcaggatg gccccgtttt ccagggtacc gccttcggtg ccaacaccaa catttctggt 1741 gaagccgttt tcaccacctc ccttgttggt taccccgagt cgatgaccga tccctcgtac 1801 cgcggccaga ttttggtctt cacccagccc ttgattggca actacggtgt cccttcgaat 1861 gagcgtgacg agttcaacct cctaagtact tcgagtctcc ccacatccag tgcgccggta 1921 tcgttgtctc cgatgtcgct acccagtaca gccactggac tgctgttcag agcttgggcg 1981 agtggtgcgc cagcgagggc attcccgcca ctctccggtg ttgacactcg tgcgattgtc 2041 acctacctcc gtgagcaggg ttcctctctc gccaggatct ccattggcga cgagtacgac 2101 gccgatgagg atgagggctt cattgaccct ggccagatca accttgtcaa gcgcgtgtcc 2161 accaaggctc cctttgtggt taccaacccc aacgccaagt tccacgtcgc tctgatcgac 2221 tgcggtgtca aggagaacat ccttcgtagc ttggtcagcc gtggcgcctc cgtcaccgtc 2281 ttcccctaca attaccccat ccacaaggtt gccgagaact tcgatggtgt cttcatttcc 2341 aacggccctg gtgatcccac ccactgccag gagactgttt acaacctcgc caagcttatg 2401 gagacttccc ctattcccat catgggtatc tgccttggtc accagcttct tgctctcgct 2461 gttggtgcca agaccatcaa gctaagtatg gtaatccgtg cccacaacat cccggctctt 2521 gacttgacca ctggtcagtg ccacatcacc tcgcagaacc acggttacgc cgtcgatatc 2581 agcaccctcc ctagcgactt caaggagtat ttcgtcaacc ttaacgacgg ctccaacgag 2641 ggtatgatgc acaagacccg ccccatcttc tctacccagt tccaccccga ggctaagggt 2701 ggtcccatgg actcttccta cctctttgac aagtacatgg agaatgtcga gcttttcaag 2761 agcaactccc aggtctaccg tgacaacagg cctactcagt tcatgattga cattctcagc 2821 aaggagcgtg tcggcgttga gcccactcct ctttctaacg ctgcttaacg gacgagaatg 2881 tctttgcatt ttcttagtga acggtgtttg tcattggtct tgggttattg ttttccagca 2941 caaaggatat gggttttgta aaagggactt tcttgggcga cgtcctctcg ttattaatgg 3001 tttagggtgt atatgatatc ttttgttact acttttgaat gcagaatcag ataagtttag 3061 tcggcattag gctgatatgg cgcagagagg acaatatcta gttgatacat acccatgtgc 3121 tatcacgtct aaacgtcttc cagtcgttgt attttaccca ttcctggtga ttgccgtatg 3181 tcttccagtc cagtacctcc cctaattcgg tgcccttcaa cattcagggt tatgttgaca 3241 gcgtgctgta taagcatggt tgccggggaa catggtgtca ttgactggta agcagggcag 3301 cctctggttc tgtgacacct ggttttccct ggtcttgatt tcaaactatt atcaaacaaa 3361 tcagcgtcgt cttagtagat ggcaggtatc ttagaacaga aagtagttat tcttcgtcgg 3421 atacgactcc aaaatcatat acgaacaagt ggatgcttta tactatgctt ctttttactg 3481 taatcttggt ctcgctcttc gcttagcctt cttcacctgg taaccggtat aggtaccttc 3541 gcaccctcaa ttggaaggat gcatcaaccc ctccatgcgc tgcttatcct gctatctatc 3601 catacatttc tatgttttta cagtgtagaa tatttcttgt gtatcagggt ccgtcccgcg 3661 atgctacccc ctgtcctgca g // LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human osteopontin mRNA, complete cds. ACCESSION J04765 KEYWORDS osteopontin; phosphoprotein. SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1425) AUTHORS Young,M.F., Fisher,L.W., McBride,O.W. and Termine,J.D. TITLE cDNA cloning, chromosomal location and RFLP analysis of human osteopontin JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Young, 04-MAY-1990. FEATURES from to/span description pept 68 1012 osteopontin precursor /hgml_locus_uid="LK0223T" /nomgen="OPN" /map="unassigned" sigp 68 115 osteopontin signal peptide matp 176 1012 osteopontin mRNA < 1 1425 osteopontin mRNA BASE COUNT 449 a 293 c 296 g 387 t ORIGIN Chromosome 4q13. 1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact 61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc 121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga 181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca 241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc 301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga 361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca 421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac 481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg 541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga 601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt 661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga 721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca 781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc 841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt 901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct 961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa 1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa 1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg 1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa 1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg 1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact 1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag 1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag // LOCUS MUS25OAS 1412 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse 2'-5' oligo A synthetase mRNA, complete cds. ACCESSION M33863 KEYWORDS 2'-5' oligo A synthetase. SOURCE Mouse cell line L-929, cDNA to mRNA, clone L3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1412) AUTHORS Coccia,E.M., Nissim,A. and Chebat,J. TITLE A full-length murine 2-5A synthetase cDNA transfected in NIH-3T3 cells impairs EMCV but not VSV replication JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.Coccia, 26-APR-1990. FEATURES from to/span description pept 36 1139 2'-5' oligo A synthetase signal 1389 1394 poly-A signal BASE COUNT 340 a 371 c 390 g 311 t ORIGIN 1 ccaggctggg agacccagga agctccagac ttagcatgga gcacggactc aggagcatcc 61 cagcctggac gctggacaag ttcatagagg attacctcct tcccgacacc acctttggtg 121 ctgatgtcaa atcagccgtc aatgtcgtgt gtgatttcct gaaggagaga tgcttccaag 181 gtgctgccca cccagtgagg gtctccaagg tggtgaaggg tggctcctca ggcaaaggca 241 ccacactcaa gggcaggtca gacgctgacc tggtggtgtt ccttaacaat ctcaccagct 301 ttgaggatca gttaaaccga cggggagagt tcatcaagga aattaagaaa cagctgtacg 361 aggttcagca tgagagacgt tttagagtca agtttgaggt ccagagttca tggtggccca 421 acgcccggtc tctgagcttc aagctgagcg ccccccatct gcatcaggag gtggagtttg 481 atgtgctgcc agcctttgat gtcctgggtc atgttaatac ttccagcaag cctgatccca 541 gaatctatgc catcctcatc gaggaatgta cctccctggg gaaggatggc gagttctcta 601 cctgcttcac ggagctccag cggaacttcc tgaagcagcg cccaaccaag ctgaagagtc 661 tcatccgcct ggtcaagcac tggtaccaac tgtgtaagga gaagctgggg aagccattgc 721 ctccacagta cgccctagag ttgctcactg tctttgcctg ggaacaaggg aatggatgtt 781 atgagttcaa cacagcccag ggcttccgga ccgtcttgga actggtcatc aattatcagc 841 atcttcgaat ctactggaca aagtattatg actttcaaca ccaggaggtc tccaaatacc 901 tgcacagaca gctcagaaaa gccaggcctg tgatcctgga cccagctgac ccaacaggga 961 atgtggccgg tgggaaccca gagggctgga ggcggttggc tgaagaggct gatgtgtggc 1021 tatggtaccc atgttttatt aaaaaggatg gttcccgagt gagctcctgg gatgtgccga 1081 cggtggttcc tgtacctttt gagcaggtag aagagaactg gacatgtatc ctgctgtgag 1141 cacagcagca cctgcccagg agactgctgg tcaggggcat ttgctgctct gctgcaggcc 1201 catgacccag tgagggaggg ccccacctgg catcagactc cgtgcttctg atgcctgcca 1261 gccatgtttg actcctgtcc aatcacagcc agccttcctc aacagattca gaaggagagg 1321 aaagaacaca cgcttggtgt ccatctgtcc acctgttgga aggttctgtc tgacaaagtc 1381 tgatcaacaa taaaccacag caggtgccgt ca // LOCUS ACARGA 1212 bp ds-DNA INV 17-JUL-1990 DEFINITION A.castellanii 5S RNA. ACCESSION M34003 KEYWORDS 5S RNA. SOURCE A.castellanii (strain Neff) trophozoite DNA, clone pAc5S.1. ORGANISM Acanthamoeba castellanii Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; Acanthamoebidae. REFERENCE 1 (bases 1 to 1212) AUTHORS Zwick,M.G., Wiggs,M. and Paule,M.R. TITLE Cloning, sequencing and genomic organization of a 5S RNA gene from the eukaryotic protist Acanthamoeba castellanii JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Paule, 04-MAY-1990. Author address: M.R.Paule Colorado State University, Dept. Biochemistry, Fort Collins, CO 80523 FEATURES from to/span description RNA 489 610 5S RNA precursor RNA 489 607 5S RNA BASE COUNT 195 a 374 c 329 g 314 t ORIGIN 1 ctgcaggaga ttcgtcgggc tcgtgccgcg atcgtcttga tgatcgagac cgacggcccg 61 ttcctcactc cgcgcacctt gcgccttgcc cgcgccgcaa cgagccgtct tacctcaccc 121 acgtcctaca gagtacttac atcctccaat gtgttgtgtg cgtgtcatgt gcccaatgta 181 tgtggtgtgt gcggtgtgtg cggtgtgtga ggtgtgtggt gtgcgcgtgt catgtctgtt 241 acacgttctg ttgaccaaca ttttttatgg tgtgtacagc ggtcgccgac tgcaccggac 301 gatcggtaga ggaggtggcc ggtgaaacca cgcagaccgc ggtgcgcttc ttcggtctgg 361 accaaatccg taaggagtga atgtagcaca cccctcccct ccatgtcatt gaagcccaag 421 ttagtgtgca atttaaatga tggtgtgacc aagagggatc acatgggcga agtccctggt 481 aggctgatgg atacggccat actgcgcaga aagcaccgct tcccatccga acagcgaagt 541 taagctgcgc caggcggtgt tagtactggg gtgggcgacc acccgggaat ccaccgtgcc 601 gtatcctcct ttttgcttct tcactctcgc cactattgct tcaggtcgcc tcatcgcctg 661 ccacagctga ctacaaactg ggtgagccaa acctgccttt tgcactggtc cagtgtacaa 721 ggctggaaca actttgtttc acacttgctg aggtcaatca atccagcctt gttggttgag 781 gccacaggag ggatctgcca cctttcacac catctcgtcg agaaggctgc ttgacgcttc 841 ttcgctgtcc ttcggggctg atgtctcgac tgtgtgtccg tcttcatcgc gattgtcact 901 tgcgttgttg ttgttgcggc tttcgtccct gtcctcatcg ccgtcgtcgt cttcatcgct 961 gttgtcgcgc gactcactcc gcaaaccgtc tggccgtgtg ctgctcctcc tcttcatcgt 1021 tgctgtcatc tcggtcgctg cggccacgcc tcgaagaaga cagcccgccc tcttcggccg 1081 cctcggtccc gcccctcgca aagcggatct cgcctcgtcg gctgccgctg gccgtcgctc 1141 cgcctggcgt cgttgtcttc cctcgcatcg tcctcaaact tgtcgtgcgt ccacttgtcg 1201 gcaaatgtcg ac // LOCUS ENTOMPX 664 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.cloacae outer membrane protein X (OmpX) gene, complete cds. ACCESSION M33878 KEYWORDS outer membrane protein X. SOURCE E.cloacae (strain 2249-1) DNA. ORGANISM Enterobacter cloacae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 664) AUTHORS Stoorvogel,J., van Bussel,M.J.A.W.M., Tommassen,J. and van de Klundert,J.A.M. TITLE Molecular characterization of an Enterobacter cloacae outer membrane protein OmpX JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Stoorvogel, 26-APR-1990. Author address: J.Stoorvogel Dept of Medical Microbiol, Univ Hospital, P.O.Box 9600, 2300 RC Leiden, THE NETHERLANDS FEATURES from to/span description pept 83 601 outer membrane protein X precursor sigp 83 151 outer membrane protein X signal peptide matp 152 598 outer membrane protein X signal 18 25 -35 region signal 43 48 -10 region binding 74 77 ribosome binding site signal 620 649 transcription termination signal BASE COUNT 171 a 156 c 162 g 175 t ORIGIN 1 atcgtgaatg acaaagagtt tcacagattt ttttgatata tttaaaactt acggacttac 61 ttgaagcaca tttgaggtgg ttatgaaaaa aattgcatgt ctttcagcac tggcagctgt 121 tctggctgtt tccgcaggta ccgctgtagc ggcaacttct actgtaactg gtggttacgc 181 tcagagcgat atgcagggcg tgatgaacaa aaccaacggt ttcaacctga agtaccgtta 241 cgagcaagac aacaacccgc tgggtgtgat cggttctttc acttacaccg agaaagatcg 301 tactgaaaat ggctcttaca ataaaggtca gtactacggc atcaccgcgg gtcctgctta 361 ccgcctgaat gactgggcaa gcatctacgg tgttgtaggt gttggctacg gtaaattcca 421 gcagaccgaa aaccagggtc tgaaccgtac tgcaagcaac agcgactacg gtttctccta 481 tggcgcaggt atgcagttca acccaatcga aaacgttgct ctggacttct cctatgagca 541 gagccgtatc cgcaacgttg acgttggcac ctggatcgcg ggcgtaggtt accgcttcta 601 atcacttcgg tgagccagta aaaaatccgc ccaatgtggc ggattttttt tgtttgtacc 661 ctct // LOCUS NEUMPPX 2038 bp ss-mRNA PLN 17-JUL-1990 DEFINITION N.crassa matrix processing peptidase (MPP) mRNA, complete cds. ACCESSION J05484 KEYWORDS matrix processing peptidase. SOURCE N.crassa, cDNA to mRNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2038) AUTHORS Schneider,H., Arretz,M., Wachter,E. and Neupert,W. TITLE Matrix processing peptidase of mitochondria: Structure-function relationships JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Neupert, 17-APR-1990. FEATURES from to/span description pept 41 1774 matrix processing peptidase BASE COUNT 439 a 626 c 552 g 421 t ORIGIN 1 cccacattac gctgccgcat cacaattcct tgttgcagcc atgctgaatc gcttccggcc 61 agcgcggcta gtagcccaat cctccagatg cttgcccttg acgagggcgc gggcaggtcc 121 cttgcccgtt aacaatgcca ggactttggc tacgagagcc gctgctgtca acaccaagga 181 accgaccgaa cgcgacaaca tcaccactct ctccaatggt gtccgtgtcg cttccgagga 241 ccttcccgat gccttctccg gtgtaggtgt ctacatcgac gcggggtccc gatatgagaa 301 cgactatgtc cggggtgcca gtcacatcat ggaccggcta gccttcaagt ctacaagtgc 361 gaggactgcg gacgaaatgc tcgaaactgt tgagaagctc ggtggtaaca ttcagtgcgc 421 ttcttcgcgc gagtctatga tgtaccaggc ggccaccttc aacaaggcta ttcccaccgc 481 tgttgagctc atggccgaga ccatccgcga tcccaagctt acggacgagg agctggaggg 541 acagatcatg acggcgcaat atgaggtcaa cgagatctgg tccaaggccg aactgatcct 601 gcccgagttg gtgcacatgg ctgccttcaa ggacaacact cttggcaacc cgttgctttg 661 tcccaaggag aggttggatt acatcaaccg ggatgtcatc caaacatacc gcgacgcttt 721 ctacaggccc gagcgccttg ttgttgcctt tgctggtgtg cctcatgaga gggccgtcaa 781 gctcgcagag aagtactttg gtgatatgaa ggcctccgat gctcccggtc tctcgaggac 841 aggttccgaa acctccgtcg actcgctagt gtccgagtcc agcgaggcct cgagtgaatc 901 ttcatcatcc tcctcggact cttccgagtc gagtggcggg ctgctctcca agcttttctc 961 tcccaaggcc aagaaagcca cccccaaccc cttcctcacc cgggtaccta ttagcaccga 1021 agacttgact cggcctgctc actacacagg cggtttcctc accctcccat cacagccccc 1081 accgctcaac cccaaccttc ccacatttac tcacatacag ctcgccttcg agggcctcgc 1141 catctcggac gacgacatct acgccctcgc caccctgcag accctcctcg gcggcggcgg 1201 ctccttctct gccggcggtc ccggcaaggg catgtactcg cgtctctaca ctaacgttct 1261 caaccagcac ggctgggttg agtcctgcgt ggccttcaac cactcataca cggactcggg 1321 tctcttcggc atcgccgcct cgtgctaccc gggtcgcacc ctgcccatgc tccaggtcat 1381 gtgccgcgag ctgcacgccc tcaccaccga ccatggctac tcggccctgg gcgagctcga 1441 ggtttcgcgc gccaagaacc agctccgcag cagcctcctg atgaacctcg agagccgcat 1501 ggtcgagctc gaggatctgg gccgccaagt tcaggttcac ggtcgcaaga tcccggtccg 1561 cgagatgacg cgccgtatca acgagctgac ggtcaaggac ctccgaaggg tcgctaagcg 1621 cgtggttggt ggcatggcga ataacgccgg ccagggaagc ggtgcgccga cggtggtgct 1681 gcaggaggcg acggtgcaag gactcaagac tacggagctg gggtgggatc agatccagga 1741 tacaattgct cagtggaagc tcggtagacg gtaaacgttt gtcaagggga aaaaaagagt 1801 agggcgtgga gaagttatgt aagaggagcg ctgtattgaa cttggcgaca cgcacacacc 1861 ggaacgataa aggcgtttta ggttccccac gagcataggg aagaggctag atggttgctc 1921 tgtacaatcg caacttttct tggtgagtta tacaagatgt gtccaggtac atctttgcct 1981 taccatactg tacgatagca atgaagattt tctgatatat caaaagtcaa aagtcaaa // LOCUS XELXOTCH 9163 bp ss-mRNA VRT 17-JUL-1990 DEFINITION X.laevis Xotch protein mRNA, complete cds. ACCESSION M33874 KEYWORDS Xotch. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 9163) AUTHORS Coffman,C., Harris,W. and Kintner,C. TITLE Xotch, the Xenopus homolog of the Drosophila notch gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Coffman, 26-APR-1990. Author address: C.R.Coffman University of California Dept. of Biology-B022, La Jolla, CA 92093 FEATURES from to/span description pept 346 7917 Xotch protein mRNA < 1 9163 Xotch mRNA signal 9146 9151 poly-A signal rpt 409 4632 EGF-like repeats rpt 4684 5025 lin-12/Notch repeats rpt 5953 6591 cdc-10/SWI-6 repeats BASE COUNT 2499 a 2192 c 2188 g 2284 t ORIGIN 1 gaattccgga cgcctggtgg aactaaacag actggaatgc actggaacac ggaagcctat 61 gcggggatac attgtagcgc tgagcccttc cgctgtaaca ttgtagcgct gagccgctca 121 tacactgtag cgctgtaact tcaggctgtg cgcaaactga gtgcgctatt ctctctccaa 181 tctggaaact ctgtcgcttt tggaccaggg attgcactga cttggattat cttcaccttt 241 ctctaaggga tttatagatt tttgccttgg gacctttatc ctccccttgt cctgggacag 301 ggaccttcat caggatctat gttcctatag tgagactgca ggaaaatgga tcggataggg 361 ctggctgtgc tgctctgctc cctgcctgtc ctcacacaag gtcttcgatg cactcagact 421 gcagaaatgt gcctgaacgg tgggaggtgc gaaatgaccc ctggtggaac tggggtgtgc 481 ctatgcggta atttgtattt cggtgaaagg tgccagttcc ccaatccctg caccataaag 541 aatcagtgta tgaactttgg aacctgcgag cctgtgctac aggggaacgc catagacttc 601 atctgccact gccccgtggg tttcaccgac aaagtgtgcc tgactcccgt ggataacgcc 661 tgcgtcaaca acccgtgccg gaatggagga acctgtgaac ttttgaatag cgtgactgaa 721 tacaaatgca ggtgtccccc aggatggaca ggggattcgt gccagcaggc agatccatgt 781 gcctctaatc catgtgccaa tggtgggaag tgtttgccct ttgagatcca gtatatctgc 841 aagtgtcccc ctggattcca tggagctacc tgcaagcagg acattaatga gtgcagccaa 901 aatccctgca agaatggcgg ccagtgcatc aatgagtttg gatcataccg ctgcacctgt 961 cagaacaggt ttactggaag aaactgtgat gagccatatg tcccctgcaa cccatcgccg 1021 tgcttgaatg gagggacctg ccgccaaact gatgacacct cctatgattg cacttgtctt 1081 ccaggcttta gtggacaaaa ttgtgaagag aacattgacg actgcccgag caacaactgc 1141 cgcaatggtg ggacctgtgt ggatggagtc aacacctata actgccagtg tccacctgac 1201 tggacaggcc aatactgcac ggaggatgtg gacgaatgcc aactgatgcc aaacgcttgc 1261 caaaatgggg gaacctgcca caacacctac ggtggataca actgtgtttg tgtcaacggc 1321 tggaccggag aggactgcag cgagaacatt gacgattgtg ccaacgccgc ctgccatagt 1381 ggtgccacct gtcatgatcg cgtggcatct ttctactgcg agtgcccaca cggccggaca 1441 ggtctccttt gccacctgga taatgcctgt atcagcaatc cctgcaacga aggctccaac 1501 tgtgacacca acccagtcaa cgggaaagcc atttgcacct gcccacctgg gtacactggt 1561 cctgcttgca acaatgatgt ggatgagtgc tctctgggtg ccaatccatg tgagcatggt 1621 gggagatgta ccaacaccct gggttccttc cagtgcaact gcccccaggg atatgcagga 1681 ccccgctgcg agattgatgt caatgaatgt ctttccaacc cgtgccaaaa tgattccaca 1741 tgcctggacc agattggaga gtttcagtgt atatgtatgc caggttacga aggcctttac 1801 tgtgaaacca acatagatga atgtgccagc aacccctgcc tacataatgg caagtgcatt 1861 gataaaatca acgagttccg ctgcgactgt cccacaggct tcagtggaaa cctctgccag 1921 catgactttg atgagtgtac cagcactccg tgcaaaaatg gagccaaatg cttggatggt 1981 ccaaattctt acacctgcca atgtacagaa ggtttcaccg ggcgccactg cgaacaagac 2041 atcaatgaat gcattcctga tccgtgccac tacggcacct gcaaagacgg cattgctacc 2101 ttcacatgcc tgtgccggcc tgggtacacg gggcgtctgt gtgataatga catcaatgag 2161 tgcctgagca agccgtgcct aaacgggggg cagtgtacag accgagagaa tggatacata 2221 tgcacttgtc cgaaaggaac cacaggggtg aattgcgaaa ccaaaataga tgactgtgcc 2281 agtaacctgt gcgataatgg aaagtgcatt gacaagattg atggctacga gtgcacctgc 2341 gagcctggtt acacaggcaa gttgtgcaat attaacatta acgaatgtga cagcaatcca 2401 tgccgcaatg gagggacttg caaagaccag atcaatggct tcacatgtgt gtgcccagat 2461 ggctaccatg accacatgtg cctgtctgaa gtaaatgaat gcaacagcaa tccatgtatc 2521 cacggggcgt gccatgatgg agtcaatggg tataaatgtg actgtgaagc aggatggagc 2581 ggtagcaatt gtgacatcaa taacaatgag tgtgaatcca acccgtgtat gaatggtggt 2641 acctgcaaag acatgaccgg agcatacatc tgtacttgca aagcaggatt tagtgggcca 2701 aactgccaaa ctaacatcaa cgaatgttcc tctaacccct gcttgaacca tggaacgtgt 2761 attgatgatg ttgctggtta caaatgcaac tgcatgttac cttatacagg tgccatttgt 2821 gaggcagttt tggctccgtg tgcaggcagt ccttgtaaaa atggtggcag atgtaaagaa 2881 tcagaagact ttgaaacctt ctcttgtgaa tgtccacccg ggtggcaagg gcaaacatgt 2941 gagattgaca tgaatgaatg cgtaaatcgc ccttgccgta acggtgcaac gtgtcaaaac 3001 accaatggca gctacaagtg caactgcaaa cctggttaca ctgggcggaa ctgcgagatg 3061 gacattgatg actgccaacc aaatccttgt cacaatggag gatcctgttc tgacggcatt 3121 aatatgtttt tctgtaactg cccggccggg ttccgggggc ctaagtgcga agaggacatt 3181 aacgagtgtg ccagtaaccc gtgcaaaaat ggagccaact gcactgattg tgtaaacagc 3241 tacacatgca cctgccagcc tggatttagt ggcatccact gtgaaagcaa cacccctgac 3301 tgcaccgaga gctcgtgctt taatggtggc acctgcattg atgggatcaa tacgtttaca 3361 tgccagtgcc cacctggatt taccggaagc tactgccaac atgatatcaa tgaatgtgac 3421 tccaagcctt gtttgaacgg aggcacctgc caggacagtt acgggacata caagtgcacc 3481 tgtccgcaag gatacactgg gctgaactgc cagaacctgg tgcgttggtg tgactcttct 3541 ccctgtaaaa atggaggaaa gtgctggcag acaaacaact tttatcgttg tgagtgcaag 3601 agtggctgga ctggtgttta ctgtgatgtc cccagtgtct cctgcgaggt tgctgctaag 3661 cagcaaggtg tggatatcgt tcatctgtgt cgcaattctg gcatgtgtgt ggacactgga 3721 aacacacatt tctgccgctg tcaggctggc tacacaggca gttactgcga ggaacaggtg 3781 gacgaatgct cgccgaatcc ttgtcagaac ggtgccacat gcacagacta cttaggtggc 3841 tactcttgcg agtgtgttgc cggctaccat ggagtaaact gctccgagga gatcaatgag 3901 tgtttgtccc acccgtgcca gaatggagga acctgtattg atctcattaa cacctacaaa 3961 tgctcctgcc ccagaggaac tcaaggtgtg cactgtgaaa ttaatgtgga tgattgcacg 4021 cccttctacg actccttcac cttggaaccc aaatgcttca acaatggcaa atgtatagac 4081 cgcgtgggtg gctataactg catctgccct ccaggctttg taggggaacg ctgtgaaggc 4141 gatgtgaacg agtgcctgtc taatccctgt gactcccgtg gcacccagaa ctgcattcaa 4201 ctggtgaatg attaccggtg tgagtgccgg cagggattca caggaaggcg ctgtgaatct 4261 gttgtggatg gttgtaaggg aatgccctgc cgaaacggtg gaacatgtgc tgtcgccagc 4321 aatacagaac gtggatttat ttgcaaatgc cctcctgggt ttgacggagc cacttgcgag 4381 tatgattcac gaacctgcag caaccttcgc tgccagaatg gcggcacttg catttcggtg 4441 ctgacgagtt ccaaatgcgt gtgctcggaa ggctatactg gcgccacatg tcagtaccct 4501 gttatcagtc cttgtgcttc ccacccttgt tacaatggag gaacctgcca gttctttgct 4561 gaggaacctt tcttccagtg cttctgtcca aaaaacttta acgggctctt ctgccatatc 4621 ttggattatg agtttcccgg gggactaggc aagaatatta ctcctcctga taatgacgat 4681 atctgcgaaa acgagcagtg ctctgagctg gctgacaaca aagtctgcaa cgccaactgc 4741 aacaaccatg cttgcggatg ggatggagga gactgctctc tcaacttcaa tgacccatgg 4801 aagaactgca cccagtcgct gcagtgctgg aaatatttca atgatggcaa atgcgattct 4861 cagtgcaaca acactgggtg tctgtacgac gggtttgact gccagaaagt tgaggttcag 4921 tgcaaccctt tgtatgacca gtactgcaag gatcactttc aagatggcca ctgcgaccaa 4981 ggctgtaaca atgcagagtg cgaatgggat ggcctggact gcgccaacat gccagagaac 5041 ctggctgaag gcaccctggt gttggtcgtc ctcatgcccc ctgaaaggct taagaacaac 5101 tctgtcaatt tcctccgtga gctgagccga gtgctccaca caaatgtggt gttcaagaaa 5161 gacagcaaag gcgaatacaa gatctaccct tactatggca atgaggaaga acttaaaaaa 5221 catcacatca agaggtcaac tgattactgg tcggatgctc cttctgccat cttcagcaca 5281 atgaaagaat ccattttact aggaaggcat cgtagggagt tggatgaaat ggaagttagg 5341 gggtcaatcg tatacttgga aatagacaac cgccagtgtt acaaatcctc ctctcagtgc 5401 ttcaacagtg ctacagatgt tgccgcattc cttggggctt tggcctccct tgggagtctg 5461 gacacccttt cctataagat tgaggcagtg aaaagtgaga acatggagac ccccaagccc 5521 tctactctct atcccatgtt atccatgttg gtcattccgt tattaatcat cttcgtcttc 5581 atgatggtca tcgttaataa gaagcgtcgc cgtgaacacg acagctttgg ttccccgacg 5641 gctttattcc aaaagaaccc agcaaaaaga aacggcgaga cgccttggga ggattctgtt 5701 ggcttaaagc ccattaagaa tatgacagat ggctctttta tggacgataa tcagaatgaa 5761 tggggagatg aggagaccct ggaaaacaag aggttcaggt ttgaagagca agtgattctc 5821 ccagaacttg ttgacgacaa aactgacccc cgacagtgga cacggcagca ccttgacgcc 5881 gctgatctgc gcatttcatc catggcccca acgccgccgc agggagagat tgaggccgac 5941 tgtatggatg tcaatgtccg tggccctgat ggcttcaccc cacttatgat tgcctcctgt 6001 agcggaggtg gattagagac tggaaacagc gaagaggaag aggatgcttc agctaatatg 6061 atttccgact tcattgggca gggtgcccaa ctgcataacc aaaccgaccg cactggcgag 6121 acagcacttc atctggccgc aagatacgca cgtgctgatg cagccaagcg cctgttggaa 6181 tcgagtgcgg acgccaacgt ccaggataac atgggcagga ctcctctcca tgcagcagtg 6241 gcggctgatg ctcagggcgt attccagatt ctcattcgga accgagcgac agacttagac 6301 gcccgcatgt ttgatggcac tacccctctg atcctggccg ctcggctggc cgtggaaggg 6361 atggtggagg agcttatcaa tgctcatgca gatgtcaacg ctgttgatga atttggaaaa 6421 tctgctttgc attgggcagc ggctgtgaat aacgttgatg ctgcagctgt gcttctcaag 6481 aatagtgcaa ataaggacat gcaaaacaac aaggaagaga catccctgtt cttggccgca 6541 agagaaggca gctacgaaac tgccaaagtc cttttggatc actacgccaa ccgtgacatc 6601 acagaccaca tggatcggct gcctcgtgac atcgcccaag aacgcatgca ccacgacatt 6661 gttcacctgc tggatgaata taaccttgtg aagagcccaa cgctgcacaa tggtccgttg 6721 ggagcaacga cattatcacc tcccatctgc tcccctaatg gttacatggg gaacatgaag 6781 ccttctgttc agagcaagaa agcccgcaag cccagtatca aaggtaatgg ctgcaaagag 6841 gccaaagagc tgaaagccag aaggaaaaaa tctcaagatg ggaaaacaac tctcttggat 6901 tctggcagtt ctggagtgtt gtccccagtg gactccctgg agtcaacaca tggatactta 6961 tcagatgtgt cttctcctcc gctgatgacc tctccgtttc agcagtctcc atccatgcct 7021 ctgaaccact tgacaagcat gccagagtcc cagcttggca tgaatcacat aaacatggcc 7081 accaagcagg aaatggcagc aggttccaac agaatggctt ttgatgccat ggtgccacgt 7141 ctgacccatc tcaatgcctc aagccctaat accatcatga gcaatggatc catgcatttc 7201 actgtgggag gagctccgac tatgaacagc caatgtgact ggttagctag gctgcagaat 7261 gggatggtcc agaatcagta tgacccaatc agaaatggca tccaacaagg caatgctcaa 7321 caagctcaag ctcttcagca tggccttatg acctcgctcc ataatggtct gccagcaaca 7381 actctctccc aaatgatgac ctatcaggcc atgcccaaca caaggctagc caatcagcca 7441 catctaatgc aagcccagca aatgcaacag cagcaaaact tgcagttgca ccagagcatg 7501 cagcaacaac atcacaattc cagcacgacc tctactcaca tcaactcacc attctgcagc 7561 agtgacataa gccagacgga cctgcagcaa atgtcaagca acaacattca ttcagtaatg 7621 ccccaggaca ctcagatatt tgctgcatct ctgccttcca atcttacgca gtctatgaca 7681 actgcacaat ttttaacccc cccttcccag catagctact cctccccaat ggacaataca 7741 ccaagccatc aactacaagt accagaccac ccgttcctga cgccttctcc tgagtcacct 7801 gaccagtggt caagctcctc ccctcattcc aatatgtctg actggtcaga aggaatatca 7861 agtcctccca cgagtatgca gcctcagcgc acccacatac ctgaagcttt caagtaaaaa 7921 aaaaaaaagt ttaaaaaaat gtaaaatatt tttaaagaca ctgagagaga ctttaagaga 7981 ctgaaggaaa tttttatatg gtttttatac ttaaaataac agaacatttg aattttctag 8041 tatttattta tatatacgtt tgacctaaaa cactgccctt ttatttataa gctttttttc 8101 atttttagga caaaatattt taacttcttg ccttgaaagt ttttcagttc taaatcttat 8161 gaaattggtt cctgcctggt attgaaaacg gcaatgtatt tattttttat ttacctgaat 8221 agtatacagg aacaaaccac tggggtgggg gggttatcgg gatgtgtatt tagcagaaaa 8281 aagattttct ataaaatgaa atctttcagg ttttcattta tagcactaaa aagattccag 8341 tattaatttt aaattaaatc atgaagaaga tgctccgaat acctctcgct gtcagggagg 8401 tgcccaatag gttttttttt tttgtttttt ttttggaaaa cctgaaactc ttggtaagta 8461 cagaaagaaa gcataaaata caggagggca cgcctaggtc tgatgcattc tcaatgggaa 8521 taaaatcctg aaaagtgact cattttaaaa aattatattt gttagaaaaa acaaactgcc 8581 attttgaatc ccttgtgtcg catggggtat tggtcagaaa atgcatgcaa tttttggtac 8641 aagcccgtgg catattttct acagtttttt tttttttttt tttttttttg taaatatcta 8701 tggtggattt tgtttcaaaa atctatttgt atatcctgat ttatattgct tatccgagta 8761 ggagtgcagg ctggatggaa ccatgccgaa gacaaatcgc agattggaca ttgtttttta 8821 tttttttgcc tacagagact atttgcacat gaacaacaac aaaaaaaaga agatatattt 8881 aaatataaaa ttttatacaa acgaaaaaac aaaaaacaaa aactgtcgat ttgcctctag 8941 ttccacttct tctgtgatac gatgcagttg ggcagttcat tccttatgcc gtactgtaca 9001 aagatctgta gaatttagtt gtttttcctg aacaaataaa tatgtaatgt atcgagcaga 9061 aatgtagttt acaaaaaaaa ctatttgttg ttgttgctgt taacacttct gtaaacaaat 9121 tttttttggt ttgtttattt gttgtaataa aattgtacaa aat // LOCUS TRCTC26 638 bp ss-mRNA INV 17-JUL-1990 DEFINITION T.cruzi AMP-inducible protein (TC26) mRNA, partial cds. ACCESSION M34062 KEYWORDS AMP-inducible protein. SOURCE T.cruzi (strain X10/4) trypomastigote, cDNA to mRNA, clone G26. ORGANISM Trypanosoma cruzi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 638) AUTHORS Heath,S., Hieny,S. and Sher,A. TITLE A cyclic AMP inducible gene expressed during the development of infective stages of Trypanosoma cruzi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Heath, 08-MAY-1990. FEATURES from to/span description pept < 1 > 638 AMP-inducible protein (TC26) (AA at 1) BASE COUNT 154 a 174 c 168 g 142 t ORIGIN 1 accccaccag accaactggt aatggtagcg accggcgctc agcttggaat tccgtgcgct 61 gcttctagcc ctagagaagc tgatgatccc ccgtatccga cacaggcgta caaccctgct 121 tgtggttacg gacagtcagt ctcttctagc ggctctaaac aagggcccgc tcaatcagac 181 agactggacg gaggatcaga tctggcggcg tctcttgaca ctgacgtgtg ctggctggtc 241 gtgcacctgc agttttgtta cggacattgt ggagtacatg ctaacgagct tgcagatcag 301 tatgcgagga actatggaaa gtggacaata cacggagcaa ggaatcgcac ctttatggca 361 tacggatctg ctgacgtgtt ttactaccca gctcaccaac aagtggcgta ctaccattcg 421 tcaagacact catcgctacc tgctttgcgg cacaaggcca tcagatctcg cggtaaggac 481 ctgatcactc aggaagttct acaccgtcag aactggttca cctcgcaagg gcaaggtgcg 541 gggaatctga gctctggggc cgactatctg ggccgtgaga gattgcacga accaatgccg 601 tttctgcaac atctcaccgg aacagtctgc atatatat // LOCUS DROANNIX 1104 bp ss-mRNA INV 17-JUL-1990 DEFINITION D.melanogaster annexin IX mRNA, 3' end. ACCESSION M34068 J05501 KEYWORDS annexin IX. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-6. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1104) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. 265, 11382-11388 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Suedhof, 04-MAY-1990. FEATURES from to/span description pept < 1 891 annexin IX (AA at 1) signal 1091 1096 poly-A signal BASE COUNT 281 a 300 c 306 g 217 t ORIGIN 1 attctgcgca aggcgatgaa gggcttcggc accgacgaga aggccatcat cgagatcctg 61 gccaggcgtg gcatcgtcca gcgtttggag atcgctgagg cgttcaagac ctcgtacggc 121 aaggatctga tctcggacct caagtccgag ctgggcggca agttcgagga tgttatcctg 181 gctctgatga cgccgctgcc ccagttctat gcccaggagc tgcacgacgc catctcggga 241 ctgggaaccg acgaggaggc catcatcgag atcctctgca cgctgtccaa ctacggcatt 301 aagaccattg cccagttcta cgagcagagc ttcggcaagt ccctagagtc cgacctaaag 361 ggcgacacca gtggccactt caagcggctg tgcgtctcgc tcgtccaggg caaccgggat 421 gagaaccagg gcgtggacga ggccgcggcc atcgccgatg cccaggctct gcacgacgcc 481 ggtgagggac agtggggcac agatgagtcc accttcaact cgatcctgat cacccgctcc 541 taccagcagc tgcgccagat cttcctcgaa tacgagaatc tgtcgggcaa cgacatcgag 601 aaggccatca agcgggagtt tagcggctcc gtggagaagg gtttcctggc catcgtcaag 661 tgctgcaagt ccaagatcga ctacttttcg gagcgcctgc acgactccat ggccggcttg 721 ggcaccaagg acaagacgct gatccgcatc atcgtcagcc ggtcggagat cgatctgggt 781 gacatcaagg aggcattcca gaacaagtac ggcaagagct tggagtcctg gatcaaggag 841 gatgccgaga ccgatattgg atacgtcctg gtcactctta cggcttggta gacggaagca 901 gccggaatat ccgaatatct atgagcaata ccccactgtt caagtagaaa atgccaaaaa 961 aaaaaacgtt gcatttcccc aaaaaaaagt ataacaaaag cgaagaacaa atggagttgg 1021 tctatataca gtagttgtga tgtgttctaa aaatccaatc tacaaaacgc ttagtatttt 1081 ccctctgtgc aataatcgga attc // LOCUS DROANNX 1192 bp ss-mRNA INV 17-JUL-1990 DEFINITION D.melanogaster annexin X mRNA, complete cds. ACCESSION M34069 J05501 KEYWORDS annexin X. SOURCE D.melanogaster adult head, cDNA to mRNA, clone pD3-16. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1192) AUTHORS Johnston,P.A., Perin,M.S., Reynolds,G.A., Wasserman,S.A. and Suedhof,T.C. TITLE Two novel annexins from Drosophila melanogaster: Cloning, characterization and differential expression in development JOURNAL J. Biol. Chem. 265, 11382-11388 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Suedhof , 04-MAY-1990. FEATURES from to/span description pept 91 1056 annexin X signal 1175 1184 poly-A signal BASE COUNT 271 a 348 c 363 g 210 t ORIGIN Chromosome 93B or 19A-4,7. 1 gaattccaaa agtcccagga gaaagactga ttcgtgtgaa gtcgtctact gaagagccac 61 aaggaaccca aggaatcttc cagctgcata atggaataca aacccgtgcc cacggttaag 121 gacgcagctc ccttcgacgc ctcccaggac gcccaggtgc tgcgggcggc gatgaaggga 181 ttcggcaccg acgagcagga aatcatcgac gtgctcgtcg gcaggagcaa ccagcagagg 241 cagacgatca aggcggttta cgaagcggag ttcgagcgcg acctggtgga cgatcttaag 301 gacgagctgg gaggcaagtt cgaggacgtg atcgtgggtc taatgatgcc accagtggag 361 tacctgtgca agcaactgca cgccgccatg gcgggcatcg gaaccgagga ggccacgctc 421 gtcgagatcc tgtgcaccaa gaccaacgag gagatggccc agatcgtggc cgtctacgag 481 gagcgctacc agcgcccgct ggccgagcag atgtgcagcg agacctccgg ctttttccgc 541 cgcctgctca cgctgatcgt gaccggagta cgtgacggac tggacacgcc cgtcgacgtc 601 ggtcaggcca aggagcaggc cgcccagctc tactcggccg gcgaggccaa gctgggaacg 661 gacgaggagg tcttcaaccg gatcatgtcg cacgccagct tcccgcagct gcgacttgtc 721 ttcgaggagt acaaggtgct ctccgggcag accatcgagc aggccatcaa gcacgagatg 781 tccgacgagc tgcacgaggc catgatggcc atagttgagt gcgtccagtc accggcggcc 841 ttcttcgcca accgcctcta caaggccatg aatggcgccg gcaccgatga cgccacgctc 901 atccgcatca tcgtcagccg ctcggagatc gacctggaga ccattaagca ggagttcgag 961 cggatctaca accgtacgct gcacagcgcc gtggtggacg cggagacctc tggtgactac 1021 aagcgggccc tgacagccct acttggatcc gcctaggccc gaggatgtgg cagctggtcc 1081 gcccaatatt ttattcgtgt taatagcttt gatcgtagtg tgccttttag gaaaatcgct 1141 tttaatgtcg tctgcgcatg cgcacactgt tggcaataaa taaacggaat tc // LOCUS ECODKSA 1273 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds. ACCESSION M34945 KEYWORDS dnaK suppressor. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1273) AUTHORS Kang,P.J. and Craig,E.A. TITLE Identification and characterization of a new Escherichia coli gene that is a dosage-dependent suppressor of a dnaK deletion mutation JOURNAL J. Bacteriol. 172, 2055-2064 (1990) STANDARD simple staff_review FEATURES from to/span description pept 229 441 ORF 1 pept 619 1074 dnaK suppressor (dksA) BASE COUNT 343 a 301 c 333 g 296 t ORIGIN 1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa 61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc 121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata 181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg 241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt 301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg 361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa 421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg 481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg 541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc 601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt 661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa 721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa 781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac 841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac 901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc 961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg 1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag 1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga 1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct 1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt 1261 acgcatagaa gat // LOCUS STFCYCLI 2180 bp ss-mRNA INV 17-JUL-1990 DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds. ACCESSION M33880 KEYWORDS cyclin B. SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone lambda-gt10-cyc10. ORGANISM Asterina pectinifera Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; Stelleroidea; Asteroidea; Spinulosida; Asterinidae. REFERENCE 1 (bases 1 to 2180) AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T. TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes cyclin JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Kishimoto, 11-MAY-1990. FEATURES from to/span description pept 126 1313 cyclin B (CYC) mRNA < 126 2180 cyclin B mRNA BASE COUNT 650 a 440 c 482 g 608 t ORIGIN 1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg 61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga 121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt 181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc 241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag 301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca 361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa 421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct 481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac 541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt 601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc 661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc 721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc 781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg 841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt 901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca 961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca 1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag 1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc 1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat 1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca 1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg 1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt 1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt 1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg 1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa 1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt 1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga 1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc 1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca 1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac 1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat 1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag 1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat 2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg 2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt 2161 gtaataaagt taatggaaat // LOCUS MUSH2A1X 929 bp ds-DNA ROD 17-JUL-1990 DEFINITION Mouse histone H2A.1 gene, complete cds. ACCESSION M33988 KEYWORDS histone H2A.1. SOURCE Mouse (BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 929) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Sch|mperli, 02-MAY-1990. Author address: Dr.Schmperli Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 164 556 histone H2A.1 mRNA 119 624 histone H2A.1 mRNA BASE COUNT 238 a 247 c 222 g 222 t ORIGIN 1 gagctcaaat tctggctttc tattgggtac gatatattaa ccaatgggag aaacacaaac 61 agaatacctc cagttagtat aaatgcttgc tgttcagttg cagaatttac tatatattct 121 tttcctttct ctgctttgcc tttactgata cttaaacgca tacatgtctg gacgcggaaa 181 gcaagggtgc aaggcccgcg ctaaggccaa gacccgctcc tcccgggccg gcctgcagtt 241 ccccgtgggc cgcgtgcacc ggctgctccg caagggcaac tactcggagc gcgtgggcgc 301 cggcgccccg gtgtacctgg cggctgtgct ggagtacctg acggccgaga tcctggagct 361 ggcgggcaat gcggcccgcg acaacaagaa gacgcgcatc atcccgcgcc acctgcagct 421 ggccatccgc aacgacgagg agctcaacaa gctgctgggc cgcgtgacca tcgcgcaggg 481 cggcgtcctg cccaacatcc aggccgtgct gctgcccaag aagaccgaga gccaccacaa 541 ggccaagggg aagtaatctg gcgattgtct gtactgccca gttgaaagtt aaccaaaaca 601 aaggctcttt tcagagccac ccacatcttt ccataaaatg agctgccacc tcgtgaaacg 661 ttcttccact acagttttta tactacatat gaaaaagtta cgaagtagct ttcaatctta 721 gtaaattgat tttaatactg ttagtccctg cgataaatct tacgaccttc cttagtttga 781 gtcaaaagtg tgtaagagat gaaaccttta gaacatacta taaattttta gtagaaattt 841 ggcacccagg tttgtcattc acgtcacgat tgtctagagc ataatggtag taagggctaa 901 gggccattaa atcccacttc catagtttc // LOCUS MUSH2A1XX 693 bp ds-DNA ROD 17-JUL-1990 DEFINITION Mouse histone H3.2 gene, complete cds. ACCESSION M33989 KEYWORDS histone H3.2. SOURCE Mouse (strain BALB/c) DNA, clone 53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 693) AUTHORS Gruber,A., Streit,A., Reist,M., Benninger,P., Bvhni,R. and Sch|mperli,D. TITLE Structure of a mouse histone gene cluster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Sch|mperil, 02-MAY-1990. Author address: D.Schlmperil Zoologisches Institut, Baltzerstrasse 4 CH-3012 Bern SWITZERLAND FEATURES from to/span description pept 151 561 histone H3.2 mRNA 116 611 histone H3.2 mRNA BASE COUNT 159 a 206 c 184 g 144 t ORIGIN 1 caggatttag aagcagaggc tgaccaatcc caacaaagcg cgggcccttt gaatgttctt 61 cggtccaata gcggatagtc tgattgtata aaaggtggac agcgccttgc agctcactat 121 agtgtcagtc tattttccct tgttaaagtc atggctcgta ctaagcagac cgctcgtaag 181 tccaccggtg gcaaggcccc gcgtaagcag ctggccacca aggccgcccg caagagcgcc 241 ccggccaccg gcggcgtgaa gaagcctcac cgctaccgtc ccggcaccgt ggcgctgcgc 301 gagatccggc gctaccagaa gtcgaccgag ctgctgatcc gcaagctgcc gttccagcgc 361 ctggtgcgcg agatcgcgca ggacttcaag accgacctgc gcttccagag ctcggctgtc 421 atggctctgc aggaggcgag cgaggcctac cttgtgggtc tgtttgagga caccaacctg 481 tgcgccatcc acgccaagcg tgtcaccatc atgcccaagg acatccagct cgcccgccgc 541 attcgtgggg agagggctta aacgaaacaa agtgcaaact ggaacccaaa ggctcttttc 601 agagccaccc acagtttctg taaaagcagc tgttactttt ctactgtctt ggaaactttg 661 tttttcaaag atttatctag ttacaaagta tac // LOCUS RATGGSLEC 1358 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat Gal/GalNAc-specific lectin mRNA, complete cds. ACCESSION J05495 KEYWORDS Gal/GalNAc-specific lectin. SOURCE Rat (strain Wistar) adult peritoneal macrophage, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1358) AUTHORS Kurata,M.Ii., Itoh,H., Yamashina,N. and Kawasaki,I.T. TITLE Molecular cloning and sequence analysis of cDNA encoding the macrophage lectin specific for galactose and N-acetylgalactosamine JOURNAL J. Biol. Chem. 265, 11295-11298 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Kawasaki, 20-APR-1990, for release after publication. FEATURES from to/span description pept 172 1092 Gal/GalNAc-specific lectin precursor sigp 172 195 Gal/GalNAc-specific lectin signal peptide matp 196 1089 Gal/GalNAc-specific lectin signal 1330 1335 poly-A signal BASE COUNT 340 a 325 c 365 g 328 t ORIGIN 1 ccacctctgg ccactgctgc acagggaagc cacgatttca cacagttcca ggacttacct 61 cagcttcaca ctgcttctga cactctcctt ggtcccagct cagtatctgt ctggattctg 121 gtgtctcggt ttcagttcag atagccctag aggccttgtg tttgtgcaag catgacaatg 181 gcatatgaaa acttccagaa cttggggagc gaagagaaaa accaagaggc tggtaaagct 241 cctccccagt ccttcctgtg caatatcctc tcttggaccc acctcctcct gttctccctg 301 ggccttagcc tcctgctgct ggtggttatc tccgtgattg gatcccaaaa ttcccagtta 361 aggagggacc tagaaaccct gagaaccact ttagacaaca ccacctccaa caccaaggct 421 gaactacagg ccctggcctc caggggtgac agcttgcaaa caggaatcaa ttctctgaaa 481 gtggaggtgg atgatcatgg gcaggaactg caggcaggcc gaggcttgag ccagaaggtg 541 gcttctctgg agagcacagt ggagaagaag gagcagactc tcagaacaga tctatctgaa 601 ataaccgatc gtgtgcaaca gctggggaag gacttgaaga ccctgacatg tcagctggcc 661 agcctcaaga acaacggctc agcagtggcc tgctgccccc ttcactggat ggagcatgaa 721 ggcagctgct actggttctc tcagtctggg aagccgtggc ctgaagctga caagtactgc 781 cagctggaga attctaacct agtggtggtc aactccctgg cggagcagaa ttttctacag 841 actcacatgg gcagtgtggt cacttggatt ggcctaacgg accaaaatgg gccctggcga 901 tgggtggacg ggaccgacta tgagaaaggc tttacgcact gggccccaaa gcagccagat 961 aactggtatg gacacgggct gggaggaggc gaggactgtg cacacttcac ctcagatggt 1021 cgctggaatg atgacgtttg ccagaggccc taccgctggg tctgtgagat gaagctggcc 1081 aaggacagct aggagcactc tccaagtaat ttatttcttc agtggtttta gctggggaag 1141 cctggagttt ggactcctcc tatcccattt caatgttatt tgggactttt ctttttgtca 1201 aaaaatttaa gtcaaggaga aaggattgtg tttgagatgg tagtgtggtg tccggaagaa 1261 tggggttatt gaaatctgtg gtattttgta cagagtgccg cttattgtag gactttttaa 1321 tgtaacaaaa ataaaatact gttggtttta aaaaaaac // LOCUS MLUDNAA 4171 bp ds-DNA BCT 17-JUL-1990 DEFINITION M.luteus ribonuclease P (rnpA), 50S ribosomal subunit protein L34 (rpmH), DNA biosynthesis initiation protein (dnaA), and DNA polymerase III beta subunit (dnaN) genes, complete cds. ACCESSION M34006 KEYWORDS 50S ribosomal subunit protein L34; DNA biosynthesis initiation protein; DNA polymerase III beta subunit; ribonuclease P. SOURCE M.luteus (strain IF03333) DNA. ORGANISM Micrococcus luteus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 4171) AUTHORS Fujita,M.Q., Yoshikawa,H. and Ogasawara,N. TITLE Structure of the dnaA region of Micrococcus luteus: Conservation and variations among eubacteria JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Ogasawara, 05-MAY-1990. FEATURES from to/span description pept 435 37 (c) ribonuclease P (rnpA) pept 601 464 (c) 50S ribosomal subunit protein L34 (rpmH) pept 944 2491 DNA biosynthesis initiation protein (dnaA) pept 3242 > 4171 DNA polymerase III beta subunit (dnaN) signal 843 848 dnaA -35 signal signal 866 871 dnaA -10 signal BASE COUNT 672 a 1558 c 1361 g 580 t ORIGIN 1 ctcgagcacg acgaagggcg agggcgtgac ggtcatctac gcgtgctccg aggcggcggg 61 caggtgcggc ttcagggcct tgccgagcgc gccgacggtc tcgcggcgca gcagcgcgta 121 gtccgcctcg gcggcggcag gcagcgcgcg gacctgcacc agaaccggca ggtcgcgcag 181 cgggggcagg cgcatctgct ccgcgacgac ggcgcgcagc cgccgcttca cccggttgcg 241 ggtcaccgcg ttccccacgg ccttggacac gacgaacccg gcccgcggcc gaggggcgct 301 cggcgacgtg gaccgggtct gatcggggtc cgtggcgacg ctcaccacga cggtgcgccg 361 acccgcgcgg gtgccggtgc gaccgaggtg acggaactcc gcgggcgtgc ggacgcgacg 421 gtcacggggc agcacggggc gcgccgtccc tggaccgagg tcctcaggcg gacagctcgg 481 cgcggttctt gccgcggcga gcggacagga tggcgcggcc ggcgcgggtg cgcatgcggg 541 cacggaagcc gtgcttgcgg gcacgacggc ggttgttggg ctggaacgtg cgcttggtca 601 ctgtgatctc tccacgacgt gttgtgtgcc ttcgaccgtc gacctgagca ggtggaccag 661 ggggaaggac cggtgaggct ctgacatgaa atccccggcg ggcacgatca accgacaccc 721 gggggaccag acaactctag tggccggggt acgcaggggt caaacgggtg cgcgcggatc 781 ccggacgccc gtgggcgccc gggacggtcg tccacaggat gataggggtt gtccaccgtc 841 tgtgcacaac cctgtggaca accgttagac tccgagcacc gtcgtccgtc tcgcgcagcc 901 gcccccgtcc ccggggacgt ctgccccagg aaaggaagcc tccgtggtgg cagaccaggc 961 cgtgctcagc tcgtggcgct ccgtcgtggg ctccctcgag gacgacgcgc gggtcagcgc 1021 ccgcctcatg ggcttcgtct acctggccca gccgcagggc ctcatcggca atacgctcct 1081 gctggccgtg ccgaacgaga ccacccgcga gacgctccag ggcacccagg tggccgacgc 1141 cctcacggac gccctgaccc aggagttccg cgaggagatc ctgctggcca tctcgatcga 1201 cgcgaacctg cagccgccgc ggaccccctc ctccgaggcc cgccgctcct ccctcgccgg 1261 cgggccgtcc ggcgccgcgg cgccggacgt cgagctgccc ccggccgcga ccgccgccac 1321 ctcccgccgc gccgtcgccg aggagctgcc gggtttccgc atcgagccgc cggccgacgt 1381 cgtgcccgcc gcgaacgcgg ccccgaacgg gaacggcaag ccgacccccg cgccgccgtc 1441 gacgtccgcg gagaccagcc gcctcaacga ccgctaccac ttcgagacgt tcgtgatcgg 1501 ctcgtccaac cggttcgcgc atgcggccgc gaacgcggtg gccgaggcgc cggcgaaggc 1561 gtacaacccg ctgttcatct acggcgagtc cggcctgggc aagacgcacc tgctgcacgc 1621 gatcgggcac tacgcccgcc gcctctaccc gggcttgcgg gtgcggtacg tgaactcgga 1681 ggagttcacc aacgacttca tcaactccat ccgccacgac gagggcgcct cgttcaagca 1741 ggtctaccgc aacgtggaca tcctgctgat cgacgacatc cagttcctgg cggacaagga 1801 ggcgacggtc gaggagttct tccacacctt caacacgctc tacaacaaca acaagcaggt 1861 ggtcatcacc tcggacctgc cgcccaagca gctctccggg ttcgaggacc ggctgcgctc 1921 ccgcttcgag tgggggctga tcacggacat ccagccgccg gacctcgaga cgcgcatcgc 1981 gatcctccgg aagaaggcgg aggccgaggg gctcgtggcc ccgccggagg cgctggagta 2041 catcgcctcg cgcatctcca cgaacatccg cgagctcgag ggtgcgctga tccgcgtgac 2101 cgcgttcgcc tcgctcaacc ggcagaccgt ggacatcgag ctggccgagc atgtgctcaa 2161 ggacctgatc acggacgaga cggcgcacga gatcacgccg gagctgatcc tgcacgccac 2221 gggggagtac ttcaatctca ccctcgagga gctgaccagc aagtcccgca cccgcacgct 2281 ggtgacggcg cggcagatcg ccatgtacct gctgcgcgag ctgaccgaga tgtcgctgcc 2341 caagatcgga caggtcctgg gcggccgcga ccacaccacc gtcatccacg cggaccgcaa 2401 gatccgcgag ctgatggccg agcggcggac gatctacaac caggtcaccg agctcaccaa 2461 cgagatcaag cggaaacagc gcggcgcctg agccacccgc cgtcccaccc tcgagtcccg 2521 gttcgaacct tcgaccggga ctcgagcctt cgtccacatt gtccaccggt gtggacaact 2581 ctgtggagaa cccccgtcgc tgtggacaac ctccgcccct gtggatggcg gaatcaggcc 2641 gctcacgccg cgtccacacc cctgggggcg acgacggtgc acaggctggt gaatcacacg 2701 gatgccattc ggcggtccac acgtatccac agatctccac agccgaaaac cgcatgatct 2761 cgcgggttcc gacgggtggg gacgcaccct gtggacagga atgtgggcaa ctggggggtg 2821 gatccgggga cgagtctgga caactccggt cgaacctgtg cagagcaggg ggaggacctc 2881 cggggctcgt ccacaccgga tgcacacccc gaacgagcgg gcgtccacac ggttccccca 2941 accggatccg catgctgacg cggcagaacg gcgttctcca caggatccac acccgctacc 3001 actccttctg atcccccttc agagattccg ccaagcaaca tcatctccgc cgcctcgcac 3061 cgagctgagg gccccgagag cccgggctgc gcccgagcac tcagaatcct tggaatcaca 3121 cgcttgtcat tcgcttccga cctgacaccc gccgcgcgga ccgcctatgc tgaggcccgc 3181 gtgtcgccgt tcccccgggc ccgcaacccc cgtcatcacc cccagagagg cagtgcagct 3241 cgtgaagttc accgtcgaac gcgacatcct caccgacgcc gtctcctggg ccgcccgctc 3301 cctgtccccg cgcccgccgg tgccggtgct ctcgggtctg ctgatcacgg ccgaggccgg 3361 cgtcgtgtcc atcgcgagct tcgactacga gacctccgcg cgcctggaga tcgaggcgga 3421 cgtcgagacc gcgggccagg tgctcgtctc cggccgtctg ctgaacgaca tcgtccgctc 3481 cctgccgcag gcccaggtca cggtggagct cgacggcggc aaggtgctcg tcacctgccg 3541 atcctcccgc ttctccctgg ccacgatgcc cgtgggcgac tacccggccc tgcccgagct 3601 gccggccccc gcaggcaccg tggacggggc cgcgttcgcc cacgccgtcg cccaggtcac 3661 cgtcgccgcc tccaaggacg acaccctgcc gatcctcacc gccgtgaagg tggagatcga 3721 gggcgacacc ataaccttcc tggccacgga ccgctaccgg ctggccatga aggagatccg 3781 ctggaccccg gcggacccgt ccatctcgac gtcgctgctc atcaaggcgc gcacgctcac 3841 cgaggtcgcc aagtccctcg gctccggcgg cgacctggag atcctcctcg gtcagaccgc 3901 cgacctcgtg ggcttcgcct ccggtggccg ccgcaccacc tccgtgctcg tggacggcga 3961 gtaccccaag atccgctcgc tcttccccga gtccagcccc atccaggccg tggtggacac 4021 cgccgccctg gtcgaggcgt cccgccgcgt ggccctcgtg gccgagcgca acaccgccct 4081 gcgcatggtg ttcacggagg gccaggtcac cctggacgcc ggcaccggcg atgacgcgag 4141 cgccaacgag tccgtgccgt gcacgctcga g // LOCUS HUMLYAM1 1192 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 1. ACCESSION M32406 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 1 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1192) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.-J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept 1040 + 1042 leukocyte adhesion molecule-1 (LAM-1), exon 1 mRNA 931 > 1192 leukocyte adhesion molecule-1 mRNA IVS 1043 > 1192 leukocyte adhesion molecule-1 intron A BASE COUNT 410 a 185 c 326 g 271 t ORIGIN Chromosome 1q22-25. 1 agagagctgt tataaagatt aaacaatata ataaatatgg cgcgtgagct tcagagtttt 61 tgttgttgtt attattattt tcctaaaaat gcaaatctga tttgcatttg actcattgac 121 tcacatcagt gggtcttcct ttttattgtc cttcatcata tgggtcctaa tttcacatgc 181 agtcttataa aaccatctca ttttatagtc caagaatatt aaaggtactt gtaggctccc 241 aaacctacac ggtgaaaagc tagagagcat gggctctctt caggggttaa cttcaggaag 301 tgccactaac aaggacgtcc actaggtggt gagcaaggaa agacggaggt gaaggaaccg 361 aaacgagtca agtccactgc ttagctctac tgaagttttg caaacatcat aaatatgtct 421 gaaatgcagt tttgatttgt agtatttgca atttccaagg gccatttacc acaggtagcc 481 aagagttagt ttagcattta tgaaaaagat aggggagggt ggtggttaag aaggaggtgg 541 aggagagagt gaaggaggaa gaggagaaca agaaccaaac aaaaacaaga acaagaacaa 601 gtagaagaag aggagcaggg aggaaaaaga agaggaagaa gaacagcaac aacaatgagt 661 gaaggaggag gagggtaagg aaagatgcat aggagaatgg aaggaaggat agaaaggagg 721 gaaggaagag agaatctagt cacattactt tctgatcagc agttcatttt tgtctcagtg 781 ggaggcaata gaggccagtc taggaaaggg gtggggaaag aggaaagaga agtgcaggag 841 gaaggggagg cccaagggga ggaggaggag gatgtgagac tgggttagag aaatgaaaga 901 aagcaaggct ttctgttgac attcagtgca gtctacctgc agcacagcac actccctttg 961 ggcaaggacc tgagaccctt gtgctaagtc aagaggctca atgggctgca gaagaactag 1021 agaaggacca agcaaagcca tggtgagcct ttcagcctaa aagacgttta gatgctcaga 1081 tagaaactct tggggttgta gaggcaggtg gcaaggatag gaatcacccc atttcaattc 1141 tggttttaaa taatatagaa actaaacatt ttctcagacc ctcaaaaaaa gt // LOCUS HUMLYAM2 363 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 2. ACCESSION M32407 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 2 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 363) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 229 + 310 leukocyte adhesion molecule-1 (LAM-1), exon 2 mRNA < 1 > 363 leukocyte adhesion molecule-1 mRNA IVS < 1 228 leukocyte adhesion molecule-1 intron A IVS 311 > 363 leukocyte adhesion molecule-1 intron B BASE COUNT 115 a 51 c 69 g 128 t ORIGIN Chromosome 1q22-25. 1 cactgagact aagcgtaaaa taaatagaac aaacaaactg tgcatcagtt ctgatgtaaa 61 tttgaagtaa ttttcatcta tgtctgagaa acctgttacc tcagacaggg ttagtagaca 121 tatgtgtttt attctgatta ttaagaaagt tgtaagcacc acctcaaagg ctataaatgt 181 gtggtttaag ggtatacatc taaatataat tttgtatttc atttgcagat atttccatgg 241 aaatgtcaga gcacccagag ggacttatgg aacatcttca agttgtgggg gtggacaatg 301 ctctgttgtg gtatgttatg atatttatat atcactaagt ctattttact tatattcatt 361 ttt // LOCUS HUMLYAM3 531 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 3. ACCESSION M32408 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 3 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 531) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 73 + 459 leukocyte adhesion molecule-1 (LAM-1), exon 3 mRNA < 1 > 531 leukocyte adhesion molecule-1 mRNA IVS < 1 72 leukocyte adhesion molecule-1 intron B IVS 460 > 531 leukocyte adhesion molecule-1 intron C BASE COUNT 172 a 106 c 138 g 115 t ORIGIN Chromosome 1q22-25. 1 ctggagtagt gctaggttct ttttagctgt aacattatgt aagtctgcat aggtcacact 61 gatgtcttgc agatttcctg gcacatcatg gaaccgactg ctggacttac cattattctg 121 aaaaacccat gaactggcaa agggctagaa gattctgccg agacaattac acagatttag 181 ttgccataca aaacaaggcg gaaattgagt atctggagaa gactctgcct ttcagtcgtt 241 cttactactg gataggaatc cggaagatag gaggaatatg gacgtgggtg ggaaccaaca 301 aatctctcac tgaagaagca gagaactggg gagatggtga gcccaacaac aagaagaaca 361 aggaggactg cgtggagatc tatatcaaga gaaacaaaga tgcaggcaaa tggaacgatg 421 acgcctgcca caaactaaag gcagccctct gttacacagg tagggagtga caagacggct 481 atgctgcctc agactcagga agggccacgg ttaagagaat actcagattt a // LOCUS HUMLYAM4 832 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 4. ACCESSION M32409 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 4 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 832) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 161 + 268 leukocyte adhesion molecule-1 (LAM-1), exon 4 mRNA < 1 > 832 leukocyte adhesion molecule-1 mRNA IVS < 1 160 leukocyte adhesion molecule-1 intron C IVS 269 > 832 leukocyte adhesion molecule-1 intron D BASE COUNT 241 a 163 c 150 g 278 t ORIGIN Chromosome 1q22-25. 1 aaaattttag ccatatgatt tttatgctat gaatttacca aataaacctt tcctgattat 61 ttaaatcatc tcagacaaaa ggttatctat gtctaaagaa atgactttga gtactaaaat 121 gtaatcacat taaaatattt tttttctgac ctccttaaag cttcttgcca gccctggtca 181 tgcagtggcc atggagaatg tgtagaaatc atcaataatt acacctgcaa ctgtgatgtg 241 gggtactatg ggccccagtg tcagtttggt aagtctcttt cctttctttg cttcttctta 301 ggtaaagtca caggaatcat tatagcttat catgaagctg gttggaacaa aatgatacta 361 gccactctga gaaatgggaa gttttgatca gaaagctctg ctttcacaat attgttacct 421 ttccgtaaag atttcataag tcagcatgaa gtttcgattc acttctcaac aagtcttttt 481 gagtaccaca agaagcacag tgttgggata aagctgtcag ggttacaata aggaattagc 541 atggtagatt cccgctctca agaagctcac gatctaatga gcttgttaga ttaattagaa 601 ctctaaggtc tggaagaaac tatgccattt atcattagga ggctgagtta cccagaaagt 661 atcttgcttt ttccttctag tagttccttt ccttcttgca gttctccaca cttaacacat 721 gtgctctgta gcacactgac tttgctggtg gccttctctc tcattttgca catggccaaa 781 aaacatgtca tctttaagac attgttcaaa gacagtttct tctaggaagc tt // LOCUS HUMLYAM5 712 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 5. ACCESSION M32410 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 5 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 712) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 377 + 562 leukocyte adhesion molecule-1 (LAM-1), exon 5 mRNA < 1 > 712 leukocyte adhesion molecule-1 mRNA IVS < 1 376 leukocyte adhesion molecule-1 intron D IVS 563 > 712 leukocyte adhesion molecule-1 intron E BASE COUNT 200 a 145 c 139 g 228 t ORIGIN Chromosome 1q22-25. 1 ctctgatgtg atagttattt cccgactaag ctggtcattc ccagttacac ctatttggct 61 ttaaggattc tcactacaga taatactgaa gataataata tgaagactag ctaatgttta 121 cttagaattt ctgatgagtc aggctttgtt ctaacgtcct tgacttatgc taattgaatt 181 acatttagtt tccatatcaa tttgataaag ataacacaat ttcattattc ctcttatata 241 gatgaagaaa ctgaagttgg agggttcaag taaccttgtt taaaggcaca tggttatcaa 301 gtggcagggc taggattcaa atccaggcgt cagttcctct taactcttcc ccatactgtt 361 tctttcccta ttgaagtgat tcagtgtgag cctttggagg ccccagagct gggtaccatg 421 gactgtactc accctttggg aaacttcagc ttcagctcac agtgtgcctt cagctgctct 481 gaaggaacaa acttaactgg gattgaagaa accacctgtg gaccatttgg aaactggtca 541 tctccagaac caacctgtca aggtgagtaa cttcagacta gaggttttgt catgcaatcc 601 tgggcttaca gtcagaacat tcagtagaag tttgctgaga agtcaaactt aggatcctaa 661 tttaacctaa cttttgttta acctactgtg atgtttctca aaggacttat tc // LOCUS HUMLYAM6 451 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 6. ACCESSION M32411 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 6 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 451) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 240 + 425 leukocyte adhesion molecule-1 (LAM-1), exon 6 mRNA < 1 > 451 leukocyte adhesion molecule-1 mRNA IVS < 1 239 leukocyte adhesion molecule-1 intron E IVS 426 > 451 leukocyte adhesion molecule-1 intron F BASE COUNT 121 a 107 c 89 g 134 t ORIGIN Chromosome 1q22-25. 1 gagggtcacc ttagctaggg cagcagcctg gagtagctac tcctctcccc acagctttca 61 atgcttcctt gccttcatct ctcattcacc acccaccatc attctcaaga aaataaagcc 121 tggaagcaat atcacaagta atgtagtcag gcagctttgg ctaaaaatcc aaagctcaag 181 ggagggtctc tactcagaaa tactgttttg tctttttttt tttttctttt tcattgaagt 241 gattcagtgt gagcctctat cagcaccaga tttggggatc atgaactgta gccatcccct 301 ggccagcttc agctttacct ctgcatgtac cttcatctgc tcagaaggaa ctgagttaat 361 tgggaagaag aaaaccattt gtgaatcatc tggaatctgg tcaaatccta gtccaatatg 421 tcaaagtgag taagtttgtc ctggaactga a // LOCUS HUMLYAM7 543 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 7. ACCESSION M32412 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 7 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 543) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 300 + 428 leukocyte adhesion molecule-1 (LAM-1), exon 7 mRNA < 1 > 543 leukocyte adhesion molecule-1 mRNA IVS < 1 299 leukocyte adhesion molecule-1 intron F IVS 429 > 543 leukocyte adhesion molecule-1 intron G BASE COUNT 174 a 99 c 122 g 148 t ORIGIN Chromosome 1q22-25. 1 tatcagaact aagaaagctt gggctgcagg tcgactctag gtgcattttc aggaactcta 61 tgaaccacaa atctgggcat tgagattctg taggcattag actagcaagg ctggtcagtc 121 tttgcctatg ctgtagactc atcaggggcc ttcccatgcc agtttcctca tctgtcaaat 181 ggcatcattt gggctactac tgggagatgt aaggaggaaa aaagtcaaat atcatgagat 241 agactaagga aataatgctg gtggtctcat gctatgtgcc ttactgattt ctctttcaga 301 attggacaaa agtttctcaa tgattaagga gggtgattat aaccccctct tcattccagt 361 ggcagtcatg gttactgcat tctctgggtt ggcatttatc atttggctgg caaggagatt 421 aaaaaaaggt atgtgagttt aacttcacat gaaaagaaca caactttaaa gtgaaaaaga 481 aaaaaaaaag aaacccacag gaaattaaat gtgatagatc aacacaagca ggatgccaag 541 ctt // LOCUS HUMLYAM8 524 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 8. ACCESSION M32413 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 8 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 524) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 380 + 398 leukocyte adhesion molecule-1 (LAM-1), exon 8 mRNA < 1 > 524 leukocyte adhesion molecule-1 mRNA IVS < 1 379 leukocyte adhesion molecule-1 intron G IVS 399 > 524 leukocyte adhesion molecule-1 intron H BASE COUNT 146 a 104 c 103 g 171 t ORIGIN Chromosome 1q22-25. 1 tagtttacag tattagcagc tgtccctcaa ggaagaatct gcaggtagat gagatgcaga 61 ttgggtggga taaacacttg aatgacatat tgggtcttgc caccaggcaa tttagcaatt 121 ctgtcttctt gagtagcacg gagatggaat ggaacctcag gaggcatctg catcaacatg 181 tctgttctgt attagtgtct accactgttt attaagccag ttcctcaaat ctcctttgac 241 acagataggg tccacctaac aaatacctaa tatacttcaa aagacagttt tgagagtggg 301 agtcttcctt ctcccttact tgaaaaactt taaattgtct aatttttgct aatgcctttt 361 tctctatttt ctatttcagg caagaaatcc aagagaaggt aagttttatt agtggcgagg 421 agtttccaca tctgctgatt cattctctac ttcttaagtt acttctgctc tagctagaca 481 catacccata gtagttatta ctgggtctat caatgacaga tagg // LOCUS HUMLYAM9 1696 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human leukocyte adhesion molecule-1 (LAM-1), exon 9. ACCESSION M32414 KEYWORDS leukocyte adhesion molecule-1. SEGMENT 9 of 9 SOURCE Human mature lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1696) AUTHORS Ord,D.C., Ernst,T.J., Zhou,L.J., Rambaldi,A., Spertini,O., Griffin,J. and Tedder,T.F. TITLE Structure of the gene encoding the human leukocyte adhesion molecule-1 (TQ1, Leu-8) of lymphocytes and neutrophils JOURNAL J. Biol. Chem. 265, 7760-7767 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Tedder, 01-MAR-1990. FEATURES from to/span description pept + 310 328 leukocyte adhesion molecule-1 (LAM-1), exon 9 mRNA < 1 1414 leukocyte adhesion molecule-1 mRNA IVS < 1 309 leukocyte adhesion molecule-1 intron H BASE COUNT 443 a 387 c 292 g 574 t ORIGIN Chromosome 1q22-25. 1 ataagcatca ctaaagagct tgttaggggt gcagaatctc aggctccact cagacctact 61 gaatcagagt ctgcatttta acaccatctc tgagtggtaa ggacatgaaa atctgagaag 121 tgctgctact agggtttgct tacatttgtt catcttcaga ggttcctaaa gcctggcctc 181 ttgtctgaga tttccagctg aaagcatttc cttgctcctc ttctcatctc taatgaatat 241 ttacctttac tactaacact ccaagttttg caatttttaa actcttatta tcttttgttt 301 ttctttcagt atgaatgacc catattaaat cgcccttggt gaaagaaaat tcttggaata 361 ctaaaaatca tgagatcctt taaatccttc catgaaacgt tttgtgtggt ggcacctcct 421 acgtcaaaca tgaagtgtgt ttccttcagt gcatctggga agatttctac ctgaccaaca 481 gttccttcag cttccatttc acccctcatt tatccctcaa cccccagccc acaggtgttt 541 atacagctca gctttttgtc ttttctgagg agaaacaaat aagaccataa agggaaagga 601 ttcatgtgga atataaagat ggctgacttt gctctttctt gactcttgtt ttcagtttca 661 attcagtgct gtacttgatg acagacactt ctaaatgaag tgcaaatttg atacatatgt 721 gaatatggac tcagttttct tgcagatcaa atttcgcgtc gtcttctgta tacgtggagg 781 tacactctat gaagtcaaaa gtctacgctc tcctttcttt ctaactccag tgaagtaatg 841 gggtcctgct caagttgaaa gagtcctatt tgcactgtag cctcgccgtc tgtgaattgg 901 accatcctat ttaactggct tcagcctccc caccttcttc agccacctct ctttttcagt 961 tggctgactt ccacacctag catctcatga gtgccaagca aaaggagaga agagagaaat 1021 agcctgcgct gttttttagt ttgggggttt tgctgtttcc ttttatgaga cccattccta 1081 tttcttatag tcaatgtttc ttttatcacg atattattag taagaaaaca tcactgaaat 1141 gctagctgca actgacatct ctttgatgtc atatggaaga gttaaaacag gtggagaaat 1201 tccttgattc acaatgaaat gctctccttt cccctgcccc cagacctttt atccacttac 1261 ctagattcta catattcttt aaatttcatc tcaggcctcc ctcaacccca ccacttcttt 1321 tataactagt cctttactaa tccaacccat gatgagctcc tcttcctggc ttcttactga 1381 aaggttaccc tgtaacatgc aattttgcat ttgaataaag cctgcttttt aagtgttaac 1441 tagtttgcct agtttgttat tttgaaaatt gatcatatgt tttgttttct ccccagtgag 1501 ttacatgctc cttcagggca gagtttgtgt cagatccctg gagtatctag tgcattactt 1561 gacactcaat aaatgaatgt tcaaataaat cagaaagagc atacagtgca ctgctgatat 1621 aagtttcagc atccctcttt ctctatggca tctgatgacc tgggtcagat atcacctaat 1681 gtcaacagct gaattc // LOCUS RATALADPS 1612 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene, complete cds. ACCESSION J04764 KEYWORDS delta-aminolevulinate dehydratase; pseudogene. SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1612) AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H. TITLE Isolation and characterization of a rat delta-aminolevulinate dehydratase processed pseudogene JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.R.Bishop, 20-APR-1990. FEATURES from to/span description pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase rpt 101 108 8 bp flanking direct repeat rpt 1385 1392 8 bp flanking direct repeat BASE COUNT 404 a 411 c 418 g 379 t ORIGIN 1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt 61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt 121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca 181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta 241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc 301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa 361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct 421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga 481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca 541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg 601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat 661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg 721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat 781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac 841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat 901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga 961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca 1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct 1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa 1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc 1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga 1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa 1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga 1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta 1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt 1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg 1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at // LOCUS BOVPDEAP 585 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Bovine cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (PDE), partial cds. ACCESSION M33140 M29465 KEYWORDS cone photoreceptor cyclic nucleotide phosphodiesterase. SOURCE Bovine dark-adapted frozen retina, cDNA to mRNA, clone BC-alpha-1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 585) AUTHORS Charbonneau,H., Prusti,R.K., LeTrong,H., Sonnenburg,W.K., Mullaney,P.J., Walsh,K.A. and Beavo,J.A. TITLE Identification of a noncatalytic cGMP-binding domain conserved in both the cGMP-stimulated and photoreceptor cyclic nucleotide phosphodiesterases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 288-292 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 585 cone photoreceptor cyclic nucleotide phosphodiesterase alpha'-subunit (AA at 1) BASE COUNT 198 a 115 c 126 g 146 t ORIGIN 1 agagaagtca tcttttataa aatcatcgat tacattttac atggaaaaga agagatcaaa 61 gtcattccga cacctcccat ggaccactgg actctcatta gtgggttgcc aacatatgtt 121 gctgaaaatg gatttatctg caacatgctg aacgccccgg cggatgaata cttcacgttt 181 cagaaaggac ctgtagatga aactggctgg gtcattaaaa atgtcttgtc cctgcctatt 241 gtcaacaaaa aggaagacat cgtgggcgta gctacatttt acaacaggaa ggatggaaag 301 ccttttgatg aatatgatga gcacatcgct gagactctca cacagtttct tggatggtct 361 ctcttaaata ctgacaccta tgagaaaatg aataagctgg agaacagaaa ggacatagcc 421 caggaaatgc tcatgaacca caccaaggct acacctgatg agatcaagtc tattttgaaa 481 tttaaagaga agttaaatat agatgtaatt gaagactgtg aagaaaaaca gcttgtcaca 541 attttgaagg aggacctgcc agacccacgg actgcagacc tgtat // LOCUS CHKG1CLSE 240 bp ds-DNA VRT 17-JUL-1990 DEFINITION Chicken delta-1-crystallin gene, intron 3 lens-specific enhancer cor segments B3 and B4. ACCESSION M33954 KEYWORDS delta-1-crystallin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 240) AUTHORS Goto,K., Okada,T.S. and Kondoh,H. TITLE Functional cooperation of lens-specific and nonspecific elements in the delta-1-crystallin enhancer JOURNAL Mol. Cell. Biol. 10, 958-964 (1990) STANDARD simple staff_review FEATURES from to/span description site 17 116 core segment B3 site 112 235 core segment B4 BASE COUNT 61 a 52 c 53 g 74 t ORIGIN 1 gtcagtgagg tgtgctcagc atgacctgcc ctcccaccct cttcagactg aacattcctg 61 aggaattgtt tcagtatgaa ttaggaatat tctttttcca atggcacttg ggatcccttt 121 gtgtctggct gcctgagtta gtagaagaca atgcacaata ttgtataggg gtgaagaaga 181 gtcagccact aagcactttt tctgaaatat tcattgttgt tgctcaccta ccatggacaa // LOCUS CHKOVAL 9206 bp ds-DNA VRT 17-JUL-1990 DEFINITION Chicken ovalbumin gene, complete cds. ACCESSION J00895 KEYWORDS ovalbumin. SOURCE Chicken oviduct DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1343 to 8906) AUTHORS Woo,S.L.C., Beattie,W.G., Catterall,J.F., Dugaiczyk,A., Staden,R., Brownlee,G.G. and O'Malley,B.W. TITLE Complete nucleotide sequence of the chicken chromosomal ovalbumin gene and its biological significance JOURNAL Biochemistry 20, 6437-6446 (1981) STANDARD full staff_review REFERENCE 2 (bases 1043 to 1562; 2675 to 4732; 8885 to 9206) AUTHORS Benoist,C., O'Hare,K., Breathnach,R. and Chambon,P. TITLE The ovalbumin gene-sequence of putative control regions JOURNAL Nucleic Acids Res. 8, 127-142 (1980) STANDARD full staff_review REFERENCE 3 (bases 1357 to 1389; 2941 to 3052; and ivs junctions) AUTHORS Breathnach,R., Benoist,C., O'Hare,K., Gannon,F. and Chambon,P. TITLE Ovalbumin gene: evidence for a leader sequence in mRNA and DNA sequences at the exon-intron boundaries JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4853-4857 (1978) STANDARD full staff_review REFERENCE 4 (bases 1282 to 1420; 2952 to 2999) AUTHORS Gannon,F., O'Hare,K., Perrin,F., LePennec,J.P., Benoist,C., Cochet,M., Breathnach,R., Royal,A., Garapin,A., Cami,B. and Chambon,P. TITLE Organisation and sequences at the 5' end of a cloned complete ovalbumin gene JOURNAL Nature 278, 428-434 (1979) STANDARD full staff_review REFERENCE 5 (bases 1343 to 8906; exons only) AUTHORS McReynolds,L., O'Malley,B.W., Nisbet,A.D., Fothergill,J.E., Givol,D., Fields,S., Robertson,M. and Brownlee,G.G. TITLE Sequence of chicken ovalbumin mRNA JOURNAL Nature 273, 723-728 (1978) STANDARD full staff_review REFERENCE 6 (bases 2675 to 5042) AUTHORS Robertson,M.a., Staden,R., Tanaka,Y., Catterall,J.F., O'Malley,B.W. and Brownlee,G.G. TITLE Sequence of three introns in the chick ovalbumin gene JOURNAL Nature 278, 370-372 (1979) STANDARD full staff_review REFERENCE 7 (bases 1 to 1042) AUTHORS Heilig,R., Muraskowsky,R. and Mandel,J.L. TITLE The ovalbumin gene family: The 5' end region of the X and Y genes JOURNAL J. Mol. Biol. 156, 1-19 (1982) STANDARD full staff_review REFERENCE 8 (bases 5576 to 5624) AUTHORS Lai,E.C., Woo,S.L.C., Dugaiczyk,A. and O'Malley,B.W. TITLE The ovalbumin gene: Alleles created by mutations in the intervening sequences of the natural gene JOURNAL Cell 16, 201-211 (1979) STANDARD simple staff_entry REFERENCE 9 (bases 443 to 611) AUTHORS Schweers,L.A., Frank,D.E., Weigel,N.L. and Sanders,M.M. TITLE The steroid-dependent regulatory element in the ovalbumin gene does not function as a typical steroid response element JOURNAL J. Biol. Chem. 265, 7590-7595 (1990) STANDARD simple staff_entry COMMENT Eight exons reported. Sequence homologies with adenovirus early 1a, adenovirus major late, chicken conalbumin, chkx and chky genes noted for 5' flanking sequence. OV1.3 is identical to OV1.8 except that intron E is shorter by 522 nucleotides [8]. FEATURES from to/span description pept 2996 3163 ovalbumin, exon 1 3415 3465 ovalbumin, exon 2 4047 4175 ovalbumin, exon 3 4576 4693 ovalbumin, exon 4 5652 5794 ovalbumin, exon 5 6126 6281 ovalbumin, exon 6 7864 8259 ovalbumin, exon 7 pre-msg 1343 8906 oval mRNA and introns IVS 1390 2978 oval intron A IVS 3164 3414 oval intron B IVS 3466 4046 oval intron C IVS 4176 4575 oval intron D IVS 4694 5651 oval intron E IVS 5795 6125 oval intron F IVS 6282 7863 oval intron G allele 1282 1282 g may be c [1] allele 1309 1309 g may be a [1] allele 1376 1376 g may be c [1],[7] allele 1385 1385 g may be a [1],[7] allele 1393 1393 a may be g [1] allele 3010 3010 t may be c [1] allele 3154 3154 a may be g [1] allele 5747 5747 g may be a [1] allele 8032 8032 can be g [5] conflict 1471 1471 c in [1]; t in [2],[7] conflict 1523 1523 a in [1]; t in [2],[7] conflict 1538 1539 at in [1]; a in [2] conflict 2681 2681 t in [1],[6]; g in [2] conflict 2696 2696 a in [1],[6]; g in [2] conflict 3552 3552 a in [1],[6]; g in [2] conflict 3683 3683 c in [1],[6]; t in [2] conflict 3934 3934 a in [1],[6]; c in [2] conflict 3960 3960 a in [1],[6]; g in [2] conflict 3994 3994 t in [1],[6]; c in [2] conflict 4207 4208 tt in [1],[6]; ttt in [2] conflict 4396 4396 t in [1],[6]; c in [2] conflict 5636 5636 c in [1]; g in [3] conflict 6100 6100 t in [1]; tc in [3] conflict 6119 6119 t in [1]; tg in [3] conflict 8492 8505 gactcacagtactg in [1]; g in [5] site 443 611 steroid-dependent regulatory element [9] BASE COUNT 2994 a 1749 c 1721 g 2742 t ORIGIN 5 bp upstream of PstI site. 1 ctgcagactg acatgcattt cataggtaga gataacattt actgggaagc acatctatca 61 tcataaaaag caggcaagat tttcagactt tcttagtggc tgaaatagaa gcaaaagacg 121 tgattaaaaa caaaatgaaa caaaaaaaat cagttgatac ctgtggtgta gacatccagc 181 aaaaaaatat tatttgcact accatcttgt cttaagtcct cagacttggc aaggagaatg 241 tagatttcta cagtatatat gttttcacaa aaggaaggag agaaacaaaa gaaaatggca 301 ctgactaaac ttcagctagt ggtataggaa agtaattctg cttaacagag attgcagtga 361 tctctatgta tgtcctgaag aattatgttg tacttttttc ccccattttt aaatcaaaca 421 gtgctttaca gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca 481 gaacaatagc ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa 541 ccatgaacac tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt 601 tattcatgga agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg 661 agtattgttt tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt 721 ataaagcatt cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt 781 gcgtctgctc ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc 841 ctattttgtc aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg 901 gctcagatgc acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac 961 tacaaccata gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag 1021 gttcccacat ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat 1081 ggaacatgag caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag 1141 aacaggcaga aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat 1201 ccaaaatgga cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg 1261 tcaaaggtca aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc 1321 ccagggctca gccagtgtct gtacatacag ctagaaagct gtattgcctt tagcagtcaa 1381 gctcgaaagg taagcaactc tctggaatta ccttctctct atattagctc ttacttgcac 1441 ctaaacttta aaaaattaac aattattgtg ctatgtgttg tatctttaag ggtgaagtac 1501 ctgcgtgata ccccctataa aaacttctca cctgtgtatg cattctgcac tattttatta 1561 tgtgtaaaag ctttgtgttt gttttcagga ggcttattct ttgtgcttaa aatatgtttt 1621 taatttcaga acatcttatc ctgtcgttca ctatctgata tgctttgcag tttgcttgat 1681 taacttctag ccctacagag tgcacagaga gcaaaatcat ggtgttcagt gaattctggg 1741 gagttatttt aatgtgaaaa ttctctagaa gtttaattcc tgcaaagtgc agctgctgat 1801 cactacacaa gataaaaatg tggggggtgc ataaacgtat attcttacaa taatagatac 1861 atgtgaactt atatacagaa aagaaaatga gaaaaatgtg tgtgtgtata ctcacacacg 1921 tggtcagtaa aaacttttga ggggtttaat acagaaaatc caatcctgag gccccagcac 1981 tcagtacgca tataaagggc tgggctctga aggacttctg actttcacag attatataaa 2041 tctcaggaaa gcaactagat tcatgctggc tccaaaagct gtgctttata taagcacact 2101 ggctatacaa tagttgtaca gttcagctct ttataataga aacagacaga acaagtataa 2161 atcttctatt ggtctatgtc atgaacaaga attcattcag tggctctgtt ttatagtaaa 2221 cattgctatt ttatcatgtc tgcatttctc ttctgtctga atgtcaccac taaaatttaa 2281 ctccacagaa agtttatact acagtacaca tgcatatctt tgagcaaagc aaaccatacc 2341 tgaaagtgca atagagcaga atatgaatta catgcgtgtc tttctcctag actacatgac 2401 cccatataaa ttacattact tatctattct gccatcacca aaacaaaggt aaaaatactt 2461 ttgaagatct actcatagca agtagtgtgc aacaaacaga tatttctcta catttatttt 2521 tagggaataa aaataagaaa taaaatagtc agcaagcctc tgctttctca tatatctgtc 2581 caaacctaaa gtttactgaa atttgctctt tgaatttcca gttttgcaag cctatcagat 2641 tgtgttttaa tcagaggtac tgaaaagtat caatgaattc tagctttcac tgaacaaaaa 2701 tatgtagagg caactggctt ctgggacagt ttgctaccca aaagacaact gaatgcaaat 2761 acataaatag atttatgaat atggttttga acatgcacat gagaggtgga tatagcaaca 2821 gacacattac cacagaatta ctttaaaact acttgttaac atttaattgc ctaaaaactg 2881 ctcgtaattt actgttgtag cctaccatag agtaccctgc atggtactat gtacagcatt 2941 ccatccttac attttcactg ttctgctgtt tgctctagac aactcagagt tcaccatggg 3001 ctccatcggt gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca 3061 ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt 3121 atacctgggt gcaaaagaca gcaccaggac acaaataaat aaggtgagcc tacagttaaa 3181 gattaaaacc tttgccctgc tcaatggagc cacagcactt aattgtatga taatgtccct 3241 tggaaactgc atagctcaga ggctgaaaat ctgaaaccag agttatctaa aagtgtggcc 3301 acctccaact cccagagtgt tacccaaatg cactagctag aaatcttgaa actggattgc 3361 ataacttctt tttgtcataa ccattatttc agctactatt attttcaatt acaggttgtt 3421 cgctttgata aacttccagg attcggagac agtattgaag ctcaggtaca gaaataattt 3481 cacctccttc tctatgtccc tttcctctgg aagcaaaata cagcagatga agcaatctct 3541 tagctgttcc aagccctctc tgatgagcag ctagtgctct gcatccagca gttgggagaa 3601 cactgttcat aagaacagag aaaaagaagg aagtaacagg ggattcagaa caaacagaag 3661 ataaaactca ggacaaaaat accgtgtgaa tgaggaaact tgtggatatt tgtacgctta 3721 agcaagacag ctagatgatt ctggataaat gggtctggtt ggaaaagaag gaaagcctgg 3781 ctgatctgct ggagctagat tattgcagca ggtaggcagg agttccctag agaaaagtat 3841 gagggaatta cagaagaaaa acagcacaaa attgtaaata ttggaaaagg accacatcag 3901 tgtagttact agcagtaaga cagacaggat gaaaaatagt tttgtaaaca gaagtatcta 3961 actactttac tctgttcata cactacgtaa aacttactaa gtaataaaac tagaataaca 4021 acatctttct ttctctttgt attcagtgtg gcacatctgt aaacgttcac tcttcactta 4081 gagacatcct caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta 4141 gactttatgc tgaagagaga tacccaatcc tgccagtaag ttgctctaaa atctgatctg 4201 agtgtattcc atgccaaagc tctaccattc tgtaatgcaa aaacagtcag agttccacat 4261 gtttcactaa gaaaatttct ttttctcttg tttttacaaa tgaaagagag gacaaataac 4321 atttctctat caccgacctg aaactctaca gtcttcagag aatgaatggc ttgctaaaag 4381 aatgtcaaat cttactatac agctatttca tattacacta ctaaatacac tataaggcat 4441 agcatgtagt aatacagtgt aaaatagctt tttacactac tatattatta atatctgtta 4501 attccagtct tgcatttcac atttgcaaaa cgttttgaaa ttcgtatctg aaagctgaat 4561 actcttgctt tacaggaata cttgcagtgt gtgaaggaac tgtatagagg aggcttggaa 4621 cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc ctgggtagaa 4681 agtcagacaa atggtaaggt agaacatgct ttgtacatag tgagagttgg ttcaccctaa 4741 tactgagaac ttggatatag ctcagccagc gtgctttgcg ttcaagctta ccagagctgt 4801 tgtatgcctg ttaagcaggg catacagtca tgaggctctt gaaaaatctt aacagacaaa 4861 gggcaatgga aaatcggagt taagggatgg tagggataaa atgcatagaa agaggtacca 4921 caattttgat ttttgcccta atgcctctct gcgtggttcc tcaatttttc tacttcattc 4981 ctcatctcct cagagcattc ctttccctca tgcttgaaac acagatgaaa gactgtgaat 5041 tctaactgag atgaaaacat ccacaaccac acaacctctg gtgtggagtc acattctgtg 5101 aaggcaaaaa ctaggccacg taatctatgc gtgcaagcta cgcgtaagct atgtgtgtga 5161 caggacaatg tgaggaacat actatgtgca caaggactgc agaataaaca ggagcaaagt 5221 ttttgaagaa aacagagtaa aatcctgttt tcctcttttg ttacattctt tacatatatc 5281 tcaaatttcc tctttggtta gaagcaagta atatttatgt ttcttggtac tgtttgggtt 5341 gaagaccatt ctgggataag agaaattcca gtggttcttc ccctaatcat aaaatgtcag 5401 gtttagtttt tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttga 5461 tagagagaga aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa 5521 gattgcaatt gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc 5581 tacattctct atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt 5641 cattcttaaa ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc 5701 aatggttctg gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga 5761 agacacacaa gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta 5821 atctagaatt tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg 5881 ctcatttgtc tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct 5941 ctctcgtcac aatattcctg acattgcaaa gctcctgaga aataacttca gattccactt 6001 ttcctaggaa ggcttctgga tgagaactaa tcatcttaac tgtaactaga catttctgca 6061 tccaagaata atctttgtta aaactatatt ctctctctct tttttttttt tttttggttc 6121 tccagcaaga aagcaaacct gtgcagatga tgtaccagat tggtttattt agagtggcat 6181 caatggcttc tgagaaaatg aagatcctgg agcttccatt tgccagtggg acaatgagca 6241 tgttggtgct gttgcctgat gaagtctcag gccttgagca ggtatggccc tagaagttgg 6301 cttcagaata ttaaaaacac atggaaattt agctgttgta aagctctttt caacacagtt 6361 atcctaaaac atttaaccag cacaaatttc atcatgattc aatatgtgat tgttgcatag 6421 aagtgtagat ttgtcccact gggtcctgca atagcccatg ctgagcatgg cttgctgaaa 6481 gaactgcttt agagggtgaa aagtttgaca cagcagacaa gatgattctc acctaagcag 6541 ctgttactgt agtggcttga actctaaagg tcttgtatct ccattcctgt gcactgagga 6601 gcttcttgga aagttcatat aaggtttact agttctaact attatctcat ttggtggcac 6661 tcaatgtgct ttgttcacgt cttcataaat taatctatct aaaaattgga tgtggttaaa 6721 gcaatttcag aaataacatg tacataatgt acaattattg atatgaacag aacacaggca 6781 tagcatattg taattaggag gactgtagtt attttgaata ggaaacacaa tgtaataaat 6841 gagaattcat tgaaatgtta gtatgctaac tcaatctaaa ttataaagat aaagaggcat 6901 ttaatcacag ctagatttcc atcacttgtg acagacaggc atatgaatga ttatgtacag 6961 ctctaggaaa aaaagtatgt aggaaaacta gtacattttg attagaaagt ctgaaaatga 7021 ggtgccttga tcaaagagaa tacgtgtgtt tgagaaaaaa aaagtttgga tagaggtggt 7081 aagagagaat atattgaaat ggtgtttcta caaactgcca tggccagatt tgtgtaagag 7141 acattcagta agtaggcaag gaaagaaata ttactaggta caaagcaaca tcagtaatac 7201 caaaagaaac caattattcc agatgccaat ctcgtaatag ggttaagaga tttccacccc 7261 tctagtggtc accagtgcaa ccagtaactt tgctaattta cattttcttt ttttaaatgg 7321 cagatatagc tttgaactga gtgatcatga actggtactg tgtaatagat gaagacatac 7381 ttgacgacta aacttctgat ttttaaaaac tcaaattctc ttgaaagatc agttcccagt 7441 ctagtaacag ctgatagttt aagtatcagt aattggctac cattaacaac tggctcctga 7501 gaggtcttaa atgtagagac agctttaaac tcaaaagcac agagtgattt ttagaataga 7561 tttcccaagc aaagaaaata aacagggagg agctttaagg gagtagccat ctcattatta 7621 ttattattta aagaaatggc agcaagccta caaaagaaaa ataagacaga gcagagaaga 7681 aagagtcatg gtatgctttt ctatcttagc aaaattaatc tctacatgcc taggaaaaag 7741 ccatgacaag agcaatcagt tcaaaaggtg tatgcaaaaa accacataat agtaactagt 7801 actgcattgc caggaaggaa gttatgtcgc cattccatgg atctcattct catttccttg 7861 cagcttgaga gtataatcaa ctttgaaaaa ctgactgaat ggaccagttc taatgttatg 7921 gaagagagga agatcaaagt gtacttacct cgcatgaaga tggaggaaaa atacaacctc 7981 acatctgtct taatggctat gggcattact gacgtgttta gctcttcagc caatctgtct 8041 ggcatctcct cagcagagag cctgaagata tctcaagctg tccatgcagc acatgcagaa 8101 atcaatgaag caggcagaga ggtggtaggg tcagcagagg ctggagtgga tgctgcaagc 8161 gtctctgaag aatttagggc tgaccatcca ttcctcttct gtatcaagca catcgcaacc 8221 aacgccgttc tcttctttgg cagatgtgtt tccccttaaa aagaagaaag ctgaaaaact 8281 ctgtcccttc caacaagacc cagagcactg tagtatcagg ggtaaaatga aaagtatgtt 8341 atctgctgca tccagacttc ataaaagctg gagcttaatc tagaaaaaaa atcagaaaga 8401 aattacactg tgagaacagg tgcaattcac ttttccttta cacagagtaa tactggtaac 8461 tcatggatga aggcttaagg gaatgaaatt ggactcacag tactgagtca tcacactgaa 8521 aaatgcaacc tgatacatca gcagaaggtt tatgggggaa aaatgcagcc ttccaattaa 8581 gccagatatc tgtatgacca agctgctcca gaattagtca ctcaaaatct ctcagattaa 8641 attatcaact gtcaccaacc attcctatgc tgacaaggca attgcttgtt ctctgtgttc 8701 ctgatactac aaggctcttc ctgacttcct aaagatgcat tataaaaatc ttataattca 8761 catttctccc taaactttga ctcaatcatg gtatgttggc aaatatggta tattactatt 8821 caaattgttt tccttgtacc catatgtaat gggtcttgtg aatgtgctct tttgttcctt 8881 taatcataat aaaaacatgt ttaagcaaac acttttcact tgtagtattt gaagtacagc 8941 aaggttgtgt agcagggaaa gaatgacatg cagaggaata agtatggaca cacaggctag 9001 cagcgactgt agaacaagta ctagtgggtg agaagttgaa caagagtccc ctacaagcaa 9061 cttaatctaa taagctagtg gtctacatca gctaaaagag catagtgagg gatgaaattg 9121 gttctccttt ctaagcatca cctgggacaa ctcatctgga gcagtgtgtc caatctgccg 9181 ctgccctgat ctcggctggg gtgatg // LOCUS PMUCEN 150 bp ds-DNA PHG 17-JUL-1990 DEFINITION Bacteriophage Mu wild type DNA fragment with a gyrase cleavage site. ACCESSION M32302 KEYWORDS . SOURCE Bacteriophage Mu (wild type) DNA. ORGANISM Bacteriophage mu Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Pato,M., Howe,M. and Higgins,P. TITLE DNA gyrase binds to a centrally located replication enhancer (CEN) in the bacteriophage Mu genome JOURNAL Unpublished (1990) In Press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Pato, 23-FEB-1990. FEATURES from to/span description site 49 50 DNA gyrase cleavage site in complimentary strand mut 44 44 a in wt; g in Mu nuB103 mut 50 50 g in wt; c in Mu nuB1 BASE COUNT 40 a 38 c 27 g 45 t ORIGIN Map position at 18.0 kb. 1 acgcgtcagc gccgctctga ggcaataaac agaatcaggc ataaaatcag ccgcacagat 61 tttttaaaac gcgccacggg atttttaaac cggtatttaa cggtgtatga atcccgtttt 121 atcttccttt cactttcttt ctccagtact // LOCUS RATRNRTR 2577 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat snRNP-associated polypeptide N, complete cds. ACCESSION J05497 KEYWORDS snRNP-associated polypeptide N. SOURCE Rat male adult (Fisher) DNA, clones rgV and rgIII2. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2577) AUTHORS Schmauss,C. and Lerner,M.R. TITLE The closely related small nuclear ribonucleoprotein polypeptides N and B/B' are distinguishable by antibodies as well as by differences in their mRNAs and gene structures JOURNAL J. Biol. Chem. 265, 10733-10739 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Schmauss, 13-APR-1990. FEATURES from to/span description pept 1314 2036 snRNP-associated polypeptide N signal 774 779 TATA box signal 698 702 CAAT box BASE COUNT 822 a 537 c 581 g 637 t ORIGIN 1 taactagaga actgagaaca gaatccctgt tagaggaatt agagaccaaa ttaaaagagg 61 tgaaggggct tgcaacccca ttagaacaac aatgccaacc aaccagagct cccagggact 121 aaaccactac ccaaagacta cacatggctc caactgcata tgtagcagag gatggcctag 181 ttgggcacaa tggaaagaga agcccttgga cctgcccagg ctggacccac cagtgtagag 241 taatgtctga ggggtagaag caggtggttg ggaatgggaa tacccttata tgtgaaggga 301 agcaggatga ggtagggaac ttatgttcgt aaaccaggaa agggaataac gtttgaaatg 361 taaataaaaa tatatccagt gaaaaaaaaa ctgaagtcta taataaaagc ttttaatcct 421 ctcagccctt aataaaagtt aattatatta cttatgttaa aaaaaacata aaacagcatg 481 gtattgtact tttttttttc agacaaaagg tctatggcac acagccaaat cagtgacctc 541 tggggcacaa tttccagaaa tcaacatcct agagttgacc tctggcttcc atgtatacgt 601 gcatgcacac acacatatgc atatacatac aaaattacat atatgcagtt gtctaaatca 661 tatgaagaat ttcaagttgt tttaagttta atatcagcaa atacatgcaa atgtgattat 721 aagaagctgg atggaatcct gagttgttga ctaaagagct aagaaggggc aattataaaa 781 caaaaatgac acatgaaatt ccacccgagg ttagaaataa ttaaagaagg ccattgcggc 841 aagtctagca cagagagtag agggtgctgg aggatgacag acggttggtt ctgaggaggg 901 attttgcaac gaatggagcg aggaagggat cgtttacact tgagaagaac tactgaacag 961 cacgtcccag agattgaggt ccaggtcaaa cgtagaagga cagcctcact gagcaaccaa 1021 gagtgtcact tgtacccacg gcattctcag caacagcaaa ttcctgtggt ggatttccag 1081 gcagaactga gacaggcgtt cttagctgag acaccaagag gtggttaaag cagtattgga 1141 acttcaaggt ggtggaagtc aacaaacaca ggacctatcc actgattgtg aaactttggt 1201 caagcttaca ctgtgttaat aaccctgcat caaaccttta tttattgccc ttccccaagt 1261 tttaaggatc ttgtaatttt agtgttgaca actgctattg tggaacagca atcatgactg 1321 tgggtaagag tagcaagatg ctgcagcata ttgactatag aatgagatgt atcctgcaag 1381 atggaagatt cttcattggc acctttaagg cttttgacaa gcatatgaat ttgatcctct 1441 gtgattgtga tgagttcagg aagatcaagc caaagaatgc aaaacagcca gaacgtgaag 1501 aaaaacgggt tttgggtctg gtcttgctac gtggagagaa cttggtttcc atgacagtgg 1561 agggtccacc tcctaaagat actggcattg ctcgtgtgcc acttgctagt gctgcaggtg 1621 gccctggtgt tggaagagca gctggcagag gagtaccagc aggtgtacct attccccaag 1681 ctcctgctgg attagcaggc cctgtccgag gagtgggagg cccatcccag caggtcatga 1741 ccccacaggg aagaggcact gttgcagctg ctgctgttgc tgctactgct agcattgcag 1801 gagccccaac ccagtacccg ccaggacggg gaactccacc tccacctgta ggcagagcaa 1861 ccccacctcc aggcattatg gctcctccac ctggaatgag accacccatg ggcccaccaa 1921 ttggacttcc ccctgctcaa gggagaccta taggcatgcc ccctccagga atgagactcc 1981 ctcctccagg aattagaggc ccacctcccc caggaatgcg tccaccaaga ccctaagata 2041 cagttgataa atctcagccc ttctctttcc ctacaatgct tcttgtgaaa ttgtgtcgcc 2101 tgcaagcttt tgacccctct tactgcatta actatagata ataaatacat agcgcaattg 2161 aattgaaaaa aaaagaaata attaaagaaa gtaagtcaca atgactattt gctattgaca 2221 ttttttttaa atgcccgaat gagagccagt ggagacgata gaaagtccag aagaagctaa 2281 gataatttca aaacacataa tgtcagtaga acgagggaag gtaagaaccc acagaacaca 2341 agaaaccact catgaaactc ctcacacaca ggaagaaaag gaagaatgta atttttaaaa 2401 aaaaagttat agtcaagtta aactatattt tctcattggt ttttttttgt gactttgtat 2461 ttatttttat gtttctttgt gtatattgta catgtctcag tcaaaggcca acggtgagtg 2521 ttttcctcta aaaaacctta ttgtttaaga cagggtctct tcctgagctc agaattc // LOCUS WUCSSP 1323 bp ds-DNA INV 17-JUL-1990 DEFINITION W.bancrofti species specific DNA fragment. ACCESSION M27140 KEYWORDS . SOURCE W.bancrofti DNA, clone IWb35. ORGANISM Wuchereria bancrofti Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 1323) AUTHORS Dissanayake,S. and Piessens,W.F. TITLE Cloning and characterization of a Wuchereria bancrofti-specific DNA sequence JOURNAL Mol. Biochem. Parasitol. 39, 147-150 (1990) STANDARD simple staff_entry BASE COUNT 399 a 204 c 251 g 469 t ORIGIN 1288 bp upstream of SacI site. 1 gatctctgtt tcattatacc gagtaaatat tggagaaaag aaaaatttgt tcaacgtgtt 61 aaagattaac ttgctttcta tataatggaa acattttgca tattggatta gtcagtaaat 121 taataatgga caattgtgat aagtaaaact aaaaagacat cgtcactctc ttccttatta 181 tagcatttcc ttgcttaaaa ccacttgcga cgtcactttt tgttataaat catatggtga 241 atacttttcc tcatttaaga tcgtttatta gcttttgcat tacaaattgt tcattttagt 301 tgtgaacgca ttttgtacat ttaaatgctt gctttagaat tttaggtttc aactggtacg 361 tttatgccgt ttatatgaaa ttatgggata acaaagaaaa ataaagataa agaagtaaaa 421 attcgaatga ttaaatgaat tattagtacc ctgattgcta tagccctttt ctacgttttg 481 gcaagaagtc ccaaattggt tctcactttt cagaatgaaa atttttagtt gtttatagcg 541 ccaaaagaaa tgattaacag cagtttggct ttgtggacgg aatgatatgc ttttctgcat 601 acctttcata aattggaaaa aacaaaataa tttggctaag agtgaatgga gtattcgttc 661 gtttgtgata ttttcaatgt ttgttgatgt atattcgaag cgtctctgct cactactgtc 721 aaaccctttt taagaacgtt gcttctacgg tcactgggca gctactacgt attgagtgag 781 cgatatgaaa agaatataca gtatctaatg actgccaatg tcaaataaat ttttgtatcg 841 tcactcagcg gtcacaaatg tttcataaat atttcacatg cattctattt taggttcaaa 901 tatgctttta aaattctgct aaatttgcaa actaacgaga ttttgtttgg cagctcttct 961 tatgataacg cagttcaatc ctggtggtga agaatttgcc acagtcttcg cattttggat 1021 gaggttcatg cgtttgtttg tgtttgtgaa atgttgattt atggtcgaat gtccgcccac 1081 aaccgggtac tttgcattca tagatgaaag gctgaccgtg tgtttcctat gtgttatata 1141 ttcgttgtaa ttgttcgtga tcaataggaa acaactggca ggatggcaga ttttaataca 1201 accatatcaa taattatatt aaatgtaaat gttctagctg ggtagagtgg cgtgcatctg 1261 tagtctcggc cacttggaag actgagctca ggaagattac ttgcacccag gagcttgagg 1321 agc // LOCUS YSCHXT2 2890 bp ds-DNA PLN 17-JUL-1990 DEFINITION S.cerevisiae high affinity hexose transporter-2 (HXT2) gene, complete cds. ACCESSION M33270 KEYWORDS high affinity hexose transporter-2. SOURCE S.cerevisiae (isogenic strain to S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2890) AUTHORS Kruckeberg,A.L. and Bisson,L.F. TITLE The HXT2 gene of Saccharomyces cerevisiae is required for high affinity glucose transport JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.L.Kruckeberg, 26-MAR-1990. FEATURES from to/span description pept 818 2443 high affinity hexose transporter-2 BASE COUNT 808 a 571 c 535 g 976 t ORIGIN 1 aaaaagaaat attattcatt actatcaaga taccgtagaa aagaaaaaga accggggatg 61 aataataaca aaacgggctg ctttttcttt ttctctttct ttttcatttg gtccctctcc 121 actctttctc cacgtggctt tgcttcccgt atttttcttc gtcagagaga ctacatgata 181 gtccaaagaa aagaaacagg ggggacgaag aagaggagag gaaaaaccaa aatataattt 241 tccgtgaaat agattctttt tctccactgc acgacttctt ctcctcccac aaaaaatgac 301 gcctcataga cagccccgca gcttcacttt taagtttctt tttctcctca cggcgcaacc 361 gctaacttaa gctaatcctt atgaatccgg agaaaagcgg ggtcttttaa ctcaataaaa 421 ttttccgaaa tcctttttcc tacgcgtttt cttcgggaac tagataggtg gctcttccac 481 ctgtttttcc atcattttag tttttcgcaa gccatgcgtg ccttttcgtt tttgcgatgg 541 cgaacgaggg ctggaaaaat taacggtacg ccgcctaacg atagtaatag gccacgcaac 601 tggcgtggac gacaacaata agtcgcccat tttttatgtt ttcaaaacct agcaaccccc 661 accaaacttg tcatcgttcc cggattcaca aatgatataa aaagcgatta caattctaca 721 ttctaaccag atttgagatt tcctctttct caattcctct tatattagat tataagaaca 781 acaaattaaa ttacaaaaag acttataaag caacataatg tctgaattcg ctactagccg 841 cgttgaaagt ggctctcaac aaacttctat ccactctact ccgatagtgc agaaattaga 901 gacggatgaa tctcctattc aaaccaaatc tgaatacact aacgctgaac tcccagcaaa 961 gccaatcgcc gcatattgga ctgttatctg tttatgtcta atgattgcat ttggtgggtt 1021 tgtctttggt tgggatactg gtaccatctc tggttttgtt aatcaaaccg atttcaaaag 1081 aagatttggt caaatgaaat ctgatggtac ctattatctt tcggacgtcc ggactggttt 1141 gatcgttggt atcttcaata ttggttgtgc ctttggtggg ttaaccttag gacgtctggg 1201 tgatatgtat ggacgtagaa ttggtttgat gtgcgtcgtt ctggtataca tcgttggtat 1261 tgtgattcaa attgcttcta gtgacaaatg gtaccaatat ttcattggta gaattatctc 1321 tggtatgggt gtcggtggta ttgctgtcct atctccaact ttgatttccg aaacagcacc 1381 aaaacacatt agaggtacct gtgtttcttt ctatcagtta atgatcactc taggtatttt 1441 cttaggttac tgtaccaact atggtactaa agactactcc aattcagttc aatggagagt 1501 gcctttgggt ttgaactttg ccttcgctat tttcatgatc gctggtatgc taatggttcc 1561 agaatctcca agattcttag tcgaaaaagg cagatacgaa gacgctaaac gttctttggc 1621 aaaatctaac aaagtcacca ttgaagatcc aagtattgtt gctgaaatgg atacaattat 1681 ggccaacgtt gaaactgaaa gattagccgg taacgcttct tggggtgagt tattctccaa 1741 caaaggtgct attttacctc gtgtgattat gggtattatg attcaatcct tacaacaatt 1801 aactggtaac aattacttct tctattatgg tactactatt ttcaacgccg tcggtatgaa 1861 agattctttc caaacttcca tcgttttagg tatagtcaac ttcgcatcca ctttcgtggc 1921 cttatacact gttgataaat ttggtcgtcg taagtgtcta ttgggtggtt ctgcttccat 1981 ggccatttgt tttgttatct tctctactgt cggtgtcaca agcttatatc caaatggtaa 2041 agatcaacca tcttccaagg ctgccggtaa cgtcatgatt gtctttacct gtttattcat 2101 tttcttcttc gctattagtt gggccccaat tgcctacgtt attgttgccg aatcctatcc 2161 tttgcgtgtc aaaaatcgtg ctatggctat tgctgttggt gccaactgga tttggggttt 2221 cttgattggt ttcttcactc ccttcattac aagtgcaatt ggattttcat acgggtatgt 2281 cttcatgggc tgtttggtat tttcattctt ctacgtgttt ttctttgtct gtgaaaccaa 2341 gggcttaaca ttagaggaag ttaatgaaat gtatgttgaa ggtgtcaaac catggaaatc 2401 tggtagctgg atctcaaaag aaaaaagagt ttccgaggaa taagagatta tacttaaact 2461 agcactgatt tttttaaggc taatggctac taatacttta atagatgatc ttcatacttt 2521 tttatttaac gatttttaat gatgttttta tttgtaccac tcatttatct agattttttt 2581 aatactgatc aaatcttacg gactcgacgt taaaaagttc ctacatacgt ctggtacttg 2641 aaacgctgct tcgaggtatt gacactataa gaatacgatc caaatactta caccgcatgt 2701 aaaaatatgc cgacaatatg aatacttgtt gatgaatgat atttgatttt aatccggcaa 2761 tttacctcct ttatataatc caataattgt tgataattag tggttaggtt gcagtactaa 2821 taagaattaa gacaaatatt cttctactat ataaaaggtg caaacaaaac acacgccgat 2881 cggccatact // LOCUS CHKARCASR 2392 bp ss-mRNA VRT 17-JUL-1990 DEFINITION Chicken 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase, complete cds. ACCESSION M31764 KEYWORDS 5-aminoimidazole ribonucleotide carboxylase; 5-aminoimidazole-4-N-succinocarboxamide ribonucleotide synthetase. SOURCE Chicken liver, cDNA to mRNA, clone pZD1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2392) AUTHORS Chen,Z., Dixon,J.E. and Zalkin,H. TITLE Cloning of a chicken liver cDNA encoding 5-aminoimidazole ribonucleotide carboxylase and 5-aminoimidazole-4-n-succinocarboxamide ribonucleotide synthetase by functional complementation of Escherichia coli pur mutants JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3097-3101 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Zalkin, 01-FEB-1990. FEATURES from to/span description pept 59 1339 5-aminoimidazole ribonucleotide (AIR) carboxylase-5- aminoimidazole-4-N-succinocarboxamide ribonucleotide (SAICAR) synthetase matp 59 835 SAICAR synthetase domain matp 836 1336 AIR carboxylase domain site 2341 2347 poly-A signal BASE COUNT 696 a 466 c 527 g 702 t 1 others ORIGIN 1 ggcgctcagt ttctttcctg tgccttaggt gcctgtcgcc gccgccgctg tcgccgccat 61 ggcccccgcc gcctcagaac tgaaacttgg taaaaaagtt aatgagggta aaacgaaaga 121 agtgtatgag ctgccggata tcccgggatg cgttctgatg cagtccaagg accaaataac 181 agcggggaat gccgctagga aggaccggat ggaggggaag gctgccatct ccaacaccac 241 caccagctgc gtgttccagc tgctgcagga agcaggaatc aaaacggctt ttgtcaggaa 301 acagagtgac acagctttca tagcagctca ctgtgaaatg atcccaattg aatgggtctg 361 cagaagaatt gctactggct ctttcctcaa aagaaaccct ggtgtcaaag aaggctataa 421 gttttaccca cctaaaattg agatgtttta caaggatgat gccaataatg atccacagtg 481 gtctgaggag cagctaattg aagcaaaatt ctcttttgct ggacttacta ttggcaagac 541 tgaagtggat attatggctc gttctactca agctattttt gagatcctgg aaaaatcatg 601 gcagccccaa aactgcactc tggtggacct gaagattgaa tttggtgtta atattttgac 661 caaagaaatt gttcttgctg atgttattga taatgattca tggagactgt ggccatcggg 721 agacagaagc cagcagaagg acaaacagtc ctatcgagat ctgaaggaag tgactcctga 781 agcattgcag atggttaaga gaaactttga atgggttgca gaaagagtag agttgcttct 841 gaaaacaaag agccaaggta gagttgtggt attgatggga tctacttctg accttggcca 901 ctgtgagaaa ataaagaagg catgtgcaac ctttggaatt ccttgtgagt taagagtaac 961 ctccgctcac aaagggccag atgaaactct gaggatcaaa gcagaatatg aaggagatgg 1021 aatcccgact gtgtttgttg cagtagctgg cagaagcaat ggtttagggc cagtaatgtc 1081 tggtaacact gcttaccctg ttgtcaactg tcctcccctc tcatctgact ggggtgctca 1141 ggatgtgtgg tcctctctca gactgcccag tggtcttggc tgtcctacta ctctgtcacc 1201 tgaaggagct gctcagtttg ctgcccagat ttttggttta aacaaccact tggtatgggc 1261 caaactgcga tcaaacatgt taaatacatg gatctctttg aagcaggctg acaaaaaatt 1321 gcgggagtgc accttgtaag tccaaccaac aagtaactcc tcctcgctac acaaagatag 1381 tggcgtgcat atgcatttgt atcaggattt gcagttggat gatgtgagac atcttttctt 1441 acagaaggaa gcatcttcag agcagcagct gagtctgcta tgtcctcatt ccttttcttg 1501 tgtatgtttt tggttttttt agtatagcca cttagtaaaa gatgagaagg taatatttta 1561 gttatccttc tgttgatcca aaatagtgct gttctgctct ttagtgtata ctgttaacta 1621 ctgatgatgg ttatgtcaac acaatgattg atttggtgtg atttaaatag aaaccccctt 1681 ttttttcccc ctgtaacagt aacattcttc catagtaata atctatgtcc tgaagcttat 1741 taactcaatg ggactgctaa ctaaaactag tgctgatgta gtgctgtaat aatggtatgc 1801 tttcttttag caataaaatg gcaacatatg ctgtattaat aagaaaatag gctaagaatg 1861 agtaaaagcc tgaaagaaat aactgcctta tttaaaagaa aaacatatat tttttcatgt 1921 gtatgtgtgt atgtatatac accccatcag taactttacc acatcatttg agtgagttaa 1981 cagttctctt gtttgtgaga agctctaatt ttgaatggaa ccataactag gcttgttgcc 2041 tacttgctgt ataactcttg actggtccag tcttttgtcc ttcaaatagt agacagatct 2101 actgtgtatg tatgtgatgc tgtcaaatta tccttataca catcaatgta tctatattcc 2161 ctcactctgt ttctatggga attaaactga actgaaggtg ggaataaaca gtaaagcaac 2221 agaatacacg ggcaagccta agcatgactg tcttgntctt tgcacacaac attctttaaa 2281 gtaggtactg attccaagaa atatgcaatt actcaatgta tgtacaattt atgcttttat 2341 aattaaataa actatttaat gcaaggaatt cgatatcaag cttatcgata cc // LOCUS ECORAFRO 1289 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli raffinose repressor (rafR) gene, complete cds, and alpha-galactosidase (rafA) gene, 5' end. ACCESSION M29849 KEYWORDS alpha-galactosidase; raffinose repressor. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1289) AUTHORS Aslanidis,C. and Schmitt,R. TITLE Regulatory elements of the raffinose operon: Nucleotide sequences of operator and repressor genes JOURNAL J. Bacteriol. 172, 2178-2180 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Schmitt, 13-NOV-1989. FEATURES from to/span description pept 166 1176 raffinose repressor pept 1270 > 1289 alpha-galactosidase binding 156 161 ribosome binding site binding 1258 1263 ribosome binding site signal 120 126 -35 region signal 144 149 -10 region signal 1212 1216 -35 region signal 1235 1240 -10 region site 1196 1213 raf operator site 1217 1234 raf operator BASE COUNT 317 a 283 c 326 g 363 t ORIGIN 1 aagcttatta tgcttccatc ggaaacaatg atctggcagc ctgagttcac agataaaata 61 tctctccagg aaacccgggg ctgttcatca tgcaagtctg tcgattactg gctttggtga 121 cggaattttc tggatttccg gcttagaacc acagcaggag ataatatgtc acttaaagcg 181 attgccacga cactcggtat ttctgtcacc actgtcagtc gggctcttgg aggcttttca 241 gatgtggctg cttctacccg tgagcgcgtg gaagcggaag cacgtcgacg aggttaccgc 301 cctaatacac aggcaagaag actcaaaacc ggtaaaaccg atgctatcgg tctggtttat 361 cctgaaaatg atgtgccgtt taacagcggt gtttttatgg atatggtcag ttgcatcagc 421 agggaacttg cttatcatga tattgactta ctgctgatcg ctgatgatga gcatgcagac 481 tgccacagct atatgcggct tgttgaaagt cgcagaattg atgctcttat cattgcacat 541 actctggatg acgatccccg tatcacacat cttcataaag caggtattcc gtttctggct 601 cttggacggg taccgcaggg cttgccctgt gcgtggtttg actttgataa tcatgccgga 661 acctggcagg caacccagaa gctgattgct ttgggacata agagtattgc gctgttgagc 721 gagaacactt cacattctta tgttattgca agacgtcagg gatggcttga tgcactgcat 781 gagcatggac tgaaagatcc attgttgcgg ctggtttctc ccacgcgacg agcgggctat 841 ctggctgtga tggagttaat gtcattaccg gcgccaccaa cagctattat tactgacaat 901 gacctgagtg gagatggtgc ggctatggcg ctgcagttga gagggcgtct ttcagggaaa 961 gaagctgtat ctctggttgt atatgatggt ttgcctcagg acagcattat tgagctggat 1021 gtggctgctg ttattcagtc aacacgaagt ctcgttggtc gtcagatttc tgacatggtg 1081 tatcagataa tcaatggtgc atcaccagaa tcactgcaga taacctggac accgatattt 1141 taccctggta gcacggttca ttctccttcc ttctgatttt ttatccagat cacacaaccg 1201 aaacgttttg gttgatgttc gaaacgtttc ggatcaacag taagacatac ctgaaagcgg 1261 agatgtctaa tgatttcaaa gtactgcag // LOCUS HSEDILSTD 351 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 subfragments from near the left terminus of STD DNA. ACCESSION M34145 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone Sma2. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 351) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 55 a 117 c 127 g 52 t ORIGIN 1 cgccgctcac gtcacactcc tccccaaaga gggtattcgg ggggtcgctc ggcggctcca 61 ggaaccgctg ctcgtaccgg tcctcggagt tcagtgcccg cagcgtggcg cgcagcggga 121 agctctgctc gggcagcagc tgcaggaggg actgccagca ctcgagtgtg gggcgcgtgt 181 gcgtgtaccc catggcgtaa aaatccagca gcgccctgcg cagggtgcgc gagtcctcgg 241 tcacgtggat cgaggaggtg gatatccccc tgaacaagcg gttgacgtcc cccgcgaggc 301 ggttcacatc ggggcgccac ggcggggcgc agaacgcccc tggcccccgg g // LOCUS HSEDISA 633 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 defective interfering particle DNA. ACCESSION M34143 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone pHSII. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 633) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_entry BASE COUNT 124 a 184 c 202 g 123 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc acgtcacact 301 cctccccaaa gagggtattc ggggggtcgc tcggcggctc caggaaccgc tgctcgtacc 361 ggtcctcgga gttcagtgcc cgcagcgtgg cgcgcagcgg gaagctctgc tcgggcagca 421 gctgcaggag ggactgccag cactcgagtg tggggcgcgt gtgcgtgtac cccatggcgt 481 aaaaatccag cagcgccctg cgcagggtgc gcgagtcctc ggtcacgtgg atcgaggagg 541 tggatatccc cctgaacaag cggttgacgt cccccgcgag gcggttcaca tcggggcgcc 601 acggcggggc gcagaacgcc cctggccccc ggg // LOCUS HSEDISTD 290 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 subfragments from the inverted repeat of STD DNA. ACCESSION M34144 KEYWORDS defective interfering particle. SOURCE Equine herpesvirus type 1 (strain Kentucky A) DNA, clone xbaG.. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 290) AUTHORS Yalamanchili,R.R., Raengsakulrach,B., Baumann,R.P. and O'Callaghan,D.J. TITLE Identification of the site of recombination in the generation of the genome of DI particles of Equine herpesvirus type 1 JOURNAL Virology 175, 448-455 (1990) STANDARD simple staff_review BASE COUNT 69 a 72 c 77 g 72 t ORIGIN 1 ttgtggaata attaaactgt atttattgat gagtaacaca aaacaagttt cctgggaaac 61 acactccaca gtttttttaa aagatttggt tacagtaaaa gtatttgccg tgcaggtaaa 121 ccggaacgag ggtgtaggcc gatacaaggc tgcaggtatc tgccttgcat cgccgcttgt 181 gcgcgtctat cgcctcgagg gttcccgcca gacaggctcc aggtacgtag tcggctagaa 241 cgcgcccgtc gggtcccagt gcgtccctgg acacagtttc ggcgccgctc // LOCUS HUMAR 1230 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) mRNA, complete cds. ACCESSION M30704 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SOURCE Human cell line MCF-7 breast carcinoma, cDNA to mRNA, clones lambda-AR1 and lambda-AR2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1230) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 210 968 amphiregulin precursor /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 210 266 amphiregulin signal peptide matp 510 962 amphiregulin (alt.) matp 528 962 amphiregulin (alt.) mRNA 1 1230 amphiregulin mRNA BASE COUNT 375 a 273 c 285 g 297 t ORIGIN 1 agacgttcgc acacctgggt gccagcgccc cagaggtccc gggacagccc gaggcgccgc 61 gcccgccgcc ccgagctccc caagccttcg agagcggcgc acactcccgg tctccactcg 121 ctcttccaac acccgctcgt tttgcggcag ctcgtgtccc agagaccgag ttgccccaga 181 gaccgagacg ccgccgctgc gaaggaccaa tgagagcccc gctgctaccg ccggcgccgg 241 tggtgctgtc gctcttgata ctcggctcag gccattatgc tgctggattg gacctcaatg 301 acacctactc tgggaagcgt gaaccatttt ctggggacca cagtgctgat ggatttgagg 361 ttacctcaag aagtgagatg tcttcaggga gtgagatttc ccctgtgagt gaaatgcctt 421 ctagtagtga accgtcctcg ggagccgact atgactactc agaagagtat gataacgaac 481 cacaaatacc tggctatatt gtcgatgatt cagtcagagt tgaacaggta gttaagcccc 541 cccaaaacaa gacggaaagt gaaaatactt cagataaacc caaaagaaag aaaaagggag 601 gcaaaaatgg aaaaaataga agaaacagaa agaagaaaaa tccatgtaat gcagaatttc 661 aaaatttctg cattcacgga gaatgcaaat atatagagca cctggaagca gtaacatgca 721 aatgtcagca agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca 781 tgattgacag tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg 841 tgatcctcac agctgttgct gttattacag tccagcttag aagacaatac gtcaggaaat 901 atgaaggaga agctgaggaa cgaaagaaac ttcgacaaga gaatggaaat gtacatgcta 961 tagcataact gaagataaaa ttacaggata tcacattgga gtcactgcca agtcatagcc 1021 ataaatgatg agtcggtcct ctttccagtg gatcataaga caatggaccc tttttgttat 1081 gatggtttta aactttcaat tgtcactttt tatgctattt ctgtatataa aggtgcacga 1141 aggtaaaaag tattttttca agttgtaaat aatttattta atatttaatg gaagtgtatt 1201 tattttacag ctcattaaac ttttttaacc // LOCUS HUMAR1 968 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 1. ACCESSION M30698 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 1 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 968) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept 859 + 919 amphiregulin precursor, exon 1 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" sigp 859 915 amphiregulin signal peptide pre-msg 649 > 968 amphiregulin mRNA and introns IVS 920 > 968 amphiregulin intron A BASE COUNT 206 a 280 c 257 g 225 t ORIGIN 1 gaattcatat ccacctggct ttgaacatta tcggctgtga gatggtgtag gtaaaatttt 61 aagtgcataa tttggcaata ataaatcatc aataaatatt aatgttgatg aggcccctgg 121 gccacataaa gaaataggga gtgaggggat ttgaaattct ggccacttca cagaaatggg 181 tgggaagggg ctcttgattg agatagaagc ccatcctaca tgaagcaatt cctcattgag 241 ttctctcgtc ctttatcctt gttggaaaca tcaggcaaag tcactcttgg tcttaaagta 301 cttttacatc taaatacgga actcttctat ttaatccctg tctgttgtag atgttaagta 361 tacaaagagg ttgtcagagt ttgaaacatc tggacttctg tcaggtacta gctccggaac 421 tccagtcctg ctcgccctca aaaacggctt gcagctagag gtttaagttc cacttcctct 481 cagcgaatcc ttacgcacga gggaggcggg gcgtgtgtcc tccgcgcgtg gttttcgggt 541 agcaccttct ggggcgccgc ctgcctccac ccacggccgg gccttgacgt catgggctgc 601 ggccccctcc cggctgagcc tataaagcgg caggtgcgcg ccgccctaca gacgttcgca 661 cacctgggtg ccagcgcccc agaggtcccg ggacagcccg aggcgccgcg cccgccgccc 721 cgagctcccc aagccttcga gagcggcgca cactcccggt ctccactcgc tcttccaaca 781 cccgctcgtt ttgcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc 841 cgccgctgcg aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg 901 ctcttgatac tcggctcagg tgaggattca acggcgctga actgctgggc tctcctccca 961 tggcaggt // LOCUS HUMAR2 348 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 2. ACCESSION M30699 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 2 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 348) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 299 amphiregulin precursor, exon 2 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp 290 + 299 amphiregulin pre-msg < 1 > 348 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron A IVS 300 > 348 amphiregulin intron B BASE COUNT 88 a 75 c 86 g 99 t ORIGIN 1 agcaccctac tttacctttt cgttttcttc ctttattccc tcccctgcag gccattatgc 61 tgctggattg gacctcaatg acacctactc tgggaagcgt gaaccatttt ctggggacca 121 cagtgctgat ggatttgagg ttacctcaag aagtgagatg tcttcaggga gtgagatttc 181 ccctgtgagt gaaatgcctt ctagtagtga accgtcctcg ggagccgact atgactactc 241 agaagagtat gataacgaac cacaaatacc tggctatatt gtcgatgatt cagtcagagg 301 tgagtagggg ataaagcaaa aatatggcct gtgagatgtg ggtttata // LOCUS HUMAR3 302 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 3. ACCESSION M30700 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 3 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 302) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 52 + 252 amphiregulin precursor, exon 3 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 52 + 252 amphiregulin matp 60 + 252 amphiregulin (alt.) pre-msg < 1 > 302 amphiregulin mRNA and introns IVS < 1 51 amphiregulin intron B IVS 253 > 302 amphiregulin intron C BASE COUNT 125 a 46 c 54 g 77 t ORIGIN 1 aattatattc aagtttgaga gactcttgtc aataaatctt ttctttttta gttgaacagg 61 tagttaagcc cccccaaaac aagacggaaa gtgaaaatac ttcagataaa cccaaaagaa 121 agaaaaaggg aggcaaaaat ggaaaaaata gaagaaacag aaagaagaaa aatccatgta 181 atgcagaatt tcaaaatttc tgcattcacg gagaatgcaa atatatagag cacctggaag 241 cagtaacatg cagtaagttt tcctaaagca tatagatttt tgtatttcta gcaccatgtc 301 tg // LOCUS HUMAR4 254 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 4. ACCESSION M30701 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 4 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 254) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 51 + 204 amphiregulin precursor, exon 4 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" matp + 51 91 amphiregulin (both alt.) pre-msg < 1 > 254 amphiregulin mRNA and introns IVS < 1 50 amphiregulin intron C IVS 205 > 254 amphiregulin intron D BASE COUNT 81 a 42 c 53 g 78 t ORIGIN 1 cacaccgcac gtgagtgtga ttataatttt taaatgtgaa ttgcttgcag aatgtcagca 61 agaatatttc ggtgaacggt gtggggaaaa gtccatgaaa actcacagca tgattgacag 121 tagtttatca aaaattgcat tagcagccat agctgccttt atgtctgctg tgatcctcac 181 agctgttgct gttattacag tccagtaagt atgacataac ttacaaattc ttaataaaat 241 aatgggaggt taat // LOCUS HUMAR5 217 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, exon 5. ACCESSION M30702 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 5 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 217) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pept + 50 143 amphiregulin precursor, exon 5 /hgml_locus_uid="LW0168B" /nomgen="AREG" /map="4q13-q21" pre-msg < 1 > 217 amphiregulin mRNA and introns IVS < 1 49 amphiregulin intron D BASE COUNT 89 a 26 c 41 g 61 t ORIGIN 1 tatagatgaa tagaaccttg ataacattag aatgccttgt tctctgaagg cttagaagac 61 aatacgtcag gaaatatgaa ggagaagctg aggaacgaaa gaaacttcga caagagaatg 121 gaaatgtaca tgctatagca taactgaaga taaaattaca ggtttgagtt ttaaaatata 181 tctttagatc atatcctata attttgaaaa atttaac // LOCUS HUMAR6 354 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human amphiregulin (AR) gene, 3' flanking region. ACCESSION M30703 KEYWORDS amphiregulin; growth regulator; tumor inhibitory factor. SEGMENT 6 of 6 SOURCE Human breast carcinoma cell line MCF-7 DNA, clones lambda-ARH[6,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 354) AUTHORS Plowman,G.D., Green,J.M., McDonald,V.L., Neubauer,M.G., Disteche,C.M., Todaro,G.J. and Shoyab,M. TITLE The amphiregulin gene encodes a novel epidermal growth factor- related protein with tumor-inhibitory activity JOURNAL Mol. Cell. Biol. 10, 1969-1981 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 18-DEC-1989. FEATURES from to/span description pre-msg < 1 294 amphiregulon mRNA and intron BASE COUNT 104 a 48 c 52 g 150 t ORIGIN 1 gtaacatttt gttttatttt attattttat tttattttat tttctcacag gatatcacat 61 tggagtcact gccaagtcat agccataaat gatgagtcgg tcctctttcc agtggatcat 121 aagacaatgg accctttttg ttatgatggt tttaaacttt caattgtcac tttttatgct 181 atttctgtat ataaaggtgc acgaaggtaa aaagtatttt ttcaagttgt aaataattta 241 tttaatattt aatggaagtg tatttatttt acagctcatt aaactttttt aaccaaacaa 301 attgagagtt tgaatattag ttctgatatt gcaagactcc agtgtacttt tctc // LOCUS HUMROSSAA 1890 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human Ro ribonucleoprotein autoantigen (Ro/SS-A), complete cds. ACCESSION M32294 KEYWORDS Ro ribonucleoprotein autoantigen. SOURCE Human T-cell line Jurkat, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1890) AUTHORS McCauliffe,D.P., Lux,F.A., Lieu,T.-S., Sanz,I., Hanke,J., Newkirk,M.M., Bachinski,L.L., Itoh,Y., Siciliano,M.J., Reichlin,M., Sontheimer,R.D. and Capra,J.D. TITLE Molecular cloning, expression and Chromosome 19 localization of a human Ro/SS-A auto antigen JOURNAL J. Clin. Invest. 85, 1379-1391 (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.P.McCauliffe, 23-FEB-1990. FEATURES from to/span description pept 67 1320 Ro ribonucleoprotein autoantigen (Ro/SS-A) precursor sigp 67 117 Ro ribonucleoprotein autoantigen signal peptide matp 118 1317 Ro ribonucleoprotein autoantigen site 1881 1886 poly-A signal (put.) BASE COUNT 464 a 497 c 520 g 409 t ORIGIN Chromosome 19. 1 ccgtactgca gagccgctgc cggagggtcg ttttaaaggg ccgcgttgcc gccccctcgg 61 cccgccatgc tgctatccgt gccgctgctg ctcggcctcc tcggcctggc cgtcgccgag 121 cccgccgtct acttcaagga gcagtttctg gacggagacg ggtggacttc ccgctggatc 181 gaatccaaac acaagtcaga ttttggcaaa ttcgttctca gttccggcaa gttctacggt 241 gacgaggaga aagataaagg tttgcagaca agccaggatg cacgctttta tgctctgtcg 301 gccagtttcg agcctttcag caacaaaggc cagacgctgg tggtgcagtt cacggtgaaa 361 catgagcaga acatcgactg tgggggcggc tatgtgaagc tgtttcctaa tagtttggac 421 cagacagaca tgcacggaga ctcagaatac aacatcatgt ttggtcccga catctgtggc 481 cctggcacca agaaggttca tgtcatcttc aactacaagg gcaagaacgt gctgatcaac 541 aaggacatcc gttgcaagga tgatgagttt acacacctgt acacactgat tgtgcggcca 601 gacaacacct atgaggtgaa gattgacaac agccaggtgg agtccggctc cttggaagac 661 gattgggact tcctgccacc caagaagata aaggatcctg atgcttcaaa accggaagac 721 tgggatgagc gggccaagat cgatgatccc acagactcca agcctgagga ctgggacaag 781 cccgagcata tccctgaccc tgatgctaag aagcccgagg actgggatga agagatggac 841 ggagagtggg aacccccagt gattcagaac cctgagtaca agggtgagtg gaagccccgg 901 cagatcgaca acccagatta caagggcact tggatccacc cagaaattga caaccccgag 961 tattctcccg atcccagtat ctatgcctat gataactttg gcgtgctggg cctggacctc 1021 tggcaggtca agtctggcac catctttgac aacttcctca tcaccaacga tgaggcatac 1081 gctgaggagt ttggcaacga gacgtggggc gtaacaaagg cagcagagaa acaaatgaag 1141 gacaaacagg acgaggagca gaggcttaag gaggaggaag aagacaagaa acgcaaagag 1201 gaggaggagg cagaggacaa ggaggatgat gaggacaaag atgaggatga ggaggatgag 1261 gaggacaagg aggaagatga ggaggaagat gtccccggcc aggccaagga cgagctgtag 1321 agaggcctgc ctccagggct ggactgaggc ctgagcgctc ctgccgcaga gcttgccgcg 1381 ccaaataatg tctctgtgag actcgagaac tttcattttt ttccaggctg gttcggattt 1441 ggggtggatt ttggttttgt tcccctcctc cactctcccc caccccctcc ccgccctttt 1501 tttttttttt tttaaactgg tattttatcc tttgattctc cttcagccct cacccctggt 1561 tctcatcttt cttgatcaac atcttttctt gcctctgtgc cccttctctc atctcttagc 1621 tcccctccaa cctggggggc agtggtgtgg agaagccaca ggcctgagat ttcatctgct 1681 ctccttcctg gagcccagag gagggcagca gaagggggtg gtgtctccaa ccccccagca 1741 ctgaggaaga acggggctct tctcatttca cccctccctt tctcccctgc ccccaggact 1801 gggccacttc tgggtggggc agtgggtccc agattggctc acactgagaa tgtaagaact 1861 acaaacaaaa tttctattaa attaaatttt // LOCUS PRD1XII 1701 bp ds-DNA PHG 17-JUL-1990 DEFINITION Bacteriophage PRD1 early protein genes XII and XIX, complete cds. ACCESSION M30146 KEYWORDS early protein XII; early protein XIX. SOURCE Bacteriophage PRD1 (Viradea strain) DNA. ORGANISM Bacteriophage PRD1 Viridae; ds-DNA nonenveloped viruses; Tectiviridae. REFERENCE 1 (bases 1 to 1701) AUTHORS Gerendasy,D. and Ito,J. TITLE Nucleotide sequence and transcription of the right early region of bacteriophage PRD1 JOURNAL J. Bacteriol. 172, 1889-1898 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Gerendasy, 27-NOV-1989. FEATURES from to/span description pept 242 724 early protein XII pept 797 955 early protein XIX (put.) mRNA 179 732 early protein XII mRNA mRNA 181 1100 early protein XIX mRNA binding 230 233 ribosome binding site binding 785 788 ribosome binding site rpt 1 111 inverted terminal repeat site 145 172 right early promoter BASE COUNT 419 a 447 c 390 g 445 t ORIGIN 1 gggggatacg tgcccctccc cacctacccg cgcccctaac atttttattt ccgtctgtca 61 ataccccctg catccgatag gcccgaacta tcacaaacgg aaaagcgata ggaataaaat 121 tttgatttat tttcaccata actattgcac gtcctaaaaa tgtgcgtata atttcaatca 181 tcgaaagggc aaacacaaaa cccccgccga taacttccac ttaactttaa aggtaactat 241 catggaaatc gtaagcaagc tgactctgaa aaccattggc gctcaaccca agccacatag 301 cgtaaaagaa aataccgcgc tggcttccat ctatggccgc gttcgcggta agaaagttgg 361 tcaatccacc tttggcgact tcatcaagtt tgaaggtgaa tttgaaggcg tgaatatcgc 421 cactggtgaa gtgttccgtt ccggtgcgct gattctgccg aaggtactgg aaagcctgct 481 tgccggtgcc gtggatggtg aaaacacggt tgattttgcg gttgaaattt gggccaagcc 541 ttccgaaaag ggcaacactg gttatgaata cggtgtcaag ccgctgattg aacccgccgc 601 atcggatgaa ctggccgcgc ttcgcaatca ggttaaggcc gcgctgcctg cccctgccgc 661 tgccggtgaa gccgctgccg aagccaagcc cgccgccaag gccaaggcca aggccgaagc 721 ctaaacagcg caccacggcc ccggctgata ccagcgcggg gccattcttt acccttcaaa 781 ttcaaaggta acaaacatgg aaaagcaaac cgaaaatacc cgccctgaat gcccgaaggc 841 attttatttt gtatccattc ccggcgactt tgggcaaacc ccttttgctt catcgcttat 901 gtatggtagc accgcgcttg ccgctgttat caagtcaagg gcgctatccg cgtagtaagc 961 gaacattttg atttgcggtt tgctgataac ggttttacgc ctgctggcgt tacgcaagct 1021 gaatggcttg gcaagctgat tactgaaact ttcggctttc gcttggaact gtttctttaa 1081 accagttttt tatttcttga atttgaaggc cgcatgattc ccttgcggcc ttttcttttg 1141 cgtatgcttc caccaaatcc cgccacgtta taaacggata aacaggcgcg gcgcattcag 1201 ttagcaggct tgacggcgga accaaataaa ccgtttcaat tttcggggct gttgtcgaac 1261 aagccgcgaa catcggcagg cataggacta tcaagaattt gtttagtttc atcgcttacc 1321 cttcccaatc cgctaattgt tcggctaagt ttgccgctga taattgtctg gccttcctga 1381 attgccgcga ttgctttcgc atcgcttgcg cgtactgatt gaagggtttc caccgccttg 1441 gcgtttgcgg tcgccttttc ttcaatggct ttaatttcat tttgaagccc cgttattttt 1501 ttagccgcta aaattgaata tccaaagtta gcaagcatta aaagcgcaac cgcaatagcg 1561 gccacgcttt tcaggttaga caaaacccat ttaatcattg aaggtacgcc ccggcttgtc 1621 attgtcggaa tctatcggcc ccggctgatt gataggggca atcttgcttt gcacaaattg 1681 cgagaaaacc gccttgacaa a // LOCUS PSEALALPQ 3141 bp ds-DNA BCT 17-JUL-1990 DEFINITION P.aeruginosa exopolysaccharide alginate regulatory protein (algP and algQ) genes, complete cds. ACCESSION M32077 KEYWORDS alginate regulatory protein P; alginate regulatory protein Q. SOURCE P.aeruginosa (strain PAO, isolate PA02003) DNA, from patient PA02003. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 3141) AUTHORS Konyecsni,W.M. and Deretic,V. TITLE DNA sequence and expression analysis of algP and algQ, components of the multigene system transcriptionally regulating mucoidy in Pseudomonas aeruginosa: algP contains multiple direct repeats JOURNAL J. Bacteriol. 172, 2511-2520 (1990) STANDARD full staff_review COMMENT draft entry and computer-readable sequence [1] kindly submitted by V.Deretic, 16-FEB-1990. FEATURES from to/span description pept 781 1263 alginate regulatory protein Q pept 2079 3137 alginate regulatory protein P (put.) mRNA 661 1310 algQ mRNA mRNA 2054 3141 algP mRNA BASE COUNT 607 a 1026 c 1009 g 499 t ORIGIN 1 bp upstream of BamHI site; map position 9 min. 1 ggatccgcca tgcccctggc cagcccccgt cagcttttcc ttctcgcgtt cctggcctgc 61 gtcgccatca tgggcggggc gctgtacctg gaacatgtgg ttggcctgga ggcctgcccg 121 ctgtgcgtcg tgcagcggat cttcttcatc ctgatcggcc tgacctgcct tgctggcgcg 181 atccaggggc ccggcctgcg tgggcggcgt atctactccg tgctggtgtt cctgctcgct 241 ctcggcggcg gggccacggc cgcccgccag gtatggttgc agaccgttcc gctggaccaa 301 ctgccggcct gcctgcccag cctcgactac atgatgcagg cgcttccctt ccaggaagtg 361 atccgcctgg tcctgcatgg caccgcggat tgtgcccagg tgagctggac gctattcacc 421 ctgagcattc ccgaatggag cctgctggcg ttcgttgcct atctcggctt ctccatcgtg 481 cagttcctcc gacgtgcctg atgcgggctg tcgcgtggtc tttcgtcagc tctcgagcgt 541 cggccgtatc gatttggcga aaaccagagc ggatgcggga taccgatcaa acatccgtat 601 gaatttttct gtacgggcgt cttgatgggg tcatcccggg ggcataatct ggctttccat 661 agcgccggag tccgcccctt gcacaggcgt tctcccacag agcgcagaga gtgcgtcagc 721 aatgcgtctc ggagcggacg atgattctcc gggacgtccg aagagggaag cgaggacaac 781 atgctcgaaa gctgccgtaa tgcccaagag cggtgggggg gtgtgcacca actgatcgat 841 cgctggctgc atgagcgcca gcaactggtg caggccttcg atgccctgag cggtattcag 901 gcaccggcgc ccaacgccga agaactccag catttctgcc aattgctgct ggactatgta 961 tctgccggac acttcgaggt ctacgagcaa ctgacggcgg aaggcaaggc cttcggcgat 1021 cagcgcggcc tggagctggc caagcagatc ttcccccggc tggaagccat caccgaatcc 1081 gcgctgaact tcaacgaccg ctgcgacaac ggcgattgcc gtgaaggagc ctgcctcatc 1141 gcggagctga aggtcctgcg gcaacagttg cacgaacgct tcgagctgga ggactgcctc 1201 atcgaagtcc tgcacaacgc ccacagccag agcggcgcag agggttcggc agtaccggtc 1261 tgattcaccg gcgagtcgtt acgaaacggc gcgtctccat gcgggagagc gccgtttttg 1321 catttcagcg gaagcccagc aggtcgatct cgaagaccag cggagcgtcg ggcgggatca 1381 ggtcgccagc gccttcgtga ccataggctt gcgccgaagg aatcaccact cgccagcgcg 1441 cgccgacagg catcgctcgc aacgctgtcc gccatccctc tatcacactg tccagggcga 1501 accactcggc gctttcgctc tggtcgaaca cttgaccgtc ggcaagcagg ccgcggtaac 1561 ggacgtgtac ctgggttgcc gcgccgatgc cgttaccttg gccacgacgt aattcgctga 1621 cgaggacgcc gccggtcaat tcgcggacgc cgaagcgggc tttttcgttg gccaggaagc 1681 gggcctcggc gggtgtggtc cgcggtgttt ccgtcgtggc gttctcgtgt tgcagcagaa 1741 gttgttcgat gcgttcgggg gggatttcaa gtgcctcgcc gcgataggcc tggcgcagtc 1801 cgagaagtag ctcggatagc tctaggccgg gcatttcctg ctgtaggcgc atacccaggc 1861 gggcgcctac ggcataggcc agttcgtcct tgggaggcgc ggcgaaaagc ggcagcgcag 1921 tagaaaaaga ccgatgaaaa ggcggcgcgg catgcggttt ctccaaacga atcggcgggc 1981 aatcctggcg cacgtcgctt gtagcacatc gaacccgttg gcgagagggg gtgtgcgggt 2041 ctagtatggg cgcaaccacg tccgcctgga ggcacgtcat gtcggccaac aagaagcccg 2101 tcaccacccc cttgcacctg ttgcagcaac tttcccacag ccttgtcgag cacctggaag 2161 atgcgtgcaa acaagcgctg gtcgattcgg aaaagctcct ggccaaactt gaaaagcaac 2221 gtggcaaagc ccaggaaaag ctgcacaagg ctcgcaccaa gctgcaggat gctgccaagg 2281 ccggcaagac caaggcacag gccaaggcgc gcgagaccat cagcgacctg gaagaggcgt 2341 tggataccct gaaggcccgg caggcggaca cccgtaccta catcgtcggc ctcaagcgtg 2401 acgtacagga aagcctcaag ctggcgcagg gtgtcggcaa ggtgaaggaa gctgctggca 2461 aggctctgga gagccgcaag gcgaaacccg cgaccaaacc tgctgcgaag gcggcagcca 2521 agcctgcggt gaaaaccgta gcggcgaagc ctgcggccaa gccggctgcg aagcctgctg 2581 cgaaaccggc ggccaagcct gcgacgaaaa ccgcggcagg caagcccgca gccaagccga 2641 cggcgaagcc tgctgcgaaa ccggcggcca agcccgcggc gaaaaccgca gccgcgaagc 2701 ccgcagccaa gccggcggcg aagcctgtgg cgaaaccggc ggccaagcct gcggcgaaaa 2761 ccgcagccgc gaagcccgcc gccaagccgg cagcgaagcc tgtcgcgaaa ccgacggcca 2821 agcccgcggc gaaaaccgca gccgcgaagc ccgcagccaa gccagctgcg aagcctgcgg 2881 cgaaaccggc ggccaagcct gtggcgaaat ccgcggccgc gaagcctgca gccaagccgg 2941 ctgcgaagcc tgcggcgaaa ccggcggcca agcctgcggc gaaacccgta gccgcgaaac 3001 ctgccgcaac caagcccgcc accgctcctg ctgcgaagcc tgcggcgact cccagcgccc 3061 cggcagccgc ctccagcgct gcttcggcaa cgcctgccgc gggcagcaac ggcgccgccc 3121 cgaccagcgc ctcctaaggc g // LOCUS STYAROF 1235 bp ds-DNA BCT 17-JUL-1990 DEFINITION S.typhimurium tyrosine sensitive 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (aroF) gene, complete cds. ACCESSION M31302 KEYWORDS 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase. SOURCE S.typhimurium (strain LT2) DNA, clone pGM59. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 165 to 1235) AUTHORS Muday,G.K. and Herrmann,K.M. TITLE Regulation of the Salmonella typhimurium aroF gene in Escherichia coli JOURNAL J. Bacteriol. 172, 2259-2266 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Herrmann, 13-JAN-1990. Author address: K.M.Herrmann Purdue University Biochemistry department West Lafayette IN, 47907 (317) 494-1653 FEATURES from to/span description pept 165 1235 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase (E.C. 4.1.2.15) site 1 20 operator aroFo2 site 52 71 operator aroFo1 site 69 113 promoter site 75 94 TyrR box-1 BASE COUNT 309 a 286 c 343 g 297 t ORIGIN 1 gtgtaaagtt tttgatacga aatgagggag attaatcgac gattgcgcat ggtgtaaagt 61 ttattttacg aaatatggat tgaaatcttt actttttgtg gtatggttta aacatcctcg 121 tggaggatca actatcgcaa acaagaaaag acaggatcgc catcatgcaa aaagacgcgc 181 tgaataacgt acgtatcacc gatgaacagg tattaatgac gccggagcag cttaaagcgg 241 cctttccctt gagcctggcg caggaagccc agatagcgca gtcccgggga atcatttctg 301 acattattgc cgggcgcgat ccgcgtctct tggtggtatg cggtccttgt tctattcacg 361 atcctgaaac cgctctggaa tatgcccgtc gatttaaagc ccttgccgca gaggtcagcg 421 atagcctcta tctggtaatg cgcgtctatt ttgaaaagcc gcgaactacc gtcggctgga 481 aagggctgat taacgatcct cacatggatg gctcatttga tgtggaagcc gggttgaaaa 541 tagcgcgtca gctactggtg gaactggtga atatggggtt gccattggcg accgaagcgt 601 tggatccgaa cagcccgcaa tacctgggcg atctgtttag ctggtcggcg ataggggcgc 661 gcacaaccga atcgcaaacc caccgcgaaa tggcgtctgg tctttctatg ccggtcgcgt 721 ttaaaaacgg cacggatggc agcctggcga cagcgattaa cgccatgcgc gccgctgcgc 781 aacctcatcg ttttgttggc attaaccagg ccggtcaggt tgcgttattg caaacccagg 841 gaaatccgca tggccatgtg attctgcgtg gcggcaaagc gccgaactat agcccggcag 901 atgtcgctca gtgtgaaaaa gagatggaac aggcgggact acgtccttcg ctgatggtag 961 attgcagtca tggtaactcc aataaagatt atcgccgcca gccagccgtt gccgaatctg 1021 tggttgcgca gattaaagat ggcaatcgtt caatcattgg cttaatgatt gaaagtaata 1081 ttcatgaggg taaccagtct tccgaacagc cgcgcagcga aatgaagtat ggcgtttccg 1141 tcaccgatgc ttgtattagc tgggagatga ccgatgccct gttacgtgaa attcataaag 1201 atttgagcgg ccagctggcg gtgcgcgtcg cataa // LOCUS TRBANTAT 18598 bp ds-DNA INV 17-JUL-1990 DEFINITION T.brucei expression site variant specific antigen 4 - 7 genes, complete cds. ACCESSION M33720 KEYWORDS expression site variant specific antigen. SOURCE T.brucei (strain EATRO 1125), cDNA to mRNA, clone AnTat 1.3A. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 18598) AUTHORS Pays,E., Tebabi,P., Pays,A., Coquelet,H., Revelard,P., Salmon,D. and Steinert,M. TITLE The genes and transcripts of an antigen gene expression site from T.brucei JOURNAL Cell 57, 835-845 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Pays, 20-JAN-1989. FEATURES from to/span description pept 1174 2196 expression site variant specific antigen 7 pept 3528 4733 expression site variant specific antigen 6 pept 8519 9772 expression site variant specific antigen 5 pept 14450 18157 expression site variant specific antigen 4 mRNA 986 2676 ESAG-7 mRNA mRNA 3483 5056 ESAG-6 mRNA mRNA 8260 11304 ESAG-5 mRNA trns 5305 5808 rime retroposon BASE COUNT 5298 a 3468 c 4707 g 5125 t ORIGIN 1 gcatgcgaag agcaggggtg caacggaaga gtgaaaggat tagaggaaat agtggaatta 61 gcatgattgg agattgtgag ggttaggaaa aacgtgacga gggaaatggt aactggacat 121 cgtgcgcgag gatggcgaag gcttgaagcg tgtagagagc gtgtgtaaat ggcccaaacg 181 gtgtgggtgg ggtaataacg cagagtgctg aggtgtttga caagttcccg cgaggtgctg 241 gggaggagga agcaagacag taaaatcgtc gtgagacgtg gctgtgctcg ggaatccgta 301 caaatatact gaaaattatt gatggttgat atgcgtggga acgacgcagg atgatgctgc 361 atcgccaaga tgcttcggga gaacaggaaa ataacgacga atcaacactg gtgagttgat 421 actataaaaa tcgcgtgggc tttgctgctt cgctttcata aatgccacct gcaatgaatt 481 tggagggtgg aggtgagctg ccctgtgtat gggtgggtat tgcagatgca gcggcgcttt 541 tgggtgattt ttggtttaca gtttgctgtt gttgggaagg ttggtggtgg caacaaaatt 601 ggagaaaatg atttcatctg gaaacctact tcaaaacccc ataaacaatc ggacctcgaa 661 gtgcaagtgt ctcccagaac gtaaacaagg ccttgaagac gaccagcaga tgaagcacag 721 caacagcgac agaggtggac tgagacgaaa gtataccata tgactatgcc acaagaattg 781 atgggtgatt ctgccaacca cggacgcagt caaaaacata gaaggagctg tagtgaatat 841 aaaaccactc atattttcta caaatattag agctttatta cttaagaacc tgttttagca 901 ccaaggttgc gtgtgtattg ccgttgctgt tacttattgt tgagttggta tttgaggaaa 961 atgttttttc aggtgcgtcg ttgttatata cggctgtgag ttggtgatgt ttatttgaat 1021 gatagcttga taggataaga tgaaataata ctaagcatgt attcttctgg cgtgtcgccg 1081 aaatggtaat ctatgttacc catagcacga aaaaattttt ttactcagta acgaagcgga 1141 aagttttaaa aagcataatt taaaaaagtg ataatgatga agttttggtt tgtgctgttg 1201 gctcttttgg gaaaagaaac acatgcatat tatgaaaata aaaggaatgc attaaatgca 1261 accgccgcta ataaagtgtg tgggctatcg acctatctta aaggaatagc gcacagagta 1321 aacagcgaga gtgccgtggt tacggaaaaa ctatcagatt tgaaaatgag aagcatccag 1381 ttgcagttat cagtaatgcg gaacagagtg ccttctggcg agcaggattg taaagacatc 1441 aggacactct tgaaaacagt attgaggaat gagtttacat tccagcagga gttggaggaa 1501 atgaggaacg catccgcgtt agcagcagct gcagctggga tagcagctgg aagactggaa 1561 gagtggattt ttgtatttgc acaggcagca ggcgggtcat cacagttttg cataagtgtg 1621 gggaagcata tcccggctga gcacggcaac ttgcaagaat gttttgatgg aataattgga 1681 cctgaaaccc tttacaaaat tgaggattca cgtgtaaaag agtcggcgca gaaaagcttg 1741 caactccatg aagtgttatc atccatttcc ttcagtagct tgggtgcgga aagtattgtt 1801 gagcaaggag agaacagagg atgtaaccta atgcggacgg ctgacggagg tttgctgaaa 1861 gatatttgtt tgaattgcaa cttcacatgg ggtggtggag tgatgaattt cggatcctgt 1921 gtggcgggga acctaaaaat aaaaggggga gaatacggtg atgtcagttc ccacgatgtg 1981 gtacggtgga ccgaagatcc cagtaaagtg agcatattta aggatgtcat ccgcctgttt 2041 gcgcgcttcc aagaagcaaa aaatgcagtg atgaataaaa taaaaactac tgtggatgaa 2101 ttggcgaaat gcatagggca gaaagaggtt gagctcacta atgatcagct ttacgaggaa 2161 ttcgaggcaa tacaaaagta cctagggtct ctgtgaaatg gagtaaaagc gaattcaact 2221 atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg gcagcagtac 2281 atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga gtgctgtgaa 2341 ggaaggatgc gacagaagct gcgctgctta gcgtgaaaga ttatggtaat ggagggttgt 2401 gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg ggcaaaagta 2461 acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa ggaaagatgt 2521 gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg agatgtaagg 2581 ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa tttagttaca 2641 acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt ctactgttta 2701 atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc gaatttcact 2761 tctccagaat ctttatttta gtgcgctata gatgatagtc caaagagaag gaattacaat 2821 tattataacc catccaaagc tatgtcagag actcctcttc tccgagtgat tcttttctat 2881 gtcacttgtg gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg aaatggggcg 2941 tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt atgatgcata 3001 gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta ggtggaagcg 3061 gctaatgatt tgtcggcacc accaacagag gaagttaatt gttgacgtag ccgtttgtga 3121 atgagataga aggttattgg gtggaacatt atcttggaaa gagtttttct tccactaaaa 3181 atgtgtagag tggtggtttg gagaaaatgg tttagatatt gaggaggaaa caaggatttg 3241 aggtgcaagt tgatgcgctg taagatttgt attgccgttg ctgttactta ttgttgagtt 3301 ggtatttgag gaaaatgttt tttcaggtgt gtcgttgtta tatacggctg tgagttggtg 3361 atgtttattt gaatgatagc ttgataggat aagatgaaat aatactaagc atgtattctt 3421 ctggcgtgtc gccgaaatgg taatctatgt tacccatagc acgaaaaaat ttttttactc 3481 agtaacgaag cggaaagttt taaaaagcat aatttaaaaa agtgataatg atgaagtttt 3541 ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa aataaaagga 3601 atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgacctat cttaaaggaa 3661 tagcgcacag agtaaacagc gagagtgccg tggttacgga aaaactatca gatttgaaaa 3721 tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct ggcgagcagg 3781 attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt acattccagc 3841 aggagttgga ggaaatgagg aacgcatccg cgttagcagc agctgcagct gggatagcag 3901 ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg tcatcacagt 3961 tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa gaatgttttg 4021 atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta aaagagtcgg 4081 cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat agcttgggtg 4141 cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga acagatactg 4201 acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt ggtggagtga 4261 tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa tacggtgatg 4321 ctacagagta cggtgcagta cggtggaccg aagatcccag taaagtgagc atatttaagg 4381 atgtcatccg cctgtttgcg cgcttccaag aagcaaaaaa tgaagtgatg aataaaataa 4441 aaactactgt ggatgaattg gcgaaatgca tagggcagaa agaggttgag ctcactgatg 4501 atcagcttta cgaggaattt atctgggaga ccatacacag attggagctg tcaaagagag 4561 tgagtgaaca actctcttta ggtgaagaag aggagacgat cctgaaatct aactatactg 4621 cagaaccggt ccgtggacct ttcacggtag cggggtccaa cgcggcagca gtacatttga 4681 gtgtttctac cgctgcactt tgtttttcag ttttattgtt gggagtgctg tgaaggaagg 4741 atgcgacaga agctgcgctg cttagcgtga aagattatgg taatggaggg ttgtgaaaga 4801 ttgggggaac aaaaacctat ttcttttatt tgggggaaca aatgggcaaa agtaacgtaa 4861 gtttccagtg ggagtggtat gtgtgtgtgt atggggctgg ctaaggaaag atgtgagttc 4921 ggcatgtggt atgtacaagc tacgaaaacg tgtgaaacaa aacgagatgt aaggggaaaa 4981 tgtaacaacc aactatgtta aatttcagga gactattttt caaatttagt tacaacaaag 5041 taaatgtcaa ataatgccaa ctataaagga aaactagact aattctactg tttaatgtga 5101 cgagaattgt ggcgtgaaac ctagtaactg cttatcatgg cgtcgaattt cacttctcca 5161 gaatctttat tttagtgcgc tatagatgat agtccaaaga gaaggaatta caattattat 5221 aacccatcca aagctatgtc agagactcct cttctccgag tgattctttt ctatgtctct 5281 tgtggaaggt tgatattttt gctaccccct ggcgatgccg gccacctcaa cgtggtgcca 5341 gggtccagtg ccccgtatca ttgggggaag ccaagagcca gcagcgttcc tttcatgggg 5401 aacactgctg tgctccggct acggcatcat acagcacagg gatcagcagc gtcttgctgg 5461 gacaccgttt ttcatttgtc ggtccctggg cacgtgccag cgtgccatca gcagtatcat 5521 ccgcgctaag atgctgctgt ccggtgatgt ggacctccaa aaaaggattg ccatttggca 5581 tcttttggag agtccagggt gggaggcttc tcgccccatc tgctgtattc cgttcaactg 5641 cggagctaca acaaaaaatt atagagggtg tgttaggatg aatgaaaaag ggagactctg 5701 ccacagtcgc cagaccgata gcatctcagg gctctacggt gttggctgat ggccgcgcca 5761 gaggggggga aactctcacg aaggcacgaa gaaaattcaa aaaaaaattt ttgctaccac 5821 aggctgtaag gataacgctg gagatggagc cgactgtggt ttttcttgtg aatgcgtatc 5881 ttcgtaaccg gttgttaaga agggatcgga cgaaggaatg tgtctgagta ggaggtaatg 5941 tcgatgtttt ctttccattg gatgtagtaa taatctccaa aatttgttgg aaacaatata 6001 atattattgc aataaatact tgaaaagtta aattattgat ggattgactt gagtatcatg 6061 attccaatga acactctcat gatgcttaga atagcaaacg tatgaacaat caacgtagcg 6121 tcatgtagac atggacattt tttgtaatga tacgactgct attgcacatc tgaaaatttc 6181 tatgacggac cagtgccgtt aatcaccgta caattgaatt caggatgaaa aatgataaat 6241 cagtgctcta atgtatgatg cagctgtcaa ggcaatgttt ctgcacaaaa agagtttaac 6301 acactgccat ttgcatgcga ataaaacaat gagtatatga agcggaggat cgcatggcaa 6361 tgataaaatc gaacactgtg gctgcgagac gcagggccac ccggaagcac cgggtaacaa 6421 agcgagttca ggagtcgtcg aaagtggcaa aagtcagcac caaaccaagt tgctttttta 6481 aaaagatcag caaacgtatt tctcactgac ttatacagca ttcaaaggaa atgaacttag 6541 cagcaggaca agggtgcaaa ctaaggcatc gattctgtat ggctgcaaaa gtgaagacat 6601 caagcaaaca aaggtaactt atttaccctt ataggagcag tggaaaaagc agcggactcg 6661 tatggagata acctgacgaa aatactacca gttttagtca caggaagcga tatcctgtca 6721 aaatgccggc agccactcaa tggcggtcaa aacttgatta aacagttgaa aaaaacagcg 6781 gaatcaaaag gaaccttgcc cgtgctcacg gtggtagata aagcaacgaa aaatgtaaag 6841 gtagaattta acggtaacaa ttcgaacaaa accgtactat cgtgcaaaaa tatttcagca 6901 caaaatcccg acaaactttg cggcgacaac acagcagaca acatggccaa gtgcacgaaa 6961 gcttccaaag cagcttttgg cgctgatggt gtatatttca aaagaaaaaa gggcaaacct 7021 tgactcagca gctataataa agtcagctag aaccggcgaa acagcggaag cgctgagcaa 7081 acttaaaaca ctggacatta tcggtgattt tatattgcgt ttatttgtcg aaaagctcat 7141 ggacatgacc gtggatgagt acacgactca aagagagact aacgacaaaa aacgctgcag 7201 tggcaaaagt agcgtgcctg tacaggggac ggagaacttg ataataatgg taatattaat 7261 cttagaagaa tgttgtgagt atgtgtcact tttacagtga tagttgtgaa atattcggaa 7321 tgacagttgt aatggaggtg aaagaagctg ttaggttgaa tttggggaac catctatgta 7381 tacaaagaag cacaagggtt gtgatagctt atttatatga tggagaagcc atgcatggta 7441 aaaaaaaaca acaagtgata tgtcggtgtt tgtggctaat gtagcataaa gactaacagc 7501 tacagcagtg catcaaccga tggagagacg ctttcagaat tttcaaatgt aagaaagtca 7561 gtaaattaca aagaagtggt tgagttgata ctcagaaaag tgggactaac aaactgttga 7621 tgaatgaagg tgatgccata cgtaccaggt acacaggtca gcaatatgat gctcactgaa 7681 aaagagagat ggtgtgctac tggcgggatg tcgctggaaa agtcttagca agtgtagttg 7741 cagcaccgag cacacacact catgggagtt atcaaagtat gaaaacgcaa aggaagctga 7801 tgtgtgtgtg tgtgtttttg tgtgtgtttg caggaaacaa atgtatcagc ttcgacatct 7861 ttccgcaaga agtagaagtt tcagttgtgg tcactgacgc attacgtgga ttgttgggtt 7921 gaaacaacac aaatgtggca accatttagg atgttttggc gctttggcga cggttgtact 7981 gcagtttgta tgccccttca agaaacaagg aatggtgtga aagcgttgtt tttttttaag 8041 tcaagtatgt gcattcaaga aatcatatca cagtgttacg cagtgctttg tcgtcatggg 8101 aggaccttcc tgcattgaga gtgcatgtaa ggaaacggca acaggatgct atggctccga 8161 atttttttta agtatctgct tttcagacca cttgcgttga tcgacttcag ttatgttaca 8221 ctccgcacat cttttttgct atgttttttt ctactttagt tgcacttagt gtagccaaca 8281 ttaacttaaa gtacaaactc tatcctaagc aggattcggt gttatttcta ttcgacgatg 8341 tcattggtat cgagcgcacc cgttttggtg ttcatttccg ttgtaacact tctgtgttgt 8401 atcatcggct ctgggcaacg gtctagtagt ccctttacgg aaccggcact caaaccaggg 8461 ggcattaagg ttgctattca agaggcggct gttacgccat tgttaccagt acttgcgtat 8521 gagttcgaaa ggtttatgga gaatatcacc attcccgagc aaaagggtta accgtgtgag 8581 tgtagaagag acttattttc gtaacgtgac ggtcggaagc gcgacagtta aattcgaaga 8641 accaaataaa atcattttga agttttggaa cgtgagcgct actgtgccgt ttacgcggtt 8701 tgtataccac tcgttttggt gttacctgta tccttgcagt gggtcggcac aggcggaaat 8761 acgcaacggc agtgtggcac tgtggctcga cgtgtccgca ggaagggggg gactactaga 8821 catacatgta gggagttcgg aaattggcat gcgtgaccca ctgataactc ttatcggtga 8881 agggaaatca aagatgccta agtggcttgg cgggagagtt aaagatatgt atgataagga 8941 tgtgcttccc aagcttgggc atcatataat tactgcggtg aatcaaattt tagccaacaa 9001 agcaaaggag atatttcata tgtttccaat tgtctttgtc aacagttcta aaattgaata 9061 tggacaaatg agacttgaac tggtcgtgct tccagatgcc gcagacaaat taatgcttac 9121 ggagaatgtt tttttcccgc ggcacgcttt tccaaacttt cctgttgctg ttgtttcttc 9181 gtttacggct ttgaataata tgcttcgttt gatgattaaa agtggccact tggtggtgcg 9241 tgtgcccttt ccgctgaggt atgttatctc cagcaacgcg ctcagcgtca gttggaccgt 9301 cttttgctct ggatgtgcct ctgaggctac attcgaacta aaaacagccc cgtggttgaa 9361 gtccctaaat aagaaactgt ttacatttaa ctatcgggat gtcgaagttg ccgtggacct 9421 gctgccaagg ggtggtgccc ccatttcctt gtttagtatg ttaatgaatg tgtctgccca 9481 ggcggcgcat atcgctcttg ttgacggtgc cacccatgcg aatttggatt ctgttgatac 9541 taatgtgagt gtcacatcct cacgtatcga tggcctggac tcaagtacaa tgaacacaaa 9601 aattcgtgat ttgatcaact tgtcttggat caacctgaat gttacatata catttcctgc 9661 tcccttcgac ttgtgtacga aacacgtgaa tattacaagt gaatgctatg tagcgggatt 9721 taaccttgtg cgagctcttg gttcactcag tatactccct cacctccgat gaaggattgc 9781 gacattcctg gtcactttgt aaaacacact tccatgcgct tatgtggaag ggggtaacac 9841 tcaaacctat cttgctttcc cctgtttacg cacaacttca catacgcact ttctctttta 9901 cgagatgttt gcatgccgca acaaccaccc tgcgcaatcg acgagaagct atccgtgttt 9961 acgggtgatc cttcgaatta tggcggagga gtaaaaacct catgatatgt gaagtgctgt 10021 ggctgaagaa ggaaataaga taaaaagggt atttttcatt gacccgacca ttactggttt 10081 tgactttttg cccgaggcga gaaggcaaat ccattgttga ttggctcatg aatgtttccg 10141 tgactatacg cgccaagcct tggatatgtc gtgggctttc ggcatatgtt gataatataa 10201 cgactgccag ttggctgagt gacggctcaa agtgttagag gcgagtgtca atggaagata 10261 tatctattga tttggtaaat aatattggat tcgagattaa ctgttcaagt gtgctacaac 10321 accagatagc atgttcgtag tgtaggggtg gcaatagaag tgatggattt atcacgacta 10381 attgttctat tggcaacgct tgtgtctcgt gactgctatt ggttgattgc tgatgtttcg 10441 atttgagttg tttcgtggcg gcacgaaaag tatcatcatg tgaagtaggt taaggggaaa 10501 agaaacagat atataaagag cctttcgtgt tgctcccaag gaattgtata aatctctgag 10561 cgagatttgt gctcggtaat gggttcaaga gtgtgtatat acaaatacta tattaagagt 10621 ggtgattgag accaagtgga gggtcgaaag agatccactt aaagccatgg aaagtgtgcg 10681 gttgtagagg gtagaaaact aagaaaaggc aaatgataaa tagtattata ggaaatgcag 10741 agtgatagca agagagaggg agatggagta accaatggtt taaaaacaag catgacgatg 10801 ggcagtatta tacccataag gatcaaagag ggtcaaaaaa atacaaaatg ggttcagtgc 10861 acaagagatg tgaagatgct acaactgata cggagaagcc tagaatgtga aaattcagaa 10921 aataatacta tatgaaaata gtgagagtga caactcacaa tacaaaacaa actgatacgg 10981 gagaagatta tgctaaacat aaaagtgaaa aaagctagaa tataagctga taaaacacca 11041 taaactacat tgatttgagt ttcatgatat aaaacgtgga tgctctattc agcagaggat 11101 tctagtaaat tttttaatat atcttaaagt aatagtgaac gtagaataaa tttctgttcc 11161 aatacacaga cgagaaaaat aaagaattga acgggtagca aatttaagta acttgcgaca 11221 atatttttgc tgaagctaac ttctaaaaca aaaatgcaaa tacgtgttca aaaatatagt 11281 gaggaggata atcttttaat tacaaacgca tacatcatga taatttcaag gcattgtttg 11341 catatgtaaa cattgcttta ttaactgagt gcagttttct ggtaacctat ttcttttcat 11401 agttttcttt cataaccctc tctaatattt taaggtatat taggaaatga aatttgttac 11461 cttgtgaact ttactattac caattatatc acatttagtg tgtttaagtg caatatacct 11521 tcaagatgaa gaagtaagtg acatgacaaa tatgacacat aatggtacta agaaacccag 11581 agactttgtg gttgtcgttc aaactcatgc aacacaagga tggtgtaggc tcatattgag 11641 tgcgaaagta agccatgtgg aaattattac aattgaaatg ggaggtttgt acagtcatgt 11701 caaaaggcct aaatgggtta tggagttttt agagaaacag aatgcaagtg atgaggatat 11761 tttaatagct aaggatggat ctgatgttat agttggtgat ggaaataagt atgagagttc 11821 tgtggaatat tttgtgtaaa aaagcaccct acagtaagga caaattttat gaagaggata 11881 ttcagaaaaa ggatataggg taatgttatt gatttttgag atggccacaa aagtggagga 11941 aaagaaaagt ctaccaactc tactaaaatg gcctttaact caaaaagcat acttaaatgg 12001 aggtggctta attgccagag tttgggcatt tagaaggttt gaatatgcat ttggagagtt 12061 actaaaggga agtgaggagt ggtggtcaga gcggagtatt tacaaacctt tgttaatatg 12121 gagtgcaatt caggaagaag ctgttggaca gagatttgtt ttgaagagag gaatgattgg 12181 attggattac gaagagagat ttttttctat agaaagtagt ggggtaattg gagaagcccc 12241 attcattcat tttcctggac agcctattgc ttgggaggag aaagcgagac tgattgtgaa 12301 aaatcttagt tgatataaaa ggttgaaagg aagtgaattt aattcgaagg atatggggaa 12361 aagtagaaac ttatgtagta gatggagaga attgggaatt tcgatatgaa agaatttgtg 12421 gggacgcggt caaggagaat gacttgttta aggcaaagaa actgtaaggg atacaaaact 12481 aaagattttc ttgcggctga gtcagtagtc taagcaaatg tgggaccaac acgcatggcc 12541 aagatggccg cacaagtcta tgttgacatg aaaatcctgg ggcagcaaca aagagaagct 12601 ggaagacaaa cctcaggggt accctaacta ttcggctgaa atggtatgtc gtgacggcaa 12661 tatcgatatt ggtttcgggg gtggtaacac gatggacgca aaatttagaa gcttctatta 12721 cctctccata aaccatgatc tcaaagctag agaagcacaa aacaataaca caactagtga 12781 cactatgcgt tttttatgca cctgcacaga agaagtaacc gacgcttgga ccgaagatgc 12841 gaatacatat tcggcatctg cagtataaaa ggtaattcag ctgctagaag gaaaaaattc 12901 ctgaagaccc cgctgtgcca gagatatgta aacaactgtt tgaaatacat gcactcaata 12961 tgtcactatc agacaagacc tggcaggaca agttcataaa ggctcaacga agagccactg 13021 cagtggggtg acgaagccaa caaacataag accactaata cggagtggca cgaagtgtgg 13081 cagcgctggg tctaagccag gcaaagacta aaaaataacg aatactgagg gcgataaagt 13141 aaggaaaaat gccgcctcta aatggcaaag ttcaaaagcg gcacaggccc gagtcaagct 13201 aaactctttg ctgatccatg cggaagcctt gaacgctgag ctggaaaaaa agggcaaata 13261 atggctaacg gcaaaaaaca atagaagtgg acgttcttgc agccctctac ggaacgccca 13321 cggcaaaagc agcgtttggg agcagtggct tattgtcatc atcatttttt tgttgttttc 13381 gccactagtg gttttctttg tttgcgtatt tgatatctcc ttcgtcacta ttcttatctt 13441 tatcgtctgg aatatgggtg agatatcatt cccggtgtgt tttgcagtca tatggcgtaa 13501 catccacctg tgttgatatt ctttacagca tatcttacat gcttctcttt ccaacagtgc 13561 ttcctgtagt ggtgtgcttg gctccccttt actcttctcc ggtatgcact tatatctggt 13621 gagccaggtt ttgcttttta tagacctctt ggaagcagag acatttgtat gcttctttta 13681 ctttctcatg ttcaacttcg ctgcttattt tcctgcattt ccgtttgttg tgtcacttca 13741 gtttgctttt ctacgatgtt taaggcgtat atgttcctag tcaactagcc atacgtggca 13801 tactgctttg ggcaacattg ccacttgtag acttttttac cgaattatct tcttctcgct 13861 tccttcctct tttgacgctc tgtccttact actggagttt taggaggctg ttgtggcatg 13921 tgccttgtgc ttcttttatg accaacaagt gttattttta cagtcgtttt tgcactggtc 13981 gtgtgtggcg aaaaatacgc ccccgctcct ccaaccaatt agtggcaaag gggcggctcc 14041 gttgaggcag cactgcttgt cctcttaatg gtacattgtt gttttaccta tcgctggttc 14101 actccaccgc acgcggtgca taccttggga gttctgtact gttttggggg tggttttgtg 14161 cattgtaaca gccgcctctg acacaacaag cacagtttca cgcctgcgtt actggcattt 14221 tatgcctgtg gtggaacaac gtgtgtgttg acgtccgtaa atcttaatat ggcttgttgc 14281 gtgctaccgt ttatattttt cccccttcga tatgggcatc ccgttcttgt cgttacgtgg 14341 ttcccagttt cacctaacaa cccttgccct ctcttaccca gcagccgctc cctttcgttc 14401 gttccagtgt aacaaccact ttcacgcgta gactatcaaa atcactgcga tgaatatgct 14461 tcacttgagc gaccgcaatg cctcactcgc accgagtggc ggggaacatt cccttccgac 14521 gggaggagct gtgtgccgtg acgccatgga catacttccc gtgatactga gagctccggt 14581 cgcgttgtta cttctgctcg tcgtcctgcc gcaattgtca gtgggcgccg aggctaatgt 14641 gacagtcaag gttctctccg ccacgtggaa ctggtacatg ccgcggaagt atgtaaccgc 14701 gattaatgct ggcttcaacg cctcactgaa gtcacgaaac tggacagtcg ctggaagcgt 14761 caacgttcag gtagtttacc ccagcaacct tgacctgatg cccgaagatt tcattaaaaa 14821 gcagttggaa ttggaaactg accagaataa gattgttata gtgtatggac ccctcggtga 14881 caaaagtgta atgcactcga ttccacattt gatgaatcac agagtagtgg cctttgggtt 14941 gataactggg tccactttta tccgacaatg gaatccgtac ctttactttc tgcgtgccga 15001 ccccgctgct gaaacattgg ttctcattcg gtattcgcta tgccagttgc gcgtgctgcg 15061 cttgggcttc atgtaccttc agggtgttca ttatggcgac gaggagtatg cattgacagt 15121 gaacgtgatg tctcggatgg gttatgaact gcatggtgtg ttcaccgtga tgagtccaga 15181 cggtaaaccc gctcctgatg ctgagtttaa agaggtgttt gagcgcttcg ccactgccct 15241 tcctcaggcc attattgtct ttggtgcgcc agtggatgat accgcgaagt tcttgatgat 15301 gatggcagtg gatgagcgaa tcgctcggtc gtatattctt agtccttcat ctgtccagtt 15361 gtcactcata gaaatgtggc aactcgctct ggaagctgca ggggcctcat tcgcgcccgg 15421 acagctactt ttcacaggca ccaatccgct tgcgaaggac agccagtaca cagcaatcaa 15481 gcgttttcaa gaagttatga gtgaatacct aaaggctcac gtcggcgaaa ccaacatcac 15541 tgaagcagat tatttcctga cccacgatct tgagggagag ttgatggtgt atgggtggat 15601 ttctggggag gtgttatctc aggcacttag caaccttgaa tggctcaagg accgcgctac 15661 atttgtcagg tcgctatacg accaacggcg ctacgtcatc aatgacattg tgatcggtga 15721 ctatggcggt acgtgtgagg gggacgcggc caagcatggc gccacctgtg agtgtaacca 15781 aggcagcaag gcggtgtatg tgaaagagat gttggaaaat gggcagaaga cgtcggtgcg 15841 aagtggcttc acagttctga aagcttcgct gtgctatacg gattcttcgg agcttcacgg 15901 gccactggat ggactcgtcg ttttcatgaa agacgacgac atagcgtcga aagcagctgc 15961 gctgtggcag aaggggacct cccatcttgt cggcaagggt gatttgggat actcggacag 16021 gttctttctg cacgcattta atactaccat agctgaggca gcgaacgatc ttcggcgtga 16081 ccaaggtgaa agaatcgtaa ccgctgtgtt tggccccgtg accgaggcga tgttggacac 16141 gccgaatata acctttatcg atcctctgga acttaaacca cggctgaata agttcagaag 16201 gaatgtgatt catctctcgc ccaccctgga acaacaactt tacgtccttt catcgtacct 16261 cgcgggcgcc ggggtcggta atgttgatgc cgtgatctgc agcaacgaag cggatggaat 16321 tgcagatttt ctgcggagtt cactaactga attcgctgtc tcgctacgtt ccgctgtcat 16381 acgcgaagat ggggaagatg ttggtaaata ccttccgatg agtgggaccg tgtttgtcat 16441 tggcctttcc gttccagatg tgaaggaaat cgcgaggaaa cttgaagaac gcaacgattt 16501 gcgtgtgatt gttctcttcg gggaattttc attcctgtat gatctgtttg ccacggccct 16561 caataacacc gcaggcgccg cacgtcttgt gtttgccacc agcttgccac attggggtga 16621 tactgaaacc tcatcgaaga ctgctcaact gttccatgat gttgaaaaag actcacggtt 16681 atggacaccc ctatcagttc tggcattcgc aaccggtcga ttgatgcggg taatacttct 16741 tcatgtagaa gaaatgagtc cagaaacgtt ggtaaatttc ttctataccg attcttccat 16801 cgtctccgat gacatgcgct acggtgtatt cgacgacaca aaatgcgtcg acaccgcaaa 16861 taaactttcg aagaatggtt gtgcctcaaa ctatggtgca acgcagatat cggtgtggtc 16921 tatggcccgt gctctgaacg cttctattcc tccgcttaca aatcccatga caccgtcaat 16981 gactttcaga aactctaacg cgggaaggat ctccggggcg tcgctggtgg gcatcatcat 17041 cggtggtgcc cttgctttgt ttcttgtggt ggctctgggc gtggttccct actttttcct 17101 gcgcaacacc gtgataacaa tctgcaccaa agatgacaga cccgtaacgc taatatttac 17161 tgacattgag agcagtacgg cgttgtgggc tgcacaccct gaggtgatgc ctgatgccgt 17221 tgcgacacat caccgcttaa ttcgcacatt gatttccaag tatgaatgct acgaagtcaa 17281 gactgttgga gattctttta tgatagcgag taagagtcct ttcgctgccg tccaactcgc 17341 acaggaacta cagctgtgtt tcttgcacca tgactgggga acaaatgcga ttgatgaatc 17401 ctaccagcag tttgagcagc agcgtgcgga agatgacagt gattacacac cgccaactgc 17461 tcggttggac cctaaagtgt acagtcgttt gtggaatggc cttcgagtgc gtgttggaat 17521 ccacaccggg ctgtgcgaca tccgacgcga tgaagtaacg aagggatatg actattatgg 17581 acgtacatcc aatatggcag caaggacgga gagtgtagca aatggtggtc aggtgctgat 17641 gacgcatgcg gcgtacatgt cgctgtcagc tgaggagcgt cagcaaattg atgtcactgc 17701 acttggtgat gtgccgctac gtggtgtgcc aaagcctgtg gaaatgtacc ggttgaatgc 17761 cgtgcctggt cgtacctttt ccgttctccg tcttgagctt gagcttctta acgatgatga 17821 ggatcagact acaacttcct gtagtgacca cagttcttca cggacagatc tgagtgtagc 17881 tgcacaaact attgctgctt cccttcagtc gctcttgggt accttcaccc ctgctcagcg 17941 ccagaaggca ttaataccct tttgcgaacg gtggcgtgtt ccgcttcctc aaaaggtcgg 18001 gaacgtgtgg gacgatgacg gttgtcagga agttgtacgg cgtgttgctg caaaggtggg 18061 ccgtgttatg gattttggaa cgaggaagcc ctccagcagt gtgacatcgt ggaaaggggt 18121 ggaagtgtct tctcaggtgg aggagcggct gctataatgc ttgcttcatc ctcaagttcc 18181 tcctgcgttg atgggcattg tgttaccgta caattgattg acttggaaaa tgacagcgct 18241 accacctaat attaccctca tatcattgct gtatttgcgc tctgctattg aagtgaatga 18301 atgagttatt ttatatgatg tgtgcaatct tttctttata ttcaattcgt ataaagtgga 18361 cattataact ttccctcccc attcttgtgc ctgaaatgat cttctcaaac ggtaaatgct 18421 ttaaatacat ttgtacatga agatggtttt tgtttctgac atggctactc tccttttgag 18481 aactgtgttt gaagattgta cattttcttg tgtttgatgt gcctttgaat gtgagtatcc 18541 tcgccacttt tgataatctc cctcgtttaa ataatatatt tattgactga ataagctt // LOCUS JS3PRSBS 81 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pJS37 promoter region DNA. ACCESSION M34567 KEYWORDS . SOURCE Plasmid pJS37 DNA, from B.subtilis (strain MB11). ORGANISM Plasmid pJS37 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 81) AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M. TITLE Comparative expression of the pC194 cat gene in Streptococcus pneumoniae, Bacillus subtilis and Escherichia coli JOURNAL Gene 86, 71-79 (1990) STANDARD simple staff_review FEATURES from to/span description pept 61 81 leader peptide (put.) mRNA 37 > 81 cat mRNA mRNA 64 > 81 cat mRNA binding 46 51 ribosomal binding site (put.) signal 34 62 promoter region 1 signal 5 34 promoter region 2 signal 57 62 P1 -10 region signal 34 39 P1 -35 region signal 29 34 P2 -10 region signal 5 10 P2 -35 region BASE COUNT 36 a 5 c 11 g 29 t ORIGIN 1 atttttaaaa tatatattta tgttacagta atattgactt ttaaaaaagg attgattcta 61 atgaagaaag cagacaagta a // LOCUS JS3PRSEC 62 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pJS37 promoter region DNA. ACCESSION M34566 KEYWORDS . SOURCE Plasmid pJS37 DNA, from E.coli (strain C600). ORGANISM Plasmid pJS37 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 62) AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M. TITLE Comparative expression of the pC194 cat gene in Streptococcus pneumoniae, Bacillus subtilis and Escherichia coli JOURNAL Gene 86, 71-79 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 52 > 62 cat mRNA signal 21 43 promoter region PT signal 43 48 PT -10 region signal 21 26 PT -35 region BASE COUNT 23 a 3 c 14 g 22 t ORIGIN 1 ccgatgatga agaaaagaat ttgaaactta gtttatatgt ggtaaaatgt tttaattggg 61 at // LOCUS JS3PRSSP 109 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pJS37 promoter region DNA. ACCESSION M34565 KEYWORDS . SOURCE Plasmid pJS37 DNA, from S.pneumoniae (strain 708). ORGANISM Plasmid pJS37 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 109) AUTHORS Ballester,S., Alonso,J.C., Lopez,P. and Espinosa,M. TITLE Comparative expression of the pC194 cat gene in Streptococcus pneumoniae, Bacillus subtilis and Escherichia coli JOURNAL Gene 86, 71-79 (1990) STANDARD simple staff_review FEATURES from to/span description pept 33 53 leader peptide (put.) signal < 1 6 promoter region 1 signal 6 34 promoter region 2 binding 18 23 ribosomal binding site (put.) BASE COUNT 47 a 13 c 16 g 33 t ORIGIN 1 taatattgac ttttaaaaaa ggattgattc taatgaagaa agcagacaag taagcctcct 61 aaattcactt tagataaaaa tttaggaggc atatcaaatg aactttaat // LOCUS HUMTNC1 1085 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human fast skeletal muscle troponin C gene, exon 1. ACCESSION M33771 J05502 KEYWORDS troponin C. SEGMENT 1 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1085) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept 1033 + 1035 troponin C, exon 1 pre-msg 968 > 1085 troponin C mRNA and introns (alt.) pre-msg 969 > 1085 troponin C mRNA and introns (alt.) IVS 1036 > 1085 troponin C intron A BASE COUNT 266 a 276 c 306 g 237 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct 61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc 121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa 181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa 241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc 301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac 361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc 421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca 481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga 541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt 601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc 661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga 721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc 781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt 841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt 901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag 961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg 1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg 1081 tggac // LOCUS HUMTNC2 1773 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human fast skeletal muscle troponin C gene, exons 2-6. ACCESSION M33772 J05502 KEYWORDS troponin C. SEGMENT 2 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1773) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept + 51 102 troponin C, exon 2 247 390 troponin C, exon 3 479 593 troponin C, exon 4 758 894 troponin C, exon 5 1506 1537 troponin C, exon 6 pre-msg < 1 1668 troponin C mRNA and introns IVS < 1 50 troponin C intron A IVS 103 246 troponin C intron B IVS 391 478 troponin C intron C IVS 594 757 troponin C intron D IVS 895 1505 troponin C intron E BASE COUNT 358 a 455 c 628 g 331 t 1 others ORIGIN About 2.5 kb after segment 1. 1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc 61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag 121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg 181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc 241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg 301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg 361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat 421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc 481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac 541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct 601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag 661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc 721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc 781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg 841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg 901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct 961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc 1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta 1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat 1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac 1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc 1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa 1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa 1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg 1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc 1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca 1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg 1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc 1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc 1741 ggctggcacc acccggagca cccttgggga tcc // LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33841 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 967) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 961 967 inverted repeat B rpt 560 810 R region signal 792 797 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 530 536 TATA box BASE COUNT 254 a 235 c 222 g 256 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct 121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc 301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt 361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa 421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa 481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg 541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg 601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc 661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg 721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc 781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga 841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct 901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac 961 ccctaca // LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33842 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 960) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 954 960 inverted repeat B rpt 555 803 R region signal 785 790 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 525 531 TATA box BASE COUNT 250 a 232 c 219 g 259 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct 121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag 301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa 361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt 421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa 481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt 541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca 601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac 661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct 721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat 781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg 841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa 901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca // LOCUS SYNLACZA 6476 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD1.27. ACCESSION M34296 KEYWORDS lacZ. SOURCE Cloning vector pPD1.27. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6476) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3500 3501 E.coli lacZ end/SV40 start recomb 3634 3635 SV40 end/synthetic start recomb 3657 3658 synthetic end/pUC19 start recomb 4076 4077 pUC19 end/C.elegans sup-7 start recomb 4442 4443 C.elegans sup-7 end/pUC19 start BASE COUNT 1560 a 1646 c 1676 g 1594 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc agctgagcgc cggtcgctac cattaccagt tggtctggtg tcaaaaataa 3241 taataaccgg gcaggccatg tctgcccgta tttcgcgtaa ggaaatccat tatgtactat 3301 ttaaaaaaca caaacttttg gatgttcggt ttattctttt tcttttactt ttttatcatg 3361 ggagcctact tcccgttttt cccgatttgg ctacatgaca tcaaccatat cagcaaaagt 3421 gatacgggta ttatttttgc cgctatttct ctgttctcgc tattattcca accgctgttt 3481 ggtctgcttt ctgacaaact cggaacttgt ttattgcagc ttataatggt tacaaataaa 3541 gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct agttgtggtt 3601 tgtccaaact catcaatgta tcttatcatg tctggatcga caaagtcaaa gcggccgcct 3661 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 3721 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 3781 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 3841 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 3901 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 3961 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4021 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4081 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 4141 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 4201 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 4261 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 4321 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 4381 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 4441 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 4501 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 4561 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 4621 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 4681 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 4741 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 4801 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 4861 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 4921 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 4981 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5041 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 5101 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 5161 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 5221 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 5281 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 5341 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 5401 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 5461 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 5521 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 5581 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 5641 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 5701 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 5761 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 5821 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 5881 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 5941 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6001 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6061 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 6121 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 6181 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 6241 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 6301 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 6361 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 6421 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZB 6096 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD8.02. ACCESSION M34297 KEYWORDS lacZ. SOURCE Cloning vector pPD8.02. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6096) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3275 3276 synthetic end/unknown DNA start recomb 3696 3697 unknown DNA end/C.elegans sup-7 end recomb 4062 4063 C.elegans sup-7 end/pUC19 start BASE COUNT 1458 a 1571 c 1616 g 1451 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt aaggccgcct gatgcggtat tttctcctta 3301 cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc tgctctgatg 3361 ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt 3421 gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc 3481 agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatacaa ttttcagaat acgttttttg 3721 tgggcttggg tatattgttt ttaatgttat acttgcagtc gtgaaatttg attttcaaat 3781 ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt gtcaaaaacc aatttcaaca 3841 aattttcgtg tgagaaatac attaccagaa ggcatttttt cacacgatta gcattttgga 3901 ctactttatt aaatttttgc gtgtaatttt gaattaaatt gtattatatt actacttaaa 3961 aaacaaaaaa tttgaccact gagcggatcg aacgcccaac ctttcgatct agagtcgaac 4021 gcgctaccat tgcgccaagc agtcatgtta ttctctcttg tcattgaaaa aggaagagta 4081 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 4141 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 4201 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 4261 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 4321 gtattgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 4381 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 4441 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 4501 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 4561 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 4621 ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 4681 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 4741 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 4801 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 4861 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 4921 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 4981 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 5041 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 5101 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 5161 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 5221 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 5281 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 5341 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 5401 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 5461 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc 5521 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 5581 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 5641 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 5701 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 5761 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 5821 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 5881 agcgcccaat acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc 5941 acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc 6001 tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa 6061 ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZC 7376 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD8.33. ACCESSION M34298 KEYWORDS lacZ. SOURCE Cloning vector pPD8.33. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7376) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 3524 3425 unknown DNA end/C.elegans sup-7 end recomb 4555 4556 synthetic end/pUC19 start recomb 4976 4977 pUC19 end/C.elegans sup-7 start recomb 5342 5343 C.elegans sup-7 end/pUC19 start BASE COUNT 1863 a 1793 c 1814 g 1906 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt aaggccgcct 4561 gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat ggtgcactct 4621 cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc caacacccgc 4681 tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 4741 ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgagacgaaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatacaa 4981 ttttcagaat acgttttttg tgggcttggg tatattgttt ttaatgttat acttgcagtc 5041 gtgaaatttg attttcaaat ttgtagaaaa atcaagaaaa taattgcaac attcgcttgt 5101 gtcaaaaacc aatttcaaca aattttcgtg tgagaaatac attaccagaa ggcatttttt 5161 cacacgatta gcattttgga ctactttatt aaatttttgc gtgtaatttt gaattaaatt 5221 gtattatatt actacttaaa aaacaaaaaa tttgaccact gagcggatcg aacgcccaac 5281 ctttcgatct agagtcgaac gcgctaccat tgcgccaagc agtcatgtta ttctctcttg 5341 tcattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt 5401 tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc 5461 tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat 5521 ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct 5581 atgtggcgcg gtattatccc gtattgacgc cgggcaagag caactcggtc gccgcataca 5641 ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg 5701 catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa 5761 cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg 5821 ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga 5881 cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg ttgcgcaaac tattaactgg 5941 cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt 6001 tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg 6061 agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc 6121 ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca 6181 gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc 6241 atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat 6301 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 6361 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 6421 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 6481 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct 6541 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 6601 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 6661 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 6721 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 6781 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 6841 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 6901 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 6961 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 7021 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 7081 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 7141 agtgagcgag gaagcggaag agcgcccaat acgcaaaccg cctctccccg cgcgttggcc 7201 gattcattaa tgcagctggc acgacaggtt tcccgactgg aaagcgggca gtgagcgcaa 7261 cgcaattaat gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc 7321 ggctcgtatg ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagct // LOCUS SYNLACZD 5730 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD16.43. ACCESSION M34299 KEYWORDS lacZ. SOURCE Cloning vector pPD16.43. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5730) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 3264 3265 synthetic end/unknown DNA start recomb 3513 3514 unknown DNA end/synthetic start recomb 3524 3525 synthetic end/pUC19 start BASE COUNT 1348 a 1488 c 1580 g 1314 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcctact agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga 3301 aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg 3361 gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa 3421 ctatgcggca tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca 3481 cagatgcgta aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat 3541 ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg 3601 gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc 3661 tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta 3721 ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg 3781 ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg 3841 gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac 3901 gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg 3961 acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt 4021 actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg 4081 ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac 4141 cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt 4201 gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag 4261 caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 4321 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 4381 ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 4441 tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 4501 ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 4561 ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 4621 ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 4681 tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 4741 cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 4801 taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 4861 gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 4921 acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 4981 ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 5041 ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 5101 cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg 5161 aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 5221 gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 5281 gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 5341 gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 5401 ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 5461 ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 5521 caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca 5581 ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc 5641 attaggcacc ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga 5701 gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZE 7010 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD16.51. ACCESSION M34300 KEYWORDS lacZ. SOURCE Cloning vector pPD16.51. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7010) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4544 4545 synthetic end/pUC19 start recomb 4793 4794 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4524 4525 unknown DNA end/synthetic start recomb 4804 4805 synthetic end/pUC19 start BASE COUNT 1753 a 1710 c 1778 g 1769 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggggccgc tgtcatcaga tcgccatctc gcgcccgtgc ctctgacttc taagtccaat 3301 tactcttcaa catccctaca tgctctttct ccctgtgctc ccacccccta tttttgttat 3361 tatcaaaaaa acttcttctt aatttctttg ttttttagct tcttttaagt cacctctaac 3421 aatgaaattg tgtagattca aaaatagaat taattcgtaa taaaaagtcg aaaaaaattg 3481 tgctccctcc ccccattaat aataattcta tcccaaaatc tacacaatgt tctgtgtaca 3541 cttcttatgt tttttttact tctgataaat tttttttgaa acatcataga aaaaaccgca 3601 cacaaaatac cttatcatat gttacgtttc agtttatgac cgcaattttt atttcttcgc 3661 acgtctgggc ctctcatgac gtcaaatcat gctcatcgtg aaaaagtttt ggagtatttt 3721 tggaattttt caatcaagtg aaagtttatg aaattaattt tcctgctttt gctttttggg 3781 ggtttcccct attgtttgtc aagagtttcg aggacggcgt ttttcttgct aaaatcacaa 3841 gtattgatga gcacgatgca agaaagatcg gaagaaggtt tgggtttgag gctcagtgga 3901 aggtgagtag aagttgataa tttgaaagtg gagtagtgtc tatggggttt ttgccttaaa 3961 tgacagaata cattcccaat ataccaaaca taactgttta aaattaaaca tttttctaaa 4021 ttttatatga tttcttttaa atttgcaaaa attacttaaa tttgaattcc cgcgcaaatg 4081 agtgacttca ttttctgcat tattgtgttt tccggctata ttaataggta tttgtttgtg 4141 tttttcttta ttttatgatt cgaactccaa tttgtaaatt ttcgaacata tttccctaaa 4201 gaaaaaatat gattaatctg gaaaaattgg aaaattattt ttcaaataaa aaacaaagaa 4261 aaaaatgaag aaaaacctat tagtttggcc ataaaacgca aaaatgtcga aaatgacgtc 4321 actcatctgc gcgggaaatc aagaataatt cggccttttt tatttttttg gaaaatcgta 4381 aaacatttag aaaaattttt taatagttat agtgggactg tattctgtca tttagggcaa 4441 aagccagaga cgctactcca ccgttaacat gaattatgaa tattattgcg acaagaccca 4501 aacattgata aaccgcaaat ctagcctact agtcggccgt acgggccctt tcgtctcgcg 4561 cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct 4621 tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc 4681 gggtgtcggg gctggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 4741 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcggccttaa 4801 gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac 4861 gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat 4921 acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg 4981 aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc 5041 attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga 5101 tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga 5161 gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg 5221 cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca tacactattc 5281 tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac 5341 agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact 5401 tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca 5461 tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg 5521 tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa ctggcgaact 5581 acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg 5641 accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg 5701 tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat 5761 cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc 5821 tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat 5881 actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt 5941 tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc 6001 cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt 6061 gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac 6121 tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt 6181 gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct 6241 gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga 6301 ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac 6361 acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg 6421 agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt 6481 cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc 6541 tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg 6601 gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc 6661 ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 6721 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 6781 cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca 6841 ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat 6901 taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc ttccggctcg 6961 tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct // LOCUS SYNLACZF 7088 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD18.32. ACCESSION M34301 KEYWORDS lacZ. SOURCE Cloning vector pPD18.32. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7088) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4622 4623 synthetic end/pUC19 start recomb 4871 4872 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4602 4603 unknown DNA end/synthetic start recomb 4882 4883 synthetic end/pUC19 start BASE COUNT 1780 a 1724 c 1797 g 1787 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttaaa 4081 attaaacatt tttctaaatt ttatatgatt tcttttaaat ttgcaaaaat tacttaaatt 4141 tgaattcccg cgcaaatgag tgacttcatt ttctgcatta ttgtgttttc cggctatatt 4201 aataggtatt tgtttgtgtt tttctttatt ttatgattcg aactccaatt tgtaaatttt 4261 cgaacatatt tccctaaaga aaaaatatga ttaatctgga aaaattggaa aattattttt 4321 caaataaaaa acaaagaaaa aaatgaagaa aaacctatta gtttggccat aaaacgcaaa 4381 aatgtcgaaa atgacgtcac tcatctgcgc gggaaatcaa gaataattcg gcctttttta 4441 tttttttgga aaatcgtaaa acatttagaa aaatttttta atagttatag tgggactgta 4501 ttctgtcatt tagggcaaaa gccagagacg ctactccacc gttaacatga attatgaata 4561 ttattgcgac aagacccaaa cattgataaa ccgcaaatct agcctactag tcggccgtac 4621 gggccctttc gtctcgcgcg tttcggtgat gacggtgaaa acctctgaca catgcagctc 4681 ccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc ccgtcagggc 4741 gcgtcagcgg gtgttggcgg gtgtcggggc tggcttaact atgcggcatc agagcagatt 4801 gtactgagag tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 4861 cgcatcaggc ggccttaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat 4921 aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat 4981 ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata 5041 aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct 5101 tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa 5161 agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa 5221 cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt 5281 taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg 5341 tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca 5401 tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa 5461 cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt 5521 gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc 5581 cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa 5641 actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 5701 ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 5761 tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 5821 tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 5881 acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga 5941 ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat 6001 ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 6061 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 6121 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 6181 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 6241 aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 6301 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 6361 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 6421 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 6481 cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 6541 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 6601 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 6661 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 6721 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 6781 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 6841 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc 6901 cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 6961 cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 7021 ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 7081 aaacagct // LOCUS SYNLACZG 6563 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD21.28. ACCESSION M34302 KEYWORDS lacZ. SOURCE Cloning vector pPD21.28. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6563) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 4097 4098 synthetic end/pUC19 start recomb 4346 4347 pUC19 end/synthetic start recomb 3322 3323 synthetic end/unknown DNA start recomb 4077 4078 unknown DNA end/synthetic start recomb 4357 4358 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1723 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggggccgctg tcatcagatc gccatctcgc gcccgtgcct 3361 ctgacttcta agtccaatta ctcttcaaca tccctacatg ctctttctcc ctgtgctccc 3421 accccctatt tttgttatta tcaaaaaaac ttcttcttaa tttctttgtt ttttagcttc 3481 ttttaagtca cctctaacaa tgaaattgtg tagattcaaa aatagaatta attcgtaata 3541 aaaagtcgaa aaaaattgtg ctccctcccc ccattaataa taattctatc ccaaaatcta 3601 cacaatgttc tgtgtacact tcttatgttt tttttacttc tgataaattt tttttgaaac 3661 atcatagaaa aaaccgcaca caaaatacct tatcatatgt tacgtttcag tttatgaccg 3721 caatttttat ttcttcgcac gtctgggcct ctcatgacgt caaatcatgc tcatcgtgaa 3781 aaagttttgg agtatttttg gaatttttca atcaagtgaa agtttatgaa attaattttc 3841 ctgcttttgc tttttggggg tttcccctat tgtttgtcaa gagtttcgag gacggcgttt 3901 ttcttgctaa aatcacaagt attgatgagc acgatgcaag aaagatcgga agaaggtttg 3961 ggtttgaggc tcagtggaag gtgagtagaa gttgataatt tgaaagtgga gtagtgtcta 4021 tggggttttt gccttaaatg acagaataca ttcccaatat accaaacata actgtttcct 4081 actagtcggc cgtacgggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc 4141 tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga 4201 caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg 4261 gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc 4321 gtaaggagaa aataccgcat caggcggcct taagggcctc gtgatacgcc tatttttata 4381 ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt 4441 gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 4501 acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 4561 tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc 4621 agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 4681 cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 4741 aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 4801 gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 4861 agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 4921 aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 4981 gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 5041 ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 5101 aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 5161 aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc 5221 tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 5281 agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 5341 ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 5401 ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 5461 ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 5521 acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 5581 agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 5641 ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 5701 cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa 5761 gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 5821 cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 5881 gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 5941 caccgaactg agatacctac agcgtgagca ttgagaaagc gccacgcttc ccgaagggag 6001 aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 6061 tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 6121 gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 6181 ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 6241 atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 6301 cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 6361 caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 6421 cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 6481 accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 6541 acaatttcac acaggaaaca gct // LOCUS SYNLACZH 6562 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD22.04. ACCESSION M34303 KEYWORDS lacZ. SOURCE Cloning vector pPD22.04. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6562) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 179 180 synthetic end/E.coli trpS start recomb 263 264 E.coli trpS end/synthetic start recomb 267 268 synthetic end/E.coli lacZ start recomb 3261 3262 E.coli lacZ end/synthetic start recomb 4096 4097 synthetic end/pUC19 start recomb 4345 4346 pUC19 end/synthetic start recomb 3321 3322 synthetic end/unknown DNA start recomb 4076 4077 unknown DNA end/synthetic start recomb 4356 4357 synthetic end/pUC19 start BASE COUNT 1587 a 1650 c 1722 g 1603 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gagggtaccg agctcagaaa aaatgactgc tccaaagaag aagcgtaagg 181 taccggtggg tgaagaccag aaacagcacc tcgaactgag ccgcgatatt gcccagcgtt 241 tcaacgcgct gtatggcgag atcgatcccg tcgttttaca acgtcgtgac tgggaaaacc 301 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 361 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggc 421 gctttgcctg gtttccggca ccagaagcgg tgccggaaag ctggctggag tgcgatcttc 481 ctgaggccga tactgtcgtc gtcccctcaa actggcagat gcacggttac gatgcgccca 541 tctacaccaa cgtaacctat cccattacgg tcaatccgcc gtttgttccc acggagaatc 601 cgacgggttg ttactcgctc acatttaatg ttgatgaaag ctggctacag gaaggccaga 661 cgcgaattat ttttgatggc gttaactcgg cgtttcatct gtggtgcaac gggcgctggg 721 tcggttacgg ccaggacagt cgtttgccgt ctgaatttga cctgagcgca tttttacgcg 781 ccggagaaaa ccgcctcgcg gtgatggtgc tgcgttggag tgacggcagt tatctggaag 841 atcaggatat gtggcggatg agcggcattt tccgtgacgt ctcgttgctg cataaaccga 901 ctacacaaat cagcgatttc catgttgcca ctcgctttaa tgatgatttc agccgcgctg 961 tactggaggc tgaagttcag atgtgcggcg agttgcgtga ctacctacgg gtaacagttt 1021 ctttatggca gggtgaaacg caggtcgcca gcggcaccgc gcctttcggc ggtgaaatta 1081 tcgatgagcg tggtggttat gccgatcgcg tcacactacg tctgaacgtc gaaaacccga 1141 aactgtggag cgccgaaatc ccgaatctct atcgtgcggt ggttgaactg cacaccgccg 1201 acggcacgct gattgaagca gaagcctgcg atgtcggttt ccgcgaggtg cggattgaaa 1261 atggtctgct gctgctgaac ggcaagccgt tgctgattcg aggcgttaac cgtcacgagc 1321 atcatcctct gcatggtcag gtcatggatg agcagacgat ggtgcaggat atcctgctga 1381 tgaagcagaa caactttaac gccgtgcgct gttcgcatta tccgaaccat ccgctgtggt 1441 acacgctgtg cgaccgctac ggcctgtatg tggtggatga agccaatatt gaaacccacg 1501 gcatggtgcc aatgaatcgt ctgaccgatg atccgcgctg gctaccggcg atgagcgaac 1561 gcgtaacgcg aatggtgcag cgcgatcgta atcacccgag tgtgatcatc tggtcgctgg 1621 ggaatgaatc aggccacggc gctaatcacg acgcgctgta tcgctggatc aaatctgtcg 1681 atccttcccg cccggtgcag tatgaaggcg gcggagccga caccacggcc accgatatta 1741 tttgcccgat gtacgcgcgc gtggatgaag accagccctt cccggctgtg ccgaaatggt 1801 ccatcaaaaa atggctttcg ctacctggag agacgcgccc gctgatcctt tgcgaatacg 1861 cccacgcgat gggtaacagt cttggcggtt tcgctaaata ctggcaggcg tttcgtcagt 1921 atccccgttt acagggcggc ttcgtctggg actgggtgga tcagtcgctg attaaatatg 1981 atgaaaacgg caacccgtgg tcggcttacg gcggtgattt tggcgatacg ccgaacgatc 2041 gccagttctg tatgaacggt ctggtctttg ccgaccgcac gccgcatcca gcgctgacgg 2101 aagcaaaaca ccagcagcag tttttccagt tccgtttatc cgggcaaacc atcgaagtga 2161 ccagcgaata cctgttccgt catagcgata acgagctcct gcactggatg gtggcgctgg 2221 atggtaagcc gctggcaagc ggtgaagtgc ctctggatgt cgctccacaa ggtaaacagt 2281 tgattgaact gcctgaacta ccgcagccgg agagcgccgg gcaactctgg ctcacagtac 2341 gcgtagtgca accgaacgcg accgcatggt cagaagccgg gcacatcagc gcctggcagc 2401 agtggcgtct ggcggaaaac ctcagtgtga cgctccccgc cgcgtcccac gccatcccgc 2461 atctgaccac cagcgaaatg gatttttgca tcgagctggg taataagcgt tggcaattta 2521 accgccagtc aggctttctt tcacagatgt ggattggcga taaaaaacaa ctgctgacgc 2581 cgctgcgcga tcagttcacc cgtgcaccgc tggataacga cattggcgta agtgaagcga 2641 cccgcattga ccctaacgcc tgggtcgaac gctggaaggc ggcgggccat taccaggccg 2701 aagcagcgtt gttgcagtgc acggcagata cacttgctga tgcggtgctg attacgaccg 2761 ctcacgcgtg gcagcatcag gggaaaacct tatttatcag ccggaaaacc taccggattg 2821 atggtagtgg tcaaatggcg attaccgttg atgttgaagt ggcgagcgat acaccgcatc 2881 cggcgcggat tggcctgaac tgccagctgg cgcaggtagc agagcgggta aactggctcg 2941 gattagggcc gcaagaaaac tatcccgacc gccttactgc cgcctgtttt gaccgctggg 3001 atctgccatt gtcagacatg tataccccgt acgtcttccc gagcgaaaac ggtctgcgct 3061 gcgggacgcg cgaattgaat tatggcccac accagtggcg cggcgacttc cagttcaaca 3121 tcagccgcta cagtcaacag caactgatgg aaaccagcca tcgccatctg ctgcacgcgg 3181 aagaaggcac atggctgaat atcgacggtt tccatatggg gattggtggc gacgactcct 3241 ggagcccgtc agtatcggcg gaattccaac tgagcgccgg tcgctaccat taccaacttg 3301 tctggtgtca aaaataatag gggccgctgt catcagatcg ccatctcgcg cccgtgcctc 3361 tgacttctaa gtccaattac tcttcaacat ccctacatgc tctttctccc tgtgctccca 3421 ccccctattt ttgttattat caaaaaaact tcttcttaat ttctttgttt tttagcttct 3481 tttaagtcac ctctaacaat gaaattgtgt agattcaaaa atagaattaa ttcgtaataa 3541 aaagtcgaaa aaaattgtgc tccctccccc cattaataat aattctatcc caaaatctac 3601 acaatgttct gtgtacactt cttatgtttt ttttacttct gataaatttt ttttgaaaca 3661 tcatagaaaa aaccgcacac aaaatacctt atcatatgtt acgtttcagt ttatgaccgc 3721 aatttttatt tcttcgcacg tctgggcctc tcatgacgtc aaatcatgct catcgtgaaa 3781 aagttttgga gtatttttgg aatttttcaa tcaagtgaaa gtttatgaaa ttaattttcc 3841 tgcttttgct ttttgggggt ttcccctatt gtttgtcaag agtttcgagg acggcgtttt 3901 tcttgctaaa atcacaagta ttgatgagca cgatgcaaga aagatcggaa gaaggtttgg 3961 gtttgaggct cagtggaagg tgagtagaag ttgataattt gaaagtggag tagtgtctat 4021 ggggtttttg ccttaaatga cagaatacat tcccaatata ccaaacataa ctgtttccta 4081 ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg tgatgacggt gaaaacctct 4141 gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc gggagcagac 4201 aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggctggctt aactatgcgg 4261 catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg 4321 taaggagaaa ataccgcatc aggcggcctt aagggcctcg tgatacgcct atttttatag 4381 gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg 4441 cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga 4501 caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat 4561 ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca 4621 gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc 4681 gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca 4741 atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg 4801 caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca 4861 gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata 4921 accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag 4981 ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg 5041 gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca 5101 acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta 5161 atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct 5221 ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca 5281 gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag 5341 gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat 5401 tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt 5461 taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa 5521 cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 5581 gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 5641 gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 5701 agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 5761 aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 5821 agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 5881 cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 5941 accgaactga gatacctaca gcgtgagcat tgagaaagcg ccacgcttcc cgaagggaga 6001 aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 6061 ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 6121 cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 6181 gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 6241 tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 6301 agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc 6361 aaaccgcctc tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc 6421 gactggaaag cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca 6481 ccccaggctt tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa 6541 caatttcaca caggaaacag ct // LOCUS SYNLACZI 6567 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD22.11. ACCESSION M34304 KEYWORDS lacZ. SOURCE Cloning vector pPD22.11. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6567) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 184 185 synthetic end/E.coli trpS start recomb 268 269 E.coli trpS end/synthetic start recomb 272 273 synthetic end/E.coli lacZ start recomb 3266 3267 E.coli lacZ end/synthetic start recomb 4101 4102 synthetic end/pUC19 start recomb 4350 4351 pUC19 end/synthetic start recomb 3326 3327 synthetic end/unknown DNA start recomb 4081 4082 unknown DNA end/synthetic start recomb 4361 4362 synthetic end/pUC19 start BASE COUNT 1587 a 1651 c 1724 g 1605 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt gcttggaggg taccgagctc agaaaaaatg actgctccaa agaagaagcg 181 taaggtaccg gtgggtgaag accagaaaca gcacctcgaa ctgagccgcg atattgccca 241 gcgtttcaac gcgctgtatg gcgagatcga tcccgtcgtt ttacaacgtc gtgactggga 301 aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 361 taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 421 atggcgcttt gcctggtttc cggcaccaga agcggtgccg gaaagctggc tggagtgcga 481 tcttcctgag gccgatactg tcgtcgtccc ctcaaactgg cagatgcacg gttacgatgc 541 gcccatctac accaacgtaa cctatcccat tacggtcaat ccgccgtttg ttcccacgga 601 gaatccgacg ggttgttact cgctcacatt taatgttgat gaaagctggc tacaggaagg 661 ccagacgcga attatttttg atggcgttaa ctcggcgttt catctgtggt gcaacgggcg 721 ctgggtcggt tacggccagg acagtcgttt gccgtctgaa tttgacctga gcgcattttt 781 acgcgccgga gaaaaccgcc tcgcggtgat ggtgctgcgt tggagtgacg gcagttatct 841 ggaagatcag gatatgtggc ggatgagcgg cattttccgt gacgtctcgt tgctgcataa 901 accgactaca caaatcagcg atttccatgt tgccactcgc tttaatgatg atttcagccg 961 cgctgtactg gaggctgaag ttcagatgtg cggcgagttg cgtgactacc tacgggtaac 1021 agtttcttta tggcagggtg aaacgcaggt cgccagcggc accgcgcctt tcggcggtga 1081 aattatcgat gagcgtggtg gttatgccga tcgcgtcaca ctacgtctga acgtcgaaaa 1141 cccgaaactg tggagcgccg aaatcccgaa tctctatcgt gcggtggttg aactgcacac 1201 cgccgacggc acgctgattg aagcagaagc ctgcgatgtc ggtttccgcg aggtgcggat 1261 tgaaaatggt ctgctgctgc tgaacggcaa gccgttgctg attcgaggcg ttaaccgtca 1321 cgagcatcat cctctgcatg gtcaggtcat ggatgagcag acgatggtgc aggatatcct 1381 gctgatgaag cagaacaact ttaacgccgt gcgctgttcg cattatccga accatccgct 1441 gtggtacacg ctgtgcgacc gctacggcct gtatgtggtg gatgaagcca atattgaaac 1501 ccacggcatg gtgccaatga atcgtctgac cgatgatccg cgctggctac cggcgatgag 1561 cgaacgcgta acgcgaatgg tgcagcgcga tcgtaatcac ccgagtgtga tcatctggtc 1621 gctggggaat gaatcaggcc acggcgctaa tcacgacgcg ctgtatcgct ggatcaaatc 1681 tgtcgatcct tcccgcccgg tgcagtatga aggcggcgga gccgacacca cggccaccga 1741 tattatttgc ccgatgtacg cgcgcgtgga tgaagaccag cccttcccgg ctgtgccgaa 1801 atggtccatc aaaaaatggc tttcgctacc tggagagacg cgcccgctga tcctttgcga 1861 atacgcccac gcgatgggta acagtcttgg cggtttcgct aaatactggc aggcgtttcg 1921 tcagtatccc cgtttacagg gcggcttcgt ctgggactgg gtggatcagt cgctgattaa 1981 atatgatgaa aacggcaacc cgtggtcggc ttacggcggt gattttggcg atacgccgaa 2041 cgatcgccag ttctgtatga acggtctggt ctttgccgac cgcacgccgc atccagcgct 2101 gacggaagca aaacaccagc agcagttttt ccagttccgt ttatccgggc aaaccatcga 2161 agtgaccagc gaatacctgt tccgtcatag cgataacgag ctcctgcact ggatggtggc 2221 gctggatggt aagccgctgg caagcggtga agtgcctctg gatgtcgctc cacaaggtaa 2281 acagttgatt gaactgcctg aactaccgca gccggagagc gccgggcaac tctggctcac 2341 agtacgcgta gtgcaaccga acgcgaccgc atggtcagaa gccgggcaca tcagcgcctg 2401 gcagcagtgg cgtctggcgg aaaacctcag tgtgacgctc cccgccgcgt cccacgccat 2461 cccgcatctg accaccagcg aaatggattt ttgcatcgag ctgggtaata agcgttggca 2521 atttaaccgc cagtcaggct ttctttcaca gatgtggatt ggcgataaaa aacaactgct 2581 gacgccgctg cgcgatcagt tcacccgtgc accgctggat aacgacattg gcgtaagtga 2641 agcgacccgc attgacccta acgcctgggt cgaacgctgg aaggcggcgg gccattacca 2701 ggccgaagca gcgttgttgc agtgcacggc agatacactt gctgatgcgg tgctgattac 2761 gaccgctcac gcgtggcagc atcaggggaa aaccttattt atcagccgga aaacctaccg 2821 gattgatggt agtggtcaaa tggcgattac cgttgatgtt gaagtggcga gcgatacacc 2881 gcatccggcg cggattggcc tgaactgcca gctggcgcag gtagcagagc gggtaaactg 2941 gctcggatta gggccgcaag aaaactatcc cgaccgcctt actgccgcct gttttgaccg 3001 ctgggatctg ccattgtcag acatgtatac cccgtacgtc ttcccgagcg aaaacggtct 3061 gcgctgcggg acgcgcgaat tgaattatgg cccacaccag tggcgcggcg acttccagtt 3121 caacatcagc cgctacagtc aacagcaact gatggaaacc agccatcgcc atctgctgca 3181 cgcggaagaa ggcacatggc tgaatatcga cggtttccat atggggattg gtggcgacga 3241 ctcctggagc ccgtcagtat cggcggaatt ccaactgagc gccggtcgct accattacca 3301 acttgtctgg tgtcaaaaat aataggggcc gctgtcatca gatcgccatc tcgcgcccgt 3361 gcctctgact tctaagtcca attactcttc aacatcccta catgctcttt ctccctgtgc 3421 tcccaccccc tatttttgtt attatcaaaa aaacttcttc ttaatttctt tgttttttag 3481 cttcttttaa gtcacctcta acaatgaaat tgtgtagatt caaaaataga attaattcgt 3541 aataaaaagt cgaaaaaaat tgtgctccct ccccccatta ataataattc tatcccaaaa 3601 tctacacaat gttctgtgta cacttcttat gtttttttta cttctgataa attttttttg 3661 aaacatcata gaaaaaaccg cacacaaaat accttatcat atgttacgtt tcagtttatg 3721 accgcaattt ttatttcttc gcacgtctgg gcctctcatg acgtcaaatc atgctcatcg 3781 tgaaaaagtt ttggagtatt tttggaattt ttcaatcaag tgaaagttta tgaaattaat 3841 tttcctgctt ttgctttttg ggggtttccc ctattgtttg tcaagagttt cgaggacggc 3901 gtttttcttg ctaaaatcac aagtattgat gagcacgatg caagaaagat cggaagaagg 3961 tttgggtttg aggctcagtg gaaggtgagt agaagttgat aatttgaaag tggagtagtg 4021 tctatggggt ttttgcctta aatgacagaa tacattccca atataccaaa cataactgtt 4081 tcctactagt cggccgtacg ggccctttcg tctcgcgcgt ttcggtgatg acggtgaaaa 4141 cctctgacac atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag 4201 cagacaagcc cgtcagggcg cgtcagcggg tgttggcggg tgtcggggct ggcttaacta 4261 tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa taccgcacag 4321 atgcgtaagg agaaaatacc gcatcaggcg gccttaaggg cctcgtgata cgcctatttt 4381 tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa 4441 atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca 4501 tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 4561 aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 4621 acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 4681 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 4741 ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg 4801 ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 4861 caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 4921 ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 4981 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 5041 aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 5101 tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 5161 aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 5221 cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 5281 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 5341 gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 5401 agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 5461 atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 5521 cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 5581 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 5641 cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 5701 tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 5761 tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 5821 ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 5881 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 5941 cctacaccga actgagatac ctacagcgtg agcattgaga aagcgccacg cttcccgaag 6001 ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg 6061 agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac 6121 ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca 6181 acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg 6241 cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc 6301 gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa 6361 tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt 6421 ttcccgactg gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt 6481 aggcacccca ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg 6541 gataacaatt tcacacagga aacagct // LOCUS SYNLACZJ 7242 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD26.77. ACCESSION M34305 KEYWORDS lacZ. SOURCE Cloning vector pPD26.77. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7242) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 102 103 synthetic end/E.coli trpS start recomb 186 187 E.coli trpS end/synthetic start recomb 190 191 synthetic end/E.coli lacZ start recomb 3184 3185 E.coli lacZ end/synthetic start recomb 4776 4777 synthetic end/pUC19 start recomb 5025 5026 pUC19 end/synthetic start recomb 3244 3245 synthetic end/unknown DNA start recomb 4756 4757 unknown DNA end/synthetic start recomb 5036 5037 synthetic end/pUC19 start BASE COUNT 1780 a 1801 c 1846 g 1815 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggta 61 ccgagctcag aaaaaatgac tgctccaaag aagaagcgta aggtaccggt gggtgaagac 121 cagaaacagc acctcgaact gagccgcgat attgcccagc gtttcaacgc gctgtatggc 181 gagatcgatc ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 241 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 301 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgctttgc ctggtttccg 361 gcaccagaag cggtgccgga aagctggctg gagtgcgatc ttcctgaggc cgatactgtc 421 gtcgtcccct caaactggca gatgcacggt tacgatgcgc ccatctacac caacgtaacc 481 tatcccatta cggtcaatcc gccgtttgtt cccacggaga atccgacggg ttgttactcg 541 ctcacattta atgttgatga aagctggcta caggaaggcc agacgcgaat tatttttgat 601 ggcgttaact cggcgtttca tctgtggtgc aacgggcgct gggtcggtta cggccaggac 661 agtcgtttgc cgtctgaatt tgacctgagc gcatttttac gcgccggaga aaaccgcctc 721 gcggtgatgg tgctgcgttg gagtgacggc agttatctgg aagatcagga tatgtggcgg 781 atgagcggca ttttccgtga cgtctcgttg ctgcataaac cgactacaca aatcagcgat 841 ttccatgttg ccactcgctt taatgatgat ttcagccgcg ctgtactgga ggctgaagtt 901 cagatgtgcg gcgagttgcg tgactaccta cgggtaacag tttctttatg gcagggtgaa 961 acgcaggtcg ccagcggcac cgcgcctttc ggcggtgaaa ttatcgatga gcgtggtggt 1021 tatgccgatc gcgtcacact acgtctgaac gtcgaaaacc cgaaactgtg gagcgccgaa 1081 atcccgaatc tctatcgtgc ggtggttgaa ctgcacaccg ccgacggcac gctgattgaa 1141 gcagaagcct gcgatgtcgg tttccgcgag gtgcggattg aaaatggtct gctgctgctg 1201 aacggcaagc cgttgctgat tcgaggcgtt aaccgtcacg agcatcatcc tctgcatggt 1261 caggtcatgg atgagcagac gatggtgcag gatatcctgc tgatgaagca gaacaacttt 1321 aacgccgtgc gctgttcgca ttatccgaac catccgctgt ggtacacgct gtgcgaccgc 1381 tacggcctgt atgtggtgga tgaagccaat attgaaaccc acggcatggt gccaatgaat 1441 cgtctgaccg atgatccgcg ctggctaccg gcgatgagcg aacgcgtaac gcgaatggtg 1501 cagcgcgatc gtaatcaccc gagtgtgatc atctggtcgc tggggaatga atcaggccac 1561 ggcgctaatc acgacgcgct gtatcgctgg atcaaatctg tcgatccttc ccgcccggtg 1621 cagtatgaag gcggcggagc cgacaccacg gccaccgata ttatttgccc gatgtacgcg 1681 cgcgtggatg aagaccagcc cttcccggct gtgccgaaat ggtccatcaa aaaatggctt 1741 tcgctacctg gagagacgcg cccgctgatc ctttgcgaat acgcccacgc gatgggtaac 1801 agtcttggcg gtttcgctaa atactggcag gcgtttcgtc agtatccccg tttacagggc 1861 ggcttcgtct gggactgggt ggatcagtcg ctgattaaat atgatgaaaa cggcaacccg 1921 tggtcggctt acggcggtga ttttggcgat acgccgaacg atcgccagtt ctgtatgaac 1981 ggtctggtct ttgccgaccg cacgccgcat ccagcgctga cggaagcaaa acaccagcag 2041 cagtttttcc agttccgttt atccgggcaa accatcgaag tgaccagcga atacctgttc 2101 cgtcatagcg ataacgagct cctgcactgg atggtggcgc tggatggtaa gccgctggca 2161 agcggtgaag tgcctctgga tgtcgctcca caaggtaaac agttgattga actgcctgaa 2221 ctaccgcagc cggagagcgc cgggcaactc tggctcacag tacgcgtagt gcaaccgaac 2281 gcgaccgcat ggtcagaagc cgggcacatc agcgcctggc agcagtggcg tctggcggaa 2341 aacctcagtg tgacgctccc cgccgcgtcc cacgccatcc cgcatctgac caccagcgaa 2401 atggattttt gcatcgagct gggtaataag cgttggcaat ttaaccgcca gtcaggcttt 2461 ctttcacaga tgtggattgg cgataaaaaa caactgctga cgccgctgcg cgatcagttc 2521 acccgtgcac cgctggataa cgacattggc gtaagtgaag cgacccgcat tgaccctaac 2581 gcctgggtcg aacgctggaa ggcggcgggc cattaccagg ccgaagcagc gttgttgcag 2641 tgcacggcag atacacttgc tgatgcggtg ctgattacga ccgctcacgc gtggcagcat 2701 caggggaaaa ccttatttat cagccggaaa acctaccgga ttgatggtag tggtcaaatg 2761 gcgattaccg ttgatgttga agtggcgagc gatacaccgc atccggcgcg gattggcctg 2821 aactgccagc tggcgcaggt agcagagcgg gtaaactggc tcggattagg gccgcaagaa 2881 aactatcccg accgccttac tgccgcctgt tttgaccgct gggatctgcc attgtcagac 2941 atgtataccc cgtacgtctt cccgagcgaa aacggtctgc gctgcgggac gcgcgaattg 3001 aattatggcc cacaccagtg gcgcggcgac ttccagttca acatcagccg ctacagtcaa 3061 cagcaactga tggaaaccag ccatcgccat ctgctgcacg cggaagaagg cacatggctg 3121 aatatcgacg gtttccatat ggggattggt ggcgacgact cctggagccc gtcagtatcg 3181 gcggaattcc aactgagcgc cggtcgctac cattaccaac ttgtctggtg tcaaaaataa 3241 taggcgaaac aaatcatctg acaccaccac cgtctgatgg atcgttctca tctccgtctc 3301 cacattatta tccgacgact acatcgacac cgaatcgaat ggaaacaagt ccggagtaca 3361 tgtttaacca tgaaatggtg ggtagatgat tattaaaatg tttaagaaaa ttaaataatt 3421 tgttttaggc accaccggtc aatgcgatgt ggtatactac accacctcct tatcaagatc 3481 caaactatcg tcatgtgcct ccaaatactg catttcaaaa tgcagagcaa atgaatggct 3541 ccttctactg ttaatctatt taattcatta atttttcatt tattgactgt atcccggatg 3601 tttcttgtcc tcccaacata tctcctaact gctcggttca ttttaaatat gctcatctca 3661 ctacatcacc cagacactgg tccccacaga gttttttgta tactatttcg ggtcattttt 3721 cttattctag actaatattg taagctataa gttgtagaat aattattgat ccaaatcaga 3781 ttaagagtat aagctttgtt ttttctcctt ttctttataa cttgttacaa tttttgaaat 3841 tccctttttt gacaggcttt tattacactg taactgtgtt tcttatcttg caaacattta 3901 atgaattgta attctttagt atcttgaggg ctttttgttt ttcgaattat tgaagctcaa 3961 agttccagtt ttactacgat ccagcgaatt ctcctcattt cgatccgatg caattgactt 4021 cagatcaata ttggttgcct gaaagaaata attgtgagca tttttgtcaa aaaacagaga 4081 actcaccatt ctcgaggctc ccgttccagg agcagtactt ggtgatggac acgtagattg 4141 attaaaccaa accaaaggtt ctttcagagt caacttacag cctcgagcgt agtccgtgat 4201 agcttctcgc agaacactga aaattggaaa tttattggaa taaaaacttt ttctgcactt 4261 tatagaataa aaaaatcatg aatttacccg aatttaacct ccgaatcgta gaccaaattg 4321 tccaagtaga tggaaatcac cttgaacatc ggatgttttt catatgctga aaataaatta 4381 atgaatttat gtaatttttt aaataattac ttttcaattt ggtgaacaat tcctgcttct 4441 ttgcataggc atctggacga gtgagtcctt tccaatcaat caatgtggtg tcgacctcga 4501 gggggggccc ggtacccagc ttttgttccc tttagtgagg gttaattccg agcttggcgt 4561 aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca 4621 taggagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagg taactcacat 4681 taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt 4741 aatgaatcgg ccaacgccta ctagtcggcc gtacgggccc tttcgtctcg cgcgtttcgg 4801 tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta 4861 agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg 4921 gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg 4981 tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcggcctt aagggcctcg 5041 tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg 5101 gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 5161 atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 5221 agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 5281 ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 5341 gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 5401 gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 5461 tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 5521 acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 5581 aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 5641 cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 5701 gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 5761 cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 5821 tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 5881 tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 5941 ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 6001 tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 6061 gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 6121 ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 6181 tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 6241 agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 6301 aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 6361 cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 6421 agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 6481 tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 6541 gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 6601 gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat tgagaaagcg 6661 ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 6721 gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 6781 ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 6841 ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 6901 acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 6961 gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 7021 cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 7081 gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 7141 gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 7201 gtggaattgt gagcggataa caatttcaca caggaaacag ct // LOCUS SYNLACZK 6620 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD34.110. ACCESSION M34306 KEYWORDS lacZ. SOURCE Cloning vector pPD34.110. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6620) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 237 238 synthetic end/E.coli trpS start recomb 321 322 E.coli trpS end/synthetic start recomb 325 326 synthetic end/E.coli lacZ start recomb 3319 3320 E.coli lacZ end/synthetic start recomb 4154 4155 synthetic end/pUC19 start recomb 4403 4404 pUC19 end/synthetic start recomb 3379 3380 synthetic end/unknown DNA start recomb 4134 4135 unknown DNA end/synthetic start recomb 4414 4415 synthetic end/pUC19 start BASE COUNT 1592 a 1665 c 1731 g 1632 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc tcgagaaagc tggcaaaggg ctcttgtcct gctaatcgta 181 ctactcttca tcgtcatctt cgttattact gttttgttcg tcataagatc taacaaggta 241 ccggtgggtg aagaccagaa acagcacctc gaactgagcc gcgatattgc ccagcgtttc 301 aacgcgctgt atggcgagat cgatcccgtc gttttacaac gtcgtgactg ggaaaaccct 361 ggcgttaccc aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc 421 gaagaggccc gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggcgc 481 tttgcctggt ttccggcacc agaagcggtg ccggaaagct ggctggagtg cgatcttcct 541 gaggccgata ctgtcgtcgt cccctcaaac tggcagatgc acggttacga tgcgcccatc 601 tacaccaacg taacctatcc cattacggtc aatccgccgt ttgttcccac ggagaatccg 661 acgggttgtt actcgctcac atttaatgtt gatgaaagct ggctacagga aggccagacg 721 cgaattattt ttgatggcgt taactcggcg tttcatctgt ggtgcaacgg gcgctgggtc 781 ggttacggcc aggacagtcg tttgccgtct gaatttgacc tgagcgcatt tttacgcgcc 841 ggagaaaacc gcctcgcggt gatggtgctg cgttggagtg acggcagtta tctggaagat 901 caggatatgt ggcggatgag cggcattttc cgtgacgtct cgttgctgca taaaccgact 961 acacaaatca gcgatttcca tgttgccact cgctttaatg atgatttcag ccgcgctgta 1021 ctggaggctg aagttcagat gtgcggcgag ttgcgtgact acctacgggt aacagtttct 1081 ttatggcagg gtgaaacgca ggtcgccagc ggcaccgcgc ctttcggcgg tgaaattatc 1141 gatgagcgtg gtggttatgc cgatcgcgtc acactacgtc tgaacgtcga aaacccgaaa 1201 ctgtggagcg ccgaaatccc gaatctctat cgtgcggtgg ttgaactgca caccgccgac 1261 ggcacgctga ttgaagcaga agcctgcgat gtcggtttcc gcgaggtgcg gattgaaaat 1321 ggtctgctgc tgctgaacgg caagccgttg ctgattcgag gcgttaaccg tcacgagcat 1381 catcctctgc atggtcaggt catggatgag cagacgatgg tgcaggatat cctgctgatg 1441 aagcagaaca actttaacgc cgtgcgctgt tcgcattatc cgaaccatcc gctgtggtac 1501 acgctgtgcg accgctacgg cctgtatgtg gtggatgaag ccaatattga aacccacggc 1561 atggtgccaa tgaatcgtct gaccgatgat ccgcgctggc taccggcgat gagcgaacgc 1621 gtaacgcgaa tggtgcagcg cgatcgtaat cacccgagtg tgatcatctg gtcgctgggg 1681 aatgaatcag gccacggcgc taatcacgac gcgctgtatc gctggatcaa atctgtcgat 1741 ccttcccgcc cggtgcagta tgaaggcggc ggagccgaca ccacggccac cgatattatt 1801 tgcccgatgt acgcgcgcgt ggatgaagac cagcccttcc cggctgtgcc gaaatggtcc 1861 atcaaaaaat ggctttcgct acctggagag acgcgcccgc tgatcctttg cgaatacgcc 1921 cacgcgatgg gtaacagtct tggcggtttc gctaaatact ggcaggcgtt tcgtcagtat 1981 ccccgtttac agggcggctt cgtctgggac tgggtggatc agtcgctgat taaatatgat 2041 gaaaacggca acccgtggtc ggcttacggc ggtgattttg gcgatacgcc gaacgatcgc 2101 cagttctgta tgaacggtct ggtctttgcc gaccgcacgc cgcatccagc gctgacggaa 2161 gcaaaacacc agcagcagtt tttccagttc cgtttatccg ggcaaaccat cgaagtgacc 2221 agcgaatacc tgttccgtca tagcgataac gagctcctgc actggatggt ggcgctggat 2281 ggtaagccgc tggcaagcgg tgaagtgcct ctggatgtcg ctccacaagg taaacagttg 2341 attgaactgc ctgaactacc gcagccggag agcgccgggc aactctggct cacagtacgc 2401 gtagtgcaac cgaacgcgac cgcatggtca gaagccgggc acatcagcgc ctggcagcag 2461 tggcgtctgg cggaaaacct cagtgtgacg ctccccgccg cgtcccacgc catcccgcat 2521 ctgaccacca gcgaaatgga tttttgcatc gagctgggta ataagcgttg gcaatttaac 2581 cgccagtcag gctttctttc acagatgtgg attggcgata aaaaacaact gctgacgccg 2641 ctgcgcgatc agttcacccg tgcaccgctg gataacgaca ttggcgtaag tgaagcgacc 2701 cgcattgacc ctaacgcctg ggtcgaacgc tggaaggcgg cgggccatta ccaggccgaa 2761 gcagcgttgt tgcagtgcac ggcagataca cttgctgatg cggtgctgat tacgaccgct 2821 cacgcgtggc agcatcaggg gaaaacctta tttatcagcc ggaaaaccta ccggattgat 2881 ggtagtggtc aaatggcgat taccgttgat gttgaagtgg cgagcgatac accgcatccg 2941 gcgcggattg gcctgaactg ccagctggcg caggtagcag agcgggtaaa ctggctcgga 3001 ttagggccgc aagaaaacta tcccgaccgc cttactgccg cctgttttga ccgctgggat 3061 ctgccattgt cagacatgta taccccgtac gtcttcccga gcgaaaacgg tctgcgctgc 3121 gggacgcgcg aattgaatta tggcccacac cagtggcgcg gcgacttcca gttcaacatc 3181 agccgctaca gtcaacagca actgatggaa accagccatc gccatctgct gcacgcggaa 3241 gaaggcacat ggctgaatat cgacggtttc catatgggga ttggtggcga cgactcctgg 3301 agcccgtcag tatcggcgga attccaactg agcgccggtc gctaccatta ccaacttgtc 3361 tggtgtcaaa aataataggg gccgctgtca tcagatcgcc atctcgcgcc cgtgcctctg 3421 acttctaagt ccaattactc ttcaacatcc ctacatgctc tttctccctg tgctcccacc 3481 ccctattttt gttattatca aaaaaacttc ttcttaattt ctttgttttt tagcttcttt 3541 taagtcacct ctaacaatga aattgtgtag attcaaaaat agaattaatt cgtaataaaa 3601 agtcgaaaaa aattgtgctc cctcccccca ttaataataa ttctatccca aaatctacac 3661 aatgttctgt gtacacttct tatgtttttt ttacttctga taaatttttt ttgaaacatc 3721 atagaaaaaa ccgcacacaa aataccttat catatgttac gtttcagttt atgaccgcaa 3781 tttttatttc ttcgcacgtc tgggcctctc atgacgtcaa atcatgctca tcgtgaaaaa 3841 gttttggagt atttttggaa tttttcaatc aagtgaaagt ttatgaaatt aattttcctg 3901 cttttgcttt ttgggggttt cccctattgt ttgtcaagag tttcgaggac ggcgtttttc 3961 ttgctaaaat cacaagtatt gatgagcacg atgcaagaaa gatcggaaga aggtttgggt 4021 ttgaggctca gtggaaggtg agtagaagtt gataatttga aagtggagta gtgtctatgg 4081 ggtttttgcc ttaaatgaca gaatacattc ccaatatacc aaacataact gtttcctact 4141 agtcggccgt acgggccctt tcgtctcgcg cgtttcggtg atgacggtga aaacctctga 4201 cacatgcagc tcccggagac ggtcacagct tgtctgtaag cggatgccgg gagcagacaa 4261 gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg gctggcttaa ctatgcggca 4321 tcagagcaga ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta 4381 aggagaaaat accgcatcag gcggccttaa gggcctcgtg atacgcctat ttttataggt 4441 taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg 4501 cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 4561 ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt 4621 ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga 4681 aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga 4741 actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat 4801 gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca 4861 agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt 4921 cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac 4981 catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct 5041 aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga 5101 gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac 5161 aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat 5221 agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg 5281 ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc 5341 actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc 5401 aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg 5461 gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta 5521 atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg 5581 tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 5641 tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 5701 ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 5761 agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa 5821 ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 5881 tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 5941 gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 6001 cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa 6061 ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 6121 agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 6181 tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 6241 ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 6301 ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 6361 ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 6421 accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 6481 ctggaaagcg ggcagtgagc gcaacgcaat taatgtgagt tagctcactc attaggcacc 6541 ccaggcttta cactttatgc ttccggctcg tatgttgtgt ggaattgtga gcggataaca 6601 atttcacaca ggaaacagct // LOCUS SYNLACZL 5808 bp ds-DNA SYN 17-JUL-1990 DEFINITION Cloning vector pPD16.01. ACCESSION M34307 KEYWORDS lacZ. SOURCE Cloning vector pPD16.01. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 5808) AUTHORS Fire,A.Z., Harrison,S. and Dixon,D. TITLE A modular set of lac-Z fusion vectors for studying gene expression in C.elegans JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Z.Fire, 11-MAY-1990. Author address: A.Z.Fire Carnegie Inst of Washington Dept Embryology 115 West Univ Parkway Baltimore, MD 21210 email: AZF@JHUIGF.BITNET FEATURES from to/span description recomb 57 58 pUC19 end/synthetic start recomb 180 181 synthetic end/E.coli trpS start recomb 264 265 E.coli trpS end/synthetic start recomb 268 269 synthetic end/E.coli lacZ start recomb 3262 3263 E.coli lacZ end/synthetic start recomb 3342 3343 synthetic end/unknown DNA start recomb 3591 3592 unknown DNA end/synthetic start recomb 3602 3603 synthetic end/pUC19 start BASE COUNT 1375 a 1502 c 1599 g 1332 t ORIGIN 1 atgaccatga ttacgccaag cttgcatgcc tgcaggtcga ctctagagga tccccgggat 61 tggccaaagg acccaaaggt atgtttcgaa tgatactaac ataacataga acattttcag 121 gaggaccctt ggagggtacc gagctcagaa aaaatgactg ctccaaagaa gaagcgtaag 181 gtaccggtgg gtgaagacca gaaacagcac ctcgaactga gccgcgatat tgcccagcgt 241 ttcaacgcgc tgtatggcga gatcgatccc gtcgttttac aacgtcgtga ctgggaaaac 301 cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat 361 agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg 421 cgctttgcct ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt 481 cctgaggccg atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc 541 atctacacca acgtaaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat 601 ccgacgggtt gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag 661 acgcgaatta tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg 721 gtcggttacg gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc 781 gccggagaaa accgcctcgc ggtgatggtg ctgcgttgga gtgacggcag ttatctggaa 841 gatcaggata tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg 901 actacacaaa tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct 961 gtactggagg ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt 1021 tctttatggc agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt 1081 atcgatgagc gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg 1141 aaactgtgga gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc 1201 gacggcacgc tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa 1261 aatggtctgc tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag 1321 catcatcctc tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg 1381 atgaagcaga acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg 1441 tacacgctgt gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac 1501 ggcatggtgc caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa 1561 cgcgtaacgc gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg 1621 gggaatgaat caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc 1681 gatccttccc gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt 1741 atttgcccga tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg 1801 tccatcaaaa aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac 1861 gcccacgcga tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag 1921 tatccccgtt tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat 1981 gatgaaaacg gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat 2041 cgccagttct gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg 2101 gaagcaaaac accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg 2161 accagcgaat acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg 2221 gatggtaagc cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag 2281 ttgattgaac tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta 2341 cgcgtagtgc aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag 2401 cagtggcgtc tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg 2461 catctgacca ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt 2521 aaccgccagt caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg 2581 ccgctgcgcg atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg 2641 acccgcattg accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc 2701 gaagcagcgt tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc 2761 gctcacgcgt ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt 2821 gatggtagtg gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat 2881 ccggcgcgga ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc 2941 ggattagggc cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg 3001 gatctgccat tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc 3061 tgcgggacgc gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac 3121 atcagccgct acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg 3181 gaagaaggca catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc 3241 tggagcccgt cagtatcggc ggaattccaa ctgagcgccg gtcgctacca ttaccaactt 3301 gtctggtgtc aaaaataata ggcctactag tcggccgtac gggccctttc gtctcgcgcg 3361 tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg 3421 tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg 3481 gtgtcggggc tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat 3541 gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc ggccttaagg 3601 gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt 3661 caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 3721 attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 3781 aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 3841 tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 3901 agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 3961 gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 4021 cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 4081 agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 4141 taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 4201 tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 4261 taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 4321 acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 4381 ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 4441 cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 4501 agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 4561 tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 4621 agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 4681 tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 4741 ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 4801 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 4861 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 4921 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 4981 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 5041 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 5101 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 5161 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag 5221 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 5281 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 5341 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 5401 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 5461 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 5521 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 5581 aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt 5641 aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta 5701 atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta 5761 tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct // LOCUS MUSTCAXL 331 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD1. ACCESSION M34194 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 331 T-cell receptor alpha-chain (AA at 1) BASE COUNT 71 a 86 c 89 g 85 t ORIGIN 1 cagtcagtga cgcagcccga tgctcgtgtc actgtctctg aaggagcctc tctgcagctg 61 agatgcaagt attcctcctc tgtgacacct tatctgttct ggtatgtcct gtacccgcgg 121 caggggctgc agctgctcct caagtactat tccggagacc cagtggttca aggagtgaat 181 ggctttgagg ctgagttcag caagagtaac tcttccttcc acctgcggaa agcctccgtg 241 cactggagcg actcggctgt gtacttctgt gctgtgagca tggatggaaa tgagaaaata 301 acttttgggg ctggaaccaa actcaccatt a // LOCUS MUSTCAXM 334 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma LD3. ACCESSION M34196 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 334 T-cell receptor alpha-chain (AA at 1) BASE COUNT 97 a 84 c 75 g 78 t ORIGIN 1 cagcaggtga gacaaagtcc ccaatctctg acagtctggg aaggagagac agcaattctg 61 aactgcagtt atgaggacag cacttttgac tacttcccat ggtaccgtct gttccctggg 121 gaaagccctg cactcctgat agccatacgt ccagtgtcca ataaaaagga agatggacga 181 ttcacaatct tcttcaataa aagggagaaa aagctctcct tgcacatcac agactctcag 241 cctggagact cagctaccta cttctgtgca gcaagaagta caggctttgc aagtgcgctg 301 acatttggat ctggcacaaa agtcattgtt ctac // LOCUS MUSTCAXN 327 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-14. ACCESSION M34198 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 327 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 85 g 70 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgagtgatc agcgggggaa gcttatcttt 301 ggacagggaa ccaagttatc tatcaag // LOCUS MUSTCAXO 324 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-27. ACCESSION M34200 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 92 a 80 c 83 g 69 t ORIGIN 1 gactcagtga ctcagacgga aggtcaagtg gccctctcag aagaggactt tcttacgata 61 cactgcaact actcagcctc agggtaccca gctctgttct ggtatgtgca gtatcccgga 121 gaagggccac agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggg 181 tttgaagcca catacaataa agaagccacc tccttccact tgcagaaagc ctcagtgcaa 241 gagtcagact cggctgtgta ctactgtgct ctgaggagca actatcagtt gatctggggc 301 tctgggacca agctaattat aaag // LOCUS MUSTCAXP 297 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-6. ACCESSION M34202 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 297) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 297 T-cell receptor alpha-chain (AA at 1) BASE COUNT 85 a 73 c 74 g 65 t ORIGIN 1 gtgacattat ctgaaggaac ttctctgact gtgaactgtt cctatgaaac caaacagtac 61 ccaaccctgt tctggtatgt gcagtatccc ggagaaggtc cacagctcct ctttaaagtc 121 ccaaaggcca acgagaaggg aagcagcaga gggtttgaag ccacatacaa taaagaagcc 181 acctccttcc acttgcagaa agcctcagtg caagagtcag actcggctgt gtactactgt 241 gctctgagtg atcgggggac caatacaggc aaattaacct ttggggatgg gaccgtg // LOCUS MUSTCAXQ 193 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, T-cell clone V2.1. ACCESSION M34204 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 193) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 193 T-cell receptor alpha-chain (AA at 2) BASE COUNT 56 a 45 c 49 g 43 t ORIGIN 1 cctctttaaa gtcccaaagg ccaacgagaa gggaagcagc agagggtttg aagccacata 61 caataaagaa gccacctcct tccacttgca gaaagcctca gtgcaagagt cagactcggc 121 tgtgtactac tgtgctctga gtggaggcaa taataagctg acttttggtc aaggaaccgt 181 tctgagtgtt ctg // LOCUS MUSTCAXR 333 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-33. ACCESSION M34206 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 333 T-cell receptor alpha-chain (AA at 1) BASE COUNT 84 a 96 c 78 g 74 t 1 others ORIGIN 1 gactccgtga cccagacaga aggcctggtc actgtcaccg aggggttgcc tgtgaagctg 61 aactgcacct atcagactac ttatttaact attgcctttt tctggtatgt gcaatatctc 121 aacgaagccc ctcaggtact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcna taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctctgagga atacaggagg tgcagataga 301 ctcacctttg ggaaaggaac tcagctgatc atc // LOCUS MUSTCAXS 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma MT1-7. ACCESSION M34208 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor alpha-chain (AA at 1) BASE COUNT 86 a 103 c 75 g 75 t ORIGIN 1 gactcagtga cccagacaga aggcctggtc actctcaccg aggggttgcc tgtgatgctg 61 aactgcacct atcagactgc ttactcaact ttccttttct ggtatgtgca acatctcaat 121 gaagccccta aactactcct gaagagctcc acagacaaca agaggaccga gcaccaaggg 181 ttccacgcca ctctccataa gagcagcagc tccttccatc tgcagaagtc ctcagcgcag 241 ctgtcagact ctgccctgta ctactgtgct ctgagtgata agactggagc taacactgga 301 aagctcacgt ttggacacgg caccatcctt agggtccat // LOCUS MUSTCAXT 342 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1F12C4. ACCESSION M34210 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 342) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 342 T-cell receptor alpha-chain (AA at 1) BASE COUNT 80 a 102 c 79 g 81 t ORIGIN 1 gactccgtga cccagacaga aggcctggtc actctcaacg aggggttgcc tgtgatgctg 61 aactgcacct atcagactat ttactcaaat gctttccttt tctggtatgt gcactatctc 121 aatgaatccc cttggctact cctgcggagc tccacagaca acaagaggac cgagcaccaa 181 gggttccacg ccactctcca taagagcagc agctccttcc atctgcagaa gtcctcagcg 241 cagctgtcag actctgccct gtactactgt gctttgagtg agaggtctgg agctaacact 301 ggaaagctca cgtttggaca cggcaccatc cttagggtcc at // LOCUS MUSTCAXU 324 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma P1D3A6. ACCESSION M34212 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 324 T-cell receptor alpha-chain (AA at 1) BASE COUNT 82 a 80 c 81 g 81 t ORIGIN 1 cagcaagtgc agcagagccc cgcgtccttg gttctgcagg agggggagaa tgcagagctg 61 cagtgtaact tttccacatc tttgaacagt atgcagtggt tttaccaacg tcctgaggga 121 agtctcgtca gcctgttcta caatccttct gggacaaagc agagtgggag actgacatcc 181 acaacagtca tcaaagaacg tcgcagctct ttgcacattt cctcctccca gatcacagac 241 tcaggcactt atctctgtgc tatggaggct actggaggca ataataagct gacttttggt 301 caaggaaccg ttctgagtgt tata // LOCUS MUSTCAXV 210 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 1E1O. ACCESSION M34214 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor alpha-chain (AA at 1) BASE COUNT 54 a 53 c 50 g 53 t ORIGIN 1 gggggaagtc tcgtcagcct gttctacaat ccttctggga caaagcagag tgggagactg 61 acatccacta cagtcatcaa agaacgtcgc agctctttgc acatttcctc ctcccagaca 121 acagactcag gcacttatct ctgtgctatg gcggctactg gaggcaataa taagctgact 181 tttggtcaag gaaccgttct gagtgttata // LOCUS MUSTCAXW 234 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34216 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 234) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 234 T-cell receptor alpha-chain (AA at 1) BASE COUNT 55 a 59 c 58 g 62 t ORIGIN 1 atgcagtggt tttatcaacg tcctggggga agtctcgtca gcctgttcta caatccttct 61 gggacaaagc agagtgggag actgacatcc actacagtca tcaaagaacg tcgcagctct 121 ttgcacattt cctcctccca gacaacagac tcaggcactt atctctgtgc tatgggtgta 181 tctggtagct tcaataagtt gacctttgga gcagggacca gactggctgt gtgc // LOCUS MUSTCAXX 312 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active alpha-chain mRNA V-region, partial cds, from hybridoma 2B11. ACCESSION M34218 KEYWORDS T-cell receptor; T-cell receptor alpha-chain; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 312 T-cell receptor alpha-chain (AA at 1) BASE COUNT 89 a 79 c 79 g 64 t 1 others ORIGIN 1 aatccgtggg ccctgagngt ccacgagggt gaaagtgtca cggtgaattg tagttacaag 61 acatccataa ctgccctaca gtggtacaga cagaagtcag gcgaaggccc tgcccagcta 121 atcttaatac gttcaaatga gagagagaag cgcaatggaa gactcagagc cacccttgac 181 acctccagcc agagcagctc cttgtccatc actgctactc ggtgtgaaga caccgctgtg 241 tacttctgtg ctactgagac aggcaatact agaaaacaca tctttgggct ggggacaact 301 ttgcaagtgc aa // LOCUS MUSTCBYAO 153 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD1. ACCESSION M34195 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 153) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 153 T-cell receptor beta-chain (AA at 1) recomb 122 123 V-region end/J-region start BASE COUNT 39 a 37 c 36 g 41 t ORIGIN 1 caaataggag atgtccctga tgggtacaag gccaccagaa caacgcaaga agacttcttc 61 ctcctgctgg aattggcttc tccctctcag acatctttgt acttctgtgc cagcagtgta 121 ggttctggaa atacgctcta ttttggagaa gga // LOCUS MUSTCBYAP 111 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma LD3. ACCESSION M34197 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma LD3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 111) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 111 T-cell receptor beta-chain (AA at 1) recomb 73 74 V-region end/J-region start BASE COUNT 18 a 37 c 25 g 31 t ORIGIN 1 ttcctcctgc tggaattggc ttctccctct cagacatctt tgtacttctg tgccgcgtcc 61 ccgacaggga acaccgacta caccttcggc tcagggacca ggcttttggt a // LOCUS MUSTCBYAQ 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-14. ACCESSION M34199 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-14, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 292 293 V-region end/J-region start BASE COUNT 106 a 81 c 61 g 73 t ORIGIN 1 gacccgaaaa ttatccagaa accaaaatat ctggtggcag tcacagggag cgaaaaaatc 61 ctgatatgcg aacagtatct aggccacaat gctatgtatt ggtatagaca aagtgctaag 121 aagcctctag agttcatgtt ttcctacagc tatcaaaaac ttatggacaa tcagactgcc 181 tcaagtcgct tccaacctca aagttcaaag aaaaaccatt tagaccttca gatcacagct 241 ctaaagcctg atgactcggc cacatacttc tgtgccagca gccccaagac acgtcaaaac 301 accttgtact ttggtgcggg c // LOCUS MUSTCBYAR 210 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-27. ACCESSION M34201 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-27, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 210 T-cell receptor beta-chain (AA at 1) recomb 167 168 V-region end/J-region start BASE COUNT 57 a 56 c 43 g 54 t ORIGIN 1 aagattatgt ttagctacaa taataagcaa ctcattgtaa acgaaacagt tccaaggcgc 61 ttctcacctc agtcttcaga taaagctcat ttgaatcttc gaatcaagtc tgtagagccg 121 gaggactctg ctgtgtatct ctgtgccagc agctatcgga caccccccta tgctgagcag 181 ttcttcggac cagggacacg actcaccgtc // LOCUS MUSTCBYAS 255 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-6. ACCESSION M34203 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 255) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 255 T-cell receptor beta-chain (AA at 1) recomb 219 220 V-region end/J-region start BASE COUNT 57 a 68 c 77 g 53 t ORIGIN 1 cagtatccct ggatgagctg gtatcagcag gatctccaaa agcaactaca gtggctgttc 61 actctgcgga gtcctgggga caaagaggtc aaatctcttc ccggtgctga ttacctggcc 121 acacgggtca ctgatacgga gctgaggctg caagtggcca acatgagcca gggcagaacc 181 ttgtactgca cctgcagtgc ggggactggg ggggctacta acaccttgta ctttggtgcg 241 ggcacccgac tatcg // LOCUS MUSTCBYAT 213 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, T-cell clone V2.1. ACCESSION M34205 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell lymphoid clone V2.1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 213) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 213 T-cell receptor beta-chain (AA at 1) recomb 171 172 V-region end/J-region start BASE COUNT 48 a 59 c 58 g 48 t ORIGIN 1 ctgaggctga tccattattc atatggtgct ggcagcactg agaaaggaga tatccctgat 61 ggatacaagg cctccagacc aagccaagag aacttctccc tcattctgga gttggctacc 121 ccctctcaga catcagtgta cttctgtgcc agcggtggcg gccgggggag ttatgctgag 181 cagttcttcg gaccagggac acgactcacc gtc // LOCUS MUSTCBYAU 207 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-33. ACCESSION M34207 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-33, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 207) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 207 T-cell receptor beta-chain (AA at 1) recomb 163 164 V-region end/J-region start BASE COUNT 65 a 54 c 36 g 52 t ORIGIN 1 ctagagttca tgttttccta cagctatcaa aaacttatgg acaatcagac tgcctcaagt 61 cgcttccaac ctcaaagttc aaagaaaaac catttagacc ttcagatcac agctctaaag 121 cctgatgact cggccacata cttctgtgcc agcagcaaaa gggccaacga aagattattt 181 ttcggtcatg gaaccaagct gtctgtc // LOCUS MUSTCBYAV 156 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma MT1-7. ACCESSION M34209 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma MT1-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 156) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 156 T-cell receptor beta-chain (AA at 1) recomb 118 119 V-region end/J-region start BASE COUNT 30 a 48 c 40 g 38 t ORIGIN 1 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 61 gcttctccct ctcagacatc tttgtacttc tgtgccagca gtgtccgggt ctgggggcct 121 gaacagtact tcggtcccgg caccaggctc acggtt // LOCUS MUSTCBYAW 132 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1F12C4. ACCESSION M34211 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1F12C4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 132) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 132 T-cell receptor beta-chain (AA at 1) recomb 94 95 V-region end/J-region start BASE COUNT 35 a 39 c 28 g 30 t ORIGIN 1 ccaagccaag agaacttctc cctcattctg gagttggcta ccccctctca gacatcagtg 61 tacttctgtg ccagcggtgc cagacaggca aacacagaag tcttctttgg taaaggaacc 121 agactcacag tt // LOCUS MUSTCBYAX 303 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma P1D3A6. ACCESSION M34213 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma P1D3A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 303 T-cell receptor beta-chain (AA at 1) recomb 259 260 V-region end/J-region start BASE COUNT 80 a 72 c 79 g 72 t ORIGIN 1 aaggtgacag taacaggagg aaacgtgaca ttgagctgtc gccagactaa tagccacaac 61 tacatgtact ggtatcggca ggacactggg catgggctga ggctgatcca ttactcatat 121 ggtgctggca accttcaaat aggagatgtc cctgatgggt acaaggccac cagaacaacg 181 caagaagact tcttcctcct gctggaattg gcttctccct ctcagacatc tttgtacttc 241 tgtgccagca gtgcaggagc tggaaatacg ctctattttg gagaaggaag ccggctcatt 301 gtt // LOCUS MUSTCBYAY 159 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 1E1O. ACCESSION M34215 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 1E1O, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 159 T-cell receptor beta-chain (AA at 1) recomb 113 114 V-region end/J-region start BASE COUNT 35 a 40 c 39 g 45 t ORIGIN 1 gatgtccctg atgggtacaa ggccaccaga acaacgcaag aagacttctt cctcctgctg 61 gaattggctt ctccctctca gacatctttg tacttctgtg ccagcagtgt gggttctgga 121 aatacgctct attttggaga aggaagccgg ctcattgtt // LOCUS MUSTCBYAZ 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 7/6AH1. ACCESSION M34217 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 7/6AH1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 321 T-cell receptor beta-chain (AA at 1) recomb 275 276 V-region end/J-region start BASE COUNT 86 a 78 c 81 g 76 t ORIGIN 1 acccaaagcc ctagaaacaa ggtgacagta acaggaggaa acgtgacatt gagctgtcgc 61 cagactaata gccacaacta catgtactgg tatcggcagg acactgggca tgggctgagg 121 ctgatccatt actcatatgg tgctggcaac cttcaaatag gagatgtccc tgatgggtac 181 aaggccacca gaacaacgca agaagacttc ttcctcctgc tggaattggc ttctccctct 241 cagacatctt tgtacttctg tgccagcagt gtgggttctg gaaatacgct ctattttgga 301 gaaggaagcc ggctcattgt t // LOCUS MUSTCBYBA 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse T-cell receptor active beta-chain mRNA V-J-region, partial cds, from hybridoma 2B11. ACCESSION M34219 KEYWORDS T-cell receptor; T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/c) T-cell hybridoma 2B11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Taylor,A.H., Haberman,A.M., Gerhard,W. and Caton,A.J. TITLE Structurally diverse T cells can recognize an influenza antigen/MHC complex in the same common orientation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.H.Taylor, 16-MAY-1990. Author address: A.H.Taylor Wistar Institute of Anatomy and Biology 3601 Spruce Street Philadelphia, PA 19104 Tel: (215) 898-3839 FEATURES from to/span description pept < 1 > 339 T-cell receptor beta-chain (AA at 1) recomb 294 295 V-region end/J-region start BASE COUNT 90 a 88 c 88 g 73 t ORIGIN 1 gaggctgcag tcacccaaag ccctagaaac aaggtgacag taacaggagg aaacgtgaca 61 ttgagctgtc gccagactaa tagccacaac tacatgtact ggtatcggca ggacactggg 121 catgggctga ggctgatcca ttactcatat ggtgctggca accttcaaat aggagatgtc 181 cctgatgggt acaaggccac cagaacaacg caagaagact tcttcctcct gctggaattg 241 gcttctccct ctcagacatc tttgtacttc tgtgccagca ggagacaggg gcctagtcaa 301 aacaccttgt actttggtgc gggcacccga ctatcggtg // LOCUS CEACAEVA 264 bp ds-DNA VRL 17-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34092 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain Cork) DNA, from goat synovial membrane, clone pCol.9. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 97 a 40 c 80 g 47 t ORIGIN 1 atgagtgaag aactgcctca aagaagggag acacatccag aagaacttgt aaggaacgta 61 cgggaaagag aaagggatac atggcaatgg acaagcatca gagtacctga ggaaatactg 121 caaagatggc ttgctatgct taggtcaggc agaaatagaa agaaagtgta tagagaaatg 181 caaaaatgga tgtggataca tcccaagggg cctgtgatta gggcctgtgg atgcagacta 241 tgtaacccgg ggtggggaac ataa // LOCUS CEACAEVB 264 bp ds-DNA VRL 17-JUL-1990 DEFINITION Caprine arthritis-encephalitis lentivirus tat protein gene, complete cds. ACCESSION M34093 KEYWORDS tat protein. SOURCE Caprine arthritis-encephalitis lentivirus (strain G63) DNA, from goat synovial membrane, clone pC63-49. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 264) AUTHORS Jackson,M.K., Knowles,D.P., Stem,T.A., Harwood,W.G., Robinson,M.M. and Cheevers,W.P. TITLE Genetic structure of the pol-env region of the Caprine arthritis- encephalitis lentivirus genome: Possible role in trans-activation of the viral long terminal repeat JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.P.Cheevers, 08-MAY-1990. Author address: W.P.Cheevers Washington State University Dept Veterinary Microbiology Pullman, WA 94164-7040 email: b384@wsuvmsl.bitnet FEATURES from to/span description pept 1 264 tat protein BASE COUNT 99 a 43 c 75 g 47 t ORIGIN 1 atgagtgaaa gactgcctca aagaagggaa gtacatccag aggaacgtgt aaggaacata 61 tgggaaagag aaagggacac atggcaatgg acaagcatca gagtacctga agaaatactg 121 caaagatggc tcgctatgct taggtcaggc agaaatagaa acaaagtgta tagagaaatg 181 caaaaatgga tgtcgataca tcccaaggcg cctgtgatta ggccttgtgg atgcagacta 241 tgtaacccgg ggtgggaaac ataa // LOCUS FLAPR834HA 540 bp ss-RNA VRL 17-JUL-1990 DEFINITION Influenza A/PR/8/34, hemagglutinin (seg 4) gene. partial cds. ACCESSION M34335 KEYWORDS glycoprotein; hemagglutinin. SOURCE Influenza A/PR/8/34 RNA, passed in bovine MBDK cells, originally from human. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 540) AUTHORS Bressoud,A., Whitcomb,J., Pourzand,C., Haller,O. and Cerutti,P. TITLE Rapid detection of influenza virus H1 by the polymerase chain reaction JOURNAL Biochem. Biophys. Res. Commun. 167, 425-430 (1990) STANDARD simple staff_review COMMENT Sequence reported is + strand. FEATURES from to/span description pept < 1 > 540 hemagglutinin (AA at 2) BASE COUNT 179 a 121 c 123 g 117 t ORIGIN 1 cctactggtc ctgttatgtg cacttgcagc tgcagatgca gacacaatat gtataggcta 61 ccatgcgaac aattcaaccg acactgttga cacagtactc gagaagaatg tgacagtgac 121 acactctgtt aacctgctcg aagacagcca caacggaaaa ctatgtagat taaaaggaat 181 agccccacta caattgggga aatgtaacat cgccggatgg ctcttgggaa acccagaatg 241 cgacccactg cttccagtga gatcatggtc ctacattgta gaaacaccaa actctgagaa 301 tggaatatgt tatccaggag atttcatcga ctatgaggag ctgagggagc aattgagctc 361 agtgtcatca ttcgaaagat tcgaaatatt tcccaaagaa agctcatggc ccaaccacaa 421 cacaaacgga gtaacggcag catgctccca tgaggggaaa agcagttttt acagaaattt 481 gctatggctg acggagaagg agggctcata cccaaagctg aaaaattctt atgtgaacaa // LOCUS HUMGPPSBAA 355 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein C-D gene, intron C1. ACCESSION M34422 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, clone PS-beta-G C. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 355) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description IVS 1 355 pregnancy-specific beta-1 glycoprotein intron C1 BASE COUNT 110 a 64 c 62 g 119 t ORIGIN Chromosome 19. 1 gtaagtggat cccagcatcg ttggcaatag ggttttaggt ggagtctatc tggcattcag 61 agaagagtca ggaaaacaat tgtattccca gcctgtgtcc catgggcaca agcaaatccc 121 aaattctcct cctgaaccct ccaaatttgt ctaagaactt cgaaaacttt aacaaacagg 181 ctgatatctt cataatattc ccagcctaga ccaagcagga agaacattga tttcattgaa 241 ataattgata ataatgaaga taatgttttt atgattttta tttgaaaatt tgctgattct 301 ttaaatggtt tgttttctac attgatggaa tttttctctt ttaatctatc tacag // LOCUS HUMGPPSBD 1418 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34421 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1418) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 79 1359 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 79 180 pregnancy-specific beta-1 glycoprotein signal peptide matp 181 1356 pregnancy-specific beta-1 glycoprotein mRNA 1 1418 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 408 a 398 c 296 g 316 t ORIGIN Chromosome 19. 1 cagctgacag ccgtgctcag acagcttctg gatcctaggc tcatctccac agaggagaac 61 acgcaggcag cagagaccat ggggcccctc ccagcccctt cctgcacaca gcgcatcacc 121 tggaaggggc tcctgctcac agcatcactt ttaaacttct ggaacccgcc caccactgcc 181 gaagtcacga ttgaagccca gccacccaaa gtttctgagg ggaaggatgt tcttctactt 241 gtccacaatt tgccccagaa tcttcctggc tacttctggt acaaagggga aatgacggac 301 ctctaccatt acattatatc gtatatagtt gatggtaaaa taattatata tgggcctgca 361 tacagtggaa gagaaacagt atattccaac gcatccctgc tgatccagaa tgtcacccgg 421 aaggatgcag gaacctacac cttacacatc ataaagcgag gtgatgagac tagagaagaa 481 attcgacatt tcaccttcac cttatacttg gagactccca agccctacat ctccagcagc 541 aacttaaacc ccagggaggc catggaggct gtgcgcttaa tctgtgatcc tgagactctg 601 gacgcaagct acctatggtg gatgaatggt cagagcctcc ctgtgactca caggttgcag 661 ctgtccaaaa ccaacaggac cctctatcta tttggtgtca caaagtatat tgcaggaccc 721 tatgaatgtg aaatacggaa cccagtgagt gccagtcgca gtgacccagt caccctgaat 781 ctcctcccga agctgcccat cccctacatc accatcaaca acttaaaccc cagggagaat 841 aaggatgtct tagccttcac ctgtgaacct aagagtgaga actacaccta catttggtgg 901 ctaaacggtc agagcctccc cgtcagtccc ggggtaaagc gacccattga aaacaggata 961 ctcattctac ccagtgtcac gagaaatgaa acaggaccct atcaatgtga aatacgggac 1021 cgatatggtg gcctccgcag taacccagtc atcctaaatg tcctctatgg tccagacctc 1081 cccagaattt acccttcatt cacctattac cgttcaggag aaaacctcga cttgtcctgc 1141 ttcacggaat ctaacccacc ggcagagtat ttttggacaa ttaatgggaa gtttcagcaa 1201 tcaggacaaa agctctttat cccccaaatt actagaaatc atagcgggct ctatgcttgc 1261 tctgttcata actcagccac tggcaaggaa atctccaaat ccatgacagt caaagtctct 1321 ggtccctgcc atggagacct gacagagtct cagtcatgac tgcaacaact gagacactga 1381 gaaaaagaac aggctgatac cttcatgaaa ttcaagac // LOCUS HUMGPPSBE 1856 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M34420 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone PS-beta-G A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1856) AUTHORS Streydio,C., Swillens,S., Georges,M., Szpirer,C. and Vassart,G. TITLE Structure, evolution and chromosomal localization of the human pregnancy-specific beta-1 glycoprotein gene family JOURNAL Genomics 6, 579-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept 82 1368 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 82 183 pregnancy-specific beta-1 glycoprotein signal peptide matp 184 1365 pregnancy-specific beta-1 glycoprotein mRNA 1 1856 pregnancy-specific beta-1 glycoprotein mRNA BASE COUNT 544 a 476 c 364 g 472 t ORIGIN Chromosome 19. 1 gcacagctga gagccatgct caggaagttt ctggatccta ggctcagctc cacagaggag 61 aacacgcagg cagcagagac catggggccc ctctcagccc ctccctgcac acagcgcatc 121 acctggaagg ggctcctgct cacagcatca cttttaaact tctggaaccc gcctaccact 181 gcccaagtca cgattgaagc cgagccaacc aaagtttcca aggggaagga cgttcttcta 241 cttgtccaca atttgcccca gaatcttgct ggctacatct ggtacaaagg gcaaatgaag 301 gacctctacc attacattac atcatacgta gtagatggtc aaataattat atatgggcct 361 gcatacagtg gacgagaaac agtatattcc aatgcatccc tgctgatcca gaatgtcacc 421 cgggaggacg caggatccta caccttacac atcgtaaagc gaggtgatgg gactagagga 481 gaaactggac atttcacctt caccttatac ctggagactc ccaagccctc catctccagc 541 agcaacttat accccaggga ggacatggag gctgtgagct taacctgtga tcctgagact 601 ccggacgcaa gctacctgtg gtggatgaat ggtcagagcc tccctatgac tcacagcttg 661 cagttgtcca aaaacaaaag gaccctcttt ctatttggtg tcacaaagta cactgcagga 721 ccctatgaat gtgaaatacg gaacccagtg agtgccagcc gcagtgaccc agtcaccctg 781 aatctcctcc cgaagctgcc caagccctac atcaccatca acaacttaaa ccccagggag 841 aataaggatg tcttagcctt cacctgtgaa cctaagagtg agaactacac ctacatttgg 901 tggctaaatg gtcagagcct cccggtcagt cccagggtaa agcgacccat tgaaaacagg 961 atcctcattc tacccagtgt cacgagaaat gaaacaggac cctatcaatg tgaaatacag 1021 gaccgatatg gtggcatccg cagttaccca gtcaccctga atgtcctcta tggtccagac 1081 ctccccagaa tttacccttc attcacctat taccattcag gagaaaacct ctacttgtcc 1141 tgcttcgcgg actctaaccc accagcagaa tattcttgga caattaatgg gaagtttcag 1201 ctatcaggac aaaagctctt tatcccccag attactacaa agcatagcgg gctctatgct 1261 tgctctgttc gtaactcagc cactggcatg gaaagctcca aatccatgac agtcaaagtc 1321 tctgctcctt caggaacagg acatcttcct ggccttaatc cattatagca gccgtgatgt 1381 catttctgta tttcaggaag actggcagac agttgctttc attcttcctc aaagtattta 1441 ccatcagcta cagtccaaaa ttgctttttg ttcaaggaga tttatgaaaa gactctgaca 1501 aggactcttg aatacaagtt cctgataact tcaagatcat accactggac taagaacttt 1561 caaaatttta atgaacaggc tgatacttca tgaaattcaa gacaaagaaa aaaacccaat 1621 tttattggac taaatagtca aaacaatgtt ttcataattt tctatttgaa aatgtgctga 1681 ttctttgaat gttttattct ccagatttat gcactttttt tcttcagcaa ttggtaaagt 1741 atacttttgt aaacaaaaat tgaaacattt gcttttgctc cctaagtgcc ccagaattgg 1801 gaaactattc aggagtattc atatgtttat ggtaataaag ttatctgcac aagttc // LOCUS HUMGPPSBF 2004 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human pregnancy-specific beta-1 glycoprotein mRNA, complete cds. ACCESSION M23575 KEYWORDS beta-1 glycoprotein. SOURCE Human placenta, cDNA to mRNA, clone pSP1-i. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2004) AUTHORS Rooney,B.C., Horne,C.H.W. and Hardman,N. TITLE Molecular cloning of a cDNA for human pregnancy-specific beta-1-glycoprotein: Homology with human carcinoembryonic antigen and related proteins JOURNAL Gene 71, 439-449 (1988) STANDARD simple staff_review FEATURES from to/span description pept 232 1518 pregnancy-specific beta-1 glycoprotein precursor /hgml_locus_uid="LG0073P" /nomgen="PSBG1" /map="19q13.1-q13.2" sigp 232 333 pregnancy-specific beta-1 glycoprotein signal peptide matp 334 1515 pregnancy-specific beta-1 glycoprotein BASE COUNT 579 a 510 c 406 g 509 t ORIGIN 1 gggcgggaca actggtctga gtactatggc tgattttcgc tgtctggcat tgagaagcca 61 cacgcccctt ttgcttagga ggcctctctg ctggaggatg acgatggcat ggtttatcta 121 aggccactga caagtcatca atataggaca gcacagctga gagccatgct caggaagttt 181 ctggatccta ggctcagctc cacagaggag aacacgcagg cagcagagac catggggccc 241 ctctcagccc ctccctgcac acagcgcatc acctggaagg ggctcctgct cacagcatca 301 cttttaaact tctggaaccc gcctaccact gcccaagtca cgattgaagc cgagccaacc 361 aaagtttcca aggggaagga cgttcttcta cttgtccaca atttgcccca gaatcttgct 421 ggctacatct ggtacaaagg gcaaatgaag gacctctacc attacattac atcatacgta 481 gtagatggtc aaataattat atatgggcct gcatacagtg gacgagaaac agtatattcc 541 aatgcatccc tgctgatcca gaatgtcacc cgggaggacg caggatccta caccttacac 601 atcgtaaagc gaggtgatgg gactagagga gaaactggac atttcacctt caccttatac 661 ctggagactc ccaagccctc catctccagc agcaacttat accccaggga ggacatggag 721 gctgtgagct taacctgtga tcctgagact ccggacgcaa gctacctgtg gtggatgaat 781 ggtcagagcc tccctatgac tcacagcttg cagttgtcca aaaacaaaag gaccctcttt 841 ctatttggtg tcacaaagta cactgcagga ccctatgaat gtgaaatacg gaacccagtg 901 agtgccagcc gcagtgaccc agtcaccctg aatctcctcc cgaagctgcc caagccctac 961 atcaccatca acaacttaaa ccccagggag aataaggatg tcttagcctt cacctgtgaa 1021 cctaagagtg agaactacac ctacatttgg tggctaaatg gtcagagcct cccggtcagt 1081 cccagggtaa agcgacccat tgaaaacagg atcctcattc tacccagtgt cacgagaaat 1141 gaaacaggac cctatcaatg tgaaatacag gaccgatatg gtggcatccg cagttaccca 1201 gtcaccctga atgtcctcta tggtccagac ctccccagaa tttacccttc attcacctat 1261 taccattcag gagaaaacct ctacttgtcc tgcttcgcgg actctaaccc accagcagaa 1321 tattcttgga caattaatgg gaagtttcag ctatcaggac aaaagctctt tatcccccag 1381 attactacaa agcatagcgg gctctatgct tgctctgttc gtaactcagc cactggcatg 1441 gaaagctcca aatccatgac agtcaaagtc tctgctcctt caggaacagg acatcttcct 1501 ggccttaatc cattatagca gccgtgatgt catttctgta tttcaggaag actggcagac 1561 agttgctttc attcttcctc aaagtattta ccatcagcta cagtccaaaa ttgctttttg 1621 ttcaaggaga tttatgaaaa gactctgaca aggactcttg aatacaagtt cctgataact 1681 tcaagatcat acatggacta agaactttca aaattttaat gaacaggctg atacttcatg 1741 aaattcaaga caaagaaaaa aacccaattt tattggacta aatagtcaaa acaatgtttt 1801 cataattttc tatttgaaaa tgtgctgatt ctttgaatgt tttattctcc agatttatgc 1861 actttttttc ttcagcaatt ggtaaagtat acttttgtaa acaaaaattg aaacatttgc 1921 ttttgctccc taagtgcccc agaattggga aactattcag gagtattcat atgtttatgg 1981 taataaagtt atctgcacaa accc // LOCUS HUMLEUELA 920 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human elastase/medullasin mRNA, complete cds. ACCESSION M34379 KEYWORDS elastase; medullasin. SOURCE Human leukemic cell line ML3, cDNA to mRNA, clone pSRHLE. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 920) AUTHORS Okano,K., Aoki,Y., Shimizu,H. and Naruto,M. TITLE Functional expression of human leukocyte elastase (HLE)/medullasin in eukaryotic cells JOURNAL Biochem. Biophys. Res. Commun. 167, 1326-1332 (1990) STANDARD full staff_review FEATURES from to/span description pept 39 842 elastase/medullasin precursor (EC 3.4.21.37) sigp 39 119 elastase/medullasin signal peptide matp 126 839 elastase/medullasin BASE COUNT 141 a 329 c 287 g 163 t ORIGIN 1 gcacggaggg gcagagaccc cggagcccca gccccaccat gaccctcggc cgccgactcg 61 cgtgtctttt cctcgcctgt gtcctgccgg ccttgctgct ggggggcacc gcgctggcct 121 cggagattgt ggggggccgg cgagcgcggc cccacgcgtg gcccttcatg gtgtccctgc 181 agctgcgcgg aggccacttc tgcggcgcca ccctgattgc gcccaacttc gtcatgtcgg 241 ccgcgcactg cgtggcgaat gtaaacgtcc gcgcggtgcg ggtggtcctg ggagcccata 301 acctctcgcg gcgggagccc acccggcagg tgttcgccgt gcagcgcatc ttcgaaaacg 361 gctacgaccc cgtaaacttg ctcaacgaca tcgtgattct ccagctcaac gggtcggcca 421 ccatcaacgc caacgtgcag gtggcccagc tgccggctca gggacgccgc ctgggcaacg 481 gggtgcagtg cctggccatg ggctggggcc ttctgggcag gaaccgtggg atcgccagcg 541 tcctgcagga gctcaacgtg acggtggtga cgtccctctg ccgtcgcagc aacgtctgca 601 ctctcgtgag gggccggcag gccggcgtct gtttcgggga ctccggcagc cccttggtct 661 gcaacgggct aatccacgga attgcctcct tcgtccgggg aggctgcgcc tcagggctct 721 accccgatgc ctttgccccg gtggcacagt ttgtaaactg gatcgactct atcatccaac 781 gctccgagga caacccctgt ccccaccccc gggacccgga cccggccagc aggacccact 841 gagaagggct gcccgggtca cctcagctgc ccacacccac actctccagc atctggcaca 901 ataaacattc tctgttttgt // LOCUS MSGIS6110 1360 bp ds-DNA BCT 17-JUL-1990 DEFINITION M.tuberculosis-50 complex IS6110 insertion sequence-like element. ACCESSION M29899 KEYWORDS insertion sequence. SOURCE M.tuberculosis (strain H37RV) DNA (cosmid library pHC79), clone I21. ORGANISM Mycobacterium tuberculosis Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae. REFERENCE 1 (bases 1 to 1360) AUTHORS Thierry,D., Cave,M.D., Eisenach,K.D., Crawford,J.T., Bates,J.H., Gicquel,B. and Guesdon,J.L. TITLE IS6110 an IS-like element of Mycobacterium tuberculosis-50 complex JOURNAL Nucleic Acids Res. 18, 188-188 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Thierry 15-NOV-1989. BASE COUNT 269 a 439 c 432 g 220 t ORIGIN 1 cgatgaaccg ccccggcatg tccggagact ccagttcttg gaaaggatgg ggtcatgtca 61 ggtggttcat cgaggaggta cccgccggag ctgcgtgagc gggcggtgcg gatggtcgca 121 gagatccgcg gtcagcacga ttcggagtgg gcagcgatca gtgaggtcgc ccgtctactt 181 ggtgttggct gcgcggagac ggtgcgtaag tgggtgcgcc aggcgcaggt cgatgccggc 241 gcacggcccg ggaccacgac cgaagaatcc gctgagctga agcgcttagc ggcgggacaa 301 cgccgaattg cgaagggcga acgcgatttt aaagaccgcg tcggctttct tcgcggccga 361 gctcgaccgg ccagcacgct aattaacggt tcatcgccga tcatcagggc caccgcgagg 421 gccccgatgg tttgcggtgg ggtgtcgagt cgatctgcac acagctgacc gagctgggtg 481 tgccgatcgc cccatcgacc tactacgacc acatcaaccg ggagcccagc cgccgcgagc 541 tgcgcgatgg cgaactcaag gagcacatca gccgcgtcca cgccgccaac tacggtgttt 601 acggtgcccg caaagtgtgg ctaaccctga accgtgaggg catcgaggtg gccagatgca 661 ccgtcgaacg gctgatgacc aaactcggcc tgtccgggac cacccgcggc aaagcccgca 721 ggaccacgat cgctgatccg gccacagccc gtcccgccga tctcgtccag cgccgcttcg 781 gaccaccagc acctaaccgg ctgtgggtag cagacctcac ctatgtgtcg acctgggcag 841 ggttcgccta cgtggccttt gtcaccgacg cctacgtcgc aggatcctgg gctggcgggt 901 cgcttccacg atggccacct ccatggtcct cgacgcgatc gagcaagcca tctggacccg 961 ccaacaagaa ggcgtactcg acctgaaaga cgttatccac catacggata ggggatctca 1021 gtacacatcg atccggttca gcgagcggct cgccgaggca ggcatccaac cgtcggtcgg 1081 agcggtcgga agctcctatg acaatgcact agccgagacg atcaacggcc tatacaagac 1141 cgagctgatc aaacccggca agccctggcg gtccatcgag gatgtcgagt tggccaccgc 1201 gcgctgggtc gactggttca accatcgccg cctctaccag tactgcggcg acgtcccgcc 1261 ggtcgaactc gaggctgcct actacgctca acgccagaga ccagccgccg gctgaggtct 1321 cagatcagag agtctccgga ctcaccgggg cggttcacga // LOCUS MUSIGHAAT 348 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig J558 family active H-chain mRNA V-J3 region from hybridoma CE5, partial cds. ACCESSION M34119 KEYWORDS diversity exon; immunoglobulin heavy chain; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 348 Ig H-chain V-J3 region (AA at 1) recomb 294 295 J-region end/D-region start recomb 303 304 D-region end/J-region start BASE COUNT 89 a 82 c 97 g 80 t ORIGIN Chromosome 12. 1 caggttcagc tgcagcagtc tggagctgaa ctgatgaagc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcagt aagtactgga tagagtgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagag attttacctg gaagtggtag tactaaccat 181 gatgagaagt tcaagggcaa ggccacattc actgcagata catcctccaa cacagcctac 241 atgcaactca gcagcctgac atctgaggac tctgccgtct attactgtgc aagagacggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS MUSIGKCSR 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma GA733, partial cds. ACCESSION M34120 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma GA733, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 87 a 83 c 74 g 77 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagtgtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatccgacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcaccaa cattatatta ctcctcggac gttcggtgga 301 ggcaccaaac tggaaatcaa a // LOCUS MUSIGKCSS 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma C017-1A, partial cds. ACCESSION M34121 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma C017-1A, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 321 Ig kappa-chain (AA at 1) BASE COUNT 89 a 77 c 78 g 77 t ORIGIN Chromosome 6. 1 aacattgtaa tgacccaatc tcccaaatcc atgtccatgt cagtaggaga gagggtcacc 61 ttgacctgca aggccagtga gaatgtggtt acttatgttt cctggtatca acagaaacca 121 gagcaatctc ctaaactctt gatttacggg gcctccaacc ggtacactgg ggtccccgat 181 cgcttcacag gtagtggatc tgcaacagat ttcactctga ccattagtag tgtgcaagct 241 gaagaccttg cagattatca ctgtggacag ggttacagct atccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSIGKCST 318 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active kappa-chain mRNA V-region from hybridoma CE5, partial cds. ACCESSION M34122 KEYWORDS immunoglobulin light chain; kappa-immunoglobulin; processed gene. SOURCE Mouse (Balb/c) hybridoma CE5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Caton,A.J., Herlyn,D., Ross,A.H. and Koprowski,H. TITLE Identical D region sequences expressed by murine monoclonal antibodies specific for a human tumor-associated antigen JOURNAL J. Immunol. 144, 1965-1968 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 318 Ig kappa-chain (AA at 1) BASE COUNT 84 a 78 c 80 g 76 t ORIGIN Chromosome 6. 1 gacattgtga tgacccagtc tcagaaattc atgtccacat cagtaggaga cagggtcggc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaatca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgag 181 cgcttcgctg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcatcaa cattatagta ctcggacgtt cggtggaggc 301 accaagctgg aaatcaaa // LOCUS PSERRSAA 1517 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.aeruginosa 16S ribosomal RNA. ACCESSION M34133 KEYWORDS 16S ribosomal RNA; ribosomal RNA; small subunit ribosomal RNA. SOURCE P.aeruginosa (strain 25330) ribosomal RNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1517) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept of Microbiology 407 S. Goodwin Avenue Urbana, IL 61801 email: carl@ninja.life.uiuc.edu FEATURES from to/span description rRNA 1 1517 16S rRNA BASE COUNT 379 a 334 c 470 g 301 t 33 others ORIGIN 1 ntactgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 61 gtcgagcgga tgaagggagc ttgctcctgg attcagcggc ggacgggtga gtaatgccta 121 ggaatctgcc tgatagtggg ggataacgtc cggaaacggg cgctaatacc gcatacgtcc 181 tgagggagaa agggggggat cctcggacct cacgctatcn gatgagccta ggtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatccg taactggtct gagaggacga 301 tcagtcacac tggaactgag acacggtcca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgaaagcc ngatccagcc atgccgcgtg tgtgaagaag gtcttcggat 421 tgtaaagcac tttaagttgg gaggaagggc agtaagttaa taccttgctg ttttgacgtt 481 accaacagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac gaagggtgcg 541 agcgttaatc ggaattactg ggcgtaaagc gcgcgtaggt ggttcagcaa gttggatgtg 601 aaatccccgg gctcaacctg ggaactgcat ccnaaactac tgagctagag tacggtagag 661 ggtggtggaa tttcctgtgt agcggtgaaa tgcgtagata taggaaggaa caccagtggc 721 gaaggcgacc acctggactg atactgacac tgaggtgcga aagcgtgggg agcaaacagg 781 attagatacc ctggtagtcc acgccgtaaa cgatgtcgac tagccgttgg gatccttgag 841 atcttagtgg cgcagctaac gcgataagtc gaccgcctgg ggagtacggc cgcaaggtta 901 aaactcaaat gaattgacgg gggcnngcac aagcggtgga gcatgtggtt taattcgaag 961 caacgcgaag aaccttacct ggccttgaca tgctgagaac tttccagaga tggattggtg 1021 ccttcgggaa ctcagacaca ggtgctgcat ggctgtcgtc agctcgtgtc gtgagatgtt 1081 gggttaagtc ccgtaacgag cgcaaccctt gtccttagtt accagcacct cgggtgggca 1141 ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg 1201 gcccttacgg cnagggctac acacgtgcta caatggtcgg tacaaagggt tgcgaagccg 1261 cgaggtggag ctaatcccat aaaaccgatc gtagtccgga tcgcagtctg caactcgact 1321 gcgtgaagtc ggaatcgcta gtaatcgtga atcagaatgt cacggtgaat acgttcccgg 1381 gccttgtaca caccgcccgt cacaccatgg gagtgggttg ctccagaagt agctagtcta 1441 accgcaaggg ggacggttac cacggagtga ttcatgnnnn nnnnnnnnnn gtaacaagnn 1501 nnnnnnnnnn gaacctg // LOCUS RATNESTIN 5946 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat nestin mRNA, complete cds. ACCESSION M34384 KEYWORDS intermediate filament protein; nestin. SOURCE Rat (strain E15) embryo central nervous system, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5946) AUTHORS Lendahl,U., Zimmerman,L.B. and McKay,R.D.G. TITLE CNS stem cells express a new class of intermediate filament protein JOURNAL Cell 60, 585-595 (1990) STANDARD simple staff_review FEATURES from to/span description pept 128 5545 nestin (128 could be 161) mRNA 1 5495 nestin mRNA BASE COUNT 1600 a 1401 c 1871 g 1074 t ORIGIN 1 tgctggagtt ctccgcttcc gctgggtcac tgtcgccgct acttcttttc aacccctaaa 61 agctccacgg gccactccct tctctagtgc tccacgtccg cttgccctcg ggggccagac 121 cagcgacatg gagggttgcg tcggggaaga atcttttcag atgtgggagc tcaatcgacg 181 cctggaggcc tacctgaccc gggtcaagac gctagaagag caaaaccagc tgctcagcgc 241 cgagcttggg ggactccggg cgcagtccgg agacacctcc tggagagccc gagccgatga 301 cgagctggca tccctgcgga tcctcgtcga tcagcgctgg cgggagaagc tcgaggctga 361 ggtgcagcgc gacaaccttg cggaagagct ggagagcgtg gcgggccggt gccagcaggt 421 gcggcttgct cgggagcgga ccgtccagga ggccgcctgc agccggcgcg cactcgaggc 481 ggagaagaat gcgcggggct ggctgagcac ccaggcggcc gagctggagc gggagttaga 541 ggctctgcga gccgcgcacg aggaggagcg cgcacacctg aacgcccagg ccgcctgtgc 601 gcctcgccgg ccccccgcac cgccccaccg gatccccggt ccggcccccg aagtcgagga 661 tctggccagg cgactaggcg aagtgtggcg cggggcggtg cgtgactacc aggagcgcgt 721 ggctcacatg gaaagctcgc tgggtcaggc acgcgagcgg ctgagccaag ccgtgcgggg 781 cgctcgggag tgtcgcttag aggtgcaaca gctgcaggct gatcgcgaca gcctccagga 841 gcgcagagaa gcgctggaac agagattgga aggccgctgg caggaccggc tgcaggccac 901 tgataagttc cagctggctg tggaagccct ggagcaggag aagcaaggtc tacagagtca 961 gatcgctcag atcctggaag gtgggcagca actggcacac ctcaagatgt cccttagtct 1021 ggaggtggct acatacagga ctctgctgga ggctgagaac tctcggttgc agacacctgg 1081 acgaggttcc caggcttctc ttggctttct ggaccccaag ctgaagccga atttccttgg 1141 gataccagag gaccagtacc tgggatctgt gctccctgcc ctcagcccca catccttccc 1201 ttcccccttg cctaataccc ttgagactcc tgtgacagcc ttcctgaaga ctcaggagtt 1261 ccttcaggcc agaaccccca ccttggccag cactcccatc ccacctatat ctgaggctcc 1321 ctgtcctcca aatgcagagg tgagagccca ggaggtccct ctttctctgc tccagacaca 1381 ggctccagag cccctttggc tgaaggccac agtgcctagt tcttctgcta tcctcccaga 1441 actagaggaa cctgggggca agcagcaggg tcacttccct gatgatctga cctccttagc 1501 cacaaacctc aaccctcacc accctacttt agaggctaaa gatggagaat ccagtgagtc 1561 tagagtttct agcatattcc aggaagatga ggggcaaatc tgggaactgg tagagaaaga 1621 agcagatata gaggtaaaag tagaaaacag ctcagcccag aaaacacaag aaagtggtct 1681 ggacacagaa gaaacccagg attcccaggg acctttgcag aaggaaacac tgaaggctct 1741 aggagaggag ccactgatgt ctctgaaaat ccagaactat gagacagcag ggaaagagaa 1801 ttgcaattct tctacagaag gccacctggg aacactagaa ggcccagaaa aagaaaagca 1861 aataccacta aagtctttag aagaaaagaa tgtagagtca gagaaaactc tagaaaatgg 1921 ggttcctgta ctatctgagc ttttaggaaa agaagacaca agaacagagg atcaagaatt 1981 aatgtctcct aaaggtacac taaagagatt ttcatctcta ggaaaggaaa gtcaagaagt 2041 agtgaggcct tcaaaagagg ggaacctaga atcatggaca gcttttaaag aggagagcca 2101 acacccactg ggatttccag gagctgagga ccagatgctt gagagactgg tagagaaaga 2161 ggatcagagc ttcccaaggt ctccagagga agaggaccag gaggcatgta gacctctgca 2221 gaaagagaat caggaaccac tagggtatga agaagcagag ggccagatac ttgagagact 2281 gatagaaaaa gagagtcagg agtccctgag gtctccagaa gaagaggacc aggaggcagg 2341 tagatctctg cagaaagaga atcaggagcc actagggtat gaagaagcag aggaccagat 2401 gcttgagaga ctgatagaaa aagagagtca ggagtccctg aagtctccag aagaaaacca 2461 gaggattggg aagcctctag aaagagagaa tcagaaatct ctgaggtatc ttgaagaaaa 2521 ccaggagact tttgtaccac tagaaagcag gaaccagagg ccactgagat ctctagaagt 2581 agaagaggag gagcagagaa ttgtgaaacc tctagaaaaa gtgagtcagg attccctcgg 2641 atctctagca gaagagaatg tgcagccact gaggtatctg gaagaagatg actgcataaa 2701 taagagcctt ctagaagaca agactcacaa gtccttgggg tctcttgaag atagaaatgg 2761 ggatagcatt attataccac aagaaagtga gacccaggtt tcattgaggc ctccagaaga 2821 ggaggaccag aggattgtga accatctaga aaaagaaagt caggagttct cgaggtcttc 2881 agaagaagaa gagcaggtga tggagagatc tctagaagga gagaaccatg aatcactgag 2941 ttctgtagaa aaagaggacc agatggttga gagccaacta gagaaagaga gtcaggactc 3001 agggaagtct cttgaagatg agagccagga gacctttgga cctctggaaa aagagaatgc 3061 agagtccctg agatctctag caggacagga ccaagaggaa cagaagcttg aacaagagac 3121 ccaacaaaca ctgagggctg tagggaatga gcagatggca gtgagcccac cagaaaaggt 3181 ggatccagag ttaccgaagc ctcttggaaa tgaccaggaa atagctagat ctcttggaaa 3241 agagaatcaa gagtcactag tgtcactgaa agaaaaaggt atagagacag tgaagtcttt 3301 agaaacagag atcatagaac cactggagac tgcagaagag gacctggaaa gaaggaagtc 3361 tatagatact caggagccat tgtggtctac tgaagtggct agagagacag tagaacctcc 3421 agaagatgag cccccaggat cgctagggtc tgtggatgag aaccgagaga cactgacatc 3481 ccttgaaaag gagagtcaag aactgagctc tctgggcaag tggaacgtag agaccagggt 3541 agaggacagt cagcagtgcc tgcaagtaga agagggtctg caggaggaac agcaccaaga 3601 gtctctgaga gaggtgaagc aggagctgcc tagctctgga aatcaacagc ggtgggagga 3661 tgtggtggag ggcaaagcag tgggtcagga agcacctctg gcaaccacag gagtgggaac 3721 tgaggataag gcagagttgc atctgagggg gcaaggtgga gaggaagaag ctgcagcaga 3781 gggagagctg ttgcaggata ttgtggggga ggcctggagt ctggggagct ctgagcccaa 3841 ggagcagagg gtccctgctg aggccctcga caacctggaa ggaggggcct tagaggtccc 3901 agttgctcag tcaatgccag aggtgacaga gcgagatgag gatagagccc aagcaggtga 3961 acaagactcc atagaggtga cccttgggtt agaggctgcc agaactggac tggaactcga 4021 gcaggaagtg gtagggctag aggacccaag gcattttgcc agggaggagg ccattccccc 4081 atccctgggg gaggaaagtg tgaaggcaaa gatagctcag ggcttggaag ggcctggaaa 4141 ggaaccaaaa gaggcaggtg ctctggactc ggggatcctt gaattgccca agactagcag 4201 cgaggctctg gaatgccagg gccatgaaga gtctgagtcc atggagggct gggaagaaga 4261 ggaggcctca ctggagactt cagatcatga gggcagtgat gcccctcagc ccaggccccc 4321 agaaacagaa gaagatgagg gtgcacaggc agcactgaca gcccctggtc ccaagctctt 4381 ggaaccctgt tcacccatcc caatcctgac agatgcccat gagctgcagc cccaggctga 4441 ggggatccag gaggctggct ggcagccaga agctgggtct gaagcactag aaagggtaga 4501 aaatgagcca gagtttggtc ttggggagat cccggagggc ctccaggatt gggaagaggg 4561 cagagaagaa agcgaggcag atgatctagg ggaaactctc cctgactcta ctcccctggg 4621 cctctacctg aggtcccctg cttctccaaa gtgggatctg gctggagaac agaggctttc 4681 ccctcaaggg gatgccggga aggaagactg gggtcctgct gtccccgctg cccagggcct 4741 cagtggtcca ccggaagagg aggaggagca aggccatggc tctgacctat catctgagga 4801 gtttgaggac ctagggactg aggcctctct tcttccaggg gttcccaagg aggtggcaga 4861 tcacgtgggc caagtgcccc cggtactgca gcctgcatgc tgggatcagg gtggggaatc 4921 tgatgggttt gctgatgagg aagaaagtgg ggaggaggga gaggaagaag atgctgatga 4981 ggaaggagca gagtcaggag ctcagtggtg ggggtcaggg gcctctggtg gaggctgcaa 5041 ggtccaggat attgcccaaa gaggagaccc ggtacaggag tctgtgggtg tcagtggtct 5101 ctgggatgat ggcttgagag gtgctgcagc taatgttcct gccctagaga tggtatctca 5161 ggacagtgct gagccttctg ggtcagagga gtctgagtct gcttccttgg agggggagga 5221 aggtcaagtg actgaccatt tagatgctcc ccaggaggtg accagcatgg tcccgggggt 5281 aggagatgcc tttgacattg gtggccagag ccccaacttg gactcagaac aagtgaatgg 5341 gaaaatggag aatggactag aacaggctga ggggcaggtg gtcctggatg gggacgagga 5401 tcaagaactc ctattacagg gacaggaggt gggtgctcta aaggttcctt tggtagcatc 5461 tcctgtgcat ctaggcccaa gccagcccct gaagttcact ctgagtgggg tagatgggga 5521 ttcctggtcc tcaggggaag actagaaact gcccctctgg ctctgaggat gtactggtgg 5581 ggatgtccct ccctgctctg ggtgaccact cttagctttg ataacttgac ccatggtatt 5641 tgtcctggag agttgtggct gggctgagca agggaggtga gatcctcctg aaggctcagg 5701 agttccaggc ctatagttct accccctctt tcttctgtgg ctcacctgct ggaagaggcc 5761 tgggcccaga gctttcccac aaggctgttc tggccacagc ttgctagcct tgcctaccac 5821 ctgcacaagg tctggtctgg tgtatgacca ggggagctga gggcagcatt tatctgaccc 5881 ttcatctcag cctgctgaga gcttgttcct ctcttcctcc ctgaataaag ccgtatccct 5941 acctac // LOCUS CLLRRE 1860 bp ss-rRNA RNA 17-JUL-1990 DEFINITION C.sapidus 18S rRNA, 3' end. ACCESSION M34360 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE C.sapidus rRNA. ORGANISM Callinectes sapidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; Brachyrhyncha; Portunoidea; Portunidae. REFERENCE 1 (bases 1 to 1860) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1860 18S ribosoma RNA BASE COUNT 359 a 375 c 421 g 367 t 338 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct 61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt 121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta 181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc 241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc 301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc 361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac 421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac 481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat 541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct 601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg 661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc 721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata 841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc 901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc 961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg 1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt 1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt 1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac 1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg 1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct 1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc 1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga 1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag 1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc 1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc 1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn 1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn // LOCUS HUMNCSRC 138 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene, exons 3, 4, NI, and NII, partial cds. ACCESSION M34469 KEYWORDS membrane-associated tryosine protein kinase. SOURCE Human adult brain DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Pyper,J.M. and Bolen,J.B. TITLE Identification of a novel neuronal C-SRC exon expressed in human brain JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 29 membrane-associated tyrosine protein kinase (C-SRC), exon 3 (AA at 1) 30 47 membrane-associated tyrosine protein kinase, exon NI 48 80 membrane-associated tyrosine protein kinase, exon NII 81 > 136 membrane-associated tyrosine protein kinase, exon 4 variant 117 117 a or g variant 135 135 c or t BASE COUNT 36 a 40 c 40 g 22 t ORIGIN 1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc 61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga 121 cagacaggct acatcccc // LOCUS MHVAPEPA 1000 bp ss-RNA VRL 17-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34435 KEYWORDS peplomer protein. SOURCE Murine (strain JHM-DL) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1000) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review BASE COUNT 258 a 192 c 227 g 317 t 6 others ORIGIN 1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag 61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa 121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct 181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa 241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta 301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt 361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca 421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac 481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg 541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga 601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg 661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt 721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt 781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa 841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa 901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag 961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa // LOCUS MHVAPEPB 843 bp ss-RNA VRL 17-JUL-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34436 KEYWORDS peplomer protein. SOURCE Murine (strain A59) RNA. ORGANISM Murine coronavirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 843) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review FEATURES from to/span description site 400 401 deletion site BASE COUNT 227 a 153 c 183 g 280 t ORIGIN 1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag 61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa 121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct 181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa 241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta 301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt 361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt 421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc 481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc 541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat 601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt 661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc 721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact 781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc 841 gaa // LOCUS MHVSP 3780 bp ss-mRNA VRL 17-JUL-1990 DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds. ACCESSION X04797 M34437 KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein; surface projection glycoprotein. SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones pJMS1010, pJS112, and pJS92. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 3780) AUTHORS Schmidt,I., Skinner,M. and Siddell,S. TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection Glycoprotein of Coronavirus MHV-JHM JOURNAL J. Gen. Virol. 68, 47-56 (1987) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description SITE 24 30 put. region of homology preceeding mRNA 5' initiation codons SITE 121 129 pot. N-glycosylation site SITE 208 216 pot. N-glycosylation site SITE 430 438 pot. N-glycosylation site SITE 604 612 pot. N-glycosylation site SITE 1099 1107 pot. N-glycosylation site SITE 1333 1341 pot. N-glycosylation site SITE 1636 1644 pot. N-glycosylation site SITE 1732 1740 pot. N-glycosylation site SITE 1756 1764 pot. N-glycosylation site SITE 1825 1833 pot. N-glycosylation site SITE 1900 1914 put. proteolytic cleavage site SITE 1972 1980 pot. N-glycosylation site SITE 2023 2031 pot. N-glycosylation site SITE 2293 2301 pot. N-glycosylation site SITE 2440 2448 pot. N-glycosylation site SITE 3139 3147 pot. N-glycosylation site SITE 3301 3309 pot. N-glycosylation site SITE 3331 3339 pot. N-glycosylation site SITE 3388 3396 pot. N-glycosylation site SITE 3436 3444 pot. N-glycosylation site SITE 3499 3507 pot. N-glycosylation site SITE 3553 3654 pot. transmembrane domain SITE 3622 3675 cysteine-rich region SITE 3655 3660 charge cluster SITE 3715 3723 pot. N-glycosylation site FEATURES from to/span description pept 31 3738 surface protein S precursor sigp 31 60 surface protein S signal peptide matp 61 3736 surface protein S BASE COUNT 1030 a 718 c 800 g 1232 t ORIGIN 1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc 61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat 121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact 181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct 241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg 301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac 361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt 421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct 481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat 541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag 601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt 661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg 721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct 781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt 841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt 901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt 961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa 1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact 1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg 1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt 1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt 1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt 1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat 1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat 1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt 1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct 1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc 1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct 1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt 1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat 1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc 1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt 1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc 1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat 2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt 2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat 2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct 2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct 2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac 2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag 2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa 2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg 2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct 2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta 2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac 2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag 2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg 2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca 2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc 2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc 3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata 3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca 3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg 3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat 3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg 3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt 3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc 3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag 3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa 3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt 3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa 3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt 3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt // LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds. ACCESSION M34598 M29013 J03562 KEYWORDS constant region; immunoglobulin; immunoglobulin light chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone Y31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 713) AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A. TITLE Structure of a third murine immunoglobulin lambda light chain variable region that is expressed in laboratory mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 2 to 450) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review REFERENCE 3 (bases 266 to 429) AUTHORS Sanchez,P. and Cazenave,P.-A. TITLE A new variable region in mouse immunoglobulin lambda light chains JOURNAL J. Exp. Med. 166, 265-270 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region sigp 12 68 Ig lambda chain signal peptide matp 69 > 713 Ig lambda chain recomb 379 380 V-region end/J2-region start recomb 414 415 J2-region end/C2-region start BASE COUNT 175 a 190 c 159 g 189 t ORIGIN 1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt 61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag 121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc 181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca 241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta 301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa 361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc 421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca 481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga 541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca 601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca 661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct // LOCUS MUSIGLVD 681 bp ds-DNA ROD 17-JUL-1990 DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds. ACCESSION M34597 KEYWORDS constant region; germline; immunoglobulin light chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 681) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review FEATURES from to/span description pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region, exon 1 209 531 Ig lambda-chain precursor Vx-J2-C2 region, exon 2 sigp 59 104 Ig lambda-chain signal peptide 209 219 Ig lambda-chain signal peptide matp 220 528 Ig lambda-chain IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A recomb 530 531 Vx-region end/J2-region start recomb 565 566 J2-region end/C2-region start BASE COUNT 179 a 156 c 136 g 210 t ORIGIN Chromosome 16. 1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat 61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt 121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc 181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca 241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag 301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg 361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct 421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa 481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa 541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta 601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa 661 tcctcttgga cttcgtaggg c // LOCUS PAERRE 1877 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.kadiakensis 18S rRNA, 3' end. ACCESSION M34359 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.kadiakensis rRNA. ORGANISM Palaemonetes kadiakensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Palaemonoidea; Palaemonidae. REFERENCE 1 (bases 1 to 1877) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-113 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1877 18S ribosoma RNA BASE COUNT 339 a 321 c 399 g 313 t 505 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt 61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt 121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac 181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg 241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca 301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct 361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna 421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga 481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac 541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn 601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac 841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna 901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta 961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta 1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa 1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt 1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc 1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt 1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn 1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg 1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna 1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg 1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca 1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact 1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag 1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnnnnn // LOCUS PBESVBRA 584 bp ds-DNA INV 17-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment alpha DNA. ACCESSION M34601 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 584) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 461 541 27 bp repeats BASE COUNT 205 a 41 c 85 g 251 t 2 others ORIGIN 1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac 61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc 121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata 181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat 241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat 301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa 361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt 421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg 481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg 541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga // LOCUS PBESVBRB 593 bp ds-DNA INV 17-JUL-1990 DEFINITION P.berghei telomeric repeat region subfragment a DNA. ACCESSION M34602 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 593) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 484 510 27 bp repeat motif BASE COUNT 209 a 40 c 97 g 247 t ORIGIN 1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca 61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac 121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa 181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt 241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat 301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat 361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat 421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg 481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt 541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg // LOCUS PEURRE 1902 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.aztecus 18S rRNA, 3' end. ACCESSION M34362 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.aztecus rRNA. ORGANISM Penaeus aztecus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; Penaeidae. REFERENCE 1 (bases 1 to 1902) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1902 18S ribosoma RNA BASE COUNT 340 a 369 c 427 g 332 t 434 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt 61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt 121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg 181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc 241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg 301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc 361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag 481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga 541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga 601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg 661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc 721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn 781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn 841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg 901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg 961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna 1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn 1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg 1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa 1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn 1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg 1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac 1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta 1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct 1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg 1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact 1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg 1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt 1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn // LOCUS POCRRE 1874 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.ascensionis 18S rRNA, 3' end. ACCESSION M34358 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.ascensionis rRNA. ORGANISM Procaris ascensionis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Caridea; Procaridoidea; Procarididae. REFERENCE 1 (bases 1 to 1874) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1874 18S ribosoma RNA BASE COUNT 298 a 291 c 331 g 266 t 688 others ORIGIN 1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct 61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt 121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac 181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt 241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac 301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc 481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa 541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt 781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg 841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat 901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat 961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga 1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac 1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt 1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca 1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt 1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta 1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct 1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag 1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca 1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn 1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg 1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg 1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnn // LOCUS PRARRE 1869 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.leonensis 18S rRNA, 3' end. ACCESSION M34363 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.leonensis rRNA. ORGANISM Procambarus leonensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; Astacoidea; Cambaridae. REFERENCE 1 (bases 1 to 1869) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1869 18S ribosoma RNA BASE COUNT 409 a 412 c 459 g 419 t 170 others ORIGIN 1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt 61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt 121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat 181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt 241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc 301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca 361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg 421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt 481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact 541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt 601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag 661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc 721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn 781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa 841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg 901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac 961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg 1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg 1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg 1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa 1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta 1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact 1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg 1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg 1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac 1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa 1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn 1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt 1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa 1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnn // LOCUS PVIC1RPTA 711 bp ds-DNA INV 17-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28745 M25758 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 711) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 711 circumsporozoite protein (AA at 1) site 1 15 region 1 site 697 711 region 2 site 502 597 post repeat variable region site 598 696 post repeat constant region BASE COUNT 261 a 151 c 226 g 73 t ORIGIN 1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat 61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca 121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca 181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga 241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc 301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa 361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga 421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat 481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca 541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga 601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta 661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c // LOCUS PVIC1RPTB 657 bp ds-DNA INV 17-JUL-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28746 M25759 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK210) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 657) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 657 circumsporozoite protein (AA at 1) site 1 15 region 1 site 643 657 region 2 rpt 16 477 27 bp repeats site 478 544 3' post repeat variable region site 545 642 3' post repeat constant region BASE COUNT 229 a 135 c 224 g 69 t ORIGIN 1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat 61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa 121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca 181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat 241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca 301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga 361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca 421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga 481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca 541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa 601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc // LOCUS SHV2A 554 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA, partial cds. ACCESSION M34085 KEYWORDS . SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 554) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL Unpublished (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. Author address: E.A.Brown 547 Burnett-Womack CB# 7030 Chapel Hill, NC 27599 FEATURES from to/span description pept < 1 > 554 segment 2A-encoded protein (AA at 1) BASE COUNT 184 a 71 c 138 g 161 t ORIGIN Segment 2A; map position 3108-3662. 1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg 61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag 121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa 181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg 241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg 301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg 361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat 421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt 481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg 541 tcaggaatac agga // LOCUS SHVVP1CP 2373 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds. ACCESSION M34084 KEYWORDS capsid protein VP1. SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed in cwll line BS-C-1. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 2373) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL J. Virol. 63, 4932-4937 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. FEATURES from to/span description pept < 1 > 2373 capsid protein VP1 (AA at 1) BASE COUNT 661 a 447 c 491 g 774 t ORIGIN Segment P1; map position 735-3107. 1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg 61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact 121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca 181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga 241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa 301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg 361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca 421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct 481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc 541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat 601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct 661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca 721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt 781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat 841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca 901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag 961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa 1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg 1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt 1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc 1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg 1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag 1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat 1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct 1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct 1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag 1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta 1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca 1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg 1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt 1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg 1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg 1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca 1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg 2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat 2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc 2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg 2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg 2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat 2341 gatcctcgaa ctgaagagga tcgtaaattt gaa // LOCUS STNRRE 1885 bp ss-rRNA RNA 17-JUL-1990 DEFINITION S.hispidus 18S rRNA, 3' end. ACCESSION M34361 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE S.hispidus rRNA. ORGANISM Stenopus hispidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; Stenopodidae. REFERENCE 1 (bases 1 to 1885) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustraceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1885 18S ribosoma RNA BASE COUNT 327 a 336 c 430 g 368 t 424 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt 61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact 121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn 181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta 241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag 301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt 361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga 421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn 481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag 541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag 601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn 661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc 721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg 841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg 901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg 961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt 1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt 1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg 1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat 1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg 1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt 1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga 1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng 1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt 1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc 1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg 1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg 1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn 1861 nnnnnnnnnn nnnnnnnnnn nnnnn // LOCUS SIVGAGFA1 541 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian immunodeficiency virus gag protein (smmgagFA1) gene, partial cds. ACCESSION M33754 KEYWORDS gag protein. SOURCE Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus atys peripheral blood mononuclear cells. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 541) AUTHORS Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N., Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A. TITLE Detection of occult SIVsmm infection in asymptomatic seronegative nonhuman primates and evidence for variation in SIV gag sequence between endogenous and in vitro propagated virus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.J.Villinger, 18-APR-1990. Author address: F.J.Villinger Emory University, Department of Biology Winship, Cancer Center Atlanta GA, 30322 Ph: (404) 248-4555. FEATURES from to/span description pept < 1 > 541 gag protein (AA at 1) BASE COUNT 191 a 109 c 126 g 115 t ORIGIN 1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta 61 ggagaacatc aggcagccat gcaaattatt agagagatta taaatgaaga agctgctgat 121 tgggatctac aacacccgca gccaggtcca ataccagcag tacaactcag agagccaaga 181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg 241 caacaaaatc ctgtaccagt gggaaatatt tacaggagat ggatccaact tggattacaa 301 aagtgtgtca ggatgtataa tccaacaaac attctggacg taaaacaggg gccgaaagag 361 ccattccaga gctatgtaga caggttctac aaaagcttaa gagcagaaca gacagatcca 421 gcagtgaaga attggatgac ccaaacactg ctgattcaaa atgctaaccc agattgtaaa 481 ttggtgctca agggtctggg tatgaatccc actttagaag aaatgctgac agcctgtcag 541 g // LOCUS SIVGAGFA2 335 bp ss-RNA VRL 17-JUL-1990 DEFINITION Simian immunodeficiency virus gag protein (smmgagFA2) gene, partial cds. ACCESSION M33755 KEYWORDS gag protein. SOURCE Simian immunodeficiency virus (isolate smm) RNA, from Cercocebus atys peripheral blood mononuclear cells. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 335) AUTHORS Villinger,F., Powell,J.D., Jehuda-Cohen,T., Neckelmann,N., Vuchetich,M., De,B.K., Folks,T.M., McClure,H.M. and Ansari,A.A. TITLE Detection of occult SIVsmm infection in asymptomatic seronegative nonhuman primates and evidence for variation in SIV gag sequence between endogenous and in vitro propagated virus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.J.Villinger, 18-APR-1990. Author address: F.J.Villinger Emory University, Department of Biology Winship, Cancer Center Atlanta GA, 30322 FEATURES from to/span description pept < 1 > 335 gag protein (AA at 1) BASE COUNT 121 a 70 c 72 g 72 t ORIGIN 1 tttcaagcgc tatcagaagg ctgcactccc tatgacatca atcaaatgct aaattgtgta 61 ggagaacatc aggcagccat gcaaattatt agagagatta taaataaaga agctgctgat 121 tgggatttac aacacccgca gccaggtcca ataccagcag gacaactcag agagccaaga 181 ggatcagata ttgcaggaac cactagtaca gtggacgagc aaattcagtg gatgtatagg 241 caacaaaatc ctgtaccagt cggaaacatt tacaggagat ggatccaact tgcattacaa 301 aagtgtgtca ggatgtacaa tccaacaaac attct // LOCUS NEUAMTR 5928 bp ds-DNA PLN 17-JUL-1990 DEFINITION N.crassa mating type protein gene, complete cds. ACCESSION M33876 KEYWORDS mating type protein. SOURCE N.crassa (strain 74-ORS-A) DNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 5928) AUTHORS Grotelueschen,J., Metzenberg,R.L. and Glass,N.L. TITLE The Neurospora crassa A mating type region JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.L.Glass, 26-APR-1990. FEATURES from to/span description pept 4121 4283 mating type protein, exon 1 4343 5046 mating type protein, exon 2 IVS 4284 4342 mating type protein intron A signal 3858 3906 GC signal signal 3945 3949 CAAT box signal 5319 5326 poly-A signal BASE COUNT 1470 a 1459 c 1526 g 1473 t ORIGIN 1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg 61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga 121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct 181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct 241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag 301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa 361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga 421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa 481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg 541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg 601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg 661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc 721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg 781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc 841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg 901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc 961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat 1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa 1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc 1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg 1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt 1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa 1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc 1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca 1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat 1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac 1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca 1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta 1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga 1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt 1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa 1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac 1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc 1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc 2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg 2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga 2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg 2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca 2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca 2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc 2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca 2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat 2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag 2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc 2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca 2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg 2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag 2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg 2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata 2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg 3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca 3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca 3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt 3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt 3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca 3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag 3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac 3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta 3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca 3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac 3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg 3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca 3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt 3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt 3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc 3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa 3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc 4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt 4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat 4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc 4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt 4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat 4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag 4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat 4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg 4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg 4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg 4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag 4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat 4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt 4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct 4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc 4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat 4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg 5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca 5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc 5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg 5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg 5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg 5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg 5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca 5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa 5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct 5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg 5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag 5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg 5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc 5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc 5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac // LOCUS HUMNCADH 3451 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human N-cadherin mRNA, complete cds. ACCESSION M34064 KEYWORDS N-cadherin; cell adhesion molecule; transmembrane protein. SOURCE Human muscle, cDNA to mRNA, clones lambda-[4-10,1-5,13,14]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3451) AUTHORS Walsh,F.S., Barton,C.H., Putt,W., Moore,S.E., Kesell,D., Spurr,N. and Goodfellow,P.N. TITLE The N-cadherin gene maps to human Chromosome 18 and is not linked to the E-cadherin gene JOURNAL J. Neurochem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.H.Barton, 08-MAY-1990. FEATURES from to/span description pept < 1 2247 N-cadherin (AA at 1) BASE COUNT 1041 a 703 c 724 g 983 t ORIGIN Chromosome 18. 1 gactgggtca tccctccaat caacttgcca gaaaactcca ggggaccttt tcctcaagag 61 cttgtcagga tcaggtctga tagagataaa aacctttcac tgcggatacg tgtaactggg 121 ccaggagctg accagcctcc aactggtatc ttcattctca accccatctc gggtcagctg 181 tcggtgacaa agcccctgga tcgccagcag aatgcccggt ttcatttagg ggcacatgca 241 gtagatatta atggaaatca agtggagacc cccattgaca ttgtcatcaa tgttattgac 301 atgaatgaca acagacctga gttcttacac caggtttgga atgggacagt tcctgaggga 361 tcaaagcctg gaacatatgt gatgaccgta acagcaattg atgctgacga tcccaatgcc 421 ctcaatggga tgttgaggta cagaatcgtg tctcaggctc caagcacccc ttcacccaac 481 atgtttacaa tcaacaatga gactggtgac atcatcacag tggcagctgg acttgatcga 541 gaaaaagtgc aacagtatac gttaataatt caagctacag acatggaagg caatcccaca 601 tatggccttt caaacacagc cacggccgtc atcacagtga cagatgtcaa tgacaatcct 661 ccagagttta ctgccatgac gttttatggt gaagttcctg agaacagggt agacatcata 721 gtagctaatc taactgtgac cgataaggat caaccccata caccagcctg gaacgcagtg 781 tacagaatca gtggcggaga tcctactgga cggttcgcca tccagaccga cccaaacagc 841 aacgacgggt tagtcaccgt ggtcaaacca atcgactttg aaacaaatag gatgtttgtc 901 cttactgttg ctgcagaaaa tcaagtgcca ttagccaagg gaattcagca cccgcctcag 961 tcaactgcaa ccgtgtctgt tacagttatt gacgtaaatg aaaaccctta ttttgccccc 1021 aatcctaaga tcattcgcca agaagaaggg cttcatgccg gtaccatgtt gacaacattc 1081 actgctcagg acccagatcg atatatgcag caaaaatatt taagatacac taaattatct 1141 gatcctgcca attggctaaa aatagatcct gtgaatggac aaataactac aattgctgtt 1201 ttggaccgag aatcaccaaa tgtgaaaaac aatatatata atgctacttt ccttgcttct 1261 gacaatggaa ttcctcctat gagtggaaca ggaacgctgc agatctattt acttgatatt 1321 aatgacaatg cccctcaagt gttacctcaa gaggcagaga cttgcgaaac tccagacccc 1381 aattcaatta atattacagc acttgattat gacattgatc caaatgctgg accatttgct 1441 tttgatcttc ctttatctcc agtgactatt aagagaaatt ggaccatcac tcggcttaat 1501 ggtgattttg ctcagcttaa tttaaagata aaatttcttg aagctggtat ctatgaagtt 1561 cccatcataa tcacagattc gggtaatcct cccaaatcaa atatttccat cctgcgcgtg 1621 aaggtttgcc agtgtgactc caacggggac tgcacagatg tggacaggat tgtgggtgcg 1681 gggcttggca ccggtgccat cattgccatc ctgctctgca tcatcatcct gcttatcctt 1741 gtgctgatgt ttgtggtatg gatgaaacgc cgggataaag aacgccaggc caaacaactt 1801 ttaattgatc cagaagatga tgtaagagat aacattttaa aatatgatga agaaggtgga 1861 ggagaagaag accaggacta tgacttgagc cagctgcagc agcctgacac tgtggagcct 1921 gatgccatca agcctgtggg aatccgacga atggatgaaa gacccatcca cgccgagccc 1981 cagtatccgg tccgatctgc agccccacac cctggagaca ttggggactt cattaatgag 2041 ggccttaaag cggctgacaa tgaccccaca gctccaccat atgactccct gttagtgttt 2101 gactatgaag gcagtggctc cactgctggg tccttgagct cccttaattc ctcaagtagt 2161 ggtggtgagc aggactatga ttacctgaac gactgggggc cacggttcaa gaaacttgct 2221 gacatgtatg gtggaggtga tgactgaact tcagggtgaa cttggttttt ggacaagtac 2281 aaacaatttc aactgatatt cccaaaaagc attcagaagc taggctttaa ctttgtagtc 2341 tactagcaca gtgcctgctg gaggctttgg cataggctgc aaaccaattt gggctcagag 2401 ggaatatcag tgatccatac tgtttggaaa aacactgagc tcagttacac ttgaatttta 2461 cagtacagaa gcactgggat tttatgtgcc tttttgtacc tttttcagat tggaattagt 2521 tttctgttta aggctttaat ggtactgatt tctgaaacga taagtaaaag acaaaatatt 2581 ttgtggtggg agcagtaagt taaaccatga tatgcttcaa cacgcttttg ttacattgca 2641 tttgctttta ttaaaataca aaattaaaca aacaaaaaaa ctcatggagc gattttatta 2701 tcttggggga tgagaccatg agattggaaa atgtacatta cttctagttt tagactttag 2761 tttgtttttt ttttttttca ctaaaatctt aaaacttact cagctggttg caaataaagg 2821 gagttttcat atcaccaatt tgtagcaaaa ttgaattttt tcataaacta gaatgttaga 2881 cacattttgg tcttaatcca tgtacacctt tttatttctg tatttttcca cttcactgta 2941 aaaatagtat gtgtacataa tgttttattg gcatacgtct atggagaagt gcagaaactt 3001 cagaacatgt gtatgtatta tttggactat ggattcaggt tttttgcatg tttatatctt 3061 tcgttatgga taaagtattt acaaaacagt gacatttgat tcaattgttg agctgtagtt 3121 agaatactca atttttaatt tttttaattt ttttattttt tattttcttt ttggtttggg 3181 gagggagaaa agttcttagc acaaatgttt tacataattt gtaccaaaaa aaaaaaaaaa 3241 ggaaaggaaa gaaaggggtg gcctgacact ggtggcacta ctaagtgtgt gtttttttaa 3301 aaaaaaaatg gaaaaaaaaa agcctttaaa ctggagagac ttctgacaac agctttgcct 3361 ctgtattgtg taccagaata taaatgatac acctctgacc ccagcgttct gaataaaatg 3421 ctaattttgg ataacaaaaa aaggggaatt c // LOCUS HUMFGF2H 3365 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34185 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3365) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 256 2457 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 786 a 962 c 917 g 700 t ORIGIN 1 gcaccgagcg ccgccgggag tcgagcgccg gccgcggagc tcttgcgacc ccgccaggac 61 ccgaacagag cccgggggcg gcgggccgga gccggggacg cgggcacacg cccgctcgca 121 caagccacgg cggactctcc cgaggcggaa cctccacgcc gagcgagggt cagtttgaaa 181 aggaggatcg agctcactgt ggagtatcca tggagatgtg gagccttgtc accaacctct 241 aactgcagaa ctgggatgtg gagctggaag tgcctcctct tctgggctgt gctggtcaca 301 gccacactct gcaccgctag gccgtccccg accttgcctg aacaagatgc tctcccctcc 361 tcggaggatg atgatgatga tgatgactcc tcttcagagg agaaagaaac agataacacc 421 aaaccaaacc gtatgcccgt agctccatat tggacatccc cagaaaagat ggaaaagaaa 481 ttgcatgcag tgccggctgc caagacagtg aagttcaaat gcccttccag tgggacccca 541 aaccccacac tgcgctggtt gaaaaatggc aaagaattca aacctgacca cagaattgga 601 ggctacaagg tccgttatgc cacctggagc atcataatgg actctgtggt gccctctgac 661 aagggcaact acacctgcat tgtggagaat gagtacggca gcatcaacca cacataccag 721 ctggatgtcg tggagcggtc ccctcaccgg cccatcctgc aagcagggtt gcccgccaac 781 aaaacagtgg ccctgggtag caacgtggag ttcatgtgta aggtgtacag tgacccgcag 841 ccgcacatcc agtggctaaa gcacatcgag gtgaatggga gcaagattgg cccagacaac 901 ctgccttatg tccagatctt gaagactgct ggagttaata ccaccgacaa agagatggag 961 gtgcttcact taagaaatgt ctcctttgag gacgcagggg agtatacgtg cttggcgggt 1021 aactctatcg gactctccca tcactctgca tggttgaccg ttctggaagc cctggaagag 1081 aggccggcag tgatgacctc gcccctgtac ctggagatca tcatctattg cacaggggcc 1141 ttcctcatct cctgcatggt ggggtcggtc atcgtctaca agatgaagag tggtaccaag 1201 aagagtgact tccacagcca gatggctgtg cacaagctgg ccaagagcat ccctctgcgc 1261 agacaggtaa cagtgtctgc tgactccagt gcatccatga actctggggt tcttctggtt 1321 cggccatcac ggctctcctc cagtgggact cccatgctag caggggtctc tgagtatgag 1381 cttcccgaag accctcgctg ggagctgcct cgggacagac tggtcttagg caaacccctg 1441 ggagagggct gctttgggca ggtggtgttg gcagaggcta tcgggctgga caaggacaaa 1501 cccaaccgtg tgaccaaagt ggctgtgaag atgttgaagt cggacgcaac agagaaagac 1561 ttgtcagacc tgatctcaga aatggagatg atgaagatga tcgggaagca taagaatatc 1621 atcaacctgc tgggggcctg cacgcaggat ggtcccttgt atgtcatcgt ggagtatgcc 1681 tccaagggca acctgcggga gtacctgcag gcccggaggc ccccagggct ggaatactgc 1741 tacaacccca gccacaaccc agaggagcag ctctcctcca aggacctggt gtcctgcgcc 1801 taccaggtgg cccgaggcat ggagtatctg gcctccaaga agtgcataca ccgagacctg 1861 gcagccagga atgtcctggt gacagaggac aatgtgatga agatagcaga ctttggcctc 1921 gcacgggaca ttcaccacat cgactactat aaaaagacaa ccaacggccg actgcctgtg 1981 aagtggatgg cacccgaggc attatttgac cggatctaca cccaccagag tgatgtgtgg 2041 tctttcgggg tgctcctgtg ggagatcttc actctgggcg gctccccata ccccggtgtg 2101 cctgtggagg aacttttcaa gctgctgaag gagggtcacc gcatggacaa gcccagtaac 2161 tgcaccaacg agctgtacat gatgatgcgg gactgctggc atgcagtgcc ctcacagaga 2221 cccaccttca agcagctggt ggaagacctg gaccgcatcg tggccttgac ctccaaccag 2281 gagtacctgg acctgtccat gcccctggac cagtactccc ccagctttcc cgacacccgg 2341 agctctacgt gctcctcagg ggaggattcc gtcttctctc atgagccgct gcccgaggag 2401 ccctgcctgc cccgacaccc agcccagctt gccaatggcg gactcaaacg ccgctgactg 2461 ccacccacac gccctcccca gactccaccg tcagctgtaa ccctcaccca cagcccctgc 2521 tgggcccacc acctgtccgt ccctgtcccc tttcctgctg gcaggagccg gctgcctacc 2581 aggggccttc ctgtgtggcc tgccttcacc ccactcagct cacctctccc tccacctcct 2641 ctccacctgc tggtgagagg tggcaaagag gcagatcttt gctgccagcc acttcatccc 2701 ctcccagatg ttggaccaac acccctccct gccaccaggc actgcctgga gggcagggag 2761 tgggagccaa tgaacaggca tgcaagtgag agcttcctga gctttctcct gtcggtttgg 2821 tctgttttgc cttcacccat aagcccctcg cactctggtg gcaggtgcct tgtcctcagg 2881 gctacagcag tagggaggtc agtgcttcgt gcctcgattg aaggtgacct ctgccccaga 2941 taggtggtgc cagtggctta ttaattccga tactagtttg ctttgctgac caaatgcctg 3001 gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc ctggggccca 3061 gccccaaact gggggctctg tatatagcta tgaagaaaac acaaagtgta taaatctgag 3121 tatatattta catgtctttt taaaagggtc gttaccagag atttacccat cgggtaagat 3181 gctcctggtg gctgggaggc atcagttgct atatattaaa aacaaaaaag aaaaaaaagg 3241 aaaacgtttt taaaaaggtc atatattttt tgctactttt gctgttttat ttttttaaat 3301 tatgttctaa acctattttc agtttaggtc cctcaataaa aattgctgct gcttcaaaaa 3361 aaaaa // LOCUS HUMFGF3H 3503 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34186 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3503) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 527 2722 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 777 a 1044 c 1005 g 677 t ORIGIN 1 gcggaaccca aggacttttc tccggtccga gctcggggcg ccccgcaccg ggacggtacc 61 cgtgctgcag tcgggcacgc cgcgggcccg ccgggggcct ccgcagggcg atggagccgg 121 tctgcaagga aagtgaggcg ccgccgctgc gttctggagg aggggggcac aaggtctgga 181 gaccccgggt ggcggacggg agccctcccc ccgccccgcc tccggggcac cagctccggc 241 tccattgttc ccgcccgggc tggaggcgcc gagcaccgag cgccgccggg agtcgagcgc 301 cggccgcgga gtcttgcgac cccgccagga cccgaacaga gcccgggggc ggcgggccgg 361 agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc ccgaggcgga 421 acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg tggagtatcc 481 atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt ggagctggaa 541 gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta ggccgtcccc 601 gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc 661 ctcttcagag gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac 721 atccccagaa aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt 781 caaatgccct tccagtggga ccccaaaccc cacactgcgc tggttggaaa atggcaaaga 841 attcaaacct gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat 901 aatggactct gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta 961 cggcagcatc aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat 1021 cctgcaagca gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat 1081 gtgtaaggtg tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa 1141 tgggagcaag attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt 1201 taataccacc gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc 1261 aggggagtat acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt 1321 gaccgttctg gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga 1381 gatcatcatc tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt 1441 ctacaagatg aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa 1501 gctggccaag agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc 1561 catgaactct ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat 1621 gctagcaggg gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga 1681 cagactggtc ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga 1741 ggctatcggg ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt 1801 gaagtcggac gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa 1861 gatgatcggg aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc 1921 cttgtatgtc atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg 1981 gaggccccca gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc 2041 ctccaaggac ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc 2101 caagaagtgc atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt 2161 gatgaagata gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa 2221 gacaaccaac ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat 2281 ctacacccac cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct 2341 gggcggctcc ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg 2401 tcaccgcatg gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg 2461 ctggcatgca gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg 2521 catcgtggcc ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta 2581 ctcccccagc tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt 2641 ctctcatgag ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa 2701 tggcggactc aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc 2761 tgtaaccctc acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc 2821 tgctggcagg agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact 2881 cagctcacct ctccctccac ctcctctcca cctgctggtg agaggtggca aagaggcaga 2941 tcttttcact gccagccact tcatcccctc ccagatgttg gaccaacacc cctccctgcc 3001 accaggcact gcctggaggg cagggagtgg gagccaatga acaggcatgc aagtgagagc 3061 ttcctgagct ttctcctgtc ggtttggtct gttttgcctt cacccataag cccctcgcac 3121 tctggtggca ggtgccttgt cctcagggct acagcagtag ggaggtcagt gcttcgtgcc 3181 tcgattgaag gtgacctctg ccccagatag gtggtgccag tggcttatta attccgatac 3241 tagtttgctt tgctgaccaa atgcctggta ccagaggatg gtgaggcgaa ggccaggttg 3301 ggggcagtgt tgtggccctg gggcccagcc ccaaactggg ggctctgtat atagctatga 3361 agaaaacaca aagtgtataa atctgagtat atatttacat gtctttttaa aagggtcgtt 3421 accagagatt tacccatcgg gtaagatgct cctggtggct gggaggcatc agttgctata 3481 tattaaaaac aaaaaaaaaa aaa // LOCUS HUMFGF4H 2283 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) transmembrane form mRNA, complete cds. ACCESSION M34187 KEYWORDS FGF receptor; fibroblast growth factor receptor; transmembrane tyrosine kinase. SOURCE Human umbilical vein endothelial cell line HUVEC, cDNA to mRNA, clone h4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2283) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 417 1325 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 566 a 621 c 612 g 484 t ORIGIN 1 ggagcccggt ctgcaaggaa agtgaggcgc cgccgctgcg ttctggagga ggggggcaca 61 aggtctggag accccgggtg gcggacggga gccctccccc cgccccgcct ccggggcacc 121 agctccggct ccattgttcc cgcccgggct ggaggcgccg agcaccgagc gccgccggga 181 gtcgagcgcc ggccgcggag ctcttgcgac cccgccagga cccgaacaga gcccgggggc 241 ggcgggccgg agccggggac gcgggcacac gcccgctcgc acaagccacg gcggactctc 301 ccgaggcgga acctccacgc cgagcgaggg tcagtttgaa aaggaggatc gagctcactg 361 tggagtatcc atggagatgt ggagccttgt caccaacctc taactgcaga actgggatgt 421 ggagctggaa gtgcctcctc ttctgggctg tgctggtcac agccacactc tgcaccgcta 481 ggccgtcccc gaccttgcct gaacaagatg ctctcccctc ctcggaggat gatgatgatg 541 atgatgactc ctcttcagag gagaaagaaa cagataacac caaaccaaac cgtatgcccg 601 tagctccata ttggacatcc ccagaaaaga tggaaaagaa attgcatgca gtgccggctg 661 ccaagacagt gaagttcaaa tgcccttcca gtgggacccc aaaccccaca ctgcgctggt 721 tgaaaaatgg caaagaattc aaacctgacc acagaattgg aggctacaag gtccgttatg 781 ccacctggag catcataatg gactctgtgg tgccctctga caagggcaac tacacctgca 841 ttgtggagaa tgagtacggc agcatcaacc acacatacca gctggatgtc gtggagcggt 901 cccctcaccg gcccatcctg caagcagggt tgcccgccaa caaaacagtg gccctgggta 961 gcaacgtgga gttcatgtgt aaggtgtaca gtgacccgca gccgcacatc cagtggctaa 1021 agcacatcga ggtgaatggg agcaagattg gcccagacaa cctgccttat gtccagatct 1081 tgaaggtaat catggcacca gtcttcgtgg gccagtctac tgggaaggag accactgtct 1141 cgggggctca agttcctgtg ggcaggctca gttgcccccg aatgggatca ttcctcacgc 1201 ttcaggcaca cacactccat ctcagtaggg atctagccac atcccccagg actagtaaca 1261 gaggtcacaa agtggaggtg agctgggaac agagggctgc agggatgggt ggtgctggtc 1321 tgtaataagc tttgagagca acgtcactgg ggctttgggg tcagctacac aaggaaggca 1381 tttggacccc tgccttttca ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt 1441 cttagccctg tgttctgagt tacgtacgat ctttctggca aatggggtgc atgataagag 1501 catctcttac gaagagttgg aaaaacaaat gccatatata aattctaagc catatgagga 1561 cgaggagtaa tggcattttc ttcctttttc ctctcactcc cagacattca ttgtccctga 1621 atgctccatt aatccaggga aggtaattgc ctaaatctcc agtggatctc gcaacaggaa 1681 ggaaccagaa gctgggaaag ttgtttacct ctttgtccca gagttagacc tcatcctccc 1741 ctagcttagc tgtctcagag atatactggc cctcccttct cttctctttg ctgctggtgc 1801 taaaactgct ctgtaggtca ttggccactg tctccactca caacccctgc tccagtcctg 1861 gagggagtgg gttaaacaca aatagaacat tccatttgaa gcagtgattc tttttttttt 1921 tttttttttt taatcaaatg ctttggactt ttgaagtcca cttgttctgt acttgtaaaa 1981 gggaaagaag gccgggcgca gtcgtcacgc ctgtaatccc agcactttag atcacttgag 2041 gtcaggagtt tgagaccagc ccggccaaca tggtgaaacc ccatctctac taaaaataca 2101 aaaattagct gtgcatagtg gttggcacct gtagtcccag ctactcagga ggctgaggca 2161 agctaactgc ttgaacccag aaggcagagg ttgcagtgag ctgagatcac gccactgcac 2221 tccagcctgg gtgacagagt gagtgagact ctgcgttaaa aaaaaaaaaa aaaaaaaaaa 2281 aaa // LOCUS HUMFGF5H 1625 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor receptor (FGFr) secreted form mRNA, complete cds. ACCESSION M34188 KEYWORDS FGF receptor; fibroblast growth factor receptor. SOURCE Human female placenta endothelial cell line HUVEC, cDNA to mRNA, clone h5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1625) AUTHORS Johnson,D.E., Lee,P.L., Lu,J. and Williams,L.T. TITLE Diverse forms of a receptor for acidic and basic fibroblast growth factors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Johnson, 10-MAY-1990. Author address: D.E.Johnson University of California San Francisco 4th and Parnassus Howard Hughes Medical Institute San Francisco, CA 94143 (415) 476-4297 FEATURES from to/span description pept 523 1425 fibroblast growth factor receptor (FGFr) transmembrane form BASE COUNT 368 a 480 c 489 g 288 t ORIGIN 1 cggaacccaa ggacttttct ccggtccgag ctcggggcgc cccgcaggcg acggtacccg 61 tgctgcagtc gggcacgccg cgggcccggg gcctccgcag ggcgatggag cccggtctgc 121 aaggaaagtg aggcgccgcc gctgcgttct ggaggagggg ggcacaaggt ctggagaccc 181 cgggtggcgg acgggagccc tccccccgcc ccgcctccgg ggcaccagct ccggctccat 241 tgttcccgcc cgggctggag gcgccgagca ccgagcgccg ccgggagtcg agcgccggcc 301 gcggagctct tgcgaccccg ccaggacccg aacagagccc gggggcggcg ggccggagcc 361 ggggacgcgg gcacacgccc gctcgcacaa gccacggcgg actctcccga ggcggaacct 421 ccacgccgag cgagggtcag tttgaaaagg aggatcgagc tcactgtgga gtatccatgg 481 agatgtggag ccttgtcacc aacctctaac tgcagaactg ggatgtggag ctggaagtgc 541 ctcctcttct gggctgtgct ggtcacagcc acactctgca ccgctaggcc gtccccgacc 601 ttgcctgaac aagatgctct cccctcctcg gaggatgatg atgatgatga tgactcctct 661 tcagaggaga aagaaacaga taacaccaaa ccaaaccccg tagctccata ttggacatcc 721 ccagaaaaga tggaaaagaa attgcatgca gtgccggctg ccaagacagt gaagttcaaa 781 tgcccttcca gtgggacccc aaaccccaca ctgcgctggt tgaaaaatgg caaagaattc 841 aaacctgacc acagaattgg aggctacaag gtccgttatg ccacctggag catcataatg 901 gactctgtgg tgccctctga caagggcaac tacacctgca ttgtggagaa tgagtacggc 961 agcatcaacc acacatacca gctggatgtc gtggagcggt cccctcaccg gcccatcctg 1021 caagcagggt tgcccgccaa caaaacagtg gccctgggta gcaacgtgga gttcatgtgt 1081 aaggtgtaca gtgacccgca gccgcacatc cagtggctaa agcacatcga ggtgaatggg 1141 agcaagattg gcccagacaa cctgccttat gtccagatct tgaaggtaat catggcacca 1201 gtcttcgtgg gccagtctac tgggaaggag accactgtct cgggggctca agttcctgtg 1261 ggcaggctca gttgcccccg aatgggatca ttcctcacgc ttcaggcaca cacactccat 1321 ctcagtaggg atctagccac atcccccagg actagtaaca gaggtcacaa agtggaggtg 1381 agctgggaac agagggctgc agggatgggt ggtgctggtc tgtaataagc tttgagagca 1441 acgtcactgg ggctttgggg tcagctacac aaggaaggca tttggacccc tgccttttca 1501 ttgcccgaaa ccagagcctt tccaccaagc gtttcccagt cttagccctg tgtcctgagt 1561 tacgtacgat ctttctggca aatggggtgc atgataagag catctcttac gaagagttgg 1621 aaaaa // LOCUS HUMINTB1A 1146 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human integrin beta-1 subunit mRNA, 3' end (cytoplasmic domain). ACCESSION M34189 KEYWORDS integrin; integrin beta-1 subunit cytoplasmic domain. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1146) AUTHORS Altruda,F., Cervella,P., Tarone,G., Botta,C., Balzac,F., Stefanuto,G. and Silengo,L. TITLE A human integrin beta-1 subunit with a unique cytoplasmic domain JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Altruda, 10-MAY-1990. FEATURES from to/span description pept < 1 117 integrin beta-1 subunit (AA at 1) (cytoplasmic domain) BASE COUNT 373 a 184 c 222 g 367 t ORIGIN 1 aagcttttaa tgataattca tgacagaagg gagtttgcta aatttgaaaa ggagaaaatg 61 aatgccaaat gggacacggt aagttacaaa acatccaaaa agcaaagtgg cttataaagt 121 aaatgtaata ctcctaagac ttatgtatta gctgtcaggc tgattattaa agtcctttct 181 aagtatttta ttcccccaaa agtttcttac tcaaggaatt tgcatttagt gaaaaacaga 241 aagcatccta aatatatccc attgaaacaa aacattgatt ataagcatgt atattctggt 301 tcatgtggcc gatattttta tttctttaat gattttgatc ctaaatctgc cttttcatct 361 aatgtgaagt agaatcctaa ataatgttat ctgtgtagca agctattcaa tgggaaagct 421 gcttctttct ttaaaacaaa caaacaaaaa aaaccttcag tggaaagcca aattccaaaa 481 ggttatatac caagcttgtc caactcgcag ctcgtcggcc aggacatgca gcccagaata 541 gctttgaatg tggccccaac acaaatttgt aaactttctt agaaattgta attattatta 601 ttattttttt ttggtaactt tttttaaagc tcatcagcta tcgttagtgt attttatgtg 661 tggcccaaga cagttcttct tcttgccagt gtggcccagg gaagccaaaa gattggacac 721 ccctgctata tactatatga ttccatttag aggacattct ggaaaagcaa aactgtaggg 781 gcaaaaatca gtggttgcta ggggctggaa tgggggaaag tgttgaccac agaggggcgt 841 aagggatctt ccttgggatg acttgattgt gggtggattt atgtatttga aaactcacag 901 aactatgtac tttaaaaaga tgtatgttcc tctatgaaaa ttatatctca gtaaactttg 961 gcttataaaa atcttaaaag ccctaagtga ccgaaaggtt atgttagcat tgagtgcttt 1021 gaaatatgga gtcagagggt ggggtaacca aatgttggcc tttgtgtatt catcttttga 1081 tacaagaaag caatgccaat cttcagtatt tttaaattgt aaatgaattt tgtagttccc 1141 gaattc // LOCUS RATIRF1A 2078 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat interferon regulatory factor 1 (IRF-1) mRNA, complete cds. ACCESSION M34253 KEYWORDS interferon regulatory factor 1; transcription factor. SOURCE Rat cell line Nb2-11c T-cell, cDNA to mRNA, clones 25,4b. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2078) AUTHORS Yu-Lee,L.-Y., Hrachovy,J.A., Stevens,A.M. and Schwarz,L.A. TITLE Interferon regulatory factor 1 is an immediate-early gene under transcriptional regulation by prolactin in Nb2 T cells JOURNAL Mol. Cell. Biol. 3087, 3094 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.-Y.Yu-Lee, 11-MAY-1990. FEATURES from to/span description pept 198 1184 interferon regulatory factor 1 (IRF-1) BASE COUNT 542 a 544 c 540 g 452 t ORIGIN 1 ctcgacgaag gagtaggacg agctctcact gtctgagcca aaccgaaccg ggccgagctg 61 agccgaggtc agcggtggcc agaggaaccc agcatctcgg gcatcattcg ctccgtgcac 121 gcatcgtgta cctacaccgc aactccgtgc ctcattcccg ggtaccctct gtgactcgct 181 cctgcagcaa agccaccatg cctatcactc ggatgcgaat gagaccctgg ctagagatgc 241 agattaattc caaccaaatt ccagggctga gctggatcaa taaagaagag atgatcttcc 301 agatcccatg gaagcatgct gccttgcacg gttgggatat caacaaggat gcctgtctgt 361 tccggagctg ggccattcac acaggccgat acaaagctgg ggaaaaagag ccagatccca 421 agacttggaa ggcaaacttc cggtgtgcca tgaactccct accagacatc gaggaagtga 481 aggaccagag caggaacaag ggcagctctg ctgtacgcgt gtaccggatg ctgccacccc 541 tcaccaagaa ccagaggaaa gagagaaagt ccaagtccag ccgtgacact aagagcaaaa 601 ccaagaggaa gctgtgcgga gattctagcc ctgacacctt atctgacgga ctgagcagct 661 ctactctgcc tgatgaccac agcagttaca cagctcaggg atacctgggt caggacttgg 721 acatggacag ggacattacc ccagctctgt caccgtgcgt cgtcagcagc agtctctctg 781 agtggcatat gcagatggac atcatgccag acagcaccac tgatctgtac aacttgcagg 841 tgtcgcccat gccctccacc tctgaagctg caacagatga ggatgaggaa gggaagttac 901 ctgaggacat catgaagctc tttgaacagt ctgagtggca gccgacgcac gtggatggca 961 agggatactt gctcaatgaa ccaggagccc aactctctac tgtctatgga gacttcagct 1021 gcaaggagga accagagatc gacagccctg gaggggacat cgagataggc atacagcgtg 1081 tcttcacaga gatgaagaat atggaccccg tcatgtggat ggacaccctg ctgggcaact 1141 ctaccaggcc gccctccatt caggctattc cttgtgcacc ataatttggg tccctgaccc 1201 gttcttgccc tcctgagtga gctaggtcca gcatcatggt ggctgtgata caacataaag 1261 ctaaacttcc gtggacccct tgatgtggca aaacataatc ccattgccaa gcagggaagg 1321 gaccaaacca tcctccttgg gtcagtggac tgactcttca gagcttagga ggcagggtct 1381 aagtttttca agctggtcct gactcctagg aagatggatt ggcgttctga ggttagtgtg 1441 aggcagagga cctggacgga agttaccttc tagctctttg aaagcttcat tgcttagaga 1501 gggtctcacc actgggctgg cctgggggat agaccagcgc ccacagaaga gcattgcact 1561 ggccttaggg ctggctccac actgggagac aattgcacta agtcctattc ccaaagaact 1621 gctgcccttc ccaaccgagc cctgggatgg ttctagagcc agtgaaatgt gaaggaaaaa 1681 atggggtcct gtgagggttg tctcccttag cctcagaggg attctgcctc actccctgct 1741 ccagctgtgg ggctcaggaa aaaaaaatgg cactttctct gtggactttg ccacatttct 1801 gatcagaagt gtacactaac atttctccca agtcttggcc tttgcattta tttatatagt 1861 gccttgccct gtgcctgctg tctctcctca ggcctcagca gtcctcagca ggcccaggga 1921 gggggttgtg agcgccttgg cgtgactctg aacattggaa acgccaccta actactaagt 1981 tgtgtctgat ctcgtgtgga tctgtgtaaa tatgtatatt catcttttta taaaaaccta 2041 agttgtttaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS ACCRRSAA 1536 bp ss-rRNA RNA 17-JUL-1990 DEFINITION A.calcoaceticus 16S ribosomal RNA. ACCESSION M34139 KEYWORDS 16S ribosomal RNA. SOURCE A.calcoaceticus (strain 33604) ribosomal RNA. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1536) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1536 16S ribosomal RNA BASE COUNT 387 a 329 c 460 g 330 t 30 others ORIGIN 1 ttaactgaag agtttgatca tggctcagat tgaacgctgg cggcaggctt aacacatgca 61 agtcgagcgg ggaaggttgc ttcggtaact gactagcggc ggacgggtga gtaatgctta 121 ggaatctgcc atttagtggg ggacaacatt ccgaanggaa tgctaatacc gcatacgtcc 181 tacaggagaa agcaggggat ctccggacct tgcgctaaat gatgagccta agtcggatta 241 gctagttggt ggggtaaagg cctaccaagg cgacgatctg tagcgggtct gagaggatga 301 tccgccacac tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata 361 ttggacaatg ggcgcaagcc ngatccagcc atgccgcgtg tgtgaagaag gccttttggt 421 tgtaaagcac tttaagcgag gaggaggctc tcttagttaa tacctaagat gagtggacgt 481 tactcgcaga ataagcaccg gctaactctg tgccagcagc cgcggtaata cagagngtgc 541 gagcgttaat cggatttact gggcgtaaag cgtgcgtagg cggcttttta agtcggatgt 601 gaaatccccg agcttaactt gggaattgca ttcgatactg ggaagctaga gtatgggaga 661 ggatggtaga attccaggtg tagcggtgaa atgcgtagag atctggagga ataccgatgg 721 cgaaggcagc catctggcct aatactgacg ctgaggtacg naagcatggg gagcaaacag 781 gattagatac cctggtagtc catgccgtaa acgatgtcta ctagccgttg gggcctttga 841 ggctttagtg gcgcagctaa cgcgataagt agactgcctg gggagtacgg tcgcaagact 901 aaaactcaaa tgaattgacg ggggcncgca caagcggtgg agcatgtggt ttaattcgat 961 gcaacgcgaa gaaccttacc tggccttgac atactagaaa ctttccagag atggattggt 1021 gccttcggga atctagatac aggtgctgca tggctgtcgt cagctcgtgt cgtgagatgt 1081 tgggttaagt cccgcaacga gcgcaaccct tttccttact tgccagcatt tcggatggga 1141 actttaagga tactgccagt gacaaactgg aggaaggcgg ggacgacgtc aagtcatcat 1201 ggcccttacg gctagggcta cacacgtgct acaatggtcg gtacaaaggg ttgctaccta 1261 gcgataggat gctaatctca aaaagccgat cgtagttcgg attggagtct gcaactcgac 1321 tccatgaagt cggaatcgct agtaatcgcg gatcagaatg ccgcggtgaa tacgttcccg 1381 ggccttgtac acaccgcccg tcacaccatg ggagtttgtt gcaccagaag tagctagcct 1441 aactgcaaag agggcggtta ccacggtgtg gccgatgact agggnnnnnn ngtaacaagn 1501 nnnnnnnnnn ngaacctgnn nnnngatcac ctcctt // LOCUS BDERRSAA 1553 bp ss-rRNA RNA 17-JUL-1990 DEFINITION B.stolpii 16S ribosomal RNA. ACCESSION M34125 KEYWORDS 16S ribosomal RNA. SOURCE B.stolpii (strain uki-2) ribosomal RNA. ORGANISM Bdellovibrio stolpii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1553) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology, 131 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1553 16S ribosomal RNA BASE COUNT 381 a 320 c 429 g 316 t 107 others ORIGIN 1 agcatnnaga gtttgatcct ggctcagaac gaacgctggc ggcgtgccta acacatgcaa 61 gtcgaacgtg aaagtccttc gggatgagta aagtggcgca cgggtgagta acacgtaggt 121 gacctgcctt ttagagggga ataaccagaa gaaattttgg ctaatgccgc atacgaagca 181 cggttttaag actgtgcttg aaagaatgcc tctgcatatg ngcattcgct attagatggg 241 cctgcgggac attagctagt tggtggggta aaggcctacc aaggcgacga tgtctatccg 301 gtctgagagg atgatcggac acactggaac tgagacacgg tccagactcc tacgggaggc 361 agcagtgggg aatattgcgc aatgggggaa accctgacgc agcaacgccg cgtgagtgag 421 gaaggacttc ggtctgtaaa gctctgttaa tgtggaaaaa tggcagttgg tctaataggc 481 cnattgtttg atggtacaca tagaggaagc accggctaac ttcgtgccag cagccgcggt 541 aatacgaagg gtgcnagcgt tgttcggatt tattgggcgt aaagcgcgcg taggcggacc 601 tgcaagtcag atgtgaaatc tcggggctca acctcgaaac tgcgtctgaa actacaggtc 661 tagaatctcg gagggggaag gggaatatcg catgtagggg taaaatccgt agatatgcga 721 tggaacacca gaggcgaagg cgccttcctg gacgagtatt gacgctgagg cncnnaagcg 781 tggggatcaa acaggattag ataccctggt agtccacgct gtaaacgatg aacactagat 841 attggaggat ttgacccctt cagtgtcgta gctaacgcgt caagtgttcc gcctgggaag 901 tacggtcgca agactaaaac tcaaaggaat tgnnnnnnnn nngcacaagn nnnngattat 961 gnngtttaat tcgnngcaac gcgcagaacc ttacctaggc ttgaaatcct acgaatccct 1021 tttaaacgag ggagtgctct tcggagaatg tagtgacagg cgctgcatgg ctgtcgtcag 1081 ctcgtgtcgt gagatgttgg gttaagtctc gcaacgagcg caacccccat ttttagttgc 1141 cagcattaag ttgggcactc tagaaagact gcntgggcta accaggagga aggtggggat 1201 gacgtcaagt cctcatggcc cttatgtcta gggctacaca cgtaatacaa tggtcggtac 1261 aaagggatgc gaactcgcga gggggagcca atctcaaaaa accgatctca gtccggattg 1321 gagtctgcaa ctcgactcca tgaagttgga atcgcgagta atcgcggatc agcacgccgc 1381 ggtgaatacg ttcccgggcc ttgtacacac cgcccgtcac accatgggag ttgtttttac 1441 ctgaagnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1501 nnnnnnngta acaagnnnnn nnnnnnngaa cctgnnnnnn gatcacctcc ttt // LOCUS PLTRRSAA 1525 bp ss-rRNA RNA 17-JUL-1990 DEFINITION P.staleyi 16S ribosomal RNA. ACCESSION M34126 KEYWORDS 16S ribosomal RNA. SOURCE P.staleyi (strain ATCC 27377) ribosomal RNA. ORGANISM Planctomyces staleyi Prokaryota; Bacteria; Eubacteriomycetes; Eubacteriales. REFERENCE 1 (bases 1 to 1525) AUTHORS Woese,C.R. and Oyalzu,H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 15-MAY-1990. Author address: C.R.Woese University of Illinois Dept Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1525 16S ribosomal RNA BASE COUNT 376 a 348 c 482 g 315 t 4 others ORIGIN 1 caattgaaga gtttgatcct ggctcagaat gaacgttggc ggcatggatt aggcatgcaa 61 gtcgtgcgcg atatgtagca atacatggag agcggcgaaa gggagagtaa tacgtaggaa 121 cctaccttcg ggtctgggat agcggcggga aactgccggt aataccagat gatgtttccg 181 aaccaaaggt gtgattccgc ctgaagaggg gcctacgtcg tattagctag ttggtagggt 241 aatggcctac caaggcaaag atgcgtatgg ggtgtgagag catgccccca ctcactggga 301 ctgagacact gcccagacac ctacgggtgg ctgcagtcga gaatcttcgg caatgggcga 361 aagcctgacc gagcgatgcc gcgtgcggga tgaaggcctt cgggttgtaa accgctgtcg 421 taggggatga agtgctaggg ggttctccct ctagtttgag ctgaacctag gaggaagggc 481 cggctaatct cgtgccanna gccgcggtaa tacgagaggc ccaaacgtta ttcggattta 541 ctgggcttaa agagttcgta ggcggtcttg taagtggggt gtgaaatccc tcggctcaac 601 cgaggaactg cgctccaaac tacaagactt gagggggata gaggtaagcg gaactgatgg 661 tggagcggtg aaatgcgttg atatcatcag gaacaccgga ggcgaaggcg gcttactggg 721 tcctttctga cgctgaggaa cgaaagctag gggagcaaac gggattagat accccggtag 781 tcctagccgt aaacgatgag cactggaccg gagctctgca cagggtttcg gtcgtagcga 841 aagtgttaag tgctccgcct ggggagtatg gtcgcaaggc tgaaactcaa aggaattgac 901 gggggctcac acaagcggtg gaggatgtgg cttaattcga ggctacgcga agaaccttat 961 cctagtcttg acatgcttag gaatcttcct gaaagggagg agtgctcgca agagagcctt 1021 tgcacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgtcgggt taagtccctt 1081 aacgagcgaa acccttgtcc ttagttacca gcgcgtcatg gcggggactc taaggagact 1141 gccggtgtta aaccggagga aggtggggat gacgtcaagt cctcatggcc tttatgatta 1201 gggctgcaca cgtcctacaa tggtgcacac aaagcgacgc aaactcgtga gagccagcta 1261 atcgcaaaaa atgtacctca gttcggattg caggctgcaa ctcgcctgca tgaagctgga 1321 atcgctagta atcgcgggtc agcataccgc ggtgaatntg ttcctgagcc ttgtacacac 1381 cgcccntcaa gccacgaaag tgggggggac ccaacagcgc tgccgtaacc gcaaggaaca 1441 aggcgcctaa ggtcaactcc gtgattggga ctaagtcgta acaaggtagc cgtaggggaa 1501 cctgcggctg gatcacctcc tttct // LOCUS RDCRRSAA 1478 bp ss-rRNA BCT 17-JUL-1990 DEFINITION R.purpureus 16S ribosomal RNA. ACCESSION M34132 KEYWORDS 16S ribosomal RNA. SOURCE R.purpureus (strain 6770) ribosomal RNA. ORGANISM Rhodocyclus purpureus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1478) AUTHORS Woese,C.R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Woese, 09-MAY-1990. Author address: C.R.Woese University of Illinois Dept. Microbiology 131 Burrill Hall 407 S. Goodwin Ave. Urbana, IL 61801 (217) 333-9369 FEATURES from to/span description rRNA 1 1478 16S ribosomal RNA BASE COUNT 367 a 345 c 470 g 289 t 7 others ORIGIN 1 tgaactgaag agtttgatcc tggctcagat tgaacgctgg cggcatgcct tacacatgca 61 agtcgaacgg taacgggncc ttcgggcgcc gaacgagtgg cgaacgggtg agtaatgcat 121 cggaacatgc cctgaagtgg gggataacgt agcgaaagtt acgctaatac cgcatattct 181 gtgagcagga aagcagggga ccttcgggcc ttgcgctttg ggagtggccg atgtcggatt 241 agctagttgg tggggtaaaa gcctaccaag gcaacgatcc gtagcgggtc tgagaggatg 301 atccgccaca ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat 361 tttggacaat gggcgaaagc ctgatccagc catgccgcgt gagtgaagaa ggccttcggg 421 ttgtaaagct ctttcggcgg ggaagaaatc gggtttccta atacggaacc cggatgacgg 481 tacccgaaga agaagcaccg gctaactacg tgccagcagc cgcggtaata cgtagggtgc 541 nagcgttaat cggaattact gggcgtaaag cgtgcgcagg cggttgtgta agacagacgt 601 gaaatccccg ggctcaacct gggaactgcg tttgtgactg cacagctaga gtacggcaga 661 ggggggtgga attccacgtg tagcagtgaa atgcgtagag atgtggagga acaccgatgg 721 cgaaggcagc cccctgggcc aatactgacg ctcatgcacg naagcgtggg gagcaaacag 781 gattagatac cctggtagtc cacgccctaa acgatgtcaa ctaggtgttg gtggggttaa 841 acccattagt gccgtagcta acgcgtgaag ttgaccgcct ggggagtacg gcggcaaggt 901 taaaactcaa aggaattgac gggganccgc acaagcggtg gatgatgtgg attaattcga 961 tgcaacgcga aaaaccttac ctacccttga catgtcagga atcctgagga gactcgggag 1021 tgcccgaaag ggnacctgaa cacaggtgct gcatggcngt cgtcagctcg tgtcgtgaga 1081 tgttgggtta agtcccgcaa cgagcgcaac ccttgtcatt aattgccatc attcagttgg 1141 gcactttaat gaaactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcctc 1201 atggccctta tgggtagggc ttcacacgtc atacaatggt cggtccatag ggttgcnaac 1261 ccgcgagggg gagctaatcc cagaaagccg atcgtagtcc ggattgcagt ctgcaactcg 1321 actgcatgaa gtcggaatcg ctagtaatcg cggatcagca tgtcgcggtg aatacgttcc 1381 cgggtcttgt acacaccgcc cgtcacacca tgggagcggg ttctgccaga agtagttagc 1441 ctaaccgcaa ggagggcgat taccacggca gcgttcgt // LOCUS RATSVPIIA 4161 bp ds-DNA ROD 17-JUL-1990 DEFINITION Rat seminal vesicle secretion II protein (SVS II) gene, complete cds. ACCESSION J05443 KEYWORDS seminal vesicle secretion II protein. SOURCE Rat (strain CHARLES RIVER) male seminal vesicle epithelial cell DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4161) AUTHORS Harris,S.E., Harris,M.A., Johnson,C.M., Bean,M.F., Dodd,J.G., Matusik,R.J., Carr,S.A. and Crabb,J.W. TITLE Structural characterization of the rat seminal vesicle secretion II protein and gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Harris, 25-APR-1990. FEATURES from to/span description pept 2065 2137 seminal vesicle secretion II protein (SVS II) precursor, exon 1 2377 3548 seminal vesicle secretion II protein precursor, exon 2 sigp 2065 2130 seminal vesicle secretion II protein signal peptide matp 2131 3545 seminal vesicle secretion II protein signal 1934 1946 CAAT box signal 2013 2020 TATA box BASE COUNT 1243 a 902 c 822 g 1194 t ORIGIN 1 tttcgatcca atgtgtggat tactcaccaa gtgtctgtct ttctttcttt ctctctttct 61 ctctttcttc ctcccttcct tccttccttc cttccttcct tccttccttc cttccttcct 121 ttctgttcaa ttgctcgttt ctcccttcat ctctcgccag tataccgcac actcaaactt 181 aaattttcat ttcaatgcgt tctcttctgg cacgtgcagc ataattacac tcatgattgt 241 caactccgtg atctgtttgc acaccttacc ccccccccca aggttttatc tgcatttaaa 301 aaaaagggat tatcaagaaa tttattctta attcagaaat gtgatcaaag ttgtcagatc 361 cgttctttac tgcctcctgt tggaaaaaaa aaatatccag ttcctggatt tttctaaaac 421 acagaaaaga gacctgggac aggggtatag gattgagcag gcatggtgag caattttata 481 ctgaatagat tcattgtgac ttaccggtct cctgagggaa ataatcactt ttcccaggta 541 gagagcagcc tagcaagaga tcagagtgca agcataaaac ccatgtgctt tataagtgta 601 tttattttat gcattttctg tttataagga catgagtgga ctttttattt gtcccttcca 661 tacaggacta cctagactat tgggatggga tgactgaaaa tatgttttca agtagacttc 721 cttccggaac taccttcata tggttctgaa ggcaaagtgg aacactgcac gggtgtcctc 781 ttctcccaag aacttggcca tggcgtcgtc gttttgagtc tatgtctgag ccacgaatgc 841 cataacagcc cttcctgtta ctctcacagt ggcacagagc tgtttctaaa caagaaggaa 901 gtcttccatc ttgtgtcagg atgctaatga cgtcaccaat ggcagtaagt gttcaccaca 961 gcccgttgct aaggcaatta tgttatccct cctgtcagag tttcctgtat taaaatatac 1021 tgagtttaat tttatgtcgg attccatgac atacattcag caaggaaacc aacagtatct 1081 tttgttcttt caacagtgat ttcctgtcac catttaactg ttgtctcgcc cccattcttt 1141 aaaatgtctc tgcacctcac cttgcctccc agatacactc ccaaactcat ttccctggac 1201 acacttgaaa tgttgctgct agcaagccac agctaccacg tcttctctgt cagggttcta 1261 gacaactcat ctaaagcagc accaggtctc tcttagaaat cagacatcgg atgtcatggt 1321 catagtatac ctcacagcta ctttggacat tcatgggccc agtattattt tccagggctg 1381 aggtttaact caagagcctc atgctcacat ggctggtggt ctggccacac agctatgact 1441 cgtctccatt tattcttcaa acttttattc ggagctccgt tgtgttctgt tgtctcctgt 1501 gcctttctat atgtgtgact gctcctttgc ctgtaaatga gaagctatgt caaattcaac 1561 gtaaaaaagg caacttcatg ggcttctgtg agatagcatg ctaaacagtg tcagctccac 1621 tacactgtga ccaggaaaat ttgatcaggc cctggttact ctcggagcat aaaagaagaa 1681 aaaaaaatct cttccccgct ctactctgga ttttgtttga aaataaaagg tccaatctgt 1741 ccttataaaa catgcataga ataaatatac tagaaaacac actttgtttg caaagggtat 1801 gtgataaagt cagagggttg ataaagattt gctgaggctt atgacataga aaaggtccct 1861 gacattgcat ccctgtgcaa agtacctggg aacattacca atgtccccaa ctgtgcagag 1921 gggaggaagt tgacatttag agataatttt tttaaaaaag caggcagtgc ttttgtagtg 1981 tcagttatat ctgtaataca tccagctaga gatatataaa tgtgaaagtc agctcagctc 2041 tcagtgaagg tccttcttga caagatgaag tcctctgtct tcattctatc tctgttcctc 2101 cttctggaaa gacaggcagc tgtggttgga cagtatggtg agtagggaga tggtgactag 2161 agggaaagtc actcagggag aatgttttta agggtgctct gggagtagca gatcctttca 2221 taggggaatt tttttttaaa tgagacctaa ttcttctcta ctgaaaacca aaacccttgt 2281 gggaacatca atggttttat gaggaaattt tggaaatgag acttggaagg actgtgcaga 2341 tcatgtaact taaaccttcc tcctctcaat taccaggtgg gacaaaaggt cacttccaga 2401 gcagctcatc agggtttatg cttggtcaga aaggccacct caattttggg ctcaaaggag 2461 gaagtgagga agcagctgaa gaaagcattt tcatgcaatc acaacaccag atgttcggcc 2521 aggatggtgg tgacatggcg cagacaagtg tttcacaaga gcatacaggt gtaaaggggg 2581 ccgcgatttg tcgtaaagga caagtatccc aattgaaatc ccaagaatcc caaataaaat 2641 cctttagaca agtaaaatcc agtggacagc tgaaatctgg aggatcccaa ttaaaatcct 2701 ttggacaagt gaaatccagt gagtcccaat taaaatcctt tggccaagtg aaagccagtg 2761 ggtcccaatt aaaatccttc ggacaagtga aagccagtgg gtcccaatta aaatcctatg 2821 gacaaatgaa atccagtggg tcccaagtga aatcctttgg acaaatgaaa tccagtgggt 2881 cccaagtaaa atcctttgga caaatgaaag ccagtgagtc ccaaataaaa tcctttggac 2941 aaagaaaatc ccaaggtggt caactacaat cctatggcca aatgaaatcc tatgggcaga 3001 cgaaatccct agaatcccag gccaaatcct tcggacaagt aaagtcccaa agtggccaaa 3061 tgaaatcctc ctatggtcag agaaaatcct atggtgaaga gactcaactg aagtctttcg 3121 accaagatgc ccaactaaaa tcctatggtc aacaaaaatc ccaaaaacaa tcctccttta 3181 gccaagtaaa atctcaaagt gcccaactaa agtcctttgg ccaacaaaaa tccctcaaag 3241 ggttttctca acaaactcaa cagaaaggat ttgccatgga tgaagatttg tcacaagtgc 3301 ggaaacaatt tgacgatgat gacctctctg tacaacagaa gtctacccaa cagatgaaaa 3361 cagaggaaga cttatcccaa tttggacaac aacgacaatt tggacaagaa cgctcccaat 3421 cctataaagg atatcttgca caatacagaa agaaattaca ggaacaacaa caacagaaaa 3481 attttaatca ggataacttt tttacaaagg gaggggcagg cctatatcag gctcaactta 3541 agggataaca tattcactga gcaactgaag accaagatca atgtcaaggt atgttccacc 3601 aagtaggaag atattatcca aatttacttg tggtatatag gaatcctgga tccattatgg 3661 attgataccc atttgttact atcagtagaa gtattgttac acacttttag aaggatgaag 3721 aacagaccct ggtaaaatga gtccttgtag agtaaaggca gagtaagcaa gctaagttaa 3781 caattggtcc tgaattacta cattcaggga gcacttttca gtgcttctct gagcacagac 3841 agtttatatt attaaatgtg taccacctat gcaatcatat ttaacatttc atgatggaat 3901 cttatttgtt cttacacttt gacttgataa aaaaaaaaga ttggtttctt gcttatattg 3961 gtataaggtg gtattgcagc tgagctcttt ctctacacca gtgcgtgttc ttgagtcccc 4021 tgggacctct gctttccatc acaatccatg gggttaagga ttagctgctt ttccatcaga 4081 tggaagattt ggttacaaag atctctgcct ggagcagaca ctatattcag ttgtatgtcc 4141 aatggtgacc ctgttgaatt c // LOCUS CREAPCYN 577 bp ss-mRNA PLN 17-JUL-1990 DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds. ACCESSION J05524 KEYWORDS apoplastocyanin. SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone PC6-2. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 577) AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and Bogorad,L. TITLE Isolation and characterization of a complementary DNA clone for an algal pre-apoplastocyanin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Hill, 11-MAY-1990. FEATURES from to/span description pept 22 459 apoplastocyanin (PC6-2) precursor sigp 22 162 apoplastocyanin signal peptide matp 163 456 apoplastocyanin BASE COUNT 91 a 189 c 171 g 126 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc 61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt 121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg 181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag 241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc 301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc 361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac 421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat 481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac 541 gcagagagcg tgtagctctg taactttttc ggaattc // LOCUS MHVNSGII 870 bp ss-RNA VRL 17-JUL-1990 DEFINITION Murine hepatitis virus non-structural protein gene-2 (NS2). ACCESSION M34035 KEYWORDS non structural protein. SOURCE Murine hepatitis virus (strain MHV-JHM), cDNA to viral RNA. ORGANISM Murine hepatitis virus A59 Unclassified. REFERENCE 1 (bases 1 to 870) AUTHORS Schwarz,B., Routledge,E. and Siddell,S.G. TITLE The coronavirus MHV 30 kDa non-structural protein NS2 is not essential for virus replication in transformed murine cells JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.G.Siddell, 04-MAY-1990. Author address: S.G.Siddell Inst of Virology Univ of Wuerzburg Versbacherstrasse 7 8700 Wuerzburg FEATURES from to/span description pept 40 837 non-structural protein-2 (NS2) BASE COUNT 269 a 147 c 187 g 267 t ORIGIN 1 gcgatagcct agtaaatgtt aaataaatct atacttgtca tggctgcgag aatggccttt 61 gctgacaagc ctaatcattt tataaacttt cctctagccc aatttagtgg ctttatgggt 121 aagtatttaa agcttcagtc tcaacttgtg gaaatgggtt tggactgtaa attacaaaag 181 gtaccacatg ttagtattac cctgcttgac attaaagcag accaatacaa acaggtggaa 241 tttgcaatac aagaaataat agatgatctg gcggcatatg agggagatat tgtctttgac 301 aaccctcata tgcttggcag atgtcttgtt cttgatgtta aaggatttga agagttgcat 361 gaagatattg ttgaaattct ccgcagaagg ggttgcactg cagatcaatc cagacaatgg 421 attccgcact gcactgtggc ccaatttgat gaagaaaaag aaataaaaga aatgcaattc 481 tattttaaat tgcccttcta tctcaagcat aacaacctac ttacggatgc taggcttgag 541 cttgtgaaga taggttcttc caaagtaggt gggttttatt gtagtgaact aagtatttgg 601 tgtggtgaga gactttgtta caagccccca acccccaaat tcagtgatat atttggctat 661 tgctgcatag ataaaatacg tggtgattta gaaataggag acctaccgcc agatgatgag 721 gaagcgtggg ccgagctaag ttaccactat caaagaaaca cctacttctt cagacatgtg 781 cacgataata gtatctattt tcgtaccgta tgtagaatga agggttgtat gtgttgattt 841 gtttttacac tattagtgta ataaacttat // LOCUS HUMFGF1A 3343 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human fibroblast growth factor (FGF) receptor-1 mRNA, complete cds. ACCESSION M34641 KEYWORDS FGF receptor-1; fibroblast growth factor receptor; transmembrane glycoprotein; tyrosine kinase. SOURCE Human teratocarcinoma cell line Tera-2, clone 13, cDNA to mRNA, clone 1:1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3343) AUTHORS Wennstroem,S., Sandstroem,C. and Claesson-Welsh,L. TITLE cDNA cloning and expression of a human FGF receptor which binds acidic and basic FGF JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Claesson-Welsh, 25-MAY-1990. Author address: L.Claesson-Welsh Ludwig Institute for Cancer Research Biomedical Center Box 595 S-751 24 Uppsala SWEDEN FEATURES from to/span description pept 10 2472 FGF receptor-1 precursor sigp 10 72 FGF receptor-1 signal peptide matp 73 2469 FGF receptor-1 BASE COUNT 766 a 957 c 911 g 709 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcggga tgtggagctg gaagtgcctc ctcttctggg ctgtgctggt cacagccaca 61 ctctgcaccg ctaggccgtc cccgaccttg cctgaacaag cccagccctg gggagcccct 121 gtggaagtgg agtccttcct ggtccacccc ggtgacctgc tgcagcttcg ctgtcggctg 181 cgggacgatg tgcagagcat caactggctg cgggacgggg tgcagctggc ggaaagcaac 241 cgcacccgca tcacagggga ggaggtggag gtgcaggact ccgtgcccgc agactccggc 301 ctctatgctt gcgtaaccag cagcccctcg ggcagtgaca ccacctactt ctccgtcaat 361 gtttcagatg ctctcccctc ctcggaggat gatgatgatg atgatgactc ctcttcagag 421 gagaaagaaa cagataacac caaaccaaac cccgtagctc catattggac atccccagaa 481 aagatggaaa agaaattgca tgcagtgccg gctgccaaga cagtgaagtt caaatgccct 541 tccagtggga ccccaaaccc cacactgcgc tggttgaaaa atagcaaaga attcaaacct 601 gaccacagaa ttggaggcta caaggtccgt tatgccacct ggagcatcat aatggactct 661 gtggtgccct ctgacaaggg caactacacc tgcattgtgg agaatgagta cggcagcatc 721 aaccacacat accagctgga tgtcgtggag cggtcccctc accggcccat cctgcaagca 781 gggttgcccg ccaacaaaac agtggccctg ggtagcaacg tggagttcat gtgtaaggtg 841 tacagtgacc cgcagccgca catccagtgg ctaaagcaca tcgaggtgaa tgggagcaag 901 attggcccag acaacctgcc ttatgtccag atcttgaaga ctgctggagt taataccacc 961 gacaaagaga tggaggtgct tcacttaaga aatgtctcct ttgaggacgc aggggagtat 1021 acgtgcttgg cgggtaactc tatcggactc tcccatcact ctgcatggtt gaccgttctg 1081 gaagccctgg aagagaggcc ggcagtgatg acctcgcccc tgtacctgga gatcatcatc 1141 tattgcacag gggccttcct catctcctgc atggtggggt cggtcatcgt ctacaagatg 1201 aagagtggta ccaagaagag tgacttccac agccagatgg ctgtgcacaa gctggccaag 1261 agcatccctc tgcgcagaca ggtaacagtg tctgctgact ccagtgcatc catgaactct 1321 ggggttcttc tggttcggcc atcacggctc tcctccagtg ggactcccat gctagcaggg 1381 gtctctgagt atgagcttcc cgaagaccct cgctgggagc tgcctcggga cagactggtc 1441 ttaggcaaac ccctgggaga gggctgcttt gggcaggtgg tgttggcaga ggctatcggg 1501 ctggacaagg acaaacccaa ccgtgtgacc aaagtggctg tgaagatgtt gaagtcggac 1561 gcaacagaga aagacttgtc agacctgatc tcagaaatgg agatgatgaa gatgatcggg 1621 aagcataaga atatcatcaa cctgctgggg gcctgcacgc aggatggtcc cttgtatgtc 1681 atcgtggagt atgcctccaa gggcaacctg cgggagtacc tgcaggcccg gaggccccca 1741 gggctggaat actgctacaa ccccagccac aacccagagg agcagctctc ctccaaggac 1801 ctggtgtcct gcgcctacca ggtggcccga ggcatggagt atctggcctc caagaagtgc 1861 atacaccgag acctggcagc caggaatgtc ctggtgacag aggacaatgt gatgaagata 1921 gcagactttg gcctcgcacg ggacattcac cacatcgact actataaaaa gacaaccaac 1981 ggccgactgc ctgtgaagtg gatggcaccc gaggcattat ttgaccggat ctacacccac 2041 cagagtgatg tgtggtcttt cggggtgctc ctgtgggaga tcttcactct gggcggctcc 2101 ccataccccg gtgtgcctgt ggaggaactt ttcaagctgc tgaaggaggg tcaccgcatg 2161 gacaagccca gtaactgcac caacgagctg tacatgatga tgcgggactg ctggcatgca 2221 gtgccctcac agagacccac cttcaagcag ctggtggaag acctggaccg catcgtggcc 2281 ttgacctcca accaggagta cctggacctg tccatgcccc tggaccagta ctcccccagc 2341 tttcccgaca cccggagctc tacgtgctcc tcaggggagg attccgtctt ctctcatgag 2401 ccgctgcccg aggagccctg cctgccccga cacccagccc agcttgccaa tggcggactc 2461 aaacgccgct gactgccacc cacacgccct ccccagactc caccgtcagc tgtaaccctc 2521 acccacagcc cctgctgggc ccaccacctg tccgtccctg tcccctttcc tgctggcagg 2581 agccggctgc ctaccagggg ccttcctgtg tggcctgcct tcaccccact cagctcacct 2641 ctccctccac ctcctctcca cctgctggtg agaggtgcaa agaggcagat ctttgctgcc 2701 agccacttca tcccctccca gatgttggac caacacccct ccctgccaca gcatcgcctg 2761 gagggcaggg agtgggagcc aatgaacagg catgcaagtg agagcttcct gagctttctc 2821 tgtcggtttg gtctgttttg ccttcaccca taagcccctc gcactctggt ggcaggtgcc 2881 ttgtcctcag ggctacagca gtagggaggt cagtgcttcg tgcctcgatt gaaggtgacc 2941 tctgccccag ataggtggtg cagtggctta ttaattccga tactagtttg ctttgctgac 3001 caaatgcctg gtaccagagg atggtgaggc gaaggccagg ttgggggcag tgttgtggcc 3061 ctggggccag ccccaaactg ggggctctgt atatagctat gaagaaaaca caaagtgtat 3121 aaatctgagt atatatttac atgtcttttt aaaagggtcg ttaccagaga tttacccatc 3181 gggtaagatg ctcctggtgg ctgggaggca tcagttgcta tatattaaaa acaaaaaaga 3241 aaaaaaagga aaatgttttt aaaaaggtca tatatttttt gctacttttg ctgttttatt 3301 tttttaaatt atgttctaaa ctcgtgccgc tcgtgccgaa ttc // LOCUS ECOFADA 1594 bp ds-DNA BCT 17-JUL-1990 DEFINITION E.coli 3-ketoacyl-coenzyme A thiolase (fadA) DNA. ACCESSION J05498 KEYWORDS 3-ketoacyl-coenzyme A thiolase. SOURCE E.coli (strain K12) DNA, clone pK52. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1594) AUTHORS Yang,S.-Y., Yang,X.-Y.H., Healy-Louie,G., Schulz,H. and Elzinga,M. TITLE Nucleotide sequence of the fadA gene: Primary structure of 3- ketoacyl-coenzyme A thiolase from Escherichia coli and the structural organization of the fadAB operon JOURNAL J. Biol. Chem. 265, 10424-10429 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-Y.Yang, 11-MAY-1990. FEATURES from to/span description pept 110 1276 3-ketoacyl-coenzyme A thiolase (fadA) precursor (E.C. 2.3.1.16) sigp 110 139 3-ketoacyl-coenzyme A thiolase signal peptide matp 140 1273 3-ketoacyl-coenzyme A thiolase BASE COUNT 329 a 425 c 489 g 351 t ORIGIN 86 min on K12 map. 1 tgaagtgccg gaaggtctgc gtaataaagc gcgtcataac gaaccgtact atcctccggt 61 tgagccagcc cgtccggttg gcgacctgaa aacggcttaa ggagtcacaa tggaacaggt 121 tgtcattgtc gatgcaattc gcaccccgat gggccgttcg aagggcggtg cttttcgtaa 181 cgtgcgtgca gaagatctct ccgctcattt aatgcgtagc ttgctggcgc gtaacccggc 241 gctggaagcg gcggccctcg acgatattta ctggggttgt gtgcagcaga cgctggagca 301 gggttttaat atcgcccgta acgcggcgct gctggcagaa gtaccacact ctgtcccggc 361 ggttaccgtt aatcgcttgt gtggttcatc catgcaggca ctgcatgacg cagcacgaat 421 gatcatgact ggcgatgcgc aggcatgtct ggttggcggc gtggagcata tgggccatgt 481 gccgatgagt cacggcgtcg attttcaccc cggcctgagc cgcaatgtcg ccaaagcggc 541 gggcatgatg ggcttaacgg cagaaatgct ggcgcgtatg cacggtatca gccgtgaaat 601 gcaggatgcc tttgccgcgc ggtcacacgc ccgcgcctgg gccgccacgc agtcggccgc 661 atttaaaaat gaaatcatcc cgaccggtgg tcacgatgcc gacggcgtcc tgaagcagtt 721 taattacgac gaagtgattc gcccggaaac caccgtggaa gccctcgcca cgctgcgtcc 781 ggcgtttgat ccagtaaacg gtatggtaac ggcgggcaca tcttctgcac tttccgatgg 841 cgcagctgcc atgctggtga tgagtgaaag ccgcgcccat gaattaggtc ttaagccgcg 901 cgctcgtgtg cgttcgatgg cggtcgttgg ttgtgaccca tcgattatgg gttacggccc 961 ggttccggcc tcgaaactgg cgctgaaaaa agcggggctt tctgccagcg atatcggcgt 1021 gtttgaaatg aacgaagcct ttgccgcgca gatcctgcca tgtattaaag atctgggact 1081 aattgagcag attgacgaga agatcaacct caacggtggc gcgatcgcgc tgggtcatcc 1141 gctgggttgt tccggtgcgc gtatcagcac cacgctgctg aatctgatgg aacgcaaaga 1201 cgttcagttt ggtctggcgg acggatgtgt atcgggtctg ggtcagggta ttgcgacggt 1261 gtttgagcgg gtttaaccaa cttgccggat tgcggccgtt gaacggcctt atccggtcta 1321 ccgatccggc accaatgtag gcctgataag acgcgccagc gtcgcatcag gcatcatgca 1381 ccaattgccg gatgcggcgt aaacgcctta tccggctacg gttcggacta tttgtaggca 1441 cgataagacg cgacagcgtc gcatcaggca tcatgcacca attgccggat gcggcgtaaa 1501 cgccttatcc ggcctacggt tcggactatt tgtaggcacg ataagacgcg acagcgtcgc 1561 atcgggcatg gtttgccaga tacgtcatag atct // LOCUS HSSCLS 1645 bp ds-DNA VRL 17-JUL-1990 DEFINITION Herpesvirus saimiri collagen-like protein mRNA, complete cds. ACCESSION M31964 KEYWORDS collagen-like protein. SOURCE Herpesvirus saimiri (strain 484-77), cDNA to mRNA. ORGANISM Herpesvirus saimiri Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 1645) AUTHORS Geck,P., Whitaker,S.A., Medveczky,M.M. and Medveczky,P.G. TITLE Expression of collagen-like sequences by a tumor virus, herpesvirus saimiri JOURNAL J. Virol. 64, 3509-3515 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P. Medveczky, 07-FEB-1990. FEATURES from to/span description pept 101 400 collagen-like protein pept 580 1224 ORF2 mRNA 56 1248 collagen-like protein mRNA BASE COUNT 545 a 363 c 393 g 344 t ORIGIN Map position 98-100. 1 gagtttccaa aatgtagtaa gctaacatat aaatatcagc agaaaataga gtaacacaag 61 aaacagctaa caagagcaac acaaacacgc ttgaaagaaa atggcaagcg agccaaacct 121 aagatatcca actgaagaaa caggagatcg aggacctcaa ggacctccag gacctccagg 181 accccaagga cctccaggac cccaaggacc tccaggaccc caaggacctc caggacccca 241 aggacctcca ggaccccaag gacctccagg accccaagga cctccaggac ccccaggacc 301 tccaggacct tcagactcca ggattgtttg taacaaactt attgcttgga atcataattt 361 tactcttatt aattatagtt gcgatcttac tggtgtctaa attagtagta aactaagaac 421 agagcaagca gtatagaaga gagtaattat gctagcttaa tttcatctag tagtcagttg 481 tgtagaactt tttattagta aatctagaag tgttaagaaa aaggtataga aagtagtgac 541 ctttattagc tgcatcttca tatctctaaa ttagtaagta tggaaaacca gagagaagaa 601 atagaactaa cagaaattcc agagactgag aaaaaaagaa cagctgaaga aaaattatta 661 tcatgctctg ctgaaactgc tgaagaaaag gtatcattat gctctgaaga aactacagac 721 acgtcaagca gttctagcag tgagcaaaca cctgctccta tagaagtaaa cgttaacatt 781 caaacgagca catacttgcc tcagaacgcg gcaacaaact tgaacagttt gtacacaagt 841 tttgaagacg caagggcaca aggaaaagga ctggtgcgac acaattcgga cgatttaaaa 901 agttttttag aaaaataccc accagattat agaaaaccta aacgggactt gtcagaatca 961 tgggatccag gtatgccgaa accaactttg cctcctaggc ctgcgaacct aggtgcaagc 1021 caggcaagta ctgtaagacg gcatgtaaga gaacaaaatt ttaaacagct tagagaacgt 1081 aaagctaatg aagggaaaat agtaaaagat ttgaaaagac ttgagtacaa agttaatata 1141 attctttgtt tagtagttgt tatattagct ataatattat tactaactgg gctaagtata 1201 ctgttcataa gaataaaaag ttaatttaac aaagaattta acaaatatgt gtgtgcatgg 1261 tcattaaaag aagcgggccc caagcagtgg gcgctgggca gctggaaaaa tgtcctgggc 1321 gtggctagag cgagctcggg gggcaaagcg agcagcgagc ggctgcggag ggagggggaa 1381 gggattgagc tgccccgccc ctctgggctg agattccaga gcccctgttg tgtccccagg 1441 ccaccgcagg cccggggctg tgcccctgtg cttggagcca gtccccagag ccctggctag 1501 gccgaggcgc tgctagagta agggtcccac aactgttcag gcccccccag ggggagagcc 1561 cagcgcagcg ctctgccggg tagggggggg gtagctggct ccccccccac ctctgcgtga 1621 agcagagtgc atgcagcccc tgcag // LOCUS MUSAGP1A 777 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34648 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1A. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 777) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 39 662 alpha-1-acid glycoprotein (AGP) precursor sigp 39 92 alpha-1-acid glycoprotein (AGP) signal peptide matp 93 659 alpha-1-acid glycoprotein (AGP) signal 747 752 AGP poly-A signal BASE COUNT 225 a 189 c 186 g 177 t ORIGIN Chromosome 4. 1 atcttttcca agccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg 61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta 121 taggcgaacc tatcaccaat gagaccctga gctggctctc tgacaaatgg tttttcattg 181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt 241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag 301 atgaccactg tgtctataac tctacacatc taggaatcca gagagagaat gggaccctct 361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg 421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca 481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg 541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg 601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat 661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct 721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacacagtca aaaaccg // LOCUS MUSAGP1X 778 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34647 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-1X. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 778) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 39 662 alpha-1-acid glycoprotein (AGP) precursor sigp 39 92 alpha-1-acid glycoprotein (AGP) signal peptide matp 93 659 alpha-1-acid glycoprotein (AGP) signal 747 752 AGP poly-A signal BASE COUNT 224 a 191 c 187 g 176 t ORIGIN Chromosome 4. 1 atctcttcca ggccctggtg cctctgagtg tcctaagcat ggcactgcac atgattcttg 61 tcatgttgag cctcctgccg ctgttggaag ctcagaaccc agaacatgtc aacatcacta 121 taggcgaacc tatcaccaat gagaccctag gctggctatc tgacaaatgg tttttcattg 181 gtgcggctgt cctaaaccct gattaccggc aggaaattca aaagatgcag atggtatttt 241 ttaacattac ccccaacttg ataaatgaca cgatggagct tcgagagtat cacaccatag 301 atgaccactg tgtctataac tccacacatc taggaatcca gagagagaat gggaccctct 361 ccaagtatgt aggaggagta aaaatctttg cagacctgat agtgctgagg aaacatgggg 421 ccttcatgct tgcctttgac ctgaaggatg agaagaaacg gggactgtcc ctcaatgcca 481 aaaggccaga tatcaccccg gagctgcggg aagtattcca gaaggctgtc aaacacgtgg 541 gcatggatga atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg 601 agaagaagca gcttgagctg gagaaggaga ccaagaaaga tcctgaggaa ggccaggcat 661 gaactcagct ctctgaactc catgggctgt ccccaggctc accaaacccc accccctcct 721 gtgcactttg attctgtctc tgcaacaata aaggtttgct gacactgtca aaaacccg // LOCUS MUSAGP2AX 765 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34649 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clones lambda-[2A,2X]. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 765) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 29 652 alpha-1-acid glycoprotein (AGP) precursor sigp 29 82 alpha-1-acid glycoprotein (AGP) signal peptide matp 83 649 alpha-1-acid glycoprotein (AGP) signal 735 740 AGP poly-A signal BASE COUNT 226 a 185 c 185 g 169 t ORIGIN Chromosome 4. 1 gggcctggtg cctctgagtg ccctcagcat ggcactgcac acggttctta tcatgttgag 61 cctcctgccg atgttggaag ctcagaaccc agaacatgcc aacatcacta taggcgaacc 121 tatcaccaat gagaccctag gctggctctc tgacaaatgg tttttcatgg gcgcagcttt 181 cagaaaactc gagtacaggc aggcaattca aatgatgcag actgaatttt tttaccttac 241 caccaacttg ataaacgaca caatagagct tcgggagtct caaacaatag gtgaccagtg 301 tgtctataac tccacccatc taggattcca gagagaaaat gggaccttct ccaagtatga 361 aggaggagta gaaacctttg cccacctgat agtgctaagg aaacatgggg ccttcatgct 421 tgcctttgat ctcaatgatg agaagaaacg gggactgtcc ctctatgcca aaaggccaga 481 tatgaccctg gagctacggg aagtattcca gaaggctgtc aaacacgtgg gcatggatga 541 atcagaaatc atatttgtcg actggaaaaa ggataagtgt ggtcagcagg agaagaagca 601 gcttgagttg gggaaggaga ccaagaaaga tcctgaggaa ggccaggcat gaactcagct 661 ctgaactcca tgggctatcc ccaggctcac caaaccccac cccctcctgt gcactttgat 721 tctgtctctg caacaataaa gatttgctga aagagtcaaa aaccg // LOCUS MUSAGP3X 766 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) mRNA. ACCESSION M34646 J05488 KEYWORDS acute phase serum protein; alpha-1-acid glycoprotein; non-specific immunosuppressant; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte, cDNA to mRNA, clone lambda-3X. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 766) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 31 654 alpha-1-acid glycoprotein (AGP) precursor sigp 31 84 alpha-1-acid glycoprotein (AGP) signal peptide matp 85 651 alpha-1-acid glycoprotein (AGP) signal 737 742 AGP poly-A signal BASE COUNT 228 a 186 c 185 g 167 t ORIGIN Chromosome 4. 1 cggggcctgg tgcctctgag tgccctcagc atggcactgc acacggttct tatcatgttg 61 agcctcctgc cgatgttgga agctcagaac ccagaacatg ccaacatcac tataggcgaa 121 cctatcacca atgagaccct aggctggctc tctgacaaat ggtttttcat gggcgcacgt 181 ttcagaaaac tcgagtacag gcaggcaatt caaatgatgc agactgaatt tttttacctt 241 accaccaact tgataaacga cacaatagag cttcgggagt ctcaaacaat aggtgaccag 301 tgtgtctata actccaccca tctaggattc cagagagaaa atgggacctt ctccaagtat 361 gaaggaggag tagaaacctt tgcccacctg atagtgctaa ggaaacatgg ggccttcatg 421 cttgcctttg acctcaatga tgagaagaaa cggggactgt ccctctatgc caaaaggcca 481 gatatgaccc tggagctacg ggaagtattc cagaaggctg tcaaacacgt gggcatggat 541 gaatcagaaa tcatatttgt cgactggaaa aaggataagt gtggtcagca ggagaagaag 601 cagcttgagt tggggaagga gaccaagaaa gatcctgagg aaggccaggc atgaactcag 661 ctctgaactc catgggctat ccccaggctc accaaacccc accccctcct gtgcactttg 721 attctgtctc tgcaacaata aagatttgct gaaagaaaaa aaaccg // LOCUS MUSAGP7A 1348 bp ds-DNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) gene 7, 5' flank and exon 1. ACCESSION M34645 J05488 KEYWORDS alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte DNA, clones lambda-[11B,27A]. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1348) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 1232 / 1348 alpha-1-acid glycoprotein (AGP), exon 1 mRNA 1194 > 1348 AGP mRNA signal 1161 1166 TATA box BASE COUNT 312 a 316 c 356 g 364 t ORIGIN Chromosome 4. 1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga 61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg 121 agaccttagc tggatgtcca cttccctgcc tgcacccttg cccttagaat ccacaggctc 181 tggatacaca gcacagccac aggaagcttt ggggatttct tttccctatc ttaagtttat 241 tgttttcttc atttggttgt ataacaaaaa gagattttta aatttgctaa tgggtgcaag 301 tgtgagtgtg tgtgtgtgtg tgtgtgtgtt tctgtgtgtg tgtgtgtgtg tgtgtgtgtg 361 tgtgtgtgtg tgtgtgtgtg agagagagag agagagagga agggacagag tgaaatagat 421 catgacactg tgagcatgtt ctaacacgtt tgctgtattg atgttttgaa gcagctcact 481 gtgatctcaa gctcccagca atgtctctgt gcctcagcgg gagtcaccag gcctgtccct 541 gcaatacttc tagagattgt tgattatgta gacaaagatc aggtaggaac tagaaccaga 601 tattgactgg atgctgggca cagaccctgg gttgggcata cactgtccta ggcagatcct 661 ttcctgctgt agatactggg agctttgctg aactcgatgg tcaactcaga ttcacccctt 721 tttgggcact tggctgcctt caggctgggt aggagtccgc aggaacatca cactcctttg 781 aaaaccatcc atctttattc ttggccctca acttgagccc ctaagtgtct tctatgttca 841 ctatgaaccc tgacctggga ccccttctta tcacgcttcg ggggggtgga gttgatgtag 901 tgtaggtttc actcctgtta gacagcttca tgggataaga gagaaagggg accgctgtct 961 gggacctaag tatcatcagg ctagcccttt acctaccttg accatgaatc agccactctg 1021 gtgtaggcag aagccgactg agggagctgc acaaacctgg cttgagggaa catttggggc 1081 aagacattta tcaagtgctg gtgactttgt ggcactgctc taacgccctg ctgcagtcac 1141 atgccctccc cacatcctgt tataaaagtc actgcaccct ccaggcacca gttatctctt 1201 cccagcactg gtgcctctga gtgccgtcag catggagcta cacacagttc ttgtgatgtt 1261 gagcctcctg ctgctgttgg aagctcagaa cccagaacat gccaacatca acataggcaa 1321 acctatcaac aatgagaccc tgagctgg // LOCUS MUSAGP8A 2094 bp ds-DNA ROD 17-JUL-1990 DEFINITION M.caroli alpha-1-acid glycoprotein (AGP) gene 8, 5' flank and exon 1. ACCESSION M34644 J05488 KEYWORDS alpha-1-acid glycoprotein; transmembrane protein; tyrosine kinase. SOURCE M.caroli adult hepatocyte DNA, clones lambda-[15A,11B]. ORGANISM Mus caroli Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2094) AUTHORS Prowse,K.R. and Baumann,H. TITLE Molecular characterization and acute-phase expression of the multiple Mus caroli alpha-1-acid glycoprotein (AGP) genes: Differences in glucocorticoid stimulation and regulatory elements between the rat and mouse AGP genes JOURNAL J. Biol. Chem. 265, 10201-10209 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.R.Prowse, 18-MAY-1990. FEATURES from to/span description pept 1978 / 2094 alpha-1-acid glycoprotein (AGP), exon 1 mRNA 1940 > 2094 AGP mRNA signal 1907 1912 TATA box BASE COUNT 554 a 502 c 464 g 574 t ORIGIN Chromosome 4. 1 gaattcacac aggccttgct catctggcct ctgcaggcaa aggacaggag aagcagggga 61 ggcacatgtg agcagaagcc tgggcagact ctggaatggt atgcagcaag taccctgggg 121 agaccttagc tggtgtccac tccctgcctg cacccttgcc cttagaatcc acaggctctg 181 gatgcacagc acagccacag gaagctttgg agatatctaa tccgtttctt aagtttatta 241 ttttccatca tttgctttta taacaaaaaa tagatatttt atatctgcca gtgtgtgcaa 301 atatgtttat gtgcacgtgt acgtgtgcgc ctgtgtgtaa gagacataac agtgacaaag 361 aggcacaaac agagagatta tgacttctgg gcactgctaa cttgggacag aacatatatc 421 attggtgctt ttgggctcct catggttatg ctgtttctcc tctgggaaga gagaaacttt 481 cttaagactg tataaataca acaaaacaca tatgttttgg gtctccagct cagcaacttc 541 tcctaggccc tgtacagccc actccatggt cagggggctc tcaggaccat ctgtggcccc 601 ttccttttgc cctaggtggt cactgtcact gacctgatta atatcattac aattagtctg 661 atcctctaga acgccctgta aacaggctca cacaacccca cctctgggcg atctgctggc 721 aataatgttg tcgagctgca tttgtatggt caaatctatc agtctttcat ttctctctgt 781 tgctaagcaa caccatgctt gcatgaaatg tcacaaccca tttctcattt tcctggtggt 841 aggcacttaa gagcttggag tttgtggcta ctttgaataa tgccattaga acatttacac 901 acatgatact ggaggtgact catgttttgt tttcctttct gttggtagac taatttttaa 961 aggcaatatg ttgctctgac acttataaag aggggaagca taaaatacaa aatgccacag 1021 gataaaagaa ggttctcgtg aaattagaac acagtaagtg aatgacaatt atttatcatt 1081 aaaacaaaaa ccctgaatac acagaaacat gagaaattgt ggaggacaca ggaagtagat 1141 tactcaactg tgagcgtgtt ctcatatgtt tggtgctttt gatgttttga aagagctcac 1201 ggtgatctca agctcctagc aatgactccg agcctcagtg agagtcacca ggcctgtccc 1261 tagcaataca tttcgagagt gttgattatg tagacaaaaa tcaggctata tctatcaagc 1321 aggaaccaga accagatatt gactggttgc tgtgcacaga ccctggattg ggcacacact 1381 attctagaca gatcctttcc tgctgtagat acagggagct ttgctgaact acattttcaa 1441 ctcagattca tccctctttt ttgggcattt ggctgcctct agcgtgcgta ggggtcccca 1501 gaaacatcac actcctttgg aaactaatcc atctttgtcc ttggccctta acttgagccc 1561 ctaaatgact tctaagttca ctatgaaccc tgaccaggac cccttcatag tccccttggt 1621 gataaataga ataggtctca ctcctgctag gcagcttcat gggataagac agatatggga 1681 ccatgactgg gatctaagta ttatcaggct agcccagtat ccaccttgac catgaatcag 1741 ccactctggt gtaggtcagg agcctgtgtc ggggccggct ggctgaggga gctgcacaaa 1801 gctggcttga gggaacattt tgcgcaagac atttcccaag tgctggggag attgtgccag 1861 ggctctagag gccctgctgc agtcccatgc cctccccaca ttctgttata aaagccactg 1921 caccctccat ccaccaatta tctcttccaa gccctggtgc ctctgagtgt cctaagcatg 1981 gcactgcaca tgattcttgt catgttgagc ctcctgccgc tgttggaagc tcagaaccca 2041 gaacatgtca acatcactat aggcgaacct atcaccaatg agaccctagg ctgg // LOCUS MUSMK1 933 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK-1) mRNA, complete cds. ACCESSION M19662 M34326 J05473 KEYWORDS DNA-binding protein; regulatory protein; retanoic acid-responsive protein; secreted protein. SOURCE Mouse (strain 129/Sv) HM-1 embryonal carcinoma, cDNA to mRNA, clone pO1-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 33 to 932) AUTHORS Kadomatsu,K., Tomomura,M. and Muramatsu,T. TITLE cDNA cloning and sequencing of a new gene intensely expressed in early differentiation stages of embryonal carcinoma cells and in mid-gestation period of mouse embryogenesis JOURNAL Biochem. Biophys. Res. Commun. 151, 1312-1318 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 933) AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T. TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma system: Heterogeneity of the transcript and the nature of the translation JOURNAL J. Biol. Chem. 265, 10765-10770 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Kadomatsu, 08-JUL-1988. Draft entry and computer-readable sequence for [2] kindly submitted by M.Tomomura and T.Muramatsu, 16-MAY-1990. The expression of MK-1 mRNA increases in the early stages of retinoic acid-induced differentiation of embryonal carcinoma cells. The putative peptide product may act as a DNA-binding protein regulating gene expression. FEATURES from to/span description pept 249 671 retanoic acid-responsive protein (MK-1) precursor sigp 249 314 retanoic acid-responsive protein signal peptide matp 315 668 retanoic acid-responsive protein mRNA < 1 910 MK-1 mRNA BASE COUNT 216 a 272 c 270 g 175 t ORIGIN 1 gagtgagcgc acgcagtggc tgtggcccca gtcccttcag gcggctgctc tgccaccaag 61 ggggctgagg tgggggtggg ggtacgctga gacatcggtt ccaagtcctc cctccgtctc 121 ccccttgtcg gtccgacgtt ttgggcctgg aaagtgggac aagtcagtca agggtgggag 181 gtccttcccg cggttcctag cggagaagag actaggcgag aaactctaac ccaggtttta 241 cccctaggat gcagcaccga ggcttcttcc ttctcgccct tcttgccctc ttggtggtca 301 cgtccgcggt ggccaaaaaa aaagagaagg tgaagaaggg cagcgagtgt tcggagtgga 361 cctgggggcc ctgcaccccc agcagcaagg actgcggcat gggcttccgc gagggtacct 421 gtggggccca gacccagcgc gtccattgca aggtgccctg caactggaag aaggaatttg 481 gagccgactg caaatacaag tttgagagct ggggggcgtg tgatgggagc actggcacca 541 aagcccgcca agggaccctg aagaaggcgc ggtacaatgc ccagtgccag gagaccatcc 601 gcgtgactaa gccctgcacc tccaagacca agtcaaagac caaagccaag aaaggaaaag 661 gaaaggacta agtcaggagg ccagagagcc tccggcctcg cctggagcct gaacggagcc 721 ctcctctccc acaggcccaa gatataaccc accagtgcct tttgtcttcc tgtcagctct 781 gtcaatcacg cctgtcctct cacgcccaca ccaagtgccc aaagtgggga gggacaagag 841 attctggaaa gtgagcctcc ccataccctc ttttgttctc cccaccctga tacttgttat 901 taagaaatga ataaaataaa ctcacttttt tcc // LOCUS MUSMK2 730 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK-2) mRNA, complete cds. ACCESSION M34327 J05473 KEYWORDS DNA-binding protein; regulatory protein; retanoic acid-responsive protein; secreted protein. SOURCE Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clone pB2-25. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 730) AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T. TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma system: Heterogeneity of the transcript and the nature of the translation JOURNAL J. Biol. Chem. 265, 10765-10770 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Tomomura and T.Muramatsu, 16-MAY-1990. FEATURES from to/span description pept 44 466 retanoic acid-responsive protein (MK-2) precursor sigp 44 109 retanoic acid-responsive protein signal peptide matp 110 463 retanoic acid-responsive protein mRNA < 1 705 MK-2 mRNA BASE COUNT 186 a 211 c 206 g 127 t ORIGIN 1 caggccggag cgggagggag cgaagcatcg agcagtgagc gagatgcagc accgaggctt 61 cttccttctc gcccttcttg ccctcttggt ggtcacgtcc gcggtggcca aaaaaaaaga 121 gaaggtgaag aagggcagcg agtgttcgga gtggacctgg gggccctgca cccccagcag 181 caaggactgc ggcatgggct tccgcgaggg tacctgtggg gcccagaccc agcgcgtcca 241 ttgcaaggtg ccctgcaact ggaagaagga atttggagcc gactgcaaat acaagtttga 301 gagctggggg gcgtgtgatg ggagcactgg caccaaagcc cgccaaggga ccctgaagaa 361 ggcgcggtac aatgcccagt gccaggagac catccgcgtg actaagccct gcacctccaa 421 gaccaagtca aagaccaaag ccaagaaagg aaaaggaaag gactaagtca ggaggccaga 481 gagcctccgg cctcgcctgg agcctgaacg gagccctcct ctcccacagg cccaagatat 541 aacccaccag tgccttttgt cttcctgtca gctctgtcaa tcacgcctgt cctctcacgc 601 ccacaccaag tgcccaaagt ggggagggac aagagattct ggaaagtgag cctccccata 661 ccctcttttg ttctccccac cctgatactt gttattaaga aatgaataaa ataaactcac 721 ttttttccaa // LOCUS MUSMK3 1039 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse retinoic acid-responsive protein (MK-3) mRNA, complete cds. ACCESSION M34328 J05473 KEYWORDS DNA-binding protein; regulatory protein; retanoic acid-responsive protein; secreted protein. SOURCE Mouse HM-1 embryonal carcinoma, cDNA to mRNA, clones pB3-30 and FK-3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1039) AUTHORS Tomomura,M., Kadomatsu,K., Matsubara,S. and Muramatsu,T. TITLE A retinoic acid responsive gene, MK, found in the teratocarcinoma system: Heterogeneity of the transcript and the nature of the translation JOURNAL J. Biol. Chem. 265, 10765-10770 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Tomomura and T.Muramatsu, 16-MAY-1990. FEATURES from to/span description pept 355 777 retanoic acid-responsive protein (MK-3) precursor sigp 355 420 retanoic acid-responsive protein signal peptide matp 421 774 retanoic acid-responsive protein mRNA < 1 1016 MK-3 mRNA BASE COUNT 278 a 292 c 302 g 167 t ORIGIN 1 cccaggtacc cagctccctg ccacatcaga gacccttctt gcactctgag tgaactgatt 61 aaaaaaaaaa aaaaaaaaaa aaaaccaagc cggaggtgag ccgggcctcg aagggaaggt 121 tcgcgggtgc ggtggccgcc ccgagcctgt gacaccagga catactcccg gggcccgcgg 181 tgggcaagcg aagtggtgac ctgagagctg acaggctgcg agagggaaaa gtatagacag 241 gcctagacca ggggaagggg aggggataga gagctgggcc tgctacgagg ggacctgagc 301 cagaagcgca ctggtaaaac cgaactccag gaccagagac ccagagatca gaggatgcag 361 caccgaggct tcttccttct cgcccttctt gccctcttgg tggtcacgtc cgcggtggcc 421 aaaaaaaaag agaaggtgaa gaagggcagc gagtgttcgg agtggacctg ggggccctgc 481 acccccagca gcaaggactg cggcatgggc ttccgcgagg gtacctgtgg ggcccagacc 541 cagcgcgtcc attgcaaggt gccctgcaac tggaagaagg aatttggagc cgactgcaaa 601 tacaagtttg agagctgggg ggcgtgtgat gggagcactg gcaccaaagc ccgccaaggg 661 accctgaaga aggcgcggta caatgcccag tgccaggaga ccatccgcgt gactaagccc 721 tgcacctcca agaccaagtc aaagaccaaa gccaagaaag gaaaaggaaa ggactaagtc 781 aggaggccag agagcctccg gcctcgcctg gagcctgaac ggagccctcc tctcccacag 841 gcccaagata taacccacca gtgccttttg tcttcctgtc agctctgtca atcacgcctg 901 tcctctcacg cccacaccaa gtgcccaaag tggggaggga caagagattc tggaaagtga 961 gcctccccat accctctttt gttctcccca ccctgatact tgttattaag aaatgaataa 1021 aataaactca cttttttcc // LOCUS YSCVPS33P 2695 bp ds-DNA PLN 17-JUL-1990 DEFINITION S.cerevisiae vacuolar biogenesis and vacuolar protein sorting protein (VPS33/SLP1), complete cds. ACCESSION M34638 KEYWORDS vacular biogenesis protein; vacuolar protein sorting protein. SOURCE S.cerevisiae (strain GRF88) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2695) AUTHORS Banta,L.M., Vida,T.A., Herman,P.K. and Emr,S.D. TITLE Characterization of the yeast Vps33p, a protein required for vacuolar protein sorting and vacuole biogenesis JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.M.Banta, 18-MAY-1990. Author address: L.M.Banta California Insitute of Technology 147-75 Div of Biology Pasadena, CA 91125 FEATURES from to/span description pept 401 2476 Vps33p protein signal 245 252 TATA box binding 480 498 ATP-binding domain A (put.) binding 661 679 ATP-binding domain B (put.) BASE COUNT 916 a 445 c 522 g 812 t ORIGIN 1 gtcaagaagc gggcttggct gccataatgg aagatttaga tgtcgcaaaa gtggaattac 61 gtcattttga gaaagctttt aaaggaattg ctaggggcat tactccagaa atgctctctt 121 attatgaaga gtttgctcta agaagcggtt catcttcgta agcttgttca tagtcaattc 181 ttttcctttg tgtgctcaat aatagtagat agaaattata ctgaactccg gtcattttgt 241 ataatatatt aatcacttca cacgaacata cataaataaa atatcataaa ggttagcaaa 301 ttggaactag ttatatgtta attagttaaa agatagaaaa ttcgagaaag gaagaaaaag 361 ctgatattgc ccatctccaa ctttatcaaa tcatttcacg atgaatagat tttggaatac 421 taagaaattt tcattaacaa atgccgatgg actatgtgct accttaaatg agatatctca 481 aaatgatgaa gttcttgtgg ttcaaccaag tgtattgcca gtactcaata gtttgctaac 541 tttccaagat ttgactcaat caactcctgt aaggaaaatt acgttactcg atgatcagct 601 aagtgacgat ttaccgagtg ccttaggcag cgttccgcaa atggatctta tttttcttat 661 tgatgtcaga acatctctcc gactccctcc acaactgctt gatgctgctc aaaagcacaa 721 tttatcatct ttgcatataa tatactgtcg atggaaaccg tctttccaaa atactttgga 781 ggatacagag caatggcaaa aggatggttt cgatttgaat tcaaaaaaaa cacatttccc 841 taacgtcatt gaatctcagt taaaggagct atcgaacgaa tatacccttt acccttggga 901 tctcttgccc ttcccacaga ttgatgaaaa tgttctattg actcattccc tttataacat 961 ggaaaatgta aacatgtatt atcccaactt acgttctttg cagagtgcca cagagtcaat 1021 actggttgat gatatggtca attcgttgca gagcttgatt tttgaaacta atagtatcat 1081 aacaaatgtt gtgtcgatag gtaatctgtc taagagatgt agccatcttt tgaagaaacg 1141 aatcgatgag catcaaacag agaatgattt attcatcaag ggtacgcttt atggtgaacg 1201 aaccaactgt ggactagaaa tggacttgat tatcttggaa aggaataccg atcctataac 1261 gccattgttg acacaactta cgtatgcagg aatactagat gatctatatg aattcaattc 1321 tggcataaag ataaaggaga aagacatgaa cttcaattat aaggaagata aaatatggaa 1381 tgatttgaaa tttttaaatt ttgggtcgat tgggccgcag ttaaataaat tggcaaagga 1441 actacaaacg caatatgata caaggcataa agccgagagc gtacatgaaa tcaaagaatt 1501 cgttgattcc ttaggttctt tgcaacaaag gcaagctttt ttgaaaaatc acacaacctt 1561 atcatccgac gttttgaaag tggtagagac tgaagagtac ggatctttca ataaaatctt 1621 agagttagag ctggaaattt tgatgggaaa tacacttaat aacgacattg aagatattat 1681 actcgagttg cagtaccagt acgaggttga tcaaaagaag attctcagat taatctgttt 1741 attgtctctt tgtaaaaatt cacttcgaga aaaggattat gaatatctaa gaacctttat 1801 gatcgactct tggggcattg aaaaatgctt tcaacttgaa tcattggctg agttaggatt 1861 tttcactagc aaaacgggaa aaactgattt gcatattaca acaagtaagt caacaagatt 1921 acagaaagaa taccgttata tttcacaatg gttcaataca gtacccatag aagacgagca 1981 tgctgccgat aaaatcacaa atgagaacga tgacttctcg gaagccactt ttgcttacag 2041 tggtgtagtg cccttgacaa tgagactggt tcagatgtta tatgataggt ctatcttgtt 2101 ccataattat tcctcgcagc agccttttat actgtcaaga gaacctagag tttctcaaac 2161 ggaggattta attgaacagt tatatggaga ctcacatgcg atcgaagaga gtatatgggt 2221 cccgggaacc attacaaaaa agatcaatgc aagcatcaag agcaataata gacggtccat 2281 agacggatct aatgggacat ttcatgctgc agaggatatt gcactcgtag tattcctcgg 2341 aggtgtaaca atgggtgaaa tagctataat gaagcatttg caaaaaatac taggtaaaaa 2401 aggtatcaat aaaaggttta tcatcatcgc cgatggcttg atcaatggca caaggatcat 2461 gaactctata tcttaattat tatatgatag atttgttaat tttttgtata tgcaaatgtg 2521 ctttttttca ccaaacggtt tgcaccaatc atacgagaga agtgttcggt gtttacggaa 2581 aagctagggg actaagaaaa attgaaaata aaggctgaca gcagtagaaa ccattgtgct 2641 ggcttagtga tttataagaa tggttaatta gttttgtatc ctttaatttt ctaga // LOCUS BOVCASA 1123 bp ss-mRNA MAM 17-JUL-1990 DEFINITION Bovine alpha-s1-casein mRNA, complete cds. ACCESSION M33123 KEYWORDS alpha-s1-casein. SOURCE Bovine (strain Holstein) lactating mammary gland, cDNA to mRNA, clone p-alpha-s1 C228. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1123) AUTHORS Nagao,M., Maki,M., Sasaki,R. and Chiba,R. TITLE Isolation and sequence analysis of bovine alpha-s1-casein cDNA clone JOURNAL Agric. Biol. Chem. 48, 1663-1667 (1984) STANDARD simple staff_review FEATURES from to/span description pept 64 708 alpha-s1-casein precursor sigp 64 108 alpha-s1-casein signal peptide matp 109 705 alpha-s1-casein mRNA < 1 1123 alpha-s1-casein mRNA BASE COUNT 331 a 247 c 223 g 322 t ORIGIN 1 tcacttcgac catcaaccca gcttgctgtt cttcccagtc ttgggttcaa gatcttgaca 61 accatgaaac ttctcatcct tacctgtctt gtggctgttg ctcttgccag gcccaaacat 121 cctatcaagc accaaggact ccctcaagaa gtcctcaatg aaaatttact caggtttttt 181 gtggcacctt ttccagaagt gtttggaaag gagaaggtca atgaactgag caaggatatt 241 gggagtgaat caactgagga tcaagccatg gaagatatta agcaaatgga agctgaaagc 301 atttcgtcaa gtgaggaaat tgttcccaat agtgttgagc agaagcacat tcaaaaggaa 361 gatgtgccct ctgagcgtta cctgggttat ctggaacagc ttctcagact gaaaaaatac 421 aaagtacccc agctggaaat tgttcccaat agtgctgagg aacgacttca cagtatgaaa 481 gagggaatcc atgcccaaca gaaagaacct atgataggag tgaatcagga actggcctac 541 ttctaccctg agcttttcag acaattctac cagctggatg cctatccatc tggtgcttgg 601 tattacgttc cactaggcac acaatacact gatgccccat cattctctga catccctaat 661 cccattggct ctgagaacag tgaaaagact actatgccac tgtggtgaag agtcaagtga 721 attctgaggg actccacagt tatggtcttt gatgggtctg aaaattccat gctctacatg 781 tcgcctcatc tacatgtcaa accattcatc caaaggcttc aactgctgtt ttagaacagg 841 gcaatctcaa actgaggcac tccttgatgc tctactgtat tttagatagt gtaacatcct 901 taagtgaaat tgtcctaaca gcttgttacc taaattccag tagtatcatg ctggtataaa 961 ggccactgag tcaaagggaa ttaaagtctt cattaaattt ctgtatggaa aatgttttaa 1021 aagcctttga atcacttctc ctgtaagtgc catcatatca aataattgtg tgcattaact 1081 gagattttgt ctttcttctt ttcaataaat tacattttaa ggc // LOCUS BPHINTXIS 1741 bp ds-DNA PHG 17-JUL-1990 DEFINITION Bacteriophage phi-11 integrase (int) and excisionase (xis) genes, complete cds. ACCESSION M34832 KEYWORDS excisionase; integrase. SOURCE Bacteriophage phi-11 DNA. ORGANISM Bacteriophage phi-11 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1741) AUTHORS Ye,Z.-H., Buranen,S.L. and Lee,C.Y. TITLE Sequence analysis and comparison of int and xis genes from Staphylococcal bacteriophages l54a and phi-11 JOURNAL J. Bacteriol. 172, 2568-2575 (1990) STANDARD simple staff_review FEATURES from to/span description pept 267 67 (c) excisionase (xis) pept 379 1425 integrase (int) BASE COUNT 650 a 264 c 263 g 564 t ORIGIN 1 cctatgccag caccagtgaa actctattat gcatggtatt aaaatcgaag agtacaattc 61 gataattcaa acattatttg acgaaatagc taagctgtct aatgtatata agtctcttaa 121 taaacagtaa gcaaaatcgg attcttcatt acataccgaa tattcatcat aaacactgac 181 tgcatcttct aagacatttt ttaaaattct aatgtcttca ttcgttaaaa ctaattcatt 241 gaaattatga ttgtttttaa atgtcataac atcacctact ttttatttta ttatatcaca 301 tttagtacct agtactaaat ttcgggtagc ccgcctaccc ttattatttt ttgccaattt 361 tgaggaggga gaagcaaaat gccagtatat aaggatgata atacaggtaa atggtatttt 421 tccattagat ataaagatgt atacggtaat aacaaacgaa aaatgaagcg tgggtttgaa 481 cgtaagaaag atgccaaact agctgaaagc gaatttatac aaaatgttaa atatggatac 541 tcggacaatc aaccctttga atatatattt tttgatcgtt taaaaaatga aaatctttct 601 gcacgctcaa tagaaaagcg aactacagaa tataatactc acataaaaga aaggttcgga 661 aatatcccta ttggcaaaat cactactacg caatgtactg ctttcaggaa ttatttgtta 721 aacgatgcag gtctttctgt tgactatgca cgatctgtgt gggcaggttt taaagcagtt 781 atcaattacg ccaaaaagca ttacaagctc ttatacgacc ccacattatc ggtaactcct 841 attcccagaa caaaaccaca agctaaattt atcactcgtg aagaatttga tgaaaaagta 901 gaacaaatca caaatgatac ttctcgtcag ctaactagac tgttatttta ttctggtctt 961 agaataggag aagctttagc tttgcagtgg aaagattacg ataaaataaa aggcgaaatt 1021 gacgtaaata agaaaatcaa tttaagtaat agaaaaattg aatataatct aaaaaaagaa 1081 agctctaaag ggataatacc tgtaccaaat ttaattagag agatgcttaa aaacatgtat 1141 aatgaatctt ctaaaagata taaatatttt gacgaaaact attttatatt cgggggttta 1201 gaacctatta gatacgttac ttattcgtat cattttaaat ctgtattccc gaatctaaaa 1261 atacaccatt taagacactc gtacgctagc tatttaatta ataatggtgt agatatgtat 1321 ttattaatgg aattaatgag gcattctaac attacagaaa caattcaaac gtactctcat 1381 ttatatactg ataaaaaaca tcaagctatg agcatatttg attaaacggt atcaaattgg 1441 tatcaaataa caattaagga gtttataaaa tgcgtaataa caagcctaaa ataagtattc 1501 aaaacgaccc atgggaagtg aaatttatat acatttaaat ttcatgagac aataaacgtt 1561 gatttaatgc gtttttttgc cttttttatt ttccttattt tttctgtttt acaacaaaat 1621 ggtatcaaaa atggtatcat ttgtagttat tttagcttca catattaaaa caaccacact 1681 cctaaattaa taggtggtgt ggttttgttg gttgtgtggg gataaaaata accgcatcag 1741 t // LOCUS BSTNPRAS 3510 bp ds-DNA BCT 17-JUL-1990 DEFINITION B.stearothermophilus neutral protease (nprS), and transcriptional activator (nprA) genes, complete cds. ACCESSION M34237 KEYWORDS neutral protease; transcriptional activator. SOURCE B.stearothermophilus (strain TELNE) DNA, clone pSP53. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3510) AUTHORS Nishiya,Y. and Imanaka,T. TITLE Cloning and nucleotide sequences of the neutral protease gene and its transcriptional activator gene from Bacillus stearothermophilus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Nishiya, 11-MAY-1990. FEATURES from to/span description pept 181 1401 transcriptional activator (nprA) pept 1750 3405 neutral protease (nprS) precursor sigp 1570 2274 neutral protease (nprS) signal and propeptide matp 2275 3219 neutral protease (nprS) BASE COUNT 1177 a 581 c 737 g 1015 t ORIGIN 1 tacggtcttc agacatttct attcctatag cccaaatgag tagttccttt tggaggagaa 61 aatgtgtata atttttagta aatttatatt agtaaaaaat taagaaggag taggtattat 121 ttgaagattg gtgatcgctt aaaattttcc cgtatcaaac ataagttaac gcaagaggaa 181 gtggctgacg gaattatttc cgtatcatat ttatcaaaaa ttgaaaacaa tcaagtggtt 241 ccaagtgaag aagtgcttcg cctcctttgt caacggttgg gaatcaacaa tatcctgaaa 301 aatagacaag atgaattaac aagtaaattg ttattatggt acaaaacgat tacggataaa 361 aaccgacagg aagcagcccg gatgtacgag gaaatcaaac gaactttcga tgacgtccag 421 ggggcggaat ccatcgctta ctttctgttg tttgaaatgc gctatcactt gttattaaaa 481 gatattcata ctgtcgaagc gttgttgatc aaattaaggg aattgtatga cacctttgat 541 gatgtgatga agtattatta ttataaattt ttaggtctac tttactattg caaggaaaaa 601 tatgaagatg ctttggaata ttataaaaag gcggagcagc gatttcgaag ccaatcattt 661 gaaaaatggg aagaagctga tttgcattat ttactagcgc ttgtttatag ccggctctgg 721 agaatattag gctgtattaa ctatgcgcag catgctttag cgatttacca atccgaatac 781 gatttaaagc gaagcgctga atgccacatt ttacttggta tttgttacag aaggtacgga 841 gaagtagatc aagcgatcga atgctattca ttggcccata aaattgccca aatcattaat 901 gataccgaat tattaggtac gattgagcat aacctaggct acttaatgtc aatgaaacat 961 gagcattatg aagccattca gcattataag aagagtttgc tgtataagcg aaactcttca 1021 ttacaagcta gatttattac gttgttttct ctcatcaaag aatattatgt ttccaaaaac 1081 tataaaaaag cattagccaa tgtagaggaa agtttgcagc ttctcaagag ggaaaaagat 1141 gggatgacaa cgtattatga atattatctt catttcacag tttatcaata tttactatca 1201 gaagatattt cggaaaatga atttgaaaca tttatgaaag atcgagtgct cccttatttt 1261 caaaggttta aaaaatatga agatgttgca caatacgctg aatacttggc aatctattac 1321 gagaaacgtc ataagtataa actagcaagc aaattctata aaatgagtta tcaatttcta 1381 aaaaatatga taaatattta ggagggattt ttttgaaaaa gcttttatta ggaatcatga 1441 cgtttggtat tatgagttta cttgttctca ttggtagtga ccaagaacca aaatatgtgg 1501 caaaagacga acatccgcct ccaaccatca tcattgcagc gaaagatgaa catccaccag 1561 caacgattat ttgaagagga ataagcaaaa agacagctag ttttctagct gtcttttttc 1621 atgcatagga aaatgtgaaa aaaacgtagg gaattatcaa ctatatcaga ctctattttt 1681 cccaatacaa aatactgtaa aatattgtgt ttaatattct aaatacaaag aataaaggag 1741 gatgaaaaaa tgaaaaggaa aatgaaaatg aaattagtac gttttggtct tgcagcagga 1801 ctagcggccc aagtattttt tttaccttac aatgcgctgg cttcaacgga acacgttaca 1861 tggaaccaac aatttcaaac ccctcaattc atctccggtg atctgctgaa agtgaatggc 1921 acatccccag aagaactcgt ctatcaatat gttgaaaaaa acgaaaacaa gtttaaattt 1981 catgaaaacg ctaaggatac tctacaattg aaagaaaaga aaaatgataa ccttggtttt 2041 acgtttatgc gcttccaaca aacgtataaa gggattcctg tgtttggagc agtagtaact 2101 gcgcacgtga aagatggcac gctgacggcg ctatcaggga cactgattcc gaatttggac 2161 acgaaaggat ccttaaaaag cgggaagaaa ttgagtgaga aacaagcgcg tgacattgct 2221 gaaaaagatt tagtggcaaa tgtaacaaag gaagtaccgg aatatgaaca gggaaaagac 2281 accgagtttg ttgtttatgt caatggggac gaggcttctt tagcgtacgt tgtcaattta 2341 aactttttaa ctcctgaacc aggaaactgg ctgtatatca ttgatgccgt agacggaaaa 2401 attttaaata aatttaacca acttgacgcc gcaaaaccag gtgatgtgaa gtcgataaca 2461 ggaacatcaa ctgtcggagt gggaagagga gtacttggtg atcaaaaaaa tattaataca 2521 acctactcta cgtactacta tttacaagat aatacgcgtg gaaatgggat tttcacgtat 2581 gatgcgaaat accgtacgac attgccggga agcttatggg cagatgcaga taaccaattt 2641 tttgcgagct atgatgctcc agcggttgat gctcattatt acgctggtgt gacatatgac 2701 tactataaaa atgttcataa ccgtctcagt tacgacggaa ataatgcagc tattagatca 2761 tccgttcatt atagccaagg ctataataac gcattttgga acggttcgca aatggtgtat 2821 ggcgatggtg atggtcaaac atttattcca ctttctggtg gtattgatgt ggtcgcacat 2881 gagttaacgc atgcggtaac cgattataca gccggactca tttatcaaaa cgaatctggt 2941 gcaattaatg aggcaatatc tgatattttt ggaacgttag tcgaatttta cgctaacaaa 3001 aatccagatt gggaaattgg agaggatgtg tatacacctg gtatttcagg ggattcgctc 3061 cgttcgatgt ccgatccggc aaagtatggt gatccagatc actattcaaa gcgctataca 3121 ggcacgcaag ataatggcgg ggttcatatc aatagcggaa ttatcaacaa agccgcttat 3181 ttgattagcc aaggcggtac gcattacggt gtgagtgttg tcggaatcgg acgcgataaa 3241 ttggggaaaa ttttctatcg tgcattaacg caatatttaa caccaacgtc caactttagc 3301 caacttcgtg ctgccgctgt tcaatcagcc actgacttgt acggttcgac aagccaggaa 3361 gtcgcttctg tgaagcaggc ctttgatgcg gtaggggtga aataaagtgg tatctcatca 3421 gtgggggatt ttttcctcca ctgatgtttt gtttgtgatc ttttaatgat gtattggggt 3481 gcaaaatgcc caaaggctta taatgttgat // LOCUS HSEGP14 3347 bp ds-DNA VRL 17-JUL-1990 DEFINITION Equine herpesvirus type 1 glycoprotein 14 (gp14) gene, complete cds. ACCESSION M34861 KEYWORDS glycoprotein 14. SOURCE Equine herpesvirus type 1 DNA. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3347) AUTHORS Guo,P. TITLE Characterization of the gene and an antigenic determinant of equine herpesvirus type-1 glycoprotein 14 with homology to gB-equivalent glycoproteins of other herpesviruses JOURNAL Gene 87, 249-255 (1990) STANDARD simple staff_review FEATURES from to/span description pept 300 3239 glycoprotein 14 (gp14) BASE COUNT 885 a 891 c 851 g 720 t ORIGIN 1 tacaacggtt gaaacgtggt gtacgcatct caagagacta gctcgtttat gataactgcg 61 gctaaaggtg aattggtcaa ttagcgaagt ttcaaaggtt ttattgcttt gaagggagtg 121 acaggtgtga cggccacgca gcggctggcg tgaaatatat cggggagctc atcctagccg 181 ccgcagtatt ctcctcggtt ttccactgtg gagaggtgcc tcctgcgcgc agatcgtacc 241 tacccggact ccgcgccaca gtgctgcgtg agcggcattt acataaccta cgaggcgtca 301 tgtcctctgg ttgccgttct gtcggcggct ccacatgggg caattggcgc ggagacggtg 361 gtgatttacg acagcgacgt gttctctctc ctgtatgcag tgctccagca gctggctcct 421 ggatcgggag ccaactaggc aatgttggaa acttactcgc caccccccac ccgctgggaa 481 agccggcatc atcgagggtg ggcacaatag ttctagcctg tttgttgctt tttggaagct 541 gtgttgttag agccgtaccc accacgccaa gccccccaac tagtactccc acttccatgt 601 caacgcactc ccatgggaca gtagacccta cgctgctccc cacagaaacg cccgacccac 661 tcagactggc tgtgcgcgag tccggtatac tcgctgagga tggagacttt tacacctgcc 721 caccgcctac cggatccacc gtcgtacgca tcgaaccacc tagaacttgc cccaagtttg 781 accttgggag aaacttcacg gaggggattg ctgttatttt taaggaaaac atcgctccct 841 acaaattcag ggcaaacgta tactacaagg acatcgttgt aacacgtgtg tggaaaggat 901 acagccatac gtccctgtcc gacagataca atgacagggt tccggtttcg gtggaggaga 961 tcttcggtct catcgacagt aagggaaaat gttcgtcaaa ggccgagtac ctcagagata 1021 acatcatgca ccacgcgtac cacgacgacg aggacgaggt ggagcttgat ttggtgccgt 1081 ccaagtttgc aactccgggg gccagagcct ggcagaccac caacgatact acgtcttacg 1141 tggggtggat gccatggagg cactacacgt caacgtctgt caactgcatc gtcgaggagg 1201 tggaggcgcg gtccgtctac ccctacgact ccttcgccct gtccaccggt gatattgtgt 1261 acgcgtctcc gttttacggc ctgagggctg ccgctcgcat agagcacaat agctacgcgc 1321 aggagcgttt caggcaagtt gaagggtaca ggccccgcga cttagacagt aaactacaag 1381 ccgaagagcc ggttaccaaa aattttatca ctaccccgca tgtcaccgtc agctggaact 1441 ggaccgagaa gaaagtcgag gcgtgtacgc tgaccaaatg gaaagaggtc gacgaactcg 1501 tcagggacga gttccgcggg tcctacagat ttactattcg atccatctcg tcttacttta 1561 tcagtaacac tactcaattt aagttggaaa gtgcccccct tactgaatgt gtatccaaag 1621 aagcaaagga agccatagac tcgatataca aaaagcagta cgagtctacg cacgtcttta 1681 gcggtgatgt ggaatattac ctggcacgcg gggggttctt aattgcattc agacctatgc 1741 tctccaacga actcgccagg ctgtacctga acgagcttgt gagatctaac cgcacctacg 1801 acctaaaaaa tctattgaac cccaatgcaa acaataacaa taacaccacg cgaagacgca 1861 ggtctctcct gtcagtacca gaacctcagc caacccaaga tggtgtgcat agagaacaaa 1921 ttctacatcg cttgcacaaa cgagcagtgg aggcaacggc aggtaccgat tcttccaacg 1981 tcaccgccaa acagctggag ctcatcaaaa ccacgtcgtc tatcgagttt gccatgctac 2041 agtttgcata cgatcacatc caatcccacg tcaatgaaat gctaagtaga atagcaactg 2101 cgtggtgtcc cctccaaaac aaagagcggc ccctatggaa cgaaatggtg aagattaccc 2161 cgagcgccat agtctccgca acccttgacg agcgagttgc agcgagggtc ctgggggacg 2221 tgatagctat aacgcactgc gccaaaatag agggcaacgt gtacttgcaa aactccatgc 2281 gctcgatgga cagtaacact tgctactccc gcccccccgt aacatttaca attactaaga 2341 atgcaaacaa cagagggtcg atagaaggcc agctgggaga ggagaacgag attttcacgg 2401 agcgcaagct gatcgagccg tgcgccctca atcagaagcg ctactttaag tttggcaaag 2461 agtacgttta ctacgagaac tacacgttcg tccgcaaagt gccccccacg gaaatcgagg 2521 ttatcagcac gtacgttgaa ctaaacttga cccttttgga agaccgcgag tttctgcccc 2581 tggaggtgta cacgcgggct gagctggagg acaccggcct gctagactac agcgaaatac 2641 agcgccgcaa ccagctccac gctctcaggt tttacgacat cgacagcgtg gtcaacgtgg 2701 acaataccgc agtgattatc aggggatcgc cagctttttc aagggcctgg gtaaagtggg 2761 ggaggccgtg ggaacgctcg ttctcggcgc gcggcgctgt tgtttcaacc gtatctggaa 2821 tagcttgctt tttaaacaac ccatttgggg ggctagccat cggcctgctg gtaatcgccg 2881 gcctggtagc tgcgtttttt gcttacagat atgtaatgca gatccgcagt aaccccatga 2941 aagctctata ccccataaca acaaaggcct tgaaaaacaa agccaaaact tcctacggcc 3001 agaacgagga ggacgatggg agcgactttg atgaggccaa gcttgaagag gctcgcgaaa 3061 tgatcaaata catgtctatg gtttcggccc tggaaaagca ggaaaagaaa gctataaaga 3121 aaaacagtgg ggttggcctg atcgccagta acgtctcaaa gctggccctg cgaaggcgcg 3181 gtcccaaata tacccgactc caacagaacg ataccatgga aaatgaaaaa atggtttaaa 3241 catgtttaat aaatattatg acacgtactc aaagtgtgac ctcatatttg cataaccact 3301 tctagttccg gcccaaggat atttaagcct agtatctccg ccgaagg // LOCUS HUMHBGAA 1227 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human A-gamma-globin gene, 3' end. ACCESSION M33200 KEYWORDS A-gamma-globin. SOURCE Human (hereditary persistence of fetal hemoglobin individual II-1) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1227) AUTHORS Gelinas,R.E., Rixon,M., Magis,W. and Stamatoyannopoulos,G. TITLE Gamma gene promoter and enhancer structure in Seattle variant of hereditary persistence of fetal hemoglobin JOURNAL Blood 71, 1108-1112 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 3 A-gamma-globin (AA at 1) /hgml_locus_uid="LK0092S" /nomgen="HBG1" /map="11p15.5" mut 794 794 t in wt; c in mutant mut 970 970 c in wt; a in mutant mut 1186 1186 a in wt; g in mutant BASE COUNT 366 a 204 c 294 g 363 t ORIGIN 1 tgagcctctt gcccatgatt cagagctttc aaggataggc tttattctgc aagcaataca 61 aataataaat ctattctgct gagagatcac acatgatttt cttcagctct tttttttaca 121 tctttttaaa tatatgagcc acaaagggtt tatattgagg gaagtgtgta tgtgtatttc 181 tgcatgcctg tttgtgtttg tggtgtgtgc atgctcctca tttattttta tatgagatgt 241 gcattttgtt gagcaaataa aagcagtaaa gacacttgta cacgggagtt ctgcaagtgg 301 gagtaaatgg tgtaggagaa atccggtggg aagaaagacc tctataggac aggacttctc 361 agaaacagat gttttggaag agatgggaaa aggttcagtg aagacctggg ggctggattg 421 attgcagctg agtagcaagg atggttctta atgaagggaa agtgttccaa gctttaggaa 481 ttcaaggttt agtcaggtgt agcaattcta ttttattagg aggaatacta tttctaatgg 541 cacttagctt ttcacagccc ttgtggatgc ctaagaaagt gaaattaatc ccatgccctc 601 aagtgtgcag attggtcaca gcatttcaag ggagagacct cattgtaaga ctctggggga 661 ggtggggact taggtgtaag aaatgaatca gcagaggctc acaagtcagc atgagcatgt 721 tatgtctgag aaacagacca gcactgtgag atcaaaatgt agtgggaaga atttgtacaa 781 cattaattgg aaggtttact taatggaatt tttgtatagt tggatgttag tgcatctcta 841 taagtaagag tttaatatga tggtgttacg gacctaatgt ttgtgtctcc tcaaaattca 901 catgctgaat ccccaactcc caactgacct tatctgtggg ggaggctttt gaaaagtaat 961 taggtttagc tgagctcata agagcagatc cccatcataa aattattttc cttatcagaa 1021 gcagagagac aagccatttc tctttcctcc cggtgaggac acagtgagaa gtccgccatc 1081 tgcaatccag gaagagaacc ctgaccacga gtcagccttc agaaatgtga gaaaaaactc 1141 tgttgttgaa gccacccagt cttttgtatt ttgttatagc accttacact gagtaaggca 1201 gatgaagaag gagaaaaaaa taagctt // LOCUS HUMHBQ1A 1114 bp ds-DNA PRI 17-JUL-1990 DEFINITION Human theta-1-globin gene, complete cds. ACCESSION M33022 KEYWORDS theta-1-globin. SOURCE Human black female with alpha-thal-2 heterozygosity white cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1114) AUTHORS Gonzalez-Redondo,J.M., Han,I.S., Gu,Y.-C. and Huisman,T.H.J. TITLE Nucleotide sequence of the human theta-1-globin gene JOURNAL Biochem. Genet. 26, 207-211 (1988) STANDARD simple staff_review FEATURES from to/span description pept 359 453 theta-1-globin, exon 1 /hgml_locus_uid="LV0155X" /nomgen="HBQ1" /map="16p13.3" 538 742 theta-1-globin, exon 2 852 980 theta-1-globin, exon 3 IVS 454 537 theta-1-globin intron A IVS 743 851 theta-1-globin intron B BASE COUNT 166 a 386 c 393 g 169 t ORIGIN 1 atcccagtta ctcgggaggc tgaggcagga gaatcgtttg aacccgggag gcggaggttg 61 cagtgagccg gaatggcgcc actgcactca ccgcacccgg ccaatttttg tgtttttagt 121 agagactaaa taccatatag tgaacaccta agacgggggg ccttggatcc agggcgattc 181 agagggcccc ggtcggagct gtcggagatt gagcgcgcgc ggtcccggga tctccgacga 241 ggccctggac ccccgggcgg cgaagctgcg gcgcggcgcc ccctggaggc cgcgggaccc 301 ctggccggtc cgcgcaggcg cagcggggtc gcagggcgcg gcgggttcca gcggggggat 361 ggcgctgtcc gcggaggacc gggcgctggt gcgcgccctg tggaagaagc tgggcagcaa 421 cgtcggcgtc tacacgacag aggccctgga aaggtgcggc aggctgggcg cccccgcccc 481 caggggccct ccctccccaa gccccccgga cgcgcctcac ccacgttcct ctcgcaggac 541 cttcctggct ttccccgcca cgaagaccta cttctcccac ctggacctga gccccggctc 601 ctcacaagtc agagcccacg gccagaaggt ggcggacgcg ctgagcctcg ccgtggagcg 661 cctggacgac ctaccccacg cgctgtccgc gctgagccac ctgcacgcgt gccagctgcg 721 agtggacccg gccagcttcc aggtgagcgg ctgccgtgct gggcccctgt ccccgggagg 781 gccccggcgg ggtgggtgcg gggggcgtgc ggggcgggtg caggcgagtg agccttgagc 841 gctcgccgca gctcctgggc cactgcctgc tggtaaccct cgcccggcac taccccggag 901 acttcagccc cgcgctgcag gcgtcgctgg acaagttcct gagccacgtt atctcggcgc 961 tggtttccga gtaccgctga actgtgggtg ggtggccgcg ggatccccag gcgaccttcc 1021 ccgtgtttga gtaaagcctc tcccaggagc agccttcttg ccgtgctctc tcgaggtcag 1081 gacgcgagag gaaggcgccg cccctcccca agga // LOCUS HUMITIH1A 1149 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human inter-alpha-trypsin inhibitor heavy chain mRNA, partial cds. ACCESSION M33033 KEYWORDS inter-alpha-trypsin inhibitor heavy chain. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1149) AUTHORS Salier,J.-P., Diarra-Mehrpour,M., Sesbouee,R., Bourguignon,J. and Martin,J.-P. TITLE Human inter-alpha-trypsin inhibitor: Isolation and characterization of heavy (H) chain cDNA clones coding for a 383 amino-acid sequence of the H chain JOURNAL Biol. Chem. Hoppe-Seyler 369, 15-18 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1149 inter-alpha-trypsin inhibitor heavy chain (AA at 1) /hgml_locus_uid="LE0221G" /nomgen="ITIH1" /map="3p21.2-p21.1" BASE COUNT 332 a 292 c 253 g 272 t ORIGIN 1 ggaggcacaa acatcaacga agcactccta cgggcaatct tcattttgaa tgaagccaat 61 aacttgggac tgttagaccc caactccgtc tcgctgatca ttttggtttc tgatggagat 121 ccaacagtgg gcgaactaaa actgtcaaaa attcagaaaa acgttaagga gaacatccaa 181 gacaatatct ccttgttcag tttgggcatg ggatttgatg tggactatga ttttttgaag 241 agactgtcca atgaaaacca tggaattgca caaaggattt atggaaacca ggacacgtct 301 tcccagctta agaaattcta caaccaggtc tccactccat tgctccggaa tgttcagttc 361 aactatcccc atacatcagt cacggacgtc actcaaaaca atttccataa ctactttgga 421 ggctcagaga ttgtggtggc aggaaaattt gaccctgcta aattggatca aatagagagc 481 gttatcacgg cgacttcggc taacacgcag ttagtcttgg agaccctggc ccagatggac 541 gacttgcagg attttctatc gaaagacaag catgcagatc ccgatttcac caggaaactg 601 tgggcctatc taaccatcaa ccaactgcta gctgaacgaa gcctggctcc tacagctgcc 661 gccaagagaa gaattacaag atcgatcctg cagatgtctc tagaccacca cattgtgact 721 ccgctgacct cgctggtgat cgagaacgag gctggggatg agcgcatgct ggcggatgcc 781 ccaccgcagg atccctcctg ctgctcaggg gccctgtatt acggcagcaa agtggttcca 841 gattccaccc cgtcttgggc caatccttca gcaacgcccg tgatctccat gctggcacaa 901 ggatctcagg tgctagagtc cacgccaccc ccacatgtga tgagagttga aaatgaccca 961 cattccatca tttatctacc aaaaagccaa aagaacattt gtttcaatat tgactcagaa 1021 cctggaaaaa tcctcgacct ggcttctgac ccagaatcag gaattgtagt caacggtcag 1081 cttgttggtg ccaagaagcc caacaatgga aaactaagca cctattttgg aaaactggga 1141 ttttatttc // LOCUS HUMPTHROM 327 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human thrombin mRNA, 5'end. ACCESSION M33031 KEYWORDS serine protease; thrombin. SOURCE Human, cDNA to mRNA, clone pIIH13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 327) AUTHORS MacGillivray,R.T.A., Irwin,D.M., Guinto,E.R. and Stone,J.C. TITLE Recombinant genetic approaches to functional mapping of thrombin JOURNAL Ann. N.Y. Acad. Sci. 485, 73-79 (1986) STANDARD simple staff_review FEATURES from to/span description pept 28 > 327 thrombin precursor /hgml_locus_uid="LD0134L" /nomgen="F2" /map="11p11-q12" sigp 28 156 thrombin signal peptide matp 157 > 327 prothrombin BASE COUNT 60 a 97 c 109 g 61 t ORIGIN 1 ccgtagtgac ccaggagctg acacactatg gcccgcatcc gaggcttgca gctgcctggc 61 tgcctggccc tggctgccct gtgtagcctt gtgcacagcc agcatgtgtt cctggctcct 121 cagcaagcac ggtcgctgct ccagcgggtc cggcgagcca acaccttctt ggaggaggtg 181 cgcaagggca acctggagcg agagtgcgtg gaggagacgt gcagctacga ggaggccttc 241 gaggctctgg agtcctccac ggctacggat gtgttctggg ccaagtacac agcttgtgag 301 acagcgagga cgcctcgaga taagctt // LOCUS MUSCC3A 312 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse complement component C3 mRNA, partial cds. ACCESSION M33032 KEYWORDS complement component C3. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Fey,G.H., Wiebauer,K. and Domdey,H. TITLE Amino acid sequences of mouse complement C3 derived from nucleotide sequences of cloned cDNA JOURNAL Ann. N.Y. Acad. Sci. 421, 307-312 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 complement component C3 precursor (AA at 1) matp < 1 27 complement component C3-beta subunit (AA at 1) matp 40 273 complement component C3-alpha subunit matp 274 > 312 complement component C3-alpha' subunit BASE COUNT 90 a 77 c 90 g 55 t ORIGIN 1 gatcttgagt gcaccaagcc agcagcccgc cgccgtcgct cagtacagtt gatggaaaga 61 aggatggaca aagctggtca gtacactgac aagggtcttc ggaagtgttg tgaggatggt 121 atgcgggata tccctatgag atacagctgc cagcgccggg cacgcctcat cacccagggc 181 gagaactgca taaaggcctt catagactgc tgcaaccaca tcaccaagct gcgtgaacaa 241 cacagaagag accacgtgct gggcctggcc aggagtgaat tggaggaaga cataattcca 301 gaagaagata tt // LOCUS MUSN038A 1260 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse nucleolar protein N038 mRNA, complete cds. ACCESSION M33212 KEYWORDS nucleolar protein N038. SOURCE Mouse teratocarcinoma stem cell line F9, cDNA to mRNA, clone lambda-FML-185.19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1260) AUTHORS Schmidt-Zachmann,M.S. and Franke,W.W. TITLE DNA cloning and amino acid sequence determination of a major constituent protein of mammalian nucleoli: Correspondence of the nucleoplasmin-related protein N038 to mammalian protein B23 JOURNAL Chromosoma 96, 417-426 (1988) STANDARD simple staff_review FEATURES from to/span description pept 79 957 nucleolar protein N038 mRNA < 1 1260 nucleolar protein N038 mRNA BASE COUNT 419 a 214 c 306 g 321 t ORIGIN 1 ggcgcgtctg ttctgtggaa caggaggcag ttgttttccg tccggcttct cccacaccga 61 agtgcgcgcc tccacctcat ggaagactcg atggatatgg acatgagtcc tcttaggcct 121 cagaactacc ttttcggctg tgaactaaag gctgacaaag actatcactt taaagtggat 181 aatgatgaaa atgagcacca gttgtcatta agaacggtca gtttaggagc aggggcaaaa 241 gatgagttac acatcgtaga ggcagaagca atgaactatg aaggcagtcc aattaaagta 301 acactggcaa ctttgaaaat gtctgtacaa ccaacagttt ccctaggggg ctttgaaatt 361 acaccacctg tggtcttacg gttgaagtgt ggttcagggc ctgtgcacat tagtggacag 421 catctagtag ctgtagagga agatgcagag tctgaagatg aagatgagga ggacgtaaaa 481 ctcttaggca tgtctggaaa gcgatctgct cctggaggtg gtaacaaggt tccacagaaa 541 aaagtaaaac ttgatgaaga tgatgaggac gatgatgagg acgatgagga tgatgaggat 601 gatgatgatg atgattttga tgaagaggaa actgaagaaa aggtcccagt gaagaaatct 661 gtacgagata ccccagccaa aaatgcacaa aaatcaaacc aaaatggaaa agacttaaaa 721 ccatcaacac cgagatcaaa gggtcaagag tccttcaaaa aacaggaaaa gactcctaaa 781 acaccaaaag gacctagttc tgtagaagac attaaggcaa aaatgcaagc aagtatagaa 841 aaaggcggtt ctcttcccaa agtggaagcc aagttcatta attatgtgaa gaattgtttc 901 cggatgactg accaggaggc tattcaagat ctctggcagt ggaggaaatc tctttaagaa 961 aagggtttaa acagtttgaa atattctgtc ttcatttctg taatagttaa tatctggctg 1021 tcctttttat aatgcaaagt gagaactttc cctactgtgt ttgataaatg ttgtccaggt 1081 tcacttgcca agaatgtgtt gtctaaaatg cctgtttagt tttcaaggat ggaactccac 1141 cctttacttg gttttaagta tgtatggaat gttatgatag gacatagtaa tagtggtcag 1201 atgtggaaat ggtagggaga caaatataca tgtgaaataa actcagtatt ttaataaagt // LOCUS RATPOS 1804 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat type-2A protein phosphatase catalytic subunit mRNA, complete cds. ACCESSION M33114 KEYWORDS type-2A protein phosphatase catalytic subunit. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1804) AUTHORS Kitagawa,Y., Tahira,T., Ikeda,I., Kikuchi,K., Tsuiki,S., Sugimura,T. and Nagao,M. TITLE Molecular cloning of cDNA for the catalytic subunit of rat liver type 2A protein phosphatase, and detection of high levels of expression of the gene in normal and cancer cells JOURNAL Biochim. Biophys. Acta 951, 123-129 (1988) STANDARD simple staff_review FEATURES from to/span description pept 114 1043 type-2A protein phosphatase catalytic subunit mRNA < 1 1804 type-2A protein phosphatase catalytic subunit mRNA BASE COUNT 482 a 391 c 437 g 494 t ORIGIN 1 ctggggccgc aggaagcacc ccggggagcg gcggcggcgt gtgcgtgtgg cccgggtgcg 61 ggcggcggcg cgggagcagc gcagagcggc agccggttcg ggcgggcggc atcatggacg 121 agaagttgtt caccaaggag ctggaccagt ggatcgagca gctgaacgag tgcaagcagc 181 tctccgagtc ccaggtcaag agcctctgcg agaaggctaa agaaatcctg acaaaagaat 241 ctaatgttca ggaggttcga tgtccagtca ctgtgtgtgg agatgtgcat gggcaatttc 301 atgacctcat ggaactcttt agaattggtg gtaaatcacc agatacaaat tacttgttta 361 tgggagacta tgtggacaga ggatattact cagttgaaac agttacactg cttgtagctc 421 ttaaggttcg ttaccgagag cgtatcacca tactccgagg gaatcacgag agcagacaga 481 tcacacaagt ttatggtttc tacgatgagt gtttaaggaa atacggaaat gcaaatgttt 541 ggaaatactt cacagacctt tttgactacc ttcctctcac tgccttggtg gatgggcaga 601 tcttctgtct acatggtggt ctttcaccat ccatagacac actggatcac atccgagcac 661 ttgatcgcct acaagaagtt cctcatgagg gtccaatgtg tgacttgctg tggtcagatc 721 cagatgaccg tggtggctgg gggatatctc ctcggggagc tggttatacc tttggccaag 781 atatttctga gacatttaat catgccaatg gcctcacgtt ggtgtccaga gctcaccagc 841 tggtgatgga gggatataac tggtgccatg accggaatgt agtaacaatt ttcagtgctc 901 caaactattg ctatcgttgt ggtaaccaag ctgcaatcat ggaacttgat gacactctta 961 agtattcttt cttgcagttc gatccagcac ctcgtagagg cgagccacat gtcactcgtc 1021 gtaccccaga ctacttcctg taatgaaagt ttaaccttgt acagtattgc catgaacacc 1081 gtctgttgac ctaatggaat cgggaagagc agcagtaact ccaaagtgtc agaaatagtt 1141 aacattcaaa cttgtttcca cacggaccaa aagatgtgcc atataaaata caaagcctct 1201 tgtcatcaac agccgtgacc actttagaat gaaccagttc attgcatgct gacgcgacat 1261 tgttggtcaa gaatccagtt tctggcatag cgctatttgt agttactttt gctttcttga 1321 gagactgcag atctaggatg taacattaac acctgtgagt ccagttgact tccacttagc 1381 tgtagcttac tcagcatgac tgtagatgag gatagcaaac aatcattgga gcttaatgaa 1441 catttttaaa tgagtaccaa ggcctcccct cttgttgtgt tctttcaggg atactattaa 1501 tttaattgta tgatttctct gcactcagtt tctcccttct caaatctcgg ccccgcgttg 1561 ttctttgtta ctgtcagaaa acctggtgag ttgttttgaa cagaactgtc tccctcctgt 1621 aagatgatgt actgcacaag tcaccgcagt gttttcataa taaacttgag aactgagaaa 1681 gtcaggtttg aattgtatca gtgggcacga ctggtgctgt ttattaaaca agataaatct 1741 attgatcaat ttcagaattt gtagaattcc aggtaaagaa aaataaagat caaggccact 1801 atat // LOCUS RATSCP2 1409 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat sterol carrier protein-2 (SCP-2)mRNA, complete cds. ACCESSION M34728 KEYWORDS sterol carrier protein-2. SOURCE Rat liver, cDNA to mRNA, clone SP43. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1409) AUTHORS Billheimer,J.T., Strehl,L.L., Davis,G.L., Strauss,J.F.III. and Davis,L.G. TITLE Characterization of a cDNA encoding rat sterol carrier protein-2 JOURNAL DNA Cell Biol. 9, 159-165 (1990) STANDARD simple staff_review FEATURES from to/span description pept 307 1128 sterol carrier protein-2 (SCP-2) mRNA < 1 1409 sterol carrier protein-2 mRNA BASE COUNT 387 a 300 c 390 g 332 t ORIGIN 2 bp upstream of EcoRI site. 1 ggaattccga acaaaggttg aacactttgc aaaaattgga tggaaaaatc ataaacactc 61 agttaataac ccgtattccc agttccaaga tgaatacagc ttagatgaga taatgaaatc 121 aaggccagtt ttcgattttc tgactgtctt acaatgctgt cccacctcag atggtgccgc 181 agcagcaatt gtgtctagtg aggagtttgt gcagaagcat ggcctgcagt ccaaagctgt 241 ggaaattgtg gcacaggaga tggtgactga catgcccagt acatttgaag aaaaagtgtt 301 attaaaatgg ttggctatga tatgagtaaa gaagctgcca ggaagtgcta tgagaagtcc 361 ggcctgggtc ccagtgatgt cgacgtgata gagcttcacg attgcttctc taccaatgaa 421 ctcctgactt atgaagcact ggggctctgt ccagaaggac aaggtggagc actggtggac 481 agaggggaca acacttacgg aggaaagtgg gtcataaacc ctagtggagg cctcatctcc 541 aagggacacc cactgggtgc cacaggtctg gctcagtgcg cggagctctg ctggcagctg 601 agaggcgaag ccggaaagag gcaggttcct ggggcaaagg tggctctgca gcacaattta 661 ggccttggag gagctgctgt tgtcaccctc tacagaatgg gttttcccga agctgccagc 721 tccttcagaa cgcaccagat ttcagctgct cccaccagct ctgcagggga tggattcaag 781 gcaaatctca tttttaagga aatcgagaag aagcttgaag aggaagggga agagttcgtg 841 aagaaaatcg gtggcatttt tgccttcaaa gtgaaggatg gccccggggg caaagaagct 901 acgtgggtgg tggacgtgaa gaacggcaaa ggatcggtgc ttccggattc agataagaag 961 gctgactgca caatcaccat ggctgactca gacttgctgg ctttgatgac tggtaaaatg 1021 aaccctcagt cggccttctt tcaaggtaaa ctgaaaattg ccggtaacat gggcctggcc 1081 atgaaactgc aaagcctgca gcttcagccg gacaaagcta agctgtgaag agtccctttg 1141 gcaacctcag gacatcaaga tgagatgtgt ggatacgtag aaatccacgt ctccctgtca 1201 ggacttagac tgacacttcc tgaatagcat gagatagatt tcttgctagg tggctatggc 1261 caattgtatt tcccccaagc tgggggtgca aagggcctcc caggctacac tgctgctttg 1321 aggacttgca ttctactgtg cttcatgaag ctactatgtt aatgatggtt tggggtaaac 1381 ttgagtttca gaataaagtt cagaatagt // LOCUS SYNPSBAII 556 bp ds-DNA BCT 17-JUL-1990 DEFINITION Synechococcus sp. photosystem II D1 protein (psbAII) gene, 5' end. ACCESSION M34833 KEYWORDS D1 protein; photosystem II. SOURCE Synechococcus (strain PCC 7942) DNA. ORGANISM Synechococcus sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Chroococcales. REFERENCE 1 (bases 1 to 556) AUTHORS Bustos,S.A., Schaefer,M.R. and Golden,S.S. TITLE Different and rapid responses of four cyanobacterial psbA transcripts to changes in light intensity JOURNAL J. Bacteriol. 172, 1998-2004 (1990) STANDARD simple staff_review FEATURES from to/span description pept 81 425 ORF1 pept 527 > 556 photosystem II D1 protein (psbAII) mRNA 59 > 556 psbAII mRNA (alt.) mRNA 478 > 556 psbAII mRNA (alt.) BASE COUNT 136 a 145 c 142 g 133 t ORIGIN 1 ttccgtgacg gctactgcca gcatgccgag cctgatgtgt gacacctaag atcactccag 61 ttctctttgg aaactggctg atgagtgaag acaccatctt tggcaagatc atccggcgcg 121 agattccagc agacattgtt tatgaagatg atctctgtct ggcttttcga gatgtggcac 181 cccaagcgcc ggttcacatt ctggtgattc ccaagcaacc aattgccaac cttttggaag 241 cgacagcaga acatcaagcg ctgctgggtc atttgttgct gactgtaaag gcgatcgcgg 301 cccaagaagg actcaccgag ggctaccgca ccgtgattaa cacgggccct gcgggtgggc 361 aaaccgttta ccacctgcat attcacttac tgggcgggcg atcgctggct tggccgcccg 421 gctgagaaaa gtctgaaagt tctttacaaa actcaatctg cttgttagat tttactcacg 481 aggctattaa gtctcgtaaa tagttcaact aaggactcat cgcaaaatga cgactgcatt 541 gcagcggcgc gagagc // LOCUS ACCPCACGH 2391 bp ds-DNA BCT 17-JUL-1990 DEFINITION A.calcoaceticus gamma-carboxymuconolactone decarboxylase (pcaC) and protocatechuate 3,4-dioxygenase alpha and beta subunit (pcaGH) genes, complete cds. ACCESSION M33798 KEYWORDS gamma-carboxymuconolactone decarboxylase; protocatechuate 3,4-dioxygenase. SOURCE A.calcoaceticus DNA, clone pZR2. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 2391) AUTHORS Hartnett,C., Neidle,E.L., Ngai,K.-L. and Ornston,N. TITLE DNA sequences of genes encoding Acinetobacter calcoaceticus protocatechuate 3,4-dioxygenase: Evidence indicating shuffling of genes and of DNA sequences within genes during their evolutionary divergence JOURNAL J. Bacteriol. 172, 956-966 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 324 728 gamma-carboxymuconolactone decarboxylase (pcaC) (EC 4.1.1.44) pept 750 1463 protocatechuate 3,4-dioxygenase, alpha subunit (pcaH) (EC 1.13.11.3) pept 1491 2120 protocatechuate 3,4-dioxygenase, beta subunit (pcaG) (EC 1.13.11.3) ORF < 1 203 ORF (AA at 3) binding 312 315 ribosomal binding site (put.) binding 737 740 ribosomal binding site (put.) binding 1478 1482 ribosomal binding site (put.) BASE COUNT 670 a 463 c 533 g 725 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttatca aacccaacgt tacttgctct gtttcatttt atgtgcaggt attgccgtca 61 atggtgctca atcttcaatg ccagtattga gtgcgcgttt ctatccaacg caatgtcgtg 121 caacaggtgt ggcatggatg tctggaattg gccgttttgg tcgccgtatt tggtgcatgg 181 attggtgcag ttttattagg taataactgg tcatttacca tgatcctgag catgttgata 241 tcctgctcgt gccgctgcaa ttgcaatttt tgttaaatcc cttgttggca catacagatg 301 caacttgatg tgaggtttct cttatgaatg atgaacaacg ttataaacaa gggctggaag 361 tccgtacaga agttctgggt gaaaagcatg ttaaccgttc acttgagaac ctaaacgact 421 tcaatcagga ttttcaaaac tttattagcc gttttgcatg gggagaggtt tggtctcgtc 481 ctggacttcc tcgtcatacg cgtagtttgg tcacaattgc agtgttgttg gcgcttgggc 541 gtgaagatga gctgcgtatc gatttacgtg cctgtttcaa caatggtgtg accaaagatg 601 aattaaaaga gttgattcta cattgttcac tttatgcagg cttacccgct tcaaatgctg 661 cgatgcatat ggcagaagag gtctttaagg acttgggaat tgcacctgaa aaagtaaaca 721 aggattaatg gatttagagg gagtagaaaa tgtctcaaat tatttgggga gcttacgctc 781 agtcgaatac cgaagatcat ccgccagctt atcgaccagg ttataaaacc agtgtgttac 841 gctcgcctaa gaatcgactc atttctattg cagaaacttt aagtgaagtc actgcaccac 901 attttagtgc tgataaattt ggtccaaaag acaatgactt aatcttgaac tatgccaaag 961 atggtttgcc gattggtgaa cgtgtcattg tgcatggtta tgtacgtgac cagtttggtc 1021 gacctgtaaa aaatgcactt gttgaagtat ggcaagccaa tgcatctggt cgttatcgtc 1081 atccaaatga tcaatatatc ggtgccatgg atcctaattt cggtggttgt ggccgtatgt 1141 tgactgatga taatggttat tatgtattcc gtacgattaa accaggtcca tacccatggc 1201 gtaaccgtat caatgaatgg cgtcctgctc acattcactt ctctttaatt gcagatggtt 1261 gggcacaacg tttaatttcg cagttctatt ttgaaggcga tacattgatt gattcttgcc 1321 caatcctgaa aaccattcct tctgagcaac aacgtcgtgc gctaattgca ttagaagaca 1381 agagcaattt cattgaagct gacagccgtt gttatcgctt tgacattacg cttcgtggtc 1441 gcgcggactt acttcgaaaa tgacttaact taatggatgg agtaaaaaac atgaatggtt 1501 ggaattttca ggaactcaaa gaaaccccat ctcaaacagg tggtccatat gtccatattg 1561 gtcttttgcc aaaacaagcc aatatcgaag tatttgaaca caatttaggt aacaaccttg 1621 tacaggacaa tacgcaaggt caacgcatcc gattagaagg tcaagtgttt gatgggctta 1681 gtttacctct acgtgacgta ctgattgaaa tctggcaagc agataccaat ggcgtttacc 1741 caagtcaagc tgatactcaa ggtaaacaag tcgatccgaa ctttttgggt tggggccgta 1801 caggtgcaga ttttggtaca ggcttctgga gttttaatac cattaagcca ggtgctgttc 1861 caggccgtaa aggttcaact caagcaccac atatttcact gattattttt gcgcgtggta 1921 tcaacattgg tcttcacact cgtgtctatt ttgatgacga agcagaagca aatgcaaaag 1981 atcctgtttt aaacagtatc gagtgggcga cacgtcgtca aacactggtt gccaaacgtg 2041 aagagcgtga tggtgaagtg gtttatcgtt ttgatattcg tattcaaggt gaaaacgaaa 2101 cggtattttt cgatatctaa atttttgact atggaacggg ccgtatatct ttcggtatac 2161 agcctccata attttacaag gaaacgtaaa tgttcaatac aaacaaactg gttaaacaac 2221 tttgtctaag tggtgcgttg ttgatgtcat tgcctatatt gtcaacgacc tatgcggctg 2281 aaaatactgt accagcatct aaatcaactt atgttgtaaa aaatttaaac attggtgatt 2341 tacctgttaa aaccctcgtg ccaatcactg ctaaaacaag agagcaagct t // LOCUS HAMSAA3A 530 bp ss-mRNA ROD 17-JUL-1990 DEFINITION M.auratus serum amyloid A (SAA3) mRNA, complete cds. ACCESSION M33431 KEYWORDS serum amyloid A protein. SOURCE M.auratus (strain LVG) embryo fibroblast, cDNA to mRNA, clone pAA825. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 530) AUTHORS Gervais,C. and Suh,M. TITLE Serum amyloid A protein-related mRNA expression in herpes simplex virus type 2 transformed hamster cells JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Suh, 30-MAR-1990. FEATURES from to/span description pept 42 410 serum amyloid A signal 510 515 polyA signal BASE COUNT 147 a 122 c 140 g 121 t ORIGIN 1 ggggacctct agagtcactc accaggcatc tcgaaagcag gatgaagcct ttccttgcca 61 tcattttttg cttcttgatc ctgggagttg acagccaaag gtggttccag ttcatgaaag 121 aagctggcca agggagtaca gacatgtgga gagcttactc tgacatgaga gaggccaact 181 ggaaaaactc agataaatac ttccatgccc gggggaacta tgatgctgcc aaaaggggac 241 caggaggagc ctgggctgct aaagtgatca gtgatgccag agagggtatt cagaggttca 301 caggacgtgg agcagcggac tcaagagctg accagtttgc caacaaatgg ggccggagtg 361 gtaaagaccc caaccacttt cgacctgctg gcctgcctag taaatactga gtttcatctt 421 catgctattc ccagccatgc accccaaggt tgcatttact agattaagtt agttcctgag 481 ctctggacgt gtaagcaccc gaaatttaga ataaatacct gtgaaatgct // LOCUS HEIBEXBA 795 bp ds-DNA BCT 17-JUL-1990 DEFINITION H.influenza (strain RM153) capsulation protein (bexB) gene, 5' end. ACCESSION M33787 KEYWORDS capsulation protein. SOURCE H.influenza (type b, strain RM153) DNA. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 795) AUTHORS Kroll,J.S. and Moxon,E.R. TITLE Capsulation in distantly related strains of Haemophilus influenzae type b: genetic drift and gene transfer at the capsulation locus JOURNAL J. Bacteriol. 172, 1374-1379 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 795 capsulation protein (bexB) BASE COUNT 188 a 133 c 177 g 297 t ORIGIN 1 atgcaatatg gtgataaaac aacatttaaa cagtcattag ctattcaagg acgggtgatt 61 aacgcgttgc ttatgcgtga aataatcact cgttatgggc gtcaaaatat tggctttttt 121 tggctatttg ttgagccttt gctcatgact ttttttattg taatgatgtg gaaatttatc 181 cgtgctgata aattttctac tttaaatatg attgcctttg tgatgacggg gtatccaatg 241 gcaatgatgt ggcgaaatgc ttcaaaccgt gcgattggtt ccatttctgc aaacttgagt 301 ttactttatc accgaaatgt tcgagtgcta gataccattt ttactcgcgt gttgcttgag 361 gttgctggtg cttctattgc tcaaattctt tttatggcaa tcttagtcat gattgattgg 421 attgatgcac cgcatgatgt gttttatatg ctcattgcat ggttcctcat ggcaatgttt 481 gcttttggct tggggttaat catttgtgcg attgctcagc aatttgatgt gtttggtaaa 541 atttggggaa cactcagttt cgttttatta ccaatttctg gcgcattctt cttcgtccat 601 aaccttccag cacaggcaca atctattgca ctttggtttc cgatgattca tggtacagaa 661 atgttccgcc atggctattt tggcgataca gtggttactt acgaaagtat aggctttttg 721 gttgtgagtg atctagcctt gttgttactg ggcttagtga tggttaaaaa ctttagtaaa 781 ggagttgagc cacaa // LOCUS HEIBEXBB 795 bp ds-DNA BCT 17-JUL-1990 DEFINITION H.influenza (strain RM926) capsulation protein (bexB) gene, 5' end. ACCESSION M33788 KEYWORDS capsulation protein. SOURCE H.influenza (type b, strain RM926 ATCC 8468) DNA, clones lambda-926.[8,11,14]. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 795) AUTHORS Kroll,J.S. and Moxon,E.R. TITLE Capsulation in distantly related strains of Haemophilus influenzae type b: genetic drift and gene transfer at the capsulation locus JOURNAL J. Bacteriol. 172, 1374-1379 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 795 bexB protein BASE COUNT 188 a 128 c 178 g 301 t ORIGIN 1 atgcaatatg gtgatcaaac aacgtttaaa cagtcattag cgattcaagg aagagtgatt 61 aatgcattac ttatgcgtga aataatcact cgttatgggc gcaaaaatat tggctttcta 121 tggctatttg ttgagccttt actcatgact ttttttattg tgatgatgtg gaaatttatt 181 cgtgctgata aattttcaac tttaaatatg attgcttttg tcatgaccgg gtatcctatg 241 gcgatgatgt ggcgaaatgc ttctaatcga gctattggtt cgatttctgc aaacttgagt 301 ttactttatc accgaaatgt tcgagtgcta gatactatct ttactcgagt attgcttgag 361 gttgcaggtg cttcgattgc tcaaatcctt tttatggcag ttttagtgct tattggttgg 421 attgatgcac cacgtgatgt gttttatatg cttatggcat ggtttctcat ggctatgttt 481 gcttttgcct taggccttat tatttgtgcg gtagctcaac aatttgatgt gttcggtaaa 541 atttggggaa ccttaagttt cgtattgtta ccaatttctg gtgcattttt ctttgttcac 601 aaccttccat cacaggctca atctatcgcg ctttggctcc caatgattca tggtacggaa 661 atgtttcgtc atggctactt tggggatacg gtcgttactt atgagagtat cggattttta 721 gtcgtaagtg atttggcctt attgctaatg ggcttggtga tggttaaaaa ctttagtaag 781 gggattgagc cacaa // LOCUS PSEARCDA 1502 bp ds-DNA BCT 17-JUL-1990 DEFINITION P.aeruginosa membrane protein arcD (complete cds) and arcA (5' end) genes. ACCESSION M33223 KEYWORDS membrane protein. SOURCE P.aeruginosa (strain PAO1) DNA, clone pME183. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1502) AUTHORS Luethi,E., Baur,H., Gamper,M., Brunner,F., Villeval,D., Mercenier,A. and Haas,D. TITLE The arc operon for anaerobic arginine catabolism in Pseudomonas aeruginosa contains an additional gene, arcD, encoding a membrane protein JOURNAL Gene 87, 37-43 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Haas, 26-MAR-1990. FEATURES from to/span description pept 24 1472 membrane protein (arcD) pept 1494 > 1502 membrane protein (arcA) binding 8 15 ribosome binding site (put.) BASE COUNT 247 a 507 c 443 g 305 t ORIGIN 1 attcctatag ggagattccg aaaatgtccc aagaatccag ccaaaaactc cgactaggag 61 cgctaacggc tctggttgtc ggctcgatga tcggaggggg catcttctcc ctcccgcaaa 121 acatggcagc cagcgccgat gtcggcgccg tcctgatcgg atgggcgatc accgccgtcg 181 gcatgctcac cctcgctttc gtcttccaga ccctcgccaa ccgcaagccc gaactcgacg 241 gcggcgtgta cgcctacgcc aaggccggct tcggcgacta catgggtttc tcctcggcct 301 ggggctactg gatcagcgcc tggctgggca acgtcggcta cttcgtcctg ttgttcagca 361 ccctcggcta cttcttcccg atcttcggca agggcgacac cgtcgcggcg atcgtctgcg 421 cctcggtcct gctctgggcc ctgcacttcc tggtgctgcg cgggatcaag gaggcggcgt 481 tcatcaacac cgtcaccacc gtggccaagg tcgtgccgct gttcctgttc atcctgatct 541 gcctgttcgc cttcaagctg gacatcttca ccgccgacat ctggggcaag agcaacccgg 601 acctgggcag cgtgatgaac caggtgcgca acatgatgct ggtcaccgtc tgggtgttca 661 tcggcatcga gggcgcgagc atcttctcct cccgcgcgga aaaacgttcc gacgtcggca 721 aggccaccgt gatcggcttc atcaccgtcc tgctcctgct ggtgctggtc aacgtgctgt 781 ccatgggcgt gatgacccag ccggaactgg ccaagctgca gaacccgtcg atggcgctgg 841 tactcgagca tgtggtcggc cactggggcg ccgtgctgat cagcgtcggc ctgctgatct 901 cgctgctggg cgcgctgctc tcctgggtgc tgctgtgcgc cgagatcatg ttcgccgccg 961 ccaaggacca caccatgccg gagttcctgc gccgcgagaa cgccaaccag gtgccggcca 1021 acgccctgtg gctgaccaac atctgcgtac aggtgttcct ggtggtggtg ttcttcacct 1081 cgggcgaccc ggacggcatg gacccgtaca ccaagatgct gctcctggcc acctcgatga 1141 tcctgatccc gtacttctgg tccgccgcct atggcctgct gctgaccctg aagggcgaga 1201 cctacgagaa cgatgcccgc gaacgcagca aggacctggt catcgccggc atcgccgtgg 1261 cctacgcggt ctggctgctc tacgcggggg gcctgaagta cctgctgctg tccgccctgc 1321 tgtatgcgcc cggcgcgatc cttttcgcca aggccaagca cgaggtcggc cagccgatct 1381 tcaccggaat cgaaaaactg atcttcgccg cagtcgtcat tggcgccctg gtggctgcct 1441 acggcctcta cgacggtttc ctcaccctct gattactcca ctggagacgt gaaatgagca 1501 cg // LOCUS TOMCD4A 5429 bp ds-DNA PLN 17-JUL-1990 DEFINITION Tomato ATP-dependent protease (CD4A) gene, complete cds. ACCESSION M32603 KEYWORDS ATP-dependent protease. SOURCE Tomato (L.esculentum) DNA. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 5429) AUTHORS Gottesman,S. and Pichersky,E. TITLE A conserved regulatory subunit for ATP-dependent protease in prokaryotes and eukaryotes JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Pichersky,E., 08-MAR-1990. FEATURES from to/span description pept 787 1357 ATP-dependent protease (CD4A), exon 1 1461 1576 ATP-dependent protease (CD4A), exon 2 1828 1950 ATP-dependent protease (CD4A), exon 3 2034 2219 ATP-dependent protease (CD4A), exon 4 2853 3068 ATP-dependent protease (CD4A), exon 5 3248 3465 ATP-dependent protease (CD4A), exon 6 3768 3843 ATP-dependent protease (CD4A), exon 7 3926 4015 ATP-dependent protease (CD4A), exon 8 4125 5309 ATP-dependent protease (CD4A), exon 9 pre-msg 1 5429 CD4A mRNA and introns IVS 1358 1460 CD4A intron A IVS 1577 1827 CD4A intron B (no splice consensus) IVS 1951 2033 CD4A intron C IVS 2220 2852 CD4A intron D IVS 3069 3247 CD4A intron E IVS 3466 3767 CD4A intron F IVS 3844 3925 CD4A intron G (no splice consensus) IVS 4016 4124 CD4A intron H BASE COUNT 1521 a 972 c 1221 g 1715 t ORIGIN 1 tcgaagttgt aaaggttggt cttttgcttc tacttgtttg gactagtttc tttccaagta 61 atggggttat taagttggta ttgttgtgga ttttgacagg atatgggggt atgatgtagc 121 tctcatgttt atattttcaa ttaagcaatg tagttctggt tgactagttg tggtgctaga 181 attcccaatt aacctcttca ctgaattact cctagtgctg aataaaactg ccttatctca 241 agggtgtggg ttgttgttgt attctagagg gtttgtttat ataatcccat tggaaccaag 301 ttaacctatg caataggaat tatatattca agattctgct agtttagaaa ctaaagtcct 361 tgcagcttgg ctgctttatc ctgtatgaaa tagttaccat cctgtggttg catagttttc 421 caggtcttct tgttttgagg ctttattatt cattttactg atgatataac ttttatgatg 481 aaatttgaat ttagttttaa gcttatgcgc taggtgtatg ttgctcaaca tgcattcgca 541 gagttttttg aagtatggca ttctcataat gatcctattt gtctacatta tgtcatctaa 601 agaaattaac cttcaacggt taatggttta caatggtcca aaaaaagtag ccttcaatgg 661 ttattagtga tggttttgtt gatgtctgct aactatatgt atatgttatg taatatgatc 721 cttggtactt cttccattgc cactaatgat cttttcttat atgctatcat gtcaggttgc 781 aggtccatga tggctagagc tttagttcag tcaacaaaca ttctaccttc agttgctggt 841 gaaagggctg gacaattcaa tggatcccgg aaagatcaaa gaactgttag aatgctatgt 901 aatgtaaaat gttgttcctc aaggctgaac aattttgcag gactgcgagg gtgcaacgca 961 ttagatacac tactagtaaa atctggagaa actctccatt cgaaagtggc agctgcaact 1021 tttgtcagac ggccacgagg gtgccgattt gtcccaaaag caatgtttga gcggttcact 1081 gagaaagcaa taaaagtcat tatgcttgcc caagaagagg ccagacgact aggtcacaac 1141 tttgttggga cagaacagat cctgctgggt cttattggtg agggaactgg tattgctgct 1201 aaggttctta aatccatggg aattaatttg aaagatgctc gtgtggaagt agagaagatc 1261 attggcaggg gtagtggatt tattgcagtg gaaatccctt ttactcctcg tgcaaagcgt 1321 gttctagaac tatctctgga ggaagcccgc caactaggta cccttatcta tcatttcagc 1381 cttttgttgg atggcctaac tgtcatgcga actcaaaatt catttctttg ttgcatctgt 1441 tgattggttt tgtgatgcag ggcataatta tattggctcc gagcacttgt tacttggact 1501 gttgcgcgaa ggtgaaggtg tggctgcccg tgttcttgaa aacttgggtg ctgaccccac 1561 taacattcgc acacaggcaa gtgattttcc tttctcaaac tggtttgaaa tgcttttctt 1621 gaagccaagg gtctaatcag aaacagcctc tctacctcca caagtagggg taaggtctgc 1681 atacatctta ccttcccaga tgccacctgt gggattacac tgggtatgtt gttgttgttg 1741 tttacttctg atgggggtga gggtatttta aattgcttga cttcattgta atcagtttct 1801 cttttttctt gtaaaataaa taaataggtg atacggatgg ttggcgagag tagtgaggct 1861 gttggtgcta gtgttggagg tggaacttct ggcctaaaga tgcctacatt ggaggagtac 1921 ggcaccaatt tgacaaagtt agctgaagag gtacatagaa agataagtcc gtcataatgt 1981 ggcaagtgat tggctctctt caattatttg atatggtcat attatatttt caggggaaac 2041 tagatcctgt agttggaaga caggctcaaa tcgagcgtgt tactcaaatc ttgggtcgcc 2101 gtaccaaaaa caacccttgc cttattggag aaccaggtgt tggaaaaact gctattgccg 2161 aagggctagc tcaaagaatt gctaatggtg atgtccctga aacaatagaa ggaaagaagg 2221 tcacaggggt gtatgataaa ttgcataaca tgttttatgc ttgagctttt tttctcttta 2281 tcaatttcat ttcaactgtg tggcagatcc agaagctgtg ctacattctt agatagaaag 2341 acttcaaatc atcattttta taatgtctta gaatttgatt ttatacagag ggtaattttt 2401 taaattggaa gtaaatattc tgcagacaaa tatatataca caaatattgt tatctgatcc 2461 cccatataac acccccaccc cccaccccaa aagcctctag gccaaaaatt tacgcgattc 2521 taagagtatc caatcaatat ttttggtagg cgtgccaaaa atttacagaa acatttgaat 2581 gtcccactaa cttgtacgaa caggttttca aatttcaacc gccttgagcc actacatatt 2641 cttcattgtc tgatatatgg actcaaggca tatatcaggt ttagttccct cttacttcaa 2701 ttctagtcta actgaagtca tagggaagat gaacctattt atctaaatct ttttatatgc 2761 aactattttg atgaggtata ccctcctttt tcaaaaaatg tgtatatcct tgtcattata 2821 cgagtgtatg acttgctttg ggtgatggtc aggtcataac acttgatatg ggattgcttg 2881 ttgctgggac aaaataccgt ggagagtttg aggaaagact gaagaaacta atggaggaaa 2941 ttaaacaaag tgatgaaata atactgttta ttgatgaagt acacacattg attggagctg 3001 gagctgcaga gggagcaatt gatgctgcaa acatcttgaa acctgccctg gctagaggcg 3061 agctacaggt tcgttatcgt ttcgacctgt agtatgttgt tccttggaat ttcaatgagc 3121 actatgaaat ctaccttaaa tttggtgttt aatcattatc ttggtccgag tacatcttgg 3181 taaaactaca aaagaatttc aaagttggtg ttaattaaga agtattttgc ttggtttcac 3241 ttgtcagtgt attggagcca caaccctgga tgagtacaga aagcacattg agaaagaccc 3301 tgcactagag agaaggttcc agccagttaa agtccctgaa ccttctgtgg atgaaaccat 3361 acagatcttg aaagggcttc gggagaggta tgagattcat cacaagctcc attacactga 3421 tgaggcaata gaggccgctg ccaagctttc tcaccagtac atcaggtatt attattaatc 3481 atatattcat tttccaaagt atcttccgta gtcaagttgt tggtacattt tgactttggg 3541 atttcacggc aaactaattt attcttgtat gatttaagat atatccttct caataataaa 3601 actcttagga taagcggctt ttcttttgta tcagtgtctg aataggctga cgcacggttt 3661 atctcctgca aattgcacat ttatctccac tccaaaacaa tcatcttttt ccgtgattta 3721 tctgaagtaa agcattaata tgtttttgag ttgaaccttt attccagtga ccgctttctg 3781 cctgataaag caattgactt gattgatgaa gctggttccc gtgttcgact tcgtcatgca 3841 caggcataga gtactctttt tgtcttactc tgtcctgcca tttcttccta agggagtcat 3901 gacccgttat attgatacta tctagctccc tgaggaagca agagagctgg aaaaggagct 3961 tcgccagatt acaaaggaga aaaatgaagc tgtccgcggt caagattttg agaaggtact 4021 ggtttcctct cgtcctgtaa aaaaattcct ctggactgat tatttcttag aaaacattga 4081 gattttcagg tgtgcaattt taatcctctt tatgcattgg acaggctggg gagttgcgtg 4141 atagagaaat ggatcttaag gcacaaatct cagccttgat agacaaaaac aaagagaaga 4201 gcaaggcaga atctgaggca ggagatgcag caggtcccat agtgacagag gcagatattc 4261 agcacattgt ctcttcctgg accggcattc ctgtagagaa ggtctcaact gacgaatctg 4321 atcgcctcct aaaaatggaa gaaacacttc atacccgagt cattggccag gatgaagcag 4381 taaaagccat tagtcgcgct atcagacgtg cccgtgttgg cctcaagaat cccaaccgac 4441 ctattgccag tttcatcttt tctggtccaa ctggtgttgg gaagtcagaa ctggcaaagt 4501 ctttagcaac gtactacttt ggttctgaag aagcaatgat tcggcttgat atgagtgagt 4561 ttatggaaag acacactgtg tctaaactca tcggttcacc ccctgggtat gttggttaca 4621 ctgaaggtgg tcaattaact gaagctgtta ggcgtcgacc ttatacagtt gtgctctttg 4681 atgagattga gaaggctcat cctgatgtct tcaacatgat gcttcaaatt cttgaagatg 4741 gaagattgac agacagcaaa ggcagaactg tcgacttcaa gaatacactc ctcatcatga 4801 catcgaatgt tggaagcagt gtgatagaga aaggaggtcg tcgtataggt tttgatctag 4861 attttgacga gaaggatagt agttacaacc gtatcaagag cttggtgact gaggagttga 4921 aacagtactt caggccagag tttttaaaca gattgagtga gatgattgta ttccgtcagc 4981 tcactaagtt ggaggtgaag gagattgccg atatcatgct taaggaggtc tttgtgaggt 5041 tgaagaataa ggagatagaa ctccaagtga cagagaggtt tagagatagg gtagttgatg 5101 aaggatataa cccaagctat ggagctagac cattgaggag agctattatg agactgctag 5161 aggatagtat ggccgagaag atgcttgcag gtgagatcaa agaaggtgat tcagtaattg 5221 tggatgtgga ttcagatggc aacgtcaccg ttctcaatgg cactagtggt gctccctcag 5281 attcagctcc tgagcctatc cttgtgtaga tcaattcttc atttagctct acaaatttgt 5341 tcttcgtaat gttgcttttg gatgtcttgg ctaagcaaca tgtgagtaat tttttgggtg 5401 acaggctgat atattcttcc aagttgact // LOCUS TOMCD4B 5846 bp ds-DNA PLN 17-JUL-1990 DEFINITION Tomato ATP-dependent protease (CD4B) gene, complete cds. ACCESSION M32604 KEYWORDS ATP-dependent protease. SOURCE Tomato (L.esculentum) DNA. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 5846) AUTHORS Gottesman,S. and Pichersky,E. TITLE A conserved regulatory subunit for ATP-dependent protease in prokaryotes and eukaryotes JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Pichersky,E., 08-MAR-1990. FEATURES from to/span description pept 1111 1675 ATP-dependent protease (CD4B), exon 1 1896 2011 ATP-dependent protease (CD4B), exon 2 2133 2255 ATP-dependent protease (CD4B), exon 3 2326 2511 ATP-dependent protease (CD4B), exon 4 3020 3235 ATP-dependent protease (CD4B), exon 5 3445 3662 ATP-dependent protease (CD4B), exon 6 3978 4053 ATP-dependent protease (CD4B), exon 7 4138 4227 ATP-dependent protease (CD4B), exon 8 4354 5535 ATP-dependent protease (CD4B), exon 9 pre-msg 1 5846 CD4B mRNA and introns IVS 1676 1895 CD4B intron A IVS 2012 2132 CD4B intron B (no splice consensus) IVS 2256 2325 CD4B intron C IVS 2512 3019 CD4B intron D IVS 3236 3444 CD4B intron E IVS 3663 3977 CD4B intron F IVS 4054 4137 CD4B intron G (no splice consensus) IVS 4228 4353 CD4B intron H BASE COUNT 1592 a 1067 c 1283 g 1904 t ORIGIN 1 aagcttctgt agcccaaagt tttgtccttt cacttcttgt ctactcctca tactgtctct 61 tccaaattca acagttgcca ctgtcgattt ttcagcagta aatttacctt gctcaatatc 121 aactccccat agcccaccca aacttctctt catttttata ataatgacgt tgcatgtttc 181 atacctcgcc taggaatgtg ttcatcgaat ttccgtcatt gtcacatcat atctgttctt 241 gtcctttact ctttttcctt ttctttcctg tactctcgtt cgtgttgtta ggatttgtac 301 tgaaaccgtt gggttatcag ttttacagac ctttaagcag tctctctagg gtttactaag 361 agctggaaat ttgaataatc tgatcaaaat tgctgaggtt cttttaatcc ttctaaagat 421 ctctgtagca gagtggtgaa aagcggaaca gggggaatac tcggtacaaa tggtgttgtg 481 ttcaattgca aagaccaaac ctcccattct aatgtgttgc aacaaactat taggggagtt 541 ttaaggaaga gaaaatgggg gataatgcta cacaatgcta acctattata atcatctatt 601 tagtttgcaa accccacata tctctgtgtg tattgtccta tttgttcaat ctattcatga 661 aaccttggct actcaatcct ggattgcgca cgtactactc agcagttgcg tagtgcccta 721 gttcttatat cttggagggt agactgaatt tggtttatgt caatgaatca tgtttcacac 781 agatggttac atattttatt acatagaatc tatatgtcat gcatcgactc ctgtcgagta 841 tggatacttc ctacttcttc ttggaactct ctgattgagt tatctttctg ctaatggact 901 gtgcctcact atttttatct tcctctggat tggaaaatat ggacattcac ttgttatcaa 961 tttttgaatt aataatttgt tgatatgttg gctgattgat gtgtcattga cttcatatac 1021 tgttatgggt ttgtcatgtg gaagatttga tattttcgat tatctaagga gagctttttt 1081 gattattttc ttatcctggg caggtccatc atggctagag ctctagttca gtcaacaagc 1141 atcccatcgt cagttgctgg tgaaaggaca accaaattca atggatctgg gaaaacaaaa 1201 agagctgtta caatgctatg caatgcacaa tcatcttcac ttacactgag ggattttaca 1261 ggattgcgag gatgcaacgc aatagataca ctagttagat ctggagaaac tctccaatcc 1321 aaagtagctg ctgcaactta tgtcagacgg ccacgaggtt gccgatttgt accaaaagca 1381 atgtttgagc gcttcacaga gaaggcgata aaagtcatta tgcttgcaca agaagaggcc 1441 agacgacttg gtcacaattt tgttggcact gagcagatct tgttgggtct aattggtgag 1501 ggaactggta ttgctgccaa ggttcttaaa tcgatgggga tcaatttgaa agatgctcgt 1561 gtggaagtgg aaaagataat tggaaggggt agtgggtttg ttgctgttga gatccctttt 1621 acccctcgtg ccaagcgtgt tctggaactc tctctggagg aagcccgcca gctaggtatc 1681 cttttttgat tgtctgtatt tgttgtatta cctttgggac atgtctaaaa gtggttctcc 1741 ggttgagtct tattctcatg ttgctcgttg acacctttgg tttttcccaa aattcattga 1801 ttttctttct ctttgcttgc ctttctggtc ttctacatgc accagcatga ttatttgtct 1861 aaatatttat tccatttata aactggttgg ttcagggcat aactatattg gttcggagca 1921 cttgctactt ggattgctac gtgaaggtga aggtgtggct gcccgtgttc ttgaaaactt 1981 gggtgctgac cccagcaaca tccgcactca ggcaagtagt agatctcaca tgcttcctct 2041 gggtagtcta atattgtgct tttgtgggaa gttggatttt ttggtttcat catgaccagt 2101 gttcattggt tttattcttc ttctaaaaat aggtgatccg gatggttggt gagagtaatg 2161 aggctgttgg tgctagcgtt ggaggtggaa cttctggcca aaaaatgcca acactggagg 2221 agtatggaac aaatttgaca aaattggcgg aagaggtaat ctaaaaaact atatttgtga 2281 tgatatgtga acagttatga ataattgtcc atttttggtt tgtagggaaa attggacccc 2341 gttgttggaa gacagccgca aattgaacgg gtcactcaaa tcttgggtcg gcggactaag 2401 aacaaccctt gtcttattgg tgaaccaggt gttggcaaaa cagctattgc tgagggtttg 2461 gcacaaagaa ttgcaaatgg tgatgtccct gaaacaattg aggggaagaa ggtacgtcat 2521 gtggatagct tgcagaatgt gatgtttacc tgtgcccttt cttctcccat ttgttgtatc 2581 atatggtgaa cccagaagct gagcagggtt atgaggtgca gtctagtggt attttcgatt 2641 cacaaggatt gggttcaaat gaatgtgcat tctatctcca ctaaattgaa atatgtcaaa 2701 ttatattttc agtgtctgtg ttctcttgaa tttttttata tataaattct tccgtgctac 2761 ttgttctctt ggcatatttc tttttagtag attgatttgt cgatctatgc ttataactaa 2821 ttgagaaaaa acgattagta tcccttgttg aacttaattt cttagctcaa actatcaaac 2881 tgaagtgatt tcaagtgaga aaggtccacc acctattctt gtgaatctaa tccatctgaa 2941 actttgtcaa atgtttcgtg ctatgatata gctacctttc tcgttgttcc taagagtcgc 3001 tttttttggt gataattagg tgataactct tgatatggga ttgcttgttg ctggaacaaa 3061 ataccgtgga gagtttgagg aaaggctaaa aaagttgatg gaggaaatta agcagagtga 3121 tgaaataata ctatttatcg atgaagtgca cacattgatt ggagctggag cagcagaggg 3181 ggccatcgat gctgcaaaca tcttgaaacc tgccctagct cgaggtgaac tacaggtaag 3241 ttcctccttc gcttgttctt tcctttctgt tctttctaga aattaaaaat gagaactaag 3301 aatatttctg ttaacaactc aatgttttag aagatatatc attgttttaa catgcatttt 3361 accgcctcat aagaagttaa aatgactgct cagttaaagc aggagaaaaa acttctacat 3421 gtgtatttac ttggcttctt gcagtgtatc ggagctacta cactggatga atacagaaag 3481 catattgaga aagatcctgc actagagagg agattccaac cagttaaggt ccctgaacct 3541 actgttgatg aaactataca gattctgaaa gggcttcgtg agaggtatga aattcatcac 3601 aaacttcgtt acactgacga agacttagtg gctgctgctc agctctcata ccagtacatc 3661 aggtatctaa tctctaccat tttcctatgt cttcttgtac gtttttactc ttttctcgtc 3721 ctgaattgtt gatgcattgg taccttgaag aaatatgttt gctttaagac ggtagacctg 3781 aagttcaact tattttatga tagttaagtg tgataataga agttggttta cttttgttta 3841 tcagcatcta attaggtttt tatgcttatc tctagaaaag attgtcctag attccacgat 3901 atagattttc gtataaagtt tttttttcct tgttctgcta agtgatacat cttttcactt 3961 gcgttttcac atttcagtga ccgatttctg cctgacaaag caattgattt gattgatgaa 4021 gctggttctc gtgttcgact tcgccatgct caggcatgaa atgctctctt agtctctctt 4081 ttaatctgtc attgccctac tatttgttga gtgacaatgc tctgttgttg cctccagctc 4141 cccgaggaag caaaagagct cgagaaagaa cttcgtcaga taacaaagga gaagaatgaa 4201 gctgttcgtg gtcaagattt cgaaaaggta ctgagtttgt catgtacatc taaagaaccc 4261 ctcaaaaatg gttgttatat agcccaacta ttttcttatc aagtattata attaaagaat 4321 tttctaatga catggttttg gattaatgga caggctgggg aattacgtga tagagaaatg 4381 gatctgaagg cacagatcac agccctcata gacaaaaaca aagaggtgag caaggctgaa 4441 agtgaggctg cagatacagg tccacttgtg acagaagcag atattcagca catcgtctct 4501 tcttggactg gtatccctgt tgagaaggtc tccactgacg aatctgaccg tctcctaaaa 4561 atggaagaaa cacttcacac acgaatcatt ggccaggatg aagctgtcaa agccattagt 4621 cgcgctattc gacgtgcacg agttgggctc aagaatccca acagacctat tgctagtttc 4681 atcttttccg gtcccactgg tgttgggaaa tcagaactgg caaaggcact ggctgcatat 4741 tactttggtt ctgaagaagc aatgatccgg cttgatatga gtgagttcat ggaaagacac 4801 actgtctcta aactcattgg atccccccct ggttatgttg gttacacaga gggtggtcaa 4861 ttgactgaag ctgttaggcg tcgaccttat acagttgtgc tctttgatga gattgagaag 4921 gctcatcctg atgtcttcaa catgatgctt caaattcttg aagatggaag attgacagac 4981 agcaagggta gaactgtaga tttcaagaac acacttctca tcatgacatc aaatgttgga 5041 agtagtgtaa ttgagaaagg tggccgtcgt ataggttttg atctcgacct tgatgagaag 5101 gatagcagtt acaatcgcat caagagctta gtgacagaag aattgaagca gtacttcagg 5161 ccagagtttt tgaacagatt ggatgagatg attgtattcc gtcagctcac taagttagag 5221 gttaaggaga tagctgatat catgcttaag gaggtctttg agaggttgaa agttaaggag 5281 atagaacttc aagtgacaga gaggtttaga gacagggtgg ttgacgaggg atacaaccct 5341 agctatggag cacgacctct gagaagagct attatgagac ttttagagga cagcatggct 5401 gagaaaatgc ttgcaaatga gattaaagaa ggtgattcag ttatcgtgga cgttgattca 5461 gatggcaatg tgactgtcct caatggaagt agtggtactc cctcggatcc ggctcctgag 5521 cctatccccg tttagatcag gtgatcacat tttagctcaa cagtctctgt gtggtctttt 5581 aagagttcat ttgtgttggc cactaagctt tgtggggctt atgaagcaac ttttgagttt 5641 atctctggga atgctcaggt aaatcttgcc acgtcgagaa ggcagcatcc tttctatgtt 5701 gatcttaatg attttacaga agtatactgg attaagattg tagttggtca caagaaagta 5761 ttgtattaac aagacactcc ttatcaaaaa catatctagt ttattgtagg cgacagttga 5821 atcgttaatc ctcgtattgt ctatcc // LOCUS PLYORIA 375 bp ds-DNA VRL 17-JUL-1990 DEFINITION Human polyomavirus BK (strain WWT) origin of replication. ACCESSION M34048 KEYWORDS origin of replication. SOURCE Polyomavirus BK (strain WWT) DNA, from human urine. ORGANISM Polyomavirus BK Unclassified. REFERENCE 1 (bases 1 to 375) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 375 regulatory region site 143 210 P block site 211 249 Q block site 250 302 R block BASE COUNT 123 a 86 c 95 g 71 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt 301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta 361 aactggacaa aggcc // LOCUS PLYORIB 424 bp ds-DNA VRL 17-JUL-1990 DEFINITION Human polyomavirus BK (strain UT) origin of replication. ACCESSION M34049 KEYWORDS origin of replication. SOURCE Human polyomavirus BK (strain UT) DNA, from human urine. ORGANISM Polyomavirus BK Unclassified. REFERENCE 1 (bases 1 to 424) AUTHORS Sundsfjord,A., Johansen,T., Flegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 424 regulatory region site 143 210 P block site 211 249 Q block site 250 261 R block site 263 315 P block site 316 349 Q block site 350 361 R block BASE COUNT 134 a 103 c 105 g 82 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa 301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc 361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa 421 ggcc // LOCUS ECOCLPPA 1236 bp ds-DNA BCT 17-JUL-1990 DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component (clpP) gene, complete cds. ACCESSION J05534 KEYWORDS ATP-dependent protease. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1236) AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J., Bowers,B. and Gottesman,S. TITLE Sequence and structure of ClpP, the proteolytic component of the ATP-dependent Clp protease of E.coli JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Gottesman, 23-APR-1990. FEATURES from to/span description pept 378 1001 ATP-dependent protease (clpP) mRNA 279 > 1236 clpP mRNA pept < 1 133 ORF signal 244 249 -35 region signal 267 272 -10 region signal 165 193 transcription termination signal binding 367 372 ribosome binding site (put.) BASE COUNT 329 a 291 c 318 g 298 t ORIGIN 1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga 61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa 121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc 181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg 241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac 301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt 361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg 421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt 481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta 541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc 601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc 661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt 721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga 781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg 841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat 901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg 961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg 1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt 1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc 1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg 1201 cgcaagctga ttgccggtcc atccgtgtat atctgc // LOCUS LACNISC 1222 bp ds-DNA BCT 17-JUL-1990 DEFINITION L.lactis insertion sequence IS981. ACCESSION M33933 KEYWORDS insertion sequence. SOURCE L.lactis (strain LM0230) DNA, clone pKM40. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1222) AUTHORS Polzin,K.M. and McKay,L.L. TITLE Identification, DNA sequence and distribution of IS981, a new, high copy number insertion sequence in the lactococci related to the IS2/IS3 family of IS-elements JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Polzin 27-APR-1990. University of Minnesota Dept. of Food Science 1334 Eckles Avenue, St. Paul, MN 55108 e-mail: kayla@molbio.cbs.umn.edu FEATURES from to/span description pept 58 318 ORF1 pept 330 1169 ORF2 binding 44 50 ribosomal binding site (put.) rpt 1 40 5' inverted repeat rpt 1183 1222 3' inverted repeat signal 33 38 -10 region signal 269 274 -35 region signal 294 299 -10 region BASE COUNT 442 a 233 c 206 g 341 t ORIGIN 132 bp upstream of PvuII site. 1 gatgttatcc ttaaatctta gagtcactat tgtataattt agacaaagga caaaaacatg 61 caaaaacgct actcaaaaga atttaaagaa acccttatcg ccttctatca ttctggtcaa 121 tccgtcaccc agctgtctaa agaatatgac gtggcccctg caacaattta taaatggata 181 gacctctact ctaaatctaa tgaaagctcc gtctctaaag ctgattttct agaattaaaa 241 agacaactgg ctaaagttaa ggaagaacga gacatcttaa aaaaagtatt gaccatattc 301 gccgagaaaa agaagtgagt gctgcggata tggctcaaac catacaaact ttagcactca 361 atgtcagact aagctgtcaa ctccttgatg ttcctgaatc aagttattat gaacggatta 421 accgacaccc atctaaaact caattaagga gacaatacct gtcactcaaa atttctcaac 481 tcttcaatgc taaccgagga atctatggtg ctcctaaaat tcatcatctt ctacttaaac 541 aaggggaaaa agtcgggtta aaactggtac agaagctaat gaagcaactt caactcaagt 601 ctgtagtcat taagaaattt aagcctggat actcactaag tgatcacatc aatcgaaaaa 661 atctcataca gactgaacct acaaagaaaa ataaggtttg gtcaaccgac attacttata 721 ttcctactca acaaggatgg gcttatctct caaccattat ggatcgttat actaaaaaag 781 tcattgcttg ggatttgggc aagcgaatga ctgtagaatt agtgcaaaga actttaaata 841 aggccattaa atcacaagac tatccagaag ctgttattct tcattctgac caaggaagcc 901 agtatacgag tctagagtat gaagagttgc ttaagtatta tgggatgact cactctttca 961 gtcgaagggg atacccttat cataatgcca gtcttgaatc ttggcatgga catttaaaaa 1021 gagagtgggt gtaccaattt aaatataaga actttgaaga agcctatcag agtattttct 1081 ggtacatcga agccttttat aattcaaaac gaatccatca aagtttaggg tatcttacac 1141 ctaatcaatt tgaaaaggta agtgcttaaa ataaatagat taaaattcta cgtttgttac 1201 tctaaaaact tgacttaacg tc // LOCUS RATSHIIA 3064 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Rat K+ channel protein (KShIIA) mRNA, complete cds. ACCESSION M34052 KEYWORDS K+ channel protein. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3064) AUTHORS McCormack,T., Vega-Saenz de Miera,E.V. and Bernardo,R. TITLE Molecular cloning of a member of a novel class of K+ channel genes in mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Bernardo, 07-MAY-1990. FEATURES from to/span description pept 80 1771 K+ channel protein (KShIIa) BASE COUNT 777 a 718 c 729 g 840 t ORIGIN 1 gccacggtga acatctggcc cacgagagct ttaacttggt gctgtgttcg ccttccctag 61 tcatgtctga gccacagaga tgggcaagat cgagaacaac gagagggtga tcctcaatgt 121 cggaggcacc aggcacgaaa cctaccgcag cactctcaag acccttcctg gaactcgcct 181 ggcccttctc gcctcctctg aacctcaggg cgactgcctg actgctgcgg gtgacaagct 241 gcagccgctg ccccctccgc tgtctccacc gccgcgaccg cctcccttgt cccctgtccc 301 cagcggctgc ttcgagggcg gcgcaggcaa ctgcagttcg cacggtggca atggcagcga 361 ccaccctggg ggaggccgcg aattcttctt cgatcgccac ccaggagtct tcgcctatgt 421 gctcaactac taccgcacgg gcaagctgca ctgccccgcc gacgtgtgtg gaccgctctt 481 cgaggaagag ctggcattct ggggcatcga tgagaccgac gtggagccct gctgctggat 541 gacctacagg cagcaccggg acgcggagga ggccctggat atcttcgaga cacccgacct 601 catcggaggc gaccctggtg atgatgagga cctagggggc aagagactgg gcattgagga 661 tgctgcgggg ctgggaggac ccgatggcaa gtctggccgc tggaggaagc tgcagcctcg 721 catgtgggct ctctttgagg acccctattc atccagagcc gctaggttta ttgcttttgc 781 ttctctgttc ttcattttgg tttccatcac aaccttttgc ctggagacac acgaagcttt 841 caatattgtt aaaaacaaga cagagccagt catcaacggc accagcgctg ttctccagta 901 tgaaatcgaa acggatcctg ccttgacata tgtggaagga gtgtgtgtgg tgtggtttac 961 ttttgaattt ttagtccgta ttgttttctc gcccaataaa cttgagttca tcaaaaatct 1021 attgaacatc attgactttg tggccatcct ccccttctac ttagaggtgg gactcagcgg 1081 gctgtcttcc aaagcggcta aagatgtgct cggctttctc agggtggtta ggtttgtgag 1141 gatcctgaga atcttcaagc ttacccgcca tttcgtaggt ctgagagtgc tcggacacac 1201 tcttcgtgcg agcaccaatg aatttttgtt gctgatcatc tttctggctc tgggagtttt 1261 gatattcgct acgatgatct actacgctga gcgagtaggg gctcaaccta atgatccctc 1321 agcgagtgag cacacacagt tcaaaaacat ccccattggt ttctggtggg ctgtggtgac 1381 catgactacc ttaggctatg gggatatgta cccccaaaca tggtcaggga tgttggtggg 1441 ggccttgtgt gctctggctg gagtgctgac catagctatg cctgtgcccg tcattgtcaa 1501 caattttggg atgtactact ccttggcaat ggcgaagcag aaacttccaa gaaaaagaaa 1561 gaagcacatt cctcctgccc ctctggcaag ctcacctaca ttttgcaaga cagaattaaa 1621 catggcttgt aacagtaccc agagtgacac atgtctgggc aaagaaaacc ggcttctgga 1681 acataacaga tcagtgttat caggtgacga cagtacagga agtgagccgc cattatcacc 1741 ttccggaaag gctccccatc agacgctcta gtaccagaga caaaaacaga agaggggaaa 1801 catgtttcct gttgacgaca ggtgattaca cgtgcgcttc tgatggagga atcaggaaag 1861 ataactgcaa agatgttgtc attactggtt acacgcaagc cgaggccaga tctcttactt 1921 aatgacttgg gaaaggcaca aaacatgaaa gaaagtgttg tacagaattt atcatggatt 1981 tttgcctgct gaaaatggga cattgagatt tagccattca aggattgtac tggaaacttc 2041 tgctactgaa tgtgaccgga tgtgaccggt ttgtgttcag aagagttccc gcattctctg 2101 aggcatttaa agcttgtaaa gaaatggtgg ctggaactga aatgggcttt gccaaggaaa 2161 tattctgctt gtcgacagac cagagtccac aaaacaactg taaataccaa cgtgtgcatg 2221 ggtccaccct tctcacagtc tcatagcaca gggagccaca ttcattgtca gcatctgtga 2281 agcttcaaga aatgcccaca aggctctgaa tggctcctct cgggctcatg gtggttttgc 2341 tgcaaattac agagggcagt gaatccaatg tagtataggt ctgttttcat ttcttttatt 2401 tgtacattac gtgttgtatc tacaaacacc gattgcctca ttttcttgca aaccacttat 2461 atatccgctt gtgaactatg ttgtaaactt ttagtgtcta catagaaatc taaccatttc 2521 ttagtcatgt attcatttag ttctggaacg ctgttactat ctagaagaca gtcatcacca 2581 gtattcatcg tataagtctt ttgtcattgt tacctcaatt ataaatatta aaaatataaa 2641 ttctggcaat gagaatattt ttttattaaa tgatcaagga aaatgtcagt atatagtaga 2701 atattatcaa attatatcct aaaatgtcta ttttgcataa aagagatatt cttcaatcga 2761 ttcctttttt tcgtgagctt tgtggctaat gaagcgtgtg tttgtcttca tcactgctgt 2821 agttgaaact gtgtaagagt ttttcatctt gcttaatcaa tatttccaga atctcttagt 2881 tcccctggga ttctgaatat actctatagc ctaattataa acccctgtat cgtgtacctt 2941 ttgtgaccat ctcaaggtgc atgcctaatc gttgtgataa gccaatggac atgttactaa 3001 ctgaaatgaa gaataaaagt caaacgatct ggggataaac ttgaatccta tctgattaaa 3061 tgat // LOCUS HUMROS1 7375 bp ss-mRNA PRI 17-JUL-1990 DEFINITION Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA, complete cds. ACCESSION M34353 KEYWORDS oncogene; transmembrane tyrosine-specific protein kinase. SOURCE Human glioblastoma cell line SW-1088, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7375) AUTHORS Birchmeier,C., O'Neil,K., Riggs,M. and Wigler,M. TITLE Characterization of ROS1 cDNA from a human glioblastoma cell line JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4799-4803 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Wigler, 15-MAY-1990. FEATURES from to/span description pept 207 7250 transmembrane tyrosine-specific protein kinase precursor /hgml_locus_uid="LN0124W" /nomgen="ROS1" /map="6q21-q22" sigp 207 314 transmembrane tyrosine-specific protein kinase signal peptide matp 208 7247 transmembrane tyrosine-specific protein kinase BASE COUNT 2203 a 1496 c 1605 g 2071 t ORIGIN 1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa 61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc 121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca 181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg 241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata 301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata 361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact 421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg 481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa 541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc 601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt 661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct 721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg 781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg 841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc 901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact 961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg 1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt 1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg 1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg 1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca 1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta 1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct 1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc 1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt 1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga 1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg 1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac 1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca 1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat 1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct 1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc 1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca 1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg 2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa 2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa 2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag 2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa 2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt 2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga 2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag 2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc 2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca 2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca 2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa 2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca 2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca 2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata 2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga 2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat 3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc 3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc 3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt 3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg 3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc 3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat 3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac 3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa 3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt 3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta 3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc 3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa 3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg 3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc 3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg 3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat 3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga 4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag 4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa 4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga 4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga 4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga 4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg 4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga 4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt 4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg 4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca 4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg 4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg 4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg 4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa 4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca 4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc 4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg 5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg 5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca 5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt 5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac 5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt 5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg 5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag 5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg 5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca 5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca 5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag 5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg 5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc 5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg 5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc 5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag 6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg 6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa 6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga 6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc 6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag 6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca 6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac 6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact 6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa 6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa 6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt 6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa 6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga 6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc 6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg 6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt 6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt 7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc 7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag 7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc 7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt 7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc 7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc // LOCUS MBOPIVML 969 bp ds-DNA BCT 17-JUL-1990 DEFINITION M.lacunata pilin gene inverting protein (PivML) gene, complete cds. ACCESSION M34367 KEYWORDS pilin gene inverting protein. SOURCE M.lacunata (ATCC 17956) DNA, clone pMxL1. ORGANISM Moraxella lacunata Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 969) AUTHORS Marrs,C.F., Rozsa,F.W., Hackel,M., Stevens,S.P. and Glasgow,A.C. TITLE Identification, cloning and sequencing of Piv, a new gene involved in inverting the pilin gene of M.lacunata JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.F.Marrs, 17-MAY-1990. FEATURES from to/span description pept 1 969 pilin gene inverting protein (PivML) BASE COUNT 318 a 236 c 165 g 250 t ORIGIN 1 atgtctaaaa cttacattgg gattgatatc gccaaaaaca catttgatgc ctgctttatt 61 acccataaca cttggcaaaa ctgtacattc acaaataacc aacagggttt tatagagcta 121 acattgtgga ttcaagcaca tcattataac acatctacct tacatttgat tattgaggca 181 acgggggctt attgggaaaa attagctcat tgggcaatct ctcatcatca taaggtaagt 241 attgtcaacc ccttatatat tcatgcatac gccaaaagcc taggcatacg caccaagaca 301 gataagcagg atgccattct cttggcacgc tatggtgcaa aagaaaatcc acctttatgg 361 cagcctaaat cagacaacga aataaaactg actgcccttc taaaacaacg tgaacaccac 421 aaacggcaac tcatcaaaga acgcactcga caggaagctc tgtctattta tgtgaaatcg 481 tatactgatg acaacattcg tcattggagt gacagcatca cacagctcga tcatcaaatt 541 tggcaactca tcaattgcac gcccgagcta aattaccgtg ccagcctact tgccacaata 601 cctggaattg gcaaaaagac actgccacat ctactggttg tcattggaga tggttcatca 661 tttcaaagtg ccaaacatct agcaagctat gctgggcttg cccctcgtca tcatcaatca 721 ggcattagta ttcacaagca gtcttccatt ggatttagtg gacaaaaaga actacgttca 781 gcccttttta tgcccgctgt catagtcagc tttggtcgat atcccgcttt tcaaaagttc 841 gtcaaacgca tggaacaaaa aggcaaaacc aaaaaacaaa ttattatcgc catcatgcgc 901 aaattactca ccatcagcta tgccgttatt cgccaaaatc gcccttttga caaacgcatt 961 catgaataa // LOCUS STYMKAD 915 bp ds-DNA BCT 17-JUL-1990 DEFINITION Plasmid pEX102 mouse killing agent D (mkaD) gene, complete cds. ACCESSION M34355 KEYWORDS mouse killing agent D. SOURCE Plasmid pEX102 (from Salmonella typhimurium, strain TML R66) DNA. ORGANISM Plasmid pEX102 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 915) AUTHORS Taira,S. and Rhen,M. TITLE Nucleotide sequence of mkaD, a virulence associated gene of Salmonella typhimurium containing variable and constant region JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.P.Taira, 16-MAY-1990. FEATURES from to/span description pept 132 857 mouse killing agent D (mkaD) BASE COUNT 273 a 181 c 213 g 248 t ORIGIN 1 ggaccgaaat atcatttttt cgcctgataa aaaatgaggt tttctggata actaatcgtt 61 ttattaaaaa aaaactgaga atttatatct aataatatgg cgatatatcc atatcgcaaa 121 ggagatttcc catgcccata aataggccta atctaaatct aaacatccct cctttgaata 181 ttgtagctgc ttatgatggg gcggaaatac catctacaaa taagcacctg aaaaataatt 241 tcaactcctt gcacaaccaa atgcggaaga tgccggtatc ccactttaaa gaggcgctgg 301 atgtgcctga ctattcaggg atgcgccaga gtggtttctt tgctatgagc caaggttttc 361 agctgaataa ccatggttac gatgttttca tccatgctcg tcgagaatca cctcagtctc 421 agggcaaatt tgccggtgac aagttccaca tcagtgtgct cagggatatg gtgccacaag 481 catttcaagc gctgtccgga ttgctgtttt cagaggacag tccggtagat aagtggaaag 541 tgaccgatat ggagaaggtc gttcaacaag cccgtgttag cctgggcgct cagttcacgt 601 tgtatataaa accagaccag gaaaattcgc agtacagtgc gtcgtttctc cacaagacac 661 ggcaatttat agagtgtctg gaatccagac tatccgaaaa tggggttatt tcaggacagt 721 gtcctgagtc agacgttcat cctgaaaatt ggaaatatct cagttatcgt aatgaactac 781 gaagtgggcg tgatggtggc gaaatgcaga gacaggcttt acgtgaggaa ccgttttatc 841 gtttgatgac agagtaagta tgggtttggg gagcaacgga acagtaaacg ccgttaaaca 901 gctattttaa atgct // LOCUS TOBEXT 3994 bp ds-DNA PLN 17-JUL-1990 DEFINITION N.plumbaginifolia extensin (ext) gene, complete cds. ACCESSION M34371 KEYWORDS cell wall protein; extensin. SOURCE N.plumbaginifolia leaf DNA. ORGANISM Nicotiana plumbaginifolia Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 3994) AUTHORS De Loose,M.R.R., Gheyson,G., Tire,C., Gielen,J., Villaroel,R., Genetello,C., Van Montagu,M., Depicker,A. and Inze,D. TITLE The extensin signal peptide allows secretion of heterologous proteins from protoplasts JOURNAL Unpublished (1990) STANDARD unannotated staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.R.De Loose, 18-MAY-1990. Author address: M.R.R.De Loose Laboratorium voor Genetika, Wetenschappen, K.L.Ledeganckstraat, Gent, Oost-Vlaanderen, BELGIUM FEATURES from to/span description pept 1590 2840 extensin precursor sigp 1590 1652 extensin signal peptide (could end at 1667) matp 1653 2837 extensin (could start at 1668) mRNA 1560 3342 extensin mRNA rpt 1740 1824 tandem 85 bp repeats rpt 1740 1758 19 bp repeats signal 1532 1537 TATA box signal 3323 3329 poly-A signal BASE COUNT 1264 a 992 c 550 g 1188 t ORIGIN 1 cgcccgggga tcctcctgga tgccattttt cctttgcttt tggatacctt ttcgcaactt 61 tagtccattg cttgaacatg gttttttcta cctcttgtct gttcccatga tgataaacta 121 tgatagctaa cattttcagt aatattggat tgaattagat atactttgaa ttattgagct 181 acaaaattcg ttgaatattt tgccctatcc cccacccaaa agaaaataat tcataccttt 241 atttggtaat aaaagtgggt cacatgcaca gtttttctcg tcttctctag attaaactct 301 ttggaaatga cctactgaaa atactacaca taaaattccc cactcccctc caagttgggg 361 gagggtgggg gttggtgggt tttttttccc cttgaatcca ttcagccggc ttatgcctcc 421 cgaggtggtg ggaggtttaa tttgaaccct taccctatag ttaatatcga aatagataat 481 acaaaggagg ggaacataaa accaaaacct ccgacaaaac atcaaagtgt aggagttcat 541 gatatacgct tgattgaaag tgcacaatgg tctttgcata ctatcaaagt atcaattggt 601 tcttgaatta tatctcttaa tataatgtat tgtgtttaat ttctctacta tcccctattt 661 tataagacta aaagatcctg acatgcttct tgaacacatg tgaaggttag ttaactggtc 721 agaagtacac aagaattaat ttgtacacct atgtttctca ccagaatgcc cataaattac 781 atcaattctt caacatcaag agatcgaaaa acttaacaat tgttctaata gttgaactga 841 gtcgtccgat ccatatccga ttcttcacta gaagcattaa tcatatagag caatttaact 901 taatttacta tattggtaat catttacata gttaagttat aaaaccttgg agcgacacaa 961 ggatttacac taatcatgat tgaatattaa ctttactcaa tttatcaatg tctgatacaa 1021 gtgaattaat ttctctcttt gtgatttcgg tagaaaatta atgtcaagtt tcaaagtttt 1081 tctttttgaa attagtcata catgtgaata gaacattaat ttaagttaaa gaatcatata 1141 aaaaagttag ccgctttgat ttctgtaact gaaatcgtgc agagatgagg catccacact 1201 tgtttttcaa agcttcagta ctatttaaaa acaaagacag taaaaaggag actttttctt 1261 tgacattatt gcatcagaaa tagtatagct gccataatag tttattcctt tgcttagctt 1321 gcagcctcta tcaaacaaaa aaatacacca gctcaagtca atttgagccg acaacatgac 1381 aaaaccaaat caaatatgca tttttttttt tttttttttt tttttttttt tttttttttt 1441 tttacacttt ggtaggtgtt aagtaatcta gtgagacttt taccttcatt tatgaaaatc 1501 ttgaaaaggg taattgtcta attgaaagct atataaaggg gtcgaagtga agcttaagag 1561 gacaacaact tttctcattt gtttcaaaga tgggaaaaat ggcttctcta tttgccacat 1621 ttttagtggt tttagtgtca cttagcttag cttctgaaag ctcagcaaat tatcaatact 1681 catctccacc accacctaag aagccatacc acccttcacc aacaccatat tatcctgcac 1741 cagtttataa atctccacca ccaccaattc cggtttataa gtcaccacca ccacccaaga 1801 agccatacta ccctccacac accccagttt acaaatctcc accaccacca actccagttt 1861 acaagtcgcc accaccaccc aagaagccac actaccctcc acacacaccc gtttacaagt 1921 ctccaccacc acctactcca gtttacaagt caccaccatc acccaagaag ccacactatc 1981 ctccacacac tccggtttac aagtcgccac caccaccaac tcccgtttac aagtcgccac 2041 caccacccaa gaagccacac tatccaccac acaccccggt ttacaagtcg ccaccaccac 2101 caactcccgt ttacaagtcg ccaccaccac caaagaagcc acactatcct ccacacacac 2161 cagtttacaa gtcaccacca ccacccaaga agccatacta tccaccacac accccagttt 2221 acaagtcgcc accaccacca actccagttt acaagtcacc accaccacct aagaagccat 2281 accacccttc accaacaccg tatcatcctt caccagttta caagtcccct ccaccaccaa 2341 ctccagttta caagtcacca ccaccaccta agaagccata ccacccttca ccaacaccgt 2401 atcatccatc accagtttac aagtcccctc caccaccaac tccagtttac aagtcaccac 2461 caccacctaa gaagccatac cacccttcac caacaccgta tcatccatca ccagtttaca 2521 agtcccctcc accaccaact ccagtttaca agtcaccacc accacctaag aagccatacc 2581 acccttcacc aacaccgtat catcctgcac cagtttacaa gtccccacca ccaccaactc 2641 cagtttacaa gtcaccaccg ccaccagtga agccatacca tccttcacca acaccatacc 2701 atcctgcacc agtttataag tctccaccac caccaactcc tgtttacaag tcacctccac 2761 caccaactcc agtctacaag tctccaccac cacaccaccc ctatgtttac gcttctcctc 2821 ctcctcccta ccattactaa gaagtgacat cacaaagctg aaggtaatta aggcaaatta 2881 attaaaacat tttcaaaatg atatgtcagg aatgcaaaga aaattgcata actttgttta 2941 tgcaaaacat taatataata taaagaactg cgctctctgc ttattaacta tttttttagc 3001 ttcatttatt agttactagc tgttcagaat tttcagtatc ttttgatatt acaaagaacc 3061 taatcagaca atgtatattc ttatgcagga aaagcagaat gttgagctaa aagaaaggct 3121 ttttccattt tcgagaggca acgagaagaa aagaagaaga aaaaatagag taaataataa 3181 gccccacggg aggcgaagtt cttttgtagc ttcatgttgt ctaagctatt gatattgttt 3241 gtaccctata ttttatttct ctgtctttgt gtatgttttg ttcaatttcg atctccttgc 3301 aaaatgcaga gattatgagc tgaataaagt aagttaatat tattatacgt gttaattttc 3361 tactcctctc tctagccagc cttttgtttt ctctttttct tatttcattt tctttatatc 3421 aaaccatttt aggagagggc caaggagtga tccagtaaaa cacgaagatt agaagaaact 3481 tcccttcttt ttttttatct gaaaaaaatt taaagtcgag atttatctct ttttgtaatg 3541 gaatcatttc cacagttatg actattttca aattgatatc gccagaccta acaaattaaa 3601 ttactccttc cgctcatctg gccgtacact tctgctccat ctggccgtac agttctgctc 3661 catctggcag tgtacggaga atggataact taatattcag tatgaatttg gacataaaat 3721 cttctaagat tgctagtgaa ttttgtgagc gttattgtgt taaaacaaaa gacagaaaat 3781 tattagaaga attaaagaaa tatgaaaata gtatagaatc agaaatattt cgagtccaca 3841 agttttttgt gtgttcttaa ggaatgtttt aacccctcac acgttgccaa ggtaatggat 3901 taaatcctcc caagataaaa cggaataaac cttcctgcaa cagtggcaat acaaactgca 3961 ggtcgactct agggatcccc gggcgagcga attc // LOCUS MUSIGHABN 336 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34574 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 336) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 336 Ig heavy-chain V-D-J region (AA at 1) recomb 276 277 V-region DNA end/D-region DNA start recomb 285 286 D-region DNA end/J-region DNA start BASE COUNT 86 a 76 c 95 g 79 t ORIGIN Chromosome 12. 1 actggaggag gcttggtgca acctgggggg tcacggggac tctcttgtga aggctcaggg 61 ttcactttta gtggcttctg gatgagctgg gttcgacaga cacctgggaa gaccctggag 121 tggattggag acattaattc tgatggcagt gcaataaact acgcaccatc cataaaggat 181 cgattcacta tcttcagaga caatgacaag agcaccctgt acctgcagat gagcaatgtg 241 cgatcggagg acacagccac gtatttctgt atgagatata gtaactactg gtacttcgat 301 gtctggggca caaggaccac ggtcaccgtc tcctca // LOCUS MUSIGHABO 357 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34575 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 306 307 D-region DNA end/J-region DNA start BASE COUNT 89 a 77 c 106 g 85 t ORIGIN Chromosome 12. 1 gaagtgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc 61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca 121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac 181 gcagcatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac 241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatgat 301 ggttactact ggtacttcga tgtctggggc acaaggacca cggtcaccgt ctcctca // LOCUS MUSIGHABP 354 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34576 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH34. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 303 304 D-region DNA end/J-region DNA start BASE COUNT 88 a 78 c 104 g 83 t 1 others ORIGIN Chromosome 12. 1 gaantgcagc tgttggagac tggaggaggc ttggtgcaac ctggggggtc acggggactc 61 tcttgtgaag gctcagggtt cacttttagt ggcttctgga tgagctgggt tcgacagaca 121 cctgggaaga ccctggagtg gattggagac attaattctg atggcagtgc aataaactac 181 gcaccatcca taaaggatcg attcactatc ttcagagaca atgacaagag caccctgtac 241 ctgcagatga gcaatgtgcg atcggaggac acagccacgt atttctgtat gagatatggt 301 aactactggt acttcgatgt ctggggcaca gggaccacgg tcaccgtctc ctca // LOCUS MUSIGHABQ 366 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34577 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 366 Ig heavy chain V-D-J-region (AA at 1) recomb 300 301 V-region DNA end/D-region DNA start recomb 318 319 D-region DNA end/J-region DNA start BASE COUNT 89 a 84 c 97 g 96 t ORIGIN Chromosome 12. 1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc 61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct 121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca 181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc 241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga 301 tattactacg gtagtagcta tgctatggac tactggggtc aaggaacctc agtcaccgtc 361 tcctca // LOCUS MUSIGHABR 360 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34578 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy chain V-D-J-region (AA at 1) recomb 300 301 V-region DNA end/D-region DNA start recomb 315 316 D-region DNA end/J-region DNA start BASE COUNT 85 a 87 c 93 g 95 t ORIGIN Chromosome 12. 1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagtctc 61 tcctgtgcag cttctggatt caccttcact gattactaca tgagctgggt ccgccagcct 121 ccagggaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca 181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc 241 ctctatcttc aaatgaatgc cctgagagct gaggacagtg ccacttatta ctgtgcaaga 301 tccctctatg atggttactt tgactactgg ggccaaggca ccactctcac agtctcctca // LOCUS MUSIGHABS 360 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34579 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 309 310 D-region DNA end/J-region DNA start BASE COUNT 90 a 93 c 93 g 83 t 1 others ORIGIN Chromosome 12. 1 caggtccaac tgcagcngcc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg 121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac 181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagagggatt 301 tatagtaact actggtactt cgatgtctgg ggcacaggga ccacggtcac cgtctcctca // LOCUS MUSIGHABT 357 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34580 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 306 307 D-region DNA end/J-region DNA start BASE COUNT 90 a 94 c 89 g 82 t 2 others ORIGIN Chromosome 12. 1 caggtccaac tgcancancc tggggctgag ctggtgaggc ctgggtcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcattgggt gaagcagagg 121 cctatacaag gccttgaatg gattggtaac attgaccctt ctgatagtga aactcactac 181 aatcaaaagt tcaaggacaa ggccacattg actgtagaca aatcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attactgtgc aagatactat 301 ggtaactact ggtacttcga tgtctggggc acagggacca cggtcaccgt ctcctca // LOCUS MUSIGHABU 366 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34581 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 366 Ig heavy chain V-D-J-region (AA at 1) recomb 294 295 V-region DNA end/D-region DNA start recomb 321 322 D-region DNA end/J-region DNA start BASE COUNT 95 a 93 c 96 g 81 t 1 others ORIGIN Chromosome 12. 1 caggtccaac tgcagcagcc tgggactgaa ctggtnaagc ctggggcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg 121 cctggacaag gccttgagtg gattggaaat attaatccta gcaatggtgg tactaactac 181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aatcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagagattac 301 tacggtagta gctgggggta ctactttgac tactggggcc aaggcaccac tctcacagtc 361 tcctca // LOCUS MUSIGHABV 321 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34582 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH27. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 321 Ig heavy chain V-D-J-region (AA at 1) recomb 258 259 V-region DNA end/D-region DNA start recomb 270 271 D-region DNA end/J-region DNA start BASE COUNT 81 a 90 c 63 g 81 t 6 others ORIGIN Chromosome 12. 1 aaaccctnac agtnactctt nnttacctgc tctattactg ntttccccat caccagtggt 61 tactactgga tctggatccg tcagtcacct gggaaacccc tagaatggat ggggtacatc 121 actcatagtg gggaaacttt ctacaaccca tctctccaga gccccatctc cattactaga 181 gaaacgtcaa agaaccagtt cttcctccaa ttgaactctg tgaccacaga ggacacagcc 241 atgtattact gtncaggaga ctatgatggt tactggtact tcgatgtctg gggcacaggg 301 accacggtca ccgtctcctc a // LOCUS MUSIGHABW 336 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig heavy-chain mRNA V-D-J region, partial cds. ACCESSION M34583 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH32. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 336) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 336 Ig heavy chain V-D-J-region (AA at 1) recomb 273 277 V-region DNA end/D-region DNA start recomb 285 286 D-region DNA end/J-region DNA start BASE COUNT 82 a 99 c 71 g 82 t 2 others ORIGIN Chromosome 12. 1 ggacctggcc tggtgaaacc ctcacagtna ctcttcctta cctgctctat tactggtttc 61 cccatcacca gtggttacta ctggatctgg atccgtcagt cacctgggaa acccctagaa 121 tggatggggt acatcactca tagtggggaa actttctaca acccatctct ccagagcccc 181 atctccatta ctagagaaac gtcaaagaac cagttcttcc tccaattgaa ctctgtgacc 241 acagaggaca cagccatgta ttactgtnca ggagacaccg atggttactg gtacttcgat 301 gtctggggca cagggaccac ggtcaccgtc tcctca // LOCUS MUSIGKABQ 306 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34586 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 306) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 306 Ig kappa-chain V-J-region (AA at 1) recomb 284 285 V-region DNA end/J-region DNA start BASE COUNT 75 a 85 c 77 g 69 t ORIGIN Chromosome 6. 1 caaattgttc tcacccagtc tccagcaatc atgtctgcat ctccagggga gaaggtcacc 61 atgacctgca gtgccagctc aagtgtaagt tacatgcact ggtaccagca gaagtcaggc 121 acctccccca aaagatggat ttatgacaca tccaaactgg cttctggagt ccctgctcgc 181 ttcagtggca gtgggtctgg gacctcttac tctctcacaa tcagcagcat ggaggctgaa 241 gatgctgcca cttattactg ccagcagtgg agtagtaacc cgtggacgtt cggtggaggc 301 accaag // LOCUS MUSIGKABR 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34588 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin: processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1) recomb 302 303 V-region DNA end/J-region DNA start BASE COUNT 89 a 81 c 81 g 86 t 2 others ORIGIN Chromosome 6. 1 gatgttttga tgacccaaac tccactctcc ctncctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg 121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt 181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc 241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg 301 tggacgttcg gtggaggcac caagctggaa atcaaacgg // LOCUS MUSIGKABS 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34589 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1) recomb 302 303 V-region DNA end/J-region DNA start BASE COUNT 89 a 79 c 82 g 86 t 3 others ORIGIN Chromosome 6. 1 gatgttttga tgacncaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gagcattgta catagtaatg gaaacaccta tttagaatgg 121 tacctgcaga aaccaggcca gtctccaaag ctcctnatct acaaagtttc caaccgattt 181 tctggggtcc caganaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc 241 agcagagtgg aggctgagga tctgggagtt tattactgct ttcaaggttc acatgttccg 301 tggacgttcg gtggaggcac caagctggaa atcaaacgg // LOCUS MUSIGKABT 323 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34590 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 323) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 323 Ig kappa-chain V-J-region (AA at 1) recomb 288 289 V-region DNA end/J-region DNA start BASE COUNT 96 a 81 c 70 g 76 t ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaagat tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagcaa cctggagcaa 241 gaagatattg ccacttactt ttgccaacag ggtaatacgc ttcctccgac gttcggtgga 301 ggcaccaagc tggaaatcaa acg // LOCUS MUSIGKABU 339 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34591 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH27. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V-J-region (AA at 1) recomb 290 291 V-region DNA end/J-region DNA start BASE COUNT 84 a 91 c 81 g 82 t 1 others ORIGIN Chromosome 6. 1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccggnga gaagatcact 61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag 121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag 241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccacg cacgttcggc 301 tcggggacaa agttggaaat aaaacgggct gatgctgca // LOCUS MUSIGKABV 327 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34592 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH32. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 327 Ig kappa-chain V-J-region (AA at 1) recomb 290 291 V-region DNA end/J-region DNA start BASE COUNT 82 a 87 c 80 g 78 t ORIGIN Chromosome 6. 1 gagaatgtgc tcacccagtc tccaaccacc atggctgcat ctcccgggga gaagatcact 61 atcacctgca gtgccagctc aagtataagt tccaattact tgcattggta tcagcagaag 121 ccaggattct cccctaaact cttgatttat aggacatcca atctggcttc tggagtccca 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaattgg caccatggag 241 gctgaagatg ttgccactta ctactgccag cagggtagta gtataccgta cacgttcgga 301 ggggggacca agctggaaat aaaacgg // LOCUS MUSIGKABW 324 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig kappa-chain mRNA V-J region, partial cds. ACCESSION M34593 KEYWORDS immunoglobulin heavy-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH34. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 324 Ig kappa-chain V-J-region (AA at 1) recomb 287 288 V-region DNA end/J-region DNA start BASE COUNT 96 a 76 c 80 g 72 t ORIGIN Chromosome 6. 1 gacatcaaga tgacccagtc tccatcctcc atgtatgcat cgctgggaga gagagtcact 61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca 121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca 181 agattcagtg gcagtggatc tgggcaagat tattctctaa ccatcagcag cctggagtct 241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg 301 gggaccaagc tggaaataaa acgt // LOCUS MUSIGLBLP 349 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds. ACCESSION M34587 KEYWORDS immunoglobulin heavy-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 349) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 349; revises [1]) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 349 Ig lambda-chain V-J-region (AA at 1) recomb 293 294 V-region DNA end/J-region DNA start BASE COUNT 87 a 94 c 81 g 86 t 1 others ORIGIN Chromosome 16. 1 caggctgttg tgactcagga atctgcactc accacatcac ctggtggaac agtcatactc 61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa 121 aaaccagatc atttattcac tggtctaata ggtggtacca gcaaccgagc tccaggtgtt 181 cctgtcagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca 241 cagactgagg atgatgcaat gtatttctgt gctctatggt acagcaccca ttatgttttc 301 ggcggtggaa ccaaggtcac tgtcctaggt cagcccaagt ccantccca // LOCUS MUSIGLBN 330 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds. ACCESSION M34584 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 330) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 330; revises [1]) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 330 Ig lambda-chain V-J-region (AA at 1) recomb 291 292 V-region DNA end/J-region DNA start BASE COUNT 87 a 87 c 79 g 77 t ORIGIN Chromosome 16. 1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc 61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa 121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt 181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca 241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca ctgggtgttc 301 ggtggaggaa ccaaactgac tgtcctaggc // LOCUS MUSIGLBO 330 bp ss-mRNA ROD 17-JUL-1990 DEFINITION Mouse Ig lambda-chain mRNA V-J region, partial cds. ACCESSION M34585 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain B10.H-2aH-4bp/Wts) from CH lymphoma B cell line, cDNA to mRNA, clone CH2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 330) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. TITLE Restricted Ig variable region gene expression among Ly-1+ B cell lymphomas JOURNAL J. Immunol. 141, 2788-2796 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 330; revises [1]) AUTHORS Pennell,C.A., Arnold,L.W., Haughton,G. and Clarke,S.H. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by C.A.Pennell, 20-APR-1990. FEATURES from to/span description pept < 1 > 330 Ig lambda-chain V-J-region (AA at 1) recomb 291 292 V-region DNA end/J-region DNA start BASE COUNT 87 a 86 c 78 g 79 t ORIGIN Chromosome 16. 1 caggctgttg tgactcagga atctgcactc accacatcac ctggtgaaac agtcacactc 61 acttgtcgct caagtactgg ggctgttaca actagtaact atgccaactg ggtccaagaa 121 aaaccagatc atttattcac tggtctaata ggtggtacca acaaccgagc tccaggtgtt 181 cctgccagat tctcaggctc cctgattgga gacaaggctg ccctcaccat cacaggggca 241 cagactgagg atgaggcaat atatttctgt gctctatggt acagcaacca tttggtgttc 301 ggtggaggaa ccaaactgac tgtcctaggc //
GenBank-Updates@genbank.bio.net (07/18/90)
LOCUS HUMELA2 906 bp ss-mRNA PRI 18-JUL-1990 DEFINITION Human elastase 2 mRNA, complete cds. ACCESSION M16631 KEYWORDS elastase. SOURCE Human pancreas, cDNA to mRNA (library of R.Weiss), clones hpe2-lambda-[4,10]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 906) AUTHORS Fletcher,T.S., Shen,W.-F. and Largman,C. TITLE Primary structure of human pancreatic elastase 2 determined by sequence analysis of the cloned mRNA JOURNAL Biochemistry 26, 7256-7261 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.S.Fletcher, 05-AUG-1987. A poly-adenylation signal is located at positions 891-896. FEATURES from to/span description pept 22 831 elastase 2 precursor sigp 22 69 elastase 2 signal peptide matp 70 828 elastase 2 BASE COUNT 197 a 274 c 253 g 182 t ORIGIN 161 bp upstream of PstI site. 1 aaacagtccc agggacacac catgataagg acgctgctgc tgtccacttt ggtggctgga 61 gccctcagtt gtggggaccc cacttaccca ccttatgtga ctagggtggt tggcggtgaa 121 gaagcgaggc ccaacagctg gccctggcag gtctccctgc agtacagctc caatggcaag 181 tggtaccaca cctgcggagg gtccctgata gccaacagct gggtcctgac ggctgcccac 241 tgcatcagct cctccaggac ctaccgcgtg gggctgggcc ggcacaacct ctacgttgcg 301 gagtccggct cgctggcagt cagtgtctct aagattgtgg tgcacaagga ctggaactcc 361 aaccaaatct ccaaagggaa cgacattgcc ctgctcaaac tggctaaccc cgtctccctc 421 accgacaaga tccagctggc ctgcctccct cctgccggca ccattctacc caacaactac 481 ccctgctacg tcacgggctg gggaaggctg cagaccaacg gggctgttcc tgatgtcctg 541 cagcagggcc ggttgctggt tgtggactat gccacctgct ccagctctgc ctggtggggc 601 agcagcgtga aaaccagtat gatctgtgct gggggtgatg gcgtgatctc cagctgcaac 661 ggagactctg gcgggccact gaactgtcag gcgtctgacg gccggtggca ggtgcacggc 721 atcgtcagct tcgggtctcg cctcggctgc aactactacc acaagccctc cgtcttcacg 781 cgggtctcca attacatcga ctggatcaat tcggtgattg caaataacta accaaaagaa 841 gtccctggga ctgtttcaga cttggaaagg tcacagaagg aaaataatat aataaagtga 901 caactc // LOCUS CLOFTHFSA 1965 bp ds-DNA BCT 18-JUL-1990 DEFINITION C.thermoaceticum formyltetrahydrofolate synthetase (FTHFS) gene, complete cds. ACCESSION J02911 KEYWORDS formyltetrahydrofolate synthetase. SOURCE C.thermoaceticum DNA. ORGANISM Clostridium thermoaceticum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1965) AUTHORS Lovell,C.R., Przybyla,A. and Ljungdahl,L.G. TITLE Primary structure of the thermostable formyltetrahydrofolate synthetase from Clostridium thermoaceticum JOURNAL Biochemistry 29, 5687-5694 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.R.Lovell, 13-APR-1990, for release after publication. FEATURES from to/span description pept 56 1735 formyltetrahydrofolate synthetase (FTHFS) (ttg start codon) (EC 6.3.4.3) mRNA < 1 1760 FTHFS mRNA binding 41 44 ribosomal binding site (put.) signal 1761 1785 pot. transcription termination signal BASE COUNT 459 a 576 c 532 g 398 t ORIGIN 1 aatatgtgac gtgcaaccgg atgttcactt tttcgaatga gaggagtgga ttacattgtc 61 caaggtaccc agtgatattg agattgccca ggcagccaaa atgaaaccgg tcatggaact 121 ggcccgggga ctgggcatcc aagaggacga ggtcgagctt tatggtaagt acaaggccaa 181 gatctccctc gatgtctatc gtcgcctcaa agacaagcct gacgggaaac taatcctggt 241 aaccgccatt acccctactc cggccggcga agggaaaact actaccagtg tcggtctcac 301 cgatgccctg gctcgcctgg ggaaaagggt gatggtctgc ctgcgggagc cctccctggg 361 acccagcttt ggtatcaaag gcggtgccgc cggcggtggt tatgcccagg tagtacccat 421 ggaagatatc aacctgcact tcaccggcga tatccacgcc gtcacctatg cccacaacct 481 gctggcggcc atggtggata accacctgca gcagggtaac gtcctgaata ttgatccccg 541 taccatcacc tggcgccggg tcatcgacct taatgaccgg gctctgagga acatagtcat 601 cggcctgggt ggcaaagcca acggcgtacc gcgggagaca gggtttgaca tctccgttgc 661 ctcggaggtt atggcctgcc tgtgcctggc cagcgacctc atggatctca aggaacgttt 721 cagccgcatt gttgtcggct acacctatga cggcaaaccg gtcaccgccg gcgatctgga 781 ggcccagggt tccatggctc ttctcatgaa ggacgccatt aaacccaacc tggtccaaac 841 cctggagaat acgccggcct ttatccacgg tggtcccttc gccaatatcg cccacggttg 901 caacagcatt atcgcaacca agacggccct gaaactggcg gattatgtcg tgacggaagc 961 cggtttcggt gccgacctgg gtgccgagaa gttctatgac gttaaatgcc gttatgccgg 1021 ctttaaaccc gatgccacag tcatcgtggc taccgtccgc gccctcaaga tgcacggcgg 1081 cgtacccaaa tcagacctgg ccactgaaaa cctggaagcc ctgcgggaag gctttgccaa 1141 cctggagaaa cacatcgaaa atatcggcaa gttcggcgta ccggcagtcg tggccatcaa 1201 tgccttcccc accgataccg aggccgagct aaatctcctc tacgagttgt gcgccaaagc 1261 tggggccgaa gttgccctct cggaagtctg ggctaagggc ggcgaaggcg gtctggaact 1321 tgcccggaag gtgttgcaga ccctggagag caggccatcc aacttccatg tcctctacaa 1381 cctggacctg agtattaaag acaaaattgc caaaatcgcc accgagatct acggggccga 1441 cggcgtcaac tatacggccg aagccgacaa agctatccag cgttatgaat ccctgggcta 1501 cggcaacctg ccggtggtca tggccaagac ccaatactcc ttttccgatg acatgaccaa 1561 gctcgggcgg ccgcggaact ttaccatcac cgtgcgcgag gtgcgcctct cggccggagg 1621 caggcttatc gtccccatca ccggcgccat aatgaccatg cccgggctgc ccaaacgccc 1681 ggcggcctgc aacatcgaca tcgatgccga cggcgtcatt accggtcttt tctagtatgc 1741 ctggagtaaa gcctatccct accccactcc tccagggagt ggggtttttg ctgtctatat 1801 gagaatgata taactaagaa gctggttttc caagccgtac ctatcatagg ggctgcgcca 1861 ccgccatgaa ccacgaaaat gcaaaagata tcattcatgt tagaactctg ttaccgagag 1921 gaacgattat gtctggcatg acgagataga gtataccgta gcgat // LOCUS HUMGLIBSA 176 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A1. ACCESSION M34816 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 176) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 31 53 human zinc finger protein binding site binding 94 116 human zinc finger protein binding site binding 157 176 human zinc finger protein binding site BASE COUNT 68 a 50 c 36 g 22 t ORIGIN 1 aagaacacag gaaccaactg caagagctcc cgaagaccac ccacaatgat ggttgtatgt 61 cagaaggaca caggaaccaa ctgcaagagc tcccaaagac cacccacaat gatggttgta 121 tgtcagaagg acacaggaac caactgcaag agctcccaaa aaccacccac aatgat // LOCUS HUMGLIBSB 166 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A2. ACCESSION M34817 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 166) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 27 49 human zinc finger protein binding site binding 90 112 human zinc finger protein binding site BASE COUNT 60 a 47 c 37 g 22 t ORIGIN 1 acacaggaac caactgcaag agctcccgaa gaccacccac aatgatggtg gtatgttgaa 61 aggacacagg aaccaactgc aagagctccc aaagaccacc cacaatgatg gttgtgtgtc 121 aaaaggacac aggaaccaac tgtaagagct cctgaagacc acccac // LOCUS HUMGLIBSC 313 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A3. ACCESSION M34818 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 313) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 1 23 human zinc finger protein binding site binding 64 86 human zinc finger protein binding site binding 127 149 human zinc finger protein binding site binding 190 212 human zinc finger protein binding site binding 253 275 human zinc finger protein binding site BASE COUNT 110 a 79 c 75 g 49 t ORIGIN 1 caaagaccac ccacaatgat ggttgtatgt cagaaggaca caggaaccaa ctgcaagagc 61 tcccaaaaac cacccacaat gatggtggta tgtcagatgg acacaggaac caactgcaag 121 agctcccgaa gaccacccac aatgatggtg gtatgttgaa aggacacagg aaccaactgc 181 aagagctccc aaagaccacc cacgatgatg gttgtgtgtc aaaaggacac aggaaccaac 241 tgtaagagct cctgaagacc acccacaatg atggttttat gtcagaagga tacaggaacc 301 agctgcaagg gct // LOCUS HUMGLIBSD 184 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region A4. ACCESSION M34819 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone A4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 184) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 32 54 human zinc finger protein binding site binding 95 117 human zinc finger protein binding site binding 158 180 human zinc finger protein binding site BASE COUNT 67 a 46 c 41 g 30 t ORIGIN 1 aaaagacaca ggaaccaact gcaagagctc ccaaagacca cccacaatga tggttgtgtg 61 tcaaaaggac acaggaacca actgtaagag ctcctgaaga ccacccacaa tgatggtttt 121 atgtcagaag gatacaggaa ccagctgcaa gagctcctga agaccaccca caatgatggt 181 tgta // LOCUS HUMGLIBSE 62 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region B1. ACCESSION M34820 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone B1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 62) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 11 33 human zinc finger protein binding site BASE COUNT 18 a 26 c 11 g 7 t ORIGIN 1 aaaccagacg cgtggaccac ccaagacgaa attcacactc gaccctagcc ccgccagccc 61 tt // LOCUS HUMGLIBSF 67 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human zinc finger protein (GLI) DNA binding region C1. ACCESSION M34821 KEYWORDS GLI protein binding site. SOURCE Human DNA, clone C1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 67) AUTHORS Kinzler,K.W. and Vogelstein,B. TITLE The GLI gene encodes a nuclear protein which binds specific sequences in the human genome JOURNAL Mol. Cell. Biol. 10, 634-642 (1990) STANDARD simple staff_entry FEATURES from to/span description binding 16 29 human zinc finger protein binding site BASE COUNT 15 a 26 c 15 g 11 t ORIGIN 1 agacccaggc acacagatag accacccagc ttcaggtggg ggaccaccct tgcatcccct 61 ctctgct // LOCUS MLVU3RA 383 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15.4 U3 region genomic RNA. ACCESSION M34722 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15.4), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 383) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 178 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 147 152 Moloney leukemia virus binding protein b binding 156 167 enhancer core binding protein binding 350 362 CCAAT-binding transcription factor binding 363 372 glucocorticoid response element BASE COUNT 127 a 105 c 84 g 67 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat cttgcaaggc 61 atggaaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 tggaaagtac cgggactagg gccaaacagg atatctgtgg tcaagcacta gggccctagc 181 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 241 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 301 gcttattgct gcccagctct ataaaaaggg taaaaacccc acactcggcc ggcccagggc 361 caagaacaga tggtccccag aaa // LOCUS MLVU3RB 348 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-15 U3 region genomic RNA. ACCESSION M34723 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWM-T-15), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 348) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 177 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 112 117 Moloney leukemia virus binding protein b binding 121 132 enhancer core binding protein binding 142 154 CCAAT-binding transcription factor binding 155 164 glucocorticoid response element BASE COUNT 115 a 97 c 76 g 60 t ORIGIN 1 ctgcagtaac gccatcttgc aaggcatgga aaaataccag agctgatgtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctggaa agtaccggga ctagggccaa acaggatatc 121 tgtggtcaaa cacctgggcc ccggcccagg gccaagaaca gatggtcccc agaaatagct 181 aaaacaacaa cagtttcaag agacccaaaa actgtctcaa ggttccccag atgaccgggg 241 atcaacccca agcctcattt aaactaacca atcagctcgc ttctcgcttc tgtacccgcg 301 cttattgctg cccagctcta taaaaagggt aaaaacccca cactcggc // LOCUS MLVU3RC 445 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25a U3 region genomic RNA. ACCESSION M34724 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25a), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 445) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 77 274 MuLV promoter region binding 96 109 CCAAT-binding transcription factor binding 168 179 CCAAT-binding transcription factor binding 238 252 CCAAT-binding transcription factor binding 138 143 Moloney leukemia virus binding protein b binding 209 214 Moloney leukemia virus binding protein b binding 147 158 enhancer core binding protein binding 218 229 enhancer core binding protein binding 180 189 glucocorticoid response element binding 252 263 glucocorticoid response element BASE COUNT 142 a 126 c 104 g 73 t ORIGIN 1 ctgcagtaac gccattttgc aaggcatgaa aaagtaccag agctgaagtt ctcagaaaaa 61 caagaacaag gaagtacaga gaggctaaaa agtacccggc ccagggccaa gaacagatgg 121 tcccagaccg ctaacgacag gatatctgtg gttaagcact agggccccgc ccagggccaa 181 gaacagatgg tccccagacc gctaacgaca ggatatctgt ggttaagcac tagggccccg 241 gcccagggcc aagaacagat ggtccccaga aatagctaaa acaacaacag tttcaagaga 301 cgcagaaact gtctcaaggt tccccagatg accggggatc aaccccaagc ctcatttaaa 361 ctaaccaatc agctcgcttc tcgcttctgt acccgcgctt attgctgccc agctctataa 421 aaagggtaag aaccccacac tcggc // LOCUS MLVU3RD 409 bp ss-RNA VRL 18-JUL-1990 DEFINITION Murine leukemia virus CWM-T-25b U3 region genomic RNA. ACCESSION M34725 KEYWORDS U3 region; promoter region. SOURCE Murine leukemia virus (strain CWN-T-25b), cDNA to genomic RNA, from mice (strain CWB). ORGANISM Murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 409) AUTHORS Thomas,C.Y., Coppola,M.A., Holland,C.A. and Massey,A.C. TITLE Oncogenicity and U3 region sequences of class II recombinant MuLVs of CWD mice JOURNAL Virology 176, 166-177 (1990) STANDARD simple staff_entry FEATURES from to/span description signal 112 238 MuLV promoter region binding 131 144 CCAAT-binding transcription factor binding 174 179 Moloney leukemia virus binding protein b binding 183 194 enhancer core binding protein binding 204 215 CCAAT-binding transcription factor binding 216 225 glucocorticoid response element BASE COUNT 133 a 114 c 91 g 71 t ORIGIN 1 aatgaaagac cccttcataa ggcttagcca gctaactgca gtaacgccat tttgcaaggc 61 atgggaaaat accagagctg atgttctcag aaaaacaaga acaaggaagt acagagaggc 121 taaaaggtac ccggcccagg gccaagaaca gatggtcccc agaccgctaa cgacaggata 181 tctgtggtta agcactaggg ccccggccag ggccaagaac agatggtccc cagaaatagc 241 taaaacaaca acagtttcaa gagacccaga aactgtctca aggttcccca gatgaccggg 301 gatcaacccc aagcctcatt taaactaacc aatcagctcg cttctcgctt ctgtacccgc 361 gcttattgct gcccagctct ataaaaaggg taagaacccc acactcggc // LOCUS SIVPBJA 125 bp ss-RNA VRL 18-JUL-1990 DEFINITION Simian immunodeficiency virus upstream enhancer region. ACCESSION M31325 KEYWORDS . SOURCE Simian (mangabey) immunodeficiency virus (isolate PBj14), clone 4.41. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 125) AUTHORS Dewhurst,S., Embretson,J.E., Anderson,D.C., Mullins,J.I. and Fultz,P.N. TITLE Sequence analysis and acute pathogenicity of molecularly cloned SIV JOURNAL Nature 345, 636-640 (1990) STANDARD simple staff_review BASE COUNT 40 a 25 c 38 g 22 t ORIGIN 1 gaagaaaaca agctaagaca gcagggactt tccacaaaga cagcagggac tttccacaaa 61 ggggatgtta cggggaggta ctggggagga gctggctgga acgcccacgt attctctgta 121 taaat // LOCUS CPARRAA 119 bp ss-rRNA RNA 18-JUL-1990 DEFINITION C.paradoxa cellular 5S ribosomal RNA. ACCESSION M33029 KEYWORDS 5S ribosomal rRNA. SOURCE C.paradoxa rRNA. ORGANISM Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Maxwell,E.S., Liu,J. and Shively,J.M. TITLE Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle- associated 5S ribosomal RNAs JOURNAL Ann. N.Y. Acad. Sci. 503, 559-561 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 25 a 35 c 37 g 22 t ORIGIN 1 gtgtacggct atactaccgg aaaagcgccc gttcccgtcc gattacgaga gcctagcccg 61 gtcaggcccg actagtacta gggtggggga ccacctggga acatcgggtg ctgtacact // LOCUS CPARRAB 118 bp ss-rRNA RNA 18-JUL-1990 DEFINITION C.paradoxa cyanellar 5S ribosomal RNA. ACCESSION M33030 KEYWORDS 5S ribosomal rRNA. SOURCE C.paradoxa cyanellar rRNA. ORGANISM Cyanelle Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa. REFERENCE 1 (bases 1 to 118) AUTHORS Maxwell,E.S., Liu,J. and Shively,J.M. TITLE Nucleotide sequence of Cyanophora paradoxa cellular and cyanelle- associated 5S ribosomal RNAs JOURNAL Ann. N.Y. Acad. Sci. 503, 559-561 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 31 a 26 c 31 g 30 t ORIGIN 1 cctggtgctt attgcgtggt ggaaccacac cgatccatac cgaactcggt tgtgaaacat 61 tactgcggcg acaatacttg aagggtaact ttctggaaag atagcttagt gccaggat // LOCUS SNDZSTR 15384 bp ss-RNA VRL 18-JUL-1990 DEFINITION Sendai virus nucleoprotein (NP), phosphoprotein (P), matrix protein (M), F protein, hemagglutinin-neuraminidase (HN), and L protein genes, complete cds. ACCESSION M30202 M30203 M30204 M19661 KEYWORDS F protein; L protein; hemagglutinin-neuraminidase; matrix protein; nucleoprotein; phosphoprotein. SOURCE Sendai virus (Z strain, F1-R mutant, and ts-f1 mutant) RNA. ORGANISM Parainfluenza virus type 1 Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Paramyxoviridae; Paramyxovirus. REFERENCE 1 (sites) AUTHORS Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K., Klenk,H.-D., Rott,R. and Seto,J.T. TITLE Characterization of a pantropic variant of Sendai virus derived from a host range mutant JOURNAL Virology 165, 577-583 (1988) STANDARD full staff_review REFERENCE 2 (bases 4866 to 6560; for [1]) AUTHORS Tashiro,M., Pritzer,E., Khoshnan,M.A., Yamakawa,M., Kuroda,K., Klenk,H.-D., Rott,R. and Seto,J.T. JOURNAL Unpublished (1988) STANDARD full staff_review REFERENCE 3 (bases 1 to 15384) AUTHORS Middleton,Y., Tashiro,M., Thai,T., Oh,J., Seymour,J., Pritzer,E., Klenk,H.-D., Rott,R. and Seto,J.T. TITLE Nucleotide sequence analyses of the genes encoding the HN, M, NP, P, and L proteins of two host range mutants of Sendai virus JOURNAL Unpublished (1989) California State University at Los Angeles, Dept. of Microbiology, Los Angeles, CA 90032-8745 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] and [2] kindly submitted by J.T.Seto, 08-JUL-1988, for [3], 27-NOV-1989. Author Address: J.T.Seto California State University at Los Angeles Department of Microbiology Los Angeles, CA 90032-8745 FEATURES from to/span description pept 120 1694 nucleoprotein (NP) pept 1844 3550 phosphoprotein (P) pept 3669 4715 matrix protein (M) pept 4866 6563 F protein pept 6693 8420 hemagglutinin-neuraminidase (HN) pept 8556 15242 L protein variant 310 310 c in strain Z and F1-R mutant; t in ts-f1 mutant variant 350 350 c in strain Z and F1-R mutant; t in ts-f1 mutant variant 1580 1580 t in strain Z and F1-R mutant; c in ts-f1 mutant variant 1586 1586 g in strain Z and ts-f1 mutant; a in F1-R mutant variant 2307 2307 g in strain Z; a in F1-R mutant and ts-fl mutant variant 4020 4020 a in strain Z and F1-R mutant; c in ts-fl mutant variant 4051 4051 g in strain Z and F1-R mutant; a in ts-fl mutant variant 4297 4297 c in strain Z and F1-R mutant; t in ts-fl mutant variant 5053 5053 t in strain Z and F1-R mutant; g in ts-fl mutant variant 5176 5176 g in strain Z and F1-R mutant; a in ts-fl mutant variant 5208 5208 c in strain Z and F1-R mutant; t in ts-fl mutant variant 5212 5212 a in strain Z and F1-R mutant; g in ts-fl mutant variant 5700 5700 a in strain Z and F1-R mutant; g in ts-fl mutant variant 5708 5708 t in strain Z and F1-R mutant; c in ts-fl mutant variant 6530 6530 a in strain Z and F1-R mutant; c in ts-fl mutant variant 10429 10429 g in strain Z; a in F1-R mutant and ts-fl mutant variant 15262 15262 c in strain Z and F1-R mutant; a in ts-fl mutant BASE COUNT 4595 a 3329 c 3756 g 3704 t ORIGIN 1 accaaacaag agaaaaaaca tgtatgggat atgtaatgaa gttatacagg attttagggt 61 caaagtatcc accctgagga gcaggttcca gaccctttgc tttgctgcca aagttcacga 121 tggccgggtt gttgagcacc ttcgatacat ttagctctag gaggagcgaa agtattaata 181 agtcgggagg aggtgctgtt atccccggcc agaggagcac agtctcagtg ttcgtactag 241 gcttaagtgt gactgatgat gcagacaagt tattcattgc aactaccttc ctagctcact 301 cattggacac agataagcag cactctcaga gaggagggtt cctcgtctcc ctgcttgcca 361 tggcttacag tagtccagaa ttgtacttga caacaaacgg agtaaacgcc gatgtcaaat 421 atgtgatcta caacatagag aaagacccta agaggacgaa gacagacgga ttcattgtga 481 agacgagaga tatggaatat gagaggacca cagaatggct gtttggacct atggtcaaca 541 agagcccact cttccagggt caacgggatg ctgcagaccc tgacacactc cttcaaatct 601 atgggtatcc tgcatgccta ggagcaataa ttgtccaagt ctggattgtg ctggtgaagg 661 ccatcacaag cagcgccggc ttaaggaaag ggttcttcaa caggttagag gcgttcagac 721 aagacggcac cgtgaaaggt gccttagttt tcactgggga gacagttgag gggataggct 781 cggttatgag atctcagcaa agccttgtat ctctcatggt tgagaccctt gtgactatga 841 atactgcaag atctgatctc accacattag agaagaacat ccagatagtt gggaactaca 901 tccgagatgc agggctggct tccttcatga acactattaa atatggggtg gagacaaaga 961 tggcagctct aacgttgtca aacctgaggc ccgatattaa taagcttaga agcctcatag 1021 acacctacct gtcaaaaggc cccagagctc cctttatctg tatcctcaag gaccctgttc 1081 atggtgaatt tgctccaggc aattatcctg cactatggag ttacgccatg ggagtcgccg 1141 tcgtacagaa caaggcaatg cagcagtacg tcacagggag gacatacctt gatatggaaa 1201 tgttcttact aggacaagcc gtggcaaagg atgctgaatc gaagatcacg agtgccttgg 1261 aagatgagtt aggagtgacg gatacagcca aggggaggct cagacatcat ctggcaaact 1321 tgtccggtgg ggatggtgct taccgcaaac caacaggcgg tggtgcaatt gaggtagctc 1381 tagacaatgc cgatatcgac ctagaaacaa aagctcatgc ggaccaggac gctaggggtt 1441 ggggtggaga tagtggtgaa agatgggcac gtcaggtgag tggtggccac tttgtcacac 1501 tacatggggc tgaacggtta gaggaggaaa ccaatgatga ggatgtatca gacatagaga 1561 gaagaatagc catgagactt gcagagagac ggcaagagga ttctgcaacc catggagatg 1621 aaggccgcaa taacggtgtc gatcacgacg aagatgacga taccgcagca gtagctgggg 1681 taggaggaat ctaggatcat acgaggcttc aaggtacttg atccgtagta agaaaaactt 1741 agggtgaaag ttcatccacc gatcggctca ggcaaggcca cacccaaccc caccgaccac 1801 acccagcagt cgagacagcc acggcttcgg ctacacttac cgcatggatc aagatgcctt 1861 cattcttaaa gaagattctg aagttgagag ggaggcgcca ggaggacgag agtcgctctc 1921 ggatgttatc ggattcctcg atgctgtcct gtcgagtgaa ccaactgaca tcggagggga 1981 cagaagctgg ctccacaaca ccatcaacac tccccaagga ccaggctctg ctcatagagc 2041 caaaagtgag ggcgaaggag aagtctcaac accgtcgacc caagataatc gatcaggtga 2101 ggagagtaga gtctctggga gaacaagcaa gccagaggca gaagcacatg ctggaaacct 2161 tgataaacaa aatatacacc gggcctttgg gggaagaact ggtacaaact ctgtatctca 2221 ggatctgggc gatggaggag actccggaat ccttgaaaat cctccaaatg agagaggata 2281 tccgagatca ggtattgaag atgaaagcag agagatggct gcgcaccctg ataagagggg 2341 agaagaccaa gctgaaggac ttccagaaga ggtacgagga ggtacatccc tacctgatga 2401 aggagaaggt ggagcaagta ataatggaag aagcatggag cctggcagct cacatagtgc 2461 aagagtaact ggggtcctgg tgattcctag ccccgaactt gaagaggctg tgctacggag 2521 gaacaaaaga agacctacca acagtgggtc caaacctctt actccagcaa ccgtgcctgg 2581 cacccggtcc ccaccgctga atcgttacaa cagcacaggg tcaccaccag gaaaaccccc 2641 atctacacag gatgagcaca tcaactctgg ggacaccccc gccgtcaggg tcaaagaccg 2701 gaaaccacca atagggaccc gctctgtctc agattgtcca gccaacggcc gcccaatcca 2761 cccgggtcta gagaccgact caacaaaaaa gggcatagga gagaacacat catctatgaa 2821 agagatggct acattgttga cgagtcttgg tgtaatccag tctgctcaag aattcgaatc 2881 atcccgagac gcgagttatg tgtttgcaag acgtgcccta aagtctgcaa actatgcaga 2941 gatgacattc aatgtatgcg gcctgatcct ttctgccgag aaatcttccg ctcgtaaggt 3001 agatgagaac aaacaactgc tcaaacagat ccaagagagc gtggagtcat tccgggatat 3061 ttataagaga ttctctgagt atcagaaaga acagaactca ttgctgatgt ccaacctatc 3121 tacacttcat atcatcacag atagaggtgg caagactgac aacacagact cccttacaag 3181 gtccccctcc gtttttgcaa aatcaaaaga gaacaagact aaggctacca ggtttgaccc 3241 atctatggag accctagaag atatgaagta caaaccggac ctaatccgag aggatgaatt 3301 tagagatgag atccgcaacc cggtgtacca agagagggac acagaaccta gggcctcaaa 3361 cgcatcacgt ctcctcccct ccaaagagaa gcccacaatg cactctctca ggctcgtcat 3421 agagagcagt cccctaagca gagctgagaa agcagcatat gtgaaatcat tatccaagtg 3481 caagacagac caagaggtta aggcagtcat ggaactcgta gaagaggaca tagagtcact 3541 gaccaactag atcccgggtg aggcatccta ccatcctcag tcatagagag atccaatcta 3601 ccatcagcat cagccagtaa agattaagaa aaacttaggg tgaaagaaat ttcacctaac 3661 acggcgcaat ggcagatatc tatagattcc ctaagttctc atatgaggat aacggtactg 3721 tggagcccct gcctctgaga actggtccgg ataagaaagc catcccccac atcaggattg 3781 tcaaggtagg agtccctcct aaacatggag tgagatacct agatttattg ctcttgggtt 3841 tctttgagac accgaaacaa acaaccaatc tagggagcgt atctgacttg acagagccga 3901 ccagctactc aatatgcggc tccgggtcgt tacccatagg tgtggccaaa tactacggga 3961 ctgatcagga actcttaaag gcctgcaccg atctcagaat tacggtgagg aggactgtta 4021 gagcaggaga gatgatcgta tacatggtgg gttcgattgg tgctccactc ctaccatggt 4081 caggcaggct gagacaggga atgatattta atgcaaacaa ggtcgcacta gctccccaat 4141 gcctccctgt ggacaaggac ataagactca gagtggtgtt tgtcaatggg acatctctag 4201 gggcaatcac catagccaag atcccaaaga cccttgcaga ccttgcattg cccaactcta 4261 tatccgttaa tttactggtg acactcaaga ccgggacctc cacagaacaa aagggggtac 4321 tcccagtact tgatgatcaa ggggagaaaa agctcaattt tatggtgcac ctcgggttga 4381 tcaggagaaa ggtcgggaag atatactctg ttgagtactg caagagcaag attgagagaa 4441 tgcggctgat tttctcactt gggttaatcg gcggtataag cttccatgtt caggttattg 4501 ggacactatc taagacattc atgagtcagc tcgcatggaa gagggcagtc tgcttcccat 4561 taatggatgt gaatccccat atgaacatgg tgatttgggc ggcatctgta gaaatcacag 4621 gcgtcgatgc ggtgttccaa ccggccatcc ctcgtgattt ccgctactac cctaatgttg 4681 tggctaagaa catcggaagg atcagaaagc tgtaaatgtg cacccatcag agacctgcga 4741 caatgcccca agcagacacc acctggcagt cggagccacc gggtcactcc ttgtcttaaa 4801 taagaaaaac ttagggataa agtcccttgt gagtgcttgg ttgcaaaact ctccccttgg 4861 gaaacatgac agcatatatc cagagatcac agtgcatctc aacatcacta ctggttgttc 4921 tcaccacatt ggtctcgtgt cagattccca gggataggct ctctaacata ggggtcatag 4981 tcgatgaagg gaaatcactg aagatagctg gatcccacga atcgaggtac atagtactga 5041 gtctagttcc ggtggtagac tttgagaatg ggtgcggaac agcccaggtt atccagtaca 5101 agagcctact gaacaggctg ttaatcccat tgagggatgc cttagatctt caggaggctc 5161 tgataactgt caccagtgat acgacacaaa atgccggtgc tccacagccg aaattcttcg 5221 gtgctgtgat tggtactatc gcacttggag tggcgacatc agcacaaatc accgcaggga 5281 ttgcactagc cgaagcgagg gaggccaaaa gagacatagc gctcatcaaa gaatcgatga 5341 caaaaacaca caagtctata gaactgctgc aaaacgctgt gggggaacaa attcttgctc 5401 taaagacact ccaggatttc gtgaatgatg agatcaaacc cgcaataagc gaattaggct 5461 gtgagactgc tgccttaaga ctgggtataa aattgacaca gcattactcc gagctgttaa 5521 ctgcgttcgg ctcgaatttc ggaaccatcg gagagaagag cctcacgctg caggcgctgt 5581 cttcacttta ctctgctaac attactgaga ttatgaccac aatcaggaca gggcagtcta 5641 acatctatga tgtcatttat acagaacaga tcaaaggaac ggtgatagat gtggatctaa 5701 agagatatat ggtcaccctg tctgtgaaga tccctattct ttctgaagtc ccaggtgtgc 5761 tcatacacaa ggcatcatct atttcttaca acatagacgg ggaggaatgg tatgtgactg 5821 tccccagcca tatactcagt cgtgcttctt tcttaggggg tgcagacata accgattgtg 5881 ttgagtccag attgacctat atatgcccca gggatcccgc acaactgata cctgacagcc 5941 agcaaaagtg tatcctgggg gacacaacaa ggtgtcctgt cacaaaagtt gtggacagcc 6001 ttatccccaa gtttgctttt gtgaatgggg gcgttgttgc taactgcata gcatccacat 6061 gtacctgcgg gacaggccga agaccaatca gtcaggatcg ctctaaaggt gtagtattcc 6121 taacccatga caactgtggt cttataggtg tcaatggggt agaattgtat gctaaccgga 6181 gagggcacga tgccacttgg ggggtccaga acttgacagt cggtcctgca attgctatca 6241 gacccgttga tatttctctc aaccttgctg atgctacgaa tttcttgcaa gactctaagg 6301 ctgagcttga gaaagcacgg aaaatcctct cggaggtagg tagatggtac aactcaagag 6361 agactgtgat tacgatcata gtagttatgg tcgtaatatt ggtggtcatt atagtgatca 6421 tcatcgtgct ttatagactc agaaggtcaa tgctaatggg taatccagat gaccgtatac 6481 cgagggacac atacacatta gagccgaaga tcagacatat gtacacaaaa ggtgggtttg 6541 atgcaatggc tgagaaaaga tgatcacgac cattatcaga tgtcttgtaa agcaggcatg 6601 gtattcgttg agatctgtat ataataagaa aaacttaggg tgaaagtgag gtcgcgcggt 6661 actttagctt tcacctcaaa caagcacaga tcatggatgg tgataggggc aaacgtgact 6721 cgtactggtc tacttctcct agtggtagca ctacaaaatt agcatcaggt tgggagaggt 6781 caagtaaagt tgacacatgg ttgctgattc tctcattcac ccagtgggct ttgtcaattg 6841 ccacagtgat catctgtatc ataatttctg ctagacaagg gtatagtatg aaagagtact 6901 caatgactgt agaggcattg aacatgagca gcagggaggt gaaagagtca cttaccagtc 6961 taataaggca agaggttata gcaagggctg tcaacattca gagctctgtg caaaccggaa 7021 tcccagtctt gttgaacaaa aacagcaggg atgtcatcca gatgattgat aagtcgtgca 7081 gcagacaaga gctcactcag cactgtgaga gtacgatcgc agtccaccat gccgagggaa 7141 ttgccccact tgagccacat agtttctgga gatgccctgt cggagaaccg tatcttagct 7201 cagatcctga aatctcattg ctgcctggtc cgagcttgtt atctggttct acaacgatct 7261 ctggatgtgt taggctccct tcactctcaa ttggcgaggc aatctatgcc tattcatcaa 7321 atctcattac acaaggttgt gctgacatag ggaaatcata tcaggtcctg cagctagggt 7381 acatatcact caattcagat atgttccctg atcttaaccc cgtagtgtcc cacacttatg 7441 acatcaacga caatcggaaa tcatgctctg tggtggcaac cgggactagg ggttatcagc 7501 tttgctccat gccgactgta gacgaaagaa ccgactactc tagtgatggt atcgaggatc 7561 tggtccttga tgtcctggat ctcaaaggga gaactaagtc tcaccggtat cgcaacagcg 7621 aggtagatct tgatcacccg ttctctgcac tataccccag tgtaggcaac ggcattgcaa 7681 cagaaggctc attgatattt cttgggtatg gtggactaac cacccctctg cagggtgata 7741 caaaatgtag gacccaagga tgccaacagg tgtcgcaaga cacatgcaat gaggctctga 7801 aaattacatg gctaggaggg aaacaggtgg tcagcgtgat catccaggtc aatgactatc 7861 tctcagagag gccaaagata agagtcacaa ccattccaat cactgaaaac tatctcgggg 7921 cggaaggtag attattaaaa ttgggtgatc gggtgtacat ctatacaaga tcatcaggct 7981 ggcactctca actgcagata ggagtacttg atgtcagcca ccctttgact atcaactgga 8041 cacctcatga agccttgtct agaccaggaa ataaagagtg caattggtac aataagtgtc 8101 cgaaggaatg catatcaggc gtatacactg atgcttatcc attgtcccct gatgcagcta 8161 acgtcgctac cgtcacgcta tatgccaata catcgcgtgt caacccaaca atcatgtatt 8221 ctaacactac taacattata aatatgttaa ggataaagga tgttcaatta gaggctgcat 8281 ataccacgac atcgtgtatc acgcattttg gtaaaggcta ctgctttcac atcatcgaga 8341 tcaatcagaa gagcctgaat accttacagc cgatgctctt taagactagc atccctaaat 8401 tatgcaaggc cgagtcttaa atttaactga ctagcaggct tgtcggcctt gctgacacta 8461 gagtcatctc cgaacatcca caatatctct cagtctctta cgtctctcac agtattaaga 8521 aaaacccagg gtgaatggga agcttgccat aggtcatgga tgggcaggag tcctcccaaa 8581 acccttctga catactctat ccagaatgcc acctgaactc tcccatagtc agggggaaga 8641 tagcacagtt gcacgtcttg ttagatgtga accagcccta cagactgaag gacgacagca 8701 taataaatat tacaaagcac aaaattagga acggaggatt gtcccctcgt caaattaaga 8761 tcaggtctct gggtaaggct cttcaacgca caataaagga tttagaccga tacacgtttg 8821 aaccgtaccc aacctactct cacgaattac ttaggcttga tataccagag atatgtgaca 8881 aaatccgatc cgtcttcgcg gtctcggatc ggctgaccag ggagttatct agtgggttcc 8941 aggatctttg gttgaatatc ttcaagcaac taggcaatat agaaggaaga gaggggtacg 9001 atccgttgca ggatatcggc accatcccgg agataactga taagtacagc aggaatagat 9061 ggtataggcc attcctaact tggttcagca tcaaatatga catgcggtgg atgcagaaga 9121 ccagaccggg gggacccctt gatacctcta attcacataa cctcctagaa tgcaaatcat 9181 acactctagt aacatacgga gatcttgtca tgatactgaa caagttgaca ttgacagggt 9241 atatcctaac ccctgagctg gtcttgatgt attgtgatgt tgtagaagga aggtggaata 9301 tgtctgctgc agggcatcta gataagaagt ccattgggat aacaagcaaa ggtgaggaat 9361 tatgggaact agtggattcc ctcttctcaa gtcttggaga ggaaatatac aatgtcatcg 9421 cactattgga gcccctatca cttgctctca tacaactaaa tgatcctgtt atacctctac 9481 gtggggcatt tatgaggcat gtgttgacag agctacagac tgttttaaca agtagagacg 9541 tgtacacaga tgctgaagca gacactattg tggagtcgtt actcgccatt ttccatggaa 9601 cctctattga tgagaaagca gagatctttt ccttctttag gacatttggc caccccagct 9661 tagaggctgt cactgccgcc gacaaggtaa gggcccatat gtatgcacaa aaggcaataa 9721 agcttaagac cctatacgag tgtcatgcag ttttttgcac tatcatcata aatgggtata 9781 gagagaggca tggcggacag tggcccccct gtgacttccc tgatcacgtg tgtctagaac 9841 taaggaacgc tcaagggtcc aatacggcaa tctcttatga atgtgctgta gacaactata 9901 caagtttcat aggcttcaag tttcggaagt ttatagaacc acaactagat gaagatctca 9961 caatatatat gaaagacaaa gcactatccc ccaggaagga ggcatgggac tctgtatacc 10021 cggatagtaa tctgtactat aaagccccag agtctgaaga gacccggcgg cttattgaag 10081 tgttcataaa tgatgagaat ttcaacccag aagaaattat caattatgtg gagtcaggag 10141 attggttgaa agacgaggag ttcaacatct cgtacagtct caaagagaaa gagatcaagc 10201 aagagggtcg tctattcgca aaaatgactt ataagatgcg agccgtacag gtgctggcag 10261 agacactact ggctaaagga ataggagagc tattcaggga aaatgggatg gttaagggag 10321 agatagacct acttaaaaga ttgactactc tttctgtctc aggcgtcccc aggactgatt 10381 cagtgtacaa taactctaaa tcatcagaga agagaaacga aggcatggga aataagaact 10441 ctggggggta ctgggacgaa aagaagaggt ccagacatga attcaaggca acagattcat 10501 caacagacgg ctatgaaaca ttaagttgct tcctcacaac agacctcaag aaatactgct 10561 taaactggag atttgagagt actgcattgt ttggtcagag atgcaacgag atatttggct 10621 tcaagacctt ctttaactgg atgcatccag tccttgaaag gtgtacaata tatgttggag 10681 atccttactg tccagtcgcc gaccggatgc atcgacaact ccaggatcat gcagactctg 10741 gcattttcat acataatcct agggggggca tagaaggtta ctgccagaag ctgtggacct 10801 taatctcaat gagtgcaatc cacctagcag ctgtgagagt gggtgtcagg gtctctgcaa 10861 tggttcaggg tgacaatcaa gctatagccg tgacatcaag agtacctgta gctcagactt 10921 acaagcagaa gaaaaatcat gtctatgagg agatcaccaa atatttcggt gctctaagac 10981 acgtcatgtt tgatgtaggg cacgagctaa aattgaacga gaccatcatt agtagcaaga 11041 tgtttgtcta tagtaaaagg atatactatg atgggaagat tttaccacag tgcctgaaag 11101 ccttgaccaa gtgtgtattc tggtccgaga cactggtaga tgaaaacaga tctgcttgtt 11161 cgaacatctc aacatccata gcaaaagcta tcgaaaatgg gtattctcct atactaggct 11221 actgcattgc gttgtataag acctgtcagc aggtgtgcat atcactaggg atgactataa 11281 atccaactat cagcccgacc gtaagagatc aatactttaa gggtaagaat tggctgagat 11341 gtgcagtgtt gattccagca aatgttggag gattcaacta catgtctaca tctagatgct 11401 ttgttagaaa tattggagac cccgcagtag cagccctagc tgatctcaaa agattcatca 11461 gagcggatct gttagacaag caggtattat acagggtcat gaatcaagaa cccggtgact 11521 ctagttttct agattgggct tcagaccctt attcgtgtaa cctcccgcat tctcagagta 11581 taactacgat tataaagaat atcactgcta gatctgtgct gcaggaatcc ccgaatcctc 11641 tactgtctgg tctcttcacc gagactagtg gagaagagga tctcaacctg gcctcgttcc 11701 ttatggaccg gaaagtcatc ctgccgagag tggctcatga gatcctgggt aattccttaa 11761 ctggagttag ggaggcgatt gcagggatgc ttgatacgac caagtctcta gtgagagcca 11821 gcgttaggaa aggaggatta tcatatggga tattgaggag gcttgtcaat tatgatctat 11881 tgcagtacga gacactgact agaactctca ggaaaccggt gaaagacaac atcgaatatg 11941 agtatatgtg ttcagttgag ctagctgtcg gtctaaggca gaaaatgtgg atccacctga 12001 cttacgggag acccatacat gggctagaaa caccagaccc tttagagctc ttgaggggaa 12061 tatttatcga aggttcagag gtgtgcaagc tttgcaggtc tgaaggagca gaccccatct 12121 atacatggtt ctatcttcct gacaatatag acctggacac gcttacaaac ggaagtccgg 12181 ctataagaat cccctatttt ggatcagcca ctgatgaaag gtcggaagcc caactcgggt 12241 atgtaagaaa tctaagcaaa cccgcaaagg cggccatccg gatagctatg gtgtatacgt 12301 gggcctacgg gactgatgag atatcgtgga tggaagccgc tcttatagcc caaacaagag 12361 ctaatctgag cttagagaat ctaaagctgc tgactcctgt ttcaacctcc actaatctat 12421 ctcataggtt gaaagatacg gcaacccaga tgaagttctc tagtgcaaca ctagtccgtg 12481 caagtcggtt cataacaata tcaaatgata acatggcact caaagaagca ggggagtcga 12541 aggatactaa tctcgtgtat cagcagatta tgctaactgg gctaagcttg ttcgagttca 12601 atatgagata taagaaaggt tccttaggga agccactgat attgcactta catcttaata 12661 acgggtgctg tataatggag tccccacagg aggcgaatat ccccccaagg tccacattag 12721 atttagagat tacacaagag aacaataaat tgatctatga tcctgatcca ctcaaggatg 12781 tggaccttga gctatttagc aaggtcagag atgttgtaca cacagttgac atgacttatt 12841 ggtcagatga tgaagttatc agagcaacca gtatctgtac tgcaatgacg atagctgata 12901 caatgtctca attagataga gacaacttaa aagagatgat cgcactagta aatgacgatg 12961 atgtcaacag cttgattact gagtttatgg tgattgatgt tcctttattt tgctcaacgt 13021 tcgggggtat tctagtcaat cagtttgcat actcactcta cggcttaaac atcagaggaa 13081 gggaagaaat atggggacat gtagtccgga ttcttaaaga tacctcccac gcagttttaa 13141 aagtcttatc taatgctcta tctcatccca aaatcttcaa acgattctgg aatgcaggtg 13201 tcgtggaacc tgtgtatggg cctaacctct caaatcagga taagatactc ttggccctct 13261 ctgtctgtga atattctgtg gatctattca tgcacgattg gcaagggggt gtaccgcttg 13321 agatctttat ctgtgacaat gacccagatg tggccgacat gaggaggtcc tctttcttgg 13381 caagacatct tgcataccta tgcagcttgg cagagatatc tagggatggg ccaagattag 13441 aatcaatgaa ctctctagag aggctcgagt cactaaagag ttacctggaa ctcacatttc 13501 ttgatgaccc ggtactgagg tacagtcagt tgactggcct agtcatcaaa gtattcccat 13561 ctactttgac ctatatccgg aagtcatcta taaaagtgtt aaggacaaga ggtataggag 13621 tccctgaagt cttagaagat tgggatcccg aggcagataa tgcactgtta gatggtatcg 13681 cggcagaaat acaacagaat attcctttgg gacatcagac tagagcccct ttttgggggt 13741 tgagagtatc caagtcacag gtactgcgtc tccgggggta caaggagatc acaagaggtg 13801 agataggcag atcaggtgtt ggtctgacgt taccattcga tggaagatat ctatctcacc 13861 agctgaggct ctttggcatc aacagtacta gctgcttgaa agcacttgaa cttacctacc 13921 tattgagccc cttagttgac aaggataaag ataggctata tttaggggaa ggagctgggg 13981 ccatgctttc ctgttatgac gctactcttg gcccatgcat caactattat aactcagggg 14041 tatactcttg tgatgtcaat gggcagagag agttaaatat atatcctgct gaggtggcac 14101 tagtgggaaa gaaattaaac aatgttacta gtctgggtca aagagttaaa gtgttattca 14161 acgggaatcc tggctcgaca tggattggga atgatgagtg tgaggctttg atttggaatg 14221 aattacagaa tagctcgata ggcctagtcc actgtgacat ggagggagga gatcataagg 14281 atgatcaagt tgtactgcat gagcattaca gtgtaatccg gatcgcgtat ctggtggggg 14341 atcgagacgt tgtgcttata agcaagattg ctcccaggct gggcacggat tggaccaggc 14401 agctcagcct atatctgaga tactgggacg aggttaacct aatagtgctt aaaacatcta 14461 accctgcttc cacagagatg tatctcctat cgaggcaccc caaatctgac attatagagg 14521 acagcaagac agtgttagct agtctcctcc ctttgtcaaa agaagatagc atcaagatag 14581 aaaagtggat cttaatagag aaggcaaagg ctcacgaatg ggttactcgg gaattgagag 14641 aaggaagctc ttcatcaggg atgcttagac cttaccatca agcactgcag acgtttggct 14701 ttgaaccaaa cttgtataaa ttgagcagag atttcttgtc caccatgaac atagctgata 14761 cacacaactg catgatagct ttcaacaggg ttttgaagga tacaatcttc gaatgggcta 14821 gaataactga gtcagataaa aggcttaaac taactggtaa gtatgacctg tatcctgtga 14881 gagattcagg caagttgaag acaatttcta gaagacttgt gctatcttgg atatctttat 14941 ctatgtccac aagattggta actgggtcat tccctgacca gaagtttgaa gcaagacttc 15001 aattgggaat agtttcatta tcatcccgtg aaatcaggaa cctgagggtt atcacaaaaa 15061 ctttattaga caggtttgag gatattatac atagtataac gtatagattc ctcaccaaag 15121 aaataaagat tttgatgaag attttagggg cagtcaagat gttcggggcc aggcaaaatg 15181 aatacacgac cgtgattgat gatggatcac tgggtgatat cgagccatat gacagctcgt 15241 aataattagt ccctatcgtg ccgaacgatc gaagctccgc ggtacctgga agtcttggac 15301 ttgtccatat gacaatagta agaaaaactt acaagaagac aagaaaattt aaaaggatac 15361 atatctctta aactcttgtc tggt // LOCUS HUMKRASPO 111 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human K-ras proto-oncogene, exon 1. ACCESSION M34904 KEYWORDS K-ras proto-oncogene. SOURCE Human normal fetal liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 111) AUTHORS Santos,E., Martin-Zanca,D., Reddy,E.P., Pierotti,M.A., Della Porta,G. and Barbacid,M. TITLE Malignant activation of a K-ras oncogene in lung carcinoma but not in normal tissue of the same patient JOURNAL Science 223, 661-664 (1984) STANDARD simple staff_entry FEATURES from to/span description pept 1 / 111 K-ras p21 protein, exon 1 BASE COUNT 35 a 16 c 30 g 30 t ORIGIN 1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 61 atacagctaa ttcagaatca ttttgtggac gaatatgatc caacaataga g // LOCUS HUMGLYE 498 bp ss-mRNA PRI 18-JUL-1990 DEFINITION Human glycophorin E mRNA, complete cds. ACCESSION M29610 J05183 M29620 KEYWORDS glycophorin. SOURCE Human erythroleukemia proerythroblast cell line K562, cDNA to mRNA, clone pHGpE-7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 498) AUTHORS Kudo,S. and Fukuda,M. TITLE Identification of a novel human glycophorin, glycophorin E, by isolation of genomic clones and complementary DNA clones utilizing polymerase chain JOURNAL J. Biol. Chem. 265, 1102-1110 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Kudo 31-OCT-1989. FEATURES from to/span description pept 57 293 glycophorin E signal 470 481 polyA signal BASE COUNT 128 a 107 c 94 g 169 t ORIGIN 1 agttgtcttt ggtagttttt ttgcactaac ttcaggagcc agctcgtgat ctcaggatgt 61 atggaaaaat aatctttgta ttactattgt cagaaattgt gagcatatca gcatcaagta 121 ccactggtgt ggcaatgcac acttcaacct cttcttcagt cacaaagagt tacatctcat 181 cacagacaaa tgggataaca ctcattaatt ggtgggcgat ggctcgtgtt atttttgagg 241 tgatgcttgt tgttgttgga atgatcatct taatttctta ctgtattcga tgactgataa 301 aggcatgagg atgtggcctg catgctgcct gatcttgcct agaaccagct gcacctgctg 361 ttctcttgtt atgcaaactg gctgcacctg ctattccttt gcttatgccc caacccttgg 421 ctatcctaac tccctgttct cctgcctatt actgtattct ctacttctaa ataaaaataa 481 aacaaaatac aaattatt // LOCUS BAMSYNAMY 85 bp ds-DNA SYN 18-JUL-1990 DEFINITION Synthetic B.amyloliquefaciens alpha-amylase gene promoter region. ACCESSION M34931 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 85) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 72 > 85 synthetic alpha-amylase BASE COUNT 34 a 8 c 19 g 24 t ORIGIN 1 actcggggtt gttattattt tatcgatatg taaaatataa tttctagaag aaaagaaggt 61 ggagaggaaa catgatccaa aaacg // LOCUS PMP5END 139 bp ss-RNA VRL 18-JUL-1990 DEFINITION Papaya mosaic virus, 5' end. ACCESSION M35054 KEYWORDS . SOURCE Papaya mosaic virus RNA. ORGANISM Papaya mosaic potexvirus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potexvirus. REFERENCE 1 (bases 1 to 139) AUTHORS Lok,S. and Abouhaidar,M.G. TITLE The nucleotide sequence of the 5' end of papaya mosaic virus RNA: Site of in vitro assembly initiation JOURNAL Virology 153, 289-296 (1986) STANDARD simple staff_entry BASE COUNT 58 a 23 c 30 g 27 t 1 others ORIGIN 1 ggaaaagaaa cacaaagcaa agcaaagcaa agcaactcaa ataaancata tttggtcaag 61 gcacttggta atcaaacggg cacaacccta gattaatgat atggcaaatt tgaggagtgt 121 tttcgaacag ttgaacgag // LOCUS MZEEMA 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Z.mexicana 180-bp knob-specific repeat region. ACCESSION M32533 KEYWORDS . SOURCE Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm1. ORGANISM Zea mexicana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review COMMENT Zea mexicana is also known as Zea mays subspecies mexicana. BASE COUNT 58 a 38 c 39 g 45 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aataatatcg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS MZEEMB 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Z.mexicana 180-bp knob-specific repeat region. ACCESSION M32534 KEYWORDS . SOURCE Z.mexicana (Teosinte) 6-day-old seedling DNA, clone pEm2. ORGANISM Zea mexicana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review COMMENT Zea mexicana is also known as Zea mays subspecies mexicana. BASE COUNT 60 a 40 c 38 g 42 t ORIGIN 1 ggccacacaa accccatttt tgtcgaaaat agccatgaac gaccattttc aataatatcg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS MZEZMA 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32521 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-5. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 40 c 39 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS MZEZMB 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32522 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-21. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 40 c 39 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS MZEZMC 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32523 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-10. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 41 c 38 g 43 t ORIGIN 1 ggccacacaa cccccatttt tttcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat // LOCUS MZEZMD 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32524 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-25. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 39 c 40 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat // LOCUS MZEZME 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32525 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm2-3(I). ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 40 c 38 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctagtc gatgatcaat // LOCUS MZEZMF 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32526 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-7. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 40 c 40 g 41 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aaaaataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctgtggtc gacgatcaat // LOCUS MZEZMG 179 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32527 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-26. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 179) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 38 c 38 g 44 t ORIGIN 1 ggccacaaac ccccattttt gtcgaaaata gccatgaacg accattttca ataataccga 61 aggctaacac ctatggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg 121 atctatggca aggaaacata tgtggggtga ggtgtatgag cctctagtcg atgatcaat // LOCUS MZEZMH 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32528 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-14. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 61 a 39 c 38 g 42 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccatttac aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaaaaatgt 121 gatctatgga aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS MZEZMI 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32529 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-13. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 61 a 40 c 36 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat aaccatgaac gaccattttc aataatatcg 61 aaggctaaca cctatggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 aatctacgac aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS MZEZMJ 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32530 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm4-1. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 35 c 38 g 48 t ORIGIN 1 ggccacacaa cccgcatttt ttttgaaaat agccatgaat gaccattttc aataatatta 61 gaggctaaca cctacgagtt tttaaccaag aaatggtctc caccagaaat tcaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gacgatcaat // LOCUS MZEZMK 179 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32531 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm2-1(I). ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 179) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 62 a 39 c 37 g 41 t ORIGIN 1 ggccacacaa cccccatttt ttcaaaaata gccatgaacg accattttca aaaatacgaa 61 aggctaacac gtacggattt ttgaccatga aatggtctcc accagaaatc cgaaaatgag 121 atctacgaca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat // LOCUS MZEZML 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION Maize 180-bp knob-specific repeat region. ACCESSION M32532 KEYWORDS . SOURCE Maize 6-day-old seedling DNA, clone pZm2-1(II). ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 34 c 40 g 47 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaat gaccattttc aaaaatacga 61 aaggctaaca cgtacggatt tttgaccatg aaatggtttt caccagaaat tcaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcctctggtc gatgatcaat // LOCUS TRD2AA 177 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32535 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 2. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 177) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 60 a 38 c 40 g 39 t ORIGIN 1 ggccacacaa cccccattta tgtcggaaat agcgatgaac gaccattttc aataatacga 61 aggtaacacc tacgaaattt tgaccaagat aggtctcaac cagaaatcca agaatgtgac 121 ctatggcaag gaaacatatg tggggtgagg tgtacgagcc tctggtcgat gatcaat // LOCUS TRD2AB 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32536 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 4. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 59 a 40 c 40 g 41 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agcgatgaac gaccattttc aataatacca 61 aaggctaaca cctatggatt ttggaccaag aaatggtctc caccagaaat ccaagaatgt 121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctctggtc gatgatcaat // LOCUS TRD2AC 180 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32537 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 7. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 60 a 41 c 37 g 42 t ORIGIN 1 ggccacacaa cccccatttt tgtttaaaat agccaagaac gaccattttc aataataccg 61 aatgctaaca cctatggatt attgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gacctatggc aaggaaacat atgtggggtg aggtgtacga gcctccggtc gatgatcaat // LOCUS TRD2AD 179 bp ds-DNA PLN 18-JUL-1990 DEFINITION T.dactyloides 180-bp knob-specific repeat region. ACCESSION M32538 KEYWORDS . SOURCE T.dactyloides 6-day-old seedling DNA, clone 8. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 179) AUTHORS Dennis,E.S. and Peacock,W.J. TITLE Knob heterochromatin homology in maize and its relatives JOURNAL J. Mol. Evol. 20, 341-350 (1984) STANDARD simple staff_review BASE COUNT 58 a 39 c 38 g 44 t ORIGIN 1 ggccacacaa cccccatttt ttttgaaaat agccatgaac gaccattttc aataataccg 61 aaggtaacac ctacggattt ttgaccaaga aatggtctcc accagaaatc caagaatgtg 121 acctatggca aggaaacata tgtggggtga ggtgtatgag cctctggtcg atgatcaat // LOCUS VSVNNJ 1329 bp ss-mRNA VRL 18-JUL-1990 DEFINITION Vesicular stomatitis virus (New Jersey) nucleocapsid protein mRNA, complete cds. ACCESSION K02379 M35062 KEYWORDS nucleocapsid protein; protein N. SOURCE VSV (New Jersey serotype; Ogden strain), cDNA to mRNA, clones 53 and 77. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Banerjee,A.K., Rhodes,D.P. and Gill,D.S. TITLE Complete nucleotide sequence of the mRNA coding for the N protein of vesicular stomatitis virus (New Jersey serotype) JOURNAL Virology 137, 432-438 (1984) STANDARD full staff_review REFERENCE 2 (bases 1234 to 1329) AUTHORS Rhodes,D.P. and Banerjee,A.K. TITLE Poly(A)-adjacent sequence of the 14.5 S mRNA of vesicular stomatitis virus (New Jersey serotype) JOURNAL Virology 105, 297-300 (1980) STANDARD simple staff_entry FEATURES from to/span description pept 14 1282 nucleocapsid protein mRNA 1 1329 N mRNA conflict 1264 1264 a in [1]; g in [2] BASE COUNT 443 a 271 c 281 g 334 t ORIGIN 182 bp upstream of BstNI site. 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcca taattcagcc 61 taaactaccg gccaatgagg atccggtaga gtatccagct gattacttca aaaataacac 121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttaaggg cctatgtcta 181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc 241 gttaaaggga gtagaaggaa cattagacag agactgggtt agctttggaa gaacgatcgg 301 aaagagggaa gagaatgtca aaatcttcga tctagtcaaa gttgaggaac tgaagacagc 361 tcttcctgat gggaaatcgg atccagatcg atcagctgaa gatgataaat ggcttcctat 421 ttacatccta ggcctatatc gagtgggtag atccaaagtt acagattaca gaaagaaact 481 cctggacggt ctagaaaatc agtgtagagt ggcatcaacc agatttgaaa gccttgttga 541 agatggtctt gacttctttg acatctggga aaatgatccc aacttcacca aaatagttgc 601 tgcagtggat atgttttttc acatgttcaa gaagcatgaa cgtgctccaa tcagatatgg 661 aaccatagtc tcacgattca aagactgtgc agcacttgct acatttggac atctcagcaa 721 ggttagcgga ctttcaattg aagacctaac aacatgggtt ctgaatcgag aagtcgcaga 781 cgaactatgc caaatgatgt acccaggcca agaaattgac aaggcagatt catacatgcc 841 atatatgatt gatttcgggt tatcccaaaa atctccttat tcatcagtga agaatccagc 901 tttccacttc tggggtcaac tggccgcact cttgctaaga tcaactcgag caaaaaatgc 961 tcggcaaccc gatgacattg agtacacttc gctgacttgt gcaagtttac tactgtcatt 1021 tgcagttgga tcttcagcag acattgaaca acaattctac atcggagagg acaaatacac 1081 aacagagaaa gatgacagtc tcaagaaatc tgatgtccca ccgaaaggaa gaaatgttgt 1141 agactggctc ggttggtatg atgacaatgg agggaaaccc acaccggata tgctcaactt 1201 cgcacgaaga gcagtcagtt cgttgcagtc actccgcgag aaaacaattg gcaaatacgc 1261 aaaagtagaa tttgacaaat gaccaggttt gacatatcaa tatcaatatt atattctgtg 1321 ctatatatg // LOCUS TNCPOLY 3660 bp ss-RNA VRL 18-JUL-1990 DEFINITION Tobacco necrosis virus RNA-dependent RNA polymerase and coat protein genes, complete cds. ACCESSION M33002 KEYWORDS RNA-dependent RNA polymerase; coat protein; polymerase. SOURCE Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell line, cDNA to viral RNA, clones pFM2[0,1,2,3]. ORGANISM Tobacco necrosis virus Viridae; ss-RNA nonenveloped viruses. REFERENCE 1 (bases 1 to 3660) AUTHORS Meulewaeter,F., Seurinck,J. and van Emmelo,J. TITLE Genome structure of tobacco necrosis virus strain A JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Meulewaeter, 20-MAR-1990, for release after publication. FEATURES from to/span description pept 60 665 RNA-dependent RNA polymerase (put.) /transl_except=(666..668) 669 2234 RNA-dependent RNA polymerase pept 2218 2436 ORF 3 pept 2440 2610 ORF 4 pept 2613 3443 coat protein pept 3467 3646 ORF 6 site 666 668 readthrough codon BASE COUNT 1034 a 890 c 895 g 841 t ORIGIN 1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa 61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact 121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag 181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt 241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa 301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg 361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg 421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg 481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac 541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag 601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt 661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac 721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct 781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc 841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc 901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa 961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga 1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag 1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc 1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc 1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg 1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat 1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg 1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa 1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta 1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact 1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc 1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga 1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg 1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt 1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat 1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca 1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc 1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat 2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt 2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg 2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg 2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt 2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa 2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa 2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt 2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca 2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac 2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg 2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc 2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc 2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg 2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag 2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa 2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca 3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca 3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg 3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag 3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta 3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg 3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg 3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa 3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga 3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt 3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa 3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag // LOCUS BLYPROZ 75 bp ss-mRNA PLN 18-JUL-1990 DEFINITION Barley protein Z mRNA, partial cds. ACCESSION M35065 KEYWORDS albumin; protein Z. SOURCE Barley, cDNA to mRNA, clone paz1-1. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Rasmussen,S.K., Hopp,H.E., Brandt,A., Svendsen,I. and Hejgaard,J. TITLE A cDNA clone for protein Z, a major barley endosperm albumin JOURNAL Carlsberg Res. Commun. 49, 385-390 (1984) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 75 protein Z (AA at 1) BASE COUNT 12 a 19 c 19 g 25 t ORIGIN 1 cccctgaagg tggatttggt ggattttgtc gcgaatcacc ctttcctctt ccttattcgg 61 gaagacattg ccggt // LOCUS BSURPOF 3078 bp ds-DNA BCT 18-JUL-1990 DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB), complete cds. ACCESSION M34995 KEYWORDS RNA polymerase; minor sigma-37 factor. SOURCE B.subtilis (strains 168 and Marburg), clones lambda-[21,gt11,1,11,21,31]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 368 to 2297) AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W. TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus subtilis RNA polymerase: Isolation, nucleotide sequence, chromosomal locus, and cryptic function JOURNAL J. Bacteriol. 169, 771-778 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 3078) AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W. TITLE Similar organization of the sigB and spoIIA operons encoding alternate sigma factors of Bacillus subtilis RNA polymerase JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.W.Price, 18-FEB-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990. FEATURES from to/span description pept 200 529 ORF V pept 526 1008 ORF W pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg start codon) pept 1762 2361 ORF X mRNA 131 2368 sigB operon mRNA signal 131 168 sigB operon promoter signal 2368 2401 sigB transcription termination signal binding 188 194 ORF V ribosomal binding site binding 511 519 ORF W ribosomal binding site binding 954 961 sigB ribosomal binding site binding 1750 1753 ORF X ribosomal binding site BASE COUNT 999 a 575 c 755 g 749 t ORIGIN 40 degrees on B.subtilis map. 1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt 61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt 121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg 181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat 241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct 301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat 361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg 421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt 481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct 541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg 601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc 661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg 721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa 781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac 841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat 901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg 961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga 1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg 1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca 1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc 1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg 1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc 1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga 1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg 1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac 1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca 1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct 1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat 1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga 1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta 1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct 1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac 1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc 1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg 2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt 2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc 2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata 2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta 2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat 2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt 2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg 2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac 2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga 2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac 2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca 2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag 2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa 2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt 2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt 2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc 3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg 3061 caaactgctg cgaagctt // LOCUS HUMERPR 1624 bp ss-mRNA PRI 18-JUL-1990 DEFINITION Human erythropoietin receptor mRNA, complete cds. ACCESSION M34986 KEYWORDS erythropoietin receptor. SOURCE Human cell line OCI-M1, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1624) AUTHORS Ehrenman,K.St. and St John,T. TITLE Sequence of the human erythropoietin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.St.John, 07-JUN-1990. Author address: T.St.John Hutchinson Cancer Res Center 1124 Columbia St. Seattle, WA Email: TSTJOHN@FHCRCVAX FEATURES from to/span description pept 1 1527 erythropoietin receptor precursor sigp 1 72 erythropoietin receptor signal peptide matp 73 1524 erythropoietin receptor BASE COUNT 276 a 537 c 498 g 313 t ORIGIN 1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct 61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc 121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg 181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc 241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt 301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta 361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat 421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc 481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac 541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag 601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc 661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg 721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg 781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag 841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac 901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc 961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg 1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc 1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg 1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc 1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct 1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca 1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct 1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta 1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg 1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata 1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc 1621 agga // LOCUS HUMGFP40H 4379 bp ds-DNA PRI 18-JUL-1990 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES from to/span description pept 796 909 P40 T-cell and mast cell growth factor precursor, exon 1 1022 1057 P40 T-cell and mast cell growth factor precursor, exon 2 1140 1172 P40 T-cell and mast cell growth factor precursor, exon 3 2461 2592 P40 T-cell and mast cell growth factor precursor, exon 4 4096 4215 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 796 849 P40 T-cell and mast cell growth factor signal peptide matp 850 909 P40 T-cell and mast cell growth factor 1022 1057 P40 T-cell and mast cell growth factor 1140 1172 P40 T-cell and mast cell growth factor 2461 2592 P40 T-cell and mast cell growth factor 4096 4212 P40 T-cell and mast cell growth factor IVS 910 1021 hP40 intron A IVS 1058 1139 hP40 intron B IVS 1173 2460 hP40 intron C IVS 2593 4095 hP40 intron D signal 742 747 TATA box signal 4354 4359 polyA signal rpt 2965 3265 Alu repeat binding 370 376 AP-2 binding site binding 624 629 IRF-1 binding site binding 627 633 AP-1 binding site BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS MEFDNABP 1710 bp ds-DNA BCT 18-JUL-1990 DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds. ACCESSION M34778 KEYWORDS DNA binding protein HMfB. SOURCE M.fervidus DNA. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (sites; for [2]) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. TITLE DNA binding protein HMf, isolated from the hypothermophilic archaeal Methanothermus fervidus is most closely related to histones JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1710) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Sandman, 01-JUN-1990. Author address: K.Sandman Department of Microbiology Ohio State University 484 W. 12th Ave. Columbus, Ohio 43210 E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU FEATURES from to/span description pept 578 787 DNA binding protein HMfB pept 1662 805 (c) ORF285 signal 450 457 promoter signal 523 530 promoter signal 790 814 transcription termination signal BASE COUNT 619 a 263 c 265 g 561 t 2 others ORIGIN 1 bp upstream of HindIII site. 1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa 61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag 121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag 181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg 241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat 301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat 361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa 421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt 481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt 541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat 601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct 661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag 721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa 781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat 841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc 901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt 961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat 1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa 1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg 1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata 1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc 1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc 1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc 1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat 1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc 1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt 1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt 1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa 1681 gttttgtttt acccattgct tctaaatatt // LOCUS MUSINT1A 5607 bp ds-DNA ROD 18-JUL-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and Van Ooyen,A. TITLE The promoter of the Wnt-1 (int-1) oncogene and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES from to/span description pept 1779 1882 Wnt-1 protein (exon 1) 2452 2705 Wnt-1 protein (exon 2) 3279 3544 Wnt-1 protein (exon 3) 4002 4490 Wnt-1 protein (exon 4) IVS 1883 2451 Wnt-1 cds intron A IVS 2706 3278 Wnt-1 cds intron B IVS 3545 4001 Wnt-1 cds intron C site 1133 1134 MMTV insertion site in tumor 17 [1] site 1394 1395 MMTV insertion site in tumor 35 [1] site 1631 1632 MMTV insertion site in tumor 102 [1] site 4495 4496 MMTV insertion site in tumor 53 [1] mRNA 1428 5398 Wnt-1 mRNA (alt.) mRNA 1595 5398 Wnt-1 mRNA (alt.) signal 1562 1566 TATA box signal 1573 1577 TATA box BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSP40M 3808 bp ds-DNA ROD 18-JUL-1990 DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete cds. ACCESSION M30136 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor. SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210 DNA, clone lambda-S40.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3808) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 21-NOV-1989. FEATURES from to/span description pept 644 757 P40 T-cell and mast cell growth factor precursor, exon 1 846 881 P40 T-cell and mast cell growth factor precursor, exon 2 965 997 P40 T-cell and mast cell growth factor precursor, exon 3 2184 2315 P40 T-cell and mast cell growth factor precursor, exon 4 3393 3512 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 644 697 P40 T-cell and mast cell growth factor signal peptide matp 698 757 P40 T-cell and mast cell growth factor 846 881 P40 T-cell and mast cell growth factor 965 997 P40 T-cell and mast cell growth factor 2184 2315 P40 T-cell and mast cell growth factor 3393 3509 P40 T-cell and mast cell growth factor IVS 758 845 mP40 intron A IVS 882 964 mP40 intron B IVS 998 2183 mP40 intron C IVS 2316 3392 mP40 intron D signal 589 594 TATA box binding 227 233 AP-2 binding site binding 475 480 IRF-1 binding site binding 478 484 AP-1 binding site BASE COUNT 1012 a 903 c 869 g 1024 t ORIGIN Chromosome 13. 1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca 61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt 121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag 181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc 241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga 301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt 361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt 421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga 481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg 541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta 601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat 661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg 721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga 781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac 841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag 901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt 961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc 1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct 1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat 1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga 1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa 1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac 1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac 1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc 1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca 1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta 1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac 1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc 1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct 1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta 1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat 1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg 1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc 1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg 2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa 2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt 2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt 2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa 2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca 2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc 2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag 2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg 2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct 2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag 2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg 2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag 2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat 2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc 2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct 2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc 3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga 3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg 3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga 3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa 3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt 3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc 3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc 3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga 3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg 3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag 3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga 3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac 3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat 3781 aggtaggatg gagcaaagac ttggtaat // LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete cds. ACCESSION M29938 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone ZQ30-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 13 777 MHC SLA-DQ alpha chain precursor sigp 13 81 MHC SLA-DQ alpha chain signal peptide matp 82 774 MHC SLA-DQ alpha chain mRNA < 1 1105 MHC SLA-DQ mRNA BASE COUNT 254 a 290 c 255 g 306 t ORIGIN 1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc 61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta 121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag 181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa 241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg 301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact 361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac 421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt 481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc 541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat 601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag 661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc 721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc 781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa 841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc 901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg 961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc 1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1081 aaacaataaa tcacttcatg aggtt // LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete cds. ACCESSION M29939 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 69 836 MHC SLA-DQ alpha chain precursor sigp 69 137 MHC SLA-DQ alpha chain signal peptide matp 138 833 MHC SLA-DQ alpha chain BASE COUNT 273 a 313 c 266 g 322 t ORIGIN 1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg 61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga 121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg 181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat 241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta 301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca 361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt 421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca 481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt 541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca 601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata 661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga 721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct 781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca 841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac 901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct 961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga 1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc 1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1141 aaacaataaa tcacttcatg aggtttttgt actg // LOCUS PIGMHDQBA 212 bp ds-DNA MAM 18-JUL-1990 DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end. ACCESSION M29944 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 212) AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F., Gustafsson,K. and LeGuern,C. TITLE Class II genes of miniature swine: II. Molecular identification and characterization of B (beta) genes from SLA-c haplotype JOURNAL Immunogenetics 31, 1-6 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept < 1 12 MHC SLA-DO beta chain BASE COUNT 38 a 58 c 52 g 64 t ORIGIN 1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg 61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc 121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct 181 cttacccttc tgcctttctt cctcctactt gt // LOCUS PP2LEND 266 bp ds-DNA PHG 18-JUL-1990 DEFINITION Bacteriophage P2 left end DNA. ACCESSION M34756 KEYWORDS . SOURCE Bacteriophage P2 DNA. ORGANISM Bacteriophage P2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Ziermann,R. and Calendar,R. TITLE Characterization of the cos sites of Bacteriophages P2 and P4 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Ziermann, 05-JUN-1990. Author address: R.Ziermann University of California Dept. Mol. Cell Biology Berkeley, CA 94720 Email: calendar@garnet.berkeley.edu FEATURES from to/span description site 1 19 cohesive end of phage DNA site 20 266 left end of phage DNA BASE COUNT 41 a 58 c 99 g 68 t ORIGIN Map position 0.0-0.8%. 1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct 61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt 121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg 181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc 241 gagccagccg tttatctcgc ggatcc // LOCUS RCABCHH 314 bp ds-DNA BCT 18-JUL-1990 DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and F108) genes, 3' end, and 5' end respectively. ACCESSION M34843 KEYWORDS . SOURCE R.capsulata DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 314) AUTHORS Yang,Z. and Bauer,C.E. TITLE Rhodobacter capsulatus genes involved in early steps of the bacteriochlorophyll biosynthetic pathway JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.E.Bauer, 04-JUN-1990. FEATURES from to/span description pept < 1 79 bacteriochlorophyll biosynthesis protein (bchH) (AA at 2) pept 108 > 314 bacteriochlorophyll biosynthesis protein (F108) BASE COUNT 59 a 87 c 119 g 49 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga 61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg 121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg 181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg 241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca 301 agcgcgtgct gcag // LOCUS TREANTAT 999 bp ds-DNA ORG 18-JUL-1990 DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle. ACCESSION M34848 KEYWORDS . SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Bajyana Songa, 05-JUN-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS VIBSCRRE 1524 bp ds-DNA BCT 18-JUL-1990 DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds, and sucrose transport (scrA) protein, 5' end. ACCESSION M35009 KEYWORDS Scr repressor protein; sucrose transport protein. SOURCE V.alginolyticus DNA. ORGANISM Vibrio alginolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1524) AUTHORS Blatch,G.L. and Woods,D.R. TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr repressor-encoding gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Blatch, 10-JUN-1990. Author address: G.L.Blatch University of Cape Town Dept. Microbiology Private Bag Rondebosch 7700 SOUTH AFRICA Email: bltgre01.uctvax@f4.n494.z5.fidonet.org FEATURES from to/span description pept 36 < 1 (c) sucrose transport protein (scrA) pept 271 555 Scr repressor protein (scrR) pept 791 1090 ORF5 binding 264 267 scrR ribosomal binding site binding 780 785 ORF5 ribosomal binding site signal 178 183 scrA -10 region signal 203 208 scrA -35 region signal 190 195 scrR -10 region signal 162 167 scrR -10 region signal 762 767 ORF5 -35 region signal 742 747 ORF5 -35 region site 78 91 repressor operator site (put.) site 229 242 repressor operator site (put.) site 12 71 ScrR helix-turn-helix motif BASE COUNT 466 a 344 c 321 g 393 t ORIGIN 1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc 61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg 121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt 181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc 241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg 301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa 361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc 421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg 481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt 541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga 601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt 661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac 721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag 781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt 841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga 901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg 961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga 1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta 1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc 1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt 1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt 1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat 1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct 1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca 1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa 1501 gccaattgtg atattgaaga agct // LOCUS XELNRP1A 2431 bp ss-mRNA VRT 18-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA, complete cds. ACCESSION M34894 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2431) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 74 1117 pot. RNA-binding protein (nrp-1B) mRNA < 1 2431 nrp-1B mRNA signal 2407 2418 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence site 921 941 alternative exon (put.) BASE COUNT 663 a 522 c 543 g 703 t ORIGIN 1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt 61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc 121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact 181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt 241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa 301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt 361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt 421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga 481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac 541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa 601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt 661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat 721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc 781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc 841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc 901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag 961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag 1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc 1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct 1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct 1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga 1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat 1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg 1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat 1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta 1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa 1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat 1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac 1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg 1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc 1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat 1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc 1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga 1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct 2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg 2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc 2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat 2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg 2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca 2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata 2401 gttcaaaata aactatgaat tttattcatt t // LOCUS XELNRP1B 2324 bp ss-mRNA VRT 18-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA, complete cds. ACCESSION M34895 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2324) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 29 1051 pot. RNA-binding protein (nrp-1B) mRNA < 1 2324 nrp-1B mRNA signal 2306 2311 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence BASE COUNT 647 a 467 c 520 g 690 t ORIGIN 1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc 61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac 121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt 181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca 241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga 301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat 361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca 421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg 481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt 541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc 601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat 661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag 721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac 781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg 841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc 901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat 961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat 1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg 1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct 1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat 1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta 1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa 1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa 1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct 1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa 1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc 1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact 1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg 1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt 1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct 1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag 1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag 1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt 1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta 2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac 2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt 2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat 2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa 2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc // LOCUS YSJSACDHY 2569 bp ds-DNA PLN 18-JUL-1990 DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds. ACCESSION M34929 KEYWORDS saccharopine dehydrogenase. SOURCE Y.lipolytica (strain W29) DNA. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2569) AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and Gaillardin,C. TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia lipolytica JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.M.Fournier, 06-JUN-1990. ORF1 is probably not translated, since a lacZ fusion/expression experiment did not produce results. FEATURES from to/span description pept 649 1929 ORF1 pept 1764 655 (c) saccharopine dehydrogenase mRNA 582 2060 ORF1 mRNA (alt.) mRNA 598 2060 ORF1 mRNA (alt.) mRNA 599 2060 ORF1 mRNA (alt.) mRNA 602 2060 ORF1 mRNA (alt.) mRNA 606 2060 ORF1 mRNA (alt.) mRNA 609 2060 ORF1 mRNA (alt.) mRNA 1807 639 (c) ORF2 mRNA (alt.) mRNA 1809 639 (c) ORF2 mRNA (alt.) mRNA 1811 639 (c) ORF2 mRNA (alt.) mRNA 1812 639 (c) ORF2 mRNA (alt.) mRNA 1814 639 (c) ORF2 mRNA (alt.) signal 521 528 TATA box signal 565 575 TATA box BASE COUNT 668 a 573 c 666 g 662 t ORIGIN 1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag 61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt 121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca 181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa 241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag 301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc 361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc 421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt 481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca 541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt 601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc 661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt 721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc 781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc 841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac 901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt 961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca 1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt 1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc 1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt 1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata 1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc 1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc 1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct 1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg 1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg 1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc 1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt 1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag 1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg 1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag 1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg 1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga 1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa 2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca 2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg 2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc 2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct 2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag 2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga 2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat 2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat 2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc // LOCUS MVISAA1A 772 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Mink serum amyloid A1 mRNA, complete cds. ACCESSION M34953 J05445 KEYWORDS serum amyloid A1. SOURCE Mink liver, cDNA to mRNA, clone pmiSAA1. ORGANISM Mustela vison Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae. REFERENCE 1 (bases 1 to 772) AUTHORS Marhaug,G., Husby,G. and Dowton,S.B. TITLE Mink serum amyloid A protein--expression and primary structure based on cDNA sequences JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.O.Marhaug, 07-JUN-1990. FEATURES from to/span description pept 29 418 serum amyloid A1 mRNA < 1 772 serum amyloid A1 BASE COUNT 176 a 205 c 200 g 191 t ORIGIN 1 cagctctgct ccaccagaac tcagcaccat gaagcttttc acgggcctca tcttctgctc 61 cttggtcctg ggggtcagca gtcaatggta ttcattcatt ggtgaggctg ctcaaggagc 121 ttgggacatg tacagagcct actctgacat gatagaagcc aagtacaaaa attcggacaa 181 atacttccat gcccgcggga actatgatgc tgcacaaagg ggccctgggg gtgcgtgggc 241 tgctaaagtc atcagtgacg ccagagagcg ttctcagaga atcacagacc ttatcaagta 301 tggagacagc ggccacggag tggaggactc gaaggctgac caagctgcca acgaatgggg 361 ccggagtggc aaagacccca accacttccg accccctggc ctgcctgaca agtactgagc 421 tttcccttgg ctctgccctg gggagatggg ctgtgaggcc cctgaaggca agaacagttg 481 ctgagttaga gttcctgaat tgtataccct tcctactgaa tactttaaag agcacattaa 541 aatgtttagt aaatgcttgt gaatgggcac ctggatggcc cagttggtta agcggctgtc 601 ttcggctcat gtcatgatcc tagggtcctg ggaccgagtt ctccatgggg ctccttcctg 661 ggcagggagt ctgcttctct gcctgcctct ctgcctctgc ctgccactct gcctgcttgt 721 gtgctctctc tccctctttc tctgacaaat aaataaataa ataaaatctt tt // LOCUS MVISAA2A 552 bp ss-mRNA MAM 18-JUL-1990 DEFINITION Mink serum amyloid A2 mRNA, complete cds. ACCESSION M34954 J05445 KEYWORDS serum amyloid A2. SOURCE Mink liver, cDNA to mRNA, clone pmiSAA2. ORGANISM Mustela vison Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Musteloidea; Mustelidae. REFERENCE 1 (bases 1 to 552) AUTHORS Marhaug,G., Husby,G. and Dowton,S.B. TITLE Mink serum amyloid a protein - expression and primary structure based on cDNA sequences JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.O.Marhaug, 07-JUN-1990. FEATURES from to/span description pept 31 420 serum amyloid A2 precursor sigp 31 84 serum amyloid A2 signal peptide matp 85 417 serum amyloid A2 mRNA < 1 552 serum amyloid A2 mRNA BASE COUNT 129 a 144 c 161 g 118 t ORIGIN 1 agcagctctg ctccaccaga actcagcacc atgaagcttt tcacgggcct catcttctgc 61 tccttggtcc tgggggtcag cagtcaatgg tattcattca ttggtgaggc tgttcaaggg 121 gcttgggaca tgtacagagc ctactctgac atgagagaag ccaattacaa aaattcggac 181 aaatacttcc atgcccgtgg gaactatgat gctgcacaaa ggggccctgg gggcgcctgg 241 gctgctaaag tcatcagtga cgccagagag cgttctcaga gagtcacaga ccttttcaag 301 tatggagaca gcggccacgg agtggaggac tcgaaggctg accaggctgc caacgaatgg 361 ggccggagtg gcaaagaccc caaccacttc cgaccctctg gcttgcctga caagtactga 421 gcttcccctt ggctctgtcc tggggggatg ggctgtgagg accctgggga cagggacacc 481 cagtcattga gttctctgtc cccagaggct agtggagggc acctaatagg tgtctaataa 541 atgcttactg ag // LOCUS TFERUBISCO 2172 bp ds-DNA BCT 18-JUL-1990 DEFINITION Pot. Thiobacillus sp. ribulose 1,5 bisphosphate carboxylase (RUBISCO) large and small subunit genes, complete cds. ACCESSION M34536 KEYWORDS ribulose 1,5 bisphosphate carboxylase. SOURCE Pot. Thiobacillus sp. DNA, clones pAH5.0 and pAH196. ORGANISM Thiobacillus sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Colorless sulfur bacteria. REFERENCE 1 (bases 1 to 2172) AUTHORS Stein,J.L., Haygood,M. and Felbeck,H. TITLE Nucleotide sequence and expression of a deep sea ribulose 1,5 bisphosphate carboxylase gene cloned from a chemoautotrophic bacterial endosymbiont JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Stein, 23-MAY-1990. Author address: J.L.Stein Scripps Institution of Oceanography UC San Diego, A-002 A-002, Marine Biol Res Div. La Jolla, CA 92093 The taxonomy of this organism could not be determined. It can only live as a symbiont of Alvinoconcha hessleri, a gastropod. FEATURES from to/span description pept 201 1616 ribulose 1,5 bisphosphate carboxylase large subunit (EC 4.1.1.39) pept 1685 2050 ribulose 1,5 bisphosphate carboxylase small subunit signal 19 24 pot. -35 region signal 43 49 pot. -10 region binding 188 194 ribosome binding site binding 1673 1676 ribosome binding site BASE COUNT 441 a 741 c 641 g 349 t ORIGIN 1 atccttatcc tgaccctcaa tacataatcg tgagcttatg gctataatca gaaagtctga 61 gttcacttta tagatgacct ctcccatgct cactctcaat cctgagtgat tcactcggat 121 tggacccgat aaccgtgcgg gctgtcggac gctaggcgtc cggtgcccct cgaacaccga 181 gcctctggag aggacaagca atggcgaaga aatacgatgc gggcgtaaaa gactaccgcg 241 aaacctactg gatgccggac tacacgccga aggagaccga cctgctggcg tgcttcaaga 301 tcattccgca gccgggcgtg ccgcgcgagg aagcccgggc cgccgtggcc gccgagtcgt 361 ccaccggcac ctggaccacc gtgtggaccg acctgctgac cgacctcgac cactacaagg 421 gccgtgccta cgccatcgag gacgtgccgg gcgacgagga ggccttctat gccttcatcg 481 cctaccccat cgacctgttc gaggaaggct cggtggtcaa cgtgtttacc tctctggtgg 541 gcaacgtgtt cggcttcaag gccatccgcg ccctgcgtct ggaggacgtg cgcttcccca 601 tcgcatacgt catgacctgc aacggtccgc cgcacggcat ccaggtcgag cgcgacatca 661 tgaacaagta cggccgtccg ctgttgggct gcaccatcaa gcccaagctg ggcctgtcgg 721 ccaagaacta cggtcgcgcc gtgtacgaat gcctgcgcgg cggcctggac ttcaccaagg 781 acgacgagaa cgtcaactcc cagcccttca tgcgctggcg gcacggtttc gacttcgtga 841 tggaggccat cgagaaggcc gagcgcgaga ccggcgagcg caagggtcac tacctgaacg 901 tcaccgcgcc caccccggac gagatgttca agcgtgccga gtacgccaag gagatcggcg 961 cgccgatcat catgcacgac tacatcaccg gcggcttcac cgccaacacc ggcctggcgc 1021 agtggtgccg tgacaacggc gtgctgctgc acatccaccg cgccatgcac gccgtgctcg 1081 accgcaaccc gcaccacggc atccacttcc gcgtgctgac caagatcctg cgcctgtccg 1141 gcggcgacca cctgcacacc ggcaccgtgg tcggcaagct ggagggcgac cgcgaggcga 1201 ccctcggctg gatcgacctg ctgcgcgagt cctacatcaa ggaagaccgt tcgcgcggca 1261 tcttcttcga ccaggactgg ggctccatgc ccggcgtctt tgcggcctgc tccggcggta 1321 tccacgtctg gcacatgccg gcgctggtca ccatcttcgg cgagcatgcc gtgctccagt 1381 tcggtggcgg caccctgggt cacccctggg gcaacgccgg cgccgcggcc aaccgggtgg 1441 cgctggaagc ctgcgtcgag gctcgcaacg agggccacga gctggagaaa gagggcaagg 1501 acatcctcat ccaggccgcc aagcacagcc ccgaactcaa gaccgcgatg gagacctgga 1561 aggaaatcaa gttcgagttc gacaccgtgg acaaactgga cgtggcgcac aaataagcgc 1621 cccgccaccc cgggccggtc gtggctggcc gcgccacgca tccccaattt tcgaggtatc 1681 aacaatgagc gaaatccaag actacaactc gagcgtcagc gatccgtcga gccgcaagtt 1741 cgagaccttc tcctacctgc ccgaactcgg cgtggaaaag atccgcaagc aggtcgagta 1801 catcgtcagc aagggctgga acccggccgt cgagcacacc gagccggaga acgccttcga 1861 ccactactgg tacatgtgga agctgccgat gttcggcgaa accgacgtgg acgccatcct 1921 ggccgaggcc gaggcatgcc acaaggcgca tccctcgcat cacgtgcgcc tgatcggcta 1981 cgacaactat gcccagtcgc aaggcactgc catggtgatc ttccgcggcc ccatcagcgc 2041 caagtgctga gccaccgggg ccgtaaacac gcgactgccg aggtctgccg gcccccgtcg 2101 catggcgacc cccgagggca tccgccaggt gcgtcacctc gcgatgttct tccgaacatt 2161 ccaataagcc ag // LOCUS LSVGPCN 3417 bp ss-RNA VRL 18-JUL-1990 DEFINITION Lassa virus glycoprotein (GPC) and nucleoprotein (N) genes, complete cds. ACCESSION M36544 M33879 KEYWORDS glycoprotein; nucleoprotein. SOURCE Lassa virus (strain GA391), cDNA to viral RNA. ORGANISM Lassa virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Arenaviridae. REFERENCE 1 (bases 1 to 3417) AUTHORS Wilson,S.M. and Clegg,J.C.S. TITLE Sequence analysis of the S RNA of Lassa virus JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.C.S.Clegg, 26-APR-1990. Author address:J.C.S.Clegg PHLS Centre for Applied Microbiology and Research Porton Down, Salisbury SP4 OJG UNITED KINGDOM FEATURES from to/span description pept 71 1543 glycoprotein (GPC) pept 3315 1603 (c) nucleoprotein (N) BASE COUNT 944 a 801 c 706 g 966 t ORIGIN 1 gcaccgggga tcctaggcat ttaggattgc gcattttaaa acctcctttt tggaaagtgt 61 cgcaatcagg atgggacaga ttgtgacatt cttccaagaa gttcctcatg ttattgagga 121 agtgatgaat attgtcctta ttgcactatc catcctagca attctgaagg gactatacaa 181 tgttgccacg tgtggcttga tagggcttgt cacattcctt ctactttcag gaaggtcatg 241 ctcactgatc tacaaaggga cttacgagct gcaaaccctt gagttgaata tggagactct 301 taatatgacc atgccgctat cctgcacaaa gaacaacagt catcattata taagggtggg 361 gaatgagaca ggacttgagc tcaccttgac caacaccagc attcttaatc acaaattctg 421 taacctctct gatgcccaca aaaggaatct ttatgatcac agcctcatga gtatcatctc 481 tacctttcat ctgtccattc ccaacttcaa tcaatatgag gcaatgagct gcgatttcaa 541 tggggggaaa atcactgtgc aatacaacct gagtcatagc ttcgcagtgg atgcagcagg 601 tcactgcggc acacttgcaa atggtgtctt acaaacattt atgagaatgg cttggggagg 661 gagttatatt gctcttgatt ctggacgcgg taactgggac tgtataatga ccagttacca 721 atatctaatc attcagaata caacctggga tgaccactgc caattttcca gaccatcacc 781 tattggctac cttgggcttc tctcacaaag aactagagac atatacatta gcagaaggtt 841 gttggggaca ttcacctgga cactatcaga ctcagaggga aatgaaacac cagggggata 901 ctgccttact agatggatgt tgattgaggc cgaattaaag tgctttggaa acactgcagt 961 agccaagtgc aatgagaaac atgatgaaga attttgtgac atgctaaggt tgttcgattt 1021 caacaaacag gccataagga ggctcaaaac agaggcccaa atgagcatac agctgatcaa 1081 caaggctgtc aatgctttaa taaatgatca gctcatcatg aagaaccact tgagagacat 1141 catgggcata ccatattgta actacagcag atattggtac cttaaccaca catcaacagg 1201 aaagacctca ctaccaaggt gttggcttat ctcaaatgga tcatatctca atgagaccaa 1261 gttttcagat gacatcgaac aacaagctga caacatgata acagagatgc tacaaaagga 1321 atacatagat agacagggca aaactccact ggggttagtt gacctatttg tttttagcac 1381 aagtttctat ctgataagca tctttctcca cctggtcaaa ataccaaccc atagacacat 1441 tgtaggtaaa ccttgcccaa aaccccacag gctgaaccac atgggcatct gctcctgtgg 1501 tctatacaaa cagccaggtg tgcctgtcag atggaagagg tgaaatccca cagggccccc 1561 gtgacccacc gccaattggc ggtgggtcac gggggcgtcc atctacagga cgactttagg 1621 tgttgaagtt ctgaacacca tgtctctagg gagcacaact ctcaggactt ttgcatcaag 1681 tcctcctgaa actgctgcat caaacataat gcaatccatc agtgcacaat gaggagttat 1741 ttcctcttta ccacctctct tctttttctc cacaactacc ccagtgtgca tgtgacatag 1801 atccttgaat tgatcccaaa cagcattctc aaactttctt gaatctgctt tactaagagt 1861 gatgtcaatc agttttatgt ctctcctccc ttgtgactca aggagttttc tgatatcctc 1921 tgatccttgg caagtgatga ccatgttccg aggaagggct tctatcactg cactggttaa 1981 cccaggttgg gcagcaaaca aatcagtcac atccctacca tgtgaatact ttgcatcttg 2041 tttgaattgc ttcaaatctg ttggctccct aaagaaatgt acatagcaac ccgagctcgg 2101 ttgataaagg gctatctcaa ctgggtcttc tggtcttcct tcaatatcca tccaggtttt 2161 tgcgttggga atcaagttga agcacttgaa atctttgaga gtcatcaact gagagtagat 2221 taatcctgca gcaaagcctg cagactgtaa tgacttgttg gaccccccat ttccaatttt 2281 ctggggcttg ttgtcttgct caaggtccac cacagtattt tcccatgctc ttcccacaat 2341 cgaggtcctt gatgcaatat aggaccatcc gtctcctgag agacagatct tgtataggat 2401 gttctcataa gggttccttt cacccggtgt gtctgataca aacattccca gactcttctt 2461 aactttcaag attgacttca agataccatc catggtctga ggtgaaacct taatagtctc 2521 taacatgtta ccaccatcaa gcatgcaggc ccctgctttg acagcagcac ccaaactgaa 2581 attgtaacca gagatgttga gtgaactttt cttagtgtca atcatattca gaattggatg 2641 actctgagac aatctgtcga gatcagatga gttggggtat ttggctgtgt aaatcagccc 2701 taaatctgtc aaagcttgaa cggcatcatt caggtccact tgcccctgtt tggtcaggca 2761 tgctaaagtt aggcttggca ttgttccgaa ctgattgttg agtaactctg catttttgac 2821 atcccaaact ctcacaatgc cattactggc accctttccc cctccactta ctccagtcat 2881 gccaatcatg ttcaaaaggg ctctcctttg atcaagctgt tgtgaactca aattccccat 2941 ataaactcct gaactcaaag gcctttctgt tctggtgact tttgatttca gtttttctaa 3001 atcagcggcc aggattagta gatcgtctga acttaaggtt ccaactctta agacactttt 3061 ctgctgtgtg gatttgagct caacaagatt gttgactgct tgattgagat ctctcagtcg 3121 ttttaggtcg ccatcatctc ttttctgctt gcgcatcaat ctctgaacat tactgacctc 3181 ggagaagtca agaccatgaa ggagagcttg agcgtcttta actacctgca actttatgtt 3241 ggaacagtag ccagatagtt cccttcttag ggattgagtc cacaagcatg acctcacttc 3301 cttggaagca ctcattgtcg tgatggttgt ctgacccttg agtgggtctt gaatgtggtc 3361 actccaaagg tttgattagt gcaaagcgca atccaatagc ctaggatcca ctgtgcg // LOCUS MOPGPCNC 3419 bp ss-RNA VRL 18-JUL-1990 DEFINITION Mopeia virus glycoprotein (GPC) and nucleocapsid protein genes, complete cds. ACCESSION M33879 KEYWORDS glycoprotein; nucleocapsid protein; small RNA segment. SOURCE Mopeia virus (strain 800150), cDNA to viral RNA. ORGANISM Mopeia virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Arenaviridae. REFERENCE 1 (bases 1 to 3419) AUTHORS Wilson,S.M. and Clegg,J.C.S. TITLE Sequence analysis of the S RNA of the African arenavirus Mopeia: An unusual secondary structure feature in the intergenic region JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.C.S.Clegg, 26-APR-1990. Author address: J.C.S.Clegg PHLS Centre for Applied Microbiology and Research Porton Down, Salisbury SP4 OJG, UNITED KINGDOM FEATURES from to/span description pept 47 1516 glycoprotein pept 3352 1640 (c) nucleocapsid protein BASE COUNT 875 a 819 c 736 g 989 t ORIGIN 1 gtggatccta ggctttttgg ttgcgcattt ctagagcatc tcggagatgg ggcagatagt 61 caccttcttt caagaggtgc cacacatcct tgaagaagtg atgaacattg tgctgatgac 121 cctctcaatc ttggccatcc taaagggcat ctacaatgtg atgacctgtg gaatcatcgg 181 tttgataaca tttttgtttt tgtgtgggag atcatgctca agcatctata aggacaacta 241 tgagttcttc tctctcgacc tcgacatgtc ttcactgaat gcaacgatgc ctctctcctg 301 ctcaaagaac aactcccatc actacatcca agttgggaat gagacaggcc tagagctgac 361 attgacaaac actagcataa ttaatcataa gttttgcaac ctgtctgatg ctcacaggag 421 gaatctttat gacaaagcac ttatgtcaat cttgacaaca ttccacctaa gcattccaga 481 ttttaaccaa tatgaggcaa tgtcatgtga ttttaacgga gggaagatct cagttcaata 541 caacctgtcc cactcaaatt atgtggatgc cggcaaccac tgtggcacta ttgcaaatgg 601 cattatggat gtttttagaa gaatgtattg gagcacctcc ctttcggttg cctctgacat 661 aagtgggact caatgcatac agaccgatta taagtatttg attattcaga acacatcatg 721 ggaggatcat tgcatgttct caagaccctc acccatggga tttttaagcc ttctgtcaca 781 gagaaccagg aatttctaca tctcaagaag acttttgggt ctttttacat ggactttgag 841 tgactcggaa gggaacgaca tgccaggtgg ttactgtctc acaaggtcta tgctaatagg 901 gcttgatctg aaatgctttg ggaacactgc cattgcgaag tgtaatcagg cacatgatga 961 agaattctgt gacatgctcc gcctttttga ttttaataaa caagcaatta gcaaactgag 1021 gtctgaagtt cagcagagca taaatttgat aaataaagct gtcaacgccc tcatcaatga 1081 tcaactggtc atgaggaacc atttgagaga tctaatgggt attccttatt gcaactactc 1141 taagttttgg tacttgaatg atactaggac agggagaaca tccctcccaa agtgttggct 1201 tgtgaccaat gggtcatacc taaatgaaac ccagttctca acagaaattg agcaagaagc 1261 caataacatg ttcactgaca tgttgaggaa ggagtatgag aaaaggcaga gcacaacacc 1321 tctggggtta gtagatcttt ttgttttctc cactagcttt tatttgatct ccgtgttcct 1381 ccacctaatc aaaattccaa cacatagaca cataaaaggc aagccttgcc cgaaaccaca 1441 cagactcaat cacatggcaa tctgttcctg tggcttctat aagcaaccag gtctccccac 1501 acaatggaaa aggtgaagaa caggggcacc cccgagaccc accgccttcg gcggtgggtc 1561 tcgggggcct ggagtgactg ttccttgatg gcctccacga cccaccgcca gaggcggtgg 1621 gtcgtggagg tgtcccttct tacaggacaa ctctgggagg acctgttcta aacaccaggt 1681 ctctactcaa gactggtctg ggggttggga tttgtggact ccctatgact gctgcttcaa 1741 acatgagaca atccagcaat gcacagtgtg gtgtcacttc ttctttgcca cctctcttct 1801 tcttttgcgt tacaatcccc gtatgcattc tacaaaggtg tttgtattca tcccagatct 1861 tatcctcaaa ttttcttgca tcgtctttct gcatggacac atcaatcagt tttatgtccc 1921 tcctgttctg ggagtcaaga agctttctga tgtcatcagc accttgacac gacaagacca 1981 tgttctttgg caggctctca attacagcac ttgtcaaccc cggctgaact gagaataggt 2041 cttggatgtc aatgccatga gagtgtttgg agtcttgttt gaattgttta atgtctgttg 2101 gttccctgta aaaatgaata tactgaccat tattaggttg atagatagct atctccacgg 2161 ggtcttcagg tctcccttct atgtcaatcc atgtcttagc atttggatcc aattgcaaca 2221 tgctgtcttt gagttccatt gtctgagagt aggtcaaacc agcaggcatg ccttgggcct 2281 gagcgagtct gttggagtta cctttgtcag gctttgcatt ctgttgcaca tcaccactca 2341 gatcaacagt tgtgttatcc caagccctgc cgacgatcga tgttcttgat gctatgtaag 2401 gccaaccctc tccagaaaga catagtttgt agaggagatt ttcataaggg ttcctttccc 2461 ctggcgtgtc agagacaaac attcccaaag aacgcttgac tttcaaggca gccttcaaaa 2521 ttccttccaa attggatttt gttactttga tggtctccag catgtttccg ccgtctatta 2581 gacaagcacc agctttcacc gcagctgaca ggctaaaatt gtaccctgat atgtttatgg 2641 cacttttctt gggctcaatg atccccaaga ttgggtgggt ctgagagagt ttgtctaagt 2701 cagacatgtt tggatacttt gcagtgtaca caagccccaa gtctgacaac gattggatca 2761 catcattcag atctgcctgc ccttgtttgc acatgcaagc aatcgtcagg cttggcattg 2821 tcccaaactg attgtttaaa agctctgggt ttcgaacatc ccacactcta acaatgccat 2881 ctccgagagt gtttcctgcc cggaacccgc cacccattcc gaccatctgt agcaaaaccc 2941 tcctctgttc tagctgctga gctgttaggt tgcccatata gactccagca gcaagaggcc 3001 tctcacctct gatgactttt gctttgagtc tgtccaggtc ggcagcaagg acgaggaggt 3061 catcagaggt tagtgtcccc actctcaaaa cattcttctg ttggactgac tttaactcaa 3121 ctagattgtt cactgcctgg tttaggtccc tcaatctttt caggtcagag tcatccctct 3181 tctcctttct catcaacctt tgaacattgg caacttcaga gaagtccagc ccatgaagaa 3241 gtgcttgagc atccttgatg acttggatct ttatgttgga gcagtaccct gagagttctc 3301 tcctcaggct ctgtgtccac aagaaggact tcacctcctt tgaattggac attgttggag 3361 gcttttcggt gcagagctga ttccaaaggc gcaatcaatt agcctaggat ccactgtgc // LOCUS HIV2D194 9399 bp ss-RNA VRL 18-JUL-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral genome. ACCESSION J04542 KEYWORDS . SOURCE Human immunodeficiency virus type 2, isolate D194. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (sites; for [2]) AUTHORS Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D., Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C., Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H. TITLE Molecular cloning of two West African human immunodeficiency virus type 2 isolates that replicate well in macrophages: A Gambian isolate, from a patient with neurologic acquired immunodeficiency syndrome, and a highly divergent Ghanian isolate JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2383-2387 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 9399) AUTHORS Kuehnel,H., von Briesen,H., Dietrich,U., Adamski,M., Mix,D., Biesert,L., Kreutz,R., Immelmann,A., Henco,K., Meichsner,C., Andreesen,R., Gelderblom,H. and Ruebsamen-Waigmann,H. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT The gag coding sequence appears to be distorted at the 5' end by a frameshift; the env cds has a premature stop codon. The 5' ends of the pol and gag genes have not been determined. FEATURES from to/span description pept < 851 2053 gag polyprotein (AA at 851) pept < 1657 4878 pol polyprotein (AA at 1657) pept 4808 5455 vif protein pept 5283 5621 vpX protein pept 5621 5938 vpR protein pept 5784 6079 tat protein, exon 2 (first expressed exon) 8212 8308 tat protein, exon 3 (AA at 8213) pept 6010 6079 rev protein, exon 2 (first expressed exon) 8212 8453 rev protein, exon 3 (AA at 8214) pept 6086 6886 env ORF pept 8471 9244 nef protein LTR 1 239 5' LTR LTR 8856 9399 3' LTR rpt 1 113 R repeat 5' copy binding 243 259 primer (Lys-tRNA) binding site BASE COUNT 3206 a 1924 c 2357 g 1911 t 1 others ORIGIN 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gctccacgct tgcttgctta aagacctctt aataaagctg ccagttagaa gcaagttaag 121 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gagtaacaag 181 accctggtct gttaggaccc ttcccgcttt gagaatccaa ggcaggaaaa tccctagcag 241 gttggcgccc gaacagggac ttgaaagagg actgagaagc cctggaacac ggctgagtga 301 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaaag cgcgggccga 361 ggtaccgaag cggcgtgtgg agcgggagtg aaagaggcct ccgggtgaag gtaagtacct 421 acaccgaaaa ctgtagccag aaaaggcttg ttatcctacc tttagacagg tagaagattg 481 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagacga attagaaaaa 541 gttaggttac ggcccaacgg aaagaaaaga tacaggttaa aacatgttgt gtgggcagcg 601 aatgaattag acagattcgg attggcagag agcctgttgg aatcaaaaga aggttgccaa 661 aagattctta aagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt 721 tttaataccg tctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa 781 gaagcaaaga aactagcaca gagacatcta gtggcagaaa caggaactgc agagaaaatg 841 ccaaatataa gtagaccaac agcaccacct agtgggaaag ggaggaaact tccccgtgca 901 acaggcaggc ggcaactata tccatgtgcc gctgagcccc cgaactctaa atgcttgggt 961 aaaattagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc 1021 agaaggctgc acgccctatg atatcaatca aatgcttaat tgtgtgggcg atcaccaagc 1081 agctatgcaa ataatcagag aaattattaa tgaggaagca gcagattggg atgcgcagca 1141 cccaatacca ggccccttac cagcagggca gcttagagac ccaagggggt ctgacatagc 1201 aggaacaaca agcacagtag atgaacagat ccagtggatg tataggcaac caaatcccgt 1261 gccggtaggg aacatctaca ggagatggat ccagataggg ctacagaaat gtgtcaggat 1321 gtacaaccca actaacatct tagatgtgaa gcagggacca aaagaatcgt tccagagcta 1381 tgtagacaga ttctacaaaa gcctaagggc agaacaaaca gacccggctg taaaaaattg 1441 gatgacccaa acgctgctaa tacagaatgc caacccagac tgcaagttag tattaaaagg 1501 actagggatg aatcccaccc tagaggagat gctgactgcc tgccagggag taggcggacc 1561 aagccagaaa gccagactaa tggctgaagc cctaaaggag gctttgacgc cagcccctat 1621 cccatttgca gcagcccaac aaagaagggc aattaggtgt tggaattgtg gaaaggaggg 1681 acactcggcg aaacagtgcc gagcacccag aagacagggc tgctggaagt gtggcaagtc 1741 aggacacatc atggcaaact gcccggaaag acaggcaggt tttttaggga tgggcccacg 1801 gggaaagcag ccccgcaact tccccgcggc ccaagctcct caggggctga taccaacagc 1861 acccccaata gatccagcag tggacctgtt ggagaaatat atgcagcaag ggagaaagca 1921 gagagagcag agggagagac catacaagga ggtgacggag gacttactgc acctcgagca 1981 gggagagacg ccccacagag gggcgacaga ggacttgcta cacctcaatt ctctctttgg 2041 aaaagaccag tagtcacagc attcatcgag gatcagccgg tagaagtctt actagacaca 2101 ggagctgatg actcaatagt agcaggaata gagttagggg acaattacac tccaaaaata 2161 gtggggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataaaggta 2221 ctaaataaaa gagtaagagc caccataatg acaggagata ccccaatcaa catttttggc 2281 agaaatattc tggcaacctt aggcatgtca ttaaacctac cagtcgccaa gttagaccca 2341 ataaaagtaa cattgaagcc agggaaagat ggaccaaggc tgaaacaatg gcccctaaca 2401 aaagaaaaaa tagaagcact aaaagaaatt tgtgaaaaaa tggaaaggga gggccaacta 2461 gaagaagcac ctccaactaa tccttataat acccccacat ttgcaattaa gaaaaaggac 2521 aagaacaaat ggagaatgct aatagatttt agagaactaa acagggtgac tcaagatttc 2581 acagaaattc agctaggaat tccacacccg gcaggattag ccaaaaagaa aaggattact 2641 gtactagatg taggggatgc ctacttttcc ataccactac atgaagattt taggcaatat 2701 actgcattta ccctaccatc agtaaacaat gcagagccag aaaaaagata tgtatataag 2761 gtcttaccac aaggatggaa aggatcacca gcaatctttc aattcatgat gaggcaaatc 2821 ttagaacctt tcagaaaagc aaacccagac gtcattctca tccaatacat ggatgatatc 2881 ttaatagcta gtgacaggac gggtttagag catgacaaag tagtcctgca actaaaagaa 2941 cttctgaatg gcctagggtt ctctacccca gatgagaagt tccaaaagga ccctccgttt 3001 caatggatgg gctatgaatt gtggccaact aaatggaaac tgcagaaaat acaattacct 3061 cagaaagaaa tatggacagt caatgacatc caaaaactag taggagtttt gaactgggcg 3121 gcgcagatct atccagggat aaaaaccaag catttatgta aattgattag aggaaaaatg 3181 acactcacag aggaagtaca gtggacagag ttagcagagg cagaactaga agaaaacaaa 3241 attatcttaa gtcaggaaca agagggatcc tactatcagg aagaagaaga actagaagca 3301 acagtcatca aaagccaaga caatcagtgg gcatacaaaa tacaccaggg agagagggtt 3361 ctaaaagtag gaaagtatgc gaagataaaa aatactcata ccaatggggt cagactacta 3421 gcacaagtag tccaaaaaat aggaaaggaa gcactggtca tttggggacg agtgccaaaa 3481 tttcacctac cggtagagag agacacctgg gagcaatggt gggataacta ctggcaagta 3541 acatgggtcc cagagtggga cttcgtatct accccaccac tggtcaggtt gacatttaac 3601 ttggtaggag atcctatacc aggcacagag accttttaca cagatggatc atgcaataga 3661 cagtcaaaag aaggaaaagc aggatatgta acagatagag ggagagacag ggtaagagta 3721 ttagagcaaa catccaatca gcaagcagaa ctagaagcct ttgcgatggc actggcagac 3781 tcaggtccca aggttaatat catagtagac tcacagtatg taatggggat agtagcaggc 3841 caaccaacag agtcagaaaa tagaatagta aaccaaatca ttgaggacat gataaagaaa 3901 gaagcagtct atgttgcatg ggtcccagcc cataaaggca taggaggaaa ccaggaagta 3961 gaccatttag taagtcaggg catcagacaa gtattattcc tggaaaagat agagcccgct 4021 caagaagaac acgaaaaata tcatagcaat ataaaagaac taacccataa atttggaata 4081 ccccaactag tggcaagaca gatagtaaac acatgtgccc aatgccaaca gaaaggagaa 4141 gccatacatg ggcaagtaaa tgcagaaata ggcgtttggc aaatggactg cacacactta 4201 gaaggaaaaa tcattatagt agcagtgcat gttgcaagtg gattcataga agcagaagtc 4261 atcccacagg aatcaggaag gcagacagca ctcttcctat taaaactggc cagtaggtgg 4321 ccaataacac acttgcacac agacaatggc cccaacttca cttcacagga agtgaagatg 4381 gtggcatggt ggataggtat agagcaatcc tttggagtac cttacaatcc acaaagccag 4441 ggagtagtag aagcaatgaa tcaccaccta aaaaatcaga taagtagaat tagagaacag 4501 gcaaatacaa tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaagaagg 4561 ggaggaatag gggatatgac cccagcagaa agactaatta acatgatcac cacagaacaa 4621 gaaatacaat tcctccaaag aaaaaattca aattttaaaa aattccaggt ctattacaga 4681 gaaggcagag atcagctgtg gaaaggacct ggagagctac tgtggaaggg agacggagca 4741 gtcatagtca aggtaggggc ggacataaaa gtagtaccaa gaaggaaggc caagattatc 4801 agggactatg gaggaaggca agaactggat agtagttccc acctggaggg tgccagggag 4861 gatggagagg tggcatagcc ttgtcaagca cctgaagtac agaacaaaag acttagagga 4921 ggtgcgctat gttccccatc acaaggtagg atgggcatgg tggacttgca gcagggtaat 4981 attcccacta gaaggagaaa gtcatctaga gatacaggca tattggaacc taacaccaga 5041 aaaaggatgg ctctcctctc attcagtaag gttaacctgg tatacagaaa agttctggac 5101 agatgttacc ccagactgtg cagactccct aatacacagc acttatttct cttgctttac 5161 ggcaggtgaa gtaagaagag ccatcagagg ggaaaagtta ttgtcctgct gcaactaccc 5221 ccaagctcat aaagcacagg taccatcact tcaataccta gccctagtgg tagtgcaaca 5281 aaatggcaga ccccagagaa agggtgccgc caggaaacag tggagaagag accattggag 5341 aggccttcga gtggctagac aggactatag aagccttaaa cagggaggca gtgaaccatc 5401 tgccccgaga gctcattttc caggtgtggc aaaggtcctg ggcatattgg catgatgaac 5461 aagggatgtc aacaagttac acaaagtata gatatttgtg cataatgcag aaagctgtgt 5521 atatacattt caagaagggg tgcacttgcc tggggagagg acatggcccg ggaggatgga 5581 gaccaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga 5641 gtttccccca gaagatggga ccccacggag agagctaggg agtacctggg taatagaaac 5701 tctgaaggaa atcaaggaag aagccttaaa acattttgat ccctgcttgc taattgctct 5761 tggcaactat atctataata gacatggaga cacccttgaa ggagccagag agctcattag 5821 agtcctacaa cgagccctct tcgtgcacat cagagcggga tgtgaccgct caagaaaggg 5881 ccaaacaagg agaagagctc cttgcccagc tgcaccgacc cctagaggca tgcactaact 5941 catgctattg taagcagtgc agttaccatt gccagctgtg tttcttgaaa aaagggctcg 6001 ggatatggta tgcgcgacag ggcagacgaa gaaggactcc aagaaaaact aagactcatc 6061 cgcctcctgc atcagataag taagtatgga gcctggtagg aatcagctgc ttgttgccat 6121 tttattaact agtgcttgct taatatattg caaacaatat gtgactgttt tctatggcat 6181 acccgcgtgg agaaatgcat ctattcccct attttgtgca accaaaaata gagatacttg 6241 ggggaccatc cagtgcttgc cagacaatga tgattatcag gaaataacct taaatgtgac 6301 agaagctttt gatgcatggg ataatacagt aacagaacaa gcaatagaag atgtctggag 6361 actgtttgag acatcaataa aaccatgtgt caagttgacg cccctatgtg tggcgatgaa 6421 ttgtaatata acttcaggga ctaccgcgac cccgagtcca ccaaacatta caataataga 6481 tgaaaattct acctgtatag gcgacaacaa ctgcacagga ttagggaaag aagaggtggt 6541 tgagtgtgag ttcaatatga cggggctaga acaagataag aaaaggaagt ataatgacgc 6601 atggtactca agagatgtgg tttgtgacaa gacaaacgga acaggcacat gttacatgag 6661 acattgcaac acatcagtca tcaaagagtc atgtgacaag cactattggg atgctatgaa 6721 gtttagatac tgtgcaccac cgggttttgc cctactaaga tgcaatgata ccaactattc 6781 aggctttgaa cctaagtgct ctaaagtagt agctgcttca tgcacaagga tgatggaaac 6841 gcaaacttct acttggtttg gctttaatgg cactagagca gaatagaaca tatatctatt 6901 ggcatggtaa gataatagga ctatcattag cttaaacagt attataatct cacaatgcat 6961 tgtaagagac caggaaataa gacagttgta ccaataacac ttatgtcagg gcgaaggttt 7021 cactctcggc cagtctacaa caaaaaacct gggcaggcat ggtgttggtt tcaaggcaac 7081 tggatagaag ccatgcggga ggtgaagcaa acccttgcaa aacatcccag gtacggagga 7141 acaaatgata caggaaaaat taactttacg aagccaggaa taggttcaga cccagaagtg 7201 acatacatgt ggactaactg cagaggagaa tttctctact gtaatatgac ttggttcctc 7261 aattgggtag aaaataagac gaaccaaaca cacggcaact atgcgccatg ccatataagg 7321 cagataatta acacctggca taaggtaggg acaaatgtat atttgcctcc tagggaaggg 7381 gagttgacct gcaattcaac agtaaccagc ataattgcta acattgactc agatggaaat 7441 cagaccaaca ttacctttag tgcagaagtg gcagaactgt accgattaga attgggggac 7501 tacaaattga tagaagtaac accaattccg ttcgcaccta caaaagagaa aagatattcc 7561 tcggctccag tgaggaacaa aagaggtgtg ttcgtgctag ggttcttggg ttttctcgca 7621 gcagcaggtt ctgcaatggg cggcncgtcc ttgacgctgt cggctcagtc ccggacttta 7681 ctggccggga tagtgcagca acagcaacag ctgttggacg tggtcaagag acaacaagaa 7741 atgttgcgat tgaccgtctg gggaacgaaa aatctccagg caagagtcac tgctatcgag 7801 aaatacttaa aggaccaggc acagctaaat tcatggggat gtgcgtttag gcaggtctgc 7861 cacactactg taccatgggt aaatgactcc ttaacacctg actggaacaa tatgacatgg 7921 caggaatggg aaaaacgagt ccactaccta gaggcaaata tcagtcaaag tttagaacag 7981 gcacaaattc aacaagaaaa gaatatgtat gaactacaaa aactaaatag ctgggatgtc 8041 tttggcaact ggtttgattt gacctcctgg atcaaatata ttcaatatgg agtttatata 8101 gtagtaggaa taataggttt aagaatagcc atatatatag tgcaattgtt aagtagactt 8161 agaaagggct ataggcctgt tttctcctcc ccccccggtt atctccaaca gatccatatc 8221 cacacggaca ggggacagcc agccaacgaa gaaacagaag aagacgccgg agacgacagt 8281 ggtttcggct tgtggccttg gccactaaac tacatacaat tcctgatcca cctactgact 8341 cgcctcttga ccgggctata caacagctgc aggggcttac tatccaagaa ctccccgacc 8401 cgccgactga tctcccagag tctaacagca atcagggact ggctgagact taaggcggcc 8461 tacctgcaat atgggtgcga gtggatccaa gaagcgttcc gagcattcgc aaggactgcg 8521 agagagacta ttgcgggcgc gtggaggggg ttatgtgaag cagcgcaacg catcgggagg 8581 ggaatcctcg cagtcccaag aaggatcagg cagggagcag aaatcgccct cctgtgaggg 8641 acagcagtat cagcagggag agttcatgaa caccccatgg agaaccccag cagcaatagg 8701 gcagaaaaat tcatataagc agcaaaatat ggatgatgta gattctgatg atgatgacct 8761 agtgggagtt cctgttatgc caagagtacc gctgagagaa atgacctata aactggcaat 8821 agatatgtca cattttataa aagaaaaagg aggactggaa gggatatttt acagtaggga 8881 gagacataga atcctagact tgttcctaga aaaggaggaa gggataatac cagattggca 8941 gaattatact catgggccag gaacaaggta cccaatgtac ttcgggtggc tgtggaaact 9001 agtaccagta gacatctcac aagaggcaga ggaagtagag accaactgct tagtacaccc 9061 agcacaaaca agcagatatg atgacgagca tggggagaca ctagtttggc ggtttgaccc 9121 catgctggcc tatagttaca aggccttcat tctgcaccca gaagaatttg ggcacaagtc 9181 aggattgcca gagaaagagt ggaaggcaaa actgaaagca agagggatac catatagtga 9241 ataacaggaa caaccatact tggtcagggc aggaaatagc tactaagaac agctgagact 9301 gcagggactt tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgtcat 9361 attctctgta taaatgtacc cgcttcttgc attgtattc // LOCUS ANAHETA 2608 bp ds-DNA BCT 18-JUL-1990 DEFINITION Anabaena sp. heterocyst differentiation protein (hetA) gene, complete cds. ACCESSION M31722 KEYWORDS differentiation protein; heterocyst differentiation protein. SOURCE Anabaena sp. (strain PCC 7120) DNA. ORGANISM Anabaena sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Nostocaceae. REFERENCE 1 (bases 54 to 2608) AUTHORS Holland,D. and Wolk,C.P. TITLE Identification and characterization of hetA, a gene that acts early in the process of morphological differentiation of heterocysts JOURNAL J. Bacteriol. 172, 3131-3137 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2608) AUTHORS Holland,D. and Wolk,C.P. JOURNAL Unpublished (1990) MSU-DOE Plant Res. Lab., M.S.U., Lansing, MI STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Wolk,C.P., 30-JAN-1990. FEATURES from to/span description pept 198 2021 heterocyst differentiation protein (hetA) ORF 2077 > 2608 unidentified reading frame (ORF2) rpt 1973 2000 direct repeat rpt 2075 2158 direct repeat BASE COUNT 793 a 510 c 513 g 792 t ORIGIN 1 taaaaatcct atataggact catatttgat ttctgaaaaa ccaaattggc tagaagcttg 61 ctatacagag ctttgtttct cagtatacgt agcaaaattc aagtacgatt cctatagatg 121 aattactaca ctttagttaa aaattaggtt gtcacggttc aaaaaattgt cttagaaaat 181 atacaacgca tttaaaaatg ccaaaatcac cacataaatt atttaaagct aatagcttct 241 ggaaagagaa caatttaata ttgcgagaaa taaaacattt tcgcaaaata gctatattag 301 ccgtaatatt ttcattttta gctgccagtt ttgaaggtgt tagtattggt tttttactgt 361 catttttaca aaaattgact agtcctaatg atccaattca aacaggaatt agctgggttg 421 acatgatttt ggctgctgat gcttggccga ttccccccat atacaggata tctctactaa 481 ttttgttgag tacctggatg cgtgctacct tcaattattt tggcggagta tacactgaat 541 cagcccaact taatttagca gatcgcttac ataagcaaat ttttgagcaa ttacaagctc 601 tgaggttgag ttactttgct caaactcgtt cgggtgaact cattaatacg attactacag 661 aaattgaaag aatcaaacag ggttttagtg ggttagcttt tgtgttaacc agaatcatga 721 ccgtttgcgt ttactttgtt gtgatgtttt ctatatcatg gcaactttcg attatttctg 781 tgctgatatt tttactgtta gcagtgggat tatctacgct gaataagcga gtcagagaaa 841 ctagctttgg catttctcat gctaatgctc aatttacggc tgtagctgta gagttcatca 901 atgggattcg gacaattcaa gcttttggga cacaagaatt tgaacggcaa cgtttctata 961 aagcgagtac taatcaactg aatgcagcta tcaaagttgt tttagcttgg acactagtaa 1021 aacctatagc tgaaggtata gctactacgg ttttaattag tttgattgtg atttccttcg 1081 cgacttttac attaccagtt gcctcattat tgacattctt ctttgtttta gtcagggtga 1141 ttcctaatat tcaagatatt aatggtacgg ttgcttttct gagtacttta caagggtcat 1201 cagaaaacat taaaaatatt ttgcagacga ataataaacc ttacttgaaa aatggcaaac 1261 ttcatttcca agggttgaag cgttcaattg atttagtatc cgtagatttt ggttatactg 1321 ctgataattt agtgttgaat aatatcactc tgacaattga gcgtggtaaa acaacagctc 1381 tggtgggtgc atcgggcgct ggtaaaacca cattagctga tttaattccc cgattttacg 1441 atccgacaga gggacagatt ttagtagatg ggcttgatgt acagtacttt gaaatcaatt 1501 ccctccgccg caaaatggct gtagttagtc aagatacatt tattttcaac acttctatta 1561 gagacaatat cgcctacggt acatctgggg cgagtgaagc ggaaattaga gaagtagcgc 1621 ggctagcaaa tgcgttgcaa tttatcgaag aaatgcccga agggtttgat actaagttag 1681 gcgatcgcgg tgtccgttta tctggaggac agagacaacg gattgcgatc gctcgtgcat 1741 tactccgaga tcccgaaatc ctcattcttg acgaagccac cagcgcccta gattcagtct 1801 ccgagcgatt aattcaggag tctatagaaa aactttccgt gggtagaaca gtaattgcga 1861 tcgctcacag actctccaca attgccaaag cagataaggt tgtggtgatg gaacaagggc 1921 gaattgttga gcagggaaat tatcaagaac ttctagaaca acgcggaaag ctctggaaat 1981 atcaccagat gcaacacgaa tcaggacaga ctaattcgta atatcaattc aaaattcaaa 2041 attcaaaatt caaaattagg gaagccgagc agaatcatgg ttttggggta tgtatctgtc 2101 ccattctttt ttcaaatcgg tataactccc caatccccaa tccccaatct ccagtcccca 2161 atccccaatc cccaatcccc aatccccaat ccccagtccc caatccccaa tcccatgaaa 2221 atttccgtca tcatctcgaa ttacaactat gctcgttatc tttctagagc aatcaactct 2281 gttctcgctc aaactcactc agacattgaa atcgttatcg tagatgatgg ttctacagat 2341 aacagccgtg atgttattac ccaactgcaa gaacaagcac cggataaaat caagcccatc 2401 tttcaagcaa atcaaggaca gggaggcgct ttcaatgcgg ggtttgcggc ggcgactggc 2461 gaagtcgtag cttttcttga tgcagacgat gtgtggaaac ctcataaatt acagcgtatt 2521 gttgaggtat ttcagacatc ggatgtagtt ggtgtcatgc accatttgga tatcatagat 2581 ggcaatgaca aaacaattga tcaagctt // LOCUS BOVTRLA 85 bp ss-tRNA RNA 18-JUL-1990 DEFINITION Bovine amber suppressor Leu-tRNA-cag. ACCESSION M35066 KEYWORDS transfer RNA-Leu. SOURCE Calf liver tRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 85) AUTHORS Valle,R.P.C., Morch,M.-D. and Haenni,A.-L. TITLE Novel amber suppressor tRNAs of mammalian origin JOURNAL EMBO J. 6, 3049-3055 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 1 85 Leu-tRNA anticdn 35 37 Leu-tRNA anticodon cag modified 6 6 m2g modified 10 10 m2g modified 12 12 ac4c modified 17 17 gm modified 19 19 d modified 21 21 p modified 27 27 m22g modified 33 33 um modified 38 38 m1g modified 39 39 p modified 45 45 um modified 47 47 p modified 50 50 m3c modified 57 57 m5c modified 64 64 p modified 67 67 m1a BASE COUNT 14 a 25 c 28 g 18 t ORIGIN 5' end of mature tRNA. 1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaggtc gcagtctccc tggaggcgtg 61 ggttcgaatc ccacttctga cacca // LOCUS BOVTRLB 86 bp ss-tRNA RNA 18-JUL-1990 DEFINITION Bovine amber suppressor Leu-tRNA-caa. ACCESSION M36600 KEYWORDS transfer RNA-Leu. SOURCE Calf liver tRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 86) AUTHORS Valle,R.P.C., Morch,M.-D. and Haenni,A.-L. TITLE Novel amber suppressor tRNAs of mammalian origin JOURNAL EMBO J. 6, 3049-3055 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 1 86 Leu-tRNA anticdn 35 37 Leu-tRNA anticodon caa modified 6 6 m2g modified 10 10 m2g modified 12 12 ac4c modified 19 19 d modified 21 21 p modified 27 27 m22g modified 33 33 um modified 38 38 m1g modified 39 39 p modified 45 45 um modified 47 47 p modified 50 50 m3c modified 58 58 m5c modified 65 65 p modified 68 68 m1a BASE COUNT 16 a 24 c 28 g 18 t ORIGIN 5' end of mature tRNA. 1 gtcaggatgg ccgagcggtc taaggcgctg cgttcaagtc gcagtctcgc atggaggcgt 61 gggttcgaat cccacttctg acacca // LOCUS BRPMBOI 62 bp ds-DNA INV 18-JUL-1990 DEFINITION B.malayi MboI repeat consensus sequence DNA. ACCESSION M34369 KEYWORDS . SOURCE B.malayi (strain sub-periodic) DNA. ORGANISM Brugia malayi Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Gnathostomatidae. REFERENCE 1 (bases 1 to 62) AUTHORS Natarajan,S., Werner,C., Cameron,M. and Rajan,T.V. TITLE Isolation and characterization of a repetitive DNA element from the genome of the human filarial parasite, Brugia malayi JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.Natarajan, 18-MAY-1990. author address: S.Natarajan University of Connecticut Dept. of Pathology Farmington, CT 06032 BASE COUNT 24 a 12 c 4 g 22 t ORIGIN 1 ccatttctct acagatataa caatatcact agaagacatt ttgattaatt cattaactca 61 ta // LOCUS DUKARGIN 1568 bp ss-mRNA VRT 18-JUL-1990 DEFINITION Duck argininosuccinate lyase/delta 2-crystallin mRNA, complete cds. ACCESSION M35132 KEYWORDS argininosuccinate lyase/delta 2-crystallin. SOURCE Duck 14-day old embryo lens, cDNA to mRNA, clone DC5-1. ORGANISM Anas platyrhynchos Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Anseriformes; Anatidae. REFERENCE 1 (bases 1 to 1568) AUTHORS Wistow,G. and Piatigorsky,J. TITLE Gene conversion and splice-site slippage in the argininosuccinate lyase/delta-crystallins of the duck lens. Members of an enzyme superfamily JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.J.Wistow, 26-MAR-1990. FEATURES from to/span description pept 74 1480 argininosuccinate lyase/delta 2-crystallin signal 1538 1543 poly-A signal site 86 91 6 inserted bases at splice site slip BASE COUNT 389 a 398 c 439 g 342 t ORIGIN 1 ccggggctgc actgcggaca cggttgcacc aggtgctggg atcgccgcac acgcgagcaa 61 aacgtcctct gagatggcat ctgaggcgag gggagataaa ctttggggag gaagattcag 121 tggaagcaca gatcccatca tggagaagct caactcttcc attgcctatg accagaggct 181 gtctgaagtt gatatccagg ggagcatggc ttatgccaaa gctctggaga aggctgggat 241 cctaactaag actgagctgg agaagatcct gagtggcctg gaaaagatct ctgaggaatg 301 gtctaaggga gtctttgtgg tgaaacagag cgatgaggat atccacactg ccaacgaacg 361 cagactgaag gagctgattg gtgatatcgc tggaaagctg cacaccggaa gaagcaggaa 421 tgatcaggtt gtgactgact tgaagctgtt catgaagaat tccctctcta tcatctccac 481 tcacctgctg cagctcatca agaccctggt ggaacgtgct gccatagaaa ttgatgttat 541 cttgcctggc tacacccacc tgcagaaggc ccagcccatc agatggagcc agttcttgct 601 cagccatgct gttgccctga cccgtgactc tgagcgcctg ggagaggtga agaagaggat 661 caacgtcctg cccctgggaa gcggtgctct ggctggcaac cccctggaca ttgacagaga 721 gatgctgcgt agcgagctgg agtttgcttc catcagcctg aacagcatgg atgccattag 781 tgagagagac tttgtggtgg aattcctctt ccttgccacc ctgctgatga tccaccttag 841 caagatggct gaagatctca taatctacag caccagcgag tttggcttcc tgaccctttc 901 tgatgccttc agcactggca gcagcctgat gcctcagaag aagaaccctg atagcctgga 961 actgatccgc agcaaggctg gacgtgtgtt tggacggctg gcttcaattc tcatggttct 1021 caaaggactt ccaagcacct acaacaagga cctgcaggag gacaaggagg ctgtctttga 1081 tgttgtggac accctgactg ctgtgctcca ggttgccact ggggtgatct ctaccctcca 1141 gatcagcaag gagaacatgg agaaggctct gacccctgag atgctggcta ctgacctggc 1201 tctctacttg gttcgcaaag gagtgccgtt cagacaagcc cacactgctt cggggaaggc 1261 cgtccacctt gccgagacca aaggcatcac catcaataaa ctcagcctgg aggacctgaa 1321 gagcatcagc ccccagttca gcagcgatgt ctcccaggtc ttcaactttg tcaacagcgt 1381 ggagcaatac actgccctgg cgggtaccgc caagagcagc gtgactaccc agatcgagca 1441 gctgagggag ctgatgaaga agcagaagga acaagcttag agtgtgggga gatttcccgt 1501 ggatgtagtg ttttgcttat cacactaatg cagagttaat aaacactgtg gtgtattgta 1561 gttcactc // LOCUS DUKARGINX 1562 bp ss-mRNA VRT 18-JUL-1990 DEFINITION Duck argininosuccinate lyase/delta 1-crystallin mRNA, complete cds. ACCESSION M35133 KEYWORDS argininosuccinate lyase/delta 1-crystallin. SOURCE Duck 14-day old embryo lens, cDNA to mRNA, clone DC6-2. ORGANISM Anas platyrhynchos Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Anseriformes; Anatidae. REFERENCE 1 (bases 1 to 1562) AUTHORS Wistow,G. and Piatigorsky,J. TITLE Gene conversion and splice-site slippage in the argininosuccinate lyase/delta-crystallins of the duck lens. Members of an enzyme superfamily JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.J.Wistow, 26-MAR-1990. FEATURES from to/span description pept 71 1471 argininosuccinate lyase/delta 1-crystallin signal 1529 1534 poly-A signal BASE COUNT 393 a 400 c 427 g 342 t ORIGIN 1 gggctccact gcagacacgg ttgtaccagg tgccgggatc gccgcacacc cgagcaaaac 61 gtcgtctgag atggcatccg agggggataa acttatggga ggaaggtttg tcggaagcac 121 ggatcccatc atgcagatgc tcagcacttc catatccact gaacagaggc tgtctgaggt 181 tgatatccag gcaagcattg cttatgccaa agccctggag aaggctggga tcctaactaa 241 gactgaactg gagaagatcc tgagtggcct ggaaaagatc tctgaggaat tgtctaaggg 301 agtcattgta gtgacccaaa gcgatgagga tatccaaact gccaatgaac gcagactgaa 361 ggagctgatt ggtgatattg ctggaaagct gcacactgga agaagcagga acgaacaggt 421 tgtgactgac ttgaagctgt tcatgaagaa ttccctctct atcatctcca ctcacctgct 481 gcagctcatc aagaccctgg tggaacgtgc tgccatagaa attgatgtta tcttgcctgg 541 ctacacccac ctgcagaagg cccagcccat cagatggagc cagttcttgc tcagccatgc 601 tgttgccctg acccgtgact ctgagcgcct gggagaggtg aagaagagga tcaacgtcct 661 gcccctggga agcggtgctc tggctggcaa ccccctggac attgacagag agatgctgcg 721 tagcgagctg gagtttgctt ccatcagcct gaacagcatg gatgccatta gtgagagaga 781 ctttgtggtg gaattcctct ctgttgccac cctgctgctg atccacctta gcaagatggc 841 tgaagatctc atcatctaca gcaccagcga gtttggcttc ctgacccttt ctgatgcctt 901 cagcactggc agcagcctga tgcctcagaa gaagaaccct gatagcctgg aactgatccg 961 cagcaaggct ggacgtgtgt ttggacggct ggcttcaatt ctcatggttc tcaaaggact 1021 tccaagcacc tacaacaagg acctgcagga agacaaggag gctgtcattg atgttgtgga 1081 caccctgact gctgtgctcc aggttgccac tggggtgatc tctaccctcc agatcagcaa 1141 ggagaacatg gagaaggctc tgacccctga gatgctggct actgacctgg ctctctactt 1201 ggttcgcaaa ggaatgccat tcagacaagc ccacactgct tcggggaagg ccgtccacct 1261 tgccgagacc aaaggcattg ccatcaataa tctcaccctg gaggacctga agagcatcag 1321 ccccctgttc agcagcgatg tgtcccaggt cttcaacttt gtcaacagcg tggagcagta 1381 cactgccctg ggcggtaccg ccaagagcag cgtgactacc cagattgagc agctgaggga 1441 gctgatgaag aagcagaagg aacaagctta gagtgtgggg agatttcccg tggatgcagc 1501 attgtgctta tcacactaat gcagagttaa taaacactgt ggtgtattgt agttcactca 1561 aa // LOCUS VACFGENE 6185 bp ds-DNA VRL 18-JUL-1990 DEFINITION Vaccinia virus F1L, F2L, F3L, F4L, F5L, F6L, F7L, F8L, and F9L mRNAs, complete cds. ACCESSION M34368 KEYWORDS . SOURCE Vaccinia virus (strain WR), cDNA to viral mRNA, clone HindIII F. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 6185) AUTHORS Roseman,N.A. and Slabaugh,M.B. TITLE The vaccinia virus HindIII D fragment: Nucleotide sequence of the left 6.2-kb JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.A.Roseman, 17-MAY-1990. Author address: N.A.Roseman Oregon State University Biochemistry and Biophysics Corvallis, Oregon 97331 FEATURES from to/span description pept 119 757 F9L protein pept 817 1014 F8L protein pept 1162 1404 F7L protein pept 1420 1644 F6L protein pept 1674 2642 F5L protein pept 2674 3633 F4L protein pept 3644 5086 F3L protein pept 5110 5553 F2L protein pept 5565 > 6185 F1L protein BASE COUNT 2175 a 946 c 1138 g 1926 t ORIGIN 1 gaattcatca tgtgtaccaa aacagactgt gataaatata gattaaaggt ttccattctt 61 cacccaatta gtttcttgga aaaatttatt atgagagaca ttttctcaga ctggataaat 121 ggcggaaact aaagagttta aaactttgta taatcttttt atagatagtt atttacaaaa 181 attagctcaa cattctatcc ctactaatgt cacttgtgct attcatatag gagaggttat 241 aggacagttt aaaaattgcg cgctccgaat aactaacaaa tgcatgagta attctcgact 301 tagtttcaca ctcatggttg aatcatttat tgaagtgatt tcattgcttc cggaaaagga 361 tagaagacgt atcgctgaag aaataggaat agatctagac gatgtaccta gtgcggtatc 421 caagctagaa aagaactgta atgcgtatgc ggaggttaat aatattatag atatacagaa 481 attagatatc ggagaatgtt cggctccgcc cggtcaacat atgcttttac agatagttaa 541 tacaggatcc gcggaacgaa attgtggttt acagacaatt gttaagtcct taaataaaat 601 atacgttcca cctattatcg aaaaccgatt gccgtattac gatccgtggt ttctagtggg 661 tgtagcaatt attctagtta tttttactgt agctatttgt tctattagac gaaatctggc 721 tcttaaatac agatacggaa cgtttttata cgtttaatta ataaaaaaat ttaattacaa 781 ggtatataat agtactccat ctacgcaatc gcgataatgg agggatctaa acgcaaacac 841 gacagtcggc gactacaaca agaacaggag cagcttcgtc cacgtacacc gccatcatat 901 gaagaaattg caaaatatgg acactcattt aacgtgaaaa gatttacgaa tgaagaaatg 961 tgtcttaaga atgattatcc acgaattata tcatataatc ctccaccaaa atagagtata 1021 tatatatcat catttcatga tgtatactac tgacatagtt tcaatgtgaa cttttcactt 1081 tcttgccggt tatgaagaat attttttatt ttaatggtca ttactaatcg tatattataa 1141 ttgaaaatgg attagtttaa tatgacgctc gtcatgggat cctgctgtgg tagattctgt 1201 gacgctaaga ataagaataa gaaggaagat gtagaagagg gaagagaagg atgttacaat 1261 tataagaacc ttaatgatct ggatgaatcc gaagcacgtg tagaatttgg accattatat 1321 atgataaatg aagaaaaatc agacataaat acattggata taaaaagaag atatagacac 1381 acgatagagt ctgtatattt ctaaaagttt ttataaaaaa tgagtaaaat actcacgttt 1441 gttaaaaata agataattga cttgattaat aatgaccaaa ttaaatattc tagagttata 1501 atgatagaag agtccgatag tcttttaccg gttgatgagg tgcatgctaa ccacggattt 1561 gactgtgtgg agatgataga tgaaaatata agcaatgaga atatcgaaca gtataaaacc 1621 gaatcttttt ttacaataaa ttgaaatcaa aacatttatt aaaccgcatc aagatgggta 1681 cgaacggcgt tagagtattt gtcattttat atttgttggc tgtatgcgga tgtatcgaat 1741 acgacgtaga cgataatgta catatttgta cccacactaa cgtgtcacat attaatcaca 1801 ctagttggta ttataatgat aaggttatag cgctagccac cgaggataaa acttctggtt 1861 atatatcatc attcataaaa cgcgttaata tctcattaac ttgtttaaat atatcgagtt 1921 tgcggtacga agattctggt acatacaaag gagtatcgca tctaaaagat ggagtcatcg 1981 ttacaactac tatgaatata tctgtaaagg ctaatatcat tgacttgact ggtagagtgc 2041 gttatctaac cagaaattat tgcgaagtta aaatacgatg cgaaataaca tctttcgcgc 2101 ttaatggttc tactacacca ccacatatga tattaggaac agtagataaa tggaaatatc 2161 ttccatttcc tacagatgat tatagatacg taggggaact gaaaagatat atatctggaa 2221 acccatatcc aacagagtcg ctagcgttag aaattagctc gacgtttaat cggtttacta 2281 tcgttaaaaa tttgaacgat gacgagtttt cttgttatct gttttcacaa aattatagtt 2341 tccataaaat gttgaatgtg cgtaacattt gtgaatccga atgggaggcg ttaaataata 2401 ataacgataa ttcatcctcc atgcccgctt cccacaacaa tctcgcaaac gatttatcta 2461 gtatgatgtc acaattacaa aatgataatg atgataataa cgattattca gcacccatga 2521 atgtcgataa tttaattatg atagtactaa taacaatgct atcaataata cttgtaatta 2581 ttgtagtgat tgcggcgata tcgatgtaca aaagatccaa gtacaggcat atagataact 2641 gaaaaaaaat ttattgttat tgttaattta gttatggaac ccatccttgc accaaatcca 2701 aatagatttg ttattttccc aatccaatat tatgacatct ggaacatgta taaaaaggca 2761 gaggcatcat tttggacagt ggaagaagta gatatatcta aagatatcaa tgattggaat 2821 aaactaacac cagacgaaaa atattttata aaacatgtat tggcgttttt tgcagccagt 2881 gacggaatag tgaatgaaaa tttggcggaa cgattttgta cagaagtaca gattaccgag 2941 gctagatgtt tctacggatt tcagatggcc attgaaaaca ttcattcgga aatgtatagt 3001 cttttgatcg atacttatgt taaagatagt aatgaaaaaa actatctctt taatgccata 3061 gaaacgatgc cttgtgtaaa aaagaaggcc gattgggctc aaaagtggat acatgacagc 3121 gccggttatg gagagagact tattgccttt gctgcagtag aaggaatctt cttttctgga 3181 tcattcgctt ccatattttg gcttaaaaag cgtggcctaa tgcccggact cacgttttcc 3241 aacgaactaa ttagtagaga cgagggtctg cactgcgatt tcgcatgttt gatgtttaaa 3301 catttattgc atccaccgag tgaagaaacc gttagatcta ttataacaga tgcggtatcc 3361 attgaacaag aatttcttac tgcggctctt ccagttaaac ttataggaat gaattgtgaa 3421 atgatgaaaa catatataga attcgtcgcg gatagattga tttctgaatt gggatttaaa 3481 aaaatttata atgttaccaa tccgtttgat ttcatggaaa atatatcatt ggaaggaaaa 3541 actaattttt tcgaaaaacg tgtgggtgaa taccaaaaaa tgggagttat gtctcaagaa 3601 gataatcatt tttctttaga tgttgacttt taaagaaaca taaatgccga tatttgttaa 3661 tactgtgtac tgtaagaata tattagcatt gtctatgact aagaaattca aaacaattat 3721 tgatgctata ggtggcaata taatagtcaa ttctacgata ttgaaaaagt tatctcctta 3781 ctttcgcaca catttacgtc aaaaatacac gaaaaataaa gatccagtta cttgggtttg 3841 tctagacctt gacattcaca gtttaacttc tatagttatt tactcgtata ctggaaaggt 3901 atatatagat agtcataacg tcgtcaattt attacgtgct tctatattaa cctctgtaga 3961 atttatcatc tacacttgta taaactttat cttacgagat tttagaaagg aatattgtgt 4021 cgagtgttac atgatgggta tagaatacgg actatccaat ctcttatgtc atactaaaaa 4081 ctttattgcc aaacactttt tggaactgga agatgacatc atagacaatt ttgattatct 4141 atctatgaaa cttattctag aaagcgatga actaaatgtt ccagatgagg attatgtagt 4201 tgattttgtc attaagtggt atataaagcg aagaaataaa ttaggaaatc tgctactcct 4261 tatcaaaaat gtaatcaggt caaattatct ttctcccaga ggtataaata atgtaaaatg 4321 gatactagac tgtaccaaaa tatttcattg tgataaacaa ccacgcaaat catacaagta 4381 tccattcata gagtatccta tgaacatgga tcaaattata gatatattcc atatgtgtac 4441 aagtactcat gttggagaag tagtatatct catcggtgga tggatgaaca atgaaataca 4501 taacaatgct atagcggtaa attatatatc aaacaattgg attccaattc ctccgatgaa 4561 tagccccaga ctgtatgcta gcgggatacc cgctaacaat aaattatacg tagtaggagg 4621 tctaccaaat cccacatctg ttgagcgttg gttccacggg gatgctgctt gggttaatat 4681 gccgagtctt ctgaaaccta gatgtaatcc agcagtggca tccataaaca atgttatata 4741 cgtaatggga ggacattctg aaactgatac aactacagaa tatttgctac ccaatcatga 4801 tcagtggcag tttggaccat ccacttatta tcctcattat aaatcatgcg cgttagtgtt 4861 cggtagaagg ttattcttgg ttggtagaaa tgcggaattt tattgtgaat ccagcaatac 4921 atggactctg atagatgatc ctatttatcc gagggataat ccagaattga tcatagtgga 4981 taataaactg ctattgatag gaggatttta tcgtgaatcg tatatagata ctatagaagt 5041 gtacaatcat cacacttatt catggaatat atgggatggt aaataatttt gaaataaaat 5101 attagtttta tgttcaacat gaatattaac tcaccagtta gatttgttaa ggaaactaac 5161 agagctaaat ctcctactag gcaatcacct tacgccgccg gatatgattt atatagcgct 5221 tacgattata ctatccctcc aggagaacga cagttaatta agacagatat tagtatgtcc 5281 atgcctaagt tctgctatgg tagaatagct cctaggtctg gtctgtccct aaaaggcatt 5341 gatataggag gcggtgtaat agacgaagat tataggggaa acataggagt cattcttatt 5401 aataatggaa aatgtacgtt taatgtaaat actggagata gaatagctca gctaatctat 5461 caacgtatat attatccaga actggaagaa gtacaatctc tagatagtac aaatagagga 5521 gatcaagggt ttggatcaac aggacttaga taataaacaa tagtatgttg tcgatgttta 5581 tgtgtaataa tatcgtagat tatgtagatg atatagataa tggtatagta caggatatag 5641 aagatgaggc tagcaataat gttgatcacg actatgtata tccacttcca gaaaatatgg 5701 tatatagatt tgacaagtcc actaacatac tcgattatct atcaacggaa cgggaccatg 5761 taatgatggc tgttcgatac tatatgagta aacaacgttt agacgacttg tatagacagt 5821 tgcccacaaa gactagatca tatatagata ttatcaacat atattgtgat aaagttagta 5881 atgattataa tagggacatg aatatcatgt atgatatggc atctacaaaa tcatttacag 5941 tttatgacat aaataacgaa gttaatacta tactaatgga taacaagggg ttgggtgtaa 6001 gattggcgac aatttcattt ataaccgaat tgggtagacg atgtatgaac ccagtagaaa 6061 cgataaaaat gtttactcta ctatcgcata ctatatgcga tgattatttt gtagattata 6121 taacggacat ttcaccacca gataatacca tccctaacac tagcacgcgt gaatatctaa 6181 agctt // LOCUS BSUCCCA 1172 bp ds-DNA BCT 18-JUL-1990 DEFINITION B.subtilis 13 kD cytochrome c-550 (cccA) gene, complete cds. ACCESSION J05569 KEYWORDS cytochrome c-550. SOURCE B.subtilis (strain w168) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1172) AUTHORS von Wachenfeldt,C. and Hederstedt,L. TITLE Bacillus subtilis 13 kDa cytochrome c-550 encoded by cccA, consists of a membrane-anchor and a heme-domain JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.O.S.von Wachenfeldt, 12-JUN-1990. FEATURES from to/span description pept 542 904 cytochrome c-550 (cccA) BASE COUNT 400 a 170 c 264 g 338 t ORIGIN 223 min on B.subtilis map. 1 cagctggaag atgtgcttga tacgttaact gatcgtgaag aaaatgtatt gcgtcttcga 61 ttcggtcttg atgacggccg tacaagaaca ttagaagagg tcggcaaagt atttggagta 121 acgagagagc gtattcgaca aatcgaagcc aaagcgttgc ggaaactaag acatcctagc 181 agaagtaaac gtttgaaaga tttccttgaa taagatggaa cgggtcttga agatccgttc 241 ttcttttttt aaaaagatat atggataata tgcctttatt ttactgaaaa atgatgtcat 301 ttgcaaatga acattgtggt gaaaaatttc aaaatctaat tccatatttt ctattgtaag 361 cgtatacaat acattataca atagaataaa aaggatatta gagatttagg catgtttcta 421 ttcaattttg tataaagtgt gaataaaaaa cttttgtata gcaatccatt tactttttgt 481 aaaaataagt tagaattaga agtgtttaca tagggggaag gatcaaaaag ggggaaggga 541 aatgaaatgg aacccgctta ttccattttt gctgatcgct gttttaggaa ttggtctaac 601 tttcttttta tcagtaaaag gacttgatga ctctcgggag attgcgagcg gaggagaaag 661 caaatctgct gaaaagaaag atgcaaacgc ttcaccagaa gaaatttaca aggcaaactg 721 catcgcatgc cacggtgaaa attatgaagg tgtttcgggt cctagcttaa aaggtgtagg 781 ggataagaaa gatgtcgctg aaattaaaac aaagatcgaa aaaggcggta atggaatgcc 841 ttccggtctt gttccggccg ataagctgga tgatatggca gagtgggtgt caaaaattaa 901 ataaaagaac tatttttctc tttatacatt cacaggccgc cgagatgttc tcgacagcct 961 tttatttttg actgctgcgt gcagacccgc atcaaaagaa ataatttcaa aatcatttgc 1021 ttcagcttgg attccgggct gattttttat atgatagtaa aggaaaaaat gattgagaaa 1081 agttgtgatg acaagtgaac gaattaaaat tatctaaacg attgcaaaca gtagcggagt 1141 acataccgaa tggagcggtg atggccgata tc // LOCUS ECOGACAR 1630 bp ds-DNA BCT 18-JUL-1990 DEFINITION E.coli glutamate and aspartate carrier (gltP) gene, complete cds. ACCESSION M32488 KEYWORDS glutamate and aspartate carrier. SOURCE E.coli (strain K12-BK9MDG) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1630) AUTHORS Wallace,B., Yang,Y.-J., Hong,J. and Lum,D. TITLE Cloning and sequencing of a gene encoding a glutamate and aspartate carrier of E.coli K-12 JOURNAL J. Bacteriol. 172, 3214-3220 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Y. Hong, 02-MAR-1990. FEATURES from to/span description pept 151 1338 glutamate and aspartate carrier (gltP) BASE COUNT 330 a 391 c 472 g 437 t ORIGIN 1 caagggttgc gcaacatacc gcgcaaatga tactgatcat aagcgttaaa aaaatctaca 61 aaccaacgca acacaattca tgccctggca gtatgtcacg ttctcgcgtt tctgaacggg 121 gaacggcgct ccattgagga agtcattcat atgaaaaata taaatttcag cctggcctgg 181 cagattctgt ttgctatggt gctgggcatt ctcctgggaa gctacctgca ctaccatagc 241 gacagccgcg actggctggt cgtcaatttg ctctctccgg cgggtgatat cttcatccat 301 ctgattaaaa tgattgttgt gccgattgtg atctccacgc tggtggtggg tatcgcgggt 361 gttggtgatg ccaaacagct cgggcgtatt ggcgcgaaaa ccattatcta cttcgaggtg 421 atcaccaccg tcgccatcat tttgggatca ctctggcgaa ccgtcttcca gcccggtgcc 481 ggggtggata tgtcgcagtt ggcgaccgtc gatatctcga aatatcagag cactacggaa 541 gcggtacaaa gcagttccca cggcattatg ggcacgattt tgtcgctggt gccgacgaac 601 attgtggcgt cgatggcgaa aggcgaaatg ctgccgatca tctttttctc ggtgctgttt 661 ggtctggggc tttcttccct gcccgcgacg catcgtgaac cgctggtgac cgtgttccgc 721 tccatctctg aaaccatgtt taaagtgact cacatggtga tgcgttatgc accggtgggt 781 gtgtttgcgc tgattgcggt gacggtggct aactttggtt tctcgtctct gtggccactg 841 gcgaaactgg tgctgctggt gcatttcgcc attctgttct tcgcgctggt agtgctggga 901 attgtggcgc gcctgtgcgg gttaagcgtc tggatcctga ttcgtattct gaaagatgag 961 ctgattctgg cgtactccac tgccagctct gaaagcgtcg tgccgcgaca tatgagaaga 1021 tggaagccta cggagcaccg gtgtcgatca ccagtttcgt ggtgccgacc ggttaacctc 1081 ttttaccttg atggttcgac gctgtatcaa agtattgccg ctatcttcat cgcgcagttg 1141 tatggcattg acctgtccat ctggcaggaa atcattctgg tcgtgacgct gatggtgacc 1201 tcgaaaggga ttgctggcgt gcctggcgtg tcgtttgtgg tgttgctggc aacgctgggt 1261 agcgtaggta tccgctggaa ggtctggcgt ttattgctgg tgttgaccgt atcctcgaca 1321 tggcgcgtac tgccgtgaac gtggtgggta atgcgctggc ggtgctggtg attgccaagt 1381 gggaacacaa atttgaccgt aagaaagcgc tggcttatga gcgtgaagtg ctgggcaaat 1441 ttgataaaac tgcggatcaa taattgaaga ttgccgggga tatccacccg gcaatgtgtg 1501 aatgcctgat gcgacgcttg ccgcgtctta tcaggcctac gccagacagc gcaatagcct 1561 gatttagcgt gattttgtag gtcggataag gcgtttatgc cgcatccgac atcaacgcct 1621 gatgcgacgc // LOCUS ECOLYSU 3034 bp ds-DNA BCT 18-JUL-1990 DEFINITION E.coli lysyl-tRNA synthetase (lysU) gene, complete cds. ACCESSION M30630 KEYWORDS lysyl-tRNA synthetase. SOURCE E.coli (strain K12) isolate CS520 DNA, clone pLC4-5. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3034) AUTHORS Clark,R.L. and Neidhardt,F.C. TITLE Roles of the two lysyl-tRNA synthetases of Escherichia coli: Analysis of nucleotide sequences and mutant behavior JOURNAL J. Bacteriol. 172, 3237-3243 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Clark, 12-DEC-1989. FEATURES from to/span description pept 316 1821 lysyl-tRNA synthetase (lysU) (E.C. 6.1.1.6) signal 194 199 -35 signal signal 216 221 -10 signal BASE COUNT 850 a 654 c 665 g 865 t ORIGIN 93.55 min on K12 map. 1 cgatcatcgt cgtattggcc tttgctaccc gttttctgtt tagcacgccg acgaatatga 61 tacaggagag caacgattaa tgtcgtaaaa acaattggtt atgaataaat taactttcag 121 tttataacac taacaaatcg agattaataa gtcatttaaa ccattttgat ggttatttat 181 tagtgatatc aacttgaggt aagcgttagt ttcgataaga taaactgagt tactaatagt 241 cgaggcagat aatacagtgt accgatctgt ctctttatct acgctaaatt gaaagctgga 301 tttagaggaa ccaaaatgtc tgaacaagaa acacggggag ccaatgaggc tattgatttt 361 aacgatgaac tgagaaatcg ccgcgaaaaa ctggcggcac tacgtcagca aggtgtggcg 421 tttcccaatg attttcgccg cgaccatacc tctgaccagt tgcacgaaga gtttgatgcg 481 aaggataacc aggaactgga atccttaaac attgaagtct cggttgctgg ccgaatgatg 541 acccgtcgta tcatggggaa agcctccttt gtaacgttgc aggatgtcgg tggccgtatt 601 caactgtacg ttgcaagaga tagcctgcca gaaggtgttt ataacgatca gtttaaaaaa 661 tgggatctgg gtgacattat cggtgccggt acgctgttta agacgcaaac gggtgagctt 721 tccattcact gtactgagct gcgcctgctg actaaagcac tacgtccttt accagataaa 781 ttccatggtc tgcaggatca ggaagtccgt tatcgtcaac gttatctgga cctcatcgct 841 aacgataaat cccgtcaaac gtttgttgtc cgttcaaaaa ttctggccgc tatccgtcaa 901 ttcatggtcg cgcgcggctt tatggaagta gaaaccccga tgatgcaggt aattccaggt 961 ggggcatctg ctcgcccgtt tattacccat cataatgctc tggatttaga tatgtatgct 1021 cgtatcgcgc cggagctgta tctgaaacgt ctggttgtag gcggttttga acgggtattc 1081 gaacacgtaa ctcgtaatga aggtattcgt cgccataatc ctgagttcac aatgatggaa 1141 ctctacatgg cgtatgcgga ttaccacgat ttgattgaac tgacagagtc actgttccgc 1201 accctggcac aagaggttct gggtaccact aaagtcactt atggcgagca tgtgtttgat 1261 ttcggcaaac cgtttgaaaa actcaccatg cgcgaagcaa tcaaaaaata tcgtccagaa 1321 accgatatgg ccgacctgga taattttgat gctcgtaaag cattagctga atctatcggt 1381 attacggtag agaaaagctg ggggttggga cgtagtgtca cagagatctt tgatgaagtg 1441 gtagaaggac atgtgattca gccatccttt attacggaat atccggcaga agtgtccccg 1501 ctggcacgtc gtaatgatgt taacccggaa atcaccgacc gttttgaatt cttcatcggt 1561 ggtcgtgaaa tcggtaatgg ttttagcgaa ttaaacgacg cagaagatca ggctgaacgt 1621 ttccaggaac aggttaatgc taaagctgca ggtgacgacg aagccatgtt ctatgacgaa 1681 gattacgtga ctgcgctgga atatggtctg ccgccaaccg ctggtctggg tattggtatc 1741 gaccgaatga ttatgctgtt tactaacagc catactattc gcgacgttat tctcttcccg 1801 gcgatgcgcc cacagaaata aatttcactt taatgaacga agcagtcagg cgactgcttc 1861 gttcatcgtt ccgcaaatcc agttttgagg agagacgctt cgtttcacga taaccagcgg 1921 gtccactcaa ctacagctcc taaaagagag aaaaaagtaa ctcgacaggc agactcattg 1981 gccaggttat tccgaccagg aatgcactaa gtaaacgaat atgtgttttc taatgataga 2041 gataaaccag agtaatcgca ggttattcac gccgaatacg ttcacacatc acataaaaaa 2101 ttaatatttt agagaatagt gccatttttt atgttcttct tatccttcct ggctacttgt 2161 aaaactgacg gaaatagtca tcaactgtca taatgttgag gtcaaaagca atatccggta 2221 agtctaagca tttgagcctg attaaatctt cgatttcact tgcggctaac gatgtcgtta 2281 ttgcgaagga gttttcattc aattgtcgtg gtatcccata ttcatcagcc acgatagagg 2341 agaaattcat atcatacata catcaatgtt cttaatctta agaagagtgg ggaataagat 2401 gatggataat caaaagaaac aacataagta ttgaacttgt ttttacctcc ataatatgac 2461 tcctatcaat attaaacgct ctggggacgc aaaacttatt aaccccgatg atgattctca 2521 tataggagat attctttttt ttgctcagta aaggggatgg acttgggaca aaaaaacgcg 2581 ctctctgccc attctttgtg catcccccta cccttcagat gctctccttg catcacatcg 2641 cggagagaat gaaaaatatc gtgactctgt actctcccat caactgcgaa tatcatcata 2701 ctcccgcgtt ttatcaaatt catgtttcgc aaatgggcat aatgggataa tttttcgttt 2761 ttcccgacgc attttttcca cgactttcgc aaccagctgt taccaatccc cgttgccttt 2821 caggctttca tcgacatcgt atgttcgata actcgctaaa tctccgtcgg tcggcacaaa 2881 gacaatttca gcgatttgct tgccttgttt gtcattaatg taaaatttat tgtggccttc 2941 gcgtatttcc attttaccct cgcttattta tgacggtatt tcagcgcgcc gctcgggcac 3001 gtatcatcac tttaaccaca gtggcgacgt cgac // LOCUS PSEBCD 4007 bp ds-DNA BCT 18-JUL-1990 DEFINITION P.aeruginosa pilin biogenesis protein (pilB, pilC and pilD) genes, complete cds. ACCESSION M32066 KEYWORDS pilin biogenesis protein. SOURCE P.aeruginosa (strain PAK) DNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 4007) AUTHORS Nunn,D., Bergman,S. and Lory,S. TITLE Products of three accessory genes, pilB, pilC and pilD are required for biogenesis of Pseudomonas aeruginosa pili JOURNAL J. Bacteriol. 172, 2911-2919 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.N.Nunn, 12-FEB-1990. FEATURES from to/span description pept 199 1899 pilin biogenesis protein (pilB) pept 1903 3123 pilin biogenesis protein (pilC) pept 3127 3999 pilin biogenesis protein (pilD) mRNA 8 < 1 (c) pilA mRNA BASE COUNT 882 a 1244 c 1110 g 771 t ORIGIN 1 cgcctaaccc taccaagcac ttaccatgcc aaataccgtt cacatagcga gcaaggccgt 61 cttatgccct tcccgaggag actcaaactt aaaacggaca ggatgtgaca ctttttgaca 121 gccggactcc cccgatttgg cagaccctgt tatctgcgtt ataaggcact tcaatagccc 181 tctgcgattt ccttctccat gaacgacagc atccaactga gcggtctgtc ccgacagctc 241 gtccaagcca atctactcga cgaaaagacc gccctgcagg cccagacgca ggcgcagcgc 301 aacaagctgt cgctggtgac gcacctggtg cagaacaagc tggtaagcgg cctggccctg 361 gccgaattgt ccgccgagca gttcggcatc gcctattgcg acctgaacag cctggacagg 421 gaaagctttc cgcgggacgc catcagcgaa aagctggtcc gccagcaccg ggtcattccc 481 ctgtggcggc ggggcaacaa gttgttcgta ggcatctccg acgccgccaa ccaccaggcc 541 atcaacgacg tccagttcag cactggcctg acgaccgagg ccattctggt cgaggacgac 601 aagctcggcc ttgccatcga caagctattc gagaatgcaa ccgacggctt ggcaggtctc 661 gatgacgtgg acctggaagg actggatgta ggcgtcaagg aaacgtccgg acaagaagat 721 actggcgcgg aggcagacga cgctcccgtc gtccgcttcg tcaacaagat gctgctggac 781 gctatcaaag gcgggtcttc cgacttgcac ttcgagccct acgagaagat ctaccgggtg 841 cgcttccgta ccgacggcat gcttcacgaa gtggccaagc cgccgatcca gttggccagt 901 cgtatctctg ctcgtctcaa ggtaatggcc ggcctggata tctccgaacg acgcaagccg 961 caggacggac ggatcaagat gcgcgtgtcg aagaccaagt ccatcgactt ccgcgtcaac 1021 accctgccga ccctgtgggg cgagaagatc gtgatgcgga tcctcgactc ctccagcgcg 1081 cagatgggca tcgacgccct gggctacgag gaggaccaga aggaactcta cctggccgca 1141 ctcaagcagc cgcagggcat gatcctggtc accggcccca ccggctcggg caagacggta 1201 tcgctgtaca ccggcctgaa catcctcaat accaccgaca tcaacatttc caccgccgaa 1261 gacccggtgg agatcaacct ggaaggcatc aaccaggtca acgtcaatcc gcgccagggc 1321 atggacttct cccaggcgct gcgcgccttc ctgcgccagg acccggacgt gatcatggtc 1381 ggcgagatcc gcgacctgga gaccgccgag atcgccatca aggcggcgca gaccgggcat 1441 atggtgatgt ccaccctgca caccaacagc gccgccgaga ccctgacccg cctgctgaac 1501 atgggcgtac cggcgttcaa cctggcgacc tcggtgaacc tgatcatcgc ccagcgcctt 1561 gcgcgaaaac tctgttcgca ctgcaagaaa gagcacgacg taccgaagga aaccctgctt 1621 cacgaaggtt tcccagaaga gttgatcggc accttcaagc tgtattcacc agtgggctgc 1681 gaccattgca agaacggtta caagggccgt gtcggtattt atgaagtagt taaaaacacc 1741 ccggccctgc agcggattat catggaggaa ggcaactcca tcgagatcgc cgagcaagcc 1801 cgcaaagaag gcttcaacga tctgcgcacc tcaggcctgc tgaaagccat gcaggggatc 1861 accagcctgg aggaagtcaa ccgcgtgacc aaggattaat ccatggcgga caaagcgtta 1921 aaaaccagcg ttttcatctg ggagggcacc gacaagaaag gcgccaaggt caagggcgaa 1981 ctgaccgggc agaatcccat gctggtgaaa gcccatctgc gcaagcaagg catcaatccg 2041 ctcaaggtac gcaagaaagg tatctccctg ctgggcgcag gcaagaaagt gaaacccatg 2101 gacatcgccc tgttcacccg gcagatggcg accatgatgg gcgctggcgt tcccctcctg 2161 caatcgttcg acatcatcgg cgagggcttc gacaacccca acatgcgcaa gcttgtggat 2221 gaaatcaaac aggaagtttc ctcaggtaac agcctagcca actccttgag aaaaaagccc 2281 cagtattttg acgagcttta ttgcaacctg gtagatgcag gggaacagtc tggcgccttg 2341 gaaaaccttc tcgatcgggt ggcaacctat aaagaaaaga cggaatcact gaaagccaag 2401 atcaaaaagg cgatgaccta tcccattgcc gtcatcattg tcgcactgat tgtatctgcg 2461 atcctcctga ttaaagtggt tccacaattt cagtcggtct ttgaaggttt cggcgcggaa 2521 cttcccgcct ttacccagat gattgtcaat ctatcggagt tcatgcagga gtggtggttc 2581 ttcatcatac tggcgatagc gatatttggc tttgcattca aagaattgca taaacgctca 2641 caaaaattcc gtgacacact cgacagaacg atcctcaaac ttcccatttt cggaggcatc 2701 gtctacaaat ctgcggtcgc ccgttatgca cggaccttgt ccacgacctt cgccgcgggt 2761 gttcccctgg tcgatgcgct cgactccgtc tccggagcga ccggcaatat cgtgttcaag 2821 aacgcggtca gcaagatcaa gcaagacgtt tccaccggca tgcagctcaa cttctccatg 2881 cgcaccacca gcgtctttcc caacatggcg atccagatga ccgccatcgg cgaggagtcc 2941 ggttcgctcg atgagatgct gagcaaagtc gccagctact acgaagagga agtcgacaac 3001 gccgtggaca acctcaccac gctcatggaa ccgatgatca tggccgttct cggcgtactg 3061 gttggcggtc tgatcgtggc catgtacctt ccgatcttcc aactcggcaa cgtcgtcgga 3121 taactgatgc ccctcctcga ctacctggcc agccatccgc tggcctttgt cttgtgcgcc 3181 atcctccttg gcctgctggt cggcagcttc ctcaacgtcg tcgtgcatcg cctgccgaag 3241 atgatggagc gcaactggaa ggccgaagcc cgcgaagcgc tcgggctgga gccggaaccc 3301 aagcaggcga cctacaacct ggtgctgccg aattcggcgt gcccgcgctg cggccacgag 3361 atccgcccct gggaaaacat cccgctggtc agctacctgg cgctgggcgg caagtgctcg 3421 tcctgcaagg ccgccatcgg caaacgctac cccctggtgg aactggccac cgcgctgctt 3481 tccggctacg tcgcctggca tttcggcttc acctggcagg cgggcgcgat gctcctgctg 3541 acctggggtc tgctggcgat gagcctgatc gatgccgatc accaactgct gccggacgtg 3601 ctggtgctgc cgctgctctg gctgggcctg atcgccaacc atttcggcct cttcgccagc 3661 ctcgacgatg cgctgttcgg cgcggtgttc ggctacctga gcctgtggtc ggtgttctgg 3721 ctgttcaagc tggtgaccgg caaggagggc atgggctacg gcgacttcaa gctgctggcc 3781 atgctcggtg cctggggtgg ctggcagatc ctgccgctga ccatcctgct gtcgtcgctg 3841 gtcggggcga tcctcggggt gatcatgctg cgcctgcgca acgccgaaag cggcacgccg 3901 atccccttcg gtccctatct ggctattgcc gggtggattg ctttgctctg gggtgatcaa 3961 ataacccgga cctatctgca attcgccgga ttcaaatgac ccaaccc // LOCUS RADHRPER 955 bp ds-DNA SYN 18-JUL-1990 DEFINITION Synthetic horseradish peroxidase isoenzyme C (HRP-C) subunit alpha-1 DNA, complete cds. ACCESSION J05552 KEYWORDS peroxidase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 955) AUTHORS Smith,A.T., Santama,N., Dacey,S., Edwards,M., Bray,R.C., Thornely,R.N.F. and Burke,J.F. TITLE Expression of a synthetic gene for horseradish peroxidase C in E.coli and folding and activation of the recombinant enzyme with calcium and heme JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.T.Smith, 13-JUN-1990. FEATURES from to/span description pept 11 940 synthetic horseradish peroxidase isoenzyme C (HRP-C) subunit alpha-1 (E.C. 1.11.1.7) BASE COUNT 246 a 269 c 221 g 219 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttaacc atgcagttaa cccctacatt ctacgacaat agctgtccca acgtgtccaa 61 catcgttcgc gacacaatcg tcaacgagct cagatccgat cccaggatcg ctgcttcaat 121 attacgtctg cacttccatg actgcttcgt gaatggttgc gacgctagca tattactgga 181 caacaccacc agtttccgca ctgaaaagga tgcattcggg aacgctaaca gcgccagggg 241 ctttccagtg atcgatcgca tgaaggctgc cgttgagtca gcatgcccac gaacagtcag 301 ttgtgcagac ctgctgacta tagctgcgca acagagcgtg actcttgcag gcggaccgtc 361 ctggagagtg ccgctcggtc gacgtgactc cctacaggca ttcctagatc tggccaacgc 421 caacttgcct gctccattct tcaccctgcc ccagctgaag gatagcttta gaaacgtggg 481 tctgaatcgc tcgagtgacc ttgtggctct gtccggagga cacacatttg gaaagaacca 541 gtgtaggttc atcatggata ggctctacaa tttcagcaac actgggttac ctgaccccac 601 gctgaacact acgtatctcc agacactgag aggcttgtgc ccactgaatg gcaacctcag 661 tgcactagtg gactttgatc tgcggacccc aaccatcttc gataacaagt actatgtgaa 721 tctagaggag cagaaaggcc tgatacagag tgatcaagaa ctgtttagca gtccaaacgc 781 cactgacacc atcccactgg tgagaagttt tgctaactct actcaaacct tctttaacgc 841 cttcgtggaa gccatggacc gtatgggtaa cattacccct ctgacgggta cccaaggcca 901 gattcgtctg aactgcagag tggtcaacag caactcttaa taaggatccg aattc // LOCUS RK2KORC 1701 bp ds-DNA BCT 18-JUL-1990 DEFINITION Plasmid RK2 kil regulatory protein (korC) (complete cds), kleA (5' end), and Tn1 bla protein (3' end) genes. ACCESSION M32794 KEYWORDS Tn1 bla protein; kil regulatory protein; kil-kor regulon; kleA protein; korC protein. SOURCE Plasmid RK2 DNA, passed in E.coli MV10. ORGANISM Plasmid RK2 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1701) AUTHORS Kornacki,J.A., Burlage,R.S. and Figurski,D.H. TITLE The kil-kor regulon of broad host-range Plasmid RK2: Nucleotide sequence, polypeptide product and expression of regulatory gene korC JOURNAL J. Bacteriol. 172, 3040-3050 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by D.H.Figurski, 13-MAR-1990. FEATURES from to/span description pept < 1 320 Tn1 bla protein pept 1169 1426 korC protein pept 1652 > 1701 kleA protein pept.ps 468 1146 interrupted ORFX trns < 1 467 Tn1 remnant mRNA 1153 1509 korC mRNA mRNA 1637 > 1701 kleA mRNA binding 1153 1161 korC ribosome binding site binding 1637 1641 kleA ribosome binding site signal 1568 1597 kleA promoter signal 1509 1526 pot. transcription termination signal site 1459 1471 KorB box site 1552 1562 KorA box site 1577 1593 KorC box BASE COUNT 359 a 481 c 523 g 338 t ORIGIN Map position 4.3'-6.0'. 1 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 61 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 121 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtggatctc 181 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 241 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 301 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 361 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 421 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccttt tatttccgaa 481 acgggctatc ggtcccatta cgaccatgcg cgcggcggca tgacggtgga ccaggtagcg 541 gacggggttt tgcgcgccct gttgcgctcg catcgccggt atctggacgc ccgcgaccag 601 gaccgccttg ccgacgagcc actgccggcg tggctcgcgg gtatcacgcc accgccgcgc 661 cgcgtgcggg ccgtggtcga ggactggcgg aagccggacg aactgccgcc gggcttcgcc 721 tgggttgatg ccgtgctgcc ggcgcaccag gcattcatcg cccgcaaatg ggcggccagc 781 gccaaggcga agctcgcggc cgcgcgtgcg aaagctcagg agccggccgg gcagcgcagg 841 gagccggtta caccggccaa gccggagccg gagccggcca aggacgagga cgcgccggcg 901 tggccggcga cgttctttcc tggcctgcgg tgcgagattg tgagcgtgca tcacccggtt 961 tttgccaagg aaatcgggaa gcacgtcatc attacgaaga tcagtcccga aacgcgacag 1021 gtgtgggcgc acgacgacaa gccgccgcgc tatcgcatca atcgcaacgg tagaaaggtt 1081 tgtgaatatg acccgcgttg catcgagtcg tgctacggct acgaccaatt gcgggccgcc 1141 atttgataac gattaggaga aaagcagcat gagcgacgtg aatatccggc ttgagtgcct 1201 gcgcccggcg gaacgctggg tgcagccgac cggcgcagaa atccgggaag tcttgcactt 1261 ggccggcctc accggcggac aggctgcgcg catcttgggc ttgggtgcca agggcgaccg 1321 cacggtgcgg cgttgggttg gcgaggattc gccgatcccc tatgccgcct gggcgatcct 1381 ttgcgatcta gcggggattg gggcgatctg gaaaggccag ggctgacgcc cgcaggactt 1441 tacgccaagg gagaggactt tagcggctaa aaccgccaac cctgatcgtt gcgatatgct 1501 gccagcagcc cggcccgcaa gccgggtttt ttcgtgcgcc cgccgcaaat tgtttagcta 1561 aatttccttg actatctagg gcataatgcc ctaatatagc aatccaaggc cgggcacttc 1621 gcccaggtca gcaaccggag gatcaacccc gatgaaaagc aagatcatgt cttggctcga 1681 tgagctgccc ggcgcggccg c // LOCUS STMSUAC 1380 bp ds-DNA BCT 18-JUL-1990 DEFINITION S.griseolus herbicide-inducible cytochrome P-450-SU1 (suaC) gene, complete cds. ACCESSION M32238 KEYWORDS cytochrome P-450-SU1. SOURCE S.griseolus (strain ATCC 11796) DNA, subclone pCAO302. ORGANISM Streptomyces griseolus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1380) AUTHORS Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M., Leto,K.J., Romesser,J.A. and O'Keefe,D.P. TITLE Genes for two herbicide-inducible cytochromes P-450 from Streptomyces griseolus JOURNAL J. Bacteriol. 172, 3335-3345 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Omer, 23-FEB-1990. FEATURES from to/span description pept 128 1348 cytochrome P-450-SU1 (suaC) binding 117 120 ribosomal binding site BASE COUNT 236 a 503 c 445 g 196 t ORIGIN 1 gcggacaggg ggactcctga agatgtctga tagaggccgt tgcgttctct acgggggcaa 61 gtctatgctc cgaaatagag aacatggcgt tctttaaagg tgagaattct tgaatcggag 121 tggaccgatg accgataccg ccacgacgcc ccagaccacg gacgcacccg ccttcccgag 181 caaccggagc tgtccctacc agttaccgga cggctacgcc cagctccggg acacccccgg 241 ccccctgcac cgggtgacgc tctacgacgg ccgtcaggcg tgggtggtga ccaagcacga 301 ggccgcgcgc aaactgctcg gcgacccccg gctgtcctcc aaccggacgg acgacaactt 361 ccccgccacg tcaccgcgct tcgaggccgt ccgggagagc ccgcaggcgt tcatcggcct 421 ggacccgccc gagcacggca cccggcggcg gatgacgatc agcgagttca ccgtcaagcg 481 gatcaagggc atgcgccccg aggtcgagga ggtggtgcac ggcttcctcg acgagatgct 541 ggccgccggc ccgaccgccg acctggtcag tcagttcgcg ctgccggtgc cctccatggt 601 gatctgccga ctcctcggcg tgccctacgc cgaccacgag ttcttccagg acgcgagcaa 661 gcggctggtg cagtccacgg acgcgcagag cgcgctcacc gcgcggaacg acctcgcggg 721 ttacctggac ggcctcatca cccagttcca gaccgaaccg ggcgcgggcc tggtgggcgc 781 tctggtcgcc gaccagctgg ccaacggcga gatcgaccgt gaggaactga tctccaccgc 841 gatgctgctc ctcatcgccg gccacgagac cacggcctcg atgacctccc tcagcgtgat 901 caccctgctg gaccaccccg agcagtacgc cgccctgcgc gccgaccgca gcctcgtgcc 961 cggcgcggtg gaggaactgc tccgctacct cgccatcgcc gacatcgcgg gcggccgcgt 1021 cgccacggcg gacatcgagg tcgaggggca cctcatccgg gccggcgagg gcgtgatcgt 1081 cgtcaactcg atagccaacc gggacggcac ggtgtacgag gacccggacg ccctcgacat 1141 ccaccgctcc gcgcgccacc acctcgcctt cggcttcggc gtgcaccagt gcctgggcca 1201 gaacctcgcc cggctggagc tggaggtcat cctcaacgcc ctcatggacc gcgtcccgac 1261 gctgcgactg gccgtccccg tcgagcagtt ggtgctgcgg ccgggtacga cgatccaggg 1321 cgtcaacgaa ctcccggtca cctggtgacg ggggagaggg gcaaggacat gaccatgcgg // LOCUS STMSUBC 1441 bp ds-DNA BCT 18-JUL-1990 DEFINITION S.griseolus cytochrome P-450-SU2 (subC) gene, complete cds. ACCESSION M32239 KEYWORDS cytochrome P-450-SU2. SOURCE S.griseolus (strain ATCC 11796) DNA, subclone pCAO304. ORGANISM Streptomyces griseolus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1441) AUTHORS Omer,C.A., Lenstra,R., Litle,P.J., Dean,C., Tepperman,J.M., Leto,K.J., Romesser,J.A. and O'Keefe,D.P. TITLE Genes for two herbicide-inducible cytochromes P-450 from Streptomyces griseolus JOURNAL J. Bacteriol. 172, 3335-3345 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.A.Omer, 23-FEB-1990. FEATURES from to/span description pept 195 1406 cytochrome P-450-SU2 (subC) binding 182 186 ribosomal binding site BASE COUNT 207 a 605 c 432 g 197 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcc accgcccgac ccgtccgcac tccgccccgc cgaccgtcgt ccatccgccc 61 ctgcggccat gcggtttgag ccaacctcgg tgctgccgcg atctgccctt ccctcccccg 121 ccgggcctgc gttagcgtga cgacatctta attacctaag ttaggtaatt agctcacgcg 181 gaaggaccgg ccgcatgacg accgcagaac gcaccgctcc ccccgacgcc ctcaccgtcc 241 cggccagccg cgcccccggc tgccccttcg accccgcgcc cgacgtcacc gaggcggccc 301 gcaccgaacc ggtcacccgg gccaccctct gggacggctc ctcctgctgg ctggtgacgc 361 gccatcagga cgtccgcgcg gtcctcggcg acccgcgctt cagcgccgac gcccaccgca 421 ccggcttccc cttcctgacc gccggcggcc gcgagatcat cggcaccaac ccgaccttcc 481 tgcgcatgga cgacccggag cacgcccgac tgcgccggat gctcaccgcc gacttcatcg 541 tcaagaaggt cgaggcgatg cgccccgagg tgcagcgcct cgccgacgac ctggtcgacc 601 ggatgaccac cggacgcacc tccgccgacc tggtcaccga gttcgcgctg ccgctgccgt 661 ccctggtgat ctgcctgctg ctcggcgtcc cctacgagga ccacgcgttc ttccaggagc 721 gcagccgggt cctgctcacc ctgcggtcca ctcccgagga agtccgggcc gcccaggacg 781 agttgctgga gtacctcgcc cggctcgccc ggaccaagcg ggagcggccg gacgacgcca 841 tcatcagccg cctggtcgcc cgcggcgagc tcgacgacac ccagatcgcc accatgggac 901 gcctgttgct ggtcgccggc cacgagacga ccgccaacat gaccgcgctc tccaccctcg 961 tgctgctgcg caaccccgac caactcgccc ggctgcgcgc cgaacccgcg ctcgtcaagg 1021 gcgccgtcga ggagctgctg cgctacctga cgatcgtgca caacggcgtt ccccggatcg 1081 ccaccgagga cgtgctcatc ggcggccgca ccatcgccgc cggcgagggc gtcctgtgca 1141 tgatcagctc cgccaaccgg gacgccgagg tgttccccgg cggcgacgac ctcgacgtgg 1201 cccgcgacgc ccgccgccac gtggccttcg gcttcggcgt ccaccagtgc ctgggacagc 1261 cgttggccag ggtggagctc cagatcgcca tcgaaacgct gctgcgccgc ctgccggacc 1321 tgcggctggc cgtgccccac gaggagatcc cgttccgcgg cgacatggcg atctacgggg 1381 tccactcgct gccgatcgcc tggtagcccg ggcgccccca ccaccgacca ccacgcaccc 1441 a //
GenBank-Updates@genbank.bio.net (07/19/90)
LOCUS HUMPPPB1A 3215 bp ss-mRNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) mRNA, complete cds. ACCESSION M33689 KEYWORDS protein phosphotyrosyl phosphatase. SOURCE Human placenta, cDNA to mRNA, (library of Clontech), clone lambda-16-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3215) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept 73 1380 protein phosphotyrosyl phosphatase 1B (EC 3.1.3.48) BASE COUNT 818 a 828 c 801 g 768 t ORIGIN Chromosome 20q13.1-q13.2. 1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg 61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg 121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag 181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg 241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa 301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg 361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa 421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa 481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga 541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat 601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc 661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt 721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac 781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt 841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa 901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac 961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc 1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa 1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc 1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg 1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag 1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct 1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag 1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc 1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc 1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat 1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag 1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc 1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc 1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc 1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat 1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg 1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt 1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata 2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc 2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat 2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac 2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg 2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg 2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc 2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg 2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc 2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca 2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg 2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc 2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg 2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc 2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac 2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg 2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg 3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg 3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag 3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt 3181 ttacaatggc catggaataa accattttta caaaa // LOCUS HUMPPPB1A1 276 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x. ACCESSION M33688 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 1 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 276) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept / 34 + 243 protein phosphotyrosyl phosphatase 1B, exon x (EC 3.1.3.48) (AA at 34) pre-msg < 1 > 276 PTP1B mRNA and introns IVS < 1 33 PTP1B intron x-1 IVS 244 > 276 PTP1B intron x BASE COUNT 56 a 77 c 69 g 74 t ORIGIN Chromosome 20q13.1-q13.2. 1 ctttagaatc tactagatga ttttctcttt cagacccaag aaactcgaga gatcttacat 61 ttccactata ccacatggcc tgactttgga gtccctgaat caccagcctc attcttgaac 121 tttcttttca aagtccgaga gtcagggtca ctcagcccgg agcacgggcc cgttgtggtg 181 cactgcagtg caggcatcgg caggtctgga accttctgtc tggctgatac ctgcctcttg 241 ctggtaagga ggcctcgcgg gtgccctggg gagctc // LOCUS HUMPPPB1A2 453 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+1. ACCESSION M33687 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 2 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 453) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 236 + 397 protein phosphotyrosyl phosphatase 1B, exon x+1 (EC 3.1.3.48) pre-msg < 1 > 453 PTP1B mRNA and introns IVS < 1 235 PTP1B intron x IVS 398 > 453 PTP1B intron x+1 BASE COUNT 104 a 118 c 111 g 120 t ORIGIN Chromosome 20q13.1-q13.2. 1 ggggaggtcc cagactctta accagatctc ttgtgaatgc attgcctcag ggaggcacca 61 agcctttcat gaggacctgt ccccctgacc cagacacctc ccacccagcc ccacctccaa 121 cactagggat cacatttcag catgagattg ggaggggaca gacatctaac ggtgttatta 181 acgttgccct tgagaattgg acctggctga cttatatctc ctctctggct ttcagatgga 241 caagaggaaa gacccttctt ccgttgatat caagaaagtg ctgttagaaa tgaggaagtt 301 tcggatgggg ctgatccaga cagccgacca gctgcgcttc tcctacctgg ctgtgatcga 361 aggtgccaaa ttcatcatgg gggactcttc cgtgcaggtc agcattgcct ttgtttgaat 421 ccaggtgtga ccattttaac ttttttgtct ttg // LOCUS HUMPPPB1A3 426 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+2. ACCESSION M33686 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 3 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 426) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 46 + 269 protein phosphotyrosyl phosphatase 1B, exon x+2 (EC 3.1.3.48) pre-msg < 1 > 426 PTP1B mRNA and introns IVS < 1 45 PTP1B intron x+1 IVS 270 > 426 PTP1B intron x+2 BASE COUNT 115 a 122 c 109 g 80 t ORIGIN Chromosome 20q13.1-q13.2. 1 gaagtgaaca ctaatagact tccttcctct tgctgctctt tcaaggatca gtggaaggag 61 ctttcccacg aggacctgga gcccccaccc gagcatatcc ccccacctcc ccggccaccc 121 aaacgaatcc tggagccaca caatgggaaa tgcagggagt tcttcccaaa tcaccagtgg 181 gtgaaggaag agacccagga ggataaagac tgccccatca aggaagaaaa aggaagcccc 241 ttaaatgccg caccctacgg catcgaaagg taatatattg ggtccagctt gttggggtga 301 ggggaaatga cttctgttct agaaacacac gctggtactg aaaccctgtg atgcagcctc 361 tgttggcaag cagcgcttcg catccttggg aacagggcgc tggaccaaca cccactccac 421 tggtgg // LOCUS HUMPPPB1A4 732 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+3. ACCESSION M33685 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 4 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 732) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 402 + 597 protein phosphotyrosyl phosphatase 1B, exon x+3 (EC 3.1.3.48) pre-msg < 1 > 732 PTP1B mRNA and introns IVS < 1 401 PTP1B intron x+2 IVS 598 > 732 PTP1B intron x+3 BASE COUNT 164 a 193 c 205 g 170 t ORIGIN Chromosome 20q13.1-q13.2. 1 tctgtagctc taaagaatga gatctggtgt actgatgtgg ccagacattg caattgcagt 61 acatgagaag gcaaatcata cagtagtgtg tacaccagtg agtcctccag ccagataaat 121 cctcacagtg accagtcgcc caggcacctt gtgaacccta ccctgggtgt gggtgctatc 181 tgaagtacct gggggagggg gtgacaagtg gacttcaggc tgatgtggcc ctggcctggc 241 cctccctcca agcagagggg gctggcacgc tggaaggtta acatcatcca actctgtcta 301 cacgtggctt gttttttcct agaattcctg ccacaatagc agcatccttg ccattcattt 361 tctccaaagt gagtacccat ctctgccctc tgattcctca gcatgagtca agacactgaa 421 gttagaagtc gggtcgtggg gggaagtctt cgaggtgccc aggctgcctc cccagccaaa 481 ggggagccgt cactgcccga gaaggacgag gaccatgcac tgagttactg gaagcccttc 541 ctggtcaaca tgtgcgtggc tacggtcctc acggccggcg cttacctctg ctacagggta 601 tgtttccact gacagacgcg ctgggcagat gctcgtgtgc agagagcact ggccgctagc 661 ccgatggtag gattcagttc tgtggtgcat ctgagccagt ctcagaagaa acagatcaag 721 gttttaagtc tg // LOCUS HUMPPPB1A5 365 bp ds-DNA PRI 19-JUL-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+4. ACCESSION M33684 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 5 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 365) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 266 289 protein phosphotyrosyl phosphatase 1B, exon x+4 (EC 3.1.3.48) pre-msg < 1 > 365 PTP1B mRNA and introns IVS < 1 265 PTP1B intron x+3 BASE COUNT 80 a 101 c 88 g 96 t ORIGIN Chromosome 20q13.1-q13.2. 1 tacctcctaa gacttttacg gttttaaata ttttacctct ttccaggtgg catctgagta 61 catcagatgg ttttgcaaaa tgcaaacaat tttttccttg gggatgattt ttggggagag 121 ggggctactg taaaaaataa aaccaaaacc ccctttgctc cctcggaggt tgaagttgcc 181 ggggggtgtg gccggggtca tgcatgaggc gacagcactg caggtgcggg tctgggctca 241 tctgaactgt ttggtttcat tccagttcct gttcaacagc aacacatagc ctgaccctcc 301 tccactccac ctccacccac tgtccgcctc tgcccgcaga gcccacgccc gactagcagg 361 catgc // LOCUS HB3HBLA 1319 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage HB-3 amidase (hbl) gene, complete cds. ACCESSION M34652 KEYWORDS amidase. SOURCE Bacteriophage HB-3 (host Streptococcus pneumoniae) DNA. ORGANISM Bacteriophage HB-3 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1319) AUTHORS Romero,A., Lopez,R. and Garcia,P. TITLE Sequence of the Streptococcus pneumoniae bacteriophage HB-3 amidase reveals high homology with the major host autolysin JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Romero, 25-MAY-1990. Consejo superior de Investigaciones Clentificas Centro de Investigaciones Biologicas Velazquez 144 Madrid, 28006 SPAIN FEATURES from to/span description pept 298 1254 amidase (hbl) binding 283 289 ribosomal binding site (put.) site 1285 1305 transcription stop sequence BASE COUNT 412 a 268 c 334 g 305 t ORIGIN 1 aagcttttta acagtagcag taggcggtat tgtaaaagca gtaaaagatt atcttttgcg 61 taaaggcgga gagaaagcgg tgatcatcgc tgaaattcta gctaaaatgc agttcatgcc 121 gttgagcaag tagcttcaga gactggctat aagggcgaag aaaagctgga gcaggctcgt 181 gctaaagtcc gtgctgagct tacaaaatac aatattagca tgactgacaa aaacttagac 241 accttcgtag agtcagcagt gaagcagatg aatgacgcat ggaaaggacg atagggaatg 301 gatatcgata gaaacagact acgtacaggc ttgccccagg ttggggtgca gccttatcga 361 caagtacatg ctcactcaac aggtaaccgc aactcaaccg tacagaatga agcggattat 421 cactggcgga aagacccaga attaggtttt ttctcgcacg ttgttgggaa ctttcgcatc 481 atgcaggtcg gacctgtgaa caacggaagt tgggatgttg ggggcggttg gaatgctgag 541 acctatgcag cggttgaact gattgaaagc cattcaacta aggaagagtt tatggctgac 601 tatcgcctct atatcgaatt gctacgcaat ctagcggacg aagcaggctt gccgaagact 661 cttgatacag acgacttggc aggtatcaag acgcatgaat actgtaccaa taaccaacca 721 aacaaccact cagaccatgt ggatccatat ccatatcttg caagttgggg cattagccgt 781 gaacagttta agcaagacat cgaaaacggc ttgagcgctg caacaggctg gcagaaaaat 841 ggcactggct actggtacgt acattcagac ggctcttatt caaaagataa gtttgagaaa 901 atcaacggta cctggtatta tttcgatggc tcaggctata tgctttcaga ccgctggaag 961 aagcacacag acggtaattg gtactacttt gaccaatcag gcgaaatggc cacaggctgg 1021 aagaaaatcg ctgacaagtg gtactatttt gatgtagaag gtgccatgaa gacaggctgg 1081 gtcaagtaca aggacacttg gtactactta gacgctaaag aaggcgccat ggtatcaaat 1141 gcctttatcc agtcagcgga cggaacaggc tggtactacc tcaaaccaga cggaacactg 1201 gcagacaagc cagagttcac agtagagcca gatggcttga ttacagttaa ataaatagaa 1261 aggaaacttt ctaaattgtt cttcaccgca gctcaggctt acggtttttt tgttttaaa // LOCUS FIBGLUC 1426 bp ds-DNA BCT 19-JUL-1990 DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene, complete cds. ACCESSION M33676 KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase. SOURCE F.succinogenes (strain S85) DNA, clone PJI5. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1426) AUTHORS Teather,R.M. and Erfle,J.D. TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene JOURNAL J. Bacteriol. 172, 3837-3841 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Teather, 11-APR-1990. FEATURES from to/span description pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase precursor (EC 3.2.1.73) sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase signal peptide matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase binding 132 137 ribosome binding site signal 62 66 -35 region signal 85 90 -10 region BASE COUNT 371 a 346 c 335 g 374 t ORIGIN 1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt 61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt 121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc 181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt 241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg 301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc 361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc 421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct 481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac 541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac 601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt 661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat 721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat 781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt 841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc 901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa 961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct 1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac 1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt 1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg 1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt 1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa 1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt 1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa // LOCUS PP1BOFFO 931 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage P1 regulatory protein (bof) gene, complete cds. ACCESSION M33224 KEYWORDS regulatory protein. SOURCE Bacteriophage P1 viral DNA. ORGANISM Bacteriophage P1 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 931) AUTHORS Schaefer,T.S. and Hays,J.B. TITLE The bof gene of bacteriophage P1. DNA sequence and evidence for roles in regulation of phage c1 and ref genes JOURNAL J. Bacteriol. 172, 3269-3277 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.B.Hayes, 26-MAR-1990. FEATURES from to/span description pept > 544 789 Bof regulatory protein site 541 543 potential ttg start codon for Bof BASE COUNT 260 a 211 c 243 g 217 t ORIGIN Map position 9-10. 1 gggtaactgg tggattatcg agacaaaaca caacgtggcg gacgttctgg ccgtcatcca 61 acaatacgca taacaggagc gcccggttcg cgctgcgcat aatatggcca cactatctga 121 tacaataaaa ccgaataaaa catatcttga ggcggtactg cgtacggcat tattaggaaa 181 gacagaagac gaatacgttg atttcttcct gtcagggcta cgcgggcgat tactgaaaaa 241 tccccgcctg taccgcagct atggcccata ctggcggaaa ttaaaaaatt attactggag 301 cgacggttat ggtaatttcg gtcgtctcgt tgaccgtgac gttcgcaaat tttaccgtta 361 tgaccgcccg gcgctaacac tcatagccgc gacgctctac agccatgagc gttttgataa 421 tggtcagata tactcagcct ggcatttact gccagtccct gaagaagttg acgaccagga 481 ctatgagttt gagtcttacg atttggaagt tgaagccttg gcacaggctg gagagaaaac 541 ttgaaaaagc gatactacac agtaaagcat gggacgctac gagcattaca agagtttgct 601 gacaagcata acgttgaggt gcgcagggaa gggggaagta aagctctgcg catgtaccgt 661 ccggacggga aatggcggac ggtcgtcgat ttcaaaacaa acagtgttcc ccagggcgtc 721 cgtgaccggg cattcgaaga atgggagcag atcatcatag ataatgcatt gcttctcaat 781 gcggattaaa cttccccaaa ttagggctgt ttgctcaccg agcatcgctc aaagaagcac 841 gattcttcaa acatatagat agtgatagtg ccacaacttc tggctctaac gggctgggga 901 ggcggcgctt tgttgctaaa tgatctggtt t // LOCUS STRTEE6 2508 bp ds-DNA BCT 19-JUL-1990 DEFINITION S.pyogenes trypsin-resistant surface T6 protein (tee6) gene, complete cds. ACCESSION M32978 KEYWORDS surface protein; trypsin-resistant surface T6 protein. SOURCE S.pyogenes (strain D471, sub-species M-type 6) DNA. ORGANISM Streptococcus pyogenes Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 2508) AUTHORS Schneewind,O., Jones,K.F. and Fischetti,V.A. TITLE Sequence and structural characteristics of the trypsin-resistant T6 surface protein of group A streptococci JOURNAL J. Bacteriol. 172, 3310-3317 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.Schneewind, 18-MAR-1990. FEATURES from to/span description pept 719 2332 trypsin-resistant surface T6 protein (tee6) precursor sigp 719 784 trypsin-resistant surface T6 protein signal peptide matp 785 2329 trypsin-resistant surface T6 protein binding 706 709 ribosome binding site signal 2353 2382 transcription terminator BASE COUNT 929 a 364 c 480 g 735 t ORIGIN 1 aagcttcaga tgaagcctat gagaagtata aggataacga aggaagatat agcgaaatgg 61 gagattccga tactgattat ggaaccaacc aaactagttc tggaaaaggt ggtttgcctt 121 ctaattcaga tgcttcggtt aattatatgg cagatggtcg tgaacagaaa ttaccttata 181 agcacccagt gattcaggtc aaaacagtac caatcacgtt taccaaagta gatgctgaca 241 acaaccagaa aaaacttgca ggtgttgagt ttgaactccg taaagaggac aagaagatcg 301 tctgggaaaa gggaacaaca ggttcaaatg gccaactcaa ctttaagtac cttcaaaaag 361 gcaaaaccta ttatctgtat gagacgaagg caaaacttgg atacactctt ccagaaaatc 421 catgggaagt tgccgttgct aacaacggtg atataaaagt aaaacacccg attgaaggtg 481 aattgaagtc aaaagatggc tcttacatga ttaaaaatta taagatttat cagttgccat 541 cgtctggggg aagaggaagt caaattttca ttatagttgg tagcatgaca gcaactgtag 601 cattattatt ttatagacgc caacacagga aaaagcaata ttaaattaat gatcatattt 661 attgacaaac aggagagaaa cagtgagaga gaagatatta ataacagcaa aaaaactaat 721 gctagcttgt ttagctatct tagcggtagt agggcttgga atgacaagag tatcagcttt 781 atcaaaagat gatactgcac aactaaagat aacaaatatt gaaggtgggc caacagtaac 841 actttataaa ataggagaag gtgtttacaa cactaatggt gattctttta ttaactttaa 901 atatgctgag ggggtttctt taactgaaac aggacctaca tcacaagaaa ttactactat 961 tgcaaatggt attaatacgg gtaaaataaa gccttttagt actgaaaacg ttagtatttc 1021 taatggaaca gcaacttata atgcgagagg tgcatctgtt tatattgcat tattaacagg 1081 tgcgacagat ggccgtacct acaatcctat tttattagct gcatcttata atggtgaggg 1141 aaatttagtt actaaaaata ttgattccaa atctaattat ttatatggac aaacaagtgt 1201 tgcaaaatca tcattaccat ctattacaaa gaaagtaacc gggacaatag atgacgtgaa 1261 taaaaagact acctcgttag gaagtgtatt gtcttattcg ctgacatttg aattaccaag 1321 ttataccaaa gaagcagtca ataaaacagt atatgtttct gataatatgt cggaaggtct 1381 tacttttaac tttaatagtc ttacagtaga atggaaaggt aagatggcta atattactga 1441 agatggttca gtaatggtag aaaatacaaa aatcggaata gctaaggagg ttaataacgg 1501 ttttaattta agttttattt atgatagttt agaatctata tcaccaaata taagttataa 1561 agctgttgta aacaataaag ctattgttgg tgaagagggt aatcctaata aagctgaatt 1621 cttctattca aataatccaa caaaaggtaa tacatacgat aatttagata ggaagcctga 1681 taaagggaat ggtattacat ccaaagaaga ttctaaaatt gtttatactt atcaaatagc 1741 gtttagaaaa gttgatagtg ttagtaagac cccacttatt ggtgcaattt ttggagttta 1801 tgatactagt aataaattaa ttgatattgt tacaaccaat aaaaatggat atgctatttc 1861 aacacaagta tcttcaggaa aatataaaat taaggaatta aaagctccta aaggttattc 1921 attgaataca gaaacttatg aaattacggc aaattgggta actgctacag tcaagacaag 1981 tgctaattca aaaagtacta cttatacatc tgataaaaat aaggcgacag ataattcaga 2041 gcaagtagga tggttaaaaa atggtatatt ctattctata gatagtagac ctacaggaaa 2101 tgatgttaaa gaggcttata ttgaatctac taaggcttta actgatggaa caactttctc 2161 aaaatcgaat gaaggttcag gtacagtatt attagaaact gacatcccta acaccaagct 2221 aggtgaatta ccttcgacag gtagcattgg tacttacctc tttaaagcta ttggttcggc 2281 tgctatgatt ggtgcaattg gtatttatat tgttaaacgt cgtaaagctt aatgctttca 2341 aaagtcgaaa tcaatcgaga ctgtctttat gcggtctcga tttttaatga taaggaactg 2401 ctatgacaga aagactaaaa aatctaggga tactcttatt atttttattg ggaacagcca 2461 tttttcttta ccctacgcta agtagtcagt ggaatgccta tcgtgatc // LOCUS HALHPA 1317 bp ds-DNA BCT 19-JUL-1990 DEFINITION H.volcanii histidinol-phosphate-aminotransferase (hisC) gene, complete cds. ACCESSION M33161 KEYWORDS histidinol-phosphate-aminotransferase. SOURCE H.volcanii (strain DSM 3757) cell line WFD 18 DNA, clone 477. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 1317) AUTHORS Conover,R.K. and Doolittle,W.F. TITLE Characterization of a gene involved in histidine biosynthesis in Halobacterium (Haloferax) volcanii: Isolation and rapid mapping by transformation of an auxotroph with cosmid DNA JOURNAL J. Bacteriol. 172, 3244-3249 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.K.Conover, 22-MAR-1990. FEATURES from to/span description pept 121 1206 histidinol-phosphate-aminotransferase (hisC) (EC 2.6.1.9) BASE COUNT 234 a 504 c 410 g 169 t ORIGIN 1 agtcgttcgg gcggcgctcg gctgacggcc gtcggtcgtc gcgtccccaa cccgaccccc 61 taccgccacg tccgacccgg agtacgcacc cttaagaacc gcgacccgca ttttccgacc 121 atgcaaccac gggacctctc cgcgcacgct ccctacgtac ccggccgcgg gacagaggag 181 gtcgcccgcg aactcggaat ggaccccgag gacctgacga aactctcctc gaacgagaac 241 ccccacggcc cgagtccgaa ggcggtcgcc gccatcgaag acgccgcgcc gaccgtgagc 301 gtctacccga agaccgccca cacggacctg accgaacgcc tcgccgacaa gtggggcctc 361 gcacccgaac aggtgtgggt gtctcccggc gcggacggct ctatcgacta cctgacccgc 421 gcggtgctcg aaccggacga ccggattctc gaacccgcgc ccggcttttc gtactactcg 481 atgagcgccc gctaccacca cggcgacgcc gtccagtacg aggtgtcgaa ggacgacgac 541 ttcgaacaga ccgccgacct cgtcctcgac gcctacgacg gcgagcgcat ggtctacctc 601 acaacgccgc acaaccccac cggttccgtg ctcccgcggg aggaactcgt cgaactggcc 661 gagtcggtcg aagagcacac gctcctcgtc gtcgacgagg cctacggcga gttcgccgag 721 gagccgtcgg ccatcgacct cttgtcggag tacgacaacg tcgcggccct gcggacgttc 781 tcgaaggcgt acgggctggc cggcctccgc atcggctacg cctgcgtgcc cgaggcgtgg 841 gccgacgcct acgcccgcgt gaacacgccg ttcgccgcca gcgaggtcgc ctgccgcgcc 901 gcgctcgccg cgctcgacga cgaggaacac gtcgagaaat ccgtcgagtc ggcccggtgg 961 tcccgcgact atctccgcga acacctcgac gcgccgacgt gggaaagcga gggcaacttc 1021 gtcctcgtcg aggtcggcga cgccacggcc gtcaccgagg ccgcccagcg cgagggcgtc 1081 atcgtccgcg actgcgggag cttcggcctg ccggagtgca tccgcgtctc ctgcggcacg 1141 gaaacccaga ccaagcgcgc cgtggacgtg ctcaaccgca tcgtctcgga ggtgccgacg 1201 gcgtgagaga cgacgacacc ggcacgcccg gcaccggaaa gaccacggcg accgagccgg 1261 tcgccgccga cctcgacctc gacgtggtcc acctcaaccg actcgtgaaa gacgagg // LOCUS BOVGOA 472 bp ss-mRNA MAM 19-JUL-1990 DEFINITION B.taurus go-alpha mRNA, 3' end. ACCESSION J02900 KEYWORDS go-alpha. SOURCE B.taurus retina, cDNA to mRNA, clone GO3.1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (sites) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. TITLE Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated regions JOURNAL Biochemistry 29, 5069-5076 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 472) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.R.Price, 12-JUN-1990. FEATURES from to/span description pept < 1 3 go-alpha (AA at 1) BASE COUNT 130 a 133 c 88 g 121 t ORIGIN 1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct 61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat 121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc 181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg 241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa 301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc 361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct 421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt // LOCUS CAJCAT 1334 bp ds-DNA BCT 19-JUL-1990 DEFINITION C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene, complete cds. ACCESSION M35190 KEYWORDS chloramphenicol acetyltransferase. SOURCE C.coli plasmid C-589 DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1334) AUTHORS Wang,Y. and Taylor,D.E. TITLE Chloramphenicol resistance in Campylobacter coli, nucleotide sequence, expression and cloning vector construction JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Taylor, 15-JUN-1990. FEATURES from to/span description pept 309 932 chloramphenicol acetyltransferase (cat) mRNA 277 > 932 chloramphenicol acetyltransferase mRNA signal 242 271 promoter binding 297 301 ribosome binding site signal 960 1006 transcriptional termination signal BASE COUNT 433 a 232 c 282 g 387 t ORIGIN 1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag 61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc 121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat 181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt 241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag 301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg 361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta 421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca 481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt 541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt 601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg 661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg 721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg 781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata 841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt 901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac 961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga 1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac 1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat 1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac 1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag 1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga 1321 aataataaat ggct // LOCUS CLOCBA 5120 bp ds-DNA BCT 19-JUL-1990 DEFINITION C.acetobutylicum beta-D-galactosidase (cbgA) and beta-D-galactosidase regulatory protein (cbgR) genes, complete cds. ACCESSION M35107 KEYWORDS beta-D-galactosidase; beta-D-galactosidase regulatory protein. SOURCE C.acetobutylicum (strain NCIB2951) DNA. ORGANISM Clostridium acetobutylicum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1260 to 5120) AUTHORS Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B. TITLE Clostridium acetobutylicum beta-galactosidase gene, cbgA, is positively regulated in Escherichia coli by a novel regulatory gene, cbgR JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 5120) AUTHORS Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.B.Scott, 11-JUN-1990. Author address:D.B.Scott: Molecular Genetics Unit Department of Microbiology and Genetics Massey University Palmerston North, New Zealand E-mail:D.B.Scott@massey.ac.nz FEATURES from to/span description pept 1560 4253 beta-D-galactosidase (cbgA) pept 4500 4805 beta-D-galactosidase regulatory protein (cbgR) BASE COUNT 1921 a 683 c 876 g 1640 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac 61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa 121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat 181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat 241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt 301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt 361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga 421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa 481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga 541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt 601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat 661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg 721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat 781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct 841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt 901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt 961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc 1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt 1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa 1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata 1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt 1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa 1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta 1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc 1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg 1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta 1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata 1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag 1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt 1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata 1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta 1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc 1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt 1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag 2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg 2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc 2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa 2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt 2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag 2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt 2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac 2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag 2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct 2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg 2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac 2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag 2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat 2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa 2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct 2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt 3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta 3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt 3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca 3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc 3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta 3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta 3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg 3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat 3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag 3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa 3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg 3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag 3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca 3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt 3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa 3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg 3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa 4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag 4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt 4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact 4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct 4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt 4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag 4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc 4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa 4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata 4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa 4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga 4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc 4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa 4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa 4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag 4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa 4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca 5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg 5101 ttaatctaat atacgaattc // LOCUS ECOPNCB 1490 bp ds-DNA BCT 19-JUL-1990 DEFINITION E.coli nicotinic acid phosphoribosyl transferase (pncB) gene, complete cds. ACCESSION J05568 KEYWORDS nicotinic acid phosphoribosyl transferase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1490) AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J., Meesters,H.A.R. and Witholt,B. TITLE Variation of cofactor levels in Escherichia coli: Sequence analysis and expression of the pncB gene encoding nicotinic acid phosphoribosyl transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1490; revises [1]) AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J., Meesters,H.A.R. and Witholt,B. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by P.Terpstra, 31-MAY-1990. FEATURES from to/span description pept 216 1418 nicotinic acid phosphoribosyl transferase (pncB) (EC 2.4.2.11) mRNA 158 > 1490 nicotinic acid phosphoribosyl transferase mRNA signal 124 129 -35 region signal 146 151 -10 region rpt 170 185 inverted repeat binding 197 202 ribosome binding site signal 1426 1450 rho-independent transcription termination signal revision 56 57 gc in [2]; cg in [1] revision 191 191 t in [2]; tt in [1] BASE COUNT 348 a 374 c 364 g 404 t ORIGIN 1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac 61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg 121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac 181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc 241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc 301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta 361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg 421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac 481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata 541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta 601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca 661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct 721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg 781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc 841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa 901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc 961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt 1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg 1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa 1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct 1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg 1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac 1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca 1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc 1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc // LOCUS NEUCCON13 2728 bp ds-DNA PLN 19-JUL-1990 DEFINITION N.crassa conidiation-specific protein (con-13) gene, complete cds. ACCESSION M35120 KEYWORDS conidiation-specific protein. SOURCE N.crassa (strain 74-OR23-1A) DNA, clone pCon10a. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2728) AUTHORS Hager,K.M. and Yanofsky,C. TITLE Genes expressed during conidiation in Neurospora crassa: Molecular characterization of con-13 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Hager, 12-JUN-1990. Author address:K.M.Hager: Dept. of Physiology UCLA Medical School 10833 Le Conte Avenue Los Angeles, CA 90024-1751 E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU FEATURES from to/span description pept 1009 1275 conidiation-specific protein (con-13), exon 1 1333 1847 conidiation-specific protein, exon 2 1910 2150 conidiation-specific protein, exon 3 pre-msg 922 2367 con-13 mRNA and introns (alt.) pre-msg 927 2367 con-13 mRNA and introns (alt.) pre-msg 936 2367 con-13 mRNA and intron (alt.) pre-msg 946 2367 con-13 mRNA and intron (alt.) IVS 1276 1332 con-13 intron A IVS 1848 1909 con-13 intron B site 2364 2367 polyadenylation site BASE COUNT 653 a 695 c 720 g 660 t ORIGIN Linkage group IV. 1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc 61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt 121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc 181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt 241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa 301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc 361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca 421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta 481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca 541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta 601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt 661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg 721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc 781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt 841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg 901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca 961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct 1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg 1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa 1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg 1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat 1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta 1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc 1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc 1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac 1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt 1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag 1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag 1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc 1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc 1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc 1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt 1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg 1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct 2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg 2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact 2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg 2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt 2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga 2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc 2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa 2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct 2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag 2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga 2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat 2701 gatggaaacg agatcgaaac ccattgag // LOCUS VVUVVHAB 2237 bp ds-DNA BCT 19-JUL-1990 DEFINITION V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds. ACCESSION M34670 KEYWORDS cytolysin; cytotoxin; hemolysin; toxin. SOURCE V.vulnificus (strain EDL174) DNA, clone pCVD702. ORGANISM Vibrio vulnificus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 2237) AUTHORS Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G. TITLE The cytolysin gene of Vibrio vulnificus: Sequence and relationship to Vibrio cholerae El Tor hemolysin JOURNAL Infect. Immun. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.B.Kaper, 29-MAY-1990. FEATURES from to/span description pept 745 2160 cytolysin (vvhA) precursor sigp 745 804 cytolysin signal peptide matp 805 2157 cytolysin pept 237 743 pot. cytolysin (vvhB) signal 55 60 -35 region signal 80 85 -10 region signal 87 92 -35 region signal 110 115 -10 region signal 184 189 -35 region signal 206 211 -10 region signal 2185 2219 transcription termination signal binding 54 69 CRP binding site binding 59 74 Fur binding site binding 185 199 Fur binding site binding 226 231 vvhB ribosome binding site binding 730 735 vvhA ribosome binding site BASE COUNT 639 a 498 c 509 g 591 t ORIGIN 1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata 61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat 121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa 181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc 241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga 301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg 361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt 421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct 481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt 541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc 601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa 661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc 721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta 781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac 841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga 901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg 961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc 1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc 1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac 1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat 1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa 1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt 1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt 1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg 1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt 1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg 1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac 1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg 1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca 1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg 1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc 1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga 1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt 1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt 2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc 2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag 2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta 2221 ttcgccatct attcgtc // LOCUS CHKMTPEPCK 3571 bp ss-mRNA ORG 19-JUL-1990 DEFINITION Chicken mitochondrial phosphoenolpyruvate carboxykinase (PEPCK-M) mRNA, complete cds. ACCESSION J05419 KEYWORDS phosphoenolpyruvate carboxykinase. SOURCE Chicken 3-day old liver mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Gallus domesticus Unclassified. REFERENCE 1 (bases 1 to 3571) AUTHORS Weldon,S.L., Rando,A., Matathias,A.S., Hod,Y., Kalonick,P.A., Savon,S., Cook,J.S. and Hanson,R.W. TITLE Mitochondrial phosphoenolpyruvate carboxykinase from the chicken: Comparison of the cDNA and protein sequences with the cytosolic isozyme JOURNAL J. Biol. Chem. 265, 7308-7317 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.L.Weldon, 17-MAY-1990. FEATURES from to/span description pept 28 1950 phosphoenolpyruvate carboxykinase precursor (EC4.1.1.32) matp 28 126 phosphoenolpyruvate carboxykinase signal peptide matp 127 1947 phosphoenolpyruvate carboxykinase rpt 2025 2123 large repeat copy A rpt 2124 2223 large repeat copy B rpt 2224 2316 large repeat copy C rpt 2317 2416 large repeat copy D rpt 2417 2508 large repeat copy E rpt 2519 2543 small repeat copy A rpt 2547 2577 small repeat copy B rpt 2580 2611 small repeat copy C rpt 2745 2777 small repeat copy D rpt 2780 2811 small repeat copy E rpt 2847 2878 small repeat copy F rpt 2040 2050 GCCAAGATGGC 11 bp repeat rpt 2105 2115 GCCAAGATGGC 11 bp repeat rpt 2205 2215 GCCAAGATGGC 11 bp repeat rpt 2298 2308 GCCAAGATGGC 11 bp repeat rpt 2072 2082 TCCAAGATGGC 11 bp repeat rpt 2139 2149 TCCAAGATGGC 11 bp repeat rpt 2265 2275 TCCAAGATGGC 11 bp repeat rpt 2332 2342 TCCAAGATGGC 11 bp repeat rpt 2424 2434 TCCAAGATGGC 11 bp repeat rpt 2524 2534 TCCAAGATGGC 11 bp repeat rpt 2558 2568 TCCAAGATGGC 11 bp repeat rpt 2657 2667 TCCAAGATGGC 11 bp repeat rpt 2691 2701 TCCAAGATGGC 11 bp repeat rpt 2724 2734 TCCAAGATGGC 11 bp repeat rpt 2758 2768 TCCAAGATGGC 11 bp repeat rpt 2792 2802 TCCAAGATGGC 11 bp repeat rpt 2825 2835 TCCAAGATGGC 11 bp repeat rpt 2859 2869 TCCAAGATGGC 11 bp repeat rpt 2926 2936 TCCAAGATGGC 11 bp repeat rpt 2983 2993 TCCAAGATGGC 11 bp repeat rpt 3023 3033 TCCAAGATGGC 11 bp repeat rpt 3057 3067 TCCAAGATGGC 11 bp repeat rpt 3114 3124 TCCAAGATGGC 11 bp repeat rpt 3234 3244 TCCAAGATGGC 11 bp repeat BASE COUNT 589 a 1077 c 1197 g 708 t ORIGIN 1 tcctcgccta tactgggaca atttataatg ttttggttaa gagggggggc gcagagttgt 61 aggggggggg aaactgagga cagaatgcag cgcgggatgt ggggcgtggg cctggcccgg 121 cgcaggctga gcacgtcgct gtcggcgctg ccggcggccg cgcgggattt cgtggaggag 181 gcggtccggc tgtgcaggcc cagggaggtt ctgctgtgcg atgggtccga ggaggagggg 241 aaggagctgc tcagagggct gcaggacgac ggggtgctgc atccgctgcc caaatacgac 301 aactgctggt tggctcgcac cgacccccgg gacgtggctc gggtgcaaag caagacggtg 361 ttggtaaccc ccgaacagag cgacgccgtc cccccacccc ccccatccgg gtccccccaa 421 ttggggaact ggatgagccc caatgctttc caggcagctg tgcaggagcg tttccccgga 481 tgcatggcag gccgccccct ctacgtcatc ccattcagca tgggcccccc cacgtccccc 541 ttggccaaac tgggggttca ggtgaccgac tccccctacg tggtgctctc catgcgcatt 601 atgacccgcg tgggccccgc ggtgctgcag cgcctcgacg acgacttcgt ccgctgcctc 661 cactctgtgg ggcggcctct gcccctcacc gagcccctgg tgagctcgtg gccgtgcgac 721 cggtcccgtg tcctggttgc ccacatcccc tcggagcgcc ggatcgtctc cttcggttcg 781 ggatacggcg gcaattcgct gctgggcaag aagtgcttcg cgctggccat cgcgtcccgc 841 atggcccagc agcagggctg gctggccgag cacatgctga ttttgggggt gacgtccccc 901 agcggtgaga agcgttacat ggcggcggcc tttcccagcg cctgcgggaa aaccaacctg 961 gccatgatga cccccagcct gccgggttgg cggatccact gcgttgggga cgacattgcg 1021 tggatgaagt tcgatgatcg agggcgcctc cgcgccatca accccgagcg tggctttttt 1081 ggggtggccc cggggacgtc gtcgcgcacc aaccccaacg ccatggccac catcgcccgc 1141 aacaccatct tcaccaacgt ggggctgcga agcgatggcg gcgtctactg ggacggcctg 1201 gatgagccca cggagcccgg ggtcacctac acctcctggc tgggcaagcc gtggaagcac 1261 ggtgaccccg agccgtgcgc ccaccccaac tcccgtttct gcgccccggc cgatcagtgc 1321 cccattatgg acccgcgttg ggacgacccg gaaggagttc ccatcgacgc catcatcttc 1381 ggggggcgcc gaccccgcgg agtgccgttg gtggtggagg cctttgggtg gcgccacgga 1441 gttttcatgg gcagcgcaat gaggagcgaa gccaccgccg ccgccgagca caaaggcggc 1501 cgtttgatgc acgacccctt cgccatgagc ccctttttcg gctacaacgc ggggcgttac 1561 ctggaacatt ggctgtctac gggtctccgg agcaacgccc gcctcccccg tctgttccac 1621 gttaattggt tcctccgaga taacgaaggt cgcttcgttt ggcccggctt cggtcacaac 1681 gcccgcgtct tggcttggat cttcgggagg atccagggga gggacactgc ccggcccacc 1741 cccatcggtt gggtacccaa agaaggggat ttggacctgg gggggctgcc gggggtcgat 1801 tactcccaac tgttccctat ggagaagggc ttttgggagg aggagtgcag gcagctgagg 1861 gagtattacg gggagaactt cggggccgat ctgcccaggg atgtcatggc ggagctggag 1921 ggcctggagg agagggtgag gaagatgtga ggggtcgggg tggggctgag ggaaaggatg 1981 gggggaggtt gggggggctg tggggggcga ggtgggggct ggcggtgggg gttggtgagg 2041 ccaagatggc ccatcggtat gggttggccg ttccaagatg gctgccgccg ctatgagttg 2101 gtcagccaag atggccgccg acagtgtggg ttggtgggtc caagatggct gccatcggta 2161 tgggttggcc gttccaagat gctgccgccg ctacgagttg gtcagccaag atggccgccg 2221 acagcgtggg tccaagaagg ccgccatcat tacgggttgg ccgttccaag atggctgccg 2281 ccactacgag ttggtcagcc aagatggcca ccgacagtgt gggttggtgg gtccaagatg 2341 gctgccatca gtatgggttg gccattccaa gatcgtgccg ccgctacgag ttggtcagcc 2401 aagatggctg ccgacagcgt gggtccaaga tggccgccat cattacgggt tggccgttcc 2461 aagatcgtgc cgccactacg agttggtcag ccaagatggc caccgacagc gttggttggt 2521 gggtccaaga tggctgccat cattgtgggt tggccgttcc aagatggccg ccatcactgt 2581 gggttggccg ttccaaggtg gctgccatct ttgtgggtcg gtgggcccat gatggctgcc 2641 atcgtgggtt ggctgttcca agatggctgc cagcagcgtg agatgactgt tccaagatgg 2701 ctgccaccac tatcagttgg ccatccaaga tggccgccaa cagcgtgggt tggtgggtcc 2761 aagatggccg ccatcactgt gggttggccg ttccaagatg gctgccgcca ctatgagttg 2821 gccatccaag atggctgcca gcaggatggg ttggtgggtc caagatggct gccaccataa 2881 tgcattggcc agacaagatg gccaccagca gcatgggatt gccgatccaa gatggccgcc 2941 ctacctggga aggagccccc tgcctgctca ttggctgagc gctccaagat ggctgccatt 3001 ccacgtcctc gttggttgac catccaagat ggctgccacc cccacagagt ggccgatcca 3061 agatggccgc cccgcctggg agggatcctc ctgccctctc attggctgag cgttccaaga 3121 tggctgccat tccacgtcct cattggttga ccatccacga tggctgctgc cttcctctcc 3181 attggctacc catctaagat ggctgctctc ctttgtcctg attggctggc caccccaaga 3241 tggctgctcg tgcccatcct ggctgctcat tggttcctgc agagctgtgg tgcctcccaa 3301 ttggtcgggg ccatttgata gtgggacttc tgggcgccat cttggagtga cgtcacactg 3361 tgagcaacgc tgcgttccta ctggcttgcc gcagcctccc atgaccaatg gctgtgtccg 3421 cttggttgcg aacgccctcg cctaatcaca gcgtcccgtt ggccgagcgg agcgtcctga 3481 ttggccgagc tcttcccctt gtccaaacgg cagcttccca ttggctgtgc tcatctcaat 3541 ggcctatcag agccgcccgt ggacctcaga a // LOCUS HUMPANMU 4139 bp ss-mRNA PRI 19-JUL-1990 DEFINITION Human pancreatic mucin mRNA, complete cds. ACCESSION J05582 KEYWORDS pancreatic mucin; tumor-associated antigen. SOURCE Human pancreatic tumor cell line HPAF-CD11, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4139) AUTHORS Lan,M., Batra,S., Qi W,-N., Metzgar,R. and Hollingsworth,M. TITLE Cloning and sequencing of a human pancreatic tumor mucin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Hollingsworth, 08-JUN-1990. FEATURES from to/span description pept 74 3841 pancreatic mucin precursor sigp 74 136 pancreatic mucin signal peptide matp 137 3838 pancreatic mucin mRNA < 1 4139 pancreatic mucin mRNA rpt 453 2880 tandem repeat rpt 299 452 5' degenerate tandem repeat rpt 2881 2957 3' degenerate tandem repeat signal 4118 4123 poly-A signal BASE COUNT 632 a 1910 c 1055 g 542 t ORIGIN 1 ccgctccacc tctcaagcag ccagcgcctg cctgaatctg ttctgccccc tccccaccca 61 tttcaccacc accatgacac cgggcaccca gtctcctttc ttcctgctgc tgctcctcac 121 agtgcttaca gttgttacag gttctggtca tgcaagctct accccaggtg gagaaaagga 181 gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg ctgtgagtat 241 gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca ctcagggaca 301 ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca cctggggaca 361 ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc cgccagccca 421 cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc ccccagccca 481 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 541 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 601 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 661 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 721 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 781 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 841 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 901 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 961 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1021 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1081 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1141 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1201 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1261 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1321 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1381 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1441 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1501 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1561 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1621 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1681 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1741 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1801 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1861 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1921 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 1981 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2041 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2101 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2161 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2221 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2281 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2341 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2401 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2461 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2521 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2581 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2641 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2701 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2761 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2821 cggtgtcacc tcggccccgg acaccaggcc ggccccgggc tccaccgccc ccccagccca 2881 tggtgtcacc tcggccccgg acaacaggcc cgccttgggc tccaccgccc ctccagtcca 2941 caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg tgcacaacgg 3001 cacctctgcc agggctacca caaccccagc cagcaagagc actccattct caattcccag 3061 ccaccactct gatactccta ccacccttgc cagccatagc accaagactg atgccagtag 3121 cactcaccat agctcggtac ctcctctcac ctcctccaat cacagcactt ctccccagtt 3181 gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc agtttaattc 3241 ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca tttctgaaat 3301 gtttttgcag atttataaac aagggggttt tctgggcctc tccaatatta agttcaggcc 3361 aggatctgtg gtggtacaat tgactctggc cttccgagaa ggtaccatca atgtccacga 3421 cgtggagaca cagttcaatc agtataaaac ggaagcagcc tctcgatata acctgacgat 3481 ctcagacgtc agcgtgagtg atgtgccatt tcctttctct gcccagtctg gggctggggt 3541 gccaggctgg ggcatcgcgc tgctggtgct ggtctgtgtt ctggttgcgc tggccattgt 3601 ctatctcatt gccttggctg tctgtcagtg ccgccgaaag aactacgggc agctggacat 3661 ctttccagcc cgggatacct accatcctat gagcgagtac cccacctacc acacccatgg 3721 gcgctatgtg ccccctagca gtaccgatcg tagcccctat gagaaggttt ctgcaggtaa 3781 cggtggcagc agcctctctt acacaaaccc agcagtggca gccgcttctg ccaacttgta 3841 gggcacgtcg ccgctgagct gagtggccag ccagtgccat tccactccac tcaggttctt 3901 caggccagag cccctgcacc ctgtttgggc tggtgagctg ggagttcagg tgggctgctc 3961 acagcctcct tcagaggccc caccaatttc tcggacactt ctcagtgtgt ggaagctcat 4021 gtgggcccct gaggctcatg cctgggaagt gttgtggggg ctcccaggag gactggccca 4081 gagagccctg agatagcggg gatcctgaac tggactgaat aaaacgtggt ctcccactg // LOCUS DOGSRP9A 1271 bp ss-mRNA MAM 19-JUL-1990 DEFINITION C.lupus signal recognition particle 9 protein (SRP9) mRNA, complete cds. ACCESSION M34952 KEYWORDS signal recognition particle protein. SOURCE C.lupus (strain Madin-Darby) kidney, cDNA to mRNA. ORGANISM Canis lupus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1271) AUTHORS Strub,K. and Walter,P. TITLE Assembly of the alu domain of the signal recognition particle (SRP): Dimerization of the two protein components is required for efficient binding to SRP RNA JOURNAL Mol. Cell. Biol. 10, 777-784 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Strub, 07-JUN-1990. FEATURES from to/span description pept 59 319 signal recognition particle 9 protein BASE COUNT 372 a 232 c 254 g 413 t ORIGIN 1 gcccacctac cacctacctc gggcggccag aaaccgatgc ggggggccca gcggcaagat 61 ggcgcagtac cagacttggg aggagttcag ccgcgcggcc gagaaactct acctcgccga 121 ccctatgaag gcacgtgtgg ttctcaaata taggcattct gatgggagtt tgtgtattaa 181 agtaacagat gatttagttt gtttggtgta tagaacagac caagcccaag atgtaaagaa 241 gattgagaaa ttccacagtc aactaatgcg actcatggta gccaaggaat cccgcagtgt 301 tgccatggaa acggactgac gggtttgaaa tgaagatcct tcatgttctt aggagtaaat 361 atcttttgaa tcagaaaaag tgttgggaaa gaaaatatgt aactaagtgg gctcttcaga 421 agtggggaga tcattttttg tactttgttt tttaatgttt actttagaga gctaggaacg 481 tacatgcttt cggtgaaagc ctttatttat ttttggaaat tcagtaaaag gcagttcttc 541 cttaaattta gttaatctgt ctttaaaaga aaattaaatt taaccatttt gctggattgt 601 tgtatttctt ttggagcata aaatttgtgc tattgatgac caacaaacaa acataaaata 661 tagtaattgg aattacctgt gcacagcagt gtacctatgt ataatatagt aattagtctc 721 agttctatct aaaagtaatc atggaaatga gtatgcttta cctaaaactt ttccaaactt 781 aaactgtatt tttgaatgta aggaatttgt agtatcgtta gcttgttgag cagggacttg 841 ctttaatcta gtttccagtg ctcaaaaaca actgcattta cttgaagtgc atgaacagat 901 gatcactagt ggactgaacc accatattac gcaagtattt gcctgcagat ttcccatcta 961 tattttctca gaagggctaa agattatttg aactgttaaa tctttgccat atgtctgtgc 1021 cactcctgcc tgtttctccc tgtacttaac caaggtgttg aacatgactg tcacaactgt 1081 tagttaaatc tttgcatatg tctgtgccac tcctgcctgt ttctccctgt acttaaccaa 1141 ggtgttgaac atgactgtca caactgttat ttttttcatt aagtcagaag gatatcattt 1201 gatatttatc atataattgt aacctcagtt ttaccatctc aatgtaatgt tcacatgttg 1261 ttcctacatt a // LOCUS PCHPMMMSA 6409 bp ss-mRNA INV 19-JUL-1990 DEFINITION P.chabaudi major merozoite surface antigen mRNA, complete cds. ACCESSION M34947 KEYWORDS major merozoite surface antigen; surface antigen. SOURCE P.chabaudi chabaudi (strain IPP-C1), cDNA to mRNA, and DNA, clone IPP-C1/C. ORGANISM Plasmodium chabaudi Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 6409) AUTHORS Deleersnijder,W., Hendrix,D., Bendahman,N., Hanegreefs,J., Brijs,L., Hamers-Casterman,C. and Hamers,R. TITLE Molecular cloning and sequence analysis of the gene encoding the major surface antigen of Plasmodium chabaudi chabaudi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Deleersnijder, 06-JUN-1990. FEATURES from to/span description pept 667 6024 major merozoite surface antigen precursor sigp 667 723 major merozoite surface antigen signal peptide matp 724 6021 major merozoite surface antigen rpt 324 365 tandem repeat region BASE COUNT 2681 a 991 c 922 g 1814 t 1 others ORIGIN 1 tctagataat atattttttg tatgcatgct aaaattaatt atacatatat taaatagatt 61 tgtgcgaatc tttatgtgtg caagttattt tttttaataa taattatcca tataccacat 121 tatttatttg tgtaccgtta aatatttatt ttctaagcga tttttctcct taaattatat 181 tttttntgat catttttttt ttttttttgg aaaatcggga gcataaaaaa tatatattac 241 actttataaa ttttttatac acatttgttt attttatttt atatatattt tttaacacat 301 ttttattttg aaatgatatg atcaattata aaaaaacaat aacataataa tagtaataat 361 ttttttttgt acgatatata aaattatgca tttttatttt tatagtaagt taaaaagtgt 421 attatatgta cgtattttgt ttaacagaac ggaaattaga aaaaacacaa taaaacttat 481 atatatatgt gtaattagtg tatgtgtata tatttgtcaa cattataaat gatataattg 541 aacttcaata tttattttta cacaaattag tactaatata aaatgcaaaa gtaatgtacc 601 tttgtgtgta ttaattttag cattataatt tattccactc tgtatattag ttaagtttcg 661 ttgaaaatga aggcgatcgg acttttgttt tctttcgttt tttttgctat atattgcaaa 721 tctgaaacaa taggagttta caatgatctc gttcataagt tagaaaagtt agaagaatta 781 tcagtagaag gattagaact atttcaaaaa agtcaagtaa ttgtaaatgc acaatcacca 841 gaaacacctg ttgatccatt tacaaaccct gaatttgcac aaaagttaca accatttatt 901 ttaaaatttg aagaattagg atttacagaa caaacagagt tagtcaattt aataaaaact 961 ttaggcccaa ataaatatgg actaaaatat ttaattgaaa gtaaagaaga atttaacgaa 1021 ttaatgcacg caataaattt ttactatgac gtgcttagag ataaattgaa tgatatgtgt 1081 gcaaataact attgtgaaat tcctgaacat cttaaaatta atgttgaaga aatcgaaatg 1141 cttaagaaag ttgtcttagg ctatagaaaa ccaattgaaa atattcaaga tgatcttgta 1201 aaattagaag aatatattgc aagaaataaa gcaactgctg aaaccttaaa cactcttatt 1261 actgaagaaa caaaaaaaat aacacctgaa gaagaaacag attgcaacga tactaattgc 1321 gacaatacta aatatggaaa gaaaaaagca atatatcaag ctatgtacaa tgttatattt 1381 tacaaaaagc aattagctga aataaaaaaa gtcatcgaag tcttagaaaa gagagttgct 1441 acattaaaga agaacgaagc cataaaacca ttgttacaac aaatcgaagc tatcagaggt 1501 ccacctgctg tcactgaagg acaaatagct acagaaggaa gcagcgaaga aacaaaacaa 1561 aatagtacag aatcatctaa cacaaaaacg actactactg acaaagctgt tacaacccaa 1621 accgctacta aagcaactgg tacagaaaca aatactggta cagaaacaaa tactggcaca 1681 gaaacaaata ctgccacagg aacaactact gccacaggaa caactactgc cacaggaaca 1741 cctactgtca ctgaaccagt tcaagtgcca gccgttcaag ttcttacaga agaagaaaaa 1801 gcaaaaaaaa tagctgaact ttatgctcaa attaaagaaa ttgcaaaaac tataaaattc 1861 aatttagacg gaatatttgt cgatccagtt gaattagaat attacaaaaa agaaaaaaaa 1921 aatgaaagct gccattcaac ttcatcttgc cacaaaaata aaacacctga aactgtaata 1981 ccattaaatg tacgttatcc aaatggtatt agctacccat taactgaaga agttgtttac 2041 agcaaaattg ctcataatgc cgctgaaaca acttatggtg atttaacaaa tgtcgataat 2101 acagccataa cagaagattt aaccacaaat gaacaagcaa gaaaaaattt aattaaagct 2161 attaaaaaga aaatcgaagc agaagaacaa aaattagtag aattaaaaga tgattatgat 2221 actaaacttg cagcatttaa tggacaaaaa actccattca aagaagcagc taaaaaattt 2281 tatgaatcca aatttagaaa taaattgact actgacattt ttgacgattt taaaacaaaa 2341 agaactgaat atatgaacaa gaaagctgca ttagtaggtt gtgaatatgg aaatactcaa 2401 caactcatta ataaattaaa taaacaactt aattatttac aagattatgg attaagaaaa 2461 gaaatagtta acactgaaat tgaatatttt tcaaacaaaa aatcagaatt acaatataat 2521 attaatagat tagcaaatgc tgttcaagca aaacaaaata tattagttgc atcaaaacat 2581 attccacttt caacacttgt tgaattacaa atccaaaaat ctttattaac aaaactaatt 2641 gaacaattaa ataaaactga attttcttta aataaagctc acttaaaaga caagatatac 2701 gttccacaaa catatggtaa agaaggaaaa ccagaaccat actacttaat agctataaaa 2761 aaagaaattg acagacttgc caaatttatt cctaaaattg atgatatgat tgagaaagag 2821 aaacaaaaaa tggaacaaga acatgtagct accggagaat ctgaacaagc ctcttctgcc 2881 tctggtactg gatcatccac agaaaccaca tcacaaacag caccagccgt tccagctgca 2941 cccgcaccag cagaaaaggc aaaagaagga acagaatcaa cagaagaaac cccagcagca 3001 tcaaaaccag ccgaaggtgc agcatccaca ggtgcaacca ccccaacaga acaagaagct 3061 gcaccaacag aacaagaagc acaacctgca gcacctgaaa caccagcaga ggtaccagca 3121 ccaaccacgc ctgcagctcc agcaactcca gccgcaccag cagcacccgc aaaaccagtt 3181 atgacaaaat tatattacct tgaaaaatta aagaaatttt tagcattctc atatgcatgc 3241 cataaatatg ttttattaca aaactctacc ataaacaaag atgctttaag caaatatgct 3301 cttacaccag aagaagataa aataagaaca ttaaagagat gcagtgaatt agatgtatta 3361 ttagctattc aaaataatat gcctactatg tattcacttt atgaaaatgt agttgatggt 3421 ttacaaaaca tttacactga attatatgaa aaagaaatga tgtatcatat atataactta 3481 aaagataaaa acccagctgt taaagcttta ttagtaaaag ctggcgtcat tgatccagaa 3541 ccagtagccc caacaccagc agtaccagca ccagaaactg caccagaaac tgcaccagaa 3601 actgcaccag aaacaccagc acaagaagct ccacaacaac cagaatcggc acaagcacca 3661 gaagcagcaa ctgaaacaac aacaccagcc gaatcggcat caacagaacc aacaccaaaa 3721 gcacctacag caacacccac atctgaaaca gtaacacaag aaggaacaac accagcagca 3781 ccaaaagcac aagaaggagc atcatcatca gcaccagcac aaccagcccc agcaaaacca 3841 gcacctgcac aaacagtaac agggcaatca acaaacgttg aaggaagtac tcaagtaaga 3901 gcagaaagtg aagacgaaat gtttgtcgat gattttgaag tagacaattt ttacaaatct 3961 tacttacaac aagttgatgg aaataatact caattcatag attttataaa atctaaaaaa 4021 gaattaatca atgcattgac ccctgaaaaa gttaaccaat tatatcttga tattgcacac 4081 ttaaaggaat tatcagaaca ttactataat cgttattata aatataaatt aaaattagaa 4141 agattatatc aaaaacatga acaaattgaa gcagctaacc aaaaagttaa agaaattagc 4201 gtattaaaat cccgattatt aaaaagaaaa aaatatatta atggtacatt ttatgtatta 4261 tctggttttg caaatttctt taacaagaga agagaagctg aaaagcaata tgtagataac 4321 gcaataaaaa atactgatat gttattgaaa tactacaaag ctcgtagtaa atattttact 4381 tctgaagctg ttcctttaaa aacattaact aaaacatcaa ttgacagaga agccaactac 4441 ttgaaaatcg aaaaattcag agcatacagc cgattagaat taagattaaa gaaaaatatt 4501 aacttaggaa aagaaagaat tacatatgta tctggtggtt tacaccatgt atttgaagaa 4561 tttaaagaac ttttaaaaaa taaaggttat accggaaaaa ctaaccctga aaatgctcct 4621 gaagttatca aggcattcga acaatataaa gaattacttc caaagggagc aacaactcca 4681 gctccagtag ttgcacctgt agttgctcca gccccagcta cagcagcccc agcagctgac 4741 gcaccagtac cagcagccgc agccgcagcc gcatcaggat caggatcagc agccacaaca 4801 gaaggagaag ccgctacaac agtagttgca agcagcgata atgatgatga tgacgatgat 4861 gatatggatc aaattgcaaa tgctcaatcc acagacgaag aagtaaaaga tattcttgat 4921 gcatttaaaa gtgaaaatga atatatatac acaaagagct taggtaacac atataaatca 4981 tttaaaaaac acatgttaaa agaattttca atgattaaag aagacataat gactggatta 5041 aactataaat tagaaaaaag aaatgatttc cttgatgtat taagctatga attagcttta 5101 ttcaaagata taaataccaa caaatttgtt gttaaaaacc cataccaatt attagataat 5161 gataagaaag acaaacaaat gataaactta aaatatgcca ttaaaggtgt aactgaagat 5221 atcgaaacag ctactgatgg aattgaattc tttaacaaaa tgattgaatt atacaaacct 5281 caattaaacg cagttaatga acaaattgct gccataggaa cagaacctac cgatgccgaa 5341 aaaaagaaat acgctccaat ctttgaagat cttaaaggat tatatgaaac catattgaac 5401 ggagcagaag aattttcaga attattacaa cacaaacttg aaaactataa aattgaaaaa 5461 gctggatttg acattttaat ggcaaattta gaaacataca taagaattga cgaaaaactt 5521 gaagacttcg tagaaagtgc agaaaaaaat aaacacattg cctcaatagc tttaaataac 5581 ttaaacaaat ctggtttagt aactgaaggt gaatcaaaga aaatattagc aaaaatgctt 5641 aacatggatg ccatggattt attaggtata ggttctaatc atgtatgtat tagtacaagt 5701 actcctgaca atgctggatg ctttagatat gatgatggta cagaagaatg gagatgttta 5761 ttaggtttca aaaaagatga tgatggtaat agatgtgtag cagatgatgc tcctgtttgt 5821 aataacaaca atggtggatg tgataaaaat gctgattgta gagaagtaga aaatacagat 5881 agggatcctt ccaaaaaaat tgtatgtact tgtaaagaac caaacccaaa tgcatattat 5941 gctggtgtat tctgtagttc ttccggattt atgggattat caattttatt gatcatcaca 6001 ttaattgtat ttaatttatt ttaaataaat gattaaaata tttgttgcat tttatatttt 6061 tcctatatat attttaaaag ttgtataata catttgaaat atatattttg gcataaattg 6121 tatatttttt attatataaa aaaatatata tatataattt ttaataaaca tttttaaata 6181 aacgtacatg tgttttagta taggaaattt tgtatgactt taaaatatga tgatactatt 6241 ttttttaaat gtatagtaaa ttaatttatt tttatttttt atacaatata ttgtatgtgt 6301 gttctttatt actattattt tataagtata taaaataaag ctattttttt ttttttttta 6361 acttcaaaca tatttagtaa cttttttatt taaagaatag ccggaattc // LOCUS SHPMHCA 588 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34672 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone SC17. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 588) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 287 MHC protein (AA at 3) BASE COUNT 120 a 161 c 157 g 150 t ORIGIN 1 ccaggaagtg ggcggccctg gtggttcctt ctggagagga gcacacatac acgtgccgtg 61 tgcagcacga ggggcttcag gagcctaccc tgagatggga acctcctcag acctccttcc 121 tcaccattgg catcattggt ctggatctcc tcgtggttgc tgtggtggct ggagctgtga 181 gctggatgaa gaagctctca ggtgaaaaaa gacggacgta cacacaggct gcaagcagtg 241 acagtgccca gggctctgat gtgtctctca cggtccctaa agtgtgaaac gctgccttgt 301 gggactgagt gatgctgcat cccgcaatgt gacgtcagat cctggacccc tctttctcgg 361 ctgcatccga atgtgtctgt gctcctagta gcataacatg aggagctggg gagactggtc 421 acccctgccc accacacccc cttctccgct gacctgtgtt ctcctccctg atacactgtc 481 ctgttccagc agagacaggg ctgggccgtg tcatcgctgt ctttgcttca tatgcactta 541 gtaatgatgt cttatttcat ctttgaaaat aaaatctgta tatatatc // LOCUS SHPMHCB 841 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34673 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone SCI89. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 841) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 543 MHC protein (AA at 1) BASE COUNT 179 a 235 c 238 g 189 t ORIGIN 1 gaggactacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaccgggaag 61 gacacgctgc tgccgcagac ccttccaaag gcacatgtga cccgacaccc catctctgag 121 cgtgaggtac ccttgaggtg ctgggccctg ggcttctacc ctgaggagat ctcactgacc 181 tggcagcgca atggggagga ccagacccag gacatggagc tcgtggagac caggccttca 241 ggagatggaa ccttccagaa gagggcggcc ctggtggtgc cttctgaaga ggagcagaga 301 tacacgtgcc atgtgcagca cgaggggctt caggagctca ccctgagatg ggaacctcct 361 cagacctcct tcctcaccaa gggcatcatt gttggcctgg ttctcctcgt gctggctgtg 421 gtggctggag ctgtgatctg gaggaagaag tgctcaggtg aaaaaagagg cacctatacc 481 caggcttcaa acaatgacat gtgcccaggc tctgatgtgt ctctcacagt tcctaaagtg 541 tgagacgctg ccttgtggga ctgagtgatg ctgtatccca ctatgtgatg tcagatccct 601 gacccctctt tctgcagctg catctgaacg ttgtctgtgc tccatgtagc ataacgtgag 661 gagctgggga gattggtcac ccctgcccac cacaccccct cccgcctgga cctatgtctc 721 ctccctgata cactgtccta atccagcaga gagggcctgg ctgtctccat ccctgtcttg 781 cttcatgtgc actgagtaat gatgtcttat acccttattg aaaataaaat ctgtatatat 841 g // LOCUS SHPMHCC 995 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34674 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI16. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 995) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 537 MHC protein (AA at 1) site 466 626 unspliced intron BASE COUNT 203 a 248 c 288 g 254 t 2 others ORIGIN 1 gaccctccaa aggcacatgt ggcccatcac cccatctctg accgtgaggt caccctgagg 61 tgctgggccc tgggcttcta ccctgaggag atctcactga cctggcagcg tgacggggag 121 gaccagactc aggacatgga gtttgtggag accaggcctt caggggatgg aaccttccag 181 aagtgggcgg ccctggtggt gccttctgga gaggagcaga gatacacgtg ccgtgtgcag 241 cacgaggggc ttcaggagcc cctcaccctg agatgggaat ctcctcagcc ctccgtcctc 301 accatgggca tcattgttgg cctggttctc ctcgtggtgg ctgtggtggc tggagctgtg 361 atctggatga agaagcgctc aggtgaaaaa ggacggatct acacccaggc tgcaagcatg 421 tacagtgccc agggctctga tgtgtctctc acggttccta aaggtgaggc cctggagtgt 481 ctagattgga aggagcattg gggcagaggg gacacactgg gtggcggggg tctctgagtg 541 ggacatgtga gcatgtcggg ggctgtggag aatatcagcc cttacatgac tgacctgaac 601 tggctcctga ttcttttctc tcacagtgtg agacagctgc cttgtgggga ctgagtgatg 661 cttggtccca ctttgtgatg tcagatcgcc ggacccctct ttcttcagct gcatctgaat 721 gtgtctgtgc tcctattagc ataacatgag aagttgggga gactggtcac ccttgcccac 781 tgtacgctgt ccccaccctg acctgtgttc tcctccctga tccaccatcc tgttcagcga 841 gacgggctgg gccatcttca ttgctatctt tgcttcacat gcactgagta atgatgtctt 901 atttccttat tgaaaataaa ttctgtatat atatgaatct attttttcta attggtgcca 961 tgaaagggnn ttggataata aaatgagaat tcgat // LOCUS SHPMHCD 1050 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, 3' end. ACCESSION M34675 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI11. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1050) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept < 1 747 MHC protein (AA at 1) BASE COUNT 216 a 292 c 315 g 227 t ORIGIN 1 ggcgggtctc acaccatcca ggcgatgtac ggctgcgaag tgggacctga cgggcgtctc 61 ctccgcgggt atgagcagtt cgcctacgaa ggcagagatt acctcgccct gaacgaggac 121 ctgcgctcct ggaccgcggc ggacacggcg gctcagatca ccaagcgcaa gtgggaggcg 181 gcaggtgagg cggcgcgtgt gaggatctac ctggagggca cgtgcgtgga gtggctccgc 241 agacacctgg agaccgggaa ggacacgctg ctgcccgcag accctccaaa ggcacatgtg 301 acccaacacc ccatcactga gcgtgaggtc accctgaggt gctgggcctt gggcttctac 361 cctgaggaga tctcactaac ctggcagcac aatgaggagg accagaccca ggacatggag 421 cttgtgaaga ccaggccttc aggggatgga accttccaga agtgggcagc cctggtggtg 481 ccttctggaa aggagcagag atacacgtgc cgtgtgcagc acgaggggct tcaggagccc 541 ctcaccctga gatgggcacc tcctcagacc tccttcctca ccatgggcat cattgttggc 601 ctggttctcc tcgtggtgac tgtggtggct ggagctgtga tctggaggaa gaagcgctca 661 ggtgaaaaaa gacagaccta tacccaggct gcaagcagtg acagtgccca gggctctgat 721 gtgtctctta tggttcctaa agtgtgagac agctgccttg tggggactga gtgatgcttg 781 gtcccattct gtgacatcag atcttgggac ccctctttct gcaggggcat ctgaatgtgt 841 ctgtgctcct attagtataa catgaggagt tggggagact ggtcacccct gcccactgca 901 caccgtcccc accctgacct gtgttctcct tcctgatcca ctgtcctgtt gcagcagaga 961 cgcctgggcc ctctccatca ctgtctttgc ttcatatgca ctgagtaatg atgtgttatt 1021 tcctttttga aaataaaatc tgtatatatg // LOCUS SHPMHCE 1396 bp ss-mRNA MAM 19-JUL-1990 DEFINITION Sheep MHC class I protein gene, complete cds. ACCESSION M34676 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Sheep 8-week old, cDNA to mRNA, clone PSCI12. ORGANISM Ovis aries Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1396) AUTHORS Grossberger,D., Hein,W. and Marcuz,A. TITLE Class I major histocompatibility complex cDNA clones from sheep thymus: Alternative splicing could make a long cytoplasmic tail JOURNAL Immunogenetics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Grossberger, 30-MAY-1990. FEATURES from to/span description pept 4 1110 MHC protein precursor sigp 4 89 MHC protein signal peptide matp 90 1107 MHC protein BASE COUNT 278 a 403 c 434 g 281 t ORIGIN 1 cccatgacca gaggattgcg agtaatgggg ccgcgaaccc tcctgttgct gctctcggga 61 gtcctggtcc tgaccgagat ccgggcgggc ccccactcca tgaggtattt cagcaccgcc 121 gtgtcccgcg ccggcgccgg ggagccccgg tacctggaag tcggctacgt ggacgacacg 181 cagttcgtgc ggttcgacag cgacgccccg gatccgaaga tggagcagag ggagccgtgg 241 atgaagcagg tggggccgga gtattgggat cggaacacgc gaaatcccaa gggcaacgca 301 cagactttcc gagtgggcct gaccatcctg cgcggctact acaaccagag cgagaccggg 361 tctcacacct ggcagtgtat gtacggctgc gacgtggggc cggacgggcg tctcctccgc 421 gggttcatgc agttcggcta cgacggcaga gattacatcg ccctgaacga ggacctgcgc 481 tcctggaccg cggcggacac ggcggctcag gtcacccagc gcaagtggga gaaggaaggt 541 gcggcggacc actacaggaa ctacgtggag ggcacgtgcg tggagtgcgt gcgcagatac 601 ctggagatcg ggaaggaaca gctgcagcga gcagaccctc caaaggcaca tgtgacccat 661 caccccatct ctggccatga tgtcaccctg aggtgctggg ccctgggctt ctaccctgag 721 gagatctcac tgacctggca gcgcaatggg gaggaccagt tgcaggacat ggagcttgtg 781 gagactaggc cttcagggga tggaaccttc cagaagtggg cggcccttgg tggtgcttct 841 ggagaggagc agagatacac gtgccatgtg cagcatgagg ggcttcagga gcccctcacc 901 ctgagatggg aacctcctca gacctccttc ctcacttcct caatgggcat cattgttggc 961 ctggttctcc tcgtcatggt ggctgtggtg gctgcagctg tgatctggag gaagaagtgc 1021 tcaggtgaaa aaagagggac ctatacccag gcttcaagca atgacagtgc ccagggttct 1081 gatgtgtctc tcacggttca taaagtgtga gacagtgatg ctgcatcccg ctatgtgcca 1141 tcagatcccc ggacccctct ttctgaagct gcatctgcac gtgtctgtgc tcctagtagc 1201 ataacgtgag gagttgggga gaccgttcac ccctgcccac cgcgccccct cctgccctga 1261 cctgtgttct cctccctgat ccactgtcct gttccagcag cagacagggc tgggccgtct 1321 ccatccctgt ctttgcttcg tatgcactga gtaatgatgt cttatttcct tattgaaaat 1381 aaaatctgta tgtatg // LOCUS YSPNMT1A 3787 bp ds-DNA PLN 19-JUL-1990 DEFINITION S.pombe no message in thiamine protein (nmt1) gene, complete cds. ACCESSION J05493 KEYWORDS . SOURCE S.pombe DNA. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3787) AUTHORS Maundrell,K. TITLE nmt1 of fission yeast: A highly transcribed gene completely repressed by thiamine JOURNAL J. Biol. Chem. 265, 10857-10864 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Maundrell, 07-JUN-1990. FEATURES from to/span description pept 1499 2539 no message in thiamine protein (nmt1) mRNA 1430 2681 nmt1 mRNA signal 1396 1403 TATA box BASE COUNT 1218 a 657 c 690 g 1222 t ORIGIN 1 ggctcattta taatctagca ctttatacct tttacctgac tgttgggttg tttatctgac 61 ctcataaaag aaagtgtcgt tttggaaaaa ttagcattac attgagggtc ctccgctaat 121 gctcctgcga aaaatgattt taattttgga tgttttttca gaaataaaat gaaaattagc 181 ttgatataat atcaaccggc agcgagtaat agatttaaaa taaatttgat taattaaaaa 241 aatttgttgt tttaagcaag ccattttgct aaaatcaaag gtaatggaag agtatttccg 301 aaaaatctca acacatgtga atgatcagaa aattatcgcc ataaaagaca gaataagtca 361 tcagcggttg tttcatttcc tatatttttt ttttattttt ttatttttta ataagggaaa 421 atttaacgtc taaggataca gaagattgtt agcacattaa agtaataaag gcttaagtag 481 taagtgcctt agcatgttat tgtatttcaa aggacataat ctaaaataat aacaatatca 541 tttctcacaa gttattcaat tttctttttt ttttctaata atatcaagaa tgtattattt 601 gtttgacata agtcaactaa tttatttaat atgctggatt aatcttgcag acatgtaaat 661 taacaagttt tagtcaaata acgttgaagt ttcaatgaac tcaaataatt tctctttttt 721 tttatataac catatgtcta atctgattta tattttccgc aggatcaact gaagttatga 781 catttggatt ggatcactta taaccttggt cgccaaataa tacaaaaatc agcgttataa 841 aacaaagaag gtttttgtta agaaattaat cctctttctt gataagaaag ttgaaccgaa 901 attgcagata ctgatatatg aaaataatac ccacaatttt gggaatagcg caagcctcaa 961 tttaaacaat aggtgaggac acatgataat gacctcaatg attgttagaa gaaaagagcc 1021 tcattacaaa atcgaaaaat gaatggttgg gtacaagttt ccaaaacatg gtaaagtgga 1081 ctttgcgtat gagacgtaaa tagaaaaaaa cacttgttat atgttttcta gaattattgt 1141 tgtctcttta tggttggatg atgcaaaata gtaatttcgg ttagttgctg taaaacacca 1201 cgagacaaat agatatggat atttattaaa tcaggaaaaa cgtaactctc ggctactgga 1261 tggttcagtc acccaacgat tactggggag agaaaacagg gcaaaagcaa agcttaaagg 1321 aatccgattg tcattcggca atgtgcagcg aaactaaaaa ccggataatg gacctgttaa 1381 tcgaaacatt gaagatatat aaaggaagag gaatcctggc atatcatcaa ttgaataagt 1441 tgaattaatt atttcaatct cattctcact ttctgactta tagtcgcttt gttaaatcat 1501 gtctactaac aagatcactt tcctcacaaa ctgggaggcc actccttacc atttgcccat 1561 ctttcttgct caaactcgcg gatactatga gcgtgaaggg attgaggttg ctattctcga 1621 gcctaccaac ccttccgacg ttacagcatt gattggttct ggtaaggttg acatgggatt 1681 aaaggccatg atccatactt tagctgctaa ggctcgcgga taccctgtca ccagttttgg 1741 atctttgtta aatgagcctt tcactggctt aattactttg aagggtaatg gcatcaacga 1801 cttcaaggac attaaaggaa agcgtattgg ctacgttggt gagtttggaa agatccaact 1861 cgatgacttg tgcagcaagt tcggtttgtc tccttctgat tatactgcta ttcgctgtgg 1921 tatgaacatt gcccctgcca tcatcaatgg tgaaatcgat ggcggcattg gcattgaatg 1981 catgcaacaa gtcgagcttg agcgctggtg cgtctcccaa ggccgcccaa ggtctgatgt 2041 ccaaatgttg cgtattgatc gattagccaa cttaggttgc tgctgtttct gtaccatttt 2101 gtatattgca catgatgaat tcattgctaa acatcccgac aagatcaagg ccttcttacg 2161 tgctatccat tctgctactt tggatatgct taaagatcct gtccaaacct acaaggagta 2221 cattcacttc aagcgtgaaa tgggatccga acttcatcgg gaacaatttg aacgttgctt 2281 tgcatatttc tcacatgaca tctctaacgt ccccagagat tggaacaagg ttaccaatta 2341 ttccaagcgt ttgggcatca tcccccaaga ttttgagccc aactgtacta acggttactt 2401 gacctgggaa cttgaccccg atgagaagga tcccatgggc aaacaagaag ccattgccga 2461 gatccaagat gaaattaagc aaaagggagg tgtcttcagc ggcaactcac ttcgttatgt 2521 cgagcctgcc aacctttaaa aggaatgtct cccttgccag tactgctagg gtttttcttt 2581 caaactatgg aagcccattc aagctgcata ttacgatttt gtttttcgct tttagaaagt 2641 ggtttagatg agataataga aaaattcttg atctccgaca acgagtactt ttattttttt 2701 tgctaatcac tttactcaat attagctcga aatcgtagaa acgtagacgg gtgcgggata 2761 ccgagtggtg tagttaagaa tttttataaa ccacgtggcc caaaaatatg aacccaaaac 2821 gtttatacat gagtatactt taagaaggct ataccccttc gtgttagatg tagttttagc 2881 tacccaaccc gagtctatga gcttgacttc agatgtagaa ggcattaaat cgttttgaat 2941 attaattaaa aaacgatgaa aattaaatat ttaaaagcaa tcatacgctg aaaatttagt 3001 gctgtggcta atccttcaac atggaaatgc cataaaagtg actttgacaa aaaaaaaagt 3061 atatacaggt agtaaactca tctacttcat tgactttgtt tacagcatgt ggaaggagga 3121 atatttattg ctaaatcgta gtttaacatt caataagtaa tactattgaa attcgacaag 3181 attggccgca tggatgaaaa agaggcattt tgctttggga gaattagttc aaattagaac 3241 tgaaaaaaaa aactttacga ggcaaaaatg tcggattgag atcgtaaaag ttcgctcgtc 3301 gtcttttgct ttgtgattgt tttcatggat acatcttgct ggatatttaa attttagtac 3361 tatgtataag atattctata aatgttttat cacccaaacc tgttagcgcc ttcttaattc 3421 tattcaatct ggcttttgct ctgagactac ttcttggact ttcactactt gttagttata 3481 cggaatttgt gtaattagaa gtgaaataat cctttctatt agtaatgcaa acaaaaatca 3541 attggaaagc aaatttacac atacttgctg tatcgccttc gactatcttt tcattgcata 3601 ccatgatttt agacgtttat acttaagcaa ttaaaaggtt ttgattcaat cataaacata 3661 attatccttg ataaaaaaag aattatacac attgttctct ttatttgact tcgaactgtt 3721 taacatcgaa acggtcagat gatacaccca ttcctccaat gtaatccctg gcttcttggg 3781 caagctt // LOCUS CP7CPL 1470 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage Cp-7 muramidase (cpl7) gene. ACCESSION M34779 KEYWORDS muramidase. SOURCE Bacteriophage Cp-7 [from S.pneumoniae] DNA. ORGANISM Bacteriophage Cp-7 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1470) AUTHORS Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and Lopez,R. TITLE Modular organization of the lytic enzymes of Streptococcus pneumoniae and its bacteriophages JOURNAL Gene 86, 81-88 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 286 1314 muramidase BASE COUNT 483 a 270 c 318 g 399 t ORIGIN 1 cagctggaca ggcttaaaag gagttatcaa acataccctt acattcattt tttactactt 61 tgtagcggta ttcttgacct atattcacgc tatggcagtc ggtcagattt tgctggttat 121 cattaactta tactatgctt tgtcaatcat ggaaaatctt gctgttatgg gtgtatttat 181 tcccaagttt atgacggcaa gggtgcaaga agagttacag aaatacacag cacaactaga 241 cgcagggaaa gacctgctag aagaatttaa aggagaaaag aaataatggt taagaaaaat 301 gatttatttg tagacgttgc aagccatcaa ggctacgaca tttcaggaat tttagaagaa 361 gcagggacaa caaacacaat tattaaagtg tcagaaagta caagctattt aaacccttgc 421 ttgtctgctc aagtgagcca gtcaaatcct atcgggtttt atcattttgc ttgctttggt 481 ggaaatgaag aagaagcaga agcagaagca cgctatttcc ttgataacgt gcctacacaa 541 gttaaatacc ttgtactaga ttatgaagac catgcaagcg caagcgtaca aagaaacact 601 accgcgtgct tacgctttat gcaaatgatc gcagaagctg gatatacacc tatttattat 661 agttacaaac cgtttacgct tgataatgtg gactatcagc agattttagc acagttccct 721 aattctctat ggattgcagg ctatggctta aatgatggta cagctaactt tgaatacttt 781 ccaagcatgg acggtatcag atggtggcaa tattctagta acccgtttga caagaatatt 841 gtactgttag atgatgagaa agaagataat ataaacaatg aaaacactct aaaaagcctt 901 accacagtag ccaacgaggt cattcaggga ctttggggca acggtcaaga acgttatgac 961 agtttagcga atcgagggta tgacccccaa gcggttcaag acaaagtgaa tgaaatctta 1021 aacgctagag aaattgcaga ccttaccaca gtagccaacg aggtcattca gggactttgg 1081 ggcaacggtc aagaacgtta tgacagttta gcgaatcgag ggtatgaccc ccaagcggtt 1141 caagacaaag tgaatgaaat cttaaacgct agagaaattg cagaccttac cacagtagcc 1201 aacgaggtca ttcagggact ttggggcaac ggtcaagaac gttatgacag tttagcgaat 1261 cgagggtatg acccccaagc ggttcaagac aaagtgaatg aattactttc ataacaagta 1321 aaagctagta gaaattttct actagctatt tttatattct gctatgattt tataggcgtc 1381 ctcatctggg ttatccagag caatggagca aatggcagac aggacagctg ttcatctgat 1441 tgtatttctg taaatagtga ttttctagct // LOCUS CP9CPL 1253 bp ds-DNA PHG 19-JUL-1990 DEFINITION Bacteriophage Cp-9 muramidase (cpl9) gene. ACCESSION M34780 KEYWORDS muramidase. SOURCE Bacteriophage Cp-9 [from S.pneumoniae] DNA. ORGANISM Bacteriophage Cp-9 Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 1253) AUTHORS Garcia,P., Garcia,J.L., Garcia,E., Sanchez-Puelles,J.M. and Lopez,R. TITLE Modular organization of the lytic enzymes of Streptococcus pneumoniae and its bacteriophages JOURNAL Gene 86, 81-88 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 52 1071 muramidase BASE COUNT 403 a 213 c 294 g 343 t ORIGIN 1 agtagacgca ggaaaagacc tgctagaaga atttaaagga gaaaagaaat aatggttaag 61 aaaaatgatt tatttataga cgtatcaagc cacaacggtt acgatataac aggaatttta 121 gagcagatgg gaacaacaaa cacgattgtt aaaatctcag aaagtacgac ctatttaaac 181 ccttgcttgt ctgctcaagt ggaacagtct acccctattg gcttttatca cttcgcacgc 241 tttggcggag acgtagcaga agctgaaaga gaagcgcagt ttttccttga caacgtgcct 301 acacaagtta aataccttgt attggactat gaagacgacc caagcggaaa cgcacaagcc 361 aacactaacg catgcttacg ctttatgcag atgattgcag acgctggata tacacctatt 421 tattatagtt ataaaccttt cacgcttgat aatgtggact atcagcagat tttagcacag 481 ttccctaatt ctctctggat tgcagggtat ggcttgaatg atggaaacgc tgattttgaa 541 tattttccat ctatggacgg gataagatgg tggcagtatt ctagtaaccc gtttgacaag 601 aatattgtac tgttagacga tgaagaagac gaaaagccaa agactgctgg aacgtggaaa 661 caagacagta agggctggtg gttcagacgc aataacggta gtttccctta taataaatgg 721 gaaaaaatcg ggggtgtgtg gtactacttc gatagtaaag gatattgctt aacgagcgaa 781 tggctcaaag ataatgaaaa atggtactac ctcaaggaca acggcgctat ggtgactggt 841 tgggtgctag tcgggtcaga gtggtattat atggacgatt caggtgcaat ggttactggt 901 tgggtcaaat acaagaataa ctggtactat atgacaaatg aacgtggtaa catggtttct 961 aatgaattta ttaaatctgg aaaaggctgg tatttcatga acacaaacgg agagcttgca 1021 gacaatccaa gctttacaaa agaaccagac ggacttataa cggtagcata aaaagaaaag 1081 ctagtagaaa ctttctacta gctgttttta tattctgcaa tgattttata agcgtcttcg 1141 tctgggttgt ccagagcgat ggagcagatg gcagacagaa ccgctgttca tctgattgta 1201 tttctgtagg tagtgatttt ctaggctgtt atgttgctga tgtgctttat acc // LOCUS YSCTY31A 5510 bp ds-DNA PLN 19-JUL-1990 DEFINITION S.cerevisiae Ty3-1 retrotransposon integrase gene, complete cds, and Cys-tRNA gene. ACCESSION M34549 KEYWORDS integrase; transfer RNA-Cys; transposable element; transposon. SOURCE S.cerevisiae (strain AB950) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 5510) AUTHORS Hansen,L.J. and Sandmeyer,S.B. TITLE Characterization of a transpositionally active Ty3 element and identification of the Ty3 integrase protein JOURNAL J. Virol. 64, 2599-2607 (1990) STANDARD unannotated staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.B.Sandmeyer, 24-MAY-1990. FEATURES from to/span description pept 536 1408 integrase tRNA 105 31 (c) Cys-tRNA mRNA 343 > 5510 integrase mRNA site 121 462 5' sigma element site 5132 5471 3' sigma element rpt 121 128 5' inverted terminal repeat rpt 455 462 3' inverted terminal repeat rpt 5132 5139 5' inverted terminal repeat rpt 5464 5471 3' inverted terminal repeat rpt 116 120 5' insertion target sequence rpt 463 467 3' insertion target sequence rpt 5127 5131 5' insertion target sequence rpt 5472 5476 3' insertion target sequence BASE COUNT 1955 a 1306 c 919 g 1330 t ORIGIN 1 aactttcatg gaaggaccac ctagttaata aaaagctcgc actcaggatc gaactaagga 61 ccaacagatt tgcaatctgc tgcgctacca ctgcgccata cgagcttgat tttctgaaag 121 tgttgtatct caaaatgaga tatgtcagta tgacaatacg tcaccctgaa cgttcataaa 181 acacatatga aacaacctta taacaaaacg aacaacatga gacaaaaccc gaccttccct 241 agctgaacta cccaaagtat aaatgcctga acaattagtt tagatccgag attccgcgct 301 tccaccactt agtatgattc atattttata taatatataa gataagtaac attccgtgaa 361 ttaatctgat aaactgtttt gacaactggt tacttcccta agactgttta tattaggatt 421 gtcaagacac tccggtatta ctcgagcccg taatacaaca cctggtagcg ttaaaggtta 481 ctaattgttc aaacgaacca tcgaaaagcc gaacctagct acaccacacc ccagtatgag 541 ctttatggat caaatcccag gaggaggaaa ttatccaaaa ctcccagtag aatgccttcc 601 taacttcccg atccaaccat ctttgacctt cagaggtaga aatgactcgc ataaactgaa 661 aaactttatc tccgaaataa tgttaaacat gtctatgata tcttggccga atgatgccag 721 tcgtattgtg tactgcagaa gacatttatt aaaccccgct gctcagtggg ctaatgactt 781 tgtacaagaa caaggtatac ttgaaataac attcgacaca ttcatacaag gattatatca 841 gcatttctat aagccaccag atatcaataa aatctttaat gcaatcacgc aactttccga 901 agctaaactt ggtattgagc gtctcaacca acgattcaga aagatttggg acagaatgcc 961 accagacttc atgaccgaaa aagctgccat aatgacatat actaggctat tgacaaagga 1021 aacctataat attgtcagaa tgcacaaacc agagacatta aaagacgcca tggaagaggc 1081 ttaccagaca actgcactaa ctgaaagatt cttcccagga ttcgaacttg atgctgatgg 1141 agacactatc atcggtgcca caacccactt acaagaagaa tacgactctg actatgattc 1201 agaagataat ctgacccaga atggatacgt ccataccgta aggacaagaa gatcttacaa 1261 taaaccaatg tcaaatcatc gaaacaggag aaataacaac ccatctagag aagaatgtat 1321 aaaaaatcgg ctatgcttct attgtaagaa agagggacat cgcctgaacg aatgtagagc 1381 acgtaaggcg agttctaacc gatcttgaac tcgaatcaaa agaccaacaa actcctttta 1441 tcaaaacctt accaattgta cactatatcg ccatccccga gatggacaat accgccgaaa 1501 aaaccataaa aatacaaaac acgaaagtaa aaaccctgtt tgacagtgga tcacccacgt 1561 catttatccg aagagatatt gtagaacttc tcaaatacga aatctacgag acccctccac 1621 tccgttttag aggattcgta gccaccaaat ccgccgttac atccgaagca gtcaccattg 1681 acctcaaaat caatgacctg catataactt tagccgcgta catactggat aacatggact 1741 accaattgtt aattggaaat ccaatcttac gccgctaccc gaaaatcctg cacacagtac 1801 tgaataccag agagagcccc gactccttaa agcccaagac ttatcgctcc gaaaccgtta 1861 ataacgttag aacctactcc gctggtaatc gtggtaaccc cagaaacata aaactgtctt 1921 ttgcccccac cattctcgaa gcaactgacc cgaaatccgc tggtaatcgt ggtgactcca 1981 gaaccaaaac cctgtctctt gcaaccacta ctcctgcagc aattgacccg cttacgaccc 2041 ttgataaccc aggtagtact caaagtacat ttgcgcaatt cccgatacct gaagaagcga 2101 gcatcctaga agaggatgga aaatactcca acgttgtctc aaccattcag agtgtagaac 2161 ctaatgctac tgatcacagc aataaggaca ccttttgcac tttgccagtt tggttacaac 2221 agaagtatag agagatcata cgtaatgatc tcccaccaag acctgccgac attaataaca 2281 tccccgtaaa acatgatatt gaaattaaac ctggcgcaag actacctcga ctacagccat 2341 accatgttac agaaaagaac gaacaagaaa tcaacaaaat agttcaaaaa ctgctcgata 2401 acaagttcat tgttccctca aagtcgcctt gcagctcccc tgtagtcctc gtcccgaaga 2461 aagacggtac cttccgactc tgcgtcgatt accgcaccct gaacaaagct accatctccg 2521 acccattccc attacccaga atcgacaacc tattgagccg tattggaaat gcccagatat 2581 ttaccacgct agatttgcat agtggttacc accagatccc gatggaaccc aaagaccgct 2641 acaaaaccgc ctttgtcaca ccatccggta agtatgaata taccgtcatg ccatttggct 2701 tagtcaatgc acctagtaca ttcgcaagat acatggctga tacatttaga gacctgagat 2761 tcgtcaatgt ttaccttgat gatatattaa tattctccga atctccagaa gaacattgga 2821 aacatttaga cacggtacta gaaagattaa agaacgagaa cctcattgtt aagaagaaaa 2881 aatgtaaatt tgcatctgaa gaaactgagt ttttaggcta tagtattgga atccagaaaa 2941 tagctccact acagcacaaa tgtgcagcaa tccgagactt tccgacgcct aaaacagtaa 3001 aacaagcaca gagattttta ggaatgatta attactacag acgattcatt ccaaattgct 3061 ccaagattgc acagccaatc caactgttta tttgtgacaa aagtcaatgg acagaaaaac 3121 aagacaaggc aattgataaa ctaaaagacg ccttgtgtaa ctcccccgtc ctagtaccat 3181 tcaacaacaa agcaaactac cgacttacaa cagacgcctc aaaagacggc attggtgctg 3241 ttctagaaga agtcgacaac aagaacaaac ttgttggtgt cgtcggttac ttctctaaat 3301 ccttagagag tgcccagaaa aactatcctg ctggcgaatt agaactactt ggaattatca 3361 aagcactcca ccacttccga tatatgcttc acggaaagca tttcacgtta agaacagacc 3421 acattagttt gttatcatta caaaacaaga acgaacccgc acgacgcgtg caacgctggt 3481 tagatgacct agccacatat gacttcacct tagaatacct agctggaccc aagaacgttg 3541 tcgcagatgc catatcccgt gccgtatata ctataacccc cgaaacatcc cgacctatcg 3601 acacagaaag ctggaaatct tactacaaat cagacccatt atgtagtgct gtcttaattc 3661 atatgaaaga attgacacaa cacaacgtca cacctgaaga tatgtcagcc ttccgtagtt 3721 accagaagaa actcgaacta tcagagacct tccgaaagaa ttattcccta gaagacgaaa 3781 tgatctatta ccaagaccga ctagtagtac caataaaaca acagaacgca gttatgagac 3841 tatatcatga ccatacctta tttggaggac attttggtgt aacagtgacc cttgcgaaaa 3901 tcagcccaat ttactattgg ccaaaattac aacattcgat catacaatac atcaggacct 3961 gcgtacaatg tcaactaata aaatcacacc gaccacgctt acatggacta ttacaaccac 4021 tccctatagc agaaggaaga tggcttgata tatcaatgga ttttgtgaca ggattacccc 4081 cgacatcaaa taacttgaat atgatcctcg tcgtagttga tcgtttttcg aaacgcgctc 4141 acttcatagc tacaaggaaa accttagacg caacacaact aatagatcta ctctttcgat 4201 acattttttc atatcatggt tttcccagga caataaccag tgatagagat gtccgtatga 4261 ccgccgacaa atatcaagaa ctcacgaaaa gactaggaat aaaatcgaca atgtcttccg 4321 cgaaccaccc ccaaacagat ggacaatccg aacgaacgat acagacatta aacaggttac 4381 taagagccta tgcttcaacc aatattcaga attggcatgt atatttacca caaatcgaat 4441 ttgtttacaa ttctacacct actagaacac ttggaaaatc accatttgaa attgatttag 4501 gatatttacc gaatacccct gctattaagt cagatgacga agtcaacgca agaagtttta 4561 ctgccgtaga acttgccaaa cacctcaaag cccttaccat ccaaacgaag gaacagctag 4621 aacacgctca aatcgaaatg gaaactaata acaatcaaag acgtaaaccc ttattgttaa 4681 acataggaga tcacgtatta gtgcatagag atgcatactt caagaaaggt gcttatatga 4741 aagtacaaca aatatacgtc ggaccatttc gagttgtcaa gaaaataaac gataacgcct 4801 acgaactaga tttaaactct cacaagaaaa agcacagagt tattaatgta caattcctga 4861 aaaagtttgt ataccgtcca gacgcgtacc caaagaataa accaatcagc tccactgaaa 4921 gaattaagag agcacacgaa gttactgcac tcataggaat agatactaca cacaaaactt 4981 acttatgtca catgcaagat gtagacccaa cactttcagt agaatactca gaagctgaat 5041 tttgccaaat tcccgaaaga acacgaagat caatattagc caactttaga caactctacg 5101 aaacacaaga caaccctgag agagaggaag atgttgtatc tcaaaatgag atatgtcagt 5161 atgacaatac gtcaccctga acgttcataa aacacatatg aaacaacctt ataacaaaac 5221 gaacaacatg agacaaaacc cgaccttccc tagctgaact acccaaagta taaatgcctg 5281 aacaattagt ttagatccga gattccgcgc ttccaccact tagtatgatt catattttat 5341 ataatatata agataagtaa cattccgtga attaatctga taaactgttt tgacaactgg 5401 ttacttccct aagactgttt atattaggat tgtcaagaca ctccggtatt actcgagccc 5461 gtaatacaac agaaagttcc attttggatg ctctatttat gggaatatga //
GenBank-Updates@genbank.bio.net (07/20/90)
LOCUS SYNM13MP18 7249 bp ds-DNA SYN 20-JUL-1990 DEFINITION Cloning vector M13mp18 DNA. ACCESSION M11454 KEYWORDS . SOURCE Synthetic DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 7249) AUTHORS Yanisch-Perron,C., Vieira,J. and Messing,J. TITLE Improved M13 phage cloning vectors and host strains: Nucleotide sequences of the M13mp18 and pUC19 vectors JOURNAL Gene 33, 103-119 (1985) STANDARD simple staff_entry BASE COUNT 1768 a 1538 c 1533 g 2410 t ORIGIN 1 aatgctacta ctattagtag aattgatgcc accttttcag ctcgcgcccc aaatgaaaat 61 atagctaaac aggttattga ccatttgcga aatgtatcta atggtcaaac taaatctact 121 cgttcgcaga attgggaatc aactgttaca tggaatgaaa cttccagaca ccgtacttta 181 gttgcatatt taaaacatgt tgagctacag caccagattc agcaattaag ctctaagcca 241 tccgcaaaaa tgacctctta tcaaaaggag caattaaagg tactctctaa tcctgacctg 301 ttggagtttg cttccggtct ggttcgcttt gaagctcgaa ttaaaacgcg atatttgaag 361 tctttcgggc ttcctcttaa tctttttgat gcaatccgct ttgcttctga ctataatagt 421 cagggtaaag acctgatttt tgatttatgg tcattctcgt tttctgaact gtttaaagca 481 tttgaggggg attcaatgaa tatttatgac gattccgcag tattggacgc tatccagtct 541 aaacatttta ctattacccc ctctggcaaa acttcttttg caaaagcctc tcgctatttt 601 ggtttttatc gtcgtctggt aaacgagggt tatgatagtg ttgctcttac tatgcctcgt 661 aattcctttt ggcgttatgt atctgcatta gttgaatgtg gtattcctaa atctcaactg 721 atgaatcttt ctacctgtaa taatgttgtt ccgttagttc gttttattaa cgtagatttt 781 tcttcccaac gtcctgactg gtataatgag ccagttctta aaatcgcata aggtaattca 841 caatgattaa agttgaaatt aaaccatctc aagcccaatt tactactcgt tctggtgttc 901 tcgtcagggc aagccttatt cactgaatga gcagctttgt tacgttgatt tgggtaatga 961 atatccggtt cttgtcaaga ttactcttga tgaaggtcag ccagcctatg cgcctggtct 1021 gtacaccgtt catctgtcct ctttcaaagt tggtcagttc ggttccctta tgattgaccg 1081 tctgcgcctc gttccggcta agtaacatgg agcaggtcgc ggatttcgac acaatttatc 1141 aggcgatgat acaaatctcc gttgtacttt gtttcgcgct tggtataatc gctgggggtc 1201 aaagatgagt gttttagtgt attctttcgc ctctttcgtt ttaggttggt gccttcgtag 1261 tggcattacg tattttaccc gtttaatgga aacttcctca tgaaaaagtc tttagtcctc 1321 aaagcctctg tagccgttgc taccctcgtt ccgatgctgt ctttcgctgc tgagggtgac 1381 gatcccgcaa aagcggcctt taactccctg caagcctcag cgaccgaata tatcggttat 1441 gcgtgggcga tggttgttgt cattgtcggc gcaactatcg gtatcaagct gtttaagaaa 1501 ttcacctcga aagcaagctg ataaaccgat acaattaaag gctccttttg gagccttttt 1561 ttttggagat tttcaacgtg aaaaaattat tattcgcaat tcctttagtt gttcctttct 1621 attctcactc cgctgaaact gttgaaagtt gtttagcaaa accccataca gaaaattcat 1681 ttactaacgt ctggaaagac gacaaaactt tagatcgtta cgctaactat gagggttgtc 1741 tgtggaatgc tacaggcgtt gtagtttgta ctggtgacga aactcagtgt tacggtacat 1801 gggttcctat tgggcttgct atccctgaaa atgagggtgg tggctctgag ggtggcggtt 1861 ctgagggtgg cggttctgag ggtggcggta ctaaacctcc tgagtacggt gatacaccta 1921 ttccgggcta tacttatatc aaccctctcg acggcactta tccgcctggt actgagcaaa 1981 accccgctaa tcctaatcct tctcttgagg agtctcagcc tcttaatact ttcatgtttc 2041 agaataatag gttccgaaat aggcaggggg cattaactgt ttatacgggc actgttactc 2101 aaggcactga ccccgttaaa acttattacc agtacactcc tgtatcatca aaagccatgt 2161 atgacgctta ctggaacggt aaattcagag actgcgcttt ccattctggc tttaatgaag 2221 atccattcgt ttgtgaatat caaggccaat cgtctgacct gcctcaacct cctgtcaatg 2281 ctggcggcgg ctctggtggt ggttctggtg gcggctctga gggtggtggc tctgagggtg 2341 gcggttctga gggtggcggc tctgagggag gcggttccgg tggtggctct ggttccggtg 2401 attttgatta tgaaaagatg gcaaacgcta ataagggggc tatgaccgaa aatgccgatg 2461 aaaacgcgct acagtctgac gctaaaggca aacttgattc tgtcgctact gattacggtg 2521 ctgctatcga tggtttcatt ggtgacgttt ccggccttgc taatggtaat ggtgctactg 2581 gtgattttgc tggctctaat tcccaaatgg ctcaagtcgg tgacggtgat aattcacctt 2641 taatgaataa tttccgtcaa tatttacctt ccctccctca atcggttgaa tgtcgccctt 2701 ttgtctttag cgctggtaaa ccatatgaat tttctattga ttgtgacaaa ataaacttat 2761 tccgtggtgt ctttgcgttt cttttatatg ttgccacctt tatgtatgta ttttctacgt 2821 ttgctaacat actgcgtaat aaggagtctt aatcatgcca gttcttttgg gtattccgtt 2881 attattgcgt ttcctcggtt tccttctggt aactttgttc ggctatctgc ttacttttct 2941 taaaaagggc ttcggtaaga tagctattgc tatttcattg tttcttgctc ttattattgg 3001 gcttaactca attcttgtgg gttatctctc tgatattagc gctcaattac cctctgactt 3061 tgttcagggt gttcagttaa ttctcccgtc taatgcgctt ccctgttttt atgttattct 3121 ctctgtaaag gctgctattt tcatttttga cgttaaacaa aaaatcgttt cttatttgga 3181 ttgggataaa taatatggct gtttattttg taactggcaa attaggctct ggaaagacgc 3241 tcgttagcgt tggtaagatt caggataaaa ttgtagctgg gtgcaaaata gcaactaatc 3301 ttgatttaag gcttcaaaac ctcccgcaag tcgggaggtt cgctaaaacg cctcgcgttc 3361 ttagaatacc ggataagcct tctatatctg atttgcttgc tattgggcgc ggtaatgatt 3421 cctacgatga aaataaaaac ggcttgcttg ttctcgatga gtgcggtact tggtttaata 3481 cccgttcttg gaatgataag gaaagacagc cgattattga ttggtttcta catgctcgta 3541 aattaggatg ggatattatt tttcttgttc aggacttatc tattgttgat aaacaggcgc 3601 gttctgcatt agctgaacat gttgtttatt gtcgtcgtct ggacagaatt actttacctt 3661 ttgtcggtac tttatattct cttattactg gctcgaaaat gcctctgcct aaattacatg 3721 ttggcgttgt taaatatggc gattctcaat taagccctac tgttgagcgt tggctttata 3781 ctggtaagaa tttgtataac gcatatgata ctaaacaggc tttttctagt aattatgatt 3841 ccggtgttta ttcttattta acgccttatt tatcacacgg tcggtatttc aaaccattaa 3901 atttaggtca gaagatgaaa ttaactaaaa tatatttgaa aaagttttct cgcgttcttt 3961 gtcttgcgat tggatttgca tcagcattta catatagtta tataacccaa cctaagccgg 4021 aggttaaaaa ggtagtctct cagacctatg attttgataa attcactatt gactcttctc 4081 agcgtcttaa tctaagctat cgctatgttt tcaaggattc taagggaaaa ttaattaata 4141 gcgacgattt acagaagcaa ggttattcac tcacatatat tgatttatgt actgtttcca 4201 ttaaaaaagg taattcaaat gaaattgtta aatgtaatta attttgtttt cttgatgttt 4261 gtttcatcat cttcttttgc tcaggtaatt gaaatgaata attcgcctct gcgcgatttt 4321 gtaacttggt attcaaagca atcaggcgaa tccgttattg tttctcccga tgtaaaaggt 4381 actgttactg tatattcatc tgacgttaaa cctgaaaatc tacgcaattt ctttatttct 4441 gttttacgtg ctaataattt tgatatggtt ggttcaattc cttccataat tcagaagtat 4501 aatccaaaca atcaggatta tattgatgaa ttgccatcat ctgataatca ggaatatgat 4561 gataattccg ctccttctgg tggtttcttt gttccgcaaa atgataatgt tactcaaact 4621 tttaaaatta ataacgttcg ggcaaaggat ttaatacgag ttgtcgaatt gtttgtaaag 4681 tctaatactt ctaaatcctc aaatgtatta tctattgacg gctctaatct attagttgtt 4741 agtgcaccta aagatatttt agataacctt cctcaattcc tttctactgt tgatttgcca 4801 actgaccaga tattgattga gggtttgata tttgaggttc agcaaggtga tgctttagat 4861 ttttcatttg ctgctggctc tcagcgtggc actgttgcag gcggtgttaa tactgaccgc 4921 ctcacctctg ttttatcttc tgctggtggt tcgttcggta tttttaatgg cgatgtttta 4981 gggctatcag ttcgcgcatt aaagactaat agccattcaa aaatattgtc tgtgccacgt 5041 attcttacgc tttcaggtca gaagggttct atctctgttg gccagaatgt cccttttatt 5101 actggtcgtg tgactggtga atctgccaat gtaaataatc catttcagac gattgagcgt 5161 caaaatgtag gtatttccat gagcgttttt cctgttgcaa tggctggcgg taatattgtt 5221 ctggatatta ccagcaaggc cgatagtttg agttcttcta ctcaggcaag tgatgttatt 5281 actaatcaaa gaagtattgc tacaacggtt aatttgcgtg atggacagac tcttttactc 5341 ggtggcctca ctgattataa aaacacttct caagattctg gcgtaccgtt cctgtctaaa 5401 atccctttaa tcggcctcct gtttagctcc cgctctgatt ccaacgagga aagcacgtta 5461 tacgtgctcg tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg 5521 tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt 5581 cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg 5641 ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga 5701 tttgggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac 5761 gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc 5821 tatctcgggc tattcttttg atttataagg gattttgccg atttcggaac caccatcaaa 5881 caggattttc gcctgctggg gcaaaccagc gtggaccgct tgctgcaact ctctcagggc 5941 caggcggtga agggcaatca gctgttgccc gtctcgctgg tgaaaagaaa aaccaccctg 6001 gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 6061 cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct 6121 cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 6181 tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg aattcgagct 6241 cggtacccgg ggatcctcta gagtcgacct gcaggcatgc aagcttggca ctggccgtcg 6301 ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 6361 atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 6421 agttgcgcag cctgaatggc gaatggcgct ttgcctggtt tccggcacca gaagcggtgc 6481 cggaaagctg gctggagtgc gatcttcctg aggccgatac ggtcgtcgtc ccctcaaact 6541 ggcagatgca cggttacgat gcgcccatct acaccaacgt aacctatccc attacggtca 6601 atccgccgtt tgttcccacg gagaatccga cgggttgtta ctcgctcaca tttaatgttg 6661 atgaaagctg gctacaggaa ggccagacgc gaattatttt tgatggcgtt cctattggtt 6721 aaaaaatgag ctgatttaac aaaaatttaa cgcgaatttt aacaaaatat taacgtttac 6781 aatttaaata tttgcttata caatcttcct gtttttgggg cttttctgat tatcaaccgg 6841 ggtacatatg attgacatgc tagttttacg attaccgttc atcgattctc ttgtttgctc 6901 cagactctca ggcaatgacc tgatagcctt tgtagatctc tcaaaaatag ctaccctctc 6961 cggcattaat ttatcagcta gaacggttga atatcatatt gatggtgatt tgactgtctc 7021 cggcctttct cacccttttg aatctttacc tacacattac tcaggcattg catttaaaat 7081 atatgagggt tctaaaaatt tttatccttg cgttgaaata aaggcttctc ccgcaaaagt 7141 attacagggt cataatgttt ttggtacaac cgatttagct ttatgctctg aggctttatt 7201 gcttaatttt gctaattctt tgccttgcct gtatgattta ttggatgtt // LOCUS RATCDOL 1458 bp ss-mRNA ROD 20-JUL-1990 DEFINITION Rat cysteine dioxygenase mRNA, complete cds. ACCESSION M35266 KEYWORDS cysteine dioxygenase. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1458) AUTHORS Hosokawa,Y., Matsumoto,A., Oka,J., Itakura,H. and Yamaguchi,K. TITLE Isolation and characterization of a cDNA for rat liver cysteine dioxygenase JOURNAL Biochem. Biophys. Res. Commun. 168, 473-478 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 190 792 cysteine dioxygenase (EC 1.13.11.20) BASE COUNT 396 a 325 c 343 g 394 t ORIGIN 1 gagggtttag gctggtctcc ggtgacctcc tagtcctaaa tcttgatacc cttgcaagag 61 ctttgagcgt gtggggtccc gggcgttcgg ggtcccgggt gtgtgcggtt tgtatagcct 121 gaagccgggg tcctccgcgc tcgcgtcctc cgcagctgga ctgaagagac gcgtcccagc 181 cctgcgggga tggaacggac cgagctgctg aagccccgga ccctggccga cctcatccga 241 atcttgcatg agctcttcgc cggggacgaa gtcaatgtgg aggaggtgca ggctgtgctg 301 gaagcctacg agagcaatcc tgccgagtgg gctttgtatg ccaaattcga tcaatacagg 361 tatacccgaa accttgtgga tcaaggaaat gggaagttta atctgatgat tctgtgctgg 421 ggtgaagggc atggcagcag tattcacgat cacacggact cccactgctt tttgaagctg 481 ctgcaaggaa atctaaagga gacattgttt gactggcctg acaagaaatc caacgagatg 541 atcaagaagt ctgaaagaac tttgagggaa aatcagtgtg cctacattaa tgattctatt 601 ggcttacatc gagtagagaa cgtcagccac acagagcctg ctgtgagcct tcacttgtac 661 agtccacctt tcgatacatg ccatgccttt gaccaacgaa cagggcataa aaacaaagtc 721 accatgacat tccacagcaa atttggaatc agaactccat ttacaacttc aggttcactg 781 gagaacaact aagacctgcc aagcctttca aagttttgct tctgggtcgt tggaatgttt 841 taccttggat aagagaggcc acccatcatt tgctgtccag ttatacattt taataagtcc 901 atgctcagtg tgtatactaa ggaagcaaac catcccctga gctatgcagg agaaaaatcc 961 cactaaagaa aaagtcactt gatttttaat agccaaatca ccttgctccc agttcttctg 1021 tcttctaact ccatggaaat tctattggga gttctcagtg gggttttttt tcaaccttag 1081 gaaagcactt ctggtctctg aactctaata atcaataagt aaaaatgaag aaaccacaag 1141 ctatcacatg tctgttttca tacctggaag tctaagtgtg gaaatcttta atttactttg 1201 tatgttctta atgtttgaca agaatttttt taaatcttgg ttttcagttt tttcaaccct 1261 gtttgacaaa ttcctatgct gtggagacta gggatgcaga tagcagtttg gtgtttggta 1321 gtgaacagca gtggggccag aaatgtgcat gtatccagac ctcctgcaaa taaaaactga 1381 aactcatgtg taatgtgtgc caccacctta agctgccacc aaaattgcca aacgacttta 1441 ataaaactgg atttgaga // LOCUS HUMMYOHP 6024 bp ss-mRNA PRI 20-JUL-1990 DEFINITION Human perinatal myosin heavy chain mRNA, complete cds. ACCESSION M36769 M35250 Y00821 KEYWORDS myosin; myosin heavy chain. SOURCE human fetal muscle, cDNA to mRNA, clone pSMHCP. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6024) AUTHORS Leinwand,L.A. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 2651 to 6024) AUTHORS Feghali,R. and Leinwand,L.A. TITLE Molecular genetic characterization of a developmentally regulated human perinatal myosin heavy chain JOURNAL J. Cell Biol. 108, 1791-1797 (1989) STANDARD simple automatic REFERENCE 3 (bases 1 to 2650) AUTHORS Karsch-Mizrachi,I., Feghali,R., Shows,T.B. and Leinwand,L.A. TITLE Generation of a full-length human perinatal myosin heavy chain cDNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Author address: L.A.Leinwand Albert Einstein College of Medicine 1300 Morris Park Avenue Bronx, N.Y. 10461, USA. Submitted (12-APR-1989) on tape to the EMBL data library. Draft entry and computer-readable sequence for [3] kindly submitted by I.Karsch-Mizrachi, 18-JUN-1990. FEATURES from to/span description pept 74 5887 myosin heavy chain BASE COUNT 1961 a 1300 c 1586 g 1177 t ORIGIN 1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa 61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaagctgc 121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc 181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag 241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga 301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat 361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat 421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt 481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca 541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc 601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca 661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca 721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc 781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg 841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt 901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa 961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt 1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga 1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac 1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc 1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc 1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa 1321 aggccagact gtgcagcagg tgtacaatgc cgtgggtgct ctggccaaag ccgtctacga 1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag 1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct 1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat 1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt 1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat 1681 cctggaagag gagtgcatgt tccctaaggc cacggacacc tccttcaaga acaagctgta 1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc 1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg 1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc 1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc 1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga 2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat 2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct 2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat 2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg 2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca 2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct 2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag 2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg 2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt 2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa 2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct 2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc 2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat 2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa 2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga 2941 cattgatgac cttgagctga cactggccaa ggttgagaag cagaaacatg ccacggagaa 3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc 3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga 3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga 3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg 3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa 3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa 3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc 3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga 3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc 3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa 3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg 3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt 3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag 3781 taacgcagag gccatttcca aagccaaggg ccaccttgaa aagatgtgcc gctctctaga 3841 agatcaagtg agtgggctta agaccaagga agaggagcag cagcggctga tcaatgacct 3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga 3961 gcaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga 4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct 4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg 4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac 4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt 4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc 4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga 4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct 4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga 4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga 4561 tcaactcgaa acgctaagaa gagcacataa gaacttgcaa caggagattt ctgacctcac 4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt 4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca 4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga 4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt 4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag 4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa 4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac 5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat 5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga 5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt 5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga 5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga 5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca 5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct 5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca 5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa 5581 tgcagaggct gttaaaggtt tacggaaaca tgaccgacga gtaaaagaac tcacctacca 5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc 5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct 5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctgacattgc 5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc 5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaaggcacaa aatgtgctat 5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac 6001 agtaaatgta tacattaaaa aaaa // LOCUS PT4MRH 1266 bp ds-DNA PHG 20-JUL-1990 DEFINITION Bacteriophage T4 rpoH modulating protein (mrh) gene, complete cds. ACCESSION M30001 KEYWORDS Mrh protein; rpoH modulating protein. SOURCE Bacteriophage T4 (strain GT7) DNA, clone pMF28,h37. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 1266) AUTHORS Frazier,M.W. and Mosig,G. TITLE The bacteriophage T4 gene mrh whose product inhibits late T4 gene expression in an E.coli rpoH (sigma-32) mutant JOURNAL Gene 88, 7-14 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.W.Frazier, 20-NOV-1989. FEATURES from to/span description pept 782 1132 rpoH modulating protein ORF 96 302 mrh ORF soc-1 site 39 68 early promoter 14.95 BASE COUNT 444 a 167 c 234 g 421 t ORIGIN 1 ctcaaggact ccttcgggag tcctttttca tttaaatggt ttactttcca aaatgagtat 61 ggtataatag aaattatctt atagaggaga gtactatgtt aaatcgttgg attaaaccaa 121 atgaagattt ggatattatc atttcacgac atgtaatgaa gaaatatgaa ctacaaccat 181 ggtctacaga agttgttgtg cattcattta tgatgtacgc agatggttct gtcgaattta 241 atgtagagat tcgatatgat tatggtgaga agcaagtcga attcaaaaga ggctttttgt 301 aatgtttatc tttaattggt ttaaaagttt ctttacggat tttttctcta caactctggg 361 gaaggtgtag ttcctatttc aaatgactac cttcctttaa ctgtagttga atatgtttat 421 atgggagatg gaacagtaga agcagttact atgacttatg aagaagccca agaatattat 481 aaaaatcctt ggcgctggtc aacacctact acatcatcta acacacagaa tacacagtct 541 agttctgatt catatgatac taatgttcct gttcatgtat gggcaggtga ttcatgtgga 601 agttcttgtg attctagttg ttcatctaca tcttgtgatt gaggaaaatt atggaagcaa 661 ttttgtttga aatgtctatc aatagtatta gtatggcgtt tgtcaaggat gttccaatta 721 ctgtagcagt aatgattgat aaaagacaat aatatgtatc tagtagaaga ttttatttca 781 aatgccaatt ccagaagatg ttgaaataaa acttaaaaag atcggaagga ttgaaactgt 841 acaaaattct ccatttatgg caattcaagc atttactaaa tctaactata ttgatgttcg 901 tgaagcatat tataataata aaccgttatc cttctattca tatgattcaa tatatgattg 961 gaaaatagat aaaggaaata aatttataat tacggacgaa agtgcgctat catattttat 1021 tacctcttta tggaataatt taaatccaaa tttgctaaaa attcataaat ttgacgatgc 1081 tcctactatt gttttaggta aaacgaatga aagttctgaa gaagatgctt gaatggttca 1141 atagaccaaa ctcaatgtat attgatgacg gttgggttga acaagcaaat aaagaaatca 1201 gaatgaatca gaagaagaat ggatgaaatc aatgattatg ctgagaaaga aaagaaacta 1261 gaacgc // LOCUS BOVPER 1295 bp ss-mRNA MAM 20-JUL-1990 DEFINITION Bovine peripherin mRNA, complete cds. ACCESSION J02884 KEYWORDS peripherin; photoreceptor protein. SOURCE Bovine retina, cDNA to mRNA, clones lambda.[5,4,8,17]. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1295) AUTHORS Connell,G.J. and Molday,R.S. TITLE Molecular cloning, primary structure and orientation of the vertebrate photoreceptor protein peripherin in the rod outer segment disk membrane JOURNAL Biochemistry 29, 4691-4698 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Connell, 20-OCT-1989. FEATURES from to/span description pept < 1 1023 peripherin BASE COUNT 272 a 368 c 379 g 276 t ORIGIN 1 aaatttgacc agaagaagcg ggtcaagttg gcccaagggc tctggctcat gaactggttc 61 tccgtgttgg ctggtatcat catcttcggc ttagggctgt tcctgaagat tgaactccgg 121 aagagaagcg atgtgatgaa caattctgag agccattttg tgcccaattc cttgatcggg 181 gtgggggtgc tgtcctgtgt cttcaattct ctggctggca agatctgtta cgacgccctg 241 gaccctgcca agtacgccaa gtggaagccc tggctgaagc cgtacctggc cgtgtgtgtc 301 ctcttcaacg tggtcctctt cctggtggcc ctctgctgct tcctcctgcg gggctcgctg 361 gagagtacgc tggcccacgg actcaagaac ggcatgaaat tctatcggga cacggacacc 421 ccaggccggt gtttcatgaa gaagaccatc gacatgctgc agatcgagtt caagtgctgc 481 ggcaacaacg gctttcggga ctggtttgag attcagtgga tcagcaaccg ctatctggat 541 ttttcctcca aagaagtcaa agatcgcatc aagagcaatg tggacgggcg gtacctggtg 601 gacggtgtcc ccttcagctg ctgcaacccc aactcaccgc ggccctgcat ccagtaccag 661 ctcaccaaca actctgcgca ctacagctac gatcaccaga cggaggagct caacctgtgg 721 ctgcgtggct gcagggccgc cctgctgagc tattacagca acctcatgaa tactacaggc 781 gctgtgacgc tcctcgtttg gctctttgag gtgaccatca ctgttgggct acgctacctg 841 cacacggcgc tggaaggcat ggccaacccc gaagaccctg agtgcgagag tgagggctgg 901 cttctggaga agagcgtgcc ggagacctgg aaggcctttc tggagagtgt gaagaagctg 961 ggcaagggca accaggtgga agccgagggc gaggacgcag gccaggcccc ggcggcaggc 1021 tgacggccct gcggccccct cccctctgca cactgaaaag tagtggactc caggaacttc 1081 ggataccccc cggatcccat cagaatctcc caaagagggc ggccgtctca cagaggctct 1141 tctggatgtg ggatttaata ttcagggccc ctaaaaactt ctgtggaatg aatgtctcac 1201 tgtgagtgca tgagtcctgc tcagccccac cgccctggat gcaccttcac ggtgggctca 1261 tggagggcct ctcagcaggt gactctgtgg tcaca // LOCUS HAMRPS14 5300 bp ds-DNA ROD 20-JUL-1990 DEFINITION C.griseus ribosomal protein S14 (RPS14) gene, complete cds. ACCESSION M35008 KEYWORDS ribosomal protein S14. SOURCE C.griseus (haplotype EmtB wild type) ovary transformed fibroblast cell line CHO DNA. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 5300) AUTHORS Diaz,J.-J., Rhoads,D. and Roufa,D.J. TITLE Genetic analysis of a vital mammalian housekeeping locus using CHO cells that express a transfected mutant allele JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.J.Roufa, 08-JUN-1990. Author address: D.J.Roufa Kansas State University Division of Biology Ackert Hall Manhattan, Kansas 66506 FEATURES from to/span description pept 1977 2125 ribosomal protein S14, exon 2 (first expressed exon) 2576 2737 ribosomal protein S14, exon 3 4074 4150 ribosomal protein S14, exon 4 4724 4791 ribosomal protein S14, exon 5 pre-msg 179 4841 ribosomal protein S14 mRNA and introns IVS 220 1974 ribosomal protein S14 intron A IVS 2126 2575 ribosomal protein S14 intron B IVS 2738 4073 ribosomal protein S14 intron C IVS 4151 4723 ribosomal protein S14 intron D signal 4819 4824 poly-A signal BASE COUNT 1315 a 1121 c 1415 g 1449 t ORIGIN 1 agatctggtc acagcgacac gcagcacccg gagtccttaa taagccatct tggtcgccgg 61 caggcttagc ctccttggaa accgcaatag aggcttcttc cgggagccgg gattccctcc 121 gtgacctcag aggtacactt ccggaatacg gaagtgaccc gtctgcccgc cccttccttc 181 tctctctttc cggtggagaa gtctggagac gacgttcagg tagggaggcc cggcggccca 241 caatcgtgcg gcatcctctg gccaggggcg tctgcgggct tggggtcgcg gtttgtgcgc 301 ggtaaccgag ccaggtgcgt gatggtgacg ggcggctgtc cgctcgccgc gatccacccg 361 gcctggggct gcgccgtgtc cgggaggcgg ccttctggcc tgagatcgga aggtctgagg 421 tggaagtgcg gggcctgcct gggccttgcc gtcacgccac agtcgtcgcc cgagcgttcg 481 gggagcgtgt ccctagtcgc gggtgggccg tcaccggcac cccattagat tcttaggggc 541 cgtcgacgac ctgcacctaa cctcatgccc ctcatgtgga aaatacgagc agcgttcgta 601 tctgcccctt ctgtaaaccg aagcgtaaaa tgggatgtca gaaaacctgg tcacgcagga 661 ttagagtgct tctaacccat ggtggttacc cctccgagtc ggacacattg cctgttgggg 721 tgccccctag tagagtagga ctgatgccat atgtttacat ttctctgaat agccctctcg 781 tctccactga ggacggcaaa gctacactga acagatgtca tgtcttatgg aatcgtgttt 841 gttagcctaa tttgagggta tgagcaggtg actacatcct ggacctgccc ttgtgctttt 901 ggcaaataag tcccaaatag tgtgtgaggg aggcactaca gagatcccag gtgtggttga 961 aacttttaag gcctaggagg aactttgtgg agtaggagca tcttggttct aggaggagta 1021 taagacaggc tttggtgcag tattgtcctg gacagactta gcctcgggtt ggtcaaaggc 1081 atgcctgcag ctttggcaaa aaaaaaaaaa gcctctttgc ttgtgtattt tccctgggtc 1141 ctgtgactgt acttacatta ttgactaggc aggaaggggt tacatgtcag gtgtatttgc 1201 agttcataac aatagcaaaa ttagagttat gaagtagtag caatgacttt ttgtgtgggg 1261 ggcaccacaa tatgagaatt gtactaaagg gtcacagcgt taggttgaga accaaagggt 1321 agactcactt gatgctgaac actgaatttt aggtatggat atctttacca ttaaacttcc 1381 tggaaggcat accctgacca tcctgggact tgctacgtag accataacta cgatatcctc 1441 ctgcctgtga ctcccaggtg ctggggctaa agttatacac aagcacacat tgctgtggca 1501 gtgcttttta atgtttatta ctagtaaagg gtgaaagatt tggttctaaa cagtgagagg 1561 ctgtcagatt aagaattgtg tgtggacagg gagtccgtta atctgcagtt ggcttcagtt 1621 gtgtcatgtt aatcaaataa acttcccagc ttaaggagag ggagagggga gttgtgaatg 1681 gggctttgcc ccaacacctg agttcagcat cctctatttt ggataactac ttgtgaaagt 1741 ttgaggtgca tattgttcat tgctggaatc ttaggctcat cctgaaaatg cttggagctt 1801 ttgtttagaa attgtctcct ctcttggcgc ctccctttta gttacaactc agtaattccc 1861 caagtacaaa tatgttggat attagagagc caaaggagct tgggttttcc tttagtaggc 1921 taaagtattt ggggatgttt tgtctgttaa catggaggtt ctccttccac tcagaaatgg 1981 cacctcgcaa gggaaaggaa aagaaggaag aacaggtcat cagcctcgga cctcaagtgg 2041 ctgaaggaga gaatgtattt ggtgtctgcc acatttttgc atctttcaac gacacctttg 2101 tccatgttac tgatctttct ggcaagtgag tacttgggga catgaagaca taaaaccaag 2161 cctcacaggg actgaggtgg gagaaaaagg cataggagtt tatgggattc cctctaaaga 2221 attactgatt ttcttatttt cccataaatt ataagagaaa acagttgaat gtagctgagg 2281 attcatccac agaaaatcac cgcaaatagg aagtggaagc agaggatcag aaactcaatt 2341 atcctcagct acacgcacat gtaaaatata tgcattatgg aagtgattat ggggataaaa 2401 tgcaaaacta attgttgtat taacagggag tttgtctaga tagggtcctt ggacccctgt 2461 aattatacac aactcctgtc ttgttcagaa ccaggatgtg tgcttgttgg ggttcagcaa 2521 gtgtttcctg tgtccttgtt ggaaaatgaa ggcatggatc tgcttttgtc tctagggaaa 2581 ctatctgccg ggtgactggt ggaatgaagg taaaggctga cagagatgag tcttctcctt 2641 atgcagccat gttggctgcc caggatgttg cccagaggtg caaggagctg ggcatcactg 2701 ccctgcatat caaactccga gctacgggag gaaacaggta tggaagtcta gggctggaca 2761 tttgtaagtt cactgggctt gggggtggat ctgagatttg agtctctgct gtgtctgaga 2821 gcaagccgtt ttaggaatgt atgtatgttc ttcgtcaggc atggtagctg atacctgtaa 2881 tcccagcact ccggaaccaa tataggagga ggatctaaag tttggtgcta acctgggcta 2941 cacagtgagc ctcaaaaata agatagtttg ttcctttatg tttgtccctg tagagtgctg 3001 cttttatttt ttttttttta aagcatacat gagactgact cttccccagc accaggtaga 3061 ctgggtgtgg tagtatacac ctgttgtctc agaaagtgga ggtgaggaga gctagagttt 3121 caagttggtt attggttgca tattgagtag gtggccagct tgagctgaac agtgatgctt 3181 aaatgatgaa ctgggggtgc atagtccaga tagagacaaa acttcaagag aagcatcttg 3241 gggtgagaaa gaagttgggc tttacaggga tgattggtgt aagatatgat ctaagaagca 3301 gggatggagc ctctggggtc ttaagtttta caaaacagag caagggcttg ggcatgaaaa 3361 ctagaatagg tggctactga gtggaaagat actgcatagg aagccctaag gaagttacat 3421 actggatgta ggagctggga agtaggcatc acagcaaggg gaggacagcc acaggccagg 3481 tgactatccc ctcccccccc cacacacaca catgtacact tttaaaggat ttgtaggcat 3541 ggtggctcac acctttaatc ccagcatctc ttggggggcg tccccccaca aaagtgggaa 3601 aaattacatg cgtatgttgc ctgtgtgcat gcatgcgtgc attatcggtt ccctgtggag 3661 gacagaagag gaccatgaat ctcttggaac tggtgttgct gagccacctc tccaggcctc 3721 agttagcagt tttgaaaggg tttggttttg ttttgttttt tcgagacagg gtttcttctg 3781 tgtagctttg gagcctatcc tggcactggc tttggagacc aggctggcct caaactcaca 3841 gagatccgac tgcctcccga gtgctgggat taaaggtgtg caccaacgca ggatttggtt 3901 ttgttgtgag attgtataat atatgaaaac cgtagcagcc tacaaagtct agaattttgt 3961 gtaatcccct cacagtgtgc agatctttaa ctatagcccc accaccattc agaagtgaat 4021 taaaaccatg tgaacattcc cactgacttg accaacaatc cttcattttt taggaccaag 4081 acccctggac ctggagccca gtcagccctc agagctcttg ctcgttcagg gatgaagatt 4141 gggcggattg gtaagtgacc cttctagcta acattgggtt tattttgaag cattgacccc 4201 ttccaaacta tgtgatgtgc atattggttc aagcttgtct ggtgactggt gcagttaact 4261 tttgttaacc tgtaaaagac attgagagcg ttgatttctg taaatgcccc ttaactgtta 4321 tattttccca tttgcaattg ggaagactta aaaaacactt gtgaatctgt cctagcagcc 4381 tttttccctg ctagctaaat gttaagggta tgcccttggg acagtccaag tgaataattg 4441 cacacacctg gcctaaagag caaacagcca caggtgctca gtgtaggggc tctgagttag 4501 aaacttgaag gagtccttcc cctctgatgt ttgtgtgcct tctcagagcc tgtgttgtcc 4561 tcgtccaatg tgggttgaga tgaggaaatg acaggtgagg tgggcccttg tttattgacc 4621 actggtgata catttgggga agaataaagt ctgctagtta gtctaatggc tagggaggct 4681 gtgtttatgt agctatgggc aatgacattt tttcttcttg cagaggatgt cacccccatc 4741 ccctctgaca gcacccgaag gaagggtggt cgtcgtggtc gccgtctgtg aacaggactt 4801 ctcaaattat tttctgttaa taaattgctt tgtataagct attttggttc tgatgtttgt 4861 ttgttttgag gcaggatctc tctccatgtc tatgtagaag agtctggcct ccaactcaga 4921 tccacatacc tgtctctgcc tcccaagtac tggggataaa ggaatgcacc accacacccc 4981 ggtgacctta ggggccaccc agggcaacac caaggcagtg ttttgagggg atgttaagat 5041 tgcatgtagg agcttctaat gtaggttggg ggttggcaaa cctgggtttg caggctggct 5101 ttctgagaat gggaacctga gagaggagat gctgaggctg ccactctggc attaatttgc 5161 cagccagtat tttcaaagtg gaactgttag tagcccatgc tgtcatggga cagctaaaca 5221 ccacacacac acacacccca agcagggtag tagagatcac aagttggaag ccagtttaat 5281 atggtctact tggggaattc // LOCUS GPIFCGR 1494 bp ss-mRNA ROD 20-JUL-1990 DEFINITION Guinea pig Fc-gamma-1/gamma-2 receptor mRNA, complete cds. ACCESSION M35272 KEYWORDS Fc-gamma receptor; Fc-gamma-1/gamma-2 receptor. SOURCE Guinea pig peritoneal macrophage, cDNA to mRNA, clone D-3. ORGANISM Cavia sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Hystricomorpha; Caviidae; Caviinae. REFERENCE 1 (bases 1 to 1494) AUTHORS Tominaga,M., Sakata,A., Ohmura,T., Yamashita,T., Koyama,J. and Onoue,K. TITLE The structure and expression of the guinea pig Fc receptor for IgG1 and IgG2 (Fc-gamma-1/gamma-2-R) JOURNAL Biochem. Biophys. Res. Commun. 168, 683-689 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 100 972 Fc-gamma-1/gamma-2 receptor precursor sigp 100 147 Fc-gamma-1/gamma-2 receptor signal peptide matp 148 969 Fc-gamma-1/gamma-2 receptor mRNA < 1 1494 Fc-gamma-1/gamma-2 receptor mRNA BASE COUNT 401 a 369 c 348 g 376 t ORIGIN 1 cagcttgctg cagaagtgat ggcgatccct tcgttcttac ctgtccttgg caccaagagt 61 caccgagctg actataagcc cttgcagact ttgagccaca tgctgctgtg gataactgtg 121 ctattcctgg ctcctgttgc tgggacctct gcagaccccc cgaaggctgt ggtgagactg 181 gagcccccat ggatccaggt gctccgggga gaccgcgtga ctctgacctg cgagggtgcc 241 cccagccctg ggaaccactc cacccagtgg ctccacaacg ggagactcat ccctacccag 301 gtcctgccca gctaccggtt cactgccaag ggcaatgaca gcggagagta caggtgccag 361 gtcctgccca gctaccggtt cactgccaag ggcaatgaca gcggagagta caggtgccag 421 gcgggcggga ccagtctcag cgaccctgtg cgtctggacg tgatttctga ctggctggtg 481 ctccagactt ctcaactgat tttccaggag ggggacgtca tcgtgctgcg gtgccacagc 541 tggaataact ggcctttggc caaggtcaca ttctaccaca atggggtagc caagaaatat 601 ttctctatca gtaaaaattt ctccatccca caagcaaacc acagtcacag tggtgcttac 661 aactgcacgg gattaatagg aaggacatct cacacatcac cgcctgtgac catcactgtc 721 caagggccca agtcaagcga ctcttcaatg gtggtgataa ttgtggctgc agtcattggg 781 attgctacag cggccattgt tgttgctgta gtagctatca tctgcctcaa gaaaaagcag 841 cctccagcca atctctctga tcctgaggag gtcgctaaat ctgaggttga aaatacaatc 901 acctattcac ttctcaagca cccggaagct caggatgatg acacagagca tgactatcag 961 aaccacattt aatctccatt atctggccct gggatttggg ggagaaaaat caagaagtga 1021 agatctgcta tctccaggcc taaggttccc ttggagaggt cgagaggatg ctgaagttca 1081 aagaaggagc aggatttttc cagagtcctg tatgtgagtc ctaaagttct ttggcctgac 1141 actaacagaa aatatgaact ctgaaggctg gctgattctg tgcctcagca cttccctaca 1201 tcagggctgt tatacagccc cacagccaac aaaatgataa aattaatatt gctaagagat 1261 tttaacaaca tgtgacatgc ctacattatg ggtaacatga gaaaaattac ataagtatat 1321 atgatttcag aagtgataaa atcaactaac atctaccaac atattaaaaa tgattgtttc 1381 agggtgatag aattatcagt ggtttttgtt ctttcttatt ttcctacaaa tctataagtt 1441 tattttccta taaatcctat aaatcatgta ctgtatttgt aataaaatat tatg // LOCUS NPOCAP 2104 bp ds-DNA VRL 20-JUL-1990 DEFINITION Orgyia pseudotsugata polyhedrosis virus capsid protein (P39) gene, complete cds. ACCESSION M21042 KEYWORDS capsid protein; structural protein. SOURCE Orgyia pseudotsugata polyhedrosis virus DNA, clone cosmid #58. ORGANISM Orgyia pseudotsugata polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 2104) AUTHORS Blissard,G.W., Quant-Russell,R.L., Rohrmann,G.F. and Beaudreau,G.S. TITLE Nucleotide sequence, transcriptional mapping, and temporal expression of the gene encoding p39, a major structural protein of the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata JOURNAL Virology 168, 354-362 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by G.W.Blissard, 03-NOV-1988. The p39 protein is found in both the PDV and BV virion phenotypes; P39 mRNAs are approximately 2600 nt in length, are expressed late (24-48 hpi), and have two major transcription initiation sites. At each major transcription start site, two transcripts are initiated within the ATAAG consensus. Transcripts initiate at positions 2 and 3 of the ATAAG consensus. FEATURES from to/span description pept 124 1179 capsid protein mRNA 27 > 1179 capsid protein mRNA (alt.) mRNA 28 > 1179 capsid protein mRNA (alt.) mRNA 75 > 1179 capsid protein mRNA (alt.) mRNA 76 > 1176 capsid protein mRNA (alt.) BASE COUNT 495 a 606 c 586 g 417 t ORIGIN Map position 57-58. 1 ccggcgcagc gcaccgtttc tttatataag aatctaaaat tagatacaag aggtcttggc 61 taaaattgat tgtataagag atttcttgct ctatcaaaac gtcggccccc atcgccgcag 121 aaaatggcgc tcgtgtcacc cggggtgtcg tcgcgacggt cgaccaacca ctgcatcttt 181 ggcgcgattg aaccgttcga ctcgtgcgtc acgtacaggt cgccatgctc gtcggacgcg 241 tcggtggacg acggctggtt catctgcgac taccacctga agttgcgctt caaaatgtcc 301 aaaatggtgc tgcccattta cgacgaagac gacaaccagt acaagcgcac catagcgcgg 361 catttggtgg ggcacaaaga aaggggcgta aagcgcattc tggtgcccac cagagccaac 421 tacatgaccg tgttcaattt gccgggcatg atgctggccg agcagctgat ttttcatttg 481 atctacgaca accgtttgga agtcaatcgc atttgcgcct cgttgaaaaa caacgaaaac 541 ttcattgaca acacgtacag cgttgtggag agcgtgtaca gcgcgacgcg caacattctg 601 tcgctcacag acccgcaagc gtactgctcg cgcgtggcca acgacgacgt gcgctttttc 661 gacgcgaacg tagtcgacaa caactaccag gcgggcaacg gcgacaccgt gtttaacaac 721 atgcccgggt ttttgcgcaa cctaatcagg cgcgccgtgg cgcccgaaac gctgcagatc 781 gacagcgaag acctgcgcct gcgaaactgc aacacctgtg tgattaacaa caccggcttg 841 gtagcaaccg tgacaaacac cgagctgtac aacccggtgc gcagctcgga catcattaaa 901 acggggccca atcgcttgca gattagaaac gttttgaagt ttgagggcga cacacgcgcg 961 ttggagcgca ccttgggccg ctacgaggaa taccccatgt acgtgccctt gtttttgggc 1021 taccagctgg tcaacctgca aaacgacatt ttgcgcgcca acaacttttt acccgcgccg 1081 tttggcgtgc cgcaggccgt aaacaacctg gaggcccaag cgcccgcagc gcccgcaccc 1141 gcagcgcccg cgcccgcgcc cgcagcgccc gttgtttaat tccgttgcca tggacacggt 1201 gaggctgcag tgccacaaat ttgctgctcg gtgggcgaaa tcaaaaatta ttttttgcag 1261 cccgtcgacg caatcaccat tttgcccatt gtggagctgc acacgtgccg gcaccaattg 1321 tgcgtcatgt gcgtgcgcaa aattgcgcag cgcggccgag acaagcgcgt cgagtgcccc 1381 atgtgccgcc gcaaaaacgc gcactttaac gtgtacagcg tgaaccgtaa ttcggtggat 1441 gtgttgcgct gcagcgtggc cgacgtgcgc gagcacgggc gctttggcgg attggcggac 1501 gccgcgtcgc tggctcgtgg gctgtttgag cccagcttgc ttgaagccga gcccgcgccg 1561 gacaactcgt ttggcccaaa cgagttacaa cttgtgctaa aacgtctaaa ggcgcaaatt 1621 gaggcgcaaa caagaactaa ttatgatttg caactgcaag caactgcttt ggagcgaaca 1681 atcgaagaag caaacgaccg cttgggtaaa agccgcggcg actacagcga cgcgtgcaag 1741 ctgatggacg agcttcgcgg cgacaggctg cgcgcggagc gcgcggtcaa ggcgctcgcc 1801 gacgcgcacg cccagtgggc ggacaagaac gccaaaatgc ggcgcgagaa cgacaggttg 1861 acaaatgaaa acattggttt gattcgggac aacaatttat ttaaacaaaa cactgcacgc 1921 aaacgtaaaa tcgctccgta accgcctttg tttgcgcgtc aaagtccacc tcgtagacgt 1981 gtcgctcgtt tacgctgccg cagcggtccc agttgtccgt cgtgcaccac gccgcattgt 2041 gtcggcggca caaatacagc gcttcgttga attgcgccgg cggcagattg aagcggccga 2101 gctc // LOCUS RATPPTA 981 bp ss-mRNA ROD 20-JUL-1990 DEFINITION Rat alpha-tachykinin mRNA, complete cds. ACCESSION M34184 KEYWORDS alpha-tachykinin; substance P. SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen complex, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 981) AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D. TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance P and neurokinin A JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Krause, 11-MAY-1990. FEATURES from to/span description pept 100 438 preprotachykinin sigp 100 156 tachykinin signal peptide (156 could be 171) matp 271 303 substance P mRNA < 1 981 tachykinin mRNA BASE COUNT 271 a 203 c 234 g 273 t ORIGIN 1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggagcgccca gcaagtgcgc 61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg 121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat 181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gccggagccc 241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta 301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt 361 tatgggcatg gtcagatctc tcacaaaatg gcttatgaaa gaagcgcaat gcagaactac 421 gaaagaaggc gtaaataaac cctgtaacgc actatctatt catctccatc tgtgtccgcg 481 agcagtgagc ggtaaaataa aaatgtgcgc tatgaggaat gattatttat ttaatatcaa 541 atgttgttat gagtgaaaaa ctcaaaaaag tgtttatttt ttcatattgt gccaataagc 601 attgtaattc taatgtggtg acctcctcag acagaagtag aaattagttg taacttcagc 661 aaagcacagt gttgatggag ttgtacaagt ttgccagcga tgcaagtctc caaagacaga 721 aaggctgctg tgaggcagtg caggcggctg ctgctggagg cagagaaact cctgtgtgtc 781 ttgcgcttcc cttggttgct tttatcctaa tgatgtactg agagtttggt atctgactct 841 atttgtatcc tagcagcatg tttcctgtgt tgtgactata tagagatgtt tttaaaagtt 901 tcaatgtact tctctggtct tcagtcattg tatgatgtgt tgtgatagct accattttaa 961 ataaaagaat gtatcttcag g // LOCUS RATPPTB 1035 bp ss-mRNA ROD 20-JUL-1990 DEFINITION Rat beta-tachykinin mRNA, complete cds. ACCESSION M15191 KEYWORDS neurokinin A; substance P; tachykinin. SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen complex, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1035) AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D. TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance P and neurokinin A JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987) STANDARD simple staff_review FEATURES from to/span description pept 100 492 beta-preprotachykinin sigp 100 156 beta tachykinin signal peptide (156 could be 171) matp 271 303 substance P matp 391 420 neurokinin A mRNA < 1 1035 beta-preprotachykinin mRNA BASE COUNT 289 a 212 c 245 g 289 t ORIGIN 1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc 61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg 121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat 181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gcccgagccc 241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta 301 atgggcaaac gggatgctga ttcctcaatt gaaaaacaag tggccctgtt aaaggctctt 361 tatgggcatg gtcagatctc tcacaaaagg cataaaacag attcctttgt tggactaatg 421 ggcaaaagag ctttaaattc tgtggcttat gaaagaagcg caatgcagaa ctacgaaaga 481 aggcgtaaat aaaccctgta acgcactatc tattcatctc catctgtgtc cgcgagcagt 541 gagcggtaaa ataaaaatgt gcgctatgag gaatgattat ttatttaata tcaaatgttg 601 ttatgagtga aaaactcaaa aaagtgttta ttttttcata ttgtgccaat aagcattgta 661 attctaatgt ggtgacctcc tcagacagaa gtagaaatta gttgtaactt cagcaaagca 721 cagtgttgat ggagttgtac aagtttgcca gcgatgcaag tctccaaaga cagaaaggct 781 gctgtgaggc agtgcaggcg gctgctgctg gaggcagaga aactcctgtg tgtcttgcgc 841 ttcccttggt tgcttttatc ctaatgatgt actgagagtt tggtatctga ctctatttgt 901 atcctagcag catgtttcct gtgttgtgac tatatagaga tgtttttaaa agtttcaatg 961 tacttctctg gtcttcagtc attgtatgat gtgttgtgat agctaccatt ttaaataaaa 1021 gaatgtatct tcagg // LOCUS RATPPTG 990 bp ss-mRNA ROD 20-JUL-1990 DEFINITION Rat gamma-tachykinin mRNA, complete cds. ACCESSION M34183 KEYWORDS gamma-tachykinin; neurokinin; substance P. SOURCE Rat (Sprague-Dawley) rostral portion of the caudate putamen complex, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 990) AUTHORS Krause,J.E., Chirgwin,J.M., Carter,M.S., Xu,Z.S. and Hershey,A.D. TITLE Three rat preprotachykinin mRNAs encode the neuropeptides substance P and neurokinin A JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 881-885 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Krause, 11-MAY-1990. FEATURES from to/span description pept 100 447 preprotachykinin sigp 100 156 tachykinin signal peptide (156 could be 171) matp 271 303 substance P matp 346 375 neurokinin mRNA < 1 990 tachykinin mRNA BASE COUNT 275 a 202 c 238 g 275 t ORIGIN 1 tcgaccagct ccactccagc accgcggcgg aggagagcga ggacgcccag gcaagtgcgc 61 acctgcggag catcaccggg tccgaccgca aaatccaaca tgaaaatcct cgtggcggtg 121 gcggtctttt ttctcgtttc cactcaactg tttgcagagg aaatcggtgc caacgatgat 181 ctaaattatt ggtccgactg gtccgacagt gaccaaatca aggaggcaat gccggagccc 241 tttgagcatc ttcttcagag aatcgcccga agacccaagc ctcagcagtt ctttggatta 301 atgggcaaac gggatgctgg gcatggtcag atctctcaca aaaggcataa aacagattcc 361 tttgttggac taatgggcaa aagagcttta aattctgtgg cttatgaaag aagcgcaatg 421 cagaactacg aaagaaggcg taaataaacc ctgtaacgca ctatctattc atctccatct 481 gtgtccgcga gcagtgagcg gtaaaataaa aatgtgcgct atgaggaatg attatttatt 541 taatatcaaa tgttgttatg agtgaaaaac tcaaaaaagt gtttattttt tcatattgtg 601 ccaataagca ttgtaattct aatgtggtga cctcctcaga cagaagtaga aattagttgt 661 aacttcagca aagcacagtg ttgatggagt tgtacaagtt tgccagcgat gcaagtctcc 721 aaagacagaa aggctgctgt gaggcagtgc aggcggctgc tgctggaggc agagaaactc 781 ctgtgtgtct tgcgcttccc ttggttgctt ttatcctaat gatgtactga gagtttggta 841 tctgactcta tttgtatcct agcagcatgt ttcctgtgtt gtgactatat agagatgttt 901 ttaaaagttt caatgtactt ctctggtctt cagtcattgt atgatgtgtt gtgatagcta 961 ccattttaaa taaaagaatg tatcttcagg //
GenBank-Updates@genbank.bio.net (07/21/90)
LOCUS BSUGLPKD 4259 bp ds-DNA BCT 21-JUL-1990 DEFINITION B.subtilis glycerol kinase (glpK) and glycerol-3-phosphate dehydrogenase (glpD) genes, complete cds. ACCESSION M34393 KEYWORDS glycerol kinase; glycerol-3- phosphate dehydrogenase. SOURCE B.subtilis (starin W168) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4259) AUTHORS Holmberg,C., Beijer,L., Rutberg,B. and Rutberg,L. TITLE Glycerol catabolism in Bacillus subtilis: Nucleotide sequence of the genes encoding glycerol kinase (glpK) and glycerol-3- phosphate dehydrogenase (glpD) JOURNAL Unpublished (1990) University of Lund, Dept. of Microbiology, Solvegatan 21, S-223 62 Lund , SWEDEN STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.M.E.Holmberg, 22-MAY-1990. FEATURES from to/span description pept < 1 679 ORF 1 (AA at 2) pept 698 2188 glycerol kinase (glpK) (EC 2.7.1.30) pept 2329 3996 glycerol-3-phosphate dehydrogenase (glpD) (EC 1.1.99.5) pept 4128 > 4259 ORF 4 binding 682 690 ribosome binding site binding 2315 2321 ribosome binding site binding 4116 4121 ribosome binding site signal 2243 2303 transcription termination signal signal 4000 4033 transcription termination signal signal 4067 4072 -35 region signal 4090 4095 -10 region BASE COUNT 1269 a 853 c 1092 g 1045 t ORIGIN 1 ccatggcgga tacgcggttg gcggcatcag cggtgcccat ttgaatccgg cgctaacgat 61 agcgcttgca tttgtaggag attttccttg gaaagaggtt ccggtttata ttgcggcgca 121 aatgatcgga gcgatcatcg gtgcggtgat tatttatctg cattacctcc cgcactggaa 181 gtcaacggat gatcccgctg ccaagctggg tgttttctca acagggccca gcattccgca 241 tacatttgca aacgttttaa gcgaagtgat tgggacattt gtccttgtac ttggaatctt 301 ggccataggt gcaaatcaat ttacagaagg acttaatcct ttaatcgtcg gtttcctcat 361 tgtagcaatc ggtatttctc tgggaggcac caccggctat gctatcaatc ctgcacgtga 421 cttaggtccg cggatcgccc acgcttttct tccgattccg gggaagggct catcaaactg 481 gaaatacgca tgggttccgg tagtcggccc gattttaggc ggatcattcg gcggtgtatt 541 ttacaacgct gcatttaaag ggcacatcac aagcagcttc tggattgtaa gcgttatatt 601 ggttgtggta ttgttaggac tctatgttta tacgaaatca cattctgcta aaacattatc 661 aaattctaaa tatatttaat caaaggggag acatcttatg gaaacgtaca ttttatcctt 721 agatcagggg acgacaagtt caagagcgat tctgtttaat aaagaaggca aaattgtcca 781 ctctgctcaa aaggaattta cacaatactt cccgcatcct ggctgggttg agcataatgc 841 caatgaaatt tggggctctg tcctcgcggt tatcgcctca gtcatctctg aatcaggaat 901 cagcgcttct caaattgccg gcatcggcat cacgaaccag cgcgagacga cggttgtgtg 961 ggataaagat acaggaagtc ctgtctataa tgcaatcgtt tggcagtcca gacagacgtc 1021 cggcatttgt gaggaacttc gtgaaaaagg atataatgat aaattcagag aaaaaacagg 1081 gcttttaatc gatccttact tctccggcac gaaggtgaag tggattttag acaatgtgga 1141 aggcgcaaga gaaaaagcgg aaaaaggcga gctgctgttt ggaacgattg atacgtggct 1201 catttggaaa atgtcaggcg gaaaagcgca tgtgaccgat tactccaatg cctcaagaac 1261 actgatgttt aatatttacg atttaaaatg ggacgatcaa ctgctcgaca ttctaggcgt 1321 accgaaatcc atgctccctg aagtgaagcc gtcctctcat gtgtatgcgg agactgttga 1381 ttatcacttc ttcggaaaaa atatcccgat tgctggagcg gcaggcgacc agcagtccgc 1441 attgttcggc caggcatgct ttgaagaagg catggggaaa aacacttacg gcacaggatg 1501 tttcatgctg atgaataccg gggaaaaagc aattaagtcc gaacatgggc ttttgacaac 1561 aatcgcttgg ggcattgacg gaaaagtgaa ctatgcgtta gaagggagca tttttgtcgc 1621 aggctctgcc atccagtggc ttagagacgg tttgagaatg ttccaggatt catcgctaag 1681 cgaatcttat gcagaaaaag tggattcaac tgacggcgtg tatgttgttc cagcatttgt 1741 cggactggga acgccttact gggacagcga tgtgcgcggt tcggttttcg gcctgacaag 1801 agggacaaca aaagagcact ttatccgtgc gacactggag tcattggctt atcagaccaa 1861 agatgtgctt gacgcaatgg aagcagattc aaacatttca ttaaagacgc tccgtgtaga 1921 cggaggagct gtaaaaaaca atttcctaat gcagttccaa ggagacctgt tgaatgttcc 1981 tgtggagcgc ccggaaatta atgaaacgac tgcacttggc gcggcttatt tggcgggtat 2041 cgctgtggga ttctggaagg accgttctga aatcgcgaac cagtggaatc tggataaacg 2101 gtttgagcct gaattggaag aagaaaaacg aaatgagctg tataaaggct ggcaaaaagc 2161 cgtgaaagca gctatggctt ttaaataaag taatactatg gtataatggt tacaagttaa 2221 taagaacggt cctgagatga ggagagacca cagcaccaaa gtgtaagcat gcactttggc 2281 tgttgtggtc tctttttcta tttaccgtga caacaaggag gaaacgtaat gatgaatcat 2341 caattttcaa gtcttgaaag agatcgcatg ctgacagaca tgacgaaaaa aacatatgac 2401 ctatttatta tcggaggagg aattacagga gccggaacag ctcttgacgc ggcatcaagg 2461 ggaatgaagg tcgcactaag cgaaatgcag gactttgcgg cgggaacatc aagccgatca 2521 acgaagctgg tacatggcgg cttgcgctat ttaaaacaat ttgaagtgaa aatggtcgct 2581 gaggtaggaa aagagcgggc gattgtgtat gaaaacggcc cgcatgtcac tacaccggaa 2641 tggatgctgc ttccgtttca taaaggcggc acattcggtt catttacaac atctattggg 2701 ttaagggttt atgacttcct tgcaggtgtg aaaaagtcag aacgaagaag catgctttca 2761 gcaaaagaaa cgctgcaaaa agagcctttg gtgaaaaaag acggcttaaa aggcggcggg 2821 tactatgtgg aataccgcac tgacgatgcg agactgacca tcgaagtcat gaaggaagcg 2881 gttaaattcg gggcagagcc tgtgaattac tccaaagtga aggagcttct ttacgaaaaa 2941 ggcaaagccg tcggcgtatt aattgaagat gtgctgacaa agaaagaata taaagtgtat 3001 gcgaaaaaaa ttgtcaatgc tacaggccct tgggtcgatc agctcagaga aaaagaccat 3061 tcgaaaaacg gaaagcattt gcagcataca aaaggcattc accttgtatt tgaccagtct 3121 gtctttccgc tgaaacaggc tgtatatttc gatacacctg atggccggat ggtatttgcg 3181 attcctcgtg aaggcaaaac atacgtggga acaacagaca ctgtttacaa agaggcgctg 3241 gagcatccgc ggatgacaac ggaggatcgt gattatgtca tcaaatcaat caattacatg 3301 ttcccggaac tgaatatcac tgcgaatgac atcgaatcca gctgggcggg gctgcgtccg 3361 ctgattcatg aagaaggcaa agatccttct gaaatttcac ggaaagacga gatttggaca 3421 tctgactcag gcctgatcac cattgccggc ggaaaactga ctggatacag aaaaatggcg 3481 gacgacatcg ttgatcttgt ccgtgatcgc ttaaaagaag agggcgaaaa ggattttgga 3541 ccatgtaaaa cgaaaaacat gccaatctct ggcgggcacg tcggcggttc gaaaaatctt 3601 atgtccttcg ttaccgcgaa aacaaaagaa ggaattgcag ccggtttatc agaaaaagac 3661 gcaaaacagc ttgcgatcag atacggctct aacgtagatc gcgtctttga ccgggtagaa 3721 gcgctgaaag atgaggccgc gaaacgcaac attccggttc atattcttgc tgaggcagaa 3781 tacagtatag aagaagagat gactgcaacc cctgctgact tctttgtccg cagaacggga 3841 cgtttatttt ttgatatcaa ttgggtaaga acatataaag atgccgttat tgattttatg 3901 agcgagcgat tccaatggga tgagcaggcg aaaaacaaac atacagaaaa cctcaacaag 3961 cttttacacg atgcggtcgt accgcttgag caataaatca taacgggctg tctgcagccc 4021 gttatttctt tttacgtgcc gaaaggggga gatctcaggt tatcaagtga cagaggcacc 4081 cgcttcatat aaaataatgg cataagctga tacataggag gacgaatatg acttggagaa 4141 agagctatga acgctggaaa cagacagaac atttagatct ggaattaaaa gagcgcctta 4201 ttgaattaga gggagatgaa caggcccttg aggactgttt ctataaagac cttgaattc // LOCUS CLCMTPLAS 379 bp ds-DNA PLN 21-JUL-1990 DEFINITION C.purpurea mitochondrial plasmid terminal 5' inverted repeat (TIR). ACCESSION M35177 KEYWORDS . SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA. ORGANISM Claviceps purpurea Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Clavicipitales; Clavicipitaceae. REFERENCE 1 (bases 1 to 379) AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B., Rogmann-Backwinkel,P. and Tudzynski,P. TITLE Structural and functional analysis of mitochondrial plasmids in Claviceps purpurea JOURNAL Mol. Gen. Genet. 214, 128-134 (1988) STANDARD simple staff_review BASE COUNT 97 a 76 c 64 g 142 t ORIGIN 1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt 61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact 121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa 181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac 241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta 301 aactttccgc taaaacccgc acgtcttcgt ctgcatggta tgggcgttag attctattct 361 acaatgtctt ccgccaaaa // LOCUS CLCMTPLASX 379 bp ds-DNA PLN 21-JUL-1990 DEFINITION C.purpurea mitochondrial plasmid 3' terminal inverted repeat (TIR). ACCESSION M35178 KEYWORDS . SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA. ORGANISM Claviceps purpurea Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Clavicipitales; Clavicipitaceae. REFERENCE 1 (bases 1 to 379) AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B., Rogmann-Backwinkel,P. and Tudzynski,P. TITLE Structural and functional analysis of mitochondrial plasmids in Claviceps purpurea JOURNAL Mol. Gen. Genet. 214, 128-134 (1988) STANDARD simple staff_review BASE COUNT 105 a 75 c 62 g 137 t ORIGIN 1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt 61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact 121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa 181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac 241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta 301 aactttccgc taaaacccgc acgtcttact ttagctcaac aaagaaagcg tagtgaactt 361 aaaaaagcac gtcttgctc // LOCUS DDISP60 328 bp ds-DNA INV 21-JUL-1990 DEFINITION D.discoideum spore coat 60 (sp60) gene, 5' flank. ACCESSION M34546 KEYWORDS spore coat 60. SOURCE D.discoideum (strain Ax4) DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 328) AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F. TITLE Coordinate regulation of the spore coat genes in Dictyosteiium discoideum JOURNAL Dev. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Fosnaugh, 24-MAY-1990. BASE COUNT 168 a 15 c 11 g 134 t ORIGIN 1 aaaacataga acaaatttaa ataattattt tttattaaat tgtattattt ttttttatta 61 tttatttatt ttttattttc aaaaaaaaaa aaaaaaagtt aacagacaaa aagtataatc 121 tatttaatta ttcaaaaaaa aaaatattaa atcattgtag tattttgttc atattcaaat 181 taaatatcaa tacaattaaa aaaagttaaa atgtaattat taatatatat atatatttat 241 caatatataa tttaatataa aaaaaaaaaa gaaatttaat aaacataaat atttataaaa 301 tcataaattt aaaataaata attttttt // LOCUS DDISP70 563 bp ds-DNA INV 21-JUL-1990 DEFINITION D.discoideum spore coat 70 (sp70) gene, 5' flank. ACCESSION M34547 KEYWORDS spore coat 70. SOURCE D.discoideum (strain Ax4) DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 563) AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F. TITLE Coordinate regulation of the spore coat genes in Dictyosteiium discoideum JOURNAL Dev. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.Fosnaugh, 24-MAY-1990. BASE COUNT 207 a 73 c 47 g 236 t ORIGIN 1 gaagtgtgag ttgcgcagtg tgttagtgtg ggtttaatga gagtgttttt tttttttttt 61 tttttttttt tgaacgtaca tatttatttg tttttattaa tatttcataa atgaatttgt 121 aaatataaaa ataaattgtt tttttttatt ttattttttt gattatgaca ataatttcaa 181 taaaatgtaa aatatgataa aaataatttt aattattgaa acacacacac tcactcactt 241 tttcccacta gtattacact ttccaatgaa aaataaatta atattgaaaa agaaggaaat 301 ggaaaaaaat gaaaaataaa aataattaaa taaacaataa tattacaaac acactatttc 361 tttctcaccc acacaaccaa tatactaaca cttctcatac acccaccaac acacctacac 421 ccagttcttc acattcatac acaaagttta atattacact ttaaaaaaca ttcatattaa 481 tgatttttat gatttattta tttattaatt tatttaattg tgtattttga ttattgctct 541 ctacttttta tatatttttt ttc // LOCUS DDISP96 682 bp ds-DNA INV 21-JUL-1990 DEFINITION D.discoideum spore coat 96 (sp96) gene, 5' flank. ACCESSION M34548 KEYWORDS spore coat 96. SOURCE D.discoideum (strain Ax4) DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 682) AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F. TITLE Coordinate regulation of the spore coat genes in Dictyosteiium discoideum JOURNAL Dev. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.Fosnaugh, 24-MAY-1990. BASE COUNT 321 a 37 c 63 g 261 t ORIGIN 1 tgaaaaaaaa aaaattttgt gtgtgatagt ttgtcaaaaa tcatttgaaa ttaaaatcat 61 attataatat atattaaaaa aatgttgtat ctttaatttt ttttcatact agtgtgttaa 121 atgaattaaa gtgtggtaat gaataaatga aaaaaaaaaa aaaattaaag tgagagagtg 181 tgtaaaagtg gtaatgaatg tgtgtttgag tgtgttaaaa atgtgtttac aaaatataat 241 aatttaaatt cattcatatt caactatgga gaaagcgtgt tttcaaacac aataaatgtg 301 taatgaaaaa tacaattaaa taattattta attttttgta tgattaaaaa aaaaagagta 361 aattagtagt gccaccctaa aacaataata ataataatta aataatatta atatttattt 421 aattttttta tttaattttt tttcaaaaca attcataata taaatttatt cttttttttt 481 tttttttttt tttttaaaaa tattcaactc agtaataata ataataaaaa taataacaat 541 aataaggaat tataatcaaa taataatata gctataatct ataattacta tagtaaaatt 601 acgtatacat atataaataa ttattattat cattgttttt aaaaataaat aaataaataa 661 taaaaaaaaa aaaaaaaaaa aa // LOCUS GLAA2GIA 1870 bp ds-DNA INV 21-JUL-1990 DEFINITION Giardia lamblia alpha-2 giardin gene, complete cds. ACCESSION M34550 KEYWORDS alpha-2 giardin. SOURCE G.lamblia (strain Portland-1) trophozoite DNA, clone pG2. ORGANISM Giardia lamblia Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae. REFERENCE 1 (bases 1 to 1213) AUTHORS Alonso,R.A. and Peattie,D.A. TITLE Genomic organization of alpha giardin genes in Giardia lamblia JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1214 to 1870) AUTHORS Alonso,R.A. and Peattie,D.A. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Alonso, 25-MAY-1990. Author address: R.A.Alonso Dept of Tropical Public Health Harvard School of Public Health 665 Huntington Avenue, Boston, MA 02115 FEATURES from to/span description pept 215 1105 alpha-2 giardin BASE COUNT 466 a 468 c 507 g 429 t ORIGIN 1 atactgcaga tacacctcat ggtgtacatg ctggcggcaa gtatcgtata aactctatgg 61 agacatgggg cagaccttgt ctcgctgctt tgacgagcga atggccccta ttttgcgcac 121 gctatggcgc gtcacagatg cataggataa gcgaactcat gatggaaatt caaattacct 181 taaaatatat tttctgagcg cttacattta gaaaatgccg aagctatccc agatcgtcgc 241 tgacatgaag caggccatcg atgccaagga cgaggcccag atcgcgttca tagcgtcgga 301 atactctgca gatgcgcgtc agaggattgc gcaggggtac agggaccaat acggcaagga 361 gctccccgac gacatcaaga aggcgctcaa gggcggctcc gaggagagcc tcctgatgga 421 tctcttcagc gatcgtcacg aggtccgcgc ccagcacatc cgcgacgcgc tttccggcaa 481 gaacgaccac atggccttct tcgacacagt cattctgtgc acccccgaag actggcacga 541 gaccgtcgcc gcctacacga ggatgttcaa gaagcccctc gtcgaggact tcatgaagga 601 tgtcggccgc aaggagaatt ggtgtctttt catggagaag tggatggctc acgagcgcac 661 tagccgtgaa ggcagccctg acgaggaagc agagaaactg aacaaggcgt tcagtgagag 721 cgaccacgac tacatttcca gcttcatggc tggggtgcct cccgaagaat acaagtccat 781 caacaccagc ttcaagagcc ttacaggaaa gggcatcgac caggccttcg ccaccatcta 841 tacggggact gactactact ccctgtactg cgcccacttt gctcttcttg gcatgcacaa 901 gcttgccgcc tacctggtca actgcgcctg caacgataag ggcgacgaga agcgcatgag 961 gaggatcacg ggcatgatgg tcgacaagtg cttggctgca aagtatgcct acaagaccta 1021 cgggagtatg aaagccgacg tcgaaaggtg ctttgataag cgcatggctc cgatcctctg 1081 taccctctgg cgcctaaggg agtgagcgcc tttactgcgg ggtttccttt cgctagtgaa 1141 ttgcctagcg ggtgtttact gccttctgtt tctttttgcc ctgtcctggc ggtccaagat 1201 gctatcctgc gatgagaggt ctttcgagaa cggatgcgag aagtctggga ttgctacatg 1261 ccgtccacct gactgtgagg gtgagtagga ctggagtagc ggcaaggagt gggagctcat 1321 gttgtccata gagctgtatg cctgatcaat ctgcccattt cctgcttgtc ctctgcgggt 1381 tttttgtttc agatctgagc tgcctaaccg agaggggccg tactctatgc ttagttctct 1441 attaaccaac tgaacagcct ttgctcgctt gcttgccagt gtatatttat atctatattt 1501 caagaacaga gagaggaccg tgatgaagag gagcaggatg agtgtcgcaa cggaccagaa 1561 cacgacaata agggctatag acatcttaga atgataacca acaaagctta tgaactcctc 1621 tccactgaaa gaagaggaaa gataccgagt ctcgtaaaac tgaatggtag tcctatcgat 1681 ggtgtgcccc cagatagggt ctactatcgg ctggttggta agattgcttg cagcggtatc 1741 gacagctgcc acaaccacta agtagtcaag gctcgtgctt gtacgaagga aggtcaacgg 1801 gaatgcgaag catggaccag gtgttatttg cttgctcgat aaactcagga gtatatattt 1861 cagtattgtc // LOCUS KAEHUTC 1625 bp ds-DNA BCT 21-JUL-1990 DEFINITION K.aerogenes histidine utilization repressor C (hutC) gene, complete cds. ACCESSION M34604 KEYWORDS histidine utilization repressor C. SOURCE K.aerogenes (strain W-70) DNA. ORGANISM Klebsiella aerogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1625) AUTHORS Schwacha,A. and Bender,R.A. TITLE Nucleotide sequence of the gene encoding the repressor for histidine utilization genes of Klebsiella aerogenes JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Bender, 29-MAY-1990. FEATURES from to/span description pept < 1 669 histidine utilization repressor G (AA at 1) pept 731 1456 histidine utilization repressor C (hutC) pept 1599 > 1625 histidine utilization repressor U signal 667 686 hutC promoter signal 1522 1493 Pc promoter signal 1531 1557 Puh promoter binding 1475 1495 CRP binding site site 34 61 helix-turn-helix BASE COUNT 311 a 476 c 508 g 330 t ORIGIN 1 gtcgacggcg agcagttaga agccgcgcat caggcgctgc gcgaggcggt ggccgactgt 61 cagcgggcgg gtaaacgcac gctggtcctg ggcggcggcc acgagacggc cttcggccac 121 ggcgctgggg tactggatgc cttcccaggc gaaaaagtgg gcattattaa tctcgatgcg 181 catctggatc tgcgctttgc cgactgcgcc agctccggga cgccgttccg ccagttggcg 241 ctggagtgtg acgcgcagca gcgcggtttt cactatacct gcatcggggt tagccgggcg 301 gcgaacaccc aggcgctatg ggatgaagcg gcgcgccgcc aggtggctat cgttgaagat 361 ctggaggtgc tgacagcctt cgaaactcgc gtgttgccgg agcttgagcg caatatcgcg 421 caattcgacc gcctgtatct gactatcgat ctcgacgtgc tgccggcgcg agaaatgccg 481 gcggtgtcgg ccccggcggc gctgggcgtg ccgctgggca cgctgctgcg catcgttgag 541 ccgctgtgcc gcagcggtaa gctgcaggcg gtggatctgg tggagtttaa cccgctgttt 601 gacattgacg gtcagggcgc tcgcgcggcg gcccgtgtgg catggcaaat cgcccattgg 661 tggcgctagc gatccgcgct attattttcg tttcgccgct ccgcttatta cgcataagga 721 aagccaggcc atgtttgcac aacaacctcg ttccgcgccc gcgcctttct atgaaaaggt 781 gaagcaggcg atcagcgaaa agatccacag cggcgtctgg cgaccgcatg accgtatccc 841 gtcggaagct gagctggtgg cccagttcgg cttcagccgg atgaccatca atcgcgcgct 901 gcgcgagctg acggacgaag gcctgctggt caggttgcaa ggggtgggaa cctttgtcgc 961 cgagccgaaa gggcagtcgg cgctgttcga ggtgcgcagt attgccgcag aaattgtcgc 1021 ccgtcaccat cagcaccgct gtgaggtact cctgcttgag gagacccggg cggatcatat 1081 tcaggcgacg gcgctcagcg tcccggaagg cacccgcatc tttcactcgc tgatggtgca 1141 ttacgaaaac gaggtgccgg tgcaaatcga agatcgctgc gtcaatgccg cagtggtacc 1201 ggactatctg catcaggact acaccgccac cacgccccat gattacctgt cgctgatcgc 1261 gccgttaacc gaaggtgaac atattgttga agcggtgcag gccacggcgg aagagtgcgc 1321 gctgctgcat attcacgccc acgatccgtg cctgctgatc cgtcgtcgca cctggtcgac 1381 aacccacatc gtctcccacg cgcgcctgct gttccccggt agccgatatc gtctgcaggg 1441 gcgctttggc tcctgatcca cgctggcgtc agcaaaacgt gattgctgac gcaatataac 1501 aaaattgtat catttctgtt aaatcctggc ttgcgcatgc ttgtatagac aagtatatgt 1561 atctacgtaa acaacgttta ttgtcaggag aaaccccgat gtcgcaaagc aaatatcgcc 1621 agctg // LOCUS MUSMHH2KBM 66 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse major histocompatibility complex antigen H-2Kbm29 mRNA. ACCESSION M34542 KEYWORDS major histocompatibility complex antigen. SOURCE Mouse ((C3H.CAS3(R4)XB6)F2, mutant #6414(bm29)), liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 66) AUTHORS Horton,R.M., Loveland,B.E., Parwani,A., Pease,L.R. and Fischer Lindahl,K. TITLE Immunological characterization and molecular analysis of the H-2Kbm29 mutant JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Weiss,E., Golden,L., Zakut,R., Mellor,A., Fahrner,K., Kvist,R. and Flavell,R.A. TITLE The DNA sequence of the H-2Kb gene: Evidence for gene conversion as a mechanism for the generation of polymorphism in histocompatibility antigens JOURNAL EMBO J. 2, 453-462 (1983) STANDARD full automatic COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Horton, 24-MAY-1990. Mutation in codon 89 changes bp 43 and 44 to "gc", otherwise this sequence is identical to reference 2. FEATURES from to/span description pept < 1 > 66 MHC H-2Kbm29 (AA at 3) BASE COUNT 14 a 22 c 19 g 11 t ORIGIN 1 cgagtggacc tgaggaccct gctcggctac tacaaccaga gcgcgggcgg ctctcacact 61 attcag // LOCUS YSCKGD2 2112 bp ds-DNA PLN 21-JUL-1990 DEFINITION S.cerevisiae dihydrolipoyl transsuccinylase (KGD2) gene, complete cds. ACCESSION M34531 KEYWORDS alphaketogluterate dehydrogenase complex; dihydrolipoyl transsuccinylase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2112) AUTHORS Repetto,B. and Tzagoloff,A. TITLE Structure and regulation of KGD2, the structural gene for yeast dihydrolipoyl transsuccinylase JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Tzagoloff, 29-MAY-1990. FEATURES from to/span description pept 560 1987 dihydrolipoyl transsuccinylase mRNA 306 > 1987 dihydrolipoyl transsuccinylase mRNA mRNA 451 > 1987 dihydrolipoyl transsuccinylase mRNA (alt.) signal 196 200 CAAT box signal 228 232 CAAT box BASE COUNT 649 a 435 c 425 g 603 t ORIGIN 1 tcgaacgtct cgtggaaagc cttgaccgtt gcgtctttac tcgtctctat tcaaccctgt 61 ggtcctatta gaaccgatct ttgttccatc gtcattacta cttcttgtct cttctctctt 121 ggggaataat aaacaaaaca attccagcca cccgcatctc cgtcaaagac gatattaaca 181 accctttttt tgcagccaat caaaactaag gtcctgcccg gagtgagcca ataaagcggc 241 cacaaattac gcaggaatct atggttccta aagcttattc ttattctaca ttgacaccat 301 tctactacag ggctcggatc cacacttggt ctatcagcaa atgcaaatcc tttttccctc 361 aactaggcct acttttcata tataatctct ttttgagctt caggggagga aatatctaac 421 gtaaaaaaaa gaaaaatccg aaataaaaaa ttaggcagct atctttaacg gtatatatca 481 gtgaatagat atattgttat aaacttcact accacatttg ttacaaccaa agacacaact 541 tcagataatt atttaaacaa tgctttccag agcgacgcgt actgcagctg ccaaatcctt 601 agtaaaatct aaagtggcta gaaatgttat ggctgcttct ttcgtcaaga gacatgcttc 661 tacaagtttg ttcaaacaag ctaacaaggt cgaatcctta ggttcaatat atttatccgg 721 caagaaaatt tcagttgcgg cgaatccgtt ctccataact agcaatcgtt ttaaatctac 781 ctctattgaa gttcctccga tggcagagtc cctgactgaa ggctctttaa aggaatatac 841 taaaaacgtt ggtgatttta ttaaggagga cgagctgttg gccactattg agaccgataa 901 aattgatatt gaggtcaatt cgccagtatc aggtactgtt acgaagctaa atttcaaacc 961 agaggacact gtcactgttg gtgaggagtt agctcaggtc gagcctggtg aagcacctgc 1021 tgagggttct ggagaatcta agccagagcc taccgaacaa gcggagcatc gcaaggtgtc 1081 gccgcaaggg aaaactcaag tgaggaaacg gcttcaaaga aagaagctgc tccaaagaaa 1141 gaagccgctc caaagaaaga agttacagaa ccaaaaaagg actgatcaac caaagaagac 1201 cgtctctaag gcgcaggaac ccccagtagc ctctaactct ttcacaccat ttccacgtac 1261 agaaaccagg gtcaaaatga accgtatgag attgaggatt gccgaaagat taaaagagtc 1321 tcaaaacact gctgcttcct taaccacatt caacgaagtt gacatgtcag ctttgatgga 1381 aatgaggaaa ctgtataaag atgagattat taagaagacc ggtactaaat tcggattcat 1441 gggtcttttc tccaaagcat gtaccttggc cgccaaggat attccagccg tcaatggtgc 1501 cattgaaggt gaccagattg tttatcgtga ttacacagat atttctgttg ctgtggccac 1561 tccaaagggt ttggttaccc ccgtcgttcg taatgcagag tcattgagtg ttttagatat 1621 tgagaacgaa attgttcgct tgagtcataa agcgcgtgat ggcaaattaa ccctagaaga 1681 tatgacgggt ggtactttca ccatatctaa tggtggtgtt tttggttcat tatacggtac 1741 tcctatcatc aattcaccac aaacagccgt cctaggcttg catggtgtca aagagagacc 1801 tgtcactgtt aatggacaaa ttgtctcaag accaatgatg tacttggctt tgacttatga 1861 tcatagattg ctagatggtg agaagctgtt atccttcttg aagactgtta aagagttgat 1921 tgaagaccct agaaaatgtt gttatggtga tttgaaattt gcagcccata ccaatctgat 1981 ttcatgatta ttcttgtccg ctattactgt gtgaatctct tctatttaag tatgatattt 2041 aaagatatgc tatgatgata aatggaacta tgaattaatg ttttcttttt ataatgagat 2101 atggaaaagc gc // LOCUS PHUDLTN 781 bp ss-mRNA VRT 21-JUL-1990 DEFINITION P.bicolor [D-ala-2]deltorphin mRNA, complete cds. ACCESSION M34560 KEYWORDS [D-ala-2]deltorphin. SOURCE P.bicolor bicolor skin, cDNA to mRNA, clone AD2. ORGANISM Phyllomedusa bicolor Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Neobatrachia; Bufonoidea; Phyllomedusidae. REFERENCE 1 (bases 1 to 781) AUTHORS Richter,K., Egger,R., Negri,L., Corsi,R., Sevrini,C. and Kreil,G. TITLE cDNA encoding [D-ala-2]deltorphin precursors from skin of Phyllomedusa bicolor also contain genetic information for three dermorphin-related opoid peptides JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4836-4839 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 30 713 pre pro[D-ala-2]deltorphin matp 348 368 [D-ala-2]deltorphin II matp 456 476 [D-ala-2]deltorphin I matp 564 584 [D-ala-2]deltorphin I matp 672 692 [D-ala-2]deltorphin I mRNA < 1 781 [D-ala-2]deltorphin mRNA BASE COUNT 323 a 69 c 233 g 156 t ORIGIN 1 tcagtacttc ctgaataaca agacccaaca tgtctttctt gaagaaatcg cttctgcttg 61 tacttttcct tggattagtg tcccattccg tttgtaaaga agagaaaaga gagactgaag 121 aggagaatga aaatgaggaa gaaaatcatg aagtgggaag tgagatgaag agatatgcgt 181 tctggtatcc gaatagagac actgaagaga agaatgaaaa tgaggaagaa aatcaggaag 241 agggaagtga gatgaagaga tatgcgttcg gctatccgaa aagagagcct gaagaggaaa 301 atgagaatga ggaagaaaat catgaagagg gaagtgagat gaagagatat gcgtttgaag 361 ttgtgggagg agaagctaag aaaatgaaaa gagaacctga agaggaaaat gagaatgagg 421 aagaaaatca tgaagaggga agtgagatga agagatatgc gtttgacgtt gtgggaggag 481 aagctaagaa aatgaaaaga gagcctgaag aggaaaatga gaatgaggaa gaaaatcatg 541 aagagggaag tgagatgaag agatatgcgt ttgacgttgt gggaggagaa gctaagaaaa 601 tgaaaagaga gcctgaagag gaaaatgaga atgaggaaga aaatcatgaa gagggaagtg 661 agatgaagag atatgcgttt gacgttgtgg gaggagaagc taagaaaatg taatatttca 721 taacttaaag gagcacaatt atcagttata tgccaaacat atattaaatg atagataact 781 t // LOCUS HSBGPG3A 1829 bp ds-DNA VRL 21-JUL-1990 DEFINITION Bovine herpesvirus type 1 glycoprotein gIII gene, complete cds. ACCESSION M27491 KEYWORDS glycoprotein. SOURCE Bovine herpesvirus type 1 DNA, clones pDR[7,123] and p7-123. ORGANISM Bovine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae. REFERENCE 1 (bases 1 to 1829) AUTHORS Fitzpatrick,D.R., Babiuk,L.A. and Zamb,T.J. TITLE Nucleotide sequence of bovine herpesvirus type 1 glycoprotein gIII, a structural model for gIII as a new member of the immunoglobulin superfamily, and implications for the homologous glycoproteins of other herpesviruses JOURNAL Virology 173, 46-57 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.J/Zamb, 31-AUG-1989. FEATURES from to/span description pept 160 1725 glycoprotein gIII precursor sigp 178 222 put. glycoprotein gIII signal peptide matp 223 1723 glycoprotein gIII signal 25 28 CAAT box signal 61 64 TATA box BASE COUNT 247 a 737 c 597 g 248 t ORIGIN 97 bp upstream of BamHI site; between map units 0.122 and 0.135. 1 cgcgcctgca gccgcgcgtg tgctcaatcc cggaccacga aagcacaaaa cggacgccct 61 taaaaatgta gcccgcgccg cggtcgcggc catcttggat ccacccgcgc gcacgaccgc 121 cgagagaccg ccagcccgag acctcgccgc gcgtccgcca tgggcccgct ggggcgagcg 181 tggctgatcg cagctatttt cgcctgggcg ctcctgtctg cccggcgggg gctcgccgag 241 gaggcggaag cctcgccctc gcctccgccc tccccgtgcc caaccgagac ggaaagctcc 301 gctgggacca ccggcgcaac gccccccacg cccaacagcc ccgacgctac gccagaggac 361 agcacgcccg gtgctactac gcccgtgggg acgccggagc cgccgtccgt gtccgagcac 421 gacccgcccg ttaccaacag cacgccgccg cccgccccgc ccgaggacgg gcgacccggc 481 ggcgctggca acgcgagccg cgatgggcga cctagcggcg gggggcggcc tcgccccccg 541 cggccgagca aagccccgcc gaaggagcgc aagtggatgc tctgcgagcg cgaggccgtg 601 gccgcctcgt acgccgagcc gctgtacgtg cactgcggcg tggccgacaa cgccactggc 661 ggtgcgcgcc tggagctctg gtttcagcgc gtgggcaggt tccgctccac gcgcggcgac 721 gacgaggccg tgcgcaaccc ctttccgcgg gccccgcccg tgctgctgtt cgtagcccag 781 aacggctcga tcgcgtaccg tagcgcggag ctgggcgaca actatatttt cccttcgccc 841 gccgaccccc gcaacttgcc cctgaccgtg cgctccctga cggccgccac cgagggcgtg 901 tacacttggc gccgcgacat gggcaccaag tcacagcgca aggtcgtgac cgtcacgacg 961 caccgcgcgc ccgctgtttc cgtcgaaccc cagccagcgc tagaaggcgc cggctacgcg 1021 gccgtgtgcc gcgccgccga gtactacccg ccgcgctcca cgcgcctgca ctggttccgc 1081 aacggctacc ccgtggaggc tcggcacgcg cgcgacgtct ttacggtcga cgactccggg 1141 ctcttttcgc gcacgtccgt ccttacgctc gaggacgcga cgccaaccgc ccacccgccc 1201 aacctgcgct gcgacgtctc ctggttccag agcgctaaca tggagcgccg cttttacgcg 1261 gctggcacgc cggccgttta ccgcccgccc gagctgcgcg tgtacttcga gggcggcgag 1321 gccgtctgcg aggcgcgctg cgtccccgag gggcgcgtct ccctgcgctg gacggtgcgc 1381 gacggcatcg ccccgtcgcg cactgagcag accggcgtct gcgccgagcg gcccgggctg 1441 gtaaacctgc gcggcgtgcg cctgctttct acaaccgacg ggcccgtcga ctacacctgc 1501 accgccactg gctacccggc accgctgccc gagttctccg cgaccgccac gtacgacgcc 1561 tcgcccggcc taatcggaag ccccgtcctc gtcagcgtcg tggccgtcgc ctgcggtctc 1621 ggcgccgtgg ggctcctgct ggtggcggcc tcgtgcctgc ggcgcaaggc ccgggtaatc 1681 caacccggtc ttactcgcgc tcgcgccctc ggctccgcgc cctagacgac cggcacggcc 1741 tggaggcgct ggcggctgcc ggtgccgctc acaccgcgcg ccacaaccgc gacgtgtggc 1801 agcgcttttc ccgcgtctgc gaggccggc // LOCUS DOGCHAMC 933 bp ss-mRNA MAM 21-JUL-1990 DEFINITION Dog mast cell chymase mRNA, complete cds. ACCESSION J02904 KEYWORDS chymase. SOURCE Dog skin mastocytoma cell line BR, cDNA to mRNA. ORGANISM Canis sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 933) AUTHORS Caughey,G.H., Raymond,W.W. and Vanderslice,P. TITLE Dog mast cell chymase: Molecular cloning and characterization JOURNAL Biochemistry 29, 5166-5171 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.W.Raymond, 17-MAY-1990. FEATURES from to/span description pept 10 759 mast cell chymase precursor sigp 10 72 mast cell chymase signal peptide matp 73 756 mast cell chymase mRNA < 1 933 mast cell chymase mRNA signal 916 921 polyA signal BASE COUNT 233 a 272 c 229 g 199 t ORIGIN 1 cctgagaaga tgcattgtct tcctctcacc ctgctgctcc ttctcctatg ttccagagca 61 gaagctgagg agatcatcgg gggcacagag tccaagccac actcccgccc ttacatggcc 121 cacctggaaa tcctcactct ccggaatcac ctggcgtctt gtggtggttt cctgataagg 181 cggaacttcg tactgacagc tgctcactgt gcaggaaggt tcataatggt caccctggga 241 gcccataaca tacaaaagaa agaagacacg tggcagaagc ttgaggtcat aaaacaattc 301 cctcacccaa aatatgatga ccttactctt cgccatgaca tcatgctact gaagttgaag 361 gagaaggcca acctgaccct ggccgtgggg accctccccc tctcacccca gttcaacttc 421 gtcccacctg ggagaatgtg ccgggtggct ggctggggaa aaagacaagt caatggatca 481 ggctctgaca ctctgcaaga ggtgaagctg aggctcatgg acccccaggc ctgtagacac 541 tacatggctt ttgaccacaa cctccagctg tgtgttggca atcccaggaa gacaaaatct 601 gcatttaagg gagattcagg gggccctctt ctgtgtgctg gggtagccca gggaattgtg 661 tcctatgggc agaatgatgc aaagccccct gctgtcttca cccgaatctc ccactaccgg 721 ccctggatca ataaggttct gaagcagaat aaagcctgaa tggagcctgg gcaagcctga 781 ggggaaatct ggaaccagac ctgagcaggc tctctgccac ttactctgga gctgcctcta 841 gtctctactg aggccccact acatccctca gacccaagag ggctcctcca ggtcacagag 901 tcctcagtaa acctcaataa agatccagct tcc // LOCUS MUSIGHACQ 340 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds. ACCESSION M34732 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H210-56, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 340) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 340 Ig H-chain VDJ-region (AA at 1) recomb 291 292 V-region end/D-region start recomb 298 299 D-region end/J-region start BASE COUNT 80 a 79 c 93 g 88 t ORIGIN 1 gtggagtctg gaggaggctt ggtacagcct gggggttctc tgagactctc ctgtgcaact 61 tctgggttca ccttcactga ttactacatg agctgggtcc gccagcctcc aggaaaggca 121 cttgagtggt tgggttttat tagaaacaaa gctaatggtt acacaacaga gtacagtgca 181 tctgtgaagg gtcggttcac catctccaga gataattccc aaagcatcct ctatcttcaa 241 atgaacaccc tgagagctga ggacagtgcc acttattact gtgcaagaga tagcggcggg 301 tttgcttact ggggccaagg gactctggtc actgtctctg // LOCUS MUSIGHACR 286 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds. ACCESSION M34733 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-17, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 286) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 286 Ig H-chain VDJ-region (AA at 1) recomb 234 235 V-region end/D-region start recomb 238 239 D-region end/J-region start BASE COUNT 74 a 70 c 68 g 74 t ORIGIN 1 acttctgggt tcaccttcac tgattactac atgagctggg tccgccagcc tccaggaaag 61 gcacttgagt ggttgggttt tattagaaac aaagctaatg gttacacaac agagtacagt 121 gcatctgtga agggtcggtt caccatctcc agagataatt cccaaagcat cctctatctt 181 caaatgaaca ccctgagagc tgaggacagt gccacttatt actgtgcaag agatcaacgc 241 ctggtttgct tactggggcc aagggactct ggtcactgtc tctgca // LOCUS MUSIGHACS 354 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds. ACCESSION M34734 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-12, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 354 Ig H-chain VDJ-region (AA at 1) recomb 303 304 V-region end/D-region start recomb 311 312 D-region end/J-region start BASE COUNT 85 a 79 c 93 g 90 t 7 others ORIGIN 1 gaggtgaagt tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagactc 61 tcctgtgcaa cctctggttt taccttcacg aattactaca tgnnnnnnnt ccgccagcct 121 ccaggaaagg cacttgagtg gttgggtgtt attagaaaca aagctaatgg tgacacaaca 181 gagtataatg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc 241 ctctatcttc aaatgaacac cctgagagct gaggacagtg ccacttatta ctgtgcaaga 301 gatagcggtc cgtttgctta ctggggccaa gggactctgg tcactgtctc tgca // LOCUS MUSIGHACT 312 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds. ACCESSION M34735 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig H-chain VDJ-region (AA at 2) recomb 262 263 V-region end/D-region start recomb 268 269 D-region end/J-region start BASE COUNT 95 a 69 c 73 g 75 t ORIGIN 1 gcctaaaggg tcattgaaac tctcatgtgc agcctctgga ttcaccttca ataccaatgc 61 catgaactgg gtccgccagg ctccaggaaa gggtttggaa tgggttgctc gcataagaag 121 taaaagtaat aattatgcaa catattatgc cgattcagtg aaagacaggt tcaccatctc 181 cagagatgat tcacaaagca tgctctatct gcaaatgaac aacttgaaaa ctgaggacac 241 agccatgtat tactgtgtga gagcggcgta ctatgctatg gactactggg gtcaaggaac 301 ctcagtcacc gt // LOCUS MUSIGHACU 353 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds. ACCESSION M34736 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H230-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 353) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 353 Ig H-chain VDJ-region (AA at 1) recomb 294 295 V-region end/D-region start recomb 308 309 D-region end/J-region start BASE COUNT 87 a 83 c 100 g 83 t ORIGIN 1 gaagtgcagc tggtggagtc tgggggaggc ttagtgaagc ctggagggtc cctgaaactc 61 tcctgtgcag cctctggatt cactttcagt gactattaca tgtattgggt tcgccagact 121 ccggaaaaga ggctggagtg ggtcgcaacc attagtgatg gtggtagtta cacctactat 181 ccagacagtg tgaaggggcg attcaccatc tccagagaca atgccaagaa caacctgtac 241 ctgcaaatga gcagtctgaa gtctgaggac acagccatgt attactgtgc aaggtacggg 301 acccttattg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctc // LOCUS MUSIGHACV 351 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds. ACCESSION M34737 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H35-D1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 351) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 351 Ig H-chain VDJ-region (AA at 1) recomb 294 295 V-region end/D-region start recomb 300 301 D-region end/J-region start BASE COUNT 89 a 82 c 99 g 81 t ORIGIN 1 gaagtgcagc tggtggagtc tgggggaggc ttagtgaagc ctggagggtc cctgaaactc 61 tcctgtgcag cctctggatt cactttcagt gactattaca tgtattgggt tcgccagact 121 ccggaaaaga ggctggagtg ggtcgcaacc attagtgatg gtggtagtta cacctactat 181 ccagacagtg tgaaggggcg attcaccatc tccagagaca atgccaagaa caacctgtac 241 ctgcaaatga gcagtctgaa gtctgaggac acagccatgt attactgtgc aaggtattac 301 tacggtagga ttgactactg gggccaaggc accactctca cagtctccgc a // LOCUS MUSIGHACW 348 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig H-chain mRNA VDJ-region, partial cds. ACCESSION M34738 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma L11-1A1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 348 Ig H-chain VDJ-region (AA at 1) recomb 291 292 V-region end/D-region start recomb 309 310 D-region end/J-region start BASE COUNT 84 a 97 c 80 g 87 t ORIGIN 1 gatgtgcagc ttcaggagtc gggacctggc ctggtgaaac cttctcagtc tctgtccctc 61 acctgcactg tcactggcta ctcaatcacc agtgattatg cctggaactg gatccggcag 121 tttccaggaa acaagctgga gtggatgggc tacataaact acagtggtta cactacctac 181 aacccatctc tcaaaagtcg aatctctatc actcgagaca catccaagga ccagttcttc 241 ctgcagttga attctgtgac tactgaggac acagccacat attactgtgc attactacgg 301 ctacgtgcgg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS MUSIGKACN 341 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34739 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H144-17B7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 341) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1) recomb 302 303 V-region end/J-region start BASE COUNT 92 a 83 c 87 g 79 t ORIGIN 1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact 61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat 301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaaac g // LOCUS MUSIGKACO 341 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34740 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H230-7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 341) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1) recomb 302 303 V-region end/J-region start BASE COUNT 92 a 82 c 87 g 79 t 1 others ORIGIN 1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact 61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atnagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat 301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaaac g // LOCUS MUSIGKACP 338 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34741 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H210-56, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 338) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 338 Ig kappa-chain VJ-region (AA at 1) recomb 302 303 V-region end/J-region start BASE COUNT 91 a 82 c 86 g 79 t ORIGIN 1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact 61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat 301 ccgctcacgt tcggtgctgg gaccaagctg gagctgaa // LOCUS MUSIGKACQ 341 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34742 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-17, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 341) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 92 a 83 c 86 g 80 t ORIGIN 1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact 61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat 301 cctctcacgt tcggtgctgg gaccaagctg gagctgaaac g // LOCUS MUSIGKACR 341 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34743 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H220-3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 341) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 341 Ig kappa-chain VJ-region (AA at 1) recomb 304 305 V-region end/J-region start BASE COUNT 96 a 81 c 84 g 80 t ORIGIN 1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact 61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatagttat 301 cctctcacgt tcggctcggg gacaaagttg gaaataaaac g // LOCUS MUSIGKACS 335 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34744 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H35-D1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 335) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 335 Ig kappa-chain VJ-region (AA at 1) recomb 302 303 V-region end/J-region start BASE COUNT 89 a 82 c 84 g 79 t 1 others ORIGIN 1 gacattgtga tgacacagtc tccatcctcc ctgactgtga cagcaggaga gaaggtcact 61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttga tctactgggc atccactagg 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacctngca gtttattact gtcagaatga ttatagttat 301 ccgctcacgt tcggtgctgg gaccaagctg gagct // LOCUS MUSIGKACT 333 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34745 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma L11-1A1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 333 Ig kappa-chain VJ-region (AA at 1) recomb 296 297 V-region end/J-region start BASE COUNT 90 a 80 c 86 g 77 t ORIGIN 1 gtgatgacac agtctccatc ctccctgact gtgacagcag gagagaaggt cactatgagc 61 tgcaagtcca gtcagagtct gttaaacagt ggaaatcaaa agaagtactt gacctggtac 121 cagcagaaac cagggcagcc tcctaaactg ttgatctact gggcatccac tagggaatct 181 ggggtccctg atcgcttcac aggcagtgga tctggaacag atttcactct caccatcagc 241 agtgtgcagg ctgaagacct ggcagtttat tactgtcaga atgattatag ttatccgctc 301 acgttcggtg ctgggaccaa gctggagctg aaa // LOCUS MUSIGKACU 339 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34746 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma IC5-2A6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 339 Ig kappa-chain VJ-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 92 a 80 c 84 g 83 t ORIGIN 1 gacattgtga tgacacagtc tccatcttcc ctgactgtga cagcaggaga gaaggtcact 61 atgagctgca agtccagtca gagtctgtta aacagtggaa atcaaaagaa cttcttgacc 121 tggtaccagc agaaaccagg acagtctcct aaactgttga tctactgggc atccactagg 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacgtggca gtttattact gtcagaatga ttatacttat 301 cctctcacgt tcggtgctgg gaccaagctg gagctgaaa // LOCUS MUSIGKACV 318 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34747 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-8, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 318 Ig kappa-chain VJ-region (AA at 2) recomb 283 284 V-region end/J-region start BASE COUNT 85 a 79 c 77 g 77 t ORIGIN 1 tccatcctcc ctgactgtga cagcaggaga gaaggtcact atgagttgca agtccagtca 61 gagtctctta aacagtggaa ctcaaaagaa ctacttgacc tggtaccagc agaaaccagg 121 gcagcctcct aaactattga tctactgggc atccactagg gaatctgggg tccctgatcg 181 cttcacaggc agtggatctg gaacagattt cactctcacc atcatcagtg tgcagactga 241 agacctggca gtttatttct gtcagaatga ttatagttat ccgctcacgt tcggtgctgg 301 gaccaagctg gagctgaa // LOCUS MUSIGKACW 246 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34748 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H36-12, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 246) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 246 Ig kappa-chain VJ-region (AA at 2) recomb 211 212 V-region end/J-region start BASE COUNT 64 a 64 c 59 g 59 t ORIGIN 1 ccgtggaaat caaaagaact acttgacctg gtaccagcag aaaccagggc agcctcctaa 61 actcttaatc tactgggcat ccactaggga atctggggtc cctgatcgct tcacaggcag 121 tggatctgga acagatttca ctctcaccat cagcagtgtg cgggctgaag acctggcagt 181 ttattactgt cagaatgatt atcattatcc gctcactttc ggtgctggga ccaagctgga 241 cctgaa // LOCUS MUSIGKACX 176 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA VJ-region, partial cds. ACCESSION M34749 KEYWORDS immunoglobulin light chain; joining exon; kappa-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) adult spleen anti-HA hybridoma H37-41, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 176) AUTHORS Clarke,S.H., Staudt,L.M., Kavaler,J., Schwartz,D., Gerhard,W.U. and Weigert,M.G. TITLE V region gene usage and somatic mutation in the primary and secondary responses to influenza virus hemagglutinin JOURNAL J. Immunol. 144, 2795-2801 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 176 Ig kappa-chain VJ-region (AA at 1) recomb 138 139 V-region end/J-region start BASE COUNT 41 a 45 c 47 g 43 t ORIGIN 1 tgggcatcca ctagggaatc tggggtccct gatcgcttca caggcagtgg atctggaaca 61 gatttcactc tcaccatcac caatgtgcag gctgaagacc tggcagttta ttactgtcag 121 aatgattaca cttatccgct cacgttcggt gctgggacca agctggagct gagacg // LOCUS MUSMHH2A 876 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse MHC class I H-2K-kml mRNA (H-2K-kml haplotype), partial cds. ACCESSION M34932 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse (strain CBA.M523) lymphoma cell line RDM-4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 876) AUTHORS Martinko,J.M., Solheim,J.C. and Geliebter,J. TITLE The H-2K-km1 mutation: A single nucleotide substitution is responsible for multiple functional differences in a class I MHC molecule JOURNAL Mol. Immunol. 25, 267-274 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 876 MHC H-2K-kml mRNA (AA at 1) BASE COUNT 180 a 255 c 286 g 155 t ORIGIN 1 ggcccacatt cgctgaggta tttccacacc gccgtgtccc ggcccggcct cgggaagccc 61 cggttcatct ctgtcggcta cgtggacgac acgcagttcg tgcgcttcga cagcgacgcg 121 gagaatccga ggtatgagcc gcgggtgcgg tggatggagc aggtggagcc cgagtattgg 181 gagcggaaca cgcagatcgc caagggcaat gagcagattt tccgagtgaa cctgaggacc 241 gcgctgcgct actacaacca gagcgcgggc ggctctcaca cgttccaacg gatgtacggc 301 tgtgaggtgg ggtcggactg gcgcctcctc cgcgggtacg agcagtacgc atacgacggc 361 tgcgattaca tcgccctgaa cgaagacctg aaaacgtgga cggcggccga catggcggcg 421 ctgatcacca aacacaagtg ggagcaggct ggtgctgcag agagagaccg ggcctacctg 481 gagggcacgt gcgtggagtg gctccgcaga tacctgcagc tcgggaacgc gacgctgccg 541 cgcacagatt ccccaaaggc ccatgtgacc cgtcacagca gacctgaaga taaagtcacc 601 ctgaggtgct gggccctggg cttctaccct gctgacatca ccctgacctg gcagttgaat 661 ggggaggagc tgacccagga catggagctt gtggagacca ggcctgcagg ggatggaacc 721 ttccagaagt gggcatctgt ggtggtgcct cttgggaagg agcagtatta cacatgccat 781 gtgtaccatc aggggctgcc tgagcccctc accctgagat gggagcctcc tccatccact 841 gtctccaaca cggtaatcat tgctgttctg gttgtc // LOCUS PEALEGAB 137 bp ss-mRNA PLN 21-JUL-1990 DEFINITION P.sativum legumin mRNA, 3' flank. ACCESSION M34908 KEYWORDS legumin; storage protein. SOURCE P.sativum, cDNA to mRNA, clones pDUB[1,3]. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 137) AUTHORS Lycett,G.W., Delauney,A.J. and Croy,R.R.D. TITLE Are plant genes different? JOURNAL FEBS Lett. 153, 43-46 (1983) STANDARD simple staff_review FEATURES from to/span description mRNA < 1 137 legumin mRNA BASE COUNT 53 a 20 c 24 g 40 t ORIGIN 1 tagatttcgc accaaatcaa tgaaagtaat gaataagaaa actaaggctt agatgccttt 61 gttacttgtg taaaataact cgagtcatgt acctttttgc ggaaacagaa taaataaaag 121 gtaaaatttc agtgctc // LOCUS PEAVICA 81 bp ss-mRNA PLN 21-JUL-1990 DEFINITION P.sativum vicilin mRNA, 3' flank. ACCESSION M34909 KEYWORDS storage protein; vicilin. SOURCE P.sativum, cDNA to mRNA, clone pDUB2. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 81) AUTHORS Lycett,G.W., Delauney,A.J. and Croy,R.R.D. TITLE Are plant genes different? JOURNAL FEBS Lett. 153, 43-46 (1983) STANDARD simple staff_review BASE COUNT 32 a 11 c 13 g 25 t ORIGIN 1 taatgagaga tcaaatattt tgcatgtatg ctataaagaa ctatagctca taatgagcaa 61 ggaataaaac atcgttctct t // LOCUS ECOALKPH 52 bp ds-DNA SYN 21-JUL-1990 DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds. ACCESSION M34498 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain AW1043) DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 52) AUTHORS Chou,M.M. and Kendall,D.A. TITLE Polymeric sequences reveal a functional interrelationship between hydrophobicity and length of signal peptides JOURNAL J. Biol. Chem. 265, 2873-2880 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 52 synthetic alkaline phosphatase precursor (AA at 1) sigp < 1 > 52 synthetic alkaline phosphatase precursor signal peptide BASE COUNT 9 a 15 c 10 g 18 t ORIGIN 1 tcgactctgc ttctgctgct cttgctgtta ctgttaaccc ctgtgacaaa ag // LOCUS ECOALKPHX 52 bp ds-DNA SYN 21-JUL-1990 DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds. ACCESSION M34499 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain AW1043) DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 52) AUTHORS Chou,M.M. and Kendall,D.A. TITLE Polymeric sequences reveal a functional interrelationship between hydrophobicity and length of signal peptides JOURNAL J. Biol. Chem. 265, 2873-2880 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 52 synthetic alkaline phosphatase precursor (AA at 1) sigp < 1 > 52 synthetic alkaline phosphatase precursor signal peptide BASE COUNT 8 a 10 c 17 g 17 t ORIGIN 1 tcgactgtcg tggttgtcgt agttgtggtg gtcgttaccc ctgtgacaaa ag // LOCUS ECOALKPHXX 52 bp ds-DNA SYN 21-JUL-1990 DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds. ACCESSION M34500 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain AW1043) DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 52) AUTHORS Chou,M.M. and Kendall,D.A. TITLE Polymeric sequences reveal a functional interrelationship between hydrophobicity and length of signal peptides JOURNAL J. Biol. Chem. 265, 2873-2880 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 52 synthetic alkaline phosphatase precursor (AA at 1) sigp < 1 > 52 synthetic alkaline phosphatase precursor signal peptide BASE COUNT 10 a 18 c 17 g 7 t ORIGIN 1 tcgactgcag cagctgcagc tgcggctgcc gcggcgaccc ctgtgacaaa ag // LOCUS ECOALKPHY 52 bp ds-DNA SYN 21-JUL-1990 DEFINITION Synthetic E.coli alkaline phosphatase gene, partial cds. ACCESSION M34501 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain AW1043) DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 52) AUTHORS Chou,M.M. and Kendall,D.A. TITLE Polymeric sequences reveal a functional interrelationship between hydrophobicity and length of signal peptides JOURNAL J. Biol. Chem. 265, 2873-2880 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 52 synthetic alkaline phosphatase precursor (AA at 1) sigp < 1 > 52 synthetic alkaline phosphatase precursor signal peptide BASE COUNT 8 a 11 c 25 g 8 t ORIGIN 1 tcgactggcg gtgggggcgg cggaggtggt ggcggtaccc ctgtgacaaa ag // LOCUS CPCURA5A 2029 bp ds-DNA BCT 21-JUL-1990 DEFINITION C.neoformans orotidine monophosphate pyrophosphorylase (URA5) gene, complete cds. ACCESSION M34606 KEYWORDS orotidine monophosphate pyrophosphorylase. SOURCE C.neoformans (strain B-3501) asexual phase cell line B-3501 DNA, clone pURA5g2. ORGANISM Cryptococcus neoformans Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina; Filobasidiales. REFERENCE 1 (bases 1 to 2029) AUTHORS Edman,J.C. and Kwon-Chung,K.J. TITLE Isolation of the URA5 gene from Cryptococcus neoformans var neoformans and its use as a selective marker for transformation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Edman, 29-MAY-1990. Author address: J.C.Edman University of California San Francisco Hormone Research Institute San Francisco, CA 94143-0534 email: edman@socrates.ucsf.edu FEATURES from to/span description pept 714 814 orotidine monophosphate pyrophosphorylase (URA5), exon 1 (E.C. 2.4.2.10) 867 1095 orotidine monophosphate pyrophosphorylase, exon 2 1145 1492 orotidine monophosphate pyrophosphorylase, exon 3 pre-msg < 714 1527 URA5 mRNA and introns IVS 815 866 URA5 intron A IVS 1096 1144 URA5 intron B BASE COUNT 512 a 515 c 479 g 523 t ORIGIN 1 gatcttgggg atggtattga agacgatcca gaaaaagtct ctccatccct cccacgttcg 61 ctgttcggtt tctttgcaaa gagcgaagtt gctcgacctt tgacagaccc taaaatctgg 121 tcgtattttg cgttcaaaga tcgcttactg cgaccgcgat tttttagagg tggatttttt 181 gttgcgagac tagcgcatcg gataggcatg ggtgatatac gtttgcgttc cttatgaggg 241 gtatcatcag tggcaaatta ccgactagat gaagaagacg acctacaccc cgctaaccgg 301 tgtccgatcg acatgatcca cggtgtccaa agcgaagaag gacatgatga ttggagttcg 361 cttacctttg gtataaacca gtgttgtttc tgaagaggtg aggaagaaaa tgatctaggg 421 caataacaga aatcgtttga agaagaagta aaagaaaaga cacacgtcag acaagcatca 481 tcatcattca ttttgaatac tcatcacggc accttcttcc ttgcatgcgc attattcatg 541 atgagatgct cgcagccgcc ggatttcggc tgcaatcaag aacgatatcg cgtacggcga 601 tgtccgcctc cactccatct tccaacaacc accacagtgc ccgataacac tacgacgaaa 661 attgttcttt tttggatctt gtcgacaacg gcgaccaaaa caagaacttg atcatgtcct 721 cccaagccct cgactccgcc aaagttgcct tcatcgaggc tgccatcgaa catggcgtgc 781 ttcttttcgg caactttacc ttgaagtccg gccggtgagc catattgcag cgcttcacag 841 tccaatcgaa tctgacatgt gttcagccaa tccccttact tcttcaatgc cggtctcctt 901 tactcttcat cgcttctctc aactaccgct caggcttacg ccaaggtact ttcctcttct 961 aggattcctg actttgacgt cctcttcggc ccagcttaca agggtatctc cttggctgct 1021 gtctccgctg taagccttta tcagcaaacc ggcaaagata tcggctactg ctacaacagg 1081 aaggagaaga aggacgtgag tctgtcctaa ccagtgcgac agcgatgagc tcataagcca 1141 gtagcacggt gagggcggta ctatggtcgg tgcgcctctc aagggacgaa tcgtcatcat 1201 cgacgatgtt ctcacctctg gcaaggccat ccgtgaagct attgacattc tcaaggcctc 1261 ccctgaagcg aagcttgtcg gaattgtcca gcttgtcgac agacaagaga aaggccagag 1321 cggtagcggc aagagtaccg tacaggaggt tgaggaagag ttcggtgtgc ctgtcgagcc 1381 tattattggt ttggacgaca ttgtgaagta cttagaaagc tccggcaagt gggaaaagga 1441 gctgcaagag gtcaggaagt acagggcgga gtacggtgtt cagaggtctt aagggttttc 1501 ttcttaaatg cacgggttta ggtctagcta atcaagttcc gacatattac aagtttgtaa 1561 gcttgtatca aaggaactta agtacaggca ggcgtcctga ggcgacaaag gaagctgtaa 1621 tatgattgtt ggctgtcaat cttcatcgta tctactttgt caatactgac ttcaatgacc 1681 caataataca attttattag tgttgaccca gaatggttag caggaaactc cccttctctt 1741 cctctcaatc ccaatcatac ttcatatctc ctgctccccc catttccgtc ttcctcgatg 1801 actccctggt cccatccctc ccacctcctg gaggcaagct ggagcacctg gacctaatgg 1861 gtcgtcgccc caaattgcct ccacctctta agacaatcat cgtccaatca aactctgaac 1921 tatcttccaa gccaatggcc ggatctggac acaagatgat gtcgaagccg cttgatgtgc 1981 cagggttggt ccgtggcctg gagacgtaag agcgggtagt actgggatc // LOCUS ECOCHEA 2190 bp ds-DNA BCT 21-JUL-1990 DEFINITION E.coli cheA protein (complete cds), motB protein (3' end), and cheW protein (5' end) genes. ACCESSION M34669 KEYWORDS cheA gene; cheW gene; motB gene. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2190) AUTHORS Kofoid,E.C. and Parkinson,J.S. TITLE Tandem translational starts in the cheA locus of E.coli JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.C.Kofoid, 01-JUN-1990. Author address: E.C.Kofoid University of Utah Dept of Biology Salt Lake City, UT 84112 email: kofoid@bioscience.utah.edu FEATURES from to/span description pept < 1 104 MotB protein pept 109 2073 CheA protein (alt.) pept 400 2073 CheA protein (alt.) pept 2094 > 2190 CheW protein BASE COUNT 575 a 546 c 609 g 460 t ORIGIN 42 min on K12 map. 1 atgaaaacgc cgaaagccag aatgagccag taagcgccct ggaaaaacct gaggttgcac 61 cacaggtcag tgttcccaca atgccatcag ccgaaccgag gtgacagcgt gagcatggat 121 ataagcgatt tttatcagac attttttgat gaagcggacg aactgttggc tgacatggag 181 cagcatttgc tggttttgca gccggaagcg ccagatgccg aacaattgaa tgccatcttt 241 cgggctgccc actcgatcaa aggaggggca ggaacttttg gcttcagcgt tttgcaggaa 301 accacgcatc tgatggaaaa cctgctcgat gaagccagac gaggtgagat gcaactcaac 361 accgacatta tcaatctgtt tttggaaacg aaggacatca tgcaagaaca gctcgacgct 421 tataaacagt cgcaagagcc ggatgccgcc agcttcgatt atatctgcca ggccttgcgt 481 caactggcat tagaagcgaa aggcgaaacg ccatccgcag tgacccgatt aagtgtggtt 541 gccaaaagtg aaccgcaaga tgagcagagt cgcagtcagt cgccgcgacg aattatcctt 601 tcgccgctga aggccgggga agtcgacctg ctggaagaag aactgggaca tctgacaacg 661 ttaactgacg tggtgaaagg ggcggattcg ctctcggcaa tattaccggg cgacatcgcc 721 gaagatgaca tcacagcggt actctgtttt gtgattgaag ccgatcagat tacctttgaa 781 acagtagaag tctcgccaaa aatatccacc ccaccagtgc ttaaactggc agccgaacaa 841 gcgccaaccg gccgcgtgga gcgggaaaaa acgacgcgca gcaatgaatc caccagcatc 901 cgtgtagcgg tagaaaaggt tgatcaatta attaacctcg tcggcgagct ggttatcacc 961 cagtccatgc ttgcccagcg ttccagcgaa ctggacccgg ttaatcatgg tgatttgata 1021 accagcatgg ggcagttaca acgtaacgcc cgtgatttgc aggaatcagt gatgtcgatt 1081 cgcatgatgc cgatggaata tgtttttagt cgctatcccc ggctggtgcg tgatctggcg 1141 ggaaaactcg gcaagcaggt agaactgacg ctggtgggca gttctactga actcgacaaa 1201 agcctgatag aacgcattat cgacccgctg acccacctgg tacgcaatag cctcgatcac 1261 ggtattgaac tgccagaaaa acggctcgcc gcaggtaaaa acagcgtcgg aaatttaatt 1321 ctgtctgccg aacatcaggg cggcaacatt tgcattgaag tgaccgacga tggggcgggg 1381 ctaaaccgtg agcgaattct ggcaaaagcg gcctcgcaag gtttgactgt cagcgaaaac 1441 atgagcgacg acgaagtcgc gatgctgata tttgcacctg gcttctccac ggcagagcag 1501 gtcaccgacg tctccgggcg cggcgtcggc atggacgtcg ttaaacgtaa tatccagaag 1561 atgggcggtc atgtcgaaat ccagtcgaag cagggtactg gcactacgat ccgcatttta 1621 ctgccgctga cgctggccat cctcgacggc atgtccgtac gcgttgcgga tgaagttttc 1681 attctgccgc tgaatgctgt tatggaatca ctgcaacccc gtgaagccga tctccatcca 1741 ctggccggcg gcgagcgggt gctggaagtg cggggtgaat atctgcccat cgtcgaactg 1801 tggaaagtgt tcaacgtcgc gggcgcgaaa accgaagcca cccagggaat tgtggtgatc 1861 ttacaaagtg gcggtcgccg ctacgccttg ctggtggatc aattaattgg tcaacaccag 1921 gttgtggtta aaaaccttga aagtaactat cgcaaagtcc ccggcatttc tgctgcgacc 1981 attcttggcg acggcagcgt ggcactgatt gttgatgtct ccgccttgca ggcgataaac 2041 cgcgaacaac gtatggcgaa caccgccgcc tgaatgagta aaaaggtaac aatatgaccg 2101 gtatgacgaa tgtaacaaag ctggccagcg agccgtcacg gcaggaattt ctggtattta 2161 cccttggtga tgaagagtac ggtattgatg // LOCUS HAMHSP60A 2726 bp ds-DNA ROD 21-JUL-1990 DEFINITION C.griseus chaperonin (HSP60) non-functional pseudogene. ACCESSION M34562 KEYWORDS chaperonin; pseudogene. SOURCE C.griseus cell line CHO DNA, clone GC-1. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2726) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description IVS 1738 1824 HSP60 intron A BASE COUNT 915 a 513 c 579 g 719 t ORIGIN 1 catggtgatt tttctttaga tattctgggc ctcctgaaac tgacaagatt ctgtaaagca 61 aaagacataa tcaacaagat aagacagcag tgcaaacaat gggaaaagtt cttcaccaac 121 cccatatcta gccaagggca aatttctaaa acatacaaag aactgatgaa agcagtcaac 181 aaaacaccaa ataatcgccc tcactctccg ccaccagcct gctttgccag atgagaccag 241 tgtcccgggc gctggctcct catgtcactc gggcctatgc caatgatgta aaatgtgggg 301 caaatccttg atccttaaag cttcaagttg tagacctttt agccgatgct gtagctgtta 361 caatggggcc aaagggaaga acagtgatta ttgaacagag ttgaggaagt cccaaagtaa 421 caaaagatgg ggtcattttt acaaaggcaa ttgatctaaa ggatacatac aaataatatc 481 agagctaaac ttgttcaagt tgttgccaat aactcaaatg aagaggctgg gatggcacta 541 ccactgctac tgttctggca agttctattg ccaacgaagg ctttgagaag atcagcaaag 601 ggtctaatcc agtagaaatc tggagaagtg tgatgttggc tgttgatgct gtaattgctg 661 aactaacaaa acaatctaaa catgtgacaa cccatgaaga aatttctcag gttgctacaa 721 tttctgcaaa tggacacata gacattggaa acatcatatc tgatgcaatg aaacatgttg 781 gagggaaaga cgtcatcaca ctgaaggata gaaataccct gaatgatgaa ttagaaatta 841 ttggaggcat gaagtttgaa agaagatgta tttctcagac atcaataagt caaaaatgtg 901 aattccaaga tgcctatgtt ctattgagtg aaaagaaaat ttctattgtc gagcccatta 961 ctgctcatag gaaacaaatg gtgatatttg ctgaagatgt tgatggagaa gctctaagca 1021 cactggtttt gaacaggcta aaagctggtc ttcagatcgt agcagttaaa gctccaggtt 1081 tttggggaca ataggaagaa taagcttaaa gatacggcta ttgctactga tggtgagcaa 1141 tttggagaac aggggctaaa actaaatatt gatgatggtc aagttcatga tttaggaaaa 1201 tttggagagg tcattgtccc caaagatgat gccatgcttt tgaaaggaac tgatgactaa 1261 actcaaattg agaaaccaat tcaagaaatc actaagcagc tagacatcac aactagtgaa 1321 tatgaaaagg aaaagctaaa cgaaacactt gctaaacttt cagatggtgc agctgtgttt 1381 aaggttggag gaacaaggga tgttgaagtg aaggagaaga aagacagagt tacaaatgct 1441 ctcaatgcta caagagcagc tgttgaaaag ggcattgttt taggacttag ctgtgctctg 1501 cctctgtgca tcccagcctt ggattcctta aagccttcaa atgaagatca gaaaataggt 1561 atagaaatta ttaaaagagc actcaaaatt cctgcagtga caattgctga gaatgcaggt 1621 gttgaataat ctttgagagt tgagaaaatt ctgcagggtt cctagggagt tggttattag 1681 gctatgcccg caaattttgt gaacatggta gaaaaaggaa tcattaatcc aacaaaggta 1741 aatgagtcag caattcttaa acattggatg gcttgttcta tggtatgtgt tttcattaat 1801 cattttataa acattttttt gtaggttgta agaactgctt tagtagatgc ctccttgcta 1861 acttcagcag aagctgtagt gacagaaatt ccaaaagaag agaaggacct tggaatgcgc 1921 acacatgcct ttaatcctag cacttgcgag gcagaggcag gcagatatct gtgagttcga 1981 ggccagcctg gactccggag tgagtgccag gatagtctcc aaagctatcc aaagagaaac 2041 cctgtcttga aaaacaaaca aacaaaaaag acatttgtac attcctgaaa aaaacccaaa 2101 taatccaatt aaactagggg tcagcctaat aaagataccc gaataggttt ttacatctta 2161 agagaaaaat ctactaataa atgacttgga gcaggtcctt cacagacaat atttagtgag 2221 agtaggtagc ctcatgtggg ttagtagttc cagttgcttc aaaatgccct caccttttgt 2281 cataggctca tgtggaggat ctactccatt ggatggtgat attaaaagta gcacatgggt 2341 acatgcctta agatattaag atccccaagt aaaaactgca tctcagatgc tctctgtttc 2401 ttgggccatg atatgggcag tgagatttct gtcacacttg ccaggaaaca cttatgatct 2461 gtctgtcaca atctcttata ttgtcaagta cttaaccaaa cacaatcacc catgacctct 2521 catatcctcc tggacttaga caggaggcac aagtctactc aggtataaat atttttactt 2581 ctgatgcccc acagtaacct acaatggctt cagtccagta cacaggtaag taaacccacc 2641 aatataccat tccagctgat tgataacctc ttagatacat cccatatgct gcagttccca 2701 aatggagaca acccaaagtt tcttga // LOCUS HAMHSP70A 2243 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Chinese hamster heat shock protein (cognate form of hsp70) mRNA, complete cds. ACCESSION M34561 KEYWORDS heat shock protein; stress protein. SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone RS 23. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2243) AUTHORS Ahmad,S., Ahuja,R., Venner,T.J. and Gupta,R.S. TITLE Identification of a protein altered in mutants resistant to microtubule inhibitors with the constitutive form of the major heat shock protein (hsp70) JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN% GUPTAR@,SSCVAX.McMASTER.CA FEATURES from to/span description pept 50 1990 heat shock protein (hsp70) mRNA < 1 2243 hsp70 mRNA signal 2220 2225 poly-A signal BASE COUNT 654 a 486 c 551 g 552 t ORIGIN 1 gttttgtgtg gtctcgtcgc cagcacagcc gggcctacag caagcaacca tgtctaaggg 61 acctgcagtt ggtattgatc ttggcaccac ctactcctgt gtgggtgtct tccagcatgg 121 aaaggtggag ataattgcca atgaccaggg taaccgaacc acgccaagct atgttgcttt 181 tactgacaca gaacgattaa ttggggatgc cgcgaagaat caggttgcaa tgaaccccac 241 caacacagtt tttgatgcca aacgtctgat cggacgtagg tttgatgatg ctgttgttca 301 gtctgatatg aagcactggc ccttcatggt ggtgaacgat gcaggcaggc ccaaggtcca 361 agtagaatac aaaggagagg caaaaagttt ctacccagag gaggtgtcct ctatggttct 421 gaccaagatg aaggaaattg cagaagccta tctcgggaag actgttacca atgctgtggt 481 cacagtgcca gcctacttca atgattctca gcgacaggct acaaaagatg ctggaactat 541 tgctggcctg aatgtcctgc gaattatcaa tgaaccaact gctgctgcta ttgcttatgg 601 gctagataag aaggttggag ctgaaaggaa tgtgctcatt tttgacttgg gaggtggtac 661 ttttgatgtg tctatcctca ctattgagga tggaattttt gaagtcaaat caacagccgg 721 agacacccac ttgggtggag aagactttga taacagaatg gtcaaccatt tcattgctga 781 gtttaagcga aacgacaaga aggacatcag cgagaacaag agagctgtca ggcgtctgcg 841 tactgcctgt gaacgggcaa agcgcaccct ctcctccagc acccaggcca gtattgagat 901 tgattctctc tatgagggaa ttgatttcta tacctccatt acccgtgctc gattcgaaga 961 attgaatgct gacctgttcc gaggcactct ggaccctgta gagaaggccc ttcgagatgc 1021 taaactagac aagtcacaga tccatgacat tgtcttggtg ggtggttcca ccagaatccc 1081 caagattcag aagcttctgc aagacttctt caatggaaaa gaactgaata agagcattaa 1141 ccccgatgaa gctgttgctt atggtgcagc tgtccaggcg gccattctat ctggagacaa 1201 gtctgagaat gttcaggatt tgttgctctt ggatgtcact cctctttccc ttggcattga 1261 aactgctggt ggagtcatga ctgtcctcat caagcgcaat accaccatcc ctaccaagca 1321 gacgcagact ttcactacct attctgacaa tcagcctggt gtgctcattc aggtctatga 1381 aggcgaaagg gccatgacca aggataacaa cctgcttgga aagtttgaac ttacaggcat 1441 ccctccagca ccccgtggtg ttcctcagat tgaagttact tttgacattg atgccaatgg 1501 catcctcaat gtctctgctg tagataagag cacaggcaag gagaacaaga tcactatcac 1561 caacgacaag ggccgcttga gcaaggagga tattgaacgc atggtccagg aagctgagaa 1621 gtacaaagct gaagatgaga agcagagaga taaggtttcc tccaagaatt cactggagtc 1681 ctatgctttc aacatgaaag caacagttga agacgagaaa cttcaaggca agatcaatga 1741 tgaggacaaa cagaagattc tagacaagtg taatgaaatc atcagttggc tggataagaa 1801 tcagactgca gagaaggaag aatttgaaca ccagcagaaa gaactggaga aagtctgcaa 1861 ccccatcatt accaagctgt accagagtgc tggtggcatg cctggaggaa tgcccggggg 1921 cttccctggt ggtggagctc ctccatctgg tggtgcttct tcagggccta ccattgaaga 1981 ggtggattaa gtcagtccaa gtagaggtgt agcattgttc cacagggaaa catttgaagg 2041 acccaaattt gcaaagtttc atggcagttt ttaaatttaa actgttaaca ataaattact 2101 gggcattctc aatacttgaa catggaacat gtacacagga aataaaacat tgctctttat 2161 aagcactgta tcctaagtgg aaaactcaat gtctaatatg gtggaaaatg gaatgtctaa 2221 ataaaactat ttaaattggc acc // LOCUS HAMTCP1A 1845 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Chinese hamster T-complex protein 1 (TCP-1) mRNA, complete cds. ACCESSION M34665 KEYWORDS T-complex protein 1. SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone pRG 501. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1845) AUTHORS Ahmad,S. and Gupta,R.S. TITLE Cloning of a Chinese hamster ovary cell protein homologous to the mouse T-complex protein TCP-1: Structural similarity to the ubiquitous "Chaperonin" family of heat shock proteins JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 01-JUN-1990. Author address: R.S.Gupta McMaster University 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR@,SSCVAX.McMASTER.CA FEATURES from to/span description pept 89 1759 T-complex protein 1 signal 1818 1823 poly-A signal BASE COUNT 530 a 368 c 472 g 475 t ORIGIN 1 gttccccgct gtggtggccg cagtccggcc tctgagcgga gatcccggcc gtccgcggtg 61 ttagcgaggc cagccaggtt ccctaacgat ggagggccct ttgtccgtgt tcggggaccg 121 cagcactgga gaggcgatcc gctcccagaa tgttatggct gcagcttcca ttgccaacat 181 tgtgaaaagt tctctggggc cagttggctt ggataaaatg ttggtggatg acattggtga 241 tgtaaccatt accaacgatg gggccaccat cctgaagtta ctggaggtag aacaccctgc 301 agctaaagtt ctttgtgagc tggctgactt gcaagacaaa gaagtgggag atggaactac 361 ctctgtggta attattgcag cggagcttct gaaaaatgca gatgaactag tcaaacagaa 421 aattcatcca acatcagtta ttagtggcta tcggcttgcc tgcaaggaag cagtgcgcta 481 tatcagtgag aacctaatta ttaacacaga cgaacttgga agagactgtc tgattaatgc 541 cgctaagaca tccatgtctt ccaaaattat tggaataaat ggtgatttct ttgctaatat 601 ggtggtagat gctgtacttg ctgtcaaata cacagatctg agaggccagc ctcgatatcc 661 agtcaattct gttaatattc tgaaagccca tggaagaagt caggtagaaa gcatgctgat 721 caatggctat gcactgaact gtgtggtggg atctcagggc atgcccaaga ggatagttaa 781 tgcaaaaatt gcatgtcttg acttcagcct gcagaaaaca aaaatgaagc ttggtgtgca 841 ggtggttatt acagatcctg agaaattgga ccaaattaga cagagagaat cagacatcac 901 caaggagaga attgaaaaga ttctggcgac tggggccaat gttattctga ccactggtgg 961 gatagatgat atgtgtctga agtactttgt ggaggctggt gccatggctg ttagacgagt 1021 tctaaaaagg gatctcaagc gtatcgctaa agcttctgga gcaagtatcc tgtccacact 1081 ggccaatttg gaaggcgaag aaacttttga agctaccatg ctggggcaag cagaagaagt 1141 ggtccaggag agaatttgtg atgatgagct gatcttaatc aaaaatacca aagctcgtac 1201 atctgcatca atcatcttac gaggagcaaa tgatttcatg tgtgatgaaa tggagcgatc 1261 tttacatgat gctctttgtg tggtgaagag agttttggag tcaaaatctg tagtcccagg 1321 aggaggtgct gtagaagccg ccctgtccat ataccttgaa aactatgcaa ccagtatggg 1381 gtctcgggaa cagcttgcta ttgcagagtt tgcaaggtct cttcttgtta ttcctaatac 1441 actggcagtg aatgctgccc aagactccac tgacctggtt gccaagttac gagcttttca 1501 caatgaggct caagttaacc cagaacgtaa aaacctgaag tggattggtc ttgatttgat 1561 caatggaaaa ccacgagaca acaagcaagc aggggtgttt gaaccaacca tagttaaagt 1621 gaagagcctg aagtttgcaa cagaagctgc aatcactatt cttcgaatcg atgatctgat 1681 aaaattacac ccggaaacta aagacgataa acatggaagt tatgaaaatg ctgttcactc 1741 tggagccctt gatgactgat tggatttccc ttttatttat agcagtgtca ggtgcaatgt 1801 agccttgggt atctcacatt aaagtacagt gagctgtcca cctga // LOCUS HUMHP601A 1941 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 1. ACCESSION M34660 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS2-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 111 to 1941) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1941) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 624 a 329 c 446 g 541 t 1 others ORIGIN 1 atatgaaata aagcctttat gtgctctacc taaattaata aatttattta ttttttttng 61 tatatttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct 121 gcctcgccat gcgacatgtc ctgccagcac cccacagaaa tgcttccatt ccattaccca 181 cagtctttca ccagatgaga ccagtgtcca ggctactggc tcctcacctc acttgaagtg 241 atggtaaaga tgtaaaattt ggtgctgatg tccgagcctt aatgcttaaa agtgtagacc 301 tttttgtcga tgtagccatt acaatggggc caaagggaag aacagtaatt attgaacata 361 gctgggcaag tcccaaagta acaaaagatg gtgtgactga tgcaaagtca attgacttaa 421 aagataaata taaaagcatt ggagctaaac ttgtccaaga tgttgccaat aacacagatg 481 aagagactgg ggatggcact atcacttgct gttgtactgg tatgctctat ttccaaataa 541 ggtttccaga aggttagcaa aggtgctaat ccagtggaaa tcaggagagg tgtatgttag 601 ctgatgctgt aattgctgaa cttaaaaagc agtctaaacc tgtgaccaaa cctgaagaaa 661 ttgcacaggt tgctacaatt tctgcaaatt gagacaaaga aattggcaac atcatctctg 721 atgcaatgaa aaaagttgga agaaagggca tcatcacagt aaaggatgga aaaacactga 781 ctgtagaatt agaaattatt gaaggcatga aatttgatcg aggatatatt tctccatact 841 ttattaatac atcaaaaggt gagaaatgtg aattccagga tgcctatgtt ctgttgcatg 901 aaaagcaaaa tttctacgtg ctccacgtcc attgtaactg ctcttgcaaa ttgccaatgc 961 ttactgtaag cctttggtca taattgctgg agatattgat ggagaagctc taactacact 1021 catcctgaat aggctaaagg ttggtcttca ggttgtggca gtcaaggctc cagggtttgg 1081 tgacaataga aagaaccagc ttaaagatac ggttattgct actggtggta cagtgtttgg 1141 agaagagggc tcacactaaa tcttgaagac gttcagcctc gtgacgtagg agagttggag 1201 aggtactgtg atcaagatat gccatgctct taaaagcaaa aggtaacaag tctcaaatta 1261 aaaatgtgtt caagaaatca ttgaccagtc agatgtcaca actatgtaat tgaaaaggaa 1321 aaagtgagtg gagaaacttt cagatggagt agctgtgctg aaggttggtg gcaagtgatg 1381 ttgaagttaa tgaagagaaa gacagagtta taggtgcact taatgctaca agagctgctg 1441 ttgaagaagg cattgtttag ggagggggtt gtgccctgct tcgatgcatt ccagccttgg 1501 actcattcac tccagctaat gaagataaaa taattggtat agaaattatt aaaagaacac 1561 tcaaaattcc agcaatgaca attcgtaaga atgcaggtgt tgatgcattt ttgatagttg 1621 agaaaattat gtaaagttcc tcagaagttg gttatgatac tatgttagga gatgtcgtga 1681 atatggtgga aaaagacatt attgacccaa caaaggttgt gagaactgct tcattggatg 1741 ctgctggcat gccctctata ttaactacag cagctgttgt agtcacagaa attcctaaag 1801 aagggaacag ccctggaatg ggtgcatgtg gaatgggagg tggcctgttc taattcctag 1861 aatagtgctt tacctttatt aatgaattgt gataggaacc caaggcagtg ttcctcacca 1921 ataacttcag agaagtcagt t // LOCUS HUMHP602A 1920 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 2. ACCESSION M34662 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS2-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 117 to 1920) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1920) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 619 a 324 c 431 g 544 t 2 others ORIGIN 1 tggattatgn aaatnaaagc ctttatgtgc tctgcctaaa ttaatatttt attttttttt 61 gatacttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct 121 gcctcacctc accatgcaca tgtcctgcca ccaccccaca gaaatgcttc acttacccac 181 agtctttcgc cagatgagac cagtgtccag ggtactggct cctcaccctc acttgaagga 241 tggtaaagat gtaaaatttg gtgctgatgt ctgcgcctta atgcttcaaa gtgtagactt 301 ttgtcagtaa ttattgaaca tagctgggaa gtcccaaagt aacaaaagat ggtctgactg 361 atgcaaagtc aattgactta aaggataaat ataaaagcat tggagctaaa cttctccaag 421 atgttgccta aaacacaaca aatggagaga ctggggatgg cactatcact gctgctgtac 481 tggtatgctc tatttccaaa taagccttcc agaaggttag caaaggtgct aatccagtgg 541 aaatcatcag agagtgtgat gttagctgtt gatgctgtaa ttgctgaact taaaaagcag 601 tctaaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga 661 cacaaagaaa attggtaaca tcatctctga tgcaatgaaa aagtttggaa gaaagggcat 721 catcacagta aaggatggaa aaacactgac tgatgaatta gaaattattg aaggcatgaa 781 aaatttgatc gaggatatat ttctccatac tttcttaata catcaaaagg tgagaaatgt 841 gaattccagg atgcctatgt tctactgagt gaaaagaaca tttctggtgt ccagtccatt 901 gtacctgctc ttgaaattgc cagtgcttac cttaagcctt tggtcacaat tgctgaagat 961 attgatgaag aaacgttaag tacgctcatc ttgaataggc taaaagttgg tcttcaggtt 1021 gtagcagcca aagtttcagg ttttggtgac aacggagcca gcttaaagat atggctattg 1081 ctactggtag tgcagtgttt ggagaagagg ggttaaccct aatcttgaag atgctcagcc 1141 tcatgactca ggaaaatttg gagaggtcat tgtgccaaag tagtgctatg tatttaaaag 1201 gaaaagatga caaagcttaa attgaaaaat gtattcaaga aatcattaag tagttagata 1261 tcataactag tggatatgaa aagaaaaaac tgaatgaata tatggcaaaa ctttcagatg 1321 gagtagccgt gttgatggtt ggttggtgag acaagtgatg ttcaggtgaa agacaagaaa 1381 gacagattta ccgatgtcct taatgctaca agagctgcta ttgaagaagg cattgttttg 1441 ggagggggtt gtgccctgct tcgagcattc cactcattca ctccagctaa tgaagataaa 1501 aataatggca tagaaattat taaaagaaca ctcaaaattc cagcaatgac tattgctaag 1561 aatgcaggtg ttgatttttg atagttgaga aaattatgca aagttcctca gaagttggtt 1621 atgatactat gttaggagat gtcgtgaata tggtggaaaa agacattatt gacccaacaa 1681 aggttgtgag aactgcttta ttggatgctg cttgcatggc tctctattaa ctacagcagc 1741 tgttgtagtc acaaaaattc ctaaagaagg gaacagccct ggaatgggtg caatgtgtgg 1801 aatgggaggt ggcctgttct aattcctaga atagtgcttt acctttatta atgaattgtg 1861 atagggagcc caaggcgtgt tcctcaccaa taacttcaga gaagtcagtt ggagaaaatg // LOCUS HUMHP603A 1912 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 3. ACCESSION M34661 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS5-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 99 to 1912) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1912) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 614 a 319 c 449 g 527 t 3 others ORIGIN 1 ctttatgtgc tcttanctaa attaataatt tatttanntt tttttgtata tttgctatgt 61 cacctttaga aagtaagagg cagctctcac ttgctgctga tgacctgcct cgccatgcac 121 atgtctgcca gcaccccaca gaaatgcttc cattacccac agtctttcac cagatgagac 181 cagtgtccag ggtactggct cctcacctca cttgaagtga tggtaaagat gtaaaatttc 241 ctgctgatgt ccgagcctta atgcttcaaa gtgtagactt ttttgtcgat gctgtagcca 301 ttacaatggg gccaaaggga agaacagtaa ttattgaaca tagctgggga agtcccaaag 361 taacaaaaga tggtgtaact gatgcaaagt caattgactt aaaggataaa tataaaagca 421 ttggagctaa acttgtccaa gatgttgcca ataacacaga tgaagagact gggggatggc 481 actatcactg ctgctgtact ggtatgctct atattcaaat aaggcttcca gaaggttagc 541 aaaggtgcta atccagtgga aatcaagaga ggtgtgatgt tagctgttga tgctgtaatt 601 gctgaactta aaaagcagtg taaacctgtg accaaacctg aaattgcaca ggttgctaca 661 atttctgcaa atggagacaa agaaattggt aacatctctg atgcaatgaa aaagtttgga 721 agaaagggca tcacagtaaa ggatggaaaa acactgactg atgaattaga aattattgaa 781 ggcatgaaaa tttgatcgag gatatatttc tccatacttt attaatacat caaaaggtga 841 gaaatgtgaa ttccaggatg cctatgttct gttgcatgaa aagaaaattt ctagtgtcca 901 gtccattgta actgctcttg aaattgccaa tgcttactgt aagcctttgg tcataattgc 961 tggagacatt gatggagaag ctctaactac actcatcctg aataggctaa aggttggtct 1021 tcaggttgtg gcagtcaaag ctccagggtt tgggtacaat agaaagaacc agcttaaaga 1081 tacggtcatt gctactggtg gtacagtgtt tggagaagag ggctgacact aaatcttgaa 1141 gacgttcagc ctcgtgacgt aggagaagtt ggagaggtca ctgtgaacaa agattatgcc 1201 atgctcttaa aaggaaaagg taacaagtct caaattgaaa aatgtgttca agaaatcatt 1261 gaccagtcag atgtcacaac tagtgaatac gaaaaggaaa aagtgagtgg agaaactttc 1321 agatggagta gctgtgctga aggttgttgg tgggacaagt gatgttgaag tgaatgaaga 1381 gaaagacaga gttacaggtg cacttaatgc tacaagagct gctgttgaag aaggcattgt 1441 ttagggaggg ggttgtgccc tgcttcgatg cattccagcc ttggactcat tcactccagc 1501 taatgaagat aaataattgg tatagaaatt attaaaagaa cactcaaaat tccagcaatg 1561 acaattgcta agaatgcagc tgttgatggc attttgatag ttgagaaaat tatgtaaagt 1621 gcctcagaag ttggttatga tactatgtta ggagatgtcg tgaatatggt gaaaaagaca 1681 ttattgaccc aacaaaggtt gtgagaactg cttcattgga tgctgctggc atggcctctc 1741 tattaactac agcagctgtt gtagtcacag aaattcctaa agaagagaac agccctggaa 1801 taggtgcaat gtgtggaatg ggaggtggcc tgttcaaatt cctagaatag tgctttacct 1861 ttattaatga attgtgatag gaacccaagg cagtgttcct caccaataac tt // LOCUS HUMHP604A 2019 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 4. ACCESSION M34663 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS5-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 123 to 2019) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2019) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 657 a 327 c 479 g 556 t ORIGIN 1 agagaaatgt aatatgaaat aaagctttat gtgctctacc taaattaata atttatttat 61 ttttttttgt atatttgcta tgtcaccttt agaaagtaag aggcagctct cacttgctga 121 ttgatgacct gatgatgacc tgcctcgcca tgcacatgtc ctgccagcac cccacagaaa 181 tgcttccatt acccacagtc tttcaccaga tgagaccagt gtccaggcta ctggctactg 241 gctcctcacc tcacttgaag tgatggtaaa gatgtaaaat ttggtgctga tgtccgagcc 301 ttaatgcttc aaagtgtata gcttttgtcg agctgttagc cattacaaat gggccaaagg 361 gaagaacagt agattattga acatagctgg ggaagtccca aagtaacaaa agagtggtgt 421 gactgatgca aaggcaattg acttaaagga taaatataaa agcattggag ttaaacttgt 481 ccaagatgtt gccaataaca cagatgaaga agactgggga tggcactatc actgctgctg 541 tactggtatg ctctatttcc aaataaggct tccagaaggt tagcaaaggt gctaatccgt 601 ggaaatcagg agaggtgtga tgttagctgt tgatgctgta attgctgaac ttaaaaagca 661 gtctaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga 721 gacaaagaaa ttggtaacat catctctgat gcaatgaaaa actttggaag aaagggcatc 781 atcacagtaa aggatggaaa aacactgact gatgaattag aaattattga aggcatgaaa 841 atttgatcga ggatatattt ctccatactt tattaataca tcaaaaggtg agaaatgtga 901 attccaggat gcctatgttc tgttgcatga aaagaaaatt tctagtgtcc agtccattgt 961 aactgctctt gaaattgcca atgcttagtg taagcctttg gtcatatatt gctggagaca 1021 ttgatggaga agctctaact acactcatcc tgaataagct aaaggttggt cttcaggttg 1081 tggcagtcaa agctaaagct ccagggtttg gtgacaatag aaagaaccag cttaaagata 1141 cggttattgc tactggtggt acagtgtttg gagaagaggg ctgacactaa atcttgaaga 1201 cgttcagcct cgtgacgtag gagaagttgg agaggtcact gtgatcaaga ttatgccatg 1261 ctcttaaaag gaaaaggtaa caagtctcaa attgaaaaat gtgtgcaaga aatcgttgac 1321 cagtcagatg tcacaactat gtaatacgaa aaggaaaaag tgagtgaaga aactttcaga 1381 tggagtagct gtgctgaagg ttggtgggac aagtgatgtt gaagtgaatg aagagaaaga 1441 cagagttata ggtgcactta atgctacaag agctgctgtt gaagaaggca ttgtttaggg 1501 agggggttgt gccctgcttc gatgcattcc agccttggac tcattcactc cagctaatga 1561 agataaaata attggtatag aaattattaa aagaacactc aaaattccag caatgacaat 1621 tgctaagaat gcaggtgttg atgcattttt gatagttgag aaaattatgt aaagttcctc 1681 agaagttggt tatgatacta tgttaggaga tgtcgtgaat atggtggaaa aagacattat 1741 tgacccaaca aaggttgtga gaactgcttc attggatgct gctgcatggc ctctctatta 1801 ttaactacag cagctgttgt agtcacagaa attcctaaag aagggaacag ccctggaatg 1861 ggtggaatgg gtgcaatgat atgtggaatg ggaggtggcc tgttctaatt cctagaatag 1921 tgctttacct ttattaatga attgtgatag gaaacccaag gcagtgttcc tcaccaataa 1981 cttcagagaa gtcagttgga gaaaatgaag aaaaaggct // LOCUS HUMHSP60A 2202 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human chaperonin (HSP60) mRNA, complete cds. ACCESSION M34664 KEYWORDS chaperonin. SOURCE Human placenta cDNA to mRNA, and DNA, clone PGEM-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2202) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description pept 25 1746 chaperonin (HSP60) BASE COUNT 699 a 371 c 538 g 594 t ORIGIN 1 cacgcttgcc gccgccccgc agaaatgctt cggttaccca cagtctttcg ccagatgaga 61 ccggtgtcca gggtactggc tcctcatctc actcgggctt atgccaaaga tgtaaaattt 121 ggtgcagatg cccgagcctt aatgcttcaa ggtgtagacc ttttagccga tgctgtggcc 181 gttacaatgg ggccaaaggg aagaacagtg attattgagc agggttgggg aagtcccaaa 241 gtaacaaaag atggtgtgac tgttgcaaag tcaattgact taaaagataa atacaagaac 301 attggagcta aacttgttca agatgttgcc aataacacaa atgaagaagc tggggatggc 361 actaccactg ctactgtact ggcacgctct atagccaagg aaggcttcga gaagattagc 421 aaaggtgcta atccagtgga aatcaggaga ggtgtgatgt tagctgttga tgctgtaatt 481 gctgaactta aaaagcagtc taaacctgtg accacccctg aagaaattgc acaggttgct 541 acgatttctg caaacggaga caaagaaatt ggcaatatca tctctgatgc aatgaaaaaa 601 gttggaagaa agggtgtcat cacagtaaag gatggaaaaa cactgaatga tgaattagaa 661 attattgaag gcatgaagtt tgatcgaggc tatatttctc catactttat taatacatca 721 aaaggtcaga aatgtgaatt ccaggatgcc tatgttctgt tgagtgaaaa gaaaatttct 781 agtatccagt ccattgtacc tgctcttgaa attgccaatg ctcaccgtaa gcctttggtc 841 ataatcgctg aagatgttga tggagaagct ctaagtacac tcgtcttgaa taggctaaag 901 gttggtcttc aggttgtggc agtcaaggct ccagggtttg gtgacaatag aaagaaccag 961 cttaaagata tggctattgc tactggtggt gcagtgtttg gagaagaggg attgaccctg 1021 aatcttgaag acgttcagcc tcatgactta ggaaaagttg gagaggtcat tgtgaccaaa 1081 gacgatgcca tgctcttaaa aggaaaaggt gacaaggctc aaattgaaaa acgtattcaa 1141 gaaatcattg agcagttaga tgtcacaact agtgaatatg aaaaggaaaa actgaatgaa 1201 cggcttgcaa aactttcaga tggagtggct gtgctgaagg ttggtgggac aagtgatgtt 1261 gaagtgaatg aaaagaaaga cagagttaca gatgccctta atgctacaag agctgctgtt 1321 gaagaaggca ttgttttggg agggggttgt gccctccttc gatgcattcc agccttggac 1381 tcattgactc cagctaatga agatcaaaaa attggtatag aaattattaa aagaacactc 1441 aaaattccag caatgaccat tgctaagaat gcaggtgttg aaggatcttt gatagttgag 1501 aaaattatgc aaagttcctc agaagttggt tatgatgcta tggctggaga ttttgtgaat 1561 atggtggaaa aaggaatcat tgacccaaca aaggttgtga gaactgcttt attggatgct 1621 gctggtgtgg cctctctgtt aactacagca gaagttgtag tcacagaaat tcctaaagaa 1681 gagaaggacc ctggaatggg tgcaatgggt ggaatgggag gtggtatggg aggtggcatg 1741 ttctaactcc tagactagtg ctttaccttt attaatgaac tgtgacagga agcccaaggc 1801 agtgttcctc accaataact tcagagaagt cagttggaga aaatgaagaa aaaggctggc 1861 tgaaaatcac tataaccatc agttactggt ttcagttgac aaaatatata atggtttact 1921 gctgtcattg tccatgccta cagataattt attttgtatt tttgaataaa aaacatttgt 1981 acattcctga tactgggtac aagagccatg taccagtgta ctgctttcaa cttaaatcac 2041 tgaggcattt ttactactat tctgttaaaa tcaggatttt agtgcttgcc accaccagat 2101 gagaagttaa gcagcctttc tgtggagagt gagaataatt gtgtacaaag tagagaagta 2161 tccaattatg tgacaacctt tgtgtaataa aaatttgttt aa // LOCUS MUSGCANF 4049 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse guanylate cyclase/atrial natriuretic factor mRNA, complete cds. ACCESSION J05504 KEYWORDS atrial natriuretic factor; guanylate cyclase. SOURCE Mouse (strain C57B1/6J) Leydig cell line MA-10, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4049) AUTHORS Pandey,K.N. and Singh,S. TITLE Molecular cloning and expression of murine guanylate cyclase/atrial natriuretic factor receptor cDNA JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.N.Pandey, 31-MAY-1990. FEATURES from to/span description pept 348 3521 guanylate cyclase/atrial natriuretic factor signal 4029 4034 poly-A signal BASE COUNT 879 a 1164 c 1140 g 866 t ORIGIN 1 cagaaaccct cccaaactcc tatagccaca cacacctttc ccggccaaga tccaaacaaa 61 cctctacttt cctcttccct aggagccaga ctcccttcgg gtgctgcgct cgctctcacc 121 tgctctaaag cacctccgct ctcggacgct cccaattccg ccctcctgct cgacggcggg 181 acagtcgcag cctcggcagg cagcttgctc tcgccgctgc ggcttcaacc cagccccctc 241 cctcgctacg gctgggcgct cttgactccc gaccctcgcc tctgagcccg aggacggcga 301 tcagaccatg gtgacagcgc tgctccgtcg ctgcgctcgc tgaggccatg ccgcgttccc 361 gacgcgtccg tccgcgccta agggcgctgc tgctgctacc gccgctgctg ctgctccgaa 421 gcggccacgc gagcgacctg accgtggctg tggtgctgcc cgtgaccaac acctcgtacc 481 cgtggtcctg ggcgcgtgta gggccggcgg tggaactggc tctcgggagg gtgaaggctc 541 ggccggactt gctgccgggt tggacggtcc gtatggtgct gggcagcagc gagaacgcgg 601 cgggcgtctg ctccgacacc gctgcaccgc tggccgcggt ggatctcaag tgggagcaca 661 gccccgccgt gttcctgggc cccggctgcg tatactctgc tgccccggtg gaccgcttca 721 ccgcgcactg gcggttgccg ctgctgacgg ctggcgcccc ggctctgggc atcggggtga 781 aggatgagta cgcgttaacc acccgcacag gacccagcca tgtcaagctg ggcgacttcg 841 tgacggcgct gcatcgacgg ctgggctggg agcaccaggc gcttgtgctc tatgcagatc 901 ggctgggcga cgaccggccg tgcttcttca tagtggaggg gctgtacatg cgggtgcgtg 961 agcgactcaa catcacagta aatcaccagg agttcgtcga gggcgacccg gaccactaca 1021 ccaagctact gcggaccgtg cagcgcaagg gcagagttat ctacatctgc agttctccgg 1081 atgccttcag gaatctgatg cttttggccc tggatgctgg cctgactggg gaggactatg 1141 ttttcttcca cctggatgtg tttgggcaaa gccttcaggg tgctcagggc cctgttccag 1201 agaagccctg ggaaagagac gatgggcagg ataggagagc ccgccagcgc tttcaggctg 1261 caaaaattat tacttacaaa gaacccgata atcctgagta cttggaattc ctgaagcagc 1321 taaaactctt ggctgacaag aaattcaact tcaccatgga ggatggcctg aaaaatatca 1381 tcccagcatc cttccatgac gggctcctgc tctatgtcca ggcagtgaca gagactctgg 1441 cacagggggg cactgtcact gatggagaga acatcactca gcggatgtgg aaccgaagct 1501 tccaaggtgt gacaggatac ctgaaaattg atagaaatgg agatcgggac actgattctc 1561 ctctctggga tatggacccc gagacaggtg ccttcagggt tgtcctgaac tttaatggta 1621 cttcccagga gctgatggct gtgtcagaac acagattata ctggcctctg ggatacccac 1681 ctcctgacat ccctaaatgt ggctttgaca atgaggaccc agcctgcaac caagaccact 1741 tttccacact ggaggttctg gctttggtgg gcagcctctc tctggttagc tttctgatcg 1801 tgtctttctt catatacagg aagatgcagc tggaaaagga gctggtctca gagttgtggc 1861 gggtgcgctg ggaggacttg cagcccagca gcctggagag gcaccttcgg agcgctggca 1921 gtcggctgac cctgagtggg cgaggctcca attatggctc cctgctaacc acggagggcc 1981 agttccaagt ctttgccaag acagcatact ataagggcaa cctcgtggct gtgaaacgtg 2041 tgaaccggaa acgcattgag ttgacacgaa aagtcctgtt tgaacttaaa catatgcggg 2101 atgtgcagaa tgagcaattg accagatttg tgggagcttg taccgaccct cccaacatct 2161 gtatcctcac agagtactgt ccccgtggaa gcctacagga cattctagag aatgagagta 2221 ttaccctgga ctggatgttt cggtactcac tcaccaatga cattgtcaag ggaatgctct 2281 ttctacacaa cggggccatt tgttcccatg ggaacctcaa gtcatccaac tgcgtggtag 2341 atggacgttt tgtgttaaag atcacagact atgggctcga gagcttcaga gacccggagc 2401 cagagcaagg acacaccctc tttgccaaaa aactgtggac tgcacctgag ctcctgcgaa 2461 tggcttcccc acctgcccgt ggctcccaag ctggggatgt ctacagtttt ggtatcatcc 2521 ttcaggaaat tgccctaaga agtggggtct tctatgtgga aggtttggac ctcagcccaa 2581 aagagatcat tgagcgtgtg actcggggtg agcagccccc attccgacct tccatggatc 2641 tgcagagcca cctggaggaa ctggggcagc tgatgcagag gtgctgggca gaggatcctc 2701 aggagcggcc accctttcaa cagatccgcc tggcgctgcg caagttcaac aaggagaaca 2761 gcagcaacat cctggacaac ctgctgtcac gcatggaaca gtacgccaac aacctggagg 2821 aactggtaga ggagagaaca cagccttatc tggaggagaa gcgcaaagct gaggccctgc 2881 tttaccagat tctgcctcac tctgtggctg agcagctgaa gagaggcgag acagtccagg 2941 ctgaggcatt tgatagtgtt actatctatt tcagtgatat cgtgggcttt acagctcttt 3001 cagcagagag cacacccatg caggtggtca ccctgctcaa tgatctgtac acctgttttg 3061 atgctgtcat agacaacttt gatgtgtaca aggtagagac cattggtgat gcttacatgg 3121 tggtatcagg gctcccagtg aggaatggac agctccatgc ccgagaggta gcccgaatgg 3181 cacttgcact gctcgatgct gtacgctcct tccgcatcgg ccataggccc caggaacagc 3241 tgcgcttgcg cattggaatt cacacaggtc ctgtgtgtgc tggtgtggta gggctaaaga 3301 tgccccgata ctgcctcttt ggagacacag tcaacacagc ttcaagaatg gagtctaatg 3361 gggaagccct caggatccac ttgtcttcgg agaccaaggc tgtgctggaa gagttcgatg 3421 gtttcgagct ggagctccga ggggatgtgg aaatgaaggg caaaggcaag gttcgttcct 3481 attggctcct cggggaccgg ggatgcagct ctcgagcctg acctactgcc ctgctattcc 3541 ttgtcacctc ccctccctat cccagcaatg acacgggtct ccaacttccc cctctcccac 3601 agcagctcag ccactgtgga aagattaggg acctaaccag cgcagtcatc agatgtgacc 3661 tctgagagag gatggagatg gtggggactg gagggggact cctaagttta tagggctgac 3721 tgaaataccc agtcactccc gtagcacatg ccccgccccc cccccgcccc cccactcagc 3781 tgcctagcag acagtgattc cttctgccgc cctcaactta gctccactgt gagttagagg 3841 gagggaaatt gccacctgaa ggaaagagaa aagagattct cggggtttgc aggaggcagg 3901 cagtcctgtg tcacaaatac tcccctcact cccagtccac cacctgcccc accgacttcc 3961 cttcccacac agtgcactga ggagaagaga ggcatggggt tgccttgctt ctcctatgag 4021 caaaacccat taaagtcttt attcctgtg // LOCUS MUSMHH2DR 1415 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse MHC class I H-2Dr protein mRNA, complete cds. ACCESSION M34961 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse (strain B10.RIII) spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1415) AUTHORS Zeling,C. and Pease,L.R. TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Zeling, 29-MAY-1990. Author address: C.Zeling Mayo Clinic 323 Guggenheim 200 1st St. SW Rochester, MN 55905 email: Pease@mayo.edu FEATURES from to/span description pept 1 1089 MHC H-2Dr protein precursor sigp 1 72 MHC H-2Dr protein signal peptide matp 73 1086 MHC H-2Dr protein BASE COUNT 298 a 398 c 442 g 277 t ORIGIN 1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact 61 cagacccgcg cgggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc 121 ctcggggagc cccggtacat agaagtcggc tacgtggacg acacggagtt cgtgcgcttc 181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag 241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagat tttccgagtg 301 aacctgagga ccctgctcag ctactacaac cagagcaagg gcggctctca cacgatccag 361 tggatgttcg gctgtgacgt ggagtcggac gggcgcctcc tccgcgggta ccagcagtac 421 gcatacgacg gctgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg 481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg ctggtgctgc agagagaaag 541 agggcctacc tagagggcac gtgcgtggag tggctcctca gacacctgaa gaacgggaac 601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaaa 661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc 721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca 781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctctcgggaa ggagcagaat 841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct 901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg 961 gccattattg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa 1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt 1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc 1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc 1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc 1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga 1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag 1381 aataagaatt tgaatgtggg tggctggaga gatgg // LOCUS MUSMHH2DS 1415 bp ss-mRNA ROD 21-JUL-1990 DEFINITION M.musculus MHC class I H-2Ds protein mRNA, complete cds. ACCESSION M34962 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE M.musculus (strain B10.RIII) spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1415) AUTHORS Zeling,C. and Pease,L.R. TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Zeling, 29-MAY-1990. Author address: C.Zeling Mayo Clinic 323 Guggenheim 200 1st St. SW Rochester, MN 55905 email: Pease@mayo.edu FEATURES from to/span description pept 1 1089 MHC H-2Dr protein precursor sigp 1 72 MHC H-2Dr protein signal peptide matp 73 1086 MHC H-2Dr protein BASE COUNT 292 a 401 c 444 g 278 t ORIGIN 1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact 61 cagacccgcg ctggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc 121 ctcggggagc cccggtacat agaagtcggc tacgtggaca acacggagtt cgtgcgcttc 181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag 241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagag tttccgagtg 301 aacctgagga ccgcgctccg ctactacaac cagagcaagg gcggctctca cacggtccag 361 gtgatgtctg gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta ccagcagcac 421 gcatacgacg gccgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg 481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg gtggtgatgc agagcattac 541 aaggcctacc tagagggcac gtgcgtggag tggctcctca gatacctgga gctcgggaac 601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaga 661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc 721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca 781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctcttgggaa ggagcagaat 841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct 901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg 961 gccattactg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa 1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt 1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc 1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc 1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc 1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga 1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag 1381 aataagaatt tgaatgtggg tggctggaga gatgg // LOCUS HUMCETP 1787 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human cholesteryl ester transfer protein mRNA, complete cds. ACCESSION M30185 KEYWORDS cholesteryl ester transfer protein; transfer protein. SOURCE Human adult liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1787) AUTHORS Drayna,D., Jarnagin,A.S., McLean,J., Henzel,W., Kohr,W., Fielding,C. and Lawn,R. TITLE Cloning and sequencing of human cholesteryl ester transfer protein cDNA JOURNAL Nature 327, 632-634 (1987) STANDARD simple staff_review FEATURES from to/span description pept 131 1612 cholesteryl ester transfer protein precursor /nomgen="CETP" /map="16q21" /hgml_locus_uid="LF0130L" sigp 131 181 cholesteryl ester transfer protein signal peptide matp 182 1609 cholesteryl ester transfer protein mRNA < 1 1787 CETP mRNA BASE COUNT 397 a 531 c 456 g 403 t ORIGIN 1 gtgaatctct ggggccagga agaccctgct gcccggaaga gcctcatgtt ccgtgggggc 61 tgggcggaca tacatatacg ggctccaggc tgaacggctc gggccactta cacaccactg 121 cctgataacc atgctggctg ccacagtcct gaccctggcc ctgctgggca atgcccatgc 181 ctgctccaaa ggcacctcgc acgaggcagg catcgtgtgc cgcatcacca agcctgccct 241 cctggtgttg aaccacgaga ctgccaaggt gatccagacc gccttccagc gagccagcta 301 cccagatatc acgggcgaga aggccatgat gctccttggc caagtcaagt atgggttgca 361 caacatccag atcagccact tgtccatcgc cagcagccag gtggagctgg tggaagccaa 421 gtccattgat gtctccattc agaacgtgtc tgtggtcttc aaggggaccc tgaagtatgg 481 ctacaccact gcctggtggc tgggtattga tcagtccatt gacttcgaga tcgactctgc 541 cattgacctc cagatcaaca cacagctgac ctgtgactct ggtagagtgc ggaccgatgc 601 ccctgactgc tacctgtctt tccataagct gctcctgcat ctccaagggg agcgagagcc 661 tgggtggatc aagcagctgt tcacaaattt catctccttc accctgaagc tggtcctgaa 721 gggacagatc tgcaaagaga tcaacgtcat ctctaacatc atggccgatt ttgtccagac 781 aagggctgcc agcatccttt cagatggaga cattggggtg gacatttccc tgacaggtga 841 tcccgtcatc acagcctcct acctggagtc ccatcacaag ggtcatttca tctacaagaa 901 tgtctcagag gacctccccc tccccacctt ctcgcccaca ctgctggggg actcccgcat 961 gctgtacttc tggttctctg agcgagtctt ccactcgctg gccaaggtag ctttccagga 1021 tggccgcctc atgctcagcc tgatgggaga cgagttcaag gcagtgctgg agacctgggg 1081 cttcaacacc aaccaggaaa tcttccaaga ggttgtcggc ggcttcccca gccaggccca 1141 agtcaccgtc cactgcctca agatgcccaa gatctcctgc caaaacaagg gagtcgtggt 1201 caattcttca gtgatggtga aattcctctt tccacgccca gaccagcaac attctgtagc 1261 ttacacattt gaagaggata tcgtgactac cgtccaggcc tcctattcta agaaaaagct 1321 cttcttaagc ctcttggatt tccagattac accaaagact gtttccaact tgactgagag 1381 cagctccgag tccatccaga gcttcctgca gtcaatgatc accgctgtgg gcatccctga 1441 ggtcatgtct cggctcgagg tagtgtttac agccctcatg aacagcaaag gcgtgagcct 1501 cttcgacatc atcaaccctg agattatcac tcgagatggc ttcctgctgc tgcagatgga 1561 ctttggcttc cctgagcacc tgctggtgga tttcctccag agcttgagct agaagtctcc 1621 aaggaggtcg ggatggggct tgtagcagaa ggcaagcacc aggctcacag ctggaaccct 1681 ggtgtctcct ccagcgtggt ggaagttggg ttaggagtac ggagatggag attggctccc 1741 aactcctccc tatcctaaag gcccactggc attaaagtgc tgtatcc // LOCUS HUMGHRA04 289 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human growth hormone receptor gene, exon 5. ACCESSION M28461 M26401 KEYWORDS growth hormone binding-protein; growth hormone receptor; transmembrane protein. SEGMENT 4 of 10 SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 289) AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R., Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I. TITLE Characterization of the human growth hormone receptor gene and the demonstration of a partial gene deletion in two patients with Laron-type dwarfism JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.I.Wood, 02-AUG-1989. FEATURES from to/span description pept + 59 + 231 growth hormone receptor, exon 5 /nomgen="GHR" /map="5p13-p12" /hgml_locus_uid="LA0219G" pre-msg < 1 > 289 growth hormone receptor mRNA and introns IVS < 1 > 58 GHR Intron D IVS 232 > 289 GHR Intron E BASE COUNT 90 a 45 c 55 g 99 t ORIGIN About 6 kb after segment 3. 1 acttaagcta caacatgatt tttggaacaa ttaatctttt tttaaccctt cattttagga 61 acactcaaga atggactcaa gaatggaaag aatgccctga ttatgtttct gctggggaaa 121 acagctgtta ctttaattca tcgtttacct ccatctggat accttattgt atcaagctaa 181 ctagcaatgg tggtacagtg gatgaaaagt gtttctctgt tgatgaaata ggtaaatcac 241 aggtttttgt ttcatttgac atagttttag actaaataaa tggggaagc // LOCUS HUMGHRA06 362 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human growth hormone receptor gene, exon 7. ACCESSION M28463 M26401 KEYWORDS growth hormone binding-protein; growth hormone receptor; transmembrane protein. SEGMENT 6 of 10 SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 362) AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R., Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I. TITLE Characterization of the human growth hormone receptor gene and the demonstration of a partial gene deletion in two patients with Laron-type dwarfism JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.I.Wood, 02-AUG-1989. FEATURES from to/span description pept + 75 + 240 growth hormone receptor, exon 7 /nomgen="GHR" /map="5p13-p12" /hgml_locus_uid="LA0219G" pre-msg < 1 > 362 growth hormone receptor mRNA and introns IVS < 1 74 GHR Intron F IVS 241 > 362 GHR Intron G BASE COUNT 117 a 58 c 76 g 111 t ORIGIN Undetermined number of bp after segment 5. 1 atacctgtag tgttcattgg cattgagttg ttgactcttt ggccaatatg gcgtttatat 61 ttttgtcttg aaagatggac cctatattga caacatcagt tccagtgtac tcattgaaag 121 tggataagga atatgaagtg cgtgtgagat ccaaacaacg aaactctgga aattatggcg 181 agttcagtga ggtgctctat gtaacacttc ctcagatgag ccaatttaca tgtgaagaag 241 gtaaaagaaa taaaagatta aaatagtagc taacctggct tttgtcaata taacagttga 301 ttcacccctg cactggtagt gtgttgtcca aatcaaaata tattaacatc agatatcagg 361 at // LOCUS HUMGHRA07 212 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human growth hormone receptor gene, exon 8. ACCESSION M28464 M26401 KEYWORDS growth hormone binding-protein; growth hormone receptor; transmembrane protein. SEGMENT 7 of 10 SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 212) AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R., Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I. TITLE Characterization of the human growth hormone receptor gene and the demonstration of a partial gene deletion in two patients with Laron-type dwarfism JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.I.Wood, 02-AUG-1989. FEATURES from to/span description pept + 59 + 149 growth hormone receptor, exon 8 /nomgen="GHR" /map="5p13-p12" /hgml_locus_uid="LA0219G" pre-msg < 1 > 212 growth hormone receptor mRNA and introns IVS < 1 58 GHR Intron G IVS 150 > 212 GHR Intron H BASE COUNT 53 a 33 c 43 g 83 t ORIGIN Undetermined number of bp after segment 6. 1 gaaactgtgc ttcaactagt cgtaattctg aaagcgaaat attcttgtgt gtttgcagat 61 ttctactttc catggctctt aattattatc tttggaatat ttgggctaac agtgatgcta 121 tttgtattct tattttctaa acagcaaagg taggatgtag gaaggtagta ttctttggta 181 ccttctgtac cagttgtgtt agaccttgcc at // LOCUS HUMGHRA08 198 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human growth hormone receptor gene, exon 9. ACCESSION M28465 M26401 KEYWORDS growth hormone binding-protein; growth hormone receptor; transmembrane protein. SEGMENT 8 of 10 SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 198) AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R., Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I. TITLE Characterization of the human growth hormone receptor gene and the demonstration of a partial gene deletion in two patients with Laron-type dwarfism JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.I.Wood, 02-AUG-1989. FEATURES from to/span description pept + 63 + 132 growth hormone receptor, exon 9 /nomgen="GHR" /map="5p13-p12" /hgml_locus_uid="LA0219G" pre-msg < 1 > 198 growth hormone receptor mRNA and introns IVS < 1 62 GHR Intron H IVS 133 > 198 GHR Intron I BASE COUNT 68 a 35 c 30 g 65 t ORIGIN About 4 kb after segment 7. 1 gctataattg agaatatgta gcttttaaga tgtcaaaacc aaaattttta tatgttttca 61 aggattaaaa tgctgattct gcccccagtt ccagttccaa agattaaagg aatcgatcca 121 gatctcctca aggtaactaa taattttatc taaagttgta gctagtacta attaacacct 181 gaagactcct gtcatatg // LOCUS HUMGHRA09 1881 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human growth hormone receptor gene, exon 10. ACCESSION M28466 M26401 KEYWORDS growth hormone binding-protein; growth hormone receptor; transmembrane protein. SEGMENT 9 of 10 SOURCE Human DNA, clones GG.33, GG.48, GG.20, GG.9, GG.47, GG.19. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1881) AUTHORS Godowski,P.J., Leung,D.W., Meacham,L.R., Galgani,J.P., Hellmiss,R., Keret,R., Rotwein,P.S., Parks,J.S., Laron,Z. and Wood,W.I. TITLE Characterization of the human growth hormone receptor gene and the demonstration of a partial gene deletion in two patients with Laron-type dwarfism JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 8083-8087 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.I.Wood, 02-AUG-1989. FEATURES from to/span description pept + 64 1035 growth hormone receptor, exon 10 /nomgen="GHR" /map="5p13-p12" /hgml_locus_uid="LA0219G" pre-msg < 1 > 1881 growth hormone receptor mRNA and introns IVS < 1 63 GHR Intron I BASE COUNT 607 a 366 c 362 g 546 t ORIGIN About 0.5 kb after segment 8. 1 gctaattcat ttaattatta tgagtttctt ttcatagatc ttcattttct ttctattttc 61 taggaaggaa aattagagga ggtgaacaca atcttagcca ttcatgatag ctataaaccc 121 gaattccaca gtgatgactc ttgggttgaa tttattgagc tagatattga tgagccagat 181 gaaaagactg aggaatcaga cacagacaga cttctaagca gtgaccatga gaaatcacat 241 agtaacctag gggtgaagga tggcgactct ggacgtacca gctgttgtga acctgacatt 301 ctggagactg atttcaatgc caatgacata catgagggta cctcagaggt tgctcagcca 361 cagaggttaa aaggggaagc agatctctta tgccttgacc agaagaatca aaataactca 421 ccttatcatg atgcttgccc tgctactcag cagcccagtg ttatccaagc agagaaaaac 481 aaaccacaac cacttcctac tgaaggagct gagtcaactc accaagctgc ccatattcag 541 ctaagcaatc caagttcact gtcaaacatc gacttttatg cccaggtgag cgacattaca 601 ccagcaggta gtgtggtcct ttccccgggc caaaagaata aggcagggat gtcccaatgt 661 gacatgcacc cggaaatggt ctcactctgc caagaaaact tccttatgga caatgcctac 721 ttctgtgagg cagatgccaa aaagtgcctc cctgtggctc ctcacatcaa ggttgaatca 781 cacatacagc caagcttaaa ccaagaggac atttacatca ccacagaaag ccttaccact 841 gctgctggga ggcctgggac aggagaacat gttccaggtt ctgagatgcc tgtcccagac 901 tatacctcca ttcatatagt acagtcccca cagggcctca tactcaatgc gactgccttg 961 cccttgcctg acaaagagtt tctctcatca tgtggctatg tgagcacaga ccaactgaac 1021 aaaatcatgc cttagccttt ctttggtttc ccaagagcta cgtatttaat agcaaagaat 1081 tgactggggc aataacgttt aagccaaaac aatgtttaaa ccttttttgg gggagtgaca 1141 ggatggggta tggattctaa aatgcctttt cccaaaatgt tgaaatatga tgttaaaaaa 1201 ataagaagaa tgcttaatca gatagatatt cctattgtgc aatgtaaata ttttaaagaa 1261 ttgtgtcaga ctgtttagta gcagtgattg tcttaatatt gtgggtgtta atttttgata 1321 ctaagcattg aatgactatg tttttaatgt atagtaaatc acgctttttg aaaaagcgaa 1381 aaaatcaggt ggcttttgcg gttcaggaaa attgaatgca aaccatagca caggctaatt 1441 ttttgttgtt tcttaaataa gaaacttttt tatttaaaaa actaaaaact agaggtgaga 1501 aatttaaact ataagcaaga aggcaaaaat agtttggata tgtaaaacat ttattttgac 1561 ataaagttga taaagatatt ttttaataat ttagacttca agcatggcta ttttatatta 1621 cactacacac tgtgtactgc agttggtatg acccctctaa ggagtgtagc aactacagtc 1681 taaagctggt ttaatgtttt ggccaatgca cctaaagaaa aacaaactcg ttttttacaa 1741 agccctttta tacctcccca gactccttca acaattctaa aatgattgta gtaatctgca 1801 ttattggaat ataattgttt tatctgaatt tttaaacaag tatttgttaa tttagaaaac 1861 tttaaagcgt ttgcacagat c // LOCUS HUMMHDQAR1 426 bp ds-DNA PRI 21-JUL-1990 DEFINITION Human MHC class II HLA-DQ-alpha chain gene, exon 1. ACCESSION M29613 KEYWORDS cell surface protein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Human (haplotype HLA-D/DR) adult male DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 426) AUTHORS Jonsson,A.-K., Hyldig-Nielsen,J.-J., Servenius,B., Larhammar,D., Andersson,G., Joergensen,F., Peterson,P.A. and Rask,L. TITLE Class II genes of the human major histocompatibility complex: Comparisons of the DQ and DX alpha and beta genes JOURNAL J. Biol. Chem. 262, 8767-8777 (1987) STANDARD simple staff_review FEATURES from to/span description pept 173 + 254 MHC HLA-DQ-alpha chain precursor, exon 1 /nomgen="HLA-DQA1" /map="6p21.3" /hgml_locus_uid="LY0151Z" sigp 173 241 MHC HLA-DQ-alpha chain signal peptide matp 242 + 254 MHC HLA-DQ-alpha chain IVS 255 > 426 MHC HLA-DQ-alpha, intron A BASE COUNT 120 a 90 c 108 g 108 t ORIGIN 1 atacaaactc ttcagctagt aactgagatg tcaccatggg ggatttttct aattggccaa 61 aacctgacct ggcagggttt ggtttgggtg tcttcagatt tccttgtctt gaggccctca 121 caattgctct acagctcaga acagcaactg ctgaggctgc cttgggaaga ggatgatcct 181 aaacaaagct ctgatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg 241 tgaagacatt gtgggtgagt gcatgagtga ggaatgttct ctggagctga aaaacagtaa 301 attgaaggaa aagagagaaa gcgatttgca gagaaattgt agagattcca agaccccttc 361 agtattaaga gaattaaaaa ttatagctgt tcctccttca ggaaaccaga gccccaacca 421 ctcttt // LOCUS SOYP34A 1287 bp ss-mRNA PLN 21-JUL-1990 DEFINITION Soybean oil bodies-associated protein P34 mRNA, complete cds. ACCESSION J05560 KEYWORDS oil bodies-associated protein. SOURCE Soybean (strain Century) seed, cDNA to mRNA, clone P34/1350. ORGANISM Glycine max Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 1287) AUTHORS Kalinski,A.J., Weiseman,J.M., Matthews,B.F. and Herman,E.M. TITLE Molecular cloning of a protein associated with soybean oil bodies which is similar to thiol proteases of the papain family JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.Herman, 18-MAY-1990. FEATURES from to/span description pept 3 1142 oil bodies-associated protein P34 precursor sigp 3 368 oil bodies-associated protein P34 signal peptide matp 369 1139 protein P34 mRNA < 1 1287 P34 protein mRNA BASE COUNT 407 a 240 c 297 g 343 t ORIGIN 1 ttatgggttt ccttgtgttg cttcttttct ccctcttagg tctctcttct agttccagca 61 tatcaactca tcgttccata ttggaccttg acctaaccaa gtttaccaca cagaaacagg 121 tgtcttcact gttccaacta tggaagagtg agcatggacg tgtctaccat aaccacgaag 181 aagaggcaaa gagacttgag attttcaaga ataactcgaa ctatatcagg gacatgaatg 241 caaacagaaa atcaccccat tctcatcgtt taggattgaa caagtttgct gacatcactc 301 ctcaagagtt cagcaaaaag tacttgcaag ctcccaagga tgtgtcgcag caaatcaaaa 361 tggccaacaa gaaaatgaag aaggaacaat attcttgtga ccatccacct gcatcatggg 421 attggaggaa aaaaggtgtc atcacccaag taaagtacca agggggctgt ggaaggggtt 481 gggcgttttc tgccacggga gccatagaac cagcacatgc aatagcaaca ggagaccttg 541 ttagcctttc tgaacaagaa ctcgtagact gtgtggaaga aagcgaaggt tgttacaatg 601 gatggcacta tcaatcgttc gaatgggttt tagaacatgg tggtattgcc actgatgatg 661 attatcctta cagagctaaa gagggtagat gcaaagccaa taagatacaa gacaaggtta 721 caattgacgg atatgaaact gtaataatgt cagatgagag tacagaatca gagacagagc 781 aagcgttctt aagcgccatc cttgagcaac caattagtgt ctcaattgat gcaaaagatt 841 ttcatttata caccggggga atttatgatg gagaaaactg tacaagtccg tatgggatta 901 atcactttgt tttacttgtg ggttatggtt cagcggatgg tgtagattac tggatagcga 961 aaaattcatg gggagaagat tggggagaag atggttacat ttggatccaa agaaacacgg 1021 gtaatttatt aggagtgtgt gggatgaatt atttcgcttc atacccaacc aaagaggaat 1081 cagaaacact ggtgtctgct cgcgttaaag gtcatcgaag agttgatcac tctcctcttt 1141 gaagccgtaa aggttcaata caacgagtgc ttgttttctt agggacaagc attgtactta 1201 tgtatgattc tgtgtaacca tgagtctcca cgttgtacta atgtgaaggg caaaaataaa 1261 acacacaaca agttcgtttt tctcaat // LOCUS XELRPS22A 531 bp ss-mRNA VRT 21-JUL-1990 DEFINITION X.laevis ribosomal protein S22, 40S subunit mRNA, complete cds. ACCESSION M34706 KEYWORDS nucleolar protein; ribosomal protein; ribosomal protein S22. SOURCE X.laevis stage 17 embryo, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 531) AUTHORS Keiper,B.D. and Wormington,M. TITLE Nucleotide sequence and 40S subunit assembly of Xenopus laevis ribosomal protein S22 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Wormington, 18-MAY-1990. Author Address [1]: M.Wormington University of Virginia Department of Biology Gilmer Hall Charlottesville, VA 22901 FEATURES from to/span description pept 131 490 ribosomal protein S22, 40S subunit mRNA < 1 531 ribosomal protein S22 mRNA BASE COUNT 158 a 117 c 124 g 132 t ORIGIN 1 tgtttaataa aagaaagtaa tagtaacttc aacctctaca cgtggctgtt cctgactcac 61 cgctgttcgc tcggcaagca tcggatcgca ccgtcggacg aataagtcgg tcagggaagt 121 gataagaaaa atggctttta aagacccagg caaagcccct gtagaccagg aagtggccat 181 ccatcgtatc aggattactt taacaagtcg taatgtgaag tctctggaaa aagtgtgtgc 241 tgatctgatc cgtggtgcca aagagaagaa cctgaaggtt aagggcccag tccgtatgcc 301 taccaagact cttcgtatca caaccagaaa aacaccttgc ggtgagggtt ccaaaacctg 361 ggatcgtttc cagatgcgca tccacaagcg cctgattgac ctgcacagtc cttccgagat 421 tgttaagcag atcacttcca tcagtatcga acctggtgta gaagttgaag ttactattgc 481 tgatgcataa atgacacttc tgtttaataa aagaaagtaa tgtggattgt t // LOCUS HUMSSBLA1 81 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human SS-B/La protein mRNA, partial cds. ACCESSION M35261 KEYWORDS SS-B/La antigen; SS-B/La protein; autoantigen; small nuclear ribonucleoprotein. SEGMENT 1 of 3 SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 81) AUTHORS Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N., Nishioka,K. and Miyamoto,T. TITLE Fine epitope mapping the human SS-B/La protein: Identification of a distinct autoepitope homologous to a viral gag polyprotein JOURNAL J. Clin. Invest. 85, 1566-1574 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 81 SS-B/La protein (AA at 1) BASE COUNT 36 a 14 c 15 g 16 t ORIGIN 1 gaaatcagtg aagataaaac taaaatcaga aggtctccaa gcaaacccct acctgaagtg 61 actgatgagt ataaaaatga t // LOCUS HUMSSBLA2 155 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human SS-B/La protein mRNA, partial cds. ACCESSION M35263 KEYWORDS SS-B/La antigen; SS-B/La protein; autoantigen; small nuclear ribonucleoprotein. SEGMENT 2 of 3 SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 155) AUTHORS Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N., Nishioka,K. and Miyamoto,T. TITLE Fine epitope mapping the human SS-B/La protein: Identification of a distinct autoepitope homologous to a viral gag polyprotein JOURNAL J. Clin. Invest. 85, 1566-1574 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 155 SS-B/La protein (AA at 3) BASE COUNT 81 a 19 c 30 g 25 t ORIGIN About 200 base pairs after segment 1. 1 agaagtacaa agaaacagac ctgctaatac ttttcaagga cgattacttt gccaaaaaaa 61 atgaagaaag aaaacaaaat aaagtggaag ctaaattaag agctaaacag gagcaagaag 121 caaaacaaaa gttagaagaa gatgctgaaa tgaaa // LOCUS HUMSSBLA3 192 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human SS-B/La protein mRNA, partial cds. ACCESSION M35262 KEYWORDS SS-B/La antigen; SS-B/La protein; autoantigen; small nuclear ribonucleoprotein. SEGMENT 3 of 3 SOURCE Human fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 192) AUTHORS Kohsaka,H., Yamamoto,K., Fujii,H., Miura,H., Miyasaka,N., Nishioka,K. and Miyamoto,T. TITLE Fine epitope mapping the human SS-B/La protein: Identification of a distinct autoepitope homologous to a viral gag polyprotein JOURNAL J. Clin. Invest. 85, 1566-1574 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 192 SS-B/La protein (AA at 1) BASE COUNT 90 a 23 c 50 g 29 t ORIGIN About 170 base pairs after segment 2. 1 gccaaggaag cattgggtaa agccaaagat gcaaataatg gtaacctaca attaaggaac 61 aaagaagtga cttgggaagt actagaagga gaggtggaaa aagaagcact gaagaaaata 121 atagaagacc aacaagaatc cctaaacaaa tggaagtcaa aaggtcgtag atttaaagga 181 aaaggaaagg gt // LOCUS RATPPTA1 1757 bp ds-DNA ROD 21-JUL-1990 DEFINITION Rat tachykinin (PPT) gene, exons 1 and 2. ACCESSION M34159 KEYWORDS alternative splicing; neurokinin A; neuropeptide K; neuropeptide gamma; substance P; tachykinin. SEGMENT 1 of 4 SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1757) AUTHORS Carter,M.S. and Krause,J.E. TITLE Structure, expression and some regulatory mechanisms of the rat preprotachykinin gene encoding substance P, neurokinin A, neuropeptide K, and neuropeptide gamma JOURNAL J. Neurosci. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Krause, 10-MAY-1990. FEATURES from to/span description pept 1400 + 1522 beta-preprotachykinin (PPT), exon 2 (first expressed exon) sigp 1400 1456 beta-tachykinin signal peptide (1456 could be 1471) pep$ 1400 + 1522 gamma-preprotachykinin (PPT), exon 2 (first expressed exon) sigp 1400 1456 gamma-tachykinin signal peptide (1456 could be 1471) pep$ 1400 + 1522 alpha-preprotachykinin (PPT), exon 2 (first expressed exon) sigp 1400 1456 alpha-tachykinin signal peptide (1456 could be 1471) pre-msg 866 > 1757 PPT mRNA and introns IVS 966 1390 PPT intron A IVS 1523 > 1757 PPT intron B BASE COUNT 411 a 443 c 470 g 433 t ORIGIN 1 ctgcagagct ccaaaggtaa gcatccagcc tttctagtcc cccaacaagg ctaaagggga 61 gagaggcaca attatcctct tcccacccct tctgccttca gggtgtgcct gggaagaagc 121 tgtaggggaa caaaagatgc cttagaatgg ctgatgggta agttctacat gagaaaggag 181 gtttaaattc ctctttcccc taaatgtaaa acaaacctgc cttcatcctc tgaagcggga 241 gaccggaaac acttttgcag tgctagagaa atgagaatat tctgactgat ttggtgggga 301 ggggggttgg gggggtgtgt tccagcccta gatataacac ctcataaacc ttaagacaca 361 taaagtagaa atgaaaggaa aaccccgctt gcttcatccc tctgaagtgc ttgctggtgt 421 cttagtatta ttcacaaggt tttgctgctc aagttatttg gctgtcctca aagcgcaata 481 ttccctgatg cctcttgaga gaaaagttcc ctaagtccga agcatgagtc acttcgctca 541 gttttgatga gtaatctcag gtgtcactga accttgttcg gaagaagagg ggaggggggc 601 gtcagatttg cagacggaag aaaacaggtc tctctggatt ggatggcgag acctcgactt 661 ccctaaaatt gcgtcatttc gaacccaatt tggtccagat gttatggact ccgacgggtt 721 accgtctcgg aaactctatc acgcaagcaa aaggcgaggg ggcggctaat taaatattga 781 gcagaaagtc gcgtggggag agtgtcacgt ggctctccag gctcatcacg cctgagataa 841 ataaggcgaa gcaggagcag ggactagagc gcactcggac cagctccact ccagcaccgc 901 ggcggaggag agcgaggagc gcccagcaag tgcgcacctg cggagcatca ccgggtccga 961 ccgcagtgag taccactccc gcctgccatc tgccgctgcc tggggcccgt cgtgccttgg 1021 tcagaggtgg cgtccgcccg gggtttcacc tgcagcagcc acagacccgg agagaaactg 1081 tgcaaatacc aacacctctc tttgtctgcc ttcaggcttc ggagtgtggg tcagtgggta 1141 gggggctggg acgttgagag gcaaagagag gaggacttga ggctctttgg cacgtcagta 1201 gccttcttaa agggtttgga gaaatgttcg acctggtggg tttttgtttt tgttttaatc 1261 ttggtttagc gagacctctt ctttcttcct tgtggcattg aaattcagct gctctaacgt 1321 cttgggtggc cccccgccta catatgtata acgattcttt ctctccttct ctgtctccta 1381 ctctctccag aaatccaaca tgaaaatcct cgtggcggtg gcggtctttt ttctcgtttc 1441 cactcaactg tttgcagagg aaatcggtgc caacgatgat ctaaattatt ggtccgactg 1501 gtccgacagt gaccaaatca aggtgagacc aaccgggcgt gggagctgtt gccgtcccct 1561 tgtggtagca ccctgactag cctagctgga gcctgagagg ctcattggat tgtaagggcg 1621 tcatacccgg acaaagcaca aggcttattg cccagaacta agtttctccc tgagggagga 1681 aaatgtcctg ggttcagtgt ctccccaaca gaaaaggagt tcttctttcc ttcctccctc 1741 cctccctccc tccctcc // LOCUS RATPPTA2 875 bp ds-DNA ROD 21-JUL-1990 DEFINITION Rat tachykinin (PPT) gene, exons 3 and 4. ACCESSION M34160 KEYWORDS neurokinin A; neuropeptide K; neuropeptide gamma; substance P; tachykinin. SEGMENT 2 of 4 SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 875) AUTHORS Carter,M.S. and Krause,J.E. TITLE Structure, expression and some regulatory mechanisms of the rat preprotachykinin gene encoding substance P, neurokinin A, neuropeptide K, and neuropeptide gamma JOURNAL J. Neurosci. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Krause, 10-MAY-1990. FEATURES from to/span description pept + 187 283 beta-preprotachykinin (PPT), exon 3 722 + 766 beta-preprotachykinin, exon 4 matp 235 267 substance P pep$ + 187 + 283 gamma-preprotachykinin (PPT), exon 3 matp 235 267 substance P pep$ + 187 283 alpha-preprotachykinin (PPT), exon 3 722 + 766 alpha-preprotachykinin, exon 4 matp 235 267 substance P pre-msg < 1 > 875 PPT mRNA and introns IVS < 1 186 PPT intron B IVS 284 721 PPT intron C IVS 767 > 875 PPT intron D BASE COUNT 213 a 191 c 219 g 252 t ORIGIN 1 ccctgataga ctttggggga gaaggaaatc tcaggcgggt ccgggttggg gaaagtgctg 61 catagattct agtatggccc ggtctcatgc tagcggctgc ttctgtggct ctgaggaaag 121 ctcaggcaca ccgtgaatgt ggaccttgtt cacttaattt gtctactcgc tggtttgtcc 181 ctccaggagg caatgccgga gccctttgag catcttcttc agagaatcgc ccgaagaccc 241 aagcctcagc agttctttgg attaatgggc aaacgggatg ctggtgagat gagcaatagt 301 cactaggtct atcaggctac ccggtctgca tgcctgctcc ttcctggagc acccaagtgt 361 ctcttgtacc gcgagtataa ttttacatgc ctaaagactt ttgattccct gaagtcttcg 421 atccactaac acttgcacta gctcaaagtg gaggggaggg gggatccatt tcccttgctt 481 cacagcacca aggtgaccca aattggaaac taaccttagc taaacactcc cctgactaaa 541 gatcccaatt acaagggtcc tcaggtatgc taatggtcca ggctcaggtg aaagatgttt 601 tcttggattg ggttagattt ggctatacct cgctacagtg ataggaattt tcaattgcct 661 tgagttgatg gcacagtcag tggagcttgt gaagagttaa tgtcaattca tctcttgtca 721 gattcctcaa ttgaaaaaca agtggccctg ttaaaggctc tttatggtaa acatccctat 781 aatcttattt cctatatgat gatcatgtaa aagtatttga tggtataatc agacttgcag 841 tgtgcctata gagagtgtgt gtgtgtgtgt gtgtg // LOCUS RATPPTA3 2503 bp ds-DNA ROD 21-JUL-1990 DEFINITION Rat tachykinin (PPT) gene, exons 5 and 6. ACCESSION M34161 KEYWORDS neurokinin A; neuropeptide K; neuropeptide gamma; substance P; tachykinin. SEGMENT 3 of 4 SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2503) AUTHORS Carter,M.S. and Krause,J.E. TITLE Structure, expression and some regulatory mechanisms of the rat preprotachykinin gene encoding substance P, neurokinin A, neuropeptide K, and neuropeptide gamma JOURNAL J. Neurosci. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Krause, 10-MAY-1990. FEATURES from to/span description pept + 140 163 beta-preprotachykinin (PPT), exon 5 1493 + 1546 beta-preprotachykinin, exon 6 matp 1495 1524 neurokinin pep$ + 140 163 gamma-preprotachykinin (PPT), exon 5 1493 + 1546 gamma-preprotachykinin, exon 6 matp 1495 1524 neurokinin pep$ + 140 + 163 alpha-preprotachykinin (PPT), exon 5 pre-msg < 1 > 2503 PPT mRNA and introns IVS < 1 139 PPT intron D IVS 164 1492 PPT intron E IVS 1547 > 2053 PPT intron F BASE COUNT 861 a 454 c 428 g 760 t ORIGIN 1 catctacttt agtcctcaga acagcacata aataagcgtt aaatatttcg aaaatcaact 61 ccttaagttc tctggaaaag ataaaatagc tttaaaaaag aaacccatgt atctgctaac 121 tggctctctt ttcttctagg gcatggtcag atctctcaca aaagtaagtt cacaatcatt 181 tgacatttat caaatgctaa tgttaatgat agtgaatatt gcatcctgtg ttaccttttg 241 cagcacaaca tagagcccaa tggttttgat tgagataatg tagggaaaga aagacagaat 301 gtgatttgct gaatgatcac tgactagact ctttatgtag acagattacg taccaaatgc 361 acattacatt gagagaaaag atacactata gtttttagtt tatattctct ctagcgtaat 421 cacatgagca ctttccagcc tgtccagaag acgagagaga gagagagaga gagagagaga 481 gagagagaga gagagaacac ttatatacac cttctaattc atattcatga actacctttt 541 ataatggcag gaaggggatt aagtagactg ggtgaatatt tgagtagaaa atgcactcga 601 aatgtgatag agatacaggt ttattttccc ccttgatccc ataattgaat gggaacctgg 661 gagagaagag gtgcagattg cagaactgga gggtccttag agctatttcc tagttaattg 721 gatccagagg agacaggaag tcccttgggc ccctggaaat gttgcctgct tctatgaggg 781 tcatgaattg gactaaccct ggcagggtac accccaaacc attaacactg tggtctcaac 841 taacacaatg gaaactaaga aaaatcctcc ccgttgcagt atgataataa ccttccccaa 901 agccttcatc tttggggaaa aaaaatgaca tttactaagg ggttaaattt taaagaaatc 961 ttatacagca aaaaaagtaa taaaatgaat gtatcttatt aatgttaaaa aacaaattgc 1021 tttcaggaac agtattttta agtctaaatc tactaaatgt gaccccttag aatattccaa 1081 tactaaggat ctgacttaaa gttcccttgt tgagaaatgc aacatatttg gtaataaaga 1141 cataaacgtg gatgtctttt cccacacaaa aacatataat taaagtcccc caaaattcaa 1201 agctactcca gtggcaacca tatttaaaac ttcataatag ataaaacata atacagataa 1261 aagattctgc taatgtctag ataaagaact gagaaaagta ctcacaatat gttgttgcag 1321 tgtggtttgc aaggcttata taagtctttg tgggttcacc agtgggtgac atagaaacat 1381 tcatttattt attctacagg catatattca gaagcttgtt aactttgtac tagttatgag 1441 tttatttctt caaaaacata catactaaaa tacccctaaa tgtattttcc agggcataaa 1501 acagattcct ttgttggact aatgggcaaa agagctttaa attctggtat gtataaaatc 1561 atgtctgaaa atatgtaaat caacttctgt aatttttacg gaaagtaaca attttcaaca 1621 tactaaaaaa aaaaaaagga gccaatagac ttaacatgag ccatttctgt agacaaattt 1681 tcataggagc gtttgtagcc aagagtgggc agaaagctga ttaagctgat accaccacaa 1741 tgtaagtata tttctctcaa agtgccattc tgtccagcat ctgccccact tagtcactac 1801 atatcctgaa tctttggtct agtctgacac tctttttcac agatattctt aataaaatgt 1861 cctggaccct tagagagaga tgagagcttt tttacttaca aactgaagtg atgtggtact 1921 ttcatcagac taaccattta gtgattttct actagaaaaa taaatagata atgacaagca 1981 aaatgtttct aaaattactg tctctctctg aagttatgga agctcaaaat ccctgtctta 2041 atgggtggca aagaggctga gtgtagtggt gtccattcta acgccttcaa accattcttc 2101 ctgttccaga tttagaccta catttaaaag taccacacgc cccaaaccca ttttaaacct 2161 aagcgggatt ttccttggcc cagctgcata atctctcatt tatcctgctt gaactttcta 2221 cataaatgcc acagcatatc tctaaagccc ctcactgttc acctcattta atcaattata 2281 ttcaatgtta aataatccca ttattgtttt catgaaaatt ggctatcagt gaaaatgggt 2341 taaaatgata aacacctgat tagaaattaa ccagcatctg tttatgatat ttagttcaca 2401 gaaaatttat aagatgtgct ggcttaattt ttaaatatct gactcatccc caaaataggt 2461 cactctatag gtatcttaat caatgtagac atattttata ttt // LOCUS RATPPTA4 779 bp ds-DNA ROD 21-JUL-1990 DEFINITION Rat tachykinin (PPT) gene, exon 7. ACCESSION M34162 KEYWORDS neurokinin A; neuropeptide K; neuropeptide gamma; substance P; tachykinin. SEGMENT 4 of 4 SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 779) AUTHORS Carter,M.S. and Krause,J.E. TITLE Structure, expression and some regulatory mechanisms of the rat preprotachykinin gene encoding substance P, neurokinin A, neuropeptide K, and neuropeptide gamma JOURNAL J. Neurosci. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Krause, 10-MAY-1990. FEATURES from to/span description pept + 7 56 beta-preprotachykinin (PPT), exon 7 pep$ + 7 56 gamma-preprotachykinin (PPT), exon 7 pep$ + 7 56 alpha-preprotachykinin (PPT), exon 7 pre-msg < 1 701 PPT mRNA and introns IVS < 1 6 PPT intron F BASE COUNT 228 a 150 c 159 g 242 t ORIGIN 1 ctgcagtggc ttatgaaaga agcgcaatgc agaactacga aagaaggcgt aaataaaccc 61 tgtaacgcac tatctattca tctccatctg tgtccgcgag cagtgagcgg taaaataaaa 121 atgtgcgcta tgaggaatga ttatttattt aatatcaaat gttgttatga gtgaaaaact 181 caaaaaagtg tttatttttt catattgtgc caataagcat tgtaattcta atgtggtgac 241 ctcctcagac agaagtagaa attagttgta acttcagcaa agcacagtgt tgatggagtt 301 gtacaagttt gccagcgatg caagtctcca aagacagaaa ggctgctgtg aggcagtgca 361 ggcggctgct gctggaggca gagaaactcc tgtgtgtctt gcgcttccct tggttgcttt 421 tatcctaatg atgtactgag agtttggtat ctgactctat ttgtatccta gcagcatgtt 481 tcctgtgttg tgactatata gagatgtttt taaaagtttc aatgtacttc tctggtcttc 541 agtcattgta tgatgtgttg tgatagctac cattttaaat aaaagaatgt atcttcagga 601 attttgtcac attctttcat taatatttat gctaaccaca ccatctctat taggatgagt 661 gtgatcaagc tgtttgggta aacacttcta tctctctcag gcacagtgca aattgagtgt 721 gcgcgcaaca cacacacaca cacacacaca cacacacaca tacctccaca cacattttt // LOCUS MXAFRZE 2493 bp ds-DNA BCT 21-JUL-1990 DEFINITION M.xanthus gliding motility regulatory protein (frzE) gene, complete cds. ACCESSION M35192 KEYWORDS gliding motility regulatory protein. SOURCE M.xanthus (strain DZF1) DNA. ORGANISM Myxococcus xanthus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; Myxococcaceae. REFERENCE 1 (bases 1 to 2493) AUTHORS McCleary,W.R. and Zusman,D.R. TITLE FrzE of Myxococcus xanthus is homologous to both CheA and CheY of Salmonella typhimurium JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.R.McCleary, 05-JUN-1990. FEATURES from to/span description pept 64 2397 gliding motility regulatory protein (frzE) BASE COUNT 375 a 848 c 926 g 344 t ORIGIN 1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg 61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc 121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag 181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt 241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag 301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc 361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc 421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg 481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc 541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc 601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc 661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc 721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc 781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg 841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg 901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc 961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc 1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag 1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg 1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg 1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc 1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc 1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg 1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc 1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag 1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc 1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg 1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg 1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg 1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc 1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac 1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc 1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc 1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc 2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag 2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag 2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg 2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg 2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc 2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga 2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg 2461 caaggggctg cgtgcgctcg cggcccgggg cct // LOCUS RATPCBB 463 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Rat PCB-binding protein mRNA, complete cds.. ACCESSION J05536 KEYWORDS PCB binding protein; binding protein. SOURCE Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone pPCB6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 463) AUTHORS Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J., Gillner,M., Gustafsson,J.-A. and Lund,J. TITLE Cloning, structure, and expression of a rat binding protein for polychlorinated biphenyls: Homology to the hormonally regulated progesterone-binding protein uteroglobin JOURNAL J. Biol. Chem. 265, 1-4 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.Lund, 15-JUN-1990. FEATURES from to/span description pept 58 348 PCB binding protein precursor sigp 58 114 PCB binding protein signal peptide matp 115 345 PCB binding protein BASE COUNT 117 a 156 c 84 g 106 t ORIGIN 1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg 61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg 121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat 181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag 241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg 301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat 361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta 421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc // LOCUS YSBCPR 2990 bp ds-DNA PLN 21-JUL-1990 DEFINITION C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds. ACCESSION M35199 KEYWORDS NADPH-cytochrome P450 reductase. SOURCE C.tropicalis (ATCC 750) DNA, clone pTS1. ORGANISM Candida tropicalis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 2990) AUTHORS Sutter,T.R., Sanglard,D. and Loper,J.C. TITLE Isolation and characterization of the alkane-inducible NADPH- cytochrome P450 oxidoreductase gene from Candida tropicalis: Identification of invariant residues within similar amino acid sequences of divergent flavoproteins JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by T.R.Sutter, 15-JUN-1990. Author name and address T.R.Sutter CIIT, P.O. Box 12137 Research Triangle Park, NC 27709 FEATURES from to/span description pept 554 2596 NADPH-cytochrome P450 reductase signal 198 204 TATA box BASE COUNT 988 a 442 c 558 g 1002 t ORIGIN 1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa 61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg 121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag 181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag 241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag 301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa 361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata 421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta 481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc 541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt 601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt 661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa 721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa 781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc 841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt 901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac 961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac 1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg 1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt 1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa 1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga 1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt 1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga 1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa 1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat 1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa 1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat 1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg 1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga 1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc 1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt 1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt 1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac 1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca 2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag 2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg 2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg 2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt 2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa 2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa 2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc 2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga 2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca 2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta 2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga 2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat 2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc 2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta 2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt 2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta // LOCUS HUMCYPIIF 1825 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human cytochrome P450IIF1 protein (CYP2F) mRNA, complete cds. ACCESSION J02906 KEYWORDS cytochrome P450. SOURCE Human lung, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1825) AUTHORS Nhamburo,P.T., Kimura,S., McBride,O.W., Kozak,C.A., Gelboin,H.V. and Gonzalez,F.J. TITLE The human CYP2F gene subfamily: Identification of a cDNA encoding a new cytochrome P450, cDNA-directed expression and chromosome mapping JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.T.Nhamburo, 18-MAY-1990. FEATURES from to/span description pept 56 1531 cytochrome P450IIF1 /nomgen="CYP2F" /map="19q13.1-q13.2" /hgml_locus_uid="LV0170W" mRNA < 1 1813 CYP2F mRNA BASE COUNT 356 a 596 c 470 g 403 t ORIGIN 1 gcaggctcag cgcatcccag ccagtgtctc ctgcagctca gcagctgcct tcaccatgga 61 cagcataagc acagccatct tactcctgct cctggctctc gtctgtctgc tcctgaccct 121 aagctcaaga gataagggaa agctgcctcc gggacccaga cccctctcaa tcctgggaaa 181 cctgctgctg ctttgctccc aagacatgct gacttctctc actaagctga gcaaggagta 241 tggctccatg tacacagtgc acctgggacc caggcgggtg gtggtcctca gcgggtacca 301 agctgtgaag gaggccctgg tggaccaggg agaggagttt agtggccgcg gtgactaccc 361 tgcctttttc aactttacca agggcaatgg catcgccttc tccagtgggg atcgatggaa 421 ggtcctgaga cagttctcta tccagattct acggaatttc gggatgggga agagaagcat 481 tgaggagcga atcctagagg agggcagctt cctgctggcg gacgtgcgga aaactgaagg 541 cgagcccttt gaccccacgt ttgtgctgag tcgctcagtg tccaacatta tctgttccgt 601 gctcttcggc agccgcttcg actatgatga tgagcgtctg ctcaccatta tccgccttat 661 caatgacaac ttccaaatca tgagcagccc ctggggcgag ttgtacgaca tcctagaccc 721 cagattcccg agcctcctgg actgggtgcc tgggccgcac caacgcatct tccagaactt 781 caagtgcctg agagacctca tcgcccacag cgtccacgac caccaggcct cgtctccccg 841 ggacttcatc cagtgcttcc tcaccaagat ggcagaggag aaggaggacc cactgagcca 901 cttccacatg gataccctgc tgatgaccac acataacctg ctctttggcg gcaccaagac 961 ggtgagcacc acgctgcacc acgccttcct ggcactcatg aagtacccaa aagttcaagc 1021 ccgcgtgcag gaggagatcg acctcgtggt gggacgcgcg cggctgccgg cgctgaagga 1081 ccgcgcggcc atgccttaca cagacgcggt gatccacgag gtgcagcgct ttgcagacat 1141 catccccatg aacttgccgc accgcgtcac tagggacacg gcctttcgcg gcttcctgat 1201 acccaagggc accgatgtca tcaccctcct taacaccgtc cactacgacc ccagccagtt 1261 cctgacgccc caggagttca accccgagca ttttttggat gccaatcagt ccttcaagaa 1321 gagtccagcc ttcatgccct tctcagctgg gcgccgtctg tgcctgggag agctgctggc 1381 gcgcatggag ctctttctgt acctcaccgc catcctgcag agcttttcgc tgcagccgct 1441 gggtgcgccc gaggacatcg acctgacccc actcagctca ggtcttggca atttgccgcg 1501 gcctttccag ctgtgcctgc gcccgcgcta acgccccggc ccttccagat tcgcctgtga 1561 gcgatgaggc ccacccatgt gggttgctac gtccccttct tggtccacag tctgccctca 1621 tccctctggc agtcacgctg tcttccctgc atgctgtgcc tgccgcgtgc ccttccccca 1681 tccctccaat ctgtgccccg tctgcagggc agaggcagat gtggcatgtc tttttgtacc 1741 cacagagctt gttctatggc acgccctttt ctaggctttt tgtatcattt cttagtacat 1801 tgtaatagat tcaaaccagt cttgg // LOCUS MUSADA3A 891 bp ds-DNA ROD 21-JUL-1990 DEFINITION Mouse adenosine deaminase (ada) gene, 3' end. ACCESSION J05523 KEYWORDS adenosine deaminase. SOURCE Mouse fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 891) AUTHORS Maa,M.-C., Chinsky,J.M., Ramamurthy,V., Martin,B.D. and Kellems,R.E. TITLE Identification of transcription stop sites at the 5' and 3' ends of the murine adenosine deaminase gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.Ramamurthy, 18-MAY-1990. BASE COUNT 241 a 207 c 262 g 181 t ORIGIN 1 atagctggcc accttgcctt gattctatca aaggcattcc agtcccagca tcaagcacat 61 tctgtcctta tgcctccctt tcagacatca gtcatggtgg ctagcaagca tcatgtactg 121 ggaggaccca aatacacaaa gactctcagt aaggcaaacg gtgttgtaaa ggcctacatg 181 aggggactag tgatggctaa ggatgaaggc aggcagcaaa caaagctgat gagagctgag 241 cagagcacag gagccttgaa gagccccttg gcactgtgcc cggtgtgcca gctgggtcca 301 ctgctcattc ggttggacag tgagcagccg ttgagtaggt acccgtcacg tgccaaacac 361 tgttagtcac agagggcatg agacagcctg tgctggaaca gctcagttca tagggccatg 421 gagatgggga gaaaggggcg cctctgtcag agacaagctg tggtctggga aggccttagc 481 actaaaagca ccacaatgag aagcaaccgc cagaagcagg gcccgcaggc ctttgttcca 541 gctgcaaaga gaaaggaaaa agtggggaat aagagttggg gctgcggagg gggtggggag 601 cattgtgcag gttccgtact tgaacagaaa gcagggacca acacaaggaa ggccaaaggg 661 ctcagcctgg gaaactttgt gaggctatgg aggtaagggg cagggtattc tcccgaggca 721 tagagactgt cataccaatg atggtcattg gctgggcgag cctaagaaga ctccagtttc 781 ccttgacctc agttaaatct attacatgag aaactgggtc cctgtcccca acattgtccc 841 tgttccgtgt tcctaagtca tttctgggca ataaggactg agaactctag a // LOCUS YSCG6PD 3782 bp ds-DNA PLN 21-JUL-1990 DEFINITION S.cerevisiae glucose-6-phosphate dehydrogenase (ZWF1) gene and ORF, complete cds. ACCESSION M34709 KEYWORDS glucose-6-phosphate dehydrogenase. SOURCE S.cerevisiae (strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3782) AUTHORS Nogae,I. and Johnston,M. TITLE Isolation and characterization of the ZWF1 gene of Saccharomyces cerevisiae, encoding glucose-6-phosphate dehydrogenase JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Johnston, 18-MAY-1990. FEATURES from to/span description pept 1585 3102 glucose-6-phosphate dehydrogenase (ZWF1) (EC 1.1.1.49) pept 246 1037 pot. ORF mRNA 1472 > 3102 ZWF1 mRNA BASE COUNT 1066 a 899 c 887 g 930 t ORIGIN 1 atcgattagc aaaaccgtag agaagatcat tgcgcataag aaacaaaaag aaaataccgg 61 tgcggaccgc aagcctctat tatctgctgt gtgtccggga tttcttatat acacggaaaa 121 gaccaaaccg cagttggtgc ctatgctgct caatgtcaag tcgcctcagc aaattacggg 181 ctcgttgatc agagctacgt tcgagagtct ctatcgcgcg tgagtccttt taccatctgt 241 cgctgatgcc atgttttgac aagaaactag aagcgtcaag gccagaatcg cttgacgatg 301 ggattgactg tgtcattacg ccacgtgaaa tagtgacaat gttacaagag ttgaatttag 361 acttcaagtc gtttcttact gaggacacaa gtttatatgg cgctatcacc gccgggtggg 421 acccgcgagt ccactgggcg tcgaatctgg gaggcacttg tggtggctat gcctaccagt 481 acgtaacggc tgtccaacga ctacacccag gaagtcagat gatagttctg gagggtagaa 541 acagcgacat tgtcgagtac cgattgctgc acgatgatcg cataatagcc gccgccagcg 601 agctttccgg gtttagaaac atccaaaatc tggtgcgtaa actgacctcg ggctccgggt 661 cagagcgcaa gaggaacatc agctctgcgg aagagagacg aacaggtcca aaggcgaact 721 cacgcgagat ggcgccagcc gccacggccg atccctacca ctcggactac atcgaggtaa 781 acgcgtgtcc gggtgcatgc atgaacggtg gtggcctact caatggcgag caaaacagtc 841 tcaagcggaa gcagttggtt cagacactga ataagcgtca cggagaggag ctagcaatgg 901 tagatcctct aaccctggga cccaagctag aagaggcggc accgcccgct ttcgctagag 961 tacgtcttcg cgcccgtcaa gcaggccgtc gaaaaggatc tcgtctctgt tgggagcacc 1021 tggtaagtaa ggtgtagttt tgcacccgtg tacataagcg tgaaatcacc acaaactgtg 1081 tgtatcaagt acatagtgac atttaaataa tagcaagaac aacaataata gtagcgctac 1141 tggaagcacc acgtaatagt ggaaaagaac tggaaaaacc gctataagat gcatactccg 1201 gcggtcttac gcggagatac aagcttccaa cggtgctaaa agcccggttt cggctcggcc 1261 ggaggaggaa gagagacgaa aaaaaaaaaa atgactaaaa aaaaaatgga atattattaa 1321 tgtgggattt ttggctcaag gtgtggtggc cccttttcta agggtggcga attcttcaat 1381 gtacggaaaa ctcgccaagg ctatcccata tataagcaaa ctgtgggttc atctatatac 1441 cgacacataa cacctaaagt ggcttcctcc tgcccctctc tcccttttct ccactcaccc 1501 ctccttctcc cccttccccc tctccaattg gctgtataga cagaaagagt aaatccaata 1561 gaatagaaaa ccacataagg caagatgagt gaaggccccg tcaaattcga aaaaaatacc 1621 gtcatatctg tctttggtgc gtcaggtgat ctggcaaaga agaagacttt tcccgcctta 1681 tttgggcttt tcagagaagg ttaccttgat ccatctacca agatcttcgg ttatgcccgg 1741 tccaaattgt ccatggagga ggacctgaag tcccgtgtcc taccccactt gaaaaaacct 1801 cacggtgaag ccgatgactc taaggtcgaa cagttcttca agatggtcag ctacatttcg 1861 ggaaattacg acacagatga aggcttcgac gaattaagaa cgcagatcga gaaattcgag 1921 aaaagtgcca acgtcgatgt cccacaccgt ctcttctatc tggccttgcc gccaagcgtt 1981 tttttgacgg tggccaagca gatcaagagt cgtgtgtacg cagagaatgg catcacccgt 2041 gtaatcgtag agaaaccttt cggccacgac ctggcctctg ccagggagct gcaaaaaaac 2101 ctgggggccc tctttaaaga agaagagttg tacagaattg accattactt gggtaaagag 2161 ttggtcaaga atcttttagt cttgaggttc ggtaaccagt ttttgaatgc ctcgtggaat 2221 agagacaaca ttcaaagcgt tcagatttcg tttaaagaga ggttcggcac cgaaggccgt 2281 ggcggctatt tcgactctat aggcataatc agagacgtga tgcagaacca tctgttacaa 2341 atcatgactc tcttgactat ggaaagaccg gtgtcttttg acccggaatc tattcgtgac 2401 gaaaaggtta aggttctaaa ggccgtggcc cccatcgaca cggacgacgt cctcttgggc 2461 cagtacggta aatctgagga cgggtctaag cccgcctacg tggatgatga cactgtagac 2521 aaggactcta aatgtgtcac ttttgcagca atgactttca acatcgaaaa cgagcgttgg 2581 gagggcgtcc ccatcatgat gcgtgccggt aaggctttga atgagtccaa ggtggagatc 2641 agactgcagt acaaagcggt cgcatcgggt gtcttcaaag acattccaaa taacgaactg 2701 gtcatcagag tgcagcccga tgccgctgtg tacctaaagt ttaatgctaa gacccctggt 2761 ctgtcaaatg ctacccaagt cacagatctg aatctaactt acgcaagcag gtaccaagac 2821 ttttggattc cagaggctta cgaggtgttg ataagagacg ccctactggg tgaccattcc 2881 aactttgtca gagatgacga attggatatc agttggggca tattcacccc attactcaag 2941 cacatagagc gtccggacgg tccaacaccg gaaatttacc cctacggatc aagaggtcca 3001 aagggattga aggaatatat gcaaaaacac aagtatgtta tgcccgaaaa gcacccttac 3061 gcttggcccg tgactaagcc agaagatacg aaggataatt agaaaaatgc aagcacattc 3121 atttatcggc taagtcactg aaattttttt tttcgagtga ttttatcttg caagctcact 3181 ctctttgttc ttttctcctc tataatggca tctccccccc accaactcct ctctacttgc 3241 gtgtgtattt gtttgtatac atgtgtaaat atatatacat aaaagaatgt cgtctcatgt 3301 attttttaac ttttagccgc ggtcagtgac attttggctt tcccaccatt ccacgtctga 3361 aaaaaaaaaa aaaaaaagag ataaaatatg aattgaatat atatcaaaaa tgtctgcaaa 3421 aatttttacg aatcagtccg attggacttg tacttatcat tgatttgctc gttatgcgtc 3481 ttatccactt gattagctat tccctgtaaa gtcttagaca ctgcctcagt ggccccaatt 3541 aacggccgaa taattgccac aggtgcagct ctagcaacat aaactgctgc ggcgctcggc 3601 cctcccggtt tatcatcctt catctgtccc tttgccctcc ataccgcatc gtacgcaata 3661 tgcatatgct tttctaatga actatacgcc tccttcaaac ccgtgggcag gtcgaggggc 3721 tgatctgcgt acagactcac tattgtgggt tcgttgtgat ctcccggtga agacatatcg 3781 at // LOCUS HLTLYS 642 bp ss-mRNA INV 21-JUL-1990 DEFINITION H.rufescens sperm lysin mRNA, complete cds. ACCESSION M34388 KEYWORDS sperm lysin. SOURCE H.rufescens testis, cDNA to mRNA, clones r13 and r14. ORGANISM Haliotis rufescens Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae. REFERENCE 1 (bases 1 to 642) AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D. TITLE Species specific sequences of abalone sperm lysin: The sperm protein that dissolves a hole in the egg envelope JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.D.Vacquier, 22-MAY-1990. FEATURES from to/span description pept 16 480 sperm lysin precursor sigp 16 69 sperm lysin signal peptide matp 70 477 sperm lysin mRNA < 1 642 sperm lysin mRNA signal 625 630 poly-A signal BASE COUNT 202 a 124 c 151 g 165 t ORIGIN 1 cgaacagatt acaagatgaa gctgttggtg ctttgcattt ttgcgatgat ggcgacattg 61 gcaatgtcac gcagctggca ttatgttgaa cccaagtttt taaacaaggc ttttgaagtt 121 gcactgaagg tacagatcat tgctggattc gatagagggc tggtcaaatg gcttagagtt 181 catggcagaa cattgtctac cgttcagaag aaggcgctgt acttcgtcaa tagacgttac 241 atgcagaccc actgggcgaa ttacatgctc tggataaata agaagatcga tgcactgggc 301 agaactccag ttgttggtga ctacacacgg ttaggtgccg aaattggaag acgtattgat 361 atggcttact tttacgactt cctgaaagat aagaacatga tcccaaaata tctgccatac 421 atggaagaaa ttaatagaat gcgcccggca gatgtcccag ttaaatacat gggcaagtga 481 aacccatgaa gaagacagct tcatgcaccg tgcatgtcaa cagatacaga aaacatcgtc 541 aaacacgggg agctatgaca ccatcctaac atgggttggc tatttgttgc acgattttta 601 aaacggaaag attgtacatt ccagaataaa tagatttact ac // LOCUS HLTLYS2 650 bp ss-mRNA INV 21-JUL-1990 DEFINITION H.corrugata sperm lysin mRNA, complete cds. ACCESSION M34389 KEYWORDS sperm lysin. SOURCE H.corrugata testis, cDNA to mRNA, clones p1 and p3. ORGANISM Haliotis corrugata Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae. REFERENCE 1 (bases 1 to 650) AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D. TITLE Species specific sequences of abalone sperm lysin: The sperm protein that dissolves a hole in the egg envelope JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.D.Vacquier, 22-MAY-1990. FEATURES from to/span description pept 16 483 sperm lysin precursor sigp 16 69 sperm lysin signal peptide matp 70 480 sperm lysin mRNA 1 650 sperm lysin mRNA signal 635 640 poly-A signal BASE COUNT 194 a 134 c 162 g 160 t ORIGIN 1 ggaacagatt acaagatgaa gctgttggtg ctttgccttt ttgcgatgat ggcgacattg 61 gctgtgtcgc gccaccgctt ccgttttatt ccacataagt atatacgcaa ggagtttgaa 121 gttgcactga aggtagagat cattgccgga ttcgatagaa cgctggtcaa atggcttaga 181 gttcatggcg gacgtttgag tacagttcag aagaaggcgc tgtacttcgt caatagacgt 241 tacatgcaga cccactggca gaattacatg ctctggatag tgaggaagac cgatgcactg 301 ggcagacctc cagttgttgc tgactacagt cggttaggtg ccgaaattgg aagacgtatt 361 gatatggcat atttttacaa cttcctgaac gggaggaaca tgatcccaaa atatctgcca 421 tacatggagg aaattaatag aatgcgcccg gcagatgtcc cagttgcaaa cagaggcaag 481 tgaaaccaac gaagaagaaa gctgcatgca ccctgcatgt caacagatac agaaaacatc 541 gtccgtcaaa cacgaggagc tgtgacagca tccatcccaa catgggttgg ctatttgttg 601 cacgatttta acacagaaag attgtacatt ccagaataaa tagatttact // LOCUS HUMROSSA 1843 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human 52 kD ribonucleoprotein Ro/SSA, complete cds. ACCESSION M34551 KEYWORDS ribonucleoprotein. SOURCE Human thymocyte, cDNA to mRNA, clone FI18.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1843) AUTHORS Itoh,K., Itoh,Y. and Frank,M.B. TITLE Heterogeneity in human Ro/SSA ribonucleoproteins: The 52 and 60 kd Ro/SSA proteins are encoded by separate genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.B.Frank, 18-MAY-1990. Author address: M.B.Frank Oklahoma Medical Res Foundation Arthritis and Immunology program 825 WE 13th St, Oklahoma City, OK 73104 FEATURES from to/span description pept 39 1466 52 kD Ro/SSA ribonucleoprotein signal 1827 1832 poly-A signal BASE COUNT 474 a 458 c 488 g 423 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgggc acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga 61 caatgatgtg ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga 121 gcatcgagtg tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg 181 gcagcgtctg tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac 241 agctagccaa catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac 301 agggggaacg gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga 361 aggccctttg ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc 421 ttgaggaggc tgcacaggag taccaggaga agctccaggt ggcattaggg gaactgagaa 481 gaaagcagga gttggctgag aagttggaag tggaaattgc aataaagaga gcagactgga 541 agaaaacagt ggaaacacag aaatctagga ttcacgcaga gtttgtgcag caaaaaaact 601 tcctggttga agaagaacag aggcagctgc aggagctgga gaaggatgag agggagcagc 661 tgagaatcct gggggagaaa gaggccaagc tggcccagca gagccaggcc ctacaggagc 721 tcatctcaga gctagatcga aggtgccaca gctcagcact ggaactgctg caggaggtga 781 taattgtcct ggaaaggagt gagtcctgga acctgaagga cctggatatt acctctccag 841 aactcaggag tgtgtgccat gtgccagggc tgaagaagat gctgaggaca tgtgcagtcc 901 acatcactct ggatccagac acagccaatc cgtggctgat actttcagaa gatcggagac 961 aagtgaggct tggagacacc cagcagagca tacctggaaa tgaagagaga tttgatagtt 1021 atcctatggt cctgggtgcc cagcactttc actctggaaa acattactgg gaggtagatg 1081 tgacaggaaa ggaggcctgg gacctgggtg tctgcagaga ctctgtgcgc aggaaggggc 1141 actttttgct tagttccaag agtggcttct ggacaatttg gttgtggaac aaacaaaaat 1201 atgaggctgg cacctacccc cagactcccc tccaccttca ggtgcctcca tgccaagttg 1261 ggattttcct ggactatgag gctggcatgg tctccttcta caacatcact gaccatggct 1321 ccctcatcta ctccttctct gaatgtgcct ttacaggacc tctgcggccc ttcttcagtc 1381 ctggtttcaa tgatggagga aaaaacacag cccctctaac cctctgtcca ctgaatattg 1441 gatcacaagg atccactgac tattgatggc tttctctgga cactgccact ctccccattg 1501 gcaccgcttc tcagccacaa accctgcctc ttttccccat gaactctgaa ccacctttgt 1561 ctctgcagag gcatccggat cccagcaagc gagctttagc agggaagtca cttcaccatc 1621 aacattcctg ccccagatgg ctttgtgatt ccctccagtg aagcagcctc cttatatttg 1681 gcccaaactc atcttgatca accaaaaaca tgtttctgcc ttctttatgg gacttaagtt 1741 ttttttttct cctctccatc tctaggatgt cgtctttggt gagatctcta ttatatcttg 1801 tatggtttgc aaaagggctt cctaaaaata aaaacccgaa ttc // LOCUS TRBBS12 4004 bp ds-DNA INV 21-JUL-1990 DEFINITION T.brucei expression site-associated protein (BS1.1 and BS1.2) genes, 5' end and complete cds. ACCESSION M34520 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA, clone BS1.1-2. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 4004) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept < 1 129 expression site-associated protein (BS1.1) pept 1318 2151 expression site-associated protein (BS1.2) site 3083 3604 RIME sequence BASE COUNT 1181 a 650 c 1052 g 1121 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcaact atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg 61 gcagcagtac atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga 121 gtgctgtgaa ggaaggatgc gacagaacgt gcgctgctta gtgtgaaaga ttatggtaat 181 ggagggttgt gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg 241 ggcaaaagta acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa 301 ggaaagatgt gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg 361 agatgtaagg ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa 421 tttagttaca acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt 481 ctactgttta atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc 541 gaatttcact tctccagaat ctttatttta ggtgcgctat agatgatagt ccaaagagaa 601 ggaattacaa ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga 661 ttcttttcta tgtcacttgt gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg 721 aaatggggcg tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt 781 atgatgcata gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta 841 ggtggaagcg gctaatgatt gctagcagca ccaacagagg aagttaattg ttgacgtagc 901 cgtttgtgaa tgagatagaa ggttattggg tggaacatta tcttggaaag agtttttctt 961 ccactaaaaa tgtgtagagt ggtggtttcg agaaaatggt ttagatattg aggaagaaac 1021 aaggatttga ggtgcaagtt gatgcgctgt atgatttgtc ttgccgttgc tgttacttat 1081 tgttgagttg gtatttgagg aaaatgtttt ttcagtgtgc gttgttatat acggctgtga 1141 gttggtgatg tttatttgaa tgatagcttg ataggataag atgaaataat actaagcatg 1201 tattcttctg gctgttcgcc gaaatggtaa tctatgttac ccatagcacg aaaaaagttt 1261 ttttactcag taactgaagc ggaaagtttt aaaaagcata atttaaaaaa gttgataatg 1321 atgaagtttt ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa 1381 aataaaagga atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgaactat 1441 cttaaaggaa tagcgcacag agtaaacagc gagagtgccg tggttacgaa aaaactatca 1501 gatttgaaaa tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct 1561 ggcgagcagg attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt 1621 acattccagc aggagttgga ggaaatgagg aagccatccg cgttagcagc agctgcagct 1681 gggatagcag ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg 1741 tcatcacagt tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa 1801 gaatgttttg atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta 1861 aaagagtcgg cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat 1921 agcttgggtg cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga 1981 acagatactg acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt 2041 ggtggagtga tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa 2101 tacgtgatgc tacagagtac ggtgcagtac ggtggaccga ggatccagta aagtgagcat 2161 atttaaggat gtcatccgcc tgtttgcgcg cttccaagaa gcaaaaaatg aagtgatgaa 2221 taaaataaaa actactgtgg atgaattggc gaaatgcata gggcagaaag aggttgagct 2281 cactgatgat cagctttacg aggaatttat ctgggagacc atacacagat tggagctgtc 2341 aaagagagtg agtgaacaac tctctttagg tgaagaagag gagacgatcc tgaaatctaa 2401 ctatactgca gaaccggtcc gtggactttc acggtagcgg ggtccaacgc ggcagcagta 2461 catttgagtg tttctaccgc tgcactttgt ttttcagttt tattgttggg agtgctgtga 2521 aggaaggatg cgacagaagc tgcgctgctt agtgtgaaag attatggtaa tggagggttg 2581 tgaaagattg ggggaacaaa aacctatttc ttttatttgg gggaacaaat gggcaaaagt 2641 aacgtaagtt tccagtggga gtggtatgtg tgtgtgtatg gggctggcta aggaaagatg 2701 tgagttcggc atgtggtatg tacaagctac gaaaacgtgt gaaacaaaac gagatgtaag 2761 gggaaaatgt aacaaccaac tatgttaaat ttcaggagac tatttttcaa atttagttac 2821 aacaaagtaa atgtcaaata atgccaacta taaaggaaaa ctagactaat tctactgttt 2881 aatgtgacga gaattgtggc gtgaaaccta gtaactgctt atcatggcgt cgaatttcac 2941 ttctccagaa tctttatttt agtgcgctat agatgatagt ccaaagagaa ggaattacaa 3001 ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga ttcttttcta 3061 tgtctcttgt gaaggtcgat ctttttgcta ccccctggcg atgccggcca cctcaacgtg 3121 gtgccagggt ccagtgaccc cgtatcattg ggggaagcca agagccagca gcgttccttt 3181 catggggaac actgctgtag ctccggctac gcatcataca gcacagggat cagcaggcgt 3241 cttgctggga caccgttttt catttgtcgg tccctgggca cgtgccagcg tcgatcagca 3301 gtatcatccg cgctaagatg ctgctgtccg gtgatgtgga cctccaaaaa aggattgcca 3361 ttttggcatc ttttggagag tccagggtgg gaggcttctc gccccatctg ctgtattccg 3421 ttcaactgcg gacgtacaac aaaaaattat agagggtgtg ttaggatgaa tgaaaaaggg 3481 agactctgcc acagtcgcca gaccgatagc actctcaggg ctctacggtg ttggctgatg 3541 gccgcgcaga ggggggaaac tctcacgaag gcacgaagaa aattcaaaaa aaaatttttg 3601 ctaccacagg ctgtaaggat aacgctggag atggagccga ctgtggtttt tcttgtgaat 3661 gcgtatcttc gtaaccggtt gttaagaagg gatcggacga aggaatttgt ctgagtatga 3721 ggtaatgtcg aggttttctt tcattggatg tagtaataat ctccaaattt gttggaacat 3781 ataatattat tgcaataaat gcttgaagag tggaattatt gatggattga cttgagtatc 3841 atgattccaa tgaacactct tataatcctt cgaatagcaa acgtatgaac aaattaacgt 3901 agcgtcatgt agacatggac atttttgtaa tgatacgact gctattgcac atctgaaaat 3961 ttctatgacg gaccagtgcc gttaatcacc gtacaattga attc // LOCUS TRBBS134 2396 bp ds-DNA INV 21-JUL-1990 DEFINITION T.brucei expression site-associated protein (BS1.3 and BS1.4) genes, 3' and 5' ends. ACCESSION M34521 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA, clone BS1.3-4. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 2396) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept 1836 > 2396 expression site-associated protein (BS1.4) ORF < 1 1206 expression site-associated protein (BS1.3) BASE COUNT 744 a 357 c 642 g 653 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgcaa ctccatgaag tgttatcatc catttccttc ggtagcttgg gtgtgaaaaa 61 tattcgagga ggaaatggga gggatggatg caatttggtg agaacagaca ctaacggtat 121 attaaatggg ggttcaccaa cacggcacaa cttaacgtgg ggtggtggag tgatgaattt 181 tggatcctat caaaacggaa gtatgtacgt tgagggtggt gaatacggtg atgctacaga 241 gtacggtgcg gtacggtgga ctgaagatcc cagtaaagtg agcatatttg aggatgtcat 301 tcgcctgttt gcgcgcttcc aagaagcaaa aaatgcagtg atgaggagaa taaaaactac 361 tgtggatgaa ttgacgaaat gtatagggca gaaggaggct gaactcacta atgatcagat 421 ttacgaggaa tttatctggg agaccataca cagattggag ctgtcaaaga gagtgagtga 481 acaaccctct ttaggtgaag aagaggagac aatcctgaaa tctaactata ctgcagaacc 541 tgtccgtgga cctttcacgg gagcggggtc caacacagta gcagtacatt tgagtgtttc 601 taccgctgca ctctgttgtt tagctttatt gttgggagtg ctgtgaggga aggatgcgac 661 agaaactgcg ctgcttagcg tgaaagatta tggtaatgga gggttgggaa agattggggg 721 aacaaaaacc tatttctttt atttggggga acaaatgggc aaaagtaacg tgagtttcca 781 gcgggagtgg tgtgtgtgtg catggggctg actaaggaaa gatgtaagtt cggaatgtgg 841 tatgtacaag ctacgaaaac gtgtgaaaca aaacgatatg taaggggaaa atgtaacaac 901 caactatgtt aaatttcaag agactatttt tcaaatttag ttacaacaaa gtaaatgtca 961 aataatgcca actataaagg aaaactagac taattctact gtttaatgtg acgagaattg 1021 tggcgtgaaa cctagtaacg accacgtcac gatgtggaat ttcacttctc cagaatcttt 1081 gttttagtgc gctatagatg atagtccaaa gagaaggaat tgcagttatt ataacccatc 1141 caaagataga tcaaagactc ctcttctccg agtgattctt ttctatgtca cttgtggaag 1201 gtcgatcttg gaacaatatc gaacgtgtga agcctgaaat gaggcgttat tattgcttct 1261 ctttttggtg gttgttagag ttatataccg ctatggtatg atgcacagag gtaacgctaa 1321 aaataagtga caaggcgttg tagtagggcg tatggtagtt ggaagcggct aatgattgct 1381 ggcctcacca acagaggaag ttaattgttg acgtagccgt ttgtgaatga gatagaaggt 1441 tatcgggtgg aacattatct tggaaagagt ttttcatcac ctaaaaatgt gtagagtggt 1501 ggtttggaga aaatggttta gatattgagg aggaaacaag gatttgaggt gcaagttgat 1561 gcgctgtaag atttgtatta ccgttggtgt tacttattgt tgagtgagta tttgagggaa 1621 atgttatggg aggtgcttcg ttgttacata cggctctgag tcagtggtgt tcatttgaat 1681 gaaatcttga taggataaga tgaaataata ctaagcatgt attcttctgt gctgccgccg 1741 aaatggtaat ctatcttacc tacagcacga aaaatttatt ttttttactc agtaacaaag 1801 cggaaagttt taaaagcata atttaaaaaa gtgatatgat gaggttttgg tttgtgttgt 1861 tggccctttt gggaaaaaaa acacatgcat attatgaaaa tgaaaggaat gcattaaacg 1921 caaccgccgc taataaagtt tgtgggctat cgaactatct taaaggagta gcgcacagag 1981 taaacagcga gagtgctgtg gttacggaaa aactatcaga tttgaaaatg agaagcatcc 2041 agttgcagct atcagtaatg cggaacagag tgccttctgg cgagaaggat tgtaaagaca 2101 taaggacact cttgaaaaca gtattgagga atgagtttac attccagcag gagttggagg 2161 aaatgaggaa cgcatccgcg ttagcagcag ctgcagctgg gttggcagct ggaagactgg 2221 aagagtggat ttttgtattt gcacaggcag cagacaggtc atcacagttt tgcataagtg 2281 tggggaagca tatcgcggct gagcacggca acttgcaaga atgttttgat ggaacaattg 2341 gacctgaaac cctttacaaa attgaggatt cacgtgtaaa agagtcggcg cagaaa // LOCUS TRBBS15 714 bp ss-mRNA INV 21-JUL-1990 DEFINITION T.brucei expression site-associated protein (BS1.5) gene, partial cds. ACCESSION M34522 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.4, cDNA to mRNA, clone BS1.5. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 714) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept < 1 > 714 expression site-associated protein (BS1.5) (AA at 1) BASE COUNT 226 a 116 c 203 g 169 t ORIGIN 1 agcgagagtg ccgtggttac ggaaaaacta tcagatttga aaatgagaag catccagttg 61 cagttatcag taatgcggaa cagagtgcct tctggcgaga aggattgtaa agacatcagg 121 acactcttga aaacagtatt gaggaatgag tttacattcc agcaggagtt ggaggaaatg 181 aggaacgcat ccgcgttagc agcagctgca gctgggttgg cagctggaag actggaagag 241 tggatttttg tatttgcaca ggcagcagac aggtcatcac agttttgcat aagtgtgggg 301 aaaactatcc caccggaaca aaacaacttg caagaatgtt ttgatggaac aattggacct 361 gaaacccttt acaaaattga ggattcacgt gtaaaagagt ctgcaaagaa aagcttgcaa 421 ctccatgaag cgttatcatc catttccttc aatagtttgg gtgcggaaag tatccgagga 481 ggaaatggga aggatggatg taatttggtg agaacagaca ctgacggtat attaaatggg 541 ggttcaccaa cacggcacaa tttaacgtgg ggtggtggag tgatgaattt tggatcctat 601 caaaacggaa gtatgtacgt tgagggggga gaatacggtg atgctacaga gtacggtgca 661 gtacggtgga ctgaagatcc cagtaaagta agcatattta aggatgtcat tcgc // LOCUS HUMCALCAT 1308 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human calcineurin catalytic subunit mRNA, complete cds. ACCESSION J05480 KEYWORDS calcineurin. SOURCE Human adult cerebral cortex and 7-year-old cerebellum, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1308) AUTHORS Kincaid,R.L., Giri,P.R., Higuchi,S., Tamura,J., Dixon,S.C., Marietta,C.A., Amorese,D.A. and Martin,B.M. TITLE Cloning and characterization of molecular isoforms of the catalytic subunit of calcineurin using nonisotopic methods JOURNAL J. Biol. Chem. 265, 11312-11319 (1990) STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by R.Kincaid, 08-MAY-1990, for release after publication. Two cDNA isoforms were found that either contained (class 1) or lacked (class 2) base pairs 535 to 564, but otherwise appeared identical; this region presumably represents an alternatively spliced exon. FEATURES from to/span description pept < 1 762 calcineurin catalytic subunit signal 921 926 poly-A signal signal 927 932 poly-A signal BASE COUNT 401 a 251 c 283 g 373 t ORIGIN 1 ttacagcaca ataacttgtt atctatactc cgagcccacg aagcccaaga tgcagggtac 61 cgcatgtaca ggaaaagcca aacaacaggc ttcccttctc taattacaat tttttcagca 121 ccaaattact tagatgtata caataacaaa gctgcagtat tgaagtatga gaacaatgtt 181 atgaatatca ggcaattcaa ctgttctcct catccatact ggctcccaaa tttcatggat 241 gtttttactt ggtcccttcc atttgttggg gaaaaagtga ctgagatgct ggtaaatgtc 301 ctcaacatct gctcagatga tgaactaggg tcagaagaag atggatttga tggtgcaaca 361 gctgcagccc ggaaagaggt gataaggaac aagatccgag caataggcaa aatggccaga 421 gtgttctcag tgctcagaga agagagtgag agtgtgctga cgctgaaagg cttgacccca 481 actggcatgc tccccagcgg agtactttct ggagggaagc aaaccctgca aagcgctact 541 gttgaggcta tcgaggctga tgaagctatc aaaggatttt caccacaaca taagatcact 601 agcttcgagg aagctaaggg cttagaccga attaatgaga ggatgccgcc tcgcagagat 661 gccatgccct ctgacgccaa ccttaactcc atcaacaagg ctctcacctc agagactaac 721 ggcacggaca gcaatggcag taatagcagc aatattcagt gaccacttac tgttcacttt 781 tttttttttt tttttttttt tttttgagct gcggggcatg atgggattgc tgcatatcag 841 cagttggatg ttcttgcctc tgacagtagc ttatttgctc tgggggccag gaattggatt 901 cagtttacac tatcattaaa aataaaaata aaaaaagggg gagagagata ataaactata 961 ttttggtggg gctggtgatt aaacacctct tttgggtatg ccttttaaaa atgcttatag 1021 agaaaaaaaa ttttaaaaag aaagctaatg ctagatatac tgcaatgtta ggggaatgaa 1081 catgttttcc tactgaattg gggacttcta gataggttaa ggttttattc tgttactgga 1141 catcaaaact ttgtctaatt tcttactcta ttgtacgttt acagtcgcag cactaaaatg 1201 gatgtcatca aacattttta acatgatgta caaactaagg actatttatt gataatgttt 1261 tgctactctt gtcagacaat ggctataaac tgaattaggc agtcttaa // LOCUS MUSCALCAT 2194 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Mouse calcineurin catalytic subunit mRNA, complete cds. ACCESSION J05479 KEYWORDS calcineurin. SOURCE Mouse (strain NIH Swiss) 18-day-old adult brain, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2194) AUTHORS Kincaid,R.L., Giri,P.R., Higuchi,S., Tamura,J., Dixon,S.C., Marietta,C.A., Amorese,D.A. and Martin,B.M. TITLE Cloning and characterization of molecular isoforms of the catalytic subunit of calcineurin using nonisotopic methods JOURNAL J. Biol. Chem. 265, 11312-11319 (1990) STANDARD full staff_review COMMENT Authorin submission for [1] kindly submitted by R.Kincaid, 08-MAY-1990, for release after publication. Two cDNA isoforms were found that either contained (class 1) or lacked (class 2) base pairs 1414 to 1443, but otherwise appeared identical; this region presumably represents an alternatively spliced exon. FEATURES from to/span description pept 76 1641 calcineurin catalytic subunit mRNA < 1 2194 calcineurin catalytic subunit mRNA signal 1786 1791 poly-A signal signal 1792 1797 poly-A signal BASE COUNT 624 a 488 c 529 g 553 t ORIGIN 1 gcgccggtgc ggtcggggtg tgcagtcgga cgggacgagc agcgcgtcgc tgtccccccc 61 tcccggtggc tggagatgtc cgagcccaag gcgattgatc ccaagttgtc gacgaccgac 121 agggtggtga aagccgttcc atttccacca agtcaccggc tgacagcaaa ggaagtgttt 181 gataatgatg ggaaacctcg tgtggatatc ttaaaagcac atctcatgaa ggagggcagg 241 ctggaagaaa gtgttgcatt gagaataata acagagggtg cttcgattct ccgacaggaa 301 aaaaacttgc tggatatcga cgcaccagtc acagtttgtg gggacatcca tggacaattc 361 tttgacttga tgaagctctt tgaagtggga ggatctcctg ccaacactcg ctacctcttc 421 ttaggggact atgttgacag agggtacttc agtatcgaat gtgtgctgta tttgtgggcc 481 ttgaaaattc tttaccccaa aacactgttt ttacttcgcg gaaaccatga atgtaggcac 541 ctcacagagt atttcacgtt taaacaagaa tgtaaaataa agtattcaga acgcgtttat 601 gacgcctgta tggatgcctt cgactgcctt cccctggctg cgctaatgaa ccagcagttc 661 ctgtgtgtac acggtggttt gtctccagag attaacactc tagatgacat cagaaaatta 721 gaccgattca aagaaccacc tgcttatggg cccatgtgtg acatcctatg gtcagacccc 781 ctggaggact ttggaaatga gaagactcag gaacatttca ctcacaacac agtcagaggc 841 tgttcgtact tctacagtta cccagctgtg tgtgacttcc tgcagcacaa taatttgttg 901 tccatactcc gcgcccacga agcccaggat gcagggtacc gcatgtacag gaaaagccaa 961 acaacaggct tcccgtctct aattacaatc ttctcggcac caaattactt agatgtgtac 1021 aataacaaag ctgcagtgtt gaagtacgag aacaatgtga tgaacatcag gcagttcaac 1081 tgctccccgc atccgtactg gctcccaaat ttcatggatg ttttcacctg gtcgctgcca 1141 tttgttgggg agaaagtgac tgagatgctg gtcaatgttc tcaacatctg ctccgacgat 1201 gaactggggt cagaagaaga tggatttgac ggagccacgg ccgcagcccg gaaggaagtc 1261 atcagaaaca agatccgagc aataggcaaa atggccagag tgttctcagt tctcagagaa 1321 gagagtgaga gtgtcctgac actgaagggc ctgaccccaa ctggcatgct ccccagcgga 1381 gtgctctctg gcgggaaaca gactctgcaa agcgctactg ttgaggctat tgaggctgat 1441 gaagccatca aaggattttc accacaacat aagatcacta gcttcgagga ggccaagggc 1501 ttagaccgaa ttaacgagag gatgccacct cgcagagacg ccatgccctc tgacgccaac 1561 cttaactcca tcaacaaggc tctcgcctca gagactaacg gcacggacag caatggcagt 1621 aatagcagca atatccagtg accacttcct gttcactttt tttttttttt gagctgcagg 1681 gcatgatggg attgctgcat ctcagcagtt ggatgttctt gcctctgaag gtagcttgtt 1741 tgctctgggg gccaggaatt ggattcagtt tacactatca tgaaaaataa aaataaaaaa 1801 agagggagag agataataaa ctatattttg gtgagggtgg tgattaaaca cctcttttgg 1861 gtatgccttt aaaaaatgct tctagggcaa aaaagtttta aaaagaaagc taatgctagc 1921 tatactgcaa tgttagggga atgaacgcgt tttcctactg cactggggac ttttagatag 1981 gttaatgaaa ggcctttatt ctgttactgg acacgaaaac tttgtctaat ttcttatact 2041 ctattgtacc tttacagtcg cagcactaaa atggaagaca tcaaacattt ttaacagaaa 2101 aaaaaaaaga tgtaaaaact aactaaggac tatttattaa tgatgttttg ctactcctgt 2161 cagacaatgg ctataaactg aattaggcag tctt // LOCUS RATGGLUT 1060 bp ds-DNA ROD 21-JUL-1990 DEFINITION Rat gamma-glutamyltransferase gene, 5' promoter region. ACCESSION J05515 KEYWORDS gamma-glutamyltransferase. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1060) AUTHORS Rajagopalan,S., Park,J.H., Patel,P., Lebovitz,R.M. and Lieberman,M.W. TITLE Cloning and analysis of the rat gamma glutamyltransferase gene JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Rajagopalan, 25-MAY-1990. BASE COUNT 235 a 277 c 275 g 273 t ORIGIN 1 cagctgcctt ctggaggacc aaactgttca ggggaaggac aggaagaaat gagcctgtgc 61 cttcaggtca gagtcatgcc tagatctggg cgggagagct acaagggata ctgaccagga 121 gatagggtgt tgtcccctcc cccctggggt ttggtatcct cctctgcctt aagagttgca 181 aatcgacttt cccacataac aggcaccaaa tccagttagg accaacccca ccttccaatc 241 caggggagag gaatgtcagc aatgcgtggg cgtgtccttc taatgtgttt tccttgagtg 301 ttgtatgtgg accatctgca tgctcggtac ccagaggcca tcaggtctct tggaacagga 361 attgttgatg tgaaatgcca tgtggttgct gggataggaa ctcaggactc cggaagaacc 421 ttctcttctc cagtccccct ctgttgtttt tttttttttt ttttttgaga tacgatctca 481 cactgtagca caggctaatc cagaactcac taggtaggtc agactgggct caaatcacag 541 cgattctgct tctgcttcct gagtgccagg gtttgcaggt gttagctatc atgcccagtc 601 ttaacatttc acacacgcca gtccaagtta ttaaaaaaca acccggcagt tgagggcagg 661 gccctcaagt cccacaactg gtgcgtgcgt accaagtcca atgcgggaaa ggcctggacc 721 cttgaaccct ttgggcggtt cacttgttag ctcttactac caaatcctgg gcttacacat 781 gaatgccagc ccctccctgc ccagttctgt gacccccttc cccgggcagc tcttgggaga 841 agtcatgcat acatggaggc ggtgccagcc tctttgactc cagagttcag cgggagacag 901 agggagctca tcacatcagg caccccagaa gagttctggg cctgcttcac gtttaacttt 961 gtgattttca ggagtaccag cctgctctaa cggtttcagg gaagattggc tgtgggtttc 1021 cgcagagtgt gggggagttc ctgcttatcc atacagctga // LOCUS YSCRGI 130 bp ds-DNA PLN 21-JUL-1990 DEFINITION Yeast 35S ribosomal RNA gene, 5' end. ACCESSION M35269 KEYWORDS 35S ribosomal RNA. SOURCE Yeast DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 130) AUTHORS Riggs,D.L. and Nomura,M. TITLE Specific transcription of Saccharomyces cerevisiae 35 S rDNA by RNA polymerase I in vitro JOURNAL J. Biol. Chem. 265, 7596-7603 (1990) STANDARD simple staff_entry FEATURES from to/span description rRNA 101 > 130 35S ribosomal RNA BASE COUNT 48 a 9 c 43 g 30 t ORIGIN 1 agagggcaaa agaaaataaa agtaagattt tagtttgtaa tgggaggggg ggtttagtca 61 tggagtacaa gtgtgaggaa aagtagttgg gaggtacttc atgcgaaagc agttgaagac 121 aagttcgaaa // LOCUS ACNRRDA 1486 bp ss-rRNA RNA 21-JUL-1990 DEFINITION A.lignieresii 16S ribosomal RNA. ACCESSION M35017 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE A.lignieresii (strain ATCC 19393) rRNA. ORGANISM Actinobacillus lignieresii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 1486) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1486 16S ribosomal RNA BASE COUNT 384 a 303 c 448 g 321 t 30 others ORIGIN 1 attnannant ttnatcangg ctcagattna ncgctggcgg caggcttaac acatgcaagt 61 cgaacggtaa cgggaagggn ncttncnttc tttgccgacg agtggcggac gggtgagtaa 121 tgcttgggaa tntggcttat ggagggggat aactacggga aactgtagct aataccgcgt 181 aatatcttag gattaaaggg tgggactttc gggccacctg ccataagatg agcccaagtg 241 ggattaggta gttggttagg taaaggctga ccaagccgac gatctctagc tngtctgaga 301 ggatgaccag ccacactnga actgagacac ggtccagact cctacgggag gcagcagtng 361 ggaatattgc acaatggggg gaaccctnat gcagccatgc cgcgtgaatg aagaaggcct 421 tcgggttgta aagttctttc ggtagcgagg aaggtatcaa atttnataga tttngtaatt 481 gacgttaact acagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag 541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggtt nattaagtga 601 gatgtgaaag ccccgggctt aacctgggna ttgcatttca tactggtcaa ctagagtact 661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc 721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcgaaagc gtggggagca 781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct 841 ntnagtctgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag 901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg 961 atgnaacgcg aagaacctta cctactcttg acatccatgg aatcttgtag agatacgaga 1021 gtgccttcgg gaaccatgag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat 1081 gttgggttna gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attnggtcgg 1141 gaactcnaag gagactgccg gtgatnaacc ggaggaaggt ggggatgacg tcaagtcatc 1201 atggccctta cgagtagggc tacacacgtg ctacaatggc gtatacagag ggaagcaaga 1261 tggcgacatg gagcaaatct cacaaagtac gtctaagtcc ggattggagt ctgcaactcg 1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc 1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc 1441 ttaaccgcaa ggggggcgtt taccacggta tgattcatga ctnggg // LOCUS BNORRDA 1532 bp ss-rRNA RNA 21-JUL-1990 DEFINITION D.nodosus 16S ribosomal RNA. ACCESSION M35016 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE D.nodosus (strain 198A; ATCC 27521) rRNA. ORGANISM Bacteroides nodosus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Anaerobic gram-negative straight, curved and helical rods; Bacteroidaceae. REFERENCE 1 (bases 1 to 1532) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1532 16S ribosomal RNA BASE COUNT 410 a 330 c 465 g 324 t 3 others ORIGIN 1 aactgaagag tttgattctg gctcagattg aacgctggcg gcatgcttaa cacatgcaag 61 tcgaacgggg ttatgtagct tgctatgtaa cctagtggcg gacgggtgag taatatatag 121 gaatctgcct tatggtgggg gacaacgtat ggaaacgtac gctaataccg cataagattg 181 aagaatgaaa gcgggggctc gaaagacctc gcgccgtaag atgagcctat atcggattag 241 ctagttggtg gggtaagagc ctaccaaggc gacgatccgt agctggtttg agagaatgat 301 cagccacatc gggactgaga cacggcccna actcctacgg gaggcagcag tggggaatat 361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt 421 gtaaagcact tttattagtg aagaacggtg catggttaat acccatgcaa ttgacattag 481 ctaaggaaaa agcaccggct nactccgtgc cagcagccgc ggtaatacgg agggtgcaag 541 cgttattcgg aatgactggg cgtaaagcgc acgcaggtgg ttttataagt caggtgtgaa 601 atccctgggc tcaacctagg aattgcattt gaaaccgtaa gactagagta tgtcagagga 661 aggcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga 721 aggcagcctt ctggggcaat actgacactc atgtgcgaaa gcgtgggtag caaacaggat 781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggtgttgggt agaaatactc 841 ggtaccgaag ctaacgcaat aagttgaccg cctggggagt acggccgcaa ggttgaaact 901 caaagaaatt gacggggacc cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg 961 cgaagaacct taccaggctt tgacatccag agaattctac agagatgtgg gagtgccttc 1021 gggaactctg agacaggtgt tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 1081 aagtcccgca acgagcgcaa cccttatcct tacttgccag cacgtaaagg tgggaactat 1141 aaggagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc atcatggccc 1201 ttacagcctg ggctacacac gtgctacaat gggcggtaca gacggtagcc aacccgcgag 1261 ggggagccaa tctgagaaaa ccgttcgtag tccggattgc agtctgcaac tcgactgcat 1321 gaagtcggaa tcgctagtaa tcgcgaatca gcacgtcgcg gtgaatacgt tcccgggtct 1381 tgtacacact gcccgtcaca ccatgggagt ttgttgcacc agaagtaggt agcttaaccg 1441 taaggagggc gcttaccacg gtgtggccga tgactggggt gaagtcgtaa caaggtaacc 1501 gtaggggaac ctgcggttgg atcacctcct tt // LOCUS CAJSAPA 3974 bp ds-DNA BCT 21-JUL-1990 DEFINITION C.fetus surface array protein (sapA) gene, complete cds. ACCESSION J05577 KEYWORDS surface array protein. SOURCE C.fetus fetus DNA. ORGANISM Campylobacter fetus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 3974) AUTHORS Blaser,M.J. and Gotschlich,E.C. TITLE Surface array protein of Campylobacter fetus: Cloning and gene structure JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.J.Blaser, 01-JUN-1990. FEATURES from to/span description pept 24 2825 surface array protein (sapA) binding 12 15 ribosomal binding site signal 2846 2870 transcription termination signal BASE COUNT 1293 a 641 c 785 g 1255 t ORIGIN 1 ctattttatt aaggagtcct taaatgttaa acaaaacaga tgtttcaatg ctttatatca 61 ctattatggg tatggcaagt gagggtgatg gtaataagta ttggttagat tatgccaata 121 ataatagttt aggagtttca agtttagcta atattatgct tgatagtcca ggggcggcta 181 aattctttgg tgattctctt ttagcaggta atgaaaaaga ttttgttact aagatatata 241 gtatagcttt aggtaatact agtgatgttg atggcattaa ttattggact aaggcaataa 301 ctggcggtgg agaatttact gatagtaagg gtaatgttat tagtgttgct agtttaagca 361 agggtgattt aataggtgct atgattaact ctatggttaa tggcggtagt gctgagtcta 421 aggctatatt tgaggctaag gcagctgcta gtgattactt tgccgatgct actttggtaa 481 gggatattag tggattagat gagggtacta cttctaagtt aattagcgag attaatagtg 541 ctagtgatct tgataaggtt aagagtgaga ttgatgcttt gaagagtgag ctacctaatc 601 cgggtagtac ttatgatctt acagagggta atgataattt aaagggtact gatttagacg 661 atacttttaa tgggactaca tatgtaggta atggtactaa taagagtact cttagtgcat 721 ttgataagac tagatggtcg gtgcttggga gagatacgtt gaatgcgata tttactgcaa 781 taacacgcgc tgcgctacta actgatcaag ctgaattgat aatcactaaa aggcgtacaa 841 acgtagaaaa tatcaatata atttcagatc tagaaacaag tggcgatttc gttttcaacg 901 gttatgaaaa agtaggattt aacgtacttg gcgatatagt tagctttgct accgacgcat 961 ctaaaagcgt aaatgtagaa acaacaggaa cgataactgc tttcaccgca gccggaacag 1021 gcaaagtcga tgttgtcgcc ggtaaaatct ctgcccttac ggccgattcg cgaacaagcg 1081 taaatttaac tgctacaaac gacactatca cattaaccag tgcaaacgct gctactagtg 1141 tgaatttaaa acagcggcag gccaaagacg ctacaataac atccgcaatg cagcaaaaat 1201 ataacaatag acgcaacagg attgcaacta taacttcagc tacggctgta gagaatttga 1261 cagttaaaca tgcaactaac gtagcgctaa atggtggcat ggataaactt gcaacagtta 1321 ctcttgacaa tgctgcttta actgctgcaa tagatataaa atctgcaagc acactaaatt 1381 taataaattc aagtgttaac ggaccaaaac atctctacag cagcaaaaga cgttactgta 1441 aatttaagcg agctgctgct aaggttaaat taaacacaac tgctgctacg gatcaaactg 1501 tgactcttaa agctaatgcc acagacaata gtcttgaatt tgatagcgca actgcaaaaa 1561 ctacgtcagt cactgctagt ggtagtggaa aaactttggt tatcaaaggt gctgaagtag 1621 aaactctagt taatatagac actactgcgt tcaatgcgct gcagagtgtt agcttcggca 1681 aaactggtca aggcggcaaa tttagcgtaa aaacaggtac aggcgatgat aaaatagagt 1741 ttgtaggcac aactttaact gaaggttcag taatcgatgc gccgggtaat gatactatag 1801 ctatgaaatc agctgctttg actagtgcaa atttcactat gatcaaaaac atagagaatg 1861 ttgctattag cgacgcggtg gctacggctg accttagctc ttctgcgttt aaaaactcag 1921 tcatcataac tactaaagag gctgccgata cgactttaac tataaataaa gaccaagtaa 1981 ttaattttac tgcggcagat gcaggctcag ttaagcttat tacagtgaaa ctaaatgatg 2041 taacagcgct aatgattgtt gttaaaatag tattagatgc tgccgctaaa gatactaata 2101 tagctttggg cacagcagca gccgataaag ctcttgtaat tgatacgggt atagaaactc 2161 ttaatatcac ttcgttagtt aaagctacaa gccctgaaac tacagcaaat accgtaaatg 2221 ctaaattgac agatgtaact tctatcatta tagatgggat gcaaattact ttaggtcatg 2281 ccggcactgc tggaactgac tatagtaaag tttctatgat agatgcaagt gctcttaaag 2341 ctggtcttac atttgatgca agtgctataa cattaggagc taatgctact ataaaaggtg 2401 gctctggagc tgatagtata actgttaaag gcggaaatat cgttgtagat ttagttgctg 2461 gcggagacga tactataact cttaaaaaag gtgctgagaa aacagatatt acaaccgtta 2521 ataactttaa tgctggcgat aagatagata tagctgatgc caaaaatggt actttcacat 2581 ttaataaaat tacaatgaat agcgatgcta atttagatga ttatatcact aaagcagtag 2641 caggagacgg atcaacaaac tctgctgtaa gctactcaca taatggatat acttatgtag 2701 tagttgatgg aacagctgga gctactttca ctaaagctac tgatacaatt atcaaacttt 2761 caggtacgct agatcttaaa cttagcggag ataatgttgt agttgatgat ggaagcgtaa 2821 tttaagcaat agtaattgct tgataaaaaa gccttaattt ttaaggcttt taatttgtct 2881 aaatttatct tagcttaggc gtatttattt gaaatctcgt cagacggatc gtcttaaaat 2941 aaggttttat agagacaaag tatattttag accaaacctt aataaatagt agatagtttt 3001 ttgctaagag gagctaggag aactagctcc ttgaatattt ttaaatttat ttgagttgta 3061 ttttaaattc aacatatctg ttttttgatg ttttgatcga aatttgtcta ttgtatccat 3121 cctaaaaatt tcagctcttc aactcttttt gtgtgaaatt gttctaataa ggttgataaa 3181 ttttcatatt tgtcacttgg aacaagaaaa ctctcgtctc cgactctttt caccaatttt 3241 tgtttataaa actttaaatt taaaggtttt ataagatcta tatatgaact aaactcatct 3301 ttatagctat tagctatact aaaagcgatc tgcttgaact ctttgaaatc aacgtttaaa 3361 attttacaaa gcggcttaaa atcgtttagc ctaatgtttt gacgcttacc gtttataggt 3421 aaatagctct caaatccttt tgccgtacta taaaatcccg ctgctcatca cgtgcataca 3481 taggcgcatc agcactatat ctccgcttgt tataagagaa agatttttca gcatgcagat 3541 cttccgtatt ttatgataac gctataaaca tatatcttca aaagctcctt cttttcagta 3601 tctgatagta gctctttgct tatgcgctta aaaagctttt cgctggttgt gtcatattta 3661 tttatactag tagtcctaga tacgaggcaa actcgacgac gttataatga tgattttcta 3721 gtttatcaaa tcttttaatt acgtagtgat actcgtcgtc atctcctgcc actacagcac 3781 ttcacggaac tctgattttc aacgtatttt tagcaaaact caaatgaaga tgctcattta 3841 tcagcgaggt gcggaaaata acttttgaac tggttggatt tgatctgatt ttcgaatatg 3901 gttttaaaat atagtttgaa gcagcgtctt gatggattat ctttcgttca aaatcgatat 3961 ttactaattt tttg // LOCUS CDBRRDA 1470 bp ss-rRNA RNA 21-JUL-1990 DEFINITION C.hominis 16S ribosomal RNA. ACCESSION M35014 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE C.hominis (strain ATCC 16826) rRNA. ORGANISM Cardiobacterium hominis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods. REFERENCE 1 (bases 1 to 1470) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1470 16S ribosomal RNA BASE COUNT 373 a 331 c 466 g 296 t 4 others ORIGIN 1 actgaagagt ttgattctgg ctcagattga acgctggcgg catgcttaac acatgcaagt 61 cgaacggaaa cgatggagct tgctccaggc gtcgagtggc gaacgggtga gtaacgcatg 121 ggaatctgcc ttttgctggg ggataacgta gggaaactta cgctaatacc gcataacacc 181 taagggtgaa agcgggggac cgaaagcctc gcggcaagag atgagcccat gttggattag 241 ctagttggtg gggtaaaggc ctaccaaggc gacgatccat agctggtctg agaggatgat 301 cagccacact gggactgaga cacggcccag actccnacgg gaggcagcag tggggaatat 361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt 421 gtaaagcact ttcagtaggg aggaaaggtg cgtagttaat acctgcgcaa ttgacgttac 481 ctacagaaga agcaccggct aactccgtgc cagcagccgc ggtaatacgg agggtgcgag 541 cgttattcgg aattactggg cgtaaagcgc acgcaggcgg ttgcccaagt cagatgtgaa 601 agccccgggc ttaacctggg aactgcattt gaaactgggc gactagagta tgaaagagga 661 aagcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga 721 aggcagcttt ctgggtcgat actgacgctc atgtgcgaaa gcgtggggag caaacaggat 781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggcgtcgggt tgttaaagac 841 tcggtgcngg agctaacgca ttaagttgac cgcctgggga gtacggccgc aaggttgaaa 901 ctcaaagaaa ttgacgggga cccgcncaag cggtggagca tgtggtttaa ttcgatgcaa 961 cgcgaagaac cttaccaggc cttgacatcc taggaacttg gcagagatgc cttggtgcct 1021 tcgggaacct agagacaggt gttgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg 1081 ttaagtcccg caacgagcgc aacccttatc cttacttgcc agcactttgg gtgggaactg 1141 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc 1201 cttacggcct gggctacaca cgtgctacaa tggtcggtac agacggtagc gaagccgcga 1261 ggtggagcca atctgagaaa gccgatcgta gtccggattg cagtctgcaa ctcgactgca 1321 tgaagtcgga atcgctagta atcgcgaatc agcatgtcgc ggtgaatacg ttcccgggtc 1381 ttgtacacac tgcccgtcac accatgggag tttgttgcac cagaagcagg tagcttaact 1441 taggagggcg cttgccacgg tgtggccgat // LOCUS GLARGBA 763 bp ds-DNA INV 21-JUL-1990 DEFINITION G.lamblia 5.8S rRNA gene. ACCESSION M35013 KEYWORDS 5.8S ribosomal RNA; large subunit rRNA; small subunit rRNA. SOURCE G.lamblia DNA, clones B4 and C4. ORGANISM Giardia lamblia Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae. REFERENCE 1 (bases 1 to 763) AUTHORS Edlind,T.D., Sharetzsky,C. and Cha,M.E. TITLE Ribosomal RNA of the primitive eukaryote Giardia lamblia: Large subunit domain I and potential processing signals JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.D.Edlind, 04-JUN-1990. FEATURES from to/span description rRNA 177 321 5.8S ribosomal RNA rRNA < 1 146 small subunit ribosomal RNA rRNA 372 > 763 large subunit ribosomal RNA BASE COUNT 121 a 293 c 293 g 56 t ORIGIN 1 gtacacaccg cccgtcgctc ctaccgactg ggcgcggcgg cgagcgcccc ggacgcgcga 61 agggccgcga gcccccgcgc ctggaggaag gagaagtcgt aacaaggtat ccgtaggtga 121 acctgcggat ggatccctcg cgcgccccgc gcgtcgcccc cgcggcccgg tcggcccccg 181 aacgccccgc cggcggatgc ctcggcccgg gcggcgacga agagcgcggc ggagcgcgag 241 acgcggtgcg gacccgcccg ccccgagaag caccgaccct cgaacgcagc gcgccccggc 301 gccgccgcct cggcgcccgc cgcgtgccgc gcgccgcgcc gcagagagcg ccccggggcg 361 gtcccgccgg gctgcgcggc ccgaggcggc gggggcgacg ggcggaactt aagcatatca 421 gtacgccccg gaggagaaac caaccgggat tccccgtagc ggcgagcgac gcgggaggag 481 cccgccccga aggcgcgctg tggggcgcag gcgcaggccc gccgcgaggg ggcccgaggg 541 ccccgcccga gagggtgcaa gccccgtacg gcggccggcg cctgcgcgcg agtagcgctg 601 cttgagcgtg cagcgcggaa gggaggcgcg gcccttccaa ggctaaatac gccccgggac 661 cgatagcgga ccaagtagcg cgagcgaacg gtgaaaagga cgcctgcggc cgctcaaaag 721 acctgaaccg ccgccgcgcc gccggccccg tctcgaaaca cgg // LOCUS HEIRRDA 1472 bp ss-rRNA RNA 21-JUL-1990 DEFINITION H.influenzae 16S ribosomal RNA. ACCESSION M35019 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE H.influenzae (strain ATCC 33391) rRNA. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 1472) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1472 16S ribosomal RNA BASE COUNT 376 a 300 c 459 g 311 t 26 others ORIGIN 1 attgaagagt ttgatcatgg ctcagattga acgctggcgg caggcttaac acatgcaagt 61 cgaacggtag caggagaaag cttgctttct tgctgacgag tggcggacgg gtgagtaatg 121 cttgggaatc tggcttatgg agggggataa cgacgggaaa ctgtcgctaa taccgcgtat 181 tatcggaaga tgaaagtgcg ggactgagag gccgcatgcc ataggangag cccaagtggg 241 attaggtagt tggtggggta aatgcctacc aagcctncga tctctagctg gtctgagang 301 atgaccagcc acactggaac tgagacacgg tccagactcc tacgggaggc agcagtgggg 361 aatattgcgc natgggggga accctgacgc agccatgccg cgtgaatgaa gaaggccttc 421 gggttgtaaa gttctttcgg tattgaggaa ggttgatgtg ttaatagcac atcaaattga 481 cgttaaatac agaagaagca ccggctnact ccgtgccagc agccgcggta atacggaggg 541 tgcgagcgtt aatcggaata actgggcgta aagggcacgc aggcggttat ttaagtgagg 601 tgtgaaagcc ccgggcttaa cctgggnatt gcatttcaga ctgggtaact agagtacttt 661 agggaggggt agaattccac gtgtagcggt gaaatgcgta gagatgtgga ggaataccga 721 aggcgaaggc agccccttgg gaatgtactg acgctcatgt gcgaaagcgt ggggagcaaa 781 caggattaga taccctggta gtccacgctg taaacgctgt cgatttnggg gttggggttn 841 aactctggca cccgtagcta acgtgataaa tcgaccgcct ggggagtacg gccgcaaggt 901 taaaactcaa atgaattgac gggggccngc ncaannggtg gagcatgtgg tttaattcga 961 tgnaacgcga agaaccttac ctactcttga catcctaaga agagctcaga gatgagcttg 1021 tgccttcggg aacttagaga caggtgctgc atggctgtcg tcagctcgtg ttgtgaaatg 1081 ttgggttnag tcccgcaacg agcgcaaccc ttatcctttg ttgccagcga cttggtcggg 1141 aactcaaagg agactgccag tgataaactg gaggaaggtn gggatgacgt caagtcatca 1201 tggcccttac gagtagggct acacacgtgc tacaatggcg tatacagagg gaagcgaagc 1261 tgcgaggtgg agcgaatctc ataaagtacg tctaagtccg gattggagtc tgcaactcga 1321 ctccatgaag tcggaatcgc tagtaatcgc gaatcagaat gtcgcggtga atacgttccc 1381 gggccttgta cacaccgccc gtcacaccnt gggagtgggt tgtaccagaa gtagatagct 1441 naacctnnnn gagggcgtnn ncnacggtan ga // LOCUS HUMIBSUB 3110 bp ss-mRNA PRI 21-JUL-1990 DEFINITION Human integrin beta-5 subunit mRNA, complete cds. ACCESSION M35011 KEYWORDS integrin. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3110) AUTHORS Suzuki,S., Huang Z,-S. and Tanihara,H. TITLE Cloning of a new integrin beta subunit exhibiting high homology with integrin beta-3 subunit JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Suzuki, 05-JUN-1990. FEATURES from to/span description pept 30 2420 integrin beta-5 subunit BASE COUNT 726 a 809 c 887 g 688 t ORIGIN 1 cgcgccgccg ctgagggagg cgccccacca tgccgcgggc cccggcgccg ctgtacgcct 61 gcctcctggg gctctgcgcg ctcctgcccc ggctcgcagg tctcaacata tgcactagtg 121 gaagtgccac ctcatgtgaa gaatgtctgc taatccaccc aaaatgtgcc tggtgctcca 181 aagaggactt cggaagccca cggtccatca cctctcggtg tgatctgagg gcaaaccttg 241 tcaaaaatgg ctgtggaggt gagatagaga gcccagccag cagcttccat gtcctgagga 301 gcctgcccct cagcagcaag ggttcgggct ctgcaggctg ggacgtcatt cagatgacac 361 cacaggagat tgccgtgaac ctccggcccg gtgacaagac caccttccag ctacaggttc 421 gccaggtgga ggactatcct gtggacctgt actacctgat ggacctctcc ctgtccatga 481 aggatgactt ggacaatatc cggagcctgg gcaccaaact cgcggaggag atgaggaagc 541 tcaccagcaa cttccggttg ggatttgggt cttttgttga taaggacatc tctcctttct 601 cctacgcggc accgaggtac cagaccaatc cgtgcattgg ttacaagttg tttccaaatt 661 gcgtcccctc ctttgggttc cgccatctgc tgcctctcac agacagagtg gacagcttca 721 atgaggaagt tcggaaacag agggtgtccc ggaaccgaga tgcccctgag gggggctttg 781 atgcagtact ccaggcagcc gtctgcaagg agaagattgg ctggcgaaag gatgcactgc 841 atttgctggt gttcacaaca gatgatgtgc cccacatcgc attggatgga aaattgggag 901 gcctggtgca gccacacgat ggccagtgcc acctgaacga ggccaacgag tacactgcat 961 ccaaccagat ggactatcca tcccttgcct tgcttggaga gaaattggca gagaacaaca 1021 tcaacctcat ctttgcagtg acaaaaaacc attatatgct gtacaagaat tttacagccc 1081 tgatacctgg aacaacggtg gagattttag atggagactc caaaaatatt attcaactga 1141 ttattaatgc atacaatagt atccggtcta aagtggagtt gtcagtctgg gatcagcctg 1201 aggatcttaa tctcttcttt actgctacct gccaagatgg ggtatcctat cctggtcaga 1261 ggaagtgtga gggtctgaag attggggaca cggcatcttt tgaagtatca ttggaggccc 1321 gaagctgtcc cagcagacac acggagcatg tgtttgccct gcggccggtg ggattccggg 1381 acagcctgga ggtgggggtc acctacaact gcacgtgcgg ctgcagcgtg gggctggaac 1441 ccaacagcgc caggtgcaac gggagcggga cctatgtctg cggcctgtgt gagtgcagcc 1501 ccggctacct gggcaccagg tgcgagtgcc aggatgggga gaaccagagc gtgtaccaga 1561 acctgtgccg ggaggcagag ggcaagccac tgtgcagcgg gcgtggggac tgcagctgca 1621 accagtgctc ctgcttcgag agcgagtttg gcaagatcta tgggcctttc tgtgagtgcg 1681 acaacttctc ctgtgccagg aacaagggag tcctctgctc aggccatggc gagtgtcact 1741 gcggggaatg caagtgccat gcaggttaca tcggggacaa ctgtaactgc tcgacagaca 1801 tcagcacatg ccggggcaga gatggccaga tctgcagcga gcgtgggcac tgtctctgtg 1861 ggcagtgcca atgcacggag ccgggggcct ttggggagat gtgtgagaag tgccccacct 1921 gcccggatgc atgcagcacc aagagagatt gcgtcgagtg cctgctgctc cactctggga 1981 aacctgacaa ccagacctgc cacagcctat gcagggatga ggtgatcaca tgggtggaca 2041 ccatcgtgaa agatgaccag gaggctgtgc tatgtttcta caaaaccgcc aaggactgcg 2101 tcatgatgtt cacctatgtg gagctcccca gtgggaagtc caacctgacc gtcctcaggg 2161 agccagagtg tggaaacacc cccaacgcca tgaccatcct cctggctgtg gtcggtagca 2221 tcctccttgt tgggcttgca ctcctggcta tctggaagct gcttgtcacc atccacgacc 2281 ggagggagtt tgcaaagttt cagagcgagc gatccagggc ccgctatgaa atggcttcaa 2341 atccattata cagaaagcct atctccacgc acactgtgga cttcaccttc aacaaatcct 2401 acaatggcac tgtggactga tgtttccttc tccgaggggc tggagcgggg atctgatgaa 2461 aaggatcaga ctgaaacgcc ttgcacggct gctcggcttg atcacagctc cctaggtagg 2521 caccacagag aagaccttct agtgagcctg ggccaggagc ccacagtgcc tgtacaggaa 2581 ggtgcctggc catgtcacct ggctgctagg ccagagccat gccaggctgc gtccctccga 2641 gcttgggata aagcaagggg accttggcgc tctcagcttt ccctgccaca tccagcttgt 2701 tgtcccaatg aaatactgag atgctgggct gtctctccct tccaggaatg ctgggccccc 2761 agcctggcca gacaagaaga ctgtcaggaa gggtcggagt ctgtaaaacc agcatacagt 2821 ttggcttttt tcacattgat catttttata tgaaataaaa agatcctgca tttatggtgt 2881 agttctgagt cctgagactt ttctgcgtga tggctatgcc ttgcacacag gtgttggtga 2941 tggggctgtt gagatgcctg ttgaaggtac atcgtttgca aatgtgagtt tcctctcctg 3001 tccgtgtttg tttagtactt ttataatgaa aagaaacaag attgtttggg attggaagta 3061 aagattaaaa ccaaaagaat ttgtgtttgt ctgataaaaa aaaaaaaaaa // LOCUS KINRRDAA 1474 bp ss-rRNA RNA 21-JUL-1990 DEFINITION K.indologenes 16S ribosomal RNA. ACCESSION M35015 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE K.indologenes (strain ATCC 25869) rRNA. ORGANISM Kingella indologenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1474) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1474 16S ribosomal RNA BASE COUNT 375 a 322 c 451 g 315 t 11 others ORIGIN 1 ttgaactgaa gagtttgatt ctggctcaga ttgaacgctg gcggcatgct taacacatgc 61 aagtcgaacg agggaagcag cttgctgcng acctngtggc ggacgggtga gtaacgtata 121 ggaatctacc ttggggtggg ggataacgta tggaaacgta cgctaatacc acataacact 181 tacgagtcaa agcgggggat cttcggacct cgcgccctta gatgagccta tattggatta 241 gctagttggt rgggtaaagg cctaccaagg cgacgatcca tagctggttt gagaggatga 301 tcagccacat cgggactgag acacggcccn nactcctacg ggaggcagca gtggggaata 361 ttggncnatg ggggcaaccc tgatccagca atgccgcgtg tgtgaagaag gccttngggt 421 tgtaaagcac tttcgttagt gaagaaaggt ttatggttaa tacctgtgaa tttgacatta 481 gctaaagaag aagcaccggc taactccgtg ccagcagccg cggtaatacg gagggtgcna 541 gcgttattcg gaatgactgg gcgtaaagcg cacgcaggcg gttatttaag tcagatgtga 601 aagccccggg cttaacctgg gnattgcatt tgaaactggg taactagagt atgtgagagg 661 aaggcggaat ttccagtgta gcagtgaaat gcgtagagat tggaaggaac accgatggcg 721 aaggcagcct tctggggcaa tactgacgct catgtgcgaa agcgtgggta gcaaacagga 781 ttagataccc tggtagtcca cgccctaaac gatgtcaact aggtgttggg cacttaagtg 841 ctcggtaccg cagctaacgc attaagttga ccgcctgggg agtacggccg caaggttgaa 901 actcaaagaa attgacgggg acccgcacaa gcggtggagc atgtggttta attcgatgca 961 acgcgaagaa ccttaccagg ccttgacatc cagagaagtt accagagatg gtttcgtgcc 1021 ttcgggaact ctgagacagg tgttgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg 1081 gttaagtccc gcaacgagcg caacccttat ccttacttgc cagcacttcg ggtgggaact 1141 ataaggagac tgccggtgac aaaccggagg aaggtgggga tgacgtcaag tcatcatggc 1201 ccttacggcc tgggctacac acgtgctaca atggtcggta cagacggttg ccaacccgcg 1261 agggggagct aatctgagaa agccgatcgt agtccggatt gcactctgca actcgagtgc 1321 atgaagtcgg aatcgctagt aatcgcgaat cagaacgtcg cggtgaatac gttcccgggt 1381 cttgtacaca ctgcccgtca caccatggga gtgggttgca ccagaagtag ctagcttatc 1441 agggcggtta ccacggtgtg attcangact aggg // LOCUS MUSTNFBG 3219 bp ds-DNA ROD 21-JUL-1990 DEFINITION Mouse tumor necrosis factor-beta (lymphotoxin) gene. ACCESSION Y00137 KEYWORDS lymphotoxin; signal peptide; tumor necrosis factor. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3219) AUTHORS Gray,P.W., Chen,E., Tang,W.-L. and Ruddle,N. TITLE The murine tumor necrosis factor-beta (lymphotoxin) gene sequence JOURNAL Nucleic Acids Res. 15, 3937-3937 (1987) STANDARD simple automatic REFERENCE 2 (sites for [1]) AUTHORS Fashena,S.J., Tang,W.-L., Sarr,T. and Ruddle,N.H. TITLE The murine lymphotoxin gene promoter: Characterization and negative regulation JOURNAL J. Immunol. 145, 177-183 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1632 1727 lymphotoxin precursor, exon 2 (first expressed exon) 1811 1910 lymphotoxin precursor, exon 3 2135 2547 lymphotoxin precursor, exon 4 sigp 1632 1727 lymphotoxin signal peptide 1811 1813 lymphotoxin signal peptide matp 1814 1910 lymphotoxin 2135 2547 lymphotoxin pre-msg 685 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 857 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 865 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 1114 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 1179 3121 lymphotoxin mRNA and introns (major alt.) IVS 1286 1622 put. intron I IVS 1728 1810 intron II IVS 1911 2134 intron III signal 1153 1158 put. TATA-box BASE COUNT 724 a 971 c 744 g 780 t ORIGIN 1 tgaaagctcc ctctgtacag agcattggaa gcctggggtg tacatttggg gttacatgat 61 cttggggttc taagagaata cccccaaatc atcttccaga cctggaacat tctaggacag 121 ggttctcaac cttcctaact ccatgaccct ttaatacagt tcctcatgtt gtggtgaccc 181 caaccataca attattttcg ttgctatttc ataactgtaa tttcgctgct attatgaaca 241 taatgtaaat atttgtttta aatagaggtt tgccaaagag accttgccac aggttgagac 301 tgccgctcca gagagtaagg gacacattaa aattgttaca caccagatcc cccaaatttg 361 gggagagggc actgtaatgg aacttcttga cattaaactg gcagataaac tggcagaaaa 421 aaaaaaaaaa aagctgggca gtggtggcac acacctttaa tcccagcact tgggaggcag 481 aggcaggcgg atttctgagt tctaggccag cctggtcgac agagtgagtt tcaggacagc 541 cagggctaca cagagaaacc ctgtctcgaa aaaagcaaaa aaaaaaaaaa aaaactggca 601 gatgaccaga aaatacagat atattggaat aactgtgact tgaaccccca aagacaagag 661 aggaaatagg cctgaagggg cggcaggcat gtcaagcatc cagagccctg ggttcgaacc 721 tgaaaaaaca aaggtgccgc taaccacatg tggcttcgga gccctccaga catgaccatg 781 atcgacagag agggaaatgt gcagagaagc ctgtgagcag tcaagggtgc agaagtgata 841 taaaccatca ctcttcaggg aaccaggctt ccagtcacag cccagctgta ccctctccac 901 gaattgctcg gccgttcact ggaactcctg ggcctgaccc agctccctgc tagtccctgc 961 ggcccacagt tccccggacc cgactccctt tcccagaacg cagtagtcta agcccttagc 1021 ctgcggttct ctcctaggcc ccagcctttc ctgccttcga ctgaaacagc agcatcttct 1081 aagcctgggg cttccccaag ccccagcccc gacctagaac ccgcccgctg cctgccacac 1141 tgccgcttcc tctataaagg gacccgacgc cagcgcccag gaccccgcac agcaggtgag 1201 cctctcctac cctgtctcct tgggcttacc ctggtatcag gcatccctca ggatccccag 1261 ccttaatggg tctggtcctc ctgtcgtggc tttgattttt ggtctgttcc tgtggcggcc 1321 ttatcagtct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct 1381 ctctctctct ctctttctct ctctctgcct ctgttagcca ttgtctgttt ctatggtgga 1441 gctttcctct tcccctctgt ctctccttat ccctgctcac ttcagggttc ccctgcctgt 1501 ccccttttct gtctgtcgcc ctgtctctca gggtggctgt ctcagctggg aggtaaggtc 1561 tgtcttcctc tgtgtgcccc gcctccgcta cacacacaca ctctctctct ctctctcagc 1621 aggttctcca catgacactg ctcggccgtc tccacctctt gagggtgctt ggcacccctc 1681 ctgtcttcct cctggggctg ctgctggccc tgcctctagg ggcccaggtg aggcagcaag 1741 agattggggg tgctggggtg gcctagctaa ctcagagtcc tagagtcctc tccactctct 1801 tctgtcccag ggactctctg gtgtccgctt ctccgctgcc aggacagccc atccactccc 1861 tcagaagcac ttgacccatg gcatcctgaa acctgctgct caccttgttg gtaaacttct 1921 gcctccagag gagaggtcca gtccctgcct tttgtcctac ttgcccaggg gcccaggcga 1981 tcttcccatc tccccacacc aacttttctt accctaaggg caggcacccc actcccaatc 2041 tccctaccaa ccatcccact tgtccagtgc ctgctcctca gggatgggga cctctgatct 2101 tgatagcccc ccaatgtctt gtgcctcttc ccagggtacc ccagcaagca gaactcactg 2161 ctctggagag caagcacgga tcgtgccttt ctccgacatg gcttctcttt gagcaacaac 2221 tccctcctga tccccaccag tggcctctac tttgtctact cccaggtggt tttctctgga 2281 gaaagctgct cccccagggc cattcccact cccatctacc tggcacacga ggtccagctc 2341 ttttcctccc aatacccctt ccatgtgcct ctcctcagtg cgcagaagtc tgtgtatccg 2401 ggacttcaag gaccgtgggt gcgctcaatg taccaggggg ctgtgttcct gctcagtaag 2461 ggagaccagc tgtccaccca caccgacggc atctcccatc tacacttcag ccccagcagt 2521 gtattctttg gagcctttgc actgtagatt ctaaagaaac ccaagaattg gattccaggc 2581 ctccatcctg accgttgttt caagggtcac atccccacag tctccagcct tccccactaa 2641 aataacctgg agctctcacg ggagtctgag acacttcagg ggactacatc ttccccaggg 2701 ccactccaga tgctcagggg acgactcaag cctacctaga agttcctgca cagagcaggg 2761 tttttgtggg tctaggtcgg acagagacct ggacatgaag gagggacaga catgggagag 2821 gtggctggga acaggggaag gttgactatt tatggagaga aaagttaagt tatttattta 2881 tagagaatag aaagagggga aaaatagaaa gccgtcagat gacaactagg tcccagacac 2941 aaaggtgtct cacctcagac aggacccatc taagagagag atggcgagag aattagatgt 3001 gggtgaccaa ggggttctag aagaaagcac gaagctctaa aagccagcca ctgcttggct 3061 agacatccac agggaccccc tgcaccatct gtgaaaccca ataaacctct tttctctgag 3121 attctgtctg cttgtgtctg tcttgcgttg ggggagaaac ttcctggtct ctttaaggag 3181 tggagcaggg gacagaggcc tcagttggcc atgggatcc // LOCUS NGORRDAA 1478 bp ss-rRNA RNA 21-JUL-1990 DEFINITION N.denitrificans 16S ribosomal RNA. ACCESSION M35020 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE N.denitrificans (strain ATCC 14686) rRNA. ORGANISM Neisseria denitrificans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1478) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1478 16S ribosomal RNA BASE COUNT 380 a 331 c 448 g 308 t 11 others ORIGIN 1 aacataagag tttgatcctg gctcagattg aacgctggcg gcatgcttta cacatgcaag 61 tcgaacggca gcggagaagt gcttgcactt ctgccggcga gtggcgaacg ggtgagtaac 121 atattggaac gtaccgagta atgggggata actaatcgaa agattagcta ataccgcata 181 cgtcctgagg gagaaagcgg gggatcttcg gacctcgcgt tattcgagcg gccaatatct 241 gattagctag ttggtggggt aaaggcctac caaggcgacg atcagtagcg ggtctgagag 301 gatgatccgc cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg 361 gaattttgga cgatgggcgc aagcctgatc cagccatgcc gcgtgtatga agaaggcctt 421 cgggttgtaa agtacttttg tnngggaaga aaagggattt gctaataccg aatcctcatg 481 acggtacctn aagaataagc accggctaac tacgtgccag nanccgcggn natacgtagg 541 gtgcgagcgt taatcggaat tactgggcgt aaagcgagcg cagacggtta cttaagcagg 601 atgtgaaatc cccgagctca acttgggaac tgcgttctga actgggtagc tagagtatgt 661 cngagggggg tagaattcca cgtgtagcag tgaaatgcgt agagatgtgg aggaataccg 721 atggcgaagg cagccccctg ggataatact gacgttcatg ctcgaaagcg tgggtagcaa 781 acaggattag ataccctggt agtccacgcc ctaaacgatg tcgattagct gttggggtac 841 ttgatacctt agtagcgtag ctaacgcgtg aaattgaccg cctggggagt acggtcgcaa 901 gattaaaact caaaggaatt gacggggacc cgcacaagcg gtggatgatg tggattaatt 961 cgatgcaacg cgaagaacct tacctggtct tgacatgtac ggaatctccc agagacggga 1021 gngtgccttc gggagccgta acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag 1081 atgttgggtt aagtcccgca acgagcgcaa cccttgtcat tagttgccat cattcagttg 1141 ggcactctaa tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct 1201 catggccctt atgaccaggg cttcacacgt catacaatgg tcggtacaga gggtagccaa 1261 gccgcgaggt ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc 1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc 1381 ccgggtcttg tacacaccgc ccgtcacacc ntgggagtgg gggataccag aagtaggtag 1441 ggtaaccgca aggagcccnc ttaccacggt atacttca // LOCUS PASRRDA 1481 bp ss-rRNA RNA 21-JUL-1990 DEFINITION P.multocida 16S ribosomal RNA. ACCESSION M35018 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE P.multocida (strain NCTC 10322) rRNA. ORGANISM Pasteurella multocida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 1481) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1481 16S ribosomal RNA BASE COUNT 374 a 305 c 459 g 321 t 22 others ORIGIN 1 attgnagagt ttgancangg ctcagattga ncgctggcgg caggcttaac acatgcaagt 61 cgaacggtag caggaagaaa gcttnctttc tttgctgacg agtggcggac gggtgagtaa 121 tgcttgggna tctggcttat ggagggggat aactgtggga aactgcagct aataccgcgt 181 attctcttag gaggaaaggg tgggaccttn gggccacctg ccataagatg agcccaagtg 241 ggattaggta gttggtgggg taaaggccta ccaagcctnc gatctctagc tngtctgaga 301 ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg 361 ggaatattgc gcaatggggg gaaccctgac gcagccatgc cgcgtgaatg aagaaggcct 421 tcgggttgta aagttctttc ggtnatgagg aagggatgtt nttaaataga tagcatcatt 481 gacgttaatt ncagaagaag caccggctna ctccgtgcca gcagccgcgg taatacggag 541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggac ttttaagtga 601 gatgtgaaat ccccgagctt aacttgggaa ctgcatttca gactgggagt ctagagtnct 661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc 721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcnaaagc gtggggagca 781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct 841 ntntgcttgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag 901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg 961 atgnaacgcg aagaacctta cctactcttg acatcctaag aagagctcag agatgagctt 1021 gtgccttcgg gaacttagag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat 1081 gttgggttaa gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attcggtcgg 1141 gaactcaaag gagactgcca gtgannaact ggaggaaggt ggggatgacg tcaagtcatc 1201 atggccctta cgagtagggc tacacacgtg ctacaatggt gcatacagag ggcagcgaga 1261 gtgcgagctt gagcgaatct cagaaagtgc atctaagtcc ggattggagt ctgcaactcg 1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc 1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc 1441 ttaaccntcg ggggggcgtt taccacggta tgattcatga c // LOCUS RATSUBB 1124 bp ss-mRNA ROD 21-JUL-1990 DEFINITION Rat F-0-ATPase subunit b mRNA, complete cds. ACCESSION M35052 KEYWORDS F-0-ATPase subunit b. SOURCE Rat (strain Wistar) liver, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1124) AUTHORS Tsurumi,C., Yoshihara,Y., Osaka,F., Yamada,F., Tani,I., Higuti,T., Shimizu,M., Oeda,K., Ohkawa,H., Toda,H., Kakuno,T., Sakiyama,F., Kumatori,A., Tanaka,K. and Ichihara,A. TITLE cDNA cloning and sequencing for the import precursor of subunit b in H+-ATP synthase from rat mitochondria JOURNAL Biochem. Biophys. Res. Commun. 169, 136-142 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Higuti, 04-JUN-1990. FEATURES from to/span description pept 60 830 F-0-ATPase subunit b precursor sigp 60 185 F-0-ATPase subunit b signal peptide matp 186 827 F-0-ATPase subunit b signal 1106 1111 poly-A signal BASE COUNT 323 a 241 c 266 g 294 t ORIGIN 1 gagtgtctcg ggtcggggtc acagggacat cggtaccgct gtctggatct tcgcagacaa 61 tgctgtcccg ggtggtactt tctgccgccg ccacagcggc cccgtgtctg aagaacgcgg 121 ccgtcctggg tccaggggtt ttacaggcaa caagggtctt tcacacagga cagcctcgcc 181 ttgcccctct accacctctt cctgaatatg gaggaaaagt acgtcttggg ctgattcctg 241 aggaattttt ccagttcctt taccctaaga ctggtgtaac aggaccctac gtgcttggaa 301 ctgggcttag cttgtacttt ctatccaaag aaatatatgt gattacccca gagaccttct 361 ctaccatatc agtagtaggg ttgattgtct atgtgattaa gaaatatggc gcctctattg 421 gagaatttat tgacaaactt aatgaggaaa aaattgctca actagaagaa ataaagcagt 481 caagcatgaa gcaaatccag gatgcaatca acagggagaa ggcacagcag gcactggttc 541 agaagcgcca ctacctcttc gatgttcaga ggaataacat tgccctggcc ttggaggtca 601 cttaccggga acgactacat aaagcatata aggaggtaaa gaatcgcctg gactaccata 661 tttctgtaca ggacatgatg cgtcgcaagg agggagagca catgataaac tgggtggaga 721 agcatgtgat acagagcatt tctgcacagc aggaaaagga gaccattgcc aagtgcatcg 781 gagatctaaa gatgcttgca aagaaggctc aagctcagcc aattatgtga atgtgtctat 841 ctcagttgtg atagccagag agggttaact taaatgagaa ctactctact tgaagaactc 901 tttctgtatt gctgtctaat gaaattgtgg tgtacctttc ctaagaaatg atcaaggtcc 961 atttagtggc ttaaccctat ttctgatcct tgctctgtat tcggagttgt ctgatcacat 1021 ttgagtaagc aatttgcagc aactcgctgc ccaacagaaa ttaccaagtt atagtttaaa 1081 cttgtaatta gttaaaacat cttacaataa aatgtttgaa acag //
GenBank-Updates@genbank.bio.net (07/22/90)
LOCUS BORFLAG 684 bp ds-DNA BCT 22-JUL-1990 DEFINITION B.burgdorferei flagellin gene, 5' end. ACCESSION M34710 KEYWORDS flagellin. SOURCE B.burgdorferei (strain B31; isolate ATCC #35210) DNA. ORGANISM Borrelia burgdorferei Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; Spirochaetales; Spirochaetaceae. REFERENCE 1 (bases 201 to 684) AUTHORS Picken,R.N. TITLE Nucleotide sequence of the flagellin gene of Borrelia hermsii and identification of oligonucleotides for the specific amplification and detection of Borrelia burgdorferi flagellin sequence by the polymerase chain reaction JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 200) AUTHORS Picken,R.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by R.N.Picken, 18-MAY-1990. Author address:R.N.Picken: Baxter Healthcare Corp., Pandex Division 909 Orchard Street Mundelein, IL 60060 FEATURES from to/span description pept 493 > 684 flagellin binding 480 486 ribosome binding site BASE COUNT 249 a 74 c 128 g 233 t ORIGIN 1 tctagatatt taagacttga tgagaaaaag tttgatgaat caattcgtaa taatattgat 61 aatgttagag agcttttttt atatgatctt aatggtgaca gagtgtatga taatggaatt 121 gctaaaatgc taggagattg tctgtcgcct cttgtggctt ccggaggagt tatttataat 181 aaaataaaga attacgactt gaaaattttt aatcaaaaaa ataaagttga agattataaa 241 aagaagtacg aagatagaga gagaaaagtg gaaggtgaac ttaatacctt ggattttacc 301 gttaagcgca tgaaagatca agaaaataca ttaaaggctt ttgattttaa tcaaagaaat 361 aaataataat aattattttt aatgctattg ctatttgcgt ttcttttttt ttaatttttg 421 tgctattctt tttaacaggc aaaaggattt gccaaagtca gaaatttaaa ttttatcatg 481 gaggaatgat atatgattat caatcataat acatcagcta ttaatgcttc aagaaataat 541 ggcattaacg ctgctaatct tagtaaaact caagaaaagc tttctagtgg gtacagaatt 601 aatcgagctt ctgatgatgc tgctggcatg ggagtttctg gtaagattaa tgctcaaata 661 agaggtttgt cacaagcttc taga // LOCUS BORGFLAGA 620 bp ds-DNA BCT 22-JUL-1990 DEFINITION B.burgdorferei DNA. ACCESSION M34711 KEYWORDS . SOURCE B.burgdorferei (strain B31; isolate ATCC #35210) DNA. ORGANISM Borrelia burgdorferei Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; Spirochaetales; Spirochaetaceae. REFERENCE 1 (bases 1 to 620) AUTHORS Picken,R.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.N.Picken, 18-MAY-1990. Author address:R.N.Picken: Baxter Healthcare Corp., Pandex Division 909 Orchard Street Mundelein, IL 60060 BASE COUNT 220 a 113 c 125 g 162 t ORIGIN 1 tctagaaata cttcaaaggc tattaatttt attcagacaa cagaagggaa tttaaatgaa 61 gtagaaaaag tcttagtaag aatgaaggaa ttggcagttc aatcaggtaa cggcacatat 121 tcagatgcag acagaggttc tatacaaatt gaaatagagc aacttacaga cgaaattaat 181 agaattgctg atcaagctca atataaccaa atgcacatgt tatcaaacaa atctgcttct 241 caaaatgtaa gaacagctga agagcttgga atgcagcctg caaaaattaa cacaccagca 301 tcactttcag ggtctcaagc gtcttggact ttaagagttc atgttggagc aaaccaagat 361 gaacctattg ctgtaaatat ttatgcagct aatgttgcaa atcttttctc tggtgaggga 421 gctcaaactg ctcaggctgc accggttcaa gagggtgttc aacaggaagg agctcaacag 481 ccagcacctg ctacagcacc ttctcaaggc ggagttaatt ctcctgttaa tgttacaact 541 acagttgatg ctaatacatc acttgctaaa attgaaaatg ctattagaat gataagtgat 601 caaagggcaa atttaggtgc // LOCUS SSOBETA 2346 bp ds-DNA BCT 22-JUL-1990 DEFINITION S.solfataricus beta-D-galactosidase (lacS) gene, complete cds. ACCESSION M34696 KEYWORDS beta-D-galactosidase. SOURCE S.solfataricus (strain MT-4) DNA, clone C1. ORGANISM Sulfolobus solfataricus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 2346) AUTHORS Cubellis,M.V., Rozzo,C., Montecucchi,P. and Rossi,M. TITLE Isolation and sequencing of a new beta-galactosidase archaebacterial gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.V.Cubellis, 18-MAY-1990. Author address:M.V.Cubellis Dipt. Chimica Organica e Biologica Universita di Napoli Via Mezzocannone, 16 80134 Napoli, Italia FEATURES from to/span description pept 229 1698 beta-D-galactosidase (lacS) (EC 3.2.1.23) BASE COUNT 770 a 402 c 487 g 687 t ORIGIN 1 aaggagaaac ttggcagttt ataacttgac agtaggttgt ggagtgatga ctggatcaat 61 actaggagga gtagcatata attacgttac acaattttat aacccaatat attcaataga 121 ccttatgctt atcctatcct ctattctaag attctcggta tctcccctat tcttgaccat 181 aaaagatact cgctcaaagc ttaaataata ttaatcataa ataaagtcat gtactcattt 241 ccaaatagct ttaggtttgg ttggtcccag gccggatttc aatcagaaat gggaacacca 301 gggtcagaag atccaaatac tgactggtat aaatgggttc atgatccaga aaacatggca 361 gcgggattag taagtggaga tctaccagaa aatgggccag gctactgggg aaactataag 421 acatttcacg ataatgcaca aaaaatggga ttaaaaatag ctagactaaa tgtggaatgg 481 tctaggatat ttcctaatcc attaccaagg ccacaaaact ttgatgaatc aaaacaagat 541 gtgacagagg ttgagataaa cgaaaacgag ttaaagagac ttgacgagta cgctaataaa 601 gacgcattaa accattacag ggaaatattc aaggatctta aaagtagagg actttacttt 661 atactaaaca tgtatcattg gccattacct ctatggttac acgacccaat aagagtaaga 721 agaggagatt ttactggacc aagtggttgg ctaagtacta gaacagttta cgaattcgct 781 agattctcag cttatatagc ttggaaattc gatgatctag tggatgagta ctcaacaatg 841 aatgaaccta acgttgttgg aggtttagga tacgttggtg ttaagtccgg ttttccccca 901 ggatacctaa gctttgaact ttcccgtagg catatgtata acatcattca agctcacgca 961 agagcgtatg atgggataaa gagtgtttct aaaaaaccag ttggaattat ttacgctaat 1021 agctcattcc agccgttaac ggataaagat atggaagcgg tagagatggc tgaaaatgat 1081 aatagatggt ggttctttga tgctataata agaggtgaga tcaccagagg aaacgagaag 1141 attgtaagag atgacctaaa gggtagattg gattggattg gagttaatta ttacactagg 1201 actgttgtga agaggactga aaagggatac gttagcttag gaggttacgg tcacggatgt 1261 gagaggaatt ctgtaagttt agcgggatta ccaaccagcg acttcggctg ggagttcttc 1321 ccagaaggtt tatatgacgt tttgacgaaa tactggaata gatatcatct ctatatgtac 1381 gttactgaaa atggtattgc ggatgatgcc gattatcaaa ggccctatta tttagtatct 1441 cacgtttatc aagttcatag agcaataaat agtggtgcag atgttagagg gtatttacat 1501 tggtctctag ctgataatta cgaatgggct tcaggattct ctatgaggtt tggtctgtta 1561 aaggtcgatt acaacactaa gagactatac tggagaccct cagcactagt atatagggaa 1621 atcgccacaa atggcgcaat aactgatgaa atagagcact taaatagcgt acctccagta 1681 aagccattaa ggcactaaac tttctcaagt ctcactatac caaatgagtt ttcttttaat 1741 cttattctaa tctcattttc attagattgc aatactttca taccttctat attatttatt 1801 ttgtaccttt tgggatctac acttaatgtt agcctaattg gaaagtcatt tagatttaat 1861 actgttacca gtccatccct tttaattatt aatgaaaata agaagggata agtagcgata 1921 gcccttattc cgatatggtc tccaacaata tcccttatta tctgccttgc aacactaggg 1981 tagaactctg aaatcagata tggtaggtaa gttgtaagtg ataggacgta aactttagag 2041 ttagagtaag tgttctgaaa gactactggg tgcaattcga caccgttata ggcgtaaagg 2101 attggcgtag ctccgtttaa tgaaaatata ggtcctacag ggaaattggc ttgcctcttg 2161 taatatgacc aatagaacgt tttcccatcc ctggttaacg cattgacact aacactatcg 2221 taaatcaagt taccgacacc aagaattttc agtgcagtat cccccaagac ttcaataagc 2281 tttttagctg cacttgctgt aaacattaag ttaactcccc tattaagtaa atccacaata 2341 tctaga //
GenBank-Updates@genbank.bio.net (07/24/90)
LOCUS HUMHPRTB 56736 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete cds. ACCESSION M26434 KEYWORDS Alu repetitive element; LINE repetitive element; hypoxanthine phosphoribosyltransferase. SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 56736) AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J., Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W. TITLE Automated DNA sequencing of the human HPRT locus JOURNAL Genomics 6, 593-608 (1990) STANDARD full staff_entry REFERENCE 2 (sites; mutations causing gout) AUTHORS Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N. TITLE A possible nucleotide-binding domain in the tertiary fold of phosphoribosyltransferase JOURNAL J. Biol. Chem. 258, 6450-6457 (1983) STANDARD full staff_entry REFERENCE 3 (sites; mutations causing gout) AUTHORS Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P. TITLE Resolution of a missense mutant in human genomic DNA by denaturing gradient gel electrophoresis and direct sequencing using in vitro DNA amplification: HPRT-Munich JOURNAL Am. J. Hum. Genet. 42, 726-734 (1988) STANDARD full staff_entry REFERENCE 4 (sites; mutations causing gout) AUTHORS Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D. TITLE Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence for identical mutations in two partially deficient subjects JOURNAL J. Clin. Invest. 82, 2164-2167 (1988) STANDARD full staff_entry REFERENCE 5 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Palella,T.D. and Kelly,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase: A single nucleotide substitution in cDNA clones isolated from a patient with Lesch-Nyhan syndrome (HPRT-Midland) JOURNAL Gene 68, 85-91 (1988) STANDARD full staff_entry REFERENCE 6 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Genetic basis of hypoxanthine guanine phosphoribosyltransferase deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint) JOURNAL Gene 63, 331-336 (1988) STANDARD full staff_entry REFERENCE 7 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Human hypoxanthine-guanine phosphoribosyltransferase deficiency. The molecular defect in a patient with gout (HPRT-Ashville) JOURNAL J. Biol. Chem. 264, 520-525 (1989) STANDARD full staff_entry REFERENCE 8 (sites; mutations causing gout) AUTHORS Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in ten subjects determined by direct sequencing of amplified transcripts JOURNAL J. Clin. Invest. 84, 342-346 (1989) STANDARD full staff_entry REFERENCE 9 (sites; mutations causing gout) AUTHORS Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and Stambrook,P.J. TITLE Nucleotide sequence and organization of the mouse adenine phosphoribosyltransferase gene: Presence of a coding region common to animal and bacterial phosphoribosyltransferases that has a variable intron/exon arrangement JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985) STANDARD full staff_entry REFERENCE 10 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D. TITLE Identification of a single nucleotide change in the hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale) responsible for Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 83, 11-13 (1989) STANDARD full staff_entry REFERENCE 11 (sites; mutations causing gout) AUTHORS Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N. TITLE Identification of a single nucleotide change in a mutant gene for hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor) JOURNAL Hum. Genet. 79, 39-43 (1988) STANDARD full staff_entry REFERENCE 12 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T. TITLE Multiplex DNA deletion detection and exon sequencing of the hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families JOURNAL Genomics 7, 235-244 (1990) STANDARD full staff_entry REFERENCE 13 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T. TITLE Identification of mutations leading to the Lesch-Nyhan syndrome by automated direct DNA sequencing of in vitro amplified cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989) STANDARD full staff_entry REFERENCE 14 (sites; mutations causing gout) AUTHORS Hershey,H.V. and Taylor,M.W. TITLE Nucleotide sequence and deduced amino acid sequence of Escherichia coli adenine phosphoribosyltransferase and comparison with other analogous enzymes JOURNAL Gene 43, 287-293 (1986) STANDARD full staff_entry REFERENCE 15 (sites; conserved domain) AUTHORS King,A. and Melton,D.W. TITLE Characterisation of cDNA clones for hypoxanthine-guanine phosphoribosyltransferase from the human malarial parasite, Plasmodium falciparum: Comparisons to the mammalian gene and protein JOURNAL Nucleic Acids Res. 15, 10469-10481 (1987) STANDARD full staff_entry REFERENCE 16 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and Caskey,C.T. TITLE Molecular analysis of a female Lesch-Nyhan patient JOURNAL J. Clin. Invest. 84, 1024-1027 (1989) STANDARD full staff_entry REFERENCE 17 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Stout,J.T. and Caskey,C.T. TITLE HPRT: Gene structure, expression, and mutation JOURNAL Annu. Rev. Genet. 19, 127-148 (1985) STANDARD full staff_entry REFERENCE 18 (sites; AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in a patient with the Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 71, 1331-1335 (1983) STANDARD full staff_entry REFERENCE 19 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Structural alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated from a patient with gout JOURNAL J. Biol. Chem. 259, 27-30 (1984) STANDARD full staff_entry REFERENCE 20 (sites; mutations causing gout) AUTHORS Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Molecular abnormality in a mutant form of the enzyme (HPRT-Toronto) JOURNAL J. Biol. Chem. 258, 6458-6460 (1983) STANDARD full staff_entry REFERENCE 21 (sites; mutations causing gout) AUTHORS Wilson,J.M., Tarr,G.E. and Kelley,W.N. TITLE Human hypoxanthine (guanine) phosphoribosyltransferase: An amino acid substitution in a mutant form of the enzyme isolated from a patient with gout JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983) STANDARD full staff_entry REFERENCE 22 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G., Hildebrand,B.M. and Caskey,C.T. TITLE Molecular evidence for new mutation at the hprt locus in Lesch-Nyhan patients JOURNAL Nature 310, 412-414 (1984) STANDARD full staff_entry REFERENCE 23 (sites; mutant reversion) AUTHORS Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and Caskey,C.T. TITLE Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene rearrangement JOURNAL Somat. Cell Mol. Genet. 14, 293-303 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Edwards, 26-JUL-1989. Mutant Description RJK 1780 missing intron 1 (partial and exon 2 [12] RJK 849 missing intron 3 (partial) and exons 4-9 (no mRNA) [22] RJK 984 missing intron 5 (partial) and exons 6-9 (no mRNA) [17] [22] GM 3467 missing intron 8 (partial) and exon 9 (no mRNA) [12] [22] RJK 853 complete gene deletion [12] [22] [16] GM 2227 inversion of exons 6-9 (no mRNA) [23] GM 1662 and GM 6804 duplication of exons 2 and 3 and elongated mRNA [22] [23] Connersville missing intron 7 (partial and exon 8 [7] FEATURES from to/span description pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1 /nomgen="HPRT" /map="Xq26" /hgml_locus_uid="LE0031K" 14780 14886 hypoxanthine phosphoribosyltransferase, exon 2 16603 16786 hypoxanthine phosphoribosyltransferase, exon 3 27891 27956 hypoxanthine phosphoribosyltransferase, exon 4 31617 31634 hypoxanthine phosphoribosyltransferase, exon 5 34937 35019 hypoxanthine phosphoribosyltransferase, exon 6 39815 39861 hypoxanthine phosphoribosyltransferase, exon 7 40033 40109 hypoxanthine phosphoribosyltransferase, exon 8 41454 41501 hypoxanthine phosphoribosyltransferase, exon 9 IVS 1704 14779 HPRT intron A IVS 14887 16602 HPRT intron B IVS 16787 27890 HPRT intron C IVS 27957 31616 HPRT intron D IVS 31635 34936 HPRT intron E IVS 35020 39814 HPRT intron F IVS 39862 40032 HPRT intron G IVS 40110 41453 HPRT intron H rpt 258 41 (c) Alu repeat copy A rpt 1012 694 (c) Alu repeat copy B rpt 3532 3342 (c) Alu repeat copy C rpt 3982 3343 (c) Alu repeat copy D rpt 6232 5937 (c) Alu repeat copy E rpt 6524 6288 (c) Alu repeat copy F rpt 7504 7807 Alu repeat copy G rpt 8713 8408 (c) Alu repeat copy H rpt 9477 9169 (c) Alu repeat copy I rpt 9699 10142 Alu repeat copy J rpt 10407 10293 (c) Alu repeat copy K rpt 12154 12310 Alu repeat copy L rpt 13302 12998 (c) Alu repeat copy M rpt 13638 13314 (c) Alu repeat copy N rpt 14238 13923 (c) Alu repeat copy O rpt 14610 14239 (c) Alu repeat copy P rpt 16090 15843 (c) Alu repeat copy Q rpt 16518 16293 (c) Alu repeat copy R rpt 18783 18439 (c) Alu repeat copy S rpt 18962 19084 Alu repeat copy T rpt 19397 19104 (c) Alu repeat copy U rpt 20064 19778 (c) Alu repeat copy V rpt 20486 20636 Alu repeat copy W rpt 20990 20696 (c) Alu repeat copy X rpt 21293 20996 (c) Alu repeat copy Y rpt 22171 21878 (c) Alu repeat copy Z rpt 23772 23460 (c) Alu repeat copy AA rpt 25310 25426 Alu repeat copy AB rpt 26388 26279 (c) Alu repeat copy AC rpt 29863 29560 (c) Alu repeat copy AD rpt 33618 33148 (c) Alu repeat copy AE rpt 36575 36645 Alu repeat copy AF rpt 38613 39039 Alu repeat copy AG rpt 39091 39388 Alu repeat copy AH rpt 40622 40925 Alu repeat copy AI rpt 40952 41262 Alu repeat copy AJ rpt 42640 42936 Alu repeat copy AK rpt 44776 44486 (c) Alu repeat copy AL rpt 46627 46305 (c) Alu repeat copy AM rpt 47520 47763 Alu repeat copy AN rpt 49257 49551 Alu repeat copy AO rpt 49705 49993 Alu repeat copy AP rpt 50002 50312 Alu repeat copy AQ rpt 51066 50774 (c) Alu repeat copy AR rpt 52276 51978 (c) Alu repeat copy AS rpt 52407 52674 Alu repeat copy AT rpt 54653 54551 (c) Alu repeat copy AU rpt 55465 55164 (c) Alu repeat copy AV rpt 56523 56689 Alu repeat copy AW rpt 11312 11254 (c) LINE repeat copy A (partial) rpt 18284 18242 (c) LINE repeat copy B (partial) rpt 27091 26843 (c) LINE repeat copy C (partial) rpt 37558 37300 (c) LINE repeat copy D (partial) rpt 39451 39427 (c) LINE repeat copy E (partial) rpt 51317 51100 (c) LINE repeat copy F (partial) rpt 261 321 short interspersed repeat copy A rpt 1013 1054 short interspersed repeat copy B rpt 2902 2869 (c) short interspersed repeat copy C rpt 3078 3107 short interspersed repeat copy D rpt 4228 4254 short interspersed repeat copy E rpt 4463 4494 short interspersed repeat copy F rpt 21582 21635 short interspersed repeat copy G rpt 28430 28394 (c) short interspersed repeat copy H rpt 28923 28981 short interspersed repeat copy I rpt 35562 35604 short interspersed repeat copy J rpt 44042 44108 short interspersed repeat copy K rpt 46628 46688 short interspersed repeat copy L rpt 51715 51687 (c) short interspersed repeat copy M unsure 17479 17479 polymorphism or cloning error unsure 24903 24903 polymorphism or cloning error mut 1664 1678 ggccggctccgttat in wt; gt in gout RJK 951 [13] mut 14778 14778 a in wt; t in Lesch-Nyhan RJK 1760, no exon 2 [12] mut 14808 14809 tt in wt; ttt in Lesch-Nyhan Chicago [7] mut 14874 14874 t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7] mut 14877 14879 tta in wt; ta in Lesch-Nyhan RJK 1939 [12] mut 14886 14886 g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys [12] mut 16619 16619 c in wt; g in gout Toronto Arg->Gly [20] mut 16623 16664 atgtgatgaaggagatgggaggccatcacattgtagccctct in wt; at in Lesch-Nyhan RJK 2108 [12] mut 16675 16676 ag in wt; agg in Lesch-Nyhan RJK 866 [13] mut 16677 16677 g in wt; a in Lesch-Nyhan New Haven Gly->Glu [7] mut 16679 16679 g in wt; c in Lesch-Nyhan Yale Gly->Arg [10] mut 16690 16690 c in wt; a in Lesch-Nyhan Flint RJK 892 Phe->Leu [5] [13] mut 16707 16707 a in wt; t in gout Arlingen Asp->Val [7] mut 16756 16759 tgta in wt; ta in Lesch-Nyhan RJK 1332 [13] mut 16780 16780 c in wt; a in gout Munich Ser->Arg [3] [19] mut 16784 16787 tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12] mut 27897 27897 c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG [12] mut 27911 27911 c in wt; t in gout London Ser->Leu [2] [4] [12] [14] mut 31621 31621 t in wt; a in Lesch-Nyhan Midland RJK 896 Val->Asp [6] [13] mut 31622 31624 ctt in wt; ct in Lesch-Nyhan RJK 2019 [12] mut 31624 31624 t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser [13] mut 31628 31628 t in wt; g in gout Ann Arbor Ile->Met [11] mut 34962 34970 tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK 1210 Met->Lys [13] mut 35015 35015 g in wt; t in gout Milwaukee RJK 949 Ala->Ser [7] [13] mut 39837 39837 c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA [13] mut 39842 39846 tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12] mut 39858 39858 g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr [12] mut 39866 39866 g in wt; a in Lesch-Nyhan RJK 1934. no exon 7 [12] mut 40034 40038 tgttg in wt; tg in Lesch-Nyhan Michigan RJK 855 [7] [13] mut 40080 40080 g in wt; a in Lesch-Nyhan Kingston RJK 2188 Asp->Asn [12] [19] mut 40095 40095 t in wt; g in Lesch-Nyhan New Briton RJK 950 Phe->Val [7] [13] mut 40102 40102 a in wt; g in gout Ashville Asp->Gly [8] mut 40114 40114 g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no exon 8 [12] mut 41486 41509 aaaatacaaagcctaagatgagag in wt; ag in Lesch-Nyhan Evansville RJK 894 [7] [12] mut 41450 41452 ata in wt; ttt in Lesch-Nyhan RJK 906, GM 1899, downstream cryptic splice site used mut 41454 41454 c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079, His->Asp [12] [13] mut 41461 41461 g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr [13] BASE COUNT 15689 a 11281 c 11599 g 18167 t ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26. 1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt 61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc 121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc 181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt 241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga 301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc 361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag 421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc 481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt 541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct 601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga 661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt 721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat 781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga 841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag 901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct 961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc 1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag 1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac 1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc 1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg 1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt 1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg 1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg 1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg 1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc 1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc 1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg 1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg 1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg 1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc 1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga 1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca 1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc 2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat 2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg 2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag 2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg 2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct 2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt 2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg 2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca 2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg 2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt 2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg 2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa 2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt 2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg 2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc 2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc 3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt 3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt 3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct 3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat 3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt 3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag 3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc 3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc 3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta 3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac 3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc 3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt 3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc 3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc 3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc 3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag 3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc 4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca 4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc 4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc 4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa 4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca 4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga 4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac 4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg 4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt 4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac 4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg 4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag 4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg 4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca 4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg 4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat 4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg 5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc 5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta 5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc 5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact 5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct 5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg 5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg 5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc 5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa 5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat 5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa 5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg 5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc 5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa 5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt 5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt 6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg 6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga 6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg 6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc 6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt 6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc 6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg 6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt 6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga 6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac 6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct 6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag 6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc 6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt 6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg 6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc 6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca 7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc 7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga 7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt 7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc 7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg 7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag 7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag 7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca 7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt 7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac 7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga 7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt 7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa 7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct 7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact 7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt 7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc 8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac 8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct 8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg 8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat 8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag 8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat 8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg 8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg 8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc 8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac 8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg 8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc 8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat 8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga 8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta 8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg 9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt 9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag 9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt 9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc 9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag 9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt 9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca 9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc 9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc 9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat 9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg 9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc 9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga 9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca 9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat 9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta 9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc 10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc 10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa 10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa 10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat 10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt 10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct 10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt 10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga 10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca 10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct 10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg 10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg 10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa 10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta 10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa 10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca 10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct 11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa 11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc 11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat 11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc 11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag 11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag 11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga 11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg 11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg 11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt 11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac 11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa 11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa 11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa 11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt 11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt 12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag 12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat 12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat 12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc 12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt 12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt 12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct 12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat 12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa 12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag 12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt 12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac 12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca 12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct 12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt 12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc 12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta 13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc 13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg 13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc 13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct 13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt 13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg 13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat 13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca 13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga 13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt 13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac 13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta 13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg 13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc 13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat 13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg 13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc 14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt 14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga 14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt 14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc 14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg 14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc 14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc 14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca 14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg 14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata 14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg 14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac 14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg 14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat 14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac 14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa 15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt 15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg 15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg 15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg 15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt 15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat 15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt 15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata 15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat 15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag 15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg 15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt 15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga 15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc 15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca 15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag 15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt 16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag 16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt 16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa 16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa 16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt 16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact 16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac 16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac 16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa 16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg 16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg 16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag 16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa 16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa 16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta 16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta 16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg 17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt 17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt 17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt 17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc 17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg 17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc 17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt 17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa 17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc 17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat 17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc 17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat 17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact 17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa 17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt 17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa 18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat 18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat 18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt 18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat 18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc 18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg 18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt 18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt 18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca 18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc 18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt 18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg 18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg 18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat 18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa 18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat 18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg 19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa 19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag 19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga 19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc 19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa 19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg 19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc 19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa 19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg 19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa 19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa 19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg 19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag 19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga 19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag 19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt 19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct 20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt 20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac 20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct 20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga 20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat 20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg 20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat 20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt 20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc 20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag 20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt 20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca 20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta 20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg 20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc 20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt 21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc 21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg 21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc 21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc 21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa 21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat 21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa 21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat 21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat 21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt 21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt 21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta 21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt 21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt 21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct 21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct 21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat 22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc 22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg 22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca 22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag 22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc 22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat 22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg 22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct 22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca 22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt 22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc 22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta 22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct 22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta 22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct 22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa 22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa 23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag 23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc 23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa 23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc 23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac 23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga 23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt 23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc 23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc 23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat 23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc 23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc 23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc 23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga 23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga 23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa 24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg 24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat 24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat 24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct 24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt 24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga 24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg 24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt 24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc 24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt 24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc 24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg 24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat 24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa 24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt 24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg 24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc 25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag 25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc 25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact 25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta 25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg 25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc 25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag 25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc 25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca 25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag 25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag 25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt 25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag 25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat 25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa 25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata 25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg 26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt 26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat 26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt 26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag 26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc 26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact 26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg 26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat 26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag 26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt 26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt 26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata 26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta 26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat 26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg 26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata 27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata 27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg 27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc 27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat 27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag 27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga 27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt 27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac 27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa 27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg 27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt 27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat 27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct 27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata 27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt 27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat 27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct 28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg 28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca 28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat 28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa 28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca 28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc 28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg 28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa 28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt 28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt 28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt 28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat 28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa 28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac 28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac 28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca 28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct 29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac 29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa 29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc 29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga 29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt 29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca 29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca 29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt 29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt 29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca 29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg 29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt 29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct 29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg 29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag 29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct 30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga 30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact 30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt 30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa 30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt 30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg 30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc 30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc 30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag 30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga 30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta 30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata 30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct 30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata 30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa 30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca 30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga 31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg 31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac 31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat 31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga 31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg 31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt 31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt 31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa 31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat 31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg 31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac 31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat 31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg 31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt 31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt 31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac 31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta 32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa 32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag 32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta 32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt 32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt 32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact 32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc 32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat 32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa 32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta 32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca 32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg 32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc 32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt 32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt 32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt 33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc 33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac 33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta 33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac 33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta 33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta 33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata 33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata 33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc 33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat 33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa 33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct 33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc 33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt 33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga 33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct 33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag 34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc 34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat 34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac 34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt 34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga 34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt 34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc 34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt 34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca 34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct 34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac 34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa 34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc 34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc 34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg 34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt 34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac 35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt 35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat 35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta 35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag 35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga 35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc 35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc 35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt 35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag 35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat 35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta 35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat 35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta 35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg 35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt 35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa 36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt 36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag 36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat 36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga 36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa 36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt 36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga 36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct 36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg 36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg 36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga 36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag 36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga 36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta 36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt 36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag 36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta 37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa 37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc 37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc 37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg 37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc 37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg 37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac 37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag 37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg 37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc 37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt 37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc 37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt 37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc 37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc 37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag 37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa 38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat 38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc 38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat 38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga 38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata 38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa 38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag 38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt 38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg 38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc 38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg 38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa 38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg 38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa 38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt 38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg 39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa 39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt 39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg 39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc 39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga 39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga 39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc 39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca 39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac 39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa 39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc 39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg 39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct 39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa 39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct 39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga 39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat 40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg 40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt 40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa 40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct 40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag 40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag 40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc 40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact 40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg 40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg 40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag 40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac 40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag 40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg 40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca 40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca 40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct 41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg 41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac 41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg 41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct 41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc 41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt 41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag 41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg 41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta 41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac 41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc 41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt 41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata 41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga 41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg 42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt 42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc 42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt 42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt 42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta 42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg 42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct 42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt 42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta 42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag 42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta 42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca 42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa 42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg 42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag 42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac 42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa 43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt 43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc 43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct 43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat 43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat 43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc 43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt 43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc 43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt 43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt 43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc 43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca 43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag 43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg 43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa 43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg 43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt 44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca 44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc 44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta 44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact 44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc 44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca 44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg 44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg 44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc 44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac 44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg 44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta 44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca 44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc 44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact 44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt 45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga 45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt 45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg 45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc 45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta 45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg 45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc 45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac 45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct 45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca 45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa 45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa 45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa 45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat 45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg 45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt 45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt 46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca 46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc 46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca 46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg 46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt 46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag 46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct 46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt 46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc 46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc 46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc 46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa 46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc 46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat 46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg 46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc 46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac 47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac 47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca 47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg 47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca 47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc 47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga 47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc 47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa 47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact 47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag 47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg 47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc 47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa 47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga 47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg 47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga 48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc 48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct 48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc 48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg 48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg 48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt 48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg 48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa 48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat 48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc 48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg 48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga 48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat 48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg 48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc 48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt 48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt 49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg 49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag 49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc 49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct 49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat 49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa 49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc 49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc 49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc 49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc 49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta 49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg 49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa 49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc 49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag 49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct 49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt 50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat 50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc 50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt 50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct 50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga 50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca 50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt 50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca 50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt 50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg 50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca 50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa 50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga 50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct 50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt 50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac 51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca 51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca 51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct 51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca 51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac 51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc 51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac 51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa 51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc 51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct 51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc 51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc 51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga 51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt 51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac 51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg 51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa 52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga 52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag 52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa 52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg 52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat 52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg 52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca 52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa 52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag 52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca 52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt 52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta 52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa 52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa 52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac 52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc 52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca 53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata 53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt 53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag 53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc 53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact 53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc 53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat 53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag 53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca 53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag 53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc 53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc 53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc 53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt 53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta 53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct 54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg 54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac 54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt 54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg 54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc 54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc 54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg 54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga 54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag 54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc 54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta 54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta 54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg 54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta 54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga 54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa 54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca 55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc 55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca 55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc 55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc 55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc 55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta 55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg 55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg 55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt 55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt 55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg 55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat 55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac 55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt 55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa 55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa 55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta 56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg 56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag 56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca 56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt 56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca 56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt 56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct 56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg 56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg 56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa 56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc 56701 gccgcccctg gtgatggacc agcggggctt cgacga // LOCUS CHTMOMPCT 1407 bp ds-DNA BCT 24-JUL-1990 DEFINITION C.trachomatis major outer membrane protein gene, complete cds. ACCESSION J03813 KEYWORDS major outer membrane protein. SOURCE C.trachomatis (serotype A) DNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (sites; amino acid sequences) AUTHORS Baehr,W., Zhang,Y.-X., Joseph,T., Su,H., Nano,F.E., Everett,K.D.E. and Caldwell,H.D. TITLE Mapping antigenic domains expressed by Chlamydia trachomatis major outer membrane protein genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4000-4004 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1407) AUTHORS Baehr,W. JOURNAL Unpublished (1988) Rocky Mountain laboratories, Hamilton, MT 59840 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly provided by W.Baehr, 04-MAY-1988 FEATURES from to/span description pept 169 1359 major outer membrane protein BASE COUNT 403 a 283 c 311 g 410 t ORIGIN 1 bp upstream of XhoI site. 1 ctcgagcatt gaacgacatg ttctcgatta aggctgcttt tacttgcaag acattcctca 61 ggccattaat tgctacagga catcttgtct ggctttaact aggacacagt gccgccagaa 121 aaagatagcg agcacaaaga gagctaatta tacaatttag aggtaagaat gaaaaaactc 181 ttgaaatcgg tattagtatt tgccgctttg agttctgctt cctccttgca agctctgcct 241 gtggggaatc ctgctgaacc aagccttatg atcgacggaa ttctgtggga aggtttcggc 301 ggagatcctt gcgatccttg caccacttgg tgtgacgcta tcagcatgcg tatgggttac 361 tatggtgact ttgttttcga ccgtgttttg aaaacagatg tgaataaaga atttcagatg 421 ggagcggcgc ctactaccag cgatgtagca ggcttagaaa aggatccagt agcaaatgtt 481 gctcgcccaa atcccgctta tggcaaacac atgcaagatg ctgaaatgtt tacgaacgct 541 gcttacatgg cattaaatat ctgggatcgt tttgatgtat tttgtacatt gggagcaact 601 accggttatt taaaaggaaa ctccgcttcc ttcaacttag ttggattatt cggaacaaaa 661 acacaatctt ctggctttga tacagcgaat attgttccta acactgcttt gaatcaagct 721 gtggttgagc tttatacaga cactaccttt gcttggagcg taggtgctcg tgcagctctc 781 tgggaatgtg ggtgtgcaac gttaggagct tctttccaat atgctcaatc taaacctaaa 841 gtagaagagt tgaatgttct ttgtaatgca tccgaattta ctattaataa gccgaaagga 901 tatgttgggg cggaatttcc acttgatatt accgcaggaa cagaagctgc gacaggaact 961 aaggatgcct ctattgacta ccatgagtgg caagcaagtt tagccctttc ttacagatta 1021 aatatgttca ctccttacat tggagttaaa tggtctagag taagttttga tgccgacacg 1081 atccgtatcg ctcagcctaa attggctaaa ccagtcttgg ataccactac tctaaacccg 1141 accatcgctg gtaaaggaac tgtggtctct tccgcagaaa acgaactggc tgatacaatg 1201 caaatcgttt ccttgcagtt gaacaagatg aaatctagaa aatcttgcgg tattgcagta 1261 ggaacaactg ttgtagatgc agataaatac gcagttacaa ttgagactcg cttgatcgat 1321 gagagagcag ctcacgtaaa tgcacaattc cgcttctaat attctaataa ttgtataatt 1381 atattagaat cttttcttag taagctt // LOCUS DROD1P 1734 bp ss-mRNA INV 24-JUL-1990 DEFINITION D.melanogaster chromosomal protein D1 mRNA, complete cds. ACCESSION J04725 KEYWORDS chromosomal protein. SOURCE D.melanogaster (strain Oregon R) embryo, cDNA to mRNA, (library of T.-S.Hsieh), clones D1-[9,9.14,15]. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 1734) AUTHORS Ashley,C.T., Pendleton,C.G., Jennings,W.W., Saxena,A. and Glover,C.V.C. TITLE Isolation and sequencing of cDNA clones encoding Drosophila chromosomal protein D1: A repeating motif in proteins which recognize AT DNA JOURNAL J. Biol. Chem. 264, 8394-8401 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by C.V.C.Glover, 29-MAR-1989. FEATURES from to/span description pept 228 1295 chromosomal protein D1 mRNA < 1 1734 D1 mRNA signal 1599 1604 polyadenylation signal signal 1717 1722 polyadenylation signal BASE COUNT 558 a 410 c 462 g 304 t ORIGIN Map position 85D1-2; polytene band. 1 acatcactac tttgaactgt actagcgtgc gcgtatttta caattacaaa ttgagaaggc 61 aaacttgatt ttcggcaaac aaaaaggtga agaagaagag cggagccaag cgaaaagagt 121 gagcgtgtgt tcgtgagaga gagcatttgt gcgcgtgtgt gtgtgagggg acgcgaagaa 181 acaaggattt tttttattcg cttgctaatt gaatcccgta cagagaaatg gaggaagttg 241 cggtaaagaa gcgcggccga ccgtcaaagg catccgtcgg tggtaaatct tcaacagcgg 301 cggttgcagc aatctcgcca ggcatcaaaa agcgcggtcg tcccgccaag aacaagggct 361 ccagtggcgg tggtggacag cggggtcgcc caccgaaggc gtcaaaaatc caaaatgacg 421 aagatcccga agacgagggg gaggaggacg gcgatggcga tggctccggc gctgagcttg 481 caaacaactc atcaccctcg ccgacgaagg gcaggggacg acccaagagc agtggtggcg 541 ccggatcagg atccggggat tcggtgaaga cgcccggttc cgccaagaag cgcaaggccg 601 gcaggcccaa gaagcaccag cccagcgata gcgagaatga agacgatcag gatgaggacg 661 acgacggcaa cagcagcatc gaagaacgtc gcccagtggg ccgaccgtcg gctggatccg 721 tcaacctgaa catatcgcgc acgggacgcg gcctgggaag acccaagaag cgggctgtcg 781 agtccaatgg cgatggggag ccacaagtgc ccaagaaacg cggtcgtcca ccacagaaca 841 agtcgggtag cggtggcagt actggctacg tacccaccgg tcgtccgcgc ggtcgtccaa 901 aggccaatgc tgcacccgtc gagaagcacg aggacaatga cgacgatcag gacgatgaga 961 actccgggga ggaggaacac agctcgcctg agaagacggt agtggcgccg aagaaacgtg 1021 gacgtccctc tcttgccgct ggcaaggtgt ccaaagagga aacaacaaag ccccgtagtc 1081 gcccagcgaa aaatattgat gacgatgcag atgatgctga ttccgccgat cagggccaac 1141 acaactccaa gaaggagtcg aacgacgagg accgtgccgt ggatgggaca ccaaccaaag 1201 gagatggcct caaatggaac tccgatggcg aaaatgatgc caacgacggt tatgtttcgg 1261 ataactacaa cgattccgaa tcggtagctg cctaaaacat acaatttata aacatttcca 1321 cacacccctc cccaaaaatc aacacaaaaa aaaaaagaat gttgtaaaag caaaaaaaaa 1381 aaaaacgaaa aaaaatctgc aagtaagaga agaaaacaca aaaatggata ggacctactt 1441 ggactttaga gaaccttccc ctcctaaaac aacacacttt gtctacatat atattccgat 1501 ttattgctga gaaatcggac aaaccgttta caactgcact gcactgtaat gacgacacat 1561 agctcggaaa acgatttcaa cttggattga aattaccaaa taaataacac ataaataaca 1621 tatttgggga agatttcaag ttccgaagta gaactgaact ttattgtaac aacaaatttg 1681 atttgattgc agaaatttca atcgtataca agtttaaata aacctactat acct // LOCUS ECOESTA2A 549 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli estA2 gene encoding heat-stable enterotoxin A2 (STa2) protein, complete cds. ACCESSION M18345 KEYWORDS STa2 protein; estA2 gene; heat-stable enterotoxin A2. SOURCE E.coli (strain CRL 25090; isolated from human in Bangladesh) HB101 pGK22 cell line DNA, clones 2,12,21, and 27. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 549) AUTHORS Guzman-Verduzio,L.M. and Kupersztoch,Y.M. TITLE Rectification of two Escherichia coli heat-stable enterotoxin allele sequences and lack of biological effect of the change of the carboxyterminal tyrosine for histidine JOURNAL Infect. Immun. 57, 645-648 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.M.Kupersztoch, 08-NOV-1988. FEATURES from to/span description pept 149 367 heat-stable enterotoxin A2 precursor sigp 149 205 heat-stable enterotoxin A2 signal peptide matp 308 364 heat-stable enterotoxin A2 mRNA 132 549 heat-stable enterotoxin A2 mRNA BASE COUNT 182 a 93 c 115 g 159 t ORIGIN Unreported. 1 gaattccccg ccctaaaaca taatattatt atgctcttcg tagcggagag tatagtatga 61 tacacatcac aaaaaaaaaa taaaaaagtt tgcgcaatcg ttctgatttt gatttaaata 121 ttcgtggacg acgtgtttcg gaggtaatat gaaaaaatca atattattta tttttctttc 181 tgtattatct ttttcacctt tcgctcagga tgctaaacca gcagggtctt caaaagaaaa 241 aattacacta gaatcgaaaa aatgtaacat tgtaaaaaaa aataatgaaa gtagtcctga 301 aagcatgaat agtagcaatt actgctgtga attgtgttgt aatcctgctt gtaccgggtg 361 ctattaataa tataaaggga actaaacagt tccctttata tttgtgtgcg ccgtggctgg 421 cgctgttctt caactgtgga ggctgaagaa cgactaagag gtgaaagtcc tccacacacc 481 cggtgagggg aagtgttagc ggaaggcaag gtgatcctac ccacgtaata tggacacagg 541 ggaattcgt // LOCUS ECOESTA3A 403 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli estA3 gene encoding heat-stable enterotoxin A3 (STa3) protein, complete cds. ACCESSION M18346 KEYWORDS STa3 protein; estA3 gene; heat-stable enterotoxin A3. SOURCE E.coli (strain 153837-2; isolated from human in Bangladesh) HB101 pYK117 cell line DNA, clones 5 and 7. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 403) AUTHORS Guzman-Verduzio,L.M. and Kupersztoch,Y.M. TITLE Rectification of two Escherichia coli heat-stable enterotoxin allele sequences and lack of biological effect of the change of the carboxyterminal tyrosine for histidine JOURNAL Infect. Immun. 57, 645-658 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.M.Kupersztoch, 08-NOV-1988. FEATURES from to/span description pept 147 365 heat-stable enterotoxin A3 precursor sigp 147 203 heat-stable enterotoxin A3 signal peptide matp 204 362 heat-stable enterotoxin A3 mRNA 130 > 403 heat-stable enterotoxin A3 mRNA BASE COUNT 139 a 61 c 72 g 131 t ORIGIN Unreported. 1 ttttcggtcg ccgaaaaaga taatattact atgctcttcg tagcggagag tatagtatga 61 tgttcatcac aaaaaaaata aaaaagtttg cgcaaccgtt ctgattttga ttcaaatgtt 121 cgtggatgcc atgttccgga ggtaatatga agaaatcaat attatttatt tttctttctg 181 tattgtcttt ttcacctttc gctcaggatg ctaaaccagt agagtcttca aaagaaaaaa 241 tcacactaga atcaaaaaaa tgtaacattg caaaaaaaag taataaaagt ggtcctgaaa 301 gcatgaatag tagcaattac tgctgtgaat tgtgttgtaa tcctgcttgt accgggtgct 361 attaataata taaagggaac taaacagttc cctttatatt tgt // LOCUS HIV2ISY 9636 bp ss-RNA VRL 24-JUL-1990 DEFINITION Human immunodeficiency virus type 2, isolate SBLISY, complete genome. ACCESSION J04498 KEYWORDS . SOURCE Human immunodeficiency virus type 2 (HIV-2) proviral DNA, isolate SBLISY. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9636) AUTHORS Franchini,G., Fargnoli,K.A., Giombini,F., Jagodzinski,L., De Rossi,A., Bosch,M., Biberfeld,G., Fenyo,A.M., Albert,J., Gallo,R.C. and Wong-Staal,F. TITLE Molecular and biological characterization of a replication competent human immunodeficiency type 2 (HIV-2) proviral clone JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 2433-2437 (1988) STANDARD full staff_review COMMENT Clean copy of sequence [1] kindly provided by G. Franchini, 30-JAN-1989. FEATURES from to/span description pept 547 2109 gag polyprotein pept < 1827 4934 pol polyprotein (NH2 terminus uncertain; AA at 1827) pept 4864 5511 vif protein pept 5339 5677 vpx protein pept 5677 5994 vpr protein pept 5840 6135 tat, exon 2 (first expressed exon) 8281 8377 tat, exon 3 (AA at 8282) pept 6066 6135 rev, exon 2 (first expressed exon) 8281 8717 rev, exon 3 (AA at 8283) pept 6145 8685 envelope polyprotein pept 8519 9289 nef protein pre-msg 1 9636 genomic mRNA pre-msg 1 9636 tat, rev, nef subgenomic mRNA IVS 6136 8280 tat cds intron 2 IVS 6136 8280 rev cds intron 2 IVS 6136 8280 tat, rev, nef subgenomic mRNA intron 2 LTR < 1 300 5' LTR LTR 8904 > 9636 3' LTR rpt 1 174 R repeat 5' copy rpt 9457 9636 R repeat 3' copy binding 304 320 primer (Lys-tRNA) binding site signal 9609 9614 mRNA polyadenylation signal BASE COUNT 3291 a 1940 c 2396 g 2009 t ORIGIN 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctggactct caccagtgct tggccggcgc tgggcagacg 121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa 181 gtgtgtgttc ccatctctcc tagtcgccgc ctggtcattc ggtgttctcc tgagtaacaa 241 gaccctggtc tgttaggacc cttcttgctt tgggaaaccg aggcaggaaa atccctagca 301 ggttggcgcc cgaacaggga cttgaggaag actgagaagc cttggaacac ggctgagtga 361 aggcagtaag ggcggcagga acaaaccacg acggagtgct cctagaaagg cgcaggccaa 421 ggtaccaaag gcggcgtgtg gagcgggagt caagaggcct ccgggtgaag gtaagtacct 481 acaccaaaaa ctgtagccgg aaaaggcttg ttatcctacc tttagacagg tagaagattg 541 tgggagatgg gcgcgaaaaa ctccgtcttg agagggaaaa aggcagatga attagaaaaa 601 attaggttac ggcccggcgg gaagaaaaaa tacagattaa aacatattgt gtgggcagcg 661 aatgaattgg acagattcgg attaacagag agcctgttgg agtcaaaaga aggttgccaa 721 aaaattattt cagttttaga accattagta ccaacagggt cagaaaattt aaaaagcctt 781 tataatacta cctgcgtcat ttggtgcttg cacgcagaag agaaagtgaa agatactgaa 841 gaagcaaaaa gaatagtagg gagacatcta gtggcagaaa cagaaactgc agagaaaatg 901 ccaaatataa gtagaccaac agcaccacct agcgggaaag ggggaaactt ccccgtgcaa 961 caaataggcg gcaactatgt ccatctgccg ctgagtcccc gaaccctaaa tgcttgggta 1021 aagttagtag aggagaaaaa gttcggggca gaagtagtgc cgggatttca ggcactctca 1081 gaaggctgca cgccctatga tattaatcaa atgcttaatt gtgtgggcga ccatcaagca 1141 gcgatgcaaa taatcagaga aattattaat gaagaagcag cagactggga tgtacaacat 1201 ccaataccag gccccttacc agcggggcag ctcagagatc cacgaggatc tgacatagca 1261 gggacaacaa gcacagtaga ggaacagatc gaatggatgt ataggcaaga aaatcctgta 1321 ccagtaggaa acatctatag gagatggatc cagataggac tgcagaagtg tgtcagaatg 1381 tacaatccaa ccaacattct agacataaaa caaggaccaa aagagtcgtt ccaaagctat 1441 gtggatagat tctacaaaag cttaagggca gaacagacag atgcagcagt gaagaattgg 1501 atgacccaga cgctgctagt gcaatcgaac ccagactgta agttagtact aaagggacta 1561 gggatgaatc ctaccttaga agagatgcta accgcctgtc aagggatagg tggaccaggc 1621 cagaaggcca gactaatggc agaagcctta aaagaggcca tgcgaccagc ccctatccca 1681 tttgcagcag cccaacagaa aagggcaatt aagtgttgga attgtggaaa ggaagggcac 1741 tcggcaagac aatgccgagc gcctagaaga cagggctgct ggaaatgtgg caagtcagga 1801 cacatcatgg caaactgccc agatagacag gctggttttt tagggcttgg accatgggga 1861 aagaagcccc gcaacttccc cgtggtccca agttcgcagg ggctaacacc aacagcaccc 1921 ccaatggatc cagcagtgga cctactggag aagtacatgc agcaagggag aaaacagaga 1981 gagcagagac aaagaccata caaagaagtg acagaggact tgctgcatct cgagcaagga 2041 gagacaccac acagagagac gacagaggac ttgctgcacc tcaattctct ctttggaaac 2101 gaccagtagt cacagcatac attgaggatc agccagtaga agttttacta gacacagggg 2161 ctgacgactc aatagtagca ggaatagagt tagggagcaa ttatagtcca aaaatagtag 2221 ggggaatagg aggattcata aataccaaag aatataaaga tgtagaaata agagtgctaa 2281 ataaaaaggt aagagccacc ataatgacag gtgatacccc aatcaacatt tttggcagaa 2341 atatcctgac agccttgggc atgtcattaa atttaccagt cgccaaaata gaaccagtaa 2401 aagtaacatt aaagccagga aaagatgggc caaaacaaag acaatggccc ttaacaagag 2461 aaaaaataga agcactaaga gaaatctgtg aaaaaatgga aagagaaggt cagctagaag 2521 aagcgcctcc aactaatccc tataataccc ctacatttgc aattaagaaa aaggacaaaa 2581 acaaatggag gatgctgata gattttagag aactaaacaa ggtaactcaa gatttcacag 2641 aggttcagtt aggaattcca cacccagcag gattagccaa gaaaagaaga attactgtgt 2701 tagatgtagg agatgcctac ttttccatac ccctatatga ggattttaga cagtatactg 2761 catttactct gccatcagta aacaatgcag aaccaggaaa aagatatata tacaaagtct 2821 taccacaggg atggaagggg tcaccagcaa tttttcaata cacaatgagg caagtcttag 2881 aaccattcag aaaagcaaac ccagatgtca ttatcgttca gtacatggat gatatcttaa 2941 tagctagtga caggacagat ttggaacatg acaaagtagt cctacagcta aaggaacttc 3001 taaatggcct gggattttcc accccagacg aaaagttcca aaaggaccct ccataccaat 3061 ggatgggcta tgaactgtgg ccaaccaaat ggaaattgca aaaaatacaa ttgccccaaa 3121 aggaagtatg gacagttaat gacatccaga aactagtggg tgtcctaaac tgggcggcac 3181 aaatctaccc aggaataaag accaaacact tatgtaagct aattagagga aagatgacac 3241 ccacggaaga agtacagtgg acagaattag cagaagcaga gctggaggaa aacaaaatta 3301 tcttaagcca ggaacaggag ggacactatt accaagagga aaaagagtta gaagcaacag 3361 ttcaaaagga tcaagacaat cagtggacat ataaagtaca ccagggagaa aaaattctaa 3421 aagtaggaaa atatgcaaag ataaaaaata cccataccaa cggggtcagg ttgttggcac 3481 aggtagttca gaaaatagga aaagaagcac tagtcatttg gggacgaata ccaaaatttc 3541 acctaccagt agaaagagag acctgggaac agtggtggga taactattgg caagtgacat 3601 ggatcccaga ctgggacttc gtatccaccc caccgttggt caggttagca tttaacctgg 3661 taaaagatcc tataccaggc gcagagacct tctacacgga tggatcttgc aataggcaat 3721 caaaagaggg aaaagcagga tatataacag atagaggaaa agacaaagta aggatattag 3781 agcaaactac caaccaacaa gcagaattag aagcctttgc aatggcagta acagactcag 3841 gtccaaaagt caatattgta gtagactcac agtatgtaat gggaatagta acaggccaac 3901 cggctgaatc agagagtaga atagtaaata aaattataga agagatgata aaaaaggaag 3961 caatctatgt tgcatgggtc ccggcccaca aaggcatagg aggaaatcaa gaaattgacc 4021 acttagtaag tcagggcatc agacaagtat tattcctaga gagaatagag cccgctcagg 4081 aagaacatgg aaaatatcat agcaatgtaa aagaactagc ccataagttt ggattaccca 4141 acctggtggc aagacaaata gtaaacacat gtgcccagtg ccaacaaaaa ggggaagcta 4201 tacatgggca agtaaatgca gaactaggca cctggcaaat ggactgcaca cacttagaag 4261 gaaaaatcat tatagtagca gtacatgttg caagtggatt tatagaagca gaagtcatcc 4321 cacaggaatc aggaaggcaa acagcactct tcctattaaa actggccagt aggtggccaa 4381 taacacactt gcacacagat aatggtgcca acttcacttc acaggaggta aagatggtag 4441 catggtgggt aggcatagaa caatcctttg gagtacctta caatccacaa agccagggag 4501 tagtagaagc aatgaatcac cacctgaaaa atcagataga aagaattaga gagcaggcaa 4561 atacaatgga aacaatagta ctaatggcag ttcattgcat gaattttaaa agaaggggag 4621 gaatagggga tatgacccca gtagaaagac tagtcaatat gatcaccaca gaacaagaaa 4681 tacaattcct ccaagcaaaa aattcaaaat taaaaaattt tcgggtctat ttcagagaag 4741 gcagaaatca actgtggcaa ggacctgggg agctactgtg gaaaggggac ggagcagtca 4801 tagtcaaggt agggacagat ataaaagtaa taccaagaag aaaggccaag atcatcagag 4861 actatggacc aaggcaagag atggatagcg gttcccacct ggagggtgcc agggaggatg 4921 gagaaatggc atagccttat caagtatcta aaatacagaa caaaagatct agaacaggtg 4981 cgctatgttc cccaccataa ggtggggtgg gcatggtgga cttgcagcag ggtaatattc 5041 ccattaaaag gaaacagtca tctagagata caggcatatt ggaacctaac accagaaaaa 5101 ggatggctct cctcttattc agtaagaatg acttggtact cagaagggtt ctggacagat 5161 gttaccccag actgtgcaga caccctaata cacagcactt atttctcttg ctttacggca 5221 ggtgaagtaa gaagagccat caggggagaa aagtcattgt cctgctgcaa ttatccccaa 5281 gcccataagt ccaaggtacc gtcactccaa tttctggcct tagtagtagt acagcaaaat 5341 gacaaacccc agagagacaa taccaccagg aaacagtggc gaagaaacta tcgaagaggc 5401 cttcgattgg ctagacagga cggtagaagc cataaacaga gaggcagtga accacctgcc 5461 cagggagctt attttccagg tgtggcaaag gtcctggaga tactggcatg atgagcaagg 5521 gatgtcacga agctacacaa agtatagata tttgtgctta atgcagaaag ctgtgttcat 5581 gcatttcaag aaagggtgca cttgccgggg ggaaggacat gggccaggag ggtggagatc 5641 aggacctccc cctcctcctc ccccaggttt agtctaatga ctgaagcacc agcagagttt 5701 cccccggagg atgggacccc accgagggaa ccaggggatg agtgggtaat agaaattctg 5761 agagaaataa aggaagaagc tttaaagcat tttgaccctc gcttgctaac tgctcttggc 5821 tactatatct atactagaca tggagacacc cttgaaggcg ccagagagct cattagggtc 5881 ctacaacgag ccctcttcac gcacttcaga gcaggatgtg gccactcaag aattggccaa 5941 ccaaggggaa gaaatcctct ctcagctata ccgaccccta gaaacatgca ataacaaatg 6001 cttttgtaag gggtgctgct tccattgcca gctgtgtttt ttaaacaagg ggctcgggat 6061 atgttatgac cgaaagggca gacgaagaag gagtccgaag aaaactaagg ctcattcgtc 6121 tcctgcatca gacaagtgag tacaatgagt ggtaaaattc agctgcttgt tgcctttctg 6181 ctaactagtg cttgcttaat atattgcacc aaatatgtga ctgttttcta tggagtaccc 6241 gtgtggaaaa atgcatccat tcccctcttt tgtgcaacta aaaatagaga tacttgggga 6301 accatacagt gcttgccaga caatgatgat tatcaagaga tacctttgaa tgtaacagag 6361 gcttttgacg catgggataa tatagtaaca gaacaagcag tagaagatgt ctggaatcta 6421 tttgagacat caataaaacc atgtgtcaaa ttaacgcctt tatgtgtaac aatgaactgt 6481 aacgcaagta cagagagcgc agttgcaact acaagcccat ctggacctga tatgataaat 6541 gatactgatc catgcataca attgaacaat tgctcaggac tgagggagga agacatggtc 6601 gagtgtcagt tcaatatgac aggactagag ttagataaga aaaaacagta tagtgaaacc 6661 tggtactcaa aagatgtggt ttgtgaatca gataacagca cagaccgaaa aagatgttac 6721 atgaaccatt gcaacacatc agtcatcaca gagtcatgtg acaagcacta ttgggatgct 6781 atgagattta gatactgtgc accaccgggt tttgtcttgc taaggtgcaa tgataccaat 6841 tactcaggct ttgagcccaa ttgctctaaa gtagtagctt ctacatgtac aagaatgatg 6901 gaaacgcaac cttctacttg gcttggcttt aatggcacta gggcagaaaa tagaacatat 6961 atctattggc atggtaggga taacagaact attatcagct taaacaaata ttataatctc 7021 accatacttt gtaggagacc agaaaataaa acagttgtac caataacact catgtcaggc 7081 cgcagatttc actcccagaa gatcatcaat aaaaaaccca ggcaagcatg gtgccggttc 7141 aaaggcgagt ggagggaagc catgcaggag gtgaaacaaa cccttgtaaa acatcccagg 7201 tataaaggaa ccaatgacac aaataaaatt aactttacag caccagaaaa agactcagac 7261 ccagaagtag catatatgtg gactaactgc agaggagaat tcctctattg caacatgact 7321 tggttcctta attgggtaga aaacaagacg ggtcaacagc ataactatgt gccgtgccat 7381 atagagcaaa taattaatac ctggcataag gtagggaaaa atgtatattt gcctcctagg 7441 gaaggagagt tgtcctgcga atcaacagtg accagtatca ttgctaacat tgatgttgat 7501 ggagataacc ggacaaatat tacctttagt gcagaggtgg cagaactata ccgattggaa 7561 ttgggggatt ataaattagt agaagtaaca ccaattggct tcgcccctac agcagaaaaa 7621 agatactcct ctgctccagg gagacataag agaggtgtgc ttgtgctagg gttcctaggt 7681 tttctcacga cagcaggtgc tgcaatgggg gcggcgtctc tgacgctgtc ggctcagtct 7741 cggactttat tccgtgggat agtgcagcaa cagcaacagc tgttggacgt ggtcaagaga 7801 caacaagaaa tgttgcgact gaccgtctgg ggaactaaaa acctccaggc aagagtcact 7861 gctattgaga agtacctagc agaccaggcg cgactaaatt catggggatg tgcgtttaga 7921 caagtctgcc acactactgt accatgggta aatgacacct taacacctga gtggaacaac 7981 atgacatggc aagaatggga acacaaaatc cgcttcctag aggcaaatat cagtgagagt 8041 ttagaacagg cacaaatcca gcaagaaaag aatatgtatg agctgcaaaa gctaaatagc 8101 tgggatgttt ttggcaattg gtttgactta acctcctgga tcaagtatat tcaatatgga 8161 gtcatgatag tagtaggaat agtagctctc agaatagtaa tatatgtagt acaaatgcta 8221 agtagactta gaaagggcta taggcctgtt ttctcttccc cccccggtta tattcaacag 8281 atccatatcc acaaggactg ggaacagcca gacagagaag aaacagaaga agacgttggg 8341 aacgacgttg gaagcagatc ctggccttgg ccgatagaat atatacattt cctgatccgc 8401 ctgctgatcc gcctcttgac cagactatac aacagctgca gggacttact atccagactc 8461 tacctgatcc tccaaccact cagagactgg ctgagactca aggcagccta cctgcagtat 8521 gggtgcgagt ggatccaaga agcgttccag gccctcgcga gggttacaag agagactctt 8581 acgagcgcgg ggaggagctt gtggggggct ctgggacgaa tcggaagggg gatactcgca 8641 gttccacgaa ggatcaggca gggagcagaa attgccctcc tgtgagggac agagatatca 8701 gcaaggagac tttatgaata ccccatggag aaccccagca acagaaaagg aaaaagaatc 8761 gtacaggcaa caaaatatgg atgatgtaga ttcagatgat gatgacctag taggggtctc 8821 tgacacatca agagtaccat tgagagcaat gacatataga atggcagtag acatgtcaga 8881 tttaataaaa gataaggggg gactggaagg gatgtattac agtgagagaa gacatagaat 8941 cctagacata tacttagaaa aggaagaagg gataattcca gattggcaga actatactca 9001 tgggctagga gtaaggtacc caatgttctt tgggtggcta tggaagctag taccagtaac 9061 tgtcccacaa gaaggggagg acactgagac tctctgctta atgcactcag cacaagtaag 9121 cagatttgat gacccgcatg gggagacact agtctggaag tttgacccca tgctggctca 9181 tgagtacacg acctttattc tatacccaga ggaatttggg cacaagtcag gaatggaaga 9241 agatgactgg aaggcaaaac tgaaagcaag agggatacca tttagttaaa aacaggaaca 9301 accatacttg gtcaggacag gaagtagcta ctgaaaacag ctgagactgc agggactttc 9361 cagaaggggc tgtaaccagg ggagggacat gggaggagct ggtggggaac gccctcatac 9421 tttctgtata aatgtacccg ctgctcgcat tgtattcagt cgctctgcgg agaggctggc 9481 agattgagcc ctgggaggtt ctctccagca ctagcaggta gagcctgggt gttccctgct 9541 ggactctcac cagtgcttgg ccggcgctgg gcagacggct ccacgcttgc ttgcttaaaa 9601 gacctcttaa taaagctgcc agttagaagc aagtta // LOCUS MUSIGHAAL 411 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig productively rearranged H-chain V-region VDJ mRNA from hybridoma HY2-2C8. ACCESSION M22438 KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon; variable region. SOURCE Mouse (strain SM/J) hybridoma HY2-2C8 cell, cDNA to mRNA, clone 1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 411) AUTHORS Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K. TITLE A single V-H gene is utilized predominantly in anti-BrMRBC hybridomas derived from purified Ly-1 B cells: Definition of the V-H11 family JOURNAL J. Immunol. 142, 3643-3651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.E.Carmack 03-FEB-1989. FEATURES from to/span description pept 1 / 411 Ig heavy-chain (V-J-region) precursor sigp 1 57 Ig heavy-chain (V-J-region) signal peptide matp 58 / 411 Ig heavy-chain (V-J-region) recomb 147 148 framework 1 end/CDR 1 start recomb 162 163 CDR 1 end/framework 2 start recomb 204 205 framework 2 end/CDR 2 start recomb 255 256 CDR 2 end/framework 3 start recomb 351 352 framework 3 end/CDR 3 or D-region start recomb 360 361 CDR 3 or D-region end/J-region start BASE COUNT 100 a 86 c 118 g 107 t ORIGIN Chromosome 12. 1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa 61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct 121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct 181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca 241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg 301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatgggccc 361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a // LOCUS MUSIGHAAM 411 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig productively rearranged H-chain V-region VDJ mRNA from hybridoma HY1-1H2. ACCESSION M22439 KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon; variable region. SOURCE Mouse (strain SM/J) hybridoma HY1-1H2 cell, cDNA to mRNA, clone 1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 411) AUTHORS Hardy,R.R., Carmack,C.E., Shinton,S.A., Riblet,R.J. and Hayakawa,K. TITLE A single V-H gene is utilized predominantly in anti-BrMRBC hybridomas derived from purified Ly-1 B cells: Definition of the V-H11 family JOURNAL J. Immunol. 142, 3643-3651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.E.Carmack 03-FEB-1989. FEATURES from to/span description pept 1 / 411 Ig heavy-chain (V-J-region) precursor sigp 1 54 Ig heavy-chain (V-J-region) signal peptide matp 55 / 411 Ig heavy-chain (V-J-region) recomb 141 142 framework 1 end/CDR 1 start recomb 159 160 CDR 1 end/framework 2 start recomb 204 205 framework 2 end/CDR 2 start recomb 252 253 CDR 2 end/framework 3 start recomb 348 349 framework 3 end/CDR 3 or D-region start recomb 360 361 CDR 3 or D-region end/J-region start BASE COUNT 94 a 115 c 96 g 106 t ORIGIN Chromosome 12. 1 atgagagtgc tgggattttt gtgcctggtg acagtccttc ctggtagcct gtctcagatg 61 cagcttcagg agtcaggacc tggcctggtg aaaccctcac agtcactctt cctcgcctgc 121 tctattactg gtttccccat caccagtggt tactactgga tctggatccg tcagtcacct 181 gggaaacccc tagaatggat ggggtacatc actcatagtg gggaaacttt ctacaaccca 241 tccctccaga gccccatctc cattactaga gaaacatcca agaaccagtt ctttctgcaa 301 ttgaactctg tgaccacaga ggacacagcc atgtattact gtgcaggaga ctatgatggt 361 tactggtact tcgatgtctg gggcacaggg accacggtca ccgtctcctc a // LOCUS RATCKBR 4360 bp ds-DNA ROD 24-JUL-1990 DEFINITION Rat creatine kinase gene, complete cds. ACCESSION M18668 M26669 KEYWORDS creatine kinase. SOURCE Rat (Sprague Dawley) brain DNA, clone lambda ckb-2. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4360) AUTHORS Benfield,P.A., Graf,D., Korolkoff,P.N., Hobson,G. and Pearson,M.L. TITLE Isolation of four rat creatine kinase genes and identification of multiple potential promoter sequences within the rat brain creatine kinase promoter region JOURNAL Gene 63, 227-243 (1988) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.A.Benfield, 23-MAR-1988. FEATURES from to/span description pept 1276 1468 creatine kinase, exon 2 (first expressed exon) 1656 1810 creatine kinase, exon 3 1888 2020 creatine kinase, exon 4 2197 2368 creatine kinase, exon 5 2941 3064 creatine kinase, exon 6 3174 3363 creatine kinase, exon 7 3443 3621 creatine kinase, exon 8 pre-msg 941 3818 creatine kinase mRNA and introns IVS 1015 1254 creatine kinase intron A IVS 1469 1655 creatine kinase intron B IVS 1811 1887 creatine kinase intron C IVS 2021 2196 creatine kinase intron D IVS 2369 2940 creatine kinase intron E IVS 3065 3173 creatine kinase intron F IVS 3364 3442 creatine kinase intron G BASE COUNT 801 a 1314 c 1278 g 967 t ORIGIN 1 aagtactgct aaacctactt tccagagaag atgaaactgc tggcagctgg aggcagggtg 61 tgaacgcctg tgttcacaca ttccaattca cagccggcac tggctgtggt caaattccag 121 gtttccccgt acctgcgtgt gtgggacacg gggtgggggt caaagagact ctttcaatct 181 ggtggatcta ttaatagtta tttttgactg gctgcttcct ccacccccag tttgccctgc 241 cgtaggtact gtctttgtag gtggggtggt agaaaccggt ccctcaggaa agaacctggg 301 gatttgaaga ttcaaaaaca gtctctagga gctcagtgtc ttaatttttc agggtccggg 361 tccaagggtc ttggctaggt tccttagggc ccgcccaagg tcagaacacc tggtcttccg 421 gcggacgagc cagatgcgct ctgtgtggac caagaacacc caggagatgc ccgcaggcac 481 cttgaggaag gtcagctccc ctgtaccgtt ttgtaccgcc tggatgcagt ggatggaagc 541 cgtaccccat gctaaaaacc tggtatctct tgccatgacc ccctcaacac tgctgtcggc 601 ctgacaaggc aggaagcagg gaaggcagga acctcaagtg ctaactggct ttccagactc 661 agtctctcgc cccaggcatg tgtgtgaagc aggtctaggt atctacggtc ctgggatttc 721 gtcttcgaga tcctgagcga gcgccacggg gttgaccaaa ctgcgcagcg gggtcgagac 781 ttggggacca tggggtgggc cgctgggggt gtcgagggtt gttgcctcgg acaaagcggc 841 ggcaccaccc caaagcgcgg gccaatggaa tgaatgggct ataaatagcc gccaatggga 901 ggccggcgac gcgcccctta agagctcagg gagcagcgag cggccgtcgt tcttctgcgt 961 gcgcccggag ctgcaagcac aagcagtctg cgttcctgct ccgtcggaat cccggtgagc 1021 gggtcggagg gtgaggggct agtcctgagg gcgtacgcgg ttgggtgcat cacgcgcagc 1081 tcgttgcccg gagcgcgtcc ggaccgccgg ggaacttggg atgcgccgga ctcaaaagat 1141 gcagacctcg ctgaccttgg tgacgtccga ggaaagcttc gggggtccgg attggtctgc 1201 agggtctctg gagaccggtg tgtaaagctc ctctgacccc gctcttcccc gcagcctgcc 1261 gccgccgccg ccgccatgcc cttctccaac agccacaaca cgcagaagct gcgcttcccg 1321 gccgaggatg aattccctga tctgagcagc cacaacaacc atatggccaa ggtgctgacc 1381 cccgagctgt acgcggagct ccgtgccaag tgcacgccga gcggctttac gttggacgac 1441 gccatccaga ctggcgtaga caatccgggt atgcacaccc ttgtagcgtc aggctttcgt 1501 ctccccaaca agccccccgg gcaaggatcc cactgctctt ccctgaacct cggcggggtg 1561 gggtctcctg gtcctctctg cgcttagcca taagggcctc agtgccttgc tctagcttgg 1621 cgctgacgtc actgtcgccg cgccccatcc tccaggccac ccgtacatca tgacagtggg 1681 tgcagtggcg ggcgacgagg agagttacga cgtattcaag gaccttttcg accccatcat 1741 cgaggaccgg cacggcggct accagcccag tgatgagcac aagactgacc tcaacccaga 1801 caacctgcag gtgcggggcg tcagggtccg ggcgtgctgg ggagaggggg tctcggcgct 1861 cactccggcc actgcttgca tccctagggc ggcgatgacc tggaccccaa ctacgtgctg 1921 agctcgcggg tgcgcacagg ccgaagcatc cgcggcttct gcctcccccc tcactgcagc 1981 cgtggggagc gccgcgccat cgagaagctg gcagtagaag gtagggtccg gtatcagcca 2041 ccagagctgc tgcgttcttg tctgggcgcg cagcgttccc ccagggtctg agggtggaag 2101 acccaactaa gacagcgggg acactgccgc gggcgggtgg cgagctttct agctcgccca 2161 cccacccacc ggctccttga actctgtact ccgcagccct gtccagccta gatggcgacc 2221 tgtctggcag gtactatgcg ctcaagagca tgaccgaggc ggacgagcag cagctcattg 2281 acgaccactt cctcttcgac aagcctgtgt cgcctctgct gctggcctcc gcgatggccc 2341 gcgactggcc ggatgctcgc ggcatttggt acgagccctc ttccctcaca gtccccggaa 2401 ggtggggcac ggcctgaatt cgtagattgt gcgttaccac actgtgcgca ggtgaccttg 2461 gttctgcgga tgactgaggt ccctgcgccg cgggaggccg cagtctccag ggattcaagg 2521 gtggtgacca gtttctttgg cgtctgttct ccgccctcct cctgggagcc ggcgcttctt 2581 gttttctctc ctccttctca cccccttatt ccgccgggat tctgccaggt gccagtgacg 2641 caaaagcctc cgcacccgtc cggcagggct cctacccctg cagactgcgc gggcgagcgt 2701 gcttcctctg gtgggctgct ctggctgctc cagccccttg cgggccacgc agcacatgac 2761 tggtgattga ggtgccctgg tctccctgcc ttcgagtgtt gctgggtggg cgtcttagag 2821 acagcccaca ctcctgcact tggaaagcgc tttggaggtc acccacctag ggcgtcctgc 2881 tgggtgtgac ctaactaagg ctgtagactt tgactttgtc tgcccgaccc cctcctccag 2941 gcacaatgac aataagacgt tcctggtgtg gatcaacgag gaggaccacc tgcgggttat 3001 ctccatgcag aaagggggca acatgaagga agttttcacg cgattctgca ctggcctcac 3061 tcaggtctgg cctggtgcta aggagggagg gggtcggggg gatttgagtc cacatttccc 3121 agggtgcttg gggttccagc tacattttcc cttaactttc ccttcacctt cagattgaaa 3181 ctctcttcaa gtctaagaac tatgagttca tgtggaaccc tcacctgggc tacatcctca 3241 cgtgcccatc caacctgggc actgggcttc gggcaggcgt gcacatcaag ctgccccacc 3301 tgggaaagca cgagaagttc tcggaggtgc tcaagcgact gcggcttcag aagcgaggca 3361 caggtgaggg gcagacgact caggtgaagc agcaaacgcg gctttcccca gccctgctga 3421 cctgcctgtc tcccatcccc aggtggtgtg gacaccgctg ctgtgggtgg agtttttgat 3481 gtctccaacg ctgaccgcct gggcttctcg gaggtggagc tggtgcagat ggtggtggac 3541 ggagtgaagc tactcattga gatggagcag cggcttgagc agggtcagcc cattgacgac 3601 ctcatgcctg cccagaagtg aagcctggcc ctagccacca ccaggctgcc gcttcctaac 3661 ttattacccg ggcagtgccc gccatgcatc cttgatgttt gccgcctggc gtgagccctt 3721 agcctcgctg tagagacttc tgtcgccctg ggtagagttt attttttctg atggctaagc 3781 tgttgcagac actgaaataa attagggttt ggcctgccct atgtctgagt gttgcttctc 3841 ctttcttaga gacagtcgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 3901 tgtgtgtgtc catccgtcct tcctggcctg tcacttcacc tagcagatga aacatgagtc 3961 ttaagataca gggcagaggg agggagactt gagtgcagcc tgaacatcta aggacatttg 4021 tacttgcagg gtggagccta gtatttcctt agctccagcc agctaaaggg agtcagcaac 4081 ttgacatctt ccccagtttc tttctgccgc attcctggca gcctgaatat ctaaagttct 4141 atctcttcac ttccttcccc tccaggcttc aacacttact ccagttctta aataattctc 4201 tccctacttc ctccagcttc caaccccagg ttgagtgcct cctcctgatc cctcagccca 4261 gctctccagc cccatccaga caagagcttt tcagaaccac ccgtccctct ctcttgcagc 4321 ttcctctctg gggccctcct ccttgggagg agacaagctt // LOCUS RATTHY 797 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat prothymosin-alpha mRNA, complete cds. ACCESSION M20035 KEYWORDS prothymosin. SOURCE Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA, clone pRSpro-187. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 797) AUTHORS Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L. TITLE Prothymosin alpha and parathymosin: Amino acid sequences deduced from the cloned rat spleen cDNAs JOURNAL Arch. Biochem. Biophys. 263, 305-310 (1988) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by B.L.Horecker, 26-APR-1988. FEATURES from to/span description pept 174 512 prothymosin precursor matp 177 509 prothymosin BASE COUNT 248 a 187 c 219 g 143 t ORIGIN Unreported. 1 gccatctttg catttgttcc cgggtcgtgc tccgcgctca catgcaagcc aacctatcgg 61 ccgacccacc ggtcatcctc caaacgcgga cttccggcta gcttatctcg gccaagagtc 121 ctcgaaactc gacatattct tacgcgtaga ccagaccacc ggcgtgcccc accatgtcag 181 acgcggcagt ggacaccagc tccgagatca ccaccaagga cttgaaggag aagaaggaag 241 ttgtggagga ggcagagaat ggaagagacg cacctgccaa tgggaacgct caaaatgagg 301 aaaatgggga gcaggaggct gacaatgagg tagatgaaga agaggaagaa ggtggggagg 361 aagaggagga ggaggaagaa ggtgatggtg aggaagaaga tggagatgaa gatgaggaag 421 ctgaggctcc tacgggcaag cgggtagctg aggatgatga ggatgatgat gttgagacca 481 agaagcagaa gaagactgat gaggatgact agacagcaaa aggaaaagct aaccttacgc 541 accgtgacct attcaccctc cacttcccgt ctcagaattt aaacgtggtc accttcgagt 601 agagaagcag gcccccgccg cccactccag gtgcacccac acatgacatg cgctcttcac 661 acccacaaaa cacaacatga ttgcacatgg aggagaaaga acagaactcc aggcctcttt 721 ttttcttaaa aatctttaaa ggaattgttg attttattaa ctaagcaaat tgaggcagca 781 ttaataaatt cgacaca // LOCUS RATTHYP 378 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat spleen parathymosin-alpha mRNA, 3' end. ACCESSION M20616 KEYWORDS parathymosin. SOURCE Rat (strain Charles River CD, 5 week old) spleen, cDNA to mRNA, clone pRSpara-5187. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 378) AUTHORS Frangou-Lazaridis,M., Clinton,M., Goodall,G.J. and Horecker,B.L. TITLE Prothymosin-alpha and parathymosin: Amino acid sequences deduced from the cloned rat spleen cDNAs-1 JOURNAL Arch. Biochem. Biophys. 263, 305-310 (1988) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by B.L.Horecker, 26-APR-1988. FEATURES from to/span description pept < 1 245 parathymosin precursor (AA at 3) BASE COUNT 111 a 69 c 144 g 54 t ORIGIN Unreported. 1 acaaggtgga ggagaaggct ggccggaaag aacggaagaa agaagtagtg gaggaggagg 61 agaatggagc tgaggaggag gaagaagaaa ctgctgagga tggagaggat gatgatgaag 121 gagacgaaga agatgaggag gaagaggagg aggaggatga aggccccgtg cggaagagaa 181 ctgctgaaga ggaggatgaa gcggatccca agaggcagaa gacagaaaac ggggcgtcgg 241 cttgagcccc tgcccgtggc ttggggatgg gaggcccctc aggtctggag gtgggcagga 301 acacacaatc agccccttct ctgctcctgc tctgcctgcc agagcttgac tgcttgacag 361 cttattcatt tcagcatc // LOCUS STYNUSAINF 228 bp ds-DNA BCT 24-JUL-1990 DEFINITION S.typhimurium nusA-infB operon promoter region. ACCESSION M28868 KEYWORDS . SOURCE S.typhimurium (strain LT2) DNA, clone 40. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 228) AUTHORS Granston,A.E., Tompson,D.L. and Friedman,D.I. TITLE Identification of a second promoter for the metY-nusA-infB operon of Escherichia coli JOURNAL J. Bacteriol. 172, 2336-2342 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.E.Granston, 27-SEP-1989. FEATURES from to/span description signal 159 164 -35 region promoter P1 signal 183 188 -10 region promoter P1 BASE COUNT 69 a 39 c 54 g 66 t ORIGIN 1 aaaataaatg agttttcatt taatcatctt ttagcagaga caggaagcgt ttagtgaagg 61 tttactgaaa taacgttgtg acatgcgtgt ataagccgga gaggtgggat gagtataacg 121 aagagtgata ttcattcatc tgcatattgg gaatagggtt gacataactc atgccaggtc 181 actatactgt gcgcagattt tacgtcccgt cctcggtacc aaatccca // LOCUS SYNAA113M 9186 bp ds-DNA SYN 24-JUL-1990 DEFINITION Sequencing vector pAA113M DNA. ACCESSION M20189 KEYWORDS cloning vector; synthetic DNA. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 9186) AUTHORS Ahmed,A. TITLE A vector for sequencing long (40-kb) DNA fragments JOURNAL Gene 75, 315-321 (1989) STANDARD full staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by A.Ahmed, 28-SEP-1988. pAA113M is a cosmid vector specially developed to sequence long (40kb) DNA fragments. It makes use of the cos site to clone long segments, IS1 to subdivide the cloned fragment by deletions, galKT to select such deletions, and some unique restriction sites to further shorten the fragment for supercoil or M13 sequencing. It contains a normal plasmid ori of replication and also the M13 ori of replication to obtain DNA in the double-stranded or single- stranded form. FEATURES from to/span description pept 3787 4977 tet protein ORF 7433 8293 amp protein BASE COUNT 2051 a 2446 c 2453 g 2236 t ORIGIN Unreported. 1 caattactgc aatgccctcg taattaagtg aatttacaat atcgtcctgt tcggagggaa 61 gaacgcggga tgttcattct tcatcacttt taattgatgt atatgctctc ttttctgacg 121 ttagtctccg acggcaggct tcaatgaccc aggctgagaa attcccggac cctttttgct 181 caagagcgat gttaatttgt tcaatcattt ggttaggaaa gcggatgttg cgggttgttg 241 ttctgcgggt tctgttcttc gttgacatga ggttgccccg tattcagtgt cgctgatttg 301 tattgtctga agttgttttt acgttaagtt gatgcagatc aattaatacg atacctgcgt 361 cataattgat tatttgacgt ggtttgatgg cctccacgca cgttgtgata tgtagatgat 421 aatcattatc actttacggg tcctttccgg tgatccgaca ggttacgggg cggcgacctc 481 gcgggttttc gctatttatg aaaattttcc ggtttaaggc gtttccgttc ttcttcgtca 541 taacttaatg tttttattta aaataccctc tgaaaagaaa ggaaacgaca ggtgctgaaa 601 gcgaggcttt ttggcctctg tcgtttcctt tctctgtttt tgtccgtgga atgaacaatg 661 gaagtcaaca aaaagcagct ggctgacatt ttcggtgcga gtatccgtac cattcagaac 721 tggcaggaac agggaatgcc cgttctgcga ggcggtggca agggtaatga ggtgctttat 781 gactctgccg ccgtcataaa atggtatgcc gaaagggatg ctgaaattga gaacgaaaag 841 ctgcgccggg aggttgaaga actgcggcag gccagcgagg cagatcaaca gtcggtacgg 901 ctgaccatcg ggtgccagtg cgggagtttc gttcagcact gtcctgctcc ttgtgatggt 961 ttacaaacgt aaaaagtctc tttaatacct gtttttgctt catattgttc agcgacacgt 1021 tgctgtacgg caggcaccag ctcttccggg atcagcgcga cgatacagcc gccaaatccg 1081 ccgccggtca tgcgtacgcc acctttgtcg ccaatcacag ctttgacgat ttctaccaga 1141 gtgtcaattt gcggcacggt gatttcgaaa tcatcgcgca tagaggcatg agactccgcc 1201 atcaactcgc ccatacgttt caggtcgcct tgctccagcg cgctggcagc ttcaacggtg 1261 cgggcgtttt cagtcagtat atgacgcacg cgttttgcca cgatcgggtc cagttcatgc 1321 gcaacagcgt tgaactcttc aatggtgaca tcacgcaggg ctggctgctg gaagaaacgc 1381 gcaccggttt cgcactgttc acgacgggtg ttgtattcgc tgccaaccag ggtacgtttg 1441 aagttactgt tgatgatgac gacagccaca cctttgggca tggaaactgc tttggtcccc 1501 agtgagcggc aatcgatcag caaggcatga tctttcttgc cgagcgcgga aattagctga 1561 tccatgatcc cgcagttaca gcctacaaac tggttttctg cttcctgacc gttaagcgcg 1621 atttgtgcgc cgtccagcgg cagatgataa agctgctgca atacggttcc gaccgcgact 1681 tccagtgaag cggaagaact taacccggca ccctgcggca cattgccgtg atcaaccatg 1741 tccacgccgc cgaagctgtt gttacgcagt tgcagatgtt tcaccacgcc acgaacgtag 1801 ttagcccatt gatagttttc atgtgcgaca atgggcgcat cgagggaaaa ctcgtcgagc 1861 tgattttcat aatcggctgc catcacgcga actttacggt catcgcgtgg tgcacaactg 1921 atcacggttt gataatcaat cgcgcagggc agaacgaaac cgtcgttgta gtcggtgtgt 1981 tcaccaatca aattcacgcg gccaggcgcc tgaatggtgt gagtggcagg gtagccaaat 2041 gcgttggcaa acagagattg tgttttttct ttcagactca tttcttacac tccggattcg 2101 cgaaaatgga tatcgctgac tgcgcgcaaa cgctctgctg cctgttctgc ggtcaggtct 2161 cgctgggtct ctgccagcat ttcataacca accataaatt tacgtacggt ggcggacggc 2221 agcagaggcg gataaaagtg cgcgtgcagc tgccagtgtt gattctcttc gccattaaat 2281 ggcgcgccgt gccagcccat agagtagggg aaggagcact ggaagaggtt gtcataacga 2341 ctggtcagct ttttcaacgc cagcgccaga tcgctgcgct gggcgtcggt caaatcggtg 2401 atccgtaaaa cgtgggcttt gggcagcagt agcgtttcga acggccaggc agcccagtaa 2461 ggcacgacgg ctaaccagtg ttcggtttcg acaacggtac ggctaccgtc tgccagctcg 2521 cgctgaacat aatccaccag cattggtgat ttctgttcgg caaaatattc tttttgcagg 2581 cggtcttcgc gctcagcttc gttaggcagg aagctatttg cccaaatctg accgtgcgga 2641 tgcgggttag agcagcccat cgccgcgcct ttgttttcaa aaacctgcac ccatgggtac 2701 gttttcccca gttctgcggt ttgctcctgc caggttttga cgatttccgt caatgctgca 2761 acgctgagct ctggcagcgt tttactgtga tccggtgaaa agcagatcac ccggctggtg 2821 ccgcgcgcgc tctggcaacg catcagcgga tcgtgacttt ctggcgcatc tggcgtgtca 2881 gacatcaaag ccgcaaagtc attagtgaaa acgtaagtcc cggtgtaatc ggggttttta 2941 tcgcctgtca cccgcacatt acctgcgcag aggaagcaat ctggatcgtg cgcaggtaac 3001 acctgtttgg ctggcgtttc ctgcgccccc tgccagggcg ttagccgggt gcggtgaaac 3061 cagaatccat tgcccggtga gcggttgtag cggcgatgtg gatgatcaac gggattaaat 3121 tgcgtcatgg tcgttcctta atcgggatat ccctgtggat ggcgtgactg ccagtgccag 3181 gtgtcctgcg ccatttcatc gagtgtgcgc gttacgcagt tcagttcacg gtcggctttg 3241 ctggcgtccg ccagtaggcc ggaaggtcgc cctcgcgagc ggtgcaaaat gataattaac 3301 cggtttgccg cacgtttgct gaaggcatta accacgtcca gcacgctgtt gcctacgcca 3361 gcgccgaggt tgtagatgtg tacgcctggc ttgttcgcca gtttttccat cgccacgacg 3421 tgaccgtccg ccagatccat tacgtggatg taatcgcgta cgccagtacc atcttcggtc 3481 ggataatcgt taccaaaaat cgccagcgag tcgcgacggc ctacagcaac ctgggcgatg 3541 tatggcatca ggttattcgg aatgccttgc ggatcttcgc ccatatcgcc cgacggatgc 3601 gcgccaaccg ggttgaagta gcgcacgagg gcaatgctcc agtccggctg ggctttttgc 3661 agatcggtga ggatctgttc caccatcagc ttgcttttgc cgtaagggct ttgcggtgtg 3721 ccggtcggga agctataatg cggtagttta tcacagttaa attgctaacg cagtcaggca 3781 ccgtgtatga aatctaacaa tgcgctcatc gtcatcctcg gcaccgtcac cctggatgct 3841 gtaggcatag gcttggttat gccggtactg ccgggcctct tgcgggatat cgtccattcc 3901 gacagcatcg ccagtcacta tggcgtgctg ctagcgctat atgcgttgat gcaatttcta 3961 tgcgcacccg ttctcggagc actgtccgac cgctttggcc gccgcccagt cctgctcgct 4021 tcgctacttg gagccactat cgactacgcg atcatggcga ccacacccgt cctgtggatc 4081 ctctacgccg gacgcatcgt ggccggcatc accggcgcca caggtgcggt tgctggcgcc 4141 tatatcgccg acatcaccga tggggaagat cgggctcgcc acttcgggct catgagcgct 4201 tgtttcggcg tgggtatggt ggcaggcccc gtggccgggg gactgttggg cgccatctcc 4261 ttgcatgcac cattccttgc ggcggcggtg ctcaacggcc tcaacctact actgggctgc 4321 ttcctaatgc aggagtcgca taagggagag cgtcgaccga tgcccttgag agccttcaac 4381 ccagtcagct ccttccggtg ggcgcggggc atgactatcg tcgccgcact tatgactgtc 4441 ttctttatca tgcaactcgt aggacaggtg ccggcagcgc tctgggtcat tttcggcgag 4501 gaccgctttc gctggagcgc gacgatgatc ggcctgtcgc ttgcggtatt cggaatcttg 4561 cacgccctcg ctcaagcctt cgtcactggt cccgccacca aacgtttcgg cgagaagcag 4621 gccattatcg ccggcatggc ggccgacgcg ctgggctacg tcttgctggc gttcgcgacg 4681 cgaggctgga tggccttccc cattatgatt cttctcgctt ccggcggcat cgggatgccc 4741 gcgttgcagg ccatgctgtc caggcaggta gatgacgacc atcagggaca gcttcaagga 4801 tcgctcgcgg ctcttaccag cctaacttcg atcattggac cgctgatcgt cacggcgatt 4861 tatgccgcct cggcgagcac atggaacggg ttggcatgga ttgtaggcgc cgccctatac 4921 cttgtctgcc tccccgcgtt gcgtcgcggt gcatggagcc gggccacctc gacctgaatg 4981 gaagccggcg gcacctcgct aacggattca ccactccaag aattggagcc aatcaattct 5041 tgcggagaac tgtgaatgcg caaaccaacc cttggcagaa catatccatc gcgtccgcca 5101 tctccagcag ccgcacgcgg cgcatctcgg gcagcgttgg gtcctggcca cgggtgcgca 5161 tgatcgtgct cctgtcgttg aggacccggc taggctggcg gggttgcctt actggttagc 5221 agaatgaatc accgatacgc gagcgaacgt gaagcgactg ctgctgcaaa acgtctgcga 5281 cctgagcaac aacatgaatg gtcttcggtt tccgtgtttc gtaaagtctg gaaacgcgga 5341 agtcagcgcc ctgcaccatt atgttccgga tctgcatcgc aggatgctgc tggctaccct 5401 gtggaacacc tacatctgta ttaacgaagc gctggcattg accctgagtg atttttctct 5461 ggtcccgccg catccatacc gccagttgtt taccctcaca acgttccagt aaccgggcat 5521 gttcatcatc agtaacccgt atcgtgagca tcctctctcg tttcatcggt atcattaccc 5581 ccatgaacag aaattccccc ttacacggag gcatcaagtg accaaacagg aaaaaaccgc 5641 ccttaacatg gcccgcttta tcagaagcca gacattaacg cttctggaga aactcaacga 5701 gctggacgcg gatgaacagg cagacatctg tgaatcgctt cacgaccacg ctgatgagct 5761 ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 5821 cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 5881 cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag 5941 cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcaccat 6001 atgcggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc 6061 gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 6121 cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 6181 tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 6241 cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 6301 aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 6361 cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 6421 gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 6481 ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 6541 cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 6601 aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 6661 tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 6721 ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 6781 tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 6841 ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 6901 agattatcaa aaaggatctt cacctagatc cttttaaatt gtaaacgtta atattttgtt 6961 aaaattcgcg ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg 7021 caaaatccct tataaatcaa aagaatagcc cgagataggg ttgagtgttg ttccagtttg 7081 gaacaagagt ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta 7141 tcagggcgat ggcccactac gtgaaccatc acccaaatca agttttttgg ggtcgaggtg 7201 ccgtaaagca ctaaatcgga accctaaagg gagcccccga tttagagctt gacggggaaa 7261 gccggcgaac gtggcgagaa aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct 7321 ggcaagtgta gcggtcacgc tgcgcgtaac caccacaccc gccgcgctta atgcgccgct 7381 acagggcgcg taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat 7441 gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct 7501 gactccccgt cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 7561 caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag 7621 ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta 7681 attgttgccg ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg 7741 ccattgctgc aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg 7801 gttcccaacg atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 7861 ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta 7921 tggcagcact gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg 7981 gtgagtactc aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc 8041 cggcgtcaac acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg 8101 gaaaacgttc ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 8161 tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg 8221 ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat 8281 gttgaatact catactcttc ctttttcaat attattgaag catttatcag ggttattgtc 8341 tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca 8401 catttccccg aaaagtgcgg taatgactcc aacttattga tagtgtttta tgttcagata 8461 atgcccgatg actttgtcat gcagctccac cgattttgag aacgacagcg acttccgtcc 8521 cagccgtgcc aggtgctgcc tcagattcag gttatgccgc tcaattcgct gcgtatatcg 8581 cttgctgatt acgtgcagct ttcccttcag gcgggattca tacagcggcc agccatccgt 8641 catccatatc accacgtcaa agggtgacag caggctcata agacgcccca gcgtcgccat 8701 agtgcgttca ccgaatacgt gcgcaacaac cgtcttccgg agactgtcat acgcgtaaaa 8761 cagccagcgc tggcgcgatt tagccccgac atagccccac tgttcgtcca tttccgcgca 8821 gacgatgacg tcactgcccg gctgtatgcg cgaggttacc gactgcggcc tgagtttttt 8881 aagtgacgta aaatcgtgtt gaggccaacg cccataatgc gggctgttgc ccggcatcca 8941 acgccattca tggccatatc aatgattttc tggtgcgtac cgggttgaga agcggtgtaa 9001 gtgaactgca gttgccatgt tttacggcag tgagagcaga gatagcgctg atgtccggcg 9061 gtgcttttgc cgttacgcac caccccgtca gtagctgaac aggagggaca gctgatagaa 9121 acagaagcca ctggagcacc tcaaaaacac catcatacac taaatcagta agttggcagc 9181 atcacc // LOCUS SYNHUMUBA 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Thr14->Cys), complete cds. ACCESSION M21581 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 51 c 54 g 60 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccatatg tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBB 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys), complete cds. ACCESSION M22214 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 82 a 51 c 54 g 59 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBC 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys), complete cds. ACCESSION M22215 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 50 c 55 g 60 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgacctta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBD 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr14->Cys, Thr66->Cys), complete cds. ACCESSION M22216 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 80 a 48 c 56 g 62 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccatatg tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBE 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Phe4->Cys, Thr66->Cys), complete cds. ACCESSION M22217 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 49 c 55 g 61 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctg tgtcaagacg ttaaccggta aaaccataac tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS SYNHUMUBF 246 bp ds-DNA SYN 24-JUL-1990 DEFINITION Synthetic human ubiquitin gene (Thr66->Cys), complete cds. ACCESSION M22218 KEYWORDS ubiquitin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (sites) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. TITLE Ubiquitin function studied by disulfide engineering JOURNAL J. Biol. Chem. 264, 1887-1893 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 246; for [1]) AUTHORS Ecker,D.J., Butt,T.R., Marsh,J., Sternberg,E., Shatzman,A., Dixon,J.S., Weber,P.L. and Crooke,S.T. JOURNAL Unpublished (1989) See comment for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.Ecker, 01-DEC-1988. Author address [1] Smith Kline and French Laboratories P.O. Box 1539 King of Prussia, PA 19046 FEATURES from to/span description pept 10 240 synthetic ubiquitin BASE COUNT 81 a 50 c 54 g 61 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcatta tgcagatctt cgtcaagacg ttaaccggta aaaccataac tctagaagtt 61 gaaccatccg ataccatcga aaacgttaag gctaaaattc aagacaagga aggcattcca 121 cctgatcaac aaagattgat ctttgccggt aagcagctcg aggacggtag aacgctgtct 181 gattacaaca ttcagaagga gtcgtgttta catcttgtct taagactaag aggtggttga 241 ggtacc // LOCUS XELC3 1087 bp ss-mRNA VRT 24-JUL-1990 DEFINITION X.laevis complement component 3 (C3) mRNA, 3' end. ACCESSION M21790 J04493 KEYWORDS complement component 3; serum protein. SOURCE X.laevis (adult) liver, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1087) AUTHORS Grossberger,D., Marcuz,A., Du Pasquier,L. and Lambris,J.D. TITLE Conservation of structural and functional domains in complement component C3 of Xenopus laevis and mammals JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1323-1327 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Grossberger, 14-DEC-1988. FEATURES from to/span description pept < 1 969 complement component 3 (AA at 1) BASE COUNT 327 a 211 c 258 g 291 t ORIGIN Unreported. 1 gggagggtgg tgaccgtgta ccagtctctt gtcatagaga aagaaagaaa gtgccttaac 61 tttgaactgt cttgtgaatg taaaggaagt acaacttgca agacctcccg aagtgcaaag 121 gcaacagtgt ctatagaagc ctgtgcaagg catctcaaga acgttgatgc caccatgtcc 181 attattgata tttccatgat gactggcttt tctcctgata ccgattccct ggataggcta 241 atgaagggag tggacaaata catctctaaa tatgaagtca acaaaggagc aaatgacaag 301 ggaacactta ttctctactt ggacaaagtc tcccacatag acgaagaatg tgtgaagttt 361 tatgctcatc aatactttga agtgggtttc atccagccag cttctgtaac tgtgtatgac 421 tattatactc cagataatcg ttgcactaaa ttttaccatg tggaagaagg cagtgccttg 481 ctggcaggga tttgccaagg ggatatatgc cgatgtgcag aagagaactg tttcatgcag 541 cagcaaattg agggtaaaat cactgctgac atgagagtca acatggcttg gctcccggag 601 tggaattttg tgtacaaggc tactctcacg gagttgcagc ccagcgacaa ctatgacaac 661 tatgtaatga caattaagaa ggtcattaag caaggcacag atgaggatcc tgaggacaag 721 acacgcaatt ttatcagcca tatcaaatgc cgaaaagctt taaatatgca gctgaaccga 781 gattatctga tttggggggt aactggtgac ctctggcgcc acgatggata ttcctacatc 841 attgggaagg acacatggat ggagtggtgg cccaatgaga gggagtgtca acaacgtgag 901 aaccaggatc tctgcgatga ctttgagaca gtttctgaca acctggagat tgttggttgt 961 cccaactgag gaacctaaat tcagaccaaa atgtttccat cttcccaagc tcatctgtga 1021 tgtttaattt tgtaaatgtt gttcagtgtt ttatatacag ttccaataaa gcatttattt 1081 gcttaac // LOCUS YSCKEX2A 4106 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae prohormone processing enzyme (KEX2) gene, complete cds. ACCESSION M24201 KEYWORDS Ca-dependent serine protease; furin-related protein; processing enzyme; protease; subtilism-related protease. SOURCE S.cerevisiae (strain DBY939) DNA, clone YEp24-pJ2B. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 4106) AUTHORS Fuller,R.S., Brake,A. and Thorner,J. TITLE Yeast prohormone processing enzyme (KEX2 gene product) is a Ca2+ -dependent serine protease JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1434-1438 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Brake, 09-MAY-1989 FEATURES from to/span description pept 1329 3773 prohormone processing enzyme (KEX2) BASE COUNT 1327 a 803 c 827 g 1149 t ORIGIN Chromosome 14, left arm. 1 gaattctctg ttgactacta aactgagaga atttgccgag actctaagaa cagctttgaa 61 agagcgttct gccgatgatt ccataattgt cactctgaga gagcaaatgc aaagagaaat 121 cttcaggttg atgtcgttgt tcatggacat acctccagtg caaccaaacg agcaattcac 181 ttgggaatac gttgacaaag acaagaaaat ccacactatc aaatcgactc cgttagaatt 241 tgcctccaaa tacgcaaaat tggacccttc cacgccagtc tcattgatca atgatccaag 301 acaccatatg gtaaattaat taagatcgat cgtttaggaa acgtccttgg cggagatgcc 361 gtgatttact taaatgttga caatgaaaca ctatctaaat tggttgttaa gagattacaa 421 aataacaaag ctgtcttttt tggatctcac actccaaagt tcatggacaa gaaaactggt 481 gtcatggata ttgaattgtg gaactatcct gccatggcta taatttacct cagcaaaagg 541 catccggtat tagataccat gaaagtttga tgactcatgc tatgttggat cactggctgc 601 cacgtcgatg aaacgtctaa attaccactt cgctaccgtc tgaaaattcc tggggtaaag 661 actccggtaa agacggatta tacgtgatga ctcaaaagta cttcgaggag tactgctttc 721 aaattgtggt cgatatcaat gaattgccaa aagagctggc ttcaaaattc acctcaggta 781 aggaagagcc gattgtcttg cccatctgga cccaatggtg ctttggccaa ataaatagtt 841 tcagcagctc tgatgtagat acacgtatct cgacatgttt tatttttact atacatacat 901 aaaagaaata aaaaatgata acgtgtatat tattattcat ataatcaatg agggtcattt 961 tctgaaacgc aaaaaacggt aaatggaaaa aaaataaaga tagaaaaaga aaacaaacaa 1021 aggaaaggtt agcatattaa ataactgagc tgatacttca acagcatcgc tgaagagaac 1081 agtattgaaa ccgaaacatt ttctaaaggc aaacaaggta ctccatattt gctggacgtg 1141 ttctttctct cgtttcatat gcataattct gtcataagcc tgttcttttt cctggcttaa 1201 acatcccgtt ttgtaaaaga gaaatctatt ccacatattt cattcattcg gctaccatac 1261 taaggataaa ctaatcccgt tgttttttgg cctcgtcaca taattataaa ctactaaccc 1321 attatcagat gaaagtgagg aaatatatta ctttatgctt ttggtgggcc ttttcaacat 1381 ccgctcttgt atcatcacaa caaattccat tgaaggacca tacgtcacga cagtattttg 1441 ctgtagaaag caatgaaaca ttatcccgct tggaggaaat gcatccaaat tggaaatatg 1501 aacatgatgt tcgagggcta ccaaaccatt atgttttttc aaaagagttg ctaaaattgg 1561 gcaaaagatc atcattagaa gagttacagg gggataacaa cgaccacata ttatctgtcc 1621 atgatttatt cccgcgtaac gacctattta agagactacc ggtgcctgct ccaccaatgg 1681 actcaagctt gttaccggta aaagaagctg aggataaact cagcataaat gatccgcttt 1741 ttgagaggca gtggcacttg gtcaatccaa gttttcctgg cagtgatata aatgttcttg 1801 atctgtggta caataatatt acaggcgcag gggtcgtggc tgccattgtt gatgatggcc 1861 ttgactacga aaatgaagac ttgaaggata atttttgcgc tgaaggttct tgggatttca 1921 acgacaatac caatttacct aaaccaagat tatctgatga ctaccatggt acgagatgtg 1981 caggtgaaat agctgccaaa aaaggtaaca atttttgcgg tgtcggggta ggttacaacg 2041 ctaaaatctc aggcataaga atcttatccg gtgatatcac tacggaagat gaagctgcgt 2101 ccttgattta tggtctagac gtaaacgata tatattcatg ctcatggggt cccgctgatg 2161 acggaagaca tttacaaggc cctagtgacc tggtgaaaaa ggctttagta aaaggtgtta 2221 ctgagggaag agattccaaa ggagcgattt acgtttttgc cagtggaaat ggtggaactc 2281 gtggtgataa ttgcaattac gacggctata ctaattccat atattctatt actattgggg 2341 ctattgatca caaagatcta catcctcctt attccgaagg ttgttccgcc gtcatggcag 2401 tcacgtattc ttcaggttca ggcgaatata ttcattcgag tgatatcaac ggcagatgca 2461 gtaatagcca cggtggaacg tctgcggctg ctccattagc tgccggtgtt tacactttgt 2521 tactagaagc caacccaaac ctaacttgga gagacgtaca gtatttatca atcttgtctg 2581 cggtagggtt agaaaagaac gctgacggag attggagaga tagcgccatg gggaagaaat 2641 actctcatcg ctatggcttt ggtaaaatcg atgcccataa gttaattgaa atgtccaaga 2701 cctgggagaa tgttaacgca caaacctggt tttacctgcc aacattgtat gtttcccagt 2761 ccacaaactc cacggaagag acattagaat ccgtcataac catatcagaa aaaagtcttc 2821 aagatgctaa cttcaagaga attgagcacg tcacggtaac tgtagatatt gatacagaaa 2881 ttaggggaac tacgactgtc gatttaatat caccagcggg gataatttca aaccttggcg 2941 ttgtaagacc aagagatgtt tcatcagagg gattcaaaga ctggacattc atgtctgtag 3001 cacattgggg tgagaacggc gtaggtgatt ggaaaatcaa ggttaagaca acagaaaatg 3061 gacacaggat tgacttccac agttggaggc tgaagctctt tggggaatcc attgattcat 3121 ctaaaacaga aactttcgtc tttggaaacg ataaagagga ggttgaacca gctgctacag 3181 aaagtaccgt atcacaatat tctgccagtt caacttctat ttccatcagc gctacttcta 3241 catcttctat ctcaattggt gtggaaacgt cggccattcc ccaaacgact actgcgagta 3301 ccgatcctga ttctgatcca aacactccta aaaaactttc ctctcctagg caagccatgc 3361 attatttttt aacaatattt ttgattggcg ccacattttt ggtgttatac ttcatgtttt 3421 ttatgaaatc aaggagaagg atcagaaggt caagagcgga aacgtatgaa ttcgatatca 3481 ttgatacaga ctctgagtac gattctactt tggacaatgg aacttccgga attactgagc 3541 ccgaagaggt tgaggacttc gattttgatt tgtccgatga agaccatctt gcaagtttgt 3601 cttcatcaga aaacggtgat gctgaacata caattgatag tgtactaaca aacgaaaatc 3661 catttagtga ccctataaag caaaagttcc caaatgacgc caacgcagaa tctgcttcca 3721 ataaattaca agaattacag cctgatgttc ctccatcttc cggacgatcg tgattcgata 3781 tgtacagaaa gcttcaaatt acaaaatagc atttttttct tatagattat aatactctct 3841 catacgtata cgtatatgtg tatatgatat ataaacaaac attaatatcc tattccttcc 3901 gtttgaaatc cctatgatgt actttgcatt gtttgcaccc gcgaataaaa tgaaaactcc 3961 gaaccgatat atcaagcaca taaaagggga gggtccaatt aatgcatatt taagaccaca 4021 gctgaataac tttaaaacgg cagacaaaac aaaaaatagg tcgaataaac cttacctgcc 4081 tagaaggaat gacagcagct aataag // LOCUS YSCTASPT 2641 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae mitochondrial aspartyl-tRNA synthetase (MSD1) gene, complete cds. ACCESSION M26020 KEYWORDS aspartyl-tRNA synthetase. SOURCE S.cerevisiae (strain D273-10B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2641) AUTHORS Tzagoloff,A. JOURNAL Unpublished (1989) Columbia U. Dept. Biol. Sci. NY NY 10027 STANDARD full staff_review REFERENCE 2 (sites; for [1]) AUTHORS Gampel,A. and Tzagoloff,A. TITLE Homology of aspartyl-and lysyl-tRNA synthetases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Tzagoloff, 06-JUL-1989. FEATURES from to/span description pept 601 2577 aspartyl-tRNA synthetase BASE COUNT 878 a 466 c 551 g 746 t ORIGIN 1 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacctg taacaatagc aataccccaa 61 atacctaatg tagttccagc aagcaagcta aaaagtaaag caacaacata actcacccct 121 gcatctgcag cttttgcccg ggcagcctgc tctgcctgtg ttttctttaa ttgagcagta 181 gaccatttag cagttgcatg aatagctgca gcgtcacatc ggataataat gatggcagcc 241 attgtagaag tgccttttgc atttctagtc tctttctcgg tctagctagt tttactacat 301 cgcgaagata gaatcttaga tcacactgcc aatcttgtgc aagctgtggc aaaatcgacc 361 ttctccatgc gtccatggta aatgaaatgt tttgctttct ttttcgtgct tgcaaagttt 421 ggtagatgaa atttcagaaa gctactgccg taccaaagta caataataag gtgaaaatgg 481 gagtaggaag taatagatgg cattgaatgc ctcatttatt cattattcat gttttcttaa 541 cattttgatt cagttgagtg catgcttttt tgaaaagact aaacattaca agttttaatc 601 atgttggccc gttccagggt gtgcttgcag acaatcacta gacggttggc agactttcca 661 gaagccaatg ctattaagaa aaaatttctc tttaggaagg acacctcgac catcaaacag 721 ttaaaaggac tgtctagcgg ccagaaaata gtactcaatg gatggataga gcagaagcca 781 aaaagagttg ggaaaaattt gatcttcgga cttttaaggg actctaacgg tgacattatc 841 cagttggttg ataacaaatc gttgttgaaa ggctttactt tagaggatgt ggttcaagcc 901 gtaggtatac tctctttgaa gaggaagcta tcaaatgagg acgcagatga gtacgaagtg 961 caattggagg atattactgt gttaaatgca tctaataaaa aaccagcgca aatgcaggat 1021 tttaaattgt cagccatata cccacctgag ttccgctatt tgcaattgag aaatcccaaa 1081 tatcaagatt ttttaaagaa gaggtcatct atctctaagg aaataagaaa ctccttcaac 1141 aattttgatt ttacggaggt cgagacccca atgttattta aagctacccc agaaggcgca 1201 agagagtttc tggttccaac aaggacaaag agatccgatg gtaaaccatc gttttatgct 1261 ctagatcaga gccctcaaca atacaagcaa ctcttaatgg ctagtggtgt caacaaatat 1321 tatcaaatgg caaggtgctt tagagatgaa gatttaagag cagacaggca gcctgagttt 1381 acacaggttg atatggaaat ggcctttgct aattctgaag atgtcatgaa aatcatagaa 1441 aagacagttt ctggggtatg gagtaaattt tccaaaaaac gaggattatt gactttagac 1501 agtaagggta cattagtgcc tgcgaaaaag gaaaacggca cagtatctat ctttcgtatg 1561 acctacgaac aagccatgac ctcatatggt attgacaagc cagatttgag agcgccagat 1621 ttgaagatta tcaatttagg cgagttcaat gcctttagtc atttgaacaa aaaatttccc 1681 gtttttgaag taattattct aagaagtgcc ttttcaaata tggaagagta caaagaacga 1741 tggtcgtttc tgacaaataa cagtaattac aattatagag ttccaatagt gctaccaatt 1801 gaaaatgacg aacaagctaa ttcaaattgg tttgagaatt ttcatgcaat tgccacgttt 1861 gaaaacccac atctaataac caaatttctg aaactgaaaa aaggtgacat tgtatgcggt 1921 tgtacgagag agccaaacca ttccattttc gagaatccta ctcccctggg aagattgaga 1981 cagttggtgc tacaaagtga gcatgggaaa aatatctatc atgctgtcaa taaggatgtt 2041 gcctcatgga ttgtggattt cccgttattt tctcccgtta taattgaaga taagtctggt 2101 aaaaaagaaa agcttgcata tccggagtac gaaaaggata gactatgttc cacgcatcat 2161 ccttttacta tggtgaagct taaagactac gaaaaattag aaaagacgcc ggaaaagtgc 2221 ttgggtcggc attatgacct cgtagttaac ggtgtggaac ttggtggtgg ctcaacaaga 2281 attcacgatc caagattaca agactatatt ttcgaagata tcctcaaaat agataatgca 2341 tatgaactat ttggccattt actgaatgct tttgatatgg gaacaccgcc acacgctgga 2401 tttgctattg gttttgatcg tatgtgcgct atgatctgtg aaactgagag tataagggac 2461 gtaatcgcct tcccaaaaag tattaccggg gctgatttgg ttgtcaagag tccaagtgtg 2521 atacctgaaa gcattctgga accttacaat atcaagtata gtaattcaaa aaaatgagca 2581 gaagttgatg ttacctagat gtgagagaat cttagtgtgg ctgcagtatg taggtttttc 2641 t // LOCUS YSCTRMSD 2604 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae MSD gene encoding mitochondrial aspartyl-tRNA synthetase, complete cds. ACCESSION M24418 KEYWORDS aspartyl-tRNA synthetase. SOURCE S.cerevisiae (strain D273-10B) DNA, clone pG94/T1. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2604) AUTHORS Gampel,A. and Tzagoloff,A. TITLE Homology of aspartyl- and lysyl-tRNA synthetases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6023-6027 (1989) STANDARD full staff_review COMMENT Draft entry of sequence [1] kindly submitted by A.Tzagoloff, 01-MAY-1989. FEATURES from to/span description pept 565 2541 mitochondrial aspartyl-tRNA synthetase BASE COUNT 842 a 466 c 551 g 745 t ORIGIN 1 cctgtaacaa tagcaatacc ccaaatacct aatgtagttc cagcaagcaa gctaaaaagt 61 aaagcaacaa cataactcac ccctgcatct gcagcttttg cccgggcagc ctgctctgcc 121 tgtgttttct ttaattgagc agtagaccat ttagcagttg catgaatagc tgcagcgtca 181 catcggataa taatgatggc agccattgta gaagtgcctt ttgcatttct agtctctttc 241 tcggtctagc tagttttact acatcgcgaa gatagaatct tagatcacac tgccaatctt 301 gtgcaagctg tggcaaaatc gaccttctcc atgcgtccat ggtaaatgaa atgttttgct 361 ttctttttcg tgcttgcaaa gtttggtaga tgaaatttca gaaagctact gccgtaccaa 421 agtacaataa taaggtgaaa atgggagtag gaagtaatag atggcattga atgcctcatt 481 tattcattat tcatgttttc ttaacatttt gattcagttg agtgcatgct tttttgaaaa 541 gactaaacat tacaagtttt aatcatgttg gcccgttcca gggtgtgctt gcagacaatc 601 actagacggt tggcagactt tccagaagcc aatgctatta agaaaaaatt tctctttagg 661 aaggacacct cgaccatcaa acagttaaaa ggactgtcta gcggccagaa aatagtactc 721 aatggatgga tagagcagaa gccaaaaaga gttgggaaaa atttgatctt cggactttta 781 agggactcta acggtgacat tatccagttg gttgataaca aatcgttgtt gaaaggcttt 841 actttagagg atgtggttca agccgtaggt atactctctt tgaagaggaa gctatcaaat 901 gaggacgcag atgagtacga agtgcaattg gaggatatta ctgtgttaaa tgcatctaat 961 aaaaaaccag cgcaaatgca ggattttaaa ttgtcagcca tatacccacc tgagttccgc 1021 tatttgcaat tgagaaatcc caaatatcaa gattttttaa agaagaggtc atctatctct 1081 aaggaaataa gaaactcctt caacaatttt gattttacgg aggtcgagac cccaatgtta 1141 tttaaagcta ccccagaagg cgcaagagag tttctggttc caacaaggac aaagagatcc 1201 gatggtaaac catcgtttta tgctctagat cagagccctc aacaatacaa gcaactctta 1261 atggctagtg gtgtcaacaa atattatcaa atggcaaggt gctttagaga tgaagattta 1321 agagcagaca ggcagcctga gtttacacag gttgatatgg aaatggcctt tgctaattct 1381 gaagatgtca tgaaaatcat agaaaagaca gtttctgggg tatggagtaa attttccaaa 1441 aaacgaggat tattgacttt agacagtaag ggtacattag tgcctgcgaa aaaggaaaac 1501 ggcacagtat ctatctttcg tatgacctac gaacaagcca tgacctcata tggtattgac 1561 aagccagatt tgagagcgcc agatttgaag attatcaatt taggcgagtt caatgccttt 1621 agtcatttga acaaaaaatt tcccgttttt gaagtaatta ttctaagaag tgccttttca 1681 aatatggaag agtacaaaga acgatggtcg tttctgacaa ataacagtaa ttacaattat 1741 agagttccaa tagtgctacc aattgaaaat gacgaacaag ctaattcaaa ttggtttgag 1801 aattttcatg caattgccac gtttgaaaac ccacatctaa taaccaaatt tctgaaactg 1861 aaaaaaggtg acattgtatg cggttgtacg agagagccaa accattccat tttcgagaat 1921 cctactcccc tgggaagatt gagacagttg gtgctacaaa gtgagcatgg gaaaaatatc 1981 tatcatgctg tcaataagga tgttgcctca tggattgtgg atttcccgtt attttctccc 2041 gttataattg aagataagtc tggtaaaaaa gaaaagcttg catatccgga gtacgaaaag 2101 gatagactat gttccacgca tcatcctttt actatggtga agcttaaaga ctacgaaaaa 2161 ttagaaaaga cgccggaaaa gtgcttgggt cggcattatg acctcgtagt taacggtgtg 2221 gaacttggtg gtggctcaac aagaattcac gatccaagat tacaagacta tattttcgaa 2281 gatatcctca aaatagataa tgcatatgaa ctatttggcc atttactgaa tgcttttgat 2341 atgggaacac cgccacacgc tggatttgct attggttttg atcgtatgtg cgctatgatc 2401 tgtgaaactg agagtataag ggacgtaatc gccttcccaa aaagtattac cggggctgat 2461 ttggttgtca agagtccaag tgtgatacct gaaagcattc tggaacctta caatatcaag 2521 tatagtaatt caaaaaaatg agcagaagtt gatgttacct agatgtgaga gaatcttagt 2581 gtggctgcag tatgtaggtt tttc // LOCUS MUSCD14A 1873 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse monocyte/granulocyte cell surface glycoprotein homologous (CD14) gene, complete cds. ACCESSION M34510 KEYWORDS CD14 receptor protein; monocyte/granulocyte cell surface glycoprotein. SOURCE Mouse (strain BALB/c) adult liver macrophage cell line J774A.1 DNA, clone pMCD14/5, and cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1873) AUTHORS Ferrero,E., Hsieh,C.-L., Francke,U. and Goyert,S.M. TITLE CD14 is a member of the family of leucine-rich proteins and is encoded by a gene syntenic with multiple receptor genes JOURNAL J. Immunol. 145, 331-336 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by E.M.Ferrero, 25-MAY-1990. FEATURES from to/span description pept 173 175 murine CD14 protein (MCD14), exon 1 273 1370 murine CD14 protein, exon 2 sigp 173 175 murine CD14 protein signal peptide 273 314 murine CD14 protein signal peptide matp 315 1367 murine CD14 protein pre-msg 62 1602 murine CD14 mRNA and introns IVS 176 272 murine CD14 protein intron A signal 1433 1438 poly-A signal signal 1583 1588 poly-A signal BASE COUNT 427 a 500 c 478 g 468 t ORIGIN Chromosome 18. 1 aggaagggac agggtgacac cccaggatta cataaattta caggggctgc cgaattggtc 61 gaacaagccc gtggaacctg gaagccagag aacaccatcg ctgtaaagga aagaaactga 121 agcttttctc ggagcctatc tgggctgctc aaactttcag aatctaccga ccatggtgag 181 tcagacagac tgtcttgggg tggaactgga gccaacctga ggaatctcag ggtcctggca 241 ggagtctccc tgtcccccta ctttctcctc aggagcgtgt gcttggcttg ttgctgttgc 301 ttctggtgca cgcctctccc gccccaccag agccctgcga gctagacgag gaaagttgct 361 cctgcaactt ctcagatccg aagccagatt ggtccagcgc tttcaattgt ttgggggcgg 421 cagatgtgga attgtacggc ggcggccgca gcctggaata ccttctaaag cgtgtggaca 481 cggaagcaga tctggggcag ttcactgata ttatcaagtc tctgtcctta aagcggctta 541 cggtgcgggc cgcgcggatt cctagtcgga ttctattcgg agccctgcgt gtgctcggga 601 tttccggcct ccaggaactg actcttgaaa atctcgaggt aaccggcacc gcgccgccac 661 cgcttctgga agccaccgga cccgatctca acatcttgaa cctccgcaac gtgtcgtggg 721 caacaaggga tgcctggctc gcagaactgc agcagtggct aaagcctgga ctcaaggtac 781 tgagtattgc ccaagcacac tcactcaact tttcctgcga acaggtccgc gtcttccctg 841 ccctctccac cttagacctg tctgacaatc ctgaattggg cgagagagga ctgatctcag 901 ccctctgtcc cctcaagttc ccgaccctcc aagttttagc gctgcgtaac gcggggatgg 961 agacgcccag cggcgtgtgc tctgcgctgg ccgcagcaag ggtacagctg caaggactag 1021 accttagtca caattcactg cgggatgctg caggcgctcc gagttgtgac tggcccagtc 1081 agctaaactc gctcaatctg tctttcactg ggctgaagca ggtacctaaa gggctgccag 1141 ccaagctcag cgtgctggat ctcagttaca acaggctgga taggaaccct agcccagatg 1201 agctgcccca agtggggaac ctgtcactta aaggaaatcc ctttttggac tctgaatccc 1261 actcggagaa gtttaactct ggcgtagtca ccgccggagc tccatcatcc caagcagtgg 1321 ccttgtcagg aactctggct ttgctcctag gagatcgcct ctttgtttaa ggaacatttg 1381 catcctcctg gtttctgagg gtcctcgtca acgaatcctc tgctttaaat ttattaaaat 1441 cttaatccac gatgtaagga aagaaaggca gtcaagatgg ttcagtgggt aaaagccagc 1501 aaacttgacc cctgatttta accctcagga tccacacgga aggggaaaac tcactcctga 1561 aagttgtcca tctgtgctca caaataaata ttttttaaaa taacaatgtg tttgttggtt 1621 ttgtttttgt ttgggttttg ttgtggtttt gtttgttttg ttttgttttt gagacagtct 1681 ggctatgtat ccttggctgg cctcaaactc ataaagatca agatcggcct gcctctacct 1741 ccaaatgctc tggttaaagg gatgtgcctc catgcccagt tgaagtcatc ctgaaccacg 1801 agtccaggcc actcactctt tactaagatc tttactaagt atctgccata ggtcaggctc 1861 agtgctaaat gag // LOCUS YSTAKYAMT 563 bp ss-mRNA PLN 24-JUL-1990 DEFINITION Yeast sp. mitochondrial adenylate kinase (AKY2) gene, 5' end. ACCESSION M35511 KEYWORDS adenylate kinase. SOURCE Yeast sp. (wild type strain D273-10B, ATCC 25657), cDNA to mRNA. ORGANISM Yeast sp. Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina. REFERENCE 1 (bases 1 to 563) AUTHORS Oechsner,U., Magdolen,V., Zoglowek,C., Haecker,U. and Bandlow,W. TITLE Yeast adenylate kinase is transcribed constitutively from a promoter in the short intergenic region to the histone H2A-1 gene JOURNAL FEBS Lett. 242, 187-193 (1988) STANDARD simple staff_review FEATURES from to/span description pept 561 > 563 adenylate kinase (AKY2) BASE COUNT 153 a 100 c 98 g 212 t ORIGIN 1 taagatcggt tctggtattt taaagaaggc ggaaggaact aaactgcttc tccattgtaa 61 ctatattaag tgtctattag acctttctgt ttttcctttt tatttttcta tatataattt 121 attgggtata taattaacgg taacatatgt catgcatgat ataaattttt tgttgtgtct 181 tcctgttttc tctgttgctc tgtttaggtt cattgggcac tgttgcttac ttgaacatga 241 ttgagtagca aacacattct gtgctttgtt cactttgata gtgtgacgaa attttcacaa 301 ccaaaggaaa aaagctcacg attgcgcgat cctgtcacgg agaactgtcc gcagcagccc 361 gcggcttctg tccttattcg cccatttttt tttgattttc gactttttca ctctggctag 421 ttttattacg catatttgta tatatatata cgcataaatt tctgaaatgg attacctggt 481 ttatcagaaa cgagagcttt aataaagaaa cattaacgtt tctctggtaa agtcaccaca 541 cagcatcaaa tataacagta atg // LOCUS MLMPOLA 1680 bp ss-RNA VRL 24-JUL-1990 DEFINITION Moloney murine leukemia virus pol gene, partial cds. ACCESSION M32803 KEYWORDS . SOURCE Moloney murine leukemia virus RNA, clone 3-1R. ORGANISM Moloney murine leukemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine leukemia viruses. REFERENCE 1 (bases 1 to 1680) AUTHORS Martinelli,S.C. and Goff,S.P. TITLE Rapid reversion of a deletion mutation in Moloney murine leukemia virus by recombination with a closely related endogenous provirus JOURNAL Virology 174, 135-144 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 684 pol protein (AA at 1) 688 > 1680 pol protein site 685 687 pol protein in-frame stop codon BASE COUNT 469 a 467 c 435 g 309 t ORIGIN 1 gcttttcccc tcgagcgccc agactgggat tacaccaccc aggcaggtag gaaccaccta 61 gtccactatc gccagttgct cctagcgggt ctccaaaacg cgggcagaag ccccaccaat 121 ttggccaagg taaaagggat aacacaggga cctaatgagt ctccctcagc ctttttagag 181 agactcaagg aggcctatcg caggtacact ccttatgacc ctgaggaccc agggcaagaa 241 accaatgtgt ctatgtcatt catctggcag tctgccccgg atatcgggcg aaagttaggg 301 cggttagaag atttaaagag caagacctta ggagacttag ttagggaggc tgaaaagatc 361 tttaataaac gtgaaacccc ggaagaaaga gaggaacgta tcaggagaga aacagaggaa 421 aaagaagaac gccgtaggac agtggatgag cagaaagaga aagaaaggga ccgcagaaga 481 catagagaga tgagcaagct cttggccact gtagttattg gtcaggaaca ggatagacag 541 gagggagagc ggaagaggcc ccaacttgat aaggatcaat gcgcctactg caaagaaaag 601 gggcactggg ctaaagactg tcccaagaag ccacgagggc cccgaggacc caggccccag 661 acctccctcc tgaccttagg tgactaggga ggtcagggtc aggacccccc ccctgaaccc 721 aggataaccc tcaaagtcgg ggggcaaccc gtcaccttcc tggtagatac tggggcccaa 781 cactccgtgc tgacccaaaa tcctggaccc ctaagtgaca agtctgcctg ggtccaaggg 841 gctactggag gaaagcggta tcgctggacc acggatcgca aagtgcatct agctaccggt 901 aaggtcaccc actctttcct ccatgtacca gactgcccct atcctctgct aggaagagat 961 ttgctgacta aactaaaagc ccaaatccac tttgagggat caggagctca ggttgtggga 1021 ccaatgggac agcccctgca agtgctgacc ctaaacatag aagatgagta tcggctacat 1081 gagacctcaa aagagccgga tgtttctcta gggttcacat ggctttctga ttttcctcag 1141 gcctgggcgg aatccggggg catgggactg gcagttcgcc aagctcctct gatcatacct 1201 ctgaaggcaa cctctacccc cgtgtccata aaacaatacc ccatgtcaca agaagccaga 1261 ctggggatca agccccacat acagagactg ttggaccagg gaatactggt accctgccag 1321 tccccctgga acacgcccct gctacccgtt aagaaaccag ggactaatga ttacaggcct 1381 gtccaagatc tgagagaagt caacaagcgg gtggaagaca tccaccccac cgtgcccaac 1441 ccttacaacc tcttgagcgg gctcccaccg tcccaccagt ggtacactgt gcttgactta 1501 aaggatgcct ttttctgcct gagactccac cccaccagtc agcctctctt cgcctttgag 1561 tggagagacc cagagatggg aatctcagga caattaacct ggaccagact cccacagggt 1621 ttcaaaaaca gtcccaccct gtttgatgag gcactgcaca gagacctagc agacttccgg // LOCUS RHIISRA 1350 bp ds-DNA BCT 24-JUL-1990 DEFINITION Plasmid RP4 of Rhizobium transposon Tn1 transposable element ISR1 open reading frames A, A1, A2 and A3. ACCESSION X06616 KEYWORDS transposable element. SOURCE Plasmid RP4 transposon Tn 1 of Rhizobium sp. DNA. ORGANISM Rhizobium sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 1350) AUTHORS Priefer,U.B., Kalinowski,J., Rueger,B., Heumann,W. and Puehler,A. TITLE ISR1, a transposable DNA sequence resident in Rhizobium class IV strains, shows structural characteristics of classical insertion elements JOURNAL Plasmid 21, 120-128 (1989) STANDARD simple staff_review FEATURES from to/span description ORF 94 360 ORF A3 ORF 433 714 ORF A1 ORF 787 1272 ORF A2 ORF 900 64 (c) ORF A BASE COUNT 258 a 415 c 470 g 207 t ORIGIN 1 acggaaattg ctcatcagct cagtattgcc tgacgtgacc ccctgaaact cctccaggaa 61 tagctagagt ccgcccaacg aaggacggac agaatgaagc gatcgaggtt cacggaagag 121 cagatcatcg ggatcctgcg ggagcaggag gccggtgtag cgacggcgga ggtgtgccga 181 cgccacgggg tcagctcggc gaccttttac aaatggaagg ccaagtttgg cgggctcgac 241 gtgtcggagg ctcggcggct caaggcgctc gaggatgaga acgcccggct gaagcggatg 301 ctggcggacg cgatgctgga caacgtggcc ctgaaggacc tgctgggaaa aaagtggtga 361 cgcccgccgg ctatcgcgag gcggctggcc atctgcaggc cgcctacgag atgagcgaaa 421 ggcgggcgtg ccgtgttctg ggcgtcgatc ggacgagcgt gcgctatcag gcgacgcgcc 481 cggacgacgc gctctgcgcg accggctgaa ggccctggcc aggaacgtcg acggttcggc 541 tatcgtcgcc tgcacgtgct gctgcggcgc gagggccatg cggtcaacag gaaacgggtc 601 cagcggatct atcgcgacga gcaactgacg gtgcgccggc gcgcgccgca agcgagcgat 661 gggcacgcgg cgaccgtgac ctgccgctgg cgcccaacca gcgctggtcg ctgacttcgt 721 cgcagaccag ctgacggacg ggcggcgctt ccgcatcctg acggtgatcg ataactgtac 781 gcgcgagtgc ttggcgctgg tggccgacac ctcgctgtcg ggcgcgcggg tcgtgcggga 841 gttggacgcc gtcatccggc agcgagggcg gcccgacacc atcgtcagcg acaacggcac 901 ggagtacacc tcgaacgcgg tcctggcctg ggcgacgaca ccggcgtcgg ctggcactac 961 atcgcgccgg gcaagcccca gcagaacggc ttcaacgaaa gcttcaacgg gcggctgcgt 1021 gacgagctgc tgaacgagac gctgttccgc tcgctgccgc cgctcgcgcc gtgcggaggc 1081 gtggcgacgc gattacaacg agcagcgtcc gcactcgaag ctcgggtggc tgacgccgca 1141 ggcttatgca gaagcgctca ccgcacacaa cggccggcct gctgcgtggt tgatggctgc 1201 gcagaccggc ctcttgccaa cccagccgat cacagctcag atcaacctag gactctcgtt 1261 atggctggat gagaaacggg ggtcacgtca tgcccgctcc acggtttata aaattcttga 1321 agacgaaagg gcctcgtgat acgcctattt // LOCUS HAMACAPT 1837 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA, partial cds. ACCESSION J05590 M22755 KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase; tunicamycin-resistance. SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone TRG-[4,22]. ORGANISM Cricetulus longicaudatus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 889 to 960) AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S. TITLE Amplification and molecular cloning of the hamster tunicamycin- sensitive N-acetylglucosamine-1-phosphate transferase gene: The hamster and yeast enzymes share a common peptide sequence JOURNAL J. Biol. Chem. 263, 19796-19803 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 1837) AUTHORS Zhu,X. and Lehrman,M.A. TITLE Cloning, sequence and expression of a cDNA encoding hamster UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman, 04-APR-1989. Draft entry and computer-readable sequence for [1] kindly submitted by M.Lehrman, 12-JUN-1990. FEATURES from to/span description pept 148 1374 N-acetylglucosamine-1-phosphate transferase (GPT) binding 67 79 dolichol binding site binding 222 234 dolichol binding site BASE COUNT 363 a 515 c 418 g 541 t ORIGIN 1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg 61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg 121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg 181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc 241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg 301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc 361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc 421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg 481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct 541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta 601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc 661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat 721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta 781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt 841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga 901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc 961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct 1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg 1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc 1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag 1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt 1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc 1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc 1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca 1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt 1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt 1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc 1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat 1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag 1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc 1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt // LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 24-JUL-1990 DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C genes, complete cds. ACCESSION M34440 KEYWORDS nuclear antigen. SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 10089) AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E., Rickinson,A. and Kieff,E. TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their EBNA-3A, EBNA-3B and EBNA-3C genes JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.T.Sample, 23-MAY-1990. Kieff Laboratory Harvard Medical School 75 Francis Street Boston, MA 02115 FEATURES from to/span description pept 189 524 nuclear antigen EBNA-3A, exon 1 610 3051 nuclear antigen EBNA-3A, exon 2 pept 3237 3593 nuclear antigen EBNA-3B, exon 1 3672 6155 nuclear antigen EBNA-3B, exon 2 pept 6277 6636 nuclear antigen EBNA-3C, exon 1 6711 9560 nuclear antigen EBNA-3C, exon 2 IVS 525 609 EBNA-3A intron IVS 3594 3671 EBNA-3B intron IVS 6637 6710 EBNA-3C intron BASE COUNT 2495 a 2837 c 2619 g 2138 t ORIGIN 1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct 61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag 121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg 181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag 241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca 301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc 361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac 421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc 481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg 541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt 601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat 661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg 721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg 781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact 841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg 901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg 961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc 1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct 1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac 1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat 1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg 1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca 1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg 1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg 1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct 1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa 1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac 1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc 1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc 1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg 1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg 1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa 1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca 1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca 2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag 2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc 2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac 2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg 2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa 2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg 2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg 2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc 2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg 2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg 2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa 2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg 2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct 2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat 2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt 2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg 3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg 3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat 3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc 3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga 3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg 3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac 3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag 3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca 3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc 3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag 3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc 3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga 3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac 3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct 3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata 3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa 3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag 4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt 4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga 4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata 4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag 4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct 4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg 4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa 4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg 4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca 4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag 4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc 4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg 4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca 4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca 4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga 4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt 4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac 5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc 5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac 5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct 5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc 5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc 5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc 5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc 5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc 5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc 5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag 5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta 5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg 5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac 5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac 5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc 5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct 6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt 6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt 6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg 6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta 6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg 6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat 6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac 6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc 6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta 6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca 6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc 6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc 6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct 6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg 6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat 6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca 6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg 7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag 7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga 7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc 7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt 7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta 7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc 7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg 7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag 7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc 7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac 7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag 7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg 7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag 7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt 7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga 7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta 7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac 8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc 8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc 8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc 8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc 8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc 8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg 8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac 8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg 8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg 8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata 8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc 8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact 8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc 8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt 8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag 8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc 9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac 9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc 9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc 9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac 9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg 9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag 9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa 9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg 9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca 9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga 9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta 9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca 9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg 9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg 9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg 9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag 9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc 10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt 10081 ggaggatcc // LOCUS HUMPAPB 3061 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human prostatic acid phosphatase mRNA, complete cds. ACCESSION M34840 KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein. SOURCE Human prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1199) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3061) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Van Etten, 08-JUN-1990. Chemistry Department Purdue University West Lafayette, IN 47907 FEATURES from to/span description pept 15 1175 prostatic acid phosphatase precursor (EC 3.1.3.2) /hgml_locus_uid="LA0168F" /nomgen="ACPP" /map="3q21-qter" sigp 15 110 prostatic acid phosphatase signal peptide matp 111 1172 prostatic acid phosphatase BASE COUNT 917 a 642 c 681 g 821 t ORIGIN Chromosome 3q21-qter. 1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc 61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga 121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg 181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg 241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt 301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg 361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc 421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc 481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg 541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag 601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg 661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga 721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat 781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc 841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc 901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga 961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc 1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt 1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa 1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag 1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc 1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca 1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt 1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa 1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt 1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt 1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga 1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc 1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa 1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt 1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat 1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata 1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga 1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag 2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag 2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt 2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta 2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca 2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa 2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc 2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac 2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa 2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat 2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac 2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc 2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc 2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat 2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc 2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat 2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta 3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa 3061 a // LOCUS LDV3ECPA 1064 bp ss-RNA VRL 24-JUL-1990 DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete cds. ACCESSION M34856 KEYWORDS viral capsid protein. SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones A16, B24, B63, B90, B103, C44, dt4. ORGANISM Lactate dehydrogenase-elevating virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae. REFERENCE 1 (bases 1 to 1064) AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A. TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid protein (Vp1) gene JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.K.Godeny, 06-JUN-1990. FEATURES from to/span description pept 637 984 capsid protein pept 135 650 unidentified ORF BASE COUNT 267 a 223 c 265 g 309 t ORIGIN 3' terminus region. 1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc 61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc 121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt 181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg 241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt 301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg 361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc 421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg 481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta 541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga 601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc 661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat 721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc 781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga 841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt 901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat 961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg 1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt // LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 24-JUL-1990 DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase (frhADGB) gene, complete cds. ACCESSION J02914 KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase. SOURCE M.thermoautotrophicum (strain delta-H) DNA. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 3903) AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T. TITLE Cloning, sequence determination, and expression of the genes encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin reducing hydrogenase from Methanobacterium thermoautotrophicum delta-H JOURNAL Biochemistry (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.Alex, 25-MAY-1990. FEATURES from to/span description pept 298 1515 F420 hydrogenase alpha-subunit (frhA) pept 1524 2000 F420 hydrogenase delta-subunit (frhD) pept 2114 2824 F420 hydrogenase gamma-subuint (frhG) pept 2837 3682 F420 hydrogenase beta-subunit (frhB) binding 284 293 ribosomal binding site (put.) binding 1514 1520 ribosomal binding site (put.) binding 2100 2106 ribosomal binding site (put.) binding 2823 2832 ribosomal binding site (put.) BASE COUNT 1201 a 878 c 965 g 859 t ORIGIN 1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta 61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac 121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt 181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt 241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg 301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg 361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg 421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg 481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc 541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac 601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac 661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt 721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac 781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa 841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc 901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa 961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata 1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca 1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt 1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg 1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta 1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag 1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca 1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc 1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa 1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg 1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt 1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc 1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg 1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg 1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa 1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa 1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg 1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc 2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga 2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag 2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact 2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga 2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga 2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac 2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt 2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc 2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc 2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt 2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct 2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg 2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct 2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac 2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata 2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg 3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa 3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga 3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta 3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat 3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat 3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga 3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg 3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg 3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga 3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga 3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat 3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt 3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa 3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt 3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat 3901 cga // LOCUS MUSLRCNA 1751 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse loricrin gene, complete cds. ACCESSION M34398 KEYWORDS cell envelope protein; loricrin. SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1751) AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D., Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M., Yuspa,S.H. and Roop,D.R. TITLE Identification of a major keratinocyte cell envelope protein, loricrin JOURNAL Cell 61, 1103-1112 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Rothnagel, 22-MAY-1990. FEATURES from to/span description pept 71 1516 loricrin signal 1740 1745 polyA signal BASE COUNT 223 a 493 c 706 g 329 t ORIGIN 1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc 61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg 121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg 181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg 241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg 301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg 361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg 421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg 481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg 541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg 601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg 661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc 721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg 781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg 841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg 901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg 961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg 1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg 1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc 1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg 1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag 1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg 1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg 1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg 1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac 1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc 1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac 1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct 1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca 1741 ataaatttgc t // LOCUS PFAGIPA 1906 bp ds-DNA INV 24-JUL-1990 DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete cds. ACCESSION J05544 KEYWORDS glucosephosphate isomerase. SOURCE P.falciparum (strain 7G8) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1906) AUTHORS Kaslow,D.C. and Hill,S. TITLE Cloning metabolic pathway genes by complementation in E.coli: Isolation and expression of Plasmodium falciparum glucosephosphate isomerase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.C.Kaslow, 04-JUN-1990. FEATURES from to/span description pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9) BASE COUNT 782 a 174 c 289 g 661 t ORIGIN Chromosome 14. 1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat 61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat 121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct 181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa 241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc 301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata 361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat 421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg 481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa 541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat 601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat 661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct 721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt 781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg 841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta 901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga 961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat 1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat 1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca 1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa 1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat 1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa 1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg 1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa 1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt 1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta 1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt 1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat 1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat 1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag 1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt 1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt // LOCUS RATGADA 3198 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds. ACCESSION M34445 KEYWORDS glutamic acid decarboxylase. SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and pCR22HP. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3198) AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I. TITLE Characterization of a cDNA coding for rat glutamic acid decarboxylase JOURNAL Mol. Brain Res. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Gottlieb, 23-MAY-1990. FEATURES from to/span description pept 184 1965 glutamic acid decarboxylase mRNA < 1 3198 glutamic acid decarboxylase mRNA signal 3174 3179 polyA signal BASE COUNT 869 a 742 c 767 g 820 t ORIGIN 1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca 61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc 121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag 181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat 241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc 301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag 361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa 421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct 481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg 541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac 601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac 661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt 721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta 781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc 841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca 901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc 961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc 1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg 1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt 1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat 1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca 1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg 1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc 1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa 1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag 1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac 1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc 1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt 1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa 1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc 1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg 1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc 1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc 1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca 2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg 2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa 2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat 2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc 2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca 2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag 2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta 2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca 2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac 2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag 2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt 2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta 2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt 2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt 2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc 2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa 3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta 3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt 3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa 3181 taaaaaatat tctccatg // LOCUS RATTSHRA 5270 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds. ACCESSION M34842 KEYWORDS TSH receptor; thyroid stimulating hormone receptor; transmembrane protein. SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5270) AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C., McBride,O.W. and Kohn,L.D. TITLE Cloning, chromosomal assignment and regulation of the rat thyrotropin receptor by thyrotropin, agents which increase cAMP levels and thyroid autoantibodies JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.D.Kohn, 05-JUN-1990. FEATURES from to/span description pept 55 2349 thyroid stimulating hormone receptor precursor sigp 55 117 thyroid stimulating hormone receptor signal peptide matp 118 2346 thyroid stimulating hormone receptor signal 5256 5261 poly-A signal BASE COUNT 1415 a 1343 c 1176 g 1336 t ORIGIN 1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg 61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga 121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag 181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact 241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat 301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa 361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta 421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct 481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct 541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg 601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg 661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga 721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct 781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag 841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac 901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat 961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc 1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag 1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag 1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac 1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca 1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg 1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc 1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc 1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac 1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt 1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc 1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg 1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc 1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac 1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg 1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt 1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc 1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg 2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc 2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag 2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa 2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt 2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca 2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa 2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct 2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc 2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac 2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg 2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag 2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga 2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat 2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc 2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat 2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt 3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta 3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat 3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac 3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga 3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt 3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa 3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct 3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt 3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt 3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata 3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt 3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac 3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat 3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga 3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat 3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag 3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc 4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc 4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg 4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa 4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga 4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag 4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc 4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg 4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc 4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct 4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg 4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca 4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa 4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta 4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga 4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg 4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc 4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg 5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca 5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt 5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg 5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg // LOCUS STMBLIPA 840 bp ds-DNA BCT 24-JUL-1990 DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete cds. ACCESSION M34538 KEYWORDS beta-lactamase inhibitory protein. SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP. ORGANISM Streptomyces clavuligerus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 840) AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E. TITLE Isolation and characterization of a beta-lactamase inhibitory protein from Streptomyces clavuligerus and cloning and analysis of corresponding gene JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Jensen, 25-MAY-1990. Dept. of Microbiology University of Alberta Edmonton, Alberta Canada T6G 2E9 FEATURES from to/span description pept 67 672 beta-lactamase inhibitory protein precursor (bliP) sigp 67 174 beta-lactamase inhibitory protein signal peptide matp 175 669 beta-lactamase inhibitory protein binding 52 63 ribosomal binding site signal 686 724 transcription termination signal BASE COUNT 140 a 248 c 320 g 132 t ORIGIN 1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga 61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc 121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg 181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc 241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc 301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc 361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg 421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg 481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg 541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc 601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg 661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc 721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg 781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc // LOCUS TRBPARPA 763 bp ss-mRNA INV 24-JUL-1990 DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds. ACCESSION M34790 KEYWORDS procyclic acidic repetitive protein; procyclin. SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone pPS-1. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 763) AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C. TITLE Inhibition of protein synthesis results in super-induction of procyclin RNA JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Boothroyd, 04-JUN-1990. Stanford University School of Medicine Dept of Microbiology and Immunology Sherman Fairchild Science Building Stanford, CA 94305-5402 USA FEATURES from to/span description pept 29 466 procyclin mRNA < 1 763 procyclin mRNA rpt 215 393 12 bp degenerate repeats BASE COUNT 192 a 215 c 156 g 200 t ORIGIN 1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct 61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc 121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg 181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga 241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga 301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga 361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc 421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt 481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc 541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt 601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc 661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag 721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt // LOCUS W10PPIBCA 4761 bp ds-DNA BCT 24-JUL-1990 DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes, complete cds, and ippD gene, 3' end. ACCESSION M34849 KEYWORDS antigen. SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA, clones HB101/pHS[4108,5357]. ORGANISM Plasmid pWR100 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes (ipaB and ipaC) of Shigella flexneri JOURNAL Microb. Pathog. 4, 345-357 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 538 and 3987 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Baudry, 05-JUN-1990. Center for Vaccine Development University of Maryland at Baltimore 10 South Pine Street Baltimore, MD 21201 FEATURES from to/span description pept 606 1073 polypeptide I (ippI) pept 1079 2821 polypeptide B (ipaB) pept 2841 3932 polypeptide C (ipaC) pept 3983 > 4761 ipaD protein pept < 1 549 unidentified ORF binding 595 600 ippI ribosomal binding site binding 1065 1070 ipaB ribosomal binding site binding 2829 2834 ipaC ribosomal binding site binding 3971 3977 ipaD ribosomal binding site signal 556 561 ippI -10 region signal 2723 2728 ipaC -35 region signal 2745 2750 ipaC -10 region BASE COUNT 1769 a 952 c 805 g 1235 t ORIGIN 1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat 61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat 121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct 181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca 241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat 301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc 361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg 421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca 481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat 541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag 601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa 661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca 721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct 781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta 841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat 901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag 961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat 1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat 1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga 1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac 1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa 1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct 1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa 1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga 1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat 1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat 1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga 1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca 1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat 1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa 1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg 1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc 1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga 1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt 2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt 2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca 2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc 2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc 2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac 2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac 2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat 2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc 2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc 2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa 2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga 2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc 2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg 2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata 2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc 2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat 3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg 3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag 3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag 3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga 3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc 3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa 3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc 3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa 3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc 3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt 3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc 3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca 3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca 3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc 3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag 3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg 3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc 4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa 4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa 4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac 4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc 4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt 4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga 4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata 4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag 4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa 4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc 4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac 4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat 4741 agacaatatg ttaaaaagct t // LOCUS YEPOMPHA 824 bp ss-mRNA BCT 24-JUL-1990 DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene, complete cds. ACCESSION M34854 KEYWORDS cationic outer membrane protein; enterobacterial outer membrane protein; lipopolysaccharide binding protein; secretory protein; structural membrane protein. SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 824) AUTHORS Hirvas,L., Koski,P. and Vaara,M. TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and comparison to known ompH sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.H.Hirvas, 06-JUN-1990. University of Helsinki FEATURES from to/span description pept 324 818 ompH protein precursor (gtg start codon) sigp 324 392 ompH protein signal peptide matp 393 815 ompH protein binding 308 317 ompH ribosomal binding site signal 120 125 ompH -35 region signal 143 148 ompH -10 region BASE COUNT 257 a 147 c 198 g 221 t 1 others ORIGIN 1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg 61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg 121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt 181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt 241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa 301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag 361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa 421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag 481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc 541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga 601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc 661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta 721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta 781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac // LOCUS YSCEXG1A 2056 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase (EXG1) gene, complete cds. ACCESSION M34341 KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase. SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2056) AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A., Nebreda,A.R., Mendez,E. and del Rey,F. TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1, in the yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.del Rey, 17-MAY-1990. Instituto de Microbiologia bioquimica Facultad de Biologia CSIC/Universidad de Salamanca 37008 Salamanca, SPAIN FEATURES from to/span description pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase precursor (EXG1, EC 3.2.1.58) sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase signal peptide matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase mRNA 265 > 1709 EXG1 mRNA signal 153 159 TATA box BASE COUNT 610 a 395 c 396 g 655 t ORIGIN Chromosome XII-right arm. 1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg 61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac 121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa 181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat 241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt 301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta 361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta 421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa 481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg 541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg 601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg 661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg 721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc 781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca 841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt 901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca 961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat 1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca 1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca 1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact 1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct 1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg 1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga 1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg 1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt 1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct 1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg 1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc 1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat 1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt 1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa 1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta 1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat 1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt 2041 gagcttttcc gccggc // LOCUS HUMPSG6A 1409 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-6 (PSG6) mRNA, complete cds. ACCESSION M33666 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1409) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept 53 1360 pregnancy-specific glycoprotein-6 BASE COUNT 395 a 403 c 294 g 317 t ORIGIN 1 gggcgggcct aggctcatct ccacagggga gaacacacag acagcagaga ccatgggacc 61 cctctcagcc cctccctgca ctcagcacat cacctggaag gggctcctgc tcacagcatc 121 acttttaaac ttctggaacc tgcccaccac tgcccaagta ataattgaag ccaagccacc 181 caaagtttcc gaggggaagg atgttcttct acttgtccac aatttgcccc agaatcttac 241 tggctacatc tggtacaaag ggcaaatgac ggacctctac cattacatta catcatatgt 301 agtacacggt caaattatat atgggcctgc ctacagtgga cgagaaacag tatattccaa 361 tgcatccctg ctgatccaga atgtcacaca ggaggatgca ggatcctaca ccttacacat 421 cataaagcga ggcgatggga ctggaggagt aactggatat ttcactgtca ccttatactc 481 ggagactccc aagccctcca tctccagcag caacttaaac cccagggagg tcatggaggc 541 tgtgcgctta atctgtgatc ctgagactcc ggatgcaagc tacctgtggt tgctgaatgg 601 tcagaacctc cctatgactc acaggttgca gctgtccaaa accaacagga ccctctatct 661 atttggtgtc acaaagtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag 721 tgccagccgc agtgacccag tcaccctgaa tctcctcccg aagctgccca tgccttacat 781 caccatcaac aacttaaacc ccagggagaa gaaggatgtg ttagccttca cctgtgaacc 841 taagagtcgg aactacacct acatttggtg gctaaatggt cagagcctcc cggtcagtcc 901 gagggtaaag cgacccattg aaaacaggat actcattcta cccagtgtca cgagaaatga 961 aacaggaccc tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtaacccagt 1021 caccctgaat gtcctctatg gtccagacct ccccagaatt tacccttcat tcacctatta 1081 ccgttcagga gaaaacctcg acttgtcctg ctttgcggac tctaacccac cggcagagta 1141 ttcttggaca attaatggga agtttcagct atcaggacaa aagctcttta tcccccaaat 1201 tactacaaat catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga 1261 aatctccaaa tccatgatag tcaaagtctc tgagacagca tctccccagg ttacctatgc 1321 tggtccaaac acctggtttc aagaaatcct tctgctgtga cctcccaaag tgctaggatt 1381 aaaacatgac ccaccatgaa acccgccca // LOCUS HUMPSGA 1930 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-1a (PSG-1a) mRNA, complete cds. ACCESSION M33664 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1930) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1 1260 pregnancy-specific glycoprotein-1a BASE COUNT 560 a 520 c 390 g 460 t ORIGIN 1 atgggaaccc tctcagcccc tccctgcaca cagcgcatca aatggaaggg gctcctgctc 61 acagcatcac ttttaaactt ctggaacctg cccaccactg cccaagtcac gattgaagcc 121 cagccaccca aagtttccga ggggaaggat gttcttctac ttgtccacaa tttgccccag 181 aatcttaccg gctacatctg gtacaaaggg caaatgaggg acctctacca ttacattaca 241 tcatatgtag tagacggtga aataattata tatgggcctg catatagtgg acgagaaaca 301 gcatattcca atgcatccct gctgatccag aatgtcaccc gggaggacgc aggatcctac 361 accttacaca tcataaaggg agatgatggg actagaggag taactggacg tttcaccttc 421 accttacacc tggagactcc taagccctcc atctccagca gcaacttaaa tcccagggag 481 accatggagg ctgtgagctt aacctgtgac cctgagactc cagacgcaag ctacctgtgg 541 tggatgaatg gtcagagcct ccctatgact cacagcttga agctgtccga aaccaacagg 601 accctctttc tattgggtgt cacaaagtat actgcaggac cctatgaatg tgaaatacgg 661 aacccagtga gtgccagccg cagtgaccca gtcaccctga atctcctccc gaagctgccc 721 aagccctaca tcaccatcaa caacttaaac cccagggaga ataaggatgt cttaaacttc 781 acctgtgaac ctaagagtga gaactacacc tacatttggt ggctaaatgg tcagagcctc 841 ccggtcagtc ccagggtaaa gcgacccatt gaaaacagga tcctcattct acccagtgtc 901 acgagaaatg aaacaggacc ctatcaatgt gaaatacggg accgatatgg tggcatccgc 961 agtgacccag tcaccctgaa tgtcctctat ggtccagacc tccccagaat ttacccttca 1021 ttcacctatt accgttcagg agaagtcctc tacttgtcct gttctgcgga ctctaaccca 1081 ccggcacagt attcttggac aattaatgaa aagtttcagc taccaggaca aaagctcttt 1141 atccgccata ttactacaaa gcatagcggg ctctatgttt gctctgttcg taactcagcc 1201 actggcaagg aaagctccaa atccatgaca gtcgaagtct ctgactggac agttccctga 1261 attctactag tacctccaat tccattttct cccatggaat cactaagagc aagacccact 1321 ctgttccaga agccctataa gctggaggtg gacaactcaa tgtaaatttc atgggaaaac 1381 ccttgtacct gaagcatgag ccactcagaa ctcaccaaaa tattcgacac cataacaaca 1441 gatgctcaaa ctgtaaacca ggacaacaag tggatgactt cacactgtgg acagtttttc 1501 ccaagatgtc agaagaagac tccccatcat gatgaggctc tcccccctct taactgtcct 1561 tgctcatgcc tgcctctttc acttggcagg ataatgcagt cattagaatt tcacatgtag 1621 tagcttctga gagtaacaac agagtgtcag atatgtcatc tcaacctcaa acttttatgt 1681 aacatctcag ggggaaatgt ggctctctcc accttgcata cagggctccc aatagaaatc 1741 aacacagaga tattgcctgt gtgttttcag agaagatggt ttgtatgaag acgtaggaaa 1801 gctgaaatta taatagagtc ccctttaaat ccacattgtg tggatggctc ttgccgtttc 1861 ctaagagata cattgtaaaa cgtgacagta agacattcta gcagaataaa acatgtacca 1921 cctttgcccc // LOCUS HUMPSGC 1307 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-4 (PSG4) mRNA, 3' end. ACCESSION M33665 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1307) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 639 pregnancy-specific glycoprotein-4 (AA at 1) BASE COUNT 394 a 339 c 249 g 325 t ORIGIN 1 gcggggtata ttgcaggacc ctatgaatgt gaaatacgga acccagtgag tgccagccgc 61 agtgacccag tcaccctgaa tctcctccca aagctgtcca agccctacat cacaatcaac 121 aacttaaacc ccagagagaa taaggatgtc ttaaccttca cctgtgaacc taagagtgag 181 aactacacct acatttggtg gctaaatggt cagagcctcc ctgtcagtcc cagggtaaag 241 cgacccattg aaaacaggat cctcattcta cccaatgtca cgagaaatga aacaggacct 301 tatcaatgtg aaatacggga ccgatatggt ggcatccgca gtgacccagt caccctgaat 361 gtcctctatg gtccagacct ccccagcatt tacccttcat tcacctatta ccgttcagga 421 gaaaacctct acttgtcctg cttcgccgag tctaacccac gggcacaata ttcttggaca 481 attaatggga agtttcagct atcaggacaa aagctctcta tcccccaaat aactacaaag 541 catagtgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa 601 tccatcacag tcaaagtctc tgactggata ttaccctgaa ttctactagt tcctccaatt 661 ccattttctc ccatggaatc acgaagagca agacccactc tgttccagaa gccctataag 721 ctggaggtgg acaactcgat gtaaatttca tgggaaaacc cttgtacctg acatgtgagc 781 cactcagaac tcaccaaaat gttcgacacc ataacaacag ctactcaaac tgtaaaccag 841 gataagaagt tgatgacttc acactgtgga cagtttttcc aaagatgtca gaacaagact 901 ccccatcatg ataaggctcc cacccctctt aactgtcctt gctcatgcct gcctctttca 961 cttggcagga taatgcagtc attagaattt cacatgtagt agcttctgag ggtaacaaca 1021 gagtgtcaga tatgtcatct caacctcaaa cttttacgta acatctcagg cgaaatgtgg 1081 ctctctccat cttgcataca gggctcccaa tagaaatgaa cacagagata ttgcctgtgt 1141 gtttgcagag aagatggttt ctataaagag taggaaagct gaaattatag tagagtctcc 1201 tttaaatgca cattgtgtgg atggctctca ccatttccta agagatacag tgtaaaacgt 1261 gacagtaata ctgattctag cagaataaaa catgtaccac atttccc // LOCUS HUMPSGD 1699 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human pregnancy-specific glycoprotein-1d (PSG-1d) mRNA, complete cds. ACCESSION M33663 KEYWORDS pregnancy-specific glycoprotein. SOURCE Human fetal liver first trimester, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1699) AUTHORS Zimmermann,W., Weiss,M. and Thompson,J.A. TITLE cDNA cloning demonstrates the expression of pregnancy-specific glycoprotein genes, a subgroup of the carcinoembryonic antigen gene family, in fetal liver JOURNAL Biochem. Biophys. Res. Commun. 163, 1197-1209 (1989) STANDARD simple staff_review FEATURES from to/span description pept 64 1344 pregnancy-specific glycoprotein-1d BASE COUNT 493 a 442 c 343 g 421 t ORIGIN 1 ctcagagagt ttctggatcc taggcttatc tccacagagg agaacacaca agcagcagag 61 accatgggaa ccctctcagc ccctccctgc acacagcgca tcaaatggaa ggggctcctg 121 ctcacagcat cacttttaaa cttctggaac ctgcccacca ctgcccaagt cacgattgaa 181 gcccagccac ccaaagtttc cgaggggaag gatgttcttc tacttgtcca caatttgccc 241 cagaatctta ccggctacat ctggtacaaa gggcaaatga gggacctcta ccattacatt 301 acatcatatg tagtagacgg tgaaataatt atatatgggc ctgcatatag tggacgagaa 361 acagcatatt ccaatgcatc cctgctgatc cagaatgtca cccgggagga cgcaggatcc 421 tacaccttac acatcataaa gggagatgat gggactagag gagtaactgg acgtttcacc 481 ttcaccttac acctggagac tcctaagccc tccatctcca gcagcaactt aaatcccagg 541 gagaccatgg aggctgtgag cttaacctgt gaccctgaga ctccagacgc aagctacctg 601 tggtggatga atggtcagag cctccctatg actcacagct tgaagctgtc cgaaaccaac 661 aggaccctct ttctattggg tgtcacaaag tatactgcag gaccctatga atgtgaaata 721 cggaacccag tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct cccgaagctg 781 cccaagccct acatcaccat caacaactta aaccccaggg agaataagga tgtcttaaac 841 ttcacctgtg aacctaagag tgagaactac acctacattt ggtggctaaa tggtcagagc 901 ctcccggtca gtcccagggt aaagcgaccc attgaaaaca ggatcctcat tctacccagt 961 gtcacgagaa atgaaacagg accctatcaa tgtgaaatac gggaccgata tggtggcatc 1021 cgcagtgacc cagtcaccct gaatgtcctc tatggtccag acctccccag aatttaccct 1081 tcattcacct attaccgttc aggagaagtc ctctacttgt cctgttctgc ggactctaac 1141 ccaccggcac agtattcttg gacaattaat gaaaagtttc agctaccagg acaaaagctc 1201 tttatccgcc atattactac aaagcatagc gggctctatg tttgctctgt tcgtaactca 1261 gccactggca aggaaagctc caaatccatg acagtcgaag tctctggtaa gtggatccca 1321 gcatcgttgg caatagggtt ttaggtggag tctatctggc attcagagaa gagtcaggaa 1381 aacaattgta ttcccagcct gtgtccctag ggcacaagca aatcccaaat tctcctcctg 1441 aaccctccaa atttgtctaa gaacttcgaa aactttaaca aacaggctga tatcttcata 1501 atattcccag cctagaccaa gcaggaagaa cattgatttc attgaaataa ttgataataa 1561 tgaagataat gtttttatga tttttatttg aaaatttgct aattctttaa atggtttgtt 1621 ttctacattg atggaatttt tctcttttaa tctatctaca gcttatagca gttcaataaa 1681 ctatacttct gggaaccgt // LOCUS RHMDCTAB 764 bp ds-DNA BCT 24-JUL-1990 DEFINITION R.meliloti dctA and dctB genes, 5' end. ACCESSION M33555 KEYWORDS regulatory protein. SOURCE R.meliloti DNA. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 764) AUTHORS Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F. TITLE Genetic analysis and regulation of the Rhizobium meliloti genes controlling C-4-dicarboxylic acid transport JOURNAL Gene 85, 135-144 (1989) STANDARD simple staff_review FEATURES from to/span description pept 726 > 764 dctA protein BASE COUNT 149 a 239 c 252 g 124 t ORIGIN 1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg 61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc 121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg 181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg 241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc 301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg 361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca 421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc 481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc 541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg 601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt 661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc 721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg // LOCUS MNKSVAA 589 bp ds-DNA PRI 24-JUL-1990 DEFINITION Monkey sp. DNA fragment found also in SV40 genome. ACCESSION M33902 KEYWORDS . SOURCE Monkey sp. DNA. ORGANISM Monkey sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae. REFERENCE 1 (bases 1 to 589) AUTHORS Szymanski,P. and Woodworth,M. TITLE A 69-base-pair monkey DNA sequence enhances simian virus 40 replication and transcription through multiple motifs JOURNAL J. Virol. 64, 1360-1365 (1990) STANDARD simple staff_review FEATURES from to/span description binding 221 230 AP-4 binding site binding 243 249 AP-1 binding site BASE COUNT 162 a 161 c 118 g 148 t ORIGIN 1 aagcttcgta gaatctcaga aagtcataaa gctacaaacg agttcacaaa caagtgatgc 61 atagaccaca cccagcccac agatgagttc tgtgtggctc acacttattt tttaaaacag 121 agtttgctgc caacacttca aaacccaatg attttgcata aaaatccaga tttattagaa 181 aatctgtgga cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga 241 gatgagtcac agttcccacc ccactcaccc atgtcaccta tatggcttgg aaggcatgtg 301 agtttgcaac ctgctcaaag ccacacccca gaaacggcct gctaaaatgg gattccagaa 361 tccacttgca cctttcgtct ttctcagtct cccctgactc aacacagcac actggcattg 421 ttttttttcc tctcagaaaa ctgagcatgc acttttgaga atcagagaga aatccgcctc 481 tcaaatgtca gaaaacagct gatgctgcct cttacttggg ggcctcttga cttcccacgt 541 gtcttactgc cacttgagag ggaggggtgc aggcaggcag gctccctgg // LOCUS MNKSVAB 69 bp ds-DNA PRI 24-JUL-1990 DEFINITION Monkey sp. DNA fragment also found in SV40 genome. ACCESSION M33903 KEYWORDS . SOURCE Monkey sp. DNA. ORGANISM Monkey sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae. REFERENCE 1 (bases 1 to 69) AUTHORS Szymanski,P. and Woodworth,M. TITLE A 69-base-pair monkey DNA sequence enhances simian virus 40 replication and transcription through multiple motifs JOURNAL J. Virol. 64, 1360-1365 (1990) STANDARD simple staff_review FEATURES from to/span description binding 1 7 AP-1 binding site binding 41 50 AP-4 binding site binding 63 69 AP-1 binding site BASE COUNT 16 a 21 c 15 g 17 t ORIGIN 1 tgactcatca cttttctgcc tggcaacaac catctggaac tcagcagcgg ttgcccctga 61 gatgagtca // LOCUS RSSFBPPRK 2099 bp ds-DNA BCT 24-JUL-1990 DEFINITION R.sphaeroides fructose 1,6-bisphosphatase (fbpB) and phosphoribulokinase (prkB) genes, complete cds. ACCESSION J02922 KEYWORDS 1,6-bisphosphatase; phosphoribulokinase. SOURCE R.sphaeroides (strain HR) DNA. ORGANISM Rhodobacter sphaeroides Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 2099) AUTHORS Gibson,J.L., Chen,J.-H., Tower,P.A. and Tabita,F.R. TITLE The form II fructose 1,6-bisphosphatase and phosphoribulokinase genes form part of a large operon in Rhodobacter sphaeroides: Primary structure and insertional mutagenesis analysis JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Gibson, 15-JUN-1990. FEATURES from to/span description pept 44 1039 fructose 1,6-bisphosphatase (fbpB) pept 1043 1921 phosphoribulokinase (prkB) BASE COUNT 361 a 694 c 720 g 324 t ORIGIN 1 ggatcctgac gcaacgtccg ccgcgacaga ggcaggagga gccatggcca tcgagctgga 61 ggacctgggg ctgagccccg atgtggcgga cgtgatgcag cgtctggcgc gcgtgggggc 121 aggcatcgcc cgcatcatct cgcgcaacgg gctcgagcgc gatctgggcg cgggcgtcgg 181 caccaatgcc ggaggagacg ggcagaaggc gctcgacgtg atcgcggacg acgcgttccg 241 cgcggcgctc gaaggctctg cggtggctta ttacgcctcc gaggagcagg acgaagtggt 301 gacgctgggc gagggaagcc tcgcgctcgc catcgacccg ctggacggct cgtccaacat 361 cgatgtgaac gtgtcgatcg ggacgatctt ctcgatcttc ccggcggcgg ctggccccga 421 ggccagcttc ctgcgcccgg gcaccgagca gattgccggc ggctacatca tctacgggcc 481 gcaatgcgcg ctggtctgca gcttcgggca gggcgtgcag cactgggtgc tcgacctcga 541 tgcgggcatc ttccggcgga tgcccgacat ccgcccgctg ccggccgaga cgtccgagtt 601 tgcgatcaac gcctcgaact accgccactg gccgcagccg atccgcgcct tcgtcgacga 661 tctggtcgcc ggggccgagg ggccgcgcgg caagaacttc aacatgcgct ggatcgcctc 721 gctggtggcc gagacgcacc gcatcctgat gcggggcggg gtgtttctct atcccggcga 781 cgagcgcaag ggctacgagc ggggccggct gcgccatgtc tacgaatgcg cgcccatcgc 841 cttcctgatc gcgaatgtcg gggggggcgc caccgacggc tgcgccgaca tcctgaccgc 901 gctgcccgac cggctgcacg cccgcacccc cttcgtcttc ggctgcgcga gcaaggtcgc 961 ccgcgtcgcc gcctatcacg atctggcctg cgaagagacg tccgctctct tcggcagccg 1021 gggcctgttc cggagttaaa gagtgtcgaa gaaatatccc atcatttccg tggtcggctc 1081 gtccggcgcg ggcacctcga cggtcaagaa cacgttcgag cagatcttcc gccgcgaggg 1141 ggtcaagtcc gtctcgatcg agggcgacgc cttccaccgc ttcaaccggg ccgacatgaa 1201 ggccgaactc gagcggcgct atgcggcggg cgatgcgacc ttctcgcatt tctcctacga 1261 ggcgaacgaa ctgaaggagc tggagcgcgt cttccgcgaa tatggcgaga cggggcgcgg 1321 ccgcacccgc acctatgtcc atgacgatgc cgaagccgcc cggacgggcg tggcccccgg 1381 caatttcacc caatgggcgc cgttcgagga caacagcgac ctgcttttct acgaggggct 1441 gcacggctgc gtggtcaatg acgaggtgaa cctcgtccgc catgccgatc tgaagctcgg 1501 cgtggcgccg gtcatcaacc ttgaatggat ccagaagatc caccgcgacc gggcgcagcg 1561 cggctatacg accgaagccg tcaccgacgt gatcctgcgc cggatgtatg cctacgtcca 1621 ctgtatcgtc ccgcaattct ccgagacgga catcaacttc cagcgcgtgc cggtggtgga 1681 cacctcgaac ccgttcatcg cgcgctggat ccccacgccg gacgagagcc tgatcgtgat 1741 ccggttcaag aacccgcgcg ggatcgactg cccctatctc acctcgatga tcgcgggctc 1801 gtggatgagc cgggcgaatt ccatcgtggt gccgggcaac aagcaggatc tggcgatgca 1861 gctgatcctg acgccgctca tcgagcggat ggtgcgcgag gcgcgccgcg cgcgggcctg 1921 aggcgcgaca gacagacgga gagggggccg cggatcgagc cgcggcccgg caaccgggac 1981 gcacagcgaa ggatgagagc catgaaggac attggagccg cgcaggagac gcggatggcg 2041 aacgcatccg ggccctcgcg atggatgccg tgaagaaggc caagtcgggc catcccggg // LOCUS YSCACH1A 2466 bp ss-mRNA PLN 24-JUL-1990 DEFINITION S.cerevisiae acetyl-CoA hydrolase (ACH1) mRNA, complete cds. ACCESSION M31036 KEYWORDS acetyl-CoA hydrolase. SOURCE S.cerevisiae (strain T3A), cDNA to mRNA, clones pBA[4,5]. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2466) AUTHORS Lee,F.-J.S., Lin,L.-W. and Smith,J.A. TITLE A glucose-repressible gene encodes acetyl-CoA hydrolase from Saccharomyces cerevisiae JOURNAL J. Biol. Chem. 265, 7413-7418 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.-W.Lin, 27-DEC-1989. FEATURES from to/span description pept 614 2194 acetyl-CoA hydrolase (ACH1) (EC 3.1.2.1) BASE COUNT 730 a 529 c 558 g 649 t ORIGIN Chromosome II. 1 gttagccatc attatcatta aaatatcaac ccgaagaaca atgtatacat atacatatac 61 gtacacatat acatatgtac atatgacata cgtattagcc gctgaggacg cggacgtata 121 aaaggacaat acttatatgg agctaagggg agcagttacg caactccgtg atcgcgcgcc 181 acgggccgtc ggcggctgtt aattgaagaa aaaaaaaatg aagaaccaca aggggtgatc 241 catataggtg actagcatca tcccctgcga cgcgcggccc gccgggcaaa ggcgggcaat 301 gcgcgctgct gattggcctc gaggacaacg ccctcaacca catccgcaac agccaatccc 361 atcggagcgt caaactacca aagtagtgat tgtatggatc accactgtat tgtggacggt 421 aagcgcttgc tggagcaaat gtgtaatcaa gttgctgtgt atatatagac gttagatgtg 481 ttctacccct tcttttgtct tgtgcccacc gggcttacat tagcacacaa agcagcaaga 541 gaccgtctta ctagacaata gcggcaaaac aaacaacaca tttctttttt tctttttcac 601 atattgcact aaaatgacaa tttctaattt gttaaagcag agagttaggt atgctcccta 661 tctgaaaaaa gttaaggaag ctcacgagct tattccattg ttcaagaatg gtcagtacct 721 tgggtggtcc ggttttacag gagtgggtac tcccaaggca gtgccggagg cactgataga 781 tcacgtggag aagaacaatt tacaagggaa gttgagattc aacctttttg ttggagcttc 841 tgctggtcca gaggaaaacc gttgggctga acacgacatg atcattaaga gagcccctca 901 tcaagtaggg aaacccattg caaaggcaat taaccagggt agaattgagt tctttgataa 961 acatctgtcc atgttccctc aggatctgac atacgggttc tacaccaggg aaagaaaaga 1021 caacaaaatc cttgattata ctataatcga ggcaacggcc attaaagagg acgggtctat 1081 cgtcccaggt ccctctgtcg gtggttctcc agaattcatt acagtcagtg ataaagtgat 1141 tattgaggtt aacacggcta cgccttcgtt cgagggtatt cacgatatag acatgcccgt 1201 gaacccacct ttcaggaaac catacccata tctgaaagtg gacgacaagt gtggtgttga 1261 ctccatcccg gttgatcctg aaaaggttgt tgcgattgtg gagtccacca tgagggacca 1321 ggtcccacca aatacgccct ctgacgacat gtccagggct attgcaggtc atttggtcga 1381 gtttttcaga aacgaggtaa aacatggtag gctacctgaa aacctgctgc ctttacaaag 1441 tggtataggt aacattgcta acgctgtcat tgaagggctt gctggcgccc aattcaagca 1501 cttgactgta tggacggaag tgctgcagga ctcgttattg gatcttttcg agaacggatc 1561 tttggactac tccactgcta cttccgtgag attgactgaa aagggtttcg acagagcctt 1621 tgcaaactgg gaaaatttca aacacagatt gtgtttgaga tctcaagttg tctcgaacaa 1681 tccggaaatg atccgtagat tccctgtcat cgccatgaat accccagtag aagttgacat 1741 ttacgcgcac gccaattcta caaatgtgaa tggttcccgt atgttgaacg ggttgggtgg 1801 atctgctgat ttcttgagaa atgcaaagtt gtccatcatg catgccccct ctgcaagacc 1861 aactaaagta gaccctaccg gtatctctac cattgttcct atggcctctc atgtagatca 1921 aactgagcat gacctggaca tcttggtcac tgaccaaggt ttggcggatc taagaggtct 1981 atcgcctaag gaaagagccc gtgaaatcat caacaagtgt gctcatcccg attatcaagc 2041 tttgttgacc gattacttgg acagagcaga gcattacgct aaaaagcaca attgcttgca 2101 tgaaccacac atgctaaaga atgctttcaa gttccacacc aacttagctg aaaagggtac 2161 aatgaaggtc gacagctggg aaccagttga ctagtgtttg tgcgcaaacc gagagatgag 2221 tatttaacaa aaaaaagaaa ggaaatgata tgattatgat tttatgttta taaagctttt 2281 atccaatgcg ttgttttttc ttgcatattt ataccttttg cgctcatgga gggagttaat 2341 caatacgcat gacgtctagt taattcacag gtagtactgt atatttatat gtttacacaa 2401 taattatgta ttaagtagtg attagtaaaa aaaactaaga ggttgaaagt catcaaccct 2461 tatatt // LOCUS YSCYEF3B 3969 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae elongation factor 3 (EF-3) gene, complete cds. ACCESSION J05583 KEYWORDS elongation factor. SOURCE S.cerevisiae DNA, clone YEpEF3. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3969) AUTHORS Sandbaken,M.G., Lupisella,J.A., DiDomenico,B. and Chakraburtty,K. TITLE Protein synthesis in yeast: Structural and functional analysis of the gene encoding elongation factor three (EF-3) JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Chakraburtty, 15-JUN-1990. FEATURES from to/span description pept 489 3623 elongation factor 3 signal 202 212 regulatory UAS-RPG binding 1875 1898 nucleotide binding consensus GX4GK(S/T) binding 2589 2612 nucleotide binding consensus GX4GK(S/T) binding 463 470 nucleotide binding consensus GX4GK(S/T) binding 701 708 nucleotide binding consensus GX4GK(S/T) binding 2117 2213 nucleotide binding consensus binding 3176 3272 nucleotide binding consensus binding 544 575 nucleotide binding consensus binding 897 928 nucleotide binding consensus BASE COUNT 1203 a 758 c 809 g 1199 t ORIGIN 1 caacttacaa tcattgttcg ccccttccat acttactgcc actcgcaaaa gggcccaacc 61 agggcaatta cgtatcaaaa aatcatgaca ggctgggtaa taaatattcg tgaagaaaga 121 agaaattaaa aaaagaaacg aagaagcaaa aaaaagaaaa gactccgttt aatcactttc 181 aaccgcggtt tatccggccc cacccatgca taaccctaaa ttattagatc acttagcacg 241 tgaaaaagaa acgtttttaa tgtttttttt ttttttcttt ttcttttttt gcgttggtga 301 aaattttttc gcttcctcga gtataattat ctcatctcat ctttcatata agataagaag 361 ttttataaaa accttttgca tcaaaatttt gttgaataac tctttttctt acgctctctt 421 tctttcctta attgttttct aaagaaccgt gtatttttct agttcgaatc catcgataac 481 attaaaagat gtctgattcc cagcaatcca ttaaggttct agaagaacta ttccagaagt 541 tatctgttgc cactgctgac aacagacacg aaatcgcttc tgaagtcgct tctttcttga 601 atggtaacat cattgaacat gatgttccag aacacttctt cggtgaattg gccaagggta 661 tcaaggacaa gaagaccgct gctaacgcca tgcaagctgt tgctcacatt gctaaccaat 721 ctaacttgtc tccatctgtt gaaccataca tcgtccaatt ggttccagct atctgtacca 781 acgcaggtaa caaggacaaa gaaattcaat ctgttgcttc cgaaactttg atttccatcg 841 ttaacgctgt taacccagtt gccatcaaag ctttgttgcc acatttgact aacgctattg 901 tagaaactaa caaatggcaa gaaaagattg ctattttggc agctttctct gctatggtcg 961 atgctgctaa ggatcaagtt gccctaagaa tgccagaatt gattccagtc ttgtctgaaa 1021 ccatgtggga caccaagaag gaagtcaagg ctgctgctac tgccgccatg accaaggcta 1081 ccgaaactgt tgacaacaag gatattgaac gtttcattcc aagtttgatt caatgtattg 1141 ctgacccaac tgaagttcca gaaaccgttc atttgctagg tgctactact ttcgttgctg 1201 aagttactcc agctactttg tccatcatgg tcccattgtt gtccagaggt ttgaacgaaa 1261 gagaaaccgg tatcaagcgt aagtctgctg ttattattga caacatgtgt aagttggtcg 1321 aagacccaca agttattgct cctttcttgg gtaaattgtt gccaggtttg aagagtaact 1381 ttgctaccat tgctgaccca gaagccagag aagttacttt gagagctttg aagactttga 1441 gaagagttgg taacgttggt gaagacgatg ctattccaga actttctcac gctggtgacg 1501 tttctactac tttgcaagtc gttaacgaat tgttgaagga cgaaaccgtt gctccaagat 1561 ttaagattgt cgtcgagtac attgccgcca ttggtgctga tttgatcgat gaaagaatca 1621 ttgaccaaca agcttggttc acccacatca ccccatacat gactatcttc ttgcacgaaa 1681 agaaggccaa ggacatcttg gacgaattca gaaagagagc tgtcgacaac attccagttg 1741 gtccaaactt cgacgacgaa gaagacgaag gtgaagactt atgtaactgt gaattttctt 1801 tggcttatgg tgctaaaatc ttgttgaaca agacccaatt aagattgaag agagccagaa 1861 gatatggtat ctgtggtcca aacggttgtg gtaagtccac tttaatgaga gctattgcca 1921 acggtcaagt tgatggtttc ccaacccaag aagaatgtag aaccgtctac gtcgaacacg 1981 acattgatgg tactcactct gacacttccg tcttggattt cgttttcgaa tctggtgttg 2041 gtactaaaga agctatcaag gacaaattga ttgaattcgg tttcaccgat gaaatgattg 2101 ctatgccaat ctctgcttta tctggtggtt ggaagatgaa gttggctcta gctagagctg 2161 tgttgagaaa tgctgatatc ttgttgttag atgaaccaac taaccatttg gataccgtca 2221 acgttgcttg gttagttaac tacttgaaca cctgtggtat cacttctatc actatttctc 2281 acgactccgt tttcttagat aacgtctgtg aatatattat taactacgaa ggtttgaagt 2341 tgagaaagta caagggtaac tttaccgaat tcgttaagaa gtgtccagct gctaaggctt 2401 acgaagaatt atccaacact gatttggaat tcaagttccc agaaccaggt tacttggaag 2461 gtgttaagac taagcaaaag gctattgtca aggttaccaa catggaattc caatatccag 2521 gtacctctaa gccacaaatc actgacatta acttccaatg ttctttgtct tccagaattg 2581 ctgtcattgg tccaaatggt gctggtaagt ctactttgat taacgtcttg actggtgaac 2641 tattaccaac ctctggtgaa gtctacaccc acgaaaattg tcgtatcgct tacattaagc 2701 aacacgcttt tgctcatatc gaatctcatt tggacaagac tccatctgaa tatatccaat 2761 ggagattcca aaccggtgaa gatagagaaa ccatggacag agctaacaga caaatcaacg 2821 aaaacgatgc tgaagctatg aacaagatct tcaagattga aggtacccct agaagaattg 2881 ccggtatcca ctccagaaga aagttcaaga acacttacga atatgaatgt tctttcttat 2941 tgggtgaaaa cattggtatg aaatctgaaa gatgggttcc aatgatgtcc gtcgacaacg 3001 cttggattcc aagaggtgaa ttggttgaat ctcactctaa gatggttgct gaagttgata 3061 tgaaggaagc tttggcttct ggtcaattcc gtccattaac cagaaaagaa attgaagaac 3121 attgttccat gttgggtttg gacccagaaa ttgtttctca ctccagaatt agaggtttgt 3181 ctggtggtca aaaggttaag ttggtcttag ctgccggtac atggcaaaga cctcacttga 3241 ttgtcttaga tgaacctacc aactatctgg acagagattc tttgggtgct ttgtctaagg 3301 ctttgaagga atttgaaggt ggtgttatta tcattactca ctctgctgaa ttcacaaaga 3361 acttgactga agaagtctgg gccgtcaagg acggtagaat gactccatct ggtcacaact 3421 gggttagtgg tcaaggtgct ggtccaagaa tcgaaaagaa ggaagacgaa gaagataaat 3481 tcgatgctat gggtaacaag attgccggtg gtaagaagaa gaagaagttg tcttctgcgg 3541 aattgagaaa gaagaagaag gaaagaatga agaagaagaa ggaattgggt gatgcttacg 3601 tttcttctga cgaagaattc taatcttttt gatcactgct ttcacagttt tctttaagat 3661 ttttattgat caataattta tgtatatttt aatttctatg tttttgtaat attgtttatt 3721 ttggtaaaat atagacgcaa cttccttatt ataaagaaag gcattattta aaagaaaaag 3781 cgttccatta gtcagacatc tttttttttt tcatacattc ttaagcctca ggcaaattga 3841 gcattgcctc ataccctttt cggtaagagg gtaacgaaaa tatttttttg gaagaataaa 3901 aataggtgac ggatcataga ctaggaagct ttaaaacatg attgagcgta atattatatt 3961 ccttctaga // LOCUS HUMKSAA 1504 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds. ACCESSION M32325 KEYWORDS adenocarcinoma-associated antigen. SOURCE Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1504) AUTHORS Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D., Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A. TITLE Molecular cloning and characterization of a human adenocarcinoma/epithelial cell surface antigen complementary DNA JOURNAL Cancer Res. 49, 314-317 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D. Sharp, 22-EB-1990. FEATURES from to/span description pept 155 1099 adenocarcinoma-associated antigen precursor (KSA) sigp 155 223 adenocarcinoma-associated antigen signal peptide matp 398 1096 adenocarcinoma-associated antigen mRNA < 1 1504 adenocarcinoma-associated antigen mRNA site 1486 1491 polyadenylation site BASE COUNT 442 a 302 c 356 g 404 t ORIGIN 1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc 61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc 121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt 181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg 241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac 301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat 361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca 421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca 481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa 541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa 601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga 661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa 721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat 781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa 841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat 901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat 961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc 1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca 1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat 1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg 1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga 1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt 1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc 1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc 1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa 1501 actg // LOCUS HAMHITAO1 1581 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Hamster G-o protein alpha subunit 1 (GNAO I) mRNA, complete cds. ACCESSION M33661 J05476 KEYWORDS G-o protein alpha subunit. SOURCE Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA. ORGANISM Cricetulus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1581) AUTHORS Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C., Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L. TITLE Molecular cloning of a novel splice variant of the alpha subunit of the mammalian G-o protein JOURNAL J. Biol. Chem. 265, 11220-11226 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Birnbaumer, 10-APR-1990, for release after publication. FEATURES from to/span description pept 458 1522 G-o protein alpha subunit 1 BASE COUNT 357 a 472 c 405 g 347 t ORIGIN 1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc 61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt 121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc 181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg 241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt 301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt 361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca 421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga 481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat 541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac 601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca 661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat 721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg 781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat 841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca 901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta 961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac 1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga 1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct 1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgcatgc acgagtctct 1201 catgctcttc gactccatct gtaacaacaa gtttttcatc gatacctcca tcattctctt 1261 cctcaacaag aaagacctct ttggcgagaa gattaagaag tcacccttga ccatctgctt 1321 ccctgaatac ccaggctcca acacctatga agatgcagct gcctacatcc aaacacagtt 1381 tgaaagcaaa aaccgctcac ctaacaaaga aatctattgt cacatgactt gtgccacaga 1441 cacgaataat atccaggtgg tattcgacgc tgtcaccgac atcatcattg ccaacaatct 1501 ccggggctgc ggcttgtact gacctcttgt cctgtatagc aacctatttg actgcttcat 1561 ggactctttg ctgttgatgt t // LOCUS HAMHITAO2 1678 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Hamster G-o protein alpha subunit 2 mRNA, complete cds. ACCESSION M33662 J05476 KEYWORDS G-o protein alpha subunit. SOURCE Hamster insulinoma cell line HIT T-125 M2.2.2, cDNA to mRNA. ORGANISM Cricetulus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1678) AUTHORS Hsu,W.H., Rudolph,U., Sanford,J., Bertrand,P., Olate,J., Nelson,C., Moss,L.G., Boyd,A.E.III., Codina,J. and Birnbaumer,L. TITLE Molecular cloning of a novel splice variant of the alpha subunit of the mammalian G-o protein JOURNAL J. Biol. Chem. 265, 11220-11226 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Birnbaumer, 10-APR-1990, for release after publication. FEATURES from to/span description pept 458 1522 G-o protein alpha subunit 2 BASE COUNT 374 a 514 c 448 g 342 t ORIGIN 1 gcctccgcct cctcctcctc cgcctccggc agccgcggca gaaggaccca ccctgccccc 61 caccccaccc tccgccggct ccggctgcgg atccagcctc tactctcatt ttatttattt 121 tgggtcgtgc actagcctca gtgcctgcag cccgcgcctc ccgggcccgc gggcacctcc 181 tccctcggct ccggaggccc agaccccggc caccctcacc tcgacaccct cagaccccgg 241 ccagtcgccg ctagtctccg ccgctggaat cttgatagag gctgtcattt tttgggggtt 301 ctggcctttt cgacaatttt gttcccagcc aaggagagga tatcgtgatt ttctcccctt 361 tgagcccagg ctctgctctg tgggggggtg gggggcgcgc cgacccgggg agtcgtgcca 421 gccgagtcgt gcgggctgtg gcagggaagg ggccaccatg ggatgtactc tgagcgcaga 481 ggagagagcc gccctcgagc ggagcaaggc gattgagaaa aacctcaaag aagatggcat 541 cagcgccgcc aaagatgtga aattactcct gctgggggct ggagaatcag gaaaaagcac 601 cattgtgaag cagatgaaga tcatccatga agatggcttc tctggagagg acgtaaagca 661 gtacaagcct gttgtctata gcaacaccat ccagtctctg gcagccatcg tccgggccat 721 ggatactttg ggtgtggagt atggtgacaa ggagagaaag gcggactcca agatggtgtg 781 tgatgtggtg agccgtatgg aagacactga acccttctct gcagagctgc tttctgccat 841 gatgcgactc tggggtgact cagggatcca ggagtgcttc aaccgatctc gggagtatca 901 gctcaatgat tctgccaaat actacctgga cagcctggat cggattggag ccgctgacta 961 ccagcccact gagcaggaca tcctccgaac cagggtcaaa accaccggca tcgtagaaac 1021 ccacttcaca ttcaagaacc tccacttcag gctgtttgat gttgggggcc agcgatctga 1081 acgcaagaag tggatccact gctttgagga tgtcacggcc atcatcttct gtgtcgccct 1141 cagcggctat gaccaggtgc tccacgagga cgaaaccacg aaccgaatgc atgagtccct 1201 gaagcttttt gacagcatct gcaacaacaa gtggttcaca gacacgtcta tcatcctgtt 1261 tctcaacaag aaggacatat ttgaggagaa gatcaccagg tccccactga ccatctgctt 1321 tcctgaatac acaggcccca gtgccttcac agaagctgtg gctcacatcc aggggcagta 1381 tgagagtaag aacaaatcag ctcacaagga aatctacacc catttcacct gtgccaccga 1441 caccaacaac atccagtttg tctttgatgc tgtgacggac gtcatcatcg ccaaaaatct 1501 gcggggctgt ggactctact gagccctggc ctcctgccca gcctgctgct cacttctccc 1561 ctggacccag agctctgcta ccactcgatg ctctgtgcac tgagggaaaa cccagaggct 1621 ggccctgggg gaggaggaag caccctctga gcatccccac cctaccccac tccagagg // LOCUS HUMPEPD1 610 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human peptidase D gene, exon 1. ACCESSION M36549 J05513 KEYWORDS peptidase D; prolidase. SEGMENT 1 of 2 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 610) AUTHORS Tanoue,A., Endo,F. and Matsuda,I. TITLE Structural organization of the gene for human prolidase (peptidase D) and demonstration of a partial gene deletion in a patient with prolidase deficiency JOURNAL J. Biol. Chem. 265, 11306-11311 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Endo, 23-MAY-1990. FEATURES from to/span description pept 372 / 388 peptidase D, exon 1 (EC 3.4.13.9) /hgml_locus_uid="LF0121K" /nomgen="PEPD" /map="19q12-q13.2" pre-msg 241 > 610 PEPD mRNA and introns IVS 389 > 610 PEPD intron A BASE COUNT 96 a 225 c 185 g 104 t ORIGIN 1 gtggtgtacg tgtacgtaag atcggggact gaacggagaa tcaagatgtc ctctgcttca 61 ggacctccct cccaccacga aaaggctggc agatgcagat gctgcccgct aacccgacct 121 actgtctgca gccggcagcc cgctcattta tcctggccga gctcttgggc ccccaatctc 181 caggagcccc gccccatcgg ctccgccctt ccccaggtac cgccccttcc cgcccgtgga 241 atttggctct ggaggaggtg ggtcaaggac tccatgcccc gccccctgcc ccgcccttac 301 cccgccccat gcattggcac cggaggggct catggacgcg cgcacttcag tgacgccggt 361 gccgggcgaa catggcggcg gccaccgggt gagtgctgcg cctccctcgc tctttcccgc 421 tggcgtggag aggggcatgg gtgtcccggc cggaccgcgg gtgagagacc gtcaccgggt 481 cgcgtgaatg ccgcgggtag gcgcctcttc ccgttggccc ggaagtccca gcgccggtca 541 ggcgagcctg cacgcgtccg agtcaaaggt caacccgagc aggacccgac tacctctcct 601 aacctctcca // LOCUS HUMPEPD2 530 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human peptidase D gene, 3' end. ACCESSION M35496 J05513 KEYWORDS peptidase D; prolidase. SEGMENT 2 of 2 SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 530) AUTHORS Tanoue,A., Endo,F. and Matsuda,I. TITLE Structural organization of the gene for human prolidase (peptidase D) and demonstration of a partial gene deletion in a patient with prolidase deficiency JOURNAL J. Biol. Chem. 265, 11306-11311 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by F.Endo, 23-MAY-1990. FEATURES from to/span description pept / 1 3 peptidase D, exon 15 (EC 3.4.13.9) /hgml_locus_uid="LF0121K" /nomgen="PEPD" /map="19q12-q13.2" pre-msg < 1 394 PEPD mRNA and introns signal 373 378 poly-A signal BASE COUNT 133 a 132 c 114 g 151 t ORIGIN About 110 kb after segment 1. 1 tagagccagc cagaaatccc agcgcacctg ggggcctggc cttgcaacct cttttcgtga 61 tgggcagcct gctggtcagc actccagtag cgagagacgg cacccagaat cagatcccag 121 cttcggcatt tgatcagacc aaacagtgct gtttcccggg gaggaaacac ttttttaatt 181 acccttttgc aggcaccacc tttaatctgt tttatacctt gcttattaaa tgagcgactt 241 aaaatgattg aaaataatgc tgttctttag tagcaactaa aatgtgtctt gctgtcattt 301 atattccttt tcccaggaaa gaagcatttc tgatactttc tgtcaaaaat caatatgcag 361 aatggcattt gcaataaaag gtttcctaaa atggtccttt gtgcgtcggc ttcccctttc 421 attcctttgc agatgagcat ttgggacccc gtccgacccc tgaaaagatc ggctcacctg 481 gtgtcagtat cagtcattcc aggaagatgg ctcgtggcag cacaggccct // LOCUS SYNPMAMA 4354 bp ds-DNA SYN 24-JUL-1990 DEFINITION Artificial cloning vector pMAMPF. ACCESSION M33637 KEYWORDS . SOURCE Synthetic DNA from E.coli cell line WK6-lambd-pMAMPF3, from pMA58. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 4354) AUTHORS Szardenings,M. and Collins,J. TITLE A plasmid optimized for protein design projects: pMAMPF JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Collins, 09-APR-1990. FEATURES from to/span description signal 4121 4183 OmpA signal sequence site 4181 4186 NaeI cloning site site 4184 4308 Stuffer (pBR325:5900 to 29) BASE COUNT 1152 a 1054 c 1012 g 1136 t ORIGIN 1 aattcacctc gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt 61 ttttttggag attttcaacg tgaaaaaatt attattcgca attccaagct aattcacctc 121 gaaagcaagc tgataaaccg atacaattaa aggctccttt tggagccttt ttttttggag 181 attttcaacg tgaaaaaatt attattcgca attccaagct ctgcctcgcg cgtttcggtg 241 atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag 301 cggatgcaga tcacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc 361 gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt 421 cctttctcgc cacgttcgcc agctttcccc gtcaagctct aaatcggggg ctccctttag 481 ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag ggtgatggtt 541 cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt 601 tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcggtctatt 661 cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt 721 aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttga tctgcgctcg 781 gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca 841 gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac 901 cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac 961 aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg 1021 tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac 1081 ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat 1141 ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag 1201 cccgaccgct gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac 1261 ttatcgccac tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt 1321 gctacagagt tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt 1381 atctgcgctc tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc 1441 aaacaaacca ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga 1501 aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac 1561 gaaaactcac gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc 1621 cttttaaatt aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct 1681 gacagttacc aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca 1741 tccatagttg cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct 1801 ggccccagtg ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca 1861 ataaaccagc cagccggaag ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc 1921 atccagtcta ttaattgttg ccgggaagct agagtaagta gttcgccagt taatagtttg 1981 cgcaacgttg ttgccattgc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct 2041 tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 2101 aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 2161 tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 2221 ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 2281 agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa 2341 gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 2401 agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 2461 accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 2521 gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcagacag 2581 ttttattgtt catgatgata tatttttatc ttgtgcaatg taacatcaga gattttgaga 2641 cacaacgtgg ctttgttgaa taaatcgaac ttttgctgag ttgactcccc gcgcgcgatg 2701 ggtcgaattt gctttcgaaa aaaaagcccg ctcattaggc gggctaaaaa aaagcccgct 2761 cattaggcgg gctcgaattt ctgccattca tccgcttatt atcacttatt caggcgtagc 2821 aaccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg ccctgccact 2881 catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca tcacagacgg 2941 catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta taatatttgc 3001 ccatagtgaa aacgggggcg aagaagttgt ccatattcgc cacgtttaaa tcaaaactgg 3061 tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac cctttaggga 3121 aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt agaaactgcc 3181 ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa 3241 cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc attgccatac 3301 gaaattccgg atgagcattc atcaggcggg caagaatgtg aataaaggcc ggataaaact 3361 tgtgcttatt tttctttacg gtctttaaaa aggccgtaat atccagctga acggtctggt 3421 tataggtaca ttgagcaact gactgaaatg cctcaaaatg ttctttacga tgccattggg 3481 atatatcaac ggtggtatat ccagtgattt ttttctccat tttagcttcc ttagctcctg 3541 aaaatctcga taactcaaaa aatacgcccg gtagtgatct tatttcatta tggtgaaagt 3601 tggaacctct tacgtgccga tcaacgtctc attttcgcca aaagttggcc cagggcttcc 3661 cggtatcaac agggacacca ggatttattt attctgcgaa gtgatcttcc gtcacaggta 3721 tttattcgaa gacgaaaggg catcgcgcgc ggggaattgg ccacgatgcg tccggcgtag 3781 aggatctctc acctaccaaa caatgccccc ctgcaaaaaa taaattcata taaaaaacat 3841 atagataacc atctgcggtg ataaattatc tctggcggtg ttgacataaa taccactggc 3901 ggtgatactg agcacatcag caggacgcac tgaccaccat gaaggtgacg ctcttaaaat 3961 taagccctga agaagggcag cattcaaagc agaaggcttt ggggtgtgtg atacgaaacg 4021 aagcattgga attctacaac ttgcttggat tcctacaaag aagcagcaat tttcagtgtc 4081 agaagtcgac caaggaggtc tagataacga ggcgcaaaaa atgaaaaaga cagctatcgc 4141 gattgcagtg gcactggctg gtttcgctac cgtagcgcag gccggcgata atggcctgct 4201 tctcgccgaa acgtttggtg gcgggaccag tgacgaaggc ttgagcgagg gcgtgcaaga 4261 ttccgaatac cgcaagcgac aggccatgtt tgacagctta tcatcgataa gcttggatcc 4321 gtcgacctgc agccaagctt ggtctagagg tcga // LOCUS DSBRR1610 1376 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.autotrophicum 16S ribosomal rRNA. ACCESSION M34409 KEYWORDS ribosomal 16S rRNA. SOURCE D.autotrophicum (strain DSM 3382) rRNA. ORGANISM Desulfobacterium autotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1376) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1376 16S rRNA BASE COUNT 340 a 287 c 399 g 293 t 57 others ORIGIN 1 aactggagag tttgattctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag 61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcacgggt gagtaacacg 121 tggataatct acctttgaat ccgggataac tgttcgaaag aatcgctaat accggatgaa 181 gtcagtnttt ccttggaagg attgatgaaa gacagcctct tcttgaaagc tnttgtttga 241 agatgagtcc gcgcaccatt agttngttgg tggggtaacg gcctaccaag acnttnatgg 301 ttagctggtc tgagaggatg atcagccacn ctggaactga cacacggtcc ngactcctnc 361 gggaggcagc agtgaggaat tttgcgcaat gggngcnacc ctnacgcagc aacgccgcgt 421 gagtgaagaa ggcctntggg tcgtaaagct ctgtcaatgg ggaagaagtt rcttcatatt 481 aatrgtatgt ngtattgacg gtacccatnn nngaagcacc ggcnnactcc gtgccagcan 541 nnncggtaat anggagngtg cnagcgtnat tcggaatnat tnggngtnaa gggcacgcag 601 gcggtnttgc cagtcagatg tgaaagcccg gggctcaacc ctggaagtgc atttgaaact 661 acaaggcttg agtacggtng aggaaagggg nattcctngt gtagaggtga aattcgtaga 721 tatcaggagg aacaccggtg gcgaaggcgc cnntctggac csnnactgac gctnaggtnc 781 naaggcgtgg gnagcgaaca ggattagata ccctggtagt ccacgcagta aacgttgtat 841 actaggtgta gcgggntatt gacccntgct gtgccggagc taacgcatta agtataccgc 901 ctgggaagta cggtcgcaag actaaaactc aaagaaattg acgggggccc gcacaagcgg 961 tggagcatgt ggtttaattc ganncaacgc gcagaacctt acccagactt gacatcctga 1021 gaatctgtta gagatattag agtgccttcg ggaactcaga gacaggtgct gcatggctgt 1081 cgtcagctcg tgtcgtgaga tgttcggtta agtccggcaa cgagcgcaac ccctatcttc 1141 agttgccagc acatcatggt gggaactctg aagagactgc cccggttaac ggggaggaag 1201 gnggggatga cgtcaagtcc tcatggccct tatgtctggg gctacacacg tgntacaatg 1261 gtatgtacna cgggcagcga acctgcnnag gtgngcgaat ccctnaaagc atatcccagt 1321 ccggattgga gtctgcaact cgactccatg aagttggaat cgctagtaat cgtgga // LOCUS DSBRR1677 1375 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.niacini 16S ribosomal rRNA. ACCESSION M34406 KEYWORDS ribosomal 16S rRNA. SOURCE D.niacini (strain DSM 2650) rRNA. ORGANISM Desulfobacterium niacini Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1375) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1375 16S rRNA BASE COUNT 292 a 252 c 340 g 256 t 235 others ORIGIN 1 taactggaga gtttgattct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa 61 gtcgaacgag aaagtgcctg cttgcaggaa cgagtagagt ggcgcacggg tgagtaacac 121 gtggataatc taccttcaaa tcgggaataa ccattcgaaa ggatggctaa tgccggataa 181 agtcagatrt tctttggagt atttgatgaa agggtgcctc tgattgaaag catctgtttg 241 aagatgagtc cgcgcaccat tagtttgttg gtggggtaac ggcctaccaa gactncgatg 301 gtnagctngt ctgagaggat gatcagccac actggaactg acacacggtc crgrctccta 361 cgggaggcag cagtgaggaa ttttgcgcaa tgggggcaac cctnacgcag caacgcngcg 421 tgagtgaaga aggccnnngg gtcgtaaagc tntgtccatg gggaagaaat tatttagtgt 481 tratrgcatn ntttattgac nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttcctgg tgtagaggtg aaattcgtag 721 atatcaggag gaacaccggt ggcgaaggcg ccngtctgga ccgatactga cgctsaggtg 781 cgaaggcgtg ggnagcgaac gggattagat rccccggtag tccacgcagt aaacgttgta 841 tgctaggtgt agcgggtatt gcccntgctn cgccgcagtt aacacattaa gcataccgcc 901 tnnnnngtac ggtcgcaaga ctaaaactca aagaaattga cgggggcccg cacaagcggt 961 ggagcatgtg gtttrattcg acnnaacgcg aagaacctta cccagacttg acatcttgag 1021 aatcctttcg aaagttagga gtgccttcgg nngctcaaag acagnngctg catggctgtc 1081 gtcagctcgt gtcgtgagat gtttggttaa gtccagcaac gagcgcaacc cttgtcttca 1141 gttgccagcg cgtaatggcg gaactctgaa gagactgccc cggtnaacgg ggaggaaggt 1201 ggggatgacg tcaagtcctc atggcccnna tgtctgggct acrcrcgtgc tacaatggta 1261 tgtacaacgg gcagcaangc agcgatgcca agcaaatccc ttaaagcata tcccrgtccg 1321 gattggagtc tgcractcga ctccatgaag ttggaatcgc tagtaatcgt ggatc // LOCUS DSBRR169 1383 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.vacuolatum 16S ribosomal rRNA. ACCESSION M34408 KEYWORDS ribosomal 16S rRNA. SOURCE D.vacuolatum (strain DSM 3385) rRNA. ORGANISM Desulfobacterium vacuolatum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1383) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1383 16S rRNA BASE COUNT 345 a 295 c 411 g 294 t 38 others ORIGIN 1 ttctggctca gratnaacgc tggcggngtg crnaacacat gcaagtcgaa cgrgraagtt 61 sctgcttgca ggaacgagga gagtggcgct cgggngagta actcgtggat aatctacctt 121 caaatcggga ataaccattc gaaaggatgg ctaatgccgg ataaagtcag atrttctttg 181 grrtntttga tgaaagggag cctctgnata taagctnttg tttgaagatg agtccgcgca 241 ccattagttt gttggtgggg taacggccta ccaagactgt gatggtnagc tggtctgaga 301 ggatgatcag ccacactgga actgacacac ggtccagact cctacgggag gcagcagtga 361 ggaattttgc gcaatggggg aaaccctgac gcagcaacgc cgcgtgagtg aagaaggccc 421 ttgggtcgta aagctctgtc acagggaaga aattattatg ctttaataga gtgtrgtatt 481 gacggtaccc tgtgaagaag caccggctaa ctccgtgcca gcagccgcgg taatacgggg 541 ggtgcaagcg ttattcggaa ttattnggcg taaagggcac gcaggcggtc ctgtcagtca 601 gatgtgaaag ccggggctca accccggacg tgcatttgaa acagcaggnc ttgagtacgg 661 nngaggaaag ggnnnttcct ggtgtagagg tgaaattcgt agatatcagg aggaacaccg 721 gtggcgaagg cgccnntctr gaccgatact gacgctnagg tacnaaggcg tgggtagcga 781 acgggattag ataccccggt agtccacgca gtaaacgttg tatgctaggt gtagcrgggt 841 attgacccct gctgtgccgc agttaacaca ttaagcatac cgcctgggga gtacggtcgc 901 aagactnaaa ctnaaagaaa ttgacggggg cccgcacaag cggtggagca tgtggtttaa 961 ttcgacgnaa cgcgaagaac cttacccaga cttgacatct tgagaatcct ntagaaattt 1021 aggagtgcct tcgggagctc aaagacaggt gctccatggc tgtcgtcagc tcgtgtcgtg 1081 agatgtttgg ttaagtccag caacgagcgc aacccttgtc ttcagttgcc agcacataat 1141 ggtgggaact ctgaagagac tgccccggtt aacggggagg aaggtggggn tgacgtcaag 1201 tcctcatggc cctnatgtct ggggctacac acgtgctaca atggtatgta caacgggcrg 1261 caaggcagcg atgccaagca aatcccttaa agcatatccc agtccggatt ggagtctgcg 1321 actcgactcc atgaagttgg aatcgctagt aatcgtggat cagcatgccn cggtgaatac 1381 gtt // LOCUS DSFRR166 1372 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.multivorans 16S ribosomal rRNA. ACCESSION M34405 KEYWORDS ribosomal 16S rRNA. SOURCE D.multivorans (ATCC 33890) rRNA. ORGANISM Desulfococcus multivorans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1372) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1372 16S rRNA BASE COUNT 329 a 295 c 429 g 291 t 28 others ORIGIN 1 taattgacga gtttgatcct ggctcagaat gaacgctggc ggcgtgctta acacatgcaa 61 gtcgcacgag aacgcttctg cttgcagggg ctagtaagtg gcgcacgggt gagtaacgcg 121 tgggtaatct acccttggat tcgggataac ccttcgaaag gagtgctaat accggatgaa 181 gtcatttagg ctgtggttta gatgatgaaa ggtggcctct ccgaggaagc taccgtttgg 241 ggatgagccc gcgtaccatt agcttgttgg tggggtaacg gcctaccaag gcgacgatgg 301 ttagctngtc tgagaggatg atcagccaca ctggaactga cacacggtcn ngnctcctac 361 gggaggcagc agtgaggaat tttgcnnnnn gggggnaacc ntnacgcagc aacgccgcgt 421 gagtgaagaa ggccttcggg tcgtaaagct ctgtcagtgg gaagaacctt cgtatgttga 481 atatgcatac ggactgacgg taccacagaa ggaagcaccg gctaactccg tgccagcagc 541 cgcggtaata cggagggtgc aagcgttatt cggaattatt gggcgtaaag agcgcgtagg 601 cggctttgca agtcaggtgt gaaatccggg gctnaacccc ggaagagcat ttgatactgt 661 ggagcttgag tatgggagag gnnagtggna ttcctggtgt agcggtgaaa tgcgtagata 721 tcaggaggaa caccggtggc gaaggcggyt tcctggacca atactgacgc tgaggcgcga 781 aggcgtgggg agcaaacagg attagatacc ctngtagtcc acgcagtaaa cgttgatcac 841 taggtgtagc gggtattgac ccctgctgtg ccggagttaa cgcattaagt gatccgcctg 901 gggagtacga tcgcaagatt aaaactcaaa ggaattgacg ggggcccgca caagcggtgg 961 agcatgtggt ttaattcgan nnaacgcgaa gaaccttacc tggatttgac atccgcggrr 1021 ttcttatgaa aatagggagt gccttcgggg agccgcgaga caggtgctgc atggctgtcg 1081 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ctrtcttcag 1141 ttaccatcat taagttgggg actctgaaga tactgccccg gttaacggga ggaaggtggg 1201 gatgacgtca agtcctcatg gcctttatgt ccagggctac acacgtgcta caatgggctg 1261 tacnnagggn agcgatctcg cgagagcaag ccaatcccaa aaagcagtcc ttagttcggr 1321 ttggagtctg caactcgact ccatgaagct ggaatcgcta gtaatcgcgg at // LOCUS DSLRR165 1381 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.pigra 16S ribosomal rRNA. ACCESSION M34404 KEYWORDS ribosomal 16S rRNA. SOURCE D.pigra (ATCC 29098) rRNA. ORGANISM Desulfomonas pigra Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1381) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1381 16S rRNA BASE COUNT 322 a 294 c 388 g 250 t 127 others ORIGIN 1 gaactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa 61 gtcgtacgcg aaangggact tcggtcccga gtaaagtggc gcgacgggtg agtaacgacg 121 tggataatct gcctctatga tgggnataac rgttggaaac gactgctaat accgaatacg 181 ctcatgatga ncttgtggaa aggtggcctn tgcttgcaag ctatcgcata gagatgagtc 241 cgcgtcccat tagctagttg gtggggtaac ggcctaccaa ggcaacgatg ggtagccgat 301 ctgagaggat gatcggccac actggaactg aaacacggtc cagactccta cgggaggcag 361 cagtggggaa tattgcgcaa tgggcgaaag cctgacgcag cgacgccgcg tgagggatga 421 aggtcttcgg atcgtaaacc tctgtcagaa gggaagaaac tagggtgctc taatcatcat 481 cctaactgac ggtaccttca aaggaagcac cggctaactc cgtgccagca gccgcggtaa 541 tacggagggt gcaagcgtta atcggaatca ctgggcgtaa agcgcacgta ggctnttatg 601 taagtcaggg gtgaaagccc acggctnaac cgtggaactg cccttgatac tgcacggctn 661 gaatnnnnnn nnnnggtncg gnattccagg tgtaggagtg aaatccgtag atatctggag 721 gaacatcagt ggcgaaggcg gcnacctnga cctngaccgn nattgacgct naggtgcnna 781 agcgtgggga gcaaacagga ttagataccc tngtngtcca cgccgtaaac gatggatgct 841 agatgtcggg atgtannnct cggtgtcgta gtnaacgcgt naagcatccg gcctggggag 901 tacggnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn nnnnnnnaac gcgaagaacc ttacctaggt ttgacatctg gggaaccctc 1021 ccgaarrtga ggrrtrrcct tcggggagcc ccaagacagg tgctgcatgg ctgtcgtcag 1081 ctnnnnnnnt gagatgttgg gttaagtccc gcaacgagcg caacccctat gcatagttgc 1141 cagcaagtaa agttgggcac tctatgcaga ctgnncgggt tracagggga ggaaggtggg 1201 gacgacgtca agtcatnatg gcccttacgc ctagggctac anacgtacta cnatggcacg 1261 cacaaagggc agcgataccg tgagatggag ccaatcccaa aaaacgtgtc ccagtccgga 1321 ttgcagtctg caactcgact gcatgaagtc ggaatcgcta gtaattcgag gtcagcatnc 1381 c // LOCUS DSORR1618 1402 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfotomaculum orientis 16S ribosomal rRNA. ACCESSION M34417 KEYWORDS ribosomal 16S rRNA. SOURCE D.orientis (strain Singapore I, NCIMB 8382) rRNA. ORGANISM Desulfotomaculum orientis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1402) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1402 16S rRNA BASE COUNT 301 a 253 c 386 g 255 t 207 others ORIGIN 1 ttttttggag agtttgrtcc tggctcagga ngaacgctgg cggcgtgcct aacacatgca 61 agtcgaacgg agaattgaat aagcttgctt agacaattst tagtggcgga cgggtgagta 121 acgcgtgggt aacctgccca taaakccggg acaacccttg gaaacgaggg ctaataccgg 181 atatcttaga tyytggcatc aggayttaag gaaagatggc ctctgaacat gctatcgatt 241 atggatggac ccgcgtctga ttagctagtt ggtggggtaa aggcctacca aggcgacgat 301 cagtagccgg cctgagaggg tgaacggcca cactgggact gagacacggc cnngactcct 361 acgggaggca gcagtgggga atcttccgca atggacgaaa gtctgacgga gcaacgcngc 421 gtgtatgatg aaggtcttcg gattgtaaag tactgtcagt gtggggacgg tgctntttga 481 aaatattgag gagacatgac ggtacccaag gaggaagccc cggctaannn nnnnnnnnnn 541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncggaatta ctgggcgtta agcgcgcgtt 601 ggcggatgct taagtcckgt gtgaaagatc agggctcaac cctnagagtg cattcggtaa 661 tgggtattgn nnnnnnnnga gaggaaagtg gaattccacg tgtagcggtg aaatgcgtag 721 atatgtggag gaacaccagc ggcgaaggcg actttctgga ctgtaactga cgctgaggcg 781 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgag 841 tgctaggtgt agagggtatc nncacnttct gtgccgcagt taacgcgata agcactccgc 901 ctggggagta cggccgcaag gttnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1021 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngctg tgagacaggt ggtgcatggt 1081 tgtcgtcagc ttgtgtcgtg agatgttggg ttaagtcccg caacgagcgc aacccctgta 1141 tttagttgct acgagtaagg ttgagcnctc tagatagact gccggtgaca aaccggagga 1201 aggtggggat gacgtcaaat catnatgccc nttatgtcct gggctacaca cgtgctacaa 1261 tggccggtac agacggaagc gaagccgcga ggtgaagcca atccgagaaa gccggtctcr 1321 gttcggattg caggctgcaa ctcgcctgca tgaagtcgga atcgctagta atcgcaggtc 1381 agcatactgc ggtgaatacg tt // LOCUS DSORR1619 1368 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.ruminis 16S ribosomal rRNA. ACCESSION M34418 KEYWORDS ribosomal 16S rRNA. SOURCE D.ruminis (strain DL NCIMB 8452) rRNA. ORGANISM Desulfotomaculum ruminis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1368) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1368 16S rRNA BASE COUNT 341 a 284 c 420 g 267 t 56 others ORIGIN 1 ttatggagag tttrgtcctg gctcaggacg aacgctggcg gcgtgcttaa cacatgcaag 61 tcgaacggag tttaaaatga aacctngtga tttttraact tagtggcgga cgggtgagta 121 acgcgtggat aacctggcgg atagaccggg ataacagttg gaaacgactg ctaataccgg 181 atacgctcct aggaacgcat gttctaagga ggaaagattt atcgctaacc gatggatccg 241 cgtcccatta gctagttgga ggtgtaacgg acccctaagg cgacgatggg tagccggcct 301 gagagggcga acggccacac tggaactgag acacggcctn gactcctacg ggaggcagca 361 gtggggaata ttccgcaatg ggcgaaagcc tgagggagca acgcngcgtg agggaagaag 421 gctttcgggt tgtaaacctc tgtcttaagg gaagaacaaa atgacggtac cttgagagga 481 agccccggct annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag cgttgtccgg 541 aattactggg cgtaaagggc gcgtaggtgg tgcattaagt tagaggtgaa agtgccgggc 601 nnaaccccgt gaggcctctg atactggtgg acttgagtgc aggagagggg agcagaattc 661 ccagtgtagc ggtgaaatgc gtagatattg ggaggaatac cagtggcgaa ggcggcnntc 721 tngactgtaa ctgacactga ggcgcgaaac cgtggggagc gaacaggatt agataccctg 781 gtagtccacg ccgtaaacga tgagtgctag gtgttggggg tatcgaccac ccccagtgcc 841 gtagttaaca caataagcac tccgcctggg gagtacggtc gcaagactga aactcaaagg 901 aattgacggg ggcccgcaca agcggtggag tatgtggttt aattcgacgn aacgcgaaga 961 accttaccag ggcttgacat ccgtctnaaa agtataggga tatactcttg ccttcggggn 1021 nacagagaga caggtggtgc atngttgtcg tcagctcgtg tcgtgagatg ttgggttaag 1081 tcccgcaacg agcgcaaccc ctracattcg ttgccagcga gtaatgtcgg gaactcgaat 1141 gtgactgccg ttgacaaaac ggaggaaggt ggggatgacg tcaaatcatc atgcccctta 1201 tgtcctgggc tacrcgcgta ctacnatggc cggtacagac ggaagcgaag ctgtgaagtg 1261 aagcaaatct gagaaagccg gtcccagttc ggattgtagt ctgcaactcg actacatggg 1321 gtcggaatcg ctagtaatcg caggtcagca tactgcggtn aatacgtt // LOCUS DSPRR1613 1390 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.hydrogenophilus 16S ribosomal rRNA. ACCESSION M34412 KEYWORDS ribosomal 16S rRNA. SOURCE D.hydrogenophilus (DSM 3380) rRNA. ORGANISM Desulfobacter hydrogenophilus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1390) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1390 16S rRNA BASE COUNT 354 a 297 c 416 g 292 t 31 others ORIGIN 1 actgagagtt tggacctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc 61 gaacgagaaa argattgctt gcaatcctga gtagagtggc gcrcgggtga gtaacrcgta 121 gataatctgc cttcaagcct ggaataacta ttcgaaaggg tagctaatac cggataaagt 181 cgatttacac nagtrgattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat 241 gagtctgcgg accattagct tgttggtggg gtaaaggcct accaaggcag cgatggttag 301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccagac tcctacggga 361 ggcagcagtg aggaattttg cgcaatgggg gcaaccctga cgcagcaacg ccgcgtgagt 421 gaagaaggcc tttgggtcgt aaagctctgt caactgggaa gaaattagct tctattaata 481 gtagttgcta ttgacggtac cagtngacga agcgccggcn aactcngtgc cagcagcngc 541 ggtaacacgg ggggcgcnag cgtnattcgg aattattggg cgtaaagggc gcgcaggcgg 601 tnttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa 661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc 721 aggaggaaca ccgatggcga aggcatctct ctrgaccgat attgacgctn aggcgcnaag 781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacactc 841 ggtgtngcgg atattnaaat ctgctgtgcc taagctaacg cattaagtgt accgcctggn 901 ragtacggtc gcaagactaa aactcaaagg aattgacggg ggcccgcaca agcggtggag 961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata 1021 ttgtgtaatt gcaatagtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca 1081 gctcgtntcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgtcagttg 1141 ccagcacntr atggtgggaa ctctggcgag actgccccrg nnnacgggra ggaaggtggg 1201 gatgacgtca agtcctcatg gcccttatac ccagggctac acacgtgcta caatggtagg 1261 tacaaagggc agcgactctg crragggaag cgaatcccaa aagcctatct cagtccggat 1321 tggggtctgc aactcgaccc catgaagttg gaatcgctag taatcgcgga tcagcatgcc 1381 gcggtnaata // LOCUS DSPRR1614 1396 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.curvatus 16S ribosomal rRNA. ACCESSION M34413 KEYWORDS ribosomal 16S rRNA. SOURCE D.curvatus (DSM 3379) rRNA. ORGANISM Desulfobacter curvatus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1396) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1396 16S rRNA BASE COUNT 339 a 292 c 432 g 285 t 48 others ORIGIN 1 actggagagt ttgatcctgg ctcagaatga acgctggcgg cgtgctnaac acatgcaagt 61 cgaacgagaa agggattgct tgcaatcccg agtagagtgg cgcacgggtg agnnacacgt 121 rgatratctg ccttcaagcc tgggataact rttcgaaagg gtagctaata ccggataaag 181 tcggnncacn nnagtggatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga 241 tgagtttgcg taccattagc ttgttggtgg ggtaacggcc taccaaggca acgatggtna 301 gctggtctga gaggatgaca ggccacactg gaactgagac acggtcgact cgtgaacggg 361 aggcagcagt gaggaatttt gcgcaatggg ggaaaccctg acgcagcgac gccgcgtgag 421 tgaagaaggc ctttgggtcg taaagctctg tcgactggga agaagttcca atgttttaat 481 agaaggtggt attgagggta ccaggggagg aacgaccggc nnactccgtg ccagcagcag 541 cggtaacacn nngggggcna gcgttattcg gaanattggg cntaaagggc gcgcaggcgg 601 tcttgtccgt caggtgtgaa agctcggggc tcaaccccgg aagtgcactt gaaacagcaa 661 gacttgaata ccgtagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc 721 aggaggaaca ccgatggcga aggcatctct ctggaccgat attgacgctn aggcgnnaag 781 gcgtgggtag cgaacgggat tagatacccc ggtagtccac gcagtnaacg ttgtacnctc 841 ggtgtngcgg atatnnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg 901 gagtacggtc gcaagactaa aactcnaagg aattgacggg ggcccgcaca gtcggtggag 961 catgtggttt aattcgacgn aacgcgaaga accttacctg ggtttgacat cctgtgaata 1021 yygrgtaatn gnrrtngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtca 1081 gctcgtgtcg tgagatgttt ggttaagtcc agcaacgagc gcaaccctta tcgttagttg 1141 ccagcacgtt atgtgggaac tctaacgaga ctgcccgggt nnaccgggag gaaggcgggg 1201 atgacgtcaa gtcctcatgg cccttatatc cagggctacg cgcgtgctac natggtaggt 1261 acaaagggca gcgactttgc rraaagaagc gaatgcccaa aagcctatcg cngtccggat 1321 tggggtctgc nactcgaccc catgaagttg gaatcgctag taatcgcgga tcngcatgcc 1381 gcggtnaata tgttcc // LOCUS DSPRR1615 1373 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobacter latus 16S ribosomal rRNA. ACCESSION M34414 KEYWORDS ribosomal 16S rRNA. SOURCE D.latus (DSM 3381) rRNA. ORGANISM Desulfobacter latus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1373) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1373 16S rRNA BASE COUNT 322 a 268 c 393 g 267 t 123 others ORIGIN 1 ctggagagtt tgatcctggc tcagaatgaa cgctggcggc gtgcttaaca catgcaagtc 61 gaacgagaaa gggattgctt gcaatcctga gtagagtggc gcacgggtga gtaacacgta 121 gataatctgc cttcaagcct gggataacta ttcgaaaggg tagctaatac cggataaagt 181 cgattyacay nagtnnattg atgaaagatt gcctcttctt gaaagcaatt gtttggggat 241 gagtttgcgt accattagct agttggtggg gtnaaggcct accaaggcnn cgatggnnag 301 ctggtctgag aggatgatca gccacactgg aactggaaca cggtccngnc tcctacggga 361 ggcagcagtg aggaattttg cgcratgggg gcaacccnna cgcagcaatg ccgcgtgagt 421 gaagaaggcc tttgggtcgt aaagctctgt caacaaggaa gaaattagga attattaata 481 gtngtttcta ttgacggtac ttgtngagga agcgccggct nactccgtgc cagcagccgc 541 ggtaacacgg ggggcgcnag cgttattcgg aattattggg cgtaaagggc gcgcaggcgg 601 tnttgtccgt naggtgtgaa agcccggggc tcaaccccgg aagagcactt gaaacagcaa 661 gacttgaata cgggagagga gagaggaatt cctggtgtag aggtgaaatt cgtagatatc 721 aggaggaaca ccgatggcga aggcatctct ctngaccgrt attgacgctn aggcgnnaag 781 gcgtggggag cgaacgggat tagatacccc ggtagtccac gcagtaaacg ttgtacgctc 841 ggtgtngcgg atattnaaat ctgctgtgcc naagctaacg cattaagtgt accgcctggg 901 nagtacggtc gcaagactaa aactcraagg aattgrcggg ggcccgcrca agcggtggag 961 catgtggttt aattcgatnn aacgcgaaga accttacctg ggtntgacat cctgtgaata 1021 tcccgtnatt gggatngtgc cttcgggagc acagagacag gtgctgcatg gctgtcgtcn 1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1141 nnnnnnnnnn nnnntgggaa ctctggcgag actgccccgg tnnacgggga ggaaggtggg 1201 gatgacgtca agtcctsrtg gccctnatat ccagggctac acacgtgcta caatggtagg 1261 tacaaagggc agcgactttg crrraagaag cgaatcccaa aangcctntc ccagtccgaa 1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atc // LOCUS DSPRR1616 1397 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobacter sp. Vulgaris 16S ribosomal rRNA. ACCESSION M34415 KEYWORDS ribosomal 16S rRNA. SOURCE Desulfobacter sp. (strain 3ac10 DSM 2035) rRNA. ORGANISM Desulfobacter sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1397) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1388 16S rRNA BASE COUNT 363 a 290 c 404 g 296 t 44 others ORIGIN 1 actggagagt ttgrtcctgg ctcagaatga acgctggcgg cgtgcttaac acatgcaagt 61 cgaacgagaa agggattgct tgcratcctg agtagagtgg cgcacgggtg agtaacgcgt 121 agataatctg ccttcaagcc tgggataact attcgaaagg gtagctaata ccggataaag 181 tcgattcaca taagtaaatt gatgaaagat tgcctcttct tgaaagcaat tgtttgggga 241 tgagtttgcg taccattagc ttgttggtgg ggtaaaggcc taccaaggca acgatggtna 301 gctggtctga gaggatgatc agccacactg gaactggaac acggtccaga ctcctacggg 361 aggcagcagt gaggaatttt gcrgcrratg ggggcaaccc tgacgcagca acgccgcgtg 421 agtgaagaag gcctttgggt cgtaaagctc tgtcaacaag gaagaaatta caattattta 481 atagataatt gtattgacgg tacttgtaga ggaagcgccg gcnaactcnn nnnnnnnnnn 541 nnnggtaaca cggggggcgc aagcgttaat tggaattatt gggcgtaaag ggcgcgcagg 601 cggtcttgtc cgtcaggtgt gaaagctcgg ggctcaaccc cggaagtgca cttgaaacag 661 caagacttga atacgggaga ggagagagga attcctggtg tagaggtgaa attcgtagat 721 atcaggagga acaccgatgg cgaaggcatc tctctggacc gatattgacg ctnaggcgct 781 aaggcgtggg tagcgaacgg gattagatac cccggtagtc cacgcagtaa acgttgtaca 841 ctcggtgtgg cggatattta aatctgctgt gcccaagcta acgcattaag tgtaccgcct 901 gggaagtacg gtcgcaagac taaaactcaa aggaattgac gggggcccgc acaagcggtg 961 gagcatgtgg tttaattcga cgnaacgcga agaaccttac ctgggtttga catcctgtga 1021 atatcccgta attgggatag tgccttcggg agcacagaga caggtgctgc atggctgtcg 1081 tcagctcgtg tngtgagatg tttggttaag tccagcaacg agcgcaaccc ttacgtcagt 1141 tgccagcaca taatggtggg aactctggcg aactgcccgc tanannnnnn nnnnnnnnnn 1201 nntgacgtca agtcctcatg gcccttatat ccagggctac acacgtgcta caatggtagg 1261 tacaaagggc accgactctg caaagggaag cgaaatccca aaagcctatc tcagtccgga 1321 ttggggtctg caactcgacc ccatgaagtt ggaatcgcta gtaatcgcgg atcagcatgc 1381 cgcggagaat atgttcc // LOCUS DSPRR1617 1472 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobacter sp. 16S ribosomal rRNA. ACCESSION M34416 KEYWORDS ribosomal 16S rRNA. SOURCE Desulfobacter sp. (strain 4ac11 DSM 2057) rRNA. ORGANISM Desulfobacter sp. Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1472) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1472 16S rRNA BASE COUNT 368 a 312 c 442 g 313 t 37 others ORIGIN 1 aactggagag tttgrtcctg gctcagaatg aacgctggcg gcgtgcttaa cacatgcaag 61 tcgaacgaga aagggattgc ttgcaatcct gagtagagtg gcgcrcgggt gagtaacrcg 121 tagataatct gccttcaagc ctgggataac tatccgaaag gatagctaat accggataaa 181 gtcgatttac acaagtagac cgatgaaaga ttgcctcttc ttgaaagcaa ttgtttgggg 241 atgagtttgc gtaccattag ctaggttggt ggggtaaagg cctaccaagg cngngatggt 301 nagctggtct gagaggatga tcagtcacac tggaactgga acgcggtcca gnctcctacg 361 ggaggcagca gtgaggaatt ttgcgcaatg ggggcaaccc tgacgcagcg acgccgcgtg 421 agtgaagaag gccnntgggt cgtaaagctc tgtcwacwag gaagaaatta ggatctatta 481 atagtgggtt ctattgacgg tacttttgga ggaagcgccg gctnactckk tgccagcagc 541 ngcggtaaca cggggggcgc nagcgttatt cggaattatt gggcgtnaag ggngcgcagg 601 cggtcttgtc tgttaggtgt gaaagcgggg gctcaacccc ggaagtgcac ttgaaacagc 661 aagacttgaa tacgggagag gagagaggaa ttcctggtgt agaggtgaaa ttcgtagata 721 tcaggaggaa caccgatggc gaaggcatct ctctggaccg atattgacgc tnaggcgnna 781 aggcgtgggt agcgaacggg attagatacc ccggtagtcc acgcagtnaa cgttgtacac 841 tcggtgtagc ggatattnaa atctgctgtg ccnaagctaa cgcattaagt gtaccgcctn 901 gnnngtacgg tcgcaagact aaaactcaaa ggaattgacg ggggcccgca caagcggtgg 961 agcatgtggt ttaattcgac nnaacgcgaa gaaccttacc tgggtttgac atcctgtgaa 1021 tatcccgtaa ttgggrtrgt gccttcggga gcacagagac aggtgctgca tggctgtcgt 1081 cagctcgtgt cgtgagatgt ttggttaagt ccagcaacga gcgcaaccct tatcgtcagt 1141 tgccagcacg taatggtggg aactctggcg agactgcccc ggtcaacggg gaggaaggtg 1201 gggatgacgt caagtcctca tggcccttat atccagggct acacacgtgc tacaatggta 1261 ggtacaaagg gcagcgactt tgcaaaaagg agcgaatccc naaaagccta tctcagtccg 1321 aattggggtc tgcaactnga ccgnatgaag ttggaatcgc tagtaatcgc ggatcagcat 1381 gccgcggtna atatgttccc gggccttgta cacaccgccc gtcacaccat ggaagttgat 1441 tatacccgac gtcrctgggc taactattta ga // LOCUS DSURR1611 1371 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.propionicus 16S ribosomal rRNA. ACCESSION M34410 KEYWORDS ribosomal 16S rRNA. SOURCE D.propionicus (ATCC 33891) rRNA. ORGANISM Desulfobulbus propionicus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1371) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1371 16S rRNA BASE COUNT 333 a 271 c 397 g 276 t 94 others ORIGIN 1 naactggaga gtttnatcct ggctcagaac gaacgctggc ggcgtgctta acacatgcaa 61 gtcgaacgcg aaagggactt cggtcctgag taaagtggcg cacgggtgag taacgcgtag 121 ataacctnnc ttcatgtctg gaataatacg ccgaaagggg trctaatacc ggatattntn 181 gctnttcata agttatgcaa gcraaaggtg gcctctgcat atgctactgc rtgaagaggg 241 gtctgcgtac cattagctag taggtggggt aatggcctac ctaggcgacg atggtnagcg 301 ggtctgagag gatgatccgc nacactggca ctggaacacg ggcnngnctc ctacgggagg 361 cagcagtgag gaatattgcg caatgggggn aacccnnacg cagcgacgcn gcgtgagtga 421 ggaaggcctt cgggtcgtaa agctctgtca agaggaaaga agtgtacaat ggctnatacc 481 tnttgtnttt gacggtacct ctnaaggaag caccggctna ctccgtncca gcagcnncgg 541 taatacggag ggtgcaagcg tnnttcggaa tcactnggng tnaagggcgc gtaggcggnt 601 tggtaagtca gatgtgaaag cccacggctn aactgtggaa gtgcatttga aactgtcagg 661 cntgagtacc agaggggaaa gtggaattcc cggtgtagag gtnaaattcg tagatatcgg 721 gaggaatacc ggtggcgaag gcgactttct ggctngatac tnacgctnag gcgcnaaagc 781 gtggggagca aacaggatta gataccctng tagtccacgc tgtaaacgat gtgaactaga 841 tgcagggggt gtnnatccnn nctgtgtngc agctaacgca ttnagttnnn nnnnnnnnna 901 gtacggnngc aagattaaaa ctcaaaggaa ttgacggggn ncngcacaag cggtggagta 961 tgtggtttaa ttcgatnnaa cgcgaagaac ctnacctngt nttgacatcc cggnaatctn 1021 ttggaaacaa gagagtgctt ccgnnagnat ctggagacag nnnctgcatn gctntcgtna 1081 nnnnntgttg tgagatgttg ggttaagtcc cnnaacgagc gcaacccttg cctttagttg 1141 ccagcagttc ggctggrcac tctaaaggga ctgccggtgt taaaccggag gaaggtgggg 1201 atgacgtcaa gtcctnatgg ccnntatgac cagggctaca cacgtcctac aatggccgat 1261 acaaagggca gcgacactgc gaggtggagc taatcccata aaatcggnct cagtccggat 1321 tggagtctgc aactcgactc catgaagttg gaatcgctag taatcgngga t // LOCUS DSURR1612 1365 bp ss-rRNA BCT 24-JUL-1990 DEFINITION Desulfobulbus sp. 16S ribosomal rRNA. ACCESSION M34411 KEYWORDS ribosomal 16S rRNA. SOURCE Desulfobulbus sp. (strain 3pr10, DSM 2058) rRNA. ORGANISM Desulfobulbus sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1365) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1365 16S rRNA BASE COUNT 299 a 267 c 389 g 267 t 143 others ORIGIN 1 gaagagtttn attctggctc agaacgaacg ctggcggcgt gcntnncaca tgcaagtcga 61 accgnnnnnn nnnnnnnntc ctagtagtgg cgcacgggtg agtracgcgt agatracctg 121 ncctnatgtc tgggatrccc caccgaaagg tgggctaata ccggatgtnc tnntntttca 181 tnagcgtaat aaagaaaagg tggcctctga tattaagctn ctgcrtgagr aggggtctgc 241 gtaccattag ctagttggtg gggtnagngc ctaccaaggc atcgatggtn agcgggtctg 301 agngnatgat ccgtcacact ggcactgaaa cncgggccng rctccnacgg gaggcngcng 361 tgaggaatnt tgcgcnatgg gggcnaccct nacgcagcga cgccgcgtga gtgagnaagg 421 ctttcgggtc gtaaagctct gtnnaaggga agaaatgcat aatnattnat ncttnttatg 481 tttgacggta cctttaaagg aagcaccggc taactccgtg ccagnagccg cggtaatacg 541 gagggtgcaa gcgttgtccg gaattactgg gcgtnaaggg cacgcaggcg gcctgataag 601 tcagatgtga aagcccacgg ctnnaccgtg gaagtgcatt tgaaactatt aggnnngagt 661 atcagagggg aaagtggaat tcctggtgta gaggtgaaat tcgtagatat caggaggaat 721 rccagtggcg aaggcgactt tctggctgaa tactgacgct gaggtgcgaa agcgtgggga 781 gsaaacagga ttagataccc tggtagtcca cgccgtaaac gatgtcaact aggtgtaggg 841 ggggttnatc csstctgtgc cgcagctrac gcattragtt gacngcctgg ggagtacggt 901 cgcaagatta aaactcnaag naattgrcgg gggcccgcrc nagcggtgga gtatgtggtt 961 taattcgatg nnacgcgaag aaccttacct ggtcttgaca tcccaagaat cttctagaaa 1021 tagaagagtg cttttcggag aacttggtga caggtgctgc atggctgtcg tcagcnnnnn 1081 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ttgccagtag 1141 ttcggctgag cactctaaag ggactgccgg tgtnnaaccg gaggaaggtg gggatgacgt 1201 caagtcctca tggcsnntat gaccagggct acacacgtac tacnatggcc ggtacnaagg 1261 gcagcgacag ggcgrcctna agcgaatccc gtraagccgg nctcngtccg gattggagtc 1321 tgcnactcga ctccatgaag tcggaatcgc tagtaatcgt ggatc // LOCUS DSVRR168 1527 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.variabilis 16S ribosomal rRNA. ACCESSION M34407 KEYWORDS ribosomal 16S rRNA. SOURCE D.variabilis (strain DSM 2060) rRNA. ORGANISM Desulfosarcina variabilis Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 1527) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1527 16S rRNA BASE COUNT 380 a 338 c 458 g 328 t 23 others ORIGIN 1 ttaattggag agtttgatcc tggctcagaa tgaacgctgg cggcgtgctt aacacatgca 61 agtcgtacga gaacgctcta gcttgctaga gcaagtaaag tggcgcacgg gtgagtaacg 121 cgtgggtaac ctaccttcga attggggata acgttgcgaa agcgacgcta ataccgaata 181 atatccaaat atcttcggat ttttggatca agggtggcct cttcttgaaa gctactgttt 241 gaagatgggc ccgcgtacca ttagcttgtt ggtggggtaa tggcctacca aggcaacgat 301 ggttagctgg tctgagagga tgatcagcca cactggaact gacacacggt ccagactcct 361 acgggaggca gcagtgagga attttgcgca atgggcgaaa gcctgacgca gcaacgccgc 421 gtgagtgatg aaggctttcg ggtcgtaaag ctctgtcaag tgggaagaac ctatataatg 481 ttaatatcat tatatactga cggtaccact gaaggaagca ccggctaact ccgtgccagc 541 agccgcggta atacgggggg tgcaagcgtt attcggaatc attgggcgta aagagcacgt 601 aggcggcctg gtaagtcaga tgtgaaagcc cggggcttaa ccccggaagt gcatttgata 661 ctgtcaggct tgagtatggg agagggaagt ggaattcctg gtgtagaggt gaaattcgta 721 gatatcagga ggaacaccgg tggcgaaggc gacttcctgg accaatactg acgctgaggt 781 gcgaaggcgt gggtagcaaa caggattaga taccctggta gtccacgcag taaacgttga 841 tcactaggtg tagcrgggta ttgacccctg ctgtgccgca gttaacgcat taagtgatcc 901 gcctggggag tacggtcgca agattaaaac tcaaaggaat tgacgggggc ccgcacaagc 961 ggtggagcat gtggtttaat tcgannnaac gcgaagaacc ttacctggat ttgacatccc 1021 gggaatccta tcgaaagact ggagtgccct tcggggagcc cggwgacagg tgctgcatgg 1081 ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccttgt 1141 ctttagttac catcatttag ttggggactc taaagatact gccccggtca acggggagga 1201 aggtggggat gacgtcaagt cctcatggcc tttatgtcca gggctacaca cgtgctacaa 1261 tgggcggtac aaagggtagc gaacccgcga gggtaagcca atcccaaaaa gccgtcccaa 1321 gttcggattg gagtctgcaa ctcgactcca tgaagctgga atcgctagta atcgcggatc 1381 agcatgccgc ggtgaatacg ttcccgggcn ttgtacacac cgcccgtcac accatgagag 1441 ttggttgtac ccgaagtcgt cgggctaacc cttcggggat gcangcgcct aaggtatggc 1501 tgannnnnnn nnnnnnnnng taacaag // LOCUS DVURR16 1388 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.vulgaris 16S ribosomal rRNA. ACCESSION M34399 KEYWORDS ribosomal 16S rRNA. SOURCE D.vulgaris (strain Hildenborough DSM 644) rRNA. ORGANISM Desulfovibrio vulgaris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1388) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1388 16S rRNA BASE COUNT 306 a 315 c 418 g 248 t 101 others ORIGIN 1 taactggaga gtttgattct ggctcagatt gaacgctggc ggcgtgctta acacatgcaa 61 gtcgngcgtg aaaggacttc ggyccgagta aagcggcgcr cgggtgagta acgcgtggat 121 gatctaccca tgagttggga ataacggctg gaaacggtcg ctaataccga atacgctccg 181 atttcracnt cgggggaaag gtggcctctg cttgcaagct actgctcatg gatgagtccg 241 cgtcccatta gctagttggt ggggtaacgg cccacnaagg cgacratggg nagccggtct 301 gagaggatga ccggccacac tgggactgga acacggccna gactcctacg ggaggcagca 361 gtggggaata ttgcgcaatg ggcgaaagcc tnacgcagcg acgccgcgtr agggatgaag 421 gtcctcggat cgtnaacctc tgtcnggagg gaagaaccgc cacggtgctn atcagccgtn 481 gtctgacggt acctccagag gaagcaccgg ctaactccgt gccagcagcc gcggtaatac 541 ggagggtgca agcgttaatc ggaatcactg ggcgtaaagc gcacgtaggc tgcttggtaa 601 gtcaggggtg aaagccggnn nctcaaccgc ggaattgcct ttgatactgc caagcnagag 661 tccgggagag ggtagtggaa ttccaggtgt aggagtgaaa tcmgtagaga tctggaggaa 721 catcagtggc gaaggcgact acctggaccg gnactgacgc tnaggtgcnn aagcgtgggg 781 agcaaacagg attagatacc ctngtagtcc acgccgtaaa cgatggatgc taggtrtcgg 841 ggccttgarr cttcggtgcc gtagttaacg cgttaagcaa tccgcctngg gagtacggtc 901 gcnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn aacgcgaaga accttaccta ggtttgacat ccggaagacc ttcccgaaaa 1021 ggaaggntgc cttcggggaa ttccgagaca ggtgctgcat ggctgtcgtc agctcgtgcc 1081 gtgaggtgtt gggttaagtc ccgcaacgag cgcaacccct attgccagtt gctaccaggt 1141 aatgctgggc tctctggtga gactgccccg gttaacgggg aggaaggtgg ggacgacgtc 1201 aagtcatgat ggcccttacg cctagggcta cacacgtact acaatggcgc atacaaaggg 1261 cagcgatacc rcraggtgga gccaatccca aaaagtgcgt cccagtccgg attgcagtct 1321 gcaactcgac tgcatgaagt tggaatcgct agtaattcga gatcagcatg cncgggtgaa 1381 tgcgttcc // LOCUS DVURR161 1370 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.gigas 16S ribosomal rRNA. ACCESSION M34400 KEYWORDS ribosomal 16S rRNA. SOURCE D.gigas (ATCC 19364) rRNA. ORGANISM Desulfovibrio gigas Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1370) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1370 16S rRNA BASE COUNT 310 a 303 c 393 g 243 t 121 others ORIGIN 1 taactggaga gtttgattct ggctcagatt gaacgctrgc ggcgtgctta acacatgcaa 61 gtcgagcgag anngggtctt tcgggacctg agtacagcgg cgcacgggtg agtaacgcgt 121 ggatgatcta cctcattgrc cgggataacg gtgggaaacc gctgctaata ctggatacgc 181 ttcatggaaa catgagggaa aggtggcctc tgatttaagc trtcacaatg agatgagtcc 241 gcgtctcatt agcttgttgg tagggtaatg gcctaccaag gctacgatga gtagcggccc 301 tgagagggtg gccrgccaca ctggaactga aacacggtcc agactcctac gggaggcagc 361 artggggaat attgcgcaat gggcgaaagc ctnacgcagc gacgccgcgt gagggaagaa 421 ggccttcggg tcgtaaacct ctgtctggag gaaagaaacc cataggtgtg aatagggctt 481 atggctgacg gtacctccga aggaagcacc ggctaactcc gtgccagcag ccgcggtaat 541 acggagggtg caagcgttaa tcggaatcac tgggcgtaaa gcgcacgtag gctgtcttgc 601 aagtcaggtg tgaaagccct cggctcaacc ggggaactgc acttnaaact gcatgacttg 661 agtacaggag aggatggcag aattccgggt gtaggggtna aatccgtaga tatccggagg 721 aataccagtg gcgaaggcgg ccatctggac tgtcactgac gctgaggtgc rnaagcgtgg 781 ggagcaaaca ggattagata ccctngtagt ccacgctgta aacgatggac gctnggtrtc 841 grggggtcgc ccctcggcgc cgaactaacg cgttnannnn nnnnnnnnnn nnnnnnnnnn 901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn aacgcgaaga accttaccta ggcttgacat cccgggaacc ctcccgaaaa 1021 ggaggnntgc tcttcggaga gcccggagac aggtgctgca tggctrtcgt cagctcgtgc 1081 cgtgaggtgt tgggttaagt cccgcaacga gcgcaacccc tgtccatagt tgccnnnngg 1141 taaagctngn cactctatgg agactgcctt ggcaacaagg aggaaggtgg ggatgacgtc 1201 aagtcatcat ggcccttacg cctagggcta cacacgtact acaatggcga gtacaaaggg 1261 ttgcgaaacc gcgaggttga gccaatccca aaaagctmgt cccagtccgg attgcagtct 1321 gcaactcgac tgcatgaagt cggaatcgct agtaatcccg gatcagcatg // LOCUS DVURR162 1400 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.salexigens 16S ribosomal rRNA. ACCESSION M34401 KEYWORDS ribosomal 16S rRNA. SOURCE D.salexigens (ATCC 14822) rRNA. ORGANISM Desulfovibrio salexigens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1400) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1400 16S rRNA BASE COUNT 320 a 306 c 393 g 270 t 111 others ORIGIN 1 actggagagt ttgattctgg ctcagattga acgctggtgg cgtgcttaac acatgcaagt 61 cgtgcgagaa cagttccttc gggaacctag tagagcggcg cacgggtgag taacgcgtgg 121 ataatctacc cagaagactg ggataacagt tggaaacgac tgctaatacc ggatacgttt 181 catatttaac tnnatnagag aaaggtggcc tctntttcaa gctatcactt ttggatgagt 241 ccgcgtttca ttagctagtt ggtaaggtaa cggcttacca aggcgacgat gaatagctgg 301 tctgagagga tgaccagcca cactgggact ggaacacggc cnagactcct acgggaggca 361 gcagtgggnn atattgcgca atgggggaaa ccctgacgca gcgacgccat gtgagggacg 421 aaggctttcg ggtcgtaaac ctctgtcrrg rrggaagaaa ctgtttgarg ctaataccct 481 ntttcacttg acggtaccct cagaggaagc accggctaac tccgtgccag cagccgcggt 541 aatacggagg gtgcgagcgt taatcggaat cactgggcgt aaagcgcgcg taggtggcgc 601 gataagtcag gcgtgaaagc cctcggctca accggggaat tgcrcttgat actgtcgtgc 661 ttgagtctcg gagagggtgg cggaattcca ggtgtaggag tgaaatccgt agatatctgg 721 aggaacacca gtggcgaagg cggccacctg gacgagtact gacgctgagg tgcgaaagcg 781 tggggagcaa acaggattag ataccctngt agtccacgcc gtaaacgatg gatgctagat 841 gtcgggcctt aaccaggttc ggtgtcgaag ttaacgcgat aagcatccnn nnnnnnnnnn 901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 961 nnnnnnnnnn nnnnnnaacg cgaagnacct tacctggact tgacatcctg agaatcctct 1021 agaaatagag gagtgccttc gggnnattca gtgacaggtn ctgcatggct ntcgtcagct 1081 cgtgccgtga ggtgttgggt taagtcccgc aacgagcgca acccctattg ctagttgcca 1141 tcacataatg gtgggcactc tagtgagact gcccgggtca accgggagga aggtgggnac 1201 gacgtcaagt catcatggcc cttacgtcca gggctacaca cgtactacaa tggtggatac 1261 aaagggttgc caagccgcga ggccgagcca atcccaaaaa gtccrtccca gtccggatcg 1321 cagtctgcaa ctcgactgtg tgaagttgga atcgctagta atcccggatc agcatgccgg 1381 ggtnaatacg ttcccgggcc // LOCUS DVURR163 1395 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.sapovorans 16S ribosomal rRNA. ACCESSION M34402 KEYWORDS ribosomal 16S rRNA. SOURCE D.sapovorans (strain 1pa3 ATCC 33892) rRNA. ORGANISM Desulfovibrio sapovorans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1395) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1395 16S rRNA BASE COUNT 334 a 291 c 427 g 286 t 57 others ORIGIN 1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag 61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg 121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt 181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga 241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna 301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg 361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag 421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac 481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg 541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg 601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc 661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat 721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa 781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact 841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg 901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag 961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact 1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc 1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt 1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg 1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat 1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg 1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg 1381 ccgcggtgaa tacgt // LOCUS DVURR164 1395 bp ss-rRNA BCT 24-JUL-1990 DEFINITION D.baarsi 16S ribosomal rRNA. ACCESSION M34403 KEYWORDS ribosomal 16S rRNA. SOURCE D.baarsi (strain 2st14 DSM 2075) rRNA. ORGANISM Desulfovibrio baarsi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1395) AUTHORS Devereux,R., Delaney,M., Widdel,F. and Stahl,D.A. TITLE Natural relationships among sulfate-reducing eubacteria JOURNAL J. Bacteriol. 171, 6689-6695 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.A.Stahl, 22-MAY-1990. FEATURES from to/span description rRNA < 1 1395 16S rRNA BASE COUNT 334 a 291 c 427 g 286 t 57 others ORIGIN 1 gaacggagag tttgatcctg gctcagaatg aacgctggcg gcgtgctnaa cacatgcaag 61 tcgaacgaga aagttccttc gggggataag taaagtggcg cacgggtgag taatgcgtgg 121 ataatctacc rttgaattcg gaataacatc gcgaaagcgg tgctaatacc ggatratgtt 181 gtrttttctt cggggaatgc aaccaaagaa tgcctcttca tgaaagcatt tgtttgagga 241 tgagtccgcg tcccattagc tagttggcgg ggtaacggcc caccaaggca gtgatgggna 301 gctngtctga gaggatgatc agccacactg ggactgacac acggccnnga ctcctacggg 361 aggcagcagt gaggaatttt gcgcaatggg ggcaaccctg acgcagcaac gccgcgtgag 421 tgaagaaggc ccttgggtcg taaagctctg tcnatgggga agaagttgtg tggttcaaac 481 agggccatgc attgacggta cccatnaagg aagcaccggc tnactccgtg ccagcagccg 541 cggtnatacg gagggtgcaa gcgttattcg gaattattgg gcgtnaaggg ngcgtaggcg 601 gcctgtnaag tcatctgtga aatttcgggg ctnaaccccg gagctgcatg tgatactggc 661 aggnttgagt atggcagagg aaagcggaat tcctggtgta gcggtgaaat gcgtagatat 721 caggaggaac accrgtggcg aaggcggctn tctngaccna tactgacgct gatgcgcgaa 781 ggcgtgggta gcaaacagga ttagataccc tngtagtcca cgcagtaaac gttgatcact 841 aggtgttgcg ggtattgacc cctgagtgcn gaagctaacg cattaagtga tcngcctggg 901 gagtacgatc gcaagattaa aactcaaagg aattgacggg ggnccgcana agcggtngag 961 crtgtggttt aattcgannn aacgcgcann accttacctg ggcttgaaat ctgtggaact 1021 tgtatgaaag tatggggnnn cttcggggaa ccgcaagaca ggtgctgcat ggctgtngtc 1081 agctcgtgtc gtgagatgtt gggtnaagtc ccgcaacgag cgcaaccctc gtcttcagtt 1141 gccagcacgt aaaggtggga actctgaaga tactgccccg gtnnncgggg nggaaggtgg 1201 ggntgacgtc aagtcctnat ngcnnntntg cccagggctn cacacgtgct ncaatggtat 1261 atacaaaggg nnncgatncc gcgaggtgga nccaatccca tnaagtatgc cacagttcgg 1321 attggagtct gcaactcgac tccatgaagt tggaatcgct agtaatcgcg gatcagcatg 1381 ccgcggtgaa tacgt // LOCUS DRORNP70K 3569 bp ds-DNA INV 24-JUL-1990 DEFINITION D.melanogaster U1 70K small nuclear ribonucleoprotein gene, complete cds. ACCESSION M31162 KEYWORDS ribonucleoprotein; small nuclear ribonucleoprotein. SOURCE D.melanogaster (strain Oregon R) DNA, clones lambda D331.[1,2]; and adult head, cDNA to mRNA, clones pBS9.[1,3] and lambda R3-2.1. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 3569) AUTHORS Mancebo,R., Lo,P.C.H. and Mount,S.M. TITLE Structure and expression of the Drosophila melanogaster gene for the U1 small nuclear ribonucleoprotein particle 70K protein JOURNAL Mol. Cell. Biol. 10, 2492-2502 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Mancebo, 09-JAN-1990. FEATURES from to/span description pept 614 1960 U1 70K small nuclear ribonucleoprotein, exon 2 (first expressed exon) pre-msg < 1 2259 snRNP mRNA and intron (alt.) pre-msg < 1 3367 snRNP mRNA and intron (alt.) IVS 488 566 U1 70K snRNP, intron A site 103 182 RNA recognition motif BASE COUNT 972 a 839 c 919 g 839 t ORIGIN Chromosome II; map position 27D1,2. 1 gcatgaagga aaatattcta caaaaaactt caattttata aaattcattt aaaatacaaa 61 attgtacgta aacttaacgt aaccgttact cagttatgga atgtgtgagc gagatggtga 121 agcagcagca agtgatgtag caaattgcaa ttgaacgcga gtgggaaaag gggcaactat 181 aaaaccgaga aacttgcttt tagcatggat tcgaacccct tatttatagt actctggatg 241 tccgagacac acacctattt gtggtattta tattttataa cgtaagtagt atatttaatt 301 actaatcagt atttcatgcg gaattcttcc gcttaattca tagaccgcgc gggggtcaca 361 cttgctactc aagccaggcg aaaaactaaa gaaaatcggg aaaatacttg gtctgcaccg 421 aattatattg ctggtactta aacgaagtac cctagattta ttcttgccaa gcggatggct 481 gtttaaggtg agttgcgcca gcgcttacta tcccttgtgg agtaaacaaa ctccaaccta 541 acctcaaact gaccgttttt ttgcagacga ggaacttcag gaaaaggtaa aacaaaacaa 601 aaaagcccac aaaatgaccc aatatctgcc gccgaatctg ctggcgctgt tcgcggcacg 661 ggagcccatc ccgttcatgc cgccggtgga caagctgccg cacgagaaga agtctcgcgg 721 ctacctggga gtggccaagt tcatggccga tttcgaggat cccaaggaca cgccgctgcc 781 gaaaacggtg gaaacgcgtc aggagcggct ggagcgacgc cggcgcgaga aggccgagca 841 ggtggcctac aagctggagc gtgagatagc gctgtgggac cccacagaga tcaaaaatgc 901 cacggaggac ccgtttcgca cgctgttcat tgcacgcatc aactacgaca cgtccgagtc 961 gaagctgcgg cgtgagttcg agttctacgg gcccatcaag aagatcgtcc tgatccacga 1021 ccaggaatca ggtaaaccca agggctacgc cttcatcgag tacgagcacg agcgggacat 1081 gcatgccgcc tacaagcacg ccgatggtaa gaagatcgac agcaagcgcg tcctggtgga 1141 cgtggagcgg gctcgcacgg tcaagggctg gctgcctcga cgcctgggcg gcggtctggg 1201 tggaacgcgc cgcggcggca acgatgtcaa cattaagcac tccggccgcg aggacaacga 1261 gagggaacgc gagcgctacc ggctggagcg ggagcgtgag gatcgcgagg gtcctggacg 1321 cggcggcggc tccaatggcc tggatgcccg gcccggacgc ggtttcggtg cggaacgtcg 1381 acgttcccgc tccagggaac gccgcgaccg tgaacgagat cgcggacggg gcgctgtggc 1441 tagcagcggt cgctcgcgca gccgttctcg cgagcgcaga aaacgacgag cgggcagccg 1501 ggagcggtac gacgagttcg accgccgcga tcggcgggac agggagcgcg agcgtgatcg 1561 cgatcgcgag cgtgagaaga aaaagaagcg ctccaagtct cgcgaacgcg aatcctccag 1621 ggagcgtcgc gaacggaagc gagagagaag ggaccgtgaa cgcggcaccg gatccggcgg 1681 cgatgtcaag gagcgcaagc ccgatttccg tgatatggat gtcatcaaga tcaaggagga 1741 gcccgtcgac gatggctatc ccacatttga ctaccagaac gcgaccatca agcgtgagat 1801 cgacgatgag gatgaggaga agtaccggcc gccgcctgcg catcacaata tgttcagtgt 1861 gccgccgccg cccattttgg ggcgtggaaa tgccagcacg aatcccaatc ccgacaatgg 1921 ccagcagagc tccggcgacc cgagttggtg gcgtcagtag agtcttggag cgatgtgtca 1981 aggttaatat ttctagaagt cagacgtctg tgggtcgcaa ctatttattc attcgactcc 2041 gaggcacccc aatcctggag agcctaaccg tagctatcca ttttacacag aaaattttaa 2101 ggtaccactt ggaggagggc caagtagaac aagattatag aaaacccgac cgctaaacgc 2161 agaatccgct aatgtgtgcg tagcttaaat cacttaaatt tataagtaac tcttaacaaa 2221 tgaatatgaa aacagtaagt aaaataaagc tagccctcat gtgtttgttt ccccaccttt 2281 ggtaaggggg ttaaagggaa tacggagagt caggagctgg aacgctttcg gtggcgcata 2341 caccgtacta tatggttact ccatcccatg gtggttcctg ggattttcta actcacctaa 2401 cataataagc tgaacaatac aaacccttgc actaactcgt gccttttatt ttctctgttt 2461 ttttgcagtt ttcaatcaat tgaaaatctg actctgacta gtgtgaaagc aaaagcataa 2521 gtatttaatc aaacaaacag taatccaaaa acggaaatta gttccgccag tattcgtagc 2581 ccatgcccaa gtctaaattc caagcccaca tcaggtaatt tggtctacgc acaaacctca 2641 ctaatccatg cgtctaccgt tctaggacag ctctagaatc aagacagcta ccgcaatact 2701 tttccaatct cctccgctct gggttgcctg tgttgtgtgg tggtgtggcg tgtaagttga 2761 ttccgggcta agaaattttg taaaccaaaa ccttttccgt aagtttgccc cggtaagatt 2821 acgatatcct cgcctaaccg ccgtggatcg gatggatgag tgagttagtg taagggagct 2881 ttcctctgtt tggtacacat tgcgaactgc tccgatggcc tgtggcaatc tactccattc 2941 atccattatg tctgtaacca atttaccatt tcgatctttt catgtacgtt gagctgattg 3001 ttatcgtact gaagacgaat ccgcggcgga aactcgcata gaacaaacag aaaactgcgc 3061 aaaggtttgc ttttggtaac tggtaacgat tggttgtggt tggtcaggtc aggggtcttt 3121 cgagctggtt agcctccact ttggattgcg gacggtgaca aaattctggg ggtcttttat 3181 ttttgtattg tattgtttgg cagacaggat ccatgcaatt gccttttcgt accgtgcgac 3241 atacgggaaa caaactatcg tgtagcaggg catttttcct attacaccat tattaagaaa 3301 agcggacaga gatcaatgtc acatttagat ttgatcaaat aaagaaaaga attcctcctt 3361 acacgaaaaa gtacttgttt catttctaag tatttatcaa gatgacttga attttttaca 3421 attttcgctt gttaaactgg tctatatgtc ctggataaga tttgcagtgg cgtccttgga 3481 atctctaaaa atgtatacat ttgttgcaaa taaagcattg tgaaatctat ggacattaaa 3541 tgcgacctct attggaaaac attcacata // LOCUS FSBAFP05A 946 bp ds-DNA VRT 24-JUL-1990 DEFINITION Macrozoarces americanus antifreeze protein OP5A gene, complete cds. ACCESSION J03924 KEYWORDS antifreeze protein. SOURCE Ocean pout adult testis DNA, clone lambda-OP5. ORGANISM Macrozoarces americanus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Gadiformes; Zoarcoidei; Zoarcidae. REFERENCE 1 (bases 1 to 946) AUTHORS Hew,C.L., Wang,N.-C., Joshi,S., Fletcher,G.L., Scott,G.K., Hayes,P.H., Buettner,B. and Davies,P.L. TITLE Multiple genes provide the basis for antifreeze protein diversity and dosage in the ocean pout, Macrozoarces americanus JOURNAL J. Biol. Chem. 263, 12049-12055 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.H.Hayes, 13-JUN-1988. FEATURES from to/span description pept 270 338 antifreeze protein, exon 1 521 715 antifreeze protein, exon 2 IVS 339 520 antifreeze protein, intron A BASE COUNT 283 a 209 c 186 g 268 t ORIGIN 211 bp upstream of Sau3AI site. 1 aagcttgtga tagtttggac aaaaacaagt tatactttac ttataagaat ataaaatttc 61 cattgcaatt ggcataagga ggtgtgacac agtgacctac tttcaggcca ataggaaacg 121 ggatatgccg gttaagtcct cccacatact gtatattaga tgcagcacat ggacctgtcc 181 tgtcagaagt ctcagctaca gctttcactt cgatctccga taattaatta attaattaat 241 tattaattaa ttaagtctca gccacagcca tgaagtccgt tattttaacc ggtttgcttt 301 tcgtcctcct ttgtgtcgac cacatgacag ccagccaggt gagatattct tgctccacta 361 aaaaatattc aaaaatgtga gctacagtaa aattcaacag tgttctgttt agaaagacag 421 agaacctttt aagtaaacat ttttagaatt ttctttttca actgtgccat gagaacaata 481 ataacgtctg accttttatc ttccattctt caacggtcag tccgtggtgg ccacccagct 541 gatccccata aatactgccc tgactccggt gatgatggag gggaaggtga ccaacccaat 601 aggcatcccg ttcgcagaga tgtcccaaat agtggggaag caagtgaaca cgccagtggc 661 taagggccaa accctcatgc caaacatggt gaaaacgtac gccgcgggaa agtagttctg 721 agggtgccaa ggagcttctt cccaaaacca aaagaagaaa tgccccctct cacaattaac 781 cctgtttttg tcacaaaccc aagtctgtta actgaacatg tcaaaacctg tggagactgt 841 tgagatttga tgttctgaaa agataaagcc tataaataaa atgttgccca aatttcctgc 901 ctgatgtttt tctttgtcgt tgctacatgg ctttgctgct cggatc // LOCUS HUMCRPX 78 bp ss-mRNA PRI 24-JUL-1990 DEFINITION Human C-reactive protein (CRP) mRNA, 5' end. ACCESSION M35163 KEYWORDS C-reactive protein. SOURCE Human adult liver, cDNA to mRNA, clone pCRP5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 78) AUTHORS Tucci,A., Goldberger,G., Whitehead,A.S., Kay,R.M., Woods,D.E. and Colten,H.R. TITLE Biosynthesis and postsynthetic processing of human C-reactive protein JOURNAL J. Immunol. 131, 2416-2419 (1983) STANDARD simple staff_review FEATURES from to/span description pept 1 > 78 C-reactive protein precursor /hgml_locus_uid="LR0093W" /nomgen="CRP" /map="1q21-q23" sigp 1 54 C-reactive protein signal peptide matp 55 > 78 C-reactive protein BASE COUNT 15 a 17 c 22 g 24 t 0 others ORIGIN 1 atggagaagc tgttgtgttt cttggtcttg accagcctct ctcatgcttt tggccagaca 61 gacatgtcga ggaaggct // LOCUS PT3TAUF 117 bp ds-DNA PHG 24-JUL-1990 DEFINITION Bacteriophage T3 early terminator (T3Te) region. ACCESSION M35171 KEYWORDS early terminator region. SOURCE Bacteriophage T3 DNA. ORGANISM Bacteriophage T3 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 117) AUTHORS Briat,J.-F., Bollag,G., Kearney,C.A., Molineux,I. and Chamberlin,M.J. TITLE Tau factor from Escherichia coli mediates accurate and efficient termination of transcription at the bacteriophage T3 early termination site in vitro JOURNAL J. Mol. Biol. 198, 43-49 (1987) STANDARD simple staff_review FEATURES from to/span description signal 85 106 transcription termination signal BASE COUNT 28 a 37 c 28 g 24 t ORIGIN 1 cggacggctc cctgcgacac cctagcttcg agaagttccg aggaactgag gacaaccctc 61 aagagaaaat gtaaccaact cactggctca ccttcacggg tgggcctttc ttcgttc // LOCUS VIBRHP 1489 bp ds-DNA BCT 24-JUL-1990 DEFINITION V.harveyi Arg-tRNA, two Pro-tRNA and two His-tRNA genes. ACCESSION M29762 M31832 KEYWORDS transfer RNA-Arg; transfer RNA-His; transfer RNA-Pro. SOURCE V.harveyi (strain ATCC 14126) DNA. ORGANISM Vibrio harveyi Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1489) AUTHORS Giroux,S. and Cedergren,R. TITLE Evolution of a tRNA operon in gamma purple bacteria JOURNAL J. Bacteriol. 171, 6446-6454 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Cedergren, 06-NOV-1989. FEATURES from to/span description tRNA 60 136 Arg-tRNA tRNA 171 247 His-tRNA tRNA 324 400 Pro-tRNA tRNA 450 523 His-tRNA tRNA 588 664 Pro-tRNA anticdn 94 96 Arg-tRNA anticodon ccg anticdn 205 207 His-tRNA anticodon gtg anticdn 358 360 Pro-tRNA anticodon tgg anticdn 481 483 His-tRNA anticodon gtg anticdn 617 619 Pro-tRNA anticodon tgg signal 11 16 -35 region signal 34 39 -10 region BASE COUNT 351 a 331 c 360 g 447 t ORIGIN 1 aaaaaagtac tagacggcat agtgtgatat ccgtattatt cctctccgcc gatgggcatg 61 cgcccgtagc tcagttggat agagcgttgg cctccggagc caaaggtcga aggttcgaat 121 cctttcgggc gtgccattcc ggaatacaaa ttaaaggcac gaacaatagt ggtggctata 181 gctcagttgg tagagccctg gattgtgatt ccggtggtcg cgagttcgaa tctcgttagc 241 caccccatta tttcggtaac tcattaaatt agtgaagttt ccagttttga ttcaaatatc 301 ccaagagtcg aaacaaaatt cgtcggtgaa tagcgcagct tggtagcgca tctggtttgg 361 gaccagaggg tcgggggttc gaatccctct tcaccgacca ctatttagtt tatggctata 421 atagagcggt aaacggcaaa attgatggtg gctatagctc agttggtaga gccctggatt 481 gtgattccgg tggtcgcgag ttcgaatctc gttagccacc ccattaattt aggtgacttc 541 ttagttgaag tatcctgtct gatttgagag aatcagcaaa tacagttcgg tgaatagcgc 601 agcttggtag cgcatctggt ttgggaccag agggtcgggg gttcgaatcc ctcttcaccg 661 accactatta gaaagcctgc tttttagcag gctttcgtcg ttttagagct tatgatctcg 721 cttggactag atgagtcggc aatatttcaa tggatttgaa cccatatggg gttcgcctga 781 agttagaata tggctcttca tcgaccaata tttaaagcct cagtcgaaag actagggctt 841 ttttgtattt ctcaccctaa aaaatctgca ttccatttca atatcagtaa acatgtacag 901 ctctggaagc ttttgaccac ctatgatcta aatgcgattc gagctgataa aagaaacccc 961 agcacgttgg ctggggtttt gtgtttcttg aattaagcgc tttactgagc aagcgttttt 1021 aggatttcat tcagctcaac cgtctctcgc acttgatagt tgggtttgat atcggttgta 1081 tttccgtacc tgttgcgttt acccacacgc tcagaatacc catattgttg gcacccgcaa 1141 tgtctgctgg tagagagtcg ccaatatgaa tcacctcttc tggctttgca tccacaggtt 1201 gagtgctttg cggaaaatgt tggctgctgg cttctcttct ggctcttccc caccgacgat 1261 aatgtgatca acccattcgc tcatttgtgt tgctttgagt tttgggtgct gtgagaagat 1321 tggaccgtta gtgatcacca ctaatttgta gtgttgacgt agctcggtca gcatctcttt 1381 tacgccaggg aagaagtcaa aagcccccat gcgagccgag tcaaaatagc tctgcgcctg 1441 ttgagccttt tctgtatcaa tctctatact ttgctctgcc aagatggcc // LOCUS YSCPOX1 2915 bp ds-DNA PLN 24-JUL-1990 DEFINITION Yeast (S.cerevisiae) acyl-coenzyme A oxidase (POX1) gene, complete cds. ACCESSION M27515 KEYWORDS acyl-coenzyme A oxidase. SOURCE Yeast (S. cerevisiae) DNA, clone PAD17. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2915) AUTHORS Dmochowska,A., Dignard,D., Maleszka,R. and Thomas,D.Y. TITLE Structure and transcriptional control of the Saccharomyces cerevisiae POX1 gene encoding acyl-coenzyme A oxidase JOURNAL Gene 88, 247-252 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Dignard, 01-SEP-1989. FEATURES from to/span description pept 455 2701 acyl-coenzyme A oxidase mRNA 364 2843 POX1 mRNA (minor alt.) mRNA 365 2843 POX1 mRNA (minor alt.) mRNA 369 2843 POX1 mRNA (minor alt.) mRNA 386 2843 POX1 mRNA (major alt.) mRNA 400 2843 POX1 mRNA (major alt.) mRNA 406 2843 POX1 mRNA (minor alt.) mRNA 413 2843 POX1 mRNA (minor alt.) BASE COUNT 895 a 532 c 600 g 888 t ORIGIN 1 bp upstream of BglII site; chromosome VII. 1 agatctcgac caaaaaaagg gggtagggta ataaaattaa ccctattatt ttttaacttt 61 aaaacctata atgtgctaat attttattat aaacctcctt tttttgcgtt caaaccctga 121 cacattttaa gccctatatt tacggtatta gttgattaaa ctccgaagcg aaaggaattc 181 ggtcattagc ggctaatagc cgttggggta aatcacctac aagcaagtac acaagagaac 241 gttggcgttg ttaagtcaaa gcactaatac attggggctt taagagtgtt tataaaggtc 301 taacctgtaa aaattattta aacaacttga acaggcctta aagttttcct cattccgctc 361 atcatcacta atattgctct ccgtttttga atacacactt gacactaata agtatcacag 421 aaaaaaagaa aatataataa attagtattg cgatatgacg agacgtacta ctattaatcc 481 cgattcggtg gttctgaatc ctcaaaaatt tatccagaaa gaaagggcgg attcgaaaat 541 caaagttgac caagttaaca catttttaga gtcatccccg gagaggagaa ctctgacgca 601 cgccttaata gaccaaatag tgaatgatcc tatattgaaa actgatacgg actattacga 661 tgctaaaaaa atgcaagaga gagaaattac tgccaaaaaa atagctaggc ttgctagtta 721 tatggagcac gatatcaaaa cagtgcgcaa acactttcgc gacactgacc tgatgaaaga 781 gttgcaagca aatgatccag acaaagcttc gcctttaaca aacaaagacc tttttatatt 841 cgataagaga ttgtcacttg tagcaaatat tgatcctcaa ttgggtacgc gcgtgggtgt 901 acacttgggg ctatttggta attgtatcaa gggcaatggt actgatgagc aaatccggta 961 ttggttgcag gagagaggtg ccactttgat gaaaggtata tatggctgtt ttgcaatgac 1021 tgagttagga catggttcca atgttgccca gctgcagact agggctgtgt acgataagca 1081 aaatgatact tttgtaattg atacacctga tctaactgcc accaaatggt ggattggtgg 1141 ggctgcccat tctgccacgc acgctgccgt gtacgccaga ttgatcgttg aaggtaaaga 1201 ctacggtgta aaaacattcg ttgttcctct gagagaccct tcgactttcc aactgttagc 1261 tggtgtttcc ataggggata ttggagcgaa gatgggtcgt gacggtattg ataatggctg 1321 gatccagttc agaaacgtag ttatccctag agaatttatg ctaagtagat ttaccaaagt 1381 tgtccgttct ccagatggtt cagtcaccgt caaaactgag ccacaattgg atcaaatttc 1441 tggttatagt gcattgttaa gtggtagagt taacatggtc atggattcat ttaggtttgg 1501 ctccaaattt gctactattg ctgtacgtta cgcggttggt cgtcagcaat tcgcacctag 1561 aaagggattg tctgaaacac aattaatcga ctatcccctt caccaatatc gtgttttacc 1621 acaattgtgt gttccatatt tggtgtcacc tgtagctttt aagttaatgg acaactatta 1681 ttccactttg gacgagttat acaacgcttc ctcatctgca tacaaagctg ctctggttac 1741 cgtgagtaaa aagttgaaga atttatttat tgatagcgcc acgttgaaag ccaccaatac 1801 ttggttaatt gctacactga ttgatgagtt gagacagact tgcggaggac atgggtattc 1861 acagtataac ggatttggta aaggctatga cgactgggtg gttcagtgca catgggaggg 1921 tgataataat gttttatctt taacttcagc aaaatcaata ttgaaaaaat ttatcgattc 1981 agccacaaag ggtagatttg acaacacact ggatgtggac tcattctctt acttaaaacc 2041 tcagtacata ggatctgtgg tttctggaga aataaagagt ggtttaaagg agttgggtga 2101 ttatactgaa atttggtcta tcaccttaat caaattactg gcacatattg gtactttagt 2161 tgaaaaatca agaagtattg atagcgtttc taagctttta gtcttagtat ccaaatttca 2221 tgccttgcgc tgcatgttga aaacctatta cgacaagtta aactctcgtg attcacatat 2281 ttccgatgaa attacaaagg aatctatgtg gaatgtttat aagttatttt ccttgtattt 2341 tattgacaag cattccggag aattccaaca attcaagatc ttcactcctg atcagatctc 2401 taaagttgtg cagccacaac tattggctct tttgccaatt gtgaggaaag actgtatagg 2461 tctgacagac tcctttgaat tacctgacgc gatgttaaat tctcctatag gttactttga 2521 tggcgatatc tatcacaatt acttcaatga agtttgccgc aataatccag tggaggcaga 2581 tggggcaggg aagccttctt atcatgcgct gttgagcagc atgctcggta gaggtttcga 2641 atttgaccaa aagttaggtg gtgcagctaa tgcggaaatt ttatcgaaaa taaacaagtg 2701 agtagaggtt tcctgttttc cttcgaaccc tctgttttgc gacttttgtt tcaattcaac 2761 tagtgtcgcc aagttttaac aaaaagttac aaaatcctag tgagaggcca tcttatgtgc 2821 ataacggtac tctctatcta tttacatatc taatactatt cacataacta tgacgaatca 2881 atgacatgac tacatttacc aatgtatagt agtaa // LOCUS YSPUGU1 507 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.pombe U1 small nuclear RNA gene (snu1), complete cds. ACCESSION M29062 KEYWORDS U1 small nuclear RNA; small nuclear RNA. SOURCE S.pombe (sub-species lindner; strain h-972) DNA, clone Bb. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 507) AUTHORS Porter,G., Brennwald,P. and Wise,J.A. TITLE U1 small nuclear RNA from Schizosaccharomyces pombe has unique and conserved features and is encoded by an essential single-copy gene JOURNAL Mol. Cell. Biol. 10, 2874-2881 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Porter, 13-OCT-1989. FEATURES from to/span description uRNA 240 387 U1 small nuclear RNA BASE COUNT 148 a 82 c 89 g 188 t ORIGIN 3 bp upstream of HincII site. 1 gttaactatt catggcttga aagcaacgaa atcatttcct attatgaaat tttggagatt 61 tatataatta caaatcgctg aacgatttca ctaatcagca aaatttaacg atttaaatat 121 cgtttttgtt aaataattat actgtacttc tttattgcat gtaagaatgt gactactttc 181 gtgactgctt cgttccgcgt cgttactata aatatgggtg cttacactta cggttcttta 241 cttacctggc atgagtttct gcagcacaag aattgtggag actcagttat ttgtcttggc 301 attgcactga gccctgacga ataactgtgg actggctaag gtcagctccg gatgcatcat 361 ttttgagttc gtccctcatt tggggcaatt ttactaatta aggcgatatg gttgccttaa 421 atagtaaagt tatccgaaat ggattatgta aagtaacatt ttctttcatc atatataatt 481 gtaaattatt gtatcataat taatatt // LOCUS MUSIGHAAZ 149 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma C16-11D11, partial cds. ACCESSION M33399 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-11D11, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 149) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 149 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 102 103 D-region end/JH4-region start BASE COUNT 39 a 45 c 35 g 30 t ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgccg tctattactg tgctagagct ggtaagggag gctatgctat ggactactgg 121 ggtcaaggaa cctcagcacc gtctcctca // LOCUS MUSIGHABB 136 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma C20-7D6, partial cds. ACCESSION M33401 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C20-7D6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 136) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 135 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 105 106 D-region end/JH4-region start BASE COUNT 36 a 37 c 32 g 26 t 5 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gncatctgag 61 gacactncng tctattactg tgctagaggg nataggtacg acggtnctat ggactactgg 121 ggtcaaggaa cctcag // LOCUS MUSIGHABC 131 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 29-1G5, partial cds. ACCESSION M33402 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 131) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 131 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 87 88 D-region end/JH4-region start BASE COUNT 34 a 40 c 27 g 28 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgcng tctattactg tgctccgtac tatgctatgg actactgggg tcaaggaacc 121 tcagtnaccg t // LOCUS MUSIGHABD 123 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 23-9D10, partial cds. ACCESSION M33403 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 123) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 123 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 87 88 D-region end/JH4-region start BASE COUNT 28 a 37 c 29 g 27 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgcng tctattactg tgctggggct tactggggcc aagggactct ggtnactgtc 121 tct // LOCUS MUSIGHABE 129 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 41-4F5, partial cds. ACCESSION M33404 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 129) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 129 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 90 91 D-region end/JH4-region start BASE COUNT 34 a 42 c 24 g 25 t 4 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag 61 gacactgcng tctattactg tgctaactgg gactactggg gnnaaggcac cactctcaca 121 gtctcctca // LOCUS MUSIGHABF 132 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 49-1G5, partial cds. ACCESSION M33405 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 132) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 132 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 90 91 D-region end/JH4-region start BASE COUNT 34 a 44 c 25 g 27 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacacngcng tctattactg tgctaggtac tactttgact actggggcca aggcaccact 121 ctcacagtct cc // LOCUS MUSIGHABG 141 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 23-10E7, partial cds. ACCESSION M33406 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 141 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 99 100 D-region end/JH4-region start BASE COUNT 39 a 43 c 30 g 27 t 2 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct aacatctgag 61 gacactgcng tctattactg tgctagaagg ggnaactacg ctatggacta ctggggtcaa 121 ggaacctcag tcaccgtctc c // LOCUS MUSIGHABH 151 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 44-2F5, partial cds. ACCESSION M33407 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 151) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 151 Ig H-chain V-D-JH4-region (AA at 1) recomb 84 85 V-region end/D-region start recomb 105 106 D-region end/JH4-region start BASE COUNT 39 a 48 c 27 g 34 t 3 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag 61 gacactgcng tctattactg tgcnctcatc tactatggta acacctatgc tatggactac 121 tggggtcaag gaacctcagt caccgtctcc t // LOCUS MUSIGHABI 160 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 25-5E10, partial cds. ACCESSION M33408 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 160) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 160 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 117 118 D-region end/JH4-region start BASE COUNT 44 a 44 c 36 g 33 t 3 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct nacatctgag 61 gacactgcng tctattactg tgctagaaaa gactattact acggtagtag cgggggagct 121 atggactact ggggtcaagg aacctcagtn accgtctcct // LOCUS MUSIGHABJ 125 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 49-1E7, partial cds. ACCESSION M33409 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1E7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 125) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 125 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 90 91 D-region end/JH4-region start BASE COUNT 32 a 42 c 26 g 24 t 1 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tcagcagcct gacatctgag 61 gacactgcng tctattactg tgctagctgg gactactggg gcaaggcacc actctcacag 121 tctcc // LOCUS MUSIGHABK 91 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 50-2G3, partial cds. ACCESSION M33410 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 50-2G3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 91) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 91 Ig H-chain V-D-JH4-region (AA at 3) recomb 29 30 V-region end/D-region start recomb 47 48 D-region end/JH4-region start BASE COUNT 21 a 18 c 23 g 24 t 5 others ORIGIN Chromosome 12. 1 gagacactnc ngtctattac tgtgctagag gtactacggt agtagtctat gctatggact 61 actggggtna aggaacctna gtnaccgtct c // LOCUS MUSIGHABL 107 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma 50-6D8, partial cds. ACCESSION M33411 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 50-6D8, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 107) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 107 Ig H-chain V-D-JH4-region (AA at 1) recomb 48 49 V-region end/D-region start recomb 57 58 D-region end/JH4-region start BASE COUNT 26 a 28 c 24 g 28 t 1 others ORIGIN Chromosome 12. 1 ctcagcagcc taacatctga ggacactgcn gtctattact gtgctagtag gtacgattac 61 tatgctatgg actactgggg tcaaggaacc tcagtcaccg tctctca // LOCUS MUSIGHADE 404 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain gene V-region, Id+ hybridoma C16-15F6, partial cds. ACCESSION M33390 KEYWORDS immunoglobulin heavy chain; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 404) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 20 Ig H-chain V-region, exon 1 (AA at 2) 100 > 404 Ig H-chain V-region, exon 2 IVS 21 99 Ig H-chain V-region intron A BASE COUNT 105 a 100 c 106 g 91 t 2 others ORIGIN Chromosome 12. 1 cctgatggca gtggttacag gtaaggggct ccnaagtccc aaacttgagg gtccataaac 61 tctgtgacag tggcaatcac tttgcctttc tttctacagg ggtcaattca gaggttcagc 121 tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg tcctgcacag 181 cttctggctt caacattaaa gacacctata tgcactgggt gaagcagagg cctgaacagg 241 gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat gacccgaagt 301 tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac ctgcagctca 361 gcagcctgac atctgaggac actgccgtct attactgtgc taga // LOCUS MUSIGHADF 141 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma J7-11H3, partial cds. ACCESSION M33400 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma J7-11H3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 141 Ig H-chain V-D-JH4-region (AA at 1) recomb 87 88 V-region end/D-region start recomb 96 97 D-region end/JH4-region start BASE COUNT 36 a 43 c 28 g 29 t 5 others ORIGIN Chromosome 12. 1 ataacagcag acacatcctc caacacagcc tacctgcagc tnagcagcct gacatctgag 61 gacactgcng tctattactg tgctaggagt agctacgact actttgacta ctggggnnna 121 ggcaccactc tcacagtctc c // LOCUS MUSIGHZD 234 bp ss-mRNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-JH4-region, Id+ hybridoma C16-15F6, partial cds. ACCESSION M17008 M33398 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain Balb.b) adult Id+ hybridoma C16-15F6, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 234) AUTHORS Busto,P., Gerstein,R., Dupre,L., Giorgetti,C.A., Selsing,E. and Press,J.L. TITLE Molecular analysis of heavy and light chains used by primary and secondary anti-(T,G)-A--L antibodies produced by normal and Xid mice JOURNAL J. Immunol. 139, 608-618 (1987) STANDARD simple staff_review REFERENCE 2 (bases 82 to 234) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 234 Ig H-chain V-D-JH4-region (AA at 1) recomb 168 169 V-region end/D-region start recomb 186 187 D-region end/JH4-region start BASE COUNT 65 a 61 c 58 g 47 t 3 others ORIGIN Chromosome 12. 1 cagggccnnn agtggattgg aaggattgat cctgcgaatg gtaatactaa atatgacccg 61 aagttccagg gcaaggccac tataacagca gacacatcct ccaacacagc ctacctgcag 121 ctcagcagcc tgacatctga ggacactgcc gtctattact gtgctagaag agggggcaat 181 ggtaactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca // LOCUS MUSIGHZL 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 29-1G5, partial cds. ACCESSION M33391 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 69 c 70 g 57 t 18 others ORIGIN Chromosome 12. 1 gaggntcagc tncagnagtc tggggnagag ctngtnaagc caggngcctc agtcaagttg 61 tcctgcacag cttctnnnnn naacattaaa gacacctata tgcactgggt naagcagagg 121 cctnaacagg gcctngagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggnaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t // LOCUS MUSIGHZM 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 23-9D10, partial cds. ACCESSION M33392 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-9D10 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 71 c 75 g 60 t 8 others ORIGIN Chromosome 12. 1 gaggttcagc tgnagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctgnntt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctnaacagg gcctngagtg gattggaagg atngatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctgac atctgaggac actgcngtct attactgtgc t // LOCUS MUSIGHZN 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 41-4F5, partial cds. ACCESSION M33393 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 41-4F5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 71 c 76 g 60 t 7 others ORIGIN Chromosome 12. 1 gaggntcagc tgcagnagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggntt caacattaaa gacacctata tgcactgggt naagcagagg 121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc t // LOCUS MUSIGHZO 294 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V1A-J2-region, Id+ hybridoma 49-1G5, partial cds. ACCESSION M33394 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 49-1G5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig H-chain V-region (AA at 1) BASE COUNT 78 a 69 c 76 g 59 t 12 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagcagtc tggggcagan cttgtnaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggnnt naacattaaa gacacctata tgcactgggt gaagcagagg 121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacnngaagt tcnagngcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctgac atctgaggac acngcngtct attactgtgc tagg // LOCUS MUSIGHZP 294 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 23-10E7, partial cds. ACCESSION M33395 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 23-10E7 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig H-chain V-region (AA at 1) BASE COUNT 80 a 72 c 78 g 60 t 4 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagnagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctaac atctgaggac actgcngtct attactgtgc taga // LOCUS MUSIGHZQ 291 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 44-2F5, partial cds. ACCESSION M33396 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 44-2F5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 291) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 291 Ig H-chain V-region (AA at 1) BASE COUNT 77 a 72 c 76 g 59 t 7 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagcagtc tggggnagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggnnt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctnaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tccagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc n // LOCUS MUSIGHZT 294 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig germline H-chain gene V-region, Id+ hybridoma 25-5E10, partial cds. ACCESSION M33397 KEYWORDS germline; immunoglobulin heavy chain; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 25-5E10 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 294 Ig H-chain V-region (AA at 1) BASE COUNT 79 a 73 c 78 g 60 t 4 others ORIGIN Chromosome 12. 1 gaggttcagc tgcagcagtc tggggcagag cttgtgaagc caggggcctc agtcaagttg 61 tcctgcacag cttctggcnt caacattaaa gacacctata tgcactgggt gaagcagagg 121 cctgaacagg gcctggagtg gattggaagg attgatcctg cgaatggtaa tactaaatat 181 gacccgaagt tcnagggcaa ggccactata acagcagaca catcctccaa cacagcctac 241 ctgcagctca gcagcctnac atctgaggac actgcngtct attactgtgc taga // LOCUS MUSIGKABO 339 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V1A-J2-region, Id+ hybridoma 29-1G5, partial cds. ACCESSION M33389 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb.b) neonatal Id+ hybridoma 29-1G5 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 339) AUTHORS Borriero,L., Giorgetti,C.A., Smith,G., Landry,D., Selsing,E., Zhukovsky,E. and Press,J.L. TITLE Neonatal and adult primary B cells use the same germ-line V-H and V-kappa genes in their (T,G)-A-L-specific repertoire JOURNAL J. Immunol. 144, 583-592 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 339 Ig kappa-chain V1A-J2-region (AA at 1) recomb 300 301 V1A-region end/J2-region start BASE COUNT 88 a 84 c 83 g 82 t 2 others ORIGIN Chromosome 6. 1 gatgttgtga tgacccaaac tccnctctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gagccttgta cacagtaatg gaaacaccta tttacattgg 121 tacctgcaga agccaggcca gtctccaaag ctcctgatct acaaagtttc caaccgattt 181 tctggggtcc cagacaggtt cagtggcagt ggatcaggga cagatttcac actcaagatc 241 agcagagngg aggctgagga tctgggagtt tatttctgct ctcaaagtac acatgttccg 301 tacacgttcg gaggggggac caagctggaa ataaaacgg // LOCUS M36836 2651 bp - UNA 24-JUL-1990 DEFINITION Figure 1. Nucleotide sequence of the characterized locust mtDNA segment. ACCESSION M36836 REFERENCE 1 (bases 1 to 2651) AUTHORS Haucke,H.-R. and Gellissen,G. TITLE different mitochondrial gene orders amino insects: exchanged trna gene positions in the coii/coiii region between an orthopteran and a dipteran species JOURNAL Curr. Genet. 14, 471-476 (1988) STANDARD unannotated staff_entry BASE COUNT 1034 a 419 c 303 g 895 t ORIGIN 1 cacagatact cagaactacc actaattaat ttatagattc taatatggca gattaatgca 61 ttagatttaa gctctaaaaa taaagttttg accttttatt agaaaaatat ggcaacatga 121 tcaaacttat cattacaaga tggagcttca ccattaatag aacaattatc attctttcat 181 gatcatacaa taattgatct attattaatc acaataattg taggatattc gcttagatat 241 atattattaa caaaatatac aaaccgaaat atacttcatg gtcatttaat tgaaactatc 301 tgaacagcac taccagcaat tacattaatc tttattgcat taccatcatt acgattacta 361 tatttacttg atgattcatc agatgctata attacaatta aaacaattgg acgacaatga 421 tactgaagtt atgaatattc agatttcatt aatgtagaat ttgatacata tataactcct 481 gaaaatgaat taaatacaga tgaattccga ctcttagaag ttgataatcg aacaacatta 541 cctataaata cagaagtacg agtattaact agagcatctg atgtactcca ctcatgagca 601 gtaccagctt tagtattaaa aattgacgca acacccggac gactaaacca aggtatattt 661 ataattaacc gcccaggtct atttttcggt caatgctcag aaatctgtgg agctaatcat 721 agatttatac caattgtaat tgaaagaaca tcaattaaac ttttcattaa atgattatct 781 aacataatat aaggagttag ttaaaatata acattagaat gtcaatctaa aataactaaa 841 aatagtacac cttgaaccat cagatgactg aaagtaagta atggtctctt aaaccaaaaa 901 atagtaaatt aacgtttact tctgatgagg taaattaaaa acttaatccc tcaaatatca 961 ccaataatat gattttcact attcattata ttctcaataa caataatatt atttaatcaa 1021 ttaaacttct tctcttataa accaaataaa attataagaa gtaataataa aattaaaaaa 1081 aaaaacatta attgaatatg ataacaaatc tattctcaac attcgaccca tcaactaatt 1141 tatttaattt atcattaaat tgaactagaa cattccttgg actattatta attccatcaa 1201 tattttgatt aataccttct cgaattaata ttttatgaaa taaaataaat ctaaacttac 1261 ataatgaatt taaaacatta ttaggaaaaa actcatttca aggatcaaca ctaattctaa 1321 tttcaatttt tatcataata ttatttaaca attttatagg attattccca tacattttta 1381 caagaactag tcatataaca ttaacattct caattgcatt accaatatga ataagattta 1441 tattatttgg atgaattaat aatactaatc atatatttac acatcttgtc ccacaaggaa 1501 caccaaatgc acttatatca tttatggttc taattgaaac aattagaaat gttattcggc 1561 cagggaccct agctgtacga ttagctgcaa atatgattgc aggacatctt ttattaacat 1621 tattaggaaa tacaggaccg tcactaacaa caagaattat attatttcta attattggac 1681 aaatattact tttaattctt gaatcagccg tagctataat tcaagcatac gtattctcaa 1741 ttctaagaac attatattca agagaagttt actaaactta tgttaacaaa taacaacaat 1801 cacccatttc atatagtaga ttatagacca tgaccattaa ctggagcaat cggagcaata 1861 attctaacct caggaataac taaatgattt catacattca atataaattt attaataatt 1921 ggaataactg tcattgtatt aacaataatt caatggtgac gagatgtagt tcgagaaggt 1981 acatttcaag gtctacacac taaattagta tcaaaaggac tgcgatgagg aataatttta 2041 tttattgcat cagaagtttt atttttcgcc tcattcttct gagcattctt taatagtaga 2101 ttagcaccaa caattgaact aggaataaaa tgacctccaa taggaattca accatttaat 2161 ccaatccaaa ttccactact taatactgca attttactag cgtcaggagt tacaattaca 2221 tgagcacatc acagaatcat agaatgtaat cattcacaag cattacaagg attattcttt 2281 acagtaatat taggatttta ttttacttta ttacaaatat acgaatattg agaagcacct 2341 tttaccattg cagacgcagt atatggatct acattcttcg tagcaacagg attccatggg 2401 ctacatgtaa tcattggtac aacattctta ttaacatgcc taattcgaca tataataaac 2461 caattctcat ctaatcacca ctttggattt gaagctgcag catgatactg acactttgtt 2521 gatgtagtat gattattttt atatttatca atttactgat gaggtagata attaattttt 2581 ctagtataat tagtacattt gacttccaat caaaagattg aaatatcaag aaaaataatt 2641 ataatattaa t // LOCUS ECOFPP 1950 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli dipeptide transport protein (fpp) gene, complete cds. ACCESSION M35045 KEYWORDS fpp gene; periplasmic protein. SOURCE E.coli (strain K12; isolate E1222) DNA, clone lambda-10. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1950) AUTHORS Olson,E.R., Dunyak,D.S., Jurss,L.M. and Poorman,R.A. TITLE Identification and characterization of fpp, an E.coli gene involved in dipeptide transport JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.R.Olson, 04-JUN-1990. Author address: E.R.Olson Molecular Biology Research, 7242-209-7 The UpJohn Company Kalamazoo, MI 49001 FEATURES from to/span description pept 147 1754 dipeptide transport protein (fpp) precursor sigp 147 230 dipeptide transport protein signal peptide matp 231 1751 dipeptide transport protein site 1792 1888 REP sequence BASE COUNT 548 a 499 c 491 g 412 t ORIGIN 79.5 min on the K12 map. 1 ggatccgcac tgttacactg atgttaatta gtacggcatc cccacctcat aacgttgacc 61 cgaccgggca aaaaacaaaa aaggtcaggc agcgacaacc cactgcaaag ggttaaaaca 121 acaaacatca caattggagc agaataatgc gtatttcctt gaaaaagtca gggatgctga 181 agcttggtct cagcctggtg gctatgaccg tcgcagcaag tgttcaggct aaaactctgg 241 tttattgctc agaaggatct ccggaagggt ttaacccgca gctgtttacc tccggcacca 301 cctatgacgc ctcttccgtc ccgctttata accgtctggt tgaatttaaa atcggcacca 361 ccgaagtgat cccgggcctc gctgaaaagt gggaagtcag cgaagacggt aaaacctata 421 ccttccatct gcgtaaaggt gtgaagtggc acgacaataa agaattcaaa ccgacgcgtg 481 aactgaacgc cgatgatgtg gtgttctcgt tcgatcgtca gaaaaacgcg caaaacccgt 541 accataaagt ttctggcggc agctacgaat acttcgaagg catgggcttg ccagagctga 601 tcagtgaagt gaaaaaggtg gacgacaaca ccgttcagtt tgtgctgact cgcccggaag 661 cgccgttcct cgctgacctg gcaatggact tcgcctctat tctgtcaaaa gaatatgctg 721 atgcgatgat gaaagccggt acaccggaaa aactggacct caacccaatc ggaaccggtc 781 cgttccagtt acagcagtat caaaaagatt cccgtatccg ctacaaagcg tttgatggct 841 actggggcac caaaccgcag atcgatacgc tggttttctc tattacccct gacgcttccg 901 tgcgttacgc gaaattgcag aagaatgaat gccaggtgat gccgtacccg aacccggcag 961 atatcgctcg catgaagcag gataaatcca tcaatctgat ggaaatgccg gggctgaacg 1021 tcggttatct ctcgtataac gtgcagaaaa aaccactcga tgacgtgaaa gttcgccagg 1081 ctctgaccta cgcggtgaac aaagacgcga tcatcaaagc ggtttatcag ggcgcgggcg 1141 tatcagcgaa aaacctgatc ccgccaacca tgtggggcta taacgacgac gttcaggact 1201 acacctacga tcctgaaaaa gcgaaagcct tgctgaaaga agcgggtctg gaaaaaggtt 1261 tctccatcga cctgtgggcg atgccggtac aacgtccgta taacccgaac gctcgccgca 1321 tggcggagat gattcaggca gactgggcga aagtcggcgt gcaggccaaa attgtcacct 1381 acgaatgggg tgagtacctc aagcgtgcga aagatggcga gcaccagacg gtaatgatgg 1441 gctggactgg cgataacggg gatccggata acttcttcgc caccctgttc agctgcgccg 1501 cctctgaaca aggctccaac tactcaaaat ggtgctacaa accgtttgaa gatctgattc 1561 aaccggcgcg tgctaccgac gaccacaata aacgcgttga actgtacaaa caagcgcagg 1621 tggtgatgca cgatcaggct ccggcactga tcatcgctca ctccaccgtg tttgaaccgg 1681 tacgtaaaga agttaaaggc tatgtggttg atccattagg caaacatcac ttcgaaaacg 1741 tctctatcga ataattaaaa gccatacaag actgatggca aaggcaaaaa tgcctgatgc 1801 gctccgctta tcaggcctac gaaaattctg caatgtattg aatttgcacg attttgtagg 1861 ccggataagg cgttaacgcg catccggcat aaacaaagcg cactttgtca acaatctgta 1921 tacccggtgg cgctgtgcct ctgcgtgcag // LOCUS ECOSHL 1820 bp ds-DNA BCT 24-JUL-1990 DEFINITION E.coli shl gene, complete cds, and acetolactate synthase II (ilvH) gene, 3' end. ACCESSION M35034 KEYWORDS acetolactate synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1820) AUTHORS Leclerc,G., Noel,G. and Drapeau,G. TITLE Molecular cloning, nucleotide sequence and expression of shl JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Drapeau, 01-JUN-1990. FEATURES from to/span description pept < 1 105 acetolactate synthase (ilvH) (AA at 1) pept 117 203 leader protein pept 285 1289 shl protein signal 66 73 -35 region signal 89 95 -10 region BASE COUNT 446 a 433 c 463 g 478 t ORIGIN 1 bp upstream HindIII site. 1 aagcttgatg catttttagc atcgattcgc gatgtggcga aaattgtgga ggttgctcgc 61 tctggtgtgg tcggactttc gcgcggcgat aaaataatgc gttgagaatg atctcaatgc 121 gcaatttaca gcccaacatg tcacgttggg ctttttttgc gaaatcagtg ggaacctgga 181 ataaaagcag ttgccgcagt taattttctg cgcttagatg ttaatgaatt taacccatac 241 cagtacaatg gctatggttt ttacatttta cgcaaggggc aattgtgaaa ctggatgaaa 301 tcgctcggct ggcgggagtg tcgcggacca ctgcaagcta tgttattaac ggcaaagcga 361 agcaataccg tgtgagcgac aaaaccgttg aaaaagtcat ggctgtggtg cgtgagcaca 421 attaccaccc gaacgccgtg gcagctgggc ttcgtgctgg acgcacacgt tctattggtc 481 ttgtgatccc cgatctggag aacaccagct atacccgcat cgctaactat cttgaacgcc 541 aggcgcggca acggggttat caactgctga ttgcctgctc agaagatcag ccagacaacg 601 aaatgcggtg cattgagcac cttttacagc gtcaggttga tgccattatt gtttcgacgt 661 cgttgcctcc tgagcatcct ttttatcaac gctgggctaa cgacccgttc ccgattgtcg 721 cgctggaccg cgccctcgat cgtgaacact tcaccagcgt ggttggtgcc gatcaggatg 781 atgccgaaat gctggcggaa gagttacgta agtttcccgc cgagacggtg ctttatcttg 841 gtgcgctacc ggagctttct gtcagcttcc tgcgtgaaca aggtttccgt actgcctgga 901 aagatgatcc gcgcgaagtg catttcctgt atgccaacag ctatgagcgg gaggcggctg 961 cccagttatt cgaaaaatgg ctggaaacgc atccgatgcc gcaggcgctg ttcacaacgt 1021 cgtttgcgtt gttgcaagga gtgatggatg tcacgctgcg tcgcgacggc aaactgcctt 1081 ctgacctggc aattgccacc tttggcgata acgaactgct cgacttctta cagtgtccgg 1141 tgctggcagt ggctcaacgt caccgcgatg tcgcagagcg tgtgctggag attgtcctgg 1201 caagcctgga cgaaccgcgt aagccaaaac ctggtttaac gcgcattaaa cgtaatctct 1261 atcgccgcgg cgtgctcagc cgtagctaag ccgcgaacaa aaatacgcgc caggtgaatt 1321 tccctctggc gcgtagagta cgggactgga catcaatatg cttaaagtaa ataagactat 1381 tcctgactat tattgataaa tgcttttaaa cccgcccgtt aattaactca ccagctgaaa 1441 ttcacaataa ttaagtgata tcgacagcgc gtttttgcat tattttgtta catgcggcga 1501 tgaattgccg atttaacaaa cacttttctt tgcttttgcg caaacccgct ggcatcaagc 1561 gccacacaga cgtaacaagg actgttaacc ggggaagata tgtcctaaaa tgccgctcgc 1621 gtcgcaaact gacactttat atttgctgtg gaaaatagtg agtcatttta aaacggtgat 1681 gacgatgagg gattttttct tacagctatt cataacgtta atttgcttcg cacgttggac 1741 gtaaaataaa caacgctgat attagccgta aacatcgggt tttttacctc ggtatgcctt 1801 gtgactggct tgacaagctt // LOCUS HUMATCT4A 1742 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human T4 surface glycoprotein CD4 gene, complete cds. ACCESSION M35160 KEYWORDS glycoprotein; immunoglobulin supergene family; recognition antigen; surface antigen. SOURCE Human (haplotype OKT4-negative) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1742) AUTHORS Hodge,T.W., Sasso,D.R., Steve,J. and McDougal,S. TITLE Humans with OKT4-epitope deficiency have a single base change in the CD4 gene resulting in substitutions of TRP-240 for ARG-240 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.W.Hodge, 12-JUN-1990. A cytosine to thymidine transition at nucleotide position 868 results in substitution of TRP-240 for ARG-240. Author address: T.W.Hodge; A-25 Bldg 1 Rm 1226 Centers for Disease Control 1600 Clifton Rd. Atlanta, GA 30333 FEATURES from to/span description pept 76 1452 T4 surface glycoprotein precursor sigp 76 150 T4 surface glycoprotein signal peptide matp 151 1449 T4 surface glycoprotein mature peptide mut 792 792 g in OKT4 epitope deficiency; c in wt BASE COUNT 405 a 491 c 466 g 380 t ORIGIN 1 caagcccaga gccctgccat ttctgtgggc tcaggtccct actgctcagc cccttcctcc 61 ctcggcaagg ccacaatgaa ccggggagtc ccttttaggc acttgcttct ggtgctgcaa 121 ctggcgctcc tcccagcagc cactcaggga aagaaagtgg tgctgggcaa aaaaggggat 181 acagtggaac tgacctgtac agcttcccag aagaagagca tacaattcca ctggaaaaac 241 tccaaccaga taaagattct gggaaatcag ggctccttct taactaaagg tccatccaag 301 ctgaatgatc gcgctgactc aagaagaagc ctttgggacc aaggaaactt ccccctgatc 361 atcaagaatc ttaagataga agactcagat acttacatct gtgaagtgga ggaccagaag 421 gaggaggtgc aattgctagt gttcggattg actgccaact ctgacaccca cctgcttcag 481 gggcagagcc tgaccctgac cttggagagc ccccctggta gtagcccctc agtgcaatgt 541 aggagtccaa ggggtaaaaa catacagggg gggaagaccc tctccgtgtc tcagctggag 601 ctccaggata gtggcacctg gacatgcact gtcttgcaga accagaagaa ggtggagttc 661 aaaatagaca tcgtggtgct agctttccag aaggcctcca gcatagtcta taagaaagag 721 ggggaacagg tggagttctc cttcccactc gcctttacag ttgaaaagct gacgggcagt 781 ggcgagctgt ggtggcaggc ggagagggct tcctcctcca agtcttggat cacctttgac 841 ctgaagaaca aggaagtgtc tgtaaaatgg gttacccagg accctaagct ccagatgggc 901 aagaagctcc cgctccacct caccctgccc caggccttgc ctcagtatgc tggctctgga 961 aacctcaccc tggcccttga agcgaaaaca ggaaagttgc atcaggaagt gaacctggtg 1021 gtgatgagag ccactcagct ccagaaaaat ttgacctgtg aggtgtgggg acccacctcc 1081 cctaagctga tgctgagctt gaaactggag aacaaggagg caaaggtctc gaagcgggag 1141 aaggcggtgt gggtgctgaa ccctgaggcg gggatgtggc agtgtctgct gagtgactcg 1201 ggacaggtcc tgctggaatc caacatcaag gttctgccca catggtccac cccggtgcag 1261 ccaatggccc tgattgtgct ggggggcgtc gccggcctcc tgcttttcat tgggctaggc 1321 atcttcttct gtgtcaggtg ccggcaccga aggcgccaag cagagcggat gtctcagatc 1381 aagagactcc tcagtgagaa gaagacctgc cagtgccctc accggtttca gaagacatgt 1441 agccccattt gaggcacgag gccaggcaga tcccacttgc agcctcccca ggtgtctgcc 1501 ccgcgtttcc tgcctgcgga ccagatgaat gtagcagatc ccacgctctg gcctcctgtt 1561 cgtcctccct acaatttgcc attgtttctc ctgggttagg ccccggcttc actggttgag 1621 tgttgctctc tagtttccag aggcttaatc acaccgtcct ccacgccatt tccttttcct 1681 tcaagcctag cccttctctc attatttctc tctgaccctc tccccactgc tcatttggat 1741 cc // LOCUS HUMFUSED 555 bp ds-DNA SYN 24-JUL-1990 DEFINITION Human interleukin-1beta/growth hormone fusion protein gene, complete cds. ACCESSION M35049 KEYWORDS growth hormone; interleukin-1-beta. SOURCE Human synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 555) AUTHORS Pecceu,R., Dousset,P., Shire,D., Cavrois,E., Marchese,E., Ferrara,P., Kaghad,M., Dumont,X. and Lupker,J. TITLE Human interleukin-1-beta fused to the human growth hormone signal peptide is N-glycosylated and secreted by Chinese hamster ovary cells JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Pecceu, 08-JUN-1990. Author address: F.Pecceu SANOFI ELF BIO-RECHERCHES LABEGE INNOPOLE B.P. 137 31328 LABEGE CEDEX FRANCE FEATURES from to/span description pept 11 550 human growth hormone/interleukin--1beta fusion protein sigp 11 88 human growth hormone signal peptide matp 89 547 interleukin-1-beta BASE COUNT 154 a 142 c 135 g 124 t ORIGIN 1 agcttccgcc atggctacag gctcccggac gtccctgctc ctggcttttg gcctgctctg 61 cctgccctgg cttcaagagg gcagtgccgc acctgtaaga tctctgaact gcacgctccg 121 ggactcacag caaaagagct tggtgatgtc tggtccatat gaactgaaag ctctccacct 181 gcagggacag gatatggagc aacaagtggt gttctccatg tcctttgtcc aaggagaaga 241 aagtaatgac aaaatacctg tggcgttggg cctcaaggaa aagaatctgt acttaagctg 301 cgtgttgaaa gatgataagc ccactctaca gctggagagt gtagacccca aaaattaccc 361 aaagaagaag atggaaaagc gatttgtctt caacaagata gaaatcaata acaagctgga 421 atttgagtct gcccagttcc ccaactggta catcagcacc tctcaagcag aaaacatgcc 481 cgtcttcctg ggtggtacca aaggcggcca ggatataact gacttcacca tgcaatttgt 541 gagctcctag gatcc // LOCUS HUMGLYA 1200 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human Miltenberger V-like glycophorin gene, exons 3 and 4. ACCESSION J05563 KEYWORDS glycophorin A; transmembrane protein. SOURCE Human leukocyte DNA, clone Mi51. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1200) AUTHORS Kudo,S., Chagnovich,D., Rearden,A., Mattei,M.-G. and Fukuda,M. TITLE Molecular analysis of a hybrid gene encoding human glycophorin variant Miltenberger V-like molecule JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Kudo and M.Fukuda, 04-JUN-1990. FEATURES from to/span description pept / 81 176 Miltenberger V-like glycophorin, exon 3 (AA at 81) 1110 / 1148 Miltenberger V-like glycophorin, exon 4 IVS < 1 80 Miltenberger V-like glycophorin intron B IVS 177 1109 Miltenberger V-like glycophorin intron C site 1079 1116 crossing-over site BASE COUNT 387 a 236 c 212 g 365 t ORIGIN 1 agatttgtct ttcatgatac gctctatgtc cacgcagtca cctcattctt gacccctttc 61 tcaacttctc ttatatgcag atacgcacaa acgggacaca tatgcagcca ctcctagagc 121 tcatgaagtt tcagaaattt ctgttagaac tgtttaccct ccagaagagg aaaccggtat 181 gttcttagtt ttaaatagtt gctctggagt cattgttgtg attgaactct gtttacacga 241 gctgtaactc atgacagttc tcaagctttc gtgacagaaa acccatctct tttactccaa 301 agcccatata gcacccacaa ctattaactg tgaccaagaa agagaaggca agccccaatt 361 aacctttgta cgtaaagcct aaagaatgaa aaaatatacc tgaatcctca atcatcaaac 421 agcatagtat atactaagta atttgtaata attaaactct agaaaattgt gtggcttcgg 481 tagtaagaga gcttcatgac ataaaatggc aagtggagac agagacaaaa gtaggatgtg 541 gactgagagg gaaggttagc acaggtggaa cagtaaggca accatactat caattgctac 601 tgacatagaa tccagagaga ctattggcaa aagctcaaat gagacacagt aacagtttag 661 attcagacag tggctgtggc ataaatcaga aaattgatag ccgcatgatc cctctttgca 721 tgggactggc atctgtgtgg agtaatggtt ccatatgcct cctttcttct ccttattttt 781 aaatttttta aaaatgcatt gcttcttgtg gaagtcaata agtgattctt ccaatacttt 841 ctcattcctt ccccctcagt tatgagacaa tttgcttatt tctcatccat gaatacttgt 901 tgggtcatta aaagtagata ctgaaattac taatggtacg actgacatat tacctcataa 961 atgttactag ctagatgttg aaagttgacc aacaactctc aaaatatgat taagaaaagg 1021 aaacccacag aacagtttga ttccaaaatg atttttttct ttgcacatgc cttacttatt 1081 tggacttaca ttgaaatttt gctttatagg agaaatggga caacttgtcc atcgtttcac 1141 tgtaccaggt atgttaatat ttgacaaaga ataaaagtca ttccatttta aactatccat // LOCUS HUMMHDRB 1172 bp ds-DNA PRI 24-JUL-1990 DEFINITION Human MHC class II HLA-DR-beta DR2 gene, complete cds. ACCESSION M35159 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (black with haplotype DR2) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1172) AUTHORS Demopulos,J.T., Hodge,T.W., Wooten,V. and Acton,R.T. TITLE A novel DRB1 allele in DR2-positive American blacks JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.W.Hodge, 12-JUN-1990. A thymidine to cytosine transition at nucleotide position 88 results in an amino acid change from tyrosine to histidine at residue 30. Author address: T.W.Hodge; A-25 Bldg 1 Rm 1226 Centers for Disease Control 1600 Clifton Rd. Atlanta, GA 30333 FEATURES from to/span description pept 6 806 MHC HLA-DR-beta chain precursor /hgml_locus_uid="LV0063D" /nomgen="HLA-DRB1" /map="6p21.3" sigp 6 92 MHC HLA-DR-beta chain signal peptide matp 93 803 MHC-HLA-DR-beta chain BASE COUNT 272 a 315 c 324 g 261 t ORIGIN 1 ccagcatggt gtgtctgaag ctccctggag gttcctacat ggcaaagctg acagtgacac 61 tgatggtgct gagctcccca ctggcttcgg ctggggacac ccgaccacgt ttcttgcagc 121 aggataagta tgagtgtcat ttcttcaacg ggacggagcg ggtgcggttc ctgcacagag 181 acatctataa ccaagaggag gacttgcgct tcgacagcga cgtgggggag taccgggcgg 241 tgacggagct ggggcggcct gacgctgagt actggaacag ccagaaggac ttcctggaag 301 acaggcgcgc cgcggtggac acctactgca gacacaacta cggggttggt gagagcttca 361 cagtgcagcg gcgagttgag cctaaggtga ctgtgtatcc tgcaaggacc cagaccctgc 421 agcaccacaa cctcctggtc tgctctgtga atggtttcta tccaggcagc attgaagtca 481 ggtggttccg gaacagccag gaagagaagg ctggggtggt gtccacaggc ctgattcaga 541 atggagactg gaccttccag accctggtga tgctggaaac agttcctcga agtggagagg 601 tttacacctg ccaagtggag cacccaagcg tgacgagccc tctcacagtg gaatggagag 661 cacagtctga atctgcacag agcaagatgc tgagtggagt cgggggcttt gtgctgggcc 721 tgctcttcct tggggccggg ctattcatct acttcaagaa tcagaaaggg cactctggac 781 ttcacccaac aggactcgtg agctgaagtg cagatgacca cattcaaggg ggaaccttct 841 gccccagctt tgcatgatga aaagctttcc tgcttggctc ttattcttcc acaagagagg 901 actttctcag gccctggttg ctaccggttc agcaactctg cagaaaatgt ccatccttgt 961 ggcttcctca gctcctgccc cttggcctga agtcccagca ttgatggcag tgcctcatct 1021 tcaactttag tgctcccctt tacctaaccc tacggcctcc catgcatctg tactccccct 1081 gtgtgccaca aatgcactac gttattaaat ttttctgaag cccagagtta aaaatcatct 1141 gtccacctgg ctccaaagac aaaaaataaa aa // LOCUS MUM3TER 55 bp ss-RNA VRL 24-JUL-1990 DEFINITION Mumps virus genome 3' terminus mRNA. ACCESSION M35284 KEYWORDS . SOURCE Mumps virus (strain SBL-1), cDNA to mRNA, passed in Vero cells. ORGANISM Mumps virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Paramyxoviridae; Paramyxovirus. REFERENCE 1 (bases 1 to 55) AUTHORS Elango,N., Varsanyi,T.M., Koevamees,J. and Norrby,E. TITLE Molecular cloning and characterization of six genes, determination of gene order and intergenic sequences and leader sequence of mumps virus JOURNAL J. Gen. Virol. 69, 2893-2900 (1988) STANDARD simple staff_review BASE COUNT 9 a 17 c 4 g 24 t 1 others ORIGIN 1 nggttcccct ctttcttcta ccctataacc atcttgttta tcacattctt tgtca // LOCUS MUSIGCR 2210 bp ds-DNA ROD 24-JUL-1990 DEFINITION Murine Ig germline membrane-bound gamma 2a-chain gene C region, exon 2. ACCESSION M35032 KEYWORDS constant region; gamma-immunoglobulin; germline; immunoglobulin heavy-chain. SOURCE Murine (strain BALB/c) myeloma cell line RPC 5 DNA, clone CH.M.IgG2a-11. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2210) AUTHORS Hall,B. and Milcarek,C. TITLE Sequence and polyadenylation site determination of the murine immunoglobulin gamma 2a membrane 3' UT region JOURNAL Mol. Immunol. 26, 819-826 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.L.Hall, 08-JUN-1990. FEATURES from to/span description pept / 37 120 IgG-2a chain, exon 2 pre-msg < 1 1682 IgG-2a chain mRNA (alt.) pre-msg < 1 1759 IgG-2a chain mRNA (alt.) IVS < 1 36 IgG-2a chain intron A (no splice consensus) signal 1506 1511 major Poly-A signal signal 1585 1590 minor Poly-A signal rpt 1270 1305 (GA)17 repeats rpt 1395 1440 (CT)23 repeats site 1536 1543 McLauchlan consensus sequence site 1552 1559 McLauchlan consensus sequence site 1625 1633 McLauchlan consensus sequence site 1536 1540 Berget consensus sequence BASE COUNT 596 a 543 c 557 g 514 t ORIGIN 1 aagagaagcg ccaagagcac agcgccgact cctaccgtaa agtggatctt ctcctctgtg 61 gtggagctga agcagacgat ctcccctgac tacagaaaca tgattgggca gggagcctag 121 gccacttcct ctgggatcag aagagcttcc taggccctgc agaagcccat ccatcctact 181 gtgcagccta acagggaggc cacactctag ccctatgact ctctgatcag aactcccatg 241 gtctcctctt tggaggacca cgtgcagtgc aggctttgcc cagacctaaa cacttccaca 301 gcagtcgcca gatatctaac tactccggac cagaagaacc atctccttcc aaaccagcac 361 tagggatctg agatctcaga atgtttgcct aagaagagct ggaaatccag gcttcctgtg 421 ttctgctaca aggacatcag cctggatttg acctggacca cacattttca tctaaatgag 481 ttttccacaa aggacacgtt tcagatcctt gaatgagacc tctacatgga agaccagagt 541 cactataccc aaaggtcact ctgtatcctt gcaccagcta tactggacag cttccttcct 601 ggtacttcag tgaccctggc tgaggaaagg atctgtgacc tcaactgttt ggagagcctc 661 tggaagatgt agtcttctct tcctgctacc accaacatgc tggatctcag atgcagaatc 721 caatccacag acaccactga ccacacaacc tgaagacaag gccattgcca cctccacaga 781 gatgccatcc acactctgtg gagaaataag gagtgctttg tgcagcctct gcaaagctct 841 ggcagggatt agagtataca cactgagtac tgactaggtg accaggcaga aaaacctcca 901 ggagaaggaa caatggggga gagatgtgaa cagatagtta gaaaaagcat ggtgtcacag 961 gtctgctctg tggactgatt tccagattgg accacctaca gcagaaacca tcggttgcag 1021 tggcaatcta ggaggaccaa cctggaatag gagggctgct gtggtcaatg gagagtagac 1081 ctgtatctat ttctccactg cctcttatga ccaataagaa gccagagtct ccagacagaa 1141 agaaagaaag aaagaaagaa agaaagaaag aaagaaagag agagagagag agagagagag 1201 agagaggaag gaaggaagga aggaaggaag gaaggaagga aggaggagga ggaggaggag 1261 gaggaggagg agagagagag agagagagag agagagagag agagagcacc agcttttctg 1321 tgactggaag gaaatgctta gagagcttgg atctttaaag cttctttttt ctagagacca 1381 tgaatgtctt tgttctctct ctctctctct ctctctctct ctctctctct ctctctctct 1441 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gcgtgcatgc acgctattgt tttggcattt 1501 gaaacaataa aacattcttt taatattctg tatctcatgg ttccccttct gtgtggatca 1561 gccctaacac ccaggaacag ggacaataaa cagaccacag ccatgtacag ccttctacct 1621 cccttctggt tctgacctcc cagaggtccc tcagtgggcc cctcacagct gggtttcttc 1681 cctggcagtg ccaccaagag ctcaggcacc tctgagctgg aggctgtcct gatgccatag 1741 gcaggctatg gagcagagat gatgaccacg gtgactcagg tgagccaggc aaagcctccc 1801 atgccagaag agaagcgtgt ggtactcact ggcctcgggc tgctacggat tcagcaaaga 1861 gcatggatcc agctacctcc atagtggatg cttattgtct caagctgtgt tcagagaaat 1921 ttcagatcag gtattaatag ggtgggttct ccttacctgg taggtgcctt ctcccatggc 1981 acctcacagg ctctccatct gtgtgtctgg gtcctgatct cttctcataa gtacacagac 2041 aggctggaat ggtacaacct ggcctcatta taacttacca gttatgatcc tatctgcaag 2101 gcaggctcag tctgaggtct ggcggcttaa atgacaattg gtgtgtcttc tagaacagaa 2161 tttagtccac attagcgcgc ctgtggacca cagtatatct ctctggcctc // LOCUS YSCARF2 1012 bp ds-DNA PLN 24-JUL-1990 DEFINITION S.cerevisiae ADP-ribosylation factor 2 (ARF2) gene, complete cds. ACCESSION M35158 KEYWORDS ADP-ribosylation factor 2. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1012) AUTHORS Stearns,T., Hoyt,M.A., Botstein,D. and Kahn,R.A. TITLE ADP-ribosylation factor is an essential protein in yeast and is encoded by two genes JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Kahn, 05-JUN-1990. Author address: R.A.Kahn Lab. of Biol. Chem., DCT Bldg. 37; Room 5D-02 National Cancer Institute Bethesda, MD 20892 FEATURES from to/span description pept 266 811 ADP-ribosylation factor 2 (ARF2) binding 24 30 consensus GTP-binding site binding 67 71 consensus GTP-binding site binding 126 129 consensus GTP-binding site BASE COUNT 301 a 180 c 222 g 309 t ORIGIN 1 gcctgcaggt acgccccttt atttgatcag gaagccgtat tgattatcta atagggccta 61 gttatcctaa ttgtggggag tcgagcagta cggctctgat gtttttcgaa cgaagataag 121 gagttgacat acaaagtcaa cagaagttct tcttgttagc gtctctgtgc tcaatatctc 181 tctttttttc tttaagtagt aattactaac atcagccaac caatagagat aaaaaaaaaa 241 ggaattaaga tttcatagag aaaagatggg tctatacgct tctaagttat tcagcaatct 301 ttttggcaac aaagagatgc gtatacttat ggttggtcta gatggtgccg gtaagaccac 361 cgttttgtac aagttgaagt tgggcgaagt tatcactacc attccaacca ttggtttcaa 421 cgttgagact gtccaatata agaacatttc cttcactgtc tgggacgtcg gtggacaaga 481 caggattaga tctttatgga gacactacta cagaaacacc gaaggtgtta tttttgtcat 541 cgattccaac gatagatcgc gtattggtga agccagagaa gtcatgcaga gaatgctgaa 601 tgaagatgaa ttgagaaatg ctgtctggtt agtcttcgct aacaaacaag atttgccaga 661 agccatgtct gctgctgaaa tcaccgaaaa attaggttta cattctatta gaaaccgtcc 721 atggtttatc cagtctactt gtgcaacctc gggtgaaggt ctgtacgaag gtctggagtg 781 gttaagcaac aacttgaaga atcaatccta atctaaatct gtatagaacg tttagtcatg 841 cgcaccttgt gtgttttgtt tctagattgt tttattttta tgattgttga agatataaac 901 cactgtatag ttgtataaga taggataatg atggtgcact gaaaataaac ttactagctc 961 tttaatattg caacggcttg taacgggcga ctctagagga tccccgggta cc // LOCUS CODCPRRKA 94 bp ss-RNA RNA 24-JUL-1990 DEFINITION Codium fragile chloroplast 4.5S RNA, complete cds. ACCESSION M35276 M15192 KEYWORDS 4.5S RNA. SOURCE C.fragile chloroplast RNA. ORGANISM Chloroplast Codium fragile Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Brypsidales; Codiaceae; Codium fragile. REFERENCE 1 (bases 1 to 94) AUTHORS Francis,M.A., Balint,R.F. and Dudock,B.S. TITLE A novel variety of 4.5 S RNA from Codium fragile chloroplasts JOURNAL J. Biol. Chem. 262, 1848-1854 (1987) STANDARD simple staff_review FEATURES from to/span description RNA 1 94 4.5 S RNA BASE COUNT 35 a 11 c 14 g 34 t ORIGIN 1 aagtcctagt tgctataaat tcttaaatca aattatgtca gatttttaat aaaaagcagc 61 atttgtattt gaaaattgtt taggaactag gcac // LOCUS HAMAPBRBD 2339 bp ds-DNA ROD 24-JUL-1990 DEFINITION Hamster apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35187 KEYWORDS apolipoprotein B. SOURCE Hamster DNA. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2339) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review COMMENT Phone call to T.J.Smith on 26-JUN-1990 made sequence clarifications on line 4 and line 17 of printed sequence. The hamster sequence should be 1 bp to the left on these lines. FEATURES from to/span description pept < 1 > 2339 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 725 a 519 c 450 g 645 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccagc ttcctcgcct ctcacacaca attgagatac ctgcttttgg cagacttcat 61 ggaatcctga aaatccagtc tcccctcttt atattagatg caaatgccaa catacagaat 121 gtaactactt tagagaacaa agcagagatt gtggcctcca tcgctgctac aggagagtcc 181 gaaattgaag ctctcaattt tgattttcaa gcacaagctc aattcttgga gctaaaccct 241 aatcctctga tcctgaagga atccatgaac ttctccagca agcatgcgag aatggagcat 301 gagggtgaga tactattttc tggaaagttc attgagggaa aattggacac ggtcgcaagt 361 ttacagacag agaaaaatat ggtggagttt aataatggta tgattgtcaa gataaacaat 421 ccaatcatcc ttgacagtca cacaaagtat tttcacaagt tgagtatccc caggctggac 481 ttctccagta aggcttcctt taacaatgaa atcaagatgc tattagaagc tggacatgta 541 gcatggactt cttcagggac tgggtcatgg aattgggcct gtcccaactt ctcagatgag 601 ggcacacatt cgtccaaaat tagcttcact gtagaaggac ccattgcttt ttttggcttg 661 tctaataaca tcaacggcaa acacctgagg gttatccaga aattggctta tgaatctggc 721 ttcctcaact attccatgtt ggaagttgag tcaaaagttg aatctcagca tgtgggttcc 781 agcattctaa ctggcaaggg aacggtactg ctcagggagg caaaggcaga aatgactggc 841 gagcacaatg ctgacttgaa tggaaaagtt attgggactt tgaaaaactc tctttccttt 901 tcagcacaac catttatgat tactgcatcc acaaataatg atgggaattt gaaagttagt 961 tttccactaa agttgactgg gaaaatagac ttcctgaata actatgcact atttttgagt 1021 cctcatgccc agcaagcaag ctggcaagtg agtgctaggt tcaatcagta caaatataat 1081 caaaattttt ctgctataaa caatgaacat aacatagaag cccatgtagg aatgaatgga 1141 gatgccaacc tggatttctt aaccatacct ctaacaattc ctgaagtgaa actaccttac 1201 atagggctca cgactccctt gctgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcgttt gatttaagtg taaaagctca atataaaaag 1321 aacagagaca ggcattccat tgcgattcct ctgaatgggt tttatgagtt tattctcaac 1381 aatgtcgact ccgggatagg gaagattggg aaagtcagag acagcgcatt agactatctt 1441 atttcatcct ataatgaagc aaaaaacaag tttgaaaatt cccttattca gccctccagg 1501 acctttcaaa agcgtggata cactatccca tttgtcaaca ttgaagtgac tccattcact 1561 gtagagacac tggcctccag ccatgtgatc ccaaaagcaa taaatacccc cagtgttcac 1621 attctgggcc ctaatgtcat tgtgccttca tacaggttag tgctgccctc cctggagctg 1681 ccagtccttc gtgtccccag gaatctactc aagttttccc tcccagattt caaggaattg 1741 agaacaattg acaatattta tattccagct cttggcaatt ttacctatga tttttccttt 1801 aaatcaagtg tcatcacgct gaataccaac gttggacttt ataaccggtc agacatcgtt 1861 gctcatttcc tttcttcctc ttcatttgtc acggatgccc tgcagtacaa attagagggt 1921 acttcacgtc tgactcggaa aagaggattg aagctagcca cagccgactc tctcactaac 1981 aaatttgtaa agggcaatca tgatagcacc tttagcttaa ccaagaaaaa catggaagca 2041 tcagtgaaaa caactgcaaa cctccatgct cccattttaa caatgaactt caagcaggaa 2101 cttaatggaa atgccaagtc aaagcccatt gtctcatcat ccattgaact aaactatgac 2161 ttcaattcct caaagctgta ctctactgct aaaggaggtg ttgaccacaa gtttagctta 2221 gaaagtctca cttcctactt ttccattgag tcatccacca aaggaaatat caagggatct 2281 gtcctttccc aggaatattc aggaagtgtt gccagtgagg ccaacacata cctgaattc // LOCUS MUSAPBRBD 2354 bp ds-DNA ROD 24-JUL-1990 DEFINITION Mouse apolipoprotein (apoB) gene, partial cds (LDL receptor-binding domain). ACCESSION M35186 KEYWORDS apolipoprotein B. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2354) AUTHORS Smith,T.J., Hautamaa,D. and Maeda,N. TITLE Sequence of the putative low-density lipoprotein receptor-binding regions of apolipoprotein B in mouse and hamster JOURNAL Gene 87, 309-310 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 2354 apolipoprotein (apoB) (LDL receptor-binding domain) (AA at 1) BASE COUNT 757 a 525 c 423 g 649 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccaac ttcctcacct ctcacataca attgaaatac ctgcttttgg caaactgcat 61 agcatcctta agatccaatc tcctctcttt atattagatg ctaatgccaa catacagaat 121 gtaacaactt cagggaacaa agcagagatt gtggcttctg tcactgctaa aggagagtcc 181 caatttgaag ctctcaattt tgattttcaa gcacaagctc aattcctgga gttaaatcct 241 catcctccag tcctgaagga atccatgaac ttctccagta agcatgtgag aatggagcat 301 gagggtgaga tagtatttga tggaaaggcc attgagggga aatcagacac agtcgcaagt 361 ttacacacag agaaaaatga agtagagttt aataatggta tgactgtcaa agtaaacaat 421 cagctcaccc ttgacagtca cacaaagtac ttccacaagt tgagtgttcc taggctggac 481 ttctccagta aggcttctct taataatgaa atcaagacac tattagaagc tggacatgtg 541 gcattgacat cttcagggac agggtcatgg aactgggcct gtcccaactt ctcggatgaa 601 ggcatacatt cgtcccaaat tagctttact gtggatggtc ccattgcttt tgttggacta 661 tccaataaca taaatggcaa acacttacgg gtcatccaaa aactgactta tgaatctggc 721 ttcctcaact attctaagtt tgaagttgag tcaaaagttg aatctcagca cgtgggctcc 781 agcattctaa cagccaatgg tcgggcactg ctcaaggacg caaaggcaga aatgactggt 841 gagcacaatg ccaacttaaa tggaaaagtt attggaactt tgaaaaattc tctcttcttt 901 tcagcacaac catttgagat tactgcatcc acaaataatg aaggaaattt gaaagtgggt 961 tttccactaa agctgactgg gaaaatagac ttcctgaata actatgcatt gtttctgagt 1021 ccccgtgccc aacaagcaag ctggcaagcg agtaccagat tcaatcagta caaatacaat 1081 caaaactttt ctgctataaa caatgaacac aacatagaag ccagtatagg aatgaatgga 1141 gatgccaacc tggatttctt aaacatacct ttaacaattc ctgaaattaa cttgccttac 1201 acggagttca aaactccctt actgaaggat ttctccatat gggaagaaac aggcttgaaa 1261 gaatttttga agacaacaaa gcaatcattt gatttgagtg taaaggctca atataaaaag 1321 aacagtgaca agcattccat tgttgtccct ctgggtatgt tttatgaatt tattctcaac 1381 aatgtcaatt cgtgggacag aaaatttgag aaagtcagaa acaatgcttt acattttctt 1441 accacctcct ataatgaagc aaaaattaag gttgataagt acaaaactga aaattccctt 1501 aatcagccct ctgggacctt tcaaaatcat ggctacacta tcccagttgt caacattgaa 1561 gtatctccat ttgctgtaga gacactggct tccaggcatg tgatccccac agcaataagc 1621 accccaagtg tcacaatccc tggtcctaac atcatggtgc cttcatacaa gttagtgctg 1681 ccacccctgg agttgccagt tttccatggt cctgggaatc tattcaagtt tttcctccca 1741 gatttcaagg gattcaacac tattgacaat atttatattc cagccatggg caactttacc 1801 tatgactttt cttttaaatc aagtgtcatc acactgaata ccaatgctgg actttataac 1861 caatcagata tcgttgccca tttcctttct tcctcttcat ttgtcactga cgccctgcag 1921 tacaaattag agggaacatc acgtctgatg cgaaaaaggg gattgaaact agccacagct 1981 gtctctctaa ctaacaaatt tgtaaagggc agtcatgaca gcaccattag tttaaccaag 2041 aaaaacatgg aagcatcagt gagaacaact gccaacctcc atgctcccat attctcaatg 2101 aacttcaagc aggaacttaa tggaaatacc aagtcaaaac ccactgtttc atcatccatt 2161 gaactaaact atgacttcaa ttcctcaaag ctgcactcta ctgcaacagg aggcattgat 2221 cacaagttca gcttagaaag tctcacttcc tacttttcca ttgagtcatt caccaaagga 2281 aatatcaaga gttccttcct ttctcaggaa tattcaggaa gtgttgccaa tgaagccaat 2341 gtatatctga attc // LOCUS DDINDK 518 bp ss-mRNA INV 24-JUL-1990 DEFINITION D.discoideum nucleoside diphosphate kinase Gip17 mRNA, complete cds. ACCESSION J05457 KEYWORDS nucleoside diphosphate kinase. SOURCE D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clones G[821,951]. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 518) AUTHORS Lacombe,M.L., Wallet,V., Troll,H. and Veron,M. TITLE Functional cloning of a nucleoside diphosphate kinase from Dictyostelium discoideum JOURNAL J. Biol. Chem. 265, 10012-10018 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Lacombe, 16-MAY-1990, for release after publication. FEATURES from to/span description pept 19 486 nucleoside diphosphate kinase Gip17 (EC 2.7.4.6) mRNA < 1 518 Gip17 mRNA signal 485 490 poly-A signal BASE COUNT 164 a 97 c 97 g 160 t ORIGIN 1 taacctaata agattaaaat gtccacaaat aaagtaaaca aagaaagaac tttccttgct 61 gttaaaccag acggtgttgc tcgtggttta gttggtgaaa tcatcgccag atacgaaaag 121 aaaggtttcg ttttagttgg tttaaaacaa ttagttccaa ccaaagactt agctgaatct 181 cactatgctg aacacaaaga aagaccattc ttcggtggtt tagtctcatt cattacctct 241 ggtccagtcg ttgctatggt cttcgaaggt aaaggtgttg ttgcctctgc ccgtttaatg 301 atcggtgtta ccaacccatt agcctcagcc ccaggttcaa ttcgtggtga tttcggtgtt 361 gatgttggta gaaacatcat ccacggttct gattcagttg aatctgccaa cagagaaatt 421 gctttatggt tcaaaccaga agaattatta actgaagtta aaccaaaccc aaatttatac 481 gaataaataa tttagctaaa ttctaaatca tttaatgt // LOCUS DDINDKA 184 bp ss-mRNA INV 24-JUL-1990 DEFINITION D.discoideum nucleoside diphosphate kinase Gip17 mRNA, 5' end. ACCESSION M36679 KEYWORDS nucleoside diphosphate kinase. SOURCE D.discoideum (strain AX3) 3 hour starvation, cDNA to mRNA, clone G821. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 184) AUTHORS Lacombe,M.L., Wallet,V., Troll,H. and Veron,M. TITLE Functional cloning of a nucleoside diphosphate kinase from Dictyostelium discoideum JOURNAL J. Biol. Chem. 265, 10012-10018 (1990) STANDARD full staff_review FEATURES from to/span description pept 137 > 184 nucleoside diphosphate kinase Gip17 (EC 2.7.4.6) BASE COUNT 77 a 21 c 14 g 72 t ORIGIN 1 atttttgatt ttctttattc ttatttgtaa ttgtatataa tagtaaaaaa aaaaaattct 61 cagatttata atttattatt atttttcaaa tttcttcaat tctttgtatt aaatattact 121 caaacataaa ataaaaatgg cctcaaccaa aagaagatta aacagagaag aatcaatgtc 181 caca //
GenBank-Updates@genbank.bio.net (07/25/90)
LOCUS DOGRAB2A 656 bp ss-mRNA MAM 25-JUL-1990 DEFINITION C.familiaris GTP-binding protein (rab2) mRNA, complete cds. ACCESSION M35521 KEYWORDS GTP-binding protein. SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to mRNA, clone II. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 656) AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M. TITLE Localization of low-molecular weight GTP binding proteins to exocytic and endocytic compartments JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Chavrier, 22-JUN-1990. Base-pairs 508 to 564 form a synthetic peptide used to raise antibodies. FEATURES from to/span description pept 7 645 GTP-binding protein (rab2) BASE COUNT 209 a 124 c 158 g 165 t ORIGIN 1 gcggccatgg cgtacgctta tctcttcaag tacatcatca tcggcgacac aggtgttggt 61 aaatcatgct tattgctaca gtttacagac aagaggtttc agccagtgca tgacctgact 121 atcggtgtag agtttggtgc tcgaatgata actattgatg ggaaacagat aaaacttcag 181 atatgggata cggcagggca agagtccttt cgttccatca caaggtcata ttacagaggt 241 gcagcagggg ctttactagt gtatgatatt acaaggagag atacattcaa ccacttgaca 301 acctggttag aagatgcccg ccagcattcc aattccaaca tggtcattat gcttattgga 361 aataaaagtg atttagaatc aagaagagaa gtaaaaaaag aagaaggtga agcttttgca 421 cgagaacatg gacttatctt catggaaact tctgctaaga ctgcttccaa tgtagaagag 481 gcatttatta atacagcaaa agaaatttat gagaaaatcc aagaaggagt ctttgacatt 541 aataatgagg caaacggcat taaaattggc cctcagcacg ctgctactaa tgccacacac 601 gcgggcaatc agggaggaca gcaggccggg ggaggctgct gttgagtccg tttttt // LOCUS DOGRAB5A 796 bp ss-mRNA MAM 25-JUL-1990 DEFINITION C.familiaris GTP-binding protein (rab5) mRNA, complete cds. ACCESSION M35520 KEYWORDS GTP-binding protein. SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to mRNA, clone II. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 796) AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M. TITLE Localization of low-molecular weight GTP binding proteins to exocytic and endocytic compartments JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Chavrier, 22-JUN-1990. Base-pairs 664 to 711 form a synthetic peptide used to raise antibodies. FEATURES from to/span description pept 121 768 GTP-binding protein (rab5) BASE COUNT 267 a 163 c 174 g 192 t ORIGIN 1 ccgcggctcc tcgtgctgcg gcctcaggtt tctgtatatc cagaaagaaa aaatttgaca 61 ccttgcatcc tggaagttca tttaagagac tgaaattagg gacttctttc aaatttggac 121 atggctaatc gaggagcaac aagacccaac gggccaaata ctggaaataa aatatgccag 181 ttcaaactag tacttctggg agagtctgct gttggcaaat caagcctagt gcttcgtttt 241 gtgaagggcc aatttcatga atttcaagag agtaccatag gggctgcttt tctaacccaa 301 actgtgtgtc ttgatgatac aacagtaaag tttgaaatat gggatacagc tggtcaagaa 361 cgataccata gcttagcacc aatgtactac agaggagcac aagcagccat agttgtatat 421 gatatcacaa atgaggagtc ctttgccaga gccaaaaact gggttaaaga acttcagagg 481 caagccagtc ctaacattgt aatagcttta tcaggaaaca aggctgatct tgcaaataaa 541 agagctgtcg atttccagga agcacagtcc tatgcagatg acaacagttt attattcatg 601 gagacatcag ctaaaacatc gatgaacgta aatgaaatat tcatggcaat agctaaaaag 661 ttgccaaaga acgaaccaca gaatccagga gcaaattctg ccagaggaag aggagtagac 721 cttactgaac ccacgcagcc aaccaggagt cagtgttgta gtaactaaac ctccagtttg 781 aacttcctgg aatatc // LOCUS DOGRAB7A 811 bp ss-mRNA MAM 25-JUL-1990 DEFINITION C.familiaris GTP-binding protein (rab7) mRNA, complete cds. ACCESSION M35522 KEYWORDS GTP-binding protein. SOURCE C.familiaris (strain Madin-Darby; Cocker spaniel) kidney, cDNA to mRNA, clone II. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 811) AUTHORS Chavrier,P., Parton,R.G., Hauri,H.P., Simons,K. and Zerial,M. TITLE Localization of low-molecular weight GTP binding proteins to exocytic and endocytic compartments JOURNAL Cell (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Chavrier, 22-JUN-1990. Base-pairs 542 to 592 form a synthetic peptide used to raise antibodies. FEATURES from to/span description pept 20 643 GTP-binding protein (rab7) BASE COUNT 251 a 206 c 194 g 160 t ORIGIN 1 gagcggctgc gtttgaagga tgacctctag gaagaaagtg ttgctgaagg ttatcatcct 61 gggagattct ggagttggta agacatcact catgaaccag tatgtgaaca agaaattcag 121 taatcagtac aaagctacaa taggagcaga ctttctgaca aaggaggtga tggtggatga 181 cagactagtt acaatgcaga tctgggacac agcaggccag gaacggttcc agtcccttgg 241 tgtggccttc tacagaggtg cagactgctg cgttctggta tttgacgtta ctgcccccaa 301 cacattcaaa accctcgata gctggagaga tgagtttctc atccaggcca gtccccggga 361 tcctgaaaac ttccctttcg ttgtgttggg aaacaagatt gacctcgaaa acagacaagt 421 ggccacaaag cgggcacagg cctggtgcta cagcaaaaac aacattccct acttcgagac 481 cagtgccaag gaggccatca atgtggagca ggcgttccag acgattgcaa ggaatgcact 541 taaacaggaa acagaggtgg agctgtacaa tgaattccct gaacccatca aactggacaa 601 gaacgaccgg gccaagacct cagcggaaag ctgcagttgc tgaaggggca gtgagagcag 661 agcacagagt ccttcacaaa caaagaacac acttaggcct tccaacacga gcccccttct 721 tctcttccaa acaaaacata aagtcatctc tcgaatccag ctgccaaaag accctaccaa 781 acacttcacc ctgacacaca catacacaca c // LOCUS HUMU7AA 649 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene, fragment 32sm. ACCESSION M35537 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 649) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 293 352 pseudo-U7 uRNA BASE COUNT 217 a 107 c 115 g 199 t 11 others ORIGIN 1 attatggcag agtacatgta acatatagtt tgctattcna actgattttt gacaaagata 61 caacagcana tcaatggagg aacaatagcn tttttaacaa atggtgttgg cacaactgga 121 caactgtaag nnaaagaaaa tgaanttcaa tctanatctc anaccgtatt aaaaaaaact 181 caaagtgggc cacagactta gatataaaat gtaaaactat aacactttta gaaaanatat 241 aggagaanat ctatgggatt tagggcaaaa gcatgattca aaaaaggaaa gtcagtgtta 301 cagccctttt agaatttgtc tagcaggttt tctggttttc cagaaaacct ccacataaaa 361 aggaaaaaga aaaaaaggaa aaagtaataa attagtatga attgagcatt ttaatgattc 421 tattttattg cctttgttgg cttattaaat ataactctct gttttgttat tttagtggtt 481 gctttaggtt ttatagtaat acatctttaa cctgttacag tccaccttct ttttgtttgt 541 ttgttttgga agcagggtct cactctgtca ccaaggctag agtgcagtgg cactatcacg 601 gctcactgca acctcaacct cccaggctcc agngttcctc ctgctgcag // LOCUS HUMU7AB 521 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene, fragment 32BG. ACCESSION M35538 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 521) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 295 344 pseudo-U7 uRNA BASE COUNT 186 a 102 c 89 g 144 t ORIGIN 1 tttcttcttt ttccacctct tgtctattca ggccctcagt gaattggatc atgctcaccc 61 acatcagggc aggcaatcta cttattgagt tcactgattc aaatgataac ctcacctgga 121 aaaatcctca cagacccaga aataatgttt aatctaagca cccatggcca gtcaagttga 181 gacataaaat tagccatcac agtacaggca tacctgggaa atgacgcagg ttcagttcca 241 gaccatcaca ataaagcaaa tattgcaata aagtgagtca caaaaagaaa aagtcagtgt 301 tacagctttt agaatttgtc tagcaggttt tctggaaaac cttcacaaaa aaaggagaaa 361 gagtgcatat aaaatgctta tgttgatacc atactgtagt ctattaagtg tgcaatagca 421 ttatgtctat aaaacaatgt acatacttta aaaatatttt attgttaaaa catgctatca 481 cagagacaca aagtgagcac atgctgttgg aaaaatggta c // LOCUS HUMU7AC 513 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene. ACCESSION M35539 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA, clone 25H. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 513) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 204 264 pseudo-U7 uRNA BASE COUNT 127 a 83 c 81 g 222 t ORIGIN 1 aattgtctgt ctttcatatt tttgtcattc tcgtgagtgt gaagtggtat ctcattgtgg 61 ttttgatttg catttcccta atgactaatg gtgttgaata tcttttcata tgcttataag 121 ccatttatat gtctttggag aaattctttt caaatctctt gctcatttta aaattaggtt 181 gtcattttat tacggagttg cattagtgtt acagctcttt tagaatttgt ctagcaggtt 241 ttctgatttt tacccggaac ccctccccag ccaaaagtaa aagaaaaaaa aagctgcaat 301 agttctttat atagtttaga tacaaggccc ttatcagata tttgattttc aaatattgtc 361 tcccattctg tgagttgttt tttcactctc ttgatggtgt catatgaagc acaaattttt 421 ttttttattt tgataatgtc ccatttatct atgtattttt tcttttcatt tgtgcttttg 481 gtgtcgtacc taagaaactg ctgcttaact caa // LOCUS HUMU7AD 418 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human U7 small nuclear RNA pseudogene, fragment 36h. ACCESSION M35540 KEYWORDS U7 small nuclear RNA; pseudogene. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 418) AUTHORS Soldati,D. and Schimperli,D. TITLE Structures of four human pseudogenes for U7 small nuclear RNA JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.Schimperli, 22-JUN-1990. FEATURES from to/span description uRNA.ps 229 286 pseudo-U7 uRNA BASE COUNT 128 a 58 c 106 g 122 t 4 others ORIGIN 1 agaggcacat gtcaagatga agctctggtg aagaattgat caaaaatagt ggcggagtga 61 gatggagatt taaatccaag ggctgattta tgaaggcttc aaagattttt tttttttaaa 121 gaaagaacat agattagttg tttctgaggg ctggagggga cagagataga ggcggcgacg 181 gaaggatcct tcaggtttct tcttgaggtg attaaacgtt ctgaaatcgc gtgttacagc 241 tcttttggaa tttgtctagc aggttttctg gttttcactg caaaacccca cagtnnnaaa 301 acagaaaaaa aaawttatcc taaaattggg ctgtggtaat ggttgcgcat atgctgtgaa 361 taggcttcca aatattgaaa tgtccacttc aaacgagtga actgtatggt atgtgaat // LOCUS SCMPMYA1 3156 bp ss-mRNA INV 25-JUL-1990 DEFINITION S.mansoni paramyosin mRNA, complete cds. ACCESSION M35499 KEYWORDS paramyosin. SEGMENT 1 of 2 SOURCE S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones Pmy[1,8,11,15]. ORGANISM Schistosoma mansoni Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; Strigeata; Schistosomatoidea; Schistosomatidae. REFERENCE 1 (bases 1 to 3156) AUTHORS Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and Shoemaker,C.B. TITLE Paramyosin is the Schistosoma mansoni (trematoda) homologue of antigen B from Taenia solium (cestoda) JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.P.Laclette, 22-JUN-1990. Author address: J.P.Laclette Department of Tropical Public Health Harvard School of Public Health 665 Huntington Avenue Boston, MA 02115 Email: zehm%hscvax%harvunxwxw.edu FEATURES from to/span description pept 47 2647 paramyosin mRNA < 1 > 3156 paramyosin mRNA BASE COUNT 1279 a 435 c 516 g 923 t 3 others ORIGIN 1 tctttcacta atattaaaaa gaaaaattta aaaaaaaaga ggaaaaatga tgaatcatga 61 tacagaatct catgtgaaaa tatcaagaac tatttatcga ggagtatcac caagtacaac 121 aagacttgag agtcgagtac gggaattaga agatcttttg gatttagaac gtgatgcaag 181 agttcgagct gaacgacatg ctgctgattt aggttttcaa gtggatgcat tatcagaacg 241 tttagatgaa gctggaggtt ctacaacaca aactcaagaa ttattaaaac gtcgtgaaat 301 ggaaatcaat aaactacgta aagatttaga aaatgctaat gcatcacttg aactagctga 361 aacatcaatg agacgtcgac atcaaacagc attgaatgaa ttagctttgg aagttgaaaa 421 tttacaaaaa caaaaaggaa aggctgaaaa agacaaaagt catttgatta tggaagtgga 481 taatgttcta ggacaattag atggtgcatt aaaagctaag caatcagctg aatcaaaatt 541 agaaggatta gatagtcaat taaatcgttt aaaatcatta accgacgatt tacaaagaca 601 attaactgaa ttaaataatg ctaaatcaag attaacatca gaaaattttg aattattaca 661 tataaatcaa gattatgaag cacaaatatt aaattattct aaagctaaat catcacttga 721 aagtcaagta gatgatttaa aaagatcatt agatgatgaa gctaaaaatc gttttaatct 781 tcaagctcaa cttacatcac ttcaaatgga ttatgataat ttacaagcta aatatgatga 841 agaaagtgaa gaagctagta atttacgtag tcaagtatct aaatttaacg ctgatattgc 901 tgcattaaaa tcgaaatttg aacgtgaact tatgagtaaa acagaagaat tcgaagaaat 961 gaagaggaaa ttcactatga gaattaccga acttgaagat actgctgaaa gagaacgatt 1021 aaaagcggta tcattagaaa aacttaaaac aaaattaaca ttagaaatta aagatttaca 1081 atctgaaata gaaagtcttt cattagaaaa tagtgaatta attcgtcgtg ctaaagctgc 1141 tgaatcatta gcttctgatt tacaacgtcg tgttgatgaa ttaacaattg aagtgaatac 1201 attaacatca caaaatagtc aattagaaag tgaaaatcta cgtttaaaaa gtttagttaa 1261 tgatttaacg gataaaaata atttattaga acgtgaaaat cgtcaaatga atgatcaagt 1321 caaagaatta aaaagttcac ttcgtgatgc taatcgtcgt cttactgatt tagaagcatt 1381 aagatcgcaa ttagaggctg aaagagataa tcttgcatca gctttacatg atgctgaaga 1441 agcattacat gatatggatc aaaagtatca agcatcacaa gctgcattaa atcatttgaa 1501 atctgaaatg gaacaaaggc ttagagaaag agatgaagaa ttagaaagtt taagaaaaag 1561 tactactaga acaattgaag aattaactgt tacaataact gaaatggaag ttaaatataa 1621 atcagaatta tcacgtttaa aaaaacgtta tgaatcaaat attgctgatt tagaaattca 1681 acttgataca gctaataaag ctaatgcaaa tcttatgaaa gagaataaaa atttatcaca 1741 acgtgttaaa gatttagaaa catttttaga tgaagaacgt cgtcttcgtg aagcagctga 1801 aaataattta caaattactg aacataaacg tttacaatta gcaaatgaaa ttgaagaaat 1861 acgtagtaca ttagaaaatt tagaacgttt acgtaaacat gctgaaacag aacttgaaga 1921 agctcaatca cgtgttagtg aattaactat tcaagttaat acattaacta atgataaacg 1981 tcgtcttgaa ggtgatattg gtgtaatgca ggctgatatg gatgatgcta ttaatgctaa 2041 acaagcttct gaagatcgag caattagatt aaataatgaa gtattacgtt tagctgatga 2101 attacgtcaa gaacaaggaa attataaaca tgctgaagca ttaagaaaac aattagaaat 2161 tgaaatacgt gaaattacag ttaaattaga agaagctgaa gcatctgcta cacgtgaagg 2221 tcgtcgtatg gtacaaaaat tacaggctcg tgtacgtgaa cttgaatcag aattcgatgg 2281 tgaatcaaga agatgtaaag atgcattagc tcaagcacgt aaatttgaac gtcaatataa 2341 agaattacaa acacaagctg aagatgatcg tcgtatggta ttagaacttc aagatttatt 2401 agataaaact caaatgaaaa tgaaagccta taaacgtcaa ttggaagaaa tggaagaagt 2461 atctcaaatt acaatgaata aatatcgtaa agcccaacaa caaattgaag aagctgaaca 2521 tcgtgcagat atggctgaac gtacagtcac tgtacgtcgt gttggtccag gtggacgtgc 2581 tgtttctgta gcacgtgaat tatctgtcac atcaaataga ggaatgagag caacaagtat 2641 gatgtaaagc acttaaataa taataataat agtgatacta tacacatata caaacgccta 2701 tatctttctt tctctctttg tttcgttttc ctcatcttcg ctttttttta gtcatgatat 2761 tcatctaaat gaggaaatta tcaataatga cctattatta ttcaatgtgc tttactttac 2821 ttcccaccct aaatctcctc ggtatatcgt ttcccttttt ttttcttttt ttttctaaaa 2881 acaaaaaatt ctaaaagtga aagacgaaaa aaaaaaannn cagaaatttg tttcctcctc 2941 tcatattttc tctttgttct ttttattcat ttcatttatt gtattattaa tattgctatt 3001 attattattg ttattactac ctaaccgatg gtttcaacga cagcaatctc ccatatttct 3061 acacacacac acacacaaca cacacaacac acaaaagtat ctgtgcaatc gtaatagata 3121 atctttattt attgattaaa aaaaaaaaaa aaaaaa // LOCUS SCMPMYA2 217 bp ss-mRNA INV 25-JUL-1990 DEFINITION S.mansoni paramyosin mRNA, 3' flank. ACCESSION M36871 KEYWORDS paramyosin. SEGMENT 2 of 2 SOURCE S.mansoni (strain Puerto Rican) adult worm, cDNA to mRNA, clones Pmy[1,8,11,15]. ORGANISM Schistosoma mansoni Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Trematoda; Digenea; Strigeidida; Strigeata; Schistosomatoidea; Schistosomatidae. REFERENCE 1 (bases 1 to 217) AUTHORS Laclette,J.P., Landa,A., Arcos,L., Willms,K., Davis,A.E. and Shoemaker,C.B. TITLE Paramyosin is the Schistosoma mansoni (trematoda) homologue of antigen B from Taenia solium (cestoda) JOURNAL Unpublished (1990) Harvard 665 Huntington Avenue, Boston, MA 02115 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.P.Laclette, 22-JUN-1990. Author address: J.P.Laclette Department of Tropical Public Health Harvard School of Public Health 665 Huntington Avenue Boston, MA 02115 Email: zehm%hscvax%harvunxwxw.edu FEATURES from to/span description mRNA < 1 217 paramyosin mRNA BASE COUNT 66 a 46 c 17 g 88 t ORIGIN About 1 kb after segment 1. 1 cagaaatttg tttcctcctc tcatattttc tctttgttct ttttattcat ttcatttatt 61 gtattattaa tattgctatt attattattg ttattactac ctaaccgatg gtttcaacga 121 cagcaatctc ccatatttct acacacacac acacacaaca cacacaacac acaaaagtat 181 ctgtgcaatc gtaatagata atctttattt attgatt // LOCUS ECAPNL 420 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.carotovora pectin lyase (PNL) gene, 5' end. ACCESSION M35271 KEYWORDS pectin lyase. SOURCE E.carotovora DNA, clone pTN2159. ORGANISM Erwinia carotovora Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 420) AUTHORS Nishida,T., Suzuki,T., Ito,K., Kamio,Y. and Izaki,K. TITLE Cloning and expression of pectin lyase gene from Erwinia carotovora in Escherichia coli JOURNAL Biochem. Biophys. Res. Commun. 168, 801-808 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 284 > 420 pectin lyase (EC 4.2.2.10) BASE COUNT 127 a 66 c 94 g 133 t ORIGIN 1 cctatcagtc tgatgaagtt gaacaggctg cgaaccgtat ttttaatggc ggcgggtaaa 61 aggctggtga tgataatcgt agcgctgcca ttttactaaa agatggcggc gtattaattg 121 ggtattgaat tattcgcaag gttgtttttt tattaaactc gattaataag cgtaatgaaa 181 tcctttctat acaattttta attgtcggag gcgtattatt tagtctcaat taaataatac 241 gctggaagac attattattc actcattgta aaaaggaaaa cttatggctt atccaacaac 301 aaatcttact gggcttattg gttttgcaaa agcggcaaaa gttaccggag gaacgggcgg 361 taaagtcgtt acggtaaatt ctttggccga ttttaaatca gcggtgacgg ttccgcaaaa // LOCUS ECOUXEX 318 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli uxaCT-exuT intercistronic region. ACCESSION M35280 KEYWORDS catabolite receptor protein. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 318) AUTHORS Blanco,C. and Mata-Gilsinger,M. TITLE Identification of cyclic AMP-CRP binding sites in the intercistronic regulatory uxaCA-exuT region of Escherichia coli JOURNAL FEMS Microbiol. Lett. 33, 205-209 (1986) STANDARD simple staff_entry FEATURES from to/span description site 46 71 catabolite receptor protein binding site 1 site 165 193 catabolite receptor protein binding site 2 BASE COUNT 98 a 62 c 70 g 88 t ORIGIN 1 gtcgacttat gatttgcgac ggcagaaaga taacttgtca tacaacttta aaaggtgaga 61 gccatcacaa atgtgggaat atttgtaggg acattacctg acgacagcaa ggccagtact 121 ggcgcggcct gcagcgagat ttaccacttt gagagtaatt tttttaacta cgtttattga 181 tctaactcac gaaaatatct tcggactctg gaaattggtg tgataacttt gtcagcatcg 241 caccataagc aagctagctc actcgttcga agaggaagac gaaaataact ccgtttatga 301 ctgaagatta tcctgtta // LOCUS HUMSYNIFA 144 bp ds-DNA SYN 25-JUL-1990 DEFINITION Human synthetic interferon alpha-2 gene, 3' end. ACCESSION M35281 KEYWORDS interferon. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 144) AUTHORS Rossi,J.J., Kierzek,R., Huang,T., Walker,P.A. and Itakura,K. TITLE An alternate method for synthesis of double-stranded DNA segments JOURNAL J. Biol. Chem. 257, 9226-9229 (1982) STANDARD simple staff_entry FEATURES from to/span description pept < 13 135 interferon alpha-2 BASE COUNT 37 a 32 c 32 g 43 t ORIGIN 1 caagaattca tgatcactct gtacctgaag gaaaagaaat actctccgtg tgcttgggaa 61 gttgtacgtg ctgaaatcat gcgttctttc tccctgtcta ctaaccttca ggagtctctg 121 cgttctaaag aatagctgca gtgg // LOCUS RATMAL5 1104 bp ds-DNA ROD 25-JUL-1990 DEFINITION Rat malic enzyme (ME) gene, 5' end. ACCESSION M35258 M21619 KEYWORDS malic enzyme. SOURCE Rat (Sprague-Dawley, female) liver, clone lambda-g-ME-29. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1104) AUTHORS Morioka,H., Tennyson,G.E. and Nikodem,V.M. TITLE Structural and functional analysis of the rat malic enzyme gene promoter JOURNAL Mol. Cell. Biol. 8, 3542-3545 (1988) STANDARD simple staff_review REFERENCE 2 (bases 427 to 925; revises [1]) AUTHORS Petty,K.J., Desvergne,B., Mitsuhashi,T. and Nikodem,V.M. TITLE Identification of a thyroid hormone response element in the malic enzyme gene JOURNAL J. Biol. Chem. 265, 7395-7400 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 918 > 1104 malic enzyme (EC 1.1.1.40) mRNA 883 > 1104 malic enzyme mRNA rpt 814 823 direct repeat rpt 827 836 direct repeat BASE COUNT 220 a 376 c 290 g 218 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgcat agcccagaag ctatagctgt actgatgggc tcaagtaaaa taattagaaa 61 ttatttctca ggtatctagg caatatttaa cccccaaatt gttccgcagt gtctagatga 121 acaccataga atttggccgt gcgacttaac tgaaaagaaa gggctttgtt gtctgaaggc 181 tgcttggctg tattgttttg ttttaatcag acatccttgg gagacatagg atttatttct 241 ccagtccttg gatcttcaag tataaatatc aataatacaa ccactgggtt tcagtactgg 301 aagacctgtt attctgaccc tctgtcatca gagaagaaac catacatcat cttgcaaaaa 361 ttaacatctt ggtttccaga acgctcagga aaattgttct taagctcaat aggactggcc 421 actggacctg tgccctctaa cacctttttc ttaccacgtt cgaacacaat tccctcagat 481 actattcaga aacaggcgag gagtcgcccg ccctatcgcc cagtgccatc gaggcctggg 541 cattctgggt caaagttgat cccctcctgc atcaggcccc tggggcatgg ctggcatcca 601 ggacgttggg gttaggggag gacagtggac gagcggagga agcgaggcgg cccgcccctc 661 acccgtcggt gcccaggtcg cacgctcggc gctcaccagc ttggccggcg ccccgccccc 721 gcctcctcgc acggcggctc ggccgatgcc gccgtgactc agcgcttctc gcgggccgcc 781 cgcgcggccg cggctaggcc gggctcctcc cgcctcgcca ccccctctcg ccacccacgc 841 ccgcccccgg ccgcggggcc ttccgtcgca cggccgccgc ccgccgcact cccgtccgcc 901 ccgccacggt gctggccatg gatccccgag ccccccgccg ccgacacacc caccagcgcg 961 gctacctgct gacgcgggac ccgcatctca acaaggtgag ccccgccccg agagccgccc 1021 tgggcccgcc gctgggctcg ggcacccgcg tcccaccgag gggacggtcc cacccgggag 1081 gccactgcgg agccggcgcc aacg // LOCUS RATSPA 1595 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat serine pyruvate aminotransferase mRNA, complete cds. ACCESSION M35270 X06357 KEYWORDS serine pyruvate aminotransferase. SOURCE Rat (strain Wistar) liver, clones pRspt910,321]. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 63 to 1595) AUTHORS Oda,T., Miyajima,H., Suzuki,Y. and Ichiyama,A. TITLE Nucleotide sequence of the cDNA encoding the precursor for mitochondrial serine:pyruvate aminotransferase of rat liver JOURNAL Eur. J. Biochem. 168, 537-542 (1987) STANDARD simple automatic REFERENCE 2 (bases 1 to 198) AUTHORS Oda,T., Funai,T. and Ichiyama,A. TITLE Generation from a single gene of two mRNAs that encode the mitochondrial and peroxisomal serine:pyruvate aminotransferase of rat liver JOURNAL J. Biol. Chem. 265, 7513-7519 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 109 1353 peroxisomal serine:pyruvate aminotransferase precursor (EC 2.6.1.51; pSPT) sigp 109 118 serine:pyruvate aminotransferase signal peptide matp 119 1350 serine:pyruvate aminotransferase pept 175 1353 mitochondrial serine:pyruvate aminotransferase (mSPT) mRNA 61 > 1520 pSPT mRNA (alt.) mRNA 62 > 1520 pSPT mRNA (alt.) mRNA 127 > 1520 mSPT mRNA (alt.) mRNA 129 > 1520 mSPT mRNA (alt.) mRNA 130 > 1520 mSPT mRNA (alt.) signal 1515 1520 polyA signal BASE COUNT 376 a 437 c 455 g 327 t ORIGIN 1 aggacaaaca tcgatcaggg tcaaattgac aataaaaggg ctggagcaag caacagggac 61 tcaccaacca ggcctcgcct ctgagttcag cccagagcta gctgggaaat gttccggatg 121 ttggccaagg ccagtgtgac gctgggctcc agggcagcaa gttgggtacg gaacatgggc 181 tcgcaccagc tgctggtgcc acccccagag gccctgagca agcccctgtc aattcctaag 241 aggctcctgt tgggtccggg accctccaac ctggctcctc gtgtgctagc agctggaagt 301 ctgaggatga ttggccacat gcaaaaagag atgtttcaga tcatggatga gatcaagcag 361 ggcatccagt atgtgttcca gaccaggaac cccctcacac tggttgtcag cggctcagga 421 cattgtgcca tggagactgc cctgttcaac ctcctggagc ctggggactc ctttcttgtg 481 ggaaccaatg gcatctgggg gatacgggct gcagagatcg ctgagcggat tggagcccgt 541 gtgcaccaga tgatcaagaa gcctggagaa cattacacac tgcaggaggt ggaggagggc 601 ctggctcagc ataaaccagt gttgctgttc ctgacccacg gggagtcatc cactggtgtg 661 ctgcagcccc tggatggttt cggggagctc tgccacaggt atcagtgcct actcctggtg 721 gactcggtgg catcattggg cggagtccct atctacatgg accaacaagg catcgacatc 781 ttgtactctg gctctcagaa ggtcctgaat gccccaccag ggatctccct catctccttc 841 aacgacaagg ccaaatccaa agtctactcc cggaagacaa agccagtctc cttctacaca 901 gacatcactt atttgtccaa gttgtggggc tgtgagggca agaccagagt aattcatcat 961 acgttgcctg tcatcagctt atactgcctg agggagagcc tagcactcat ttcagagcag 1021 ggcctggaga attcctggcg gcgtcacagg gaggctacag cacatctgca caagtgcctg 1081 cgggagttgg gcttaaagtt ctttgtgaag gacccggaaa tccggctacc tacaatcacc 1141 accgtgaccg tgcctgccgg ctacaactgg agggacatcg tcagctacgt gctggaccac 1201 ttcaacattg aaatctctgg tggtcttggg ccctctgagg ataaggtgct gcggattggc 1261 ctcctgggct acaacgccac cacagagaat gcggaccgtg tagcggaggc cctgagggag 1321 gccctgcaac attgtcctaa gaataaattg tgagcatcgt ctcaccagac tgtgccctcc 1381 tggaggggct gggaatatag caggaacgag aagactgtgc aagccctcca gccagcaaag 1441 gctgccgatg taaccaggcg ggaagggtca gggcgaagct gcccctctcc ccacagatgg 1501 agccctgtgg tcacatgatg ctaatcacct tccgatgaag ctgcattctg caggccactg 1561 gacttcggga atattcaata aagtacttgc cagac // LOCUS YSCCOX9A 180 bp ds-DNA PLN 25-JUL-1990 DEFINITION S.cerevisiae cytochrome c oxidase subunit VIIa (COX9) gene, complete cds. ACCESSION M35260 KEYWORDS cytochrome c oxidase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Duhl,D.M., Powell,T. and Poyton,R.O. TITLE Mitochondrial import of cytochrome c oxidase subunit VIIa in Saccharomyces cerevisiae: Identification of sequences required for mitochondrial localization in vivo JOURNAL J. Biol. Chem. 265, 7273-7277 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 180 cytochrome c oxidase subunit VIIa BASE COUNT 53 a 35 c 51 g 41 t ORIGIN 1 atgactattg ctccaattac tggtacgatc aagagaagag tcatcatgga catcgtcctc 61 gggttctccc tcgggggtgt catggcctct tactggtggt ggggattcca catggataag 121 attaacaaga gagagaagtt ctacgcagag ctagctgaga ggaaaaagca agagaactga // LOCUS DROTNCOPIA 276 bp ds-DNA INV 25-JUL-1990 DEFINITION D.melanogaster transposable element copia DNA in omega-aLTR1. ACCESSION M35053 KEYWORDS copia transposon; transposable element. SOURCE D.melanogaster (strain w-a-1A) DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 276) AUTHORS Zachar,Z., Davison,D., Garza,D. and Bingham,P.M. TITLE A detailed developmental and structural study of the transcriptional effects of insertion of the copia transposon into the white locus of Drosophila melanogaster JOURNAL Genetics 111, 495-515 (1985) STANDARD simple staff_entry BASE COUNT 100 a 42 c 34 g 100 t ORIGIN 1 tgttggaata tactattcaa cctacaaaaa taacgttaaa caacactact ttatatttga 61 tatgaatggc cacacctttt atgccataaa acatattgta agagaatacc actcttttta 121 ttccttcttt ccttcttgta cgttttttgc tgtgagtagg tcgtggtgct ggtgttgcag 181 ttgaaataac ttaaaatata aatcataaaa ctcaaacata aacttgacta tttatttatt 241 tattaagaaa ggaaatataa attataaatt acaaca // LOCUS ECOMETBJI 82 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli metB-metJ intercistronic DNA region. ACCESSION M34899 KEYWORDS . SOURCE E.coli (strain K12) DNA, clone pAA110. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 82) AUTHORS Smith,A.A., Greene,R.C., Kirby,T.W. and Hindenach,B.R. TITLE Isolation and characterization of the product of the methionine- regulatory gene metJ of Escherichia coli K-12 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6104-6108 (1985) STANDARD simple staff_entry BASE COUNT 25 a 13 c 15 g 29 t ORIGIN 1 tataatttta acggctattt gggatttgct catctatacg caaagaagtt tagatgtcca 61 gatgtattga cgtccattaa ca // LOCUS MNICPRRKA 103 bp ds-DNA RNA 25-JUL-1990 DEFINITION M.rugicum 4.5S ribosomal RNA. ACCESSION M35056 KEYWORDS 4.5S ribosomal RNA. SOURCE M.rugicum chloroplast DNA. ORGANISM Chloroplast Mnium rugicum Eukaryota; Plantae; Embryobionta; Bryophyta; Bryopsida; Bryidae; Bryales; Mniaceae; Mnium rugicum. REFERENCE 1 (bases 1 to 103) AUTHORS Troitsky,A.V., Bobrova,V.K., Ponomarev,A.G. and Antonov,A.S. TITLE The nucleotide sequence of chloroplast 4.5 S rRNA from Mnium rugicum (Bryophyta): Mosses also posses this type of RNA JOURNAL FEBS Lett. 176, 105-109 (1984) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 103 4.5S ribosomal RNA BASE COUNT 33 a 20 c 28 g 22 t ORIGIN 1 taaggtgacg gcaagactag ccgtttatca tcacgatagg tgccaagtgg aagtgcagta 61 atgtatgcag ctgaggcatc ctaacagacc gagagattta aac // LOCUS MUSCABLA 125 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse tyrosine kinase (c-abl) mRNA, 3' terminus. ACCESSION M34905 KEYWORDS tyrosine kinase. SOURCE Mouse (strain NIH Swiss) testis, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 125) AUTHORS Meijer,D., Hermans,A., von Lindern,M., van Agthoven,T., de Klein,A., Mackenbach,P., Grootegoed,A., Talarico,D., Valle,G.D. and Grosveld,G. TITLE Molecular characterization of the testis specific c-abl mRNA in mouse JOURNAL EMBO J. 6, 4041-4048 (1987) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 44 tyrosine kinase (c-abl) mRNA (alt.) mRNA < 1 125 tyrosine kinase (c-abl) mRNA (alt.) BASE COUNT 26 a 35 c 26 g 38 t ORIGIN 1 gcttactgta cctgcacctt tgatgcttac aaactgtccc cgagagcctg tgctcactgt 61 gttttcattg gaaggaagct gcttactgta cctgcacctt tgatgcttac aaactgtccc 121 cgaga // LOCUS SOPMPDNA 111 bp ds-DNA SYN 25-JUL-1990 DEFINITION Synthetic ovalbumin pre-message selfprimer DNA. ACCESSION M35058 KEYWORDS ovalbumin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 111) AUTHORS Oyama,F., Kikuchi,R. and Uchida,T. TITLE A synthetic, partial pre-mRNA for ovalbumin primes its own complementary DNA with reverse transcriptase JOURNAL J. Biochem. 104, 403-408 (1988) STANDARD simple staff_entry FEATURES from to/span description site 3 3 cDNA start with primer site 36 36 cDNA start without primer site 69 84 primer-independent cDNA BASE COUNT 44 a 15 c 20 g 32 t ORIGIN 1 atcctggaag tttatcaaag cgaacaacct gtaattgaaa ataatagtag ctgaaataat 61 ggttatgaca aaaagaagtt atgcaatcca gtttcaagat ttctagctag t // LOCUS XELRRAA 121 bp ss-RNA RNA 25-JUL-1990 DEFINITION X.laevis 5S RNA. ACCESSION M35055 KEYWORDS 5S ribosomal RNA. SOURCE X.laevis kidney ribosomal RNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 121) AUTHORS Brownlee,G.G., Cartwright,E., McShane,T. and Williamson,R. TITLE The nucleotide sequence of somatic 5 S RNA from Xenopus laevis JOURNAL FEBS Lett. 25, 8-12 (1972) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 121 5S ribosomal RNA BASE COUNT 24 a 34 c 38 g 25 t ORIGIN 1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg 61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt 121 t // LOCUS YSCTRR2 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, Brewer's) Arg-tRNA-II. ACCESSION K00157 M34900 KEYWORDS transfer RNA; transfer RNA-Arg. SOURCE Yeast (Saccharomyces cerevisiae, Brewer's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Weissenbach,J., Martin,R. and Dirheimer,G. TITLE Nucleotide sequence of tRNA-Arg-II from Brewer's yeast JOURNAL FEBS Lett. 28, 353-355 (1972) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Weissenbach,J., Martin,R. and Dirheimer,G. TITLE The primary structure of Arg-tRNA-II from brewer's yeast: Partial digestion with ribonuclease T-1 and derivation of the complete sequence JOURNAL Eur. J. Biochem. 56, 527-532 (1975) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 0130 in Nucleic Acids Res. 11, r1-r54 (1983). FEATURES from to/span description tRNA 1 76 Arg-tRNA-II (NAR: 0130) anticdn 34 36 Arg-tRNA-II anticodon gcg modified 1 1 f = pseudouridine modified 9 9 m1g = 1-methylguanosine modified 10 10 m2g = 2-methylguanosine modified 16 16 d = dihydrouridine modified 19 19 d = dihydrouridine modified 26 26 m22g = 2,2-dimethylguanosine modified 27 27 f = pseudouridine modified 34 34 i = inosine modified 47 47 d = dihydrouridine modified 49 49 m5c = 5-methylcytidine modified 54 54 t = 5-methyluridine modified 55 55 f = pseudouridine modified 58 58 m1a = 1-methyladenosine BASE COUNT 15 a 22 c 24 g 15 t ORIGIN 5' end of mature tRNA. 1 ttcctcgtgg cccaatggtc acggcgtctg gctgcgaacc agaagattcc aggttcaagt 61 cctggcgggg aagcca // LOCUS YSCTRT1A 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, brewer's) Thr-tRNA-1a. ACCESSION K00278 M34898 KEYWORDS transfer RNA; transfer RNA-Thr. SOURCE Yeast (Saccharomyces cerevisiae, brewer's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Weissenbach,J., Kirarly,I. and Dirheimer,G. TITLE The nucleotide sequences of two threonine tRNAs from Brewer's yeast JOURNAL FEBS Lett. 71, 6-8 (1976) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Weissenbach,J., Kiraly,I. and Dirheimer,G. TITLE Structure primaire des Thr-tRNA-1a-et-b de levure de biere JOURNAL Biochimie 59, 381-391 (1977) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast Thr-tRNA-1 is 50% Thr-tRNA-1a and 50% Thr-tRNA-1b [1]. FEATURES from to/span description tRNA 1 76 Thr-tRNA-1a (NAR: 1760) anticdn 34 36 Thr-tRNA-1a anticodon ggt modified 10 10 m2g modified 16 16 d modified 17 17 d modified 20 20 d modified 26 26 m22g modified 32 32 m3c modified 34 34 i modified 37 37 t6a modified 39 39 f modified 47 47 d modified 48 48 m5c modified 54 54 t modified 55 55 f modified 58 58 m1a BASE COUNT 20 a 17 c 21 g 18 t ORIGIN 5' end of mature tRNA. 1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcat cggttcaaat 61 ccgattggaa gcacca // LOCUS YSCTRT1B 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, brewer's) Thr-tRNA-1b. ACCESSION K00279 M34898 KEYWORDS transfer RNA; transfer RNA-Thr. SOURCE Yeast (Saccharomyces cerevisiae, brewer's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Weissenbach,J., Kirarly,I. and Dirheimer,G. TITLE The nucleotide sequences of two threonine tRNAs from Brewer's yeast JOURNAL FEBS Lett. 71, 6-8 (1976) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Weissenbach,J., Kiraly,I. and Dirheimer,G. TITLE Structure primaire des Thr-tRNA-1a-et-b de levure de biere JOURNAL Biochimie 59, 381-391 (1977) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 1760 in Nucleic Acids Res. 11, r1-r54 (1983). Brewer's yeast Thr-tRNA-1 is 50% Thr-tRNA-1b and 50% Thr-tRNA-1a [1]. FEATURES from to/span description tRNA 1 76 Thr-tRNA-1b (NAR: 1760) modified 10 10 m2g modified 16 16 d modified 17 17 d modified 20 20 d modified 26 26 m22g modified 32 32 m3c modified 34 34 i anticdn 34 36 Thr-tRNA-1b anticodon ggt modified 37 37 t6a modified 39 39 f modified 47 47 d modified 48 48 m5c modified 54 54 t modified 55 55 f modified 58 58 m1a BASE COUNT 19 a 18 c 22 g 17 t ORIGIN 5' end of mature tRNA. 1 gcttctatgg ccaagttggt aaggcgccac actggtaatg tggagatcgt cggttcaaat 61 ccgactggaa gcacca // LOCUS BSTGLGBA 2735 bp ds-DNA BCT 25-JUL-1990 DEFINITION B.stearothermophilus branching enzyme (glgB) gene, complete cds. ACCESSION M35089 KEYWORDS branching enzyme. SOURCE B.stearothermophilus (strain 1503-4R, variant 4) DNA, clone pKVS1. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2735) AUTHORS Kiel,J.A.K.W., Boels,J.M., Beldman,G. and Venema,G. TITLE Molecular cloning and nucleotide sequence of the branching enzyme gene (glgB) from Bacillus stearothermophilus, expression in E.coli and B.subtilis JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.K.W.Kiel, 12-JUN-1990. Dept of Genetics Center of Biological Sciences Kerklaan 30, NL 9751 NN Haren, THE NETHERLANDS FEATURES from to/span description pept 522 2441 branching enzyme (glgB) (EC 2.4.1.18) pept 325 < 1 (c) unidentified ORF2 binding 337 330 (c) ORF2 ribosomal binding site (put.) binding 504 516 glgB ribosomal binding site (put.) signal 370 365 (c) ORF2 -10 region (put.) signal 394 389 (c) ORF2 -35 region (put.) signal 446 458 glgB -35 region (put.) signal 469 479 glgB -10 region (put.) BASE COUNT 835 a 492 c 634 g 774 t ORIGIN 1 gaattccaat ggaaataatg gctaacgtaa ggccgtttaa aaaggacgta ataatttcaa 61 agcgcaaata accgaatgta aatcgatgat ttggcggacg catggcaaga taaagagcga 121 tcatgctaag cccaagcgcc aatacgtcag atgccatatg ggcagagtcg gaaagcaaag 181 ctaaggaatt ggataatagc cccccaacaa tttccacaat cgtaaaaaac aatgttaaaa 241 cgagagtgat ccaaagcgtt tttttcgatt gattttgcgt ttttacatga ggaagatggt 301 gataatcgta ttgaattggt gacatgacac acctcttatt tagaattatt tttaatttat 361 atacattata atatagtttt ttataattgt gcaaaaaaat tttttgttta tttatcgaaa 421 aatgtaaaaa aaatacaatt tttttatcaa ggaatttatg gaatcgctgt ggaatataag 481 taacaacggt aagaaacttt aaggaaagga tgcgatacag attgatcgcc gtcggtccca 541 ctgatttaga aatctattta tttcatgaag gcagcttata taaaagttat gaattgtttg 601 gtgcacatgt gataaagaaa aatggcatgg tcggaacccg gttttgtgta tgggcacccc 661 atgcgcggga agtgcgatta gtcggcagtt ttaatgaatg gaacggaact aattttaacc 721 ttatgaaagt aagtaatcaa ggcgtatgga tgatttttat tcctgaaaac ttagaagggc 781 atttatataa atacgaaatt acgacgaacg atgggaatgt tctgttaaaa tcggatccat 841 acgcgtttta ctccgagttg cgtccccata ctgcttccat tgtctacaac ataaaaggat 901 atcaatggaa tgaccagaca tggcgacgga agaaacagcg aaagcgaatt tatgaccagc 961 ctttgttcat ttatgaactt cactttggtt cgtggaaaaa gaaagaggac ggcagttttt 1021 atacatatca agagatggca gaggagctaa tcccttatgt tctcgaacat gggtttactc 1081 atattgagct gctcccactc gtcgagcatc cgttcgatcg ttcttgggga tatcagggaa 1141 taggttatta ttcagcaaca agccgctacg gaacaccgca tgatttgatg tattttattg 1201 accgctgtca ccaagctgga ataggcgtca ttctcgattg ggttcctggc cacttttgta 1261 aagattccca tgggttatat atgtttgatg gcgcaccggc atatgaatat gccaacatgc 1321 aagaccggga aaattacgta tggggaacgg caaactttga ccttggcaag ccggaagtcc 1381 gcagcttttt gatttccaat gcgttatttt ggatggaata tttccatgtg gacgggtttc 1441 gtgtagatgc tgttgccaat atgttatatt ggccaaacag cgacgtacta tacaaaaata 1501 cgtatgccgt ggagttcttg caaaaattaa atgaaacggt attcgcctat gatccgaaca 1561 tattaatgat tgccgaagat tcgacagact ggccgcgcgt cactgctcca acatacgacg 1621 gaggattagg atttaactat aaatggaaca tgggatggat gaacgatatt ttaacttata 1681 tggaaacgcc gcctgaacat cgaaaatacg tgcacaataa agtaacattt tccctcttgt 1741 atgcgtattc ggaaaatttc attttacctt tttcccatga cgaggtcgta catggaaaaa 1801 aatcgctgtt aagtaaaatg ccggggacat atgaggaaaa gtttgcgcaa ttaaggttgc 1861 tgtatggata tttgttgacg catcctggta agaaattatt gtttatgggc ggcgaatttg 1921 gccagtttga tgaatggaaa gatttagagc agctggattg gatgcttttt gattttgata 1981 tgcatcggaa tatgaatatg tatgtgaaag aattgttgaa atgttataag cgctataaac 2041 cgctttatga gttagaccac tctccagatg gattcgagtg gattgatgtt cataacgccg 2101 aacaaagtat tttctcattc attcgcagag gaaaaaaaga ggatgatttg cttattgttg 2161 tgtgtaattt cacaaataaa gtataccacg gttataaagt tggtgttccg ttatttacaa 2221 gatatcggga agtaatcaat agcgatgcaa tccaattcgg cggctttggg aatatcaatc 2281 caaaaccgat tgcggcgatg gaagggccgt ttcacggaaa gccatatcat attcagatga 2341 cgatcccgcc gtttggcatt tctattttaa gaccagtaaa aaaaggtagc gtcaaaagtt 2401 ttatgaaaac tccacatccg ccatcccatg gagcatcgta aggcatcctt ggagccggat 2461 tcgcccttga ccaacacccg ccaaaggtgt gaaagggacg tcaagggcga cggggacaaa 2521 aaagagggca taggaaagcc gcccttgccc ttaccgaatt ttacctttga cgaggttcgg 2581 ttggtcaagg gttcgcttcg ccgaatccgg ctgttcttct gatccatggg ctccggcgga 2641 caaaaaagtt aggctgcctc ttgttggagg aaatcttgag ccatggcgat cagcttcgtc 2701 caccgggccg gcatatgggg cagatcggcg agctc // LOCUS HUMETMAGA 3343 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human secreted epithelial tumor mucin antigen (H23Ag) gene, complete cds. ACCESSION M35093 KEYWORDS cell surface antigen; tumor mucin antigen. SOURCE Human breast tumor cell line MCF7 DNA, clone lambda-gtWES. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3343) AUTHORS Tsarfaty,I., Hareuveni,M., Horev,J., Zaretsky,J., Weiss,M., Jeltsch,J.M., Garnier,J.M., Lathe,R., Keydar,I. and Wreschner,D.H. TITLE Isolation and characterization of an expressed hypervariable gene coding for a breast cancer associated antigen JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Tsarfaty, 12-JUN-1990. FEATURES from to/span description pept 785 842 secreted epithelial tumor mucin antigen precursor, exon 1 (H23Ag) 1342 2207 secreted epithelial tumor mucin antigen precursor, exon 2 (H23Ag) sigp 785 805 secreted epithelial tumor mucin antigen signal peptide matp 806 842 secreted epithelial tumor mucin antigen 1342 2207 secreted epithelial tumor mucin antigen pre-msg 777 > 842 H23Ag mRNA and introns IVS 843 1341 H23Ag intron A signal 384 397 H23Ag ERE signal 633 644 H23Ag CACCT motifs signal 689 692 H23Ag TATA box site 1063 1090 put. enhancer rpt 1670 1729 repeat unit BASE COUNT 679 a 986 c 981 g 697 t ORIGIN Chromosome 1q21-q24. 1 gagctcctgg ccagtggtgg agagtggcaa ggaaggaccc tagggttcat cggagcccag 61 gtttactccc ttaagtggaa atttcttccc ccactcccct ccttggcttt ctccaaggag 121 ggaaccccag gctgctggaa agtccggctg gggcggggac tgtgggtttc agggtagaac 181 tgcgtgtgga acgggacagg gagcggttag aagggtgggg ctattccggg aagtggtggt 241 ggggggaggg agcccaaaac tagcacctag tccactcatt atccagccct cttatttctc 301 ggccgcctct gcttcagtgg acccggggag ggcggggaag tggagtggga gacctagggg 361 tgggcttccc gaccttgctg tacaggacct cgacctagct ggctttgttc cccatcccca 421 gttagttgtt gccctgaggc taaaactaga gcccaggggc cccaagttcc agactgcccc 481 tcccccctcc cccggagcca gggagtggtt ggtgaaaggg ggaggccagc tggagaagaa 541 acgggtagtc aggggttgca gcattagagc ccttgtagcc ctagcccagg aatggttgga 601 gagagaagag tagagtaggg aggggggttt gtcacctgtc acctgctcgg ctgtgcctag 661 ggcgggcggg ggggagtggg gggaccggta taaagcggta ggcgcctgtg cccgctccac 721 ctctcaagca gccagcgcct gcctgaatct gttctgcccc ctccccaccc atttcaccac 781 caccatgaca ccgggcaccc agtctccttt cttcctgctg ctgctcctca cagtgcttac 841 aggtgagggg cacgaggtgg ggagtgggct gccctgctta ggtggtcttc gtggtctttc 901 tgtgggtttt gctccctggc agatggcacc agaagttaag gtaagaattg cagacagagg 961 ctgccctgtc tgtgccagaa ggagggagag gctaaggaca ggctgagaag agttgccccc 1021 aaccctgaga gtgggtacca ggggcaagca aatgtcctgt agagaagtct agggggaaga 1081 gagtagggag agggaaggct taagagggga agaaatgcag gggccatgag ccaaggccta 1141 tgggcagaga gaaggaggct gctgcaggaa ggaggcggcc aacccagggg ttactgaggc 1201 tgcccactcc ccagtcctcc tggtattatt tctctggtgg ccaggcttat attttcttct 1261 tgctcttatt tttccttcat aaagacccaa ccctatgact ttaacttctt acagctacca 1321 cagcccctgg gcccgcaaca gttgttacag gttctggtca tgcaagctct accccaggtg 1381 gagaaaagga gacttcggct acccagagaa gttcagtgcc cagctctact gagaagaatg 1441 ctgtgagtat gaccagcagc gtactctcca gccacagccc cggttcaggc tcctccacca 1501 ctcagggaca ggatgtcact ctggccccgg ccacggaacc agcttcaggt tcagctgcca 1561 cctggggaca ggatgtcacc tcggtcccag tcaccaggcc agccctgggc tccaccaccc 1621 cgccagccca cgatgtcacc tcagccccgg acaacaagcc agccccgggc tccaccgccc 1681 ccccagccca gggtgtcacc tcggccccgg agaccaggcc gcccccgggc tccaccgccc 1741 ccccagccca tggtgtcacc tcggcgccgg acaacaggcc cgccttggcg tccaccgccc 1801 ctccagtcca caatgtcacc tcggcctcag gctctgcatc aggctcagct tctactctgg 1861 tgcacaacgg cacctctgcc agggctacca caaccccagc cagcaagagc actccattct 1921 caattcccag ccaccactct gatactccta ccacccttgc cagccatagc accaagactg 1981 atgccagtag cactcaccat agcacggtac ctcctctcac ctcctccaat cacagcactt 2041 ctccccagtt gtctactggg gtctctttct ttttcctgtc ttttcacatt tcaaacctcc 2101 agtttaattc ctctctggaa gatcccagca ccgactacta ccaagagctg cagagagaca 2161 tttctgaaat ggtgagtatc ggcctttcct tccccatgct cccctgaagc agccatcaga 2221 actgtccaca ccctttgcat caagcctgag tcctttccct ctcaccccag tttttgcaga 2281 tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggtac agttctgggt 2341 gtggacccag tgtggtggtt ggaggggtgg gtggtggtca tgagccgtag ggagggactg 2401 gtgcacttaa ggttggggga agagtgctga gccagagctg ggacccgtgg ctgaagtgcc 2461 catttccctg tgaccaggcc aggatctgtg gtggtacaat tgactctggc cttccgagaa 2521 ggtaccatca atgtccacga cgtggagaca cagttcaatc agtataaaac ggaagcagcc 2581 tctcgatata acctgacgat ctcaagacgt cagcggtgag gctacttccc tgctgcagcc 2641 agcaccatgc cggggcccct ctccttccag tgtctgggtc cccgctcttt ccttagtgct 2701 ggcagcggga ggggcgcctc ctctgggaga ctgccctgac cactgctttt ccttttagtg 2761 agtgatgtgc catttccttt ctctgaccag tctggggctg gggtgccagg ctggggcatc 2821 gcgctgctgg tgctggtctg tgttctggtt gcgctggcca ttgtctatct cattgccttg 2881 gtgagtgcag tccctggccc tgatcagagc cccccggtag aaggcactcc atggcctgcc 2941 ataacctcct atctccccag gctgtctgtc agtgccgccg aaagaactac gggcagctgg 3001 acatctttcc agcccgggat acctaccatc ctatgagcga gtaccccacc taccacaccc 3061 atgggcgcta tgtgccccta gcagtaccga tcgtagcccc tatgagaagg tgagattggg 3121 ccccacaggc aggggaagca gagggtttgg ctgggcaagg attctgaagg gggtacttgg 3181 aaaacccaaa gagcttggaa gaggtgagaa gtggcgtgaa gtgagcaggg gagggctggc 3241 aaggatgagg ggcagaggtc agaggagttt tgggggacag gcctgggagg agactatgga 3301 agaaaggggc ccctcaaaag ggagtgcccc actgccagaa ttc // LOCUS MPMVPIA 1155 bp ds-DNA VRL 25-JUL-1990 DEFINITION Mouse polyomavirus major structural protein (VP1) gene, complete cds. ACCESSION M34958 KEYWORDS major structural protein. SOURCE Mouse polyomavirus (strain RA) DNA. ORGANISM Mouse polyomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 1155) AUTHORS Freund,R., Garcea,R.L., Sahli,R. and Benjamin,T.L. TITLE A specific amino acid substitution in polyoma virus VP1 correlates with plaque size and hemagglutination behavior JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Freund, 08-JUN-1990. Author address: R.Freund Bldg C2 RM 129A Dept of Pathology Harvard Medical School 200 Longwood Avenue Boston, MA 02115 FEATURES from to/span description pept 1 1155 VPI protein (VPI) BASE COUNT 367 a 274 c 285 g 229 t ORIGIN 1 atggccccca aaagaaaaag cggcgtctct aaatgcgaga caaaatgtac aaaggcctgt 61 ccaagacccg cacccgttcc caaactgctt attaaagggg gtatggaggt gctggacctt 121 gtgacagggc cagacagtgt gacagaaata gaagcttttc tgaaccccag aatggggcag 181 ccacccaccc ctgaaagcct aacagaggga gggcaatact atggttggag cagagggatt 241 aatttggcta catcagatac agaggattcc ccaggaaata atacacttcc cacatggagt 301 atggcaaagc tccagcttcc catgctcaat gaggacctca cctgtgacac cctacaaatg 361 tgggaggcag tctcagtgaa aaccgaggtg gtgggctctg gctcactgtt agatgtgcat 421 gggttcaaca aacccacaga tacagtaaac acaaaaggaa tttccactcc agtggaaggc 481 agccaatatc atgtgtttgc tgtgggcggg gaaccgcttg acctccaggg acttgtgaca 541 gatgccagaa caaaatacaa ggaagaaggg gtagtaacaa tcaaaacaat cacaaagaag 601 gacatggtca acaaagacca agtcctgaat ccaattagca aggccaagct ggataaggac 661 ggaatgtatc cagttgaaat ctggcatcca gatccagcaa aaaatgagaa cacaaggtac 721 tttggcaatt acactggagg cacaacaact ccacccgtcc tgcagttcac aaacaccctg 781 acaactgtgc tcctagatga aaatggagtt gggcccctct gtaaaggaga gggcctatac 841 ctctcctgtg tagatataat gggctggaga gttacaagaa actatgatgt ccatcactgg 901 agagggcttc ccagatattt caaaatcacc ctgagaaaaa gatgggtcaa aaatccctat 961 cccatggcct ccctcataag ttcccttttc aacaacatgc tcccccaagt gcagggccaa 1021 cccatggaag gggagaacac ccaggtagag gaggttagag tgtatgatgg gactgaacct 1081 gtaccggggg accctgatat gacgcgctat gttgaccgct ttggaaaaac aaagactgta 1141 tttcctggaa attaa // LOCUS MYCP115A 3082 bp ss-mRNA BCT 25-JUL-1990 DEFINITION M.hyorhinis 115 kDa protein (p115) gene, complete cds. ACCESSION M34956 KEYWORDS . SOURCE M.hyorhinis (strain GDL) DNA, clone MhrG27. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 3082) AUTHORS Notarnicola,S.M., McIntoch,M.A. and Wise,K.S. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.S.Wise, 08-JUN-1990. University of Missouri-Columbia Dept of Mol Microbiol and Immunol School of Medicine-M653 Columbia, MO 65212 FEATURES from to/span description pept 70 3009 115 kDa protein BASE COUNT 1324 a 410 c 443 g 905 t ORIGIN 1 gaattctttt ttaataattt ttttacttta aaattctagt taaaactcta caaaaaaaca 61 aggacaaata tgttaaagct tattaaaatt gaaatcgaag gttttaaatc gttcgccgat 121 ccgatcagca taaatttcga tggttctgtt gtaggaatag ttggaccaaa tggttcagga 181 aaatctaata ttaatgacgc aattagatga gtattaggtg aacaatcagc aaaacaactt 241 cgtggactaa atatggatga tgttatcttt gcaggttcca aaactgtcaa acctcaagaa 301 aaagcaatgg taaaattaac cttcaaaaat gaagatgcaa ttgaagaaac gaaacaaatt 361 tttactattt ctcgtttact taaaagaggt caaggaacta atgaatattt ttacaatgat 421 caacctgtta gatataaaga tattaaaaat ttagctgttg aatctggaat ttctaaatct 481 tcacttgcaa ttatttccca aggtactata tctgaaattg cagaagcaac gcctgaacaa 541 agaaaagcag ttattgaaga agctgctgga acttcaaaat acaaattaga caaagaagaa 601 gcacaaaaga aacttattag aacaaatgat gcaattgata aattacaagg tgcaatcaaa 661 gagttagaac gtcaagtaaa ctcgcttgat aaacaagctt ctaaagcaaa aatttattta 721 gaaaaaagta aagctcttga atcagttgaa gtaggtttaa ttgttaatga tctaaacttt 781 ttcaatgaaa aattaaataa tttaaatact tcactattag aagtagaaca acaaagaaat 841 gatcttgaac tcaacattca aacttatgaa tccagtattt cacaaactgt tcattttaaa 901 acagaagttg aatcttcaat ccaagaaatt acttcaaaat tagacaattt aaaaaacgca 961 ctttccgaaa tcaaccttca agaagctaga attgaagaac gtagaaaatt aattatcagt 1021 ggtgaaattg tagttgatca aaaaacaaaa attgaagaaa ttaaaaaaca agttgaatca 1081 ctcaaaatac aaataaatgc ttcaaaacaa agagaaattg aactagacca acaacttaca 1141 agactaaatg caaaagctaa ttctttaaaa ttgcaagaaa atgatattaa taaagaaatt 1201 ggtgtattac ttgaaaaaaa atcagctgct gcagcaaata ttaatatatt aaaacaacaa 1261 tttgaaaata aaagttttct ttctaaagga attaaaacta ttaaagataa ctcattttta 1321 tttgatggtt acattggatt agcttctgaa ttatttaaag tagaatccga atttagttta 1381 gcaattgaaa ctgttttagg tgctgcttta aatcaaatag taatgaaaac atctgaagat 1441 gtacttcaag ctattgactt tttaaagaaa aatctttcag gtaaagcaac ttttattcct 1501 ttaacatcta ttaaagaaag agaagtaaga gaagatcatt tacttgtttt aaaaggacaa 1561 aaaggatttt taggtgttgc aaaagaacta attgaatttg atactcaatt taacaaactc 1621 tttggatttt tacttggaaa catcttagtg gttgataatg tagacaatgc aaatagaata 1681 gctaaaatat tagatcataa atacactata gtttctttag aaggtgattt attcagacca 1741 ggcggaacca ttactggagg ttcaaaacta gaaagaactt ctattttaaa ttacgatatc 1801 aaaataaaag aacacacaaa tacacttaaa tttgctgaag atcaaattca tgatttaaaa 1861 attaaacagc aaacaatata taacgaaatt gaaacagtca attcaacaat ccaacaagta 1921 aaaattgaag ctaattcaat aaattcaaaa cttaatatct taaacgaaga attaaataac 1981 ttaaaactaa acgcaagcga aattttcaaa gaacaacaag aagaccaaga gagtttaaat 2041 ttaagttttg attctgaaaa attgaacata gaaaaacaaa tttctactct aacaattgaa 2101 ttaaattcta aaaaagatcg actaacaaat ttaattagtg agcaaggaaa aggagaaacc 2161 aagaaacaag aattagatgc caaactaaga aaattaaaca ctcaacactc agatagtatc 2221 actgaacaaa acagagcaaa attcttggta gagcaaaatc aaaaaagact ttctgagcac 2281 tacaaattaa ctttagaagc tgctagtgaa caatattctt tagatttaga cattgaacaa 2341 gcaagacatt ttgttgatag ccttaaaaaa gagttaaaag aattaggaaa cgttaattta 2401 gaagcaatta ctgaatttga agaagtaaat caacgttacc aagagaaaaa acaatacatc 2461 gaagaactaa ccactgctaa atccaaaatt gaagaagcaa tttctgattt agataaaatt 2521 attatcaata aaacaacaga aattgttaac ttagtaaata atgaatttaa tatggtattt 2581 caaaaaatgt ttggtggtgg aaaagcagaa attcacttca cagacaaaaa tgatatttta 2641 aattctggtg ttgaaatatc tgcacaacca cctggtaaaa caattaaaaa cttacgactt 2701 ttttcaggtg gagaaaaagc tattattgca atttcacttc tttttgctat tttaaaagca 2761 agaccaattc cattgtgtat tttagacgaa gttgaagctg cacttgatga atctaatgtt 2821 attcgttatg tagaattttt aaaattacta aaagaaaata ctcaattctt aattattact 2881 caccgttcag gaacaatgtc aagagtagat cagttacttg gagttactat gcaaaaacgt 2941 ggagttactt ccattttctc agttgaacta agcaaagcaa aagagatgct aaaagacgaa 3001 ttaaaataat acaaataaaa ataaaaaaaa cagaagtttg aagtgaggtg ataccctttt 3061 cttgaaaaaa ttttttgagt gt // LOCUS PPHVLCRA 314 bp ds-DNA VRL 25-JUL-1990 DEFINITION Human papillomavirus type 6 long control region DNA. ACCESSION M35091 KEYWORDS . SOURCE Human papillomavirus type 6 (patient specimen X020) DNA. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 314) AUTHORS Hrisomalos,T.F., Boggs,D.L. and Fife,K.H. TITLE The human papillomavirus type 6 long control region and human cellular DNA contain related sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.H.Fife, 12-JUN-1990. AUTHOR address: K.H.Fife Emerson Hall 435 Indiana University School of Medicine 545 Barnhill Dr. Indianapolis, IN 46202-5124 FEATURES from to/span description pept < 1 21 L1 open reading frame (AA at 1) signal 231 236 polyA signal site 81 175 insert (as compared to prototype sequence) site 245 259 insert (as compared to prototype sequence) BASE COUNT 75 a 29 c 72 g 138 t ORIGIN Mapped between nucleotides 7271 to 7476. 1 cgcgccaaaa ccaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt 61 actgttatgt atatgtgttt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta 121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt 181 tatgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat 241 taactacatt attgtatatc ttgttacacc ctgtgactca gtggctgttg cacgcgtttt 301 ggtttgcacg cgcc // LOCUS PPHVLCRB 300 bp ds-DNA VRL 25-JUL-1990 DEFINITION Human papillomavirus type 6 long control region DNA. ACCESSION M35092 KEYWORDS . SOURCE Human papillomavirus type 6 (patient specimen X019) DNA. ORGANISM Human papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 300) AUTHORS Hrisomalos,T.F., Boggs,D.L. and Fife,K.H. TITLE The human papillomavirus type 6 long control region and human cellular DNA contain related sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.H.Fife, 12-JUN-1990. Emerson Hall 435 Indiana Univ School of Medicine 545 Barnhill Dr. Indianapolis, IN 46202-5124 FEATURES from to/span description pept < 1 21 L1 open reading frame (AA at 1) signal 231 236 polyA signal site 81 175 insert (as compared to prototype sequence) BASE COUNT 68 a 29 c 72 g 131 t ORIGIN Mapped between nucleotides 7271 to 7476. 1 cgcgccaaaa ctaaaaggta atatatgtgt atatgtactg ttatatatat gtgtgtatgt 61 actgttatgt atatgtgtgt atgtactgtt atatgtatgt gtgttgtata tatgtgtgta 121 tatatgtgta tgtgtgtata tgtatatgta tgtgttgtgt atatatatgt gtgtgtgtgt 181 tctgtgtgta atgtaattta tttgtgtaat gtgtatgtgt gtttatgtgc aataaacaat 241 tacctcttgt tacaccctgt gactcagtgg ctgttgcacg cgttttggtt tgcacgcgcc // LOCUS TRHTCSA 1010 bp ss-mRNA PLN 25-JUL-1990 DEFINITION T.kirilowii trichosanthin (TCS) mRNA, complete cds. ACCESSION M34858 KEYWORDS ribosome inactivating protein; trichosanthin. SOURCE T.kirilowii maximowicz, cDNA to mRNA. ORGANISM Trichosanthes kirilowii Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1010) AUTHORS Shaw,P.-C., Yung,M.-H., Zhu,R.-H., Ho,W.K.-K., Ng,T.-B. and Yeung,H.-W. TITLE Molecular cloning of trichosanthin cDNA and its expression in Escherichia coli JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.-C.Shaw, 06-JUN-1990. Author address: P.-C.Shaw Department of Biochemistry Chinese University of Hong Kong Shatin, NT, HONG KONG FEATURES from to/span description pept 10 879 trichosanthin precursor sigp 10 78 trichosanthin signal peptide matp 79 819 trichosanthin variant 196 196 t in wild type; a in allele variant 197 197 c in wild type; g in allele variant 468 468 t in wild type; a in allele BASE COUNT 290 a 218 c 205 g 297 t ORIGIN 1 gtcaaaaaga tgatcagatt cttagtcctc tctttgctaa ttctcaccct cttcctaaca 61 actcctgctg tggagggcga tgttagcttc cgtttatcag gtgcaacaag cagttcctat 121 ggagttttca tttcaaatct gagaaaagct cttccaaatg aaaggaaact gtacgatatc 181 cctctgttac gttcctctct tccaggttct caacgctacg cattgatcca tctcacaaat 241 tacgccgatg aaaccatttc agtggccata gacgtaacga acgtctatat tatgggatat 301 cgcgctggcg atacatccta ttttttcaac gaggcttctg caacagaagc tgcaaaatat 361 gtattcaaag acgctatgcg aaaagttacg cttccatatt ctggcaatta cgaaaggctt 421 caaactgctg caggcaaaat aagggaaaat attccgcttg gactccctgc tttggacagt 481 gccattacca ctttgtttta ctacaacgcc aattctgctg cgtcggcact tatggtactc 541 attcagtcga cgtctgaggc tgcgaggtat aaatttattg agcaacaaat tgggaagcgt 601 gttgacaaaa ccttcctacc aagtttagca attataagtt tggaaaatag ttggtctgct 661 ctctccaagc aaattcagat agcgagtact aataatggac agtttgaaag tcctgttgtg 721 cttataaatg ctcaaaacca acgagtcacg ataaccaatg ttgatgctgg agttgtaacc 781 tccaacatcg cgttgctgct gaatagaaac aatatggcag ccatggatga cgatgttcct 841 atgacacaga gctttggatg tggaagttat gctatttagt gtaacttcaa gctacgtacg 901 agtacaaact cccacttgaa gaatctatta tcgtttgaga gtttaatcta cttgtagaaa 961 taataaagca tgttcgtgtg accgacctac gtggatgctc tgtatgtgtg // LOCUS CIBABI 1989 bp ds-DNA BCT 25-JUL-1990 DEFINITION Plasmid ColIB abortive infection protein (abi) gene, complete cds. ACCESSION J03314 KEYWORDS abortive infection protein. SOURCE Plasmid ColIB DNA, clone pTP64, isolated from E.coli K-12 strain W3110. ORGANISM Plasmid Colicin Ib Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1989) AUTHORS Gupta,S.K. and McCorquodale,D.J. TITLE Nucleotide sequence of a DNA fragment that contains the Abi gene of the ColIb plasmid JOURNAL Plasmid 20, 194-206 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.J.McCorquodale, 21-NOV-1988. FEATURES from to/span description pept 1306 1575 abortive infection protein binding 1028 1050 LexA binding site binding 1100 1121 LexA binding site binding 1215 1234 LexA binding site binding 1232 1252 LexA binding site BASE COUNT 476 a 533 c 531 g 449 t ORIGIN 5bp upstream of PstI site. 1 ctgcaggtcc gtgccgacca ggtgcttaag gggtggaaaa atatcccgcg cgggatctcc 61 ctgaccttct ccctgtttgc cgagatcgcc ggccgggaca gggaaaccat cgaccaggcc 121 tggaaaaata tcttctactc gcaactgagg gaaaaaaaac accgctttta ccaaagatat 181 cgaggccatc cgcgccctga aaaaactgcc tgccctcacc ggcgacagct ggcgcgggat 241 ggcatcacgg tgcgtatcta ccgcccggaa aattacgccc gcggcgatgg cggcttacac 301 tgagcctccc ggaaaattac gccacccaga tgtggaacat cccgttcccg gagcttgaat 361 accgcctctt taccgccgat ccgggctaca gcgccctgat cagcgccgaa cccgacaggt 421 gggacaaggc cttccgtttt gtggacgggg tgtgcgagct tcacctttac accaacggtg 481 tggaaggaag atcacaatcc caccccgctc ggggatgtcg ctcaggcgct gatcaacgtg 541 gtggaagaaa acctgctgta acggacccgg atgctgcggg cacaactgca tcatcaggag 601 gatgcaatga aaggacgaca gagccgctat gttaccggcg gagagagttt cgcggagatt 661 gcccgtctcc cttcaggggc ggtggtgagg ctctgtctga acaccggtct tgaggatgcg 721 ctgcgggagg cctccaaatc gctcaagtca gccttcaccc gttccgggcg aaaatgccgg 781 ctgtcagcgg gtacggcgca ggggccgttt accggacgcc ggcaggcgtg gccacacatc 841 tcttcgtctc ggtactctga gggggcaggg ggcaaaaaaa gtaaaaatgt attcgccagg 901 ttgcccggag gtgaaggaaa atagacatac agcagaacga cggatagcac tttttgctaa 961 atggacatca gtattactat gctatagttg ctttaatgga taagtgcgcc ttgacaaagg 1021 cggtgatttc tgttaacatt actctcatag tattgttccg tcccgctcca ccccaacaag 1081 atccgtttat ttcccgccag actggttatc accattcagg cccggatttt tttggatttt 1141 tttccgggga gcccccggac gagcttaaaa tcggtatgac aaacaggagg atgcgaatga 1201 acacatcata acagagctga aagataaaac attctgtacg gcattaacag cgttcacgtg 1261 tgtgaggcgc cgggtgcctt ttgacttaaa aacgaggtta ttgagatgac caaaatcaag 1321 acagttactt ttgtaaatac ttacccggga gggtctatga aaaacttgtt agacaccgag 1381 ggaacggttc tattcccatt ccagactgaa atccatttta tttggacgat tttctccacc 1441 gttaaacgcc tggttatcgg aaccagggac catatttgcc agaagcaata ctggagcgcc 1501 tgtctctgta ttttgcttct tatggcctat gtgggtctct gtgctgcggt ggtctggttt 1561 gtagtgccct gctgaaggcc tttatagtgt cgaaatttgc ggtttcggca ctatgggtca 1621 cgccagtaaa gcgcggacta ctctggggta tcggtaaagt ggttaccgcc acttgccgaa 1681 gatttactct gctaaagtaa gtagccgcaa cgctacacga actgatggtg aatgtcaaca 1741 gatactcacc atctccttac ggcggtggtc cctgtgacca ctggcctttc gcgtgggtgc 1801 aacacggcaa aactcctctg tacaacaggc tcccgccgtc attttccggc acaggtgagg 1861 ccggaattcg gactaaaacg taaaccgcgg gccagtccgg tagcgttcac tatcggccag 1921 cattctctca accagagaga aatccttttc accgcagaac acgtacgtct ccgcgaactc 1981 cacctgcag // LOCUS HUMET3 2223 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human endothelin 3 (EDN3) mRNA, complete cds. ACCESSION J05081 KEYWORDS endothelin. SOURCE Human adult hypothalamus, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2223) AUTHORS Bloch,K.D., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE cDNA cloning and chromosomal assignment of the gene encoding endothelin 3 JOURNAL J. Biol. Chem. 264, 18156-18161 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.D.Bloch, 06-OCT-1989. FEATURES from to/span description pept 194 910 endothelin 3 precursor /hgml_locus_uid="LU0066V" /map="unassigned" /nomgen="EDN1" sigp 194 268 endothelin 3 signal peptide (put.) matp 484 544 endothelin 3 matp 670 712 endothelin-like protein BASE COUNT 575 a 535 c 583 g 530 t ORIGIN 1 cgggtagcgc gctctgaaag tttatgaccg ccgcagccaa ctcctggccg gagctggaga 61 cgcagcgagc gatcggccgg cctcgaaccc ccacagctgg agggcgaggc cagctgtacc 121 cggccccagt gccctttcgc ggccacaagc ggccgtcctc ctggtccggt gctccggcgc 181 ctgatctagg ttcatggagc cggggctgtg gctccttttc gggctcacag tgacctccgc 241 cgcaggattc gtgccttgct cccagtctgg ggatgctggc aggcgcggcg tgtcccaggc 301 ccccactgca gccagatctg agggggactg tgaagagact gtggctggcc ctggcgagga 361 gactgtggct ggccctggcg aggggactgt ggccccgaca gcactgcagg gtccaagccc 421 tggaagccct gggcaggagc aggcggccga gggggcccct gagcaccacc gatccaggcg 481 ctgcacgtgc ttcacctaca aggacaagga gtgtgtctac tattgccacc tggacatcat 541 ttggatcaac actcccgaac agacggtgcc ctatggactg tccaactaca gaggaagctt 601 ccggggcaag aggtctgcgg ggccacttcc agggaatctg cagctctcac atcggccaca 661 cttgcgctgc gcttgtgtgg ggagatatga caaggcctgc ctgcactttt gcacccaaac 721 tctggacgtc agcagtaatt caaggacggc agaaaaaaca gacaaagaag aggaagggaa 781 ggttgaagtc aaggaccaac aaagcaagca ggctttagac ctccaccatc caaagctcat 841 gcccggcagt ggactcgccc tcgctccatc tacctgcccc cgctgcctct ttcaggaagg 901 agccccttag gaggacaggc ctgcagctcc aatttcatgc aggaaattgg ttttggagag 961 ttttggcaag ttggaaagcc acttactggc ttttgacatg acttctcttg gagaataagt 1021 ggactccaag ctaactcttt gcaaatgtaa acacatgtcc atcttgttaa taaatgcaaa 1081 atgcccgtgc agcagaagca tgcgactttc atatccttgc ctagaatagg ctgcatggtg 1141 tatgtcagtg agggccacga ggcgtcggct ttagacacag atcatagctc tacaggagtt 1201 tatgaatttg aagcttatgg gattttggca gagaaatttt cagctgtgct tgatacccac 1261 caaaagaatg tatctcgaaa gaatgaagga agaagaaaaa aggatccttg atgtttgtga 1321 caagaaaatg agaaagttag tatctgcaat acagagcttg ttcctgttca gtgactgacc 1381 ctctgtattc tgtatagaca ccaggccgat acacagtgga gttcccaggc cttgtttgca 1441 ggaagccgac tgtaaagaca gccccagctc aaggctatta ggttgaatat ttgctttcat 1501 gagtaaatgt ggatctttgg ggaatggctt caaaataagt cacgaacaca aattctttgt 1561 aaattatgta aattcctgtt tatataaatt ggcaacaact tataccgtct gacagttcaa 1621 aatctctttc agctgcgctc ttcccaccga gccgagctta ctgtgagtgt ggagatgtta 1681 tcccaccatg taaagtcgcc tgcgcagggg agggctgccc atctccccaa cccagtcaca 1741 gagagatagg aaacggcatt tgagtgggtg tccagggccc cgtagagaga catttaagat 1801 ggtgtatgac agagcattgg ccttgaccaa atgttaaatc ctctgtgtgt atttcataag 1861 ttattacagg tataaaagtg atgacctatc atgaggaaat gaaagtggct gatttgctgg 1921 taggattttg tacagtttag agaagcgatt atttattgtg aaactgttct ccactccaac 1981 tcctttatgt ggatctgttc aaagtagtca ctgtatatac gtatagagag gtagataggt 2041 aggtagattt taaattgcat tctgaataca aactcatact ccttagagct tgaattacat 2101 ttttaaaatg catatgtgct gtttggcacc gtggcaagat ggtatcagag agaaacccat 2161 caattgctca aatactcaga aagtactgtc aaaagcctaa taaaaaacct aaagtttgct 2221 ctg // LOCUS HUMSATAA 293 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human alpha satellite DNA, clone pC1.8. ACCESSION M26918 J04744 KEYWORDS alpha satellite DNA; satellite DNA. SOURCE Human (cell line HHW423) DNA, clone pC1.8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 293) AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A. TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic and internal organization and chromosomal assignment JOURNAL Genomics 5, 822-828 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.Baldini, 08-AUG-1989. FEATURES from to/span description rpt 1 293 alpha-satellite BASE COUNT 84 a 55 c 63 g 91 t ORIGIN Chromosomes 1, 5, and 19; centromere. 1 gatcctttac acagagcaga cttgaaacac tctttttgtg gaatttgcag tggagatttc 61 aagcgctttg aggccaatgg cagaaaagga aatacttcga tataaaaact agacagaatc 121 attctcagaa actgctctgc gatgtgtcgg ttcaactctc agagtttaac ttttcttttc 181 attcagcagt ttggaaacac tctgtttgta aagtctgcaa cgtggatatt tgaccactta 241 gaggccttcg ttggaaacgg gtttttttcc tgtaaggcta gacagaagaa ttc // LOCUS HUMSATAB 344 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human alpha satellite DNA, clone pC1.8. ACCESSION M26919 J04744 KEYWORDS alpha satellite DNA; satellite DNA. SOURCE Human (cell line HHW423) DNA, pC1.8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 344) AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A. TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic and internal organization and chromosomal assignment JOURNAL Genomics 5, 822-828 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.Baldini, 08-AUG-1989. FEATURES from to/span description rpt 1 344 alpha-satellite BASE COUNT 98 a 67 c 68 g 111 t ORIGIN Chromosomes 1, 5 and 9; centromere. 1 gaattcccag tagcttcctt gtgttgtgaa cattcaactc acagagttga acgttccctt 61 agacagagca gatttgaaca ctctttttgt gcaattggca agtggagatt tcaagcgctt 121 taaggtcaat ggcagaaaag gaaatatctt cgtttcaaaa ctagacagaa tcattcccac 181 aaactgcgtt gtgatgtgtt cattcaactc acacagttta acctttcttt tcatagagca 241 gttaggaaac agtctgtttg taaattctct aagtggatat tctgacatct tgtggccttc 301 gttggaaacg ggatttcttc atattctgct agacagaaga attc // LOCUS HUMSATAC 1049 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human alpha satellite DNA, clone pC1.8. ACCESSION M26920 J04744 KEYWORDS alpha satellite DNA; satellite DNA. SOURCE Human (cell line HHW423) DNA, clone pC1.8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1049) AUTHORS Baldini,A., Smith,D.I., Rocchi,M., Miller,O.J. and Miller,D.A. TITLE A human alphoid DNA clone from the EcoRI dimeric family: Genomic and internal organization and chromosomal assignment JOURNAL Genomics 5, 822-828 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by A.Baldini 08-AUG-1989. FEATURES from to/span description rpt 1 1049 alpha-satellite BASE COUNT 295 a 198 c 221 g 335 t ORIGIN Chromosomes 1, 5 and 19; centromere. 1 aaattttctt ttcatacagc agagtttgga aacactctgt ttgtaaagtc tgcacgtgga 61 taagttgtcc acttagaggc attcgttgga aacgggtttt tttcatgtaa ggctacacag 121 aagaattccc agtaacttcc ttgtgttgtg tgtatcaact caaagagttg aacgatcctt 181 tacacagagc agacttctaa cactcttttt gtggaatttg caagtggaga tttcagccgc 241 tttgaagtca aaggtagaaa aggaaatatc ttcctataaa aactagacag aatgattctc 301 agaaactcct ttgtgatgtg tgcgttcaac tcacagagtt taacctttct tttcatagag 361 cagttaggaa acactctgtt tgtaaagtct gcaagtggat attcagacct ctttgaggcc 421 ttcgtggaac gggttttcat ataaggctag gcagagaatt cccagtaact tccttgtgtt 481 gtgtgtgtca actcacagag ttgactttca tttacacaga gcagacttga aacactcttt 541 ttgtaattgc aagtggagat ttcaagcgct ttgagcaagg ccgaaaagga aatatcttcg 601 tataaaaact agacagaatc attctcagaa actgctctgc gatgtgtgcg ttcaactctc 661 agagtttaac ttttcttttc atcagcagtt tggaaacact ctgtttgtaa agtctgcacg 721 tggatatttt gaccacttag aggccttcgt tggaaacggg tttttttcct gtaaggctag 781 acagaagaat tccctgtagc ttccttgtgt tgtgtacatt caacgcacag agttgaacgt 841 tcccttagac agagcagatt tgaaacactc tttttgtgca attggcaagt ggagatttca 901 ggcgctttaa ggtcaatggc agaaaaggaa atatcttcgt ttcaaaacta gacagaatca 961 ttcccacaaa ctgcgtggtg atgtgttcgt tcaactcaca gagtttaacc tttcctttca 1021 tagagcagtt aggaaacagt ctgtttttt // LOCUS PVYCPA 1122 bp ss-RNA VRL 25-JUL-1990 DEFINITION Potato virus Y coat protein gene, 3' end. ACCESSION M22470 KEYWORDS coat protein. SOURCE Potato virus Y (necrotic strain; isolate New Zealand; N-PVY), passed in Nicotiana tabacum cv. Burley 21, cDNA to viral RNA, clone PVYN 27. ORGANISM Potato virus Y Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potyvirus. REFERENCE 1 (bases 1 to 1122) AUTHORS Hay,J.M., Fellowes,A.P. and Timmerman,G.M. TITLE Nucleotide sequence of the coat protein gene of a necrotic strain of potato virus Y from New Zealand JOURNAL Arch. Virol. 107, 111-122 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.M.Hay, 09-FEB-1989. FEATURES from to/span description pept < 1 796 coat protein (AA at 2) BASE COUNT 351 a 196 c 263 g 312 t ORIGIN 6 bp upstream of TaqI site. 1 cacaatcgat gcaggaggaa gcactaaaaa ggatgcaaaa caagagcaag gtagcattca 61 accaaatttc aacaaggaaa aggaaaagga cgtgaatgtt ggaacatctg gaactcatac 121 tgtgccacga attaaagcta tcacgtccaa aatgagaatg cccaagagta aaggtgcaat 181 tgcattaaat ttggaacact tactcgagta tgctccacag caaattgaca tctcaaatac 241 tcgagcaact caatcacagt ttgatacgtg gtatgaagca gtacaacttg catacgacat 301 aggagaaact gaaatgccaa ctgtgatgaa tgggcttatg gtttggtgca ttgaaaatgg 361 aacctcgcca aacatcaacg gagtttgggt tatgatggat ggagatgaac aagtcgaata 421 cccactaaaa ccaatcgttg agaatgcaaa accaacactt aggcaaatca tggcacattt 481 ctcagatgtt gcagaagcgt atatagaaat gcgcaacaaa aaggaaccat atatgccacg 541 atatggttta gttcgtaatc tgcgcgatgg aagtttggct cgctatgctt ttgactttta 601 tgaagttaca tcacggacac cagtgagggc tagagaggca cacattcaaa tgaaggccgc 661 agctttaaaa tcagctcaat ctcgactttt cggattggat ggtggcatta gtacacaaga 721 ggaaaacaca gagaggcaca ccaccgagga tgtttctcca agtatgcata ctctacttgg 781 agtgaagaac atgtgattgt agtgtctttc cggacgatat atagatattt atgtttgcag 841 taagtatttt ggcttttcct gtactacttt tatcgaaatt aataatcgtt tgaatattac 901 tggcagatag gggtggtata gcgattccgt cgttgtagtg accttagctg tcgtttctgt 961 attattatgt ttgtataaaa gtgccgggtt gttgttgttg tggctgatct atcgattagt 1021 tgatgttgcg atttgtcgta gcagtgacta tgtctggatt tagttagttg ggtgatgctg 1081 tgattctgtc atagcagtga ctgtaaactt caatcaggag ac // LOCUS SRAAFPG 2420 bp ds-DNA VRT 25-JUL-1990 DEFINITION Sea raven (H.americanus) antifreeze protein type II gene, complete cds. ACCESSION J05100 KEYWORDS antifreeze protein. SOURCE Sea raven (adult) testes DNA, clone lambda SR7. ORGANISM Hemitripterus americanus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Scorpaeniformes; Cottoidei; Cottidae. REFERENCE 1 (bases 1 to 2420) AUTHORS Hayes,P.H., Scott,G.K., Ng,N.F.L., Hew,C.L. and Davies,P.L. TITLE Cystine-rich type II antifreeze protein precursor is initiated from the third AUG codon of its mRNA JOURNAL J. Biol. Chem. 264, 18761-18767 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.L.Davies, 19-OCT-1989. FEATURES from to/span description pept 434 494 antifreeze protein, exon 2 (first expressed exon) 1246 1382 antifreeze protein, exon 3 1488 1604 antifreeze protein, exon 4 1697 1805 antifreeze protein, exon 5 2045 2112 antifreeze protein, exon 6 pre-msg 226 2112 antifreeze protein mRNA and introns IVS 284 389 antifreeze protein intron A IVS 495 1245 antifreeze protein intron B IVS 1383 1487 antifreeze protein intron C IVS 1605 1696 antifreeze protein intron D IVS 1806 2044 antifreeze protein intron E rpt 74 94 repeat copy A rpt 95 114 repeat copy B rpt 115 135 repeat copy C signal 74 135 antifreeze protein regulatory sequence (put.) site 161 164 antifreeze protein CAAT box site 195 198 antifreeze protein TATA box signal 2368 2373 antifreeze protein polyA signal BASE COUNT 684 a 475 c 496 g 765 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttcaga aattcactcc tttttctaat attaacttta aagccacagt gtgcgatttg 61 gagccctttg atttgttgtt ttcaaagttc aaactgttgt ttcaaaattc aaactgttgt 121 tttcaaagtt caaactgatg ccagtgtcca taataaaaat caatgtatga ataatattgt 181 gaaatgtaat tgactatata agagctggtc tttctctagt tcagcacatg aatgcagagg 241 caacaggctg acactgaaac aagagaagat atttctacag caggtttgct ctcagcctct 301 tcttcgtcct gccgagcccc acaggcactg tgctgccctg ctgtctttgt aattcattgc 361 aactcttgtg tttttctctt ctgatgcagg gctatcaatc atcttcatcg tctgcaccat 421 ctctaccacg aggatgctga ctgtgtctct actggtttgt gccatgatgg ctctgactca 481 agctaatgat gacagtgagt ctcagtctta cattctgtgt gtaggatact atactgtctg 541 taaatatatt caattgtaga cctattaaga tgctgtgaat attaatatta ggtaatattt 601 agtttattta tatatgtata tatatttgac agtaataaca aaaaactagg atagattgca 661 atccgacttt ttgttatctt tattgttaac aatattaaag acataattcc atagaattat 721 ataatttaca tagaaacagc aaatacaact gtcagagaaa gacttgacag ctaaagcagg 781 agagatcaag tgtagaaggg agatttgatc tcgtctcaac tgaagctaga actgaatgta 841 ctaacttatt tttggtgaaa caaccgaata attaattcat ttttccccca caaaactaaa 901 cgagacgcag accaagctaa gtgtgtgcta acagtaatca gcattcgttt agcaaagtat 961 tagtaactgc catcacagct tttgactcta gtggaattca tgaaatttgg cagaacaaag 1021 gagacctgtg cacatctgat tccaatgaga atacaatgtg cttcacagaa aagcacttca 1081 ccaatcctgt acacattcat aaagccacag aaaaaaagag agctgattaa tcgtcgttcc 1141 ctctgctctg acaataaaag gattataaac tccagatttc tgataaacag actcggtggc 1201 ttacctgtga tcagacatgt tacccactct tctgtttgtc ctcagaaata ctcaaaggca 1261 cggctacaga ggctggaccg gtctctcaga gagccggacc aaactgtccc gctggttggc 1321 aacctcttgg tgaccgctgt atctattatg agacaacagc gatgacttgg gctctggctg 1381 aggtagtcag gatatgatta tgattcagat tgcttctaaa ctggtctggt ggtattgcct 1441 tacatgctcg gttaattgag catgagcttg actcatttcc actgcagaca aactgtatga 1501 aattgggtgg acaccttgca tccatccaca gccaggagga gcatagtttc attcagacct 1561 tgaatgctgg tgttgtatgg atcggaggct ccgcttgcct ccaggtaaaa cattgcatta 1621 caatggtggc agaaagaaag gatttttatt acatgctatc ttactatacg tatattcttt 1681 cctttctgtt ttctaggcag gtgcttggac ctggtctgat ggtacaccta tgaattttcg 1741 ttcctggtgt tctaccaaac ctgatgatgt actggccgcg tgctgtatgc agatgactgc 1801 tgcaggtaaa tcacaacaca ttagagcata gtattaaatg actgaaggca gtagtgttgt 1861 ttagtacatt tggttcatct tgagatcaat actctcagaa tttcactttt gaatcacttt 1921 tgttcttcag ttcatgtgta gctttggcct cgttatccgt gtctttgtct gtctagtgat 1981 gaagacagtt tcaggttagg ttggtatggc gctgactcac ttcttgtgtt tttgatgttt 2041 acagctgacc aatgctggga tgacttgcct tgtccggcgt cccacaaatc agtctgcgcc 2101 atgacattct aagctaacac agaggccatc catcacacaa acactttagt gggtgtttga 2161 ttgtgtgtgt tcgcatactc atctgtgttc gtgtcaacag cctcatgctg aacctgaagg 2221 ttcaaaatct catatgacat ctttaattct ttgctattgt tggagctgcc tgaaaggatg 2281 agacgacaag agctggaaag catctgaggg attttaggaa gaaagtgaat ggttatgaaa 2341 atgatggtct ttttatgtat tatgtcaaat taaaaggctg acacgttgaa acaaactctt 2401 ctgtgagttt ggcagaattc // LOCUS YSCTFIIDA 2439 bp ds-DNA PLN 25-JUL-1990 DEFINITION Yeast (S.cerevisiae) TATA-binding protein (TFIID) gene, complete cds. ACCESSION M27135 KEYWORDS DNA binding protein; TATA-binding protein; transcription factor. SOURCE Yeast (S.cerevisiae, strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2439) AUTHORS Hahn,S., Buratowski,S., Sharp,P.A. and Guarente,L. TITLE Isolation of the gene encoding the yeast TATA binding protein TFIID: A gene identical to the SPT15 suppressor of Ty element insertions JOURNAL Cell 58, 1173-1181 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Hahn, 10-AUG-1989. FEATURES from to/span description pept 1237 1959 TATA-binding protein BASE COUNT 737 a 481 c 472 g 749 t ORIGIN 1 bp upstream of EcoRI site; chromosome 5 right arm after TRP1. 1 gaattcgttc aagtggtccg taatattccc gtctttacaa agctggatta ccatctctaa 61 tgccaacttc catgcatata gctcaggccc caccgtgtgc agctccgtgc ttcgcagctc 121 ctgcagagca tcctcgggga ttgggaacct ctcatttagc aagtaattca cataacacag 181 atttagaaac catttccatt gtgacttttc ccgacattgc gagagtagcc catgaaaact 241 cgtcttcacc ctgcggtgct gtttcagctt aatgcaaagc atcacgccga catactggaa 301 tacggatgcc caattttgat acaactcatc ctgcaaattt accatgtact ggactaattc 361 attgcaattt cttagtgcaa tcttatagtg gaacttactg tctctcataa gtggcaagtc 421 atgtaacagc agaaactcgc aacgcatgat ctcttctacc aaatctgtgt cgctctggtg 481 cgtttgtaac cgttctttca aactggaaat gtaaagctct gctaggtcaa aattatacgt 541 ctcctgtatc aataactcca ccatctcaaa cgtgacctta ctatcctcca gaactgaaag 601 cgtacatttc gttttcaata gctgaaacat ctggatagac atgttcatga ggccataata 661 ctgcttcaac ccttcctcag aaccgatttt attcgcaatt gatatgcatg gtctctgtat 721 tcctgtgcta agtggtatac ttgtgaaata ctaagtttgt cgccaagatt ttccatgaat 781 ttgtacttct ttcgaaatcg ttcaatttct accaatactg attcccctct gatagctgag 841 atgtcgggat tccctttgct gatagatcta actcatctct ttacgtattt taattgtgaa 901 gccgtaaata gttatcttcc aagtttctct tacgcgagct ttttgggaaa agaaaaaaat 961 ttgaagatct acatataaaa catggcttca aaggattact aatgactttt tttaccttga 1021 taggtattct tgatggtaag agtaaacaag ggacgtgaaa attacagtag ttactgtttt 1081 ttttggacta taagatcggg ggaaagataa cacataagaa ataaaacgac tactagttag 1141 actgctctgc ggaagaagca aggaagtaaa ggctgcattt tatttttctt ttctagtcca 1201 acataaacag gtgtatcaag agaaactttt ttaattatgg ccgatgagga acgtttaaag 1261 gagtttaaag aggcaaacaa gatagtgttt gatccaaata ccagacaagt atgggaaaac 1321 cagaatcgag atggtacaaa accagcaact actttccaga gtgaagagga cataaaaaga 1381 gctgccccag aatctgaaaa agacacctcc gccacatcag gtattgttcc aacactacaa 1441 aacattgtgg caactgtgac tttggggtgc aggttagatc tgaaaacagt tgcgctacat 1501 gcccgtaatg cagaatataa ccccaagcgt tttgctgctg tcatcatgcg tattagagag 1561 ccaaaaacta cagctttaat ttttgcctca gggaaaatgg ttgttaccgg tgcaaaaagt 1621 gaggatgact caaagctggc cagtagaaaa tatgcaagaa ttatccaaaa aatcgggttt 1681 gctgctaaat tcacagactt caaaatacaa aatattgtcg gttcgtgtga cgttaaattc 1741 cctatacgtc tagaagggtt agcattcagt catggtactt tctcctccta tgagccagaa 1801 ttgtttcctg gtttgatcta tagaatggtg aagccgaaaa ttgtgttgtt aatttttgtt 1861 tcaggaaaga ttgttcttac tggtgcaaag caaagggaag aaatttacca agcttttgaa 1921 gctatatacc ctgtgctaag tgaatttaga aaaatgtgat ggggaaggag tagacgaaaa 1981 gaaaaaaagg ttttctattt gttccatttt ctcaattatt aatggtcctc aaagaaataa 2041 aagaaaagga agaagaagta attgtaatat caaacggttt tttatagtat attcttctta 2101 ttctatattt atatatcaat gttttataat aagatgttta ttcatagcat atctggtgga 2161 tcgtctctat taagcgccag cgaggtgttt gcctctgcat ttttcagcaa agcaagctcc 2221 ctttccagct tgaatctatg ttcacgctca tccgacaatt ctttttcata ctttctttgt 2281 gtactcgtaa gcactttttt aaactcactt gtcattattg aaagtgaacg tgatccagaa 2341 ccgcttgtgg ggcttcctac agaggaaggt gaacttggat cccaagtcac tggcgaactc 2401 gctggtgatg acatgccgaa attatgtctg cttgaattc // LOCUS ECOPUTC 730 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli putC region encoding proline uptake protein (putP) and proline oxidase (putA) genes, 5'ends. ACCESSION M35174 KEYWORDS putA protein; putC region; putP protein. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 730) AUTHORS Nakao,T., Yamato,I. and Anraku,Y. TITLE Nucleotide sequence of putC, the regulatory region for the put regulon of Escherichia coli K 12 JOURNAL Mol. Gen. Genet. 210, 364-368 (1987) STANDARD simple staff_review FEATURES from to/span description pept 129 < 1 (c) proline uptake protein (putP) pept 549 > 730 proline oxidase (putA) pept 209 544 ORF mRNA 266 < 1 (c) putP mRNA (alt.) mRNA 255 < 1 (c) putP mRNA (alt.) mRNA 249 < 1 (c) putP mRNA (alt.) mRNA 224 < 1 (c) putP mRNA (alt.) mRNA 142 < 1 (c) putP mRNA (alt.) mRNA 506 > 730 putA mRNA BASE COUNT 205 a 173 c 144 g 208 t ORIGIN 1 cccaagacta cgaccgccca gaatatagtc gtcaaagttt ttcgttgatc gccaggcgat 61 aaacccaatc aatatcatgc caaagatata gacacaaaat gtcaccaaca tcggtgtgct 121 aatagccatc taaagtctcc aaaaaattat tatcggcaat gtcgaaactt gccgttatat 181 ctgccaccgg aacggggtaa cagagtttat gttttaccag ggcgaccgta tcctgccgga 241 agcgctggtt attcacaatc gatttaacac accatttaca ttaaatttta gtgctcagcg 301 acactatttt tcatcaggtt gcactctctc acattttttg cggttgcacc tttcaaaaat 361 gttaactgcc gcagagaaaa agtctgagtt atttttttcc ctgtcatatc gatttctttt 421 attaacattt cattcatttt taagcttgct acgcatgtca catttaacat ggttgcacaa 481 agttgcaaca tcatggatat ttcacgataa cgttaagttg cacctttctg aacaacagga 541 gtaatggcat gggaaccacc accatggggg ttaagctgga cgacgcgacg cgtgagcgta 601 ttaagttcgc cgcgacacgt atcgatcgca caccacactg gttaattaag caggcgattt 661 tttcttatgc tcgaacaact ggaaaacagc gatactctgc cggagctacc tgcgctgctt 721 tctggcgcgg // LOCUS FIBGLUC 1426 bp ds-DNA BCT 25-JUL-1990 DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene, complete cds. ACCESSION M33676 M33311 KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; beta-glucanase. SOURCE F.succinogenes (strain S85) DNA, clone PJI5. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1426) AUTHORS Teather,R.M. and Erfle,J.D. TITLE DNA sequence of a Fibrobacter succinogenes mixed linkage beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene JOURNAL J. Bacteriol. 172, 3837-3841 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Teather, 11-APR-1990. FEATURES from to/span description pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase precursor (EC 3.2.1.73) sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase signal peptide matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase binding 132 137 ribosome binding site signal 62 66 -35 region signal 85 90 -10 region BASE COUNT 371 a 346 c 335 g 374 t ORIGIN 1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt 61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt 121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc 181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt 241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg 301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc 361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc 421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct 481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac 541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac 601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt 661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat 721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat 781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt 841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc 901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa 961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct 1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac 1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt 1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg 1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt 1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa 1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt 1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa // LOCUS BFRRCRRA 89 bp ss-RNA PHG 25-JUL-1990 DEFINITION Bacteriophage fr coat protein replicase cistron (R region) RNA. ACCESSION M35063 KEYWORDS coat protein. SOURCE Bacteriophage fr RNA. ORGANISM Bacteriophage fr Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 89) AUTHORS Cielens,I.E., Jansone,I.V., Gribanov,V.A., Vishnevskii,Y.I., Berzin,V.M. and Gren,E.J. TITLE Regulator region of phage fr replicase cistron: II. Isolation and structure of specific fr RNA fragments JOURNAL Mol. Biol. 16, 886-892 (1982) STANDARD simple staff_entry FEATURES from to/span description pept 55 > 89 coat protein (R region) pept < 1 20 undefined ORF (AA at 3) BASE COUNT 34 a 23 c 13 g 19 t ORIGIN 1 ccaactcggg aatctactaa gaaacccgtg ccattccaac aatgaggaat acccatgtca 61 aaatcaacaa agaagttcaa ctctttatg // LOCUS CHKAGLBB 71 bp ss-mRNA VRT 25-JUL-1990 DEFINITION Chicken alpha-globin gene, partial cds. ACCESSION M35068 KEYWORDS alpha-globin. SOURCE Chicken (strain white Leghorn) 2-3 week old, cDNA to mRNA, clone pHb1003. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 71) AUTHORS Cummings,I.W., Liu,A.Y. and Salser,W.A. TITLE Identification of a new chicken alpha-globin structural gene by complementary DNA cloning JOURNAL Nature 276, 418-419 (1978) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 71 alpha-globin (AA at 1) BASE COUNT 17 a 22 c 18 g 14 t ORIGIN 1 aagaaggtag tggctgcctt gatcgaggct gccaaccaca ttgatgacat cgccggcacc 61 ctctccaagc t // LOCUS ECOTGLPA 141 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli suppressor tRNA-Leu (leuX) precursor gene. ACCESSION M35064 KEYWORDS leuX gene; suppressor transfer RNA-Leu. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 141) AUTHORS Nomura,T. and Ishihama,A. TITLE A novel function of RNase P from Escherichia coli: Processing of a suppressor tRNA precursor JOURNAL EMBO J. 7, 3539-3545 (1988) STANDARD simple staff_entry FEATURES from to/span description tRNA 23 114 Leu-tRNA anticdn 57 59 Leu-tRNA anticodon caa site 35 36 self-cleavage site BASE COUNT 38 a 34 c 33 g 36 t ORIGIN 1 gttttccgca tacctcttca gtgccgaagt ggcgaaatcg gtagacgcag ttgattcaaa 61 atcaaccgta gaaatacgtg ccggttcgag tccggccttc ggcaccaaaa gtatgtaaat 121 agacctcaac tgaggtcttt t // LOCUS HUMFBPC 66 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human folate binding protein mRNA, partial cds. ACCESSION M35069 KEYWORDS folate binding protein. SOURCE Human epidermoid carcinoma cell line KB, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 66) AUTHORS Sadasivan,E. and Rothenberg,S.P. TITLE Molecular cloning of the complementary DNA for a human folate binding protein JOURNAL Proc. Soc. Exp. Biol. Med. 189, 240-244 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 66 folate binding protein (AA at 1) BASE COUNT 21 a 17 c 17 g 11 t ORIGIN 1 acaaggattg catgggccag gactgagctt ctcaatgtct gcatgaacgc caagcaccac 61 aaggaa // LOCUS HUMMETONA 90 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human met oncogene, middle exon. ACCESSION M35073 KEYWORDS met oncogene; tyrosine kinase. SOURCE Human cell line MNNG-HOS DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 90) AUTHORS Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O., Rowley,J.D., Blair,D.G. and Vande Woude,G.F. TITLE The human met oncogene is related to the tyrosine kinase oncogenes JOURNAL Nature 318, 385-388 (1985) STANDARD simple staff_entry FEATURES from to/span description pept / 22 / 90 met oncogene (AA at 24) /hgml_locus_uid="LN0032R" /nomgen="MET" /map="7q31" IVS < 1 21 met oncogene intron BASE COUNT 28 a 16 c 21 g 25 t ORIGIN Chromosome 7q31. 1 ttggctttgg tcttcaagta gccaaagcga tgaaatatct tgcaagcaaa aagtttgtcc 61 acagagactt ggctgcaaga aactgtatgt // LOCUS HUMMETONB 375 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human met oncogene mRNA, 3' end. ACCESSION M35074 KEYWORDS met oncogene; tyrosine kinase. SOURCE Human cell line MNNG-HOS, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 375) AUTHORS Dean,M., Park,M., Le Beau,M.M., Robins,T.S., Diaz,M.O., Rowley,J.D., Blair,D.G. and Vande Woude,G.F. TITLE The human met oncogene is related to the tyrosine kinase oncogenes JOURNAL Nature 318, 385-388 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 375 met oncogene /hgml_locus_uid="LN0032R" /nomgen="MET" /map="7q31" BASE COUNT 94 a 97 c 83 g 101 t ORIGIN Chromosome 7q31. 1 tggtcctttg gcgtcgtcct ctgggagctg atgacaagag gagccccacc ttatcctgac 61 gtaaacacct ttgatataac tgtttacttg ttgcaaggga gaagactcct acaacccgaa 121 tactgcccag accccttata tgaagtaatg ctaaaatgct ggcaccctaa agccgaaatg 181 cgcccatcct tttctgaact ggtgtcccgg atatcagcga tcttctctac tttcattggg 241 gagcactatg tccatgtgaa cgctacttat gtgaacgtaa aatgtgtcgc tccgtatcct 301 tctctgttgt catcagaaga taacgctgat gatgaggtgg acacacgacc agcctccttc 361 tgggagacat catag // LOCUS MS23ENDA 105 bp ss-RNA PHG 25-JUL-1990 DEFINITION Bacteriophage MS2 3' terminal fragment. ACCESSION M35059 KEYWORDS . SOURCE Bacteriophage MS2 RNA. ORGANISM Bacteriophage MS2 Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 105) AUTHORS Contreras,R., Vandenberghe,A., Jou,W.M., De Wachter,R. and Fiers,W. TITLE Studies on the Bacteriophage MS2 nucleotide sequence of a 3' terminal fragment (n=104) JOURNAL FEBS Lett. 18, 141-144 (1971) STANDARD simple staff_entry BASE COUNT 21 a 34 c 30 g 20 t ORIGIN 1 gctccaccga aaggtgggcg ggcttcggcc cagggacccc tccctaaaga gaggacccgg 61 gattctcccg atttggtaac tagctgcttg gctagttacc accca // LOCUS PEAPCATE 1004 bp ds-DNA SYN 25-JUL-1990 DEFINITION Chimaeric gene with P.sativum ribulose 1,5-bisphosphate carboxylase 5' flank/A.tumefaciens chloramphenicol acetyltransferase gene, 5' end. ACCESSION M35072 KEYWORDS . SOURCE Recombined Pisum sativum and Agribacterium tumefaciens DNA inserted in decapitated tobacco seedlings. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 1004) AUTHORS Herrera-Estrella,L., Van den Broeck,G., Maenhaut,R., Van Montagu,M., Schell,J., Timko,M. and Cashmore,A. TITLE Light-inducible and chloroplast-associated expression of a chimaeric gene introduced into Nicotiana tabacum using a Ti plasmid vector JOURNAL Nature 310, 115-120 (1984) STANDARD simple staff_entry FEATURES from to/span description pept 1002 > 1004 chloramphenicol acetyltransferase (CAT) recomb 965 966 P.sativum DNA end/CAT DNA start signal 943 946 TATA box BASE COUNT 309 a 176 c 146 g 373 t ORIGIN 1 gaattcaaca ttggctatta ctggttttac aaagtcagac taaggagcat gtccaaccac 61 tataaggtct ataataggat ttaccttttt ccttagaagc actttaatca actagaaatc 121 aaagaagcaa aatgtagtgt ctagatcttc atcagaagta aagtatagag ctttagcaaa 181 cacatcatgt gagacacgat ggtttctata cttgcttcag gatctctgca tttcccatac 241 ctcgttcatg acaattgcaa accaacctcg tacatttgat gcccataatt tctgaaaacc 301 aagttgcata cctcttcacc aaaactcttc atcttggtct cttctcctct tttgttcaca 361 aactaggaat tattaacttt cattctaatt tataggggct gctacaactt aatatatttt 421 taattatttt tattctctta atttcctttt tttctatttg tttgtcaggt agttgagata 481 tttgggctaa tctattagag atagtttctc taacaaactt gtaactttgg gtctatatta 541 gctaatgatt catcttatat tttttcaaat gaatcattaa taaaactttc ctcttttatt 601 taattttttc aattcagttt catcatcaaa gcaaatgttt ccctgccatc tgtttgtcaa 661 cactaacatc taatgtactt atctcattag tttaattatt gtttgatcat gtttaatcct 721 tctagtgttg ttagtttttt cagttagctt aatgggcatc ttacacgtgg cattatccta 781 ttggtggcaa atgataaggt taggacacac aacttttcaa tcttgtgtgg ttaatatggc 841 tgcaaagttt atcatttcac aatctaacaa gattggtact aggcagtagc taattaccac 901 aatattaaga ccataatatt ggaaatagat aaataaaaac attatatata gcaagtttta 961 gcagaagctt ggcgagattt tcaggagcta aggaagctaa aatg // LOCUS TRFMTTGVA 149 bp ds-DNA ORG 25-JUL-1990 DEFINITION C.oncopelti mitochondrion Val-tRNA gene. ACCESSION M35071 KEYWORDS transfer RNA-Val. SOURCE C.oncopelti mitochondrial DNA, clone pCo150. ORGANISM Mitochondrion Crithidia oncopelti Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Crithidia oncopelti. REFERENCE 1 (bases 1 to 149) AUTHORS Entelis,N.S., Maslov,D.A., Bol'shakova,E.V. and Zaitseva,G.N. TITLE Primary structure of an unusual valine tRNA gene from mitochondria of Crithidia oncopelti JOURNAL Dokl. Biochem. 297, 435-438 (1987) STANDARD simple staff_entry FEATURES from to/span description tRNA 18 89 Val-tRNA anticdn 45 47 Val-tRNA anticodon tac BASE COUNT 44 a 39 c 16 g 50 t ORIGIN 1 gatctaaaat ccctgttaga cacttgtttt tgcaaacgta taattacgtt ttctacacca 61 aaacccttta aatccctgtt aggaccccat ttcttcaaat gtataatcac gttttctgcg 121 tccaaacccc ttaaaaccca gatttcgat // LOCUS YSCTRV2A 75 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae, Baker's) Val-tRNA-2a. ACCESSION M35070 K01066 KEYWORDS transfer RNA-Val. SOURCE Yeast (S.cerevisiae, Baker's) tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N. and Baev,A.A. TITLE Nucleotide sequence in Val-tRNA-2a from Baker's yeast JOURNAL FEBS Lett. 45, 333-336 (1974) STANDARD full staff_review REFERENCE 2 (bases 1 to 75) AUTHORS Aksel'rod,V.D., Kryukov,V.M., Isaenko,S.N.. and Baev,A.A. TITLE Primary structure of Val-tRNA-2a from Baker's yeast JOURNAL Mol. Biol. 9, 42-48 (1975) STANDARD simple staff_entry COMMENT Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss; from their entry 2050 in Nucleic Acids Res. 11, r1-r54 (1983). [1] compared given sequence with that of baker's yeast Val-tRNA-1. FEATURES from to/span description tRNA 1 75 Val-tRNA-2a (NAR: 2050) anticdn 35 37 Val-tRNA-2a anticodon tac modified 10 10 m2g modified 16 16 d modified 19 19 d modified 20 20 d modified 27 27 m22g modified 28 28 p modified 33 33 p modified 35 35 unidentified uridine derivative modified 46 46 d modified 48 48 m5c modified 53 53 t modified 54 54 p modified 57 57 m1a BASE COUNT 16 a 22 c 20 g 17 t ORIGIN 5' end of mature tRNA. 1 ggtccaatgg tccagtggtt caagacgtcg cctttacacg gcgaatcccg agttcgaacc 61 tcggttggat cacca // LOCUS YSCTRW 75 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (S.cerevisiae) Trp-tRNA-cca. ACCESSION M35060 X02698 KEYWORDS transfer RNA-Trp. SOURCE Yeast tRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 75) AUTHORS Keith,G., Roy,A., Ebel,J.P. and Dirheimer,G. TITLE The nucleotide sequences of two tryptophane-tRNAs from Brewer's yeast JOURNAL FEBS Lett. 17, 306-308 (1971) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 75) AUTHORS Keith,G., Roy,A., Ebel,J.-P. and Dirheimer,G. TITLE The primary structure of tryptophan transfer ribonucleic acid from Brewer's yeast: II. Partial digestion with pancreatic ribonuclease and derivation of complete sequence JOURNAL Biochimie 54, 1417-1426 (1972) STANDARD full staff_review FEATURES from to/span description tRNA 1 75 transfer RNA-Trp anticdn 33 35 Trp-tRNA anticodon cca modified 9 9 1-methylguanosine modified 10 10 2-methylguanosine modified 16 16 dihydrouridine modified 17 17 2'-O-methylguanosine modified 19 19 dihydrouridine modified 25 25 pseudouridine modified 26 26 pseudouridine modified 27 27 pseudouridine modified 31 31 2'O-methylcytidine modified 33 33 2'O-methylcytidine modified 38 38 pseudouridine modified 45 45 7-methylguanosine modified 47 47 dihydrouridine modified 53 53 5-methyluridine (ribosylthymine) modified 54 54 pseudouridine modified 57 57 1-methyladenosine modified 64 64 pot. pseudouridine BASE COUNT 17 a 18 c 20 g 20 t ORIGIN 1 gaagcggtgg ctcaatggta gagctttcga ctccaaatcg aagggttgca ggttcaattc 61 ctgtccgttt cacca // LOCUS YSUTRAI 76 bp ss-tRNA RNA 25-JUL-1990 DEFINITION Yeast (T.utilis) Ala-tRNA-I. ACCESSION M35061 K00143 KEYWORDS transfer RNA-Ala. SOURCE Yeast (T.utilis) tRNA. ORGANISM Candida utilis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 76) AUTHORS Takemura,S., Ogawa,K. and Nakazawa,K. TITLE Nucleotide sequence of alanine tRNA I from Torulopsis utilis JOURNAL FEBS Lett. 25, 29-32 (1972) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 76) AUTHORS Takemura,S. and Ogawa,K. TITLE The primary structure of alanine transfer ribonucleic acid 1 from Torulopsis utilis: II. Partial digestion with ribonuclease T-1 and derivation of the complete sequence JOURNAL J. Biochem. 74, 323-333 (1973) STANDARD full staff_review COMMENT Contributed on tape April 1983 by M.Sprinzl and D.H.Gauss; from their entry 0020 in Nucleic Acids Res. 11, r1-r54 (1983). [1]: The cloverleaf model for the secondary structure was compared with that of Saccharomyces Ala-tRNA, especially with respect to the aminoacyl-tRNA synthetase recognition sites. FEATURES from to/span description tRNA 1 76 Ala-tRNA-I (NAR: 0020) anticdn 34 36 Ala-tRNA-I anticodon ggc modified 9 9 m1g = 1-methylguanosine modified 16 16 d = dihydrouridine modified 17 17 d = dihydrouridine modified 20 20 d = dihydrouridine modified 26 26 m22g = 2,2-dimethylguanosine modified 27 27 f = pseudouridine modified 34 34 i = inosine modified 37 37 m1i = 1-methylinosine modified 38 38 f = pseudouridine modified 47 47 d = dihydrouridine modified 54 54 t = 5-methyluridine modified 55 55 f = pseudouridine modified 58 58 m1a = 1-methyladenosine BASE COUNT 9 a 21 c 28 g 18 t ORIGIN 5' end of mature tRNA 1 gggcgtgtgg cgtagttggt agcgcgttcg cttggcgtgc gaaaggtctc cggttcgact 61 ccggactcgt ccacca // LOCUS MUSPTKA 211 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD15. ACCESSION M33421 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 211) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 211 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 47 a 55 c 64 g 45 t ORIGIN 1 ggatccacag ggacctggct gctcggaact gcctggtgac agagaagaat gtcctgaaga 61 tcagcgactt tgggatgtcc cgcgaagaag ctgatgggat ctatgccgcc tgcagcggcc 121 tcagacaagt ccctgttaag tggactgccc ctgaggccct taactacgga cgctactcct 181 cagagagtga tgtgtggagc tttggaattc c // LOCUS MUSPTKB 211 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD16. ACCESSION M33422 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD16. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 211) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 211 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 50 a 50 c 59 g 52 t ORIGIN 1 ggatccacag agaccttgct gctaggaact gcatggatgc cgaagatttc acagtaaaaa 61 ttggagattt cggtatgaca cgagacatct acgagacgga ctactaccgg aaaggcggga 121 aggggttgct gcctgtgcgc tggatgtctc tcgagtccct caaggatggt gtcttcacta 181 ctcattctga cgtctggtcc ttcggaattc c // LOCUS MUSPTKC 214 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD17. ACCESSION M33423 M22448 J04523 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD17. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 214) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review REFERENCE 2 (sites) AUTHORS Wilks,A.F. TITLE Two putative protein-tyrosine kinases identified by application of the polymerase chain reaction JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [2] kindly submitted by A.Wilks, 08-FEB-1989, for release after publication. FEATURES from to/span description pept < 1 > 214 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 69 a 42 c 58 g 45 t ORIGIN 1 ggatccacag ggacctggca acaaggaaca tattggtgga aaatgagaac agggttaaaa 61 taggagactt cggattaacc aaagtcttgc cgcaggacaa agaatactac aaagtaaagg 121 agccagggga aagaccgata ttctggtacg cacctgaatc cttgacggag agcaagtttt 181 ctgtggcctc agatgtctgg tcctttggaa ttcc // LOCUS MUSPTKD 217 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD19. ACCESSION M33424 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD19. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 217) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 217 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 64 a 44 c 57 g 52 t ORIGIN 1 ggatccacag agacttagct gcaagaaact gcatgttgga tgaaaaattc actgtcaagg 61 ttgctgattt cggtcttgcc agagacatgt acgataaaga gtactatagt gtccacaaca 121 agacgggtgc caagctacca gtgaagtgga tggctttaga gagtctgcaa aggcagaagt 181 tcaccaccac gtcagatgtg tggtcctttg gaattcc // LOCUS MUSPTKE 214 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD22. ACCESSION M33425 M22447 J04523 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD22. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 214) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review REFERENCE 2 (sites) AUTHORS Wilks,A.F. TITLE Two putative protein-tyrosine kinases identified by application of the polymerase chain reaction JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1603-1607 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [2] kindly submitted by A.Wilks, 08-FEB-1989, for release after publication. FEATURES from to/span description pept < 1 > 214 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 59 a 46 c 55 g 54 t ORIGIN 1 ggatccaccg ggacttagca gcaagaaatg tccttgttga gagtgagcat caagtgaaga 61 tcggagactt tggtttaacc aaagcaattg aaaccgataa ggagtactac acagtcaagg 121 acgaccggga cagcccagtg ttctggtacg ctccggagtg tttaatccag tgtaaatttt 181 atatcgcctc tgacgtctgg tcctttggaa ttcc // LOCUS MUSPTKF 208 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone FD175. ACCESSION M33426 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line FDC-P1, cDNA to mRNA, clone FD175. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 208) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 208 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 49 a 53 c 55 g 51 t ORIGIN 1 ggatccaccg tgatctgcga gctgctaacg tcctggtctc tgagtcactc atgtgcaaga 61 ttgcagactt tggcctcgcg agagtcatcg aagataacga gtacacagca agggaaggtg 121 cgaagttccc tatcaagtgg acagctccag aggcgttcaa cttcggctgc ttcactatca 181 aatctgacgt gtggtccttt ggaattcc // LOCUS MUSPTKG 208 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse protein-tyrosine kinase (PTK) mRNA, partial cds, clone W3.13. ACCESSION M33427 KEYWORDS protein-tyrosine kinase. SOURCE Mouse haemopoietic cell line WEH1-3B D+, cDNA to mRNA, clone W3.13. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 208) AUTHORS Wilks,A.F., Kurban,R.R., Hovens,C.M. and Ralph,S.J. TITLE The application of the polymerase chain reaction to cloning members of the protein tyrosine kinase family JOURNAL Gene 85, 67-74 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 208 protein-tyrosine kinase (AA at 3) (EC 2.7.1.112) BASE COUNT 53 a 44 c 56 g 55 t ORIGIN 1 ggatccacag agacctggct gccagaaatt gtctagtgaa tgaagcagga gttgtcaaag 61 tatctgattt tggaatggcc aggtacgttc tggatgatca gtacacaagt tcttctggcg 121 ccaagttccc tgtgaagtgg tgtcccccag aagagtttaa ttacagccgc tttagcagca 181 agtcagacgt gtggtcctat ggaattcc // LOCUS RATCROS1A 7839 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat lung-derived c-ros-1 proto-oncogene mRNA, complete cds. ACCESSION M35104 KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase. SOURCE Rat (strain Fischer) lung, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 7839) AUTHORS Matsushime,H. and Shibuya,M. TITLE Tissue-specific expression of rat c-ros-1 gene and partial structural similarity of its predicted products with sev protein of Drosophila melanogaster JOURNAL J. Virol. 64, 2117-2125 (1990) STANDARD simple staff_review FEATURES from to/span description pept 402 7355 c-ros-1 tyrosine kinase (put.) mRNA < 1 7839 c-ros-1 mRNA BASE COUNT 2159 a 1760 c 1887 g 2033 t ORIGIN 1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag 61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg 121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag 181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc 241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc 301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac 361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc 421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg 481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca 541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta 601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg 661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca 721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac 781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac 841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc 901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc 961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc 1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac 1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa 1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca 1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta 1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa 1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt 1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt 1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg 1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt 1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct 1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt 1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg 1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt 1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg 1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc 1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag 1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag 2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc 2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca 2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg 2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt 2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca 2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta 2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt 2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg 2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct 2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt 2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg 2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca 2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg 2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc 2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca 2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg 3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa 3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc 3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg 3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc 3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc 3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct 3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat 3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt 3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta 3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg 3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt 3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct 3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat 3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc 3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg 3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt 3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact 4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg 4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata 4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt 4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc 4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat 4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact 4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca 4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta 4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg 4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag 4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa 4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc 4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta 4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc 4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt 4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg 4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg 5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg 5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg 5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc 5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt 5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt 5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt 5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca 5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag 5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac 5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag 5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact 5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca 5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta 5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt 5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag 5941 tcccactgac ctttgtctgg catagaagct tgaaaaacca caaagctacc aaggaaggcc 6001 tctcagttct caacgacaat gaccaagagt tggctgagct tcgaggactg gcggctggag 6061 tgggcctggc caatgcctgc tatgcagtac atactcttcc aacccaagag gagattgaaa 6121 gtcttcccgc cttccctcgg gagaagctga gcctgcgcct tctgttggga agtggagctt 6181 ttggagaagt gtacgagggc acagctgtag acatcctagg acggggaagt ggagaaatca 6241 aggtggccgt gaagaccctg aagaaaggtt cgacagacca ggagaagatc gagttcctga 6301 aggaggcaca cctgatgagc aagtttaatc accccaacat tctgaagcag ctgggagtct 6361 gtctgctgag tgaaccccag tacattatcc tggaactgat ggaaggggga gaccttctaa 6421 gctatctgcg caaagcccga gggacaacgt tgtctggccc tttactcaca ttggctgacc 6481 tggtagagct gtgtgtagat atttcaaaag gctgcgtcta cttggagcag atgcacttca 6541 ttcacaggga tctggcagct cggaattgcc ttgtgtctgt gaaagactat accagtcctc 6601 gggtagtcaa gatcggtgac tttggtttgg caagggaaat ctataagcat gattattata 6661 gaaagagagg ggaaggcctg cttcctgtcc ggtggatggc tcctgaaaac ttgatggatg 6721 gaatcttcac ttcccagtct gatgtatggt cttttggaat tttggtttgg gagattttaa 6781 ctcttggtca tcaaccttat ccagcgcatt ccaaccttga tgttttaaac tatgtgcaag 6841 caggagggag actggagcca ccgagaaact gtcctgatga tctgtggaat ttaatgttcc 6901 gatgttgggc ccaagaacct gaccaaagac ccactttcta taacattcaa gaccagcttc 6961 agttattcag aaatgtttcc ttaaacaatg tttctcactg tggacaagca gctcctgctg 7021 gtggagtcat caacaaaggc tttgaaggtg aagacaatga aatggccact ttgaattcag 7081 atgacacgat gccagttgcc ttgatggaaa ccaggaacca agaaggatta aattatatgg 7141 tacttgccac aaagtgtagc caaagtgagg atcgttatga gggtcctcta ggctctaagg 7201 aatctgggtt gcatgatctg aagaaagacg agaggcaacc agcagacaaa gatttctgcc 7261 agcaaccaca ggtggcttat ggctctcctg gccactctga aggcctgaac tatgcctgtc 7321 ttgctcacag tggacatgga gatgtgtctg aataatagta tctcatagga aacatagcac 7381 tgagatgaac actgtattaa gttaaaaaga agaaaggtgg ggtggcagtc tagactctga 7441 actgacacag ccaagttcca aagttctgat cttggttcca agagccatta tgtttcattc 7501 agcattctct ttaccagtga cgtaaccttc agtggattat cagaggaacc tgtgtgtgtg 7561 cggaaatccc aggacaaatc ctaagtctgg gaagaaaaca tcactgtctc tctcctctga 7621 agccctttac ttcagagcat tgcctgccct ggcaatctta ctaggttcat gcaaggatgt 7681 gagtggggga ggggccggag tctgctgagg accacctgaa ctacagatta ccttaagagg 7741 atgcaggaaa caattactca cacaggagga agcagcctgt ggaccatgag gaatcatctg 7801 gcacgctatt attccaataa aatattccct ttaatcatc // LOCUS RATCROS1B 8010 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat lung-derived L01 c-ros-1 proto-oncogene mRNA, complete cds. ACCESSION M35105 KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase. SOURCE Rat (strain Fischer) lung, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 8010) AUTHORS Matsushime,H. and Shibuya,M. TITLE Tissue-specific expression of rat c-ros-1 gene and partial structural similarity of its predicted products with sev protein of Drosophila melanogaster JOURNAL J. Virol. 64, 2117-2125 (1990) STANDARD simple staff_review FEATURES from to/span description pept 402 5966 c-ros-1 unknown protein mRNA < 1 8010 c-ros-1 mRNA BASE COUNT 2197 a 1812 c 1930 g 2071 t ORIGIN 1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag 61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg 121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag 181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc 241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc 301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac 361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc 421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg 481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca 541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta 601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg 661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca 721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac 781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac 841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc 901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc 961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc 1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac 1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa 1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca 1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta 1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa 1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt 1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt 1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg 1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt 1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct 1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt 1681 cctacaatgg ggacaccaaa gctgtgcgta ttgtggagag tggcacatta aaggacttcg 1741 cagtaaagcc gcagtccaag cgaatcattt acttcaatgg caccatgcaa gtcttcatgt 1801 cgacatttct ggatggctcg gcattccaca gggttctgcc gtgggtcccc cttgcggatg 1861 tgaagagctt tgcttgtgaa aacaatgact tcctcatcac agatggcaag gccattttcc 1921 aacaggactc tctgtctttc aatgagttca tcgtgggatg tgacctgagt cacatagaag 1981 aatttgggtt tggtaacttg gtcatctttg gctcctccgt ccagtcgtac cctctgccag 2041 gccatccaca ggaggtctcg gtgctgtttg gttctcgaga ggcccttatt cagtggaagc 2101 ctccgattct cgccatagga gccagtcctt ccgcctggca gaactggact tatgaggtca 2161 aagtttcctc ccaggacatt ctggaaacca ctcaagtttt cttgaacata agcaggactg 2221 tgctgaatgt acccaagctg caaagttcta caaagtacat ggtgtctgtg cgagcaagtt 2281 ctcctaaagg cccaggccca tggtcagaac cctcagtggg tactaccttg gtaccagcca 2341 ctgagccacc gttcatcatg gctgtgaaag aagatgggct ttggagcaaa ccactcagta 2401 gttttggccc aggagagttc ctatcctctg acgtaggaaa cgtgtcagat atggattggt 2461 ataacaacag cctctactac agtgacacaa aaggcaatgt gtatgtgcgg cctctgaatg 2521 ggatggatat ctcggagaat taccacatat ccagcattgc aggagcttgt gccttggcct 2581 ttgaatggct gggtcacttt ctctactggg ctgggaagac atatgtgatt caaaggcagt 2641 ctgtgttaac gggacacaca gacattgtga ctcacgtgaa gctgttggtg aatgacatgg 2701 ccgtggatcc agttggtggc tatctgtact ggacgacgct ctactcggtt gaaagcacca 2761 gactcaatgg agaaagttct cttgtactac aggctcagcc ctggctctct ggaaaaaagg 2821 ttattgctct aacattagac ctcagcgatg ggctcctgta ctggctggtg caggacaatc 2881 agtgtattca cctgtacacg gctgttctcc ggggatggag tggtgcggat gctaccatca 2941 ccgagtttgc agcctggagt acttctgaaa tttcccagaa tgcactgatg tactacagcg 3001 gtagactctt ctggatcaat ggctttagga tcatcacagc acaggaaata ggtcagagaa 3061 ccagcgtgtc tgtttctgag ccagggaaat tcaatcagtt tacgatcata cagacatccc 3121 tcaagcctct gccagggaac ttttcctcta ctcccacggt tatcccagat tctgttcagg 3181 agtcctcatt tcgaattgaa ggacacactt caagtttccg aatcctgtgg aatgagcccc 3241 ctgcggtgga ctggggcata gttttctaca gtgtggaatt tagtgctcat tctaagttcc 3301 tggctattga acaacagtct ttacctgttt ttactgtgga aggactggag ccctatgcct 3361 tatttaatct ttctgtcact ccttatacct attggggaaa aggtcaaaaa acatctctat 3421 catttcgagc gcctgaatca gttccgtcag caccagagaa ccccagaata tttatattgt 3481 cacttggaag atacaccagg aagaatgaag tcgtggtaga gtttaggtgg aataaaccta 3541 agcatgaaaa tggagtgcta accaaatctg aaatcttcta ccacatatct aaacaaagtg 3601 gcacaaataa atcaacggaa gactgggtat ctgtcagcgt tacaccgccg gtgatgtctt 3661 ttcaacttga agccatgagt cctgggtata ttgtttcctt ccaggttcga gtcttcacct 3721 ccaaagggcc aggaccattt tctgatatag tgatgtctaa aacatcagaa atcaagccat 3781 gtccatatct catatctctt cttggcaata agattgagtt cttagacatg gaccaaaatc 3841 aagttgtgtg gacattttcc ctggagggag ccgtcagcac agtggggtac acagcggatg 3901 atgaaatggg gtatttcgct caaggagatg cactcttcct tctgaatttg cacaatcatt 3961 ccagctccaa gcttttccag gacgtgctgg cttctgacat tgcggttatt gctgttgact 4021 ggatcgcaag gcacctctac tttgctctga aagcatcgca agatggaaca cagatattcg 4081 atgttgacct tgaacacaag gtgaaatccc ccagggaggt gaagatttgc aaaagccata 4141 cagcaataat ttctttctct atgtatcccc tcttaagtcg cctgtattgg acagaagttt 4201 cagatctggg ctaccagatg ttctactgca atattagcag tcacaccttg catcacgttc 4261 tacaacccaa ggcctcaaac cagcatggaa ggagacagtg ttcttgtaat gtgacagaat 4321 ccgagttaag tggggcaatg actgtggaca cgtctgatcc agacagacct tggatatact 4381 ttaccaaaca gcaagagatc tgggccatgg atctggaagg atgtcagtgt tggaaagtca 4441 tcatggtacc tgctacccct ggaaaaagaa tcattagttt aacagtggat ggggagttta 4501 tatattggat cacaacaatg aaggacgaca cagaaattta tcaagcaaag aagggaagtg 4561 gggccatcct ctcccaggtg aaggccccca ggagtaagca tatcttggct tacagttcag 4621 ctctgcaacc ttttccagat aaagcatatc tgtctgtagc ttccaatatg gtagaagcaa 4681 gtatattgaa tgccaccaac accagcctca ttctcaagtt acctccagtc aagacaaacc 4741 tcacgtggca tggaattacc actcccacgt caacatacct ggtttactat atggaggcta 4801 atagggcaaa cagctctgac aggaaacaca acatgttgga atcacaggag aatgtagccc 4861 ggattgaagg tctgcagcca ttttcaacat acgtgattca gatagctgtg aagaactatt 4921 attctgatcc tttagaacat ctctctctgg gaaaagagat tcaaggaaaa actaaaagtg 4981 gagtgcccgg ggcagtttgt catatcaatg caactgtgct gtcggacacc agtcttcttg 5041 tattctggac agaatcgcat aaaccaaacg gacccaaaga gttagtccgc tatcagttgg 5101 ttatgtcata cctggctccg attcctgaga ctcctctaag acaggacgaa tttccaagcg 5161 ccaggctttc tctacttgtc actaaactct ctggtggaca acaatatgtg ctgaagatcc 5221 ttgcctgcca ctcagaggaa atgtggtgta ctgagagtca tcctgtcagt gtcaacatgt 5281 ttgacacacc ggagaaacct tctgccttgg ttccagagaa cactagtctg ctgttggatt 5341 ggaaggctcc gtctaacgct aacctcacca gattttggtt tgaactccag aagtggaagt 5401 atagtgagtt ttaccatgtc aaggcttcat gcagccaagg tccagtttat gtctgtaaca 5461 tcgcaaatct gcagccttac actccttata acatccgagt ggtggtggtc tatacgacag 5521 gagaaaatag ctcctcgatt cccgagagct tcaagacaaa agctggagtc ccaagcaaac 5581 cagggattcc taagttacta gaagggagta aaaattcaat ccagtgggaa aaagccgaag 5641 ataacgggaa cagattgatg tactacaccc tggaggtcag aaaaagcatt tcaaatgact 5701 cacgggacca gagtttaagg tggacggcgg tgtttaatgg gtcctgcagt agcatttgca 5761 catggaggtc aaaaaaccta aaaggaactt tccagttcag agcagtagcg tcaaatgcta 5821 ttggatttgg agaatacagt gaaatcagtg aagatattac attagtggaa gatggttttt 5881 ggataacaga aacaagtttt atacttacta tcatagttgg gatatttctg gttgccacag 5941 tcccactgac ctttgcctgt cactgaagct ggggctcaca gatcagctag gccggctggc 6001 caacagatcc ccgagatctg cctgcctctg acctctacct ccaacactgg ggctacagat 6061 gtgtgctaca ttctcagtat ttaactgggt gctgaggaac caagcacagg tcctcatgct 6121 cgtaagtctg gcatagaagc ttgaaaaacc acaaagctac caaggaaggc ctctcagttc 6181 tcaacgacaa tgaccaagag ttggctgagc ttcgaggact ggcggctgga gtgggcctgg 6241 ccaatgcctg ctatgcagta catactcttc caacccaaga ggagattgaa agtcttcccg 6301 ccttccctcg ggagaagctg agcctgcgcc ttctgttggg aagtggagct tttggagaag 6361 tgtacgaggg cacagctgta gacatcctag gacggggaag tggagaaatc aaggtggccg 6421 tgaagaccct gaagaaaggt tcgacagacc aggagaagat cgagttcctg aaggaggcac 6481 acctgatgag caagtttaat caccccaaca ttctgaagca gctgggagtc tgtctgctga 6541 gtgaacccca gtacattatc ctggaactga tggaaggggg agaccttcta agctatctgc 6601 gcaaagcccg agggacaacg ttgtctggcc ctttactcac attggctgac ctggtagagc 6661 tgtgtgtaga tatttcaaaa ggctgcgtct acttggagca gatgcacttc attcacaggg 6721 atctggcagc tcggaattgc cttgtgtctg tgaaagacta taccagtcct cgggtagtca 6781 agatcggtga ctttggtttg gcaagggaaa tctataagca tgattattat agaaagagag 6841 gggaaggcct gcttcctgtc cggtggatgg ctcctgaaaa cttgatggat ggaatcttca 6901 cttcccagtc tgatgtatgg tcttttggaa ttttggtttg ggagatttta actcttggtc 6961 atcaacctta tccagcgcat tccaaccttg atgttttaaa ctatgtgcaa gcaggaggga 7021 gactggagcc accgagaaac tgtcctgatg atctgtggaa tttaatgttc cgatgttggg 7081 cccaagaacc tgaccaaaga cccactttct ataacattca agaccagctt cagttattca 7141 gaaatgtttc cttaaacaat gtttctcact gtggacaagc agctcctgct ggtggagtca 7201 tcaacaaagg ctttgaaggt gaagacaatg aaatggccac tttgaattca gatgacacga 7261 tgccagttgc cttgatggaa accaggaacc aagaaggatt aaattatatg gtacttgcca 7321 caaagtgtag ccaaagtgag gatcgttatg agggtcctct aggctctaag gaatctgggt 7381 tgcatgatct gaagaaagac gagaggcaac cagcagacaa agatttctgc cagcaaccac 7441 aggtggctta tggctctcct ggccactctg aaggcctgaa ctatgcctgt cttgctcaca 7501 gtggacatgg agatgtgtct gaataatagt atctcatagg aaacatagca ctgagatgaa 7561 cactgtatta agttaaaaag aagaaaggtg gggtggcagt ctagactctg aactgacaca 7621 gccaagttcc aaagttctga tcttggttcc aagagccatt atgtttcatt cagcattctc 7681 tttaccagtg acgtaacctt cagtggatta tcagaggaac ctgtgtgtgt gcggaaatcc 7741 caggacaaat cctaagtctg ggaagaaaac atcactgtct ctctcctctg aagcccttta 7801 cttcagagca ttgcctgccc tggcaatctt actaggttca tgcaaggatg tgagtggggg 7861 aggggccgga gtctgctgag gaccacctga actacagatt accttaagag gatgcaggaa 7921 acaattactc acacaggagg aagcagcctg tggaccatga ggaatcatct ggcacgctat 7981 tattccaata aaatattccc tttaatcatc // LOCUS RATCROS1C 7902 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Rat heart-derived c-ros-1 proto-oncogene mRNA, complete cds. ACCESSION M35106 KEYWORDS c-ros-1 proto-oncogene; tyrosine kinase. SOURCE Rat (strain Fischer) heart, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 7902) AUTHORS Matsushime,H. and Shibuya,M. TITLE Tissue-specific expression of rat c-ros-1 gene and partial structural similarity of its predicted products with sev protein of Drosophila melanogaster JOURNAL J. Virol. 64, 2117-2125 (1990) STANDARD simple staff_review FEATURES from to/span description pept 402 7418 c-ros-1 tyrosine kinase (put.) mRNA < 1 7902 c-ros-1 mRNA BASE COUNT 2171 a 1775 c 1899 g 2057 t ORIGIN 1 catagctcag ccaacctcaa agaagtgcgg tggctggccg acctgagtgt tctgcgtcag 61 gactgtgtgg actggctcgc tggaaagcaa tctaagttcc tactgcttat tttgcatgtg 121 gagagctctt ccacgatcta gcctttagcc agggaacgtc tttcattatg ggagtaaaag 181 gaagctaaac tataaaatag tcttgctgcg atgttctggg ctatctgaga tccaaaggtc 241 taaaccggtt tcaataagag agtacgatat tctaacatcg caaaagaaaa cagataaccc 301 accaagctca cttgcaaccg aagtatgaag cctaaagaat tgttaaagca acatggagac 361 atgaggacgc cagccgtgta ggaagctggc cttcctgagg gatgaagagg atccgctggc 421 tcaccccaaa acctgcgacc tttgtggtcc ttgggtgcgt atggatttcc gtggcgcagg 481 gtaccattct gagcagctgc ctaacgtcct gtgtaactaa cttgggcagg cagcttgaca 541 gcggcacccg gtacaatctg agtgaggcat gcatccaagg atgtcagttt tggaactcta 601 tagatcagga gaagtgtgct ttgaagtgta atgatacata tgtcaccatt tgtgagaggg 661 agtcctgtga ggtcggctgc agcaacgcgg agggtagcta cgaagaggaa gtgctggaca 721 acacagagct tcctacagca cccttcgcat cttccattgg aagtaacggg gtgacattac 781 gatggaaccc tgccaacatc tctggagtaa aatacatcat tcagtggaaa tatgcccaac 841 ttccgggaag ctgggcttac acagaaactg tgtctaagct ctcatacatg gtggaacccc 901 tgcatccatt tactgaatat atttttcgag tggtttggat tttcacagcc cagctgcacc 961 tttattcccc gccaagtccc agttacagga ctcatcctta tggagttcca gaaactgcgc 1021 ctttcatcac gaacatcgaa agctcgagcc ctgacactgt ggaggtcagc tgggctccac 1081 cctatttccc aggtggacct attttgggtt ataatttaag gctgatcagt aaaactcaaa 1141 aattagattc agggacacag agaaccagtt tccagtttta ttctactctt ccaaacacca 1201 cttacaggtt ttctatcgca gcagtcaatg aagtcggtga ggggccagaa gcagaatcta 1261 tgattaccac tccatcccca gcagttcaag aagaagaaca atggctcttt ttatccagaa 1321 aaacttctct aagaaagagg tctttgaagt acttagtaga cgaagcacat tgcctttggt 1381 cagatgctat acgtcataat attacaggaa tatcagtcaa cactcagcag gaagtggttt 1441 atttctcaga aggaaccatc atatggatga agggggctgc taacatgtct gatgtgtctg 1501 acctgaggat cttttatcga ggctcagctc tagtctcttc tatctctgta gactggcttt 1561 accaaaggat gtatttcatc atggataatc gggtgcatgt ctgtgactta aagcattgct 1621 caaatcttga ggaaatcact ccattctcta ttgttgcacc tcaaaaagtt gtggttgatt 1681 cctacaatgg gtatgtcttt tatctcctaa gagacggcat ttatagagtc catcttcctt 1741 tgccgtctgt cagggacacc aaagctgtgc gtattgtgga gagtggcaca ttaaaggact 1801 tcgcagtaaa gccgcagtcc aagcgaatca tttacttcaa tggcaccatg caagtcttca 1861 tgtcgacatt tctggatggc tcggcattcc acagggttct gccgtgggtc ccccttgcgg 1921 atgtgaagag ctttgcttgt gaaaacaatg acttcctcat cacagatggc aaggccattt 1981 tccaacagga ctctctgtct ttcaatgagt tcatcgtggg atgtgacctg agtcacatag 2041 aagaatttgg gtttggtaac ttggtcatct ttggctcctc cgtccagtcg taccctctgc 2101 caggccatcc acaggaggtc tcggtgctgt ttggttctcg agaggccctt attcagtgga 2161 agcctccgat tctcgccata ggagccagtc cttccgcctg gcagaactgg acttatgagg 2221 tcaaagtttc ctcccaggac attctggaaa ccactcaagt tttcttgaac ataagcagga 2281 ctgtgctgaa tgtacccaag ctgcaaagtt ctacaaagta catggtgtct gtgcgagcaa 2341 gttctcctaa aggcccaggc ccatggtcag aaccctcagt gggtactacc ttggtaccag 2401 ccactgagcc accgttcatc atggctgtga aagaagatgg gctttggagc aaaccactca 2461 gtagttttgg cccaggagag ttcctatcct ctgacgtagg aaacgtgtca gatatggatt 2521 ggtataacaa cagcctctac tacagtgaca caaaaggcaa tgtgtatgtg cggcctctga 2581 atgggatgga tatctcggag aattaccaca tatccagcat tgcaggagct tgtgccttgg 2641 cctttgaatg gctgggtcac tttctctact gggctgggaa gacatatgtg attcaaaggc 2701 agtctgtgtt aacgggacac acagacattg tgactcacgt gaagctgttg gtgaatgaca 2761 tggccgtgga tccagttggt ggctatctgt actggacgac gctctactcg gttgaaagca 2821 ccagactcaa tggagaaagt tctcttgtac tacaggctca gccctggctc tctggaaaaa 2881 aggttattgc tctaacatta gacctcagcg atgggctcct gtactggctg gtgcaggaca 2941 atcagtgtat tcacctgtac acggctgttc tccggggatg gagtggtgcg gatgctacca 3001 tcaccgagtt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactaca 3061 gcggtagact cttctggatc aatggcttta ggatcatcac agcacaggaa ataggtcaga 3121 gaaccagcgt gtctgtttct gagccaggga aattcaatca gtttacgatc atacagacat 3181 ccctcaagcc tctgccaggg aacttttcct ctactcccac ggttatccca gattctgttc 3241 aggagtcctc atttcgaatt gaaggacaca cttcaagttt ccgaatcctg tggaatgagc 3301 cccctgcggt ggactggggc atagttttct acagtgtgga atttagtgct cattctaagt 3361 tcctggctat tgaacaacag tctttacctg tttttactgt ggaaggactg gagccctatg 3421 ccttatttaa tctttctgtc actccttata cctattgggg aaaaggtcaa aaaacatctc 3481 tatcatttcg agcgcctgaa tcagttccgt cagcaccaga gaaccccaga atatttatat 3541 tgtcacttgg aagatacacc aggaagaatg aagtcgtggt agagtttagg tggaataaac 3601 ctaagcatga aaatggagtg ctaaccaaat ctgaaatctt ctaccacata tctaaacaaa 3661 gtggcacaaa taaatcaacg gaagactggg tatctgtcag cgttacaccg ccggtgatgt 3721 cttttcaact tgaagccatg agtcctgggt atattgtttc cttccaggtt cgagtcttca 3781 cctccaaagg gccaggacca ttttctgata tagtgatgtc taaaacatca gaaatcaagc 3841 catgtccata tctcatatct cttcttggca ataagattga gttcttagac atggaccaaa 3901 atcaagttgt gtggacattt tccctggagg gagccgtcag cacagtgggg tacacagcgg 3961 atgatgaaat ggggtatttc gctcaaggag atgcactctt ccttctgaat ttgcacaatc 4021 attccagctc caagcttttc caggacgtgc tggcttctga cattgcggtt attgctgttg 4081 actggatcgc aaggcacctc tactttgctc tgaaagcatc gcaagatgga acacagatat 4141 tcgatgttga ccttgaacac aaggtgaaat cccccaggga ggtgaagatt tgcaaaagcc 4201 atacagcaat aatttctttc tctatgtatc ccctcttaag tcgcctgtat tggacagaag 4261 tttcagatct gggctaccag atgttctact gcaatattag cagtcacacc ttgcatcacg 4321 ttctacaacc caaggcctca aaccagcatg gaaggagaca gtgttcttgt aatgtgacag 4381 aatccgagtt aagtggggca atgactgtgg acacgtctga tccagacaga ccttggatat 4441 actttaccaa acagcaagag atctgggcca tggatctgga aggatgtcag tgttggaaag 4501 tcatcatggt acctgctacc cctggaaaaa gaatcattag tttaacagtg gatggggagt 4561 ttatatattg gatcacaaca atgaaggacg acacagaaat ttatcaagca aagaagggaa 4621 gtggggccat cctctcccag gtgaaggccc ccaggagtaa gcatatcttg gcttacagtt 4681 cagctctgca accttttcca gataaagcat atctgtctgt agcttccaat atggtagaag 4741 caagtatatt gaatgccacc aacaccagcc tcattctcaa gttacctcca gtcaagacaa 4801 acctcacgtg gcatggaatt accactccca cgtcaacata cctggtttac tatatggagg 4861 ctaatagggc aaacagctct gacaggaaac acaacatgtt ggaatcacag gagaatgtag 4921 cccggattga aggtctgcag ccattttcaa catacgtgat tcagatagct gtgaagaact 4981 attattctga tcctttagaa catctctctc tgggaaaaga gattcaagga aaaactaaaa 5041 gtggagtgcc cggggcagtt tgtcatatca atgcaactgt gctgtcggac accagtcttc 5101 ttgtattctg gacagaatcg cataaaccaa acggacccaa agagttagtc cgctatcagt 5161 tggttatgtc atacctggct ccgattcctg agactcctct aagacaggac gaatttccaa 5221 gcgccaggct ttctctactt gtcactaaac tctctggtgg acaacaatat gtgctgaaga 5281 tccttgcctg ccactcagag gaaatgtggt gtactgagag tcatcctgtc agtgtcaaca 5341 tgtttgacac accggagaaa ccttctgcct tggttccaga gaacactagt ctgctgttgg 5401 attggaaggc tccgtctaac gctaacctca ccagattttg gtttgaactc cagaagtgga 5461 agtatagtga gttttaccat gtcaaggctt catgcagcca aggtccagtt tatgtctgta 5521 acatcgcaaa tctgcagcct tacactcctt ataacatccg agtggtggtg gtctatacga 5581 caggagaaaa tagctcctcg attcccgaga gcttcaagac aaaagctgga gtcccaagca 5641 aaccagggat tcctaagtta ctagaaggga gtaaaaattc aatccagtgg gaaaaagccg 5701 aagataacgg gaacagattg atgtactaca ccctggaggt cagaaaaagc atttcaaatg 5761 actcacggga ccagagttta aggtggacgg cggtgtttaa tgggtcctgc agtagcattt 5821 gcacatggag gtcaaaaaac ctaaaaggaa ctttccagtt cagagcagta gcgtcaaatg 5881 ctattggatt tggagaatac agtgaaatca gtgaagatat tacattagtg gaagatggtt 5941 tttggataac agaaacaagt tttatactta ctatcatagt tgggatattt ctggttgcca 6001 cagtcccact gacctttgtc tggcatagaa gcttgaaaaa ccacaaagct accaaggaag 6061 gcctctcagt tctcaacgac aatgaccaag agttggctga gcttcgagga ctggcggctg 6121 gagtgggcct ggccaatgcc tgctatgcag tacatactct tccaacccaa gaggagattg 6181 aaagtcttcc cgccttccct cgggagaagc tgagcctgcg ccttctgttg ggaagtggag 6241 cttttggaga agtgtacgag ggcacagctg tagacatcct aggacgggga agtggagaaa 6301 tcaaggtggc cgtgaagacc ctgaagaaag gttcgacaga ccaggagaag atcgagttcc 6361 tgaaggaggc acacctgatg agcaagttta atcaccccaa cattctgaag cagctgggag 6421 tctgtctgct gagtgaaccc cagtacatta tcctggaact gatggaaggg ggagaccttc 6481 taagctatct gcgcaaagcc cgagggacaa cgttgtctgg ccctttactc acattggctg 6541 acctggtaga gctgtgtgta gatatttcaa aaggctgcgt ctacttggag cagatgcact 6601 tcattcacag ggatctggca gctcggaatt gccttgtgtc tgtgaaagac tataccagtc 6661 ctcgggtagt caagatcggt gactttggtt tggcaaggga aatctataag catgattatt 6721 atagaaagag aggggaaggc ctgcttcctg tccggtggat ggctcctgaa aacttgatgg 6781 atggaatctt cacttcccag tctgatgtat ggtcttttgg aattttggtt tgggagattt 6841 taactcttgg tcatcaacct tatccagcgc attccaacct tgatgtttta aactatgtgc 6901 aagcaggagg gagactggag ccaccgagaa actgtcctga tgatctgtgg aatttaatgt 6961 tccgatgttg ggcccaagaa cctgaccaaa gacccacttt ctataacatt caagaccagc 7021 ttcagttatt cagaaatgtt tccttaaaca atgtttctca ctgtggacaa gcagctcctg 7081 ctggtggagt catcaacaaa ggctttgaag gtgaagacaa tgaaatggcc actttgaatt 7141 cagatgacac gatgccagtt gccttgatgg aaaccaggaa ccaagaagga ttaaattata 7201 tggtacttgc cacaaagtgt agccaaagtg aggatcgtta tgagggtcct ctaggctcta 7261 aggaatctgg gttgcatgat ctgaagaaag acgagaggca accagcagac aaagatttct 7321 gccagcaacc acaggtggct tatggctctc ctggccactc tgaaggcctg aactatgcct 7381 gtcttgctca cagtggacat ggagatgtgt ctgaataata gtatctcata ggaaacatag 7441 cactgagatg aacactgtat taagttaaaa agaagaaagg tggggtggca gtctagactc 7501 tgaactgaca cagccaagtt ccaaagttct gatcttggtt ccaagagcca ttatgtttca 7561 ttcagcattc tctttaccag tgacgtaacc ttcagtggat tatcagagga acctgtgtgt 7621 gtgcggaaat cccaggacaa atcctaagtc tgggaagaaa acatcactgt ctctctcctc 7681 tgaagccctt tacttcagag cattgcctgc cctggcaatc ttactaggtt catgcaagga 7741 tgtgagtggg ggaggggccg gagtctgctg aggaccacct gaactacaga ttaccttaag 7801 aggatgcagg aaacaattac tcacacagga ggaagcagcc tgtggaccat gaggaatcat 7861 ctggcacgct attattccaa taaaatattc cctttaatca tc // LOCUS HUMFVIIIM 65 bp ds-DNA PRI 25-JUL-1990 DEFINITION Human mutant coagulation factor VIII exon 13 duplication region. ACCESSION M34731 KEYWORDS coagulation factor VIII. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 65) AUTHORS Murru,S., Casula,L., Pecorara,M., Mori,P., Cao,A. and Pirastu,M. TITLE Illegitimate recombination produced a duplication within the FVIII gene in a patient with mild hemophilia A JOURNAL Genomics 7, 115-118 (1990) STANDARD simple staff_review COMMENT As a result of illegitimate recombination of two misaligned chromosomes, exon 13 of the factor VIII is duplicated in its entirety. The exon undergoes normal splicing and its incorporation into the mRNA generates an unstable protein. FEATURES from to/span description recomb 25 26 chromosome DNA end; misaligned chromosome DNA start BASE COUNT 26 a 3 c 11 g 25 t ORIGIN 1 aagttttagg ggtacatgtg cacaattagt ttgaaataat ttaattagtt tgaaataatt 61 taaaa // LOCUS EUBBAIA3 2596 bp ds-DNA BCT 25-JUL-1990 DEFINITION Eubacterium sp. baiA3 protein gene, complete cds. ACCESSION M34658 KEYWORDS . SOURCE Eubacterium sp. (strain VPI 12708) DNA. ORGANISM Eubacterium sp. Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 135 to 2242) AUTHORS Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B. TITLE Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 134; 2243 to 2596) AUTHORS Gopal-Srivastava,R., Mallonee,D.H., White,W.B. and Hylemon,P.B. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by D.H.Mallonee, 24-MAY-1990. FEATURES from to/span description pept 1165 1914 baiA3 protein gene BASE COUNT 778 a 521 c 633 g 664 t ORIGIN 1 tccctgtgct ttcttctgca gttcataaaa tccgccgcca caaatccaag aatccacaat 61 agactcagaa gcaaggcgta ttccagcgca tccattggga tattatacaa atagaatagc 121 aaggcaaata tggccatgat cccggcgtac atgcctattc ccctgatatg atccctgata 181 tatcttcctg tcaatctcat gcctgcacca tatatcctat tccttttttc gttacgatcc 241 atttgcattt atcctgtctg atgcggatta tgcatcgtat accgctggcc ttacagagga 301 taacaaagag gatatggttt tctttaacgt gaaggatgtg atggatactt atccattcgc 361 caaagaactg gaagaagaat atatcgcgca tgccacagat atctcggacc attattttct 421 ttatgatgcc cgcgaagaag aacttgcaaa aaaagcaggg gaaccctaca catattcagg 481 cagggtaggg atgacggcgg acaatccgga acttcttcag gactggaaat atgcgcctgc 541 cttcaaagtt cttacaaaag gggaggttat gcagatgatt gcggtattcg tgatgcttag 601 cgcctacatt gcgataattg ccctggcggc aatcggggtt atgacttatg taagaagcgt 661 taccattgct gtcgataaca ggcagctgtt cgaggatatg aagaagctgg gggccagccg 721 ggattatgag acgcgggtgg taaaagtaca gcttcgcaag atcttcttat atcccggtat 781 cgcaggatgc gggatatccc tggtctttac ggtcctgatg ctctttttta acaatatgcg 841 cctggaaatt gaagaaatca ggctgatcgg aatcgagagc attatgattg gggcatccgc 901 catcttcctg tacgtactgt accggatctc ttttcggaag atgagaagca tgctggatct 961 atagggaaac aaaatagtga tagtgtttgc aaactttttg tccatggact gcttatattt 1021 tgcaattaaa aaagaacttt acaagttgta agatgccgtg tgattttcca atgtcgcgtc 1081 ctgtaaaatg ttaaagttgt atcaatcgat acgatacttt ggcagatatg ataagccaaa 1141 ggaaaagaaa ggaaggaaaa gttcatgaaa cttgtacagg acaaaattac aattatcaca 1201 ggcggaaccc gtggaatcgg attcgcagca gcaaaactct ttattgagaa tggagcaaaa 1261 gtctccatat ttggcgagac ccaggaagag gtagacacag cgctggctca gttaaaggaa 1321 ctctatccgg aggaagaggt attaggattc gctccagacc ttacatcaag agatgctgtt 1381 atggcagcag ttggaacggt tgcacagaag tacggaagac tggatgtcat gatcaacaac 1441 gcaggcatta caatgaattc tgtattctcc agggtatcag aagaggattt caaaaatata 1501 atggacatca atgttaacgg cgtattcaat ggcgcatggt ctgcttatca gtgcatgaaa 1561 gatgcaaagc agggcgttat catcaatacg gcatctgtaa ccggaatcta tggttcctta 1621 tcaggaatcg gatatcctac cagcaaggcg ggcgtaatcg gcctgactca tggtcttgga 1681 agagagatta tccgtaagaa catccgtgta gttggcgttg cacctggcgt tgtagataca 1741 gatatgacga aggggcttcc accggagatc ctggaggact acttgaagac actgccaatg 1801 aagagaatgc ttaagccgga agagatcgcg aatgtatatc tgttccttgc atccgacctg 1861 gctagcggca tcacggctac gacgatcagc gtagatgggg cttacaggcc atagaaaaga 1921 catactgcta ttaattccat agttcatact ccaagaacag gcaggcaaga ggcatttgcg 1981 ttttagcgcg gatgcccggg cctgcctgat ttaattcagc tggtatatca tgaaattcag 2041 atatgcggcg aacaggcacc atataaggta ggggatctgt agataggcgg caacaggact 2101 tatcttgtga aactgatata tcatcagggc tatgaggatg ataagcacga gaagccataa 2161 aaatgcaaag aggtacatgg aaaagccgaa aaagaatatg ctccagagga agttgaagaa 2221 cagctggata aaatatagtc gaagcgcctt attcttttca ggagtttcgg attcatagat 2281 tatataagaa gatatcccca ttaatatata taatatggtc cagacgatgg gaaataggaa 2341 ggacggagga ctaagaggcg gcttattcaa tgccaaatag gccgccgaat tgccgcttaa 2401 gagagcagac aaggatcctg ccgcaagagg aataaggata aaaataatga gagcgctttt 2461 gtttttgatg ttcatatata ccggctccag gcatgacttt caatattata tgaaaaatct 2521 ccgggaaata tgaacggtat ctccggcttt acttgccgct ctttgacttg cccgccgtct 2581 ctttgagcag ttccag // LOCUS ECOTRAU 1080 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli F plasmid transfer operon: traU gene, complete cds; traW gene, 3' end; and trbC gene, 5' end. ACCESSION M34695 KEYWORDS periplasmic protein; transfer operon. SOURCE E.coli F Plasmid (strain K12; isolate Flac plasmid FLO) DNA, clones pKI[182;282;175]. ORGANISM Plasmid F Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1080) AUTHORS Moore,D., Maneewannakul,K., Maneewannakul,S., Wu,J.H., Ippen-Ihler,K.A. and Bradley,D.E. TITLE Characterization of the F plasmid conjugative transfer gene traU JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.A.Ippen-Ihler, 25-MAY-1990. FEATURES from to/span description pept 49 1041 traU protein precursor sigp 1 66 traU protein signal peptide matp 67 1038 traU protein pept < 1 52 traW protein pept 1050 > 1080 trbC protein BASE COUNT 242 a 281 c 311 g 246 t ORIGIN Map position 77.9-78.9 units on the genome. 1 cgatcgcttc ctgaaggtgg aatttattcc ggcagaggag ggcagaaaat gaagcgaagg 61 ctgtggctgc tgatgttatt ccttttcgcc ggtcatgtcc ctgcggcgtc tgcggattct 121 gcctgtgagg ggcgttttgt aaacccgatc acagatatct gctggagctg tattttcccg 181 ctctcgctgg gcagtatcaa agtcagtcag ggcaaggtcc ccgacacggc gaacccgtcg 241 atgcccattc agatttgtcc ggcaccgccg ccgctgttca ggcgtatcgg gctggccatt 301 ggttactggg agccgatggc gttgacggac gtcacccggt caccgggatg catggtgaac 361 ctgggcttca gcctgccggc ttttggtaaa acggcacagg gaacggcgaa aaaggatgag 421 aagcaggtaa atggggcgtt ctatcacgtt cactggtaca aatacccgct gacgtactgg 481 ctgaacatca tcacatcgct gggctgtctg gaaggtggtg acatggatat cgcttatctt 541 tctgaaatcg accccacctg gacggacagc agcctgacca ccattctcaa tccggaagct 601 gtcatctttg ccaatccgat agcacaggga gcctgcgcag cagatgcgat tgccagcgcc 661 tttaatatgc ctctcgatgt tctgttctgg tgtgccggtt cgcagggaag tatgtacccg 721 ttcaatggct gggtgagtaa tgagtccagt ccgttgcagt cctccctgct ggtcagtgaa 781 cgcatggcgt tcaagctgca ccgtcagggc atgattatgg aaaccatcgg gaaaaataac 841 gccgtctgta atgaatatcc gtccccaatc ctgcccaaag aacgctggcg ttaccagatg 901 gtgaatatgt atccggacag cgggcagtgc cacccgttcg ggcgcagcgt gacccgctgg 961 gaaaccggga aaaatccgcc caacacaaag aaaaacttcg gctacctgat gtggcgtaaa 1021 cgtaactgtg tcttcctgtg aggtgaatga tgaagctgag tatgaaatct ctggcagcac // LOCUS MUSSMRNAA 74 bp ss-RNA RNA 25-JUL-1990 DEFINITION Mouse brain-specific small RNA, clone pABr-4. ACCESSION M35067 KEYWORDS small RNA. SOURCE Mouse 17-day fetus, cDNA to RNA, clone pABr-4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 74) AUTHORS Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S. TITLE Conservation of the ID sequence and its expression as small RNA in rodent brains: Analysis with cDNA for mouse brain-specific small RNA JOURNAL Mol. Brain Res. 2, 43-49 (1987) STANDARD simple staff_review FEATURES from to/span description RNA < 1 > 74 brain-specific small RNA BASE COUNT 12 a 16 c 28 g 18 t ORIGIN 1 ggggttgggg atttagctca gtggtagagc gcttgcctag caagcaaggc cctgggttcg 61 gtcctaagct ctgg // LOCUS MUSSMRNAB 74 bp ss-RNA RNA 25-JUL-1990 DEFINITION Mouse brain-specific small RNA, clone pABr-9. ACCESSION M36619 KEYWORDS small RNA. SOURCE Mouse 17-day fetus, cDNA to RNA, clone pABr-9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 74) AUTHORS Anzai,K., Kobayashi,S., Suehiro,Y. and Goto,S. TITLE Conservation of the ID sequence and its expression as small RNA in rodent brains: Analysis with cDNA for mouse brain-specific small RNA JOURNAL Mol. Brain Res. 2, 43-49 (1987) STANDARD simple staff_review FEATURES from to/span description RNA < 1 > 74 brain-specific small RNA BASE COUNT 11 a 15 c 28 g 20 t ORIGIN 1 ggggttgggg atttagctta gtggtagagc ttgcctagca agcgcaaggc cctgggttcg 61 gtccttagct ctgg // LOCUS BOVPRLB 1214 bp ds-DNA MAM 25-JUL-1990 DEFINITION Bovine prolactin gene, exon 5. ACCESSION M34535 KEYWORDS prolactin. SOURCE Bovine pituitary DNA, and cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 628) AUTHORS Carroll,S.M., Narayan,P. and Rottman,F.M. TITLE N-6-methyladenosine resides in an intron-specific region of bovine prolactin pre-mRNA JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 629 to 1214) AUTHORS Carroll,S.M., Narayan,P. and Rottman,F.M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Narayan, 16-MAY-1990. The cDNA sequence which corresponds to this gene is found in J.B.C. 257: 678-681 (1982), accession number M25007. Author address: P.Narayan Dept. of Molecular Biology and Microbiology School of Medicine Case Western University Cleveland, OH 44106 FEATURES from to/span description pept / 629 820 prolactin, exon 5 (AA at 629) pre-msg < 1 971 prolactin mRNA and intron IVS < 1 628 prolactin intron D BASE COUNT 400 a 229 c 197 g 388 t ORIGIN 1 gtgagcttca tgaaagcttc cttgctattt tcatgaatga gagaggtgat ttctgtaatg 61 aggaatgagt tttgaactat ctcactgtac aagaacacaa ttcaggcctt ctttttctag 121 accggtgtta cataaagcaa gaacctgttc attcatagtg atagattcta ttgtaagtga 181 attagaattc caccagcaat ttttcacaga ggtatagtct ttcttgaatt gtacagttac 241 accaaaatct tgcctcttcc tgggtacaga tggctgaaat attttcaagg ataagagaat 301 tagagaatac aatttgcaag ataaatgttt tcttcaaaat atcccaagat atcctctact 361 gaaattcagc ttgtattctt tctctattct cctcaaacca caggatgaga atgagaagaa 421 agaaaagaga agatcaaaac caaatacttg agttctgctt tagtttttat taataaatta 481 ctaacatata tctgatacac tggctccaaa atccaagtgt agagactttc atgtatcttc 541 cctaattttt aatttgataa atagaaagaa caaagatgag ctaatactac taaaactcat 601 aataactcat tatcttttgg atgtttaggt tattcctgga gccaaagaga ctgagcccta 661 ccctgtgtgg tcaggactcc cgtccctgca aactaaggat gaagatgcac gttattctgc 721 tttttataac ctgctccact gcctgcgcag ggattcaagc aagattgaca cttaccttaa 781 gctcctgaat tgcagaatca tctacaacaa caactgctaa gcccacattc catcctatcc 841 atttctgaga tggttcttaa tgatccattc cctggcaaac ttctctgagc tttatagctt 901 tgtaatgcat gcttggctct aatgggtttc atcttaaata aaaacagact ctgtagcgat 961 gtcaaaatct aagactgcaa ttttgtcaat gtttcttatc ttcatttaat agacaatcaa 1021 atgaaaatcc ttccttatga ttgagagaaa gaacttctga ttaaaatttg tcacaaatag 1081 cagaaactga cattacaaag accgttaata acttacttta gaatcacagc aaattattct 1141 ggggtcaagt tattagaatt aaaaattaga taaacattca ttgtgttggt catgctacca 1201 agaagactga attc // LOCUS RHMCYA 1560 bp ds-DNA BCT 25-JUL-1990 DEFINITION R.meliloti adenylate cyclase (cyaA) gene, complete cds. ACCESSION M35096 KEYWORDS adenylate cyclase. SOURCE R.meliloti DNA. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 1560) AUTHORS Beuve,A., Boesten,B., Crasnier,M., Danchin,A. and O'Gara,F. TITLE Rhizobium meliloti adenylate cyclase is related to eucaryotic adenylate and guanylate cyclases JOURNAL J. Bacteriol. 172, 2614-2621 (1990) STANDARD simple staff_review FEATURES from to/span description pept 885 1466 adenylate cyclase (cyaA) binding 871 879 ribosome binding site BASE COUNT 292 a 512 c 487 g 269 t ORIGIN 1 ggatcctgtt cctggacgcg agcggcctgc agtttgccga acgtcacgct gcctccaacg 61 gcttcgatcc gaggacgcgg ccctggtacc gcgcggccgt caacggcaag gcgccggtgg 121 ccatcggtcc ctatgagatg gccaccacag gcaatctcgg gatgaccata tcgcaagcgc 181 accgcggcaa cccccaaatc gtcatcggcg ccgatgtcgt tctcgatacg atcacggatt 241 ttctgtcccg cgagcggctg accgacgact cggtttcctt cgtgctcgat gcggtgggac 301 gaccgatcat ccactccgac tccaccatga tgcggcgcat catggcatcg aagggccggg 361 accggccggt ggccacgccg caggaggatg gactgatcga gagcatccgg cgcaacccgc 421 caccggccgg aaaggcaact ctcgtcgaag tcggaaaccg cacctatctc gtcacggtgg 481 cgccgctcga atcggcattg cttctgtccg ggcaccgggt ggtcgtcgcc gcccctctcg 541 acgagctgct ggcggccgca aacgagacgc tcgttcaggg acttgccgtc tcgggcgccg 601 tggtggtggt cgccgttctc ctggccctcg tgcttgcgca tctgatcacg aagtcgctca 661 accagctcac cgacagcgcc aaccgcctgc aggacctgga tttcgccact cctatcgacg 721 tttcgtcgca tgtggcggaa atctcgacgc tcaacggcgc aatgaacagg gctcgcgacg 781 cgatcttcac cttcgcgctc tatgttccga aggagctggt gcgcaagggc atcgaatccg 841 gccatttcgg cggccgcgcc gcatggcggc aggaggtgac ggcgatgttc accgacatct 901 acgacttcac caccatcagc gagggccggt cgccggaaga agtggtcgcg atgctctcgg 961 agtatttcga cctgttcagc gaggtcgtcg ccgcccacga cggaaccatc atccaattcc 1021 atggagactc ggtctttgcc atgtggaacg cgccggtcgc cgataccagg catgccgagc 1081 atgcctgtcg atgcgcactc gcggtcgagg agaggctcga ggccttcaat tctgcgcaac 1141 gcgccagcgg attgccggag ttccgcaccc gcttcggcat ccacaccgga acggccgtcg 1201 tcggcagcgt cggcgccaag gaacggctgc aatatacggc gatgggcgac acggtgaacg 1261 tcgcctcgcg gctcgagggc atgaacaagg attacggcac gagcgttctt gcaagcggcg 1321 cggtggtcgc ccaatgcaaa gacatggtga agttccgccc gctcggcacc gccaaggcaa 1381 agggccgttc gacggcgctc gacatttacg aagtcgtggg cgtcgtccgc gcggtgaaca 1441 ctaccgaagc cggaacggcc gcctgaggaa aggcagatgc cgcggcgaac ggcggccccg 1501 ctgaattcgc ttcgaaactc tgaaagcaaa aaagcccgga aacccgggct ttttttgact // LOCUS ECOCYSD 492 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli sulfate adenylate transferase (cysD) gene, 5' end. ACCESSION M35098 KEYWORDS sulfate adenylate transferase. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 492) AUTHORS Malo,M.S. and Loughlin,R.E. TITLE Promoter elements and regulation of expression of the cysD gene of Escherichia coli K-12 JOURNAL Gene 87, 127-131 (1990) STANDARD simple staff_review FEATURES from to/span description pept 412 > 492 sulfate adenylate transferase (cysD) mRNA 373 > 492 cysD mRNA BASE COUNT 141 a 110 c 118 g 123 t ORIGIN 1 ctgcaggagt tccggtcatg cgtcccggaa agaaagtagc aatatgtcgt gcctgagtat 61 tagcaaaatc gccaggttta ggtgacgagg cgtgtacggg gagaataaag catacgccga 121 gcgccagggc agcggtacgg tggcgcaatg cggaaaacat agtgagtcct taaataccat 181 gcaaattttt ttaccgccat agtatgaaac tgccgctgcg ctaaaacaat ttcaaatctt 241 cctaaacgcc cgaaatccgg tgccttaagc actttttgat attagctttg ccaaatcgtt 301 attccgttaa ggaactactc attctaattg gtaatttcat tcgttctctt acgctcccta 361 tagtcgaaac atctgatggc aagaaaatag cggtattgca aaggaacggt tatggatcaa 421 atacgactta ctcacctgcg gcaactggag gcggaaagca tccacattat tcgcgaggtg 481 gcggcagaat tc // LOCUS CHPCOX41A 956 bp ds-DNA PRI 25-JUL-1990 DEFINITION Chimpanzee cytochrome c oxidase subunit IV (COX4P1) processed pseudogene, complete cds. ACCESSION M34599 KEYWORDS cytochrome c oxidase subunit IV; pseudogene. SOURCE Chimpanzee DNA, clone lambda-Ch1. ORGANISM Pan troglodytes Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Pongidae. REFERENCE 1 (bases 1 to 956) AUTHORS Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I. TITLE Novel use of a chimpanzee pseudogene for chromosomal mapping of human cytochrome c oxidase subunit IV JOURNAL Gene 86, 209-216 (1990) STANDARD simple staff_review FEATURES from to/span description pept 216 665 cytochrome c oxidase subunit IV (COXIV) pseudogene (E.C. 1.9.3.1) signal 881 888 poly-A signal BASE COUNT 268 a 222 c 277 g 189 t ORIGIN Chromosome 14q21-qter. 1 ggtacctcca atcccagcta ctcgggaggc tgaggcagga gaatcacttg aactcgggag 61 gcggaggttg cagtgagctg agatcacgcc tctgcgctac agcctgggca acaagagcaa 121 aactccgtct cggaaaagaa aaaaacaaaa aagaactact ggggtcgcgg gacaccgggc 181 atagagggcg gcggtggtgg ggcagctgcg gcagaatgtt ggctaccagg gtagttagcc 241 tagttggcaa gcgagcaatt tccaccttgg tgtctgtacg agcacacgga aatgttgtga 301 agagcgatga ctatgcgctc ccagcttatg tggatcgacg tgactatccc gtacccgatg 361 tggcccatgt caagcacctg tctgccagac agaaagcctt gaagaagaag gagaaggcct 421 cctggagcaa ccgctccacg gatgggaaag tcgagttgta tcacattcag ttcaaggaga 481 gctttgctga gatgaacagg ggcgtgaacg agtggaagat ggttgtgggc gctgccatgt 541 tcttccttgg cttcacggcg ttcattatca tctgggagaa gcgctgtgtg tacggcccca 601 tcccgcacac ctttgacaaa gagtgggtgc ccatgcagac caagaggatg ctggacatga 661 ggtgaacccc tgcagggctt cgccagccaa gtgggactat gacaagaacg agtggaagaa 721 gtgaacccct gcagggcttc gccagccaag tgggactatg acaagaacga gtggaagaag 781 tgagagatgc tgtcctgctt ttgagccttg ctctgtcacc tccatactat aactccatgc 841 ctatttactg gaaacctgtt atgccaaaca gtaccactgc taataaatga ccagtttacc 901 tgaaagaaaa aaaaaaaaag aactactgaa gtgaaagaaa aatctggaga aagtac // LOCUS CHTMOMPA 682 bp ss-mRNA BCT 25-JUL-1990 DEFINITION C.trachomatis outer membrane protein (ompl) gene, 5' end. ACCESSION M35099 KEYWORDS outer membrane protein. SOURCE C.trachomatis (serovar L2/434/Bu), cDNA to mRNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 682) AUTHORS Kaul,R., Duncan,M.J.J., Guest,J. and Wenman,W.M. TITLE Expression of the Chlamydia trachomatis major outer membrane protein-encoding gene in Escherichia coli: Role of the 3' end in mRNA stability JOURNAL Gene 87, 97-103 (1990) STANDARD simple staff_review FEATURES from to/span description pept 568 > 682 outer membrane protein (ompl) precursor sigp 568 633 outer membrane protein (ompl) signal peptide matp 634 > 682 outer membrane protein (ompl) mRNA 1 > 682 ompl mRNA BASE COUNT 218 a 144 c 111 g 209 t ORIGIN 1 aaaaacactt tctttgtagt aataaaaacg atttctatca aaacaaattc ttagattttc 61 ttacaaaaat ctcctctttt cttttagcca aacccccatc ttcgagctat tccaaacaca 121 aaaatcttag gttttggaaa ttaacaactc ataaaaattg aactgttttg taattaactc 181 aaaaccctct cattctcaac aatcaacata ttgccaacat ggcttttgct ctcggtttca 241 cagcgatttt tttcgcaaaa accaagaaca taaaacataa aaagatatac aaaaatggct 301 ctctgcttta tcgctaaatc aggaggcgct taagggcttc ttcctgggac gaacgttttt 361 cttatcaact ttacgagaat aagaaaattt tgttatggtc tcgagcattg aacgacatgt 421 tctcgattaa ggctgctttt acttgcaaga cattcctcag gccattaatt gctacaggac 481 atcttgtctg gctttaacta ggacgcagtg ccgccagaaa aagatagcga gcacaaagag 541 agctaattat acaatttaga ggtaagaatg aaaaaactct tgaaatcggt attagtgttt 601 gccgctttga gttctgcttc ctccttgcaa gctctgcctg tggggaatcc tgctgaacca 661 agccttatga tcgacggaat tc // LOCUS ECOK99FIM 740 bp ds-DNA BCT 25-JUL-1990 DEFINITION E.coli K99 fimbrial subunit gene, complete cds. ACCESSION M35282 KEYWORDS K99 fimbrial subunit. SOURCE E.coli (strain K-12 C600) DNA, clones 1, 2, 3, 4 and 5. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 740) AUTHORS Roosendaal,B., Gaastra,W. and de Graaf,F.K. TITLE The nucleotide sequence of the gene encoding the K99 subunit of enterotoxigenic Escherichia coli JOURNAL FEMS Microbiol. Lett. 22, 253-258 (1984) STANDARD simple staff_review FEATURES from to/span description pept 70 615 K99 fimbrial subunit precursor sigp 70 135 K99 fimbrial subunit signal peptide matp 136 612 K99 fimbrial subunit pept 648 > 740 ORF1 BASE COUNT 235 a 133 c 146 g 226 t ORIGIN 1 tagggaatgg ctatgttttc tggtgattcc acggaactaa aaaataatat cgaacaatgg 61 agaatctaga tgaaaaaaac actgctagct attatcttag gtggtatggc ttttgcgact 121 accaatgctt ctgcgaatac aggtactatt aacttcaatg gcaaaataac gagtgctact 181 tgtacaattg accctgaggt caatggtaat cgtacatcaa ctatagatct tgggcaggct 241 gctattagtg gtcatggcac tgtagtggat tttaaactaa aaccagcgcc cggcagtaat 301 gactgcctag cgaaaacaaa tgctcgtatt gactggtctg gttctatgaa cagtttaggt 361 tttaataata cagcttcagg aaatactgct gctaaaggat accatatgac tttgcgcgca 421 acaaacgttg gaaatgggtc tggtggtgct aatattaata cttcattcac tacggctgaa 481 tacactcaca cttctgcaat tcagtcattt aactattcag cccagctgaa aaaagatgac 541 cgcgctccgt ctaatggtgg atataaagct ggcgtattta ctacttcagc atccttctta 601 gtcacttata tgtaatattt aaagtatttt acattgcggg catatctatg attgcccgca 661 atattactga tggatattat atgaatagaa aaaaacatca gattttaaaa attttattgt 721 tgtgtctaat aagcagtaaa // LOCUS ECORRDAA 72 bp ss-rRNA RNA 25-JUL-1990 DEFINITION E.coli 16S rRNA fragment. ACCESSION M35308 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE E.coli (MRE 600) ribosomal RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 72) AUTHORS Ehresmann,C., Fellner,P. and Ebel,J.P. TITLE Nucleotide sequences of sections of 16S ribosomal RNA JOURNAL Nature 227, 1321-1323 (1970) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 72 16S rRNA BASE COUNT 17 a 16 c 20 g 19 t ORIGIN 1 ggcttggttt gcaagtgtca gatactgtta agcatctgaa atccccgggc taaccctggg 61 aactgatgac tg // LOCUS ECORRDAB 174 bp ss-rRNA RNA 25-JUL-1990 DEFINITION E.coli 16S rRNA fragment. ACCESSION M35309 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE E.coli (MRE 600) ribosomal RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 174) AUTHORS Ehresmann,C., Fellner,P. and Ebel,J.P. TITLE Nucleotide sequences of sections of 16S ribosomal RNA JOURNAL Nature 227, 1321-1323 (1970) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 158 16S rRNA BASE COUNT 42 a 40 c 55 g 37 t ORIGIN 1 ggcatgaaga cacactgcta actccgaata cgcacaagcc cgtaatggag cgacggtggg 61 ccttgttccc gtgccccgat gtggggtgga ggtgactgtg ggttgtgata ttcggggagg 121 caaaagaagt agcgagtcta accttgctta ccactttgcc taatacggga aacg // LOCUS HPTRRA 117 bp ss-rRNA RNA 25-JUL-1990 DEFINITION H.aurantiacus 5S rRNA gene. ACCESSION M35310 KEYWORDS 5S ribosomal RNA; ribosomal RNA. SOURCE H.aurantiacus (strain Sengas Wie 2) ribosomal RNA. ORGANISM Herpetosiphon aurantiacus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Cytophagaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Van den Eynde,H., Stackebrandt,E. and De Wachter,R. TITLE The structure of the 5S ribosomal RNA of a member of the phylum of green non-sulfur bacteria and relatives JOURNAL FEBS Lett. 213, 301-303 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 117 5S rRNA BASE COUNT 22 a 37 c 40 g 18 t ORIGIN 1 tccggtggca atgtcggagg ggtcccaccc gttcccatcc cgaacacgga agttaagccc 61 tccagagccg atggtactcc gcggggaacc gcgcgggaga gtaggtcgct gccggat // LOCUS HUMCOX4AA 634 bp ss-mRNA PRI 25-JUL-1990 DEFINITION Human cytochrome c oxidase subunit IV (COX4) mRNA, complete cds. ACCESSION M34600 KEYWORDS cytochrome c oxidase; cytochrome c oxidase subunit IV. SOURCE Human liver, cDNA to mRNA, clones pCOX4.-[111 and 4.2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 634) AUTHORS Lomax,M.I., Welch,M.D., Darras,B.T., Francke,U. and Grossman,L.I. TITLE Novel use of a chimpanzee pseudogene for chromosomal mapping of human cytochrome c oxidase subunit IV JOURNAL Gene 86, 209-216 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1 510 cytochrome c oxidase subunit IV (COX4) /hgml_locus_uid="LS0022W" /nomgen="COX4L2" /map="16q22-q24" mRNA < 1 634 COX4 mRNA BASE COUNT 156 a 157 c 180 g 141 t ORIGIN 1 atgttggcta ccagggtatt tagcctagtt ggcaagcgag caatttccac ctctgtgtgt 61 gtacgagctc atgaaagtgt tgtgaagagc gaagactttt cgctcccagc ttatatggat 121 cggcgtgacc accccttgcc ggaggtggcc catgtcaagc acctgtctgc cagccagaag 181 gcactgaagg agaaggagaa ggcctcctgg agcagcctct ccatggatga gaaagtcgag 241 ttgtatcgca ttaagttcaa ggagagcttt gctgagatga acaggggctc gaacgagtgg 301 aagacggttg tgggcggtgc catgttcttc atcggtttca ccgcgctcgt tatcatgtgg 361 cagaagcact atgtgtacgg ccccctcccg caaagctttg acaaagagtg ggtggccaag 421 cagaccaaga ggatgctgga catgaaggtg aaccccatcc agggcttagc ctccaagtgg 481 gactacgaaa agaacgagtg gaagaagtga gagatgctgc ctgcgcctgc acctgcgcct 541 ggctctgtca ccgccatgca actccatgcc tatttactgg aaacctgtta tgccaaacag 601 ttgtaccact gctaataaat gaccagttta cctg // LOCUS MRGRBMII 2574 bp ds-DNA VRT 25-JUL-1990 DEFINITION M.serrator retropseudogene-like repetitive element I (RBMI). ACCESSION M35143 KEYWORDS repetitive sequence. SOURCE M.serrator blood DNA. ORGANISM Mergus serrator Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Anseriformes; Anatidae. REFERENCE 1 (bases 1 to 2574) AUTHORS McHugh,K.P., Madsen,C.S. and de Kloet,S.R. TITLE A highly repeated retropseudogene-like sequence in DNA of the redbreasted merganser (Mergus serrator) JOURNAL Gene 87, 193-197 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 1 2574 retropseudogene-like repetitive element pept 1403 909 (c) ORF1 pept 2118 1426 (c) ORF2 BASE COUNT 691 a 600 c 484 g 790 t 9 others ORIGIN 1 gaattcctca aacacgctgc ggctgcttac ctttaataca cccgttgcat gcgatggagc 61 tgtatttctt gcttttncct gcactggaag gcttcccttc cttgtcaggt tgtttactgc 121 cctcactctt ctgcattgct cacatgaaga gccatctgga ggatgggttt cttccttctt 181 ctcccgggtt atcttctgga aacgaggacc taagtattcc aaggagcctt tcactttcct 241 ggtgtttctc cttttttttc tttttcttct cctttttctt ctttttctta tgcttgtgat 301 tggcattgtc aaagtggagc gcacagaaac acaaatcgtg aagtctgaaa gaaacatgca 361 agttaaaaag agaaaaaaag atgtggcact tgttgcctat atgaaacttt atttttttta 421 ccacaggtga tgatttgcag catgtcagct attttgtggt gctttgtgca cacgcaactt 481 acttacttta gatgcagcaa acttaagccc tcagattgaa ggaccatagg ctggtttgta 541 cacagatcat taaccatggt tagctctgga atacgtgcaa gcagaaaaaa acttttaacc 601 taatccggaa tggtgtacag atgtgattcg aactatgtgg tctaacgcta gtgctctgac 661 acaattcagc aatagctttc ctatcttcac tgaacaccta cacacagacc cagccagctg 721 atgctatcta aataacttag aaactaccag aaaaaaaaaa aaaaaaaaaa gaagaaaaaa 781 cgagaataaa aaaaaaaagt agaaaaaaaa aaaaaaagga agacatgaga agcacccaga 841 aatgaattag gataaaaaat tcggagtatg ctggaatcct tgcttacttg gaatccttct 901 ctgcatgttt aatccttaga cttctttttt cttctagaac ttgttgatat ttttgcattt 961 ttttcaccac ctaaaagctc cttttctatc tttctgtctt tcctttctat ttcactttca 1021 ctaccttctg cacgggtata ttttcttttt ctgtttcttt ctgtttcatt tttctggcga 1081 cagttctcca aatgagctga cacgggtgga agcgcatgtc tttcacgaga atgtcttctg 1141 gaatgttgct gatgtaccga gcaacgatgc aagtctgctc ggggtgtgct aaagcgacgt 1201 acatcttcct ctctcaagag ggaactgtga ggccatccgc ttttgtaatg ataactctta 1261 tgtgacctgc tgtagtaagt tgcagtcgat ttgtcaaagg ctgcatcgcc gtgagacaac 1321 tttctctctc tactgtctcc tgtcgcatga ggtgaatagt aatcattgta atagctacat 1381 ctttcccatc tccgagcntt catcctcgat agtatctntc tctgctcaac ttctttgccc 1441 tttggatcgg taatatctat tgctacctcg ttctgatctt cctccgcttg ccagatctgt 1501 actttgaata tttgacngct cttctgccat tctcagggct gtttctttca nnnnggaaag 1561 atctgcacct gcttcccccc cagtgctcct gcttgtgacg cttttgctca acaacttcca 1621 cgctctgaga acacctcctc ttgctggaag gacctgcttt ttgactctcc ttctcttcag 1681 taggagcatg ttcctcttgc tttggtaatg ctctttgtca gtgtttttag tctcgncttg 1741 tatcttggca tctctctgta atagctgagg aggaaaggtt tttagagcta cattcagtgt 1801 cagacttgag agaggaagct tgccgcaatt tctcaccagg ctcagaagac tctttgccgg 1861 acaaaacgtt ttcttttgaa atgaggtcac gttcttttca tcttcttgct ttctccttat 1921 ctccaccgtc atcacattgg tactgtgcga ggtattcatc attccagtag attttcgagg 1981 gtccgcaaca ctgcacaaaa taaaagcaca tttctcagtt ctgctgaagg acgtgaatat 2041 taagaggaaa accttccaaa agtcgaacaa acaaacaaaa acctccggac tacaggaaca 2101 ctctccaaga tatgccattt agaaacctct cctgtcatta ggacaccttc ttcagctcca 2161 cagaaagggg ttttgccctc ttgcttctga agccattgca ctaaaaagca aacgcagtgc 2221 tgtctccctc cacatgctgc tctgaataag agccagaata ttcaaaacca ctctctttgt 2281 tctcccacat agccgaaaaa acaccggttg aaacagagtt ttctacctct cgcccaacaa 2341 tttacattca catagcctat gactgaaaaa ataaaaggcg gggctgagga ggaacagcca 2401 gtgttggaaa tgaaaagaag cagcccgttc cttcatagtc ttaagcctat gctactagga 2461 aaacaaaaca aaacaaaaca aaacaagagg agaggagaac aacagcggga aattttcctg 2521 ttctccaggt gttaaattgc aaagcctcct ctggaggatc acagctgtga attc // LOCUS MUSLAMB 2789 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse lamin B mRNA, complete cds. ACCESSION M35153 KEYWORDS lamin B. SOURCE Mouse liver, cDNA to mRNA, clone FML11-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2789) AUTHORS Hoeger,T.H., Krohne,G. and Franke,W.W. TITLE Amino acid sequence and molecular characterization of murine lamin B as deduced from cDNA clones JOURNAL Eur J Cell Biol 47, 283-290 (1988) STANDARD simple staff_review FEATURES from to/span description pept 256 2019 lamin B mRNA < 1 2789 lamin B mRNA site 2768 2773 poly-A signal BASE COUNT 696 a 657 c 812 g 624 t ORIGIN 1 aataatctta agctcttaca aagagctgcg ggcgggagac tcgcgtccgg cgcacagccg 61 tctgcgtctc ccggctgccc tggcctcttc ccgcgcgcgc gtgcagtgtg cgtgtacact 121 cacaaagggc gtctggcggg cgatccgcgg ccctcccgct tcgctctttg tgcggtagcc 181 ccgccgccac cgccagccca ggtccgctcg atcctcaccg gcctgtggtt tgtaccttcg 241 gtcccgccgc ccgccatggc gaccgcgacc cccgtgcagc agcagcgggc gggcagccgc 301 gccagcgccc ccgccacgcc gctcagcccc acgcgcctgt cgcgcctgca ggagaaagag 361 gagctgcggg agctcaacga ccgcctggct gtgtacatcg ataaggtccg cagcctggag 421 acggagaaca gcgcgctgca gctgcaggtg accgagcggg aggaggtgcg cggccgcgag 481 ctcaccggcc tcaaggctct ctacgagacc gagctggccg acgcacgccg cgctctggac 541 gacacggccc gcgagcgcgc caagcttcag atcgagctgg gcaagttcaa ggccgagcac 601 gaccagctgc tgctcaatta tgccaagaag gaatctgatc tcagtggagc ccagatcaag 661 cttcgagagt atgaggcggc actaaactct aaggatgcgg cgctggcaac tgccctaggg 721 gacaaaaaga gtttagaggg agacttggag gatctgaaag atcagattgc ccagctagaa 781 gcatccttat ctgccgccaa aaagcagtta gcagatgaaa ctttacttaa agtggatttg 841 gagaatcgct gtcagagcct tactgaggac ttggagtttc gtaaaaatat gtatgaagag 901 gagatcaatg agacaaggag gaagcatgag acccgcttgg tggaagtgga ctctgggcgt 961 cagattgagt atgagtacaa gctggctcaa gccctgcatg agatgcggga gcagcacgac 1021 gcgcaggtga ggctgtacaa ggaagagctg gagcagacct accacgccaa gcttgagaat 1081 gccagactct cctcagagat gaacacttcc actgtcaaca gtgcccggga agagctgatg 1141 gagagccgga tgaggatcga gagcctctcc tcacagctct ctaacctgca gaaagagtct 1201 agagcgtgtt tggaaaggat ccaggaattg gaggacatgc ttgctaagga gagagacaac 1261 tcgcgccgca tgctgtctga cagagagaga gagatggcgg agatcaggga ccagatgcag 1321 cagcagctga gtgattatga gcagctgctg gacgtgaagc tggccctgga catggagatc 1381 agcgcctaca ggaagctcct ggaaggcgaa gaagagcggt taaagctctc tccaagccct 1441 tcttcccggg tgaccgtgtc cagagcgtcc tccagtcgca gtgtgcgcac caccagagga 1501 aagcggaaga gagttgatgt ggaggagtcg gaggcgagca gcagtgttag catttcccac 1561 tctgcctcag ccacggggaa cgtgtgcatt gaagagatag atgttgatgg gaagtttatt 1621 cgcttgaaga acacttctga gcaggatcaa ccaatgggag gctgggagat gatcagaaaa 1681 attggagaca catcagtcag ttacaaatat acctcaagat atgtgctgaa ggctggccag 1741 actgtcacag tgtgggctgc aaatgctggc gtcacagcca gccctccaac tgacctcatc 1801 tggaagaacc agaactcttg gggtactggt gaagatgtga aggttatgct taagaattct 1861 cagggagagg aggttgctca gagaagctct gtcttcaaga ccaccatacc cgaggaggag 1921 gaggaggagg aggagcccat cggagtggct gtggaggagg agcgtttcca ccagcaggga 1981 gccccaagag catggaataa aagctgtgcc attatgtgaa cttatcaaga catggtcgat 2041 cttcctcaag ctagaagcat ggagtcctgt atacagtgca gagccttctc agaagcacat 2101 gatatttttg tatttccttt atgtgaattt ttaagctgcg aatctgatgg ccttaatttc 2161 ctttttgaca ctgaaagttt tgtcaaaaga aatcctatcc atacacgttg taagatgtga 2221 attattgaca ctgagctaac tgtactgttt ggaaaggggc cctcaagttt ttggcatttt 2281 ttctttcctt tttgtatgtg tgtatgtaat tttttttttt taagttcttt taagagggga 2341 caaggagggt aagaaaacca ctgcgtgtcc gggcattaat tgaagcttgc tctccctaga 2401 tgggcggtct gctctcggtc cttctctgct ctctataaaa tggtgctgtc ggggagggag 2461 gggggaagtt tttcaatata tgaacttttg tatggaattt tttgtaataa gtgatcaggt 2521 tacaattttt ttaaatagaa aagagaagaa aaacgttgta agaacggaat attaatctag 2581 tcacccatgt acgcactctg gatggaggtt ctacagagct gttgattggt caactacttc 2641 tcttacattg ttgactcatg aggggagcgg gcaggcgggt gagggtgggg gaaggctttc 2701 tcttcaaatt cgctagttga gtttttaaga tagtgtacat gcttacattt cttatccgac 2761 attaacaaat aaaacgctgt tttcctatt // LOCUS MUSMSTA1 1651 bp ds-DNA ROD 25-JUL-1990 DEFINITION Mouse metastatic cell protein (mts1) gene, exons 1 and 2. ACCESSION M36578 M35147 KEYWORDS mts1 protein. SEGMENT 1 of 2 SOURCE Mouse metastatic cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1651) AUTHORS Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and Lukanidin,E.M. TITLE Structure of gene mts1, transcribed in metastatic mouse tumor cells JOURNAL Gene 87, 219-223 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1498 / 1638 mts1 protein, exon 2 (first expressed exon) pre-msg 255 > 1076 mts1 mRNA and introns IVS 293 1482 mts1 intron A IVS 1639 > 1651 mts1 intron B signal 225 231 TATA box BASE COUNT 380 a 358 c 460 g 437 t 16 others ORIGIN 1 ttctggctga gctgtggctg cttggtggtg tccaccccat ccaagcctct gccgtgccca 61 ctggagctca ctcactactt gattgtgcct gctggggagg gagcaggaag cctagatccc 121 agactgggct ggtcgagggt gctatgacat ttactacatc aaccaacagc aagagcacag 181 tatccatgtt cccccatcct ctgcatgggc agggcctagc agggtataaa taggtcagat 241 tgttgggctc tccccaaacc tctctattca gcacttcctc tctcttggtc tggtgagttg 301 tgttggtctg atagcactgc tagcggcatt agaggctgag gctagggtag aagaaagggg 361 ggctgctgtg ggggaacaga tgtctttaat aaatccagat gagagattct gatgtggagg 421 ttcatgtatg tgtgtgtgtg tgtgtgtttt cacgagaatg aaaaccaaaa aaaaaaaaaa 481 aaaaaaaaaa agtgtataaa tggctacatc tgagctcccg aaggttttga gatactgagg 541 ctggcttgca tgttgctata gtgtatattg gtggtgcttg ggagtcactg tcatgcatag 601 gatgctgact cgtgttgctg ggtaatacaa gacagtgtgt ggacactcgg gtacaggaag 661 caaagcgaag gcatcagtag gcctttttgt tttacagtat ttaaattaca gtttttattt 721 gtgtgtatga gcgtatgggt tgggctggag caaatgccaa ggcgacattg tgggagccaa 781 aggacaattt gtgtgggagt caactcgttc cttctagcat gtgggctgtg gggatcaaac 841 tcaggccttg gagcttggtg gcaagcacct ctacccattg agctatctct ccagcaccct 901 cctgcagnnn nnnnnnnnnn nntttgtagt gtcttgtttt taattgccct atgaacatat 961 agcacctagg ccaagaaagc ctagcttccc caccctctcc tcttgcatcc ctacctctgc 1021 cacttcatct tactcctatt aggcagctgg ggtttttcca cttttttttt gtctgcctct 1081 gggcaggcag ccagcagccg cgcccaacgc tgggagggag aagaatgggc caggcctgtg 1141 cttgtggttg agctgtggga gtgagtaagc tgatggaaaa ctgctgttgt tgaggccata 1201 gctgagaggc acagaaaggt gctggcatag gtctccagag tttgaggggt agctttgcag 1261 gtttcagagc ccagagcaca tgtgaccttc ttgccaccaa tgggtcccat tcctctgatc 1321 cccnaggggg tgaggtccat ctcttagaga gttgtgggat agagcactta aaatgggaac 1381 agaatgagtg tgatttgggt catgctcagc aacacatatc cagttctcaa cacactgttg 1441 gcgtgggttg gagaatgtta cttttgtgtc tcctgccctt aggtctcaac ggttaccatg 1501 gcaagaccct tggaggaggc cttggatgta attgtgtcca ccttccacaa atactcaggc 1561 aaagagggtg acaagttcaa gctgaacaag acagagctca aggagctact gaccagggag 1621 ctgcctagct tcctgggggt aagtgggtcc t // LOCUS MUSMSTA2 545 bp ds-DNA ROD 25-JUL-1990 DEFINITION Mouse mts1 protein gene, exon 2. ACCESSION M36579 M35147 KEYWORDS mts1 protein. SEGMENT 2 of 2 SOURCE Mouse metastatic cell line NIH3T3 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 545) AUTHORS Tulchinsky,E.M., Grigorian,M.S., Ebralidze,A.K., Milshina,N.I. and Lukanidin,E.M. TITLE Structure of gene mts1, transcribed in metastatic mouse tumor cells JOURNAL Gene 87, 219-223 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 14 178 mts1 protein, exon 2 pre-msg < 1 315 mts1 mRNA and introns IVS < 1 13 mts1 intron B signal 297 302 poly-A signal BASE COUNT 126 a 97 c 170 g 152 t ORIGIN 1 cttcaacggc cagaaaagga cagatgaagc tgcattccag aaggtgatga gcaacttgga 61 cagcaacagg gacaatgaag ttgacttcca ggagtactgt gtcttcctgt cctgcattgc 121 catgatgtgc aatgaattct ttgagggctg cccagataag gagccccgga agaagtgaag 181 actcctcaga tgaagtgttg gggtgtagtt tgccagtggg ggatcttccc tgttggctgt 241 gagcatagtg ccttactctg gcttcttcgc acatgtgcac agtgctgagc aaattcaata 301 aaaggttttg aaactattag ctgttgtctg agagactgga gctatgggct gagggctgtg 361 gtagagactg ctggaagttg acctgagctt tgtggggcca aactaaaaaa aggtcgggga 421 gggggtgggt ggcttatttt gagtacattg caagtatgta tttgtgtgtg tcggcttagt 481 catgcgtgca tgtgtgcgtg cgtgtgtgtt tgtgtgtgtt tacgtgctcc tatatagcaa 541 ccgag // LOCUS MUSNFH 3959 bp ss-mRNA ROD 25-JUL-1990 DEFINITION Mouse neurofilament component (NF-H) mRNA, complete cds. ACCESSION M35131 KEYWORDS neurofilament protein. SOURCE Mouse (strain Swiss-Webster) brain, cDNA to mRNA, clones pMuH1, pMuH5, and DNA, clone lambda-5A. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3959) AUTHORS Shneidman,P.S., Carden,M.J., Lees,J.F. and Lazzarini,R.A. TITLE The structure of the largest murine neurofilament protein (NF-H) as revealed by cDNA and genomic sequences JOURNAL Mol. Brain Res. 4, 217-231 (1988) STANDARD simple staff_review COMMENT Nucleotides 1-955 are derived from genomic DNA. FEATURES from to/span description pept 154 3372 neurofilament component (NF-H) signal 59 65 TATA box signal 3936 3942 NF-H mRNA BASE COUNT 1104 a 1122 c 1197 g 536 t ORIGIN 1 ggggccgcgg gggaggaggt ggagcccact gccgaggggc cggaccgggc caccgcgata 61 taaaagagcc ggagtcccag agctgccgca gtgctgcctg ccccgtccca gccccgcact 121 cccgctccgc tggcggccgc acctgctccg gccatgatga gcttcggcag cgccgatgcg 181 ctgctgggcg ccccgttcgc gccgctgcac ggaggcggca gcctgcacta ctcgctgagc 241 cgcaaggcag gcccgggcgg cacgcgctcc gcggccggct cctccagcgg cttccactcg 301 tgggcgcgga cgtccgtgag ctccgtgtcc gcctcaccca gccgcttccg cggcgccgcc 361 tcgagcaccg actcgctaga caccctaagc aacggcccag agggctgcgt ggtggcggcg 421 gtggcggcgc gcagcgagaa ggagcagctg caggctctga acgaccgctt cgcgggctac 481 atcgacaagg tgaggcagct cgaggcgcac aaccgcagcc tggagggcga ggcggcggcg 541 ctgcggcagc aacaagccgg ccgcgccgcc atgggcgagc tgtacgagcg cgaggtgcgc 601 gagatgcgcg gcgccgtgct gcgcctcggg gcggcgcgcg ggcagctgcg cctggagcag 661 gagcacctgc tggaggacat cgctcacgtc cgccagcggc tggacgagga ggcccggcag 721 cgtgaggagg cggaggcggc ggcgcgcgcc ctggcgcgct tcgcgcagga ggcggaagcg 781 gcgcgcgtgg agctgcagaa gaaggcgcag gcgctgcagg aggagtgcgg ctacctgcgg 841 cgccaccacc aggaggaggt gggcgagctg ctcggtcaga tccagggctg cggggccgcg 901 caggcgcagg ctcaggccga ggctcgcgac gccctcaagt gcgacgtgac gtcggcgctg 961 cgggagatcc gcgcgcagct cgaaggccac gcggtgcaga gcacgctgca gtccgaggag 1021 tggttccgag tgaggttgga ccgactctca gaggcagcca aagtgaacac agatgctatg 1081 cgctcggccc aagaggagat aactgagtac cggcggcagc tgcaagccag gaccacagag 1141 ttggaggccc tgaaaagcac caaggagtca ctggagaggc agcgctctga gctagaggac 1201 cgtcatcagg cagacattgc ctcctaccag gacgctattc agcagctgga cagtgagctg 1261 agaaacacca agtgggagat ggctgcacag ctccgagagt accaggacct gctcaacgtc 1321 aagatggccc tggacattga gattgccgct tacagaaagc tcctggaagg cgaagagtgt 1381 cggattggct ttggtccgag tcccttctct cttactgaag gactcccaaa aattccctcc 1441 atatccacgc acataaaagt caaaagcgaa gagatgataa aggtagtaga gaaatccgag 1501 aaggaaactg tgattgtaga aggacagaca gaagagatcc gggtgacgga aggagtgaca 1561 gaagaggagg acaaagaggc ccaaggtcag gaaggagaag aagcagaaga gggagaagaa 1621 aaagaagaag aggaaggagc agcagctaca tctccccctg cagaagaggc tgcatctcca 1681 gaaaaagaaa ccaagtctcg tgtgaaagaa gaggccaagt ccccaggtga ggccaagtcc 1741 ccaggtgagg ccaagtcccc aggtgaggcc aagtccccag ctgaggccaa gtccccaggt 1801 gaggccaagt ccccacgtga ggccaagtcc ccaggtgagg ccaagtctcc agctgagccc 1861 aagtctccag ctgagcccaa gtctccagct gaggccaagt caccagctga gcccaagtct 1921 ccagctacag tgaagtctcc aggtgaggcc aagtcaccat ctgaggccaa atctccagct 1981 gaagccaaat ctccagctga ggccaaatct ccagctgagg ccaaatctcc agctgaggcc 2041 aagtcaccag ctgaagccaa gtcaccagct gaagccaaat ctccagctac agtgaagtct 2101 ccaggtgagg ccaagtcacc atctgaggcc aaatctccag ctgaagccaa atctccagct 2161 gaggccaaat ctccagctga ggccaaatct ccagctgagg tcaagtcacc aggtgaggcc 2221 aagtctccag ctgagcccaa gtcaccagct gaggccaaat ctccagctgc agtgaagtca 2281 ccagctgagg ccaagtctcc agctgcagtc aagtccccag gtgaggccaa gtccccaggt 2341 gaggccaagt caccagctga ggccaaatct ccagctgagg ccaagtcacc aattgaggta 2401 aaatctccag agaaggccaa gacccccgtc aaggaaggag caaaatctcc agctgaggcc 2461 aagtctcctg agaaggccaa gtcccccgtg aaggaagata tcaagccccc agctgaggcg 2521 aaatcccctg agaaggccaa gagccccatg aaggaaggag caaagcctcc tgagaaggcc 2581 aagcctctag atgtgaagtc tccggaagcc cagactccag tacaggagga agcgaacgac 2641 cccacagaca tcagaccccc tgagcaggtg aaaagtcctg ccaaggagaa ggccaagtcc 2701 cctgagaagg aagaagccaa gacttctgaa aaggtggctc ccaagaagga agaggtgaag 2761 tcccctgtga aggaggaggt aaaagccaaa gaacccccaa agaaggtaga agaagagaag 2821 acactgccta caccaaagac agaggcgaag gagagtaaga aagacgaagc tcccaaggag 2881 gccccgaagc ccaaggtgga ggagaagaag gaaactccca cggaaaagcc caaggactct 2941 acagcagaag ccaagaagga agaggctgga gagaagaaga aagccgtggc ctcagaggag 3001 gagactcctg ccaagttggg tgtgaaggaa gaagctaaac ccaaagagaa gacagagaca 3061 accaagacag aagcagaaga caccaaggcc aaagaaccta gcaaacccac agagacggaa 3121 aagccaaaga aagaggagat gccagcggca ccagagaaga aagacaccaa ggaggagaag 3181 accacagagt ccaggaagcc tgaggagaag cccaaaatgg aggccaaggt caaggaggat 3241 gacaagagcc tttccaaaga gcctagcaaa cccaagacag aaaaggctga aaaatcctct 3301 agcacagacc agaaagaaag ccagccccca gagaagacca cagaggacaa ggccaccaag 3361 ggagagaagt aagagaacaa gagaaacacc cagaatagcc aaagaaactc aggacggtcc 3421 cagtactcag gggtcggcgt aataaatttt atttcttcct ttccctccgt aagaagaaac 3481 actgcttaga tggtgggcct gccctcacca aacaggaatt tctattaaga ttaagttagc 3541 aagagaagat aaccctgagc cttgtccccc acgccgaaaa ccctccccag gtgatggaca 3601 attatgatag cttcttgtag ccgaacgtga tgtatgctga acgctacgcg taaaacacgc 3661 gtctaaaaac tgccccctcc tttccaagta agtgcattta tttcctgtat gtccaactga 3721 cagatgaccg caataatgaa tgagcagtta gaaacgcatt atgcttgaaa tgttgtaacc 3781 tattcctgaa tgccttcttg ttttccaaag gagtggtcag gcccttgccc agtacacgct 3841 cctggaagag ctgcagcagg tgaggcaggg cgctggccac tgaaccacgc cagggtgtac 3901 tctccactga agtccacttt caattgcttc catgcaataa aaccaagtgc ttctgaaat // LOCUS MUSRGCA 350 bp ds-DNA ROD 25-JUL-1990 DEFINITION Mouse 18S rRNA gene. ACCESSION M35283 KEYWORDS 18S ribosomal RNA; processing factor; ribosomal RNA. SOURCE Mouse (strain S100) ribosomal DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 350) AUTHORS Mishima,Y., Katayama,M. and Ogata,K. TITLE Identification of a protein factor and the nucleotide sequence required for processing of mouse precursor rRNA JOURNAL J. Biochem. 104, 515-520 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 325 > 350 18S rRNA site 220 220 processing site BASE COUNT 29 a 128 c 117 g 76 t ORIGIN 1 tcgacgttcc ggctctcccg atgccgaggg gttcgggatt tgtgccgggg acggagggga 61 gagcgggtaa gagaggtgtc ggagagctgt cccggggcga cgctcgggtt ggctttgccg 121 cgtgcgtgtg ctcgcggcgg gttttgtcgg accccgacgg ggtcggtccg gccgcatgca 181 ctctcccgtt ccgcgcgagc gccgcccggc tcacccccgg tttgtcctcc cgcgaggctc 241 tccgccgccg cctcctcctc ctctctcgcg ctctctgttc cgcctggtcc tgtcccaccc 301 ccgacggctt cgctcgcgct tccttacctg gttgatcctg ccagtagcat // LOCUS MYCRDNAA 190 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis A-repeat sequence DNA. ACCESSION M35303 KEYWORDS A-repeat. SOURCE M.hyorhinis DNA, clone pG102.1. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 190) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 94 a 15 c 31 g 50 t ORIGIN 1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat 121 agaggctata aaattggact taaaaaagtt cacagattat tggaaaaatt caatcttaaa 181 gcaatttgtt // LOCUS MYCRDNAB 190 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis A-repeat sequence DNA. ACCESSION M35304 KEYWORDS A-repeat. SOURCE M.hyorhinis DNA, clone pG102.3. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 190) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 87 a 16 c 38 g 49 t ORIGIN 1 gaattcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaag aaaataaagc aagatatgaa aaaaggagaa tcaaagctga acttaataat 121 agaggctata aaattggact tagatagggt tgagtgttgt tccagtttgg acaagaagtc 181 cactattaaa // LOCUS MYCRDNAC 191 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis A-repeat sequence DNA. ACCESSION M35305 KEYWORDS A-repeat. SOURCE M.hyorhinis DNA, clone pG101. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 191) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 97 a 18 c 29 g 47 t ORIGIN 1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaaa gaaaataaag caagatatga aaaaaagaga atcaaagctg aactcaataa 121 tagaggctat aaaattggac ttaaaaaagt tcacagatta ttgaaaaaat tcaatctcaa 181 agcaatttgt t // LOCUS MYCRDNAD 191 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyopneumoniae A-repeat sequence DNA. ACCESSION M35306 KEYWORDS A-repeat. SOURCE M.hyopneumoniae DNA, clone pJ125. ORGANISM Mycoplasma hyopneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 191) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 94 a 17 c 31 g 49 t ORIGIN 1 gaactcaaaa aagaagattt tgacaagaaa aatgaagaaa ttataagtca aatgaagctt 61 atttttgaaa gaaaataaag caagatatgc aaaaaagaga ataaaagctg atcttaataa 121 tagaggctat aaaattggac ttaaaaaagt tcgcagatta ttggaaaaat tcaatctcaa 181 agcaatttgt t // LOCUS MYCRDNAE 210 bp ds-DNA BCT 25-JUL-1990 DEFINITION M.hyorhinis B-repeat sequence DNA. ACCESSION M35307 KEYWORDS B-repeat. SOURCE M.hyorhinis DNA, clones pG102.[1,3]. ORGANISM Mycoplasma hyorhinis Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 210) AUTHORS Taylor,M.A., Ferrell,R.V., Wise,K.S. and McIntosh,M.A. TITLE Reiterated DNA sequences defining genomic diversity within the species Mycoplasma hyorhinis JOURNAL Mol. Microbiol. 2, 665-672 (1988) STANDARD simple staff_review BASE COUNT 84 a 29 c 28 g 69 t ORIGIN 1 gaattcttta aatttagtag aaatcaaaaa aactcaacaa ggcaactgag ttcgttataa 61 aaaagtttat caatatgcta aattcgatgc aagaactaaa caatttatct tagttgaaaa 121 aggcgttcct tttactaata tgattattgc taatcaaaac aatctacatt tgaatatttt 181 gactgacagg ttctaaagaa tgcagcattt // LOCUS STRLACZ 209 bp ds-DNA BCT 25-JUL-1990 DEFINITION S.bovis lactose catabolic protein (lacZ) gene, 5' end. ACCESSION M35285 KEYWORDS catabolic protein. SOURCE S.bovis (strain H/3) DNA. ORGANISM Streptococcus bovis Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 209) AUTHORS Gilbert,H.J. and Hall,J. TITLE Molecular cloning of Streptococcus bovis lactose catabolic genes JOURNAL J. Gen. Microbiol. 133, 2285-2293 (1987) STANDARD simple staff_review FEATURES from to/span description pept 184 > 209 lactose catabolic protein (lacZ) BASE COUNT 59 a 55 c 43 g 52 t ORIGIN 1 tcgattagcc cttggaccct gctagtcttg acctgcctag gtttcccagg tcaagttccc 61 agttaccgac tacccgtaaa tcgatactac gccattgtta gatcggatct gaacccgtaa 121 ctttatagtt gggtatcgtg agcagatcac aatatcccac aataaaagga ggataacatc 181 caaatgatca cggacacagt ggccatcga // LOCUS STYSSCA 1551 bp ds-DNA BCT 25-JUL-1990 DEFINITION S.typhimurium Ssc protein (ssc) gene, complete cds. ACCESSION M35193 KEYWORDS Ssc protein. SOURCE S.typhimurium (strain SH5014, isolate LT2) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1551) AUTHORS Hirvas,L., Koski,P. and Vaara,M. TITLE Characterization of a new protein encoding region between ompH and lipid A biosynthesis genes of Salmonella typhimurium JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.H.Hirvas, 15-JUN-1990. Author address: L.H.Hirvas University of Helsinki Dept of Bacteriology and Immunology Haartmaninkatu 3 00290 Helsinki FINLAND FEATURES from to/span description pept 19 1044 Ssc protein BASE COUNT 362 a 354 c 438 g 397 t ORIGIN 1 aaacaggtta aataagtaat gccttcaatt cgactggctg acttagcaga acagttggat 61 gcagaattac acggtgatgg cgatatcgtc atcaccggcg ttgcgtccat gcaatctgca 121 acaacaggcc acattacgtt tatggtgaat cctaagtacc gtgaacactt aggtttatgc 181 caggcttctg cggttgtcat gacgcaggac gatcttcctt ttgctaagag tgcggcgctg 241 gtagttaaaa atccctacct gacctacgcg cgcatggcgc aaattttaga tactacgccg 301 cagcccgcgc agaatatcgc gccaagcgcc gtgattgatg cgacggcaac gctgggtagc 361 aatgtttcag tcggcgcgaa tgcggtgatt gaatctggcg tacaactggg cgataacgtg 421 gttatcggcg caggctgttt cgtcggaaaa aatagcaaaa tcggggcggg ttcacgcttg 481 tgggcgaacg taacgattta ccacgacatt cagatcggtg agaattgcct gatccagtcc 541 agtacggtga tcggcgcgga cggttttggc tacgctaacg atcgtggcaa ctgggtgaag 601 atcccacaac tgggccgggt cattattggc gatcgtgtcg agatcggcgc ttgtaccacc 661 attgaccgtg gcgcgttgga tgatactgtt attggcaatg gcgtgattat tgataatcag 721 tgccagattg cacataacgt cgtgattggc gacaatacgg cagttgccgg tggcgtcatt 781 atggcgggta gcctgaagat tggccgttac tgcatgattg gcggcgccag cgtgatcaat 841 gggcatatgg aaatatgcga caaagtcacg gtaactggca tgggtatggt gatgcgtccc 901 atcacggaac cgggcgtcta ctcctcaggc attccgctgc aacccaacaa agtatggcgt 961 aaaactgctg cactggtgat gaacattgat gatatgagca agcgtctcaa agcgattgag 1021 cgcaaggtta atcaacaaga ctaacgttcc gccttgtagt tgccattctt ttccggcctg 1081 tcacattcat acgattgcgg caggccgtgt tattattgcc tttttgtata tttggacagg 1141 aagagtattt tgactactaa cactcatact ctgcagattg aagagatttt agagcttctg 1201 ccgcaccgtt ttccgttttt actggtcgat cgcgtgctgg actttgaaga aggtcgtttt 1261 ctgcgtgcgg tgaaaaatgt ctccgtcaac gagccgtttt tccaggggca tttcccgggc 1321 aaaccgattt tgccaggcgt gctgattctg gaagcgatgg cgcaggcaac cggtattctg 1381 gcgtttaaaa gcgttggtaa actggaacct ggcgaactgt attatttcgc gggtattgat 1441 gaagcgcgct ttaagcgtcc ggtggtgcca ggcgatcaga tgatcatgga agtcactttc 1501 gagaaaacgc gccgtggcct gacccgcttt aaaggggttg cgctggtcga c // LOCUS TOBRUBPA 979 bp ds-DNA PLN 25-JUL-1990 DEFINITION Tobacco ribulose-1,5-bisphosphate carboxylase small subunit gene, exons 1 and 2. ACCESSION M32419 KEYWORDS ribulose-1,5-bisphosphate carboxylase. SOURCE Tobacco DNA, clone TSSU3-8. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 979) AUTHORS O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K. TITLE Isolation of tobacco SSU genes: Characterization of a transcription- ally active pseudogene JOURNAL Nucleic Acids Res. 15, 8661-8676 (1987) STANDARD simple staff_review FEATURES from to/span description pept 584 760 ribulose-1,5-bisphosphate carboxylase small subunit precursor, exon 1 854 > 979 ribulose-1,5-bisphosphate carboxylase small subunit precursor, exon 2 sigp 584 757 ribulose-1,5-bisphosphate carboxylase small subunit signal peptide matp 758 760 ribulose-1,5-bisphosphate carboxylase small subunit 854 > 979 ribulose-1,5-bisphosphate carboxylase small subunit IVS 761 853 ribulose-1,5-bisphosphate carboxylase small subunit intron A BASE COUNT 320 a 175 c 185 g 299 t ORIGIN 1 ttaattatgt ctttgtttgc ttctcatgtg ataaagaatc gaagccttga tgaacataat 61 ttgcatttga gtagtgaata gctgctttca caaagagtac tctagctatt aagtttagtt 121 tgaatatttt gaaacacaaa aatatatgta tacatacaaa aacaaatacc gcaatagtcc 181 aagcaaaagg gactttaaaa aaaaaaacca acctcaatta cacattcata tcctcttcct 241 accccatcta ggatgagata agattactga ggttgtttac acgtggcacc tccattgtgg 301 tgaattaaat gatcaatggc ttagctcaaa atataatttt ccaacctttc atgtgtggat 361 attaagtttt gtgtagtgaa tcaagaacca cataatccaa tggttagctt tactccaaga 421 tgagggggtt gttgattttt gtccgttaga tatgggaaat atgtaaaacc ttatcattat 481 atatagagtg gtgggcaact atgcaatgac catcttggaa gtttaaagga aaaaaaagga 541 aagggagaaa gagaaatctt tctgtcttaa agtgtaatta acaatggctt cctcagttct 601 ttcctctgca gcagttgcca cccgcagcaa tgttgctcaa gctaacatgg ttgcaccttt 661 cactggcctt aagtcagctg cctcattccc tgtttcaagg aagcaaaacc ttgacatcac 721 ttccattgcc agcaacggcg gaagagtgca atgcatgcag gtaatttata tacaatgaca 781 gtgcaaaaaa ttttgataca attaatgcat cttaacatgt catagctaaa aattctattt 841 tggtggaata taggtgtggc caccaattaa caagaagaag tacgagactc tctcatacct 901 tcctgatttg agccaggagc aattgcttag tgaagttgag taccttttga aaaatggatg 961 ggttccttgc ttggaattc // LOCUS TOBRUBPB 1337 bp ds-DNA PLN 25-JUL-1990 DEFINITION Tobacco ribulose-1,5-bisphosphate carboxylase small subunit pseudogene, complete cds. ACCESSION M32420 KEYWORDS pseudogene; ribulose-1,5-bisphosphate carboxylase. SOURCE Tobacco DNA, clone TSSU3-2. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 1337) AUTHORS O'Neal,J.K., Pokalsky,A.R., Kiehne,K.L. and Shewmaker,C.K. TITLE Isolation of tobacco SSU genes: Characterization of a transcription- ally active pseudogene JOURNAL Nucleic Acids Res. 15, 8661-8676 (1987) STANDARD simple staff_review FEATURES from to/span description pept.ps 272 451 ribulose-1,5-bisphosphate carboxylase small subunit, exon 1 620 754 ribulose-1,5-bisphosphate carboxylase small subunit, exon 2 912 1100 ribulose-1,5-bisphosphate carboxylase small subunit, exon 3 IVS 452 619 ribulose-1,5-bisphosphate carboxylase small subunit intron A IVS 755 911 ribulose-1,5-bisphosphate carboxylase small subunit intron B BASE COUNT 379 a 249 c 256 g 453 t ORIGIN 1 gttttagaaa atatttccca ttcacaaatt aagtttggga actttgagat aaggacgact 61 gagtgtaatc aatgtcaggg gttcaaattt atgtgcccgt caatttttca atccacggct 121 acgattcctc taagatgagg tcattgcttg cttgtgtccg ttagatgaga aaaagacgtg 181 aaaccttatc actatatata gcactcatca cacccttgaa agcaaaggtc aagggaagca 241 atagctttaa gctaaacaat tactttcaac aatggcttcg tctgtgattt cctcagccgc 301 tgccgttgcc accggcgcta atgcggctca agccagtatg gttgcacctt tcactggcct 361 caaatccgcc tactccttcc ctgtttccag aaaacaaaac cttgacatta cttccattgc 421 tagcaatggt ggaagagttt aatgcatgca ggtttgtagc atatattatt gtagttagct 481 tatataaact gatagagtaa agaaatttta cgttatatat tgatatattt taacctggta 541 atttgattta tttttcatat tattaatccc acttttttat tgtacttatg aagtttattt 601 taattcttta tatatatagg tgtggccacc aattaacaag aagaagtacg agacactctc 661 ataccttcct gatttgagcg aggagcaatt gcttagggaa gttgaatacc ttttgaaaaa 721 tggatgggtt ccttgcttgg aattcgagac tgaggtcaaa catctattct aaatcatgct 781 actattatca agcataacta acatgaataa ctcaatccta actagtttgg gattagacat 841 atatagttga ttaagtgaaa gaggagtatt atctcatgtt aatgttttgt ttatcttgtg 901 gatatgcgca gcacggattc gtctaccgtg agaataacaa gtcaccaggt tactacgatg 961 gaagggccac tcaggtcttg gctgaggtcg aggaggcaaa gaaggcttac ccacaagcct 1021 ggatcagaat cattggattc gacaacgtcc gtcaagtgca atgcatcagt ttcatcgcct 1081 acaagcccgc aggctactaa aatctccatt tttaagacaa cttaccgtat gtattcaggg 1141 gaagtttgtt tgaattctcc ttgtgttttt ccccggagaa actgttttgg ttttcctttg 1201 ttttaattcc ttctttctat tcggtgtata tttttgaatt ccaatcaagt ttatgagaac 1261 taataatgtc atttgtttct ttcgtaattt gctttgtggt gtacatcggt tttaattatc 1321 cgagtaatat ctgcttt // LOCUS ZYMCPA 1374 bp ss-RNA VRL 25-JUL-1990 DEFINITION Zucchini yellow mosaic virus coat protein (cp) mRNA, 3' end. ACCESSION M35095 KEYWORDS capsid protein; coat protein. SOURCE Zucchini yellow mosaic virus, cDNA to viral RNA, clone ZYKS-22cp. ORGANISM Zucchini yellow mosaic virus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Potyvirus. REFERENCE 1 (bases 1 to 1374) AUTHORS Gal-On,A., Antignus,Y., Rosner,A. and Raccah,B. TITLE Nucleotide sequence of the zucchini yellow mosaic virus capsid- encoding gene and its expression in Escherichia coli JOURNAL Gene 87, 273-277 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 1164 coat protein (cp) (AA at 1) mRNA < 1 1374 cp mRNA BASE COUNT 423 a 279 c 342 g 330 t ORIGIN 1 tcgacgaagg agagattgtt tccaatttta gagtgggata gaagcaaaga aattatgcac 61 cgaacagagg ctatttgcgc tgcgatgatt gaggcatggg gacacaccga gcttttacaa 121 gagatcagaa agttttatct atggttcgtt gaaaaggaag aagtgcgaga attagccgcc 181 ctcggaaaag ctccatacat agctgagaca gcacttcgta agctatacac tgacaaggga 241 gcggatacaa gtgaactggc acgttatcta caagccctcc accaagacat cttctttgaa 301 caaggagaca ctgtaatgct ccaatcaggc actcagccaa ctgtggcaga cactggagcc 361 acaaagaaag acaaagaaga tgacaaaggg aaaaacaagg atgttacagg ctccggctca 421 agtgagaaaa cagtggcagc tgtcacgaag gacaaggatg taaatgctgg ttctcatggg 481 aaaattgtgc cgcgtctttc gaagataaca aagaagatgt cactgccacg cgtgaaagga 541 aatgtgatac tcgacattga tcacttgctg gagtataagc cggatcaaat tgagttatac 601 aacacacgag cgtctcatca gcaattcgcc tcttggttca accaagttaa aacagaatat 661 gatctgaatg agcaacagat gggagttgta atgaatggtt tcatggtttg gtgcatcgaa 721 aatggcacgt cacccgacat taacggagta tgggttatga tggacggtaa tgagcaggtt 781 gaatatcctt tgaaaccaat agttgaaaat gcaaagccaa cgctgcgaca aataatgcat 841 cacttttcag atgcagcgga ggcatatata gagatgagaa atgcagaggc accatacatg 901 ccgaggtatg gtttgcttcg aaacttacgg gataggagtt tggcacgata tgctttcgac 961 ttctacgaag tcaattccaa aactccggaa agagcccgcg aagctgttgc gcagatgaaa 1021 gcagcagccc ttagcaatgt ttcttcaagg ttgtttggcc ttgatggaaa tgttgccacc 1081 actagcgaag acactgaacg gcacactgca cgtgatgtta ataggaacat gcacaccttg 1141 ctaggtgtga atacaatgca gtaaagggta ggtcgcctac ctaggttatc gtttcgctcc 1201 gacgtaattc taatatttac cgctttatgt gatgtcttta catttctaga gtgggcctcc 1261 cacctttaaa gcgtaaagtt tatgttagtt gtccaggagt gccgtagtcc tgtcggaagc 1321 tttagtgtga gcctctcacg aataagctcg agattagact ccgtttgcaa gcct //
GenBank-Updates@genbank.bio.net (07/26/90)
LOCUS ATUNPSS 6425 bp ds-DNA SYN 26-JUL-1990 DEFINITION A.tumefaciens neomycin phosphotransferase and streptomycin/spectinomycin adenyltransferase, complete cds. ACCESSION M35007 KEYWORDS neomycin phosphotransferase; streptomycin/spectinomycin adenyltransferase. SOURCE N.tabacum T-DNA inserts in A.tumefaciens DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6425) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 2782 1985 (c) neomycin phosphotransferase pept 4983 4042 (c) streptomycin/spectinomycin adenyltransferase recomb 24 25 T-DNA end/plant DNA start recomb 40 41 plant DNA end/T-DNA start recomb 1094 1095 plant DNA end/T-DNA start recomb 2786 2787 T-DNA end/plant DNA start recomb 3044 3045 T-DNA end/plant DNA start recomb 3354 3355 plant DNA end/T-DNA start recomb 5585 5586 T-DNA end/plant DNA start recomb 6389 6390 T-DNA end/plant DNA start site 1 24 left T-DNA border site 372 395 24 bp border-like sequence site 1569 1592 24 bp border-like sequence site 1669 1692 24 bp border-like sequence site 1779 1756 (c) 24 bp border-like sequence site 2128 2105 (c) 24 bp border-like sequence site 2449 2472 24 bp border-like sequence site 2485 2462 (c) 24 bp border-like sequence site 3660 3683 24 bp border-like sequence site 3875 3898 24 bp border-like sequence site 4359 4336 (c) 24 bp border-like sequence site 5868 5891 24 bp border-like sequence BASE COUNT 1509 a 1754 c 1710 g 1452 t ORIGIN 1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa 61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca 121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa 181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct 241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc 301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat 361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc 421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt 481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt 541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa 601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg 661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa 721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc 781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac 841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc 901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc 961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc 1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa 1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc 1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc 1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc 1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa 1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt 1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat 1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga 1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata 1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta 1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata 1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt 1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct 1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc 1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa 1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc 1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg 2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat 2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga 2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg 2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg 2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc 2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat 2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa 2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg 2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata 2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa 2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct 2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca 2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag 2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg 2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc 2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg 3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg 3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg 3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc 3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg 3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat 3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc 3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac 3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag 3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg 3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat 3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat 3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa 3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg 3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt 3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg 3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg 3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg 4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta 4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag 4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc 4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg 4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag 4321 cgttaaggtt tcatttagcg cctcaaatag atcctgttca ggaaccggat caaagagttc 4381 ctccgccgct ggacctacca aggcaacgct atgttctctt gcttttgtca gcaagatagc 4441 cagatcaatg tcgatcgtgg ctggctcgaa gatacctgca agaatgtcat tgcgctgcca 4501 ttctccaaat tgcagttcgc gcttagctgg ataacgccac ggaatgatgt cgtcgtgcac 4561 aacaatggtg acttctacag cgcggagaat ctcgctctct ccaggggaag ccgaagtttc 4621 caaaaggtcg ttgatcaaag ctcgccgcgt tgtttcatca agccttacgg tcaccgtaac 4681 cagcaaatca atatcactgt gtggcttcag gccgccatcc actgcggagc cgtacaaatg 4741 tacggccagc aacgtcggtt cgagatggcg ctcgatgacg ccaactacct ctgatagttg 4801 agtcgatact tcggcgatca ccgcttccct catgatgttt aactttgttt tagggcgact 4861 gccctgctgc gtaacatcgt tgctgctcca taacatcaaa catcgaccca cggcgtaacg 4921 cgcttgctgc ttggatgccc gaggcataga ctgtacccca aaaaaacagt cataacaagc 4981 catgaaaacc gccactgcgc cgttaccacc gctgcgttcg gtcaaggttc tggaccagtt 5041 gcgtgaggcc atacgctact tgcattacag cttacgaacc gaacaggctt atgtccactg 5101 ggttcgtgcc ttcatccgtt tccacggtgt gcgtcacccg gcaaccttgg gcagcagcga 5161 agtcgaggca tttctgtcct ggctggcgaa cgagcgcaag gtttcggtct ccacgcatcg 5221 tcaggcattg gcggccttgc tgttcttcta cggcaagtgc tgtgcacgga tctgccctgg 5281 cttcaggaga tcggaagacc tcggccgtcc gggcgcttgc cggtggtgct gaccccggat 5341 gaagtggttc gcatcctcgg ttttctggaa ggcgagcatc gtttgttcgc ccagcttctg 5401 tatggaacgg gcatgcggat cagtgagggt ttgcaactgc gggtcaagga ctggatttcg 5461 atcacggcac gatcatcgtg cgggagggca agggctccaa ggatcgggcc ttgatgttac 5521 ccgagagctt ggcacccagc ctgcgcgagc agctgtctcg tgcacgggca tggtggctga 5581 aggactaggc cgagggccgc agcggcgttg cgcttcccga cgcccttgag cggaagtatc 5641 cgcgcgccgg gcattcctgg ccgtggttct gggtttttgc gcagcacacg cattcgaccg 5701 atccacggag cggtgtcgtg cgtcgccatc acatgtatga ccagaccttt cagcgcgcct 5761 tcaaacgtgc cgtagaacaa gcaggcatca cgaagcccgc cacaccgcac accctccgcc 5821 actcgttcgc gacggccttg ctccgcagcg gttacgacat tcgaaccgtg caggatctgc 5881 tcggccattc cgacgtctct acgacgatga tttacacgca tgtgctgaaa gttggcggtg 5941 ccggagtgcg ctcaccgctt gatgcctgcc gcccctcact gtgagaggta gggcagcgca 6001 agtcaatcct agcggattca ctacccctgc gcgaaggcca tcggtgccgc atcgaacggc 6061 cggttgcgga aagtcctccc tgcgtccgct gatggccggc agcagcccgt cgttgaagga 6121 tccctgaaag cgacgttgga tgttaacatc tacaaattgc cttttcttac gaccatgtac 6181 gtaagcgctt acgtttttgg tggacccttg aggaaactgg tagctgttgt gggcctgtgg 6241 tctcaagatg gatcattaat ttccaccttc acctacgatg gggggcatcg caccggtgag 6301 taatattgta cggctaagag cgaatttggc ctgtagacct caattgcgag ctttctaatt 6361 tcaaactatt cgggcctaac ttttggtgtg atgatgctga ctggcaggat atataccgtt 6421 gtaat // LOCUS TOBNPTII 200 bp ds-DNA PLN 26-JUL-1990 DEFINITION N.tabacum nptII gene, complete cds. ACCESSION M34757 KEYWORDS nptII protein. SOURCE N.tabacum (strain SR1) DNA. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 198 > 200 nptII protein mRNA 133 > 200 nptII protein mRNA (5' end +/- 2 bp) recomb 193 194 T-DNA end/plant DNA start signal 25 30 CAAT box signal 47 55 CAAT box signal 101 107 TATA box BASE COUNT 64 a 53 c 32 g 51 t ORIGIN 1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg 61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta 121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg 181 atacaccaaa tcggatcatg // LOCUS BOVANDRE 2461 bp ss-mRNA MAM 26-JUL-1990 DEFINITION Cow alpha-1C-adrenergic receptor mRNA, complete cds. ACCESSION J05426 KEYWORDS alpha-1C-adrenergic receptor. SOURCE Cow adult brain cortex, cDNA to mRNA, clone B12. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 2461) AUTHORS Schwinn,D.A., Lomasney,J.W., Lorenz,W., Szklut,P.J., Fremeau,R.T.Jr., Yang-Feng,T.L., Caron,M.G., Lefkowitz,R.J. and Cotecchia,S. TITLE Molecular cloning and expression of the cDNA for a novel alpha-1-adrenergic receptor subtype JOURNAL J. Biol. Chem. 265, 8183-8189 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.Schwinn, 29-MAY-1990. FEATURES from to/span description pept 97 1497 alpha-1C-adrenergic receptor BASE COUNT 551 a 667 c 647 g 596 t ORIGIN 1 tgactccccg ctccctcgct cccctcctcc tcacccgccg aggggtggcc ctcaagagcc 61 ggactttgcc ggccccggcc ccggggggct gggaccatgg tgtttctctc cggaaatgcc 121 tccgacagct ccaactgcac ccacccgccg ccaccggtga acatttccaa ggccattctg 181 ctcggggtga tcttgggggg cctcatcctt ttcggggtac tggggaacat cctcgtgatc 241 ctttccgtgg cctgccaccg gcacctgcac tcggtcacac actactacat cgtcaacctg 301 gcggtggccg accttctcct cacttccacg gtgctgccct tctccgctat cttcgagatc 361 ttgggctact gggccttcgg cagggtcttc tgcaatgtct gggcggcggt ggacgtcctg 421 tgctgcacgg cttccatcat gggactctgc atcatctcca tcgaccgcta catcggcgtg 481 agctatcctc tgcgctaccc caccatcgtc acccagaaga ggggcctcat ggccctgctc 541 tgcgtctggg cgctctcttt ggtcatctcc atcgggcccc tcttcggctg gaggcagccg 601 gccccggagg acgagaccat ctgccagatc aacgaggagc cgggctacgt gctcttctcg 661 gctctgggct ccttctacgt gccgctgacc atcatcctgg tcatgtactg ccgggtctac 721 gtcgtggcca agagggagag ccggggcctc aagtcgggcc ttaagaccga caagtcagac 781 tcggagcagg tgacgctccg catccatcgc aaaaacgccc aggtaggagg cagcggggtg 841 accagcgcca agaacaagac gcacttctcc gtgagactgc tcaaattttc ccgcgagaag 901 aaagcggcca aaacgctggg catcgtggtc ggctgcttcg tcctctgctg gctgcctttt 961 ttcttagtga tgcccattgg gtctttcttt cctgatttca ggccctcaga aaccgttttt 1021 aaaatagcat tttggctcgg ttacctaaac agctgcatca accccattat atacccatgc 1081 tccagtcaag agtttaaaaa ggcctttcag aatgtcttga gaatccagtg tctgcgacga 1141 aagcagtcct ccaaacacac cctgggctac acgctgcacg cacccagcca cgtcctggag 1201 ggacagcaca aggacctggt tcgcattccg gtgggatctg cagagacctt ctataagatc 1261 tccaagacgg atggggtctg tgaatggaaa attttctctt ccctaccccg cggatctgcc 1321 aggatggcgg tggccagaga cccatcagcc tgcaccactg cccgggtgag aagtaaaagc 1381 tttttgcaag tgtgctgttg cctggggccc tcgaccccca gtcatggaga gaatcatcag 1441 attccgacca ttaagatcca caccatctcc ctcagtgaaa atggggagga agtctaaagg 1501 acaggaaagg tcagaaggat gggagggtga tcttaggtac ccactctcca cttccttctg 1561 ggaaggccag ttcacgttcc gtggatgctg agacacagcc agtaaaccag ggaccatctg 1621 ggaatgggct ggggaggaga gctgactctg gggcagaggt agggcttaga gacgagagag 1681 gatgtcctac caccatccag ttcactatga tgagaaacag catttccttg aggctaatgc 1741 tctctgggtc attctctgag cctgctttct acgcctgtcc ctttcaacga caaacaccat 1801 gggaaacaga atttcataca caatccaaaa gacgataaat ataggattat gatttcatca 1861 tgaatatttt gagcatgcac tctaagtttg gagctatttc ttgatggagt gaggggattt 1921 tatttccagg ctaaacttgc tgaaagccac gttggatttt tatggagaga aggcctggag 1981 aggaagagcc ttaagatggt ggccaatatc cagacgcatt atttttagag caagttttac 2041 agtccaccct ttctcagttt gggtgaaact tgacagtgag attttattta ccttttgctg 2101 ctgcttgaca ggatactgct cccaattccc taaggatgag ggtgaggggt actcattatg 2161 ccaatggtca tctgcacttg ggtatagaga gtgttgaaag aaccagttgg gaaaaggatg 2221 gcttttcctg gtggaagaca gtaaggatga gagtcagttc ttcaaattct atggacagaa 2281 ttccattaag tggttccaag atcaggtgga ggaaggcttc ttgtgtaaca tatttaaaga 2341 tcaagagttt ggggtggggt gggtgctact ttcaagctaa gatagaggct gcaaaattac 2401 tccacagcct tttcaacatg gcatagaaag gcttttcttg gcaaatcact taccttttcc 2461 a // LOCUS CHKANCC2A 1229 bp ss-mRNA VRT 26-JUL-1990 DEFINITION Chicken anchorin CII mRNA, 3' end. ACCESSION M30971 J03194 KEYWORDS anchorin; collagen-binding protein. SOURCE Chicken cartilage (sternum) and bone, cDNA to mRNA, clones A[1,4,6,7,14,15,22,23]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1229) AUTHORS Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M., Deutzmann,R., Mollenhauer,J. and von der Mark,K. TITLE The structure of anchorin CII, a collagen binding protein isolated from chondrocyte membrane JOURNAL J. Biol. Chem. 263, 5921-5925 (1988) STANDARD simple staff_entry REFERENCE 2 (bases 373 to 504) AUTHORS Fernandez,M.P., Selmin,O., Martin,G.R., Yamada,Y., Pfaeffle,M., Deutzmann,R., Mollenhauer,J. and von der Mark,K. TITLE The structure of anchorin CII, a collagen binding protein isolated from chondrocyte membrane JOURNAL J. Biol. Chem. 265, 8344-8344 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 19 984 anchorin CII BASE COUNT 353 a 230 c 310 g 336 t ORIGIN 28 bp upstream of AccI site. 1 cccggcgaac cggggaagat ggcgaagtat acaagaggca ccgtgacagc attctctcct 61 tttgatgcca gagctgatgc agaagccctt cgcaaggcca tgaagggaat ggggactgat 121 gaagagacaa ttctgaagat ccttaccagc agaaataatg ctcaacgtca agaaattgca 181 tctgctttta aaacactgtt tggcagggat cttgtggatg acctgaaatc agaacttact 241 ggcaagtttg aaacactgat ggtatctttg atgagaccag cacgtatttt tgatgcgcat 301 gcactgaagc atgcaatcaa gggagcagga accaatgaga aagtgttgac tgaaattctt 361 gcctccagaa cacctgctga agtgcagaat attaaacagg tttatatgca agagtatgag 421 gccaacttgg aggataagat cacaggagag acatcaggcc attttcagag actgctggtg 481 gtcctgctgc aggcaaatag agatcctgat ggcagagttg acgaggctct tgttgagaag 541 gatgctcagg tcttgtttag agctggggag ctaaaatggg gaacagatga agaaacattc 601 atcaccatct tgggaactcg aagtgtttct catttgagga gggtgtttga caaatacatg 661 actatttctg gctttcaaat tgaagaaacc attgaccgtg aaacctctgg tgatttggag 721 aagttgcttt tggcagttgt gaagtgcatc cgaagtgtgc ctgcttattt tgctgaaact 781 ttgtattatt ctatgaaagg ggctggcact gatgatgata ccctgatcag agtcatggtt 841 tcaagaagtg aaatcgacct gttggatatt agacatgaat tcagaaagaa ttttgcgaaa 901 tcgttgtatc agatgattca gaaagataca tctggggact acaggaaggc actcctgctc 961 ctctgtggtg gagatgatga gtaatggtgg cagcgacgtg aaggatttct tgtaatccag 1021 ctttgcagcc cttcagttag catgcctagc taagattttg catcttaatg ctttatggct 1081 gttcgaattt atattcatat cacacttatt aaacacaaac atgttactac tagctgataa 1141 acagtccctc ctcctcagac gtcctgactc tgggaatttc agtgccttct gagtgtatgc 1201 aaagtctctc atggagtaga gtagtatcg // LOCUS ECOHLYCA 633 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli hly plasmid hemolysin (hlyC) gene, complete cds. ACCESSION M35668 KEYWORDS hemolysin. SOURCE E.coli hly plasmid pHly152 DNA, clone pANN202-419. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 633) AUTHORS Goebel,W., Hacker,J., Knapp,S., Then,I., Wagner,W., Hughes,C. and Juarez,A. TITLE Structure, function, and regulation of the plasmid-encoded hemolysin determinant of Escherichia coli JOURNAL Basic Life Sci. 30, 791-805 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 121 633 hemolysin (hlyC) BASE COUNT 219 a 99 c 115 g 200 t ORIGIN 1 tagtcacgca ataaaacgtt ctttaatatt aatgcagtta tgacattaaa ggcaagaaac 61 ataaaggcat atttttgcca caatatttaa tcatataatt taagttgtag tgagtttatt 121 atgaatataa acaaaccatt agagattctt gggcatgtat cctggctatg ggccagttct 181 ccactacaca gaaactggcc agtatctttg tttgcaataa atgtattacc cgcaatacag 241 gctaaccaat atgttttatt aacccgggat gattaccctg tcgcgtattg tagttgggct 301 aatttaagtt tagaaaatga aattaaatat cttaatgatg ttacctcatt agttgcagaa 361 gactggactt caggtgatcg taaatggttc attgactgga ttgctccttt cggggataac 421 ggtgccctgt acaaatatat gcgaaaaaaa ttccctgatg aactattcag agccatcagg 481 gtggatccca aaactcatgt tggtaaagta tcagaatttc atggaggtaa aattgataaa 541 cagttagcga ataaaatttt taaacaatat caccacgagt taataactga agtaaaaaga 601 aagtcagatt ttaatttttc attaactggt taa // LOCUS ECOTRMX4 77 bp ss-tRNA RNA 26-JUL-1990 DEFINITION E.coli f-Met-tRNA. ACCESSION M35184 KEYWORDS transfer RNA-f-Met. SOURCE E.coli tRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 77) AUTHORS Dube,S.K., Marcker,K.A., Clark,B.F.C. and Cory,S. TITLE Nucleotide sequence of N-formyl-methionyl-transfer RNA JOURNAL Nature 218, 232-233 (1968) STANDARD simple staff_review FEATURES from to/span description tRNA 1 77 f-Met-tRNA anticdn 35 37 f-Met-tRNA anticodon cat modified 21 21 d modified 33 33 2'Ome modified 47 47 m7g modified 56 56 p BASE COUNT 14 a 26 c 25 g 12 t ORIGIN 1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa 61 tccggccccc gcaacca // LOCUS HECDA8 1435 bp ss-rRNA BCT 26-JUL-1990 DEFINITION H.mustelae 16S ribosomal RNA. ACCESSION M35048 KEYWORDS 16S ribosomal RNA. SOURCE H.mustelae (strain ATCC 43772) ribosomal RNA. ORGANISM Helicobacter mustelae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 9 to 1435) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. TITLE The phylogeny of Helicobacter felis sp. nov., a spiral-shaped bacterium isolated from the gastric mucosa of the cat, Helicobacter mustelae, and related bacteria JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1435) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by B.J.Paster, 06-JUN-1990. Author address:B.J.Paster Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 > 1435 16S ribosomal RNA BASE COUNT 375 a 301 c 412 g 326 t 21 others ORIGIN 1 attatggaga gtttnatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa 61 gtcgaacgat gaagcttcta gcttgctaga agtggattag tggcgcacgg gtgagtaacg 121 cataggttat gtgccccata gtctgggata gccactggaa acggtgatta atactggata 181 ctcctacggg ggnaaagntn ttcgctatgg gatcagccta tgtcctatca gcttgttggt 241 gaggtaatgg ctcacnnagg ctatgacggg tatccggcct nagagggtga tcggacacac 301 tggaactgag acacggtcca gactcctacg ggaggcagca gtagggaata ttgctcaatg 361 ggcgaaagcc tgaagcagca acgccgcgtg gaggatgaag gttttaggat tgtaaactcc 421 ttttctaaga gaagataatg acggtatctt aggaataagc accggcnnac tccgtgccag 481 cagccgcggn antacggagg gtgcnagcgt tactcggaat cactgggcgt naagagcgcg 541 taggcggagt aataagtcag atgtgaaatc ctgtagctta actacagaac tgcatttgaa 601 actgttattc tagagtgtgg gagaggtagg tggaattctt ggtgtagggg tnaaatccgt 661 agagatcaag aggaatactc attgcgaagg cgacctactg gaacattact gacgctgatg 721 cgcgaaagcg tggggagcaa acaggattag ataccctggt agtccacgcc ctaaacgatg 781 aatgctagtt gttggggtgc ttgtcactcc agtaatgcag ttaacacatt aagcattccg 841 cctggggagt acggtcgcaa gattaaaact caaaggaata gacggggacc cgcacaagcg 901 gtggagcatg tggtttaatt cgannntacg cgaagaacct tacctaggct tgacattgat 961 agaatctgct agaaatagcg gagtgtctag tttactagac cttgaaaaca ggtgctgcac 1021 ggctgtcgtc agctcgtgtc gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctc 1081 gttcttagtt gctagcagtt cggctgagca ctctaagaag actgccttcg tnaggaggag 1141 gaaggtgagg acgacgttaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac 1201 aatggggtgc acaaagagac gcaataccgc gaggtggagc aaatctcaaa aacatctctc 1261 agttcggatt gtagtctgca actcgactac atgaagctgg aatcgctagt aatcgtgaat 1321 cagccatgtc acggtgaata cgttcccggg tcttgtactc accgnccgtc acaccatggg 1381 agttgtattc gccttaagcc gggatgctaa attggctacc gtccanggcg gatnc // LOCUS HECRDA 1446 bp ss-rRNA BCT 26-JUL-1990 DEFINITION H.felis 16S ribosomal RNA. ACCESSION M35047 KEYWORDS 16S ribosomal RNA. SOURCE H.felis (ATCC 49179) ribosomal RNA. ORGANISM Helicobacter felis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 9 to 1446) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. TITLE The phylogeny of Helicobacter felis sp. nov., a spiral-shaped bacterium isolated from the gastric mucosa of the cat, Helicobacter mustelae, and related bacteria JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1446) AUTHORS Paster,B.J., Lee,A., Dewhirst,F.E., Fox,J.G., Tordoff,L.A. and Ferrero,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by B.J.Paster, 06-JUN-1990. Author address:B.J.Paster Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 > 1446 16S ribosomal RNA BASE COUNT 354 a 299 c 402 g 310 t 81 others ORIGIN 1 tttatggaga gtttgatcct ggctcagagt gaacgctggc ggcgtgccta atacatgcaa 61 gtcgaacgat gaagcctagc ttgctaggcg gattagtggc gcacgggtga gtaacgcata 121 gatgacatgc cctttagttt gggatagcca ctagaaatgg tgattaatac caaatactac 181 ctacggggga aagatttatc gctaaaggat tggtctatgt cctatcagct tgttggtgag 241 gtaaaggctc acnnaggcta tgacgggtat ccggcctgag agggtgaacg gacacactgg 301 aactgagaca cggtccagac tccnncggga ggcagcagta gggaatattg ctcaatgggc 361 gcaagcctga agcagcaacg ccgcgtggag gatgaaggtt ttaggattgt aaactccttt 421 tgtcagagaa gataatgacg gtatctgacg aataagcacc ggctanctcc gtgccagcag 481 ccgcggtaat acggagggtg cnagcgttac tcggaatcnc tgggcgtaaa gagtgcgtag 541 gcggggttgt aagtcagatg tgaaatccta tggcttaacc atagaactgc atttgaaact 601 acaactctgg agtgtgggag aggtaggtgg aattcttggt gtaggggtaa aatccgtaga 661 gatcaagagg aatactcatt gcgaaggcga cctgctggaa caatactgac gctgattgcn 721 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccct aaacgatgga 781 tgctagttgt tggggggctt tgtcctccca gtaatgcagc taacgcctta agcatcccgc 841 ctggggagta cggtcgcaag annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc gaagaacctt acctaggctt gacattgaan 961 gaatctgcta gaaatatgtg agtgtctagc ttgctagacc ctgaaaacag gtgctgcacg 1021 gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc cgcaacgagc gcaaccctct 1081 ttcttagttg ctaacaggta gtgctgagct ctctaagaat actgcctgcg taagcaggag 1141 gaaggtgagg acgacgtcaa gtcatcatgg cccttacgcc tagggctaca cacgtgctac 1201 aatggggtgc acaaagagat gcaatgccgc gaggttgagc caatcttaaa aacnnctctc 1261 agttcggatt gcaggctgca actcgcctgc atgaagctgg aatcgctagt aatcgcaaat 1321 cagccatgtt gcggtgaata cgttcccggg tcttgtactc accgnncgtc acaccatggg 1381 agttgtgttt gccttaagtc aggatgctaa ggtagctact gcccacggca cacacagcga 1441 ctgggg // LOCUS HUMHPBS 821 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human peripheral benzodiazepine receptor (hpbs) mRNA, complete cds. ACCESSION M36035 KEYWORDS peripheral benzodiazepine receptor. SOURCE Human hystiocytic lymphoma monocyte-like cell line U937, cDNA to mRNA, clone p-hPBS11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 821) AUTHORS Riond,J., Mattei,M.G., Kaghad,M., Dumont,X., Guillemot,J.C., Le Fur,G., Caput,D. and Ferrara,P. TITLE Molecular cloning and chromosomal localization of a human peripheral-type benzodiazepine receptor JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Riond, 27-JUN-1990. Author address:J.Riond SANOFI ELF BIO RECHERCHES BP137 31328 LABEGE CEDEX FRANCE FEATURES from to/span description pept 62 571 peripheral benzodiazepine receptor mRNA < 1 811 peripheral benzodiazepine receptor mRNA site 800 805 polyadenylation site BASE COUNT 118 a 271 c 260 g 171 t 1 others ORIGIN Chromosome 22, map position q13.3. 1 agtgcccttc ccggagcgtg ccctcgccgc tgagctcccc tgaacagcag ctgcagcagc 61 catggccccg ccctgggtgc ccgccatggg cttcacgctg gcgcccagcc tggggtgctt 121 cgtgggctcc cgctttgtcc acggcgaggg tctccgctgg tacgccggcc tgcagaagcc 181 ctcgtggcac ccgccccact gggtgctggg ccctgtctgg ggcacgctct actcagccat 241 ggggtacggc tcctacctgg tctggaaaga gctgggaggc ttcacagaga aggctgtggt 301 tcccctgggc ctctacactg ggcagctggc cctgaactgg gcatggcccc ccatcttctt 361 tggtgcccga caaatgggct gggccttggt ggatctcctg ctggtcagtg gggcggcggc 421 ngccactacc gtggcctggt accaggtgag cccgctggcc gcccgcctgc tctaccccta 481 cctggcctgg ctggccttcg cgaccacact caactactgc gtatggcggg acaaccatgg 541 ctggcatggg ggacggcggc tgccagagtg agtgcccggc ccaccaggga ctgcagctgc 601 accagcaggt gccatcacgc ttgtgatgtg gtggccgtca cgctttcatg accactgggc 661 ctgctagtct gtcagggcct tggcccaggg gtcagcagag cttcagaggt tgccccacct 721 gagcccccac ccgggagcag tgtcctgtgc tttctgcatg cttagagcat gttcttggaa 781 catggaattt tataagctga ataaagtttt tgacttcctt t // LOCUS XELAAA 121 bp ss-rRNA VRT 26-JUL-1990 DEFINITION X.laevis 5S ribosomal RNA. ACCESSION M35175 KEYWORDS 5S ribosomal RNA. SOURCE X.laevis somatic cell ribosomal RNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 121) AUTHORS Wegnez,M. and Denis,H. TITLE Expression des genes ribosomiques 5 S chez le Xenope JOURNAL Arch. Int. Physiol. Biochim. 81, 211-213 (1973) STANDARD simple staff_review FEATURES from to/span description rRNA 1 121 5S ribosomal RNA BASE COUNT 24 a 34 c 38 g 25 t ORIGIN 1 gcctacggcc acaccaccct gaaagtgccc gatctcgtct gatctcggaa gccaagcagg 61 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccaggtg tcgtaggctt 121 t // LOCUS XELAAB 121 bp ss-rRNA VRT 26-JUL-1990 DEFINITION X.laevis 5S ribosomal RNA. ACCESSION M35176 KEYWORDS 5S ribosomal RNA. SOURCE X.laevis oocyte ribosomal RNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 121) AUTHORS Wegnez,M. and Denis,H. TITLE Expression des genes ribosomiques 5 S chez le Xenope JOURNAL Arch. Int. Physiol. Biochim. 81, 211-213 (1973) STANDARD simple staff_review FEATURES from to/span description rRNA 1 121 5S ribosomal RNA BASE COUNT 25 a 33 c 37 g 26 t ORIGIN 1 gcctacggcc acaccaccct gaaagtgcct gatctcgtct gatctcagaa gcgatacagg 61 gtcgggcctg gttagtactc ggatgggaga ccgcctggga ataccaggtg tcgtaggctt 121 t // LOCUS ECORR50L1A 165 bp ds-DNA RNA 26-JUL-1990 DEFINITION E.coli 50S rRNA protein L1-associated RNA. ACCESSION M24864 KEYWORDS 50S ribosomal RNA. SOURCE E.coli 50S ribosomal RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 165) AUTHORS Branlant,C., Krol,A., Sriwidada,J. and Brimacombe,R. TITLE RNA sequences associated with proteins L1, L9, and L5, L18, L25, in ribonucleoprotein fragments isolated from the 50-S subunit of Escherichia coli ribosomes JOURNAL Eur. J. Biochem. 70, 483-492 (1976) STANDARD simple staff_entry FEATURES from to/span description modified 13 13 7-methyluridine unsure 61 61 u could be a unsure 141 141 c could be g BASE COUNT 36 a 35 c 47 g 47 t ORIGIN 1 taacctttac tatggcgaca ctgaacattg agccttgatg tgtaggatag gtgggagctt 61 tgaagtggac gtgccagtct gcatggagcc gaccttgaaa taccctttac aatgtttgat 121 gttctaacgt ggacccgctt cgggttgcat cgcggacagt gtctg // LOCUS BMOSP1 1512 bp ds-DNA INV 26-JUL-1990 DEFINITION Silkworm (B.mori) storage protein 2 (SP2) gene, exon 1. ACCESSION M24371 J04829 KEYWORDS arylphorin-type storage protein; storage protein; storage protein 2. SEGMENT 1 of 2 SOURCE Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to mRNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 983) AUTHORS Fujii Tomino,S. TITLE Structure of the gene for the arylphorin-type storage protein, Sp 2 of Bombyx mori JOURNAL J. Biol. Chem. 264, 11020-11025 (1989) STANDARD simple staff_review REFERENCE 2 (bases 984 to 1512) AUTHORS Fujii,T., Sakurai,H., Izumi,S. and Tomino,S. JOURNAL Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan STANDARD simple staff_review COMMENT Draft entry and sequence for [1],[2] kindly submitted by S.Tomino, 28-APR-1989. FEATURES from to/span description pept 796 + 883 storage protein SP2, exon 1 pre-msg 771 > 1512 SP2 mRNA and intron IVS 884 > 1512 SP2 intron A binding 436 441 glucocorticoid-receptor binding site site 447 457 SV40 enhancer core conflict 199 199 c in [1]; g in [2] BASE COUNT 475 a 266 c 247 g 524 t ORIGIN Unreported. 1 aagcttttta aaaaaagaac tttatttaat tttaataatt aaaacatttg aaattaacaa 61 ttgaaattaa ttggcgcaag tgtcaccggg agcgcggtta gaattgaact gcgtgatcta 121 tcggtaacct aactaagctg cattacgtcg tgcaccttac attgcacatt tatgtacatt 181 aaaaatatat aacagtaccc aataaaaaag cattatttcg tcttgtaaca gtcggttgaa 241 aaattgaaag taattaacga catgcttaga gtttcgatcg tagtaaaagc tacgttttgt 301 ctatcatatt agaaagatat agtaacttct tttgtctctc tttattcttt aaattttact 361 taatcaggtg aatagctttt actactttac tcaatgtttt catcatactc ctggctaagt 421 cttcgctagc ccgcctgtcc tagtaagccg tggaaaggct ccgggacacc agcaaacctt 481 caatcataaa aaaaattgct ttcatcatgt tttcgtttac agttttacaa atatttcata 541 attttccatt cctttttttt gaattatata ataataacaa gaaaaaaact ttatatctat 601 ttgtttatca tcatcgttga aatttatatt cagtaattca aattatgaga ccggtgaaaa 661 ggtcagtaga ttacgttgat aatgaaagca taacacttgt tgctaatgag tgcatgtttc 721 gggagaagat aaagtgtggg tataaatatt cgaaaacgga ttgcagaagc acagtttgct 781 tctaggctgg aaaccatgaa gtctgtcttg attctggctg ggcttgtagc cgtcgcgctc 841 agcagtgcag taccaaaacc gagcaccata aagtcaaaaa atggtaagcg ttaaatagta 901 gtgctctatt ttaatacgct tttattatta ttattattat tattaattct ttatttcagt 961 tttgtttttt aaaaccataa cattttgtta gtagtaatta cttatatcta tgttagtgac 1021 ttaaaaaatc taacacataa ctctcattat atatatacat tttataccat tacatttttt 1081 attttatttt tttctccttc caagtgccta ctgcaaaggc tattgatcag cagtccctcg 1141 atcttgctcg atatgattct caaaagactg ttgccactgt cacgaactcg acgcaataac 1201 gatgcacttc tcttccgcat tattgcaaag aagtcatcgg tgtgagatgt cgcaaacatt 1261 gtggatgcac tacaaaagcg cggcagtgac aacatcatcc taaacgcatt attatattga 1321 acgcgtaggg cattgtaagc tctccgcgtg tatgtggtcc acagactact ggcgtaaaaa 1381 ttctggcaat aagctttaaa aattgtaatt tgacatacta tcgcaaccag taaatctgcg 1441 ggccagcata ttgcatctta ctatcaatta ttattattat tttttttatt gcttagatgt 1501 gtggacgagc tc // LOCUS BMOSP2 3876 bp ds-DNA INV 26-JUL-1990 DEFINITION Silkworm (B.mori) storage protein 2 (SP2) gene, exons 2,3,4, and 5. ACCESSION M24370 J04829 KEYWORDS arylphorin-type storage protein; storage protein; storage protein 2. SEGMENT 2 of 2 SOURCE Silkworm (strain Tokai x Asahi; 5th larval instar) DNA and cDNA to mRNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1288 to 3876) AUTHORS Fujii,T., Sakurai,H., Izumi,S. and Tomino,S. TITLE Structure of the gene for the arylphorin-type storage protein, Sp 2 of Bombyx mori JOURNAL J. Biol. Chem. 264, 11020-11025 (1989) STANDARD simple staff_review REFERENCE 2 (bases 1 to 1287) AUTHORS Tomino,S. JOURNAL Unpublished (1989) 2-1-1 Fukazawa, Setagaya-ku, Tokyo 158, Japan STANDARD simple staff_review COMMENT Draft entry and sequence for [1],[2] kindly submitted by S.Tomino, 28-APR-1989. FEATURES from to/span description pept + 1388 1521 storage protein SP2, exon 2 1614 2444 storage protein SP2, exon 3 2526 2692 storage protein SP2, exon 4 2850 3744 storage protein SP2, exon 5 pre-msg < 1 3846 SP2 mRNA and introns IVS < 1 1387 SP2 intron A IVS 1522 1613 SP2 intron B IVS 2445 2525 SP2 intron C IVS 2693 2849 SP2 intron D BASE COUNT 1204 a 734 c 696 g 1242 t ORIGIN Unknown number of bp after segment 1. 1 ttgctagccc ttcttcttta tgttttggag aaggttctca attcaaaatg tacgttttca 61 ttatagcctt attacgaaag cttatacgaa cgttatatct ttaactatgc atacagccgt 121 ctattgaatc attgttgtta taaattgttt tacaattgct ataggctcac atctctttct 181 gaggcgtgat ttagaaaagg atgcacgatg cgtgatccaa tttggaattt gatagctcgg 241 cctcatctcc tgcctcatag caaggccgat tttgtgaggc ctcctatcta aactaaaaag 301 aacaaaaccg cacttacccc gcagcggccg actaggttgc actgttgcta taccatcatt 361 tgtatgttgg tatattatta ccgctgtaat gtataggtac attaccgcca gtattgcata 421 tgttgcacga tgaacatgtt caatatatgt aaaatttaca atttaaatac gtcaccgttt 481 caacacaaaa ctatttgcaa atggattcat cattcatcat ctaaactcgt cgtggcctaa 541 aggataagac gtccggtgca ttcgtgttga gcgatgcacc ggtgctcgaa tcccaagcgg 601 gtaccaattt ttctaatgga atacgtactc aacaaatgtt catgattgac ttccacggta 661 aaggaataac atctatacta atattataaa gaggaaagat ttgtttgttt gtttgtttcg 721 aataggctcc gaaactactg gaccgatttg aaaaattctt tttccattag aagccaacat 781 tgtccctgat gaacataggc tacatttttt aatttttttt tttttttttg tttcatgtgt 841 gttttaatgt ttccgaagcg aagcgagggc gggtcgctag tcgtgtaata aaaatcaaag 901 ccgcaaaaat tataatttgc gtaattacta gtggtaggac ctcttgtgac gcaagggtag 961 gtacttgaga ccttagaatt tatatctcaa ggtgggtggt gcatatacgt tgtaaatgtc 1021 tatggggtct agtaaccgct taacaccagg tggactagtt cagccaccta agcaataaaa 1081 ataaaaatca tcaaaataga aaatcaacca ttgtaggttt ataccgtatt gactaagtaa 1141 taaagaaaag caggtttttt ttacaaacaa ccaaattatg taataaaagt aaatatagta 1201 agctatgaac gaccgattag tggtaacata tcggcgctga aagttcctaa tgtgctttga 1261 tgccaatatt tatctcagaa ttgaagttat tcaatacttt ccagataatg atgacatcta 1321 agtgatatcg cttattcgta aatacttctt tataaaatat ttacatatat ttttttactt 1381 tattcagtgg atgccgtatt tgttgaaaag caaaagaaaa ttctgtcctt cttccaagat 1441 gtgagccaac taaacactga tgatgaatat tataaaattg gcaaagacta tgatatcgaa 1501 atgaatatgg acaactacac tgtaagtact aataattaat atcaatttaa atttaacgtg 1561 aatttgtttg tttctttctt tctttattga aaaccatgtt tcatatttaa cagaacaaga 1621 aagctgttga agaatttctg aagatgtaca ggactggttt tatgcctaag aatttagagt 1681 tctccgtttt ttatgacaag atgagggatg aagctattgc tctattggat ttattctatt 1741 acgctaagga ctttgaaacg ttctacaaga gtgcctgttt tgcgcgtgtg catctcaatc 1801 aaggtcaatt cttgtatgcc ttctacatcg ctgttatcca gcgccctgat tgccacggtt 1861 tcgttgttcc tgctccgtat gaagtatacc ctaaaatgtt tatgaatatg gaagtgctgc 1921 aaaaaattta cgtaacaaag atgcaacatg gcctcattaa tcctgaagcc gcagctaagt 1981 atggcattca caaggaaaac gactacttcg tttacaaagc caattattct aacgccgttt 2041 tatacaataa tgaagaacaa aggctgacat acttcactga ggatattggc atgaacgctt 2101 actactacta cttccactct catttaccgt tctggtggac atcagaaaaa tacggagccc 2161 ttaaagagcg tcgtggagag gtttacttct acttctacca gcaattattg gctcgttact 2221 actttgagcg tcttaccaat ggacttggta agattcccga attctcatgg tactctccga 2281 taaagactgg atactatcca ttgatgctaa ctaagtttac acccttcgca caaagacctg 2341 actactacaa cttgcacacc gaagaaaact atgaaagagt aagattcctt gacacttatg 2401 agaagacatt cgttcagttc ctccaaaagg accactttga agccgtaagt tcgaacatta 2461 agtgtctaat cttattggtt tatttctaaa aatgtatgaa tttaatagat tttttcattt 2521 tatagttcgg acaaaaaatt gattttcacg acccgaaagc cattaacttc gtcggcaact 2581 actggcaaga taatgcagat ctgtatggag aagaagtcac aaaagattac caacgttctt 2641 acgaagtatt tgcgcgccgt gtgctaggtg ctgcgcctat gccattcgac aagtacgttt 2701 aaaaaatatt ttcaaaactt aatttttact aagcaatgac gacaactctt ttctacgtta 2761 tatccaagtc aaccgtaatc cggatttgtc tttgtacgtt tgcaaaaaaa ttaatagtaa 2821 tacatagttt cttcatgcta ctttttcagg tacactttca tgcctagtgc aatggacttt 2881 taccagactt ctcttcgtga tcctgctttc tatcagctct acaacagaat tgtggaatac 2941 atcgttgagt tcaagcaata cttgaagcct tacactcaag acaaacttta ctttgatggt 3001 gtcaagataa ctgatgttaa agtcgacaaa ttgacaacat tcttcgagaa ctttgaattc 3061 gacgccagca acagcgtgta ctttagtaag gaggagatta agaacaatca cgtccatgag 3121 ttaaggtgcg ccacacgatt gaaccacagc cccttcaacg ttaacattga ggttgattct 3181 aatgtcgcca gtgacgctgt tgtcaaaatg ttgctggccc ccaaatacga tgacaacgga 3241 atacctctca cattagagga caactggatg aaattcttcg agttggactg gttcacaact 3301 aaactcaccg ctggtcagaa caagattatc cgcaattcga atgaatttgt catatttaaa 3361 gaagactccg tgccaatgac tgaaattatg aagatgctcg acgaaggaaa agtacctttt 3421 gatatgtcgg aagagttctg ttacatgcct aaaagactca tgctgcctag aggtactgaa 3481 ggtggattcc cattccagct ctttgttttc gtctatccat tcgacaacaa aggcaaggac 3541 ttggctcctt tcgaatcttt tgttcttgac aataacctct tggcttccct ctggatcgcc 3601 ccgttgttga tgcattattc aaggttccta acatgtattt caaggatatt ttcatttacc 3661 acgagggtga acggttccct tacaaattca atcttccttc gtatgacaca catgataatg 3721 ttgttccaaa aaattaaatt ttaataaact gatgaatttt gcatccgtaa tatccaaaga 3781 aaatgtaaaa actttaagta gaactgttat gatttagaaa aaataaaatc aagtaggtaa 3841 aattataatt atgtattttt attgcatgca ttttta // LOCUS HUMG6PA 1464 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human glucose-6-phosphate dehydrogenase, complete cds. ACCESSION M24470 M27958 KEYWORDS glucose-6-phosphate dehydrogenase. SOURCE Human, cDNA to mRNA, clone NG6PD 1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1464) AUTHORS Kanno,H., Huang,I.-Y., Kan,Y.W. and Yoshida,A. TITLE Two structural genes on different chromosomes are required for encoding the major subunit of human red cell glucose-6-phosphate dehydrogenase JOURNAL Cell 58, 595-606 (1989) STANDARD simple staff_review COMMENT Draft entry and sequence for [1] kindly submitted by A.Yoshida, 02-MAY-1989. FEATURES from to/span description pept 72 1109 glucose-6-phosphate dehydrogenase /nomgen="G6PD" /map="Xq28" /hgml_locus_uid="LH0033J" mRNA < 1 1464 glucose-6-phosphate dehydrogenase mRNA BASE COUNT 331 a 404 c 389 g 340 t ORIGIN 1 ctccccgcgc cgccccgcgc aggcgccccc gccccgccgt cgccgccgcc gcagccagga 61 gccgctgcac catgccccgc atagatgcgg acctcaagct cgacttcaag gacgtcctgc 121 tccgacctaa gcggagcagc ctcaagagcc gagccgaggt ggatcttgaa cgcaccttca 181 cgtttcgaaa ttcaaagcag acctactcag ggattcccat catcgtggcc aacatggaca 241 ctgtgggcac gtttgagatg gcagccgtga tgtcacagca ctccatgttt acagcaattc 301 ataagcatta ctccctggat gactggaagc tctttgccac aaatcaccca gaatgcctgc 361 agaatgtagc cgtgagttca ggcagtgggc agaatgatct ggaaaagatg accagcatcc 421 tggaagctgt gccacaggtt aagtttattt gcctggatgt ggccaatggg tattcagaac 481 attttgtgga attcgtgaaa cttgtccgtg ccaaatttcc tgaacacacc attatggcag 541 ggaacgtggt gacaggagaa atggtagaag agcttattct ttccggagca gatatcatca 601 aagtgggagt tggaccaggt tctgtgtgca ccacccgcac caagacggga gtggggtacc 661 cccagctgag tgccgtcatt gagtgtgccg actctgccca cggcctgaag ggccacatca 721 tctctgatgg aggctgtacg tgtccagggg atgtcgccaa agcctttgga actggagcag 781 attttgtcat gctgggagga atgttttcgg gtcatacgga gtgtgctgga gaagtgattg 841 agaggaacgg acggaagctc aagctcttct acgggatgag ctctgacacc gccatgaaca 901 agcacgcagg aggagttgct gagtacagag cctctgaggg taagactgtg gaagttcctt 961 acaaaggaga tgtggaaaac actatcctgg atattctcgg gggactgagg tccacgtgca 1021 cctacgtggg ggccgccaaa ctcaaggagc tcagcaggag ggcaacattc atccgggtga 1081 cccagcagca caacaccgtg ttcagctaac cctggggaca aagcagcgtc tggctcgatg 1141 gaagcgtcca aacctgcttt tcccatctcc ccccaagtct gttccgtcag agcttctggc 1201 tgctcctgaa tggtggaatg cctgtgtcct ctcttctgtc tcctgccgcc tggaggcttc 1261 ggggctctcc cgcctgcctt ctcggggccc agacgcaagg caccgattgg gccaacatca 1321 gagccctgct gcccagaact cataacctca ttgttcaaac caacacttgc acctttctct 1381 ttttctcttt ctctctccct ttctttgttt ttctttcttt tttaaaagaa gatggtttca 1441 gctttaatat aatgctatta tctt // LOCUS MUSGT1A 2544 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse glucose transporter 1 mRNA, complete cds. ACCESSION M23384 J04557 KEYWORDS glucose transporter 1. SOURCE Mouse adipocyte cell line 3T3-L1, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites) AUTHORS Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T., Cornelius,P., Pekala,P.H. and Lane,M.D. TITLE Sequence, tissue distribution, and differential expression of mRNA for a putative insulin-responsive glucose transporter in mouse 3T3-L1 adipocytes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3150-3154 (1989) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 2544; for [1]) AUTHORS Kaestner,K.H., Christy,R.J., McLenithan,J.C., Braiterman,L.T., Cornelius,P., Pekala,P.H. and Lane,M.D. JOURNAL Unpublished (1989) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly provided by M.Lane, 28-MAR-1989. FEATURES from to/span description pept 190 1668 glucose transporter 1 BASE COUNT 514 a 719 c 679 g 632 t ORIGIN 1 ttggtcctat aaaaaggcag ctccgcgcgc tctcccccaa gagcagaggc ttgcttgtag 61 agtgacgatc tgagctacgg ggtcttaagt gcgtcagggc gtggaggtct ggcgggagac 121 gcatagttac agcgcgtccg ttctccgtct cgcagccggc acagctagag cttcgagcgc 181 agcgcggcca tggatcccag cagcaagaag gtgacgggcc gcctcatgtt ggctgtggga 241 ggagcagtgc tcggatcact gcagttcggc tataacactg gtgtcatcaa cgccccccag 301 aaggttattg aggagttcta caatcaaaca tggaaccacc gcatcggaga gcccatccca 361 tccaccacac tcaccacgct ttggtctctc tccgtggcca tcttctctgt cgggggcatg 421 attggttcct tctctgtcgg cctctttgtt aatcgctttg gcaggcggaa ctccatgctg 481 atgatgaacc tgttggcctt tgtggctgct gtgcttatgg gcttctccaa actgggcaag 541 tcctttgaga tgctgatcct gggccgcttc atcatcggtg tgtactgcgg cctgactact 601 ggctttgtgc ccatgtatgt gggagaggtg tcacctacag ctctacgtgg agccctaggc 661 acactgcacc agctgggaat cgtcgttggc atccttattg cccaggtgtt tggcttagac 721 tccatcatgg gcaatgcaga cttgtggcct ctgctgctca gtgtcgtctt cgtcccagcc 781 ctgctacagt gtatcctgtt gcccttctgc cccgagagcc cccgcttcct gctcatcaat 841 cgtaacgagg agaaccgggc caagagtgtg ctgaagaagc ttcgagggac agccgatgtg 901 acccgagacc tgcaggagat gaaagaagag ggtcggcaga tgatgcggga gaagaaggtc 961 accatcttgg agctgttccg ctcacccgcc taccgccagc ccatcctcat cgctgtggtg 1021 ctgcagctgt cccagcagct gtcgggtatc aatgctgtgt tctactactc aacgagcatc 1081 ttcgagaagg caggtgtgca gcagcctgtg tacgccacca tcggctccgg tatcgtcaac 1141 acggccttca ctgtggtgtc gctgtttgtt gtagagcgag ctggacgacg gaccctgcac 1201 ctcattggcc tggctggcat ggcaggctgt gctgtgctca tgaccatcgc cctggccttg 1261 ctggaacggc tgccttggat gtcctatctg agcatcgtgg ccatctttgg ctttgtggcc 1321 ttctttgaag taggccctgg tcctattcca tggttcattg tggccgagct gttcagccag 1381 gggccccgtc ctgctcgtat tgctgtggct ggcttctcca actggacctc aaacttcatt 1441 gtgggcatgt gcttccagta tgtggagcaa ctgtgcggcc cctacgtctt catcatcttc 1501 acggtgctcc tcgtgctctt cttcatcttc acctacttca aagtccctga gaccaaaggc 1561 cgaaccttcg atgagatcgc ttccggcttc cggcaggggg gtgccagcca aagtgacaag 1621 acacccgagg agctcttcca ccctctgggg gcggactccc aagtgtgagg agccccacac 1681 ccagcccggc ctgctccctg cagcccaagg atctctctgg agcacaggca gctagatgag 1741 acctcttccg aaccgacaga tctcgggcaa gccgggcctg ggcgcctttc ctcagccagc 1801 agtgaagtcc aggaggatat tcaggacttt gatggctcca gaatttttaa tgaaagcaag 1861 actgctgctc agatctattc agataagcag caggttttat aattttttta ttactgattt 1921 tgttattttt tttttttatc agccactctc ctatctccac actgtagtct tcaccttgat 1981 tggcccagtg cctgagggtg gggaccacgc cctgtccaga cacttgcctt ctttgccaag 2041 ctaatctgta gggctggacc tatggccaag gacacactaa taccgaactc tgagctagga 2101 ggctttacgc tggaggcggt agctgccacc cacttccgca ggcctggacc tcggcaccat 2161 aggggtccgg actccatttt aggattcgcc cattcctgtc tcttcctacc caaccactca 2221 attaatcttt ccttgcctga gaccagttgg aagcactgga gtgcagggag gagagggaag 2281 ggccaggctg ggctgccagg ttctagtctc ctgtgcactg agggccacac aaacaccatg 2341 agaaggacct cggaggctga gaacttaact gctgaagaca cggacactcc tgccctgctg 2401 tgtatagatg gaagatattt atatactggt tgtcaatatt aaatacagac actaagttat 2461 agtatatctg gacaaaccca cttgtaaata caccaacaaa ctcctgtaac tttacctaag 2521 cagatataaa tggctggttt ttag // LOCUS MUSMS6HM 321 bp ds-DNA ROD 26-JUL-1990 DEFINITION M.musculus Ms6-hm locus, repeat elements. ACCESSION J04743 KEYWORDS dispersed repetitive element; minisatellite sequence. SOURCE M.musculus (strain C57BL/6J) DNA, clone pMm3-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kelly,R., Bulfield,G., Collick,A., Gibbs,M. and Jeffreys,A.J. TITLE Characterization of a highly unstable mouse minisatellite locus: Evidence for somatic mutation during early development JOURNAL Genomics 5, 844-856 (1989) STANDARD full staff_entry COMMENT Printed sequence for [1] kindly submitted by R.Kelly, 08-AUG-1989. FEATURES from to/span description rpt 37 168 MT dispersed repetitive element rpt 168 263 tandem repeated element rpt 263 > 321 MT dispersed repetitive element BASE COUNT 84 a 66 c 109 g 62 t ORIGIN 1 gatccccagt gatgtaaacc agactatatg gctaactgtt ttagttagag tttctagttg 61 ctgtgaccaa caccatgacc aaaaagcaag ttggggagga aaggatttat ttgacttaca 121 cttccatata actgttcatc atcaaaagaa atcaggacag aaacccgggg gcagggcagg 181 gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg gcagggcagg 241 gcagggcagg gcagggcagg gcagggctga tgtagcgtca ctgaggagtc ctgcttccta 301 ctttgcttcc atgggtggat c // LOCUS RABCYP4A6 1790 bp ss-mRNA MAM 26-JUL-1990 DEFINITION Rabbit cytochrome P450IVA6 (CYP4A6) mRNA, complete cds. ACCESSION M28656 KEYWORDS cytochrome P450; lauric acid omega-hydroxylase. SOURCE Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA, clone KdA6. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1790) AUTHORS Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T., Muerhoff,A.S. and Masters,B.S. TITLE Cloning and expression of three rabbit kidney cDNAs encoding lauric acid omega-hydroxylases JOURNAL Biochemistry 29, 873-879 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.F.Johnson, 04-OCT-1989. FEATURES from to/span description pept 14 1546 lauric acid omega-hydroxylase BASE COUNT 341 a 610 c 479 g 360 t ORIGIN 1 gggccgctgc accatgagcg tgtctgcact gaaccccacc cggctcccgg gcagcctctc 61 cgggctcctc caagtggcgg gcctgctggg cctgctcctg ctgctgctca aggcagctca 121 gctctacctg caccgccagt ggctgctcag agccctccag cagttcccgt gcccaccctt 181 ccactggctc ctggggcaca gccgagagtt ccaaaatggc catgagttac aagtgatgct 241 gaaatgggtg gagaaattcc caagtgcttg tcctcgctgg ctatggggga gcagagccca 301 cctcctgatc tatgaccctg actacatgaa ggtgattctg gggagatcag acccaaaagc 361 tcaaggttcc tacagattcc tggctccctg gattgggtat ggtttgctcc tgctgaatgg 421 gcagacgtgg ttccagcacc ggcgcatgct caccccagcc ttccactacg acatcctgaa 481 gccctacgtg gggctcatgg cggactccgt ccaaatcatg ctggacaaat gggagcagct 541 ggtcagccag gactcctccc tggaggtctt ccaagacatc tccctgatga ccctggacac 601 catcatgaag tgtgccttca gccaccaggg cagcgtccag ttggacagga attcccagtc 661 ctacatccag gctgttgggg acctgaacaa cctgttcttt tcccgagtga ggaacgtctt 721 tcatcagagt gacaccatct acaggctgag ccctgaaggc cgcttgtccc accgtgcctg 781 ccagctcgcc cacgagcaca cagaccgagt gatccagcag aggaaggctc agctgcagca 841 ggagggggag ctggagaagg tcaggaggaa gaggcgcttg gacttcctgg acgtcctcct 901 ctttgccaag atggagaacg ggagcagcct gtccgaccag gacctccgcg ccgaggtgga 961 cacgttcatg ttcgagggcc acgacaccac ggccagcggc atctcctgga tcttctatgc 1021 cctggccacg caccccgagc atcagcaccg gtgccgcgag gagatccagg gcctcctggg 1081 ggacggagcc tccatcacct gggagcacct ggaccagatg ccctacacca ccatgtgcat 1141 caaggaggcg ctgagactct acccaccagt gccaggtgtc ggcagacagc tcagctcacc 1201 tgtcaccttc cctgatggac gctccctccc caagggtgtc atagtcacgc tctccatcta 1261 cgcccttcac cacaacccga aggtgtggcc aaacccagag gtgtttgacc ctttcccgtt 1321 cgcaccgggt tctgctcgcc acagccacgc tttcctgccc ttctcaggag gaccacggaa 1381 ctgcatcggg aagcaatttg ccatgaatga gctgaaggtg gccgtggccc tgaccctcgt 1441 gcgcttcgag ctgctgccag atcccaaaag agtcccggac caaaaaccac gtcttgtgct 1501 gaagtccagc aacgggatcc acctgcgtct gaggaagctc cgctaaccct ggtggggaca 1561 agagcaggct ctggggcctt ctgccaggcg tcctggcttc ctgtcacctg cccatgcccc 1621 ctgcctgtct gcccacatcc tgctttctat ccaccagcac ttcttccacc tgtctgcctt 1681 gctgcctctt ggcctccagg ctgtctgtcc tctcgcacct tcctctgggc cactgacctg 1741 tctgtctact gtccgcttcc tgccagcatc tctgaccgtg cacctaaccc // LOCUS RABCYP4A7 1694 bp ss-mRNA MAM 26-JUL-1990 DEFINITION Rabbit cytochrome P450IVA7 (CYP4A7) mRNA, complete cds. ACCESSION M28657 KEYWORDS cytochrome P450; lauric acid omega-hydroxylase. SOURCE Rabbit (strain New Zealand White, adult) kidney, cDNA to mRNA, clone KdB18. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1694) AUTHORS Johnson,E.F., Walker,D.L., Griffin,K.J., Clark,J.E., Okita,R.T., Muerhoff,A.S. and Masters,B.S. TITLE Cloning and expression of three rabbit kidney cDNAs encoding lauric acid omega-hydroxylases JOURNAL Biochemistry 29, 873-879 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.F.Johnson, 04-OCT-1989. FEATURES from to/span description pept 25 1560 lauric acid omega-hydroxylase BASE COUNT 330 a 564 c 463 g 337 t ORIGIN 1 ggcagatcca gaagctgctg caccatgagc gtgtctgcgc tgagctccac ccggctcccg 61 ggcagcttct ccgggttcct ccaagcggcg gccctgctgg gcctactcct gctgctgctc 121 aaggcagctc agctctacct gcgccgccag tggctgctca gagccctcca gcagttcccg 181 tgcccaccct cccactggct cctggggcac agccgagagt ttccaataga ctcggagctg 241 cagcaggtgc tgaagcgagt ggagaaattc ccaagcgcct gtcctcgctg gctgtggggg 301 agtgagctgt ttctcatttg ctacgaccct gactacatga agacgattct ggggcgatca 361 gacccaaagg ctcgtgtttc ctacagcttc ctggctccct ggattgggta tggcttgctg 421 cttttggaag ggcagacgtg gttccagcac cggcgcatgc tcaccccagc cttccactac 481 gacatcctga agccctacgt ggggctcatg gtggactccg tccaagtgat gctggacaaa 541 ctggagaagc tcgcccgcaa ggacgcgcct ctggagatat acgaacacgt ctccctgatg 601 accctggaaa ccatcatgaa gtgcgccttc agccaccagg gcagcgtcca gctggaaagc 661 aggacctcca aatcctacat ccaggctgtc agggagctca gcgacttggc attgcagcgg 721 gtgaggaacg tctttcacca gagcgacttc ctctacaggc tgagccctga gggccgcttg 781 tcccaccgtg cctgccagct cgcccacgag cacacagacc gagtgatcca gcagaggaag 841 gctcagctgc agcaggaggg ggagctggag aaggtcagga ggaagaggcg cttggacttc 901 ctggacgtcc tcctctttgc caagatggag aacgggagca gcctgtccga ccaggacctc 961 cgcgccgagg tggacacgtt catgttcgag ggccacgaca ccacggccag cggcatctcc 1021 tggatcttct atgccctggc cacgcacccc gagcatcagc accggtgccg cgaggagatc 1081 cagggtctcc tgggggacgg agcctccatc acctgggagc acctggacaa gatgccctac 1141 accaccatgt gcatcaagga ggcgctgaga ctctacccac cggtgccagg tgtcggcagc 1201 aagctcagct cacctgtcac cttccctgat ggacgctccc tccccaaggg catcataatc 1261 acactctcca tctatggcct gcatcacaac ccgaaggtgt ggccaaaccc agaggtgttt 1321 gacccttccc gcttcgcacc gggttctgct cgccacagcc acgctttcct gcccttctca 1381 ggaggatcga ggaactgcat cgggaaacaa tttgccatga acgagctgaa ggtggccgtg 1441 gccctgaccc tcgtgcgctt cgagctgctg ccggatccca ccagagtccc catccccata 1501 acaagacttg tgctgaagtc taagaatggg attcacctac gtctcaggaa gctccactaa 1561 ccctgctgga aacaagaatg gtctgccagg cgtcctctct tcctgtcacc tgcccgtgtc 1621 ccgcactctg tctgtatctt gctttctctc tacctacctg cccttcttcc acctgcctcc 1681 gattcggcct tttg // LOCUS HUMRGIT 1095 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human rRNA gene internal transcribed spacer 1 (ITS1). ACCESSION M36624 KEYWORDS internal transcribed spacer. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1095) AUTHORS Gonzalez,I.L., Sylvester,J.E., Smith,T.F., Stambolian,D. and Schmickel,R.D. TITLE Ribosomal RNA gene sequences and hominoid phylogeny JOURNAL Mol. Biol. Evol. 7, 203-219 (1990) STANDARD simple staff_entry FEATURES from to/span description BASE COUNT 83 a 429 c 441 g 139 t 3 others ORIGIN 1 acggagcccg gagggcgagg cccgcggcgg cgccgccgcc gccgcgcgct tccctccgca 61 cacccacccc cccaccgcga cgcggcgcgt gcgcgggcgg ggcccgcgtg cccgttcgtt 121 cgctcgctcg ttcgttcgcc gcccggcccc gccgccgcga gagccgagaa ctcgggaggg 181 agacgggggg gagagagaga gagagagaga gagagagaga gagagagaga gaaagaaggg 241 cgtgtcgttg gtgtgcgcgt gtcgtggggc cggcgggcgg cggggagcgg tccccggccg 301 cggccccgac grcgtgggtg tcggcgggcg cgggggcggt tctcggcggc gtcgcggcgg 361 gtctgggggg gtctcggtgc cctcctcccc gccggggccc gtcgtccggc cccgccgcgc 421 cggctccccg tcttcggggc cggccggatt cccgtcgcct ccgccgcgcc gctccgcgcc 481 gccgggcacg gccccgctcg ctctccccgg ccttcccgct agggcgtctc gagggtcggg 541 ggccggacgc cggtcccctc ccccgcctcc tcgtccgccc ccccgccgtc caggtaccta 601 gcgcgttccg gcgcggaggt ttaaagaccc cttgggggga tcgcccgtcc gcccgtgggt 661 cgggggcggt ggtgggcccg cgggggagtc ccgtcgggag gggcccggcc cctcccgcgc 721 ctccaccgcg gactccgctc cccggccggg gccgcgccgc cgccgmcgcc gcggcggccg 781 tcgggtgggg gctttacccg gcggccgtcg cgcgcctgcc gcgcgtgtgg cgtgcgcccc 841 gcgccgtggg ggcgggaacc cccgggcgcc tgtggggtgg tgtccgcgct cgcccccgcg 901 tgggcggcgc gcgcctcccc gtggtgtgaa accttccgac ccctctccgg agtccggtcc 961 cgtttgctgt ctcgtctggc cggcctgagg caaccccctc tcctcttggg cggggggggs 1021 ggggggacgt gccgcgccag gaagggcctc ctcccggtgc gtcgtcggga gcgccctcgc 1081 caaatcgacc tcgta // LOCUS MHV1NP 1670 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHV1) RNA, complete cds. ACCESSION M35253 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain 1), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1670) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1368 hepatitis virus nucleocapsid (N-MHV1) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHV1) ORF 2 BASE COUNT 497 a 364 c 426 g 383 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgt aaaccgcgct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat 121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg 181 agtgtggttc cccattactc ttggttttcg ggcattaccc aatttcagaa gggaaaagag 241 tttcagtttg cacaaggaca gggagtgcct attgccaacg gaatcccagc ttcagagcaa 301 aagggatatt ggtatagaca caaccgacgg tcttttaaaa cacctgatgg ccagcagaag 361 cagctactgc ccagatggta tttttactat cttggaacag ggccccatgc tggcgcagag 421 tatggcgacg atatcgacgg agttgtctgg gtcgcaagcc aacaggccga cactaagacc 481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg 541 cccggtacgg tattgcctca aggtttttat gttgaaggct caggaaggtc tgcacctgct 601 agtcgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacaaa gcaaagcgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag 841 tgccctgtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca 961 cctagtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct 1021 gatgacccca ccaaagatgt gtatgaattg cagtattcag gtgcaattag atttgatagt 1081 actctcccag gatttgagac tatcatgaaa gtgttgaatg agaatttgga tgcctaccag 1141 gatcaagctg gtggtgcaga tgtagtgagc ccaaagcccc aaagaaagag agggacaaaa 1201 caaaaggctc tgaaaggtga agtagataat gtaagcgttg caaagcccaa aagctctgtg 1261 cagcgaaatg taagtagaga attaacccct gaggatcgta gtctgttggc tcagatcctt 1321 gatgatggcg ttgtgcctga tgggttagaa gatgactcta atgtgtaaag agaatgaatc 1381 ctatgtcggc actcggtggt aacccctcgc gagaaagtcg ggataggaca ctctctatca 1441 gaatggatgt cttgctgtca taacagatag agaaggttgt ggcagaccct gtatcaatta 1501 gttgaaagag attgcaaaat agagaatgtg tgagagaagt tagcaaggtc ctacgtctaa 1561 ccataagaac ggcgataggc gcccccctgg gaagagctca catcagggta ctattcctgc 1621 aatgccctag taaatgaatg aagttgatca tggccaattg gaagaatcac // LOCUS MHV3NP 1666 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHV3) RNA, complete cds. ACCESSION M35254 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain 3), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1666) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1365 hepatitis virus nucleocapsid (N-MHV3) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHV3) ORF 2 BASE COUNT 494 a 358 c 432 g 382 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgg aaaccgcgct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat 121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg 181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag 241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa 301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag 361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt 421 tatggagaca gcattgaagg agtcttctgg gttgcaaaca gccaagcgga caccaatacc 481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg 541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct 601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag 841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca 961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct 1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt 1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag 1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct 1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag 1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat 1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta 1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa 1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt 1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca 1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcctgcaatg 1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcgc // LOCUS MHVA59NP 1666 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHVA59) RNA, complete cds. ACCESSION M35256 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain A59), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1666) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1365 hepatitis virus nucleocapsid (N-MHVA59) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHVA59) ORF 2 BASE COUNT 497 a 355 c 433 g 381 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtagcagaa gctcctctgg aagccgctct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgcgc tggaaataat 121 ggaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caattccggg 181 agtgtggttc cccattactc ttggttttcg ggcattaccc aattccagaa gggaaaagag 241 tttcagtttg tacaaggaca gggagtgcct attgccaatg gaatcccagc ttcagagcaa 301 aagggatatt ggtatagaca caaccgacgt tcttttaaaa cacctgatgg ccagcagaag 361 cagctactgc ccagatggta tttttactat ctcggaacag ggccccatgc tggcgcagag 421 tatggcgacg atatcgaagg agttgtctgg gtcgcaagcc aacaggccga cactaagacc 481 actgccgata ttgttgaaag ggacccaagt agccatgagg ctattcctac taggtttgcg 541 cccggtacgg ttttgcctca gggtttttat gttgaaggct caggaaggtc tgcacctgct 601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagctcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggt cagcccaagc aagtaacaaa gcaaagtgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgtcaaa agaggactcc aaacaagcag 841 tgcccagtgc agcaatgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgcttaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggccccaaca 961 gctggtgcct tcttctttgg atctaaatta gaattggtca aaaagaactc tggtggtgct 1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt 1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag 1141 aaggatggtg gtgcagatgt agtgagccca aagccccaaa gaaaagggcg tagacaggct 1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag 1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatcctagat 1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta 1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa 1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt 1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca 1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg 1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac // LOCUS MHVSHV 1666 bp ss-RNA VRL 26-JUL-1990 DEFINITION Mouse hepatitis virus nucleocapsid (N-MHVS) RNA, complete cds. ACCESSION M35255 KEYWORDS N protein; RNA binding viral structural protein; nucleocapsid protein. SOURCE Mouse hepatitis virus (strain S), cDNA to viral RNA. ORGANISM Mouse hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1666) AUTHORS Parker,M.M. and Masters,P.S. TITLE Sequence comparison of the N genes of five strains of the coronavirus mouse hepatitis virus suggests a three domain structure for the nucleocapsid protein JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Masters, 19-JUN-1990. FEATURES from to/span description pept 1 1365 hepatitis virus nucleocapsid (N-MHVS) ORF 1 pept 65 688 hepatitis virus nucleocapsid (N-MHVS) ORF 2 BASE COUNT 494 a 357 c 430 g 385 t ORIGIN 1 atgtcttttg ttcctgggca agaaaatgcc ggtggcagaa gctcctctgt aaaccgcgct 61 ggtaatggaa tcctcaagaa gaccacttgg gctgaccaaa ccgagcgtgg accaaataat 121 caaaatagag gcagaaggaa tcagccaaag cagactgcaa ctactcaacc caactccggg 181 agtgtggttc cccattactc ctggttttct ggcattaccc agttccaaaa gggaaaggag 241 tttcagtttg cagaaggaca aggagtgcct attgccaatg gaatccccgc ttcagagcaa 301 aagggatatt ggtatagaca caaccgccgt tcttttaaaa cacctgatgg gcagcagaag 361 caattactgc ccagatggta tttttactat cttggcacag ggccccatgc tggagccagt 421 tatggagaca gcattgaagg tgtcttctgg gttgcaaaca gccaagcgga caccaatacc 481 cgctctgata ttgtcgaaag ggacccaagc agtcatgagg ctattcctac taggtttgcg 541 cccggcacgg tattgcctca gggcttttat gttgaaggct ctggaaggtc tgcacctgct 601 agccgatctg gttcgcggtc acaatcccgt gggccaaata atcgcgctag aagcagttcc 661 aaccagcgcc agcctgcctc tactgtaaaa cctgatatgg ccgaagaaat tgctgctctt 721 gttttggcta agctcggtaa agatgccggc cagcccaagc aagtaacgaa gcaaagtgcc 781 aaagaagtca ggcagaaaat tttaaacaag cctcgccaaa agaggactcc aaacaagcag 841 tgcccagtgc agcagtgttt tggaaagaga ggccccaatc agaattttgg aggctctgaa 901 atgttaaaac ttggaactag tgatccacag ttccccattc ttgcagagtt ggctccaaca 961 gttggtgcct tcttctttgg atctaaatta gaattggtca aaaagaattc tggtggtgct 1021 gatgaaccca ccaaagatgt gtatgagctg caatattcag gtgcagttag atttgatagt 1081 actctacctg gttttgagac tatcatgaaa gtgttgaatg agaatttgaa tgcctaccag 1141 aaggatggtg gtgcagatgt ggtgagccca aagccccaaa gaaaagggcg tagacaggct 1201 caggaaaaga aagatgaagt agataatgta agcgttgcaa agcccaaaag ctctgtgcag 1261 cgaaatgtaa gtagagaatt aaccccagag gatagaagtc tgttggctca gatccttgat 1321 gatggcgtag tgccagatgg gttagaagat gactctaatg tgtaaagaga atgaatccta 1381 tgtcggcgct cggtggtaac ccctcgcgag aaagtcggga taggacactc tctatcagaa 1441 tggatgtctt gctgtcataa cagatagaga aggttgtggc agaccctgta tcaattagtt 1501 gaaagagatt gcaaaataga gaatgtgtga gagaagttag caaggtccta cgtctaacca 1561 taagaacggc gataggcgcc ccctgggaag agctcacatc agggtactat tcttgcaatg 1621 ccctagtaaa tgaatgaagt tgatcatggc caattggaag aatcac // LOCUS MXAFRZGF 2999 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.xanthus frzG and frzF genes, complete cds. ACCESSION M35200 KEYWORDS FrzF protein; FrzG protein; methylesterase; methyltransferase. SOURCE M.xanthus (strain DZF1) DNA. ORGANISM Myxococcus xanthus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; Myxococcaceae. REFERENCE 1 (bases 1 to 2999) AUTHORS McCleary,W.R., McBride,M.J. and Zusman,D.R. TITLE Developmental sensory transduction in Myxococcus xanthus involves methylation and demethylation of FrzCD JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.R.McCleary, 15-JUN-1990. FEATURES from to/span description pept 120 1124 FrzG protein pept / 1117 2899 FrzF protein (AA at 2) BASE COUNT 383 a 990 c 1134 g 492 t ORIGIN 1 ggatgccggc gcggacgcgt acctcgtcaa gggcgagctg ggcgtggagg ttctcgcgca 61 ggccatcgac cggctgacct gaggagccag gcttgggcgg tggcgcggta gtcgcaggaa 121 tggcgtttcg ggtgctcatg gtgggcaagg ggctgcgtgc gctcgcggcc cggggcctgt 181 tcgatgggga atccctggtg cccgtggggc cggcggaggt ggacttcgcc ggcgccctgg 241 tggccgtgca gcggcacttc ccggacgtgg tgctggtgga cctgagcgcg ctggacgcgc 301 tgcccgccat cgagcacgtc atggtggagc ggcccgtgcc ggtgctggcg ttgcaccccg 361 gcgtgttgtc cggccaggag gccttccagg cgatggtggc gggcgcgctg gacgtgctgg 421 agcgtccggc gaaccccggg cccgagttct ggacgcacgt gtcgcgcaag ctggtgctgc 481 tggcgcaggt gaaggcggtg cggcaggtgc agacgcggcc gccaccgcaa gcggcgcgtg 541 aggcgaagcc gcctcctccg tatccgctgg tggccatcgc cgcgtccctg ggtggcccca 601 aggcggtggc gcaggtgctg cggatgattc cgcgcgcctt cccggcgccc atcgcctact 661 gccagcacat cagcgacggt ttcacggaag ggctggcgca ctggttgtcc aatgaaacgg 721 cgctgcgcgt gctggaggcc gagcatgacg tgctcatggc gccgggcacg gtgtacatcg 781 ctccgtcggg cagtcacctc ttggtccgac ccgagggcag gttggagctg gacgcgggcc 841 ccgcgcttcg cggtttccgg ccgtcctgtg acatgctgct gacttcagcg ggtgagtcgt 901 tcggcccgcg ctgcatcggg gtcatcctga cgggcatggg gcgcgacggg gcgcgagggt 961 tgaaggagat tcgagagcgc ggcggtcgga ccattgccca ggacgaagcg tcgagcgtcg 1021 tctggggcat gccgcgcgag gcggtgttga tgggcgcggc gcacgaggtg ctgccactga 1081 gccggattgg cgcggcgctg atgcagtggg tggatgtgtg ttgacggcga gccagaaagt 1141 cttgcaacaa ctcgcggcgc tgctgctgga gcgcgcgggg ctgaaaatca cgccggatgg 1201 cttccacagc ctccgactgg cgctgtccgc gcggatgccc gtgctggggc tggaagagcc 1261 cgagcactac atccagcgac tgacgggcgc cggtggcgaa gaggagctgc gctcgctgtt 1321 gccgctggtg acggtggggc acacggagtt cttccgcgac gcgaagcagt tccgcgcgct 1381 ggagaagagc gtgctgccgg acctggtgtc ccgttcgcgg cgcgagatgc gcaaggtgtc 1441 catctggtcc gcgggctgcg cgacggggga ggagccctac agcctggcca tggtgctggc 1501 ggagctgggc gcgctgtcgc tggaggtgga cctgtgggcc accgacctca acctggccgc 1561 ggtggaggcc gcgaagcagg ggcgcttcac ctcgcggcgg gccatcagca tcaaccaggc 1621 gcggctgacg cgcttcttca agcccgtgga agagggctat gaggcgctgc ccgcgctgcg 1681 tgagtacatc cgcttcgatg gacagaacct ggcggttccc gtcttcgaca aggtggccct 1741 gtcgtcgctg gacctcatcc tctgccgcaa cgtcatcatc tacttcgacc tgcccaccat 1801 ccgcgggttg atggaccgct tcctcgccgc gctgcggccg ggcgggctgt tgttcctggg 1861 gtactcggag agcctcttca aggtctacga ccgcttcgag atgatcgaag tcgatggggc 1921 gttcgtgtac cgccgcccgc tgaacgacaa gagcatgcgg gcgccgccgc tgcgcatcac 1981 cccgtatcct ggcgagcccg atgtcgccgc gcgcaggccc gtgcctgcgg acgcgttcac 2041 cgcggacctg cgcaagcgga tgctgcccga ggacgtcccg ttgacgacgc ggctgcccgc 2101 ggtgtcagcg tcgtcggtgg cggcgcctgg ctcgcccagc gtgacgctgc cggcgctggg 2161 ggcctcttcg agtccgcgtt ccgtggtgcc ggggcggctg cccgcggtgt cgcctcactc 2221 gccgctgccg gccatcgccg cgcgctcgcg tgtcaccgcg gagttgccca cggtgggaag 2281 cgtggactcc gcccgtccgc gcatcaccac cgagctgccg gccgtggcca ccacgccgcg 2341 cgcgcccacc gtggaggtgc ccgcctggcc cacgctgctg cctccggcgg agcggctggc 2401 catggcggtg cggaagatgg cgcaggggga tttctcggcg gccatcgctg gcgtgcagcg 2461 gctgctcgcg gacgagccca gtgacttgga tgggctgctg acgctgggca acctgttctc 2521 gctcaccggc cgcatccccg aggcgcgcga ggccttcgcg caggccattc agcgcgagcc 2581 gctgtgcgtg gaggcgcggg tgttcggcgg ggtcgccgcg ttgcaagcag gggagttgag 2641 cgaggcgcgc tccgagctga gcaaggccct gttcctggag cccacgctgg ccattggcca 2701 ctacctgctg gcgcaggtgc acgagcgcac gcaggaccat gaggcggccc gccgcagcta 2761 ccgcaacgcc attgcccagc ttcgcttccc gcagcgtccc ctcgcggggc actacccgga 2821 gatgccggac tcggcggatg ccatctctcg cgcggcgcgt tacgccctgg ccgcgctgga 2881 ggagcagccc ctgcgctgag gcaggggccg cgtcccaggc ttcacgtcag tccaggctgc 2941 tcttcacctg gtccaggctc ttgctcgggt cgagcacgga gccgaacttc ttctgcagg // LOCUS ECOSFIM 762 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli S-fimbrial protein (sfaA) gene, complete cds. ACCESSION M35273 KEYWORDS S-fimbrial protein. SOURCE E.coli (strain 536) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 762) AUTHORS Schmoll,T., Hacker,J. and Goebel,W. TITLE Nucleotide sequence of the sfaA gene coding for the S-fimbrial protein subunit of Escherichia coli JOURNAL FEMS Microbiol. Lett. 41, 229-235 (1987) STANDARD simple staff_review FEATURES from to/span description pept 166 708 S-fimbrial protein precursor sigp 166 237 S-fimbrial protein signal peptide matp 238 705 S-fimbrial protein BASE COUNT 218 a 140 c 172 g 232 t ORIGIN 1 gaaaatatta tcggagataa tgtcataaat gctgcctgag tgtatttctc acattgcatt 61 tatgaagttc tcctgaaaaa agattcccgt cgttcgggat attgattgtg tctgttgtga 121 tgacagatac ggtgtgcgta gttcaattaa aaacaggaat taaatatgaa gttaaaattc 181 atctccatgg ctgtattttc agccctgacc ttgggtgttg cgacaaatgc gtctgctgtc 241 accacggtta atggtggtac agttcatttt aagggggaag ttgttgatgc tgcatgtgct 301 gtaaacacta attcagcaaa tcaaacgttt tctgggcaag ttcgttcagc taagttggcg 361 aatgatggag agaagagttc ccctgttgga tttagtattg aacttaatga ctgtagttct 421 gcaactgccg ggcatgcatc aattatcttt gcaggaaatg ttattgctac acacaatgat 481 gtgctgtctc tacagaatag tgctgcaggt agtgcaacaa atgtaggtat tcagatattg 541 gatcatacag gtactgcagt tcaatttgac ggagtgactg catctacaca atttacatta 601 acagatggca ccaataaaat tcctttccag gcagtttatt atgcaacagg taagtcaacg 661 cctggtattg ccaacgccga cgccaccttt aaagttcagt accagtaata tcagaacagt 721 gtaacgatat atacccggcc aggagggctg tttttatcat gc // LOCUS ECOSRNB 655 bp ds-DNA BCT 26-JUL-1990 DEFINITION F plasmid (from E.coli) stable RNA degradation promoter (srnB) gene, complete cds. ACCESSION M35279 KEYWORDS . SOURCE F plasmid (from E.coli) DNA. ORGANISM Plasmid F Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 655) AUTHORS Akimoto,S., Ono,K., Ono,T. and Ohnishi,Y. TITLE Nucleotide sequence of the F plasmid gene srnB that promotes degradation of stable RNA in Escherichia coli JOURNAL FEMS Microbiol. Lett. 33, 241-245 (1986) STANDARD simple staff_review FEATURES from to/span description pept 251 457 stable RNA degradation promoter (srnB) signal 127 132 -35 region signal 150 155 -10 region signal 466 485 transcription termination signal (put.) binding 242 245 ribosome binding site BASE COUNT 163 a 156 c 173 g 163 t ORIGIN 1 aattcccatt ctggaccagc gggagcatac gaacaataat ttacggtttc gcgctatagc 61 tggctcaagt taggttggac cctgaatctc cagacaacca atatctgatc gcgccagtgg 121 tggcagttat taagcaacag ggaatgtggt attatcgcgg cgggtgtctg agcctttctg 181 gttcaggcaa gacgcaggta ccagaaatgc gaagacccca cttgttaatc cattaactcg 241 tgaggtctgc atgaagtacc ttaacactac tgattgtagc ctcttccttg cagagaggtc 301 aaagtttatg acgaaatatg cccttatcgg gttgctcgcc gtgtgcgcta cggtgttgtg 361 tttttcactg atattcaggg aacggttatg tgagctgaat attcacaggg gaaatacagt 421 ggtgcaggta actctggcct acgaagcacg gaagtaagct gccgggcggg gacggaagtc 481 cccgctttcc ggaagtgtga ggtatttcag gggcagacac ccgacatgcc agaaacagcc 541 ggtcccgccc ggggccggca cccaggttca ggcatttcct gcttttcagt catttcatta 601 tcaaaatcac attaaacggt cgtaatcaga catgatttgt gcgccaacac agatc // LOCUS HUMTIMP2 1062 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human metalloproteinase-2 inhibitor (TIMP-2) mRNA, complete cds. ACCESSION J05593 KEYWORDS metalloproteinase-2 inhibitor. SOURCE Human melanoma cell line A2058, cDNA to mRNA, clone pT2-M01. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1062) AUTHORS Stetler-Stevenson,W.G., Brown,P.D., Onisto,M., Levy,A.T. and Liotta,L.A. TITLE Tissue inhibitor of metalloproteinases-2 (TIMP-2) mRNA expression in tumor cell lines and human tumor tissues JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by W.G.Stetler-Stevenson, 19-JUN-1990. FEATURES from to/span description pept 271 933 metalloproteinase-2 inhibitor precursor sigp 271 348 metalloproteinase-2 inhibitor signal peptide matp 349 930 metalloproteinase-2 inhibitor BASE COUNT 222 a 370 c 316 g 154 t ORIGIN 1 ggggccgccg agagccgcag cgccgctcgc ccgccgcccc ccaccccgcc gccccgcccg 61 gcgaattgcg ccccgcgccc tcccctcgcg cccccgagac aaagaggaga gaaagtttgc 121 gcggccgagc gggcaggtga ggagggtgag ccgcgcggag gggcccgcct cggccccggc 181 tcagcccccg cccgcgcccc cagcccgccg ccgcgagcag cgcccggacc ccccagcggc 241 ggccccgccc gcccagcccc ccggcccgcc atgggcgccg cggcccgcac cctgcggctg 301 gcgctcggcc tcctgctgct ggcgacgctg cttcgcccgg ccgacgcctg cagctgctcc 361 ccggtgcacc cgcaacaggc gttttgcaat gcagatgtag tgatcagggc caaagcggtc 421 agtgagaagg aagtggactc tggaaacgac atttatggca accctatcaa gaggatccag 481 tatgagatca agcagataaa gatgttcaaa gggcctgaga aggatataga gtttatctac 541 acggccccct cctcggcagt gtgtggggtc tcgctggacg ttggaggaaa gaaggaatat 601 ctcattgcag gaaaggccga gggggacggc aagatgcaca tcaccctctg tgacttcatc 661 gtgccctggg acaccctgag caccacccag aagaagagcc tgaaccacag gtaccagatg 721 ggctgcgagt gcaagatcac gcgctgcccc atgatcccgt gctacatctc ctccccggac 781 gagtgcctct ggatggactg ggtcacagag aagaacatca acgggcacca ggccaagttc 841 ttcgcctgca tcaagagaag tgacggctcc tgtgcgtggt accgcggcgc ggcgcccccc 901 aagcaggagt ttctcgacat cgaggaccca taagcaggcc tccaacgccc ctgtggccaa 961 ctgcaaaaaa agcctccaag ggtttcgact ggtccagctc tgacatccct tcctggaaac 1021 agcatgaata aaacactcat cccatgggtc caaattaata tg // LOCUS ALREV1 717 bp ss-RNA VRL 26-JUL-1990 DEFINITION Rous sarcoma defective endogenous virus ev-1 locus gag polyprotein RNA, 5' end. ACCESSION M30517 KEYWORDS gag polyprotein. SOURCE Rous sarcoma defective endogenous virus (strain Prague C), cDNA to viral RNA, clone pGD27. ORGANISM Rous sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian sarcoma viruses. REFERENCE 1 (bases 1 to 717) AUTHORS Vogt,V.M., Pepinsky,R.B. and Southard,L.E. TITLE Primary structure of p19 species of avian sarcoma and leukemia viruses JOURNAL J. Virol. 56, 31-39 (1990) STANDARD full staff_review FEATURES from to/span description pept 1 > 717 gag polyprotein matp 1 465 p19 protein matp 466 531 p2 protein matp 532 717 pp10 protein BASE COUNT 161 a 173 c 246 g 137 t ORIGIN 1 atggaagccg tcataaaggt gatttcgtcc gcgtgtaaaa cctattgcgg gaaaacctct 61 ccttctaaga aggaaatagg ggccatgttg tccctgttac aaaaggaagg gttgcttatg 121 tctccctcag acttatattc cccggggtcc tgggatccca ttaccgcggc gctctcccag 181 cgggcaatgg tacttgggaa atcgggagag ttaaaaacct ggggattggt tttgggggca 241 ttgaaggcgg ctcgagagga acaggttaca tctgagcaag caaagttttg gttgggatta 301 gggggaggga gggtctctcc cccaggtccg gagtgcatcg agaaaccagc aacggagcgg 361 cgaatcgaca aaggggagga agtgggagaa acaactgcgc agcgagatgc gaagatggcg 421 ccggagaaaa tggccacacc taaaaccgtt ggcacatcct gctatcagtg cggaacagct 481 actggctgta attgcgccac agcctcggcc cctcctcctc cttatgtggg gagtggtttg 541 tatccttccc tggcgggggt gggagagcag cagggccagg ggggtgacac accttggggg 601 gcggaacagc caagggcgga gccagggcac gcgggtctgg cccctgggcc ggccctgact 661 gactgggcaa ggatcaggga ggagcttgcg agtactggtc cgcccgtggt ggccatg // LOCUS ALREV2 564 bp ss-RNA VRL 26-JUL-1990 DEFINITION Rous sarcoma endogenous virus ev-2 locus gag polyprotein RNA, partial cds. ACCESSION M30518 KEYWORDS gag polyprotein. SOURCE Rous sarcoma endogenous virus (strain Prague C), cDNA to viral RNA, clone pAS2. ORGANISM Rous sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian sarcoma viruses. REFERENCE 1 (bases 1 to 564) AUTHORS Vogt,V.M., Pepinsky,R.B. and Southard,L.E. TITLE Primary structure of p19 species of avian sarcoma and leukemia viruses JOURNAL J. Virol. 56, 31-39 (1990) STANDARD full staff_review FEATURES from to/span description pept < 1 > 564 gag polyprotein (AA at 1) matp < 1 312 p19 protein matp 313 378 p2 protein matp 379 564 pp10 protein BASE COUNT 123 a 138 c 207 g 96 t ORIGIN 1 gatcccatta ccgcggcgct ctcccagcgg gcaatggtac ttgggaaatc gggagagtta 61 aaaacctggg gattggtttt gggggcattg aaggcggctc gagaggaaca ggttacatct 121 gagcaagcaa agttttggtt gggattaggg ggagggaggg tctctccccc aggtccggag 181 tgcatcgaga aaccagcaac ggagcggcga atcgacaaag gggaggaagt gggagaaaca 241 actgtgcagc gagatgcgaa gatggcgccg gaggaaacgg ccacacctaa aaccgttggc 301 acatcctgct atcattgcgg aacagctatt ggctgtaatt gcgccacagc ctcggcccct 361 cctcctcctt atgtggggag tggtttgtat ccttccctgg cgggggtggg agagcagcag 421 ggccaggggg gtgacacacc tcggggggcg gaacagccaa gggcggagcc agggcacgcg 481 ggtctggccc ctgggccggc cctgactgac tgggcaagga tcagggagga gcttgcgagt 541 acaggtccgc ccgtggtggc catg // LOCUS HAMCHO1 1953 bp ss-mRNA ROD 26-JUL-1990 DEFINITION C.griseus intracisternal A-particle retrovirus like sequences. ACCESSION M34949 KEYWORDS p27 protein; pseudogene. SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.SW2. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1953) AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H., Nguyen,T.P. and Wurm,F.M. TITLE Presence and transcription of intracisternal A-particle-related sequences in CHO cells JOURNAL J. Virol. 64, 2021-2032 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 2 277 IAP p27 homologue 277 564 IAP p27 homologue mRNA < 1 1953 p27 (pot.) mRNA BASE COUNT 485 a 473 c 509 g 486 t ORIGIN 1 ctttactctt acacaattgg atagacttgc cctaaatgcc ttgacgccat ctgactggca 61 gatggtcaca aaagctgcgc ttgtcagcat gggccaatac atggagtgga aagcactctg 121 gcatgaggcc gcccaagagc aggccagagc taacgcgacg gccttaactc ctgagcaaca 181 actatggaca ttcgacctgt taacgggcca gggtcgtttt gcagctgatc aaacaaatta 241 tcattggggc gcttatccac aaatcgacaa cgcggcatta gggcctgaaa ggtgctctcc 301 aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca ggagactttc 361 tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga tcctgagcag 421 gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg tcgcgcagct 481 atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag agaacttggg 541 ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa gcgccccctt 601 aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat tatggcagat 661 ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca tccaatatga 721 aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt tacagaattg 781 gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc agattaactc 841 cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt cgactgtctc 901 ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct gcggattggt 961 gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca aggtggttat 1021 cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt taacaatact 1081 taagcaatag gcgctggcca gacagctctt gcacacccgg agcctaggct cattgcacag 1141 ggtagagtgt ctggcttgag cagcccatga gggaatgtgg agcaaggcat cgcacagaag 1201 agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg gttgcctgcc 1261 ctagtctccc tttcccagaa aacggcagag gacaggtcga gagcgcttcg ggtcaagcta 1321 acagcctaat ggcgactctc gtacacagtc ttaatgtttg attgggaagg tacaacctct 1381 gcctctatcc ctcaacatat gggtgaccta tttgcttgta aaaatatgta agccttatca 1441 ttaattaata aaaaagggga gatgtaggga gccgtccctg cattctctat tacaagatgg 1501 cgcctgcatc cggcaggcac cgaatggtaa acaagttaat gcgcaggtgc tgggtaactt 1561 tccatccctt ggtctctgcc tctcccgtgg cgtcatatgg tccgatgagc tgcagccagt 1621 cagggggtga cacgtccgag gcggtggttg ccagcctata taagggatgg gtttttggga 1681 gttcggggtc tctgctctgt aagcttatgc tctccctctc aagatgcatt aaagctttac 1741 tacagaagga tcctgaatgt cctgcgtcat tcttgctggc gagacggtag cgcgggacag 1801 atggtgacag ccggtgcaga aagtgtcaac ctcagcttcc ttctccagga agacttcagc 1861 ctgggactgc tcctctacag agccccctac caagattatc taacctgcct gccttcttgt 1921 tgagctgtgt gtaataaact cattgagttt ccc // LOCUS HAMCHO2 1570 bp ss-mRNA ROD 26-JUL-1990 DEFINITION C.griseus intracisternal A-particle retrovirus like sequences. ACCESSION M34950 KEYWORDS p27 protein; pseudogene. SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.LY6. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1570) AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H., Nguyen,T.P. and Wurm,F.M. TITLE Presence and transcription of intracisternal A-particle-related sequences in CHO cells JOURNAL J. Virol. 64, 2021-2032 (1990) STANDARD full staff_review FEATURES from to/span description pept.ps 26 694 IAP p27 homologue BASE COUNT 418 a 368 c 406 g 378 t ORIGIN 1 aaaaagaaag ctgggcctcg cttttcccat ctttgagggc attgagggag agtgtatgca 61 tgcacccatg gagtataatc agataaaaga attggcagaa tcagtcagga aatatggagt 121 cacagccaac tttactctta cacaattgga tagacttgcc ctaaatgcct tgacgccatc 181 tgactggcag atggtcacaa aagctgcgct tgtcagcatg ggccaataca tggagtggaa 241 agcactctgg catgaggccg cccaagagca ggccagagct aacgcgacgg ccttaactcc 301 tgagcaacaa ctatggacat tcgacctgtt aacgggccag ggtcgttttg cagctgatca 361 aacaaattat cattggggcg cttatccaca aatcgacaac gcggccatta gggcctgaaa 421 ggtgctctcc aagaaaggag gggttgacaa tcagcttact aaaatcattc aaggaaccca 481 ggagactttc tccgattttg tagcaaggat gacagaggca gcgggatgga tctttggcga 541 tcctgagcag gccgcacctc ttgttgagca acttatcttt gaacaggcct cccaagaatg 601 tcgcgcagct atagccccga gaaaaaacaa aggattacaa gattggctta gggtctgtag 661 agaacttggg ggacccctta ctaatgcagg gttagctact gccatcctac agtctcaaaa 721 gcgccccctt aaggggccag ataaaagaac ttgctttaga tgtggaacaa ttggacatat 781 tatggcagat ggcccaacta ggctgtgagc agaagctccc cggcctatat gtcacctcca 841 tccaatatga aaattttacc aaagcagcta atttgtctaa aagcctttct cagttcatgt 901 tacagaattg gacctccaaa tttgagcaaa cgcttcggga gttgagagcc gctattatcc 961 agattaactc cacgcgcctt gacctgtcct tgacggaggg attgtcatca tggatcgctt 1021 cagctgtctc ctattttaag gaatgggtgg gggtgggatt gtttggtgca gccgtttgct 1081 gcggattggt gttgcttcta tggctggtct gtaggctcag ggctcaaact aagagagaca 1141 aggtggttat cgcccaagcg cttgtagctt tggaacaagg ggcttccact gacatttggt 1201 taacaatact taagcaatag gccgctggcc agacagctct tgcacacccg gagcctaggc 1261 tcattgcaca gggtagagtg tctggcttga gcagcccatg agggatgtgg agcaaggcat 1321 cgcacagaag agttgcccag tatgcaggct tctctgggag gcatgttgtc ctgcataagg 1381 gttgcctgcc ctagtctccc tttcccagaa aaacggcaga ggacaggtcg agagcgcttc 1441 gggtcaagct aacagcctaa tggcgactct cgtacacagt cttaatgttt gattgggaag 1501 gtacaacctc tgcctctatc cctcaacata tgggtgacct atttgcttgt aaaaatatga 1561 agccttatca // LOCUS HAMCHO3 2186 bp ss-mRNA ROD 26-JUL-1990 DEFINITION C.griseus intracisternal A-particle retrovirus like sequences. ACCESSION M34951 KEYWORDS protease; pseudogene. SOURCE C.griseus adult ovary, cDNA to mRNA, clone CHIAP.YL[7,9]. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2186) AUTHORS Anderson,K.P., Lie,Y.S., Low,M.-A.L., Williams,S.R., Fennie,E.H., Nguyen,T.P. and Wurm,F.M. TITLE Presence and transcription of intracisternal A-particle-related sequences in CHO cells JOURNAL J. Virol. 64, 2021-2032 (1990) STANDARD simple staff_review FEATURES from to/span description pept.ps 553 1281 IAP protease homologue BASE COUNT 649 a 431 c 518 g 588 t ORIGIN 1 gcaataactc catataaggg taaaggcctt gaagtctgga tgaaagtctg tagggagtta 61 gggggtccgc tgactaatgc tggactagca gctgctgtgt tgcaattaac taagaaaggt 121 ggaggttcag gagcttgctt taaatgcggc aagcaagggc atttgaaaaa gcaatgcccc 181 gagggaggaa acactaaagt caataaactt tgctccgcgc cctaagcaac ctggcttatg 241 tcctagatgt agaaaaggaa atcattgggc taaggattgt agatcagtaa aagacatcag 301 tggacagcct cttgttcagg ggtatggagg agcccgttca aaaaacggac gacggggccc 361 acgaccccag ggcccacaaa tatatggggc catggaggat cagaaccagg agcagagtcc 421 cgaaacctgg ccctctcttc gtcatccgag ggaccgagga gagccactac aggctccgcg 481 gggctggact tacgctccac caccagactc gtattaactc ccagaatggg ggtccagctt 541 gttgacaccg attttaaggg accccttgag cctggcacag taggtttgct tataggaaga 601 tcatctgcag cattgaaagg tttacgagta catcctggag ttatagatcc tgattacatg 661 ggtgtagtaa agatcatggt agaatctcct agagggatta cggccatttc tcctggagac 721 aggatagcac agttactgct tttgccaagc ttgcatgaca agtttccagc acaagccaga 781 gagagaggag agggaaactt tggctccact ggatcaaact taactttcct agctttagac 841 cttgatcaac gtccaaccct tgagttaata gtgaatggta agaaaatctt aggcttacta 901 gattctggag ctgataagag catcatagcc actaaagatt ggccctctgg ctggcctata 961 caggtttctt ctcaaagttt acaaggttta ggctatgcta aggctcctga tatgagtgct 1021 agacaattgc cttggaaaga tcaggaaggg cattcaggga ccatgcaacc ttatgtgtta 1081 gacttaccaa tttcattatg gggaagagat ttgttaaagg atatgggttt taaactcaca 1141 aatgaatact cagaaacatc tcaaggtatc atgaaacgaa tgggatacag tcccaggcca 1201 ggcctcggga aacatctgca gggtcgtacc agtcctatta attccacaat tgagaccaaa 1261 gaatctaggt ctgggttttt cctagggcca ctgaggaggt attcctatta cctggaaaac 1321 agaggagccg gtatgggttc ctcagtggcc actttcctct gagaaactgg aagctgctaa 1381 gactctagtg cgggagcagc tggatctggg gcatataaaa tcctctgtat ctccatggaa 1441 tactcctatt tttgtcatta agaaaaaatc tggtaaatgg agactgcttc acgatcttag 1501 agctattaat caacagatgc aaattatggg ccctgtacaa cgtggtcttc cacttttaac 1561 ttctttacct gcatcatggc ctatcatctc tatagatatt aaagattgct tcttttccat 1621 acctttgtgt gccaaggatt cagggcgttt tgcgtttacg ctgccctctt gtaatcatga 1681 acaacctgat ttaaggtatg aatgggatag tgttggccac aggggatggc caatagtcct 1741 actatgtgtc agttgtttgt agcagaagca attgctcctt ttgagagtgg actttcccaa 1801 agattagatg tgttcattat atggatgata ttttattggc tgccaaagat gataaaacgc 1861 ttaataaggc atatacaaaa ttggtaaaat tgcttgagat gcataattta gtcatagcct 1921 cagaaaaggt acaaaaggac actgttgtta actatctagg ggctaagatt ctccctcata 1981 caattattcc acaaaagata gagattagaa aagataattt aaaaactctt aatgattttc 2041 aaaagttgtt gggagatata aattggataa gatgttattt aaaattacca aattatgagt 2101 tgaagccatt gtataatatt ctcaatggtg attcagcatt agattcacct aggcagttaa 2161 ctgctgaagc cagagaagct ttaaag // LOCUS HUMCHRM 2098 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human muscarinic acetylcholine receptor gene, complete cds. ACCESSION M35128 Y00508 KEYWORDS muscarinic acetylcholine receptor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2098) AUTHORS Allard,W.J., Sigal,I.S. and Dixon,R.A.F. TITLE Sequence of the gene encoding the human M1 muscarinic acetylcholine receptor JOURNAL Nucleic Acids Res. 15, 10604-10604 (1987) STANDARD simple staff_review FEATURES from to/span description pept 449 1831 muscarinic acetylcholine receptor BASE COUNT 458 a 662 c 570 g 408 t ORIGIN 1 agtatagctt ataagtggat gaatgcttga gaagttgcag attatacaaa gtagttccca 61 actcctgcaa cccagtatgt aagatagaat tgtagttaat ttcccagtaa gaaaatgagc 121 ctgagtctga aaggtaaaac tgaatgaagt attcaaaccc tggatcccaa agccactcca 181 cgctgctggc aaatccactt atggctggga aagtgccact gcataaatga ccatgagtgg 241 gcaccggtaa gggagggtga tgctatctgg tctgaagctc tgaagggcaa gaattacatc 301 ccatgcatct tccaataagg tctatcagaa atgtccagtg gcccaaccaa agcccatgtc 361 ctctctttta ggtgatgact ttcccctgag gaagccctgt agcgtgcctg gaggaagggg 421 tctccaaccc cagccccacc tagccaccat gaacacttca gccccacctg ctgtcagccc 481 caacatcacc gtcctggcac caggaaaggg gccctggcaa gtggccttca ttgggatcac 541 cacgggcctc ctgtcgctag ccacagtgac aggcaacctg ctggtactca tctccttcaa 601 ggtcaacacg gagctcaaga cagtcaataa ctacttcctg ctgagcctgg cctgtgctga 661 cctcatcatc ggtaccttct ccatgaacct ctataccacg tacctgctca tgggccactg 721 ggctctgggc acgctggctt gtgacctctg gctggccctg gactatgtgg ccagcaatgc 781 ctccgtcatg aatctgctgc tcatcagctt tgaccgctac ttctccgtga ctcggcccct 841 gagctaccgt gccaagcgca caccccgccg ggcagctctg atgatcggcc tggcctggct 901 ggtttccttt gtgctctggg ccccagccat cctcttctgg cagtacctgg taggggagcg 961 gacagtgcta gctgggcagt gctacatcca gttcctctcc cagcccatca tcacctttgg 1021 cacagccatg gctgccttct acctccctgt cacagtcatg tgcacgctct actggcgcat 1081 ctaccgggag acagagaacc gagcacggga gctggcagcc cttcagggct ccgagacgcc 1141 aggcaaaggg ggtggcagca gcagcagctc agagaggtct cagccagggg ctgagggctc 1201 accagagact cctccaggcc gctgctgccg ctgctgccgg gcccccaggc tgctgcaggc 1261 ctacagctgg aaggaagaag aggaagagga cgaaggctcc atggagtccc tcacatcctc 1321 agagggagag gagcctggct ccgaagtggt gatcaagatg ccaatggtgg accccgaggc 1381 acaggccccc accaagcagc ccccacggag ctccccaaat acagtcaaga ggccgactaa 1441 gaaagggcgt gatcgagctg gcaagggcca gaagccccgt ggaaaggagc agctggccaa 1501 gcggaagacc ttctcgctgg tcaaggagaa gaaggcggct cggaccctga gtgccatcct 1561 cctggccttc atcctcacct ggacaccgta caacatcatg gtgctggtgt ccacgttctg 1621 caaggactgt gttcccgaga ccctgtggga gctgggctac tggctgtgct acgtcaacag 1681 caccatcaac cccatgtgct acgcactctg caacaaagcc ttccgggaca cctttcgcct 1741 gctgctgctt tgccgctggg acaagagacg ctggcgcaag atccccaagc gccctggctc 1801 cgtgcaccgc actccctccc gccaatgctg atagtcccct ctcctgcatc cctccacccc 1861 agtccccggg aaaaggccgg tcggaagagg gcaggggctg catcctcagc cccagggccc 1921 tgctcaggcc tcacctggct tcccaggacc ctgggtcacc ttcctgggca gcccagagag 1981 acctgccaac tttccagact tcgctattcc caggcaggga gggaaacccg gggaactggt 2041 ttttctgttc cctgctgggt gggaatgcgc tcttcacagg aagaaggccc gggaggag // LOCUS MVOTRPBA 2874 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.voltae tryptophan synthase operon (trp) genes, complete cds. ACCESSION M35130 KEYWORDS tryptophan synthase. SOURCE M.voltae (PS DSM 1537) DNA. ORGANISM Methanococcus voltae Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanococcales; Methanococcaceae. REFERENCE 1 (bases 1 to 2874) AUTHORS Sibold,L. and Henriquet,M. TITLE Cloning of the trp genes from the archaebacterium Methanococcus voltae: Nucleotide sequence of the trpBA genes JOURNAL Mol. Gen. Genet. 214, 439-450 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 206 tryptophan synthase F (AA at 3) pept 304 1533 tryptophan synthase B pept 1571 2425 tryptophan synthase A pept 2460 2600 ORF 46 pept > 2874 2666 (c) ORF 68 (AA at 2872) BASE COUNT 1046 a 381 c 569 g 878 t ORIGIN 1 gggttgcgga aactcatgac catagagtta gcgaaattat ctccaaaaaa tttgatgtcg 61 tacttgcagg cggtataact tttgaaaacg tgagaaaaat tgtaaattcc gtaaaacccg 121 ttggaattga tgtttctagt ggcgttgagt taaacaacag aaaaaacgaa ttattaataa 181 aaaagatttg tcataatttg atttaattag aattaattag aattaatcga attttaacta 241 attaaaaatt ataggttatt aaattatgac taaatacagt atatgtaaaa ttaaggtgaa 301 attatgaaat gtaatacaaa atgtgacaaa aatggatatt ttggggaatt tgggggtcaa 361 tatatacctg aagttttaaa accggctgtt gaagagctta aagaagccta taaagagtta 421 aaagatgacg aagactttca aaatgagctt gcatactatt taaaacatta tgcaggacgt 481 gaaactcccc tatattatgc aaaaaacttg actgaaaaac ttggtggtgc caaaatctac 541 ctaaaaagag aggacttatt gcatggtggt gcccataaaa ccaataacac tattggtcaa 601 gcacttcttg ctaaaaaaat gggtaaaaca agaataattg ctgagacggg tgcgggtcaa 661 catggtgttg gcacgtctat ggcaggagca ctttttggtc tcgaaacaga gatttttatg 721 ggtagggtag atacagaacg acaacaacct aacgtagcac gtatgaaatt attgggtgca 781 aaagttacgc cagtcgatac aggttccaaa gttttaaaag acgctgtaaa tgaagctatg 841 agaaattgga ctgctacttt tgaaaatact cactatttac ttggcactgt gatgggtcca 901 cacccattcc caactatggt gagagatttt cagtcagtaa ttgggaaaga agttaaaaaa 961 caaataatgg agcaggaaga aagacttcct gattatttag ttgcctgtat tggagggggt 1021 agcaatgcaa tgggtttatt tcatccattt ttaagtaata atatcagtac tggcaatgat 1081 gatgccaaaa atgttaaaat gataggaata gaggctgcag gtaaggggct taacactagc 1141 cttcacggtg catccataac taaaggtgaa aaaggggtac ttcacggtat gctttcgtat 1201 ttcttacaag acgaggatgg acaaatagaa gaagcttata gtatttctgc cggattggat 1261 tacccaggga taggtccaga gcatgcttat ttacataacc ttgggcgtgt gcagtatgct 1321 tcagcaactg ataaacaggc cttaaaagca tttatggaac ttacgagaac cgaaggaatt 1381 atcccggctc tagaatcgtc tcacgcgatt gcttatgcca ttgaaaatgc aggaaatatg 1441 gataaggacg atataatggt aataaacctt tcaggacgtg gggataaaga tttaaacaca 1501 gtaataaatg cagtacataa attgggttgt taaaattaat taaaattaat taaaatatcg 1561 aggaatttaa atgaaaaact tagaaaattt agaaaaagat ttgaaaaatg acttaaaaaa 1621 agatttgaaa aaagaaaaac caattttagt tagtttttta gtatcagggg acccaaatat 1681 tgaagctaca ctaaaattta tgaatgcact agacgaatat tgcggagtta tagaactagg 1741 tataccattt agtgacccga tagcagatgg ttcaactatt caagaggcaa atgtacgttc 1801 cttatcaaat ggttataaaa tacatcaatc ttttgacgta ttacgggaat ttaggaaatt 1861 ttcagatacg ccagttgtac ttatgacgta ttacaatcca atatataaaa gaggtattga 1921 aaattttgta attcaagcaa aagaagcagg ggcaaatggg cttataattg tagatttacc 1981 cctagatgaa gcagaacagt atagggcaat atgtaaaaag catgatatgg gaacagtatt 2041 ccttgtagcc ccaaatacac ctgatgagag gttgatgtat tctgatgagg ctagtacact 2101 gtttttatac gtaatttcga catttggtat tactggagct aggggttcat ttgaaaagat 2161 gacttttgaa tttatagctc gtgcaaaaaa tctttgcgat aaaaataagc tgtatgtagg 2221 ttttggaatt tcaaacggtg aacatgctga aaaaataatt gaaaatggtg ctgacggtgt 2281 tattgtaggg agtgcttttg tagatattat taaggaatac ggggattcta atgaaactat 2341 ttataaatta aaagaattag ctcgggaatt aagcgaaggg attcataaag gttatgttaa 2401 atacaatgaa aagaataaat attaaataat ataatttatt ttaaattttg ggtggagata 2461 tgaatttaaa agataatata ctttataaat caatcaaatg gttttttgcg gttaaatcgg 2521 agaaacctaa aaattacgat actgaagtaa aacctatatt gtatgagcaa gagcgacgtg 2581 gtagacgccg tatattataa taaattctaa tttaaaaaaa taaaaaaaga aattatatta 2641 ttgtagtatt taattaatta ttcatttaat tcttttttga attcaaaaag tttttggcaa 2701 tgtccattat attttcagat attatatatt ttgaattttc ttttaaaacg ctatttgcaa 2761 cgtcaagtga tttataaact tgtgcttcac ctttaaagta catttgtgcc gcttctgcaa 2821 ctgtttttat tgctttagcc tgcccctcag cttcaattct aatactttct gcag // LOCUS MYCSD1XX 425 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.pneumoniae SDC1 repetitive sequence. ACCESSION M35024 KEYWORDS . SOURCE M.pneumoniae (strain M-129) DNA. ORGANISM Mycoplasma pneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 425) AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F. TITLE Prevalence of novel repeat sequence in and around the P1 operon in the genome of Mycoplasma pneumoniae JOURNAL Gene 87, 91-96 (1990) STANDARD simple staff_review BASE COUNT 100 a 118 c 120 g 87 t ORIGIN 1 aattcgaatt tgaaggccca aggcctcacc caacccgcct acctcatcgc cggtcttgac 61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg 121 acgactgatt cgagcgcttc gacctacaac caagcactcg cctggtcgac cacggccggg 181 ttggacagtg atggggggta caaggccttg gtggaaaaca cggccgggct caacggcccg 241 attaatggct tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg 301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa 361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat 421 gatgg // LOCUS MYCSDC1 425 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.pneumoniae SDC1 repetitive sequence. ACCESSION M35022 KEYWORDS . SOURCE M.pneumoniae (strain M-129) DNA, clone MP135. ORGANISM Mycoplasma pneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 425) AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F. TITLE Prevalence of novel repeat sequences in and around the P1 operon in the genome of Mycoplasma pneumoniae JOURNAL Gene 87, 91-96 (1990) STANDARD simple staff_review BASE COUNT 102 a 110 c 123 g 90 t ORIGIN 1 aattcgaatt tgaaggctca aggcctcacc caacccgcct acctcatcgc cggtcttgac 61 gttgtggccg accacctcgt ctttgcggcc tttaaagcgg gcgcggtggg gtatgatatg 121 agcacggaaa acagtgctgc caccaaagac caagcactcg cctggtcgac cacggccggg 181 ttggacagtg ctggggggta caaggccttg gtggaaaaca cggccgggct caacggtccg 241 attaatggct tgtttaccct gctcgacagc tttgcctatg tgaccccggt gagtggcatg 301 aaagggggta gtcagaataa cgaagaagtg cagaccaagt atcccgttaa ggatgatagt 361 aaggcttccg ccaaaattgc gtccttaatt aatgccagcc cactcaacag ttatggggat 421 gatgg // LOCUS MYCSDC1X 425 bp ds-DNA BCT 26-JUL-1990 DEFINITION M.pneumoniae SDC1 repetitive sequence. ACCESSION M35023 KEYWORDS . SOURCE M.pneumoniae (strain M-129) DNA, clone MP46.. ORGANISM Mycoplasma pneumoniae Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 425) AUTHORS Colman,S.D., Hu,P.-c. and Bott,K.F. TITLE Prevalence of novel repeat sequences in and around the P1 operon in the genome of Mycoplasma pneumoniae JOURNAL Gene 87, 91-96 (1990) STANDARD simple staff_review BASE COUNT 105 a 114 c 117 g 89 t ORIGIN 1 aattcgaatt tgaagaccca aggcctcacc caacccgcct acctcatcgc cggtcttgac 61 gttgtggccg accacctcgt ctttgcggca tttaaagcgg gcgcggtggg gtatgatatg 121 acgactgatt cgaacgcttc gacctacaac caagcactcg tctggtcgac cacggccggg 181 ttggacagtg atggggggac aaggctttgg tagaaaacac aggccgggct caacggcccg 241 attaatggtt tgtttaccct gctcgacacc tttgcgtatg tgacccccgt gagtgggatg 301 aaagggggga gtcagaataa tgaagaagtg caaacgactt acccggtcaa gtccgaccaa 361 aaggccaccg ccaaaattgc ctccttaatt aatgccagcc cactcaacag ttatggggat 421 gatgg // LOCUS MZECAT1 2065 bp ss-mRNA PLN 26-JUL-1990 DEFINITION Z.mays catalase isozyme 1 (CAT-1) mRNA, complete cds. ACCESSION M33104 KEYWORDS catalase isozyme 1. SOURCE Z.mays seedling, cDNA to mRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 2065) AUTHORS Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G. TITLE Characterization of catalase transcripts and their differential expression in maize JOURNAL Biochim. Biophys. Acta 951, 104-116 (1988) STANDARD simple staff_review FEATURES from to/span description pept 169 1647 catalase isozyme 1 (EC 1.11.1.6) mRNA 1 2065 catalase isozyme 1 mRNA BASE COUNT 502 a 558 c 487 g 518 t ORIGIN 1 gaaaaaaaag gggaaatcgg cttcctactc cccgtcctta tcgccagccg aaccgacatg 61 ttttctcccc ccttctcgcc ttctccttct ccccctagtc tagaggcgtt tgctccccaa 121 ctccttcggc ccgtccgccc gcccactcga ctgatcccac cggcagccat ggatccatac 181 aagcaccgcc cgtctagtgg gagcaactcc agcttctgga ccaccaactc cggcgccccc 241 gtctggaaca acaactctgc cctcaccgtc ggacagcgag gtccaatcct ccttgaggat 301 tatcatctaa tcgaaaagct tgctcagttc gacagagaac gtatccctga acgtgttgtg 361 catgcacggg gagccagtgc caagggtttc tttgaggtca ctcatgatgt ctctcacctt 421 acatgtgctg attttctccg tgctcctggg gtccagacac ctgttattgt ccgtttctct 481 acagttgtgc atgagcgtgg aagccctgag accttgaggg atccacgtgg ttttgctgtc 541 aagttctaca ccagagaggg taactttgac ctcgtgggta acaacatgcc tgtgtttttc 601 atacgagatg ggatgaaatt ccctgacatg gtccacgctt tcaagccgaa tccaaagacc 661 aatttgcagg agaactggag aatagtagat ttcttctctc accacccaga gagcctacac 721 atgttcacct tcctctttga cgatgttggc atcccactca actacaggca catggagggc 781 tttggtgtca atacctactc cttgatcaac agggatggaa agcctcacct tgtgaaattc 841 cattggaagc ctacttgtgg tgtgaaatgc ttgctcgaca atgaagctgt gactgttgga 901 ggcacctgcc acagccatgc gacgaaggat ctatatgatt ccatcgcagc tgggaattac 961 cctgaatgga agctctacat ccagactatt gatcttgacc atgaggataa gtttgacttt 1021 gacccgctcg atgtcaccaa gacctggcct gaggatatca tcccgctgca gcccgttgga 1081 cggatggtcc tgaacaagaa cgtcgacaac ttctttgcag agaatgaaca gattgctttc 1141 tgcccagcga ttagtgttcc tgcaattcac tattctgatg ataagctgct ccagacgaga 1201 atcttctcct atgctgatac ccagaggcac cgccttggtc caaactatct gatgcttcct 1261 gtgaatgcac caaaatgtgc ccaccacaat aaccaccatg atgggttcat gaacttcatg 1321 cacagggacg aagaggtgaa ctacttccct tcgaggtttg atcccgcccg tcacgcggag 1381 aaggtcccca ttcctccccg tgttctaaca cgctgtcgtg agaagtgcat cattcagaag 1441 gagaacaact tcaagcaggc tggcgagaga tatcgttcct tcgaccctgc aaggcaagac 1501 cggttcatcc agcgatgggt tgacgcactg acacaccctc gcgtgaccca tgaacaccgt 1561 accatttgga tctcctactg gtcccagtgc gacgccgctc ttggccagaa gctgccttct 1621 aggctgaacc tgaagccgag catgtaagga tcgacgagga agaaagcagg caccggtggc 1681 caaggatgca acgcaacatg gagcgtgtga tgtttacacc aatataattg aataaacagg 1741 ggatgtgcgc gttgtcgtac ttatgctgat gctgatggtc ggtggtcgat tatatatact 1801 ggaacttctg gtgtatgctc ttctcttctg gggagacgta atctaacgaa gaagaatgtg 1861 tgtcattgtg gcctgtgcta caaaccctgc tgtatgggcc tgtctataag aaaacacgga 1921 tggagttgtg acgttatgtt ctgacagttt atttactaat gagcacatac tttgatctaa 1981 ctagaacgaa gagaagttca cggaactgtc ggacacatgc agcaaggatc ctcattataa 2041 tacgaatcac tcttcgtttg cattc // LOCUS MZECAT3 1790 bp ss-mRNA PLN 26-JUL-1990 DEFINITION Z.mays catalase isozyme 3 (CAT-3) mRNA, complete cds. ACCESSION M33103 KEYWORDS catalase isozyme 3. SOURCE Z.mays seedling, cDNA to mRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1790) AUTHORS Redinbaugh,M.G., Wadsworth,G.J. and Scandalios,J.G. TITLE Characterization of catalase transcripts and their differential expression in maize JOURNAL Biochim. Biophys. Acta 951, 104-116 (1988) STANDARD simple staff_review FEATURES from to/span description pept 22 1509 catalase isozyme 3 (EC 1.11.1.6) mRNA 1 1790 catalase isozyme 3 mRNA BASE COUNT 386 a 550 c 550 g 304 t ORIGIN 1 cgtgggtagc tagctaggtg aatgacaatg gatcctacca agttccgtcc gtccagcagc 61 cacgacacga cggtgacgac gacgaacgct ggcgctcctg tgtggaacga caacgaggcg 121 ctgactgtgg ggcctcgcgg tcccatcctg ctggaggact accacctgat cgagaaggtg 181 gcgcacttcg accgcgagcg catcccggag agggtggtgc acgcgcgtgg cgcgtccgcc 241 aagggcttct tcgagtcgac ccacgacgtg acgtcgctga cgtgcgccga cttcctgcgc 301 gcgcccggcg tgcggacgcc cgtgatcgtg cgcttctcgc aggtgatccc agagccgggg 361 tccggacgga cgatccgaga cgcgcgcggg ttcgccgtga agttctacac ccgcgagggc 421 aactgggacc tgctgggcaa caacttcccc gtcttcttca tccgcgacgg catcaagttc 481 cccgacgtga tccacgcgtt caagcccaac ccgcggtcgc acgtgcagga gtactggcgg 541 gtgttcgact tcctgtcgca cctccccgag agcctgcaca ccttcttctt cctcttcgac 601 cacgtgggcg tgccgtccga ctaccgccac atggaagggt tcggcgtgaa cacgtacacg 661 ttcgtgagcg cggcggggaa ggcgcagtac gtgaagttcc actggaagcc gacgtgcggc 721 gagcggtcca tcctgacgga cgaggaggcg cgcgtcgggg gacggaacca cagccacacg 781 caggacctgt acgactccat cgcggcggag gggagcttcc cggagtggac gctgtacgtg 841 caggtgatgg acccggcaca gcaggagcag tacgacttcg acccgctgga cgacaccaag 901 acgtggccgg aggacctgtt gccgctccgc cccgtgggga ggctggtgct ggacaggaac 961 gtggacaact tcttgaacga gaacgagcag ctggcgttcg ggccggggct ggtggtgcca 1021 gggatctact actcggacga caagatgctg cagtgccggg tgttcgccta cgccgacacg 1081 cagcgctaca ggctgggtcc caactacctg atgctgcccg tcaacgcgcc gcgctgcggc 1141 acccacaaca accactacga cggcgccatg aacttgatgc accgcgacga ggaggtggac 1201 tactacccgt ccaggcacgc gcgccgctgc ggcagggcgg cgcccacgcc actgccgccc 1261 aggccggtcg cggggaggag ggagaaggca accatacgca agcccaacga cttcaagcag 1321 ccaggggaga ggtaccgctc ctgggacgcc gaccgacagg accgattcgt gaaggcgatt 1381 cgccgactcg ctcggacacc caaacgtcag ccagagctca ggtccatctg gatagacctc 1441 ctcgccaagg tcgacgcgtc gctggggatg aagattgcca cccggctcaa catgaaggca 1501 aacatgtgat gcttgtgctg aatagaataa taatgaagac gcatgcatgt cgtcgccagg 1561 aacaagagaa ataataacaa gaccaccacg catgggcata ctccatatat atatgtatag 1621 cccgtgcccg tgtccgcctt tgtaccaata caagccaaga ctagtggatg tattattatt 1681 attattattg cgctatcaca tacatgtacc cctgctacct gaagatggat attgtatcca 1741 gttatcaaat taagacacct gcagcaaaaa aactatatat gttgcataag // LOCUS BRVRNASA 197 bp ss-mRNA VRL 26-JUL-1990 DEFINITION Berne virus ORF5 mRNA, 5'end. ACCESSION M33503 M33501 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 197) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 137 > 197 ORF5 mRNA 113 > 197 RNA5 BASE COUNT 50 a 27 c 38 g 82 t ORIGIN 1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag 61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt 121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc 181 acctcaggtt gttgcat // LOCUS BRVRNASB 179 bp ss-RNA VRL 26-JUL-1990 DEFINITION Berne virus ORF3 mRNA, 5' end. ACCESSION M33502 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 179) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 153 > 179 ORF3 BASE COUNT 52 a 17 c 34 g 76 t ORIGIN 1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat 61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag 121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt // LOCUS CHKGLOBA 1204 bp ds-DNA VRT 26-JUL-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H3/H4. ACCESSION M30485 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H3/H4. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1204) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 91 > 1204 pie-alpha-globin mRNA fragment H3/H4 (put.) BASE COUNT 282 a 252 c 263 g 407 t ORIGIN 1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg 61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat 121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg 181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg 241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt 301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac 361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga 421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg 481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag 541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc 601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga 661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag 721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg 781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac 841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt 901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca 961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc 1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt 1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg 1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact 1201 ggtt // LOCUS CHKGLOBB 582 bp ds-DNA VRT 26-JUL-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H10. ACCESSION M30486 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H10. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 582) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 241 > 582 pie-alpha-globin fragment mRNA H10 BASE COUNT 171 a 128 c 108 g 175 t ORIGIN 1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata 61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct 121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg 181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata 241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac 301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg 361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat 421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg 481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca 541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc // LOCUS ECOPHOAA 600 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli alkaline phosphatase (phoA) gene, 5' end. ACCESSION M33536 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain K-12) cell line BW7710 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 600) AUTHORS Agrawal,D.K. and Wanner,B.L. TITLE A phoA structural gene mutation that conditionally affects formation of the enzyme bacterial alkaline phosphatase JOURNAL J. Bacteriol. 172, 3180-3190 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.K.Agrawal, 03-APR-1990. The phoA503 mutation does not interfere with export of active enzyme but does interfere with assembly. FEATURES from to/span description pept 283 > 600 alkaline phosphatase precursor (phoA) (EC 3.1.3.1) sigp 283 345 alkaline phosphatase signal peptide matp 346 > 600 alkaline phosphatase variant 413 413 c in wild type; t in phoA503 mutation BASE COUNT 159 a 130 c 151 g 160 t ORIGIN Map position 8.7 minutes; 1 bp upstream of HindIII site. 1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc 61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg 121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt 181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct 241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt 301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg 361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt 421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat 481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat 541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac // LOCUS GCOEARA 1771 bp ds-DNA PLN 26-JUL-1990 DEFINITION G.tikvahiae McLachlan 18S ribosomal RNA gene. ACCESSION M33640 KEYWORDS 18S ribosomal RNA. SOURCE G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA. ORGANISM Gracilaria tikvahiae McLachlan Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gigartinales; Gracilariaceae. REFERENCE 1 (bases 1 to 1771) AUTHORS Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A. TITLE Nucleotide sequence of the 18S ribosomal RNA gene from the red alga Gracilaria tikvahiae mclachlan JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ragan 08-APR-1990. Atlantic Research Lab, National Research Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA B3H 3Z1 FEATURES from to/span description rRNA 1 1771 18S ribosomal RNA BASE COUNT 445 a 371 c 501 g 454 t ORIGIN 1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc 61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata 181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt 241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat 301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg 361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc 421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta 481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc 541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc 601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg 661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta 721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata 781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg 841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg 901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga 961 tcgaagacga tcagataccg tcgtagtctt tac tataaac gatgaggact ggagatcgga 1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg 1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg 1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg 1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt 1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag 1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc 1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc 1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat 1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa 1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct 1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat 1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt 1741 ttccgtaggt gaacctgcag aaggatcaag c // LOCUS HS6MCP 4440 bp ds-DNA VRL 26-JUL-1990 DEFINITION Human herpesvirus type 6 major capsid protein (MCP) gene, complete cds. ACCESSION M33515 KEYWORDS major capsid protein. SOURCE Human herpesvirus type 6 DNA. ORGANISM Human herpesvirus type 6 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4440) AUTHORS Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R. TITLE Identification, cloning, and expression of the major capsid protein gene of human herpesvirus 6 JOURNAL J. Virol. 64, 714-722 (1990) STANDARD simple staff_review FEATURES from to/span description pept 235 4272 major capsid protein (MCP) BASE COUNT 1422 a 1169 c 785 g 1064 t ORIGIN 1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct 61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag 121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac 181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa 241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac 301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat 361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa 421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat 481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca 541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa 601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca 661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga 721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc 781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat 841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga 901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt 961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact 1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa 1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat 1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg 1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa 1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg 1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac 1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg 1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac 1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt 1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga 1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa 1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga 1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac 1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc 1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct 1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca 1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt 2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct 2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat 2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac 2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta 2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg 2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc 2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta 2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc 2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac 2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag 2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag 2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc 2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt 2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc 2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca 2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac 3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac 3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat 3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt 3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa 3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag 3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc 3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat 3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat 3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg 3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc 3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg 3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt 3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac 3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc 3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac 3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt 3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac 4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag 4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac 4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc 4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa 4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga 4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct 4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca // LOCUS HUMSEXREPB 916 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human sex chromosome repeat, clone pDP330. ACCESSION M33524 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP320. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 916) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 916 sex chromosome repeat BASE COUNT 206 a 228 c 229 g 253 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta 61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc 121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt 181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc 421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga 481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct 541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct 601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca 661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct 721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg 781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct 841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc 901 ctggaggctg gaattc // LOCUS HUMSEXRPA 918 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human sex chromosome repeat, clone pDP316. ACCESSION M33523 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP316. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 918) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 918 sex chromosome repeat BASE COUNT 207 a 242 c 226 g 243 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta 61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc 121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt 181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc 421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg 481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc 541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc 601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac 661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc 721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta 781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg 841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag 901 tcctggaggc tggaattc // LOCUS IRICAP 2461 bp ds-DNA VRL 26-JUL-1990 DEFINITION Iridescent virus type 1 capsid protein gene, complete cds. ACCESSION M33542 KEYWORDS capsid protein. SOURCE Iridescent virus type 1 DNA. ORGANISM Iridescent virus type 1 Viridae; ds-DNA nonenveloped viruses; Iridoviridae. REFERENCE 1 (bases 1 to 2461) AUTHORS Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L. TITLE Molecular cloning, characterization, and expression of the Tipula iridescent virus capsid gene JOURNAL J. Virol. 64, 125-136 (1990) STANDARD simple staff_review FEATURES from to/span description pept 601 1995 capsid protein mRNA 587 > 2461 capsid protein mRNA ( 5' end +/- 5 bp) BASE COUNT 717 a 462 c 443 g 839 t ORIGIN 1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat 61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa 121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta 181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt 241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat 301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt 361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt 421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt 481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt 541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac 601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa 661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga 721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct 781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg 841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta 901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat 961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct 1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca 1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca 1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag 1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct 1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct 1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt 1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt 1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca 1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat 1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt 1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga 1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt 1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc 1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg 1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc 1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac 1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga 2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt 2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa 2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa 2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt 2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc 2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc 2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct 2461 t // LOCUS MUSIGHAAR 363 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds. ACCESSION M33679 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 363 Ig heavy chain V-D-J region (AA at 1) BASE COUNT 98 a 83 c 89 g 93 t ORIGIN 1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg 61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg 121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac 181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac 241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt 301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc 361 tca // LOCUS MUSIGKABF 324 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse Ig rearranged L-chain mRNA V-J region, partial cds. ACCESSION M33678 KEYWORDS immunoglobulin light chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 324 Ig light-chain V-J region (AA at 1) BASE COUNT 96 a 77 c 73 g 77 t 1 others ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat 241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagt tggaaatcaa acgg // LOCUS MUSTCVYAN 2567 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5' end. ACCESSION M33500 KEYWORDS T cell receptor; beta-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, hybridoma B.1.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2567) AUTHORS Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and Fotedar,A. TITLE Transcription of the T cell receptor beta-chain gene is controlled by multiple regulatory elements JOURNAL J. Immunol. 144, 1111-1119 (1990) STANDARD simple staff_review FEATURES from to/span description pept 2544 > 2567 T cell receptor beta-chain V-2 region precursor sigp 2544 > 2567 T cell receptor beta-chain signal peptide mRNA 2478 > 2567 T cell receptor beta-chain mRNA BASE COUNT 708 a 560 c 583 g 716 t ORIGIN 1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag 61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc 121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga 181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag 241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga 301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc 361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc 421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc 481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga 541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt 601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt 661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga 721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg 781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga 841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa 901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata 961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac 1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag 1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt 1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta 1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta 1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat 1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa 1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc 1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct 1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa 1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc 1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc 1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag 1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag 1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa 1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc 1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct 1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat 2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg 2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc 2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta 2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact 2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca 2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt 2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt 2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat 2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc // LOCUS R751TRA 578 bp ds-DNA BCT 26-JUL-1990 DEFINITION Plasmid R751 traJ and traK genes, 5'end. ACCESSION M25422 KEYWORDS inverted repeat; transfer origin region. SOURCE Plasmid R751 (strain HB101, Inc P-beta) DNA. ORGANISM Plasmid R751 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 578) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 48 < 1 (c) traJ protein pept 403 > 578 traK protein signal 243 211 promoter PL signal 266 294 promoter PR rpt 49 64 inverted repeat rpt 118 157 inverted repeat rpt 296 331 inverted repeat BASE COUNT 141 a 168 c 163 g 106 t ORIGIN 1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc 61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg 121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt 181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat 241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct 301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag 361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc 421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac 481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg 541 atgaaaacga tctgggagca catgaaggaa accggccg // LOCUS RP4TRAB 571 bp ds-DNA BCT 26-JUL-1990 DEFINITION Plasmid RP4 traJ and traK genes, 5' end. ACCESSION M25423 KEYWORDS inerted repeat; transfer origin region. SOURCE Plasmid RP4 (strain HB101, IncP-alpha) DNA. ORGANISM Plasmid RP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 571) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 26 < 1 (c) traJ protein pept 394 > 571 traK protein rpt 48 63 inverted repeat rpt 118 157 inverted repeat signal 219 192 promoter PL rpt 281 318 inverted repeat signal 246 272 inverted repeat BASE COUNT 135 a 148 c 181 g 107 t ORIGIN 1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc 61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg 121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt 181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa 241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa 301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg 361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg 421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg 481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc 541 atttgggagc acatgcggga aacggggaag g // LOCUS STAREPEBR 2389 bp ds-DNA BCT 26-JUL-1990 DEFINITION S.aureus ethidium resistance (ebr) and replication protein (repA) genes, complete cds. ACCESSION M33479 KEYWORDS ethidium resistance protein; replication protein. SOURCE S.aureus plasmid DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 2389) AUTHORS Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J. TITLE Genetic mapping and nucleotide sequence of pWBG32, an ethidium bromide resistance plasmid naturally occurring in Staphylococcus aureus JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First Avenue, RM 1166, New York, NY 10016 FEATURES from to/span description pept 1153 1476 ethidium resistance protein (ebr) BASE COUNT 796 a 403 c 290 g 900 t ORIGIN 1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa 61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca 121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt 181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg 241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa 301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg 361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc 421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat 481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa 541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata 601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt 661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc 721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt 781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt 841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt 901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg 961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac 1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca 1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa 1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga 1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata 1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat 1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt 1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc 1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt 1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc 1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa 1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt 1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata 1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta 1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg 1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg 1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg 1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca 2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg 2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct 2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa 2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc 2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag 2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg // LOCUS XANAVR 2100 bp ds-DNA BCT 26-JUL-1990 DEFINITION X.campestris avirulence protein (avrBs1) gene, complete cds. ACCESSION M32142 J03672 KEYWORDS avirulence protein. SOURCE X.campestris (strain E3, race 2, pv. vesicatoria) DNA. ORGANISM Xanthomonas campestris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2100) AUTHORS Ronald,P.C. and Staskawicz,B.J. TITLE The avirulence gene avrBs-1 from Xanthomonas campestris pv. vesicatoria encodes a 50-kD protein JOURNAL Mol. Plant Microb. Interact. 1, 191-198 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by P.Ronald, 15-FEB-1990. FEATURES from to/span description pept 308 622 ORF1 pept 713 2050 ORF2 BASE COUNT 656 a 423 c 505 g 516 t ORIGIN 1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca 61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag 121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc 181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa 241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt 301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg 361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt 421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa 481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct 541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga 601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc 661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga 721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc 781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc 841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc 901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag 961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt 1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc 1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc 1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca 1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc 1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg 1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt 1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct 1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga 1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat 1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct 1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac 1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca 1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac 1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga 1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga 1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga 1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg 2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc // LOCUS YSCSCD25 5055 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.cerevisiae SCD25 gene, complete cds. ACCESSION M26647 M31771 KEYWORDS Ras protein; SCD25 gene; cell division cycle. SOURCE S.cerevisiae (strain OL136) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 2129 to 5055) AUTHORS Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M. TITLE The C-terminal part of a gene partially homologous to CDC25 gene suppresses the CDC25-5 mutation in Saccharomyces cerevisiae JOURNAL Gene 77, 21-30 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 3880) AUTHORS Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and Jacquet,M. TITLE SCD25, a CDC25 like gene, which contains a RAS activating domain is a dispensable gene of Saccharomyces cerevisiae JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak, 01-FEB-1990. Laboratoire IGD, Groupe des laboratoires de biologie cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex FEATURES from to/span description pept 128 3880 SCD25 protein pept 4319 > 5055 ORF X BASE COUNT 1638 a 973 c 900 g 1544 t ORIGIN 1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc 61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc 121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga 181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt 241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa 301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa 361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa 421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa 481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg 541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag 601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt 661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat 721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa 781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt 841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt 901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag 961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa 1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc 1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct 1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac 1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac 1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg 1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt 1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg 1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag 1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca 1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc 1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac 1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga 1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg 1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga 1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga 1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa 1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga 2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa 2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt 2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca 2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat 2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga 2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg 2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg 2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga 2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt 2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga 2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag 2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc 2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac 2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga 2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac 2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt 3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat 3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag 3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag 3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa 3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt 3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa 3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag 3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg 3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata 3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa 3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt 3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc 3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga 3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt 3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt 3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca 3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta 4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc 4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc 4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca 4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg 4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat 4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg 4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt 4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag 4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac 4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag 4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt 4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa 4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta 4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt 4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga 4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt 4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa 5041 gaaaatcaaa gatct // LOCUS ADBMLPA 101 bp ds-DNA VRL 26-JUL-1990 DEFINITION Mastadenovirus 2 R1, R2 and R3 binding sites. ACCESSION M33540 KEYWORDS . SOURCE Mastadenovirus 2 viral DNA. ORGANISM Mastadenovirus 2 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 101) AUTHORS Leong,K., Lee,W. and Berk,A.J. TITLE High-level transcription from the adenovirus major late promoter requires downstream binding sites for late-phase-specific factors JOURNAL J. Virol. 64, 51-60 (1990) STANDARD simple staff_review COMMENT Sequence-specific binding proteins are induced during the late phase of infection. These proteins interact with three regions in the first intron of the major late promoter (MLP). BASE COUNT 24 a 25 c 26 g 26 t ORIGIN 1 ccagctgttg gggtgagtac tccctctcaa aagcgggcat gacttctgcg ctaagattgt 61 cagtttccaa aaacgaggag gatttgatat tcacctggcc c // LOCUS LB3HDCBA 804 bp ds-DNA BCT 26-JUL-1990 DEFINITION Lactobacillus 30a histidine decarboxylase-B (hdcB) gene, complete cds. ACCESSION X13099 KEYWORDS histidine decarboxylase. SOURCE Lactobacillus 30a DNA. ORGANISM Lactobacillus 30a Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 804) AUTHORS Copeland,W.C., Domena,J.D. and Robertus,J.D. TITLE The molecular cloning, sequence and expression of the hdcB gene from Lactobacillus 30a JOURNAL Gene 85, 259-265 (1989) STANDARD simple staff_review FEATURES from to/span description pept 85 609 histidine decarboxylase-B (hdcB) BASE COUNT 277 a 140 c 152 g 235 t ORIGIN 1 actaatccac aggacatagt ttgaggaaga gatggtgttt actacctctt cctttaatat 61 tttgtaagtt aaggattgat tgcaatgagc aacagtaact accaagttag tttagaacga 121 attaaaaaag ttgtccctga agaactctta accaatgcat tgttagcagc tattgacaat 181 tctggtgaaa ggatgtcaca aataatagtc gataaaaaag ataacggcaa cgactattac 241 ctcaccatcc atagattctt cgtttatagc aacgaagaat tcaccgcttt tgataaagaa 301 gatgttgcag atgtcgaatt cgttaatggt acgccagatg gtgaagtaat cattacttta 361 aaggacggca aagtgttgca cccgtctcac atttgttacg gccgagcttt tgactttatc 421 caagatgtca agccaaaagt aattacaatg gcgggatatg acagcacaat tcgaggcgaa 481 tttccacaat tattagatcc agatcatgcg gaagagattg atcgattacg tcgctggatg 541 caagatggaa atattagcca ttacgaatac gatgatgcaa atccagctta tccaaaagca 601 ggaaaataaa aaaacatatt gacatatcat cagatatagg ttatgttaca atcaagcatc 661 ttaataggta atgcgcaatt tatatctttg aatatagttc cattatttat ttataaatag 721 ttactccgaa aaggactacg tacctactat acttttaaat aaatatattt cgtgatgggg 781 agcgttatta ccccggctgt cgac // LOCUS LBPREPA 3547 bp ds-DNA BCT 26-JUL-1990 DEFINITION L.plantarum repA, repB and repC genes, complete cds. ACCESSION M33531 KEYWORDS rep protein. SOURCE L.plantarum DNA. ORGANISM Lactobacillus plantarum Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3547) AUTHORS Bates,E.E.M. and Gilbert,H.J. TITLE Characterization of a cryptic plasmid from Lactobacillus plantarum JOURNAL Gene 85, 253-258 (1989) STANDARD simple staff_review FEATURES from to/span description pept 2191 2349 repA protein pept 2406 3062 repB protein pept 570 1655 repC protein BASE COUNT 1189 a 589 c 758 g 1011 t ORIGIN 1 gatatctggt taactttgat cacattagtg atcaaattca tttctttagc cccatcaaac 61 gatcagtttg ctttatgaaa gtgaccgctt gatggggctt tttcgtttac cttttgtcaa 121 aggtaaggtg tgacgggctt gactttgggt ggcgttgtgc ggaagcgcaa tcgacacgat 181 tttgactttg aggggagtta agaggggaag cgtagcgccc cttcttacaa gtgtaaagtg 241 tggacaagag agcgtagcga tattgtctac actttacccc aattgtcatg cgactttaaa 301 tagaattatt gattaataaa agccccctga caaaagtcga agggggactt ttattttagt 361 ttgaggtttg catacctact taaaaaagta gggcagcaaa acgtcaaaca ggtatcagct 421 aatcatccga tagggtgcgc tgatacggtc ctcaaaagag agccgacaga gccgtctgca 481 agacccctcg gcggaggccc acctttacga agtaagatat agtgggttat actttacttg 541 gaagataact ccgaaatgag gtgcatacaa tgagttttgc agtggctaga atgacgaaat 601 taaaagctga taatttagtc ggcattggca atcatgacca acggaaaacg actaatcaca 661 gcaacgaaga tattgatgtt tcccgctctc acctgaatta tgatttagtg gctgggcgca 721 ctgataactt taaaacggat attgaagcct atatcaacga aaacaaagcg agtaagcggg 781 cagttcgcaa agacgctgtt ttagtcaatg agtggattat aaccagtgac aaagactttt 841 ttgagcaatt agacgaagcc gaaacccgta aatattttga aacagccaaa caatattttg 901 cagataacta tggtgacgaa aatattcgct atgcagttgt tcatatggac gagaagaccc 961 ctcacatgca tatgggcatt gtgccctttg atgatgataa aaagctctca gctaagcgta 1021 tattcaatcg tgaagcctta cagcacattc aagaggaatt accacagtac ctcaaagaaa 1081 atggctttga tgttcaacgt ggtaacaaaa ataaagagcg taagaattta tcagtacccg 1141 aatacaaagc tatgcgggaa gaattgaaaa aaatagagac cgaaaaacaa gagacacaag 1201 caaagcttgc agatacaaaa aaacagcttg atgagatcaa accacgggat accaagaaaa 1261 ttgctagtaa acccaccttg atgaataaaa ataaagtcac ggttgataaa tctgatctcg 1321 ctgatttgga acaaagggcg gtgactagcg acgcttataa ctttgaaaaa attcatctgg 1381 aagtaggaaa tcatagttta cgtaatgatt tgagtgaagc caagggccgc aactatgaac 1441 tgagaaaaga aaatgagcga ttgcaaaaac tagtaggaac gcttcaaggc attatacgaa 1501 atgttgatga gtttctacac aaaaaactag gtattaattt acctgaaaag tggctagagc 1561 gtgcaggact aaaagaaccg tctaaaaaag cccctgaaag ctcacaggaa ctcgacagac 1621 ataaatctga tgaattaggc ggtccacatc tttaaatcgc ttatacgagc ttaaaatggc 1681 gtttaagagc ttaatttacc atctcgctag attgaacgta gttaactttg tgtccgtcaa 1741 cggtaaatcg acgtaggcgt tttatagccg ctgggctatt agacgcccta ggaggcttta 1801 aggagttgat agactagcgg ataaaacact tttgcacatg caaagaaaag cacccctgct 1861 ttttttgcct gccccacggc gagtgcgggg tgagtttagc gggtgctccc gtcatttatg 1921 gggtcaagct gacacagctt gcgggtttgg gcagagccca tattttggtt tggtttgagt 1981 gggataaaaa aattgggcga aaaacatggg ggtactacga caccccccca tgtgtccatt 2041 gtccattaaa cagaacactt ttttcaagaa accttttagg ttaggggttt tcgggggggt 2101 ttgagatttt ataaaaaatg ttgtatttct aacgtatgta taatataatg atggaataga 2161 gataaaaata gtaagaaaga aggttttttg atggttgaag ttgaaaagaa aaaaattact 2221 ttgtctatac ctgttgaaac taatggaaag ctggaagaat tggcccagaa atatggcatg 2281 actaaatctg gattggttaa ttttttggtt aatcaggttg cagaagctgg aactatttat 2341 aggcaataaa aaaagcgccc tgtgcatagg acgcaatcta aaagtctgtg aggtaattat 2401 aacatatgaa aagtgaatct aaaatcgatt ggacggtacc tcgtccaaat aaaaatccca 2461 aaacaaaaca gccttataaa cgtggtcgta attggggtat tgttgtttat cctgaaagtc 2521 ttcctgaaaa ttggaaagat attatcaggc aagagcctat tgctgtcagt cccttacatg 2581 ataaagatgt taaccctgat ggagaaaaga aaaaatctca ctatcatctt gttttgaact 2641 ataaagggaa caaatctttt gaacaaattg atgaaattgc taggtcttta agggcgcctg 2701 ctcctcaaag aattagtagt ttaactggcg ctgttaggta cttgacacat atggataatc 2761 ctgaaaaata tcagtatgat aatgctgata ttgagacctt tggaggcttt gatttagaga 2821 gttgcttagc tctttctact ggcgataagc gccaagcctt acgtgacatg ttggctttta 2881 tttctgaaaa tgaaattatg catttaaaag actttgcaga ttattgcatg tctgaggaag 2941 caccagctgg ctggttcgaa cttctaactg aaaggaatac gctttttatt aaagaatata 3001 tcaagtcaaa ttggcagaaa caacagtatg ctagtaaaaa catcaataaa atgtcggatt 3061 aaaattttat tgatgttgtt gctatattat tagtgaaagg atggtttact ttatgccaac 3121 aagaaaaaat attttagatg atattcaaga acatattgac aatgaagaac gtgttttggt 3181 tactaattca agcaaaatta actagcacca cgcgtataga gtgatttaaa ataactaaca 3241 tcgtttttat ttgaatttag aagggaagag atttttatta aaaatatagg ttttaactca 3301 aattatttta aaacctggta tttttggcta ggcatattaa cggtagtggg attaatcggt 3361 gatcccattt tacactatca ttcttcaact agtccgtggt tacaaatact tattgctatt 3421 ttattatttg tagcagcatt taccaaaaaa ataaataata actgacttaa atcgcaattc 3481 actctaaact tttaacaaat ttgttatcat aattgggtaa ggtgtttgca agttaagtat 3541 ttttccc // LOCUS RATUD2A01 1088 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 1. ACCESSION M35202 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 1 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1088) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP-glucuronosyltransferase, UDPGTr-2, gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept 349 + 1072 UDP glucuronosyltransferase-2 (UDPGTr-2) pre-msg 315 > 1088 UDP glucuronosyltransferase-2 mRNA and intron IVS 1073 > 1088 UDP glucuronosyltransferase-2 intron A signal 288 293 CAT box BASE COUNT 329 a 194 c 216 g 349 t ORIGIN 1 ctgcagtcaa cggatcttca ctgctatgta agaacattta agaaataaga gctttcatct 61 gtgattttta catgactcta acacgttata atcaacagat gatgtttgca catgagaagt 121 gattcaattt tggctgaata gaatcaggga caaaaaagac aaataaactc tgttaacctt 181 gagctcatgt tccatgcttg tatttacaca tggcgtaaca tcattgcact catctaatcg 241 gtgatggttt aaaagttata tattaatttc ttgggtgact gaactttcat aaaaaacatg 301 aatatctaca atgaacgaca gatatcaaaa gcattccatt tctgcaagat gtctatgaaa 361 cagacttcag tgtttctgtt gatacagctc atatgctact ttagacctgg agcctgtgga 421 aaagtgctag tgtggcccac agaatacagc cactggatta atataaagat aattctgaat 481 gaacttgccc agagaggtca tgaagtcacg gttcttgtat cttcggcttc cattctcatt 541 gagcctacca aggaatcttc tattaatttt gagatttact ctgtaccttt gagtaaaagt 601 gatcttgaat atagttttgc aaaatggata gatgaatgga cacgtgattt tgaaacactc 661 tcgatttgga catattattc aaaaatgcaa aaagtcttca atgaatattc tgatgtcgtt 721 gaaaatttat gcaaagcact catttggaac aagagtctta tgaaaaaact ccaaggatct 781 caatttgatg tcattctcgc agatgctgtg ggtccctgtg gtgagctgct agcagaactg 841 cttaagacac ctttagtgta cagtctccgc ttctgtcctg gatacagatg tgaaaagttc 901 agtgggggac ttccactgcc tccttcctat gtgcctgttg ttctttcaga attaagtgac 961 cgcatgacat ttgtggaaag agtgaagaat atgttgcaga tgctgtattt tgacttttgg 1021 tttcaaccat ttaaagagaa gtcctggagt cagttttaca gtgatgttct aggtaaactg 1081 tgcctttc // LOCUS RATUD2A02 373 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A. ACCESSION M35078 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 2 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 373) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 373 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 373 UDP glucuronosyltransferase-2 intron A BASE COUNT 137 a 75 c 56 g 105 t ORIGIN About 0.1 kb after segment 1. 1 aaaatgctat agagtaactg agcagaacac tccaaaaatt actatccatg taaactgaga 61 caaagatttc tcttagtaat cactagatct actctaagtt tgtcttagta aaagaaactc 121 caagtttctc gaatgcttta atgactgtag atgcgaacac taaagagtca ttatatacca 181 ccacaactat ctgtgtagca cagaaggaaa catgttccct tatacaaatt actcacttgc 241 aaatgatgaa aaaactccaa ggagctaagt ttgatgttat cacctagaat atcacgacag 301 gttttctcac aattaaatca tatcactaga accagaaaca gtcaaggcat cttagtttct 361 tcgagttcag ctg // LOCUS RATUD2A03 380 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron A. ACCESSION M35079 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 3 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 380) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 380 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 380 UDP glucuronosyltransferase-2 intron A BASE COUNT 154 a 49 c 55 g 122 t ORIGIN About 0.1 kb after segment 2. 1 tcaaataaaa tagtacctaa attaatagga gaaagaattt aaaggttaac tatttgtgga 61 aatatccagg tgtaactttg acatatacaa ctaagttagt attacttgtc tcttctaata 121 ggcacagcac agtagtgata aaaagaaact tagtcataaa ctgcagatta tcacagtgca 181 tttcaagaat cagaaatcaa aagaatagct actaaaatgt ataaagtaga tgaaatattc 241 tacaaaagtt gatttttcta aggcattttc aagctttttt gcaaggaaca aatgttccaa 301 attcattggt gtaactttag aaaacatgta attgacaaca ttgatattat gttatacatt 361 atatcataat caaatgactt // LOCUS RATUD2A04 1435 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 2. ACCESSION M35080 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 4 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1435) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 1435 UDP glucuronosyltransferase-2 mRNA and intron pept + 51 + 199 UDP glucuronosyltransferase-2, exon 2 IVS < 1 50 UDP glucuronosyltransferase-2 intron A IVS 200 > 1435 UDP glucuronosyltransferase-2 intron B BASE COUNT 375 a 301 c 268 g 485 t 6 others ORIGIN About 0.2 kb after segment 3. 1 ccacaaaacc tcttttcacc attgagtatt tttatctgtt ttggatgcag gtagacccac 61 aacattaact gagatgatgg ggaaggcaga tatatggctc attcgaacct tctgggactt 121 ggaatttcca cacccattct tacctaattt tgactttgtt ggaggactac attgcaaacc 181 agccaaacca ctgcctaggg taacattgga ttgttttcct tgataaactg ttcgttcctt 241 tatcattctt tatttgtttt tacaaagagg atagtttatt ttaattatta atatttatct 301 ttaatctttt tttacagtcc agtaattatc cccttctgga ccaccctcgt tccatcctcc 361 tcctcccttg ctccaagagt atgtatgcca ggagcctcct gcgatggaga ggatagtgtc 421 aggggtgcag gagggaacaa agtaagactc tggtgtggct ttaaagctga cggtctcctg 481 acattctaac tctctacctg ttcagaaaca ctgatgataa cttctagaaa atcatacaaa 541 ctttcttgct ctttctcatg ataaaaggct gctggcttgg gaatcagtac ctgtaactta 601 acaacagagg attgagcaat gtggccttgg tcctatatag taggaactgt gtggctctaa 661 ctttcagcct gctagtcaga anngcagaag ggatctttcc acatgatgtc tcctccttct 721 tcttcttgta gtcctcctct actctcctgg attctcaact gggatcagac gccctgccct 781 cttctcttct gcccagctga tcgattcttt attaactaat caaggatgat ctaaattatt 841 ttatacataa cattgagacc agtgatgctt gactgtgcca aattttggac tgcaaccaga 901 tatctgggca taaaaattag cacatgaata cacagtgtaa aaaaaaaacc gtcccctaac 961 actcacctat tgttttctgc atgtgggtga gtctacatgt gtctgatggg aggcctgtgc 1021 atgtttcttt ttacaactag gtcccttnnc tg gtatataa gtttcattac taggaagtgt 1081 tagcatttaa tggtaatttt gttagatgga tgggattgtg aatttaaaac ttgccttgaa 1141 gtagattttg agtgacatag cacattttta aattttattt tgtgtttttt taaagaggac 1201 atctctctat agcttanntg tccttaacct catagcagtc cttctgcctc agtctcccat 1261 gtgctgagat tagaccagtc ttaatacctc ttctgaaaca tgatgtgtaa tatcagtgat 1321 ggagatctta ctgtgcacag ctttagatca tgatgtttag cagattgtaa cttccattca 1381 tgagaagaaa ctgcacaaac catctcattc ctgtcttact ttattgattg gaagc // LOCUS RATUD2A05 769 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron B. ACCESSION M35081 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 5 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 769) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 769 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 769 UDP glucuronosyltransferase-2 intron B BASE COUNT 293 a 116 c 117 g 243 t ORIGIN About 0.1 kb after segment 4. 1 aaatgctact tcatttgatc ttgaaggtgt gtgagctgtc attatttaat tggtacggta 61 tttctttcaa ataaacaatt aaaatagtgt tcttttcttt aaaaaaataa agaaaaaaga 121 gatcataaag aaaaaaagaa gttgcagaaa gaaaagggga caccttgaaa agtgattata 181 gcacttatta ctaagttgta aaaggtttcc tatgaaaact atctaagaag ataagtagaa 241 aagtcctaat gagggaaagg aaaaaaaaat tcttctcctt ctcatcattt tgtcctcagt 301 acttacacat cttttcagaa tacatgacca caagttaaaa gtcataacaa aaaattaaat 361 aataaattta agtagaagtt tacaagaaaa aaatgcttac atgcatatcc attaggagta 421 atttctggct aaacaccatt cacatggctc cacaggttca tagaaggttg aaaaccataa 481 ttaaaattat tagtgaagtt ttgtattgat gaacccagtc catattttat cttctgtctt 541 agcacctata ataaatttta gttccctttt tacgaccttt agttaagtgt tttacaacct 601 cttggattgt gctctgagaa gaagaaagtc tggttgctat ctaagaacaa ttaactggtg 661 acacatagga gactgataca gttctcattg cacttttcac tatcagaaaa ggaactaaaa 721 taattccact ataaaagagc ttaataatca ctgatatact tagatctct // LOCUS RATUD2A06 359 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 3. ACCESSION M35082 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 6 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 359) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 359 UDP glucuronosyltransferase-2 mRNA and intron pept + 175 + 306 UDP glucuronosyltransferase-2, exon 3 IVS < 1 174 UDP glucuronosyltransferase-2 intron B IVS 307 > 359 UDP glucuronosyltransferase-2 intron C BASE COUNT 114 a 69 c 75 g 101 t ORIGIN About 4.0 kb after segment 5. 1 gtagtatagt acaaatgcac acttaatgaa cactgggtac cgaggcaatg gatacactgg 61 tctcccaaaa taattccagg aattacataa tttcctctgg taagtttgtc tcggtagttg 121 agacaatgct tcccatgcaa ccattcatct gtgatgtcat aaccatcttc ataggaaatg 181 gaagaatttg ttcagagctc tggagaacat ggtgtagtgg tgttttctct gggatcaatg 241 gttaaaaacc tgactgaaga aaaagccaat gtagttgctt ctgctcttgc ccaaattcca 301 cagaaggtaa gataaaatgt ccacagagat ggcaaatgta ttataagtca tctgaaccc // LOCUS RATUD2A07 609 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exons 4 and 5. ACCESSION M35083 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 7 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 609) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept + 69 156 UDP glucuronosyltransferase-2, exon 4 292 + 511 UDP glucuronosyltransferase-2, exon 5 pre-msg < 1 > 609 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 68 UDP glucuronosyltransferase-2 intron C IVS 157 291 UDP glucuronosyltransferase-2 intron D IVS 512 > 609 UDP glucuronosyltransferase-2 intron E BASE COUNT 170 a 119 c 115 g 205 t ORIGIN About 0.6 kb after segment 6. 1 ccaggaacaa attttaccaa agccttggaa tttctgtaat taaataaggc attgtctgtg 61 tgtaacaggt tgtatggaga tttgatggta agaaaccaga taccttagga tctaacactc 121 ggctgtacaa gtggatcccc cagaatgacc ttcttggtaa ggcaaagttt aactacaagt 181 ttgtggctat agtaacacac tttcttgaga atagcacact tctgagtctt catattttcc 241 tctcttaaat attattcggt caataattat gtcaacttct tctcattgca ggtcatccaa 301 aaaccaaagc ttttgtagct catggtggaa caaatggcat ctatgaggca atctaccatg 361 gcattcctat tgttggtatt cccttgtttg cagatcaacc ggataacatt aatcacatgg 421 tagccaaagg agctgctgtt agagttgact tcagcatact gtcaactaca ggccttctca 481 ctgccttgaa gattgtcatg aatgaccctt cgtgagtctg tttgtttgtt gaagttgttt 541 tttccaagga aggctgtttc tttttctttt ttgaaacata atttttacta tataactaca 601 agagctgcc // LOCUS RATUD2A08 316 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E. ACCESSION M35084 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 8 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 316) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 316 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 316 UDP glucuronosyltransferase-2 intron E BASE COUNT 112 a 39 c 42 g 123 t ORIGIN About 0.1 kb after segment 7. 1 ctcatagata tttgcttgct tcagcctcct gggtgctggg attagaaata tctgaattta 61 tatttgctgt gaataactat tattttaaaa atattgacag attcagatga tcatcagatt 121 gattttatcc tatttgaagg agggagaata atttcgaaaa attatgtttt tgcatatctg 181 aaatatgtgc ttttttaaca ataaagttac tctaaatttc taattgaatc aattagacat 241 gattattctc aaactattct atataaagaa ataatattac aaatatttat ctattataac 301 aaaggacaca ttttct // LOCUS RATUD2A09 487 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, intron E. ACCESSION M35085 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 9 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 487) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pre-msg < 1 > 487 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 > 487 UDP glucuronosyltransferase-2 intron E BASE COUNT 196 a 89 c 80 g 122 t ORIGIN About 0.05 kb after segment 8. 1 gaataagaga cagtattaaa ttcatacaaa tacctggaga acactattgt aatttcaagg 61 tttgctagaa gacaaatgta cctaatgaga aggtcctgag tcaaaaataa ctggagaaag 121 tgctgttcgt tcctacatac acagtcttct agtccaggaa cagaattaaa ttgttttcat 181 tgtggtgaat tcttgtggaa ctgttgtaca aagaagagtc ataaacaaca aagtgttttt 241 agaagaagaa cctagttata aacagataca taggagagga aaaaaaacta gagaggagat 301 atcgaacatg acatatgacc tggaaaaagt tctatggcta cttcccttct tggtcttata 361 tcatgagtta catgttacac aaaaacacac acacacaaac aaacacacac aaacatacac 421 acacacaaac atacacacac acaaacatac aaacacatac acacaagttt gtgtgtctta 481 ctagttt // LOCUS RATUD2A10 895 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-2 (UDPGTr-2) gene, exon 6. ACCESSION M35086 J05482 KEYWORDS UDP glucuronosyltransferase-2. SEGMENT 10 of 10 SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pUDPGTr-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 895) AUTHORS Mackenzie,P.I. and Rodbourn,L. TITLE Organization of the rat UDP glucuronosyltransferase, UDPGTr-2 gene and characterization of its promoter JOURNAL J. Biol. Chem. 265, 11328-11332 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept + 212 488 UDP glucuronosyltransferase-2, exon 6 pre-msg < 1 836 UDP glucuronosyltransferase-2 mRNA and intron IVS < 1 211 UDP glucuronosyltransferase-2 intron E signal 808 812 poly-A signal BASE COUNT 248 a 206 c 163 g 278 t ORIGIN About 0.15 kb after segment 9. 1 gttcataatt accctgtgct aaacaagact gtttcactgt ctttcctgtc actcaactct 61 cctctgccac cacctgaaac aaaacacttg agtgggaagt atacatgatt tattttaagt 121 tgcttgtgag acttttccct aaaacaacaa atgttgttaa gtcatcaaat tgcctcctct 181 ttaatcttag ttgtatacat tgtcccttca gctataagga gaatgccatg agattatcca 241 gaatccacca tgatcagcca gtgaagcccc tggaccgagc cgtcttctgg atcgagtatg 301 tcatgcgtca caaaggagcc aagcacctcc gctcaactct gcatgacctt agctggttcc 361 agtaccactc tctggatgtc attgggttcc tattgctctg tgtggtaggt gtggtattca 421 tcatcacaaa attctgcctc ttttgttgcc gtaagactgc taacatggga aagaagaaga 481 aagagtagca tcataaaggc tgaagcagag ccctgagaga tgagcctctg ccagctgctt 541 ccagcaggaa cctgttgtca tgccagtgcc ttccctctaa aagaagacag cgttgggacc 601 tcattgaaca tggctccaat gaattcacta tgttctgaag acatgcaaga tttcatgcca 661 aatatatatt cagtgctaaa aaaacaaaat cctgtgttca gtttagaatg ttttgatgta 721 gctgagaagc tttgcccaac aacaataact gaagctactg tagttcataa agttcacatg 781 gctttatagc ctttgcaaaa catatctata aatcaattac tttttgaaaa tacccagcct 841 gctttgtctt catttagtag actatttttc tctccttctt tcttttttct tcttt // LOCUS RATUDPA 1858 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-5 (UDPGTr-5) mRNA, complete cds. ACCESSION M33746 J05440 KEYWORDS UDP glucuronosyltransferase-5. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone UDPGTr-5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1858) AUTHORS Mackenzie,P.I. TITLE The cDNA sequence and expression of a variant 17B-hydroxysteroid UDP glucuronosyltransferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept 43 1635 UDP glucuronosyltransferase-5 precursor (EC 2.4.1.17) sigp 43 111 UDP glucuronosyltransferase-5 signal peptide (put.) matp 112 1632 UDP glucuronosyltransferase-5 mRNA < 1 1858 UDP glucuronosyltransferase-5 mRNA BASE COUNT 559 a 365 c 387 g 547 t ORIGIN 1 agaaaggaac acagtgtgaa cagaaggatt ttgattttca aaatgccagg aaaatggatt 61 tttgctctgc tcttgctgca gataagtttc tgcctcagat ctgcgaaatg tgggaaggtg 121 ttggtgtggc cgatggaatt cagtcactgg atgaatataa aaacaatact ggatgaactt 181 gtacagaggg gccatgaagt cactgttctg aaaccttcag cttactatgt tcttgatcca 241 aaaaaatcgc cagaccttaa gtttgaaact tttcctacat ctgtcagtaa agatgaactg 301 gaaaaatatt tcataaaact tgctgatgcg tggacttatg agttgcaaag agatacatgt 361 ttgtcttttt ctcctttact acaaaatatg atggatgaat tttctgatta ttatctaagt 421 gtttgtaaag acgccgtttc aaacaagcag ctcatggcaa aactacagga atccaagttt 481 gatgttcttt tgtcagatcc tgtggctgcc tgtggggagc tgatagccga agtgctccac 541 attccttttc tgtacagtct tcgtgcctct ccaggccata aaattgaaaa gtccagtgga 601 agatttatac tacctccctc ttatgtgcct gtaattttgt caggattggg tggccaaatg 661 acattcatag acagggttaa aaatatgata tgtatgcttt attttgactt ttggttccat 721 atgtttaatg ccaagaattg ggatccattt tatactgaga ttttgggaag gcccaccacc 781 ttagctgaga caatgggcaa agcagaaatg tggctcatta gatcctactg ggatttggag 841 tttccccacc caacattacc aaatgttgac tacattggag gactccaatg caaacctgct 901 aaacccttgc ccaaggatat agaagacttt gtccagagct ctggagagca tggtgtggtg 961 gtgttttctc tggggtcaat ggtcagcagc atgacagaag aaaaggccaa cgcaattgca 1021 tgggcccttg cccagattcc acaaaaggtt ctttggaaat ttgatggcaa aatcccagca 1081 actttaggac ccaataccag agtctacaag tggcttcccc agaatgacct ccttggtcat 1141 ccaaaaacca aagcctttgt aactcatggt ggagccaatg gtgtctatga ggccatctat 1201 catggaatcc ctatgattgg cattcctatg tttggagaac aacatgataa cattgcccac 1261 atggtggcca aaggagcagc tgttacactg aatatcagga caatgtcaaa gtcagatttg 1321 ttcaatgcac ttaaggaagt aataaacaat cctttctata aaaaaaatgc tatgtggctg 1381 tcaaccattc accatgacca acctatgaaa cccctggaca aggctatctt ctggattgag 1441 tatgtcatgc gccacaaaag agccaagcac ctgagaccac ttggacataa ccttccctgg 1501 taccagtacc actctctgga tgtgattgga ttcctgctag cctgtttggc agtcattgca 1561 gcccttgctg taaaatgctt cttgttcatt taccgattct ttgcaaagaa gcaaaagaaa 1621 atgaagaatg agtagagctc gttgacaatg cactacagga atgaaattta agcctcattc 1681 taatttatga atcactttct taacacttcc tgattttttt ttgtggaggc agatcatcat 1741 tgtaagaaga catatagctc tgtgaatatt gatatgttat caaaatttta aaatcactta 1801 atgtaaaaaa gttgcattgt agaaaaattg aggaaaataa agtttacttg atagtctt // LOCUS RATUDPB 2216 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat UDP glucuronosyltransferase-21 (UDPGTr-21) mRNA, 3' end. ACCESSION M33747 J05440 KEYWORDS UDP glucuronosyltransferase-21. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone UDPGTr-21. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2216) AUTHORS Mackenzie,P.I. TITLE The cDNA sequence and expression of a variant 17B-hydroxysteroid UDP glucuronosyltransferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.I.Mackenzie, 13-APR-1990. FEATURES from to/span description pept < 1 463 UDP glucuronosyltransferase-21 (AA at 2) (EC 2.4.1.17) mRNA < 1 2216 UDP glucuronosyltransferase-21 nRNA BASE COUNT 738 a 386 c 386 g 706 t ORIGIN 1 agccaatggt gtctatgagg ccatctatca tggaatccct atgattggca ttcctatgtt 61 gggagaacaa catgataaca ttgcccacat ggtggccaaa ggagcagctg ttacactgaa 121 tatcaggaca atgtcaaagt cagatttgtt caatgcactt aaggaagtaa taaacaatcc 181 tttctataaa aaaaatgcta cgtggctgtc aaccattcac catgaccaac ctatgaaacc 241 cctggacaag gctatcttct ggattgagta tgtcatgcgc cacaaaagag ccaagcacct 301 gagaccactt ggacataacc ttccctggta ccagtaccac tctctggatg tgattggatt 361 cctgctagcc tgtttggcag tcattgcagc ccttgctgta aaatgcttct tgttcattta 421 ccgattcttt gcaaagaagc aaaagaaaat gaagaatgag tagagctcgt tgacaatgca 481 ctacaggaat gaaatttaag cctcattcta atttatgaat cactttctta acatttcctg 541 attttttttt gtggaggcag atcatcattg taagaagaca tatagctctg tgaatattga 601 tatgttatca aaattttaaa atcacttaat gtaaaaaagt tgcattgtag aaaaattgag 661 gaaaataaag tttacttgat agtcttaaaa atcacagtat taaccttaca atatttgaat 721 attgtccatt gacctctttc tctgagactg aatctgtagc tttcatacaa ataagtagct 781 aacttgtata ctataaatat ggacatataa atagtttttt ctgtaatagt cttaattatt 841 tgtagtcggg gataaagtgt ggtttggttt ggatattcat ttcaaagggt aggaatctgt 901 tggctatttt gttcctgtaa caaaatgtgc tgaccaaaag catctccagg gaaaagcaga 961 gcagtttatt ttgagttgtg cttacagatc ctgagaacgc aggatagata ggaaggcagg 1021 gcagcagtca gccagatgac aaaactctct cattacatct taaccacaca tagaaagcac 1081 aaagagtgag caaaaagtgt gactatggtg tgaactttca aagcttgctc cagtgatata 1141 tttcctccaa aaagatttaa cccctttaaa taatattcct gtacccctgg agttgggagt 1201 ttagctcagt ggtagagcat ttgcctacca aacacaaggc tctgtgttca gtcctcagct 1261 ccgggggaaa aaaaaaagaa agattccata acctcaaaca gcattacaaa ttttggaaaa 1321 tgtgctaaaa ttcatcagcc tatctgaaac attttacatt gaatccataa caggaaataa 1381 acctgtttct taattcttat tttttagcat accattctaa tactccaagt tctaacacag 1441 cacttgtacc tcttcaatgt aatttaacta tgatcatgag gcataatgtt cattggaaat 1501 gaagcatatg aacaggaaac aaataaaagt cctaactaaa gtaaacttag ctttgagatt 1561 ggctattaca agtctggttg taattccact aatgctgcca tatgctgtga ggaatgttat 1621 aaaagagcta tgtaactatt atgacagttg tagcttttag cattgaaata catagatatt 1681 aatataaaag taagtgtata atatgatgct taaatgtgta acctaatatt ttagaataaa 1741 tttaattagt ggaaacattc tagacaggaa cagtaaatat atccaacatc attattcttt 1801 gatttaaaaa atgcaatttg gaggttcttc cctgcaaaag actatctctt tcccactctt 1861 aacattactt aggtgcttat tacagtttta tgttgagttg gggaaagggt aaaattgacc 1921 cctttccata ttagcatgaa tattggtatc atctttattg agatcttgtt taggaaccca 1981 ttatgagact tcaggagtat aactttcata atgtttgtaa tagatgcaac tttacagcag 2041 acaacttgat ccttctggcc tcttaaatct ttccatcctc tattatgtaa tgttttgttg 2101 atagttactt cagtatttga cacaagattc aataatttta tgcctatggg ttccatcaaa 2161 catcatgact ctatatatat gtaaatccaa aataagaaat aaaaaatagt gtatct // LOCUS BCEHEMOL 280 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.cereus hemolysin gene, partial cds. ACCESSION M35411 KEYWORDS hemolysin. SOURCE B.cereus DNA. ORGANISM Bacillus cereus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 280) AUTHORS Gilmore,M.S., Gilmore,K.S. and Goebel,W. TITLE A new strategy for ordered DNA sequencing based on a novel method for the rapid purification of near-milligram quantities of a cloned restriction fragment JOURNAL Gene Anal. Tech. 2, 108-114 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 280 hemolysin (AA at 2) BASE COUNT 99 a 43 c 51 g 87 t ORIGIN 1 gaattctcat ttatggattg taaaccgtgc aattgatatt atgtctcgta atacaacact 61 tgtaaaacaa gatcgagttg cactattaaa tgaatggcgt actgagttag agaacggtat 121 ttatgctgct gactatgaaa atccttatta tgataatagc acatttgctt cacatttcta 181 tgaccctgac aatgggaaaa cttatattcc gtatgcaaag caggcaaagg aaactggagc 241 taaatatttt aaattagctg gtgagtctta caaaaataaa // LOCUS BPEFHAA 164 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.pertussis filamentous hemagglutinin antigen gene, partial cds. ACCESSION M35274 KEYWORDS filamentous hemagglutinin antigen. SOURCE B.pertussis DNA, clone lambda-FHA15. ORGANISM Bordetella pertussis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 164) AUTHORS Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and Ullmann,A. TITLE Molecular cloning of a coding sequence of Bordetella pertussis filamentous hemagglutinin gene JOURNAL FEMS Microbiol. Lett. 37, 73-77 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 164 filamentous hemagglutinin antigen (AA at 1) BASE COUNT 38 a 45 c 60 g 21 t ORIGIN 1 gaattccaca tgcacctgga tgcgccgcgc atcgagaaca ccgcgaaact gacgcggcga 61 ggtgcaacgc aaaggcgtgc aggacgtcgg gggaggcgag cacggccgct ggacgtatcg 121 gctatgtcaa ctactggttg cgcgcgcatg gaagaaggcg ggca // LOCUS BPEFHAB 165 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.pertussis filamentous hemagglutinin antigen gene, partial cds. ACCESSION M35275 KEYWORDS filamentous hemagglutinin antigen. SOURCE B.pertussis DNA, clone lambda-FHA15. ORGANISM Bordetella pertussis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 165) AUTHORS Mattei,D., Pichot,F., Bellalou,J., Mercereau-Puijalon,O. and Ullmann,A. TITLE Molecular cloning of a coding sequence of Bordetella pertussis filamentous hemagglutinin gene JOURNAL FEMS Microbiol. Lett. 37, 73-77 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 165 filamentous hemagglutinin antigen (AA at 1) BASE COUNT 30 a 54 c 55 g 26 t ORIGIN 1 gaattcggac cagcctggcc cgagcgctgc atgccgcgcg ggaaggccca cacagttggt 61 cccgacactg ccactttccg agtcccatcg caacgggcgg tgatccactc gtcgttggcg 121 cgtgatagac agcgcgtgca tgcgagagcg catgcagcag gctgg // LOCUS CRECYCA 662 bp ss-mRNA PLN 26-JUL-1990 DEFINITION C.reinhardtii mitochondrial apocytochrome c (cyc) mRNA, complete cds. ACCESSION M35173 KEYWORDS apocytochrome c; cytochrome c apoprotein. SOURCE C.reinhardtii, cDNA to mRNA, clone C321. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 662) AUTHORS Amati,B.B., Goldschmidt-Clermont,M., Wallace,C.J.A. and Rochaix,J.-D. TITLE cDNA and deduced amino acid sequences of cytochrome c from Chlamydomonas reinhardtii: Unexpected functional and phylogenetic implications JOURNAL J. Mol. Evol. 28, 151-160 (1988) STANDARD simple staff_review FEATURES from to/span description pept 42 380 apocytochrome c (cyc) BASE COUNT 147 a 176 c 200 g 139 t ORIGIN 1 ccgaaccaaa acctttcctg tgacccttct atctgcttaa aatgtcgacc ttcgctgagg 61 cccccgctgg cgaccttgct cgcggcgaga agattttcaa gaccaagtgc gcgcaatgcc 121 acgttgctga gaagggcggc ggccacaagc agggccccaa cctgggcggt ctgttcggcc 181 gtgtctcggg cactgctgcc ggcttcgcat actcgaaggc gaacaaggag gctgccgtga 241 cctggggcga gagcactctc tacgagtacc tgctgaaccc caagaagtac atgcctggca 301 acaagatggt gttcgctggc ctgaagaagc ccgaggagcg cgccgatctg attgcctacc 361 tgaagcaggc gactgcttaa actgcgcgcg gcttagcaag cggcttcatt cattaggcag 421 aagcgggtct caagagcggg atagggttgc atctgggcgc ggcgtgtgtt cgcttcagaa 481 cgtcccacca gatgcaacag gcggatgtgt tacgagtgtc gagtgtgtac tgatgatggt 541 gtgcatgtgt aacggcgaca tacggatgga atagacatat cgtcttgaag actgtctcat 601 aggcagagac atctgctcac aggcaactta ttatgtctgc catgggcggt cgtaaagaat 661 tc // LOCUS ECOABC 1993 bp ds-DNA SYN 26-JUL-1990 DEFINITION Synthetic plasmid (for E.coli) DNA. ACCESSION M34519 KEYWORDS b-galactosidase; b-lactamase; bla gene; lacZ gene; promoter. SOURCE Synthetic DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 1993) AUTHORS Hayden,M.A., Shallcross,M.A., Stotland,E. and Mandecki,W. TITLE A totally synthetic plasmid for general cloning, gene expression and mutagenesis in Escherichia coli JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Mandecki, 22-MAY-1990. Author address:W.Mandecki Abbott Laboratories Corporate Molecular Biology D93D Abbott Park, IL 60064 FEATURES from to/span description pept 119 301 b-galactosidase pept 438 1298 b-lactamase site 81 327 lacZ fragment mRNA site 136 137 SmaI site for introduction of multicloning signal 301 336 trpA terminator signal 337 402 bla promoter P3 mRNA 403 1329 bla mRNA signal 1299 1342 phage fd terminator signal 1343 1408 RNAII promoter mRNA 1409 1962 RNAII mRNA signal 1415 1440 RNAI terminator signal 1518 1555 RNAI promoter mRNA 1422 1517 RNAI mRNA site 1961 1962 RNaseH cleavage site BASE COUNT 524 a 475 c 500 g 494 t ORIGIN 1 gaattgatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctg 61 ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat 121 gactatgatt acgcccgggc ttgccgtcgt tttacagcga cgagactggg aaaatcctgg 181 cgttacccaa cttaatcgcc ttgccgcaca cccccctttc gccagttggc gtaatagcga 241 agaagcccgc accgaccgcc cttcccaaca gttgcgtagt ctgaatggcg aatggcgtta 301 aactagtagc ccgcctaatg agcgggcttt tttttaattc ccctatttgt ttatttttct 361 aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat 421 attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg 481 cggcattttg ccttcctgtt tttgctcacc cagaaacgct cgtgaaagta aaagacgcag 541 aggaccaatt gggggcacga gtgggataca tagaactgga cttgaatagc ggtaaaatcc 601 ttgagagttt tcgccctgaa gagcgttttc caatgatgag cactttcaaa gttctgctat 661 gtggagcagt attatcccgt gtagatgcgg ggcaagagca actcggacga cgaatacact 721 attcgcagaa tgacttggtt gaatactccc cagtgacaga aaagcacctt acggacggaa 781 tgacggtaag agaattatgt agtgccgcca taacgatgag tgataacact gcggcgaact 841 tacttctgac aaccatcggt ggaccgaagg aattaaccgc ttttttgcac aatatgggag 901 accatgtaac tcgccttgac cgttgggaac cagaactgaa tgaagccata ccaaacgacg 961 agcgagacac cacaatgcct gcggcaatgg caacaacatt acgcaaacta ttaactggcg 1021 aactacttac tctggcttca cggcaacaat taatagactg gcttgaagcg gataaagttg 1081 caggaccact actgcgttcg gcacttcctg ctggctggtt tattgctgat aaatctgggg 1141 caggagagcg tggttcacgg ggtatcattg ccgcacttgg accagatggt aagccttccc 1201 gtatcgtagt tatctacacg acgggtagtc aggcaactat ggacgaacga aatagacaga 1261 ttgctgaaat aggggcttca ctgattaagc attggtaaac cgatacaatt aaaggctcct 1321 tttggagcct ttttttttgg acggaccgag tagaaaagat caaaggatct tcttgagatc 1381 ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg 1441 tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag 1501 cgcagatacc aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact 1561 ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg 1621 gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc 1681 ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg 1741 aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggacaaagg 1801 cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag 1861 ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc 1921 gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct 1981 ttttacggtt cct // LOCUS HS1PROM 591 bp ds-DNA VRL 26-JUL-1990 DEFINITION Herpes simplex virus type 1 joint promoter. ACCESSION M34532 KEYWORDS promoter. SOURCE Herpes simplex virus type 1 (strain KOS) DNA, clone pRAB6. ORGANISM Herpes simplex virus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 591) AUTHORS Bohenzky,R.A., Papavassiliou,A.P., Gelman,I.H. and Silverstein,S. TITLE Identification of novel transcripts mapping to the joint region of Herpes simplex virus type 1 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Silverstein, 23-MAY-1990. Author address:S.Silverstein Dept. of Microbiology Columbia University 701 W. 168th ST. N.Y., N.Y. 10032 FEATURES from to/span description binding 134 139 CTF binding site binding 290 295 Sp1 binding site binding 508 513 Sp1 binding site binding 387 392 Sp1 binding site binding 480 485 Sp1 binding site binding 492 497 Sp1 binding site site 298 306 Oct1/a-TIF site site 327 330 a4 enhancer site 366 372 E4TF1 site signal 528 533 TATA box BASE COUNT 103 a 193 c 204 g 91 t ORIGIN Map position 0.794-0.798. 1 gcatgcccct cccgccgacg caacaggggc ttggcctgcg tcggtgcccc ggggcttccc 61 gccttcccga agaaactcat taccataccc ggaaccccag gggaccaatg cgggttcatt 121 gagcgacccg cgggccaatg cgcgaggggc cgtgtgttcc gccaaaaaag caattaacat 181 aacccggaac cccaggggag tggttacgcg cggcgcggga ggcggggaat accggggttg 241 cccattaagg gccgcgggaa ttgccggaag cgggtaatgt cggccggggc cgcccattaa 301 tgagtttcta attaccatac cgggaagcgg aacaaggcct ctgcaagttt ttaattacca 361 taccgggaag tgggcgcccg cccagtgggc gggagttacc gcccagtggg ccggcccgac 421 gactcggcgg acgctggttg gccgggcccc gccgcgctgg cggccgccga ttggccagtc 481 ccgccctccg agggcggccc gcctcggggg cgggccggct ccaagcgtat atatgcgcgg 541 ctcctgccat cgtctctccg gagagcggct tggtgcggac ctgcagccaa g // LOCUS MZEHETRO 184 bp ds-DNA PLN 26-JUL-1990 DEFINITION Corn heterochromatin repetitive DNA. ACCESSION M35408 KEYWORDS . SOURCE Corn knob heterochromatin DNA, clone pZm4.25. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 184) AUTHORS Peacock,W.J., Dennis,E.S., Rhoades,M.M. and Pryor,A.J. TITLE Highly repeated DNA sequence limited to knob heterochromatin in maize JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 4490-4494 (1981) STANDARD simple staff_review BASE COUNT 58 a 41 c 42 g 43 t ORIGIN 1 ggccacacaa cccccatttt tgtcgaaaat agccatgaac gaccattttc aataataccg 61 aaggctaaca cctacggatt tttgaccaag aaatggtctc caccagaaat ccaagaatgt 121 gatctatggc aaggaaacat atgtggggtg aggtgtatga gcgtctggtc gatgatcaat 181 ggcc // LOCUS RATRSB1 170 bp ds-DNA ROD 26-JUL-1990 DEFINITION Rat B1 repetitive sequence. ACCESSION M35409 KEYWORDS B1 repetitive sequence. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 170) AUTHORS Blin,N., Weber,T. and Alonso,A. TITLE Cross-reaction of snRNA and an Alu I-like sequence from rat with DNAs from different eucaryotic species JOURNAL Nucleic Acids Res. 11, 1375-1388 (1983) STANDARD simple staff_entry BASE COUNT 57 a 29 c 44 g 40 t ORIGIN 1 aaaaaaaagc aaatgacagc tgtgtgtggt ttcatatgtg tttaatccag cactcaggag 61 gcagaggtaa atggatctct gtgagttcga gtccagtctg gctacaaagc aagttctaga 121 gcagccaggg ctgttacaca gagaaactct gtcttggaag ataaaaaaga // LOCUS SHFINV 261 bp ds-DNA BCT 26-JUL-1990 DEFINITION Plasmid pINV (from S.flexneri) RepA gene, 5' end. ACCESSION M35403 KEYWORDS . SOURCE Plasmid pWR110 (from S.flexneri 5) DNA. ORGANISM Shigella flexneri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 261) AUTHORS Silva,R.M., Saadi,S. and Maas,W.K. TITLE A basic replicon of virulence-associated plasmids of Shigella spp. and enteroinvasive Escherichia coli is homologous with a basic replicon in plasmids of IncF groups JOURNAL Infect. Immun. 56, 836-842 (1988) STANDARD simple staff_review FEATURES from to/span description pept 256 > 261 repA protein mRNA 171 82 (c) inc mRNA BASE COUNT 70 a 54 c 68 g 69 t ORIGIN 1 gatcgtttaa ggaattttat ggctggccac gccttaaggt ggcagggaac tggttctgat 61 gtggatgtac aggagccaga aaagcaaaaa ccccgataat cttctttaac tttggcgagt 121 cagaaagatt accggggccc acttaaaccg tatagccaac aatcaagcta tgcggggagt 181 atagttatat gcccggaaaa gttcaagact tctttctgtg ctcgctcctt ctgcgcattg 241 taagtgcagg atggtgtgac t // LOCUS YSYPSKLA 598 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.kluyveri plasmid pSKL left-end inverted terminal repeat. ACCESSION M35319 KEYWORDS . SOURCE S.kluyveri plasmid pSKL DNA. ORGANISM Saccharomyces kluyveri Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 598) AUTHORS Kitada,K. and Hishinuma,F. TITLE A new linear DNA plasmid isolated from the yeast Saccharomyces kluyveri JOURNAL Mol. Gen. Genet. 206, 377-381 (1987) STANDARD simple staff_review FEATURES from to/span description BASE COUNT 247 a 18 c 206 g 127 t ORIGIN 1 aaaaggtata gatatagata tattttttat gggtttggaa gggggaagtg gaagaatgta 61 tcgtgtaaaa aaagagcaaa aaaaaaatta gatgagagaa ggggaaaaga ggggagtgta 121 tcatgtgaaa aaacgcgtca aaatgaagag aagggaaaaa ggggagagtg tatcgtgggg 181 aaagtgaatt ttgaagaaga gaaggggaaa agaggggagt gtatcgtcta agaagggggt 241 attataagag aaggggatat tggtagagtg tattgaatgt ggcttagcaa aaatagaaaa 301 agggtaaaaa atgggggata aaaaaaagaa aaaaacggta ttaaggggag aaggggaaaa 361 gggtagagtg tatcgtgcaa aaagtgagtt caaaatgaag agaaggggaa aagggtagag 421 tgtatcgtgg gggaaagtga gtttaaatga agagaagggg aaaagggtag agtgtatcgt 481 gggggaaagt gagtttaaat gaagagaagg gaaaaagggg gagtgtatcg tataaaaagt 541 gaatatattt tatttgatgg gattaagtat tgaaaatgga aatggatgat aggttgtt // LOCUS YSYPSKLB 117 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.kluyveri plasmid pSKL right-end DNA. ACCESSION M35320 KEYWORDS . SOURCE S.kluyveri plasmid pSKL DNA. ORGANISM Saccharomyces kluyveri Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Kitada,K. and Hishinuma,F. TITLE A new linear DNA plasmid isolated from the yeast Saccharomyces kluyveri JOURNAL Mol. Gen. Genet. 206, 377-381 (1987) STANDARD simple staff_review BASE COUNT 35 a 5 c 18 g 59 t ORIGIN 1 caaaaagtga gattaggggg agaatatatt tattatgtta aatataaggt agttttttta 61 taatttattt aatttatttt gtttgtattt tagcttcttt aattagtctg tattctt // LOCUS XELTRH 1442 bp ss-mRNA VRT 26-JUL-1990 DEFINITION X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds. ACCESSION M34699 K00931 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis skin, cDNA to mRNA, clone L4 and 8/136. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 478) AUTHORS Richter,K., Kawashima,E., Egger,R. and Kreil,G. TITLE Biosynthesis of thyrotropin releasing hormone in the skin of Xenopus laevis: Partial sequence of the precursor deduced from cloned cDNA JOURNAL EMBO J. 3, 617-621 (1984) STANDARD full staff_review REFERENCE 2 (bases 15 to 1442) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of end-product JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 110 793 thyrotropin releasing hormone precursor matp 332 340 thyrotropin releasing hormone copy 1 matp 374 382 thyrotropin releasing hormone copy 2 matp 428 436 thyrotropin releasing hormone copy 3 matp 470 478 thyrotropin releasing hormone copy 4 mRNA < 1 1442 TRH mRNA conflict 139 139 t in [2]; c in [1] conflict 214 216 tct in [2]; ctc in [1] conflict 319 319 g in [2]; t in [1] BASE COUNT 460 a 286 c 334 g 362 t ORIGIN 1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag 61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt 121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc 181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg 241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct 301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga 361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc 421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg 481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga 541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt 601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga 661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg 721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag 781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca 841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga 901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt 961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag 1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt 1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca 1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat 1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct 1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt 1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg 1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa 1441 tg // LOCUS XELTRHA 2955 bp ss-mRNA VRT 26-JUL-1990 DEFINITION X.laevis thyrotropin releasing hormone mRNA. ACCESSION M34698 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis, cDNA to mRNA, clone C6. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2955) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of end-product JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 157 831 thyrotropin releasing hormone BASE COUNT 927 a 597 c 604 g 827 t ORIGIN 1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc 61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga 121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg 181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag 241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc 301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg 361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga 421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac 481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag 541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat 601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa 661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa 721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac 781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag 841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt 901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac 961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc 1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca 1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa 1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc 1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca 1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg 1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca 1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg 1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa 1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct 1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa 1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt 1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt 1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca 1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat 1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg 1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga 1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg 2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga 2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca 2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct 2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc 2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac 2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg 2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata 2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga 2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc 2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa 2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg 2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct 2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa 2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct 2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt 2941 tcgtggcgct ggccc // LOCUS MUSIGCS 302 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse Ig heavy-chain gene enhancer region. ACCESSION M35179 KEYWORDS constant region; germline; immunoglobulin heavy-chain. SOURCE Mouse (strain BXXB:SB/Le) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 302) AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J. TITLE Molecular aspects of murine systemic lupus erythematosus JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986) STANDARD simple staff_review BASE COUNT 90 a 59 c 68 g 85 t ORIGIN 1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa 61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc 121 cccaccaaac cgaaagtcca ggctgagcaa aacaccacct gggtaatttg catttctaaa 181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct 241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat 301 gt // LOCUS MUSIGCT 313 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse Ig heavy-chain gene enhancer region. ACCESSION M35180 KEYWORDS constant region; germline; immunoglobulin heavy-chain. SOURCE Mouse (lupus erythematosus strain MRL/I) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 313) AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J. TITLE Molecular aspects of murine systemic lupus erythematosus JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986) STANDARD simple staff_review BASE COUNT 94 a 59 c 69 g 91 t ORIGIN 1 ctgcagcagc tggcaggaag caggtcatgt ggcaaggcta tttggggaag ggaaaataaa 61 accactaggt aaacttgtag ctgtggtttg aagaagtggt tttgaaacac tctgtccagc 121 cccaccaaac cgaaagtcta ggctgagcaa aacaccacct gggtaatttg catttctaaa 181 ataagttgag gattcagccg aaactggaga ggtcctcttt taacttattg agttcaacct 241 tttaatttta gcttgagtag ttctagtttc cccaaactta agtttatcga cttctaaaat 301 gtatttagaa ttc // LOCUS MUSTCBYBB 459 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse T-cell receptor C beta-1/2 recombinant chain, exon 1. ACCESSION M35181 KEYWORDS T-cell receptor beta chain; constant region; germline. SOURCE Mouse (strain NZW) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 459) AUTHORS Theofilopoulos,A.N., Kofler,R., Noonan,D., Singer,P. and Dixon,F.J. TITLE Molecular aspects of murine systemic lupus erythematosus JOURNAL Springer Semin. Immunopathol. 9, 121-142 (1986) STANDARD simple staff_review FEATURES from to/span description pre-msg < 1 > 459 TCR C-beta-1/2 recombinant chain, exon 1 IVS 436 > 459 TCR C-beta-1/2 intron A (no splice consensus) BASE COUNT 117 a 124 c 126 g 92 t ORIGIN 1 ttacaagatc aaggcagatc cagatagctc tcagaccatt cgtactctct ttactttcca 61 gaggatctga gaaatgtgac tccacccaag gtctccttgt ttgagccatc aaaagcagag 121 attgcaaaca aacaaaaggc taccctcgtg tgcttggcca ggggcttctt ccctgaccac 181 gtggagctga gctggtgggt gaatggcagg gaggtccaca gtggggtcag cacggaccct 241 caggcctaca aggagagcaa ttatagctac tgcctgagca gccggctgag ggtctctgct 301 accttctggc acaatcctcg aaaccacttc cgctgccaag tgcagttcca tgggctttca 361 gaggaggaca agtggccaga gggctcaccc aaacctgtca cacagaacat cagtgcagag 421 gcctggggcc gagcaggtaa gtgcggacct catgaggaa // LOCUS HAMSCARPB 537 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Hamster alpha-crystallin B chain mRNA, 5' end. ACCESSION J03849 KEYWORDS alpha-crystallin B chain. SOURCE Hamster scrapie infected brain, cDNA to mRNA. ORGANISM Mesocricetus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 537) AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B. TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive hybridization of a cDNA library JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.Duguid, 25-OCT-1990. FEATURES from to/span description pept 21 > 537 alpha-crystallin B chain BASE COUNT 113 a 177 c 127 g 120 t ORIGIN 1 catacattca cctagccacc atggacatcg ccatccacca cccctggatc cgccgtccct 61 ttttcccttt ccactccccc agccgcctct ttgaccagtt cttcggagag cacctgttgg 121 agtctgacct cttctcaact gccacttctc tgagtccctt ctacctgcgg ccaccttcct 181 tccttcgggc acccagctgg attgacactg gactctcaga gatgcggatg gagaaggaca 241 gattctccgt caacctggat gtgaagcact tctccccgga agagctgaaa gtcaaggtgc 301 tgggggacgt ggttgaagtg catggcaagc acgaagagcg ccaggacgaa cacggcttca 361 tctctaggga gttccatagg aagtaccgga tcccagctga tgtggatcct ctgaccatta 421 cttcatccct gtcatctgac ggcgtcctca ctgtgaatgg accaaggaaa caggcctctg 481 gccccgagcg taccattccc atcacccgtg aagagaagcc tgctgtcact gcagccc // LOCUS HAMSCRAP 282 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Hamster glial fibrillary acidic protein mRNA, partial cds. ACCESSION J03847 KEYWORDS glial fibrillary acidic protein. SOURCE Hamster scrapie infected brain, cDNA to mRNA. ORGANISM Mesocricetus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 282) AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B. TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive hybridization of a cDNA library JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.Duguid, 25-OCT-1990. FEATURES from to/span description pept < 1 > 282 glial fibrillary acidic protein (AA at 1) BASE COUNT 90 a 69 c 76 g 47 t ORIGIN 1 gagggccaaa gcctcaagga ggagatggct cgccacctgc aggagtatca agatctactc 61 aatgtcaagc tagccctgga catcgagatt gccacctata ggaaattgct agaaggcgag 121 gaaaaccgca tcaccatccc tgtacaaact ttctccaacc tgcaaatccg agaaaccagc 181 ctggacacca agtccgtgtc agaaggacac ctcaagagga acatcgtggt aaagacagtg 241 gagatgaggg atggtgaggt cattaaggag tccaagcagg ag // LOCUS HAMSCRAPA 327 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Hamster metallothionein II mRNA, complete cds. ACCESSION J03848 KEYWORDS metallothionein II. SOURCE Hamster scrapie infected brain, cDNA to mRNA. ORGANISM Mesocricetus sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 327) AUTHORS Duguid,J.R., Rohwer,R.G. and Seed,B. TITLE Isolation of cDNAs of scrapie-modulated RNAs by subtractive hybridization of a cDNA library JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5738-5742 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.Duguid, 25-OCT-1990. FEATURES from to/span description pept 62 247 metallothionein II BASE COUNT 65 a 100 c 81 g 81 t ORIGIN 1 cactcaagtt tcgacttttc ctcggtcctc agccggtctt caaccgccgc cttcactcgc 61 catggacccc aactgctcct gtgccacaga tggatcctgc tcctgctctg ggtcttgcaa 121 atgcaaagag tgcaaatgca ccacgtgcaa gaaaagctgc tgctcctgct gcccggtggg 181 ctgtgcgaag tgctcccagg gctgcgtctg caaagaggct tcggagaagt gcagctgctg 241 cgcctgaagc ggattcccct cagctgtctg taaatagagc aatgtgtaga aacgtattgg 301 tttttttaca accccgtcct attctcc // LOCUS ASOTAAG1 2935 bp ds-DNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G1) gene, complete cds. ACCESSION M33218 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 2935) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 607 772 Taka-amylase A (Taa-G1) precursor, exon 1 828 868 Taka-amylase A precursor, exon 2 955 1070 Taka-amylase A precursor, exon 3 1140 1248 Taka-amylase A precursor, exon 4 1317 1545 Taka-amylase A precursor, exon 5 1603 1765 Taka-amylase A precursor, exon 6 1830 1976 Taka-amylase A precursor, exon 7 2041 2281 Taka-amylase A precursor, exon 8 2360 2647 Taka-amylase A precursor, exon 9 sigp 607 669 Taka-amylase A signal peptide matp 670 772 Taka-amylase A 828 868 Taka-amylase A 955 1070 Taka-amylase A 1140 1248 Taka-amylase A 1317 1545 Taka-amylase A 1603 1765 Taka-amylase A 1830 1976 Taka-amylase A 2041 2281 Taka-amylase A 2360 2644 Taka-amylase A pre-msg 543 > 2789 Taa-G1 mRNA and introns IVS 773 827 Taa-G1 intron A (no splice consensus) IVS 869 954 Taa-G1 intron B IVS 1071 1139 Taa-G1 intron C IVS 1249 1316 Taa-G1 intron D IVS 1546 1602 Taa-G1 intron E IVS 1766 1829 Taa-G1 intron F IVS 1977 2040 Taa-G1 intron G IVS 2282 2359 Taa-G1 intron H signal 2784 2789 poly-A signal BASE COUNT 818 a 752 c 657 g 708 t ORIGIN 1 ccagtgaatt catggtgttt tgatcatttt aaatttttat atggcgggtg gtgggcaact 61 cgcttaccga ttacgttagg gctgatattt acgtaaaaat cgtcaaggga tcgaagacca 121 aagtagtaaa accccggagt caacagcatc caagcccaag tccttcacgg agaaacccca 181 gcgtccacat cacgagcgaa ggaccacctc tacgcatcgg acgcaccatc caaatagaag 241 cagcaaagcg aaacagccca agaaaaaggt cggcccgtcg gccttttctg caacgctgat 301 cacgggcagc gatccaacca acaccctcca gagtgactag gggcggaaat ttaaagggat 361 taatttccac tcaaccacaa atcacagtcg tccccggcta ttgtcctgca gaatgcaatt 421 gaaactcttc tgcgaatcgc ttgattcccc gcccctggcc gtagagctta aagtatgtcc 481 cttgtcgatg cgatgtatca caaccatata aatactagca agggatgcca tgcttggagg 541 atagcaaccg acaacatcac atcaagctct cccttctctg aacaataaac cccacagaag 601 gcatttatga tggtcgcgtg gtggtctcta tttctgtacg gccttcaggt cgcggcacct 661 gctttggctg caacgcctgc ggactggcga tcgcaatcca tttatttcct tctcacggat 721 cgatttgcaa ggacggatgg gtcgacgact gcgacttgta atactgcgga tcgggtgtgt 781 tgttacctac tagctttcag aaagaggaat gtaaactgac ttgatataga aatactgtgg 841 tggaacatgg cagggcatca tcgacaaggt aaattgcccc tttatcaaaa aaaaagaagg 901 aaaagcagaa gaaaaaataa aataaaaaga actctagtcc taaccatcac atagttggac 961 tatatccagg gaatgggctt cacagccatc tggatcaccc ccgttacagc ccagctgccc 1021 cagaccaccg catatggaga tgcctaccat ggctactggc agcaggatat gtaagtcgat 1081 ttctttaaat atctacctgt catcttttac atcaatatga actaacttga tggttttaga 1141 tactctctga acgaaaacta cggcactgca gatgacttga aggcgctctc ttcggccctt 1201 catgagaggg ggatgtatct tatggtcgat gtggttgcta accatatggt tcgtggtcct 1261 ttgcaactga cttcgcggat atggttcatt tcagtactga caatgagtaa tatcagggct 1321 atgatggagc gggtagctca gtcgattaca gtgtgtttaa accgttcagt tcccaagact 1381 acttccaccc gttctgtctc attcaaaact atgaagatca gactcaggtt gaggattgct 1441 ggctaggaga taacactgtc tccttgcctg atctcgatac caccaaggat gtggtcaaga 1501 atgaatggta cgactgggtg ggatcattgg tatcgaacta ctccagtaag atatttctcc 1561 ctcattctac aacttggctg atcgatgatc ttacgaaatc agttgacggc ctccgtatcg 1621 acacagtaaa acacgtccag aaggacttct ggcccgggta caacaaagcc gcaggcgtgt 1681 actgtatcgg cgaggtgctc gacggtgatc cggcctacac ttgtccctac cagaacgtca 1741 tggacggcgt actgaactat cccatgtatg gttcctccaa ccatgagcct tcttgcaagt 1801 ctcatctcct aacgaaacgc taaaaccagt tactatccac tcctcaacgc cttcaagtca 1861 acctccggca gcatgcacga cctctacaac atgatcaaca ccgtcaaatc cgactgtcca 1921 gactcaacac tcctgggcac attcgtcgag aaccacgaca acccacggtt cgcttcgtaa 1981 gtcttccctt ttattttcgt tcccaatttc cacacagaac cccacctaac aagagcaaag 2041 ttacaccaac gacatagccc tcgccaagaa cgtcgcagca ttcatcatcc tcaacgacgg 2101 aatccccatc atctacgccg gccaagaaca gcactacgcc ggcggaaacg accccgcgaa 2161 ccgcgaagca acctgggctt cgggctaccc gaccgacagc gagctgtaca agttaattgc 2221 ctccgcgaac gcaatccgga actatgccat tagcaaagat acaggattcg tgacctacaa 2281 ggtaagcaca acctctaagc ataccctaat ggcctatcct tcagagtatc tgacacaaga 2341 ctaatcactg gcaatacaga actggcccat ctacaaagac gacacaacga tcgccatgcg 2401 caagggcaca gatgggtcgc agatcgtgac tatcttgtcc aacaagggtg cttcgggtga 2461 ttcgtatacc ctctccttga gtggtgcggg ttacacagcc ggccagcaat tgacggaggt 2521 cattggctgc acgaccgtga cggttggttc ggatggaaat gtgcctgttc ctatggcagg 2581 tgggctacct agggtattgt atccgactga gaagttggca ggtagcaaga tctgtagtag 2641 ctcgtgaagg gtggagagta tatgatggta ctgctattca atctggcatt ggacagtgag 2701 tttgagtttg atgtaacttg tctattctat gatgtatggt ctttttgttc tatagttgga 2761 aatcggaatg atctcaaatc ttgaataaat ataaaaagga taatactcac atccatcaca 2821 accttacaag gttaattccg agctatattc caccgacaca caaataggca gattcttctc 2881 tcgccaggaa tcgcgatatt attggcatgc aaataacgat aactgtctca gaagg // LOCUS ASOTAAG2A1 197 bp ds-DNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 5' end. ACCESSION M33220 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 197) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 195 > 197 Taka-amylase A (Taa-G2) precursor pre-msg 131 > 197 Taa-G2 mRNA and introns signal 2 11 CAAT box signal 95 100 TATA box BASE COUNT 59 a 52 c 36 g 50 t ORIGIN 1 aatgcaattt aaactcttct gcgaatcgct tgattccccg cccttggccg tagagcttaa 61 agtatgtccc ttgtcgatgc gatgtatcac aacatataaa tactagcaag ggatgccatg 121 cttggaggat agcaaccgac aacatcacat caagctctcc cttctctgaa caataaaccc 181 cacagaaggc atttatg // LOCUS ASOTAAG2A2 198 bp ds-DNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa-G2) gene, 3' end. ACCESSION M33222 KEYWORDS Taka-amylase A. SOURCE A.oryzae (strain JCM02239) DNA. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 198) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa-G2) (AA at 1) pre-msg < 1 151 Taa-G2 mRNA and introns (alt.) pre-msg < 1 156 Taa-G2 mRNA and introns (alt.) BASE COUNT 45 a 48 c 53 g 52 t ORIGIN About 2.1 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgtctgag agcccgtctc gaatccctgt tggtcatctt ccatcgcttc 181 gtcctccaga ggcgagga // LOCUS ASOTTAM1 191 bp ss-mRNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 3' end. ACCESSION M33219 KEYWORDS Taka-amylase A. SEGMENT 1 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 191) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 189 > 191 Taka-amylase A (Taa) mRNA 125 > 191 Taa mRNA BASE COUNT 56 a 52 c 32 g 51 t ORIGIN 1 ttccggccat ataaatggtt cattgttcat tactctataa tgctaatgtt tagattagca 61 caactatgac tgggcaaatg ccgccggcca tagatagatc atctcctctc ggacgcttgt 121 ccgaagcaac cgacaacatc acatcaagct ctcccttctc tgaacaataa accccacaga 181 aggcatttat g // LOCUS ASOTTAM2 156 bp ss-mRNA PLN 26-JUL-1990 DEFINITION A.oryzae Taka-amylase A (Taa) mRNA, 5' end. ACCESSION M33221 KEYWORDS Taka-amylase A. SEGMENT 2 of 2 SOURCE A.oryzae (strain JCM02239), cDNA to mRNA, clones lambda-T[1-4]. ORGANISM Aspergillus oryzae Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 156) AUTHORS Tsukagoshi,N., Furukawa,M., Nagaba,H., Kirita,N., Tsuboi,A. and Udaka,S. TITLE Isolation of a cDNA encoding Aspergillus oryzae Taka-amylase A: Evidence for multiple related genes JOURNAL Gene 84, 319-327 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 9 Taka-amylase A (Taa) (AA at 1) mRNA < 1 151 Taa mRNA (alt.) mRNA < 1 156 Taa mRNA (alt.) BASE COUNT 40 a 35 c 42 g 39 t ORIGIN About 1.5 kb after segment 1. 1 agctcgtgaa gggtggagag tatatgatgg tactgctatt caatctggca ttggacagtg 61 agtttgagtt tgatgtacag tataaatcta gtgtactttg cacccaccac gcaatgaaac 121 ggcaccgggc cccgactgag agcccgtctc gaatcc // LOCUS CIPNADGAPD 1354 bp ss-mRNA PLN 26-JUL-1990 DEFINITION M.crystallinum glyceraldehyde-3-phosphate dehydrogenase (NAD-GAPDH) mRNA, complete cds. ACCESSION J05223 KEYWORDS D-glyceraldehyde-3-phosphate:NAD+ oxidoreductase; glyceraldehyde-3-phosphate dehydrogenase. SOURCE M.crystallinum, cDNA to mRNA. ORGANISM Mesembryanthemum crystallinum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Aizoaceae. REFERENCE 1 (bases 1 to 1354) AUTHORS Ostrem,J.A., Vernon,D.M. and Bohnert,H.J. TITLE Increased expression of a gene coding for NAD:glyceraldehyde-3- phosphate dehydrogenase during the transition from C-3 photosynthesis to crassulacean acid metabolism in Mesembryanthemum crystallinum JOURNAL J. Biol. Chem. 265, 3497-3502 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 55 1068 glyceraldehyde-3-phosphate dehydrogenase (EC 1.2.1.12) BASE COUNT 315 a 308 c 345 g 386 t ORIGIN 1 tctcacttct ctcttcttcc cctcgatctc tcaatctctc tctctcttcc tacaatggct 61 aaggttaagg tcggaatcaa cggttttgga aggatcgggc gtttggtcgc cagagtgatc 121 ctccagaggg atgactgtga gctcgtcgct gtcaacgacc ccttcatctc caccgattac 181 atgacataca tgttcaagta cgacagtgtc cacggtcagt gcaagagcca tgagatcaag 241 ttgaaggacg agaagaccct tctcttcggt gagaccccgg tcgccgtctt cggatgcagg 301 aacccagagg aaatcccatg gggtcaggct ggagccgact tcgttgtcga atccaccgga 361 gtcttcaccg acaaggacaa ggctgctgct catttgaagg gtggtgctaa gaaggtcgtt 421 atctcagctc ctagcaagga tgctcctatg tttgttgttg gtgttaacga gcacgagtac 481 aagtcagacc tcaacatcgt ttctaatgcc agttgtacca caaactgtct tgctcccttg 541 gccaaggtta tcaacgacag gtttggcatc gttgagggtc ttatgacaac tgtccacgcc 601 atgactgcta cccaaaagac cgttgatggt ccatcaatga aggactggag aggtggaagg 661 gctgcttcat tcaacatcat ccctagcagc actggagcag ctaaggctgt cggcaaggtt 721 ttgcctgctt tgaacgggaa attgacagga atggctttcc gtgttccaac ttgtgatgtg 781 tccgtggttg acctcacagt cagaattgag aaggctgcta gctacgagca gatcaaggct 841 gccatcaagg aggaatctga gggcaagctg aagggtattt tgggatacac cgaggatgat 901 cttgtttcca ccgactttat tggtgacaac aggtcaagca tctttgatgc caaggccgga 961 atctcattga acgacaactt cgtcaagctt gtctcgtggt acgacaacga atggggttac 1021 agtacccgtg ttgttgactt gatcatgcac atctcaaagt gccagtaagc tatttgctga 1081 aggttggctg agtgtgcgtt gatgcagtgt ttttcccttg tctatcatga gatggctatc 1141 gtcatcatca tttgaataaa gcgggatttt gagaaaaacc ggagctttgt ctttccgttt 1201 agtttcctag gtttggtata taggggtgat tgtttctccc ccctttgtgt tttgttatta 1261 tttagtgaaa gaacttgcag tctatatcgg agttatttga ctttccggtg gcacttatcc 1321 agcatttatg aaacattgct gtgagctttt gagt // LOCUS ECOPRIA 2658 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli primosomal protein n' (priA) gene, complete cds, and cytR gene, 5' end. ACCESSION M33293 KEYWORDS cytR gene; priA gene; primosomal protein. SOURCE E.coli (strain W3110) DNA, clone pEL042. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2658) AUTHORS Lee,E.H., Masai,H., Allen,G.C.Jr. and Kornberg,A. TITLE The priA gene encoding the primosomal replicative n' protein of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4620-4624 (1990) STANDARD full automatic COMMENT Authorin sequence for [1] kindly submitted by G.C.Allen Jr., 26-MAR-1990. FEATURES from to/span description pept 64 2262 primosomal protein n' pept 2418 > 2658 cytR gene product signal 27 32 -10 region signal 5 10 -35 region binding 53 59 ribosome binding site BASE COUNT 578 a 738 c 756 g 586 t ORIGIN 1 gatccgcact cttctgcgac aatgtgtata ctaacccacc gaatttcaag tcaggatgat 61 gctatgcccg ttgcccacgt tgccttgccc gttccgcttc ctcgtacctt tgactatctg 121 ctgccagaag gcatgacggt taaagctggg tgtcgcgtgc gcgtgccgtt tggcaaacag 181 caggagcgca tcgggattgt ggtatcagtt agcgatgcca gcgaactgcc gctcaatgag 241 ctaaaagcgg tagtcgaagt gctggatagt gagccggtgt ttactcactc cgtctggcga 301 ttgctgctat gggcggcaga ttactatcat catccgattg gcgatgtgct gtttcatgcc 361 ttgccgattt tactacgcca ggggcggcct gcggcgaacg cgccgatgtg gtactggttt 421 gccactgaac aaggccaggc ggtggatctg aacagcctga aacgctcccc caagcaacaa 481 caggcgctgg cggcgttacg gcaaggcaaa atctggcgcg accaggtccg cacgctcgaa 541 tttaatgatg ccgcgttgca ggcgctacgc aaaaaaggtc tgtgtgattt agcaagtgaa 601 acaccagagt ttagcgactg gcgaacgaac tatgccgttt ctggtgagcg gttgcgattg 661 aataccgaac aggccaccgc cgttggcgca attcatagcg cggcagatac tttttctgcc 721 tggctgctgg cgggcgttac cggttccggt aaaacggagg tttatctcag cgtactggaa 781 aacgtgctcg ctcagggcaa acaggcgctg gtgatggtgc cggaaatcgg cctgacaccg 841 caaactatcg cccgttttcg tgaacgtttt aatgcccccg tggaagttct gcattccggc 901 ctgaacgaca gcgagcgtct ttcggcgtgg ctgaaagcga aaaatggtga ggcggcgatt 961 gtgatcggca cccgctccgc gctgtttacg ccgtttaaaa atctcggcgt gattgtcatt 1021 gatgaagagc acgacagctc ctacaagcag caggaaggct ggcgctatca tgcccgcgac 1081 ctggcggtgt atcgtgcgca cagcgagcaa atcccgatta ttcttggctc cgcaacgccc 1141 gcgctggaaa cgttatgcaa cgtccagcag aaaaaatacc gcctgctgcg cctgacccgt 1201 cgggcaggga atgcgcgtcc ggcaattcaa catgtgctgg atttaaaagg tcagaaggtg 1261 caggcaggtc tggctccggc gttaatcact cgtatgcgcc agcatttaca ggctgataac 1321 caggtcattc tctttcttaa ccgccgtggc tttgcgcctg cactgctgtg ccacgactgt 1381 ggctggattg ccgaatgccc acgttgcgat cactactaca cgctgcatca ggcgcagcac 1441 catctgcgct gccaccactg tgacagtcag cgtccggtgc cgcgccagtg cccttcctgc 1501 ggttccacgc acctggtccc cgtggggctg ggcaccgaac agcttgaaca gacgctcgcg 1561 ccgttgttcc ccggcgtgcc catttctcgt atcgaccgcg ataccaccag ccgcaaaggg 1621 gcgctggaac agcaactggc agaagtacat cgcggcggcg cgcggatttt gattggtaca 1681 caaatgctgg cgaaaggtca ccatttcccg gatgtgacgc tggttgcatt actggacgtg 1741 gacggcgcgc tgttttctgc cgattttcgc tcggcagagc gtttcgctca gctttacacc 1801 caggtcgccg gtcgtgccgg gcgtgcgggt aaacagggcg aagtggtgct gcaaacgcac 1861 catccggaac atcctctgtt gcaaacgttg ctctataaag gctacgacgc ctttgccgaa 1921 cagcggctgg ctgagcggcg aatgatgcag ctaccgccgt ggaccagcca tgtgattgtg 1981 cgtgcggaag atcataacaa tcagcacgcg ccattgttcc tgcaacaact gcgtaatctg 2041 atcctctcca gcccactggc agacgagaaa ctgtgggttc tcggtccggt tccggctctg 2101 gcacctaaac gtggcggtcg ctggcgctgg cagatattgt tgcagcaccc ttcccgcgtg 2161 cgcttgcaac acatcattaa cggtacgctg gcgctcatca atacaatacc ggattcccgt 2221 aaggtgaaat gggtgctgga tgttgatccg attgagggtt aaaccgctca cgatgcgagg 2281 cggatcgaaa aattcaatat tcatcacact tttcatgaaa attctgtaac cgttttcacg 2341 cgctatctgc taaaaatgtt gccgatgtga agtaaacatg gatgtagtac gcctgacgtg 2401 ccaggcgagg agtgagtgtg aaagcgaaga agcaggaaac tgccgcgacc atgaaagacg 2461 ttgccctcaa ggcaaaagtc tctacagcga ccgtctcccg agcattaatg aatcccgata 2521 aagtctccca ggccacccgt aatcgggttg aaaaagcggc ccgggaagtg ggttatttac 2581 cgcagcctat ggggcgcaac gtcaagcgta atgaatcccg caccattctg gtgattgtcc 2641 cggatatctg cgatcccc // LOCUS EWCTELRNA 657 bp ds-DNA INV 26-JUL-1990 DEFINITION E.crassus telomerase RNA component gene, complete cds. ACCESSION M33461 KEYWORDS telomerase RNA. SOURCE E.crassus DNA. ORGANISM Euplotes crassus Eukaryota; Animalia; Metazoa; Ciliophora; Polyhymenophora; Spirotricha; Heterotrichida; Clevelandellina. REFERENCE 1 (bases 1 to 657) AUTHORS Shippen-Lentz,D. and Blackburn,E.H. TITLE Functional evidence for an RNA template in telomerase JOURNAL Science 247, 546-552 (1990) STANDARD simple staff_entry FEATURES from to/span description RNA 152 342 telomerase RNA component site 186 197 functional telomeric template BASE COUNT 202 a 122 c 111 g 222 t ORIGIN 1 aaaaccccaa aaccccaaaa ccccaaatct gataaaatta ttacgaatag aattttaaga 61 cctgcttatt gttttcgcgt aatttttgac ccataataat taacagaagt aatgactagt 121 tgtttataac ctaataggag gatatagggt agttctccat tgactaatcc gtcaaatctg 181 tcaaacaaaa ccccaaaacc gatcaatagg tgcgtttagc ttgattacac ctcttaaatg 241 aaatcttgca attctggaga gcttgagagg tgaaaccccc acagttaggt caaacatagt 301 ttgagatttg tatctcatat gctctagctg tcctctcatc tttttgacat tagctagacg 361 agacagctcc tcttgctatt tacttgcctt agtccgatca ctccgctaat atttttgatt 421 tttaaatttg gcggaatttc ttgttcacta atcttgaaat ttttacagaa attgttagat 481 ttaataagct aataatctat gtcagagcct ttagccaatt agaggctttc ctaagtacga 541 aagaggtata tatcattaca ttttgaatcc ctgacctcca tttttaagga atagagatac 601 cctccattat attcaatttg ggaaggattg aaaggggttt tggggttttg gggtttt // LOCUS HS4DWXJ 160 bp ds-DNA VRL 26-JUL-1990 DEFINITION Epstein-Barr virus defective WZhet junction. ACCESSION M33474 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, clinical sample 9) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 160) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 87 90 WZhet junction BASE COUNT 41 a 45 c 41 g 33 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cactcctggc gctctgatgc 61 gaccagaaat agctgcagga ccactttata ccaggggcag tggtccccct ccctagaact 121 gacaattggc tgctgtctgg cttacgtaaa cgcgctggac // LOCUS HS4WXJ 181 bp ds-DNA VRL 26-JUL-1990 DEFINITION Epstein-Barr virus WZhet junction, HR-1 clone 5. ACCESSION M33473 KEYWORDS . SOURCE Epstein-Barr virus (strain HR-1, het+ allotype) DNA, clone 5. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 181) AUTHORS Patton,D.F., Shirley,P., Raab-Traub,N., Resnick,L. and Sixbey,J.W. TITLE Defective viral DNA in Epstein-Barr virus-associated oral hairy leukoplakia JOURNAL J. Virol. 64, 397-400 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 108 111 WZhet junction BASE COUNT 50 a 42 c 46 g 43 t ORIGIN 1 aatagacagc ccagttgaaa tatgcatggc atgcagcaga cattcatcat ttagaaatgt 61 atccaagatt tcattaagtt cgggggtcag gggggagtcc agattcaaat accaggggca 121 gtggtccccc tccctagaac tgacaattgc ctgctgtctg gcttacgtaa acgcgctgga 181 c // LOCUS HUMREGA 4251 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human regenerating protein (reg) gene, complete cds. ACCESSION J05412 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4251) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. FEATURES from to/span description pept 1571 1634 regenerating protein (reg), exon 2 (first expressed exon) 2270 2388 regenerating protein, exon 3 2696 2833 regenerating protein, exon 4 3549 3660 regenerating protein, exon 5 3856 3923 regenerating protein, exon 6 pre-msg 1196 4116 reg mRNA and introns IVS 1224 1524 reg intron A IVS 1635 2269 reg intron B IVS 2389 2695 reg intron C IVS 2834 3548 reg intron D IVS 3661 3855 reg intron E signal 1169 1174 TATA box BASE COUNT 1161 a 927 c 869 g 1294 t ORIGIN 1 gaattcctgg gctcaagtga tcctctcatg tcagtctccc aaagtgctgg gatgacaggc 61 ttgagccacc acaccaggcc catcatcagt ttttatataa agaaaaaaaa accttaaaat 121 tgttaggcaa atactatgac aaattgtaat atatattctt acatttcaga tttttatttt 181 ttaaactgta taagaattga ttaataaata aaatttagta ttaatctgtc ttttaaaacc 241 atatataaag tttatcaaat agcttataac ttcttgcaac tgaatttttg tattcaatgt 301 tatggctttg atactagtcc aagttgaaat atagatatct actttattcg atttaaattc 361 tgtttagtat tttattatat tttgttaatc catttgtccc aattcatata cttatctctc 421 tttctgtgaa tattcaggtt agttttttct tcctaatttt gcattctgat tggcttttat 481 tccctgaatt ataaatgact attctatgat gattctggta aatactcaat ttcaccacac 541 aatctttgac ttcatactaa caaacagttg acttcaaatg gacaatttca atgaaggctg 601 acttcatatt tagctccttt aagcttcctt aggcatcagc tctctacaat tctcacattg 661 agaatatgtg tattttgtta gctcaaacct tgttagacat gttaaatgtt tagaaatata 721 aatttaacct accccttgag gtaggtcttg agaggtttgt gagcctaaaa agacatggag 781 gaaccactta ttgccacaag cacattgttc taaattattt ggaatcagtt aattcttccc 841 catctcctac ccatgcctga caccaaagag gagcctctaa atttacaggg aatacaagga 901 agtctactgt tctctgctcc tctctgggtt attagggcac atgggagccc tcagttgttt 961 tctgctgagc aagagcaaag tccaccttgg acttagacag cttgccaaat tttttgccag 1021 aaggggacct gagttgtgac cactcccagt gtgtgccggg aaaaggctca tactggtgcc 1081 agaatctctt actgtcaatg ctcccaaaac tcaccgcttg cccccacccc ttttgcttaa 1141 atgacgtggt tcttatctca gatcctgata taaagctcct acagctacct ggcctgagaa 1201 gccaactcag actcagccaa caggtaagtg ggcattacag gagaagggcg tctctaacat 1261 gcactgtaga tctaaaatct tcgggaagat acagcatgag tttctgtcca agaggtttta 1321 gctgtaagga agcctcagtg ggatccaaag ttgtttttca gttactgagt ctgtataatc 1381 cccactctca agagaaacat ttgaaggtgt gggtgtctca gaggaccttc ctggtctcag 1441 aaattctgag aggaggtttt aaggaaggta ataggtgctt tgctctccat ctctcagaac 1501 ccccttctct gtgttctcct atagagattg ttgatttgcc tcttaagcaa gagattcatt 1561 gcagctcagc atggctcaga ccagctcata cttcatgctg atctcctgcc tgatgtttct 1621 gtctcagagc caaggtaaga tctcttttcc accaaccaac tctttctagc cctgaagact 1681 tcactctatc cccaagcata cgggtctact tgaaaaaaaa aaaaaagcag agtcactgtt 1741 aagggttgtt ttgtggtgtt tagtgatctt tattgcttat ctcttcacat ttatatacat 1801 ccacacctca ttaaggagtt ggagctagaa tttaaaatga ccccttataa gcaactgctg 1861 cagctggcat gagtttatct gattaaattt atacgtgatg gtggatttgg ggatgtctgt 1921 gtgtagacag tcactaatgg ggtggagaac tgaagagagc cttgtgttca gggaaaccaa 1981 gtcaggcttg agaaagtaga aggctgagtc cttcaaggta gaagagcctg agctccagac 2041 ataaaaggga aactggagac ttgtttcttt ggcctattca ttctgttttt tttcccctga 2101 tcaaagaaac caaagacaga agatgtagga tgcaggagca atagtgagca gtcatcccat 2161 aatagactgg attcttctgt ttctataaag gaacctcaga agctcttacc tcaccttcaa 2221 gccttttcct taccctgaga gcctccttta attgtctctt ctttttcagg ccaagaggcc 2281 cagacagagt tgccccaggc ccggatcagc tgcccagaag gcaccaatgc ctatcgctcc 2341 tactgctact actttaatga agaccgcgag acctgggttg atgcagatgt gagtgaggag 2401 agcagtgtgg gaagggagac tcatgaaggg aggggaagct gccactctcc agtgtgttca 2461 gtggctgcaa tgagatgaga ctgaacccct tgctatacta tcatcagccc caaactttcc 2521 aatctacttt atcccattat tcagcacatt cccagcacaa agaacctggt ggtcagtgac 2581 agcatcatca cggacattac tctgctgtcc tttttctgac ccgtcctctt ggaggactca 2641 gtatatccgt cacaacttcc tcctccactg agtgctccat tttcttctgc aacagctcta 2701 ttgccagaac atgaattcgg gcaacctggt gtctgtgctc acccaggccg agggtgcctt 2761 tgtggcctca ctgattaagg agagtggcac tgatgacttc aatgtctgga ttgccctcca 2821 tgaccccaaa aaggtaggct gcagccttct ttatctccta atgatcaggt ttgagaagta 2881 agaaggaggt tcaagttctg gtctcttaag taccagcttt tatcgctttc cagaaatcag 2941 gctgtttaca gatcctctaa tgtcctgtgt agcaaggtgc actgtagatg attggagata 3001 taagtggaag gctgaatttc ctaggtgttc ttgtcattca tgaataaact tattctgttt 3061 tcagtcaaca aagcatcttt atgcaccaac ttcttaccta ttttgttact gtcagagtca 3121 caagagagac tagattgccg actatataag aaaggagact tgtggtaaaa atctgctgct 3181 gtactgctgg catttgggaa cctggtagta tactaaataa tataatatat caacaactaa 3241 tggtcagcca atgctatgct ggatatgagg gtcctgggcc acaaagacaa aaaatcagga 3301 accacttttt aagtgagata ctttgggtct ctgtcaaatt cataacactt atttcttggt 3361 ggaatacagt taatgagttg gacagttcag gaaagaagtt tagagcaata gcaaaggaaa 3421 ggaaacaata tttagcaagg tttattcttc ctttgtgtct tagcatgttt ctgagtgtgc 3481 acacaggccc agtgattcca tgtatttttg agtgaccact gcctctgttc tggcccttcc 3541 ccatctagaa ccgccgctgg cactggagca gtgggtccct ggtctcctac aagtcctggg 3601 gcattggagc cccaagcagt gttaatcctg gctactgtgt gagcctgacc tcaagcacag 3661 gtgagaggca gagaatccat ccacctgttt ctgttctctc ctgcttagct ccagggatgg 3721 aactgggact gggatagagg aaaggtgaac tcctcattaa ggaaatggat gtttggtttt 3781 tgtcctgagt cctaaagcca ggagggtcat actctttcgg gtctcccagt tgtaactctt 3841 ctcattgact tataggattc cagaaatgga aggatgtgcc ttgtgaagac aagttctcct 3901 ttgtctgcaa gttcaaaaac tagaggcagc tggaaaatac atgtctagaa ctgatccagc 3961 aattacaacg gagtcaaaaa ttaaaccgga ccatctctcc aactcaactc aacctggaca 4021 ctctcttctc tgctgagttt gccttgttaa tcttcaatag ttttacctac cccagtcttt 4081 ggaaccctaa ataataaaaa taaacatgtt tccactattg tgctgtctta ctgtgtctgc 4141 tatttccaca gctgatgcct gggtggttga gatgagagtg attacaacaa agcttgctct 4201 ggcctatcca cttcttaaaa gtccatccgc ataccatgca tattggaatt c // LOCUS HUMREGRELA 1524 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human reg-related sequence, complete cds. ACCESSION J05413 KEYWORDS pancreatic stone protein; pancreatic thread protein; regenerating protein. SOURCE Human esophageal mucosa DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1524) AUTHORS Watanabe,T., Yonekura,H., Terazono,K., Yamamoto,H. and Okamoto,H. TITLE Complete nucleotide sequence of the human reg gene and its expression in normal and tumoral tissues: The reg protein, pancreatic stone protein, and pancreatic thread protein are one and the same product of the gene JOURNAL J. Biol. Chem. 265, 7432-7439 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by H.Okamoto, 23-FEB-1990. BASE COUNT 382 a 368 c 345 g 429 t ORIGIN 1 atctcagagg accttcctgc tgtcaggaat tcagaggagg aaataaggaa ggtaataggt 61 gctctgctct cattctctca aaccctcttc cctgtgtttt cctatagaga ttgctgattt 121 gctccttaag caagagattc actgctgctc agcatggctc agaccaactc atgcttcatg 181 ctgatctcct gcctgatgtt cctgtctctg agccaaggtg agattgtttt ccccacacat 241 acctcccaca accccagccc tgaagccctc actctatcct catgcatatg agttcacttg 301 agaaaaagca gagtcaagtt caggggttgt tttgtgttgt tcagtgatat ttattgctga 361 tctcatccca ttcaaaaaca tcctgacctc cctaaggagt tagagatgga acttagcata 421 accctttatc agtgaccact gcagttggca ttggtttgtc atattaacac tactcatgat 481 gggggtgttg aggatgtctg tttgtagaca gtcattagtg gaatggggaa ctgaggggag 541 ctttgtgtgt agagaaactg gacaggcttg agaaagaagc ctcagtcctt caaggaagaa 601 aaagccataa gtaaaaggga caatggggac acttttcatg agcctattca ttgtgtgctc 661 ttgtcttgag caaagacatc ttgagagcct ataggtaaga tgcagaaggg cagaagtgac 721 caatcgcttc gtgacctata ggatccttct attcctataa agaatcctca gaagctccta 781 cctcatattt tagcctttac cttgccctga gggtctttct taattgtctc tcttttccca 841 ggacaggagg cccatgctga gttgcccaag gcccagatca gctgcccaga aggcaccagt 901 gcctaaggct cccactgcta ctactttaat gaagagcatg agacctgggt ttatgcagat 961 gtgagtgagg agagcagtgt gggaagggag gctcacgaag ggaggggaag ctgccactct 1021 ccagtgtgtt cagtggctga tatgagatga gactaatccc ctccctatcc aatcatcagc 1081 ccaaaacttt ccaatctact ttatcccatc attcagcaca gagatgctgg tggtcagtga 1141 cagcatcatc agggacattt ctgtgctgtc ctttttctgt tacatcctct gggagggctc 1201 aatatgtctc ccacactttc ctccttcact gagtgctcca ttttcttctc caacagctct 1261 actgccagaa catgaattca ggtaacctgg tgtctgtgct cacccaggct gagggtgcct 1321 ttgtggcttc gctgattaaa gagagtggca ccaaggatag caatgtctgg attggcctcc 1381 atgaccccca ccggatcagt ctgctgcatc ttctacctcc tgattatcag gttccagagg 1441 gtctgatgtc tggcacctca agcatcagtt tttactatat tatgataaaa gcaacctctc 1501 tataaatcat ataatgtaaa ggat // LOCUS MDPCGA 4801 bp ss-RNA VRL 26-JUL-1990 DEFINITION Aleutian mink disease parvovirus complete genome. ACCESSION M20036 KEYWORDS complete genome. SOURCE Aleutian mink disease parvovirus (strain ADV-G), clone pXVB-4. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (bases 1 to 4592) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 4593 to 4801) AUTHORS Bloom,M.E., Alexandersen,S., Garon,C.F., Mori,S., Wei,W., Perryman,S. and Wolfinbarger,J.B. TITLE Nucleotide sequence of the 5'-terminal palindrome of Aleutian mink disease parvovirus (ADV) and construction of an infectious molecular clone JOURNAL J. Virol. 64, 3551-3556 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by M.E.Bloom, 18-MAR-1990. Sequence reported below is (+) strand. FEATURES from to/span description ORF 116 1978 Left ORF ORF 1983 2207 Middle ORF2 ORF 1993 2211 Middle ORF1 ORF 2241 4399 Right ORF BASE COUNT 1740 a 912 c 943 g 1206 t ORIGIN 1 attaattctc aaccaatatt cgttagcaac caacaccagc tcgcttcgct cgcgcacctt 61 cggcgctggt gttgggcgct tcgcgcttgc taacttcata ttggttgaga attaatccgt 121 gtctttcctg tggaatgagg aagtagtgtg gtatataagc agaggttgct tggagcaaag 181 cacagaccgg ttacagcaaa gtaacatggc tcaggctcaa attgatgagc agaggagact 241 gcaggacctg tatgtgcagt tgaagaagga gattaacgac ggtgaaggag ttgcctggtt 301 gttccaacaa aagacctaca ccgacaagga caacaaacca accaaagcaa caccgccact 361 gaggacaacc tcttctgacc taaggttagc ttttgactct attgaagaga atttaacagc 421 ttctaatgaa cacttaacta acaatgagat aaacttttgt aaactaacct tggggaagac 481 gttgctgtta attgataagc atgtaaaaag ccacagatgg gatagtaaca aagttaactt 541 aatttggcaa atagaaaaag gaaaaactca gcaatttcat attcactgtt gcttaggtta 601 ctttgataag aatgaagatc ctaaggatgt tcaaaaatcc ttaggttggt ttatgaaaag 661 actaaataaa gacctagcag ttatctatag taaccatcat tgtgacatac aagatattaa 721 ggatcctgaa gatagagcta agaacctaaa agtgtggatt gaagatggac ctactaagcc 781 ttacaaatat tttaacaaac aaaccaaaca agactacaat aaaccagttc acttgagaga 841 ctatacattc atatacctgt ttaacaaaga taagataaat acagatagta tggatggtta 901 ctttgctgct ggtaacggtg gcattgttga caacctaact aacaaagaac gaaaaacttt 961 aagaaaaatg tacttagatg agcagagttc agatataatg gatgctaata tagactggga 1021 agatggccaa gacgcgccaa aagtaactga ccaaactgac tcagcaacca caaaaacagg 1081 aactagtttg atttggaaat catgtgctac taaagtaacc tcaaaaaaag aagttgctaa 1141 tccagttcag caaccttcta aaaaactgta ctcagctcaa agtactttag atgcattgtt 1201 taacgttggt tgctttactc cagaagatat gattataaag caaagtgaca aataccttga 1261 actatcttta gaaccaaacg ggcctcaaaa aattaacact ttacttcaca tgaaccaagt 1321 aaagacatca accatgatta ctgcttttga ttgtattata aaatttaatg aagaggaaga 1381 tgacaaacct ttgctagcaa ctataaaaga catgggactt aatgaacaat accttaagaa 1441 ggtactatgt accatcctaa ccaagcaagg tggaaagaga ggttgtattt ggttctatgg 1501 accggggggc actggaaaaa ccttgctagc atctttaata tgtaaagcaa cagtaaacta 1561 tggtatggtt actacaagca atccaaactt tccatggact gactgtggca atagaaacat 1621 catttgggct gaagagtgtg gtaactttgg taactgggtt gaagacttta aagccattac 1681 tggaggtggt gatgtaaaag tagacaccaa gaacaagcaa cctcaatcta ttaaaggctg 1741 tgtgattgta acaagcaaca ccaacataac caaagtaact gttggatgtg tggaaacaaa 1801 cgctcacgca gagccactta aacagaggat gattaagata cgttgcatga aaaccatcaa 1861 ccctaaaact aaaataacac caggcatgtt aaaaagatgg ctaaatacct gggatagaca 1921 accaattcaa ctaagccatg agatgcctga actgtactta ggtaagtgcc gttggtaagt 1981 aacacatttt aaatgccaac tttaaaccaa catcaattta tgaggttact ttactttaca 2041 gagactactg gaccaaactc gagtgccaca actgccacga agaatactgg caactcacaa 2101 cctactactg caaagagtgc agaaagtgtg aacacggaaa actgcgacac accaaaaagg 2161 agtgcgagca gtgtgcctgc aaagcagcac aagagacctc ggcatgagta aaagtaaata 2221 acctacttaa agtaacctaa caccataaca ctttactttc cttgtactta tgttacttta 2281 ctttagttcc tcagcactat cctgggaaaa agagaagtgc tccaagacac gtgtttattc 2341 agcaagcaaa aaagaagaag caaactaacc ctgcggtcta ccacggagag gacaccatag 2401 aggaaatgga ttctactgaa gctgaacaaa tggacactga gcaagcaact aaccaaactg 2461 ctgaagctgg tggtgggggg ggtgggggtg gtgggggtgg tggtggtggt ggtggggttg 2521 gtaacagcac tggcggcttt aataacacaa cagaattcaa agtaataaac aatgaagtgt 2581 atattacttg tcacgctact agaatggtac acattaacca agctgacaca gacgaatact 2641 tgatatttaa tgctggtaga actactgata ccaaaacaca tcagcaaaaa ctaaacttag 2701 aattttttgt atatgatgat tttcaccaac aagtaatgac accttggtat atagtagata 2761 gcaacgcttg gggtgtatgg atgagtccta aagactttca acaaatgaaa acactgtgta 2821 gtgaaattag tttggttact ttggaacaag aaatagacaa tgtaaccata aaaactgtaa 2881 cagaaaccaa ccaaggtaac gcatctacca agcaattcaa caatgactta actgcgtcgt 2941 tacaggttgc tttagatact aacaacatac tgccatatac tccagctgcg ccgttggggg 3001 aaacactggg ctttgttcct tggagagcaa ccaaaccaac ccaatatagg tattatcatc 3061 catgttacat ttacaacaga tatcctaaca ttcaaaaagt tgcaacagaa acactaacct 3121 gggatgcagt acaagatgat taccttagtg tggatgaaca gtactttaac tttattacta 3181 tagagaacaa catacctatt aacattctca gaacgggaga taactttcat acaggcttgt 3241 atgagtttaa cagtaaacca tgtaaactaa ccttaagcta tcaaagtaca cgttgcttgg 3301 ggctacctcc tctctgcaaa ccaaagacag atacaacaca caaagtaacc tcaaaagaaa 3361 acggagctga cctaatttac atacaaggac aagataatac cagactaggt cacttttggg 3421 gtgaggaaag aggtaagaaa aacgcagaga tgaacagaat tagaccttac aacataggtt 3481 accaatatcc tgaatggata ataccagcag ggttacaggg tagttacttt gctggaggac 3541 caagacagtg gagtgacaca accaaaggtg caggtacaca cagtcaacac ttacaacaga 3601 actttagtac taggtacatc tatgacagaa accacggtgg agacaacgag gtagacctat 3661 tagatggaat acccattcat gaaagaagta actactactc agacaatgag atagagcaac 3721 atacagcaaa gcaaccaaag ttacgtacac cacccattca ccactcaaaa atagactcgt 3781 gggaagaaga aggttggcct gctgcttcag gcacacactt tgaagatgag gttatatacc 3841 tagactactt taactttagt ggtgaacagg agctaaactt tccacatgaa gtattagatg 3901 atgctgctca gatgaaaaag ctacttaact cataccaacc aacagttgct caagacaacg 3961 ttggtcctgt atacccgtgg ggacagatat gggacaagaa acctcatatg gatcacaaac 4021 ctagcatgaa caacaacgct ccatttgtat gtaaaaacaa ccctccaggt caactctttg 4081 ttaaactaac agaaaacctc actgatacat ttaactatga tgaaaatcca gacagaataa 4141 aaacctatgg ttactttact tggagaggca agcttgtact aaaaggcaaa ctaagccaag 4201 taacatgctg gaatcctgtt aagagagaac tcataggaga acctggtgta tttactaaag 4261 acaagtatca caaacagata ccaaacaaca aaggtaactt tgaaataggg ttacaatatg 4321 gaagaagtac tatcaaatat atctactaaa gtaacctgtg tactatgtta ctatgttact 4381 atgataatat ctcaataaaa gttacatgaa tagtgaacaa cctaaatact gtgtacttcc 4441 ttattttacc agaaagtggc ggattaaaat aaacctacat tctatactat ctatatacta 4501 ctaactaacc tataggttac tttgctttga tatactgatg taggaataca ggatactaac 4561 atttatatat atactaacat ctatactact aacctaacta tggcctaatg tatgcagtgt 4621 cggcgtcgcc gacaactaca ttatattatt aggcatagtt aggttagtag tatagatgtt 4681 agtatatata taaatgttag tatcctgtgt tcctacttca gtatataaag aaagtttcct 4741 ataggtgggt ttgcggtcta tctagagttg tggtccgtat tggtttctgt aaaggacctg 4801 a // LOCUS MDPUPS 3454 bp ss-RNA VRL 26-JUL-1990 DEFINITION Aleutian mink disease parvovirus (ADV-Utah 1 strain) RNA, partial sequence. ACCESSION M32981 KEYWORDS . SOURCE Aleutian mink disease parvovirus (strain ADV-Utah 1) RNA. ORGANISM Aleutian mink disease parvovirus Viridae; ss-DNA nonenveloped viruses; Parvoviridae; Parvovirus. REFERENCE 1 (sites) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. TITLE Nucleotide sequence and genomic organization of Aleutian mink disease parvovirus (ADV): Sequence comparisons between a nonpathogenic and pathogenic strain of ADV JOURNAL J. Virol. 62, 2903-2915 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3454, for [1]) AUTHORS Bloom,M.E., Alexandersen,S., Perryman,S., Lechner,D. and Wolfinbarger,J.B. JOURNAL Unpublished (1990) Rocky Mountain Labs, Hamilton, MT 59840 STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by M.E.Bloom 18-MAR-1990. Sequence reported below is (+) strand. BASE COUNT 1268 a 673 c 698 g 815 t ORIGIN 1 ggatcctgaa gatagagcta agaacctaaa agtgtgggtt gaagatggac ctactaagcc 61 ttacaaatat tttaacaaac aaaccaacaa gactacaaca aaccagttca cttgagagac 121 tatacattca tatacctgtt taacaaagat aagataaata cagatagtat ggatggttac 181 tttgctgctg gtaacggtgg cattgttgac aacctaacta acaaagaacg aaaaacttta 241 agaaaaatgt acttagatga gcagagttca gatataatgg atgctaatat agactgggaa 301 gatggccaag acgcgccaaa agtaactgac caaactgact cagcaaccac aaaaacagga 361 actagtttga tttggaaatc atgtgctact aaagtaacct caaaaaaaga agttgctaat 421 ccagttcagc aaccttctaa aaaactgtac tcagctcaaa atactttaga tgcattgttt 481 aacgttggtt gctttactcc agaagatatg attataaagc aaagtgacaa ataccttgaa 541 ctatctttag aaccaaacgg gcctcaaaaa attaacactt tacttcacat gaaccaagta 601 aagacatcaa ccatgatgac tgcttttgat tgtattataa aatttaatga agaggaagat 661 gacaaacctt tgctagcaac tataaaagac atgggactta atgaacaata ccttaagaag 721 gtactatgta ccatcctaac caagcaaggt ggaaagagag gttgtatttg gttctatgga 781 ccggggggca ctggaaaaac cttgctagca tctttaatat gtaaagcaac agtaaactat 841 ggtatggtta ctacaagcaa tccaaacttt ccatggactg actgtggcaa tagaaacatc 901 atttgggctg aagagtgtgg taaccttggt aactgggttg aagactttaa agccattact 961 ggaggtggtg atgtaaaagt agataccaag aacaagcaac ctcaatctat taaaggctgt 1021 gtgattgtaa caagcaacac caacataacc aaagtaactg ttggatgtgt ggaaacaaac 1081 gctcacgcag agccacttaa acagaggatg attaagatac gttgcatgaa aaccatcaac 1141 cctaaaacta aaataacacc aggcatgtta aaaagatggc taaatacctg ggatagacaa 1201 ccaattcaac taagccatga gatgcctgaa ctgtacttag gtaagtgccg ttggtaagta 1261 acacatttta aatgccaact ttaaaccaac atcaatttat gaggttactt tactttacag 1321 agactactgg accaaactcg agtgccacaa ctgccacgaa gaatactggc aactcacaac 1381 ctactactgc aaagagtgca gaaagtgtga acacggaaaa ctgcgacaca ccaaaaaggg 1441 gtgcgagcag tgtgcctccg aagcagcaca agagacctcg gcatgagtag aagtaagtaa 1501 cctacttaaa gtaacctaac accatgacac tttactttac ttgtacttat gttactttac 1561 tttagttcct cagcactatc ctgggaaaaa gagaagtgct ccaagacacg tatttattca 1621 gcaagcaaaa aagaagaagc aaactaaccc tgcggtgtac cacggagaag acacaataga 1681 ggaaatggat tctgctgaac ctgaacagat ggacactgag caagcaacta accaaactgc 1741 tgaagctggg ggtggagggg gtgggagtgg gggtggtggt ggtgggggtg gtggggttgg 1801 taacagcact ggcggcttta ataacacaac agaattcaaa gtaataaaca atgaagtgta 1861 tattacttgt cacgctacta gaatggtgca catcaaccaa gctgacacag atgaatactt 1921 gatatttaat gctgatagaa ctactgatac caaaacagct caaaaaaaac taaacttaga 1981 attttttgta tatgatgatt ttcaccaaca agtaatgaca ccttggttta tagtagatag 2041 caacgcttgg ggtgtgtgga tgagtcctaa agactttcaa caaatgaaaa cactgtgtag 2101 tgagattagt ttggttactt tggaacaaga gatagacaat gtaaccataa agactgtaac 2161 agaaaccaac caaggtaacg catccaccaa gcaattcaac aatgacttaa ctgcgtcgtt 2221 acaggttgct ttagatacta acaacatact gccatatact ccagctgcgc cgttggggga 2281 aacactgggc tttgttcctt ggagagcaac caaaccaacc caatataggt attatcatcc 2341 atgttacatt tacaacagat atcctaacat tcaaaagctg gggcaggagc aattagaatg 2401 gactggtaca caagatgatt acctgagtgt ggatgagcag tactttaact ttatcactat 2461 agagaacaac atacctatta acattctcag aacgggagat aactttcata caggcttgta 2521 tgagtttaac agtaaaccat gtaaactaac cttaagctat caaagtacac gttgcttggg 2581 gctacctcct ctctgcaaac caaagacaga tacaacacac aaagtaacct caaaagaaaa 2641 cggagctgac ctaatttaca tacaaggaca agataatacc agactaggtc acttttgggg 2701 tgaggaaaga ggtaagaaaa acgcagagat gaacagagtt agaccttaca acataggtta 2761 ccaatatcct gaatggataa taccagcagg gttacagggt agttactttg ctggaggacc 2821 aagacagtgg agtgacacaa ccaaaggtgc aggtacacac agtcaacagt tacaacagaa 2881 ctttagtact aggtacatct atgacagaaa ccacggtgga gacaacgagg tagacctatt 2941 agatggaata cccattcatg aaagaagtaa ctactactca gaccatgaga tagagcaaca 3001 tacagcaaag caaccaaagt tacgtacacc acccattcac cactcaaaaa tagactcgtg 3061 ggaagaagaa ggttggcctg ctgcttcagg cacacacttt gaagatgagg ttatatacct 3121 agactacttt aactttagtg gtgaacaaga attagagttt ccacatgaag tattagatga 3181 tgctgctcaa atgaaaaagc tacttaactc ataccaacca acagttgctc aagacaacgt 3241 tggtcctgta tacccatggg gacagatatg ggacaagaaa cctgatatgg atcacaaacc 3301 tagcatgaac aacaacgctc catttgtatg taaaaacaac cctccaggtc aactctttgt 3361 taaactaaca gaaaacctca ctgatacatt taactatgat gaaaatccag acagaataaa 3421 aacctatggt tactttactt ggagaggcaa gctt // LOCUS MUSAA2DEL 300 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse dilute prenatal lethal Aa2 deletion breakpoint fusion fragment. ACCESSION M33468 KEYWORDS deletion mutant. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 130 133 deletion breakpoint BASE COUNT 82 a 45 c 76 g 97 t ORIGIN 1 agaggctgca cagcgcagac atgttggtag gtaacgtgat agtttagaat tggagtcact 61 gggaatgtga ttatgaaggc ccaagggtac ctgttatctg tagagtaccc agtgtggtgt 121 ggtaagactt ctgcaccttg atagggacgg cttctgagtc agaaaatgtt cttcaaaagt 181 tatgttttac tctctttgct gatatgacta acaatgctgt tgatgattaa ttgataaata 241 tgtggaataa tactgactga tcagtgtaca gattctttgc ttctgagtga ttgccttaaa // LOCUS MUSSL20JA 300 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse dilute lethal-20J (d-l20J) deletion breakpoint fusion fragment. ACCESSION M33467 KEYWORDS deletion mutant. SOURCE Mouse (C57BL/6J-d-l120J/d-v-se allotype) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Strobel,M.C., Seperack,P.K., Copeland,N.G. and Jenkins,N.A. TITLE Molecular analysis of two mouse dilute locus deletion mutations: Spontaneous dilute lethal-20J and radiation-induced dilute prenatal lethal Aa2 alleles JOURNAL Mol. Cell. Biol. 10, 501-509 (1990) STANDARD simple staff_entry FEATURES from to/span description pept.ps / 42 122 dilute gene, exon 3 179 > 300 dilute gene, exon 5 recomb 129 130 deletion breakpoint BASE COUNT 86 a 51 c 61 g 102 t ORIGIN 1 gtgtcctttt gtgttttgca ttgtgtttct ttacacggaa gatcatctac tatggattac 61 caggagttga atgaggatgg agagctctgg atggtttatg aagggttaaa acaagccaac 121 aggttatatc ttgctcaaag acacaagaaa caaatatcca ttgtacctgt tttttcagta 181 ttttgaggaa ttatatgcag atgaccctaa gaagtatcaa tcctatcgga tttcacttta 241 caaaaggatg attgtatgta aaacacagtg cttttctgtt gtcctctgct acttctagcc // LOCUS PRVVP4 2359 bp ds-RNA VRL 26-JUL-1990 DEFINITION Porcine rotavirus capsid protein VP4 gene, complete cds. ACCESSION M33516 KEYWORDS capsid protein VP4. SOURCE Porcine rotavirus (Gottfried strain; serotype 4) DNA. ORGANISM Porcine rotavirus Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 2359) AUTHORS Gorziglia,M., Nishikawa,K., Hoshino,Y. and Taniguchi,K. TITLE Similarity of the outer capsid protein VP4 of the Gottfried strain of porcine rotavirus to that of asymptomatic human rotavirus strains JOURNAL J. Virol. 64, 414-418 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 10 2337 capsid protein VP4 BASE COUNT 812 a 406 c 442 g 699 t ORIGIN 1 ggctataaaa tggcttcgct catttataga cagctgctca ctaattcata cacagttgaa 61 ttatctgatg aaattaaaac aattggatca gaaaagagtc agaatgtaac aattaatccg 121 ggtccgtttg ctcaaacgac ctatgcacca gtcacttgga gacatggaga agtaaacgat 181 tctacaacgg tagaaccagt acttgacggt ccatatcagc caacgagttt caaaccgcca 241 aatgactatt ggatattgtt aaacccgatt aataagggag ttgtattcaa gggtactaac 301 aggactgatg tttgggttgc aatactactc attgaacaac gcgtacctag tcaagatcga 361 caatatacat tatttggaga agtgaagcaa atcactgtag agaatagttc cgacaaatgg 421 aaattctttg aaatgtttag aaacaacgct aacattgatt ttcagcttca acgtccttta 481 acatcagata caaaattagc tggctttcta acacatggtg gacgtgtttg gacatttaat 541 ggtgaaacgc cgcatgctac aactgattac tcaacaactt caaacttacc tgatgtagaa 601 gtagtaatac atactgaatt ctacataata ccaagatctc aagaatctaa atgcaatgag 661 tatattaata ctgggttacc accaatgcaa aacacaagga atgtggttcc agtagcatta 721 tcatctagat ctataactta tcaacgtgca caagttaacg aagatatcat tatatcaaag 781 acttcattgt ggaaagaaat gcaatacaat agagacatta caataagatt taaattcggt 841 aatagcatag taaagcttgg tggattaggt tataaatggt cagaagtctc attcaaagca 901 gcaaattatc agtataatta tttaagggat ggagaacagg tgacagccca cactacttgt 961 tcagttaacg gagtaaataa ttttagttat aatggaggat cactgccaac tgattttagc 1021 gtatctagat atgaattaat aaaagagaat tcatatgttt atatcgatta ctgggatgac 1081 tcacaagcat tcaaaaacat ggtatatgtt agatcacttg cagcaaattt aaattcagtg 1141 aaatgtagtg gaggtaacta taactttaaa attccagttg gtgcatggcc agtaatgagt 1201 ggtggtgcag tatctctaca tttcgcggga gttacattat ctactcaatt tactaatttc 1261 gtatcactca attcactaag attcagattc agtttaactg ttgaggaacc atccttttca 1321 attttgcgta cacgtgtatc aggattgtac ggattaccag cagctaatcc gaataatgga 1381 aatgaatact atgaaatagc gggaagattt tctctcattt tattggtacc atctaatgac 1441 gactatcaaa ctccaattat gaattcagtc accgtacgac aagatttaga acgccaattg 1501 ggcgatttga gagaagaatt taattcactg tcacaagaaa tagctatgac tcaattaata 1561 gacttggctt tattgccgtt agatatgttt tccatgttct caggtattaa aagtacaatt 1621 gatgtggcta aatcaatggc cacaaatgtt atgaaaaagt ttaaaaagtc aggactagct 1681 acatctatat cagaactgac tggatcattg ccgagtgctg catcgtcagt ttcaaggagc 1741 tcttctatta gatctaacat ttcatctatt tcagtgtgga cggatgtttc tgaacaaata 1801 gcagatgcat caaattctgt tagaagtatt tcaacgcaga cgtcagctat tagtaaaaga 1861 cttagattac gtgagatcac tactcagact gaagggatga attttgacga tatttccgct 1921 gctgttctca aaacgcccct agataagtca acacatataa gccctgatac gctgccagat 1981 ataataactg aatcgtctga aaaatttata ccaaaacgcg cttatagagt tttaaagaat 2041 gatgaagtta tggaggctga tgtagatggg aaatttttcg catacagagt tgatactttc 2101 gaagaagtgc catttgatgt ggataaattt gttaatctgg ccactgcttc ccctgtgata 2161 tcagctataa ttgattttaa aacactgaaa aacctgaatg acaactatgg tataacacgc 2221 tctcaagcgc tagatttgat tagatctgat cccagggttc tacgtgattt tatcaatcaa 2281 aacaatccaa ttattaaaaa tagaatagaa caattaatac tgcaatgtag attgtgagag 2341 ctctatagag gatgtgacc // LOCUS RATSTAA 1000 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat hydroxysteroid sulfotransferase a (STa) mRNA, complete cds. ACCESSION M33329 KEYWORDS hydroxysteroid sulfotransferase a. SOURCE Rat (strain Sprague-Dawley) female liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1000) AUTHORS Ogura,K., Kajita,J., Narihata,H., Watabe,T., Ozawa,S., Nagata,K., Yamazoe,Y. and Kato,R. TITLE cDNA cloning of the hydroxysteroid sulfotransferase STa sharing a strong homology in amino acid sequence with the senescence marker protein SMP-2 in rat livers JOURNAL Biochem. Biophys. Res. Commun. 166, 1494-1500 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 38 892 hydroxysteroid sulfotransferase a (STa) mRNA < 1 1000 STa mRNA signal 977 982 poly-A signal BASE COUNT 299 a 187 c 211 g 303 t ORIGIN 1 ctggaatcct aacaggacct acacagagct atttataatg ccagactata cttggtttga 61 aggaatacct tttcctgcct ttgggattcc aaaagaaact ttgcaaaatg tttgtaataa 121 gtttgtggtg aaagaagaag atttgatctt attgacttat cccaagtcag gaacaaactg 181 gctgattgaa attgtctgct tgattcagac caagggagat cccaagtgga tccaatctgt 241 gaccatctgg gatcgctcac cctggataga gactgattta ggatatgata tgttaatcaa 301 aaagaaagga ccacgactca taacctccca tcttcccatg catcttttct ccaagtctct 361 cttcagttcc aaggccaagg tgatctatct catcagaaat cccagagatg ttcttgtttc 421 tggttattat ttctggggta agacaactct tgcgaagaag ccagactcac tgggaacgta 481 tgttgaatgg ttcctcaaag gatatgttcc gtatggatca tggtttgagc acatccgtgc 541 ctggctgtct atgcgagaat tagacaactt cttgttactg tactatgaag acatgaaaaa 601 ggatacaatg ggaaccataa agaagatatg tgacttccta gggaaaaaat tagagccaga 661 tgagctggat ttggtcctca agtacagttc cttccaagtc atgaaagaaa acaacatgtc 721 caattataat ctcatggaga aggaactgat tcttcctggt tttactttca tgagaaacgg 781 cactactggg gactggaaga atcacttcac tgtagcccaa gctgaagcct ttgataaagt 841 gtttcaggag aaaatggccg gtttccctcc agggatgttc ccatgggatt aaaatttcaa 901 aagttttaaa tattttatga acattgattt ttatgtttct gttgttctat gtctgaataa 961 gtgaatgtgg tcattgaata aattctattc tggcattgtg // LOCUS SMFPOLENV 3534 bp ss-RNA VRL 26-JUL-1990 DEFINITION Simian foamy virus type 1 polymerase (pol) gene, 3' end; and envelope (env) gene, complete cds. ACCESSION M33561 KEYWORDS envelope protein; polymerase. SOURCE Simian foamy virus type 1, cDNA to viral RNA. ORGANISM Simian foamy virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Spumavirinae. REFERENCE 1 (bases 1 to 3534) AUTHORS Mergia,A., Shaw,K.E.S., Lackner,J.E. and Luciw,P.A. TITLE Relationship of the env genes and the endonuclease domain of the pol genes of simian foamy virus type 1 and human foamy virus JOURNAL J. Virol. 64, 406-410 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 582 polymerase (AA at 1) pept 524 3481 envelope protein BASE COUNT 1164 a 648 c 674 g 1048 t ORIGIN 1 gaattcagta ctccttacca cccccaaagt agtggtaaag tggaaaggaa aaatagtgac 61 attaaacgac ttttaactaa actgctaatt gggagacctg ctaagtggta tgatctacta 121 cctgttgtac aattggcctt aaataattct tatagtccct cttctaaata tactcctcat 181 caactcttgt ttggtgtaga ttccaacaca ccgtttgcaa attctgatac acttgactta 241 tccagagaag aggaactgtc tcttttacag gaaattagat cttctctaca ccagccaacc 301 tcccctcctg cctcctctcg ttcctggtct ccttctgttg gccaactagt ccaggagagg 361 gtagctcgcc ctgcttcact tcgaccacgc tggcataagc ctacagctat tttggaggtc 421 gtgaatcctc ggacagtgat aattttggac catcttggca acagacgtac tgtaagtgtt 481 gacaacctta agttaacagc ttatcaggat aatggcacct ccaatgactc tggaacaatg 541 gctcttatgg aagaagatga gtcaagcaca tcaagcactt gaaaatgtaa ccaccttgac 601 tgaggaacag aagcaacaag ttataataga cattcagcat gaagatgttg ttcctactag 661 gatggacaaa ttgaaatatc tggcctattc atgctgcgct actagcacac gtgtattgtg 721 ctggatagtg ttagtttgcg tcttgctatt agttgtattt atatcctgct ttgtgacaat 781 gtccaggata caatggaata aggatattgc tgtttttggt ccagtcattg actggaatgt 841 tagccaacaa gctgtgattc aacaaataag agctaaaaga ttagcaagat caattagggt 901 ggaacatgct actgagacat atgtagaggt caatatgacc agtatacctc aaggggtgtt 961 atatgtgcct catccagaac caataattct caaggagagg gttcttggtt tatctcaggt 1021 cataatgata aactctgaaa atattgctaa tactgctaac cttactcaag aaactaaggt 1081 actgttagca gacatgatta atgaagagat gaatgattta gctaatcaaa tgatagattt 1141 tgaaatccca ttaggagatc ccagagatca aaaacaatac cagcatcaaa aatgttttca 1201 agaatttgca cattgttatt tagtaaaata taaaactact aaaggatggc ctagttctac 1261 tgttatagca gatcaatgcc ctttgcctgg taaccatcct acagtacaat atgcacatca 1321 aaatatatgg gattattatg tcccctttga acaaattcgg ccagaaggat ggaactcaaa 1381 aagttattat gaagatgcta gaataggagg gttttatata ccaaaatggt tacgaaataa 1441 ttcctatacc catgtcttat tttgttctga tcaaatttat ggaaaatggt ataatattga 1501 tctcacagcc caggagaggg aaaatttatt agtccaaaaa ttaattaatt tagctaaagg 1561 aaattcatca caattaaagg atagagctat gccagctgaa tgggataaac aaggaaaagc 1621 tgatctattt agacaaatta atactttaga tgtttgtaat agaccagaaa tggtattttt 1681 gttaaattcc tcatattatg aattttccct atgggaagga gattgtggtt ttaccagaca 1741 gaatgttaca caggctaatt ccttatgtaa agatttctat aataactcaa aatggcaaaa 1801 attacatcca tattcgtgta gattttggag atataaacaa gagaaagaag aaactaaatg 1861 tagtaatggt gaaaagaaaa aatgtcttta ttacccacaa tgggatactc ctgaagcttt 1921 atatgacttt gggttcctag catatttaaa ttcttttcct tctccaatct gtataaaaaa 1981 tcagactata agggaacctg agtatgaaat ctcttcttta tacctagaat gcatgaatgc 2041 ttcagacaga catggtatag atagtgcttt attagctttg aagacatttt taaactttac 2101 tggtcagtct gtaaacgaaa tgccattagc tagagccttt gtaggcctta ctgaccctaa 2161 atttccacca acatatccca acattacaag ggaatcttct ggttgtaata ataacaaaag 2221 aaaaaggaga agtgttaata attatgaaag acttagatct atgggatatg ctttaactgg 2281 agctgttcaa actttatctc aaatatctga tattaatgat gagaggctgc aacacggagt 2341 atatttactc cgggatcatg tggtaaccct gatggaagct gcccttcatg atgtttcgat 2401 tatggaagga atgttagcaa ttcaacatgt gcatactcat ctcaatcatc tcaagaccat 2461 acttttgatg agaaagattg attggacatt catcagaagt gactggattc aacagcaatt 2521 acagaagaca gatgatgaaa tgaaattgat acgaagaact gcacgaagtc tagtctacta 2581 tgtcacacaa acctccagtt ctcctacagc tacttcctgg gagattggaa tatattatga 2641 aatagtaatt cctaaacata tatatttaaa taattggcaa gtaatcaatg taggtcattt 2701 attggagtca gctggtcatc tgactcatgt aaaggttaag catccttatg aaataattaa 2761 taaggaatgt agtgacactc aatatttaca tcttgaggaa tgcattagag aggattatgt 2821 gatttgtgac atagtacaaa tagttcaacc atgtggaaat gcaacagaat tgagtgattg 2881 tccagtagca gcattaaagg tgaagactcc atatattcaa gtgtctcccc tgaagaatgg 2941 aagttattta gttttatcta gtactaagga ttgttctata cctgcatatg tacctagtgt 3001 ggtcacagtc aatgaaacag ttaagtgctt tggagtagag tttcacaaac cactttatgc 3061 tgaaacaaaa accagctatg aaccacaagt tccgcatttg aagcttcgtt taccccactt 3121 gactgggatt attgccagct tgcaatcact ggaaatagaa gttacttcta cacaagagaa 3181 tataaaagac cagatcgaaa gggccaaagc acagcttctc cggctggaca ttcacgaagg 3241 agactttcct gactggctga aacaagtcgc ctctgcaacc agggacgttt ggcctgctgc 3301 agcttccttt atacaaggag taggtaactt cttatctaat actgcccagg ggatattcgg 3361 ctcagcggta agcctcctat cctatgcaaa acctattttg attggaatag gagttatact 3421 gcttattgcc cttcttttta agataatatc atggcttcct gggaagctca agaagaattg 3481 agagaacttc tacatcatct accagaggac gatccaccag cagatctaac tcat // LOCUS MUSC5DPROA 5401 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse complement component C5D (pro-C5D) mRNA, complete cds. ACCESSION M35526 J05234 KEYWORDS complement component C5D. SOURCE Mouse (strain B10.D2/oSnJ) liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5401) AUTHORS Wetsel,R.A., Fleischer,D.T. and Haviland,D.L. TITLE Deficiency of the murine fifth complement component (C5): A 2- base pair gene deletion in a 5'-exon JOURNAL J. Biol. Chem. 265, 2435-2440 (1990) STANDARD full staff_review COMMENT This coding sequence is translated into a truncated protein of the fifth complement component C5, for the C5S sequence see accession # M35525. FEATURES from to/span description pept 14 664 complement component C5D BASE COUNT 1636 a 1212 c 1221 g 1332 t ORIGIN Chromosome 2. 1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac 61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc 121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct 181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga 241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga 301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa 361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac 421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa 481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga 541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta 601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcact 661 ttgaaattaa agaatatgtc ttgccacgat tctctgtttc aatagaacta gaaagaacct 721 tcattggcta taaaaacttt aagaactttg aaatcactgt gaaagcaaga tatttttata 781 ataaagtggt acctgatgct gaagtgtatg ccttttttgg attgagagag gacataaaag 841 atgaggagaa gcagatgatg cacaaagcca cacaagccgc aaagttggtt gacggagttg 901 ctcagatctc ttttgattct gaaacagcag ttaaagagct gtcctacaac agtctagaag 961 acttaaacaa caagtacctt tatattgcag taacagtcac agaatcttca ggtggatttt 1021 cagaagaggc agaaatccct ggagtcaaat atgtcctctc tccctacaca ctgaatttgg 1081 tcgctactcc tcttttcgtg aagcccggga ttccattttc catcaaggca caggttaaag 1141 attcactcga gcaggcggta ggaggggtcc cagtaactct gatggcacaa acagtcgatg 1201 tgaatcaaga gacatctgac ttggaaacaa agaggagcat cactcacgac actgatggag 1261 tagctgtgtt tgtgctgaac ctcccatcaa acgtgacggt gctaaagttt gagatcagaa 1321 ctgatgaccc agaacttccc gaagaaaatc aagccagcaa agagtacgaa gcagttgcgt 1381 actcgtctct cagccaaagt tacatttaca tcgcttggac tgaaaactac aagcccatgc 1441 ttgtgggaga atacctgaat attatggtta cccccaagag cccatatatc gacaaaataa 1501 ctcactataa ttacttgatt ttatccaaag gcaaaattgt acagtacggc acaagagaga 1561 aacttttctc ctcaacttat caaaatataa atattccagt gacacagaac atggttcctt 1621 cagcacgact cctggtctat tacatagtca caggggagca aacagcagaa ttagtggctg 1681 acgcagtctg gataaatatt gaggagaagt gtggcaacca gctccaggtc catctgtctc 1741 cagatgaata tgtgtattct ccaggccaaa ctgtgtccct tgacatggtg actgaagcag 1801 actcatgggt agcactatca gcagtggaca gagctgtgta taaagtccag ggaaacgcca 1861 aaagggccat gcaaagagtc tttcgagctt tggatgaaaa gagtgacctg ggctgtgggg 1921 caggtggtgg ccatgacaat gcagatgtat tccatctagc tgggctcacc ttcctcacca 1981 acgcaaacgc agatgactcc cattatcgtg atgactcttg taaagaaatt ctcaggtcaa 2041 agagaaatct gcatctccta aggcagaaaa tagaagaaca agctgctaag tacaaacata 2101 gtgtgctaaa gaaatgctgc tatgacggag cccgagtgaa cttctatgaa acctgtgagg 2161 agcgagtggc ccgggttacc ataggccctc tctgcatcag ggccttcaac gagtgctgta 2221 ctattgcgaa caagatccga aaagaaagcc cccataaacc tgtccaactg ggaaggatcc 2281 acattaagac cctgttacca gtgatgaagg cagatatccg aagctacttt ccagagagct 2341 ggctatggga aattcatcgc gttcccaaaa gaaaacagct gcaggtcacg ctgcctgact 2401 cactaacgac ttgggaaatt caaggcattg gcatttcaga caatggtata tgtgttgctg 2461 atacactcaa ggcaaaggtg ttcaaagaag tcttcctgga gatgaacata ccatattctg 2521 ttgtgcgagg agaacagatc caattgaaag gaactgttta caactatatg acctcaggga 2581 caaagttctg tgttaaaatg tctgctgtgg agggaatctg cacttcggga agctcagctg 2641 ctagccttca cacctccagg ccctccagat gtgtgttcca gaggatagag ggctcgtcca 2701 gtcacttggt gaccttcacc ctgcttcctc tggaaattgg ccttcactcc ataaacttct 2761 cactagagac ctcatttggg aaagacatct tagtaaagac attacgggta gtgccagaag 2821 gagtcaagag ggaaagctat gccggcgtga ttctggaccc taagggaatt cgtggtattg 2881 ttaacagacg aaaggaattc ccatacagga tcccattaga tttggtcccc aagaccaaag 2941 ttgaaaggat tttgagtgtc aaaggactgc ttgtagggga gttcttgtcc acggttctga 3001 gtaaggaagg catcgacatc ctaacccacc tccccaaggg cagtgcagag gcagagctca 3061 tgagcatagc tccggtgttc tatgttttcc actacctgga agcaggaaac cattggaata 3121 ttttctatcc tgatacactg agtaaaagac agagcctgga gaaaaaaata aaacaagggg 3181 tggtgagcgt catgtcctac agaaacgctg actattccta cagcatgtgg aagggggcga 3241 gcgctagtac ctggctgaca gcttttgctc tgagagtgct tggacaggtg gccaagtatg 3301 taaaacagga tgaaaactca atttgtaact ctttgctatg gctggttgag aagtgtcagc 3361 tggaaaacgg ctctttcaag gaaaattccc aatatctacc aataaaatta cagggtactt 3421 tgcctgctga agcccaagag aaaactttgt atcttacagc cttttctgtg attggaatta 3481 gaaaggcagt tgacatatgc cccaccatga aaatccacac agcgctagat aaagccgact 3541 ccttcctgct tgaaaacacc ctgccatcca agagcacctt cacactggcc attgtagcct 3601 atgctctttc cctaggagac agaacccacc cgaggtttcg tctaattgtg tcggccctga 3661 ggaaggaagc ttttgttaaa ggtgatccgc ccatttaccg ttactggaga gataccctca 3721 aacgtccaga cagctctgtg cccagcagcg gcacagcagg tatggttgaa accacagcct 3781 atgctttgct cgccagcctg aaactgaagg atatgaatta cgccaacccc atcatcaagt 3841 ggctatctga agagcagagg tatggaggcg gcttttattc cacccaggat acgattaatg 3901 ccatcgaggg cctgacagaa tattcactcc tgttaaaaca aattcatttg gatatggaca 3961 tcaatgtcgc ctacaaacac gaaggtgact tccacaagta taaggtgaca gagaagcatt 4021 tcctggggag gccagtggag gtatctctca atgatgacct tgttgtcagc acaggctaca 4081 gcagtggctt ggccacagta tatgtaaaaa ctgtggttca caaaattagt gtctctgagg 4141 aattttgcag cttttacttg aaaattgata cccaagatat tgaagcatcc agccacttca 4201 ggctcagtga ctctggattc aagcgcataa tagcatgtgc cagctacaag cccagcaagg 4261 aggagtcaac atccgggtcc tcccatgcag taatggatat atcactgccg actggaatcg 4321 gagcaaacga ggaagattta cgggctcttg tggaaggagt ggatcaacta ctaactgatt 4381 accagatcaa agatggccat gtcattctgc aactgaattc gatcccctcc agagatttcc 4441 tctgtgtccg gttccggata tttgaacttt tccaagttgg gtttctgaat cctgctacct 4501 tcacggtgta cgagtatcac agaccagata agcagtgcac catgatttat agcatttctg 4561 acaccaggct tcagaaagtc tgtgaaggag cagcttgcac atgtgtggaa gctgactgtg 4621 cgcaactgca ggcagaagtg gacctagcca tctctgcaga ctccagaaaa gagaaagcct 4681 gtaaaccaga gactgcatat gcttataaag tcaggatcac atcagccact gaagaaaatg 4741 tttttgtcaa gtacactgcg actcttctgg tcacttacaa aacaggggaa gctgctgatg 4801 agaattcgga ggtcaccttc attaaaaaga tgagctgtac caatgccaac ctggtgaaag 4861 ggaagcagta tttaatcatg ggcaaagagg ttctgcagat caaacacaat ttcagtttca 4921 agtatatata ccctctagat tcctccacct ggattgaata ttggcccaca gacacaacgt 4981 gtccatcctg tcaagcattt gtagagaatt tgaataactt tgctgaagac ctctttttaa 5041 acagctgtga atgaaaagtt ctgctgcacg aagattcctc ctgcggcggg gggatttctc 5101 ctcctctggc ttggaaacct agcctagaat cagatacact ttctttagag taaagcacaa 5161 gctgatgagt tacgactttg tgaaatggat agccttgagg ggaggcgaaa acaggtcccc 5221 caaggctatc agacgtcagt gccaatagac tgaaacaagt ctgtaaagtt agcagtcagg 5281 ggtgttggtt ggggccggaa gaagagaccc actgaaactg tagcccctta tcaaaacata 5341 tccttgcttg aaagaaaaat accaaggaca gaaaatgcca taaaatcttg actttgcact 5401 c // LOCUS MUSC5PRO 5403 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse complement component C5S (pro-C5) mRNA, complete cds. ACCESSION M35525 M15079 J05234 KEYWORDS clotting factor; complement component C5; complement protein. SOURCE Mouse (strain B10.D2/nSnJ) liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 141 to 5403) AUTHORS Wetsel,R.A., Ogata,R.T. and Tack,B.F. TITLE Primary structure of the fifth component of murine complement JOURNAL Biochemistry 26, 737-743 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 5403) AUTHORS Wetsel,R.A., Fleischer,D.T. and Haviland,D.L. TITLE Deficiency of the murine fifth complement component (C5): A 2- base pair gene deletion in a 5'-exon JOURNAL J. Biol. Chem. 265, 2435-2440 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Noack, 02-APR-1987; for [2] by R.A.Wetsel, 01-DEC-1989. For the C5D sequence see accession # M35526. FEATURES from to/span description pept 14 5056 complement component C5S precursor sigp 14 67 complement component C5S signal peptide matp 68 2035 complement component pro-C5S beta-chain matp 2048 5053 complement component pro-C5S alpha-chain mut 659 660 2 bp deletion in C5D BASE COUNT 1640 a 1212 c 1219 g 1332 t ORIGIN Chromosome 2. 1 gccgctacca gccatgggtc tttggggaat actttgtctt ttaattttcc tggacaaaac 61 ttggggacag gaacaaacct acgtcatttc agcacccaaa atcctccggg tcggctcgtc 121 tgaaaatgtg gtaattcaag tccatggcta cactgaagca tttgatgcaa ctctttctct 181 aaaaagctat cctgacaaaa aagtcacctt ctcttcaggc tatgttaatt tgtccccgga 241 aaacaaattc caaaacgcgg cactgttgac actacagccc aatcaagttc ctagagaaga 301 aagcccagtc tctcacgtgt atctggaagt tgtgtcaaaa cacttttcaa aatcaaagaa 361 aataccaatt acctataaca atggaattct cttcatccat acagacaaac ctgtttacac 421 gccggaccag tcagtaaaga tcagagtcta ttctctgggt gacgacttga agccagccaa 481 acgggagact gtcttaactt tcatagaccc cgaaggatca gaagttgaca ttgtagaaga 541 aaatgattac accggaatta tctcttttcc tgacttcaag attccatcta atcccaagta 601 tggtgtttgg acaattaaag ctaactataa gaaggatttt acaacaactg gaactgcata 661 ctttgaaatt aaagaatatg tcttgccacg attctctgtt tcaatagaac tagaaagaac 721 cttcattggc tataaaaact ttaagaactt tgaaatcact gtgaaagcaa gatattttta 781 taataaagtg gtacctgatg ctgaagtgta tgcctttttt ggattgagag aggacataaa 841 agatgaggag aagcagatga tgcacaaagc cacacaagcc gcaaagttgg ttgacggagt 901 tgctcagatc tcttttgatt ctgaaacagc agttaaagag ctgtcctaca acagtctaga 961 agacttaaac aacaagtacc tttatattgc agtaacagtc acagaatctt caggtggatt 1021 ttcagaagag gcagaaatcc ctggagtcaa atatgtcctc tctccctaca cactgaattt 1081 ggtcgctact cctcttttcg tgaagcccgg gattccattt tccatcaagg cacaggttaa 1141 agattcactc gagcaggcgg taggaggggt cccagtaact ctgatggcac aaacagtcga 1201 tgtgaatcaa gagacatctg acttggaaac aaagaggagc atcactcatg acactgatgg 1261 agtagctgtg tttgtgctga acctcccatc aaatgtgacg gtgctaaagt ttgagatcag 1321 aactgatgac ccagaacttc ccgaagaaaa tcaagccagc aaagagtacg aagcagttgc 1381 gtactcgtct ctcagccaaa gttacattta catcgcttgg actgaaaact acaagcccat 1441 gcttgtggga gaatacctga atattatggt tacccccaag agcccatata tcgacaaaat 1501 aactcactat aattacttga ttttatccaa aggcaaaatt gtacagtacg gcacaagaga 1561 gaaacttttc tcctcaactt atcaaaatat aaatattcca gtgacacaga acatggttcc 1621 ttcagcacga ctcctggtct attacatagt cacaggggag caaacagcag aattagtggc 1681 tgacgcagtc tggataaata ttgaggagaa gtgtggcaac cagctccagg tccatctgtc 1741 tccagatgaa tatgtgtatt ctccaggcca aactgtgtcc cttgacatgg tgactgaagc 1801 agactcatgg gtagcactat cagcagtgga cagagctgtg tataaagtcc agggaaacgc 1861 caaaagggcc atgcaaagag tctttcaagc tttggatgaa aagagtgacc tgggctgtgg 1921 ggcaggtggt ggccatgaca atgcagatgt attccatcta gctgggctca ccttcctcac 1981 caacgcaaac gcagatgact cccattatcg tgatgactct tgtaaagaaa ttctcaggtc 2041 aaagagaaac ctgcatctcc taaggcagaa aatagaagaa caagctgcta agtacaaaca 2101 tagtgtgcca aagaaatgct gctatgacgg agcccgagtg aacttctacg aaacctgtga 2161 ggagcgagtg gcccgggtta ccataggccc tctctgcatc agggccttca acgagtgctg 2221 tactattgcg aacaagatcc gaaaagaaag cccccataaa cctgtccaac tgggaaggat 2281 ccacattaag accctgttac cagtgatgaa ggcagatatc cgaagctact ttccagagag 2341 ctggctatgg gaaattcatc gcgttcccaa aagaaaacag ctgcaggtca cgctgcctga 2401 ctcactaacg acttgggaaa ttcaaggcat tggcatttca gacaatggta tatgtgttgc 2461 tgatacactc aaggcaaagg tgttcaaaga agtcttcctg gagatgaaca taccatattc 2521 tgttgtgcga ggagaacaga tccaattgaa aggaactgtt tacaactata tgacctcagg 2581 gacaaagttc tgtgttaaaa tgtctgctgt ggaggggatc tgcacttcag gaagctcagc 2641 tgctagcctt cacacctcca ggccctccag atgtgtgttc cagaggatag agggctcgtc 2701 cagtcacttg gtgaccttca ccctgcttcc tctggaaatt ggccttcact ccataaactt 2761 ctcactagag acctcatttg ggaaagacat cttagtaaag acattacggg tagtgccaga 2821 aggagtcaag agggaaagct atgccggcgt gattctggac cctaagggaa ttcgtggtat 2881 tgttaacaga cgaaaggaat tcccatacag gatcccatta gatttggtcc ccaagaccaa 2941 agttgaaagg attttgagtg tcaaaggact gcttgtaggg gagttcttgt ccacggttct 3001 gagtaaggaa ggcatcaaca tcctaaccca cctccccaag ggcagtgcag aggcagagct 3061 catgagcata gctccggtgt tctatgtttt ccactacctg gaagcaggaa accattggaa 3121 tattttctat cctgatacac tgagtaaaag acagagcctg gagaaaaaaa taaaacaagg 3181 ggtggtgagc gtcatgtcct acagaaacgc tgactattcc tacagcatgt ggaagggggc 3241 gagcgctagt acctggctga cagcttttgc tctgagagtg cttggacagg tggccaagta 3301 tgtaaaacag gatgaaaact caatttgtaa ctctttgcta tggctggttg agaagtgtca 3361 gctggaaaac ggctctttca aggaaaattc ccaatatcta ccaataaaat tacagggtac 3421 tttgcctgct gaagcccaag agaaaacttt gtatcttaca gccttttctg tgattggaat 3481 tagaaaggca gttgacatat gccccaccat gaaaatccac acagcgctag ataaagccga 3541 ctccttcctg cttgaaaaca ccctgccatc caagagcacc ttcacactgg ccattgtagc 3601 ctatgctctt tccctaggag acagaaccca cccgaggttt cgtctaattg tgtcggccct 3661 gaggaaggaa gcttttgtta aaggtgatcc gcccatttac cgttactgga gagataccct 3721 caaacgtcca gacagctctg tgcccagcag cggcacagca ggtatggttg aaaccacagc 3781 ctatgctttg ctcgccagcc tgaaactgaa ggatatgaat tacgccaacc ccatcatcaa 3841 gtggctatct gaagagcaga ggtatggagg cggcttttat tccacccagg atacgattaa 3901 tgccatcgag ggcctgacag aatattcact cctgttaaaa caaattcatt tggatatgga 3961 catcaatgtc gcctacaaac acgaaggtga cttccacaag tataaggtga cagagaagca 4021 tttcctgggg aggccagtgg aggtatctct caatgatgac cttgttgtca gcacaggcta 4081 cagcagtggc ttggccacag tatatgtaaa aactgtggtt cacaaaatta gtgtctctga 4141 ggaattttgc agcttttact tgaaaattga tacccaagat attgaagcat ccagccactt 4201 caggctcagt gactctggat tcaagcgcat aatagcatgt gccagctaca agcccagcaa 4261 ggaggagtca acatccgggt cctcccatgc agtaatggat atatcactgc cgactggaat 4321 cggagcaaac gaggaagatt tacgggctct tgtggaagga gtggatcaac tactaactga 4381 ttaccagatc aaagatggcc atgtcattct gcaactgaat tcgatcccct ccagagattt 4441 cctctgtgtc cggttccgga tatttgaact tttccaagtt gggtttctga atcctgctac 4501 cttcacggtg tacgagtatc acagaccaga taagcagtgc accatgattt atagcatttc 4561 tgacaccagg cttcagaaag tctgtgaagg agcagcttgc acatgtgtgg aagctgactg 4621 tgcgcaactg caggcagaag tagacctagc catctctgca gactccagaa aagagaaagc 4681 ctgtaaacca gagactgcat atgcttataa agtcaggatc acatcagcca ctgaagaaaa 4741 tgtttttgtc aagtacactg cgactcttct ggtcacttac aaaacagggg aagctgctga 4801 tgagaattcg gaggtcacct tcattaaaaa gatgagctgt accaatgcca acctggtgaa 4861 agggaagcag tatttaatca tgggcaaaga ggttctgcag atcaaacaca atttcagttt 4921 caagtatata taccctctag attcctccac ctggattgaa tattggccca cagacacaac 4981 gtgtccatcc tgtcaagcat ttgtagagaa tttgaataac tttgctgaag acctcttttt 5041 aaacagctgt gaatgaaaag ttctgctgca cgaagattcc tcctgcggcg gggggattgc 5101 tcctcctctg gcttggaaac ctagcctaga atcagataca ctttctttag agtaaagcac 5161 aagctgatga gttacgactt tgtgaaatgg atagccttga ggggaggcga aaacaggtcc 5221 cccaaggcta tcagatgtca gtgccaatag actgaaacaa gtctgtaaag ttagcagtca 5281 ggggtgttgg ttggggccgg aagaagagac ccactgaaac tgtagcccct tatcaaaaca 5341 tatccttgct tgaaagaaaa ataccaagga cagaaaatgc cataaaatct tgactttgca 5401 ctc // LOCUS HUMENN 1592 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human endonexin II mRNA, complete cds. ACCESSION J03745 KEYWORDS Ca2+ -dependent phospholipid binding protein; endonexin. SOURCE Human placenta, cDNA to mRNA, (library of Clonetech Laboratories Inc.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1592) AUTHORS Kaplan,R., Jaye,M., Burgess,W.H., Schlaepfer,D.D. and Haigler,H.T. TITLE Cloning and expression of cDNA for human endonexin II, a Ca2+ and phospholipid binding protein JOURNAL J. Biol. Chem. 263, 8037-8043 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.T.Haigler, 06-APR-1988 FEATURES from to/span description pept 160 1122 endonexin II /hgml_locus_uid="LS0217S" /nomgen="ENX2" /map="4q28-q32" mRNA < 1 1592 endonexin II mRNA BASE COUNT 434 a 337 c 366 g 455 t ORIGIN 284 bp upstream of HincII site. 1 ttggatcagt ctaggtgcag ctgccggatc cttcagcgtc tgcatctcgg cgtcgcccgc 61 gtaccgtcgc ccggctctcc gccgctctcc cggggtttcg gggcacttgg gtcccacagt 121 ctggtcctgc ttcaccttcc cctgacctga gtagtcgcca tggcacaggt tctcagaggc 181 actgtgactg acttccctgg atttgatgag cgggctgatg cagaaactct tcggaaggct 241 atgaaaggct tgggcacaga tgaggagagc atcctgactc tgttgacatc ccgaagtaat 301 gctcagcgcc aggaaatctc tgcagctttt aagactctgt ttggcaggga tcttctggat 361 gacctgaaat cagaactaac tggaaaattt gaaaaattaa ttgtggctct gatgaaaccc 421 tctcggcttt atgatgctta tgaactgaaa catgccttga agggagctgg aacaaatgaa 481 aaagtactga cagaaattat tgcttcaagg acacctgaag aactgagagc catcaaacaa 541 gtttatgaag aagaatatgg ctcaagcctg gaagatgacg tggtggggga cacttcaggg 601 tactaccagc ggatgttggt ggttctcctt caggctaaca gagaccctga tgctggaatt 661 gatgaagctc aagttgaaca agatgctcag gctttatttc aggctggaga acttaaatgg 721 gggacagatg aagaaaagtt tatcaccatc tttggaacac gaagtgtgtc tcatttgaga 781 aaggtgtttg acaagtacat gactatatca ggatttcaaa ttgaggaaac cattgaccgc 841 gagacttctg gcaatttaga gcaactactc cttgctgttg tgaaatctat tcgaagtata 901 cctgcctacc ttgcagagac cctctattat gctatgaagg gagctgggac agatgatcat 961 accctcatca gagtcatggt ttccaggagt gagattgatc tgtttaacat caggaaggag 1021 tttaggaaga attttgccac ctctctttat tccatgatta agggagatac atctggggac 1081 tataagaaag ctcttctgct gctctgtgga gaagatgact aacgtgtcac ggggaagagc 1141 tccctgctgt gtgcctgcac caccccactg ccttccttca gcacctttag ctgcatttgt 1201 atgccagtgc ttaacacatt gccttattca tactagcatg ctcatgacca acacatacac 1261 gtcatagaat gaaaatagtg gtgcttcttt ctgatctcta gtggagatct ctttgactgc 1321 tgtagtacta aagtgtactt aatgttacta agtttaatgc ctggccattt tccatttata 1381 tatatttttt aagaggctag agtgctttta gcctttttta aaaactccat ttatattaca 1441 tttgtaacca tgatacttta atcagaagct tagccttgaa attgtgaact cttggaaatg 1501 ttattagtga agttcgcaac taaactaaac ctgtaaaatt atgatgattg tattcaaaag 1561 attaatgaaa aataaacatt tctgtccccc tg // LOCUS CPAFPRFA 1277 bp ds-DNA ORG 26-JUL-1990 DEFINITION C.paradoxa cyanelle ferredoxin (petF) and ribosomal protein S10 (rps10; rpsJ) genes, complete cds, and elongation factor Tu (tufA) gene, 5' end. ACCESSION M35206 KEYWORDS elongation factor Tu; ferredoxin; ribosomal protein S10. SOURCE C.paradoxa (isolate UTEX LB555) cyanelle DNA, clone pCpcGP1.3. ORGANISM Cyanelle Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae; Cyanophora paradoxa. REFERENCE 1 (bases 1 to 1277) AUTHORS Bryant,D.A., Schluchter,W.M. and Stirewalt,V.L. TITLE Ferredoxin and ribosomal protein S10 are encoded on the cyanelle genome of Cyanophora paradoxa JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Bryant, 14-JUN-1990. Author address: D.A.Bryant s-101 Frear Bldg. Dept. of Mol. and Cell Biol. Pennsylvania State University University Park, PA 16802 email: DAB14@PSUVM FEATURES from to/span description pept 338 637 ferredoxin (petF) pept 1027 710 (c) ribosomal protein S10 (rps10; rpsJ) pept > 1277 1114 (c) elongation factor Tu (tufA; AA at 1275) binding 313 318 ribosome binding site rpt 643 694 inverted repeat rpt 1065 1097 inverted repeat BASE COUNT 477 a 170 c 163 g 467 t ORIGIN 1 agatcttatc taagatatgt aaataaataa aaatatatat ctatatttat agtatatatt 61 aatttttttt aaaaatcgat actaaattta aattttcctt ttttttcttt ataaaaattt 121 aattttaaat agaaaaaatt aagtttttcg aaaaaagcaa ttaaaacata ttaaaaaaaa 181 attaataaac atggtaaact ttaaatataa atttataatt aactgaaaaa ataataaaaa 241 taaatttata tatatatata ttttagatta aaataattta aattaaatta ttaaaagttc 301 taccttgtaa ctataattat ttaggagata gtattttatg gcagtatata aagttcgtct 361 tatttgtgaa gaacaaggtt tagataccac tattgaatgt ccagatgatg agtacattct 421 tgatgcagca gaagaacaag gtattgattt accatactcc tgtcgtgcag gtgcatgttc 481 tacttgtgca ggtaaagtgg tagaaggaac tgtagatcaa tctgatcaat ctttcttaga 541 tgacgctcaa ttagcagctg gttatgtatt aacttgtgta gcatacccat cttctgactg 601 tacagttaaa actcaccaag aagaatctct ttactaaaaa ataaaaaatc taaataataa 661 aatagaaatc tctattttat tatttagatt ttcttaattc aaaaaaaaac taaagtttaa 721 cttccacatc aacacctgct ggtaaatcta aacgagttaa agtatcaatt gttttggaag 781 atggtaaata taaatcaatt attctgcgat gaactctaat ttcgaaatgt tctcgtgaat 841 ctttatctac atgtggggaa cgtaaaacgc aataaatttt cttttttgtt ggtaaaggaa 901 taggtcctac tgcggtagca tcagttcgtt ttgcagcttc aataatttgt tcacatgagt 961 tttctaataa tgaagagtca taagaacgta gttgaatacg aatttttaat tgttgattac 1021 tggccataat ttttaatttt taatttttat tttttaaatt aaaaagagag aaataaatac 1081 attttctatt tctctctaaa atttagattt taattatttt aaaatcttag aaactacacc 1141 tgcaccaatt gtacgaccac cttcacgaat cgcgaaacgc ataccttgtt caatcgcaat 1201 tggatgtact aaacttactg tcattttaat acgatctcct ggcataacca tttctgcatt 1261 actaccatca tctgcag // LOCUS MUSIGHZSA 333 bp ds-DNA ROD 26-JUL-1990 DEFINITION Mouse Ig germline H-chain gene, D region. ACCESSION M35332 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Mouse liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Landolfi,N.F., Capra,J.D. and Tucker,P.W. TITLE Germ-line sequence of the D-H segment employed in Ars-A antibodies: Implications for the generation of junctional diversity JOURNAL J. Immunol. 137, 362-365 (1986) STANDARD simple staff_review FEATURES from to/span description pept / 107 / 129 Ig heavy chain D region (AA at 107; 107 could be 109) iDNA < 1 106 V-D intervening DNA iDNA 130 > 333 D-J intervening DNA BASE COUNT 95 a 98 c 62 g 78 t ORIGIN 1 tgacaactga aactcaaccg tgctgcctgg cccccaatgc tctctacacc tgcaaaacca 61 gagaccatac tggccagtgc tttttgtgaa gggatctact actgtgttta ttactatggt 121 ggtagctacc acagtgctat atccatcagc aaaaacccat tgtgcccagc agactcttga 181 gctcgaaaaa ctgagtctag aaaagctggc atcacggggt ttatatcccg agtcttgacc 241 actgacccat taatactatc caacacagag ctctccgtct gcccacaaag aaatccaacc 301 accctaaagt cagatcctct agagtcgacc tgc // LOCUS WHTREPTA 295 bp ds-DNA PLN 26-JUL-1990 DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD1. ACCESSION M35329 KEYWORDS repetitive DNA. SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD1. ORGANISM Triticum monococcum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 295) AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B. TITLE Apparent sources of the A genomes of wheats inferred from polymorphism in abundance and restriction fragment length of repeated nucleotide sequences JOURNAL Genome 30, 680-689 (1988) STANDARD simple staff_review BASE COUNT 72 a 58 c 66 g 99 t ORIGIN 1 tccagacttg ggtaacaggg tgtgccttag aatcccagtt gatagtgggc agtcctgaca 61 gaagatagtg cactgagcca aacttgaatg tgtcaagtgc ttcattcgga atctccttgt 121 acatgttgaa catagagttg tggtccatct ttttcttggc ataaatgtcc aagtcatctg 181 cttgctcctc tggggcattg atcattataa gtaatagtct tttcttcaac tttaataggt 241 gcagctactt ttacttctat gggaggatga tatttaaacc acttctcctt gggga // LOCUS WHTREPTB 273 bp ds-DNA PLN 26-JUL-1990 DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD2. ACCESSION M35330 KEYWORDS repetitive DNA. SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD2. ORGANISM Triticum monococcum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 273) AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B. TITLE Apparent sources of the A genomes of wheats inferred from polymorphism in abundance and restriction fragment length of repeated nucleotide sequences JOURNAL Genome 30, 680-689 (1988) STANDARD simple staff_review BASE COUNT 58 a 44 c 74 g 97 t ORIGIN 1 ctggccatgg agggcctatg tagatagaca ggcttcgaga agcttctttc tttctagtgt 61 ctgtactcag accggttgct tccgcatgtg cttgtatgag tgtatgactt gagtgtcggg 121 tcatgtgacc cctatctgta tgaacatgtt atgtatggct ctctagagcc tttaaataaa 181 gtacttgagt tgtagagtat tgttgtgatg ccatgttgta tgtactcata tcgggcatat 241 tgtgtgtatg attgaaatgc ttggtatgag tgg // LOCUS WHTREPTC 229 bp ds-DNA PLN 26-JUL-1990 DEFINITION T.monococcum aegilopoides repetitive DNA sequence, clone pTbUCD3. ACCESSION M35331 KEYWORDS repetitive DNA. SOURCE T.monococcum aegilopoides leaf DNA, clone pTbUCD3. ORGANISM Triticum monococcum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 229) AUTHORS Dvorak,J., McGuire,P.E. and Cassidy,B. TITLE Apparent sources of the A genomes of wheats inferred from polymorphism in abundance and restriction fragment length of repeated nucleotide sequences JOURNAL Genome 30, 680-689 (1988) STANDARD simple staff_review BASE COUNT 85 a 55 c 37 g 52 t ORIGIN 1 caaattagct actccagtat gtaaaaacct gtttgtccaa cacttagcag atttcactct 61 tgatagatca ctagcaatag ctcccgcaaa atcgcaaaag agttcatgat ctgcccaaaa 121 caacaactat gcaaaagttg agctcgattg agtcaaccta gggtgctcca acataacaag 181 taaagacatg gatggattaa gcacaacaag catgacaaac cactcttac // LOCUS RATMTXXX 169 bp ds-DNA ORG 26-JUL-1990 DEFINITION Rat mitochondrial HindIII fragment. ACCESSION M35251 KEYWORDS . SOURCE Rat mitochondrial DNA. ORGANISM Mitochondrion Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Rattus norvegicus. REFERENCE 1 (bases 1 to 169) AUTHORS Brown,G.G., Castora,F.J., Frantz,S.C. and Simpson,M.V. TITLE Mitochondrial DNA polymorphism: Evolutionary studies on the genus Rattus JOURNAL Ann. N.Y. Acad. Sci. 361, 135-153 (1981) STANDARD simple staff_review FEATURES from to/span description ORF < 1 > 169 ORF allele 56 56 a in type A; g in type B allele 80 80 a in type A; g in type B allele 122 122 t in type A; g in type B BASE COUNT 44 a 24 c 43 g 58 t ORIGIN 1 agcttgctaa tagtcatcat gttgctatca atggaaagat tatttgtaat cctcgagcta 61 taattatagt tcggctgtga attcgttcgt agttggtgtt tgctaggcag aataagagtg 121 atgaggttaa gccgtgggcg attattagta ttgtagctcc catgaagct // LOCUS MUSCRABP 868 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA, complete cds. ACCESSION M35523 KEYWORDS cellular retinoic acid-binding protein. SOURCE Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 868) AUTHORS Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S. TITLE Molecular cloning of a novel cellular retinoic acid-binding protein expressed during mouse embryogenesis and in adult skin JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.Giguere, 22-JUN-1990. FEATURES from to/span description pept 116 532 cellular retinoic acid-binding protein BASE COUNT 226 a 218 c 220 g 204 t ORIGIN Chromosome 2. 1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac 61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc 121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc 181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt 241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac 301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg 361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa 421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct 481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac 541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc 601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc 661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc 721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac 781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt 841 taattttatt aaaatgtttc cggaattc // LOCUS BTHCRYIA 4320 bp ds-DNA BCT 26-JUL-1990 DEFINITION B.thuringiensis delta-endotoxin gene, complete cds. ACCESSION M35524 KEYWORDS delta-endotoxin. SOURCE B.thuringiensis kenyae (strain HD588) DNA. ORGANISM Bacillus thuringiensis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4320) AUTHORS Von Tersch,M.A., Loidl,R.H., Jany,C.S. and Johnson,T.B. TITLE Insecticidal toxin genes from Bacillus thuringiensis variety kenyae: Cloning characterization and comparative studies JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.A.Von Tersch, 19-JUN-1990. Author address: M.A.Von Tersch Ecogen Inc. 2005 Cabot Blvd. West Loanghorne PA 19047 FEATURES from to/span description pept 239 3772 delta-endotoxin binding 228 232 ribosome binding site BASE COUNT 1392 a 724 c 909 g 1295 t ORIGIN 1 gttaacggaa tacaaaccct taatgcattg gttaaacatt gtaaagtcta aagcatggat 61 aatgggcgag aagtaagtag attgttaaca ccctgggtca aaaattgata tttagtaaaa 121 ttagttgcac tttgtgcatt ttttcataag atgagtcata tgttttaaat tgtagtaatg 181 aaaaacagta ttatatcata atgaattggt atcttaataa aagagatgga ggtaacttat 241 ggataacaat ccgaacatca atgaatgcat tccttataat tgtttaagta accctgaagt 301 agaagtatta ggtggagaaa gaatagaaac tggttacacc ccaatcgata tttccttgtc 361 gctaacgcaa tttcttttga gtgaatttgt tcccggtgct ggatttgtgt taggactagt 421 tgatataata tggggaattt ttggtccctc tcaatgggac gcatttcttg tacaaattga 481 acagttaatt aaccaaagaa tagaagaatt cgctaggaac caagccattt ctagattaga 541 aggactaagc aatctttatc aaatttacgc agaatctttt agagagtggg aagcagatcc 601 tactaatcca gcattaagag aagagatgcg tattcaattc aatgacatga acagtgccct 661 tacaaccgct attcctcttt tggcagttca aaattatcaa gttcctcttt tatcagtata 721 tgttcaagct gcaaatttac atttatcagt tttgagagat gtttcagtgt ttggacaaag 781 gtggggattt gatgccgcga ctatcaatag tcgttataat gatttaacta ggcttattgg 841 caactataca gatcatgctg tacgctggta caatacggga ttagaacgtg tatggggacc 901 ggattctaga gattgggtaa ggtataatca atttagaaga gaattaacac taactgtatt 961 agatatcgtt gctctgttcc cgaattatga tagtagaaga tatccaattc gaacagtttc 1021 ccaattaaca agagaaattt atacaaaccc agtattagaa aattttgatg gtagttttcg 1081 aggctcggct cagggcatag aaagaagtat taggagtcca catttgatgg atatacttaa 1141 cagtataacc atctatacgg atgctcatag gggttattat tattggtcag ggcatcaaat 1201 aatggcttct cctgtcggtt tttcggggcc agaattcacg tttccgctat atggaaccat 1261 gggaaatgca gctccacaac aacgtattgt tgctcaacta ggtcagggcg tgtatagaac 1321 attatcctct actttttata gaagaccttt taatataggg ataaataatc aacaactatc 1381 tgttcttgac gggacagaat ttgcttatgg aacctcctca aatttgccat ccgctgtata 1441 cagaaaaagc ggaacggtag attcgctgga tgaaatacca ccacagaata acaacgtgcc 1501 acctaggcaa ggatttagtc atcgattaag ccatgtttca atgtttcgtt caggctctag 1561 tagtagtgta agtataataa gagctcctat gttctcttgg atacatcgta gtgctgaatt 1621 taataatata attgcatcgg atagtattac tcaaatccct gcagtgaagg gaaactttct 1681 ttttaatggt tctgtaattt caggaccagg atttactggt ggggacttag ttagattaaa 1741 tagtagtgga aataacattc agaatagagg gtatattgaa gttccaattc acttcccatc 1801 gacatctacc agatatcgag ttcgtgtacg gtatgcttct gtaaccccga ttcacctcaa 1861 cgttaattgg ggtaattcat ccattttttc caatacagta ccagctacag ctacgtcatt 1921 agataatcta caatcaagtg attttggtta ttttgaaagt gccaatgctt ttacatcttc 1981 attaggtaat atagtaggtg ttagaaattt tagtgggact gcaggagtga taatagacag 2041 atttgaattt attccagtta ctgcaacact cgaggctgaa tataatctgg aaagagcgca 2101 gaaggcggtg aatgcgctgt ttacgtctac aaaccaacta gggctaaaaa caaatgtaac 2161 ggattatcat attgatcaag tgtccaattt agttacgtgt ttatcggatg aattttgtct 2221 ggatgaaaag cgagaattgt ccgagaaagt caaacatgcg aagcgactca gtgatgaacg 2281 caatttactc caagattcaa atttcaaaga cattaatagg caaccagaac gtgggtgggg 2341 cggaagtaca gggattacca tccaaggagg ggatgacgta tttaaagaaa attacgtcac 2401 actatcaggt acctttgatg agtgctatcc aacatatttg tatcaaaaaa tcgatgaatc 2461 aaaattaaaa gcctttaccc gttatcaatt aagagggtat atcgaagata gtcaagactt 2521 agaaatctat ttaattcgct acaatgcaaa acatgaaaca gtaaatgtgc caggtacggg 2581 ttccttatgg ccgctttcag cccaaagtcc aatcggaaag tgtggagagc cgaatcgatt 2641 cgcgccacac cttgaatgga atcctgactt agattgttcg tgtagggatg gagaaaagtg 2701 tgcccatcat tcgcatcatt tctccttaga cattgatgta ggatgtacag acttaaatga 2761 ggacctaggt gtatgggtga tctttaagat taagacgcaa gatgggcacg caagactagg 2821 gaatctagag tttctcgaag agaaaccatt agtaggagaa gcgctagctc gtgtgaaaag 2881 agcggagaaa aaatggagag acaaacgtga aaaattggaa tgggaaacaa atatcgttta 2941 taaagaggca aaagaatctg tagatgcttt atttgtaaac tctcaatatg atcaattaca 3001 agcggatacg aatattgcca tgattcatgc ggcagataaa cgtgttcata gcattcgaga 3061 agcttatctg cctgagctgt ctgtgattcc gggtgtcaat gcggctattt ttgaagaatt 3121 agaagggcgt attttcactg cattctccct atatgatgcg agaaatgtca ttaaaaatgg 3181 tgattttaat aatggcttat cctgctggaa cgtgaaaggg catgtagatg tagaagaaca 3241 aaacaaccaa cgttcggtcc ttgttgttcc ggaatgggaa gcagaagtgt cacaagaagt 3301 tcgtgtctgt ccgggtcgtg gctatatcct tcgtgtcaca gcgtacaagg agggatatgg 3361 agaaggttgc gtaaccattc atgagatcga gaacaataca gacgaactga agtttagcaa 3421 ctgcgtagaa gaggaaatct atccaaataa cacggtaacg tgtaatgatt atactgtaaa 3481 tcaagaagaa tacggaggtg cgtacacttc tcgtaatcga ggatataacg aagctccttc 3541 cgtaccagct gattatgcgt cagtctatga agaaaaatcg tatacagatg gacgaagaga 3601 gaatccttgt gaatttaaca gagggtatag ggattacacg ccactaccag ttggttatgt 3661 gacaaaagaa ttagaatact tcccagaaac cgataaggta tggattgaga ttggagaaac 3721 ggaaggaaca tttatcgtgg acagcgtgga attactcctt atggaggaat agtctcatgc 3781 aaactcaggt ttaaatatcg ttttcaaatc aattgtccaa gagcagcatt acaaatagat 3841 aagtaatttg ttgtaatgaa aaacggacat cacctccatt gaaacggagt gatgtccgtt 3901 ttactatgtt attttctagt aatacatatg tatagagcaa cttaatcaag cagagatatt 3961 ttcacctatc gatgaaaata tctctgcttt ttcttttttt atttggtata tgctttactt 4021 gtaatcgaaa ataaagcact aatagggtgt ttttgcccat cccttcggga aatcaagact 4081 aaaatgaaaa ataaacagaa aatataaggc tcttactttg tggatatgac cacaaagtaa 4141 gagccttatt tcattaaatt tgttcataca tttttccttg tagtcttttg ttttcatcct 4201 ttaatcgcct attctcgtac tctacttcct tgattcgatc ccgtaataat tgaatcattg 4261 catctttatt ttcatcactc attttccgtt tttcgaattt tggagataca gctcgttgct // LOCUS HUMHBLOD 3373 bp ss-mRNA PRI 26-JUL-1990 DEFINITION Human GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase mRNA, complete cds. ACCESSION M35531 KEYWORDS GDP-L-fucose:beta-D-galactoside 2-alpha-l-fucosyltransferase. SOURCE Human epidermal carcinoma cell line A431, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3373) AUTHORS Larsen,R.D., Ernst,L.K., Nair,R.P. and Lowe,J.B. TITLE Molecular cloning, sequence and expression of a human GDP-L-fucose: Beta-D-galactoside 2-alpha-l-fucosyltransferase cDNA that can be from the H blood group antigen JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD ull staff_review staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by J.B.Lowe, 22-JUN-1990. FEATURES from to/span description pept 104 1201 GDP-L-fucose:beta-D-galactoside 2-alpha-L-fucosyltransferase site 1744 2385 Alu sequence homologue BASE COUNT 687 a 925 c 905 g 856 t ORIGIN 1 gcctggcgtt ccaggggcgg ccggatgtgg cctgcctttg cggagggtgc gctccggcca 61 cgaaaagcgg actgtggatc tgccacctgc aagcagctcg gccatgtggc tccggagcca 121 tcgtcagctc tgcctggcct tcctgctagt ctgtgtcctc tctgtaatct tcttcctcca 181 tatccatcaa gacagctttc cacatggcct aggcctgtcg atcctgtgtc cagaccgccg 241 cctggtgaca cccccagtgg ccatcttctg cctgccgggt actgcgatgg gccccaacgc 301 ctcctcttcc tgtccccagc accctgcttc cctctccggc acctggactg tctaccccaa 361 tggccggttt ggtaatcaga tgggacagta tgccacgctg ctggctctgg cccagctcaa 421 cggccgccgg gcctttatcc tgcctgccat gcatgccgcc ctggccccgg tattccgcat 481 caccctgccc gtgctggccc cagaagtgga cagccgcacg ccgtggcggg agctgcagct 541 tcacgactgg atgtcggagg agtacgcgga cttgagagat cctttcctga agctctctgg 601 cttcccctgc tcttggactt tcttccacca tctccgggaa cagatccgca gagagttcac 661 cctgcacgac caccttcggg aagaggcgca gagtgtgctg ggtcagctcc gcctgggccg 721 cacaggggac cgcccgcgca cctttgtcgg cgtccacgtg cgccgtgggg actatctgca 781 ggttatgcct cagcgctgga agggtgtggt gggcgacagc gcctacctcc ggcaggccat 841 ggactggttc cgggcacggc acgaagcccc cgttttcgtg gtcaccagca acggcatgga 901 gtggtgtaaa gaaaacatcg acacctccca gggcgatgtg acgtttgctg gcgatggaca 961 ggaggctaca ccgtggaaag actttgccct gctcacacag tgcaaccaca ccattatgac 1021 cattggcacc ttcggcttct gggctgccta cctggctggc ggagacactg tctacctggc 1081 caacttcacc ctgccagact ctgagttcct gaagatcttt aagccggagg cggccttcct 1141 gcccgagtgg gtgggcatta atgcagactt gtctccactc tggacattgg ctaagccttg 1201 agagccaggg agactttctg aagtagcctg atctttctag agccagcagt acgtggcttc 1261 agaggcctgg catcttctgg agaagcttgt ggtgttcctg aagcaaatgg gtgcccgtat 1321 ccagagtgat tctagttggg agagttggag agaaggggga cgtttctgga actgtctgaa 1381 tattctagaa ctagcaaaac atcttttcct gatggctggc aggcagttct agaagccaca 1441 gtgcccacct gctcttccca gcccatatct acagtacttc cagatggctg cccccaggaa 1501 tggggaactc tccctctggt ctactctaga agaggggtta cttctcccct gggtcctcca 1561 aagactgaag gagcatatga ttgctccaga gcaagcattc accaagtccc cttctgtgtt 1621 tctggagtga ttctagaggg agacttgttc tagagaggac caggtttgat gcctgtgaag 1681 aaccctgcag ggcccttatg gacaggatgg ggttctggaa atccagataa ctaaggtgaa 1741 gaatcttttt agtttttttt tttttttttt ggagacaggg tctcgctctg ttgcccaggc 1801 tggagtgcag tggcgtgatc ttggctcact gcaacttccg cctcctgtgt tcaagcgatt 1861 ctcctgtctc agcctcctga gtagatggga ctacaggcac aggccattat gcctggctaa 1921 tttttgtatt tttagtagag acagggtttc accatgttgg ccgggatggt ctcgatctcc 1981 tgaccttgtc atccacctgt cttggcctcc caaagtgctg ggattactgg catgagccac 2041 tgtgcccagc ccggatattt ttttttaatt atttatttat ttatttattt attgagacgg 2101 agtcttgctc tgtagcccag gccagagtgc agtggcgcga tctcagctca ctgcaagctc 2161 tgcctcccgg gttcatgcca ttctgcctca gcctcctgag tagctgggac tacaggcgcc 2221 cgccaccacg cccggctaat tttttttgta tttttagtag agacggggtt tcatcgtgtt 2281 aaccaggatg gtctcgatct cctgacctcg tgatctgccc acctcggcct cccacagtgc 2341 tgggattacc ggcgtgagcc accatgcctg gcccggataa ttttttttaa tttttgtaga 2401 gacgaggtct tgtgatattg cccaggctgt tcttcaactc ctgggctcaa gcagtcctcc 2461 caccttggcc tcccagaatg ctgggtttat agatgtgagc cagcacaccg ggccaagtga 2521 agaatctaat gaatgtgcaa cctaattgta gcatctaatg aatgttccac cattgctgga 2581 aaaattgaga tggaaaacaa accatctcta gttggccagc gtcttgctct gttcacagtc 2641 tctggaaaag ctggggtagt tggtgagcag agcgggactc tgtccaacaa gccccacagc 2701 ccctcaaaga cttttttttg tttgttttga gcagacaggc taaaatgtga acgtggggtg 2761 agggatcact gccaaaatgg tacagcttct ggagcagaac tttccaggga tccagggaca 2821 ctttttttta aagctcataa actgccaaga gctccatata ttgggtgtga gttcaggttg 2881 cctctcacaa tgaaggaagt tggtctttgt ctgcaggtgg gctgctgagg gtctgggatc 2941 tgttttctgg aagtgtgcag gtataaacac accctctgtg cttgtgacaa actggcaggt 3001 accgtgctca ttgctaacca ctgtctgtcc ctgaactccc agaaccacta catctggctt 3061 tgggcaggtc tgagataaaa cgatctaaag gtaggcagac cctggaccca gcctcagatc 3121 caggcaggag cacgaggtct ggccaaggtg gacggggttg tcgagatctc aggagcccct 3181 tgctgttttt tggagggtga aagaagaaac cttaaacata gtcagctctg atcacatccc 3241 ctgtctactc atccagaccc catgcctgta ggcttatcag ggagttacag ttacaattgt 3301 tacagtactg ttcccaactc agctgccacg ggtgagagag caggaggtat gaattaaaag 3361 tctacagcac taa // LOCUS MUSCRABPA 868 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse cellular retinoic acid-binding protein (CRABP-II) mRNA, complete cds. ACCESSION M35523 KEYWORDS cellular retinoic acid-binding protein. SOURCE Mouse 12.5 day old embryo, cDNA to mRNA, clone lambda-mE2.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 868) AUTHORS Giguere,V., Lyn,S., Yip,P., Siu,C.-H. and Amin,S. TITLE Molecular cloning of a novel cellular retinoic acid-binding protein expressed during mouse embryogenesis and in adult skin JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.Giguere, 22-JUN-1990. FEATURES from to/span description pept 116 532 cellular retinoic acid-binding protein BASE COUNT 226 a 218 c 220 g 204 t ORIGIN Chromosome 2. 1 gaattccggg gaggatctgt tctgcaaagg agacagcaaa gtatctttag cctaaaggac 61 tcagcgtcca gtgttctagt tgaagatcta aagagaaagc caccttgctg ccactatgcc 121 taacttttct ggcaactgga agatcatccg atcggaaaac tttgaggaaa tgctaaaagc 181 tctgggggtg aacatgatga tgaggaagat cgctgtggct gcagcctcca agccagcagt 241 cgagatcaaa caggagaatg acactttcta catcaaaacc tccaccactg tgcgaaccac 301 ggagattaac ttcaagatcg gggaggaatt tgaggagcag accgtggatg ggagaccctg 361 taagagtttg gtgaaatggg agagtggaaa caaaatggtg tgcgagcaga ggcttctgaa 421 gggggagggc cccaagacct cctggagccg agaactgacc aatgatggag agctgatcct 481 gacaatgaca gcagatgacg ttgtgtgcac cagggtctac gtccgagagt gagtgcctac 541 gggtccaaga actgcctgag acgacttctg tgcccgctac aggacacaaa cctccctccc 601 acgtccatct tacaaactag ctctcccctt actcctgagg gttactgctt cctccaaggc 661 cttttgttct ttgccttctc tacgccagag aggggcagaa gctcagaacc ctcccaccgc 721 catttgcccc tcccaggtca gcagtcccag ctccatacca gggtccttcc tggaagagac 781 tgtctctctg gcctctactc cttatccttg tagtctgtgt gatttagaat atttattggt 841 taattttatt aaaatgtttc cggaattc // LOCUS YSCGLN3 3021 bp ds-DNA PLN 26-JUL-1990 DEFINITION S.cerevisiae nitrogen regulatory protein (GLN3) gene, complete cds. ACCESSION M35267 KEYWORDS nitrogen regulatory protein. SOURCE S.cerevisiae (strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3021) AUTHORS Minehart,P.L. and Magasanik,B. TITLE Sequence and expression of GLN3, a positive nitrogen regulatory gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by P.Minehart, 19-JUN-1990. Author address: P.Minehart MIT, 56-428 MIT 77 Mass Ave. Cambridge, MA 02139 FEATURES from to/span description pept 730 2922 GLN3 protein signal 509 514 TATA box site 1 140 acidic activation region BASE COUNT 1020 a 704 c 530 g 767 t ORIGIN Chromosome VL, map position 43cm distal to GCN4. 1 gacgtcaact ccatagaagt gacttttccg ccaaagaaga ggacctcgcc ataagcaatg 61 agaatgatcg tcagattctt gaaaattgtg tagatgggca cggcaaggta ttgtaagctc 121 tttgacgacg tataaatcat caatacgagc agcaaagaaa ttggaaacca gttttttaca 181 tctgtcctgt tcaaagatca aaaattagca acgcctacaa ttcgtaggat acatagcgtc 241 acagtgcaca ccagtgattg tacaaacaac atcacaaagt tcatgttaaa gttgtccagg 301 ttaaccacga atttgttcgt tactgtcatc aaaatcgagg acgcgcagta agataagatt 361 gaagccggcc cagagttggc cactgattcc gtccattcat gcttatgctt gctcataatt 421 accacacctt cttgatctct ttacagcttt tcaaccttcc attcttgtac tctatctcta 481 cctggccctt taaacattct taatatgata tattcacatt ttttgctcta ttacccggcg 541 gacaggttcc cgaaagaaag tgacatggca atgctgagag agtggaaaga gtcatcttgc 601 aagacagaga aagatgttca agagtggtaa gctaatgtca gcgcagtagc ccatcccaca 661 ataacagagt gtgtaagaaa gagagacgag agagagcaca gggccccctt ttcccccacc 721 aacaaacaaa tgcaagacga ccccgaaaat tcgaagctgt acgacctgct gaatagtcat 781 ctggacgtgc atggtcgaag taatgaagag ccgagacaaa ctggtgacag taggagccag 841 agtagtggca acaccggtga aaacgaggag gatatagcat ttgccagtgg attaaacggc 901 ggcacattcg actcaatgct ggaggcactg cccgatgatt tatattttac ggacttcgtg 961 tctcctttta cagcagctgc cacgaccagc gtgactacta agacggtcaa ggacaccaca 1021 ccagctacca atcatatgga tgatgatatt gcgatgtttg attcacttgc cacaactcag 1081 cccatcgaca tagccgcatc caaccaacaa aatggtgaaa ttgcacaact ttgggacttt 1141 aacgtggacc aattcaacat gacgcccagc aactcgagcg gttcagctac tattagtgct 1201 cctaacagct ttacttccga cataccgcaa tacaaccacg gttccctcgg caacagcgtc 1261 tccaaatcct cactgttccc gtataattcc agcacgtcca acagcaacat caaccagcca 1321 tctatcaata acaactcaaa tactaatgcg cagtcccacc attccttcaa catctacaaa 1381 ctacaaaaca acaactcatc ttcatccgct atgaacatta ccaataataa taatagcaac 1441 aatagtaata tccagcatcc ttttctgaag aagagcgatt cgataggatt atcttcatcc 1501 aacacaacaa attctgtaag aaaaaactca cttatcaagc caatgtcgtc cacgtccctg 1561 gccaatttca aaagagctgc ctcagtatct tccagtatat ccaatatgga accatcagga 1621 caaaataaaa aacctctgat acaatgtttc aattgtaaaa ctttcaagac accgctttgg 1681 aggagaagcc cagaggggaa tactctttgc aatgcctgcg gtcttttcca gaaattacat 1741 ggtaccatga ggccattatc cttaaaatcg gacgttatca aaaagaggat ttcaaagaag 1801 agagccaaac aaacggaccc aaacattgca caaaatactc caagtgcacc tgcaactgcc 1861 tcaacttcag taaccactac aaatgctaaa cccatacgat cgaggaaaaa atcactacaa 1921 caaaactctt tatctagagt gatacctgaa gaaatcatta gagacaacat cggtaatact 1981 aataatatcc ttaatgtaaa taggggaggc tataacttca actcagtccc ctccccggtc 2041 ctcatgaaca gccaatcgta taatagtagt aacgcaaatt ttaatggagc aagcaatgca 2101 aatttgaatt ctaataactt aatgcgtcac aattcgaaca ctgttactgg taattttaga 2161 aggtcttcaa gacgaagtag tacttcatcg aacacctcaa gttccagtaa atcttcatcc 2221 agatctgttg ttccgatatt accaaaacct tcacctaata gcgctaattc acagcagttc 2281 aacatgaaca tgaacctaat gaacacaaca aataatgtaa gtgcaggaaa tagtgtcgca 2341 tcctcaccaa gaattatatc gtccgcaaac tttaactcaa atagtcctct acagcagaat 2401 ctattatcaa attctttcca acgtcaagga atgaatatac caagaagaaa gatgtcgcgc 2461 aatgcatcgt actcctcatc gtttatggct gcgtctttgc aacaactgca cgaacagcaa 2521 caagtggacg tgaattccaa cacaaacacg aattcgaata gacagaattg gaattcaagc 2581 aatagcgttt caacaaattc aagatcatca aattttgtct ctcaaaagcc aaattttgat 2641 atttttaata ctcctgtaga ttcaccgagt gtctcaagac cttcttcaag aaaatcacat 2701 acctcattgt tatcacaaca attgcagaac tcggagtcga attcgtttat ctcaaatcac 2761 aaatttaaca atagattatc aagtgactct acttcaccta taaaatatga agcagatgtg 2821 agtgcaggcg gaaagatcag tgaggataat tccacaaaag gatcttctaa agaaagttca 2881 gcaattgctg acgaattgga ttggttaaaa tttggtatat gaccgcgtat tatcattatc 2941 attattctta ttatgttaat aattactgaa cggttgcatt gatagatttt cattacctct 3001 gaccacaatc ctgagcattg g // LOCUS BLYHISH3PA 505 bp ss-mRNA PLN 26-JUL-1990 DEFINITION Barley histone H3 mRNA, 3' end. ACCESSION M34928 KEYWORDS histone H3 protein. SOURCE Barley (strain Nudinka) seed scutella 2 days after germination, cDNA to mRNA. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 505) AUTHORS Chojecki,J. TITLE Identification and characterization of a cDNA clone for histone H3 in barley JOURNAL Carlsberg Res. Commun. 51, 211-217 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 243 histone H3 protein mRNA < 1 505 histone H3 mRNA BASE COUNT 95 a 138 c 146 g 126 t ORIGIN 1 aagagcaccg agctgctgat ccgcaagctc ccgttccagc gcctggtgag ggagatcgcg 61 caggacttca agaccgacct caggttccag tcccacgccg tgctggccct ccaggaggcc 121 gccgaggcgt acctcgtcgg gctgttcgag gacaccaacc tgtgcgccat ccacgccaag 181 cgcgtcacca tcatgcccaa ggacatccag ctcgcccgcc gcatccgcgg ggagcgcgcc 241 taagccaccc agagcgctgc attcgggagc gatgacaccg ttcgccagca ttagtgtagt 301 tgattggctt tccttgtcca gatatgcgtc ttgtggttcg ttgtagaaac cctggttggt 361 tggttcccgt agttacagag acttttctgc ttaagtggtt ttggtttgcg gtgttgcaaa 421 ccgatgctta ctgtgatgca aattgttggt taatgtagtg ttgattgaca attatcgatg 481 gatgaacttg tggtgttgcg tagtt // LOCUS BMOFIBA 324 bp ss-mRNA INV 26-JUL-1990 DEFINITION B.mori silk fibroin mRNA, partial cds. ACCESSION M35378 KEYWORDS fibroin. SOURCE B.mori (Kinryu x Showa) posterior silk gland, cDNA to mRNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 324) AUTHORS Mita,K., Ichimura,S., Zama,M. and James,T.C. TITLE Specific codon usage pattern and its implications on the secondary structure of silk fibroin mRNA JOURNAL J. Mol. Biol. 203, 917-925 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 324 silk fibroin (AA at 1) BASE COUNT 35 a 60 c 144 g 85 t ORIGIN 1 ggatacggag caggagctgg aagcggagct gcctctggtg ccggtgccgg ttcaggtgct 61 ggtgctggtt caggagctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct 121 ggttcaggtg ctggtgctgg ttcaggagct ggtgctggtt caggtgctgg tgctggttca 181 ggagctggtg ctggatacgg agcaggagct ggcgttggat acggagcagg agctgggagc 241 ggagctgcct ctggtgctgg tgctggttca ggtgctggtg ctggttcagg tgctggtgct 301 ggttcaggtg ctggtgctgg ttca // LOCUS DROMETA 338 bp ss-mRNA INV 26-JUL-1990 DEFINITION D.melanogaster metallothionein (MT) mRNA, complete cds. ACCESSION M35390 KEYWORDS metallothionein. SOURCE D.melanogaster larva, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 338) AUTHORS Maroni,G., Lastowski-Perry,D., Otto,E. and Watson,D. TITLE Effects of heavy metals on Drosophila larvae and a metallothionein cDNA JOURNAL Environ. Health Perspect. 65, 107-116 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 124 246 metallothionein mRNA < 1 338 metallothionein mRNA signal 308 313 polyA signal BASE COUNT 101 a 88 c 77 g 72 t ORIGIN 1 gatcagttgt ggtcagcagc aaaatcaagt gaatcatctc agtgcaacta aaggcctaaa 61 tagcccatac ctaccttttt tgtaaacaag tgaacaagtt cgaggaaata caactcaatc 121 aagatgcctt gcccatgcgg aagcggatgc aaatgcgcca gccaggccac caagggatcc 181 tgcaactgcg gatctgactg caagtgcggc ggcgacaaga aatccgcctg cggctgctcc 241 gagtgagctt tcccccaaaa aagatctgga gtagaggcgc tgcatcttgt ctctctacac 301 accctgcaat aaatgtccaa ttaaagtaat tgatgcct // LOCUS HUMVPREBA 503 bp ds-DNA PRI 26-JUL-1990 DEFINITION Human pre-B lymphocyte VpreB gene, 5' end. ACCESSION M34927 KEYWORDS . SOURCE Human myeloid cell line U937 DNA, clone pHVPB-6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 503) AUTHORS Bauer,S.R., Kudo,A. and Melchers,F. TITLE Structure and pre-B lymphocyte restricted expression of the VpreB gene in humans and conservation of its structure in other mammalian species JOURNAL EMBO J. 7, 111-116 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 1 46 VpreB protein precursor, exon 1 133 > 503 VpreB protein precursor, exon 2 sigp 1 46 VpreB protein signal peptide 133 143 VpreB protein signal peptide matp 144 > 503 VpreB protein IVS 47 132 VpreB intron A BASE COUNT 104 a 160 c 140 g 99 t ORIGIN 1 atgtcctggg ctcctgtcct gctcatgcac tttgtctact gcacaggtga gggaaccccc 61 agatcccaaa gactcctgcc ccttccttca tcctgccctg cccccacggg ccacatgcat 121 ctgtgtcacc aggttgtggt cctcagccgg tgctacatca gccgccggcc atgtcctcgg 181 cccttggaac cacaatccgc ctcacctgca ccctgaggaa cgaccatgac atcggtgtgt 241 acagcgtcta ctggtaccag cagaggccgg gccaccctcc caggttcctg ctgagatatt 301 tctcacaatc agacaagagc cagggccccc aggtcccccc tcgcttctct ggatccaaag 361 atgtggccag gaacaggggg tatttgagca tctctgagct gcagcctgag gacgaggcta 421 tgtattactg tgctatgggg gcccgcagct cggagaagga ggagagggag agggagtggg 481 aggaagaaat ggaacccact gca // LOCUS MUSNGF 1176 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Mouse nerve growth factor (NGF) precursor mRNA, complete cds. ACCESSION M35075 J00608 KEYWORDS nerve growth factor. SOURCE Mouse male submaxillary gland, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1176) AUTHORS Scott,J., Selby,M., Urdea,M., Quiroga,M., Bell,G.I. and Rutter,W.J. TITLE Isolation and nucleotide sequence of a cDNA encoding the precursor of mouse nerve growth factor JOURNAL Nature 302, 538-540 (1983) STANDARD simple staff_review REFERENCE 2 (bases 3 to 226) AUTHORS Edwards,R.H., Selby,M.J. and Rutter,W.J. TITLE Differential RNA splicing predicts two distinct nerve growth factor precursors JOURNAL Nature 319, 784-787 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 96 1019 nerve growth factor precursor sigp 96 656 nerve growth factor signal peptide matp 657 1010 nerve growth factor mRNA 1 1176 NGF mRNA BASE COUNT 283 a 330 c 295 g 268 t ORIGIN 1 gagcgcctgg agccggaggg gagcgcatcg agtgactttg gagctggcct tatatttgga 61 tctcccgggc agctttttgg aaactcctag tgaacatgct gtgcctcaag ccagtgaaat 121 taggctccct ggaggtggga cacgggcagc atggtggagt tttggcctgt ggtcgtgcag 181 tccagggggc tggatggcat gctggaccca agctcacctc agtgtctggg cccaataaag 241 gttttgccaa ggacgcagct ttctatactg gccgcagtga ggtgcatagc gtaatgtcca 301 tgttgttcta cactctgatc actgcgtttt tgatcggcgt acaggcagaa ccgtacacag 361 atagcaatgt cccagaagga gactctgtcc ctgaagccca ctggactaaa cttcagcatt 421 cccttgacac agccctccgc agagcccgca gtgcccctac tgcaccaata gctgcccgag 481 tgacagggca gacccgcaac atcactgtag accccagact gtttaagaaa cggagactcc 541 actcaccccg tgtgctgttc agcacccagc ctccacccac ctcttcagac actctggatc 601 tagacttcca ggcccatggt acaatccctt tcaacaggac tcaccggagc aagcgctcat 661 ccacccaccc agtcttccac atgggggagt tctcagtgtg tgacagtgtc agtgtgtggg 721 ttggagataa gaccacagcc acagacatca agggcaagga ggtgacagtg ctggccgagg 781 tgaacattaa caacagtgta ttcagacagt acttttttga gaccaagtgc cgagcctcca 841 atcctgttga gagtgggtgc cggggcatcg actccaaaca ctggaactca tactgcacca 901 cgactcacac cttcgtcaag gcgttgacaa cagatgagaa gcaggctgcc tggaggttca 961 tccggataga cacagcctgt gtgtgtgtgc tcagcaggaa ggctacaaga agaggctgac 1021 ttgcctgcag cccccttccc cacctgcccc ctccacactc tcttgggccc ctccctacct 1081 cagcctgtaa attattttaa attataagga ctgcatgata atttatcgtt tatacaattt 1141 taaagacatt atttattaaa ttttcaaagc atcctg // LOCUS RATXDHA 4162 bp ss-mRNA ROD 26-JUL-1990 DEFINITION Rat xanthine dehydrogenase mRNA, complete cds. ACCESSION J05579 KEYWORDS xanthine dehydrogenase. SOURCE Rat (strain Wistar) liver, cDNA to mRNA, clones lambda-RXD[7,32,42,51]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4162) AUTHORS Amaya,Y., Yamazaki K,-i., Sato,M., Noda,K., Nishino,T. and Nishino,T. TITLE Proteolytic conversion of xanthine dehydrogenase from the NAD- dependent type to the oxygen-dependent type: Amino acid sequence of rat liver xanthine dehydrogenase and identification of the cleavage sites of the enzyme protein during irreversible conversion by trypsin JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by Y.Amaya, 22-JUN-1990. FEATURES from to/span description pept 27 3986 xanthine dehydrogenase (EC 1.1.1.204) signal 4125 4130 polyA signal (put.) signal 4146 4151 polyA signal (put.) BASE COUNT 1055 a 1046 c 1121 g 940 t ORIGIN 1 agagctcagt gactccagca gccacgatga ctgcggatga gttggtcttc tttgtgaatg 61 gcaaaaaggt ggtggagaaa aatgcggacc ctgaaacaac acttctggtc tacctgagaa 121 gaaagttggg gctatgtggg accaagcttg gctgtggaga aggtggctgt ggggcatgca 181 ccgtgatgat ctccaagtat gaccgtcttc agaacaagat tgttcatttt tctgtcaatg 241 cctgcttggc tcccatctgc tccttgcacc atgttgctgt gaccaccgtg gaaggcatag 301 gaaacaccca gaagctgcat cctgtacagg agagaattgc cagaagccat ggttcccagt 361 gtgggttctg cactcctggc attgtcatga gtatgtacac actgctccgg aaccagcctg 421 agcctactgt tgaggagatc gagaatgcct tccaaggaaa cctctgtcgc tgtacaggct 481 acagacccat cctccaggga ttccggacct ttgccaagga tggtgggtgc tgtggaggga 541 gtggaaacaa cccaaactgc tgtatgaacc agacgaaaga ccaaacggtt tctctctcac 601 cttctttatt caacccagag gatttcaaac ctttagatcc cacgcaagag cccatcttcc 661 ccccagagtt gctgaggctg aaagacactc cccagaagaa gctgcgtttt gaaggggaac 721 gtgtgacctg gatccaggct tcaactatgg aggagctgct tgacctgaaa gctcagcacc 781 ctgatgccaa gctggtggtg ggaaacacag agataggcat tgaaatgaaa tttaagaata 841 tgctatttcc tctgatcgtc tgcccagcct ggatccctga actgaattca gtggtgcatg 901 ggcctgaggg aatctccttc ggagcttctt gcccccttag cttggtggaa agtgtcctgg 961 cggaggagat tgctaaactt ccagagcaaa agacagaggt gttcagaggc gtgatggagc 1021 agctgcgctg gtttgccggc aagcaggtca agtccgtggc gtccatcgga gggaacatca 1081 tcactgccag ccccatctct gacctcaacc ctgtgttcat ggccagtgga gccaagctga 1141 ctctggtgtc tagaggtacc aggagaactg ttcggatgga tcataccttc ttccctggct 1201 acagaaagac tctgctcaga ccagaggaga tattgctgtc catcgagatc ccctatagca 1261 aggagggaga gtttttctca gccttcaagc aggcctccag gagggaagat gacattgcca 1321 aggtgactag tggcatgaga gtcctgttca aaccggggac cattgaagtg caggaactgt 1381 ccctttgctt cggagggatg gccgacagaa ctatctcagc cctcaagacc actccgaagc 1441 agctatcgaa gtcctggaat gaggagctgc agctggcccc cgatgcccct ggtggtatgg 1501 tggaattccg gcgcaccctc accctcagct tcttcttcaa gttctacctg acagtgctcc 1561 agaagctggg cagagcggac cttgaggata tgtgtggtaa actggacccc acctttgcca 1621 gtgccaccct gctctttcag aaggaccctc cagctaatgt ccagcttttc caagaggtgc 1681 caaaggatca gtctgaggag gacatggtgg gccggcccct gcctcacctg gcggcaaaca 1741 tgcaggcatc gggagaggcc gtgtactgtg atgacattcc ccgctatgag aatgagctct 1801 ctctcaggct ggtcaccagc acccgggcgc atgctaaaat cacgtccatc gacacttcag 1861 aagccaagaa ggtgccaggg tttgtttgct tcctcaccgc agaggatgtc cctaatagta 1921 atgcaaccgg ccttttcaat gatgaaactg tctttgcgaa ggatgaggtt acttgtgttg 1981 ggcacatcat tggtgctgtg gtcgctgaca ccccagaaca cgcacagaga gctgcgagag 2041 gggtgaaaat cacctatgaa gatcttccag ccattatcac aatccaggat gctataaaca 2101 acaactcctt ttatggctct gagataaaaa ttgagaaagg agatctcaag aaaggctttt 2161 cagaagctga caatgttgtc tcaggagagt tgtatatcgg tggccaggag cacttctacc 2221 tggagaccaa ctgcaccatt gccgtgccaa aaggcgaggc aggcgagatg gagctgttcg 2281 tgagcacaca gaacaccatg aaaacccaga gctttgttgc aaaaatgttg ggcgttccgg 2341 acaacagaat cgtagtccga gtgaagagga tgggtggagg ctttggaggg aaggagaccc 2401 ggagcactgt ggtgtccaca gcactggcct tggctgcaca caagactggc cggcccgtac 2461 gttgcatgtt ggaccgagat gaggacatgc tgataactgg tggcagacat cccttcctgg 2521 ctaaatacaa ggttggcttc atgaagactg ggactgtagt ggctctcgag gtggctcact 2581 tcagcaatgg tggtaacact gaggatctct ctcggagtat aatggaacga gctttgttcc 2641 acatggataa cgcctataag atccccaaca ttcgaggcac tgggaggatt tgcaagacta 2701 atctgccctc caacacagcc ttcagaggtt ttgggggtcc tcaggggatg ctaatcgcag 2761 aatactggat gagcgaggtc gccataacct gtgggctgcc tgcagaggag gtacggagga 2821 aaaacatgta caaagaaggg gacctgactc acttcaacca gaagctggag gggttcacct 2881 tgcccaggtg ctgggatgaa tgcatcgcca gctctcagta tcttgctcgc aagagggaag 2941 tggagaaatt caacagggag aattgttgga aaaagagagg gctgtgtata atcccaacta 3001 agtttggaat aagctttaca cttccttttc tgaaccaggg aggcgctctg gttcacgtgt 3061 acactgatgg ttcggtgctg ttgacccatg gagggactga gatgggccaa ggccttcaca 3121 ccaagatggt tcaggtggcc agcagagctc tgaaaatccc cacctccaag attcatataa 3181 gtgagacaag cactaacacc gtccccaaca cttctcccac agctgcctct gccagtgctg 3241 acctcaatgg acagggtgtt tatgaagcat gccagaccat actgaaaagg ctggaacctt 3301 tcaagaagaa gaaacccacc ggcccctggg aggcatgggt gatggacgcc tatacgagcg 3361 cagtgagttt gtccgcaact ggattttata agacacccaa ccttggctac agctttgaga 3421 caaactccgg aaatcccttc cactatttca gttatggggt ggcttgctct gaagtagaaa 3481 ttgactgctt aacaggggat cataagaatc tccgtacgga tatcgtcatg gatgttggtt 3541 ccagcttgaa tcctgccatt gatattggac aagtagaggg ggcatttgtc cagggccttg 3601 gtctcttcac tatggaggag ctgcactact cccctgaggg gagcctgcat actcgtggcc 3661 ccagtaccta caaaatccct gcatttggta gcatccccat tgagttcaga gtatccctac 3721 tccgggactg ccccaacaag agggccatct atgcatccaa ggctgttggg gagccacctc 3781 ttttcctggc ttcctctatc ttctttgcca tcaaagatgc cattcgtgca gctcgagctc 3841 agcacggaga taacgcaaaa caacttttcc agctagacag ccctgccact ccggagaaga 3901 tccgaaacgc ctgtgtggac cagttcacca ccctgtgtgt cactggagta ccagaaaact 3961 gtaaatcctg gtctgtgagg atctgaagag aaggtctcca ccattggttt gtaccgcacc 4021 aggattcctt ggagccacaa gcacatcctg tagtatccag atttccgcat gccgcgtggg 4081 actcagcagg atgacatttt caggaagatg gacattttga cccaaataag agctgcaaac 4141 aaaccaataa gcaaatgggg ag // LOCUS RICHISH2AA 321 bp ds-DNA PLN 26-JUL-1990 DEFINITION Rice histone H2A gene, 5' end. ACCESSION M35379 KEYWORDS histone. SOURCE Rice DNA, clone pIR22. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 321) AUTHORS Thomas,G. and Padayatty,J.D. TITLE Restriction map and partial sequence of a rice DNA fragment carrying histone genes H2A, H2B and H4 JOURNAL Indian J Biochem Biophys 21, 1-6 (1984) STANDARD simple staff_entry FEATURES from to/span description pept 260 > 321 histone H2A protein mRNA 186 > 321 histone H2A mRNA signal 36 40 CAAT box signal 74 77 GATCC motif signal 138 145 TATA box BASE COUNT 74 a 70 c 60 g 73 t 44 others ORIGIN 1 caaaggacnt gttcccgctg atgtgagcaa ttgtcacaat gccctcccaa acngttttca 61 gatngtngat gtggatcnnn antttnttgc gnntnnanac ctggctctcg ttttttcgca 121 angtcccgaa cnnnnngtat aaatagcgtg tggacccgta ncgtgagaac tcgtgatctn 181 atttcatctg gaacgactcn nggaatnttc cgaaaannnn nnnnnnnnng ccgaaagcct 241 tttggaactt ttcnnccaaa tgcacaccaa aggcctcngg aagnnttttc ancgcaaaaa 301 gatatcaccc gcagggatca c // LOCUS TEYMT14SRR 169 bp ds-DNA ORG 26-JUL-1990 DEFINITION T.pyriformis mitochondrial 14S rRNA. ACCESSION M35376 KEYWORDS 14S ribosomal RNA. SOURCE T.pyriformis (strain ST) linear mitochondrial DNA. ORGANISM Mitochondrion Tetrahymena pyriformis Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae; Tetrahymena pyriformis. REFERENCE 1 (bases 1 to 169) AUTHORS Suyama,Y., Fukuhara,H. and Sor,F. TITLE A fine restriction map of the linear mitochondrial DNA of Tetrahyemena pyriformis: Genome size, map locations of rRNA and tRNA genes, terminal inversion repeat, and restriction site polymorphism JOURNAL Curr. Genet. 9, 479-493 (1985) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 169 14S rRNA site 31 144 conserved U5 region BASE COUNT 54 a 26 c 37 g 52 t ORIGIN 1 gaattcagaa tagctaacgc aaagtattct gcttggggag tattatcgca agattaaaac 61 ttaactgaat tggcgggaat ttgttcgaac ggtggaacat gtggtttaat gcgataatcc 121 acgcaaaatc ttaccaacgt tttaggcttt atctgataat atggttaac // LOCUS YSCPET122 2862 bp ds-DNA PLN 26-JUL-1990 DEFINITION Yeast PET122 encoded protein gene, complete cds. ACCESSION X07558 KEYWORDS PET122 encoded protein. SOURCE Yeast (S.cerevisiae, strain AB320) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2862) AUTHORS Ohmen,J.D., Burke,K.A. and McEwen,J.E. TITLE Divergent overlapping transcripts at the PET122 locus in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 3027-3035 (1990) STANDARD simple staff_entry REFERENCE 2 (bases 953 to 2862) AUTHORS Ohmen,J.D., Kloeckener-Gruissem,B. and McEwen,J.E. TITLE Molecular cloning and nucleotide sequence of the nuclear PET122 gene required for expression of the mitochondrial COX3 gene in S.cerevisiae JOURNAL Nucleic Acids Res. 16, 10783-10862 (1988) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Ohmen, 11-JUN-1990. Draft entry and computer-readable sequence for [1] submitted to EMBL by J.D.Ohmen, 09-JUN-1989. EMBL features not translated to GenBank features: key from to description SITE 746 1096 similarity to E.coli alanyl tRNA-synthetase (AA 116-232) [1] Author address: McEwan J.E. Department of Microbiology College of Letters and Science University of California 405 Hilgard Avenue Los Angeles, CA 90024-1489 FEATURES from to/span description pept 1139 < 1 (c) ORF3 pept 1354 2118 PET122 protein pept 2171 > 2862 ORF2 mRNA 1144 < 1 (c) ORF3 mRNA mRNA 1152 < 1 (c) ORF3 mRNA mRNA 1157 < 1 (c) ORF3 mRNA mRNA 1159 < 1 (c) ORF3 mRNA mRNA 1870 < 1 (c) ORF3 mRNA mRNA 1875 < 1 (c) ORF3 mRNA mRNA 1882 < 1 (c) ORF3 mRNA mRNA 1883 < 1 (c) ORF3 mRNA mRNA 1887 < 1 (c) ORF3 mRNA mRNA 1895 < 1 (c) ORF3 mRNA mRNA 1907 < 1 (c) ORF3 mRNA mRNA 1343 > 2119 PET122 mRNA mRNA 1348 > 2119 PET122 mRNA mRNA 1354 > 2119 PET122 mRNA mRNA 2140 > 2862 ORF2 mRNA mRNA 2143 > 2862 ORF2 mRNA mRNA 2147 > 2862 ORF2 mRNA mRNA 2150 > 2862 ORF2 mRNA BASE COUNT 748 a 652 c 795 g 667 t ORIGIN 1 aagctttctt gtaacttctt ctcattatct tgcatcaatt gccttctttc cgcctgatct 61 cttgcctttt gaatgttatg ttttaatgat tggaagatgc ccatgttctc tgtgggggaa 121 gcgccagcga taggagtcct tggtttagct acttctgtta tcttcagttt cgaacgaacc 181 catttgtttc tcaaaatcat tgtctgtagg acggagaagg caccattaaa ggcaaagtag 241 aggaccacag cggacgataa gttcattgtg gccggtatag aaatgatcgg tagaatagtg 301 aaaagacgct tcatgggaga actgaattgt tgagcaccag tctcaccccc cagccttgta 361 aatgagatga acacagcggc agtgattact tgcaaaccta agtaagggtc tgcttgagtc 421 aagtctgtaa accaagcgac accttgatta gcgaacccat ctactgggta gttagccatg 481 tgtctcaatg cgttgaaaaa cccaagggcg attggaattt gtagcatggg tgcggccagc 541 catctgttct taatgccgtg cgaggagagc agttttttcc tttgcatggc gactagctga 601 ccttgttgca aatctgtagt ggacattagc ttattattca aggcgtccag ctcgggcttg 661 atatgggaat ttctagcaac agtatcagag gacttgacat agaggggaaa catcaggcat 721 cgaatgagga tggtggtggc cgcgatagtt ccccaccaag gcaacccaga gtaaacatga 781 acggcctcca agacgtgttg gataatgtcc gagggccagt accaggtttg ggccaggcca 841 atgctattta agtaccctat atgggaggac aactcgccca ctgtttgggt cgtgttagcg 901 ataaggtccg aagtagaagc ggaaagagaa ggagctgaag aggttaattc atcgatggaa 961 ggcaactggg tttggatttc cgagacatcg ttggcatttg ggcccgtcga attaaatctt 1021 ttggcctgaa aagagatcca tgacggatgg ggccggggca atactatggt tcgagcggtg 1081 gccagtctgg aagaggcagc aaaccttgac gtgacgagtc gagaggtgag tttgaacatc 1141 gtcggggagg ttattctgtg gctccgcttg tacgtgaaca gatacgtata gagggcgagc 1201 cactggttaa atttttcatg gctcggatta cttccgtact gctggctaaa atcgaaatct 1261 cggcctgctg agagtgtttt gagcaatcaa gggaacatct gaacgtggaa gagcagacga 1321 ggcattagct cgaacataag aacggaacac gtcatgttga ctatcacgaa aagactggtg 1381 accaccgatg tgcggtcgcg aatactgtta agcagtttaa acgggaaaat gtccgatgca 1441 ctggcgctgc tgcgtcagca gcagcagacc agcgtggatg tggagctgct gcacacgatg 1501 ctagcgcgag ccgctgcgct tgcccatgcc gacactatag catacatgtg gtatcagcat 1561 gtgatgccac gccggttgcc agtagagggc cgcctgctat gtgaaatggc tggcgtagca 1621 ttgtaccagg acaggctctt cttacccgcg cagttcctcc agcactacca ggcgatgaat 1681 cgcgatcgtc gcaccagccc agaagatgaa ctgattgagt atgagcttag acggattaaa 1741 gtcgaagcgt ttgcgcgtgg cacaatgcac tccacggcgc tcagggaaaa gtggaaggta 1801 ttcttgcagg agatggatac gctaccaggg cagccgccat taaggctgcg cgacttcccg 1861 caaatgacca aggctatggg catagcattg atgcagcaag atgagcaagc agctgccctg 1921 gcgttgtttg gacgacagcc cctagtgata aagaacgaat ggtcactacc gctactactg 1981 gctggtgtcc tttggcatgt tcccggccca gcgcaggcgc gacgtgtgct ggcggagttc 2041 cgtcaaagtt atcgcgggct gccgctgctg gatgccgaac tagtgataaa gagaagagga 2101 tttgaaatca acacataaat ctgggtggag catcgctgta acaaggaaca acgcgtgcta 2161 gcaagcggta atgaaataca aggaaatcaa tttcttcaag ggccatccga gctcgaggtt 2221 gctgcctcga gaagcagtaa ttcaagcgac tgcggctata ttggggcccg agaccaggga 2281 gtacgataac gacccctata acaggcatcc gctgacgtac ggttcggacg aaggtgccct 2341 gtgggtgcga gagcagattt gtacgtttct gaatgatcag ctgtttaagt tcgaaaatgg 2401 ggctcggagc aggacacggg cagactattt gaatctgaat agcggcgctt cgtatggcat 2461 gctgaacatc cttctgcaaa caaccttgcc acataacggg tataccaggc aggcgttcat 2521 catcacgcca acatatttct tgatcaacaa ttgcttcaca gatgcgggat tcaaggggaa 2581 aatgaccgcc atcaacgagc agggccacga ctcgattgat ttcgagtcgt tgatttctgc 2641 ccttgagcag cacgaggcgg agccgcagcc ccatagtacc acagagatga ttcaggggcc 2701 aaagttgacc aagaaggtct acaggtacgt tatgtactgc atcccgacgt ttgcaaaccc 2761 atcgggaaac acatactcgc ttgagaccag acgcagactt atcgacatcg ctcggaagta 2821 cgacatgctg ataatcactg atgacgtgta cgatattcta ga // LOCUS ECO987P 954 bp ds-DNA BCT 26-JUL-1990 DEFINITION E.coli fimbriae 987P subunit gene, complete cds. ACCESSION M35257 KEYWORDS fimbriae. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 954) AUTHORS De Graaf,F.K. and Klaasen,P. TITLE Nucleotide sequence of the gene encoding the 987P fimbrial subunit of Escherichia coli JOURNAL FEMS Microbiol. Lett. 42, 253-258 (1987) STANDARD simple staff_review FEATURES from to/span description pept 259 843 fimbriae 987P subunit precursor sigp 259 328 fimbriae 987P subunit signal peptide matp 329 840 fimbriae 987P subunit BASE COUNT 309 a 170 c 180 g 295 t ORIGIN 1 aaatttagaa aagtgcatta tgcttatcac tagataagaa aataaaacac gaaatatagc 61 gagccatata gcctgttgtg tttgtaatag ataaaaaaca cgcaattgat tatttatgta 121 tctttttgtt tgtatttttt tattaaaaaa agcacacaat tactgcgtgc atcgaaatga 181 gttgaagtgg atgcatatat gcatgaaatg cttttaactt gaaagtctta atgtttctat 241 taattaagat aaggtaatat gagaatgaaa aaatccgcat taacattagc agtgctttcc 301 tctctgttca gtggttactc gctcgcagcg cccgctgaaa acaacaccag ccaggcaaat 361 ttagacttta ctggtaaagt tactgccagt ctatgccaag tggatacttc taatctgtcg 421 caaaccatag atcttggaga gttgtctact tctgctctta aagctactgg caaggggcct 481 gccaagtcat ttgcagttaa tcttatcaac tgcgatacaa cattgaattc tattaaatac 541 actattgctg gtaataataa tacaggaagt gatactaaat atttagttcc agcctccaat 601 gatactagtg catcaggagt tggcgtatac attcaggaca acaacgccca ggctgtggaa 661 attggtactg aaaaaactgt acctgtggta tcaaatggcg gattagctct ttcagaccaa 721 agtattccac tgcaagcata catcggaacc accacaggga atcctgatac aaacggtgga 781 gttacggccg gtactgtcac tgctagtgca gtaatgacta ttcgttcagc aggtacaccg 841 taattagata acaattttta tacaacaaaa caggaaggat tttgaactaa tccttcctgt 901 tattggagat tgaaatgtct aagtttgtaa tatttcttgt gtttttgttt atat //
GenBank-Updates@genbank.bio.net (07/27/90)
LOCUS HUMHPRTB 56736 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human hypoxanthine phosphoribosyltransferase (HPRT) gene, complete cds. ACCESSION M26434 KEYWORDS Alu repetitive element; LINE repetitive element; hypoxanthine phosphoribosyltransferase. SOURCE Human cell line 4X,Y (GM1202), fetal liver, and beta thalassemia patient DNA, clones pAE[23,28], Hu-lambda-[2,3,14] and AE29. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 56736) AUTHORS Edwards,A., Voss,H., Rice,P., Civitello,A., Stegemann,J., Schwager,C., Zimmermann,J., Erfle,H., Caskey,C.T. and Ansorge,W. TITLE Automated DNA sequencing of the human HPRT locus JOURNAL Genomics 6, 593-608 (1990) STANDARD full staff_entry REFERENCE 2 (sites; mutations causing gout) AUTHORS Argos,P., Hanei,M., Wilson,J.M. and Kelley,W.N. TITLE A possible nucleotide-binding domain in the tertiary fold of phosphoribosyltransferase JOURNAL J. Biol. Chem. 258, 6450-6457 (1983) STANDARD full staff_entry REFERENCE 3 (sites; mutations causing gout) AUTHORS Cariello,N.F., Scott,J.K., Kat,A.G., Thilly,W.G. and Keohavong,P. TITLE Resolution of a missense mutant in human genomic DNA by denaturing gradient gel electrophoresis and direct sequencing using in vitro DNA amplification: HPRT-Munich JOURNAL Am. J. Hum. Genet. 42, 726-734 (1988) STANDARD full staff_entry REFERENCE 4 (sites; mutations causing gout) AUTHORS Davidson,B.L., Chin,S.J., Wilson,J.M., Kelley,W.N. and Palella,T.D. TITLE Hypoxanthine-guanine phosphoribosyltransferase. Genetic evidence for identical mutations in two partially deficient subjects JOURNAL J. Clin. Invest. 82, 2164-2167 (1988) STANDARD full staff_entry REFERENCE 5 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Palella,T.D. and Kelly,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase: A single nucleotide substitution in cDNA clones isolated from a patient with Lesch-Nyhan syndrome (HPRT-Midland) JOURNAL Gene 68, 85-91 (1988) STANDARD full staff_entry REFERENCE 6 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Genetic basis of hypoxanthine guanine phosphoribosyltransferase deficiency in a patient with Lesch-Nyhan syndrome (HPRT-Flint) JOURNAL Gene 63, 331-336 (1988) STANDARD full staff_entry REFERENCE 7 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Davidson,B.L., Pashmforoush,M., Kelly,W.N. and Palella,T.D. TITLE Human hypoxanthine-guanine phosphoribosyltransferase deficiency. The molecular defect in a patient with gout (HPRT-Ashville) JOURNAL J. Biol. Chem. 264, 520-525 (1989) STANDARD full staff_entry REFERENCE 8 (sites; mutations causing gout) AUTHORS Davidson,B.L., Tarle,S.A., Palella,T.D. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in ten subjects determined by direct sequencing of amplified transcripts JOURNAL J. Clin. Invest. 84, 342-346 (1989) STANDARD full staff_entry REFERENCE 9 (sites; mutations causing gout) AUTHORS Dush,M.K., Sikela,J.M., Khan,S.A., Tischfield,J.A. and Stambrook,P.J. TITLE Nucleotide sequence and organization of the mouse adenine phosphoribosyltransferase gene: Presence of a coding region common to animal and bacterial phosphoribosyltransferases that has a variable intron/exon arrangement JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2731-2735 (1985) STANDARD full staff_entry REFERENCE 10 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Fujimori,S., Davidson,B.L., Kelley,W.N. and Palella,T.D. TITLE Identification of a single nucleotide change in the hypoxanthine-guanine phosphoribosyltransferase gene (HPRT-Yale) responsible for Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 83, 11-13 (1989) STANDARD full staff_entry REFERENCE 11 (sites; mutations causing gout) AUTHORS Fujimori,S., Hidaka,Y., Davidson,B.L., Palella,T.D. and Kelley,W.N. TITLE Identification of a single nucleotide change in a mutant gene for hypoxanthene-guanine phosphoribosyltransferase (HPRT-Ann Arbor) JOURNAL Hum. Genet. 79, 39-43 (1988) STANDARD full staff_entry REFERENCE 12 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Gibbs,R.A., Nguyen,P.N., Edwards,A., Civitello,A.B. and Caskey,C.T. TITLE Multiplex DNA deletion detection and exon sequencing of the hypoxanthine phosphoribosyltransferase gene in Lesch-Nyhan families JOURNAL Genomics 7, 235-244 (1990) STANDARD full staff_entry REFERENCE 13 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Gibbs,R.A., Nguyen,P.N., McBride,L.J., Koepf,S.M. and Caskey,C.T. TITLE Identification of mutations leading to the Lesch-Nyhan syndrome by automated direct DNA sequencing of in vitro amplified cDNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1919-1923 (1989) STANDARD full staff_entry REFERENCE 14 (sites; mutations causing gout) AUTHORS Hershey,H.V. and Taylor,M.W. TITLE Nucleotide sequence and deduced amino acid sequence of Escherichia coli adenine phosphoribosyltransferase and comparison with other analogous enzymes JOURNAL Gene 43, 287-293 (1986) STANDARD full staff_entry REFERENCE 15 (sites; conserved domain) AUTHORS King,A. and Melton,D.W. TITLE Characterisation of cDNA clones for hypoxanthine-guanine phosphoribosyltransferase from the human malarial parasite, Plasmodium falciparum: Comparisons to the mammalian gene and protein JOURNAL Nucleic Acids Res. 15, 10469-10481 (1987) STANDARD full staff_entry REFERENCE 16 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Ogasawara,N., Stout,J.T., Goto,H., Sonta,S.I., Matsumoto,A. and Caskey,C.T. TITLE Molecular analysis of a female Lesch-Nyhan patient JOURNAL J. Clin. Invest. 84, 1024-1027 (1989) STANDARD full staff_entry REFERENCE 17 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Stout,J.T. and Caskey,C.T. TITLE HPRT: Gene structure, expression, and mutation JOURNAL Annu. Rev. Genet. 19, 127-148 (1985) STANDARD full staff_entry REFERENCE 18 (sites; Lesch-Nyhan mutation) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Molecular basis of hypoxanthine-guanine phosphoribosyltransferase deficiency in a patient with the Lesch-Nyhan syndrome JOURNAL J. Clin. Invest. 71, 1331-1335 (1983) STANDARD full staff_entry REFERENCE 19 (sites; mutations causing Lesch-Nyhan syndrome and gout) AUTHORS Wilson,J.M. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Structural alteration in a dysfunctional enzyme variant (HPRT-Munich) isolated from a patient with gout JOURNAL J. Biol. Chem. 259, 27-30 (1984) STANDARD full staff_entry REFERENCE 20 (sites; mutations causing gout) AUTHORS Wilson,J.M., Kobayashi,R., Fox,I.H. and Kelley,W.N. TITLE Human hypoxanthine-guanine phosphoribosyltransferase. Molecular abnormality in a mutant form of the enzyme (HPRT-Toronto) JOURNAL J. Biol. Chem. 258, 6458-6460 (1983) STANDARD full staff_entry REFERENCE 21 (sites; mutations causing gout) AUTHORS Wilson,J.M., Tarr,G.E. and Kelley,W.N. TITLE Human hypoxanthine (guanine) phosphoribosyltransferase: An amino acid substitution in a mutant form of the enzyme isolated from a patient with gout JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 870-873 (1983) STANDARD full staff_entry REFERENCE 22 (sites; mutations causing Lesch-Nyhan syndrome) AUTHORS Yang,T.P., Patel,P.I., Chinault,A.C., Stout,J.T., Jackson,L.G., Hildebrand,B.M. and Caskey,C.T. TITLE Molecular evidence for new mutation at the hprt locus in Lesch-Nyhan patients JOURNAL Nature 310, 412-414 (1984) STANDARD full staff_entry REFERENCE 23 (sites; mutant reversion) AUTHORS Yang,T.P., Stout,J.T., Konecki,D.S., Patel,P.I., Alford,R.L. and Caskey,C.T. TITLE Spontaneous reversion of novel Lesch-Nyhan mutation by HPRT gene rearrangement JOURNAL Somat. Cell Mol. Genet. 14, 293-303 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Edwards, 26-JUL-1989. Mutant Description RJK 1780 missing intron 1 (partial and exon 2 [12] RJK 849 missing intron 3 (partial) and exons 4-9 (no mRNA) [22] RJK 984 missing intron 5 (partial) and exons 6-9 (no mRNA) [17] [22] GM 3467 missing intron 8 (partial) and exon 9 (no mRNA) [12] [22] RJK 853 complete gene deletion [12] [22] [16] GM 2227 inversion of exons 6-9 (no mRNA) [23] GM 1662 and GM 6804 duplication of exons 2 and 3 and elongated mRNA [22] [23] Connersville missing intron 7 (partial and exon 8 [7] FEATURES from to/span description pept 1677 1703 hypoxanthine phosphoribosyltransferase, exon 1 /nomgen="HPRT" /map="Xq26" /hgml_locus_uid="LE0031K" 14780 14886 hypoxanthine phosphoribosyltransferase, exon 2 16603 16786 hypoxanthine phosphoribosyltransferase, exon 3 27891 27956 hypoxanthine phosphoribosyltransferase, exon 4 31617 31634 hypoxanthine phosphoribosyltransferase, exon 5 34937 35019 hypoxanthine phosphoribosyltransferase, exon 6 39815 39861 hypoxanthine phosphoribosyltransferase, exon 7 40033 40109 hypoxanthine phosphoribosyltransferase, exon 8 41454 41501 hypoxanthine phosphoribosyltransferase, exon 9 IVS 1704 14779 HPRT intron A IVS 14887 16602 HPRT intron B IVS 16787 27890 HPRT intron C IVS 27957 31616 HPRT intron D IVS 31635 34936 HPRT intron E IVS 35020 39814 HPRT intron F IVS 39862 40032 HPRT intron G IVS 40110 41453 HPRT intron H rpt 258 41 (c) Alu repeat copy A rpt 1012 694 (c) Alu repeat copy B rpt 3532 3342 (c) Alu repeat copy C rpt 3982 3343 (c) Alu repeat copy D rpt 6232 5937 (c) Alu repeat copy E rpt 6524 6288 (c) Alu repeat copy F rpt 7504 7807 Alu repeat copy G rpt 8713 8408 (c) Alu repeat copy H rpt 9477 9169 (c) Alu repeat copy I rpt 9699 10142 Alu repeat copy J rpt 10407 10293 (c) Alu repeat copy K rpt 12154 12310 Alu repeat copy L rpt 13302 12998 (c) Alu repeat copy M rpt 13638 13314 (c) Alu repeat copy N rpt 14238 13923 (c) Alu repeat copy O rpt 14610 14239 (c) Alu repeat copy P rpt 16090 15843 (c) Alu repeat copy Q rpt 16518 16293 (c) Alu repeat copy R rpt 18783 18439 (c) Alu repeat copy S rpt 18962 19084 Alu repeat copy T rpt 19397 19104 (c) Alu repeat copy U rpt 20064 19778 (c) Alu repeat copy V rpt 20486 20636 Alu repeat copy W rpt 20990 20696 (c) Alu repeat copy X rpt 21293 20996 (c) Alu repeat copy Y rpt 22171 21878 (c) Alu repeat copy Z rpt 23772 23460 (c) Alu repeat copy AA rpt 25310 25426 Alu repeat copy AB rpt 26388 26279 (c) Alu repeat copy AC rpt 29863 29560 (c) Alu repeat copy AD rpt 33618 33148 (c) Alu repeat copy AE rpt 36575 36645 Alu repeat copy AF rpt 38613 39039 Alu repeat copy AG rpt 39091 39388 Alu repeat copy AH rpt 40622 40925 Alu repeat copy AI rpt 40952 41262 Alu repeat copy AJ rpt 42640 42936 Alu repeat copy AK rpt 44776 44486 (c) Alu repeat copy AL rpt 46627 46305 (c) Alu repeat copy AM rpt 47520 47763 Alu repeat copy AN rpt 49257 49551 Alu repeat copy AO rpt 49705 49993 Alu repeat copy AP rpt 50002 50312 Alu repeat copy AQ rpt 51066 50774 (c) Alu repeat copy AR rpt 52276 51978 (c) Alu repeat copy AS rpt 52407 52674 Alu repeat copy AT rpt 54653 54551 (c) Alu repeat copy AU rpt 55465 55164 (c) Alu repeat copy AV rpt 56523 56689 Alu repeat copy AW rpt 11312 11254 (c) LINE repeat copy A (partial) rpt 18284 18242 (c) LINE repeat copy B (partial) rpt 27091 26843 (c) LINE repeat copy C (partial) rpt 37558 37300 (c) LINE repeat copy D (partial) rpt 39451 39427 (c) LINE repeat copy E (partial) rpt 51317 51100 (c) LINE repeat copy F (partial) rpt 261 321 short interspersed repeat copy A rpt 1013 1054 short interspersed repeat copy B rpt 2902 2869 (c) short interspersed repeat copy C rpt 3078 3107 short interspersed repeat copy D rpt 4228 4254 short interspersed repeat copy E rpt 4463 4494 short interspersed repeat copy F rpt 21582 21635 short interspersed repeat copy G rpt 28430 28394 (c) short interspersed repeat copy H rpt 28923 28981 short interspersed repeat copy I rpt 35562 35604 short interspersed repeat copy J rpt 44042 44108 short interspersed repeat copy K rpt 46628 46688 short interspersed repeat copy L rpt 51715 51687 (c) short interspersed repeat copy M unsure 17479 17479 polymorphism or cloning error unsure 24903 24903 polymorphism or cloning error mut 1664 1678 ggccggctccgttat in wt; gt in gout RJK 951 [13] mut 14778 14778 a in wt; t in Lesch-Nyhan RJK 1760, no exon 2 [12] mut 14808 14809 tt in wt; ttt in Lesch-Nyhan Chicago [7] mut 14874 14874 t in wt; c in Lesch-Nyhan Detroit Leu->Pro [7] mut 14877 14879 tta in wt; ta in Lesch-Nyhan RJK 1939 [12] mut 14886 14886 g in wt; a in Lesch-Nyhan RJK 2163 Arg->Lys [12] mut 16619 16619 c in wt; g in gout Toronto Arg->Gly [20] mut 16623 16664 atgtgatgaaggagatgggaggccatcacattgtagccctct in wt; at in Lesch-Nyhan RJK 2108 [12] mut 16675 16676 ag in wt; agg in Lesch-Nyhan RJK 866 [13] mut 16677 16677 g in wt; a in Lesch-Nyhan New Haven Gly->Glu [7] mut 16679 16679 g in wt; c in Lesch-Nyhan Yale Gly->Arg [10] mut 16690 16690 c in wt; a in Lesch-Nyhan Flint RJK 892 Phe->Leu [5] [13] mut 16707 16707 a in wt; t in gout Arlingen Asp->Val [7] mut 16756 16759 tgta in wt; ta in Lesch-Nyhan RJK 1332 [13] mut 16780 16780 c in wt; a in gout Munich Ser->Arg [3] [19] mut 16784 16787 tgtg in wt; tg in Lesch-Nyhan RJK 1747 [12] mut 27897 27897 c in wt; t in Lesch-Nyhan RJK 1930 Gln->TAG [12] mut 27911 27911 c in wt; t in gout London Ser->Leu [2] [4] [12] [14] mut 31621 31621 t in wt; a in Lesch-Nyhan Midland RJK 896 Val->Asp [6] [13] mut 31622 31624 ctt in wt; ct in Lesch-Nyhan RJK 2019 [12] mut 31624 31624 t in wt; c in Lesch-Nyhan RJK 1784 Leu->Ser [13] mut 31628 31628 t in wt; g in gout Ann Arbor Ile->Met [11] mut 34962 34970 tgcag in wt; agcaaagcaa in Lesch-Nyhan RJK 1210 Met->Lys [13] mut 35015 35015 g in wt; t in gout Milwaukee RJK 949 Ala->Ser [7] [13] mut 39837 39837 c in wt; t in Lesch-Nyhan RJK 974 Arg->TGA [13] mut 39842 39846 tgttg in wt; tg in Lesch-Nyhan RJK 1894 [12] mut 39858 39858 g in wt; t in Lesch-Nyhan RJK 2185 Asp->Tyr [12] mut 39866 39866 g in wt; a in Lesch-Nyhan RJK 1934. no exon 7 [12] mut 40034 40038 tgttg in wt; tg in Lesch-Nyhan Michigan RJK 855 [7] [13] mut 40080 40080 g in wt; a in Lesch-Nyhan Kingston RJK 2188 Asp->Asn [12] [19] mut 40095 40095 t in wt; g in Lesch-Nyhan New Briton RJK 950 Phe->Val [7] [13] mut 40102 40102 a in wt; g in gout Ashville Asp->Gly [8] mut 40114 40114 g in wt; a in Lesch-Nyhan RJK 888, GM 7092, no exon 8 [12] mut 41486 41509 aaaatacaaagcctaagatgagag in wt; ag in Lesch-Nyhan Evansville RJK 894 [7] [12] mut 41450 41452 ata in wt; ttt in Lesch-Nyhan RJK 906, GM 1899, downstream cryptic splice site used mut 41454 41454 c in wt; g in Lesch-Nyhan RJK 1874, RJK 2079, His->Asp [12] [13] mut 41461 41461 g in wt; a in Lesch-Nyhan RJK 1727 Cys->Tyr [13] BASE COUNT 15689 a 11281 c 11599 g 18167 t ORIGIN 1 bp upstream of EcoRI site; chromosome Xq26. 1 gaattctcgt aaaactcttc atggcagtag ttattattct ctctctctct ctttttcttt 61 tttcttgaga caggatattt ctctgttgcc caggctggag tgcagtggca cagtcttggc 121 tcactgcagc ctggacctcc tgggctcaag ccatcctccc acctcagcct cccaagtagc 181 tggggctaca ggcacatggc caccaggcca gataattttt catttttgta gagactgagt 241 ctcaccatgt tacccaggtt tattattctc attttttaga tgaagagact gaggtccaga 301 gaagctcaat gacttgccta gttttacaaa tctcctgcca tcacataccc ctcagcgtcc 361 ttaataagag ggaggccacc aactatgtgc tgggcactgt ggtggatgct ggagctatag 421 ggttgagtat ataagaaatg gtgttgctgg agcaactgtt gcttgcttac ctgacctatc 481 tgagaattaa ttagcagggg aacatatttt tgttttcaga ttcaatataa gaacttgtgt 541 gggcaaaaat aaagatcagt agtaataaca gtagttccca tttgctgact gtactgtcct 601 aagtgcatat atatatacat acacacacgc atacctatac tcctctaata ctcaaaatga 661 tcctgtttat gtattgttaa tatgctcatt ttatttttaa atttttattt atttttattt 721 ttatttattt ttgagacgga gtctcattct gtcgcggagg ctgaagtgca gtggtgcgat 781 ctcagctcag tgcgacctcc gcctcccggg ttcaagtgat tctcctgcct cagctccgga 841 ctagctggga ttacaggcgc ccgcctccac gcccagctaa tttttgtatt tttagtagag 901 atggggtttc gccatgttgg ccaggctggt ctcgtactcc tgaccttgag tgatccacct 961 gcctcggcct cccaaagtgc tgggattaca ggcatgagcc accgcgccgg gctaatatgc 1021 tcattttagt gaggcaaaaa tagaggctca gagtctgatt tgtacaaaac tacagagcag 1081 ttaagtgtcc tctcagatgt gtaccctgat ctgggtgact ctaggactct aggtctcaac 1141 tgttacaacc agttaagggt ttggggaagc actgggccaa gagtcaggaa aatggaagcc 1201 acaggtagtg caaggtcttg ggaatgggac gtctggtcca aggattcacg cgatgactgg 1261 aacccgaaga gccggggccc ggtttacggc cgccatgaag caacgcgcgc cggtaggttt 1321 gggaatcagg gagccctctg aataggagac tgagttggga gggaaagggg cttcgctggg 1381 ggagcctcgg cttcttctgg gagaaaattc ccacggctac ctagtgagcc tgcaaactgg 1441 taggcgccgg cgtaggcgcg cgggcggggc cgggggcggg gcctgcgggg cgtggcgggg 1501 cgggcagagg gcggggcctg cttctcctca gcttcaggcg gctgcgacga gccctcaggc 1561 gaacctctcg gctttcccgc gcggcgccgc ctcttgctgc gcctccgcct cctcctctgc 1621 tccgccaccg gcttcctcct cctgagcagt cagcccgcgc gccggccggc tccgttatgg 1681 cgacccgcag ccctggcgtc gtggtgagca gctcggcctg ccggccctgg ccggttcagg 1741 cccacgcggc aggtggcggc cgggccctga ggcgcgggat ccgcagtgcg ggctcgggcg 1801 gccgggccca gggaaccccg caggcggggg cggccagttt cccgggttcg gctttacgtc 1861 acgcgagggc ggcagggagg acggaatggc ggggtttggg gtgggtccct cctcggggga 1921 gccctgggaa aagaggactg cgtgtgggaa gagaaggtgg aaatggcgtt ttggttgaca 1981 tgtgccgcct gcgagcgtgc tgcggggagg ggccgagggc agattcggga atgatggcgc 2041 ggggtggggg cgtgggggct ttctcgggag aggcccttcc ctggaagttt ggggtgcgat 2101 ggtgaggttc tcggggcacc tctggagggg cctcggcacg gaaagcgacc acctgggagg 2161 gcgtgtgggg accaggtttt gcctttagtt ttgcacacac tgtagttcat ctttatggag 2221 atgctcatgg cctcattgaa gccccactac agctctggta gcggtaacca tgcgtatttg 2281 acacacgaag gaactaggga aaaggcatta ggtcatttca agccgaaatt cacatgtgct 2341 agaatccaga ttccatgctg accgatgccc caggatatag aaaatgagaa tctggtcctt 2401 accttcaaga acattcttaa ccgtaatcag cctctggtat cttagctcca ccctcactgg 2461 ttttttcttg tttgttgaac cggccaagct gctggcctcc ctcctcaacc gttctgatca 2521 tgcttgctaa aatagtcaaa accccggcca gttaaatatg ctttagcctg ctttattatg 2581 attatttttg ttgttttggc aatgacctgg ttacctgttg tttctcccac taaaactttt 2641 taagggcagg aatcaccgcc gtaactctag cacttagcac agtacttggc ttgtaagagg 2701 tcctcgatga tggtttgttg aatgaataca ttaaataatt aaccacttga accctaagaa 2761 agaagcgatt ctatttcata ttaggcattg taatgactta aggtaaagag cagtgctatt 2821 aacggagtct aactgggaat ccagcttgtt tgggctattt actagttgtg tggctgtggg 2881 caacttactt cacctctctg ggcttaagtc attttatgta tatctgaggt gctggctacc 2941 tcttggagtt attgagagga ttataagaca gtctatgtga atcagcaacc cttgcatggc 3001 ccctggcggg gaacagtaat aatagccatc atcatgttta cttacatagt cctaattagt 3061 cttcaaaaca gccctgtagc aatggtatga ttattaccat tttacagatg aggaaccttt 3121 gaagcctcag agaggctaac agacataccc taggtcatac agttattaag agaaggagct 3181 ctgtctcgaa cctagctctc tctctctcga gtaataccag ttaaaaaata ggctacaaat 3241 aggtactcaa aaaaatggta gtggctgttg tttttattca gttgctgagg aaaaaatgtt 3301 gatttttcat ctctaaacat caacttactt aattctgcca atttcttttt tttgagacag 3361 ggtctcactc tgtcacctag gatggagtgc agtggcacaa tcactgctca ctgcagcctc 3421 gacttcccgg gctcgggtga ttctccccag gctcagggga ttctcccact tcagcctccc 3481 aagtagctgg gactacaggt gcgcaccacc atccctggct aatatttgta ctttatttta 3541 tttatttatt tatttatttt ttgagatgga gtttcgctct tgttgcccgg gctggagtac 3601 agtggcatga tctcggctca gtgcaacctc tgcctcccgg gttcaagcga ttctcctacc 3661 tcatccccct gagtagctgg gattacaggc gcctgccacc atgcctggct aattttttgt 3721 atttttaata gagacgaggt ttcaccatgt tggccaggct actctcgaac tcctgatctc 3781 aggtgatcca cccgccttgg cctcccaaag tgctgggatt acaggcgtga gccactgcgc 3841 ccggcctaat atttgtattt tttgtagaga tggtgttttg ccatgttgtc caggctggtc 3901 ttgaactcct gagctcaagc gatctgcccg cctctgcttc ccaaagtgct gggattacag 3961 gcatgagcca ccgtgcctgg cctaggtaga cgcttttagc tttggggtgt gatgcctgcc 4021 ccagtatata gtgaatttaa ttattgctag agctggctgt ttgttagttt tctttgaaca 4081 taagatactc attgttttta gtttgcaaat ccctcttcct ttttaaaaaa tttctttccc 4141 ttaaattgtt tgcatgttag caataacaaa tgcttaaatg gtgctatgtg ctagatactc 4201 ttctaagccc tgttatgtat attaactaat tttttaaatt acacaaatca gagaggttaa 4261 gtaacttgcc caagattacc caacaatact aggatttgaa cctaagtttg tctcacccca 4321 gattctgctc ttaatctcta aacttttaag ttagtagtga caatagtagg tatttattga 4381 atacttaact atgttttagg cgttgaagta aatattttgc aggcattatc taatgtaaac 4441 accctaaagt tacataacag gtacccttta ggtaaataaa cactagtatg accttggagg 4501 cacagatagt tgaagtaact tgcccaatat cacttacatg aaattggccc tcaaatgtgt 4561 ctgatacaac ccatgctgct tgtaactatc gttttaaact gccagggtaa acttggacac 4621 acttgagcta agaaaaagct tttagatttt tgcaaattaa tgtgaaagat atgctttatg 4681 tggatataat atcttctaaa tttcggggat ggtagtccta gaaatgtaat cctgccctag 4741 ccgagcttac cctgccaata attttttaca gaattggtaa aacggagcac cttttttttg 4801 tccttggcca cactgttatc aacagggtgt agattgacat caatctgtag gtgtaaacca 4861 gaattactct ttgtgaccac caggaaatag agcagttcag ttcaggggtt tctttctgtg 4921 aatttagcac tgtgacctgc atactacaag tctactttgt tttctatcca ttgtttgtat 4981 ctgggtattg caaaaggtag gaaaaggacc aaccagatca gcagagaaga gttgccttgg 5041 agttttcttt tagttttctg cagttcatta gatagtaact aggccatgtc attttactcc 5101 cttgtagtga agatatgttg aagttgtact ggtatactct tctacctttc tgtaatttta 5161 tattgtgtag acttgataaa atttatgtgt caatcaccac cattaatatc aatattgagc 5221 ctcaattctt atttttctgc ccagtggctg ccaaattact aacatttaca ataattcact 5281 actactaaga taatctacta gttcgatcac atacttcaaa ttgttatgga actactgtct 5341 tcagcattgt gcttctgata actgataagt ataatttttt ttttgtccag agtgaacatg 5401 tctattcttc cactgtacac actaataaaa ggaaaaattg taatattggg taaattcatg 5461 tccttacaca tgtagtagtt atgagcccat gtccctagaa tgagtaataa tttatccctc 5521 ccttggttga atagtcaaga atgctgattt taattcttct aacagcttta tccctcagaa 5581 gggaaggcaa gcaagttata tatgtagttt atttgtaaga ctgatatgaa attggaagat 5641 gaatctacta ttagctttaa ttatttttac atttaggaat attgcatcag taactcataa 5701 ttttggtttt ctgttatcct gagttaacac aaattatcca aggagatggc ggatcatctg 5761 ctttgaggtg tttttttttg agaattttaa tgtatctgaa tataaaaggt aaaaatatgc 5821 caactagcaa tttctgccca ttccagaagt ttggaaatat tactcattac taggaattaa 5881 ataaaatatg gtttatctat tgttatacct cttttaattc acatagctca tttttatctt 5941 ttatttttgt ttgttttttt tgagatggag tcttgctctg tcaccaggca ggagtgcagt 6001 gatgcaaatc tcggctcact ctagccaccg actccctggt tcaagcgatt ctcctgcctg 6061 agccttctga gtagctggga ttacaggcag gcaccaccac gcccagctaa tttttgtaga 6121 gacaggattt caccgtgttg gccaggatgg tctccatctc ctgacctcat gatctgcctg 6181 cttcggcctc ccaaagtgct gggattacag gtgggagcca ctacgcctgg cccacatagc 6241 tcatttttag actcacttcc attaagtctt gtttggaccc acgaacattg tctttttttt 6301 tttaagatgg agtttcactt ttgttgccca gactgtagtg caatggtgca atctcagctc 6361 actgcaatct ctgcctcctg ggttctagca attctcctgc ctcagcctcc cgagtagctg 6421 gaattacagg cgcccgccac cacgcccagc taatttttgt gtttttagta gagacggggt 6481 ttcaccatgt tgggcaggcc aggggtgatc cgcccacctc agcctcccaa agtgctggga 6541 ttacaggtgt gagccaccgc atctggccaa catgtctttt tttttttttt cctttttaac 6601 cacaaagaga cttaagcagt ccttgtcaca gatgatgaat tgatgttgca agtattgtct 6661 tagcttggat taattttctt gcttactgta attttagata atatagcttt gtaattagag 6721 attttatgtg taaaccacaa aaatgtttac atgaaggcca ttattacaga tgtgacgtgc 6781 ataattatta gtaatttgta tgtttacatg ggtcagtctg gcaaaaaatt atgaagtttt 6841 aaaaattaaa aaaaattata atgccagttt tactggaaag taaaattatt tcagtaatcg 6901 attatagcaa aagtattgat tttcattcca gacaaaagtc agaatgaaag gtaatttctc 6961 aatactcttt cagattaata aaagtacctg tagcgatttt tatcattcac aagtatatca 7021 caagtaagtt agaatttgag aactgtgttc tagatctctg aggagatgca gtcagatttc 7081 tgaactgtct cagcaaatgg taagtaactt agagctagta attaataacc tgtcctttga 7141 tttctgattc agccaagaat ggccatattt gggaaaggca gatctggaga gtaaccacgt 7201 tttcattcat ttaccacttc taggcccctc cagagctctc agatattttg gggttgagcc 7261 cttccccaaa gccatacagg accttttttt tgtgatctgt tctagccatt tttatgttgg 7321 gtgcttgtta tggactgagc atttatgtcc tcccacaccc cccccatacc ttttttgaag 7381 tcctaacccc cagtgtgatg gtatttggag acagggcctt tggaaggtaa ttacagttag 7441 aagaagtcgg gagggttggg cccaggtctg attggattag tgcccttata tgaaaagaca 7501 ccaggacggg cgcagtggct cacacctgta atcccagcac tttgggaggc caaggtgggt 7561 ggatcacgag gtcaggagtt tgagaccagc ctggccaatg tagtgaaaca ccatctctac 7621 taaaaataca aaaattagct gggtgtggta gcgggctcct gtcatccaag ctactcggga 7681 gggtgaggca tgagaatcac ttgaacccgg gagttggagg ttgcagtgag cccagattgt 7741 gccactgtac tccagcctgg gtgacagagt gagactctgt ctcaaaaaag aaaaaaaaaa 7801 aaaaagagac accagagagc ttgttagaag aggtcatgtg agcacacagt tagaagacct 7861 tcaagccaaa gaagaggcct gagattgaaa cctaccttgc aggtacctta attttggact 7921 tcccagcctc caaaactgtg agaaataagt ttctgttaag tcactcagtc tgtggtattt 7981 tgttatggca gcctgagcag gtagttgttc tttcagaagg tgttgataat aaccacatgc 8041 aacaccaagt cacaaataat aaaacagatg taacttatat tcatacagaa agttgggcac 8101 tgccattgcc ttgttggttt acacggctgt gctagttcag tagcagaaag gtgctggtct 8161 cctttactca gtttacaatc taggcagtag aatgtaatca ctgctttaaa cttgatactg 8221 cttagggaga gaatcattgg tgctgggtaa ctttgggttc taggtttact ttttgtgtat 8281 atataactgt ttttggtaaa tcacaagttt ctgggcttgt cgaattagat tttgttacag 8341 attatgagct ttattatgct atacagttag ttgtatgtat atatgccttt cccactagat 8401 tttaagcttt tttttttttt ttttttttgt gacggagtct tgctcttgtc gcccaggctg 8461 aagtggagtg cagtggcaca atctcggctc actgcagcct ccacctccta ggttcaagcg 8521 attctcctgc ctcggcctcc caagtaactg ggactacagg cacgtgccac cacacccggc 8581 taatttttgt attttttgta gagacagggt ttcgccatgt tggctaggct ggtcttgaac 8641 ttctggcctc aggtgatcca cccgcctcag cctcccaaag tgctgggatt tacaggcatg 8701 agccaccacg cccagctata gctctttaag ggttgtaaat ttataatcat tcttttactc 8761 tcctgcaaat tctgttgcac actgccttaa tcaaggtaga tgctgaatgc atttttgtat 8821 aattgaatat gttgcaatcc ccaactctct ccaactgttc ctgtcaaagc agccactgga 8881 ttgttaacta atccatatta gatggggtta attaatatca gatgggacaa gtaagggcta 8941 ataagattat aggccaccaa gtagatttct gtctagctct tatagagatt gagtttattg 9001 gacctgtttg ataggaagtt ttggtgtttg ggatgattaa aactgaagtt cctatttatt 9061 gaattatacc tatttatatt atttcatatc agtggtccac atgcaagtga ggcttctgag 9121 acagagtttg agttctctct tcaactacca taacacttaa cctgtatctt tttttttttt 9181 ttttttttta gacaggagtc tcgctctgtc actcaggctg gagtgtagtg gtatgatctc 9241 ggctcactgt aacctctgcc tcctggattc aagcagttct ccatgtctca gcctccctag 9301 tagctgggat tacaggcctg tgccaccatg cctggctaat tttttttttg tatttttagt 9361 agagacgggg ttttaccacg ttggccaggc tggtctcgaa ctcttgacct cgagcgatca 9421 acttgccttg gcctcccaaa gtgctgggat tacaggcatg agccacagcg cccagccgtc 9481 ttttttttta aatagcaatt taacactgtt cacagttact catgtacatg tcatgccatc 9541 tattacactg taagttctgt gagggtagct gtatcaaatt tatctaactc tctctagtat 9601 gcatgacata gtaagtattc aataaatatt tgcatattag tgataaggat acaggttctg 9661 aatagtgggt ccttaccatt taagaattag tatttgatgg ccgggcgggg tggctcacgc 9721 ctgtaatccc agcactttgg gaggctgagg cgggcggatc atgagatcag gagatcgaga 9781 ccatcctggc taacatggtg aaatcccgtc tttacaaaaa aaatacaaaa gaattaacca 9841 agtgtggtgg tgggtgcctg tagtcccagc tactgctttg tgaggctgag gcaggcagat 9901 cacctgaggt gggaaattca agaccagcct gaccaacatg gagaaacccc atctctacta 9961 aaaatacaaa attagccggg cgtggtggcg catgtctgta atcccagcta ctcgggaggc 10021 tgaggcagga gaatggcgtg aacccgggag gcggagcttg cagtgagcca ggatcgcgcc 10081 actgcactcc agcctgggcg acagagcgag actccgtctc aaaaaaaaaa aaaaaaaaaa 10141 aattagtatt tgatatttga tcattaaata tgaattaaga ggacttagac tttttgttaa 10201 atgtcaagct gggaaaagtt gtcatttaaa tgaattgcct cttatttaat ttcgtctgat 10261 gatacatttt gtttttattt tgtaaaaaat tatttttttt ctttttggag acagggtctt 10321 gctctgttgc ccaggctggt cacaaactcc tgacctcaag caatcctcct gccttagcct 10381 cccaaaatgc tgggattaca ggcgtgacga cctcgcccgg ccttgtatta tgatacattt 10441 tgaacaacta caagtagact tggtataatg aacctgcacg tacccattgc caagttctga 10501 caactgtctg tctatagcca attatgcatt tcttaaatta gaaccccccc aatataccca 10561 aatatatata tatgtgtgca tatatatagt aagttgtaac aaagttgtga attcatacct 10621 gaagtatctc aagtgatgca agttttatga atttttgttt atgccttttg ggaagagttg 10681 tattgacaaa ttttttatgc ttaaagtaaa ccataaatca aaaaaataaa atctaggatg 10741 caataaaaca aaacaacttc ttgacataag tatggtatgt aaatctgttt tgattggaaa 10801 tcaatttgtt atattgccag aattcctgtt ttagaataca tctctgctga tctgtctgta 10861 ttcttagact gcatatctgg gatgaactct gggcagaatt cacatgggct tcctttgaaa 10921 taaacaagac ttttcaaatt cttagtcgat ctgcagaacc tgtagccagg cactgaacca 10981 ttttgataga tgcagtaatc gttgcaagtg tatatttcaa gggagttctg gctgggtcct 11041 agtttatgct tgtggcagaa gcagtgagta actgggagga agttggtgag taagcttcaa 11101 ggaagaagtc atttttagta ctctggatct tcctgatttt aaagcactac aaaatggtgc 11161 attttcattc ttgtcaagtg ataacagata tattctgatg agcctgaaat gaatatatat 11221 tgtatcattt ttataatatc tagcaaggtt tgtattttcc tagaacttga actaaatttc 11281 agttcataaa atttataaaa tacttagttg ttgtaaaata tttttggaat gttcacatag 11341 gtgacacaca aatgtcccat tttcattctt tctatagtaa atatgttctg atatgtgaag 11401 gtttagcaga tgcatcagca tttaatccta gaggatctgg cataatcttt tcccccaaga 11461 atagaaattt tttctgctta tgaaagtagt acatgtttct ttaaaaacaa atcaatattg 11521 acttctgcct gctgtatagc actatgcctc cacctggcca tgaccagggg catgtcctgg 11581 tccacctacc tgaaaatgtt tgcaaccagc ctcctggcca tgtgcacagg ggctgaagtt 11641 gtcccacagg tattacgggc caacctgaca atacatgaag ttccaccaaa gtctgagaac 11701 tcagaactga gctttgggga ctgaaagaca gcacaaacct caaatttctc agcactggaa 11761 acctcaaaat ataactgaat tccataaata agattttaag tcttaaatat gtatttttaa 11821 atgtattaaa agtcaagctg cttgtattta agcacctaat acaatgctta ggttgtaaaa 11881 ggagatgctc aataggtact aactgatata ttgagattta attatggttt gaccaatatt 11941 tattggaaac cgccaaagct taaatcatca gcttcttgaa tgtgatttga aaggtaattt 12001 agtattgaat agcatgtgag ctagagtatt tcattctttc tggtttattt cttcaaatag 12061 actttgaata taatggtgaa tgggtattat aaattaacta ataaaaatga cattgaaaat 12121 gaaaaaatat atatattaaa gtgtagaaag tgaccaggcg tggtggctca cacctgtaat 12181 ccaagcacct tgggaggctg aggcaggagg atctcttgat cccaggagtt caagaccagc 12241 ctgggcaaca tagcgagact tcgtctctaa aaaaaaaaaa gagagagaaa aaaatttttt 12301 ttatttaaaa aaagtgtaga aagtgtcaag accccacttc ttaccattat ttggtatatt 12361 tctctatacc cacccaccct tcctccttac tccctccctc ccttcccaat ctttttatct 12421 ttttgtattc tgattttttg tttgtatatt ttgctttaat ttaatgtatc ctttaaaaat 12481 ttcccataca ttttatatgt atatataaaa acgcatgctg ccaaagataa tttataagaa 12541 agaccattga atttttttaa aagtgatata tattcattga aaaaaattta gaatatatag 12601 caaagcaata aagaactaaa taaaattgct gtaactcctc tttcaaagat aagtgctttt 12661 atgattttgt tgtatttttt tctgtatata ggtacatata tagtatttat aaagctgtac 12721 tcatagtaca ttttcacatc acaggtacca tatcagtgtt attaaatatt ttgtatgcca 12781 ggggctagac ataccaagac aaccaatatg tggttctact taaataatat tagagtatct 12841 tttatgatga cacttcatga gttgactata ataatcttag acttctaaga gtttgggttt 12901 tcaaaagatc acttagcttt tttgggtgat ttttccccct tactgtgaga tgagagaggc 12961 tgtttggatt tgggattggg gtagcgggga cagcaacttt tcttttcttt ttctttttta 13021 ttttgaggta gggtattgct gtgtcaccca ggctggagtg cagtggtgtg atctcggctc 13081 actgcaacct ccacctcccg ggctcaggtg atcctcctgc ttcagcctcc cagtaactgg 13141 gactacaggc gcgtgccaca tgcctggcta attttgtatt tttagtagag atggggtttc 13201 accatgttgg ccaggctggt ctctaactcc tgacctcagg tgatacgccc acctgggcct 13261 cccaaaatac tgggattaca ggcatgagcc gctgcatcag ccagcagttt ttcttgtggt 13321 tttttttgtt tgttttgttt tgttttgttt ttgagatagg gtcttactct gttgtccacg 13381 ctggagtgct gtggtatgat cgtagctcac tgcagcctca aactcctggg ctcaagtgat 13441 tccttctgcc tccgcctccc gagtagctgg gactacaggt atgcaccacc atacctggca 13501 aatttttaca aagttttttg tagggacggg gtcttgctac attccccatg tcggtcttga 13561 actcctggcc tcaagcaact ctcctgtctc agcctcccaa agcactggga ttacaagtgt 13621 gagccaccac accatgccag tttttcctgt tcagtgtgat attttatctt gttagactac 13681 agtgtgttaa aacttgtttt actaaatttt caaacatact caaaagtgga gagaatagta 13741 taatgaatac ccgtatgttc atcacccatg tttagaatat tattaaatat aaagattttg 13801 ctgcgtttgt cttagctctt taaaattttt ctttttctct ttgtgaccta aaggaaattc 13861 catatcttat cactttactt ctacattctt gactaagatg actaagacat atagttacat 13921 ggttttttgt tttgtttttg ttttttaaag acgaaatctc gctcttgtcc cccaggctgg 13981 agtgcaatgg tgccatctca gctcagtgca acctctgcct tctgggtaca agcgattctc 14041 ctgcctcagc ctcccaagta gctgggatta caggctcctg ccaccacgcc tggctaattt 14101 ttgtattttt agtagagacg gcggggggag gtttcaccat gttgacaagg ctggtctgga 14161 actcctgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctggga ttacaggcgt 14221 gagccaccgc gcccagcctg tttttttgtt tgtgtgtttt gttttttttg agacagagtc 14281 ttgctctgtt tcccaggctg gagtgaagtg gtgccatctc agctcagaga cagagtcttg 14341 ctctgtttcc caggctggag tgaagtggtg ccatcttggc tcactgcaac cttcacctcc 14401 caggttcaag tgattctcct gcctcagcct cccaagtagc tgggactaca ggcatgtgtc 14461 accacacccg gctaattttt ttgtattttt agtagagacg ggatttcacc gtgttgccca 14521 ggctggtctc gaactcctga gctcaggcag tctgcctgcc tcagcctccc aaagtgctgg 14581 gattacacgt gtgaaccaac ccgcccggcc tgttgttttc ttacataatt cattatcata 14641 cctacaaagt taacagttac taatatcatc ttacacctaa atttctctga tagactaagg 14701 ttatttttta acatcttaat ccaatcaaat gtttgtatcc tgtaatgctc tcattgaaac 14761 agctatattt ctttttcaga ttagtgatga tgaaccaggt tatgaccttg atttattttg 14821 catacctaat cattatgctg aggatttgga aagggtgttt attcctcatg gactaattat 14881 ggacaggtaa gtaagatctt aaaatgaggt tttttacttt ttcttgtgtt aatttcaaac 14941 atcagcagct gttctgagta cttgctattt gaacataaac taggccaact tattaaataa 15001 ctgatgcttt ctaaaatctt ctttattaaa aataaaagag gagggcctta ctaattactt 15061 agtatcagtt gtggtatagt gggactctgt agggaccaga acaaagtaaa cattgaaggg 15121 agatggaaga aggaactcta gccagagtct tgcatttctc agtcctaaac agggtaatgg 15181 actggggctg aatcacatga aggcaaggtc agatttttat tattatgcac atctagcttg 15241 aaaattttct gttaagtcaa ttacagtgaa aaaccttacc tggtattgaa tgcttgcatt 15301 gtatgtctgg ctattctgtg tttttatttt aaaattataa tatcaaaata tttgtgttat 15361 aaaatattct aactatggag gccataaaca agaagactaa agttctctcc tttcagcctt 15421 ctgtacacat ttcttctcaa gcactggcct atgcatgtat actatatgca aaagtacata 15481 tatacattta tattttaacg tatgagtata gttttaaatg ttattggaca cttttaatat 15541 tagtgtgtct agagctatct aatatatttt aaaggttgca tagcattctg tcttatggag 15601 ataccataac tgatttaacc agtccactat tgatagacac tattttgttc ttaccgactg 15661 tactagaaga aacattcttt tacatgtttg gtacttgttc agctttattc aagtggaatt 15721 tctgggtcaa ggggaaagag tttattgaat attttggtat tgccaaattt tcctctaaga 15781 agttgaatca ttttatactc ctgatgttat atgagagtac ctttctcttc acaatttgtc 15841 tctttttttt ttttttttga gacaaggtct ctgttgccca ggctggggtg cagtgcagca 15901 gaatgatcac agttcactgc agtctcaacc tcctgggttc aagcgatcct tccacctcag 15961 cctcctgagt agctgggact ataggtgtgc gccaccactc ccagctaata tttttatttt 16021 gtagaaacag ggttcgccat gttacccagc ctcccaaagt gctgggatta caggcatgag 16081 ccactggccc agtttctaca gtctctctta atattgtata ttatccagaa aatttcattt 16141 aatcagaacc tgccagtctg ataggtgaaa atggtatctt gtttttattt gcatttaaaa 16201 aaaattatga tagtggtatg cttggttttt ttgaaggtat caaatttttt accttatgaa 16261 acatgagggc aaaggatgtg atacgtggaa gatttaaaaa aaatttttaa tgcatttttt 16321 tgagacaagg tcttgctcta ttgtccaggc tggagtgcag tggcacaatc acagttcact 16381 ccagcctcaa catcctgcac taaagtgatt ttcccacctc acctctcaag tagctgggac 16441 tacaggtaca tgctaccatg cctggctaat tttttttttt ttgcaggcat ggggtctcac 16501 tatattgccc aggttggtgt ggaagtttaa tgactaagag gtgtttgtta taaagtttaa 16561 tgtatgaaac tttctattaa attcctgatt ttatttctgt aggactgaac gtcttgctcg 16621 agatgtgatg aaggagatgg gaggccatca cattgtagcc ctctgtgtgc tcaagggggg 16681 ctataaattc tttgctgacc tgctggatta catcaaagca ctgaatagaa atagtgatag 16741 atccattcct atgactgtag attttatcag actgaagagc tattgtgtga gtatatttaa 16801 tatatgattc tttttagtgg caacagtagg ttttcttata ttttctttga atctctgcaa 16861 accatacttg ctttcatttc acttggttac agtgagattt ttctaacata ttcactagta 16921 ctttacatca aagccaatac tgttttttta aaactagtca ccttggagga tatatactta 16981 ttttacaggt gtgtgtggtt ttttaaataa actcctttta ggaattgctg ttgggacttg 17041 ggatactttt ttcactatac atactggtga cagataccct ctcttgagct acatcggttt 17101 gtggggagtc aaaagtcctt tggagctagg tttgacaaat aaggtgggtt aacacttgtt 17161 tcctagaaag cacatggaga gctagagtat tggcgaattg aagaaatccc cctttttttt 17221 taacacactt aagaaagggg actgcaggta tactcaagag agtaagtcgc accagaaacc 17281 acttttgatc cacagtctgc ctgtgtcaca caattgaaat gcatcacaac attgacactg 17341 tggatgaaac aaaatcagtg tgaattttag tagtgaattt cattcataat ttgatcgtgc 17401 aaacgtttga tttttattac tttagactat tgtttctgat tttatgttgg gttggtattt 17461 cctgtgagtt actgttttac ctttaaaata ggaatttttc atactcttca aagattagaa 17521 caaatgtcca gtttttgctg tttcatgaat gagtcctgtc catctttgta gaaactcgcc 17581 ttatgttcac atttttattg agaataagac cacttatcta catttaacta tcaacctcat 17641 cctctccatt aatcatctat tttagtgacc caagtttttg accttttcca tgtttacatc 17701 aatcctgtag gtgattgggc agccatttaa gtattattat agacattttc actatcccat 17761 taaaaccctt tatgcccata catcataaca ctacttccta cccataagct ccttttaact 17821 tgttaaagtc ttgcttgaat taaagacttg tttaaacaca aaatttagac ttttactcaa 17881 caaaagtgat tgattgattg attgattgat tgatggttta cagtaggact tcattctagt 17941 cattatagct gctggcagta taactggcca gcctttaata cattgctgct tagagtcaaa 18001 gcatgtactt tagagttggt atgatttatc tttttggtct tctatagcct ccttccccat 18061 ccccatcagt cttaatcagt cttgttacgt tatgactaat ctttggggat tgtgcagaat 18121 gttattttag ataagcaaaa acgagcaaaa taggggagtt taactttaat attttctttt 18181 aaaaagcatt tcatgttata agatcaattc tgagtggtag aaaatgcttt gacattttat 18241 ttccattttc tacttttagt ttttttccta tttgtttaag atcttagagg attattaagc 18301 tgaactcctc aactgataaa aagcatgaca tcttaaacat aagcaaagca tatttttagg 18361 ttaattttca catagaaaac agtttatttt atgtgaaatt ctatgtagat atactatttt 18421 tttggtattt attgatatgt ttattttatt ttattttatt ttattttatt ttattttatt 18481 ttatttattt attttttttt ttgagacaga gtctcactct gttgcccagg ctggagtgca 18541 gtggcatgat cgtagctcac tgcaacctcc actcccgggt tcaagcaatt cttctgtctc 18601 agcctcccga gtagctggga ctacaggtgc ctgccactat gcccggctaa tttttgtgtt 18661 tttagtagag atggggtttc accttgttgg tcaggctggt ctcgaacccc tgacctcagg 18721 tgatccaccc acctcagcct cccaaagtgc tgggattata ggcatgagcc acgtgcccgg 18781 ccgacatgtt aattttttaa aaaaggcttt actggggtat attttatata atataataat 18841 cacatgtttt aactatacaa ttccaagctt tttagtatat ttatagggct atgcaaggaa 18901 gatatactgt taaacagtag aaattgagaa agctcttctg ataatatctc ttgatttgat 18961 gatggctcat gcctgtaatc tcagtgcttt ggaaggccaa gacagcagaa tcacttgagg 19021 ccaggggttc gagaccagcc tgggcaacac agcaataccc tatctttaca aataataaaa 19081 atatctgttg atttgaagta aagttttttt ttaaagacaa ggtctcattc tgtcacccag 19141 gctggaatgc agtagcaaga tcacagctca ctgtggcctt gaccttctgg gctcaagtga 19201 ttctcccact tcggcctccc gagtagctgg gactaacagg tgtgcaccac catggctggc 19261 taattttttt ttatgtttgt agagattggg tcttactgtg ttgcccaggc tgatcccgaa 19321 ctcctgggct caagcagtct tcctgcctca gcctctaaaa ttgctgggat tacaggcttg 19381 agtcaccatg cccagcctga agtagcattt ctaccctgtt taataattca gcagcttgtc 19441 atgtaagata ttcatatatg catataaaca ttaggcagct taatttggta aaactgtaaa 19501 atggaaattt taaattgttt gcagcatcaa taacattgat gtcagtatga tttttacatg 19561 ctgatcttga ccaatttgaa acagtgagtt aaaatctggc tgatccgtac taatcctaaa 19621 gaaatattct atgaactatt aaatgtttcc agaatatata aagaaacatt atgatgtcaa 19681 cacacccatc tatttttttt tggaaataaa aactccattt ttcttattaa agaaaacatg 19741 cttattagaa aacatacggc tgggtgcagt ggcacacatg taattccagt gctttgggag 19801 atcgaggtgg gagaatcact tgaggccagg agtttgagac cagcctagac aacataatga 19861 gaccccctct ctacacaaaa agaattagtt gtgcatggtg gcgtgcacct gtagtcccag 19921 ctacttggga ggcagaggca ggagcatccc ttgagcctag gagtttgaga ctgcaggagt 19981 tcgagactga gtggaatgca gtggaactgc attccagcct gagtgacaga gggagaccct 20041 gtcttaaaaa aataagaaag aaaacacaac tgcagaaaat tataaaggat ttaagtcatt 20101 ccaaatatca ctgccacttt ttatttagaa tattctaaag aattctctct ctgtgtacac 20161 acacacatat gcgtactctt aatccaagta gcttggtagg attttattta cctagtgcct 20221 agatgggaaa ttgcctgggg attccaaata cctatttcat taaattaaag atgtcactga 20281 ttttaagact taacactatt tttcatactg ccaagaaaga aaacactacc agttataaat 20341 gtaaattgcc atcaattgta atacatcaat tttagagcta ttattaataa aatgtgaatg 20401 tgcatcttag agcaatgaaa tatagtacta tatatttgat gaccttttct gccctgtgat 20461 attcagaaag tgaaagttaa atatgggctg agcatggtgg ctcacacctg taatcccagt 20521 actttgggaa gtcaagacgg gaggctggct tgaacccagg agttcaagac cagcctaggc 20581 aatgtagcga gacgccatct caaaatatta aaaataagta aataagtaaa taaaaagaag 20641 gttaagtata caaatgtatt tcctttgttg tgaatttatt tcaattttat agtgattttt 20701 tttttttgag acgaagtctc actcttgtcc cccaggctgg agtgcgatgg cgtgatctca 20761 gctcactgca acctctgcct cccaggttca agctatactc ctgccttggc cccccgagta 20821 gctgggatta caggcgcctg ctaccatgcc tggctaattt ttgtattttt agttgagatg 20881 gggtttcacc atgttggcca ggctggtcta gaactcttga cctctggtga tccacccgcc 20941 tcggactccc aaaatgctgg gattacaggc gtgagccacc gtgcctggcc agtggttttt 21001 tgttgttgtt gttgttgttt tgttttgttt ttgtttttgt ttttgttttg agacaggatc 21061 ttgctctgtc acccaggctg gagtgcagtg gtgccatctt ggttcactgc aacctctgcg 21121 tgggctcaag caatcctccc acctcccttt ccagagtagc ggggaccaca ggtgtgtgcc 21181 accacacctg actaattttt gcattttttt ttgtagaaac agggttttgc catgttgccc 21241 aggttggtct gaaactcctg agctcaaaca atccaactgc cttggcttcc ctaagtgaaa 21301 ttacaggcat gggccactgt acccagtcta gtgatttttt tatttttatt tttattttat 21361 tttattttat ttttttacca aaaaaacaac aaagcctcag gaggaaaagt tgatacacaa 21421 gtaaatttta ttggaaatgt ttttgtgtgg accttaagca gagggaaaat tagtctgcat 21481 tatggtgtat ccagactaaa tgactgatat taaaatgaaa ttattcttag gatttgcaat 21541 cttagagaaa actttttcat ttttattttt ttgagttaca aattatcttc atttacattt 21601 gagaacagtg agtcacagag ggattaagta acttactcaa gatcatacaa gtctttgatt 21661 tgaacccaat cttttaactc tgcagaactc agagtcactc ttatttggaa aaacttttta 21721 actgatgtgg atcctctaat atgggcttcc tattattcat tctctattag tcagaagttt 21781 tgcaagcaga cagaattcat tttgccaatt acgggatttt ccctcagttg cagtcaaggt 21841 tcataaaact ataactcttt atctttaatt agaaatgttt ttttttttga gacaaggtct 21901 tgctctgttg cccagactgg aatgcagtgg catagtggcc cattgcagct ttgaactcct 21961 gggctcaagg gatcctctgc ctcagcctcc caagtatctg agactacaag tgcgtgccat 22021 cacccatggc tattttaaaa aaaaaaaaaa ttgtagagat agggtcttgc tgtgttgccc 22081 aggctggtct caaactcctg gtctcaagca atccttctgc cttggtctcc caaagtgctg 22141 agattacagg tgtcagccgt tgcacctggc caaaacgata acttaaaata cacacacaca 22201 cacacacaca caaacacata tgtgtattgt gtgtgtgtgt gtgtgtgtgt gtctcaaaag 22261 gtatcaaaag agaatagcta taactttagt gttgatcttg atagtgactt gattaggctc 22321 tgtttaacat caaagatgca aattaatact ttctttgaac atattaaaaa tgcagaaaat 22381 attggagtat tttattttaa ataaattgta ttctgtatat ttaaggtata caacatgatg 22441 ttatgggata catataggtg gttaaaagat tactgcagtg aagcaaatta acgtatccct 22501 caactcacat agttacccat tttttttttg ttttggtggc aagaggagct taaaatctca 22561 tttagtgtga atcccaaata cagcacaatt ttattaccta tatacttcat gttgtacatt 22621 atatttctag acttgttcat cctacatatc tgctactttg tatcctctga gctacatctc 22681 cccattttct cacttgcccc ccaagtagtt tcttaaagtg tctcatgtaa gagggcagta 22741 gctttcagct taaacttttt ctctgtatgt agtcgatttc tttgaggtat acttttctct 22801 ccagaatagt tagatgtagg tataccactt tgatgttgac actagtttac ctagaactta 22861 tcttctgtaa atctgtctct atttccatct ctgtctccat ctttgtctct atctctatct 22921 gtctatctct atctatctat ctatctatct atctatctat ctatctatct atctatctaa 22981 agcaaattca tgcccttctc ctatttattg aatcgagacc atagacaggg gtgagagaaa 23041 gaatttggca ggaatgggga tgtgtattat ctgtggcata aggaaacttt acagaactag 23101 gttcaaaagt atactttcta gttctttccc atggcttttc actttgatgt agtccttatc 23161 aggtaactga ggttttatat aagtcccctg attcttagaa catgaaggtg tagtagtcaa 23221 ggttggtccc ttgaaaccac aaattttgtg aaaaaaaatt aagaaaattt gaataatttc 23281 ctcagcaaat acatattgat catctgttat acagccatga gaagtggttc tgttgcacac 23341 gtttatttta tcagatccta atcccaaacc aggcataaaa tggaaaccat gaagatagga 23401 tgaaataact tctgaatgtt tgaatgtttg aaaatagtgt acttaaaaat accaggtggt 23461 ttttgtttgt tttttgtttt tttctttttt tgagacaggg tctcactctg tcacccaggc 23521 tggagtgtag tggtgcaatc tcatctcatt gcagtcttga cctcccaggc tcaggttatc 23581 tcccacctca gcctcccaag tagctgggac tacaggcaca tgccaccacg cccagctaat 23641 tttttgtatt ttttgtagag acggggtttc accctgttgc ccaggctggt ctagaactcc 23701 tgggcttaag cgatcctccc acctcagcct cccaaagtgc taggattaca ggcatgagcc 23761 accatgcctg gcagaaaata ccaggttttt aagtatcagc acttactctt caatcttttc 23821 tattactatg ttgtgctaaa tggtattttt tatttaatta gagcaatgct gttcaataga 23881 actttctttg aggatggaaa tcttttatgt ttctgctatg tggtacagag ccactagtga 23941 catgtggctt ttgagcgctt gacacatctt gtgcaacaca ggaactgaat ttttaagtaa 24001 tttatattgc cacatgtggc taccgtatgg gacagtgtag tactagatga tctgtaaggg 24061 ctgtgcttca tcagtgtcgt tttttaactg acaaaaacct ttagtttttt ttttagtaat 24121 gtgtttattt aaaagaattc ataaaataca agtaaacaaa ttaacttgtt acctgagcat 24181 atgtcctttc atacttattt tttctgcata catattttgg aaaatggaat atctgcccct 24241 ttttttttat ctgagataca gtctacctct aaaaatacat gattctaaca ttctcacttt 24301 ttgttggcat ttgatcaggg tatagaaaaa cagttaaaag gacagagaat ggttgagaga 24361 ttatgatatg aagagaaaat gtgattgagt gtggtagact tggggcctgc ttgaatgttg 24421 agagaatgac tgttttccga taaaaaaaaa aagtccattc taggatccta aaagaagggt 24481 ctgaagttca ctgcagaaag caagctacat agtactaagc cactaagggg acatggagcc 24541 cttagtaatt cctaccttag taatagtctc atcatgccct cttgggaacc cagccttgtt 24601 gattagcctc tctgctttct ctccttatag ttcaacctcc ctgtttgttc caagcagttc 24661 ttttcctgcc catttattat gcatttctat acagctttcc tcctcttttt ctataccatg 24721 ctgcagttct tattgctacc tagaggtttt caaaattcct aggggcggat aagtaggcat 24781 aaacaaagtt cttccctatt atccttccta ttttttcacc tagactgaag aggtagacaa 24841 aatagaaata aagacattaa gggtatgtgt ttgtagtccc aaagagcttc tctggcaatt 24901 ttgatgtagt tgacagtgac gctctgagtt caggacagat tggactcctt ggctgagagg 24961 agtgaggaga taggacggta gaggagaggg tagagcaact ctggaggaag ctttcccctc 25021 acctttgcca gtcctgttat cctagactta accataatta aagatgaggg aggcactcag 25081 taaagggatc tagtgggaag cttgttccag acagccaagg agggaggttc gcgcagttcc 25141 tttggccacc caggtggggt aattgatcca tgtatgccat tcatgtacaa tgtaggcact 25201 tatacctgta ttccaatgta gtgaactata ccattactct taaattaata ttctttatta 25261 gcttccatgg tggctatagg ccaggcaaga gagttaagaa aaaataaata gccaggtatg 25321 gtgactcaag cctgtaatct cggcacttta ggaggccgag gcaggaggat agcttgagtc 25381 caggagttca agaccagcct gagcaaaata gtgagatcct gtctctattt tttaaaaaag 25441 ccttggggca aacaggagta tggaggtttg gatgctaata gaacagcagt gtcttactgc 25501 ttggagttct cttgtttctt gtcctatcac cgtagccttt ggatcacagc aatttttcca 25561 tgactccata cttttcagtt cttgaatatt ttttccttta ttcctcttgt ctctgtaaag 25621 acatcaactg gagttggact gtaataccag gtatctccag aagatggcac tatttaacag 25681 attttataaa taatttgatg tgagtcactg tcatctgaag cttgttgcct tttctttctt 25741 tcttctttct tttttttccc catcaattct gtatgtttga aatgctggga tttaagttag 25801 ttagaataag ggatgtctgt aatttcccta aattgagaag taatatgcaa aggttgatat 25861 cagaagtcat atgctcacct tgcaacacca aataatactg gcccatttgt gatttttgaa 25921 agtaacactc cataataaat ggatgtatat atagaagcat aacaaaaata gaagcacata 25981 aaagtgaaaa gtctcataaa cgccattgtc actactcatg taattgctgt tacaaatttg 26041 tttaaatgtt gaataaaaat ggtgtcatag gcaacacagt gttccactac ttggtgtttt 26101 taatagcatt attctgtctc agtgtgcttt ggattatcag gtgcttttta atagttgcat 26161 ggtattacat tgtgtagatg aacttgatta atttaaatgg ttccctgtta atggacatgt 26221 tggtttgttt ttgtgaacaa ctgatacagt gaacatttat tttttaaata aaaaaaagag 26281 agacagggtc ttgctgtgtt tctcgggctg gccttgaact cctggggtca agcgatcgtc 26341 ttgcctctgc ctccctggga ttacaggcat gaagccaccg cacccggccc agtgaacact 26401 cttgaatgta tctttgtata cttgtcaagt gtttttgtag caattgattc ccagaagtgg 26461 gaattacatg gaattaagtg acatgcatgt ttgcaatttt aacaggtatt gctatgtcat 26521 tttcaaaaga agctatgcca attaatactc tcaccaacaa gagtgcttat ttcccctcag 26581 catattatca ggcttaagtt ttgccagtat gggtgggaga acagtagaat cacattgttt 26641 tagtgtttgt ttctcagata gatataattt tacaccttat aaccttctct tctataaatt 26701 gtctatttgt gttcattctc cattttccta tgggttctta ttgttggagc ccaatatata 26761 aaagggggta tttgttacag aacctcttca gttttggttc atgtcatgcc tgggttttta 26821 ccctttctac ggatgttaaa aaaaattctc tattttcttc cagtccactt atggctttat 26881 tttttacatt tagattttaa tccgtctgga atttattttt gtgtatgctg tgaggtaggg 26941 accatacttt tattttttcc caaatgggtt actagttggc caaacatcat ttattgaata 27001 attcatcttt tccctactga ctcgaaatac catctttatt gtatactaaa tcctcatata 27061 gttctgggtc tgtttctggg ctctactttg ttcatttact gtgctggtac tgcaccgttg 27121 taattgctgt ggctttgtgg tatggtatgg cttgctctct gctagggcaa gtcgaagctc 27181 ttttgttcac ctgctctttc acccaaattt tctgtcctga atccagcaca gccaaattat 27241 ggtcattgtc accaccaact acagtgggtg ttgagcattt cccattgaat ctcctgtaag 27301 ggttttattg gattctgtga tagcagtaaa atgggagcct aagaggtatt ccttaaagga 27361 ctactaatca gacctggttt cccagatgat gctgaagatg acggggcctg ggctagactt 27421 ttgagggaca tatccttggg gttgggtgtg atatagacca gcccttacaa tttgcttgac 27481 tcatgggaat cgtacagggc cagaaccaga cacctgtcat gctaataact tccctcacaa 27541 ttcagaaatc actgtgattg aagatgggtg gctgttataa tactacccac ttaaaaatgg 27601 atgtaaccca ttttttagga ctcttaaaaa catcaaatca gtaatggccg attaggactt 27661 tttaattttt actaatctct acttgaaagt tttctagtca ttcatttcag gaaacctaat 27721 tcttataatt catatcattt agaatatcat aatgctatgg atattagcta gctaacttct 27781 caaatcttct agttctcatt taatttgaag tttgtgtgtg tacataagga tatacatata 27841 catatgtgtg tgtagatata tatatatata gttttttttt ttttaactag aatgaccagt 27901 caacagggga cataaaagta attggtggag atgatctctc aactttaact ggaaaggtat 27961 gtatcttgaa agggaagaaa aaaaagcact tcataccgag tcaattagta acagtgtgct 28021 ttcaatcaat cactaagaga taatttacat agtataacta aatgggttat ttaacccttg 28081 gaagcagtct aggttaatta tcgttcccta ggtcatgtag taaaaagaca gtagaatcca 28141 acattaacct taaatgtcca tattgtcaag tactgctgtc tgcctctgtg ggactctaat 28201 ttgggatcct tcaaaaaaca ttgatggggg aaaagatagc ctttaaaaaa aaaaaaaaaa 28261 caaacctatg tgagtctatg tgaggtagac tcacatagtt tcctaaaaga tagcaaagca 28321 gtattatgta gtggctgaaa gtgtgagttc cggagcctga caactgattc aaagcatggc 28381 ttagtacttc ctaactctga ccttgggcaa gttacttaac ctctctgtgt cccatatgtg 28441 attagggtga ggttgataat agcagccata gagttaagag gattaagtgc tataatgcaa 28501 gtagagctct tacaacagtt tctggtaaat cactcaataa attcagacat actattattt 28561 taagaaatct caaagagttt tcttgtacct taaaattctc ctagtgtgaa ccattggttt 28621 tggtatattg tgcttccatg tagtttaata tcaagatgtt tttagatttc ccttttaatt 28681 tatttgttga cccattggtt gttcaggagc atgctgttta cctgaaaata atggagatat 28741 taaggtattt gaatatttat cttctagtac attgaaaaac tttttgagag taaccaataa 28801 taaatgatgg aatgctactg cttttttttt ttgaagctgc cagttattgt ttacttacac 28861 tatgccaaat ataaaggcat taatctcata aaagtttcac aacaatcctg tgagggagac 28921 gatatcccca ttttacaaat caggaaatta agacttaata aggttaaaag acttgcccca 28981 aagtcacaga accagtaagt ggtagagctt gaatttgaat acagacctga ctctaaagct 29041 cttttctttc tttagatttt agtgttcatt gcttacttga atgagtatct ataagaaaac 29101 tttaacatgt aaaacttctg tgaaattatc ttgtcccata tcagggtcat gtcaaactaa 29161 tgtcctcctc agcatctttg gaaaacttca gaggagaaat gagctttgcc cctcctgttc 29221 atttcatata ccactgttag acctgtcctt ccctttcagc atgctttgtc catatttaga 29281 agctgttgaa gccattactt gtctggtcag tttttagtgc tggaatggac ctagcctttt 29341 aggccttctg agatttagtt tgatctcgtc tttcccacct aatggctctg ttctactaca 29401 tagatttgat ctgaaacagt tctctgtttc taaaataact ttcttttcat gatagtcaca 29461 gtaaagtaca tttattatgg aaaaatcaat aagtataacg agtgaaagtt atttcttggt 29521 ggtaagatta tgggattatt tgaactttct gtttcattgt attttattta tttatttatt 29581 tttgtgatgg agtctcactc tgctgcccag gctggagtgc agtagtacga tcttggctca 29641 ctgcaacctc cccttcccag ttcaagtgat tctcctgcct cagactccca agtagctggg 29701 attacaggcg cacgccacca tgcctggcta atttttttat ctttagtaga gacagggttt 29761 caccatgttg accaggctga tctccaactc ctgatctcag gtatccacct gcctcagcct 29821 cccaaagtac cgggattacg ggtgtgagcc accctgcctg gcctcatttt gtcttttggg 29881 ggtatttttg tgtgcagata tatatgtata taaatatttt tccctctttt ccccagttag 29941 tatttgagca gatgaacttt ggacccgaat acctgtattc aagtctctaa taccacttct 30001 tggctatttt cattttatca aatggcctct tatcctcgtt tttctcattt attaagtaga 30061 gatgtaacta cttgatataa ttcaaaaact caataatggc attcttttgt tttttagact 30121 ctagtgtctg tactccttgt accatgctgg gattcatttg aacaattgca tggctttttt 30181 agtgtattat taaatttgca gtttacttag aatttactgg gacctcatac aaatgggaaa 30241 aaaacataac tgtgttactc atttgctgtg tgcctttgga ttgaccctat tttttgtatt 30301 cattttctcc ccatgtcctg agttccactt tgaataaaaa agtaattttt ttcctgcctg 30361 taaaataggc taccaatagg ctgcagttgt ctatagtagc tgcttcactg aggagagctc 30421 agcatgagag aaatagtatg aattgcttgc cacaagttat gggctagcct tacttcattc 30481 tgtacttgga cctgtttagg cttctaagag atcttacctc caacaataaa ctgctttgag 30541 acatgaaaag gtggaagctt tacttggtta taactttact tttaatacct agaacagtga 30601 gtcttcaaac ttgtatttgc atgcccaatt tataaaaagt ttcctgagca tttaccccta 30661 atatatgcat tttaaattat atatgattta tggtaataat aatatatatg ttacaaaata 30721 catacaaaaa tatagattaa acaaggtgag gttaaaaaat ttaaaagttc taatctttct 30781 tgcaaaccag tggatctttt gtgccttact ctggtaaaca ctgtcttaga agaatatata 30841 gaacattaaa atcttaatgc tatagttata tgacagagta tgatgagagc tacagataaa 30901 caacacatca tgaatcttct tgtggcagtg tttataacca ttatgtgaaa tgctgcctca 30961 ttcttataac tagcataaga acagatagga ctttctcgat tttgaggggt aattattaga 31021 tggtattttc tgttaaggac tcttccagct ataaaattct taaatgtaga aagcgaagtg 31081 agggtttatg gtgagaggaa gcattggtat catgttttag tgtagtccaa gaatatggac 31141 acatccagaa aatgcagatc aagtttagcc taatgagaaa atatattttg gagtccatat 31201 ggtaaattaa attatgtgat ttttgagtta ttgtacaaat ataattctta gaatgttaga 31261 gtcaggagac tataagagac caactgcttc aagtttcatt taacacatgg gaaactaagg 31321 cgagagaaat ttcaagactt gcccaagatt agacctcttg ttaagtaatg aaagtgtttt 31381 aaaaacaggt gggtcaaatt ctgtttttaa aatttccatt atgatgaaaa tttcagtatt 31441 acaggcttcc aaatcccagc agatgggcca cttgtttaaa ggagagtttg atataataaa 31501 gcatctaaaa acaagagttt ggataattcc ttagggttgt tatgatgtga tttgacttat 31561 aattggaaat accgttttat tcattgtact gattttcatt tctctttttc ttctagaatg 31621 tcttgattgt ggaagtaagt tcacatttac ttttaatata acatttatga cttttctaac 31681 ttagtatgca ccatcctaaa ggtaagccag ggagagaaat tcctctgcat cagttttaat 31741 ggtgggcttg tgttctaaag gagtgagatt ggttttttgt aaagactact tagtaatttg 31801 tttttaccaa taatggaatg gtatacttcc tacctctctt tttttagttt gaagtatttt 31861 ctttctaaac ataactctct ctctctattt atctatatat aatatataca tatatatctt 31921 atattttatg tatatatata tatatcttgc ttagattttg tcttatgtaa tatttggtac 31981 ataaaaaata atatttataa tttatagact attttccatg tgttattatg tgctaaagta 32041 ttttgtatct tagcaccgag aggctaagca gtttcctagg gttaccagct agtaaactaa 32101 gggaaacctt tacttccttt agctcagtgg ttctcaaaat gtggttccct agaccaaaag 32161 tattaatatc agacaagaac ctaccgaatc aaaatatctg tgatgaggcc cagcaagcta 32221 tgctttaaca agtttccgag tgattctgat gcatgctaag gtttaggatc ccttgttttt 32281 actcataagt cactttctca ttaaggcctt ccctggccat cctatataaa atctcatgtt 32341 ttcacaccgt caacttcgta ttcctcctca atacttttat tttcctgatc acttatcact 32401 aacagcctct ctctctctct ctctctctct ctatgtatat atatatatat atcacttatc 32461 actgtctaac agcctctctt tatatatata taatctatag attatatata tatgcagcat 32521 tgtgcaatca ttatcacgct caattttaaa acattttcat ttccccacaa agaaacccaa 32581 tccccttagc catcactccc aattttccct tcccccagca cctagcaaac tgatcatcta 32641 cctacttgct gtctataaga tttgcctatt ctggacattt tgtataaata gaatcataca 32701 atatgtggcc ttttgtatct ggcttctctc acttaatgtt ttcaaggttc attcatgttg 32761 tggagtatat ctgcactcat ttccttttta ttgccaaatt gtatggatag acaggtgttc 32821 ctcaactgtg tcctgataaa cccatctgaa gttgaaaata tcataagttg aaaatggatt 32881 tactactttg ataaatctat cctaaagtca gaaaaatctc atgttggaac catcgtaagt 32941 tggataccat ctgaattaca tttttgttat ccattcactg gttgacagac gttaggttgt 33001 ttccactgat gctccttatt tctcgtacct gaaatgtcct tattccctcc cttcttatcc 33061 catgtttaag tcatttaaga cccagctcaa acgtcacctc cacaaaacct tccttgatac 33121 ccctttcctc ttcaattcac ttggaccttt tgcatttaat tttaattttt atttttttta 33181 agacagagtc tcactctgtc accaggctgg agtgcagtgg tatgatctca gctcactaac 33241 tactctgcct cccaggttca agcaattctc atgtctcagc ctcccaagta gctgggacta 33301 caggtgtgcg ccaccatgcc tggctaattg tgtgtgtgtg tgtgtgtatg tatgtatgta 33361 tatatgtgtg tgtgtgtata tatatatata cacaaacata tataaatata tatacatata 33421 tatatataca cacatatata aatatatata catatatata tatacacaca cacacacata 33481 tatatatata tagttttttt ttttttaagt agagatgggg ttttgccatg ttggccaggc 33541 tggtctggcc tcaagccatc ctcccacctc ggcctcgcaa agtgctggta ttataggcat 33601 gagccactgt gcctggcctg catttcattt taattataaa atattttgaa ctcagaaaaa 33661 agggtatgct gaatacctac gtacccacaa aagtattaac attttgccat atttgcttct 33721 gatcttattt tttttgagaa attaaagatc ataatacaac taaagcccca tttctttccc 33781 ttcattccca gaagtatgac aattatcctt aaagttgata tatatcattc ccatgcatgt 33841 tttttatact tccctagtac aagttagctg tatcctctgc tcaggggctc atcaagctga 33901 atcaagggac tcatgatcct cttcaaagtt ccttcaggtt gttggcagaa tttagttcct 33961 tgtgattgta ggactgaggg cccgttttct cactggctgc tggccagggg ttgctcccag 34021 atatttaaag gctcatgccc tagcccatga cagtctcaca acatggcagc tgacttcttc 34081 aaaaccagca ggagaatctt gctctagtct accacataac ctaatcacag gagcggctat 34141 cccgttattt tcacagatcc tggtcacatt caaggggagg gaacccttct gtgtgtgtac 34201 accaggaggc aggaattttt tttttctttt tcttttttgt taaaaagtct taaagtcttt 34261 tatccctaaa ggaggcagga attttgagag ccatcagaat tctgcctacc acagcccaga 34321 aatctgcatt tttcacaagt ctccagccat gatgtttctg atggctcaca ctgctttatt 34381 ccatttttaa agagtatttt tattgaaaag cattagggtt atggtttaaa aaatattttc 34441 cctaacaaag atgggtttgt ttagagtcct acttttgact aaatagctga gattcacttt 34501 tatgtaaagt tcattttata gcgttattaa tttgggtgcc tttaaaaata gtataaagca 34561 tgtttctcga gtgtagtctg ttagccacct atattggaga gttgggagga gagagtctct 34621 atcttgaatt tatgggaaaa attctaaaat actttttata atgaaggaca acatcataac 34681 tccctaataa aatgtgcatg tatatattca aatttgctgt cattgatcct gcacctacaa 34741 aatccagtcc tgggggctgg cattcttact gcttgctgag ggccagatga tatagattcc 34801 agaatatctc catgtagatt ttggtgagaa ttactgtgct gaaaagaatg acagtattgc 34861 agttatacat gggggttttg gtactttata ttgtgactct gaatttaaag ctatgcaatg 34921 tcttcttttt tgaaaggata taattgacac tggcaaaaca atgcagactt tgctttcctt 34981 ggtcaggcag tataatccaa agatggtcaa ggtcgcaagg tatgtatgac attttgacac 35041 agaatatttt cctcatttga agggggatta agtgattgct tctttttaag gataaatgtt 35101 ttcaactgtc attttatctt cgaaaagtaa tgtaatctca tataagactt aagatataat 35161 ccttttaaat aattttgtca tgtgttaata aagctcataa ttacagtcac ttccttgcta 35221 atattaacat ttggttttca gcatgctaat tatatcagtt tgtcctgaat agcatggcag 35281 aggattttgg gcccccttgc aaaattaaga ataaggattc caaagcgggt gaggaagtga 35341 taggaagggg tgggccctga agatctggac ctcctggaat tgagtgatga atgctgcatc 35401 ttctttgtgt ctgtagtgaa attttataat gcctgcttcc ttttttatta agtcggcctc 35461 acctcctcac cttacctatg ctgttttact tttgctttta tagttctacc tgtgtttatt 35521 tctcattttc gtttcatctc tcaacaactc tggggtggca ttattattcc cacttttcag 35581 ataaggttac tgaggcatag ggaattgtcc aaaggtacag agctagtccg ctatagagat 35641 gagatttgaa cccagggaac ctggctcaca gtttatgctt ttgcctacct taagttttta 35701 atagagtgac atcaaacaaa catttaagaa tatgtttttc ttttcctttt ataatttcat 35761 taaaaacatt aagtctctga tcagtctgca gtttttatgt aggggtcagg taatgttcta 35821 acttctgctt tttcctaagt gattaacagg tttttataag cccttttgaa aaaatcacgg 35881 tatctgtcga gcatctttga atcagagtaa gccttctagt gagtcatatg tcagcagttt 35941 gactgtatgg gcttttctaa tatccagttc aagtgtttat cagtgagttt ttcttttaaa 36001 tagatttggg acaggtacta tgagagtata taagtgatac gttataggac actaactagt 36061 atcctatgaa atggcaaaaa ctgcaatcac ttttgcacca accaaataga aactaatcag 36121 tgcacttgct tatttttcta catgctcttt agggttttaa atgtcaacct actgtggcat 36181 agactttaat cctctgggta ttcttttgtt gttctttcct ggtatatgct gtggaattga 36241 gatagactgg ttcgtgagcg agagattttg tgttgccaca ggtaggacat gctcaaacaa 36301 tacttgggtc atttcttgac ccaagtcatc tattcaccat agttttgtag caccgatctt 36361 gcatacattt catgtatctt ctttgaaccc cacgtcagtg ctgcttatat gatactcaga 36421 aattaaacac taaggaataa gattttcagg taggattgag ttttggaggg tcacaaatct 36481 tgtaatgtct aatatttcca ctctccctgc tgagaattag ttttggcttc cttggaggtg 36541 atatcgcctc tgttgagtat aagtggccta ctgtgatcac accactgcac tccagcctgg 36601 gtgacagagt gagaccctgt ctcagaaaaa aaaaaaaaaa aaaagaatgc atggcctaga 36661 tgacttctaa ggtttttccc acccagttcc agttttcatg ttctaggcag agcagtaaag 36721 tgagaaacac atggacttgg gagtttagtc tcgcatttca ctgccactta atctgagcga 36781 ctattccata tttaatctct ctgaatgtat ttactcatct ttaaagggga atgattatta 36841 acatcttttt ctcagggaaa ctatatgagt caaggagata atatatttga aaatcttttt 36901 aactgcaaag cgctgtttca ctgttggtta taatgtgatt gatctcattg tagtgagcag 36961 ctgcttaatt gcgttttaga atgtagggaa gatagtaata tttttcacat tatatatgta 37021 gctggttctg gaactgtaaa catactcctt ttttatggag atctgagtca cgtaccataa 37081 aattcactct tttaaagttg tacaatccag tggtttttga tatattcaga gttgtgcatc 37141 tgctaccact atttcatttt ggaacccaaa gaaaccttgt acccattagc agtcattctc 37201 ccttctccca gcccctggca actactaatc tactttctac agaaagtccg tacagatttg 37261 tgtattatgg acattccata taaatggact catgcaatat cctgtcttct ttcacttagc 37321 atagtgtttt caaggttcat ctaggttggg gcatgtatca gtacttcatc ccttgttttg 37381 gctgaataat atttcattgt acaaatatat cacattttgc ttatccatct gttggtgaac 37441 atttgagttt ctacctgttg gcttttatga ataatgttga tttgaatgtt tgtgtacaag 37501 tatgaatacc tgttttcagg tctcttgagt atatagttgc taggtcatat agtaactctg 37561 tgtttaacat tttgaggaat tgcccgacta tttaacaagg tatatgtact gttttacacc 37621 agtaacatat gagggttcca atatctccac atccttgaca acacttgtta ctgtcctttt 37681 tattgtagcc atcctagtgg ctatgatgtg gtatctcatt gtggttttga tttgtgtttc 37741 tctgatgctg atgatgttga acatgttttc atctgcttat tggccattta catatatctt 37801 cttaagaacg gttacccatt tacagtatgg aaaatgcttc agatgcaact ctagtcatgc 37861 cttagagatg gagctttatt aaacattcag atctctaggc atatgaagtg ctgagttctc 37921 ttgaactcct aatacagatt gcactgagtt tagtgatacc ttttctggag cattcctgag 37981 ttcaggtagg gagaagggtt tttgctgtga ttggcttgtt atgttctttc taaatggaaa 38041 tagaattgaa gtgtctcctc tctccattta ttggaagagt catgagggac ataattagat 38101 gatcccttgg agtctccggc ttaggtcagt ggttatctac ttaggctgca cattggaatc 38161 acctgagagt taaaaaacca ggataacctc tgcctgtgtc tcatctccag caattctgat 38221 gtaattggtc aggctgtggc ccgagtaggt gagttctggt tttttaaagc tcccaggtga 38281 ttctgatgtg caatccaggt tgagatcact ttgggccctt tccagctctt taaacatata 38341 tatttatcta ggaaggtatg aaagcataag ttttcttgag actgccttta acatctgtaa 38401 aggctttcaa agcagcttct gtagtttttt ttaaatggct gaatattttt caacaggcag 38461 catttgggtt ataaaattag cttttggtag agttgactta taccacctcc agcttttgtt 38521 ccaaaaataa atactggttc ttttggcaca ctagttgttt taccctaaag ttcctctttg 38581 taagccagtt attaaaagtt gtgatgcagc cagggcgaag tggtacacat ctgtagtccc 38641 agctactcgg aaggctgagg ggggaggatc gctagagccc aagaagtcaa ggctgcagtg 38701 aactgtgatt acaccactgc actgcagcct gggccacaga gcgagactca tctctttaaa 38761 aaaagaatgt tgtgaggccg ggcgcagtgc tcacgcctgt gatcccagca ctttgggagg 38821 ccgaggtgga cggatcacct gaggttggga gttcgagacc agcctgacca acatggagaa 38881 accctgtctc tactaaaaaa aatacaaaat tagccgggcg tggtggcaca tgcctgtagt 38941 cccagctact cggcaggctg aggcaggaga atcgcttgaa cctgggaggc agaggttgtg 39001 gtgagttggg cgagccattg cactccagcc tgggcaacaa gagcaaaact ccatctcaaa 39061 aaaaagaaaa gaaaagaaaa gaatgttgtg gccaggcgcg gtggcttacg cctgtaattt 39121 cagcactttg ggagaccgag gtgggcggat cacgaggtca ggagatcaag accatcctgg 39181 ctaacacagt aaaaccccat ctctactaaa tacaaaaaaa aattagccgg gagtgctggc 39241 gggtgcctgt agtcccagct actcaggagg ctgaggcggg agaatggcgt gaacccagga 39301 ggcagagctt gcagtgagcg gagatcgcgc cactgcactc cagcctgggc aacagagcga 39361 gattccgtct aaaaaaaaaa aaaaagaatg ttgtgataaa aggtgatgct cacctctccc 39421 acaccctttt atagtttagg gattgtattt ccaaggtttc tagactgaga gcccttttca 39481 tctttgctca ttgacactct gtacccatta atcctcctta ttagctcccc ttcaatggac 39541 acatgggtag tcagggtgca ggtctcagaa ctgtccttca ggttccaggt gatcaaccaa 39601 gtgccttgtc tgtagtgtca actcattgct gccccttcct agtaatcccc ataatttagc 39661 tctccatttc atagtctttc cttgggtgtg ttaaaagtga ccatggtaca ctcagcacgg 39721 atgaaatgaa acagtgttta gaaacgtcag tcttctcttt tgtaatgccc tgtagtctct 39781 ctgtatgtta tatgtcacat tttgtaatta acagcttgct ggtgaaaagg accccacgaa 39841 gtgttggata taagccagac tgtaagtgaa ttactttttt tgtcaatcat ttaaccatct 39901 ttaacctaaa agagttttat gtgaaatggc ttataattgc ttagagaata tttgtagaga 39961 ggcacatttg ccagtattag atttaaaagt gatgttttct ttatctaaat gatgaattat 40021 gattcttttt agttgttgga tttgaaattc cagacaagtt tgttgtagga tatgcccttg 40081 actataatga atacttcagg gatttgaatg taagtaattg cttctttttc tcactcattt 40141 ttcaaaacac gcataaaaat ttaggaaaga gaattgtttt ctccttccag cacctcataa 40201 tttgaacaga ctgatggttc ccattagtca cataaagctg tagtctagta cagacgtcct 40261 tagaactgga acctggccag gctagggtga cacttcttgt tggctgaaat agttgaacag 40321 ctttaatata caataattgt tgcattatta tttcagatga taaatgtggt cataagtaag 40381 aaataaatga tcgagtttag tcttttaatt cactgtcctt tgaatacctg cctcttactc 40441 tggaggcaga agtcccatgg atgtgtttat gaacatggtt gaggaagatt taggaagact 40501 gcaacagtac actacctaaa gcaggttttt tactccatct ttttttgcca cgtacactgg 40561 cctcccactt tgatatgctt gaaattatct ccttgatttg tctttcaaaa ctacatattg 40621 aggctggttg cggtggctca cacctgtaat cctagcactt tgggaggcca agccggacag 40681 atcacttgag gtcaggagtt cgagaccagc ctggcaaaca tgatgaaacc ccacctttac 40741 taaaaataca aaaattagcc aggcgtagtg gtgtgtgcct gtaacccagc tacctgggag 40801 gctgaggcag gagaatcact ggaacccggg aggcagaggc tacagtgagc caacatcacg 40861 ccactgcact ccagcctggg tgacagagca agactctgtc tcaaaacaaa acaaaaaaca 40921 aaaaactacg tattaagaca agaaacagac tgggcgcggt ggctcacgcc tgtaatccca 40981 gcactttggg aggctgaggc gggcggatca caaggtcagg agatcgagac catcctggct 41041 aacacggtga aaccccgtct ctactaaaaa atagaaaaaa ttagctgggg tggtggcggg 41101 cgcctatagt ctcagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 41161 agagcttgca gtgagcagag atcgtgccac tgcactccag tctgggtgac agagcaagac 41221 tccgtctcaa aaaaaaaaac aaaaacaaga aacaaattaa actaatgtga tagactactg 41281 ctttgttttc aaaagataca ctccccaaaa gttactgatc taaatacagt agtactatct 41341 ctgtttagta agaaccctga caactaatag tgttcttata tgtaaaatgc tattcttgcc 41401 tttcatttca gaatatactt tttaaatgtg aatttctgga ttttttttta tagcatgttt 41461 gtgtcattag tgaaactgga aaagcaaaat acaaagccta agatgagagt tcaagttgag 41521 tttggaaaca tctggagtcc tattgacatc gccagtaaaa ttatcaatgt tctagttctg 41581 tggccatctg cttagtagag ctttttgcat gtatcttcta agaattttat ctgttttgta 41641 ctttagaaat gtcagttgct gcattcctaa actgtttatt tgcactatga gcctatagac 41701 tatcagttcc ctttgggcgg attgttgttt aacttgtaaa tgaaaaaatt ctcttaaacc 41761 acagcactat tgagtgaaac attgaactca tatctgtaag aaataaagag aagatatatt 41821 agttttttaa ttggtatttt aatttttata tatgcaggaa agaatagaag tgattgaata 41881 ttgttaatta taccaccgtg tgttagaaaa gtaagaagca gtcaattttc acatcaaaga 41941 cagcatctaa gaagttttgt tctgtcctgg aattatttta gtagtgtttc agtaatgttg 42001 actgtatttt ccaacttgtt caaattatta ccagtgaatc tttgtcagca gttccctttt 42061 aaatgcaaat caataaattc ccaaaaattt aactgcttta tgaattcaat ttaaaaatcc 42121 ttaaaataag tcctgtctct ttaaaaaaac ctatgcatag ttatcatttc tctacaaatt 42181 aacctagttt agttttctgt tggttccatt ttccttgttt gttaagtttt agtagctagt 42241 ttaattgtaa tctcaatgat tatgtggtag aatgggttgg cggacgtaca aaaattccta 42301 gctacttcag agacattaaa tttcagacac atggtacact ttatattaca ttttactatg 42361 ctaaaataac acggctttct tttggaattc tgttcagttt ttcagattgt aatctcagct 42421 acatctcaac agattgttct cagatatgtc ctattacctt ctttgtgtag atagtgcttt 42481 attgactaag aacaatgaca acaacacctt ttgttttctg ggaataggag aaaagtttta 42541 agccaaaact cttaattgct tatctgctcc acgtgaggta tgaactatca aacttaggag 42601 ccatctagct tacacgtgtt ccttaaaaag tttgctgtag gccgggcaca gtggctcgta 42661 cctgtagtcc cagcactttt gggagcccag ggtgggggat cacttgagct caggagttca 42721 agaccagcct gggcaacatg gcaaaacgcc atctctacaa aaatacaaaa aaaaaaaaaa 42781 acgctgggtg tggtggcgca caactgtagt cccagctact tgggaggctg aggtgggagg 42841 attgcttgag cttgggaggt gaaggctgca gtgagccttg acagtgccac tacactccag 42901 cctggatgac agagtgagac cctgtctcaa aaaaaagagt ttgctgtaat tcccagcaac 42961 aaagtaggag actcaaacta aataattttc tatagtccta gaacttctta gtttacaaaa 43021 catttttact tctgttatct catttgatct tcatacccat gtaagggttg aggtagatgt 43081 taccacatgt gagtgcaata tccagaactc tgaatccctt cttcccctaa aatgtcagcc 43141 cgctgaggtc cacttggcta ccctcttgaa tactgcatcc agcttcccac tgctgaacct 43201 ctttactctt tttttttcag ttgcacttac cgccttctag taagttgaac catatgaaat 43261 taccattttt gcaggtaaaa aatggccggt gataggcagt ttggcgtcgt ataacccaat 43321 aacatgttat ataatttacc cacaagtggt gggttgctat gtcctggagg agtcagcttc 43381 agactctagc taaatgattg tataaccttg cagctctccc ctaagtgagg aggcaatgtt 43441 gaaagtccca tgtcttatca gaaccaggga ggcagatgag aaactgcctt atggcagctc 43501 ccacaacata gggaggtggg tgacaaatgg ccttgggaca gcttcttccc aagactggtt 43561 atgttacagt gttcctggga ggatcacatg gcattcctcc aagatgggtc agactgctgt 43621 tggccttgtc tgtgtggcgt atgtgaagac attcatggcc agagctgttc ccttagaagc 43681 atctactaaa ttgatctttt cctttcttac ttactgtctg tctcccttag taggctgtca 43741 gctccgtgag tgcaggacct tgccagtcct ggtcactgct atatccccag cacctacaag 43801 agtgcctgga aaattgtagt gctcaataaa tatttgttgg ataaatgata gaatgatagg 43861 aagttaaaaa gcaattaaaa tacttgaaaa gaagcaaaac atttttcatg ttaagcaaaa 43921 aaaaaaaaaa aacttattaa ggatagctaa catgtattga attctatatg caatggaatg 43981 atacttagcg cctttgaatc cttatgataa ccctataagg taggttgttt gggttttttt 44041 aattgtccca actttacaga tgaagaagtg caggtccaga gaggtcacat aatttgccca 44101 ggatcacaca gctagtaagt agcagatgag gaatttgaac ccaggcagtt gtattccacc 44161 atctgccctc ttagttcatt gccacttaac ctataatgcc cagctcttgt gtagaaatta 44221 atacactgat aacatagagg aaaacattaa gctcattgaa tgtaataagt ccagatgact 44281 tgtacattaa acacagcttt ttgaggtcac agctgatctc taagaatgta aactgatttc 44341 ctctggcact aaaaagcatt ttcaaagact gttaagagag tttctccaac attctcttca 44401 gatttttctg ctggcttatt ttatgattct gtggacagct tcagacaaaa taactttctg 44461 gtatgaagga ttgtgtttac tctgcttttt tttttgttgt tttttgggtt ttttgttttg 44521 ttttgttttg tttttgagac agtgccttgc tctgctgctg cccaggctga gtgcaatggc 44581 atgatctcgg ctcactgcaa cctctgcctc cctgggctca ggccaggtgt atgctaccac 44641 tctcagctaa tttttaattt atttttttag agatacggtc ccactctgtt tcccaggctg 44701 gtctcagaac tcctgggctc agacagtccc ccgccatggc ctcccacagt gctgggatta 44761 caggcatcag ccaccatgct cagcttgttc tgccattttc aaatgtgaat tttatagaca 44821 ctttaaacca cttgaaagag tgatgatgtt ttaatgattt tcattattat ttgcaacttc 44881 aagcattaaa cactgccaaa ttaagtttca agttttctct ttacacaata tggatgtact 44941 tcataatgga cttcctcatc atgattaatg agtgaagtga cattcaaact tggtagcttt 45001 tcagtagaac ttcctttccc aacatttttt ctgttccttt aattatggca atatctgaga 45061 gctctgaaca taagtcaaag gtttgattat ttttcatgtg gcttcctctg cttggaactt 45121 tctgccccgc atcttcccgt tgccccctgt gtcctcttgt catgccccta cccttttttg 45181 agtgtgtcta ttttctggca ctacaagaca taacaggctc atcttgtgtt ttccctaccc 45241 tgacccagaa tcagccatta cttcaaggag ccctggttcc attattggag aatactatta 45301 gaaaccagga tctggtgcta ggcatgctca tttctattgg agtgtcatac aaacaatttg 45361 taaattgttt gtaggtcctc ccagtggata ggattaggaa ataaaacatg catactaacc 45421 atgcatacac acacatctac gtctatttct gtatctgtct gtatacatat taaaataaac 45481 atgggttgat aactaatgtt tctgctgtaa tccacagcct tcatcctagc ctgccactct 45541 tcttcttttt agctttttca acagtgggaa atgtggctct tgttatgtac actttattca 45601 cttatttgtt tgaccctagt atcataaagt agttccgtat gcctgtaaca gatcgactaa 45661 ctagagtcca ttatttgcgg aaagatcttt ttgtccgaac gttaccgcag gggtgtccaa 45721 tcttttggct tccctgggcc acactagaaa aagaagaatt gtcttgggcc acacgtaaaa 45781 tacactaaca ctaacgatag ctgataagct aaaaaaaaaa aatcaaaaaa atttcatgat 45841 gttttaagaa agtttactaa tttgtgttgg gccacgttta aagccatccc gggtcgcagg 45901 ttggacaagc ttgccttaca gtatccagtc aaaataatgt tttccaaaat tacttctttt 45961 ctttttcatc cctttcagtg tggccgttat ttataatgca gtttggttca ttagtgtttt 46021 tattacaaat acaccctcag ccttcatatc ctagttttaa tgaattatta cggtgaaaca 46081 taataagagt cagagctata cagaaaggtc tactcagagg tgctttgttc cctcctattc 46141 tgttcccact actcctactt tccactgacc ctgtaagcat catatttatt tttaatggca 46201 gttacatttt taccaagtgc ttactatctg taggcacttg gtgtgtattg cttcttttgg 46261 tgttcacagc aacctcttga ggtaagcact attattatcc cccctttttc ttttttcttt 46321 ctttctttct tttttttttt tttttttttt gacagtctta ctctgttgcc caggcaggag 46381 tgcagtggcg caatctcggc tcactgcaac ctctgcctcc caggttcaag tgattctcct 46441 gcctcagcct cctgagtagc tgcgagtaca ggcacaagcc accacgcccg gctaattttt 46501 gtatttttag tagggatgag gttttgccat gttggccagg gtggtctcga actcctgacc 46561 tcaggtgatc tgcccgcctc gacctcccaa agtgctggga ttacaggcac gaaccactgc 46621 acccggctat tatccccatt ttttagatga gaaagctgaa tcccagagag cataagaagc 46681 ttgtccagag tgacatctct gatgcataac cagtactcaa acctattttt ctgacaccaa 46741 ggcctgtgtg taaactgtaa aggggctgct tggcacctac tttcctaaag ttgtcctatc 46801 ccttctctgt ctgggtcttc ctgaagcttg gcacttctga agtcacctct ctgaaaacat 46861 tctggtaact gttaaatccc ttgttctagc tattcatgtg ttctgtgtgg ttaaacaagg 46921 ttcacaatgg ccacctggcc tttggaactt gggtgaagag gctgccttca gttgatcctc 46981 cccactccca ttttcaaaac atgggtttac atgagttatt tgtgaattag gaaacataac 47041 catgttttga gccttcatag aaaacaaacg tctggggtca tacaggttaa aaggagtaac 47101 caaattcggc actatcattg ttctattcag tagacaattc tggggccttt ctgtgtctca 47161 ggttctgtac tagttgtttc aggactttgg gataaataca aactatccct gccctcaggg 47221 ggcttaaggt caggtgtaca agtgactcta atgtgaggca aggctggatt cagtgctgca 47281 tatctaatgc tatgggaatt caaagaggaa gtgatcagaa tgagaaggga gggatggatc 47341 attccaggag aagcttcagg gaaaagcaac atttaaaatg agacttttga gagtgaggga 47401 aatttggaca ggtggatata gaggatgcaa ggctagagga aaggttttag ccagaaagtc 47461 tgcttgggca aatgcctggg taaaaaaaga aaatccactt tgggaggaca aggcgggcaa 47521 tcgcctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact 47581 aaaaatacaa aaattagctg ggcgtggtgg tgggtgcctg taatcccagc tacttggaag 47641 gctgaggcag gagaatcact tgaacccagg aggcagaggt ttcagtgagc cgagattgcg 47701 ccactgcact ccagcctggg caacaagagt gaaacatcta aaaaaaaaaa aaagaaaatc 47761 acagggcagt gtggggaatg gtgagtattc taatttggtt gtggcagaga ggatgtagaa 47821 ggaagtgata agagagaaag ccggatagga gggcctttgt gccagttagg atgttctaga 47881 cttccagcca ggttgcccag ctcaaactgg cttaaacaat gagggggttt attggctatg 47941 taattgggaa gtgcagaggt agctcaggcc agatcagttt gatccactgc tccattatga 48001 tgtcaaagac ccatgcgatt tccacctcat tattctgctg tccatagagc caacttcatc 48061 ctaaggccag tccttgtggt cagacaaggg ctgccaatag taatctgggt gcaagtttct 48121 ttgagaaaat ctttctgtgt caactctctt aaaaggggtg aaaaatctct ccttaagtcc 48181 cactggccag aatgggccca tgcacccatt tcttaaccag tcactggcaa ctgggggtgg 48241 gattgccgtt tgcccaatca ggtccatttc tggagctaag attaaactcc atttcccttg 48301 ggacacattg aacagaatca gaattcgatg aagaaggaag aagcggagaa ttggtttggt 48361 gttgggtagg caaccaaaaa taacctctgt tgcctcaagt gccaagaaag tggtgttttg 48421 tgcttgttag ggtaaaaatg gggatcatgg aaaatatttt aagtttcata gaccaaaaaa 48481 tattccagtg tttcatcaaa tctaagaggc tatcaattat aagatatacc attattttat 48541 gtaccaccaa ggaagaaaaa atgctgccag tgaagttagg atgtattgca ggttgggttc 48601 tctgggaagc aggctgaaaa ggaggtgaga atgcaggaca tttatgggag aacacccttg 48661 ggattaatac tggaggagga gaaccaagca gggttggtgg ggcacaggga gaagttggga 48721 tgccatgcag tcacaacaaa ggcctcagcc aaccccacgg ggagctcgag aagctgagat 48781 ggcccttcag tgttgccctg ccttgtggtg agtgaattgg gtcttcatat ccccatgttg 48841 actggtcatt ggatgtgggc tcccttagga atgggcatct cttcagcaga ggtagctttc 48901 ttcaaaagag gtgattccaa agagtcaccc actcactgag ggctgtctgc tggcagcatt 48961 ctcagccact actcaaagat gacctgtcca ggaaggggaa cctaggtggc atgacacatt 49021 gtctattaca acatgctact gattataaga gccgggaggt ggggggcaac acaatgtctg 49081 agatattaaa atggaagtct cttagaagaa atggataatt ctataattat agttaatcag 49141 aaaggggaag aagtggggaa atggaccaag ggcctgagag agaaaacaga cgcaacaggc 49201 cactagaaag ataggacact ggagggtggg aagccctagc agtttcttcc agggtgggct 49261 gggcacggtg gctcattcct gtaatcccag aactttggga ggccgaggcg ggcagatcat 49321 ttgaagtcag gagttggaga ccagcctggc caactcctgt ttcaccctgt ctctgccaaa 49381 aatataaaaa attagccggg tgtggttgca tgcgcctgta atcccagcta cttgggaagc 49441 tgaggcagga gaatcgcttg aacccaggag gcagaggttg cagcgaggaa aaatcgtgcc 49501 actgcacttg agcctgggtg acagagtgag actgtctcaa aaaaaaaaaa agtttcttcc 49561 agggtggctt ctgtgccaga gtcaggtgcc ccagctacct ctaatttatg gtcctcctgc 49621 actgggaaac agattttcta cttttggttt catgataaat aacatttccc cctgatttta 49681 aaagttatgg atttggctgg gcatggtggc tcatgcctgt aatcctagca ctttgggagg 49741 tcaaggcagg cagatcactt aaggtcagga gttccagacc agactgggca acatggtgaa 49801 aacccgtgtc taccaaaaaa aaaaaaaaaa aaaaaattag ccaagtgtgg tggtacatgc 49861 cagtagccct agctactcag gagactgagg tgggaggatt acctgagccc aggagatcag 49921 gcctgcagtg agctgtgatt gtgccatttt actccagcct gggtgacaga gtaagaccct 49981 gtctcaaaaa taatagtaat aggctgggcg cggtggctca agcctgtaat cccaacactt 50041 tgggaggcca aggcgggcga tcaattgagg tcaggaactc aagaacagcc ttgccaaaat 50101 ggtgaaactc cgtctctact aaaaatacaa aaatgagccg ggtgtggtgg cgcatgctgc 50161 attcccagct actcaggagg ctgaggcagg agaatcgctt gaactcggga ggcagaggtt 50221 gcagtgagcc gagattgcac cactgcactc cagcctgggt gacagagtga gactccatct 50281 taataataat aaaataataa aaattttaaa aagttatgga tctggatgga gggaaatgga 50341 atgtataaaa gaagtaaaca tacacaagaa gatacaaata cagaataaaa gtaaaatgca 50401 accatcatcc cactaccccg ataccagggt atccgttttt acatcttttc tttcattctt 50461 tctgtcttta tataattgta taaatgctgc ataaacctcc tcttgcctgc tgcctcctca 50521 aagacctccc tccctccttc actgcccttc tgctcctgga gagccaccct ctctccattt 50581 atccttccta tcagcttcag gttcttacca tgttaacaaa aagaaaatct tataagcctg 50641 tcactctcta catacgccgc acctcctttc attcatagcc tttaaaacat atatatagca 50701 gttattgtgg ttatttttct gttcacaaaa taaaaaaaca ctctttctag aaaactggaa 50761 tatagaggca agcttttttt ttttttcaga cggagtttcg ttctgtcgcc ccaggctgga 50821 gtgcagtaac gaaattacag cttactgtaa cctctgcctc ctgggttcaa gatattctct 50881 tgcctcagcc tcctgagtag ctgggattat aggtgcctgc caccacaccc ggctaatttt 50941 tgtattttta gtggaaatgg ggtttcgcca tgttggtcag gctggtctcg aactcctgac 51001 cttgtgatct gcccatctcg gccttccaaa gtgctgggat tacaggtgtg agccactgca 51061 ccctgccgag gcaagatttt tttttttttt tttaagaaaa cccagttatt ccattaccca 51121 atgaaactct aaacatgttg atgtacatcc ttccaaaatt tctttttatg acaacatgct 51181 ttttattttt aattattttt attttatttt aaggtccggg gtacatgtga aggatgtgca 51241 ggtttgttac ataggtaaac gtgtgccttg gtggtttgct gcaccctgtc aacccatcac 51301 ctacgtatta agccccacat gcattagcta ttgatcctga tgctctctct ccctgctggc 51361 tccccagcag gccccggtgt gtgttgttcc cctccctgtt tatgagaaca ctttcttgac 51421 ataaagattt catttattcc catggaattc taaaggcttt tcatacttgt gaaggaataa 51481 tagtttagaa ataaactgaa ctttaaaaga taccattttg aaaaataata tacagccatc 51541 aaaaattata tttatgggaa ctatgcaata atattaaact ctatcatctg ttgactgcct 51601 cctatattcc agaaacttta catacaccaa ttctaatcct tacaagaacg ctgtgtaggc 51661 tttagcatta gatggaccag gtttcaccaa ctgtatggtc ttggataagt acccaacctc 51721 ctgtccctaa gtttcctcac ctgtgaaaac acggtttcta ccagctttca aataagatga 51781 tcaatataag gcacttggaa cagaacctga cacatcataa gcactctata aatgtctatt 51841 atcaccaaat aattccaggt gccttgaaaa tttaaatgaa aaacaaaatc aaaccatgac 51901 aatactagaa gcaaatttag gtgaacactt ttctaatccg ggggtgggcg ggggctgggg 51961 ggaggcaggg agaagacctt ttttttttct ttttgagatg gagtcttgct ctgtccccaa 52021 gctggagtgc agaggcgtga tctcagctca ctgcaacctc tgcctcctgg attcaagtga 52081 ttctcctgcc tcagcctccc gagtagctgg gactatacag gtgcacacca ccacggccag 52141 ctaatttttg tatttttagt agagatgggg tttacaccct gttagccagg atggtctcaa 52201 tttcttgacc tcgtgatccc atccgcgttg gccttccaaa gtgctgggat taccagcatg 52261 agccaccgtg cccggctggg agaagacctt tctaagcatg ataccaaagg cagagacaat 52321 aaaggcaaag aattgacaga attcactatc cgataaaaat cacttctgtg gccgggcgcg 52381 gtggctcaca cctgtaatcc cagcactggg aagccgaggt gggcggattg cttgaggcca 52441 ggagttcaag accagcctgg ccaacatggc aaacctcctg tctctactaa aaatacaaaa 52501 aattagctag gcatggtggc atgcctgtag tcccagctac tcaggaagct gaggcatgag 52561 aatcacttga acctgggagg tagaggttgc agtgagccaa gatcatgcca ctgcactcca 52621 acctgggtga caaagtgaga ctctgtctca aaaaaaataa caattaaaat aaaatcactt 52681 ctgaatggtg gaaagcacca caaagttaga ggtcaagcaa taatttggag aaaagaatta 52741 gtaatttgtt ggacagacaa aagacttttt taatataaca aaaactttaa aaattaaaaa 52801 aatacacatt cgaggacatt ttcctaaaaa cacaggcaaa ggacataaac agcaaagcaa 52861 gaagacagct tgatgtggcc attttatcca gggggacatt ttggtgagcc ctatggacac 52921 agctgccatg atgccaacaa tgtgacagct gtccccttca aaatgcgtta gccccagctc 52981 ttcctctccc ccaacctcca gtccaaagga cttgcacttt ctactttact cctttctgca 53041 ttgtttaatt ttcttttaca aatatgttac ttgtcatcag aaaaaataaa gaaataaata 53101 aactgttaga gtgttagccc cttaaagggg agcaagaatc acctttctaa aagaaagttt 53161 atgttaaata taatattagc atatgtgaat cctgagagaa aagttaacag tttagttgag 53221 ttatttcctc tgtagtctgg agctaaaaat agggaatctt attctgtcct aaatcttttc 53281 cttcctccac ccagtgtctg tctggatcga attcattcat tcactcagta ggcactcact 53341 cagccaggca tggtgctagg cctcaggacc tcgctgtgaa ccagaaactg tccctacccc 53401 catggtgcag gcattctgct tgggagttgg aggaggaaca ggtaaaaaat aattaaatat 53461 tcaggttaac gatatattgt caggtttgag gattgaggaa agggcgcaga gagtggcaag 53521 ggctgctgtt tagatacagt ggccaggagg ctccgatgag gtgacctttg aggagagaca 53581 tgcaggagat gaggggacag tgaagaggat ttctaagaac actccaggca gacagaacag 53641 cgacagccaa ggccctgaag tgggtagggg cctggtgtgt gtgaggaacc tcaggattgc 53701 catcatggct ggagcagaga catgaagcaa gaaggccatg gagatgaggg cagggagatc 53761 ccggagtggg gagatcagat ggggctctgt gtatcatgca aaggactttg cattctgttc 53821 caagagctgg gaaggttgac ataattagga aaaaagccca gaaaagcaga ggtatccatt 53881 tttcatggta aagatgataa tttcaattaa aacacgattc ctggatatat gtaatttgta 53941 ggccaaatgg tgcccaatcc ctacctccct caccccctca cttccctatc cctaaaacct 54001 gtacctcaac tcccgttcgt aagtgatggg agttaggaat agagaaatct cccggttggg 54061 ttttctgagc aaagaggtaa catagcagct ctgttatttc tttcacgtct ccaagggaac 54121 catgactcac ccttagctat cccccgggaa tgtggccctc agagtgttct tttactgatt 54181 cgtgattttg ttatgtacac ctggagtgat ggaacatacc ataccagctt gtcagggttg 54241 ctttgtgcaa agatcgatga cgtgtgtgaa cccggatcca tgcttggggt cctgagtttc 54301 aggtgccatg gccagttgct agcaggttgt atgtgtgtga ccagccccta tgtgagtctc 54361 tcagaccctg aaactccaaa caggcttccc tgggcagaga cattctgtcc atgctctgtg 54421 gcttgctgct cgagagggat agatcacatc ctgtgtggct tcttcttaaa tgaagaagga 54481 cattggaagc ctgtgctggg cttctctgga ccccccgatg tatatgtatg tatattaaag 54541 agagaccagg gtctcactct gttggccagg ctggtcttga actgctagcc tcaagaaatc 54601 ctcccgcttt ggcctcccaa agtgctggga ttacaggcat gagtcaccat gcctgatgta 54661 tatatttttc cagctccctt cttttctgta tcatttgcta ttactacctc ttagctatta 54721 gtataaactg atcttgagtt gtgtaaatct ttctggtgat tcactgtgat gggatgattg 54781 tgtcctctca aaattcctat gttggagtcc tgacccatgg tacctcagaa agtgactgta 54841 tttgaagata ggtctttaaa gaggtcattg taaattaatt aataaggtca ttagggtgga 54901 ctctaatccg atatgactgg tatccttata agaaaaggaa attagcacac agacacacaa 54961 tcagagggag aagacagcca gtcatctaca agccgaggag acagacctca gaagaaacca 55021 accctgcctg caccttgatc ttggacttct agtcgccaga actgtgagaa aacaaatctc 55081 atgtttaagc cagaacctag cacgtggtac ttgttaaggc atccctagaa aactaataca 55141 ctcactgaat gaggcaggta gctgtttctt ttattttttg agacagagtc tcactttgtc 55201 tccaaggcca gagtgcagtg gagcgatcac agctcactgc agcccctgcc ttccaggctc 55261 aagccatcct cccacctcag cttctcaagt agctgggact acaggcatgc accaccacgc 55321 ccagctaatt tttgtatttt tttttttttt ttgtagagac ggggttcacc gtgttgccta 55381 ggctggtctc aaacccctga gctcaagcaa tctgccctcc ttggcctccc aaagtgttgg 55441 atttacaggc gtgagccact gtgcctggat atggtaactt tttcatatgc tatttgcttg 55501 atgattattt ttctgtttct gatataatgc tttttattag agagttatct gtttgttttt 55561 attttttaat gtttgaattt aaaaaattag tataatttgc ataattgaaa aattatattt 55621 gaataattga aatatatttg tataacctta aatttaaaaa ctatgatagc gtatacagtg 55681 aaattttcct ctcatccctt ttttccattt aaccagtgca cttcccaaca gccaacagat 55741 aattttagtt tcctcactcc ctgagctatt ttatgtatat gcaagtagat atgtacatac 55801 atatttctgc cttgtaacac aaatagtagc atactataca actgctctgc ttcttccttt 55861 ttttagctaa gaatattaaa agagtgaaaa agatgtacgc taacaaaaat caaaagaaaa 55921 ctagagtgac attataagaa ctgatgatgt agatttcaga gcaatgatta ctgctaggaa 55981 aaaagggtca ttttacattg atcaaagagg tcaactcatc aggaagacat aataatccta 56041 aacacttatg tacttaacag agcatcaaaa tacatgaagc ataaatgaaa gaaccgtggg 56101 agaaagtaga caaattaatg actgtagttg aagatttcag tatccctcta tgaaaatcag 56161 ggtagtacaa gtacacagaa aattggtaaa gatatatgac ttgaacaaca ttatcaacca 56221 aattgacctc atttacattt gtggaatgtt ccaactaaga acgtcagaaa acatactctt 56281 ttcaagtgca catggaacat ttaccaagat agacaatatt ttgggtcacc gcaagtctca 56341 acacattgaa aggattcaga tcatataaag tatgctccat gaccatgatg gaattgaatt 56401 agaaaccaat aatgtatctc tggaaaatac acaaatattt ggaaattaat atgcccttct 56461 aaaaaattta tgcatcaaga agaaatcaaa aagggatatt tgaaaagtac tatgaaactg 56521 atggccaggc atggtgctca tcgcctgtaa tcccagcact ttgggaggcc gagaaagatg 56581 gatgaagtca ggagttcaag accagcctgg gcaacatggc agaaccccgt ctctactaaa 56641 aatacaaaaa attagccggg cgtggtggtg ggcgcctgta atcccagcag tccacgtgtc 56701 gccgcccctg gtgatggacc agcggggctt cgacga // LOCUS VSVGLYPA 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-H) glycoprotein gene, complete cds. ACCESSION M35207 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-H, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 482 a 364 c 364 g 447 t ORIGIN 1 aacagagatc aatctgtttc ctcgacacca tgaagtgcct tttgtactta gctctgttat 61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggtaattgga 121 aaaatgttcc ttccagttac cattactgcc catcaagctc agatttgaat tggcataatg 181 gcctaatagg cacagcctta caggtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac 301 cgaaatatat aacacattcc atccggtcct tcacaccatc cgtggagcaa tgcaaggaaa 361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactccc catcatgtac 481 ttgtcgatga gtacacggga gaatgggttg attcccaatt catcaacggg aaatgcagca 541 atgacatctg ccctactgtg cataactcca caacctggca ttccgattac aaagttaaag 601 ggctgtgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggggagc 661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgct tacgaaactg 721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg 781 tttggttcga aatggcagac aaagatctgt ttgctgcagc aaaattccca gaatgtccag 841 aagggtcaag catatctgct ccatctcaga catcagtgga cgtgagtctc atccaggatg 901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc 961 ttcccatctc tccggtagac ctaagctatc ttgctcctaa aaatccagga accggcccag 1021 ctttcacaat catcaatggt actctgaaat attttgagac caggtacatc agagtcgata 1081 ttgctgctcc aattctccca agaatggttg gaacaatcag tggaactacc acagaaagag 1141 aactgtggga agactgggca ccatatgaag atgtggaaat cgggcctaat ggagttctga 1201 aaaccagttc aggatataaa tttcctttat acatgattgg acatggcatg ctggactctg 1261 atctccatct cagctcgaag gctcaagtgt ttgaacaccc tcacattcag gatgctgcat 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttgtcc aaaaatccaa 1381 tcgagctcgt agaagggtgg ttcagcggtt ggaagagctc cattgcttct tttttcttta 1441 tcataggttt gatcattgga ttattcttgg tcctccgagt tggtatctat ctttgcatca 1501 agttaaaaca gaccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaaccaga atcggacacg acagagtaat catatttaaa ccaaatcgac ttgtgatatc 1621 atgctccaag aggctttaat tatgtttcaa ttttatg // LOCUS VSVGLYPB 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-CR-B1) glycoprotein gene, complete cds. ACCESSION M35208 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-CR-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 358 c 366 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagtcttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPC 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-CR-B2) glycoprotein gene, complete cds. ACCESSION M35209 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-CR-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 489 a 358 c 364 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPD 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-CR-B) glycoprotein gene, complete cds. ACCESSION M35210 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-CR-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 488 a 357 c 365 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 gattgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPE 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-OA-B) glycoprotein gene, complete cds. ACCESSION M35211 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-OA-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 476 a 364 c 358 g 467 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcgtct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPF 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-VC-B) glycoprotein gene, complete cds. ACCESSION M35212 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-VC-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 476 a 362 c 358 g 469 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagttaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcgtct tagctcaaag gctcaggtat ttgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPG 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-B1) glycoprotein gene, complete cds. ACCESSION M35213 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 488 a 356 c 366 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttctggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcggg tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccga 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctatgcatca 1501 acttacggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgtttgaa ttgtatg // LOCUS VSVGLYPH 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain Mudd-Summers) glycoprotein gene, complete cds. ACCESSION M35214 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain Mudd-Summers, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 477 a 366 c 353 g 469 t ORIGIN 1 aacagagatc gatctgtttc cttgacacca tgaagtgcct tttgtactta gcttttttat 61 tcatcggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttagtagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtacggac 301 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag cgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttccgactat aaggtcaaag 601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaagaag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 1021 tctttaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 1141 tactgtggga tgactgggct ccatatgaag acgtgggaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt ttgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatggtgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagtttgt agaaggttgg ttcagtagtt ggaagagctc tattgcctct tttttcttta 1441 ccatagggtt aatcattgga ctattcttgg ttctccgagt tggtatttat ctttgcatta 1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 cgtaactcaa atcctgcaca acagattctt catgtttgaa ccaaatcaac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttaattt ttatg // LOCUS VSVGLYPI 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-DF-P) glycoprotein gene, complete cds. ACCESSION M35215 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-DF-P, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 477 a 363 c 357 g 468 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctct tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPJ 1788 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 82-HD-B) glycoprotein gene, complete cds. ACCESSION M35216 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 82-HD-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1788) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 508 a 362 c 344 g 574 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat 1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg // LOCUS VSVGLYPK 1788 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 83-HD-B1) glycoprotein gene, complete cds. ACCESSION M35217 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 83-HD-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1788) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 508 a 362 c 344 g 574 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat 1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg // LOCUS VSVGLYPL 1815 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 83-HD-B2) glycoprotein gene, complete cds. ACCESSION M35218 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 83-HD-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1815) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 512 a 358 c 346 g 599 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gattgctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatt agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcactat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggcctaatt catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttat tttttatttt ttatttttta 1741 ttttttattt tttattttta ttttttatct ttttattttt tattttttat ttttattttt 1801 atttttattt ttatg // LOCUS VSVGLYPM 1665 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain San Juan 56-NM-B) glycoprotein gene, complete cds. ACCESSION M35219 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain San Juan 56-NM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 484 a 364 c 355 g 462 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccata caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 cgaagtatat aacacagtcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc 481 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 541 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 721 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 1021 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt 1321 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta 1441 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta 1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc 1621 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatg // LOCUS VSVGLYPN 1812 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-ES-B1) glycoprotein gene, complete cds. ACCESSION M35220 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-ES-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1812) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 512 a 362 c 347 g 591 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt 1741 tttatttttt attttttatt tttatttttt atctttttat ttcttatttt tatttttatt 1801 tttattttta tg // LOCUS VSVGLYPO 1861 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-ES-B2) glycoprotein gene, complete cds. ACCESSION M35221 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-ES-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1861) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 363 c 347 g 631 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttta ttttttattt tttatttttt atttttattt tttatttttt 1801 tattttttta ttttttattt ttatttttat ttttattttt atttttattt ttatttttat 1861 g // LOCUS VSVGLYPP 1802 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B) glycoprotein gene, complete cds. ACCESSION M35222 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain Mudd-Summers 84-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1802) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 506 a 362 c 348 g 586 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta 1741 ttttttattt ttatttttta tctctttatt ttttattttt atttttattt ttatttttta 1801 tg // LOCUS VSVGLYPQ 1868 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-ES-B) glycoprotein gene, complete cds. ACCESSION M35223 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-ES-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1868) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 362 c 347 g 639 t ORIGIN 1 aacagagatc aatctgtttt cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat tacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct catcatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catatttcaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt 1801 atttttttat ttttttattt tttattttta tttttatttt tatttttatt tttattttta 1861 tttttatg // LOCUS VSVGLYPR 1860 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-ES-B1) glycoprotein gene, complete cds. ACCESSION M35224 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-ES-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1860) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 362 c 347 g 631 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcta caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt 1801 atttttttat tttttatttt tatttttatt tttattttta tttttatttt tatttttatg // LOCUS VSVGLYPS 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 69-PN-L) glycoprotein gene, complete cds. ACCESSION M35225 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 69-PN-L, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 358 c 364 g 448 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcattggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattattgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgccacgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 ctaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca agtgactcct catcatgtat 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggagaac 661 tctcatccct agggaaggaa ggcacgggat tcagaagcaa ttactttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgac aaagatctat ttgctgcagc aaaattcccg gagtgtcccg 841 aaggatcgag tatctccgct ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt ggagcaggtc 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccacct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg ttcagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttgttcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca ataccacaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPT 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 87-CR-B1) glycoprotein gene, complete cds. ACCESSION M35226 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-CR-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 490 a 358 c 363 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg aatcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPU 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-CR-B2) glycoprotein gene, complete cds. ACCESSION M35227 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-CR-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 486 a 357 c 367 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttgga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPV 1796 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene, complete cds. ACCESSION M35228 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1796) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 503 a 362 c 349 g 582 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttcccattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat agtcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta 1741 ttttttattt tttattttta ttttttatct ttttattttt atttttattt tttatg // LOCUS VSVGLYPW 1800 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-GM-B) glycoprotein gene, complete cds. ACCESSION M35229 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1800) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 506 a 362 c 348 g 584 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttatttttta ttttttattt 1741 ttatttttta tttttatttt ttcatctttt tattttttat ttttattttt attttttatg // LOCUS VSVGLYPX 1652 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 59-PN-L) glycoprotein gene, complete cds. ACCESSION M35230 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 59-PN-L, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1652) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 344 c 367 g 453 t 1 others ORIGIN 1 aacagagatc aatctgtctt cctagcatca tgaaatgnat tttgtacttc gcttttctcc 61 tggtaggagt gaactgtaaa tttacaatag ttttcccaca taatcaaaaa gggaactgga 121 aaaatgtgcc atccagttac cattattgtc catctagttc cgatcttaat tggcacaatg 181 gcctgattgg tacagcatta caagtcaaga tgcccaaaag tcacaaagca attcaggcag 241 acggctggat gtgtcatgca tccaaatggg tcaccacatg tgacttcagg tggtacggac 301 ccaaatatat tacgcattct attcggtcat tcacaccctc tgtggaacag tgcagagaaa 361 gcattgaaca gaccaaacaa ggaatctgga taaatcctgg ctttcctcct cagagttgtg 421 gatatgcaac tgtgacggac gcggaagcag taattgtcca ggtcacccca catcatgtgc 481 tcgtagatga gtacacaggc gaatgggtag attcccaatt tttgaacgga aagggcaaag 541 acgacatatg ccccacagtc cataattcga ccacatggca ttcagactac aaagtcaaag 601 ggctttgtga tgctaaccta atctcaatgg acatcacttt cttctcagaa gatggtgagc 661 tgtcctcctt gggcaaagat gggactggat tcagaagcaa ttactttgct tatgaaaatg 721 gagataaggc ttgccgcatg cagtattgca aacattgggg agtcagactc ccttctggag 781 tctggttcga gatggcagac aaagatttgt ttgcaggagc caagttcccc gagtgtccag 841 acggaaccag tatctctgct ccttctcaga catccgtaga tgtaagctta attcaggacg 901 tcgaaagaat cttggactat tccttgtgtc aagagacttg gagtaaaatc aaggcaggcc 961 ttccgatttc tccagttgat ctaagctatc tcgctccaaa aaatccaggg acaggtcccg 1021 ctttcaccat catcaatggg accttgaagt actttgagac cagatacatc agagtggaca 1081 ttgctgctcc aatcataccc agaatggtgg gaactatcag tggaacatct acagaaagag 1141 aattgtggga tgactggtac tcttatgaag atgtggagat tgggcccaat ggagtattga 1201 aaacaacctc aggatataaa tttcctcttt atatgattgg acacggaatg ctggactctg 1261 atcttcacct gacatcgaag gcccaggttt ttgaacatcc ccacattcag gacgctgcat 1321 cacagcttcc tgacggcgag actttatttt ttggtgatac cgggctctca aaaaacccca 1381 ttgaattagt agagggatgg ttcagtgggt ggaaaagctc tattgcttcg ttcttctttg 1441 tcattggatt gatcattgga ttatatttgg ttttaagggt gggaatatat ctttgtctta 1501 aactaaagca tacaaaaaag agaaaaattt acacagacat tgagatgaac agacttggaa 1561 aataaccaaa attgtgtgat ggatttgctt ttttgccaaa gaagcctgtg atatccttct 1621 cagagaagct ttaatctttt attgacaata tg // LOCUS VSVGLYPY 1657 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-B2) glycoprotein gene, complete cds. ACCESSION M35231 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 489 a 367 c 357 g 444 t ORIGIN 1 aacagagatc catctgtttc ctcgacacca tgaagtgcct tttgtactta gctcttttat 61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggaaattgga 121 aaaatgttcc ttccagttac cattactgcc cgtcaagttc agatttgaac tggcataacg 181 gcctaatagg cacagcccta caggtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac 301 cgaaatatat aacacattcc atccggtcct tcacaccatc tgtggagcaa tgcaaggaaa 361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtaactccc catcatgtac 481 ttgtcgatga gtacacggga gagtgggtag attcccaatt catcaacggg aaatgcagca 541 atgacatatg ccctactgtc cataactcca caacctggca ttccgactac aaagtcaaag 601 ggctgtgtga ttctaacctc atctccatgg acatcacctt cttctcagaa gatggggagc 661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgca tacgaaacag 721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg 781 tctggtttga aatggccgac aaagatctgt ttgctgcagc aaaattccca gaatgtccag 841 aagggtcaag catctctgct ccatctcaga catcagtgga cgtgagtctt attcaggacg 901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc 961 ttcccatctc tccggtagat ctaagctatc ttgctccaaa aaacccagga accgggccag 1021 cctttacaat aatcaatggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattctccca agaatggtcg gaacaatcag tggaactacc accgagagag 1141 aattgtggga cgactgggca ccatatgaag atgttgaaat cggaccaaat ggagttctga 1201 aaaccagttc aggatataaa ttccctttat atatgattgg acatggcatg ctggactctg 1261 atctccatct cagctcaaag gctcaggtat ttgaacaccc tcacattcag gatgctgcat 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaatccaa 1381 tcgagctcgt agaagggtgg ttcagtggtt ggaaaagctc cattgcttct tttttcttta 1441 tcataggttt gatcattgga ttattcttgg ttctccgagt tggtatctat ctctgcatca 1501 agttaaaaca taccaaaaaa agaaaaatct atacagacat tgagatgaat cgactcggaa 1561 agtaactgga atcagacacg acagaataat catatttaaa ccaaatcgac ttgtgatatc 1621 atgctctaag aggctttaat tatgttttaa ttttatg // LOCUS VSVGLYPZ 1812 bp ss-RNA VRL 27-JUL-1990 DEFINITION Vesicular stomatitis virus (strain 85-ES-B2) glycoprotein gene, complete cds. ACCESSION M35232 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-ES-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1812) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 513 a 360 c 346 g 593 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacttggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat aaagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt 1741 tttatttttt attttttatt tttatttttt atctttttat tttttatttt tatttttatt 1801 tttattttta tg // LOCUS CAVXDHB 8062 bp ds-DNA INV 27-JUL-1990 DEFINITION C.vicina xanthine dehydrogenase (Xdh) gene allele 1, exons 2-4. ACCESSION M30316 KEYWORDS xanthine dehydrogenase. SOURCE C.vicina third instar larvae fat body DNA (library of Schenkel et al.), clone Lcv1. ORGANISM Calliphora vicina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae. REFERENCE 1 (bases 1 to 8062) AUTHORS Houde,M., Tiveron,M.-C. and Bregegere,F. TITLE Divergence of the nucleotide sequences encoding xanthine dehydrogenase in Calliphora vicina and Drosophila melanogaster JOURNAL Gene 85, 391-402 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Bregegere, 01-DEC-1989. FEATURES from to/span description pept / 2538 6299 xanthine dehydrogenase, exon 2 (AA at 2538) (EC 1.1.1.204) 6367 6534 xanthine dehydrogenase, exon 3 7233 7283 xanthine dehydrogenase, exon 4 pre-msg < 1 7400 xanthine dehydrogenase mRNA and introns IVS < 1 2537 xanthine dehydrogenase intron A IVS 6300 6366 xanthine dehydrogenase intron B IVS 6535 7232 xanthine dehydrogenase intron C signal 7370 7375 poly-A signal signal 7391 7396 poly-A signal BASE COUNT 2516 a 1358 c 1473 g 2715 t ORIGIN 1 aagcttcaac ctgtaaaata aatttcaata aaatgttata aaatcttata ttttggtaaa 61 aattccattt gaataaaaca aatctagacg aaattataca tacaatcaaa tttatgcact 121 tcagatgttg aaaacagatt ggaaaattac ttattatacg tttcttagtg ttcgatttat 181 ttcgcttagc aataattgca aaatatttga ttgatttgta accataataa catcgttttt 241 aaatatgagt acttacttgt cttatcaact ttgttgttta tatatcaata tatatgtata 301 tttcattaaa aaacaataac aattttaaaa tggcaacttt ttgtgtaacc ttaatgaaaa 361 aaaaaaaata acacataaaa catcttatca atgactgctg tgagaccccc ctgtgacgta 421 cagataataa caatcttaaa actgttatac tgcacttttg taggactaat gaattagtaa 481 acaaataaat aactctgtat aaatatagaa cattttatta tccacatatt agttaaattt 541 gtcttggagg tctaatagct agtttctggg ataataaaat ctacattctt cgtttaaacc 601 tagattaagt aaactacaaa ttttatttct cactaatggc gtaaagagta tatgtaatct 661 aggtttaact gagcattatt gacgagttaa acctatgaaa aaaattattc caattgattt 721 tgtttcttca gattccattt gatttcgaca gttcccaaaa tattttataa ttctgaattt 781 ccaaatttaa tttggttttt ggtaaccccc tgtttatact tgacaaattt tgatcataat 841 aattaatgac gtttgttgcc aagacaaagt tgtcctgcaa tagcactaac aattgtgtca 901 taacgaagca ataatactaa taaatgaaga cattacctga aatttttttt atcttgacaa 961 caattttgac ttttatcatg ataaaaattt atcaggtata gataattatt taaaaatatc 1021 tgaaaaacaa ataattacaa actaataaca attcgacata taagacattc acacgatgaa 1081 ttttcagtca tgcaattgta gcaattctct tttgaatttt cttttactcc acccctcttg 1141 ccaacaaact cataaaatca cttagagtga catgacaaaa caagacatcg taaaaattcc 1201 ttattaaaaa aaaacaagtt agagagctat attcggcagt gccgaatctt atataccctt 1261 caccaaatta tacttcaaaa taaaaatttt aaatattttt aagtaaacaa attttttttt 1321 ccaaagttgt ttttttaagt ttttaggaaa tttttttttt cgaattgtta ttttaaaatt 1381 tttaaacagc tgttacgaac aaaatcaact atgtattgtg aatgaaaagt tcatgagaat 1441 attacgatag caattttccc tttgagggaa cataaatttc acgtattatt gccgataggg 1501 tctattgact agcatgtatt ttgttgttgc aagagttagg ttttggacaa cagacttagg 1561 tttttgactc gtctcgtctt ttattacaca gttttatttt atatttatta ttacagaagt 1621 ttaaaagttt ccattgtttt gccaatgtta agcattatat ccacatatac atgtattttt 1681 gttaattatt ccctatattt taatagaaaa cctggtgaaa ggtcatattt tttaaaactg 1741 cacagagaaa atgattcggt tgcagacata gacattttcg gttctatcaa cagaaaatca 1801 gttgataaag aagaatttcc gttgaagcaa ccaaactttt gttacctctt ctaaaatgtt 1861 gtagccataa ctgtaaaatt cggtcactaa gaatcattcg attggcaaca aattcggttg 1921 ctatcacgaa tctgttttct ctgtgtgaaa gtatggaata ttgttttaaa aaagtttcca 1981 ttttttcgct agtaggtctg ggttaattaa taactcgcgg cttcgtataa cgattcctgt 2041 tctacgtcgt ttacaccaac cgcatgtatt taatttgaac aaatcctggt ggagaccata 2101 aaaacccttc gatactggtg gagtatcatt tcaaactcaa atcgtaaatt ggtagagaac 2161 catataaaac tcgacggatc aactggagca gtatcattat cactcttgat tttaataaca 2221 aggtttaagc attctcactt ctatgaagac atatgtaggt catttaggct tacttaaata 2281 gacatgggat aacctggaaa atttgtgaga cgttatactc aaccaacaaa ctttaaaaca 2341 attttattgt cactctctgt ctaaagcatc gtaatctata catcacttta cttcacaaat 2401 gttacaatac tctccgtaat taatcttaaa tatcatcaat tacacatttt gtttaagtaa 2461 tttaaaaata ttaagttctg cattcttaaa atacagcttt gattcaagaa atataactat 2521 ttgtcttatt ttcctaggta attgatacca atcccgatcc agaatgtacc cttctaacat 2581 atttacgtga aaaactaaga ttatgtggca caaaattggg ctgtggcgag ggtggttgtg 2641 gtgcctgcac cgtcatgatt tctcgcattg atacgctaac gaatcggatt aaacacatag 2701 cggtgaatgc ctgcctgact ccggtatgtg ccatgcatgg tagtgccgtg actaccgtgg 2761 aaggtattgg cagtacgcga actcgattgc atccagtaca agagcgtctg gctaaagccc 2821 acggcagtca gtgtggtttt tgtacacccg gtattgtcat gtctatgtat gcactgttga 2881 gaaatttatc ccagccctcg atgaaagatt tggaaattgc tttccaggga aatttgtgca 2941 gatgcacggg ctatagaccc attttagagg gttataaaac atttactaaa gaattcggct 3001 gtgctatggg tgataaatgt tgtaaggtga atggcaataa atgtggagag ggtatggaaa 3061 atggtggtga catggtcgac gataaactat ttgagaaaag tgaatttgta cctttcgatc 3121 cctcacaaga acccatattt cctccggagc tgcagctgaa taaagattgg gactctcaaa 3181 cgttagtgta taagggcgaa cgtgctactt ggtatagacc tggcaattta gaagatttac 3241 tgaagattaa ggcccaattt ccggaagcca agttggttgt gggcaacact gaaataggcg 3301 tggaggttaa atttaaacat tttctatatc cagtcttggt aaatcccaca aaagttaagg 3361 aaatgattga tgtccaggaa ctcgaggata gtatttattt tggtgcttcc gttagtttaa 3421 tggatataga taggatttta agatccagca tagaaaagct acccgaacat caaacaagat 3481 ttttccaatg tgccgtgaat atgttgcatt attttgctgg taaacaaatt cgtaatgtgg 3541 cgtcattggg tggcaatatt atgacgggta gtccaatatc cgatatgaat ccggtgttga 3601 tggcgggtgc tgttaagttg aaagtggcca agtatgttga gggtcaaata aaatacagag 3661 aagtttgcat ggcatctggc ttctttacgg gttatcgcaa gaatgttata gaacccacag 3721 aaattttggt gggtctgtat ttccccaaga ctttggaaca tcaatatgtg gtggctttca 3781 agcaggccaa acgtagagat gatgatattg ctattgtaaa tgcagccata aatgtattca 3841 ttgatcccag gtccataact gtggataagg tgtatatggc ttttggtggt atggctccta 3901 ccactgtcct ggccacacgc actgccgaca taatggttaa acaacaatgg aataaggtgc 3961 ttatggagcg tgtggtggaa aatctgtgtg cagagttgcc tttggctccc tcagctcctg 4021 gtggcatgat tgcctatcgt cgttccctgg tggtgagctt atttttcaag gcctacttaa 4081 caataaccca acaattaatt aaatctggca ttttgcccca agattcttta ccccaagaag 4141 aactaagcgg ttctgatgtc ttccatactc ctgccttaaa aagtgctcaa ctgtttgaaa 4201 aagtttcaaa caaacaatcg gaatgtgatc ctattggcag accaaagata catgcctccg 4261 cattgaagca ggctaccgga gaagccatct actgcgatga tatgcctcgc atggaaaatg 4321 aattatattt ggcgctggtg ttaagtacaa aagctcatgc gaaaatccta agcattgatg 4381 cttcggaagc tctggcaatg cccggtgttc atgctttctt ctccagcaaa gacataaccc 4441 aacatgaaaa tgaagttggt ccagtgtttc atgatgaaga ggttttcgcc tcagatatgg 4501 tttactgtca aggtcaagtt attggagcta tagcagccga taatccaaac tttagctcaa 4561 agacggctcg taaggttaca atcgaatatg aagatataaa gccagttatt ataactattg 4621 aacaggctat cgagcataaa tcctacttcc ctgattatcc ccgttttacg gaaataggtg 4681 atgtagagaa ggctttcagt gaggccgatc atgtctatga aggttcttgt cgcatgggtg 4741 gccaggaaca tttctatttg gaaacccatg cctctttagc cgtgccccga gattcagatg 4801 aaattgaaat cttctgttca acacaacatc cctctgaggt acaaaaacta gtggcccatg 4861 tattgtcaac ttcagcacat cgagtagtgt gtcgtgccaa acgtttggga ggaggttttg 4921 gtggcaaaga gtctcgtgct atagctgttg ccttgcccgt ggctttggcc tgccatcgtc 4981 ttcgtagacc aatacgttgt atgctcgatc gtgatgaaga tatgatgatc accggtactc 5041 gtcatccctt tttatttaaa tacaaaattg cttttacctc tgaaggtcgt ttaaccggtt 5101 gctatattga gtgctacaat aatgccggtt ggtctatgga tctttccttc tccgtattgg 5161 aaagagccat gtttcatttt gaaaattgct ataaaatacc caacattaag gtgggtggtt 5221 gggtttgtaa aacaaactta ccatcgaaca cggcattccg tggctttggt ggtcctcagg 5281 gcatgtttgc gggagagcat ataatacgtg atgtggcccg catattgggc aaggactatt 5341 tggaaattat gaaacaaaat ttctataaag aaggcgatat aacgcactat caacaaaagt 5401 tggataactt tcccattgag aaatgtttct acgattgcct gcaacagtcg aactattacc 5461 agaaaaggaa agaaattgaa gagtttaatc gtaaccaccg ttggcgcaag cgtggcatat 5521 cattagttcc cacaaaatat ggcatagcct ttggtgtttc acatctcaat caagctggtg 5581 ctttaataaa tatttatgct gatggctcgg tgctgctttc gcatggtggt gtggaaattg 5641 gccaaggttt acataccaaa atgatccagt gttgtgctcg tgctttacaa atacccatag 5701 aattcattca catttccgag actgccactg ataaagtgcc aaatacatca cctacggctg 5761 ctagttcggg ttccgatcta aacggcatgg ccgttctgga tgcttgtgaa aagttaaata 5821 aacgtctggc tccgattaag gaggccaatc cgaatggctc ttggacagaa tggattaata 5881 aagcctattt cgaacgtgtc agcctgtcgg ctaccggctt ctatagaatg cctgatatcg 5941 gctatgatcc tgttcaaaat ccaaacgctc taatgtacaa ctattttacc aatggtgttg 6001 gttcgtcgat agttgaaatc gattgtctaa ctggagatca tcaggtccta agcacagata 6061 ttgttatgga tattggttcg agtttgaatc cggccattga tattggtcaa attgagggag 6121 ccttcatgca aggctacggc ctattcacat tggaagaaat gatttactcg cctcagggtg 6181 tactctattc tcgcggcccc ggcatgtaca aattacctgg ttttgctgat ataccgggag 6241 aatttaatgt gacaattttg acaggagctg ctaatccccg ggcggtttac tcctccaagg 6301 taagagataa tttgtgtttt aattgaagtt gtatgtttta ttatattgtg tttatgttta 6361 tttcaggctg ttggcgaacc tcctttgttt attggctgtt ctgtgttctt tgccattaaa 6421 gaagccatta cgtcggctcg actaatgaat ggcttgagcg aggactttaa attagaatca 6481 ccagctactt cggctcgtat acgtatggct tgtcaggatg agtttacgaa tttggtaaga 6541 aaatattaac aatttgaaat ggctcttatt aaatctttat atattgtttt aatgaagatc 6601 tagttagttt tctataggaa aatttcaatt gattcggcat tatattaaaa acgttggata 6661 caactagttt tctttaggaa atttttggat tcaacttgtt tttattagaa aattatttga 6721 ttcgattaga ttttagactt cttttatatg aaaatttgca ttcgactagt tttttacagc 6781 aagtattctt ttttataccc ttcaccttcg ggagcagggg atatataagt ttgtcattcc 6841 gtttgtaatt tccacaatat aattttccga ccctataaag tcgattaagc catgtccgtc 6901 tgtctatctg tcggcttgcc tgtctgttga aattttctga atacatttga ttaattaatt 6961 cgacatgttt tctgtggcaa acttgagatt cgactaggtt tatttcaaaa acttttttat 7021 tcaattagtt ttcattataa aatttaccat ccaattgttt ttattagaaa atttccgttt 7081 aattataagt tttcattaaa aactattcga ttcgagtagt tgtctgaaga aactgtttgt 7141 tttcctaaag aaaaccttcg attagactag ttttgattaa aaaacctttc gattcgaaca 7201 tttttaatgc caaatttttt ttatttttac agattgagca gccccctgct ggaagttatg 7261 tgccatggaa tattgtacct taaaattaat atttttttta atttattagc aaaaatttag 7321 ctagaaataa acgcaataga catttttttt gtatacaaat aaactgtgaa attaaatgaa 7381 atatttagaa aattaaatat aatactcgta tttgcattta ttacaatgaa atataaaact 7441 gattaaatgg aaaatgaaat gatgtgcaat taaatgttaa tggcagagtc tgtattaaca 7501 gaattgacct caattaaatg atcattactt tgttcattta tcacattggc attatcatct 7561 ccggacaaag tttcatacgg ttgattgtgg ctgaaaagga aatggaaaat tttaacagta 7621 tttcttcaca ccttaaagta tgctttaaaa tgtagataat aaataatttt attaaaagca 7681 ctgggagaaa acgtttaact gttaactact atctacttta ccttggtttt ttttcaattt 7741 aaacaatttt tatttgtttt ttttttaatt caacacatca actactttta tttgtttaat 7801 ttataaaatt ttttcataca aaatttgagc atgcatgcga tacttgatga cttttgtcaa 7861 cttttggcgt tgaattgtat ttaacgatat ttgagttttg aacattcgaa cagctacaaa 7921 cactttttct aaatagtttg ctaaatttag ggcaatgaca ggtaaatgaa ggacgtcttc 7981 taacgctaaa gtaagtcaaa tttcaagtaa ttaaaatatt aatacaaaaa aaaaatttga 8041 gtgtggttta aaagtaagcg ct // LOCUS CAVXDHC 3346 bp ds-DNA INV 27-JUL-1990 DEFINITION C.vicina xanthine dehydrogenase (Xdh) gene allele 2, exon 1. ACCESSION M30488 KEYWORDS xanthine dehydrogenase. SOURCE C.vicina third instar larvae fat body DNA (library of Schenkel et al.), clone Lcv2. ORGANISM Calliphora vicina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Muscoidea; Calliphoridae. REFERENCE 1 (bases 1 to 3346) AUTHORS Houde,M., Theron,M.-C. and Bregegere,F. TITLE Divergence of the nucleotide sequences encoding xanthine dehydrogenase in Calliphora vicina and Drosophila melanogaster JOURNAL Gene 85, 391-402 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Bregegere, 01-DEC-1989. FEATURES from to/span description pept 1529 / 1609 xanthine dehydrogenase, exon 1 (AA at 1529) (EC 1.1.1.204) pre-msg 1305 > 3346 xanthine dehydrogenase mRNA and introns IVS < 1610 3346 xanthine dehydrogenase intron A BASE COUNT 1213 a 444 c 419 g 1270 t ORIGIN 1 aactgttcga ataatttgaa attgccgatt ttcaaataac aaataaaccg attaatttgt 61 gtcaattaac cgattaaccg aaattccttt tttttgcgct ttaaaatttt tttttgtatt 121 tatttttccg tttcaattca aatacaaatt aaaatttgat attttttgaa catccaataa 181 acatgattaa tgagtatgta taacaactga catgttttga aactttgttt gtatttacat 241 gtatagacta aacttttttg aattgagtct tttatcataa ctaaatgaat ctattaaatt 301 aatccaaatc taaaacaatc caaaaaagaa tattctttat catgtttttg atttctccaa 361 catataaaat cagtgagaga gttttttcca agaaaagttt tattaaatct gaagaaaaaa 421 atcgtttaaa ttatattctt tttataaaag attatttcag aagatctctt gctgcaacaa 481 cgtaataatt aaaaaaaagt cgtttccaaa aaatgtcttt gaatatttta tgacatttta 541 ctatttctta tataaatttt caacaaatca tgcgatttca gaaatataaa tagtagatgt 601 taatatcttt ctaatctgta tttaacccaa atttttactt atgaaatata cgagaaaaca 661 tgaattttaa ttttgatgtt tgcaataata aaaaacactc tcacacaaaa aataattgac 721 ttttttgaaa actatatgaa agattataga tcagcatatt ttgtattact cagttcataa 781 aacacggatt ttgagttatg acgttgttgc agcaaataca aaaagattac aaaactgatc 841 tcaaatacct tatttgctgt tttttatgtt tttgtacaca aaattcgttg ttgtattttc 901 aatttaaaat gaaaatagaa attattcggt taatcgaata attttaaatt aaccgattat 961 taaccgaata aatctaaacc tcgattaatt atttgctcga ttaaccgatt aaaccagaga 1021 cccgattaat tgaataccct acccccatct ttttatgcaa tttttgccct gtaaatgtca 1081 atgcgacaac actaccacta ctgtatattt cttattgtag cgccctcatg agtattaaga 1141 catttactct cgccctcgat tttatttgta ttattctctg gttatcatca cacttgttga 1201 aaatttatac acaaatatct tagcgactct ctaccgtagc cagccggttg tacaattaat 1261 tattaatacc agtataaata aaatttttat attcaaataa ataaataatt tgtattcgac 1321 tctcagttct ccagtttgca acgtgtgatc attgtgtcga gacgtatcgt ttaaaagtgc 1381 ttataatata cgtacaatat ttcatattgc gtttattatc gattgtttga ttaaaaaaat 1441 tgaattgaaa cgaaaataaa gacatcattg caattagtta aataaaaatt gataaaacaa 1501 gtggaattca gacagctaaa cgtttaatat gacgcaagaa cataatgcag cagtcctaga 1561 tttaaatcca acattttcta cattgatatt ttttgttaat ggcaaaaagg taattaatgt 1621 aatagtattt aaaaataatt tatgtatgaa aacagttagt ataatatttg gttaaaacat 1681 aactatacat tgtgattgag ggggggttta gggaacaaat tttattaaca cacatattta 1741 aatacacgaa aattttataa ataaaagata tttgttgatt atttattgag tcaattccct 1801 aatatttgtt gtaatcgtac tcgcattgtt attgtctgtc atacataaaa acttctgtct 1861 gtctgtctca acttctcatc aaacgagtaa tcatcatgaa cacttatacc tggtttgacg 1921 gcataaataa taaaatagag ttgcacagtg aggtcgatcg taaaacatgc taataaatct 1981 gaatcttctg acctatttcg attacaaaat gatgtataaa aaactaactt tgaaaaaata 2041 aattttagta taatttgtat gcaattaaaa tctgctcccc tgagcttgtt ttaggacggg 2101 tgattaatgt ttaaatgtca gattaatatt ctaattaata tgtttattat aatttgtaaa 2161 aaaaatttaa gaactagatg tgacacttag aatttttttt tttttaaatc actattttct 2221 tcctttgcct gaagtagggt acatatcgat tttaaaatct tatcgaaaat gaaaaaatat 2281 tacaattaag ctacattttt gtcaaaatgc atccattttt atatccttca ccatgagtgg 2341 cataagtctg tcattccgat tgtaaaatct acatttttca ttttcgaccc cacaaagaaa 2401 atatattcta gatcgttata catcaatata tcgggaattc ttccggctcg gaaactattt 2461 aaaaccgaaa aaatcggctg agatataagg aataaaccgg gacaacctcg actcttggcc 2521 tatttttgat ttatattttt ttgtcataaa ttaaaaaaag aattaaaaat aaaatttgga 2581 aaaaacttct ttaaacaaaa ttaaaaaaca agttagagag atacatatat tcggccgtgc 2641 atcttatata cccttcacca aattatactt caaaataaaa attttaaata gttttaggta 2701 aacaaaattt attttttttc caaagtagtt tttttttaat tttttggaaa aaaaaatttt 2761 cgaattgttt tttaaaattt taattttttt tttgtttttt taatatttag cgaaaaaaaa 2821 tttttttgtg aaaaaaaaat taacaaaaaa ctatttacat gtcattataa attttttgaa 2881 ataagatttt gacattctcg taaaatcagc ctaaggtgtt ccaacatttt gattaaataa 2941 tatctagttg atttttttgt tgttgttgct tttgtctatc attcaggttc agttgtaaac 3001 attaacgtgt tttgtttgcc ttatcacttg tttgcatttt tagaaaatgt gttttaatta 3061 aattagttta gaaaagtttt ttaaatggaa attattttaa gagcattgtc tggacattta 3121 tagatatata ttattgttaa ttgtattagt taaattttga ataaattgtg gaaaaatatc 3181 agttattttt tggttaaatt aaggaattag tacataaaaa cgaaagatag ctgagaattt 3241 ccatattagc gaatgcttct tggcttccga gtaaaaactc gcccgttagc gaattccatc 3301 atagcgttgt caatgattct tacagaaaca tcaagtatta aatgtt // LOCUS RATCGM1A 1104 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen (rnCGM1) gene, partial cds. ACCESSION M22226 J04626 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1104) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-termianl domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 723 / 1082 carcinoembryonic antigen rnCGM1, exon x (AA at 725) pre-msg < 1 > 1104 carcinoembryonic antigen rnCGM1, mRNA and introns IVS < 1 722 carcinoembryonic antigen rnCGM1, intron y IVS 1083 > 1104 carcinoembryonic antigen rnCGM1, intron y+1 BASE COUNT 314 a 262 c 232 g 296 t ORIGIN Unreported. 1 gaattcactc ctcagctctc acagcataga tggacataca gactcctgaa ggctcttctc 61 ttccctccac actggtgtgt gtcacgtacc tgtagtgtgc acactgggac atgtaccttc 121 ccaaaccctc acgaacaata cagaaatatt aaattacact tgaatataat tatttttatg 181 tgctataaac atggaaatta tgtagacaaa cccagagata tcttttcttc cttccttcct 241 tccttcttcc ttccttcctt ccttcctctt tttccatact agtttctgag attttttgag 301 gaactgaacc ttccaaaaag accataccaa tccctgtcct caaaaagcct tttttattct 361 aatggactgg aaatcattgt atccagagga gaaagtcaat gatttagtgg aaccataaat 421 agaacagaaa acattcagga agtgaggatt gtatggagga ggaaaaagag gaggaggagg 481 aggaagagga ggaggaggag gaggaggacc gagagccggt tctccactca ccagacactt 541 tatggaaaga gtgatatggg gacacctgag tagaggattc cacagagagg aaatgacacc 601 ctttgaggtt ctgagggcat ggaggtcatg ctgctcacct ccattaaggg tgcatcctac 661 ctacaggctg agggatgctc acacctgctc aggattgtca acttttctct cttcccttct 721 agcctccctc ttaacctgct ggctcctgcc caccactgcc caagtctcca ttgaatcctt 781 accaccccag gtggttgaag gagaaaatgt tcttctacgt gttgacaatt tgccagagaa 841 tctcatagcc tttgtctggt acaaagggct gacaaacatg agcctcggag ttgcactgta 901 ttcactaacc tataacgtaa ctgtgacggg acctgtgcac agtggtagag agacattgta 961 cagcaatggg tccctgtgga tccaaaatgt cacccagaag gacacaggat tctacaccct 1021 acgaaccata agtaatcatg gagaaattgt atcaaataca tccctgcacc ttcatgtgta 1081 ctgtaagtaa ttctttgtga attc // LOCUS RATCGM1B 654 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM2 gene, partial cds. ACCESSION M22227 J04626 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM2-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-termianl domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 249 / 608 carcinoembryonic antigen rnCGM2, exon x (AA at 251) pre-msg < 1 > 654 carcinoembryonic antigen rnCGM2, mRNA and introns IVS < 1 248 carcinoembryonic antigen rnCGM2, intron y IVS 609 > 654 carcinoembryonic antigen rnCGM2, intron y+1 BASE COUNT 168 a 169 c 171 g 146 t ORIGIN Unreported. 1 aatgtaattc ttgttggaga gtgagtgggg agccatgcag acacgggagg agagagaccc 61 gtacaaaagg tcactccagc ttcgggggac tgggaacata gatgatgaag tttccctgca 121 ccaatgagag cgacgccctc accccacacc tcggcagaag atgaacacac ctacctgttc 181 cggacttggg cctcctctca gcgatcacta agcttctgac actgatggag ttttttcctt 241 ctccctagct tccttcttaa cctgctggaa tgcacccgcc gctgccgagc tcactattga 301 attagtgcca cccatggttg ctgaaggcgg aaactccgtt ttgtttgtgc atgaaatgcc 361 attgaatgtc caggcgtttt actggtacaa acagagagat ccgacgaaga gctatgaagt 421 cgcgcggtac ttaacaccca ccaacgaaag ttcgaagatg cctcagcaca gcggccggaa 481 aaccgtattc tacagtggat ccctgctgat cagaaacgtc acccaggccg acagtggagt 541 ctacacctta ctaacattta acacagaaat gcaaagcgaa ttaacacatg tgcatctgga 601 agtacgcggt aggtggttgc gggatctctg ggtgctaggg gtcggggtga gctc // LOCUS RATCGM1C 1185 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM3 gene, partial cds. ACCESSION M22228 J04626 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM3-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1185) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-terminal domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 729 / 1088 carcinoembryonic antigen rnCGM3, exon x (AA at 731) pre-msg < 1 > 1185 carcinoembryonic antigan rnCGM3, mRNA and introns IVS < 1 728 carcinoembryonic antigen rnCGM3, intron y IVS 1089 > 1185 carcinoembryonic antigen rnCGM3, intron y+1 BASE COUNT 312 a 279 c 277 g 317 t ORIGIN Unreported. 1 ctgcagtgtg cacagcaaga cattgtgctt tcccgaaccc acacgaacac actgaattat 61 taaatcacac ttgaatatat tgatttccct ttgctctgag cctgggcact atgtagataa 121 gtccatggaa atattaatct ttccttcctt ccttccttcc ttccttcctt ccttccttcc 181 ttccttcctt tcttccttct tctagttctt ttcacgtttt cccttttctt tttctctcca 241 atttgtttct aatctatttt caggaactga accttccaaa aagatgattc cagtccctgt 301 cctcacaaag cccttttctt gtggactgga agtcagagta tccagagaaa ggcaatggtt 361 taatggaacc tcaaacagaa cagaaaacaa ttctgagagt gagcattgca tgaggaagag 421 gaggaacggg aagaggagga ggaagaggag gaggaagagg aggaggaaga ggaatgggaa 481 gaggaggagg aagaggagga agaggaggaa gaggtcagac agctgcttca cctctcacca 541 gacactctat gggaagaatg atatggggac acctgagtag aggattcctg gagaggaaat 601 gacagctttt gagtctttga gggcatggag gtcatgctgc tcacctccat taagggtgca 661 tcctacctac aggctgaggg atgctcacac ctgctcagga tcggtgactt ttttctcttc 721 ccttctagcc tcacttttaa cctgctggct cctgcccacc actgcccacg tcaccctcaa 781 gtcctcaccg ccccaggtgg ttgaaggaga aaacgttctt ctaagtgctg acaatctgcc 841 agagaacatt atagctttcg cctggtacaa aggggagacc gacatgaacc gtggaattgc 901 actgtattca ctgaggtata ctgtaagttt gacggggcct gtgcacagtg gtcgagagac 961 attgtacagc gacgggtccc tgtggatcaa aaatgtcacc caggaggaca caggatttta 1021 tacctttcga atcataaata atcatggaaa aattcaatca aatacaaccc tgttccttca 1081 cgtgaaatgt aagtaactct ttgtgaactg tgggttttgg gtggtgtcct tccactagac 1141 acatagaagt atcaggccag ggctgtgtct cccttccccc tgcag // LOCUS RATCGM1D 748 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM4 gene, partial cds. ACCESSION M22229 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM4-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 748) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of the N-terminal domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 90 / 434 carcinoembryonic antigen rnCGM4, exon x (AA at 92) pre-msg < 1 > 748 carcinoembryonic antigen rnCGM4, mRNA and introns IVS < 1 89 carcinoembryonic antigen rnCGM4, intron y IVS 435 > 748 carcinoembryonic antigen rnCGM4, intron y+1 BASE COUNT 197 a 158 c 183 g 210 t ORIGIN Unreported. 1 gagctctggg aaggcagaag tgtgattttt taaaaaacca acagatttca cctgctcaat 61 atcgatggtt gctctgtctt cccttttagc ctcccttcta acctgttggc tcctgactac 121 tgcccaggtc aacattgaat cggtgccatt caatgtggtt gaaggggaaa acgtccttct 181 tcttgtccac aatctgccag agaatctcat agcctttgcc tggtatagag ggctgaggaa 241 aattggagta tacatactga acactgaagt aagtgtgacg gggccaatgt acagcggtag 301 agagacagtg tacagcaatg gttccctgtg tatccgcaat gtcacccaga aggacacagg 361 attctacact ctacgaacag tcaacacacg tggagaaact gtatcaacaa catccttgta 421 cctctatgtg tacagtaagt gatactttgt gaactctggg tgttgtgtgg ggttcattcc 481 gtagacacac acagaagagg caggcctacc taccctttgc attgtgtctc cttattgagg 541 tgtgaacatt taactcaggc taaggagagt aatgccaatt gaatagaatc cttcttttga 601 ctttaccttg tagtcagctg gatgtgtggt taactcagtg aaggacatca gcccttgtct 661 agacttctgg ggttcttagc agtaatgtgt ccttgggaaa gaccttgagg gaaggagatt 721 gggtttgaat gagatagcca taggatcc // LOCUS RATCGM1E 750 bp ds-DNA ROD 27-JUL-1990 DEFINITION Rat carcinoembryonic antigen rnCGM5 gene, partial cds. ACCESSION M22230 KEYWORDS carcinoembryonic antigen. SOURCE Rat (strain Wistar) adult male DNA, clone lambda-rnCGM5-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 750) AUTHORS Kodelja,V., Lucas,K., Barnert,S., von Kleist,S., Thompson,J.A. and Zimmermann,W. TITLE Identification of a carcinoembryonic antigen gene family in the rat: Analysis of th N-terminal domains reveals immunoglobulin-like, hypervariable regions JOURNAL J. Biol. Chem. 264, 6906-6912 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann 12-JAN-1989. FEATURES from to/span description pept / 118 / 477 carcinoembryonic antigen rnCGM5, exon x (AA at 120) pre-msg < 1 > 750 carcnioembryonic antigen rnCGM5, mRNA and introns IVS < 1 117 carcinoembryonic antigen rnCGM5, intron y IVS 478 > 750 carcinoembryonic antigen rnCGM5, intron y+1 BASE COUNT 180 a 207 c 178 g 185 t ORIGIN Unreported. 1 agcgtaggca ggagactcca cacctcagct gaccactgga cacagctgct cggactcagg 61 caccatctta gccaaatact aaagtcctga tgttgacgga tctctcttcc cttctagcct 121 ctcttttcat ctgtgggcgt ccttttaacc ctgccaagct cactattgaa tcagtgccgc 181 ccagtgttgc tgaaggggga agcgttcttc tcctcgttca caatctccag gacgagcttc 241 gagggttttt ctggtacaaa ggggcgtcta tgtctagcaa ccatgagata gcccgataca 301 gaacagcaaa gaattcaagt gtgccaggcc ctgcccacag tggtagagag acggtgtaca 361 gcaatggatc cctcctgctc cagaatgtca cccggaatga cactgggttc tacaccctac 421 gcactctgaa aagacatcag aaaatggaat tggcacacgt gcaacttcag gtggacagta 481 agtgattttc cgtgatcgtt cagtgctggg tgggtctttg acacacagga ctgtcacccc 541 tggcatgtgg ctacctcctc tctgcctttt tatccccatg ttgtggttaa ccactatgtg 601 caggacacat gtgatggaaa gaaatgccca tgggtcagac ttatcatctg actctcccct 661 gtatcaagga cagtaactca accctaggtg ctagactctg cccagtcatc tggggcatct 721 tgccatgcaa cgtgaggaaa ccatggatcc // LOCUS RATGSTAA 1116 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Rat glutathione S-transferase mRNA, complete cds. ACCESSION M28241 KEYWORDS glutathione S-transferase. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1116) AUTHORS Hsieh,J.-C., Liu,L.-F., Chen,W.-L. and Tam,M.F. TITLE Expression of Yb1 glutathione S-transferase using a Baculovirus expression system JOURNAL Biochem. Biophys. Res. Commun. 162, 1147-1154 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by J.-C.Hsieh, 21-SEP-1989. FEATURES from to/span description pept 103 753 glutathione S-transferase (EC 2.5.1.18) BASE COUNT 268 a 334 c 246 g 268 t ORIGIN 1 tttgtccggc ccacgtttct ctggtaatct gtataaatcg caacacacag gtcaattctg 61 ctgaagccaa attgagaaga ccacagcgcc agaaccatgc ctatgatact gggatactgg 121 aacgtccgcg ggctgacaca cccgatccgc ctgctcctgg aatacacaga ctcaagctat 181 gaggagaaga gatacgccat gggcgacgct cccgactatg acagaagcca gtggctgaat 241 gagaagttca aactgggcct ggacttcccc aatctgccct acttaattga tggatcgcgc 301 aagattaccc agagcaatgc cataatgcgc taccttgccc gcaagcacca cctgtgtgga 361 gagacagagg aggagcggat tcgtgcagac attgtggaga accaggtcat ggacaaccgc 421 atgcagctca tcatgctttg ttacaacccc gactttgaga agcagaagcc agagttcttg 481 aagaccatcc ctgagaagat gaagctctac tctgagttcc tgggcaagcg accatggttt 541 gcaggggaca aggtcaccta tgtggatttc cttgcttatg acattcttga ccagtaccac 601 atttttgagc ccaagtgcct ggacgccttc ccaaacctga aggacttcct ggcccgcttc 661 gagggcctga agaagatctc tgcctacatg aagagcagcc gctacctctc aacacctata 721 ttttcgaagt tggcccaatg gagtaacaag taggcccttg ctacactggc actcacagag 781 aggacctgtc cacattggat cctgcaggca ccctggcctt ctgcactgtg gttctctctc 841 cttcctgctc ccttctccag ctttgctagc cccatctcct caacctcacc ccagtcatgc 901 ccacatagtc ttcattctcc ccactttctt tcatagtggt ccccttcttt attgacacct 961 taacacaacc tcacagtcct tttctgtgat ttgaggtctg ccctgaactc agtctcccta 1021 gacttacccc aaatgtaaca ctgtctcagt gccagcctgt tcctggtggg ggagctgccc 1081 caggcctgtc tcatctttaa taaagcctga aacacc // LOCUS MUSTCGXI 354 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor gamma-chain mRNA V4-J1 region, 3' end. ACCESSION M30171 KEYWORDS T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 342 T-cell receptor beta-chain V4-J1 region (AA at 1) recomb 293 294 V-region DNA end/J-region DNA start BASE COUNT 118 a 72 c 69 g 95 t ORIGIN 1 aagttggagc aacctgaaat atcaatttcc agaccaagag atgagactgc acaaatatcc 61 tgtaaagttt tcatcgaaag ctttaggagt gtaaccatac actggtaccg gcagaaacca 121 aaccaaggtt tagagtttct attatatgtc cttgcaaccc ctacccatgt tttcttagat 181 aaggagtaca agaaaatgga ggcaagtaaa aatcctagtg cttctacatc gatattgaca 241 atatattcct tggaggaaga agacgaagct atctactact gttcctacgg ccaagtagct 301 caggttttca caaggtattt gcagaaggaa ctaagctcat agtaattccc tctg // LOCUS MUSTCGXJ 353 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA V2-J2 region, 3' end. ACCESSION M30172 KEYWORDS T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 353) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 297 T-cell receptor beta-chain V2-J2 region (AA at 1) recomb 294 295 V-region DNA end/J-region DNA start BASE COUNT 131 a 71 c 65 g 86 t ORIGIN 1 cagctggagc aaactgaatt atcggtcacc agagagacag atgagaatgt gcaaatatcc 61 tgtatagttt atcttccata tttctccaac acagctatac attggtaccg gcaaaaaaca 121 aatcaacagt ttgagtatct aatatatgtc gcaacaaact acaatcaacg acccttagga 181 gggaagcaca aaaaaattga agcaagtaaa gattttaaaa gttctacctc aaccttggaa 241 ataaattact tgaagaaaga agatgaagcc acctactact gtgcagtctg gatatagctc 301 gggctttcac aaggtatttg cagaaggaac aaagctcata gtaattccct ccg // LOCUS MUSTCGXK 76 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA V5-J1 region, 3' end. ACCESSION M30173 KEYWORDS T-cell receptor beta-chain; joining exon; variable region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 76) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 21 T-cell receptor beta-chain V5-J1 region (AA at 1) recomb 18 19 V-region DNA end/J-region DNA start BASE COUNT 19 a 18 c 16 g 23 t ORIGIN 1 acgtactact gtgcctgcta gctcaggttt tcacaaggta tttgcagaag gaactagctc 61 atagtaattc cctctg // LOCUS MUSTCGXL 926 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA C1 region, 3' end. ACCESSION M30174 KEYWORDS T-cell receptor beta-chain; constant region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 926) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 518 T-cell receptor beta-chain C1 region (AA at 3) BASE COUNT 267 a 231 c 169 g 259 t ORIGIN 1 acaaaagact tgatgcagac atttccccca agcccactat tttccttcct tctgttgctg 61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttccagatg 121 ttataagggt gtattggaaa gaaaaggatg gcaatactat cctggactcc caggaagggg 181 atacgctgaa gactaacgac acatacatga agtttagctg gcttacagtg cccgaaaggg 241 caatggggaa agagcacaga tgtattgtca aacatgagaa caacaaagga ggagcagatc 301 aagagatttt ctttccttca ataaagaaag ttgctgtgag taccaagcct acaacttgct 361 ggcaagataa aaatgatgtg ctgcagctcc agttcacgat cacctctgcc tactacacct 421 accttctcct gctcctcaag agtgtgatct acttggccat catcagcttc tctctgctta 481 gaagaacatc tgtctgtggc aatgagaaga aatcctaaag aaacgagtgg tggtacagca 541 agtcagctgg atttcatcct cactgccata aaggtgcctt aacgaggtcc agctgccttc 601 acttgttggc tttcacttct ataaagtccc tcactcatgt tgcataaaca ttttctgaac 661 gtttgtgtgc aatttctgca acttctctct aactcacatc acctcattcc tgattccatc 721 cactgcagaa gtcccctccc ccagaagtct gaagcatcaa aattctagta cccatagcct 781 acagctttta cccatagccc ttgttgctct tgaaccagct aactccatga agacccccac 841 tctagacaac ctgaagagcc ttctttactg tttgattctg tgccttgtat ttcacattat 901 ggacaatata ttttttaaaa attaaa // LOCUS MUSTCGXM 913 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse T-cell receptor beta-chain mRNA C2 region, 3' end. ACCESSION M30175 KEYWORDS T-cell receptor beta-chain; constant region. SOURCE Mouse (strain BALB/cByJ) fetal thymus, cDNA to mRNA, clone FT12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 913) AUTHORS Heilig,J.S. and Tonegawa,S. TITLE Diversity of murine gamma genes and expression in fetal and adult T lymphocytes JOURNAL Nature 322, 836-840 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 503 T-cell receptor beta-chain C2 region (AA at 3) BASE COUNT 263 a 223 c 168 g 259 t ORIGIN 1 acaaaaggct tgatgcagac atttccccca agcccactat ttttcttcct tctgttgctg 61 aaacaaatct ccataagact gggacatacc tttgtctcct tgaaaagttc tttcccgatg 121 tcataagggt gtattggaaa gaaaagaatg gcaatactat cctggactcc caggaagggg 181 atacgctgaa gactaagggc acatacatga agtttagctg gcttactgtg cccgaaaggg 241 caatggggaa agagcacagt tgtattgtca aacatgagaa caacaaagga ggagcagatc 301 aagagatttt cttcccttca ataaagaaag ttgctacaac ttgctggcaa gataaaaatg 361 atgtgctgca gttccagttc acgagcacct ctgcctacta cacctacctc ctcctgctcc 421 tcaagagtgt gatctacttg gccatcatca gcttctctct gcttagaaga acatctgtct 481 gtggcaatga gaagaagtcc taaagaaacg agtggtggta cagcaagtca gctggatttc 541 atcctcactg ccataaaggt gccttaacgg ggaaccagat gccttctctt gttggctttc 601 acttctataa agtccctcac tcatgttgca taaacatttt ctgaactttt gtatgcaatt 661 tcagcaactt tttttaaact gaactcacct tcttcctgat tccatccact ccagaagtcc 721 cctcccccag aagcctgaaa cattaaaatt ctagtaccca tagcctacag cttttaccca 781 tggcccttgt tgctcttgaa ccagctaact ccatgaagac ccccactcta gacaacctga 841 agagccttct ttactgtttg attctgtgcc ttgtatttca gattattgac aataaatttt 901 ttaaaaaatt aaa // LOCUS DVUHYDAB 3180 bp ds-DNA BCT 27-JUL-1990 DEFINITION D.fructosovorans periplasmic hydrogenase (hydA and hydB) gene, complete cds. ACCESSION M35333 KEYWORDS periplasmic hydrogenase. SOURCE D.fructosovarans (strain DSM 3604) DNA, clone DHS-alpha-[pHH7]. ORGANISM Desulfovibrio fructosovorans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 3180) AUTHORS Rousset,M., Dermoun,Z., Matchikian,C.E. and Belaich,J.P. TITLE Cloning and sequencing of the locus encoding the large and small subunit genes of the periplasmic [Ni,Fe] hydrogenase Desulfovibrio fructosovorans JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Rousset, 15-JUN-1990. FEATURES from to/span description pept 301 1242 periplasmic hydrogenase-alpha subunit precursor (hydA) (EC 1.18.99,1) sigp 301 450 periplasmic hydrogenase-alpha subunit signal peptide matp 452 1239 periplasmic hydrogenase pept 1314 3008 periplasmic hydrogenase-beta subunit (hydB) mRNA 129 3060 periplasmic hydrogenase mRNA signal 97 102 -35 region signal 120 125 -10 region binding 289 293 ribosome binding site for hydA binding 1303 1307 ribosome binding site for hydB signal 3025 3060 termination signal BASE COUNT 660 a 1150 c 866 g 504 t ORIGIN 1 cgtgctgcat acgatttctt tattcatatc ctttacttcc ccggcctcat caccccggta 61 acacccttga atcacgtgaa aaatttgcca aaccccattg acgttcaaac aaactcgtga 121 ttaactgcaa aagggaaatc gaacccggcg acgcaggaac gcacgcgtcc gtcacataga 181 ggcggcggcg gccggttcgc ctgcgacggc cgcgtgccga acgggtcaac ggatgctacg 241 tggcctgggg atcatgggac gccccggacc atgcctctcg taaaacaaag gaggacgttt 301 atgaactttt ccgtgggtct tggcaggatg aatgcggaaa aacggcttgt gcaaaacggc 361 gtctcccgcc gcgacttcat gaaattttgc gccaccgtgg ccgcggccat gggcatgggc 421 ccggcgttcg cgcccaaggt cgccgaagca ttgacggcca aacaccgtcc gtcggtggtc 481 tggctgcaca acgccgagtg caccggctgc accgaagcgg cgatccggac gatcaaacct 541 tatatagacg cgctcattct cgacaccatc tccctggatt accaggagac catcatggcc 601 gcggccggcg agacgtccga ggcggccctg caccaggccc tcgaaggcaa ggacggctac 661 tacctcgtgg tcgagggcgg cctgcccacc atcgacggcg gccagtgggg catggttgcc 721 ggccatccca tgatcgagac caccaagaag gccgcggcca aggccaaggg catcatctgc 781 atccggcacc tgcctcacgg cggcgtccag aaggccaaac ccaatcccag ccaggccaag 841 ggcgtgtccg aagccctcgg cgtcaagacc atcaacatcc ccggctgccc gcccaacccc 901 atcaacttcg tgggcgccgt ggtccatgtc ctgaccaagg gcatcccgga tctcgacgag 961 aacggccgtc cgaagctctt ctacggcgag ctggtccacg acaactgtcc gcgcctgccc 1021 cacttcgagg cctccgaatt cgcgccctcc ttcgattccg aagaggccaa gaaaggcttc 1081 tgcctctacg aactcggctg caagggcccc gttacctaca acaactgccc caaggtgctg 1141 ttcaaccagg tcaactggcc cgtccaggcc ggccacccct gcctcggctg cagcgagccg 1201 gacttctggg acaccatgac gccgttctac gagcagggct aaccccctcc tttgtagcgg 1261 ccagtacccg gacaccttca acagcaccga acgtctcgtg acggaggaag catatggctg 1321 agagcaaacc cacgccacaa tccaccttca ccggccccat cgtggtcgac cccattaccc 1381 ggatcgaagg tcacttgcgg atcatggtcg aggtggaaaa cggtaaggtc aaggacgcct 1441 ggagctcctc gcagctcttc cgcggcctgg aaatcatcct caaaggccgc gatccccgcg 1501 acgcccagca cttcacccag cgcgcctgcg gcgtgtgcac gtacgtccac gccctggcct 1561 ccagccgctg cgtcgatgac gccgtcaagg tcagcatccc ggccaacgcc cgcatgatgc 1621 gcaacctggt catggcctcc cagtatctcc atgaccacct cgtccacttc tatcacctgc 1681 acgccctcga ctgggtcgac gtgaccgccg ccctcaaggc cgatcccaac aaggccgcca 1741 aactggcagc ctccatcgac acggctcgga ccggcaactc ggaaaaggcc ctcaaggcag 1801 tccaggacaa gctgaaagcc ttcgtcgagt ccggacagct cggcatcttc accaacgcct 1861 acttcctcgg cggccacaaa gcctactacc tgccgcccga ggtcaacctc atcgccaccg 1921 cccactacct ggaagccctg cacatgcagg tcaaggcggc cagcgccatg gccatcctcg 1981 gcggcaagaa cccccacacc cagttcaccg tcgtgggcgg ctgctccaac taccagggcc 2041 tgaccaagga cccgctggcc aactacctgg ccctgagcaa ggaagtctgc cagttcgtca 2101 acgagtgcta catccctgac ctgctggccg tggccggctt ctacaaggac tggggcggca 2161 tcggcggcac cagcaactac ctggccttcg gcgagttcgc caccgacgac agctcccccg 2221 agaaacacct ggccacctcg cagtttccgt ccggcgtcat caccggccgc gacctcggca 2281 aggtggataa cgtggacctc ggcgccatct acgaagacgt caagtactcc tggtacgccc 2341 ccggcggcga cggcaagcac ccctacgact gcgtcaccga tcccaagtac accaagctcg 2401 acgacaagga ccactactcc tggatgaagg ccccccgcta caagggcaag gccatggaag 2461 tcggtccctt ggcccgcacc ttcatcgcct acgccaaggg gcagcccgac ttcaaaaagg 2521 tcgtggacat ggtcctcggc aaactctccg tcccggccac ggccctgcat tcgaccctcg 2581 gacgcaccgc cgcccgcggc atcgagaccg ccatcgtctg cgccaacatg gagaagtgga 2641 tcaaggaaat ggccgacagc ggcgccaagg acaacaccct gtgcgccaag tgggagatgc 2701 ccgaggagtc caagggcgtc ggcctggccg atgctccccg cggctccctg tcccactgga 2761 tccgcatcaa gggcaagaag atcgacaact tccagctggt tgtcccctcg acctggaacc 2821 tcggtccccg ggggccccag ggcgacaaga gcccggtgga agaggccctt atcggcacgc 2881 ccatcgccga tcccaaacgc ccggtcgaaa tcctgcgcac ggtccacgcc ttcgacccct 2941 gcatcgcctg cggcgtgcac gtcatcgagc ccgagaccaa cgagatcctc aagttcaagg 3001 tttgctaaga gcggcaaccc tctaatacga aggacccgct tcgggccggg ttcttcgttt 3061 ctcaaggagg tcgcatgtcc gataccccgc gcaaaatcct catcctcggc ctcggcaaca 3121 tcctctacac cgacgagggc gtcggcgtgc gggccgtgga gcgccttctc gaaacccacg // LOCUS MTFRECA 1349 bp ds-DNA BCT 27-JUL-1990 DEFINITION M.flagellatum recA gene, complete cds. ACCESSION M35325 KEYWORDS co-protease; recombinase. SOURCE M.flagellatum (strain MFK1) DNA. ORGANISM Methylobacillus flagellatum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Methylococcaceae. REFERENCE 1 (bases 1 to 1349) AUTHORS Gomelsky,M., Bolstin,A. and Tsygankov,Y.D. TITLE The nucleotide sequence and regulation of recA gene from Methylobacillus flagellatum JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.V.Gomelsky, 19-JUN-1990. Author address:M.V.Gomelsky Institute of Genetics and Section of Industrial Microorganisms 1st Dorozhry pr-d,1 Moscow 113545 U.S.S.R. FEATURES from to/span description pept 293 1327 RecA protein site 1 292 LexA repressor binding site BASE COUNT 320 a 413 c 386 g 230 t ORIGIN 1 aggagatggc cctgggcgcg ctggcctaca gcaccgccga cattagtgct tcctacacgg 61 gcatcgctgg gccgatggcg gaagcgagca aaaaccggtc ggtaccgtat gctttggctg 121 ggcccatcgc gatggccgga tcatcaccca aacccaccac ttcctgggcg agagacgccg 181 tgcgcagaca gtctgtcaaa accgcattaa ccggcttgtt gcagcttact ttacccactg 241 atttatgaaa taattgcagg ctttagcagg caactcgaaa aggtattcac tgatggatga 301 aaacagaagc aaagcgctag ccgccgccct ctcgcaaatc gagaaacaat ttggcaaagg 361 ctctatcatg cgcatgggcg acaccgatgt cgctgctgac atccaggcag tttctaccgg 421 ctccctgggc ctggacatcg ccctgggtat tggcggcttg ccgcgcggcc gcatcgtcga 481 gatttacggc ccggaatcct ccggcaagac cacgctgaca ttgtctgtga tcgcgcagat 541 gcaaaagctg ggcggcacgg cagcattcat cgacgcagaa cacgcgctcg acccggtcta 601 cgcgcagaag ctcggtgtca atgtttccga tctgctcatc tcccagcccg acaccggcga 661 acaggcgctc gagatcgccg acatgctggt gcgttccggc tcggtagatg tcgtggtggt 721 cgactccgtc gctgcgctga cgcccaaggc cgaaatcgaa ggcgaaatgg gcgactcgca 781 catgggcctg caggcccgcc tgatgtcgca ggcactgcgc aagctcaccg ccaacatcaa 841 gcgcaccaac accttggtca tcttcatcaa ccagatccgc atgaagatcg gcctgatgtt 901 cggcaacccc gagaccacca ccggcggcaa tgcactcaag ttctacgcct ccgtgcgcct 961 ggacatccgc cgcaccggcg cgatcaagaa gggggacgag gtcacgggat cagaaacccg 1021 ggtcaaggta gtgaagaaca acgtggcccc tccgttcaag ctagcggaat tcgacatcct 1081 ctatggcgaa ggtatctcgc gcgagggcga aatcatcgaa ctcggcgtca accttaaatt 1141 gatcgagaag gccggtgcct ggtacagcta caagggcgag aagatcggcc agggcaagga 1201 caacgcgcgc gagttcctca gggagcaccc ggaaatcgcc aatgagatcg acgccaagat 1261 tcgcgagcat tccaatctgg ccaacgccgc gatgacgacg gcaccggatg aagaaagcga 1321 cgaataagcc taccaccatg aatgcgaga // LOCUS CHKIGLAO 329 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34804 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 2. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 329) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 329 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 329 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 65 a 107 c 82 g 75 t ORIGIN 1 ccgttttctc ccctctttcc tcctcctctc caggttccct ggtgcaggca gcactgactc 61 agccggcctc ggtgtcagca aacctgggag aaaccgtcga gatcacctgc tccgggggta 121 gtggcagcta cggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct 181 atgctaacac caacagaccc tcagacatcc cttcacgatt ctccggttcc aaatccggct 241 ccacgggcac attaaccatc actggggtcc aagccgagga cgaggctgtc tatttctgtg 301 ggagcaggga cagcaactat gttggtata // LOCUS CHKIGLAP 332 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34805 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 36. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 332) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 332 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 332 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 70 a 110 c 79 g 73 t ORIGIN 1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc 61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggagta 121 gtggcagcta ctatggctgg taccagcaga aggcacctgg cagtgcccct gtcactgtga 181 tctatgacaa caccaacaga ccctcggaca tcccttcacg attctccggt tccaaatccg 241 gctccacggc cacattaacc atcactgggg tccaagccga cgacgaggct gtctatttct 301 gtgggagtac agacagcagc tatattggta ta // LOCUS CHKIGLAQ 335 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34806 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 24. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 335) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 335 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 335 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 71 a 110 c 77 g 77 t ORIGIN 1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc 61 agccggcctc ggtgtcagca aacctgggag aaaccgtcaa gatcacctgc tcagggagta 121 gcaatactta ttactattat ggctggttcc agcagaagtc tcctggcagt gcccctgtca 181 ctgtgatcta taacaacgac aagagaccct cggacatccc ttcacgattc tccggtgccc 241 tatccggctc cacagccaca ttaaccatca ctggggtcca agccgacgac gaggctgtct 301 atttctgtgg gagcaacgac agcagtactg ctgca // LOCUS CHKIGLAR 330 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34807 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 330) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 29 > 330 Ig lambda-chain V1-J region, exon N precursor (AA at 31) sigp / 29 45 Ig lambda-chain V1-J region, exon N signal peptide (AA at 31) matp 46 > 330 Ig lambda-chain V1-J region, exon N IVS < 1 28 Ig lambda-chain V1-J region intron X BASE COUNT 71 a 101 c 80 g 78 t ORIGIN 1 ccccattttt tctcctctcc ttctccaggt tccctggtgc aggcagcgct gactcagccg 61 gcctcggtgt cagcaaacct gggagaaacc gtcgagatca cctgctccgg gggtagcagc 121 gactatggct ggtaccagca gaagtcacct ggcagtgccc ctctcactgt gatctataat 181 ggcaataaca gaccctcaaa catcccttca cgattctccg gttccctatc cggctccaca 241 aacacattaa ccatcactgg ggtccaagtc gaggacgagg ctgtctattt ctgtgggagt 301 gcagacagca gtgctggtta tgctgatata // LOCUS CHKIGLAS 332 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34808 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 126. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 332) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 332 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 332 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 70 a 106 c 79 g 77 t ORIGIN 1 ccgttttctc ccctctctcc tctccctctc caggttccct ggtgcaggca gcgctgactc 61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccgggggtt 121 acagcgacta tggctggttc cagcagaagt ctcctggcag tgcccctgtc actctgatct 181 atgctaacac caacagaccc tcgaacatcc cttcacgatt ctccggttcc aaatctggct 241 ccacaaacac attaaccatc actggggtcc aagccgagga cgaggctgtc tattactgtg 301 ggagctggga tagcagtggt aatagtggta ta // LOCUS CHKIGLAT 108 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged lambda-chain gene V1-J region, partial cds. ACCESSION M34809 KEYWORDS immunoglobulin light-chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 34. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 108) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 108 Ig lambda-chain V1-J region, exon N (AA at 1) BASE COUNT 22 a 27 c 35 g 24 t ORIGIN 1 ggatccggct ccacgggcac attaaccatc actggggtcc gagccgagga cgaggctgtc 61 tattactgtg gtggctacga cggcagcagt gcttatgctg gtgatata // LOCUS CHKIGLAU 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34811 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone 2. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 94 a 123 c 155 g 157 t ORIGIN 1 ggtatatttg gggccgggac aaccttgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLAV 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34812 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 95 a 124 c 154 g 156 t ORIGIN 1 gatatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLAW 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34813 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 126. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 94 a 123 c 155 g 155 t 2 others ORIGIN 1 ggtanntttg gggacgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLAX 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig rearranged light-chain gene J region, partial cds. ACCESSION M34814 KEYWORDS immunoglobulin light-chain; joining exon; processed gene. SOURCE Chicken (inbred strain B19) 3 month old spleen DNA, clone 34. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 97 a 126 c 152 g 154 t ORIGIN 1 gatatattag gggccgggac aaccctgacc gtcctaggtg agtcactgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat tcttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattacttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLJA 529 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig germline light-chain gene J region, partial cds. ACCESSION M34810 KEYWORDS germline; immunoglobulin light-chain; joining exon. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone B19-Eco. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 529) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 / 37 Ig light-chain J region, exon N (AA at 3) IVS 38 > 529 IgL intron X BASE COUNT 94 a 124 c 155 g 156 t ORIGIN 1 ggtatatttg gggccgggac aaccctgacc gtcctaggtg agtcgctgac ctcgtctcgg 61 tctttcttcc cccatcgtga aattgtgaca ttttgtcgat ttttggtgat ttgggggttt 121 ttcttggact tggcggcagg ctggggtctg ccacggcgca gggccgggca ctcagcgcgg 181 cagcctgggc tgagtcttgt ccccaccgag ccggagggct ccggtgtgcg ccatggagga 241 cttagggtta ttttgtcaat ggaaagttct taaaatttga ccagaaaatg tgcccgaggt 301 ctgtctctgc cacacaattt cagaaattgt gtctaggtcg atgagaagac agtttttgtc 361 tttgtcagga aattagttgt gagttgttag tccttccctc ttagtcctaa ggactaagac 421 ctttgtcccc ggtctggtct ctcactgggg actcttggct ccagtgccat ggggagccca 481 agtgtcactg acacagtgtc cttgggggtg aaattcgagt ttttcagct // LOCUS CHKIGLVE 335 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken Ig germline lambda-chain gene V1-J region, partial cds. ACCESSION M34803 KEYWORDS germline; immunoglobulin light-chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Chicken (inbred strain B19) 3 month old bursa DNA, clone B19/g1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 335) AUTHORS Parvari,R., Ziv,E., Lantner,F., Heller,D. and Schechter,I. TITLE Somatic diversification of chicken immunoglobulin light chains by point mutations JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3072-3076 (1990) STANDARD simple staff_review FEATURES from to/span description pept / 34 > 335 Ig lambda-chain V1-J region, exon N precursor (AA at 36) sigp / 34 50 Ig lambda-chain V1-J region, exon N signal peptide (AA at 36) matp 51 > 335 Ig lambda-chain V1-J region, exon N IVS < 1 33 Ig lambda-chain V1-J region intron X BASE COUNT 70 a 111 c 82 g 72 t ORIGIN 1 ccgttttctc ccctctctcc tcctcctctc caggttccct ggtgcaggca gcgctgactc 61 agccgtcctc ggtgtcagca aacccgggag aaaccgtcaa gatcacctgc tccggggata 121 ggagctacta tggctggtac cagcagaagg cacctggcag tgcccctgtc actgtgatct 181 atgctaacac caacagaccc tcggacatcc cttcacgatt ctccggttcc aaatccggct 241 ccacagccac attaaccatc actggggtcc aagccgacga cgaggctgtc tattactgtg 301 ggagtgcaga cagcagcagt actgctggtg gtata // LOCUS CH1L1CG 7502 bp ds-DNA circular BCT 27-JUL-1990 DEFINITION Plasmid pCHL1 (from C.trachomatis) virulence proteins pGP1-D, pGP2-D, pGP3-D, pGP4-D, pGP5-D, pGP6-D, pGP7-D, and pGP8-D, complete cds. ACCESSION J03321 KEYWORDS complete genome; virulence protein. SOURCE Plasmid pCHL1 (from Chlamydia trachomatis trachoma biovar, serotype D) DNA. ORGANISM Plasmid pCHL1 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 7502) AUTHORS Comanducci,M., Ricci,S., Cevenini,R. and Ratti,G. TITLE Diversity of the Chlamydia trachomatis common plasmid in biovars with different pathogenicity JOURNAL Plasmid 23, 149-154 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Ratti, 28-MAR-1990. FEATURES from to/span description pept 1480 488 (c) pGP8-D pept 1579 2934 pGP1-D pept 2928 3992 pGP2-D pept 4054 4848 pGP3-D pept 4918 5226 pGP4-D pept 5317 6048 pGP5-D (gtg start codon) pept 6045 6788 pGP6-D (gtg start codon) pept 7022 7502 pGP7-D 1 437 pGP7-D rpt 6857 6945 four tandem 22bp repeats BASE COUNT 2460 a 1285 c 1433 g 2324 t ORIGIN 1 ggatccgtaa gttagacgaa attttgtctt tgcgcacaga cgatctattt tttgcatcca 61 atcagatttc ctttcgcatt aaaaaaagac agaataaaga aaccaaaatt ctaatcacat 121 ttcctatcag cttaatggaa gagttgcaaa aatacacttg tgggagaaat gggagagtat 181 ttgtttctaa aatagggatt cctgtaacaa caagtcaggt tgcgcataat tttaggcttg 241 cagagttcca tagtgctatg aaaataaaaa ttactcccag agtacttcgt gcaagcgctt 301 tgattcattt aaagcaaata ggattaaaag atgaggaaat catgcgtatt tcctgtcttt 361 catcgagaca aagtgtgtgt tcttattgtt ctggggaaga ggtaattcct ctagtacaaa 421 cacccacaat attgtgatat aattaaaatt atattcatat tctgttgcca gaaaaaacac 481 ctttaggcta tattagagcc atcttctttg aagcgttgtc ttctcgagaa gatttatcgt 541 acgcaaatat catctttgcg gttgcgtgtc ctgtgacctt cattatgtcg gagtctgagc 601 accctaggcg tttgtactcc gtcacagcgg ttgctcgaag cacgtgcggg gttattttaa 661 aagggattgc agcttgtagt cctgcttgag agaacgtgcg ggcgatttgc cttaacccca 721 ccatttttcc ggagcgagtt acgaagacaa aacctcttcg ttgaccgatg tactcttgta 781 gaaagtgcat aaacttctga ggataagtta taataatcct cttttctgtc tgacggttct 841 taagctggga gaaagaaatg gtagcttgtt ggaaacaaat ctgactaatc tccaagctta 901 agacttcaga ggagcgttta cctccttgga gcattgtctg ggcgatcaac caatcccggg 961 cattgatttt ttttagctct tttaggaagg atgctgtttg caaactgttc atcgcatccg 1021 tttttactat ttccctggtt ttaaaaaatg ttcgactatt ttcttgttta gaaggttgcg 1081 ctatagcgac tattccttga gtcatcctgt ttaggaatct tgttaaggaa atatagcttg 1141 ctgctcgaac ttgtttagta ccttcggtcc aagaagtctt ggcagaggaa acttttttaa 1201 tcgcatctag gattagatta tgatttaaaa gggaaaactc ttgcagattc atatccaagg 1261 acaatagacc aatcttttct aaagacaaaa aagatcctcg atatgatcta caagtatgtt 1321 tgttgagtga tgcggtccaa tgcataataa cttcgaataa ggagaagctt ttcatgcgtt 1381 tccaatagga ttcttggcga atttttaaaa cttcctgata agacttttca ctatattcta 1441 acgacatttc ttgctgcaaa gataaaatcc ctttacccat gaaatccctc gtgatataac 1501 ctatccgtaa aatgtcctga ttagtgaaat aatcaggttg ttaacaggat agcacgctcg 1561 gtattttttt atataaacat gaaaactcgt tccgaaatag aaaatcgcat gcaagatatc 1621 gagtatgcgt tgttaggtaa agctctgata tttgaagact ctactgagta tattctgagg 1681 cagcttgcta attatgagtt taagtgttct catcataaaa acatattcat agtatttaaa 1741 cacttaaaag acaatggatt acctataact gtagactcgg cttgggaaga gcttttgcgg 1801 cgtcgtatca aagatatgga caaatcgtat ctcgggttaa tgttgcatga tgctttatca 1861 aatgacaagc ttagatccgt ttctcatacg gttttcctcg atgatttgag cgtgtgtagc 1921 gctgaagaaa atttgagtaa tttcattttc cgctcgttta atgagtacaa tgaaaatcca 1981 ttgcgtagat ctccgtttct attgcttgag cgtataaagg gaaggcttga tagtgctata 2041 gcaaagactt tttctattcg cagcgctaga ggccggtcta tttatgatat attctcacag 2101 tcagaaattg gagtgctggc tcgtataaaa aaaagacgag tagcgttctc tgagaatcaa 2161 aattctttct ttgatggctt cccaacagga tacaaggata ttgatgataa aggagttatc 2221 ttagctaaag gtaatttcgt gattatagca gctagaccat ctatagggaa aacagcttta 2281 gctatagaca tggcgataaa tcttgcggtt actcaacagc gtagagttgg tttcctatct 2341 ctagaaatga gcgcaggtca aattgttgag cggattattg ctaatttaac aggaatatct 2401 ggtgaaaaat tacaaagagg ggatctctct aaagaagaat tattccgagt agaagaagct 2461 ggagaaacgg ttagagaatc acatttttat atctgcagtg atagtcagta taagcttaac 2521 ttaatcgcga atcagatccg gttgctgaga aaagaagatc gagtagacgt aatatttatc 2581 gattacttgc agttgatcaa ctcatcggtt ggagaaaatc gtcaaaatga aatagcagat 2641 atatctagaa ccttaagagg tttagcctca gagctaaaca ttcctatagt ttgtttatcc 2701 caactatcta gaaaagttga ggatagagca aataaagttc ccatgctttc agatttgcga 2761 gacagcggtc aaatagagca agacgcagat gtgattttgt ttatcaatag gaaggaatcg 2821 tcttctaatt gtgagataac tgttgggaaa aatagacatg gatcggtttt ctcttcggta 2881 ttacatttcg atccaaaaat tagtaaattc tccgctatta aaaaagtatg gtaaattata 2941 gtaactgcca cttcatcaaa agtcctatcc accttgaaaa tcagaagttt ggaagaagac 3001 ctggtcaatc tattaagata tctcccaaat tggctcaaaa tgggatggta gaagttatag 3061 gtcttgattt tctttcatct cattaccatg cattagcagc tatccaaaga ttactgaccg 3121 caacgaatta caaggggaac acaaaagggg ttgttttatc cagagaatca aatagttttc 3181 aatttgaagg atggatacca agaatccgtt ttacaaaaac tgaattctta gaggcttatg 3241 gagttaagcg gtataaaaca tccagaaata agtatgagtt tagtggaaaa gaagctgaaa 3301 ctgctttaga agccttatac catttaggac atcaaccgtt tttaatagtg gcaactagaa 3361 ctcgatggac taatggaaca caaatagtag accgttacca aactctttct ccgatcatta 3421 ggatttacga aggatgggaa ggtttaactg acgaagaaaa tatagatata gacttaacac 3481 cttttaattc accacctaca cggaaacata aagggttcgt tgtagagcca tgtcctatct 3541 tggtagatca aatagaatcc tactttgtaa tcaagcctgc aaatgtatac caagaaataa 3601 aaatgcgttt cccaaatgca tcaaagtatg cttacacatt tatcgactgg gtgattacag 3661 cagctgcgaa aaagagacga aaattaacta aggataattc ttggccagaa aacttgttat 3721 taaacgttaa cgttaaaagt cttgcatata ttttaaggat gaatcggtac atctgtacaa 3781 ggaactggaa aaaaatcgag ttagctatcg ataaatgtat agaaatcgcc attcagcttg 3841 gctggttatc tagaagaaaa cgcattgaat ttctggattc ttctaaactc tctaaaaaag 3901 aaattctata tctaaataaa gagcgctttg aagaaataac taagaaatct aaagaacaaa 3961 tggaacaatt agaacaagaa tctattaatt aatagcaagc ttgaaactaa aaacctaatt 4021 tatttaaagc tcaaaataaa aaagagtttt aaaatgggaa attctggttt ttatttgtat 4081 aacactgaaa actgcgtctt tgctgataat atcaaagttg ggcaaatgac agagccgctc 4141 aaggaccagc aaataatcct tgggacaaca tcaacacctg tcgcagccaa aatgacagct 4201 tctgatggaa tatctttaac agtctccaat aattcatcaa ccaatgcttc tattacaatt 4261 ggtttggatg cggaaaaagc ttaccagctt attctagaaa agttgggaga tcaaattctt 4321 gatggaattg ctgatactat tgttgatagt acagtccaag atattttaga caaaatcaaa 4381 acagaccctt ctctaggttt gttgaaagct tttaacaact ttccaatcac taataaaatt 4441 caatgcaacg ggttattcac tcccagtaac attgaaactt tattaggagg aactgaaata 4501 ggaaaattca cagtcacacc caaaagctct gggagcatgt tcttagtctc agcagatatt 4561 attgcatcaa gaatggaagg cggcgttgtt ctagctttgg tacgagaagg tgattctaag 4621 ccctgcgcga ttagttatgg atactcatca ggcattccta atttatgtag tctaagaacc 4681 agtattacta atacaggatt gactccgaca acgtattcat tacgtgtagg cggtttagaa 4741 agcggtgtgg tatgggttaa tgccctttct aatggcaatg atattttagg aataacaaat 4801 acttctaatg tatctttttt agaggtaata cctcaaacaa acgcttaaac aatttttatt 4861 ggatttttct tataggtttt atatttagag aaaacagttc gaattacggg gtttgttatg 4921 caaaataaaa gaaaagtgag ggacgatttt attaaaattg ttaaagatgt gaaaaaagat 4981 ttccccgaat tagacctaaa aatacgagta aacaaggaaa aagtaacttt cttaaattct 5041 cccttagaac tctaccataa aagtgtctca ctaattctag gactgcttca acaaatagaa 5101 aactctttag gattattccc agactctcct gttcttgaaa aattagagga taacagttta 5161 aagctaaaaa aggctttgat tatgcttatc ttgtctagaa aagacatgtt ttccaaggct 5221 gaatagacaa cttactctaa cgttggagtt gatttgcaca ccttagtttt ttgctctttt 5281 aagggaggaa ctggaaaaac aacactttct ctaaacgtgg gatgcaactt ggcccaattt 5341 ttagggaaaa aagtgttact tgctgaccta gacccgcaat ccaatttatc ttctggattg 5401 ggggctagtg tcagaagtga ccaaaaaggc ttgcacgaca tagtatacac atcaaacgat 5461 ttaaaatcaa tcatttgcga aacaaaaaaa gatagtgtgg acctaattcc tgcatcattt 5521 tcatccgaac agtttagaga attggatatt catagaggac ctagtaacaa cttaaagtta 5581 tttctgaatg agtactgcgc tcctttttat gacatctgca taatagacac tccacctagc 5641 ctaggagggt taacgaaaga agcttttgtt gcaggagaca aattaattgc ttgtttaact 5701 ccagaacctt tttctattct agggttacaa aagatacgtg aattcttaag ttcggtcgga 5761 aaacctgaag aagaacacat tcttggaata gctttgtctt tttgggatga tcgtaactcg 5821 actaaccaaa tgtatataga cattatcgag tctatttaca aaaacaagct tttttcaaca 5881 aaaattcgtc gagatatttc tctcagccgt tctcttctta aagaagattc tgtagctaat 5941 gtctatccaa attctagggc cgcagaagat attctgaagt taacgcatga aatagcaaat 6001 attttgcata tcgaatatga acgagattac tctcagagga caacgtgaac aaactaaaaa 6061 aagaagcgga tgtctttttt aaaaaaaatc aaactgccgc ttctctagat tttaagaaga 6121 cgcttccctc cattgaacta ttctcagcaa ctttgaattc tgaggaaagt cagagtttgg 6181 atcgattatt tttatcagag tcccaaaact attcggatga agaattttat caagaagaca 6241 tcctagcggt aaaactgctt actggtcaga taaaatccat acagaagcaa cacgtacttc 6301 ttttaggaga aaaaatctat aatgctagaa aaatcctgag taaggatcac ttctcctcaa 6361 caactttttc atcttggata gagttagttt ttagaactaa gtcttctgct tacaatgctc 6421 ttgcatatta cgagcttttt ataaacctcc ccaaccaaac tctacaaaaa gagtttcaat 6481 cgatccccta taaatccgca tatattttgg ccgctagaaa aggcgattta aaaaccaagg 6541 tcgatgtgat agggaaagta tgtggaatgt cgaactcatc ggcgataagg gtgttggatc 6601 aatttcttcc ttcatctaga aacaaagacg ttagagaaac gatagataag tctgattcag 6661 agaagaatcg ccaattatct gatttcttaa tagagatact tcgcatcatg tgttccggag 6721 tttctttgtc ctcctataac gaaaatcttc tacaacagct ttttgaactt tttaagcaaa 6781 agagctgatc ctccgtcagc tcatatatat atatctatta tatatatata tttagggatt 6841 tgatttcacg agagagattt gcaactcttg gtggtagact ttgcaactct tggtggtaga 6901 ctttgcaact cttggtggta gactttgcaa ctcttggtgg tagacttggt cataatggac 6961 ttttgttaaa aaatttatta aaatcttaga gctccgattt tgaatagctt tggttaagaa 7021 aatgggctcg atggctttcc ataaaagtag attgttttta acttttgggg acgcgtcgga 7081 aatttggtta tctactttat cttatctaac tagaaaaaat tatgcgtctg ggattaactt 7141 tcttgtttct ttagagattc tggatttatc ggaaaccttg ataaaggcta tttctcttga 7201 ccacagcgaa tctttgttta aaatcaagtc tctagatgtt tttaatggaa aagttgtttc 7261 agaggcatct aaacaggcta gagcggcatg ctacatatct ttcacaaagt ttttgtatag 7321 attgaccaag ggatatatta aacccgctat tccattgaaa gattttggaa acactacatt 7381 ttttaaaatc cgagacaaaa tcaaaacaga atcgatttct aagcaggaat ggacagtttt 7441 ttttgaagcg ctccggatag tgaattatag agactattta atcggtaaat tgattgtaca 7501 ag // LOCUS MUSMIP1X 638 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse macrophage inflammatory protein 1-beta (MIP-1) mRNA, complete cds. ACCESSION M35590 KEYWORDS macrophage inflammatory protein 1-beta. SOURCE Mouse macrophage cell line RAW 264.7, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 638) AUTHORS Sherry,B., Tekamp-Olson,P., Gallegos,C., Bauer,D., Davatelis,G., Wolpe,S.D., Masiarz,F., Coit,D. and Cerami,A. TITLE Resolution of the two components of macrophage inflammatory protein 1, and cloning and characterization of one of those components, macrophage inflammatory protein 1-beta JOURNAL J. Exp. Med. 168, 2251-2259 (1988) STANDARD simple staff_review FEATURES from to/span description pept 63 341 macrophage inflammatory protein 1-beta (MIP-1) precursor sigp 63 131 macrophage inflammatory protein 1-beta signal peptide matp 132 338 macrophage inflammatory protein 1-beta mRNA < 1 638 macrophage inflammatory protein 1 mRNA BASE COUNT 125 a 175 c 139 g 199 t ORIGIN 1 gcttctgaag cttctgggcc ctgcagtccc agctctgtgc aaacctaacc ccgagcaaca 61 ccatgaagct ctgcgtgtct gccctctctc tcctcttgct cgtggctgcc ttctgtgctc 121 cagggttctc agcaccaatg ggctctgacc ctcccacttc ctgctgtttc tcttacacct 181 cccggcagct tcacagaagc tttgtgatgg attactatga gaccagcagt ctttgctcca 241 agccagctgt ggtattcctg accaaaagag gcagacagat ctgtcctaac cccagtcagc 301 cctgggtcac tgagtacatg agtcacttgg agttgaactg agcagctcca gcggcagggc 361 aggaggagcc acttcaggag aggcctcctc agccctgatg cttctcactg agaagcgtcc 421 ttgctcctca cgttcagatt tcctgcccct cttcttaatt taaatctctg tgtagacttt 481 gttttgtttt tttgggggag tattatttct attatttatg ttttagttat aggacgcgtc 541 tctcccatgg agatggtcca ccattgctgt ttctctgcta ttgtggatat gactgtgaaa 601 ttgatttcat gcattttcat aataaatctt tctttaag // LOCUS DENAAA 699 bp ss-RNA VRL 27-JUL-1990 DEFINITION Dengue virus type 2 RNA. ACCESSION M36806 KEYWORDS . SOURCE Dengue virus type 2, cDNA to viral RNA. ORGANISM Dengue virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Flaviviridae; Flavivirus (arbovirus group B). REFERENCE 1 (bases 1 to 699) AUTHORS Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L., Strauss,E.G. and Strauss,J.H. TITLE Conserved elements in the 3' untranslated region of flavivirus RNAs and potential cyclization sequences JOURNAL J. Mol. Biol. 198, 33-41 (1987) STANDARD simple staff_review BASE COUNT 170 a 160 c 183 g 186 t ORIGIN 1 cctccatcag agcatggctg tcttagggct actcttctgc ctagtgacat tcccaagctg 61 tgagtgtttc agggtttcag aagagggact aggacatgtc agataggaga tgtgtgactt 121 atgttgatgt tgcttgtccc caggtgtcct atcccaggtg cagctgaagc agtcaggacc 181 tggcctggtg cagccctcac agagcctgtc catcacctgc acagtctctg gtttctcatt 241 aactacctat ggtgtaaact gggttcgcca gtctccagga aagggtctgg agtggctggg 301 agtgatatgg agtggtggaa gcacagagta taatgcagct ttcatatcca gactgaacat 361 caccaaggac aattccaaga ccaagttttc tttaaaatga acagtctgca atctaatgac 421 acagccatat attactgtgc cagaagggac tacgttagta ggtacgaccc gtttgcttac 481 tggggccaag ggactctggt cactgtctct gcaggtgagt cctaacttct cccattctag 541 atgcatcttg gggggattct gagccttcag gaccaagatt ctctgcaaac gaatcaagat 601 tcaacccctt tgtaccaaaa ttgagacagg ggtctgggct agggactctc tgcctgctgg 661 tctgtggtgg cattagaaat gaagtatgat gaaggatct // LOCUS MVEAAA 558 bp ss-RNA VRL 27-JUL-1990 DEFINITION Murray Valley encephalitis virus RNA. ACCESSION M35172 KEYWORDS repetitive DNA. SOURCE Murray Valley encephalitis virus, cDNA to viral RNA. ORGANISM Murray Valley encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Flaviviridae; Flavivirus (arbovirus group B). REFERENCE 1 (bases 1 to 558) AUTHORS Hahn,C.S., Hahn,Y.S., Rice,C.M., Lee,E., Dalgarno,L., Strauss,E.G. and Strauss,J.H. TITLE Conserved elements in the 3' untranslated region of flavivirus RNAs and potential cyclization sequences JOURNAL J. Mol. Biol. 198, 33-41 (1987) STANDARD simple staff_review FEATURES from to/span description ORF < 1 30 open reading frame rpt 184 211 repeat rpt 333 359 repeat BASE COUNT 170 a 122 c 162 g 104 t ORIGIN 1 actcatgtga gtgaagatag ggtcttgtaa ataacattga tagaaaattt tgtaaatatt 61 taatgtaata tagtataggt aaaatttttt gaaattaagt aaaattaagt agcaagactt 121 gatagtcagg ccagccggtt aggctgcacc cgaaggttgg tagacggtgc tgcctgcgac 181 caaccccagg aggactgggt taccaaagct gattctccac ggttggaaag cctcccagaa 241 ccgtctcgga agaggagtcc ctgccaacaa tggagatgaa gcccgtgtca gagccgaaag 301 cgccacttcg ccgaggagtg caatctgtga ggccccagga ggactgggta aacaaagccg 361 taaggccccc gcagcccggg ccgggaggag gtgatgcaaa ccccggcgaa ggactagagg 421 ttagaggaga ccctgcggaa gaaatgagtg gcccaagctc gccgaagctg taaggcgggt 481 ggacggacta gaggttagag gagaccccac tctcaaaagc atcaaacaac agcatattga 541 cacctgggaa aagactag // LOCUS ECOSTII 806 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli heat-stable toxin II (STII) gene, complete cds. ACCESSION M35586 KEYWORDS heat-stable toxin II. SOURCE E.coli (strain K12/711) DNA, pCHL6. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 806) AUTHORS Lee,C.H., Moseley,S.L., Moon,H.W., Whipp,S.C., Gyles,C.L. and So,M. TITLE Characterization of the gene encoding heat-stable toxin II and preliminary molecular epidemiological studies of enterotoxigenic Escherichia coli heat-stable toxin II producers JOURNAL Infect. Immun. 42, 264-268 (1983) STANDARD simple staff_review FEATURES from to/span description pept 189 40 (c) ORF 1 pept 443 658 heat-stable toxin II (put.) BASE COUNT 287 a 141 c 135 g 243 t ORIGIN 1 aagcttagta ttgttgacat gaacagcaaa tcgacttaat caaaaaacta tcagagaacg 61 ctgcattatt gattttagga ctttcaatgc aaatatcatc gaggaacgta catcattgcc 121 cacagataca acggatagtt gtatgaaagc ataaagcaaa atcaagattg caaacacaga 181 agggggcatt cctccccctc tcttttgcac ttctttccta tatttttaca agaaaagaaa 241 atgattttat atgttgaata ataaaacaat caatatgtaa atacctacaa cgggtgattg 301 acactacact cattaactat actgcaagta gcattaaaaa tcttaataaa ggagagcttc 361 gtcacatttt tttgacttga ctcatataaa ggcccactgg tataagtttt attgcttata 421 gcaataaggt tgaggtgatt ttatgaaaaa gaatatcgca tttcttcttg catctatgtt 481 cgttttttct attgctacaa atgcctatgc atctacacaa tcaaataaaa aagatctgtg 541 tgaacattat agacaaatag ccaaggaaag ttgtaaaaaa ggttttttag gggttagaga 601 tggtactgct ggagcatgct ttggcgccca aataatggtt gcagcaaaag gatgctaata 661 tatttatcaa tagcattcag caccatatac acaaaaataa tttttcataa aaagaactct 721 ataaaataaa tattttttgt gacaatgtcc taacgcaaga cggacattgt ccatttctca 781 ctgcaggctg cggctgagcg ataccc // LOCUS TRNTETE 830 bp ds-DNA BCT 27-JUL-1990 DEFINITION Transposon Tn10 (from E.coli) tetracycline resistance E (tetE) gene, complete cds. ACCESSION M34933 KEYWORDS tetracycline resistance. SOURCE Transposon Tn10 (from E.coli) DNA, clone pSL1503. ORGANISM Transposon Tn10 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 830) AUTHORS Tovar,K., Ernst,A. and Hillen,W. TITLE Identification and nucleotide sequence of the class E tet regulatory elements and operator and inducer binding of the encoded purified Tet repressor JOURNAL Mol. Gen. Genet. 215, 76-80 (1988) STANDARD simple staff_review FEATURES from to/span description pept 152 787 tetracycline resistance determinant E BASE COUNT 244 a 179 c 187 g 220 t ORIGIN 1 agttaacgtt ctctatcgat gatagggttt gaaaaataac tctatcagtg atagattgtc 61 aacagcaagt atcaattgca agagatagct actatcccaa actttttatt gagatagtca 121 ctatctaaca gttgtccaaa aggagatagt gatggcacga ctaagcttgg acgacgtaat 181 ttcaatggcg ctcaccctgc tggacagcga agggctagag ggcttgacta cgcgtaagct 241 ggcgcagtcc ctaaaaattg agcaaccgac tctgtattgg cacctgcgca acaagcagac 301 tcttatgaac atgctttcag aggcaatact ggcgaagcat cacacccgtt cagcaccgtt 361 accgactgag agttggcagc agtttctcca ggaaaatgct ctgagtttcc gtaaagcatt 421 actggtccat cgtgatggag cccgattgca tatagggacc tctcctacgc ccccccagtt 481 tgaacaagca gaggcgcaac tacgctgtct atgcgatgca gggttttcgg tcgaggaggc 541 tcttttcatt ctgcaatcta tcagccattt tacgttgggt gcagtattag aggagcaagc 601 aacaaaccag atagaaaata atcatgtgat agacgctgca ccaccattat tacaagaggc 661 atttaatatt caggcgagaa cctctgctga aatggccttc catttcgggc tgaaatcatt 721 aatatttgga ttttctgcac agttagatga aaaaaagcat acacccattg aggatggtaa 781 taaatgatgc tatctatgtg tcaactctaa tttatagtta tggatagtgt // LOCUS CLONIFC 1191 bp ds-DNA BCT 27-JUL-1990 DEFINITION C.pasteurianum nitrogenase C (nifC) gene, complete cds. ACCESSION M34365 KEYWORDS nitrogen-fixation protein; nitrogenase C. SOURCE C.pasteurianum (strain W5) DNA. ORGANISM Clostridium pasteurianum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1191) AUTHORS Wang,S.-Z., Chen,J.-S. and Johnson,J.L. TITLE A nitrogen-fixation gene (nifC) in Clostridium pasteurianum with sequence similarity to chlJ of Escherichia coli JOURNAL Biochem. Biophys. Res. Commun. 169, 1122-1128 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-Z.Wang, 18-MAY-1990. FEATURES from to/span description pept 331 1191 nitrogenase C (nifC) BASE COUNT 403 a 132 c 178 g 478 t ORIGIN 1 tttagtacat ctgaaatctg taattaagga gataataagt caaagtttaa tatatttgat 61 ttgttatttc cttaattttt tataaataac tgctatagta taactgtata aaatatggta 121 taatgtcgta gttgataata attatatttt tttatatatc attctgtgcc tataataatg 181 tatgaactaa tatttattga tacaataaat tataaaatta tataagaata tatagaatat 241 tatattgttt tatataagtt atagtatttt atattaaaca acttaattct aaatattttt 301 ttatgtattt aacaggaggt aatataatac atggaaaata ataaaaaaat attagagagt 361 tcaaaaaaac tttcttccta tggagatgga gagagcaggt tttcttttct tgaaaaaata 421 ttagctcctt tatttttagc tctaacggct atatattttg taatgcttat atttccaata 481 atttcaatga ttcggtattc aggtggttct cacattattc aaacccttta tgatcaagat 541 aatataaaga ctataatttt aagctttgta acttcactta tagcacttat atttactttt 601 atcataggaa ctccaacagc tttttgtatc aattttgtta gaaataaagt tttatctaaa 661 atattagata tatttgtaga aattcctgta gttttgcctc cagcagtagc tggtattgct 721 ttgctactag cttttggtaa aaatggtgta gtgggtaatt ttttatctaa tcatgggata 781 aatgttatat ttacctctac agctgtaata attgcacaat tttttgtatc ttctgctctt 841 tatgtaagag ttcttagaga ttctgttaaa tcagtaccta ttgaactttt tgaagtgagt 901 tatgtgctag gggcaggaaa aattgaaaca ataattaaaa ttatgattcc aatgcttaaa 961 aagtcaatcg tgtctggact tattttggct tggataagat ctcttggaga gtttggagcc 1021 actttgatgt ttgcaggaaa cattatagga aaaacaagaa ccataccact tcaaatctat 1081 acttatatgc aggatgatat taaaatggct actgcttttg ctacgatttt gtatataatg 1141 acttttgtat tgttattatt agttagactt agtattaggg atgatgatta g // LOCUS LACISTRAN 808 bp ds-DNA BCT 27-JUL-1990 DEFINITION L.lactis insertion sequence IS946 transposase gene, complete cds. ACCESSION M33868 KEYWORDS transposase. SOURCE L.lactis DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 808) AUTHORS Romero,D.A. and Klaenhammer,T.R. TITLE Characterization of gram-positive insertion sequence IS946, an Iso-ISS1 element, isolated from the conjugative Lactococcal plasmid pTR2030 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.A.Romero, 26-APR-1990. Author address: D.A.Romero North Carolina State University Dept. of Food Science Box 7624, Raleigh, NC 27695 FEATURES from to/span description pept 74 754 transposase (put.) rpt 1 18 inverted repeat rpt 791 808 inverted repeat signal 6 35 promoter region binding 62 65 ribosomal binding site BASE COUNT 281 a 137 c 158 g 232 t ORIGIN 1 ggttctgttg caaagttttc tgataagtct attttagtgt aaaatgaata aaaatgacag 61 cgaggatata tcaatgaact attttaaagg taaacaattt caaaaagatg tgattattgt 121 cgctgttggt tactacctgc gttacaatct aagctatcgt gaagttcagg aattgttata 181 tgatcgtgga ataaatgttt gtcatactac gatttatcgt tgggtgcaag aatacagtaa 241 agtcctctat catctctgga aaaagaaaaa tagacagtcc ttctattcgt ggaaaatgga 301 cgaaacctat atcaaaatca aaggtcgttg gcattatctc tatcgtgcaa ttgatgcgga 361 tggcttaacc ttagatatct ggttacgaaa gaaacgggat acgcaagcag cctatgcttt 421 cttaaaacga ctccataaac agtttggtca accaagagta attgtcacgg ataaagcgcc 481 ctctattggt tctgcattta gaaagttaca gagtaacggt ttatatacta agacagagca 541 tcgaaccgtg aagtatctca ataacctcat tgagcaagac catcgaccaa tcaaacgacg 601 caataaattt tatcgaagtc tacgaactgc ctcaaccacg attaagggca tggaaacaat 661 tcgaggaata tacaaaaaga accgaagaaa tggaacgctc ttcggatttt cggtatctac 721 tgagattaag gtcttaatgg gaatattagc ttaagaacaa gaaggattat aaaccttgta 781 tttgattttt aaactttgca acagaacc // LOCUS MUSMB1 5629 bp ds-DNA ROD 27-JUL-1990 DEFINITION Murine B cell 1 (mb-1) gene, complete cds. ACCESSION M31773 KEYWORDS murine B cell 1. SOURCE Mouse (strain Balb/c) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5629) AUTHORS Kashiwamura,S.-I., Koyama,T., Matsuo,T., Steinmetz,M., Kimoto,M. and Sakaguchi,N. TITLE Structure of the murine mb-1 gene encoding a putative sIgM-associated molecule JOURNAL J. Immunol. 145, 337-343 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Sakaguchi, 01-FEB-1990. FEATURES from to/span description pept 277 343 B cell 1, exon 1 1823 2116 B cell 1, exon 2 2308 2426 B cell 1, exon 3 4450 4518 B cell 1, exon 4 4646 4759 B cell 1, exon 5 pre-msg 268 5072 B cell 1 mRNA and introns IVS 344 1822 mb-1 intron A IVS 2117 2307 mb-1 intron B IVS 2427 4449 mb-1 intron C IVS 4519 4645 mb-1 intron D BASE COUNT 1332 a 1515 c 1400 g 1382 t ORIGIN 1 gatcctttct cagggatcca tggtgatgaa ccaggccttg aaccaccctc tccccgaccc 61 cacgcactag agagagactc aagggaattg tggccaccca ggtgcagggc agttcctctc 121 cactcagagc ccacacatat ggcaaataaa gggccaggag taagggccac tccagcccat 181 ctccggcacg gctgaacagg aagtgaggcg gagtttggag tggaatctgg tacctcaagc 241 tacggccacc caaactaacc gactcactgg gagacgatgc cagggggtct agaagccctc 301 agagccctgc ctctcctcct cttcttgtca tacgcctgtt tgggtaatgt ggccaaaggg 361 aaggtgcggg tggggcgggg gtgttgcggc tgacaggaga ggtgcagagt ggagtcagaa 421 atggccaggc agggtgatgc ataacgtggc atctcttaaa gtaacgccag aactctagga 481 aactgaggag ggaggaacat gcatttgagg ccagctggcg tacataatga cttctatcta 541 ggccaacctc gcaagttact cattaaaact ctctcaaaaa gaaaaggagt gtcgggaggg 601 aatcagggaa agcctctctc agctagctca gacagctgag taagtccact ttaggaccaa 661 cactaccttg cccacgacac acgaccttag ctcacagtca gaatgagaac tgagtaagtg 721 tgactgaaca cacaccagtg agtgtagcaa tggtgtcggt gagggaacgg gttttattgt 781 tgttgttgtt gttgttgttg ttttttcctc tgccttacat ctggaggcag gcttgccatg 841 tgaccacata gacccaggtt tgcagtggag gtcttacgag actggggccc tggtctctgt 901 gtcttaagcc tgtgtgacag gaactggctg gggctctttc ccctcccttt ctttcttcct 961 cacttccgac cttcttcttt tctgtcatgt ccacaaaagg gacctacaag ccagtgtgga 1021 gctggggagg ctgagaacag ggtggggggt ggggggggcg ggtcaagata aatacaagag 1081 cagacaccat agccaccagt ccaccccgtc tcttagcctc ctccgatgcc ccagacagaa 1141 ctcagtccca gtgcctaggt taatactggt ttctctgttt tgtttttcga gacagggatt 1201 ctgtgtgcaa ctggaacgta ctctgttaga ccagcaggcc tcaaactccc agagatctgc 1261 ctgacttctc tctcctgagt gctgggatta aaggcatgta ccagtactgc ctggcagtcg 1321 ccacatttaa attaaattac agagatgaga aacagcctag gacaaaaaaa caaaaaaaca 1381 aacaaacaaa caaacaaaaa aaacaaaaac aaaaaaaaaa acaaggccaa gcgcagtaaa 1441 ggagagattc aggaaggtgg gaattgggct ctactatctt ttccaatagc ctggttactg 1501 ctgagctgag ttttaccgct gcatctaagg ttctgcccca ccttcacccc ctccccgtgg 1561 ggttcctaag agttgtaagg aatggggaac aaatgggatg gttttcatga aagaatccct 1621 ggggtttttg gggaacatca aaatacaaag gggaaaggga agaaagaggc catagtgtgc 1681 ccccccccaa gtctcccatc atctttttct agtacattac cattggtacc ggctccaatt 1741 ctgatgtgaa ggggtctggg ctagagcaat catctccatt ctccaggcta tgacccatct 1801 gtctcctctc ctctctccac aggtcccgga tgccaggccc tgcgggtaga agggggtcca 1861 ccatccctga cggtgaactt gggcgaggag gcccgcctca cctgtgaaaa caatggcagg 1921 aaccctaata tcacatggtg gttcagcctt cagtctaaca tcacatggcc cccagtgcca 1981 ctgggtcctg gccagggtac cacaggccag ctgttcttcc ccgaagtaaa caagaacaca 2041 ggggcttgta ctgggtgcca agtgatagaa aacaacatat taaaacgctc ctgtggtact 2101 tacctccgcg tgcgcagtga gtagggaggg cgctggcctc ttgcgttccc tgctccctct 2161 ttcttccaaa acattaggag cagacgtagc tcctccctcc tggacctgcc acccagccac 2221 agagactggt tgcgagggct ccctgacctc gcaagggtct aggcgagaag aagggacaca 2281 gaatgctgag cagcacctgt cttcaagatc cagtccctag gcccttcctg gacatggggg 2341 aaggtaccaa gaaccgcatc atcacagcag aagggatcat cttgctgttc tgtgcagtgg 2401 tgccagggac gctgctgcta ttcagggtga gtctccccca agtgacaccc ttcaagtcat 2461 cttggcttac ctgggcctgg actgtgactc ccagtgtcca gctggtcctc tgaattcgag 2521 ctcgcccgga gatcttgtac tgatgccaac tgatagaaaa caacatatta aaacgctcct 2581 tgtacttacc atcccgtgcg cagtgagtag gaggcgcctg gcctccttgc tttccctgct 2641 ccctctttct tcaaaacatt agagcaagct agctcctcct cctgacctcg acagcacaga 2701 gatggtggtt caggcgcgtc cgaccctcgc aagggtctgg gctgccagaa gaagggacac 2761 cagaatgctg agcagcaccc tgtcttcaca gatccagtcc ctaggccctt cctggacatg 2821 ggggaaggta ccaagaaccg catcatcaca gcagaaggga tcatcttgct gttctgtgca 2881 gtggtgccag ggacgctgct gctattcagg gtgagtctcc cccaagtgac acccctcaag 2941 tcatcttggc ttccctgggc ctggactgtg actcccagtg tccagctggt ccttgggtct 3001 tccgaattcc aaggtttgtc cttccaattt gggaggctgg gaataaccac caaaaggatt 3061 ccaggggggt cagctgtccc cacggccctc atagaaaccc tctcacttct ggccgccaac 3121 gggagtccct gcaaatgcag ccaagacaag ggaagaaggt gtgtcctccc agagcccctg 3181 cgtctcctta ggccagaaga caacattttt caccacattg gggcttcatt attgtacgcc 3241 tggctggcct tgaactcaca gagatctgcc tatttctgcc tcccaagtgc tgctgggatt 3301 aaagggccac catgcctgcc cttcttcctt tctttaactt tttcttttcc ccaccaccta 3361 accccccaag actgggtctc tctgtgtagc cctggtgtcc tagatataag gctggctcag 3421 actcacagag atccacctgc ctctgcctcc tgagtgctgg gattacagtt gtgcacccct 3481 atcgccctgc actttattta tgtcttagtg tgtacgtggg catgttgagt gtggaggtcg 3541 aagaacaact ttcaggagtt gattctttct ccactgtgga ttcgcacaca aattcatatc 3601 ctcaggtttg caggcataca ttaccagttg agctatcatc tgacttcttt gcctgagcta 3661 tcatctgact tctttgcttt catttttatt ttagtttatt ttattttatt tttttaaagc 3721 tagatattag tctagctagc ctcaaacttg ctacatcaca tggctggcct gattcctctt 3781 taccttccaa gcactgaaat tacagggcag tggcaccacc agttttattt tgtttgtttt 3841 aatttttgtt ttgtgttttg ttttgttttt gatggggagg gttagacagc attttgctat 3901 gtaattcagg ctgggctaga actcactcta tagatcatat tggtctggaa ttctcctgcc 3961 tcagtttgcc aagcactgga attacagatc tccagcacca ctcattccct gctcgccgtt 4021 ttgcttgtca ttgaggagca gtagagagtc agcatggtgg tgcacagctt tgaccctagc 4081 agcactcagg aggcagaggc aggcggatct ctcgggagcc aaggccagcc tggtccacgt 4141 actgagttcc caggtagttc ttgctagata gagatcctca gagcatcagt ggagtggcag 4201 ggaggtgcag cagttccact tgagacaggg agattgctta aagtttgagg cccataaaaa 4261 ggaaagagag gagaggggca gagactagaa tcttgacaag atacctgatg gtgggaatct 4321 agaccttgat ggtggggagc caagtatccc tcactctttc tgataaagca cttgctagta 4381 ggagacagtc taaaaaccct tcacctccct ccttgctaaa cgtcgcttct acttcatccc 4441 cttctgtaga aacggtggca aaatgagaag tttggggtgg acatgccaga tgactatgaa 4501 gatgaaaatc tctatgaggt aagcactgtt tgcagcaggc gtcgggtgca tggggtgtgg 4561 tgtctgcctt gagctctcac cctggagttg actggactgg ggctccagct gtcagggtac 4621 tgatgtctcc tctctgaccc tgcagggcct gaaccttgat gactgttcta tgtatgagga 4681 catctccagg ggactccagg gcacctacca ggatgtgggc aacctccaca ttggagatgc 4741 ccagctggaa aagccatgac tgacatgtcc cacccttccc tgcctgccat atgtctgact 4801 ccagcatccc tgcctcccca ctcttcccct ggagtccccc cttcccagtg cttgctccag 4861 ccactcgagt gcgctaccct tgcctccagg ctgccccacc ctattctccc acctaatctt 4921 gtccttcccc aagtccagct ccctctctag tgggtaatga gcccttaatc gctgcctcta 4981 ggggagctga ttacagcatc ctcgttagtg tcaccccccc tcctcccaga tctgtcatgg 5041 ccacttaagt gataataaat ccttcccaaa gcagccttgg caggagtcat ggatctcatg 5101 tagactgctc ctcaacactg ccagggaacc aggagctaga aaacccaagt cccctccccc 5161 ctcagaccca ggattctagg ttcaagttct cacctgacca cttctagatc tacctgacaa 5221 ctcttcgaac ggtacagata gactctgcgt atgtcttcga cttcccctca acccctcccc 5281 ccgaaaataa cggtcaaaga gctctcttgg ctaatcccca ccccaattgt aaatgtctgg 5341 tttatctctt atgggcctca ccctcctctg tcccaaaaca aggggaacca gcccggtgtt 5401 aacaaatttt tacctatatt taaactctga gaacacaaga tattattttc tggagtgggg 5461 cctaattcct catgggttca cttgcataag gcagatttgg agagtggctg tcaaaaaccg 5521 gggggagggg gatgacagag agagctagag tggtgtgaca gtcattggga tgggaagagc 5581 agactcaggt tgttaaaaga caggaggcca agaggagact cagtgggga // LOCUS FRNTUL4 1233 bp ds-DNA BCT 27-JUL-1990 DEFINITION F.tularensis 17 kDa major membrane protein (TUL4) and 13-kDa protein genes, complete cds. ACCESSION M32059 KEYWORDS 13-kDa protein; 17 kDa major membrane protein. SOURCE F.tularensis (strain live vaccine) DNA. ORGANISM Francisella tularensis Prokaryota; Bacteria; Gracilicutes; Scotobacteria. REFERENCE 1 (bases 1 to 1233) AUTHORS Sjoestedt,A., Sandstroem,G., Taernvik,A. and Jaurin,B. TITLE Nucleotide sequence and T cell epitopes of a membrane protein of Francisella tularensis JOURNAL J. Immunol. 145, 311-317 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Sjoestedt, 12-FEB-1990. FEATURES from to/span description pept 99 470 13-kDa protein pept 551 1000 17 kDa major membrane protein (TUL4) precursor sigp 551 607 17 kDa major membrane protein (TUL4) signal peptide matp 608 997 17 kDa major membrane protein (TUL4) binding 85 97 ribosome binding site binding 539 548 ribosome binding site BASE COUNT 440 a 171 c 204 g 418 t ORIGIN 1 gaattcatag tcgcctcaaa atatgtttag atatgttgtt aataagttat aatcttatat 61 atagttaact atgttattat aatcaaggtt tttttgctat gatgacttta caagaagaaa 121 aaatacaagc gccagtattt tttaaagaat atgtcaaagg taggtttata cttaatatag 181 gtgagtataa tcatccattg attttatcag caactcaagt gcttgagtat caagataaaa 241 ttgatgatat ccaaagtatc aaaaaaagtc atcttgatct tatcttagcg actaatcctg 301 aaataatact cataggtacc ggtgaaaaac aacttttgcc tccacttgag ataattaatc 361 aaatcgcaaa agctggtaaa agtgttgatt ttatggcgag tgatactgct tgtaagacat 421 ataatttgct tgttaatgaa aatcgtaatg ttagctgtat catcatttaa taaactgctg 481 tttattttat tttaattaat gttataatcg atttgagtat atgtgaatat ttaaaaatag 541 gagtatctat atgaaaaaaa taattaagct tagtctttta tctttatcaa tcgcaggttt 601 agcgagctgt tctactctag ggttaggtgg ctctgatgat gcaaaagctt cagctaaaga 661 tactgctgct gctcagacag ctactactga gcaagctgct gctgtatcta agccaactgc 721 aaaagtaagt ttaaataaac ttggtcagga taaaataaaa gcaactgtat atacagcata 781 caataataac ccacaaggaa gtgtaagatt acaatggcag gctccagaag gttctaagtg 841 ccatgataca agcttcccaa ttactaagta tgctgagaag aacgataaaa cttgggcaac 901 tgtaacagtt aagcaaggta ataacttctg tagcggtaag tggacagcta atgtagttta 961 tgacaaagaa gtaatcgctt ctgattcaat aaatatttaa tttcttcaaa tatttataat 1021 tttctagttt ttgcttttct caaaaaaaaa taacggtgtg attaattttg cttagctaga 1081 ttagttgtac ttttaacttt ttactaaaaa gatttgctat aattgcttaa atataacaaa 1141 taaaaaagta atagtttaat gaaaaacatt atcttgttat taatgatatt ctcgatgtta 1201 gttgtactta gccatgtgca ccaaccacag ctg // LOCUS TIPREGU 29802 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid Ti (from A.tumefaciens) virulence regulon (vir) genes, complete cds. ACCESSION J03320 KEYWORDS virulence regulon. SOURCE Plasmid Ti (strain C58) (from A.tumefaciens) DNA. ORGANISM Plasmid Ti Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 29802) AUTHORS Rogowsky,P.M., Powell,B.S., Shirasu,K., Lin,T.-S., Morel,P., Zyprian,E.M., Steck,T.R. and Kado,C.I. TITLE Molecular characterization of the vir regulon of Agrobacterium tumefaciens: Complete nucleotide sequence and gene organization of the 28.63-kbp regulon cloned as a single unit JOURNAL Plasmid 23, 85-106 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Kado, 02-MAR-1990. FEATURES from to/span description pept 1983 4484 virA protein pept 5029 5766 virB1 protein pept 5766 6131 virB2 protein pept 6131 6457 virB3 protein pept 6457 8823 virB4 protein pept 8840 9502 virB5 protein pept 9603 10487 virB6 protein pept 10523 10690 virB7 protein pept 10677 11390 virB8 protein pept 11387 12268 virB9 protein pept 12265 13398 virB10 protein pept 13439 14473 virB11 protein pept 14728 15453 virG protein (ttg start codon) pept 17174 17617 virD1 protein pept 17651 18994 virD2 protein pept 19009 21030 virD3 protein pept 21104 23101 virD4 protein pept 23098 25734 ORF 5 pept 26802 27053 virE1 protein pept 27169 27360 virE2 pept 27374 29044 virE3 29044 site 1983 29044 vir regulon region BASE COUNT 7085 a 7819 c 8213 g 6685 t ORIGIN 1 taccgacgat atgtgcagtc catcagcagt ataatgatta caggagacca aaaattggtt 61 tcaaatgtaa ccgaataagc ctgctgtacg taatggcaat cggtcgatat cagtcggcac 121 agccactcgt cttgctgaga tgcaatgtcg acgtgtaacc aaaatgactt tgatggaagt 181 cgactttgaa gcctgaccga caccggctaa tcgaaatctg caactgctgg aactcggaag 241 cggccaaatc cctcgaggac cgagcgccgg ccgaggtgct tcgaaagaaa tcacttactc 301 aaataagatg tgcgaatagc tttggcgcgc cgcatgtcgc gcttggaggt gacgccgacc 361 gatgctgcgc cgccgatggc catccaacac gcaggtcaaa tgatgaagcg gcgaatcacc 421 gaattcgcgt cagcgtgacc gggcaaacag gctgtcctgc gccatcttct ggccactgag 481 gaaaatcgcg ctcctgcgag agcgcatgtt caagatattc attagctatg gcctctatga 541 gctctacgtg ccgccctgca tcaatatttg gcaactggct gattgcgaga tcgtgtttgc 601 gcgcaaaacg aattgcgcag cgatatccgt cgatatcttc caaaatcggt cgagcggcag 661 ggtagttcca gagttccgcc aactcctcca acaacgaggg gcgatcttcc cggatggcaa 721 acatttccgc aacgcgttgc ttggctcggg tgagaaaggc gtccgaatcc ccaagacgta 781 gccgcttgac atgccattga aaacccgatc tccaaaacgg acttttagcc atgcaattga 841 gaagcgaaat cgacccgccc tcgagaataa gaccctcttc ggacttccgc caatccactt 901 cgaatatgag tcgacgatgg gcactctcag cgtcaaggat gccctcggtg agggggcggg 961 aatccaaata tattctccgc gttgattgca attccgattc caaaggtctt ccgctacctg 1021 tcgcgatttg aggacagcat tgcacacgat caagggcaac caccggccac ccggtttctt 1081 gtgcgatttg gatcgccatg tccgttttgc cgctgcaagt cggtccgtag atgagatgga 1141 gtaacatgtg cgcctcatat aaatttaatg ggatatccca tctgttccaa accagatttt 1201 tcaattcgga ttatgatgtt ttactgcatg atttttacca gctttacgca atctgctgaa 1261 ctaatatggc gctacatttg taatcgtaca tttttgacgt tgccggctca ttcggccatg 1321 agagtttcca tcccgttgtc ctcctgaaca attgaggagt ttggccccga cacctgctta 1381 tggtgaatgc agcggacccc gttaacgcaa cgtcgaaaag cgataggcga tttgagtagg 1441 aagccacatc ggccgcgcgg tatgctccgt cagcacctga cagttcgtgt tgcagctttg 1501 cttggtcacc tgaaacgaaa aatgccgctg aagaatattt aacggatatg cacacgaaaa 1561 agccggttgg gtctcatatg ccgctgacgt cctggcttcg tttcgaagcc gacagcctct 1621 aaggtggtac gaacacagac ctaatatccg ctagtgtccg cggcagccgc cacggcagtt 1681 ctcgctgcgc atagcatagc aacttgccct gtcgacgcat gatcagccat attgttaatg 1741 tcgacgattt catttttctc ggtgcaaacg gcgacatcct gaccatatcg atcctaccag 1801 ccaccgcgcg gagtcaaaga acacctcggc aaccggaaac atccattgga atgccgaggc 1861 tgtgtttcat ttgaaacaca ctgagtcgac gtttgttgct tcaaacccat ttacaaaccc 1921 tactgtgcgg cctaagggcc aacggggtgg gactgagcgc tggtacgagg acgtaagtgc 1981 ggatgaatgg aaggtattca ccgtctcggc aagatttcaa gacaggcgcc aagccttggt 2041 ctatcctggc cttggtagtt gctgcaatga tttttgcctt gatggcgatt acgtcttggc 2101 aggacaatga aaccaatcgg gcgatcctga cccaattgcg agctattaac atcgacagtg 2161 cttcgctgca gcgggatgta ctctccgcgg aagcgggtgt ggtggcgaac taccggccca 2221 ttatctccag gttgggagct ttgcggaaga acctggaaaa tttgaagcga ctatttaaac 2281 aatctcatct tgtgatcggc aatgatttct ctcaactgct cgacaagcta aaggtgtctg 2341 tggatacgac cgacgcggcc gttgcagcct tcggagcgca aaacgtgctc ctgcaagatt 2401 cgcttgccag cttcactcgc gcgcttagta ttcttcccaa aatgtcgtcg acggatcaga 2461 cggtcgaaaa ttcgaacgaa ttgggcagcc tgatgctgcg gtttgtgcgt cagccaagcc 2521 cagcactctc gttggagatc agtcacgaac tcgacatgct ccaaaaagct agcggtgggg 2581 ctgaagttcc tatccgtata cttgcacgcg agggtcgcgt catcttgtcg attttgcccc 2641 gtgtgaacga tgccgtaaac atgattcaga cctccgacac cgctgaaatt gccgaaagat 2701 tggagcgcaa gtgtttggag gcctatagct tgcaaagcgt gagggagcag cgggcacgga 2761 tcttcttggg ttccgtttcg gtgggccttt gcatctacat catctcactg gtctataggc 2821 tgcgtcggaa aacggcttgg ttaacgcggc gtttggatta cgaagaggta atcaaagaga 2881 ttggggtttg tttcgagggg ggaggggcca cagcgtcgtc cctcaattcg tccgcgcaag 2941 ctgccgttgg aattattcaa cgcttcttta atgcggaatc gtgtgcacta gcattggtgg 3001 accatggtga caggtgggct gtcgaaagtt tcgctgcgaa gctgcctgag cccgtctggg 3061 aggacctcgc gctacgcgag atggtttctc ttgccagagc ggatgagcgt gcgtcagtat 3121 tccgcatcat gtcgacgcga aaggtcagct gccttcctcc ggagactccg ggcgtttcta 3181 tgctgctggc acacaaatct acggatcaac tgatagcgat ttgttccctc ggttaccagg 3241 gctatcgtct gaaatcttgt ccaggcgaag ttcagcttct tgaactcgcc accgcctgcc 3301 tctgccacta tatcgatgtc cggcgtaagc agaccgaatg cgattttctg gagaggcgat 3361 tagagcatgc ggaacgtctt caggcagttg gtacacttgc tggtggaata gcgcatgagt 3421 tcaacaacat tttgggagca atcctcgggt acgctgaaat ggcgcaaaac atgctgcgtc 3481 ggtcatctgt cacccgaaga cacattgacc aaattatttc gtcgggtgac agagccaggc 3541 tcattatcga tcagatcttg actctaagtc gaaaactaga acgcgtgaca aagccgttca 3601 gtgtctccga actcgtaatg gaaattgctc ccttattgcg tgttgcattg cagcgcaaca 3661 tcgagctgaa gttcaagttt gacgacaaga agagcgtagt cgaaggaagc ccgctagagg 3721 ttcagcagat gctgatgaat ctttgcaaga acgcttccca ggcgtttacc gccgatggtc 3781 aaatcgacat catcgttagc cgaatttttg tatctcgaca gaaagtactg gcgcatggtg 3841 ttatgccagc tggcgactat gttcttcttt ctgtcagcga cgacggtgaa ggcattgccg 3901 aaactgtgct cccccacatt tttgagcctt tctttacaac acgctcttgc agcggcggca 3961 cgggtctagg tcttgctgcc gtgcacggtc atgtcagcgc gcttgcagga tatattgacg 4021 ttacttcagc cgtagggcga gggacgcgct tcgacattta tctacctcct tcctcgaaga 4081 agcccgtcag cccggacgcg ttttttgggc cctgtaaaac accgcgtggc aacggagaaa 4141 ttgtggcatt gattgagcca gatcctgtct tgcgggaggt gtacgaagac aagatcgccg 4201 ctctgggcta tgagccggtg ggctttaaga catgtgcaga cctttgcaat tggatatcaa 4261 aaggcaagca agccgatctg gttctagttg accaatcgtc tcttcccgag aatcagagtg 4321 ctactgcttt gcacgcagcc ttcaagacgg cgtccatcat cattggagga agtgatctta 4381 aaatgtcact ttccagcgat gacatgacgt ccgcgctttt tctgcccaaa cctatatcct 4441 ccaggacgat ggcctacgcg attcgtacca agatcaaagc ctagagttga ggatgttttt 4501 caggagactc gaccagggta atcagcacgt ctgttcggcg cgcctatagc tatggtttct 4561 cgcaatcgct ttgggcagtc ggttgtggcg gtgcgtcttc tgagacccga aggttggcga 4621 ttatctgctg gaggtcttcg ataccgtctt cgaagaacaa gacatatgtt cgctgagctc 4681 ggacatagga tagtcggcat aggcggatac cctcgccacg caaatggcgc gatcgacacc 4741 ctgatcggag cctgcgaagt gccacagttt attggaacca tcccccgttc attccgccct 4801 ggaatttcac ttcgacggca ttctagatcc tcgccgttgt cttagagagc catctttcgt 4861 cggcgcaccc ttaagtgcat ggaaagccgt tttcgcttca aatgaaatcg aaaagaagaa 4921 aacaaaaatc ctagagtaac cgaccctccc gataatcgtg aacatcagat cacagcattt 4981 cttccgaccg aagtggctgt gttggttatg agcttgggga gattgggaat gttgaaggca 5041 acagggccgc tgtcgattat cttactggcc tccacgtgca cgtcgagtgg tgctgcccca 5101 ctttcatttg ctgagttcaa taattttgca cgcgaatgcg ctccatccgt tgctccatct 5161 acgcttgcag cgatcgctca ggtcgaaagt cgctttgatc cgcttgctgt gcatgacaat 5221 accaccggcg aaacgcttca ctggcagaac caggctcaag caacgcaagt cgtgatggac 5281 ggtctcgaag cacggcattc gctggatgtt gggctcatgc agatcaattc ccggaatttt 5341 tccgtgctcg gtctgacacc tgacggagcc cttcaacctt gcacgtcatt atctgtcgcc 5401 gcaaacttgc ttgggagccg ctacgctggc ggcaacacgg ctgacgacga gcaattgtcg 5461 cttcgtcggg caatctccgc ctataacacc ggtgatttca cgcacggctt cgcgaacggc 5521 tacgtgcgaa aagttgaaac ggccgctcaa cagctcgtcc ccccgttaac cgcgcgtcca 5581 aaagatgatc gtgagaagcc gggatccgag gaaacatggg atgtttgggg ggcatataag 5641 cgtcgttccc cggagggcgg agctggcggg tcatccggtc cgcctccgcc gccagacgag 5701 gacaaccgca aatccgaaga cgacgatcaa ctcttgttcg acttaaatca aggaggtccg 5761 caataatgcg atgctttgaa agataccgtg tacatctgaa tcgcctctcg ctctcgaacg 5821 cggtgatgcg catggtatcc ggctatgcgc cgagcgtggt cggtgcaatg gggtggagca 5881 ttttctcctc tgggccggcc gcggcccaat ctgcaggtgg cggcactgac ccagccacaa 5941 tggttaacaa catatgcacg tttatccttg gtccgttcgg ccagtcactc gctgttcttg 6001 gcatcgtggc catcggaatc tcctggatgt tcggtcgcgc ttcactcggt ctcgttgccg 6061 gcgtcgtcgg cggcattgtc atcatgtttg gagccagctt cctgggcaaa acgctgactg 6121 gaggtggcta atgaatgatc gtctggaaga agcaaccctt tacttggcgg cgacacggcc 6181 cgcattgttt cttggcgtgc cgctgacgtt ggcggggcta ctcgtgatgt ttgccggctt 6241 tgtcatcgtc atcgttcaga acccgctgta cgaagtcgtt ctcgtgccct tgtggttcgg 6301 agcgcggctt gtggtggaac gagactataa cgcggccagc gtcgttctac tttttttgca 6361 gacggcggga aggagcgttg atggtctgat ttggggcggc gcaagcgtta gcccaaatcc 6421 aatcaaggtt cccgcgcgag ggagaggaat ggcgtaatgc tcggagcaag tggcacaacc 6481 gaaaggtctg gcgaggtcta tctaccctac gtcgggcacg tcagcgacca tattgtcctt 6541 ctagaagatg gatcgatcat gacgatggcg cacgtaagtg gcatggcctt cgaactcgaa 6601 gatgccgaaa tgcgcaatcg acgttgccgt gcatttaata cgctcttgcg caatatcgct 6661 gatgatcatg tgtcaatata tgctcacctc gtacgtcatg acgatgtgcc gccgtcaccc 6721 gcgcgacatt tccgcagcgc tttttccgcc agtctgagcg aagcttttga ggagcgcgtt 6781 ctctccggca aactccttcg caatgaccac ttccttacgc tgatcgtgtc tccccgggcc 6841 gcgcttggca aagtgaggag aaggttcacc aaacgctaca gacaaaaaga aaacgatctc 6901 acagctcaaa ccaggaacct ggaagatctc tggcatcttg tcgctggcgc tctcgaagcg 6961 tacggcctgc gtcgtcttgg tattcgtgag aagcaagatg tgctttttac ggaggttgga 7021 gaagctctgc ggctgataat gactggtcga ttcacgccgg ttcccgtcgt tagcggttcg 7081 ctcggcgcct cgatctatac cgaccgagtt atttgcggca agcggggact cgagatccga 7141 acaccaaaag atagttatgt gggatctatt tactcgtttc gcgaataccc cgcaacgacg 7201 ggaccgggta tgctcaacgt gctactgtct ctcgattttc cgcttgttct gacgcagagc 7261 ttctcgtttc tgactcgctc gcaagcccac tcgaagctca gcctcaagtc cagccaaatg 7321 ttgagttctg gcgacaaagc cgtcacccaa atcagcaagt tatccgaggc ggaggacgca 7381 ctagcgagca acgaattcgt attgggggcg catcatgtga gtctttgcat atatgcaaat 7441 gatctcaata atcttgcaga tagaggtgcc cgcgcccgga cgcgattggc ggatgcggga 7501 gctgttgttg tccaagaggg catcggcatg gaggcggctt attggtcgca gctgccaggc 7561 aactataagt ggcgcacgcg tccgggagcg atcacatcgc gcaacttcgc tggtttagtc 7621 tcattcgaga attttcccga gggatccggc tcaggtcact ggggcaacgc gattgcgcgc 7681 tttcgtacca atggtggaac ccctttcgac tacatcccgc acgagcacga tgtcggcatg 7741 acggcgatat tcggtcccat cgggaggggt aaaacgacgc tcatgacctt tatcctcgct 7801 atgctcgagc agagcatggt cgaccgcgcg ggtgcggttg tcctcttcga caaggaccgc 7861 ggcagtgagc tgctggttcg cgccaccggg ggaacatatt tggcgctccg tagaggagga 7921 agcggattgg cgccattgcg tggcctggaa aatacagcgg cttcacatga ttttctgcgc 7981 gaatggatcg tggcgctcat tgagagcgat ggccgtggag gaatatcccc ccaggaaaat 8041 cgccgtctgg tgcggggtat ccatcggcag ctctcgtttg atccccacat gcgctcaatc 8101 gcggggttac gcgaattttt gttgcatgga cccgccgaag gggcgggagc gagactccaa 8161 cgctggtgcc gtggcaatgc actaggctgg gcgttcgacg gcgagctcga cgaagtaaag 8221 ttggatcctt cgattactgg tttcgacatg acgcatcttc tcgaatatga ggaagtatgc 8281 gctgccgcgc gagcatatct tctgcaccgc attggagcca tggttgacgg ccgtcggttt 8341 gtgatgagtt gcgatgagtt tcgcgcctat ttgctaaatc ctaaatttgc ggcggtcgtc 8401 gacaagttcc tgcttactgt ccgcaaaaac aatgggatgc tgatactggc aacgcagcaa 8461 cctgagcatg tcctggaatc gcagctaggc gccagtctcg tcgcgcaatg tatgacgaag 8521 attttctatc cttcacccac ggcagatcga tcggcttaca tcgatggact gaaatgtact 8581 gaaaaggaat ttcaggcgat ccgtgaagac atggcagtag gtagccggaa gtttctgctt 8641 aaacgagaaa gcggaagcgt cgtctgcgaa ttcgatctgc gggaaatgcg cgaatatgtc 8701 gccgtacttt cggggcgcgc caacacggtg cgcttcgcgg atcagcttcg caaagtacag 8761 ggggacaacc catcagcctg gctcagcgaa tttatggctc gttaccacga ggcaaaagat 8821 tgatcaaagg tggggaacta tgaagatcat gcaacttgtt gctgcggcca tggccgtcag 8881 ccttctttcg gtcgggcccg cgcgggcgca gttcgttgtc agcgatccgg cgacggaagc 8941 tgagacgctg gcgacggcgc tcgagactgc ggcaaatctc gaacagacca taacgatggt 9001 ggcgatgtta acctcggctt atggcgtcac cggcctacta acttcgctca accaaaaaaa 9061 tcagtatccc tcgaccaggg acttggacac ggaaatgttt tcgccgcgaa tgccaatgtc 9121 gaccacggca cgtgcgatca ccaccgatac agatcgcgcc gtagttggtg gcgacgctga 9181 agcggatctg ttgcgatcgc agatcaccgg ttccgcaaat agcgcgggca ttgcggctga 9241 caacctggag acgatggaca aacgcttaac agcgaatgcc gagacctcga cacagctttc 9301 tcgctctcgc aatatcatgc aggcaaccgt taccaacggt ttgcttctca agcagatcca 9361 tgacgcaatg attcaaaatg tacaggcgac cagcttgtta acgatgacca ccgcgcaggc 9421 tggccttcat gaggcggaag aggcggccgc tcaacgtaag gagcatcaaa agactgcggt 9481 catctttggg gccgtcccct gagactgggc gacttattcg tgccgcccat ctgcacattt 9541 caggatgcga gttcattgta tccgacattc tgcgacaagc cagtcaaggt caggtccaaa 9601 cgatgaattt cacgatcccg gcgccgttta cggccattca tacgatcttc gatctagcct 9661 ttacgacaag cctggacaca atgcttggga cgatccaaga ggcggtgagc gcgccattgg 9721 tcgcctgcgt cactctttgg attatcgttc agggtattct ggtcatgcgt ggcgaaatcg 9781 acacgcgcgg cggtatcact cgggtgatca cggtcaccgt cgttgttgcc cttgtcgtcg 9841 ggcaggccaa ctaccacgac tatgtggttt cagtctttga agagacgatt ccaaacttta 9901 ttcagcagtt tagtggcagc ggcctgcctc tgcagaccat tcccgctcag ctcgatacaa 9961 tgttcgccct aacccaggct gcatttcaga gaattgcatc tgaaatcggc ccgatgaatg 10021 accaggacat ccttgctttc cagggggctc agtgggtctt ttacggcacg ctctggtctg 10081 ccttcggaat ctacgacgcc gtcggaattc tcacgaaagt tcttctggcg atcggccctt 10141 tgatcctcac gggatatatc tttgatcgca cgcgcgacat cgcggcaaag tggatcgggc 10201 aacttatcac ctatggtctc ctacttctcc tgctaaacct cgtggcgaca atagtcatcc 10261 taaccgaagc gactgcactc acgcttatgc ttggtgtaat cacccttgcc ggtacgaccg 10321 cggccaagat cattggtctt tacgaactcg acatgttctt tctgactggt gacgcactca 10381 ttgtcgcttt gccggcaatc gccaacatcg gaggcagtta ttggagcggc gcgacccaat 10441 ctgccaacag cttgtaccgt cgcttcgctc aggtcgatcg tcgttaggtc ccgcaaaaaa 10501 attcacgtcg gtggagaact ctatgaaata ttgcctgctg tgcctggctc tcgctttaag 10561 cggctgccag acaaacgata aattggcgag ctgcaaaggc ccaatattcc cgctgaatgt 10621 ggggcgatgg cagcctacgc cgtcagatct tcagctcagc aacgtaggtg gtcgccatga 10681 aggggtctga atacgccttg ctagtagcgc gggaaaccct ggctgagcac tacaaggaag 10741 tggaagcttt ccaaactgcg cgtgcgaaat cagcgcggcg cctctccaag gtcattgcag 10801 ctgtcgcaac catcgcggtt ttggggaatg ttgcgcaagc cttcacaatt gccaccatgg 10861 tgccgctgat caggcttgtg ccggtatatc tttggatacg gccggatggc accgttgaca 10921 gcgaggtgtc cgtctcccga ttgcctgcaa ctcaagagga ggccgtcgtt aacgcctcac 10981 tgtgggagta tgttcggctg cgcgagagct atgatgccga taccgcccag tatgcctatg 11041 acctggtctc gaacttcacg cgcccaatgg tgcgccaaaa ttatcagcaa ttcttcaatt 11101 atcccaatcc aacttcgcct caagtcatcc tcggcaaaca cggcaggcta gaggtcgaac 11161 acatcgcttc gaatgatgtt actccgggtg tgcagcaaat tcgctataag cgaaccctca 11221 tcgttgacgg caaaatgccg atggcgagca cttggactgc tacggttcgt tatgaaaagg 11281 tgaccagctt gcccggcaga ttgagactga ccaacccggg aggcttggtt gtcacctcct 11341 accagacatc ggaagatacc gtttcgaacg caggccacag cgaaccatga cgaaaaaagc 11401 atttctcact ctggcatgtt tactttttgc ggcgattggc gcgagggctg aagacacgcc 11461 aacggcgggc agacttgatc cgcgcatgcg ttatctcgct tacaatcccg atcaagtggt 11521 gcgcctttca acggcggttg gagccacttt ggttgttact ttcggggcta acgaaacggt 11581 gacagctgtt gccgtttcca atagcaaaga tctcgcggcc cttccacgcg gaaattatct 11641 tttcttcaag gctagcaagg ttctcccacc ccagccagtg gtcgtgctaa ctgcgagtga 11701 cgccggtatg cgacgctacg ttttcagcat ctcttccaag acgctgccgc acctcgataa 11761 agagcaggcc gatctctact atagcgtaca attcgcttac cctgccgatg acgcagcggc 11821 tccgcagaag gcggcacaag agaaggctgt tgcagaccgt atacgtgcgg aagcgcaata 11881 tcaacagaga gcagagggtt tattggagca gcctgccacg accgttggtg ccgaggacaa 11941 gaattggcac tatgtcgctc agggcgatcg ttcgctgttg ccgctcgaag tcttcgatga 12001 tggatttacg acggtatttc acttcccagg taatgtacgc ataccctcca tctacacgat 12061 aaatccggat ggaaaggaag ctgtcgctaa ctattcagtc aaagggagct atgtcgagat 12121 ttcttcggtt tcccgtggtt ggcgtctgag ggatggccac acggtattat gcatttggaa 12181 taccgcctac gatcccgtcg gccgcaggcc ggagacgggc actgtgaggc ccgatgtgaa 12241 gcgcgtccta aaggaggtga gaggatgaac gacgacaatc agcaatcggc gcatgatgtc 12301 gatgcgtcgg ggtccctggt ctccgacaca catcaccggc gcctttcggg ggctcaaaag 12361 ttgatcgtag gaggtgtagt tctcgcgcta tcacttagcc tcatttggct tggcgggcgt 12421 gaaaagaagg aaaacgggga cgcacccccg tcaaccatga tcgccacgaa caccaagcca 12481 tttcatccgg ctccgattga cgttacactt gatcctccgg ctgcccagga agctgttcag 12541 ccgactgctc ctccgccagc acgaagtgag ccggaacggc atgagccgcg gccggaagaa 12601 acaccgattt ttgcgtacac cagtggtgat caagggacca gcaagcgcgt tcaacaaggc 12661 gaaacggacc gaagacgcga aggcaatggg gaagacagtc ctttgccgaa ggtcgaagtg 12721 tccgccgaga atgatctctc gatacgcatg aagcccaccg agctgcagcc caccagggct 12781 acgctcttgc ctcatcccga cttcatggtg acggagggga cgattattcc atgtatcttg 12841 caaacggcaa tcgacaccag tctggcaggc tatgtaaaat gcgtgttacc ctgggatgtt 12901 cgtggaacaa cgaacaacgt tgtgcttctt gatcgcggca ccaccgttgt tggcgagatc 12961 cagcgcggtt tgcaacaggg agatgcgcgt gtttttgtgc tctgggatcg ggcggagaca 13021 cccgaccatg ccatgatttc gcttgcgtca ccaagcgctg acgaactcgg tcgctcggga 13081 ttgccgggca ccgtcgacaa tcacttctgg cagcgcttta gcggggccat gctcttgagt 13141 gtcgtccaag gtccgttcca ggcagcgagc acctacgctg gcagctcggg tggagggacg 13201 agcttcaaca gcgtccagaa taacggtgaa caaacggcag acacagccct caaggccacg 13261 atcaacatac cgccaaccct gaagaagaat cagggcgaca cggtctccat ttttgtcgca 13321 cgggatctcg atttctcagg catataccag cttcgtatgg ctggtcgcgc ggcgcggggg 13381 cgggatcgcc gtccataacg aattcaactt atcacttaca gatggagata cgatacaaat 13441 ggaggtggat ccgcaattac gaatccttct caagccgatt ttggaatggc tcgatgaccc 13501 gcggaccgaa gaagttgcga taaatcgacc tggggaggca tttgtgcgcc aggccggcgc 13561 cttcctcaag ttccctttgc ctgtctccta tgacgatctc gaagatatcg ctattttagc 13621 aggcgcgctg agaaaacagg acgttggacc acgcaaccca ctttgcgcaa ctgaacttcc 13681 agacggcgag cggctgcaga tctgtttgcc gccgacggta ccatcgggca ccgtcagctt 13741 gacgattcga cggccaagtt cccgtgtttc tagtctcaaa gaagtctcgt cccgttacga 13801 tgctccgagg tggaatcagt ggaaggaacg aaaaaaacgg catgctcagc atgatgaagc 13861 tatccttcgg tactatgaca acggggatct ggaggcgttt ctgcacgcat gtgtcgttgg 13921 tcggttgacg atgctgcttt gcggacccac cgggagtggc aagacaacga tgagcaagac 13981 cttgatcaac gctatcccgc cgcaggaaag gctgattacc atcgaagata cgctcgaact 14041 cgtcattcca cacgagaacc acgtaaggct gctttattct aagaatgggg ctgggctggg 14101 cgcagttacc gctgagcacc tgctacaggc tagcctgcgc atgcgaccgg accgaatact 14161 gctcggcgag atacgcgacg atgccgcgtg ggcttatctg agtgaagtcg tctcagggca 14221 tccgggatcg atttccacaa tacatggtgc caatcccgtc caaggtttca aaaagctatt 14281 ttcgctcgtg aaaagcagcg ctcagggggc tagcttggaa gatcgcaccc tgattgacat 14341 gctcgcaacc gcagttgatg tcatcgtacc cttccgtgcc cacggtgaca tttacgaggt 14401 gggcgaaatc tggctcgctg ccgatgcgcg tcggcgcggt gagacaatag gcgatcttct 14461 taaccagcag tagttgtgat ccatgtttct aaatgccgca tggcgcgttg tagaattacg 14521 tttgtagcaa tgctcagcaa tctttgtcat caaacggaga catctagttt gcatttctgt 14581 cgtgcgcggt ttggtcgaaa tcttgccgaa atgcccgtgt agtgagagaa aattaaagag 14641 tggagtctag caaatacaac ctttacgtgt ataaattctg ttgagctgca aatggctggc 14701 caggatccta gattgagagg tgaaccgttg aaacacgttc ttgtcatcga tgacgatgtc 14761 gctatgcggc atcttatagt cgagtatctt acgatccatg cctttaaggt gactgcggta 14821 gccgacagca agcagttcaa tcgtgtactc tgctccgaga cggtcgatgt cgtggtcgtc 14881 gatcttaatt tgggtcgcga agatgggctt gaaattgttc gtagtctggc cacgaagtcc 14941 gatgttccaa tcataattat tagcggcgct cgcctcgaag aggcggacaa agttattgcg 15001 ctcgagttgg gagcaaccga ttttattgcc aagccttttg ggacgcggga atttctggcg 15061 cgcatccgtg ttgcgttacg cgtgcggccc agtgtcgcgc gaaccaaaga tcgacgctca 15121 tttagtttcg ctgactggac acttaatctc aggcgacgcc gcttgatttc ggaagagggc 15181 agtgaggtga aactcacggc aggtgagttt aatctcctgg ttgctttcct ggagaagccg 15241 cgcgacgtcc tatcccggga gcagcttctg atcgccagtc gggtacgcga ggaggaggtg 15301 tatgacagaa gtattgatgt cctcattttg cggctgcgcc ggaagcttga gggggatccg 15361 acgacccctc agttgatcaa gactgcaaga ggtgctggct atttctttga cgctgacgtg 15421 gatgtttcgt acgggggtgt gatggcggcc tgaggtagag gtgcatttcg cctttagcaa 15481 tctgttccca acgtgagcag attgctatgc ggcttggcaa agctgccttt cctcggtact 15541 atccgaaaaa ctcagcactg cggagtgatt ggatgggtcc tatcttttga gagatcagct 15601 gttcgttgcc ttctcccgag caaagaaaca tgcaagcgct gcggtagcca gcttgtggcc 15661 gaaagcccgg gcggtctcca atcccaatgg atcaaagtga ttgcgagcgg cttctattag 15721 cgagaccggg aacatgcggg aggtctgaac gatgattgat ttttcgaaag ctgtgtgagg 15781 gatcggataa ctcttcggag ccgcacgaaa cgatccatcc gccagcatgt tttcaaaatc 15841 gccaagcgca cggcgcaaga tcatttgtag cgacttggaa ggactgtatt gcaggatcag 15901 gttgtcatat atcttcgata cttcaggcgc gggcgggcgc gctgaaagga agacctggat 15961 cttttctggc gctgtcgtcg aactcaaagc atccacggtc agcatcgatt gctgatcaca 16021 gctgtgacaa cgcttggcgg tggctggggc aggtcgtcga tcttcctcgt cgagattttc 16081 aggcggctgc ggcagggtcg agttttgggt ggcaacaggc aaagaaggat ggacgatttc 16141 gggtcgagcg gcggcaagcc gcctggcctc cccgacagac aaagcgggtt tgcgaattcc 16201 catcttcacc cctccaaggc ttcgctaacc aatttggaga tagtgacgag ttcctccatg 16261 gcgattctga gattccgttc gaggaggcgc attgtcggat cggttctcat attcagcaat 16321 gtgagatgca acatgccacg ttccttcatc gcggcaaatg cgtctctctc gtgcatggga 16381 gactgtacaa ctggaaggct tgcgagcatg tccgacatcg cgcgctgcga tgtggtcaat 16441 cgaccaaccg gcacgcgttg gcgcaatacg gctgtcggaa ttgccaagtt ctcgctcagc 16501 agcagttcaa tgacatagcg gtaggtcgac aatgcttcat cgatatcgag cggagttaac 16561 atggtcggga tcagaagcag gtttgagctg gcaatgatcg tgttgttgag ttcgctcgaa 16621 ccaccatgcg tatcggccag cgcataatca aatccctgga gttcggcgtc ctcataggcc 16681 gcctcaagga gtgccatttc ctcggcggcg tagacttcgc agaaggagcc ccaggtattg 16741 ctgcgaaggg cgttttcttt ccatcgcgtt agtggtcggt tttcatcagc gtcgaagaga 16801 gccaatcgtt tgccgtcact tgcaaaggcg gcgcaaagac ccatgagtgc cgtggttttg 16861 ccggctcctc ccttgaagga gcaaaatgtc agaagtttca tgtccttatc ctgtcgattt 16921 tgtgaagcgg aagtgtgtct gtacttttat ttgtgtgtat gattttgcga taattcataa 16981 gtaatgtagt aattacctga ttttatattt caattttatt gtaatataat ttcaattgta 17041 ataatataaa aataaatatc ccttatgtgt tcttgatttc gttttgtata tggctagatt 17101 cccatctgcc acgacgagga aatgctacgg cggggcaagt tccaagatct ttccgtcttc 17161 tatggaggaa gctatgtcgc aaggcagtag gcccacctca agtgacattg ccgtcaacca 17221 gcgcgaatgc gtgaaggttg aaggcttcaa ggtcgtcagt acccgattaa gatcggccga 17281 atatgagagt ttttctcatc aggcacgctt gctgggcctc tccgacagca tggccatacg 17341 ggttgcggtg cgccgcattg gtggctttct tgaaatcgac gcagagactc gtcataggat 17401 ggaggccata ctacaatcca taggaacact ctcaagcaac attgccgcgc tgctatctgc 17461 ctatgccgaa aatccgacaa tggatttgga ggctttgcga gctgaacgta tcgccttcgg 17521 taaatctttc gctgacctcg acggcttgct ccgttccatt ttgtccgtat cacggcggcg 17581 gatcgacggt tgctcgctgc tgaaagacgc cttgtagcac tgacgtagca cttggcgggg 17641 aacatattcg atgcccgatc gagctcaagt tatcattcgc attatgccgg gaggtggcac 17701 caagaccctt caacaaatta tcaatcagtt ggagtatcta tcccggaagg gcaggctgga 17761 gctgcagcgt tcagcccgac atctcgatat tcccctgcca ccggatcaaa tccacgaact 17821 tgcccgaagc tgggttcaag agactggaac ttatgacgaa agtcagccag acgaggaaag 17881 gcaacaggag ttgaccaccc atattattgt aagcttcccc gccggtacaa gccaggtagc 17941 ggcttatgcg gcgagccggg agtgggcagc cgagatgttt gggtcaggcg cagggggggg 18001 ccgatacaac tatcttacgg ccttccacat cgatcgcgac cacccacatc tgcatgtcgt 18061 cgtcaatcgg cgcgaacttt taggacacgg ctggctgaag atatctcggc gccatcccca 18121 actgaattac gacgccctgc gcataaagat ggccgagatt tcacttcgtc atggcattgc 18181 cctcgatgcg agccgacgag cagaacgtgg catcaccgag cggccgatca cttatgccca 18241 atatcggcgc cttgagcggg agcaggctcg ccaaatccgt ttcgaagacg cggatttgga 18301 acagtcgtcg ccgcaaggag atcatccaga attcagccaa cctttcgata catccccatt 18361 tgaagcatcc gcgggcggac cggaggacat gcctcggccc aacaatcggc agaatgagtc 18421 gcaagttcat ctccaggagc cagctggtgt cagcaacgaa gccggtgtcc ttgtgcgggt 18481 tgcattggag acggagcgcc ttgctcaacc attcgtttcc gaaaccattc tcgcggacga 18541 catagggagc ggctcttcgc gtgttgccga gggccgtgtg gagagcgcaa accgcactcc 18601 cgatattcct cgcgcagcaa ctgaagctgc cacgcacacg acacacgacc ggcagcggcg 18661 tgcaaagcgt cctcatgatg acgacggagg gccgagtgga gcaaaacgtg tgacattgga 18721 aggcatcgcg gttgggcccc aggcgaacgc cggcgaacag gatggcagta gtggcccctt 18781 agtacggcaa gctggaacgt ctcggccatc tccaccgacg gccacgacgc gggccagcac 18841 cgcaaccgat tcattgtctg ctacagccca cctccagcaa cggagaggtg tcctttcaaa 18901 gcgtccgcgt gaagatgatg atggagaacc gagtgaacgc aaacgcgaga gagatgagcg 18961 cagcaaggac gggcgtgggg gaaataggag atagagaagg accgagtgat ggcaaatgaa 19021 gagttcacca gacactatgc gtggcccgtt cctgtggctt cgaatgatga agggcgtggg 19081 accgcgcgga tccccatcca ggcacaatca atcgttgctg gagaagacgg tcgggacact 19141 tcggtcccaa cggctttgtc gcgaccgcca attgaagata tgccgcacgg cgtccaagaa 19201 acatcggcga gtggcggacg actgggagcg gcccgtctgc gggattccgt aatcccgcca 19261 ggaatatccg aagcccgcac ggacctatcc gcaattttgc ggaaaaaaag cggttctttc 19321 cgcaccggta tgcagtatct gcgtgggctt gaacgggaaa attttgataa acaagacagg 19381 gaagccagtg cgttgccaga tttaagtgca aggggcataa agcgaccgcg cgaaattgag 19441 tatcccggca atgcaagcgg attaaccata aagaggcaag acggcttagg catagagatc 19501 aatactatct cggcatcttc gcccgtgaac cgggccgcgc attcgtcgaa ctggcaaggc 19561 gcgccggaac cgggcgtgta caatgttcag ccatcggcag atagagcaca gaactctgcg 19621 caggaaagtt cgacgtttcc cgatggcact tctgtttccg ctctctattc aggcccactc 19681 gcagaatggt tcgagaggga tactggcagc gagacgacca ggaattcagg caacaccatt 19741 tcgtcaccgc tccgagggct ggaggagttc ggcgattccg cggacagccg gtatcttggg 19801 cgcgaagctc agagtctttc agttaccgta acaacgccta attcgaatgc tgaggcaagt 19861 tctcatagcg cacacactga aactctcgac gatgtcagca gtgaccgctc gagcgaacag 19921 gggaggggtc cccttggcgc cgcaatcctt ggatcccatc atgacctttc accgcgtgcg 19981 cagaagttat cacaaacaaa ccgcgattcc cctgaactga ccgacgccga cctggcaaaa 20041 gtcgatgcgg tgttcgaatc tctctccaag gggcccccgg gggcggagag cgccgcaccg 20101 gactttcgtg agcgcggacc aggcagcgct ttccagaaag aaggcgtgtc cgacagggcg 20161 aacggggtgc ccactaactg ggaggtacct tttggtcgcg gtggcgggca ttcaccgcaa 20221 gcgctgaggt cctcaggagt ggagctcgat gacttccctg atttcaccga ggccgaactg 20281 gcaaaaatcg acgcactagt cgaatctcac tccaacagat ctttatcagt gcgaaatata 20341 gtaccagatt tgcgcggagc gggagcagac aacgtcttcc ggaaagaagg cgttgtcgaa 20401 cgcgcggaga agatgccgat cgatagtgtg agtcttacca gactgaatgg cgagcgttca 20461 cgttcaccga agacatcgca agcaagcctt gaggatttcc ccgatctaac cgatgccgat 20521 ctggcccata tcgaagagtc ggagcggatc gcaagaactg ctgttgaaaa ggggaagcaa 20581 aaaatctcga ccgaagccga tacgcggttt gatttgggca actcttctgc accccgggtt 20641 agccctcgat cggttacccc gttggttccc aatgccaatc agccgattac atcgtggttc 20701 tacgaggcac aaaagacttg tgacaaattg gtagagaaca cttacgtgaa gccggccgtt 20761 gacagttcta gggcgcgaaa tgacgtcgaa aacaccgctg ccaggcttgg tgaccctgct 20821 cctgctttgg ggcatgacaa tcttggtaga actcgcgcgt taacgcccgt tcgcgacgtg 20881 atgtctagac ccagcgcgga ccgacaactg gctagtcatg cggcagaaca ttcggcaata 20941 gacgacattt ggaaacggga tgaccgtgat cgcaggacac acccttatag aggattggat 21001 tcgcgttcac gtgaaggttg cgggcggtaa aggctgtgtt gttcgcagca acacccaaat 21061 caaaagccca atttgcgatc gatcgaagaa tctcggtgaa gtgatgaatt cgagcaagac 21121 tacgccccag cgtttagctg tcagcatcgt atgttcgctg gcagccggtt tttgcgcggc 21181 aagtctctat gtaacatttc gccatggttt caatggcgaa gcgatgatga cgtttagcgt 21241 cttcgccttt tggtacgaga ccccgctcta tatgggtcat gcgacccccg tcttctattg 21301 cggtttagcc attgtcgtct cgacgtctat tgttgtgctg ttaagccaac ttatcatatc 21361 gtttcgcaat cacgagcatc atggcacggc tcgttgggcg ggatttggcg aaatgcggca 21421 cgccggttac ctgcagcgct ataatcgtat caaggggcca atctttggca agacgtgtgg 21481 ccctcgttgg ttcggcagct atctgaccaa tggcgaacag cctcacagtc ttgttgtcgc 21541 gccaacgcgc gccggcaaag gcgttggcgt cgttattccg acgctcctaa ccttcaaggg 21601 ttcggtgata gccctcgacg tcaaaggcga gctttttgag ctgacttcca gagcacgcaa 21661 agcgggccgc gacgccgttt tcaagttctc gcctctggat ccagagcgac ggactcattg 21721 ttacaatccc gtcctcgata ttgcagcttt accgcccgag cggcagttta ccgagacgcg 21781 ccgcctcgcc gcaaacctca tcacggccaa aggcaaggga gcggaaggct ttatcgacgg 21841 cgcgcgggac cttttcgttg cgggcatcct tacctgtatt gaccgcggta cgccaacaat 21901 tggtgcggtc tacgacttat ttgctcaacc tggagagaag tacaaacttt ttgcgcacct 21961 cgcggaagaa agccgaaata aagaggctca gcgcattttc gacaatatgg cgggtaacga 22021 cacgaaaatt ttgacttctt atacatcagt gctcggcgac ggcggactta acctgtgggc 22081 tgatccattg gttaaagcag cgacaagtcg atcagatttt tccgtctatg atctgcgtcg 22141 gaagaggacc tgcgtctatc tttgcgtcag tcccaacgac ctcgaggtcg tggcgccatt 22201 gatgcgcctc ctttttcagc aggtcgtgtc aattctgcag cgatcactgc caggtaaaga 22261 tgagcggcat gaagttttgt ttctccttga cgaatttaaa cacctgggga agcttgaggc 22321 aatcgagacc gcgattacaa ccatcgccgg ttacaaaggc cgcttcatgt ttattattca 22381 gagtctctcg gccttgacgg gcatctacga tgacgcgggc aaacaaaact ttctcagtaa 22441 cactggcgtg caagtattta tggccacggc cgacgacgaa acaccgacct atatctcaaa 22501 agctatcggc gactatacgt tcaaggcgcg ttcgacctca tacagtcaag cccgaatgtt 22561 cgatcataac atccagattt ccgatcaggg ggcaccgctt ttgcgccccg aacaagtgcg 22621 cttgcttgac gataacaatg aaatcgtcct tattaaaggg catccgcctc tcaaactacg 22681 aaaggtgcga tattattccg atcgtatgct gaggcgcctt ttcgaatgcc aaattggcgc 22741 cctccctgag cccgcatctt tgatgctttc ggaaggtgtc catcgggatg ggcaagacct 22801 cagtcaacaa gccgccgtca cggaggcagc tagtactcga tcgataccta ataatatgga 22861 agccgctaca ccgcaaaaca gtgaaatgga tgacgagcag gacagcctcc caactggaat 22921 tgacgtcccc cagggcctta ttgaaagcga tgaggtgaaa gaagacgccg gtggcgtggt 22981 gccagatttc ggtgtttcag ctgaaatggc tccagctatg attgcacaac agcagctgct 23041 ggagcagatc attgcgcttc agcaacgata tggacccgcg tcctcgcact cggtgaaatg 23101 acttcaagct gcctttcaca ttggaatcat tcgatgcggg gcgtcgtcag atatccggtt 23161 gtattcgaat gcggactaga accccgaggt tgccgattaa cagatatcgc catgagacct 23221 tcaggaaacc cgaacgtcga tcttagcggt tcgactgcat cgcttgccga agttcccgcc 23281 ggagctaccc ctgtccttaa tctaatcgag cccaggaacc gtccggctga cgactcgctt 23341 gagggccaaa ccgatcgcgg cgagcatcca tctgcatcat ttgactatga tggcatgaag 23401 cttggcgccg cggagcgtga agcatacgag aactggtgtc catcgaaccg gcctacatgg 23461 aaagatctgg tactcagggc gcgccttgat gcaatcgaca gttccgcttg gctccccgat 23521 ttgggcgagg agtcgccttt gatcttcaga tatgaaggga ttccgctggg tgagggggaa 23581 cggcaagcct acaaagaatg gcaagaggag gctcagccca catgggaaga cctcgttgtc 23641 aacgcacgaa tggcggtacc tgatccttgt gctgacgttg cagacgagca caatcccctc 23701 aaagaaggcg aggagtttcg gtctgaagcg tcgaaacgca agcggaaaaa accgatcgac 23761 caggacgaga attctcctac atcgttttac tatgacggga tgaggctcgg agaacccgag 23821 cgcgaggcat atgataactg gggcaacgcg gagccgccca cgtggaaaga cctggtactt 23881 aaggcgcgcc ttgatgcaat tgacagctcc gcctggctct ttgcttcaga agggtcttcc 23941 tcgacttttg agtatgaggg aattccactg ggtgaggggg aacggcaagc ctacaaagaa 24001 tggcaagagg acgctcagcc cacgtgggag gacctcgtca ttaatgcacg catggcagaa 24061 ctcgaccatc cttcttggat tacagacgag cacaattccc ttgaagaaaa cttagagttt 24121 cggcccgatg caagacaggc cagcctgaag gactcgaccg accagcggaa gagttcttcc 24181 gcgtcattta tctatgatgg aatgaagctc ggggaacccg agactgctgc atacgagaac 24241 tggagcaaac cggaacgacc gtcatgggaa gccctcatcc tagatgggcg ccaggcttcc 24301 atagcaagct cttcggtttc gaattcgtta cttgcaaaga catcctcgcc agtctttcta 24361 tacgagggaa tgtcgctagg ggatgcggaa cgtcaatcgt atccaaggtg gaggcagcct 24421 gcccaaccgc gatggcaaaa tcttgtggtg aacgcgcgcc ttgcggatct tgatccgttg 24481 gcctggattc ccgatgaaca tgatccgttt gcggaagccg aggcgcttag ccccacttcg 24541 caatcgagtg gcgccaacaa gtccaaccgc gctttgggtc aatcagattc aggccggccc 24601 gccttcgcat atctggcagc acaagaggcg agtcacctgc aatcaccggc atgctcacaa 24661 ttggaaacga ggcgtgcatt aaatttcggg tcgcctggac cggatgcaaa tccaacggaa 24721 agcatcgcca aatgtaatcg tttggatggc gtcagtaaaa ttaagcgatt aggcaccaaa 24781 ggccgccggg cagtaaacgc gactattcac ggcggcaagt tcggtgcgca aggactgttg 24841 tccgaagatt gcggacaagc ggctgagccc tctccatcgg aacaaactac tcggccgcga 24901 accgataata ttggtaccta tgcaagccgg aaaaacgaac gagctcggct ggctacagag 24961 accgggaaat acgaatcgga gcacattttc ggatttaagg tcgtccacga tactttgcgg 25021 gcgaccaaag aaggccgtcg tctcgaaagg ccaatgccgg catacctcga atacaaggag 25081 cttcatcggc aacacgtagg gacaggaaga ggacggactg gactggtcgg gcgcggatgg 25141 ccagatgatg cgagctatcg cgcagatcaa agggcaaccc tgtcggaccc tgttgcgtcc 25201 gcggaaggtg caacagcttc aaatgggtac cagttgaacc agctgggcta cgcgcaccag 25261 ctcgccaccc atggtctcca aagtgaaacg cccgatgggg ttaccatgcc aattcaagtt 25321 gcgacaatta gctacaacta tacagtgagt cgtgatccgg tcctttcacc acccagcaaa 25381 gaacaagcgc ccccattgct acacctgggc cttcgtgggc aaacggaagc tgtgcttgcc 25441 cgcgagaccg cattgaccgg aaaatggcca actctcgagc gtgagtgtca agtgtatcag 25501 cggtttttgg ccctgtacga cgtaaagaag gatctggacg ccaaacaact tggtgtacgg 25561 cagaaaaagc aggcactcgt tgcggcgttg aaccggaccg ccggcttaat tggcgcgtca 25621 cctttggaag cccaatcgtc gaccgctgag gttgaatata ctaccgacga gcccgatgaa 25681 cgacgggttt atgatccgcg cgatcgaggc agagacaaag cgttcaatcg ctgacaaagt 25741 cgcctagaca atgattgtta cggacttcgc aatcgatttc ttgacagcag tgccattcaa 25801 cttttgatcc accatcggaa aataatcggc ggagaaacgc acgacagctt ctattgccaa 25861 gccccgtcga tcgatctcat ggcatattcc cgttatggtc gatttttcga ggggccgcgc 25921 aacctctctc agcaaagatg aagttctcgc gggtccgcca tgggtctcca ggaaaatcct 25981 gggcgtcacg gtcggtaaag aattgaagac tttaagcaaa cgcgaacttg acgttcaaag 26041 attacgcgcc gcggaacgcg ccctcgataa acaagtctgg cagaatcctc ccgttaaccc 26101 ctcccgtcga aacgaggaga tgccaatagt catcttcgaa acgaacgcga tgagcggcta 26161 gagtcggaga tgacgatcgc gcatcaagcc gcttccgtct ccgttggcga ggatcgccaa 26221 gctcttgatg ccgctcgatc gaccttgaat gaactgcaca actcgccatc gtcggatgat 26281 agggcaaact tgagtttgcc gagagccgaa atttgcgacc gtacaacgta tccgccgtct 26341 ccatcccatt gaatacggat atcctgagca ttaggtccgt gccaatacgt ctccggcaag 26401 agctgcagat ttggcatggc ggatgatgag ggatttgctg ggcgctttga gattgtcgag 26461 gcgcgtcgag gaatactatc cccctcgaca agcgaagagt ctgatccgtg gggtcgaggc 26521 ctgcagaaca ccgtaaacgc taacggcggg atcccgaatg accagtcatt gtggcattcc 26581 gaaagaatta cattcacacg gcaccggtta ctcgatcact atctcctgcg gccttattcg 26641 ataaagtcgt gctaactccg tgacttggca agaattacat atgaaacgcg atattcgttt 26701 aatgacggag gctcggatgg tagattcacc gaaggctcaa ttctattagg agtcttgatt 26761 gtgtccgaag agtgactgcc gctgcagcag cctagaaagc aatgcacggt gatgatgttg 26821 atcggacggc tttcgtcccg cgcaggctgc cggctgaagc cgcttacgtc gatagtcaag 26881 ttgacctggc ggcaaaccga caacctcact gtgtgacttg ttttgagcga cttttggttc 26941 aagagaaaga tgaaaaccgt cgacgattgt gggcttccgg caaagcatgc gccacgtctg 27001 aagagagttg tgcaggaggc cctacactcg caccgggcca ttcggggcca tgaagatgga 27061 gcaaacctta atttgcaccg ataattttcc gttcgatgga tttgggctgg gcaggatcgg 27121 agcggaaggc actaggaaaa tcgtaatttc tgtaggagaa acaatgatat ggtgatcatc 27181 aagctaaatg cgaataaaaa catgcctgtc ttggcggttg agaagccgca agaaattcac 27241 aaagaggagt tgagcgacca tcaccagtca aatggcttta cgagtttgga tctcgaaatg 27301 attgaactgg agaattttgt ccttcactgc ccgctccccg aagaaaacct agccggctaa 27361 aaggagttag acgatggatc cgaaggccga aggcaatggt gaaaatatca ctgagaccgc 27421 agcaggcaat gtcgaaactt ctgatttcgt gaatttgaag cgccagaaga gggagggcgt 27481 aaattccacc gggatgtccg aaattgatat gacgggtagc caagaaactc ccgaacacaa 27541 catgcacgga agcccgactc acacggatga tctcggcccg cggttggatg cggacatgct 27601 cgattctcag tcaagtcatg tttctagcag cgctcaaggc aatcggtctg aggttgaaaa 27661 tgagctatcc aacttattcg cgaagatggc tttaccaggc catgatcggc gtaccgacga 27721 gtatattctt gtgcggcaaa ccggacaaga caagttcgca ggtactacta aatgtaacct 27781 cgatcatctg cccaccaagg cggaattcaa tgcgagctgc cggctctata gggatggagt 27841 cggcaactac tatcccccgc ccctcgcatt cgagaggatc gatatcccgg agcaattggc 27901 tgcacaattg cataacctgg agccaagaga acagagtaaa cagtgttttc agtacaagtt 27961 ggaagtctgg aatcgcgctc acgcagagat gggcatcact ggcaccgaca tcttctatca 28021 aacagacaag aatattaagc tcgaccgaaa ttataaattg aggcctgagg atagatatat 28081 acaaacagag aaatacgggc gcagagaaat tcaaaaacgc tatgagcacc agtttcaagc 28141 tggttcactg ctgccggata tcttaatcaa gaccccgcaa aatgatatac atttctcgta 28201 caggtttgcg ggcgaccgtt acgctaacaa gcgatttgag gaattcgaac gcgcaatcaa 28261 aactaaatac ggtagcgata ccgagatcaa gctcaaatcc aaatctggga ttatgcatga 28321 ctccaaatat ttggaatcat gggagcgggg cagtgcggat atccgtttcg cagagttcgc 28381 cggcgagaat cgagctcaca acaagcagtt tccggctgcg actgtgaata tgggaaggca 28441 gccagatggc cagggaggga tgactcgcga tcgccatgta agcgttgact acctattgca 28501 aaacctaccc aactcccctt ggacgcaagc cttgaaagag ggaaagttgt gggatcgagt 28561 tcaggtcctt gctcgcgacg gaaaccgtta catgtcacct tcaagactgg aatattccga 28621 ccccgaacac tttacccaac tgatggatca agttggtctg cccgtgtcga tgggtcggca 28681 aagtcatgcg aatagtgtca agtttgagca gtttgacaga caggcagcgg ttattgttgc 28741 ggatggcccg aacttacgtg aggttccaga tttgtccccg gaaaagttgc aacaactgtc 28801 tcaaaaagat gtcctgatag cggatcgcaa tgaaaagggg caaagaaccg gcacttacac 28861 taatgttgtg gaatatgagc gcctgatgat gaaattaccg agcgacgcag cgcagcttct 28921 cgctgaaccg tccgatagat attcacgtgc ttttgtccgg ccggagccag cattgccccc 28981 catcagtgac agccggcgga cttatgaaag ccgaccgcgc ggcccaaccg taaacagtct 29041 gtagatttcc tgataccgcg tcagtgacga agccctttag ttcaaaatat gcatttgaaa 29101 ggattcaaca atggtcgaca cgacgaagaa gagtgtcgcg aagtcgctta cggctgacat 29161 gcgccgttct gctaagcggc tttctaagca aatgcgtaaa gcctcgctta ctgaagagga 29221 ggcaacaagg aatctagccc ggctcgaaac gccggaccag aagcgaaaat atgtcgccga 29281 tatgcagata atcgacaagc tggaagacgg ctttcgaggc gaaataagct ataaaatgct 29341 gggaaataaa cagcttcggg tcgacagccc aaaagaatta acgcgcgagc atggtataat 29401 aagaaaaaca agaaaggttc tgaagcgtaa cgcagagact ggcaatgttt acttgggtct 29461 ccacgaaaag aagacctgga ggagcgttag cagccatctt tatgccgagg acggtacact 29521 tcgcgcgaag catgtgaaat acaaagacgg acgctttgaa gaaaaatggg aacgagacga 29581 aaatggcctg ctgttccgca cgcagtttgt caaccgaaat cggctatttc aacctatttc 29641 cgagaaggtc agcacaccct accggagcgg accggaaaac cggctctttc gtgaactaac 29701 ccgtcgaaaa ggttccaagc aggaaacttt tgagcgggac gaaaaaggca acctcgagct 29761 catcggcagc aaacgtctcg gcttttccaa gaattcgacg aa // LOCUS HUMPCOLLA2 700 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human procollagen alpha2(I) chain gene, exon 6. ACCESSION M35391 KEYWORDS procollagen alpha2(I). SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 700) AUTHORS Weil,D., D'Alessio,M., Ramirez,F. and Eyre,D. TITLE Structural and functional characterization of a splicing mutation in the pro-alpha2(I) collagen gene of an Ehlers-Danlos type VII patient JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by F.Ramirez, 19-JUN-1990. FEATURES from to/span description pept / 6 / 59 procollagen alpha2(I) chain, exon 6 (AA at 6) /hgml_locus_uid="LP0002V" /nomgen="COL1A2" /map="7q21.3-q22.1" IVS < 1 5 procollagen alpha2(I) chain intron E IVS 60 > 700 procollagen alpha2(I) chain intron F BASE COUNT 235 a 120 c 101 g 244 t ORIGIN 1 tctagaactt tgctgctcag tatgatggaa aaggagttgg acttggccct ggaccaatgg 61 tatgcttatc tgtttatctt agccaaaaaa attgctaaat aaatcattca ttttatgtca 121 cattttacca cgccatttat ttagctacct aagttaacac tcaatactta gattatataa 181 aaaacaactc tttttgtttt caaatttatg aaaacataag ttaaggagtt cacttttctt 241 tacaaaagaa agattaattg atcttttatg attatatgat ctttttgatt atatgatcct 301 cattaagata gatcatatac ttatgtccga agaaataatc tttcggcaat atgccaaatc 361 aattcgggaa atcaatttaa tttcaaaagt aatccatctg attaattttt catatcttca 421 atcggtagat ttagcaacac tagaggcata gaactattta ttaagttctc tgaacttgtt 481 ggaaaggatc aacaagttct atctagtcca gctaactcat ttttaaaatg ggagagttta 541 agcccttttc tcaaagtcat ccaggtaact aatgacataa ctagaactag atgccaggca 601 agatgtctaa tatttgctta catcatggtt tatgtaccta gtccttgaat aaaccactca 661 tttagtcaac agatattaat cagatgcctt caatgggccc // LOCUS MUSC4P 1739 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse complement C4 5' flank. ACCESSION D90052 KEYWORDS C4; complement C4; promoter. SOURCE Mouse (strain B10.BR) genomic DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1739) AUTHORS Nakayame K Pattanakitsakul,S.-N., Yokoyama,S., Kimura,H., Nosaka,M. and Takahashi,M. TITLE Post-transcriptional regulation of the complement C4 gene in the low C4-producing strain of mouse JOURNAL Immunogenetics 31, 361-367 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Masaru Nonaka Cancer Research Institute, Kanazawa University 13-1 Takaramachi Kanazawa 920 Japan Phone: 0762-62-8151 FEATURES from to/span description site 1691 1691 transcription initiation site BASE COUNT 376 a 464 c 476 g 423 t ORIGIN chromosome 17. 1 gtcaacagtt attgcacatg cttaatgtaa acctgtagcc tctacatgct tgtgtgcctg 61 cccagacagt atatgacacc cacacccaca cccacaccca cacccacccc accccacttc 121 acacccaggc agcagtagtg gacccaacag ggtaagtggg tccctgcttg ggttgcctgt 181 gtttctggaa gctcaaggtc catgcatcag cagtagggga agggagggtt ctcagcccag 241 gctgcatcat catggccgcc taccacctgc aggcacctgg tgaacgctgg ggtcctcact 301 gtccgagatg ctggaagctg gtggctggct gtgcctggag ctgggagatt catcaagtgc 361 tttgttaaag gtatggtgcc tgaggagcaa cacccatggg gggtctttag gtctccaaat 421 ttgactcact tcttcctttt cctggacctt ctcctccagg gcgccaggct gtactgagca 481 tggtgtggaa ggccaagtac tgggagcttg ccctgtcaga gctcctgggc cgcagggccc 541 ctttggcagt gcggctaggt cttgcctacc atgtgcacga cctcattgga gcccagctgg 601 tggactggtg agtctagtcc tgagacttgg gaaacgtgca gggacccagg gttggtgtcc 661 ccctaagtgt tgcttctgtc atcagtgaag ccaagattct ctcattcccg gactcgggag 721 aagactggac taaattctgg atttgcacac tttgggtcta ggcagctggc tccagtcttg 781 tctttctgtt acacagtgtc cccaccactt ctggaaccct ccttcgcctg ccagatacat 841 gaggagccaa tgcttggatt ctgcaactca ctgagtgagg ttcctggaag tgccacccca 901 gggtggctga gcaagtcacc gcagtgggtg ccaggctcta ctgctgcaag ctgggcttct 961 acctgagccg ggctgtgggc attgcagctc ttgcttctgt gcgtgtggag tcaggagccg 1021 tgccaagggg atgagaaggt gggattgcta gagacactgg agcagggagt agaaaactct 1081 gcccttcacg tcaggctgaa attgccaaat aaaatacttg tgcctgtact tctttctgtg 1141 tcctttgttc agtgtggtgt atgtacggct gtgtgaagcc caacttgggg ctgagcattt 1201 catcattgtt tagagagaga acaaggctta ctcaggccaa tttccctgtc tggggtgaga 1261 gtgtacctca tcttctgagc tataggcttt ctgttagagc cataccaagc agaaggctgc 1321 cttgaatatg aggccatcga ctggggaacc cattctagtt ttgttaggac ctttacctgt 1381 cttatctatg tgtcatgttg gagctgaata cacgcacaca tggacaagca catgtattcg 1441 tacacacaca cacacacacg cacacgcaca caaacacaca cacaagggga agtccctggt 1501 actgggaata aacaagcacc agatcaattt tggggccaag atttctacag agaccctggt 1561 gtttgggttg cctccccagc tctgggccta gggccagttc tgtttcttgg ccatcacgtg 1621 gtttcccggc tcatggggca aagggagggg caaggtccag atcaaactct gccccaaccc 1681 taggttgatc agaagggagc agacagtcag accagacagg tctgaccttt cctggatcc // LOCUS BSURPOF 3078 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB), complete cds. ACCESSION M34995 KEYWORDS RNA polymerase; minor sigma-37 factor. SOURCE B.subtilis (strains 168 and Marburg), clones lambda-[21,gt11,1,11,21,31]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 368 to 2297) AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W. TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus subtilis RNA polymerase: Isolation, nucleotide sequence, chromosomal locus, and cryptic function JOURNAL J. Bacteriol. 169, 771-778 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 3078) AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W. TITLE Similar organization of the sigB and spoIIA operons encoding alternate sigma factors of Bacillus subtilis RNA polymerase JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.W.Price, 18-FEB-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990. FEATURES from to/span description pept 200 529 ORF V pept 526 1008 ORF W pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg start codon) pept 1762 2361 ORF X mRNA 131 2368 sigB operon mRNA signal 131 168 sigB operon promoter signal 2368 2401 sigB transcription termination signal binding 188 194 ORF V ribosomal binding site binding 511 519 ORF W ribosomal binding site binding 954 961 sigB ribosomal binding site binding 1750 1753 ORF X ribosomal binding site BASE COUNT 999 a 575 c 755 g 749 t ORIGIN 40 degrees on B.subtilis map. 1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt 61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt 121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg 181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat 241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct 301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat 361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg 421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt 481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct 541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg 601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc 661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg 721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa 781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac 841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat 901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg 961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga 1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg 1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca 1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc 1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg 1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc 1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga 1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg 1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac 1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca 1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct 1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat 1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga 1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta 1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct 1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac 1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc 1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg 2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt 2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc 2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata 2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta 2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat 2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt 2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg 2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac 2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga 2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac 2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca 2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag 2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa 2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt 2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt 2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc 3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg 3061 caaactgctg cgaagctt // LOCUS HUMERPR 1624 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human erythropoietin receptor mRNA, complete cds. ACCESSION M34986 KEYWORDS erythropoietin receptor. SOURCE Human cell line OCI-M1, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1624) AUTHORS Ehrenman,K. and St John,T. TITLE Sequence of the human erythropoietin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.St.John, 07-JUN-1990. Author address: T.St.John Hutchinson Cancer Res Center 1124 Columbia St. Seattle, WA Email: TSTJOHN@FHCRCVAX FEATURES from to/span description pept 1 1527 erythropoietin receptor precursor sigp 1 72 erythropoietin receptor signal peptide matp 73 1524 erythropoietin receptor BASE COUNT 276 a 537 c 498 g 313 t ORIGIN 1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct 61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc 121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg 181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc 241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt 301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta 361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat 421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc 481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac 541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag 601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc 661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg 721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg 781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag 841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac 901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc 961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg 1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc 1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg 1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc 1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct 1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca 1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct 1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta 1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg 1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata 1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc 1621 agga // LOCUS HUMGFP40H 4379 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES from to/span description pept 796 909 P40 T-cell and mast cell growth factor precursor, exon 1 1022 1057 P40 T-cell and mast cell growth factor precursor, exon 2 1140 1172 P40 T-cell and mast cell growth factor precursor, exon 3 2461 2592 P40 T-cell and mast cell growth factor precursor, exon 4 4096 4215 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 796 849 P40 T-cell and mast cell growth factor signal peptide matp 850 909 P40 T-cell and mast cell growth factor 1022 1057 P40 T-cell and mast cell growth factor 1140 1172 P40 T-cell and mast cell growth factor 2461 2592 P40 T-cell and mast cell growth factor 4096 4212 P40 T-cell and mast cell growth factor IVS 910 1021 hP40 intron A IVS 1058 1139 hP40 intron B IVS 1173 2460 hP40 intron C IVS 2593 4095 hP40 intron D signal 742 747 TATA box signal 4354 4359 polyA signal rpt 2965 3265 Alu repeat binding 370 376 AP-2 binding site binding 624 629 IRF-1 binding site binding 627 633 AP-1 binding site BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS MEFDNABP 1710 bp ds-DNA BCT 27-JUL-1990 DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds. ACCESSION M34778 KEYWORDS DNA binding protein HMfB. SOURCE M.fervidus DNA. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (sites; for [2]) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. TITLE DNA binding protein HMf, isolated from the hypothermophilic archaeal Methanothermus fervidus is most closely related to histones JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1710) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Sandman, 01-JUN-1990. Author address: K.Sandman Department of Microbiology Ohio State University 484 W. 12th Ave. Columbus, Ohio 43210 E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU FEATURES from to/span description pept 578 787 DNA binding protein HMfB pept 1662 805 (c) ORF285 signal 450 457 promoter signal 523 530 promoter signal 790 814 transcription termination signal BASE COUNT 619 a 263 c 265 g 561 t 2 others ORIGIN 1 bp upstream of HindIII site. 1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa 61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag 121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag 181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg 241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat 301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat 361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa 421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt 481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt 541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat 601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct 661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag 721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa 781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat 841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc 901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt 961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat 1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa 1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg 1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata 1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc 1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc 1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc 1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat 1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc 1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt 1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt 1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa 1681 gttttgtttt acccattgct tctaaatatt // LOCUS MUSINT1A 5607 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunisseen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and Van Ooyen,A. TITLE The promoter of the Wnt-1 (int-1) oncogene and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES from to/span description pept 1779 1882 Wnt-1 protein (exon 1) 2452 2705 Wnt-1 protein (exon 2) 3279 3544 Wnt-1 protein (exon 3) 4002 4490 Wnt-1 protein (exon 4) IVS 1883 2451 Wnt-1 cds intron A IVS 2706 3278 Wnt-1 cds intron B IVS 3545 4001 Wnt-1 cds intron C site 1133 1134 MMTV insertion site in tumor 17 [1] site 1394 1395 MMTV insertion site in tumor 35 [1] site 1631 1632 MMTV insertion site in tumor 102 [1] site 4495 4496 MMTV insertion site in tumor 53 [1] mRNA 1428 5398 Wnt-1 mRNA (alt.) mRNA 1595 5398 Wnt-1 mRNA (alt.) signal 1562 1566 TATA box signal 1573 1577 TATA box BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSP40M 3808 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete cds. ACCESSION M30136 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor. SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210 DNA, clone lambda-S40.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3808) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 21-NOV-1989. FEATURES from to/span description pept 644 757 P40 T-cell and mast cell growth factor precursor, exon 1 846 881 P40 T-cell and mast cell growth factor precursor, exon 2 965 997 P40 T-cell and mast cell growth factor precursor, exon 3 2184 2315 P40 T-cell and mast cell growth factor precursor, exon 4 3393 3512 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 644 697 P40 T-cell and mast cell growth factor signal peptide matp 698 757 P40 T-cell and mast cell growth factor 846 881 P40 T-cell and mast cell growth factor 965 997 P40 T-cell and mast cell growth factor 2184 2315 P40 T-cell and mast cell growth factor 3393 3509 P40 T-cell and mast cell growth factor IVS 758 845 mP40 intron A IVS 882 964 mP40 intron B IVS 998 2183 mP40 intron C IVS 2316 3392 mP40 intron D signal 589 594 TATA box binding 227 233 AP-2 binding site binding 475 480 IRF-1 binding site binding 478 484 AP-1 binding site BASE COUNT 1012 a 903 c 869 g 1024 t ORIGIN Chromosome 13. 1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca 61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt 121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag 181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc 241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga 301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt 361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt 421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga 481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg 541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta 601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat 661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg 721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga 781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac 841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag 901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt 961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc 1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct 1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat 1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga 1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa 1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac 1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac 1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc 1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca 1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta 1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac 1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc 1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct 1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta 1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat 1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg 1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc 1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg 2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa 2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt 2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt 2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa 2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca 2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc 2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag 2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg 2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct 2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag 2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg 2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag 2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat 2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc 2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct 2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc 3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga 3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg 3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga 3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa 3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt 3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc 3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc 3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga 3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg 3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag 3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga 3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac 3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat 3781 aggtaggatg gagcaaagac ttggtaat // LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 27-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete cds. ACCESSION M29938 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone ZQ30-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 13 777 MHC SLA-DQ alpha chain precursor sigp 13 81 MHC SLA-DQ alpha chain signal peptide matp 82 774 MHC SLA-DQ alpha chain mRNA < 1 1105 MHC SLA-DQ mRNA BASE COUNT 254 a 290 c 255 g 306 t ORIGIN 1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc 61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta 121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag 181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa 241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg 301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact 361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac 421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt 481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc 541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat 601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag 661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc 721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc 781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa 841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc 901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg 961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc 1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1081 aaacaataaa tcacttcatg aggtt // LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 27-JUL-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete cds. ACCESSION M29939 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 69 836 MHC SLA-DQ alpha chain precursor sigp 69 137 MHC SLA-DQ alpha chain signal peptide matp 138 833 MHC SLA-DQ alpha chain BASE COUNT 273 a 313 c 266 g 322 t ORIGIN 1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg 61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga 121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg 181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat 241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta 301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca 361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt 421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca 481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt 541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca 601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata 661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga 721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct 781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca 841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac 901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct 961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga 1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc 1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1141 aaacaataaa tcacttcatg aggtttttgt actg // LOCUS PIGMHDQBA 212 bp ds-DNA MAM 27-JUL-1990 DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end. ACCESSION M29944 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 212) AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F., Gustafsson,K. and LeGuern,C. TITLE Class II genes of miniature swine: II. Molecular identification and characterization of B (beta) genes from SLA-c haplotype JOURNAL Immunogenetics 31, 1-6 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept < 1 12 MHC SLA-DO beta chain BASE COUNT 38 a 58 c 52 g 64 t ORIGIN 1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg 61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc 121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct 181 cttacccttc tgcctttctt cctcctactt gt // LOCUS PP2LEND 266 bp ds-DNA PHG 27-JUL-1990 DEFINITION Bacteriophage P2 left end DNA. ACCESSION M34756 KEYWORDS . SOURCE Bacteriophage P2 DNA. ORGANISM Bacteriophage P2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Ziermann,R. and Calendar,R. TITLE Characterization of the cos sites of Bacteriophages P2 and P4 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Ziermann, 05-JUN-1990. Author address: R.Ziermann University of California Dept. Mol. Cell Biology Berkeley, CA 94720 Email: calendar@garnet.berkeley.edu FEATURES from to/span description site 1 19 cohesive end of phage DNA site 20 266 left end of phage DNA BASE COUNT 41 a 58 c 99 g 68 t ORIGIN Map position 0.0-0.8%. 1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct 61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt 121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg 181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc 241 gagccagccg tttatctcgc ggatcc // LOCUS RCABCHH 314 bp ds-DNA BCT 27-JUL-1990 DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and F108) genes, 3' end, and 5' end respectively. ACCESSION M34843 KEYWORDS . SOURCE R.capsulata DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 314) AUTHORS Yang,Z. and Bauer,C.E. TITLE Rhodobacter capsulatus genes involved in early steps of the bacteriochlorophyll biosynthetic pathway JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.E.Bauer, 04-JUN-1990. FEATURES from to/span description pept < 1 79 bacteriochlorophyll biosynthesis protein (bchH) (AA at 2) pept 108 > 314 bacteriochlorophyll biosynthesis protein (F108) BASE COUNT 59 a 87 c 119 g 49 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga 61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg 121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg 181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg 241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca 301 agcgcgtgct gcag // LOCUS TREANTAT 999 bp ds-DNA ORG 27-JUL-1990 DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle. ACCESSION M34848 KEYWORDS . SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Bajyana Songa, 05-JUN-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS VIBSCRRE 1524 bp ds-DNA BCT 27-JUL-1990 DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds, and sucrose transport (scrA) protein, 5' end. ACCESSION M35009 KEYWORDS Scr repressor protein; sucrose transport protein. SOURCE V.alginolyticus DNA. ORGANISM Vibrio alginolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1524) AUTHORS Blatch,G.L. and Woods,D.R. TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr repressor-encoding gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Blatch, 10-JUN-1990. Author address: G.L.Blatch University of Cape Town Dept. Microbiology Private Bag Rondebosch 7700 SOUTH AFRICA Email: bltgre01.uctvax@f4.n494.z5.fidonet.org FEATURES from to/span description pept 36 < 1 (c) sucrose transport protein (scrA) pept 271 555 Scr repressor protein (scrR) pept 791 1090 ORF5 binding 264 267 scrR ribosomal binding site binding 780 785 ORF5 ribosomal binding site signal 178 183 scrA -10 region signal 203 208 scrA -35 region signal 190 195 scrR -10 region signal 162 167 scrR -10 region signal 762 767 ORF5 -35 region signal 742 747 ORF5 -35 region site 78 91 repressor operator site (put.) site 229 242 repressor operator site (put.) site 12 71 ScrR helix-turn-helix motif BASE COUNT 466 a 344 c 321 g 393 t ORIGIN 1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc 61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg 121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt 181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc 241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg 301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa 361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc 421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg 481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt 541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga 601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt 661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac 721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag 781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt 841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga 901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg 961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga 1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta 1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc 1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt 1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt 1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat 1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct 1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca 1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa 1501 gccaattgtg atattgaaga agct // LOCUS XELNRP1A 2431 bp ss-mRNA VRT 27-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA, complete cds. ACCESSION M34894 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2431) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 74 1117 pot. RNA-binding protein (nrp-1B) mRNA < 1 2431 nrp-1B mRNA signal 2407 2418 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence site 921 941 alternative exon (put.) BASE COUNT 663 a 522 c 543 g 703 t ORIGIN 1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt 61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc 121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact 181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt 241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa 301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt 361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt 421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga 481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac 541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa 601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt 661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat 721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc 781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc 841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc 901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag 961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag 1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc 1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct 1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct 1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga 1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat 1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg 1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat 1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta 1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa 1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat 1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac 1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg 1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc 1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat 1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc 1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga 1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct 2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg 2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc 2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat 2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg 2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca 2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata 2401 gttcaaaata aactatgaat tttattcatt t // LOCUS XELNRP1B 2324 bp ss-mRNA VRT 27-JUL-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA, complete cds. ACCESSION M34895 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2324) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 29 1051 pot. RNA-binding protein (nrp-1B) mRNA < 1 2324 nrp-1B mRNA signal 2306 2311 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence BASE COUNT 647 a 467 c 520 g 690 t ORIGIN 1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc 61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac 121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt 181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca 241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga 301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat 361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca 421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg 481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt 541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc 601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat 661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag 721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac 781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg 841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc 901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat 961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat 1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg 1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct 1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat 1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta 1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa 1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa 1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct 1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa 1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc 1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact 1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg 1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt 1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct 1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag 1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag 1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt 1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta 2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac 2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt 2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat 2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa 2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc // LOCUS YSJSACDHY 2569 bp ds-DNA PLN 27-JUL-1990 DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds. ACCESSION M34929 KEYWORDS saccharopine dehydrogenase. SOURCE Y.lipolytica (strain W29) DNA. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2569) AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and Gaillardin,C. TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia lipolytica JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.M.Fournier, 06-JUN-1990. ORF1 is probably not translated, since a lacZ fusion/expression experiment did not produce results. FEATURES from to/span description pept 649 1929 ORF1 pept 1764 655 (c) saccharopine dehydrogenase mRNA 582 2060 ORF1 mRNA (alt.) mRNA 598 2060 ORF1 mRNA (alt.) mRNA 599 2060 ORF1 mRNA (alt.) mRNA 602 2060 ORF1 mRNA (alt.) mRNA 606 2060 ORF1 mRNA (alt.) mRNA 609 2060 ORF1 mRNA (alt.) mRNA 1807 639 (c) ORF2 mRNA (alt.) mRNA 1809 639 (c) ORF2 mRNA (alt.) mRNA 1811 639 (c) ORF2 mRNA (alt.) mRNA 1812 639 (c) ORF2 mRNA (alt.) mRNA 1814 639 (c) ORF2 mRNA (alt.) signal 521 528 TATA box signal 565 575 TATA box BASE COUNT 668 a 573 c 666 g 662 t ORIGIN 1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag 61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt 121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca 181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa 241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag 301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc 361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc 421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt 481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca 541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt 601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc 661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt 721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc 781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc 841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac 901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt 961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca 1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt 1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc 1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt 1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata 1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc 1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc 1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct 1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg 1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg 1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc 1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt 1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag 1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg 1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag 1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg 1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga 1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa 2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca 2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg 2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc 2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct 2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag 2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga 2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat 2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat 2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc // LOCUS HUMMHDQ1A 1515 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human MHC class II HLA-DQ-alpha gene, regulatory region. ACCESSION M33765 KEYWORDS cell surface antigen; cell surface glycoprotein; class II gene; major histocompatibility complex. SOURCE Human (DR4, DR4) DNA, clone H11A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1515) AUTHORS Ombra,M.N., Del Pozzo,G., Perfetto,C., Maffe,A. and Guardiola,J. TITLE Effect of the AIR-I locus on the activation of an enhancerless HLA-DQA1 promoter JOURNAL Immunogenetics 31, 368-376 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Guardiola, 18-APR-1990. FEATURES from to/span description pre-msg 655 > 1515 HLA-DQA1 mRNA and introns /hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1" /map="6p21.3" IVS 785 > 1515 HLA-DQA1 intron A BASE COUNT 441 a 322 c 340 g 412 t ORIGIN Chromosome 6p21.3. 1 atataactga tgacctcatc tatccatgag gctgctcacg aaatgccctc ccctggccag 61 gcacggtggc tcatgcctgt aatccaagca ctttgggagg ctgaggcagg cagatcatga 121 ggtcaggagt tcaagaccag cgtggccaac atagtgaaac tctgtctcta ctaaaaatac 181 agaattagcc gggcatggtg gcagcgctgc agtcaccagc tactcgggag ggtgaggcag 241 gagaatcgct tgaacctggg aggcaaaggt tgcagtgagc cgagaccatg ccattgtacc 301 tcagcctggg tgacagagtg agactacgtc tcgaaaagaa aagaaaaaga aaagaaaaag 361 aaaaaaaaag aaaaatgccc atccctcttg cgaatggcag acatgcacac accagagaag 421 attccaattt agtgtcctcc ctctgttcat agaacaattc ctcaagtcca ctctgagtag 481 aggctgcatc acaacaaggg gattgccctg tctccttcca gggctcttaa tacaaactct 541 tcagctagta actgagatgt caccatgggg gatttttcta attggccaaa acctgacctg 601 gcagggtttg gtttgggtgt cttcagattt ccttgtcttg aggccctcac aattgctcta 661 cagctcagaa cagcaactgc tgaggctgcc ttgggaagag gatgatccta aacaaagctc 721 tggatgctgg gggccctcgc cctgaccacc gtgatgagcc cttgtggagg tgaagacatt 781 gtgggtgagt gcatgagtga ggaaatgttc tctggagctg aaaaacagta aattgaagga 841 aaagagagaa agcgatttgc agagaaattg tagagatttc ctaagaccct ttcactatta 901 agagaaatta aaaattatag ctgttcctcc ttcaggaaac cagagaccca atccactctt 961 tttgttatgt atgcttttgt gttcactaag gatgctattc tgtttatatt atattcagtg 1021 acaacagcct ggaggtctct atgtcgttcc gtcatgattg cctcaaaaat tagtgaagtt 1081 tccatcagtg gaataatttt ttattattaa aaatgtatga agtgtcattc tcaaatttcc 1141 ctgaacaact tttgaagctt ttcgtatgtc tcctgtagta gatcttgggg tcgttccatc 1201 aattatatac tctatagata ttaaaaaagt tgcccgtttc tttctctcag acttactcac 1261 atttccacat gggaactggc acaggtgggg agtgggtaaa ggagtccagc aggctgaatg 1321 ccttcaacaa tcattttacc acatggtcct cacttactct cagctgcctc atatgtgtca 1381 cctcacaaaa taatcaaaat aaaaatgggc atgtagctaa gctttgtaaa tagtgaaaac 1441 atggaatgtc aattgttttt acatatttct attacaggta tagcttcaca tttttcttta 1501 gcaaaataag ggatc // LOCUS BSUAMYEA 727 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.subtilis alpha-amylase (amyE) gene, 5' end. ACCESSION M35517 KEYWORDS alpha-amylase. SOURCE B.subtilis (strain 2633) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 727) AUTHORS Emori,M., Tojo,T. and Maruo,B. TITLE Molecular cloning and expression of an alpha-amylase gene from an alpha-amylase extrahyper producing Bacullis subtilis JOURNAL Agric. Biol. Chem. 52, 399-406 (1988) STANDARD simple staff_review FEATURES from to/span description pept 599 > 727 alpha-amylase structural protein (amyE) BASE COUNT 207 a 125 c 179 g 216 t ORIGIN 1 gatcatccgc ggcattatgt ttgaatttcc gtttaaagaa tgggctgcaa gcttgttcat 61 cattatctta tattactgca tcagggctgc ggcatccgga atgctcatgc cgagaataga 121 caccaaagaa gaactgcaaa aacgggtgaa gcagcagcga atagaatcag ttgcggtcgc 181 ctttgcggta gtggtgctta cgatgtacga cagggggatt ccccatacat tcttcgcttg 241 gctgaaaatg attcttcttt ttatcgtctg cggcggcgtt ctgtttctgc ttcggtatgt 301 gattgtgaag ctggcttaca gaagagcggt aaaagaagaa ataaaaaaga aatcatcttt 361 tttgtttgga aagcgaggga agcgtacaca gtctcgggca gtttttttat aggaacattg 421 atttgtattc actctgccaa gttgttttga tagagtgatt gtgataattt aaaatgtaag 481 tgttaacaaa attctccagt cttcacatca gtttgaaagg aggaagcgga agaatgaagt 541 aagagggatt tttgactccg aagtaagtct tcaaaaaatc aaataaggag tgtcaagaat 601 gtttgcaaaa cgattcaaaa cctctttact gccgttattc gctggatttt tattgctgtt 661 ttatttggtt ctggcaggac cggcggctgc gagtgctgaa acggcgaaca aatcgaatga 721 gcttaca // LOCUS CHKCKMB 274 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken b-creatine kinase gene, 3' end. ACCESSION M35381 KEYWORDS B-creatine kinase. SOURCE Chicken brain DNA, clone CK-B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 274) AUTHORS Kwiatkowski,R.W., Ehrismann,R., Schweinfest,C.W. and Dottin,R.P. TITLE Accumulation of creatine kinase mRNA during myogenesis: Molecular cloning of a b-creatine kinase cDNA JOURNAL Dev. Biol. 112, 84-88 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 51 M-creatine kinase (AA at 1) 97 243 M-creatine kinase BASE COUNT 60 a 81 c 75 g 58 t ORIGIN 1 ctgcagaaac gaggcacagg tggtgtggac acagctgcag ccgaagcctc ttgaggagcc 61 cccccccccc cccccccccc tgcagtcgga ggggtgtttg atgtatccaa tgctgatcgt 121 cttggcttct ctgaggtaga gctggtgcag atggtggtag atggtgtgaa gctgctcatt 181 gaaatggaaa aacgccttga gaaaggccag tccattgatg acctcatgcc agctcagaaa 241 taaagcactt tattctcccc ccgcccccct gcag // LOCUS CHKCMYBA1 172 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 1. ACCESSION M35506 KEYWORDS c-myb protein. SEGMENT 1 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 172) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept 140 + 162 thymic c-myb protein, exon 1 IVS 163 > 172 thymic c-myb protein intron A BASE COUNT 16 a 73 c 62 g 21 t ORIGIN 1 aacctgtttc ctcctcctcc tcccgcggct cctcggcact tcgctgcgcg gcggcggagc 61 gcctgggcgc agcccggcag cagggcgcgg cggcgggcgc gggctccccc gtcctcttcc 121 cccagcggcc gccgcgagga tggcccggag accccggcac aggtaacggg gc // LOCUS CHKCMYBA2 290 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 2. ACCESSION M35507 KEYWORDS c-myb protein. SEGMENT 2 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 290) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 163 + 280 thymic c-myb protein, exon 2 pep$ 200 + 280 yolk sac c-myb protein, exon 1 IVS < 1 162 thymic c-myb protein intron A IVS 281 > 290 thymic c-myb protein intron B IVS 281 > 290 yolk sac c-myb protein intron A BASE COUNT 78 a 59 c 71 g 82 t ORIGIN 1 aatactagaa caacagaatg cagaacaaaa tcttgttgtg caagttttaa aacttttatc 61 tgtgtaacct ttgtaaagat tgtgtgtttt ctgtgtgctg tgtaattaag tagctccagt 121 cctcacgttc cgctctgtgt ctcgctgtgt ttcctgctgc agcatataca gcagcgatga 181 cgatgaagaa gatgttgaga tgtacgacca cgattacgac ggcctgcttc ctaaggctgg 241 gaaacgtcac ctagggaaaa ccaggtggac ccgtgaagag gtaattgccg // LOCUS CHKCMYBA3 92 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 3. ACCESSION M35508 KEYWORDS c-myb protein. SEGMENT 3 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 92) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 82 thymic c-myb protein, exon 3 pep$ + 11 + 82 yolk sac c-myb protein, exon 2 IVS < 1 10 thymic c-myb protein intron B IVS 83 > 92 thymic c-myb protein intron C IVS < 1 10 yolk sac c-myb protein intron A IVS 83 > 92 yolk sac c-myb protein intron B BASE COUNT 32 a 16 c 24 g 20 t ORIGIN 1 aatgatccag gatgagaaac tgaagaaact tgtggaacag aatggcacag aagactggaa 61 agtcattgcc agtttccttc ctgtaggtag ac // LOCUS CHKCMYBA4 34 bp ds-DNA VRT 27-JUL-1990 DEFINITION Chicken c-myb gene, exon 4. ACCESSION M35509 KEYWORDS c-myb protein. SEGMENT 4 of 4 SOURCE Chicken hematopoietic cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 34) AUTHORS Soret,J., Vellard,M., Martinerie,C. and Perbal,B. TITLE Organization of 5'-proximal c-myb exons in chicken DNA: Implications for c-myb tissue-specific transcription JOURNAL FEBS Lett. 232, 227-234 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 > 34 thymic c-myb protein, exon 4 pep$ + 11 > 34 yolk sac c-myb protein, exon 3 IVS < 1 10 thymic c-myb protein intron C IVS < 1 10 yolk sac c-myb protein intron B BASE COUNT 10 a 6 c 8 g 10 t ORIGIN 1 atttcattag aatcggacag atgttcagtg ccag // LOCUS MUSFOS 3673 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse c-fos gene; cellular homolog to viral oncogene. ACCESSION J00370 KEYWORDS fos oncogene; oncogene. SOURCE Mouse DNA, clones pc-fos(mouse)-1 and lambda-c-fos(mouse)-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3548) AUTHORS Van Beveren,C., van Straaten,F., Curran,T., Mueller,R. and Verma,I.M. TITLE Analysis of FBJ-MuSV provirus and c-fos (mouse) gene reveals that viral and cellular fos gene products have different carboxy termini JOURNAL Cell 32, 1241-1255 (1983) STANDARD full staff_review REFERENCE 2 (bases 2799 to 3673) AUTHORS Meijlink,F., Curran,T., Miller,A.D. and Verma,I.M. TITLE Removal of a 67-base-pair sequence in the noncoding region of protooncogene fos converts it to a transforming gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 4987-4991 (1985) STANDARD simple staff_review COMMENT A TATA box was found at bp 101-106 and a poly-A signal at 3493-3498. [1] also reported the proviral FBJ-MuSV sequence, which is found in the viral section of the data base. FEATURES from to/span description pept 284 424 c-fos protein exon 1 1179 1430 c-fos protein exon 2 1836 1943 c-fos protein exon 3 2061 2702 c-fos protein exon 4 pre-msg 133 > 2702 c-fos protein mRNA IVS 425 1178 c-fos protein intron A IVS 1431 1835 c-fos protein intron B IVS 1944 2060 c-fos protein intron C variant 2856 2856 g in [1]; c in [2] variant 3135 3136 gt in [1]; gtt in [2] BASE COUNT 817 a 987 c 974 g 895 t ORIGIN 997 bp upstream of EcoRI site. 1 gagttgacga cagagcgccc gcagagggcc ttggggcgcg cttccccccc cttccagttc 61 cgcccagtga cgtaggaagt ccatccattc acagcgcttc tataaaggcg ccagctgagg 121 cgcctactac tccaaccgcg actgcagcga gcaactgaga agactggata gagccggcgg 181 ttccgcgaac gagcagtgac cgcgctccca cccagctctg ctctgcagct cccaccagtg 241 tctacccctg gaccccttgc cgggctttcc ccaaacttcg accatgatgt tctcgggttt 301 caacgccgac tacgaggcgt catcctcccg ctgcagtagc gcctccccgg ccggggacag 361 cctttcctac taccattccc cagccgactc cttctccagc atgggctctc ctgtcaacac 421 acaggtgagt ttggctttgt gtagccgcca ggtccgcgct gagggtcgcc gtggaggaga 481 cactggggtg tgactcgcag gggcgggggg gtcttccttt ttcgctctgg agggagactg 541 gcgcggtcag agcagcctta gcctgggaac ccaggacttg tctgagcgcg tgcacacttg 601 tcatagtaag acttagtgac cccttcccgc gcggcaggtt tattctgagt ggcctgcctg 661 cattcttctc tcggccgact tgtttctgag atcagccggg gccaacaagt ctcgagcaaa 721 gagtcgctaa ctagagtttg ggaggcggca aaccgcggca atcccccctc ccggggcagc 781 ctggagcagg gaggagggag gagggaggag ggtgctgcgg gcgggtgtgt aaggcagttt 841 cattgataaa aagcgagttc attctggaga ctccggagca gcgcctgcgt cagcgcagac 901 gtcagggata tttataacaa accccctttc gagcgagtga tgccgaaggg ataacgggaa 961 cgcagcagta ggatggagga gaaaggctgc gctgcggaat tcaagggagg atattgggag 1021 agcttttatc tccgatgagg tgcatacagg aagacataag cagtctctga ccggaatgct 1081 tctctctccc tgcttcatgc gacactaggg ccacttgctc cacctgtgtc tggaacctcc 1141 tcgctcacct ccgctttcct ctttttgttt tgtttcagga cttttgcgca gatctgtccg 1201 tctctagtgc caactttatc cccacggtga cagccatctc caccagccca gacctgcagt 1261 ggctggtgca gcccactctg gtctcctccg tggccccatc gcagaccaga gcgccccatc 1321 cttacggact ccccacccag tctgctgggg cttacgccag agcgggaatg gtgaagaccg 1381 tgtcaggagg cagagcgcag agcatcggca gaaggggcaa agtagagcag gtgagcagcg 1441 attctggacc tttgtgggct gggggggggg gggggggcgg agactgacgc acagaccaca 1501 caacagagaa gggacgctac tgactgcact tcctgaccag gagctgtggc tgctagccct 1561 ttccctccct tgtcagattt tgacagttgg acccaagaca aactctagac agtttccctg 1621 acagcttcct acttcattct ctagccgggg agcttctttg ttcccctgct aaagatctca 1681 ctttaaatgc aaatcacact ctgcctgcca actgcaggtt agaaaaactg cttcaccgag 1741 aggtgcgggt gctgtaggag ccagtttcac tggggtgact gaatggaggt gacactagac 1801 aaccttaact gaatgttggt ccttttcttc tatagctatc tcctgaagag gaagagaaac 1861 ggagaatccg aagggaacgg aataagatgg ctgcagccaa gtgccggaat cggaggaggg 1921 agctgacaga tacactccaa gcggtaggtt gaaccagctg ctgctcctga aactttatta 1981 aagttggagc ttgggactat gggcgcaggg tccttgagca tgcccgtgtc ttatgctttc 2041 ttatatctct ccctatgcag gagacagatc aacttgaaga tgagaagtct gcgttgcaga 2101 ctgagattgc caatctgctg aaagagaagg aaaaactgga gtttattttg gcagcccacc 2161 gacctgcctg caagatcccc gatgaccttg gcttcccaga ggagatgtct gtggcctccc 2221 tggatttgac tggaggtctg cctgaggctt ccaccccaga gtctgaggag gccttcaccc 2281 tgccccttct caacgaccct gagcccaagc catccttgga gccagtcaag agcatcagca 2341 acgtggagct gaaggcagaa ccctttgatg acttcttgtt tccggcatca tctaggccca 2401 gtggctcaga gacctcccgc tctgtgccag atgtggacct gtccggttcc ttctatgcag 2461 cagactggga gcctctgcac agcaattcct tggggatggg gcccatggtc acagagctgg 2521 agcccctgtg tactcccgtg gtcacctgta ctccgggctg cactacttac acgtcttcct 2581 ttgtcttcac ctaccctgaa gctgactcct tcccaagctg tgccgctgcc caccgaaagg 2641 gcagcagcag caacgagccc tcctccgact ccctgagctc acccacgctg ctggccctgt 2701 gagcagtcag agaaggcaag gcagccggca tccagacgtg ccactgcccg agctggtgca 2761 ttacagagag gagaaacacg tcttccctcg aaggttcccg tcgacctagg gaggacctta 2821 cctgttcgtg aaacacacca ggctgtgggc ctcaaggact tgcaagcatc cacatctggc 2881 ctccagtcct cacctcttcc agagatgtag caaaaacaaa acaaaacaaa acaaaaaacc 2941 gcatggagtg tgttgttcct agtgacacct gagagctggt agttagtaga gcatgtgagt 3001 caaggcctgg tctgtgtctc ttttctcttt ctccttagtt ttctcatagc actaactaat 3061 ctgttgggtt cattattgga attaacctgg tgctggattg tatctagtgc agctgatttt 3121 aacaatacct actgtgttcc tggcaatagc gtgttccaat tagaaacgac caatattaaa 3181 ctaagaaaag ataggacttt attttccagt agatagaaat caatagctat atccatgtac 3241 tgtagtcctt cagcgtcaat gttcattgtc atgttactga tcatgcattg tcgaggtggt 3301 ctgaatgttc tgacattaac agttttccat gaaaacgttt ttattgtgtt ttcaatttat 3361 ttattaagat ggattctcag atatttatat ttttatttta tttttttcta ccctgaggtc 3421 tttcgacatg tggaaagtga atttgaatga aaaattttaa gcattgtttg cttattgttc 3481 caggacattg tcaataaaag catttaagtt gaatgcgacc accttcttgc tctctttatt 3541 ctcagttttg tatggtttca ggaaggcctc tgaggagacc agtttgtcaa gatgggtggg 3601 tcctggaggg gaacacgccc tctgtcccct tgtcactcag aggacacgta gttcagggta 3661 ttttacagat gtg // LOCUS MUSIGHACX 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34880 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 83 a 77 c 72 g 80 t ORIGIN Chromosome 6. 1 gacattgtga tgacacagtc tccatcctcc ctggctatgt cagtaggaca gaaggtcact 61 atgaggtgca agtccagtca gagcctttta aatagtagca gtcaaaagaa ctatttggcc 121 tggtaccagc agaaaccagg acagtctcct aaacttctgg tatactttgc atccactagg 181 gaatctgggg tccctgatcg cttcataggc agtggatctg ggacagattt cactcttacc 241 atcagcagtg tgcaggctga agacctggca gattacttct gtcagcaaca ttatagcact 301 ccgtacacgt tc // LOCUS MUSIGHACY 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34883 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 80 a 78 c 68 g 83 t 3 others ORIGIN Chromosome 6. 1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gatctagtca gaacattgta catactnnna atggaaacac ctatttagaa 121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacaaagt ttccaaccga 181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag 241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt 301 cctccgacgt tc // LOCUS MUSIGHACZ 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34884 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 78 a 76 c 72 g 83 t 3 others ORIGIN Chromosome 6. 1 gatgttttga tgacccaaac tccactctcc ctgcctgtca gtcttggaga tcaagcctcc 61 atctcttgca gattcagtca gagcattgta catactnnna atggaaacac ctatttagaa 121 tggtacctgc agaaaccagg ccagtctcca aagctcctga tctacagagt ttccaaccga 181 ttttctgggg tcccagacag gttcagtggc agtggatcag ggacagattt cacactcaag 241 atcagcagag tggaggctga ggatctggga gtttattact gctttcaagg ttcacatgtt 301 ccgtggacgt tc // LOCUS MUSIGHADA 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma C3H.SW. ACCESSION M34885 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 69 a 75 c 78 g 87 t 3 others ORIGIN Chromosome 6. 1 gatattgtga tgactcaggc tgcaccctct gtacctgtca ctcctggaga gtcagtatcc 61 atctcctgca ggtctagtaa gagtctcctg catagtnnna atggcaacac ttacttgtat 121 tggttcctgc agaggccagg ccagtctcct cagctcctga tatatcggat gtccaacctt 181 gcctcaggcg tcccagacag gttcagtggc agtgggtcag gaactgcttt cacactgaga 241 atcagtagag tggaggctga ggatgtgggt gtttattact gtatgcaaca tctagaatat 301 cctttgacgt tc // LOCUS MUSIGHADB 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J. ACCESSION M34886 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma A/J, cDNA to mRNA, clone AF1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 86 a 73 c 62 g 73 t 18 others ORIGIN Chromosome 6. 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gggcaagtca gnnnnnnnnn nnnnnnnnng acattagcaa ttatttaaac 121 tggtatcagc agaaaccaga tggaactgtt aaactcctga tctactacac atcaagatta 181 cactcaggag tcccatcaag gttcagtggc agtgggtctg gaacagatta ttctctcacc 241 attagcaacc tggagcaaga agatattgcc acttactttt gccaacaggg taatacgctt 301 ccgtggacgt tc // LOCUS MUSIGHADC 312 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig rearranged kappa-chain mRNA V-J-region hybridoma A/J. ACCESSION M34887 KEYWORDS immunoglobulin light chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse hybridoma A/J, cDNA to mRNA, clone AF2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 312 Ig kappa-chain V-J-region (AA at 1) recomb 303 304 V-region end/J-region start BASE COUNT 87 a 79 c 70 g 76 t ORIGIN Chromosome 6. 1 gacattgtga tgacacagtc tccatcctcc ctgactgtca cagcaggaga gaaggtcact 61 atgagttgca agtccagtca gagtctgtta aacagtggaa atcaaaacaa ctacttgacc 121 tggtaccagc agaaaccagg gcagcctcct aaactgttaa tctactgggc atccactaga 181 gaatctgggg tccctgatcg cttcacaggc agtggatctg gaacagattt cactctcacc 241 atcagcagtg tgcaggctga agacctggca gtttattact gtcagaatga ttatcgttat 301 ccgctcacat tc // LOCUS MUSIGHADG 327 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34888 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 327) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 327 Ig H-chain V-D-J-region (AA at 1) recomb 294 295 V-region end/D-region start recomb 297 298 D-region end/J-region start BASE COUNT 87 a 78 c 87 g 75 t 0 others ORIGIN Chromosome 12. 1 gaggtccagc tgcaacagtc tggacctgag ctggtgaagc ctggggattc agtgaagatg 61 tcctgcaagg cttctggcta cacattcact gactcctaca tggactgggt gaagcagagc 121 cctggaaaga gccttgagtg gattggatat atttatccta acaatggtgg tactaactac 181 aatcagaagt tcaagggcaa ggccacattg actgtagaca agtcctccag cacagcctac 241 atggaactcc acagcctgac atctgaggac tctgcagtct attactgtgc aagatggact 301 tactggggcc aagggactct ggtcact // LOCUS MUSIGHADH 345 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34889 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 345) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 345 Ig H-chain V-D-J-region (AA at 1) recomb 293 294 V-region end/D-region start recomb 309 310 D-region end/J-region start BASE COUNT 81 a 93 c 86 g 85 t 0 others ORIGIN Chromosome 12. 1 cagcctgggg ctgagcttgt gaagcctggg acttctgtga agttgtcctg caggccttct 61 ggcttcacct acaccgactt ctggattcac tgggtgaaac agaggcctgg acaaggcctt 121 gagtggattg gagagattaa tcctagcaat ggtggtacta gctacaatga gaagttcaag 181 accaaggcca cactgactgc agacaaatcc tccagcacag tccttacatg caactcaacg 241 agcctgacat ctgacgactc tgcggtctat ttctgtacaa gaccccctct ctcctatggt 301 aacttcgcct ggtttactta ctggggccaa gggactctgg tcact // LOCUS MUSIGHADI 264 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34890 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CM4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 264) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 264 Ig H-chain V-D-J-region (AA at 1) recomb 223 224 V-region end/D-region start recomb 230 231 D-region end/J-region start BASE COUNT 70 a 68 c 67 g 59 t 0 others ORIGIN Chromosome 12. 1 ttcactacct actggatgca ctgggtgaag cagaggcctg gacaaggcct tgagtggatc 61 ggagagattg atccttctga tacttatact taccacaatc aaaacttcaa gggcaaggcc 121 acattgactg tagacaaatc ctccagcaca gcctacatgc aactcagcag cctgacatct 181 gaggactctg cggtctatta ctgtgcaaga aggtcgatgg gccactacgg gcttgcatac 241 tggggccaag ggactctggt cact // LOCUS MUSIGHADJ 329 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma C3H.SW. ACCESSION M34891 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone CR1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 329) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 329 Ig H-chain V-D-J-region (AA at 1) recomb 273 274 V-region end/D-region start recomb 294 295 D-region end/J-region start BASE COUNT 80 a 84 c 82 g 83 t ORIGIN Chromosome 12. 1 ttagtgaagc ttggagggtc cctgcaactc tcctgtgcag cctctggatt cactttcagt 61 agctattaca tgtcttgggt tcgccagact cctgagaaga ggctggagtg ggtcgcaacc 121 attagtaatg gtggtgttag gcacctacta tccagacagt gtgaaggacc gcttcacatc 181 tccagagaca atgccaggaa caccctctac ctgcgaatga gcagtctgaa ttctgaggac 241 acagccgtgt attactgtgc aagagatgga tcttatggta actaccacgt accttacttt 301 gactactggg gccaaggcac cactctcac // LOCUS MUSIGHADK 271 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J. ACCESSION M34892 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 271) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 271 Ig H-chain V-D-J-region (AA at 2) recomb 226 227 V-region end/D-region start recomb 238 239 D-region end/J-region start BASE COUNT 69 a 63 c 73 g 66 t 0 others ORIGIN Chromosome 12. 1 attcactttc agtagctatg ccatgtcttg ggttcgccag actccagaga agaggctgga 61 gtgggtcgca tccattaata gtggtggtag tacctactat ccagacagtg tgaagggccg 121 attcaccatc tccagagata atggcaggaa caccctgtac ctgcaaatga gcagtctgag 181 gtctgaggac acggccatgt attattgtgg aagaggcagt tacccctatt actatggtct 241 ggactactgg ggtcaaggaa cctcagtcac c // LOCUS MUSIGHADL 348 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig H-chain mRNA V-D-J-region hybridoma A/J. ACCESSION M34893 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma C3H.SW, cDNA to mRNA, clone AF2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Andria,M.L., Levy,S. and Benjamini,E. TITLE Diverse V-H and V-L genes are used to produce antibodies against a defined protein epitope JOURNAL J. Immunol. 144, 2614-2619 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 348 Ig H-chain V-D-J-region (AA at 1) recomb 303 304 V-region end/D-region start recomb 315 316 D-region end/J-region start BASE COUNT 86 a 80 c 99 g 83 t 0 others ORIGIN Chromosome 12. 1 gaagtgaagc tggtggagtc tgggggaggc ttagtgacgc ctggagggtc cctgaaactc 61 tcctgtgcag cctctggatt cactttcagt agctatgcca tgtcttgggt tcgccagact 121 ccagagaaga ggctggaatg ggtcgcatcc attaataatg gtggtaatac ctactatcca 181 gacagtgtga agggccgatt caccatctcc agagataatg gcaggaacac cctgtacctg 241 caaatgagca gtctgaggtc tgaggacacg gccatgtatt actgtggaag aggcagttat 301 ccctattact atggtctaga ctactggggt caaggaacct cagtcacc // LOCUS NEUMTNDCO 839 bp ds-DNA ORG 27-JUL-1990 DEFINITION N.crassa mitochondrial pseudo-NADH dehydrogenase (ND5) gene, 3' end, and cytochrome oxidase subunit 2 (co2) gene, 5' end. ACCESSION M35513 KEYWORDS NADH dehydrogenase; cytochrome oxidase subunit 2; pseudogene. SOURCE N.crassa (strain RL3-8A) stopper mutant ER-3 mitochondrial DNA. ORGANISM Mitochondrion Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae; Neurospora crassa. REFERENCE 1 (bases 1 to 839) AUTHORS Almasan,A. and Mishra,N.C. TITLE Molecular characterization of the mitochondrial DNA of a new stopper mutant ER-3 of Neurospora crassa JOURNAL Genetics 120, 935-945 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps > 1 270 pseudo-NADH dehydrogenase (ND5; AA at 1) pept 645 > 839 cytochrome oxidase subunit 2 (co2) BASE COUNT 313 a 124 c 119 g 283 t ORIGIN 1 tgtatgaaat atttagctga aatattagga ggtaaaaaac atttattaaa aagttacgaa 61 ggttacaatg taacagttaa taccactaaa ttatctccta ttgtacaata ttttaatctt 121 tatcctttaa aaactaaaaa gtatattaca tattttaact gaataaaaat atataagtta 181 gtaatagata aaaaacataa tgatcctgaa aatttattat tgattacgaa atataaaaat 241 aacattaata aatctgatta taataaataa atcaactttc ggcggacccc gccccttcgg 301 ggcgggggcc cgtttttagt aataactaaa aattaaacac agctacataa atgaagatat 361 agtccgatca gttaagtaat taactgcgta ttccacccct cccttccccc gccccgaagg 421 cgcccccgaa aaaacaaagg acgctagcgt agctaacctt atgagcagca ggctataaaa 481 acccattagg cacagggtat acttcgttag tggataagaa ttagcctaag ccttaatata 541 caatacagtt taatgttata tttaaattct atatttatga aaagatataa tatatttatt 601 attatggttt aatattcaat atagaatatt ctatttcttt ttttatggga ttattattta 661 ataatttaat tatgaatttt gatgctccaa gcccatgagg tatctatttt caagatagtg 721 ctactcctca gatggaaggt ttagttgaat tacatgacaa tattatgtac tatctggtgg 781 taatactatt tgttgtagga tgaatcttat tatctataat aagaaattat atcagtaca // LOCUS PUMCHSA1 90 bp ds-DNA PLN 27-JUL-1990 DEFINITION P.crispum chalcone synthase gene, 5' flank. ACCESSION M35515 KEYWORDS chalcone synthasse. SEGMENT 1 of 2 SOURCE P.crispum (strain Mooskrause) DNA, clone PcCHS-1. ORGANISM Petroselinum crispum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Apiales; Apiaceae. REFERENCE 1 (bases 1 to 90) AUTHORS Herrmann,A., Schulz,W. and Hahlbrock,K. TITLE Two alleles of the single-copy chalcone synthase gene in parsley differ by a transposon-like element JOURNAL Mol. Gen. Genet. 212, 93-98 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 36 43 5' insertion target sequence rpt 44 58 5' inverted terminal repeat trns 44 > 90 transposon-like sequence BASE COUNT 29 a 14 c 23 g 24 t ORIGIN 1 tgaaaatgca tgttacaaac gttccctaat aatatagtag aagtagggtg taaatgagtc 61 gatacgctca cgaacagttc ggtgttcggg // LOCUS PUMCHSA2 732 bp ds-DNA PLN 27-JUL-1990 DEFINITION P.crispum chalcone synthase gene, exon 1. ACCESSION M35516 KEYWORDS chalcone synthasse. SEGMENT 2 of 2 SOURCE P.crispum (strain Mooskrause) DNA, clone PcCHS-1. ORGANISM Petroselinum crispum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Apiales; Apiaceae. REFERENCE 1 (bases 1 to 732) AUTHORS Herrmann,A., Schulz,W. and Hahlbrock,K. TITLE Two alleles of the single-copy chalcone synthase gene in parsley differ by a transposon-like element JOURNAL Mol. Gen. Genet. 212, 93-98 (1988) STANDARD simple staff_review FEATURES from to/span description pept 725 > 732 chalcone synthase, exon 1 mRNA 591 > 732 chalcone synthase mRNA (alt.) mRNA 639 > 732 chalcone synthase mRNA (alt.) trns < 1 53 transposon-like sequence rpt 38 53 3' inverted terminal repeat rpt 54 61 3' insertion target sequence BASE COUNT 245 a 139 c 126 g 222 t ORIGIN About 810 bp after segment 1. 1 ctgaaacgtg aacacttaag aattcggctc ggttcggctc atttacagcc ctaagtagaa 61 gtgatccata gcattttata ataactaaca ttacatatat attacccttt ttctacattc 121 gtgtagagag aaaatgactc aataattcaa ttagtattat agaatttctt gttggtaact 181 cttaatttta tttattaata gaataacgga aacgaaacga gatgggattg gtagatggga 241 aacagaaata ggcacgtcac ggggtgtatg tatttattaa aaacaatggc atataggtaa 301 gttgaggata acaattaaat cgtggtttca ttacaatagt tctgaaagaa gaggtagaat 361 cgattgtcct tcacttgatg tatctactca ctttggatgt acgtggaggc ctaaatcccc 421 accatggttg caagtagcag ttacatccac ataactcctc ccttattcca cgtggccatc 481 cggtggccgt ccctccaacc taacctccct tgaatttcta tcagtaaatt ttcaaccctc 541 tctcttctct gtcgtcacta tataatttcc ctacctacat ctctcttgat atctctttct 601 ctctataaat atttaacaag gagaaaggaa aaataacaac acaaatcaga cagatcagat 661 catcccttta cataaagata aggggtaagg gggggtttat agagagattt ataaaaaaaa 721 aaaaatggca aa // LOCUS R11ORIA 589 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid R1162 origin of replication region DNA. ACCESSION M35512 KEYWORDS origin of replication. SOURCE Plasmid R1162 DNA. ORGANISM Plasmid R1162 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 589) AUTHORS Meyer,R.J., Lin,L.-S., Kim,K. and Brasch,M.A. TITLE Broad host-range plasmid R1162: Replication, incompatibility, and copy-number control JOURNAL Basic Life Sci. 30, 173-188 (1985) STANDARD simple staff_review BASE COUNT 114 a 156 c 199 g 120 t ORIGIN 1 ccgggctgaa tgatcgaccg agacaggccc tgcggggctg cacacgcgcc cccacccttc 61 gggtaggggg aaaggccgct aaagcggcta aaagcgctcc agcgtatttc tgcggggttt 121 ggtgtggggt ttagcgggct ttgcccgcct ttccccctgc cgcgcagcgg tggggcggtg 181 tgtagcctag cgcagcgaat agaccagcta tccggcctct gggcatattg ggcagggcag 241 cagcgcccca cagggcgtga ctaaccgcgc ctagtggatt attcttagat aatcatggat 301 ggatttttcc aacaccccgc cagcccccgc ccctgctggg tttgcaggtt tgggggcgtg 361 acagttattg caggggttcg tgacagttat tgcagggggg cgtgacagtt attgcagggg 421 ttcgtgacag ttagtacggg atgacgggca ctggctggca atgtctagca acggcaggca 481 tgtcggctga cggtaaaaca actttccgct aagcgataga ctgtatgtga aacacagtat 541 tgcaaggacg cggaacatgc ctcatgtggc ggccaggacg gccagccgg // LOCUS STRSPEC 936 bp ds-DNA BCT 27-JUL-1990 DEFINITION S.pyogenes streptococcal pyrogenic exotoxin type C (speC) gene, complete cds. ACCESSION M35514 KEYWORDS pyrogenic exotoxin. SOURCE S.pyogenes DNA. ORGANISM Streptococcus pyogenes Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Streptococcaceae. REFERENCE 1 (bases 1 to 936) AUTHORS Goshorn,S.C. and Schlievert,P.M. TITLE Nucleotide sequence of Streptococcal pyrogenic exotoxin type C JOURNAL Infect. Immun. 56, 2518-2520 (1988) STANDARD simple staff_review FEATURES from to/span description pept 154 861 streptococcal pyrogenic exotoxin type C (speC) precursor sigp 154 234 streptococcal pyrogenic exotoxin type C signal peptide (put.) matp 235 858 streptococcal pyrogenic exotoxin type C BASE COUNT 364 a 144 c 127 g 301 t ORIGIN 1 caaccttgac tatttaaatg gaactgccac tcctaaaaac taaaatataa atacatttat 61 aaaatttcta aataaacaga aatctgattt ttaactactt actgctattt catgtattct 121 cgtacgagta atacatttaa ttaaggagaa aaaatgaaaa agattaacat catcaaaata 181 gttttcataa ttacagtcat actgatttct acttatttca cctatcatca aagtgactct 241 aagaaagaca tttcgaatgt taaaagtgat ttactttatg catacactat aactccttat 301 gattataaag attgcagggt aaatttttca acgacacaca cattaaacat tgatactcaa 361 aaatatagag ggaaagacta ttatattagt tccgaaatgt cttatgaggc ctctcaaaaa 421 tttaaacgag atgatcatgt agatgttttt ggattatttt atattcttaa ttctcacacc 481 ggtgagtaca tctatggagg aattacgcct gctcaaaata ataaagtaaa tcataaatta 541 ttgggaaatc tatttatttc gggagaatct caacagaact taaataacaa gattattcta 601 gaaaaggata tcgtaacttt ccaggaaatt gactttaaaa tcagaaaata ccttatggat 661 aattataaaa tttatgacgc tacttctcct tatgtaagcg gcagaatcga aattggcaca 721 aaagatggga aacatgagca aatagactta tttgactcac caaatgaagg gactagatca 781 gatatttttg caaaatataa agataataga attatcaata tgaagaactt tagtcatttc 841 gatatttatc ttgaaaaata attcatcata cacaaaaaac cgcccagaat aatctgagcg 901 gttttgtctt atctcggagc tttacctcct aattta // LOCUS HUMTROPI 816 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human slow-twitch skeletal troponin I (TNN1) mRNA, complete cds. ACCESSION J04760 KEYWORDS slow-twitch skeletal troponin I; troponin I. SOURCE Human adult slow-twitch skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 816) AUTHORS Wade,R., Eddy,R., Shows,T.B. and Kedes,L. TITLE cDNA sequence, tissue-specific expression, and chromosomal mapping of the human slow-twitch skeletal muscle isoform of troponin I JOURNAL Genomics 7, 346-357 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Wade, 15-MAR-1990. FEATURES from to/span description pept 77 640 slow-twitch skeletal troponin I (TNN1) mRNA < 1 816 TNN1 mRNA BASE COUNT 180 a 247 c 244 g 145 t ORIGIN Chromosome 1q12-qter. 1 tagtctgcag tctacggcga ggcacaggcc agcccagctc cacgaggact gaacaaggtg 61 ctgtctcact gccaccatgc cggaagtcga gagaaaaccc aagatcactg cctcccgcaa 121 actcttgctg aagagcctga tgctggccaa ggccaaggaa tgctgggagc aggagcacga 181 ggagcgcgag gctgagaagg tgcgctacct ggcagagcgc atccccacgc tgcagacccg 241 tggcctgtcc ctcagtgccc tgcaggacct gtgccgggag ctgcacgcca aggtggaggt 301 ggtggatgag gagcgatacg acattgaggc caaatgcctc cacaacacca gggagattaa 361 ggacctgaag ctgaaggtga tggacctccg tgggaagttc aagcgcccgc ccctgcgtcg 421 agtccgtgtc tcggctgacg ccatgctccg ggccctgctg ggctccaagc acaaggtgtc 481 catggatctg cgggccaacc tcaagtctgt gaagaaggaa gacacagaga aggagcggcc 541 tgtggaggtg ggtgactgga ggaagaacgt ggaggccatg tctggcatgg aaggccggaa 601 gaagatgttt gatgccgcca atgctccgac ctcacaatag aggccagctt gctgtgctgc 661 gctctgagct cctgcttcat gcttcttctc caacccagct cactcacctc tctgcctgtg 721 tctggagcat cccttcccac ctctccccca cttcttccct ccagcctgca atgccctcct 781 ctggaactgg gattaaacag atacccaaga ggcagg // LOCUS PPB4TSP 349 bp ds-DNA VRL 27-JUL-1990 DEFINITION Bovine papillomavirus major transcript, 5' flank. ACCESSION M35265 KEYWORDS major transcript. SOURCE Bovine papillomavirus (type 4) DNA. ORGANISM Bovine papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 349) AUTHORS Stamps,A.C. and Campo,M.S. TITLE Mapping of two novel transcripts of Bovine papillomavirus type 4 JOURNAL J. Gen. Virol. 69, 3033-3045 (1988) STANDARD simple staff_review FEATURES from to/span description pre-msg 320 > 349 BPV-4 major transcript BASE COUNT 109 a 54 c 88 g 98 t ORIGIN 1 ctttgcagat gaattagaac aggaggaaca gtaaatagaa tattgctgtg cttttacatt 61 tgagaaggat tgtagtgtgg gcatttaaaa gctgaccttc cagtcttaat tgcagtaggc 121 gcctaagagg gtggtggtgg tataagttca agtttttcaa gaaaatgaag gggcagaacg 181 tgacattaca ggacattgca atagaattag aggatacaat tagtccaatt aacttgcatt 241 gtgaagagga gattgaaact gaggaggtgg acacccctaa cccttttgca ataacagcaa 301 cttgttatgc ttgcgagcaa gtccttcgtt tagctgttgt aacgtcaac // LOCUS PPBBPV4X 69 bp ds-DNA VRL 27-JUL-1990 DEFINITION Bovine papillomavirus X protein gene, partial cds. ACCESSION M35264 KEYWORDS . SOURCE Bovine papillomavirus (type 4) DNA. ORGANISM Bovine papillomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 69) AUTHORS Stamps,A.C. and Campo,M.S. TITLE Mapping of two novel transcripts of Bovine papillomavirus type 4 JOURNAL J. Gen. Virol. 69, 3033-3045 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 69 X protein (AA at 3) BASE COUNT 18 a 27 c 19 g 5 t ORIGIN 1 acaacgcggg acccaaaccg gggaccacgc ccgaggacgt agcagaccgt ccccccgatc 61 ttccagaga // LOCUS BLIPGECR 314 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.licheniformis regulatory extracellular protease gene, complete cds. ACCESSION M35503 KEYWORDS extracellular protease. SOURCE B.licheniformis DNA, clone pAN2. ORGANISM Bacillus licheniformis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 314) AUTHORS Watanabe,K., Sato,N., Asano,K., Hatanaka,Y., Okada,J., Murata,K. and Kimura,A. TITLE Nucleotide sequence of the gene increasing the extracellular proteolytic activities of Bacillus licheniformis; Comparison with similar phenotypic genes from other Bacillus sp JOURNAL Agric. Biol. Chem. 51, 2807-2809 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 27 149 regulatory extracellular protease binding 12 16 ribosomal binding site (put.) binding 20 23 ribosomal binding site (put.) BASE COUNT 118 a 53 c 55 g 88 t ORIGIN 1 gatcttccaa aaggagtgtg gaaccggtgg aaaagcaaca aattgaagaa ttaaaacaac 61 tgctttggcg gctagagaat gaaatcagag aaacaaagga ctccttgcgc aagattaaca 121 aaagcattga tctgttaata caagtttaaa gggatttggt ttctttttca ccgaaaagtt 181 gcatataaat tcatgcgtta caccgcaaac tttaaataga ccataacacc cattttacac 241 aacttgcgat tagttctcaa atcctaatga aacataaaat ttttgtttgt ttttcaagga 301 gtcaaataaa gatc // LOCUS DOGPLBA 832 bp ss-mRNA MAM 27-JUL-1990 DEFINITION Dog cardiac phospholamban mRNA, complete cds. ACCESSION M35393 KEYWORDS phospholamban. SOURCE Dog heart, cDNA to mRNA, clone pPLB1. ORGANISM Canis sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 832) AUTHORS Uyeda,A., Kitano,K., Fujii,J., Kadoma,M., Tada,M. and Tanaka,S. TITLE Characterization of recombinant cDNA clones for canine cardiac phospholamban JOURNAL Nucleic Acids Symp Ser 17, 121-124 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 181 339 cardiac phospholamban (EC 2.7..1.37) BASE COUNT 277 a 161 c 125 g 269 t ORIGIN 1 agaaaacttt ctaactaaac accgataaga cttcatacaa ctcacaatac tttatattgt 61 aatcatcaca agagccaagg ctacctaaaa gaagagagtg gttgagctca catttggccg 121 ccagcttttt acctttctct tcaccattta aaacttgaga cttcctgctt tcctggggtc 181 atggataaag tccaatacct cactcgctct gctattagaa gagcttcaac cattgaaatg 241 cctcaacaag cacgtcaaaa tcttcagaac ctatttataa atttctgtct cattttaata 301 tgtctcttgt tgatctgcat cattgtgatg cttctctgaa gttctgctgc aatctccagt 361 gatgcaactt gtcaccatca acttaatatc tgccatccca tgaagagggg aaaataatac 421 tatataacag accacttcta agtagaagat tttacttgtg aaaaggtcaa gattcagaac 481 aaaagaaatt attaacaaat gtcttcatct gtgggatttt gtaaacatga aaagagcttt 541 attttcaaaa attaacttca aaatgactat aggtgcgcat aatgtaattg ctgaattcct 601 caacaaagct tgtaaaagtt tctatgccaa attttttctg agggtaaagt aggagtttag 661 ttttaaaact gctctgctaa ccagttcact tcacatataa agcattagct tcactatttg 721 agctaaatat ttatattgta ctgtaaatgc ctatgtaatg tttattaaga tttttcaagt 781 ctccgctaag tacgaaaata atcatccaaa tgaagtcatc atttgaaata gc // LOCUS DVUFDXA 707 bp ds-DNA BCT 27-JUL-1990 DEFINITION D.salexigens flavodoxin gene, complete cds. ACCESSION M35475 KEYWORDS flavodoxin. SOURCE D.salexigens (ATCC 14822) DNA. ORGANISM Desulfovibrio salexigens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 707) AUTHORS Helms,L.R., Krey,G.D. and Swenson,R.P. TITLE Identification, sequence determination, and expression of the flavodoxin gene form Desulfovibrio salexigens JOURNAL Biochem. Biophys. Res. Commun. 168, 809-817 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 238 678 flavodoxin binding 227 232 ribosome binding site (put.) signal 129 135 flavodoxin -35 region signal 152 157 flavodoxin -10 region BASE COUNT 223 a 152 c 149 g 183 t ORIGIN 1 agactaacaa taaaaatttt gttttctcac aaacaacatt ttgaacgcgc atatatcggt 61 acgtcttaag gagtgcataa caccactcag gcgcggcgta aacactcaat taccgattgg 121 ctgacatttt cacattttct taatattttt ataaattgat attgactttg aatttcactc 181 tcactacaaa ctaacacaac gttatcaaac cgattacaga tttataagga ggacactatg 241 tccaaatcac tgatcgttta cggctctact accggaaata ctgaaacagc cgccgaatac 301 gtggctgaag cgtttgaaaa caaagaaatt gatgtggaac ttaaaaatgt tactgatgtc 361 agtgttgccg atctcggcaa cggatacgac atcgtgctat tcggctgctc tacctggggc 421 gaagaagaaa ttgaattgca ggacgacttc atccccctct acgattccct cgaaaacgca 481 gacctgaagg gcaagaaagt atctgtcttc ggatgcggtg actccgatta cacttatttt 541 tgcggtgcag tagatgccat cgaagaaaaa ctcgaaaaaa tgggggccgt cgtcataggt 601 gacagcctca agatcgatgg cgacccggaa cgcgatgaga ttgtaagctg gggttcagga 661 atagcggaca aaatttaaat taatacagct cagctatcct tacgaag // LOCUS HUMCKRASA 450 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human PR310 c-K-ras protein mRNA, 5' end. ACCESSION M35504 KEYWORDS c-K-ras protein; oncogene. SOURCE Human (patient PR310) lung carcinoma, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 450) AUTHORS Yamamoto,F., Nakano,H., Neville,C. and Perucho,M. TITLE Structure and mechanisms of activation of c-K-ras oncogenes in human lung cancer JOURNAL Prog. Med. Virol. 32, 101-114 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 450 PR310 c-K-ras oncogene BASE COUNT 155 a 71 c 106 g 118 t ORIGIN 1 atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac 121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt 181 catgaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt 241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt 301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg 361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct 421 tttattcaaa catcagcaaa gacaagacag // LOCUS HUMCKRASB 450 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human PR371 c-K-ras oncogene mRNA, 5' end. ACCESSION M35505 KEYWORDS c-K-ras oncogene. SOURCE Human (patient PR371) lung carcinoma, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 450) AUTHORS Yamamoto,F., Nakano,H., Neville,C. and Perucho,M. TITLE Structure and mechanisms of activation of c-K-ras oncogenes in human lung cancer JOURNAL Prog. Med. Virol. 32, 101-114 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 450 PR371 c-K-ras oncogene BASE COUNT 156 a 71 c 105 g 118 t ORIGIN 1 atgactgaat ataaacttgt ggtagttgga gcttgtggcg taggcaagag tgccttgacg 61 atacagctaa ttgacaatca ttttgtggac gaatatgatc caacaataga ggattcctac 121 aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt 181 caagaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt 241 gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt 301 aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg 361 ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct 421 tttattcaaa catcagcaaa gacaagacag // LOCUS HUMTCRAER 237 bp ds-DNA PRI 27-JUL-1990 DEFINITION Human T cell receptor-alpha enhancer region DNA. ACCESSION M35477 KEYWORDS T-cell receptor. SOURCE Human T-lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 237) AUTHORS Winto,A. and Baltimore,D. TITLE Developmental regulation of the TCR-alpha-delta locus JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 87-92 (1989) STANDARD simple staff_entry FEATURES from to/span description binding 28 60 NF-alpha-1 protein binding site binding 128 139 NF-alpha-2 protein binding site binding 159 172 NF-alpha-3 protein binding site binding 175 189 NF-alpha-5 protein binding site binding 192 203 NF-alpha-4 protein binding site mut 180 187 gcagaagc in NF-alpha-5; tactccta in Mut-alpha-5 mut 192 197 tcctct in NF-alpha-4; gaagag in Mut-alpha-4 BASE COUNT 73 a 58 c 57 g 49 t ORIGIN 1 agacagctgc accctgaaat gggtaagctg gtcagatagt gaatcaatag ccagaagtag 61 aacaggaaat ggaaaaagtt tcccacttcc ctccaggtgt ttgggtctga acagcctccc 121 acttccatga cgtcacggct gctgacatgg gcaaacaggt ccccctttga agctctcccg 181 cagaagccac atcctctgga aagaggagtt aaaaatacag agttagagat aagatct // LOCUS HUMTGIGMU 352 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig mu-chain transgenic mRNA, partial cds. ACCESSION M35478 KEYWORDS immunoglobulin heavy-chain; mu-immunoglobulin; transgenic gene. SOURCE Human leukemic patient mononuclear cell line, cDNA to mRNA, clones pTMMG1-[4,5,12]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 352) AUTHORS Shimizu,A., Kinashi,T., Nussenzweig,M.C., Mizuta,T.-R., Leder,P. and Honjo,T. TITLE Molecular mechanism for immunoglobulin double-isotype expression JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 175-181 (1989) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 352 mu-immunoglobulin transgene protein (AA at 2) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" recomb 226 227 V region end/D region start recomb 274 275 D region end/J6 region start recomb 332 333 J6 region end/C1 region start BASE COUNT 84 a 99 c 104 g 65 t ORIGIN 1 ggcttctgga ggcaccttca gcagctatgc tatcagctgg gtgcgacagg cccctggaca 61 agggcttgag tggatgggag ggatcatccc tatctttggt acagcaaact acgcacagaa 121 gttccagggc agagtcacga ttaccgcgga cgaatccacg agcacagcct acatggagct 181 gagcagcctg agatctgagg acacggccgt gtattactgt gcgaaaaccg ggatcctggg 241 gcggtatagc agtggctggt acccgaactc ggactactac tactacggta tggacgtctg 301 gggccaaggg accacggtca ccgtctcctc agccaaaacg acacccccat ct // LOCUS MUSIGHADD 411 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Ig mu-chain mRNA VDJ region, 5' end. ACCESSION M35502 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse hybridoma CP8 B3-D3, cDNA to mRNA, clone pCP12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 411) AUTHORS Reininger,L., Kaushik,A., Izui,S. and Jaton,J.-C. TITLE A member of a new V-H gene family encodes anti-bromelinized mouse red blood cell autoantibodies JOURNAL Eur. J. Immunol. 18, 1521-1526 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 411 Ig mu-chain VDJ region precursor sigp 1 57 Ig mu-chain signal peptide matp 58 > 411 Ig mu-chain recomb 351 352 V region end/D region start recomb 360 361 D region end/J1 region start BASE COUNT 101 a 84 c 118 g 108 t ORIGIN 1 atggagtggg aactgagctt aattttcatt tttgctcttt taaaagatgt ccagtgtgaa 61 gtgcagctgt tggagactgg aggaggcttg gtgcaacctg gggggtcacg gggactctct 121 tgtgaaggct cagggtttac ttttagtggc ttctggatga gctgggttcg acagacacct 181 gggaagaccc tggagtggat tggagacatt aattctgatg gcagtgcaat aaactacgca 241 ccatccataa aggatcgatt cactatcttc agagacaatg acaagagcac cctgtacctg 301 cagatgagca atgtgcgatc tgaggacaca gccacgtatt tctgtatgag atatggtaac 361 tactggtact tcgatgtctg gggcgcaggg accacggtca ccgtctcctc a // LOCUS PRW1707PPS 66 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1707 DNA with polypurine-polypyrimidine sequences. ACCESSION M35455 KEYWORDS . SOURCE Synthetic DNA, clone pRW1707. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 36 a 5 c 21 g 4 t ORIGIN 1 ggatccgaag aagaagaaga agaagaagaa gaattcgaag aagaagaaga agaagaagaa 61 ggatcc // LOCUS PRW1718PPS 72 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1718 DNA with polypurine-polypyrimidine sequences. ACCESSION M35460 KEYWORDS . SOURCE Synthetic DNA, clone pRW1718. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 72) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 38 a 6 c 22 g 6 t ORIGIN 1 ggatccgaag aagaagaaga agaagaagaa gaattaattc gcgaagaaga agaagaagaa 61 gaagaaggat cc // LOCUS PRW1724PPS 66 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1724 DNA with polypurine-polypyrimidine sequences. ACCESSION M35458 KEYWORDS . SOURCE Synthetic DNA, clone pRW1724. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 19 a 6 c 38 g 3 t ORIGIN 1 ggatccggag gaggaggagg aggaggagga ggatccggag gaggaggagg aggaggagga 61 ggatcc // LOCUS PRW1725PPS 66 bp ds-DNA SYN 27-JUL-1990 DEFINITION Plasmid pRW1725 DNA with polypurine-polypyrimidine sequences. ACCESSION M35459 KEYWORDS . SOURCE Synthetic DNA, clone pRW1725. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 66) AUTHORS Shimizu,M., Hanvey,J.C. and Wells,R.D. TITLE Multiple non-B-DNA conformations of polypurine-polypyrimidine sequences in plasmids JOURNAL Biochemistry 29, 4704-4713 (1990) STANDARD simple staff_entry BASE COUNT 27 a 6 c 30 g 3 t ORIGIN 1 ggatccggag gaggaggagg aggaggagga ggatccgaag aagaagaaga agaagaagaa 61 ggatcc // LOCUS PT1INC3AA 330 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid pT181 (from S.aureus) inc3A region DNA with repC gene, 5' end. ACCESSION M35476 KEYWORDS . SOURCE Plasmid pT181 (from Staphylococcus aureus) DNA. ORGANISM Plasmid pT181 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 330) AUTHORS Highlander,S.K. and Novick,R.P. TITLE Mutational and physiological analyses of plasmid pT181 functions expressing incompatibility JOURNAL Plasmid 23, 1-15 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 11 < 1 (c) RepC protein mRNA 191 < 1 (c) RepC mRNA III mRNA 291 < 1 (c) RepC mRNA IV RNA 106 190 RNA I RNA 106 250 RNA II binding 27 22 (c) RepC ribosomal binding site (put.) site 151 176 inverted repeat terminator structure site 218 249 inverted repeat terminator structure BASE COUNT 113 a 51 c 37 g 129 t ORIGIN 1 tttttataca taaaaaatcg actccttaat ctcaatttcg tttaaggaat cgctcaccca 61 aatatatatc ttgatgtata tttaaatatc gtttaatatc taaatataca agattataaa 121 aacaactcag tgtttttttc tttgaatgat gtcgttcaca aactttggtc agggcgtgag 181 cgactccttt ttattttgtt attaatataa cactatcaaa agatttggtc taatcagatc 241 aagtcttttt ttatttaagc atttgtatta tctggtaaac agttaaaact actaaaacac 301 caagtacata cttacttgtt ataaaattct // LOCUS RICRGC1 145 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 17S rRNA gene. ACCESSION M35382 M35383 KEYWORDS 17S ribosomal RNA. SEGMENT 1 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 145) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 > 145 17S rRNA variant 105 106 cg in IR20; cgcggccg in Mangetsumochi variant 112 113 gt in IR20; gcgt in Mangetsumochi BASE COUNT 30 a 40 c 43 g 32 t ORIGIN 1 gatccattgg agggcaagtc tggtgccagc agccgcggta attccagctc caatagcgta 61 tatttaagtt gttgcagtta aaaagctcgt agttggacct tgggcgggcc ggtccgcctc 121 acggcaggca ccgacctgct cgacc // LOCUS RICRGC2 260 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 5.8S rRNA gene. ACCESSION M35384 KEYWORDS 5.8S ribosomal RNA. SEGMENT 2 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 260) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA 85 247 5.8S rRNA BASE COUNT 61 a 85 c 76 g 38 t ORIGIN About 1.0 kbp after segment 1. 1 cagaacccac ggcgccgacg gcgtcaagga acacagcgag gccctggccg tccggcggcg 61 cgcgcgatac cacgagttaa atccacacga ctctcggcaa cggatatctc ggctctcgca 121 tcgatgaaga acgtagcgaa atgcgatacc tggtgtgaat tgcagaatcc cgtgaaccat 181 cgagtctttg aacgcaagtt gcgcccgagg ccatccggcc gagggcacgc ctgcctgggc 241 gtcacgccaa aagacgctcc // LOCUS RICRGC3 91 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 25S rRNA gene. ACCESSION M35385 KEYWORDS 25S ribosomal RNA. SEGMENT 3 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 91) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 91 25S rRNA BASE COUNT 30 a 12 c 29 g 20 t ORIGIN About 1.6 kbp after segment 2. 1 gatcttggtg gtagtagcaa atattcaaat gagaacttga agggcgaaga ggagaaaggt 61 tccatgtgaa cggcacttgc acatgggtaa g // LOCUS RICRGC4 161 bp ds-DNA PLN 27-JUL-1990 DEFINITION Rice 25S rRNA gene. ACCESSION M35386 KEYWORDS 25S ribosomal RNA. SEGMENT 4 of 4 SOURCE Rice (variety IR20) 48 hr-germinated embryo DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 161) AUTHORS Hariharan,N. and Padayatty,J.D. TITLE Ribosomal RNA gene repeat units in rice embryos JOURNAL Indian J Biochem Biophys 24, 157-164 (1987) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 161 25S rRNA variant 24 25 cc in IR20; ccgc in Mangetsumochi variant 63 68 gccctt in IR20; tccacg in Mangetsumochi variant 128 128 c in IR20; g in Mangetsumochi BASE COUNT 36 a 52 c 48 g 25 t ORIGIN About 250 bp after segment 3. 1 cgcacgtcgc gcggtgtccg gtgcccccgg cggcccttga aaatccggag gaccgagtac 61 cggcccttcc cggtcgtact cataaccgca tcaggtctcc aaggtgaacg acctctggcc 121 aatggaacaa tgtaggcaag ggaagtcggc aaaacggatc c // LOCUS STRRRA 5100 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid pAM-beta-1 (from S.faecalis) replication region DNA. ACCESSION X17092 KEYWORDS . SOURCE plasmid pAM-beta-1 (from Streptococcus faecalis) DNA. ORGANISM Plasmid pAM-beta-1 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 5100) AUTHORS Swinfield,T.-J., Oultram,J.D., Thompson,D.E., Brehm,J.K. and Minton,N.P. TITLE Physical characterization of the replication region of the Streptococcus faecalis plasmid pAM-beta-1 JOURNAL Gene 87, 79-90 (1990) STANDARD simple staff_entry FEATURES from to/span description pept < 1 476 ORFA (AA at 3) pept 505 873 ORFB pept 933 1799 ORFC pept 2121 2405 ORFF pept 2772 3071 ORFD pept 3114 4604 ORFE pept 4954 5091 ORFG binding 488 496 ORFB ribosomal binding site (put.) binding 919 927 ORFC ribosomal binding site (put.) binding 2759 2765 ORFD ribosomal binding site (put.) binding 3100 3106 ORFE ribosomal binding site (put.) binding 4942 4948 ORFG ribosomal binding site (put.) BASE COUNT 1953 a 806 c 944 g 1397 t ORIGIN 1 gaattcattt gttcatatct attttgactc tttttattgt attattgata ggaataagtg 61 ttattcaaca agtacaactt caatcagtca aaaaagagag cgaattattg gaagaacaaa 121 ttgaacgtgt caaagaaacg gatatttctc aatcgaaaat agatactttt ggaagatact 181 ttttgactta ctatttttca caggaaaaaa atcaagaaaa ttatcagtca agtttaagaa 241 catatgtatc cgaaaaggta gatatatccg attggaaagc gttagggaaa acgcttaaga 301 gtgtcaatta ctatggtagt gaacaaacaa aaaaagggta tagcgttgaa taccttctaa 361 atgtttcagt agacaatcgt agtaaaatgc aaaaaatcac ttttgaggtt gaaccaacta 421 aaaacggctt tcttgtgaca actcaaccaa aacttacaga cttctctttt aactgaaata 481 cgctaaaaag agagtgctta aacaatggga aaaattaatt taaatcaaat ttacacagca 541 aaagaaatga gtgaaaggat aggtaagaat agaaattacc tatcccaagc ttatcgtaac 601 aacaaacatg aaatacttaa aaattttaat tatcgaaaaa ttggtggaac aattattttc 661 tctgataatc ccaataatga tttatcgcaa ttgattacag caaaagaagc aagccagttg 721 ttaggaaaga atgatgaata ctttgcccac atttataaac ggtttcctca tagactagag 781 ggaatcgatc acatttatac aggtaaaaca ttattcttaa cgaaagaatc gctagaagtt 841 tttaaaaaaa agatgaacaa aaatgtcaga taaagcatat ttatgatata ctatttatgt 901 acacaaatat taatttttaa ggagagaaga aaatgaaaaa aattgccctt ttaacacttt 961 ttagtacagc gctattggta tcagccccac ttgttagttt tgcggacgaa gcaacaggta 1021 gttcaggtgt caatgttcta gctgatgatg tagttgatcc agtagaacca accgatccaa 1081 ctactccggt tgatccagtt gatccagttg atccagttga tccagttgat ccagttgatc 1141 cagttgatcc agtcgatcca gttgatccag tcgatccaac agagccaacg gaaccaacag 1201 agccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acggaaccaa 1261 cggaaccaac ggaaccaacg gaaccaacag agccaacgga accaacggaa ccaacagagc 1321 caacggaacc aacagagcca acggaaccaa cggaaccaac agagccaaca gagccaacgg 1381 aaccaacgga accaacagag ccaacggaac caacagagcc aacagagcca acagagccaa 1441 cggaaccaac ggagccaacg gaaccaacgg aaccaagcaa gccgacagag ccaacagaac 1501 caagtaaacc agtagacaca aacccaatcg aaaacccagt taatacagat acaggcgttg 1561 tgattgttgc agttgaagat agtaaaccaa tcattcaatt agcagacgga acaacaaaaa 1621 aagttgaagc taaagaaatt ggcgcagacg ttcaaaaaga cggaacagta acggttaaag 1681 gttcagacgg aaaaatgaaa gttttaccaa aaacaggaga aacagaaaat attgcattgt 1741 ctgtcctagg ttcattaatg gtattaggat cagcattcat ttttaaaaag agaatctaat 1801 taagtatttt taaaagcatg gtcgcaagtt tcactagcag ccatgctttt tttgaatctt 1861 tgtaggacgt tttaaagaga attacccact ttaaaattaa acatgcgctt agaatcgctt 1921 taggaaacac gatccagtcc aataatcgtc gataaaaact tttgaaaaag gttggtgaaa 1981 ttacctactt ttggaataat cacaaatcac aagtgattaa tcacaaatca caagtgatta 2041 atcacttgtt tattaagata ttaaaagcta taatttaaat aaagcgtgaa ttttattaca 2101 caaaaagagg ggggagaaac ttggaactag catttagaga aagcttaaaa aagatgagag 2161 gtaccaaatc aaaagaaaaa ttctcccaag aattagaaat gagtagatca aattattcac 2221 gaatagaatc aggaaaatca gatccaacca taaaaacact agaacaaatt gcaaagttaa 2281 ctaactcaac gctagtagtg gatttaatcc caaatgagcc aacagaacca gaaccagaaa 2341 cagaatcaga acaagtaaca ttggatttag aaatggaaga agaaaaaagc aatgacttcg 2401 tgtgaataat gcacgaaatc gttgcttatt tttttttaaa agcggtatac tagatataac 2461 gaaacaacga actgaataga aacgaaaaaa gagccatgac acatttataa aatgtttgac 2521 gacattttat aaatgcatag cccgataaga ttgccaaacc aacgcttatc agttagtcag 2581 atgaactctt ccctcgtaag aagttattta attaactttg tttgaagacg gtatataacc 2641 gtactatcat tatataggga aatcagagag ttttcaagta tctaagctac tgaatttaag 2701 aattgttaag caatcaatcg gaaatcgttt gattgctttt tttgtattca tttatagaag 2761 gtggagtttg tatgaatcat gatgaatgta aaacttatat aaaaaatagt ttattggaga 2821 taagaaaatt agcaaatatc tatacactag aaacgtttaa gaaagagtta gaaaagagaa 2881 atatctactt agaaacaaaa tcagataagt atttttcttc ggagggggaa gattatatat 2941 ataagttaat agaaaataac aaaataattt attcgattag tggaaaaaaa ttgacttata 3001 aaggaaaaaa atctttttca aaacatgcaa tattgaaaca gttgaatgaa aaagcaaacc 3061 aagttaatta aacaacctat tttataggat ttataggaaa ggagaacagc tgaatgaata 3121 tcccttttgt tgtagaaact gtgcttcatg acggcttgtt aaagtacaaa tttaaaaata 3181 gtaaaattcg ctcaatcact accaagccag gtaaaagcaa aggggctatt tttgcgtatc 3241 gctcaaaatc aagcatgatt ggcggtcgtg gtgttgttct gacttccgag gaagcgattc 3301 aagaaaatca agatacattt acacattgga cacccaacgt ttatcgttat ggaacgtatg 3361 cagacgaaaa ccgttcatac acgaaaggac attctgaaaa caatttaaga caaatcaata 3421 ccttctttat tgattttgat attcacacgg caaaagaaac tatttcagca agcgatattt 3481 taacaaccgc tattgattta ggttttatgc ctactatgat tatcaaatct gataaaggtt 3541 atcaagcata ttttgtttta gaaacgccag tctatgtgac ttcaaaatca gaatttaaat 3601 ctgtcaaagc agccaaaata atttcgcaaa atatccgaga atattttgga aagtctttgc 3661 cagttgatct aacgtgtaat cattttggta ttgctcgcat accaagaacg gacaatgtag 3721 aattttttga tcctaattac cgttattctt tcaaagaatg gcaagattgg tctttcaaac 3781 aaacagataa taagggcttt actcgttcaa gtctaacggt tttaagcggt acagaaggca 3841 aaaaacaagt agatgaaccc tggtttaatc tcttattgca cgaaacgaaa ttttcaggag 3901 aaaagggttt aatagggcgt aataacgtca tgtttaccct ctctttagcc tactttagtt 3961 caggctattc aatcgaaacg tgcgaatata atatgtttga gtttaataat cgattagatc 4021 aacccttaga agaaaaagaa gtaatcaaaa ttgttagaag tgcctattca gaaaactatc 4081 aaggggctaa tagggaatac attaccattc tttgcaaagc ttgggtatca agtgatttaa 4141 ccagtaaaga tttatttgtc cgtcaagggt ggtttaaatt caagaaaaaa agaagcgaac 4201 gtcaacgtgt tcatttgtca gaatggaaag aagatttaat ggcttatatt agcgaaaaaa 4261 gcgatgtata caagccttat ttagtgacga ccaaaaaaga gattagagaa gtgctaggca 4321 ttcctgaacg gacattagat aaattgctga aggtactgaa ggcgaatcag gaaattttct 4381 ttaagattaa accaggaaga aatggtggca ttcaacttgc tagtgttaaa tcattgttgc 4441 tatcgatcat taaagtaaaa aaagaagaaa aagaaagcta tataaaggcg ctgacaaatt 4501 cttttgactt agagcataca ttcattcaag agactttaaa caagctagca gaacgcccta 4561 aaacggacac acaactcgat ttgtttagct atgatacagg ctgaaaataa aacccgcact 4621 atgccattac atttatatct atgatacgtg tttgtttttt ctttgctgtt tagcgaatga 4681 ttagcagaaa tatacagagt aagattttaa ttaattatta gggggagaag gagagagtag 4741 cccgaaaact tttagttggc ttggactgaa cgaagtgagg gaaaggctac taaaacgtcg 4801 aggggcagtg agagcgaagc gaacacttga ttttttaatt ttctatcttt tataggtcat 4861 tagagtatac ttatttgtcc tataaactat ttagcagcat aatagattta ttgaataggt 4921 catttaagtt gagcatatta gaggaggaaa atcttggaga aatatttgaa gaacccgatt 4981 acatggattg gattagttct tgtggttacg tggtttttaa ctaaaagtag tgaatttttg 5041 atttttggtg tgtgtgtctt gttgttagta tttgctagtc aaagtgatta aatagaattc // LOCUS SUSISDNA 241 bp ds-DNA INV 27-JUL-1990 DEFINITION Sea urchin integration site DNA. ACCESSION M34912 KEYWORDS . SOURCE Sea urchin DNA, clone 1. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 241) AUTHORS Flytzanis,C.N., McMahon,A.P., Hough-Evans,B.R., Katula,K.S., Britten,R.J. and Davidson,E.H. TITLE Persistence and integration of cloned DNA in postembryonic sea urchins JOURNAL Dev. Biol. 108, 431-442 (1985) STANDARD simple staff_entry BASE COUNT 70 a 42 c 33 g 96 t ORIGIN 1 gaaaagtaac gatattcgtt tttataaaag gtcattttca cgtagagatg aatttacctg 61 gtactaataa ctgatgcaga gtcgttttca aacagtgtca aatgcaaacc tttttgtctg 121 actggctctt ttttaaattt gtattaatgc cataacactc actttaattt gcgacatggc 181 tctttccatg tcactttttt attcttcaca tgataaatat tctaaaacta tctgttcttt 241 t // LOCUS MSQD7AA 3819 bp ds-DNA INV 27-JUL-1990 DEFINITION A.aegypti D7 gene, exons 1-5. ACCESSION M33156 KEYWORDS . SOURCE A.aigypti (strain Rockefeller) female salivary gland DNA, clone AEGD7. ORGANISM Aedes aegypti Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae. REFERENCE 1 (bases 1 to 3819) AUTHORS James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J. TITLE Molecular analysis of a gene expressed in the female salivary glands of the vector mosquito, Aedes aegypti JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.A.James, 22-MAR-1990. Author address:A.A.James: U of C, Irvine, Dept Biochem Irvine, CA 92717 FEATURES from to/span description pept 2377 2421 D7 protein, exon 1 2481 2696 D7 protein, exon 2 2757 2975 D7 protein, exon 3 3038 3232 D7 protein, exon 4 3288 3578 D7 protein, exon 5 pre-msg 2350 > 3819 D7 protein mRNA and introns IVS 2422 2480 D7 protein intron A IVS 2697 2756 D7 protein intron B IVS 2976 3037 D7 protein intron C IVS 3233 3287 D7 protein intron D signal 2321 2326 TATA box signal 3634 3639 poly-A signal BASE COUNT 1202 a 721 c 744 g 1152 t ORIGIN 1 gaattcaagg aacctttaaa tgaatctctt gaaatatccc tggaaacatt actcgtgaag 61 tcctggagaa ttttcagtta taaatctggg aagaatcctt gcatgaatcc ctggggtaat 121 ttatgaagaa atcctttgag gtattcttgc aagaatctct tgacgttgaa tgcatttggt 181 ttggcttgaa atattttaag gttttggtga aattagaata cttggagaaa tctgtgataa 241 tattttgaat atattcttga aataatttta gaaagagtta ccgcattaat tggaactgaa 301 atcgctggcg gaatttttgt gaaaatcttc aagaaatttc aaaatgtctc tgaaaaaacc 361 ctgaaggaaa ttactaaagg ataatttgac agattttttt tattattgac tctttattat 421 ggagaaattc agcccttagc tagttcatct ccttgacaga attactgaaa aacgactctg 481 ttttttgtaa tctactaatt cagagtccaa cgctcgggtt ttgaatttcc catatttcct 541 acacaaatcc atgaaagaat aatagtcgag aactgtcaaa aagttatgaa aaattattga 601 aaacggaatc gagactagca ctgtaaaaga gtcatctatt gtcatttcag ttacaagggc 661 caatcaattg acacaaacac aagcttcacc aactcttcca gaaattttaa agcatctact 721 gatattggct tagataattt atacgtattc ctcctagatt tggcccagtt gttagttaca 781 tgaaaggcca aatgttaaag gttaaatatc cgcagagtcc attgtcaatc agtgaagtat 841 tttaagccag catgaaaggg tattatcaac attttttcca aaaccctccg ggttcattta 901 tgctgagaag gaaaaaccga agtgattttt ttgcgctctc gtgctctgca taataacctt 961 accgtccttc aattaaactt tccgtcaaag tcttcatagt ccaagcaata aaaaaaaaga 1021 tgaaaatttc ttcaataagg tatcaccaaa gacacatacc ataccgaggg accaaatgca 1081 gtactagaag tggtaccaaa tcagagcccg agtgggacgg acctggtatc actcgacata 1141 ttcactgttt actgatgtcg tacaaacagc tattcgtcgt aattcttcgt ctgccacttc 1201 aaatgaaaac ctgctaggtg aaacggacga acaatccaaa ctttccttca gcaaatcttg 1261 tcctacccat cagaaagtgt tatttatgag gactcctgag cggtactgca atgctctcat 1321 atgagcgctg gacggcactg acgttcacct tccggatcct cgtggagtgc ttgctgtttc 1381 gacgccatct tcgattgaac tgtcagttat tactctcaac agaagcagta agcttcacct 1441 taaggctaag taacccgtca ttcgttttgg caacaatgat gacttttcag cttgcatttc 1501 aaaatgataa aacgcagtct tgatagttta tactgacttg aaaaagtatc actgtacgca 1561 tttacatgca taaagtatgc tgatactttt tcagctgtgt cagtgcaaaa gcaactgatt 1621 ttctttgatt cgaaatcatg agatgaatta gcaacaacca tcaacgacgc gtacaaattt 1681 taatgacggc ctacttcgcc ttaacagcct tgtcgtgatg acgatccttg cagttgcgac 1741 aggaatactt cgacaaatcc tagctagatt tcccgatctg aagcagtttg ttaaaaaaaa 1801 atattttcag ttgagaaatg attttcctca cctcagagag ttaccggatc taaccgtatt 1861 aaaggttgtc aatcaggata tgcgcatcca gtaccgattc cttggataac gatggcaaat 1921 cgattaaaac ttgtgcctac gcaatttaat agttctagtt gtctctaagc aatctatcag 1981 gtcttttggt tctcgcgaaa tctgattaat tcatcgacat taccgtcgag agcctggagc 2041 ttgatttgcg acaaacatat tgatcgattg cttgaataca taagcttttg ctaagcctat 2101 taatccaata atagatctct tcagagacga atggaagcac accatagtac tttataatat 2161 tgaacatttg ttagtactgt gcacgtgtgc acacattcat gtacatatgg atgatttgat 2221 cagctttact caattctttc ttcacactct gctggttgtt tatatttgcg taaatctatt 2281 atctgttatc agctgactgc taaatcacat tcaacttgca tataaaacgc attaggcctg 2341 tctaacattc atcccgatca tacttacaat actagaatga agctacttct attactcgca 2401 atttttacaa ccttttccgt ggtaagatgt tcctagtgca tccaactctt catttcttaa 2461 caaatactcc accttttaag gttgcctcaa tgggaccatt tgatccggag gagatgctgt 2521 tcatctttac gcgctgcatg gaagacaatt tggaagatgg agcgaatcga cttccaatgt 2581 tagcgaaatg gaaagaatgg attaacgaac cggtagacag ccccgcaact cagtgtttcg 2641 gcaaatgcgt cctggtaaga acaggtctgt acgatccggt agcccaaaag ttcgatgtaa 2701 gttggtggtt gatatccggg tggaattgga taatttcatc gcttcaattg ttgtaggcgt 2761 cggtgatcca ggagcaattt aaggcttatc cgtccttggg ggaaaagagc aaagttgaag 2821 catatgctaa cgcagttaaa cagttgcctt ccacaaataa cgactgtgcc gctgttttca 2881 aagcgtacga tcctgttcat aaggcgcata aggacaccag caagaacttg ttccatggaa 2941 acaaggagtt gaccaagggc ctctatgaga agttggtaag tgtttccgtg ttattcgatt 3001 gaaacaattg tgtgtatgaa catattaatt gcttcaggga aaagacattc gccagaagaa 3061 gcaatcctac ttcgagtttt gcgagaacaa gtactaccca gctgggtcag ataagcgtca 3121 gcaactttgt cagataaggc aatacactgt cttagatgat gcgctgttca aggagcacac 3181 tgattgcgtg atgaagggta ttcgctacat tacgaaggat aatcaactgg atgtaagaaa 3241 ataataaata tcatgcaaac atcatagtaa tcgacatttc ctttcaggtg gaagaggtga 3301 agcgggactt caagctagtg aataaagata cgaaggcact cgaaaaggtt ttgaatgact 3361 gtaagtctaa ggaaccaagc aacgcaaaag agaaatcatg gcactactac aaatgtttgg 3421 tggaatcttc ggttaaggat gatttcaaag aggcttttga ctatcgtgag gtacggtcac 3481 agatttatgc cttcaatttg cccaagaaac aggcttacag caaaccagca gtgcaatctc 3541 aagtgatgga aatcgacggt aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt 3601 ccaattcaat ggtttcatcg aagtagaaag ttaaataaag atataataat catcacgcag 3661 aggaactttg ttttaactta tattgtttat ttggtaattg ttgtagttat ttttaaatta 3721 ttttaatttg ggaagccgaa aaaccgttaa taggaaatta aagtagtttt tgaatcattg 3781 ttattataaa atatcgcata gttctgtact gaagaattc // LOCUS MSQD7AB 1040 bp ss-mRNA INV 27-JUL-1990 DEFINITION A.aegypti D7 mRNA, complete cds. ACCESSION M33157 KEYWORDS . SOURCE A.aigypti (strain Rockefeller) female salivary gland, cDNA to mRNA, clone AEGD7. ORGANISM Aedes aegypti Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae. REFERENCE 1 (bases 1 to 1040) AUTHORS James,A.A., Blackmer,K., Marinotti,O., Ghosn,C. and Racioppi,J. TITLE Molecular analysis of a gene expressed in the female salivary glands of the vector mosquito, Aedes aegypti JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.A.James, 22-MAR-1990. Author address:A.A.James U of C, Irvine, Dept Biochem Irvine, CA 92717 FEATURES from to/span description pept 13 978 D7 protein signal 1016 1021 poly-A signal BASE COUNT 335 a 208 c 251 g 246 t ORIGIN 1 gaattcccta gaatgaagct gcctctatta ctcgcaatag ttacaacctt ttccgtggtt 61 gcttcaacgg gaccatttga tccggaggag atgctgttca cctttacgcg ctgcatggaa 121 gacaatttgg aagatggacc gaatcgactt ccaatgttag cgaaatggaa agaatggatt 181 aacgaaccgg tagacagccc cgcaactcag tgtttcggca aatgcgtcct ggtaagaaca 241 ggtctgtacg atccggtagc ccaaaagttc gatgcgtcgg tgatccagga gcaatttaag 301 gcttatccgt ccttggggga aaagagcaaa gttgaagcat atgctaacgc agttcaacag 361 ttgccttcca caaataacga ctgtgccgct gttttcaaag cgtacgatcc tgttcataag 421 gcgcataagg acaccagcaa gaacttgttc catggaaaca aggagttgac caagggcctc 481 tatgagaagt tgggaaaaga cattcgccag aagaagaaat cctacttcga gttttgcgag 541 aacaagtact acccagctgg atcagataag cgccagcaac tttgtaagat aaggcaatac 601 actgtcttag atgatgcgct gttcaaggag cacactgatt gcgtgatgaa gggtattcgc 661 tacataacga agaataatga actggatgct gaagaggtga aacgggactt caagctagtg 721 aataaagata cgaaggcact tgaaaaggtt ttgaatgact gtaagtctaa ggaaccaagc 781 aacgcaaaag agaaatcatg gcactactac aaatgtttgg tggaatcttc ggttaaggat 841 gatttcaaag aggcttttga ctatcgtgag gtacggtcac agatttatgc cttcaatttg 901 cccaagaagc aggcttacag caaaccagca gtgcaatctc aagtgatgga aatcgacggt 961 aaacagtgtc cacaatagaa tagtgcattg aaaatgtctt ccaattcaat ggttaaataa 1021 agaaataatg atcatcacgc // LOCUS ADGLS 1477 bp ds-DNA VRL 27-JUL-1990 DEFINITION Adenovirus type 7 (simian) left end of genome: 0.0-4.2 map units. ACCESSION K03325 KEYWORDS alternate splicing; terminal repeat. SOURCE Adenovirus type 7 (simian) DNA. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 1477) AUTHORS Kimelman,D., Miller,J.S., Porter,D. and Roberts,B.E. TITLE E1a regions of the human adenoviruses and of the highly oncogenic simian adenovirus 7 are closely related JOURNAL J. Virol. 53, 399-409 (1985) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Kimelman (15-AUG-1985). The mRNA transcripts (mRNAs I and II) were determined by mRNA-DNA hybridization and S1 nuclease digestion. The peptide coding regions were deduced by analogy to other adenoviruses and following the most likely mode of translation. The intron of the mRNA III transcribed from the E1a of other adenoviruses has not yet been determined by S1 mapping. Enhancer sequences are located at positions 165-173 and 265-270. A TATA box is located at position 408-413. A polyadenylation signal is located at posityion 1447-1452. FEATURES from to/span description pept 486 837 E1a 28 K protein from mRNA I, exon 1 1178 1407 E1a 28 K protein from mRNA I, exon 2 pept 486 1056 E1a 24 K protein from mRNA II, exon 1 1178 1407 E1a 24 K protein from mRNA II, exon 2 pre-msg 439 1477 E1a mRNA IVS 838 1177 E1a 24K pept intron A IVS 1057 1177 E1a 28K pept intron A BASE COUNT 331 a 334 c 442 g 370 t ORIGIN 5' end of the l strand of the genome. 1 tctatataat ataccttatt tgggaacggt gccaatatgc taatgaggtg ggcggagttt 61 ggtgacgtat gcggaaatgg gcggagttag gggcggggtt tggcggtagg cgtggctggg 121 ggagtgtccg ggcgtgggaa cggaagtgac gtagggggcg cgccggaggt gacgtcgtgt 181 ggggagtttt aaaccggaag caaggtattt taaacgcttg caagcgcaat tttgtcggtt 241 ttggcgcgaa aactgataaa aagcggaagt tcggttaatc attaattttt acgataggga 301 ggaatattta ccgagggccg gtgaactttg agcggtgacg cggtggtttc gttacgtggc 361 accaccacgc gactgctcaa agtccccgtt tattgtctag gtgagggtat ttaaaccggc 421 tcagaacgtc aagaggccac tcttgagtgc ccgcgagaag agctttctcc tctttcgctg 481 cgaaaatgag acacttggcg ttggaaatga tttctgaact gctggattta ggactggata 541 ccattgatgg ctggctgcac accgaatttc ggccggtacc ggcgggggtg agtcataaca 601 tgtcgctgca cgaaatgtac gacctggacg ttaccggcca ggaggatgag aacgaagagg 661 cggtagatgg tgttttttcc gatgcgatgc tcctggccgc ggaggaggga atagaaatgc 721 ctaatcttta ttctccggga cctctggttg ggggaggtga aatgcctgaa cttcagcctg 781 aggaggaaga tcttttctgc tacgaagatg gcttccctcc cagtgactct gaggaaggtg 841 agcattcgca ggtggagaca gaacgtaaaa tggcggaggc ggcggcagca ggtgcggcgg 901 cggccgcgcg gcgggagcaa gatgactttc gcttagactg tcctagcgta cctggccatg 961 gctgtagctc ctgtgactac catcgcaaaa ctagcggctg tcctgaaatt ctgtgctcgc 1021 tgtgctatct gagggctaac agcatgttta tttatagtaa gtaaattttt tctactaact 1081 ttctcgttgt gtgtttgctc gctcgctcgc tcgcaactgc tggggtgctt gctgttggga 1141 ctgagcttac aggtattttc tctgtaattt tccataggtc cagtttctga ctctgagcca 1201 gacgagcccg actccacaac agctgattca aatcatggca gcccgccaac ccttcgctgc 1261 accccaccca gggacttgcc gcgacctgtg ccagtgaagg cctctcctgg caagcgccca 1321 gcggtgaaca gcttgcatga cctcatagag gaggttgaac aaacagtacc tttggacctg 1381 tccctaaagc gctctaggag caattagggt tataaaaccc ctccccttcc ccttaagtta 1441 tcaggaaata aaaagattaa ctggattctt tgtgcct // LOCUS ADGRS 116 bp ds-DNA VRL 27-JUL-1990 DEFINITION adenovirus type 7 (simian) right end of genome. ACCESSION V00035 J01991 KEYWORDS terminal repeat. SOURCE adenovirus type 7 (simian) dna. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 116) AUTHORS Broker,T.R. TITLE terminal nucleotide sequences and restriction endonuclease cleavage maps of simian and murine adenoviruses JOURNAL (in) Tooze,J. (Ed.); DNA Tumor Viruses, Second Edition Revised: 1037-1046; Cold Spring Harbor Laboratory, Cold Spring Harbor (1981). STANDARD simple staff_review REFERENCE 2 (bases 1 to 116) AUTHORS Tolun,A., Alestroem,P. and Pettersson,U. TITLE sequence of inverted terminal repetitions from different adenoviruses: demonstration of conserved sequences and homology between sa7 termini and sv40 dna JOURNAL Cell 17, 705-713 (1979) STANDARD simple staff_review COMMENT L-strand is given, from 5' to 3'. BASE COUNT 25 a 15 c 46 g 30 t ORIGIN 1 atcaataata taccttattt gggaacggtg ccaatatgct aatgaggtgg gcggagtttg 61 gtgacgtatg cggaaatggg cggagttagg ggcggggttt ggcggtaggc gtggct // LOCUS ADGSA7PE1 2338 bp ds-DNA VRL 27-JUL-1990 DEFINITION Simian adenovirus SA7P, leftmost 2338 bp. ACCESSION X01027 KEYWORDS E1A protein; E1b protein; inverted terminal repeat; overlapping genes; transforming gene. SOURCE Simian adenovirus SA 7P. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 2338; enum. 1 to 2338) AUTHORS Dekker,B.M.M., Konings,D.A.M., Denisova,T.S., Gibadulin,R.A. and Van Ormondt,H. TITLE The nucleotide sequence of the leftmost XhoI fragment (6%) of simian adenovirus SA7P JOURNAL J. Gen. Virol. 65, 1699-1708 (1984) STANDARD simple automatic COMMENT **map: leftmost XhoI fragment (6%); EMBL features not translated to GenBank features: key from to description SITE 1 183 inverted terminal repeat (ITR) of SA7P PRM 409 416 TATA E1A region CAP 440 440 cap site E1A RNAs MSG 440 838 "12S" E1A RNA MSG 440 838 1st exon "12S" E1A RNA IVS 839 1175 intron "12S" E1A RNA MSG 1176 >1445 2nd exon "12S" E1A RNA MSG 440 1057 1st exon "13S" E1A RNA IVS 1058 1175 intron "13S" E1A RNA MSG 1176 >1445 2nd exon "13S" E1A RNA SITE 1445 1450 AATAAA E1A PRM 1508 1514 TATA E1B CAP 1538 1538 cap E1B RNA's This virus, an independent isolate from Moscow, USSR, is highly related to simian adenovirus 7 (SA7). The E1A sequences are 95% homologous to those of SA7 see $<K03322>$ J. VVrol. 5:399-409(1985) FEATURES from to/span description pept 487 838 E1A 21K protein (part 1) 1176 / 1401 E1A 21K protein (part 2) pept 487 1057 E1A 29K protein (part 1) 1176 / 1401 E1A 29K protein (part 2) pept 1569 2111 pot. 21K E1B protein pept 1874 > 2338 pot. 55K E1B protein BASE COUNT 514 a 535 c 725 g 564 t ORIGIN 1 catcatcaat aatatacctt atttgggaac ggtgccaata tgctaatgag gtgggcggag 61 tttggtgacg tatgcggaag tgggcggagc aaggggcggg gcgagaggcg gggcttttgg 121 taggcgtggc cgggcgtggg aacggaagtg acgtcggggg cgcgccggac gtgacgtgtt 181 ttgaggcgtt ttaaaccgga agcaaggtat tttaaacgct tgcagcgcaa ttttgccggt 241 tttggcgcga aaactgataa aaagcggaag ttcggttaat cattaatttt tacgataggg 301 aggaatattt accgagggcc ggtgaacttt gagcgatgac gcggtggttt cgttacgtgg 361 caccaccacg cgactgctca aagtccccgt ttattgtcta ggtgagggta tttaaaccgg 421 ctcagaccgt caagaggcca ctcttgagtg cccgcgagaa gagctttctc ctctttcgct 481 gcgaaaatga gacacttggc gttggagatg atgtctgaac tgctggattt aggactggat 541 accatcgata gctggctgca caccgaattc gcgccggtac cgacgggggt gagtcataac 601 atgtcgctgc acgaaatgta cgacctggac gttaccggcc aggaggatga gaacgaagag 661 gcggtagacg gtgttttttc cgatgcgatg ctcctggccg ctgaagaggg agtagaaatg 721 cctagtcttt attctccggg acctctggtt gggggaggtg agatgcctga gctacagcct 781 gaggaggtag atcttttctg ctacgaagat ggcttccctc ccagtgactc tgaggaaggt 841 gagcattcgc aggtggagac ggaacgtaaa atggcggagg tggcggcggc aggtgcggcg 901 gcggccgtcc gcggggagca agatgacttt cgcttagact gtcctagcgt acctggccat 961 ggctgtagct cctgtgacta ccatcgcaaa aatagcggct gtcctgaaat tctgtgctcg 1021 ctgtgctatc tgagggctaa cagcatgttt atttatagta agtgaatttt tctactaact 1081 ttctcgctgt gtgtttgctc gctcgttcgc taactgctgg ggtgcttggt gttgggactg 1141 agcttacagg tatttttctc tgtaattttc cacaggtcca gtttctgact ctgagccaga 1201 cgagcctgac tccacaacag ctgattcaaa tcatggcagc ccgccaaccc ttcgctgcac 1261 cccacccagg gacttgccgc gacctgtgcc agtgaaggcc tctcatggca agcgcccagc 1321 ggtgaacagc ttgcatgacc tcatagagga ggttgaacaa acagtacctt tggacctgtc 1381 cctaaagcgc tctaggagca attagggtca taaaacccct ccccttcccc ttaagttata 1441 aggaaataaa aagattaact ggattctttg tgcctgcttt gtttgttcgt gggcggtcct 1501 tggggattat aaaaggggtg agtcaggtta taagggtact taggcaacgc tcaggtaaca 1561 tcgcctccat ggatctccga acggcgcttc agacttttga gagcacccgc cgcttgctgg 1621 agctctgttc caatagaacc tcttttttgt ggaggtggtt atttggaact ccgctcagcc 1681 ggctggttag gcaggtgaaa ttagaatacg agaaggattt tgaaagaatt ttagatcagt 1741 gtcccggggt gtttgagtcc ctggagctgg gctatcataa ggtttttgag gagaagattg 1801 taaaggagtt ggatttttct tctcccggtc gggcggtcgc ggctgtggcc tttgcttcct 1861 acctgctgga tagatggaac acccggaccc acctgtcccc ggggtaccag atggattaca 1921 tcagcctgaa cctgtggaag ttttggttgc gccggcgggt ttacaattac tcgcgggggc 1981 tgcctcagct agggccggcg gcgccgctgg cgaggcaggg gtcgcagcag gaggagcagc 2041 agcagcggca ggaggaggag caggtgcagg aggagatgag gtccggcctg gaccctccaa 2101 cggagaactg atggcgcagg tggccggcgc ggcggatgcc agcgggccgc cttccaagcg 2161 ccctaagaat gaggcagagc agatgcagga gggtttaact cagttgaccc tgagtctcat 2221 taacaagtat cggcctgaga ccgtgtactg gaatgaactg gagcaggaat ttcagcatgg 2281 ggagatgcac ttgcagttcc actttagctt tgagcaatta aaaactcact ggctcgag // LOCUS ADGVARNA 850 bp ds-DNA VRL 27-JUL-1990 DEFINITION Simian adenovirus type 7 (SA7) virus-associated (VA) RNA gene. ACCESSION M14082 KEYWORDS virus-associated RNA. SOURCE Simian adenovirus type 7 DNA. ORGANISM Mastadenovirus s7 Unclassified. REFERENCE 1 (bases 1 to 850) AUTHORS Larsson,S., Svensson,C. and Akusjaervi,G. TITLE Characterization of a low-molecular-weight virus-associated (VA) RNA encoded by simian adenovirus type 7 which functionally can substitute for adenovirus type 5 VA RNA-1 JOURNAL J. Virol. 60, 635-644 (1986) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Larsson, 15-JAN-1987. Simian adenovirus type 7 VA RNA stimulates protein synthesis in a transient expression assay and can functionally substitute for VA RNA-I during lytic growth of human Ad5. The SA7 genome encodes only one VA RNA species. FEATURES from to/span description RNA 181 352 virus-associated RNA (5' and 3' ends +/-2bp) ORF 105 < 1 (c) ORF A cds ORF 366 > 850 ORF B cds BASE COUNT 140 a 260 c 338 g 112 t ORIGIN 150 bp upstream of XbaI site; 30 mu on viral map. 1 gtagatgtag cgcgacatcc aggtgaggcc ggcggcggtg gtggcggcgc gggtgaactc 61 gcggacgcgg ttccagaggt tgcgcagggg ggcgaagcgt tgcatggtgg tcacgctctg 121 gccggtgaga cgggcgcagt cctgcacgct ctagacggaa acagagagag gcgtcaccga 181 ctcctctccg tagcttgggg gttaggtcgc aagggtgcgg tggcggggaa ccccggttcg 241 aaaccggccg gatccgccac tcccgacgcg gcggccccgc gtccacgacc ccaccagagg 301 ccgagaccca gccgcgacgc gcgcacccca aatacggagg ggagtctttt ggtgcttttt 361 tgtagatgca tcccgtgctg cggcagatgc gccccacgcg gggggcctcc tcgcgggaag 421 aagagctgga ggcgcaggag gaggcgcgcg gtggcgccgc gggccccgcc gagctggagc 481 tggaggaggg cgaaggcttg gcgcggctgg gcgccggagt gccggagcgc cacccgcggg 541 tgcaactggc tcgcgaccag cgccaggcct acgtgccgcc gcagaacctg tttagggacc 601 ggagcgggga ggaggcggag gaaatgcgtg actgccgttt tcgggccggg cgggagctgc 661 gggcgggctt ggagcgcgag cggctgctgc ggcccgagga ctttgagccc gaggagcgca 721 acggtatcag tccggcgcgc gcgcacgtct cggccgccaa cctggtgacg gcctacgagc 781 agacggtgaa cgaggagcgc aactttcaaa agagcttcaa caaccacgtg cgcaccctcg 841 tggccccggg // LOCUS ADZITRL 150 bp ds-DNA VRL 27-JUL-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori left end. ACCESSION J01958 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 38 a 29 c 48 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgcg // LOCUS ADZITRR 150 bp ds-DNA VRL 27-JUL-1990 DEFINITION adenovirus 31 inverted terminal repeat & repl ori right end. ACCESSION J01959 KEYWORDS origin of replication; terminal repeat. SOURCE ad31 (adenovirus 31). ORGANISM Mastadenovirus 31 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 150) AUTHORS Stillman,B.W., Topp,W.C. and Engler,J.A. TITLE conserved sequences at the origin of adenovirus dna replication JOURNAL J. Virol. 44, 530-537 (1982) STANDARD simple staff_review BASE COUNT 40 a 28 c 47 g 35 t ORIGIN 1 catcatcaat aatatacctt acactggact tgagccaata ttaaaatgaa gtgggcggag 61 tgaatagtta attgaccgta ggcgtggttt gcaagtttgc cgaagccgga tgtgacgcgt 121 gtgggagccg ggcgcgccgg atgtgacgaa // LOCUS HSKGAORA 3726 bp ds-DNA VRL 27-JUL-1990 DEFINITION Herpesvirus of turkey glycoprotein A (gA) gene and ORF of unknown function. ACCESSION M33303 KEYWORDS glycoprotein A. SOURCE Herpesvirus of turkey DNA. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 3726) AUTHORS Kato,A., Sato,I., Ihara,T., Ueda,S., Ishihama,A. and Hirai,K. TITLE Homologies between herpesvirus of turkey and Marek's disease virus type-1 DNAs within two co-linearly arranged open reading frames, one encoding glycoprotein A JOURNAL Gene 84, 399-405 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 207 1676 glycoprotein A pept 2118 2756 ORF2 BASE COUNT 1016 a 884 c 815 g 1011 t ORIGIN 1 tcgtgcgcat cgcccatggc tggcggaacg ctctttcgaa ccgtgaataa aactttgtat 61 ctactaaaca ataactttgt gttttattga gcggtcgaaa acaatgagga gctgcaattt 121 aaagctaacc gcatacgccg ggcgggtaaa gaccatttta taccatatta cgcatctatc 181 gaaacttgtt cgagaaccgc aagtatatgg tttccaacat gcgcgttcta cgcgtactgc 241 gcctgacggg atgggtgggc atatttctag ttctgtcttt acagcaaacc tcttgtgccg 301 gattgcccca taacgtcgat acccatcata tcctaacttt caacccttct cccatttcgg 361 ccgatggcgt tcctttgtca gaggtgccca attcgcctac gaccgaatta tctacaactg 421 tcgccaccaa gacagctgta ccgacgactg aaagcactag ttcctccgaa gcgcaccgca 481 actcttctca caaaatacct gatataatct gcgaccgaga agaagtattc gtattcctta 541 acaatacagg aagaattttg tgtgacctta tagtcgaccc cccttcagac gatgaatggt 601 ccaacttcgc tcttgacgtc acgttcaatc caatcgaata ccacgccaac gaaaagaatg 661 tagaggttgc ccgagtggcc ggtctatacg gagtaccggg gtcggattat gcatacccta 721 ggaaatcgga attaatatcc tccattcgac gggatcccca gggttctttc tggactagtc 781 ctacaccccg tggaaataaa tatttcatat ggattaataa aacaatgcac accatgggcg 841 tggaagttag aaatgtcgac tacaaagaca acggctactt tcaagtgata ctgcgtgata 901 gatttaatcg cccattggta gaaaaacata tttacatgcg tgtgtgccaa cgacccgcat 961 ccgtggatgt attggcccct ccagttctca gcggagaaaa ctacaaagca tcttgcatcg 1021 ttagacattt ttatcccccg ggatctgtct acgtatcttg gagacgtaac ggaaacattg 1081 ccacaccccg caaggaccgt gacgggagtt tttggtggtt cgaatctggc cgcggggcca 1141 cactagtatc cacaataacc ctcggaaact ctggactcga atctcctcca aaggtttcct 1201 gcttggtagc gtggaggcaa ggcgatatga taagcacatc gaatgctaca gctgtaccga 1261 cggtatatta tcacccccgt atctctctgg catttaaaga tgggtatgca atatgtacta 1321 tagaatgtgt tccctctggg attactgtga ggtggttagt tcatgatgaa ccccagccta 1381 acacaactta tgatactgtg gttacaggtc tctgcaggac catcgatcgt tatagaaatc 1441 tcgccagtcg gattccagtc caggacaact gggcgaaaac gaagtatacg tgcagactaa 1501 ttggatatcc gttcgacgtg gatagatttc aaaattccga atattatgat gcaacgccgt 1561 cggcaagagg aatgccgatg attgtaacaa ttacggccgt tctaggactg gccttgtttt 1621 taggtattgg tatcattatc acagccctat gcttttacct accggggcgg aattaagatt 1681 aaccatcgta tgtgatataa aaattattaa gtgttataac cgatcgcatt cttctgtttc 1741 gattcacaat aaataaaatg gtattgtaat cagcaccatc gcattgtttc gtagatgact 1801 catgttcagt ccgcgtgatg tcaaaaatac gtatttttgg tatcacgcag cggccaaaat 1861 gcccattatg ttatttttac tccaaacgcg gtatttaaaa catcgggacg tacatcatgt 1921 ggcgcacgtt aatcgtatac ggtgccgcta cattaaaaat cgcaagtctc cgaatatcaa 1981 gctcacggcc aaaacgtcgg taataatctt acgcatcgaa tgtgatacgg ataccgtaca 2041 atcgctgagt agatttccta tatagttact cagtagtgat acacaatcac aaaatcgctg 2101 gggtatatca tataagaatg atgtcgccca cccctgaaga tgatcgcgat ctcgttgtgg 2161 ttcgtggacg tctccgaatg atggatagcg gcacggaaac agatagagag caacgacatc 2221 cacgtacgac ttggcgatcg atctgttgtg ggtgtacgat aggaatggta tttaccatat 2281 tcgttctcgt agcggcagta ttgttgggat cactattcac tgtttcatac atggccatgg 2341 aatcgggaac atgtcccgat gaatggattg gtttgggtta tagttgcatg cgcgtggccg 2401 ggaaaaatgc aactgatctt gaggcgttgg atacatgtgc tcggcataac agcaaactta 2461 ttgacttcgc aaacgccaaa gttctggttg aagctatcgc cccattcggt gtgccaaatg 2521 cagcatatgg ggaagtcttc cggttaaggg acagcaaaac cacgtgtata cgacctacca 2581 tgggaggacc cgtgtcggca gactgtcctg taacatgtac cgttatatgt cagcgaccca 2641 ggcctctaag taccatgtct tccatcatta gagatgcccg cgtgtatctt catttagaac 2701 gacgcgatta ttatgaagtc tacgcctctg tcctctctaa tgcgatgagt aaataaaaac 2761 gcacctctaa cggttactgt gttttattta tccaatcaca ccatagacat tataaacata 2821 atatggatct ttatttcata taatgtatta tacagaccgc gcatcgcgtg taaaatatta 2881 ttcgctggac actttcagcc tacgtaaaat atgttctacc ttggacactg gtatatctcc 2941 acgcattgtt ttttcatgaa agtatttggt tgtcggagta gacaccctgc cagattccga 3001 aactggaggg tcattgctcg ccaaagaagg ccccccagac ccatcggtct tacgaattgc 3061 accagtcaga ccgtttccga tgttcgcaca ttgagaactc gaccccgccc ccgcaaattc 3121 agtcaaacca ggaaccgtat cgccacagcc acattcgaca cgatcgccta attgatgtgt 3181 gggcgtactt cccgacttgt tcgagttttt atcttgcttc cgaatagatt tcataagccg 3241 tgcattaaca tcgcttattc gctttggcag taattttatc cccagtagag ccagggcgtt 3301 cttagacagt ttaacatcat cggttgcagc gcggccgaac caagtattta tgctgcactc 3361 gagagccccc catgacacgg gcaacatgct cggaaacaaa ttcccaagct gatgttcaaa 3421 ttggtcttgc tgcaacaggg cctcttttaa tatagtattt tccgtattac cgtccaacca 3481 tatcacatat cccctctgag ttaaattaat taggtactga caatgatggt tgagaagtga 3541 aaccaacgaa accactgcta gtatggcacg tgagttccga tcaccatcaa agtccaccag 3601 gcgggtttgc gaccataata cgctggtttg tagtaagggg gcgatgacat tgcctataat 3661 tcgacaaagc aatgatgtcg aactccattt gtctgatggt aaggaggaaa tttcttccat 3721 gaattc // LOCUS HSKTK 2039 bp ds-DNA VRL 27-JUL-1990 DEFINITION Herpesvirus of turkey thymidine kinase gene, complete cds. ACCESSION M26659 KEYWORDS thymidine kinase. SOURCE Herpesvirus of turkey (strain FC126) DNA, passed in chicken embryo fibroblasts. ORGANISM Gallid herpesvirus type 2 Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 2039) AUTHORS Martin,S.L., Aparisio,D.I. and Bandyopadhyay,P.K. TITLE Genetic and biochemical characterization of the thymidine kinase gene from herpesvirus of turkeys JOURNAL J. Virol. 63, 2847-2852 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.L.Martin, 03-AUG-1989. FEATURES from to/span description pept 608 1540 thymidine kinase (EC 2.7.1.21) BASE COUNT 570 a 458 c 437 g 574 t ORIGIN 649 bp upstream of XbaI site. 1 atcgatgaaa gtctgtacat atagccttct tttttaatgg tgtcgaacgt ataactacaa 61 gcccgcgtct cgcgacaaac actaaaacgg ggcaaatgat agtatgatca gatcccctgg 121 gggccatgcg ctcgatcaac gttttagact ccagcaactg cttagtccct gttgcgcgtt 181 gctcattctt gctggccgtc ttgaggttgc ggtaaaatct gcaagttttc agttcgatta 241 tgacacagac attgttacag ccttccacca ataacccgcc agataaggtg aacatgcaaa 301 tgcaatctgg ccttctgcgc cccaggtcta cttcaaaggc caatcgaaac gccgacgccc 361 ctttaagtag ttcagaaggc aggatcttgc ttaatatctt ctgtaatgtt ctaccaactg 421 tattccccgc cgacgactgg tcaaagtttt cagcagcaag ctttttgtaa aaacgattat 481 gaccacggac acccgctttt agcaatcctg ccataaggtg gtttcgccgc gtgcttgcct 541 cgaagacaat tgccagctaa tccagcatta ccatatttcc ttggcttgca tttggatctg 601 cgcgtcgatg gcattgccga gaagaccgcc cacgttaacg cgagtttatc tagacggacc 661 gtttggtata ggcaaaacgt ctatactaaa cgctatgccc gaccacacgc ccgatggggc 721 tcctatattg aaagtgtacg aaccaatgaa atattggaga tgccagtcta ccgatttggt 781 ggtagctgcc aacgaaacgc cagaacgtag gcgtggtgga gctttatcac gattccaatc 841 tgacatgatc atggcatcta tacaagccag atttgccgat ccatatttgc tttttcacga 901 acggttatca tctaaatgta gaggaaaaat agaaatatgc gatactccag caattatatt 961 aatgctggat aggcaccctg tggcggcgat attatgtttc ccaatcactc gctatttact 1021 tggagaatat tctttggaaa tgttgattag ctctataata agacttccgt tggaatcccc 1081 cggatgcaac ctgacagtca caatccttcc cgacgaaaag gaacacgtta ataggatttg 1141 ttcaagagat agaccgggtg aaacggcaga tagaaatatg ctcagaacac tcaatgccgt 1201 atacgcatct ttggtggaca cggttaaata cgcaaatcta acatgccctt acgagaaaga 1261 aagctgggaa atggaatggt tgggacttcc ctggtttgaa gagtcattac ttgaagaatt 1321 catctcgcgt ccccgccctg ttatttgttc gagaactcga atgccgctgg accgaactct 1381 cctggccatt tttaaacgga aagagctgtg tagcgaaaat ggggagctgt taactcagta 1441 ttcttggata ttgtggggat tactgactaa actacacacc attaatgtcg aattatttga 1501 cattagcggt atgtcacgtc gagaatgcgc cacgctataa tgcatactat gccggagaga 1561 ttgtctactc tcgctagctg gaatgattta tgcgagcttg aagatgatgt aatttcctat 1621 aataagggaa tgtgtaacga ggttggagcg tctcgataat tcttcttaat ctgctggtat 1681 tggttactgc cataacttaa tattggtcca tgctagaata gtcatacgct acgatctgtt 1741 gctatatatg actatcgcca aactgttaaa ccgcgaagaa tatatttcat ataaacctaa 1801 gggcccctca gtctgatttt ttgtgaaaac gtgtatacat gaagttttac tgcataatcc 1861 gtttcatgat catagcgaat ctttattcat cttaccaaat atcgcttcca ggcacatatc 1921 catcgcaaat attgcttgac atgaagaact cgccgctcgt acgctttaat atatcgacgc 1981 gtgattataa agacgagaca ctctggatac ggaaaaattc gacatttgtt tatatcgat // LOCUS LACHSP 3095 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid pTR2030 (from L.lactis) aborts phage infection (hsp) gene, complete cds. ACCESSION M30192 KEYWORDS . SOURCE Plasmid pTR2030 (from Lactococcus lactis lactis (strain ME2)) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3095) AUTHORS Hill,C., Miller,L.A. and Klaenhammer,T.R. TITLE Nucleotide sequence and distribution of the pTR2030 resistance determinant (hsp) which aborts Bacteriophage infection in Lactococci JOURNAL Appl. Environ. Microbiol. 56, 2255-2258 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Hill, 29-NOV-1989. FEATURES from to/span description pept 1105 2991 hsp gene (aborts phage infection) BASE COUNT 1129 a 439 c 518 g 1009 t ORIGIN 1 ataagagtca gaatgtgacc gtcataatga ttacgaggat agctgcaacg cttggtttcg 61 agttaattgt cgtgcaacga ttgacggaga attgaaaaat ttcaaagttc acgatgttga 121 gacttacgat aaaaagaaaa atagatttca tcggagattg tctgatgcct tggtcccaat 181 aatatcttca gaagatgttg aatttgaagc cgagcaattt ttgaaattgt atttcccagt 241 agctatggaa attcctcagc gtattgaccc gcttttgata gctgaaaaaa tgggacttac 301 ggttgagtac catgaaattt cagaagatgg gaacattttt ggtcaaattt attttcatga 361 tgctcttctt gacgggaaag aaatcaaggc aaaaactatc ctgattgatc ctcgtgtcat 421 cgaaagccgt gggattggtg gacttaataa cacaataatg cacgagtgtg tccattggca 481 taaacatagg cttgcatttg aattggttag gcttttccaa ccagagcttt cgaatattac 541 cacaactaaa gaagagttcg atgggctcat tgagaaaaat atgactccaa cagattggct 601 ggagattcaa gctagaaaga ttacgcctaa aatcttgatg cctaagaaga tgtttaaaca 661 agaagttgaa acgtttatgc gtcctgatgg tggaagtgga atagtggacc agctcctcat 721 aatcgaggga actatttcag agcttgctag ttttttcacc gtatcaaaat tatctgcaaa 781 aattcggatg gttgagctgg gatatgagat tgcgataggt gcttcaaact atgttgatgg 841 gcatcctgta cccccacact cttggaagca gggagctgta tctgctaatc agacttattc 901 aattggtttt gttgatgcaa ctatagagac attaaagaac ccaagattac tcgtcgcaat 961 aaaaaaagaa cttaacctca tatttcatag ggattgaagt tcttgttttt tatttttggt 1021 agctaaaatg gtatccaatt aataattagc tattttattt tggtaaaata gaataaatag 1081 atttttagaa cagggagtag gtaaatgata actttgcaac accaagattg ggaacgtgca 1141 gtcaatatga ttaaaaacat tccaccaagt gccaaaaata aatactttca aacttttcct 1201 ttctttttgt taagtgaaac cagctgggaa gagttacttt ctgaaaattt tttctactca 1261 tatataaagt caggagagtt tctgacttat caagaaaacc tttcattcta cgataggaca 1321 attcaaaaga gccatggagc ataccgacaa actcgaattg tatctccaat catctatata 1381 tttttaatag caattgcttc tcaagtggaa agaatatatg ttgaaaagcg tactaatgat 1441 atgtcagtat atttttcagg ttcttttgaa aaagaaaaaa acactgcaca ttataaacaa 1501 agttataaca cttatatgac tgaacttaat gcctgtcaag aagagtttga ttattatttt 1561 caaacagatt tttcaacatt ttttcatcta gtagatacag acaacttatt caataaaata 1621 gaccgattag atccgaaaag tgcattagtc tactcttccc taataaaaat gattggacaa 1681 ggaagaatgc ccatagttga tggaaattca ggtctttctt ttttaaatac tgtagtttat 1741 ctagatgact ttgataaaga aattattgat tccttaaaaa cgatagttga aatcgaaagt 1801 tttaaactag ttcgatatgt agatgactta catattttca taaaatgtgc aaataaagat 1861 ttagattttt taaattataa agtatataat ttgttatgcg aaaaagcaac gaaacatcat 1921 cttgagataa attcttcaaa aacaaaatca tttactccca caagtgaact aagcactaaa 1981 atgaatacag atttgtataa tttctttgtg tataacgaag atgttgattt tgagcaatat 2041 ttttcaaaaa ataccttaat agaatttcta gataaattaa ataatatgtc tgttaatgct 2101 gatttttcag agtacgaaaa agaagtcctg tatacattgg aaaaccctga aatagtatct 2161 gacggaagtt atatattaaa cgctattgtt tacaataaaa gtacgtggtc acaggattat 2221 gatataaaaa ataaaattag tttattagta aatagtaatt atagaaaact aagatattct 2281 gcaaaagcac tcataactct tgtcttgaat acaagagatg gtgatatcat aaaaggacta 2341 cttaataatt tatttactac attcaaaaat ggtacaaatg atattattga tgaaattata 2401 ttaattgaat acttggtaca aagaaagttt aatcataaag atttaatgac tatcttgaaa 2461 gctgacgacc atggtattaa ggaatatatt aaggcatatc aaacttctga tttcataaaa 2521 agtctagaaa aaaataaagt tattttttat actaatcaga aagaagttta tcctttaata 2581 agtaaggata aaatactcaa tttcatatat tttagagcta agtattttga atccttagac 2641 ctagttcttg aatcctttgc atattataaa aactattttg ataggtttgt tgctcacgct 2701 atgttctgta caggaattga ctctggaaga aaacctaatt ataagctata ttatacagaa 2761 ggaaagttaa tagatgggtt aaaacaacta aattttttat cttctgatga gattacaaaa 2821 ataattaatg aagcacataa aattagaaat agtaaccccg taagtcactc tagtgctggc 2881 ttacttcaaa acgaagattt tagtagatat agagttaagt caagtcttaa tgacttgaaa 2941 attatcatag aacaactttc aacattgcta caaaataaaa acagattata aaatattatt 3001 atctgtttta tttttttcaa aagaatcaaa gtttattaga tttttcaact gtcctgttat 3061 ctcctcatcc gcatttggat aaagatgact ataaa // LOCUS PH15RER 2016 bp ds-DNA PHG 27-JUL-1990 DEFINITION Bacteriophage phi-15 right early region containing genes 17, 16.5-16.9, complete cds. ACCESSION M28830 KEYWORDS DNA replication protein. SOURCE Bacteriophage phi-15 DNA. ORGANISM Bacteriophage phi-15 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 2016) AUTHORS Benes,V., Arnold,L., Smrt,J. and Paces,V. TITLE Nucleotide sequence of the right early region of Bacillus phage phi-15 and comparison with related phages: Reorganization of gene 17 during evolution JOURNAL Gene 75, 341-347 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 203 664 gene 17 peptide pept 666 992 gene 16.9 peptide (put.) pept 989 1306 gene 16.8 peptide (put.) pept 1308 1700 gene 16.7 peptide (put.) pept 1697 1861 gene 16.6 peptide (put.) pept 1858 1971 gene 16.5 peptide (put.) BASE COUNT 744 a 374 c 382 g 516 t ORIGIN 1 aaagtagggt acagcgacaa cataccccat ttccccattg accgactatc ttcgacaaca 61 atctaacaac taaaacacaa ctatatacct atactaaata acaacatcaa tttgtcgaaa 121 agggtagaca aactactgtt taacatggta tactataata gaagtaagga agtaagacaa 181 ccaatcatag gaggaattac acatgaacaa ctaccaatta actattaacg aggtaatcga 241 gatcatcaac cgcaatacag acattaataa gttagtcgct aaaaaggaca acttatatcc 301 aacagatttg tatgacttag acaaacaaca attaatcgct atcattctca acagcgattt 361 cgcattatct agcatcaaaa gagcattatt agaggtaact gtcgaagagc taagagaaca 421 ggacaatgat gacgatttag atgaaataga taacgagtta tacgagggcg cagaggcggc 481 tgatgttcca cgtgaaacaa tagtgaaagt attcgaggca gacaaatcaa tagtaacgtt 541 caacggtgag aaactaaaac attatgttaa tgttgatgac aatagctcat ctgttgatga 601 agtaaagaag atcgccaagg aaatatcaga acatgacttt aacgaaaaca aggagaacaa 661 gtaacatgag cgtacaactt aatgcattta cattcatctt agaacgccgt ggatggcgca 721 tggtatgcta cgaacaatta acaacgaacg ggacacgcat attacatttc tatctcaaag 781 ataacccaac gttctttgct acatactcat cacaattctt atcagacacc aaaatgataa 841 gacgctttgc atcatggagc gggcaattac ttgaaggttc aaactctgtg ttctggacta 901 acataacacc atttgagcca attgatgaag aaacagcaga agacatcaag aatctcgaca 961 aagtcgttga aggaatgaat tttacgttat gattgacatt attgtaaaag aggacaagcg 1021 cttaataact gttcaaacac cagagggaga cgaagtattt tacactttgt ctttctcaga 1081 cgagcataag ttactaaaac gatcaagtgc cagactaaga aacaacatat atgccattgg 1141 tgtagcaaat atcagatggg tgttagttga catggataac atgatactta gtgagtatat 1201 gcatcacgtt gacatcctaa aagacataga caggaaaatg agacaattag ggtatatagt 1261 catttcagaa tggcaacacg caaataaaaa agggacgagg agataatatg gaagctattt 1321 tgatgatcgg tgtaatcaca ttgtgcgtta tattcctttt atcaggacgc aataataaaa 1381 agaaacaaga aataagggag ctagaagatt accttgaaga cctcaatcaa agaatcgtgc 1441 aacgaacaca gatacttagt gagcttaacg aagtcatcac aaacagaagt gttgataagt 1501 cagtcaatat gtcagcttgt gaaatcgccg tccttgacct ctatgagcag tcaaatatcc 1561 gcattcctag tgatatcatc gaagatatgg ttaatcaacg tttacaatca gaacaagatg 1621 tgttaaacta tatagagaca cagcggacat actggaaact ggagaatcag aaaaaactat 1681 atcgggggtc attgaaatga agttgctcac tcacacgtgt cattactgta gcttctcttt 1741 cttcactcga aagtttgatg tatttggtgc aataaccaag aaagatactc ctgttgtttt 1801 ctgccccgca tgtgggaatc aatccctttc agtatcacac attgaggagg agatcatatg 1861 aatcagaaag aattccaagc cgttttagac tggatgttgt cacacactat tatacaattc 1921 cacgaataca actatatgct acaaaagagc ctaccgttcc tcagacgata ggcttttctt 1981 ttgattgttt cacgtggaac atcactgtat acgcat // LOCUS STLENV 4122 bp ss-RNA VRL 27-JUL-1990 DEFINITION Simian T-cell leukemia virus, pol-env-pX-3' LTR region. ACCESSION M11373 M11374 KEYWORDS env gene; pol gene. SOURCE Simian T-cell leukaemia virus genomic RNA. ORGANISM Simian T-cell lymphotropic virus type I Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 4122) AUTHORS Watanabe,T., Seiki,M., Tsujimoto,H., Miyoshi,I., Hayami,M. and Yoshida,M. TITLE Sequence homology of the simian retrovirus genome with human T-cell leukemia virus type I JOURNAL Virology 144, 59-65 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 311 1777 env protein BASE COUNT 905 a 1456 c 741 g 1020 t ORIGIN 1 catttaaacg tattaaccca ttgccacaaa acccgatggc aacttcacca ctccctccgt 61 ctgcaaccta tcccagagac acactccccc atcagtaaac aaacccattg gtattatttc 121 aagcttcctg gtcttaatag ccgccagtgg aaaggaccac aggaagctct ccaagaagcc 181 gccggcgctg ctctcattcc ggtaagcgct aattctgccc agtggatccc gtggaggctc 241 ctcaagcaag ctgcatgccc aagacccgtc gggggccccg ccgatcccaa agaaaaagac 301 caccaacacc atgggtaagt ttcttaccac cttactttta ttcctccaac tccgcccccc 361 cattgtctgt gatcacagtc ccagctgctg cactctcaca gtcggagtct cctcatatca 421 ttctaaaccc tgcaatcccg cccaaccagt ctgctcatgg accctcgacc tgctagctct 481 ttcagcagac caagccctac aacccccctg ccctaatctg gtgggttact ccaactacca 541 tgctacctat tccctatatc tattccctca ttggattaaa aagccaaacc gaaatggcgg 601 aggctattat tcggcctctt attcagaccc ttgttcccta aagtgccctt acctaggatg 661 ccaatcatgg acctgtccct acacaggagc cgtctccagc ccctattgga aatatcagca 721 agatgtcaat ttcactcaag aagtctcacg cctcagcctt aatctccact tttcaaaatg 781 cggtttctcc ttctcccttc tggtcgatgc actaggatac gaccccatct ggttccttaa 841 caccgaacct agccaactgc cccccactgc ccttcctcta ctcccccatt ccaacttgga 901 ccacatcctc gagccttcca taccatggaa gtcaaaactc ctaactcttg tccaattaac 961 tctacaaagc actaactaca cttgtattgt ctgcatagat cgtgccagct tatctacttg 1021 gcacgtctta tactctccca acatctctgt tccatcctcc ccctctactc ccctccttta 1081 cccatcgtta gcgcttccag ccccccatct gacattacca ttcaactgga ctcactactt 1141 tgaccctcag attcaagcca tagtctcctc cccctgtcac aactccctca tcttgccccc 1201 cttttctctg tcacctgttc ccgccccagg gtcccgttcc cgccgggcag taccggtggc 1261 ggtctggctc gtctctgccc tggccatggg aaccggagtt gctggcggga ttaccggctc 1321 catgtccctc gcctcaggaa aaagtctcct acatgaagta gacaaagata tttcccaatt 1381 gacccaagca attgtcaaaa accacaaaaa tctactcaag attgcacaat atgctgccca 1441 aaacagacga ggccttgatc tcctgttttg ggagcaagga ggactatgta aagcactgca 1501 agaacaatgt tgtttcctaa atattactaa ttcccatgtc tcaatacttc aagaaagacc 1561 cccccttgaa aatcgagttc taactggttg gggcctcaac tgggaccttg gcctctcgca 1621 atgggcccga gaggccttac aaactggaat cacccttgtc gcattactcc ttctggtcat 1681 ccttgtaggg ccatgcatcc ttcgccaact acgacaactc ccctcgcgtg tcagataccc 1741 ccactactct cttataaacc ccgagtcctc cctataaacc tagcagttac tgcgaccaca 1801 ttgcttgcag cctcccctgc caatagttaa cctcccccat caaatcctcc ttttcctgca 1861 acaacctctt ccgttcggcc tccaaggagt ctacctcact ctccagctgt ctagcataac 1921 catcaactcc caactccttt attttttctt tcctagcatc acgctgtctc gccctcttag 1981 ccccttgttc tgccagtctc cgcttgcgct caagccgctt ctcctcctcc cgcctcctgc 2041 gccgagcctt cttctcttcc ttccttttca ggaactcggc aatctgttcc tcctcctctt 2101 tctcccgctc tctttttcgc ttcctcttct cctcagcccg tcgctgccga tcacgatgcg 2161 tccccccgcg aggtggcgtc tcctccctta gagggacccg ttgctgctgg ccattacatt 2221 cctcttcgag ggatagtaaa ccgtcaagca cagactcctc ctccgtcctt tgtttgttta 2281 agtcctcttc tagggataat agtccgtcca ccaagtcttc caccagcagg tcctccgggc 2341 agggaacagg taaacatcga aacggcccta cacatacaaa attaatcata cttattatca 2401 gcccacttcc caggttttgg acagagcctt ctttatggat acccagtcta cgtatttgga 2461 aactgtgtgc aaggcgactg gtgccccatc tctgggggac tatgctcggc ccgcctgcac 2521 cgtcacgccc tactggccac ctgtccagag catcagatca cctgggaccc caccgatgaa 2581 cgcgttatcg gctcagctct gcagttcctt atccctcgac tcccctccct ccccacccaa 2641 agaacctcta agaccctcaa ggtccttacc ccgccggcta ctcatacaac ccccaacatt 2701 ccaccctcct tcttccaggc cgtacgaaaa tactccccct tccgaaacgg atacatggag 2761 cccacccttg ggcaacaact cccaaccctg tcctttcccg accccggcct ccggccccaa 2821 aacttgtata ccctctgggg aaactccgtt gtctgcatgt acctctacca actttccccc 2881 cccatcacct ggcccctcct gccccatgta attttctgcc acccaggcca acttggggcc 2941 tttctcacca acgttcccta caaacgaatg gaagaacttc tctataaaat tttccttaat 3001 acaggggcca caatcattct acccgaggac tgcttgccca ccaccctatt ccgacccact 3061 agggcacccg cctcgctgac agccaggcaa aatggcctcc ttccgttcca gtcaaccctt 3121 actacgccag gccttatttg gacatttacc gatggcacgc ccatggtttc cgggccctgc 3181 cctagggacg gtcagccatc tttggtgttg cagtcctcct cctttatatt tcacaaattt 3241 caaaccaagg cctaccatcc ctctttccta ctctcacatg gcctcataca gtactcgtcc 3301 tttcataact tacatctcct ctttgaagag tacaccaata ttcccatttc tttacttttt 3361 aacaaagaag aggcaaatga tactgaccat gagccccaaa tgctccccgg gggcctaaaa 3421 cctcccaatg gaaaacattt ccgcgaaaca gatgtctgaa aaaggtcagg gcccagacta 3481 gggctctgac gtctcccccc ggagggcagc tcaacacaag gccctgacgt gtccccctgg 3541 ggacaaatca tgagcccaga cctccgggaa gccaccgaga accacccatt tcctccccat 3601 gtttgtcaag ccgtccccag gcgttgacga caacccctcg cctcaaaaac ttttcatggc 3661 acgcatacag ctcaataaaa caacaggagt ctataaaagc gtggagacag ttcaggaggg 3721 ggctagcttc tgctctacac gcgcccgccg ccctacctga ggccgccatc cacaccggtt 3781 gagttgcgtt ctgccgcctc ccgcctgtgg tgcctcctga actgcgaccg ccgtctgggt 3841 aagttcggag ctcgggtcga gaccgggcct ttgtccggcg ctcccttgga gcctacctag 3901 attcggccgg ctctccacgc tttgcctgac cctgcttgct taactccaca tctttgtttc 3961 gttttctgtt ccgcgccgct gcagatcgaa agttccaccc ctttcccttt cattcacgtc 4021 tgactgccgg cttggcccac ggccaagcac cggcatcctc actggctcgg agccagtagc 4081 agcccatttt acaactctct ccagagagaa gtttagtaca ca // LOCUS HUMIGHDA 368 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds. ACCESSION M33858 KEYWORDS diversity region; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region; variable region V-H4. SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA, clone C. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 368) AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J. TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic lymphocytic leukemia-producing pathologic IgM rheumatoid factor JOURNAL J. Immunol. 144, 1526-1530 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 368 Ig H-chain V4-D-J region (AA at 1) BASE COUNT 73 a 110 c 112 g 73 t ORIGIN 1 caggtgcatc tacagcagtg gggcgcagga ctgttgaagc gttcggagac cctgtccctc 61 acctgcgctg tctatggtgg gcccttcagt ggttactatt ggagctgggt ccgccagccc 121 ccagggaagg gaccggagtg gattggggaa atcaatcata gtggacgcac cacctacaac 181 ccgtccctca cgagtcgagt caccatgtca ctggacacgt ccaagaacca gttctccctg 241 aagctgacct ctgtgaccgc cgcggacacg gctgtttatt actgtgcgag aggcccttgc 301 gaggcatact gtactgatga tgccccgcaa ggcggtgcgc ggtacttcca gcactggggc 361 cagggcac // LOCUS HUMIGHDB 260 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds. ACCESSION M33859 KEYWORDS diversity region; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region; variable region V-H4. SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA, clone A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 260) AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J. TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic lymphocytic leukemia-producing pathologic IgM rheumatoid factor JOURNAL J. Immunol. 144, 1526-1530 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 260 Ig H-chain V4-D-J region (AA at 1) BASE COUNT 57 a 79 c 75 g 49 t ORIGIN 1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc 61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac 121 cagttctccc tgaagttgac ctctgtgacc gccgcggaca cggctgttta tttctgtgcg 181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aaggcggtgc gcggtacttc 241 cagcactggg gccagggcac // LOCUS HUMIGHDC 260 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human Ig H-chain mRNA V-D-J region, partial cds. ACCESSION M33860 KEYWORDS diversity region; immunoglobulin heavy chain; joining exon; mu-immunoglobulin; processed gene; variable region; variable region V-H4. SOURCE Human peripheral blood lymphocyte (isolate LES), cDNA to mRNA, clone B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 260) AUTHORS Roudier,J., Silverman,G.J., Chen,P.P., Carson,D.A. and Kipps,T.J. TITLE Intraclonal diversity in the V-H genes expressed by CD5- chronic lymphocytic leukemia-producing pathologic IgM rheumatoid factor JOURNAL J. Immunol. 144, 1526-1530 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 260 Ig H-chain V4-D-J region (AA at 1) BASE COUNT 58 a 82 c 74 g 46 t ORIGIN 1 atccgccagc ccccagggaa gggactggag tggattgggg aaatcaatca tagtggacgc 61 accacctaca acccgtccct cacgagtcga gtcaccatat cagtggacac gtccaagaac 121 cagttctccc tgaagctgac ctctgcgacc gccgcggaca cggctgttta ttactgtgcg 181 agaggccctt gcgaggcata ctgtactgat gatgccccgc aagccggtgc gcggtacttc 241 cagcactggg gccagggcac // LOCUS HUMHLADQA 1468 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC class II HLA-DQA1 mRNA, complete cds. ACCESSION M33906 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone pII-alpha-7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1468) AUTHORS Jonsson,A.-K., Andersson,L. and Rask,L. TITLE Complete sequences of DQA1 and DQB1 cDNA clones corresponding to the DQw4 specificity JOURNAL Immunogenetics 30, 232-234 (1989) STANDARD simple staff_review FEATURES from to/span description pept 44 811 MHC HLA-DQA1-alpha protein /hgml_locus_uid="LY0151Z" /nomgen="HLA-DQA1" /map="6p21.3" signal 1119 1124 poly-A signal (pot.) BASE COUNT 356 a 381 c 309 g 422 t ORIGIN 1 cacagctcag agcagcaact gctgaggctg ccttgggaag aggatgatcc taaacaaagc 61 tctgctgctg ggggcccttg ccctgaccac cgtgatgagc ccctgtggag gtgaagacat 121 tgtggctgac catgttgcct cttatggtgt aaacttgtac cagtcttacg gtccctctgg 181 ccagtacacc catgaatttg atggagacga gcagttctac gtggacctgg ggaggaagga 241 gactgtctgg tgtttgcctg ttctcagaca atttagattt gacccgcaat ttgcactgac 301 aaacatcgct gtgacaaaac acaacttgaa catcctgatt aaacgctcca actctactgc 361 tgctaccaat gaggttcctg aggtcacagt gttttccaag tctcccgtga cgctgggtca 421 gcccaacacc ctcatctgtc ttgtggacaa catctttcct cctgtggtca acatcacatg 481 gctgagcaat gggcactcag tcacagaagg tgtttctgag accagcttcc tctccaagag 541 tgatcattcc ttcttcaaga tcagttacct caccttcctc ccttctgctg atgagattta 601 tgactgcaag gtggagcact ggggcctgga cgagcctctt ctgaaacact gggagcctga 661 gattccagcc cctatgtcag agctcacaga gactgtggtc tgcgccctgg gattgtctgt 721 gggcctcgtg ggcattgtgg tgggcactgt cttcatcatc atccgaggcc tgcgttcagt 781 tggtgcttcc agacaccaag ggcccttgtg aatcccatcc tggaagggaa ggtgcatcgc 841 catctacagg agcagaagag tggacttgct acatgaccta gcattatttt ctggccccat 901 ttatcatatc ccttttctcc tccaaatgtt tctcctctca cctcttctgt gggacttaaa 961 ttgctatatc tgctcagagc tcacaaatgc ctttgaatta tttccctgac ttcctgattt 1021 ttttcttttc tcaagtgtta cctactaaga gatgcctgga gtaagccacc cagctaccta 1081 attcctcagt aacctccatc tataatctcc atggaagcaa caaattccct ttatgagata 1141 tatgtcaaat ttttccatct ttcatccagg gctgactgaa accgtggcta agaattggga 1201 gactctcttg tttcaagcca atttaacatc atttaccaga tcatttgtca tgtccagtaa 1261 cacagaagca accaactaca gtatagcctg ataacatgtt gatttcttag ctgacgttaa 1321 tatttctttc ttccttgtgt tcccaccctt ggcattgcca cccacccctc aattaaggca 1381 acaatgaagt taatggatac cctctgcctt tggctcagaa atgttatagc aaaaatttta 1441 aaataaaaaa gtaagtctgt attaattg // LOCUS HUMHLADQB 896 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC class II HLA-DQB1 mRNA, complete cds. ACCESSION M33907 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human lymphoblastoid cell cell line SPL, cDNA to mRNA, clone pII-beta-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 896) AUTHORS Jonsson,A.-K., Andersson,L. and Rask,L. TITLE Complete sequences of DQA1 and DQB1 cDNA clones corresponding to the DQw4 specificity JOURNAL Immunogenetics 30, 232-234 (1989) STANDARD simple staff_review FEATURES from to/span description pept 46 831 MHC HLA-DQB1-beta protein /hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1" /map="6p21.3" BASE COUNT 185 a 252 c 263 g 196 t ORIGIN 1 gtccaagctg tgttgactac cactactttt cccttcgtct caattatgtc ttggaagaag 61 gctttgcgga tccctggagg ccttcgggta gcaactgtga ccttgatgct ggcgatgctg 121 agcaccccgg tggctgaggg cagagactct cccgaggatt tcgtgttcca gtttaagggc 181 atgtgctact tcaccaacgg gaccgagcgc gtgcggggtg tgaccagata catctataac 241 cgagaggagt acgcgcgctt cgacagcgac gtgggggtgt atcgggcggt gacgccgctg 301 gggcggcttg acgccgagta ctggaatagc cagaaggaca tcctggagga ggaccgggcg 361 tcggtggaca ccgtatgcag acacaactac cagttggagc tccgcacgac cttgcagcgg 421 cgagtggagc ccacagtgac catctcccca tccaggacag aggccctcaa ccaccacaac 481 ctgctggtct gctcagtgac agatttctat ccagcccaga tcaaagtccg gtggtttcgg 541 aatgaccagg aggagacaac tggcgttgtg tccacccccc ttattaggaa cggtgactgg 601 accttccaga tcctggtgat gctggaaatg actccccagc gtggagacgt ctacacctgc 661 cacgtggagc accccagcct ccagaacccc atcatcgtgg agtggcgggc tcagtctgaa 721 tctgcccaga gcaagatgct gagtggcatt ggaggcttcg tgctggggct gatcttcctc 781 gggctgggcc ttattatcca tcacaggagt cagaaagggc tcctgcactg actcctgaga 841 ctattttaac tgggattggt tatcactttt ctgtaacgcc tgcttgtccc tgccca // LOCUS HAMRPS14 5300 bp ds-DNA ROD 27-JUL-1990 DEFINITION C.griseus ribosomal protein S14 (RPS14) gene, complete cds. ACCESSION M35008 KEYWORDS ribosomal protein S14. SOURCE C.griseus (haplotype EmtB wild type) ovary transformed fibroblast cell line CHO DNA. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (base 1 to 5300) AUTHORS Rhoads,D. and Roufa,D. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (sites for [1]) AUTHORS Diaz,J.-J., Rhoads,D. and Roufa,D.J. TITLE Genetic analysis of a vital mammalian housekeeping locus using CHO cells that express a transfected mutant allele JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by D.J.Roufa, 08-JUN-1990. Author address: D.J.Roufa Kansas State University Division of Biology Ackert Hall Manhattan, Kansas 66506 FEATURES from to/span description pept 1977 2125 ribosomal protein S14, exon 2 (first expressed exon) 2576 2737 ribosomal protein S14, exon 3 4074 4150 ribosomal protein S14, exon 4 4724 4791 ribosomal protein S14, exon 5 pre-msg 183 4841 ribosomal protein S14 mRNA and introns (5' end +/- 4 bp) IVS 220 1974 ribosomal protein S14 intron A IVS 2126 2575 ribosomal protein S14 intron B IVS 2738 4073 ribosomal protein S14 intron C IVS 4151 4723 ribosomal protein S14 intron D signal 4819 4824 poly-A signal mut 4783 4783 c in wt; g in mutant Emr-75 [ref. 2] null Arg-150 -> Gly with ThaI rflp. BASE COUNT 1315 a 1121 c 1415 g 1449 t ORIGIN 1 agatctggtc acagcgacac gcagcacccg gagtccttaa taagccatct tggtcgccgg 61 caggcttagc ctccttggaa accgcaatag aggcttcttc cgggagccgg gattccctcc 121 gtgacctcag aggtacactt ccggaatacg gaagtgaccc gtctgcccgc cccttccttc 181 tctctctttc cggtggagaa gtctggagac gacgttcagg tagggaggcc cggcggccca 241 caatcgtgcg gcatcctctg gccaggggcg tctgcgggct tggggtcgcg gtttgtgcgc 301 ggtaaccgag ccaggtgcgt gatggtgacg ggcggctgtc cgctcgccgc gatccacccg 361 gcctggggct gcgccgtgtc cgggaggcgg ccttctggcc tgagatcgga aggtctgagg 421 tggaagtgcg gggcctgcct gggccttgcc gtcacgccac agtcgtcgcc cgagcgttcg 481 gggagcgtgt ccctagtcgc gggtgggccg tcaccggcac cccattagat tcttaggggc 541 cgtcgacgac ctgcacctaa cctcatgccc ctcatgtgga aaatacgagc agcgttcgta 601 tctgcccctt ctgtaaaccg aagcgtaaaa tgggatgtca gaaaacctgg tcacgcagga 661 ttagagtgct tctaacccat ggtggttacc cctccgagtc ggacacattg cctgttgggg 721 tgccccctag tagagtagga ctgatgccat atgtttacat ttctctgaat agccctctcg 781 tctccactga ggacggcaaa gctacactga acagatgtca tgtcttatgg aatcgtgttt 841 gttagcctaa tttgagggta tgagcaggtg actacatcct ggacctgccc ttgtgctttt 901 ggcaaataag tcccaaatag tgtgtgaggg aggcactaca gagatcccag gtgtggttga 961 aacttttaag gcctaggagg aactttgtgg agtaggagca tcttggttct aggaggagta 1021 taagacaggc tttggtgcag tattgtcctg gacagactta gcctcgggtt ggtcaaaggc 1081 atgcctgcag ctttggcaaa aaaaaaaaaa gcctctttgc ttgtgtattt tccctgggtc 1141 ctgtgactgt acttacatta ttgactaggc aggaaggggt tacatgtcag gtgtatttgc 1201 agttcataac aatagcaaaa ttagagttat gaagtagtag caatgacttt ttgtgtgggg 1261 ggcaccacaa tatgagaatt gtactaaagg gtcacagcgt taggttgaga accaaagggt 1321 agactcactt gatgctgaac actgaatttt aggtatggat atctttacca ttaaacttcc 1381 tggaaggcat accctgacca tcctgggact tgctacgtag accataacta cgatatcctc 1441 ctgcctgtga ctcccaggtg ctggggctaa agttatacac aagcacacat tgctgtggca 1501 gtgcttttta atgtttatta ctagtaaagg gtgaaagatt tggttctaaa cagtgagagg 1561 ctgtcagatt aagaattgtg tgtggacagg gagtccgtta atctgcagtt ggcttcagtt 1621 gtgtcatgtt aatcaaataa acttcccagc ttaaggagag ggagagggga gttgtgaatg 1681 gggctttgcc ccaacacctg agttcagcat cctctatttt ggataactac ttgtgaaagt 1741 ttgaggtgca tattgttcat tgctggaatc ttaggctcat cctgaaaatg cttggagctt 1801 ttgtttagaa attgtctcct ctcttggcgc ctccctttta gttacaactc agtaattccc 1861 caagtacaaa tatgttggat attagagagc caaaggagct tgggttttcc tttagtaggc 1921 taaagtattt ggggatgttt tgtctgttaa catggaggtt ctccttccac tcagaaatgg 1981 cacctcgcaa gggaaaggaa aagaaggaag aacaggtcat cagcctcgga cctcaagtgg 2041 ctgaaggaga gaatgtattt ggtgtctgcc acatttttgc atctttcaac gacacctttg 2101 tccatgttac tgatctttct ggcaagtgag tacttgggga catgaagaca taaaaccaag 2161 cctcacaggg actgaggtgg gagaaaaagg cataggagtt tatgggattc cctctaaaga 2221 attactgatt ttcttatttt cccataaatt ataagagaaa acagttgaat gtagctgagg 2281 attcatccac agaaaatcac cgcaaatagg aagtggaagc agaggatcag aaactcaatt 2341 atcctcagct acacgcacat gtaaaatata tgcattatgg aagtgattat ggggataaaa 2401 tgcaaaacta attgttgtat taacagggag tttgtctaga tagggtcctt ggacccctgt 2461 aattatacac aactcctgtc ttgttcagaa ccaggatgtg tgcttgttgg ggttcagcaa 2521 gtgtttcctg tgtccttgtt ggaaaatgaa ggcatggatc tgcttttgtc tctagggaaa 2581 ctatctgccg ggtgactggt ggaatgaagg taaaggctga cagagatgag tcttctcctt 2641 atgcagccat gttggctgcc caggatgttg cccagaggtg caaggagctg ggcatcactg 2701 ccctgcatat caaactccga gctacgggag gaaacaggta tggaagtcta gggctggaca 2761 tttgtaagtt cactgggctt gggggtggat ctgagatttg agtctctgct gtgtctgaga 2821 gcaagccgtt ttaggaatgt atgtatgttc ttcgtcaggc atggtagctg atacctgtaa 2881 tcccagcact ccggaaccaa tataggagga ggatctaaag tttggtgcta acctgggcta 2941 cacagtgagc ctcaaaaata agatagtttg ttcctttatg tttgtccctg tagagtgctg 3001 cttttatttt ttttttttta aagcatacat gagactgact cttccccagc accaggtaga 3061 ctgggtgtgg tagtatacac ctgttgtctc agaaagtgga ggtgaggaga gctagagttt 3121 caagttggtt attggttgca tattgagtag gtggccagct tgagctgaac agtgatgctt 3181 aaatgatgaa ctgggggtgc atagtccaga tagagacaaa acttcaagag aagcatcttg 3241 gggtgagaaa gaagttgggc tttacaggga tgattggtgt aagatatgat ctaagaagca 3301 gggatggagc ctctggggtc ttaagtttta caaaacagag caagggcttg ggcatgaaaa 3361 ctagaatagg tggctactga gtggaaagat actgcatagg aagccctaag gaagttacat 3421 actggatgta ggagctggga agtaggcatc acagcaaggg gaggacagcc acaggccagg 3481 tgactatccc ctcccccccc cacacacaca catgtacact tttaaaggat ttgtaggcat 3541 ggtggctcac acctttaatc ccagcatctc ttggggggcg tccccccaca aaagtgggaa 3601 aaattacatg cgtatgttgc ctgtgtgcat gcatgcgtgc attatcggtt ccctgtggag 3661 gacagaagag gaccatgaat ctcttggaac tggtgttgct gagccacctc tccaggcctc 3721 agttagcagt tttgaaaggg tttggttttg ttttgttttt tcgagacagg gtttcttctg 3781 tgtagctttg gagcctatcc tggcactggc tttggagacc aggctggcct caaactcaca 3841 gagatccgac tgcctcccga gtgctgggat taaaggtgtg caccaacgca ggatttggtt 3901 ttgttgtgag attgtataat atatgaaaac cgtagcagcc tacaaagtct agaattttgt 3961 gtaatcccct cacagtgtgc agatctttaa ctatagcccc accaccattc agaagtgaat 4021 taaaaccatg tgaacattcc cactgacttg accaacaatc cttcattttt taggaccaag 4081 acccctggac ctggagccca gtcagccctc agagctcttg ctcgttcagg gatgaagatt 4141 gggcggattg gtaagtgacc cttctagcta acattgggtt tattttgaag cattgacccc 4201 ttccaaacta tgtgatgtgc atattggttc aagcttgtct ggtgactggt gcagttaact 4261 tttgttaacc tgtaaaagac attgagagcg ttgatttctg taaatgcccc ttaactgtta 4321 tattttccca tttgcaattg ggaagactta aaaaacactt gtgaatctgt cctagcagcc 4381 tttttccctg ctagctaaat gttaagggta tgcccttggg acagtccaag tgaataattg 4441 cacacacctg gcctaaagag caaacagcca caggtgctca gtgtaggggc tctgagttag 4501 aaacttgaag gagtccttcc cctctgatgt ttgtgtgcct tctcagagcc tgtgttgtcc 4561 tcgtccaatg tgggttgaga tgaggaaatg acaggtgagg tgggcccttg tttattgacc 4621 actggtgata catttgggga agaataaagt ctgctagtta gtctaatggc tagggaggct 4681 gtgtttatgt agctatgggc aatgacattt tttcttcttg cagaggatgt cacccccatc 4741 ccctctgaca gcacccgaag gaagggtggt cgtcgtggtc gccgtctgtg aacaggactt 4801 ctcaaattat tttctgttaa taaattgctt tgtataagct attttggttc tgatgtttgt 4861 ttgttttgag gcaggatctc tctccatgtc tatgtagaag agtctggcct ccaactcaga 4921 tccacatacc tgtctctgcc tcccaagtac tggggataaa ggaatgcacc accacacccc 4981 ggtgacctta ggggccaccc agggcaacac caaggcagtg ttttgagggg atgttaagat 5041 tgcatgtagg agcttctaat gtaggttggg ggttggcaaa cctgggtttg caggctggct 5101 ttctgagaat gggaacctga gagaggagat gctgaggctg ccactctggc attaatttgc 5161 cagccagtat tttcaaagtg gaactgttag tagcccatgc tgtcatggga cagctaaaca 5221 ccacacacac acacacccca agcagggtag tagagatcac aagttggaag ccagtttaat 5281 atggtctact tggggaattc // LOCUS AFATFDCD 7956 bp ds-DNA BCT 27-JUL-1990 DEFINITION A.eutrophus plasmid pJP4 chlorocatechol oxidative (tfdCDEF) operon, complete cds; and 2,4-dichlorophenol hydroxylase (tfdB) gene, complete cds. ACCESSION M35097 X07754 KEYWORDS 2,4-dichlorophenol hydroxylase; chlorocatechol 1,2-dioxygenase; chloromuconate cycloisomerase; tdfF gene; tfdE gene. SOURCE A.eutrophus (strain JMP134) plasmid pJP4 DNA. ORGANISM Plasmid pJP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1598; enum. 1 to 1598) AUTHORS Perkins,E.J. JOURNAL Unpublished (1988) Dept of Biochem,Univ of Wash.,Seattle,WA 98195 STANDARD simple automatic REFERENCE 2 (bases 1 to 339) AUTHORS Perkins,E.J., Bolton,G.W., Gordon,M.P. and Lurquin,P.F. TITLE Partial nucleotide sequence of the chlorocatechol degradative operon tfdCDEF of pJP4 and similarity to promoters of the chlorinated aromatic degradative operons tfdA and clcABD JOURNAL Nucleic Acids Res. 16, 7200 (1988) STANDARD simple automatic REFERENCE 3 (bases 1 to 7956) AUTHORS Perkins,E.J., Gordon,M.P., Caceres,O. and Lurquin,P.F. TITLE Organization and sequence analysis of the 2,4-dichlorophenol hydroxylase and dichlorocatechol oxidative operons of plasmid pJP4 JOURNAL J. Bacteriol. 172, 2351-2359 (1990) STANDARD simple staff_review FEATURES from to/span description pept 337 1104 chlorocatechol 1,2-dioxygenase (tfdC) pept 1101 2213 chloromuconate cycloisomerase (tfdD) pept 2288 2992 tfdE product pept 2989 4053 tfdF product pept 4398 6194 2,4-dichlorophenol hydroxylase (tfdB) conflict 681 682 gg in [3]; cc in [1] BASE COUNT 1716 a 2209 c 2403 g 1628 t ORIGIN 1 aagctttgcc ccaatatcgc gttccaaggc ctggatctga cgagtaagcg gtggctgcga 61 tatgtgcatc ctctgtgccg ctgttccgaa acctcccgcc tccgcgaccg cgacgaagta 121 tttcaactgt cttatttcca tgttccgtcc cgttgcgctg cctgttacaa ggcgattgtg 181 agcaagcacc tgcggtctcg caactaggga aagacatacc gaaaccgtat ggtctaggcg 241 gagaaaaggt attggacggt atgggattgc cgtctctatt atgactgtgc ggggcaattg 301 tcgcccacgt gttcatgttt tcatgacgga ggcaaagtga acaaaagagt caaggatgtt 361 gtcgatgcga tcgtcgcggc ggtacagcgg gtgctcgacc agaaggaggt tactgaggcg 421 gaataccgaa ctgcggttca ttacctcatg caagtagccg aacagcgtga aaccgctctt 481 ctttgcgatg tttttttcaa tagcaccgtg gctgcgacga aggctcgtat tagcgaaggt 541 tcaacgcctg ctatcgaggg accctactat cgcgacgacg cccctctggt cgatgaccgg 601 ctcaagactt acgacacgga cgaccacaag cctttgctta tccagggaac ggtcaaagcg 661 gtcgacggga gcgttgtcga ggacgtgacg attgatgtct ggcattcgac gcccgatggg 721 aagtacagcg gtttccatga cgacatcccg actgattttt atcgagggaa gctcagggtg 781 ggcaccgatg gcagcttccg cgtgcgcaca acgatgccgg tgccgtatca gatcccggat 841 cagggtccca cgggcgcatt gctcgaaacc atgggtggtc actcgtggcg tcccgctcat 901 gtacatttca aggtgaaggc gccgggatat gaaacgttga ccacgcagta ctacttcgaa 961 ggtggagatt ggatcacgga cgactgttgc aacggcgtcc agtctagtct gattactccc 1021 gatattgtgg aagagggtgt ccgcctgatg aacatcaact tcgttatcga gcccgcgcgt 1081 gcgcaggcgg gggcaaaccc gtgaagattg acgcgattga agcagtgatc gtggatgtgc 1141 cgaccaagcg gccgatccag atgtcgatca ctaccgtgca ccagcagagc tacgttatcg 1201 tccgggtgta ttcggagggg ctcgttggtg tcggcgaggg tggaagcgtt ggtggtcccg 1261 tctggagcgc agagtgtgcg gagacgatca agatcatcgt ggaacggtat ctcgcgcccc 1321 acctcctcgg aactgatgcg ttcaacgttt caggtgcact gcaaaccatg gcgcgtgccg 1381 tcaccggaaa cgcctctgca aaggctgcgg tcgagatggc gttactggat ctcaaagctc 1441 gagcgttagg cgtatcgatc gccgagttac ttggcgggcc gttgcgcagt gcgattccga 1501 ttgcctggac attggcgagc ggagatacga aacgcgatct cgattctgcc gtcgagatga 1561 ttgaaagacg acgacacaat cgcttcaaag tcaagcttgg cttccggtcg ccccaagacg 1621 atctcatcca tatggaggct ttgtcaaata gtctcggatc gaaggcctac cttcgcgttg 1681 acgtgaatca ggcttgggac gagcaagtgg cgtccgtcta cattcctgaa ctggaggcgc 1741 ttggcgtgga actcatcgaa cagccggtcg gccgcgaaaa tacgcaagcg ttgaggcggc 1801 tctccgacaa caaccgcgtg gccatcatgg ccgatgagag cctgagtacg ttggcctcgg 1861 cattcgatct cgcgcgcgac cgaagtgtgg atgtcttttc gctgaagctt tgcaacatgg 1921 gaggggtctc ggcgacgcaa aagatagcag cggtcgcgga agcaagcggg attgcatcgt 1981 atggcggaac aatgcttgac tcgacgatcg gcacatcggt tgcacttcag ctctattcta 2041 cggttccatc gcttccgttc ggttgcgaac tgatcggtcc cttcgtgttg gcagacacgc 2101 tgagccacga gccactcgag atccgggatt acgaactgca ggttcccact ggcgtaggtc 2161 acggcatgac gcttgacgag gacaaggtgc gccaatacgc acgcgtcagc taggcaaaac 2221 gcgcagcgca ggccgcggct gtggatcgcc tcgaatgcaa ctgcgcatgt gtaactggag 2281 actaaatatg ttatcagacg gcgttgagat cacgtcgcgc tcgggtggtc gctttggtgc 2341 ctacctcgga aagccgacga cggactccgc acccatcgtc gtgatcgcgc aggagatttt 2401 cgggatcacg cctttcataa gagaaaccgt ggaatggctc gttggtgcgg gctttggttg 2461 cgtgtgtccg gatctgtact ggagacaggc gccgaatatc gagcttgatg caaacgtacc 2521 atcggaacgg gaacaggcgc ttgcgttgtt tcgcgacttt gacatggagg caggagtcaa 2581 tgacctttca tgcgccattg aatacgctcg tgcgctccct ttctcgaacg gtcgtgtcgc 2641 tgtagtcggg tactgcctgg gaggtgcgct ggcgtttgat gtggcggcgc gatcattggc 2701 cgattgctca atcggctatt acggcgtggg acttgaaaaa aaggtctcac tggtgccagc 2761 tattacacga ccggcgatgt tccacatggg taccaaagat cactatgtca cagaagaggc 2821 ccgtagcatt ctcgaagagc atttcggtcg aaacaaaaat ctgagtctgc actggtatcc 2881 agtggggcat tcatttgcac ggtcatccag cccaaatttc gatcaggcgg cgacaactgt 2941 ggccaatgct cggacgctcg aactgcttgc aatgttgaag gacccttcat gaagaagttc 3001 acgcttgact acctgagccc gagggtcgtc ttcggggcgg gcactgcttc tgcattgcca 3061 gatgaaatag gacgccttgg cgcacgccgg cccttggtat taagcagccc ggaacaacgc 3121 gagttagcga aggatatcgt ccgtccgata ggtgacaggg tagctggata tttcgatggc 3181 gcgacgatgc atgttcccgt cgacgtcatc cagaaagccg agcgggcttt taacgatact 3241 gacgccgact caatcatcgc gatcggggga ggatcgacca ccggactcgc aaaaatcctt 3301 tcgatgaacc ttgacgtccc aagtctggtt ataccaacga cctatgccgg tagtgaaatg 3361 actaccattt ggggtgtcac ggaaggcgga atgaagagga ccggccgcga ccccaaggtg 3421 ctaccgaaga ccgtgattta tgatccattg ctcacggtcg atttgccgct tgctatctcg 3481 gtgacgagcg ccttgaatgc gatcgctcac gccgcagaag gtctgtactc ggccgacctc 3541 aatcccgttc tcgagaccat gtgtaagcag ggcatatgcg ccttgttcga tgcaatcccg 3601 cgcctggtgg caaagccgac tgacgccgaa gcgcgtacgg atgccctttt tggggcatgg 3661 atgtgtggca ctgcactgtg ccacttgggc atggggctac atcacaaact ctgccacacg 3721 cttgggggaa cccttaatct tccccacgcg gagacacatg caatcgtact accacacgca 3781 ctggcataca atctgccgta cgccgcgcca gctgagcgac tgcttcagga agtcgccggc 3841 agtagtgacg tcccgagcgc gctatatgat ctcgccagaa atgctggagc accactcagt 3901 ctcgccgaaa tcggtatgcg gcctgaagat attccgaggg tacgcgacct cgcgctaagg 3961 gaccaatatc cgaatccgcg tccgctggaa tcggacgcat tggaaacatt gttagtcaat 4021 gcgtttcgtg ggcgaagacc ggatttcaaa taatgtgacc tgcactccgc gtttagtacg 4081 gtagcggtga agagcccgtt ccaaagtcga catcgggtct tcaagctaaa gcgaagcaca 4141 tgaatttgca tccactgttc cttgtggagc atcgccagcc ttcgcgcgaa aaaggccgac 4201 tgtactggag gtggctcgga ttttctttgc gcgaagtggc tcatttttac tttgcgcgca 4261 acaaaggctc ttcagcctta ctattcatcg cgcgttaaag acggtgaagt ttaaaaaatc 4321 gacttggccg aacgcgagat gaacaatccc tgtttcttct cacaaagaga gagacataac 4381 aattgacgga ggaagacatg gcattgacga tcgaaaccga cgtgctggtc gttggcaccg 4441 ggccggcggg cgcttcggcg ggggcactgc ttgcgcgata cggtgtcagg acgatgctca 4501 ttaacaagta caactggacg gctccgacgc cgcgtgccca catcaccaac caacgcacaa 4561 tggagatcct gcgtgatctc ggtctcgaag ctgaagcccg cctctacgcc gcgccgaatg 4621 acctaatggg cgagaacacc atctgcgcct cgttggccgg cgaggagttc ggtcggattc 4681 gcacctgggg gacggatgtt cgccggcgcg ccgattacga cgaatgttcg ccgacctcca 4741 tgtgcgacct gccacagaac tatctcgagc cgattctcgt caagtcggcc gcactcgacg 4801 gctgcaaggt gcgcttcgac acagagtatc ttggacacga acaggatgct gacggcgtgt 4861 cgtcgcgctt gcgcgatcgt ctgaatggcg aagagtttac cgtccggtcg aaatatctga 4921 tcggcgccga cggcgccaac agccgcgtcg tgtcagacct ggacctgccg ctcgaaggta 4981 ccatgggcaa gtccggttcc atcaacttgc tgttcgaggc tgatctcgac cgctacgtcg 5041 cgcatcgtcc atcggtcctt tactgggtca tccagcccgg ctccgatatc ggcggtctcg 5101 gcatcggcgt ggtgcgcatg gtccggccgt ggaacaaatg gcttgcgatc tgggggtacg 5161 acgtcgagca gggcccgccg gagatctcgg aatccttcgc ccgccgcatc gtccacaatc 5221 tgatcggcga tgacagcgtg ccgctgaaaa ttgagggcat ctcgacctgg acggtcaacg 5281 acatgtacgc gacccggttg cagcaaggac gggtcttctg cgccggcgac gcggtccacc 5341 ggcatccgcc gaccaacgga ctcggctcaa acacctccat ccaggacagc ttcaatctcg 5401 cctggaagat cgccatggtg ctgaacggca cggccgacga gagcctgctg gacacctaca 5461 cgatcgagcg ggcgccgatc gcgaagcagg tggtctgcag ggcgaacaag tcgcttgagg 5521 atttcccgcc gatcgcgatg gctctggggt tgccgcaggc caagagcgcc gacgagatga 5581 agagcaacat ggctcgccgc aaggaacctg ggcccgaggc gcaagcgcag cgcacgcggt 5641 tacgcgaggc catcgccggg accaattacg tttacaacgc ccacggcgtc gaaatgaatc 5701 agcgttacga cagtcccgcg atcgtggcag acaattcgcc ggacgaagtc ttccgcgacg 5761 tcgagctcta tcaccaggcg tcgacgcggc ctggagcgcc catgccgcat gtctgggtct 5821 atgcgagtgg tgatggacac cgtatctcaa caaaggatct ctgcgggaag ggcaacttca 5881 ctctctttac cgggattggc ggcgcagcct ggcaggatgc cgcggcggcc gtgagccggc 5941 aactcggggt ggcagtgacg gtgcgcatca ttgggcctgg ccaggcctac gaggatcatt 6001 acggcgactt cgcgagaatt tccgagatca tcgacaccgg cgccatcctg gtgcggccgg 6061 atttccacgt cgcctatcgg gcgacgtcct tgccggccga tgcggcaggc gacctcgtgt 6121 ccgccatgag gcggatcctc ggacgccagt cagagaggtc gagcgcgctg cgcgttacgt 6181 ctcgagctat ataaggagtg gcgtcgcgat ctattcgagg tgagacggaa gcgctcacct 6241 cgggttgcgg gaggggaagc gtgatcttgc gcagcgtgag atggcccgtg gcgacgcggg 6301 aaggttgtcg cgcgattaag gcaaggcggt gcgcaacgac gtgacgtgtt ttctggaccg 6361 gacgtaactg cagatcatct aatttgggaa gttcgtaccg gtgtcgaccg ccttgctgat 6421 gatgctatta agcacgtgag cagacgacgg gccggtgcag ccaacgttga aaaaagagcg 6481 cttgattgca atgtcgacga cacgatctac attactgtca tttgaacgac cgcttgctcg 6541 tgcggattca accggtggat gcaacagttt gctgaaatcg ttcggcgggc gttccgtggt 6601 tcagggtttt cctcggtcgc ccatccagcc gcctcgccac ggtattgagt agccaattgg 6661 tacaccgaaa tgtaggtgcc gttggggatg tctcgatagc ccgttcgcat tttcattcga 6721 cccgcgctgc caaggatgct gggggttcag taactgcagg ctgaactgcg ccaggtgaac 6781 caaacgctga tcgtcaagca aaatgaaata acccagttga acagagaaag cgcgcggctc 6841 gtcgccgaag ccggcgtcgc gacaaggcat ctacgcgagc tgcaagcgca cagcaagtag 6901 ttacagaata cgttgaacca gtcgctcgcc gaccacgcac gcaccaaggc cgagcacgat 6961 gcgctgctaa caccgcgagg actgaggcga ctgaactgac gcggctgcgc gaagcacgtg 7021 aatcgctgac cgccgatctg acgaagctca cagcgcagtt cgacgagcag cagttgctcg 7081 gggactatcg gatgcgtctg ggcctcacgg acgcggcggg atgaaagcgc cagattccgc 7141 atcaaaagga tggggatcta cgccggaacc ccagaaaatt ccgtcgttcc acattgtgaa 7201 cattaatcgg cttattccag ccgaagaatg ttggagcgac gtccggtgag tgacatgacg 7261 gtctcggtca cacggaccat caggaccgtt acgcggggcc cagttcgatt acattgtggg 7321 acaggtcgag acaatcttcg cagtcccggc aaacccagtg cagcatcggc ccgaccggat 7381 ctggatggtc gctgaacgtt gtcagcgccg ggtcgtacta gcatcggacg tgctcgagcg 7441 cagtatccgc tcgagattag actcgatgcg gtccaggctg cattcaccga tgccggactt 7501 gaacaccgtc acaccgcctg acgggttgac cgcgaagaag aaaaacgcct tcgtgtcgac 7561 catgtgcgac aggacttgtc ctggcgatga agttgctcgg attgatccag cgcattgaag 7621 atgcggtagc cgtaaaactc taagccgaag tgtacgacct cgaagtcgtc ggtgccgcgg 7681 atcgagtggc caaagttgaa accgctgcgt gttacgtcgt tcggaatgtc atccaccggc 7741 cgcagtgcca ggcaagcata gtgcgtctgg ccctcggcca cgagtcccgt cgcaaagcag 7801 ggcgcgcgct cggccagcaa ggcctcgaca aaattccggt tcatgcggag cgttggcagc 7861 atcgggcatc cttcatcgtg aatcaaatca acatttgccg acctggctgt gcggtgcgtc 7921 gacgtcgaac atcaaaggct atacatcgtc gagctc // LOCUS ECOCLPAA 3380 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli ATP-dependent C1p protease (clpA) gene, complete cds. ACCESSION M31045 KEYWORDS C1p protease; clpA gene. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3380) AUTHORS Gottesman,S., Clark,W.P. and Maurizi,M.R. TITLE The ATP-dependent Clp protease of Escherichia coli: Sequence of ClpA and identification of a Clp-specific substrate JOURNAL J. Biol. Chem. 265, 7886-7893 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Gottesman, 03-JAN-1990. FEATURES from to/span description pept 1000 3276 ClpA protein signal 779 784 -35 region signal 805 810 -10 region binding 990 994 ribosome binding site binding 214 221 ATP binding site binding 495 502 ATP binding site BASE COUNT 877 a 759 c 943 g 801 t ORIGIN 1 bp upstream of BamHI site; 19 min. on K12 map. 1 ggatccagta gatgctctgg cacgagagta atttttgaga taaaaatcga gccgatcggg 61 cggcattttg cctttaggat gtacacaatg agacagaaga gctatgcgac tgccgcttct 121 acttcgacgg gcacaataac actggcgtga ttgccttttg gcccctggtg gacatcaaac 181 tgaacggatt gtccagcttt tagcgttctg taaccatcca tctgaatggt ggaataatga 241 gcgaaaatat cttcgccgcc gccttcaggg cagatgaaac caaacccttt ggcattgttg 301 aaccacttaa cagtaccctt ttccatgctt cgacatcctt cgcaaatctt atacaagtaa 361 gatggaataa accggggtca gagagggggc tgttcaaaac ctcgccaact ctagaaatac 421 aatttagaga attagggcga gccgtcaagc atttgacagg ggacaagggg caggtatgaa 481 tcaaaaattt gaagcagtta acgctattga caggaatgtg acagatgtcg ctgatgccaa 541 cgatagatga tagttatcta tcatgtggag tagattggtc aggcaaataa gctcttgtca 601 gcggcagggc gttctgccga taaccgtaac cgaagatgat aactgacaat gggtaaaacg 661 aacgactggc tggactttga tcaactggcg gaagaaaaag ttcgcgacgc gctaaaaccg 721 ccatctatgt ataaagtgat attagtcaat gatgattaca ctccgatgga gtttgttatt 781 gacgtgttac aaaaattctt ttcttatgat gtagaacgtg caacgcaatt gatgctcgct 841 gttcactacc aggggaaggc catttgcgga gtctttaccg ccgaggttgc agaaaccaaa 901 gtggcgatgg tgaacaagta cgcgagggag aatgagcatc cattgctgtg tacgctagaa 961 aaagcctgaa tgcaggcata aaaattgggg gaggtgccta tgctcaatca agaactggaa 1021 ctcagtttaa atatggcttt cgccagagcg cgcgagcacc gtcatgagtt tatgaccgtc 1081 gagcacttgt tactggcgct gctcagtaac ccatctgccc gggaggcgct ggaagcgtgt 1141 tctgtggatt tggttgcgct ccgtcaggaa ctggaagcct ttattgaaca aaccacaccc 1201 gttctgcctg ccagtgaaga ggagcgcgac acacagccga cgctgagttt tcagcgtgta 1261 ctgcaacgtg cggtcttcca tgtccagtcc tccggtcgca atgaggttac cggtgcaaac 1321 gttctggtcg ctatctttag cgaacaggag tcgcaggcgg catatctgtt gcgtaaacac 1381 gaagtcagcc gtctcgatgt ggtgaacttt atctctcatg gcacgcgtaa agacgagccg 1441 acacagtctt ctgatcctgg cagccagcca aacagcgaag aacaagctgg tggggaggaa 1501 cgtatggaga atttcacgac gaacctgaat cagcttgcgc gcgtgggcgg aatcgaccca 1561 ctgattggtc gtgagaagga gctggagcgt gctattcagg ttctctgccg tcgccgtaaa 1621 aacaacccgc tgctggtggg ggaatctggt gtcggtaaaa ccgcgattgc ggaaggtctt 1681 gcctggcgaa ttgttcaggg cgatgtgccg gaagtgatgg ctgactgtac gatttactct 1741 ctcgatatcg gttctctgtt agcgggcaca aaatatcgcg gcgactttga aaaacgtttt 1801 aaagcgttgc tcaagcagct ggagcaggac actaacagca tcctgtttat tgatgagatc 1861 cacaccatta tcggtgcggg tgcagcgtct ggtggtcagg tcgatgcggc taacctaatc 1921 aaaccgttgc tctccagcgg taaaattcgt gtaattggtt cgacaaccta tcaggagttc 1981 agcaacattt tcgagaaaga ccgtgctctg gcgcgtcgct tccagaaaat tgatattact 2041 gaaccgtcga tcgaagaaac tgttcaaatc atcaatggcc tgaaaccgaa gtatgaaggc 2101 caccacgacg tgcgttatac cgcaaaagcg gtgcgtgcgg cggtagagct ggcggtgaaa 2161 tacattaacg atcgtcatct gccggataaa gccattgatg ttatcgacga agcgggcgct 2221 cgcgcacgcg tgatgccggt aagcaaacgc aagaaaaccg ttaatgtggc ggatattgag 2281 tccgtggtgg cccgtattgc acgcattcca gagaagagtg tttctcagag tgatcgtgat 2341 accctgaaaa acctcggcga tcgcttgaaa atgctggtct tcggtcagga taaagccatt 2401 gaggcgctga ctgaagccat taagatggcg cgtgcaggtt taggtcacga acataaaccg 2461 gttggttcgt tcctgtttgc cggccctacc ggggtcggga aaacagaggt gacggtacag 2521 ctttcgaaag ctttgggcat tgagcttctg cgctttgata tgtccgagta tatggaacgc 2581 cataccgtca gccgtgttat tggtgcgcct ccgggatacg ttggttttga tcagggcggt 2641 ttgctgactg atgcggtcat caagcatcca catgcggtgc tgctgctgga cgaaatcgag 2701 aaagcgcacc cggacgtgtt caatattctg ttgcaggtga tggataacgg tacgctgacc 2761 gataacaacg gacgcaaagc agacttccgt aacgtggtgc tggtgatgac caccaacgcc 2821 ggggtacggg aaactgagcg caaatccatt ggtcttatcc accaggataa cagcaccgat 2881 gcgatggaag agatcaagaa gatctttaca ccggaattcc gtaaccgtct cgacaacatt 2941 atctggtttg atcatctgtc aaccgacgtg atccatcagg tggtggataa attcatcgtc 3001 gagttgcagg ttcagctgga tcagaaaggt gtttctctgg aagtgagcca ggaagcgcgt 3061 aactggctgg ccgagaaagg ttacgaccgg gcaatgggcg ctcgtccgat ggcgcgtgtc 3121 atccaggaca acctgaaaaa accgctcgcc aacgaactgc tgtttggttc gctggtggac 3181 ggcggtcagg tcaccgtcgc gctggataaa gagaaaaatg agctgactta cggattccag 3241 agtgcacaaa agcacaaggc ggaagcagcg cattaatctg attgttaggt aggttggtca 3301 agtccgtaat ctcgaaagag gttacggact ttttgtttat ggggtggagg agttcagacc 3361 ctttttttat gatgatggta // LOCUS HUMCSAE 1372 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human epithelial glycoprotein (EGP) mRNA, complete cds. ACCESSION M32306 KEYWORDS cell surface antigen; cell surface glycoprotein; epithelial glycoprotein; transmembrane protein; tumor-associated antigen. SOURCE Human colon carcinoma cell line HT-29, cDNA to mRNA, (library HT-29/cdm8). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1372) AUTHORS Simon,B., Podolsky,D.K., Moldenhauer,G., Isselbacher,K.J., Gattoni-Celli,S. and Brand,S.J. TITLE Epithelial glycoprotein is a member of a family of epithelial cell surface antigens homologous to nidogen, a matrix adhesion protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2755-2759 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by B.Simon, 23-FEB-1990. FEATURES from to/span description pept 22 966 epithelial glycoprotein (EGP) precursor sigp 22 84 epithelial glycoprotein signal peptide matp 85 963 epithelial glycoprotein mRNA < 1 1372 EGP mRNA BASE COUNT 436 a 233 c 317 g 386 t ORIGIN 1 gctctaaagg cgcgcgcagg catggcgccc ccgcaggtcc tcgcgttcgg gcttctgctt 61 gccgcggcga cggcgacttt tgccgcagct caggaagaat gtgtctgtga aaactacaag 121 ctggccgtaa actgctttgt gaataataat cgtcaatgcc agtgtacttc agttggtgca 181 caaaatactg tcatttgctc aaagctggct gccaaatgtt tggtgatgaa ggcagaaatg 241 aatggctcaa aacttgggag aagagcaaaa cctgaagggg ccctccagaa caatgatggg 301 ctttatgatc ctgactgcga tgagagcggg ctctttaagg ccaagcagtg caacggcacc 361 tccatgtgct ggtgtgtgaa cactgctggg gtcagaagaa cagacaagga cactgaaata 421 acctgctctg agcgagtgag aacctactgg atcatcattg aactaaaaca caaagcaaga 481 gaaaaacctt atgatagtaa aagtttgcgg actgcacttc agaaggagat cacaacgcgt 541 tatcaactgg atccaaaatt tatcacgagt attttgtatg agaataatgt tatcactatt 601 gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc tgatgtggct 661 tattattttg aaaaagatgt taaaggtgaa tccttgtttc attctaagaa aatggacctg 721 acagtaaatg gggaacaact ggatctggat cctggtcaaa ctttaattta ttatgttgat 781 gaaaaagcac ctgaattctc aatgcagggt ctaaaagctg gtgttattgc tgttattgtg 841 gttgtggtga tagcagttgt tgctggaatt gttgtgctgg ttatttccag aaagaagaga 901 atggcaaagt atgagaaggc tgagataaag gagatgggtg agatgcatag ggaactcaat 961 gcataactat ataatttgaa gattatagaa gaagggaaat agcaaatgga cacaaattac 1021 aaatgtgtgt gcgtgggacg aagacatctt tgaaggtcat gagtttgtta gtttaacatc 1081 atatatttgt aatagtaaag cctgtactca aaatataagc agcttgaaac tggctttacc 1141 aatcttgaaa tttgaccaca agtgtcttat atatgcagat ctaatgtaaa atccagaact 1201 tggactccat cgttaaaatt atttatgtgt aacattcaaa tgtgtgcatt aaatatgctt 1261 ccacagtaaa atctgaaaaa ctgatttgtg attgaaagct gcctttctat ttacttgagt 1321 cttgtacata catacttttt tatgagctat gaaataaaac cattttaaac tg // LOCUS HUMMHCAD 1093 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B7, complete cds. ACCESSION M32317 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line JY cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1093) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B7 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 221 a 335 c 364 g 173 t ORIGIN 1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag 301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag 361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg 541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg 901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttgaa aag // LOCUS HUMMHCAE 1089 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B57, complete cds. ACCESSION M32318 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line WIN and cell line MOC cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1089) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B57 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 217 a 331 c 366 g 175 t ORIGIN 1 atgcgggtca cggcaccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc 121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gaggatggcg ccccgggcgc catggataga gcaggagggg 241 ccggagtatt gggacgggga gacacggaac atgaaggcct ccgcgcagac ttaccgagag 301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca catcatccag 361 gtgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtcc 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg 481 gacacggcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcagctg 541 agagcctacc tggagggcct gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccaaagcccc tcaccctgag atgggagcca 901 tcttcccaat ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagctc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttga // LOCUS HUMMHCAF 1093 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B51, complete cds. ACCESSION M32319 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line BM92 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1093) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B51 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 228 a 334 c 358 g 173 t ORIGIN 1 atgcgggtca cggcgccccg aaccgtcctc ctgctgctct ggggggcagt ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca ccgccatgtc ccggcccggc 121 cgcggggagc cccgcttcat tgcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc catggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacagatc ttcaagacca acacacagac ttaccgagag 301 aacctgcgga tcgcgctccg ctactacaac cagagcgagg ccgggtctca cacttggcag 361 acgatgtatg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca taaccagtac 421 gcctacgacg gcaaagatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagctg 541 agagcctacc tggagggcct gtgcgtggag tggctccgca gacacctgga gaacgggaag 601 gagacgctgc agcgcgcgga ccccccaaag acacacgtga cccaccaccc cgtctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca 901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttgaa aag // LOCUS HUMMHCAG 1093 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele B37, complete cds. ACCESSION M32320 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line KASO and cell line MG cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1093) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1089 HLA protein allele B37 sigp 1 72 HLA protein signal peptide matp 73 1086 HLA protein mature peptide BASE COUNT 224 a 337 c 358 g 174 t ORIGIN 1 atgcgggtca cggcgccccg aaccctcctc ctgctgctct ggggggcagt ggccctgacc 61 gagacctggg ctggctccca ctccatgagg tatttccaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gaggacggag ccccgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggga gacacagatc tccaagacca acacacagac ttaccgagag 301 gacctgcgga ccctgctccg ctactacaac cagagcgagg ccgggtctca caccatccag 361 aggatgtctg gctgcgacgt ggggccggac gggcgcctcc tccgcgggta taaccagttc 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgagctcctg gaccgcggcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgtggc ggagcaggac 541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcgcgga ccccccaaag acacatgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacaccgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagcca 901 tcttcccagt ccaccatccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctactgtg atgtgtagga ggaagagctc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtccagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttgaa aag // LOCUS HUMMHCAH 1098 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele A25, complete cds. ACCESSION M32321 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line BM92 cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1098) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1098 HLA protein allele A25 sigp 1 72 HLA protein signal peptide matp 73 1095 HLA protein mature peptide BASE COUNT 216 a 329 c 372 g 181 t ORIGIN 1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggccct ggccctgacc 61 cagacctggg cgggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat cgccgtgggc tacgtggacg acacgcagtt cgtgcggttc 181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacggaat gtgaaggccc actcacagac tgaccgagag 301 agcctgcgga tcgcgctccg ctactacaac cagagcgagg acggttctca caccatccag 361 aggatgtatg gctgcgacgt ggggccggac gggcgcttcc tccgcgggta ccagcaggac 421 gcttacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcttg gaccgcggcg 481 gacatggcgg ctcagatcac ccagcgcaag tgggagacgg cccatgaggc ggagcagtgg 541 agagcctacc tggagggccg gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcacgga cgcccccaag acgcatatga ctcaccacgc tgtctctgac 661 catgaggcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca 781 ggggatggga ccttccagaa gtgggcgtct gtggtggtgc cttctggaca ggagcagaga 841 tacacctgcc atgtgcagca tgagggtctg cccaagcccc tcaccctgag atgggagccg 901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct 961 gtgatcgctg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa 1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tatgtctctc 1081 acagcttgta aagtgtga // LOCUS HUMMHCAI 1098 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human MHC HLA protein, allele A2, complete cds. ACCESSION M32322 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human blood B-cell cell line JY cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1098) AUTHORS Ennis,P.D., Zemmour,J., Salter,R.D. and Parham,P. TITLE Rapid cloning of HLA-A,B cDNA by using the polymerase chain reaction: Frequency and nature of errors produced in amplification JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2833-2837 (1990) STANDARD full staff_entry COMMENT Authorin copy of sequence [1] kindly submitted by E.Warren, 22-FEB-1990. FEATURES from to/span description pept 1 1098 HLA protein allele A2 sigp 1 72 HLA protein signal peptide matp 73 1095 HLA protein mature peptide BASE COUNT 220 a 323 c 368 g 187 t ORIGIN 1 atggccgtca tggcgccccg aaccctcgtc ctgctactct cgggggctct ggccctgacc 61 cagacctggg cgggctctca ctccatgagg tatttcttca catccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat cgcagtgggc tacgtggacg acacgcagtt cgtgcggttc 181 gacagcgacg ccgcgagcca gaggatggag ccgcgggcgc cgtggataga gcaggagggt 241 ccggagtatt gggacgggga gacacggaaa gtgaaggccc actcacagac tcaccgagtg 301 gacctgggga ccctgcgcgg ctactacaac cagagcgagg ccggttctca caccgtccag 361 aggatgtatg gctgcgacgt ggggtcggac tggcgcttcc tccgcgggta ccaccagtac 421 gcctacgacg gcaaggatta catcgccctg aaagaggacc tgcgctcttg gaccgcggcg 481 gacatggcag ctcagaccac caagcacaag tgggaggcgg cccatgtggc ggagcagttg 541 agagcctacc tggagggcac gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gagacgctgc agcgcacgga cgcccccaaa acgcatatga ctcaccacgc tgtctctgac 661 catgaagcca ccctgaggtg ctgggccctg agcttctacc ctgcggagat cacactgacc 721 tggcagcggg atggggagga ccagacccag gacacggagc tcgtggagac caggcctgca 781 ggggatggaa ccttccagaa gtgggcggct gtggtggtgc cttctggaca ggagcagaga 841 tacacctgcc atgtgcagca tgagggtttg cccaagcccc tcaccctgag atgggagccg 901 tcttcccagc ccaccatccc catcgtgggc atcattgctg gcctggttct ctttggagct 961 gtgatcactg gagctgtggt cgctgctgtg atgtggagga ggaagagctc agatagaaaa 1021 ggagggagct actctcaggc tgcaagcagt gacagtgccc agggctctga tgtgtctctc 1081 acagcttgta aagtgtga // LOCUS HUMNOXF 2206 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human neutrophil oxidase factor (p67-phox) mRNA, complete cds. ACCESSION M32011 KEYWORDS neutrophil oxidase factor. SOURCE Human promyelocytic leukemia myeloid cell line HL60, cDNA to mRNA, clone 10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2206) AUTHORS Leto,T.L., Lomax,K.J., Volpp,B.D., Nunoi,H., Sechler,J.M.G., Nauseef,W.M., Clark,R.A., Gallin,J.I. and Malech,H.L. TITLE Cloning of a 67kD neutrophil oxidase factor with similarity to a noncatalytic region of P60-c-src JOURNAL Science 248, 727-730 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by H.L.Malech, 08-FEB-1990. FEATURES from to/span description pept 68 1648 neutrophil oxidase factor (p67-phox) mRNA < 1 2206 p67-phox signal 2030 2035 poly-A signal signal 2181 2186 poly-A signal BASE COUNT 624 a 499 c 573 g 510 t ORIGIN 1 ctagtctttc agccttcagg ctgtttttgg cttgaagctc tcttggcctc ctagtttcta 61 cctaatcatg tccctggtgg aggccatcag cctctggaat gaaggggtgc tggcagcgga 121 caagaaggac tggaagggag ccctggatgc cttcagtgcc gtccaggacc cccactcccg 181 gatttgcttc aacattggct gcatgtacac tatcctgaag aacatgactg aagcagagaa 241 ggcctttacc agaagcatta accgagacaa gcacttggca gtggcttact tccaacgagg 301 gatgctctac taccagacag agaaatatga tttggctatc aaagacctta aagaagcctt 361 gattcagctt cgagggaacc agctgataga ctataagatc ctggggctcc agttcaagct 421 gtttgcctgt gaggtgttat ataacattgc tttcatgtat gccaagaagg aggaatggaa 481 aaaagctgaa gaacagttag cattggccac gagcatgaag tctgagccca gacattccaa 541 aatcgacaag gcgatggagt gtgtctggaa gcagaagcta tatgagccag tggtgatccc 601 tgtgggcaag ctgtttcgac caaatgagag acaagtggct cagctggcca agaaggatta 661 cctaggcaag gcgacggtcg tggcatctgt ggtggatcaa gacagtttct ctgggtttgc 721 ccctctgcaa ccacaggcag ctgagcctcc acccagaccg aaaaccccag agatcttcag 781 ggctctggaa ggggaggctc accgtgtgct atttgggttt gtgcctgaga caaaagaaga 841 gctccaggtc atgccaggga acattgtctt tgtcttgaag aagggcaatg ataactgggc 901 cacggtcatg ttcaacgggc agaaggggct tgttccctgc aactaccttg aaccagttga 961 gttgcggatc caccctcagc agcagcccca ggaggaaagc tctccgcagt ccgacatccc 1021 agctcctcct agttccaaag cccctggaaa accccagctg tcaccaggcc agaaacaaaa 1081 agaagagcct aaggaagtga agctcagtgt tcccatgccc tacacactca aggtgcacta 1141 caagtacacg gtagtcatga agactcagcc cgggctcccc tacagccagg tccgggacat 1201 ggtgtctaag aaactggagc tccggctgga acacactaag ctgagctatc ggcctcggga 1261 cagcaatgag ctggtgcccc tttcagaaga cagcatgaag gatgcctggg gccaggtgaa 1321 aaactactgc ctgactctgt ggtgtgagaa cacagtgggt gaccaaggct ttccagatga 1381 acccaaggaa agtgaaaaag ctgatgctaa taaccagaca acagaacctc agcttaagaa 1441 aggcagccaa gtggaggcac tcttcagtta tgaggctacc caaccagagg acctggagtt 1501 tcaggaaggg gatataatcc tggtgttatc aaaggtgaat gaagaatggc tggaagggga 1561 gtgcaaaggg aaggtgggca ttttccccaa agtttttgtt gaagactgcg caactacaga 1621 tttggaaagc actcggagag aagtctagga tgtttcacaa actacaaagc tgaagaaaat 1681 gaagccctat tacttgtttg taagatttag cacccttctg ctgtatactg tactgagaca 1741 ttacagtttg gaagtgttaa ctatttattc cctgttaaaa tttaacctac tagacaatga 1801 tgtgagtacc caggatgatt tcctggggca cagtgggtga ggagatgggg acaggtgaat 1861 ggaggagtta ggggagagga aaagtggatg gaagtgtctg gaaagggcac gagagagtct 1921 tccaggtact gatcctgttt cttgctctga gtgctagcta gccagctgtg ttcacactgt 1981 aaacattcat caagctgtac atttggtgca cttttctgtg tcataccaca ataaaaaaaa 2041 acctatcatc atcttacaaa aacaagacac ccaagtccag gcccaaggag taagtacaaa 2101 tattcctgtt tctgaaccat tactgtaatt ggctcttaag gcttgaagta accttatagg 2161 ttactcataa ggcatataca aataaacttg tttgttttct tttttc // LOCUS HUMPTPBX 3247 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human phosphotyrosyl-protein phosphatase (PTP-1B) mRNA, complete cds. ACCESSION M31724 KEYWORDS phosphotyrosyl-protein phosphatase. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3247) AUTHORS Chernoff,J., Schievella,A.R., Jost,C.A., Erikson,R.L. and Neel,B.G. TITLE Cloning of a cDNA for a major human protein-tyrosine-phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2735-2739 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Chernoff,J., 13-MAR-1990. FEATURES from to/span description pept 91 1398 phosphotyrosyl-protein phosphatase (EC 3.1.3.48) mRNA < 1 3247 phosphotyrosyl-protein phosphatase mRNA BASE COUNT 836 a 834 c 806 g 771 t ORIGIN 1 gggcgggcct cggggctaag agcgcgacgc ctagagcggc agacggcgca gtgggccgag 61 aaggaggcgc agcagccgcc ctggcccgtc atggagatgg aaaaggagtt cgagcagatc 121 gacaagtccg ggagctgggc ggccatttac caggatatcc gacatgaagc cagtgacttc 181 ccatgtagag tggccaagct tcctaagaac aaaaaccgaa ataggtacag agacgtcagt 241 ccctttgacc atagtcggat taaactacat caagaagata atgactatat caacgctagt 301 ttgataaaaa tggaagaagc ccaaaggagt tacattctta cccagggccc tttgcctaac 361 acatgcggtc acttttggga gatggtgtgg gagcagaaaa gcaggggtgt cgtcatgctc 421 aacagagtga tggagaaagg ttcgttaaaa tgcgcacaat actggccaca aaaagaagaa 481 aaagagatga tctttgaaga cacaaatttg aaattaacat tgatctctga agatatcaag 541 tcatattata cagtgcgaca gctagaattg gaaaacctta caacccaaga aactcgagag 601 atcttacatt tccactatac cacatggcct gactttggag tccctgaatc accagcctca 661 ttcttgaact ttcttttcaa agtccgagag tcagggtcac tcagcccgga gcacgggccc 721 gttgtggtgc actgcagtgc aggcatcggc aggtctggaa ccttctgtct ggctgatacc 781 tgcctcctgc tgatggacaa gaggaaagac ccttcttccg ttgatatcaa gaaagtgctg 841 ttagaaatga ggaagtttcg gatggggttg atccagacag ccgaccagct gcgcttctcc 901 tacctggctg tgatcgaagg tgccaaattc atcatggggg actcttccgt gcaggatcag 961 tggaaggagc tttcccacga ggacctggag cccccacccg agcatatccc cccacctccc 1021 cggccaccca aacgaatcct ggagccacac aatgggaaat gcagggagtt cttcccaaat 1081 caccagtggg tgaaggaaga gacccaggag gataaagact gccccatcaa ggaagaaaaa 1141 ggaagcccct taaatgccgc accctacggc atcgaaagca tgagtcaaga cactgaagtt 1201 agaagtcggg tcgtgggggg aagtcttcga ggtgcccagg ctgcctcccc agccaaaggg 1261 gagccgtcac tgcccgagaa ggacgaggac catgcactga gttactggaa gcccttcctg 1321 gtcaacatgt gcgtggctac ggtcctcacg gccggcgctt acctctgcta caggttcctg 1381 ttcaacagca acacatagcc tgaccctcct ccactccacc tccacccact gtccgcctct 1441 gcccgcagag cccacgcccg actagcaggc atgccgcggt aggtaagggc cgccggaccg 1501 cgtagagagc cgggccccgg acggacgttg gttctgcact aaaacccatc ttccccggat 1561 gtgtgtctca cccctcatcc ttttactttt tgccccttcc actttgagta ccaaatccac 1621 aagccatttt ttgaggagag tgaaagagag taccatgctg gcggcgcaga gggaaggggc 1681 ctacacccgt cttggggctc gccccaccca gggctccctc ctggagcatc ccaggcggcg 1741 cacgccaaca gcccccccct tgaatctgca gggagcaact ctccactcca tatttattta 1801 aacaattttt tccccaaagg catccatagt gcactagcat tttcttgaac caataatgta 1861 ttaaaatttt ttgatgtcag ccttgcatca agggctttat caaaaagtac aataataaat 1921 cctcaggtag tactgggaat ggaaggcttt gccatgggcc tgctgcgtca gaccagtact 1981 gggaaggagg acggttgtaa gcagttgtta tttagtgata ttgtgggtaa cgtgagaaga 2041 tagaacaatg ctataatata taatgaacac gtgggtattt aataagaaac atgatgtgag 2101 attactttgt cccgcttatt ctcctccctg ttatctgcta gatctagttc tcaatcactg 2161 ctcccccgtg tgtattagaa tgcatgtaag gtcttcttgt gtcctgatga aaaatatgtg 2221 cttgaaatga gaaactttga tctctgctta ctaatgtgcc ccatgtccaa gtccaacctg 2281 cctgtgcatg acctgatcat tacatggctg tggttcctaa gcctgttgct gaagtcattg 2341 tcgctcagca atagggtgca gttttccagg aataggcatt tgctaattcc tggcatgaca 2401 ctctagtgac ttcctggtga ggcccagcct gtcctggtac agcagggtct tgctgtaact 2461 cagacattcc aagggtatgg gaagccatat tcacacctca cgctctggac atgatttagg 2521 gaagcaggga caccccccgc cccccacctt tgggatcagc ctccgccatt ccaagtcaac 2581 actcttcttg agcagaccgt gatttggaag agaggcacct gctggaaacc acacttcttg 2641 aaacagcctg ggtgacggtc ctttaggcag cctgccgccg tctctgtccc ggttcacctt 2701 gccgagagag gcgcgtctgc cccaccctca aaccctgtgg ggcctgatgg tgctcacgac 2761 tcttcctgca aagggaactg aagacctcca cattaagtgg ctttttaaca tgaaaaacac 2821 ggcagctgta gctcccgagc tactctcttg ccagcatttt cacattttgc ctttctcgtg 2881 gtagaagcca gtacagagaa attctgtggt gggaacattc gaggtgtcac cctgcagagc 2941 tatggtgagg tgtggataag gcttaggtgc caggctgtaa gcattctgag ctggcttgtt 3001 gtttttaagt cctgtatatg tatgtagtag tttgggtgtg tatatatagt agcatttcaa 3061 aatggacgta ctggtttaac ctcctatcct tggagagcag ctggctctcc accttgttac 3121 acattatgtt agagaggtag cgagctgctc tgctatatgc cttaagccaa tatttactca 3181 tcaggtcatt attttttaca atggccatgg aataaaccat ttttacaaaa ataaaaacaa 3241 aaaaagc // LOCUS HUMUDPGTA 1855 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human 3,4-catechol estrogen UDP-glucuronosyltransferase mRNA, complete cds. ACCESSION J05428 KEYWORDS 3,4-catechol estrogen UDP-glucuronosyltransferase. SOURCE Human liver, cDNA to mRNA, clone 63-11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1855) AUTHORS Ritter,J.K., Sheen,Y.Y. and Owens,I.S. TITLE Cloning and expression of human liver UDP-glucuronosyltransferase in COS-1 cells: 3,4-catechol estrogens and estriol as primary substrates JOURNAL J. Biol. Chem. 265, 7900-7906 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.S.Owens, 22-FEB-1990. FEATURES from to/span description pept 15 1604 UDP-glucuronosyltransferase (EC 2.4.1.17) BASE COUNT 571 a 337 c 394 g 553 t ORIGIN 1 tgcattgcac caggatgtct gtgaaatgga cttcagtaat tttgctaata caactgagct 61 tttgctttag ctctgggaat tgtggaaagg tgctggtgtg ggcagcagaa tacagccatt 121 ggatgaatat aaagacaatc ctggatgagc ttattcagag aggtcatgag gtgactgtac 181 tggcatcttc agcttccatt ctttttgatc ccaacaactc atccgctctt aaaattgaaa 241 tttatcccac atctttaact aaaactgagt tggagaattt catcatgcaa cagattaaga 301 gatggtcaga ccttccaaaa gatacatttt ggttatattt ttcacaagta caggaaatca 361 tgtcaatatt tggtgacata actagaaagt tctgtaaaga tgtagtttca aataagaaat 421 ttatgaaaaa agtacaagag tcaagatttg acgtcatttt tgcagatgct atttttccct 481 gtagtgagct gctggctgag ctatttaaca taccctttgt gtacagtctc agcttctctc 541 ctggctacac ttttgaaaag catagtggag gatttatttt ccctccttcc tacgtacctg 601 ttgttatgtc agaattaact gatcaaatga ctttcatgga gagggtaaaa aatatgatct 661 atgtgcttta ctttgacttt tggttcgaaa tatttgacat gaagaagtgg gatcagtttt 721 atagtgaagt tctaggaaga cccactacgt tatctgagac aatggggaaa gctgacgtat 781 ggcttattcg aaactcctgg aattttcagt ttcctcatcc actcttacca aatgttgatt 841 ttgttggagg actccactgc aaacctgcca aacccctgcc taaggaaatg gaagactttg 901 tacagagctc tggagaaaat ggtgttgtgg tgttttctct ggggtcaatg gtcagtaaca 961 tgacagaaga aagggccaac gtaattgcat cagccctggc ccagatccca caaaaggttc 1021 tgtggagatt tgatgggaat aaaccagata ccttaggtct caatactcgg ctgtataagt 1081 ggatacccca gaatgacctt ctaggtcatc caaagaccag agcttttata actcatggtg 1141 gagccaatgg catctacgag gcaatctacc atgggatccc tatggtgggg attccattgt 1201 ttgccgatca acctgataac attgctcaca tgaaggccag gggagcagct gttagagtgg 1261 acttcaacac aatgtcgagt acagacttgc tgaatgcatt gaagagagta attaatgatc 1321 cttcatataa agagaatgtt atgaaattat caagaattca acatgatcaa ccagtgaagc 1381 ccctggatcg agcagtcttc tggattgaat ttgtcatgcg ccacaaagga gctaaacacc 1441 ttcgggttgc agcccacgac ctcacctggt tccagtacca ctctttggat gtgattgggt 1501 tcctgctggt ctgtgtggca actgtgatat ttatcgtcac aaaatgttgt ctgttttgtt 1561 tctggaagtt tgctagaaaa gcaaagaagg gaaaaaatga ttagttatat ctgagatttg 1621 aagctggaaa acctgatagg tgagactact tcagtttatt ccagcaagaa agattgtgat 1681 gcaagatttc tttcttcctg agacaaaaaa aaaaaaaaga aaaaaaaatc ttttcaaaat 1741 ttactttgtc aaataaaaat ttgtttttca gagatttacc acccagttca tggttagaaa 1801 tattttgtgg caatgaagaa aacactacgg aaaataaaaa ataagataaa gcctt // LOCUS RATGFGDVA 630 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Rat glioma-derived vascular endothelial cell growth factor mRNA, complete cds. ACCESSION M32167 KEYWORDS endothelial cell growth factor protein. SOURCE Rat cell line GS-9L, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 630) AUTHORS Conn,G., Bayne,M.L., Soderman,D.D., Kwok,P.W., Sullivan,K.A., Palisi,T.M., Hope,D.A. and Thomas,K.A. TITLE Amino acid and cDNA sequences of a vascular endothelial cell mitogen that is homologous to platelet-derived growth factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2628-2632 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.L.Bayne, 16-FEB-1900. FEATURES from to/span description pept 5 577 endothelial cell growth factor protein precursor sigp 5 82 endothelial cell growth factor protein signal peptide matp 83 574 endothelial cell growth factor protein BASE COUNT 169 a 163 c 172 g 126 t ORIGIN 1 aaccatgaac tttctgctct cttgggtgca ctggaccctg gctttactgc tgtacctcca 61 ccatgccaag tggtcccagg ctgcacccac gacagaaggg gagcagaaag cccatgaagt 121 ggtgaagttc atggacgtct accagcgcag ctattgccgt ccgattgaga ccctggtgga 181 catcttccag gagtaccccg atgagataga gtatatcttc aagccgtcct gtgtgcccct 241 aatgcggtgt gcgggctgct gcaatgatga agccctggag tgcgtgccca cgtcggagag 301 caacgtcact atgcagatca tgcggatcaa acctcaccaa agccagcaca taggagagat 361 gagcttcctg cagcatagca gatgtgaatg cagaccaaag aaagatagaa caaagccaga 421 aaatcactgt gagccttgtt cagagcggag aaagcatttg tttgtccaag atccgcagac 481 gtgtaaatgt tcctgcaaaa acacagactc gcgttgcaag gcgaggcagc ttgagttaaa 541 cgaacgtact tgcagatgtg acaagccaag gcggtgagcc aggctgcagg aaggagcctc 601 ctcagggttt cgggaactag acctctcacc // LOCUS RATYWKII 1821 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Rat sperm membrane protein (YWK-II) mRNA, 3' end. ACCESSION M31322 KEYWORDS sperm membrane protein. SOURCE Rat testis cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1821) AUTHORS Yan,Y.C., Bai,Y., Wang,L.F. and Miao,S.Koide.S.S. TITLE Characterization of cDNA encoding a rat sperm membrane protein related to A4 amyloid protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2405-2408 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.S.Koide 17-JAN-1990. FEATURES from to/span description pept < 1 576 sperm membrane protein (AA at 1) mRNA < 1 1821 YWK-II mRNA BASE COUNT 447 a 427 c 416 g 531 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgtta cctcctccat ctcagagaac cctgtggatg tccgggtgag ctctgaggaa 61 agtgaggaga tcccgccgtt ccaccctttc catcccttcc catccttgtc tgagaacgaa 121 gacactcagc cggagttgta ccacccaatg aaaaaaggat ctggaatggc agagcaagat 181 gggggcctga ttggtgcaga agaaaaggtg atcaacagta agaataaaat ggatgaaaat 241 atggtcattg acgagactct ggatgttaag gaaatgattt tcaatgctga gagagttggg 301 ggtctggagg aagagccgga ctctgtgggg cctctacggg aggacttcag tttgagcagc 361 agtgccctta ttggcttgct ggtcatcgcg gtggccattg ctacagtcat cgtcatcagc 421 ctggtgatgc tgaggaagag gcaatacggc accatcagcc atggcattgt ggaggttcat 481 ccaatgctca ccccagaaga gcgtcacttg aacaagatgc agaaccacgg ctatgaaaac 541 ccaacctaca aatacctgga gcagatgcag atttaaggac agcagcgtgt gcgacaccct 601 ggctgaggct gctgcaggtg ggctggaaga gcctcagcgt ttgtgcttga ctgctgacca 661 ccagcggtgc cagaggcctc atcctacatc ctgctctcct ggattgttaa gactataaag 721 tactactgta ggattgcaat ttccattctt ttaaatgggt ttaaaagatg ttaatataac 781 aatatatgat atataaacct taagtgaaaa aaagatctat tgcagatatc tgatggatgt 841 agttttcttt ttttaaatta gaaatgccac ttctattgta ttgtctcaca catgctctat 901 ataaatggaa aatgttgatt tttcaatgat agactatata cacaggctgt tcccgttatg 961 taagtctgtt ctttaggctc gtttgctggg ctggttttgt cgtcatttgt tttaatgtat 1021 aaaggcagta ttcccctttt caggttgctg agaaatgtaa gtggaactga agtacattgt 1081 atgcagttac tgactgtttt aggcatagtc tccttggaag cctagagctt ccagtgccgg 1141 gtgtccagtg cctgtcacca aagcaagggc taagtcacct tgagctagct ggatgcaaac 1201 tagatccact gtgctttcct tcaaatccag ttcttccaca gcaaccagcc catagttgtt 1261 ctgtgttctt ccacagctgt ttacggtagc ctcctagcca ctctcctcag caagtgcatc 1321 caagagtgca ccaccccctt ctttggacgt ctccgtccca tgcactgacc ctctgcttgc 1381 cttcgtacct cacttcctcc accgctcttc agcccctttg atgtcccctc agagaatacc 1441 gatatacaca tggctaagga cccaggagac ttcacgggag gcctcattag gtgaaaggac 1501 gatgttctgg gctgtacatg aaattggatc tgtagacact gtgtttcctt cactgacttg 1561 taatgtcacg cagctggagt tgatgccaca acccttagtg ctttgttgct gttttgtttt 1621 tcagggttct ggtaacctgc tactgttttt gttttggttt tggtttggtt ttttttgtat 1681 ttttctgtga tttccctccc cttccccccc atgcctcttc ccactatgca cagatggaaa 1741 ctttacctac aaactccttc gtatgatctg tggagaatgt acagaactta ttacatcaat 1801 aaaacacttt aacttccccc g // LOCUS RHBNODVW 4227 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.japonicum nodulation protein (nodV, nodW) genes, complete cds. ACCESSION M31765 KEYWORDS nodulation protein; regulatory protein. SOURCE B.japonicum (strain USDA3I1b110spc4) DNA, clone pRJ141. ORGANISM Bradyrhizobium japonicum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 4227) AUTHORS Goettfert,M., Grob,P. and Hennecke,H. TITLE Proposed regulatory pathway encoded by the nodV and nodW genes, determinants of host specificity in Bradyrhizobium japonicum JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2680-2684 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Goettfert, 01-FEB-1990. FEATURES from to/span description pept 648 3317 nodulation protein (nodV) pept 3314 3997 nodulation protein (nodW; gtg start) BASE COUNT 919 a 1147 c 1228 g 933 t ORIGIN 1 ggatcctgcg gacgcagctt gtcgttctcg ttacgatgcg cgctgcgtta gcagagcatg 61 gtgggtgtga tcagcgtgaa tggcaccgac aaagccgtgg cgagtcatac gaggctgccc 121 tgcgttctgc gacgccgctc gcttccagcg gaaccagttt ttcgcagcta aagcgcgata 181 agattggaac gaatcatcat cgcgcgtagg actcgcgcga ctccggcttc tgtgtcggca 241 ggcgaacttg cgctttcttc gctttagcgg ctgagtcggg gagacgagtc attgggcagc 301 ttttttctga tccttgccgg tccggaggcg cgccttcgaa cctatcggct gaactgaaac 361 cactgtcctc attccgcggc ctaatggggc gcattggccg taggctcgag cgatgcgcaa 421 aagcggacgt ttcagcttcg actaaagtcg cgtcagcgac acctactgaa ggtttaggac 481 aaatatacgt atgtttataa tgcgtttttt ccccgcgcgt gctgagttcg gcagccgatg 541 atcgatcttg cacacgctat catgcggacc caagtgttca agattgcctc atttttgcct 601 aaaatgctat agctctcccg ccggataaca ggccggacgg tctcaaaatg tataggatca 661 gcagtttcaa acgttcagcc cctcagttga ccttgggcag catagcgcta gccgcggtta 721 cactgacttg cgtgtacttt caggcgcatt tcgccgccgc ggcgttcgcc tatttgttag 781 tagtcttact attttcgttg atgggcagct tcattgcttc atcagcgctt tgcatcgtcg 841 caatcgctgc tctcgcatac tactttgcgc cgccggcgtt tagtttacga atcgatgatc 901 cccgcgatgt tcctgtggtt gttgcatttc ttattgtctc tgttgtcgga acgtacctga 961 ttggaaaact ccgccaggaa agagaggctg cacgtgtggc tgcggccaag cttcagcgga 1021 gtgcctcgga tttggaggat cgtgaaaaac ggtggcgcgc aattttcgag cacaatccag 1081 ccatgtactt catggtcgat gaggccggca ttgtcctcaa cgtcaatacg ttgggcgcga 1141 cacaactggg atttgcttgt gcagaactat tgggccaatc cgtgctcgac gtatttctgg 1201 aggaggatcg cgcattcgtc cgcaaatgca ttcagacgtg tcttgaggat gttggacaat 1261 cgcgcacttg ggacgtccgg aaagtcagga aggacggttc tgtattgtgg gtgcgtgaaa 1321 acgccaaggc catgctttgg gccggcgacc gccccgtcat cctcatggcg tgcgaagata 1381 ttacggagcg caagcagacc gagcttgcgc tgcagcggag cgaagcgcat ttggctcacg 1441 cgcaggagtt gagtcataca ggcagcttca gctggaacgc ctctaccggc gaggccttct 1501 ggtctaagga gacatttcgg attttccaaa tcgatcttca gacgacaccg gcgccacaac 1561 tcgtcattga gcgcacgcac ccagatgata gggcttctgt caaagagatt atcgatgaag 1621 cgatgcgaga cctgagggat ttcgagcacg agtaccggct gctgctacct gacggctccg 1681 tgaagcacat ccatgcgcag gcacgagtca cgcgaaccgc ctctggtgaa attgagtttg 1741 ttggggcagc caccgatatt acggcagcta ggcgagcaga acagcagttg cgccgaagcg 1801 aggcctatct ggccgaggct cagcatctca ctcacacagg cagctggtcc tgggacgtcc 1861 acacacgaga tttcgtttat cgctccgctg aggtcgaccg cctgtttggc tttaacccac 1921 aagagccggt ttcgctagag actattcgat cgcgcatcca tccggaagac ttgccagggt 1981 tgcaggaggt gcagcgtcag gcgattgacc aggagcacga acggttcgaa tatgatttcc 2041 gtgttattct gccagatggc gggataaggc gcatacactc cgttgcacac gttgtcgtcg 2101 gcagcgatgg taatgtcagc gagctgatcg gaacacatat ggatgttacc gagcaacacg 2161 cagctaggga acgcttggaa aacacgcttg tagcgctgcg cgagagcgaa cagcgctttc 2221 gcgactatgc cgagactgct tccgactggc tctgggaaac cgggccagat catcgggtca 2281 ctcacttgtc cgagcacacc agcgctgcgg gaattttggc gacagggtta acgggcctgc 2341 ttcgctggga catcgcgtgc gacatggaag aagaaccgga gaagtggcgt cagcatcggg 2401 cgacgttgca ggcacacctg ccgttccggg atctcatcta ccgtaccgta aataggatgg 2461 gatctccgat ctacgtccgc actagtggca agcccttctt cgacggaaac ggaaattttt 2521 tgggctatcg cggcgtcagc actgacatca ccgctaccat tcgcgctgat caggccgaac 2581 aagagctgcg aaaggcacag gcggagcttg cacatgtgac gcgtgtaacg accttgggag 2641 aaatgacaac ttctatcgcc cacgagataa cccaaccact cgccgctatc ctcagcaacg 2701 ccgatgcgtg cctcggctgg atggctcgcg atgttcccaa tcttgcagcc gcgcgctctt 2761 cagtcgaatg gatcatagaa gatgcaatcc gggcaagcga ggtgatccgt agtattcgcg 2821 cactcgcgaa aaagggcgag atcgagatgg tgccgctcga cattaatcag gtggttaggg 2881 acgtcagcgc gctggtaaca cgagagctgg tgagccacca agtgacgttg cgaagcgagt 2941 tggcgtctgc gctgcctagg gtcctcggtg atcgaattca gctacaacaa gtgatcatca 3001 atctggtgat gaacggaatc gaggccatgg acgcagttac agaccggccg cgtgaacttc 3061 tgattcaatc atctacggac gatctggggt acgtgcagct ttccgtgacc gattgcggcg 3121 tcgggatcgc cgagaatgac gcggaccgcg tcttggaccc cttcttcacc accaaatcga 3181 gcggcctagg aatgggcctt tcgatctgcc ggtcgatcgt ggaagttcac ggaggacgaa 3241 tttcagtggt tcagaaaaat ggaccgggcg cgacgttcca gtttgccctt ccgctgcata 3301 aggaggccat ctcgtgacag gacgatttga ctggagaggc caaggcggac ataccgaggc 3361 ttcgacgaag gcaatcgtct ttgtcgtcga ggatgacatc tctatgcgtc gctcgcttac 3421 gaaccttttt cgatcggtag gcttggaggt cgtggcgttc ggatcggccc gtgaaatgct 3481 gcagagcaca atgccggacg tcacaagctg tctagttctt gatgtccggc tgccgggcct 3541 gagcggcctt gactaccaga ccgagctcgc caggttgaac atacacatcc caatcatctt 3601 cattaccggc catggcgaca ttcccatgac cgtcagggcc atgaagggag gcgcggtcga 3661 tttcctcagc aagcccttcc gcgatcagga actgcttgat gccgtcgttg cggcgaccga 3721 acgcgatcgc aaaagacgag aagctcagcg aaccgtggcg aacctgaaat ctctatttga 3781 gaccctaagc ccgcgagaac aggcagtgat gaaactggtc gcgacggggc tgatgaacaa 3841 gcaggtagcc gccgaacttg ggctcgccga gatcaccgtc aagatctacc ggggacacgt 3901 aatgaaaaag atgcgtgccc gctcgctggc tgacttgatc agaatgagcg agacgctcgg 3961 aattagcgcc aatcacactg aacaaaccca agtatgattt tacaattcca tcacttaagc 4021 ccactttcgc gaaagtggct gacggtttgg cagccgctgt actgcgtcgg gagggctcat 4081 cttgtccacg cctttgattt ccgtcgttga cgacgacccc tcggtccgtg cggcgacaga 4141 aaaccttttg aaatcgcgtg gctacgtcgt gcagatattt gcctcggccg aggcgctcct 4201 gcggtcgccg cggttgaacg agatatc // LOCUS ANAPETF1A 1682 bp ds-DNA BCT 27-JUL-1990 DEFINITION A.variabilis ferredoxin (petF1) gene, complete cds. ACCESSION M35126 KEYWORDS ferredoxin. SOURCE A.variabilis (ATCC 29413; PCC 7937) DNA, clone pVA1. ORGANISM Anabaena variabilis Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Nostocaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 794 1093 ferredoxin (petF1) binding 782 785 ribosomal binding site (put.) BASE COUNT 480 a 353 c 322 g 527 t ORIGIN 1 aagcttgatg taagccacct aaaagttgaa tgtgttcgac gtggagtaat ttttgctgtg 61 ctttggctaa ggcaatcagt tcttctgctt cagttaaatc tacagataaa ggatattcta 121 caattacgtg tttgccggca gttagagcag cacggcgatc gcaccatgat cacgattgat 181 ggtacaaata accactaaat ctatatcttc tcgttctact aactgttgcc agccagttat 241 cgcttcagcc tggtaatctt gagccaaagc ctgggttcgc tccagtgtac tacccgctat 301 ggcgactagg tgcgatcgct tatcctccaa aaaggcttct gcccgcagtt ttgctgcata 361 ccctgtccaa ctatgccaat acgtattgtt gcttgtgcca aagctacctc cgactcctat 421 ataatcctca ccagtcctgc taccaagcac agtaataaat ttttattttt caaaaaaagc 481 taggttaacg cagtaccagc ctaactgaag gttatccctg ttccagaact tttttatcat 541 attcttgatt caagccataa atttttctaa taactgtggg tttaacgtct ttaaagtata 601 aaaaaaattt attgttatct gctaactaaa tttcattact aatcgtggtg gatttcaagt 661 tatatacttt gatttttccc gtagtatcag aattgaacta aatttaattc tgtggctaat 721 cccctgagaa tagccgctaa gttctgcttt agcataactt atactgccga ttacaaaaga 781 gaggattacg gaaatggcaa cttttaaagt tacattgatc aacgaagcag aaggaaccag 841 caacacaatt gacgttcctg atgatgagta tattttagac gctgccgaag aacagggtta 901 tgacctaccc ttttcctgtc gtgcaggtgc ttgctccacc tgcgccggta aactagtatc 961 cggtactgtt gaccagtctg accaatcatt cttggatgac gatcaaatcg aagctggata 1021 tgtattgacc tgtgttgctt atccaacctc tgatgtaacc atccaaaccc acaaagaaga 1081 agacctctac taagagtcaa gttagcttta aaaatagagg ctagaggcta gcagctacag 1141 actagtccct agccttttta ttttttttca ttagggaaca gtctattggg actgggtgta 1201 agggtgtaag ggtatagggg tgtaatagtt tcaaacattt atactctttt caacccttga 1261 tatttcgttt tcatccgtaa gtcccacaaa aaataaatta tccaaaattg atggtttggt 1321 agggtgcgtc aatagaaatc atttctgagt gtatttaggc tctatcgcac tgacggacac 1381 tacattttgg atattttttg atctgaaagt cccttaaaat ttgatttctt cattcaggct 1441 gtgaagttgt gtaccaggat aataaaattg gagaattttg ctgttagacc aacctagttt 1501 agctaaagtt tgagcgccag tttgacttaa gcccactcca tgtcctaaac caccaccaat 1561 aaaagcgtat cccacaactc ttctttgcct ttgttgagag gtagtatgta aaaaagcgtg 1621 cttcttggcg cagcaaaggc actacgcact tcatctttgt gtagggtaaa aatgcccata 1681 tc // LOCUS ANIPETF1A 1074 bp ds-DNA BCT 27-JUL-1990 DEFINITION A.nidulans ferredoxin (petF1) gene, complete cds. ACCESSION M35127 KEYWORDS ferredoxin. SOURCE A.nidulans (strain R2, PCC7942) DNA, clone pA612. ORGANISM Anacystis nidulans Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 1074) AUTHORS Van Der Plas,J., De Groot,R., Woortman,M., Cremers,F., Borrias,M., Van Arkel,G. and Weisbeek,P. TITLE Genes encoding ferredoxins from Anabaena sp. PCC 7937 and Synechococcus sp. PCC 7942: Structure and regulation JOURNAL Photosyn. Res. 18, 179-204 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 504 803 ferredoxin (petF1) binding 493 496 ribosomal binding site (put.) BASE COUNT 234 a 298 c 281 g 261 t ORIGIN 1 ctgcagccgc tgcatgcgca gggcgttcga tttgagggct ttttgtggaa catcttggcg 61 cacatcatcg aagctctaac accgaggctg tctgagttgg ctagtcctgg cagctgggca 121 atctttagcg gcttgctaac cagtcaagcc gacactgtca gcgtcacttt ggaagagtac 181 ttttgggtga tccgcgatcg cgccagtcag ggagattggt gtcgtttggt cgcggatttt 241 cggccggaac gataaatctc actaatgctt agcttagagg gcttactggg agcgggccga 301 gtttgagccg tgattacccc tacgaacttt ccggccacgc tccattgctt agacataaaa 361 ttcccttatg tctagactgg cgattgatag catttctcgc ggcgcagttc gccctttggc 421 aacccatagt atcaatggga aaggtacggg caggctgtca atcgatgagc tctgccaccc 481 caaaagcgat agaggacacg ctcatggcaa cctacaaggt tacgctcgtc aatgctgccg 541 aaggcttgaa caccacgatc gacgtggctg acgataccta catcttggac gccgctgaag 601 agcaaggcat tgacctgcct tactcctgcc gtgctggtgc ttgctcgacc tgtgctggca 661 aagtcgtctc tggtaccgtc gaccaatcgg atcaatcctt cttggatgac gaccaaattg 721 cagcaggctt tgtcctgacc tgcgtcgcct atccgacctc cgatgtgacg atcgaaaccc 781 acaaagaaga agacctctac taagtcttgc ttcgactgct tcaatcctta gaattcaaat 841 caaattgcgg cttccaaatt gggagtcgct tttttatcgc cgtagtcagc agcgatcgcc 901 tcactaccgt ccagaaacag cctgcgatcg ccctgactta acgcccctcc cctagcaatc 961 ctgactcggt agagttcagg ccaaggactc aaggttgagg cgatcgttca gcaacgtctt 1021 gctctaggcg tccggcgtga ctgttgtggg gaagaggcga tcgccttcct gcag // LOCUS ASAASPPA 1764 bp ds-DNA PLN 27-JUL-1990 DEFINITION A.awamori aspergillopepsin A (pepA) gene, complete cds. ACCESSION M34454 KEYWORDS aspartic proteinase; aspergillopepsin. SOURCE A.awamori (NRRL3112 mutant strain UVK143f) DNA. ORGANISM Aspergillus awamori Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae. REFERENCE 1 (bases 1 to 1764) AUTHORS Berka,R.M., Ward,M., Wilson,L.J., Hayenga,K.J., Kodama,K.H., Carlomagno,L.P. and Thompson,S.A. TITLE Molecular cloning and deletion of the gene encoding aspergillopepsin A from Aspergillus awamori JOURNAL Gene 86, 153-162 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 177 496 aspergillopepsin (pepA), exon 1 548 825 aspergillopepsin, exon 2 878 1126 aspergillopepsin, exon 3 1186 1523 aspergillopepsin, exon 4 sigp 177 308 aspergillopepsin signal peptide matp 455 496 aspergillopepsin 548 825 aspergillopepsin 878 1126 aspergillopepsin 1186 1523 aspergillopepsin pre-msg 124 1764 pepA mRNA and introns (put.) pre-msg 125 1764 pepA mRNA and introns (put.) pre-msg 127 1764 pepA mRNA and introns (put.) IVS 497 547 pepA intron A IVS 826 877 pepA intron B IVS 1127 1185 pepA intron C signal 54 58 TATA box BASE COUNT 387 a 515 c 445 g 417 t ORIGIN 1 gtcgacttgg atgatggaga gttgatcgag gtcaatgagg agaggcttgc aagtataaga 61 agagactgct cgaccagcag aatggatctt cttgttcatc aaccaagagt ccaaggcttc 121 tttgtctggt tctatctctt ctccgaactc tcttgcttga cattctcgtg gtcaaaatgg 181 tcgtcttcag caaaaccgct gccctcgttc tgggtctgtc ctccgccgtc tctgcggcgc 241 cggctcctac tcgcaagggc ttcaccatca accagattgc ccggcctgcc aacaagaccc 301 gcaccatcaa cctgccaggc atgtacgccc gttccctggc caagtttggc ggtacggtgc 361 cccagagcgt gaaggaggct gccagcaagg gtagtgccgt gaccacgccc cagaacaatg 421 acgaggagta cctgactccc gtcactgtcg gaaagtccac cctccatctg gactttgaca 481 ccggatctgc agatctgtaa gcttccctgc tcgggtgttc gggcaaatcg tgactaacct 541 ggactagctg ggtcttctcg gacgagctcc cttcctcgga gcagaccggt cacgatctgt 601 acacgcctag ctccagcgcg accaagctga gcggctacac ttgggacatc tcctacggtg 661 acggcagctc ggccagcgga gacgtgtacc gggatactgt cactgtcggc ggtgtcacca 721 ccaacaagca ggctgttgaa gcagccagca agatcagctc cgagttcgtt cagaacacgg 781 ccaatgacgg ccttttggga ctggccttta gctccatcaa cactggtgag tcaatcctac 841 atcagccggg ttgacctacc tgctgaccga tagacagtcc agccaaaggc gcagacaacc 901 ttcttcgaca ccgtcaagtc ccagctggac tctccccttt tcgccgtgca gctgaagcac 961 gacgcccccg gtgtttacga ctttggctac atcgatgact ccaagtacac cggttctatc 1021 acctacacgg atgccgatag ctcccagggt tactggggct tcagcaccga cggctacagt 1081 atcggtgacg gcagctccag ctccagcggc ttcagcgcca ttgctggtaa gaaccgcctt 1141 catttaacac acaacttgtc cacctcttta ctaactagtg tatagacacc ggtaccaccc 1201 tcatcctcct cgatgacgaa atcgtctccg cctactacga gcaggtttct ggcgcctcag 1261 gagagacgga agccggtggc tacgttttct cttgctcgac caacccccct gacttcactg 1321 tcgtgattgg cgactacaag gccgttgttc cgggcaagta catcaactac gctcccatct 1381 cgactggcag ctccacctgc tttggcggta tccagagcaa cagcggtctg ggactgtcca 1441 tcctgggtga tgttttcttg aagagccagt acgtggtctt caactctgag ggccctaagc 1501 tgggattcgc cgctcaggct tagattatcc actgaagtgg agtctatgat ctgctgattg 1561 atccctcgac gatgaactac atgtggaaat gcatagcaga cgagggtgat ggtgatgatg 1621 ttgatttgat gatgacccgt acatacttga tgaagctcgg tacatatgca atgtgactgt 1681 atctatgtga tgatatatgt atcatctcat aacttttggc tatgagtgca ggataacact 1741 gaaccagtag tagtactttc cact // LOCUS BSUPSIF2A 2758 bp ds-DNA BCT 27-JUL-1990 DEFINITION B.subtilis protein synthesis initiation factor 2 (infB) gene, complete cds. ACCESSION M34836 KEYWORDS protein synthesis initiation factor 2. SOURCE B.subtilis (strain RS410) DNA, clones lambda-JET[1,2],pUK, and pJET1. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2758) AUTHORS Shazand,K., Tucker,J., Chiang,R., Stansmore,K., Sperling-Petersen,H.U., Grunberg-Manago,M., Rabinowitz,J.C. and Leighton,T. TITLE Isolation and molecular genetic characterization of the Bacillus subtilis gene (infB) encoding protein synthesis initiation factor 2 JOURNAL J. Bacteriol. 172, 2675-2687 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 381 2531 protein synthesis initiation factor 2 (infB) binding 369 375 ribosomal binding site (put.) BASE COUNT 917 a 535 c 662 g 644 t ORIGIN 1 atcgatgacc agattttcga tgaattgctg gaactggcgg aaaaggtgaa aaaataaaat 61 gtctggaatg gaatggtttc ccttgctggg tctggccaat cgagctcgta aggtcgtgtc 121 aggcgaagac ttggtaataa aagaaatcag gaatgcgcgt gcaaagcttg tcctgcttac 181 agaggatgca tcatctaaca cagcaaaaaa agtaactgac aagtgcaatt attataaagt 241 cccttataaa aaagtcgaga gtcgcgcggt tcttggacgc tctatcggta aagaagcccg 301 tgtcgttgtc gccgtcactg accaaggttt tgcgaataag ctgatcagct tgctcgatta 361 atatttttgg gggtgaacga atggctaaaa tgagagtata cgaatatgca aaagcgttaa 421 atgtttcaag taaggaaatt ttgaccgcac tgaagaacat ggatttagaa gtgaataatc 481 acatggccat gcttgaagaa aaggccatta aaaagctaga tgccaaatat aaaaaaggcg 541 gcgcacgtgc taaatctcaa aagccagcag aaacgaacaa aaacaaacag ccacaagggg 601 ttaatcagca atcagctgga aatcaaccaa ataaaattcg agacggaaag aagaatgacg 661 tgcagaataa tcaatttaac aaaaacaaga agaataacaa caacaaaaaa aataaacgca 721 acaacaacaa taataaaaac caacatcagc aaaagcctgt aaagccgaaa aaagagcttc 781 ctgagaaaat tacattctct ggcactttaa cagttggcgc acttgctgaa gagcttggca 841 aagagccttc agaactcatt aaaaagctga tgcttcttgg cgtaatggca accattaacc 901 aagagcttga taaagacaca atcgaactca ttgcatcaga atatggtgtt gaaacagaag 961 aggtcattgt gcttgaagaa acagagctgg aaaaatacga agagcctgat aatgaagagg 1021 atcttgaaat tcgtcctcct gtcgtgacaa tcatgggcca cgttgaccat gggaaaacaa 1081 cgcttcttga cagcatccgt aaaacaaagg ttgttgaagg ggaagcaggc ggaatcactc 1141 agcatatcgg cgcttatcaa attgaagaga acggcaagaa aatcacgttc ttggatacac 1201 cgggccacgc cgcatttaca acaatgcgcg cacgcggtgc agaagtaact gatattacga 1261 ttcttgtcgt agctgccgat gacggtgtca tgccgcaaac agttgaagcg attaaccatg 1321 caaaagcagc agaggttcca atcatcgttg ctgtgaataa aatagataaa gaatccgcaa 1381 accctgaccg tgtaatgcaa gaactgacgg aatacggact tgtccctgaa gcttggggcg 1441 gagaaaccat ttttgtgccg ctttccgctt taactggaaa aggcattgat gagctcgttg 1501 aaatgatttt gcttgtcagt gaggtagagg aactgaaagc gaatccgaat cgtcaggcaa 1561 aaggaacggt tattgaagct gaactcgata aaggcagagg gtcagtcgcg acgttgctcg 1621 tacagactgg aacactgcat gtcggtgatc cgatcgtagt cggcaataca tttggccgtg 1681 tccgtgcaat ggtcaacgac attggccgcc gtgtgaaaac tgccggcccg tcaactccgg 1741 ttgaaattac cggtttgaat gatgtccctc aagcgggaga ccaattcctt gtctttaagg 1801 atgaaaaaac agctcgttct gtcggtgaag cccgtgcttc gaaacagctt gaagagcagc 1861 gcagcgataa agcgaagctc agtcttgatg atttatttga gcaaattaag caaggtgatg 1921 taaaagacat caacctcatc gtaaaagctg acgttcaagg atctgctgaa gctttaacgg 1981 ctgcacttca aaaaattgaa gtagaaggcg ttaaagtgaa aatcatccat acaggcgttg 2041 gtgcgattac tgaatcagac attatcttgg catctgcttc caatgcaatt gttatcgggt 2101 ttaatgtgag accggacgga aatgctaaga gtacggctga agctgaaaat gtagatattc 2161 gacttcaccg tatcatttac aaagtaatcg acgagattga agctgccatg aaaggtatgc 2221 ttgatcctga atatgaagaa aaagtaattg gtcaagtaga agtacgccaa acattcaaag 2281 tatctaaaat cggtacaatt gccggcggat atgttactga aggaaccatt acgcgcgaca 2341 gcggcctccg tttaattcgt gacggcgtcg tcatctttga aggcgaagta gatgttctga 2401 aacgctttaa agacgatgtg aaagaagttt cacaaggcta tgaatgtggt attacaatta 2461 agaaatacaa tgacattcgt gaaggtgaca tccttgaagc gtttgtcatg caagaaattg 2521 aaagaacgtg atcggatttg cggagtgtga atgcatcatt tatgatgcag gatcgctaaa 2581 agaaaagcgt gccgttctga agcggatttt aaccagggtt caaaacaagt tcaatgtttc 2641 gatttcggag attggctatc aggacacctg gcaaagaacc agcttcggaa tcgccgctgt 2701 ttcttcctct cgcgttcaaa cagaaaaaga actgcagcgc gtcctagcgt ttatcgat // LOCUS CIBRCRA 1845 bp ds-DNA BCT 27-JUL-1990 DEFINITION Plasmid Colicin Ib-P9 replication control region replicon initiator protein (repZ) gene, complete cds. ACCESSION M34837 KEYWORDS replicon initiator protein. SOURCE Plasmid Colicin Ib-P9 (from E.coli) DNA, clone pCH10-delta-1. ORGANISM Plasmid Colicin Ib-P9 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1845) AUTHORS Hama,C., Takizawa,T., Moriwaki,H., Urasaki,Y. and Mizobuchi,K. TITLE Organization of the replication control region of plasmid Co1Ib-P9 JOURNAL J. Bacteriol. 172, 1983-1991 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 455 1486 repZ protein mRNA 157 1762 repZ protein mRNA (alt.) mRNA 158 1762 repZ protein mRNA (alt.) mRNA 363 < 1 (c) inc RNA binding 442 448 ribosomal binding site (put.) signal 377 372 (c) inc -10 region signal 400 395 (c) inc -35 region mut 353 353 c in wild type; t in inc3 mut 374 374 t in wild type; c in inc2 mut 400 400 a in wild type; c in inc1 BASE COUNT 505 a 467 c 490 g 383 t ORIGIN 1 gatctaaagc agaaaaatct gcttttaaaa atagatttta tttttttgac gcaggtcaag 61 attaacttat tagaggtatc agtgaggagg cactggaaga gaagagatcg ttgtaatgct 121 tttcaaatta acgtaaagcg ggtatatttc ggctgttatt agctgtgcag agggtggcac 181 tctgtggagc aaagcggcga aagccggacg gcagaatgcg ccataaggca ttcaggagag 241 atggcatgta cgggcagtaa gtcagaagac tgaagatgtt ccggaagcca taaaaggaaa 301 acccccacta tctttcttac gaacttggcg gaaagacgaa agatagtggg ggcctcacag 361 aatacgggta aagtataatg aaaccgtacc agagattcaa ccctgtgcag tgtataaata 421 cacggcacaa tcgctccgcc ataagcgaca gcttgtggca ggtctgaaga atactccata 481 taacgcagta cactggagtc agttagcacc cgaagagcag atccgtttct gggaagacta 541 tgaagcggga agggcgacca ctttcctggt tgaaccggaa aggaagcgca cgaagcgtcg 601 tcgcggtgag cactccacca aacccaaatg cgaaaatccg tcctggtatc gtcctgagcg 661 ctataaagcg ctgaaagggc agctcgggca cgcctataac cgtctggtga aaaaggaccc 721 ggtgaccggc gagcagagcc tgcgcatgcg catgtcgcgg catccttttt atgtacagaa 781 acggacgttc gtcggtcgca aatacgcttt ccggccggaa aaacaacgtc tcctcgatgc 841 catctggccg gttctggtca gcttcagtga tgccggcaca cacaccgtgg ggatgagcgt 901 cactcggctg gctgaagaaa tcagcccgaa agacagcgag ggacatgtta ttccggaact 961 ggaagtgacg gtctcccgtc tttcccgtct gctggccgaa caggtgcgct ttggtgtgct 1021 gggtgtgtca gaggaaacaa tgtgggaccg tgaacatcgt cagcgtctgc cacgttacgt 1081 ctggataaca ccggcaggct ggcagatgct gggcgtcgac atggtaaaac ttcacgaaca 1141 gcagcagaaa cgactgcgtg aaagtgaaat ccgccagcag ctcattcggg aaggtgttct 1201 gcgtgaggat gaagatatct ccgttcatgc ggccagaaaa cgctggtatc tgcagcgcag 1261 ccaggatgcc ctgaagaagc gccgagaaaa agccgcagcc agcaagcgtg ccaaccgcct 1321 gaagaaatta cctgttgacc agcagattta tgagatggct gagtatctcc ggaagcgtct 1381 gccgccggat gaagcctatt tctgttccga tgaccatctg aagcgactgg ccatcaggga 1441 gttgcgtcag cttgaactga cgctggctgc cccgccaccg cactagacag catcattccc 1501 tcagcactga atcatcgcca gcccctccgg ggctttcggc gcagattccg ctcagcccaa 1561 aatccttagt agtcacctta aatcccctca gaggggcata tctgcccata aaaccacgca 1621 tcagtcatca gaacgtggcc acgttgtttc agttatccac ataaatccgc aaacaaagaa 1681 ttttaagaag ctgcaaacct gaaacagcaa acctgcaata tattcttaac cccatcattt 1741 aatcccctgc gttgcttcgc cgcagggaaa gtctttatct ctgaaaccac tgtgaacaaa 1801 tacaaaaagg ccttcgcttg cagcgggcaa ggccgcgccg gaatt // LOCUS ECOHEMBA 1085 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli porphobilinogen synthase (hemB) gene, complete cds. ACCESSION M35121 KEYWORDS 5-aminolevulinic acid dehydratase; hemB gene; porphobilinogen synthase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1085) AUTHORS Echelard,Y., Dymetryszyn,J., Drolet,M. and Sasarman,A. TITLE Nucleotide sequence of the hemB gene of Escherichia coli K12 JOURNAL Mol. Gen. Genet. 214, 503-508 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 80 1054 porphobilinogen synthase (hemB) signal 15 20 hemB -35 region signal 41 46 hemB -10 region BASE COUNT 254 a 277 c 301 g 253 t ORIGIN 1 tcgcgacaac tttcgtaaaa catccctacc ctgcttcagg tatactatgc ccctcgattc 61 cacaaacatc aggcagacca tgacagactt aatccaacgc cctcgtcgcc tgcgcaaatc 121 tcctgcgctg cgcgtatgtt tgaagagaca acacttagcc ttaacgacct ggtgttgccg 181 atctttgttg aagaagaaat tgacggacta caaagccgtt gaagccatgc caggcgtgat 241 gcgcattcca gagaaacatc tggcacgcga aattgaacgc atcgccaacg ccggtattcg 301 ttccgtgatg acttttggca tctctcacca taccgatgaa accggcagcg atgcctggcg 361 ggaagatgga ctggtggcgc gtatgtcgcg catctgcaag cagaccgtgc cagaaatgat 421 cgttatgtca gacacctgct tctgtgaata cacttctcac ggtcactgcg gtgtgctgtg 481 cgagcatggc gtcgacaacg acgcgactct ggaaaattta ggcaagcaag ccgtggttgc 541 agctgctgca ggtgcagact tcatcgcccc ttccgccgcg atggacggcc aggtacaggc 601 gattcgtcag gcgctggacg ctgcgggatt taaagatacg gcgattatgt cgtattcgac 661 caagttcgcc tcctcctttt atggcccgtt ccgtgaagct gccggaagcg cattaaaagg 721 cgaccgcaaa agctatcaga tgaacccaat gaaccctcgt gaggcgattc gtgaatcact 781 gctggatgaa gcccagggcg cagactgcct gatggttaaa cctgctggag cgtacctcga 841 catcgtgcgt gagctgcgtg aacgtactga attgccgatt ggcgcgtatc aggtgagcgg 901 tgagtatgcg atgattaagt tcgccgcgct ggcgggtgct atagatgaag agaaagtcgt 961 gctcgaaagc ttaggttcga ttaagcgtgc gggtgcggat ctgattttca gctactttgc 1021 gctggatttg gctgagaaga agattctgcg ttaattttgt ttcctctcag aaagggggag 1081 ttaac // LOCUS ECOSTMA 1418 bp ds-DNA BCT 27-JUL-1990 DEFINITION E.coli F factor (stm) gene, complete cds. ACCESSION M35124 KEYWORDS F factor; stm gene. SOURCE E.coli DNA, clones pEG[157,159,175,176,220]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1418) AUTHORS Golub,E.I. and Panzer,H.A. TITLE The F factor of Escherichia coli carries a locus of stable plasmid inheritance stm, similar to the parB locus of plasmid RI JOURNAL Mol. Gen. Genet. 214, 353-357 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 830 1009 F factor (stm) signal 1013 1039 pot. transcription termination signal signal 1074 1105 pot. transcription termination signal BASE COUNT 279 a 364 c 429 g 345 t 1 others ORIGIN 1 atcgctgtct gacctgcgat ttttcatgcc gtccctgacc gcagaagaac tgcacggcaa 61 ccgtcttcag tggctgtatg gcatcgatgt gctgattgaa acacagggcg aggtctgcct 121 ccttcccctt cccggtgatg cagcagagcg gtcgtttccg tcggtgcggt ttcgtgtcag 181 ggagcgcagt cgccataaat ccgcgctggt catgcagaag tacagccgcc agcaggcacg 241 ggaggcagag cagaaagccc gtgcgtacca ggcgctggtc gctcaggctg aaatcgaact 301 ggcattccat tcgccggaaa cagtcggaag ctggcatgca cgctggagtg accgggtggc 361 agagcatgac ctggaaaccc tgttctggca gtggggcgag cgttttccgt cactggccgg 421 aatggaacgc tggcagtggc aggatatgcc gttcgtggca ggttattgcg gaacgcagtc 481 ttgcagcaag agaggccggt cacgctgtcc gtgagatgag cgtgatgtgc cgaacaaact 541 ccggaggcag cgtgatgcgg gcaacaatca cacagattac ccgtaaacag cctgaatgag 601 cgggttattt tcaggaaaag tatgtgtggt cagcgtgcag ggatatggct atgatgtgcc 661 cggcgcttga ggctttctgc ctcatgacgt gaaggtggtt tgttgccgtg ttgtgtggca 721 gaaggacaaa agccccatag ttaatttttc attaacccac gaggcccctg catgcttaga 781 caacatcagg atagcctctt actgcgcttt cgacgaggag aagaaggcca tgaaactacc 841 acgcagctct cttgtctggt gtgtgttgat cgtgtgtctc acactgttga tattcactta 901 tctgacacga aaatcgctgt gcgagattcg ttacagagac ggatacaggg aggtngcggc 961 cggatacagg gaggtggcgg ctttcatggc ttacgaatcc ggtaagtagc aacctggagg 1021 cgggcgcagg cttttcaggg ctgatgctgg tctgactgca ctgatgcctt tataaagggg 1081 ctgctggttc gccggtggcc cctttctcct tgctggcatt gtacgggcat gaacaaccag 1141 actgccagga aggccgtttc cctctgggaa cggggaagaa cgtgcgcctg cggccgttct 1201 tttcccccgc cttctctgtt tatgacctgt cagaaatcac atcaccttcc tttgattatt 1261 ctctttttct ctgttctgac tctgattact gctgccctga ttcctgtctg ttttgttgtt 1321 ctcctggcag tgtcttttcc gggcgtcctg gcggggcggc tggcgtgaac ggagccagcg 1381 aagctgtctc tggcgggctt ccatccctga cgggaaac // LOCUS HUMGPIIBA 3333 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human platelet glycoprotein IIb (GPIIb) mRNA, complete cds. ACCESSION M34480 KEYWORDS platelet glycoprotein IIb. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIb[3,4]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3333) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 33 3152 platelet glycoprotein IIb (GPIIb) mRNA 1 3333 GPIIb mRNA BASE COUNT 626 a 998 c 1040 g 669 t ORIGIN 1 attcctgcct gggaggttgt ggaagaagga agatggccag agctttgtgt ccactgcaag 61 ccctctggct tctggagtgg gtgctgctgc tcttgggagc ttgtgctgcc cctccagcct 121 gggccttgaa cctggaccca gtgcagctca ccttctatgc aggccccaat ggcagccagt 181 ttggattttc actggacttc cacaaggaca gccatgggag agtggccatc gtggtgggcg 241 ccccgcggac cctgggcccc agccaggagg agacgggcgg cgtgttcctg tgcccctgga 301 gggccgaggg cggccagtgc ccctcgctgc tctttgacct ccgtgatgag acccgaaatg 361 taggctccca aactttacaa accttcaagg cccgccaagg actgggggcg tcggtcgtca 421 gctggagcga cgtcattgtg gcctgcgccc cctggcagca ctggaacgtc ctagaaaaga 481 ctgaggaggc tgagaagacg cccgtaggta gctgcttttt ggctcagcca gagagcggcc 541 gccgcgccga gtactccccc tgtcgcggga acaccctgag ccgcatttac gtggaaaatg 601 attttagctg ggacaagcgt tactgtgaag cgggcttcag ctcggtggtc actcaggccg 661 gagagctggt gcttggggct cctggcggct attatttctt aggtctcctg gcccaggctc 721 cagttgcgga tattttctcg agttaccgcc caggcatcct tttgtggcac gtgtcctccc 781 agagcctctc ctttgactcc agcaacccag agtacttcga cggctactgg gggtactcgg 841 tggccgtggg cgagttcgac ggggatctca acactacaga atatgtcgtc ggtgccccca 901 cttggagctg gaccctggga gcggtggaaa ttttggattc ctactaccag aggctgcatc 961 ggctgcgcgc agagcagatg gcgtcgtatt ttgggcattc agtcgctgtc actgacgtca 1021 acggggatgg gaggcatgat ctgctggtgg gcgctccact gtatatggac agccgggcag 1081 accgaaaact ggccgaagtg gggcgtgtgt atttgttcct gcagccgcga ggcccccacg 1141 cgctgggtgc ccccagcctc ctgctgactg gcacacagct ctatgggcga ttcggctctg 1201 ccatcgcacc cctgggcgac ctcgaccggg atggctacaa tgacattgca gtggctgccc 1261 cctacggggg tcccagtggc cggggccaag tgctggtgtt cctgggtcag agtgaggggc 1321 tgaggtcacg tccctcccag gtcctggaca gccccttccc cacaggctct gcctttggct 1381 tctcccttcg aggtgccgta gacatcgatg acaacggata cccagacctg atcgtgggag 1441 cttacggggc caaccaggtg gctgtgtaca gagctcagcc agtggtgaag gcctctgtcc 1501 agctactggt gcaagattca ctgaatcctg ctgtgaagag ctgtgtccta cctcagacca 1561 agacacccgt gagctgcttc aacatccaga tgtgtgttgg agccactggg cacaacattc 1621 ctcagaagct atccctaaat gccgagctgc agctggaccg gcagaagccc cgccagggcc 1681 ggcgggtgct gctgctgggc tctcaacagg caggcaccac cctggacctg gatctgggcg 1741 gaaagcacag ccccatctgc cacaccacca tggccttcct tcgagatgag gcagacttcc 1801 gggacaagct gagccccatt gtgctcagcc tcaatgtgtc cctaccgccc acggaggctg 1861 gaatggcccc tgctgtcgtg ctgcatggag acacccatgt gcaggagcag acacgaatcg 1921 tcctggactg tggggaagat gacgtatgtg tgccccagct tcagctcact gccagcgtga 1981 cgggctcccc gctcctagtt ggggcagata atgtcctgga gctgcagatg gacgcagcca 2041 acgagggcga gggggcctat gaagcagagc tggcggtgca cctgccccag ggcgcccact 2101 acatgcgggc cctaagcaat gtcgagggct ttgagagact catctgtaat cagaagaagg 2161 agaatgagac cagggtggtg ctgtgtgagc tgggcaaccc catgaagaag aacgcccaga 2221 taggaatcgc gatgttggtg agcgtgggga atctggaaga ggctggggag tctgtgtcct 2281 tccagctgca gatacggagc aagaacagcc agaatccaaa cagcaagatt gtgctgctgg 2341 acgtgccggt ccgggcagag gcccaagtgg agctgcgagg gaactccttt ccagcctccc 2401 tggtggtggc agcagaagaa ggtgagaggg agcagaacag cttggacagc tggggaccca 2461 aagtggagca cacctatgag ctccacaaca atggccctgg gactgtgaat ggtcttcacc 2521 tcagcatcca ccttccggga cagtcccagc cctccgacct gctctacatc ctggatatac 2581 agccccaggg gggccttcag tgcttcccac agcctcctgt caaccctctc aaggtggact 2641 gggggctgcc catccccagc ccctccccca ttcacccggc ccatcacaag cgggatcgca 2701 gacagatctt cctgccagag cccgagcagc cctcgaggct tcaggatcca gttctcgtaa 2761 gctgcgactc ggcgccctgt actgtggtgc agtgtgacct gcaggagatg gcgcgcgggc 2821 agcgggccat ggtcacggtg ctggccttcc tgtggctgcc cagcctctac cagaggcctc 2881 tggatcagtt tgtgctgcag tcgcacgcat ggttcaacgt gtcctccctc ccctatgcgg 2941 tgcccccgct cagcctgccc cgaggggaag ctcaggtgtg gacacagctg ctccgggcct 3001 tggaggagag ggccattcca atctggtggg tgctggtggg tgtgctgggt ggcctgctgc 3061 tgctcaccat cctggtcctg gccatgtgga aggtcggctt cttcaagcgg aaccggcaca 3121 ccctggaaga agatgatgaa gagggggagt gatggtgcag cctacactat tctagcagga 3181 gggttgggcg tgctacctgc accgcccctt ctccaacaag ttgcctccaa gctttgggtt 3241 ggagctgttc cattgggtcc tcttggtgtc gtttccctcc caacagagct gggctacccc 3301 ccctcctgct gcctaataaa gagactgagc cct // LOCUS HUMGPIIIAA 4487 bp ss-mRNA PRI 27-JUL-1990 DEFINITION Human platelet glycoprotein IIIa (GPIIIa) mRNA, complete cds. ACCESSION M35999 KEYWORDS platelet glycoprotein IIIa. SOURCE Human megakaryocytes, cDNA to mRNA, clone IIIa[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4487) AUTHORS Frachet,P., Uzan,G., Thevenon,D., Denarier,E., Prandini,M.H. and Marguerie,G. TITLE GPIIb and GPIIIa amino acid sequences deduced from human megakaryocyte cDNAs JOURNAL Mol. Biol. Rep. 14, 27-33 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 19 2385 plate glycoprotein IIIa (GPIIIa) mRNA < 1 2603 GPIIIa mRNA BASE COUNT 1046 a 1090 c 1186 g 1165 t ORIGIN 1 ccgcgggagg cggacgagat gcgagcgcgg ccgcggcccc ggccgctctg ggtgactgtg 61 ctggcgctgg gggcgctggc gggcgttggc gtaggagggc ccaacatctg taccacgcga 121 ggtgtgagct cctgccagca gtgcctggct gtgagcccca tgtgtgcctg gtgctctgat 181 gaggccctgc ctctgggctc acctcgctgt gacctgaagg agaatctgct gaaggataac 241 tgtgccccag aatccatcga gttcccagtg agtgaggccc gagtactaga ggacaggccc 301 ctcagcgaca agggctctgg agacagctcc caggtcactc aagtcagtcc ccagaggatt 361 gcactccggc tccggccaga tgattcgaag aatttctcca tccaagtgcg gcaggtggag 421 gattaccctg tggacatcta ctacttgatg gacctgtctt actccatgaa ggatgatctg 481 tggagcatcc agaacctggg taccaagctg gccacccaga tgcgaaagct caccagtaac 541 ctgcggattg gcttcggggc atttgtggac aagcctgtgt caccatacat gtatatctcc 601 ccaccagagg ccctcgaaaa cccctgctat gatatgaaga ccacctgctt gcccatgttt 661 ggctacaaac acgtgctgac gctaactgac caggtgaccc gcttcaatga ggaagtgaag 721 aagcagagtg tgtcacggaa ccgagatgcc ccagagggtg gctttgatgc catcatgcag 781 gctacagtct gtgatgaaaa gattggctgg aggaatgatg catcccactt gctggtgttt 841 accactgatg ccaagactca tatagcattg gacggaaggc tggcaggcat tgtccagcct 901 aatgacgggc agtgtcatgt tggtagtgac aatcattact ctgcctccac taccatggat 961 tatccctctt tggggctgat gactgagaag ctatcccaga aaaacatcaa tttgatcttt 1021 gcagtgactg aaaatgtagt caatctctat cagaactata gtgagctcat cccagggacc 1081 acagttgggg ttctgtccat ggattccagc aatgtcctcc agctcattgt tgatgcttat 1141 gggaaaatcc gttctaaagt cgagctggaa gtgcgtgacc tccctgaaga gttgtctcta 1201 tccttcaatg ccacctgcct caacaatgag gtcatccctg gcctcaagtc ttgtatggga 1261 ctcaagattg gagacacggt gagcttcagc attgaggcca aggtgcgagg ctgtccccag 1321 gagaaggaga agtcctttac cataaagccc gtgggcttca aggacagcct gatcgtccag 1381 gtcacctttg attgtgactg tgcctgccag gcccaagctg aacctaatag ccatcgctgc 1441 aacaatggca atgggacctt tgagtgtggg gtatgccgtt gtgggcctgg ctggctggga 1501 tcccagtgtg agtgctcaga ggaggactat cgcccttccc agcaggacga gtgcagcccc 1561 cgggagggtc agcccgtctg cagccagcgg ggcgagtgcc tctgtggtca atgtgtctgc 1621 cacagcagtg actttggcaa gatcacgggc aagtactgcg agtgtgacga cttctcctgt 1681 gtccgctaca agggggagat gtgctcaggc catggccagt gcagctgtgg ggactgcctg 1741 tgtgactccg actggaccgg ctactactgc aactgtacca cgcgtactga cacctgcatg 1801 tccagcaatg ggctgctgtg cagcggccgc ggcaagtgtg aatgtggcag ctgtgtctgt 1861 atccagccgg gctcctatgg ggacacctgt gagaagtgcc ccacctgccc agatgcctgc 1921 acctttaaga aagaatgtgt ggagtgtaag aagtttgacc ggggagccct acatgacgaa 1981 aatacctgca accgttactg ccgtgacgag attgagtcag tgaaagagct taaggacact 2041 ggcaaggatg cagtgaattg tacctataag aatgaggatg actgtgtcgt cagattccag 2101 tactatgaag attctagtgg aaagtccatc ctgtatgtgg tagaagagcc agagtgtccc 2161 aagggccctg acatcctggt ggtcctgctc tcagtgatgg gggccattct gctcattggc 2221 cttgccgccc tgctcatctg gaaactcctc atcaccatcc acgaccgaaa agaattcgct 2281 aaatttgagg aagaacgcgc cagagcaaaa tgggacacag ccaacaaccc actgtataaa 2341 gaggccacgt ctaccttcac caatatcacg taccggggca cttaatgata agcagtcatc 2401 ctcagatcat tatcagcctg tgccaggatt gcaggagtcc ctgccatcat gtttacagag 2461 gacagtattt gtggggaggg atttcggggc tcagagtggg gtaggttggg agaatgtcag 2521 tatgtggaag tgtgggtctg tgtgtgtgta tgtgggggtc tgtgtgttta tgtgtgtgtg 2581 ttgtgtgtgg gagtgtgtaa tttaaaattg tgatgtgtcc tgataagctg agctccttag 2641 cctttgtccc agaatgcctc ctgcagggat tcttcctgct tagcttgagg gtgactatgg 2701 agctgagcag gtgttcttca ttacctcagt gagaagccag ctttcctcat caggccattg 2761 tccctgaaga gaagggcagg gctgaggcct ctcattccag aggaagggac accaagcctt 2821 ggctctaccc tgagttcata aatttatggt tctcaggcct gactctcagc agctatggta 2881 ggaactgctg gcttggcagc ccgggtcatc tgtacctctg cctcctttcc cctccctcag 2941 gccgaaggag gagtcaggga gagctgaact attagagctg cctgtgcctt ttgccatccc 3001 ctcaacccag ctatggttct ctcgcaaggg aagtccttgc aagctaattc tttgacctgt 3061 tgggagtgag gatgtctggg ccactcaggg gtcattcatg gcctggggga tgtaccagca 3121 tctcccagtt cataatcaca acccttcaaa gatttgcctt attggcagct ctactctgga 3181 ggtttgttta gaagaagtgt gtcaccctta ggccagcacc atctctttac ctcctaattc 3241 cacaccctca ctgctgtaga catttgctat gacctgggga tgtctctcat gaccaaatgc 3301 ttttcctcaa agggagagag tgctattgta gagccagagg tctggcccta tgcttccggc 3361 ctcctgtccc tcatccatag cacctccaca tacctggccc tgagccttgg tgtgctgtat 3421 ccatccatgg ggctgattgt atttaccttc tacctcttgg ctgccttgtg aaggaattat 3481 tcccatgagt tggctgggaa taagtgccag gatggaatga tgggtcagtt gtatcagcac 3541 gtgtggcctg ttcttctatg ggttacaacc tcatttaact cagtctttaa tctgagaggc 3601 cacagtgcaa ttttatttta tttttctcat gatgaggttt tcttaactta aaagaacatg 3661 tatataaaca tgcttgcatt atatttgtaa atttatgtgt atggcaaaga aggagagcat 3721 aggaaaccac acagacttgg gcagggtaca gacactccca cttggcatca ttcacagcaa 3781 gtcactggcc agtggctgga tctgtgaggg gctctctcat gatagaaggc tatggggata 3841 gatgtgtgga cacattggac ctttcctgag gaagagggac tgttcttttg tcccagaaaa 3901 gcagtggctc cattggtgtt gacatacatc caacattaaa agccaccccc aaatgcccaa 3961 gaaaaaaaga aagacttatc aacatttgtt ccatgagcag aaaactggag ctctggcctc 4021 agtgttacag ctaaataatc tttaattaag gcaagtcact ttcttcttct taaagctgtt 4081 tctagtttga gaaatgatgg gattttagca gccagtcttg aaggtctctt tcagtatcaa 4141 cattctaaga tgctgggact tactgtgtca tcaaatgtgc ggttaagatt ctctgggata 4201 ttgatactgt ttgtgttttt agttgggaga tctgagagac ctggctttgg caagagcaga 4261 tgtcattcca tatcaccttt ctcaatgaaa gtctcattct atcctctctc caaacccgtt 4321 ttccaacatt tgttaatagt tacgtctctc ctgatgtagc acttaagctt catttagtta 4381 ttatttcttt cttcactttg cacacatttg catccacata ttagggaagg aataagtagc 4441 tgcaaactat ctattcctgt attattgtgt taacattgag ataaacc // LOCUS MUSCDX1A 1597 bp ss-mRNA ROD 27-JUL-1990 DEFINITION Mouse Cdx-1 homeobox protein mRNA, complete cds. ACCESSION M37163 M35125 KEYWORDS Cdx-1 homeobox protein. SOURCE Mouse 8.5 day embryo, cDNA to mRNA, clone BH8. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1597) AUTHORS Duprey,P., Chowdhury,K., Dressler,G.R., Balling,R., Simon,D., Guenet,J.-L. and Gruss,P. TITLE A mouse gene homologous to the Drosophila gene caudal is expressed in epithelial cells from the embryonic intestine JOURNAL Genes Dev. 2, 1647-1654 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 158 736 Cdx-1 homeobox protein site 389 571 Cdx-1 homeobox BASE COUNT 326 a 523 c 444 g 304 t ORIGIN 1 gccgcgcccg ctgggccctc cgacctatgc gccccctggc ccggcgcccg cgcccgcagt 61 accccgactt cgcgggttac acgcacgtgg agccggcgcc cgcgccctcc gacctgggct 121 gcgcttccct gcgcccaagg acgactgggc agctgctatg gcccgggccc cacggcctca 181 gcccgcagcc cggccccgct ggccttcggg ccctccggac tttagcccgg tgcccgcgcc 241 tcccgggcct ggtcccggca tcctagcgca gtccctcggg gctccgggcg caccgtcctc 301 gccaggagcg ccaaggcgga cgccctacga atggatgcgg cgcagcgtgg cggctgcagg 361 cggctgtggc agcggtaaga cccgaaccaa ggacaagtca cgtgtggtct acacagacca 421 ccaacgccta gagctggaaa aggagtttca ctacagccgg tacatcacta tccggcgcaa 481 gtccgagctg gctgctaacc tggggctcac agagcggcag gtaaagatct ggttccagaa 541 ccgccgggcc aaggagcgca aagtaaacaa gaagaaacag cagcagcagc agcccctgcc 601 tcccacacag ctgcccctgc ccctggatgg cactcccaca ccatcagggc cacccctagg 661 aagtctatgc cctactaatg ctggccttct gggcaccccc tccccagtgc ccgtcaagga 721 ggagtttcta ccctagcccc ttgcagcctg gggtctaggg atctagggac ttgaatgctg 781 ggcagctggc cttttcgggg cccaaggacg tgttctgagt gcagccctgc cttggccctt 841 tggggtcact gtggacaaac tgcctaccta ggacaagtta gcttgccctc ttcctgcctt 901 ccgttggctg ggctgtgtgg tgagcctgtt ggaccaagga gctcttccag ctcttgtgtc 961 ctagggcaca cagggtgatg gggaagccca attagaagtc tcaatcacct ggacatttcc 1021 caaagattca gaggctcaac gagctgtcaa gacaagattg aggcttcagc gtctcctcca 1081 agggttccag agtgaggtgg gaggctggta tctggaccag actggctctg gagagtaggc 1141 ggccattgaa acaaggtggt acaccacagt ctagactccc atgtgaaggg aagatgctat 1201 ctgcccacct gtctcctctt ccagcctcac tttcacctgc cctagcaccc catgctctgg 1261 ccctccccag cctggagcta caaagccatc aggactgaca tgaggtagag gctgggcaat 1321 tggccactgc tcttgagaga gaactcgccc ccaccttagt gtccagggag gcctgcggat 1381 ggagagaagg cactatgagt ctacctacct gcacatgccc accttcccac aggctgccca 1441 tcacacccca tttagccccc acaccatggg ggactcgcag ctgtacaggg cttatgtagg 1501 agtgtaatct cttggatgca gcttcagaaa taagttttcc cccttttaaa caatttataa 1561 aaatcataca aggcattaaa gaatttttaa aaagttt // LOCUS MUSHSP70B 3518 bp ds-DNA ROD 27-JUL-1990 DEFINITION Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds. ACCESSION M35021 KEYWORDS heat shock protein 70.1. SOURCE Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3]. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3518) AUTHORS Hunt,C. and Calderwood,S. TITLE Characterization and sequence of a mouse hsp70 gene and its expression in mouse cell lines JOURNAL Gene 87, 199-204 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 806 2734 heat shock protein 70.1 (hsp70.1) signal 507 512 inverted CCAAT box signal 547 552 TATA box BASE COUNT 827 a 993 c 1014 g 684 t ORIGIN 1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag 61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga 121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta 181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca 241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc 301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg 361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac 421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct 481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga 541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat 601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc 661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc 721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag 781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg 841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga 901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg 961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg 1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt 1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg 1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga 1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga 1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat 1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga 1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat 1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga 1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga 1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag 1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga 1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg 1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca 1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga 1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg 1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct 1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc 2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc 2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga 2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc 2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga 2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa 2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca 2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc 2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga 2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt 2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca 2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc 2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg 2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt 2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt 2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat 2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag 3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc 3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt 3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac 3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat 3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt 3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc 3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct 3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt 3481 agctatcttc agacacacca gaaaagggca tcagatct // LOCUS RDVS10 1319 bp ds-RNA VRL 27-JUL-1990 DEFINITION Rice dwarf virus segment S10 mRNA, encoding an ORF, complete cds. ACCESSION D00241 KEYWORDS S10 RNA. SOURCE Rice dwarf virus, cDNA to viral RNA. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1319) AUTHORS Omura,T., Minobe,Y. and Tsuchizaki,T. TITLE Nucleotide sequence of segment S10 of the rice dwarf virus genome JOURNAL J. Gen. Virol. 69, 227-231 (1988) STANDARD simple automatic COMMENT Three contiguous termination codons were detected at the third triplet downstream from the first stop codon. Corresponds to DDBJ (rel3.0) entry <RDVS1O>(D00241) dated 20-JUN-1988 FEATURES from to/span description pept 27 1088 ORF BASE COUNT 354 a 279 c 284 g 402 t ORIGIN 5' end of the plus-sense strand of segment S10. 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttgtgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgagact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctgatc gcactgatga tagtgctttt catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccataggtg ggttagagtc 1261 gtgtgcgcac gcacttagtt ctactaatgc gtgcccccac actgcgcagg gattctgat // LOCUS RDVS10A 1320 bp ds-RNA VRL 27-JUL-1990 DEFINITION Rice dwarf virus segment 10 mRNA, encoding an ORF, complete cds. ACCESSION M35118 KEYWORDS . SOURCE Rice dwarf virus, cDNA to viral RNA, clones pRD[545,112,367]. ORGANISM Rice dwarf virus Viridae; ds-RNA nonenveloped viruses; Reoviridae; Fijivirus. REFERENCE 1 (bases 1 to 1320) AUTHORS Uyeda,I., Matsumura,T., Sano,T., Ohshima,K. and Shikata,E. TITLE Nucleotide sequence of rice dwarf virus genome segment 10 JOURNAL Proc. Jpn. Acad. 63, 227-230 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 27 1088 ORF BASE COUNT 357 a 280 c 282 g 401 t ORIGIN 1 ggtaaacttg cgcctttctg acgaacatgg aagtagacac tgctacgttt gttcggcttc 61 atcatgagct cctttctgct cacgaaggac caagtattat ttccaagttt gatgcaatta 121 aaaaagtcaa gcttggtaca cttgctaatc aatctggtgg tgctaataac atcaccgaag 181 ctttcttggc taagcttcga aattttgaaa gaaagtccga agcctatttg gcatcagact 241 tagctgaacg tgaattgact agggatacgc acaaggccat tgtgtttgta accaagtctg 301 tattgttagg agggaaaagt ttgaaagact tgctacctta tggagtgata gtgtgtgctt 361 ttatctttat ccctgaaact gcttctgtct tggacaacgt ccctgtcatg atcgggaacc 421 aaaaacgacc attgacagta gctttgataa aatacatcgc taaatctttg aattgcgatt 481 tagttggtga ctcatatgat actttttatt actgcaattc gtctgcttat ggtaagaatc 541 tgatttcagt atcagacaat gatttttcta atccccaaag ggctctcctc tcagtgggag 601 atttatgtta tcaagcagcg cgttcactcc acgttgcagc agctaattat ataaggatat 661 ttgaccgcat gcctcctggc tttcagccgt caaaacacct tttccgtatc ataggtgtac 721 tagatatgga aactttgaag actatggtga catcaaacat tgctcgtgag ccaggcatgt 781 tttgtcatga taatgttaaa gatgtactgc accgtatagg tgtttactct ccgaaccacc 841 acttttccgc agtcatcttg tggagggggt gggcttccac gtatgcatat atgtttaacc 901 aagaacaatt aaatatgcta tcaggaacat ctggtttagc tggagacttt ggcaagtaca 961 aattaacata tggatccact ttcgatgaag gtgttattca tgtgcaatat caatttgtta 1021 ctccagaagt cgtccgcaag cgaaatatat acccggatct gtctgccctt aaaggcggca 1081 gttcctaatc gcactgatga tagtgcttat catttatcaa aatattacgg tggggtcatc 1141 accatcgtgt aatacgtcga cggattgggt gaattatccg tcttgcaact gctgaacaac 1201 ccaaccagtt gatccccatg ttcaaaaact gcccgagctt cccatagggt gggttagagt 1261 cgtgtgcgca cgccacttag ttctactaat gcgtgccccc acatgcgcag ggattctgat // LOCUS SRVAGSM 3651 bp ss-RNA VRL 27-JUL-1990 DEFINITION Sapporo rat virus mRNA M segment, encoding an ORF, complete cds. ACCESSION M34882 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 3651) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 47 3448 M segment ORF BASE COUNT 1119 a 692 c 755 g 1085 t ORIGIN 1 tagtagtaga ctccgcaaga aacagcagtt aaagaacaat aggatcatgt ggagtttgct 61 attactggcc gctttagttg gccaaggctt tgcattaaaa aatgtatttg acatgagaat 121 tcagttgccc cactcagtca actttgggga aacaagtgtg tcaggctata cagaatttcc 181 cccactctca ttacaggagg cagaacagct agtgccagag agctcatgca acatggacaa 241 ccaccagtca ctctcaacaa taaataaatt aaccaaggtc atatggcgga aaaaagcaaa 301 tcaggaatca gcaaaccaga attcatttga agttgtggaa agtgaagtca gctttaaagg 361 gttgtgtatg ttaaagcata gaatggttga agaatcatat agaaatagga gatcagtaat 421 ctgttatgat ctagcctgta atagtacatt ctgtaaacca actgtttata tgattgttcc 481 tatacatgct tgcaacatga tgaaaagctg tttgattggc cttggcccct acagaatcca 541 ggttgtctat gaaaggacat actgcactac gggtatattg acagaaggaa aatgctttgt 601 ccctgacaag gctgttgtca gtgcattgaa aagaggcatg tatgctatag caagcataga 661 gacaatctgc ttttttattc atcagaaagg gaatacatat aagatagtga ctgccattac 721 atcagcaatg ggctccaaat gtaataatac agatactaaa gttcaaggat attatatctg 781 tattattggt ggaaactccg cccctgtata tgcccctgct ggtgaagact tcagagcaat 841 ggaggttttt tctgggatta ttacatcacc acatggagaa gaccatgacc tacccggcga 901 agaaatcgca acgtaccaga tttcagggca gatagaggca aaaatccctc atacagtgag 961 ctccaaaaac ttaaaattga ctgcttttgc aggtattcca tcatactcat caactagtat 1021 attggctgct tcagaagatg gtcgtttcat atttagtcct ggtttatttc ctaacctaaa 1081 tcagtcagtc tgtgacaaca atgcactccc tttaatctgg aggggcctaa ttgatttaac 1141 gggatactat gaggcagtcc acccttgcaa tgtgttctgt gtcttatcag gaccaggtgc 1201 ttcatgtgag gccttttcag aaggaggtat tttcaatatt acttctccaa tgtgtctggt 1261 gtctaagcaa aatagattta gagcagctga gcagcagatt agctttgtct gccaaagagt 1321 tgatatggat attatagtgt actgtaatgg tcagaaaaaa acaatcctaa caaaaacatt 1381 agttataggc caatgtattt atactattac aagtctcttt tcactgttac caggggttgc 1441 ccattctatt gctattgagt tgtgtgttcc agggtttcat ggctgggcca cagctgcact 1501 tttgattaca ttctgcttcg gctgggtatt gattcctgca tgtacattag ctattctttt 1561 agtccttaag ttctttgcaa atatccttca tacaagcaat caagagaacc gattcaaagc 1621 cattctacgg aaaataaagg aggagtttga aaaaacaaag ggttccatgg tttgtgagat 1681 ctgtaagtat gagtgtgaaa cattaaagga attgaaggca cataacctat catgtgttca 1741 aggagagtgc ccatattgct ttacccactg tgaaccgaca gaaactgcaa ttcaggcaca 1801 ttacaaagtt tgtcaagcca cccaccgatt cagagaagat ttaaaaaaga ctgtaactcc 1861 tcaaaatatt gggccaggct gttaccgaac actaaatctt tttaggtata aaagtaggtg 1921 ttatattctg acaatgtgga ctcttcttct cattattgaa tccatcctct gggcagcaag 1981 tgcagcagaa atcccccttg tccctctctg gacagataat gctcatggcg ttgggagtgt 2041 tcctatgcat acggatcttg aattagactt ctctttgcca tccagttcta agtacacata 2101 caaaagacat ctcacaaacc cagttaatga ccaacagagt gtctcattgc atatagaaat 2161 tgaaagtcaa ggcattggtg ctgctgttca tcatcttgga cattggtatg atgcaagatt 2221 gaatctaaaa acctcatttc attgttatgg tgcctgcaca aaatatcaat acccatggca 2281 cactgcaaaa tgccattttg agaaagatta tgagtatgaa aatagctggg cttgcaaccc 2341 cccagattgc ccaggggttg gtacaggttg tactgcttgt ggattatatc tagatcaatt 2401 gaagccggta ggaacagcct ttaaaattat aagtgtaaga tacagtagaa aagtgtgcgt 2461 gcagtttggt gaagaacacc tttgtaaaac aattgatatg aatgattgct ttgtgactag 2521 gcatgccaaa atatgtataa ttgggactgt atctaagttt tctcaaggtg acactctact 2581 atttctgggg cccatggaag gaggtggtat aatctttaaa cactggtgta catctacctg 2641 tcactttgga gaccctggtg atgtcatggg tccaaaagat aaaccattta tttgccctga 2701 attcccaggg caatttagga aaaaatgtaa ctttgccaca actccagttt gtgaatatga 2761 tggaaacatt atatcaggct ataagaaagt acttgcaaca attgattctt tccaatcatt 2821 taacacaagc aatatacact tcactgatga gagaattgaa tggagagacc ctgatggcat 2881 gcttcgggat catattaata ttgttatttc taaagatatt gattttgaaa atttggctga 2941 gaatccttgt aaagtagggc tccaggcagc aaacatagaa ggtgcctggg gttcaggtgt 3001 cgggtttaca ctcacatgca aggtgtctct cacagaatgc ccaacatttc ttacatcaat 3061 aaaggcctgt gacatggcaa tttgttatgg tgcagaaagt gtgacactct cacgaggaca 3121 aaatactgtc aaaattaccg ggaaaggtgg ccatagtggt tcttcattca aatgctgtca 3181 tgggaaagaa tgttcatcaa ctggcctcca agccagtgca ccacatctgg ataaggtaaa 3241 tggtatctct gagttagaaa acgagaaagt ttatgatgac ggtgcacctg aatgtggcat 3301 tacttgttgg tttaaaaaat caggtgaatg ggttatgggt ataatcaatg ggaactgggt 3361 tgtcctaatt gtcttgtgtg tactgctgct cttttctctt atcctgttga gcatcttgtg 3421 tcctgttaga aagcataaaa aatcataaat cccacctaac aatcttcaca tcatgtatcg 3481 attttcaaac actttatcat ttagaactta acttggcact actatctgat aactgacttt 3541 catttttatt tttatatgga ttaattacta aaaaaaatac tctcttctat ctcccaatct 3601 tttattgatt caccggggtg ctgtcttgac atctggcggc gtctactact a // LOCUS SRVAGSS 1769 bp ss-RNA VRL 27-JUL-1990 DEFINITION Sapporo rat virus mRNA S segment, encoding an ORF, complete cds. ACCESSION M34881 KEYWORDS . SOURCE Sapporo rat virus (strain SR-11), cDNA to viral RNA, passed in Vero E6 cell line C1008. ORGANISM Sapporo rat virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Bunyaviridae; Hantavirus. REFERENCE 1 (bases 1 to 1769) AUTHORS Arikawa,J., Lapenotiere,H.F., Iacono-Connors,L., Wang,M. and Schmaljohn,C.S. TITLE Coding properties of the S and the M genome segments of Sapporo rat virus: Comparison to other causative agents of hemorrhagic fever with renal syndrome JOURNAL Virology 176, 114-125 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 43 1332 S segment ORF BASE COUNT 569 a 331 c 410 g 459 t ORIGIN 1 tagtagtaga ctccctaaag agctactaca ctaacaagaa aaatggcaac tatggaagaa 61 atccagagag aaatcagtgc tcacgagggg cagcttgtga tagcacgcca gaaggtcaag 121 gatgcagaaa agcagtatga gaaggatcct gatgacttaa acaagagggc actgcatgat 181 cgggagagtg tcgcagcttc aatacaatca aaaattgatg aattgaagcg ccaacttgcc 241 gacagattgc agcagggaag aacatccggg caggaccggg atcctacagg ggtagagcca 301 ggtgatcatc ttaaggaaag atcagcacta agctacggga atacactgga cctgaatagt 361 cttgacattg atgaacctac aggacagaca gctgattggc tgaccataat tgtctatctg 421 acatcattcg tggtcccgat catcttgaag gcactgtaca tgttaacaac acgaggtagg 481 cagacttcaa aggacaacaa ggggatgagg atcagattca aggatgacag ctcatatgag 541 gatgtcaatg gaatcagaaa gcccaaacat ctgtatgtgt caatgccaaa cgcccaatcc 601 agcatgaagg ctgaagagat aacaccagga agattccgca ctgcagtatg tggactatat 661 cctgcacaga taaaggcaag gaatatggta agccctgtca tgagtgtagt tgggttcttg 721 gcactggcaa aagactggac atcgagaatt gaagaatggc tcggtgcacc ctgcaaattc 781 atggcggagt ctcttattgc cgggagttta tctgggaatc ctgtgaatcg tgactatatc 841 agacagagac aaggtgcact tgcagggatg gagccaaagg aatttcaagc cctcaggcaa 901 cattcaaagg atgctggatg tacactagtt gaacatattg agtcaccatc atcaatatgg 961 gtgtttgctg gggcccctga taggtgtcca ccaacatgct tgtttgtcgg agggatggct 1021 gaattaggtg ccttcttttc tatacttcag gatatgagga acacaatcat ggcttcaaaa 1081 actgtgggca cagctgatga aaagcttcga aagaaatcat cattctatca atcatacctc 1141 agacgcacac aatcaatggg aatacaactg gaccagagga taattgttat gtttatggtt 1201 gcctggggaa aggaggcagt ggacaacttt catctcggtg atgacatgga tccagagctt 1261 cgtagcctgg ctcagatctt gattgaccag aaagtgaagg aaatctcaaa ccaggaacct 1321 atgaaattat aagtacataa atatataatc aatactaact ataggttaag aaatactaat 1381 cattagttaa taagaatata gatttattga ataatcatat taaataatta ggtaagttaa 1441 ctagtattta gttaagttag ctaattgatt tatatgattg tcacaattaa atgtaatcat 1501 aagcacaatc actgccatgt ataatcacgg gtatacgggt ggttttcata tggggaacag 1561 ggtgggctta gggccaggtc accttaagtg accttttttt gtatatatgg atgtagattt 1621 caattgatcg aatactaatc ctactgtcct cttttctttt cctttctcct tctttactaa 1681 caacaacaaa ctacctcaca ccttaatata tactacttta ttaagttgtt aagttgtgtc 1741 tttttgggga gtaagggagt ctactacta // LOCUS TRNSTIIA 1315 bp ds-DNA BCT 27-JUL-1990 DEFINITION Transposon Tn4521 (from E.coli) heat-stable toxin II gene, complete cds. ACCESSION M35123 KEYWORDS heat-stable toxin II; insertion sequence; transposase. SOURCE Transposon Tn4521 (from Escherichia coli) DNA. ORGANISM Transposon Tn4521 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1315) AUTHORS Hu,S.T. and Lee,C.H. TITLE Characterization of the transposon carrying the STII gene of enterotoxigenic Escherichia coli JOURNAL Mol. Gen. Genet. 214, 490-495 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 879 400 (c) heat-stable toxin II (STII) trns 88 607 insertion sequence IS2 (bases 17-536) trns 608 970 insertion sequence IS2 (bases 969-1327) BASE COUNT 313 a 349 c 320 g 333 t ORIGIN 1 gccgccggca gatggagctg gatattctaa aaaagacccg ggcatcagta tcagtcacct 61 gaacaacaga gaaaaaacaa agcttgcttt ccagacacct gttatcactt aacccattac 121 tggcccgctg ccgtagatat tcccgtggcg agcgataacc cagtgcacta tgcggatgcc 181 attcgttata atgctcgaac gcctctgcaa ggttctttgc tgccgttaac ccgtctggtt 241 tgggcatgat actgatgtaa tcacgcttta tcgttttcac gaagctctct gctatgccgt 301 tactctccgg actccgcacc gccgtgttct tcggttcaag tcccaacatc cgggcgaact 361 ggcgtgtttc attagcccgg tagcatgaac cattatccgt cagccattcc actggcgata 421 tcggaagctc gctgccgaag gggcgtacca ccgctcccag catgacgtcc tgtactgttt 481 cactgtcgaa gccgcccgta gtaaccgccc agtgcagtgc ctcacggtca cagcagtcca 541 gcgcgaacgt gactcgcagt ttttctccgt tatcacagcg gaactcgaac ccgtcagagc 601 accatcgttt tttttgccga gcaggcgctg gagttcttta atctgcttca tggcagcagc 661 aagttcagag gcaggaacga cctgttctcc tgcggccaca gcagtaagac ttcccttctg 721 gtattgcttg cgccagagaa ataactggct ggctgccgca ccgtgttgcc gggtaacaag 781 ggagcccgtc attcccggtt caaagctctg ctgaacaata gcgatctttt cctgtgtggt 841 acgccgtctg cgtttctccg gccctaagac atcactcatc tgctctccaa tgactagtct 901 aaaaactagt attaagacta tcacttaaat aagtgatatt ggttgtctgg agattcaggg 961 ggccagtcta cggcctatgt cggtctggtg tcaaaacagt caggctccgg agataaagtt 1021 cgactactgg gaataagcaa gtgtggtgat acttatctcc gaacattatt tattcatgga 1081 gcaagggcgg cagcattgct ggtcaaagaa ctagggccat ggatacacaa cttaagatac 1141 aacgactgtc tggtgtttca gggggccagc tcaaacattt ggcaaagata tacgtcttct 1201 ttacaatccg gaggtgttaa cggctctact tgtatcagct acgttctggt tatacgtaat 1261 agcataagaa tggtcgtgcg catacacgct ctggaaagca gaactagctg gacga //
GenBank-Updates@genbank.bio.net (07/28/90)
LOCUS MUSMDR1A 4924 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Mouse P-glycoprotein (mdr1a) mRNA, complete cds. ACCESSION M33581 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4924) AUTHORS Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461 FEATURES from to/span description pept 137 3967 P-glycoprotein (mdr1a) signal 4315 4320 poly-A signal signal 4898 4903 poly-A signal BASE COUNT 1450 a 1021 c 1210 g 1243 t ORIGIN Chromosome 5. 1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc 61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga 121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt 181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt 241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct 301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac 361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga 421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac 481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc 541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga 601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt 661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac 721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt 781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt 841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc 901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa 961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat 1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc 1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt 1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg 1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc 1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt 1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag 1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct 1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag 1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct 1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga 1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca 1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat 1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc 1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg 1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc 1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa 1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa 2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg 2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga 2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg 2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat 2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt 2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt 2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa 2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca 2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc 2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa 2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact 2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt 2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga 2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat 2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg 2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt 3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc 3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc 3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga 3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg 3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga 3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt 3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa 3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga 3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt 3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga 3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg 3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct 3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga 3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa 3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca 3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg 3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt 4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca 4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa 4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag 4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat 4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa 4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa 4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat 4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag 4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct 4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata 4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg 4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca 4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca 4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt 4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa 4921 aaat // LOCUS MUSMDRXX 2873 bp ds-DNA ROD 28-JUL-1990 DEFINITION Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2. ACCESSION M33580 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2873) AUTHORS Hsu,S., Cohen,D., Lothstein,L., Kirschner,L.S., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. Albert Einstein C., 1300 Morris Pk, Bronx, 10461. E-mail: kirschne@aecom.yu.edu. FEATURES from to/span description pre-msg 1992 > 2873 P-glycoprotein mRNA and introns IVS 2120 2606 P-glycoprotein intron A IVS 2678 > 2873 P -glycoprotein intron B signal 1904 1912 CAAT box signal 1956 1963 TATA box site 1880 1887 SP-1 site site 1921 1927 SP-1 site site 1937 1944 SP-1 site site 1869 1875 AP-1 site rpt 1 1300 L1Md repetitive element BASE COUNT 860 a 621 c 714 g 678 t ORIGIN Chromosome 5. 1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct 61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa 121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa 181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc 241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca 301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag 361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga 421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat 481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac 541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct 601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga 661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt 721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc 781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt 841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac 901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta 961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag 1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc 1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg 1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg 1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa 1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa 1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa 1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact 1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta 1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca 1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg 1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt 1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag 1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag 1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca 1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct 1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa 1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct 2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc 2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag 2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag 2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg 2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg 2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag 2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc 2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt 2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg 2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa 2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg 2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat 2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct 2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc // LOCUS HUMKSAA 1504 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human adenocarcinoma-associated antigen (KSA) mRNA, complete cds. ACCESSION M32325 KEYWORDS adenocarcinoma-associated antigen. SOURCE Human cell line UCLA-P3, cDNA to mRNA, clone AG[1,1338,933]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1504) AUTHORS Strnad,J., Hamilton,A.E., Beavers,L.S., Gamboa,G.C., Apelgren,L.D., Taber,L.D., Sportsman,J.R., Bumol,T.F., Sharp,J.D. and Gadski,R.A. TITLE Molecular cloning and characterization of a human adenocarcinoma/epithelial cell surface antigen complementary DNA JOURNAL Cancer Res. 49, 314-317 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D. Sharp, 22-FEB-1990. There are a few base differences between the sequence presented here and that which appears in entry M26481. The difference occurs mostly in the Poly-A signal. FEATURES from to/span description pept 155 1099 adenocarcinoma-associated antigen precursor (KSA) sigp 155 223 adenocarcinoma-associated antigen signal peptide matp 398 1096 adenocarcinoma-associated antigen mRNA < 1 1504 adenocarcinoma-associated antigen mRNA site 1486 1491 polyadenylation site BASE COUNT 442 a 302 c 356 g 404 t ORIGIN 1 gagcgagcac cttcgacgcg gtccggggac cccctcgtcg ctgtcctccc gacgcggacc 61 cgcgtgcccc aggcctcgcg ctgcccggcc ggctcctcgt gtcccactcc cggcgcacgc 121 cctcccgcgc ccctcttctc ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt 181 cgggcttctg cttgccgcgg cgacggcgac ttttgccgca gctcaggaag aatgtgtctg 241 tgaaaactac aagctggccg taaactgctt tgtgaataat aatcgtcaat gccagtgtac 301 ttcagttggt gcacaaaata ctgtcatttg ctcaaagctg gctgccaaat gtttggtgat 361 gaaggcagaa atgaatggct caaaacttgg gagaagagca aaacctgaag gggccctcca 421 gaacaatgat gggctttatg atcctgactg cgatgagagc gggctcttta aggccaagca 481 gtgcaacggc acctccacgt gctggtgtgt gaacactgct ggggtcagaa gaacagacaa 541 ggacactgaa ataacctgct ctgagcgagt gagaacctac tggatcatca ttgaactaaa 601 acacaaagca agagaaaaac cttatgatag taaaagtttg cggactgcac ttcagaagga 661 gatcacaacg cgttatcaac tggatccaaa atttatcacg agtattttgt atgagaataa 721 tgttatcact attgatctgg ttcaaaattc ttctcaaaaa actcagaatg atgtggacat 781 agctgatgtg gcttattatt ttgaaaaaga tgttaaaggt gaatccttgt ttcattctaa 841 gaaaatggac ctgacagtaa atggggaaca actggatctg gatcctggtc aaactttaat 901 ttattatgtt gatgaaaaag cacctgaatt ctcaatgcag ggtctaaaag ctggtgttat 961 tgctgttatt gtggttgtgg tgatggcagt tgttgctgga attgttgtgc tggttatttc 1021 cagaaagaag agaatggcaa agtatgagaa ggctgagata aaggagatgg gtgagatgca 1081 tagggaactc aatgcataac tatataattt gaagattata gaagaaggga aatagcaaat 1141 ggacacaaat tacaaatgtg tgtgcgtggg acgaagacat ctttgaaggt catgagtttg 1201 ttagtttaac atcatatatt tgtaatagtg aaacctgtac tcaaaatata agcagcttga 1261 aactggcttt accaatcttg aaatttgacc acaagtgtct tatatatgca gatctaatgt 1321 aaaatccaga acttggactc catcgttaaa attatttatg tgtaacattc aaatgtgtgc 1381 attaaatatg cttccacagt aaaatctgaa aaactgattt gtgattgaaa gctgcctttc 1441 tatttacttg agtcttgtac atacatactt ttttatgagc tatgaaataa aacattttaa 1501 actg // LOCUS HUMMHDNDRW 1066 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human MHC class II DN alpha mRNA, complete cds. ACCESSION M26039 M27046 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1066) AUTHORS Jonsson,A.-K. and Rask,L. TITLE Human class II DNA and DOB genes display low sequence variability JOURNAL Immunogenetics 29, 411-413 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.-K.Jonsson, 07-JUL-1989. FEATURES from to/span description pept 26 778 MHC DN alpha protein precursor /nomgen="HLA-DNA" /map="6p21.3" /hgml_locus_uid="LV0117X" sigp 26 106 MHC DN alpha protein signal peptide matp 107 775 MHC DN alpha protein mRNA < 1 1066 MHCDNA-a mRNA BASE COUNT 199 a 337 c 283 g 247 t ORIGIN Chromosome 6p21.3. 1 catttgatta aagcaccaga gtgtaatggc cctcagagca gggctggtcc tggggttcca 61 caccctgatg accctcctga gcccgcagga ggcaggggcc accaaggctg accacatggg 121 ctcctacgga cccgccttct accagtctta cggcgcctcg ggccagttca cccatgaatt 181 tgatgaggaa cagctgttct ctgtggacct gaagaaaagc gaggccgtgt ggcgtctgcc 241 tgagtttggt gactttgccc gctttgaccc gcagggcggg ctggccggca tcgccgcaat 301 caaagcccat ctggacatcc tggtggagcg ctccaaccgc agcagagcca tcaacgtgcc 361 tccacgggtg accgtgctcc ccaagtctcg ggtggagctg ggccagccca acatcctcat 421 ctgcatcgtg gacaacatct tcccccctgt gatcaatatc acctggctgc gcaacggcca 481 aactgtcact gagggagtgg cccagaccag cttctattcc cagcctgacc atttgttccg 541 caagttccac tacctgccct tcgtgccctc agccgaggac gtctatgact gccaggtgga 601 gcactggggc ctggatgcgc cactcctcag gcattgggag ctccaggtgc ctattccacc 661 accagatgcc atggagaccc tggtctgtgc cctgggcctg gccatcggcc tggtgggctt 721 cctcgtgggc accgtcctca tcatcatggg cacatatgtg tccagtgtcc ccaggtaatg 781 atccttctga gagaaatgac ttgtgggaga caccctgcag atcctcatgg gtttgtgaca 841 gaccctgcgt gctcagtgcc ctttaagtgc atcccgctgt gctgactttg agtgggatca 901 acatctgtcc tacgggtccc ctcttttttg gccccagtat tcatggcagg gtttgttgga 961 cacctactag cttcccttcc cattcaacac acacacacat tcttgctcta cccaaagctc 1021 tggctggcag cactaaatgc tttggtggtg tttgcactgt gtcctt // LOCUS HUMMHDOBDR 1293 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human MHC class II DO beta mRNA, complete cds. ACCESSION M26040 M27047 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (haplotype DRw8,Dw8.2/DRw8,Dw8.2) cell line SPL, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1293) AUTHORS Jonsson,A.-K. and Rask,L. TITLE Human class II DNA and DOB genes display low sequence variability JOURNAL Immunogenetics 29, 411-413 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.-K.Jonsson, 07-JUL-1989. FEATURES from to/span description pept 36 857 MHC DO beta protein precursor /nomgen="HLA-DOB" /map="6p21.3" /hgml_locus_uid="LM0050S" sigp 36 125 MHC DO beta protein signal peptide matp 126 854 MHC DO beta protein BASE COUNT 293 a 315 c 330 g 355 t ORIGIN Chromosome 6p21.3. 1 cgattttact gtctcatttt tttcctttct ccagaatggg ttctgggtgg gtcccctggg 61 tggtggctct gctagtgaat ctgacccgac tggattcctc catgactcaa ggcacagact 121 ctccagaaga ttttgtgatt caggcaaagg ctgactgtta cttcaccaac gggacagaaa 181 aggtgcagtt tgtggtcaga ttcatcttta acttggagga gtatgtacgt ttcgacagtg 241 atgtggggat gtttgtggca ttgaccaagc tggggcagcc agatgctgag cagtggaaca 301 gccggctgga tctcttggag aggagcagac aggccgtgga tggggtctgt agacacaact 361 acaggctggg cgcacccttc actgtgggga gaaaagtgca accagaggtg acagtgtacc 421 cagagaggac cccactcctg caccagcata atctgctgca ctgctctgtg acaggcttct 481 atccagggga tatcaagatc aagtggttcc tgaatgggca ggaggagaga gctggggtca 541 tgtccactgg ccctatcagg aatggagact ggacctttca gactgtggtg atgctagaaa 601 tgactcctga acttggacat gtctacacct gccttgtcga tcactccagc ctgctgagcc 661 ctgtttctgt ggagtggaga gctcagtctg aatattcttg gagaaagatg ctgagtggca 721 ttgcagcctt cctacttggg ctaatcttcc ttctggtggg aatcgtcatc cagctaaggg 781 ctcagaaagg atatgtgagg acgcagatgt ctggtaatga ggtctcaaga gctgttctgc 841 tccctcagtc atgctaaggt cctcactgaa gcttctctct ctggagcctg aagtagtgat 901 gagtagtctg ggccctgggt gaggtaaagg acattcatga ggtcaatgtt ctgggaataa 961 ctctcttccc tgatccttgg aggagcccga actgattctg gagctctgtg ttctgagatc 1021 atgcatctcc cacccatctg cccttctccc ttctacgtgt acatcattaa tccccattgc 1081 caagggcatt gtccagaaac tcccctgaga ccttactcct tccagcccca aatcatttac 1141 ttttctgtgg tccagcccta ctcctataag tcatgatctc caaagctttc tgtcttccaa 1201 ctgcagtctc cacagtcttc agaagacaaa tgctcaggta gtcactgttt ccttttcact 1261 gtttttaaaa accttttatt gtcaaataaa atg // LOCUS TRPFLAA 966 bp ds-DNA BCT 28-JUL-1990 DEFINITION T.pallidum endoflagellar sheath protein (flaA) gene, 3' end. ACCESSION M26525 KEYWORDS endoflagellar sheath protein. SOURCE T.pallidum (strain Nichols) DNA. ORGANISM Treponema pallidum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Spirochetes; Spirochaetales; Spirochaetaceae. REFERENCE 1 (bases 1 to 966) AUTHORS Isaacs,R.D., Hanke,J.H., Guzman-Verduzco,L.-M., Newport,G., Agabian,N., Norgard,M.V., Lukehart,S.A. and Radolf,J.D. TITLE Molecular cloning and DNA sequence analysis of the 37-kilodalton endoflagellar sheath protein of Treponema pallidum JOURNAL Infect. Immun. 57, 3403-3411 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.D.Radolf, 26-JUL-1989. FEATURES from to/span description pept < 1 966 endoflagellar sheath protein (AA at 1) BASE COUNT 199 a 196 c 325 g 246 t ORIGIN Unreported 1 aagctgaacg ctgatatcat ggcggataag agtggaggta tgacgcataa tcggcgtacc 61 gttctggact atgcttctct ggcggatacc tcgtacactg acgagcagaa ggcattgatg 121 agatcttctc ttgcggttgc acagtgggag gttgtgctga attcttccgc gcgtaatcct 181 gtcgcccatg ctgcctctcg cgttattgag gctccggtaa gtgagggagc gaagagtttt 241 gctggtgagc gtgtccttgg tgtgcgcgtg ttgttcccca cgtgggacag taacgcaaac 301 gcaatgataa agccggcgtt cgtaattcct gcgtacgagg tgatggctca ggtggacgat 361 cagggtaatg tacaggcccc cacagaggag gagaaggctt ctggaaaggg gcgttttgaa 421 gatgggtacg gagtggtaaa gaatgtgggt gttcttaagt ccatcgcggt gaacacttac 481 gggatgaatt atcctcatgg tttgtacgtg atgatgcggg atcaggatgg tgaggtgcat 541 cgctacttca tggggtatct cctgttcgac tcctggaagg agttggtgtg gaacaatcct 601 tcgtatatct ctgatgttcg gtcgcgggag gtgcgcttgt atcccgtgta tcccgcgtcg 661 acgccccacg tcgtgtttga aggctttatg gttactaggg acgcggctca tgccggaggg 721 gactatgttg gttatttcaa ggacgtcaag attatctatg ataaggcggt gctgagtacg 781 gtgcgcgatt ttgcggacga ggacctgtgg ggtatccagg cgcggcgtga ggctgagcgt 841 aagagagttg aggttgcgcg tttcgggcag cagcaggtgc tgcgttatat agagcaagag 901 aagcttgcta cagaggttgg ttttacaccc tctgggggtg ctcagcggca ggaagagcag 961 cagtag // LOCUS DROMPP1 3376 bp ds-DNA INV 28-JUL-1990 DEFINITION D.melanogaster membrane protein (patched) gene complete cds. ACCESSION M28418 KEYWORDS transmembrane protein. SEGMENT 1 of 2 SOURCE D.melanogaster (embryo), DNA and cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 3376) AUTHORS Hooper,J.E. and Scott,M.P. TITLE The Drosophila patched gene encodes a putative membrane protein required for segmental patterning JOURNAL Cell 59, 751-765 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.E.Hooper, 27-SEP-1989. FEATURES from to/span description pept 3161 + 3289 membrane protein exon 1 pre-msg 2389 > 3376 MPR mRNA and intron IVS 3290 > 3376 MPR intron A BASE COUNT 1059 a 731 c 764 g 822 t ORIGIN Chromosome 2, map position 44D-E. 1 gatcctgaat tgagaaatat agattgaaac agaattcatt accatttaag caatcattat 61 ttatgggggc gtaatgcgcc tccgagtagg caatgctttt cttgacattg ttactaagaa 121 ttgtgaatga tatttgggcg tggatcaacg ccgattaaaa gctgcttttg cttccaggcg 181 gccagagaag agatccaaac ttcaactcca gccataaaag caacaacatt tccgtctccc 241 ccttgtagct ccccttcctc cggctcttcc actctccacg aaacggcaaa tgaagctctc 301 aaagcgaact gtgcttcgct ggtggtccat tggcagctgc cgccacacag gcgctgcttt 361 tgtgtgtgtg tgtaatatca atcttgctct ccctctcttt ttatctctct tgggaattgg 421 agctgcatgc gaattgagcg acagcaaaac gaactgcaag tcattgagag gagagcaaaa 481 actcgagcgc aagccaaaga tagcgcaatc tggggagagc gaaataaagc taaaatatgc 541 atgttggaga aaaaatgccg cccatgtcgc caaaatgcgc cacacgcaga gtgagcgggc 601 ggaggtggga gtaatggaaa gggcgatgag ggaacgatta gcttgaagag agagaacaac 661 aaatgaatgt gctgcaacgt tagttcaggt gagcgcgtta gagagagagt tgttgttttt 721 tgattgtaat agctcgcttg gtggtgggtc cacattcaca tctccctctc ccactctttc 781 tccccgaaag agagagcggg agcgaagggg cacgagggga gcacgatgac tatgcagttg 841 cattcaattt gaatttccat ggtgctgatg attcgagcgc caattttttc gaagagttct 901 tatttgttta cttcgttgtt gttgcctcaa ttggaaaggg aaaatgtgga atgcggagaa 961 acaccagaag caaatgcatt tccattcata aatccaaaga agttttaaag ataacatgtc 1021 atttggctta agttcgtggt gcacaaaaaa gatcggtttg cggttgtcgc atgaaatgag 1081 tttattccat tggtatatta ttattcagaa attaaaaaaa aacttgttta gtctattttt 1141 tttttttaaa taaaaaaaaa aaattctttt ataagtcgat tttagagtaa atatttaaag 1201 actacgtcta ataaacatat aatttgttct gtgttttaat ttgccggcaa aaacaaacct 1261 acttgtgtgg tcctcgcaca ctcataaccc ctcgcatatt tgagattcat ggggcaagag 1321 gctgcaaaaa caatggaaag ggaaaagcag aaacatcctg ccgctcataa tttagcatcg 1381 gaacatgcaa aaacagacat catcgcatgg ggcagcagca acagccataa aaccaacacg 1441 agcaatgtaa agctaacaaa tttgccaaca gttcgcggca cggctacaca cacacacatg 1501 catgcgcagc ctgccacgca cgcgcttccc ccaaacaaat acacacacac acactgagac 1561 gaaagctcca ttgggcagcg ctgccgacgc tgaaggccga catcggcaga gctgaacgtt 1621 tgggtagggg accacccaca tcgcttggcg gtttcagttt aatgaaggca gaaacaaatt 1681 tatttttggg tggtccacac tgcagcgaaa ataaactaca gtggcaacaa caaaccagca 1741 gccaaggcac tttgggtggt ccatgcaaaa aaaaaacaaa ttacggcatg cgaataacaa 1801 tagaaattag cgctctcgtg gcggagctat ttgggtatat tagagctaca tattttattt 1861 gtttataaaa agtataaatg taaacaatga gttccaagca ttaagtccgt atgctcaaca 1921 attacattat cattattatt atcacttaaa tatttacaaa ggatatttaa acagtaatag 1981 atatatattt tatttcttaa tttctgttaa catatgtatt tacattggta gttattcttt 2041 attttgcaac aagcattcat aaattttata taacaaactt ggtattttct cggaaaaact 2101 cctgaatcac ccctcggtat tttgtgcgtt gagctatcgt taaagcagcc ctcgcagaga 2161 gcgttctcaa accaaaatgg ccgcacacga aacaagagag cgagtgagag tagggagagc 2221 gtctgtgttg tgtgttgagt gtcgcccacg cacacaggcg caaaacagtg cacacagacg 2281 cccgctgggc aagagagagt gagagagaga aacagcggcg cgcgctcgcc taatgaagtt 2341 gttggcctgg ctggcgtgcc gcatccacga gatacagata catctctcag actgcgtgcg 2401 atcctcgaac gaaacggttg taagtgcgga gcgcgacgac ttgttattcg tatttccgac 2461 tactggcact ctctgtgtgt ggtatactaa caagatagat atcacagaac tcgtggaaaa 2521 gctaagatat tgtacctcac ggatgcgagg cgaagttcat ggattaaatg ccaggcaaca 2581 acaaaagcca gccaaccagc cagtgtttgt gtgtgtgcgt cgccaagtgc aaagtaaagt 2641 aaaggtaaaa gagcgaaagg cgagagagaa aaccgaatac gtgagtcgtc cgactgccgc 2701 ttttccatgt gtaaaagatc tgtgaaaatt ctgtcaaatt cccctgagaa attgtgccca 2761 agataaaacc cgaaaaccgc gttttaatcg tcgaaaaaac ccagcaaaag cgaagccagc 2821 aatcacaaca aaacaacata acgagagctc agatacacag cgtgctcagt gagtgagcga 2881 gagagcgcgg gagagagcgt ctcttgattt aaaatacaaa ataattaaaa ataaaaatgc 2941 ggaatgcagt gcaaaatgca gccaaacaaa atacgagatt ccaataacaa ttaatcgaac 3001 cgaaagtcca cgaacaatcc gcacactgtc tcccaagtct cagttctcag gacgcagacg 3061 aacggcaggc actgtagaaa gaccgattcc gcagcacact cccatctgca catctccgcc 3121 acgcgattcc gtccggaatc tggctataaa cataaccata atggaccgcg acagcctccc 3181 acgcgttccg gacacacacg gcgatgtggt cgatgagaaa ttattctcgg atctttacat 3241 acgcaccagc tgggtggacg cccaagtggc gctcgatcag atagataagg tgagtgccca 3301 actacagtga actttcactg tgaaggatag ccatgtgttg aattcaataa tattcttgat 3361 cgtattcgga ggatcc // LOCUS DROMPP2 5665 bp ds-DNA INV 28-JUL-1990 DEFINITION D.melanogaster membrane protein (patched) gene, complete cds. ACCESSION M28999 KEYWORDS transmembrane protein. SEGMENT 2 of 2 SOURCE D.melanogaster (embryo), DNA and cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 5665) AUTHORS Hooper,J.E. and Scott,M.P. TITLE The Drosophila patched gene encodes a putative membrane protein required for segmental patterning JOURNAL Cell 59, 751-765 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.E.Hooper, 27-SEP-1989. Mak called J.E.Hooper and requested copy of cds be sent showing introns and exons, 9-OCT-1989. Copy was received and corrections made, 18-OCT-1989. FEATURES from to/span description pept + 95 648 membrane protein exon 2 986 2912 membrane protein exon 3 3051 3258 membrane protein exon 4 3322 3593 membrane protein exon 5 3789 4559 membrane protein exon 6 pre-msg < 1 > 5448 MPR mRNA and introns IVS < 1 94 MPR intron A IVS 649 985 MPR intron B IVS 2913 3050 MPR intron C IVS 3259 3321 MPR intron D IVS 3594 3788 MPR intron E BASE COUNT 1397 a 1537 c 1350 g 1381 t ORIGIN About 9.2kb after segment 1, Chromosome 2, map position 44D-E. 1 aattttaatg cgtattttat ggcagtggag caaggcgggg gaatctaaaa aaaaaactaa 61 acgctaaatt ccgtattttt gttgcatttt tcagggcaaa gcgcgtggca gccgcacggc 121 gatctatctg cgatcagtat tccagtccca cctcgaaacc ctcggcagct ccgtgcaaaa 181 gcacgcgggc aaggtgctat tcgtggctat cctggtgctg agcaccttct gcgtcggcct 241 gaagagcgcc cagatccact ccaaggtgca ccagctgtgg atccaggagg gcggccggct 301 ggaggcggaa ctggcctaca cacagaagac gatcggcgag gacgagtcgg ccacgcatca 361 gctgctcatt cagacgaccc acgacccgaa cgcctccgtc ctgcatccgc aggcgctgct 421 tgcccacctg gaggtcctgg tcaaggccac cgccgtcaag gtgcacctct acgacaccga 481 atgggggctg cgcgacatgt gcaacatgcc gagcacgccc tccttcgagg gcatctacta 541 catcgagcag atcctgcgcc acctcattcc gtgctcgatc atcacgccgc tggactgttt 601 ctgggaggga agccagctgt tgggtccgga atcagcggtc gttataccgt aagtagttaa 661 tatgtagtta atagccacat cttatagatt ctaaagtgaa cgtatccctt atgaccatat 721 ccttttgcat gatctacttt aacccacagt acttctctat tcatattaag gaattaataa 781 agtacttact ttgcgcttac ctttattaaa tacgatagct tatctttata aacttgctat 841 caagtcgaaa gataaacgtg acaagagtat ctttgtactt atcccagttg cttaccatcg 901 taaataatct tcttattaat aaatattcgt aaataaatat tcttaactca acaaatccat 961 ctttattatt gttactcctc tacagaggcc tcaaccaacg actcctgtgg accaccctga 1021 atcccgcctc tgtgatgcag tatatgaaac aaaagatgtc cgaggaaaag atcagcttcg 1081 acttcgagac cgtggagcag tacatgaagc gtgcggccat tggcagtggc tacatggaga 1141 agccctgcct gaacccactg aatcccaatt gcccggacac ggcaccgaac aagaacagca 1201 cccagccgcc ggatgtggga gccatcctgt ccggaggctg ctacggttat gccgcgaagc 1261 acatgcactg gccggaggag ctgattgtgg gcggacggaa gaggaaccgc agcggacact 1321 tgaggaaggc ccaggccctg cagtcggtgg tgcagctgat gaccgagaag gaaatgtacg 1381 accagtggca ggacaactac aaggtgcacc atcttggatg gacgcaggag aaggcagcgg 1441 aggttttgaa cgcctggcag cgcaactttt cgcgggaggt ggaacagctg ctacgtaaac 1501 agtcgagaat tgccaccaac tacgatatct acgtgttcag ctcggctgca ctggatgaca 1561 tcctggccaa gttctcccat cccagcgcct tgtccattgt catcggcgtg gccgtcaccg 1621 ttttgtatgc cttttgcacg ctcctccgct ggagggaccc cgtccgtggc cagagcagtg 1681 tgggcgtggc cggagttctg ctcatgtgct tcagtaccgc cgccggattg ggattgtcag 1741 ccctgctcgg tatcgttttc aatgcgctga ccgctgccta tgcggagagc aatcggcggg 1801 agcagaccaa gctgattctc aagaacgcca gcacccaggt ggttccgttt ttggcccttg 1861 gtctgggcgt cgatcacatc ttcatagtgg gaccgagcat cctgttcagt gcctgcagca 1921 ccgcaggatc cttctttgcg gccgccttta ttccggtgcc ggctttgaag gtattctgtc 1981 tgcaggctgc catcgtaatg tgctccaatt tggcagcggc tctattggtt tttccggcca 2041 tgatttcgtt ggatctacgg agacgtaccg ccggcagggc ggacatcttc tgctgctgtt 2101 ttccggtgtg gaaggaacag ccgaaggtgg cacctccggt gctgccgctg aacaacaaca 2161 acgggcgcgg ggcccggcat ccgaagagct gcaacaacaa cagggtgccg ctgcccgccc 2221 agaatcctct gctggaacag agggcagaca tccctgggag cagtcactca ctggcgtcct 2281 tctccctggc aaccttcgcc tttcagcact acactccctt cctcatgcgc agctgggtga 2341 agttcctgac cgttatgggt ttcctggcgg ccctcatatc cagcttgtat gcctccacgc 2401 gccttcagga tggcctggac attattgatc tggtgcccaa ggacagcaac gagcacaagt 2461 tcctggatgc tcaaactcgg ctctttggct tctacagcat gtatgcggtt acccagggca 2521 actttgaata tcccacccag cagcagttgc tcagggacta ccatgattcc tttgtgcggg 2581 tgccacatgt gatcaagaat gataacggtg gactgccgga cttctggctg ctgctcttca 2641 gcgagtggct gggtaatctg caaaagatat tcgacgagga ataccgcgac ggacggctga 2701 ccaaggagtg ctggttccca aacgccagca gcgatgccat cctggcctac aagctaatcg 2761 tgcaaaccgg ccatgtggac aaccccgtgg acaaggaact ggtgctcacc aatcgcctgg 2821 tcaacagcga tggcatcatc aaccaacgcg ccttctacaa ctatctgtcg gcatgggcca 2881 ccaacgacgt cttcgcctac ggagcttctc aggtgggtct tcttattaaa ttaaattaaa 2941 ttaaattaaa ttagatcgcc ttagttctcc tcatatgtac atacatatta taacttatcg 3001 cactccaaag ttaaagatta ctaaatgtgt gtgtatcttt attcttacag ggcaaattgt 3061 atccggaacc gcgccagtat tttcaccaac ccaacgagta cgatcttaag atacccaaga 3121 gtctgccatt ggtctacgct cagatgccct tttacctcca cggactaaca gatacctcgc 3181 agatcaagac cctgataggt catattcgcg acctgagcgt caagtacgag ggcttcggcc 3241 tgcccaacta tccatcgggt gagtcggaaa tgagtacttc atacatgggg cccaactaac 3301 agtcgattta tttatcgcca ggcattccct tcatcttctg ggagcagtac atgaccctgc 3361 gctcctcact ggccatgatc ctggcctgcg tgctactcgc cgccctggtg ctggtctccc 3421 tgctcctgct ctccgtttgg gccgccgttc tcgtgatcct cagcgttctg gcctcgctgg 3481 cccagatctt tggggccatg actctgctgg gcatcaaact ctcggccatt ccggcagtca 3541 tactcatcct cagcgtgggc atgatgctgt gcttcaatgt gctgatatca ctggtgagtc 3601 ttcatttctg gctggaccat taagagcttc ggagtgagtc ttcatttctg gctggaccat 3661 taagagcttc ggagtgagtc ttcatttctg gctggaccat taagagcttc ggattttcca 3721 gagatatccc aagacttttc attggatcct cttcagcaca cattaattgc ttatctttcc 3781 gattctaggg cttcatgaca tccgttggca accgacagcg ccgcgtccag ctgagcatgc 3841 agatgtccct gggaccactt gtccacggca tgctgacctc cggagtggcc gtgttcatgc 3901 tctccacgtc gccctttgag tttgtgatcc ggcacttctg ctggcttctg ctggtggtct 3961 tatgcgttgg cgcctgcaac agccttttgg tgttccccat cctactgagc atggtgggac 4021 cggaggcgga gctggtgccg ctggagcatc cagaccgcat atccacgccc tctccgctgc 4081 ccgtgcgcag cagcaagaga tcgggcaaat cctatgtggt gcagggatcg cgatcctcgc 4141 gaggcagctg ccagaagtcg catcaccacc accacaaaga ccttaatgat ccatcgctga 4201 cgacgatcac cgaggagccg cagtcgtgga agtccagcaa ctcgtccatc cagatgccca 4261 atgattggac ctaccagccg cgggaacagc gacccgcctc ctacgcggcc ccgccccccg 4321 cctatcacaa ggccgccgcc cagcagcacc accagcatca gggcccgccc acaacgcccc 4381 cgcctccctt cccgacggcc tatccgccgg agctgcagag catcgtggtg cagccggagg 4441 tgacggtgga gacgacgcac tcggacagca acaccaccaa ggtgacggcc acggccaaca 4501 tcaaggtgga gctggccatg cccggcaggg cggtgcgcag ctataacttt acgagttagc 4561 actagcacta gttcctgtag ctattaggac gtatctttag actctagcct aagccgtaac 4621 cctatttgta tctgtaaaat cgatttgtcc agcgggtctg ctgaggattt cgttctcatg 4681 gattctcatg gattctcatg gatgcttaaa tggcatggta attggcaaaa tatcaatttt 4741 tgtgtctcaa aaagatgcat tagcttatgg tttcaagata catttttaaa gagtccgcca 4801 gatatttata taaaaaaaat ccaaaatcga cgtatccatg aaaattgaaa agctaagcag 4861 acccgtatgt atgtatatgt gtatgcatgt tagttaattt cccgaagtcc ggtatttata 4921 gcagctgcct tccgcgcccc ccttcccttg aaatgaacac ccttccagcc acgccccacc 4981 gcccctctgc gtagcagctt tgtatgtatg tagtatgcta gcacctaagg aatacttaaa 5041 cttagagata tttattgtaa cacacgcaaa acacacacaa tgtacttaca tataattcaa 5101 tgcgagattc acccacacaa aaaggaaaca caacaaacta gtaattgtag ctcgtaattt 5161 agtttaaata tgttacataa aacacaagga cttgaaccaa aatagtatcg cttaaacgga 5221 aacgagagaa acgagaaaaa ataactatta cttaatcaac tacaagagag atatccctcc 5281 tcccctaacc gtacttacaa ccaaaataaa acaagagtat aagcataaaa atggaaaacg 5341 aagcgaggaa cgattgtaaa cgcggtcatt tatcctgtac atttgttgcc cgaagactga 5401 ctgtcttttt tttaataaaa atatatatta tacagttttt taaaagcgaa attcatgact 5461 tttttttaac agtgagcaga gaacaaaaga aacggaagtt ttcgctgtat caataaaaag 5521 attccatttt tttaataaat tgtaaaaatc ctaaaaaaaa gaagactaca aaagtttaaa 5581 tttttatacg ttattgataa acttttatac acgaaaatac ttgtacttag ctatgatcaa 5641 ctccttggct taagtctcgg gtaag // LOCUS BLYGEH 1250 bp ss-mRNA PLN 28-JUL-1990 DEFINITION Barley (1->3)-beta-glucan endohydrolase mRNA, complete cds. ACCESSION M23548 X15205 KEYWORDS glucan endohydrolase. SOURCE Barley (2 days into germination) scutellum, cDNA to mRNA, clone lambda-3. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1250) AUTHORS Hoej,P.B., Hartman,D.J., Morrice,N.A., Doan,D.N.P. and Fincher,G.B. TITLE Purification of (1->3)-beta-glucan endohydrolase isoenzyme II from germinated barley and determination of its primary structure from a cDNA clone JOURNAL Plant Mol. Biol. 13, 31-42 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by G.W.Fincher, 02-MAY-1989. FEATURES from to/span description pept 48 1052 glucan endohydrolase precursor sigp 48 131 glucan endohydrolase signal peptide matp 132 1049 glucan endohydrolase mRNA < 1 1250 GEH mRNA BASE COUNT 262 a 404 c 362 g 222 t ORIGIN 1 ccagcattgc atagcatttg agcaccagat actccatgtg tgcagcaatg gctagaaaag 61 atgttgcctc catgtttgca gctgctctct tcattggagc gttcgctgct gttcctacga 121 gtgtgcagtc catcggcgtg tgctacggcg tgatcggcaa caacctcccc tcccggagcg 181 acgtggtgca gctctacagg tccaagggca tcaacggcat gcgcatctac ttcgccgacg 241 ggcaggccct ctcggcgctc cgcaactccg gcatcggcct catcctcgac atcggcaacg 301 accagctcgc caacatcgcc gccagcacct ccaacgcggc gtcctgggtc cagaacaacg 361 tgcggcccta ctaccctgcc gtgaacatca agtacatcgc cgccggcaac gaggtgcagg 421 gcggcgccac gcagagcatc ctgccggcca tgcgcaacct caacgcggcc ctctccgcgg 481 cggggctcgg cgccatcaag gtgtccacct ccatccggtt cgacgaggtg gccaactcct 541 tcccgccctc cgccggcgtg ttcaagaacg cctacatgac ggacgtggcc cggctcctcg 601 cgagcaccgg cgcgccgctg ctcgccaacg tctaccccta cttcgcgtac cgtgacaacc 661 ccgggagcat cagcctgaac tacgcgacgt tccagccggg caccaccgtg cgtgaccaga 721 acaacgggct gacctacacg tccctgttcg acgcgatggt ggacgccgtg tacgcggcgc 781 tggagaaggc cggcgcgccg gcggtgaagg tggtggtgtc ggagagcggg tggccgtcgg 841 cgggcgggtt tgcggcgtcg gccggcaatg cgcggacgta caaccagggg ctgatcaacc 901 acgtcggcgg gggcacgccc aagaagcggg aggcgctgga gacgtacatc ttcgccatgt 961 tcaacgagaa ccagaagacc ggggacgcca cggagaggag cttcgggctc ttcaacccgg 1021 acaagtcgcc ggcatacaac atccagttct agtgtagcta cctagctcac atacctacat 1081 ccccagccta aataaataag ctgctcgtac gtacgtaatg cggcatccaa gtgtaacgta 1141 gacacgtaca ttcatccatg gaagagtgca accaagcatg cgttaacttc ctggtgatga 1201 tacatcatca tggtatgaat aaaagatatg gaagatgtta tgaatttgtg // LOCUS ECOPOLBDA 4666 bp ds-DNA BCT 28-JUL-1990 DEFINITION E.coli DNA polymerase (polB) gene, 5' flank. ACCESSION M35371 KEYWORDS DNA polymerase; polB gene. SOURCE E.coli (strain W3110) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 4666) AUTHORS Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R. TITLE Nucleotide sequence and deletion analysis of the polB gene of E.coli JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Chen, 20-JUN-1990. Author address: H.Chen Baylor College of Medicine Cell Biology and IMG One Baylor Plaza Houston, TX 77030 email: hchen@vulture.bcm.tmc.edu FEATURES from to/span description signal 1209 1214 -35 region signal 1229 1234 -10 region binding 1250 1255 ribosome binding site site 1193 1208 lexA box BASE COUNT 1100 a 1278 c 1257 g 1031 t ORIGIN 1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg 61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg 121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga 181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc 241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc 301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg 361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg 421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca 481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct 541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg 601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat 661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga 721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac 781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac 841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga 901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca 961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg 1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat 1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga 1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa 1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc 1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca 1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa 1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt 1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat 1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt 1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt 1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt 1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc 1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat 1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc 1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg 1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt 1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc 2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag 2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag 2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc 2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg 2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac 2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg 2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc 2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac 2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg 2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc 2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa 2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt 2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc 2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac 2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa 2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa 3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc 3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga 3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc 3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac 3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat 3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg 3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc 3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg 3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc 3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca 3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa 3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac 3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat 3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga 3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac 3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc 3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg 4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt 4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga 4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt 4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa 4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg 4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc 4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct 4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca 4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca 4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc 4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc // LOCUS HUMBIGFII 1387 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA, complete cds. ACCESSION M35410 KEYWORDS insulin-like growth factor binding protein 2. SOURCE Human 67-year old retina, cDNA to mRNA, clone AS200. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1387) AUTHORS Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B., Francke,U. and Swaroop,A. TITLE Sequence analysis, expression and chromosomal localization of a gene, isolated from a subtracted human retina cDNA library, that encodes an insulin-like growth factor binding protein (IGFBP2) JOURNAL Exp. Eye Res. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 64 1050 insulin-like growth factor binding protein 2 (IGFBP2) precursor /hgml_locus_uid="LN0220S" /nomgen="IBP1" /map="7p13-p12" sigp 64 180 insulin-like growth factor binding protein 2 signal peptide matp 181 1047 insulin-like growth factor binding protein 2 mRNA < 1 1387 IGFBP2 mRNA signal 1362 1367 poly-A signal binding 175 197 ATP binding site BASE COUNT 232 a 455 c 477 g 223 t ORIGIN 1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc 61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg 121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg 181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg 241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg 301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag 361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc 421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac 481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga 541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct 601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact 661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag 721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag 781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg 841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg 901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga 961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct 1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg 1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg 1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg 1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc 1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga 1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta 1381 aagtgtg // LOCUS HUMLBPA 1431 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human lipopolysaccharide binding protein (LBP) mRNA, complete cds. ACCESSION M35533 KEYWORDS lipopolysaccharide binding protein. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites; for [2]) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1431) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1431 lipopolysaccharide binding protein (LBP) precursor sigp 1 75 lipopolysaccharide binding protein signal peptide matp 76 1431 lipopolysaccharide binding protein BASE COUNT 319 a 417 c 359 g 336 t ORIGIN 1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc 61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag 121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct 181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc 241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc 301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag 361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc 421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc 481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc 541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg 601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca 661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc 721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt 781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat 841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca 901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc 961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc 1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata 1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc 1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca 1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg 1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg 1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag 1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t // LOCUS HUMPEC12L 2344 bp ds-DNA PRI 28-JUL-1990 DEFINITION Human cell 12-lipoxygenase gene, complete cds. ACCESSION M35418 KEYWORDS lipoxygenase. SOURCE Human platelet/erythroleukemia cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2344) AUTHORS Funk,C.D., Furci,L. and FitzGerald,G.A. TITLE Molecular cloning, primary structure and expression of the human platelet/erythroleukemia cell 12-lipoxygenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.D.Funk, 20-JUN-1990. FEATURES from to/span description pept 42 2033 12-lipoxygenase signal 2324 2329 poly-A signal mRNA < 1 2344 12-lipoxygenase mRNA BASE COUNT 514 a 692 c 642 g 496 t ORIGIN 1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc 61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg 121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg 181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca 241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg 301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga 361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc 421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag 481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc 541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc 601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct 661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt 721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc 781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag 841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca 901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc 961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa 1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg 1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg 1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca 1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac 1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac 1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc 1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc 1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg 1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc 1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc 1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc 1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca 1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg 1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc 1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat 1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact 1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag 2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt 2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac 2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt 2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg 2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac 2341 actg // LOCUS HUMRALBA 1327 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human GTP-binding protein (RALB) mRNA, complete cds. ACCESSION M35416 KEYWORDS GTP-binding protein. SOURCE Human retina, cDNA to mRNA, clone AS181. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1327) AUTHORS Hsieh,C.-L., Swaroop,A. and Francke,U. TITLE Chromosomal localization and cDNA sequence of human RALB, a GTP binding protein JOURNAL Somat. Cell Mol. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 171 791 GTP-binding protein (RALB) mRNA < 1 1327 RALB mRNA signal 1303 1308 poly-A signal BASE COUNT 368 a 273 c 373 g 313 t ORIGIN Chromosome cen-q13. 1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc 61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc 121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca 181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag 241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg 301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga 361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc 421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa 481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc 541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca 601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca 661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa 721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt 781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg 841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc 901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa 961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc 1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt 1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta 1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag 1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag 1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga 1321 aacagct // LOCUS LACLACR 1332 bp ds-DNA BCT 28-JUL-1990 DEFINITION L.lactis lactose phosphotransferase system repressor (lacR) gene, complete cds. ACCESSION M35375 KEYWORDS lactose phosphotransferase system repressor; lactose repressor. SOURCE L.lactis (strain MG1820) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1332) AUTHORS Van Rooijen,R.J. and Devos,W.M. TITLE Molecular cloning, transcriptional analysis, and nucleotide sequence of LACR, a gene encoding the repressor of the lactose phosphotransferase system of Lactococcus lactis JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Van Rooijen, 20-JUN-1990. Author address: R.J.Van Rooijen Netherlands Inst for Dairy Research Kernhemseweg 2 P.O. Box 20 6710 BA EDE THE NETHERLANDS FEATURES from to/span description pept 370 1155 lactose repressor (lacR; alt.) pept 388 1155 lactose repressor (lacR; alt.) mRNA 79 1245 lactose repressor mRNA signal 1215 > 1155 transcription termination signal binding 353 357 ribosomal binding site signal 45 51 -35 region signal 68 74 -10 region BASE COUNT 469 a 207 c 198 g 458 t ORIGIN 1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt 61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca 121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac 181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata 241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg 301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa 361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta 421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt 481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt 541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag 601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc 661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa 721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg 781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata 841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa 901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa 961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt 1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc 1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt 1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac 1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt 1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga 1321 taattggata tc // LOCUS MARCMYCA 1391 bp ds-DNA ROD 28-JUL-1990 DEFINITION Woodchuck c-myc protein gene, exon 1. ACCESSION M35498 KEYWORDS c-myc protein. SOURCE Woodchuck (Marmota monax) DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 1391) AUTHORS Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J. TITLE Evolutionary conservation of target sequences for cis-acting regulation in c-myc exon 1 and its upstream sequences JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Buendia, 21-JUN-1990. FEATURES from to/span description pre-msg 818 > 1391 c-myc mRNA and introns IVS 1378 > 1391 c-myc intron 1 site 159 220 P0 promoter site 790 794 TATA box site 964 968 TATA box BASE COUNT 317 a 385 c 383 g 306 t ORIGIN 1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc 61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat 121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac 181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg 241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg 301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc 361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc 421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt 481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag 541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa 601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg 661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag 721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca 781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg 841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg 901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag 961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac 1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt 1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc 1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg 1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg 1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg 1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta 1381 agcaccagat c // LOCUS ONGOSTLE 214 bp ds-DNA INV 28-JUL-1990 DEFINITION O.volvulus recombinant antigen gene, 3' end. ACCESSION M35370 KEYWORDS T-cell epitope; recombinant antigen. SOURCE O.volvulus DNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 214) AUTHORS Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B. TITLE The identification of an Onchocerca-specific recombinant antigen containing a T cell epitope JOURNAL Unpublished (19900 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.B.Nutman, 19-JUN-1990. Author address: T.B.Nutman Inst. Lab. of Parasitic Diseases NIAID National Institutes of Health Bldg 4, Rm 126 9000 Rockville Pike Bethesda, MD 20892 email: tbn@helix.nih.gov FEATURES from to/span description pept < 1 93 recombinant antigen (AA at 1) site 91 93 nematode splice junction BASE COUNT 81 a 37 c 31 g 65 t ORIGIN 1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt 61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa 121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga 181 aaactttcac ttatgtaaaa ttgcgcacaa gacc // LOCUS PSEIAAL 2766 bp ds-DNA BCT 28-JUL-1990 DEFINITION P.syringae IAA-lysine synthetase (iaaL) gene, complete cds. ACCESSION M35373 KEYWORDS IAA-lysine synthetase. SOURCE P.syringae savastanoi DNA. ORGANISM Pseudomonas syringae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2766) AUTHORS Roberto,F., Klee,H., White,F., Nordeen,R. and Kosuge,T. TITLE Expression and fine structure of the gene encoding IAA-lysine synthetase from Pseudomonas savastanoi JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.J.Klee, 20-JUN-1990. FEATURES from to/span description pept 1100 2287 IAA-lysine synthetase (iaaL) pept 95 1003 ORF1 BASE COUNT 637 a 760 c 754 g 615 t ORIGIN 1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc 61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag 121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag 181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat 241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac 301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt 361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat 421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt 481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct 541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat 601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt 661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc 721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac 781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg 841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac 901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca 961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt 1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa 1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc 1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt 1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct 1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa 1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt 1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct 1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt 1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa 1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca 1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt 1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg 1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt 1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac 1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat 1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat 1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc 2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt 2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga 2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa 2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg 2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc 2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga 2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc 2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg 2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac 2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg 2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct 2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag 2761 ccgcct // LOCUS RABLPBA 1446 bp ss-mRNA MAM 28-JUL-1990 DEFINITION Rabbit lipopolysaccharide binding protein (LBP) protein mRNA, complete cds. ACCESSION M35534 KEYWORDS lipopolysaccharide binding protein. SOURCE Rabbit liver, cDNA to mRNA. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (ases 1 to 1446ites; for [2] AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1446) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1446 lipopolysaccharide binding protein (LBP) precursor sigp 1 78 lipopolysaccharide binding protein signal peptide matp 79 1446 lipopolysaccharide binding protein BASE COUNT 316 a 454 c 391 g 285 t ORIGIN 1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct 61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg 121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg 181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac 241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag 301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc 361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc 421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc 481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac 541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg 601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca 661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca 721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca 781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt 841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg 901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag 961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc 1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa 1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc 1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact 1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc 1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag 1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac 1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg 1441 agagtt // LOCUS RATUKATPA 1484 bp ss-mRNA ROD 28-JUL-1990 DEFINITION R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete cds. ACCESSION M35535 KEYWORDS (H+,K+)-ATPase beta-subunit. SOURCE R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1484) AUTHORS Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P., Forte,J.G. and Levenson,R. TITLE Cloning of the H,K-ATPase beta subunit: Tissue-specific expression, chromosomal assignment, and relationship to Na,K-ATPase beta subunits JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.A.Canfield, 21-JUN-1990. Author address: V.A.Canfield Yale University School of Medicine Dept. of Cell Biology P.O. Box 3333 New Haven, CT 06510 email: levenson@YALEMED FEATURES from to/span description pept 176 1060 gastric (H+,K+)-ATPase beta-subunit (E.C. 3.6.1.3) mRNA 1 1484 gastric (H+,K+)-ATPase beta-subunit BASE COUNT 363 a 457 c 362 g 302 t ORIGIN 1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta 61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag 121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc 181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg 241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct 301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct 361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt 421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga 481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc 541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt 601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg 661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa 721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac 781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta 841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc 901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt 961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta 1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca 1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca 1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga 1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc 1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg 1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata 1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg 1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta // LOCUS YSCMYO2A 5675 bp ds-DNA PLN 28-JUL-1990 DEFINITION S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds. ACCESSION M35532 KEYWORDS myosin-1. SOURCE S.cerevisiae (strain GRF88) DNA, clone 10-2B. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 5675) AUTHORS Johnston,G.C., Prendergast,J.A. and Singer,R.A. TITLE The S.cerevisiae MYO2 gene encodes an essential myosin for vectorial transport of vesicles JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.C.Johnston, 21-JUN-1990. Author address: G.C.Johnston Dalhousie University 7E Tupper Medical Bldg. Halifax, N.S. CANADA B3H 4H7 email: JOHNSTON@AC.DAL.CA FEATURES from to/span description pept 581 5305 myosin 1 isoform (MYO2) binding 138 171 ATP-binding site binding 443 523 actin-binding site rpt 926 981 heptad repeat rpt 1010 1086 heptad repeat BASE COUNT 1994 a 986 c 1110 g 1585 t ORIGIN Chromosome 15 right arm. 1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt 61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa 121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt 181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa 241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc 301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta 361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc 421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt 481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg 541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac 601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga 661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt 721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc 781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt 841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct 901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc 961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc 1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg 1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga 1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc 1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc 1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag 1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa 1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca 1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg 1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac 1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat 1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact 1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca 1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc 1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat 1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct 1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta 1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga 2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat 2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc 2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac 2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc 2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga 2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg 2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg 2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt 2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg 2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc 2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc 2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg 2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat 2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt 2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc 2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat 3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt 3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc 3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag 3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat 3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac 3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa 3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt 3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag 3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa 3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca 3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga 3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga 3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca 3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg 3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc 3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct 3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg 4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac 4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc 4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat 4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca 4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag 4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa 4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta 4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa 4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat 4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt 4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg 4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt 4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg 4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca 4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca 4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac 4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat 5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc 5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac 5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc 5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca 5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat 5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa 5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg 5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat 5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga 5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat 5641 aataacaact aagaataaac acattctcat tttta // LOCUS PTUB256 205 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB256 alpha-amylase gene promoter region. ACCESSION M36663 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 205) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 164 > 205 synthetic alpha-amylase BASE COUNT 73 a 29 c 44 g 59 t ORIGIN 1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctccgaagta agtcttcaaa aaatcaaata aggagtgtca agaatgtttg caaaacgatt 181 caaaacctct ttactgccgt tattc // LOCUS PTUB261 232 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB261 alpha-amylase gene promoter region. ACCESSION M36664 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 232) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 191 > 232 synthetic alpha-amylase BASE COUNT 82 a 28 c 52 g 70 t ORIGIN 1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaagcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg 181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc // LOCUS PTUB263 232 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB263 alpha-amylase gene promoter region. ACCESSION M36665 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 232) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 191 > 232 synthetic alpha-amylase BASE COUNT 78 a 40 c 49 g 65 t ORIGIN 1 aagcactccc gcgatcgcct atttggcttt tccccaaaat gtaagcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctcggggttg ttattatttt atcgatatgt aaaatataat ttctagaaga aaagaaggtg 181 gagaggaaac atgatccaaa aacgattcaa aacctcttta ctgccgttat tc // LOCUS PTUB265 214 bp ds-DNA SYN 28-JUL-1990 DEFINITION Synthetic pTUB265 alpha-amylase gene promoter region. ACCESSION M36666 KEYWORDS alpha-amylase. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 214) AUTHORS Furusato,T., Takano,J.-i., Jigami,Y., Tanaka,H. and Yamane,K. TITLE Two tandemly located promoters, artificially constructed, are active in a Bacillus subtilis alpha-amylase secretion vector JOURNAL J. Biochem. 99, 1181-1190 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 173 > 214 synthetic alpha-amylase BASE COUNT 73 a 28 c 50 g 63 t ORIGIN 1 gccaagttgt tttgatagag tgattgtgat aatttaaaat gtaatcgtga acaaaattct 61 ccagtcttca catcagtttg aaaggaggaa gcggaagaat gaagtaagag ggatttttga 121 ctcggggttg ttattatttt atcgctagaa gaaaagaagg tggagaggaa acatgatcca 181 aaaacgattc aaaacctctt tactgccgtt attc // LOCUS RABMEPHA 1653 bp ss-mRNA MAM 28-JUL-1990 DEFINITION Rabbit microsomal epoxide hydrolase. ACCESSION M21496 KEYWORDS microsomal epoxide hydrolase. SOURCE Rabbit (New Zealand White) adult liver cDNA to mRNA, clone pEH. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1653) AUTHORS Hassett,C., Turnblom,S.M., DeAngeles,A. and Omiecinski,C.J. TITLE Rabbit microsomal epoxide hydrolase: Isolation and characterization of the xenobiotic metabolizing enzyme cDNA JOURNAL Arch. Biochem. Biophys. 271, 380-389 (1989) STANDARD full staff_review COMMENT Draft entry and computer readable copy for sequence [1] kindly submitted by C.Hassett 12-JAN-1989. FEATURES from to/span description pept 148 1515 microsomal epoxide hydrolase (EC 3.3.2.3) BASE COUNT 351 a 505 c 465 g 332 t ORIGIN 1 cggcatccgc aaggacctgt acgccaacac ggtgctgtct cgcctctccc gcagctctgc 61 agtgtcgccg tgcgcagagt tccacagctc tgcttcccaa gcaggtgagc agaggctgac 121 aacacagcgc ccttgtggac aggagccatg ttgctggaac tccttctcgc ctcggtgctg 181 ggcttcgtca tctactggtt cgtctctgga gacaaggagg agagtctgcc actggaggat 241 gggtggtggg gcccggggtc gaggcccgta ggcctggagg acgagagcat ccggcccttc 301 aaggtggaga cgtcggacga agagatcaac gacttacacc agaggatcga caggatccgc 361 ttgaccccac ctttggagaa cagccgcttc cactacggct tcaactccaa ctacctgaag 421 aagatcctct cctactggag gcacgaattc gactggaaga agcaagtgga gattctgaac 481 tcataccctc acttcaagac caagatcgaa gggctggaca tccacttcat ccacgtgaag 541 cccccgcagg tgccccctgg ccgcacccca aagcccttgc tgatggtgca tggctggccc 601 ggctccttct tcgagttcta caaaatcatc ccgctgctga ctgaccccaa gagccacggc 661 ctgagcgatg agcacatctt tgaagtcatc tgcccttcca ttccaggcta tggcttctca 721 caggcatctt ccaagaaggg cttcaactcg gtgagcaccg ccaggatctt ctacaagctg 781 atgctgcggc tgggcttcca ggagttctac atccagggcg gggactgggg ggccctggtc 841 tgcacgaaca tggcccagct ggtgcccagc cacgtgaaag gtctgcactt gaacatggct 901 ttgattttaa gaaatcacta cactctgacc ctcctgctgg gacggcgcat cgggggactt 961 cttggctaca ctgagaggga catggagctg ctgtacccct tcaaggagaa ggtgttctac 1021 agtctgatga gggagagcgg ctacatgcac atccgggcca ccaagcccga cactgtgggc 1081 tgtgctctga atgactctcc tgtgggactg gctgcataca ttctagagaa attttccacc 1141 tggaccaact cagaattccg agacctggag gacggaggcc tggagaggaa gttctccctg 1201 caggacctgc tgaccaacat catgatctac tggaccactg gctccatcgt ctcctcccag 1261 cgctactaca aggagaacct gggccagggc ttcatggccc acaagcatga gcggctgaag 1321 gtccacgtgc ccacgggctt cgcagccttc ccgtgtgaga taatgcatgt gccagagaag 1381 tgggtgagga ccaagtaccc gcagctcatc tcctactcct acatgccccg cgggggccac 1441 ttcgccgcct tcgaggagcc ggagctgctg gcccgggaca tctgcaagtt cgtggggctg 1501 gtggagcggc agtgatgctc ccagccttgc ctggggtgag gggtcggctt gcctcctccc 1561 ctggcctgct ggaacccacc tcaggcctcc atactcactg tctcaccccc atggcgtggc 1621 tgataaatga tttgactccc aaaaaaaaaa aaa // LOCUS XELBETA 1138 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A1 mRNA, complete cds. ACCESSION M35359 KEYWORDS thyroid hormone receptor beta A1 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1138) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1110 thyroid hormone receptor beta A1 protein BASE COUNT 351 a 228 c 283 g 276 t ORIGIN 1 atggaagggt atatacccag ctacttggat aaagatgagc tatgcgtggt gtgtggagac 61 aaggctacag ggtatcatta tagatgtatc acctgcgagg gctgcaaggg cttttttaga 121 agaactattc agaagaacct ccacccaagc tattcttgta aatatgaagg aaaatgtgtt 181 atagacaaag taacaagaaa ccagtgccaa gaatgtcgct tcaaaaagtg catcgctgtt 241 ggaatggcaa cagacttggt tttggatgac aacaaacgtt tggcaaaaag aaagctcata 301 gaagaaaaca gagaaaaaag acggaaagat gagattcaga aatcacttgt tcagaaacct 361 gaacccacac aagaagaatg ggagttgata caagttgtca ctgaagcaca tgtggccacc 421 aacgcacagg gaagccactg gaaacagaaa agaaaatttt tgccagagga cattggacaa 481 gctcccatag ttaatgcgcc cgagggtgga aaagtggact tagaagcctt cagccagttt 541 acaaaaataa tcaccccagc aattacaaga gttgttgatt ttgccaaaaa gctacctatg 601 ttttgtgagc tgccatgtga agaccagatc atccttctta aaggctgttg tatggagatc 661 atgtcgctcc gagcagcagt gcgttatgac cccgaaagtg aaactctaac gttaaatggt 721 gagatggcag tgacaagggg gcagctaaaa aatggaggac ttggagtggt ttcagatgcc 781 atctttgact taggggtatc gctttcttca ttcagtcttg atgataccga agtcgccttg 841 ttgcaggctg tgctgcttat gtcatcagat cggcctggtc ttgctagcgt ggagagaata 901 gaaaagtgcc aggaaggttt cctcttggct tttgaacact acattaatta caggaaacat 961 aacattgcac acttttggcc aaaactgctg atgaaagtca ccgacctccg catgattgga 1021 gcgtgccacg ccagccggtt cctgcacatg aaggtggagt gccccactga actgtttccc 1081 ccactgttct tggaagtgtt tgaggactag aacagactgt gcttctggat tctcagca // LOCUS XELBETA1 259 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon A. ACCESSION M35345 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 1 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 259) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 63 a 55 c 88 g 53 t ORIGIN 1 aaattgggat ctatcctggg agagaatgga aatagacgac agcgctttat cctgactgaa 61 ctgaggcagg ggtaacgctg ggagtgactg gcatagcagg ggctgcgggg aggcacttca 121 gtccgtgcca agtccaacat tgtagctagt gacgagaatc gtactacagt gcgggctctc 181 actaagtgac gctcgaattc gggaagaacg acgcggcagc tgttgcatta tggtgcgtct 241 gtaggtcgga gagccggcg // LOCUS XELBETA2 97 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon B. ACCESSION M35346 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 2 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 97) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 14 a 32 c 24 g 27 t ORIGIN 1 atttcaggac agcccagcgc cctggtgcac gatcagctgt agatctccct gtctgtgtcg 61 ctgctgccgc tgctacttca gttcctctga ctgtcag // LOCUS XELBETA3 44 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon C. ACCESSION M35347 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 3 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 44) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 17 a 5 c 13 g 9 t ORIGIN 1 atgttgaaga ctgattgggg ttaagcaggc acatacaaga aaag // LOCUS XELBETA4 79 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon D. ACCESSION M35348 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 4 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 79) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 32 a 12 c 21 g 14 t ORIGIN 1 acagaagccg tgaaccaatg cagaattaca ggaaaggacg aggattgaaa catctgtaca 61 tgagaaggaa tttctgaag // LOCUS XELBETA5 72 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon E. ACCESSION M35349 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 5 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 72) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 19 a 15 c 20 g 18 t ORIGIN 1 ttaaagttga agtatttctg gtcaggtgat ctctgaggca gcgcacaggc cctcacaaaa 61 tggtggctca ag // LOCUS XELBETA6 46 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon F. ACCESSION M35350 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 6 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 46) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 17 a 11 c 8 g 10 t ORIGIN 1 gttcctctca agcccaggaa caaaaaccgg aaatttttca aatgag // LOCUS XELBETA7 64 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon G. ACCESSION M35351 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 7 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 64) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 46 > 64 thyroid hormone receptor beta A protein, exon G (first expressed exon) (alt.) BASE COUNT 20 a 13 c 14 g 17 t ORIGIN 1 gctatatgtg attcttagaa gaatgagcgg accttccaat ccataatgcc aagcagtatg 61 tcag // LOCUS XELBETA8 191 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A gene, exon H. ACCESSION M35352 KEYWORDS thyroid hormone receptor beta A protein. SEGMENT 8 of 8 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 191) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. A unique procedure for translation determination reveals that alternate translation initiation occurs at exons G and H of the beta A thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 185 > 191 thyroid hormone receptor beta A protein, exon H (first expressed exon) (alt.) BASE COUNT 57 a 31 c 37 g 66 t ORIGIN 1 gcagagtata tggtttagaa gaactaacac agaagttttt tgttggacac tactctccat 61 aatgacaatg agatttccat tgtaacatcc taattgtaac cagtaatcag agatgctgct 121 tggacagtgc ttacagcttt tttaaagaga ttttttattt ttgctttgca tcgaaccgtg 181 tactatggaa g // LOCUS XELBETAB 1150 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta A5 mRNA, complete cds. ACCESSION M35360 KEYWORDS thyroid hormone receptor beta A5 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1150) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1122 thyroid hormone receptor beta A5 protein BASE COUNT 354 a 232 c 285 g 279 t ORIGIN 1 atgccaagca gtatgtcagg gtatataccc agctacttgg ataaagatga gctatgcgtg 61 gtgtgtggag acaaggctac agggtatcat tatagatgta tcacctgcga gggctgcaag 121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcttg taaatatgaa 181 ggaaaatgtg ttatagacaa agtaacaaga aaccagtgcc aagaatgtcg cttcaaaaag 241 tgcatcgctg ttggaatggc aacagacttg gttttggatg acaacaaacg tttggcaaaa 301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag atgagattca gaaatcactt 361 gttcagaaac ctgaacccac acaagaagaa tgggagttga tacaagttgt cactgaagca 421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag 481 gacattggac aagctcccat agttaatgcg cccgagggtg gaaaagtgga cttagaagcc 541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagttgttga ttttgccaaa 601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt 661 tgtatggaga tcatgtcgct ccgagcagca gtgcgttatg accccgaaag tgaaactcta 721 acgttaaatg gtgagatggc agtgacaagg gggcagctaa aaaatggagg acttggagtg 781 gtttcagatg ccatctttga cttaggggta tcgctttctt cattcagtct tgatgatacc 841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcggcctgg tcttgctagc 901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat 961 tacaggaaac ataacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc 1021 cgcatgattg gagcgtgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact 1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gtgcttctgg 1141 attctcagca // LOCUS XELBETAC 1132 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B1 mRNA, complete cds. ACCESSION M35361 KEYWORDS thyroid hormone receptor beta B1 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1132) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1122 thyroid hormone receptor beta B1 protein BASE COUNT 352 a 233 c 284 g 263 t ORIGIN 1 atgccaagca gtatgtcagg gtacataccc agctacttgg ataaagatga gctatgtgtg 61 gtatgtggag acaaagctac agggtatcac tatagatgta tcacctgcga gggctgcaag 121 ggctttttta gaagaactat tcagaagaac ctccacccaa gctattcctg taaatatgaa 181 ggaaaatgtg ttatagacaa agtaacaagg aaccagtgcc aagaatgtcg cttcaaaaag 241 tgcaaaactg ttggaatggc aacagacttg gttttggatg acagcaaacg tttggcgaaa 301 agaaagctca tagaagaaaa cagagaaaaa agacggaaag acgagataca gaaatcaatt 361 gttcagagac cggaaccaac acaagaagaa tgggagttga tacaagttgt cactgaagca 421 catgtggcca ccaacgcaca gggaagccac tggaaacaga aaagaaaatt tttgccagag 481 gacattggac aagctcccat agttaatgcg cctgaaggtg gaaaagtgga cttagaagcc 541 ttcagccagt ttacaaaaat aatcacccca gcaattacaa gagtggttga ttttgccaaa 601 aagctaccta tgttttgtga gctgccatgt gaagaccaga tcatccttct taaaggctgt 661 tgtatggaga tcatgtccct ccgagcagcc gtgcggtatg accccgaaag tgaaactcta 721 acgctgaatg gggagatggc agtgacaagg gggcagctaa aaaatggagg actcggtgtg 781 gtctcagatg ccatctttga cttgggggtg tcgctttctt cattcagtct tgatgatacc 841 gaagtcgcct tgttgcaggc tgtgctgctt atgtcatcag atcgtcctgg tctctctagt 901 gtggagagaa tagaaaagtg ccaggaaggt ttcctcttgg cttttgaaca ctacattaat 961 tacaggaaac acaacattgc acacttttgg ccaaaactgc tgatgaaagt caccgacctc 1021 cgcatgatcg gagcatgcca cgccagccgg ttcctgcaca tgaaggtgga gtgccccact 1081 gaactgtttc ccccactgtt cttggaagtg tttgaggact agaacagact gt // LOCUS XELBETAD 1255 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B2 mRNA, complete cds. ACCESSION M35362 KEYWORDS thyroid hormone receptor beta B2 protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1255) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 1 1245 thyroid hormone receptor beta B2 protein BASE COUNT 383 a 258 c 323 g 291 t ORIGIN 1 atgccaagca gtatgtcagt tcggcttttc actgcatctg ccgcacaaag aaagaagata 61 caggaagggg attgctgtgt ggtgctcgct ggaaaaaccc agggccggtt tatattgata 121 ggagcagtgg cccgggtatc agggtacata cccagctact tggataaaga tgagctatgt 181 gtggtatgtg gagacaaagc tacagggtat cactatagat gtatcacctg cgagggctgc 241 aagggctttt ttagaagaac tattcagaag aacctccacc caagctattc ctgtaaatat 301 gaaggaaaat gtgttataga caaagtaaca aggaaccagt gccaagaatg tcgcttcaaa 361 aagtgcaaaa ctgttggaat ggcaacagac ttggttttgg atgacagcaa acgtttggcg 421 aaaagaaagc tcatagaaga aaacagagaa aaaagacgga aagacgagat acagaaatca 481 attgttcaga gaccggaacc aacacaagaa gaatgggagt tgatacaagt tgtcactgaa 541 gcacatgtgg ccaccaacgc acagggaagc cactggaaac agaaaagaaa atttttgcca 601 gaggacattg gacaagctcc catagttaat gcgcctgaag gtggaaaagt ggacttagaa 661 gccttcagcc agtttacaaa aataatcacc ccagcaatta caagagtggt tgattttgcc 721 aaaaagctac ctatgttttg tgagctgcca tgtgaagacc agatcatcct tcttaaaggc 781 tgttgtatgg agatcatgtc cctccgagca gccgtgcggt atgaccccga aagtgaaact 841 ctaacgctga atggggagat ggcagtgaca agggggcagc taaaaaatgg aggactcggt 901 gtggtctcag atgccatctt tgacttgggg gtgtcgcttt cttcattcag tcttgatgat 961 accgaagtcg ccttgttgca ggctgtgctg cttatgtcat cagatcgtcc tggtctctct 1021 agtgtggaga gaatagaaaa gtgccaggaa ggtttcctct tggcttttga acactacatt 1081 aattacagga aacacaacat tgcacacttt tggccaaaac tgctgatgaa agtcaccgac 1141 ctccgcatga tcggagcatg ccacgccagc cggttcctgc acatgaaggt ggagtgcccc 1201 actgaactgt ttcccccact gttcttggaa gtgtttgagg actagaacag actgt // LOCUS XELBETB1 226 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon A. ACCESSION M35353 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 1 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 226) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 57 a 50 c 73 g 46 t ORIGIN 1 agcttcatta tcctgactga acacaagcag ggataacgct gggagtgact ggcatagcag 61 gggctgcagg gaggcacttc ataatccgtg ccaaatccaa cgttgtagcg agtgacgaga 121 atcgtagagt gcgcggaaca gtctcacgga cgctggggtt tgggaaggac gacgcggcag 181 ctgttgcact acgttacgtc taactctata ggttggagag ctgacg // LOCUS XELBETB2 65 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon B. ACCESSION M35354 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 2 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 65) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 7 a 19 c 16 g 23 t ORIGIN 1 agctgtagat ctcctgtctg tgttgctgcc actgctgttg ctgctccagt tcctctgact 61 gtcag // LOCUS XELBETB3 50 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon C. ACCESSION M35355 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 3 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 50) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 19 a 5 c 15 g 11 t ORIGIN 1 atgttgaaga gtgattgggg ttaagcaggc acatactgta caagaaaaag // LOCUS XELBETB4 67 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon F. ACCESSION M35356 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 4 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 67) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 27 a 14 c 12 g 14 t ORIGIN 1 ctacaggttt ccctcaagca ccaagaacga aaaccagaaa gaatttgcag agaatttttc 61 aaatgag // LOCUS XELBETB5 64 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon G. ACCESSION M35357 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 5 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 64) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 46 > 64 thyroid hormone receptor beta B gene BASE COUNT 21 a 12 c 14 g 17 t ORIGIN 1 gttatatgtg atgcttagaa gaatgagcag accttccaat ccataatgcc aagcagtatg 61 tcag // LOCUS XELBETB6 123 bp ds-DNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor beta B gene, exon H. ACCESSION M35358 KEYWORDS thyroid hormone receptor beta B protein. SEGMENT 6 of 6 SOURCE X.laevis DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 123) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Exons D, E and H do not exist for the beta B thyroid hormone receptor protein. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 BASE COUNT 31 a 25 c 39 g 28 t ORIGIN 1 ttcggctttt cactgcatct gccgcacaaa gaaagaagat acaggaaggg gattgctgtg 61 tggtgctcgc tggaaaaacc cagggccggt ttatattgat aggagcagtg gcccgggtat 121 cag // LOCUS XELTHYA 1406 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor alpha A mRNA, complete cds. ACCESSION M35343 KEYWORDS thyroid hormone receptor protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1406) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 122 1378 thyroid hormone receptor protein BASE COUNT 346 a 350 c 406 g 304 t ORIGIN 1 gtcgacctgt gagaggcgtc cgcccgcctc catgtgaacg ctacgcccca tgatcctcgg 61 ggagctgggg gcggagcccg ccttggtctc ttcggattgg ttctggatgg aattacgttg 121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc 181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga 241 cagcttggtg tctctgccct ctgcagggta catccccagc tatctggaca aagatgagcc 301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg 361 gtgtaagggt ttctttcgcc gcaccatcca gaagaacctg cacccctcct actcgtgcaa 421 gtacgatggc tgctgcatta tcgacaagat cacccgaaat cagtgccagc tctgccgctt 481 caagaaatgc attgccgttg gcatggcaat ggatcttgtc ctggatgatg gcaagcgggt 541 agccaagcga aaactgattg aagagaatcg acagcggcgg cggaaggagg agatgatcaa 601 gactctgcaa cagcgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac 661 agaagctcac aggagtacca atgctcaggg cagccactgg aaacagcgta ggaagtttct 721 gccggaagat atcgggcagt ctcccatggc ttccatgccg gatggggata aagttgacct 781 ggaagctttc agtgagttca ccaagataat caccccggca attaccagag tggtggactt 841 tgccaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa 901 aggatgttgt atggagatca tgtctctccg tgctgctgta cgctacgatc cagacagcga 961 gaccctaacg ctgagcggag agatggctgt gaaacgggag cagcttaaga acggaggtct 1021 gggtgttgtc tctgatgcca tctttgacct cgggaggtcg cttgctgcgt ttaaccttga 1081 cgatacggaa gtggcgctgc tgcaggctgt tttgctaatg tcatcagacc gaactggttt 1141 aatctgcacg gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta 1201 catcaaccat cgcaaacaca acattcccca cttctggccc aaactcctaa tgaaggtgac 1261 ggacctgcgc atgatagggg catgccatgc cagccgcttt ctgcacatga aggtcgagtg 1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg 1381 gacagtgcat gtcggtagag aggaaa // LOCUS XELTHYB 1406 bp ss-mRNA VRT 28-JUL-1990 DEFINITION X.laevis thyroid hormone receptor alpha B mRNA, complete cds. ACCESSION M35344 KEYWORDS thyroid hormone receptor protein. SOURCE X.laevis, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1406) AUTHORS Yaoita,Y., Shi,Y.-B. and Brown,D.D. TITLE The Xenopus laevis alpha and beta thyroid hormone receptors JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Yaoita, 15-JUN-1990. Author address:Yoshio Yaoita Carnegie Inst of Washington 115 West University Parkway Baltimore, MD 21210 FEATURES from to/span description pept 122 1378 thyroid hormone receptor protein BASE COUNT 350 a 347 c 402 g 307 t ORIGIN 1 gtcgacctgt gagaggcgcc cgcccgcctc catgtgaaag ccacgcccca tgagccttgg 61 gcagctgggg gcggagccca ccttggtctc ttcggattgg ttctggatgg aattacgttg 121 aatggaccag aatctcagcg ggctggactg cttgtcagag ccagatgaaa aaaggtggcc 181 ggatgggaag cgaaaaagaa agaacagcca atgtatggga aaaagcggca tgtccggtga 241 cagcttggtg tctctgcccc ctgcagggta catccccagc tatctggaca aagatgagcc 301 atgcgtggtg tgcagtgata aggccacggg gtaccactac cgctgtatca cttgcgaggg 361 gtgcaagggt ttcttccgcc gcaccatcca gaagaacctg cacccctcct attcttgcaa 421 gtacgatggc tgctgcatta tcgacaaaat cacccgtaat cagtgccagc tctgccgctt 481 caagaaatgc attgccgttg gcatggcaat ggatcttgac ctggatgata gcaagcgggt 541 agccaagcga aaactgattg aagaggatcg agtgcggcgg cggaaggagg agatgatcaa 601 gactctgcaa cagtgtcccg agccaagcag cgaggagtgg gagttgattc gcattgtaac 661 agaagctcac aggagtacca atgcccaggg cagccattgg aaacagcgta ggaagtttct 721 gccagaagac atcggacagt ctcctatggc ttccatgcca gatggggata aagttgacct 781 ggaagctttc agtgagttca ccaaaataat caccccggca attaccagag tggtggactt 841 tgcgaagaag ctgcccatgt tctctgagct gacttgtgaa gaccagatca tcctgttgaa 901 aggatgttgt atggagatca tgtctcttcg tgctgctgtg cgctacgatc cagacagcga 961 gaccctaacg ctgagcggcg agatggcggt gaaacgggag cagcttaaga acggaggtct 1021 gggtgttgtc tctgatgcca tctttgacct tgggaggtcg cttgctgcgt tcaaccttga 1081 tgatacggaa gtggcactgt tgcaggctgt tttgctaatg tcatcagacc gtactggttt 1141 aatctgcaca gacaagatag agaaatgtca agagacctac cttctcgcct ttgaacacta 1201 catcaaccat cgcaaacaca acattcccca cttctggccc aagctcctaa tgaaggtgac 1261 ggacctgcgc atgatagggg catgccatgc cagctgcttt ctgcacatga aggtcgagtg 1321 ccccaccgag ctctttccac cgctcttcct tgaggtcttt gaggaccagg aagtttgagg 1381 gacagtgcat gtcggtagag aggaaa // LOCUS RATSIMPA1 205 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 5. ACCESSION M36626 KEYWORDS simple sequence DNA. SEGMENT 1 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 205) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 52 a 53 c 79 g 21 t ORIGIN 1 cagctctgtc ctgttgtcgc ccttgggcag agttgtgcct cctgctcttc tttccctaag 61 gaggggcagc agcagcagca gcaggaggag caggaggagc agcagcagga gcagcaggag 121 cagcagcagc agcaggagga gcaggagcag cagcaggagc agcagcagca ggagcagcag 181 cagcagcagg agcaggagga gcagc // LOCUS RATSIMPA2 146 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 5. ACCESSION M32514 KEYWORDS simple sequence DNA. SEGMENT 2 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 146) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 46 a 36 c 61 g 3 t ORIGIN About 500 base pairs after segment 1. 1 agcagcagca gcaggagcag caggaggagc agcaggagca ggagcagcag gagcagcagc 61 aggagcagga gcaggagcag caggagcagc aggagcagca gcaggagcag cagcagcagc 121 agcagcagca gcggtgcagc tccatg // LOCUS RATSIMPB 380 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 8. ACCESSION M32515 KEYWORDS simple sequence DNA. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 380) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 131 a 96 c 138 g 15 t ORIGIN 1 tgatcattgc tgcaatccca cagcaggagc agcagcagga gcagcagcag cagcaggagc 61 aggagcagcc acaggaggag cagcaacaag aggcagcagc agcagcagga gcagcagcag 121 caggaggagc agcaacagga gcagcagcaa caggagcagc agcaggaaca gaacaggagc 181 agcagcagca ggaacaagga gtagcagcag cagcagcagg aacaggagaa gcagcagcag 241 cagcagcagc aggagcagga gcagcaggag cagcagcagc agcagtagga gcagcagcag 301 cagcaggagc agcagcagca gcagcaggag gagcagcagc agcagcagca cagcagcagg 361 gtacttggtg atcccttgac // LOCUS RATSIMPC 542 bp ds-DNA ROD 28-JUL-1990 DEFINITION Rat simple sequence DNA, clone 16. ACCESSION M36627 KEYWORDS simple sequence DNA. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 542) AUTHORS Ivanova,M.N., Frolova,E.I. and Georgiev,G.L. TITLE Simple sequences of the rat genome detected by hybridization with adenovirus DNA JOURNAL Dokl. Biochem. 276, 189-193 (1984) STANDARD simple staff_entry BASE COUNT 76 a 255 c 142 g 69 t ORIGIN 1 ggatccaccg cctgagtagc cgccgccaca gctagagccg cctccacctc caccgccgtc 61 ggagtagccg cctccgcagc tggagccacc gccgccgccg ccgccggagt acttgccccc 121 ttcggaccgc cgccgcgacc accgggctgc cgctccagag gagcctccgc agtaggagcc 181 gccgcctcct gattcgtctc ctatagttgg agcctccgcc tccgtcggag tcgccgccgc 241 cgccgtagcc ggagccgccg ccgccgccgc ccgcctccgg agtaccttga cgccgccgcc 301 gccgccgccg gagtacttcg cccctccgga ccgccgccgc gaccagagaa ctgacgcccc 361 ctccggagcc gcctccgccg ccgcagctgg aaccacctcc ataggaacca ccgcctccgc 421 ctccgcctcc gcagccagag cctcctccag atgagccacc tccgcagctg ggagcctcca 481 ccgctaccac caccgctata gtaaccgcca ccgccgcctc ctcctccacc agaggtcttt 541 tc // LOCUS RATPSTIAA 2382 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Rat pancreatic secretory trypsin inhibitor-like protein (PSTI) mRNA, complete cds. ACCESSION M35299 KEYWORDS monitor protein; pancreatic secretory trypsin inhibitor-like protein. SOURCE Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 427; 594 to 1693; 1954 to 2338) AUTHORS Fukuoka,S.-I. and Scheele,G. TITLE Rapid and selective cloning of monitor peptide, a novel CCK-releasing peptide, using minimal amino acid sequence and the polymerase chain reaction (PCR) JOURNAL Pancreas 4, 1-7 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2382) AUTHORS Fukuoka,S.-I. and Scheele,G. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.-I.Fukuoka, 19-JUN-1990. FEATURES from to/span description pept 10 249 pancreatic secretory trypsin inhibitor-like protein precursor sigp 10 63 pancreatic secretory trypsin inhibitor-like protein signal peptide matp 64 246 pancreatic secretory trypsin inhibitor-like protein signal 1665 1670 poly-A signal signal 2064 2069 poly-A signal signal 2303 2308 poly-A signal BASE COUNT 770 a 423 c 517 g 672 t ORIGIN 1 tctacaacca tgaaggtagc aattatcttt cttctcagtg ctttggccct gctcagttta 61 gcaggtaacc ctccagctga ggtgaatgga aaaacgccta attgccctaa gcaaattatg 121 ggatgtccca ggatttatga ccctgtgtgt gggactaacg gaattactta ccccagtgaa 181 tgcagtctgt gctttgaaaa caggaaattc ggaacatcta tccacattca gaggagaggg 241 acttgctgaa tgtcctgatt ttgaaatctt ttagggctac cataatgttt agcaagaagg 301 tttgctgaat aaatgcatct gaacatattt tgttcttccc aaagcttttg ctcaaaggca 361 tatatgagta tattgagaat agggatctga gaagaaaacc agagtagagc aagctttacc 421 acttagttct tcatgctcat acttcaaaaa ttgcagatga tgacaacaca tagttgagca 481 tgaacatgtg taatgaatag agtttgggtt aggatgaaga aggtagccta tctgtgcaca 541 agaaagaagt agactgactt ggatctttct taggggagtt taccaaagga aagactgcct 601 tgtatatcta cagtgtttca cttgtgagac accacaactc tgcagattta ctcttgttct 661 gtgaggaaac ttagaagagt caaattgttt gactaatagt ccaacataca tgatgccagg 721 gtgttctttt agatcaagct gacctcttcc ttcatccata tgagcactcc ttcttttaac 781 cacaatcttc tcttgtggat catgccttga ctttcttcaa tgggaatcct agataatatt 841 ccctactgta agatcttgca tgtctatatt cagtgataga atatagacgt gatataatag 901 gatataacca aatgaattag aaacaaggaa atattctcaa aagggaaagt atcaacaact 961 acttttaaaa aaggaatcat tttaagatcc tgagtttcta aagaaaatct tagtctaaga 1021 tggaaagaga gtaaagagct aacacaggtg agtctgggca aggaacccta gtacagtggg 1081 gttgggtcag cacctttgcc agaaataacc aagctattca gaaatacact aggaaaggag 1141 agttgcctag taacccactt ctggtcatat tcagtattca tgccttgaac tgaactcttg 1201 ctcctagagg atgctataac taacaaaccg agcaacttaa acagcctgac agctctcacc 1261 aaataccttg ctatctcaag ttatggatgc aagatggctc ccagtgtcta tctgtgattc 1321 tagaggacac ttgaagggca ccaacactta acaaattctg tgggggtaaa tttattttaa 1381 tcactggatg ctggaagaca cacacagaga cacaaacaca caaagagaga cagagagaga 1441 gaaagagaga gagagaggta gagagagaga gagagagaga gagggagaga gagggagaga 1501 gagagtgttt tgggttttgt tgttgttgtt gttgttgatt tggaattata tcaagatata 1561 agataatctc aaatgtatct ttagtagttc tgctccctgg acccatgaga agacaggaat 1621 gaggattctg tgcatgtggt acttacattt caaaaggagt atctaataaa ctggaaactg 1681 cttaaaagaa tgagactatc agcactgata agaatataaa gcttcaagct atgaagagtg 1741 attcaaagaa ggaaaagaat tccctcagaa ctgggaggac cttttaaaaa attctgagtc 1801 cccgtttcta aagtttcacc ttcctaactt catgtatttt ttaatagctc aaagagtcca 1861 attactgctg ctcatatact catgagtgtg acaccatgca ctgttactgc caatatatga 1921 aaggccatac ccctaaagaa aattgactta agaactcctt gtttagggtt gggtacttct 1981 gtgaccctcc cacattcatg ctggaatgtt gactggcttc atttttataa ggcaaaagat 2041 cttcccactc tcttctgaga gagaataaat cagttttgct caatggagtg attctgagta 2101 tactaatcac gatcccagga caggccccat tctcacaagc agttagctaa cacaaataga 2161 actccatatt ttatagcagt ttttatcttt tgttcttggt tttagttctt attttcaaga 2221 cagagaaaaa cacatgaagt tggaagggta gaagtggggg ggggcgtggg tctgggagga 2281 gttgggggat agagaaaaat ataataaaaa tatatgaaat tctcgagaat gaataaatgg 2341 aattcgatat caagcttatc gataccgtcg acctcgaggg gg // LOCUS RATPSTIBA 300 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Rat pancreatic secretory trypsin inhibitor-like protein (PSTI) mRNA, 3' end. ACCESSION M35300 KEYWORDS monitor protein; pancreatic secretory trypsin inhibitor-like protein. SOURCE Rat (strain Wistar) adult pancreas, cDNA to mRNA, clone MP3. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Fukuoka,S.-I. and Scheele,G. TITLE Rapid and selective cloning of monitor peptide, a novel CCK-releasing peptide, using minimal amino acid sequence and the polymerase chain reaction (PCR) JOURNAL Pancreas 4, 1-7 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.-I.Fukuoka, 19-JUN-1990. FEATURES from to/span description pept < 1 116 pancreatic secretory trypsin inhibitor-like protein precursor (AA at 3) matp < 1 113 pancreatic secretory trypsin inhibitor-like protein signal 285 290 poly-A signal BASE COUNT 98 a 57 c 65 g 80 t ORIGIN 1 gtcccaggat ttatgaccct gtgtgtggga ctaacggaat tacttacccc agtgaatgca 61 gtctgtgctt tgaaaacagg aaattcggaa catctatcca cattcagagg agatagagcg 121 tctgcaaaaa cagatcgaac ggcataagaa gaagattaat acctaaagaa tagtgaggca 181 ttgagtgcac acagtcagtc tctcacatag tggcagtatc attcccactc ttatagagat 241 tgttttgaat gattgatgtt tgaccatgtg tgctactaac agataataaa ttatcaccag // LOCUS SYNTRPA 3763 bp ds-DNA circular SYN 28-JUL-1990 DEFINITION Cloning vector pATH3 propagated in E.coli. ACCESSION M33622 KEYWORDS trpE' protein. SOURCE Synthetic DNA, clone pATH3. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3763) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpE-fusion genes: pATH vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill, 06-APR-1990. Nucleotides 1-147 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH3: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 1392 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated. This produced plasmid pATH2 (GenBank acc M33624) 4. An EcoRI linker was inserted at the remaining SmaI site of pATH2 replacing the SmaI site and changing the reading frames of the other sites in the MCS. This plasmid is pATH3. FEATURES from to/span description pept 423 > 1391 trpE' fusion protein BASE COUNT 926 a 942 c 946 g 949 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat ccccccgaat tcggggggat cctctagagt cgacctgcag cccaagctta 1441 tcgatgataa gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac 1501 gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt 1561 ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt 1621 atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta 1681 tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg 1741 tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac 1801 gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg 1861 aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc 1921 gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg 1981 ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat 2041 gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg 2101 gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg 2161 atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc 2221 ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt 2281 cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct 2341 cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc 2401 gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca 2461 cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct 2521 cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt 2581 taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga 2641 ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca 2701 aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac 2761 caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg 2821 taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag 2881 gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac 2941 cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt 3001 taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg 3061 agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc 3121 ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc 3181 gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc 3241 acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa 3301 acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt 3361 tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg 3421 ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag 3481 agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt 3541 gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc 3601 gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg 3661 acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg 3721 catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg agg // LOCUS SYNTRPB 3771 bp ds-DNA circular SYN 28-JUL-1990 DEFINITION Cloning vector pATH10, propagated in E.coli. ACCESSION M33623 KEYWORDS beta-lactamase; trpE' protein. SOURCE Synthetic DNA, clone pATH10. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3771) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpE-fusion genes path vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill 06-APR-1990. Nucleotides 1-147 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH10: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in the trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 423 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated producing plasmid pATH2 (GenBank acc M33624). 4. An interim vector was constructed by inserting an EcoRI linker at the remaining SmaI site of pATH2. 5. The EcoRI-HindIII fragment of MCS in this interim vector was replaced with the EcoRI-HindIII fragment containing the MCS of M13mp12. 6. Tha AvaII-AvaII fragment that spanned the PstI site in the bla gene of this interim vector was replaced with the corresponding AvaII fragment from pUC8, eliminating this PstI site, making the PstI site in the MCS unique. This is plasmid pATH10. FEATURES from to/span description pept 423 1472 trpE' protein pept 1688 2548 beta-lactamase BASE COUNT 927 a 945 c 948 g 951 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat cccccggaat tcgagctcgc ccggggatcc tctagagtcg acctgcagcc 1441 caagcttatc gatgataagc tgtcaaacat gagaattaat tcttgaagac gaaagggcct 1501 cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg 1561 tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc 1621 aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag 1681 gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg 1741 ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt 1801 gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt 1861 tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt 1921 attatcccgt gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa 1981 tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag 2041 agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac 2101 aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac 2161 tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac 2221 cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac 2281 tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact 2341 tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg 2401 tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt 2461 tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat 2521 aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta 2581 gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa 2641 tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga 2701 aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac 2761 aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt 2821 tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc 2881 gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat 2941 cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag 3001 acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc 3061 cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc attgagaaag 3121 cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac 3181 aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg 3241 gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct 3301 atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc 3361 tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga 3421 gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga 3481 agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg 3541 catatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact 3601 ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac 3661 gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc 3721 gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag g // LOCUS SYNTRPC 3753 bp ds-DNA SYN 28-JUL-1990 DEFINITION Cloning vector pATH2, propagated in E.coli. ACCESSION M33624 KEYWORDS beta-lactamase; trpE' protein. SOURCE Synthetic DNA, clone pATH2. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3753) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpe-fusion genes path vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill 06-APR-1990. Nucleotides 1-147 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH2: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 1392 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated. This produced plasmid pATH2 FEATURES from to/span description pept 423 1454 trpE' protein pept 1670 2530 beta-lactamase BASE COUNT 924 a 939 c 943 g 947 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat ccccggggat cctctagagt cgacctgcag cccaagctta tcgatgataa 1441 gctgtcaaac atgagaatta attcttgaag acgaaagggc ctcgtgatac gcctattttt 1501 ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa 1561 tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat 1621 gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca 1681 acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca 1741 cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta 1801 catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt 1861 tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtgttgacgc 1921 cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc 1981 accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc 2041 cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa 2101 ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga 2161 accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgcagcaat 2221 ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca 2281 attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc 2341 ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat 2401 tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag 2461 tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa 2521 gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca 2581 tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc 2641 ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc 2701 ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc 2761 agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt 2821 cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt 2881 caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc 2941 tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa 3001 ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac 3061 ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc ttcccgaagg 3121 gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga 3181 gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact 3241 tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa 3301 cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc 3361 gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg 3421 ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcctgat 3481 gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag 3541 tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc gctacgtgac 3601 tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt 3661 ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag 3721 aggttttcac cgtcatcacc gaaacgcgcg agg // LOCUS SYNTRPD 3772 bp ds-DNA SYN 28-JUL-1990 DEFINITION Cloning vector pATH11, propagated in E.coli. ACCESSION M33625 KEYWORDS beta-lactamase; trpE' protein. SOURCE Synthetic DNA, clone pATH11. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3772) AUTHORS Koerner,T.J., Hill,J.E., Myers,A.M. and Tzagoloff,A. TITLE High-expression vectors with multiple cloning sites for construction of trpe-fusion genes path vectors JOURNAL Meth. Enzymol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.E.Hill 06-APR-1990. Nucleotides 1-144 are provided as a personal communication from R.P.Gunsalus at the Dept. of Microbiology at UCLA. Construction of pATH11: 1. PvuII-HindIII fragment from the 5' end of the trp operon (through nt 1999 of ECOTGP, which is in the trpD cds) was ligated to the HindIII-PvuII fragment of pBR322 containing the bla (= Amp-resistance) gene and origin of replication, but not the rop gene, which encodes a negative regulator of ColE1 replication. In addition, the EcoRI site in the pBR322 backbone was eliminated. This plasmid is pKRS101. (Spindler et al. M. Virol. 49, 132-141 (1984)) 2. The BglII-HindIII fragment (nt 423 of trpE to the end of the trpD sequence present in pKRS101) was replaced with a BamHI-EcoRI fragment and an EcoRI-HindIII fragment, both from the MCS of M13mp12. This plasmid is pATH1 (see GenBank acc M32985 for more details). 3. The SmaI-SmaI fragment from the MCS of pATH1 was deleted and the remaining plasmid religated. This produced plasmid pATH2 (GenBank acc M33624). 4. An interim vector was constructed by inserting an EcoRI linker at the remaining SmaI site of pATH2. 5. The EcoRI-HindIII fragment of MCS in this interim vector was replaced with the EcoRI-HindIII fragment containing the MCS of M13mp12. 6. Tha AvaII-AvaII fragment that spanned the PstI site in the bla gene of this interim vector was replaced with the corresponding AvaII fragment from pUC8, eliminating this PstI site, making the PstI site in the MCS unique. This is plasmid pATH11. FEATURES from to/span description pept 423 1487 trpE' protein pept 1689 2549 beta-lactamase BASE COUNT 927 a 946 c 948 g 951 t ORIGIN 1 cagctgtggt gtcatggtcg gtgatcgcta gggtgccgag cgcatctcga ctgcacggtg 61 caccaatgct tctggcgtca ggtagttatt ggaaagctgt ggtatggctg tgcaggtcgt 121 aaatcactgc ataactcgct gctgcctaag gcgcactccc gttctggata atgttttttg 181 cgccgacatc ataacggttc tggcaaatat tctgaaatga gctgttgaca attaatcatc 241 gaactagtta actagtacgc aagttcacgt aaaaagggta tcgacaatga aagcaatttt 301 cgtactgaaa ggttggtggc gcacttcctg aaacgggcag tgtattcacc atgcgtaaag 361 caatcagata cccagcccgc ctaatgagcg ggcttttttt tgaacaaaat tagagaataa 421 caatgcaaac acaaaaaccg actctcgaac tgctaacctg cgaaggcgct tatcgcgaca 481 atcccaccgc gctttttcac cagttgtgtg gggatcgtcc ggcaacgctg ctgctggaat 541 ccgcagatat cgacagcaaa gatgatttaa aaagcctgct gctggtagac agtgcgctgc 601 gcattacagc tttaggtgac actgtcacaa tccaggcact ttccggcaac ggcgaagccc 661 tcctggcact actggataac gccctgcctg cgggtgtgga aagtgaacaa tcaccaaact 721 gccgtgtgct gcgcttcccc cctgtcagtc cactgctgga tgaagacgcc cgcttatgct 781 ccctttcggt ttttgacgct ttccgtttat tgcagaatct gttgaatgta ccgaaggaag 841 aacgagaagc catgttcttc agcggcctgt tctcttatga ccttgtggcg ggatttgaag 901 atttaccgca actgtcagcg gaaaataact gccctgattt ctgtttttat ctcgctgaaa 961 cgctgatggt gattgaccat cagaaaaaaa gcacccgtat tcaggccagc ctgtttgctc 1021 cgaatgaaga agaaaaacaa cgtctcactg ctcgcctgaa cgaactacgt cagcaactga 1081 ccgaagccgc gccgccgctg ccagtggttt ccgtgccgca tatgcgttgt gaatgtaatc 1141 agagcgatga agagttcggt ggcgtagtgc gtttgttgca aaaagcgatt cgcgctggag 1201 aaattttcca ggtggtgcca tctcgccgtt tctctctgcc ctgcccgtca ccgctggcgg 1261 cctattacgt gctgaaaaag agtaatccca gcccgtacat gttttttatg caggataatg 1321 atttcaccct atttggcgcg tcgccggaaa gctcgctcaa gtatgatgcc accagccgcc 1381 agattgagat ccccccggaa ttcgagctcg cccggggatc ctctagagtc gacctgcagc 1441 ccaagcttat cgatgataag ctgtcaaaca tgagaattaa ttcttgaaga cgaaagggcc 1501 tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag 1561 gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt 1621 caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa 1681 ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt 1741 gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt 1801 tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt 1861 ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg 1921 tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga 1981 atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa 2041 gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga 2101 caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa 2161 ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca 2221 ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta 2281 ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac 2341 ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc 2401 gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag 2461 ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga 2521 taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt 2581 agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata 2641 atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag 2701 aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa 2761 caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt 2821 ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc 2881 cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa 2941 tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa 3001 gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc 3061 ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag cattgagaaa 3121 gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa 3181 caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg 3241 ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc 3301 tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg 3361 ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg 3421 agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg 3481 aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc 3541 gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac 3601 tccgctatcg ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga 3661 cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 3721 cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gg // LOCUS ECOTRMF 77 bp ss-tRNA RNA 28-JUL-1990 DEFINITION E. coli initiator Met-tRNA-f. ACCESSION K00305 M25117 KEYWORDS transfer RNA; transfer RNA-Met. SOURCE E. coli (strain CA265) tRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 77) AUTHORS Dube,S.K. and Marcker,K.A. TITLE The nucleotide sequence of N-formyl-methionyl-transfer RNA: Partial digestion with pancreatic and T-1 ribonuclease and derivation of the total primary structure JOURNAL Eur. J. Biochem. 8, 256-262 (1969) STANDARD full staff_review REFERENCE 2 (bases 1 to 77) AUTHORS Uemura,H., Imai,M., Ohtsuka,E., Ikehara,M. and Soell,D. TITLE E. coli initiator tRNA analogs with different nucleotides in the discriminator base position JOURNAL Nucleic Acids Res. 10, 6531-6539 (1982) STANDARD full staff_review REFERENCE 3 (sites) AUTHORS Dahlberg,J.E., Kintner,C. and Lund,E. TITLE Specific binding of tRNA-Met-f to 23S rRNA of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 1071-1075 (1978) STANDARD simple staff_entry COMMENT [1] Contributed on tape April 1983 by M.Sprinzl & D.H.Gauss from their entry 1310 in Nucleic Acids Res. 11, r1-r54 (1983). [1] notes that there may be either another minor Met-tRNA-f or a modification of this sequence, because there is a small amount of an oligonucleotide which shows that base 47 is adenosine instead of m7g. [2] generated all possible substitutions at the fourth base up from the 3' end (position 74); all four variants accepted methionine in in-vitro aminoacylation reactions, implying that the "discriminator hypothesis" is incorrect. FEATURES from to/span description tRNA 1 77 Met-tRNA (NAR: 1310) modified 8 8 s4u modified 21 21 d modified 33 33 cm anticdn 35 37 Met-tRNA-f anticodon cat modified 47 47 m7g modified 55 55 t modified 56 56 f BASE COUNT 14 a 26 c 25 g 12 t ORIGIN 5' end of mature Met-tRNA-f. 1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggtcg tcggttcaaa 61 tccggccccc gcaacca // LOCUS MCPRNA3A 80 bp ss-RNA VRL 28-JUL-1990 DEFINITION Cowpea mosaic virus M RNA 3' terminal sequence. ACCESSION M25438 KEYWORDS . SOURCE Cowpea mosaic virus RNA. ORGANISM Cowpea mosaic virus Viridae; ss-RNA nonenveloped viruses; Comoviridae. REFERENCE 1 (bases 1 to 80) AUTHORS Davies,J.W., Stanley,J. and Van Kammen,A. TITLE Sequence homology adjacent to the 3' terminal poly(A) of cowpea mosaic virus RNAs JOURNAL Nucleic Acids Res. 7, 493-500 (1979) STANDARD simple staff_entry BASE COUNT 22 a 13 c 13 g 32 t ORIGIN 1 tatgaattta atttcttttg tgagctcctg tttagcaggt cgtcccttca gcaaggacac 61 aaaaagattt taattttatt // LOCUS MCPRNA3B 80 bp ss-RNA VRL 28-JUL-1990 DEFINITION Cowpea mosaic virus B RNA 3' terminal sequence. ACCESSION M25439 KEYWORDS . SOURCE Cowpea mosaic virus RNA. ORGANISM Cowpea mosaic virus Viridae; ss-RNA nonenveloped viruses; Comoviridae. REFERENCE 1 (bases 1 to 80) AUTHORS Davies,J.W., Stanley,J. and Van Kammen,A. TITLE Sequence homology adjacent to the 3' terminal poly(A) of cowpea mosaic virus RNAs JOURNAL Nucleic Acids Res. 7, 493-500 (1979) STANDARD simple staff_entry BASE COUNT 22 a 12 c 13 g 33 t ORIGIN 1 taaataatgc ttatgttttt gtttgctcct gtttagcagg tcgttccttc agcaagaaca 61 acaaaaatat gtgttttatt // LOCUS PPCCGAAA 5306 bp ds-DNA VRL 28-JUL-1990 DEFINITION Hamster papovavirus complete genome. ACCESSION M26281 KEYWORDS complete genome. SOURCE Hamster papovavirus DNA. ORGANISM Hamster papovavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 5306) AUTHORS Delmas,V., Bastien,C., Scherneck,S. and Feunteun,J. TITLE A new member of the polyomavirus family: The hamster papovavirus. Complete nucleotide sequence and transformation properties JOURNAL EMBO J. 4, 1279-1286 (1985) STANDARD simple staff_entry FEATURES from to/span description ORF 192 839 early proteins polyprotein (T antigens) ORF 5083 4046 (c) VP2 ORF 4711 4046 (c) VP2 ORF 4045 2927 (c) VP1 BASE COUNT 1595 a 1124 c 1080 g 1507 t ORIGIN 1 ccccttgcct ccttagctct caagtagaaa aggaagagag gcttttgggg ctttttggct 61 ttaagcctca ttttatgagc aggaggagct tgttgcaact tgagaggcgt tttgaggctt 121 ccaggcagag aatactcaca gaccccacac agtctagacg ctcagaagca tctctagctg 181 caacaagcaa gatggataga attcttacta aagaagaaaa gcaagcctta ataagtttac 241 tagatttgga gccacaatat tggggagact atggacgaat gcagaaatgc tacaagaaaa 301 agtgtcttca actgcatcct gataaaggtg gcaatgaaga gctcatgcaa cagcttaata 361 ccctgtggac caaactaaaa gatggtcttt acagagttag gctgttactt gggcctagtc 421 aggtaagaag acttggaaaa gatcagtgga atttatcttt acagcaaaca ttttctggta 481 cctactttag gaggctctgc agactcccca ttacctgcct aagaaacaag ggaattagta 541 cctgcaattg catactttgt ttgctcagaa aacagcattt tctgctaaag aagtcctgga 601 gagtaccttg cctggtgtta ggagaatgct actgcataga ctgctttgcc ttatggtttg 661 gcctgccagt taccaatatg ctggttccat tatatgcaca atttcttgct ccaatacctg 721 tggattggct tgatctgaat gttcatgagg tctacaatcc ggcctcaggt atgtatgaat 781 atggggggct tatagttgta actgtacaag tttaaaatgt gcttttttca ggaccctaat 841 gcttccacct ccaccagcag acccggagag ttctacaatc ctgacacagg aggatactgg 901 tcctactctt atgggtcagc aggatactct gaccagcaga agaaatactg ggaagagttt 961 ttctctaagt gggatgttaa tgaggacctc acctgccaag aagagttatc atcatcagaa 1021 gatgaattca ccccctggca tcccaatccc cccccctccc ctgtttctat ttccagtgac 1081 agctccagtt cctcctgtga cgaggaatac ccaagaaact caagcagaaa gagaaaacga 1141 gtacatgcca atggctcccc aaatacacct atacagccaa ataagagagc ccacacacca 1201 ggaggaggaa gaaccacaat acgaggagat accgatatac ctagaactcc tgccagagaa 1261 tcccaatcaa catttggctc ttacttcaac agcacggagg agcttgagga ggaaatatca 1321 caaacacaac agtcacatca taacacaacg ccaaagaaac cgcctccgac ggttagtcct 1381 gatgattttc ctactatcct tagggggttt ctttctcacg ctattttttc taataaaacg 1441 caaaatgcat ttataatcta cagtactaag gaaaaatgtg aagtacttta tgaacaaata 1501 gacaaatata atccagacta taaaggtatc ttcattatga aacaaacaga agcatttgta 1561 atgtttatga ctcctggaaa acatagagta gctgcagtta aaagttactg ttgtaaattt 1621 tgtaccgtta gcttcctgct atgcaaagct gttacaaaac cgttagagtt gtataactgt 1681 gtggctaaat gtgatgactt tcaaatttta aaagaaaata agcctggtct atatcatttt 1741 gaattctgtg atgaaaaaaa agaggtgaag caaatagact ggaatttcct aacatctttt 1801 gcagttgaaa atgagttaga tgatcctctt gtaattatgg gacattatct agaatttagt 1861 cagtgtgaaa gctcttgcaa aaagtgtgca gaagctttac caaggatgaa agtccactgg 1921 gctaaccaca gtcagcactt agagaatgct gagcttttct tacactgcaa acaacagaaa 1981 agtatctgtc agcaagcagc agataatgtt ctggcaagga gaagattaaa ggtccttgaa 2041 tcaacaagac aagaattgtt ggcagagaga ctgaacaaac tgttagacca attaaaagat 2101 ttatctcctg tagataagca tttatatctt gctggagtag cctggtacca atgtatgttt 2161 cctgattttg agatgatgtt attagatatt ttaaaattgt ttactgaaaa tgttccaaaa 2221 aaaagaaatg tactttttag aggtcctgta aattcaggga aaactagcct tgctgcagct 2281 atcatgaatc ttgtaggagg agttgccctc aatgttaatt gtcctgcaga taagctcaac 2341 tttgaacttg gtgttgctat agataaattt gcagtagtct ttgaagatgt caaaggacaa 2401 accggagata agagacacct acagtctgga cttggaatta ataaccttga taacctgaga 2461 gattaccttg atggaagtgt aaaggttaat ttagaaaaga agcatgtaaa taagaggtcc 2521 cagatatttc ctccttgtat tgttactgct aatgaatatt tttttcctca aacactctat 2581 gccagattcc ataaagttta taactttgaa gtgaaggatt ttcttgccaa gagccttgag 2641 gaaaacagtt acatggggag acatagagtc tgtcaaagtc cacttacaat gctgatagca 2701 ttgctttgga atgtacccac tgaaaatttt gataagtctc tcaaagagaa ggtggaaaca 2761 gaaaagaagg ttttgtctga tatgtgtaac tttactacat ttgcagaaat gtgtctcaat 2821 attcagaggg gtgctgatcc ccttgaggca ttgtaattga ggaggaaaca ataattgatg 2881 aataaagcat ttattagaag ctctgtgtac agtcattttt caagcattag tttgctggtt 2941 ttgcaggggg tttagtatgc tgttggccat acttgtcaat gaacctattc acatctgggt 3001 caccaggaac agcctctgta ccctcataaa tcctgacttc ttctacctga gcagcttctc 3061 cttccatggg ctggccttca attgttggaa gcatattgtt gtacaaagaa gctagcaagc 3121 ttgtaactgg gtaaggattt ttcacccatc tttttctcaa ggtcacatta aaatatctag 3181 gcagccccct ccaatgccag cctgcactgt tgtattctat gtaccagccc ataacatctg 3241 ctgcactgag ataaagccca tctcctttgc aaagaggccc aaccccattt tcatccagaa 3301 gcacagtagt caaggtatta gtaaactgca tcactggtgg agtaccagta ccacctgtga 3361 ggtacctacc atccttgtcc aattttgctt ttgcagtagg gtccagcacc tggtttgtgg 3421 aagtcattgc tttgccagta acagttttga tactaacaat agctgcctca taatttgcat 3481 tatagttctg cactaggcct tgcaaatcta atggttctcc tcccactgca aacatgtggt 3541 aagttgtacc ctcaactggt ttggaaattc caatatcctt tgtctcactt ctggagccat 3601 atccatgcac atttagaagg gatcccactc caacaacttc agtttttaca gatacagcct 3661 cccacatttg aagggtatca caggtcaaat cttcattcag tgttggaagc tgtattttag 3721 ccatactgta atatggcagt tgattagcct tcacttcatc agcagtaagg gagctattta 3781 ctttaatact ctgggagaac ccataatact ggccatcagt tcctgtgcca ggcttgttct 3841 gacccattct aggattaagg taggcctcaa tttgtgtgat actgtcttct cctgttacaa 3901 gatcaagcac acccacacca ccccgcataa taagcttggg aacattagca ggctttggac 3961 agggtttcca caggggtttg cacatctact ggaagcgccg ctttttcttt ttggggccat 4021 actcaacctc atcaatgtat gtctgccaag taggactaat gtctccgtac aatcctagaa 4081 ttaaaggaag catccaatca ggtgtcactc tttggtgggc tccaccagga gcaaaatacc 4141 tcatgatatt tgcccctgat tcaaaccaac tagaagagtc ttcctgctgt tgacttcttc 4201 tctggacatc aggtctccct agttcagctt ctaatacttg tctgctattg gcatcttcaa 4261 tagaaggtct attactgtat tctaaagctc tttctatttg tcttctttga gctggattaa 4321 ttcctggaag ttctgcatag tagttttgta ggccaccata tattctacta taggcctctc 4381 taggtaaatt agtaacaacc catctactat tttccatcat tctggcaatg gcatctaaaa 4441 attgatgggt ggtctgcaaa cttaagtccc tcacagcaga ttctacagct ccttgagttt 4501 cccttctcaa agtatcccag atatactctc ctacagattg aaataatgag tggccccagc 4561 catgaataac atccaaggca tgggtaaatg actgtacacc agggaataat atatcatagt 4621 agtcagctgg tctcctggga ataagtgcca tgtttctatt cacaatcggt acttcgtgag 4681 caagatagcc gtgtagactt cccaaagaga aggctgctga acctgcaaca gtttgaaaaa 4741 taaatgctgt ctggactgat tctctcacaa actcagtcat tactgttgat gttaactcag 4801 gggcagcttg cataaatata aacatgtctt cacttaggcc aattgaagac aaagctgtct 4861 cagctcctaa aaacccctcc atagttatta atgaagtaac ttgggcatct atcgcggcaa 4921 aggcttctcc actaagtatg gcctctactg aaattccagt aactgatgaa atttcggaga 4981 ggtagctgat catctcaata atcactgaaa tggcagatcc catgttgact tacttgaaca 5041 gtttgaaaat cttctgaact gtttcaggca ggtttttagg ccgaattcta aagaaacaga 5101 aagcaaacac tcagcgccga agagcaggaa atggctgacc actgcacttg ggcgacacga 5161 cacgcctagc gataaggaag tcaccatggc aacataaccg cagcactgct gttgtcacag 5221 ttgcctagca aatgacagac tcagcaacca caggagagga aatgataggg ctagcatttt 5281 ttcaaatgta aaccagaggc tagggg // LOCUS RATGST2YB 500 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Rat liver glutathione S-transferase Ya subunit mRNA. ACCESSION M26874 KEYWORDS S-transferase; glutathione S-transferase; ligandin; transferase. SOURCE Rat liver cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 500) AUTHORS Daniel,V., Sarid,S., Bar-Nun,S. and Litwack,G. TITLE Rat ligandin mRNA molecular cloning and sequencing JOURNAL Arch. Biochem. Biophys. 227, 266-271 (1983) STANDARD simple staff_entry FEATURES from to/span description pept < 1 402 glutathione S-transferase Ya subunit (AA at 1) BASE COUNT 153 a 112 c 109 g 126 t ORIGIN 1 gccctgattg acatgtattc agagggtatt ttagatctga ctgaaatgat tatccaattg 61 gtaatatgtc ccccagacca aagagaagcc aagaccgcct tggcaaaaga caggaccaaa 121 aaccggtact tgcctgcctt tgaaaaggtg ttgaagagcc atggccaaga ctaccttgta 181 ggtaacaggc tgacccgggt agacatccac ctgctggaac ttctcctcta tgttgaagag 241 tttgatgcca gccttctgac ctctttccct ctgctgaagg ccttcaagag cagaatcagc 301 agcctcccca atgtgaagaa gttcctgcag cctggcagtc agagaaagct tcccgtggat 361 gcaaaacaaa tcgaagaagc aaggaagatt ttcaagtttt agcggagctg cactatccaa 421 tttctttatg ttttgcaaaa aatgagaagc aattgttgat cctaggtatt tttgaaataa 481 taaacacgaa aaaatactct // LOCUS CPARBCSL 528 bp ds-DNA PLN 28-JUL-1990 DEFINITION C.paradoxa ribulose-1,5-bisphosphate carboxylase/oxygenase large (rbcL) and small (rbcS) subunits, 3' end and complete cds. ACCESSION M35728 KEYWORDS ribulose-1,5-bisphosphate carboxylase/oxygenase. SOURCE C.paradoxa DNA. ORGANISM Cyanophora paradoxa Eukaryota; Plantae; Thallobionta; Chromophycota; Cryptophyceae; Cryptomonadales; Kathablepharidaceae. REFERENCE 1 (bases 1 to 528) AUTHORS Starnes,S.M., Lambert,D.H., Maxwell,E.S., Stevens,S.E.Jr., Porter,R.D. and Shively,J.M. TITLE Cotranscription of the large and small subunit genes of ribulose- 1,5-bisphosphate carboxylase/oxygenase in Cyanophora paradoxa JOURNAL FEMS Microbiol. Lett. 28, 165-169 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 18 ribulose-1,5-bisphosphate carboxylase/oxygenase large subunit (rbcL) pept 124 444 ribulose-1,5-bisphosphate (AA at 1) carboxylase/oxygenase small subunit (rbcS) BASE COUNT 189 a 85 c 60 g 194 t ORIGIN 1 actattgata ctatctaata tcatttaatt tatttaatta tttagagttt aaaactctaa 61 ataattaatc aaaatgatat tacttcaatc tatttttacc ttaaaattcg gaattataaa 121 taaatgcaac ttagagtaga acgtaagttc gaaacttttt cttatttacc accattaaac 181 gaccaacaga ttgcgcgtca attacaatac gcactttcca atggttatag cccagcaatc 241 gaattcagtt ttacaggtaa agctgaagac ttagtatgga ctttatggaa attaccttta 301 tttggtgcac aatctcctga agaagtactt agcgaaattc aagcttgtaa acaacagttc 361 cctaatgctt acattcgtgt tgtagcattt gactctatca gacaagttca aactttaatg 421 ttcttagttt acaaaccatt atagtttaat tgatatctac tctaattgat agatatcaat 481 ttttaattaa tctacaaaac aaaattatct aattattatt aatacttt // LOCUS HUMCFIX 873 bp ss-mRNA PRI 28-JUL-1990 DEFINITION Human coagulation factor IX mRNA, partial cds. ACCESSION M35672 KEYWORDS coagulation factor IX; serine protease. SOURCE Human adult liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 873) AUTHORS Jagadeeswaran,P., Lavelle,D.E., Kaul,R., Mohandas,T. and Warren,S.T. TITLE Isolation and characteriztion of human factor IX cDNA: Identification of Taq I polymorphism and regional assignment JOURNAL Somat. Cell Mol. Genet. 10, 465-473 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 873 coagulation factor IX (AA at 1) BASE COUNT 279 a 146 c 205 g 243 t ORIGIN 1 aacgccaaca aaattctgaa tcggccaaag aggtataatt caggtaaatt ggaagagttt 61 gttcaaggga accttgagag agaatgtatg gaagaaaagt gtagttttga agaagcacga 121 gaagtttttg aaaacactga aagaacaact gaattttgga agcagtatgt tgatggagat 181 cagtgtgagt ccaatccatg tttaaatggc ggcagttgca aggatgacat taattcctat 241 gaatgttggt gtccctttgg atttgaagga aagaactgtg aattagatgt aacatgtaac 301 attaagaatg gcagatgcga gcagttttgt aaaaatagtg ctgataacaa ggtggtttgc 361 tcctgtactg agggatatcg acttgcagaa aaccagaagt cctgtgaacc agcagtgcca 421 tttccatgtg gaagagtttc tgtttcacaa acttctaagc tcacccgtgc tgagactgtt 481 tttcctgatg tggactatgt aaattctact gaagctgaaa ccattttgga taacatcact 541 caaagcaccc aatcatttaa tgacttcact cgggttgttg gtggagaaga tgccaaacca 601 ggtcaattcc cttggcaggt tgttttgaat ggtaaagttg atgcattctg tggaggctct 661 atcgttaatg aaaaatggat tgtaactgct gcccactgtg ttgaaactgg tgttaaaatt 721 acagttgtcg caggtgaaca taatattgag gagacagaac atacagagca aaagcgaaat 781 gtgattcgaa ttattcctca ccacaactac aatgcagcta ttaataagta caaccatgac 841 attgcccttc tggaactgga cgaaccctta gtg // LOCUS HUMMHDRBPV 292 bp ds-DNA PRI 28-JUL-1990 DEFINITION Human MHC class II HLA-DR-beta-I allele gene, partial cds. ACCESSION M35651 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human (Pemphigus vulgaris patient, haplotype DR4 Dw10) blood DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 292) AUTHORS Scharf,S.J., Long,C.M. and Erlich,H.A. TITLE Sequence analysis of the HLA-Dr-beta and HLA-DQ-beta loci from three Pemphigus vulgaris patients JOURNAL Hum. Immunol. 22, 61-69 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 26 > 292 HLA-DR-beta, exon 2 (AA at 26) BASE COUNT 64 a 74 c 101 g 53 t ORIGIN 1 ccggatcctt cgtgtcccca gaccacgttt cttggagcag gttaaacatg agtgtcattt 61 cttcaacggg acggagcggg tgcggttcct ggacagatac ttctatcacc aagaggagta 121 cgtgcgcttc gacagcgacg tgggggagta ccgggcggtg acggagctgg ggcggcctga 181 tgccgagtac tggaacagcc agaaggacat cctggaagac gagcgggccg cggtggacac 241 ctactgcaga cacaactacg gggttgtgga gagcttcaca gtgcagcggc ga // LOCUS MUSC3B 647 bp ss-mRNA ROD 28-JUL-1990 DEFINITION Mouse complement component 3 (C3) mRNA, partial cds. ACCESSION M35659 KEYWORDS complement component 3. SOURCE Mouse liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 647) AUTHORS Fey,G., Domdey,H., Wiebauer,K., Whitehead,A.S. and Odink,K. TITLE Structure and expression of the C3 gene JOURNAL Springer Semin. Immunopathol. 6, 119-147 (1983) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 647 complement 3 (AA at 1) BASE COUNT 170 a 171 c 168 g 138 t ORIGIN 1 atccccatgt attccatcat tactcccaat gtcctacggc tggagagcga agagaccatc 61 gtactggagg cccacgatgc tcagggtgac atcccagtca cagtcactgt gcaagacttc 121 ctaaagaggc aagtgctgac cagtgagaag acagtgttga caggagccag tggacatctg 181 agaagcgtct ccatcaagat tccagccagt aaggaattca actcagataa ggaggggcac 241 aagtacgtga cagtggtggc aaacttcggg gaaacggtgg tggagaaagc agtgatggta 301 agcttccaga gtgggtacct cttcatccag acagaccaga ccatctacac ccccggctcc 361 actgtcttat atcggatctt cactgtggac aacaacctac tgcccgtggg caagacagtc 421 gtcatcctca ttgagacccc cgatggcatt cctgtcaaga gagacattct gtcttccaac 481 aaccaacacg gcatcttgcc tttgtcttgg aacattcctg aactggtcaa catggggcag 541 tggaagatcc gagcctttta cgaacatgcg ccgaagcaga tcttctccgc agagtttgag 601 gtgaaggaat acgtgctgcc cagttttgag gtccgggtgg agcccac // LOCUS P30LTA 777 bp ds-DNA BCT 28-JUL-1990 DEFINITION Plasmid P307 (from E.coli) heat-labile enterotoxin subunit A (LTA) gene, complete cds. ACCESSION M35581 KEYWORDS enterotoxin. SOURCE Plasmid P307 (from Escherichia coli) DNA, clone pAT153. ORGANISM Plasmid P307 Unclassified. REFERENCE 1 (bases 1 to 777) AUTHORS Dykes,C.W., Halliday,I.J., Hobden,A.N., Read,M.J. and Harford,S. TITLE A comparison of the nucleotide sequence of the A subunit of heat- labile enterotoxin and cholera toxin JOURNAL FEMS Microbiol. Lett. 26, 171-174 (1985) STANDARD simple staff_review FEATURES from to/span description pept 1 777 heat-labile enterotoxin subunit A (LTA) BASE COUNT 255 a 136 c 164 g 222 t ORIGIN 1 atgaaaaata taactttcat tttttttatt ttattagcat cgccattata tgcaaatggc 61 gacagattat accgtgctga ctctagaccc ccagatgaaa taaaacgttc cggaggtctt 121 atgcccagag ggcataatga gtacttcgat agaggaactc aaatgaatat taatctttat 181 gatcacgcga gaggaacaca aaccggcttt gtcagatatg atgacggata tgtttccact 241 tctcttagtt tgagaagtgc tcacttagca ggacagtcta tattatcagg atattccact 301 tactatatat atgttatagc gacagcacca aatatgttta atgttaatga tgtattaggc 361 gtatacagcc ctcacccata tgaacaggag gtttctgcgt taggtggaat accatattct 421 cagatatatg gatggtatcg tgttaatttt ggtgtgattg atgaacgatt acatcgtaac 481 agggaatata gagaccggta ttacagaaat ctgaatatag ctccggcaga ggatggttac 541 agattagcag gtttcccacc ggatcaccaa gcttggagag aagaaccctg gattcatcat 601 gcaccacaag gttgtggaaa ttcatcaaga acaatcacag gtgatacttg taatgaggag 661 acccagaatc tgagcacaat atatctcagg gaatatcaat caaaagttaa gaggcagata 721 ttttcagact atcagtcaga ggttgacata tataacagaa ttcgggatga attatga // LOCUS PIGFSHB 929 bp ss-mRNA MAM 28-JUL-1990 DEFINITION Pig follicle stimulating hormone (FSH) beta-subunit mRNA, 3" end. ACCESSION M35676 KEYWORDS follicle stimulating hormone. SOURCE Pig anterior pituitary, cDNA to mRNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 929) AUTHORS Kato,Y. TITLE Cloning and DNA sequence analysis of the cDNA for the precursor of porcine follicle stimulating hormone (FSH) beta-subunit JOURNAL Mol. Cell Endocrinol. 55, 107-112 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 348 follicle stimulating hormone beta-subunit (AA at 1) BASE COUNT 256 a 230 c 207 g 236 t ORIGIN 1 gccatctgct gcaatagctg tgagctgacc aacatcacca tcacagtgga gaaagaggag 61 tgtaacttct gcataagcat caacaccacg tggtgtgctg gctattgcta cacccgggac 121 ctggtataca aggacccagc caggcccaac atccagaaaa catgtacctt caaggagctg 181 gtgtacgaga ccgtgaaagt acctggctgt gctcaccatg cagactccct gtatacgtat 241 ccagtagcca ctgaatgtca ctgtggcaag tgtgacagtg acagtactga ctgcaccgtg 301 agaggcctgg ggcccagcta ctgctccttc agtgaaatga aagaataaag agcagtggac 361 atttcatgct tcctaccctt gtctgaagga ccaagacgtc caagaagttt gtgtgtacat 421 gtgcccaggc tgcaaaccac tatgagagac cccactgatc cctgctgtcc tgtggaggag 481 gagctccagg aatgcagagt gctagggcct cagtcccatc accactcaac cctgtatttt 541 gggtctggtt ccataagttt tattcggtct ttttttttaa attactcaat gaattttatt 601 acatttataa ttgtacaatg atcatcacaa cccaatttta taggatttcc atcccaaacc 661 cccagcatag acccccatct cccaatctgt ctcatttgga aaccataagt ttttcaaagt 721 ccgtgagtca gtatctactc agtcttatta ccttaaagac atgtgggtgt tttctgttta 781 ataatcttag aaatcctctc aagacaggga tatggaccca gaggaaggaa atgggctaag 841 aatgggtgaa aggactaaat gcagcattct cccactagac acagcagcct acaagagcag 901 ggccagtctc tttgtcatga gtgtggccg //
GenBank-Updates@genbank.bio.net (07/31/90)
LOCUS CHKCYTA 4400 bp ds-DNA VRT 31-JUL-1990 DEFINITION Chicken cytotactin gene, 5' flank. ACCESSION M35369 KEYWORDS cytotactin. SOURCE Chicken DNA, (library of Clontech), clone CTG[1,2]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 4400) AUTHORS Jones,F.S., Crossin,K.L., Cunningham,B.A. and Edelman,G.M. TITLE Identification and characterization of the promoter for the cytotactin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.S.Jones, 19-JUN-1990. Author address: F.S.Jones Laboratory of Developmental and Molecular Biology The Rockefeller University 1230 York Avenue N.Y., N.Y. 10021 FEATURES from to/span description signal 3305 3301 (c) GAGA box signal 3996 4001 TATA box BASE COUNT 1156 a 937 c 1031 g 1276 t ORIGIN 1 gtcgacctgc aggtcaacgg atctgagcat cacttgccaa cttgtaccct cagatgctgt 61 gtctatatct gcttaggtca gacttccatt ggagtagggg atatatttct gcttttccta 121 atatcaaaac aatgcccttc agcatgggtt gcaattatga gaatcaatcc agcctcaaag 181 caatgtcatt gtcaatgctg tatgttgaat taaatgtagt catactgctg tttacactgg 241 aaaaggatgg tctatgaata actatataaa atagtattcc catttaaata tatgcactga 301 acaaatagtc aagagtttgc agtgtttttg aaaactcgta ttatacttga gagccattag 361 ttcaggaaag gaggatatta gtacacgggc agcatgaatt gttgagaata tcagatactt 421 aaagcaaaag tttagctatt tagatgctat ctgaaaaaga agttaagcta tcttcttgcg 481 tattagcact gagtgattcc aaacagcaaa tgtagaggat cgaccctgtg atatgatgtc 541 tctctttaaa acctactctt tcatctttct cttaacatta cataagataa actatagttg 601 gtgggttctg tttctcaggc tacctattct gaatctggtg tggctgtttc tgctgtgttc 661 ttgtgcgctg ctgttgagcc tggtactgct ctaacatccc tttttttgca agaaatgtgc 721 ttccttccag ctacaggcac tgaatacttc ccctggctcc tgaaattgtc caagaatgtt 781 tccctgcccc aaggcatgat catctgtcca tttgttttgt aagttatatt ccagtggctc 841 agtgcattcc catctgcaca ctattgaaat cagcttcgct acgtgcatgg caagatgcat 901 ctcacgctgt gctcatctct gtgccttcag ggcagctttt acactgtctg atatgccaat 961 acaaaataaa ttattcctgg gactgggtag attaagtttt caagctgaac taagactaat 1021 tacctgtgtt gcaacaattt gtcccaaggt aacgatgagt atgtagtgtg aaatccacta 1081 ttttccagaa tgatgagttg ggatctcttt caggctttca tgctgttctt gatttcatgc 1141 tatgtccaac tgttcctaga tttgtctgta atgattatcc ttacatgtgt gaattgcatc 1201 tgcagtgtga gcttgtgtgt aaaaagacaa cgataaactc agctgcatat acaaaattta 1261 atgctcgttt actttctcct tttcattaaa atagaaatca actgatattg acagtcatct 1321 agtgaaagag aatggcacag ggcaaatgat tcagttgtaa tccttcatgt ttgacatcac 1381 aacctaaagc tatcctgctg attaaaatac attttccctg ttctttcatg ttatcatttg 1441 cttctgactc aatgtatcac tcagtgtttt ccacagagtg catttaaaga tgtgtgagcc 1501 agaaaagata aaaccaggac aggcctggac tcttacctga ctcttattca ccgcatcacg 1561 gcacgttgtt tgaagtcagg taagtgggtg gaggaagata ggagaaaaaa ttggtttaaa 1621 ttttgattca atagctgctg gcagaagacc tggtgttagc aaacactaca aaaacaaatc 1681 ctgcagattg cttcccgttg tttctgtctt gcttagactt gtggatgtgg acagagagat 1741 ggagtgtttc tcaaatggat ctgaatggac ttctggagta agagaaagca catggtgcat 1801 tctatgggga ctgtctgctt ctcagtcccc atcagatctt attgggaaaa tctcacattg 1861 ggtttttgcc attgagttta attgaatttc acaggactgt atgactgggg gagaattaag 1921 ctatgctggc accatgatca ccttgagaat attctgggga gaacagcaga caaacattga 1981 gtatttattg ggggctgcaa acggtaacct tattgccttt ggaatcaatg gaaacgtcca 2041 gaacaagttc tacttttaag aggtgttcca gaaccgtgga gatgtggcac tgcaggacgt 2101 ggtcagtggg catggtggga tgggttgggg ttggacttgg gggatctggg agatcttttg 2161 caagcctaat gattctgtga ttctaagtca gcagaaagaa tccaatgcct ttactgggga 2221 gcagtcagca gttctgacac tctttccagt aggatcttgt ttctgctctt atgacgtgac 2281 tgctatggaa cacagcgagc cagaaaatta aagcaaaaat cagaaatgag tgctcataaa 2341 tggctgatag ggatgcatcc ctgattgtac tgaagcgttt gatctcggtg tttcctgaat 2401 acttctgtag ttgctcggag aaatgggatc tgtttgttgt acaaatcatt tgtaaaaacg 2461 tcaccaagtt ttgcttgctt ttttgtcagc acctgtttag aaatgattga gagttgtgct 2521 gtgcattgaa gaagggaatg agctcagtaa tattgttaca catcaggaaa taaacacaag 2581 ggtcactggt aaataggaaa tccccaccaa agaaaatctg acccgtgtta ccaaaagaac 2641 tctgttttct gtaatgatga taataataat gattctaaaa tggccatcag aattttgggg 2701 ccaggaattc cccatccctt cccgaaggtg gagaatgggc caagttttcc tgttagtgct 2761 gtgcctgtga agcaggaaaa ggaaaaaatc ctagaagtga tgcgagtcca tggaggctgg 2821 ggaaggccag ctgggaggag gcacactcag aaagggagaa tgctcagaag ggcctggaca 2881 gtccctctta attcttgtga gcacccttgt tgctgaatag gaccccactg ctttcttact 2941 gggcagtccc acagcctcct gaggtaggtg ggtaatgctt aaagggattt ggatcagtcc 3001 tgcttctgaa atggtgaagc aatttcatcc cttctcctag aacacagaca tgacagagct 3061 taactctcta agagcttaac cctgcagctc ccctccctct aactaccagc ccccccccac 3121 ccccgaaaga aaggtagatt cagatagaag ggagctgact gctccctgag tgctaactgt 3181 gatcacaatt ctttgccatg aggctgagat ttgagggtaa gcaggagacc ctccctctga 3241 gcagcactgt gcattgcccg caggtccggg actgaccaag ggactcacgg aggggttgga 3301 ggatctttcc aacacccaaa tgcacagtct ctctttaaaa agaaggaagc cctcctgtgc 3361 gcttctgtgg gcttttatct ctctcttttt aagagaatca gcttaggatg ctgctcagcc 3421 ctgctgtccc ccgcagctga gggagagcag aagcagcgtg ggatgcagag atcacaagcc 3481 tggaaagacc gatgtgtgcc agatcagcag cttgtgtaca tccgagctgc tctgctcatc 3541 cttgcagagc agtgccagcc ccactcgctg cccatccgcc ccagcacact ccgagggact 3601 ttttggaatg gaaaccctgt gtttgtgatc cccctgattc cagtttgctg ctctttcgca 3661 gtgttaaaat aaaggggaaa ggggtttaaa attcctgata atgcactctg ctgatgctgt 3721 aaggaaaata ttcctctttc tgagtcattt gccctgagct gacttttttt tttttgtttg 3781 aagggttcag gggtttcctt ctttagccat aaactgcatg caaattcttt gtgtgtgtgt 3841 gtatgacagt gtgtgtctgc gagtgcgtgt gtgtttcttc agaagctaag ccaaagagag 3901 aggctgttcc tttaaagcaa aactgaaaaa aaaaatgcct tcaggcatcc tattacagtg 3961 gaatgagcca aggatcagga tttgcagctc agctatataa acccggcaga atctggctga 4021 tctgaccagt gtgccgcact gtcaaaccct cctttcacac acgcgcgcac caaatgagac 4081 ggcacaactt ctctgagttt tgacaggacg gcgaggaatc cgggagccga cagctggctg 4141 cagtacctct gcttcgtgga ggtcgcccgt ggcaggatct gatccgtcag cccacacggt 4201 aagcctcctt ccttcctctg ttcaaatatc aggagctctt ttgccaccca agtctgcgac 4261 acggttccat tgcgaccgtg gcagctgtga ctttgttcag tcggagcggg caagggtttt 4321 taggagggag tgtttctgct ttcgttcggc tgtgctccct gatggcataa gaacgtagga 4381 aaggtttcag cggcatcgat // LOCUS HUMIFNB 507 bp ds-DNA SYN 31-JUL-1990 DEFINITION Human fibroblast interferon (IFN-beta) gene, complete cds. ACCESSION M35591 KEYWORDS interferon. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 507) AUTHORS Nagase,Y., Nakamura,N., Tohyama,J., Watanabe,S., Ogino,H., Horikoshi,K., Nii,A., Soma,M., Nobuhara,M., Suzuki,Y. and Mochida,E. TITLE Chemical synthesis of a human fibroblast interferon gene and its expression in Escherichia coli JOURNAL Nucleic Acids Symp Ser 12, 83-86 (1983) STANDARD simple staff_review FEATURES from to/span description pept 3 503 fibroblast interferon (IFN-beta) BASE COUNT 140 a 126 c 106 g 135 t ORIGIN 1 cgatgtcgta caacctgctt ggattcctgc aacgctctag caacttccaa tgtcagaaac 61 tgctctggca gcttaatggt cgtctggaat actgcctgaa agaccgtatg aacttcgaca 121 tcccggaaga gatcaagcaa ctccagcaat tccagaaaga agacgcagct ctgactatct 181 acgaaatgct tcagaacatc ttcgcaatct ttcgtcagga cagctcctct actggttgga 241 acgagactat cgttgagaac ctgctcgcta acgtatatca tcagatcaac cacctgaaga 301 ctgttcttga agagaagctg gagaaagaag atttcactcg aggtaaactg atgtcttccc 361 ttcatctgaa gcgttattac ggtcgtatcc tgcattacct gaaagctaag gaatactctc 421 attgcgcttg gactatcgtt cgtgtagaaa tcctgcgtaa tttctacttt atcaaccgtc 481 tgactgggta ccttcgcaac tgatagg // LOCUS HUMMAC2 908 bp ss-mRNA PRI 31-JUL-1990 DEFINITION Human macrophage galactose-specific lectin (hMAC-2) mRNA, complete cds. ACCESSION M35368 KEYWORDS macrophage galactose-specific lectin. SOURCE Human colon epithelial carcinoma cell line HT29, cDNA to mRNA, clone hMac2.2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 908) AUTHORS Cherayil,B., Chaitovitz,S., Wong,C. and Pillai,S. TITLE Molecular cloning of a macrophage galactose-specific lectin JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.J.Cherayil, 22-JUN-1990. FEATURES from to/span description pept 23 769 macrophage galactose-specific lectin (hMAC-2) mRNA < 1 908 macrophage galactose-specific lectin mRNA site 137 248 lectin domain BASE COUNT 251 a 233 c 217 g 207 t ORIGIN 1 ggagccagcc aacgagcgga aaatggcaga caatttttcg ctccatgatg cgttatctgg 61 gtctggaaac ccaaaccctc aaggatggcc tggcgcatgg gggaaccagc ctgctgggca 121 ggggctaccc aggggcttcc tatcctgggg cctaccccgg gcaggcaccc ccagggctta 181 tcctggacag gcacctccag gcgcctaccc tggagcacct ggagcttatc ccggagcacc 241 tgcacctgga gtctacccag ggccacccag cggccctggc tacccatctt ctggacagcc 301 aagtgccccc ggagcctacc ctgccactgg cccctatggc gcccctgctg ggccactgat 361 tgtgccttat aacctgcctt tgcctggggg agtggtgcct cgcatgctga taacaattct 421 gggcacggtg aagcccaatg caaacagaat tgctttagat ttccaaagag ggaatgatgt 481 tgccttccac tttaacccac gcttcaatga gaacaacagg agagtcattg tttgcaatac 541 aaagctggat aataactggg gaagggaaga aagacagtcg gttttcccat ttgaaagtgg 601 gaaaccattc aaaatacaag tactggttga acctgaccac ttcaaggttg cagtgaatga 661 tgctcacttg ttgcagtaca atcatcgggt taaaaaactc aatgaaatca gcaaactggg 721 aatttctggt gacatagacc tcaccagtgc ttcatatacc atgatataat ctgaaagggg 781 cagattaaaa aaaaaaaaaa agaatctaaa ccttacatgt gtaaaggttt catgttcact 841 gtgagtgaaa atttttacat tcatcaatat ccctcttgta agtcatctac ttaataaata 901 ttacagag // LOCUS MUSIGLCD 794 bp ds-DNA ROD 31-JUL-1990 DEFINITION Mouse Ig germline lambda-chain gene C5 region. ACCESSION M35582 KEYWORDS constant region; germline; immunoglobulin light-chain; lambda-immunoglobulin. SOURCE Mouse (strain SPE) DNA. ORGANISM Mus spretus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 794) AUTHORS Mami,F., Cazenave,P.-A. and Kindt,T.J. TITLE Conservation of the immunoglobulin C-lambda-5 gene in the Mus genus JOURNAL EMBO J. 7, 117-122 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 294 613 Ig lambda-chain C5 region BASE COUNT 195 a 188 c 195 g 216 t ORIGIN 1 ggttccagct tcaaggggct agagaattca gctggtctta gtctcagcgg gggaactgag 61 attgcaaggg tctgggtctg ggtcatttta tctggaagag gaacatgttc taatggatgc 121 taggctgtct gctctccaag ggactcaagt ggtcagagga gaagaaggaa gcatccctgg 181 atggaagact gatgctgtag tgaatggcca cagagctcct gataagagaa ggacgcttcc 241 ttatcacatg ggctctccta tgctaactct tatccccttc tctatctgcg caggtcagcc 301 caagtctgat cccttggtca ctctgttcct gccttcctta aagaatcttc aggccaataa 361 ggtcacactg gtgtgtttgg tgagcgaatt ctacccaggt actttggtgg tggactggaa 421 ggtagatggg gtccctgtca ctcagggtgt agagacaacc caaccctcca aacagaccaa 481 caacaaatac atggtcagca gctacctgac attgatatct gaccagtgga tgcctcacag 541 tagatacagc tgccgggtca ctcatgaagg aaacactgtg gagaagagtg tgtcacctgc 601 tgagtgttct tagaccacaa tcctccctga agccttaggg gcctggatct gaagtgccag 661 aaaaagttgt tttttgtttt tttttttttt tccccattaa ccatctcact gtctttcctg 721 tgcctaatac tcaataaata tcttaccacc aaccagagag tcttgtcgat ctcatcattt 781 gcatgccagg gatg // LOCUS PSELSP 2396 bp ds-DNA BCT 31-JUL-1990 DEFINITION P.fluorescens signal peptidase II (lsp) gene, complete cds and isoleucyl-tRNA synthetase (ileS) gene, 3' end. ACCESSION M35366 KEYWORDS isoleucyl-tRNA synthetase; signal peptidase II. SOURCE P.fluorescens (strain NCIB 10586) DNA. ORGANISM Pseudomonas fluorescens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2396) AUTHORS Isaki,L., Beers,R. and Wu,H. TITLE Nucleotide sequence of the Pseudomonas fluorescens signal peptidase II (lsp) and flanking genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Wu, 18-JUN-1990. Author address:H.Wu Uniformed Services University of the Health Sciences 4301 Jones Bridge Rd. Bethesda, MD 20814 FEATURES from to/span description pept < 1 729 isoleucyl-tRNA synthetase (ileS) (AA at 1) pept 722 1234 signal peptidase II (lsp) pept 1227 1679 ORF149 pept 1857 > 2396 ORF316 rpt 768 810 inverted repeat rpt 1239 1268 inverted repeat rpt 1640 1674 inverted repeat rpt 1697 1889 inverted repeat BASE COUNT 498 a 706 c 712 g 478 t 2 others ORIGIN 1 ctgcagcgcg agttgcagga acactacggc gaataccgct tctggaacgt ctactccaag 61 atccacaact tctgcgtgca ggagctgggt ggtttctacc tcgacatcat caaggaccgc 121 cagtacacca ctggcgccaa cagcaaggcg cgccgttcgg cagaccgcgc tgtaccacat 181 cagcgaaggc tggtgcgctg gatcgcaccg atcctggcat tcaccgctga cgaactgtgg 241 gagtacctgc cgggcgagcg taacgagtcc gtcatgctca acacctggta cgaaggcctg 301 accgaactgc cggctgactt cgaactgggc cgcgagtact gggagggcgt gatggccgtc 361 aaggttgcgg tgaacaagga actggaagtc cagcgtgcgg ccaaggccgt gggtggcaac 421 ctgcaagccg aagtcaccct gtttgccgaa gacggcctga ccgccgacct ggccaagctg 481 agcaacgagc tgcgcttcgt gctgatcacc tctactgcga gcctggcgcc gtttacccag 541 gctccggcag atgcggtggc taccgaagtg cctggcctta agctcaaagt ggtcaagtcg 601 gccttcccca agtgcgctcg ttgctggcac tgccgtgaag acgtcggcgt gaaccctgag 661 cacccggaaa tctgcggtcg ttgtgtcgac aacatcagcg gtgaaggcga ggttcgccac 721 tatgcctaat gcagacagtc gtttcggacg tctgggctgg ctcgtactga gcctgctggt 781 cctggtcatt gaccaggtca gcaaggctca cttcgagggc tccctggaaa tgttccagca 841 aatcgtggtg atcccggatt atttcagctg gaccctggcc tacaacactg gcgccgcctt 901 cagcttcctc gctgacggcg gtggctggca gcgctggctg tttgctgtga tcgccgtggt 961 ggtaagtgcc gtactggtgg tgtggctcaa gcgcctgggc cgcgacgaca cctggctggc 1021 catcgcgctg gccctagtgt tgggtggcgc gctgggcaac ctgtatgacc gcatcgccct 1081 gggccatgtg atcgacttta tcctggtgca ttggcagaac cgccactact tcccggcgtt 1141 caactttgcc gacagtgcta tcaccgtcgg cgcaatcatg ctggcgctgg atatgttcaa 1201 gagcaagaaa accggagaaa ccgtcaatga ctgatcaggt attggctgag caacgcatcg 1261 gccagaacac ggaagtcact ttgcatttcg cactgcgcct ggagaatggc gacacggtcg 1321 acagcacgtt cgacaaggcc cccgcgacct tcaaggtcgg cgacggcaac ctgctgccgg 1381 gtttcgaagc ggcactgttc ggtttcaagg ccggtgacaa gcgcaccctg caaatcctgc 1441 cggaaaacgc ctttggccag cccaacccgc aaaacgtgca gatcatcccc cgttcgcagt 1501 tccagaacat ggacctgtcg gaaggcttgc tggtgatctt caatgatgcg gcgaacactg 1561 aattgcctgg cgtggtgaaa gcatttgatg acgcgcaagt gaccatcgac ttcaatcacc 1621 cgttggccgg caagacgttg acctttgacg tcgagattat cgacgtcaaa gcgctgtaac 1681 cgaccgtacg cggtctaaaa tgtgggaggg ggcttgctcc cgatagcggt atatcagcta 1741 aatgtgcaga ctgatatacc tatatcggga gcaagccccc tcccaccttt gatctccatt 1801 gtcggttgag tcagtgtcaa attgactcaa tatggctgca atacacgagg cacagcatgc 1861 aaatcaaact cgccaacccc cgtggcttct gcgccggcgt ggaccgggcg atcgaaatcg 1921 tcaatcgcgc cctggaagtc ttcgggccgc cgatttatgt gcgccatgaa gtcgtccata 1981 acaaatttgt ggtcgaagac ttgcgtgcgc gcggggcgat ctttgtcgaa gaactcgatc 2041 aggtgaaaga cgacgtgatc gtcatcttca gtgcccacgg tgtttcccag gctgtacgta 2101 ccgaagcggc gggccgtggc ctgaaggtgt tcgatgccac ctgcccactg gtgaccaagg 2161 tgcatatcga ggtggcgcgc tacagccgcg acggtcgtga gtgcatcctg atcggccacg 2221 ccggtcaccc ggangtagaa ggcaccatgg gccaatacga cgccagcaac ggcggtgcta 2281 tctacctggt cgaggacgag aaagacgtcg ccaacttgca ggtgcacaat ccggaacgcc 2341 tggccttcgt gacccagacc accttgtcca tggacgacac cagccgngta atcgat // LOCUS PSEXILES 1128 bp ds-DNA BCT 31-JUL-1990 DEFINITION P.fluorescens X and isoleucyl-tRNA synthetase (ileS) genes, complete cds. ACCESSION M35367 KEYWORDS isoleucyl-tRNA synthetase. SOURCE P.fluorescens (strain NCIB 10586) DNA. ORGANISM Pseudomonas fluorescens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1128) AUTHORS Isaki,L., Beers,R. and Wu,H. TITLE Nucleotide sequence of the Pseudomonas fluorescens signal peptidase II (lsp) and flanking genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Wu, 18-JUN-1990. Author address:H.Wu Uniformed Services University of the Health Sciences 4301 Jones Bridge Rd. Bethesda, MD 20814 FEATURES from to/span description pept 90 1025 protein X pept 1040 > 1128 isoleucyl-tRNA synthetase (ileS) signal 56 61 TATA box BASE COUNT 198 a 349 c 349 g 231 t 1 others ORIGIN 1 tcgattcgac ccatttggcc tgcgctgttg cctgtcgtcc ggggctgggt gtggttataa 61 tcgaccactt tatgagcaag aagcgcgtta tgcagctggt tcgaggtctc cacaacctgc 121 gccccgagca tcggggctgc gtcgccacta ttggcaactt tgacggtgtt caccgtggcc 181 accaggctat cctggcaagg ctgcgcgagc gtgcggtcga gttgggtgtg cccagctgcg 241 tggtgatttt cgagccacaa ccgcgggagt tctttacccc ggaaacagcg ccggcccgct 301 tggcgcttgc gcgacaagct gcaactgctg gcggagaggg cgtggaccgc gtcctctgcc 361 tggctttcaa ccagcgtttg cgcagcctca gcgccgccga gttcgtcgac cgcattctgg 421 tcgatggcct gggtgtacaa cacctggagg ttggtgacga cttccatttc ggttgcgatc 481 gggtcgggga tttcgatttc ctgcaacatg ccggcgtcaa ccagggcttt accgttgaag 541 ccgcccaaac cgtcgaactg gacggcctgc gtgtgagcag cacccaggtg cgtaacgcct 601 tggctgccgc cgacttcgac ctggccgagc gtttgctcgg tcgcccgttc cgcattgccg 661 ggcgggtact gcacggccag aagctggcgc gccaattggg cacgccaact gccaacgtgc 721 aactcaagcg ccgccgagtg ccgctgactg gggtttacct ggtgagcgtc gacatcgacg 781 gccaatcgtg gccgggagtc gccaatatag gcgtcaggcc cacggttgca ggtgatggca 841 aggcccacct ggaagttcac cttttggatt ttgccggtga tttatacgac cggcgtttga 901 cggtggtttt ccaccagaag ctgcgtgaag agcagcgttt cgcctccctg gaggcgttga 961 aaacggcgat caatgcggat gtcgccgccg cccgtgcact agccgcacct agcgcccatc 1021 gctaaccgaa gagccttaaa tgaccgacta taaagccacg ntaaaccttc cggacaccgc 1081 cttcccaatg aaggccggcc tgccacagcg cgaaccgcag tcctgcag // LOCUS BLY656 672 bp ss-mRNA PLN 31-JUL-1990 DEFINITION H.vulgare thaumatin II mRNA, complete cds. ACCESSION M35656 KEYWORDS thaumatin II. SOURCE H.vulgare (cv. Bomi) endosperm, cDNA to mRNA, clone pUP-23. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 672) AUTHORS Lazaro,A., Rodriguez-Palenzuela,P., Marana,C., Carbonero,P. and Garcia-Olmedo,F. TITLE Signal peptide homology between the sweet protein thaumatin II and unrelated cereal alpha-amylase/trypsin inhibitors JOURNAL FEBS Lett. 239, 147-150 (1988) STANDARD simple staff_review FEATURES from to/span description pept 39 482 thaumatin II precursor sigp 39 101 thaumatin II signal peptide matp 102 479 thaumatin II BASE COUNT 155 a 175 c 209 g 133 t ORIGIN 1 aagagattga accaacgacc aataaactag tatcaacaat ggcatccgac catcgtcgct 61 tcgtcctctc cggcgccgtc ttgctctcgg tcctcgccgt cgccgccgcc accttggaga 121 gcgtcaagga cgagtgccaa ctaggggtgg acttcccgca taacccgtta gccacctgcc 181 acacctacgt gataaaacgg gtctgcggcc gcggtcccag ccggcccatg ctggtgaagg 241 agcggtgctg ccgggagctg gcggccgtcc cggatcactg ccggtgcgag gcgctgcgca 301 tcctcatgga cggggtgcgc acgccggagg gccgcgtggt tgagggacgg ctcggtgaca 361 ggcgtgactg cccgagggag gagcagaggg cgttcgccgc cacgcttgtc acggcggcgg 421 agtgcaacct atcgtccgtc caggcgccgg gagtacgctt ggtgctactg gcagatggat 481 gacgatgcaa atgcgccaag gtaatgaagc ggagtactgt atacagaata aaagtactcg 541 agtgaaaaca aactcataaa taaaccttgt gagatgtatg cgtatgatct atggtgtgga 601 cagttaaatt gtggccgatt gatgaataaa aaaggttgga acaaattaaa ttgttgtggg 661 ttcatatact at // LOCUS SPIFERRI 632 bp ss-mRNA PLN 31-JUL-1990 DEFINITION Spinach ferredoxin I mRNA, complete cds. ACCESSION M35660 KEYWORDS ferredoxin I. SOURCE Spinach (var. Monatol) seedlings, cDNA to mRNA, p6SocFd/1. ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 632) AUTHORS Wedel,N., Bartling,D. and Herrmann,R.G. TITLE Analysis of cDNA clones encoding the entire ferredoxin I precursor polypeptide from spinach JOURNAL Botanica Acta 101, 295-300 (1988) STANDARD simple staff_review FEATURES from to/span description pept 17 460 ferredoxin I precursor sigp 17 166 ferredoxin I signal peptide matp 167 457 ferredoxin I mRNA < 1 632 ferredoxin I mRNA BASE COUNT 176 a 131 c 139 g 186 t ORIGIN 1 ttacaaaatt gtaaaaatgg cagcaaccac cacaacaatg atgggcatgg ccaccacctt 61 tgtcccaaaa ccccaagcac caccaatgat ggcggcgctt ccatccaaca ccggccgctc 121 tttgttcgga ctcaagaccg gtagccgtgg cggaaggatg acaatggctg cctacaaggt 181 aaccttggta acacccaccg gtaacgtaga gtttcaatgc ccagacgatg tttacatctt 241 ggatgctgct gaagaagaag gcattgactt gccttactca tgcagagctg ggtcgtgctc 301 ttcatgcgcc ggaaagctta agacaggtag tcttaaccaa gatgatcaga gttttttgga 361 tgacgatcag atcgatgaag gatgggttct tacctgtgct gcttaccctg ttagtgatgt 421 tactattgag acccacaagg aagaggagct tactgcctaa ttcatttttt tttatttttt 481 attattttgt tctcatttga gagggaattg aaagactaaa aaagagtaaa tgcagcgagg 541 agtttttact tcttcgatct gtggtttgta attgtgtatt atcatgttaa tcaattgctc 601 ttataagctt aattactacg taatatatat at // LOCUS GEM5SRRNA 108 bp ss-rRNA RNA 31-JUL-1990 DEFINITION G.obscuriglobus 5S ribosomal RNA. ACCESSION M35170 KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA. SOURCE G.obscuriglobus (strain UQM 2246) ribosomal RNA. ORGANISM Gemmata obscuriglobus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Planctomycetales. REFERENCE 1 (bases 1 to 108) AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E. TITLE A unique type of eubacterial 5S rRNA in members of the order Planctomycetales JOURNAL J. Mol. Evol. 27, 121-125 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 108 5S rRNA BASE COUNT 25 a 33 c 31 g 19 t ORIGIN 1 ttccggtgac catacccaaa cggaaacacc cgttcccatt ccgaacacgg ccgtgaaagt 61 ttgaggccga tggtagtgcg tccagcgcga gagtaggtat cgccggcc // LOCUS ISO5SRRNA 111 bp ss-rRNA RNA 31-JUL-1990 DEFINITION I.pallida 5S ribosomal RNA. ACCESSION M35169 KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA. SOURCE I.pallida (strain IS 1B) ribosomal RNA. ORGANISM Isosphaera pallida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Planctomycetales. REFERENCE 1 (bases 1 to 111) AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E. TITLE A unique type of eubacterial 5S rRNA in members of the order Planctomycetales JOURNAL J. Mol. Evol. 27, 121-125 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 111 5S rRNA BASE COUNT 19 a 39 c 32 g 21 t ORIGIN 1 ctccggtgac cataccgtcg gggtcctacc cgttcccatt ccgaacacgg ccgtcaagcc 61 cgccgggcct atgatagtgc gtcagcgcga aagtcggtga tcgccggatc t // LOCUS MUSINTZ1 271 bp ds-DNA ROD 31-JUL-1990 DEFINITION Mouse integration site for foreign DNA, 5' flank. ACCESSION M35630 KEYWORDS integration site. SEGMENT 1 of 3 SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 271) AUTHORS Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G. TITLE Structural features of the integration site of foreign DNA in the genome of transgenic mice JOURNAL Mol. Biol. 22, 1235-1242 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 9 50 region of dyad symmetry BASE COUNT 108 a 48 c 55 g 60 t ORIGIN 1 gaaaaatcta cgttaataaa acgaactaac ggaacaacat tattacaggt agaaagattc 61 atcagttgag atttaggaat accacatcaa ctaatgcaga tacataacgc caaaaggaat 121 tacgaggcat agtaagagca acactatcat aaccctcgtt taccagacga cgataaaaac 181 caaaatagcg agaggctttt gcaaagaagt tttgccagag ggggtaatag taaatgttta 241 gactggatag cgtccaatat gcggtcgtca a // LOCUS MUSINTZ2 300 bp ds-DNA ROD 31-JUL-1990 DEFINITION Mouse integration site for foreign DNA, 3' flank. ACCESSION M35631 KEYWORDS integration site. SEGMENT 2 of 3 SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G. TITLE Structural features of the integration site of foreign DNA in the genome of transgenic mice JOURNAL Mol. Biol. 22, 1235-1242 (1988) STANDARD simple staff_review BASE COUNT 56 a 94 c 81 g 69 t ORIGIN About 1.5 kb after segment 1. 1 ccagctggtg ctccatgtgc agccaggtcc tggccccact atgtccatcc cagctccagc 61 ccctgcttgc tggcagcagc ctggcagccc caggcacttc tcccagcacg ctgccatgtc 121 actggtcagg tcgggtgctg cctgcctgct gccagcaggc tgtccccggt gtacccactg 181 tttccccact gctaggtcat ctgtgcactg atgcatgcag gtgattctcg gtaatatggc 241 atacgtgttc gggaatgatc gtaatcaaaa aggcgagata agctggtcag agatatatag // LOCUS MUSINTZ3 99 bp ds-DNA ROD 31-JUL-1990 DEFINITION Mouse repetitive sequence DNA. ACCESSION M35632 KEYWORDS . SEGMENT 3 of 3 SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 99) AUTHORS Makarova,I.V., Tarantul,V.Z. and Gazaryan,K.G. TITLE Structural features of the integration site of foreign DNA in the genome of transgenic mice JOURNAL Mol. Biol. 22, 1235-1242 (1988) STANDARD simple staff_review BASE COUNT 12 a 26 c 41 g 20 t ORIGIN About 1.5 kb after segment 3. 1 agctgggaat cggggggggg ggggggggcg ggtggtggtt tggccaggga tttgggcacc 61 ttcctaagcc ccaaccctaa tgccggtttc cttcccccg // LOCUS NPOCAPA 3555 bp ds-DNA VRL 31-JUL-1990 DEFINITION O.pseudotsugata polyhedrosis virus capsid protein p87 gene, complete cds. ACCESSION M35445 D00514 KEYWORDS capsid protein; structural protein. SOURCE O.pseudotsugata polyhedrosis virus DNA. ORGANISM Orgyia pseudotsugata polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 3555) AUTHORS Mueller,R., Pearson,M.N., Russell,R.L.Q. and Rohrmann,G.F. TITLE A capsid-associated protein of the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata: Genetic location, sequence, transcriptional mapping, and immunocytochemical characterization JOURNAL Virology 176, 133-144 (1990) STANDARD simple staff_review FEATURES from to/span description pept 216 2090 capsid protein p87 mRNA 182 2282 capsid protein p87 mRNA (alt.) mRNA 209 2282 capsid protein p87 mRNA (alt.) signal 2194 2199 polyA site BASE COUNT 946 a 1003 c 880 g 726 t ORIGIN 1 gaattctgta aaaaaggtta ggccctttat gtcaatcaaa ctgtgctggt caaagtactt 61 ggccgtcaaa aaggttagcg agtcgatttc gcgttggtgc atttgcgctt caaagcgcac 121 gtgttcaaag cggtgctcca gaccgtgcac cgcgttgaac ctcaaattgt acaacaattt 181 gtaagtgtgc atgttgcatt cgatatagcc ttataatgga cgagcagcat tcattacgca 241 ttgccgcgct ggctggcgag atcttgacgc gcgaccgcgc acaggtgaac acaatcatac 301 acagccccga gcgcgcgctg ggccaaaaat tggacgccat tacggcgcta gtcgattcaa 361 tgcagcccgg cacgccgcgc gacgctgctg tgaacgaagc cgcaaacgtt accgcgcaaa 421 gcccaatgtc ggaaactcaa gacccgcagc gcgccaacga caacgtgtca gataccgttg 481 caaatgaaaa cgcgcaaaac ttgttgttgg agggccaaga ccgcgttttg cgacatcgcg 541 ttttgcaaat cgcggtaacg tttttgcagc gcaacaagcg agtaaaagcg aacgcgacca 601 cgttggccca gattgaggaa gcgctccgaa actatgaaac tgcaaaaaac agcggcgcaa 661 gcgacagtgt aatcgacggg tttcttgaac gtgccgaatc tttgtttaac accctaaaaa 721 acatatcgct atccgagctg ttggaccgcg aaagcgcagt gtttgcggac acggaaagcg 781 ctccgcggac gcaaaccgcg gacaactcgc cgccacctgt gagcgaacaa gactttgacc 841 ggttggacat tagccagctc accgattaca tcgagaacaa ttacagggat cagtttgatt 901 tcgacaaaca caacagtgtg gaagacgtgc gcaactttgc aaaaaatctt tggcgcaaca 961 aaacgcgtgt cacgtccacg cctctgcagg aataccagat gccaccgcaa acgcccgcgc 1021 ctctgcaaga ccaaatgcca ccgcaaacgc ccgcgtacgc gacgcccgcg cagcagccaa 1081 gccagccaac gcccgcgcaa acgcccgcgc agcagccaag ccagccaacg cccgcgtacg 1141 ttacgtctgc gcaaacgccc gcgcagcagc caagccagcc aacgcccgtt tccaattaca 1201 gttgggaaag gcgcgtcgcg tcaatgtttg cgaatacaga tttaccacaa aacgtgccgc 1261 tgcccgacag ctacgacacc tcgtccgtca ttgggcaaaa acggcgcaag cggcgcgcgc 1321 cgccgctgcc gccgtactcg tcagacgaag aagacgccgc cccgccacgt tcgccaaaac 1381 gcaaagagtc gctttcctcg tctgaagaag acgaatttga ctacgagcgc gaacaaaaac 1441 gccggcgcga ggaggataaa aactttttgc gcctaaaggc gttggaactg tccaagtacg 1501 ccggcgttaa cgagcgcatg gaaaagattg tgcgcgttac caaggcgatg caacaaacgt 1561 acgactactg caattgcaaa aacacaatca gcggcacgcc cgccgcggcg tcgtttatta 1621 acttgttgcg gcggctgaac acctacaacc tcagccacgt ggaaatgacg gtcaactttt 1681 acgagctctt gtaccccctc acgctgtaca acgacgagtc caaccgcatt gttggctaca 1741 tattcgcggc gaccaactat tttcaaaatt gcgccaaaaa ctttgggcgc atgcgcgccg 1801 agtttaacga gcacggcccg tttgcgcaaa ttgattcgct cgtcatgttt gtcatcaaat 1861 tcaacttttt gtgcgacttg caaacgtttt ttggaaaaat tgacggcttg ccaatgttgg 1921 cccagcccaa catcaagacg cacaccgtgc tggtgatgcg cgacaagatc gtcaagctgg 1981 cctttggcgc gctgcaatac gacaccagtc tgaaaacgga caatcggcgg gaccccaaac 2041 acctgcaacg actcatccag ttgatgaacg ccgattttaa cataatgtaa acaaatgtgt 2101 tagcaattta tttattacaa acagagttac gatattcggt gtacatggtt ttataatctt 2161 tgtgatcaaa gttaaaacgt tttctataca aataataaag cttgttaaac agctgttttc 2221 gttttttttt gtgtcgcaaa caaatttttt gtggcaatgg cgtagcacat ctagaaaatt 2281 aaatttttcg ccgaccgcac actcttcatt aagctcgtta cggctggccg cctatgcgat 2341 aactgcacgt gacacacagg gtgtagtggc cgcatggcgc cgccaccatg cgcttcggct 2401 cgctcatgca aacaatgcac tcgtcgttgg cttcgtcggt aaacacgcgc tcaacagtaa 2461 ttttacaaac ggtaacggcg tctggtttta tgtaataata gcatttggag cggcaaatta 2521 aggaactgtc tttgggcgcc acggcaaaag tggcgccctc gcgccgcaac cgcttgggaa 2581 aggtgcagtt ggcgtcgcgc cgataggcgg gatcgtctat acgcacgcgc agcctgccgt 2641 tcacctttac acaaatcaac ttaacgtgcg ccattgcacg gtcgaatgta ggaccagtgt 2701 caattattta agacaccgtt agcataaccg ctggcccaat ggacgccaaa tacaaggtgg 2761 tcgacgtgga cacctttgcc cgccagctaa tcacggacaa gtgcagcgag ctgattgaaa 2821 ccgaaaatct attgcctgca aacattttgc atgttgttaa acaagcaaga gacaaatact 2881 ttgaagaccc ctcggtcaaa aactacgaat atgtcaaaaa cctgtttttg cgcacaaaat 2941 acatggacga ctcaattgac tacaaaaatt ttaatcggcg cgtgttgctg attgtgttca 3001 aatttgcgct caaccgaggc tccggctact ttccctcgta cagggagctc attgaagtgg 3061 ccgtcaagcg cctcaacaaa atcaaccccg acctcaagag ctcgccgcgc gccatgttgc 3121 agcactacaa cgagtgcctg gaaaacctgg acaacccggt gaccgacgag caccacctgt 3181 taacgtttgg aaaagaagtg gccaccaaaa tgtttatcga ggcgttcgag tttagctacg 3241 ccagcaacaa cgaaatcaac ttaactacca acaagcgcgg ctcggacctg ttcgacccga 3301 taccgatgcc cgcgcccgcg cccgcgcctt cggcgtcttt gttggacaac gtcatgaacg 3361 agcgcaagcg caagttgcag gcgtcggtca cgacaacgcc gcccaaacga tgcaaattgg 3421 ccgaccggcc ggcgcagacc acacaagaca cgccgcgcgc gccgcaaccg gcgcctgtgc 3481 gcgcgcagcg gcccctgttt acgctgtaga cgcgcgcgca gaggcgtcgg tgggcggcgc 3541 gtaatttcgc tgcag // LOCUS PLL5SRRNA 110 bp ss-rRNA RNA 31-JUL-1990 DEFINITION P.marina 5S ribosomal RNA. ACCESSION M35164 KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA. SOURCE P.marina (strain IFAM 1313) ribosomal RNA. ORGANISM Pirella marina Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Planctomycetales. REFERENCE 1 (bases 1 to 110) AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E. TITLE A unique type of eubacterial 5S rRNA in members of the order Planctomycetales JOURNAL J. Mol. Evol. 27, 121-125 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 110 5S rRNA BASE COUNT 30 a 29 c 25 g 26 t ORIGIN 1 ttccggtgac catatcgaaa aggtcatacc tgttcccatt ccgaacacag ccgtcaagct 61 tttcgagccg atgatagtac cacaagtgtg aaagtaggta tcgccggatc // LOCUS PLLS5RRNAA 111 bp ss-rRNA RNA 31-JUL-1990 DEFINITION Pirella sp. 5S ribosomal RNA. ACCESSION M35165 KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA. SOURCE Pirella sp. (strain IFAM 1310) ribosomal RNA. ORGANISM Pirella sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Planctomycetales. REFERENCE 1 (bases 1 to 111) AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E. TITLE A unique type of eubacterial 5S rRNA in members of the order Planctomycetales JOURNAL J. Mol. Evol. 27, 121-125 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 111 5S rRNA BASE COUNT 29 a 29 c 25 g 28 t ORIGIN 1 ttccggcgat catatcttaa aggttatacc tgttcccatt ccgaacacag cagtcaagct 61 ttaagagccg atgatagtgc ccaccagcgt gaaagtaggt cttgccggat c // LOCUS PLLS5RRNAB 111 bp ss-rRNA RNA 31-JUL-1990 DEFINITION Pirella sp. 5S ribosomal RNA. ACCESSION M35166 KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA. SOURCE Pirella sp. (strain IFAM 1441) ribosomal RNA. ORGANISM Pirella sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Planctomycetales. REFERENCE 1 (bases 1 to 111) AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E. TITLE A unique type of eubacterial 5S rRNA in members of the order Planctomycetales JOURNAL J. Mol. Evol. 27, 121-125 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 111 5S rRNA BASE COUNT 29 a 31 c 27 g 24 t ORIGIN 1 ttccggtgac catatggttg tggaaacacc tgttcccttt ccgaacacag cagttaagca 61 caaccagccg atgatagtgc ccaccagtgc gaaagtaggt atcgccggat c // LOCUS PLT5SRRNAA 111 bp ss-rRNA RNA 31-JUL-1990 DEFINITION P.limnophilus 5S ribosomal RNA. ACCESSION M35167 KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA. SOURCE P.limnophilus (strain IFAM 1008) ribosomal RNA. ORGANISM Planctomyces limnophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Planctomycetales. REFERENCE 1 (bases 1 to 111) AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E. TITLE A unique type of eubacterial 5S rRNA in members of the order Planctomycetales JOURNAL J. Mol. Evol. 27, 121-125 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 111 5S rRNA BASE COUNT 26 a 33 c 29 g 23 t ORIGIN 1 ttccggtgac tttacgcgtg aggaaacact cgttcccatt ccgaacacga cagttaagct 61 cccgcggccg atgatagtgc ccaccagcgt gaaagtaggt tatcgccgga c // LOCUS PLT5SRRNAB 110 bp ss-rRNA RNA 31-JUL-1990 DEFINITION P.brasiliensis 5S ribosomal RNA. ACCESSION M35168 KEYWORDS 5S ribosomal RNA; small subunit ribosomal RNA. SOURCE P.brasiliensis (strain IFAM 1448) ribosomal RNA. ORGANISM Planctomyces brasiliensis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Planctomycetales. REFERENCE 1 (bases 1 to 110) AUTHORS Bomar,D., Giovannoni,S. and Stackebrandt,E. TITLE A unique type of eubacterial 5S rRNA in members of the order Planctomycetales JOURNAL J. Mol. Evol. 27, 121-125 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 110 5S rRNA BASE COUNT 28 a 26 c 29 g 27 t ORIGIN 1 ttctggtgac tttacgtctg gggaaacact cgttcccatt ccgaacacga cagttaagcc 61 ctgacggccg atgatagtac tgcaagtgtg aaagtaggtg attgccagac // LOCUS MZEPPDK1F1 1272 bp ds-DNA PLN 31-JUL-1990 DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK1) gene, 5' end. ACCESSION X14927 KEYWORDS pyruvate,orthophosphate dikinase. SEGMENT 1 of 2 SOURCE Z.mays mays (strain B73) DNA, clone H2-lambda-21; and cDNA to mRNA, clone pPPDK2. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1272) AUTHORS Glackin,C.A. and Grula,J.W. TITLE Organ-specific transcripts of different size and abundance derive from the same pyruvate,orthophosphate dikinase gene in maize JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.A.Glackin, 31-MAR-1989, for release after publication. FEATURES from to/span description pept 1017 / 1241 pyruvate,orthophosphate dikinase precursor (PPDK1) (EC 2.7.9.1) sigp 1017 1241 pyruvate,orthophosphate dikinase signal peptide pre-msg 805 > 1272 PPDK1 mRNA and intron IVS 1242 > 1272 PPDK1 intron A BASE COUNT 306 a 324 c 327 g 315 t ORIGIN 1 agtactgaaa gcaatctaaa gacatggagg tggaaggcct gacgtagata gagaagatgc 61 tcttagcttt cattgtcttt cttttgtagt catctgattt acctctctcg tttatacact 121 ggttttttaa acactcctta acttttcaaa ttgtctcttt ctttacccta gactagataa 181 ttttaatggt gattttgcta atgtggcgcc atgttagata gaggtaaaat gaactagtta 241 aaagctcaga gtgataaatc aggctctcaa aaattcataa actgtttttt aaatatccaa 301 atatttttac atggaaaata ataaaattta gtttagtatt aaaaaattca gttgaatata 361 gttttgtctt caaaaattat gaaactgatc ttaattattt ttccttaaaa ccgtgctcta 421 tctttgatgt ctagtttgag acgattatat aatttttttt gtgcttaact acgacgagct 481 gaagtacgta gaaatactag tggagtcgtg ccgcgtgtgc ctgtagccac tcgtacgcta 541 cagcccaagc gctagagccc aagaggccgg aggtggaagg cgtcgcggca ctatagccac 601 tcgccgcaag agcccaagag gccggagctg aaggatgagg gtctgggtgt tcacgaattg 661 cctggaggca ggaggctcgt cgtccggagc cacaggcgtg gagacgtccg ggataaggtg 721 agcagccgct gcgatagggg cgcgtgtgaa ccccgtcgcg ccccacggat ggtataagaa 781 taaaggcatt ccgcgtgcag gattcacccg ttcgcctctc accttttcgc tgtactcact 841 cgccacacac accccctctc cagctccgtt ggagctccgg acagcagcag gcgcggggcg 901 gtcacgtagt aagcagctct cggctccctc tccccttgct ccatatgatc gtgcaaccca 961 tcgagctacg cgcgtggact gccttccctg ggtcggcgca ggaggggatc ggaaggatgg 1021 cggcgtcggt ttccagggcc atctgcgttc agaagccggg ctcaaaatgc accagggaca 1081 gggaagcgac ctccttcgcc cgccgatcgg tcgcagcgcc gaggcccccg cacgccaaag 1141 ccgccggcgt catccgctcc gactccggcg cgggacgggg ccagcattgc tcgccgctga 1201 gggccgtcgt tgacgccgcg ccgatacaga cgaccaaaaa ggtatccctt gcagctctta 1261 gaaactgaat tc // LOCUS MZEPPDK1F2 265 bp ds-DNA PLN 31-JUL-1990 DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK1) gene, 3' end. ACCESSION X14929 KEYWORDS pyruvate,orthophosphate dikinase. SEGMENT 2 of 2 SOURCE Z.mays mays (strain B73) DNA, clones H2-lambda-[21,13]. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 265) AUTHORS Glackin,C.A. and Grula,J.W. TITLE Organ-specific transcripts of different size and abundance derive from the same pyruvate,orthophosphate dikinase gene in maize JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.A.Glackin, 31-MAR-1989, for release after publication. FEATURES from to/span description pept < 1 23 pyruvate,orthophosphate dikinase (PPDK1) (EC 2.7.9.1) (AA at 3) BASE COUNT 62 a 62 c 50 g 91 t ORIGIN Undetermined number of base pairs after segment 1. 1 ctgcagctca ggtgcttgtc tgaggctgcc tcctcgttgg caaccggatt gcctgctgct 61 ggtggatgtg gtgatcaaca gtattattac agagccatgc tatgtgaaca ttactagtag 121 cagtgctcat aaaagctaca atcccatctc cctttttttt ccagtcatgt aaaacttcca 181 aactgctcca tggttcaaaa ctctgttctt caatacatca tcaattatcg attatatatg 241 tggcagaggt tttttctttt ctttc // LOCUS MZEPPDK2F 322 bp ds-DNA PLN 31-JUL-1990 DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK2) gene, 3' end. ACCESSION M32081 KEYWORDS pyruvate,orthophosphate dikinase. SOURCE Z.mays mays (strain B73) DNA, clone H2-lambda-23. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 258) AUTHORS Glackin,C.A. and Grula,J.W. TITLE Organ-specific transcripts of different size and abundance derive from the same pyruvate,orthophosphate dikinase gene in maize JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 322) AUTHORS Glackin,C.A. and Grula,J.W. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1],[2] kindly submitted by C.A.Glackin, 13-FEB-1990. Author address: C.A.Glackin Phytogen 101 Waverly Dr. Pasadena, CA 91105 FEATURES from to/span description pept < 1 23 pyruvate,orthophosphate dikinase (PPDK2) (EC 2.7.9.1) (AA at 3) mRNA < 1 322 PPDK2 mRNA BASE COUNT 75 a 74 c 74 g 99 t ORIGIN 1 ctgcagctca ggtgcttgtc tgaggggctg cctcctcgtt ggcagcctgc tgttggtgca 61 tgctggtgat taataatact actatgacag agccatatgc tctgtgaaga gtattagtag 121 cagcgctcat aaaagctaca gttccatcta ttttctcagc tatgtaaaac ttccaaactg 181 ttcatgctta aaactgaggg ttttcgtggt gtgagatgtg catgtcgttg ttgaggccat 241 tgctgcacat tccacctatt gaggccctcc tcaaattaag cctcgaacaa gctgatcatc 301 ttttctgaga actctagact cg // LOCUS MZEPPDKLR 246 bp ss-mRNA PLN 31-JUL-1990 DEFINITION Z.mays pyruvate,orthophosphate dikinase (PPDK1) mRNA, 3' end. ACCESSION X14928 KEYWORDS pyruvate,orthophosphate dikinase. SOURCE Z.mays mays (strain B73) leaf and root, cDNA to mRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 246) AUTHORS Glackin,C.A. and Grula,J.W. TITLE Organ-specific transcripts of different size and abundance derive from the same pyruvate,orthophosphate dikinase gene in maize JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3004-3008 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.A.Glackin, 31-MAR-1989, for release after publication. FEATURES from to/span description pept < 1 23 pyruvate,orthophosphate dikinase (PPDK1) (EC 2.7.9.1) (AA at 3) BASE COUNT 61 a 59 c 48 g 78 t ORIGIN 1 ctgcagctca ggtgcttgtc tgaggctgcc tcctcgttgg caaccggatt gcctgctgct 61 ggtggatgtg gtgatcaaca gtattattac agagccatgc tatgtgaaca ttactagtag 121 cagtgctcat aaaagctaca atcccatctc cctttttttt ccagtcatgt aaaacttcca 181 aactgctcca tggttcaaaa ctctgttctt caatacatca tcaattatcg attatatatg 241 tggcag // LOCUS BABAPOAI 860 bp ss-mRNA PRI 31-JUL-1990 DEFINITION Baboon apolipoprotein A-I (ApoA-I) mRNA, complete cds. ACCESSION M35634 KEYWORDS apolipoprotein A-I. SOURCE Baboon liver, cDNA to mRNA. ORGANISM Papio hamadryas Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Cercopithecidae; Cercopithecinae. REFERENCE 1 (bases 1 to 860) AUTHORS Hixson,J.E., Borenstein,S., Cox,L.A., Rainwater,D.L. and VandeBerg,J.L. TITLE The baboon gene for apolipoprotein A-I: Characterization of a cDNA clone and identification of DNA polymorphisms for genetic studies of cholesterol metabolism JOURNAL Gene 74, 483-490 (1988) STANDARD simple staff_review FEATURES from to/span description pept 3 806 apolipoprotein A-I (ApoA-I) mRNA < 1 860 ApoA-I mRNA BASE COUNT 191 a 265 c 286 g 118 t ORIGIN 1 ggatgaaagc tacggtgctg accttggccg tgctcttcct gacggggagc caggctcggc 61 atttctggca gcaagatgaa cccccacaga ccccctggga tcgagtgaag gacctggtca 121 ctgtgtacgt ggaagcactc aaagacagcg gcaaagacta tgtgtcccag tttgaaggct 181 ccgccttggg aaaacagcta aacctaaagc tcctggacaa ctgggacagc gtgacctcca 241 ccgtcagcaa gctgcgcgaa cagctcggcc cggtgaccca ggagttctgg gataacctgg 301 aaaaggagac agagggcctg aggcaggaga tgagcaagga tctggaggag gtgaaggcca 361 aggtgcagcc ctacctggac gacttccaga agaagtggca ggaggagatg gagctctacc 421 gccagaaggt ggagccgctg cgcgcggagc tccacgaggg cacgcgccag aagctgcacg 481 agctgcacga gaaactgagc ccgctggggg aggaggtgcg cgaccgcgcg cgcgcccacg 541 tggacgcgct gcgcacgcat ctggccccct acagcgatga gctgcgccag cgcctggccg 601 cgcgccttga ggcgctcaag gagaacggcg gcgcccgcct ggcggagtac cacgccaagg 661 ccagcgagca tctgagcacg ctcagcgaga aggccaagcc cgcgctcgag gacctccgcc 721 agggcctgct gcccgtgctg gagagcttca aggtcagctt cctgagcgct ctcgaggagt 781 acactaagaa gctcagcacc cagtgaggcg cccgccgccg ccccccttcc cggtgctcag 841 aataaacgtt tccgaagtgg // LOCUS DROARSDA 721 bp ds-DNA INV 31-JUL-1990 DEFINITION D.melanogaster autonomically replicating sequence (ARS). ACCESSION M35633 KEYWORDS autonomically replicating sequence. SOURCE D.melanogaster (strain Dm665) DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 721) AUTHORS Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N. TITLE Structure of ARS elements from Drosophila melanogaster telomeres JOURNAL Genetika 24, 407-414 (1988) STANDARD simple staff_review FEATURES from to/span description site 574 584 core sequence BASE COUNT 237 a 168 c 110 g 206 t ORIGIN 1 cctcgctata tcaacgtctt cgtgccgcga tccaactgga cggagccatg ttcgggacca 61 ccttcccgca catgttcttg acatgttagc tgccgaactt tagaccccag ccgctcctct 121 ctgctatgca ggagtatatg taagccgagt atacatgggt acatactttc caatacgcca 181 cctcaaaagt tacattgctg tacttttatt acttgcatgt ccttctgggg aataattttg 241 atccagcatt cgctgcgagg tgtgggaaac caggtgggaa acccagtaaa acccaaccag 301 atcaattgaa catattattt acataaattc agtttgcaaa caattttaac taaaatcaat 361 attattcaac tttgtagaac ctcaaaatta tcaataaact taattaaatt taatttaatg 421 atactttata tcacgacccc cagttcagtc tgtttcttca tttcaatcta agcatatatt 481 gttttaattc attctgagag tccaagccca cttctcaatg taatgaatct tgcagatggt 541 aacggcttag ttagtatatc agcaagttgt aaataaatat aaataaatat aacaatcaaa 601 taacgcactt acctccttga ccacagccaa tccaacgcaa ccgaaaacag gagacgggcc 661 ccgcaaacgc aaattaaaat tgccaacatt accgattata actacaaaaa actaacaagc 721 t // LOCUS DROARSDB 275 bp ds-DNA INV 31-JUL-1990 DEFINITION D.melanogaster autonomously replicating sequence (ARS). ACCESSION M36876 KEYWORDS autonomously replicating sequence. SOURCE D.melanogaster (strain Dm665) deletion mutant pSL41 DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 275) AUTHORS Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N. TITLE Structure of ARS elements from Drosophila melanogaster telomeres JOURNAL Genetika 24, 407-414 (1988) STANDARD simple staff_review FEATURES from to/span description site 146 156 core sequence BASE COUNT 118 a 56 c 34 g 67 t ORIGIN 1 tttttgcata ataatttcaa ttgtaaaatg taaacaaacc acgcaatatg ataatgttac 61 cagtccagat tactgtcaaa aacctagtac gaaaaaaata ctaactagga ccaaactatt 121 atttaagctt tataatattt ataaaataat ataaaagctg gaattgccgc ttgaccacag 181 ccaatccaac gcaaccgaaa acaggagacg ggccccgcaa acgcaaatta aaattgccaa 241 cattaccgat tataactaca aaaaactaac aagct // LOCUS DROARSDC 290 bp ds-DNA INV 31-JUL-1990 DEFINITION D.melanogaster autonomously replicating sequence (ARS). ACCESSION M36877 KEYWORDS ARS element. SOURCE D.melanogaster (strain Dm665) deletion mutant pSL52 DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 290) AUTHORS Gragerov,A.G., Danilevskaya,O.N., Didichenko,S.A. and Kaverina,E.N. TITLE Structure of ARS elements from Drosophila melanogaster telomeres JOURNAL Genetika 24, 407-414 (1988) STANDARD simple staff_review FEATURES from to/span description site 81 90 core sequence BASE COUNT 103 a 58 c 52 g 77 t ORIGIN 1 agattactgt caaaaaccta gtacgaaaaa aatactaact aggaccaaac tattatttaa 61 gctttataat atttataaaa taatataaaa gctggaattg ccgctttgat gcgtttgagg 121 tctttgtaaa gggcatcgag tcctcggacc attaagggtt gctgccctgt cgaaactctg 181 acgtttgatt tgtttttggc ccaacgcaac cgaaaacagg agacgggccc cgcaaacgca 241 aattaaaatt gccaacatta ccgattataa ctacaaaaaa ctaacaagct // LOCUS GIBINVOL 1569 bp ds-DNA PRI 31-JUL-1990 DEFINITION H.lar involucrin gene, complete cds. ACCESSION M35447 KEYWORDS involucrin. SOURCE H.lar keratinocyte DNA, clones 1 and 2. ORGANISM Hylobates lar Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hylobatidae. REFERENCE 1 (bases 1 to 1569) AUTHORS Djian,P. and Green,H. TITLE The involucrin gene of the gibbon: The middle region shared by the hominoids JOURNAL Mol. Biol. Evol. 7, 220-227 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1 1569 involucrin BASE COUNT 472 a 389 c 550 g 158 t ORIGIN 1 atgtcccagc aacacacact gccagtgacc ctctcccctg ccctcagtca ggagctcctc 61 aagactgttc ctcctccagt caatacccag caggagcaaa tgaaacagcc aactccactg 121 cctcccccat gccagaaggt gcttggtgag ctcccagtgg aggtcccatc aaagcaagag 181 gaaaagcaca tgactactgt aaaggggctg cctgagcaag aatgtgagca acagcagcag 241 gagccacagg agcaggagct gcagcaacag cactgggaac agcatgagga acatcagaaa 301 gcagaaaacc cagagcagca gcttaagcag gagaaagcac aaagggatca gcagctaaac 361 gaacatctgg aagaagagaa gaagctctta gaccagcaac tgaatcaaga gctaatcaag 421 agagatgagc aactgggaat aaagaaagag caactcttgg agctcacaga gcagcaggag 481 gggcagttgg agcacctaga acagcaggag gggcagctgg agctcccaga gcagcaggag 541 gggcagctgg agcacctgga gcagcaggag gggcagctga agcacctgga tcagcagggg 601 aagcagccag agctcccaga gcagcaggtg gcacagctga agcacctgga gcagcaggag 661 gggcagctga agcatctgga gcaccagaag ggggagctgc aggtcccaga ggagcaggtg 721 gggcagctga agtacctgga acagcaggag gggcagctga agcacctgga tcagcaggag 781 aagcagccag agctcccaga gcagcaggtg gggcagctga agcacctgga gcagcaggag 841 gggcagctgg agcacatgga gcaccaggaa gggcagctgg ggctcccaga gcagcaggtg 901 gggcagctga agcagctgga ggagcaggag ggacagccaa agcacctgga ggaggaggag 961 gggcagctga agcacctggt gcagcaggag gggcagctgg agcatctggt gcagcaggag 1021 aggcagctgg agcagcagga ggggaaggtg cagcacctgg agcagcaggt ggagcagctg 1081 aagcacctag aggagcagga ggggcagctg aagcatctgg agcagcagca ggggcagctg 1141 gaggtctcag agcagcaggt ggggcagcca aagcacctgg agcaggaggg gaagcagctg 1201 gagctcccag agcagcaaga gggccagctg aagcacctgg agaagcagga ggcacagctg 1261 gagctcccag agcagcaggt aggacagcca aagcacccgg agcaacagga gaagcagcta 1321 gagcacccag agcagcagga gggacaacta aaacatctgg agcagcagga ggggcagctg 1381 aaagacctgg agcagcagaa ggggcagctg gagcagcagc aggggcagct ggagcagcct 1441 gtgtttgccc cagctccagg ccaggtccaa gacattcaac cagtcctgcc cacaaaggga 1501 gaagcattgc ttcctgtaga gcagcagcag cagaagcagg aggtgcagtg gccacccaaa 1561 cataaataa // LOCUS HUMHEMBP 822 bp ss-mRNA PRI 31-JUL-1990 DEFINITION Human eosinophil major basic protein mRNA, complete cds. ACCESSION M35670 KEYWORDS major basic protein. SOURCE Human eosinophil, cDNA to mRNA, clone MBP-lambda-9. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 822) AUTHORS McGrogan,M., Simonsen,C., Scott,R., Giffith,J., Ellis,N., Kennedy,J., Campanelli,D., Nathan,C. and Gabay,J. TITLE Isolation of a complementary DNA clone encoding a precursor to human eosinophil major basic protein JOURNAL J. Exp. Med. 168, 2295-2308 (1988) STANDARD simple staff_review FEATURES from to/span description pept 49 717 eosinophil major basic protein precursor sigp 49 94 eosinophil major basic protein signal peptide matp 95 714 eosinophil major basic protein mRNA < 1 822 eosinophil major basic protein mRNA BASE COUNT 173 a 219 c 236 g 194 t ORIGIN 1 gggaagatct aaagacccag gaaggtctct gggtgggata aagccaagat gaaactcccc 61 ttacttctgg ctcttctatt tggggcagtt tctgctcttc atctaaggtc tgagacttcc 121 acctttgaga cccctttggg tgctaagacg ctgcctgagg atgaggagac accagagcag 181 gagatggagg agaccccttg cagggagctg gaggaagagg aggagtgggg ctctggaagt 241 gaagatgcct ccaagaaaga tggggctgtt gagtctatct cagtgccaga tatggtggac 301 aaaaacctta cgtgtcctga ggaagaggac acagtaaaag tggtgggcat ccctgggtgc 361 cagacctgcc gctacctcct ggtgagaagt cttcagacgt ttagtcaagc ttggtttact 421 tgccggaggt gctacagggg caacctggtt tccatccaca acttcaatat taattatcga 481 atccagtgtt ctgtcagcgc gctcaaccag ggtcaagtct ggattggagg caggatcaca 541 ggctcgggtc gctgcagacg ctttcagtgg gttgacggca gccgctggaa ctttgcgtac 601 tgggctgctc accagccctg gtcccgcggt ggtcactgcg tggccctgtg tacccgagga 661 ggctactggc gtcgagccca ctgcctcaga agacttcctt tcatctgttc ctactgagct 721 ggtcccagcc gacagttcag agctgccctc tcctgggcat ggcctcccct cctctgcttg 781 ccatccctcc ctccacctcc ctgcaataaa atgggtttta ct // LOCUS HUMIGHVX 529 bp ds-DNA PRI 31-JUL-1990 DEFINITION Human Ig germline H-chain gene V-III region subgroup, exons 1 and 2. ACCESSION M35415 KEYWORDS germline; immunoglobulin heavy chain; variable region; variable region subgroup VH-III. SOURCE Human fetal liver DNA, clone lambda-VH-26. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 529) AUTHORS Matthyssens,G. and Rabbitts,T.H. TITLE Structure and arrangement of human heavy chain variable region genes JOURNAL immune system 1, 132-138 (1981) STANDARD simple staff_review FEATURES from to/span description pept 29 74 Ig heavy chain V-region precursor, exon 1 /hgml_locus_uid="LK0080M" /nomgen="IGHV" /map="14q32.33" 179 / 483 Ig H-chain V-region precursor, exon 2 sigp 29 74 Ig H-chain signal peptide 179 189 Ig H-chain signal peptide matp 190 / 483 Ig H-chain IVS 75 178 Ig H-chain V-region intron A iDNA 484 > 529 Ig iDNA BASE COUNT 131 a 106 c 165 g 127 t ORIGIN 1 gatcaggact gaacagagag aactcaccat ggagtttggg ctgagctggc tttttcttgt 61 ggctatttta aaaggtaatt catggagaaa tagaaaaatt gagtgtggaa tggataagag 121 tgagagaaac agtggatacg tgtggcagtt tctgaccagg gtttcttttt gtttgcaggt 181 gtccagtgtg aggtgcagct gttggagtct gggggaggct tggtacagcc tggggggtcc 241 ctgagactct cctgtgcagc ctctggattc acctttagca gctatgccat gagctgggtc 301 cgccaggctc cagggaaggg gctggagtgg gtctcagcta ttagtggtag tggtggtagc 361 acatactacg gagactccgt gaagggccgg ttcaccatct caagagacaa ttccaagaac 421 acgctgtatc tgcaaatgaa cagcctgaga gccgaggaca cggccgtata ttactgtgcg 481 aaagacacag tgagggaagt cattgtgagc ccagacacaa acctccctg // LOCUS HUMPRAPP 256 bp ss-mRNA PRI 31-JUL-1990 DEFINITION Human amyloid beta precursor protein (ABPP) mRNA, 5' end. ACCESSION M35675 KEYWORDS amyloid beta. SOURCE Human fetal muscle, cDNA to mRNA, clone lambda-HAP2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 256) AUTHORS Mita,S., Sadlock,J., Herbert,J. and Schon,E.A. TITLE A cDNA specifying the human amyloid beta precursor protein (ABPP) encodes a 95-kDa polypeptide JOURNAL Nucleic Acids Res. 16, 9351-9351 (1988) STANDARD simple staff_review FEATURES from to/span description pept 8 > 256 amyloid beta precursor protein (ABPP) /hgml_locus_uid="LG0136J" /nomgen="APP" /map="21q21.2" BASE COUNT 42 a 77 c 93 g 44 t ORIGIN 1 ggaggatatg aagtccatct tcacttcaga gatctcctcc gtcttgatat ttgtcaaccc 61 agaacctggt cgagtggtca gtttcctcgg cagcggtagg cgagagcacg cggaggagcg 121 tgcgcggggg ccccgggaga cggcggcggt ggcggcgcgg gcagagcaag gacgcggcgg 181 atcccactcg cacagcagcg cactcggtgc cccgcgcagg gtcgcgatgc tgcccggttt 241 ggcactgctc ctgctg // LOCUS MUSRP32A 901 bp ds-DNA ROD 31-JUL-1990 DEFINITION Mouse ribosomal protein L32' (L32') gene, complete cds. ACCESSION M35397 KEYWORDS ribosomal protein L32'. SOURCE Mouse cell line L1210 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 901) AUTHORS Jacks,C.M., Biltz,R.E. and Hackett,P.B. TITLE Analysis of potential expression of highly related members of the ribosomal protein L32 gene family JOURNAL Nucleic Acids Res. 16, 10751-10764 (1988) STANDARD simple staff_review FEATURES from to/span description pept 255 662 ribosomal protein L32' mRNA 200 > 695 L32' mRNA (alt.) mRNA 203 > 695 L32' mRNA (alt.) mRNA 204 > 695 L32' mRNA (alt.) mRNA 207 > 695 L32' mRNA (alt.) mRNA 246 > 695 L32' mRNA (alt.) mRNA 247 > 695 L32' mRNA (alt.) BASE COUNT 290 a 203 c 210 g 198 t ORIGIN 1 gtccaacaag tatttccttg gccaggctta gcgttcttcg agtttatctt ctagggggaa 61 aaaatcaact agcaacatat aaaaaggtct tatttgtcca ggaaaaaaaa aggaagcaac 121 attatagatt gactcttgca tagtatgtat tttggaatcc gagtaatgca acagtataat 181 gtggttaaga acaaagactc tggctccctc ctcggcactg cctacgaggt ggctaccttc 241 tgttatgcag catcatggct gccctccggc ctctggtgaa gcccaagatc gtcaaaaaga 301 ggaccaaaaa gttcatcaga caccagtcag accgatatgt gaaaattaag tggaactgga 361 ggaaacccag aggcatcgac aacagggtgc ggagaaggtt caagggccag atcctgatgc 421 ccaacatcgg ttacaggagc aacaagaaaa ccaagcacac gctgtctagt ggcttccgca 481 agttcctggt ccacaatatc aaggagccgg aggtgctgtt gatgtgcaac aaatcttacc 541 gtgctgagat tgctcacaac gtgtcctcta agaaccgaaa agccattgta gaaagagcag 601 cacagctggc catcagagtc accaatccca acaccaggct acacagcgaa gaaaatgaat 661 agatggcttg tgtacatgtt ttgtgtttaa ataaaatcac aaaaactgga aaaaaaaaag 721 aataaagact ctgaggctgg aagagggtgt caaatgcccc tggaactgga gttacagaca 781 attgtgagcc accccccccc catgggagct ggtgatggaa cttgggtccc ctggaagaga 841 ggcaagcact attaattatg gagccatttc ttcagccaaa catcccatat tttaatccga 901 a // LOCUS SPIB559AC 597 bp ds-DNA ORG 31-JUL-1990 DEFINITION S.oleracea chloroplast apocytochrome b-559 gene, complete cds. ACCESSION M35673 KEYWORDS apocytochrome b-559. SOURCE S.oleracea chloroplast DNA. ORGANISM Chloroplast Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae; Spinacia oleracea. REFERENCE 1 (bases 1 to 597) AUTHORS Herrmann,R.G., Alt,J., Schiller,B., Widger,W.R. and Cramer,W.A. TITLE Nucleotide sequence of the gene for apocytochrome b-559 on the spinach plastid chromosome: Implications for the structure of the membrane protein JOURNAL FEBS Lett. 176, 239-244 (1984) STANDARD simple staff_review FEATURES from to/span description pept 152 403 apocytochrome b-559 pept 412 531 ORF 39 BASE COUNT 173 a 120 c 117 g 187 t ORIGIN 1 attcggtata ctctaaagac acctttggta ctatattggc aatcctacaa gtattttacg 61 tttttacgtg gagtaaaaag taaatagaaa ttgactgatt tcatctttta cggatttgcc 121 tttgactgta caagaatatg tggagctcag tatgtctgga agcacaggag aacgttcttt 181 tgctgatatt attaccagta ttcgatactg ggttattcat agcattacta taccttccct 241 attcattgcg ggttggttat tcgtcagcac aggtttagct tacgatgtgt ttggaagccc 301 tcgtccaaac gaatatttca cagagagccg acaaggaatt ccattaataa ctggccgttt 361 tgactctttg gaacaacttg atgaatttag tagatccttt taggaggccc aatgaccata 421 gatcgaacct atccaatttt tacagtgcgc tggctggcta ttcacggact ggctgtacct 481 accgtttctt ttttggggtc aatatccgca atgcagttca tccaacgata aacaaaaccg 541 aatccgaatt atagagctac gacacaatca aacccgaacg aacaaaatgt tgaattg // LOCUS SYNQBRNA 672 bp ss-RNA SYN 31-JUL-1990 DEFINITION Synthetic lambda-N protein RNA. ACCESSION M35652 KEYWORDS lambda-N protein. SOURCE Synthetic RNA, clone pDM1001. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 672) AUTHORS Mills,D.R. TITLE Engineered recombinant messenger RNA can be replicated and expressed inside bacterial cells by an RNA bacteriophage replicase JOURNAL J. Mol. Biol. 200, 489-500 (1988) STANDARD simple staff_review FEATURES from to/span description RNA 157 516 lambda N-protein complementary RNA BASE COUNT 122 a 190 c 183 g 177 t ORIGIN 1 gggaggagag agggcaaagc agatccccct ctcactcgta agagtaattg tgataattta 61 ttcacaatta ggcgccatat tggcgccctc ccccaggtta cgcctcgtgt agagacgcaa 121 ccttcgtgcc ctggaagacc ccgtagggcc tcgaggccaa tatctaagta actagataag 181 aggaatcgat tttcccttaa ttttctggcg tccactgcat gttatgccgc gttcgccagg 241 cttgctgtac catgtgcgct gattcttgcg ctcaatacgt tgcaggttgc tttcaatctg 301 tttgtggtat tcagccagca ctgtaaggtc tatcggattt agtgcgcttt ctactcgtga 361 tttcggtttg cgattcagcg agagaatagg gcggttaact ggttttgcgc ttaccccaac 421 caacagggga tttgctgctt tccattgagc ctgtttctct gcgcgacgtt cgcggcggcg 481 tgtttgtgca tccatctgga ttctcctgtc agttagcctc gaggcgcttt cgcgctctcc 541 caggtgacgc ctcgtgaaga ggcgcgacct tcgtgcgttt cggcgacgca cgagaaccgc 601 cacgctgctt cgcagcgtgg ccccttcgcg cagcccgctg cgcgaggtga ccccccgaag 661 ggggcttccc ca // LOCUS SYNT4ENDV 424 bp ds-DNA SYN 31-JUL-1990 DEFINITION Synthetic T4 endonuclease V (T4endV) gene, complete cds. ACCESSION M35392 KEYWORDS T4 endonuclease V. SOURCE Synthetic gene, expressed in E.coli. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 424) AUTHORS Inaoka,T., Miura,K. and Ohtsuka,E. TITLE Chemical synthesis of the T4 endonuclease V gene and its expression in Escherichia coli JOURNAL Nucleic Acids Symp Ser 17, 105-108 (1986) STANDARD simple staff_review FEATURES from to/span description pept 5 421 T4 endonuclease V (T4endV) BASE COUNT 120 a 90 c 86 g 128 t ORIGIN 1 cgatatgacg cgtatcaacc ttactttagt atccgagtta gctgaccaac acttaatggc 61 tgaataccgt gaattgccgc gtgtttttgg tgcagttcgt aagcacgtag caaacggtaa 121 acgtgttcgt gacttcaaaa tcagtcctac ttttatcctt ggcgcaggtc atgttacatt 181 cttctacgat aagctcgagt tcttacgcaa gcgtcaaatt gagcttatag ctgaatgttt 241 gaaacgtggc ttcaatatca aggatactac agtccaggac atcagtgaca ttcctcaaga 301 attccgtggt gattatattc cccatgaagc ttctattgct atatcacaag ctcgtttaga 361 tgaaaaaatt gcacaacgtc ctacttggta caaatactac ggtaaggcga tttatgcatg 421 atag // LOCUS TOMACS 1819 bp ss-mRNA PLN 31-JUL-1990 DEFINITION Tomato 1-aminocyclopropane-1-carboxylate synthase mRNA, complete cds. ACCESSION M34289 KEYWORDS 1-aminocyclopropane-1-carboxylate synthase. SOURCE Tomato (strain Mill., cv. Orlando), fruit, cDNA to mRNA. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 1819) AUTHORS Van Der Straeten,D., Van Wiemeersch,L., Goodman,H.M. and Van Montagu,M. TITLE Cloning and sequence of two different cDNAs encoding 1-aminocyclopropane-1-carboxylate synthase in tomato JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4859-4863 (1990) STANDARD unannotated staff_entry FEATURES from to/span description pept 154 1611 1-aminocyclopropane-1-carboxylate synthase mRNA < 1 1819 1-aminocyclopropane-1-carboxylate synthase mRNA BASE COUNT 610 a 285 c 365 g 559 t ORIGIN 1 ccaaacacat aatactttta atacaattag ttatttatta gaagtattta aagtaaagca 61 cttgtgagtt gtgtacattt tattaatctt catcttctta attctcttca gtttttaatt 121 tcttcacttc taaactcatt tagtaaaaaa aaaatgggat ttgagattgc aaagaccaac 181 tcaatcttat caaaattggc tactaatgaa gagcatggcg aaaactcgcc atattttgat 241 gggtggaaag catacgatag tgatcctttc caccctctaa aaaaccccaa cggagttatc 301 caaatgggtc ttgctgaaaa tcagctttgt ttagacttga tagaagattg gattaagaga 361 aacccaaaag gttcaatttg ttctgaagga atcaaatcat tcaaggccat tgccaacttt 421 caagattatc atggcttgcc tgaattcaga aaagcgattg cgaaatttat ggagaaaaca 481 agaggaggaa gagttagatt tgatccagaa agagttgtta tggctggtgg tgccactgga 541 gctaatgaga caattatatt ttgtttggct gatcctggcg atgcattttt agtaccttca 601 ccatactacc cagcatttaa cagagattta agatggagaa ctggagtaca acttattcca 661 attcactgtg agagctccaa taatttcaaa attacttcaa aagcagtaaa agaagcatat 721 gaaaatgcac aaaaatcaaa catcaaagta aaaggtttga ttttgaccaa tccatcaaat 781 ccattgggca ccactttgga caaagacaca ctgaaaagtg tcttgagttt caccaaccaa 841 cacaacatcc accttgtttg tgacgaaatc tacgcagcca ctgtctttga cacgcctcaa 901 ttcgtcagta tagctgaaat cctcgatgaa caggaaatga cttactgcaa caaagattta 961 gttcacatcg tctacagtct ttcaaaagac atggggttac caggatttag agtcggaatc 1021 atatattctt ttaacgacga tgtcgttaat tgtgctagaa aaatgtcgag tttcggttta 1081 gtatctacac aaacgcaata ttttttagcg gcaatgccat cggacgaaaa attcgtcgat 1141 aattttctaa gagaaagcgc gatgaggtta ggtaaaaggc acaaacattt tactaatgga 1201 cttgaagtag tgggaattaa atgcttgaaa aataatgcgg ggcttttttg ttggatggat 1261 ttgcgtccac ttttaaggga atcgactttc gatagcgaaa tgtcgttatg gagagttatt 1321 ataaacgatg ttaagcttaa cgtctcgctt ggatcttcgt ttgaatgtca agagccaggg 1381 tggttccgag tttgttttgc aaatatggat gatggaacgg ttgatattgc gctcgcgagg 1441 attcggaggt tcgtaggtgt tgagaaaagt ggagataaat cgagttcgat ggaaaagaag 1501 caacaatgga agaagaataa tttgagactt agtttttcga aaagaatgta tgatgaaagt 1561 gttttgtcac cactttcgtc acctattcct ccctcaccat tagttcgtta agacttaatt 1621 aaaagggaag aatttaattt atgttttttt atattttgaa aaaaatttgt aagaataaga 1681 ttataatagg aaaagaaaat aagtatgtag gatgaggagt attttcagaa atagttgtta 1741 gcgtatgtat tgacaactgg tctatgtact tagacatcat aatttgtctt agctaattaa 1801 tgaatgcaaa agtgaagtt // LOCUS TTHRGG 873 bp ds-DNA BCT 31-JUL-1990 DEFINITION T.thermophilus 23S ribosomal RNA gene. ACCESSION M35674 KEYWORDS 23S ribosomal RNA. SOURCE T.thermophilus (strain HB8; ATCC 27634) DNA. ORGANISM Thermus thermophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 873) AUTHORS Hartmann,R.K., Vogel,D.W., Kroeger,B., Ulbrich,N. and Erdmann,V.A. TITLE Studies on rDNA from the extreme thermophilic eubacterium Thermus thermophilus HB8 JOURNAL FEBS Lett. 218, 215-221 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 873 23S RNA gene BASE COUNT 167 a 248 c 321 g 137 t ORIGIN 1 ctggccgtga agatgcggcc tacccgtggc aggacgaaaa gaccccgtgg agctttactg 61 cagcctggtg ttggctcttg gtcgcgcctg cgtaggatag gtgggagcct gtgaaccccc 121 gcctccgggt gggggggagg cgccggtgaa ataccaccct ggcgcggctg ggggcctaac 181 cctcggatgg ggggacagcg cttggcgggc agtttgactg gggcggtcgc ctcctaaaag 241 gtaacggagg cgcccaaagg tcccctcagg cgggacggaa atccgccgga gagcgcaagg 301 gtagaagggg gcctgactgc gaggcctgca agccgagcag gggcgaaagc cgggcctagt 361 gaaccggtgg tcccgtgtgg aagggccatc gatcaacgga taaaagttac cccggggata 421 acaggctgat ctcccccgag cgtccacagc ggcggggagg tttggcacct cgatgtcggc 481 tcgtcgcatc ctggggctga agaaggtccc aagggttggg ctgttcgccc attaaagcgg 541 cacgcgagct gggttcagaa cgtcgtgaga cagttcggtc tctatccgcc acgggcgcag 601 gaggcttgag gggggctctt cctagtacga gaggaccgga agggacgcac ctctggtttc 661 ccagctgtcc ctccaggggc ataagctggg tagccatgtg cggaagggat aaccgctgaa 721 agcatctaag cgggaagccc gccccaagat gaggcctccc acggcgtcaa gccggtaagg 781 acccgggaag accacccggt ggatgggccg ggggtgtaag cgccgcgagg cgttgagccg 841 accggtccca atcgtccgag gtcttgaccc ctc // LOCUS WHTCPRS2A 780 bp ds-DNA ORG 31-JUL-1990 DEFINITION T.aestivum chloroplast ribosomal protein S2 gene, complete cds. ACCESSION M35396 KEYWORDS ribosomal protein S2. SOURCE T.aestivum (cv. Mardler) chloroplast DNA. ORGANISM Chloroplast Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Triticum aestivum. REFERENCE 1 (bases 1 to 780) AUTHORS Hoeglund,A.-S. and Gray,J.C. TITLE Nucleotide sequence of the gene for ribosomal protein S2 in wheat chloroplast DNA JOURNAL Nucleic Acids Res. 15, 10590-10590 (1987) STANDARD simple staff_review FEATURES from to/span description pept 46 756 ribosomal protein S2 BASE COUNT 266 a 118 c 164 g 232 t ORIGIN 1 ccgtaatgga atggtaggat taaaaaaaaa ggtaatgtgg aaaaaatgac aagaagatat 61 tggaacatta atttgaaaga gatgatagaa gcaggagttc attttggtca tggtattaag 121 aaatggaatc ctaaaatggc cccttacatt tcggcaaagc gtaaaggtac tcatattata 181 aatctcgcta gaacggctcg ttttttatca gaagcttgtg atttagtttt tgatgcagca 241 agtcagggaa caagtttctt aattgttggt accaaaaaaa gagcaacaga tttagtagca 301 tcagctgcaa taagggctcg ttgtcattat gttaataaaa agtggttcag tggtatgtta 361 acgaattggt cgattacgaa aactagactt tctcaattta gagacttaag agcagaagaa 421 aaaatgggaa aattccacca tctcccaaaa agagatgtgg caatcttgaa gagaaaatta 481 tctaccttgc aaaggtatct cggcgggatc aaatatatga cgagattgcc agacattgtg 541 atcgtccttg atcagcaaaa agagtatata gctcttcggg aatgtgccat tttggggatt 601 cctactattt ctttagtcga tacaaattgt gacccggatc tcgcgaatat atcgattcca 661 gccaacgatg acactatgac ttcaattcga ttgattctta acaaattagt attttcaatt 721 tgtgagggcc gttctctcta tataagaaat cgttgattaa gaatatatag tgaattcttg // LOCUS YSCVDACA 852 bp ss-mRNA PLN 31-JUL-1990 DEFINITION Yeast voltage-dependent anion-selective channel (VDAC) protein mRNA, complete cds. ACCESSION M34907 KEYWORDS voltage-dependent anion-selective channel protein. SOURCE Yeast, cDNA to mRNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 852) AUTHORS Forte,M., Guy,H.R. and Mannella,C.A. TITLE Molecular genetics of the VDAC ion channel: Structural model and sequence analysis JOURNAL J Bioenerg Biomembr 19, 341-350 (1987) STANDARD simple staff_review FEATURES from to/span description pept 1 852 voltage-dependent anion-selective channel (VDAC) protein BASE COUNT 224 a 213 c 172 g 243 t ORIGIN 1 atgtctcctc cagtttacag cgatatctcc agaaatatca atgacctatt gaacaaggat 61 ttctatcatg ctaccccagc tgcctttgat gtgcaaacaa caaccgccaa tggcattaag 121 ttctcattga aggctaaaca gcctgtcaaa gacggtccac tgtctactaa cgtggaagca 181 aagttgaatg acaagcaaac cggcttgggt ctaactcaag gctggtctaa cacaaacaac 241 ttgcaaacca aattagagtt tgccaacttg acccctggtc taaagaacga attgatcact 301 tctttgactc caggcgtcgc caagtccgcc gtcttaaaca ctacgttcac agaacctttc 361 ttcaccgcaa gaggtgcctt tgacttgtgt ttgaagtcac caacatttgt tggtgactta 421 actatggccc acgaaggtat tgttggtggc gcagagtttg gttacgatat cagcgccggt 481 tccatttctc gttatgccat ggctttaagt tatttcgcca aagactactc cttgggcgct 541 acattgaaca acgagcaaat aactaccgtt gacttcttcc aaaacgtcaa cgccttttta 601 caggtcgcta aggctacaat gatgaactgc aaactaccta actccaatgt caacatcgaa 661 ttcgccacta gatatttgcc tgatgcatct tcccaagtta aggctaaggt gtccgattcc 721 ggtattgtca ctttggctta caagcaattg ttaagacctg gcgtcactct gggtgtcggt 781 tcctctttcg atgctttgaa gttgtctgaa cctgttcaca agctaggttg gtctttgtcc 841 ttcgacgctt ga // LOCUS ATHHISH3A 411 bp ds-DNA PLN 31-JUL-1990 DEFINITION A.thaliana histone H3 gene, complete cds. ACCESSION M35387 KEYWORDS histone H3. SOURCE A.thaliana (strain Strasbourg) adult leaf DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 411) AUTHORS Chaubet,N., Chaboute,M.-E., Philipps,G. and Gigot,C. TITLE Histone genes in higher plants: Organization and expression JOURNAL Dev. Genet. 8, 461-473 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 1 411 histone H3 BASE COUNT 119 a 111 c 109 g 72 t ORIGIN 1 atggctcgta ccaagcagac ggctaggaaa tcaaccggag gaaaggctcc aaggaagcag 61 ctggcgacaa aagcggcgag gaaatctgct ccggccaccg gaggagtgaa gaagccacac 121 agattcagac ccggaactgt tgctttaaga gaaatcagga agtaccagaa gagcactgag 181 cttcttatcc gtaaactccc tttccagagg cttgtcagag agatcgctca ggacttcaaa 241 accgatcttc gtttccagag cagtgccgtc gccgctcttc aggaagctgc tgaagcttac 301 ctcgtcggac tcttcgaaga caccaatctc tgcgccattc acgccaagag ggtcacgatc 361 atgcccaagg acatccagct cgcgagaaga atcagaggag aacgagctta g // LOCUS BLYCHORDA 391 bp ss-mRNA PLN 31-JUL-1990 DEFINITION Barley C-hordein storage protein, 3' end. ACCESSION M35610 KEYWORDS C-hordein; storage protein. SOURCE Barley 20-day old endosperm, cDNA to mRNA, clone pc hor1-3. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 391) AUTHORS Rasmussen,S.K. and Brandt,A. TITLE Nucleotide sequences of cDNA clones for C-hordein polypeptides JOURNAL Carlsberg Res. Commun. 51, 371-379 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 219 C-hordein storage protein mRNA < 1 391 C-hordein storage protein mRNA signal 310 315 polyA signal (put.) signal 365 370 polyA signal (put.) BASE COUNT 134 a 120 c 57 g 80 t ORIGIN 1 tttccccagc cccaggaacc atttccccag caaccacaac aaccattccc actgcaaccg 61 caacaaccat tcccccaaca accacaacaa ccatttcccc agccccaaca accattccgc 121 caacaagcag aactaataat tccccagcaa cctcaacaac cattccctct gcagccacac 181 caaccatata cacaacaaac catctggagt atggtctagt catcaggggc ctatgaagcg 241 acaagttgta atactaaatg ggtggatcat catcctttag tcaacggagt gtttaatgta 301 atgatgataa ataaagtgat gtggaccatc atgtgtaacc ccggcctata ctagttcaaa 361 tgagaataaa agaccaaaga aagttcttgt c // LOCUS BLYCHORDB 255 bp ss-mRNA PLN 31-JUL-1990 DEFINITION Barley C-hordein storage protein, 3' end. ACCESSION M35611 KEYWORDS C-hordein; storage protein. SOURCE Barley 20-day old endosperm, cDNA to mRNA, clone pc-919. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 255) AUTHORS Rasmussen,S.K. and Brandt,A. TITLE Nucleotide sequences of cDNA clones for C-hordein polypeptides JOURNAL Carlsberg Res. Commun. 51, 371-379 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 207 C-hordein storage protein BASE COUNT 97 a 96 c 24 g 38 t ORIGIN 1 tacccacaac aaccacaacc atttccacaa cagcccatcc cacaacaacc acaaccatac 61 ccacaacaac cacaaccatt ttcacaacag cccatcccac aacaaccaca accataccca 121 caacaaccac aaccatttcc acaacaaccc atccctctgc agccacacca accatataca 181 caacaaacca tctggagtat ggtctagtca tcaggggcct atgaagcgac aagttgtaat 241 actaaatggg tggat // LOCUS BOVFGFAA 668 bp ss-mRNA MAM 31-JUL-1990 DEFINITION Bovine acidic eye-derived fibroblast growth factor (EDGF II) mRNA, complete cds. ACCESSION M35608 KEYWORDS acidic eye-derived fibroblast growth factor. SOURCE Bovine retina, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 668) AUTHORS Alterio,J., Halley,C., Brou,C., Soussi,T., Courtois,Y. and Laurent,M. TITLE Characterization of a bovine acidic FGF cDNA clone and its expression in brain and retina JOURNAL FEBS Lett. 242, 41-46 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 91 558 acidic eye-derived fibroblast growth factor BASE COUNT 173 a 184 c 170 g 141 t ORIGIN 1 ggatcctctt tcccttctac tggagaggaa aagccctcag cctgcaagct gttcagcctt 61 gaaacagcca caaccagcag ctgctgagcc atggctgaag gagaaaccac gaccttcacg 121 gccctgactg agaagtttaa cctgcctcta ggcaattaca agaagcccaa gctcctctac 181 tgcagcaacg ggggctactt cctgagaatc ctcccagatg gcacagtgga tgggacgaag 241 gacaggagcg accagcacat tcagctgcag ctctgtgcgg aaagcatagg ggaggtgtat 301 attaagagta cggagactgg ccagttcttg gccatggaca ccgacgggct tttgtacggc 361 tcacagacac ccaatgagga atgtttgttc ctggaaaggt tggaggaaaa ccattacaac 421 acctacatat ccaagaagca tgcagagaag cattggttcg ttggtctcaa gaagaacgga 481 aggtctaaac tcggtcctcg gactcacttc ggccagaaag ccatcttgtt tctccccctg 541 ccagtctcct ctgattaaag aaatctgttg tgggtgctga gccactccag aggaatctga 601 aggggtcctc acctggctga ccccagattg taccctttac cattggccgt gctaacccct 661 ggcccaca // LOCUS CHKGHA 778 bp ss-mRNA VRT 31-JUL-1990 DEFINITION Chicken growth hormone mRNA, complete cds. ACCESSION M35609 KEYWORDS growth hormone; somatotropin. SOURCE Chicken hypophysis, cDNA to mRNA, clone pcGH1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 778) AUTHORS Zhvirblis,G.S., Gorbulev,V.G., Rubtsov,P.M., Karapetyan,R.V., Zhuravlev,I.V., Fisinin,V.I., Skryabin,K.G. and Baev,A.A. TITLE Genetic engineering of peptide hormones: I. Cloning and primary structure of cDNA of chicken growth hormone JOURNAL Mol. Biol. 21, 1324-1328 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 36 686 growth hormone mRNA < 1 778 GH mRNA BASE COUNT 178 a 257 c 196 g 147 t ORIGIN 1 gttcaagcaa cacctgagca actctcccgg caggaatggc tccaggctcg tggttttctc 61 ctctcctcat cgctgtggtc acgctgggac tgccgcagga agctgctgcc accttccctg 121 ccatgcccct ctccaacctg tttgccaacg ctgtgctgag ggctcagcac ctccacctcc 181 tggctgccga gacatataaa gagttcgaac gcacctatat tccggaggac cagaggtaca 241 ccaacaaaaa ctcccaggct gcgttttgtt actcagaaac catcccagct cccacgggga 301 aggatgacgc ccagcagaag tcagacatgg agctgcttcg gttttcactg gttctcatcc 361 agtcctggct gactcccgtg caatacctaa gcaaggtgtt cacgaacaac ttggtttttg 421 gcacctcaga cagagtgttt gagaaactaa aggacctgga agaagggatc caagccctga 481 tgagggagct ggaggaccgc agcccgcggg gcccgcagct cctcagaccc acctacgaca 541 agttcgacat ccacctgcgc aacgaggacg ccctgctgaa gaactacggc ctgctgtcct 601 gcttcaagaa ggatctgcac aaggtggaga cctacctgaa ggtgatgaag tgccggcgct 661 tcggagagag caactgcacc atctgaggcc ccgtgcctgc gccatggctg atggccctgt 721 cccccccccc cccttcctcc ccgtcaccaa aaacacgagg aataaacccc acagcgcc // LOCUS ECOTRYSU3A 85 bp ss-tRNA RNA 31-JUL-1990 DEFINITION E.coli su+III tRNA-Tyr suppressor. ACCESSION M35560 KEYWORDS transfer tRNA-Tyr. SOURCE E.coli (strain MB100) RNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 85) AUTHORS Abelson,J.N., Gefter,M.L., Barnett,L., Landy,A., Russell,R.L. and Smith,J.D. TITLE Mutant tyrosine transfer ribonucleic acids JOURNAL J. Mol. Biol. 47, 15-28 (1970) STANDARD simple staff_entry FEATURES from to/span description tRNA 1 85 suppressor Tyr-tRNA su-III anticdn 35 37 suppressor Tyr-tRNA su-III anticodon cta modified 8 8 s4u modified 17 17 gm modified 38 38 ms2i6a modified 40 40 p modified 54 54 p mut 15 15 g in wild-type; a in su-15 mut 17 17 gm in wild-type; a in su-17 mut 31 31 g in wild-type; a in su-12 BASE COUNT 19 a 28 c 22 g 16 t ORIGIN 1 ggtggggttc ccgagcggcc aaagggagca gactctaaat ctgccgtcat cgacttcgaa 61 ggttcgaatc cttcccccac cacca // LOCUS HUMG6PDC 608 bp ss-mRNA PRI 31-JUL-1990 DEFINITION Human glucose-6-phosphate dehydrogenase (G6PD) mRNA, 3' end. ACCESSION M35604 KEYWORDS glucose-6-phosphate dehydrogenase. SOURCE Human fibroblast, cDNA to mRNA, clone pGD6405. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 608) AUTHORS Toniolo,D., Persico,M.G., Battistuzzi,G. and Luzzatto,L. TITLE Partial purification and characterization of the messenger RNA for human glucose-6-phosphate dehydrogenase JOURNAL Mol. Biol. Med. 2, 89-103 (1984) STANDARD simple staff_entry FEATURES from to/span description pept < 1 103 glucose-6-phosphate dehydrogenase (G6PD) /hgml_locus_uid="LH0033J" /nomgen="G6PD" /map="Xq28" mRNA < 1 608 G6PD mRNA signal 587 592 polyA signal BASE COUNT 143 a 143 c 183 g 138 t 1 others ORIGIN Chromosome Xq28. 1 ggagaagaac cttggggtgc cagggtgggg gcagaagcat gggctgggtt ccggttcatc 61 ctcctccacc ctgccgtgtg tgtgggcaca agaggacatc taaccacctg ctccttggac 121 gaggccccca ggggtggtag aggctggaag gaagccacat caggaggacg ccactccggc 181 ccttcaccct tgccaagtga gctgctcaca gtgtggtcag ggctgcgcgt gctggaggcc 241 ctcctgcctg ggccttgtgg ggcaaatatt gggtccccag gctggaaaga tggacagagg 301 cccaatgggt gaaggctttg aagagcacac agaagcccct ggccccccac gagagctgga 361 gagccatgta tatggcttca aagccaccta cggcagggac acactcgtga gcatgtgtgg 421 cctgcagttc aggtgataca tttaccagtg ttcttgtttg tgtggtgcca ggaaattgat 481 tttggaaaaa gtgaaataac attaaaggtg aatgtgaggc ttctactttt atccaaaagg 541 agctatatta gctaggctgt ttctgatayc caatcattgg tttaacaata aaggcaattt 601 gtttaatc // LOCUS MSGMPB57A 553 bp ds-DNA BCT 31-JUL-1990 DEFINITION M.bovis immunogenic protein MPB57 gene, complete cds. ACCESSION M35389 KEYWORDS immunogenic protein MPB57. SOURCE M.bovis (strain BCG) DNA. ORGANISM Mycobacterium bovis Prokaryota; Bacteria; Firmicutes; Mycobacteria; Mycobacteriaceae. REFERENCE 1 (bases 1 to 553) AUTHORS Yamaguchi,R., Matsuo,K., Yamazaki,A., Nagai,S., Terasaka,K. and Yamada,T. TITLE Immunogenic protein MPB57 from Mycobacterium bovis BCG: Molecular cloning, nucleotide sequence and expression JOURNAL FEBS Lett. 240, 115-117 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 223 525 immunogenic protein MPB57 binding 209 213 MPB57 ribosomal binding site signal 29 34 MPB57 -35 region signal 50 55 MPB57 -10 region BASE COUNT 113 a 170 c 184 g 86 t ORIGIN 1 ttggcaacca ggaagcaagg ggccgccctt gagtgctagc actctcatgt atagagtgct 61 agatggcaat cggctaaccc ctgcgtcggc acccgcgacg acggcgcagg gcgcggacgt 121 cacctggtaa ttcggacggt tcgggcacgc cccggaccga ccgccaactc cggtccgggc 181 gagcgtcccg ggctctgatc caaatagtgg agggctccaa tcgtggcgaa ggtgaacatc 241 aagccactcg aggacaagat tctcgtgcag gccaacgagg ccgagaccac gaccgcgtcc 301 ggtctggtca ttcctgacac cgccaaggag aagccgcagg agggcaccgt cgttgccgtc 361 ggccctggcc ggtgggacga ggacggcgag aagcggatcc cgctggacgt tgcggagggt 421 gacaccgtca tctacagcaa gtacggcggc accgagatca agtacaacgg cgaggaatac 481 ctgatcctgt cggcacgcga cgtcgttggc cgtcgtttca agtagtagag cgtgttcccc 541 ggcgatcccg tcg // LOCUS MUSMTMTAAA 78 bp ds-DNA ORG 31-JUL-1990 DEFINITION Mouse mitochondrial maternally transmitted antigen-alpha (Mta-alpha) mRNA, 5' end. ACCESSION M35494 KEYWORDS maternally transmitted antigen. SOURCE Mouse mitochondrial DNA. ORGANISM Mitochondrion Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Mus musculus. REFERENCE 1 (bases 1 to 78) AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and Yonekawa,H. TITLE Molecular definition of a mitochondrially encoded mouse minor histocompatibility antigen JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 78 maternally transmitted antigen-alpha (Mta-alpha) (gtg start codon) BASE COUNT 22 a 23 c 8 g 25 t ORIGIN 1 gtgttcttta ttaatatcct aacactcctc gtccccattc taatcgccat agccttccta 61 acattagtag aacgcaaa // LOCUS MUSMTMTABA 78 bp ds-DNA ORG 31-JUL-1990 DEFINITION Mouse mitochondrial maternally transmitted antigen-beta (Mta-beta) mRNA, 5' end. ACCESSION M36861 KEYWORDS maternally transmitted antigen. SOURCE Mouse mitochondrial DNA. ORGANISM Mitochondrion Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Mus musculus. REFERENCE 1 (bases 1 to 78) AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and Yonekawa,H. TITLE Molecular definition of a mitochondrially encoded mouse minor histocompatibility antigen JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 78 maternally transmitted antigen-beta (Mta-beta) (gtg start codon) BASE COUNT 21 a 24 c 9 g 24 t ORIGIN 1 gtgttcttta ttaatgccct aacactcctc gtccccattc taatcgctat agccttccta 61 acactagtag aacgcaaa // LOCUS MUSMTMTACA 78 bp ds-DNA ORG 31-JUL-1990 DEFINITION Mouse mitochondrial maternally transmitted antigen-gamma (Mta-gamma) mRNA, 5' end. ACCESSION M36862 KEYWORDS maternally transmitted antigen. SOURCE Mouse mitochondrial DNA. ORGANISM Mitochondrion Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Mus musculus. REFERENCE 1 (bases 1 to 78) AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and Yonekawa,H. TITLE Molecular definition of a mitochondrially encoded mouse minor histocompatibility antigen JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 78 maternally transmitted antigen-gamma (Mta-gamma) (gtg start codon) BASE COUNT 21 a 23 c 9 g 25 t ORIGIN 1 gtgttcttta ttaatgtcct aacactcctc gtccccattc taatcgctat agccttccta 61 acactagtag aacgcaaa // LOCUS MUSMTMTADA 78 bp ds-DNA ORG 31-JUL-1990 DEFINITION Mouse mitochondrial maternally transmitted antigen-delta (Mta-delta) mRNA, 5' end. ACCESSION M36863 KEYWORDS maternally transmitted antigen. SOURCE Mouse mitochondrial DNA. ORGANISM Mitochondrion Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Mus musculus. REFERENCE 1 (bases 1 to 78) AUTHORS Lindahl,K.F., Hermel,E., Loveland,B.E., Richards,S., Wang,C.-R. and Yonekawa,H. TITLE Molecular definition of a mitochondrially encoded mouse minor histocompatibility antigen JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 563-569 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 78 maternally transmitted antigen-delta (Mta-delta) (gtg start codon) BASE COUNT 22 a 23 c 8 g 25 t ORIGIN 1 gtgttcttta ttaataccct aacactcctc gtccctattc taatcgctat agccttccta 61 acactagtag aacgcaaa // LOCUS MZEHISH3A 411 bp ds-DNA PLN 31-JUL-1990 DEFINITION Z.mays histone H3 gene, complete cds. ACCESSION M35388 KEYWORDS histone H3. SOURCE Z.mays (L. cv. INRA 258) 5-6 day plantlet DNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 411) AUTHORS Chaubet,N., Chaboute,M.-E., Philipps,G. and Gigot,C. TITLE Histone genes in higher plants: Organization and expression JOURNAL Dev. Genet. 8, 461-473 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 1 411 histone H3 BASE COUNT 77 a 154 c 128 g 52 t ORIGIN 1 atggcccgca cgaagcagac ggcgcgcaag tcgacgggcg gcaaggcgcc ccgcaagcag 61 ctggccacca aggcggcgcg caagtcggcg ccggcaaccg gtggcgtgaa gaagcctcac 121 cgcttccgcc ccggcaccgt cgcgctccgg gagattcgca agtaccagaa gagcacggag 181 ctgctcatcc gcaagctgcc cttccagcgc ctcgtccgtg agatcgcgca ggatttcaag 241 accgacctcc gcttccagtc ctccgctgtc gccgcgctgc aggaggccgc cgaggcctac 301 ctcgtggggc tcttcgagga caccaacctc tgcgccatcc acgccaagcg cgtcaccatc 361 atgcccaagg acatccagct cgcgcgccgc atcaggggcg agagggcttg a // LOCUS RATFIBAA 511 bp ss-mRNA ROD 31-JUL-1990 DEFINITION Rat alpha-fibrinogen mRNA, 3' end. ACCESSION M35601 KEYWORDS alpha-fibrinogen. SOURCE Rat (strain Wistar) male 16-hr regenerating liver, cDNA to mRNA, clone lambda-7A. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 511) AUTHORS Sobczak,J., Lotti,A.-M., Taroux,P. and Duguet,M. TITLE Molecular cloning of mRNA sequences transiently induced during rat liver regeneration JOURNAL Exp. Cell Res. 169, 47-56 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 282 alpha-fibrinogen BASE COUNT 145 a 124 c 100 g 141 t 1 others ORIGIN 1 cctgaacttg gttcctttta tgacagccgc tttggttcac tcacaaagaa cttcaaagaa 61 tttggcagta agacctctga ttctgacatc ttcacagaca tcgagaaccc tagctcccat 121 gtacctgagt tttcttncag tagtaaaacc tcaactgtca ggaaacaagt aaccaagagc 181 tataaaatgg cagatgaggc agcaagtgaa gctcaccaag aaggagacac tcgaaccacc 241 aagaggggcc gagctcgcac aatgagaggt atccacgctt aactctggga agttgccctg 301 accccctaga ctaagttaac catttctgca aagtgcttac caggcgcgct ctttcttaac 361 ctcttctagt gctttggtgg aatctcattt tttttcatgc tagactgtac gttccttggg 421 ggcagggact ttgccatgtg tctatttctc taattcccca atgcataaca gtgcagtcat 481 ttctcaataa atatatttta aataaatgaa c // LOCUS RATFIBBA 261 bp ss-mRNA ROD 31-JUL-1990 DEFINITION Rat beta-fibrinogen mRNA, 3' end. ACCESSION M35602 KEYWORDS beta-fibrinogen. SOURCE Rat (strain Wistar) male 16-hr regenerating liver, cDNA to mRNA, clone lambda-7B. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 261) AUTHORS Sobczak,J., Lotti,A.-M., Taroux,P. and Duguet,M. TITLE Molecular cloning of mRNA sequences transiently induced during rat liver regeneration JOURNAL Exp. Cell Res. 169, 47-56 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 168 beta-fibrinogen BASE COUNT 73 a 49 c 61 g 78 t ORIGIN 1 tgtcatgcag ccaatccaaa cggcagatac tactggggtg gacagtacac ctgggacatg 61 gcaaagcatg gcacagatga tggtgtagta tggatgaatt ggaaggggtc atggtactca 121 atgaggaaga tgagtatgaa gatcaggccc ttcttcccac agcaatagtc cccaatagcg 181 tagatttttg ctcttctgta tgtgacaaca tttttgtaca ttatgttatt ggaattttct 241 ttcatacatt atattcctac t // LOCUS RATNKBA 727 bp ss-mRNA ROD 31-JUL-1990 DEFINITION Rat neurokinin B mRNA, complete cds. ACCESSION M35607 KEYWORDS neurokinin B. SOURCE Rat cerebral cortex, cDNA to mRNA, clone cRNKp[1,6]. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 727) AUTHORS Bonner,T.I., Affolter,H.-U., Young,A.C. and Young,W.S.III. TITLE A cDNA encoding the precursor of the rat neuropeptide neurokinin B JOURNAL Mol. Brain Res. 2, 243-249 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 115 465 neurokinin B precursor sigp 115 162 neurokinin B signal peptide matp 358 387 neurokinin B mRNA < 1 727 neurokinin B mRNA BASE COUNT 155 a 228 c 179 g 165 t ORIGIN 1 agctgggact ggttcctgcc acctccccga gcccctgcac tcttgtctct ctctgtctcc 61 acagcagcgt caccggctcc cggatcctgc cagccgtcca gctctacagg caccatgagg 121 agcgccatgc tgttcgcggc tgtcctcgcc ctcagcttgg catggacctt cggggctgcg 181 tgtgaggaac ctcaggagca gggtgggagg ctcagtaagg actctgatct ctctctgcta 241 cctccacccc tgcttcggag actctacgac agccgctcca tctctctgga aggattgctg 301 aaagtgctga gcaaggctag cgtgggaccg aaggagacat cacttccaca gaaacgtgac 361 atgcacgact tctttgtggg acttatgggc aagaggaaca gccaaccaga cactcccgct 421 gatgtggttg aagagaacac ccccagcttt ggcgtcctca aataggccag cagtgcagaa 481 aagcactcca ctctcagacc ctggactgca tcataaagac agggttcctg tggcggtccc 541 agtgcctgcg ctcctgcttc cctgcctgca aggtcctcct gttggctccc ttccctactc 601 tgcacagatg ctgcatatga acagcctcta cccccatatc aattatggtt tctgtagtgt 661 cctgcattaa aaataccatg tctcctcctc aacaataaag ggtttttaca atggagtgac 721 tgaaaag // LOCUS RABIGHCPS 577 bp ss-mRNA MAM 31-JUL-1990 DEFINITION Rabbit nonproductive VHa1 mRNA for Ig(mu) heavy chain constant region. ACCESSION X13699 KEYWORDS constant region; immunoglobulin; immunoglobulin heavy chain; mu-immunoglobulin. SOURCE rabbit (Oryctolagus cuniculus). ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 577) AUTHORS Mage,R.G. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 577) AUTHORS Mage,R.G., Newman,B.A., Harindranath,N., Bernstein,K.E., Becker,R.S. and Knight,K.L. TITLE Evolutionary conservation of splice sites in sterile C-mu transcripts and of immunoglobulin heavy chain (IgH) enhancer sequences JOURNAL Mol. Immunol. 26, 1007-1010 (1989) STANDARD simple automatic COMMENT *source: strain=allotype-defined NIH VHal Ckb9 Rabbit No CW247-4; *source: library=pBR322; clone=pB3; The sequence overlaps with that reported by Bernstein et. al. in J. Immunol. 132:490-495(1984) J00666 (F-I haplotype). The cDNA pB3 sequence starts at a position corresponding to pos. 70 in a germline DNA from JH-C-mu intron 80 bp downstream of the Ig(mu) enhancer. See x13700. EMBL features not translated to GenBank features: key from to description SITE 549 >577 CH1 domain of Ig(mu)H chain [1] Author address Mage R.G., Lab of Immunology, NIAID , NIH, Building 10,11-N311, NIH, Bethesda, MD 20892. Submitted (05-DEC-1988) on tape to the EMBL data library BASE COUNT 145 a 120 c 132 g 180 t ORIGIN 1 tagaaagctc aagcttagct tatcaatttc taaattgtat tcatttttaa agtaagatat 61 tttagaaatt aaacttacag atggtttatt aatgcgccca tgaaactttt taaaattact 121 ttgttatttc ttcatacggt aattaacctc aaacttcaga gacctcaaag cattattttt 181 gcaacgcctg cgaccgtcat ttttaagagt ctgttttctc tatgattaga agagacgttt 241 tgaggcttca tttcagtatt tttcaagact gcttttcaag tattcatttt tgcaccagtt 301 ctagctgagg tggacctaag tgtcctgagt tattgggtta aacgtgtgac ttggtgaatt 361 ggagagccat cagacagtgc agagacagga gtcgggagct ctgcagaagc cgggccaccc 421 acgccttgag gtttcccaag accgccgggt ctgggggtgc agccgtggct tctggttagt 481 caggaattcc tgtttggagt ggacttggag gacccgcagg acgtgccagt gaccagccac 541 tgcacatgtg agcctgtcat ctccaactct gtacccg // LOCUS RABIGHENH 1257 bp ds-DNA MAM 31-JUL-1990 DEFINITION Rabbit IgH gene JH-C(mu) intron with enhancer region yielding a nonproductive C(mu) transcript. ACCESSION X13700 KEYWORDS enhancer; germ line; immunoglobulin; immunoglobulin heavy chain; mu-immunoglobulin. SOURCE rabbit (Oryctolagus cuniculus). ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1257) AUTHORS Mage,R.G. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 1257) AUTHORS Mage,R.G., Newman,B.A., Harindranath,N., Bernstein,K.E., Becker,R.S. and Knight,K.L. TITLE Evolutionary conservation of splice sites in sterile C-mu transcripts and of immunoglobulin heavy chain (IgH) enhancer sequences JOURNAL Mol. Immunol. 26, 1007-1010 (1989) STANDARD simple automatic COMMENT *source: library=cosmid; clone=COS 3.3; tissue=sperm; *source: strain=allotype-defined a3n80f71g75 rabbit No X314-6. see x13699 for corresponding transcript seq EMBL features not translated to GenBank features: key from to description IVS 1 1257 JH-C(mu) intron SITE 591 598 conserved enhancer octamer TRANSCR 670 1227 homolog. to pB3 cDNA (X13699) IVS 1228 >1257 intron (to form pB3 cDNA) [1] Author address Mage R.G., Lab of Immunology, NIAID , NIH\, Building 10,11-N311, NIH, Bethesda, MD 20892. Submitted (05-DEC-1988) on tape to the EMBL data library BASE COUNT 328 a 248 c 320 g 361 t ORIGIN 1 gttagaaaga gaacggttgt cagcagtaca aggaaggagg gccgtggagc cccaaaggcg 61 ctgcctctga cagggaccta gcaaggcctg ggcagcccaa ccgtgctggg acggcgccca 121 ccaggtgtgc agggacatgg tgtccgaagt ggcttctgag aaaagtggct gagaaaattg 181 agaatgaaga ctttttgttc atgtatattc agctttttgt gtttttttaa tgtaaaggac 241 ttttagatta ttattgttca gtatctgtct gggattggca caagacagca gagaaacagt 301 ggagtggcac cagcaggccc caatacagga ctgagttttg acaaaaatta tttaagttta 361 atttttaatt aattgagcaa tgccagatta gtgcatgagt gacagccaag atggccgcgc 421 atccggccgg acacctgcag caggtggcag gaagcatgtc gcactgagag tctattttag 481 gaagccaaaa agcaccctgt gtgaatttat agctcctgtt tttgaagtgg tggttttcaa 541 gtggttttcc cggatccgaa agtgttgcag agcaaaacaa cacctgggta gtttgcatat 601 ctaaaatagg acaggggact gccagggttc ctcttgtcac tatttgaatt tattttttca 661 atctgaactt agcaacttct agtttgctca agcttagctt atcaatttct aaattgtatt 721 catttttaaa gtaagatatt ttagaaatta aacttacaga tggtttatta atgcgcccat 781 gaaacttttt aaaattactt tgttatttct tcatacggta attaacctca aacttcagag 841 acctcaaagc attatttttg caacgcctgc gaccgtcatt tttaagagtc tattttctct 901 atgattagaa gagacgtttt gaggcttcat ttcagtattt ttcaagactg cttttcaagt 961 attcattttc acaccagttc tagctgaggt ggacctaagt gtcctgagtt attgggttaa 1021 acgtgtgact tggtgaattg gagagccatc agacagtgca gagacaggag tcgggagctc 1081 tgcagaagcc gggccaccca cgccttgagg tttcccagga ccgccgggtc tgggggtgca 1141 gccgtggctt ctggttagtc aggaattcct gtttggagtg gacttggagg acccgcagga 1201 cgtgccggtg accagccact gcacatggtg agaggggcag ctgctctgcc acagatc // LOCUS ECOTN5011 60 bp ds-DNA BCT 31-JUL-1990 DEFINITION E.coli mercuric ion resistance transposon (Tn501), 5' end. ACCESSION M25453 KEYWORDS transposon. SEGMENT 1 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 60) AUTHORS Brown,N.L., Choi,C.-L., Richmond,M.H. and Grinsted,J. TITLE Nucleotide sequences at the ends of the mercury resistance transposon, Tn501 JOURNAL Nucleic Acids Res. 8, 1933-1945 (1980) STANDARD simple staff_review COMMENT Transposon Tn501 was originally isolated in Pseudomonas aeruginosa. FEATURES from to/span description recomb 12 13 E.coli DNA end/Tn501 DNA start BASE COUNT 21 a 11 c 17 g 11 t ORIGIN 1 ctgaaactat gagggggaac cgcagaattc ggaaaaaatc gtacgctaag ctaacggtgt // LOCUS ECOTN5012 60 bp ds-DNA BCT 31-JUL-1990 DEFINITION E.coli mercuric ion resistance transposon (Tn501), 3' end. ACCESSION M25454 KEYWORDS transposon. SEGMENT 2 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 60) AUTHORS Brown,N.L., Choi,C.-L., Grinsted,J., Richmond,M.H. and Whitehead,P.R. TITLE Nucleotide sequences at the ends of the mercury resistance transposon, Tn501 JOURNAL Nucleic Acids Res. 8, 1933-1945 (1980) STANDARD simple staff_review COMMENT Transposon Tn501 was originally isolated in Pseudomonas aeruginosa. FEATURES from to/span description recomb 48 49 Tn501 DNA end/E.coli DNA start BASE COUNT 15 a 17 c 13 g 15 t ORIGIN About 8.2 kb after segment 1 1 cccggaaaac cttagcgtac gattttttcc gaattctgcg agccccccta tgagatgaga // LOCUS COXCBBEA 1749 bp ds-DNA BCT 31-JUL-1990 DEFINITION Plasmid QpRS (from Coxiella burnetii) surface protein (cbbE') gene, complete cds. ACCESSION M29982 KEYWORDS surface protein. SOURCE Plasmid QpRS (from Coxiella burnetti, strain Biotzere, isolate priscilla Q177) DNA. ORGANISM Plasmid QpRS Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1749) AUTHORS Minnick,M.F., Heinzen,R.A., Frazier,M.E. and Mallavia,L.P. TITLE Characterization and expression of the cbbE' gene of Coxiella burnetii JOURNAL J. Gen. Microbiol. 136, 1099-1107 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Heinze, 20-NOV-1989. FEATURES from to/span description pept 196 1683 E' protein signal 157 162 cbbE' -35 region (pot.) signal 178 183 cbbE' -10 region (pot.) signal 1691 1713 rho-independent transcription terminator (pot.) binding 187 193 ribosome binding site (pot.) BASE COUNT 579 a 335 c 343 g 492 t ORIGIN 1 gtcgacgtct gcctttgccg ctatagttca ccttcaaata aacaacaata tttatttata 61 aattttcgga gtccatcatg ccgctgagaa aggaaacttt taaagtaagt tacacattag 121 gtttttttgt gaaaataaat cgactagctt agcttattta atcctttctt aaaattttat 181 aataacggag agactatgcc taaaaaactc gtacccaaag actatgaata tatccatctg 241 gatcttacca ccggtgaaat aaactttacg tcgtttaatt cgcttgaaga attgcaagcc 301 tctttaaaag aaggtcagat ttttttccac aaaagtgtca tctttgaaga aaaaccagaa 361 agtggggaaa tttactcccc taaactgata agccagatat atcgaaaaga acaggaactc 421 tttgaaataa gggaaaaaag taagggacat ccactaccgg ttactaagaa actgcttaag 481 agagggcagg gaacgatagt gtgttgcggt atttacacaa aagaactctt gaaaaacgta 541 gcggaaaagg gacagtacga tacccaatgt gacgatctaa atttgggaat ttttcacgta 601 cgcgcccata aacctttagg catcgcgcaa aggcttgtgc atcttccgct tcccgaggat 661 gcttcttccg ctgcagtagc cactgaaaat ttattcggct taatacgatt tatactcgtc 721 aatgatcccg ctaagaaaaa aatttactta cctatctctt gttttgcaat tgagaagcgt 781 atagaacaag agcatataat tggatattct cagaaagata gcctggcgct ctctcagcga 841 gcttattatg aatataagaa ggacggaacg cttatcgggc tagtcgcatt gatcggtgtt 901 gacgtaaaga tagatggtaa gctaggtttt ttatatcatc cggtgtggcg tgagaaacaa 961 tgggcattaa aattcaatga aaaaatgttt tattgtgctg tttcccgtgc agaaaaagaa 1021 aaagtcttta agccgcctta ttatttggag ccgactgcga taattgtcga tgtaactgaa 1081 acgcccgtta agcgcttaaa gaatacaagt gaggactatt tatggttgga ggtttcccaa 1141 atttcggcaa aattctctct tttttgtgca caaaacaatt tgaaattgga gaaggctgat 1201 tcaaaaaata aatctccttt cgtggcccta tcaatggaat cgattagtga attaacaggc 1261 gaacagaaaa gagcttttgt caaaatcttg aatattcccg gaatcatttt ctcttcctca 1321 accctagcta aagcgagact agagagcaaa cttcaatata ttggaccagc attaattgaa 1381 gccgctgcag atggaaattt caccgatgtg gttgatataa taaatcgaat agaaccactc 1441 tatgattaca aagagatttt aaaagaggca ttaaaaacac aacgtttggg aacgggcaat 1501 actcctttac aggaggctat caaaggacag catacaagcc tagttaagta cttcagttcg 1561 ctatccgctt cgttgaaagt cataaatcat aaaaatcatc aaggattaac agcactcaat 1621 ttcgctacag ccattggatc atcccctgcg atcgtacaag agcttgaatg gtgctcccaa 1681 taaagttagg aatggtccgc cgcgcaaatc attgctaata cgttatctga aaaggtgtca 1741 tccccttcc // LOCUS DROOPA 240 bp ds-DNA INV 31-JUL-1990 DEFINITION D.melanogaster neurogenic locus mastermind opa repeat sequence. ACCESSION M35653 KEYWORDS mastermind locus. SOURCE D.melanogaster DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 240) AUTHORS Yedvobnick,B., Smoller,D., Young,P. and Mills,D. TITLE Molecular analysis of the neurogenic locus mastermind of Drosophila melanogaster JOURNAL Genetics 118, 483-497 (1988) STANDARD simple staff_review BASE COUNT 68 a 81 c 67 g 24 t ORIGIN 1 ccgccgcacg gcatgaatcc gcagcaacag cagcagcagc aacaacagca gcagcagcaa 61 cagcaacagg cccagcagca acatggtcaa atgatgggac aaggacagcc gggtcgctat 121 aacgactacg gcggcggctt tcccaatgac tttggcctgg gacccaatgg tccgcagcag 181 cagcagcagc aggcgcagca acagcagccg cagcagcaac acctgccgcc gcagttccat // LOCUS DRORS1 60 bp ds-DNA INV 31-JUL-1990 DEFINITION D.melanogaster neurogenic locus mastermind RS repeat sequence. ACCESSION M35654 KEYWORDS mastermind locus. SOURCE D.melanogaster DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 60) AUTHORS Yedvobnick,B., Smoller,D., Young,P. and Mills,D. TITLE Molecular analysis of the neurogenic locus mastermind of Drosophila melanogaster JOURNAL Genetics 118, 483-497 (1988) STANDARD simple staff_review BASE COUNT 29 a 29 c 1 g 1 t ORIGIN 1 gcacacacac acacacacac acacacacac acacacacac acacacacac acacacacat // LOCUS DRORS2 60 bp ds-DNA INV 31-JUL-1990 DEFINITION D.melanogaster neurogenic locus mastermind RS repeat sequence. ACCESSION M35655 KEYWORDS mastermind locus. SOURCE D.melanogaster DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 60) AUTHORS Yedvobnick,B., Smoller,D., Young,P. and Mills,D. TITLE Molecular analysis of the neurogenic locus mastermind of Drosophila melanogaster JOURNAL Genetics 118, 483-497 (1988) STANDARD simple staff_review BASE COUNT 20 a 27 c 7 g 6 t ORIGIN 1 agttgccctg ctcacacaca cgcacacaca cacacacaca cacacacatg catgcacgcc // LOCUS ECOCS3 612 bp ds-DNA BCT 31-JUL-1990 DEFINITION E.coli major fimbrial subunit of CS3 gene, complete cds. ACCESSION M35657 KEYWORDS fimbrial subunit. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 612) AUTHORS Boylan,M., Smyth,C.J. and Scott,J.R. TITLE Nucleotide sequence of the gene encoding the major subunit of CS3 fimbriae of enterotoxigenic Escherichia coli JOURNAL Infect. Immun. 56, 3297-3300 (1988) STANDARD simple staff_review FEATURES from to/span description pept 82 588 fimbrial subunit precursor of CS3 sigp 82 126 CS3 fimbrial subunit signal peptide matp 127 585 CS3 fimbrial subunit BASE COUNT 197 a 114 c 110 g 191 t ORIGIN 1 agcagtacag ttccaggtac gtatactgtt ggtcttaacg taaccagtaa tgttatttaa 61 agtgaatgta tgagggattc gatgttaaaa ataaaatact tattaatagg tctttcactg 121 tcagctatga gttcatactc actagctgca gcggggccca ctctaaccaa agaactggca 181 ttaaatgtgc tttctcctgc agctctggat gcaacttggg ctcctcagga taatttaaca 241 ttatccaata ctggcgtttc taatactttg gtgggtgttt tgactctttc aaataccagt 301 attgatacag ttagcattgc gagtacaagt gtttctgata catctaagaa tggtacagta 361 acttttgcac atgagacaaa taactctgct agctttgcca ccaccatttc aacagataat 421 gccaacatta cgttggataa aaatgctgga aatacgattg ttaaaactac aaatgggagt 481 cagttgccaa ctaatttacc acttaagttt attaccactg aaggtaacga acatttagtt 541 tcaggtaatt accgtgcaaa tataacaatt acttcgacaa ttaaataatt atataataga 601 cgtagccttc ga // LOCUS HUMCRYAX1 189 bp ds-DNA PRI 31-JUL-1990 DEFINITION Human alpha-A-crystallin gene, exon 1. ACCESSION M35629 KEYWORDS alpha-A-crystallin. SEGMENT 1 of 2 SOURCE Human spleen DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 189) AUTHORS McDevitt,D.S., Hawkins,J.W., Jaworski,C.J. and Piatigorsky,J. TITLE Isolation and partial characterization of the human alpha-A- crystallin gene JOURNAL Exp. Eye Res. 43, 285-291 (1986) STANDARD simple staff_review FEATURES from to/span description pept 1 / 189 alpha-A-crystallin, exon 1 BASE COUNT 27 a 72 c 45 g 45 t ORIGIN 1 atggacgtga ccatccagca cccctggttc aagcggaccc tggggccctt ctaccccagc 61 cggctgttcg accagttttt cggcgagggg ctttttgagt atgacctgct gcccttcctg 121 tcgtccacca tcacccccta ctaccgccag tccctcttcc gcaccgtgct ggactccggc 181 atctctgag // LOCUS HUMCRYAX2 51 bp ds-DNA PRI 31-JUL-1990 DEFINITION Human alpha-A-crystallin gene, exon 3 (partial). ACCESSION M35628 KEYWORDS alpha-A-crystallin. SEGMENT 2 of 2 SOURCE Human spleen DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 51) AUTHORS McDevitt,D.S., Hawkins,J.W., Jaworski,C.J. and Piatigorsky,J. TITLE Isolation and partial characterization of the human alpha-A- crystallin gene JOURNAL Exp. Eye Res. 43, 285-291 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 27 alpha-A-crystallin, exon 3 (AA at 1) BASE COUNT 7 a 23 c 11 g 10 t ORIGIN About 1.7 kb after segment 1 1 aagcccacct cggctccctc gtcctaagca gcattgcctc ggctggctcc c // LOCUS HUMTBP1 1341 bp ss-mRNA PRI 31-JUL-1990 DEFINITION Human tat binding protein-1 (tbp-1) mRNA, complete cds. ACCESSION M34079 KEYWORDS Tat binding protein-1. SOURCE Human Jurkat T-cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites for [2]) AUTHORS Nelbock,P., Dillion,P.J., Perkins,A. and Rosen,C.A. TITLE A cDNA for a protein that interacts with the human immunodeficiency virus tat transactivator JOURNAL Science 248, 1650-1653 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1341) AUTHORS Nelbock,P., Dillion,P.J., Perkins,A. and Rosen,C.A. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.A.Rosen, 08-MAY-1990. FEATURES from to/span description pept 63 1277 tat binding protein-1 (tbp-1) BASE COUNT 350 a 337 c 412 g 242 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccggc gaccgtgtgg gatgaggccg agcaagatgg aattggggag gaggtgctca 61 agatgtccac ggaggagatc atccagcgca cacggctgct ggacagtgag atcaagatca 121 tgaagagtga agtgttgaga gtcacccatg agctccaagc catgaaggac aagataaaag 181 agaacagtga gaaaatcaaa gtgaacaaga ccctgccgta ccttgtctcc aacgtcatcg 241 agctcctgga tgttgatcct aatgaccaag aggaggatgg tgccaatatt gacctggact 301 cccagaggaa gggcaagtgt gctgtgatca aaacctctac acgacagacg tacttccttc 361 ctgtgattgg gttggtggat gctgaaaagc taaagccagg agacctggtg ggtgtgaaca 421 aagactccta tctgatcctg gagacgctgc ccacagagta tgactcgcgg gtgaaggcca 481 tggaggtaga cgagaggccc acggagcaat acagtgacat tgggggtttg gacaagcaga 541 tccaggagct ggtggaggcc attgtcttgc caatgaacca caaggagaag tttgagaact 601 tggggatcca acctccaaaa ggggtgctga tgtatgggcc cccagggacg gggaagaccc 661 tcctggcccg ggcctgtgcc gcacagacta aggccacctt cctaaagctg gctggccccc 721 agctggtgca gatgttcatt ggagatggtg ccaagctagt ccgggatgcc tttgccctgg 781 ccaaggagaa agcgccctct atcatcttca ttgatgagtt ggatgccatc ggcaccaagc 841 gctttgacag tgagaaggct ggggaccggg aggtgcagag gacaatgctg gagcttctga 901 accagctgga tggcttccag cccaacaccc aagttaaggt aattgcagcc acaaacaggg 961 tggacatcct ggaccccgcc ctcctccgct cgggccgcct tgaccgcaag atagagttcc 1021 cgatgcccaa tgaggaggcc cgggccagaa tcatgcagat ccactcccga aagatgaatg 1081 tcagtcctga cgtgaactac gaggagctgg cccgctgcac agatgacttc aatggggccc 1141 agtgcaaggc tgtgtgtgtg gaggcgggca tgatcgcact ggccaggggt gccacggagc 1201 tcacccacga ggactacatg gaaggcatcc tggaggtgca ggccaagaag aaagccaacc 1261 tacaatacta cgcctaggca cacaggccag ccccagtctc acggctgaag tgcgcaataa 1321 aagatggttt agggggaatt c // LOCUS MUSMETMO 280 bp ds-DNA SYN 31-JUL-1990 DEFINITION MoMLV/mouse metallothionein promoter recombinant DNA. ACCESSION M35969 KEYWORDS . SOURCE Mouse and mouse moloney leukemia virus DNA. REFERENCE 1 (bases 1 to 280) AUTHORS Mclvor,R.S. TITLE Deletion in a recombinant retroviral vector resulting from a cryptic splice donor signal in the moloney leukemia virus encvelope gene JOURNAL Virology 176, 652-655 (1990) STANDARD simple staff_entry FEATURES from to/span description recomb 127 128 MoMLV DNA end/mouse metallothionein promoter DNA start recomb 241 242 metallothionein promoter region deletion BASE COUNT 64 a 78 c 77 g 61 t ORIGIN 1 gggtccagac tccttctact gtgcctattg gggctgtgag acaaccggta gagcttactg 61 gaagccctcc tcatcatggg atttcatcac agtaaacaac aatctcacct ctgaccaggc 121 tgtccagctt cgatccagag agagacctgg gcggagctgg tcgctgctag gaactccagg 181 aaaggagaag ctgaggttac cacgctgcga atgggtttac ggagatagct ggctttccgg 241 gctccagctt caccagatcc gtcgacctgc agccaagctt // LOCUS PLYREGU 203 bp ds-DNA VRL 31-JUL-1990 DEFINITION Polyomavirus JC HindIII C fragment regulatory sequence. ACCESSION M34921 KEYWORDS . SOURCE Polyomavirus JC DNA. ORGANISM Polyomavirus JC Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 203) AUTHORS Frisque,R.J. TITLE Regulatory sequences and virus-cell interactions of JC virus JOURNAL Prog. Clin. Biol. Res. 105, 41-59 (1983) STANDARD simple staff_review FEATURES from to/span description pept 99 < 1 (c) T- antigen BASE COUNT 62 a 55 c 31 g 55 t ORIGIN 1 aagctttctc atgacaggaa tgttccccca tgcagaccta tcaaggccta ataaatccat 61 aagctccatg gattcctccc tattcagcac tttgtccatt ttagcttttt gcagcaaaaa 121 attactgcaa aaaagggaaa aacaagggaa tttccctggc ctcctaaaaa gcctccacgc 181 ccttactact tctgagtaag ctt // LOCUS PLYREGUX 292 bp ds-DNA VRL 31-JUL-1990 DEFINITION Polyomavirus JC regulatory sequence. ACCESSION M34922 KEYWORDS . SOURCE Polyomavirus JC DNA, clones Mad1-[TC,Br]. ORGANISM Polyomavirus JC Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 292) AUTHORS Frisque,R.J. TITLE Regulatory sequences and virus-cell interactions of JC virus JOURNAL Prog. Clin. Biol. Res. 105, 41-59 (1983) STANDARD simple staff_review FEATURES from to/span description binding 3 19 pot. T-antigen binding site rpt 20 117 tandem repeat copy A rpt 118 215 tandem repeat copy B BASE COUNT 96 a 68 c 81 g 47 t ORIGIN 0.67 - 0.73 mu. 1 cggaggcggc ctcggcctcc tgtatatata aaaaaaaggg aagggatggc tgccagccaa 61 gcatgagctc atacctaggg agccaaccag ctaacagcca gtaaacaaag cacaaggctg 121 tatatataaa aaaaagggaa gggatggctg ccagccaagc atgagctcat acctagggag 181 ccaaccagct aacagccagt aaacaaagca caaggggaag tggaaagcag ccaagggaac 241 atgttttgcg agccagagct gttttggctt gtcaccagct ggccatggtt ct // LOCUS PT7RNA 203 bp ds-DNA PHG 31-JUL-1990 DEFINITION Bacteriophage T7 DNA sequence. ACCESSION M35614 KEYWORDS . SOURCE Bacteriophage T7 (strain delta-D111) DNA RNA. ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 203) AUTHORS Aivazashvili,V.A., Bibilashvili,R.S., Vartikyan,R.M. and Kutateladze,T.V. TITLE Factors affecting the discontinuous in vitro elongation of RNA by RNA polymerase from Escherichia coli JOURNAL Mol. Biol. 15, 510-522 (1981) STANDARD simple staff_review BASE COUNT 63 a 54 c 49 g 35 t 2 others ORIGIN 1 catcgagagg gacacggcga atagccatcc caatccacac gtccaacggg gcaaccgtat 61 gtacacctga tgggttcgca atgaacacna cgaatcgaac gccttaagcg tgaactccgc 121 attaaccgca agattacnag ataggtcggc tatgacgaac gcactgatgg catagaggta 181 atgcatgcat gcacacatat acg // LOCUS PT7RNAX 209 bp ds-DNA PHG 31-JUL-1990 DEFINITION Bacteriophage T7 DNA sequence. ACCESSION M35615 KEYWORDS . SOURCE Bacteriophage T7 DNA. ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 209) AUTHORS Aivazashvili,V.A., Bibilashvili,R.S., Vartikyan,R.M. and Kutateladze,T.V. TITLE Factors affecting the discontinuous in vitro elongation of RNA by RNA polymerase from Escherichia coli JOURNAL Mol. Biol. 15, 510-522 (1981) STANDARD simple staff_review BASE COUNT 71 a 52 c 54 g 32 t 0 others ORIGIN 1 catcgagagg gacacggcga atagccatcc caatcgacac cggggtcaac cggataagta 61 gacagcctga taagtcgcac gaaaaacagg tattgacaac atgaagtaac atgcagtaag 121 atacaaatcg ctaggtaaca ctagcagcgt caaccgggcg cacagtgcct tctaggtgac 181 ttaagcgcac cacggcacat aaggtgaag // LOCUS SEQPGH 879 bp ss-mRNA VRT 31-JUL-1990 DEFINITION S.quinqueradiata growth hormone (GH) mRNA, complete cds. ACCESSION M35627 KEYWORDS growth hormone. SOURCE S.quinqueradiata (pituitary glands), cDNA to mRNA. ORGANISM Seriola quinqueradiata Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Perciformes; Percoidei; Carangidae. REFERENCE 1 (bases 1 to 879) AUTHORS Watahiki,M., Tanaka,M., Masuda,N., Yamakawa,M., Yoneda,Y. and Nakashima,K. TITLE cDNA cloning and primary structure of yellow tail (Seriola quinqueradiata) pregrowth hormone JOURNAL Gen. Comp. Endocrinol. 70, 401-406 (1988) STANDARD simple staff_review FEATURES from to/span description pept 97 711 growth hormone (GH) precursor sigp 97 147 growth hormone signal peptide (pot.) (other potential cleavage site on bp 150) matp 150 708 growth hormone (pot.) (other matp starts on bp 153) BASE COUNT 226 a 239 c 192 g 222 t ORIGIN 1 acaacacact gaacaactga actcagctca gattaaacag aacctgaacc tgaagcagac 61 cctgaaccag aacctgaact tgaaccctga tcagccatgg acagagttgt ccttctgctg 121 tcagtcctgt ctctgggcgt ctcctctcag ccaatcacag acagccagca tctgttctcc 181 atcgctgtca gcagaatcca aaacctccac ctgctcgctc agagactctt ctccaacttc 241 gagagtactc tgcagacgga ggaccagcgt caactcaaca aaatcttcct acaggatttc 301 tgtaactctg attacatcat cagtcccatt gacaagcatg agacacaacg cagctctgtt 361 ctgaagctgt tatcgatctc ctatcgattg gtggagtctt gggagttctc cagtcgcttt 421 ctgtctggag gttctgctct gaggaaccag atttcaccca gactgtctga actcaagaca 481 ggaatccaac tgctgatcac agccaatcag gacggagcag agatgttctc tgacgtctcg 541 gccctccagc tcgctccata tggaaacttc tatcagagtc tgggaggcga agagttgctg 601 agacgaaact acgaattgct ggcctgtttc aagaaggaca tgcacaaggt ggagacgtac 661 ctgacggtgg ctaaatgtcg gctctctcca gaagctaact gcaccctgta gccccgtctc 721 tccacagtga agcctgtcgt tgatgatgta atcctgtatg ttctcaagct ccgcccctat 781 gttagcatta gtgtttacct ctgtcaggtg ctgaagtcca aactgatgat gtcatagtga 841 tgtcatacat tcagcttgtg aaataaagtg tgttaattc // LOCUS VIBRR5S 118 bp ss-rRNA RNA 31-JUL-1990 DEFINITION V.marinus 5S ribosomal RNA. ACCESSION M35658 KEYWORDS 5S ribosomal RNA. SOURCE V.marinus (strain MP-1, ATCC 15381) RNA. ORGANISM Vibrio marinus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 118) AUTHORS MacDonell,M.T. and Colwell,R.R. TITLE The nucleotide sequence of 5S ribosomal RNA from Vibrio marinus JOURNAL Microbiol. Sci. 1, 229-231 (1984) STANDARD simple staff_review FEATURES from to/span description rRNA 1 118 5S rRNA BASE COUNT 27 a 25 c 34 g 32 t ORIGIN 1 tgtttggtga ccatagtgtt gtggtaccac ctgactccat tccgaactca gtagtgaaac 61 gtaataacgc cgatggtagt gtggggtttc cccatgtgag agtaggtatc gccaggca //
GenBank-Updates@genbank.bio.net (08/01/90)
LOCUS RATGGLUT 1060 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat gamma-glutamyltransferase gene, 5' promoter region. ACCESSION J05515 KEYWORDS gamma-glutamyltransferase. SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1060) AUTHORS Rajagopalan,S., Park,J.-H., Patel,P.D., Lebovitz,R.M. and Lieberman,M.W. TITLE Cloning and analysis of the rat gamma-glutamyltransferase gene JOURNAL J. Biol. Chem. 265, 11721-11725 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Rajagopalan, 25-MAY-1990. FEATURES from to/span description mRNA 834 > 1060 gamm-glutamyltransferase mRNA BASE COUNT 235 a 277 c 275 g 273 t ORIGIN 1 cagctgcctt ctggaggacc aaactgttca ggggaaggac aggaagaaat gagcctgtgc 61 cttcaggtca gagtcatgcc tagatctggg cgggagagct acaagggata ctgaccagga 121 gatagggtgt tgtcccctcc cccctggggt ttggtatcct cctctgcctt aagagttgca 181 aatcgacttt cccacataac aggcaccaaa tccagttagg accaacccca ccttccaatc 241 caggggagag gaatgtcagc aatgcgtggg cgtgtccttc taatgtgttt tccttgagtg 301 ttgtatgtgg accatctgca tgctcggtac ccagaggcca tcaggtctct tggaacagga 361 attgttgatg tgaaatgcca tgtggttgct gggataggaa ctcaggactc cggaagaacc 421 ttctcttctc cagtccccct ctgttgtttt tttttttttt ttttttgaga tacgatctca 481 cactgtagca caggctaatc cagaactcac taggtaggtc agactgggct caaatcacag 541 cgattctgct tctgcttcct gagtgccagg gtttgcaggt gttagctatc atgcccagtc 601 ttaacatttc acacacgcca gtccaagtta ttaaaaaaca acccggcagt tgagggcagg 661 gccctcaagt cccacaactg gtgcgtgcgt accaagtcca atgcgggaaa ggcctggacc 721 cttgaaccct ttgggcggtt cacttgttag ctcttactac caaatcctgg gcttacacat 781 gaatgccagc ccctccctgc ccagttctgt gacccccttc cccgggcagc tcttgggaga 841 agtcatgcat acatggaggc ggtgccagcc tctttgactc cagagttcag cgggagacag 901 agggagctca tcacatcagg caccccagaa gagttctggg cctgcttcac gtttaacttt 961 gtgattttca ggagtaccag cctgctctaa cggtttcagg gaagattggc tgtgggtttc 1021 cgcagagtgt gggggagttc ctgcttatcc atacagctga // LOCUS ACMGAG 167 bp ss-RNA VRL 01-AUG-1990 DEFINITION Avian myelocytomatosis retrovirus gag gene, partial cds. ACCESSION M35626 KEYWORDS gag protein; oncogene. SOURCE Avian myelocytomatosis retrovirus (mutant MC29-10H) RNA. ORGANISM Avian myelocytomatosis retrovirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian myelocytomatosis viruses. REFERENCE 1 (bases 1 to 167) AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and Patschinsky,T. TITLE Structure of mutant and wild-type MC29 v-myc alleles and biochemical properties of their protein products JOURNAL Oncogene 1, 97-109 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 167 gag protein (AA at 1) BASE COUNT 38 a 47 c 52 g 30 t ORIGIN 1 ggggaggagc ttgcgagtac aggtccgccc gtggtggcca tgcctgtagt gattaacaca 61 gagggacccg cctggacccc tctggagcca aaattgatca caagactggc tgatacggtc 121 aggaccaagg gcttacgatc cccgattact atagcggcgg ccactcg // LOCUS ACMVMYC 333 bp ss-RNA VRL 01-AUG-1990 DEFINITION Avian myelocytomatosis retrovirus v-myc gene, partial cds. ACCESSION M35624 KEYWORDS oncogene; v-myc protein. SOURCE Avian myelocytomatosis retrovirus (mutant MC29-10A) RNA. ORGANISM Avian myelocytomatosis retrovirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian myelocytomatosis viruses. REFERENCE 1 (bases 1 to 333) AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and Patschinsky,T. TITLE Structure of mutant and wild-type MC29 v-myc alleles and biochemical properties of their protein products JOURNAL Oncogene 1, 97-109 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 333 v-myc protein (AA at 1) BASE COUNT 72 a 130 c 90 g 41 t ORIGIN 1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtggtc 61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccggcgc caaccccgcg 121 gctctgctgg gggtcgacac gccgcccacg atccaccaac acaactacgc tgctcctccc 181 tccaccaagg tggaataccc agccgccaag aggctaaagt tggacagtgg cagggtcctc 241 aaacagatca gcaacaaccg aaaatgctcc agtccccgca cgttagactc agaggagaac 301 gacaagaggc gaacgcacaa cgtcttggag cgc // LOCUS ACMVMYCA 202 bp ss-RNA VRL 01-AUG-1990 DEFINITION Avian myelocytomatosis retrovirus v-myc gene, partial cds. ACCESSION M35625 KEYWORDS oncogene; v-myc protein. SOURCE Avian myelocytomatosis retrovirus (mutants MC29-10C and 10H) RNA. ORGANISM Avian myelocytomatosis retrovirus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Avian myelocytomatosis viruses. REFERENCE 1 (bases 1 to 202) AUTHORS Bister,K., Trachmann,C., Jansen,H.W., Schroeer,B. and Patschinsky,T. TITLE Structure of mutant and wild-type MC29 v-myc alleles and biochemical properties of their protein products JOURNAL Oncogene 1, 97-109 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 202 v-myc protein (AA at 1) BASE COUNT 34 a 77 c 62 g 29 t ORIGIN 1 ggcctctacc tgcacgacct gggagccgcg gccgccgact gcatcgaccc ctcggtcgtc 61 ttcccctacc cgctcagcga gcgcgccccg cgggccgccc cgcccgacga caagaggcga 121 acgcacaacg tcttggagcg ccagcgaagg aatgagctga agctgcgttt ctttgccctg 181 cgtgaccaga tacccgaggt gg // LOCUS HUM3BHSD 1565 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human placental 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase mRNA, complete cds. ACCESSION M35493 KEYWORDS 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase. SOURCE Human placenta, cDNA to mRNA, clone H3-beta-hp6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1565) AUTHORS Lorence,M.C., Murry,B.A., Trant,J.M. and Mason,J.I. TITLE Human 3-beta-hydroxysteroid dehydrogenase/delta-5->4isomerase from placenta: Expression in nonsteroidogenic cells of a protein that catalyzes the dehydrogenation/isomerization of C21 and C19 steroids JOURNAL Endocrinology 126, 2493-2498 (1990) STANDARD simple staff_review FEATURES from to/span description pept 31 1152 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase mRNA < 1 1565 3-beta-hydroxysteroid dehydrogenase/5-4-isomerase BASE COUNT 417 a 381 c 376 g 391 t ORIGIN 1 gcggagtgat tcctgctact ttggatggcc atgacgggct ggagctgcct tgtgacagga 61 gcaggagggt ttctgggaca gaggatcatc cgcctcttgg tgaaggagaa ggagctgaag 121 gagatcaggg tcttggacaa ggccttcgga ccagaattga gagaggaatt ttctaaactc 181 cagaacaaga ccaagctgac agtgctggaa ggagacattc tggatgagcc attcctgaag 241 agagcctgcc aggacgtctc ggtcatcatc cacaccgcct gtatcattga tgtcttcggt 301 gtcactcaca gagagtctat catgaatgtc aatgtgaaag gtacccagct cctgttagag 361 gcctgtgtcc aagctagtgt gccagtcttc atctacacca gtagcataga ggtagccggg 421 cccaactcct acaaggaaat catccagaat ggccatgaag aagagcctct ggaaaacaca 481 tggcccgctc catacccaca cagcaaaaag cttgctgaga aggctgtact ggcggctaac 541 gggtggaatc tgaaaaacgg cggcaccctg tacacttgtg ccttacgacc catgtatatc 601 tatggggaag gaagccgatt cctttctgct agtataaacg aggccctgaa caacaatggg 661 atcctgtcaa gtgttggaaa gttctccact gttaacccag tctatgttgg caatgtggcc 721 tgggcccaca ttctggcctt gagggccctg caggacccca agaaggcccc aagcatccga 781 ggacagttct actatatctc agatgacacg cctcaccaaa gctatgataa ccttaattac 841 accctgagca aagagttcgg cctccgcctt gattccagat ggagctttcc tttatccctg 901 atgtattgga ttggcttcct gctggaaata gtgagcttcc tactcaggcc aatttacacc 961 tatcgaccgc ccttcaaccg ccacatagtc acattgtcaa atagcgtatt caccttctct 1021 tataagaagg ctcagcgaga tctggcgtat aagccactct acagctggga ggaagccaag 1081 cagaaaacgg tggagtgggt tggttccctt gtggaccggc acaaggagac cctgaagtcc 1141 aagactcagt gatttaagga tgacagagat gtgcatgtgg gtattgttag gagatgtcat 1201 caagctccac cctcctggcc tcatacagaa agtgacaagg gcacaagctc aggtcctgct 1261 gcctcccttt catacaatgg ccaacttatt gtattcctca tgtcatcaaa acctgcgcag 1321 tcattggccc aacaagaagg tttctgtcct aatcatatac cagaggaaag accatgtggt 1381 ttgctgttac caaatctcag tagctgattc tgaacaattt agggactctt ttaacttgag 1441 ggtcgttttg actactagag ctccatttct actcttaaat gagaaaggat ttcctttctt 1501 tttaatcttc cattccttca catagtttga taaaaagatc aataaatgtt tgaatgttta 1561 atgtg // LOCUS HUMMHB7B 1089 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human class I HLA-B7 mRNA, complete cds. ACCESSION M35444 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1089) AUTHORS Parham,P., Benjamin,R.J., Chen,B.P., Clayberger,C., Ennis,P.D., Krensky,A.M., Lawlor,D.A., Littman,D.R., Norment,A.M., Orr,H.T., Salter,R.D. and Zemmour,J. TITLE Diversity of class I HLA molecules: Functional and evolutionary interactions with T cells JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 529-543 (1989) STANDARD simple staff_review FEATURES from to/span description pept 1 1089 MHC HLA-B7 /hgml_locus_uid="LX0031C" /nomgen="HLA-A" /map="6p21.3" BASE COUNT 218 a 335 c 363 g 173 t ORIGIN 1 atgctggtca tggcgccccg aaccgtcctc ctgctgctct cggcggccct ggccctgacc 61 gagacctggg ccggctccca ctccatgagg tatttctaca cctccgtgtc ccggcccggc 121 cgcggggagc cccgcttcat ctcagtgggc tacgtggacg acacccagtt cgtgaggttc 181 gacagcgacg ccgcgagtcc gagagaggag ccgcgggcgc cgtggataga gcaggagggg 241 ccggagtatt gggaccggaa cacacagatc tacaaggccc aggcacagac tgaccgagag 301 agcctgcgga acctgcgcgg ctactacaac cagagcgagg ccgggtctca caccctccag 361 agcatgtacg gctgcgacgt ggggccggac gggcgcctcc tccgcgggca tgaccagtac 421 gcctacgacg gcaaggatta catcgccctg aacgaggacc tgcgctcctg gaccgccgcg 481 gacaccgcgg ctcagatcac ccagcgcaag tgggaggcgg cccgtgaggc ggagcagcgg 541 agagcctacc tggagggcga gtgcgtggag tggctccgca gatacctgga gaacgggaag 601 gacaagctgg agcgcgctga ccccccaaag acacacgtga cccaccaccc catctctgac 661 catgaggcca ccctgaggtg ctgggccctg ggtttctacc ctgcggagat cacactgacc 721 tggcagcggg atggcgagga ccaaactcag gacactgagc ttgtggagac cagaccagca 781 ggagatagaa ccttccagaa gtgggcagct gtggtggtgc cttctggaga agagcagaga 841 tacacatgcc atgtacagca tgaggggctg ccgaagcccc tcaccctgag atgggagccg 901 tcttcccagt ccaccgtccc catcgtgggc attgttgctg gcctggctgt cctagcagtt 961 gtggtcatcg gagctgtggt cgctgctgtg atgtgtagga ggaagagttc aggtggaaaa 1021 ggagggagct actctcaggc tgcgtgcagc gacagtgccc agggctctga tgtgtctctc 1081 acagcttga // LOCUS MUSMUPE 872 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse major urinary protein mRNA, complete cds. ACCESSION M28649 KEYWORDS major urinary protein. SOURCE Mouse liver, cDNA to mRNA, clones 8-1 and 13-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 872) AUTHORS Bennett,A.L., Paulson,K.E., Miller,R.E. and Darnell,J.E.Jr. TITLE Aquisition of antigens characteristic of adult pericentral hepatocytes by differentiating fetal hepatoblasts in vitro JOURNAL J. Cell Biol. 105, 1073-1085 (1987) STANDARD simple staff_review FEATURES from to/span description pept 65 601 major urinary protein mRNA 43 872 major urinary protein BASE COUNT 266 a 188 c 170 g 248 t ORIGIN 1 gccacgatca caagaaagat gtggtcctga cagacagaca atcctattcc ctaccaaaat 61 gaagatgctg ctgctgctgt gtttgggact gaccctagtc tgtgtccatg cagaagaagc 121 tagttctacg ggaaggaact ttaatgtaga aaagattaat ggggaatggc atactattat 181 cctggccttt gacaaaagag aaaagataga agataatggc aactttagac tttttctgga 241 gcaaatccat gtcttggaga attccttagt tcttaaattc catactgtaa gagatgaaga 301 gtgctcggaa ttatctatgg ttgctgacaa aacagaaaag gctggtgaat attctgtgac 361 gtatgatgga ttcaatacat ttactatacc taagacagac tatgataact ttcttatggc 421 tcatctcatt aacgaaaatg atggggaaac cttccagctg atggggctct atggccgaga 481 accagatttg agttcagaca tcaaggaaag gtttgcacaa ctatgtgaga agcatggaat 541 ccttagagaa aatatcattg acctatccaa tgccaatcgc tgcctccagg cccgagaatg 601 aagaatggcc tgagcctcca gtgttgagtg gagacttctc accaggactc caccatcatc 661 ccttcctatc catacagcat ccccagtata aattctgtga tctgcattcc atcctgtctc 721 actgagaagt ccaattccag tctatccaca tgttacctag gatacctcat caagaatcaa 781 agacttcttt aaatttttct ttgatatacc catgacaatt tttcatgaat ttcttcctct 841 tcctgttcaa taaatgatta cccttgcact ta // LOCUS RATMHREC 1552 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Rat MHC class I IgG Fc region receptor large subunit p51 (FcRn) mRNA, complete cds. ACCESSION M35495 KEYWORDS IgG Fc region receptor large subunit p51; cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Rat 11 day old epithelium, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1552) AUTHORS Simister,N.E. and Mostov,K.E. TITLE Cloning and expression of the neonatal rat intestinal FC receptor, a major histocompatibilty complex class I antigen homolog JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 571-580 (1989) STANDARD simple staff_review FEATURES from to/span description pept 205 1305 IgG Fc region receptor large subunit p51 (FcRn) precursor sigp 205 270 IgG Fc region receptor large subunit p51 signal peptide matp 271 1302 IgG Fc region receptor large subunit p51 mRNA < 1 1552 FcRn mRNA BASE COUNT 312 a 420 c 443 g 377 t ORIGIN 1 tcagttctgt aattaattaa ctaacgtgga tcaaatgaga aggtgaaagt tcacacagga 61 gcactcctgt cgtcttggac tgggtctcca tcccaccatc cagtgccctg gtctacgaag 121 agtccacagg gaccttgtga agaatcaaca aggcggggtc cagaggagtc acgtgtgcct 181 tccactccgg gtcgccctgt caggatgggg atgtcccagc ccggggtcct cctcagcctc 241 ttattggtcc tcctgcctca gacctgggga gcggagcccc gtctcccact gatgtatcat 301 cttgcagctg tgtctgactt atcaacgggg cttccctctt tctgggccac gggctggctg 361 ggtgctcagc aatatctgac ctacaacaac ctgcggcagg aggctgaccc ctgtggggcc 421 tggatatggg aaaaccaggt gtcttggtat tgggagaagg agaccacgga tctgaaaagc 481 aaagaacagc tcttcttgga ggccatcagg accctggaga accaaataaa tgggaccttc 541 acactgcagg gcctgctggg ctgtgaactg gcccctgata attcttcatt gcccacggct 601 gtgtttgccc tcaatggtga ggagttcatg cggttcaacc caagaacggg caactggagt 661 ggggagtggc cggagacaga tatcgttggt aatctgtgga tgaagcaacc tgaggcggcc 721 aggaaggaga gcgagttcct gctaacttct tgtcctgagc ggctgctagg ccacctggag 781 aggggccgtc agaacctgga gtggaaggag ccgccatcta tgcgcctgaa ggcccgtcct 841 ggcaactctg gctcctcagt actgacctgt gctgctttct ccttctaccc gccggagctc 901 aagtttcgat tcctgcgcaa tgggctagcc tcaggctctg ggaattgcag cactggtccc 961 aatggtgatg gatctttcca tgcatggtca ttgctagagg tcaaacgtgg agatgaacac 1021 cattaccaat gtcaagtgga gcatgagggg ctggcccagc ctctcactgt ggacctagat 1081 tcgcccgcca gatcttctgt gcctgtggtc ggaatcattc ttggtttatt gctggtggta 1141 gtggccatcg cagggggtgt gctgctatgg aacaggatgc gaagtgggct gccagcccca 1201 tggctttctc tcagtggtga tgactctggc gacctattgc ctggtgggaa cttgcccccg 1261 gaggctgaac ctcaaggtgt aaatgccttt ccggccactt cctgatgcca acccaggccc 1321 catacccatt gcagcctgtg gggctgtgtg acctcctgaa ctgtctctga gcctcccgag 1381 ggagccctgg gctggatgtc ctcctcgtgg atcccttctt ttgtggcctg cttcagtttc 1441 ccctcttaat gtcaatggct atttccatct ccacataaat ttgggcccaa atctgtgtgt 1501 gcatcgttat tctcaggttt caggcagccg gaataaattg aacaagtttg ag // LOCUS YSCATP10 2343 bp ds-DNA PLN 01-AUG-1990 DEFINITION S.cerevisiae ATP10 (essential for mitochondrial ATPase complex assembly) gene, complete cds. ACCESSION J05463 KEYWORDS . SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2343) AUTHORS Ackerman,S.H. and Tzagoloff,A. TITLE ATP10, a yeast nuclear gene required for the assembly of the mitochondrial F1-Fo complex JOURNAL J. Biol. Chem. 265, 9952-9959 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Tzagoloff, 19-JUN-1990. The gene sequence submitted codes for a protein that is essential for the biosynthesis of the F1-F0 ATPase complex of the mitochondrial inner membrane. Bases 1 to 977 are shown on the complementary strand as shown in Fig. 6. FEATURES from to/span description pept 629 < 1 (c) ORF pept 976 680 (c) ORF pept 1444 2238 ATP10 protein BASE COUNT 692 a 447 c 490 g 714 t ORIGIN 1 agatcttttg gctcaggtat aaattcgaac gtctcgattt cccttatcag tttatggaat 61 ggcttaaacc aagatgaaga tttccctagc tctaattgaa cttgcaccat atatataact 121 tttccaaaag agtaaaaata caaatccaga ttatcgattt tattaaattc ttgccaatga 181 ctattgaacg taggtgggag tcgggcatta cttcttgtca caaacgctac tgtctttgcc 241 gtatgatttt tcagacattc aggctttctg ggaaacttgt caaattgaaa gctataatta 301 tatgaacctg gtttaacttt gaacggcttg gaggagccat caagagcatt ccatacatta 361 tctggaggga aaactctctg ttcgaatttc attaatgtat gaaaggattt gttgtcttgg 421 cccggcatca tcatgccatt ctgttgaaac atgtactctt gatcaatttt tgttaaggtc 481 tctgagaatc cttttagaat gacggaaatt ttccttatag atagcgcttt tgttaactga 541 agactaacta tccctgacat ttgatcatta gagctataaa actccccgtt gtacggtggg 601 tttaaggata ttgaaatttt tggagccatg gtttgacaaa ctgtatggtt ctcaaccttc 661 tctaatcaaa agcagaatct taaatataaa cactcacaga atatccgttg gtcaatgaag 721 taattctcct ttgtactggc tgctttttct cctctagttt atgtaattct acttttggat 781 gggtgcgact gcttttaatt gattgagtgg cggtgttaga agggctgtag agtcgaaggc 841 ttgtttctct cttacgcacc tcttgtgaaa agggcgtgca ccttccccag gaccctctct 901 caccctcaac ccgcattttg ctgagaattt tcaccaaggc cctaggtgat attagattcc 961 acctgactaa ttgcattaca gccgacccaa ggcaatatca gtttaataaa atatcatgta 1021 tctcaccctc ttcttggtat tagtaaagag acgcctgatc ttgtaacagt ggtgaagatt 1081 gtactagagc agaatcaaga atttaaaagt gtaaggcagg cagaggcgat gtacataaac 1141 ttcgaagtaa gaaatattta atagttctcg ccacatcact atgcagctat ataaaaacta 1201 ctataaacgt ttgttttgtt ccttacgcac aatatccttg cctagaaatc gtttttgaaa 1261 tttaaatttt tattaccatt tatttgattc gccttcagaa aaatatggaa gagtgcatat 1321 ttaaaaagga ctatttcagc atatagtaaa agtcaggtta tttgtttatt tgcgatatca 1381 gagtaactta aactaactat gcagggcact tttaaaaggt tttaccatcc cacgcttacg 1441 cggatgtcct tcttggataa attcctcaag cctatgatgg caacggcttc cccaaaggaa 1501 taccagatca aacaactggt caagccaata ggcttaacac aagcaccaag gaaaagcacc 1561 aaatactccc aggggaactc tttgagggat atgtttgatt cggaaaagac aaaccacaga 1621 gttaaagagt tggccgttga attcagcaaa tctggacttt atgacgtgca agtcttccaa 1681 aagacaaagg ggaaattgtt tatagctcca gtttcatatt ggaaagaaga taaagctttg 1741 ttttttcctc atttgatagg aacggcaatg gatggtacga aacaacagaa tatcgaggat 1801 atgttaaggg gtaaaaccag tatagtgagg ttatttagta cagcatctgg cgataagttg 1861 agtagttcat acttccaagg aatcgtagac gataacaaaa aaactgacta cttgactgaa 1921 gctgatgcgc gtttaagttt aaatgacagt aacgtccaaa tcatcgaggt caatcttgta 1981 gaaaacgctg tgaaaagtgc tctagtgaaa acgcttgctc gttgggccaa tcgcgttcca 2041 tcctggcgcc agccatttta tttcgaatat tctagaggcc aatggccatt ttccgtcagg 2101 gaagagctct tttgcaataa tgtcttttct ggatacgtct ttcttgtgga ccagcagtta 2161 aaaattaggt gggcagcttg cggggaggct actccatctg aaaaggaagc attgtggaag 2221 tttgccaaac gtctgtgaag ttgacgcttt gtgcggcggc caacaaggga tgggcggcta 2281 tttggcgatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag 2341 tag // LOCUS DDIDPYK1A 1090 bp ss-mRNA INV 01-AUG-1990 DEFINITION D.discoideum protein-tyrosine kinase-1 (DPYK1) mRNA, complete cds. ACCESSION M33785 KEYWORDS protein-tyrosine kinase-1. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1090) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium discoideum JOURNAL Mol. Cell. Biol. 10, 3578-3583 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. FEATURES from to/span description pept < 1 1014 protein-tyrosine kinase-1 (DPYK1; AA at 1) BASE COUNT 371 a 198 c 186 g 335 t ORIGIN 1 cgcccatttg gtggttggga aactcaatca tcattatcac atccaccatc acgtccacca 61 ccacctccac caccaccacc acaactacca gttagatcag aatacgagat tgatttcaat 121 gaattagaat ttggtcaaac cattggtaaa ggtttctttg gtgaagtaaa gagaggttat 181 tggagagaga ctgatgttgc cataaaaatc atctatcgtg atcaattcaa aaccaaatca 241 tcattggtta tgtttcaaaa tgaagttgga atactaagta aattaagaca tccaaatgta 301 gttcaatttt tgggtgcatg tactgcagga ggtgaagatc atcattgtat agtaacagaa 361 tggatgggtg gaggtagttt aagacagttc ttgactgatc atttcaattt actcgaacaa 421 aatccacata ttcgtttgaa gttggctttg gatattgcaa aaggaatgaa ttatctacat 481 ggttggactc cacccattct tcatcgtgac ttatcctcaa gaaacatttt attggatcac 541 aacatcgatc caaagaatcc gttagtttcc tcaagacaag atattaaatg taagatctct 601 gattttggtc taagtagatt aaagaaggaa caagcctctc aaatgactca atcggttggt 661 tgtattccct acatggcacc agaggttttc aaaggcgata gtaatagtga aaagagtgat 721 gtttactcct atggcatggt tttgtttgaa ctattaacct ctgatgaacc tcaacaagat 781 atgaaaccaa tgaaaatggc tcacttggct gcttatgaat cttatcgtcc tccaattcca 841 ttaactacct cttccaagtg gaaagaaatt ctaactcaat gttgggattc taatcctgat 901 agtcgtccaa cctttaaaca aatcattgtt catctcaaag aaatggaaga tcaaggtgta 961 tcttcttttg catctgtacc tgttcaaact attgatactg gtgtttatgc ttaatttttt 1021 ttttataatt aaaaaaaaaa aaaacaaaac aaaaaaaaaa aataataata aatataatca 1081 cttcaactcg // LOCUS DDIDPYK2A 1291 bp ss-mRNA INV 01-AUG-1990 DEFINITION D.discoideum protein-tyrosine kinase-2 (DPYK2) mRNA, complete cds. ACCESSION M33784 KEYWORDS protein-tyrosine kinase-2. SOURCE D.discoideum (strain AX-3) 4-hour, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1291) AUTHORS Tan,J.L. and Spudich,J.A. TITLE Developmentally regulated protein-tyrosine kinase genes in Dictyostelium discoideum JOURNAL Mol. Cell. Biol. 10, 3578-3583 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.L.Tan, 20-APR-1990. FEATURES from to/span description pept < 1 1233 protein-tyrosine kinase-2 (DPYK2; AA at 1) BASE COUNT 491 a 207 c 203 g 390 t ORIGIN 1 cgattctaca atacaacaaa ctctactaaa gatatcacat ttttagtttg tgataatcct 61 gattcaacta aagaaaagag taacgtttca aatacttcat caataatttc cgcttcaaat 121 ttaaatagac atataacacc aaattctcat atgagaccta gaggtagatc aatttctgaa 181 tctttaatta tgtcaccaat taataaagaa tctttaaatg atattcaaag agcaattgaa 241 agtgaaaaaa taaagaaaac taaatttgaa gaattaaaat caatattggg cgaaagagaa 301 tatataattg atataaatga tattcaattt atacaaaaag ttggagaagg tgcattcagt 361 gaagtttggg aaggttggtg gaaaggtatt catgttgcca taaaaaagtt aaagattata 421 ggagatgaag aacaattcaa agagagattc attagagagg ttcaaaattt gaaaaaagga 481 aatcatcaaa acattgtcat gtttattggt gcatgttata aaccagcatg tatcataaca 541 gagtatatgg caggtggtag tctttacaat atacttcata atccaaatag ttccactcca 601 aaagttaaat attctttccc attggttttg aaaatggcaa ccgacatggc attgggctta 661 ttacatcttc attccatcac cattgtgcat cgtgatttaa ccagtcaaaa cattctattg 721 gatgaattgg gtaatataaa gatctctgat tttggtttat ctgctgaaaa gagtagagaa 781 ggttcaatga caatgacaaa tggtggcatt tgcaatccaa gatggagacc acccgaattg 841 acaaagaatt taggtcacta ctcggaaaag gttgatgtct attgtttctc tctagtagtt 901 tgggaaattt taactggcga aattcctttc tctgatttag atggatctca acgatccgct 961 caagtagctt atgctggttt aagaccacca ataccagagt attgcgatcc tgaattaaaa 1021 ttactcttaa ctcaatgttg ggaggctgat ccaaatgata gacctccctt tacctatata 1081 gtaaacaaat taaaagaaat ctcttggaat aatccaattg gtttcgtctc tgatcaattc 1141 tatcaatata gcgaaccttc aactccaaga ttagcattat caaatcaatc ttcaaattca 1201 agtagtattt ctttatcacc aactaaatta taaaaaaaaa aaaaaaaaaa aacaaatttc 1261 aaacaccaaa caccaccact catcaaaatc g // LOCUS HUMSPTB 6765 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds. ACCESSION J05500 KEYWORDS beta-spectrin; spectrin. SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and V252. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6765) AUTHORS Winkelmann,J.C., Chang,J.-G., Tse,W.T., Scarpa,A.L., Marchesi,V.T. and Forget,B.G. TITLE Full length sequence of the cDNA for human erythroid beta-spectrin JOURNAL J. Biol. Chem. 265, 11827-11832 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Winkelmann, 08-MAY-1990. FEATURES from to/span description pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14" /hgml_locus_uid="LS0033T" mRNA < 1 6765 SPTB mRNA signal 6716 6722 poly-A signal BASE COUNT 1626 a 1822 c 2146 g 1171 t ORIGIN Chromosome 14q23-q24. 1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag 61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa 121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg 181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg 241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg 301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca 361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc 421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac 481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc 541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag 601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag 661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta 721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca 781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc 841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct 901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc 961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg 1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca 1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca 1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca 1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag 1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg 1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc 1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata 1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga 1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg 1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc 1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg 1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg 1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga 1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg 1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc 1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg 1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg 2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc 2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc 2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt 2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga 2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc 2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg 2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc 2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat 2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgcaggcc ctgcgggagc 2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg 2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga 2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca 2761 ggttcgacat cctggaccag gagatgaaga ccttgatgac tcagattgat ggtgtgaacc 2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc 2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg 2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt 3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag 3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg 3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga 3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat 3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg 3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca 3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg 3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc 3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg 3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg 3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat 3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga 3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg 3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg 3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc 3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct 3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc 4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc 4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc 4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa 4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg 4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcggtcagac gacccgggca 4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga 4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg 4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa 4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg 4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta 4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg 4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg 4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc 4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt 4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct 4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc 4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg 5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg 5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca 5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa 5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc 5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca 5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg 5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc 5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga 5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca 5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg 5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg 5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc 5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct 5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc 5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca 5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc 6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt 6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc 6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg 6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca 6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg 6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga 6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg 6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac 6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg 6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg 6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag 6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa 6721 attttattgt ataaatatat caccatttac atgaggggaa acact // LOCUS STYEUTBC 2526 bp ds-DNA BCT 01-AUG-1990 DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes, complete cds. ACCESSION J05518 KEYWORDS ethanolamine ammonia-lyase. SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2526) AUTHORS Faust,L.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M. TITLE Cloning, sequencing and expression of the genes encoding the alcohol-dependent ethanolamine ammonia-lyase of Salmonella typhimurium JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.M.Babior, 08-MAY-1990. FEATURES from to/span description pept 141 1499 ethanolamine ammonia-lyase (eutB) pept 1518 2378 ethanolamine ammonia-lyase (eutC) binding 130 133 ribosome binding site binding 1507 1510 ribosome binding site BASE COUNT 563 a 687 c 779 g 497 t ORIGIN 1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga 61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt 121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt 181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg 241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa 301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc 361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca 421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc 481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc 541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc 601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg 661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc 721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga 781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc 841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct 901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt 961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct 1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg 1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc 1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga 1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc 1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc 1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg 1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg 1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat 1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg 1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc 1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg 1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc 1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg 1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg 1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc 1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg 1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga 2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact 2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca 2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg 2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc 2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg 2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga 2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg 2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt 2521 gcctcc // LOCUS XELPCNA 1018 bp ss-mRNA VRT 01-AUG-1990 DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete cds. ACCESSION M34080 KEYWORDS nuclear protein; proliferating cell nuclear antigen. SOURCE X.laevis oocyte, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1018) AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M. TITLE Characterization and developmental expression of Xenopus proliferating cell nuclear antigen (PCNA) JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Leibovici, 08-MAY-1990. FEATURES from to/span description pept 28 813 proliferating cell nuclear antigen (PCNA) mRNA < 1 1018 PCNA mRNA BASE COUNT 284 a 223 c 237 g 274 t ORIGIN 1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc 61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca 121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact 181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag 241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg 301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt 361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa 421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc 481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca 541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa 601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat 661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca 721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg 781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc 841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa 901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa 961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac // LOCUS YSCAAC2A 1333 bp ds-DNA PLN 01-AUG-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete cds. ACCESSION M34076 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1333) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 235 1158 ADP/ATP-translocator protein (AAC2) BASE COUNT 388 a 209 c 301 g 435 t ORIGIN 1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc 61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg 121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa 181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt 241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt 301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat 361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt 421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat 481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg 541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct 601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc 661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg 721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc 781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct 841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg 901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga 961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa 1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg 1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc 1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc 1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag 1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc 1321 agccactgtt gtt // LOCUS YSCAAC3 1308 bp ds-DNA PLN 01-AUG-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete cds. ACCESSION M34075 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1308) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator in yeast JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 78 1034 ADP/ATP-translocator protein (AAC3) BASE COUNT 353 a 228 c 263 g 464 t ORIGIN 1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata 61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc 121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg 181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag 241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca 301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta 361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt 421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg 481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat 541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg 601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac 661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga 721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt 781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa 841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga 901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag 961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta 1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc 1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata 1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa 1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc 1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt // LOCUS HUMMTVA1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 13) mitochondrial DNA sequences, 5' end. ACCESSION M28909 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 121 a 123 c 42 g 80 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat cacccccctc 361 agatagg // LOCUS HUMMTVA2 361 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 13) mitochondrial DNA sequences, 3' end. ACCESSION M28910 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 361) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 107 a 102 c 54 g 93 t 5 others ORIGIN 1 tttggtattt tcgtctgggg ggtgtgcacg cgatagcatt gcgagacgct ggagccggag 61 caccctatgt cgcagtatct gtctttgatt cctgccccat cctattattt atcgcaccta 121 cgttcaatat tacaggcgaa catacnctac taaagtgtgt taattaatta atgcttgtag 181 gacataataa taacaattaa atgtctgcac agccactttc cacacagaca tcataacaaa 241 aaatttncca ccaaaccccc ccnnntcccc ccgcttctgg ccacagcact taaacacatc 301 tctgccaaac cccaaaaaca aagaacccta acaccagcct aaccagattt caaattttat 361 c // LOCUS HUMMTVB1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 14) mitochondrial DNA sequences, 5' end. ACCESSION M28911 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 121 a 120 c 43 g 82 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acatatcaac cgtaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVB2 356 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 14) mitochondrial DNA sequences, 3' end. ACCESSION M28912 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 356) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 105 a 104 c 52 g 90 t 5 others ORIGIN 1 ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc tggagccgga gcaccctatg 61 tcgcagtatc tgtctttgat tcctgcccca tcccattatt tatcgcacct acgttcaata 121 ttacaggcga acatacncta ctaaagtgtg ttaattaatt aatgcttgta ggacataata 181 ataacaattn aatgtctgca cagccacttt ccacacagac atcataacaa aaaatttncc 241 accaaacccc ccccnntccc cccgcttctg gccacagcac ttaaacacat ctctgccaaa 301 ccccaaaaac aaagaaccct aacaccagcc taaccagatt tcaaatttta tctttt // LOCUS HUMMTVC1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 11) mitochondrial DNA sequences, 5' end. ACCESSION M28905 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 119 a 123 c 45 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtacg gcaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVC2 371 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 11) mitochondrial DNA sequences, 3' end. ACCESSION M28906 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 371) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 109 a 107 c 55 g 94 t 6 others ORIGIN 1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc 61 tggagccgga gcaccctatg tcgcagcacc tgtctttgat tcctgcccca ttccattatt 121 tatcgcacct acgttcaata ttacaggcga acatacncta ctaaagtgtg ttaattaatt 181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac 241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac 301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt 361 tcaaatttta t // LOCUS HUMMTVD1 368 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 12) mitochondrial DNA sequences, 5' end. ACCESSION M28907 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 368) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 119 a 122 c 46 g 80 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacggtac cataaatact 121 tgaccacctg tagtacataa aaacccanac ccacatcaaa accctccccc catgcttaca 181 agcaagcaca gcaatcaacc ttcaactgtc acacatcaac tgcaactcca aagccacccc 241 tcacccacta ggatatcaac aaacctactc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agataggg // LOCUS HUMMTVD2 375 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 12) mitochondrial DNA sequences, 3' end. ACCESSION M28908 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 375) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 109 a 104 c 59 g 97 t 6 others ORIGIN 1 ggaggctctn ccatgcattt ggtattttcg tctggggggt gtgcacgcga tagcattgcg 61 agacgctgga gccggagcac cctatgtgca gtatctgtct ttgattcctg ccccattcca 121 ttatttatcg cacctacgtt caatattaca ggcgagcata cnctattaaa gtgtattaat 181 taattaatgc ttgtaggaca taataataac aattaaatgt ctgcacagcc actttccaca 241 cagatcataa caaaaaattt nccaccaaac ccccccnnnt ccccccgctt ctggccacag 301 cacttaaaca catctctgcc aaaccccaaa aacaaagaac cctaacacca gcctaaccag 361 atttcaaatt ttatc // LOCUS HUMMTVE1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 1-4) mitochondrial DNA sequences, 5' end. ACCESSION M28893 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 120 a 121 c 44 g 81 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVE2 362 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 1-4) mitochondrial DNA sequences, 3' end. ACCESSION M28894 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 362) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 108 a 102 c 54 g 93 t 5 others ORIGIN 1 catttggtat tttcgtctgg ggggtgtgca cgcgatagca ttgcgagacg ctggagccgg 61 agcaccctat gtcgcagtat ctgtctttga ttcctgcccc atcctattat ttatcgcacc 121 tacgttcaat attacaggcg aacatacnct actaaagtgt gttaattaat taatgcttgt 181 aggacataat aataacaatt aaatgtctgc acagccactt tccacacaga catcataaca 241 aaaaatttnc caccaaaccc ccccnnntcc ccccgcttct ggccacagca cttaaacaca 301 tctctgccaa accccaaaaa caaagaaccc taacaccagc ctaaccagat ttcaaatttt 361 at // LOCUS HUMMTVF1 369 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 7) mitochondrial DNA sequences, 5' end. ACCESSION M28899 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 369) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 120 a 122 c 47 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccactcc 241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagggg // LOCUS HUMMTVF2 371 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 7) mitochondrial DNA sequences, 3' end. ACCESSION M28900 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 371) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 109 a 107 c 55 g 94 t 6 others ORIGIN 1 tctnccatgc atttggtatt ttcgtctggg gggtgtgcac gcgatagcat tgcgagacgc 61 tggagccgga gcaccctatg tcgcagtatc tgtctttgat tcctgcccca tcccattatt 121 tatcgcacct acgttcaata ttacaggcga acatacncta ccaaagtgtg ttaattaatt 181 aatgcttgta ggacataata ataacaatta aatgtctgca cagccacttt ccacacagac 241 atcataacaa aaaatttncc accaaacccc cccnnntccc cccgcttctg gccacagcac 301 ttaaacacat ctctgccaaa ccccaaaaac aaagaaccct aacaccagcc taaccagatt 361 tcaaatttta t // LOCUS HUMMTVG1 340 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 5) mitochondrial DNA sequences, 5' end. ACCESSION M28895 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 340) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 114 a 111 c 39 g 75 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatc tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gtaatcaacc ctcaactgtc atacatcaac cgcaactcca aagccacccc 241 tcagccacta ggataccaac aaacctaccc acccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt // LOCUS HUMMTVG2 349 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 5) mitochondrial DNA sequences, 3' end. ACCESSION M28896 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 349) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 106 a 103 c 52 g 83 t 5 others ORIGIN 1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc 61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc 121 aatattacag gcgaacatac nctaccaaag tgtgttaatt aattaatgct tgtaggacat 181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt 241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc 301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaa // LOCUS HUMMTVH1 348 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 8) mitochondrial DNA sequences, 5' end. ACCESSION M28901 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 348) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 115 a 115 c 38 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acacattaac tgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatcc // LOCUS HUMMTVH2 355 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 8) mitochondrial DNA sequences, 3' end. ACCESSION M28902 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 355) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 107 a 101 c 52 g 90 t 5 others ORIGIN 1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc 61 tatgtcgcag tatctgtctt tgattcctgc cccatcctat tatttatcgc acctacgttc 121 aatattacag gcgaacatac nctactaaag tgtgttaatt aattaatgct tgtaggacat 181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt 241 tnccaccaaa cccccccnnn tccccccgct tctggccaca gcacttaaac acatctctgc 301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat // LOCUS HUMMTVI1 367 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 6) mitochondrial DNA sequences, 5' end. ACCESSION M28897 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 367) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 121 a 121 c 43 g 81 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccaccta tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gtaatcaacc ttcaactgtc acacatcaac cgcaactcca aagccacccc 241 tcacccacta ggataccaac aaacctaccc atccttaaca gtacatagca cataaagcca 301 tttaccgtac atagcacatt acagtcaaat cccttctcgt ccccatggat gacccccctc 361 agatagg // LOCUS HUMMTVI2 358 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 6) mitochondrial DNA sequences, 3' end. ACCESSION M28898 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 358) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 106 a 102 c 54 g 91 t 5 others ORIGIN 1 ttggtatttt cgtctggggg gtgtgcacgc gatagcattg cgagacgctg gagccggagc 61 accctatgtc gcagtatctg tctttgattc ctgccccatc ccattattta tcgcacctac 121 gttcaatatt acaggcgaac atacnctact aaagtgtgtt aattaattaa tgcttgtagg 181 acataataat aacaattaaa tgtctgcaca gccactttcc acacagacat cataacaaaa 241 aatttnccac caaacccccc cnnntccccc cgcttctggc cacagcactt aaacacatct 301 ctgccaaacc ccaaaaacaa agaaccctaa caccagccta accagatttc aaattttt // LOCUS HUMMTVJ1 365 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 9,10) mitochondrial DNA sequences, 5' end. ACCESSION M28903 KEYWORDS mitochondrial DNA. SEGMENT 1 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 365) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 120 a 122 c 43 g 79 t 1 others ORIGIN 1 ttctttcatg gggaagcaga tttgggtacc acccaagtat tgactcaccc atcaacaacc 61 gctatgtatt tcgtacatta ctgccagcca ccatgaatat tgtacagtac cataaatact 121 tgaccacctg tagtacataa aaacccanat ccacatcaaa accctccccc catgcttaca 181 agcaagtaca gcaatcaacc ttcaactgtc acaatcaacc gcaactccaa agccacccct 241 cacccactag gataccaaca aacctaccca cccttaacag tacatagcac ataaagccat 301 ttaccgtaca tagcacatta cagtcaaatc ccttctcgtc cccatggatg acccccctca 361 gatag // LOCUS HUMMTVJ2 355 bp ds-DNA ORG 01-AUG-1990 DEFINITION Human (!Kung 9,10) mitochondrial DNA sequences, 3' end. ACCESSION M28904 KEYWORDS mitochondrial DNA. SEGMENT 2 of 2 SOURCE Human mitochondrial hair root DNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 355) AUTHORS Vigilant,L., Pennington,R., Harpending,H., Kocher,T.D. and Wilson,A.C. TITLE Mitochondrial DNA sequences in single hairs from a southern African population JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9350-9354 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Vigilant 06-OCT-1989. BASE COUNT 107 a 102 c 52 g 90 t 4 others ORIGIN 1 tattttcgtc tggggggtgt gcacgcgata gcattgcgag acgctggagc cggagcaccc 61 tatgtcgcag tatctgtctt tgattcctgc cccatcccat tatttatcgc acctacgttc 121 aatattacag gcgaacatac nctattaaag tgtgttaatt aattaatgct tgtaggacat 181 aataataaca attaaatgtc tgcacagcca ctttccacac agacatcata acaaaaaatt 241 tnccaccaaa ccccccccnn tccccccgct tctggccaca gcacttaaac acatctctgc 301 caaaccccaa aaacaaagaa ccctaacacc agcctaacca gatttcaaat tttat // LOCUS HUMLD78A 3176 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human cytokine LD78 alpha gene, complete cds. ACCESSION D90144 KEYWORDS LD78; LD78 alpha; cytokine; inducible gene family; secreted peptide. SOURCE Human blood lymphocyte DNA , clone Lm LD-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3176) AUTHORS Nakao,M., Nomiyama,H. and Shimada,K. TITLE Structures of human genes coding for cytokine LD78 and their expression JOURNAL Mol. Cell. Biol. 10, 3646-3658 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hisayuki Nomiyama Department of Biochemistry Kumamoto University Medical School 2-2-1 Honjo, Kumamoto 860 Japan Phone: 096-344-2111 Fax: 096-372-6140 FEATURES from to/span description pept 1155 1227 cytokine LD78 alpha precursor, exon 1 1916 2030 cytokine LD78 alpha precursor, exon 2 2451 2541 cytokine LD78 alpha precursor, exon 3 sigp 1155 1219 cytokine LD78 alpha signal peptide matp 1220 1227 cytokine LD78 alpha mature peptide 1916 2030 cytokine LD78 alpha mature peptide 2451 2538 cytokine LD78 alpha mature peptide pre-msg 1069 2957 cytokine LD78 alpha mRNA and introns IVS 1228 1915 cytokine LD78 alpha intron A IVS 2031 2450 cytokine LD78 alpha intron B signal 1041 1045 TATA box BASE COUNT 833 a 741 c 752 g 850 t ORIGIN 1 acccagggac ctatcacaca aatataagaa ctattcattc tttaaggcat gtatttccaa 61 gcctttgtat ttttttccat gcttagggtt ggcaaggaat atatatatat ttgtacaaat 121 atatatgtgt atatgtacaa atacatgtat atatagtaca aatatatata tatatttgta 181 caattcttca gactttgtag aatttgtata atgtcgtatc ttgctttttt taaccactga 241 tgttataagc atatttatgc cacttcattc attttagaga cttaataata aatgatctag 301 tggataattt atcattccct gatggagaaa aatttagctt tgtttatttt agagttataa 361 acgatgctgg gtcaggtatc tttatgtttg aagatggctc catatttggg ttgtttccac 421 agaactcttt cctagaaatg ctttttctag gttaatggct acagatattt ctaggcacct 481 gacatattga cacccacctc taaagtattt ttatgatcca caactagcgt ttaacacagc 541 gccctagtca ctacatgact aataaataga caaatgactg aaacatgacc tcatgctttc 601 tattcctcca gctttcattc agttctttgc ctctgggagg aggaagggtt gtgcagccct 661 ccacagcatc agcccatcaa ccctatccct gtggttatag cagctgagga agcagaattg 721 cagctctgtg ggaaggaatg gggctggaga gttcatgcac agaccagttc ttatgagaag 781 ggactgacta agaatagcct tgggttgaca tatacccctc ttcacactca caggagaaac 841 catttcccta tgaaactata acaagtcatg agttgagagc tgagagttag agaatagctc 901 aaagatgcta ttcttggata tcctgagccc ctgtggtcac cagggaccct gagttgtgca 961 acttagcatg acagcatcac tacgcttaaa aatttccctc ctcaccccca gattccattt 1021 ccccatccgc cagggctgcc tataaagagg agagctggtt tcagacttca gaaggacacg 1081 ggcagcagac agtggtcagt cctttcttgg ctctgctgac actcgagccc acattccgtc 1141 acctgctcag aatcatgcag gtctccactg ctgcccttgc tgtcctcctc tgcaccatgg 1201 ctctctgcaa ccagttctct gcatcacgtg agtctgagtt tcgttgtggg tatcaccact 1261 ctctggccat ggttagacca catcaatctt ttcttgtggc ctaaaagccc ccaagagaaa 1321 agagaacttc ttaaagggct gccaaacatc ttggtctttc tctttaagac ttttattttt 1381 atctctagaa ggggtcttag ccccctagtc tccaggtatg agaatctagg caggggcagg 1441 ggagttacag tcccttttac agatagaaaa acagggttcg aaacgaatca gttagcaaga 1501 ggcagaatcc agggctgctt acttcccagt ggggtatgtt gttcactctc cagctcactc 1561 taggtctccc aggagctctg tcccttggat gtcttatgag agatgtccaa ggcttctctt 1621 gggttggggt atgacttctt gaaccagaca aaattccctg aagagaactg agataagaga 1681 acagtccgtt caggtatctg gatcacacag agaaacagag aacccactat gaagagtcaa 1741 ggagaaagaa ggatacagac agaaacaaag agacatttct cagcaaaaat gcccaaatgc 1801 cttccagtca cttggtctga gcaagcctgc cttcctcaac tgctcgggga tcagaagctg 1861 cctggccttt tcttctgagc tgtgactcgg gctcattctc ttcctttctc cacagttgct 1921 gctgacacgc cgaccgcctg ctgcttcagc tacacctccc ggcagattcc acagaatttc 1981 atagctgact actttgagac gagcagccag tgctccaagc ccggtgtcat gtaagtgcca 2041 gtcttcctgc tcacctctat ggaggtaggg agggtcaggg ttggggcaga gacaggccag 2101 aaggctatcc tggaaaggcc cagccttcag gagcctatcg gggatacagg acgcagggct 2161 ccgaggtgtg acctgacttg gagctggagt gaggcatgtg ttacagagtc aggaagggct 2221 gccccagccc agaggaaagg gacaggaaga aggaggcagc gggacactct gagggccacc 2281 cctactgagt cactgagaga agctctctag acagagatag gcagggggcc cctgaaagag 2341 gagcaagccc tgagctgccc aggacagaga gcagaatggt ggggccatgg tgggcccagg 2401 attcccctgc tggattcccc agtgcttaac tcttcctccc ttctccacag cttcctaacc 2461 aagcgaagcc ggcaggtctg tgctgacccc agtgaggagt gggtccagaa atatgtcagc 2521 gacctggagc tgagtgcctg aggggtccag aagcttcgag gcccagcgac ctcggtgggc 2581 ccagtgggga ggagcaggag cctgagcctt gggaacatgc gtgtgacctc cacagctacc 2641 tcttctatgg actggttgtt gccaaacagc cacactgtgg gactcttctt aacttaaatt 2701 ttaatttatt tatactattt agtttttgta atttattttc gatttcacag tgtgtttgtg 2761 attgtttgct ctgagagttc ccctgtcccc tcccccttcc ctcacaccgc gtctggtgac 2821 aaccgagtgg ctgtcatcag cctgtgtagg cagtcatggc accaaagcca ccagactgac 2881 aaatgtgtat cggatgcttt tgttcagggc tgtgatcggc ctggggaaat aataaagatg 2941 ctcttttaaa aggtaaacca gtattgagtt tggttttgtt tttctggcaa atcaaaatca 3001 ctggttaaga ggaatcatag gcaaagatta ggaagaggtg aaatggaggg aaattgggag 3061 agatggggag ggctaccaca gagttatcca ctttacaacg gagacacagt tctggaacat 3121 tgaaactacg aatatgttat aactcaaatc ataacatgca tgctctagga gaattc // LOCUS HUMLD78B 3112 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human cytokine LD78 beta gene. ACCESSION D90145 KEYWORDS LD78; LD78 beta; cytokine; inducible gene family; secreted peptide. SOURCE Human placenta DNA, clone Lm LD-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3112) AUTHORS Nakao,M., Nomiyama,H. and Shimada,K. TITLE Structures of human genes coding for cytokine LD78 and their expression JOURNAL Mol. Cell. Biol. 10, 3646-3658 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hisayuki Nomiyama Department of Biochemistry Kumamoto University Medical School 2-2-1 Honjo, Kumamoto 860 Japan Phone: 096-344-2111 Fax: 096-372-6140 FEATURES from to/span description pept 1192 1267 cytokine LD78 beta precursor, exon 1 1953 2067 cytokine LD78 beta precursor, exon 2 2488 2578 cytokine LD78 beta precursor, exon 3 sigp 1192 1259 cytokine LD78 beta signal peptide matp 1260 1267 cytokine LD78 beta mature peptide 1953 2067 cytokine LD78 beta mature peptide 2488 2575 cytokine LD78 beta mature peptide pre-msg 1106 2995 cytokine LD78 beta mRNA and introns IVS 1268 1952 cytokine LD78 intron A IVS 2068 2487 cytokine LD78 intron B rpt 498 797 Alu repeat signal 1078 1082 TATA box BASE COUNT 756 a 775 c 780 g 801 t ORIGIN 1 ttagagactt aataataaag gatcttgtgg ataatttatc attccctgat agagaaaaat 61 ttagctttgc ttattttaga gttataaatg atgctgggtc aggtatcttt atgtttgaag 121 atggctccat atttgggttg tttccacaga actctttccc agaaatgctt tttctaggtt 181 aatggctaca catatttcta ggcacctgac atactgacac ccacctctaa agtattttta 241 tgatccacaa ctagcgttta acacagcgcc ccagtcactc cgagactaat aaatagacaa 301 atgactgaaa cgtgacctca tgctttctat tcctccagct ttcattgagt tcctttcctc 361 tgggaggact gggggttgtc tagccctcca cagcatcagc ccattgaccc tatccttgtg 421 gttatagcag ctgaggaagc agaattacag ctctgtggga aggaatgggg ctggagagtt 481 catgcataga ccaattcttt tttttttttt tttttgagat ggagtttcac ttttgttgcc 541 caggctggag tgcaatggca tgatctcagc tcaccacagc ccccacctcc tgggttcaag 601 cgattctcct gccctcagcc tcccgagtag ctgggattac aggcatgtgc caccacgcct 661 gactactttt gtatttttag tagagatgga gtttctcttt cttggtcagg ttggtctcaa 721 actcctgacc tcaggtgatc cgcagcctcg gcctcccaaa gtgttgggat tacaggtgtg 781 agcgaccatg cctggctgca tagaccagtt cttatgagaa gggatcaact aagaatagcc 841 ttgggttgac acacacccct cttcacactc acaggagaaa ccccatgaag ctagaaccag 901 tcatgagttg agagctgaga gttagagagt agctcagaga tgctattctt ggatatcctg 961 agcccctgtg gtcaccaggg accctgagtt gtgcaacact cagcatgaca gcatcactac 1021 acttaaaaat ttccctcctc acccccagat tccatttccc catccgccag ggctgcctat 1081 aaagaggaga gatggcttca gacatcagaa ggacgcaggc agcaaagagt agtcagtccc 1141 ttcttggctc tgctgacact cgagcccaca ttccatcacc tgctcccaat catgcaggtc 1201 tccactgctg cccttgccgt cctcctctgc accatggctc tctgcaacca ggtcctctct 1261 gcaccacgtg agtccatgtt gttgttgtgg gtatcaccac tctctggcca tggttagacc 1321 acatcagtct ttttttgcgg cctgagagcc ccgaagagaa aagaaggaag ttcttaaagc 1381 gctgccaaac accttggtct ttttcttcac aacttttatt tttatctcta gaaggggtct 1441 tagccctcct agtctccagg tatgagaatc taggcagggg caggggagtt acagtccctt 1501 gtacagatag aaaaacaggg ttcaaaacga atcagtttgc aagaggcaga atccagggct 1561 gcttacttcc cagtggggtc tgttgttcac tctccagctc accctaggtc tcccaggagc 1621 cctgtccctt ggatgtctta tgagagatgt ccagggcttc tcttgggctg gggtatgact 1681 tcttgaaccg acaaaattcc atgaagagag ctaagagaac agtccattca ggtatctgga 1741 tcacatagag aaacagagaa cccactatga agagtcaagg ggaaagagga atatagacag 1801 aaacaaagag acatttctct gcaaaacccc ccaaatgcct tgcagtcact tggtctgagc 1861 aagcctgccc tcctcaacca ctcagggatc agaagctgcc tggccttttc ttctgagctg 1921 tgactcgggc ttattctctc ctttctccgc agttgctgct gacacgccga ccgcctgctg 1981 cttcagctac acctcccgac agattccaca gaatttcata gctgactact ttgagacgag 2041 cagccagtgc tccaagccca gtgtcatgta agtgccagtc ttcctgctca cctctaggga 2101 ggtagggagt gtcagggtgg gggcagaaac aggccagaag gccatcctgg aaaggcccag 2161 ccttcaggag cctatcgggg atacaggacg cagggcactg aggtgtgacc tgacttgggg 2221 ctggagtgag gtgggtgtta cagagtcagg aagggctgcc ccaggccaga ggaaaggaac 2281 aggaagaagg aggcagcagg acactctgag ggcccccttg cctggagtca ctgagagaag 2341 ctctctagac ggagataggc agggggcccc tgagagagga gcaggccttg agctgcccag 2401 gacagagagc aggatgtcag gccatggtgg gcccaggatt ccccggctgg attccccagt 2461 gcttaactct tcctcccttc tccacagctt cctaaccaag agaggccggc aggtctgtgc 2521 tgaccccagt gaggagtggg tccagaaata cgtcagtgac ctggagctga gtgcctgagg 2581 ggtccagaag cttcgaggcc cagcgacctc agtgggccca gtggggagga gcaggagcct 2641 gagccttggg aacatgcgtg tgacctctac agctacctct tctatggact ggttattgcc 2701 aaacagccac actgtgggac tcttcttaac ttaaatttta atttatttat actatttagt 2761 ttttataatt tatttttgat ttcacagtgt gtttgtgatt gtttgctctg agagttcccc 2821 ctgtcccctc caccttccct cacagtgtgt ctggtgacga ccgagtggct gtcatcggcc 2881 tgtgtaggca gtcatggcac caaagccacc agactgacaa atgtgtatca gatgcttttg 2941 ttcagggctg tgatcggcct ggggaaataa taaagatgtt cttttaaacg gtaaaccagt 3001 attgagtttg gttttgtttt tctggcaaat caaaatcact agttaagagg aatcataggc 3061 aaagattagg aagaggtgaa atggagggaa actgggagag atggggagcg ct // LOCUS XELTRH 1442 bp ss-mRNA VRT 01-AUG-1990 DEFINITION X.laevis thyrotropin releasing hormone (TRH) mRNA, complete cds. ACCESSION M34699 K00931 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis skin, cDNA to mRNA, clone L4 and 8/136. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 478) AUTHORS Richter,K., Kawashima,E., Egger,R. and Kreil,G. TITLE Biosynthesis of thyrotropin releasing hormone in the skin of Xenopus laevis: Partial sequence of the precursor deduced from cloned cDNA JOURNAL EMBO J. 3, 617-621 (1984) STANDARD full staff_review REFERENCE 2 (bases 15 to 1442) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of the end product JOURNAL J. Biol. Chem. 265, 11731-11733 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 110 793 thyrotropin releasing hormone precursor matp 332 340 thyrotropin releasing hormone copy 1 matp 374 382 thyrotropin releasing hormone copy 2 matp 428 436 thyrotropin releasing hormone copy 3 matp 470 478 thyrotropin releasing hormone copy 4 matp 566 574 thyrotropin releasing hormone copy 5 matp 611 619 thyrotropin releasing hormone copy 6 matp 686 694 thyrotropin releasing hormone copy 7 mRNA < 1 1442 TRH mRNA conflict 139 139 t in [2]; c in [1] conflict 214 216 tct in [2]; ctc in [1] conflict 319 319 g in [2]; t in [1] BASE COUNT 460 a 286 c 334 g 362 t ORIGIN 1 agcacagagc agcacaagga cacactctgc atattgtgct gccggacaag gaggtgacag 61 ccagtcaggc tgagacaaag gaacttccag acctctgaca gcaggaaaga tggtgtctgt 121 ctggtggttg ctgcttcttg gtacaaccgt atctcacatg gtgcacacac aagagcagcc 181 tttactggag gaggacacag caccattaga tgatctggat gttcttgaga aagccaaagg 241 tatcctgatc cgcagtatcc tggagggatt tcaagaaggg caacaaaaca atagagatct 301 accagatgca atggaaatga tatctaagcg ccagcaccca gggaaacgat tccaggagga 361 gatagaaaag agacaacacc ctggaaagag ggatctggaa gatctgaatc tagagctttc 421 caaaaggcaa caccccggaa gaagatttgt ggatgatgta gagaagaggc aacatccagg 481 aaagagagaa gagggtgact ggagtaggag gtatctgaca gatgactcac gttatttgga 541 cctcctttct gatgtttcca ggagacagca cccaggcaaa agagttccag ccccattgtt 601 tacaaaacgt caacacccag gtaagagagt gacagaagaa gagggtgata ctgaatttga 661 aaactcgaag gaagtgggga agcgccagca tccaggaaag agatatgacc cttgtgaagg 721 ccctaatgcc tacaactgta actcaggaaa cattctaccg gattctgtag aagaattgag 781 ttttgggctt taagctgccc agccccttta ttagttccat ctgaccctaa atgattccca 841 atgaacacaa ctttctataa ttgttaaata acattgtatt aagtatcata catttctgga 901 aagcaagcag ctcttagaac acttcttcgc tttaaaaggc acctggggca taagagtatt 961 aagcttcaga cagtaacctg cccaccacag ggagggattc aacaatcaca attggctgag 1021 tgttcctttc ccttgtttgg cagtgagatc agataataaa tataagatgg ccaggaaagt 1081 ggactctttc ttttctgaaa atttgcaagt aacaccaaaa tataataatt tgcacactca 1141 gtagtattaa cgtgaagatc tcaagaaggt tataaattct tggtgatctg ctcaaagcat 1201 ttaattcata gttgcttcca tggtttgatg gggaatgcac attctaaatt gcttattgct 1261 aattagcgct tgccacacag ttctggtggt agatcttgat gaggcatatt caataaaagt 1321 agagcccata gtaaaatttg tgccccgtca gctttaagga tcctctgtaa gcaatatgtg 1381 ttgtgagggc cacttgtttc taaagtaata ttttcatttt aataaatatg tctactcaaa 1441 tg // LOCUS XELTRHA 2955 bp ss-mRNA VRT 01-AUG-1990 DEFINITION X.laevis thyrotropin releasing hormone mRNA, complete cds. ACCESSION M34698 J05514 KEYWORDS thyrotropin releasing hormone. SOURCE X.laevis, cDNA to mRNA, clone C6. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (sites) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. TITLE Two precursors of thyrotropin releasing hormone from skin of Xenopus laevis: Each contains seven copies of the end product JOURNAL J. Biol. Chem. 265, 11731-11733 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2955; for [1]) AUTHORS Kuchler,K., Richter,K., Trnovsky,J., Egger,R. and Kreil,G. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by G.Kreil, 18-MAY-1990. FEATURES from to/span description pept 157 831 thyrotropin releasing hormone matp 379 387 thyrotropin releasing hormone copy 1 matp 421 429 thyrotropin releasing hormone copy 2 matp 475 483 thyrotropin releasing hormone copy 3 matp 517 525 thyrotropin releasing hormone copy 4 matp 613 621 thyrotropin releasing hormone copy 5 matp 658 666 thyrotropin releasing hormone copy 6 matp 733 741 thyrotropin releasing hormone copy 7 BASE COUNT 927 a 597 c 604 g 827 t ORIGIN 1 catgcagttt attagatata cagtacaatg aagtcagtta tgagaaatag caattgcagc 61 acaaggacac actctgcata ttgtgctgcc ggacaaggag gtgacagcca gtcaggctga 121 gacaaaggaa cttccagacc tctgacagca ggaaagatgg tgtctgtctg gtggttgctg 181 cttcttggta caaccgtatc tcacatggtg cacacacaag agcagccttt actggaggag 241 gacacagcac cattagatga tctggatgtt cttgagaaag ccaaaggtat cctgatccgc 301 agtatcctgg agggatttca agaagggcaa caaaacaata gagatctacc agatgcaatg 361 gaaatgatat ctaagcgcca gcacccaggg aaacgattcc aggaggagat agaaaagaga 421 caacaccctg gaaagaggga tctggaagat ctgaatctag agctttccaa aaggcaacac 481 cccggaagaa gatttgtgga tgatgtagag aagaggcaac atccaggaaa gagagaagag 541 ggtgactgga gtaggaggta tctgacagat gactcacgtt atttggacct cctttctgat 601 gtttccagga gacagcaccc aggcaaaaga gttccagccc cattgtttac aaaacgtcaa 661 cacccaggta agagagtgac agaagaagag ggtgatactg aatttgaaaa ctcgaaggaa 721 gtggggaagc gccagcatcc aggaaagaga tatgaccctt gtgaaggccc taatgcctac 781 aactgtaact caggaaacat tctaccggaa gaattgagtt ttgggcttta agctgcccag 841 cccctttatt agttccatct gaccctaaat gattcccaat gaacacaact ttctataatt 901 gttaaataac attgtattaa gtatcataca tttctggaaa gcaagcagct cttagaacac 961 ttcttcgctt taaaaggcac ctggggcata agagtattaa gcttcagaca gtaacctgcc 1021 caccacaggg agggattcaa caatcacaat tggctgagtg ttcctttccc ttgtttggca 1081 gtgagatcag ataaataaat ataagatggc caggaaagtg gactctttct tttctgaaaa 1141 tttgcaagta acaccaaaat ataataattt tgcactctgc agtgtattaa cgtgaagatc 1201 tcaagaaggt tataaattag gttataaatt cttggtgatc tgctcaaagc atttaattca 1261 tagttgcttc catggtttga tggggaatgc acattctaaa ttgcttattg ctaattagcg 1321 cttgccacac agttctggtg gtagatcttg atgaggcata ttcaataaaa gtagagccca 1381 tagtaaaatt tgtgccccgt cagctttaag gatcctctgt aagcaatatg tgttgtgagg 1441 gccacttgtt tctaaagtaa tattttcatt ttaataaata tgtctactca aatgacaaaa 1501 acattcatta tttcactaca ttatactcct tcccacagca attatgtacc tatgaatcct 1561 gatagaagac tgcagttttc ctcttatatc ctccatgttg gattcaccat aagtcaccaa 1621 aatatatcta tagggaagca cactatacac aatagcagtg acccccatcc agtggcttgt 1681 gggcaacaag ctactcacca acccccttgg ctgttgctcc cagtggccct aaagtaaggt 1741 gcataaaaaa accagatgaa cttgtcaaaa agagcctccc ttagactgcc ttgttccaca 1801 tagaggctac catatagcca atcacagccc ttatttggca cccccgggaa cttttttcat 1861 gcttgagttg ctccccaaat ctttttacag ttgaatatgt ctcatggcta aaaaaacgtg 1921 aggaccccgg cgtaatatag tataatatac acacactcac tttggaaaac tctatggaga 1981 tcaataagca cttttgggtt aaactatttt tttgatacaa tttgagcact ttatatatgg 2041 attttaaaga tattccgctt tagtagtctg tggtgcgctg ccccataaat atattggtga 2101 attattcacc acctactctt aacaattctg ctcaattcat ctagatgtta acataataca 2161 tcaccagtat cacaatggca gcgggaagca aagacattct gtagtgtcct gagaccagct 2221 aaagcctaga ggtggaccat aaataatgtc tattgcaggg tcagtacaaa caaaaacacc 2281 aaggctgctt tatacaaggc atatctaatt tgcaggtatt ttgctgaact attactccac 2341 acacaaagct tgagggacac agactaataa tctgctgaag gtttgcagga tggacagttg 2401 gacactgctt tgcttcaact ttattctagg cttgtgctct gatgtatgca gcgtcaaata 2461 ccagctgttg tttgactaca actcccagaa gcctcagcat actgagggtg gtatgcttga 2521 atgcttgaat gcttgaatac cgaaggctgt ctgtcctcca acacctcccg ttgatctccc 2581 gctccagctc ttattgtcat tccattgtat attttgtttt taaatgtata aagaaataaa 2641 aaaaaagtat gatatattca cccttcttct tctgagtata aaaagattta aatgaatgtg 2701 aaaataatat ttttatagac aacaatcttt gtgcagtgtt ggtaaataca tgtttattct 2761 gtatatagct attttaatat gcatactgaa agaatatata tatataataa gaagcatgaa 2821 catctcattg cctgggtatg aaacaataaa gattgcatct gataatgaag caaattcgct 2881 ctgtggcgca gtattatgtt gacctgatga tgaagttagg tctggtgcgc ttctcaatgt 2941 tcgtggcgct ggccc // LOCUS PVICSD 1107 bp ds-DNA INV 01-AUG-1990 DEFINITION P.vivax circumsporozoite protein gene, complete cds. ACCESSION M34697 KEYWORDS circumsporozoite protein. SOURCE P.vivax (strain Thai; isolate NYU Thai) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 286 to 798) AUTHORS Arnot,D.E., Stewart,M.J. and Barnwell,J.W. TITLE Antigenic diversity in Thai Plasmodium vivax circumsporozoite proteins JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 285; 799 to 1107) AUTHORS Arnot,D.E., Stewart,M.J. and Barnwell,J.W. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Arnot, 18-MAY-1990. The bases in reference [2] are identical to bases 140 to 442 and 995 to 1294 of the sequence of the North Korean strain published in Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106; accession number M20671. Author address: D.E.Arnot Dept. of Genetics University of Edinburgh West Mains Rd., Edingburgh, EM93JM Scotland FEATURES from to/span description pept 1 1107 circumsporozoite protein BASE COUNT 382 a 212 c 338 g 175 t ORIGIN 1 atgaagaact tcattctctt ggctgtttct tccatcctgt tggtggactt gttccccacg 61 cactgcgggc acaatgtaga tctgtccaag gccataaatt taaatggagt aaacttcaat 121 aatgtagacg ccagttcact tggcgcggca cacgtaggac aaagtgctag ccgaggcaga 181 ggacttggtg agaacccaga tgacgaggaa ggagatgcta aaaaaaaaaa ggatggaaag 241 aaagcagaac caaaaaatcc acgtgaaaat aagctgaaac aaccaggaga cagagcagat 301 ggacagccag caggagacag agcagatgga cagccagcag gtgatagagc agatggacaa 361 ccagcaggtg atagagctgg acagccagca ggagatagag cagatggaca gccagcagga 421 gacagagcag atggacagcc agcaggagac agagcagatg gacagccagc aggagacaga 481 gcagatggac agccagcagg tgacagagct ggacaaccag caggtgatag agctggacag 541 ccagcaggcg atagagcaga tggacagcca gcaggagata gagctggaca gccagcaggc 601 gatagagcag atggacagcc agcaggagat agagctggac aaccagcagg agatagagca 661 gatggacaac cagcaggaga tagagctgga cagccagcag gagatagagc tggacagcca 721 gcaggagata gagctggaca gccagcagga gatagagctg gacagccagc aggaaatggt 781 gcaggtggac aggcagcagg aggaaacgca ggaggacagg gacaaaataa tgaaggtgcg 841 aatgccccaa atgaaaagtc tgtgaaagaa tacctagata aagttagagc taccgttggc 901 accgaatgga ctccatgcag tgtaacctgt ggagtgggtg taagagtcag aagaagagtt 961 aatgcagcta acaaaaaacc agaggatctt actttgaatg accttgagac tgatgtttgt 1021 acaatggata agtgtgctgg catatttaac gttgtgagta attcattagg gctagtcata 1081 ttgttagtcc tagcattatt caattaa // LOCUS ATTRRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION A.solani 5S rRNA. ACCESSION M35573 KEYWORDS 5S ribosomal RNA. SOURCE A.solani (strain CBS 277-32) 5S rRNA. ORGANISM Atractiella solani Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales; Auriculariaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 29 a 30 c 34 g 26 t ORIGIN 1 aggtgcgacc ataccgtgtt gaaaattctg catcccgtcc gatctgcaaa gacaagcaac 61 acagggccca gtcagtagtg cggtgggtga ccacgtgcga atactgtggt gttgcactt // LOCUS CETRRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION C.cornigerum 5S rRNA. ACCESSION M35577 KEYWORDS 5S ribosomal RNA. SOURCE C.cornigerum (strain FO 29225) 5S rRNA. ORGANISM Ceratobasidium cornigerum Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Metabasidiomycetidae; Metatremellales; Ceratobasidiaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 23 a 35 c 37 g 23 t ORIGIN 1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga 61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt // LOCUS GRARRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION G.phoenicis 5S rRNA. ACCESSION M35575 KEYWORDS 5S ribosomal RNA. SOURCE G.phoenicis (strain PB 4349) 5S rRNA. ORGANISM Graphiola phoenicis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Teliomycetes; Ustilaginales; Graphiolaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 26 a 33 c 36 g 23 t ORIGIN 1 atctgcggcc atagaaccgt gaaaataccg catcccgtcc gatctgcgaa gtcaagcacg 61 gtatcgccta gtcagtactg cggtggggga ccacgcggga atcctgggtg ctgcagtt // LOCUS PLARRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION P.peniophorae 5S rRNA. ACCESSION M35571 KEYWORDS 5S ribosomal RNA. SOURCE P.peniophorae (strain FO 22315) 5S rRNA. ORGANISM Platygloea peniophorae Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Auriculariales; Auriculariaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 27 a 35 c 36 g 21 t ORIGIN 1 atctgcggcc ataccgtgat gaacattccg cgtcccgtcc gatccgcgca gacaagcatc 61 acaggggcca gagagtattg acgtgggtga ccagtcgaga acactgtgct gccgcaggt // LOCUS PLERRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION P.faginea 5S rRNA. ACCESSION M35574 KEYWORDS 5S ribosomal RNA. SOURCE P.faginea (strain FO 22315) 5S rRNA. ORGANISM Phleogena faginea Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Eutremellales; Phleogenaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 28 a 30 c 34 g 27 t ORIGIN 1 atgtgcgacc ataccaagct gaaaatactg catcccgtct gatctgcaca gtcaagcagc 61 ttagggccca gtcagtagtg cggtggggga ccatgcgcga acattgtggt gttgcactt // LOCUS SEPRRA 119 bp ss-RNA RNA 01-AUG-1990 DEFINITION S.carestianum 5S rRNA. ACCESSION M35572 KEYWORDS 5S ribosomal RNA. SOURCE S.carestianum (strain FO 25109) 5S rRNA. ORGANISM Septobasidium carestianum Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Septobasidiales; Septobasidiaceae. REFERENCE 1 (bases 1 to 119) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 119 5S rRNA BASE COUNT 25 a 37 c 36 g 21 t ORIGIN 1 atctggggcc ataccacagt gaacacaccg catcccgtcc gatctgcgca gttaaccact 61 gtagggccga gtcagtagtg cggtggggga ccacgcgcga atactctggt gccccaggt // LOCUS TULRRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION T.violea 5S rRNA. ACCESSION M35576 KEYWORDS 5S ribosomal RNA. SOURCE T.violea (strain FO 29326) 5S rRNA. ORGANISM Tulasnella violea Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales; Tulasnellaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 30 a 29 c 31 g 28 t ORIGIN 1 atcttcggcc ataggacaga gaaaataccg catcccgtcc gatctgcgca gtcaagctct 61 gtaccgctta gttagtacca tagtggggga ccatatggga atcctgagtg ctgaagtt // LOCUS UTHRRA 118 bp ss-RNA RNA 01-AUG-1990 DEFINITION U.fusisporum 5S rRNA. ACCESSION M35578 KEYWORDS 5S ribosomal RNA. SOURCE U.fusisporum (strain FO 25106) 5S rRNA. ORGANISM Uthatobasidium fusisporum Eukaryota; Plantae; Thallobionta; Basidiomycotina; Phragmobasidiomycetes; Heterobasidiomycetidae; Tremellales; Tulasnellaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Blanz,P.A. and Gottschalk,M. TITLE Systematic position of Septobasidium, Graphiola and other basidiomycetes as deduced on the basis of their 5S ribosomal RNA nucleotide sequences JOURNAL Syst. Appl. Microbiol. 8, 121-127 (1986) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 > 118 5S rRNA BASE COUNT 23 a 35 c 37 g 23 t ORIGIN 1 atccacggcc ataggacttc gaaagcaccg catcccgtcc gatctgcgca gttaaccgga 61 gtgccgccta gttagtacca cggtggggga ccacgcggga atcctgggtg ctgtggtt // LOCUS C11CMIA 2149 bp ds-DNA BCT 01-AUG-1990 DEFINITION Plasmid pColBM-C1139 colicin lysis protein (cmi) gene, 5' end. ACCESSION M35683 KEYWORDS colicin lysis protein. SOURCE Plasmid pColBM-C1139 DNA. ORGANISM Plasmid pColBM-C1139 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 2149) AUTHORS Thumm,G., Oelschlaeger,T. and Braun,V. TITLE Plasmid pColBM-C1139 does not encode a colicin lysis protein but contains sequences highly homologous to the D protein (resolvase) and the oriV region of the miniF plasmid JOURNAL Plasmid 20, 75-82 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1108 1890 ORF pept 1991 > 2149 colicin lysis protein (cmi) BASE COUNT 512 a 510 c 578 g 549 t ORIGIN 1 gaattcatct tttggccgtt tacgtctgtt ccgttatcct gatgatacga tgttctgcac 61 gttctgccgg gaagatgcag atgattcgct taaaagtatt atgacccatc tctgggagct 121 ggatgcagag atgacagatc ctgtcatagc tatgtttaat cacgtctgag tgccgtgagt 181 gatttctgtc ttttatgcaa cagtgccaag atattgtaat caaaaaaaag cattaatgca 241 ttttggacag taatctattt taattgatga catagaggca ttaatctttc tttttcttca 301 ggaagatccg aaaactcctg gtcacggatc ttcctctccc ccacacaacg ccacctcctg 361 taagcacaac atgtggtgcc ggattcagct gctgatgaca ctatatgttg tgtcatctcc 421 ctgacctgtg atgcgtcgcg caggggcgga aaacagcgat atgatgattt cctcggcgtg 481 gtacacttcc ggaaagttgt gatattccgg aaagtcggat ctgacggaaa cggctctccg 541 gtaatttaac ggcgtggtta tatggatgct tgttatcatg gtgatgatga taacggcatg 601 atgttatcag acggcgtgac ggtaagggca gtgatgatgg atgacgttat cgcatgaccg 661 tccctgcccg gaaaagaaaa aaggagtcac ccatgttttt tattgagaat gaaggtcagg 721 ctgtcgccgg aacggattac tggcagtctg tacaggcgca ggccggatat gtctacctca 781 gctggaatgc cggcgcagcc aggctgcttg tcccggatgc ggcaaaacat ttactcaggg 841 agatgcgggg ggctgagtac gtcatcatca gtaagggagc actgcatggc cgcgatgcgc 901 tggaactggt atttgaagac ggcagcgatg cgccgtttgt gatccacatg ctgagtgagc 961 agtgcgatcg cctgctcccc gaaaacaacc agggaggggg ttttgttgtc accgtctgga 1021 cgcgtggcgg taaccagctc cgttatccgg gaaagtaccg ggttgtggaa aacctgcccg 1081 acgtttcccc gtggagtgaa cactgatatg cagcacctgc cggcaccgat ccaccatgcc 1141 cgggatgctg ttcagcttcc tgttgccatc gattatccgg cagcgctggc actccgccag 1201 atgtcgatgg ttcatgatga actgcccaaa tacctgctgg cccctgaagt gagcgccctg 1261 ctccattacg tcccggatct gcgccgcaag atgctgctgg ccacactgtg gaacaccggt 1321 gcgcgcatta atgaagcact ggcgctgacg cggggggatt tttcgctcac gcctccgtat 1381 ccgtttgtgc agctggccac tctgaagcag cggacagaaa aagccgccag gacggcagga 1441 agaatgcccg ccggtcagca gactcaccgg ctggttccgc tctccgactc ctggtacgtc 1501 agccagctgc agacgatggt agccacactg aaaatcccca tggaacggcg taataaacga 1561 acaggcagga cagagaaagc gcggatctgg gaagtgacgg acagaacggt caggacctgg 1621 attggggagg cggttgccgc cgctgccgct gatggtgtga cgttctctgt cccggtcacg 1681 ccacatacgt tccgccattc ctatgcgatg cacatgctgt atgccggtat accgcttaag 1741 gttctgcaga gtctgatggg gcataagtcc atcagctcaa cagaggtcta cacgaaggtg 1801 tttgcactgg atgtggctgc acggcaccgg gtgcagtttt cgatgcctga gtccgatgcg 1861 gtcacaatgc tgaaaaacag acatgcataa taagtcacaa ttatgaattg tgatttcttc 1921 tataaaaaag agaccactgc aatatgtgat ctcttgtatt atttcataat tgttaaagcc 1981 acttcacagt atgctcacat tgtacggata tattcgtaat gtttttttat atcgaatgaa 2041 cgacagaagt tgtggagatt ttatgaaagt aattagcatg aaatttattt ttattttaac 2101 gattattgct cttgctgctg tttttttctg gtctgaagat aaaggtccg // LOCUS DOGPPPP 427 bp ss-mRNA MAM 01-AUG-1990 DEFINITION Canine pancreatic polypeptide mRNA, complete cds. ACCESSION M35596 KEYWORDS pancreatic polypeptide. SOURCE Canine pancreas, cDNA to mRNA. ORGANISM Canis lupus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 427) AUTHORS Toothman,P. and Paquette,T.L. TITLE Canine pancreatic polypeptide complementary deoxyribonucleic acid sequence: Pancreatic polypeptide and insulin messenger ribonucleic acid distribution in the lobes of the pancreas JOURNAL Mol. Endocrinol. 1, 413-419 (1987) STANDARD simple staff_review FEATURES from to/span description pept 21 302 pancreatic polypeptide precursor sigp 21 107 pancreatic polypeptide signal peptide matp 108 215 pancreatic polypeptide matp 225 284 icosapeptide mRNA 1 427 pancreatic polypeptide mRNA BASE COUNT 88 a 149 c 115 g 75 t ORIGIN 1 tccgcccctt aggactcggg atgcctgccg cctgccgctg cctcttcctg ctgctcctgt 61 cagcctgtgt ggctctgttg ctgcagccgc cactgggtac ccggggggcc ccgctggagc 121 cagtgtatcc gggggacgat gccacaccag agcagatggc ccagtacgcg gctgagctcc 181 gcagatacat caacatgctg accaggccca ggtatgggaa aagagacaga ggagaaatgc 241 gggacatcct ggaatggggc tccccccatg cagccgcccc cagggagctg atggacgagt 301 aatgccacct ccaagtaatg ccacctctgc ctctcaggcc aatgccagcc tacctctccc 361 ctctgcaccc ctggccaaag cttgctccct gctctcacac acagactaaa taaagcaagt 421 caaagtc // LOCUS GVICG 296 bp ss-RNA circular VRL 01-AUG-1990 DEFINITION Grapevine viroid grapevine isolate (SHV-g(GV)) complete genome. ACCESSION M35717 KEYWORDS complete genome. SOURCE Grapevine viroid RNA. ORGANISM Grapevine viroid Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 296) AUTHORS Sano,T., Ohshima,K., Hataya,T., Uyeda,I., Shikata,E., Chou,T.-G., Meshi,T. and Okada,Y. TITLE A viroid resembling hop stunt viroid in grapevines from Europe, the United States and Japan JOURNAL J. Gen. Virol. 67, 1673-1678 (1986) STANDARD simple staff_review BASE COUNT 60 a 87 c 80 g 69 t ORIGIN 1 ctggggaatt ctcgagttgc cgcatcaggc aagcaaagaa aaaacaaggc agggaggtac 61 ttacctgaga aaggagcccc ggggcaactc ttctcagaat ccagcgagag gcgtggagag 121 agggccgcgg tgctctggag tagaggctct gcttcagaac accatcgatc gtcccttctt 181 ctttaccttc ttctggctct tccgatgaga cgcgaccggt ggcatcacct ctcggttcgt 241 cccaacctgc tttttgtcta tctgagcctc tgccgcggat cctctcttga gcccct // LOCUS HUMTCAJK 94 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor germline J-alpha RP DNA, partial cds. ACCESSION M35619 KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline; joining exon. SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 94) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 32 / 92 T-cell receptor germline J-alpha RP region (AA at 32) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" IVS 93 > 94 TCR intron signal 2 10 nonamer recombination signal signal 23 29 heptamer recombination signal BASE COUNT 25 a 23 c 22 g 24 t ORIGIN 1 aggtttctgt tatgaagcat ctcacagtgt aaataccggc actgccagta aactcacctt 61 tgggactgga acaagacttc aggtcacgct cggt // LOCUS HUMTCAJM 80 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor unproductively rearranged J-alpha AA/J-alpha AB DNA pseudogene, partial cds. ACCESSION M35621 KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon; processed gene; pseudogene. SOURCE Human cell line AT5-B1 tumor DNA, clone lambda-A30. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 80) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps / 30 > 78 T-cell receptor unproductively rearranged J-alpha AA/J-alpha AB region (AA at 30) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" recomb 26 27 J-alpha AA end/J-alpha AB start signal 7 15 nonamer recombination signal BASE COUNT 23 a 15 c 18 g 24 t ORIGIN 1 tatgttggtt tatgtagaga cacatataga ccgacaagct catctttggg actgggacca 61 gattacaagt ctttccaagt // LOCUS HUMTCAZI 520 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor productively rearranged V-alpha-J-alpha DNA, exons 1 and 2. ACCESSION M35617 KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon; processed gene; variable region. SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 520) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept 11 56 T-cell receptor V-alpha-J-alpha region, exon 1 /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" 166 / 513 T-cell receptor V-alpha-J-alpha region, exon 2 IVS 57 165 T-cell receptor intron A IVS 514 > 520 T-cell receptor intron B BASE COUNT 115 a 129 c 122 g 154 t ORIGIN 1 ttgctcagcc atgctcctgg agcttatccc actgctgggg atacattttg tcctgagtga 61 gtaaaaattt ctttatggtc tctagttcca caggttctga ctagaaatgc ttgcttttta 121 tactgagtct gcactgcttt cactgatagt acgttgtttt tccaggaact gccagagccc 181 agtcagtgac ccagcctgac atccacatca ctgtctctga aggagcctca ctggagttga 241 gatgtaacta ttcctatggg gcaacacctt atctcttctg gtatgtccag tcccccggcc 301 aaggcctcca gctgctcctg aagtactttt caggagacac tctggttcaa ggcattaaag 361 gctttgaggc tgaatttaag aggagtcaat cttccttcaa cctgaggaaa ccctctgtgc 421 attggagtga tgctgctgag tacttctgtg ctgtggttgg cactgccagt aaactcacct 481 ttgggactgg aacaagactt caggtcacgc tcggtaggta // LOCUS HUMTCAZJ 130 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor unproductively rearranged J-alpha RX/J-alpha RP DNA, partial cds. ACCESSION M35618 KEYWORDS T-cell receptor alpha-chain; antigen receptor; joining exon; processed gene. SOURCE Human T-cell line RPMI 8402 DNA, clone lambda-R15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 130) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps / 71 / 127 T-cell receptor J-alpha RP region (AA at 71) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" pept.ps / 68 / 9 (c) T-cell receptor J-alpha RX (AA at 68) recomb 69 70 J-alpha RX end/J-alpha RP start BASE COUNT 29 a 42 c 22 g 37 t ORIGIN 1 tttaaagata gcttcactct cacttgcgtc cccattccaa atgtaaattt cctgtttccc 61 cccctccgtt accggcactg ccagtaaact cacctttggg actggaacaa gacttcaggt 121 cacgctcggt // LOCUS HUMTCAZL 97 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor germline J-alpha AA DNA, partial cds. ACCESSION M35620 KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline; joining exon. SOURCE Human cell line AT5-B1 tumor DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 97) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 35 / 95 T-cell receptor germline J-alpha RP region (AA at 35) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" IVS 96 > 97 TCR intron signal 7 15 nonamer recombination signal signal 28 34 heptamer recombination signal BASE COUNT 31 a 19 c 21 g 26 t ORIGIN 1 tatgttggtt tatgtagaga cacataacac tgtgactacc tcaggaacct acaaatacat 61 ctttggaaca ggcaccaggc tgaaggtttt agcaagt // LOCUS HUMTCAZN 89 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human T-cell receptor germline J-alpha AB DNA, partial cds. ACCESSION M35622 KEYWORDS T-cell receptor alpha-chain; antigen receptor; germline; joining exon. SOURCE Human cell line AT5-B1 tumor DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 89) AUTHORS Baer,R., Boehm,T., Yssel,H., Spits,H. and Rabbitts,T.H. TITLE Complex rearrangements within the human J-delta-C-delta/J-alpha-C- alpha locus and aberrant recombination between J-alpha segments JOURNAL EMBO J. 7, 1661-1668 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 30 > 87 T-cell receptor germline J-alpha RP region (AA at 30) /hgml_locus_uid="LX0123X" /nomgen="TCRA" /map="14q11.2" IVS 88 > 89 TCR intron signal 2 10 nonamer recombination signal signal 23 29 heptamer recombination signal BASE COUNT 23 a 19 c 18 g 29 t ORIGIN 1 aggtttttgt agatctcagt atcactgtgt cttataacac cgacaagctc atctttggga 61 ctgggaccag attacaagtc tttccaagt // LOCUS MUSBMTA 141 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35719 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 118 > 141 thyrotropin beta-subunit BASE COUNT 40 a 30 c 39 g 32 t ORIGIN 1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gactgagcgc 61 atacgagtgg agagaaaaat attctgcttc agtcaagagc tggggttgtt caaaagcatg 121 agtgctgccg tcctcctctc c // LOCUS MUSBMTB 99 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35720 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 99) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 76 > 99 thyrotropin beta-subunit BASE COUNT 26 a 24 c 29 g 20 t ORIGIN 1 agcagtaact cactcatgca aagtaagatc ctgcagtagt gggtggagaa gagtgaccgc 61 atacgagtgg agagcatgag tgctgccgtc ctcctctcc // LOCUS MUSBMTC 93 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35721 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 93) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 70 > 93 thyrotropin beta-subunit BASE COUNT 26 a 22 c 21 g 24 t ORIGIN 1 agcagtaact cactcatgca aagtaagaaa aatattctgc ttcagtgaag agctggggtt 61 gttcaaagca tgagtgctgc cgtcctcctc tcc // LOCUS MUSBMTD 52 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse thyrotropin beta-subunit mRNA, 5' end. ACCESSION M35723 KEYWORDS thyroid stimulating hormone; thyrotropin beta-subunit. SOURCE Mouse (strain LAF-1) male tumor TtT97, cDNA to mRNA, clone 25-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 52) AUTHORS Wood,W.M., Gordon,D.F. and Ridgway,E.C. TITLE Expression of the beta-subunit gene of Murine thyrotropin results in multiple messenger ribonucleic acid species which are generated by alternative exon splicing JOURNAL Mol. Endocrinol. 1, 875-883 (1987) STANDARD simple staff_review FEATURES from to/span description pept 29 > 52 thyrotropin beta-subunit BASE COUNT 13 a 17 c 10 g 12 t ORIGIN 1 agcagtaact cactcatgca aagtaagcat gagtgctgcc gtcctcctct cc // LOCUS MUSIGKCSU 444 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse Ig aberrantly rearranged kappa-chain mRNA V-J2-C-region, complete cds. ACCESSION M35669 KEYWORDS constant region; immunoglobulin light chain; joining exon; kappa-immunoglobulin; variable region. SOURCE Mouse myeloma MOPC-21, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 444) AUTHORS Carroll,W.L., Mendel,E. and Levy,S. TITLE Hybridoma fusion cell lines contain an aberrant kappa transcript JOURNAL Mol. Immunol. 25, 991-995 (1988) STANDARD simple staff_review FEATURES from to/span description pept 28 414 Ig kappa-chain V-J2-C-region precursor sigp 28 87 Ig kappa-chain V-J2-C-region signal peptide matp 88 411 Ig kappa-chain V-J2-C-region recomb 380 381 V-region end/J2-region start recomb 411 412 J2-region end/C-region start BASE COUNT 108 a 122 c 111 g 103 t ORIGIN Chromosome 6. 1 cagcatcctc tcttccagct ctcagagatg gagacagaca cactcctgtt atgggtactg 61 ctgctctggg ttccaggttc cactggtgac attgtgctga cacagtctcc tgcttcctta 121 gctgtatctc tggggcagag ggccaccatc tcatacaggg ccagcaaaag tgtcagtaca 181 tctggctata gttatatgca ctggaaccaa cagaaaccag gacagccacc cagactcctc 241 atctatcttg tatccaacct agaatctggg gtccctgcca ggttcagtgg cagtgggtct 301 gggacagact tcaccctcaa catccatcct gtggaggagg aggatgctgc aacctattac 361 tgtcagcaca ttagggagct tacacgttcg gaggggggac caagctggaa ataaaacggg 421 ctgatgctgc accaactgta tcca // LOCUS MUSLACPI 844 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse placental lactogen I (mPL-I) mRNA, complete cds. ACCESSION M35662 KEYWORDS placental lactogen I. SOURCE Mouse (strain Swiss-Webster) day 10 placenta, cDNA to mRNA, clone 1.5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 844) AUTHORS Colosi,P., Talamantes,F. and Linzer,D.I.H. TITLE Molecular cloning and expression of mouse placental lactogen I complementary deoxyribonucleic acid JOURNAL Mol. Endocrinol. 1, 767-776 (1987) STANDARD simple staff_review FEATURES from to/span description pept 42 716 placental lactogen I (mPL-I) precursor sigp 42 131 placental lactogen I (mPL-I) signal peptide matp 132 713 placental lactogen I (mPL-I) mRNA < 1 844 mPL-I mRNA signal 821 831 mPL-I poly-A signal BASE COUNT 243 a 188 c 176 g 237 t ORIGIN 1 ttcctcactt ggagcctaca ttgtggtgga tcttctcaga aatgcagctg actttgaatc 61 tttcaggctc cgcaggaatg caattgttgc tgctggtgtc aagcctactc ctttgggaga 121 atgtgtcctc caaaccaact gccatggtgc ccactgaaga cctgtatact cgtttggctg 181 aactgctcca taatacattt atcttggccg cagatgtgta tagggaattt gatttggatt 241 ttttcgataa aacttggata acagacagaa cacttcccct gtgtcatact gcttccatcc 301 atactccaga gaatcgagag gaagtccacg aaactaaaac tgaagacctt ctgaaagcaa 361 tgatcaatgt ttcaatttcc tggaaagaac ctctgaaaca cctggtgtct gcactgacgg 421 ctctcccagg agcttctgag agtatgggga aaaaagctgc tgacattaag ggcagaaacc 481 ttgtaattct ggagggactt cagacaatat acaacaggtc tcaggctaac attgaagaaa 541 atgaaaattt tgactaccct gcttggtctg gactcgaaga actgcagtca cctaacgaag 601 acactcatct ttttgccgtt tataatctat gccgctgcat taaaagggac atccataaga 661 tagacagcta tatcaaagtc ttgaggtgcc gagttgtctt tcagaacgaa tgttgagtgc 721 ccacccagcg aagccctgcc cacatggtct ttgttgaacc agacttgtaa tgctttcccc 781 tcctcagtta tgatgagcta taatggaatt attgtcataa aataaaataa aattatttag 841 attc // LOCUS BLYGSA 1621 bp ss-mRNA PLN 01-AUG-1990 DEFINITION Barley glutamate 1-semialdehyde aminotransferase (GSA) mRNA, complete cds. ACCESSION M31545 KEYWORDS glutamate 1-semialdehyde aminotransferase. SOURCE Barley (cv. Bonus) 5 day old dark grown seedling, cDNA to mRNA. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1621) AUTHORS Grimm,B. TITLE Primary structure of a key enzyme in plant tetrapyrrole synthesis: Glutamate 1-semialdehyde aminotransferase JOURNAL Unpublished (1990) Carlsberg Laboratory, Dept. of Physiology, Gamle STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grimm, 22-JAN-1990. FEATURES from to/span description pept 20 1429 glutamate 1-semialdehyde aminotransferase (GSA) precursor (EC 5.4.3.8) sigp 20 121 glutamate 1-semialdehyde aminotransferase signal peptide matp 122 1426 glutamate 1-semialdehyde aminotransferase signal 1598 1603 polyA signal mRNA < 1 1621 GSA mRNA BASE COUNT 362 a 363 c 459 g 437 t ORIGIN 1 ggagaaggaa ggcagcatca tggccggagc agcagccgcc gtggcctccg gcatatcgat 61 caggcctgta gccgcgccta agatctcgcg cgcgccccgc tctcggtcgg tggtgagggc 121 ggccgtctcc atagacgaga aggcttacac ggttcagaaa tccgaggaga tcttcaacgc 181 cgccaaggaa ttgatgcctg gtggtgttaa ttcaccagtc cgtgccttca aatcagtcgg 241 cgggcagccc atagtttttg attctgtgaa gggctctcat atgtgggatg tcgatggaaa 301 tgaatatatt gattatgttg gttcctgggg tcctgcaatc attggtcatg cagatgacaa 361 ggtgaatgct gcacttattg aaactctgaa gaagggtact agctttggtg ctccatgtgc 421 gttggagaat gtgttggctc aaatggtcat ctccgctgtg ccgagtatcg aaatggttcg 481 ttttgtaaat tcaggaacag aagcttgcat gggagcactc cgccttgtgc gtgcattcac 541 tgggagggaa aagattctca agtttgaagg ctgttaccat ggccatgcag attccttcct 601 tgttaaagca ggcagtggtg ttgccaccct cggcctccca gactcccctg gagtgcctaa 661 gggagccacc gttgggactc taacagcacc ttataatgat gctgatgcgg ttaaaaagct 721 gtttgaggat aacaaagggg agattgctgc agtcttcctt gagccggttg ttggcaatgc 781 tggcttcatt cctccgcagc ctgctttcct aaatgctctc cgtgaggtga ccaaacaaga 841 cggcgcactt ctggtgtttg atgaagtgat gactcctttc cgtttagctt atggtggggc 901 acaagagtac tttggaatca cccctgatgt gacaaccttg ggccaaatta ttggcggtgg 961 tcttccggtt ggtgcttacg gtggacggaa ggatatcatg gagatggttg ctccagcagg 1021 gccaatgtac caggcaggaa ccctcagtgg aaaccctcta gctatgactg ctggaatcca 1081 cactctcaag cgtctgatgg agcctggcac ctatgaatac ttagacaagg tcactggtga 1141 acttgtccgg ggcatattgg atgtgggcgc taaaacaggg cacgagatgt gtggaggaca 1201 catcagaggc atgttcggat tcttcttcgc aggtggccca gtgcacaact ttgatgatgc 1261 caagaagagt gacacagcga agtttgggag gttccaccgt ggaatgctgg gcgaaggcgt 1321 gtatctggca ccatcccagt tcgaggcagg ttttacaagc ttggcacaca ccacccaaga 1381 cattgagaaa accgtggagg ctgccgagaa ggttcttcga tggatataga tgatttggat 1441 tgcaaacctt ttgaagcttt tccttctgtt gtattctgtt agtttgtacg tggctgaagt 1501 ttagttttgt attgtatttt gttgtgcagc agcagtatct tgtctctagc ccatttttct 1561 tcttctgagt tagcatttgg ggtgattttg tcttggcaat aaaactttgg ctacgacctc 1621 c // LOCUS MUSSVSIV 541 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse seminal vesicle secretory protein IV (SVS IV) mRNA, 3' end. ACCESSION M35732 KEYWORDS seminal vesicle secretory protein IV. SOURCE Mouse adult seminal vesicle, cDNA to mRNA, clone p2A2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 541) AUTHORS Chen,Y.H., Pentecostt,B.T., McLachlan,J.A. and Teng,C.T. TITLE The androgen-dependent mouse seminal vesicle secretory protein IV: Characterization and complementary deoxyribonucleic acid cloning JOURNAL Mol. Endocrinol. 1, 707-716 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 329 seminal vesicle secretory protein IV (SVS IV) precursor (AA at 3) sigp < 1 50 seminal vesicle secretory protein IV (SVS IV) signal peptide matp 51 326 seminal vesicle secretory protein IV (SVS IV) mRNA < 1 541 SVS IV mRNA BASE COUNT 154 a 107 c 131 g 149 t ORIGIN 1 gtttgttcct cttttctctg cttctccttc tggtgacagg agccattggg aagaaaacta 61 aggaaaaatt cttgcagtcg gaagaaactg tcagagagag cttctcgacg ggaagcagag 121 gccatatgtc aagaagttct gagccagagg tatttgttag gccacaggac tccatcggtg 181 acgaagcttc tgaggaaatg agtagtagta gtagtagtag aagaagaagt aagattatct 241 ctagcagttc tgatggttct aatatggaag gtgagagttc atattcaaag agaaagaaga 301 gccggttttc tcaagatgca ctcgagtgat actgcattga ccagctgaac atctggacca 361 atatgctgga gccatatcgc cagaacagag cccatgatgt cttcagcata cagctcccat 421 gtggtctcag aggcagtccc tggatggcat ttacttccca tgcttgtttg tcttgaggtt 481 cttaaaccta acatttactc tggagctttc tttccaataa agagataaca attgcatcat 541 t // LOCUS NEMRPT 677 bp ds-DNA INV 01-AUG-1990 DEFINITION A.lumbricoides BamHI repetitive DNA. ACCESSION M35399 KEYWORDS BamHI repetitive sequence. SOURCE A.lumbricoides DNA, clone AL700-1. ORGANISM Ascaris lumbricoides Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Ascaridida; Ascaridina; Ascaridoidea; Ascarididae. REFERENCE 1 (bases 1 to 677) AUTHORS Warren,T. and Pasternak,J.J. TITLE A related moderately repetitive DNA family in the nematodes Ascaris lumbricoides and Panagrellus silusiae JOURNAL Nucleic Acids Res. 16, 10833-10847 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 1 677 BamHI repeat BASE COUNT 186 a 158 c 161 g 172 t ORIGIN 1 ggatccgagt aagtgtgcaa aaacagcatt atttatgtaa acgaagctca attacatttc 61 taagtgcaat tacggctgta tcacgggttg gcaactccat attccacgga aatccaccca 121 ttcaacgggt gcaattcccg tgagtatcgt aaaataggag agtgaaagct cagaatgcgg 181 ctagaatgtg tcatcttgtt gccaaatcgg agatatgtat cgtgtgaatt gacatgtatc 241 atgccaaggt aggtcggaaa ggccaaagaa aagcggaaac cagacggtcg gaaagtacag 301 aactcgattc ttgcgattgt gcatcttcga gttctggtaa gtgtaaatgc gagtccggtg 361 tctgatcgga tctgatcggc cagtgccgag gcttacacgt gactatcaca tagtctcact 421 ctttcactct tcccttttcg cgatttccga ttcagtgcta acaactcgac gtagacaccc 481 cactctttct cctgcgcatt cctatgccgg tcaccgattg ggtcgcaaaa tgccaaagga 541 cagggcatgt aagcccgcat cttaattgtt aagattcacc gatgaatcgt caaaaatttt 601 gcaaaagcta gtggaaaacg gggttttgag gcccgttcca ccggcaaacc gtcatcgtgc 661 gccgatcaga tggatcc // LOCUS PNGRPT 682 bp ds-DNA INV 01-AUG-1990 DEFINITION P.silusiae BamHI repetitive DNA. ACCESSION M35398 KEYWORDS BamHI repetitive sequence. SOURCE P.silusiae DNA, clone PS700-1. ORGANISM Panagrellus silusiae Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Rhabditida; Rhabditina; Rhabditoidea; Cephalobidae. REFERENCE 1 (bases 1 to 682) AUTHORS Warren,T. and Pasternak,J.J. TITLE A related moderately repetitive DNA family in the nematodes Ascaris lumbricoides and Panagrellus silusiae JOURNAL Nucleic Acids Res. 16, 10833-10847 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 1 682 BamHI repeat BASE COUNT 201 a 154 c 155 g 172 t ORIGIN 1 ggatccgcag cgaattgtgt aaaacagcat taattatgta aaagaagctc aattaacctt 61 tctaagtgca attgaggctg tatcacgggt tggcaacctc gtattccacg gaaatccacc 121 cattcaacgg gtgcgatttc gtgtttttcg taaaaatcgg attctgaagg ctagaatccg 181 gccagaatgt gtcatcttgt tccaaatgag agttatttga catctgaatc acatttgaaa 241 tgcaaagaca ggtcggaaag gccaaacaag agcgaaaacc cgcgggtcgc caaaagtacc 301 agaactcgat tcttgcgatt tttcgcattt tcgagttctg gtaagtgcaa aaagtttcga 361 tttcggatct gcatcggaat ctgattgccc acgtgccaga aggcttaaaa acgtgcacaa 421 accacatggt taccctttac cttgttttcg aaatttaaca aaaagtgcaa aaaccgggta 481 aaaacccatc tttggcctgc gcattgccaa tggcggtcat cgatgggtcg cgaagtgcca 541 aagggaccaa ggtgtaagcc cgcatcatat ctgttaagat tcatcgatga atcggccaat 601 attttgaaaa gctagtggaa aaacgcgttt tgacgcccgt ttccaccggc aaaccgtcat 661 cgtgcgccga tcagacggat cc // LOCUS TETTRGA 75 bp ss-tRNA RNA 01-AUG-1990 DEFINITION T.thermophila Gln-tRNA-UUG. ACCESSION M35400 KEYWORDS glutamine tRNA. SOURCE T.thermophila tRNA. ORGANISM Tetrahymena thermophila Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H. TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG termination codons to glutamine codons due to anticodon mutations in two Tetrahymena tRNAs-Gln JOURNAL EMBO J. 5, 1307-1311 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 75 Gln-tRNA modified 9 9 m1g modified 10 10 m2g modified 13 13 p modified 19 19 d modified 20 20 d modified 34 34 um anticdn 34 36 Gln-tRNA anticodon ttg modified 39 39 p modified 48 48 m5c modified 54 54 p modified 57 57 m1a BASE COUNT 15 a 19 c 21 g 18 t 2 others ORIGIN 1 ggttgtatgg tgtagcggaa agcaccgagg actttgaatc ctctgacctg ggttcgaatc 61 ccagtacgac ctcca // LOCUS TETTRGB 75 bp ss-tRNA RNA 01-AUG-1990 DEFINITION T.thermophila Gln-tRNA-CUA. ACCESSION M35401 KEYWORDS transfer RNA-Gln. SOURCE T.thermophila tRNA. ORGANISM Tetrahymena thermophila Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H. TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG termination codons to glutamine codons due to anticodon mutations in two Tetrahymena tRNAs-Gln JOURNAL EMBO J. 5, 1307-1311 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 75 Gln-tRNA modified 10 10 m2g modified 13 13 p modified 19 19 d modified 20 20 d anticdn 34 36 Gln-tRNA anticodon cta modified 37 37 t6a modified 39 39 p modified 48 48 m5c modified 54 54 p modified 57 57 m1a BASE COUNT 19 a 18 c 18 g 19 t 1 others ORIGIN 1 ggttctatag tatagcgcaa agtactgggg antctaaatc ccttgacctg ggttcgaatc 61 ccagtaggac ctcca // LOCUS TETTRGC 75 bp ss-tRNA RNA 01-AUG-1990 DEFINITION T.thermophila Gln-tRNA-UUA. ACCESSION M35402 KEYWORDS transfer RNA-Gln. SOURCE T.thermophila tRNA. ORGANISM Tetrahymena thermophila Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Hanyu,N., Kuchino,Y., Nishimura,S. and Beier,H. TITLE Dramatic events in ciliate evolution: Alteration of UAA and UAG termination codons to glutamine codons due to anticodon mutations in two Tetrahymena tRNAs-Gln JOURNAL EMBO J. 5, 1307-1311 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 75 Gln-tRNA modified 10 10 m2g modified 13 13 p modified 16 16 d modified 19 19 d modified 20 20 d modified 32 32 cm modified 34 34 um anticdn 34 36 Gln-tRNA anticodon tta modified 37 37 t6a modified 39 39 p modified 48 48 m5c modified 54 54 p modified 57 57 m1a BASE COUNT 16 a 17 c 20 g 19 t 3 others ORIGIN 1 ggttccatag tatagdggdd agtactgggg actttaaatc ccttgacctg ggttcgaatc 61 ccagtgggac ctcca // LOCUS BEGRR5S 120 bp ss-rRNA RNA 01-AUG-1990 DEFINITION B.alba 5S ribosomal RNA. ACCESSION M35565 KEYWORDS 5S ribosomal RNA. SOURCE B.alba (strain B18LD) rRNA. ORGANISM Beggiatoa alba Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 120) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 120 5S ribosomal RNA BASE COUNT 32 a 32 c 29 g 27 t ORIGIN 1 ttcttggcga ccatagcaaa taggaaccac ccgaccccat cccgaactcg gtagtgaaac 61 tgttctgcgc cgatgatagt gtggatactc tccatgtgaa agtaggttat cgccaagagc // LOCUS ECOHEMC 2092 bp ds-DNA BCT 01-AUG-1990 DEFINITION E.coli porphobilinogen deaminase (hemC) and uroporphyrinogen III synthase (hemD) genes, complete cds. ACCESSION X04242 M35827 KEYWORDS deaminase; hemC gene; hemD gene; porphobilinogen deaminase; uroporphyrinogen III synthase. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1957) AUTHORS Thomas,S.D. and Jordan,P.M. TITLE Nucleotide sequence of the hemC locus encoding porphobilinogen deaminase of Escherichia coli K12 JOURNAL Nucleic Acids Res. 14, 6215-6226 (1986) STANDARD simple automatic REFERENCE 2 (bases 1290 to 2092) AUTHORS Jordan,P.M., Mgbeje,B.I.A., Thomas,S.D. and Alwan,A.F. TITLE Nucleotide sequence for the hemD gene of Escherichia coli encoding uroporphyrinogen III synthase and initial evidence for a hem operon JOURNAL Biochem. J. 249, 613-616 (1988) STANDARD simple staff_review COMMENT Data kindly reviewed (11-SEP-1986) by P. Jordan FEATURES from to/span description pept 390 1331 porphobilinogen deaminase (hemC) pept 1328 2068 uroporphyrinogen III synthase (hemD) signal 330 335 put. -35 region rpt 63 67 inverted repeat A rpt 78 82 direct repeat 1 rpt 349 853 inverted repat A' signal 354 359 put. -10 region rpt 356 360 direct repeat 1 rpt 367 371 direct repeat 1 binding 377 381 put. ribosome binding site signal 1508 1522 pot. transcription termination signal BASE COUNT 495 a 540 c 566 g 491 t ORIGIN 1 caagacgtat cgcctgattt gctacccgtc atgactgtga ttccgccaac atcaacggta 61 acacgcggca ttcgggatat ttcgtatgtc aaaggtaacc gttaccactt ttcgcgcctg 121 gtttttttag tttcacgacg aaaaaatggt ctaaaacgtg atcaatttaa caccttgctg 181 attgaccgta aagaaagatg cgctacatac aagtgtagca ccgtttattc tctgtaaatt 241 ccttattaca acggcgtgaa acgcctgtca ggatccactg ccagacctca ttttacggtt 301 tgcgcaggcg tctacgtttc accacaacac tgacatcact ctggcaagga tgttaggatg 361 gaccacggat gataatgacg gtaacaagca tgttagacaa tgttttaaga attgccacac 421 gccaaagccc acttgcactc tggcaggcac actatgtcaa agacaagttg atggcgagcc 481 atccgggcct ggtcgttgaa ctggtaccga tggtgacgcg cggcgatgtg attcttgata 541 cgccgctggc gaaagtaggc ggaaaaggct tatttgtaaa agagctggaa gtcgcgctcc 601 tcgaaaatcg cgccgatatc gccgtacact caatgaaaga tgtgccggtt gaattcccgc 661 aaggtctggg actggtcact atttgtgagc gtgaagatcc tcgcgatgcc tttgtgtcca 721 ataactatga cagtctggat gcgttaccgg caggcagtat cgtcgggacg tccagtttac 781 gtcgccagtg ccaactggct gaacgccgtc cggatctgat tatccgctcc ctgcgcggca 841 acgtcggcac tcgcctgagc aaactggata acggcgaata cgatgccatc attcttgccg 901 tagccggact aaaacgttta ggtctggagt cacgtattcg cgccgcgttg ccacccgaga 961 tttctcttcc ggcggtagga caaggtgcgg tgggtattga atgccgcctt gatgattcac 1021 gcactcgcga gctgcttgcc gcgctgaatc accacgaaac tgcactgcgc gttaccgcag 1081 aacgcgccat gaatacccgt ctcgaaggcg catgtcaggt gccaattggt agctacgccg 1141 agcttattga tggcgaaatc tggctgcgtg ggctggtcgg cgcgccggac ggttcgcaga 1201 ttattcgcgg tgaacgccgc ggtgcgccgc aagatgccga acaaatgggg atttcgctgg 1261 cagaagagct actgaataac ggcgcgcgcg agatcctcgc tgaagtctat aacggagacg 1321 ccccggcatg agtatccttg tcacccgccc gtctcccgct ggagaagagt tagtgagccg 1381 tctgcgcaca ctggggcagg tggcctggca ttttccgctg attgagtttt ctccgggtca 1441 acaattaccg caacttgctg atcaactggc agcgctgggg gagagcgatc tgttgtttgc 1501 cctctcgcaa cacgcggttg cttttgccca atcacagctg catcagcaag atcgtaaatg 1561 gccccgacta cctgattatt tcgccattgg acgcaccacc gcactggcac tacataccgt 1621 aagtggacag aagattctct acccgcagga tcgggaaatc agcgaagtct tgctacaatt 1681 acctgaatta caaaatattg cgggcaaacg tgcgctgata ttacgtggca atggtggtcg 1741 tgagctaatt ggggataccc tgacggcgcg cggtgctgag gtcacttttt gtgaatgtta 1801 tcaacgatgc gcaatccatt acgatggtgc agaagaagcg atgcgctggc aagcccgcga 1861 ggtgacgatg gtcgttgtta ccagcggtga aatgttgcag caactctggt cactgatccc 1921 acaatggtat cgtgagcact ggttactaca ctgtcgacta ttggtcgtca gtgagcgttt 1981 ggcgaaactc gcccgggaac tgggctggca agacattaag gtcgccgata acgctgacaa 2041 cgatgcgctt ttacgggcat tacaataact ctcataacag gaagccataa tg // LOCUS LTTRR5S 117 bp ss-rRNA RNA 01-AUG-1990 DEFINITION L.discophora 5S ribosomal RNA. ACCESSION M35569 KEYWORDS 5S ribosomal RNA. SOURCE L.discophora (strain Stokes) rRNA. ORGANISM Leptothrix discophora Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Budding and/or appendaged bacteria; Prosthecate bacteria. REFERENCE 1 (bases 1 to 117) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 27 a 35 c 32 g 23 t ORIGIN 1 atgcctgacg accatagcga ggtggtccca ctccttccca tcccgaacag gacagtgaaa 61 cgcctcagcg ccgatgatag tgcgcattcg cgtgtgaaag taggtcatcg tcaggct // LOCUS TBSACG 4776 bp ss-RNA VRL 01-AUG-1990 DEFINITION Tomato bushy stunt virus complete genome. ACCESSION M21958 M31019 KEYWORDS capsid protein; coat protein; complete genome; p19 protein; p22 protein; p33 protein; p41 protein; p92 protein. SOURCE Tomato bushy stunt virus (strain cherry), cDNA to viral RNA. ORGANISM Tomato bushy stunt virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Tombusvirus. REFERENCE 1 (bases 2621 to 4776) AUTHORS Hillman,B.I., Hearne,P., Rochon,D. and Morris,T.J. TITLE Organization of tomato bushy stunt virus genome: Characterization of the coat protein gene and the 3' terminus JOURNAL Virology 169, 42-50 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 2620) AUTHORS Hearne,P.Q., Knorr,D.A., Hillman,B.I. and Morris,T.J. TITLE The complete genome structure and synthesis of infectious RNA from clones of tomato bushy stunt virus JOURNAL Virology 177, 141-151 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Q.Hearne, 16-DEC-1988. Draft entry and computer-readable sequence for [2] kindly submitted by D.Knorr, 21-DEC-1989. The 5' terminal nucleotide was not determined. However, in the infectious constructs, two 5' terminal "g" residues are added, one of which is removed during subsequent replication in host plants. FEATURES from to/span description pept 166 1056 p33 protein pept 166 2622 p92 protein (read-through of p33) pept 2652 3818 p41 capsid protein pept 3888 4406 p19 protein pept 3856 4425 p22 protein mRNA 2621 4776 2.2kb subgenomic mRNA mRNA 3841 4776 0.9kb subgenomic mRNA BASE COUNT 1257 a 983 c 1315 g 1220 t 1 others ORIGIN 1 naaattctcc aggatttctc gacctagttc gtttatctgg tgacttgcgc taccgttgct 61 ttgcgtagag aatttctctc cataattatt atctttagtt gtggggtttg aaggttgggt 121 ctacctttcg gggggataaa ttgtaacttc caacaaacaa gcgacatgga gaccatcaag 181 agaatgattt ggcctaagaa agagattttt gtgggtgatt tcgcaaccgg agtgaatagg 241 acagttccgg tgaacatctt tcaattggtg tgtcgtgtgg ttctgagata catgaggaca 301 gggaaaatag agtgtgattc tgacagcatg actaagttta tagttgaatt actcaaaact 361 gattgtgctg ccaaatggga atggttcatg aagagacggc agaggggtga ttacattgtc 421 cctctatcta tagcctccat accaatcata ccgctgttga gttatgccac tagggtacgc 481 gcagtctcag tcaaggcttt tggcaatgaa ctatcgttca atgtcagggt gcctagacca 541 tctgtaccta agaaaggatt gctcctcaga ctggcggcag gtctagcgtt agctcctata 601 tgcgcgctgg ccgtgtacgc taccctacct agggaaaaac tgtcggtatt taagctgaga 661 actgaggcac gagcacacat ggaggatgag agagaagcga cagattgtct ggtggttgag 721 ccggcaaggg aacttaaggg taaagatggt gaggatctcc tcactggtag tagattgact 781 aaggtgatcg cgtccactgg gcgccctcgt cgaagacctt atgcggcaaa gatcgcacag 841 gtggcgagag caaaggtggg ttaccttaag aacagtccag agaatagact aatctaccag 901 agggtgatga tcgagatcat ggacaaagac tgcgtcaggt atgttgacag ggatgtcata 961 ttgcctttgg ctattggatg ctgttttgtc tatccggatg gagtggagga gtcggcggca 1021 ctatggggct cacaggagtc cctgggtgtc aaatagggag gcctagtacg tctacctggg 1081 gttgtaacac agatcaatcg agatatccca tctgatgtgt tacttcctca ggaggtgcta 1141 gaggttcgta caggacctcc caatgctaag gaccgtaata tatttatggt tgcaggttgc 1201 ccatcacagg cacggttctt agtacataat cactgcctga aaaaccttaa aaggggtctt 1261 gtggagagag tcttctgcgt agagagaaac gggaagctcg ctcgcactcc acaacctacc 1321 aaaggagcct ttggacgtct ttccccgttc aggaaagcgg tttgtgagaa ggttggggta 1381 gcccaccgac ttgggtatga tgggtttctg tcatactaca gcggtgcgaa actccgtact 1441 tacacacgag ccgtggagag tctgcatatc acacctgtct ccgagaggga tagtcacttg 1501 actaccttcg taaaagcaga gaagatatcg acgtctaagg gtgacccagc acctcgggtg 1561 attcagcctc gaaacccgag gtacaatgtg gaacttggaa gatatctacg gcatatggaa 1621 tccaagctga tgaaagctgt tgatggcgtg ttcggagaga cgacatgcat caaaggatac 1681 acagctgatg aggtaggtgc aattttccgg gctaaatggg acaggtttga taagcctgtc 1741 gccatagggc tcgatgcatc taggtttgat caacactgtt ccgttgaagc attgcaatat 1801 gagcatagct tctacagggc catgtaccct ggcaacaagc tcttgggcaa gttgttggaa 1861 tggcagctcc ataataaagg taaaggttat gttccagatg gaactataac ctatcgcaag 1921 gagggctgtc gcatgagtgg ggatataaac acctcgttgg gcaactatct actgatgtgt 1981 gcaatggtac atgggtacat gcgtcatctg gggattaatg agtttagtct ggcaaactgt 2041 ggggatgatt gcgtcctaat tgtcgaacgc aggaatctta agcagataca gagaacttta 2101 ccggagtatt tcctcaatct gggatatact atgaaggtgg agcaacctgt atttcaactg 2161 gaagaggttg aattttgcca ggcacaccca gtacagtttc aaggcggttg gaagatggtt 2221 cgaaacgtcc gtactgctat gagcaaggat gtgcactgtg tcaacaatat acgcgatttg 2281 gcgacgagga gagcttggag taatgctcaa catcatgggg gtctagcgct tagtgctggt 2341 attccagttg tggagacgtt ttactctagg tttaagcttt atgatgtacc tcgtaaacat 2401 caacgtattg acacggtcac aaatgtgcac aagtggcgtg gatccggtgg gagttatgtt 2461 gtgacccctg aatctagggc tagcttttgg gctgcctttg gactcacggg ggatgagcaa 2521 ctggctctgg aggaccgtct ggaaagatgg gagatggatc tgtttggaga ggagggtgtt 2581 gacgctcatg agcccagcat cctcgactcc gccgtagctt gaccaagaat acacacacgc 2641 aggatagaca catggcaatg gtaaagagaa acaacaacac gggaatgatc ccggtgagta 2701 caaagcaatt actggcattg ggtgcggccg ctggggccac agccttgcag ggatttgtca 2761 agaataatgg gatggccatc gttgaggggg ctgtcgatct gactaaaaga gcgtacaaag 2821 cagtgcggag aagaggaggt aagaaacagc agatgattaa tcatgtaggt ggtacaggtg 2881 gtgctataat ggcgccggta gcagtgacta gacaacttgt cggtagtaag cctaagttta 2941 ctggcaggac gtctggctct gtcacagtta cccaccgtga gtatctgtca caagtgaata 3001 attccacggg tttccaagtt aatgggggaa ttgtcggcaa tttgttacag cttaacccgt 3061 tgaatggtac attgttctct tggttgccag cgatagcatc caattttgat cagtacacat 3121 tcaacagcgt tgtgctacat tatgtgcccc tatgttcaac tactgaggta gggagagtgg 3181 ctatttactt tgataaggac tcagaagatc cagaacctgc tgatagagtt gagttggcga 3241 attacagcgt gcttaaagag acagcccctt gggctgaagc gatgcttagg gtacccaccg 3301 ataagattaa gagattttgt gatgacagtt ccacatctga tcacaaactt atcgacttgg 3361 gtcaattggg cattgctaca tatggtggcg ctgggactaa tgctgtgggg gatatcttta 3421 tctcgtacag tgttacgtta tatttccctc aacctacgaa cacactcctt agtaccagaa 3481 ggctcgacct tgctggcgct cttgtcacag catctggccc tggatacctc ctggtgtcta 3541 ggactgccac tgtattgaca atgacattcc gtgctacagg cacgtttgtc atatccggga 3601 cgtatcggtg cctcacggca acaacgttag gcttggctgg cggagtgaat gtcaatagta 3661 tcacagttgt agataacata ggtacagaca gtgcgttttt cataaattgt actgtctcta 3721 acctaccatc tgtggtgaca ttcacatcta ccggtatcac atctgccaca gtacattgcg 3781 tgcgcgcgac acgacagaat gatgtttctc taatttagtg tgtcctgcga ggggcctctt 3841 gaacaagacc agttcatgga tactgaatac gaacaagtca ataaaccatg gaacgagcta 3901 tacaaggaaa cgacgctagg gaacaagcta acagtgaacg ttgggatgga ggatcaggag 3961 gtaccacttc tcccttcaaa cttcctgacg aaagtccgag ttggactgag tggcggctac 4021 ataacgatga gacgaattcg aatcaagata atccccttgg tttcaaggaa agctggggtt 4081 tcgggaaagt tgtatttaag agatatctca gatacgacag gacggaagct tcactgcaca 4141 gagtccttgg atcttggacg ggagattcgg ttaactatgc agcatctcga tttttcggtt 4201 tcgaccagat cggatgtacc tatagtattc ggtttcgagg agttagtatc accgtttctg 4261 gagggtcgcg aactcttcag catctctgtg agatggcaat tcggtctaag caagaactgc 4321 tacagcttgc cccaatcgaa gtggaaagta atgtatcaag aggatgccct gaaggtactg 4381 agaccttcga aaaagaaagc gagtaagaca gactcttcag tctgagtttg tggagatgag 4441 tgtaaatctg gcatagcata caggttactc ttgttgggtt ctggatgtta ggatgacgag 4501 tcgactcggg ctccgcacta ggtttggtcg cctaggggat ggagatatgg aaagggtctc 4561 gtgtggtatc agtcggtcga aagacgcgct tccaacatgg gcctatggtc ggataagtct 4621 tagcaatacc agccagcatg aattggattc ctgtttacga aagttaggtg tcacttgtgg 4681 aagcggaccc agacacggtt gatctcaccc ttcggggggc tatagagatc gctggaagca 4741 ctaccggaca accggaacat tgcagaaatg cagccc // LOCUS THTRR5S 122 bp ss-rRNA RNA 01-AUG-1990 DEFINITION T.nivea 5S ribosomal RNA. ACCESSION M35563 KEYWORDS 5S ribosomal RNA. SOURCE T.nivea (strain JP2) rRNA. ORGANISM Thiothrix nivea Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Leucotrichaceae. REFERENCE 1 (bases 1 to 122) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 122 5S ribosomal RNA BASE COUNT 27 a 35 c 36 g 24 t ORIGIN 1 tttgcctggt gtccatagag cactggaacc acctgatccc atcccgaact cagaagtgaa 61 acggtgcatc gccgatggta gtgtggggcc tccccatgtg agagtaggtc aacgccaggc 121 gc // LOCUS THVRR5S 123 bp ss-rRNA RNA 01-AUG-1990 DEFINITION Thiovulum sp. 5S ribosomal RNA. ACCESSION M35570 KEYWORDS 5S ribosomal RNA. SOURCE Thiovulum sp. rRNA. ORGANISM Thiovulum sp. Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Colorless sulfur bacteria. REFERENCE 1 (bases 1 to 123) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 123 5S ribosomal RNA BASE COUNT 30 a 28 c 30 g 35 t ORIGIN 1 tttggttggt gattacagag aaaaggtcac actcagctcc atttcgaacc tgaaagttaa 61 gcttttcttc gtcgataata ctgcccccta cgggggtggg acggtagatc gttgccaacc 121 att // LOCUS VITRR5S 118 bp ss-rRNA RNA 01-AUG-1990 DEFINITION V.beggiatoides 5S ribosomal RNA. ACCESSION M35566 KEYWORDS 5S ribosomal RNA. SOURCE V.beggiatoides (strain B23SS) rRNA. ORGANISM Vitreoscilla beggiatoides Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 30 a 35 c 31 g 22 t ORIGIN 1 cgcctgacga ccacagcgac tgtgaaccac ccgaccccat ctcgaactcg gtagtgaaac 61 cagtcagcgc cgatgatagt gtggcatatg ccatgtgaaa gtaggtcatc gtcaggct // LOCUS VITRR5SX 118 bp ss-rRNA RNA 01-AUG-1990 DEFINITION V.stercoraria 5S ribosomal RNA. ACCESSION M35567 KEYWORDS 5S ribosomal RNA. SOURCE V.stercoraria (strain VT1) rRNA. ORGANISM Vitreoscilla stercoraria Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 118) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 118 5S ribosomal RNA BASE COUNT 30 a 32 c 30 g 26 t ORIGIN 1 tgtttgacga ccatagcgag ttggtcccac gccttcccat cccgaacagg accgtgaaac 61 gacttagcgc cgatgatagt gtggattacc catgtgaaag taggtcatcg tcaaacgc // LOCUS VITRR5SXX 116 bp ss-rRNA RNA 01-AUG-1990 DEFINITION V.filiformis 5S ribosomal RNA. ACCESSION M35568 KEYWORDS 5S ribosomal RNA. SOURCE V.filiformis (strain ATCC 15551) rRNA. ORGANISM Vitreoscilla filiformis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Nonphotosynthetic, nonfruiting gliding bacteria; Cytophagales; Beggiatoaceae. REFERENCE 1 (bases 1 to 116) AUTHORS Stahl,D.A., Lane,D.J., Olsen,G.J., Heller,D.J., Schmidt,T.M. and Pace,N.R. TITLE Phylogenetic analysis of certain sulfide-oxidizing and related morphologically conspicuous bacteria by 5S ribosomal ribonucleic acid sequences JOURNAL Int. J. Syst. Bacteriol. 37, 116-122 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 116 5S ribosomal RNA BASE COUNT 27 a 34 c 31 g 24 t ORIGIN 1 gcctgatgac catagcaagg tggtcccact ccttcccatc ccgaacagga cagtgaaacg 61 ccttagcgcc gatgatagtg cggttctccc gtgtgaaagt aggacatcgt caggct // LOCUS PVICSA 1895 bp ds-DNA INV 01-AUG-1990 DEFINITION Plasmodium vivax circumsporozoite protein gene, complete cds. ACCESSION M11926 M20671 J04090 KEYWORDS circumsporozoite protein. SOURCE P.vivax (strain Belem) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1529) AUTHORS Arnot,D.E., Barnwell,J.W., Tam,J.P., Nussenzweig,V., Nussenzweig,R.S. and Enea,V. TITLE Circumsporozoite protein of Plasmodium vivax: Gene cloning and characterization of the immunodominant epitope JOURNAL Science 230, 815-818 (1985) STANDARD simple staff_review REFERENCE 2 (bases 158 to 1294; revises [1]) AUTHORS Arnot,D.E., Barnwell,J.W. and Stewart,M.J. TITLE Does biased gene conversion influence polymorphism in the circumsporozoite protein-encoding gene of Plasmodium vivax? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988) STANDARD full staff_entry REFERENCE 3 (bases 1 to 157; 1295 to 1895; revises [1]) AUTHORS Arnot,D.E. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2],[3] kindly submitted by D.E.Arnot, 14-SEP-1988. FEATURES from to/span description pept 158 1294 circumsporozoite protein BASE COUNT 674 a 347 c 471 g 403 t ORIGIN 1 ctgcataagg caaactcaca aacatccaaa aaaatataca tatatatatt tatatacacg 61 tgtatatatt attaagcggc ttaagttaag caagcaaaac agccaaaggc ctacaagtgt 121 aaacagcttc ctgcacacac gtatatacca gaacaagatg aagaacttca ttctcttggc 181 tgtttcttcc atcctgttgg tggacttgtt ccccacgcac tgcgggcaca atgtagatct 241 gtccaaggcc ataaatttaa atggagtaaa cttcaataat gtagacgcca gttcacttgg 301 cgcggcacac gtaggacaaa gtgctagccg aggcagagga cttggtgaga acccagatga 361 cgaggaagga gatgctaaaa aaaaaaagga tggaaagaaa gcagaaccaa aaaatccacg 421 tgaaaataag ctgaaacaac caggagacag agcagatgga cagccagcag gagacagagc 481 agatggacag ccagcaggtg atagagcaga tggacaacca gcaggagata gagcagctgg 541 acaaccagca ggagatagag cagatggaca gccagcagga gacagagcag atggacagcc 601 agcaggagac agagcagatg gacaaccagc aggagacaga gcagatggac aaccagcagg 661 tgatagagca gctggacaac cagcaggtga tagagcagct ggacaaccag caggagatag 721 agcagatgga cagccagcag gagatagagc agctggacag ccagcaggag atagagcaga 781 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagatggaca 841 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacagccagc 901 aggagataga gcagctggac agccagcagg agatagagca gctggacagc cagcaggaaa 961 tggtgcaggt ggacaggcag caggaggaaa cgcaggagga ggacagggac aaaataatga 1021 aggtgcgaat gccccaaatg aaaagtctgt gaaagaatac ctagataaag ttagagctac 1081 cgttggcacc gaatggactc catgcagtgt aacctgtgga gtgggtgtaa gagtcagaag 1141 aagagttaat gcagctaaca aaaaaccaga ggatcttact ttgaatgacc ttgagactga 1201 tgtttgtaca atggataagt gtgctggcat atttaacgtt gtgagtaatt cattagggct 1261 agtcatattg ttagtcctag cattattcaa ttaagtagct gacatccatt attttcggcg 1321 tcctccacgg tgcatattaa gtgttttgtg ttttgtacat gcacataaat acttgcccgt 1381 agggacatga tttttttccc tttcttatga atgttccctg ctgtttgcac gtaactgtat 1441 gtacgtgcgc gtaaggcata gtaagtaaca cctcttacac attatgcgct tacgcacaat 1501 cagttgtgca attctagaaa acacgatatg agtattttta aacacttatc gtccaaaaaa 1561 acaaaaaaaa cagaaaaaac agaaaaaaca gaaaaaacaa aaaaaaacaa aaaaaaacaa 1621 aaaaaaacaa aaaaaacaca tttatattaa cttttccttt ttgattgacc cttttttgac 1681 gtatattttt tttttttttt cgtatgtatt atatatactg cttaacgtag agaacttaaa 1741 ttttgagaat gtattttttt ttaacaagtt aaaaaaagaa ctggtatttt tgggaattca 1801 aaaaatttgc aaattcaaaa gaggcgagtt aaaatttgcg ccgtggcaaa cggggtgcgt 1861 gcgggagtcg tgcaaatgtg gcttatatcc ggggg // LOCUS PVICSC 1375 bp ds-DNA INV 01-AUG-1990 DEFINITION Plasmodium vivax circumsporozoite protein gene, 3' end. ACCESSION M20670 J04090 KEYWORDS circumsporozoite protein. SOURCE P.vivax (strain North Korean) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Arnot,D.E., Barnwell,J.W. and Stewart,M.J. TITLE Does biased gene conversion influence polymorphism in the circumsporozoite protein-encoding gene of Plasmodium vivax? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 8102-8106 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1106 to 1375) AUTHORS Arnot,D.E. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Arnot, 14-SEP-1988. FEATURES from to/span description pept < 1 1105 circumsporozoite protein (AA at 2) BASE COUNT 464 a 260 c 407 g 244 t ORIGIN Sau3AI site. 1 agatctgtcc aaggccataa atttaaatgg agtaaacttc aataatgtag acgccagttc 61 acttggcgcg gcacacgtag gacaaagtgc tagccgaggc agaggacttg gtgagaaccc 121 agatgacgag gaaggagatg ctaaaaaaaa aaaggatgga aagaaagcag aaccaaaaaa 181 tccacgtgaa aataagctga aacaaccagg agacagagca gatggacagc cagcaggaga 241 cagagcagat ggacagccag caggagacag agcagatgga caggcagcag gaaatggtgc 301 aggtggacag ccagcaggtg atagagcagc tggacaacca gcaggcgatg gagcagctgg 361 acagccagca ggcgatagag cagatggaca gccagcagga gatagagcag ctggacagcc 421 agcaggcgat agagcagatg gacagccagc aggagataga gcagctggac agccagcagg 481 cgatagagca gatggacagc cagcaggaga tagagcagct ggacaggcag caggaaatgg 541 tgcaggtgga caggcagcag gaaatggtgc aggtggacaa ccagcaggag atagagcagc 601 tggacagcca gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca 661 gccagcagga gatagagcag ctggacagcc agcaggagat agagcagctg gacaggcagc 721 aggaaatggt gcaggtggac aggcagcagg aggaaatgcg gcaaacaaga aggcagaaga 781 cgcaggagga aacgcaggag gaaacgcagg aggacaggga caaaataatg aaggtgcgaa 841 tgccccaaat gaaaagtctg tgaaagaata cctagataaa gttagagcta ccgttggcac 901 cgaatggact ccatgcagtg taacctgtgg agtgggtgta agagtcagaa gaagagttaa 961 tgcagctaac aaaaaaccag aggatcttac tttgaatgac cttgagactg atgtttgtac 1021 aatggataag tgtgctggca tatttaacgt tgtgagtaat tcattagggc tagtcatatt 1081 gttagtccta gcattattca attaagtagc tgacatccat tattttcggc gtcctccacg 1141 gtgcatatta agtgttttgt gttttgtaca tgcacataaa tacttgcccg tagggacatg 1201 atttttttcc ctttcttatg aatgttccct gctgtttgca cgtaactgta tgtacgtgcg 1261 cgtaaggcat agtaagtaac acctcttaca cattatgcgt tacgcacaat cagttgtgca 1321 attctagaaa acacgatatg agtattttta aacacttatc gtgaccaaaa aaaca // LOCUS ECOHSEST 360 bp ds-DNA BCT 01-AUG-1990 DEFINITION E.coli heat-stable enterotoxin gene, complete cds. ACCESSION M34916 KEYWORDS heat-stable enterotoxin. SOURCE E.coli (strain 153837-2) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 360) AUTHORS Moseley,S.L., Hardy,J.W., Huq,M.I., Echeverria,P. and Falkow,S. TITLE Isolation and nucleotide sequence determination of a gene encoding a heat-stable enterotoxin of Escherichia coli JOURNAL Infect. Immun. 39, 1167-1174 (1983) STANDARD simple staff_review FEATURES from to/span description pept 48 266 heat-stable enterotoxin signal 268 301 pot. transcription termination signal BASE COUNT 115 a 54 c 65 g 126 t ORIGIN 1 ttctggtttt gattcaaatg ttcgtggatg ccatgtccgg aggtaatatg aagaaatcaa 61 tattatttat ttttctttct gtattgtctt tttcaccttt ccctcaggat gctaaaccag 121 tagagtcttc aaaagaaaaa atcacactag aatcaaaaaa atgtaacatt gcaaaaaaaa 181 gtaataaaag tggtcctgaa agcatgaata gtagcaatta ctgctgtgaa ttgtgttgta 241 atcctgcttg taccgggtgc tattaataat ataaagggaa ctaaacagtt ccctttatat 301 ttgttctgat tctgatgatg tctgtaacgt atgtacctgt tgctttgttg aataaatcga // LOCUS HUMRENA1 826 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exon 1. ACCESSION M10030 M34914 KEYWORDS aspartyl protease; renin. SEGMENT 1 of 5 SOURCE Human fetal liver DNA (library of Lawn et al.), clone lambda-III. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 826) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review REFERENCE 2 (bases 276 to 583) AUTHORS Shine,J., Hardman,J.A., Hort,Y.J., Tellam,J.T., Catanzaro,D.F., Morris,B.J. and Baxter,J.D. TITLE Structure of the human renin gene JOURNAL Trans Assoc Am Physicians 97, 63-69 (1984) STANDARD simple staff_review COMMENT There is only a single renin gene in the human haploid genome [1]. It is comprised of 10 exons encoding 406 amino acids. The first intron separates the 5' untranslated region and the signal peptide coding region from the remainder of the gene. Exon 2 comprises most of the sequence coding for the pro portion of the enzyme. Precise boundaries were not indicated by in figure 2 of [1], but were taken from the text and from other human renin entries. FEATURES from to/span description pept 626 + 723 preprorenin /hgml_locus_uid="LW0050B" /nomgen="REN" /map="1q32" sigp 626 685 renin signal peptide pre-msg 584 > 826 renin mRNA [1] IVS 724 > 826 renin intron A site 520 521 ga in [1]; gagca in [2] BASE COUNT 190 a 226 c 222 g 188 t ORIGIN Chromosome 1q32; 437 bp upstream of KpnI site. 1 gatctaccca ccttggcctc ccaaagtgct gggacaggtg tgagccacca tgcctggccc 61 ctctactctt ataattaaac cagctgttgc ttttcctgcc aagaaaccag tcatgaagat 121 tcacccatgt tctagatggg aaaactgggc tgtagctggg agaggccagt cagggacaaa 181 gccaaagtta atatagagaa tggagcttcc agggtatagg ggttgggtct gggctaggga 241 gctggaaacc taggttttac gcttgtccca gttttgatgt tagccctgac agtgctgttt 301 ctcatcagcc tctgcctgct ccaggggtca cagggccaag ccagatagag ggctgctagc 361 gtcactggac acaagattgc tttcccacag ctgtccttcc tccagcccct ctgctcccca 421 tccggaaacc tgggtaccct tcacccacct agctctgtcc cgcagtgaga tttattgctg 481 actgccctgc catctacccc agggtaataa atcagggcag agcagaattg caatcacccc 541 atgcatggag tgtataaaag gggaagggct aagggagcca cagaacctca gtggatctca 601 gagagagccc cagactgagg gaagcatgga tggatggaga aggatgcctc gctggggact 661 gctgctgctg ctctggggct cctgtacctt tggtctcccg acagacacca ccacctttaa 721 acggtaattg gtaactcagg cagagaaggg gtgggcaggg gtgtaggttc ccaccttccc 781 aacaccctgg cttttccaca tgcggtgtca ttcagtcctt acgatc // LOCUS HUMRENA2 373 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exon 2. ACCESSION M10128 KEYWORDS renin. SEGMENT 2 of 5 SOURCE Human fetal liver DNA, clones lambda-[III,V]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 373) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review FEATURES from to/span description pept + 105 + 255 preprorenin, exon 2 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" matp 205 + 255 renin pre-msg < 1 > 373 renin mRNA IVS < 1 104 renin intron A IVS 256 > 373 renin intron B BASE COUNT 79 a 107 c 96 g 91 t ORIGIN Chromosome 1q32; about 4.8 kb after segment 1. 1 aacgttaaag gtggttgtac taaagagagg ggtttggcct cagggactca catgtggtgg 61 aggtacagca cttttctatt tttgcttcct ccaccctggg ccaggatctt cctcaagaga 121 atgccctcaa tccgagaaag cctgaaggaa cgaggtgtgg acatggccag gcttggtccc 181 gagtggagcc aacccatgaa gaggctgaca cttggcaaca ccacctcctc cgtgatcctc 241 accaactaca tggacgtgag tgcttggctc agcccctcgc tccctccctg tctcctttcc 301 ctcatggacc tagggctttc tttgctgcaa gactcaccct ttccaagctg tgtttgacga 361 aggcgctgag tag // LOCUS HUMRENA3 2480 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exons 3, 4 and 5. ACCESSION M10150 KEYWORDS renin. SEGMENT 3 of 5 SOURCE Human fetal liver DNA, clones lambda-[III,V]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2480) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review FEATURES from to/span description pept + 140 263 preprorenin, exon 3 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" 879 997 preprorenin, exon 4 1949 + 2145 preprorenin, exon 5 matp + 140 263 renin 879 997 renin 1949 + 2145 renin pre-msg < 1 > 2480 renin mRNA IVS < 1 139 renin intron B IVS 264 878 renin intron C IVS 998 1948 renin intron D IVS 2146 > 2480 renin intron E BASE COUNT 568 a 710 c 622 g 579 t 1 others ORIGIN Chromosome 1q32; about 0.4 kb after segment 2. 1 ctgcaggaaa atggaaaccc cgacaggtat aggacctcgc ctggggcaag tctacacccg 61 agagccaaga gtgaagccag gcaagacccc aagcccaagg tcccctgagc ccctccagcc 121 ctctcttttt accccacaga cccagtacta tggcgagatt gggatcggga ccccacccca 181 aaccttcaaa gtcgtctttg acactggttc gtccaatgtt tgggtgccct cctccaagtg 241 cagccgtctc tacactgcct gtggtgagac ctaagaccca cagtgcctct cctccatccc 301 cctgccctac tgtgcatgag caatcctgcc caacacccag ctcccatccc tcttgccacc 361 aagggagtgg cttcctctct gcctctgtgc ccactgacat gtaggggaga ggggaagatg 421 tctcccgttt ttctgataca gccaccaagg ttaaaaacaa aaaaaggtcc aagaacccct 481 gagnacccag gaggaccagt tcccagtcgt cctgagattg agacaggact gaattctcaa 541 acccatccca ggcactcgga actcttccat ccctagtctt aatcaacaac ctcttactag 601 cacttactct gtgcctggca tacttctctg gtgttatcag tggttagtga ttactttaaa 661 ttccttcatt taggacaaaa ttctcgatgt atgggacact taggagagcc caagaaaccc 721 agtccttgat tgatgaagca catattccaa gccccctgac cctagggcca ctcatccctg 781 cacctaagct aaccagccat acccacaatg caccctgcct ctgagtcccc ctgtctgggc 841 cactcttgga caaacctgag cctctgtccc cctgccagtg tatcacaagc tcttcgatgc 901 ttcggattcc tccagctaca agcacaatgg aacagaactc accctccgct attcaacagg 961 gacagtcagt ggctttctca gccaggacat catcaccgta agttgggccg ccctaggtca 1021 tctgccccgg accccttctg tccccaggcc tctcctgacc caccagggcc cacacctgcg 1081 gggaggtaca ctgcagccca cttggagcct ggggagctga ggaacaccct actctgccac 1141 atctggtgtt gaaagcagca gtacctatgg gggagcaagc ctgggctacg ggctcaccgt 1201 tgggtggttt gtggatgttt ttgcatctaa cttgcatgta gggctgtcct gagccccgtg 1261 gctgcagtca agtaactcgt cccaagttca ccagctctga ctggggctac taccctagac 1321 tgaaatcctg ggtcagagtc aggctatttt agggtcaggc atagttttaa ggtcacatta 1381 gttgactctg ggactcaggt caaggctctc ttttcttttc catgtggccc atgtctgacc 1441 gtttcctcat cctggagttt ctcaggccct gctccatcag agttagggga ggggcacacg 1501 tggcacctga gaggaaatca gggtgattcc tgcctccctt cctttttctg ttgaactctg 1561 atataaagga ggaagaaggg caagcttgtc tgtgctaaag aaacccttcg cccatgataa 1621 gggtggggcc aagacccagt cctgccaggc acgaaagtct ggccactggg gaggggagga 1681 gctcttggac ttttcttttg cgcttggcag gaccaccctc tcagcctctg ctctccgatc 1741 cctggtcaac tctagctctc tctgggctcc gcagcagaga tgtgtattgg cacagagtgt 1801 gtgcgtgcag ggttgaggca atactcttac cccgatttct gtaccctgga gcatgtgtgc 1861 ccctgggatc cctagtgtgg atgcccagac cagactccaa ccaaggaggg gcagtgggct 1921 tggtctccta tggtccttcc tcccacaggt gggtggaatc acggtgacac agatgtttgg 1981 agaggtcacg gagatgcccg ccttaccctt catgctggcc gagtttgatg gggttgtggg 2041 catgggcttc attgaacagg ccattggcag ggtcacccct atcttcgaca acatcatctc 2101 ccaaggggtg ctaaaagagg acgtcttctc tttctactac aacaggtggg gactgggact 2161 ccaagggctg aggtgggggg caggagggga gaagagatgg ggagtggaag gagagtctgg 2221 gccagaattg taaagtgttt gtaacttagg tgacagccaa tcaatatcta gagctgtact 2281 agccaatatg gaaggcacta ttgcaaattt aaacttaact taaatacagc ttaagcatca 2341 attaagcatt caactggctg gcctcttagt tgtactagcc acagctcaat gcctggcagc 2401 cacggtggct agtaactaca gtctagtaca gtgcagatag agatatccag catgacagga 2461 catctataga cagcgccact // LOCUS HUMRENA4 3057 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exons 6, 7, 8, and 9. ACCESSION M10151 KEYWORDS renin. SEGMENT 4 of 5 SOURCE Human fetal liver DNA, clones lambda-[III,V]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3057) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review FEATURES from to/span description pept + 998 1006 preprorenin, exon 6 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" 1572 1691 preprorenin, exon 7 2061 2202 preprorenin, exon 8 2466 + 2564 preprorenin, exon 9 matp + 998 1006 renin 1572 1691 renin 2061 2202 renin 2466 + 2564 renin pre-msg < 1 > 3057 renin mRNA IVS < 1 997 renin intron E IVS 1007 1571 renin intron F IVS 1692 2060 renin intron G IVS 2203 2465 renin intron H IVS 2565 > 3057 renin intron I BASE COUNT 750 a 799 c 825 g 683 t ORIGIN Chromosome 1q32; about 0.6 kb after segment 3. 1 aaaagaatag aggaggatca gagttcagag aaatctcaca gtaaaatgga gaggagtctc 61 cggtttggtg atagaaagtg aggccttgag aaaaggccaa ttggcggctc tgcattcagg 121 ggtggtcttt agaagaactg ttttagagga ggtgggggca aggccagatg gcaagaagtt 181 aagaggtgga cgacgtgggt gtcaggaagt ggaggtcatg agatgtacgc tgccctggga 241 cattcaacag ggaagggaat ggggggtggc gtgggggggt gagatccaga agcagaagag 301 gaagggtggg tgtttttaaa tgctagagga tgctcgagtg atcgcctgta ggtggaggaa 361 gaacccaata gaaagaaaga gattaaaaat gtggaaagaa gaggagctaa atgggggcac 421 tggagtttag aggccttgaa agagatgagg aaccagcaga taggaagaag ccaggtttta 481 cagaggagag ggctggcctc ttcttttatc ttgggatggg aaggagggaa catccagaga 541 gatactgaag tgttgagaga caggcaggag ggaatttgtg ctagcatata cacatacgag 601 ttccgaattt ataaaaacac aagtagtttg cagttgcaca aaataacata tgcacaccta 661 cacacccatg cacacatgtg catgtgaatt ctggaaaaac acatcacaca cacaggcatg 721 ccctggagac taggcctaca gtagtccctg agccaagtgc agtgaggagg aaaggaaggt 781 gaggggaatc atctccagac ggggcaccag gagcctggct ccagtccccc acttgttcac 841 tcatggactg ggtaacttca ggcaagtgac ttcgcctctt ggtgactcca ttgcctgaag 901 ggcaaagaga gtacataaca cccaccctgc caaacagcag ggtgatgagg ctggcatgaa 961 atgaagcttc ctttctgctg tctctctttc tctgcagaga ttccgagtaa ggagacaaaa 1021 cccccacatg gctgtgacct tccagtattc cccgagcacc tgacctagaa ttacacacgc 1081 caccggccca aaactcacat cagcaagtcc cagcctccgc tagatgccga agttctctgt 1141 ctctccttcc tgctctctcc atgccacctg cccaccccat acccaatagc ctccccaggg 1201 tcccctccca tgcacctgct caatcagcag caacccaaga gtgaggggtg tccatttgtg 1261 tcttgttcac atccactcac tgtccttgta cctgctcctt ttctgtgacc tctctgggga 1321 tgctttttgg gggaacagct ggactaccct ggaacaacct ctggttggtc ttggggaggg 1381 gaagaaaggc agagaagcag tatgttctgc atgcttccca acgacagctc cgagcctggc 1441 tgtctgtccc acattcctct gctctagagc cctctgtcct cccctcgacc cttgtgcaac 1501 cttccccaat tgcctgagtt gctgggtcct ggaggttatg ggtttccaag agcttctgat 1561 ctttccttta ggaattccca atcgctggga ggacagattg tgctgggagg cagcgacccc 1621 cagcattacg aagggaattt ccactatatc aacctcatca agactggtgt ctggcagatt 1681 caaatgaagg ggtcagaaat cctcagaccc tccccgggct ccaaaaaatg ctgccgtcac 1741 tggggttggg gagggcgggc gcggactgca ttaccatcct gccctctttc caaatgcagc 1801 cacttcttaa gcacagccac catttgctct ctgcctggct ctggtccagg ctggggcaga 1861 gagaagggag gggcctgggc cggagtggtg gaggccgaga gtaccttccc tcctctactc 1921 actgcctcaa cagccagcca gcgtggcgct ccacccaccc acccaccact caggaaggac 1981 atgcagcctg gcgtgcccat cagccttctg tctgtctgtc tgtctgtctg tctctctgtc 2041 tgactgtggc gctcccccag ggtgtctgtg gggtcatcca ccttgctctg tgaagacggc 2101 tgcctggcat tggtagacac cggtgcatcc tacatctcag gttctaccag ctccatagag 2161 aagctcatgg aggccttggg agccaagaag aggctgtttg atgtaagaag ccaaagaggg 2221 aaggtgctgt gggtgtgggg agcggccacc tggtatcggc tcacaaatcc cccaggcaaa 2281 tgaggccatc tcaggccttc gcttgttcac ctcacactct ccacacatgt ggctggtcac 2341 ccatggggcg gggcactgtc cccagccctc tccagcagag agacccaggg ccaccagcgc 2401 aggactcctt gtctgctgag acgtcgttcc atactcaaga aggctctctt tgccccccac 2461 cccagtatgt cgtgaagtgt aacgagggcc ctacactccc cgacatctct ttccacctgg 2521 gaggcaaaga atacacgctc accagcgcgg actatgtatt tcaggtgagg ttcgagtcgg 2581 ccccctcggt ggcagggaga aaggctggac agagaccctc aagagtgaca gattacaatg 2641 cacagatcat gttagaactg tagttctcaa acttggctgt gcatgtcacc tggagagctt 2701 tggaaaaatc caggtacctg ggccacatcc catacctatt aaatcagaac ctctagaagt 2761 gggacctggg gttcagtttc cccagatgat tccaatgtgt ggccatgttt gggcatcact 2821 atgcctgttc cctcatctcc attttctcat caaatactcc caataatcct atgctcctat 2881 attcttaccc tcttttcata atcaataggc ttagagaatt tgaataactt gtctaggatc 2941 agaagctaag gcaaactgta agctcctgaa ggaagcacgt tgcctgatgc cctgtttgcc 3001 tgggatctag cacaggggct aaacatagga atggtgcagt ccacgatggg gcaaaat // LOCUS HUMRENA5 763 bp ds-DNA PRI 01-AUG-1990 DEFINITION Human renin gene, exon 10. ACCESSION M10152 KEYWORDS renin. SEGMENT 5 of 5 SOURCE Human fetal liver DNA, clone lambda-V. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 763) AUTHORS Hardman,J.A., Hort,Y.J., Catanzaro,D.F., Tellam,J.T., Baxter,J.D., Morris,B.J. and Shine,J. TITLE Primary structure of the human renin gene JOURNAL DNA 3, 457-468 (1984) STANDARD full staff_review COMMENT A poly-adenylation signal is located at positions 357-362. FEATURES from to/span description pept + 22 183 preprorenin, exon 10 /nomgen="REN" /map="1q32" /hgml_locus_uid="LW0050B" matp + 22 180 renin pre-msg < 1 > 183 renin mRNA IVS < 1 21 renin intron I BASE COUNT 170 a 236 c 194 g 163 t ORIGIN Chromosome 1q32; about 0.6 kb after segment 4. 1 aaaactctcc ccctctgcca ggaatcctac agtagtaaaa agctgtgcac actggccatc 61 cacgccatgg atatcccgcc acccactgga cccacctggg ccctgggggc caccttcatc 121 cgaaagttct acacagagtt tgatcggcgt aacaaccgca ttggcttcgc cttggcccgc 181 tgaggccctc tgccacccag gcaggccctg ccttcagccc tggcccagag ctggaacact 241 ctctgagatg cccctctgcc tgccttatgc cctcagatgg agacattgga tgtggagctc 301 ctgctggatg cgtgccctga cccctcacag cccttccctg ctttgaggac aaagagaata 361 aagacttcat gttcacagcc tgttgcatct gggttcacta gggtttagaa cagagggagg 421 ggctgcgtga tcatgtgtgg acaggaatgt gacacagaca agctacacat tagcctaggc 481 cacaggttct tgcgtgcagg gatgatgcca tccatctgcc atcaacggga ctcaggtgga 541 gctgttacac aacctcaggt gggaagtctg aaaagagccg gaaccaagct ccctgctatc 601 gactcaggga ccaaggcgta atgctgtggc gagtagactg gggtcagaaa gttgtcccag 661 ctcacagaag ccagctctga gttcagactc tgctctgctg agctagtcag ccctgtctct 721 tgtccctgca aaactcccct cacctgtcct tatccacctg cag // LOCUS SYNT1RNAA 324 bp ds-DNA SYN 01-AUG-1990 DEFINITION Synthetic ribonuclease T1 gene, 3' end. ACCESSION M37098 M35733 M35736 KEYWORDS ribonuclease T1. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 324) AUTHORS Ikehara,M., Ohtsuka,E., Uesugi,S., Kikyodani,T., Aoyama,Y., Tokunaga,T. and Fujimoto,K. TITLE Synthesis and expression of RNase T1 gene JOURNAL Nucleic Acids Symp Ser 15, 197-200 (1984) STANDARD simple staff_review REFERENCE 2 (bases 1 to 324) AUTHORS Nishikawa,S., Morioka,H., Tokunaga,T., Aoyama,Y., Kikyotani,S., Fujimoto,K., Yanase,K., Tanaka,T., Uesugi,S., Ohtsuka,E. and Ikehara,M. TITLE Synthesis and expression of the native RNase T1 gene and several mutant genes JOURNAL Nucleic Acids Symp Ser 16, 287-290 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 321 ribonuclease T1 precursor (AA at 1) sigp < 1 6 ribonuclease T1 signal peptide matp 7 318 ribonuclease T1 BASE COUNT 73 a 98 c 75 g 78 t ORIGIN 1 ttcatggctt gcgactacac ctgcggcagc aactgctact ctagctctga cgtttctacc 61 gctcaggctg ctggctacca gctgcacgag gacggcgaaa ccgttggctc taactcttac 121 ccgcacaaat acaacaacta tgagggcttc gactttagcg tttcttctcc gtactacgaa 181 tggccgatcc tgtctagcgg cgacgtttac tccggtccag gtagcggtgc tgaccgtgta 241 gtattcaacg aaaacaacca gctcgctggc gttatcaccc acaccggcgc ttctggcaac 301 aactttgtag aatgcaccta atag // LOCUS TIPCDREG 209 bp ds-DNA BCT 01-AUG-1990 DEFINITION Plasmid pTiC58 promoter-active fragment CD25 DNA. ACCESSION M35735 KEYWORDS . SOURCE Plasmid pTiC58 DNA. ORGANISM Plasmid pTiC58 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 209) AUTHORS Tait,R.C. and Kado,C.I. TITLE Regulation of the virC and virD promoters of pTiC58 by the ros chromosomal mutation of Agrobacterium tumefaciens JOURNAL Mol. Microbiol. 2, 385-392 (1988) STANDARD simple staff_review BASE COUNT 60 a 37 c 41 g 71 t ORIGIN 1 gtcgacccgg gatccgcggc gataattcat aagtaatgta gtaattacct gattttatat 61 ttcaatttta ttgtaatata atttcaattg taataatata aaaataaata tcccttatgt 121 gttcttgatt tcgttttgta tatggctaga ttcccatctg ccacgacgag gaaatgctac 181 ggcggggcaa gttcagatcc cgggtcgac // LOCUS FIBGLUC 1426 bp ds-DNA BCT 01-AUG-1990 DEFINITION F.succinogenes 1,3-1,4-beta-D-glucan 4-glucanohydrolase gene, complete cds. ACCESSION M33676 M33311 KEYWORDS 1,3-1,4-beta-D-glucan 4-glucanohydrolase; mised-linkage beta-glucanase. SOURCE F.succinogenes (strain S85) DNA, clone PJI5. ORGANISM Fibrobacter succinogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 1426) AUTHORS Teather,R.M. and Erfle,J.D. TITLE DNA sequence of a Fibrobacter succinogenes mixed-linkage beta-glucanase (1,3-1,4-beta-D-glucan 4-glucanohydrolase) gene JOURNAL J. Bacteriol. 172, 3837-3841 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Teather, 11-APR-1990. FEATURES from to/span description pept 145 1194 1,3-1,4-beta-D-glucan 4-glucanohydrolase precursor (EC 3.2.1.73) sigp 145 225 1,3-1,4-beta-D-glucan 4-glucanohydrolase signal peptide matp 226 1191 1,3-1,4-beta-D-glucan 4-glucanohydrolase binding 132 137 ribosome binding site signal 62 66 -35 region signal 85 90 -10 region BASE COUNT 371 a 346 c 335 g 374 t ORIGIN 1 ttttcagcac agcacactgc cacaattgat acagttaatc ttttaaatac attctatttt 61 attggttatt taatttcgct aacttatctt tatctttggt taaatgggat tctgttttgt 121 acagaaactt catggagaaa aaatatgaac atcaagaaaa ctgcagtcaa gagcgctctc 181 gccgtagcag ccgcagcagc agccctcacc accaatgtta gcgcaaagga ttttagcggt 241 gccgaactct acacgttaga agaagttcag tacggtaagt ttgaagcccg tatgaagatg 301 gcagccgcat cgggaacagt cagttccatg ttcctctacc agaatggttc cgaaatcgcc 361 gatggaaggc cctgggtaga agtggatatt gaagttctcg gcaagaatcc gggcagtttc 421 cagtccaaca tcattaccgg taaggccggc gcacaaaaga ctagcgaaaa gcaccatgct 481 gttagccccg ccgccgatca ggctttccac acctacggtc tcgaatggac tccgaattac 541 gtccgctgga ctgttgacgg tcaggaagtc cgcaagacgg aaggtggcca ggtttccaac 601 ttgacaggta cacagggact ccgttttaac ctttggtcgt ctgagagtgc ggcttgggtt 661 ggccagttcg atgaatcaaa gcttccgctt ttccagttca tcaactgggt caaggtttat 721 aagtatacgc cgggccaggg cgaaggcggc agcgacttta cgcttgactg gaccgacaat 781 tttgacacgt ttgatggctc ccgctggggc aagggtgact ggacatttga cggtaaccgt 841 gtcgacctca ccgacaagaa catctactcc agagatggca tgttgatcct cgccctcacc 901 cgcaaaggtc aggaaagctt caacggccag gttccgagag atgacgaacc tgctccgcaa 961 tcttctagca gcgctccggc atcttctagc agtgttccgg caagctcctc tagcgtccct 1021 gcctcctcga gcagcgcatt tgttccgccg agctcctcga gcgccacaaa cgcaatccac 1081 ggaatgcgca caactccggc agttgcaaag gaacaccgca atctcgtgaa cgccaagggt 1141 gccaaggtga acccgaatgg ccacaagcgt tatcgcgtga actttgaaca ctaatcgtgg 1201 ctgattctct ttataattct ctttatcgca aagaccatgt ggtttactcc acatggtttt 1261 tcgttaagtc cactaaaatt aggggatttt cgctattttt tttgaatttt gacactaaaa 1321 tgtcaaatga gtttttgtat ttttgatttc gaaattttta aaaattaaaa taggatagtt 1381 atatggctta tttgaataag gttatgctca tcggtaatat cggtaa // LOCUS BOVRS157A 824 bp ss-mRNA MAM 01-AUG-1990 DEFINITION Bovine retina-specific 15.7 kDa protein mRNA, complete cds. ACCESSION M34915 KEYWORDS . SOURCE Bovine retina, cDNA to mRNA, clone pCR18. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 824) AUTHORS Nakagawa,Y., Kuo,C.-H., Ishii,K., Shiosaka,S., Tohyama,M. and Miki,N. TITLE Cloning and characterization of a cDNA specific for bovine retina JOURNAL Neurosci. Res. 3, 300-310 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 138 581 retina-specific 15.7 kDa protein mRNA < 1 824 retina-specific mRNA signal 800 805 polyA signal BASE COUNT 178 a 219 c 224 g 203 t ORIGIN 1 tttagcctca gccgtgaccg gccccgtccc gcggcgccgg gagttcgtgt gaacgggtag 61 gtgtaccgac ttcgcccgtc cgtgaatccc gtggtcgcaa aggcccgcgc ggcgggccgg 121 gttctgccga taccttaatg ggctgtgcgc gaggagagcc tcaattgcaa gttggtcgag 181 gagatcgcca cgctggtgca gagctggcct cactagttgc ggctagtgta ggacgttgta 241 ctccgacatt ccgcaagccc ttccacacgg acagtcctag catccagggt cagtggcacc 301 ccttcaccaa caaaccgaca gcactggggt gctcctcgag aggtccagaa tcctgccccg 361 acccagcggc cagcacaatg aagaccaact ccatacccac agtttggact tttactccag 421 cagagggtgg ttcctgctcc tggtttgctt cacgggagac agatgaagcc accaatgggg 481 tacttcttgc ttgggataaa gaagagctgc ctgtctcttt tgatgtccac cgtgaggcag 541 ggactgtgag tctcctcatt cttagccagt tgacatcctg aaaccctgag aatcttcaga 601 gatttgactt ggtcttcatt tcttaaatcc aaatcaataa tagtgatctc aaatcaagtg 661 agggctttca aggctggctt ctgaagaatt ccttttggcc tgtttctgta gccagtgacc 721 aagagagtct gctgtgagct ggcattgggc taggccttgt atctatgtga tgtttgtgtg 781 cagttagaaa actgaagtta ataaatttgc caaggtcaca cttg // LOCUS CHKFRA2A1 360 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 1. ACCESSION D90104 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 1 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 360) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 241 + 339 fra-2 protein, exon 1 pre-msg < 1 > 360 fra-2 mRNA and introns IVS 340 > 360 fra-2 intron A BASE COUNT 39 a 123 c 113 g 85 t ORIGIN 1 tgtttttttg gttgtttttt ttttttgtcg gctttccgct ttttcttttt ttcttttttt 61 tccctttttc tatttttccc ccccttcttc ttctcccgct gcggactctc ccccggctgc 121 gggaggcgcg aggcagagcc cgagaggtcg gcacggagca gggggcgggg agacggcgag 181 ggagcggcgg ccgcggcgcg ggaaggcggg gacgcggctc ccccgggccg gcctcggacc 241 atgtaccagg actatcccgg gagcttcgac acctcctcca gaggcagcag cggctccccg 301 ggacaccccg agccctactc cgccggcgca gcccagcagg tagggccgcc tccgccccgt // LOCUS CHKFRA2A2 297 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 2. ACCESSION D90105 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 2 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 297) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 22 + 276 fra-2 protein, exon 2 pre-msg < 1 > 297 fra-2 mRNA and introns IVS < 1 21 fra-2 intron A IVS 277 > 297 fra-2 intron B BASE COUNT 68 a 103 c 74 g 52 t ORIGIN About 5 kbp after segment 1. 1 ctcccccacc tttcctccta gaaattccga gtagatatgc caggatcagg cagtgctttt 61 attcccacga tcaacgccat cacaaccagc caagacctgc agtggatggt gcagcccacc 121 gtcatcacct ccatgtccag cccgtactct cgctcgcacc cctacagcca cccactgccg 181 ccgctgtcct cggtggctgg acacacggcc cttcagcgac cgggcgtgat caaaaccatc 241 ggcaccacag tgggacggag acgaagggat gagcaggtaa ctgtgtgagc aggagga // LOCUS CHKFRA2A3 149 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 3. ACCESSION D90106 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 3 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 149) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 22 + 129 fra-2 protein, exon 3 pre-msg < 1 > 149 fra-2 mRNA and introns IVS < 1 21 fra-2 intron B IVS 130 > 149 fra-2 intron C BASE COUNT 42 a 34 c 46 g 27 t ORIGIN About 4 kbp after segment 2. 1 tttcttggca cttgcccata gctgtcgcct gaggaagaag agaagcgaag gatccggaga 61 gagaggaaca agctggcagc tgctaaatgt cgtaacaggc gccgagagct aacagagaaa 121 ctccaggcgg tacgtgctct gcatgcatt // LOCUS CHKFRA2A4 744 bp ds-DNA VRT 01-AUG-1990 DEFINITION Chicken fra-2 oncogene gene, exon 4. ACCESSION D90107 KEYWORDS fos-related gene; fra-2 gene; oncogene. SEGMENT 4 of 4 SOURCE Chicken embryo fibroblasts DNA, clones lambda-OO[1,2,3,4]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 744) AUTHORS Nishina,H., Sato,H., Suzuki,T. and Iba,H. TITLE Isolation and characterization of fra-2, and additional member of the fos gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3619-3623 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 22 531 fra-2 protein, exon 4 pre-msg < 1 > 744 fra-2 mRNA and introns IVS < 1 21 fra-2 intron C BASE COUNT 180 a 206 c 209 g 149 t ORIGIN About 2 kbp after segment 3. 1 ttattccctt tttgtctgca ggaaactgag gtgctggagg aggaaaagtc agtgcttcaa 61 aaagagattg ctgagctcca gaaggagaag gagaaactag agttcatgct ggttgctcac 121 agccctgtgt gtaaaatcag ccctgaggaa cgtcggagcc caccaaccag cagcctccag 181 agcgttcgga ctggagcgag cggagcagtg gtggtgaagc aggagcctgt ggaggaagag 241 atcccatctt cctctttggt ccttgacaaa gctcagaggt ctgtcattaa gcccatcagc 301 attgctggag gttattatgg ggaggaggca ctcaacactc ccatcgtggt gacctcgaca 361 ccagccatca ctcctggttc ctccaacttg gtgttcacct accccaatgt cttggatcag 421 gagtctcctc tctccccgtc cgagtcctgc tccaaagctc accggaggag cagcagcagc 481 ggcgaccagt cctcggattc cttgaactct cccaccttgc tggcattgta atcccctgag 541 gcccccccat tgccagtgtg ttacatcccc cgcccggctc catggggaga cccctccatg 601 ggattagaga caggcacagg atcgttcaag cacaagggca gcaagaacaa gaatggggaa 661 atgctgcagc tccaggaaag agagtgagga ccaatgccag ctccctggag gcaggaaatg 721 gcaagggtgg gactgatgca ccag // LOCUS ECOTGP 7335 bp ds-DNA BCT 01-AUG-1990 DEFINITION E.coli tryptophan operon: entire DNA sequence. ACCESSION J01714 M12471 M12472 M25593 KEYWORDS anthranilate isomerase; anthranilate synthetase; attenuator; glutamine amidotransferase; isomerase; leader peptide; phosphoribosyl anthranilate synthetase; synthetase; transferase; trp operon; trpA gene; trpB gene; trpC gene; trpD gene; trpE gene; tryptophan synthetase. SOURCE Escherichia coli RNA and DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 5917 to 6133) AUTHORS Platt,T. and Yanofsky,C. TITLE An intercistronic region and ribosome-binding site in bacterial messenger RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 72, 2399-2403 (1975) STANDARD full staff_review REFERENCE 2 (bases 84 to 141) AUTHORS Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and Yanofsky,C. TITLE Nucleotide sequence of region preceding trp mRNA initiation site and its role in promoter and operator function JOURNAL Proc. Natl. Acad. Sci. U.S.A. 73, 2351-2355 (1976) STANDARD full staff_review REFERENCE 3 (bases 117 to 310) AUTHORS Squires,C., Lee,F., Bertrand,K., Squires,C.L., Bronson,M.J. and Yanofsky,C. TITLE Nucleotide sequence of the 5' end of tryptophan messenger RNA of Escherichia coli JOURNAL J. Mol. Biol. 103, 351-381 (1976) STANDARD full staff_review REFERENCE 4 (bases 230 to 272) AUTHORS Bertrand,K., Korn,L.J., Lee,F. and Yanofsky,C. TITLE The attenuator of the tryptophan operon of Escherichia coli: heterogeneous 3'-OH termini in vivo and deletion mapping of functions JOURNAL J. Mol. Biol. 117, 227-247 (1977) STANDARD full staff_review REFERENCE 5 (bases 230 to 272) AUTHORS Stauffer,G.V., Zurawski,G. and Yanofsky,C. TITLE Single base-pair alterations in the Escherichia coli trp operon leader region that relieve transcription termination at the trp attenuator JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 4833-4837 (1978) STANDARD full staff_review REFERENCE 6 (bases 6707 to 6863) AUTHORS Wu,A.M. and Platt,T. TITLE Transcription termination: nucleotide sequence at 3' end of tryptophan operon in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 5442-5446 (1978) STANDARD full staff_review REFERENCE 7 (bases 1 to 140) AUTHORS Bennett,G.N., Schweingruber,M.E., Brown,K.D., Squires,C. and Yanofsky,C. TITLE Nucleotide sequence of the promoter-operator region of the tryptophan operon of Escherichia coli JOURNAL J. Mol. Biol. 121, 113-137 (1978) STANDARD full staff_review REFERENCE 8 (bases 2351 to 2503) AUTHORS Miozzari,G.F. and Yanofsky,C. TITLE Gene fusion during the evolution of the tryptophan operon in enterobacteriaceae JOURNAL Nature 277, 486-489 (1979) STANDARD full staff_review REFERENCE 9 (bases 5932 to 6809) AUTHORS Nichols,B.P. and Yanofsky,C. TITLE Nucleotide sequences of trpA of Salmonella typhimurium JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 5244-5248 (1979) STANDARD full staff_review REFERENCE 10 (bases 117 to 256) AUTHORS Oxender,D.L., Zurawski,G. and Yanofsky,C. TITLE Attenuation in the Escherichia coli tryptophan operon: role of RNA secondary structure involving the tryptophan codon region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 5524-5528 (1979) STANDARD full staff_review REFERENCE 11 (bases 3422 to 4824) AUTHORS Christie,G.E. and Platt,T. TITLE Gene structure in the tryptophan operon of Escherichia coli: nucleotide sequence of trpC and the flanking intercistronic regions JOURNAL J. Mol. Biol. 142, 519-530 (1980) STANDARD full staff_review REFERENCE 12 (bases 230 to 296) AUTHORS Farnham,P.J. and Platt,T. TITLE A model for transcription termination suggested by studies on the trp attenuator in vitro using base analogs JOURNAL Cell 20, 739-748 (1980) STANDARD full staff_review REFERENCE 13 (bases 4810 to 6003) AUTHORS Crawford,I.P., Nichols,B.P. and Yanofsky,C. TITLE Nucleotide sequence of the trpB gene in Escherichia coli and Salmonella typhimurium JOURNAL J. Mol. Biol. 142, 489-502 (1980) STANDARD full staff_review REFERENCE 14 (bases 1761 to 2443) AUTHORS Nichols,B.P., Miozzari,G.F., van Cleemput,M., Bennett,G.N. and Yanofsky,C. TITLE Nucleotide sequences of the trpG regions of Escherichia coli, Shigella dysenteriae, Salmonella typhimurium and Serratia marcescens JOURNAL J. Mol. Biol. 142, 503-517 (1980) STANDARD full staff_review REFERENCE 15 (bases 6707 to 7335) AUTHORS Wu,A.M., Chapman,A.B., Platt,T., Guarente,L.P. and Beckwith,J. TITLE Deletions of distal sequence affect termination of transcription at the end of the tryptophan operon in E. coli JOURNAL Cell 19, 829-836 (1980) STANDARD full staff_review REFERENCE 16 (bases 279 to 1843) AUTHORS Nichols,B.P., van Cleemput,M. and Yanofsky,C. TITLE Nucleotide sequence of Escherichia coli trpE: anthranilate synthetase component I contains no tryptophan residues JOURNAL J. Mol. Biol. 146, 45-54 (1981) STANDARD full staff_review REFERENCE 17 (bases 5932 to 6809) AUTHORS Schneider,W.P., Nichols,B.P. and Yanofsky,C. TITLE Procedure for production of hybrid genes and proteins and its use in assessing significance of amino acid differences in homologous tryptophan synthetase alpha polypeptides JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 2169-2173 (1981) STANDARD full staff_review REFERENCE 18 (bases 6807 to 6856; 7057 to 7119) AUTHORS Wu,A.M., Christie,G.E. and Platt,T. TITLE Tandem termination sites in the tryptophan operon of Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 2913-2917 (1981) STANDARD full staff_review REFERENCE 19 (review; bases 77 to 6809; compiled) AUTHORS Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E., Horowitz,H., van Cleemput,M. and Wu,A.M. TITLE The complete nucleotide sequence of the tryptophan operon of Escherichia coli JOURNAL Nucleic Acids Res. 9, 6647-6668 (1981) STANDARD full staff_review REFERENCE 20 (bases 2504 to 3436) AUTHORS Horowitz,H., Christie,G.E. and Platt,T. TITLE Nucleotide sequence of the trpD gene, encoding anthranilate synthetase component II of Escherichia coli JOURNAL J. Mol. Biol. 156, 245-256 (1982) STANDARD full staff_review REFERENCE 21 (bases 57 to 137) AUTHORS Windass,J.D., Newton,C.R., De Maeyer-Guignard,J., Moore,V.E., Markham,A.F. and Edge,M.D. TITLE The construction of a synthetic Escherichia coli trp promoter and its use in the expression of a synthetic interferon gene JOURNAL Nucleic Acids Res. 10, 6639-6657 (1982) STANDARD full staff_review REFERENCE 22 (sites; mutational analysis of the regulatory region) AUTHORS Kolter,R. and Yanofsky,C. TITLE Genetic analysis of the tryptophan operon regulatory region using site-directed mutagenesis JOURNAL J. Mol. Biol. 175, 299-312 (1984) STANDARD full staff_entry REFERENCE 23 (bases 36 to 136) AUTHORS Brown,K.D., Bennet,G.N., Lee,F., Schweingruber,M.E. and Yanofsky,C. TITLE RNA polymerase interaction at the promoter-operator region of the tryptophan operon of Escherichia coli and Salmonella typhimurium JOURNAL J. Mol. Biol. 121, 153-177 (1978) STANDARD simple staff_entry COMMENT The tryptophan operon of E.coli consists of a repressor(trpR), a promoter(trpP), an operator(trpO), an attenuator which is part of a leader peptide region(trpL) and five structural genes: trpE(anthranilate synthetase), trpD(glutamine amido transferase and anthranilate 5-phosphoribosylpyrophosphate phosphoribosyl- transferase), trpC(phosphoribosyl anthranilate isomerase-indole glycerol phosphate synthetase), trpB(tryptophan synthetase beta) and trpA(tryptophan synthetase alpha). The promoter region covers approximately 40 bases upstream from the mRNA initiation site(75-116); the operator approximately 20 bases upstream with two-fold axes of symmetry around 104-105 and 109-110([2],[7],[20]). The attenuator region is the first 140 nucleotides(117-256) of the mRNA leader, a G-C rich region with a two-fold axis of symmetry around base 240 and an A-T rich region with its axis about bases 259-260; it provides a second site for control of transcription ([4],[5],[10],[12]). Two mRNA termination regions are reported: trpT (bases 6807-6856) and trpT' (bases 7057-7119), the first of which bears some similarity to the attenuator region ([18]). A chi site for recombination is localized between bases 2492 and 2501 and the trp-P2 promoter is located between bases 3240 and 3280 ([20]). The trpE gene is unusual in that it codes for no tryptophan residues([16]). The two enzymatic functions coded by trpG and trpD genes in S.marcescens are coded by the single trpD gene in E.coli and other enterobacteriaceae. This appears to have occurred via base changes at sites 2420 and 2438. The intercistronic regions for the structural genes show little superfluity: the trpE-trpD and trpB-trpA boundaries consist of 'tgatg'; the trpD-trpC boundary is 'taaatgatg' and the trpC-trpB boundary is 'taaggaaaggaacaatg'. All the cistrons show a high degree of homology with their correlates among the enterobacteriaceae. Sequence discrepancies in early work([3]) are corrected in later work from the same laboratory([10],[19]). [17] also sequenced S.typhimurium trpA region. [19] compiles sequences from [7],[8],[9],[11],[13],[14],[16],[20]. FEATURES from to/span description pept 143 187 trp operon leader peptide (putative) pept 279 1841 anthranilate synthetase component I /nomgen="trpE" pept 1841 3436 anthranilate synthetase component II: glutamine amidotransferase and phosphoribosyl anthranilate synthetase /nomgen="trpD" pept 3440 4798 anthranilate isomerase /nomgen="trpC" pept 4810 6003 tryptophan synthetase beta subunit /nomgen="trpB" pept 6003 6809 tryptophan synthetase alpha subunit /nomgen="trpA" mRNA 117 257 trp mRNA (alt.) [2],[3],[7],[10],[21] mRNA 117 6842 trp mRNA (alt.) [2],[3],[6],[7],[10],[18],[21] used revision 1787 1787 c in [16]; t in [14] revision 1793 1793 t in [16]; c in [14] conflict 3526 3530 gg in [19]; gaatg in [11] conflict 4289 4293 gc in [19]; gttgc in [11] conflict 5949 5949 c in [1]; a in [17] BASE COUNT 1740 a 1926 c 1960 g 1705 t 4 others ORIGIN 9 bp upstream from HhaI site [7]. 1 ctcaaggcgc actcccgttc tggataatgt tttttgcgcc gacatcataa cggttctggc 61 aaatattctg aaatgagctg ttgacaatta atcatcgaac tagttaacta gtacgcaagt 121 tcacgtaaaa agggtatcga caatgaaagc aattttcgta ctgaaaggtt ggtggcgcac 181 ttcctgaaac gggcagtgta ttcaccatgc gtaaagcaat cagataccca gcccgcctaa 241 tgagcgggct tttttttgaa caaaattaga gaataacaat gcaaacacaa aaaccgactc 301 tcgaactgct aacctgcgaa ggcgcttatc gcgacaatcc caccgcgctt tttcaccagt 361 tgtgtgggga tcgtccggca acgctgctgc tggaatccgc agatatcgac agcaaagatg 421 atttaaaaag cctgctgctg gtagacagtg cgctgcgcat tacagcttta ggtgacactg 481 tcacaatcca ggcactttcc ggcaacggcg aagccctcct ggcactactg gataacgccc 541 tgcctgcggg tgtggaaagt gaacaatcac caaactgccg tgtgctgcgc ttcccccctg 601 tcagtccact gctggatgaa gacgcccgct tatgctccct ttcggttttt gacgctttcc 661 gtttattgca gaatctgttg aatgtaccga aggaagaacg agaagccatg ttcttcagcg 721 gcctgttctc ttatgacctt gtggcgggat ttgaagattt accgcaactg tcagcggaaa 781 ataactgccc tgatttctgt ttttatctcg ctgaaacgct gatggtgatt gaccatcaga 841 aaaaaagcac ccgtattcag gccagcctgt ttgctccgaa tgaagaagaa aaacaacgtc 901 tcactgctcg cctgaacgaa ctacgtcagc aactgaccga agccgcgccg ccgctgccag 961 tggtttccgt gccgcatatg cgttgtgaat gtaatcagag cgatgaagag ttcggtggcg 1021 tagtgcgttt gttgcaaaaa gcgattcgcg ctggagaaat tttccaggtg gtgccatctc 1081 gccgtttctc tctgccctgc ccgtcaccgc tggcggccta ttacgtgctg aaaaagagta 1141 atcccagccc gtacatgttt tttatgcagg ataatgattt caccctattt ggcgcgtcgc 1201 cggaaagctc gctcaagtat gatgccacca gccgccagat tgagatctac ccgattgccg 1261 gaacacgccc acgcggtcgt cgcgccgatg gttcactgga cagagatctc gacagccgta 1321 ttgaactgga aatgcgtacc gatcataaag agctgtctga acatctgatg ctggttgatc 1381 tcgcccgtaa tgatctggca cgcatttgca cccccggcag ccgctacgtc gccgatctca 1441 ccaaagttga ccgttattcc tatgtgatgc acctcgtctc tcgcgtagtc ggcgaactgc 1501 gtcacgatct tgacgccctg cacgcttatc gcgcctgtat gaatatgggg acgttaagcg 1561 gtgcgccgaa agtacgcgct atgcagttaa ttgccgaggc ggaaggtcgt cgccgcggca 1621 gctacggcgg cgcggtaggt tatttcaccg cgcatggcga tctcgacacc tgcattgtga 1681 tccgctcggc gctggtggaa aacggtatcg ccaccgtgca agcgggtgct ggtgtagtcc 1741 ttgattctgt tccgcagtcg gaagccgacg aaacccgtaa caaagcccgc gctgtactgc 1801 gcgctattgc caccgcgcat catgcacagg agactttctg atggctgaca ttctgctgct 1861 cgataatatc gactctttta cgtacaacct ggcagatcag ttgcgcagca atgggcataa 1921 cgtggtgatt taccgcaacc atataccggc gcaaacctta attgaacgct tggcgaccat 1981 gagtaatccg gtgctgatgc tttctcctgg ccccggtgtg ccgagcgaag ccggttgtat 2041 gccggaactc ctcacccgct tgcgtggcaa gctgcccatt attggcattt gcctcggaca 2101 tcaggcgatt gtcgaagctt acgggggcta tgtcggtcag gcgggcgaaa ttctccacgg 2161 taaagcctcc agcattgaac atgacggtca ggcgatgttt gccggattaa caaacccgct 2221 gccggtggcg cgttatcact cgctggttgg cagtaacatt ccggccggtt taaccatcaa 2281 cgcccatttt aatggcatgg tgatggcagt acgtcacgat gcggatcgcg tttgtggatt 2341 ccagttccat ccggaatcca ttctcaccac ccagggcgct cgcctgctgg aacaaacgct 2401 ggcctgggcg cagcataaac tagagccagc caacacgctg caaccgattc tggaaaaact 2461 gtatcaggcg cagacgctta gccaacaaga aagccaccag ctgttttcag cggtggtgcg 2521 tggcgagctg aagccggaac aactggcggc ggcgctggtg agcatgaaaa ttcgcggtga 2581 gcacccgaac gagatcgccg gggcagcaac cgcgctactg gaaaacgcag cgccgttccc 2641 gcgcccggat tatctgtttg ctgatatcgt cggtactggc ggtgacggca gcaacagtat 2701 caatatttct accgccagtg cgtttgtcgc cgcggcctgt gggctgaaag tggcgaaaca 2761 cggcaaccgt agcgtctcca gtaaatctgg ttcgtccgat ctgctggcgg cgttcggtat 2821 taatcttgat atgaacgccg ataaatcgcg ccaggcgctg gatgagttag gtgtatgttt 2881 cctctttgcg ccgaagtatc acaccggatt ccgccacgcg atgccggttc gccagcaact 2941 gaaaacccgc accctgttca atgtgctggg gccattgatt aacccggcgc atccgccgct 3001 ggcgttaatt ggtgtttata gtccggaact ggtgctgccg attgccgaaa ccttgcgcgt 3061 gctggggtat caacgcgcgg cggtggtgca cagcggcggg atggatgaag tttcattaca 3121 cgcgccgaca atcgttgccg aactgcatga cggcgaaatt aaaagctatc agctcaccgc 3181 agaagacttt ggcctgacac cctaccacca ggagcaactg gcaggcggaa caccggaaga 3241 aaaccgtgac attttaacac gtttgttaca aggtaaaggc gacgccgccc atgaagcagc 3301 cgtcgctgcg aacgtcgcca tgttaatgcg cctgcatggc catgaagatc tgcaagccaa 3361 tgcgcaaacc gttcttgagg tactgcgcag tggttccgct tacgacagag tcaccgcact 3421 ggcggcacga gggtaaatga tgcaaaccgt tttagcgaaa atcgtcgcag acaaggcgat 3481 ttgggtagaa gcccgcaaac agcagcaacc gctggccagt tttcagaatg aggttcagcc 3541 gagcacgcga catttttatg atgcgctaca gggtgcgcgc acggcgttta ttctggagtg 3601 caagaaagcg tcgccgtcaa aaggcgtgat ccgtgatgat ttcgatccag cacgcattgc 3661 cgccatttat aaacattacg cttcggcaat ttcggtgctg actgatgaga aatatttcag 3721 gggtagcttt aatttcctcc ccatcgtcag ccaaatcgcc ccgcagccga ttttatgtaa 3781 agacttcatt atcgaccctt accagatcta tctggcgcgc tattaccagg ccgatgcctg 3841 cttattaatg ctttcagtac tggatgacga ccaatatcgc cagcttgccg ccgtcgctca 3901 cagtctggag atgggggtgc tgaccgaagt cagtaatgaa gaggaacagg agcgcgccat 3961 tgcattggga gcaaaggtcg ttggcatcaa caaccgcgat ctgcgtgatt tgtcgattga 4021 tctcaaccgt acccgcgagc ttgcgccgaa actggggcac aacgtgacgg taatcagcga 4081 atccggcatc aatacttacg ctcaggtgcg cgagttaagc cacttcgcta acggttttct 4141 gattggttcg gcgttgatgg cccatgacga tttgcacgcc gccgtgcgcc gggtgttgct 4201 gggtgagaat aaagtatgtg gcctgacgcg tgggcaagat gctaaagcag cttatgacgc 4261 gggcgcgatt tacggtgggt tgatttttgt tgcgacatca ccgcgttgcg tcaacgttga 4321 acaggcgcag gaagtgatgg ctgcggcacc gttgcagtat gttggcgtgt tccgcaatca 4381 cgatattgcc gatgtggtgg acaaagctaa ggtgttatcg ctggtggcag tgcaactgca 4441 tggtaatgaa gaacagctgt atatcgatac gctgcgtgaa gctctgccag cacatgttgc 4501 catctggaaa gcattaagcg tcggtgaaac cctgcccgcc cgcgagtttc agcacgttga 4561 taaatatgtt ttagacaacg gccagggtgg aagcgggcaa cgttttgact ggtcactatt 4621 aaatggtcaa acgcttggca acgttctgct ggcggggggc ttaggcgcag ataactgcgt 4681 ggaagcggca caaaccggct gcgccggact tgattttaat tctgctgtag agtcgcaacc 4741 gggcatcaaa gacgcacgtc ttttggcctc ggttttccag acgctgcgcg catattaagg 4801 aaaggaacaa tgacaacatt acttaacccc tattttggtg agtttggcgg catgtacgtg 4861 ccacaaatcc tgatgcctgc tctgcgccag ctggaagaag cttttgtcag tgcgcaaaaa 4921 gatcctgaat ttcaggctca gttcaacgac ctgctgaaaa actatgccgg gcgtccaacc 4981 gcgctgacca aatgccagaa cattacagcc gggacgaaca ccacgctgta tctcaagcgt 5041 gaagatttgc tgcacggcgg cgcgcataaa actaaccagg tgctggggca ggcgttgctg 5101 gcgaagcgga tgggtaaaac cgaaatcatc gccgaaaccg gtgccggtca gcatggcgtg 5161 gcgtcggccc tggccagcgc cctgctcggc ctgaaatgcc gtatttatat gggtgccaaa 5221 gacgttgaac gccagtcgcc taacgttttt cgtatgcgct taatgggtgc ggaagtgatc 5281 ccggtgcata gcggttccgc gacgctgaaa gatgcctgta acgaggcgct gcgcgactgg 5341 tccggtagtt acgaaaccgc gcactatatg ctgggcaccg cagctggccc gcatccttat 5401 ccgaccattg tgcgtgagtt tcagcggatg attggcgaag aaaccaaagc gcagattctg 5461 gaaagagaag gtcgcctgcc ggatgccgtt atcgcctgtg ttggcggcgg ttcgaatgcc 5521 atcggcatgt ttgctgattt catcaatgaa accaacgtcg gcctgattgg tgtggagcca 5581 ggtggtcacg gtatcgaaac tggcgagcac ggcgcaccgc taaaacatgg tcgcgtgggt 5641 atctatttcg gtatgaaagc gccgatgatg caaaccgaag acgggcagat tgaagaatct 5701 tactccatct ccgccggact ggatttcccg tctgtcggcc cacaacacgc gtatcttaac 5761 agcactggac gcgctgatta cgtgtctatt accgatgatg aagcccttga agccttcaaa 5821 acgctgtgcc tgcacgaagg gatcatcccg gcgctggaat cctcccacgc cttggcccat 5881 gcgttgaaaa tgatgcgcga aaacccggat aaagagcagc tactggtggt taacctttcc 5941 ggtcgcggcg ataaagacat cttcaccgtt cacgatattt tgaaagcacg aggggaaatc 6001 tgatggaacg ctacgaatct ctgtttgccc agttgaagga gcgcaaagaa ggcgcattcg 6061 ttcctttcgt cacgctcggt gatccgggca ttgagcagtc attgaaaatt atcgatacgc 6121 taattgaagc cggtgctgac gcgctggagt taggtatccc cttctccgac ccactggcgg 6181 atggcccgac gattcaaaac gccactctgc gcgcctttgc ggcaggtgtg actccggcac 6241 aatgttttga aatgctggca ctgattcgcc agaaacaccc gaccattccc attggcctgt 6301 tgatgtatgc caatctggtg tttaacaaag gcattgatga gttttatgcc cagtgcgaaa 6361 aagtcggcgt cgattcggtg ctggttgccg atgtgccagt tgaagagtcc gcgcccttcc 6421 gccaggccgc gttgcgtcac aacgtcgcac ctatcttcat ctgcccgcca aatgccgatg 6481 acgacctgct gcgccagata gcctcttacg gtcgtggtta cacctatttg ctgtcacgag 6541 caggcgtgac cggcgcagaa aaccgcgccg cgttacccct caatcatctg gttgcgaagc 6601 tgaaagagta caacgctgca cctccattgc agggatttgg tatttccgcc ccggatcagg 6661 taaaagcagc gattgatgca ggagctgcgg gcgcgatttc tggttcggcc attgttaaaa 6721 tcatcgagca acatattaat gagccagaga aaatgctggc ggcactgaaa gtttttgtac 6781 aaccgatgaa agcggcgacg cgcagttaat cccacagccg ccagttccgc tggcggcatt 6841 ttaactttct ttaatgaagc cggaaaaatc ctaaattcat ttaatattta tctttttacc 6901 gtttcgctta ccccggtcga tcgtyractt acgtcatttt tccgcccaac agtaatataa 6961 acaaacaaat taaacccgca acataacacc agtaaaatca ataattttct ctaagtcact 7021 tattcctcag gtaattctta atatatccag aatgttcctc aaaatatatt ttccctctat 7081 cttctcgttg cgcttaattt gactaattct cattagcgac taattttaat gagtgtcgac 7141 acacaacact catattaatg aaacaatgca acgcaacggg agaaataaca tggccgaaca 7201 tcgtggtggt tcaggaaatt tcgccgaaga ccgtgagaag gcatccgacg cagccgtaaa 7261 ggcggtcagc atagcggcgg taattttaaa aatgatcgca acgcgcatct gaagcgggta 7321 aaaaaggcgg tyrac // LOCUS HUMGSTH 808 bp ss-mRNA PRI 01-AUG-1990 DEFINITION Human glutathione S-transferase (GST) a-subunit mRNA, complete cds. ACCESSION M14777 KEYWORDS GSH S-transferase; glutathione S-transferase. SOURCE Human liver, cDNA to mRNA, clone pGTH1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 808) AUTHORS Tu,C.-P.D. and Qian,B. TITLE Human liver glutathione S-transferases: Complete primary sequence of an H-a subunit cDNA JOURNAL Biochem. Biophys. Res. Commun. 141, 229-237 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 67 735 glutathione S-transferase (GST, EC 2.5.1.18) /hgml_locus_uid="LL0130R" /nomgen="GST2" /map="6p12.2" mRNA < 1 808 GST mRNA signal 792 797 polyA signal BASE COUNT 252 a 175 c 192 g 189 t ORIGIN Chromosome 6p12.2. 1 agttgtcgag ccaggacggt gacagcgttt aacaaagctt agagaaacct ccaggagact 61 gctatcatgg cagagaagcc caagctccac tacttcaatg cacggggcag aatggagtcc 121 acccggtggc tcctggctgc agctggagta gagtttgaag agaaatttat aaaatctgca 181 gaagatttgg acaagttaag aaatgatgga tatttgatgt tccagcaagt gccaatggtt 241 gagattgatg ggatgaagct ggtgcagacc agagccattc tcaactacat tgccagcaaa 301 tacaacctct atgggaaaga cataaaggag agagccctga ttgatatgta tatagaaggt 361 atagcagatt tgggtgaaat gatcctcctt ctgcccgtat gtccacctga ggaaaaagat 421 gccaagcttg ccttgatcaa ggagaaaata aaaaatcgct acttccctgc ctttgaaaaa 481 gtcttaaaga gccatggaca agactacctt gttggcaaca agctgagccg ggctgacatt 541 catctggtgg aacttctcta ctacgtcgag gagcttgact ccagtcttat ctccagcttc 601 cctctgctga aggccctgaa aaccagaatc agcaacctgc ccacagtgaa gaagtttcta 661 cagcctggca gcccaaggaa gcctcccatg gatgagaaat ctttagaaga agcaaggaag 721 attttcaggt tttaataacg cagtcatgga ggccaagaac ttgcaatacc aatgttctaa 781 agttttgcaa caataaagta ctttacct // LOCUS MUSIGKACY 321 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse lysozyme-binding Ig kappa chain (HyHEL-10) V23-J2 region mRNA, partial cds. ACCESSION M35667 KEYWORDS immunoglobulin light-chain; kappa-immunoglobulin; processed gene; variable region VK23. SOURCE Mouse hybridoma, cDNA to mRNA, clone 10K-106. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Mainhart,Smith-Gill-S.J., Lavoie,C., Feldman,T.B., Drohan,R.J. and Brooks,W.B.R. TITLE A three-dimensional model of an anti-lysozyme antibody JOURNAL J. Mol. Biol. 194, 713-724 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 321 lysozyme binding Ig kappa chain V23-J2 region (AA at 1) recomb 285 286 V23 region end/J2 region start BASE COUNT 88 a 80 c 75 g 78 t ORIGIN 1 gatattgtgc taactcagtc tccagccacc ctgtctgtga ctccaggaaa tagcgtcagt 61 ctttcctgca gggccagcca aagtattggc aacaacctac actggtatca acaaaaatca 121 catgagtctc caaggcttct catcaagtat gcttcccagt ccatctctgg gatcccctcc 181 aggttcagtg gcagtggatc agggacagat ttcactctca gtatcaacag tgtggagact 241 gaagattttg gaatgtattt ctgtcaacag agtaacagct ggccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSLTAGBSA 237 bp ds-DNA ROD 01-AUG-1990 DEFINITION Mouse SV40 transformed large T-antigen binding site DNA. ACCESSION M35500 KEYWORDS large T antigen. SOURCE Mouse (strain BALB/c) SV40 transformed cell line SVA31E7 DNA, clone p27. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 237) AUTHORS Lane,D.P., Simanis,V., Bartsch,R., Yewdell,J., Gannon,J. and Mole,S. TITLE Cellular targets for SV40 large T-antigen JOURNAL Proc. R. Soc. Lond., B, Biol. Sci. 226, 25-42 (1985) STANDARD simple staff_entry FEATURES from to/span description binding 84 118 large T-antigen binding site BASE COUNT 67 a 65 c 60 g 45 t ORIGIN 1 ggatccatcc cataatcagc ctctaaacgc tgacaccatt gcatacacta gcaagatttt 61 gctgaaagaa ccctgatata gctgtctctt gtgaggctat gccggggcct agcaaacaca 121 gaagtggatg ctcacagtca gctagtggat cacagggccc ccaatggagg agctagagaa 181 agtacccaag gagctaaagg gatcctctac gccggacgca tcgtggccag tcaccgc // LOCUS PEAIVSS 350 bp ds-DNA PLN 01-AUG-1990 DEFINITION Pea legumin J gene, exons 1 and 2 (partial). ACCESSION M26771 KEYWORDS legumin. SOURCE Pea DNA, clone pSP65LegJi. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 350) AUTHORS Brown,J.W.S., Feix,G. and Frendewey,D. TITLE Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa cell nuclear extract JOURNAL EMBO J. 5, 2749-2758 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 48 legumin J, exon 1 (AA at 3) 50 144 legumin J, exon 2 pre-msg < 1 > 350 legumin J mRNA and introns IVS 49 186 legumin intron BASE COUNT 117 a 74 c 71 g 88 t ORIGIN 1 gaatacacgg aattcgagct cgcccgggga tcccattcaa ccccaagagt aagtaatagt 61 gtatccatac attacattat ctcttataaa ttgttcatac agcatgctca ttcgattata 121 actttaaaag tttctaatgt ataatttgtt atactaaatc aatcacacgt aaatatgtgt 181 atgcaggtat tttaccttgg tgggaaccca gaaacagagt tccccgaaac acaggaggaa 241 caacaaggaa ggcatcggca aaagcatagt taccctgttg gacgtaggag tggacatcac 301 caacaagaag aggaatggga tcctctagag tcgacctgca gcccaagctt // LOCUS RATCGM1AA 3190 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) mRNA, complete cds. ACCESSION M32474 J05417 KEYWORDS carcinoembryonic antigen-related protein. SOURCE R.norvegicus (strain Sprague-Dawley) placenta day 18 of gestation, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3190) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analysis imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 122 2251 carcinoembryonic antigen-related protein precursor (CGM1) sigp 122 220 carcinoembryonic antigen-related protein signal peptide matp 221 2248 CGM1 protein BASE COUNT 871 a 798 c 693 g 828 t ORIGIN 1 gggaagtgct cctccttgag aggacaccta gctcaagagg aggaaagaca ataacagtta 61 ggtgccttgc tggaacgaaa gctcctctcc taagagtgag gccattctag tgagaagaca 121 gatggagctg tcctctgtgc ttccctgcaa gaggtgtact ccctggcggg ggctcctgct 181 cacagcctcc ctcttaacct gctggctcct gcccaccact gcccaagtct ccattgaatc 241 cttaccaccc caggtggttg aaggagaaaa tgttcttcta catgttgaca atttgccaga 301 gaatctcata gcctttgtct ggtacaaagg gctgacaaac atgagcctcg gagttgcact 361 gtattcacta acctataacg taactgtgac gggacctgtg cacagtggta gagagacatt 421 gtacagcaat gggtccctgt ggatccaaaa tgtcacccag aaggacacag gattctacac 481 cctacgaacc ataagtaatc atggagaaat tgtatcaaat acatccctgc accttcatgt 541 gtacttctcc actttgacct gtggacgcgc tgccacctct gctcagctca gtattgaatc 601 agtgccgacc agcatctcta aaggagaaag cgctcttctc cttgctcaca atctcccaga 661 gaatctccga gccattttct ggtacaaggg ggcgattgtg ttcaaggacc ttgaggttgc 721 tcgatatgta ataggcacaa attcaagtgt gccggggcct gcccacagcg gcagagagac 781 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 841 caccttaaaa actctgagta cagatctgaa aactgaaata gcctatgtgc aactccaggt 901 ggacacctgt tttatgagct atgctggccc tcccacttct gcccagctca ctgtcgaatc 961 agcgcctacc agcgttgctg aaggagcaag cgttcttctc cttgttcaca atctccctga 1021 gaatctccga gccattttct ggtataaagg ggtgattttg ttcaaggacc ttgaggttgc 1081 tcgatatgta ataggcacaa attcaagtgt gctggggcct gcccacagcg gcagagagac 1141 aatgtacagc aatggatccc tcctgcttca gaatgtcact cggaacgatg ctggattcta 1201 caccttaaga actctgagta cagatctgaa agctaaagta gtacatgtgc aactccaggt 1261 gaacacctcc tcgtgctgtg accctctcac tcctgcccta ctcacgatag acccagtgcc 1321 acggcatgcg gctaaagggg aaagtgttct tcttcaagtt cgcaatctgc cagaggatct 1381 gcgaatgttt atctggttca aatctgtgta cacctcccag atctttaaaa tagcagagta 1441 cagcagagcc attaattatg tcttcagggg ccctgcacac agcggaagag agacagtgta 1501 caccaacgga tccctgctgc tccaggatgc cactgagaaa gacacgggct tgtacacact 1561 acaaataata tacagaaatt tcaaaataga aacagcacac gttcaagtca gcgtgcacac 1621 ctgtgttcac ccttctacca ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt 1681 tgaaggggga gacgttctcc tacttgttca taatatgcca gagaaccttc aatccttttc 1741 ctggtacaaa ggcgtagcca ttgtcaacag acatgaaatc tctcggaaca taatagccag 1801 taatagaagc acgttggggc ctgctcacag tggcagagag acaatatatt ctaatggctc 1861 tcttctgctc cacaatgcca ccgaggagga caatggatta tacaccttat ggactgtaaa 1921 cagacattct gaaactcaag ggatacacgt gcacatccac atatacaagc ctgtggcaca 1981 gccctttatc cgagtcactg aatcctcagt cagagtgaag agctctgtgg tcctcacctg 2041 cctctcagct gacactggaa cctccatcca gtggctcttc aacaaccaga atctgcggct 2101 cacacagagg atgtcactgt cccagactaa gtgccaactc agcatagatc ccgtcaggag 2161 ggaggatgct ggagagtata ggtgtgaggt ctccaacccg gtcagttcga agacgagcct 2221 cccagtcagc ctggatgtga tcattgagtg accccccacc ttctctcatc ctacagcaga 2281 gtgggggaca tttctttatc aatgggtaca aaatggagca aaattatgtg gtgaaaattg 2341 tcagttgcta ctcaggtaca gtcagcatgt tgagtcatgt ctgtatccct aggataaaca 2401 tgtacaagga caagccagaa catagagact cagtttccaa aaaaaagaaa acatcaatac 2461 agtaaacagt attgtagtgg tgttaagagt taggttgtgg atcaaataca tagccaatcc 2521 tcagaatcca tgggaactaa tttcaggagc caccaatatt ctgtatgctc caagtcccct 2581 gttagcatgg tgcagtgact tcatagagat aaatgcatct tttgcatgct taagtatatt 2641 ctgtgtataa ctaattcaca tagtaccatt actgtctggg caccagttat ccatgtgaag 2701 aaaggacaag caacaggaga agggactgcc ctttcccagt ggacataact tgtgtctaaa 2761 tagtttgatc cacagttggg tgtaacattc atagcagaga cccaactctg gactctgtat 2821 atcctgacag tggcattcat aagattctta ttcctgtttt ttcttccttc cttccttcct 2881 tccttccttc cttacttctg aagggcatat atgggatttc ccattttgag tattttgaag 2941 tgggcaatta acatgaaaca cactcatatt gtcatgtgac caataaatgt tgtccattct 3001 caaagcattt tcaactcctc ccattctctc tagccccgtg taatcccatc tactggtgtt 3061 tctatgcatg tgacaaaaac aggatatcta attgcttttg gtcaatatta gtttacagag 3121 tacagctcag ctggatgtgt ttgctcacca gttccagaaa cttctgtaga ctctaggttt 3181 ttctccaaat // LOCUS RATCGM1AC1 2238 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exons 1 and 2. ACCESSION M32476 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 1 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2238) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene and analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept 704 767 carcinoembryonic antigen-related protein (CGM1) precursor, exon 1 1770 + 2129 carcinoembryonic antigen-related protein, exon 2 sigp 704 767 carcinoembryonic antigen-related protein (CGM1) signal peptide 1770 1804 carcinoembryonic antigen-related protein signal peptide matp 1805 + 2129 carcinoembryonic antigen-related protein pre-msg 512 > 2238 CGM1 mRNA and introns IVS 768 1769 CGM1 intron A IVS 2130 > 2238 CGM1 intron B BASE COUNT 615 a 532 c 539 g 552 t ORIGIN 1 ctcacccaac aacagctcag ccaacacata atattgaaag gtgctttgaa cccctccata 61 ggaagaagaa cagtctcttc caagacacac aggtcacctc ttcccaacat ccagcacatg 121 aaatttgtca cacaactgct ccaggacctc tctcctgggt cagaaacttg actggtgaca 181 ttagtgataa aggattaatc ttcatcccca ctcagtccct ttccaaccct cacagatatc 241 tgtcgccttc ctgctgggaa ataccacctt cccagaacac ggaagacaca gggcagactg 301 ggtgctcaac tgggtctctg tgtcacaggg acgcatgggt aggatggagg cttcctcttt 361 ggtgctgaca gattcaagac caggactcag cagatgtcct ggcatgagcc attgttctct 421 gagggcatgg ggatgtttgt cagcacagct cctcaaggtg ttgcctggag gagaagcaca 481 aagatagaaa agttgagacg gatgcagggt agcattgaga gtggaaggga cagagcagtg 541 ccttggacac agaccccgac caccccacaa tccacagatt ctgggaagtg ctcctccttg 601 agaggacacc tagctcaaga ggaggaaaga caataacagt taggtgcctt gctggaacga 661 aagctcctct cctaagagtg aggccattct agtgagaaga cagatggagc tgtcctctgt 721 gcttccctgc aagaggtgta ctccctggcg ggggctcctg ctcacaggta agggtgctta 781 ctccatggtt gtgtgtgggg tgggggaggc ccagagtctc ctgaaatgga cagaatcctt 841 agggaagatg tgtagtttct gtttgtaatc atgttataga aggtgcagtg agggaacagg 901 aagctctgag gcagacagga gctgaggagc agaatagaaa aggcctcagc tgcaattatt 961 caaattcagt cacagggtga atctccaaat agaaatcaaa catgggaggg cagtgagatg 1021 gctcagtgtg tggatacagg acagtctgaa ttcactcctc agctctcaca gcatagatgg 1081 acatacagac tcctgaaggc tcttctcttc cctccacact ggtgtgtgtc acgtacctgt 1141 agtgtgcaca ctgggacatg taccttccca aaccctcacg aacaatacag aaatattaaa 1201 ttacacttga atataattat ttttatgtgc tataaacatg gaaattatgt agacaaaccc 1261 agagatatct tttcttcctt ccttccttcc ttcttccttc cttccttcct tcctcttttt 1321 ccatactagt ttctgagatt ttttgaggaa ctgaaccttc caaaaagacc ataccaatcc 1381 ctgtcctcaa aaagcctttt ttattctaat ggactggaaa tcattgtatc cagaggagaa 1441 agtcaatgat ttagtggaac cataaataga acagaaaaca ttcaggaagt gaggattgta 1501 tggaggagga aaaagaggag gaggaggagg aagaggagga ggaggaggag gaggaccgag 1561 agccggttct ccactcacca gacactttat ggaaagagtg atatggggac acctgagtag 1621 aggattccac agagaggaaa tgacaccctt tgaggttctg agggcatgga ggtcatgctg 1681 ctcacctcca ttaagggtgc atcctaccta caggctgagg gatgctcaca cctgctcagg 1741 attgtcaact tttctctctt cccttctagc ctccctctta acctgctggc tcctgcccac 1801 cactgcccaa gtctccattg aatccttacc accccaggtg gttgaaggag aaaatgttct 1861 tctacgtgtt gacaatttgc cagagaatct catagccttt gtctggtaca aagggctgac 1921 aaacatgagc ctcggagttg cactgtattc actaacctat aacgtaactg tgacgggacc 1981 tgtgcacagt ggtagagaga cattgtacag caatgggtcc ctgtggatcc aaaatgtcac 2041 ccagaaggac acaggattct acaccctacg aaccataagt aatcatggag aaattgtatc 2101 aaatacatcc ctgcaccttc atgtgtactg taagtaattc tttgtgaatt ctgggttatg 2161 ggtggggtcc ttccactaga cacacagaag tgtcaggcct ggcttgtgct cccttccttc 2221 tgcattgatc tacatgtt // LOCUS RATCGM1AC2 539 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32477 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 2 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 539) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 539 carcinoembryonic antigen-related protein intron B BASE COUNT 126 a 127 c 138 g 148 t ORIGIN 1 ccctgattcc agacctctgt tacagactta tctcctcatg gccccgagaa tcatcttact 61 agggctggct ttgcctctct ctcagcagag accagtgctt ttgagtagtg aaagtatttt 121 gctatgtgta agcagacagt gcattgcaat gagagccatg ttggttaggt ctcctggatg 181 tccctagtga ctcagcaggg tgaggatagg cagcaggtgc ccagtccatc atctaactct 241 tctaatggtc ttaggaaact ttcaggaagg tcaggatccc taaagagagg gacagaggac 301 acaggtcctc ctgacaactt cttgtcttct ggggacagtt cagtgatttc tcctctgcgt 361 gcacaggctc tgctgatgtg gacaggtcct tgtgaggcaa gtggatctgt gtccccaggc 421 aaaaactgag aaggttgagt agattcagaa accctggtaa attttcatat ctgagaatgg 481 tagacctttg atctactctg gacctggttc ctgtcctgga gcatgtgacc atgacaccc // LOCUS RATCGM1AC3 828 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron B. ACCESSION M32478 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 3 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 828) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 828 carcinoembryonic antigen-related protein intron B BASE COUNT 193 a 255 c 190 g 190 t ORIGIN 1 aaaaagctgg attggctctc cctccaaccc ctgtgcctgt ctgccctgat gcactgggct 61 cactgaaggc cctcagacca gtccccactc accgagagtc ccaaaggtgt ctgaatgacc 121 aggaatttga gaaccccagc ttcagcccca gcccatgttg tttctcacct ggggccctca 181 ttttgcccca taatatagcc taatgcctcc catttcatct gcctgagctg tgttcacaaa 241 cccagttgta aggtggaaag gggatccaca attcctcaga aatgagctga agttcctata 301 agtgaccagg aggaggcagc atcaggaagt acaatgacta cttagggaag tattttctgt 361 accaggaacc caccttgtat cctggctttt atctctgttc ccatagacct ggaggtcatt 421 ggcacagctt ctcagacctc tcagctgctt cctgtatctg ctgccccacc aaggatcatg 481 ttcgcattcc tgacattcat tttctctggg aaagcaaggg tgtctatggg aagcacctag 541 acagaggttc aaggcatctc agaaaggcac gcagcacatg ggcagagcac ctcacagctc 601 aggacacaga ggaagtgtgc ccaccatctt gaatccctgc atgggacgat ggagcccaga 661 gcagtccttc caggactcag gtcacctcct cccacacact caggaagtga ggctcctgac 721 acagctgctc ctgggcccct tttctccctg agaatcctga ctggtgactg cagtgagaac 781 gcatctgtcc cctcccccac tcgtcacaca gctggcccct tgggatcc // LOCUS RATCGM1AC4 642 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 3. ACCESSION M32479 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 4 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 642) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 61 + 420 carcinoembryonic antigen-related protein (CGM1), exon 3 matp + 61 + 420 carcinoembryonic antigen-related protein pre-msg < 1 > 642 CGM1 mRNA and introns IVS < 1 60 CGM1 intron B IVS 421 > 642 CGM1 intron C BASE COUNT 151 a 160 c 154 g 177 t ORIGIN 1 ggtgccatct tagccaaata caaaagccct aatgttgatg gatctctgtc ttccttctag 61 tctccacttt gacctgtgga cgcgctgcca cctctgctca gctcagtatt gaatcagtgc 121 cgaccagcat ctctaaagga gaaagcgctc ttctccttgc tcacaatctc ccagagaatc 181 tccgagccat tttctggtat aaaggggcga ttgtgttcaa ggaccttgag gttgctcgat 241 atgtaatagg cacaaattca agtgtgccgg ggcctgccca caacggcaga gagacaatgt 301 acagcaatgg atccctcctg cttcagaatg tcactcggaa cgatgctgga ttctacacct 361 taaaaactct gagtacagat ctgaaaactg aaatagccta tgtgcaactc caggtggaca 421 gtaagtagtt ctctgtgatc attcagtgtt ggtccaggtt tagacacaca gcagtgtttt 481 cttgctctgt acctgccttc cctctgcact ttgtccccat gtaagtattt gagaactttg 541 tgcaagacac acatggtggt ttctgactcc accctcagag agtatcgtgt acgcatgcgt 601 gcgtgcgtgc gtgcgtgcgt gcgtgtgtgt gtgataggaa gg // LOCUS RATCGM1AC5 616 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 4. ACCESSION M32480 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 5 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 616) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 90 + 449 carcinoembryonic antigen-related protein (CGM1), exon 4 matp + 90 + 449 carcinoembryonic antigen-related protein pre-msg < 1 > 616 CGM1 mRNA and introns IVS < 1 89 CGM1 intron C IVS 450 616 CGM1 intron D BASE COUNT 152 a 153 c 147 g 164 t ORIGIN 1 ggaatggaga cctcagctca gggtacaggg cgccatctta gtcaaataca aacaccccaa 61 tattaatgga tctctctctt cttttctagc ctgttttatg agctatgctg gccctcccac 121 ttctgcccag ctcactgtcg aatcaggccc taccagcgtt gctgaaggag caagcgttct 181 tctccttgct cataatctcc ctgagaatct ccgagccatt ttctggtata aaggggcgat 241 tttgttcaag gaccttgagg ttgctcgata tgtaataggc acaaattcaa gtgtgccggg 301 gcctgcccac agcggcagag agacaatgca cagcaatgga tccctcctgc ttcagaatgt 361 cactcggaac gatgctggat tctacacctt aagaactctg agtacagatc tgaaagctaa 421 agtagtacat gtgcaactcc aggtgaacag taagtgaatc tctgtgatta gtctgtgctg 481 ggtggggcta gacacacagg aatgtccttt ctggcctgtg catagtgtcc ccatgttgag 541 gtttgggcgc ttagtgcaag acaaacatgg cggagacaaa ttgccataga tcagacttca 601 ttgtctgatt cccttc // LOCUS RATCGM1AC6 654 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, intron 4. ACCESSION M32481 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 6 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 654) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description IVS < 1 > 654 carcinoembryonic antigen-related protein intron D BASE COUNT 185 a 146 c 155 g 168 t ORIGIN 1 tctcgatgta tgttccccta agaaagacct caatcaggca ggacgctggt tgaggaaagg 61 atggcatcct aagagaggtg agcaccagga agaaccttga ctgcacacat ctgtatgaat 121 ctcaacaact tgtgacccaa gagaacattt tgtcagggct agactattaa ctctcagagc 181 tgacagagaa caatggtgtt ggctgtctat gtcaaaccgg ggtagatatt ttctccaaac 241 atgagtttca tatataaaat ctagaaactt tacagagccc atggaggggt gctgcttatg 301 ggcttgctcc ttgttgcttg ctcagcctgg tttcttatag cacccaggat ccccagtgga 361 ctggactctt ccctatcaat aaccaattag gaaatgtact ctgggcttgc acaggccaat 421 atggtggtga ttttacaact gaggctccct ctttcaaatc taatcgagca tgttgaagtt 481 ggcacagagc cagccagcat agttcctgat ccttttctga gacttgagcc tgccaagagt 541 atcagattgc ttccagccct cacccatctc tagacctgtg ggttggagag cacggtagca 601 agaacattta gaagtaaaaa tggagttgaa tggagccaca aaggaaactg agaa // LOCUS RATCGM1AC7 492 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 5. ACCESSION M32482 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 7 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Rebstock,S., Lucas,K., thompson,F.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 118 + 471 carcinoembryonic antigen-related protein (CGM1), exon 5 matp + 118 + 471 carcinoembryonic antigen-related protein pre-msg < 1 > 492 CGM1 mRNA and introns IVS < 1 117 CGM1 intron D IVS 472 492 CGM1 intron E BASE COUNT 134 a 134 c 105 g 119 t ORIGIN 1 aaatgtctac acctgcatct aggctgagtg aagagtccat ctgctcagga tggaggtcgc 61 catctttcca ccaagcacag tgatcccatg tgatgacttt tctcctttcc cttccagcct 121 cctcgtgctg tgaccctctc actcctgccc cactcacgat agacccagtg ccacggcatg 181 cggctaaagg ggaaagtgtt cttcttcaag ttcgcaatct gccagaggat ctgcgaatgt 241 ttatctggtt caaatctgtg tatacctccc agatctttaa aatagcagag tacagcagag 301 ccattaatta cgtcttcagg ggccctgcac acagcggaag agagacagtg tacacgaatg 361 gatccctgct gctccaggat gccactgaga aagacacagg cttgtacaca ctacaaataa 421 tatacagaaa tttcaaaatt gaaacagcac acgttcaagt cagcgtgcac agtaagtgac 481 tctcaaggtc tc // LOCUS RATCGM1AC8 1341 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM1) gene, exon 6. ACCESSION M32483 J05417 KEYWORDS carcinoembryonic antigen-related protein. SEGMENT 8 of 8 SOURCE R.norvegicus (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM1-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1341) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept + 640 / 987 carcinoembryonic antigen-related protein (CGM1), exon 6 matp + 640 / 987 carcinoembryonic antigen-related protein pre-msg < 1 > 1341 CGM1 mRNA and introns IVS < 1 639 CGM1 intron E IVS 988 1341 CGM1 intron F BASE COUNT 357 a 329 c 321 g 334 t ORIGIN 1 ctacatacca tcccacccca tggcccacat atgcataaac taactgaagt attaaccagt 61 gtcagtagct ctgaatatga gaatttcatc aacacctgga catgcaagga cttgagacat 121 cagtctttta tccacccaca tgtatctgag tctgttcagg cactgaacct tcctaaaaga 181 tcaaactagt ctttcctatc aggactctag ctctagtcga cgtcgactgg acgacagaca 241 aggaagctca ctttgaagtg aagtcaggga ttgaatggaa ccagaaaagg actatgtcaa 301 agagagcaga aggtaaaggt cttcctctgt agaggaagag gtgatggaag gtaccctcat 361 cctccacatc tcctgagtgt gagcaggcac gtgaggacag ggagggtgga gacacgtgag 421 gacagagttt cacgggtagc agaggaagct acacacagtc aggtgcacca agggcatgga 481 ggtcgtttgc tcactccctc tgggttgtgc agacattgcc tcccacccga tgagtgatgg 541 atctaagcta ctctggtcac aggaccacat cttttcacca acggcagagg cgtcaatatt 601 gatggatttg tctctcttct tttctatctg cccttttagc ctgtgttcac ccttctacca 661 ctggccagct tgtaatcgaa tcggtgccac ccaatgttgt tgaaggggga gacgttctcc 721 tacttgttca taatatgcca gagaaccttc aatccttttc ctggtacaaa ggcgtagcca 781 ttgtcaacag acatgaaatc tctcggaaca taatagccag taatagaagc acattggggc 841 ctgctcacag tggcagagag acaatatatt ctaatggctc tcttctgctc cacaatgcca 901 ccgaggagga caatggatta tacaccttat ggactgtaaa cagacattct gaaactcaag 961 ggatacacgt gcacatccac atatacagta agtaattctc tgagatgtct tggtgctggt 1021 ggggttgaac ccatgttaca cacacaggag tgtcaggtgt gaactatgcc tttcttgctc 1081 tccatgtgtc tccatgttgg agtttgaggt gcaggcatat gcctagtaga cgtacggaaa 1141 tgggtcagaa tccctcaccg tctccacctg cagaacaggt gtggagatct cgtgtgacct 1201 gccgtgacag ctgcagtcat ctaggtcacc tgtgcacctc cttctcctga gcctcagtgg 1261 acaagtgcca gaacagaata caactttctt atgggcttag gagactcaca ggaaggtcag 1321 atccgttgcc tgacggtcga c // LOCUS RATCGM4AA 4627 bp ds-DNA ROD 01-AUG-1990 DEFINITION Rat carcinoembryonic antigen-related protein (CGM4) gene, exons 2 and 3. ACCESSION M32475 J05417 KEYWORDS carcinoembryonic antigen-related protein. SOURCE Rat (strain Sprague-Dawley) liver DNA, clone lambda-rnCGM415-1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites for [2]) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. TITLE cDNA and gene analyses imply a novel structure for a rat carcinoembryonic antigen-related protein JOURNAL J. Biol. Chem. 265, 7872-7879 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 4627) AUTHORS Rebstock,S., Lucas,K., Thompson,J.A. and Zimmermann,W. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Zimmermann, 02-MAR-1990. FEATURES from to/span description pept / 145 489 carcinoembryonic antigen-related protein (CGM4), exon 2 (AA at 147) 3379 / 3738 carcinoembryonic antigen-related protein exon 3 pre-msg < 1 > 4627 CGM4 mRNA and introns IVS < 1 144 CGM4 intron B IVS 490 3378 CGM4 intron C IVS 3739 > 4627 CGM4 intron D BASE COUNT 1213 a 1165 c 1162 g 1087 t ORIGIN 1 agatctgggt cgacctgcag gtcaacggat ctgggcctta gcaggagtgt gggcagagct 61 ctgggaaggc agaagtgtga ttttttaaaa aaccaacaga tttcacctgc tcaatatcga 121 tggttgctct gtcttccctt ttagcctccc ttctaacctg ttggctcctg actactgccc 181 aggtcaacat tgaatcggtg ccattcaatg tggttgaagg ggaaaacgtc cttcttcttg 241 tccacaatct gccagagaat ctcatagcct ttgcctggta tagagggctg aggaaaattg 301 gagtatacat actgaacact gaagtaagtg tgacggggcc aatgtacagc ggtagagaga 361 cagtgtacag caatggttcc ctgtgtatcc gcaatgtcac ccagaaggac acaggattct 421 acactctacg aacagtcaac acacgtggag aaactgtatc aacaacatcc ttgtacctct 481 atgtgtacag taagtgatac tttgtgaact ctgggtgttg tgtggggttc attccgtaga 541 cacacacaga agaggcaggc ctacctaccc tttgcattgt gtctccttat tgaggtgtga 601 acatttaact caggctaagg agagtaatgc caattgaata gaatccttct tttgacttta 661 ccttgtagtc agctggatgt gtggttaact cagtgaagga catcagccct tgtctagact 721 tctggggttc ttagcagtaa tgtgtccttg ggaaagacct tgagggaagg agattgggtt 781 tgaatgagat agccatagga tcctcatgga agtgagaacc agaaagccct ggctccagac 841 ctctgtcctg actcatctcc tgatggcccc gagaagcatt ttacaaaggc tggattctga 901 catctgttgg cagggaacag tgcttttgag gagcaaatcc ttgtgccaca tacaatcacc 961 tggtgcacgg ccatgagagc cacagttagg cgaggtctcc tggatctctc cagtgactca 1021 tcagggagag aatagaaaga cagatgtccc ggccactaag ttaactgtta tgatggcctt 1081 atgagacttc caggaaggtc atggttgcca ggaagaggga caaaggacac agatccccct 1141 gacagttgct tgtcctttgg ggtccagctc atagaagtct gtccgcaggc aaatgacacc 1201 aggctctgct gatgtggata gctccccaga tctgagctgc agttctccca gcgatcacga 1261 gggccgcctc agggaaacac aattaacacc cagaagagta tttgtctaaa ccaggaactt 1321 acctcctcct ctggctagct cccctgttcc tacagacatg ggggtcacac agccttctca 1381 gacctaccag ctgcctcctt ttctgctgcc ttgctaggga attatgtgta gtggctgctt 1441 tgtgtatttt ctttggaaaa gatagagtat cctaagggaa tcacccagac agaggttcaa 1501 ggcatctctg aaaggccagg cagcacatgg cagagccacc tcacagctca ggacccagag 1561 gaagtgtgcc caccatcttg aatccatgca tgggacgatg gagcccagag ctacgttcca 1621 ggactcaggt cacctcccac acactcaaga agtgaggctc ctgacacagc tgctcctggg 1681 ccccttttct ccctgagaat cctgactggt ggctgcagtg agaacacatc tgtcccctcc 1741 cccactcgtc acacagctgg cccttgggat cctcacacac atctctgtct ccttcctcct 1801 gagagcaaac tacctctttg acgggcactg agaacacagg gcagactggg tgcccagctg 1861 gttctgggtc acccagggag tgcagaggct cactcactgg tgctgactga gccaggaaga 1921 ggccagaaca gagggatgcc ccccgggtga gctgctgtct tcttagggca cagagatgct 1981 cagaggtttg tttgtcactg tgagctctgt ggcatgagac agaaagagcc cagaggagag 2041 gttaggtgtg taggactgag tgtgcacagg gcagagaaca gagttaccca cagcccacgg 2101 gactctggga tatgatcctg tctggcggag gctgagctca gaggatcaga gaacttggga 2161 gctgtattgg agcagatgtg ctacagactg aggacagatc tggccacaga gaccagggcg 2221 gtgctctgta ccatctgcaa acaatgcccc acctgttggt gctcctgctc acagatgagg 2281 agaccacatt ttacagtgtg tgagaggaga ggactcacct actgtctaaa gtctcttcaa 2341 ggggacaggg actggagaag agtttcaggt ttgtagggct gaaaacacta aagtataggg 2401 gctcatcatc atcatcatca ccaccgccat caccaccacc accaccacca ccaccaccac 2461 caccaccacc accaccacca ccatcatcat catcatcatg aggctcttgg taaataagaa 2521 gaagcagggg gaggaggaga ttattgtcaa cccacagttc accatcaatg agcccagtgt 2581 tctgaagact gaggttctca gctgtgatgc cccaaataag aaaccaagct ggtgttgatc 2641 agtgacatgg ctcagtggat ctgggtgttt gcttcatgtc tgacaacctg agaaccagtg 2701 aacacaagtt gtccctgacc tccacctagg gacggcgttt tgcacccaac acagacacac 2761 tgaggcatgc ccttgcacat gaactcatac accaatataa taagcaaatg cataaaaatt 2821 atagcaaatg gaagcagtca acactgtatt cccaaacata ctaatttgtt aaataaatcc 2881 atggccatgt attcattcat tcattcattc actcattcat ttactctcca agatatttga 2941 gttttctttt gcagtctttt ttttttaaaa gataatataa gacaaatccc agttctcatt 3001 attccctagc cctagactgg aagacgacca gtgaagaaag ctagaaggcg aatcagtcac 3061 taaaggacaa gaaacaaaag agtcagagtg tgacggtcgg gaggcttcac cccaacaccc 3121 atcgactgac actgagggtg agcagggatc tgaggacggt gaggcagggc catgttgaca 3181 cctgaggaga gagcagcata gagaggaaat gacaagtgag gggcgcggag tgcatggagg 3241 taatgcactg acctccacta gctagggcag ggagactccc acacctcagc tgaccactgg 3301 acacagctgc tcggactcag gcaccatctt agccaaatac taaagtcctg atgttgacgg 3361 atctctcttc ccttctagcc tctcttttca tctgtgggcg tccttttaac cctgccaagc 3421 tcactattga atcagtgccg cccagtgttg ctgaaggggg aagcgttctt ctcctcgttc 3481 acaatctcca ggacgagctt cgagggtttt tctggtacaa aggggcgtct atgtctagca 3541 accatgagat agcccgatac agaacagcaa agaattcaag tgtgccaggc cctgcccaca 3601 gtggtagaga gacggtgtac agcaatggat ccctcctgct ccagaatgtc acccggaatg 3661 acactgggtt ctacacccta cgcactctga aaagacatca gaaaatggaa ttggcacacg 3721 tgcaacttca ggtggacagt aagtgatttt ccgtgatcgt tcagtgctgg gtgggtcttt 3781 gacacacagg actgtcaccc ctggcatgtg gctacctcct ctctgccttt ttatccccat 3841 gttgtggtta accactatgt gcaggacaca tgtgatggaa agaaatgccc atgggtcaga 3901 cttatcatct gactctcccc tgtatcaagg acagtaactc aaccctaggt gctagactct 3961 gcccagtcat ctggggcatc ttgccatgca acgtgaggaa accatggatc ctcacagcgt 4021 ggtgagcacc aggaagctct gatctcagtc gtttgtccca gacttgactg caaatgtctc 4081 taggagcatt ttgtcaggag tgctgcttac tgcctctctc ctcacagcct gccatcctga 4141 tcttatagta acccaggaca ctgagcccag gggtgaaaat gctcccagtt gggctgggct 4201 ctcccacatc aatcaccaat taaaaatgta ctacaggtta gcccacaggt tattttggtg 4261 gtggcatttt aaattgaggc ccttgtttca aaaaattcta gcttgtgtta agttgacata 4321 aagccagcag cacgattcct gagccctccc caatacctat atctgccaag aagaccagac 4381 tgttcccacc catcatccgg ccttagtcct gggtgctata ggctgggacg tgagaacatg 4441 tggaatgtga agtctgagga tgaccgcagg tacaaaggag atgagaaagt cagagagtgt 4501 gtatccaggg tgtgtagaga ccaaaggtca ggggaggcat catcccaaag cacagtgtgc 4561 atgagtatgt gcaatgtctg aatgagggca gtgagggaca gccacggaga caccaaggac 4621 agagctc // LOCUS STMRGDA 2540 bp ds-DNA BCT 01-AUG-1990 DEFINITION S.coelicolor 16S rRNA gene and 23S rRNA, 5' end (rrnD) gene cluster. ACCESSION Y00411 M35377 KEYWORDS 16S ribosomal RNA; 23S ribosomal RNA. SOURCE S.coelicolor (strain 1147 A3(2)) DNA, clone RSC33. ORGANISM Streptomyces coelicolor Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 705 to 2230) AUTHORS Baylis,H.A. and Bibb,M.J. TITLE The nucleotide sequence of a 16S rRNA gene from Streptomyces coelicolor A3(2) JOURNAL Nucleic Acids Res. 15, 7176-7176 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 771 and 2196 to 2540) AUTHORS Baylis,H.A. and Bibb,M.J. TITLE Transcriptional analysis of the 16S rRNA gene of the rrnD gene set of Streptomyces coelicolor A3(2) JOURNAL Mol. Microbiol. 2, 569-579 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 144 ORF (AA at 1) rRNA 536 > 2231 16S rRNA gene rRNA 537 > 2231 16S rRNA gene rRNA 704 2231 16S rRNA rRNA 2507 > 2540 pot. 23S rRNA BASE COUNT 573 a 654 c 834 g 478 t 1 others ORIGIN 1 tgggcccgca tcaccatcgg cgtcctcgcc gagctggcct tcctggccta cgtctacgtt 61 ctgggcggcc gagccgtgcg cgacggcgag acgggtgacg tcgaggcagc cgaacgcagc 121 gccacggtgc caacagccgc ctgatgtgca tccacccctg cgagctgcta gtgtcctctt 181 cgttcccgca agagccgttg acacggagcg agcggggagg tagattcgaa cagttgcctg 241 gagacgggtt caccccagag ggcaacagtg aacatctacc agcttctccg aatcaacgaa 301 ttcgacgaag cactctcccg atgaatcgga aacgaaggcc ggtaagaccg gctcgaaagt 361 tctgataaag tcggagccgc cggaaaggga aacgcgaaag cgggaacctg gaaagcgccg 421 aggaaatcgg atcggaaaga tctgatagag tcggaaacgc aagaccgaag ggaagcgccc 481 ggaggaaagc ccgagagggt gagtacaaag gaagcgtgcc gttccttgag aactcaacag 541 cgtgccaaaa gtcaacgcca gatatgttga taccccgacc tgatcggatc tccgttcggg 601 ttgaggttcc tttgaagtaa cacaacagcg aggacgctgt gaacggtcgg attattcctc 661 cgactgttcc gctctcgtgg tgtcacccga ttacgggtat acattcacgg agagtttgat 721 cctggctcag gacgaacgct ggcggcgtgc ttaacacatg caagtcgaac gatgaaccac 781 ttcggtgggg attagtggcg aacgggtgag taacacgtgg gcaatctgcc cttcactctg 841 ggacaagccc tggaaacggg gtctaatacc ggatactgac cctcgcaggc atctgcgagg 901 ttcgaaagct ccggcggtga aggatgagcc cgcggcctat cagcttgttg gtgaggtaat 961 ggctcaccaa ggcgacgacg ggtagccggc ctgagagggc gaccggccac actgggactg 1021 agacacggcc cagactccta cgggaggcag cagtggggaa tgttgcacaa tgggcgaaag 1081 cctgatgcag cgacgccgcg tgagggatga cggccttcgg gttgtaaacc tctttcagca 1141 gggaagaagc gaaagtgacg gtacctgcag aagaagcgcc ggctaactac gtgccagcag 1201 ccgcggtaat acgtagggcg caagcgttgt ccggaattat tgggcgtaaa gagctcgtag 1261 gcggcttgtc acgtcggttg tgaaagcccg gggcttaacc ccgccactgc agtcgatacg 1321 ggcaggctag agttcggtag gggagatcgg aattcctggt gtagcggtga aatgcgcaga 1381 tatcaggagg aacaccggtg gcgaaggcgg atctctgggc cgatactgac gctgaggagc 1441 gaaagngtgg ggagcgaaca ggattagata ccctggtagt ccacgccgta aacggtgggc 1501 actaggtgtg ggcaacattc cacgttgtcc gtgccgcagc taacgcatta agtgccccgc 1561 ctggggagta cggccgcaag gctaaaactc aaaggaattg acgggggccc gcacaagcgg 1621 cggagcatgt ggcttaattc gacgcaacgc gaagaacctt accaaggctt gacatacacc 1681 ggaaagcatc agagatggtg ccccccttgt ggtcggtgta caggtggtgc atggctgtcg 1741 tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc ttgtcccgtg 1801 ttgccagcaa gccttcgggg tgttggggac tcacgggaga ccgccgggtc aactcggagg 1861 aaggtgggga cgacgtcaag tcatcatgcc ccttatgtct tgggctgcac acgtgctaca 1921 atggccggta caatgagctg cgataccgca aggtggagcg aatctcaaaa agccggtctc 1981 agttcggatt ggggtctgca actcgacccc atgaagtcgg agtcgctagt aatcgcagat 2041 cagcattgct gcggtgaata cgttcccggg ccttgtacac accgcccgtc acgtcacgaa 2101 agtcggtaac acccgaagcc ggtggcccaa ccccttgtgg gagggagctg tcgaaggtgg 2161 gactggcgat tgggacgaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca 2221 cctcctttct aaggagcaca tagccgactg cagcgaaatg tcctgcacgg ttgctcatgg 2281 gtggaacgtt gactactcgg cacggtcttc ttgatggatc actagtactg cttcggcgtg 2341 gaacgtgact tcaaagaggg gttcgtgtcg ggcacgctgt tgggtatctg agggtacggc 2401 cgtgaggtcg ccttcagttg ccggccccgg taaaaatccg cgtgagtggg ttgtgacggg 2461 tggttggtcg ttgtttgaga actgcacagt ggacgcgagc atctgtggcc aagtttttaa 2521 gggcgcacgg tggatgcctt // LOCUS SUSCYIIAA 230 bp ds-DNA INV 01-AUG-1990 DEFINITION S.purpuratus cytoskeletal actin CyIIa gene, complete cds. ACCESSION M35321 M35322 KEYWORDS cytoskeletal actin SpG11A. SOURCE S.purpuratus DNA, clone pSpG11A. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 230) AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M. TITLE DNA sequence analysis and structural relationships among the cytoskeletal actin genes of the sea urchin Strongylocentrotus purpuratus JOURNAL J. Mol. Evol. 28, 72-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 219 > 230 actin CyIIa BASE COUNT 74 a 43 c 33 g 80 t ORIGIN 1 ttcgaattgt cactcattct tcaaataaag attgtgagat cacgcgtttt ctgtacccta 61 ccctacaaat acgtaggaca cctgggtatg tagtgaacct taaagtttat aaatgatgtt 121 cttgtttgtc catcaattta accgggaaaa aaatttatct gtctaatatc attatctatt 181 ttcacacttt tagatcaaac tagattaaac aaatcatcat gtgtgacgac // LOCUS SUSCYIIBA 1972 bp ds-DNA INV 01-AUG-1990 DEFINITION S.purpuratus cytoskeletal actin CyIIb gene, complete cds. ACCESSION M35323 KEYWORDS cytoskeletal actin CyIIb. SOURCE S.purpuratus DNA, clone pSpG11A. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 1972) AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M. TITLE DNA sequence analysis and structural relationships among the cytoskeletal actin genes of the sea urchin Strongylocentrotus purpuratus JOURNAL J. Mol. Evol. 28, 72-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 251 616 cytoskeletal actin CyIIb, exon 1 841 1087 cytoskeletal actin CyIIb, exon 2 1312 1829 cytoskeletal actin CyIIb, exon 2 pre-msg 1 1972 CyIIb mRNA and introns IVS 617 840 CyIIb intron A IVS 1088 1311 CyIIb intron B BASE COUNT 529 a 492 c 408 g 543 t ORIGIN 1 tcggcagttc aagaccacgt gtgtttcccg gattggtaaa ctccttatca cgaactcctt 61 atcagtaaaa cttacgagct ttgtacactt ttaatgactt ttcgattatt ctttcaagag 121 attttccctg ccacaaaatt acttagttct tttatttctc attcctgtgc aattccaatt 181 actagcattt tatttatgat ccatttttgt gtttttattt tagagtaaat aaaacgagaa 241 atcaatcatc atgtgtgacg acgatgttgc cgctcttgtc atcgacaacg gatccggtat 301 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtcttcccat ccatcgttgg 361 cagaccccgt caccagggtg tcatggtcgg catgggacag aaggacagct acgtcggaga 421 cgaggcccag agcaagagag gtatcctcac cctgaagtac cccatcgagc acggtatcgt 481 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg aactccgtgt 541 tgccccggag gagcaccccg tcctccttac cgaggctccc ctcaacccca aggccaacag 601 ggaaaagatg acacaggtta gaaaaagcaa tatgcctatt attgaagtaa tcaaattctc 661 aaaacaaata cattctcaca tttaaacatc ttaatttaag ctgtttatta atattaatat 721 caagtgagtt tcgttgttga aataacagcg attgactaaa atgaacttgt atcaaacttg 781 ttgtgattag tgaaatgaaa tcggtgatta acaattgttt tgttttcatg tcttctgcag 841 atcatgttcg agaccttcaa ctcacccgcc atgtacgtcg ctatccaggc cgtgctttcc 901 ctctacgcct ctggtcgtac cactggtatc gttttcgact ctggtgatgg tgtttcacac 961 acagtgccca tctacgaggg ttatgccctt ccccacgcca tcctccgtct ggacttggct 1021 ggacgtgatc tcacagacta cctgatgaag atccttaccg agcgtggcta ctctttcacc 1081 accaccggta agatatcttt tttttacaat caaagagtga gtgaagctat cacctgcatc 1141 ctgtgcttaa agaatattaa aaaaagagga gggaagatat tatatatgat taatgttcat 1201 tttctttgga ctttgacaat aacattttgg ggggatagaa agtgaatgtt gcttttcgtt 1261 atacattcgt aactaactaa tttcatcttg tttttttttt ctatcttgca gctgagcgtg 1321 aaatcgttcg tgacatcaag gagaagctct gctacgttgc tcttgacttt gagcaagaga 1381 tgcagactgc tgcctcatcc tcctccctcg agaagagcta cgagcttccc gacggacagg 1441 tcatcaccat tggcaacgag cgattccgtg ccccagaggc cctcttccag ccagccttcc 1501 ttggaatgga atccgctgga atccacgaga cctgctacaa cagcatcatg aagtgcgatg 1561 ttgacatccg taaggatctg tacgccaaca ctgttctgtc tggaggctcc accatgttcc 1621 caggaatcgc cgacaggatg cagaaggaga tcaccgccct tgccccacca accatgaaga 1681 tcaagatcat tgctcctcca gaaaggaaat actccgtatg gatcggaggc tccatccttg 1741 cctctctctc caccttccaa cagatgtgga tcagcaagca ggaatacgat gagtccggcc 1801 catccatcgt ccacaggaag tgcttctaaa caactcgctt ttggtgaaca aactcttgaa 1861 catcaatatc aaggaaacga ccatgatctc aaattgcaaa gtttaagtat gacaccattg 1921 cgggcaatgc agccgaaaaa ctcgcgcttt ctcaaaactt ggaggactgc ag // LOCUS SUSCYIIIBA 2918 bp ds-DNA INV 01-AUG-1990 DEFINITION S.purpuratus cytoskeletal actin CyIIIb gene, complete cds. ACCESSION M35324 KEYWORDS cytoskeletal actin CyIIIb. SOURCE S.purpuratus DNA, clone pSpG11A. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 2918) AUTHORS Durica,D.S., Garza,D., Restrepo,M.A. and Hryniewicz,M.M. TITLE DNA sequence analysis and structural relationships among the cytoskeletal actin genes of the sea urchin Strongylocentrotus purpuratus JOURNAL J. Mol. Evol. 28, 72-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 371 736 cytoskeletal actin CyIIIb, exon 1 1634 1880 cytoskeletal actin CyIIIb, exon 2 2247 2764 cytoskeletal actin CyIIIb, exon 2 pre-msg 1 1972 CyIIIb mRNA and introns IVS 737 1633 CyIIIb intron A IVS 1881 2246 CyIIIb intron B BASE COUNT 850 a 668 c 559 g 841 t ORIGIN 1 acggttcggg catttaggga tagctttgat tttaagaatg ttaaaatgag aatgtcaaat 61 agcctaacgc tggtctgtgc cagtaaacat gaatcaattc caaatgttga tatattaata 121 gtcggggagt tcaaatagga caacatgttt cacggggata gaattatcag acataattat 181 aatcccactt tgtcgtgaat tttgttggtt gtatgaaagt tttttagacc gtttgaaagg 241 aaaacagacc tatgccaaat ccaccaccac gaattaacta gtctgcaaac aaagaaacta 301 aaattaatat ttctctgggt atgtttttct catattcagg acaggaaaac gaaattcaat 361 catcatgtgt atgtgtgacg atgatgttgc cgctcttgtc gtcgacaacg ggtccggaat 421 ggtgaaggcc ggattcgccg gagacgatgc cccaagggct gtctttccat ccatcgttgg 481 caggccccgt caccagggtg tcatggttgg tatgggacaa aaggacagct acgttggaga 541 cgaagcacag agcaagagag gtatcctcac cctgaagtac cctattgagc acggtatcgt 601 caccaactgg gacgatatgg agaagatctg gcatcacacc ttctacaacg agctccgtgt 661 tgccccagag gagcaccccg tccttctgac agaggccccc ctcaacccta aggccaacag 721 ggaaaagatg acacaggtaa ggatatagtg cggaattgca aaacattcct taaagatact 781 atgtctcttt tgcacccaac atcagattct gtagaacttt gcaggaacta taattatgac 841 ttgtcatgta tgtcctatct atgaaatcta aacattagca atgtcgtatt attcgaatta 901 tgcaaggaaa cccgtttatc ttctagactt cactgtcaga cttactgaca tctatttttc 961 tttattgtaa taacatacat acatttagct ttaacaggta catgagcatt tgtctacatc 1021 aataacccac tatttgtgac ggccaaaatt aaactgattg aatatttgta cagcacaaaa 1081 cgtacgacca atcggtgaaa gggtgtgaaa atgaaactat tacttaggtg atcgcaatta 1141 cttaactcga ttcgataact aatggtaaca tgtagttatt ttcccactaa aagccctttt 1201 taatcctttc gtttcgaagg aacttctaac ttagtttttt tccttcaaat gcagttggaa 1261 tttaatcttt tcattgttgg cctgcaaatg ggacatacag tagtaccttt aactgcattt 1321 tggcaggaat gaaatgaaca acggctacag atagcccacg tcaccaatag cctacataag 1381 cgaagaaaac tagtcggata cccccacacg accgacatat cgctctccct gaccaatcta 1441 aaatatcgtt tttctttttt aaagtccata aaatgctatg aaaacctttt cgtttcttta 1501 ctgcagtgaa aataaaagct gatacggact acgagtacaa aatcgcgaac attcagataa 1561 aaaagttgaa tttgcccagt ttataatccc tagagtttat tcttaattca aaaaaatatt 1621 cttcttttgt tagatcatgt ttgagacctt caactcgccc gccatgtacg tcgccatcca 1681 ggccgtgctt tccctctacg cctctggtcg taccactggt atcgttttcg actctggcga 1741 cggtgtttca cacactgtac caatctatga gggttacgcc ctcccccacg ccatcatccg 1801 tctggacttg gctggacgtg atcttaccga ttacctgatg aagatcctta ccgagcgtgg 1861 ctactctttc accaccactg gtaagacatg atatggataa tagcaatagc taatgatgat 1921 aattaaaata gggataattg ataatattag aatactaatg taaacagatg aatgtcttac 1981 caaagggcag tctgtctcgg gttttgaatt caaaaacctc acatctcgtt atctttaagc 2041 cgcagaccac aacacctgca tgttcatttt tttttttact gcttgttcaa atccttttga 2101 caaagcgaat atctgattag atcgataata attaataaca aataccctct aagtcccgga 2161 gtttcaacac atttccattg ttatcttcac attttacaat ttgtctgcaa ttgatatgtg 2221 actgcatcca ttattatctc ttacagctga gcgtgaaatc gtccgtgaca taaaggagaa 2281 gctctgctac gtagctcttg attttgagga ggagatgcaa actgctgcct catcctcctc 2341 cctcgagaag agctacgagc ttcccgacgg acaggtcatc accatcggca acgagcgatt 2401 tcgttgctca gagaccctct tacagccctc tttcattgga atggaatctg ctggaatcca 2461 tgagacctgt tataacagca tcatgaagtg cgatgttgac atccgtaagg atctatacgc 2521 caacaccgtt ctctccggag cttccaccat gttcccagga atcgctgaca ggatgcagaa 2581 agagattgtc gcccttgccc caccaaccat gaagatcaag atcatcgctc ctcctgagag 2641 gaaatactct gtatggatcg gaggctccat tcttgcctct ctctccacct tccaacagat 2701 gtggatcagc aagcaggaat acgatgagtc tggtccatcc atcgtccaca ggaagtgctt 2761 ctaaacaacc ttccaacaga tttggatcag caagcaggaa tacaatgagt ccggtccatc 2821 catcgtccaa gggaagtgct tctaaacaac ttgattttct tctacttcta atgagcaacc 2881 tgattttttt aattctgttt cactccatgt tgccacct // LOCUS WHTIVSS 310 bp ds-DNA PLN 01-AUG-1990 DEFINITION Wheat amylase gene, exons 2 and 3 (partial). ACCESSION M26770 KEYWORDS . SOURCE Wheat DNA, clone pSP64Amyi. ORGANISM Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 310) AUTHORS Brown,J.W.S., Feix,G. and Frendewey,D. TITLE Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa cell nuclear extract JOURNAL EMBO J. 5, 2749-2758 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 109 amylase, exon 2 (AA at 1) 214 > 310 amylase, exon 3 pre-msg < 1 > 310 amylase mRNA and intron IVS 110 213 amylase intron 2 BASE COUNT 79 a 91 c 76 g 64 t ORIGIN 1 gaatacaagc ttgggctgca ggtcgacgca gaggctgtgg ccattcccct cggacaaggt 61 catgcagggc tacgcctaca tcctcacaca cccgggcata ccatgcatcg taagtagtag 121 cacactacac aacctcacca taacatttcg catcaaacgt accccacgat gtttgtgatc 181 tgaacttaca actacttggt tttgcgcgcg cagttctacg accatgtgtt cgactggaaa 241 ctgaagcagg agatcaccgc actggctacg gtcaggtcaa ggaacgggat ccccgggcga 301 gctcgaattc // LOCUS YSCMTARSA 384 bp ds-DNA ORG 01-AUG-1990 DEFINITION Yeast (S.cerevisiae) mitochondrial autonomously replicating sequence DNA. ACCESSION M35612 KEYWORDS . SOURCE S.cerevisiae (strain 992) mitochondrial DNA, clone pYmit1021. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 384) AUTHORS Mabuchi,T., Nishikawa,S. and Wakabayashi,K. TITLE The nucleotide sequence of mitochondrial ARS in Saccharomyces cerevisiae JOURNAL J. Gen. Appl. Microbiol. 30, 469-478 (1984) STANDARD simple staff_entry FEATURES from to/span description site 46 56 consensus autonomously replicating sequence site 126 136 consensus autonomously replicating sequence site 245 255 consensus autonomously replicating sequence site 290 300 consensus autonomously replicating sequence site 148 156 ori/rep GC cluster A site 187 194 ori/rep GC cluster A BASE COUNT 126 a 29 c 34 g 195 t ORIGIN 1 ccgccgcggg cggacgccgg aggagaatta tatttttata taataattta tatttctata 61 tatatatata tatattatat ataaatatta ttatatatat ttttatatat attataatta 121 tattcattaa tattttatta tagtggtggg ggtcccaatt attattttca ataataattt 181 atcatgggac ccggatatct tcttgttttt atttattatt ttttttaatt tattttaatt 241 atttatttat aatttatatt atacaattta ttatttcgtt aataccttta tttatattat 301 ataatatatt atattattat aatatattta ttgattatat taatacattt aactaatgtg 361 tgctctatat ttattgaata gttt // LOCUS YSCMTARSB 218 bp ds-DNA ORG 01-AUG-1990 DEFINITION Yeast (S.cerevisiae) mitochondrial Ser-tRNA, 3' end in and autonomously replicating sequence. ACCESSION M35613 KEYWORDS transfer RNA-Ser. SOURCE S.cerevisiae (strain 992) mitochondrial DNA, clone pYmit1S2SC-delta-11. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 218) AUTHORS Mabuchi,T., Nishikawa,S. and Wakabayashi,K. TITLE The nucleotide sequence of mitochondrial ARS in Saccharomyces cerevisiae JOURNAL J. Gen. Appl. Microbiol. 30, 469-478 (1984) STANDARD simple staff_entry FEATURES from to/span description tRNA < 1 49 Ser-tRNA site 138 148 consensus autonomously replicating sequence BASE COUNT 99 a 25 c 11 g 83 t ORIGIN 1 ctatcattag tctttattgg ctacgtaggt tcaaatccta catcatccgt aataatacat 61 atatataata ataattttaa tattattcct ataaaaataa aataaataaa taaataataa 121 taattaatta attttaataa atataaaata tataaaataa taataataat aattattatt 181 ttaataatat tatttatata atagtccggc ccgccccc // LOCUS MUSMDRXX 2873 bp ds-DNA ROD 01-AUG-1990 DEFINITION Mouse P-glycoprotein (mdr1a) gene, exons 1 and 2. ACCESSION M33580 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1 DNA, clone pV1.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2873) AUTHORS Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. FEATURES from to/span description pept 2613 / 2677 P-glycoprotein, exon 2 (first expressed exon) pre-msg 1992 > 2873 P-glycoprotein mRNA and introns (alt.) pre-msg 1801 > 2873 P-glycoprotein mRNA and introns (alt.) IVS 2120 2606 P-glycoprotein intron A IVS 2678 > 2873 P -glycoprotein intron B signal 1904 1912 CAAT box signal 1956 1963 TATA box site 1880 1887 SP-1 site site 1921 1927 SP-1 site site 1937 1944 SP-1 site site 1869 1875 AP-1 site rpt 1 1300 L1Md repetitive element BASE COUNT 860 a 621 c 714 g 678 t ORIGIN Chromosome 5. 1 gaattctcac ctgaggaata ccgaatccag agaaacacct gaaaaaatgt tcaacatcct 61 taatcatcag ggaaatgcaa atcaaaacaa ccctgagatt ccacctcaca ccagtcagaa 121 tggctaagat caaaaattca ggtgacagca gatgctggcg aggatgtgga gaaagaggaa 181 cactcctcca ttgttggtgg gagtgcaggc ttgtacaacc actctggaaa tcagtctggc 241 ggttcctcag aaaactggac atagtactct cggaggatcc agcaatacct ctcctgggca 301 tatatccaga agatgcccca acaggtaaga aggacacatg ctccactatg ttcatagcag 361 ccttatttat aatagccaga agctggaaag aacctagatg cccctcaaca gaggaatgga 421 tacagaaaat gtggtacatc tacacaatgg agtactactc agctattaaa aagaatgaat 481 ttatgaaatt cctagccaaa tggatggacc tggggggcat catcctgagt gaggtaacac 541 attcacaaag aaactcacac aatatgtatt cactgataag tggatattag ccccaaacct 601 aggataccca agatataaga tataatttgc taaacacatg aaactcaagg agaatgaaga 661 ctgaagtgtg gacactatgc ccctccttag atttgggaac aaaacaccca tggaaggagt 721 tacagagacg gagtttggag ctgagatgaa aggatggacc atgtagagac tgccatagcc 781 agggatccac cccataatca gcatccaaac gctgacacca ttgcatacac tagcaagatt 841 ttattgaaag gacgcagatg tagctgtctc ttgtgagact atgccggggc cagcaaacac 901 agaagtggat gctcacagtc agctaatgga tggatcatag ggctcccaat ggaggagcta 961 gagaaagtag ccaaggagct aaagggatct gcaaccctat aggtggaaca acattatgag 1021 ctaaccagta ccccggagct cttgactcta gctgcatata tatcaaaaga tggcctagtc 1081 ggccatcact ggaaagagag gcccattgga cttgcaaact ttatatgccc cagtacaggg 1141 gaataccagg gccaaaaagg gggagtgggt gggcagggga gtgggggtgg gtggatatgg 1201 gggacttttg gtatagcatt ggaaatgtaa atgagttaaa tacctaataa aaaatggaaa 1261 aaaaaataaa ataaaaataa gatgaaactg gaaaaaaaaa gttatgttta ataattccaa 1321 ttgaactgta agaatttcag atgccctgga aaaacatgga cattggttta gtacctaaaa 1381 gttcaaaata ttatatattt ttaaatacca ttttacactg aaatactcca tttatatact 1441 ggggactgtc ctctttctgg tttgctttgt tttgtttaat aaaagaaata aaccaatcta 1501 cctgaggaac tgtgaactat attgaagaaa agcctgcacg ggggttctct taccttttca 1561 agagtgcttc aaagaaggga aatttactga caggcaaggt ctgtacccat tgtttaattg 1621 tctgttagat gttatgcata gaatacgtct tttaacttag ccaaatgcag aaggccaagt 1681 gcactatcta caaacacata actctatata tagacatgtg catggccgtg tagagatgag 1741 actctgcaag tgtgtctcta atgattcggg ggatatgagt ttgtctaatt gacctttgag 1801 agggaaacca gactgcacat ttcatctaca aatccaacct gtttcgcaat ttctccagca 1861 ataatacttg agtcaagctg ggccgggagc tggttaacct ccaggtcaaa ctcactggct 1921 gggcgggact gcgcctgggc gtagattgag catgctaaat ttactctcct gtccacagaa 1981 agcccaggca cagtggaaca gcggtttcca ggagctgctg gtcccatctt ccaaggctct 2041 gctcaactca gagccgcttc ttccaaagtc tacatcttgg tggactttgc agaggaaacc 2101 gggagtagag acacgtgagg taagcatttc ctaggaaggg tcgggtgttc cggataccag 2161 agcctggtcc gggtgtcagc gtaatcgtga gtctgtgggg accaagtggc gacacaagag 2221 tcgctccagg agcacccgca gcatcagctt tcaggacggt gttttccgcg ccaccctgtg 2281 ctgtggatct cgctgcccag ctcgcagcca ggggtggtgg aggagcgcgc cagggcgagg 2341 ggacccagca ggcgggtggc ggacctagag ccgagcaccc ggtccacgca ggtgacacag 2401 cttcccggga ttccccagtg agttacctcc aggccctctc cggcagcatc agggcggggc 2461 tcctcctcac cactgggctc tgcggggcag tgagctttgc ataaactctg gtcccgtgtt 2521 tggctaatga actgtggttt ctccccaggt cgtgatggaa cttgaagagg accttaaggg 2581 aagagcagac aagaacttct ccccaggtcg tgatggaact tgaagaggac cttaagggaa 2641 gagcagacaa gaacttctca aagatgggca aaaagaggta gccagattgt ttcactttcg 2701 tactttactt gtcttgtaca ttcgggcaat tagtttgtag cctccagcac tgtacttgat 2761 tagtgggtgt tatttcagac ttcagaaatg taaaccagcc cttggaagga actcctcgct 2821 tggagcagtc cttcaaatgt gtgtgacaga tcaatcaatg attctgtgaa ttc // LOCUS MUSMDR1A 4924 bp ss-mRNA ROD 01-AUG-1990 DEFINITION Mouse P-glycoprotein (mdr1a) mRNA, complete cds. ACCESSION M33581 KEYWORDS P-glycoprotein. SOURCE Mouse (strain BALB/c/NIH) macrophage-like cell line J774.2-vinblastine resistant subline J7.V1-1, cDNA to mRNA, library pUC18-cDNA and pGEM-zf, clones pV1.PRC2, pV1.3, pV1.20, and pV1.10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4924) AUTHORS Hsu,S.I.-H., Cohen,D., Kirschner,L.S., Lothstein,L., Hartstein,M. and Horwitz,S.B. TITLE Structural analysis of the mouse mdr1a (P-glycoprotein) promoter reveals the basis for differential transcript heterogeneity in multidrug-resistant J774.2 cells JOURNAL Mol. Cell. Biol. 10, 3596-3606 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.S.Kirschner, 05-APR-1990. Albert Einstein College of Medicine, 1300 Morris Park Ave, Bronx, NY 10461 FEATURES from to/span description pept 137 3967 P-glycoprotein (mdr1a) mRNA < 1 4924 P-glycoprotein mRNA signal 4315 4320 poly-A signal signal 4898 4903 poly-A signal BASE COUNT 1450 a 1021 c 1210 g 1243 t ORIGIN Chromosome 5. 1 acagtggaac agcggtttcc aggagctgct ggtcccatct tccaaggctc tgctcaactc 61 agagccgctt cttccaaagt ctacatcttg gtggactttg cagaggaaac cgggagtaga 121 gacacgtgag gccgtgatgg aacttgaaga ggaccttaag ggaagagcag acaagaactt 181 ctcaaagatg ggcaaaaaga gtaaaaagga gaagaaagaa aagaaaccag cagtcagtgt 241 gcttacaatg tttcgttatg caggttggct agacaggttg tacatgctgg tgggaactct 301 ggctgctatt atccatggag tggcgctccc acttatgatg ctgatctttg gtgacatgac 361 agatagcttt gcaagtgtag gaaacgtctc taaaaacagt actaatatga gtgaggccga 421 taaaagagcc atgtttgcca aactggagga agaaatgacc acgtacgcct actattacac 481 cgggattggt gctggtgtgc tcatagttgc ctacatccag gtttcatttt ggtgcctggc 541 agctggaaga cagatacaca agatcaggca gaagtttttt catgctataa tgaatcagga 601 gataggctgg tttgatgtgc atgacgttgg ggagctcaac acccggctca cagatgatgt 661 ttccaaaatt aatgaaggaa ttggtgacaa aatcggaatg ttcttccagg caatggcaac 721 attttttggt ggttttataa taggatttac ccgtggctgg aagctaaccc ttgtgatttt 781 ggccatcagc cctgttcttg gactgtcagc tggtatttgg gcaaagatat tgtcttcatt 841 tactgataag gaactccatg cttatgcaaa agctggagca gttgctgaag aagtcttagc 901 agccatcaga actgtgattg cgtttggagg acaaaagaag gaacttgaaa ggtacaataa 961 caacttggaa gaagctaaaa ggctggggat aaagaaagct atcacggcca acatctccat 1021 gggtgcagct tttctcctta tctatgcatc atatgctctg gcattctggt atgggacttc 1081 cttggtcatc tccaaagaat actctattgg acaagtgctc actgtcttct tttccgtgtt 1141 aattggagca ttcagtgttg gacaggcatc tccaaatatt gaagccttcg ccaatgcacg 1201 aggagcagct tatgaagtct tcaaaataat tgataataag cccagtatag acagcttctc 1261 aaagagtggg cacaaaccag acaacataca aggaaatctg gaatttaaga atattcactt 1321 cagttaccca tctcgaaaag aagttcagat cttgaagggc ctcaatctga aggtgaagag 1381 cggacagacg gtggccctgg ttggcaacag tggctgtgga aaaagcacaa ctgtccagct 1441 gatgcaaagg ctctacgacc ccctagatgg catggtcagt atcgacggac aggacatcag 1501 aaccatcaat gtgaggtatc tgagggagat cattggtgtg gtgagtcagg aacctgtgct 1561 gtttgccacc acgatcgccg agaacattcg ctatggccga gaagatgtca ccatggatga 1621 gattgagaaa gctgtcaagg aagccaatgc ctatgacttc atcatgaaac tgccccacca 1681 atttgacacc ctggttggtg agagaggggc gcacgtgagt gggggacaga aacagagaat 1741 cgccattgcc cgggccctgg tccgcaatcc caagatcctt ttgttggacg aggccacctc 1801 agccctggat acagaaagtg aagctgtggt tcaggccgca ctggataagg ctagagaagg 1861 ccggaccacc attgtgatag ctcatcgctt gtctaccgtt cgtaatgctg acgtcattgc 1921 tggttttgat ggtggtgtca ttgtggagca aggaaatcat gatgagctca tgagagaaaa 1981 gggcatttac ttcaaacttg tcatgacaca gacagcagga aatgaaattg aattaggaaa 2041 tgaagcttgt aaatctaagg atgaaattga taatttagac atgtcttcaa aagattcagg 2101 atccagtcta ataagaagaa gatcaactcg caaaagcatc tgtggaccac atgaccaaga 2161 caggaagctt agtaccaaag aggccctgga tgaagatgta cctccagctt ccttttggcg 2221 gatcctgaag ttgaattcaa ctgaatggcc ttattttgtg gttggtatat tctgtgccat 2281 aataaatgga ggcttacagc cagcattctc cgtaatattt tcaaaagttg taggggtttt 2341 tacaaatggt ggcccccctg aaacccagcg gcagaacagc aacttgtttt ccttgttgtt 2401 tctgatcctt gggatcattt ctttcattac attttttctt cagggcttca catttggcaa 2461 agctggagag atcctcacca agcgactccg atacatggtt ttcaaatcca tgctgagaca 2521 ggatgtgagc tggtttgatg accctaaaaa caccaccgga gcactgacca ccaggctcgc 2581 caacgatgct gctcaagtga aaggggctac agggtctagg cttgctgtga ttttccagaa 2641 catagcaaat cttgggacag gaatcatcat atccctaatc tatggctggc aactaacact 2701 tttactctta gcaattgtac ccatcattgc gatagctgga gtggttgaaa tgaaaatgtt 2761 gtctggacaa gcactgaaag ataagaagga actagaaggt tctggaaaga ttgctacgga 2821 agcaattgaa aacttccgca ctgttgtctc tttgactcgg gagcagaagt ttgaaaccat 2881 gtatgcccag agcttgcaga taccatacag aaatgcgatg aagaaagcac acgtgtttgg 2941 gatcacgttc tccttcaccc aggccatgat gtatttttct tatgctgctt gtttccggtt 3001 cggtgcctac ttggtgacac aacaactcat gacttttgaa aatgttctgt tagtattctc 3061 agctattgtc tttggtgcca tggcagtggg gcaggtcagt tcattcgctc ctgactatgc 3121 gaaagcaaca gtgtcagcat cccacatcat caggatcatt gagaaaaccc ccgagattga 3181 cagctacagc acgcaaggcc taaagccgaa tatgttggaa ggaaatgtgc aatttagtgg 3241 agtcgtgttc aactatccca cccgacccag catcccagtg cttcaggggc tgagccttga 3301 ggtgaagaag ggccagacgc tggccctggt gggcagcagt ggctgcggga agagcacagt 3361 ggtccagctg ctcgagcgct tctacgaccc catggctgga tcagtgtttc tagatggcaa 3421 agaaataaag caactgaatg tccagtggct ccgagcacag ctgggcattg tgtcccaaga 3481 gcccattctc tttgactgca gcatcgcaga gaacattgcc tacggagaca acagccgggt 3541 cgtgtcttat gaggagattg tgagggcagc caaggaggcc aacatccacc agttcatcga 3601 ctcgctacct gataaataca acaccagagt aggagacaaa ggcactcagc tgtcgggtgg 3661 gcagaagcag cgcatcgcca tcgcacgcgc cctcgtcaga cagcctcaca ttttacttct 3721 ggacgaagca acatcagctc tggatacaga aagtgaaaag gttgtccagg aagcgctgga 3781 caaagccagg gaaggccgca cctgcattgt gatcgctcac cgcctgtcca ccatccagaa 3841 cgcggacttg atcgtggtga ttcagaacgg caaggtcaag gagcacggca cccaccagca 3901 gctgctggcg cagaagggca tctacttctc aatggtcagt gtgcaggctg gagcaaagcg 3961 ctcatgaact gtgaccatgt aagatgttaa gtatttttat tgtttgtatt catatatggt 4021 gtttaatcca agtcaaaagg aaaacactta ctaaaatagc cagttatcta ttttctgcca 4081 cagtggaaag catttagttt ggtttagagt cttcagaggc tttgtaatta aaaaaacaaa 4141 aatagataca gcatcaaatg gagattaatg ctttaaaatg cactataaaa tttataaaag 4201 ggttaaaagt gaatgtttga taatatatac ttttatttat actttctcat ttgtaactat 4261 aactgatttc tgcttaacaa attatgtatg tatcaaaaat tactgaaatg tttgtataaa 4321 gtatatatag tgaaactgag cattcatatt tttgagttat tttgctcaaa tgcatgcgaa 4381 attatatatt gtcccaactg ggatattgta cataatttta gcctttaaaa aacagtccat 4441 tactgggggg agggggcatc actctatggg caaagtgtta ctcagacatg ggcacctgag 4501 ttcagatccc taccacctaa gtaagcagac aaggtgtggt gtttttgtaa tgccagtgct 4561 agaggcagaa aagacagatc ctgcaggctc agtggctggc caaacagcct agccaacata 4621 gcgcgttcca ggttcagtga gaaaacttgt ctcaaaaatc agagggaaaa gcaaatgagg 4681 tgtcagccat gtgcactcat gcaaatgcca tacatgcaga agtatgtgca cacacacgca 4741 cacattaacc aacgactagc aaggaaaatg aaggtggata agaggggtgg gactgggaca 4801 aaggagggta cctggatgaa tatgactgaa ggacgttatg tacacatatg aaaacgtcgt 4861 actgaaactc actacaatgt atacttaata tattgctaat aaaatatttt taaaagaaaa 4921 aaat // LOCUS RICCPCTA 2526 bp ds-DNA ORG 01-AUG-1990 DEFINITION Rice chloroplast beta and epsilon subunit (atpB and atpE) genes, complete cds. ACCESSION M31464 Y00323 KEYWORDS atpB protein; atpE protein. SOURCE Rice chloroplast DNA, clone Ct-3. ORGANISM Chloroplast Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 2526) AUTHORS Moon,E., Kao,T.-h. and Wu,R. TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters from rice JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987) STANDARD simple staff_review FEATURES from to/span description pept 398 1894 atpB protein pept 1891 2304 atpE protein BASE COUNT 770 a 459 c 563 g 734 t ORIGIN 1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttgacagc 61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag 121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc 181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag 241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga 301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt 361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc 421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg 481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt 541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt 601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg 661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg 721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc 781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat 841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat 901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa 961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga 1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga 1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag 1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt 1201 gcttctattc atcgataata tctttcgttt tgttcaagca ggatcggagg tatctgcctt 1261 attagggaga atgccctctg cagtgggtta tcaacctact cttagtacag aaatgggttc 1321 tttgcaagaa agaattactt ctactaaaaa gggatctata acttcgatcc aagcggttta 1381 tgtacctgcg gacgatttga ccgaccctgc tcctgctaca acatttgcac atttggatgc 1441 tactaccgta ctttccagag gattagcttc caaagggatt tatcctgcag tagatccttt 1501 agattcaacc tcaactatgt tacaacctcg gatcgttggc aacgaacatt atgaaactgc 1561 gcaaagagtt aagcaaactt tacaacgtta caaagaactt caggacatta tcgcaattct 1621 tgggttggat gaattatcgg aggaggatcg tttaactgta gcaagagcac gaaaaattga 1681 gcgcttctta tcacaaccgt tttttgtggc agaagttttt accggttctc caggaaagta 1741 tgttggtctt gcagaaacta ttaggggatt tcaactaatc ctttccggag aattagacgg 1801 cctacccgaa caggcttttt atttggtggg taacatcgat gaagctagca cgaaagctat 1861 aaacttagaa gaggagaaca acttgaagaa atgaaattaa atctttatgt actgactcct 1921 aagcgaatta tttgggattg tgaagtgaaa gaaatcattt tatctactaa tagtggccaa 1981 attggcgtat taccaaacca cgcccccatt aacacagctg tagatatggg tcccttgaga 2041 atacgcctcc tcaacgatca atggttaacg gcggttctgt ggagcggttt tgccagaata 2101 gttaataatg agatcatcat tttaggaaat gatgcggaac tgggtagtga cattgatccg 2161 gaagaagctc aacaggcact tgaaatagcc gaagctaacg tgagtagagc tgagggtacg 2221 aaagaattgg ttgaagcgaa ggtagctctc agacgagcta ggatacgagt cgaggctgtt 2281 aattggattc ccccatctaa ttgaagacaa cccaacggtt tagttgatac aaagaaaaag 2341 ggaagagggg tagaaaaaat tattagatag cgaagcgaag tagggccaat gctatctagt 2401 aatttttcta cctacctacc tactattgga tttgaaccaa tgactcccgc cgtatgaaag 2461 caatactcta accactgagt taagtaggca atttatcacc acaaaggaag accctttact 2521 tcgatc // LOCUS RICCPCTB 2524 bp ds-DNA ORG 01-AUG-1990 DEFINITION Rice mitochondrial beta and epsilon subunit (atpB and atpE) pseudogenes, complete cds. ACCESSION M31465 Y00323 KEYWORDS pseudogene. SOURCE Rice chloroplast DNA, clone Ct-1. ORGANISM Chloroplast Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 2524) AUTHORS Moon,E., Kao,T.-h. and Wu,R. TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters from rice JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987) STANDARD simple staff_review FEATURES from to/span description pept.ps 398 1392 atpB pseudogene pept.ps 1389 2302 atpE pseudogene BASE COUNT 769 a 458 c 564 g 733 t ORIGIN 1 cccccttttc ttattttgag tccaaatacc taaatactac gaaaattctc tgttggcagc 61 aatctatgct tcacagtagt atatattttg tatatcgaag tcctagataa gaaagtagag 121 taggcacaaa tcgtttacaa aaggcaaaat gtatatgaaa aaaagattga ttgaactttc 181 cgacgggctc attccatgag taaacgattg aatgggattc gtttgggcaa cgaaatcaag 241 tgctggtccc cttttctctc ttattgaatt aactaattca tttccttttg acttttggga 301 tttttggata tttttttggt gttgatttgg cattattcaa caagaaaaaa atcaaaattt 361 cgataaattc cttttttttg aaaattatgt gataattatg agaaccaatc ctactacttc 421 tcgtcccggg gtttctacaa ttgaagaaaa aagtacaggg cgtatcgatc aaattattgg 481 acccgtgctg gatgtcactt ttcccccggg caagttacct tatatttata atgctttggt 541 agtcaagagt cgagacactg agggtaagca aattaatgta acttgtgagg tacaacaatt 601 attaggaaat aatcgagtta gagctgtagc tatgagtgct acagatgggt tgatgagagg 661 aatggaagtg attgacacgg gagctcctct cagtgttcct gtcggtggag ctactcttgg 721 acgaattttc aacgttcttg gggagcctgt tgacaatttg ggtcctgtag atactagtgc 781 aacattccct attcatagat ccgcgcccgc ctttatcgag ttagatacga aattatccat 841 ctttgaaact ggtattaagg tggtcgatct tttagctcct tatcggcgtg gaggaaaaat 901 cggactattt gggggagctg gagtaggtaa aacagtactc atcatggaat taatcaacaa 961 tattgctaaa gctcacgggg gcgtatccgt atttggcgga gtaggggaac ggactcgtga 1021 aggaaatgat ctttatatgg aaatgaagga atctggagta attaatgaaa aaaatcttga 1081 ggaatcaaag gtagctctag tctatggcca aatgaatgaa ccgccaggag ctcgtatgag 1141 agttggtttg actgccctaa ctatggcaga atatttccga gatgttatta agcaagacgt 1201 gctctattca tcgataatat ctttcgtttt gttcaagcag gatcggaggt atctgcctta 1261 ttagggagaa tgccctctgc agtgggttat caacctactc ttagtacaga aatgggttct 1321 ttgcaagaaa gaattacttc tactaaaaag ggatctataa cttcgatcca agcggtttat 1381 gtacctgcgg acgatttgac cgaccctgct cctgctacaa catttgcaca tttggatgct 1441 actaccgtac tttccagagg attagcttcc aaagggattt atctgcagta gatcctttag 1501 attcaacctc aactatgtta caacctcgga tcgttggcaa cgaacattat gaaactgcgc 1561 aaagagttaa gcaaacttta caacgttaca aagaacttca ggacattatc gcaattcttg 1621 ggttggatga attatcggag gaggatcgtt taactgtagc aagagcacga aaaattgagc 1681 gcttcttatc acaaccgttt tttgtggcag aagtttttac cggttctcca ggaaagtatg 1741 ttggtcttgc agaaactatt aggggatttc aactaatcct ttccggagaa ttagacggcc 1801 tacccgaaca ggctttttat ttggtgggta acatcgatga agctagcacg aaagctataa 1861 acttagaaga ggagaacaac ttgaagaaat gaaattaaat ctttatgtac tgactcctaa 1921 gcgaattatt tgggattgtg aagtgaaaga aatcatttta tctactaata gtggccaaat 1981 tggcgtatta ccaaaccacg cccccattaa cacagctgta gatatgggtc ccttgagaat 2041 acgcctcctc aacgatcaat ggttaacggc ggttctgtgg agcggttttg ccagaatagt 2101 taataatgag atcatcattt taggaaatga tgcggaactg ggtagtgaca ttgatccgga 2161 agaagctcaa caggcacttg aaatagccga agctaacgtg agtagagctg agggtacgaa 2221 agaattggtt gaagcgaagg tagctctcag acgagctagg atacgagtcg aggctgttaa 2281 ttggattccc ccatctaatt gaagacaacc caacggttta gttgatacaa agaaaaaggg 2341 aagaggggta gaaaaaatta ttagatagcg aagcgaagta gggccaatgc tatctagtaa 2401 tttttctacc tacctaccta ctattggatt tgaaccaatg actcccgccg tatgaaagca 2461 atactctaac cactgagtta agtaggcaat ttatcaccac aaaggaagac cctttacttc 2521 gatc // LOCUS RICMTBEA 2281 bp ds-DNA ORG 01-AUG-1990 DEFINITION Rice mitochondrial beta and epsilon subunit (atpB and atpE) pseudogene, complete cds. ACCESSION M31466 Y00323 KEYWORDS pseudogene. SOURCE Rice mitochondrion DNA, clone Mt-0. ORGANISM Mitochondrion Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 2281) AUTHORS Moon,E., Kao,T.-h. and Wu,R. TITLE Sequence of the chloroplast-encoded atpB-atpE-trnM gene clusters from rice JOURNAL Nucleic Acids Res. 15, 4358-4359 (1987) STANDARD simple staff_review FEATURES from to/span description pept.ps 384 1657 atpB pseudogene pept.ps 1654 2067 atpE pseudogene BASE COUNT 710 a 416 c 488 g 667 t ORIGIN 1 cccctttctt attttgagtc caaataccta aatactatga aaattctctg ttgacagcaa 61 tctatgcttc acagtagtat atattttgta tatcgaagtc ctagataaga aatggagtag 121 gcacagatcc ttcacaaaag gcgaaatgta tatgaaaaaa agattgattg aactttccga 181 cggactcatg gaatgagtaa acgattgaat gggattcgtt tgggcaacga aatcaagtgc 241 tggtcccctt ttctctctta ttgaattaac taattcattt ccttttgact tttgttggat 301 ttttggatat ttttttggtg ttgatttggc attattcaac aagataaaaa gaaaaatttc 361 tataaattcc ttttttttta attatgagaa ccaatcctac tacttctcat cccggggttt 421 ctacaattga agaaaaaagt acagggcgta tcgatcaaat tattggaccc gtgctggatg 481 ccacttttcc cccgggcaag ttaccttata tttataacgc tttggtagtc gagacactga 541 gggtaagcaa attaatgtga cttgtgaggt acaacaatta ttaggaaata atcgagttag 601 aacgaaatta tccatctttg aaactggtat taaggtggtc gatcttttag ctccttatcg 661 gcgtggagga aaaatcggac tatttggggg aactggagta ggtaaaacag tactcatcat 721 ggaattaatc aacaatattg ctaaagctca tagaggcgta tccgtatttg gcggagtagg 781 ggaacggact cgtgaaggaa atgatcttta tatggaaata aaggagtaat taatgaaaaa 841 aatccttgag gaatcaaagg tagctctagt ctatggccaa atgaatgaac gccaggagct 901 cgtatgagag ttggtttgac tgccctaact atggcagaat atttccgaga tgttattaag 961 caagacgtgc ttctattcat cgataatatc tttcgttttg ttcaagcagg atcgggggta 1021 tttgccttat tagggagaat gccctctgca gtgggttatc aacctactct tagtacagaa 1081 atgggttctt tgcaagaaag aattacttct actaaaaagg gatctataac ttcgatccaa 1141 gcggtttatg tacctgcgga cgatttgacc gaccctgctc ctgccacaac atttgcacat 1201 ttggatgcta ctaccgtact ttccagagga ttagcttcca agggtattta tcctagatcc 1261 tttagattca acctcaacta tgttacaacc tcggatcgtt ggcaacgaac attatgaaac 1321 tgcgcaaaga gttaagcaaa ctttacaacg ttacaaagaa cttcaggaca ttatcgcaat 1381 tcttgggttg gatgaattat cggaggagga tcgtttaact gtagcaagag cacgaaaaat 1441 tgagcgcttc ctatcacaac cgttctttgt ggcagaagtt tttaccggtt ctccaggaaa 1501 gtatgttggt cttgcagaaa caattcgggg atttcaacta atcctttccg gagaattaga 1561 cggcctaccc gaacaggctt tttatttggt gggtaacatc gatgaagcta gcacgaaagc 1621 tataaactta gaagaggaaa acaacttgaa gaaatgaaat taaatcttta tgtactgact 1681 cctaagcgaa ttatttggga ttgtgaagtg aaagaaatca ttttttctac taatagtggc 1741 caaattggcg tattaccaaa ccacgccccc attaacacag ctgtagatat gggtcccttg 1801 agaatacgcc tcctcaacga tcaatggtta acggcggttc tgtggagcgg ttttgccaga 1861 atagttaata atgagatcat cattttagga aatgatgcgg aactgggtag tgacattgat 1921 ccggaagaag ctcaacaggc acttgaaata gccgaagcta acgtgagtag agctgagggt 1981 acgaaagaat tggttgaagc gaacgtagct ctcagacgag ctgggatacg agtcgaggct 2041 gttaattgga ttcccccatc taattgaaga caatccaacg gtttagttga tacaaagaaa 2101 aagggtctaa aaagttatta gatagcgaag cgaagtaagt ccaatgctat ctagtaattt 2161 ttctacctac ctacctacta ttggatttga accaatgact cccgccgtat gaaagcaata 2221 ctctaaccac tgagttaagt aggcaattta tcaccacaaa ggaagaccct ttacttcgat 2281 c //
GenBank-Updates@genbank.bio.net (08/02/90)
LOCUS ECOALPHOA 171 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33724 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 171) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y. TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia coli JOURNAL Gene 44, 121-125 (1986) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 141 phoA coding sequence /note="upstream sequence of psiA44:Mu dI" /evidence=EXPERIMENTAL [1],[2] site 141 142 Mu dI insertion site /evidence=EXPERIMENTAL /note="psiA44:Mu dI insertion site" [1] trns 142 > 171 Mu dI-R /evidence=EXPERIMENTAL [1] BASE COUNT 49 a 45 c 51 g 26 t ORIGIN 1 caaagctccg ggcctcaccc aggcgctaaa taccaaagat ggcgcagtga tggtgatgag 61 ttacgggaac tccgaagagg attcacaaga acataccggc agtcagttgc gtattgcggc 121 gtatggcccg catgccgcca atgaagcggc gcacgaaaaa cgcgaaagcg t // LOCUS ECOALPHOB 183 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33725 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 183) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y. TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia coli JOURNAL Gene 44, 121-125 (1986) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 153 phoA coding sequence /note="upstream sequence of psiA10:Mu dI" /evidence=EXPERIMENTAL [1],[2] site 153 154 Mu dI insertion site /evidence=EXPERIMENTAL /note="psiA10::Mu dI insertion site" [1] trns 154 > 183 Mu dI-R /evidence=EXPERIMENTAL [1] BASE COUNT 55 a 39 c 39 g 50 t ORIGIN 1 ctgtcataaa gttgtcacgg ccgagactta tagtcgcttt gtttttattt tttaatgtat 61 ttgtacatgg agaaaataaa gtgaaacaaa gcactattgc actggcactc ttaccgttac 121 tgtttacccc tgtgacaaaa gcccggacac cagtgaagcg gcgcacgaaa aacgcgaaag 181 cgt // LOCUS ECOALPHOD 242 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33726 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 242) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y. TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia coli JOURNAL Gene 44, 121-125 (1986) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 212 phoA coding sequence /note="upstream sequence of psiA40" /evidence=EXPERIMENTAL [1],[2] site 212 213 Mu dI insertion site /evidence=EXPERIMENTAL /note="psiA40:Mu dI insertion site" [1] trns 213 > 242 Mu dI-R /evidence=EXPERIMENTAL [1] BASE COUNT 59 a 61 c 64 g 58 t ORIGIN 1 attgcactgg cactcttacc gttactgttt acccctgtga caaaagcccg gacaccagaa 61 atgcctgttc tggaaaaccg ggctgctcag ggcgatatta ctgcacccgg cggtgctcgc 121 cgtttaacgg gtgatcagac tgccgctctg cgttattctc ttagcgataa acctgcaaaa 181 aatattattt tgctgattgg cgatgggatg ggtgaagcgg cgcacgaaaa acgcgaaagc 241 gt // LOCUS ECOALPHOE 97 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33727 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 97) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y. TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia coli JOURNAL Gene 44, 121-125 (1986) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 67 phoA coding sequence /note="upstream sequence of psiA5::Mu dI. psiA30 upstream region is identical" /evidence=EXPERIMENTAL [1],[2] site 67 68 Mu dI insertion site /evidence=EXPERIMENTAL /note="psiA5:Mu dI insertion site; psiA30 upstream region is identical" [1] trns 68 > 97 Mu dI-R /evidence=EXPERIMENTAL [1] BASE COUNT 34 a 13 c 22 g 28 t ORIGIN 1 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt 61 gcactggtga agcggcgcac gaaaaacgcg aaagcgt // LOCUS ECOALPHOF 82 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33728 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 82) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T. TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and sequencing of the phn (psiD) genes involved in alkylphosphonate uptake and C-P lyase activity in Escherichia coli B JOURNAL J. Biol. Chem. 265, 4461-4471 (1990) STANDARD full staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 52 phn operon sequence /note="upstream sequence of psiD33; psiD41 is identical" /evidence=EXPERIMENTAL [1],[2] site 52 53 Mu dI insertion site /evidence=EXPERIMENTAL /note="psiD33:Mu dI insertion site; psiD41 is identical" [1] trns 53 > 82 Mu dI-R /evidence=EXPERIMENTAL [1] BASE COUNT 28 a 19 c 22 g 13 t ORIGIN 1 attgaggaaa acgaatgaac gctaagataa ttgcctcgct ggccttcacc agtgaagcgg 61 cgcacgaaaa acgcgaaagc gt // LOCUS ECOALPHOG 94 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33729 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 94) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y. TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia coli JOURNAL Gene 44, 121-125 (1986) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 64 phoA coding sequence /note="upstream sequence of psiA4" /evidence=EXPERIMENTAL [1],[2] site 65 65 Mu dI insertion site /evidence=EXPERIMENTAL /note="psiA4:Mu dI insertion site" [1] trns 65 > 94 Mu dI-R /evidence=EXPERIMENTAL [1] BASE COUNT 35 a 18 c 22 g 19 t ORIGIN 1 aatgtatttg tacatggaga aaataaagtg aaacaaagca ctattgcact ggcactctta 61 ccgttgaagc ggcgcacgaa aaacgcgaaa gcgt // LOCUS ECOALPHOH 233 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33730 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 233) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chen,C.-M., Ye,Q.-Z., Zhu,Z., Wanner,B.L. and Walsh,C.T. TITLE Molecular biology of carbon-phosphorus bond cleavage: Cloning and sequencing of the phn (psiD) genes involved in alkylphosphonate uptake and C-P lyase activity in Escherichia coli B JOURNAL J. Biol. Chem. 265, 4461-4471 (1990) STANDARD full staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 203 phn operon sequence /note="upstream sequence of psiD31" /evidence=EXPERIMENTAL [1],[2] site 203 204 Mu dI insertion site /evidence=EXPERIMENTAL /note="Mu dI insertion site into phnD gene (psiD31::Mu dI)" [1] trns 204 > 233 Mu dI-R /evidence=EXPERIMENTAL [1] variant 201 201 g in E.coli; a in E.coli B variant 174 174 c in E.coli; g in E.coli B variant 141 141 g in E.coli; a in E.coli B variant 135 135 g in E.coli; a in E.coli B variant 76 76 t in E.coli; c in E.coli B variant 51 51 g in E.coli; a in E.coli B BASE COUNT 62 a 64 c 66 g 41 t ORIGIN 1 tcgctggcct tcaccagcat gttcagcctc agcaccctgt taagcccggc gcacgccgaa 61 gagcaggaaa aggcgttgaa tttcggcatt atttcaacgg aatcacagca aaacctgaaa 121 ccgcaatgga cgccgttctt gcaggatatg gagaagaagc tgggcgtgaa ggtcaacgcc 181 ttctttgccc cggactacgc gggtgaagcg gcgcacgaaa aacgcgaaag cgt // LOCUS ECOALPHOJ 88 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33732 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y. TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia coli JOURNAL Gene 44, 121-125 (1986) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 58 phoA coding sequence /note="upstream sequence of psiA12; upstream sequence of psiA13::Mu dI is identical" /evidence=EXPERIMENTAL [1],[2] site 58 59 Mu dI insertion site /evidence=EXPERIMENTAL /note="psiA12:Mu dI insertion site; psiA13::Mu dI is identical" [1] trns 59 > 88 Mu dI-R /evidence=EXPERIMENTAL [1] BASE COUNT 33 a 13 c 21 g 21 t ORIGIN 1 ttttttaatg tatttgtaca tggagaaaat aaagtgaaac aaagcactat tgcactggtg 61 aagcggcgca cgaaaaacgc gaaagcgt // LOCUS ECOALPHOK 66 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33733 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 66) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Chang,C.N., Kuang,W.-J. and Chen,E.Y. TITLE Nucleotide sequence of the alkaline phosphatase gene of Escherichia coli JOURNAL Gene 44, 121-125 (1986) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 36 psiF (orf-106) coding sequence /pseudo /evidence=EXPERIMENTAL [1],[2] trns 37 > 66 Mu dI-R BASE COUNT 26 a 17 c 15 g 8 t ORIGIN 1 cgcgaatgca ataatcaagc aacacaacaa tctctgtgaa gcggcgcacg aaaaacgcga 61 aagcgt // LOCUS ECOALPHOL 67 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33734 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 67) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 37 psiR43 coding sequence /pseudo trns 38 > 67 Mu dI-R BASE COUNT 23 a 18 c 17 g 9 t ORIGIN 1 ctctaattag gacaacagaa tcactcggta ccggacctga agcggcgcac gaaaaacgcg 61 aaagcgt // LOCUS ECOALPHOM 106 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33735 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 106) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Overduin,P., Boos,W. and Tommassen,J. TITLE Nucleotide sequence of the ugp genes of e. coli k-12: homology to the maltose system JOURNAL Mol. Microbiol. 2, 767-775 (1988) STANDARD simple staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 76 ugpB coding sequence /pseudo /evidence=EXPERIMENTAL [1],[2] trns 77 > 106 Mu dI-R BASE COUNT 41 a 26 c 22 g 17 t ORIGIN 1 gcccaacgtt ttaacgccga aaacccggat tacaaaattg taccgaccta taaaggcaac 61 tacgaacaga atttaatgaa gcggcgcacg aaaaacgcga aagcgt // LOCUS ECOALPHON 67 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33736 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 67) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 37 psiH6 coding sequence /pseudo [1] trns 38 > 67 Mu dI-R BASE COUNT 22 a 14 c 17 g 14 t ORIGIN 1 ggcatcatga ctatcgtaag tttgatctat cacatagtga agcggcgcac gaaaaacgcg 61 aaagcgt // LOCUS ECOALPHOO 82 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33737 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 82) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M., Flores,N., Becerril,B., Valle,F. and Bolivar,F. TITLE Determination of the nucleotide sequence for the glutamate synthase structural genes of Escherichia coli K-12 JOURNAL Gene 60, 1-11 (1987) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 52 gltD coding sequence /pseudo /evidence=EXPERIMENTAL [1],[2] trns 53 > 82 Mu dI-R BASE COUNT 26 a 21 c 28 g 7 t ORIGIN 1 gaacatgccg ggttcccgcc gcgaagtgaa aaacgcccgg gaagaaggcg tatgaagcgg 61 cgcacgaaaa acgcgaaagc gt // LOCUS ECOALPHOP 116 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33738 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 116) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 86 psiK20 coding sequence /pseudo [1] trns 87 > 116 Mu dI-R BASE COUNT 31 a 26 c 34 g 25 t ORIGIN 1 gcgactcaga gagatcagca agtcagcact accgaccgtg ggcagttttc aaaaggtcat 61 cggtagtagt ttttgcgttc tgcgtttgaa gcggcgcacg aaaaacgcga aagcgt // LOCUS ECOALPHOQ 70 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33739 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 70) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Cole,S.T., Eiglmeier,K., Ahmed,S., Honore,N., Elmes,L., Anderson,W.F. and Weiner,J.H. TITLE Nucleotide sequence and gene-polypeptide relationships of the glpABC operon encoding the anaerobic sn-glycerol-3-phosphate dehydrogenase of Escherichia coli K-12 JOURNAL J. Bacteriol. 170, 2448-2456 (1988) STANDARD simple staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 40 glpB coding sequence /pseudo [1],[2] trns 41 > 70 Mu dI-R BASE COUNT 19 a 20 c 21 g 10 t ORIGIN 1 acaacgcgtg ctcgatctcg cttgccaggc gcaggcatta tgaagcggcg cacgaaaaac 61 gcgaaagcgt // LOCUS ECOALPHOR 78 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33740 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 78) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M., Flores,N., Becerril,B., Valle,F. and Bolivar,F. TITLE Determination of the nucleotide sequence for the glutamate synthase structural genes of Escherichia coli K-12 JOURNAL Gene 60, 1-11 (1987) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 48 gltB coding sequence /pseudo /evidence=EXPERIMENTAL [1],[2] trns 49 > 78 Mu dI-R BASE COUNT 20 a 21 c 24 g 13 t ORIGIN 1 cgaactgatg gttatcgaca cccgcagtgg gcgtattctg cactcggctg aagcggcgca 61 cgaaaaacgc gaaagcgt // LOCUS ECOALPHOS 87 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33741 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 87) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 57 paiJ22 coding sequence /pseudo [1] trns 58 > 87 Mu dI-R BASE COUNT 28 a 20 c 22 g 17 t ORIGIN 1 agactatgag acggaatgaa gaactgttca tcacattctt tattccgagc cgctacgtga 61 agcggcgcac gaaaaacgcg aaagcgt // LOCUS ECOALPHOT 81 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33742 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 81) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Davis,E.O. and Henderson,P.J.F. TITLE The cloning and DNA sequence of the gene xylE for xylose-proton symport in Escherichia coli K12 JOURNAL J. Biol. Chem. 262, 13928-13932 (1987) STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 51 psiE coding sequence /pseudo [1],[2] trns 52 > 81 Mu dI-R BASE COUNT 25 a 19 c 24 g 13 t ORIGIN 1 gtgcatctgg ctgatgtgct gttcgcgcca gaacaaacca gcaaatatga gtgaagcggc 61 gcacgaaaaa cgcgaaagcg t // LOCUS ECOALPHOU 69 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33743 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 69) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Oliver,G., Gosset,G., Sanchez-Pescador,R., Lozoya,E., Ku,L.M., Flores,N., Becerril,B., Valle,F. and Bolivar,F. TITLE Determination of the nucleotide sequence for the glutamate synthase structural genes of Escherichia coli K-12 JOURNAL Gene 60, 1-11 (1987) STANDARD simple staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 39 gltB coding sequence /evidence=EXPERIMENTAL [1],[2] trns 40 > 69 Mu dI-R [1] BASE COUNT 19 a 15 c 21 g 14 t ORIGIN 1 ctttgacttt aactccatgc atatggagcc gtgggatggt gaagcggcgc acgaaaaacg 61 cgaaagcgt // LOCUS ECOALPHOV 94 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33744 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 94) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 64 psiH16 coding sequence /pseudo [1] trns 65 > 94 Mu dI-R [1] BASE COUNT 19 a 26 c 29 g 20 t ORIGIN 1 ccatcttagc agcgtctggt ccgggtcggc ttaggggctt cctttatgca gtactgcctg 61 ccgatgaagc ggcgcacgaa aaacgcgaaa gcgt // LOCUS ECOALPHOW 59 bp ds-DNA BCT 02-AUG-1990 DEFINITION Escherichia coli Mu dI insertion site. ACCESSION M33745 KEYWORDS alkaline phosphatase. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 59) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by P.M.Steed 13-APR-1990. FEATURES from to/span description site 1 29 psiJ49 coding sequence /pseudo [1] trns 30 > 59 Mu dI-R [1] BASE COUNT 15 a 13 c 18 g 13 t ORIGIN 1 ccgattgcgg ctgtttatgg gtatctatct gaagcggcgc acgaaaaacg cgaaagcgt // LOCUS SYNFUSAAA 240 bp ds-DNA SYN 02-AUG-1990 DEFINITION Bacteriophage Mu fusion gene of the trp operon. ACCESSION M33723 KEYWORDS . SOURCE Bacteriophage Mu and E.coli lacZ DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 240) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Allet,B. TITLE Mu insertion duplicates a five base pair sequence at the host inserted site JOURNAL Cell 16, 123-129 (1979) STANDARD simple staff_entry REFERENCE 3 (sites) AUTHORS Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E., Horowitz,H., van Cleemput,M. and Wu,A.M. TITLE The complete nucleotide sequence of the tryptophan operon of Escherichia coli JOURNAL Nucleic Acids Res. 9, 6647-6668 (1981) STANDARD full staff_entry COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description signal 1 104 stem_loop, Mu dI-R [1] site 105 240 trp operon (partial) /note="trp operon sequence present in Mu dI is derived from the trpA-lacZ W209 protein fusion and contains last base of trpC, all of trpB and 187 bases of trpA" [1],[3] site 1 48 MuR derived sequence [1],[2] site 104 43 (c) MuR derived sequence [1],[2] BASE COUNT 67 a 54 c 57 g 62 t ORIGIN 1 tgaagcggcg cacgaaaaac gcgaaagcgt ttcacgataa atgcgaaacg taaagttttc 61 gcatttatcg tgaaacgctt tcgcgttttt cgtgcgccgc ttcattaagg aaaggaacaa 121 tgacaacatt acttaacccc tattttggtg agtttggcgg catgtacgtg ccacaaatcc 181 tgatgcctgc tctgcgccag ctggaagaag cttttgtcag tgcgcaaaaa gatcctgaat // LOCUS SYNFUSAAB 53 bp ds-DNA SYN 02-AUG-1990 DEFINITION Bacteriophage Mu fusion gene of the trp operon. ACCESSION M33731 KEYWORDS . SOURCE Bacteriophage Mu and E.coli lacZ DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 53) AUTHORS Metcalf,W.W., Steed,P.M. and Wanner,B.L. TITLE Identification of phosphate-starvation-inducible genes in Escherichia coli K-12 by DNA sequence analysis of psi::lacZ(Mu dI) transcriptional fusions JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Yanofsky,C., Platt,T., Crawford,I.P., Nichols,B.P., Christie,G.E., Horowitz,H., van Cleemput,M. and Wu,A.M. TITLE The complete nucleotide sequence of the tryptophan operon of Escherichia coli JOURNAL Nucleic Acids Res. 9, 6647-6668 (1981) STANDARD full staff_entry REFERENCE 3 (sites) AUTHORS Kalnins,A., Otto,K., Ruether,U. and Mueller-Hill,B. TITLE Sequence of the lacZ gene of Escherichia coli JOURNAL EMBO J. 2, 593-597 (1983) STANDARD full staff_review COMMENT Authorin copy of sequence for [1] kindly submitted by W.W.Metcalf 13-APR-1990. FEATURES from to/span description site 1 9 lacZ coding sequence /note="lacZ sequence begins with third codon of LacZ protein" [1],[3] site 53 10 (c) trpA coding sequence BASE COUNT 13 a 14 c 17 g 9 t ORIGIN 1 atccgtaatc gccagtgggt cggagaaggg gatacctaac tccagcgcgt cag // LOCUS ATH60CPNA 735 bp ss-mRNA PLN 02-AUG-1990 DEFINITION A.thaliana plastid 60-kDa chaperonin-60 alpha polypeptide (cpn-60 alpha) mRNA, partial cds. ACCESSION M35597 KEYWORDS chaperonin-60. SOURCE A.thaliana (cv. Columbia), cDNA to mRNA, clone pTZ19R. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 735) AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M. TITLE Unique composition of plastid chaperonin-60: Alpha and beta polypeptide-encoding genes are highly divergent JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.M.Hemmingsen, 26-JUN-1990. FEATURES from to/span description pept < 1 > 735 60-kDa chaperonin-60 alpha-polypeptide (AA at 1) BASE COUNT 187 a 139 c 196 g 213 t ORIGIN 1 aagcttgctg attgtgttgg tctcactctt ggccctagag ggaggaatgt tgtgttggat 61 gaatttggaa gtcctaaggt tgtgaatgat ggagtcacca ttgctagagc tattgagtta 121 cctaatgcca tggaaaatgc tggtgcagcg cttattcgtg aggttgcgag taagactaat 181 gactctgctg gtgatgggac aaccactgcg tctatccttg ctcgggagat aatcaaacat 241 ggattgttga gcgtcacttc tggtgcgaat cccgtttcac tcaagagggg aattgataag 301 actgttcaag gtctgatcga agagcttcag aagaaagcta ggcccgtgaa aggtcgtgta 361 gactacagag ctgtggcttc tatctctgct ggaaatgatg accttatagg gtcaatgatt 421 gctgatgcca tcgacaaagt tggacctgat ggtgttttgt ccattgaatc ttcatcttct 481 tttgagacca cggtcgaagt tgaagaaggg atggagattg acagaggtta catctcgcct 541 cagtttgtta caaaccctga gaaactacta gctgagtttg agaatgctag ggtattgatc 601 actgatcaga agatcactgc gatcaaagac atcatcccta ttttggagaa gaccactcag 661 cttcgtgctc cgttgttgat tattgcagag gatgttactg gtgaagcctt agctaccctt 721 gtcgtgaaca agctt // LOCUS ATH60CPNB 559 bp ss-mRNA PLN 02-AUG-1990 DEFINITION A.thaliana plastid 60-kDa chaperonin-60 beta-polypeptide (cpn-60 beta) mRNA, partial cds. ACCESSION M35598 KEYWORDS chaperonin-60. SOURCE A.thaliana (cv. Columbia), cDNA to mRNA, clone ptz19R. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 559) AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M. TITLE Unique composition of plastid chaperonin-60: Alpha and beta polypeptide-encoding genes are highly divergent JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.M.Hemmingsen, 26-JUN-1990. FEATURES from to/span description pept < 1 > 559 60-kDa chaperonin-60 beta-polypeptide (AA at 1) BASE COUNT 166 a 96 c 155 g 142 t ORIGIN 1 aagcttgtga ggcaagcggc tgccaaaacc aatgacttgg ctggtgacgg tacaacaaca 61 tctgttgttc ttgcacaagg ttttatcgct gagggtgtca aggtggtagc tgctggtgca 121 aaccctgttt tgatcaccag aggtattgag aagacagcaa aggctttggt taccgagctc 181 aagaaaatgt ctaaggaggt tgaagacagt gaacttgcag atgttgcagc tgttagtgcg 241 ggtaacaacg acgaaattgg aaatatgatt gctgaagcaa tgagcaaagt gggcaggaaa 301 ggtgtggtga cccttgagga gggtaaaagt gccgagaaca accttcacgt cgtagaagga 361 atgcaatttg accgtggtta tatctctcct tactttgtga cagacagcga gaaaatgtcc 421 gtcgagttcg acaattgcaa gttacttctc gttgacaaga aaattaccaa tgcaagggat 481 cttgttggtg ttttggagga tgcaattaga ggcggatacc caatattaat tattgctgaa 541 gacattgaac aagaagctt // LOCUS BNA60CPNA 1783 bp ss-mRNA PLN 02-AUG-1990 DEFINITION B.napus plastid 60-kDa chaperonin-60 alpha-polypeptide (cpn-60 alpha) mRNA, partial cds. ACCESSION M35599 KEYWORDS chaperonin-60 alpha-polypeptide. SOURCE B.napus (cv. Westar) plastid chaperonin-60, cDNA to mRNA. ORGANISM Brassica napus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 1783) AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M. TITLE Unique composition of plastid chaperonin-60: Alpha and beta polypeptide-encoding genes are highly divergent JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.M.Hemmingsen, 26-JUN-1990. FEATURES from to/span description pept < 1 1638 60-kDa chaperonin-60 alpha-polypeptide precursor (AA at 1) sigp < 1 21 60-kDa chaperonin-60 alpha-polypeptide signal peptide matp < 1 1635 60-kDa chaperonin-60 alpha-polypeptide BASE COUNT 460 a 395 c 486 g 442 t ORIGIN 1 cgccgtttca gcgttagagc taatgtaaaa gagatctctt tcgaccagag ctcaagagct 61 gctcttcaag ctggtattga caagcttgct gatgccgttg gtcttaccct tggccctaga 121 ggaagaaatg ttgtgttgga tgagtttgga agtcccaagg tcgtgaatga tggagtcacc 181 attgctaggg ccattgagtt acctgacgct atggagaatg ctggtgcagc acttatccgt 241 gaggttgcta gtaagactaa tgactcagct ggtgacggca caaccactgc ctccgtcctt 301 gctcgggaga taatcaaaca cggcttacta agcgtcactt ccggtgccaa tccagtctca 361 ctcaagagag gaatcgataa gaccgttcaa gctttgatcg aagagctcga gaagagggct 421 agacctgtca aaggcggtag tgacatcaaa gccgtggcta caatctctgc tggaaatgat 481 gagcttgtgg gaacaatgat tgctgatgcc attgataaag ttggacctga tggtgttttg 541 tccattgaat cttcttcctc ctttgagact accgtcgaag ttgaagaagg aatggagatt 601 gatagaggtt acatctcacc acagtttgtt acaaaccctg agaagctcct agtcgagttc 661 gagaacgcgc gcgtgctgat caccgatcag aagatcactg cgatcaaaga catcatcccg 721 atcctggaga aaaccactca gctccgagct ccgttgctca tcatcgcgga ggatgtcacc 781 ggcgaggccc tggcgaccct cgtcgtgaac aaactccgcg gcgtgctcaa cgtggtcgcc 841 gtcaaggctc cagggtttgg tgaaagaaga aaagctatgc ttcaggacat tgccatcttg 901 acggggccga gtacgaccct cgacatgggc cttctggtgg agaacacgac catagatcag 961 ttggggatcg ctaggaaagt cactattagc aaagactcga ccacgcttat agctgatgcg 1021 gcttccaagg agaccctgca agctcgtatc tcccagctta agaaggagct gtcagagact 1081 gactccgtgt acgactcgga gaagctcgcc gagagaatgc ccaagctgcc aggcggtgtt 1141 gccgttatta aagtcggagc ggcgactgag actgagctcg aggaccgtaa gcttcgtatc 1201 gaggacgcga agaacgctac atttgctgca atcgaagaag ggatagttcc tggcggcggt 1261 gccacgctgg tgcatctatc aactgtgatt cctgctatta aggagaagct tgaggatgct 1321 gatgagcgtt tgggagccga catagtacag aaggctttgg tggcaaggtc gcttattgca 1381 cagaacgctg gaataggagg agaagttgtt gtggagaaga ttatgttcag cgaatgggag 1441 atagggtaca acgccatgac tgatacatat gagaatctgt tggaagctgg agtgattgat 1501 ccagctaaag tgacgagatg tgcgcttcag aacgctgctt cggttgcggg gatggtgctg 1561 accactcagg ccattgttgt cgacaaaccg aaacctaagg ctcctactgc tgctcctcct 1621 cagggtctca tggtgtaatg aaatcactta tcatatggct atatttgttg tccccaatta 1681 tatctgcaat atttttctct ggggaaatga gtagtttgtt taacggtaat gagaatatca 1741 tctttttgat ttatgatttg caaaataaaa gtttatattc aat // LOCUS BNA60CPNB 1982 bp ss-mRNA PLN 02-AUG-1990 DEFINITION B.napus plastid 60-kDa chaperonin-60 beta-polypeptide (cpn-60 beta) mRNA, partial cds. ACCESSION M35600 KEYWORDS chaperonin-60 beta-polypeptide. SOURCE B.napus (cv. Westar) plastid chaperonin-60, cDNA to mRNA. ORGANISM Brassica napus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 1982) AUTHORS Martel,R., Cloney,L.P., Pelcher,L.E. and Hemmingsen,S.M. TITLE Unique composition of plastid chaperonin-60: Alpha and beta polypeptide-encoding genes are highly divergent JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.M.Hemmingsen, 26-JUN-1990. FEATURES from to/span description pept 64 1830 60-kDa beta-polypeptide of plastid chaperonin-60 precursor sigp 64 228 60-kDa beta-polypeptide of plastid chaperonin-60 signal peptide matp 229 1827 60-kDa beta-polypeptide of plastid chaperonin-60 BASE COUNT 564 a 372 c 547 g 499 t ORIGIN 1 cctgctcccc agtttcttat ccgtggtgct tctccgcctc ttcgcttcac ttctcttgaa 61 aagatggcat caaccttcac tgcaacgtct tccattggct ccatggttgc tccaaatgcc 121 cacaaaacgg ataagaagct tatgaacaag ttgtcttcaa gctctttcgg gaggaggcag 181 aacgtgttcc ccaagctcag aagatccagt cctgcagttg tatgtgcagc caaggagttg 241 catttcaaca aagacgggac taccatcagg aagcttcaag ctggtgtcaa caagcttgca 301 gacctagttg gtgtgacact tggacctaaa gggcgaaacg ttgttcttga gagcaagtat 361 ggatcaccaa gaattgtcaa tgatggtgtg actgttgcaa gggaggttga attggaagac 421 cctgttgaga acattggtgc caaacttgtt aggcaagcag ctgccaagac caatgatctg 481 gccggtgatg gtaccacaac atctgtggtt cttgcacaag gttttattgc tgagggtgtc 541 aaggtggtgg ctgctggtgc aaaccctgta ttgatcacta gaggcattga gaagacagca 601 aaggctttgg tagccgagct caagaaaatg tctaaggagg ttgaagacag tgagcttgca 661 gatgtggcag ccgttagtgc cggtaacaat gcagaaattg gaagcatgat tgctgaagca 721 atgagcagag tgggcaggaa gggtgtggtg acacttgagg agggtaaaag tgcagagaac 781 gctctctacg tggtggaagg aatgcaattt gatcgaggtt atgtctcccc ttactttgtg 841 acagacagcg agaaaatgtc agttgagttc gacaattgca agttgcttct tgttgacaag 901 aaaattacca atgcaaggga tcttgttggt gttctggagg atgcaattag aggaggatac 961 ccaattttaa taattgcgga agacattgag caggaggctt tagcgaccct tgttgttaac 1021 aagcttagag gcacactgaa gattgcagct ctcaaagctc caggatttgg agagcgcaag 1081 agccaatacc ttgatgatat tgccatcctc actggagcga ctgtgattcg tgaggaagtt 1141 ggtctttcac ttgacaaagc tggaaaagag gttcttggac atgcagcaaa agttgtcctc 1201 actaaggaga cttcgaccat tgtgggtgat gggagcacac aggacgcagt gcaaaagcgt 1261 gttacacaaa ttaagaacct tattgagcaa gcagagcaag actatgagaa ggaaaaactg 1321 aatgagagaa ttgcaaagct ctctggtgga gttgctgtga ttcaggtcgg agcacaaact 1381 gagacagaac tcaaagagaa gaaactgaga gttgaagatg ctcttaatgc tacaaaggct 1441 gctgttgagg aaggaattgt cgttggtggt ggttgtactc tgcttcgcct tgcttccaag 1501 gtggatgcca ttaaagccgc ccttgacaat gatgaagaaa aggttggagc cgacatcgtg 1561 aaaagagcac tgagttaccc tctgaaactg attgccaaga atgcaggagt caatggaagc 1621 gtagttagcg agaaggtgct ttctaacgat aatgtgaagt tcggttacaa tgctgcaacc 1681 ggcaagtacg aggatctaat ggctgcagga atcatcgatc caactaaggt tgtgagatgt 1741 tgcttggaac atgcagcttc ggttgcaaag acattcttga tgtctgactg tgtggttgtt 1801 gagatcaagg agcctgagcc agttccgtag gcaaccaatg gacaactcag gttatggata 1861 ctgaggagat gggggaatga agtaactctg tcggaaattg attcctttta gcttttgtaa 1921 tagatgagca agttgtaatg atttcgaaca aaagaatgtg ctgtgtactg aaattttgaa 1981 cc // LOCUS HS4RAJI 11835 bp ds-DNA VRL 02-AUG-1990 DEFINITION Epstein-Barr virus B95-8 deletion region. ACCESSION M35547 KEYWORDS . SOURCE Epstein-Barr virus (strain Raji) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 11835) AUTHORS Parker,B.D., Bankier,A., Satchwell,S., Barrell,B. and Farrell,P.J. TITLE Sequence and transcription of Raji Epstein-Barr virus DNA spanning the B95-8 deletion region JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Farrell, 25-JUN-1990. FEATURES from to/span description pept 3625 851 (c) LF3 gene product pept 10605 9316 (c) LF2 gene product pept > 11835 10566 (c) LF1 gene product (AA at 11834) orgrpl 3554 4609 origin of replication BASE COUNT 1948 a 3315 c 3869 g 2703 t ORIGIN 1 ggccgctgtt cacctaaagt gacgcaaggt ctgtcagccg ccagggtccg tttaccaggc 61 tttcaggtgt ggaatttaga tagagtgggt gtgtgctctt gtttaattac accaagatca 121 ccaccctcta tccatatccc acaattgata aacctccgca tgtccaacca ccacgttgaa 181 caggatgtgg caccctaaga ggacgcaggc atacaaggtt attacccagt ccttgtatgc 241 ctggtgtccc cttagtggga cgcaggccta ggtagcatca tttacactaa aagcagtgac 301 cttgttggta ctttaaggtt ggtccaatcc ataggctttt tttgtgaaaa cccggggatc 361 ggactagcct tagagtaact caaggccaag catttcacac ctgcaaatgc accatgtaac 421 cacagatcta aactgaaagt tgcagcttta gatggcaagg aaacttgggt ttcaggcata 481 gaaagcctgg ctcactatag cagcccatgt ttgttccagg gtgggggaaa ggcacgtgcc 541 cttagaaaac ttagctgcaa aaattctatt gtgttgggag agcctctata tctaaaggcc 601 tttcctcaca atacaaatgt tactaacgtc tgccctctgg agacctgcta tgtggctaga 661 cgtatggcct acccaagacg ttgggggtct cgggtaggcc atgattcttc caggcatagg 721 ttacaaccag tcactgctat caagcctact cagttcccaa cgcagcacat accccccgcc 781 tctcctgcca tgaggactta tggcagtgtt tactgttctg cttttactct tggaccaggc 841 tgtcattcta tcagaataac aggggaagca aggccccctg cttcagcggg acacgtgttt 901 ctagaatctc ggagccaata actacctgcc cctctaatct gtatgctgca tgaaaaacca 961 catacacgtg atgtaagttt agccagttta ttgttacacc aatgccccga aagtctcccc 1021 ctgtcccttt gggtctcagg acccagccct ggagctcggg ggcggccggg tggcccaccg 1081 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg 1141 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg 1201 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc 1261 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg 1321 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc 1381 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg 1441 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt 1501 ccgctgcccc gctccggcgg ggggtggccg gctgcagccg ggtccggggt tccggccctg 1561 gagctcgggg ggcggccggg tggcccaccg ggtccgctgg gtccgctgcc ccgctccggc 1621 ggggggtggc cggctgcagc cgggtccggg gttccggccc tggagctcgg ggggcggccg 1681 ggtggcccac cgggtccgct gggtccgctg ccccgctccg gcggggggtg gccggctgca 1741 gccgggtccg gggttccggc cctggagctc ggggggcggc cgggtggccc accgggtccg 1801 ctgggtccgc tgccccgctc cggcgggggg tggccggctg cagccgggtc cggggttccg 1861 gccctggagc tcggggggcg gccgggtggc ccaccgggtc cgctgggtcc gctgccccgc 1921 tccggcgggg ggtggccggc tgcagccggg tccggggttc cggccctgga gctcgggggg 1981 cggccgggtg gcccaccggg tccgctgggt ccgctgcccc gctccggcgg ggggtggccg 2041 gctgcagccg ggtccggggt tccggccctg gagctcgggg ggcggccggg tggcccaccg 2101 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg 2161 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg 2221 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc 2281 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg 2341 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc 2401 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg 2461 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt 2521 ccgctgcccc gctccggcgg ggggtggccg gctgcagccg ggtccggggt tccggccctg 2581 gagctcgggg ggcggccggg tggcccaccg ggtccgctgg gtccgctgcc ccgctccggc 2641 ggggggtggc cggctgcagc cgggtccggg gttccggccc tggagctcgg ggggcggccg 2701 ggtggcccac cgggtccgct gggtccgctg ccccgctccg gcggggggtg gccggctgca 2761 gccgggtccg gggttccggc cctggagctc ggggggcggc cgggtggccc accgggtccg 2821 ctgggtccgc tgccccgctc cggcgggggg tggccggctg cagccgggtc cggggttccg 2881 gccctggagc tcggggggcg gccgggtggc ccaccgggtc cgctgggtcc gctgccccgc 2941 tccggcgggg ggtggccggc tgcagccggg tccggggttc cggccctgga gctcgggggg 3001 cggccgggtg gcccaccggg tccgctgggt ccgctgcccc gctccggcgg ggggtggccg 3061 gctgcagccg ggtccggggt tccggccctg gagctcgggg ggcggccggg tggcccaccg 3121 ggtccgctgg gtccgctgcc ccgctccggc ggggggtggc cggctgcagc cgggtccggg 3181 gttccggccc tggagctcgg ggggcggccg ggtggcccac cgggtccgct gggtccgctg 3241 ccccgctccg gcggggggtg gccggctgca gccgggtccg gggttccggc cctggagctc 3301 ggggggcggc cgggtggccc accgggtccg ctgggtccgc tgccccgctc cggcgggggg 3361 tggccggctg cagccgggtc cggggttccg gccctggagc tcggggggcg gccgggtggc 3421 ccaccgggtc cgctgggtcc gctgccccgc tccggcgggg ggtggccggc tgcagccggg 3481 tccggggttc cggccctgga gctcgggggg cggccgggtg gcccaccggg tccgctgggt 3541 ccgctgcccc gctccggcgg ggatgggggt gcgctcccag gccggaccct ggtgccaggc 3601 agggaccccg cgccacccgc ttcatggggg gggaggccgc cgcaaggacg ccgggccggc 3661 tgggaggtgt gcaccccccg agcgtctgga cgacgctggc gagccgggcc agctcgcctt 3721 cttttatcct ctttttgggg tctctgtgca ataccttaag gtttgctcag gagtgggggg 3781 cttctcattg gttaattcag gtgtgtgatt ttagcccgtt gggttacatt aaggtgtgta 3841 accaggtggg tggtacctgg aggtcattct attgggataa cgagaggagg aggggctaga 3901 ggcccgcgag atttggggta ggcggagcct caggagggtc ccctccatag ggttgaacca 3961 ggagggggag gatcgggctc cgccccgata tacctagtgg gtggagccta gaggtaggta 4021 tccatagggt tccattatcc tggaggtatc ctaagctccg cccctatata ccaggtgggt 4081 ggagctaggt aggattcagc taggttccta ctggggtacc cccctaccct accttaaggt 4141 gcgccaccct tcctccttcc gttttaatgg tagaataacc tataggttat taacctagtg 4201 gtggaatagg gtattgcagc tgggtatata cctataggta tatagaacct agaggaaggg 4261 aaccctatag tgtaatccct ccccccccta cccccccctc ccttacggtt gcctgagccc 4321 atcccccacc ccagcacccc ggggtgacgt ggcaccccgc gtgccttact gacttgtcac 4381 ctttgcacat ttggtcagct gaccgatgct cgccacttcc tgggtcatga cctggcctgt 4441 gccttgtccc atggacaatg tccctccagc gtggtggctg cctttgggat gcatcacttt 4501 gagccactaa gcccccgttg ctcgccttgc ctgcctcacc atgacacact aagcccctgc 4561 taatccatga gccccgcctt taggaagcac cacgtcccgg ggacggaagc tggattttgg 4621 ccagtcttca attttgggga gtggttttgt gtgagccgga agttggcaat ggggtgaggg 4681 tggcgctggt taagctgacg acctcccaag gtctctcacc ctgggtacac aggtggggcg 4741 gcagcctcta actttggctg tggcctctat ttcctccctt tcctagccag ggccatgtgt 4801 tcctgcatgt ctacttgcct cctgtggtgg cagagcttgg ccctgggccc aacccccgcc 4861 ttgggagcct gtaggggcca acacccttgg tttgtttgtg ttcctgtttg ctggcaactt 4921 actggcagcc gagcagattc taatgggcgc ccgccttctt tctctcttgt tttattaata 4981 gaatctcagc caggacctat acctgagact tcaaagtctg gtcctgggtt ctgagacccc 5041 caagatttgt catgcacacc tgcacacctg ttggtattgg gtttctattc ttgagtgtga 5101 aagtttgtaa aaaaattcat aaaatgtcac taattcctct tacctgttta gggtattgtg 5161 caattcttca gcctgcctat tttcaatttg cctaaggtgg caatttaaga tgtggttaat 5221 taaccatttt cctgtctgac accactgcat gggcaaccgg gttccatggc acatttagag 5281 ataaacatag atgtcttgtc ttgctcatgt gcagaggagg gggtgttggt gtgcaatata 5341 gtttctggat tccaaattga gttgggggtg ctattttcac tatggaatta aattactgac 5401 attagacagt ggacaccggg ctatatgtgg ggatgtctgt ggcttgtcat ttcctcttag 5461 aaggtaatcc cccatcttaa cttcccttta aattgtgatg caagccctgg gttatttata 5521 gaatgattat ctaggtttga tagtctgaag gctgggcaga gaatgtttgt aatttttatt 5581 caccttcttt accccccacg agtatccagt tctagaagat ctcctgatat cccgggctgc 5641 cattattccc ttgagtgtta tagcttcctc ttaacttaag caagagctcc aggatgttag 5701 cttttttggt ggggctggtt gtcaggaaga ggttccagtg ttgtccttta tttttagatg 5761 ttagctttgt gttaggttag tatgggctgg gtattcacta gtgaaggcaa ctaacacagt 5821 tagacgtgct agttgtgccc actggtgttt atccggtccc aaatgtcacc acagaacaca 5881 gggggctgga tttggcagca gcacttgtgc ttttgttgat ttttacccgt gtatcagagt 5941 gggggatgct agccaattta gcttcccctc cccttaacag ggggtctcgc ggggtgccaa 6001 ttgtcgcctg ccttcccccg cttccccttg ttaacttata gcatgatagg taggtcacct 6061 aacgtggaag cctggtgggt gatccttcct cggtagggag cgcttagggc tgttgagctc 6121 aacagcccca cctgggtaaa atgtatgttc taaagagtta cccaattata acaaaactgt 6181 tgtagggtaa cgaagacctg atggaagtgg tattgttgcc gttgaaagac gggtgtcctg 6241 gctcaagttc gcacttccta tacagtgtta aagccttgta tcggaagttt gggcttcgtc 6301 ccagtgtact cgataatgtc gactgctgcg aaaggtttgg accgtcttcc agtaggtgtt 6361 gggggtccca aatcacgagg ttaggcaggt gcacttggct ctttaggagg gacccttaag 6421 ccagacaatg tagtgcccct tttttttgca aattggcctt attattaatt tcttgttaac 6481 actaattctg ttctatgacc ctgtgttttt cagatgccgt tgaacgtgtc actgagctga 6541 atttggacgc agctacttga cctttgcccc cgtgcctcca gcgctgataa gtgctgcgtc 6601 cactttgtgt tacaggtggg ccaaacctcc agaatatcaa ttggtggggc cttggtgggc 6661 tgcataaggc agtaggtttg aggtgaccta cttggaccat gtggatccag tgtcctgatc 6721 ctggaccttg actatgaaac aattctaaaa aaatgcatca tagtccagtg tccagggaca 6781 gtgcactcgg aagtctcatc atctccgttt gtgtgtttag tgtggccagt acggccaccc 6841 ctgtgccacg ccctggcatg ctgctgacat ctggccgcca atttcagcgg gcccttttcc 6901 cccttgttca ccccatagca agaagggtag gttacatggg tattttccca tcagcacctg 6961 actggccggt gcaattagag gagagggcaa caacgcaagg ctgttgtttt atttgggtta 7021 caagagctgc ggcggtcgat gggttcactg attacggttt cctagattgt acagatgaac 7081 tagaactgtc acaatctatg gggtcgtaga cagtgtgctt accagacttc catggaagat 7141 gtgaatttgc tgctagctat atgggtggtg ctatgggctc cctagggact catgtagtgg 7201 ggctttgtga tagctaatga atgtggcagc tgttgtttgt actggaccct gaattggaaa 7261 cagtaacttg gattctgtaa cacttcatgg gtcccgtagt gacaactatg ctgaatatct 7321 tgaatatggg aggagggggg ctttgggttc cattgtgtgc cctttcctgg ccaacgtgag 7381 ggtcctagtg ttatagggcg tggcagtttt cttgagggct aataacccgg gtgaggcggt 7441 tgtcacaggt gctagaccct ggagttgaac cagtaccact cggttacaaa gtcatggtct 7501 agtagttgtg accctgcaaa gctacgtggg gatgagcagc cagggacttt ggttggcaag 7561 cagacaggcg gcgcattgga accccagagg agtgtcccgg ggccacctct ttggttctgt 7621 acatattttg ttattgtaca taaccatgga gttggctgtg gtgcactcca tctggtaagg 7681 gggctggtgc ggacgcctgt gtttagtcta tgccaatgtt tacctgcctt gggttactat 7741 tccaaacgac cacacctttg aggacacctg gagccctgat cattctcggc ttttactgcc 7801 acctggcttc tgttgggtca gacagtttgg tgcgctagtt gtgtgcttag cagcaacgca 7861 caccaggctg actgccttag cagtgtggcc ctttattgtg gcatcctaag gagggattct 7921 ggagtgcctt tcgcgtgaag catgccctga gacgtactcg agttaggact taatcgctcc 7981 tgtgccgctg gatgagggag cgccaatttg tacatcctag ctctggccat agagttagcc 8041 cacccttgtg tctccctttg gcctttgcgg tgccaatttc cggtggtttc ccttttccgc 8101 ccgtttatcc aatagcatgt aagagaggtt gcctagattt ggcaactttg agggaacgtt 8161 ccgtgtagct ggtgacctaa cacccgccca tcaccaccgg acagattctg aacttgtcct 8221 gtggtgtttg gtgtggtttt ggggtacgca ggagtacgtt ggaatgcttt ggagccgaga 8281 gggatgggcc cgcttgtgcg cttatgtgtt acacggtgcc aataaccggc ccggtgcggc 8341 tgccccgtga cccgtgggcc ttaccttcct ggccatcggg ggaccctggt gctagggtcc 8401 cttgtgttgc tttctgccat aggggggaaa gcatcgcctt cagaattggc tgctccgttg 8461 gaacatttga ggcctactgt atccgtgtcc tgacaacatt ccccgcaaac atgacatggg 8521 ttaatttaaa catgttttgt ttgcttggga atgctcttag ggcctggaag cttgtcattg 8581 gattcatcgt ttcctgaact acaggcgtag ggcctattgt agcaggcatg tcttcattcc 8641 tgcgtaccga atggcatgaa ggcacagcct gttaccattg gcaccttttt tccatgtaaa 8701 cctccgtgat cctgggtcct ttggagactc aagtgtgaat ttgttttggt gttcggcgcc 8761 agggtatctc gacgttggaa tgtcaactca acttgggcac ctcgataacc ggctcgtggc 8821 tcgtacagac gattgtttgg ctctgtaact tgccagggac ggctgacgat gtgtttagtc 8881 tgccacttgc atccggcgct ttggttactc gggagactaa tggggggtgt ggtatggcac 8941 aggctggggg tgagtctggg gatgtccctg ggcgttgctg cagcccattc gccctctggg 9001 gatgagatgt tcaggggtgg ccggtaccct acgctgccga tttacataat ataaattgta 9061 aatgctgcag tagtagggat ctggacgcgc gacctgctac tcttcggaaa cgccaaccca 9121 ggagcgtcgc ctctggcccc atactcccgc catgcgactg ctcgccccct cccaggcctc 9181 cctggtgagc ccttgccgct ccccgcattc ctgctttcgg cgcccctgcg gatcccgatg 9241 acagcaggcc tttccttccc ccgttaatga aaagaatgac agtgaggttg tgacagaagg 9301 acagctttat tcagtttaca gagtgccctc ggaggctacg atattcccgt taaatgtctt 9361 gttgattctc tcaaaggtgg ggagggagga gctctccaca acaatgttcc ctggcagcgt 9421 gagcgcgcag ccctgccgtt ggatgtatct tctcatgatg gtgctgatag aggggtctcc 9481 ggcgtagatg aaaaaggcct gggccatgct ctggccggtc acgatcgtta tggggttgtt 9541 ggaaatgttc cggaccgtca gcttgagggt ctggcccggc ttccactcct gtgggtagac 9601 gtagaagacc gggttggagg agtgggacac gacaacggcc gtaatcttgg agctcagggg 9661 ggcctcgtag gtgttgttgt attccagctc cgtgatgaaa ttaggaggaa taatcacagg 9721 ggagccaaag tagcggatgt ctgtggattc cccgtcccag cgccagtggc tcttagggta 9781 ggggttgtaa cggaaggcaa taatcacatc atccaatagg gtcatgccca ccttgacgtt 9841 cagcgggccc tctcgtttca ggtccggcgt gtccacggag actcggacgt agcccttacc 9901 gcggcgtatg gcgtttaccg gacacacctt ccccgggaat gtgtgaatac gggcgtatga 9961 ctttagaaat gggggcgtgt gctgcgccag caggtaaggc aggcactcgt cctggctggt 10021 gacgggagag ccactgagga agatctgggg ctcgctggtg tttagcttgt ccccgctctg 10081 ggtgcaggag cgtgtcagct gaatgtcgct ctgcccgggc agaatctgca ggtagaggta 10141 ggggttcttg accaatctga tgggcacaat gtaccaggta aacttccctt tctctatgaa 10201 caggctgcgc ggattcagga cgcttagcac gatgtcctgg tcagagtgca taacgaagaa 10261 gggcttgagg aatacctcgt tgtcttccgc tccaaagaac aaaaacgcga ccgtaaagta 10321 gcggctgccg taggtggtcg tgttgaagga gaaagaaggt aacttgaagc tgagtatctg 10381 gcccaccgag gggcagggag gcagctcttg gcactgcgcg tccagctgca atacctgctt 10441 gttggtgacg cggacgtatg aggggaagat ctcgtacttc cacacgcctc tcatgaacga 10501 cgtgtctggt ttttcagtgg gccgcaggcg gcggaggctg ttcctgaacg acgagcgccg 10561 ggacgctagt gctgcatggg ctcctccggg gtaagcttcg gccatggccg gagctcgtcg 10621 acgggcaagg tgagagtcgg ggggcgggcg acggtgcggc cccaatacaa ctctccgctc 10681 gttagctggt agaatatccg cccggcgtct aggttgtcac ttcgctcggc cggccagaag 10741 agcgcaagtc caagtctggt gctggggccg atgtgcagcg gtttgtgccc gcagttgtag 10801 actgtcattt ttatgggcga gtgggcggtc cacacgcgcg ggcgcagcac ccattggtcg 10861 cacgccgcct cctggaatgt aaacccccag agagagggcg tgccgccctg gagatggccc 10921 tgtgccatca catgtatttc ctccttgggt ggaacaacgg cgtcgtgctc cgggtggagg 10981 gggaatagcg tccaggcatc tttcagggtc acgagaccgg ggtccatgct cagagaacag 11041 ccctcccggg cggtgggcgg cccgggctcc agcagaacgt cgcagaccca gccctcctcg 11101 gccctgtcca cctgtatgtc caggtgcacg gacccggagg ctgcgtctcg tgacatggcc 11161 aggcctggtg ccagccgacc acgtcccgtg tcccagccga ggccgcgcca gagcagagcc 11221 cgggactgac tcagggccac atcccctcgg cccgcggacg ccgcctcgcc agcccccggg 11281 ccttcatggg cccgctttct acctctctcc ggcaccccag cctggtcagc cgcagaggaa 11341 gcatgacctt ggggtgggac ggggcaggcg tgatcctggg cgcaatcttt gccgatcccc 11401 acaccttcac tccttgttag gttgatagaa tgtcggtacc acgccacggg gggcgggccc 11461 gcatagggaa aagccaggga gagcgatgtg ggcgaggatg ggctcaggcg gccccagaca 11521 cgcaatttgc ccccctgggc ggccgcagcc tgcccctcgg cggcccgtgc cccagctccg 11581 tcacgggggg cgcataggag gggtatatct aggatagccg cacctacaca aatgagacac 11641 agacacaggt cgtgaggatt taggcaacgc aggcttgtct ttatagttac aaacatggga 11701 gcgtgcacct ggaagatgca gctggggtag atctttacat ctttacaggg cgcagcggcc 11761 gccagacact gaagggcaga gttcacggcg ggcacctccc agagggagcc caccagcccg 11821 tacctggcca cggcc // LOCUS HUMG25KA 1014 bp ss-mRNA PRI 02-AUG-1990 DEFINITION Human GTP-binding protein (G25K) mRNA, complete cds. ACCESSION M35543 KEYWORDS G25K gene; GTP-binding protein G25K. SOURCE Human fetal brain, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1014) AUTHORS Munemitsu,S., Innis,M.A., Clark,R., McCormick,F., Ullrich,A. and Polakis,P. TITLE The molecular cloning and expression of G25K, the human homolog of the yeast cell cycle gene CDC42 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Munemitsu, 25-JUN-1990. Author address: S.Munemitsu Cetus Corporation 1400 53rd Street Emeryville, CA 94608 FEATURES from to/span description pept 4 579 GTP-binding protein G25K BASE COUNT 286 a 204 c 210 g 314 t ORIGIN 1 gcaatgcaga caattaagtg tgttgttgtg ggcgatggtg ctgttggtaa aacatgtctc 61 ctgatatcct acacaacaaa caaatttcca tcggaatatg taccgactgt ttttgacaac 121 tatgcagtca cagttatgat tggtggagaa ccatatactc ttggactttt tgatactgca 181 gggcaagagg attatgacag attacgaccg ctgagttatc cacaaacaga tgtatttcta 241 gtctgttttt cagtggtctc tccatcttca tttgaaaacg tgaaagaaaa gtgggtgcct 301 gagataactc accactgtcc aaagactcct ttcttgcttg ttgggactca aattgatctc 361 agagatgacc cctctactat tgagaaactt gccaagaaca aacagaagcc tatcactcca 421 gagactgctg aaaagctggc ccgtgacctg aaggctgtca agtatgtgga gtgttctgca 481 cttacacaga gaggtctgaa gaatgtgttt gatgaggcta tcctagctgc cctcgagcct 541 ccggaaactc aacccaaaag gaagtgctgt atattctaaa ctgttttctc cttcccttct 601 ttgctgctgc ttcctgtccc actactgtag aaagatcgtt taaaaacaaa ggaataaaac 661 catcctgttt gaaagcctct gcgtcttttt actcaccacc ttagagcaac ctctgtatta 721 gtttttgatc aagaattgca atatcatata aattttttgt gatcagtagt caagttggac 781 ttgttttaac gttctgctgc ttgagttgcc tgatgctcag agctttttgg tttggattac 841 tattgcaaag ggaacttggt ctggcttaga tgtcctcttg gagaaaataa caagagtttt 901 aacacttcta gatcttagtt cagatggaga aagtaacaca aacatcattt tactcttatg 961 atcaattgtt aattgtaatt gcatgacaaa ccttatggaa aaggggtgac ctgg // LOCUS HUMMUCAB 1804 bp ss-mRNA PRI 02-AUG-1990 DEFINITION Human polymorphic epithelial mucin (PEM) mRNA, complete cds. ACCESSION J05581 KEYWORDS polymorphic epithelial mucin. SOURCE Human adult mammary adenocarcinoma cell line BT20, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1804) AUTHORS Gendler,S.J., Lancaster,C.A., Taylor-Papadimitriou,J., Duhig,T., Peat,N., Burchell,J., Pemberton,L., Lalani,E.-N. and Wilson,D. TITLE Molecular cloning and expression of the human tumour-associated polymorphic epithelial mucin, PEM JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Gendler, 26-JUN-1990. FEATURES from to/span description pept 73 1500 polymorphic epithelial mucin (PEM) precursor sigp 73 135 polymorphic epithelial mucin signal peptide matp 136 1497 polymorphic epithelial mucin signal 1783 1788 poly-A signal mRNA < 1 1804 PEM mRNA BASE COUNT 361 a 622 c 434 g 387 t ORIGIN 1 cgctccacct ctcaagcagc cagcgcctgc ctgaatctgt tctgccccct ccccacccat 61 ttcaccacca ccatgacacc gggcacccag tctcctttct tcctgctgct gctcctcaca 121 gtgcttacag ttgttacagg ttctggtcat gcaagctcta ccccaggtgg agaaaaggag 181 acttcggcta cccagagaag ttcagtgccc agctctactg agaagaatgc tgtgagtatg 241 accagcagcg tactctccag ccacagcccc ggttcaggct cctccaccac tcagggacag 301 gatgtcactc tggccccggc cacggaacca gcttcaggtt cagctgccac ctggggacag 361 gatgtcacct cggtcccagt caccaggcca gccctgggct ccaccacccc gccagcccac 421 gatgtcacct cagccccgga caacaagcca gccccgggct ccaccgcccc cccagcccac 481 ggtgtcacct cggccccgga caccaggccg gccccgggct ccaccgcccc cccagcccat 541 ggtgtcacct cggccccgga caacaggccc gccttgggct ccaccgcccc tccagtccac 601 aatgtcacct cggcctcagg ctctgcatca ggctcagctt ctactctggt gcacaacggc 661 acctctgcca gggctaccac aaccccagcc agcaagagca ctccattctc aattcccagc 721 caccactctg atactcctac cacccttgcc agccatagca ccaagactga tgccagtagc 781 actcaccata gcacggtacc tcctctcacc tcctccaatc acagcacttc tccccagttg 841 tctactgggg tctctttctt tttcctgtct tttcacattt caaacctcca gtttaattcc 901 tctctggaag atcccagcac cgactactac caagagctgc agagagacat ttctgaaatg 961 tttttgcaga tttataaaca agggggtttt ctgggcctct ccaatattaa gttcaggcca 1021 ggatctgtgg tggtacaatt gactctggcc ttccgagaag gtaccatcaa tgtccacgac 1081 gtggagacac agttcaatca gtataaaacg gaagcagcct ctcgatataa cctgacgatc 1141 tcagacgtca gcgtgagtga tgtgccattt cctttctctg cccagtctgg ggctggggtg 1201 ccaggctggg gcatcgcgct gctggtgctg gtctgtgttc tggttgcgct ggccattgtc 1261 tatctcattg ccttggctgt ctgtcagtgc cgccgaaaga actacgggca gctggacatc 1321 tttccagccc gggataccta ccatcctatg agcgagtacc ccacctacca cacccatggg 1381 cgctatgtgc cccctagcag taccgatcgt agcccctatg agaaggtttc tgcaggtaat 1441 ggtggcagca gcctctctta cacaaaccca gcagtggcag ccacttctgc caacttgtag 1501 gggcacgtcg cccgctgagc tgagtggcca gccagtgcca ttccactcca ctcaggttct 1561 tcagggccag agcccctgca ccctgtttgg gctggtgagc tgggagttca ggtgggctgc 1621 tcacaccgtc cttcagaggc cccaccaatt tctcggacac ttctcagtgt gtggaagctc 1681 atgtgggccc ctgaggctca tgcctgggaa gtgttgtggt gggggctccc aggaggactg 1741 gcccagagag ccctgagata gcggggatcc tgaactggac tgaataaaac gtggtctccc 1801 actg // LOCUS RABGHKAB 1367 bp ss-mRNA MAM 02-AUG-1990 DEFINITION O.cuniculus gastric H+ - K+ ATPase beta subunit mRNA, complete cds. ACCESSION M35544 KEYWORDS H+ - K+ ATPase beta subunit. SOURCE O.cuniculus (strain New Zealand White) adult stomach, cDNA to mRNA, clone RB-49. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (bases 1 to 1367) AUTHORS Reuben,M.A., Lasater,L.S. and Sachs,G. TITLE Characterization of a beta subunit of the gastric H,K ATPase JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Reuben, 25-JUN-1990. FEATURES from to/span description pept 54 929 H+/K+ ATPase beta subunit mRNA 1 1367 H+/K+ ATPase beta subunit mRNA BASE COUNT 293 a 450 c 376 g 248 t ORIGIN 1 tgtccatcta aggggtgcag ggtgggggag cggcggcttc cacagcagac accatggccg 61 ccttgcagga gaagaagtcg tgcagccagc gcatggagga gttccgccac tactgctgga 121 acccggacac ggggcagatg ctgggccgca ccctgtcccg ctgggtgtgg atcagcctct 181 actacgtggc cttctacgtg gtgatgaccg ggctgtttgc cctgtgcatc tacgtgctca 241 tgcaaaccat cgacccctac acgcctgact accaggacca gctgaagtcg ccaggcgtaa 301 ccttaaggcc agacgtgtat ggggagaagg gcctggaaat ccactacaac atctctgaca 361 accgcacctg gaccagcctc acgcacaccc tgagaagctt cctggcaggt tactcacccg 421 cagcccaggt ggacaacatc aactgcacct ccaagaccta cttcttccag gagagcttcg 481 gggctcccaa ccacaccaag ttctcctgca agttcacggc ggacatgctg gagaactgct 541 ccggcctcac cgaccccagc ttcggcttca aggaaggaaa gccctgcttc atcattaaga 601 tgaacaggat tgtcaggttc ctgcccagca acagcacacc accccgcgtg gactgcacct 661 tcctggacat gccccaccag gccctcacgc cgctgcaggt cgagtactac ccccccaacg 721 gcaccttcag cctccactac ttcccttact acggaaagaa agcgcagccg cactacagca 781 accctctggt ggccgccaag ctcctcaacg tgcccacgaa cacggaggtc gtggtcctct 841 gcaagatcct ggccgatcac gtgaccttcg acaacccgca cgacccctac gaggggaaag 901 tggagttcaa gctcaagatc cagaagtgag ctcgcggtcg tgggtctcac gggggtcccg 961 ctgggccctg tggacgcgtt tctgaggccc gcaccagtgc ccccctggga gcgccaggca 1021 agcagccgcg tgcacagcgc ttccattggc gtccgctgct tcccgtggtg tcccgtcccg 1081 tgcaggcgca tctgtagtag gatggaaaat gccggctaac ccagccatgg caaggcggcg 1141 accgcgagtg cccagagccc agaactgacc cgtcctttcc gtgcggggag caccttcttg 1201 ctaccattgg tttttctgac gtcagtgtga gcggcttccc agatgacgca ggggctgtta 1261 gtgtgctgag gtgcggtgac gtgagacgcc catggcaacc cccactcaca gcgcaatccc 1321 cgtgttgtaa aaataaaacg tgttagttac tctaggtaaa acaaaaa // LOCUS ECOTRRNA23 485 bp ss-rRNA RNA 02-AUG-1990 DEFINITION E.coli 23S small subunit ribosomal RNA, 5' end. ACCESSION M24881 KEYWORDS 23S ribosomal RNA; small subunit ribosomal RNA. SOURCE E.coli rRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 485) AUTHORS Branlant,C., Sri Widada,J., Krol,A. and Ebel,J.-P. TITLE RNA sequences in ribonucleoprotein fragments of the complex formed from ribosomal 23-S RNA and ribosomal protein L24 of Escherichia coli JOURNAL Eur. J. Biochem. 74, 155-170 (1977) STANDARD simple staff_review FEATURES from to/span description rRNA 1 > 485 23S rRNA BASE COUNT 143 a 105 c 151 g 86 t ORIGIN 1 ggttaagcga ctaagccgga gcgtacacgg atggtgcaac gccctgggca gtcagagagc 61 gtgcgatgaa ggacgctaat ctgataagcg atatggtaag gtcgaaccgt gttataaccg 121 atttccggcg aatggggaaa cagtgtgttt cgacacacta tcattaactg aatcaatagg 181 ttaatgaacc ggggggaact gaaacatcta agtaccccga aaaggggaaa taaccgagat 241 tcccccagta gagagcgtag ggcccagagc ctgaacgaat cagtgtgtgg aagcgataca 301 gggtgaaagg gcgtgtctgt tagcgccccg acagtacaca aaaatgcaca tgctgagctc 361 gggacacgag tgatgagggg gggcgcgaaa agaaccccga aagggggcgt ctgaatatgt 421 atccctgacc atcctccaag gctaaatacc cttggggggg gactgaccga tagtaccgaa 481 gagtg // LOCUS KPNNIFBQ 2178 bp ds-DNA BCT 02-AUG-1990 DEFINITION K.pneumoniae nitrogen fixation (nifB and niQ) genes, complete cds. ACCESSION M15545 KEYWORDS nitrogen fixation protein. SOURCE K.pneumoniae DNA. ORGANISM Klebsiella pneumoniae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2178) AUTHORS Buikema,W.J., Klingensmith,J.A., Gibbons,S.L. and Ausubel,F.M. TITLE Conservation of structure and location of Rhizobium meliloti and Klebsiella pneumoniae nifB genes JOURNAL J. Bacteriol. 169, 1120-1126 (1987) STANDARD simple staff_review FEATURES from to/span description pept 199 1605 nitrogen fixation B (nifB) pept 1605 2108 nitrogen fixation Q (nifQ) BASE COUNT 402 a 670 c 692 g 414 t ORIGIN 1 tcagattatg gatatcacca tgccgcgact gtgaagcctt atgtgagatt caggacattg 61 tcgccagcgc ggcggaattg cgacaattca gggacgcggg ttgccggtta aaaagtctac 121 ttttcatgcg gttgcgaaat taacctctgg tacagcattt gcagcaggaa ggtatcgccc 181 aaccacgaag gtacgaccat gacttcctgc tcctcttttt ctggcggcaa agcctgccgc 241 ccggcggatg acagcgcatt gacgccgctt gtggccgata aagctgccgc gcacccctgc 301 tactctcgcc atgggcatca ccgtttcgcg cggatgcatc tgcccgtcgc gcccgcctgc 361 aatttgcagt gcaactactg taatcgcaaa ttcgattgca gcaacgagtc ccgccccggg 421 gtatcgtcaa cgctgctgac gcctgaacag gcggtcgtga aagtgcgtca ggtcgcgcag 481 gcgatcccgc agctttcggt ggtgggcatc gccgggcccg gcgatccgct cgccaatatc 541 gcccgcacct ttcgcaccct ggagctgatc cgcgaacagc tgccggacct gaaattatgc 601 ctgtcgacca acggactgat gctgcctgac gcggtggacc gcctgctgga tgtcggcgtt 661 gaccacgtca cggtcaccat taacaccctc gacgcggaga ttgccgcgca aatctacgcc 721 tggctatggc tggacggcga acgctacagc gggcgcgaag gcggagagat cctgattgcc 781 cgtcagcttg agggcgtacg caggctgacc gccaaaggcg tgctggtgaa aataaattcg 841 gtgctgatcc ccggtatcaa cgatagcggc atggccgacg tgagccgcgc gctgcgggcc 901 agcggcgcgt ttatccataa tattatgccg ctgatcgcca ggccggagca cggcacggtg 961 tttcggctca acggccagcc ggagccggac gccgagacgc tcgccgccac ccgcagccgg 1021 tgcggcgaag tgatgccgca gatgacccac tgccaccagt gtcgcgccga cgccattggg 1081 atgctcggcg aagaccgcag ccagcagttt acccagcttc cggcgccaga gagtctcccg 1141 gcctggctgc cgatcctcca ccagcgcgcg cagctgcacg ccagcattgc gacccgcggc 1201 gaatctgaag ccgatgacgc ctgcctggtc gccgtggcgt caagccgcgg ggacgtcatt 1261 gattgtcact ttggtcacgc cgaccggttc tacatttaca gcctctcggc cgccggtatg 1321 gtgctggtca acgagcgctt tacgcccaaa tattgtcagg ggcgcgatga ctgcgagccg 1381 caggataacg cagcccggtt tgcggcgatc ctcgaactgc tggcggacgt taaagccgta 1441 ttctgcgtgc gtatcggcca tacgccgtgg caacagctgg aacaggaagg cattgaaccc 1501 tgcgttgacg gcgcgtggcg gccggtctcc gaagtgctgc ccgcgtggtg gcaacagcgt 1561 cgggggagct ggcctgccgc gttgccgcat aagggggtcg cctgatgccg ccgctcgact 1621 ggttgcggcg cttatggctg ctgtaccacg cggggaaagg cagctttccg ctgcgcatgg 1681 ggcttagccc gcgcgattgg caggcgctgc ggcggcgcct gggcgaggtg gaaacgccgc 1741 tcgacggcga gacgctcacc cgtcgccgcc tgatggcgga gctcaacgcc acccgcgaag 1801 aggagcgcca gcagctgggc gcctggctgg cgggctggat gcagcaggat gccgggccga 1861 tggcgcagat tatcgccgag gtttcgctgg cgtttaacca tctctggcag gaatcttggt 1921 ctggcatcgc cgccgaattg cgcctgctga tgagcgactg ctttccacag ctggtggtga 1981 tgaacgaaca caatatgcgc tggaaaaagt tcttttatcg tcagcgctgt ttgctgcaac 2041 agggggaagt tatctgccgt tcgccaagct gcgacgagtg ctgggaacgc agcgcctgtt 2101 ttgagtagcc gtttcccgaa gggggcgctg caaacaaaaa agccggaggt ttccctccgg 2161 cttttcacat catcaaat // LOCUS RHMNIFB 1722 bp ds-DNA BCT 02-AUG-1990 DEFINITION R.meliloti nitrogen fixation B (nifB) gene, complete cds. ACCESSION M15544 KEYWORDS nitrogen fixation protein. SOURCE R.meliloti DNA. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 1722) AUTHORS Buikema,W.J., Klingensmith,J.A., Gibbons,S.L. and Ausubel,F.M. TITLE Conservation of structure and location of Rhizobium meliloti and Klebsiella pneumoniae nifB genes JOURNAL J. Bacteriol. 169, 1120-1126 (1987) STANDARD simple staff_review FEATURES from to/span description pept 199 1671 nitrogen fixation B protein (nifB) BASE COUNT 422 a 476 c 481 g 343 t ORIGIN 1 gcgtcgcgag cgtcgcctct tttctccgtc ctttcgaaac acgaccggat gcaattcaac 61 tttgcccttg tcggaagtcc gacaagcctg tgagcaaaga gacaggaccg aatccaaaga 121 cgccaatcca ttgacgaaaa ggatttggca tagctgttgc tggttgaatt gcaaaatcac 181 cctccactgg agttgttcat gtccacaccc atgattttgc gtgagagccg gaccagcact 241 acattctctg accagttgct ggagaacgct aaatcggttg gctgctcacc cccatcgacg 301 gcgccgggcg acatagatcc tggaacttgg gacaagatta agaatcaccc ctgtttttca 361 gaggaggcgc atcactattt cgcgcgcatg cacgtggcgg tcgcgcctgc ctgcaacatc 421 caatgtaact attgcaatcg caaatacgat tgcgccaacg aaagtcggcc cggtgttgcc 481 tcggaaaagc tcacacctga ccaggcggtg cgaaaggtga ttgccgttgc caacgaagtg 541 cctcagctgt cagtgcttgg catcgctggg cctggcgatg cctgttacga ctggaagaaa 601 acaagggcga cgttcgaacg agtggctagg gaaattcccg acataagact ctgcatctcc 661 acgaacgggc tctcgctgcc ggaccatgtc gatgagcttg ccgaaatgaa cgtcgatcac 721 gtgacgatca ccatcaacat ggtcgatccg cgtgtcggcg taaagatcta cccctggatt 781 tactatggtc agcgccgcca cactggtatc gacgctgcga gaatcctgca cgaacggcag 841 atgttgggcc tggagatgct agccgaacgc ggcatcctca ccaaggtcaa ctcggtaatg 901 atccccggcg tcaatgatga gcacctgatc gaagtcaaca aagttgtgaa aggaagaggc 961 gcgttgctgc acaacgtaat gccgctaatt tcaaaccgca tacacgggac ctattacgga 1021 ctgacagggc agcgcggccc ggaggccttc gaactgcagg cccttcagga ccgtctagaa 1081 ggaaccaaac tgatgcgtca ttgtcgacat tgccgggccg atgccatagg cttgctcggc 1141 gatgatcgtg gtcacgagtt cacgctcgct gaaatccccg acgagataac ctacgatgcc 1201 agcaagcgac aggcctatcg ccagttggtc gcgcgcgaac gcggggacca cctagtggcc 1261 aagaacgagg cgaacagaac ggtaatgtcg gtggaatatg gcggatcgct tctcattgcc 1321 gtggcgacca aaggcggggg ccggatcaac gaacattttg gacacgcgaa agaatttcac 1381 gtttataccg tctcccagag agggatcaag ctggcaggcc gccgcagggt tgagcagtat 1441 tgcctcggcg gttggggcga ggtcgccacc ctcgatcaca tcgtcgttgc gcttgaagga 1501 atagacatcc tgctctgcgt caaaatcgga gattacccaa ggaaacagct gacacaggcc 1561 gggcttcgag cgacggaagc ttatggccat gactacatcg agagtgcgct cgaaagctct 1621 acgccgccga gtttggctat cgaaccaccg gtaaagacgg cgacacgttg agctgccctc 1681 cgactgaata aggagtttaa aatggccttc aagattattg ca // LOCUS AKOMTCYTBA 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.boliviensis mitochondrial cytochrome b gene, 5' end. ACCESSION M35691 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.boliviensis (MVZ catalog # 171607) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon boliviensis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon boliviensis. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 122 a 111 c 52 g 116 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta ctcaaaattg ttaatcactc attcattgac 61 ctaccaactc catctaacat ctcatcctga tgaaattttg gatccctact aggcatatgc 121 ctaataatcc aaattttaac aggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cctcagtagc acatatctgc cgagatgtaa actacggatg attaatccgc 241 tacttgcacg ctaatggagc ctccatattc tttatctgcc tcttcatcca cgtaggccga 301 ggtatttact atggttccta taccctctca gaaacctgaa acattggcat catcttattc 361 ctcacaacta tagcaacagc atttgtagga tatgtactcc c // LOCUS AKOMTCYTBB 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.boliviensis mitochondrial cytochrome b gene, 5' end. ACCESSION M35692 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.boliviensis (MVZ catalog # 171608) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon boliviensis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon boliviensis. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 122 a 112 c 52 g 115 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta ctcaaaattg ttaatcactc attcattgac 61 ctaccaactc catctaacat ctcatcctga tgaaattttg gatccctact aggcatatgc 121 ctaataatcc aaattttaac aggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cctcagtagc acatatctgc cgagatgtaa actacggatg attaatccgc 241 tacttgcacg ctaatggagc ctccatattc tttatctgcc tcttcatcca cgtaggccga 301 ggtatttact atggttccta caccctctca gaaacctgaa acattggcat catcttattc 361 ctcacaacta tagcaacagc atttgtagga tatgtactcc c // LOCUS AKOMTCYTBC 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.puer mitochondrial cytochrome b gene, 5' end. ACCESSION M35693 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.puer (MVZ catalog # 171612) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon puer Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon puer. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 122 a 112 c 52 g 115 t ORIGIN 1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc atttattgac 61 ctaccaactc catctaacat ctcatcctgg tgaaattttg gatccctact aggcatatgc 121 ctagtaatcc aaattctaac aggcctattc ctagccatac actatacatc agacacaacc 181 acagcattct cctcagtagc ccacatctgc cgagatgtaa attacggatg attaatccgc 241 tatctacacg ccaatggagc ttctatattc tttatctgcc tcttcatcca tgtaggccga 301 ggtatttact acggctccta tgtcctctca gaaacctgaa acatcggcat catcttattc 361 ctcacaacta tagcaacagc attcgtagga tatgtactac c // LOCUS AKOMTCYTBD 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.subfuscus mitochondrial cytochrome b gene, 5' end. ACCESSION M35694 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.subfuscus (MVZ catalog # 172969 and 172970) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon subfuscus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon subfuscus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 120 a 112 c 53 g 116 t ORIGIN 1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac 61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcatatgc 121 ttagtgatcc aaattctaac aggcctgttc ctagccatac actatacatc cgacacaact 181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc 241 tacttacacg ctaatggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga 301 ggtatttact acggctccta tgtcctctca gaaacttgaa acatcggcat catcttattc 361 ctcacaacta tagcaacagc attcgtagga tatgtattac c // LOCUS AKOMTCYTBE 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.subfuscus mitochondrial cytochrome b gene, 5' end. ACCESSION M35695 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.subfuscus (MVZ catalog # 174109 and 174110) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon subfuscus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon subfuscus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 120 a 113 c 53 g 115 t ORIGIN 1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac 61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcatatgc 121 ttagtgatcc aaattctaac aggcctgttc ctagccatac actatacatc cgacacaact 181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc 241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga 301 ggtatttact acggctccta tgtcctctca gaaacttgaa acatcggcat catcttattc 361 ctcacaacta tagcaacagc attcgtagga tatgtattac c // LOCUS AKOMTCYTBF 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.subfuscus mitochondrial cytochrome b gene, 5' end. ACCESSION M35696 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.subfuscus (MVZ catalog # 174229 and 174230) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon subfuscus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon subfuscus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 122 a 116 c 51 g 112 t ORIGIN 1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac 61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcacatgc 121 ttagtaatcc aaattctaac aggcctattc ctagccatac actatacatc cgacacaact 181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc 241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga 301 ggtatttact acggctccta cgtcctctca gaaacttgaa acatcggcat catcttattc 361 ctcacaacta tagcaacagc attcgtagga tatgtactac c // LOCUS AKOMTCYTBG 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.kofordi mitochondrial cytochrome b gene, 5' end. ACCESSION M35697 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.kofordi (MVZ catalog # 171665 and 171666) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon kofordi Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon kofordi. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 122 a 116 c 51 g 112 t ORIGIN 1 atgaaaattt tacggaaaaa ccacccacta ctcaaaatta ttaatcactc attcattgac 61 ttaccaactc catctaacat ctcatcctga tgaaatttcg gatccctact aggcacatgc 121 ttagtaatcc aaattctaac aggcctattc ctagccatac actatacatc cgacacaact 181 acagcattct cctcagtagc ccatatctgc cgagatgtaa actacggatg actaatccgc 241 tacttacacg ctaacggagc ctctatattc ttcatctgcc tcttcatcca tgtaggccga 301 ggtatttact acggctccta cgtcctctca gaaacttgaa acatcggcat catcttattc 361 ctcacaacta tagcaacagc attcgtagga tatgtactac c // LOCUS AKOMTCYTBH 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.juninensis mitochondrial cytochrome b gene, 5' end. ACCESSION M35698 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.juninensis (MVZ catalog # 173038 and 173039) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon juninensis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon juninensis. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 117 a 115 c 57 g 112 t ORIGIN 1 atgaaagtct tacgaaaaaa tcacccacta ctcaaaattg ttaatcactc atttatcgat 61 ctaccaaccc catctaacat ctcatcctga tggaatttcg ggtccctact aggcacatgc 121 ctagtaatcc aaattctaac aggcctattc ctagccatac actacacgtc agatacaacc 181 acagcattct cctcagtagc ccacatctgc cgagatgtaa actacggatg attaatccgc 241 tacttacacg ctaacggagc ctctatattc tttatctgcc tcttcatcca tgtaggccga 301 gggatttact acggctccta catcctctca gaaacctgaa acattggcat catcttgttt 361 ctcacaacta tagcaacagc atttgtaggg tatgtacttc c // LOCUS AKOMTCYTBI 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.mollis mitochondrial cytochrome b gene, 5' end. ACCESSION M35699 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.mollis (MVZ catalog # 173057 and 173058) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon mollis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon mollis. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 120 a 110 c 54 g 117 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaatta ttaatcactc attcattgac 61 ctaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactgct cggtgtatgc 121 ctaataatcc aaatcctaac tggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cctcagtagc ccacatttgt cgagatgtaa actacggatg attaatccgc 241 tatctacacg ctaacggagc ttccatattc tttatttgcc ttttcatcca tgtaggccga 301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acatcggtat catcctattc 361 cttacaacta tagcaacagc attcgtagga tatgtattac c // LOCUS AKOMTCYTBJ 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.torques mitochondrial cytochrome b gene, 5' end. ACCESSION M35700 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.torques (MVZ catalog # 171720) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon torques Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon torques. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 117 a 112 c 58 g 114 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc attcattgac 61 ttaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactact cggtgtatgc 121 ctaataatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc 241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga 301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acgtcggtat catcctattc 361 ctcacaacta tagcaacagc attcgtaggg tacgtgttac c // LOCUS AKOMTCYTBK 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.torques mitochondrial cytochrome b gene, 5' end. ACCESSION M35701 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.torques (MVZ catalog # 171721) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon torques Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon torques. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 116 a 112 c 59 g 114 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc attcattgac 61 ttaccaaccc catccaacat ctcatcttga tgaaactttg ggtcactact cggtgtatgc 121 ctaataatcc aaattctgac cggcctgttc ctagccatac actacacatc agacacaacc 181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc 241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga 301 ggaatctatt acggctccta tgtcctctca gaaacctgaa acgtcggtat catcctattc 361 ctcacaacta tagcaacagc attcgtaggg tacgtgttac c // LOCUS AKOMTCYTBL 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.torques mitochondrial cytochrome b gene, 5' end. ACCESSION M35702 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.torques (MVZ catalog # 174053 and 174054) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon torques Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon torques. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 118 a 112 c 58 g 113 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg taaatcactc atttattgac 61 ttaccaaccc catccaacat ctcatcttga tggaactttg ggtcactact aggcgtatgc 121 ctaataatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cttcagtagc ccacatttgc cgagacgtaa actacggatg attaatccgc 241 tatctacacg ctaacggagc ttccatattc tttatctgcc ttttcatcca tgtaggccga 301 gggatctatt acggctccta tgccctctca gaaacctgaa acatcggtat catcttattc 361 ctcacaacca tagcaacagc attcgtaggg tatgtattac c // LOCUS AKOMTCYTBM 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end. ACCESSION M35703 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.aerosus (MVZ catalog # 171679 and 1761680) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon aerosus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon aerosus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 115 a 107 c 59 g 120 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccacta cttaaaattg ttaatcactc atttattgac 61 ctaccaaccc cacccaacat ttcatcctga tggaattttg ggtcactact tggtgtatgc 121 ctaataatcc aaattctgac tggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cttcagtagc ccatatttgc cgagatgtaa actacgggtg gctaatccgc 241 tatctacatg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtgggtcga 301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat tatcttattc 361 cttacaacta tagcaacagc attcgtaggg tatgtactac c // LOCUS AKOMTCYTBN 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end. ACCESSION M35704 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.aerosus (MVZ catalog # 172818 and 172819) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon aerosus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon aerosus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 110 a 114 c 60 g 117 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac 61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc 121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc 241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga 301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctgttc 361 cttacaacta tagcaacagc attcgtaggg tatgttctac c // LOCUS AKOMTCYTBO 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end. ACCESSION M35705 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.aerosus (MVZ catalog # 172849) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon aerosus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon aerosus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 111 a 114 c 59 g 117 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac 61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc 121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc 241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga 301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctattc 361 cttacaacta tagcaacagc attcgtaggg tatgttctac c // LOCUS AKOMTCYTBP 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end. ACCESSION M35706 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.aerosus (MVZ catalog # 172850) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon aerosus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon aerosus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 110 a 114 c 60 g 117 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcattc atttattgac 61 ctcccaaccc catctaacat ctcatcttga tggaatttcg ggtcactact cggtgtgtgc 121 ctagtaatcc aaattctgac cggcctattc ctagccatac actacacatc agacacaacc 181 acagcattct cctcagttgc ccacatttgc cgagatgtaa actacggatg attaattcgc 241 taccttcacg ctaacggagc ctccatattc tttatctgcc ttttcatcca cgtaggccga 301 ggaatctact acggctccta tgtcctctca gaaacctgaa acatcggtat catcctgttc 361 cttacaacta tagcaacagc attcgtaggg tatgttctac c // LOCUS AKOMTCYTBQ 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end. ACCESSION M35707 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.aerosus (MVZ catalog # 172870) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon aerosus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon aerosus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 115 a 108 c 58 g 120 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcactc atttattgac 61 ctcccaaccc catccaacat ctcatcttga tggaattttg ggtcactact aggtgtatgc 121 ctaataatcc aaattctgac tggactattc ctagccatac actacacatc agacacaacc 181 acagcattct cttcagtcgc ccacatttgc cgagatgtaa attacgggtg attaatccgc 241 taccttcacg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtaggtcga 301 ggaatctact acggctccta tgtcctttca gaaacctgaa acatcggtat catcctgttc 361 cttacaacta tagcaacagc atttgtagga tatgtactac c // LOCUS AKOMTCYTBR 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.aerosus mitochondrial cytochrome b gene, 5' end. ACCESSION M35708 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.aerosus (MVZ catalog # 172871) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon aerosus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon aerosus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 116 a 108 c 57 g 120 t ORIGIN 1 atgaaaattt tacggaaaaa tcacccgcta cttaaaattg ttaatcactc atttattgac 61 ctcccaaccc catccaacat ctcatcttga tggaattttg ggtcactact aggtgtatgc 121 ctaataatcc aaattctgac tggactattc ctagccatac actacacacc agacacaacc 181 acagcattct cttcagtcgc ccacatttgc cgagatgtaa attacgggtg attaatccgc 241 tatcttcacg ctaacggagc ctccatattc tttatctgcc tttttatcca cgtaggtcga 301 ggaatctact acggctccta tgtcctttca gaaacctgaa acatcggtat catcctattc 361 cttacaacta tagcaacagc atttgtagga tatgtactac c // LOCUS AKOMTCYTBW 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.jelskii mitochondrial cytochrome b gene, 5' end. ACCESSION M35716 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.jelskii (MVZ catalog # 173084) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon jelskii Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon jelskii. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 109 a 117 c 55 g 117 t 3 others ORIGIN 1 atggcaatta tacgaaaaac ccacccctta cttaaaatta ttaaccactc attcattgat 61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc 121 ctaattattc aaattctaac tggccttttc ctagcaatac actacacatc cgacacagcc 181 acagcattct cctccgttac ccacatctgc cgagatgtaa actacggctg actcatccgc 241 taccttcatg ctaacggagc atcyatattt tttatttgyc tattyattca cgtaggacga 301 ggaatctact atggctccta catgctctca gagacctgaa acatcggaat cattctactc 361 ctggccacaa tagccacagc attcgtagga tacgttcttc c // LOCUS AKOMTCYTBX 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.jelskii mitochondrial cytochrome b gene, 5' end. ACCESSION M35715 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.jelskii (MVZ catalog # 173083) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon jelskii Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon jelskii. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 107 a 118 c 57 g 119 t ORIGIN 1 atggcaatta tacgaaaaac ccacccctta cttaaaatta ttaaccactc attcattgat 61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc 121 ctaattattc aaatcctaac tggccttttc ctagcaatac actacacatc cgacacagcc 181 acagcattct cctccgttac ccacatctgc cgagatgtaa actacggctg actcatccgc 241 taccttcatg ctaacggggc atctatattt tttatttgtc tatttattca cgtaggacgg 301 ggaatctact atggctccta catgctctca gagacctgaa acatcggaat cattctactc 361 ctggccacaa tagccacagc attcgtagga tacgttcttc c // LOCUS AKOMTCYTBY 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.jelskii mitochondrial cytochrome b gene, 5' end. ACCESSION M35714 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.jelskii (MVZ catalog # 173073 and 173074) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon jelskii Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon jelskii. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 103 a 121 c 60 g 117 t ORIGIN 1 atggcaatta tacgaaagac ccacccctta cttaaaatca ttaaccactc gttcattgat 61 ctacctaccc cctgcaatat ttcttcttga tggaacttcg gctcactcct aggcatttgc 121 ctaattattc aaattctaac tggccttttc ctagccatac actacacatc cgacacagcc 181 acagcattct cctccgtcac ccacatctgc cgagatgtaa attacggctg actcatccgc 241 tatcttcatg ctaacggagc atccatattt tttatttgcc tattcattca cgtggggcga 301 gggatctact atggctccta catgctctca gagacctgaa acatcggaat tgttctactc 361 ctgaccacaa tagccacagc attcgtagga tacgttctcc c // LOCUS AKOMTCYTBZ 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION A.andinus mitochondrial cytochrome b gene, 5' end. ACCESSION M35713 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE A.andinus (MVZ catalog # 174062 and 174063) liver mitochondrial DNA. ORGANISM Mitochondrion Akodon andinus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Akodon andinus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 112 a 115 c 55 g 119 t ORIGIN 1 atgacaatca tacgaaaaac ccacccttta cttaaaatca ttaatcactc atttattgac 61 ctacccgccc cctccaacat ctcatcctga tggaacttcg gctcactcct tggcatttgc 121 ttgataattc aaattttaac gggccttttc ctagccatac attatacgtc cgacacagct 181 acagcattct cctccgtcac ccatatctgc cgagatgtaa attacggatg actaatccgc 241 tacttacacg ccaatggagc atctatattc tttatctgca tatttattca tgtgggacga 301 ggaatttact acggctccta catactctca gaaacctgaa acatcggaat tattctactc 361 ctgaccacaa tagccacagc atttgtaggc tatgtccttc c // LOCUS BOMMTCYTBA 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION B.amoenus mitochondrial cytochrome b gene, 5' end. ACCESSION M35711 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE B.amoenus (MVZ catalog # 172878) liver mitochondrial DNA. ORGANISM Mitochondrion Bolomys amoenus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Bolomys amoenus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 122 a 107 c 50 g 122 t ORIGIN 1 atgaaaatta ttcgaaaaaa tcacccacta cttaaaatta ttaaccattc attcattgat 61 cttccaaccc catctaacat ttcatcctga tgaaacttcg gatccttact aggcatctgc 121 ctaataatcc aaatcctaac aggcctattt ctagcaatac actatacatc agatacaacc 181 acagcattct cctcagtcgc ccatatctgc cgagacgtga actatggatg attaatccgc 241 tacctacacg ccaacggagc ctccatattc ttcatctgtc ttttcatcca tgtaggccga 301 ggaatttact atggctccta tgtcctatca gaaacctgaa atattggtat tatcctattt 361 tttacaacta tagcaacagc attcgtagga tatgtccttc c // LOCUS BOMMTCYTBB 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION B.amoenus mitochondrial cytochrome b gene, 5' end. ACCESSION M35712 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE B.amoenus (MVZ catalog # 172879) liver mitochondrial DNA. ORGANISM Mitochondrion Bolomys amoenus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Bolomys amoenus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 122 a 106 c 50 g 123 t ORIGIN 1 atgaaaatta ttcgaaaaaa tcacccacta cttaaaatta ttaaccattc attcattgat 61 cttccaaccc catctaacat ttcatcctga tgaaacttcg gatccttact aggcatctgc 121 ctaataatcc aaatcctaac aggcctattt ctagcaatac actatacatc agatacaacc 181 acagcattct cctcagtcgc ccatatctgc cgagacgtga actatggatg attaatccgc 241 tacctacacg ccaacggagc ctccatattc ttcatctgtc ttttcattca tgtaggccga 301 ggaatttact atggctccta tgtcctatca gaaacctgaa atattggtat tatcctattt 361 tttacaacta tagcaacagc attcgtagga tatgtccttc c // LOCUS MCXMTCYTBA 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION M.mimus mitochondrial cytochrome b gene, 5' end. ACCESSION M35709 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE M.mimus (MVZ catalog # 171745 and 171746) liver mitochondrial DNA. ORGANISM Mitochondrion Microxus mimus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Microxus mimus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 118 a 116 c 54 g 113 t ORIGIN 1 atgaaaatcc tacgaaaaaa ccacccgcta cttaaaattg ttaaccactc atttattgac 61 ctaccaaccc catctaacat ctcttcctga tgaaattttg gatccttact aggtgcatgc 121 ctaataatcc aaattctaac cggcctattc ctagccatac actatacatc agacacaacc 181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc 241 tatctacacg ctaacggagc ctctatattc tttatctgcc tcttcattca cgtaggccga 301 ggaatttact acggctccta tgccctttca gagacctgaa acattggcat catcctattc 361 cttacaacta tagcaacagc attcgtaggg tacgtactcc c // LOCUS MCXMTCYTBB 401 bp ds-DNA ORG 02-AUG-1990 DEFINITION M.mimus mitochondrial cytochrome b gene, 5' end. ACCESSION M35710 KEYWORDS cytochrome b; mitochondrial DNA. SOURCE M.mimus (MVZ catalog # 171752 and 171753) liver mitochondrial DNA. ORGANISM Mitochondrion Microxus mimus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Hesperomyini; Microxus mimus. REFERENCE 1 (bases 1 to 401) AUTHORS Smith,M.F. and Patton,J.L. TITLE Variation in mitochondrial cytochrome b sequence in natural populations of South American akodontine rodents (Muridae: Sigmodontinae) JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Smith, 28-JUN-1990. FEATURES from to/span description pept 1 > 401 cytochrome b BASE COUNT 116 a 116 c 56 g 113 t ORIGIN 1 atgaaaatcc tacgaaaaaa ccacccgcta cttaaaattg ttaaccactc atttattgac 61 ctaccaaccc catctaacat ctcttcctga tgaaattttg gatccttact aggtgcatgc 121 ctaataatcc aaattctaac cggcctattc ctagccatac actatacgtc agacacaacc 181 acagcattct cttcagtagc ccacatttgc cgagatgtaa actacggatg attaatccgc 241 tatctgcacg ctaacggagc ctctatattc tttatctgcc tcttcattca cgtaggccga 301 ggaatttact acggctccta tgccctttca gagacctgaa acattggcat catcctattc 361 cttacaacta tagcaacagc attcgtaggg tacgtactcc c // LOCUS CRCIGHA 482 bp ds-DNA VRT 02-AUG-1990 DEFINITION Caiman Ig germline heavy-chain gene V region, exons 1 and 2. ACCESSION M34918 KEYWORDS germline; immunoglobulin heavy-chain; variable region. SOURCE Caiman DNA. ORGANISM Caiman crocodylus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Reptilia; Archosauria; Crocodylia; Crocodylidae; Alligatorinae. REFERENCE 1 (bases 1 to 482) AUTHORS Litman,G.W., Berger,L., Murphy,K., Litman,R., Podlaski,F., Hinds,K., Jahn,C.L., Dingerkus,G. and Erickson,B.W. TITLE Phylogenetic diversification of immunoglobulin V-H genes JOURNAL Dev. Comp. Immunol. 8, 499-514 (1984) STANDARD simple staff_review FEATURES from to/span description pept 1 46 Ig H-chain V region precursor, exon 1 137 / 441 Ig H-chain V region precursor, exon 2 sigp 1 46 Ig H-chain V region signal peptide 137 147 Ig H-chain V region signal peptide matp 148 / 441 Ig H-chain V region, exon 2 IVS 47 136 Ig H-chain V region intron A signal 444 450 heptamer recombination signal signal 474 482 nonamer recombination signal BASE COUNT 104 a 139 c 131 g 108 t ORIGIN 1 atgggacttg ggctccacct gctcgtcctg gccgcagctc tgcaaggtat gtcatcttcc 61 cacaggtcgg ggagggaatc attgcactga tgctgcccta atactagatc ctcttggatc 121 ctaccttgct tttcaggtgc ttggtcccag gtgcagctgg tggagtccgg aggagatgtg 181 aggaaacctg gaaactcttt gcgcctctcc tgcaaagcct cggggttcac cttcggtggc 241 tacggcatgt tctgggtccg ccaggctcct gggaaggggc tggactgggt ggctacaatt 301 aatactgatg gatccagcca gtggtactcc ccggccgttc aggggaaatt caccatctcc 361 agaggcaact cccagaacat gctgtacctg cagatgagca gcctcacacc tgaggacaca 421 gccacgtatt actgcgccag agacacagtg actcaaaccc tattcacggc aatacaaaat 481 cc // LOCUS MUSIGKACZ 203 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds. ACCESSION M35640 KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, clone 1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 203) AUTHORS Nishi,M., Kataoka,T. and Honjo,T. TITLE Preferential rearrangement of the immunoglobulin kappa-chain joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 > 203 Ig kappa-chain V-J region (AA at 9) recomb 181 182 V-region end/J-region start BASE COUNT 45 a 57 c 50 g 50 t 1 others ORIGIN 1 ggtaccngca gaagtcagat gcctccccca aactatggat ttattacaca tccaacctgg 61 ctcctggagt cccagctcgc ttcagtggca gtgggtctgg gaactcttat tctctcacaa 121 tcagcagcat ggagggtgaa gatgctgcca cttattactg ccagcagttt actagttccc 181 catggacgtt cggtggaggc acc // LOCUS MUSIGKADA 204 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds. ACCESSION M35641 KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, clone 2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 204) AUTHORS Nishi,M., Kataoka,T. and Honjo,T. TITLE Preferential rearrangement of the immunoglobulin kappa-chain joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 > 204 Ig kappa-chain V-J region (AA at 9) recomb 181 182 V-region end/J-region start BASE COUNT 45 a 58 c 50 g 50 t 1 others ORIGIN 1 ggtaccngca gaagtcagat gcctccccca aactatggat ttattacaca tccaacctgg 61 ctcctggagt cccagctcgc ttcagtggca gtgggtctgg gaactcttat tctctcacaa 121 tcagcagcat ggagggtgaa gatgctgcca cttattactg ccagcagttt actagttccc 181 catcggacgt tcggtggagg cacc // LOCUS MUSIGKADB 203 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds. ACCESSION M35642 KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, clone 3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 203) AUTHORS Nishi,M., Kataoka,T. and Honjo,T. TITLE Preferential rearrangement of the immunoglobulin kappa-chain joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 > 203 Ig kappa-chain V-J region (AA at 9) recomb 181 182 V-region end/J-region start BASE COUNT 42 a 59 c 53 g 48 t 1 others ORIGIN 1 ggtaccngca gaagccagga tcctcgccca aaccctggat ttatgacaca tccaacctgg 61 cttctggatt ccctgctcgc ttcagtggca gtgggtctgg gacctcttgc tctctcataa 121 tcagcagcat ggaggctgaa gatgctgcca cttattactg ccatcagcgg agtagttacc 181 catggacgtt cggtggaggc acc // LOCUS MUSIGKADC 203 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds. ACCESSION M35643 KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, clone 4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 203) AUTHORS Nishi,M., Kataoka,T. and Honjo,T. TITLE Preferential rearrangement of the immunoglobulin kappa-chain joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 > 203 Ig kappa-chain V-J region (AA at 9) recomb 181 182 V-region end/J-region start BASE COUNT 47 a 54 c 52 g 50 t ORIGIN 1 ggtaccaaca gaagccaggg cagtctccta aactgctgat atactatgca tccaatcgct 61 acactggagt ccctgatcgc ttcactggca gtggatatgg gacggatttc actttcacca 121 tcagcactgt gcaggctgaa gacctggcag tttatttctg ccagcaggat tatagctctc 181 cgtggacgtt cggtggaggc acc // LOCUS MUSIGKADD 205 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds. ACCESSION M35644 KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, clone 5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 205) AUTHORS Nishi,M., Kataoka,T. and Honjo,T. TITLE Preferential rearrangement of the immunoglobulin kappa-chain joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 > 205 Ig kappa-chain V-J region (AA at 9) recomb 181 182 V-region end/J-region start BASE COUNT 50 a 55 c 52 g 48 t ORIGIN 1 ggtacctgca caagcctggc cagtctccac agctcctcat ctatgggatt tccaacagat 61 tttctggggt gccagaccgc ttcagtggca gtggttcagg gacagatttc acactcaaga 121 tcagcacaat aaagcctgag gacttgggaa tgtattactg cttacaaggt acacattcag 181 cctccggacg ttcggtggag gcacc // LOCUS MUSIGKADE 203 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds. ACCESSION M35645 KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, clone 6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 203) AUTHORS Nishi,M., Kataoka,T. and Honjo,T. TITLE Preferential rearrangement of the immunoglobulin kappa-chain joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 > 203 Ig kappa-chain V-J region (AA at 9) BASE COUNT 46 a 51 c 53 g 50 t 3 others ORIGIN 1 ggtaccnnnc gaaaccaggg cagtctccta aactgctgat ttactgggca tccactaggg 61 aatctggggt ccctgatcgc ttcacaggca gtggatctgg gactcatttc actctcacca 121 tcagcagtgt gaaggctgaa gacctggcag tttattactg tcagcaatat tatagctatc 181 cgtggacgtt cggtggaggc acc // LOCUS MUSIGKADF 205 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse Ig rearranged kappa-chain gene V-J region, partial cds. ACCESSION M35646 KEYWORDS immunoglobulin light-chain; joining exon; kappa-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, clone 1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 205) AUTHORS Nishi,M., Kataoka,T. and Honjo,T. TITLE Preferential rearrangement of the immunoglobulin kappa-chain joining region J-kappa-1 and J-kappa-2 segments in mouse spleen DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6399-6403 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 > 205 Ig kappa-chain V-J region (AA at 9) recomb 181 182 V-region end/J-region start BASE COUNT 46 a 60 c 51 g 42 t 6 others ORIGIN 1 ggtaccnnnn nnaaacagga cagccaccca aactcctcat ctatcttagc atccaaccta 61 gaatctgggg tccctgccag cttcagtggc agtgggtctg ggacaggctt caccctcaac 121 atccatcctg tggaggagtg aggatgctgc aacctattac ttccagcaca gtagggagct 181 tcctcggacg ttcggtggag gcacc // LOCUS MUSPROTCP 912 bp ss-mRNA ROD 02-AUG-1990 DEFINITION Mouse proteoglycan core protein mRNA, complete cds. ACCESSION M34603 KEYWORDS proteoglycan core protein. SOURCE Mouse (strain C57B1/6, haplotype H-2b) T-lymphocyte cell line EL4.E1, cDNA to mRNA, clone 154. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 912) AUTHORS Elliott,J.F., Pohajdak,B., Talbot,D., Miller,C.L., Helgason,C.D., Bleackley,R.C. and Paetkau,V.H. TITLE Cloning and characterization of an inducible proteoglycan core protein mRNA from a mouse T-lymphocyte cell line JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.H.Paetkau, 29-MAY-1990. FEATURES from to/span description pept 43 501 proteoglycan core protein precursor sigp 43 117 proteoglycan core protein signal peptide matp 265 498 proteoglycan core protein BASE COUNT 252 a 200 c 194 g 266 t ORIGIN 1 gatttgctaa tccagaggct gagtggagcc gagctggtca ggatgcaggt tcccgtcggc 61 agcaggcttg tcctggctct cgccttcgtc ctggtttggg gatcttcagt gcaaggttat 121 cctgctcgga gagccaggta ccagtgggtc cgctgcaaac cgaatggctt ttttgctaac 181 tgcatcgagg agaagggacc acagtttgac ctaatagatg aatccaataa catcggccct 241 cccatgaata atcctgtttt gatggaagga ccctcaaaag atttcatctc caattatgat 301 gactatgggt caggttcggg ctccggctct ggctccggct ctggctcggg ttccggctcc 361 ggaagtggct tcctaggtga catggaatgg gaataccagc caacagatga aagcaatatt 421 gtctatttca actataagcc ttttgacagg attctcactg agcaaaacca agaccaacca 481 gaagacgatt ttattatatg aatgtgacgg tctctgtctc cccacctcca tgtggaacaa 541 tgtattcagt atacttagtg taccacgttt aaatgaccag tctcaggata aagagtttta 601 cagaaaattt aaaatgcctg gaaaagactc ttgaatcctg ttaccccttt cctcattaac 661 tcgtaaggaa ttatgcttta atgctgttac ctatcttgtt gttctggaaa atgcctgcat 721 ttatgtgtat tgaatcaaca tttaagaaat taacacacac ccccattatt atacaataac 781 tttcaaagcc atactggttt tgaaaatttt aatttgatag caagttgatg aacatctttc 841 atacctaaag tgttcaggac ccaactcgca ttgtggatta caatatattc ctttatgtga 901 taaaagaaat cg // LOCUS PVBENH 316 bp ds-DNA VRL 02-AUG-1990 DEFINITION Papovavirus BKV enhancer/regulatory region. ACCESSION M35737 KEYWORDS . SOURCE Papovavirus BKV (strain BKV-P2, from prototype Gardner ATCCVR 837) DNA, passed in primary human embryonic kidney cells. ORGANISM Papovavirus BKV Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Papillomavirus. REFERENCE 1 (bases 1 to 316) AUTHORS Berg,D.T., Walls,J.D. and Grinnell,B.W. TITLE A variant enhancer/regulatory region from a cloned human prototype BK virus genome JOURNAL Nucleic Acids Res. 16, 9057-9057 (1988) STANDARD simple staff_review FEATURES from to/span description rpt 92 191 tandem repeat copy A rpt 192 290 tandem repeat copy B BASE COUNT 101 a 79 c 85 g 51 t ORIGIN 1 cctcagaaaa agcctccaca cccttactac ttgagagaaa gggtggaggc agaggcggcc 61 tcggcctctt atatattata aaaaaaaagg ccacagggag gagctgctta cccatggaat 121 gcagccaaac catgaccgca ggaaggaaag tgcatgactg ggcagccagc cagtggcagt 181 taataagcag ccagacagac atttgcttac ccatggaatg cagccaaacc atgacctcag 241 gaaggaaagt gcatgactgg gcagccagcc agtggcagtt aataagcagc agccagacag 301 acatgttttg cgagcc // LOCUS HUMCREB 1026 bp ss-mRNA PRI 02-AUG-1990 DEFINITION Human active transcription factor CREB mRNA, complete cds. ACCESSION M34356 KEYWORDS active transcription factor CREB; alternative splicing. SOURCE Human T-cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1026) AUTHORS Berkowitz,L.A. and Gilman,M.Z. TITLE Two distinct forms of active transcription factor CREB (cAMP response element binding protein) JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5258-5262 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Z.Gilman, 16-MAY-1990. FEATURES from to/span description pept 1 1026 active transcription factor CREB-B pept 1 261 active transcription factor CREB-A 304 1026 active transcription factor CREB-A BASE COUNT 332 a 249 c 233 g 212 t ORIGIN 1 atgaccatgg aatctggagc cgagaaccag cagagtggag atgcagctgt aacagaagct 61 gaaaaccaac aaatgacagt tcaagcccag ccacagattg ccacattagc ccaggtatct 121 atgccagcag ctcatgcaac atcatctgct cccaccgtaa ctctagtgca gctgcccaat 181 gggcagacag ttcaagtcca tggagtcatt caggcggccc agccatcagt tattcagtct 241 ccacaagtcc aaacagttca gtcttcctgt aaggacttaa aaagactttt ctccggaaca 301 cagatttcaa ctattgcaga aagtgaagat tcacaggagt cagtggatag tgtaactgat 361 tcccaaaagc gaagggaaat tctttcaagg aggccttcct acaggaaaat tttgaatgac 421 ttatcttctg atgcaccagg agtgccaagg attgaagaag agaagtctga agaggagact 481 tcagcacctg ccatcaccac tgtaacggtg ccaactccaa tttaccaaac tagcagtgga 541 cagtatattg ccattaccca gggaggagca atacagctgg ctaacaatgg taccgatggg 601 gtacagggcc tgcaaacatt aaccatgacc aatgcagcag ccactcagcc gggtactacc 661 attctacagt atgcacagac cactgatgga cagcagatct tagtgcccag caaccaagtt 721 gttgttcaag ctgcctctgg agacgtacaa acataccaga ttcgcacagc acccactagc 781 actattgccc ctggagttgt tatggcatcc tccccagcac ttcctacaca gcctgctgaa 841 gaagcagcac gaaagagaga ggtccgtcta atgaagaaca gggaagcagc tcgagagtgt 901 cgtagaaaga agaaagaata tgtgaaatgt ttagaaaaca gagtggcagt gcttgaaaat 961 caaaacaaga cattgattga ggagctaaaa gcacttaagg acctttactg ccacaaatca 1021 gattaa // LOCUS ACNRRDA 1486 bp ss-rRNA RNA 02-AUG-1990 DEFINITION A.lignieresii 16S ribosomal RNA. ACCESSION M35017 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE A.lignieresii (strain ATCC 19393) rRNA. ORGANISM Actinobacillus lignieresii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 1486) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1486 16S ribosomal RNA BASE COUNT 384 a 303 c 448 g 321 t 30 others ORIGIN 1 attnannant ttnatcangg ctcagattna ncgctggcgg caggcttaac acatgcaagt 61 cgaacggtaa cgggaagggn ncttncnttc tttgccgacg agtggcggac gggtgagtaa 121 tgcttgggaa tntggcttat ggagggggat aactacggga aactgtagct aataccgcgt 181 aatatcttag gattaaaggg tgggactttc gggccacctg ccataagatg agcccaagtg 241 ggattaggta gttggttagg taaaggctga ccaagccgac gatctctagc tngtctgaga 301 ggatgaccag ccacactnga actgagacac ggtccagact cctacgggag gcagcagtng 361 ggaatattgc acaatggggg gaaccctnat gcagccatgc cgcgtgaatg aagaaggcct 421 tcgggttgta aagttctttc ggtagcgagg aaggtatcaa atttnataga tttngtaatt 481 gacgttaact acagaagaag caccggctaa ctccgtgcca gcagccgcgg taatacggag 541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggtt nattaagtga 601 gatgtgaaag ccccgggctt aacctgggna ttgcatttca tactggtcaa ctagagtact 661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc 721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcgaaagc gtggggagca 781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct 841 ntnagtctgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag 901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg 961 atgnaacgcg aagaacctta cctactcttg acatccatgg aatcttgtag agatacgaga 1021 gtgccttcgg gaaccatgag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat 1081 gttgggttna gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attnggtcgg 1141 gaactcnaag gagactgccg gtgatnaacc ggaggaaggt ggggatgacg tcaagtcatc 1201 atggccctta cgagtagggc tacacacgtg ctacaatggc gtatacagag ggaagcaaga 1261 tggcgacatg gagcaaatct cacaaagtac gtctaagtcc ggattggagt ctgcaactcg 1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc 1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc 1441 ttaaccgcaa ggggggcgtt taccacggta tgattcatga ctnggg // LOCUS BNORRDA 1532 bp ss-rRNA RNA 02-AUG-1990 DEFINITION D.nodosus 16S ribosomal RNA. ACCESSION M35016 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE D.nodosus (strain 198A; ATCC 27521) rRNA. ORGANISM Bacteroides nodosus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Anaerobic gram-negative straight, curved and helical rods; Bacteroidaceae. REFERENCE 1 (bases 1 to 1532) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1532 16S ribosomal RNA BASE COUNT 410 a 330 c 465 g 324 t 3 others ORIGIN 1 aactgaagag tttgattctg gctcagattg aacgctggcg gcatgcttaa cacatgcaag 61 tcgaacgggg ttatgtagct tgctatgtaa cctagtggcg gacgggtgag taatatatag 121 gaatctgcct tatggtgggg gacaacgtat ggaaacgtac gctaataccg cataagattg 181 aagaatgaaa gcgggggctc gaaagacctc gcgccgtaag atgagcctat atcggattag 241 ctagttggtg gggtaagagc ctaccaaggc gacgatccgt agctggtttg agagaatgat 301 cagccacatc gggactgaga cacggcccna actcctacgg gaggcagcag tggggaatat 361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt 421 gtaaagcact tttattagtg aagaacggtg catggttaat acccatgcaa ttgacattag 481 ctaaggaaaa agcaccggct nactccgtgc cagcagccgc ggtaatacgg agggtgcaag 541 cgttattcgg aatgactggg cgtaaagcgc acgcaggtgg ttttataagt caggtgtgaa 601 atccctgggc tcaacctagg aattgcattt gaaaccgtaa gactagagta tgtcagagga 661 aggcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga 721 aggcagcctt ctggggcaat actgacactc atgtgcgaaa gcgtgggtag caaacaggat 781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggtgttgggt agaaatactc 841 ggtaccgaag ctaacgcaat aagttgaccg cctggggagt acggccgcaa ggttgaaact 901 caaagaaatt gacggggacc cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg 961 cgaagaacct taccaggctt tgacatccag agaattctac agagatgtgg gagtgccttc 1021 gggaactctg agacaggtgt tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 1081 aagtcccgca acgagcgcaa cccttatcct tacttgccag cacgtaaagg tgggaactat 1141 aaggagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc atcatggccc 1201 ttacagcctg ggctacacac gtgctacaat gggcggtaca gacggtagcc aacccgcgag 1261 ggggagccaa tctgagaaaa ccgttcgtag tccggattgc agtctgcaac tcgactgcat 1321 gaagtcggaa tcgctagtaa tcgcgaatca gcacgtcgcg gtgaatacgt tcccgggtct 1381 tgtacacact gcccgtcaca ccatgggagt ttgttgcacc agaagtaggt agcttaaccg 1441 taaggagggc gcttaccacg gtgtggccga tgactggggt gaagtcgtaa caaggtaacc 1501 gtaggggaac ctgcggttgg atcacctcct tt // LOCUS CAJSAPA 3974 bp ds-DNA BCT 02-AUG-1990 DEFINITION C.fetus surface array protein (sapA) gene, complete cds. ACCESSION J05577 KEYWORDS surface array protein. SOURCE C.fetus fetus DNA. ORGANISM Campylobacter fetus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 3974) AUTHORS Blaser,M.J. and Gotschlich,E.C. TITLE Surface array protein of Campylobacter fetus: Cloning and gene structure JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.J.Blaser, 01-JUN-1990. FEATURES from to/span description pept 24 2825 surface array protein (sapA) binding 12 15 ribosomal binding site signal 2846 2870 transcription termination signal BASE COUNT 1293 a 641 c 785 g 1255 t ORIGIN 1 ctattttatt aaggagtcct taaatgttaa acaaaacaga tgtttcaatg ctttatatca 61 ctattatggg tatggcaagt gagggtgatg gtaataagta ttggttagat tatgccaata 121 ataatagttt aggagtttca agtttagcta atattatgct tgatagtcca ggggcggcta 181 aattctttgg tgattctctt ttagcaggta atgaaaaaga ttttgttact aagatatata 241 gtatagcttt aggtaatact agtgatgttg atggcattaa ttattggact aaggcaataa 301 ctggcggtgg agaatttact gatagtaagg gtaatgttat tagtgttgct agtttaagca 361 agggtgattt aataggtgct atgattaact ctatggttaa tggcggtagt gctgagtcta 421 aggctatatt tgaggctaag gcagctgcta gtgattactt tgccgatgct actttggtaa 481 gggatattag tggattagat gagggtacta cttctaagtt aattagcgag attaatagtg 541 ctagtgatct tgataaggtt aagagtgaga ttgatgcttt gaagagtgag ctacctaatc 601 cgggtagtac ttatgatctt acagagggta atgataattt aaagggtact gatttagacg 661 atacttttaa tgggactaca tatgtaggta atggtactaa taagagtact cttagtgcat 721 ttgataagac tagatggtcg gtgcttggga gagatacgtt gaatgcgata tttactgcaa 781 taacacgcgc tgcgctacta actgatcaag ctgaattgat aatcactaaa aggcgtacaa 841 acgtagaaaa tatcaatata atttcagatc tagaaacaag tggcgatttc gttttcaacg 901 gttatgaaaa agtaggattt aacgtacttg gcgatatagt tagctttgct accgacgcat 961 ctaaaagcgt aaatgtagaa acaacaggaa cgataactgc tttcaccgca gccggaacag 1021 gcaaagtcga tgttgtcgcc ggtaaaatct ctgcccttac ggccgattcg cgaacaagcg 1081 taaatttaac tgctacaaac gacactatca cattaaccag tgcaaacgct gctactagtg 1141 tgaatttaaa acagcggcag gccaaagacg ctacaataac atccgcaatg cagcaaaaat 1201 ataacaatag acgcaacagg attgcaacta taacttcagc tacggctgta gagaatttga 1261 cagttaaaca tgcaactaac gtagcgctaa atggtggcat ggataaactt gcaacagtta 1321 ctcttgacaa tgctgcttta actgctgcaa tagatataaa atctgcaagc acactaaatt 1381 taataaattc aagtgttaac ggaccaaaac atctctacag cagcaaaaga cgttactgta 1441 aatttaagcg agctgctgct aaggttaaat taaacacaac tgctgctacg gatcaaactg 1501 tgactcttaa agctaatgcc acagacaata gtcttgaatt tgatagcgca actgcaaaaa 1561 ctacgtcagt cactgctagt ggtagtggaa aaactttggt tatcaaaggt gctgaagtag 1621 aaactctagt taatatagac actactgcgt tcaatgcgct gcagagtgtt agcttcggca 1681 aaactggtca aggcggcaaa tttagcgtaa aaacaggtac aggcgatgat aaaatagagt 1741 ttgtaggcac aactttaact gaaggttcag taatcgatgc gccgggtaat gatactatag 1801 ctatgaaatc agctgctttg actagtgcaa atttcactat gatcaaaaac atagagaatg 1861 ttgctattag cgacgcggtg gctacggctg accttagctc ttctgcgttt aaaaactcag 1921 tcatcataac tactaaagag gctgccgata cgactttaac tataaataaa gaccaagtaa 1981 ttaattttac tgcggcagat gcaggctcag ttaagcttat tacagtgaaa ctaaatgatg 2041 taacagcgct aatgattgtt gttaaaatag tattagatgc tgccgctaaa gatactaata 2101 tagctttggg cacagcagca gccgataaag ctcttgtaat tgatacgggt atagaaactc 2161 ttaatatcac ttcgttagtt aaagctacaa gccctgaaac tacagcaaat accgtaaatg 2221 ctaaattgac agatgtaact tctatcatta tagatgggat gcaaattact ttaggtcatg 2281 ccggcactgc tggaactgac tatagtaaag tttctatgat agatgcaagt gctcttaaag 2341 ctggtcttac atttgatgca agtgctataa cattaggagc taatgctact ataaaaggtg 2401 gctctggagc tgatagtata actgttaaag gcggaaatat cgttgtagat ttagttgctg 2461 gcggagacga tactataact cttaaaaaag gtgctgagaa aacagatatt acaaccgtta 2521 ataactttaa tgctggcgat aagatagata tagctgatgc caaaaatggt actttcacat 2581 ttaataaaat tacaatgaat agcgatgcta atttagatga ttatatcact aaagcagtag 2641 caggagacgg atcaacaaac tctgctgtaa gctactcaca taatggatat acttatgtag 2701 tagttgatgg aacagctgga gctactttca ctaaagctac tgatacaatt atcaaacttt 2761 caggtacgct agatcttaaa cttagcggag ataatgttgt agttgatgat ggaagcgtaa 2821 tttaagcaat agtaattgct tgataaaaaa gccttaattt ttaaggcttt taatttgtct 2881 aaatttatct tagcttaggc gtatttattt gaaatctcgt cagacggatc gtcttaaaat 2941 aaggttttat agagacaaag tatattttag accaaacctt aataaatagt agatagtttt 3001 ttgctaagag gagctaggag aactagctcc ttgaatattt ttaaatttat ttgagttgta 3061 ttttaaattc aacatatctg ttttttgatg ttttgatcga aatttgtcta ttgtatccat 3121 cctaaaaatt tcagctcttc aactcttttt gtgtgaaatt gttctaataa ggttgataaa 3181 ttttcatatt tgtcacttgg aacaagaaaa ctctcgtctc cgactctttt caccaatttt 3241 tgtttataaa actttaaatt taaaggtttt ataagatcta tatatgaact aaactcatct 3301 ttatagctat tagctatact aaaagcgatc tgcttgaact ctttgaaatc aacgtttaaa 3361 attttacaaa gcggcttaaa atcgtttagc ctaatgtttt gacgcttacc gtttataggt 3421 aaatagctct caaatccttt tgccgtacta taaaatcccg ctgctcatca cgtgcataca 3481 taggcgcatc agcactatat ctccgcttgt tataagagaa agatttttca gcatgcagat 3541 cttccgtatt ttatgataac gctataaaca tatatcttca aaagctcctt cttttcagta 3601 tctgatagta gctctttgct tatgcgctta aaaagctttt cgctggttgt gtcatattta 3661 tttatactag tagtcctaga tacgaggcaa actcgacgac gttataatga tgattttcta 3721 gtttatcaaa tcttttaatt acgtagtgat actcgtcgtc atctcctgcc actacagcac 3781 ttcacggaac tctgattttc aacgtatttt tagcaaaact caaatgaaga tgctcattta 3841 tcagcgaggt gcggaaaata acttttgaac tggttggatt tgatctgatt ttcgaatatg 3901 gttttaaaat atagtttgaa gcagcgtctt gatggattat ctttcgttca aaatcgatat 3961 ttactaattt tttg // LOCUS CDBRRDA 1470 bp ss-rRNA RNA 02-AUG-1990 DEFINITION C.hominis 16S ribosomal RNA. ACCESSION M35014 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE C.hominis (strain ATCC 16826) rRNA. ORGANISM Cardiobacterium hominis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods. REFERENCE 1 (bases 1 to 1470) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1470 16S ribosomal RNA BASE COUNT 373 a 331 c 466 g 296 t 4 others ORIGIN 1 actgaagagt ttgattctgg ctcagattga acgctggcgg catgcttaac acatgcaagt 61 cgaacggaaa cgatggagct tgctccaggc gtcgagtggc gaacgggtga gtaacgcatg 121 ggaatctgcc ttttgctggg ggataacgta gggaaactta cgctaatacc gcataacacc 181 taagggtgaa agcgggggac cgaaagcctc gcggcaagag atgagcccat gttggattag 241 ctagttggtg gggtaaaggc ctaccaaggc gacgatccat agctggtctg agaggatgat 301 cagccacact gggactgaga cacggcccag actccnacgg gaggcagcag tggggaatat 361 tggacnatgg ggggaaccct gatccagcaa tgccgcgtgt gtgaagaagg ccttcgggtt 421 gtaaagcact ttcagtaggg aggaaaggtg cgtagttaat acctgcgcaa ttgacgttac 481 ctacagaaga agcaccggct aactccgtgc cagcagccgc ggtaatacgg agggtgcgag 541 cgttattcgg aattactggg cgtaaagcgc acgcaggcgg ttgcccaagt cagatgtgaa 601 agccccgggc ttaacctggg aactgcattt gaaactgggc gactagagta tgaaagagga 661 aagcggaatt tccagtgtag cagtgaaatg cgtagatatt ggaaggaaca ccgatggcga 721 aggcagcttt ctgggtcgat actgacgctc atgtgcgaaa gcgtggggag caaacaggat 781 tagataccct ggtagtccac gccctaaacg atgtcaacta ggcgtcgggt tgttaaagac 841 tcggtgcngg agctaacgca ttaagttgac cgcctgggga gtacggccgc aaggttgaaa 901 ctcaaagaaa ttgacgggga cccgcncaag cggtggagca tgtggtttaa ttcgatgcaa 961 cgcgaagaac cttaccaggc cttgacatcc taggaacttg gcagagatgc cttggtgcct 1021 tcgggaacct agagacaggt gttgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg 1081 ttaagtcccg caacgagcgc aacccttatc cttacttgcc agcactttgg gtgggaactg 1141 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc 1201 cttacggcct gggctacaca cgtgctacaa tggtcggtac agacggtagc gaagccgcga 1261 ggtggagcca atctgagaaa gccgatcgta gtccggattg cagtctgcaa ctcgactgca 1321 tgaagtcgga atcgctagta atcgcgaatc agcatgtcgc ggtgaatacg ttcccgggtc 1381 ttgtacacac tgcccgtcac accatgggag tttgttgcac cagaagcagg tagcttaact 1441 taggagggcg cttgccacgg tgtggccgat // LOCUS GLARGBA 763 bp ds-DNA INV 02-AUG-1990 DEFINITION G.lamblia 5.8S rRNA gene. ACCESSION M35013 KEYWORDS 5.8S ribosomal RNA; large subunit rRNA; small subunit rRNA. SOURCE G.lamblia DNA, clones B4 and C4. ORGANISM Giardia lamblia Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae. REFERENCE 1 (bases 1 to 763) AUTHORS Edlind,T.D., Sharetzsky,C. and Cha,M.E. TITLE Ribosomal RNA of the primitive eukaryote Giardia lamblia: Large subunit domain I and potential processing signals JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.D.Edlind, 04-JUN-1990. FEATURES from to/span description rRNA 177 321 5.8S ribosomal RNA rRNA < 1 146 small subunit ribosomal RNA rRNA 372 > 763 large subunit ribosomal RNA BASE COUNT 121 a 293 c 293 g 56 t ORIGIN 1 gtacacaccg cccgtcgctc ctaccgactg ggcgcggcgg cgagcgcccc ggacgcgcga 61 agggccgcga gcccccgcgc ctggaggaag gagaagtcgt aacaaggtat ccgtaggtga 121 acctgcggat ggatccctcg cgcgccccgc gcgtcgcccc cgcggcccgg tcggcccccg 181 aacgccccgc cggcggatgc ctcggcccgg gcggcgacga agagcgcggc ggagcgcgag 241 acgcggtgcg gacccgcccg ccccgagaag caccgaccct cgaacgcagc gcgccccggc 301 gccgccgcct cggcgcccgc cgcgtgccgc gcgccgcgcc gcagagagcg ccccggggcg 361 gtcccgccgg gctgcgcggc ccgaggcggc gggggcgacg ggcggaactt aagcatatca 421 gtacgccccg gaggagaaac caaccgggat tccccgtagc ggcgagcgac gcgggaggag 481 cccgccccga aggcgcgctg tggggcgcag gcgcaggccc gccgcgaggg ggcccgaggg 541 ccccgcccga gagggtgcaa gccccgtacg gcggccggcg cctgcgcgcg agtagcgctg 601 cttgagcgtg cagcgcggaa gggaggcgcg gcccttccaa ggctaaatac gccccgggac 661 cgatagcgga ccaagtagcg cgagcgaacg gtgaaaagga cgcctgcggc cgctcaaaag 721 acctgaaccg ccgccgcgcc gccggccccg tctcgaaaca cgg // LOCUS HEIRRDA 1472 bp ss-rRNA RNA 02-AUG-1990 DEFINITION H.influenzae 16S ribosomal RNA. ACCESSION M35019 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE H.influenzae (strain ATCC 33391) rRNA. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 1472) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1472 16S ribosomal RNA BASE COUNT 376 a 300 c 459 g 311 t 26 others ORIGIN 1 attgaagagt ttgatcatgg ctcagattga acgctggcgg caggcttaac acatgcaagt 61 cgaacggtag caggagaaag cttgctttct tgctgacgag tggcggacgg gtgagtaatg 121 cttgggaatc tggcttatgg agggggataa cgacgggaaa ctgtcgctaa taccgcgtat 181 tatcggaaga tgaaagtgcg ggactgagag gccgcatgcc ataggangag cccaagtggg 241 attaggtagt tggtggggta aatgcctacc aagcctncga tctctagctg gtctgagang 301 atgaccagcc acactggaac tgagacacgg tccagactcc tacgggaggc agcagtgggg 361 aatattgcgc natgggggga accctgacgc agccatgccg cgtgaatgaa gaaggccttc 421 gggttgtaaa gttctttcgg tattgaggaa ggttgatgtg ttaatagcac atcaaattga 481 cgttaaatac agaagaagca ccggctnact ccgtgccagc agccgcggta atacggaggg 541 tgcgagcgtt aatcggaata actgggcgta aagggcacgc aggcggttat ttaagtgagg 601 tgtgaaagcc ccgggcttaa cctgggnatt gcatttcaga ctgggtaact agagtacttt 661 agggaggggt agaattccac gtgtagcggt gaaatgcgta gagatgtgga ggaataccga 721 aggcgaaggc agccccttgg gaatgtactg acgctcatgt gcgaaagcgt ggggagcaaa 781 caggattaga taccctggta gtccacgctg taaacgctgt cgatttnggg gttggggttn 841 aactctggca cccgtagcta acgtgataaa tcgaccgcct ggggagtacg gccgcaaggt 901 taaaactcaa atgaattgac gggggccngc ncaannggtg gagcatgtgg tttaattcga 961 tgnaacgcga agaaccttac ctactcttga catcctaaga agagctcaga gatgagcttg 1021 tgccttcggg aacttagaga caggtgctgc atggctgtcg tcagctcgtg ttgtgaaatg 1081 ttgggttnag tcccgcaacg agcgcaaccc ttatcctttg ttgccagcga cttggtcggg 1141 aactcaaagg agactgccag tgataaactg gaggaaggtn gggatgacgt caagtcatca 1201 tggcccttac gagtagggct acacacgtgc tacaatggcg tatacagagg gaagcgaagc 1261 tgcgaggtgg agcgaatctc ataaagtacg tctaagtccg gattggagtc tgcaactcga 1321 ctccatgaag tcggaatcgc tagtaatcgc gaatcagaat gtcgcggtga atacgttccc 1381 gggccttgta cacaccgccc gtcacaccnt gggagtgggt tgtaccagaa gtagatagct 1441 naacctnnnn gagggcgtnn ncnacggtan ga // LOCUS HUMIBSUB 3110 bp ss-mRNA PRI 02-AUG-1990 DEFINITION Human integrin beta-5 subunit mRNA, complete cds. ACCESSION M35011 KEYWORDS integrin. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3110) AUTHORS Suzuki,S., Huang Z,-S. and Tanihara,H. TITLE Cloning of an integrin beta subunit exhibiting high homology with integrin beta-3 subunit JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5354-5358 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Suzuki, 05-JUN-1990. FEATURES from to/span description pept 30 2420 integrin beta-5 subunit BASE COUNT 726 a 809 c 887 g 688 t ORIGIN 1 cgcgccgccg ctgagggagg cgccccacca tgccgcgggc cccggcgccg ctgtacgcct 61 gcctcctggg gctctgcgcg ctcctgcccc ggctcgcagg tctcaacata tgcactagtg 121 gaagtgccac ctcatgtgaa gaatgtctgc taatccaccc aaaatgtgcc tggtgctcca 181 aagaggactt cggaagccca cggtccatca cctctcggtg tgatctgagg gcaaaccttg 241 tcaaaaatgg ctgtggaggt gagatagaga gcccagccag cagcttccat gtcctgagga 301 gcctgcccct cagcagcaag ggttcgggct ctgcaggctg ggacgtcatt cagatgacac 361 cacaggagat tgccgtgaac ctccggcccg gtgacaagac caccttccag ctacaggttc 421 gccaggtgga ggactatcct gtggacctgt actacctgat ggacctctcc ctgtccatga 481 aggatgactt ggacaatatc cggagcctgg gcaccaaact cgcggaggag atgaggaagc 541 tcaccagcaa cttccggttg ggatttgggt cttttgttga taaggacatc tctcctttct 601 cctacgcggc accgaggtac cagaccaatc cgtgcattgg ttacaagttg tttccaaatt 661 gcgtcccctc ctttgggttc cgccatctgc tgcctctcac agacagagtg gacagcttca 721 atgaggaagt tcggaaacag agggtgtccc ggaaccgaga tgcccctgag gggggctttg 781 atgcagtact ccaggcagcc gtctgcaagg agaagattgg ctggcgaaag gatgcactgc 841 atttgctggt gttcacaaca gatgatgtgc cccacatcgc attggatgga aaattgggag 901 gcctggtgca gccacacgat ggccagtgcc acctgaacga ggccaacgag tacactgcat 961 ccaaccagat ggactatcca tcccttgcct tgcttggaga gaaattggca gagaacaaca 1021 tcaacctcat ctttgcagtg acaaaaaacc attatatgct gtacaagaat tttacagccc 1081 tgatacctgg aacaacggtg gagattttag atggagactc caaaaatatt attcaactga 1141 ttattaatgc atacaatagt atccggtcta aagtggagtt gtcagtctgg gatcagcctg 1201 aggatcttaa tctcttcttt actgctacct gccaagatgg ggtatcctat cctggtcaga 1261 ggaagtgtga gggtctgaag attggggaca cggcatcttt tgaagtatca ttggaggccc 1321 gaagctgtcc cagcagacac acggagcatg tgtttgccct gcggccggtg ggattccggg 1381 acagcctgga ggtgggggtc acctacaact gcacgtgcgg ctgcagcgtg gggctggaac 1441 ccaacagcgc caggtgcaac gggagcggga cctatgtctg cggcctgtgt gagtgcagcc 1501 ccggctacct gggcaccagg tgcgagtgcc aggatgggga gaaccagagc gtgtaccaga 1561 acctgtgccg ggaggcagag ggcaagccac tgtgcagcgg gcgtggggac tgcagctgca 1621 accagtgctc ctgcttcgag agcgagtttg gcaagatcta tgggcctttc tgtgagtgcg 1681 acaacttctc ctgtgccagg aacaagggag tcctctgctc aggccatggc gagtgtcact 1741 gcggggaatg caagtgccat gcaggttaca tcggggacaa ctgtaactgc tcgacagaca 1801 tcagcacatg ccggggcaga gatggccaga tctgcagcga gcgtgggcac tgtctctgtg 1861 ggcagtgcca atgcacggag ccgggggcct ttggggagat gtgtgagaag tgccccacct 1921 gcccggatgc atgcagcacc aagagagatt gcgtcgagtg cctgctgctc cactctggga 1981 aacctgacaa ccagacctgc cacagcctat gcagggatga ggtgatcaca tgggtggaca 2041 ccatcgtgaa agatgaccag gaggctgtgc tatgtttcta caaaaccgcc aaggactgcg 2101 tcatgatgtt cacctatgtg gagctcccca gtgggaagtc caacctgacc gtcctcaggg 2161 agccagagtg tggaaacacc cccaacgcca tgaccatcct cctggctgtg gtcggtagca 2221 tcctccttgt tgggcttgca ctcctggcta tctggaagct gcttgtcacc atccacgacc 2281 ggagggagtt tgcaaagttt cagagcgagc gatccagggc ccgctatgaa atggcttcaa 2341 atccattata cagaaagcct atctccacgc acactgtgga cttcaccttc aacaaatcct 2401 acaatggcac tgtggactga tgtttccttc tccgaggggc tggagcgggg atctgatgaa 2461 aaggatcaga ctgaaacgcc ttgcacggct gctcggcttg atcacagctc cctaggtagg 2521 caccacagag aagaccttct agtgagcctg ggccaggagc ccacagtgcc tgtacaggaa 2581 ggtgcctggc catgtcacct ggctgctagg ccagagccat gccaggctgc gtccctccga 2641 gcttgggata aagcaagggg accttggcgc tctcagcttt ccctgccaca tccagcttgt 2701 tgtcccaatg aaatactgag atgctgggct gtctctccct tccaggaatg ctgggccccc 2761 agcctggcca gacaagaaga ctgtcaggaa gggtcggagt ctgtaaaacc agcatacagt 2821 ttggcttttt tcacattgat catttttata tgaaataaaa agatcctgca tttatggtgt 2881 agttctgagt cctgagactt ttctgcgtga tggctatgcc ttgcacacag gtgttggtga 2941 tggggctgtt gagatgcctg ttgaaggtac atcgtttgca aatgtgagtt tcctctcctg 3001 tccgtgtttg tttagtactt ttataatgaa aagaaacaag attgtttggg attggaagta 3061 aagattaaaa ccaaaagaat ttgtgtttgt ctgataaaaa aaaaaaaaaa // LOCUS KINRRDAA 1474 bp ss-rRNA RNA 02-AUG-1990 DEFINITION K.indologenes 16S ribosomal RNA. ACCESSION M35015 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE K.indologenes (strain ATCC 25869) rRNA. ORGANISM Kingella indologenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1474) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1474 16S ribosomal RNA BASE COUNT 375 a 322 c 451 g 315 t 11 others ORIGIN 1 ttgaactgaa gagtttgatt ctggctcaga ttgaacgctg gcggcatgct taacacatgc 61 aagtcgaacg agggaagcag cttgctgcng acctngtggc ggacgggtga gtaacgtata 121 ggaatctacc ttggggtggg ggataacgta tggaaacgta cgctaatacc acataacact 181 tacgagtcaa agcgggggat cttcggacct cgcgccctta gatgagccta tattggatta 241 gctagttggt rgggtaaagg cctaccaagg cgacgatcca tagctggttt gagaggatga 301 tcagccacat cgggactgag acacggcccn nactcctacg ggaggcagca gtggggaata 361 ttggncnatg ggggcaaccc tgatccagca atgccgcgtg tgtgaagaag gccttngggt 421 tgtaaagcac tttcgttagt gaagaaaggt ttatggttaa tacctgtgaa tttgacatta 481 gctaaagaag aagcaccggc taactccgtg ccagcagccg cggtaatacg gagggtgcna 541 gcgttattcg gaatgactgg gcgtaaagcg cacgcaggcg gttatttaag tcagatgtga 601 aagccccggg cttaacctgg gnattgcatt tgaaactggg taactagagt atgtgagagg 661 aaggcggaat ttccagtgta gcagtgaaat gcgtagagat tggaaggaac accgatggcg 721 aaggcagcct tctggggcaa tactgacgct catgtgcgaa agcgtgggta gcaaacagga 781 ttagataccc tggtagtcca cgccctaaac gatgtcaact aggtgttggg cacttaagtg 841 ctcggtaccg cagctaacgc attaagttga ccgcctgggg agtacggccg caaggttgaa 901 actcaaagaa attgacgggg acccgcacaa gcggtggagc atgtggttta attcgatgca 961 acgcgaagaa ccttaccagg ccttgacatc cagagaagtt accagagatg gtttcgtgcc 1021 ttcgggaact ctgagacagg tgttgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg 1081 gttaagtccc gcaacgagcg caacccttat ccttacttgc cagcacttcg ggtgggaact 1141 ataaggagac tgccggtgac aaaccggagg aaggtgggga tgacgtcaag tcatcatggc 1201 ccttacggcc tgggctacac acgtgctaca atggtcggta cagacggttg ccaacccgcg 1261 agggggagct aatctgagaa agccgatcgt agtccggatt gcactctgca actcgagtgc 1321 atgaagtcgg aatcgctagt aatcgcgaat cagaacgtcg cggtgaatac gttcccgggt 1381 cttgtacaca ctgcccgtca caccatggga gtgggttgca ccagaagtag ctagcttatc 1441 agggcggtta ccacggtgtg attcangact aggg // LOCUS MUSTNFBG 3219 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse tumor necrosis factor-beta (lymphotoxin) gene. ACCESSION Y00137 KEYWORDS lymphotoxin; signal peptide; tumor necrosis factor. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3219) AUTHORS Gray,P.W., Chen,E., Tang,W.-L. and Ruddle,N. TITLE The murine tumor necrosis factor-beta (lymphotoxin) gene sequence JOURNAL Nucleic Acids Res. 15, 3937-3937 (1987) STANDARD simple automatic REFERENCE 2 (sites for [1]) AUTHORS Fashena,S.J., Tang,W.-L., Sarr,T. and Ruddle,N.H. TITLE The murine lymphotoxin gene promoter: Characterization and negative regulation JOURNAL J. Immunol. 145, 177-183 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1632 1727 lymphotoxin precursor, exon 2 (first expressed exon) 1811 1910 lymphotoxin precursor, exon 3 2135 2547 lymphotoxin precursor, exon 4 sigp 1632 1727 lymphotoxin signal peptide 1811 1813 lymphotoxin signal peptide matp 1814 1910 lymphotoxin 2135 2547 lymphotoxin pre-msg 685 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 857 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 865 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 1114 3121 lymphotoxin mRNA and introns (minor alt.) pre-msg 1179 3121 lymphotoxin mRNA and introns (major alt.) IVS 1286 1622 put. intron I IVS 1728 1810 intron II IVS 1911 2134 intron III signal 1153 1158 put. TATA-box BASE COUNT 724 a 971 c 744 g 780 t ORIGIN 1 tgaaagctcc ctctgtacag agcattggaa gcctggggtg tacatttggg gttacatgat 61 cttggggttc taagagaata cccccaaatc atcttccaga cctggaacat tctaggacag 121 ggttctcaac cttcctaact ccatgaccct ttaatacagt tcctcatgtt gtggtgaccc 181 caaccataca attattttcg ttgctatttc ataactgtaa tttcgctgct attatgaaca 241 taatgtaaat atttgtttta aatagaggtt tgccaaagag accttgccac aggttgagac 301 tgccgctcca gagagtaagg gacacattaa aattgttaca caccagatcc cccaaatttg 361 gggagagggc actgtaatgg aacttcttga cattaaactg gcagataaac tggcagaaaa 421 aaaaaaaaaa aagctgggca gtggtggcac acacctttaa tcccagcact tgggaggcag 481 aggcaggcgg atttctgagt tctaggccag cctggtcgac agagtgagtt tcaggacagc 541 cagggctaca cagagaaacc ctgtctcgaa aaaagcaaaa aaaaaaaaaa aaaactggca 601 gatgaccaga aaatacagat atattggaat aactgtgact tgaaccccca aagacaagag 661 aggaaatagg cctgaagggg cggcaggcat gtcaagcatc cagagccctg ggttcgaacc 721 tgaaaaaaca aaggtgccgc taaccacatg tggcttcgga gccctccaga catgaccatg 781 atcgacagag agggaaatgt gcagagaagc ctgtgagcag tcaagggtgc agaagtgata 841 taaaccatca ctcttcaggg aaccaggctt ccagtcacag cccagctgta ccctctccac 901 gaattgctcg gccgttcact ggaactcctg ggcctgaccc agctccctgc tagtccctgc 961 ggcccacagt tccccggacc cgactccctt tcccagaacg cagtagtcta agcccttagc 1021 ctgcggttct ctcctaggcc ccagcctttc ctgccttcga ctgaaacagc agcatcttct 1081 aagcctgggg cttccccaag ccccagcccc gacctagaac ccgcccgctg cctgccacac 1141 tgccgcttcc tctataaagg gacccgacgc cagcgcccag gaccccgcac agcaggtgag 1201 cctctcctac cctgtctcct tgggcttacc ctggtatcag gcatccctca ggatccccag 1261 ccttaatggg tctggtcctc ctgtcgtggc tttgattttt ggtctgttcc tgtggcggcc 1321 ttatcagtct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct 1381 ctctctctct ctctttctct ctctctgcct ctgttagcca ttgtctgttt ctatggtgga 1441 gctttcctct tcccctctgt ctctccttat ccctgctcac ttcagggttc ccctgcctgt 1501 ccccttttct gtctgtcgcc ctgtctctca gggtggctgt ctcagctggg aggtaaggtc 1561 tgtcttcctc tgtgtgcccc gcctccgcta cacacacaca ctctctctct ctctctcagc 1621 aggttctcca catgacactg ctcggccgtc tccacctctt gagggtgctt ggcacccctc 1681 ctgtcttcct cctggggctg ctgctggccc tgcctctagg ggcccaggtg aggcagcaag 1741 agattggggg tgctggggtg gcctagctaa ctcagagtcc tagagtcctc tccactctct 1801 tctgtcccag ggactctctg gtgtccgctt ctccgctgcc aggacagccc atccactccc 1861 tcagaagcac ttgacccatg gcatcctgaa acctgctgct caccttgttg gtaaacttct 1921 gcctccagag gagaggtcca gtccctgcct tttgtcctac ttgcccaggg gcccaggcga 1981 tcttcccatc tccccacacc aacttttctt accctaaggg caggcacccc actcccaatc 2041 tccctaccaa ccatcccact tgtccagtgc ctgctcctca gggatgggga cctctgatct 2101 tgatagcccc ccaatgtctt gtgcctcttc ccagggtacc ccagcaagca gaactcactg 2161 ctctggagag caagcacgga tcgtgccttt ctccgacatg gcttctcttt gagcaacaac 2221 tccctcctga tccccaccag tggcctctac tttgtctact cccaggtggt tttctctgga 2281 gaaagctgct cccccagggc cattcccact cccatctacc tggcacacga ggtccagctc 2341 ttttcctccc aatacccctt ccatgtgcct ctcctcagtg cgcagaagtc tgtgtatccg 2401 ggacttcaag gaccgtgggt gcgctcaatg taccaggggg ctgtgttcct gctcagtaag 2461 ggagaccagc tgtccaccca caccgacggc atctcccatc tacacttcag ccccagcagt 2521 gtattctttg gagcctttgc actgtagatt ctaaagaaac ccaagaattg gattccaggc 2581 ctccatcctg accgttgttt caagggtcac atccccacag tctccagcct tccccactaa 2641 aataacctgg agctctcacg ggagtctgag acacttcagg ggactacatc ttccccaggg 2701 ccactccaga tgctcagggg acgactcaag cctacctaga agttcctgca cagagcaggg 2761 tttttgtggg tctaggtcgg acagagacct ggacatgaag gagggacaga catgggagag 2821 gtggctggga acaggggaag gttgactatt tatggagaga aaagttaagt tatttattta 2881 tagagaatag aaagagggga aaaatagaaa gccgtcagat gacaactagg tcccagacac 2941 aaaggtgtct cacctcagac aggacccatc taagagagag atggcgagag aattagatgt 3001 gggtgaccaa ggggttctag aagaaagcac gaagctctaa aagccagcca ctgcttggct 3061 agacatccac agggaccccc tgcaccatct gtgaaaccca ataaacctct tttctctgag 3121 attctgtctg cttgtgtctg tcttgcgttg ggggagaaac ttcctggtct ctttaaggag 3181 tggagcaggg gacagaggcc tcagttggcc atgggatcc // LOCUS NGORRDAA 1478 bp ss-rRNA RNA 02-AUG-1990 DEFINITION N.denitrificans 16S ribosomal RNA. ACCESSION M35020 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE N.denitrificans (strain ATCC 14686) rRNA. ORGANISM Neisseria denitrificans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1478) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1478 16S ribosomal RNA BASE COUNT 380 a 331 c 448 g 308 t 11 others ORIGIN 1 aacataagag tttgatcctg gctcagattg aacgctggcg gcatgcttta cacatgcaag 61 tcgaacggca gcggagaagt gcttgcactt ctgccggcga gtggcgaacg ggtgagtaac 121 atattggaac gtaccgagta atgggggata actaatcgaa agattagcta ataccgcata 181 cgtcctgagg gagaaagcgg gggatcttcg gacctcgcgt tattcgagcg gccaatatct 241 gattagctag ttggtggggt aaaggcctac caaggcgacg atcagtagcg ggtctgagag 301 gatgatccgc cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg 361 gaattttgga cgatgggcgc aagcctgatc cagccatgcc gcgtgtatga agaaggcctt 421 cgggttgtaa agtacttttg tnngggaaga aaagggattt gctaataccg aatcctcatg 481 acggtacctn aagaataagc accggctaac tacgtgccag nanccgcggn natacgtagg 541 gtgcgagcgt taatcggaat tactgggcgt aaagcgagcg cagacggtta cttaagcagg 601 atgtgaaatc cccgagctca acttgggaac tgcgttctga actgggtagc tagagtatgt 661 cngagggggg tagaattcca cgtgtagcag tgaaatgcgt agagatgtgg aggaataccg 721 atggcgaagg cagccccctg ggataatact gacgttcatg ctcgaaagcg tgggtagcaa 781 acaggattag ataccctggt agtccacgcc ctaaacgatg tcgattagct gttggggtac 841 ttgatacctt agtagcgtag ctaacgcgtg aaattgaccg cctggggagt acggtcgcaa 901 gattaaaact caaaggaatt gacggggacc cgcacaagcg gtggatgatg tggattaatt 961 cgatgcaacg cgaagaacct tacctggtct tgacatgtac ggaatctccc agagacggga 1021 gngtgccttc gggagccgta acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag 1081 atgttgggtt aagtcccgca acgagcgcaa cccttgtcat tagttgccat cattcagttg 1141 ggcactctaa tgagactgcc ggtgacaaac cggaggaagg tggggatgac gtcaagtcct 1201 catggccctt atgaccaggg cttcacacgt catacaatgg tcggtacaga gggtagccaa 1261 gccgcgaggt ggagccaatc tcacaaaacc gatcgtagtc cggattgcac tctgcaactc 1321 gagtgcatga agtcggaatc gctagtaatc gcaggtcagc atactgcggt gaatacgttc 1381 ccgggtcttg tacacaccgc ccgtcacacc ntgggagtgg gggataccag aagtaggtag 1441 ggtaaccgca aggagcccnc ttaccacggt atacttca // LOCUS PASRRDA 1481 bp ss-rRNA RNA 02-AUG-1990 DEFINITION P.multocida 16S ribosomal RNA. ACCESSION M35018 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE P.multocida (strain NCTC 10322) rRNA. ORGANISM Pasteurella multocida Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 1481) AUTHORS Dewhirst,F.E., Paster,B.J., La Fontaine,S. and Rood,J.I. TITLE Transfer of Kingella indologenes (Snell and Lapage 1976) to the genus Suttonella gen. nov. as Suttonella indologenes comb. nov.; transfer of Bacteroides nodosus (Beveridge 1941) to the genus Dichelobacter gen. nov. as Dichelobacter nodosus comb. nov.; and assignment of the genera Cardiobacterium, Dichelobacter and Suttonella to Cardiobacteriaceae fam. nov. in the gamma division of Proteobacteria based upon 16S ribosomal ribonucleic acid sequence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.E.Dewhirst, 05-JUN-1990. Author address: F.E.Dewhirst Forsyth Dental Center 140 Fenway Boston, MA 02115 FEATURES from to/span description rRNA 1 1481 16S ribosomal RNA BASE COUNT 374 a 305 c 459 g 321 t 22 others ORIGIN 1 attgnagagt ttgancangg ctcagattga ncgctggcgg caggcttaac acatgcaagt 61 cgaacggtag caggaagaaa gcttnctttc tttgctgacg agtggcggac gggtgagtaa 121 tgcttgggna tctggcttat ggagggggat aactgtggga aactgcagct aataccgcgt 181 attctcttag gaggaaaggg tgggaccttn gggccacctg ccataagatg agcccaagtg 241 ggattaggta gttggtgggg taaaggccta ccaagcctnc gatctctagc tngtctgaga 301 ggatgaccag ccacactgga actgagacac ggtccagact cctacgggag gcagcagtgg 361 ggaatattgc gcaatggggg gaaccctgac gcagccatgc cgcgtgaatg aagaaggcct 421 tcgggttgta aagttctttc ggtnatgagg aagggatgtt nttaaataga tagcatcatt 481 gacgttaatt ncagaagaag caccggctna ctccgtgcca gcagccgcgg taatacggag 541 ggtgcgagcg ttaatcggaa taactgggcg taaagggcac gcaggcggac ttttaagtga 601 gatgtgaaat ccccgagctt aacttgggaa ctgcatttca gactgggagt ctagagtnct 661 ttagggaggg ntagaattcc acgtgtagcg gtgaaatgcg tagagatgtg gaggaatacc 721 gaaggcgaag gcagcccctt gggaatgtac tgacgctcat gtgcnaaagc gtggggagca 781 aacaggatta gataccctgg tagtccacgc tgtaaacgct gtcgatttgg ggattgggct 841 ntntgcttgg tgcccgaagc taacgtgata aatcgaccgc ctggggagta cggccgcaag 901 gttaaaactc aaatgaattg acgggggccg cacaagcggt ggagcatgtg gtttaattcg 961 atgnaacgcg aagaacctta cctactcttg acatcctaag aagagctcag agatgagctt 1021 gtgccttcgg gaacttagag acaggtgctg catggctgtc gtcagctcgt gttgtgaaat 1081 gttgggttaa gtcccgcaac gagcgcaacc cttatccttt gttgccagcg attcggtcgg 1141 gaactcaaag gagactgcca gtgannaact ggaggaaggt ggggatgacg tcaagtcatc 1201 atggccctta cgagtagggc tacacacgtg ctacaatggt gcatacagag ggcagcgaga 1261 gtgcgagctt gagcgaatct cagaaagtgc atctaagtcc ggattggagt ctgcaactcg 1321 actccatgaa gtcggaatcg ctagtaatcg caaatcagaa tgttgcggtg aatacgttcc 1381 cgggccttgt acacaccgcc cgtcacacca tgggagtggg ttgtaccaga agtagatagc 1441 ttaaccntcg ggggggcgtt taccacggta tgattcatga c // LOCUS RATSUBB 1124 bp ss-mRNA ROD 02-AUG-1990 DEFINITION Rat F-0-ATPase subunit b mRNA, complete cds. ACCESSION M35052 KEYWORDS F-0-ATPase subunit b. SOURCE Rat (strain Wistar) liver, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1124) AUTHORS Tsurumi,C., Yoshihara,Y., Osaka,F., Yamada,F., Tani,I., Higuti,T., Shimizu,M., Oeda,K., Ohkawa,H., Toda,H., Kakuno,T., Sakiyama,F., Kumatori,A., Tanaka,K. and Ichihara,A. TITLE cDNA cloning and sequencing for the import precursor of subunit b in H+-ATP synthase from rat mitochondria JOURNAL Biochem. Biophys. Res. Commun. 169, 136-142 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Higuti, 04-JUN-1990. FEATURES from to/span description pept 60 830 F-0-ATPase subunit b precursor sigp 60 185 F-0-ATPase subunit b signal peptide matp 186 827 F-0-ATPase subunit b signal 1106 1111 poly-A signal BASE COUNT 323 a 241 c 266 g 294 t ORIGIN 1 gagtgtctcg ggtcggggtc acagggacat cggtaccgct gtctggatct tcgcagacaa 61 tgctgtcccg ggtggtactt tctgccgccg ccacagcggc cccgtgtctg aagaacgcgg 121 ccgtcctggg tccaggggtt ttacaggcaa caagggtctt tcacacagga cagcctcgcc 181 ttgcccctct accacctctt cctgaatatg gaggaaaagt acgtcttggg ctgattcctg 241 aggaattttt ccagttcctt taccctaaga ctggtgtaac aggaccctac gtgcttggaa 301 ctgggcttag cttgtacttt ctatccaaag aaatatatgt gattacccca gagaccttct 361 ctaccatatc agtagtaggg ttgattgtct atgtgattaa gaaatatggc gcctctattg 421 gagaatttat tgacaaactt aatgaggaaa aaattgctca actagaagaa ataaagcagt 481 caagcatgaa gcaaatccag gatgcaatca acagggagaa ggcacagcag gcactggttc 541 agaagcgcca ctacctcttc gatgttcaga ggaataacat tgccctggcc ttggaggtca 601 cttaccggga acgactacat aaagcatata aggaggtaaa gaatcgcctg gactaccata 661 tttctgtaca ggacatgatg cgtcgcaagg agggagagca catgataaac tgggtggaga 721 agcatgtgat acagagcatt tctgcacagc aggaaaagga gaccattgcc aagtgcatcg 781 gagatctaaa gatgcttgca aagaaggctc aagctcagcc aattatgtga atgtgtctat 841 ctcagttgtg atagccagag agggttaact taaatgagaa ctactctact tgaagaactc 901 tttctgtatt gctgtctaat gaaattgtgg tgtacctttc ctaagaaatg atcaaggtcc 961 atttagtggc ttaaccctat ttctgatcct tgctctgtat tcggagttgt ctgatcacat 1021 ttgagtaagc aatttgcagc aactcgctgc ccaacagaaa ttaccaagtt atagtttaaa 1081 cttgtaatta gttaaaacat cttacaataa aatgtttgaa acag // LOCUS RATHDNFNT 1020 bp ss-mRNA ROD 02-AUG-1990 DEFINITION Rat neurotrophin-3 (HDNF/NT-3) mRNA, complete cds. ACCESSION M34643 KEYWORDS neurotrophic factor; neurotrophin-3. SOURCE Rat adult (strain Sprague-Dawley) hippocampus, cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1020) AUTHORS Ernfors,P., Ibanez,C.F., Ebendal,T., Olson,L. and Persson,H. TITLE Molecular cloning and neurotrophic activities of a protein with structural similarities to nerve growth factor: Developmental and topographical expression in the brain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5454-5458 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.P.Ernfors, 18-MAY-1990. FEATURES from to/span description pept 73 849 neurotrophin-3 (HDNF/NT-3) precursor sigp 73 120 neurotrophic factor/neurotrophin-3 (HDNF/NT-3) signal peptide matp 490 846 neurotrophic factor/neurotrophin-3 BASE COUNT 320 a 229 c 226 g 245 t ORIGIN 1 gtcgacgtcc ctggaaatag tcatacggat gccatggtta cttctgccac gatcttacag 61 gtgaacaagg tgatgtccat cttgttttat gtgatatttc ttgcttatct ccgtggcatc 121 caaggcaaca acatggatca aaggagtttg ccagaagact ctctcaattc cctcattatc 181 aagttgatcc aggcggatat cttgaaaaac aagctctcca agcagatggt agatgttaag 241 gaaaattacc agagcaccct gcccaaagca gaggcaccca gagaaccaga gcagggagag 301 gccaccaggt cagaattcca gccgatgatt gcaacagaca cagaactact acggcaacag 361 agacgctaca attcaccccg ggtcctgctg agtgacagca cccctttgga gccccctccc 421 ttatatctaa tggaagatta tgtgggcaac ccggtggtaa ccaatagaac atcaccacgg 481 aggaaacgct atgcagagca taagagtcac cgaggagagt actcagtgtg tgacagtgag 541 agcctgtggg tgaccgacaa gtcctcagcc attgacattc ggggacacca ggttacagtg 601 ttgggagaga tcaaaaccgg caactctcct gtgaaacaat atttttatga aacgaggtgt 661 aaagaagcca ggccagtcaa aaacggttgc agggggattg atgacaaaca ctggaactct 721 cagtgcaaaa cgtcgcaaac ctacgtccga gcactgactt cagaaaacaa caaactcgta 781 ggctggcgct ggatacgaat agacacttcc tgtgtgtgtg ccttgtcaag aaaaatcgga 841 agaacatgaa ttggcatctg tccccacata taaattatta ctttaaatta tatgatatgc 901 atgtagcata taaatgttta tattgttttt atatattata agttgacctt tatttattaa 961 acttcagcaa cccttacagt atataagctt ttttcataat cgggctgctc aaaaaaaaaa // LOCUS NEMG12 3620 bp ds-DNA INV 02-AUG-1990 DEFINITION A.lumbricoides trans-spliced mRNA encoded protein gene 12, complete cds. ACCESSION M33757 KEYWORDS . SOURCE A.lumbricoides suum DNA, clone gene-12. ORGANISM Ascaris lumbricoides Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Rhabditia; Ascaridida; Ascaridina; Ascaridoidea; Ascarididae. REFERENCE 1 (bases 1 to 3620) AUTHORS Denker,J.A. and Nilsen,T.W. TITLE Trans splicing of nematode pre-mRNA in in vitro JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 886 to 1149) AUTHORS Hannon,G.J., Maroney,P.A., Denker,J.A. and Nilsen,T.W. TITLE Trans splicing of nematode pre-messenger RNA in vitro JOURNAL Cell 61, 1247-1255 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.W.Nilsen, 18-APR-1990. [1] Author address: T.W.Nilsen, Case Western Reserve University, Department of Molecular Biology and Microbiology School of Medicine, 2119 Abbington Rd., Cleveland OH, 44106, Ph: (216) 368-3913, FAX: 216-368-3055. FEATURES from to/span description pept 995 1190 gene-12 encoded protein, exon 1 1860 1945 gene-12 encoded protein, exon 2 2280 2432 gene-12 encoded protein, exon 3 2947 2997 gene-12 encoded protein, exon 4 pre-msg < 1 3210 gene-12 mRNA (alt.) pre-msg < 1 3114 gene-12 mRNA (alt.) IVS 1191 1859 gene-12 intron A IVS 1946 2279 gene-12 intron B IVS 2433 2946 gene-12 intron C signal 3097 3102 poly-A signal BASE COUNT 942 a 722 c 867 g 1080 t 9 others ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgagc acactctggc ctctttcagt tgtttcatgg caaggcgcat acgattttct 61 aaacgcgctt gagtttttgt atgtgcataa ctgtgccaac actgccgagg atattcgtgc 121 gctttttgct tgttcttttc cacaaacagc aagtaatgtt gtaaaatacg ccctgtcata 181 atcattggca tcttcctttg atgtcgacga actggtcacc gatggcttct ttactttgcc 241 tctgctcatt cttacccaca ttgtccactt acctgttgac cgtgtttcgg caaccagtgc 301 agtcagatgg tctctatacg cgacgcagca ctcgtcccag tcgtcttcaa gagcaacttt 361 agcgaagatt ttgaatcccg cgggagagcc gtttcctgtt cgacggtgac agagctattg 421 tagagggata tcatcgaaat ctaggtcctc ttgagtgcga atcctctgca aacaggtaac 481 tctagactcc tggcaaaaaa tgagttgatt ctaccaattt tttagttagc agcagttggt 541 tgggacgagc ttgcatgctt cgttcggttg aggaacttta ctgcagttcc gtcagccatg 601 ttgcctttcg gattctgttt ttccgacatc atggctacag gtggcactga agacgacgtt 661 atatttgaca caccttgcac gtcataatgt gtatccgcat tgtgctggtc ggatatcgct 721 gcagcaatta ttgaggaggt atttggtcat ctgggttcat ttgaaacggc gtgcattatg 781 ggaacaagtg gtacgcagtc gttcacgcgc tcctctgatt agttacagaa gtggatgcgt 841 gtccggtctt ttccctcatc ttcttaaccc ttttctttta gaatgtattc aaatgcatgg 901 ttttgtttgg tggttgtatc gaaaatttac gccatgagac actagtgcat tatttaatag 961 tagtaaaaga gatgtttttt tagtatattg aaggatgtca tacgacttgg caacggctga 1021 aagagcagcg tatgctcctt tcttcggcta tatgggagca gcttctgcac agatcttcac 1081 agtgcttggg gctgcgtacg gcactgcgaa atcggccgtt ggaatctcat caatgggagt 1141 gatgaggccg gagctgatca tgaaatccgt tattcctgtc atcatggccg gtcagtgttt 1201 cccgttattc ccactttgat accactgttc agctttacgt ggtgtgatat cggagcatcg 1261 cccagagcag catattactt tattttttga aagggagcat ggcaagaggt tgacgaaagg 1321 caaggaatat ttactgcttc taacttactg aattcatcac tgccaattct ttttgtttta 1381 tatctgactt tcttagagct gaagatgttg tttcaattaa ggtggttgtc gaaatgctgt 1441 aatcgttacc tgttgctgtc tgaaatttgg cttgtacatc cattattctg agattctttt 1501 gtataatttt ccactaactg ggatcaaaac ggacttttat aaatatttgt cggtaactaa 1561 gagtgatttc catttagagt atggaaaaca aagtagggca tctcgcatat ctcatctctt 1621 gcctctggac tgtcacatat tttctttcat ccccaaagaa aggtccggaa tacttggaga 1681 tgatcgtcaa tgggtgatta ttgcctcaag gcatatgaga agtgtgcata aatggaattg 1741 cttaatttgg tgctgtttga tggattctgt ttatgatttg acatattcgg acagatattc 1801 gggtaacata aaagtatttt tattgacaca agctggtcat gaatttacgt gcgttgcagg 1861 tattatcggc atttatgggc ttgttgttgc gatggtttta agagggaagg tgacttccgc 1921 tagcgcagga tacaccctcg acaaagtaag accatttgtt gctctattct cgcaagccca 1981 ccactgtttt cgctgttctt catctgcacg aattttcttt tggtgcataa tgtcttctgc 2041 tgcatcatct attgcgactc aaagaaagga cttcagcttt tctaaccttt cctgcgtaaa 2101 tcatggaaag tcggggggat tgggagggcg ataggtggag ggggagggag cggggaactg 2161 caaagagaga agtgggacta agtaaacatg atgcggagcg agtcgaccta agattgccta 2221 aaggtgattt tatattgagt agtgttttta gaggcggtca ttaacgaggt ccgtttcagg 2281 gatttgccca cctggctgct ggccttacat gtggtctttg cggtctcggt gccggatacg 2341 caatcggaat cgttggagat gcgggtgtca ggggaacagc acagcaaccg cgcttgttcg 2401 tcggcatgat cctcattctt atcttctccg aggtactttt ccgctacttg gttaanctgt 2461 tggaattggt cgatggaatc agcgcgtctt agaatgacta gactgattta aagtggccgg 2521 atatacctca gtgggtttgg tgctcgtctg gctgagttta gagggaggaa gacttctccg 2581 ctgttcctgg attcttattt gaacagtcta agagtttatt cccagaactt ccnnnnaaga 2641 ctaggtagnn aaggggggag agtgggannt aggaaggccg cgaccctaga atgtacaaag 2701 gaaggtcgat ccctcaatga cgattcgcac ttagaggagt gcacataaga cttacgagaa 2761 tttacgatgg caccggaaaa aaggtagaag tcataaggtt gttgaccaga tggttactac 2821 accagcattg tcgcagtcag tcccggaggt gctcatataa agagaaaaca ccagaacaaa 2881 aataacaaca ttgagatgta gtagtcttca catatttctg tcgctggtga tttgaaaatc 2941 gttcaggtac tcggtctata cggaatgatt gtcgcgctca ttcttggcac ctcatagacg 3001 acgggcataa tcggtggctg tacaacaact gtgaaatggc gccgggaggg ttgccaataa 3061 tgttccacta aatgttatac actactgctg catataaata aaagctgctc cgcgaaaatg 3121 ctgagtttaa gtggctgcaa cacattttgg tgctgagatg tcgtcttaac tgcaagtttt 3181 ttgacggaga caatgaagta ttttatgtgc aattattcat gtaacgtcga tgcatttaca 3241 gtcatatgca aatgatctgc gtgaatgtcg cactgtgaac cacaaccaca gaaaaacgaa 3301 gtggatctat catgactgca tttacgacta aaacagatcg gtgcacaaag caagtgaaac 3361 ttagccaagg taaaattaac acagttcttt ctctcaaaca aacatgaagt aaacacatca 3421 cagaaacatc caaactgcct cgaaagtggt tgtagcgcta taacagcaag agatacgaaa 3481 agaataaaag aacccgcatt tatacgataa gcgtatttaa atgcgctttc acgcaacgat 3541 gcaaaaggaa aggctgttgg ctattgtatt gccttcgaga aacaattgac ttagagcaat 3601 gccatattgg cgtctgaacg // LOCUS HUMPPPB1A 3215 bp ss-mRNA PRI 02-AUG-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) mRNA, complete cds. ACCESSION M33689 KEYWORDS protein phosphotyrosyl phosphatase. SOURCE Human placenta, cDNA to mRNA, (library of Clontech), clone lambda-16-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3215) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept 73 1380 protein phosphotyrosyl phosphatase 1B (EC 3.1.3.48) BASE COUNT 818 a 828 c 801 g 768 t ORIGIN Chromosome 20q13.1-q13.2. 1 gcgcgacgcg gcctagagcg gcagacggcg cagtgggccg agaaggaggc gcagcagccg 61 ccctggcccg tcatggagat ggaaaaggag ttcgagcaga tcgacaagtc cgggagctgg 121 gcggccattt accaggatat ccgacatgaa gccagtgact tcccatgtag agtggccaag 181 cttcctaaga acaaaaaccg aaataggtac agagacgtca gtccctttga ccatagtcgg 241 attaaactac atcaagaaga taatgactat atcaacgcta gtttgataaa aatggaagaa 301 gcccaaagga gttacattct tacccagggc cctttgccta acacatgcgg tcacttttgg 361 gagatggtgt gggagcagaa aagcaggggt gtcgtcatgc tcaacagagt gatggagaaa 421 ggttcgttaa aatgcgcaca atactggcca caaaaagaag aaaaagagat gatctttgaa 481 gacacaaatt tgaaattaac attgatctct gaagatatca agtcatatta tacagtgcga 541 cagctagaat tggaaaacct tacaacccaa gaaactcgag agatcttaca tttccactat 601 accacatggc ctgactttgg agtccctgaa tcaccagcct cattcttgaa ctttcttttc 661 aaagtccgag agtcagggtc actcagcccg gagcacgggc ccgttgtggt gcactgcagt 721 gcaggcatcg gcaggtctgg aaccttctgt ctggctgata cctgcctctt gctgatggac 781 aagaggaaag acccttcttc cgttgatatc aagaaagtgc tgttagaaat gaggaagttt 841 cggatggggc tgatccagac agccgaccag ctgcgcttct cctacctggc tgtgatcgaa 901 ggtgccaaat tcatcatggg ggactcttcc gtgcaggatc agtggaagga gctttcccac 961 gaggacctgg agcccccacc cgagcatatc cccccacctc cccggccacc caaacgaatc 1021 ctggagccac acaatgggaa atgcagggag ttcttcccaa atcaccagtg ggtgaaggaa 1081 gagacccagg aggataaaga ctgccccatc aaggaagaaa aaggaagccc cttaaatgcc 1141 gcaccctacg gcatcgaaag catgagtcaa gacactgaag ttagaagtcg ggtcgtgggg 1201 ggaagtcttc gaggtgccca ggctgcctcc ccagccaaag gggagccgtc actgcccgag 1261 aaggacgagg accatgcact gagttactgg aagcccttcc tggtcaacat gtgcgtggct 1321 acggtcctca cggccggcgc ttacctctgc tacaggttcc tgttcaacag caacacatag 1381 cctgaccctc ctccactcca cctccaccca ctgtccgcct ctgcccgcag agcccacgcc 1441 cgactagcag gcatgccgcg gtaggtaagg gccgccggac cgcgtagaga gccgggcccc 1501 ggacggacgt tggttctgca ctaaaaccca tcttccccgg atgtgtgtct cacccctcat 1561 ccttttactt tttgcccctt ccactttgag taccaaatcc acaagccatt ttttgaggag 1621 agtgaaagag agtaccatgc tggcggcgca gagggaaggg gcctacaccc gtcttggggc 1681 tcgccccacc cagggctccc tcctggagca tcccaggcgg gcggcacgcc agacagcccc 1741 ccccttgaat ctgcagggag caactctcca ctccatattt atttaaacaa ttttttcccc 1801 aaaggcatcc atagtgcact agcattttct tgaaccaata atgtattaaa attttttgat 1861 gtcagccttg catcaagggc tttatcaaaa agtacaataa taaatcctca ggtagtactg 1921 ggaatggaag gctttgccat gggcctgctg cgtcagacca gtactgggaa ggaggacggt 1981 tgtaagcagt tgttatttag tgatattgtg ggtaacgtga gaagatagaa caatgctata 2041 atatataatg aacacgtggg tatttaataa gaaacatgat gtgagattac tttgtcccgc 2101 ttattctgct ccctgttatc tgctagatct agttctcaat cactgctccc ccgtgtgtat 2161 tagaatgcat gtaaggtctt cttgtgtcct gatgaaaaat atgtgcttga aatgagaaac 2221 tttgatctct gcttactaat gtgccccatg tccaagtcca acctgcctgt gcatgacctg 2281 atcattacat ggctgtggtt cctaagcctg ttgctgaagt cattgtcgct cagcaatagg 2341 gtgcagtttt ccaggaatag gcatttgcct aattcctggc atgacactct agtgacttcc 2401 tggtgaggcc cagcctgtcc tggtacagca gggtcttgct gtaactcaga cattccaagg 2461 gtatgggaag ccatattcac acctcacgct ctggacatga tttagggaag cagggacacc 2521 ccccgccccc cacctttggg atcagcctcc gccattccaa gtcgacactc ttcttgagca 2581 gaccgtgatt tggaagagag gcacctgctg gaaaccacac ttcttgaaac agcctgggtg 2641 acggtccttt aggcagcctg ccgccgtctc tgtcccggtt caccttgccg agagaggcgc 2701 gtctgcccca ccctcaaacc ctgtggggcc tgatggtgct cacgactctt cctgcaaagg 2761 gaactgaaga cctccacatt aagtggcttt ttaacatgaa aaacacggca gctgtagctc 2821 ccgagctact ctcttgccag cattttcaca ttttgccttt ctcgtggtag aagccagtac 2881 agagaaattc tgtggtggga acattcgagg tgtcaccctg cagagctatg gtgaggtgtg 2941 gataaggctt aggtgccagg ctgtaagcat tctgagctgg cttgttgttt ttaagtcctg 3001 tatatgtatg tagtagtttg ggtgtgtata tatagtagca tttcaaaatg gacgtactgg 3061 tttaacctcc tatccttgga gagcagctgg ctctccacct tgttacacat tatgttagag 3121 aggtagcgag ctgctctgct atgtccttaa gccaatattt actcatcagg tcattatttt 3181 ttacaatggc catggaataa accattttta caaaa // LOCUS HUMPPPB1A1 276 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x. ACCESSION M33688 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 1 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 276) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept / 34 + 243 protein phosphotyrosyl phosphatase 1B, exon x (EC 3.1.3.48) (AA at 34) pre-msg < 1 > 276 PTP1B mRNA and introns IVS < 1 33 PTP1B intron x-1 IVS 244 > 276 PTP1B intron x BASE COUNT 56 a 77 c 69 g 74 t ORIGIN Chromosome 20q13.1-q13.2. 1 ctttagaatc tactagatga ttttctcttt cagacccaag aaactcgaga gatcttacat 61 ttccactata ccacatggcc tgactttgga gtccctgaat caccagcctc attcttgaac 121 tttcttttca aagtccgaga gtcagggtca ctcagcccgg agcacgggcc cgttgtggtg 181 cactgcagtg caggcatcgg caggtctgga accttctgtc tggctgatac ctgcctcttg 241 ctggtaagga ggcctcgcgg gtgccctggg gagctc // LOCUS HUMPPPB1A2 453 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+1. ACCESSION M33687 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 2 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 453) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 236 + 397 protein phosphotyrosyl phosphatase 1B, exon x+1 (EC 3.1.3.48) pre-msg < 1 > 453 PTP1B mRNA and introns IVS < 1 235 PTP1B intron x IVS 398 > 453 PTP1B intron x+1 BASE COUNT 104 a 118 c 111 g 120 t ORIGIN Chromosome 20q13.1-q13.2. 1 ggggaggtcc cagactctta accagatctc ttgtgaatgc attgcctcag ggaggcacca 61 agcctttcat gaggacctgt ccccctgacc cagacacctc ccacccagcc ccacctccaa 121 cactagggat cacatttcag catgagattg ggaggggaca gacatctaac ggtgttatta 181 acgttgccct tgagaattgg acctggctga cttatatctc ctctctggct ttcagatgga 241 caagaggaaa gacccttctt ccgttgatat caagaaagtg ctgttagaaa tgaggaagtt 301 tcggatgggg ctgatccaga cagccgacca gctgcgcttc tcctacctgg ctgtgatcga 361 aggtgccaaa ttcatcatgg gggactcttc cgtgcaggtc agcattgcct ttgtttgaat 421 ccaggtgtga ccattttaac ttttttgtct ttg // LOCUS HUMPPPB1A3 426 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+2. ACCESSION M33686 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 3 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 426) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 46 + 269 protein phosphotyrosyl phosphatase 1B, exon x+2 (EC 3.1.3.48) pre-msg < 1 > 426 PTP1B mRNA and introns IVS < 1 45 PTP1B intron x+1 IVS 270 > 426 PTP1B intron x+2 BASE COUNT 115 a 122 c 109 g 80 t ORIGIN Chromosome 20q13.1-q13.2. 1 gaagtgaaca ctaatagact tccttcctct tgctgctctt tcaaggatca gtggaaggag 61 ctttcccacg aggacctgga gcccccaccc gagcatatcc ccccacctcc ccggccaccc 121 aaacgaatcc tggagccaca caatgggaaa tgcagggagt tcttcccaaa tcaccagtgg 181 gtgaaggaag agacccagga ggataaagac tgccccatca aggaagaaaa aggaagcccc 241 ttaaatgccg caccctacgg catcgaaagg taatatattg ggtccagctt gttggggtga 301 ggggaaatga cttctgttct agaaacacac gctggtactg aaaccctgtg atgcagcctc 361 tgttggcaag cagcgcttcg catccttggg aacagggcgc tggaccaaca cccactccac 421 tggtgg // LOCUS HUMPPPB1A4 732 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+3. ACCESSION M33685 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 4 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 732) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 402 + 597 protein phosphotyrosyl phosphatase 1B, exon x+3 (EC 3.1.3.48) pre-msg < 1 > 732 PTP1B mRNA and introns IVS < 1 401 PTP1B intron x+2 IVS 598 > 732 PTP1B intron x+3 BASE COUNT 164 a 193 c 205 g 170 t ORIGIN Chromosome 20q13.1-q13.2. 1 tctgtagctc taaagaatga gatctggtgt actgatgtgg ccagacattg caattgcagt 61 acatgagaag gcaaatcata cagtagtgtg tacaccagtg agtcctccag ccagataaat 121 cctcacagtg accagtcgcc caggcacctt gtgaacccta ccctgggtgt gggtgctatc 181 tgaagtacct gggggagggg gtgacaagtg gacttcaggc tgatgtggcc ctggcctggc 241 cctccctcca agcagagggg gctggcacgc tggaaggtta acatcatcca actctgtcta 301 cacgtggctt gttttttcct agaattcctg ccacaatagc agcatccttg ccattcattt 361 tctccaaagt gagtacccat ctctgccctc tgattcctca gcatgagtca agacactgaa 421 gttagaagtc gggtcgtggg gggaagtctt cgaggtgccc aggctgcctc cccagccaaa 481 ggggagccgt cactgcccga gaaggacgag gaccatgcac tgagttactg gaagcccttc 541 ctggtcaaca tgtgcgtggc tacggtcctc acggccggcg cttacctctg ctacagggta 601 tgtttccact gacagacgcg ctgggcagat gctcgtgtgc agagagcact ggccgctagc 661 ccgatggtag gattcagttc tgtggtgcat ctgagccagt ctcagaagaa acagatcaag 721 gttttaagtc tg // LOCUS HUMPPPB1A5 365 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human protein phosphotyrosyl phosphatase 1B (PTP1B) gene, exon x+4. ACCESSION M33684 KEYWORDS protein phosphotyrosyl phosphatase. SEGMENT 5 of 5 SOURCE Human DNA, (library of Clontech), clone lambda-10-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 365) AUTHORS Brown-Shimer,S., Johnson,K.A., Lawrence,J.B., Johnson,C., Bruskin,A., Green,N.R. and Hill,D.E. TITLE Molecular cloning and chromosome mapping of the human gene encoding protein phosphotyrosyl phosphatase 1B JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5148-5152 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Hill, 13-APR-1990. FEATURES from to/span description pept + 266 289 protein phosphotyrosyl phosphatase 1B, exon x+4 (EC 3.1.3.48) pre-msg < 1 > 365 PTP1B mRNA and introns IVS < 1 265 PTP1B intron x+3 BASE COUNT 80 a 101 c 88 g 96 t ORIGIN Chromosome 20q13.1-q13.2. 1 tacctcctaa gacttttacg gttttaaata ttttacctct ttccaggtgg catctgagta 61 catcagatgg ttttgcaaaa tgcaaacaat tttttccttg gggatgattt ttggggagag 121 ggggctactg taaaaaataa aaccaaaacc ccctttgctc cctcggaggt tgaagttgcc 181 ggggggtgtg gccggggtca tgcatgaggc gacagcactg caggtgcggg tctgggctca 241 tctgaactgt ttggtttcat tccagttcct gttcaacagc aacacatagc ctgaccctcc 301 tccactccac ctccacccac tgtccgcctc tgcccgcaga gcccacgccc gactagcagg 361 catgc // LOCUS YSCIRA2A 9859 bp ds-DNA PLN 02-AUG-1990 DEFINITION S.cerevisiae IRA2 protein gene, complete cds. ACCESSION M33779 KEYWORDS IRA2 protein. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Unclassified. REFERENCE 1 (bases 1 to 9859) AUTHORS Tanaka,K., Nakafuku,N.M., Tamanoi,F., Kagiro,Y., Matsumoto,K. and Toh-i,A. TITLE IRA2, a second gene of Saccharomyces cerevisiae that encodes a protein with a domain homologous to mammalian ras GTPase activating protein JOURNAL Mol. Cell. Biol. 10, 4303-4313 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Tanaka, 19-APR-1990, for release after publication. University of Chicago Dept. of Biochem and Mol Biol 920 E. 58th Street Chicago, IL 60637 FEATURES from to/span description pept 231 9470 IRA2 protein (IRA2) BASE COUNT 3163 a 1908 c 1735 g 3053 t ORIGIN 1 gagctcctgg gaaacaaaag accgaaaaag cgaaaacaaa atcagaacaa ggcttaagta 61 ctttttcacc aattgtagca aacatttaac cacattttag cacactagca tatagcattg 121 tcctctgtta ttcgttttgc ttttctcctt tagtgttact tttcccccaa cgttacacca 181 ttttttgata tcaactaaac tgtatacatt atctttcttc agggagaagc atgtcccagc 241 ccactaagaa taagaagaaa gaacacggga ccgattccaa gtcatcccgc atgactcgga 301 cgttggttaa tcatattctt tttgaaagaa ttctcccgat ccttccggtg gagtctaatc 361 taagtaccta ttcggaagtg gaagagtatt cctcattcat ttcatgcaga tctgtgctca 421 ttaacgttac cgtttcccga gatgcaaatg ctatggtgga aggcaccttg gagttgatag 481 aatcgcttct tcaagggcac gaaatcattt cagataaggg tagcagtgac gttattgaat 541 caatactgat tatactaaga ttgttaagtg atgcgctaga gtataattgg caaaatcaag 601 aaagccttca ttacaacgac atttcgactc acgtagaaca tgaccaagaa cagaagtaca 661 gaccaaagct taacagtatt ctgcccgact actcgtcgac tcattccaat ggcaacaaac 721 actttttcca ccagagcaaa cctcaggcac tgataccgga actggcatcg aaattgcttg 781 agagttgcgc gaagttgaag ttcaatacaa gaactttgca aattttacaa aatatgatca 841 gtcatgttca tggaaacatt ctaacgactt tgagttcctc gattcttccc cgccacaaat 901 cctatctgac aaggcacaac catccttctc attgtaaaat gattgactct actctaggcc 961 atattctccg atttgtagcg gcttccaatc cgtccgagta ttttgaattt atcagaaaga 1021 gtgtgcaagt gcccgtaaca cagacacaca cgcattcaca ctcccattca cactctttgc 1081 catcttccgt ttataacagc atagtgcccc actttgatct tttcagcttc atctatttaa 1141 gcaagcataa ttttaagaaa tacttggaac tcatcaaaaa cttatcggtg acgttaagga 1201 aaacgattta tcattgccta cttttgcatt acagcgccaa agcaataatg ttttggataa 1261 tggctaggcc tgcggaatat tatgaactct tcaacttatt aaaagataat aacaatgaac 1321 actcgaaatc cttaaacacg ttaaaccata cacttttcga ggagatccat tcgactttta 1381 atgtgaatag catgataacc accaatcaaa atgctcatca aggctcatct tccccttcgt 1441 cctcctcgcc atcgtcacca cctagctcat catcatcgga taacaacaat caaaacataa 1501 tagcaaaatc cttaagtcgt cagctttctc accaccagtc atacattcaa cagcagtctg 1561 aaagaaaact acattcttca tggactacaa actctcaatc ctctacttca ctgtcatctt 1621 caacgtctaa ttcaacaaca actgatttct ctactcacac tcaaccagga gaatatgacc 1681 cttccttacc agatactccc acgatgtcta acatcactat tagtgcatct tcattattat 1741 ctcaaactcc aactccaaca acacaattgc aacagcggtt gaactcagca gctgcagccg 1801 ccgccgcagc tgcttcacca tcgaattcca ccccaactgg atacacagca gagcaacaaa 1861 gtcgcgcttc atacgatgca cacaaaactg gccatactgg taaggattat gacgaacatt 1921 ttttgtctgt cactcgtttg gataatgttt tggagttata cacgcacttt gatgatactg 1981 aggtactacc acacacatcc gtactgaagt ttttaactac tttgacaatg ttcgatattg 2041 acctttttaa tgaattaaac gctacatcat tcaaatatat tcctgattgt actatgcatc 2101 gtccaaaaga aagaacaagt tctttcaata atactgcaca cgagacaggt tccgaaaaga 2161 cttcgggtat aaaacatatt acacaaggct taaagaaatt aacttcttta ccttcctcaa 2221 ccaaaaaaac tgtaaaattt gtgaagatgt tgctaagaaa tttaaatggg aatcaagctg 2281 tatcagatgt tgccctctta gatacaatga gggccttact atcattcttt acaatgactt 2341 ctgcggtctt tctcgtggat agaaacttac cctcagtact ttttgccaag agactcatcc 2401 ccataatggg gacaaattta agcgtcggtc aagactggaa ttcaaaaata aataacagtt 2461 tgatggtttg tttgaaaaaa aactccacca cgtttgttca attacaatta atattcttct 2521 cttcagctat tcaattcgat catgaattat tactggcacg tctgagcatc gatacaatgg 2581 ccaacaattt aaacatgcag aagctatgcc tttatactga aggattcagg atattcttcg 2641 acataccaag taagaaggaa ttgcggaagg caattgcggt taaaatttct aaatttttca 2701 aaacattatt ctccattata gcagatattc ttttacaaga atttccgtat tttgatgagc 2761 aaatcaccga catagttgct tccattcttg acggtacaat tatcaatgag tatggtacga 2821 agaaacattt caaggggagc tcaccctctt tatgttcgac aacccggtca agatcaggat 2881 ctacatctca aagttcaatg acaccagttt ctccgctggg actggatact gatatatgtc 2941 caatgaacac cctgtcttta gttggttcaa gtacttcaag aaattctgac aacgttaatt 3001 cattaaacag ttcaccaaag aacttgtctt ctgatccata cttgtcacat cttgtggccc 3061 caagagcgcg tcatgcttta ggtgggccat ctagtattat aaggaataaa ataccgacta 3121 cattgacttc acctccagga acggaaaaat cttcaccagt acaacgtccg caaacggaaa 3181 gcatcagtgc cacaccaatg gccataacaa attctactcc attatcgtcg gcagcattcg 3241 gaattcgatc gcctttgcag aaaataagaa cgaggcgtta ttccgatgaa agtttaggta 3301 aattcatgaa atcaacaaat aattacattc aagaacattt gataccaaaa gatttgaatg 3361 aagcaactct tcaagatgct agaagaataa tgattaatat tttcagtatt tttaagagac 3421 cgaatagtta cttcatcatt cctcacaata taaactcgaa tttacaatgg gtttcgcagg 3481 attttagaaa tattatgaaa ccgattttcg tcgccatcgt aagtccggat gtagatttac 3541 agaatactgc tcaatcattc atggatacct tattatcgaa tgttattact tatggtgaat 3601 cagatgagaa tatcagtatt gaagggtatc atcttctttg cagttacact gtaacattat 3661 ttgcaatggg ccttttcgat ttgaaaatta ataatgaaaa gcgtcaaatt ctcttggata 3721 taactgtcaa gtttatgaag gttagatcac atttagcagg gatcgcggag gcctcacacc 3781 acatggaata cataagtgat tctgaaaaac tcacctttcc gctgattatg gggactgttg 3841 gtagggccct atttgtttca ttatactcta gtcaacaaaa aattgaaaag actttaaaga 3901 ttgcttacac agagtatctt tctgcaatca attttcatga gaggaatatt gatgatgctg 3961 ataaaacttg ggttcataat attgagtttg tagaagcgat gtgtcatgac aactacacaa 4021 cttctggttc aattgctttc caaaggagga caagaaataa tattttacga tttgctacta 4081 ttcctaacgc tatcttactt gattctatga ggatgatcta taagaagtgg catacttaca 4141 cacacagtaa aagtttagaa aaacaagaac ggaacgactt cagaaatttc gcgggtattt 4201 tagcctcttt gtcgggtatc ctattcatca ataaaaagat attgcaagaa atgtatccat 4261 acctactcga caccgtttca gaattgaaaa aaaatataga ctcttttatc tcaaaacaat 4321 gccaatggtt aaactatccg gatttattaa cgagagaaaa ttcaagagat attctaagtg 4381 tagaactgca tcctttgtct tttaacttac tttttaataa tttgaggctc aagttaaaag 4441 aacttgcttg ttcagactta tcaataccag aaaatgaaag ttcctatgtt ttattagaac 4501 aaataatcaa aatgctgcgg acaatcctag gtcgtgatga tgacaattat gtaatgatgc 4561 ttttttccac agagattgta gatcttattg atttattgac agatgaaata aaaaaaatac 4621 cagcctattg tccaaaatat ctcaaggcaa ttattcaaat gaccaaaatg ttcagtgcct 4681 tgcagcactc agaggttaat ttaggtgtca aaaatcattt tcacgttaaa aataaatggt 4741 tgaggcaaat cactgattgg tttcaagtga gtattgcgag agagtacgat ttcgaaaact 4801 tgtcaaaacc tctaaaagaa atggatttgg taaaaagaga catggatatt ctatacatag 4861 atacggcaat cgaagcttca accgctattg cgtacctcac gagacatact ttcttagaga 4921 ttccacctgc cgcgtcagat cccgaactat ctcgatctag gtctgtgata tttgggtttt 4981 atttcaacat cttaatgaaa ggccttgaaa aaagtagtga tcgtgacaat tacccagtat 5041 tcttgaggca caaaatgagt gtcctcaacg acaatgtaat actttcatta acaaatcttt 5101 caaacaccaa tgttgatgcg agtttgcagt tcaccttacc gatgggctat tccggaaatc 5161 gaaacattag gaatgcattt ttggaggtct tcattaatat cgttacgaac tatcggacat 5221 acacggctaa aactgacctt ggaaaattag aggcagcaga caaatttttg cgatatacga 5281 ttgaacatcc ccagctatcg tcctttggag cagcggtttg tcccgctagc gatattgatg 5341 cttatgctgc tggcttaata aatgcatttg aaacgaggaa tgccacccac attgtagtgg 5401 cacagttgat taaaaatgaa attgaaaaat cttccagacc tacggatatc cttagaagaa 5461 atagctgtgc tacgagatca ttatctatgc tagccaggtc caagggtaac gaatatttga 5521 ttcgcacttt gcaaccatta ctaaaaaaaa ttatccagaa cagagatttt tttgaaattg 5581 agaaactaaa accggaagat tcagatgctg aacgtcaaat agagctcttc gttaaataca 5641 tgaatgaatt attggaatcc atatccaact ccgtatctta ttttccccct cctttatttt 5701 atatttgcca aaacatttat aaagttgcgt gtgaaaaatt tccggatcac gcaattatcg 5761 ccgctgggtc tttcgtgttt ttacggtttt tttgtcctgc tttagtcagc cctgattctg 5821 aaaatatcat agatatttct cacttgagcg aaaagcgtac cttcatcagc ttggctaaag 5881 ttatccaaaa tattgccaat ggctcagaaa atttctccag atggccagct ttgtgttccc 5941 aaaaggattt tcttaaggaa tgtagcgata gaattttcag attcctagct gaactttgta 6001 gaacagatcg cacgatagac atccaagtga gaacagaccc aacgccaatt gcatttgact 6061 atcaattcct tcattccttt gtttaccttt acggtcttga ggtgagaagg aatgtgctaa 6121 atgaagcaaa acatgatgat ggtgacattg atggtgacga tttctataag accacatttt 6181 tacttattga tgatgttctt ggccaattag gccaacctaa aatggaattt tccaatgaaa 6241 taccaatata cataagagaa catatggacg actatccgga actgtatgag ttcatgaata 6301 ggcacgcgtt cagaaacatt gagacttcaa cagcgtacag cccaagcgtt cacgagtcca 6361 cctcaagtga aggcattcca attattacgt taacaatgtc aaatttctca gacagacatg 6421 tggacattga tacagttgct tacaagttct tgcaaattta tgctcgaatc tggaccacca 6481 aacactgttt aataatcgac tgtacagaat ttgacgaggg agggcttgat atgaggaaat 6541 ttatttcttt ggttatggga ctattaccag aagttgcacc caaaaattgt ataggctgtt 6601 actactttaa cgtaaacgag acatttatgg ataattatgg aaaatgtttg gacaaagaca 6661 acgtatatgt ttcctcgaaa attcctcatt atttcattaa tagtaactct gatgaaggac 6721 ttatgaaatc tgtgggtata actggacaag ggttgaaggt tctgcaagat attcgtgtct 6781 ctctgcatga tatcacgctt tatgacgaaa aaagaaatag atttacgccg gtatcgttga 6841 aaataggcga tatttacttt caagtcttgc atgaaactcc taggcaatat aaaataaggg 6901 acatgggtac tttattcgac gtaaaattca atgatgtcta cgaaattagc cgaatatttg 6961 aagtacatgt ttcgtcaata actggagtgg cagctgaatt tacagtaact tttcaggacg 7021 agagaaggtt gatttttagt agtccgaaat accttgaaat tgtgaagatg ttctattacg 7081 cacagatccg gttagaaagt gaatatgaaa tggataataa ttcgagtacc tcctccccaa 7141 attcaaacaa caaggtcaaa cagcagaaag agagaacaat tctattgtgc cacctactgt 7201 tagtatctct tattggtctg tttgatgaga gtaaaaaaat gaaaaacagt tcgtataacc 7261 taatagctgc cactgaggcg tcatttggtt tgaactttgg ctcccatttt catcgctctc 7321 ccgaggtgta cgtccccgaa gatactacaa catttttagg tgttattgga aagtctcttg 7381 cagagtctaa tccagaactc acagcctata tgtttatcta tgttttggag gcattgaaga 7441 acaacgtaat tcctcacgtt tacatccctc ataccatttg cggtttgtct tattggatcc 7501 ctaatttata ccaacatgtg tatttggctg atgatgaaga aggccccgaa aacatatctc 7561 acattttccg aattcttatc aggctctctg tgagagagac tgactttaaa gccgtataca 7621 tgcaatatgt ttggttgcta cttttagatg atggccgctt aactgacatt atcgttgatg 7681 aagttattaa tcatgcgtta gaaagagact ccgaaaaccg cgattggaag aaaacaatat 7741 cgttactgac tgtcctaccc actactgagg ttgctaataa tattattcaa aaaatattgg 7801 caaaaattag atcattttta ccgtcattga agttagaagc tatgacccaa agttggtctg 7861 aactaacaat attagttaag ataagcatcc acgttttttt tgaaacttct ttgctggtac 7921 agatgtactt accagagatc ctgtttatcg tatccttatt aattgatgtt ggtccaaggg 7981 aactcagatc atcactacac cagctattaa tgaatgtatg ccattccttg gctattaact 8041 cagctttacc acaagatcat agaaataatc tagatgaaat aagtgatata tttgcacatc 8101 aaaaggtgaa gtttatgttt gggttcagcg aggacaaagg acgaatttta cagattttta 8161 gcgcttcttc ttttgcaagc aagtttaata ttctggattt cttcatcaat aatatattat 8221 tgctgatgga atattcttca acgtacgaag caaacgtgtg gaagacaaga tacaagaaat 8281 atgtcttgga atctgtgttt acaagtaatt cttttctttc ggcacgttca atcatgattg 8341 ttggtataat gggtaaatct tacataactg aagggttatg caaggctatg ttaattgaaa 8401 ccatgaaagt tatcgccgaa ccaaagatta ctgacgagca tcttttctta gccatatctc 8461 atatttttac ttattccaaa attgttgaag gtttggatcc caaccttgac ttaatgaagc 8521 acttattttg gttttcaaca ctcttccttg aatcacgtca cccgataatt tttgagggtg 8581 cccttctctt tgtgtcaaac tgtataaggc gcctatacat ggcccagttt gaaaatgaaa 8641 gcgaaacatc attgataagt actttactta aggggagaaa gtttgctcat acctttttaa 8701 gcaagataga gaatcttagt ggtattgttt ggaatgaaga taattttaca cacattctga 8761 ttttcatcat taataaagga ctatccaatc ctttcattaa gagtacggct tttgatttct 8821 tgaagatgat gtttagaaac tcctactttg agcatcaaat caatcagaaa tctgatcatt 8881 atttgtgcta tatgttccta ttgtattttg ttttaaactg caatcaattt gaggaacttt 8941 taggtgacgt tgattttgaa ggagaaatgg ttaacattga aaacaagaac accattccta 9001 aaattttgtt agagtggttg agttcggata acgaaaatgc aaacattacc ctctatcaag 9061 gtgcgatact gttcaaatgt tcagttacgg atgaaccaag tagatttagg tttgcgttga 9121 ttattaggca tctattgaca aagaaaccca tttgtgcatt gcgtttttac agtgttattc 9181 gtaacgaaat aagaaaaata tcagcatttg agcaaaattc ggattgtgtt ccacttgctt 9241 tcgatatttt aaacttatta gtgacgcatt cagagtctaa ttcgttagaa aaacttcacg 9301 aagaatccat tgaacgtcta accaaaagag gtttatcgat tgtgacttct tctggtatat 9361 ttgcgaagaa ctccgacatg atgatacctt tagatgtaaa acctgaagat atctatgaac 9421 gtaagagaat aatgacaatg attttatcaa ggatgtcatg ttctgcttag aggtgttaca 9481 taaactaatg aaagaaatat caatatctat ctgtaagcat gaatgtacat atctcatgtt 9541 agggttttct tatcgctaat ttttcgcaat ttgttacgtg ggttgctttt atacagctac 9601 aatttttata tattctatcg tgtaatgaat ggctcagtaa attcaagcgc cacatagact 9661 aatgtacata ccaatgcatt ttaattgtaa gaataaaagg ggccattcat ctaccgtctt 9721 agttgaaagt gtttctgtga attttttcaa attccgtttt ttccttttta tataatagca 9781 tggtggcacg agcatcttcg actgaagaat gctcaccttc ttgaatggaa atttttaaaa 9841 cctccctggt taatttctt // LOCUS HUMMHDQ3L 967 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human MHC class II HAL-DQ-LTR3 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33841 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 967) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA-DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4927-4931 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 961 967 inverted repeat B rpt 560 810 R region signal 792 797 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 530 536 TATA box BASE COUNT 254 a 235 c 222 g 256 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttatg tactaagaga aattcttctg ccttgagatt ctgttaatct 121 ataaccttac ccccaacccc gtgctctctg aaacatgtgc tgtgtcaact cagagttgaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaaaac tgcggaaggc 301 cgcagggacc tctgcctagg aaagccaggt attgtccaag gtttctcccc atgtgagagt 361 ctgaaatatg gcctcgtggg aagggaaaga cctgaccatc ccccagcccg acacccgtaa 421 agggtctgtg ctgaggagga ttagtaaaag aggaaggaat gcctctttca gttgagacaa 481 gaggaaggca tctgtctcct gcctgtccct gggcaatgga atgtctctgt ataaaacccg 541 attgtatgct ccatctactg agatagggaa aaactgcctt agggctggag gtgggacctg 601 cgggcagcaa tactgctttg taaagcattg agatgtttat gtgtatgcat atctaaaagc 661 acagcactta atcctttaca ttgtctatga tgcaaagacc tttgttcaca tgtttgtctg 721 ctgaccctct ccccacaatt gtcttgtgac cctgacacat ccccctcttc gagaaacacc 781 cacaaatgat caataaatac taagggaact cagaggctgg cgggatcctc catatgctga 841 acgctggttc cccgggtccc cttatttctt tctctatact ttgtctctgt gtctttttct 901 ttcctaagtc tctcgttcca ccttacgaga aacacccaca ggtgtggagg ggcaacccac 961 ccctaca // LOCUS HUMMHDQ5L 960 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human MHC class II HAL-DQ-LTR5 (DQ,w8) DNA fragment, long terminal repeat region. ACCESSION M33842 KEYWORDS major histocompatibility complex. SOURCE Human (pot. haplotype DQ,w8) lung carcinoma DNA, clone LC14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 960) AUTHORS Kambhu,S., Falldorf,P. and Lee,J.S. TITLE Endogenous retroviral long terminal repeats (LTR) within the HLA-DQ locus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4927-4931 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.S.Lee, 25-APR-1990. FEATURES from to/span description rpt 1 7 inverted repeat A rpt 954 960 inverted repeat B rpt 555 803 R region signal 785 790 poly-A signal site 74 79 1/2 GRE site 80 87 enhancer core site 525 531 TATA box BASE COUNT 250 a 232 c 219 g 259 t ORIGIN Chromosome 6p21.3. 1 tgtggggaaa agaaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 61 agagactcca ttttgttctg tactaagaca aattcttctg ccttgggatg ctgttaatct 121 ataaccttac ccccaaccct gtgctctctg aaacatgtgc tgtgtcaact cggggttaaa 181 tggattaagg gcggtgcaag atgtgctttg ttaaacagat gcttgaaggc agcatgctcc 241 ttaagagtca tcaccactcc ctaatctcaa gtacccaggg acacaaacag aaggccgcag 301 ggacctctgc ctaggaaagc caggtattgt ccaaggtttc tccccatgtg acagtctgaa 361 atatggcctc gtgggaaggg aaagacctga ccgtccccca gcctgacacc cgtaaagggt 421 ctgtgctgag gaggattagt ataagaggaa ggcatgcctc ttgcagttga gacaagagaa 481 aggcatctct ctcctgtccg tccctgggca atggaatgtc tcggtataaa acccgattgt 541 atgttccatc tactgagata aggaaaaccg ccttagggct ggaggtggga catgtgggca 601 acaatactgc tctgtaaggc attgagatgt ttatgtgtat gcatatctaa agcacagcac 661 ttaatccttt accttgtcta tgatgcagag agctttgttc acgtgtttat ctgctgacct 721 tctctccact attatcttat gaccctgcca catccccctc tctgagaaac acccaaaaat 781 gatcaataaa tactaaggga actcagaggc tagcgggatc ctccatatgc tgaatgctgg 841 tcccctgggc ccccttattt ctttctctat actttgtctc tgtgtctttt tcttttctaa 901 gtctctcatt ccacctaacg agaaacaccc acaggtgtgg aggggcaacc caccccttca // LOCUS NEUAMTR 5928 bp ds-DNA PLN 02-AUG-1990 DEFINITION N.crassa mating type protein gene, complete cds. ACCESSION M33876 KEYWORDS mating type protein. SOURCE N.crassa (strain 74-ORS-A) DNA. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 5928) AUTHORS Glass,N.L., Grotelueschen,J. and Metzenberg,R.L. TITLE The Neurospora crassa A mating-type region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4912-4916 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.L.Glass, 26-APR-1990. FEATURES from to/span description pept 4121 4283 mating type protein, exon 1 4343 5046 mating type protein, exon 2 IVS 4284 4342 mating type protein intron A signal 3858 3906 GC signal signal 3945 3949 CAAT box signal 5319 5326 poly-A signal BASE COUNT 1470 a 1459 c 1526 g 1473 t ORIGIN 1 cgagaccgtt gttgcttgct gtatccatgc cggtgtcaaa gtcttgtcgt cgtatgcagg 61 agtctgaaaa ccaacccgag aagggtgggg caagtgactc tgcagtgatg tcaagactga 121 ggtccagctg ctgattgaaa tggctgatag aacagacgga ccaagactga aactgatgct 181 gagctgcgct gatggaatgt gccaagagaa tgaagctgtc cactgccggc gagcacgcct 241 agtgtgctgt gatttgagga cgggactccc tactcgtagg attgacgaga gattgaacag 301 agagccatcg acttatttgt gatgtcttgg ttgttgatca actgttgccg gctctccaaa 361 tgcgaagtcg gcgagtacga gcgttagtcc gtgaatgtgg gacagcggta gtgaatgaga 421 catgatctgg atcaatgtag tcgcaagcgt gagtaaagaa tcaggacgcc tgcttgagaa 481 ggaatcgcat ggagtcgtcc tcagtcatca tgaagtccgg gtcctggcgc cacggacagg 541 tcggtcgaca tgtcgataat gtcgataatg tggttaggct cctcccactc gaagtcgggg 601 aaagcgccga cctcctcggc ttgttgaggt tgaacaacat cgttaagtgc ggcttcttcg 661 gcagccaact gccgttcgac tcttcccaga caaagtcgag catccggcaa aattcgttgc 721 cgatcttcgg gcgagattac gcaaggatat ctcatgcgag gaggaacggg atcggtggtg 781 ggaaatcgcg gctggtcggg gtaaaggtga ggattctcag ctcgatgacg atgaacctcc 841 atttccgcca gttccctgaa gcgagctttt gccgcgggtg ctcgctgttc cacaagcccg 901 caacaatttg agctgtatag atatggttaa taaatgtcat gcaggacagg cagtttgttc 961 acatcaatat tgcgagctga aagactcgga tcttcggaga acagagtatc caacagccat 1021 tggtagtaga gaacgaattg gttgcgcggg cggctagtac cattgctttg agttaccgaa 1081 ctctgttcaa aaaagttgct gggcagagtc atgtcgatgg tatgagagct ttgctcttgc 1141 tgctcaacat tagcacctcc ttggtttgag atgcccaaga gctctcgctt ggaatggtgg 1201 gagttcgcgc tggaatgtct ggttagcttg agcaatgggg gcccaatgtt tggtgaactt 1261 acaagggggc gaaactgcga gtatgtccca gtttccccat tccatcatca tgagcccaaa 1321 tgtgatcgtg cagatcgcga tgctggactc gtcgggggca accatgagca aggcctcttc 1381 gccaaacacg acactgaaag gagtcagcta tgagctataa gagaaacttt cctcgggcca 1441 acactcacac tgagttgtcc atcgcataga caagatcctc ttcagcaaat tcggccagat 1501 gggacctgaa cagcattacc tggatcctgc catagtgaat tgcagtcaca gggctgagac 1561 cgggtgcgat gtcgctgatt gaatcaacgt ctaaggcaga cattgtgata gaggggtgca 1621 gacggcgact acaggtgtgc ttggatgtgg ttatggaatg gatgggacag acgaagtgta 1681 agaagattga cgtatatgaa gatgaatgac aacgaggacc ggtagttggt ggaaaacgga 1741 attgtcgagt gttgagtttg gaggaaggaa gagggggtat ttgcgagaat ttgagccggt 1801 atttgtaggt gatacgacaa tctgctctgc gtgggttaat gtcaaggtga atgcaggaaa 1861 ggcccaatac ctcccgcagc tcgtcctcct attgttcgcg ggaaagggta cgcattttac 1921 tattgtttct gtggcttgcc agctggcgca ccttatgtga ttggtcaaat tgacgtttgc 1981 cctaaggtcg gccgggagaa caataggaag gacttgggat gaaatttggc atacgatgcc 2041 cctcaaatcg gcgagtgacc ttggctgatt ctcacaggag aacaatagga ataacttggg 2101 atgaatctca gcatgcagtg cccctcgtca agtaatctcc acctcaagtt tcacaggaga 2161 acaataggaa ggacctggat tggaaacctg ccaggcaatg tccctcgaaa gatattttgg 2221 aaccctgtgt ctttgttggt tcacttcttc gaaactccgt gtcaacaaaa cttctctcca 2281 tacttagcag tcgcatggca gctttctcaa gcgttcattg ttgaggtttc cttttcgtca 2341 gctgtcgaca tgaatcttct caacatgcaa cctaaaaggt cagagcaacc agctatgttc 2401 gaagaaaacc gtgcctctag ccaggaaggc caggatctcg aagtgatgta caaggtagca 2461 attcttctga cccggaaaca ctcgcttgct tgtcgctaat ggattggtca gaaactccat 2521 cagctacagg ctaggctttc ccgttcagtt ctttcagagg caatcaagga gttcgaagag 2581 aacttcggtg tcttttccat gaagccaagc tcttgctatg ctcaacgagt tcgaagtatc 2641 gccaaagctg gttcgggtct agcaacgagt tcggatctag cgacgagaga agaatcatca 2701 agacatcatg ctgcatcatt gagtcgacaa acacaattct taacttcctc tcatttcttg 2761 agaagaatcg aggattgcca ttcggtggag atcaaagact ccaacaagct gcctacaaag 2821 gccagcagtt tgcgttccgc ctccttcgct cacttacact tcacaaagct gctcaggagg 2881 ttccgggaaa ggactttggc ttggtctacg gaaaagatgt gtacgtactg aatggacata 2941 ttttgcacag gtcgaagcaa gagatcgtgg ggcaggcggg aggaagaaac tggcatgtcg 3001 accataccct ccatcctttg aggcgcgttc caggcacccc atggcacaag ttctttggca 3061 atcttgaagt tggcgacgac aagcaacttc gcctcttcga tgatgatgcg gccgtcgaca 3121 gttaccgagt cggtcctcag aagttctttg tggttattcc ggaaactgct gaatttattt 3181 tggacgaagt cagcagcgag catcagagag tcgctacaat tcacacagag gtaagtactt 3241 gaacgtgtct gaaaactaca aaatttgcac gactgactga aggtagaatg gacatgtcca 3301 gccgccagca ccgacatcca ttcagcaaga agtaagttct cctatctcga tttaatgtag 3361 gtaatcatca ctgacatcac ggcaggctct cctcaggaag ttggactttg ccatgacaac 3421 atcattgcct ggttatgttg tagaaggaca acctgagatt gtgtttcatc atgaacgtta 3481 cgccaggttc gtatgatcct gcttactttt cacggatgat gatgtgctaa caaccgatca 3541 acagatcccc gttgactaca gtcaggagcg cccacttagc attctctccc atgttttcac 3601 tcgacccgca ctttggggag agggtttgga gcttgctgat cacttcgacc cgcgagacgg 3661 tgtgcagcaa gaggagcaca tctattacat ttgatggata tggtagaatc cgtggctgca 3721 caaacaatgc tacttttaat ttaagaaaag tattattcga tcagagtggc tttacttttt 3781 tcttagaagt tcaacaaagc tgttatgtgt tatgtaatcc aagccctcgc tgaaagttgt 3841 gcccccaagg cagcaagccc cccccccccc cccccccccc ccccaccccc ctccctcctc 3901 tcccccgcgg tcgtcaagtg aagggagaga gaagccgctc cacccaaatt aaccaaccaa 3961 ccccatgtct cctatttaag aaagcccagt tcatcttttc caccttcacc caaacttccc 4021 accatctttc cccgaacatc aacttcgcaa ccaaaatctc ggcagcacta cctcacgtgt 4081 tcagtgctct ccaatcaata atccatccac cagaaacacg atgtcgggtg tcgatcaaat 4141 cgtcaagacg ttcgccgacc tcgctgagga cgaccgtgaa gcggcaatga gagctttctc 4201 aaggatgatg cgtagaggta ccgaacctgt tcgccgaatc cccgcggcaa agaagaaggt 4261 caacggcttc atgggtttca gatgtgagtc aaatctgaat caacattgtc gttgatccat 4321 ggctgattgc tcttcatttc agcgtactat tccccgctct tctctcagct cccgcaaaag 4381 gagagatcgc ccttcatgac tattctctgg cagcatgatc ccttccacaa tgagtgggat 4441 ttcatgtgct cggtgtattc gtcaatccgg acctaccttg agcaggagaa ggttactctg 4501 caactctgga ttcactatgc tgtcggccat ctgggagtga ttatccgcga caactacatg 4561 gcatcctttg gctggaacct cgtccgtttt cccaacggca ctcacgacct cgagcgcacg 4621 gctcttcctt tggttcagca caatctccag cccatgaacg gcttatgcct gctcaccaag 4681 tgcctcgaga gcggattgcc tcttgccaat cctcactctg tcatcgccaa gctttcagat 4741 cctagctacg acatgatctg gttcaacaag cgtcctcacc gtcagcaggg acacgccgtt 4801 caaactgatg aatctgaagt tggagtttcg gcgatgttcc ctcgcaatca cacggtcgct 4861 gcagaggtag atggcatcat caatcttcct ctctcccatt ggattcagca gggagaattc 4921 ggtaccgagt ctggatactc agctcagttt gagaccttgt tggattcaat tctcgagaat 4981 ggacacgcct ccagcaatga cccttacaac atggctctgg ctatcgatgt tcccatgatg 5041 ggttagtgga agatgaggta ccatcttgca aaactttacc cgtgtgctaa ccgattaaca 5101 ggatttaacg gaggagcata gaagcacggc gcagtcaccg ttttctttcc ttgtcacatc 5161 tggatttcgt gttacgggca tacaaagcga gggcgaaaag ggtctagtta ggtttctttg 5221 tgcatacatt gggcaatcat gagacttcag aatcgacggg gtggaatggg caattacacg 5281 gcaaggagac aggtacgcct agaaggcgaa agagtatcaa ataaaatcaa atcagcggcg 5341 tccaccatct gatccgggat ggccttcact actcgggggt tgcggttcgc ttttgtatgg 5401 ggagaggggg gaaaaagttt ggccagccaa aagcgacccg aatggaaccc tagtcaatca 5461 atacctatga acgcaagcgt ctgcggtgtc attgccggat ttgacatgtc gttgagataa 5521 agaaacaggc ccgccgctga cggcaacgct tatgcatgca accccgctgc gctgaatgct 5581 tcagccgcaa aactggggca atgcgggagc tgtggccccc gttcatgcta gtgtacaggg 5641 ttgctctgct tctaagatcc tgataagggt ccgctgatgt ttgtacatac tacatatcag 5701 tccctgtaag tttgctagtc tggttcctgc cccatatttt cttccaaggg ggtaatatgg 5761 ggactgtaag gcggactggt ctatctacga gtccgggtcc ccgcaggaac tgtacccttc 5821 agtgggtccc ggtcacgtat cctgcacgtt ccgtctcggc caggaatggc agctttcccc 5881 gttgattttc ggtttatcat cacataaagg ttttggttgc ttgtcgac // LOCUS PFARGEB 2126 bp ds-DNA INV 02-AUG-1990 DEFINITION P.falciparum 18S ribosomal RNA not in asexual parasites. ACCESSION M19173 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.falciparum DNA, clone B. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 2126) AUTHORS McCutchan,T.F., de la Cruz,V.F., Lal,A.A., Gunderson,J.H., Elwood,H.J. and Sogin,M.L. TITLE Primary sequences of two small subunit ribosomal RNA genes from Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. 28, 63-68 (1988) STANDARD simple staff_review REFERENCE 2 (bases 1 to 2126; revises [1]) AUTHORS Sogin,M.L. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Computer-readable copy of sequence for [2] kindly submitted by M.L.Sogin, 01-AUG-1990. SOGIN@EVOL1.MBL.EDU BASE COUNT 704 a 296 c 412 g 714 t ORIGIN 1 aacctggttg atcttgccag tagtcatatg cttgtctcac agattaagcc atgcaagtga 61 aagtacatat gtaagtaata tattgaaact gcgaacggct cattaaaaca gttataatct 121 acttgatgtt tttaatataa ggataactac ggaaaatctg tagctaatac ttgtgaaaat 181 accttttgat atatacatat gtatgtatca ttaaggtatg tatttgttag atataagaat 241 aaaataaata taattgaatt ataacaaaga agaaacacat aaaatatgtg tattatcaat 301 cgagtatctg acctatcagc ttttgatgtt agggtattga cctaacatgg ctatgacggg 361 taacggggaa ttagagttcg attccggaga gggagcctga gaaatagcta ccacatctaa 421 ggaaggcagc aggcgcgtaa attacccaat tctaaaaaag agaggtagtg acaagaaata 481 acaatacaat atcgaaaaat gattttgtaa ttggaatgat aggaatttac aaggttccta 541 gagaaacaat tggagggcaa gtctggtgcc agcagccgcg gtaattccag ctccaatagc 601 atatattaaa attgttgcag ttaaaacgtt cgtagttgaa tattaaagaa tccgatgttt 661 catttaaact ggtttgggaa aaccaaatat attatatatt ttgctttgtt caaaataagg 721 ttttctaata aattatgttt ttatcagata tgacagaatc ttttttaaaa tctcttcaat 781 atgcttttat tgcttttgag aggttttgtt actttgagta aaattaagtg ttcataacag 841 acgggtagtc atgattgagt tcattgtgtt tgaatactac agcatggaat aacaaatatg 901 aataagctaa ttattttttt tttcattatt ttttttgata ttcttattag cttagttacg 961 attaatagga gtagctttgg gggcattcgt attcagatgt cagaggtgaa attctaagat 1021 tttctggaga cggactactg cgaaagcatt tgcctaatct atttccatta atcaagaacg 1081 aaagttaagg gagtgaagac gatcagatac cgtcgtaatc ttaaccataa actataccga 1141 ctaggtgttg gatgaatata aaaaatatat aaatatgtag catttcttag ggaatgttga 1201 ttttatatta gaattgcttc cttcagtacc ttatgagaaa tcaaagtctt tgggttctgg 1261 ggcgagtatt cgcgcaagcg agaaagttaa aagaattgac ggaagggcac caccaggcgt 1321 ggagcttgcg gcttaatttg actcaacacg ggaaaactca ctagtttaag acaagagtag 1381 gattgacaga ttaatagctc tttcttgatt tcttggatgg tgatgcatgg ccgtttttag 1441 ttcgtgaata tgatttgtct ggttaattcc gataacgaac gagatcttaa cctgctaatt 1501 agcggtaagt acactatatt tttatttgaa attgaatata ggtaattata catgtttatt 1561 cagtgttcaa attaggatat tttttttatt aaaatattct tttccctgtt ttactaataa 1621 tttgtttttt tttactctat ttctctcttc ttttaagaat gtacttgctt gattaaataa 1681 agcttcttag aggaacagtg tgtatctaac acaaggaagt ttaaggcaac aacaggtctg 1741 tgatgtcctt agataaacta ggctgcacgc gtgctacaat gatatatata acaagttgtt 1801 aaaaatgtac ttataaataa gtgtgtacaa tttttcctgt actgaaaagt ataggtaatc 1861 tttatcagta tatatcgtaa ttgggataga ttattgcaat tattaatctt gaacgaggaa 1921 gcccgtcgct cctaccgatt gaaagatatg ataaattgtt tggatatgaa ttaaaataat 1981 agaagtcgta acaaggtttc cgtaggtgaa cctgcggaag gatcattaat atatgaaacg 2041 ccgagctcga attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc 2101 acaattccac acaacatacg agccgg // LOCUS ALCRR45S 103 bp ss-rRNA RNA 02-AUG-1990 DEFINITION A.tuberosum chloroplast 4.5S ribosomal RNA. ACCESSION M35406 KEYWORDS 4.5S ribosomal RNA. SOURCE A.tuberosum chloroplast young leaf rRNA. ORGANISM Chloroplast Allium tuberosum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Liliidae; Liliales; Liliaceae; Allium tuberosum. REFERENCE 1 (bases 1 to 103) AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L. TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four species of plants, celery (Apium graveoleus), barley (Hordeum vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina communis) JOURNAL FEBS Lett. 200, 193-196 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 103 4.5S ribosomal RNA BASE COUNT 29 a 18 c 31 g 25 t ORIGIN 1 taaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtctagtgg aagtgcagtg 61 atgtatgcag ctgaggcatc ctaatagacc gagagatttg aac // LOCUS APIRR45S 103 bp ss-rRNA RNA 02-AUG-1990 DEFINITION A.graveoleus chloroplast 4.5S ribosomal RNA. ACCESSION M35404 KEYWORDS 4.5S ribosomal RNA. SOURCE A.graveoleus chloroplast young leaf rRNA. ORGANISM Chloroplast Apium graveoleus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Apiales; Apiaceae; Apium graveoleus. REFERENCE 1 (bases 1 to 103) AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L. TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four species of plants, celery (Apium graveoleus), barley (Hordeum vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina communis) JOURNAL FEBS Lett. 200, 193-196 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 103 4.5S ribosomal RNA BASE COUNT 28 a 19 c 32 g 24 t ORIGIN 1 gaaggtcacg gtgagacgag ccgtttatca ttacgatagg tgtctagtgg aagtgcagtg 61 atgtatgcag ctgaggcatc ctaacagacc ggcagatttg aac // LOCUS BLYRR45S 96 bp ss-rRNA RNA 02-AUG-1990 DEFINITION H.vulgare chloroplast 4.5S ribosomal RNA. ACCESSION M35405 KEYWORDS 4.5S ribosomal RNA. SOURCE H.vulgare chloroplast young leaf rRNA. ORGANISM Chloroplast Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Hordeum vulgare. REFERENCE 1 (bases 1 to 96) AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L. TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four species of plants, celery (Apium graveoleus), barley (Hordeum vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina communis) JOURNAL FEBS Lett. 200, 193-196 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 96 4.5S ribosomal RNA BASE COUNT 29 a 17 c 30 g 20 t ORIGIN 1 taaggtagcg gcgagacgag ccgtttaaat aggtgtcaag tggaagtgca gtgatgtatg 61 cagctgaggc atcctaacga accgaacgat ttgaac // LOCUS CEVCG 371 bp ds-DNA circular VRL 02-AUG-1990 DEFINITION Citrus exocortis viroid RNA sequence, complete genome. ACCESSION M34917 KEYWORDS complete genome. SOURCE Citrus exocortis viroid (strain A) RNA. ORGANISM Citrus exocortis viroid Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 371) AUTHORS Visvader,J.E., Gould,A.R., Bruening,G.E. and Symons,R.H. TITLE Citrus exocortis viroid: Nucleotide sequence and secondary structure of an Australian isolate JOURNAL FEBS Lett. 137, 288-292 (1982) STANDARD simple staff_review BASE COUNT 72 a 112 c 112 g 75 t ORIGIN 1 cgggatcttt cttgaggttc ctgtggtgct cacctgaccc tgcaggcagg aaaagaaaaa 61 agaggcggcg ggggaagaag tccttcaggg atccccgggg aaacctggag gaagtcgagg 121 tcggggggga cagctgcttc ggtcgccgcg gatcactggc gtccagcgga gaaacaggag 181 ctcgtctcct tcctttcgct gctggctcca catccgatcg tcgctgaagc gcctcgcccc 241 ctcgcccgga gcttctctct ggagactacc cggtggaaac aactgaagct tcaaccccaa 301 accgcttttc ttgtatcttc actgctctcc gggcgagggt gaaagccctc ggaaccctag 361 attgggtccc t // LOCUS COMRR45S 103 bp ss-rRNA RNA 02-AUG-1990 DEFINITION C.communis chloroplast 4.5S ribosomal RNA. ACCESSION M35407 KEYWORDS 4.5S ribosomal RNA. SOURCE C.communis chloroplast young leaf rRNA. ORGANISM Chloroplast Commelina communis Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Commelinales; Commelinaceae; Commelina communis. REFERENCE 1 (bases 1 to 103) AUTHORS Zhen-Qi,C., Hong,Z., Guo-Ya,L. and Xiao-Yang,L. TITLE The nucleotide sequences of chloroplast 4.5 S rRNAs from four species of plants, celery (Apium graveoleus), barley (Hordeum vulgare), Chinese chive (Allium tuberosum) and dayflower (Commelina communis) JOURNAL FEBS Lett. 200, 193-196 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 103 4.5S ribosomal RNA BASE COUNT 31 a 18 c 31 g 23 t 0 others ORIGIN 1 taaggtagcg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagta 61 atgtatgcag ctgaggcatc ctaacagacc gagagatttg aac // LOCUS HUMBKM 615 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human banded krait minor satellite (Bkm) repeat. ACCESSION M35828 KEYWORDS banded krait minor satellite repeat. SOURCE Human DNA, clone lambda-X13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 615) AUTHORS Erickson,R.P., Ross,C.E., Gorski,J.L., Stalvey,J.R.D. and Drumm,M.M. TITLE Bkm sequences from the human X chromosome contain large clusters of GATA/GACA repeats JOURNAL Ann. Hum. Genet. 52, 167-176 (1988) STANDARD simple staff_review BASE COUNT 258 a 90 c 87 g 180 t ORIGIN X chromosome. 1 agctttggtt caatttttat ggtttttatg gaaggtacaa agttactaat gcaccacccc 61 acagtaagca ccagccccac atggtttcat agagaaatat tagaaaatct tcaaagattg 121 ggtagttgtg atgctacgaa aacttttcca aaacaaactc ctactgaagt tttatgaaat 181 actaaaatat tgatacttta acctaataga tagtacaata aaacctacat ataattgcac 241 ttatggatat agatgtacta attaaaatac tagcaaacat aacccaacac tgcattcaga 301 tataacatac catgaccaaa agaacttcat tctattaatg taaaaaaaat cacattaata 361 gaattaaggg gaaaacatat gattatctac agtgatgcta aatgaaacca acaatattta 421 aaattattct gataagaccc tcaagaactt gaattgatgg agattcaact atatatatat 481 aaatatatgt ttataaatat agatagacag atagatagat agacagatag atagatagat 541 agatagatag atagatagat agatagatat agatatagat actttaatcc taaaggcaag 601 atcttaccta ctgca // LOCUS HUMBKMX 206 bp ds-DNA PRI 02-AUG-1990 DEFINITION Human banded krait minor satellite (Bkm) repeat. ACCESSION M35829 KEYWORDS banded krait minor satellite repeat. SOURCE Human DNA, clone lambda-X81. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 206) AUTHORS Erickson,R.P., Ross,C.E., Gorski,J.L., Stalvey,J.R.D. and Drumm,M.M. TITLE Bkm sequences from the human X chromosome contain large clusters of GATA/GACA repeats JOURNAL Ann. Hum. Genet. 52, 167-176 (1988) STANDARD simple staff_review BASE COUNT 93 a 25 c 39 g 49 t ORIGIN X chromosome. 1 agtctacaga gctgcaagaa caccacatca taaatcttac gatttgaatg acatcgatag 61 atagatagat agatagatag atagatagat agatagatag atagatagat agatagatag 121 atatagatag atacatacat acatacatac acacatagaa atagagatat cctgatggaa 181 agaataaacc atgagtctgg taatag // LOCUS MENRR5S 128 bp ss-rRNA BCT 02-AUG-1990 DEFINITION M.tindarius 5S ribosomal RNA. ACCESSION M34910 KEYWORDS 5S ribosomal RNA. SOURCE M.tindarius (strain DSM 2095) rRNA. ORGANISM Methanolobus tindarius Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanomicrobiales; Methanosarcinaceae. REFERENCE 1 (bases 1 to 128) AUTHORS Willekens,P., Stetter,K.O., Vandenberghe,A., Huysmans,E. and De Wachter,R. TITLE The structure of 5S ribosomal RNA in the methanogenic archaebacteria Methanolobus tindarius and Methanococcus thermolithotrophicus JOURNAL FEBS Lett. 204, 273-278 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 128 5S ribosomal RNA BASE COUNT 32 a 33 c 34 g 29 t ORIGIN 1 ggagtttggc ggccatagcg gcggggcaat tcctgtaccc atcccgaaca cagaagataa 61 gcccgcctgc gttctatact gtactaaagt atgagagtct ttgggaaata tggaacgctg 121 ccatctcc // LOCUS MUSSPIX 980 bp ds-DNA ROD 02-AUG-1990 DEFINITION Mouse protease inhibitor gene, 5' flank. ACCESSION M35824 KEYWORDS protease inhibitor gene. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 980) AUTHORS Needham,M., Mills,J.S. and Parker,M.G. TITLE Organization and upstream DNA sequence of the mouse protease inhibitor gene JOURNAL Nucleic Acids Res. 16, 6229-6229 (1988) STANDARD simple staff_review FEATURES from to/span description mRNA 964 > 980 protease inhibitor mRNA BASE COUNT 324 a 200 c 208 g 248 t ORIGIN 1 taaacagaaa ttgtcttgtg ctgttaggaa agaacatcaa actcttttta ttaagtccct 61 agttgtgtaa gagagcagta gccccatctg agtgttgctg ggaaatatgt gcctttattt 121 ttctagagag aaaaacaaaa ctaagctaaa ctatacaaaa caaaaacaac ttacactttg 181 tgataatggg agacattgct tcattggtca atactgttta ttatacaaac accctgggaa 241 agataatctg tctgaaacaa ggtgacagag ttttttctac tgggcaagac aggcagaaac 301 atgaaagaaa ctagagaccc agggtttggg tagatactca gtacacattg gttaataaag 361 aatcaatcct ttggattatt tactaccatt attttaattt tccaatgcat aagttaggca 421 ataggaaagt gaaccaattt cctagaggca tgcagctaat ctggtcagta cacacagccc 481 cgggacacag gagacagaac tgaactctcc ttatcttgta ccccaccctg gcaccagcct 541 atcaggactt aaggaataca cagcagataa agagacaacg gggggggggg gtgaaggacc 601 aagaattgga gcagaagaat ggaagaaaag gaagaatgga aatcctgttt gagttctgca 661 tgtctgccca tccattcaga actactgaag agacaaagga gacctttgac ttcccaaaca 721 gagaagaaaa cctcacccca ggaaggcttt cctaaaagag gatgtcagga aatgcagcta 781 atttgctttt cctggaagaa tttgggtcca aagttttctg cccttcttcc tacctaggtt 841 acattttccc cagcctggag gccacacaag tcaatcaata accaaagata cctattatag 901 ggcacagtgg gtggagcctg tattaccatc tgcctatatg accactcctc agtttctgaa 961 gagaagcacc ctgtatagtt // LOCUS MVTRR5S 120 bp ss-rRNA BCT 02-AUG-1990 DEFINITION M.thermolithotrophicus 5S ribosomal RNA. ACCESSION M34911 KEYWORDS 5S ribosomal RNA. SOURCE M.thermolithotrophicus (strain DSM 2095) rRNA. ORGANISM Methanococcus thermolithotrophicus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanococcales; Methanococcaceae. REFERENCE 1 (bases 1 to 120) AUTHORS Willekens,P., Stetter,K.O., Vandenberghe,A., Huysmans,E. and De Wachter,R. TITLE The structure of 5S ribosomal RNA in the methanogenic archaebacteria Methanolobus tindarius and Methanococcus thermolithotrophicus JOURNAL FEBS Lett. 204, 273-278 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 120 5S ribosomal RNA BASE COUNT 30 a 33 c 31 g 26 t ORIGIN 1 tgatataacg gtcatagcgg aggagccaca tccgatcccg tcccgatctc ggaaattaag 61 ccctccagcg ataccctaag tactgccatt tggtgggaac agggtgacgc tgttgatcac // LOCUS SF6LYZ 996 bp ds-DNA PHG 02-AUG-1990 DEFINITION Bacteriophage SF6 fragment D lysozyme gene, complete cds. ACCESSION M35825 KEYWORDS lysozyme. SOURCE Bacteriophage SF6 DNA. ORGANISM Bacteriophage SF6 Viridae; ds-DNA viruses. REFERENCE 1 (bases 1 to 996) AUTHORS Verma,M. TITLE Molecular cloning and sequencing of lysozyme gene of bacteriophage SF6 of Bacillus subtilis JOURNAL Curr. Microbiol. 13, 299-301 (1986) STANDARD simple staff_review FEATURES from to/span description pept 13 963 lysozyme BASE COUNT 284 a 223 c 243 g 246 t ORIGIN 1 ctgcagatcg taatggatcc tcgcctacgt gaagaagtag tacggctgat aatcgcatta 61 acgagtgata atggagcatc actgtcaaaa cggcttcaat caagggtctc ggcgctcgag 121 aagacgtctc aaatacactc tgatactatc ctccggatca cccagggact cgatgatgca 181 aacaaacgaa tcatcgctct tgagcaaagt cgggatgact tggttgcatc agtcagtgat 241 gctcaacttg caatctccag attggaaagc tctatcggag ccctccaaac agttgtcaat 301 ggacttgatt cgagtgttac ccagttgggt gctcgagtgg gacaacttga gacaggactt 361 gcagacgtac gcgttgatca cgacaatctc gttgcgagag tggatactgc agaacgtaac 421 attggatcat tgaccactga gctatcaact ctgacgttac gagtaacatc catacaagcg 481 gatttcgaat ctaggatatc cacgttagag cgcacggcgg tcactagcgc gggagctccc 541 ctctcaatcc gtaataaccg tataaccatg ggattaaatg atggactcac gttgtcaggg 601 aataatctcg ccatccgatt gccaggaaat acgggtctga atattcaaaa tggtggactt 661 cagtttcgat ttaatactga tcaattccag atagttaata ataacttgac tctcaagacg 721 actgtgtttg attctatcaa ctcaaggata ggcgcaactg agcaaagtta cgtggcgtcg 781 gcagtgactc ccttgagatt aaacagtagc acgaaggtgc tggatatgct aatagacatg 841 tcaacacttg aaattaattc tagtggacag ctaactgtta gatcgacatc cccgaatttg 901 aggtatccga tagctgatgt tagcggcggt atcggaatga gtccaaatta taggtttagg 961 tgaggatcag accaccccgc ggcactgggg ctgcag // LOCUS SYESC6 1071 bp ss-RNA VRL 02-AUG-1990 DEFINITION Sonchus yellow net virus M1 protein gene, complete cds. ACCESSION M35689 KEYWORDS M1 protein. SOURCE Sonchus yellow net virus RNA, clones pGL[9,32]. ORGANISM Sonchus yellow net virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Phytorhabdovirus. REFERENCE 1 (bases 1 to 1071) AUTHORS Hillman,B.I., Heaton,L.A., Hunter,B.G., Modrell,B. and Jackson,A.O. TITLE Structure of the gene encoding the m1 protein of sonchus yellow net virus JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.I.Hillman, 28-JUN-1990. FEATURES from to/span description pept 72 932 M1 protein mRNA 1 1071 M1 mRNA BASE COUNT 403 a 197 c 235 g 236 t ORIGIN 1 aacttaaaat aataataata tcaaataatc aagcagacat aaaaacaggg ttatctctat 61 tgtatttcag aatggcaggt atatacgcag tttcaatcaa gggccatgca tcggcgatat 121 tcaaccgaca agagaaggaa atcagcacag gacgtgtttg ggaagttatg aagaagataa 181 tgtccttaaa gcctactagg gtcattatgt catattctct ccttagatca gcgttggata 241 aaagcaggca acttacccaa gaagaataca atattatgca attgatccta gacggctgtg 301 tcaaaaccct tgagcctgtt gcagccagcg ggatatgcat agatgtgaac ctgggaaaat 361 gcaccaagca cacaatccca tttggcatca caaataacga cgtaggtcac gtgtcagtag 421 tgatgacatt accatttttg gaagaaggat gttataacat aggcgcatgt tttgatggaa 481 gattgtccaa gtcaaggagc gacgcaagcc attatgctgt tgatgtaagt ctggaaattt 541 acttaaagtc attatccaga gacgaagctg aagaacaaat atccaaggga acatcagtgt 601 accctttcaa aattaatcat cctacatact ttgaagatga gactgataca tcagatggag 661 aaagtcttag tggacgagca agctcagatg atgggccaga agacggaggc cacggacatg 721 gtgataaaaa caatgagaaa aactctggga aggtggtaag gaaaaggaaa tcaaggaaag 781 agatcgatgt aggaaggttc aagatggtta aggataatat cataaacacc agatccggac 841 tgctaaaatc catgagagga acgggacatc ggaaacacag aacgcaggag atcacagaag 901 gatacaacta cggagacaag gacgcagaat gaaccaaccc accaaaagca gaaaacttgc 961 tactatataa taatatgaag agcccacaat aagatcacac aatgtcaaac agcacaactc 1021 caaatccatt aggtaatacc tgtattattc cgtgcactta tataagaaaa a // LOCUS BLYCPPSBEF 681 bp ds-DNA ORG 02-AUG-1990 DEFINITION Barley chloroplast cytochrome b-559 apoprotein (psbEF) genes, complete cds. ACCESSION M35616 KEYWORDS cytochrome b-559 apoprotein; psbE gene; psbF gene. SOURCE Barley chloroplast DNA, clones pDS6 and pDS6-RBSII. ORGANISM Chloroplast Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Hordeum vulgare. REFERENCE 1 (bases 1 to 681) AUTHORS Krupinska,K. TITLE Characterization and in vitro expression of the cytochrome b-559 genes of barley. II. in vitro transcription and translation JOURNAL Carlsberg Res. Commun. 53, 233-246 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 139 390 cytochrome b-559 9.4 kDa protein (psbE) pept 401 520 cytochrome b-559 4.5 kDa protein (psbF) binding 129 132 psbE ribosomal binding site (put.) binding 390 394 psbF ribosomal binding site (put.) BASE COUNT 202 a 131 c 123 g 225 t ORIGIN 1 cgactcaaaa tacacctttg gtacaaaatt gacaatctca caaggatgaa atatcagtaa 61 ttttctattt actggttgat cccatctttt acggaatcaa ttcctttttt gaatgtacaa 121 aaattttggg agttcagcat gtctggaagc acgggagaac gttcttttgc tgatattatt 181 accagtattc gatactgggt tattcatagc attactatac cttccctatt cattgcgggt 241 tggttatttg tcagtacggg tttagcttat gacgtgtttg gaagtcctag gccaaacgag 301 tatttcacgg aaagccgaca aggaattccg ttaataaccg accgttttga ttctttagaa 361 caactcgatg aatttagtag atccttttag gaggccctca atgaccatag atcgaaccta 421 tcctattttt acagtgcgat ggctggctat tcacggacta gctgtaccta ctgttttttt 481 cttgggatca atatcagcaa tgcagttcat ccaacgataa accaaattcc aactatagaa 541 ctatgacaca atcaaacccg aatgaacaaa atgttgaatt gaatcgtacc agtctatact 601 ggggtttatt actcattttt gtacttgctg ttttattttc caattacttc ttcaattgag 661 agaaagaaag agactaacaa g // LOCUS BOVIL1BA 944 bp ds-DNA MAM 02-AUG-1990 DEFINITION Bovine interleukin-1 beta mRNA, complete cds. ACCESSION M35589 KEYWORDS interleukin-1 beta. SOURCE Bovine alveolar macrophage, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 944) AUTHORS Leong,S.R., Flaggs,G.M., Lawman,M. and Gray,P.W. TITLE The nucleotide sequence for the cDNA of bovine interleukin-1 beta JOURNAL Nucleic Acids Res. 16, 9054-9054 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 45 845 interleukin-1 beta precursor sigp 45 383 interleukin-1 beta signal peptide matp 384 842 interleukin-1 beta BASE COUNT 270 a 234 c 228 g 212 t ORIGIN 1 attctctcca gccaaccttc attgcccagg tttctgaaac agccatggca accgtacctg 61 aacccatcaa cgaaatgatg gcttactaca gtgacgagaa tgagctgtta tttgaggctg 121 atgaccctaa acagatgaag agctgcatcc aacacctgga cctcggttcc atgggagatg 181 gaaacatcca gctgcagatt tctcaccagt tctacaacaa aagcttcagg caggtggtgt 241 cggtcatcgt ggccatggag aagctgagga acagtgccta cgcacatgtc ttccatgatg 301 atgacctgag gagcatcctt tcattcatct ttgaagaaga gcctgtcatc ttcgaaacgt 361 cctccgacga gtttctgtgt gacgcacccg ttcagtcaat aaagtgcaaa ctccaggaca 421 gagagcaaaa atccctggtg ctggctagcc catgtgtgct gaaggctctc cacctcctct 481 cacaggaaat gaaccgagaa gtggtgttct gcatgagctt tgtgcaagga gaggaaagag 541 acaacaagat tcctgtggcc ttgggtatca aggacaagaa tctatacctg tcttgtgtga 601 aaaaaggtga tacgcccacc ctgcagctgg aggaagtaga ccccaaagtc taccccaaga 661 ggaatatgga aaagcgcttt gtcttctaca agacagaaat caagaataca gttgaatttg 721 agtctgtcct gtaccctaac tggtacatca gcacttctca aatcgaagaa aggcccgtct 781 tcctgggaca ttttcgagct ggccaggata taactgactt cagaatggaa accctctctc 841 cctaaagaaa gccataccca gggagtccac gtgggctgaa taaccccgag gactggcaga 901 agggaaggga agaatgtagc tgcagcctga acttcactgt tgtc // LOCUS ECOK88ADA 744 bp ds-DNA BCT 02-AUG-1990 DEFINITION E.coli K88ad subunit protein gene, partial cds. ACCESSION M35637 KEYWORDS K88ad subunit protein. SOURCE E.coli (K12 strain 5K) DNA, clones pFM205, pBad, pBac. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 744) AUTHORS Gaastra,W., Klemm,P. and de Graaf,F.K. TITLE The nucleotide sequence of the K88ad protein subunit of porcine enterotoxigenic Escherichia coli JOURNAL FEMS Microbiol. Lett. 18, 177-183 (1983) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 744 K88ad subunit protein (AA at 1) BASE COUNT 192 a 137 c 205 g 210 t ORIGIN 1 aatggttcgg tcgatatcgg tggtagtatc actgcagatg attatcgtca gaaatgggaa 61 tggaaagttg gtacaggtct taatggattt ggtagtgtat tgaatgacct gaccaatggt 121 ggaaccgaac tgaccattac tgttactggt aataagccaa gtttgttagg ccgaaccaga 181 gaggcatttg ctacgccagt ggttggtggt gtagatggaa ttcctcatat tgcatttact 241 gactatgaag gagcttctgt agaactcaga aaccctgatg gtgaaattga aaaaggttta 301 gcatattttg ttctgccgat gaaaaatgca gagggcacta aagttggttc agtgaaagtg 361 aatgcatctt atgccggtgc gctcgggaga ggtggggtta cttctgcgga cggggagctg 421 atgtcgcttt ttgccgaggg gtcgcacgct atcttttatg gtggtttgcc gacgaatgtt 481 cagaattctg cactcccggg tgggagtgcc gcagcggcgc gcacagagtt gtttggaagt 541 ctatcaaaaa atgatattct cggacagatt caaagagtaa acgcaaatat tacttctctt 601 gttaacgtcc caggttcttt caatgaaaac atggcgtaca ctgatggatc cgttgtttcc 661 gttgcctatg cactgggtat tgcaaaccgg cagactattg aggcaacttt taatcaggct 721 gtaactacca gcactcagtg gagc // LOCUS HUMTNFTRP 1379 bp ss-mRNA SYN 02-AUG-1990 DEFINITION Synthetic human tumor necrosis factor mRNA, complete cds. ACCESSION M35592 KEYWORDS tumor necrosis factor. SOURCE Altered human leukemic B-cell line Ball-1, cDNA to mRNA, clone pM324-346. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 1379) AUTHORS Nobuhara,M., Kanamori,T., Nagase,Y., Nii,A., Morishita,H., Tohyama,J., Andoh,S. and Kurimoto,M. TITLE The expression of human tumor necrosis factor in E.coli JOURNAL Nucleic Acids Symp Ser 17, 131-134 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 178 879 synthetic tumor necrosis factor precursor sigp 178 399 synthetic tumor necrosis factor signal peptide matp 400 876 synthetic tumor necrosis factor BASE COUNT 318 a 438 c 336 g 287 t ORIGIN 1 agcagacgct ccctcagcaa ggacagcaga ggaccagcta agagggagag aagcaactac 61 agaccccccc tgaaaacaaa aatcagacgc cacatcccct gacaagctgc caggcaggtt 121 ctcttcctct cacatactga cccacggctc caccctctct cccctggaaa ggacaccatg 181 agcactgaaa gcatgatccg ggacgtggag ctggccgagg aggcgctccc caagaagaca 241 ggggggcccc agggctccag gcggtgcttg ttcctcagcc tcttctcctt cctgatcgtg 301 gcaggcgcca ccacgctctt ctgcctgctg cactttggag tgatgggccc ccagagggaa 361 gagttcccca gggacctctc tctaatcagc cctctggccc aggcagtcag atcatcttct 421 cgaaccccga gtgacaagcc tgtagcccat gttgtagcaa accctcaagc tgaggggcag 481 ctccagtggc tgaaccgccg ggccaatgcc ctcctggcca atggcgtgga gctgagagat 541 aaccagctgg tggtgccatc agagggcctg tacctcatct actcccaggt cctcttcaag 601 ggccaaggct gcccctccac ccatgtgctc ctcacccaca ccatcagccg catcgccgtc 661 tcctaccaga ccaaggtcaa cctcctctct gccatcaaga gcccctgcca gagggagacc 721 ccagaggggg ctgaggccaa gccctggtat gagcccatct atctgggagg ggtcttccag 781 ctggagaagg gtgaccgact cagcgctgag atcaatcggc ccgactatct cgactttgcc 841 gagtctgggc aggtctactt tgggatcatt gccctgtgag gaggacgaac atccaacctt 901 cccaaacgcc tcccctgtcc caatcccttt attaccccct ccttcagaca ccctcaacct 961 cttctggctc aaaaagagaa ttgggggctt agggtcggaa cccaagctta gaactttaag 1021 caacaagacc accacttcga aacctgggat tcaggaatgt gtggcctgca cagtgaaggt 1081 gctggcaacc actaagaatt caaactgggg cctccagaac tcactggggc ctacagcttt 1141 gatccctgac atctggaatc tggagaccag ggagcctttg gttctgacca gaatgctgca 1201 ggacttgaga agacctcacc tagaaattga cacaagtgga ccttaggcct tcctctctcc 1261 agatgtttcc agacttcctt gagacacgga gcccagccct ccccatggag ccagctccct 1321 ctatttatgt ttgcacttgt gattatttat tatttattta ttatttattt atttacaga // LOCUS SHREF1BA 694 bp ss-mRNA INV 02-AUG-1990 DEFINITION A.salina elongation factor 1-beta mRNA, complete cds. ACCESSION M35636 KEYWORDS elongation factor 1-beta. SOURCE A.salina cyst, cDNA to mRNA, clones B[4,27]. ORGANISM Artemia salina Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Branchiopoda; Sarsostraca; Anostraca; Artimiidae. REFERENCE 1 (bases 1 to 694) AUTHORS Maessen,G.D.F., Amons,R., Maassen,J.A. and Moeller,W. TITLE Primary structure of elongation factor 1-beta from Artemia JOURNAL FEBS Lett. 208, 77-83 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 26 649 elongation factor 1-beta signal 677 682 polyA signal BASE COUNT 240 a 117 c 150 g 187 t ORIGIN 1 ttttctgctg taaaaaaaga tcaaaatggc caatattgac ctaaaagccg aaaagggaca 61 ggaacaacta aacgagcttt tggccaacaa gagttacctt caaggatatg aaccatccca 121 agaggatgtt gctgccttca atcagctaaa taaagccccg tctgacaaat ttccttactt 181 gcttcgatgg tacaagcaca ttagttcttt ttccgatgca gaaaagaaag gatttcccgg 241 tattccaact tctgcatcca aagaagaaga tgatgatgtt gatctttttg gatcagatga 301 ggaagatgaa gaagctgaaa aaatcaaggc agaaaggatg aaggcctata gtgataagaa 361 gtcaaagaaa cctgcaattg ttgccaaatc ctctgtcatt ttggatataa agccatggga 421 cgacgaaacg gatatggcag aaatggagaa actagttcga agtgtacaaa tggatggtct 481 tgtctggggt gcagctaaac ttattcccct tgcatatgga attaagaagc tatcaatcat 541 gtgtgttgta gaagatgata aagtatccat tgacgaactt caagaaaaaa tttctgaatt 601 tgaagacttt gtccagtctg tggatattgc cgctttcaac aaagtttaat tcttaattaa 661 ttacttgtga gaagggaata aatcagggca attg // LOCUS YSCRO18SA 813 bp ds-DNA PLN 02-AUG-1990 DEFINITION Yeast (S.cerevisiae) ribosomal operon DNA with 18S rRNA, 5' end. ACCESSION M35588 KEYWORDS 18S ribosomal RNA; ribosomal operon. SOURCE Yeast (S.cerevisiae) DNA, clone pVIrA-3. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 813) AUTHORS Skryabin,K.G., Zakhar'ev,V.M., Rubtsov,P.M. and Baev,A.A. TITLE Nucleotide sequence of the presumed region of initiation of transcription of the yeast ribosomal operon JOURNAL Dokl. Biochem. 247, 288-290 (1979) STANDARD simple staff_entry FEATURES from to/span description rRNA 653 > 813 18S rRNA BASE COUNT 221 a 140 c 194 g 258 t ORIGIN 1 gaattcgagt aggcttgtcg ttcgttatgt ttttgtaaat ggcctcgtca aacggtggag 61 agagtcgcta ggtgatcgtc agatctgcct agtctctata cagcgtgttt aattgacatg 121 ggttgatgcg tattgagaga tacaatttgg gaagaaattc ccagagtgtg tttcttttgc 181 gtttaacctg aacagtctca tcgtgggcat cttgcgattc cattggtgag cagcgaagga 241 tttggtggat tactagctaa tagcaatcta tttcaaagaa ttcaaacttg ggggaatgcc 301 ttgttgaata gccggtcgca agacgtgatt cttgaagtgt aacctcctct caaatcagcg 361 atatcaaacg taccattccg tgaacaccgg ggtatctgtt tggtggaacc tgattagagg 421 aaactcaaag agtgctatgg tatggtgacg gagtgcgctg gtcaagagcg taagcttttg 481 aacagagagc atttccggca gcagagagac ccgaaaaagc aatttttctg gaatttcagc 541 tgtttccaaa ctcaataagt atcttctagc aagagggaat aggtgggaaa aaaaaaaaga 601 gatttcggtt tctttctttt ttactgcttg ttgcttcttc ttttaagata gttatctggt 661 tgatcctgcc agtagtcata tgcttgtctc aaagattaag ccatgcatgt ctaagtataa 721 gcaatttata cagtgaaact gcgaatggct cattaaatca gttatcgttt atttgatagt 781 tcctttacta catggtataa ctgtggtaat tct // LOCUS YSJLEU2A 968 bp ds-DNA PLN 02-AUG-1990 DEFINITION Y.lipolytica beta-isopropylmalate dehydrogenase (LEU2) gene, 5' end. ACCESSION M35579 KEYWORDS beta-isopropylmalate dehydrogenase. SOURCE Y.lipolytica (strain 21109-9) DNA, clone pINA92. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 968) AUTHORS Gaillardin,C. and Ribet,A.-M. TITLE LEU2 directed expression of beta-galactosidase activity and phleomycin resistance in Yarrowia lipolytica JOURNAL Curr. Genet. 11, 369-375 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 291 > 968 beta-isopropylmalate dehydrogenase (LEU2) signal 163 172 CAAT box signal 213 223 TATA box BASE COUNT 215 a 303 c 242 g 208 t ORIGIN 1 gaattccgtc gtcgcctgag tcatcattta tttaccagtt ggccacaaac ccttgacgat 61 ctcgtatgtc ccctccgaca tactcccggc cggctgggta cgttcgatag cgctatcggc 121 atcgacaagg tttgggtccc tagccgatac cgcactacct gagtcacaat cttcggaggt 181 ttagtcttcc acatagcacg ggcaaaagtg cgtatatata caagagcgtt tgccagccac 241 agattttcac tccacacacc acatcacaca tacaaccaca cacatccaca atggaacccg 301 aaactaagaa gaccaagact gactccaaga agattgttct tctcggcggc gacttctgtg 361 gccccgaggt gattgccgag gccgtcaagg tgctcaagtc tgttgctgag gcctccggca 421 ccgagtttgt gttcgaggac cgactcattg gaggagctgc cattgagaag gagggcgagc 481 ccatcaccga cgctactctc gacatctgcc gaaaggctga ctctattatg ctcggtgctg 541 tcggaggcgc tgccaacacc gtatggacca ctcccgacgg acgaaccgac gtgcgacccg 601 agcagggtct cctcaagctg cgaaaggacc tgaacctgta cgccaacctg cgaccctgcc 661 agctgctgtc gcccaagctc gccgatctct cccccatccg aaacgttgag ggcaccgact 721 tcatcattgt ccgagagctc gtcggaggta tctactttgg agagcgaaag gaggatgacg 781 gatctggcgt cgcttccgac accgagacct actccgttcc tgaggttgag cgaattgccc 841 gaatggccgc cttcctggcc cttcagcata acccccctct tcccgtgtgg tctcttgaca 901 aggccaacgt gctggcctcc tctcgacttt ggcgaaagac tgttactcga gtcctcaagg 961 acgaattc // LOCUS YSTARSA1 131 bp ds-DNA PLN 02-AUG-1990 DEFINITION Yeast ARS site linked to HML locus, segment 1. ACCESSION M17505 KEYWORDS autonomous replication; origin of replication. SEGMENT 1 of 2 SOURCE Yeast DNA. ORGANISM Yeast sp. Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina. REFERENCE 1 (bases 1 to 131) AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J., Nasmyth,K.A. and Hicks,J.B. TITLE Localization and sequence analysis of yeast origins of DNA replication JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983) STANDARD simple staff_entry FEATURES from to/span description site 92 101 ARS consensus sequence BASE COUNT 41 a 14 c 19 g 57 t ORIGIN Chromosome III. 1 atggaatgta atttcttaag tattctatat gtacttaaaa cctattaata tatggatcaa 61 cacagtatct tatgaatggg tttttgattt ttttatgttt ttttaaaaca ttaaagtttt 121 cggcacggac t // LOCUS YSTARSA2 146 bp ds-DNA PLN 02-AUG-1990 DEFINITION Yeast ARS site linked to HML locus, segment 2. ACCESSION M17506 KEYWORDS autonomous replication; origin of replication. SEGMENT 2 of 2 SOURCE Yeast DNA. ORGANISM Yeast sp. Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina. REFERENCE 1 (bases 1 to 146) AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J., Nasmyth,K.A. and Hicks,J.B. TITLE Localization and sequence analysis of yeast origins of DNA replication JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983) STANDARD simple staff_entry FEATURES from to/span description site 136 146 ARS consensus sequence BASE COUNT 46 a 22 c 21 g 57 t ORIGIN Chromosome III; about 2.8 kb downstream of of segment 1. 1 cttctatgtt acttcttcaa catgaaagcc cgaacgtttg caatgatatc ctctttttca 61 cccgattatt ttcatttatt aagtaagtgg ggttctgggt gaaaaaatca aaacaaaaat 121 atgaatttgt aatgattttt atattt // LOCUS YSTARSB1 147 bp ds-DNA PLN 02-AUG-1990 DEFINITION Yeast ARS site linked to HMR locus, segment 1. ACCESSION M17507 KEYWORDS autonomous replication; origin of replication. SEGMENT 1 of 2 SOURCE Yeast DNA. ORGANISM Yeast sp. Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina. REFERENCE 1 (bases 1 to 147) AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J., Nasmyth,K.A. and Hicks,J.B. TITLE Localization and sequence analysis of yeast origins of DNA replication JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983) STANDARD simple staff_entry FEATURES from to/span description site 112 122 ARS consensus sequence BASE COUNT 56 a 14 c 9 g 68 t ORIGIN Chromosome III. 1 aatagtatca atatatatat atatatatat atttatttgt ttactttttc tatcagtgtt 61 ttcaattttt tattaaacaa tgtttgattt tttaaatcgc aatttaatac ctaaatataa 121 aaaatgttat tatattgcaa aaaccca // LOCUS YSTARSB2 85 bp ds-DNA PLN 02-AUG-1990 DEFINITION Yeast ARS site linked to HMR locus, segment 2. ACCESSION M17508 KEYWORDS autonomous replication; origin of replication. SEGMENT 2 of 2 SOURCE Yeast DNA. ORGANISM Yeast sp. Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina. REFERENCE 1 (bases 1 to 85) AUTHORS Broach,J.R., Li,Y.-Y., Feldman,J., Jayaram,M., Abraham,J., Nasmyth,K.A. and Hicks,J.B. TITLE Localization and sequence analysis of yeast origins of DNA replication JOURNAL Cold Spring Harb. Symp. Quant. Biol. 47, 1165-1173 (1983) STANDARD simple staff_entry FEATURES from to/span description site 63 73 ARS consensus sequence BASE COUNT 33 a 7 c 5 g 40 t ORIGIN Chromosome III; about 1.9 kb downstream of segment 1. 1 atttatcatg ttttggtatg ataatttaat ttttaaaaaa acaaatttaa ttgacctcat 61 taattaatat ttattaatac cttta // LOCUS HUMROS1 7375 bp ss-mRNA PRI 02-AUG-1990 DEFINITION Human transmembrane tyrosine-specific protein kinase (ROS1) mRNA, complete cds. ACCESSION M34353 KEYWORDS oncogene; transmembrane tyrosine-specific protein kinase. SOURCE Human glioblastoma cell line SW-1088, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7375) AUTHORS Birchmeier,C., O'Neil,K., Riggs,M. and Wigler,M. TITLE Characterization of ROS1 cDNA from a human glioblastoma cell line JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4799-4803 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.H.Wigler, 15-MAY-1990. FEATURES from to/span description pept 207 7250 transmembrane tyrosine-specific protein kinase precursor /hgml_locus_uid="LN0124W" /nomgen="ROS1" /map="6q21-q22" sigp 207 314 transmembrane tyrosine-specific protein kinase signal peptide matp 208 7247 transmembrane tyrosine-specific protein kinase BASE COUNT 2203 a 1496 c 1605 g 2071 t ORIGIN 1 ccgcattcaa gctttcaagc attcaaaggt ctaaatgaaa aaggctaagt attatttcaa 61 aaggcaagta tatcctaata tagcaaaaca aacaaagcaa aatccatcag ctactcctcc 121 aattgaagtg atgaagccca aataattcat atagcaaaat ggagaaaatt agaccggcca 181 tctaaaaatc tgccattggt gaagtgatga agaacattta ctgtcttatt ccgaagcttg 241 tcaattttgc aactcttggc tgcctatgga tttctgtggt gcagtgtaca gttttaaata 301 gctgcctaaa gtcgtgtgta actaatctgg gccagcagct tgaccttggc acaccacata 361 atctgagtga accgtgtatc caaggatgtc acttttggaa ctctgtagat cagaaaaact 421 gtgctttaaa gtgtcgggag tcgtgtgagg ttggctgtag cagcgcggaa ggtgcatatg 481 aagaggaagt actggaaaat gcagacctac caactgctcc ctttgcttct tccattggaa 541 gccacaatat gacattacga tggaaatctg caaacttctc tggagtaaaa tacatcattc 601 agtggaaata tgcacaactt ctgggaagct ggacttatac taagactgtg tccagaccgt 661 cctatgtggt caagcccctg caccccttca ctgagtacat tttccgagtg gtttggatct 721 tcacagcgca gctgcagctc tactcccctc caagtcccag ttacaggact catcctcatg 781 gagttcctga aactgcacct ttgattagga atattgagag ctcaagtccc gacactgtgg 841 aagtcagctg ggatccacct caattcccag gtggacctat tttgggttat aacttaaggc 901 tgatcagcaa aaatcaaaaa ttagatgcag ggacacagag aaccagtttc cagttttact 961 ccactttacc aaatactatc tacaggtttt ctattgcagc agtaaatgaa gttggtgagg 1021 gtccagaagc agaatctagt attaccactt catcttcagc agttcaacaa gaggaacagt 1081 ggctcttttt atccagaaaa acttctctaa gaaagagatc tttaaaacat ttagtagatg 1141 aagcacattg ccttcggttg gatgctatat accataatat tacaggaata tctgttgatg 1201 tccaccagca aattgtttat ttctctgaag gaactctcat atgggcgaag aaggctgcca 1261 acatgtctga tgtatctgac ctgagaattt tttacagagg ttcaggatta atttcttcta 1321 tctccataga ttggctttat caaagaatgt atttcatcat ggatgaactg gtatgtgtct 1381 gtgatttaga gaactgctca aacatcgagg aaattactcc accctctatt agtgcacctc 1441 aaaaaattgt ggctgattca tacaatgggt atgtctttta cctcctgaga gatggcattt 1501 atagagcaga ccttcctgta ccatctggcc ggtgtgcaga agctgtgcgt attgtggaga 1561 gttgcacgtt aaaggacttt gcaatcaagc cacaagccaa gcgaatcatt tacttcaatg 1621 acactgccca agtcttcatg tcaacatttc tggatggctc tgcttcccat ctcatcctac 1681 ctcgcatccc ctttgctgat gtgaaaagtt ttgcttgtga aaacaatgac tttcttgtca 1741 cagatggcaa ggtcattttc caacaggatg ctttgtcttt taatgaattc atcgtgggat 1801 gtgacctgag tcacatagaa gaatttgggt ttggtaactt ggtcatcttt ggctcatcct 1861 cccagctgca ccctctgcca ggccgcccgc aggagctttc ggtgctgttt ggctctcacc 1921 aggctcttgt tcaatggaag cctcctgccc ttgccatagg agccaatgtc atcctgatca 1981 gtgatattat tgaactcttt gaattaggcc cttctgcctg gcagaactgg acctatgagg 2041 tgaaagtatc cacccaagac cctcctgaag tcactcatat tttcttgaac ataagtggaa 2101 ccatgctgaa tgtacctgag ctgcagagtg ctatgaaata caaggtttct gtgagagcaa 2161 gttctccaaa gaggccaggc ccctggtcag agccctcagt gggtactacc ctggtgccag 2221 ctagtgaacc accatttatc atggctgtga aagaagatgg gctttggagt aaaccattaa 2281 atagctttgg cccaggagag ttcttatcct ctgatatagg aaatgtgtca gacatggatt 2341 ggtataacaa cagcctctac tacagtgaca cgaaaggcga cgtttttgtg tggctgctga 2401 atgggacgga tatctcagag aattatcacc tacccagcat tgcaggagca ggggctttag 2461 cttttgagtg gctgggtcac tttctctact gggctggaaa gacatatgtg atacaaaggc 2521 agtctgtgtt gacgggacac acagacattg ttacccacgt gaagctattg gtgaatgaca 2581 tggtggtgga ttcagttggt ggatatctct actggaccac actctattca gtggaaagca 2641 ccagactaaa tggggaaagt tcccttgtac tacagacaca gccttggttt tctgggaaaa 2701 aggtaattgc tctaacttta gacctcagtg atgggctcct gtattggttg gttcaagaca 2761 gtcaatgtat tcacctgtac acagctgttc ttcggggaca gagcactggg gataccacca 2821 tcacagaatt tgcagcctgg agtacttctg aaatttccca gaatgcactg atgtactata 2881 gtggtcggct gttctggatc aatggcttta ggattatcac aactcaagaa ataggtcaga 2941 aaaccagtgt ctctgttttg gaaccagcca gatttaatca gttcacaatt attcagacat 3001 cccttaagcc cctgccaggg aacttttcct ttacccctaa ggttattcca gattctgttc 3061 aagagtcttc atttaggatt gaaggaaatg cttcaagttt tcaaatcctg tggaatggtc 3121 cccctgcggt agactggggt gtagttttct acagtgtaga atttagtgct cattctaagt 3181 tcttggctag tgaacaacac tctttacctg tatttactgt ggaaggactg gaaccttatg 3241 ccttatttaa tctttctgtc actccttata cctactgggg aaagggcccc aaaacatctc 3301 tgtcacttcg agcacctgaa acagttccat cagcaccaga gaaccccaga atatttatat 3361 taccaagtgg aaaatgctgc aacaagaatg aagttgtggt ggaatttagg tggaacaaac 3421 ctaagcatga aaatggggtg ttaacaaaat ttgaaatttt ctacaatata tccaatcaaa 3481 gtattacaaa caaaacatgt gaagactgga ttgctgtcaa tgtcactccc tcagtgatgt 3541 cttttcaact tgaaggcatg agtcccagat gctttattgc cttccaggtt agggccttta 3601 catctaaggg gccaggacca tatgctgacg ttgtaaagtc tacaacatca gaaatcaacc 3661 catttcctca cctcataact cttcttggta acaagatagt ttttttagat atggatcaaa 3721 atcaagttgt gtggacgttt tcagcagaaa gagttatcag tgccgtttgc tacacagctg 3781 ataatgagat gggatattat gctgaagggg actcactctt tcttctgcac ttgcacaatc 3841 gctctagctc tgagcttttc caagattcac tggtttttga tatcacagtt attacaattg 3901 actggatttc aaggcacctc tactttgcac tgaaagaatc acaaaatgga atgcaagtat 3961 ttgatgttga tcttgaacac aaggtgaaat atcccagaga ggtgaagatt cacaatagga 4021 attcaacaat aatttctttt tctgtatatc ctcttttaag tcgcttgtat tggacagaag 4081 tttccaattt tggctaccag atgttctact acagtattat cagtcacacc ttgcaccgaa 4141 ttctgcaacc cacagctaca aaccaacaaa acaaaaggaa tcaatgttct tgtaatgtga 4201 ctgaatttga gttaagtgga gcaatggcta ttgatacctc taacctagag aaaccattga 4261 tatactttgc caaagcacaa gagatctggg caatggatct ggaaggctgt cagtgttgga 4321 gagttatcac agtacctgct atgctcgcag gaaaaaccct tgttagctta actgtggatg 4381 gagatcttat atactggatc atcacagcaa aggacagcac acagatttat caggcaaaga 4441 aaggaaatgg ggccatcgtt tcccaggtga aggccctaag gagtaggcat atcttggctt 4501 acagttcagt tatgcagcct tttccagata aagcgtttct gtctctagct tcagacactg 4561 tggaaccaac tatacttaat gccactaaca ctagcctcac aatcagatta cctctggcca 4621 agacaaacct cacatggtat ggcatcacca gccctactcc aacatacctg gtttattatg 4681 cagaagttaa tgacaggaaa aacagctctg acttgaaata tagaattctg gaatttcagg 4741 acagtatagc tcttattgaa gatttacaac cattttcaac atacatgata cagatagctg 4801 taaaaaatta ttattcagat cctttggaac atttaccacc aggaaaagag atttggggaa 4861 aaactaaaaa tggagtacca gaggcagtgc agctcattaa tacaactgtg cggtcagaca 4921 ccagcctcat tatatcttgg agagaatctc acaagccaaa tggacctaaa gaatcagtcc 4981 gttatcagtt ggcaatctca cacctggccc taattcctga aactcctcta agacaaagtg 5041 aatttccaaa tggaaggctc actctccttg ttactagact gtctggtgga aatatttatg 5101 tgttaaaggt tcttgcctgc cactctgagg aaatgtggtg tacagagagt catcctgtca 5161 ctgtggaaat gtttaacaca ccagagaaac cttattcctt ggttccagag aacactagtt 5221 tgcaatttaa ttggaaggct ccattgaatg ttaacctcat cagattttgg gttgagctac 5281 agaagtggaa atacaatgag ttttaccatg ttaaaacttc atgcagccaa ggtcctgctt 5341 atgtctgtaa tatcacaaat ctacaacctt atacttcata taatgtcaga gtagtggtgg 5401 tttataagac gggagaaaat agcacctcac ttccagaaag ctttaagaca aaagctggag 5461 tcccaaataa accaggcatt cccaaattac tagaagggag taaaaattca atacagtggg 5521 agaaagctga agataatgga tgtagaatta catactatat ccttgagata agaaagagca 5581 cttcaaataa tttacagaac cagaatttaa ggtggaagat gacatttaat ggatcctgca 5641 gtagtgtttg cacatggaag tccaaaaacc tgaaaggaat atttcagttc agagtagtag 5701 ctgcaaataa tctagggttt ggtgaatata gtggaatcag tgagaatatt atattagttg 5761 gagatgattt ttggatacca gaaacaagtt tcatacttac tattatagtt ggaatatttc 5821 tggttgttac aatcccactg acctttgtct ggcatagaag attaaagaat caaaaaagtg 5881 ccaaggaagg ggtgacagtg cttataaacg aagacaaaga gttggctgag ctgcgaggtc 5941 tggcagccgg agtaggcctg gctaatgcct gctatgcaat acatactctt ccaacccaag 6001 aggagattga aaatcttcct gccttccctc gggaaaaact gactctgcgt ctcttgctgg 6061 gaagtggagc ctttggagaa gtgtatgaag gaacagcagt ggacatctta ggagttggaa 6121 gtggagaaat caaagtagca gtgaagactt tgaagaaggg ttccacagac caggagaaga 6181 ttgaattcct gaaggaggca catctgatga gcaaatttaa tcatcccaac attctgaagc 6241 agcttggagt ttgtctgctg aatgaacccc aatacattat cctggaactg atggagggag 6301 gagaccttct tacttatttg cgtaaagccc ggatggcaac gttttatggt cctttactca 6361 ccttggttga ccttgtagac ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac 6421 ggatgcattt cattcacagg gatctggcag ctcgaaattg ccttgtttcc gtgaaagact 6481 ataccagtcc acggatagtg aagattggag actttggact cgccagagac atctataaaa 6541 atgattacta tagaaagaga ggggaaggcc tgctcccagt tcggtggatg gctccagaaa 6601 gtttgatgga tggaatcttc actactcaat ctgatgtatg gtcttttgga attctgattt 6661 gggagatttt aactcttggt catcagcctt atccagctca ttccaacctt gatgtgttaa 6721 actatgtgca aacaggaggg agactggagc caccaagaaa ttgtcctgat gatctgtgga 6781 atttaatgac ccagtgctgg gctcaagaac ccgaccaaag acctactttt catagaattc 6841 agaaccaact tcagttattc agaaattttt tcttaaatag catttatcag tgcagagatg 6901 aagcaaacaa cagtggagtc ataaatgaaa gctttgaagg tgaagatggc gatgtgattt 6961 gtttgaattc agatgacatt atgccagttg ttttaatgga aacgaagaac cgagaagggt 7021 taaactatat ggtacttgct acagaatgtg gccaaggtga agaaaagtct gagggtcctc 7081 taggctccca ggaatctgaa tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag 7141 acaaagattt ctgccaagaa aaacaagtgg cttactgccc ttctggcaag cctgaaggcc 7201 tgaactatgc ctgtctcact cacagtggat atggagatgg gtctgattaa tagcgttgtt 7261 tgggaaatag agagttgaga taaacactct cattcagtag ttactgaaag aaaactctgc 7321 tagaatgata aatgtcatgg tggtctataa ctccaaataa acaatgcaac gttcc // LOCUS MBOPIVML 969 bp ds-DNA BCT 02-AUG-1990 DEFINITION M.lacunata pilin gene inverting protein (PivML) gene, complete cds. ACCESSION M34367 KEYWORDS pilin gene inverting protein. SOURCE M.lacunata (ATCC 17956) DNA, clone pMxL1. ORGANISM Moraxella lacunata Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 969) AUTHORS Marrs,C.F., Rozsa,F.W., Hackel,M., Stevens,S.P. and Glasgow,A.C. TITLE Identification, cloning and sequencing of Piv, a new gene involved in inverting the pilin gene of M.lacunata JOURNAL J. Bacteriol. 172, 4370-4377 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.F.Marrs, 17-MAY-1990. FEATURES from to/span description pept 1 969 pilin gene inverting protein (PivML) BASE COUNT 318 a 236 c 165 g 250 t ORIGIN 1 atgtctaaaa cttacattgg gattgatatc gccaaaaaca catttgatgc ctgctttatt 61 acccataaca cttggcaaaa ctgtacattc acaaataacc aacagggttt tatagagcta 121 acattgtgga ttcaagcaca tcattataac acatctacct tacatttgat tattgaggca 181 acgggggctt attgggaaaa attagctcat tgggcaatct ctcatcatca taaggtaagt 241 attgtcaacc ccttatatat tcatgcatac gccaaaagcc taggcatacg caccaagaca 301 gataagcagg atgccattct cttggcacgc tatggtgcaa aagaaaatcc acctttatgg 361 cagcctaaat cagacaacga aataaaactg actgcccttc taaaacaacg tgaacaccac 421 aaacggcaac tcatcaaaga acgcactcga caggaagctc tgtctattta tgtgaaatcg 481 tatactgatg acaacattcg tcattggagt gacagcatca cacagctcga tcatcaaatt 541 tggcaactca tcaattgcac gcccgagcta aattaccgtg ccagcctact tgccacaata 601 cctggaattg gcaaaaagac actgccacat ctactggttg tcattggaga tggttcatca 661 tttcaaagtg ccaaacatct agcaagctat gctgggcttg cccctcgtca tcatcaatca 721 ggcattagta ttcacaagca gtcttccatt ggatttagtg gacaaaaaga actacgttca 781 gcccttttta tgcccgctgt catagtcagc tttggtcgat atcccgcttt tcaaaagttc 841 gtcaaacgca tggaacaaaa aggcaaaacc aaaaaacaaa ttattatcgc catcatgcgc 901 aaattactca ccatcagcta tgccgttatt cgccaaaatc gcccttttga caaacgcatt 961 catgaataa // LOCUS STYMKAD 915 bp ds-DNA BCT 02-AUG-1990 DEFINITION Plasmid pEX102 mouse killing agent D (mkaD) gene, complete cds. ACCESSION M34355 KEYWORDS mouse killing agent D. SOURCE Plasmid pEX102 (from Salmonella typhimurium, strain TML R66) DNA. ORGANISM Plasmid pEX102 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 915) AUTHORS Taira,S. and Rhen,M. TITLE Nucleotide sequence of mkaD, a virulence associated gene of Salmonella typhimurium containing variable and constant region JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.P.Taira, 16-MAY-1990. FEATURES from to/span description pept 132 857 mouse killing agent D (mkaD) BASE COUNT 273 a 181 c 213 g 248 t ORIGIN 1 ggaccgaaat atcatttttt cgcctgataa aaaatgaggt tttctggata actaatcgtt 61 ttattaaaaa aaaactgaga atttatatct aataatatgg cgatatatcc atatcgcaaa 121 ggagatttcc catgcccata aataggccta atctaaatct aaacatccct cctttgaata 181 ttgtagctgc ttatgatggg gcggaaatac catctacaaa taagcacctg aaaaataatt 241 tcaactcctt gcacaaccaa atgcggaaga tgccggtatc ccactttaaa gaggcgctgg 301 atgtgcctga ctattcaggg atgcgccaga gtggtttctt tgctatgagc caaggttttc 361 agctgaataa ccatggttac gatgttttca tccatgctcg tcgagaatca cctcagtctc 421 agggcaaatt tgccggtgac aagttccaca tcagtgtgct cagggatatg gtgccacaag 481 catttcaagc gctgtccgga ttgctgtttt cagaggacag tccggtagat aagtggaaag 541 tgaccgatat ggagaaggtc gttcaacaag cccgtgttag cctgggcgct cagttcacgt 601 tgtatataaa accagaccag gaaaattcgc agtacagtgc gtcgtttctc cacaagacac 661 ggcaatttat agagtgtctg gaatccagac tatccgaaaa tggggttatt tcaggacagt 721 gtcctgagtc agacgttcat cctgaaaatt ggaaatatct cagttatcgt aatgaactac 781 gaagtgggcg tgatggtggc gaaatgcaga gacaggcttt acgtgaggaa ccgttttatc 841 gtttgatgac agagtaagta tgggtttggg gagcaacgga acagtaaacg ccgttaaaca 901 gctattttaa atgct // LOCUS TOBEXT 3994 bp ds-DNA PLN 02-AUG-1990 DEFINITION N.plumbaginifolia extensin (ext) gene, complete cds. ACCESSION M34371 KEYWORDS cell wall protein; extensin. SOURCE N.plumbaginifolia leaf DNA. ORGANISM Nicotiana plumbaginifolia Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 3994) AUTHORS De Loose,M.R.R., Gheyson,G., Tire,C., Gielen,J., Villaroel,R., Genetello,C., Van Montagu,M., Depicker,A. and Inze,D. TITLE The extensin signal peptide allows secretion of heterologous proteins from protoplasts JOURNAL Unpublished (1990) STANDARD unannotated staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.R.De Loose, 18-MAY-1990. Author address: M.R.R.De Loose Laboratorium voor Genetika, Wetenschappen, K.L.Ledeganckstraat, Gent, Oost-Vlaanderen, BELGIUM FEATURES from to/span description pept 1590 2840 extensin precursor sigp 1590 1652 extensin signal peptide (could end at 1667) matp 1653 2837 extensin (could start at 1668) mRNA 1560 3342 extensin mRNA rpt 1740 1824 tandem 85 bp repeats rpt 1740 1758 19 bp repeats signal 1532 1537 TATA box signal 3323 3329 poly-A signal BASE COUNT 1264 a 992 c 550 g 1188 t ORIGIN 1 cgcccgggga tcctcctgga tgccattttt cctttgcttt tggatacctt ttcgcaactt 61 tagtccattg cttgaacatg gttttttcta cctcttgtct gttcccatga tgataaacta 121 tgatagctaa cattttcagt aatattggat tgaattagat atactttgaa ttattgagct 181 acaaaattcg ttgaatattt tgccctatcc cccacccaaa agaaaataat tcataccttt 241 atttggtaat aaaagtgggt cacatgcaca gtttttctcg tcttctctag attaaactct 301 ttggaaatga cctactgaaa atactacaca taaaattccc cactcccctc caagttgggg 361 gagggtgggg gttggtgggt tttttttccc cttgaatcca ttcagccggc ttatgcctcc 421 cgaggtggtg ggaggtttaa tttgaaccct taccctatag ttaatatcga aatagataat 481 acaaaggagg ggaacataaa accaaaacct ccgacaaaac atcaaagtgt aggagttcat 541 gatatacgct tgattgaaag tgcacaatgg tctttgcata ctatcaaagt atcaattggt 601 tcttgaatta tatctcttaa tataatgtat tgtgtttaat ttctctacta tcccctattt 661 tataagacta aaagatcctg acatgcttct tgaacacatg tgaaggttag ttaactggtc 721 agaagtacac aagaattaat ttgtacacct atgtttctca ccagaatgcc cataaattac 781 atcaattctt caacatcaag agatcgaaaa acttaacaat tgttctaata gttgaactga 841 gtcgtccgat ccatatccga ttcttcacta gaagcattaa tcatatagag caatttaact 901 taatttacta tattggtaat catttacata gttaagttat aaaaccttgg agcgacacaa 961 ggatttacac taatcatgat tgaatattaa ctttactcaa tttatcaatg tctgatacaa 1021 gtgaattaat ttctctcttt gtgatttcgg tagaaaatta atgtcaagtt tcaaagtttt 1081 tctttttgaa attagtcata catgtgaata gaacattaat ttaagttaaa gaatcatata 1141 aaaaagttag ccgctttgat ttctgtaact gaaatcgtgc agagatgagg catccacact 1201 tgtttttcaa agcttcagta ctatttaaaa acaaagacag taaaaaggag actttttctt 1261 tgacattatt gcatcagaaa tagtatagct gccataatag tttattcctt tgcttagctt 1321 gcagcctcta tcaaacaaaa aaatacacca gctcaagtca atttgagccg acaacatgac 1381 aaaaccaaat caaatatgca tttttttttt tttttttttt tttttttttt tttttttttt 1441 tttacacttt ggtaggtgtt aagtaatcta gtgagacttt taccttcatt tatgaaaatc 1501 ttgaaaaggg taattgtcta attgaaagct atataaaggg gtcgaagtga agcttaagag 1561 gacaacaact tttctcattt gtttcaaaga tgggaaaaat ggcttctcta tttgccacat 1621 ttttagtggt tttagtgtca cttagcttag cttctgaaag ctcagcaaat tatcaatact 1681 catctccacc accacctaag aagccatacc acccttcacc aacaccatat tatcctgcac 1741 cagtttataa atctccacca ccaccaattc cggtttataa gtcaccacca ccacccaaga 1801 agccatacta ccctccacac accccagttt acaaatctcc accaccacca actccagttt 1861 acaagtcgcc accaccaccc aagaagccac actaccctcc acacacaccc gtttacaagt 1921 ctccaccacc acctactcca gtttacaagt caccaccatc acccaagaag ccacactatc 1981 ctccacacac tccggtttac aagtcgccac caccaccaac tcccgtttac aagtcgccac 2041 caccacccaa gaagccacac tatccaccac acaccccggt ttacaagtcg ccaccaccac 2101 caactcccgt ttacaagtcg ccaccaccac caaagaagcc acactatcct ccacacacac 2161 cagtttacaa gtcaccacca ccacccaaga agccatacta tccaccacac accccagttt 2221 acaagtcgcc accaccacca actccagttt acaagtcacc accaccacct aagaagccat 2281 accacccttc accaacaccg tatcatcctt caccagttta caagtcccct ccaccaccaa 2341 ctccagttta caagtcacca ccaccaccta agaagccata ccacccttca ccaacaccgt 2401 atcatccatc accagtttac aagtcccctc caccaccaac tccagtttac aagtcaccac 2461 caccacctaa gaagccatac cacccttcac caacaccgta tcatccatca ccagtttaca 2521 agtcccctcc accaccaact ccagtttaca agtcaccacc accacctaag aagccatacc 2581 acccttcacc aacaccgtat catcctgcac cagtttacaa gtccccacca ccaccaactc 2641 cagtttacaa gtcaccaccg ccaccagtga agccatacca tccttcacca acaccatacc 2701 atcctgcacc agtttataag tctccaccac caccaactcc tgtttacaag tcacctccac 2761 caccaactcc agtctacaag tctccaccac cacaccaccc ctatgtttac gcttctcctc 2821 ctcctcccta ccattactaa gaagtgacat cacaaagctg aaggtaatta aggcaaatta 2881 attaaaacat tttcaaaatg atatgtcagg aatgcaaaga aaattgcata actttgttta 2941 tgcaaaacat taatataata taaagaactg cgctctctgc ttattaacta tttttttagc 3001 ttcatttatt agttactagc tgttcagaat tttcagtatc ttttgatatt acaaagaacc 3061 taatcagaca atgtatattc ttatgcagga aaagcagaat gttgagctaa aagaaaggct 3121 ttttccattt tcgagaggca acgagaagaa aagaagaaga aaaaatagag taaataataa 3181 gccccacggg aggcgaagtt cttttgtagc ttcatgttgt ctaagctatt gatattgttt 3241 gtaccctata ttttatttct ctgtctttgt gtatgttttg ttcaatttcg atctccttgc 3301 aaaatgcaga gattatgagc tgaataaagt aagttaatat tattatacgt gttaattttc 3361 tactcctctc tctagccagc cttttgtttt ctctttttct tatttcattt tctttatatc 3421 aaaccatttt aggagagggc caaggagtga tccagtaaaa cacgaagatt agaagaaact 3481 tcccttcttt ttttttatct gaaaaaaatt taaagtcgag atttatctct ttttgtaatg 3541 gaatcatttc cacagttatg actattttca aattgatatc gccagaccta acaaattaaa 3601 ttactccttc cgctcatctg gccgtacact tctgctccat ctggccgtac agttctgctc 3661 catctggcag tgtacggaga atggataact taatattcag tatgaatttg gacataaaat 3721 cttctaagat tgctagtgaa ttttgtgagc gttattgtgt taaaacaaaa gacagaaaat 3781 tattagaaga attaaagaaa tatgaaaata gtatagaatc agaaatattt cgagtccaca 3841 agttttttgt gtgttcttaa ggaatgtttt aacccctcac acgttgccaa ggtaatggat 3901 taaatcctcc caagataaaa cggaataaac cttcctgcaa cagtggcaat acaaactgca 3961 ggtcgactct agggatcccc gggcgagcga attc //
GenBank-Updates@genbank.bio.net (08/03/90)
LOCUS CLLRRE 1861 bp ss-rRNA RNA 03-AUG-1990 DEFINITION C.sapidus 18S rRNA. ACCESSION M34360 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE C.sapidus rRNA. ORGANISM Callinectes sapidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Brachyura; Brachyrhyncha; Portunoidea; Portunidae. REFERENCE 1 (bases 1 to 1861) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1860 18S ribosomal RNA BASE COUNT 359 a 375 c 421 g 367 t 339 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc nngcatgtct 61 nagtacaagc cgaatnaagg cgaaaccgcg aatggctnnn taaatcagct atgattcatt 121 nnatctgtac ccncncnnac ttggataact gtggtaattc tanagctaat acatgcatta 181 cgtctctgac cgcaagggaa gagngctttt attagttcaa aaccggtcgg gcctcggtcc 241 gnnnccccac tgtgttgaat ctgaataact ttttgctgag cgcacggtct cngcncgcgc 301 ngcctctttc aagtgtctgc cttatcagct ttcgattgta ggttatacgc ctacnatggc 361 tntnacgggt nacggggaat gagggttcga ttccggagag ngagcctgag aaacggctac 421 cacntctnag gnnggcagca ggcacgcnna ttacccactc cggcncgggg aggtagtgac 481 naaaaataac gatgcgagac tcatccgngg cctcgnnatc ggaatgagtn cactttaaat 541 cctttnacga ggatctattg gagggcnagt ctggtgccng cagccncggt nattccagct 601 gcaatanngt atattaaagt tgttgcggtt annaaagctc gtagttnnat ttcagttctg 661 gactgacggt tnccgcnngg tgcacactgt cacnctccga acagccacaa caccgctggc 721 cnnnggggtg ctcttcnccn ggtgtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnncc tgaatgccta tgcantggaa taatggaata 841 ggacctcggn nctnttttgt cggttttctg aacccgaggt aatgactaat aggaacnggc 901 gggggcnttc gtattgcgac gctagaggtg aaattcttgg accgtcgcna gacgaactnc 961 tgcgaaagca tttgccnagg atgtttcntt natcnagaan gaaagttaga ggttcgaagg 1021 cgatcagata ccgcnnnnnn nnnaaccnta aacgatgctg accagcgatc cgccggnntt 1081 attnncatga cccggccncc agcttccggg aaaccaaagt ctttgggttc cgggggaagt 1141 atggttgcaa agctgaaact caaaggaatt gacggnnnnn nnnnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnac acggggaacc tcaccaggcc cagacaccgg aagganngac 1261 agattgagag ctcnntctca ttnggtgggt ggtngtgcat nncgngttct tagttggtgg 1321 agcgnnnnnn nnnnnnnnnn ncgatnacga acgagannnn nnnnnnnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnngtgt ccagttcgca gcttcttctt agagggataa cggcaattct 1441 agccgcacga gattgagcaa taacaagtct gtgatgccct tagatgttct gggcgcacgc 1501 gcgctacact gaagggatca acgtgtcctc ccnctccgag aggagcgggn nncccgttga 1561 aatccnttca tgatagggat tggggtttgc aattgtctcc catgaannng gaattcccag 1621 taagcgcaag tcatgagctt gcgntgattn ngtccctncc nnttgtacac accnnnnntc 1681 gctactaccg attgaatgat ttagtgaggc ttcggactgg cgctcttgga tgccggnccc 1741 gagnggttcn ncgccggnnc ncggcgcctc gagctgacgg aaagatgtcc aaacttgatn 1801 nnnnnnnnnn nnnnnaagtc gtaacaaggt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 n // LOCUS HUMNCSRC 138 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human membrane-associated tyrosine protein kinase (C-SRC) gene, exons 3, 4, NI, and NII, partial cds. ACCESSION M34469 KEYWORDS membrane-associated tryosine protein kinase. SOURCE Human adult brain DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Pyper,J.M. and Bolen,J.B. TITLE Identification of a novel neuronal C-SRC exon expressed in human brain JOURNAL Mol. Cell. Biol. 10, 2035-2040 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 29 membrane-associated tyrosine protein kinase (C-SRC), exon 3 (AA at 1) 30 47 membrane-associated tyrosine protein kinase, exon NI 48 80 membrane-associated tyrosine protein kinase, exon NII 81 > 136 membrane-associated tyrosine protein kinase, exon 4 variant 117 117 a or g variant 135 135 c or t BASE COUNT 36 a 40 c 40 g 22 t ORIGIN 1 cggctccagc tccagattgt caacaacacg aggaaggtgg atgtcagcca gacctggttc 61 acattcagat ggctgcaaag agagggagac tggtggctgg cccactcgct cagcacagga 121 cagacaggct acatcccc // LOCUS MHVAPEPA 1000 bp ss-RNA VRL 03-AUG-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34435 KEYWORDS peplomer protein. SOURCE Murine (strain JHM-DL) RNA. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 1000) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review BASE COUNT 258 a 192 c 227 g 317 t 6 others ORIGIN 1 tgagtctttg tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag 61 tgtctcagtt gataagtttg ctttcccccg aagccgtcaa attgattttc aaattggcaa 121 ctccggattt ttgcaaacgg ctaattataa gattgatatc gctgccacat catgtcagct 181 gtattacagt cttcctaaga ataatgttac cattaataac tataacccct cgtcttggaa 241 taggaggtat ggttttaatg atgctggtgt gtttggcaaa agtaaacatg atgttgccta 301 cgcccagnna tgttttnttg tgcgacctag ctattgtccg tgtgcacaac cggaaatagt 361 tagtgcttgc actagtcaga ccaaacccat gtctgcttat tgccccacag gcacaattca 421 tcgtgagtgt tctctttgga atgggcccca tttgcgctcg gcacgtgtag gttccggcac 481 gtacacgtgt gagtgcactt gtaaacccaa tccatttgat acgtatgatc tccgctgtgg 541 gcaaattaaa actattgtta atgtgggcga tcattgtgaa ggtctgggtg ttttagaaga 601 taaaggtggc aatagcgatc cacataaggg ctgttcttgt gccaatgatt cttttatcgg 661 atggtcacat gacacttgtt tagtaaatga tcgctgccca atttttgcta acatattgtt 721 aaatggcatt aatagtggga ctacgtgttc cacagattta caattgccta atactgaagt 781 ggccactggc gtttgcgtca gatatgacct ctatggtatt actggtcnag gtgtttttaa 841 agaggtcaag gcagnntatt ataatagctg gcaggcccta ttatatgatg ttaatggtaa 901 cttaaacggg ttccgtgacc ttaccactaa caagacttat acgataagga gctgttatag 961 tggccgtgtt tctgctgcat atcataaaga agcacccgaa // LOCUS MHVAPEPB 843 bp ss-RNA VRL 03-AUG-1990 DEFINITION Murine coronavirus peplomer (S) protein gene. ACCESSION M34436 KEYWORDS peplomer protein. SOURCE Murine (strain A59) RNA. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 843) AUTHORS Banner,L.R., Keck,J.G. and Lai,M.M.C. TITLE A clustering of RNA recombination sites adjacent to a hypervariable region of the peplomer gene of Murine coronavirus JOURNAL Virology 175, 548-555 (1990) STANDARD simple staff_review FEATURES from to/span description site 400 401 deletion site BASE COUNT 227 a 153 c 183 g 280 t ORIGIN 1 tgagagtttg ttttgtaata atatcgatgc ttccaaagtg tatggcaggt gctttggtag 61 tatttcagtt gataagtttg ctgtaccccg aagtaggcaa gttgatttac agcttggtaa 121 ctctggattt ctgcagactg ctaattataa gattgataca gctgccactt cgtgtcagct 181 gcattacacc ttgcctaaga ataatgtcac cataaacaac cataacccct cgtcttggaa 241 taggaggtat ggctttaatg atgctggcgt ctttggcaaa aaccaacatg acgttgttta 301 cgctcagcaa tgttttactg taagatctag ttattgcccg tgtgctcaac cggacatagt 361 tagcccttgc actactcaga ctaagcctaa gtctgctttt ttaatgtggg tgaccattgt 421 gaaggcttag gtgttttaga agataattgt ggcaatgctg atccacataa gggttgtatc 481 tgtgccaaca attcatttat tggatggtca catgatacct gccttgttaa tgatcgctgc 541 caaatttttg ctaatatatt gttaaatggc attaatagtg gtaccacatg ttccacagat 601 ttgcagttgc ctaatactga agtggttact ggcatttgtg tcaaatatga cctctacggt 661 attactggac aaggtgtttt taaagaggtt aaggcagact attataatag ctggcaaacc 721 cttctgtatg atgttaatgg taatttgaat ggttttcgtg atcttaccac taacaagact 781 tatacgataa ggagctgtta tagtggccgt gtttctgctg catttcataa agatgcaccc 841 gaa // LOCUS MHVSP 3780 bp ss-mRNA VRL 03-AUG-1990 DEFINITION Mouse hepatitis virus surface protein S mRNA, complete cds. ACCESSION X04797 M34437 KEYWORDS glycoprotein; membrane glycoprotein; surface glycoprotein; surface projection glycoprotein. SOURCE Murine hepatitis virus (strain JHM). cDNA to viral RNA, clones pJMS1010, pJS112, and pJS92. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 3780) AUTHORS Schmidt,I., Skinner,M. and Siddell,S. TITLE Nucleotide Sequence of the Gene Encoding the Surface Projection Glycoprotein of Coronavirus MHV-JHM JOURNAL J. Gen. Virol. 68, 47-56 (1987) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description SITE 24 30 put. region of homology preceeding mRNA 5' initiation codons SITE 121 129 pot. N-glycosylation site SITE 208 216 pot. N-glycosylation site SITE 430 438 pot. N-glycosylation site SITE 604 612 pot. N-glycosylation site SITE 1099 1107 pot. N-glycosylation site SITE 1333 1341 pot. N-glycosylation site SITE 1636 1644 pot. N-glycosylation site SITE 1732 1740 pot. N-glycosylation site SITE 1756 1764 pot. N-glycosylation site SITE 1825 1833 pot. N-glycosylation site SITE 1900 1914 put. proteolytic cleavage site SITE 1972 1980 pot. N-glycosylation site SITE 2023 2031 pot. N-glycosylation site SITE 2293 2301 pot. N-glycosylation site SITE 2440 2448 pot. N-glycosylation site SITE 3139 3147 pot. N-glycosylation site SITE 3301 3309 pot. N-glycosylation site SITE 3331 3339 pot. N-glycosylation site SITE 3388 3396 pot. N-glycosylation site SITE 3436 3444 pot. N-glycosylation site SITE 3499 3507 pot. N-glycosylation site SITE 3553 3654 pot. transmembrane domain SITE 3622 3675 cysteine-rich region SITE 3655 3660 charge cluster SITE 3715 3723 pot. N-glycosylation site FEATURES from to/span description pept 31 3738 surface protein S precursor sigp 31 60 surface protein S signal peptide matp 61 3736 surface protein S BASE COUNT 1030 a 718 c 800 g 1232 t ORIGIN 1 cttgtagttt aaatctaatc taatctaaac atgctgttcg tctttatttt actattaccc 61 tcttgtttag ggtatattgg tgattttaga tgtatccaga ccgtgaatta taacggcaat 121 aatgcttctg cgcctagcat tagcaccgaa gcagtcgatg tttccaaagg tcggggcact 181 tactatgttt tagatcgtgt ttacttaaat gccacgttat tgcttactgg ttattatcct 241 gtggacggtt ccaattatcg gaatctcgcg cttacaggca ctaatacctt aagccttacg 301 tggtttaaac caccctttct aagtgagttt aatgatggta tatttgctaa ggtccagaac 361 ctcaagacaa atacgccaac aggtgcaacc tcatattttc ccactatagt tataggtagt 421 ttgtttggta acacttccta taccgtagtt ttagagccat ataataatat tataatggct 481 tctgtttgta catataccat ttgtcaatta ccttacacac cctgtaagcc taataccaat 541 ggtaatcgtg ttattggatt ttggcacaca gatgtcaaac cgccgatttg tcttttaaag 601 cgtaatttta cgtttaatgt taatgcccct tggctttatt tccattttta tcagcagggt 661 ggtacttttt atgcgtacta tgcggataaa ccttccgcta ctacgttttt gtttagtgtg 721 tatattggcg acattttaac acagtatttt gtgttacctt ttatttgtac tccaacagct 781 ggtagcactt tagctccgct ctattgggtt acacctttac ttaagcgcca atatttgttt 841 aattttaatg aaaagggtgt cattactagt gctgttgatt gcgccagcag ctacattagt 901 gaaataaaat gtaagaccca aagtctctta ccgagtactg gtgtctatga tctatccggt 961 tacacggtcc aacctgttgg agttgtgtac cggcgtgttc ctaacctacc tgattgtaaa 1021 atagaggaat ggctcactgc taaatctgtg ccgtcacctc tcaattggga gcgtaggact 1081 ttccaaaatt gtaattttaa tttaagcagc ctgctacgtt atgtccaggc tgagtctttg 1141 tcgtgtaata atattgatgc gtccaaagtg tatggtatgt gctttggtag tgtctcagtt 1201 gataagtttg ctatcccccg aagccgtcaa attgatttac aaattggcaa ctccggattt 1261 ttgcaaacgg ctaattataa gattgatacc gctgccacat catgtcagct gtattacagt 1321 cttcctaaga ataatgttac cataaataac tataacccct cgtcttggaa taggaggtat 1381 ggttttaaag taaatgatcg ctgccaaatt tttgctaaca tattgttaaa tggcattaat 1441 agtgggacta cgtgttccac agatttacaa ttgcctaata ctgaagtggc cactggcgtt 1501 tgcgtcagat atgacctcta tggtattact ggtcaaggtg tttttaaaga ggtcaaggct 1561 gactattata atagctggca ggccctatta tatgatgtta atggtaactt aaacgggttc 1621 cgtgacctta ccactaacaa gacttatacg ataaggagct gttatagtgg ccgtgtttct 1681 gctgcatatc ataaagaagc acccgaaccg gctctgctct atcgtaatat aaattgtagt 1741 tatgttttta ctaataatat ttcccgtgag gaaaaccccc ttaactattt tgatagttat 1801 ttgggttgtg ttgttaatgc tgataaccgc acggatgagg cgcttcctaa ttgcaatctc 1861 cgtatgggtg ctggactatg cgtagattat tcaaagtcac gcagagcccg ccgatcagtt 1921 tctactggct atcgattaac cacattcgag ccatacatgc cgatgttagt caatgatagc 1981 gttcaatccg taggtggatt atatgagatg caaataccaa ccaattttac tattggtcat 2041 catgaggaat tcatccagat aagggctccc aaggtgacta tagattgtgc tgcatttgtt 2101 tgtggtgata acgctgcatg cagacagcag ttggttgagt atggctcttt ttgtgataat 2161 gttaatgcca ttcttaatga ggttaataac ctcttggata atatgcaatt acaagttgct 2221 agtgcattaa tgcagggtgt tactataagt tcgaggctgc cagatggcat ctccggccct 2281 atagatgaca ttaatttcag tcctctactt ggatgcatag gttcaacatg tgctgaagac 2341 ggcaatggac ctagtgcgat acgggggcgt tcagctatag aggatttatt atttgacaag 2401 gtcaaactat ctgacgttgg ctttgtcgag gcttataaca attgcactgg tggtcaagaa 2461 gttcgcgacc tcctttgcgt acagtctttt aatggcatca aagtattacc tcccgtgttg 2521 tctgagagtc aaatctctgg ctacacagcg ggtgctactg cggcagctat gttcccacct 2581 tggactgcag ctgctggtgt gccattcagt ttaaatgttc aatataggat taatggttta 2641 ggtgtcacta tgaatgttct tagtgagaac caaaagatga ttgctagtgc ttttaacaac 2701 gcgctcggtg ctattcagga agggttcgat gcaaccaatt ctgctctagg taagatccag 2761 tccgttgtta atgcaaacgc tgaagcactt aataatttat taaaccaact ttctaatagg 2821 tttggtgcta ttagtgcttc tttacaagaa attctaacgc ggcttgacgc tgtagaagca 2881 aaggcccaga tagatcgtct tattaatggc aggttaactg cacttaatgc gtatatatcc 2941 aagcaactca gtgatagtac gcttattaaa tttagtgctg ctcaggccat cgaaaaggtc 3001 aatgagtgcg ttaagagcca aactacgcgc attaatttct gtggcaatgg taatcacata 3061 ttatcacttg tccagaatgc gccttatggc ttatgtttta ttcatttcag ctacgtgcca 3121 acatccttta aaacggcaaa tgtgagtcct ggactatgca tttctggtga tagaggattg 3181 gcacctaaag ctggatattt tgttcaagat aatggagagt ggaagttcac aggcagtaat 3241 tattactacc ctgaacccat tacagataaa aatagtgttg ccatgatcag ttgcgctgtg 3301 aattacacaa aagcgcctga agttttcttg aacaactcaa taccaaatct acccgacttt 3361 aaggaggagt tagataaatg gtttaagaat cagacgtcta ttgcgcctga tttatccctc 3421 gatttcgaga agttaaatgt tactttcctg gacctgactt atgagatgaa caggattcag 3481 gatgcaatta agaagttaaa tgagagctac atcaacctca aggaagttgg cacatatgaa 3541 atgtatgtga aatggccttg gtatgtttgg ttgctaattg gtttagctgg tgtagctgtt 3601 tgtgtgttat tattctttat atgttgctgc acaggttgcg gctcatgttg ttttagaaaa 3661 tgcggaagtt gttgtgatga gtatggagga caccaggaca gtattgtgat acataatatt 3721 tcagcccatg aggattgact atcacagcct ctcctggaaa gacagaaaat ctaaacaatt // LOCUS MUSIGLAZ 713 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse Ig active lambda-chain mRNA Vx-J2-C2-region, complete cds. ACCESSION M34598 M29013 J03562 KEYWORDS constant region; immunoglobulin; immunoglobulin light chain; joining exon; lambda-immunoglobulin; processed gene; variable region. SOURCE Mouse (strain Balb/c AnPt) liver hybridoma B6, cDNA to mRNA, clone Y31. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 713) AUTHORS Sanchez,P., Marche,P.N., Le Guern,C. and Cazenave,P.-A. TITLE Structure of a third murine immunoglobulin lambda light chain variable region that is expressed in laboratory mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9185-9188 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 2 to 450) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review REFERENCE 3 (bases 266 to 429) AUTHORS Sanchez,P. and Cazenave,P.-A. TITLE A new variable region in mouse immunoglobulin lambda light chains JOURNAL J. Exp. Med. 166, 265-270 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 12 > 713 Ig lambda chain precursor V-x,J-2,C-2 region sigp 12 68 Ig lambda chain signal peptide matp 69 > 713 Ig lambda chain recomb 379 380 V-region end/J2-region start recomb 414 415 J2-region end/C2-region start BASE COUNT 175 a 190 c 159 g 189 t ORIGIN 1 gtacctgcat tatggcctgg actcctctct tcttcttctt tgttcttcat tgctcaggtt 61 ctttctccca acttgtgctc actcagtcat cttcagcctc tttctccctg ggagcctcag 121 caaaactcac gtgcaccttg agtagtcagc acagtacgta caccattgaa tggtatcagc 181 aacagccact caagcctcct aagtatgtga tggagcttaa gaaagatgga agccacagca 241 caggtgatgg gattcctgat cgcttctctg gatccagctc tggtgctgat cgctacctta 301 gcatttccaa catccagcct gaagatgaag caatatacat ctgtggtgtg ggtgatacaa 361 ttaaggaaca atttgtgtat gttttcggcg gtggaaccaa ggtcactgtc ctaggtcagc 421 ccaagtccac tcccactctc accgtgtttc caccttcctc tgaggagctc aaggaaaaca 481 aagccacact ggtgtgtctg atttccaact tttccccgag tggtgtgaca gtggcctgga 541 aggcaaatgg tacacctatc acccagggtg tggacacttc aaatcccacc aaagagggca 601 acaagttcat ggccagcagc ttcctacatt tgacatcgga ccagtggaga tctcacaaca 661 gttttacctg tcaagttaca catgaagggg acactgtgga gaagagtctg tct // LOCUS MUSIGLVD 681 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse Ig germline lambda-chain gene Vx-J2-C2-region, complete cds. ACCESSION M34597 KEYWORDS constant region; germline; immunoglobulin light chain; joining exon; lambda-immunoglobulin; variable region. SOURCE Mouse (strain Balb/c AnPt) liver DNA, clone 30X2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 681) AUTHORS Sanchez,P., Marche,P.N., Rueff-Juy,D. and Cazenave,P.-A. TITLE Mouse V-lambda-x gene sequence generates no junctonal diversity and is conserved in mammalian species JOURNAL J. Immunol. 144, 2816-2820 (1990) STANDARD simple staff_review FEATURES from to/span description pept 59 104 Ig lambda-chain precursor Vx-J2-C2 region, exon 1 209 531 Ig lambda-chain precursor Vx-J2-C2 region, exon 2 sigp 59 104 Ig lambda-chain signal peptide 209 219 Ig lambda-chain signal peptide matp 220 528 Ig lambda-chain IVS 105 208 Ig lambda-chain Vx-J2-C2 region intron A recomb 530 531 Vx-region end/J2-region start recomb 565 566 J2-region end/C2-region start BASE COUNT 179 a 156 c 136 g 210 t ORIGIN Chromosome 16. 1 tgaaccatag agagaactac aacctgctgt ctcagcagag atcagtagta cctgcattat 61 ggcctggact cctctcttct tcttctttgt tcttcattgc tcaggtcagg agaaccattt 121 gtaccctgaa cctcagttca tctgagaggc agatacattc tatatctgtc tgtaaatgtc 181 aggaaataaa cagtttctct attttcaggt tctttctccc aacttgtgct cactcagtca 241 tcttcagcct ctttctccct gggagcctca gcaaaactca cgtgcacctt gagtagtcag 301 cacagtacgt acaccattga atggtatcag caacagccac tcaagcctcc taagtatgtg 361 atggagctta agaaagatgg aagccacagc acaggtgatg ggattcctga tcgcttctct 421 ggatccagct ctggtgctga tcgctacctt agcatttcca acatccagcc tgaagatgaa 481 gcaatataca tctgtggtgt gggtgataca attaaggaac aatttgtgta accacagtaa 541 cggagataaa ggaggaagca ggacagaaac tttttttttt ctcttcaaag gtcttttcta 601 ccagaatcat tggttttttt ttttcttttt tgcttattaa taaagtagat agtctagcaa 661 tcctcttgga cttcgtaggg c // LOCUS PAERRE 1877 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.kadiakensis 18S rRNA. ACCESSION M34359 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.kadiakensis rRNA. ORGANISM Palaemonetes kadiakensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea; Palaemonoidea; Palaemonidae. REFERENCE 1 (bases 1 to 1877) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1877 18S ribosomal RNA BASE COUNT 339 a 321 c 399 g 313 t 505 others ORIGIN 1 nncctggtng atcctgccag nagtcntnng cttgtctcaa annttaagcc angcatgtgt 61 cagtacaggc cgctctaagg cgaaaccgcg aatggctnnn taaatcagtt atcattcatt 121 tnatctaaaa cnnnnnnnnn nnnnggnnaa nnnnggnaan ncnanagcnn nanacgtgac 181 ttgtnaacnc cgacnggaag ggaggagngc ttntattagt tgaaaaccaa gcgggccncg 241 gtccgnnnnn nnnnctgtga tgactctgaa tnactttgtg cagagagcac ggnctnngca 301 ccggctccgt atctttcgag tttctgcctt atcatgctgt ggattgtagg ccatgcgcct 361 ncngtngctg ttncgggtga cggagaatca ggnntcgatt ccggagaggg agcctgagna 421 acggctacca catccaaggn nggcagcagg cacnnnnatt acccaatccc agctctggga 481 ggtagtgacn aaaaataaca atgcgggact cttccgagtc tgcgtaattg gaatgagcac 541 actttaaatc ctttagcaac naccnattgg agggcaagtc tggtgccagc agccgcggtn 601 attcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtncct tgcatggaac 841 tgatggaaga ctgatctcgg ttccacnttc ttggtggtgg gagccagagg taatgatcna 901 gagggnctgt cnnnnnnntc cgtactacga cgcgagaggt gaaattcagt gaccgtcgta 961 ggacgaacca cagcgaaagc atttgccnag aatgtcttcg ttgatcnaga angaaagtta 1021 gaggatcgaa ggcgatcaga tacnnnnnan gaaagaaccn taaacgatgc tgactngcaa 1081 ttcgcngnng ttnttcccat gacgtgcgag acgcccccgg gaaacctcaa gtctttgagt 1141 tccgggggaa gtatggttgc aaaactgaaa ctcaaaggaa ttgacggnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna acacgggaaa cctcaccagg cccggacacc 1261 agaagganng acagatnnag agctctttct cgatttggtg ggtnnnnntg catggcngtt 1321 cttagttggt ggagtgannn nnnnnnnnnc gatnacgaac gagannnnnn nnnnnnnnnn 1381 nnnnnnnnnn nnnnnnnnnn nnnnccccng ttcgannnng tcttcttnga gggatgagcn 1441 gcgagtntag ctgcaggaga ttgagcaata acangtctgt gatgccctta gatgtcctgg 1501 gcgcacgcgc gctacactga atgggttagc gggttgtcct tctccgagag gagcgggnna 1561 tcgcgtgaaa accattcgtg atngggattg gggcttgcaa ttgtttcccn atgaangagg 1621 aattcccagt aagcgcaagt catcagcttg cgntgattnn gtccctnccc nttgtacaca 1681 cngnnnntcg ctactaccga ttgaatgatt agtgaggctt cggactggcg gtcctggact 1741 gggtcggcgg gtcncnccca gcnntgggnt tccgccnnct cgcctggacg ggccggaaag 1801 atgtccaaac ttgatnnnnn nnnnnnnnnn naagtcgtaa caaggtnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnnnnn // LOCUS PBESVBRA 584 bp ds-DNA INV 03-AUG-1990 DEFINITION P.berghei telomeric repeat region subfragment alpha DNA. ACCESSION M34601 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 584) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 461 541 27 bp repeats BASE COUNT 205 a 41 c 85 g 251 t 2 others ORIGIN 1 tcgacaanta caacattatc tataaaagat gttttataca tctaacattt ttagtaatac 61 ataaaaaata cactatatat atgtgtataa taaattcata aattataaat atatataatc 121 atcacttttt taatttcaat aatttacatt tatgttaaaa ttataattta tattgatata 181 aatagttctc tatatattaa tttatttact ataaaggtat aataatatat taatcactat 241 taatttataa atttgatagt tttgaggtat aaataaatta tattttaaat agttaaatat 301 aatatataat aaatgtaatg tcatattttc tataatactt ataaacaatt cgtatataaa 361 attagcgtta ttgtactaat atatataata ttgtatcaat gactaaaact gaaatatgtt 421 aatttggttt agggtttatg gttcaggttt aggtttntgg tttagggttc aggtttatgg 481 ttcagggttt agggttcagg tttatggttc agggtttagg gttcaggttt atggttcagg 541 gtttagggtt tgtggtttag ggtttatggt ctatggttgt tcga // LOCUS PBESVBRB 593 bp ds-DNA INV 03-AUG-1990 DEFINITION P.berghei telomeric repeat region subfragment a DNA. ACCESSION M34602 KEYWORDS . SOURCE P.berghei DNA, clone pTel.1. ORGANISM Plasmodium berghei Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 593) AUTHORS Dore,E., Pace,T., Ponzi,M., Picci,L. and Frontali,C. TITLE Organization of subtelomeric repeats in Plasmodium berghei JOURNAL Mol. Cell. Biol. 10, 2423-2427 (1990) STANDARD simple staff_review FEATURES from to/span description rpt 484 510 27 bp repeat motif BASE COUNT 209 a 40 c 97 g 247 t ORIGIN 1 tcgacaatac aacattatct ataaaagatg ttttatacat ctaacatttt tagtaataca 61 taaaaaatac actatatata tgtgtataat aaattcataa attataaata tatataatac 121 tcactttttt aatttcaata atttacattt atgttaaaat tataatttat attgatataa 181 atagttctct atatattaat ttatttacta taaaggtata ataatatatt aatcactatt 241 aatttataaa tttgatagtt ttgaggtata aataaattat attttaaata gttaaaatat 301 aaatatataa ataaaatgta atgtcatatt tttctataat acttataaac aattcggtat 361 ataaaattag cgttattgta ctaatatata taatattgta tcaatgacta aaactgaaat 421 atgttaattt gggtttaggg gtttatggtt cagggtttag ggtttgtggt ttagggtttg 481 tggtttaggg ttcaggttta tggttcaggg tttagggttc agggttcagg tttagggttt 541 agggtttagg gttcagggtt cagggttcag ggtttagggt ttagggttta ggg // LOCUS PEURRE 1902 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.aztecus 18S rRNA. ACCESSION M34362 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.aztecus rRNA. ORGANISM Penaeus aztecus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Dendrobranchiata; Penaeoidea; Penaeidae. REFERENCE 1 (bases 1 to 1902) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1902 18S ribosomal RNA BASE COUNT 340 a 369 c 427 g 332 t 434 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa agattaagcc nngcatgtgt 61 aagtacaggc cgacnnaagg cgaaaccgcg gacggcnnnn taaatcagat ataactcatt 121 nnatctctgc tgaacnncnt nnnnnnttgg ataactgtgg taattctaga nnnnnacatg 181 cctttgtann ctccgaccgc gagggaggag ngcttttatt agaccaaaac cctcggcagc 241 nnnntcccgc aagggncnag cagcacacat cttggtgaat cagaataact tttgccgagg 301 cacgacccct ccgtaacnng ggntgggncg gcgccgcgtc ctgcaggcgt ctgccttatc 361 agctctcgat tgtaggttaa acgcctacaa tggctatnnn gggtnacggg gaatnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggcag 481 cangcgcnnn nattacccnc tcggcncggg gagnnagtga cnaaaaatac tgttngnnga 541 ccccgngncc tcgcnattgg aatgagtnca ctttaaatcc ttgtacgagg atcgagtgga 601 gggcaannnn nnnnnnagcn gccgcgnnna ttccagctcc actagcgtat attaaagttg 661 ttgcggttga aacgctcgta gtttgacttc tgctcggacg gcggncttnn cngctactgc 721 cgnnttccga gctgtgtccc cngccggcgc acatggggnt nnnntgcctt aannncgggn 781 gtcccctnnn nnnnnnnccg ttactttgaa aaaattagag ngcnnagagc aggcnngnnn 841 nnnnnnncag cccgaatggt cgtgcatgga atgatggaac aggacctcgg ntctattttg 901 tcggtttttc ggaacccgag gnnatgattn atagaagcag acgggggnnt tcgtactgcg 961 acgctagagg tgaaattctt agaccgtcgc atgacgacct nctgcgaaag catctgccna 1021 ggatgttttc attgatcaag aangaaagtt agaggttcga aggcgatcag atacngcncn 1081 ngttctaacc ttaaacgatg ctgactagcg atccgccgca gttattnnca tgacccggcg 1141 nnnagcttcc gggaaaccaa agtctttggg ttccggggga agtatggttg caaagctgaa 1201 actcaaagga attgacggnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nnnnnnnnnn nnnnngtgca tgggtcgngt tcttagttgg tggagtgatc nnnnnnnnnn 1381 nnnnncgatn acgaacgaga nnnnnnnnnn nnnnnnnnnn nnnnnggcgc cggnaacngg 1441 cgntcntcgc ngtcttcttc ttagagggat aagcggcagc naaaaatata ctagccgcac 1501 gagagtttga gccataacan gtctgtgatg cccttagatg ttctgggcgc acgcgcgcta 1561 caatggagag ttcagcgagc tngncccnct ccgagaggag cgggnncctg cgtgaaagct 1621 gtccttaaag gggattgggg cttgcaaatg ttcccnatga nnnnggaatt cccagtagcg 1681 caattcncca gattgcgcgg atttagtccc tacccnttgt acacaccgcc nntcgctact 1741 accgattgaa tggtctagtg agggnnccgg actngcgccc ntggagccct accctcngcg 1801 ncngcgccct cgggtcgacg gaaaggtgtc caagctgggt nnnnnnnnnn nnnnnnaagt 1861 cgtaacaagg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nn // LOCUS POCRRE 1874 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.ascensionis 18S rRNA. ACCESSION M34358 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.ascensionis rRNA. ORGANISM Procaris ascensionis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Caridea; Procaridoidea; Procarididae. REFERENCE 1 (bases 1 to 1874) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1874 18S ribosomal RNA BASE COUNT 298 a 291 c 331 g 266 t 688 others ORIGIN 1 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngcatgtct 61 aagcacaggc cgaactaagg ctaagccgcg aatggcnnnn taaatcagtt atggttcatt 121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnac 181 ccacgctccg accgcgaggg aggagngctt ttattagttg aaaaccaacc gggccncggt 241 ccgcnaaaga canctgtggt gaagctgaat aactttgtgc cgagcgcacn gncnnnncac 301 cggcgccgat tccttcgagt gtctcgctta tcaggcngtc gattgtaggt tatgtgccnn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 421 nnnnnnnnnn nnnnnnnnnn nnnnnnnngg cagcaggcan nnnnattacc cactcccggc 481 ttggggaggt agtgacnaaa aataacgatg cgggactcat ccgaggccnc gcaattggaa 541 tgagtacact ttaantcctt taacgaggac ccannnnnnn nnnnnnnnnn nnnnnnnnnn 601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnagtt 781 taccttgaac aaatcagagt gctcagagca ggctaattna atggcccgct atgtttcctg 841 catggaatga tggaagatga cctcggttcc attttgtttg ttttcggaac ccgaggnnat 901 gatgaataga gacggacggg ggcatccgnn ctgcgacgtg agaggtgaaa ttcttggaat 961 gtcgnnagac gaacgacagc gaaagcattt gccaagtatg tcttcgttaa tcaagaanga 1021 aagttagagg ttcgaaggcg atcagatacc gcccnngttc taaccataaa cgatgctgac 1081 cagcgatccg ccggcgttat tcccatgacg cggcggnnag ctactccggg aaaccaaagt 1141 cnntgagttc cgggggtann nnnnnnnnnn nnnnnaaact caaaggaatt gacggnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgca 1321 tnnnngttct tagttggtgg agcgatttgt ctggttaatt ccgataacga angagactnt 1381 ggcctactaa ctagtcgacg ggtctccagc nnttggtgcc cagttcgcaa catcttctta 1441 gagggataag cggcaattct agccgcacga gattgagcaa taacaagtct gtgatgccct 1501 tagatgtcct gggcncacgc gcgctacact gaagggggca gcgggnntcc nctccgagag 1561 gagcgggnaa ccncttgaaa acctntcatg atagggactg gggcntgtaa ttgnttccca 1621 tgaacgagga anncccagta agcgcaagtg nnnnnnntgc gctgattnng tcccnnccnn 1681 ttgtacacac cnnnnntcgc tactaccgat tgaatgattt agtgaggctt cggactggcg 1741 ctcctngaac gaccccatcc ganngggnnc ccnggnnctc ctcgagtcga cgganngatg 1801 tccaaacttg annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnn // LOCUS PRARRE 1869 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.leonensis 18S rRNA. ACCESSION M34363 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE P.leonensis rRNA. ORGANISM Procambarus leonensis Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Astacidea; Astacoidea; Cambaridae. REFERENCE 1 (bases 1 to 1869) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1869 18S ribosomal RNA BASE COUNT 409 a 412 c 459 g 419 t 170 others ORIGIN 1 nncctggttg atcctgccag nagtcatnng cttgtctcaa anattaagcc nngcatgtgt 61 aagtacaagc cgagttaagg cgaaaccgcg aatggcncnn taaatcagct atgtttcatt 121 ggatctgtaa acnnncnnnn acttggataa ctgtggtaat tctagagctn atacatgcat 181 cacgtctctg accgcaaggg aagagcgctt ttattagttc aaaactggtc gggcctcggt 241 ccgttnaccc acccgtggtg aatctgaata actttttgct gagcgcacgg nctccgcacc 301 ggcgccgcat ccttcaagtg tctgccttat cagctttcga ttgtaggtta tgcgcctaca 361 atggctataa cgggtaacgg ggaatcaggn ttcnattccg gagagggagc ctgagaaacg 421 gctaccacat ctaaggcagg cagcaggcac gcnnattacc cactcccggc acggggaggt 481 agtgacnaaa aataacgatg cgagactcat ccgaggcctc gcaatcggaa tgagtacact 541 ttaaancctt taacgaggat ctattggagg gcnagtctgg tgccagcagc cgcggtaatt 601 ccagctccaa tanngtatat taaagttgtt gcggttnnaa agctcgtagt tggatctcag 661 ttccggactg acggtacacg cnnggtgctt actgtcacgc tccgaacagc taactagccc 721 cgccggccag tggggtgctc ttcatcgagt gtcccgagtg gccggnncgt ttactttgnn 781 nnnattagag tgctcagagc nggcnncnnn natggcctga atgtctatgc actggaataa 841 tggaatagga cctcggttct attttgttgg ttttcggaac ctgaggtaat gactaatagg 901 aacaggcggg ggcattcgta ttgcgacgct agaggtgaaa ttcttggacc gtcgcnagac 961 gaactactgc gaaagcattt gccaaggatg ttttcattaa tcaagaanga aagttagagg 1021 ttcgaaggcg atcagatacc gcncnngttn naaccataaa cgatgccaac tagcgatccg 1081 ccggcgttat tcccatgacc cggcngncag cttccgggaa accaaagtct ttgggttccg 1141 ggggaagtat ggttgcaaag ctgaaactca aaggaattga cggnnnnnnn nnnnnnnnnn 1201 nnnnnnnnnn nnnnnnnnnn nnnnnaacac ggggaacctc accaggccca gacaccggaa 1261 ggatngacag attgagagct ctttctcgat tcggtgggtg gtngtgcatg gccgttctta 1321 gttggtggag cgatttgtct ggttaattcc gatnnnnnnn gagactctgg cctattaact 1381 agtcgacgga tctccagcnn ttggtgtcca gttcgcaact tcttcttaga gggattacgg 1441 caattctagc cgcacgagat tgagcaataa caggtctgtg atgcccttag atgttctggg 1501 cgcacgcgcg ctacactgaa gagatcaacg tgttctcccc ctccgagagg agcgggnaac 1561 ccgttcaatc cccttcatga tagggattgg ggcttgcaat tgtttcccat gaacgaggaa 1621 ttcccagtaa gtgcaagtca tcacgttgcg ctgattnngt ccctgcccnt tgtacacacn 1681 nnnnntcgct actaccgatt gaatgattta gtgaggcttc ggactggcgc tcttggatgt 1741 tctacccctc gcgtctcggc gcaaggnnnt ctcgcctcga gctgacggaa agatgtccaa 1801 acttgatnnn nnnnnnnnnn nnnaagtcgt aacaaggtnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnn // LOCUS PVIC1RPTA 711 bp ds-DNA INV 03-AUG-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28745 M25758 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK247) sporozoite DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 711) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 711 circumsporozoite protein (AA at 1) site 1 15 region 1 site 697 711 region 2 site 502 597 post repeat variable region site 598 696 post repeat constant region BASE COUNT 261 a 151 c 226 g 73 t ORIGIN 1 aagctgaaac aaccagaaga tggggcaggc aatcaaccag gagcaaatgg agcaggcaat 61 caaccaggag caaatggggc aggcaatcaa ccaggagcaa atggggcagg caatcaacca 121 ggagcaaatg gggctggcaa tcaaccagga gcaaatgggg ctggcaatca accaggagca 181 aatggggctg gcaatcaacc aggagcaaat ggggctggca atcaaccagg agcaaatgga 241 gcaggcaatc aaccaggagc aaatggggca ggcaatcaac caggagcaaa tggggctggc 301 aatcaaccag gagcaaatgg agcaggcaat caaccaggag caaatggggc tggcaatcaa 361 ccaggagcaa atggagcagg caatcaacca ggagcaaatg gggcgggcaa tcaaccagga 421 gcaaatgggg ccggcaatca accaggagca aatggggcag gcaatcaacc aggagcaaat 481 ggggctggca atcaaccagg agcaaatggg gcaggtaatc aaccaggagc aaatggtgca 541 ggtggacagg cagcaggagg aaatgctgca aacaaaaagg caggagacgc aggagcagga 601 cagggacaaa ataatgaagg tgcgaatgcc ccaaatgaaa agtctgtgaa agaataccta 661 gataaagtta gagctaccgt tggcaccgaa tggactccat gcagtgtaac c // LOCUS PVIC1RPTB 657 bp ds-DNA INV 03-AUG-1990 DEFINITION P.vivax circumsporozoite protein gene, partial cds. ACCESSION M28746 M25759 KEYWORDS circumsporozoite protein. SOURCE P.vivax sporozoite (isolate VK210) DNA. ORGANISM Plasmodium vivax Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 657) AUTHORS Rosenberg,R., Wirtz,R.A., Lanar,D.E., Sattabongkot,J., Hall,T., Waters,A.P. and Prasittisuk,C. TITLE Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax JOURNAL Science 245, 973-976 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Lanar, 25-JUN-1989. FEATURES from to/span description pept < 1 > 657 circumsporozoite protein (AA at 1) site 1 15 region 1 site 643 657 region 2 rpt 16 477 27 bp repeats site 478 544 3' post repeat variable region site 545 642 3' post repeat constant region BASE COUNT 229 a 135 c 224 g 69 t ORIGIN 1 aagctgaaac aaccagcagg tgatagagca gatggacagc cagcaggtga tagagcagat 61 ggacagccag caggtgatag agcagatgga caaccagcag gtgatagagc agctggacaa 121 ccagcaggtg atagagcaga tggacagcca gcaggcgata gagcagctgg acaaccagca 181 ggtgatagag cagatggaca gccagcagga gatagagcag ctggacagcc agcaggcgat 241 agagcagatg gacagccagc aggtgataga gcagctggac aaccagcagg tgatagagca 301 gctggacaac cagcaggtga tagagcagat ggacagccag caggcgatag agcagctgga 361 caaccagcag gtgatagagc agatggacaa ccagcaggag atagagcagc tggacagcca 421 gcaggagata gagcagctgg acagccagca ggagatagag cagctggaca gccagcagga 481 aatggtgcag gtggacaggc cgcaggagga aacgcaggag gaaacgcagg aggaaacgca 541 ggaggacagg gacaaaataa tgaaggtgcg aatgccccaa atgaaaagtc tgtgaaagaa 601 tacctagata aagttagagc taccgttggc accgaatgga ctccatgcag tgtaacc // LOCUS SHV2A 554 bp ss-RNA VRL 03-AUG-1990 DEFINITION Simian hepatitis A virus segment 2A-encoded protein mRNA, partial cds. ACCESSION M34085 KEYWORDS . SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 554) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL Unpublished (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. Author address: E.A.Brown 547 Burnett-Womack CB# 7030 Chapel Hill, NC 27599 FEATURES from to/span description pept < 1 > 554 segment 2A-encoded protein (AA at 1) BASE COUNT 184 a 71 c 138 g 161 t ORIGIN Segment 2A; map position 3108-3662. 1 agtcatattg aaaagtggaa accttataaa gagttaagat tggaggtagg taagcaaagg 61 ctaaagtatg ctcaggaaga gttgtcaaat gaagtgttgc ctcctcctcg taaaattaag 121 ggtgtgtttt cacaagcaaa aatctcattg ttttacacag aagatcatga aattatgaaa 181 ttttcctgga aaggaattac tgctgacact agagctttga ggagatttgg cttttcattg 241 gctgctggta ggagtgtgtg gacattggaa atggatgctg gagttttgac tggcaggctg 301 gtgagggtca atgatgaaaa atggacagaa atgaaagatg acaaaatagt ttctttggtg 361 gagaaattta ctagtaataa acactggtcc aaagttaatt ttcctcatgg aatgctagat 421 ttggaagaaa ttgctgcaaa tgcaaaagaa tttccaaata tgtcagaaac tgatttgtgt 481 ttcttgttgc attggctgaa ccccaaaaag ataaacttgg cagatagaat gttgggtctg 541 tcaggaatac agga // LOCUS SHVVP1CP 2373 bp ss-RNA VRL 03-AUG-1990 DEFINITION Simian hepatitis A virus capsid protein VP1 mRNA, partial cds. ACCESSION M34084 KEYWORDS capsid protein VP1. SOURCE Simian hepatitis A virus (strain PA21), cDNA to viral RNA, passed in cwll line BS-C-1. ORGANISM Simian hepatitis A virus Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Picornaviridae. REFERENCE 1 (bases 1 to 2373) AUTHORS Brown,E.A., Jansen,R.W. and Lemon,S.M. TITLE Characterization of a Simian hepatitis A virus (HAV): Antigenic and genetic comparison with human HAV JOURNAL J. Virol. 63, 4932-4937 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.A.Brown, 04-MAY-1990. FEATURES from to/span description pept < 1 > 2373 capsid protein VP1 (AA at 1) BASE COUNT 661 a 447 c 491 g 774 t ORIGIN Segment P1; map position 735-3107. 1 atgaatatgt ccaggcaagg tattttccag actgttggga gtggccttga ccacattctg 61 tctttggcag atgtggagga ggaacaaatg attcagtctg tggatcgtac cgcagttact 121 ggggcttcat atttcacttc tgtggatcaa tcttctgttc atacagctga agttggctca 181 caccaacctg aacctttgaa aacctctgtt gacaaaccag gctctaagag gacacaagga 241 gagaaatttt tccttgttca ttctgctgac tggttgacga cacatgcttt gtttcatgaa 301 gttgcaaaat tggatgtggt caaactgttg tacaatgagc aatttgctgt tcagggtctg 361 ttgaggtatc acacttatgc aagatttgga attgagatac aagttcagat caatcctaca 421 ccattccagc aaggtggttt gatatgtgcc atggtgccag gagatcagag ctatggatct 481 atagcttctt tgacagttta tcctcatggt ttgttgaatt gtaatatcaa caatgtggtc 541 agaattaagg ttccttttat ttatacaaga ggagcttatc actttaagga ccctcaatat 601 cccgtttggg agttgactat tagagtttgg tctgagctaa acattggaac tggtacctct 661 gcttacacat cactgaatgt gctggctaga tttactgatt tggaactcca tgggctaaca 721 cccctgtcta cacagatgat gagaaatgaa tttagagtca gtacaacaga aaatgtagtt 781 aatttgtcca attatgaaga tgctagagca aaaatgtctt ttgctcttga tcaggaagat 841 tggaaatctg atgcctctca agggggagga attaaaatta cacattttac aacctggaca 901 tcaattccta ctttggctgc tcagtttcca ttcaatgcct ctgattcagt tgggcaacag 961 atcaaggtta ttccagttga tccatatttc ttccaaatga ctaacacaaa tcctgaacaa 1021 aaatgtataa ctgcattggc ttcaatatgt caaatgttct gtttttggag aggagacttg 1081 gtttttgact tccaggtttt tcctacaaaa tatcactcag ggagattatt attttgtttt 1141 gttcctggaa atgaactgat tgatgtttcc cacataacat tgaaacaagc cactactgcc 1201 ccttgtgctg tgatggatat tactggagta cagtcaactt taagatttcg tgttccttgg 1261 atttcagata ctccttatag agttaataga tataccaaat cgtcacatca gaaaggagag 1321 tatactgcca taggaaagtt gattgtttat tgttacaaca gactgacttc tccctccaat 1381 gtggcttctc atgttagagt taatgtttat ctctcagcta ttaatttgga atgttttgct 1441 ccactctatc atgctatgga tgtcacaact caggttgggg atgattctgg aggcttctct 1501 accactgttt caacaaaaca gaatgttcca gaccctcaag ttggcattac aacagtgaag 1561 gatcttaaag gtagagcaaa ccaagggaaa atggatgttt cgggtatcca agctcctgta 1621 ggagctatca ctaccattga ggatccagtt ttggcaaaga aagtgcctga gaccttccca 1681 gaattgaagc ctggagagtc aagacatact tctgatcata tgtctattta caaatttatg 1741 ggcagatctc atttcttatg tacatttaca tttaattcta ataacaaaga gtacactttt 1801 cctatcactt tgtcatcaac ttctaatcct cctcatggat tgccttcaac tctgagatgg 1861 ttttttaacc tttttcagct ttataggggt cccttggatt tgacaataat tataactggg 1921 gctactgatg ttgatggaat ggcttggttt actcccgttg ggttagcagt agatacccca 1981 tgggttgaga aggagtctgc tctttctatt gattacaaga cagctcttgg tgctgttagg 2041 tttaatacta gaagaacagg aaacattcag attaggttgc cctggtactc ctatctttat 2101 gctgtctcag gggcactgga tgggcttgga gacaaaacag attcaacttt tggacttgtc 2161 tccattcaaa ttgcaaatta caatcactca gatgaatatt tgtcttttag ttgttacttg 2221 tctgtgactg aacagtctga gttttatttt cctagagcac ctttgaatac caatgctatg 2281 atgtcatcag aaacaatgat ggatagaatt gctcttggtg atcttgaatc ctcagttgat 2341 gatcctcgaa ctgaagagga tcgtaaattt gaa // LOCUS STNRRE 1885 bp ss-rRNA RNA 03-AUG-1990 DEFINITION S.hispidus 18S rRNA. ACCESSION M34361 KEYWORDS 18S ribosomal RNA; ribosomal RNA. SOURCE S.hispidus rRNA. ORGANISM Stenopus hispidus Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Malacostraca; Eucarida; Decapoda; Pleocyemata; Stenopodidea; Stenopodidae. REFERENCE 1 (bases 1 to 1885) AUTHORS Kim,W. and Abele,L.G. TITLE Molecular phylogeny of selected decapod crustaceans based on 18S rRNA nucleotide sequences JOURNAL J. Crust. Biol. 10, 1-13 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.G.Abele, 19-MAY-1990. FEATURES from to/span description rRNA 1 1885 18S ribosomal RNA BASE COUNT 327 a 336 c 430 g 368 t 424 others ORIGIN 1 nncctggtng atcctgccag nngtcntnng cttgtctcaa annttnagcc nngcatgtgt 61 gagtacaagc ccaaggaagg tgaaaccgcg aatggcnnnn taaatcagct atggtttact 121 ggacctgtac tncnntnnnn nnnnnnnnnn nnnnggtaat tctagagctn anncnngccn 181 cgagcncnga cgcgggagcg ggaagagcgc nnnannagta cnaaaaccng ngtctgtgta 241 tcggcttagg tcgttgcata gncnnnnnnn tgtggtgact ctgaataact tttggctgag 301 cgcatggtct ccgcacctgg cgccgcatct ttcaagtgtc tgccttatca gctgtcgatt 361 gtaggttatg cgcctnnnat ggcgatnnng ggtnacgggg aatcngggtt nnnttccgga 421 ganngngcct gagnnncggc tnccnnntnt nnnnnnnnnn nnnnggcngn aggcnnnnnn 481 attacccntt ccggcncggg gaggtagtga cnaaaaataa cgatgcgaga ctcatccgag 541 gcctcgcnat cggaatgaga acactttaaa tcctttntcg aggatcgatt ggagggcaag 601 tctngtgcca gcagccncgg tnattccagc tccaatagng tatattaaag ttgctgcggn 661 tnnaaagctc gtagttnnat ctcagttcgg acggccgncn tccnnngtgc nttttgcggc 721 ttgatccgaa cactnctgtt gtgggcgcgc agggggtgct cttgatcgag tgtgcnnnnn 781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnccctg 841 aatgactttg catggaataa tggaatagga cctcggttct attttgctgg ttttgtctgg 901 aacccgaggt aatgactaat agaaacnggc gggggnnttc gtactgcgac gctagaggtg 961 aaattcttgg accgtcgcna gacgaactna tgcgaaagca tctgccnagg atgttttcnt 1021 tnatcnagaa ngaaagttag aggttcgaag gcgatcagat acnnnnnnng ttctaaccgt 1081 aaacgatgct naccagcnat ccgcccgcgt tnttcccatg accgggcnnn nngcttcggg 1141 gaaaccaaag tctttgagtt ccgggggaag tatggttgca aannngaaac tcaaaggaat 1201 tgacggnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1321 nntgcatggt nnnnnnnnnn nnnnggtgga gcgattgctg gttaattccg atnacgaacg 1381 agactcggac ctactaacta gtcgacggat cttcgtccga cggtgtccag ttcgtnaagt 1441 cttcttagag ggataacggc aagtgtagcc gcaggagatc gagcaataac angtctgtga 1501 tgcccttaga tgttctgggc gcacgcgcgc tacactgaag tgttcaacgt gttgtcccng 1561 tccgagagga tcgggnnncc cgctgaaagc ntttcttgat ngggatgggg gcttgcaatt 1621 gttcccnntg aannnggaat tcccagtaag cgcaagtcaa tagcttgcgn tgatnnngtc 1681 cctncnnntt gtncncnccn nnnntcgcta ctaccgattg aatgatttag tgaggcttcg 1741 gactggcgcc ctgggtctga tgcangttgg ccttagtgcc ttgtgtatcg cctagggncg 1801 acggaaagat gtccaaactt gatnnnnnnn nnnnnnnnna agtcgtaaca aggtnnnnnn 1861 nnnnnnnnnn nnnnnnnnnn nnnnn // LOCUS SV4CG 5243 bp ds-DNA circular VRL 03-AUG-1990 DEFINITION Simian Virus 40 complete genome. ACCESSION J02400 J02402 J02406 J02407 J02408 J02409 J02410 V01380 J02403 J04139 M24914 KEYWORDS T-antigen; agnoprotein; alternate splicing; antigen; auxiliary region; circular; complete genome; core region; early leader protein; enhancer; large T-antigen; major structural protein VP1; minor structural protein VP2; minor structural protein VP3; origin of replication; palindrome; small T-antigen; tandem repeat; topoisomerase. SOURCE Simian Virus 40 (SV40) DNA, cDNA, mRNA and RNA from wild type strains 777 and 776. (When the material is not simply DNA, such is indicated on the reference line.). ORGANISM Rhesus macaque polyomavirus Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 2463 to 2640; mRNA) AUTHORS Dhar,R., Zain,S., Weissman,S.M., Pan,J. and Subramanian,K. TITLE Nucleotide sequences of RNA transcribed in infected cells and by Escherichia coli RNA polymerase from a segment of Simian Virus 40 DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 71, 371-375 (1974) STANDARD full staff_review REFERENCE 2 (bases 2623 to 2699; mRNA) AUTHORS Dhar,R., Weissman,S.M., Zain,B.S., Pan,J. and Lewis,A.M.Jr. TITLE The nucleotide sequence preceding an RNA polymerase initiation site on SV40 DNA. part 2. the sequence of the early strand transcript JOURNAL Nucleic Acids Res. 1, 595-613 (1974) STANDARD full staff_review REFERENCE 3 (bases 2623 to 2699; mRNA) AUTHORS Dhar,R., Subramanian,K., Zain,B.S., Pan,J. and Weissman,S.M. TITLE Nucleotide sequence about the 3' terminus of SV40 DNA transcripts and the region where DNA synthesis is initiated JOURNAL Cold Spring Harb. Symp. Quant. Biol. 39, 153-160 (1975) STANDARD full staff_review REFERENCE 4 (sites; cds start for VP1) AUTHORS van de Voorde,A., Contreras,R., Rogiers,R. and Fiers,W. TITLE The initiation region of the SV40 VP1 gene JOURNAL Cell 9, 117-120 (1976) STANDARD full staff_review REFERENCE 5 (bases 470 to 524) AUTHORS Ysebaert,M., Thys,F., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the restriction fragments Hind L and Hind M of SV40 DNA JOURNAL Nucleic Acids Res. 3, 3409-3421 (1976) STANDARD full staff_review REFERENCE 6 (bases 32 to 236; direct tandem repeats) AUTHORS Subramanian,K.N., Reddy,V.B. and Weissman,S.M. TITLE Occurrence of reiterated sequences in an untranslated region of Simian Virus 40 DNA determined by nucleotide sequence analysis JOURNAL Cell 10, 497-507 (1977) STANDARD full staff_review REFERENCE 7 (bases 4592 to 4661; mRNA and DNA) AUTHORS Thimmappaya,B. and Weissman,S.M. TITLE The early region of SV40 DNA may have more than one gene JOURNAL Cell 11, 837-843 (1977) STANDARD full staff_review REFERENCE 8 (bases 1046 to 1620; overlap between VP1, VP2 and VP3 genes) AUTHORS Contreras,R., Rogiers,R., van de Voorde,A. and Fiers,W. TITLE Overlapping of the VP2-VP3 gene and the VP1 gene in the SV40 genome JOURNAL Cell 12, 529-538 (1977) STANDARD full staff_review REFERENCE 9 (bases 1 to 164; 5092 to 5243; origin of replication) AUTHORS Subramanian,K.N., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequence of a fragment of SV40 DNA that contains the origin of DNA replication and specifies the 5' ends of "early" and "late" viral RNA: III. construction of the total sequence of EcorII-G fragment of SV40 DNA JOURNAL J. Biol. Chem. 252, 355-367 (1977) STANDARD full staff_review REFERENCE 10 (bases 1 to 585; 4857 to 5243) AUTHORS Dhar,R., Subramanian,K.N., Pan,J. and Weissman,S.M. TITLE Structure of a large segment of the genome of Simian Virus 40 that does not encode known proteins JOURNAL Proc. Natl. Acad. Sci. U.S.A. 74, 827-831 (1977) STANDARD full staff_review REFERENCE 11 (bases 3733 to 4007) AUTHORS Volckaert,G., Contreras,R., Soeda,E., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of Simian Virus 40 Hind H restriction fragment JOURNAL J. Mol. Biol. 110, 467-510 (1977) STANDARD full staff_review REFERENCE 12 (bases 1427 to 2593; VP1 gene) AUTHORS Pan,J., Reddy,V.B., Thimmappaya,B. and Weissman,S.M. TITLE Nucleotide sequence of the gene for the major structural protein of SV40 virus JOURNAL Nucleic Acids Res. 4, 2539-2548 (1977) STANDARD full staff_review REFERENCE 13 (bases 323 to 565; mRNA) AUTHORS Celma,M.L., Dhar,R., Pan,J. and Weissman,S.M. TITLE Comparison of the nucleotide sequence of the messenger RNA for the major structural protein of SV40 with the DNA sequence encoding the amino acids of the protein JOURNAL Nucleic Acids Res. 4, 2549-2559 (1977) STANDARD full staff_review REFERENCE 14 (bases 347 to 472) AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the SV40 DNA restriction fragment Hind C-Hap 2 JOURNAL Nucleic Acids Res. 4, 1015-1024 (1977) STANDARD full staff_review REFERENCE 15 (bases 1709 to 1786; RNA and DNA) AUTHORS Contreras,R., Volckaert,G., Thys,F., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the restriction fragment Hind F-Eco RI2 of SV40 DNA JOURNAL Nucleic Acids Res. 4, 1001-1014 (1977) STANDARD full staff_review REFERENCE 16 (bases 1 to 163; 181 to 5243) AUTHORS Reddy,V.B., Thimmappaya,B., Dhar,R., Subramanian,K.N., Zain,S., Pan,J., Ghosh,P.K., Celma,M.L. and Weissman,S.M. TITLE The genome of Simian Virus 40 JOURNAL Science 200, 494-502 (1978) STANDARD full staff_review REFERENCE 17 (bases 1 to 163; 181 to 5243) AUTHORS Fiers,W., Contreras,R., Haegeman,G., Rogiers,R., van de Voorde,A., van Heuverswyn,H., van Herreweghe,J., Volckaert,G. and Ysebaert,M. TITLE Complete nucleotide sequence of SV40 DNA JOURNAL Nature 273, 113-120 (1978) STANDARD full staff_review REFERENCE 18 (bases 2633 to 2933; RNA and DNA) AUTHORS Thimmappaya,B., Zain,B.S., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequence of DNA template for the 3' ends of SV40 mRNA: II. the sequence of the DNA fragment EcorII-F and a part of EcorII-H JOURNAL J. Biol. Chem. 253, 1613-1618 (1978) STANDARD full staff_review REFERENCE 19 (bases 325 to 526; 1463 to 1540; sites-late 16S mRNA and RNA splicing) AUTHORS Ghosh,P.K., Reddy,V.B., Swinscoe,J., Choudary,P.V., Lebowitz,P. and Weissman,S.M. TITLE The 5'-terminal leader sequence of late 16 S mRNA from cells infected with Simian Virus 40 JOURNAL J. Biol. Chem. 253, 3643-3647 (1978) STANDARD full staff_review REFERENCE 20 (bases 322 to 536; 1450 to 1541; sites-late 16S mRNA and RNA splicing) AUTHORS Haegeman,G. and Fiers,W. TITLE Evidence for 'splicing' of SV40 16S mRNA JOURNAL Nature 273, 70-73 (1978) STANDARD full staff_review REFERENCE 21 (bases 4568 to 5175; small t antigen gene) AUTHORS Volckaert,G., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian Virus 40 small-t gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 2160-2164 (1978) STANDARD full staff_review REFERENCE 22 (bases 5109 to 5243; 1 to 63; RNA) AUTHORS Woodworth-Gutai,M. and Nathans,D. TITLE Evolutionary variants of Simian Virus 40: nucleotide sequence of a conserved SV40 DNA segment containing the origin of viral DNA replication as an inverted repetition JOURNAL J. Mol. Biol. 126, 259-274 (1978) STANDARD full staff_review REFERENCE 23 (bases 1 to 74; 5171 to 5243; T antigen binding sites) AUTHORS Tjian,R. TITLE The binding site on SV40 DNA for a T antigen-related protein JOURNAL Cell 13, 165-179 (1978) STANDARD full staff_review REFERENCE 24 (sites; 5' and 3' termini of 19s late mRNAs) AUTHORS Ghosh,P.K., Reddy,V.B., Swinscoe,J., Lebowitz,P. and Weissman,S.M. TITLE Heterogeneity and 5'-terminal structures of the late RNAs of Simian Virus 40 JOURNAL J. Mol. Biol. 126, 813-846 (1978) STANDARD full staff_review REFERENCE 25 (bases 562 to 1620; VP2 and VP3 genes) AUTHORS Reddy,V.B., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequence of the genes for the Simian Virus 40 proteins VP2 and VP3 JOURNAL J. Biol. Chem. 253, 621-630 (1978) STANDARD full staff_review REFERENCE 26 (bases 358 to 914; sites-5' termini of late mRNAs) AUTHORS Dhar,R., Reddy,V.B. and Weissman,S.M. TITLE Nucleotide sequence of the DNA encoding the 5'-terminal sequences of Simian Virus 40 late mRNA JOURNAL J. Biol. Chem. 253, 612-620 (1978) STANDARD full staff_review REFERENCE 27 (bases 2300 to 2668; sites-mRNA 3' termini) AUTHORS Zain,B.S., Thimmappaya,B., Dhar,R. and Weissman,S.M. TITLE Nucleotide sequences of DNA encoding the 3' ends of SV40 mRNA I. the sequence of the DNA fragment HindII,III-G JOURNAL J. Biol. Chem. 253, 1606-1612 (1978) STANDARD full staff_review REFERENCE 28 (bases 2669 to 2901) AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian-Virus-40 DNA region coding for the carboxyl-terminal part of the T antigen JOURNAL Eur. J. Biochem. 86, 335-344 (1978) STANDARD full staff_review REFERENCE 29 (bases 264 to 526; 1463 to 1516; 2582 to 2602; 2663 to 2676; sites-cDNA, 16S leader segments) AUTHORS Reddy,V.B., Ghosh,P.K., Lebowitz,P. and Weissman,S.M. TITLE Gaps and duplicated sequences in the leaders of SV40 16S RNA JOURNAL Nucleic Acids Res. 5, 4195-4213 (1978) STANDARD full staff_review REFERENCE 30 (sites; cap site for 16s and 19s late mRNAs) AUTHORS Haegeman,G. and Fiers,W. TITLE Localization of the 5' terminus of late SV40 mRNA JOURNAL Nucleic Acids Res. 5, 2359-2371 (1978) STANDARD full staff_review REFERENCE 31 (sites; mutations in region of origin of replication) AUTHORS Subramanian,K.N. and Shenk,T. TITLE Definition of the boundaries of the origin of DNA replication in Simian Virus 40 JOURNAL Nucleic Acids Res. 5, 3635-3642 (1978) STANDARD full staff_review REFERENCE 32 (sites; splice sites in late mRNAs) AUTHORS Lai,C.-J., Dhar,R. and Khoury,G. TITLE Mapping the spliced and unspliced late lytic SV40 RNAs JOURNAL Cell 14, 971-982 (1978) STANDARD full staff_review REFERENCE 33 (sites; early mRNAs) AUTHORS May,E., Kress,M. and May,P. TITLE Characterization of two SV40 early mRNAs and evidence for a nuclear "prespliced" RNA species JOURNAL Nucleic Acids Res. 5, 3083-3099 (1978) STANDARD full staff_review REFERENCE 34 (sites; early mRNAs and RNA splicing) AUTHORS Berk,A.J. and Sharp,P.A. TITLE Spliced early mRNAs of Simian Virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75, 1274-1278 (1978) STANDARD full staff_review REFERENCE 35 (review; bases 1 to 163; 181 to 5243) AUTHORS Lebowitz,P. and Weissman,S.M. TITLE Organization and transcription of the Simian Virus 40 genome JOURNAL Curr. Top. Microbiol. Immunol. 87, 43-172 (1979) STANDARD full staff_review REFERENCE 36 (bases 1 to 472; 5174 to 5243) AUTHORS van Heuverswyn,H. and Fiers,W. TITLE Nucleotide sequence of the Hind-C fragment of Simian Virus 40 DNA: comparison of the 5'-untranslated region of wild-type virus and of some deletion mutants JOURNAL Eur. J. Biochem. 100, 51-60 (1979) STANDARD full staff_review REFERENCE 37 (bases 1276 to 1539; deletion mutant analysis) AUTHORS Contreras,R., Cole,C., Berg,P. and Fiers,W. TITLE Nucleotide sequence analysis of two Simian Virus 40 mutants with deletions in the late region of the genome JOURNAL J. Virol. 29, 789-793 (1979) STANDARD full staff_review REFERENCE 38 (bases 325 to 562; 1403 to 1582; cDNA) AUTHORS Bina-Stein,M., Thoren,M., Salzman,N. and Thompson,J.A. TITLE Rapid sequence determination of late Simian Virus 40 16S mRNA leader by using inhibitors of reverse transcriptase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 731-735 (1979) STANDARD full staff_review REFERENCE 39 (bases 2691 to 4571; 4638 to 5163; sites-5' termini of early mRNAs) AUTHORS Reddy,V.B., Ghosh,P.K., Lebowitz,P., Piatak,M. and Weissman,S.M. TITLE Simian Virus 40 early mRNA's: I. genomic localization of 3' and 5' termini and two major splices in mRNA from transformed and lytically infected cells JOURNAL J. Virol. 30, 279-296 (1979) STANDARD full staff_review REFERENCE 40 (sites; splice sites for large T antigen mRNA) AUTHORS Mark,D.F. and Berg,P. TITLE A third splice site in SV40 early mRNA JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 55-62 (1980) STANDARD full staff_review REFERENCE 41 (bases 4963 to 5179; deletion mutant analysis) AUTHORS Pipas,J.M., Adler,S.P., Peden,K.W.C. and Nathans,D. TITLE Deletion mutants of SV40 that affect the structure of viral tumor antigens JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44, 285-291 (1980) STANDARD full staff_review REFERENCE 42 (bases 2667 to 2840; deletion mutant analysis) AUTHORS van Heuverswyn,H., Cole,C., Berg,P. and Fiers,W. TITLE Nucleotide sequence analysis of two Simian Virus 40 mutants with deletions in the region coding for the carboxyl terminus of the T antigen JOURNAL J. Virol. 30, 936-941 (1979) STANDARD full staff_review REFERENCE 43 (bases 263 to 532; deletion mutant analysis) AUTHORS Subramanian,K.N. TITLE Segments of Simian Virus 40 DNA spanning most of the leader sequence of the major late viral messenger RNA are dispensable JOURNAL Proc. Natl. Acad. Sci. U.S.A. 76, 2556-2560 (1979) STANDARD full staff_review REFERENCE 44 (bases 4548 to 4923; deletion mutant analysis) AUTHORS Volckaert,G., Feunteun,J., Crawford,L.V., Berg,P. and Fiers,W. TITLE Nucleotide sequence deletions within the coding region for small-t antigen of Simian Virus 40 JOURNAL J. Virol. 30, 674-682 (1979) STANDARD full staff_review REFERENCE 45 (bases 250 to 304; 345 to 399; deletion mutant analysis) AUTHORS Haegeman,G., van Heuverswyn,H., Gheysen,D. and Fiers,W. TITLE Heterogeneity of the 5' terminus of late mRNA induced by a viable Simian Virus 40 deletion mutant JOURNAL J. Virol. 31, 484-493 (1979) STANDARD full staff_review REFERENCE 46 (bases 4566 to 5225; cDNA) AUTHORS Thompson,J.A., Radonovich,M.F. and Salzman,N.P. TITLE Characterization of the 5'-terminal structure of Simian Virus 40 early mRNA's JOURNAL J. Virol. 31, 437-446 (1979) STANDARD full staff_review REFERENCE 47 (sites; deletion mutant analysis over bases 4553 to 5172) AUTHORS Thimmappaya,B. and Shenk,T. TITLE Nucleotide sequence analysis of viable deletion mutants lacking segments of the Simian Virus 40 genome coding for small t antigen JOURNAL J. Virol. 30, 668-673 (1979) STANDARD full staff_review REFERENCE 48 (sites; deletion mutant analysis of major late cap site) AUTHORS Haegeman,G., Iserentant,D., Gheysen,D. and Fiers,W. TITLE Characterization of the major altered leader sequence of late mRNA induced by SV40 deletion mutant dl-1811 JOURNAL Nucleic Acids Res. 7, 1799-1814 (1979) STANDARD full staff_review REFERENCE 49 (sites; splice sites in early mRNAs) AUTHORS Khoury,G., Gruss,P., Dhar,R. and Lai,C.-J. TITLE Processing and expression of early SV40 mRNA: a role for RNA conformation in splicing JOURNAL Cell 18, 85-92 (1979) STANDARD full staff_review REFERENCE 50 (review; bases 1 to 163; 181 to 5243) AUTHORS Buchman,A.R., Burnett,L. and Berg,P. TITLE Appendix A: The SV40 nucleotide sequence JOURNAL (in) Tooze,J. (Eds.); DNA Tumor Viruses - Second Edition Revised: 799-841; Cold Spring Harbor Laboratory, Cold Spring Harbor (1981). STANDARD full staff_review REFERENCE 51 (bases 2663 to 2921; deletion mutant analysis) AUTHORS Alwine,J.C. and Khoury,G. TITLE Simian Virus 40-associated small RNA: mapping on the Simian Virus 40 genome and characterization of its synthesis JOURNAL J. Virol. 36, 701-708 (1980) STANDARD full staff_review REFERENCE 52 (bases 4007 to 4574) AUTHORS Volckaert,G., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian Virus 40 HindII+III restriction fragment A: (second part of the T antigen gene) JOURNAL Eur. J. Biochem. 106, 169-177 (1980) STANDARD full staff_review REFERENCE 53 (bases 3476 to 3735; RNA and DNA) AUTHORS van Herreweghe,J., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian Virus 40 HindII+III restriction fragment I: (fourth part of the T antigen gene) JOURNAL Eur. J. Biochem. 106, 179-192 (1980) STANDARD full staff_review REFERENCE 54 (bases 3146 to 3481) AUTHORS van Herreweghe,J., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Hind-I-proximal part of Simian Virus 40 HindII+III restriction fragment B: (fifth part of the T antigen gene JOURNAL Eur. J. Biochem. 106, 193-198 (1980) STANDARD full staff_review REFERENCE 55 (bases 2669 to 3251) AUTHORS van Heuverswyn,H., van de Voorde,A., van Herreweghe,J., Volckaert,G., de Winne,P. and Fiers,W. TITLE Nucleotide sequence of Simian Virus 40 DNA: structure of the middle segment of the HindII+III restriction fragment B: (sixth part of the T antigen gene) and codon usage JOURNAL Eur. J. Biochem. 106, 199-209 (1980) STANDARD full staff_review REFERENCE 56 (bases 4572 to 4917) AUTHORS Seif,I., Khoury,G. and Dhar,R. TITLE A rapid enzymatic DNA sequencing technique: determination of sequence alterations in early Simian Virus 40 temperature sensitive and deletion mutants JOURNAL Nucleic Acids Res. 8, 2225-2240 (1980) STANDARD full staff_review REFERENCE 57 (bases 1 to 74; 5161 to 5243; deletion and point mutants over the replication origin) AUTHORS DiMaio,D. and Nathans,D. TITLE Cold-sensitive regulatory mutants of Simian Virus 40 JOURNAL J. Mol. Biol. 140, 129-142 (1980) STANDARD full staff_review REFERENCE 58 (bases 13 to 65; 5155 to 5227; deletion mutants over the early promoter) AUTHORS Benoist,C. and Chambon,P. TITLE Deletions covering the putative promoter region of early mRNAs of Simian Virus 40 do not abolish T-antigen expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3865-3869 (1980) STANDARD full staff_review REFERENCE 59 (sites; T antigen binding sites) AUTHORS Shalloway,D., Kleinberger,T. and Livingston,D.M. TITLE Mapping of SV40 DNA replication origin region binding sites for the SV40 T antigen by protection against exonuclease III digestion JOURNAL Cell 20, 411-422 (1980) STANDARD full staff_review REFERENCE 60 (bases 1 to 33; 212 to 236; 5161 to 5243; origin-defective deletion mutants) AUTHORS Gluzman,Y., Sambrook,J.F. and Frisque,R.J. TITLE Expression of early genes of origin-defective mutants of Simian Virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77, 3898-3902 (1980) STANDARD full staff_review REFERENCE 61 (bases 5213 to 5236; early mRNA initiation sites) AUTHORS Haegeman,G. and Fiers,W. TITLE Characterization of the 5'-terminal cap structures of early Simian Virus 40 mRNA JOURNAL J. Virol. 35, 955-961 (1980) STANDARD full staff_review REFERENCE 62 (sites; early mRNA initiation sites and repression by T antigen) AUTHORS Hansen,U., Tenen,D.G., Livingston,D.M. and Sharp,P.A. TITLE T antigen repression of SV40 early transcription from two promoters JOURNAL Cell 27, 603-612 (1981) STANDARD full staff_review REFERENCE 63 (sites; poly-A at 2662; deletion mutants over 2533-2775) AUTHORS Fitzgerald,M. and Shenk,T. TITLE The sequence 5'-AAUAAA-3' forms part of the recognition site for polyadenylation of late SV40 mRNAs JOURNAL Cell 24, 251-260 (1981) STANDARD full staff_review REFERENCE 64 (bases 306 to 428; deletion mutant analysis) AUTHORS Woodworth-Gutai,M. TITLE Recombination in SV40-infected cells: viral DNA sequences at sites of circularization of transfecting linear DNA JOURNAL Virology 109, 353-365 (1981) STANDARD full staff_review REFERENCE 65 (bases 101 to 178; 251 to 280; early promoter analysis) AUTHORS Gruss,P., Dhar,R. and Khoury,G. TITLE Simian Virus 40 tandem repeated sequences as an element of the early promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 943-947 (1981) STANDARD full staff_review REFERENCE 66 (bases 110 to 178; 251 to 601; deletion mutant analysis) AUTHORS Barkan,A. and Mertz,J.E. TITLE DNA sequence analysis of Simian Virus 40 mutants with deletions mapping in the leader region of the late viral mRNA's: mutants with deletions similar in size and position exhibit varied phenotypes JOURNAL J. Virol. 37, 730-737 (1981) STANDARD full staff_review REFERENCE 67 (bases 1 to 54; 5148 to 5243; cDNAs over this region) AUTHORS Ghosh,P.K., Lebowitz,P., Frisque,R.J. and Gluzman,Y. TITLE Identification of a promoter component involved in positioning the 5' termini of Simian Virus 40 early mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 100-104 (1981) STANDARD full staff_review REFERENCE 68 (bases 1 to 173; 182 to 280; 5216 to 5243; early promoter analysis) AUTHORS Benoist,C. and Chambon,P. TITLE In vivo sequence requirements of the SV40 early promoter region JOURNAL Nature 290, 304-310 (1981) STANDARD full staff_review REFERENCE 69 (sites; promoter for early mRNAs) AUTHORS Mathis,D.J. and Chambon,P. TITLE The SV40 early region TATA box is required for accurate in vitro initiation of transcription JOURNAL Nature 290, 310-315 (1981) STANDARD full staff_review REFERENCE 70 (bases 1 to 177; 5171 to 5243; autoregulation by T antigen) AUTHORS Myers,R.M., Rio,D.C., Robbins,A.K. and Tjian,R. TITLE SV40 gene expression is modulated by the cooperative binding of T antigen to DNA JOURNAL Cell 25, 373-384 (1981) STANDARD full staff_review REFERENCE 71 (bases 1 to 102; 5148 to 5243; upstream shift of early mRNA initiation site) AUTHORS Ghosh,P.K. and Lebowitz,P. TITLE Simian Virus 40 early mRNA's contain multiple 5'termini upstream and downstream from a Hogness-Goldberg sequence; a shift in 5' termini during the lytic cycle is mediated by large T antigen JOURNAL J. Virol. 40, 224-240 (1981) STANDARD full staff_review REFERENCE 72 (bases 2644 to 2923; deletion mutant analysis) AUTHORS Feunteun,J., Carmichael,G., Nicolas,J.C. and Kress,M. TITLE Mutant carrying deletions in the two Simian Virus 40 early genes JOURNAL J. Virol. 40, 625-634 (1981) STANDARD full staff_review REFERENCE 73 (sites; cds for agnoprotein) AUTHORS Jackson,V. and Chalkley,R. TITLE Use of whole-cell fixation to visualize replicating and maturing Simian Virus 40: identification of new viral gene product JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78, 6081-6085 (1981) STANDARD full staff_review REFERENCE 74 (bases 1 to 32; 212 to 236; 5161 to 5243; T antigen binding to a deletion mutant) AUTHORS McKay,R. and DiMaio,D. TITLE Binding of an SV40 T antigen-related protein to the DNA of SV40 regulatory mutants JOURNAL Nature 289, 810-813 (1981) STANDARD full staff_review REFERENCE 75 (sites; transcriptional enhancement by the 72 bp repeats) AUTHORS Moreau,V., Hen,R., Wasylyk,B., Everett,R., Gaub,M.P. and Chambon,P. TITLE The SV40 72 base repair repeat has a striking effect on gene expression both in SV40 and other chimeric recombinants JOURNAL Nucleic Acids Res. 9, 6047-6068 (1981) STANDARD full staff_review REFERENCE 76 (bases 325 to 622; 1463 to 1511; cDNAs; late mRNA synthesis by deletion mutants) AUTHORS Piatak,M., Subramanian,K.N., Roy,P. and Weissman,S.M. TITLE Late messenger RNA production by viable Simian Virus 40 mutants with deletions in the leader region JOURNAL J. Mol. Biol. 153, 589-618 (1981) STANDARD full staff_review REFERENCE 77 (sites; agnoprotein coding sequence) AUTHORS Jay,G., Nomura,S., Anderson,C.W. and Khoury,G. TITLE Identification of the SV40 agnogene product: a DNA binding protein JOURNAL Nature 291, 346-349 (1981) STANDARD full staff_review REFERENCE 78 (sites; mapping of the origin of replication) AUTHORS Bergsma,D.J., Olive,D.M., Hartzell,S.W. and Subramanian,K.N. TITLE Territorial limits and functional anatomy of the Simian Virus 40 replication origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 381-385 (1982) STANDARD full staff_review REFERENCE 79 (sites; topoisomerase cleavage sites) AUTHORS Edwards,K.A., Halligan,B.D., Davis,J.L., Nivera,N.L. and Liu,L.F. TITLE Recognition sites of eukaryotic DNA topoisomerase I: DNA nucleotide sequencing analysis of topo I cleavage sites on SV40 DNA JOURNAL Nucleic Acids Res. 10, 2565-2576 (1982) STANDARD full staff_review REFERENCE 80 (bases 2619 to 2920; deletion mutant analysis) AUTHORS Polvino-Bodnar,M. and Cole,C.N. TITLE Construction and characterization of viable deletion mutants of Simian Virus 40 lacking sequences near the 3' end of the early region JOURNAL J. Virol. 43, 489-502 (1982) STANDARD full staff_review REFERENCE 81 (bases 255 to 2535; only joints of deletion mutants) AUTHORS Johnson,A.D., Barkan,A. and Mertz,J.E. TITLE Nucleotide sequence analysis of the recombinant joints in 16 naturally arising deletion mutants of Simian Virus 40 JOURNAL Virology 123, 464-469 (1982) STANDARD full staff_review REFERENCE 82 (bases 260 to 422; RNA) AUTHORS Hay,N., Skolnik-David,H. and Aloni,Y. TITLE Attenuation in the control of SV40 gene expression JOURNAL Cell 29, 183-193 (1982) STANDARD full staff_review REFERENCE 83 (sites; late mRNA initiation sites) AUTHORS Alwine,J.C. TITLE Evidence for Simian Virus 40 late transcriptional control: mixed infections of wild-type Simian Virus 40 and a late leader deletion mutant exhibit trans effects on late viral RNA synthesis JOURNAL J. Virol. 42, 798-803 (1982) STANDARD full staff_review REFERENCE 84 (bases 284 to 326; late promoter analysis) AUTHORS Brady,J., Radonovich,M., Vodkin,M., Natarajan,V., Thoren,M., Das,G., Janik,J. and Salzman,N.P. TITLE Site-specific base substitution and deletion mutations that enhance or suppress transcription of the SV40 major late RNA JOURNAL Cell 31, 625-633 (1982) STANDARD full staff_review REFERENCE 85 (bases 1 to 16; 5227 to 5243; mutant analysis over origin of replication) AUTHORS Bergsma,D.J., Olive,D.M., Hartzell,S.W., Byrne,B.J. and Subramanian,K.N. TITLE Cyclization of linear chimeric plasmids in vivo by a novel end-to-end joining reaction or by intramolecular recombination: one of the products contains a 147-bp perfect palindrome stable in Escherichia coli JOURNAL Gene 20, 157-167 (1982) STANDARD full staff_review REFERENCE 86 (bases 5164 to 5243; 1 to 48; T antigen binding to a deletion mutant) AUTHORS Tenen,D.G., Haines,L.L. and Livingston,D.M. TITLE Binding of an analog of the Simian Virus 40 T antigen to wild-type and mutant viral replication origins JOURNAL J. Mol. Biol. 157, 473-492 (1982) STANDARD full staff_review REFERENCE 87 (sites; T antigen binding sites) AUTHORS DiMaio,D. and Nathans,D. TITLE Regulatory mutants of Simian Virus 40: effect of mutations at a T antigen binding site on DNA replication and expression of viral genes JOURNAL J. Mol. Biol. 156, 531-548 (1982) STANDARD full staff_review REFERENCE 88 (sites; mapping of early region promoter) AUTHORS Fromm,M. and Berg,P. TITLE Deletion mapping of DNA regions required for SV40 early promoter function in vivo JOURNAL J. Mol. Appl. Genet. 1, 457-481 (1982) STANDARD full staff_review REFERENCE 89 (sites; SV40 enhancement of beta-globin gene expression) AUTHORS Banerji,J., Rusconi,S. and Schaffner,W. TITLE Expression of a beta-globin gene is enhanced by remote SV40 DNA sequences JOURNAL Cell 27, 299-308 (1981) STANDARD full staff_review REFERENCE 90 (bases 1 to 129; 5228 to 5243; mapping of the early promoter) AUTHORS Byrne,B.J., Davis,M.S., Yamaguchi,J., Bergsma,D.J. and Subramanian,K.N. TITLE Definition of the Simian Virus 40 early promoter region and demonstration of a host range bias in the enhancement effect of the Simian Virus 40 72-base-pair repeat JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 721-725 (1983) STANDARD full staff_review REFERENCE 91 (sites; agnogene mutants) AUTHORS Mertz,J.E., Murphy,A. and Barkan,A. TITLE Mutants deleted in the agnogene of Simian Virus 40 define a new complementation group JOURNAL J. Virol. 45, 36-46 (1983) STANDARD full staff_review REFERENCE 92 (bases 335 to 394; 1706 to 1753; mutant analysis) AUTHORS Margolskee,R.F. and Nathans,D. TITLE Suppression of a VP1 mutant of Simian Virus 40 by missense mutations in serine codons of the viral agnogene JOURNAL J. Virol. 48, 405-409 (1983) STANDARD full staff_review REFERENCE 93 (bases 3512 to 3565; 4154 to 4311; mutant analysis over these regions) AUTHORS Pipas,J.M., Peden,K.W.C. and Nathans,D. TITLE Mutational analysis of Simian Virus 40 T antigen: isolation and characterization of mutants with deletions in the T-antigen gene JOURNAL Mol. Cell. Biol. 3, 203-213 (1983) STANDARD full staff_review REFERENCE 94 (bases 1709 to 1786; topoisomerase II) AUTHORS Liu,L.F., Rowe,T.C., Yang,L., Tewey,K.M. and Chen,G.L. TITLE Cleavage of DNA by mammalian DNA topoisomerase II JOURNAL J. Biol. Chem. 258, 15365-15370 (1983) STANDARD full staff_review REFERENCE 95 (bases 2773 to 4406; only joints of deletion mutants) AUTHORS Tornow,J. and Cole,C.N. TITLE Intracistronic complementation in the Simian Virus 40 A gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 6312-6316 (1983) STANDARD full staff_review REFERENCE 96 (bases 21 to 118; transcription factor binding to early promoter) AUTHORS Dynan,W.S. and Tjian,R. TITLE The promoter-specific transcription factor Sp1 binds to upstream sequences in the SV40 early promoter JOURNAL Cell 35, 79-87 (1983) STANDARD full staff_review REFERENCE 97 (sites; 7S-K RNA of transformed cells) AUTHORS Sohn,U., Szyszko,J., Coombs,D. and Krause,M. TITLE 7S-K nuclear RNA from Simian Virus 40-transformed cells has sequence homology to the viral early promoter JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 7090-7094 (1983) STANDARD full staff_review REFERENCE 98 (bases 1 to 49; 5181 to 5243; T antigen binding analysis) AUTHORS Tenen,D.G., Livingston,D.M., Wang,S.-S. and Martin,R.G. TITLE Effect of a stem-loop structure within the SV40 replication origin upon SV40 T antigen binding to origin region sequences JOURNAL Cell 34, 629-639 (1983) STANDARD full staff_review REFERENCE 99 (bases 253 to 482; cDNA; late mRNA initiation) AUTHORS Piatak,M., Ghosh,P.K., Norkin,L.C. and Weissman,S.M. TITLE Sequences locating the 5' ends of the major Simian Virus 40 late mRNA forms JOURNAL J. Virol. 48, 503-520 (1983) STANDARD full staff_review REFERENCE 100(bases 1 to 14; 5210 to 5243; autoregulation by T antigen) AUTHORS Rio,D.C. and Tjian,R. TITLE Sv40 T antigen binding site mutations that affect autoregulation JOURNAL Cell 32, 1227-1240 (1983) STANDARD full staff_review REFERENCE 101(bases 1 to 134; 5216 to 5243; early promoter analysis) AUTHORS Everett,R.D., Baty,D. and Chambon,P. TITLE The repeated GC-rich motifs upstream from the TATA box are important elements of the SV40 early promoter JOURNAL Nucleic Acids Res. 11, 2447-2464 (1983) STANDARD full staff_review REFERENCE 102(bases 5165 to 5243) AUTHORS Tenen,D.G., Taylor,T.S., Haines,L.L., Bradley,M.K., Martin,R.G. and Livingston,D.M. TITLE Binding of simian virus 40 large T antigen from virus-infected monkey cells to wild-type and mutant viral replication origins JOURNAL J. Mol. Biol. 168, 791-808 (1983) STANDARD full staff_review REFERENCE 103(bases 1 to 26; 5170 to 5243; T antigen binding sites) AUTHORS Jones,K.A. and Tjian,R. TITLE Essential contact residues within SV40 large T antigen binding sites I and II identified by alkylation-interference JOURNAL Cell 36, 155-162 (1984) STANDARD full staff_review REFERENCE 104(sites; sites-5' termini of late mRNAs) AUTHORS Lycan,D.E. and Danna,K.J. TITLE S1 mapping of purified nascent transcripts of Simian Virus 40 JOURNAL Mol. Cell. Biol. 4, 625-633 (1984) STANDARD full staff_review REFERENCE 105(bases 1 to 33; 5213 to 5243; T antigen binding analysis) AUTHORS Lewton,B.A., Delucia,A.L. and Tegtmeyer,P. TITLE Binding of Simian Virus 40 A protein to DNA with deletions at the origin of replication JOURNAL J. Virol. 49, 9-13 (1984) STANDARD full staff_review REFERENCE 106(sites; T antigen binding sites) AUTHORS Pomerantz,B.J. and Hassell,J.A. TITLE Polyomavirus and Simian Virus 40 large T antigens bind to common DNA sequences JOURNAL J. Virol. 49, 925-937 (1984) STANDARD full staff_review REFERENCE 107(bases 1 to 118; 5186 to 5243; primase initiation sites) AUTHORS Tseng,B.Y. and Ahlem,C.N. TITLE Mouse primase initiation sites in the origin region of Simian Virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 2342-2346 (1984) STANDARD full staff_review REFERENCE 108(bases 1 to 294; 297 to 379; transcriptional enhancer analysis) AUTHORS Weber,F., de Villiers,J. and Schaffner,W. TITLE An SV40 'enhancer trap' incorporates exogenous enhancers or generates enhancers from its own sequences JOURNAL Cell 36, 983-992 (1984) STANDARD full staff_review REFERENCE 109(bases 1 to 33; 5213 to 5243; mutant analysis at the replication origin) AUTHORS Cohen,G.L., Wright,P.J., Delucia,A.L., Lewton,B.A., Anderson,M.E. and Tegtmeyer,P. TITLE Critical spatial requirement within the origin of Simian Virus 40 DNA replication JOURNAL J. Virol. 51, 91-96 (1984) STANDARD full staff_review REFERENCE 110(sites; mRNA polyadenylation signal sequences) AUTHORS Sadofsky,M. and Alwine,J.C. TITLE Sequences on the 3' side of hexanucleotide AAUAAA affect efficiency of cleavage at the polyadenyation site JOURNAL Mol. Cell. Biol. 4, 1460-1468 (1984) STANDARD full staff_review REFERENCE 111(bases 1582 to 1635; 2524 to 2652; chromatin structure at origin of replication; mutational analysis) AUTHORS Innis,J.W. and Scott,W.A. TITLE DNA replication and chromatin structure of Simian Virus 40 insertion mutants JOURNAL Mol. Cell. Biol. 4, 1499-1507 (1984) STANDARD full staff_review REFERENCE 112(sites; bidirectional promoter element) AUTHORS Hartzell,S.W., Byrne,B.J. and Subramanian,K.N. TITLE Mapping of the late promoter of Simian virus 40 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 23-27 (1984) STANDARD full staff_review REFERENCE 113(sites; late promoter sequence requirements in the presence of T antigen) AUTHORS Hartzell,S.W., Byrne,B.J. and Subramanian,K.N. TITLE The simian virus 40 minimal origin and the 72-base-pair repeat are required simultaneously for efficient induction of late gene expression with large tumor antigen JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6335-6339 (1984) STANDARD full staff_review REFERENCE 114(bases 304 to 528) AUTHORS Hay,N., Kessler,M. and Aloni,Y. TITLE SV40 deletion mutant (d1861) with agnoprotein shortened by four amino acids JOURNAL Virology 137, 160-170 (1984) STANDARD full staff_review REFERENCE 115(bases 1 to 366; 5230 to 5243) AUTHORS Swimmer,C. and Shenk,T. TITLE A viable simian virus 40 variant that carries a newly generated sequence reiteration in place of the normal duplicated enhancer element JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 6652-6656 (1984) STANDARD full staff_review REFERENCE 116(sites; Okazaki fragment sequences) AUTHORS Hay,R.T., Hendrickson,E.A. and DePamphilis,M.L. TITLE Sequence specificity for the initiation of RNA-primed simian virus 40 DNA synthesis in vivo JOURNAL J. Mol. Biol. 175, 131-157 (1984) STANDARD full staff_review REFERENCE 117(sites; tsA3900 point mutant sequence) AUTHORS Hutchinson,N.I., Chang,L.-S., Pater,M.M., Bouck,N., Shenk,T.E. and Di Mayorca,G. TITLE Characterization of a new simian virus 40 mutant, tsA3900, isolated from deletion mutant tsA1499 JOURNAL J. Virol. 53, 814-821 (1985) STANDARD full staff_review REFERENCE 118(sites; large t-antigen binding specificity) AUTHORS Scheller,A. and Prives,C. TITLE Simian virus 40 and polyomavirus large tumor antigens have different requirements for high-affinity sequence-specific DNA binding JOURNAL J. Virol. 54, 532-545 (1985) STANDARD full staff_review REFERENCE 119(bases 1499 to 2590; temperature-sensitive mutants) AUTHORS Ng,S.-C., Behm,M. and Bina,M. TITLE DNA sequence alterations responsible for the synthesis of thermosensitive VP1 in temperature-sensitive BC mutants of simian virus 40 JOURNAL J. Virol. 54, 646-649 (1985) STANDARD full staff_review REFERENCE 120(sites; recombination in poly(GT)-sequences) AUTHORS Stringer,J.R. TITLE Recombination between poly[d(GT)-d(CA)] sequences in simian virus 40-infected cultured cells JOURNAL Mol. Cell. Biol. 5, 1247-1259 (1985) STANDARD full staff_review REFERENCE 121(sites; large T-antigen binding sites) AUTHORS Barrera-Saldana,H., Takahashi,K., Vigneron,M., Wildeman,A., Davidson,I. and Chambon,P. TITLE All six GC-motifs of the SV40 early upstream element contribute to promoter activity in vivo and in vitro JOURNAL EMBO J. 4, 3839-3849 (1985) STANDARD full staff_review REFERENCE 122(bases 4556 to 4657; 4907 to 4937) AUTHORS Noble,J.C., Prives,C. and Manley,J.L. TITLE In vitro splicing of simian virus 40 early pre mRNA JOURNAL Nucleic Acids Res. 14, 1219-1235 (1986) STANDARD full staff_review REFERENCE 123(sites; SV40 early leader protein (SELP) boundaries) AUTHORS Khalili,K., Brady,J. and Khoury,G. TITLE Translational regulation of SV40 early mRNA defines a new viral protein JOURNAL Cell 48, 639-645 (1987) STANDARD full staff_review REFERENCE 124(bases 1494 to 1708) AUTHORS Rogiers,R., van de Voorde,A., Soeda,E. and Fiers,W. TITLE Nucleotide sequence of the Simian virus 40 Hind-K restriction fragment JOURNAL Eur. J. Biochem. 85, 205-224 (1978) STANDARD full staff_entry REFERENCE 125(bases 662 to 1051) AUTHORS Ysebaert,M., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian virus 40 HindII+III restriction fragment D and the total amino acid sequence of the late proteins VP2 and VP3 JOURNAL Eur. J. Biochem. 91, 431-439 (1978) STANDARD full staff_entry REFERENCE 126(bases 2060 to 2299) AUTHORS van Heuverswyn,H., van de Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the Simian virus 40 HindII+III restriction fragment J and the total amino acid sequence of the major structural protein VP1 JOURNAL Eur. J. Biochem. 91, 415-430 (1978) STANDARD full staff_entry REFERENCE 127(bases 1 to 107; 5146 to 5243) AUTHORS Hay,R.T. and DePamphilis,M.L. TITLE Initiation of SV40 DNA replication in vivo: Location and structure of 5' ends of DNA synthesized in the ori region JOURNAL Cell 28, 767-779 (1982) STANDARD full staff_entry REFERENCE 128(bases 80 to 350) AUTHORS Contreras,R., Gheysen,D., Knowland,J., van de Voorde,A. and Fiers,W. TITLE Evidence for the direct involvement of DNA replication origin in synthesis of late SV40 RNA JOURNAL Nature 300, 500-505 (1982) STANDARD full staff_entry REFERENCE 129(sites; early SV40 transcription) AUTHORS Lebowitz,P. and Ghosh,P. TITLE Initiation and regulation of Simian virus 40 early transcription in vitro JOURNAL J. Virol. 41, 449-461 (1982) STANDARD full staff_entry REFERENCE 130(sites; temperature sensitive mutants in the VP1 gene) AUTHORS Behm,M., Lowman,H., Ng,S.-C. and Bina,M. TITLE Analysis of temperature-sensitive mutations in the simian virus 40 gene encoding virion protein 1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9421-9425 (1988) STANDARD full staff_review REFERENCE 131(sites; ends of "early" and "late" mRNA) AUTHORS Subramanian,K.N., Ghosh,P.K., Dhar,R., Thimmappaya,B., Zain,S.B., Pan,J. and Weissman,S.M. TITLE The primary structure of regions of SV40 DNA encoding the ends of mRNA JOURNAL Prog. Nucleic Acid Res. Mol. Biol. 19, 157-164 (1976) STANDARD simple staff_review REFERENCE 132(bases 1 to 164; 5092 to 5243) AUTHORS Weissman,S.M., Subramanian,K.N., Dhar,R., Zain,B. and Pan,J. TITLE SV40 DNA sequences as an example of the structure of genes functioning in animal cell nuclei JOURNAL Med. Pediatr. Oncol. 2, 259-263 (1976) STANDARD full staff_entry REFERENCE 133(bases 1787 to 2059) AUTHORS Contreras,R., Van De Voorde,A. and Fiers,W. TITLE Nucleotide sequence of the restriction fragment Hind-F-EcoRI-1 of simian-virus-40 DNA (part of the VP1 gene) JOURNAL Eur. J. Biochem. 86, 317-324 (1978) STANDARD simple staff_review COMMENT Kindly reviewed by K. N. Subramanian. Draft entry and computer- readable sequence for [130] kindly submitted by M.Bina, 16-FEB-1989. The E or minus strand of SV40 strain 776 is shown as it is reported by [16] and [17] using the numbering system by [50], with exception of of the addition of 17 bases (between nucleotides 164 and 165) to their sequence [36]. Differences between the complete sequences as published by [16] and [17] and this sequence are limited to the 17 bp change, the revision at 328 below, and the numbering. All references tend to agree upon an independent coordinate system, where map units run from 0.00 to 1.00 and the origin is defined by the sole EcoRI site, which is found at base 1782 below. In keeping with [50], map units for this entry have been calculated by SV + 3461 mu = ----------- 5243 If the result is > 1, subtract 1 for the value. The origin of this sequence, then, is at 0.66. Identical units are used for Polyoma virus. DNA replication is bidirectional from an origin which has been narrowed by mutational analysis to bases 5193 to 34 [31]. Given that early mRNA transcription also arises from this region, the two processes will be, to some extent, simultaneously regulated. In addition to the 21-bp and 72-bp repeats concerned with early transcription (see below), the origin includes a 17-bp A + T-rich sequence; further demarcation of a "core" region and an "auxiliary" region makes the former comprise a 17-bp palindrome (5193-5209), a 15-bp palindrome (5213-5227), a 27-bp palindrome (5230-13) and the A + T-rich region (15-31), and the latter the three 21-bp repeats [85]. Thus the "core" overlaps the T-antigen binding sites I and II and the "auxiliary region" (which is said to enhance replication efficiency) overlaps the T-antigen binding site III [85]. Transcription of early mRNAs (18s and 19s) is leftward off the E, or minus, strand and transcription of late mRNAs (16s and 19s) is rightward off the L, or plus, strand. In the former case, the annotation will show the symbols "(c)" and "comp strand" for the transcripts and products, and for the regulatory elements which govern them. In some references, the E strand has been labeled "Late" to indicate that it has the polarity of late mRNA, and the L strand "Early" to indicate that it has the polarity of early mRNA. (Early mRNAs can be produced throughout the lytic cycle.) The characterization of the mRNAs as 18s and 19s early and 16s and 19s late is a simplification: heterogeneity with regard to 5' and 3' termini, splice sites, and abundance as a function of time in the lytic cycle and regulatory events, have all been observed [24] [35],[67],[71]. For the purposes of this entry, only the simplest major mRNA species have been entered in the Sites. The promoter for the early mRNAs is considered to include at least bases 5185 to 250 on the E strand: the T antigen binding sites (for which the minimal spans are given [62]), the "TATA" box at position 21, the 21-bp repeats, and the 72-bp repeats are all known to play a role in early transcription. One of the effects of regulation at these sites is to shift the cap site to one of at least seven positions between bases 95 and 32 [71]. Bases 55 to 97 are also of interest because they permit base-pairing homology with the small nuclear RNA 7S-K [97] and because they bind the cell factor SP-1 [96]. For late promoter induction to occur at high efficiency two domains are required simultaneously: the minimal replication origin and the 72 bp repeat [113]. The origin-proximal 22 bp portion of the 72-bp repeat is sufficient for induction, while the origin-distal portion is dispensable [113]. If T antigen is present, the 21 bp repeat is dispensable for induction of late promoter activity in vivo [113]. The regulatory elements for early transcription have been annotated for the "comp strand". The late region induction signals are to be found in the same region as the early regulatory elements [113]. The efficacy of the 72-bp repeat in promoting transcription has been demonstrated for both orientations [75]. The SAS-RNA is a small viral associated RNA whose function is unknown. It appears to arise through processing from some of the longer early transcripts; mutants in the SAS-RNA coding region remain viable [51]. Seven proteins -- the small t antigen, the large T antigen, the three structural proteins, the early leader protein [123], and the agnoprotein (which is thought to be involved with structural development [92])-- are known to be translated from the multiplicity of mRNAs. The existence of a middle T antigen, known to exist in Polyoma, has been suggested [40]. For a summary of other possible coding sequences, see [50]. Missing data project [124-129], [131]. EMBL features not translated to GenBank features: key from to description PRM 21 15 (C) TATA-box [3] MSG 120 2674 major 16S RNA FEATURES from to/span description pept 16 1 (c) early leader protein (SELP; [123]) 5243 5188 (c) early leader protein (SELP; [123]) pept 335 523 agnoprotein (lp-1, VCP) pept 562 1620 VP2 minor structural protein pept 916 1620 VP3 minor structural protein pept 1499 2593 VP1 major structural protein pept 5163 4918 (c) large T antigen exon 1 4571 2691 (c) large T antigen exon 2 pept 5163 4639 (c) small t antigen RNA 2907 2842 (c) SV40-associated small (SAS) RNA; 0.21 [51] pre-msg 264 2676 minor late 19s mRNA [29],[62],[71] pre-msg 325 2676 major late 19s mRNA [10],[19],[29],[30],[38] pre-msg 2587 31 (c) early mRNA (alt.) [3],[34],[39],[62],[71] pre-msg 2599 31 (c) early mRNA (alt.) [28],[62],[71] pre-msg 5225 2587 (c) early mRNA (alt.) [3],[34],[39],[46],[61] pre-msg 5230 2587 (c) early mRNA (alt.) [3],[34],[39],[68] pre-msg 5236 2587 (c) early mRNA (alt.) [3],[6],[34],[39],[61] pre-msg 5225 2599 (c) early mRNA (alt.) [28],[46],[61] pre-msg 5230 2599 (c) early mRNA (alt.) [28],[68] pre-msg 5236 2599 (c) early mRNA (alt.) [6],[28],[39],[61] IVS 295 434 late 19s intron [24],[29] IVS 527 1462 late 16s intron [19],[29],[32] IVS 4636 4572 (c) early 19s mRNA (small t antigen) intron [24],[29] IVS 4917 4572 (c) large T antigen (18s) intron [24],[29] rpt 60 40 (c) 21 bp non-tandem repeat I [90],[96],[97] rpt 82 62 (c) 21 bp tandem repeat II [90],[96],[97] rpt 103 83 (c) 21 bp tandem repeat III [90],[96],[97] rpt 178 107 (c) 72 bp tandem repeat enhancer sequence A [65],[68],[75],[90] rpt 250 179 (c) 72 bp tandem repeat enhancer sequence B [65],[68],[75],[90] binding 5231 13 (c) T antigen binding site II [23],[62],[74] binding 61 36 (c) T antigen binding site III 3' [23],[62],[74] binding 5209 5184 (c) T antigen binding site I [23],[62],[74] signal 27 21 (c) early mRNA promoter element; 0.66 [58],[68],[69] orgrpl 5191 31 replication origin core region; 0.67 [85] orgrpl 32 83 replication origin auxiliary region; 0.67 [85] variant 164 182 gggactttccacacctggt in strain 776 [36]; gt in strain 777; 0.69 [16],[17] revision 328 331 gtgc in [36],[50]; gc in [16],[17] signal 2613 2608 (c) early mRNA polyadenyation signal on the comp strand; 0.16 [28],[110] signal 2642 2637 (c) early mRNA polyadenyation signal on the comp strand; 0.16 [28],[110] signal 2657 2662 late mRNA polyadenyation signal; 0.17 [28],[63],[110] mut 1626 1626 g in wild-type; a in temperature-sensitive mutant tsC260 mut 1667 1667 c in wild-type; a in temperature-sensitive mutant tsB228 mut 1680 1680 c in wild-type; g in temperature-sensitive mutant tsB218 mut 1718 1718 g in wild-type; a in temperature-sensitive mutant tsB[204,211,265] mut 1719 1719 c in wild-type; t in temperature-sensitive mutant tsB8 mut 1756 1756 a in wild-type; c in temperature-sensitive mutant tsB8 mut 1995 1995 g in wild-type; c in temperature-sensitive mutant tsB221 mut 2003 2003 g in wild-type; a in temperature-sensitive mutant tsB201 mut 2084 2084 c in wild-type; t in temperature-sensitive mutant tsBC223 mut 2091 2091 c in wild-type; c in temperature-sensitive mutant tsB4 mut 2141 2141 c in wild-type; t in temperature-sensitive mutant tsC219 mut 2237 2237 c in wild-type; a in temperature-sensitive mutant tsC240 mut 2262 2262 c in wild-type; t in temperature-sensitive mutant tsC260 mut 2354 2354 c in wild-type; t in temperature-sensitive mutant tsBC[208,214,216,217,248,274] mut 2367 2367 a in wild-type; c in temperature-sensitive mutant tsBC11 BASE COUNT 1518 a 1100 c 1039 g 1586 t ORIGIN BglI site on the E (minus) strand of the genome; mu 0.66 [50] 1 gcctcggcct ctgcataaat aaaaaaaatt agtcagccat ggggcggaga atgggcggaa 61 ctgggcggag ttaggggcgg gatgggcgga gttaggggcg ggactatggt tgctgactaa 121 ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacacctg 181 gttgctgact aattgagatg catgctttgc atacttctgc ctgctgggga gcctggggac 241 tttccacacc ctaactgaca cacattccac agctggttct ttccgcctca gaaggtacct 301 aaccaagttc ctctttcaga ggttatttca ggccatggtg ctgcgccggc tgtcacgcca 361 ggcctccgtt aaggttcgta ggtcatggac tgaaagtaaa aaaacagctc aacgcctttt 421 tgtgtttgtt ttagagcttt tgctgcaatt ttgtgaaggg gaagatactg ttgacgggaa 481 acgcaaaaaa ccagaaaggt taactgaaaa accagaaagt taactggtaa gtttagtctt 541 tttgtctttt atttcaggtc catgggtgct gctttaacac tgttggggga cctaattgct 601 actgtgtctg aagctgctgc tgctactgga ttttcagtag ctgaaattgc tgctggagag 661 gccgctgctg caattgaagt gcaacttgca tctgttgcta ctgttgaagg cctaacaacc 721 tctgaggcaa ttgctgctat aggcctcact ccacaggcct atgctgtgat atctggggct 781 cctgctgcta tagctggatt tgcagcttta ctgcaaactg tgactggtgt gagcgctgtt 841 gctcaagtgg ggtatagatt ttttagtgac tgggatcaca aagtttctac tgttggttta 901 tatcaacaac caggaatggc tgtagatttg tataggccag atgattacta tgatatttta 961 tttcctggag tacaaacctt tgttcacagt gttcagtatc ttgaccccag acattggggt 1021 ccaacacttt ttaatgccat ttctcaagct ttttggcgtg taatacaaaa tgacattcct 1081 aggctcacct cacaggagct tgaaagaaga acccaaagat atttaaggga cagtttggca 1141 aggtttttag aggaaactac ttggacagta attaatgctc ctgttaattg gtataactct 1201 ttacaagatt actactctac tttgtctccc attaggccta caatggtgag acaagtagcc 1261 aacagggaag ggttgcaaat atcatttggg cacacctatg ataatattga tgaagcagac 1321 agtattcagc aagtaactga gaggtgggaa gctcaaagcc aaagtcctaa tgtgcagtca 1381 ggtgaattta ttgaaaaatt tgaggctcct ggtggtgcaa atcaaagaac tgctcctcag 1441 tggatgttgc ctttacttct aggcctgtac ggaagtgtta cttctgctct aaaagcttat 1501 gaagatggcc ccaacaaaaa gaaaaggaag ttgtccaggg gcagctccca aaaaaccaaa 1561 ggaaccagtg caagtgccaa agctcgtcat aaaaggagga atagaagttc taggagttaa 1621 aactggagta gacagcttca ctgaggtgga gtgcttttta aatcctcaaa tgggcaatcc 1681 tgatgaacat caaaaaggct taagtaaaag cttagcagct gaaaaacagt ttacagatga 1741 ctctccagac aaagaacaac tgccttgcta cagtgtggct agaattcctt tgcctaattt 1801 aaatgaggac ttaacctgtg gaaatatttt gatgtgggaa gctgttactg ttaaaactga 1861 ggttattggg gtaactgcta tgttaaactt gcattcaggg acacaaaaaa ctcatgaaaa 1921 tggtgctgga aaacccattc aagggtcaaa ttttcatttt tttgctgttg gtggggaacc 1981 tttggagctg cagggtgtgt tagcaaacta caggaccaaa tatcctgctc aaactgtaac 2041 cccaaaaaat gctacagttg acagtcagca gatgaacact gaccacaagg ctgttttgga 2101 taaggataat gcttatccag tggagtgctg ggttcctgat ccaagtaaaa atgaaaacac 2161 tagatatttt ggaacctaca caggtgggga aaatgtgcct cctgttttgc acattactaa 2221 cacagcaacc acagtgcttc ttgatgagca gggtgttggg cccttgtgca aagctgacag 2281 cttgtatgtt tctgctgttg acatttgtgg gctgtttacc aacacttctg gaacacagca 2341 gtggaaggga cttcccagat attttaaaat tacccttaga aagcggtctg tgaaaaaccc 2401 ctacccaatt tcctttttgt taagtgacct aattaacagg aggacacaga gggtggatgg 2461 gcagcctatg attggaatgt cctctcaagt agaggaggtt agggtttatg aggacacaga 2521 ggagcttcct ggggatccag acatgataag atacattgat gagtttggac aaaccacaac 2581 tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt gatgctattg ctttatttgt 2641 aaccattata agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca 2701 ggttcagggg gaggtgtggg aggtttttta aagcaagtaa aacctctaca aatgtggtat 2761 ggctgattat gatcatgaac agactgtgag gactgagggg cctgaaatga gccttgggac 2821 tgtgaatcaa tgcctgtttc atgccctgag tcttccatgt tcttctcccc accatcttca 2881 tttttatcag cattttcctg gctgtcttca tcatcatcat cactgtttct tagccaatct 2941 aaaactccaa ttcccatagc cacattaaac ttcatttttt gatacactga caaactaaac 3001 tctttgtcca atctctcttt ccactccaca attctgctct gaatactttg agcaaactca 3061 gccacaggtc tgtaccaaat taacataaga agcaaagcaa tgccactttg aattattctc 3121 ttttctaaca aaaactcact gcgttccagg caatgcttta aataatcttt gggcctaaaa 3181 tctatttgtt ttacaaatct ggcctgcagt gttttaggca cactgtactc attcatggtg 3241 actattccag ggggaaatat ttgagttctt ttatttaggt gtttcttttc taagtttacc 3301 ttaacactgc catccaaata atcccttaaa ttgtccaggt tattaattcc ctgacctgaa 3361 ggcaaatctc tggactcccc tccagtgccc tttacatcct caaaaactac taaaaactgg 3421 tcaatagcta ctcctagctc aaagttcagc ctgtccaagg gcaaattaac atttaaagct 3481 ttccccccac ataattcaag caaagcagct gctaatgtag ttttaccact atcaattggt 3541 cctttaaaca gccagtatct ttttttagga atgttgtaca ccatgcattt taaaaagtca 3601 tacaccactg aatccatttt gggcaacaaa cagtgtagcc aagcaactcc agccatccat 3661 tcttctatgt cagcagagcc tgtagaacca aacattatat ccatcctatc caaaagatca 3721 ttaaatctgt ttgttaacat ttgttctcta gttaattgta ggctatcaac ccgcttttta 3781 gctaaaacag tatcaacagc ctgttggcat atggtttttt ggtttttgct gtcagcaaat 3841 atagcagcat ttgcataatg cttttcatgg tacttatagt ggctgggctg ttctttttta 3901 atacatttta aacacatttc aaaactgtac tgaaattcca agtacatccc aagcaataac 3961 aacacatcat cacattttgt ttccattgca tactctgtta caagcttcca ggacacttgt 4021 ttagtttcct ctgcttcttc tggattaaaa tcatgctcct ttaacccacc tggcaaactt 4081 tcctcaataa cagaaaatgg atctctagtc aaggcactat acatcaaata ttccttatta 4141 acccctttac aaattaaaaa gctaaaggta cacaattttt gagcatagtt attaatagca 4201 gacactctat gcctgtgtgg agtaagaaaa aacagtatgt tatgattata actgttatgc 4261 ctacttataa aggttacaga atatttttcc ataattttct tgtatagcag tgcagctttt 4321 tcctttgtgg tgtaaatagc aaagcaagca agagttctat tactaaacac agcatgactc 4381 aaaaaactta gcaattctga aggaaagtcc ttggggtctt ctacctttct cttctttttt 4441 ggaggagtag aatgttgaga gtcagcagta gcctcatcat cactagatgg catttcttct 4501 gagcaaaaca ggttttcctc attaaaggca ttccaccact gctcccattc atcagttcca 4561 taggttggaa tctaaaatac acaaacaatt agaatcagta gtttaacaca ttatacactt 4621 aaaaatttta tatttacctt agagctttaa atctctgtag gtagtttgtc caattatgtc 4681 acaccacaga agtaaggttc cttcacaaag atcaagtcca aaccacattc taaagcaatc 4741 gaagcagtag caatcaaccc acacaagtgg atctttcctg tataattttc tattttcatg 4801 cttcatcctc agtaagcaca gcaagcatat gcagttagca gacattttct ttgcacactc 4861 aggccattgt ttgcagtaca ttgcatcaac accaggattt aaggaagaag caaatacctc 4921 agttgcatcc cagaagcctc caaagtcagg ttgatgagca tattttactc catcttccat 4981 tttcttgtac agagtattca ttttcttcat tttttcttca tctcctcctt tatcaggatg 5041 aaactccttg cattttttta aatatgcctt tctcatcaga ggaatattcc cccaggcact 5101 cctttcaaga cctagaaggt ccattagctg caaagattcc tctctgttta aaactttatc 5161 catctttgca aagctttttg caaaagccta ggcctccaaa aaagcctcct cactacttct 5221 ggaatagctc agaggccgag gcg // LOCUS BOVPROB1 257 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Cattle prolactin mRNA, 5' end. ACCESSION M36873 KEYWORDS prolactin. SEGMENT 1 of 2 SOURCE Cattle hypophysis, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 257) AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and Baev,A.A. TITLE Genetic engineering of peptide hormones: II. Possible polymorphism of preprolactin in cattle. Data of molecular cloning JOURNAL Mol. Biol. 22, 117-121 (1988) STANDARD simple staff_review FEATURES from to/span description pept 51 > 257 prolactin precursor sigp 51 143 prolactin signal peptide matp 144 > 257 prolactin BASE COUNT 51 a 76 c 66 g 64 t ORIGIN 1 ataggacgag agcttcctgg tgaagtgtgt ttcttgaaat catcaccacc atggacagca 61 aaggttcgtc gcagaaagca gggtcccgcc tgctcctgct gctggtggtg tcaaatctac 121 tcttgtgcca gggtgtggtc tccacccccg tctgtcccaa tgggcctggc aactgccagg 181 tatcccttcg agacctgttt gaccgggcag tcatggtgtc ccactacatc catgacctct 241 cctcggaaat gttcaac // LOCUS BOVPROB2 551 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Cattle prolactin mRNA, 3' end. ACCESSION M36874 KEYWORDS prolactin. SEGMENT 2 of 2 SOURCE Cattle hypophysis, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 551) AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and Baev,A.A. TITLE Genetic engineering of peptide hormones: II. Possible polymorphism of preprolactin in cattle. Data of molecular cloning JOURNAL Mol. Biol. 22, 117-121 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 405 prolactin BASE COUNT 153 a 135 c 119 g 144 t ORIGIN 1 ccggaagata aagaacaagc ccaacagacc catcatgaag tccttatgag cttgattctt 61 gggttgctgc gctcctggaa tgaccctctc tatcacctag tcaccgaggt gcggggtatg 121 aaaggagccc cagatgctat cctatcgagg gccatagaga ttgaggaaga aaacaaacga 181 cttctggaag gcatggagat gatatttggc caggttattc ctggagccaa agagactgag 241 ccctaccctg tgtggtcagg actcccgtcc ctgcaaacta aggatgaaga tgcacgttat 301 tctgcttttt ataacctgct ccactgcctg cgcagggatt caagcaagat tgacacttac 361 cttaagctcc tgaattgcag aatcatctac aacaacaact gctaagccca cattccatcc 421 tatccatttc tgagatggtt cttaatgatc cattccctgg caaacttctc tgagctttat 481 agctttgtaa tgcatgcttg gctctaatgg gtttcatctt aaataaaaac agactctgta 541 gcgatgtcaa a // LOCUS CHKBABA1 4051 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exons 1-4. ACCESSION M33711 J05496 KEYWORDS B-creatine kinase. SEGMENT 1 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clones T[56,62]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 4051) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept 1201 1393 B-creatine kinase, exon 2b (EC 2.7.3.2) (first expressed exon) 2449 2603 B-creatine kinase, exon 3 3464 + 3596 B-creatine kinase, exon 4 pep$ 1800 1977 B-creatine kinase, exon 2a 2449 2603 B-creatine kinase, exon 3 3464 + 3596 B-creatine kinase, exon 4 pre-msg 411 > 4051 B-creatine kinase mRNA and introns IVS 1394 1789 B-creatine kinase intron A IVS 1978 2498 B-creatine kinase intron B (no splice consensus) IVS 2604 3463 B-creatine kinase intron C IVS 3597 > 4051 B-creatine kinase intron D signal 384 390 TATA box signal 251 256 GC box signal 310 315 GC box signal 328 332 CAT box signal 346 351 TATA box signal 358 362 GC box signal 384 389 TATA box BASE COUNT 943 a 985 c 1128 g 995 t ORIGIN 1 tttgcacacg accagaagag accttcagcg cgttacacaa taaacctatg ttcaaatctg 61 aggtaaaagg agccagctag caaggatgga agcaaaaagc cgcttgtgcc cgccttggtg 121 cctaacggct gccgtattcg ggctgccggt ggccgctggg acggatgtac gtgcacgtct 181 gcgttcacga ggtcacgacg gctaccgctc caggctcgag gcggacgggc cacgaaccca 241 aagccaccgg ccgcccaacc gccgcgcaaa atggcggcac gccttcactg gccaggagga 301 gccgctgctc cgcccctcac ggctcgacca atggaatgaa tgggctataa ataggagcca 361 atgaggcggc gggtcgcgct ctatataaaa ggggcgcggc ggctgactcg gggctccact 421 ccggcttggg cagcgggtgg gtgcggagcg gccgctgagg tccttacgtt ggctgtggag 481 ccctcggtaa gtcccggggc agccgagccg cctgcagctt aaagaaaaga agaaggaaac 541 gatttaactt ttataagaaa aatgtatttt tgatggtgtg ggttggagcc ggtccggccg 601 ggaaagcgtc ggggggagcg ggatgcggcg gggcggcgcg gggcaggtgc cgggtgcgta 661 ccgggtgcgg ccgcagcttg agccggagcc ccgctctgaa agcagctcac cccggtattt 721 agcagggtag ttacatacgg tcgccccagc cgcgtccccc gagggagccg tccgttgtgt 781 aacggctcgc tgagcactgc ccactcgccc cggtccccgg ctcgcgctgc cggtaccggc 841 accggccgcg gcgctcccca acggccgctc cgcggggctc cgccggcggc tgctccgccg 901 cctccccgcg ccctggccga gccccggagc gtccggaccg ggatgctgaa ggatggggcg 961 aggtttgcgg agctggagaa agtttccttt tttttttttc tcggtttttt tttctctttt 1021 tttttttttt tttgggccct ccctccttgc ctggccttga ggaaggtgtt gtggcaacga 1081 aggtgcggaa ggcacgggtg gaagcgaagc caaggtgacc ttgcttggcg ttggcgcctc 1141 ccccgggccc ctggtgatag gcttccctct cttccctcag tgactgctgt agggacagcc 1201 atgcccttct caaacagcca caacctcctg aagatgaagt actccgttga cgatgagtac 1261 cccgacctga gcgtgcacaa caatcacatg gccaaggtgc tgaccctgga cctgtacaag 1321 aagttgaggg acaggcagac ttccagtgga tttacgctgg atgatgtcat tcagaccggg 1381 gttgacaacc caggtaacgg cggcaccgtt aggactgcag ggctctctgc tctccctcct 1441 tcctggaagt gctccttgac agcaaagcaa accccgcacg ctccgaggga cggctgtgct 1501 cctccgtgca atctgacaca gcaattccca ggcgctccca ttatgtaact gctctgctaa 1561 atacttcaaa tcccaaccaa aaaaaaaaaa aaaaagcaaa cgctcaaatg tgttctcatt 1621 tttatgcaac cggggggaaa aaaattctct gagctgtgtg ggcttacccg gctgttctga 1681 gctcgtctgt tccaaagtgg ttatgtgatg ttctgtagca cttctatgaa agcaaggggg 1741 agcggatgcc gtatacgtgt taacattaca gtatttattt ttactctagc aatatcaaaa 1801 tggcccaact aaataatcag aggctgcctc ccgaggagga gtacccggac ctgagcaccc 1861 acaacaacca catggccaaa gtgctaaccc tggatttgta caagaaactg agagacagag 1921 tcacgcccag tggcttcacc ctggatgatg tcattcagac tggggttgat aatcccggta 1981 aaatgcgctg agatattctg tgggagccag catgaagtac ccggtgcttt tcagtatggt 2041 gacttaagct aagctcgtgg aggtggtgtt tatgagcaga ctatgaaaat gcggttctgt 2101 ctgtagcaga gcgaataaag cgatgcatgc ttgctcacag caaagcatgc taaagtgtag 2161 ttctatctac gggtttgcat tagctagtaa aattggatgt gctgtgcgaa attgccctaa 2221 accaaagctt gacaagagtg tggcttaacg gcaactgctg tggtttccca tagcactgca 2281 tttgactaac tggatgaaat gtaaaatgta gggtctccca actgctctta ggccagtgca 2341 tttctaagtg agtagatctg ctgtgaagaa acttgcctgg ttagtgctgt gctggagtac 2401 agctgagcat cacccaactt agctcttctg cctttgttac cactccaggc caccccttca 2461 taatgacagt aggatgcgta gctggcgacg aagaatccta tgaagtgttt aaggaactct 2521 ttgatccggt tattgaggac aggcacggtg gctacaaacc aactgatgag cacaagaccg 2581 acctgaatgc tgataacctg caggtatacg tttttgttga gctactgata cccactgatc 2641 tgaggcctgg ggaaagctgt gagcggctgc ctaaacggat tgtttgggct ctgtcctctc 2701 tctctcatct ctcgctgccc gctgcagtct tacttaagag cctgcagcag taggttgtca 2761 gtaactgggt agaaccaggc agctgacaag caggacttgc ttcatgtgct gctgcaatgt 2821 aagtcatacc aaacacgttg agcacgtatt tctttgggct gtgctagttg tgtgaaacaa 2881 cccagtccca gggctgttag aaatcttaac tcttcctact gtatgatgtt gtaggctaaa 2941 aatcagccag tggtctcgtg ttatagaata gccattgtta cggctggaag tagaggtttc 3001 tgcccatctt acagacttca aatgttattg cagtctctgg gaattccaac tgctatcctg 3061 gttggcctgc cataggacat ctcaaaaatg agaacgcttt gacaataatg agtaactctc 3121 atttgcagga gtaaacacct acctaagcaa ggatttgtct cgtgttgata cctctagatt 3181 gcagagtgaa aggagtaaca gaagcctggg atttgggtct gagatgggtg taaaactgat 3241 ggtatcccta tcttctagca catagttggt agtcgtagtc ttagcctcaa aacaaggagt 3301 ttgatactct catacttgtt ggaaagctgg aaaaaatggt cctgaagtac atcgtattac 3361 aggacaacta gtcagtctca aatatgaatg cacgtgacct gtgtgtattt agggtaagca 3421 tcttagcaac aaggcagtac tgactaattt tctgatcttg tagggtggtg acgacttgga 3481 tcctaattac gtgctgagct ctcgtgtgag aactggcagg agcatccgtg gattctgcct 3541 tccccctcat tgcagcaggg gagagagacg ggctattgaa aagctctctg ttgaaggtaa 3601 agtgttgaat caggcaatga ggggaaactg ctgacctcgt ctaggcaagg aggtgctctg 3661 tgcagttgga gaatggaggt ggttggcaga agctgccttc ctcaaggctt aacttgagtg 3721 tggaaccttc cctgaaaaag gcagcttgga ttatggttgc gtcacgttgc agtcggtact 3781 cagagcagga acacatgtca gggaagttgg actgtattaa gttgaccttt gctgatcttg 3841 gaagcttctt tctgtgtgtg gggaaatagg ttggttcatc acggctcaaa cctgagatgt 3901 aataggtatg gagcaaccct agggcttgag tcatggttgc agtattttgt ggcttctttt 3961 aattgacttt tattaatgtt cagcttgctt agaggtgtag tccagctctg agagagctct 4021 agttctcagt ggaactcctc agctgcagtt c // LOCUS CHKBABA2 444 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exon 5. ACCESSION M33712 J05496 KEYWORDS B-creatine kinase. SEGMENT 2 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 444) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept + 45 + 216 B-creatine kinase (B-CK), exon 5 pep$ + 45 + 216 B-creatine kinase (B-CK), exon 5 pre-msg 1 > 444 B-creatine kinase mRNA and introns IVS < 1 44 B-creatine kinase intron E IVS 217 > 444 B-creatine kinase intron F BASE COUNT 108 a 95 c 120 g 121 t ORIGIN 1 tgtttatagt ctctgctccc atcctaaaag cacctctctt tcagctctgg gcagtctggg 61 aggtgatctc aaggggaagt actatgccct aaggaacatg actgatgcag agcagcagca 121 gctgattgat gatcacttct tgtttgacaa acctgtttct cctcttctgt tggcatctgg 181 gatggcacga gattggcctg atgccagggg tatctggtga gtatatgccc agcagggtga 241 gttgtgtgaa atccctgcaa caaaaagcct tgaaattttg gttgtggctg atataattgg 301 acaaaacagc acgttgtgaa cttacgctgc tgaatctaat gaggaacttc tgcgtggtat 361 cccctgggaa gcgatatctg aaagcagaac ttgccccaag tgctagtgtg cactgcattg 421 ggctggtagg aagaagccag cttg // LOCUS CHKBABA3 1586 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exons 6 and 7. ACCESSION M33713 J05496 KEYWORDS B-creatine kinase. SEGMENT 3 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clone T8. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1586) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept + 308 431 B-creatine kinase, exon 6 1170 + 1359 B-creatine kinase, exon 7 pep$ + 308 431 B-creatine kinase, exon 6 1170 + 1359 B-creatine kinase, exon 7 pre-msg 1 > 1586 B-creatine kinase mRNA and introns IVS < 1 307 B-creatine kinase intron G IVS 432 1171 B-creatine kinase intron H (no splice consensus) IVS 1360 > 1586 B-creatine kinase intron I BASE COUNT 435 a 336 c 385 g 430 t ORIGIN 1 tggactgaac ataaatgagg tgagcaatga ctgaactctt aagttttgat gctgttaaca 61 ctggctaatg tcctgcaatt actttctgac atggatagat acaggccact tccaaatacc 121 tcttgactag agtagggtaa aagtacaagc ttaattttgt tagtgcttag atggagaaca 181 gggagataaa tttctttgga gaagcaatta aacgagcctc ttggaagcta cagcagtgag 241 taacttggtg ctactgaatc tgtcttgaaa acgtcaagtg agacaacagg tctgtcttgt 301 ctcctaggca caatgacaac aagaccttcc ttgtttggat caatgaggag gatcacctca 361 gggttatttc catgcagaaa ggtggcaaca tgaaggaagt gtttactcgc ttctgtacag 421 ggctaacaca ggtaaatcaa atactgcttg agctctgggt ctaaatatta atctggttgg 481 agctgaccat acaggatctt ccaggctggg agcatgtaca caggaggtga cagccttctc 541 ctttgttctg cagggtgctt ctttgtagtt tgggtttctg ggaacatcct gatgagatcc 601 tctgctttga gggctctgaa atgcctgtac tggcaaagct ttatggctgt gaaatctgga 661 ataaaacctc taatgtgaca cctttaccaa actgtactta aaggatctct gtggttctat 721 aaaatgtatg tagagaccct gggtaggagg aaaaaattgc ttaaatgtga atgggtgaca 781 ttggacagaa gtttgtaggc tgcatgcttt tctcagcttc tggtcttgat atcagcttgt 841 gctttaccag ccccttccct aatcctaaca gatgcaccag tttgacatgt actcaatcaa 901 gcttcccagt agatagcaac ttcagacttt acagtgtagc ttgaacttag gctgtctcgc 961 tgctgggagc aagagtaagg ccttcaaaag cccctcattg gtgaccctct tctacaaatg 1021 gagggggccc tatgtaactg acttaaataa tctgctcaaa ctgttgtggc tgttgctatg 1081 caactgaagc tgtaaatgag cacttagtgg atgacctaca tcgttcaggc agagaacaat 1141 ctctaaacat ccattctctc tatttataga tagaaactct cttcaagtcc aaaaactacg 1201 agttcatgtg gaacccacac ttgggctaca tcctgacctg cccatccaac cttggaacgg 1261 ggctccgtgc tggtgtgcac atcaagctac caaaccttgg gaagcatgag aagtttggag 1321 aagtcctcaa gaggcttcgg ctgcagaaac gaggcacagg tgagaggtgg tgatacagtg 1381 atgccccaag caggctgcct ctgaccagac agcagtgaga gcctggctgc agcgaagcta 1441 atcttggttc atgctgacag tgcctgacag accatccaga agggaccctt ctgaagcgtg 1501 gggggggaca accaagcatt aaaagtgtag cttagaagag gtggagatgc tggtactttg 1561 cttaagcctt attatatcca gcttct // LOCUS CHKBABA4 650 bp ds-DNA VRT 03-AUG-1990 DEFINITION G.gallus B-creatine kinase (B-CK) gene, exon 8. ACCESSION M33714 J05496 KEYWORDS B-creatine kinase. SEGMENT 4 of 4 SOURCE G.gallus (strain White Leghorn) adult blood cell DNA, clone T8. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 650) AUTHORS Wirz,T., Braendle,U., Soldati,T., Hossle,J.P. and Perriard,J.-C. TITLE A unique chicken B-creatine kinase gene gives rise to two B-creatine kinase isoproteins with distinct N-termini by alternative splicing JOURNAL J. Biol. Chem. 265, 11656-11666 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Wirz, 13-APR-1990. FEATURES from to/span description pept + 49 227 B-creatine kinase, exon 8 pep$ + 49 227 B-creatine kinase, exon 8 pre-msg 1 > 396 B-creatine kinase mRNA and introns IVS < 1 48 B-creatine kinase intron J signal 391 396 poly-A signal BASE COUNT 161 a 129 c 161 g 199 t ORIGIN 1 aactggctca ggcagcggtc agcgtactaa ttaaatctct tcgtccaggt ggtgtggaca 61 cagctgcagt cggaggggtg tttgatgtat ccaatgctga tcgtcttggc ttctctgagg 121 tagagctggt gcagatggtg gtagatggtg tgaagctgct cattgaaatg gaaaaacgcc 181 ttgagaaagg ccagtccatt gatgacctca tgccagctca gaaataaagc actttattct 241 catgcttcct aacttattgg atgaataata aaatgtcact ccaatttcaa accccttggg 301 tcagagccca cttagttaca ctgtagagaa gtcttccatc catctgtgtt agagtttatt 361 ttttgatggc tgaaatgttg ttgaaaatga aataaactgt tgttttggcc tgacctgcct 421 tagatgtgtt aactgagtgt ctttgaggtc taagtgactt gctgttcctg gtaagctgaa 481 ctgattggtt ttaggtgcta gcacatcatc ctggctgtgc tgctgtagtc tgcttggtgg 541 gcttagaaat gtgcttatag cagtgtcaaa agcaagcctc taatgtgatt gcagctgccg 601 taaacctgag aactggtgct aaacatgagt cactttaaac ctgccctacc // LOCUS HUMSATIII1 132 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human satellite III DNA fragment, 5' end. ACCESSION M25431 KEYWORDS satellite repetitive sequence. SEGMENT 1 of 3 SOURCE Human DNA, clone lambda-HS3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 132) AUTHORS Cooke,H.J. and Hindley,J. TITLE Cloning of human satellite III DNA: Different components are on different chromosomes JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979) STANDARD simple staff_review BASE COUNT 39 a 27 c 14 g 52 t ORIGIN 1 aattcatttg aagacaattc cattcaatac caattgatga tggttatttt tgattccatt 61 tgatgatgat tacattccat ttcatcataa ttccattcga ttccactcga gattccattc 121 gattccattc aa // LOCUS HUMSATIII2 75 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human satellite III DNA fragment. ACCESSION M25432 KEYWORDS satellite repetitive sequence. SEGMENT 2 of 2 SOURCE Human satellite DNA, clone lambda-HS3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 75) AUTHORS Cooke,H.J. and Hindley,J. TITLE Cloning of human satellite III DNA: Different components are on different chromosomes JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979) STANDARD simple staff_review BASE COUNT 19 a 15 c 11 g 30 t ORIGIN 1 cgaatgaatg agtccatcca tttcaatttc atgataattc cattcgtttc aattcgatgg 61 tgtttccatt cgatt // LOCUS HUMSATIII3 93 bp ds-DNA PRI 03-AUG-1990 DEFINITION Human satellite III DNA fragment, 3' end. ACCESSION M25433 KEYWORDS satellite repetitive sequence. SEGMENT 3 of 3 SOURCE Human satellite DNA, clone lambda-HS3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 93) AUTHORS Cooke,H.J. and Hindley,J. TITLE Cloning of human satellite III DNA: Different components are on different chromosomes JOURNAL Nucleic Acids Res. 6, 3177-3197 (1979) STANDARD simple staff_review BASE COUNT 21 a 20 c 16 g 36 t ORIGIN 1 ttcattcgat tcatttgatg atgattcatg cgcgattcat tagatgatga cccctttcat 61 ttccattcaa tggaggattc cattcggttc cat // LOCUS MS2AAR 218 bp ss-RNA PHG 03-AUG-1990 DEFINITION Bacteriophage MS2 A protein mRNA, 3' end, and coat protein mRNA, complete cds. ACCESSION M25187 KEYWORDS A protein; coat protein. SOURCE Bacteriophage MS2 RNA. ORGANISM Bacteriophage MS2 Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 218) AUTHORS Contreras,R., Ysebaert,M., Min Jou,W. and Fiers,W. TITLE Bacteriophage MS2 RNA: Nucleotide sequence of the end of the A protein gene and the intercistronic region JOURNAL Nature New Biol. 241, 99-101 (1973) STANDARD simple staff_review FEATURES from to/span description pept < 1 137 A protein (AA at 3) pept 161 > 218 coat protein BASE COUNT 54 a 58 c 52 g 54 t ORIGIN 1 ctaaggccca aatctcagcc atgcatcggg gagtacaatc cgtatggcca acaactggcg 61 cgtacgtaaa gtctcctttc tcgatggtcc ataccttaga tgcgttagca ttaatcaggc 121 aacggctctc tagatagagc cctcaaccgg agtttgaagc atggcttcta actttactca 181 gttcgttctc gtcgacaatg gcggaactgg cgacgtga // LOCUS MUSCMDH1 567 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 1. ACCESSION M36076 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 1 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 567) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept 540 + 542 cytosolic malate dehydrogenase, exon 1 (cMDHase) (EC 1.1.1.37) pre-msg 458 > 567 cytosolic malate dehydrogenase mRNA and introns (alt.) pre-msg 459 > 567 cytosolic malate dehydrogenase mRNA and introns (alt.) IVS 543 > 567 cytosolic malate dehydrogenase intron A BASE COUNT 109 a 156 c 169 g 133 t ORIGIN 1 gttcttccgc aagcgtcaat tcctcccgcc tctgagagag tttttaaaag gtttgtttcc 61 gggtcgagcg ggctttaagc aacggaaggt ctcttagccg cccttgttta gtcttgggga 121 ggatagattc tcgtggagcg acgtgtgtgt cgctcagggg tcggtttctc ctccctcgag 181 ttaacgcctc ctctcctgcc aattgctgag cgccatcagg caggcgcctc actcaaagca 241 ccaaccctct gctcacagac gcgctccaat caccgaggct cagcccggga ctactttgca 301 gcgaggcgcg taggaagaag gggtttgggg gaattgtagt ttagcactgt caggaatgca 361 cgaaggtggg cgccagaggt cgcggaagaa ctacacttcc cagaaagggc cgtgtctcca 421 gccgaagcct ttgcgggcca gccccggttc tctcccagag tctgttccgc tgtagaggtg 481 acctgactgc tggagactgc cttttgcagg tgcagagatc ggccttgcag tttgcaataa 541 tggtgaggtg ggctctggaa ctcacac // LOCUS MUSCMDH2 149 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 2. ACCESSION M36077 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 2 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 149) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 27 + 125 cytosolic malate dehydrogenase, exon 2 (cMDHase) pre-msg < 1 > 149 cytosolic malate dehydrogenase mRNA and introns IVS < 1 26 cytosolic malate dehydrogenase intron A IVS 126 > 149 cytosolic malate dehydrogenase intron B BASE COUNT 41 a 25 c 36 g 47 t ORIGIN About 4.8 kb after segment 1. 1 taaactagtg gtctttgtca ttacagtctg aaccaatcag agtccttgtg actggagcag 61 ctggtcaaat tgcatattca ctgttgtaca gtattggaaa tggatctgtc tttgggaaag 121 accaggtagg ggcatgttct tataaatac // LOCUS MUSCMDH3 147 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 3. ACCESSION M36078 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 3 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 147) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 122 cytosolic malate dehydrogenase exon 3 (cMDHase) pre-msg < 1 > 147 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron B IVS 123 > 147 cytosolic malate dehydrogenase intron C BASE COUNT 27 a 39 c 40 g 41 t ORIGIN About 0.8 kb after segment 2. 1 gcctgctgtc cttgctcttt ggcagcccat cattcttgtg ctgttggaca tcacccccat 61 gatgggtgtt ctggacggtg tcctgatgga actgcaagac tgtgcccttc cccttctgca 121 gggtgagttg gaagtcaaag aaaacag // LOCUS MUSCMDH4 226 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 4. ACCESSION M36079 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 4 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 226) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 201 cytosolic malate dehydrogenase, exon 4 pre-msg < 1 > 226 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron C IVS 202 > 226 cytosolic malate dehydrogenase intron D BASE COUNT 69 a 45 c 62 g 50 t ORIGIN About 1.6 kb after segment 3. 1 tgtgtgttgt ttgccatgtc catagatgtc attgcaacgg acaaagaaga gattgccttc 61 aaagacctgg atgtggctgt cctagtgggc tccatgccaa gaagggaagg catggagagg 121 aaggacctac tgaaagccaa tgtgaaaatc ttcaaatccc agggcacagc cttggagaaa 181 tacgccaaga aatcagttaa ggtgactcac acagatttca tggggt // LOCUS MUSCMDH5 173 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 5. ACCESSION M36080 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 5 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 173) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 148 cytosolic malate dehydrogenase, exon 5 pre-msg < 1 > 173 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron D IVS 149 > 173 cytosolic malate dehydrogenase intron E BASE COUNT 48 a 49 c 35 g 41 t ORIGIN About 0.9 kb after segment 4. 1 tctgctctgt gcctccacca tctaggtcat tgttgtggga aacccagcca atacgaactg 61 cctgacagcc tccaagtcag cgccatcgat ccccaaggag aatttcagtt gcctgactcg 121 cttggaccac aaccgagcaa aatctcaagt aagaaaaatg gcttttagat tta // LOCUS MUSCMDH6 227 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 6. ACCESSION M36081 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 6 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 227) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 202 cytosolic malate dehydrogenase, exon 6 pre-msg < 1 > 227 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron E IVS 203 > 227 cytosolic malate dehydrogenase intron F BASE COUNT 71 a 40 c 60 g 56 t ORIGIN About 3.4 kb after segment 5. 1 tgatatgatg ttttacatga actagattgc tcttaaactc ggtgtaaccg ctgatgatgt 61 aaagaatgtc attatctggg gaaatcattc atcgacccag tatccagatg tcaatcatgc 121 caaggtgaaa ctgcaaggaa aggaagtcgg tgtgtatgaa gccctgaaag acgacagctg 181 gctgaaggga gagttcatca cggtaagaag gatgtgaacc ctctgag // LOCUS MUSCMDH7 164 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 7. ACCESSION M36082 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 7 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 164) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 139 cytosolic malate dehydrogenase, exon 7 pre-msg < 1 > 164 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron F IVS 140 > 164 cytosolic malate dehydrogenase intron G BASE COUNT 34 a 46 c 44 g 40 t ORIGIN About 0.7 kb after segment 6. 1 actgtctctc tgttgtccca cccagactgt gcaacagcgt ggtgctgctg tcatcaaggc 61 tcggaagctg tccagtgcaa tgtctgctgc gaaagccatc gcagaccaca tcagagacat 121 ctggtttgga accccagagg tgagggttct catttgtact ggcc // LOCUS MUSCMDH8 131 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 8. ACCESSION M36083 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 8 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 131) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 + 115 cytosolic malate dehydrogenase, exon 8 pre-msg < 1 > 131 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron G IVS 116 > 131 cytosolic malate dehydrogenase intron H BASE COUNT 27 a 27 c 36 g 41 t ORIGIN About 0.3 kb after segment 7. 1 gctatgataa tgtaaacttt ttcagggaga gttcgtgtcg atgggtgtta tctctgatgg 61 caactcctat ggtgtccctg atgacctgct ctactcactc cctgtcgtga tcaaggtggg 121 tacatggaga g // LOCUS MUSCMDH9 528 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse cytosolic malate dehydrogenase (cMDHase) gene, exon 9. ACCESSION M36084 KEYWORDS cytosolic malate dehydrogenase. SEGMENT 9 of 9 SOURCE Mouse (strain C3H/He) male liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 528) AUTHORS Setoyama,C., Joh,T., Tsuzuki,T. and Shimada,K. TITLE Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase gene JOURNAL J. Mol. Biol. 202, 355-364 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 26 151 cytosolic malate dehydrogenase, exon 9 (cMDHase) pre-msg < 1 > 353 cytosolic malate dehydrogenase mRNA and introns IVS < 1 25 cytosolic malate dehydrogenase intron H signal 329 334 poly-A signal signal 449 454 poly-A signal BASE COUNT 175 a 98 c 100 g 155 t ORIGIN About 1.4 kb after segment 8. 1 agctctcgcc cttgtcccct gacagaataa gacctggaag tttgttgaag gcctccccat 61 taatgacttc tcccgtgaaa agatggacct gacagcaaag gagctgaccg aggaaaagga 121 gaccgctttt gagtttctct cctctgcgtg actagacact cgttttgaca tcagcagaca 181 gccgaaggct gaggaatcaa aatgtcgtct ttgagcctag taccaaacag taataatgct 241 acattcaaat tgtgaacagc aaaatatttt aaatagtgtg tgctttatga tttgtgaaag 301 tctatcatgt tgttagtgct gcaatctaaa taaaagtata ttcaagtgaa aatctctcag 361 actctgtttc tactttatat ttagtatctt caggaaaaca agtttggccc aatagattat 421 aattttactt ttttaattga ctaaaagaaa taaagatgga aaatattatg aagtaaagca 481 ttagtctcta acataaacaa ggaagcccaa tcaatttcag agggatcc // LOCUS MUSMHHLABA 158 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-B7 gene, 5' flank. ACCESSION M35971 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 158) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 53 c 43 g 41 t ORIGIN 1 ggtctcaggg tctcaggctc cgagggccgg gtctgcaatg gggaggcgca cgttggggat 61 tccccactcc cctgagtttc acttcttctc ccaacttgtg tcgggtcctt cttccaggat 121 actcgtgacg cgtccccact tcccactccc attgggta // LOCUS MUSMHHLABB 160 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-Bw64 gene, 5' flank. ACCESSION M35972 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 160) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 55 c 43 g 41 t ORIGIN 1 ggctctcagg gtctcaggct ccgagggccg cgtctgcaat ggggaggcgc agcgttgggg 61 attccccact cccctgagtt tcacttcttc tcccaacttg tgtcgggtcc ttcttccagg 121 atactcgtga cgcatcccca cttcccactc ccattgggtg // LOCUS MUSMHHLABC 159 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-B38 gene, 5' flank. ACCESSION M35973 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 52 c 44 g 42 t ORIGIN 1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga 61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga 121 tactcgtgac gcatccccac ttcccactcc cattgggtg // LOCUS MUSMHHLABD 159 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class I HLA-B39 gene, 5' flank. ACCESSION M35974 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse L cell line DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 159) AUTHORS Schmidt,H., Gekeler,V., Haas,H., Engler-Blum,G., Steiert,I., Probst,H. and Mueller,C.A. TITLE Differential regulation of HLA class I genes by interferon JOURNAL Immunogenetics 31, 245-252 (1990) STANDARD simple staff_review BASE COUNT 21 a 52 c 44 g 42 t ORIGIN 1 ggctctcagg gtctcaggct cgagggccgc gtctgtaatg gggaggcgga gcgttgggga 61 ttccccactc ccctgagttt cacttcttct cccaacttgt gtcgggtcct tcttccagga 121 tactcgtgac gcatccccac ttcccactcc cattgggtg // LOCUS MUSTCEAEA 396 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35486 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL23.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 396) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 396 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 33 T-cell receptor alpha-chain signal peptide matp 34 > 396 T-cell receptor alpha-chain recomb 327 328 V-region end/J-region start recomb 385 386 J-region end/C-region start BASE COUNT 113 a 101 c 95 g 87 t ORIGIN 1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg 61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt 121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc 181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc 241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac 301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact 361 gggacaaaac tccaagtcgt tccaaacatc cagaac // LOCUS MUSTCEAEB 384 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35487 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL51.1.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 384) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 384 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 39 T-cell receptor alpha-chain signal peptide matp 40 > 384 T-cell receptor alpha-chain recomb 315 316 V-region end/J-region start recomb 373 374 J-region end/C-region start BASE COUNT 111 a 96 c 93 g 84 t ORIGIN 1 ctagctgggg tgagtggcca gcaggagaaa cgtgaccagc agcaggtgag acaaagtccc 61 caatctctga cagtctggga aggagagacc gcaattctga actgcagtta tgagaacagt 121 gcttttgact acttcccatg gtaccagcag ttccctgggg aaggtcccgc tctcctgata 181 tccatacttt cagtgtccga taaaaaggaa gatggacgat tcacaatctt cttcaataaa 241 agggagaaaa agctctcctt gcacattgca gactctcagc ctggagactc agccacctac 301 ttctgtgcag caagtgcaaa ttctgggact taccagaggt ttggaactgg gacaaaactc 361 caagtcgttc caaacatcca gaac // LOCUS MUSTCEAEC 388 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35488 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL83.12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 388) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 388 T-cell receptor alpha-chain precursor (AA at 2) sigp < 1 43 T-cell receptor alpha-chain signal peptide matp 44 > 388 T-cell receptor alpha-chain recomb 319 320 V-region end/J-region start recomb 377 378 J-region end/C-region start BASE COUNT 112 a 98 c 93 g 85 t ORIGIN 1 tcacctagct ggggtgagtg gccagcagga gaaacgtgac cagcagcagg tgagacaaag 61 tccccaatct ctgacagtct gggaaggaga gaccgcaatt ctgaactgca gttatgagaa 121 cagtgctttt gactacttcc catggtacca gcagttccct ggggaaggtc ccgctctcct 181 gatatccata ctttcagtgt ccgataaaaa ggaagatgga cgattcacaa tcttcttcaa 241 taaaagggag aaaaagctct ccttgcacat tgcagactct cagcctggag actcagccac 301 ctacttctgt gcagcaagtg caaattctgg gacttaccag aggtttggaa ctgggacaaa 361 actccaagtc gttccaaaca tccagaac // LOCUS MUSTCEAED 359 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35489 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL127.6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 359) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 359 T-cell receptor alpha-chain precursor (AA at 3) sigp < 1 14 T-cell receptor alpha-chain signal peptide matp 15 > 359 T-cell receptor alpha-chain recomb 290 291 V-region end/J-region start recomb 348 349 J-region end/C-region start BASE COUNT 107 a 91 c 81 g 80 t ORIGIN 1 agaaacgtga ccagcagcag gtgagacaaa gtccccaatc tctgacagtc tgggaaggag 61 agaccgcaat tctgaactgc agttatgaga acagtgcttt tgactacttc ccatggtacc 121 agcagttccc tggggaaggt cccgctctcc tgatatccat actttcagtg tccgataaaa 181 aggaagatgg acgattcaca atcttcttca ataaaaggga gaaaaagctc tccttgcaca 241 ttgcagactc tcagcctgga gactcagcca cctacttctg tgcagcaagt gcaaattctg 301 ggacttacca gaggtttgga actgggacaa aactccaagt cgttccaaac atccagaac // LOCUS MUSTCEAEE 396 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35490 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL172.10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 396) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 396 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 51 T-cell receptor alpha-chain signal peptide matp 52 > 396 T-cell receptor alpha-chain recomb 327 328 V-region end/J-region start recomb 385 386 J-region end/C-region start BASE COUNT 113 a 101 c 95 g 87 t ORIGIN 1 ctaggccttc acctagctgg ggtgagtggc cagcaggaga aacgtgacca gcagcaggtg 61 agacaaagtc cccaatctct gacagtctgg gaaggagaga ccgcaattct gaactgcagt 121 tatgagaaca gtgcttttga ctacttccca tggtaccagc agttccctgg ggaaggtccc 181 gctctcctga tatccatact ttcagtgtcc gataaaaagg aagatggacg attcacaatc 241 ttcttcaata aaagggagaa aaagctctcc ttgcacattg cagactctca gcctggagac 301 tcagccacct acttctgtgc agcaagtgca aattctggga cttaccagag gtttggaact 361 gggacaaaac tccaagtcgt tccaaacatc cagaac // LOCUS MUSTCEAEF 390 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35491 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL183.2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 390) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 390 T-cell receptor alpha-chain precursor (AA at 1) sigp < 1 45 T-cell receptor alpha-chain signal peptide matp 46 > 390 T-cell receptor alpha-chain recomb 321 322 V-region end/J-region start recomb 379 380 J-region end/C-region start BASE COUNT 112 a 99 c 93 g 86 t ORIGIN 1 cttcacctag ctggggtgag tggccagcag gagaaacgtg accagcagca ggtgagacaa 61 agtccccaat ctctgacagt ctgggaagga gagaccgcaa ttctgaactg cagttatgag 121 aacagtgctt ttgactactt cccatggtac cagcagttcc ctggggaagg tcccgctctc 181 ctgatatcca tactttcagt gtccgataaa aaggaagatg gacgattcac aatcttcttc 241 aataaaaggg agaaaaagct ctccttgcac attgcagact ctcagcctgg agactcagcc 301 acctacttct gtgcagcaag tgcaaattct gggacttacc agaggtttgg aactgggaca 361 aaactccaag tcgttccaaa catccagaac // LOCUS MUSTCEAEG 395 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse T-cell receptor alpha-chain rearranged mRNA V-D-J region. ACCESSION M35492 KEYWORDS T-cell receptor; constant region; diversity exon; joining exon; processed gene; variable region. SOURCE Mouse (strain SJL/J) spleen, cDNA to mRNA, hybridoma PL214.12. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 395) AUTHORS Hood,L., Kumar,V., Osman,G., Beall,S.S., Gomez,C., Funkhouser,W., Kono,D.H., Nickerson,D., Zaller,D.M. and Urban,J.L. TITLE Autoimmune disease and T-cell immunologic recognition JOURNAL Cold Spring Harb. Symp. Quant. Biol. 54, 859-874 (1989) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 395 T-cell receptor alpha-chain precursor (AA at 3) sigp < 1 50 T-cell receptor alpha-chain signal peptide matp 51 > 395 T-cell receptor alpha-chain recomb 326 327 V-region end/J-region start recomb 384 385 J-region end/C-region start BASE COUNT 108 a 101 c 98 g 88 t ORIGIN 1 ctaggccttc acatgattgt gatgctcctc atattcacaa gggcccatgg agactcagtg 61 actcagacgg aaggtcaagt ggccctctca gaagaggact ttcttacgat acactgcaac 121 tactcagcct agggtaccca gctctgttct ggtacgagca gtatcccggg gaaggtccac 181 agttcctctt tagagcctca agggacaaag agaaaggaag cagcagaggt tttgaagcta 241 catatgataa agggaccacc tccttccact tgcggaaagc ctcagtgcaa gagtcagact 301 cggctgtgta ctactgtgct ctgggtccaa attctgggac ttaccagagg tttggaactg 361 ggacaaaact ccaagtcgtt ccaaacatcc agaac // LOCUS RATIDA 198 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat ID-A element in the S-100 beta-subunit gene. ACCESSION M35983 KEYWORDS . SOURCE Rat brain DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 198) AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and Takahashi,Y. TITLE ID sequences in the genes of three brain-specific proteins JOURNAL Biochem. Int. 15, 809-816 (1987) STANDARD simple staff_review FEATURES from to/span description rpt 5 19 direct repeat rpt 184 198 direct repeat BASE COUNT 94 a 26 c 58 g 20 t ORIGIN 1 agctaaaaac gaactccgtg gggttgggga tttagctcag tggtagagcg cttgcctagc 61 aagagcaagg ccctgggttc ggtccccagc tccgaaaaaa agaaaaaaag aaaaaagaaa 121 aagaagaaga agaagaagaa gaagaagaag aagaagaaga agaagaagaa gaagaagaag 181 aagaaaaaag aaacgcgt // LOCUS RATIDB 141 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat ID-B element in the S-100 beta-subunit gene. ACCESSION M35984 KEYWORDS . SOURCE Rat brain DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and Takahashi,Y. TITLE ID sequences in the genes of three brain-specific proteins JOURNAL Biochem. Int. 15, 809-816 (1987) STANDARD simple staff_review FEATURES from to/span description rpt 5 19 direct repeat rpt 121 135 direct repeat BASE COUNT 52 a 26 c 43 g 20 t ORIGIN 1 gtccaaagaa tggggcagag aggctgggga tttagctcag tggtagagcg cttacctagg 61 aagcgacagg ccctgggttc ggtccccagc tccgaaagaa agaaccaaaa aaaaaaaaaa 121 aaagaatggg gcagaataga t // LOCUS RATIDC 141 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat ID-C element in the S-100 beta-subunit gene. ACCESSION M35985 KEYWORDS . SOURCE Rat brain DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 141) AUTHORS Usui,H., Kuwano,R., Maeda,T., Araki,K., Sakimura,K., Kushiya,E. and Takahashi,Y. TITLE ID sequences in the genes of three brain-specific proteins JOURNAL Biochem. Int. 15, 809-816 (1987) STANDARD simple staff_review FEATURES from to/span description rpt 7 19 direct repeat rpt 113 125 direct repeat BASE COUNT 39 a 32 c 48 g 22 t ORIGIN 1 cccttagagc ccaagggcga gggttgggga tttagctcag tggtagagcg cttgcctggc 61 aagtgcaagg ccctgggttc ggtccccagc tccggaaaaa aaaaaaaaaa aggagcccaa 121 gggtgggtca aaggcagcta g // LOCUS RSFGCA1 71 bp ds-DNA BCT 03-AUG-1990 DEFINITION Plasmid RSF1010 transposon Tn951 5' inverted terminal repeat. ACCESSION M25019 KEYWORDS inverted terminal repeat. SEGMENT 1 of 3 SOURCE Plasmid RSF1010 DNA. ORGANISM Plasmid RSF1010 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 71) AUTHORS Cornelis,G., Sommer,H. and Saedler,H. TITLE Transposon Tn951 (TnLac) is defective and related to Tn3 JOURNAL Mol. Gen. Genet. 184, 241-248 (1981) STANDARD simple staff_review FEATURES from to/span description rpt 16 56 5' inverted terminal repeat rpt 11 15 5' insertion target sequence BASE COUNT 23 a 13 c 19 g 16 t ORIGIN 1 ttgtcaaact gactaggggt ctgacgctca gtggaacgaa aactcacgtt aagcaattgt 61 ggctgagaaa a // LOCUS RSFGCA2 109 bp ds-DNA BCT 03-AUG-1990 DEFINITION Plasmid RSF1010 transposon Tn951 DNA fragment. ACCESSION M25020 KEYWORDS inverted terminal repeat. SEGMENT 2 of 3 SOURCE Plasmid RSF1010 DNA. ORGANISM Plasmid RSF1010 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 109) AUTHORS Cornelis,G., Sommer,H. and Saedler,H. TITLE Transposon Tn951 (TnLac) is defective and related to Tn3 JOURNAL Mol. Gen. Genet. 184, 241-248 (1981) STANDARD simple staff_review BASE COUNT 29 a 27 c 23 g 30 t ORIGIN 1 gaatcggcat gagtgcaacg gcataagcct gcttaaaaaa caggcttaac gtgggatatt 61 ttccgttttc caagcggacc cctatcaata tgctcggcca ttattcctt // LOCUS RSFGCA3 105 bp ds-DNA BCT 03-AUG-1990 DEFINITION Plasmid RSF1010 transposon Tn951 3' inverted terminal repeat. ACCESSION M25021 KEYWORDS inverted terminal repeat. SEGMENT 3 of 3 SOURCE Plasmid RSF1010 DNA. ORGANISM Plasmid RSF1010 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 105) AUTHORS Cornelis,G., Sommer,H. and Saedler,H. TITLE Transposon Tn951 (TnLac) is defective and related to Tn3 JOURNAL Mol. Gen. Genet. 184, 241-248 (1981) STANDARD simple staff_review FEATURES from to/span description rpt 50 90 3' inverted terminal repeat rpt 91 95 3' insertion target sequence BASE COUNT 31 a 27 c 26 g 21 t ORIGIN 1 accaaaggac atctgagacc attaaaagag gcgtcagagg tagaaaacgt tgcttaacgt 61 gagttttcgt tccactgagc gtcagacccc gactacgtcc acgcg // LOCUS SYNPROA 808 bp ss-mRNA SYN 03-AUG-1990 DEFINITION Manipulation-derived rearrangement of cattle prolactin mRNA. ACCESSION M35587 KEYWORDS prolactin. SOURCE Synthetic, cDNA to mRNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 808) AUTHORS Rubtsov,P.M., Oganesyan,R.G., Gorbulev,V.G., Skryabin,K.G. and Baev,A.A. TITLE Genetic engineering of peptide hormones: II. Possible polymorphism of preprolactin in cattle. Data of molecular cloning JOURNAL Mol. Biol. 22, 117-121 (1988) STANDARD simple staff_review BASE COUNT 217 a 201 c 195 g 195 t ORIGIN 1 ttgaacattt ccgaggagag gtcatggatg tagtgggaca ccatgactgc ccggtcaaac 61 aggtctcgaa gggatacctg gcagttgcca ggcccattgg gacagacggg ggtggagacc 121 acaccctggc acaagagtag atttgacacc accagcagca ggagcaggcg ggaccctgct 181 ttctgcgacg aacctttgct gtccatggtg gtgatgattt caagaaacac acttcaccag 241 gaagctctcg tcctatcccg gaagataaag aacaagccca acagacccat catgaagtcc 301 ttatgagctt gattcttggg ttgctgcgct cctggaatga ccctctctat cacctagtca 361 ccgaggtgcg gggtatgaaa ggagccccag atgctatcct atcgagggcc atagagattg 421 aggaagaaaa caaacgactt ctggaaggca tggagatgat atttggccag gttattcctg 481 gagccaaaga gactgagccc taccctgtgt ggtcaggact cccgtccctg caaactaagg 541 atgaagatgc acgttattct gctttttata acctgctcca ctgcctgcgc agggattcaa 601 gcaagattga cacttacctt aagctcctga attgcagaat catctacaac aacaactgct 661 aagcccacat tccatcctat ccatttctga gatggttctt aatgatccat tccctggcaa 721 acttctctga ggtttatagc tttgtaatgc atgcttggct ctaatgggtt tcatcttaaa 781 taaaaacaga ctctgtagcg atgtcaaa // LOCUS FLANSHB 856 bp ss-RNA VRL 03-AUG-1990 DEFINITION Influenza A/Houston/24269/85 (H3N2) nonstructural protein RNA (seg 8). ACCESSION M17699 KEYWORDS nonstructural protein. SOURCE Influenza A/Houston/24269/85 RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 856) AUTHORS Buonagurio,D.A., Nakada,S., Parvin,J.D., Krystal,M., Palese,P. and Fitch,W.M. TITLE Evolution of human influenza A viruses over 50 years: Rapid, uniform rate of change in NS gene JOURNAL Science 232, 980-982 (1986) STANDARD full staff_review COMMENT Computer-readable sequence for [1] kindly provided by P.Palese, 12-OCT-1987. FEATURES from to/span description BASE COUNT 280 a 173 c 198 g 205 t ORIGIN Unreported. 1 caacactgtg tcaatcttcc aggtagattg ctttctttgg catgtccaca aacaagttgt 61 agaccaagaa ctgagtgatg ccccattcct tgatcggctt cgccgagatc agaggtccct 121 aaggggaaga ggcagcaccc tcggtctaga catcaaagca gccacccatg ttggaaagca 181 gatagtagaa aagattctga aggaagaatc tgatgaggca cttaaaatga ccatggcctc 241 cacacctgct tcgcgataca taactgacat gactattgag gaattgtcaa gaaactggtt 301 catgctaatg cccaagcaga aagtggaagg acctctttgc atcagaatgg accaggcaat 361 catggagaaa aacatcatgt tgaaagcgaa tttcagtgtg atctttgacc gactagagac 421 cctagtatta ctaagggctt tcaccgaaga gggagcaatt gttggcgaaa tctcaccatt 481 gccttctttt ccaggacata ctattgagga tgtcaaaaat gcaattgggg tcctcatcgg 541 aggacttgaa tggaatgata acacagttcg agtctctaaa actctacaga gattcgcttg 601 gggaagcagt aatgaggatg ggggacctcc acttactcca aaacagaaac ggaaaatggc 661 gagaacagct aggtcaaaag tttgaagaga taagatggct gattgaagaa gtgagacaca 721 gactaaaaac aactgagaat agttttgagc aaataacatt catgcaagcc ttacagctgc 781 tgtttgaagt ggaacaggag ataagaactt tctcatttca gcttatttaa tgataaaaaa 841 cacccttgtt tctact // LOCUS VACSPI1A 3043 bp ds-DNA VRL 03-AUG-1990 DEFINITION Vaccinia virus serine protease inhibitor superfamily gene SPI-1. ACCESSION M24217 KEYWORDS serine protease inhibitor. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 3043) AUTHORS Kotwal,G.J. and Moss,B. TITLE Vaccinia Virus Encodes Two Proteins That Are Structurally Related to Members of the Plasma Serine Protease Inhibitor Superfamily JOURNAL J. Virol. 63, 600-606 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence for [1] kindly submitted by B.Moss, 15-FEB-1990. FEATURES from to/span description pept < 1 243 unspecified ORF pept 255 659 16 kDa ORF pept 927 1988 SPI-1 (40 kDa ORF) pept 2150 2722 21.5 ORF BASE COUNT 1100 a 459 c 487 g 997 t ORIGIN 1 tcacataatc tatttagaga tcgagtcatg cacgattata taagtaatac atatattgat 61 cttgagtgtt tagatattat tagatcgttg gatggattcg atatcaatgg ttactttgaa 121 ggacgtacac cacttcattg cgctatacaa cataacttca ctcagattgc taagtactta 181 ttagatcgag gagctgatat agtcgtaccc aacacattga ttatacatca gtacatacag 241 taaatagcat agatatggag gaggatacaa atatttcaaa taaagttata aggtacaaca 301 ctgtcaataa tatatgggaa acattaccta acttctggac tggaactata aatccaggcg 361 tggtctcgca taaagatgat atatatgttg tatgcgacat caaagatgaa aaaaatgtta 421 aaacttgtat atttagatat aacacgaata cgtataacgg atgggaattg gtcacgacga 481 cagaaagcag attatcagct ctgcatacta ttctttataa caataccata atgatgttac 541 attgttatga atcgtatatg ttacaagata catttaatgt gtacactcgc gaatggaatc 601 atatgtgtca tcaacattcg aatagttata tcatgtacaa tatactaccc atctactaaa 661 tataatagaa taaaataaat gagtatgatc attttagata acgattgatt ttatcattac 721 cgcttcattc ttatattctt tgcttacgga acctatattt agaaacatct actaacgatt 781 ttttatgctt gcattattaa tggtatgtaa tatgattgat tgtgtacgca ataccaattt 841 gttaagtatg aatacggggt acaaacataa actgaagttt aacattattt atttatgata 901 tatatcgtta ttgtttggtc tataccatgg atatctttaa agaactaatc ttaaaacaca 961 cggatgaaaa tgttttgatt tctccagttt ctattttatc tactttatct attctaaatc 1021 atggagcagc tggttctaca gctgaacaac tatcaaaata tatagagaat atgaatgaga 1081 atacacccga tgacaataat gacatggacg tagatattcc gtattgtgcg acactagcta 1141 ccgcaaataa aatatacggt agcgatagta tcgagttcca cgcctccttc ctacaaaaaa 1201 taaaagacga ttttcaaact gtaaacttta ataatgctaa ccaaacaaag gaactaatca 1261 acgaatgggt taagacaatg acaaatggta aaattaattc cttattgact agtccgctat 1321 ccattaatac tcgtatgaca gttgttagcg ccgtccattt taaagcaatg tggaaatatc 1381 cattttctaa acatcttaca tatacagaca agttttatat ttctaagaat atagttacca 1441 gtgttgatat gatggtgagc actgagaata acttgcaata tgtacatatt aatgaattat 1501 tcggaggatt ctctattatc gatattccat acgagggaaa ctctagtatg gtaattatac 1561 taccggacga catagaaggt atatataaca tagaaaaaaa tataacagat gaaaaattta 1621 aaaaatggtg tggtatgtta tctactaaaa gtatagactt gtatatgcca aagtttaaag 1681 tggaaatgac agaaccgtat aatctggtac cgattttaga aaatttagga cttactaata 1741 tattcggata ttatgcagat tttagcaaga tgtgtaatga aactatcact gtagaaaaat 1801 ttctacatac gacgtttata gatgttaatg aggagtatac agaagcatcg gccgttacag 1861 gagtatttat gactaacttt tcgatggtat atcgtacgaa ggtctacata aaccatccat 1921 tcatgtacat gattaaagac aacacaggac gtatactttt tatagggaaa tactgctatc 1981 cgcaataaat ataaacaaat agacttttat cacgtttatc tatgtctaaa tattacaaat 2041 agtaatagta taaactaaag ctgataatac ttaaaaaaat aataatatca tttacaatta 2101 atagtataaa ctaaaaatta aacaaatcgt tattataagt aatatcaaaa tgatgatata 2161 cggattaata gcgtgtctta tattcgtgac ttcatccatc gctagtccac tttatattcc 2221 cgttattcca cccatttcgg aagataaatc gttcaatagt gtagaggtat tagtttcctt 2281 gtttagagat gaccaaaaag actatacggt aacttctcag ttcaataact acactatcga 2341 taccaaagac tggactatcg gcgtactatc cacacctgat ggtttggata taccattgac 2401 taatataact tattggtcac ggtttactat aggtcgtgca ttgttcaaat cagagtctga 2461 ggatattttc caaaagaaaa tgagtattct aggtgtttct atagaatgta agaagtcgtc 2521 gacattactt acttttttga ccgtgcgtaa aatgactcga gtatttaata aatttccaga 2581 tatggcttat tatcgaggag actgtttaaa agccgtttat gtaacaatga cttataaaaa 2641 tactaaaact ggagagactg attacacgta cctctctaat ggggggttgc ctgcatacta 2701 tcgtaatggg gtcgatggtt gattattgat tagtatattc cttattcttt ttattcacac 2761 aaaaagaaca tttttataaa catgaaacca ctgtctaaat gtaattatga tcttgattta 2821 tagatgaaga tcagccttta gaggatttta accagtatgt ttaatatgaa aaaaataaac 2881 ataacatatt ttgagattaa gcgctattgt gcaagattat attagaatca aattaatctt 2941 tcatacgaga aaaataacga catacgtcgt caacaaatta aactttttat ttattagtta 3001 actagcttat agaacttgct cattgttatg tttctaaaac ggg // LOCUS VACSPI2A 1475 bp ds-DNA VRL 03-AUG-1990 DEFINITION Vaccinia virus serine protease inhibitor superfamily gene SPI-2. ACCESSION M24218 KEYWORDS serine protease inhibitor. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1475) AUTHORS Kotwal,G.J. and Moss,B. TITLE Vaccinia Virus Encodes Two Proteins That Are Structurally Related to Members of the Plasma Serine Protease Inhibitor Superfamily JOURNAL J. Virol. 63, 600-606 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1475) AUTHORS Kotwal,G.J. and Moss,B. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence for [2] kindly submitted by B.Moss, 15-FEB-1990. FEATURES from to/span description pept 295 1332 SPI-2 BASE COUNT 487 a 245 c 286 g 457 t ORIGIN 1 tccatggaaa aacgaaagta gtataaaagt aataaaacaa aaaaaagaat ataaaaaatt 61 tatagctact ttctttgagg actgttttcc tgaaggaaat gaacctctgg aattagttag 121 atatatagaa ttagtataca cgttagatta ttctcaaact cctaattatg acagactacg 181 taaactgttt atacaagatt gaaattatat tctttttttt atagagtgtg gtagtgttac 241 ggatatttaa tattagacta tctctatcgc gctacacgac caatatcgat tactatggat 301 atcttcaggg aaatcgcatc ttctatgaaa ggagagaatg tattcatttc tccagcgtca 361 atctcgtcag tattgacaat actgtattat ggagctaatg gatccactgc tgaacagcta 421 tcaaaatatg tagaaaagga ggagaacatg gataaggtta gcgctcaaaa tatctcattc 481 aaatccataa ataaagtata tgggcgatat tctgccgtgt ttaaagattc ctttttgaga 541 aaaattggcg ataagtttca aactgttgac ttcactgatt gtcgcactat agatgcaatc 601 aacaagtgtg tagatatctt tactgagggg aaaatcaatc cactattgga tgaaccattg 661 tctcctgata cctgtctcct agcaattagt gccgtatact ttaaagcaaa atggttgacg 721 ccattcgaaa aggaatttac cagtgattat cccttttacg tatctccgac ggaaatggta 781 gatgtaagta tgatgtctat gtacggcaag gcatttaatc acgcatctgt aaaggaatca 841 ttcggcaact tttcaatcat agaactgcca tatgttggag atactagtat gatggtcatt 901 cttccagaca agattgatgg attagaatcc atagaacaaa atctaacaga tacaaatttt 961 aagaaatggt gtaactctct ggaagctacg tttatcgatg ttcacattcc caagtttaag 1021 gtaacaggct cgtataatct ggtggatact ctagtaaagt caggactgac agaggtgttc 1081 ggttcaactg gagattatag caatatgtgt aattcagatg tgagtgtcga cgctatgatc 1141 cacaaaacgt atatagatgt caatgaagag tatacagaag cagctgcagc aacttgtgca 1201 ctggtgtcag actgtgcatc aacaattaca aatgagttct gtgtagatca tccgttcatc 1261 tatgtgatta ggcatgttga tggaaaaatt cttttcgttg gtagatattg ctctccgaca 1321 actaattgtt aaccattttt tttaaaaaat agaaaaaaca tgtggtatta gtgcaggtcg 1381 ttattcttcc aattgcaatt ggtaagatga cggccaactt tagtacccac gtcttttcac 1441 cacagcactg tggatgtgac agactgacca gtatt // LOCUS YSCARSX 1539 bp ds-DNA PLN 03-AUG-1990 DEFINITION Yeast (S.cerevisiae) chromosome III L terminal region DNA. ACCESSION M12922 KEYWORDS . SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1539) AUTHORS Button,L.L. and Astell,C.R. TITLE The Saccharomyces cerevisiae chromosome III left telomere has a type X, but not a type Y', ARS region JOURNAL Mol. Cell. Biol. 6, 1352-1356 (1986) STANDARD simple staff_review REFERENCE 2 (sites) AUTHORS Longtine,M.S., Wilson,N.M., Petracek,M.E. and Berman,J. TITLE A yeast telomere binding activity binds to two related telomere sequence motifs and is indistinguishable from RAP1 JOURNAL Curr. Genet. 16, 225-239 (1989) STANDARD full staff_entry FEATURES from to/span description site 232 236 an unusual yeast telomere repeat motif (CCCCA) BASE COUNT 514 a 551 c 130 g 344 t ORIGIN 1 cccacacacc acacccacac cacacccaca caccacacac accacaccca cacacccaca 61 ccacaccaca cccacaccac acccacacac ccacacccac acaccacacc cacacacacc 121 acacccacac acaccacacc cacacaccac acccacacac acaccacacc cacacacacc 181 acaccacacc cacaccacac ccacacccac acaccacacc acaccacacc cacaccacac 241 accacaccca cacacaccac accacacaca ccacaccaca cccacacacc acacccacac 301 acccacaccc acacacacca cacccacacc acacccacac ccacacaccc acaccctaac 361 actaccctaa cactacccta ttctaaccct gattttacct gtctcccaac ttactctcca 421 ttaccctacc tctccactcg ttaccctgtc tgattcaacc gtaccactcc caaccaccat 481 ccatctctct acttactacc accaacccac cgtccaccat aaccgttacc ctccaactac 541 ccgtatccaa ctccactacc gcttacccta ccatcgacca tgtcctactc actgtactgt 601 tgttcaccca ccatattgaa acgtctacaa atgatcgtaa ataatacaca tatacttatc 661 ctaccactct aatcccacta ccacatgcca tactcacctt cacttgtatt ctgatcggtc 721 atacgcacac ggatgctaca gtatatacca tctcaaactt acctactttc atattcatca 781 tcacccatct ctcaccatca gtaccaatgc actcgcatca ttatgcacgg cttgcctcag 841 cggtctatac cctgtgccat ttacgcataa cgcccatcat tatccacatt ttaatatcta 901 tatctcattc ggcgacacca aatattgtat aactgccctt aatacatacg ttataccact 961 tttacaccat atactaacca ctcaatttat acacacttat gtcaatataa ccacaaaatc 1021 accactaaaa tcacctaaac ataaaaatat tctactcttc aacaataata cataaacaca 1081 ctcaattgcg tatctatacc accatgacgt cattaacgta aaagttcctt aatattgcaa 1141 attgcttgaa cggatgccat ttcatgttga atgtggtaac ccaatagcat gatatgagta 1201 atgctttagt attgtttcag agttgtttca gtaatgtttt agacaaggag aacatatagt 1261 agcaaacctc taatccggta gtacttaaga aactacagtt tctatgtacg aaagcagtaa 1321 ctatgtaatt attacattta catgacatat aggaaggtcc aataaactta ctacattatg 1381 acctataagc tagatcgtaa ttcattacgt caacaggtta tgagccctag agcaatgctt 1441 ctgagaacat aatcaatcta tctagcccca acaattataa acagtggctg tacggtatcg 1501 agaccgctgc tgaatatgct aacgaatata tgaacgaat // LOCUS VAC35K 840 bp ds-DNA VRL 03-AUG-1990 DEFINITION Vaccinia virus gene for 35K complement control protein. ACCESSION X13166 KEYWORDS complement protein; regulatory protein; secretory protein. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 840) AUTHORS Kotwal,G.J. and Moss,B. TITLE Vaccinia virus encodes a secretory polypeptide structurally related to complement control proteins JOURNAL Nature 335, 176-178 (1988) STANDARD simple automatic COMMENT *source: library=HindIII in M13; strain=wildtype. EMBL features not translated to GenBank features: key from to description SITE 43 54 late transcription start site OPR 543 549 early transcription stop signal FEATURES from to/span description pept 52 > 840 35K preprotein sigp 52 108 signal peptide (AA -19 to -1) matp 109 > 840 mature 35K protein (AA 1-244) BASE COUNT 257 a 152 c 179 g 252 t ORIGIN 1 tttttattat ttgtacgatg tccaggataa catttttacg gataaataaa tatgaaggtg 61 gagagcgtga cgttcctgac attgttggga ataggatgcg ttctatcatg ctgtactatt 121 ccgtcacgac ccattaatat gaaatttaag aatagtgtgg agactgatgc taatgctaat 181 tacaacatag gagacactat agaatatcta tgtctacctg gatacagaaa gcaaaaaatg 241 ggacctatat atgctaaatg tacaggtact ggatggacac tctttaatca atgtattaaa 301 cggagatgcc catcgcctcg agatatcgat aatggccaac ttgatattgg tggagtagac 361 tttggctcta gtataacgta ctcttgtaat agcggatatc atttgatcgg tgaatctaaa 421 tcgtattgtg aattaggatc tactggatct atggtatgga atcccgaggc acctatttgt 481 gaatctgtta aatgccaatc ccctccatct atatccaacg gaagacataa cggatacgag 541 gatttttata ccgatgggag cgttgtaact tatagttgca atagtggata ttcgttgatt 601 ggtaactctg gtgtcctgtg ttcaggagga gaatggtccg atccacccac gtgtcagatt 661 gttaaatgtc cacatcctac aatatcaaac ggatacttgt ctagcgggtt taaaagatca 721 tactcataca acgacaatgt agactttaag tgcaagtacg gatataaact atctggttcc 781 tcatcatcta cttgctctcc aggaaataca tggaagccgg aacttccaaa atgtgtacgc // LOCUS BSUCISAB 2246 bp ds-DNA BCT 03-AUG-1990 DEFINITION B.subtilis site-specific DNA recombinase (cisA) and RNA polymerase sigma-K subunit (cisB; spoIVCB) genes, complete cds. ACCESSION M29040 KEYWORDS DNA recombinase; RNA polymerase sigma-K subunit. SOURCE B.subtilis (strain JH642) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2246) AUTHORS Sato,T., Samori,Y. and Kobayashi,Y. TITLE The cisA cistron of Bacillus subtilis sporulation gene spoIVC encodes a protein homologous to a site-specific recombinase JOURNAL J. Bacteriol. 172, 1092-1098 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Kobayashi, 14-OCT-1989. FEATURES from to/span description pept 134 1636 DNA recombinase (cisA) (gtg start codon) pept 2014 1544 (c) RNA polymerase sigma-K subunit (cisB) binding 119 127 ribosome binding site signal 1635 1660 pot. transcription termination signal BASE COUNT 776 a 405 c 475 g 590 t ORIGIN 3 bp upstream of PvuII site. 1 cagctgttac atagcattac ccaaggggtg atgcatttta tgaaagtgat aatcatcgag 61 ggaccgcaag ctgacaaatg cattaacgat tgctatcatt atttaataaa actttatagg 121 aaggagattc agggtgatag caatatatgt aagggtatcg accgaggaac aagcgatcaa 181 gggatcgagc atcgacagcc aaatcgaggc ctgtataaag aaagcaggga ctaaagatgt 241 gctgaagtat gcagatgaag gattttcagg agagctttta gaacgtccgg ctttgaatcg 301 cttgagggag gatgcaagca agggacttat aagtcaagtc atttgttacg atcctgaccg 361 tctttctcgg aaattaatga atcagctaat cattgatgac gaattgcgaa agcgaaacat 421 acctttgatt tttgtaaatg gtgaatacgc caattctcca gaaggtcaat tgtttttcgc 481 aatgcgcggg gcaatctcag aatttgaaaa agccaaaatc aaagaacgga catcaagcgg 541 ccgacttcaa aaaatgaaaa aaggcatgat cattaaagat tctaaactat atggctataa 601 atttgttaaa gagaaaagaa ctcttgagat attagaagag gaagcaaaaa tcattcggat 661 gatttttaac tatttcaccg atcataaaag cccttttttc ggcagagtaa atggtattgc 721 tctacattta actcagatgg gggttaaaac aaaaaaaggc gccaaagtat ggcacaggca 781 ggttgttcgg caaatattaa tgaactcttc ctataagggt gaacatagac agtataaata 841 tgatacagag ggttcctatg tttcaaagca ggcagggaac aaatctataa ttaaaataag 901 gcctgaagaa gaacaaatca ctgtgacaat tccagcaatt gttccagctg aacaatggga 961 ttatgctcaa gaactcttag gtcaaagtaa aagaaaacac ttgagtatca gccctcacaa 1021 ttacttgtta tcgggtttgg ttagatgcgg aaaatgcgga aataccatga cagggaagaa 1081 aagaaaatca catggtaaag actactatgt atatacttgc cggaaaaatt attctggcgc 1141 aaaggaccgc ggctgcggaa aagaaatgtc tgagaataaa ttgaaccggc atgtatgggg 1201 tgaaattttt aaattcatca caaatcctca aaagtatgtt tcttttaaag aggctgaaca 1261 atcaaatcac ctgtctgatg aattagaact tattgaaaaa gagatagaga aaacaaaaaa 1321 aggccgcaag cgtcttttaa cgctaatcag cctaagcgat gacgatgatt tagacataga 1381 tgaaatcaaa gcacaaatta ttgaactgca aaaaaagcaa aatcagctta ctgaaaagtg 1441 taacagaatc cagtcaaaaa tgaaagtcct agatgatacg agctcaagtg aaaatgctct 1501 aaaaagagcc atcgactatt ttcaatcaat cggtgcagat aacttaactc ttgaagataa 1561 aaaaacaatt gttaacttta tcgtgaaaga agttaccatt gtggattctg acaccatata 1621 tattgaaacg tattaaagag gggtgtatgc accccccttt tgtaattaca atctcatttt 1681 caatacacct cgctgcatac gtcgccacct ttgtcccttt tccagcggaa tagctttcaa 1741 ttcctttaat aagcccgatc gttccgatgg agattaagtc ctctgcatcc tcacctgtat 1801 tttcgaactt tttcacaata tgggcgacca agcgaagatt atgctcaatc agcatgtttc 1861 tggcatgttc atccccttta gccatgagct ctaagtattt tttttcttcg ctgcttgaga 1921 gcggttgtgg aaaggcattg tttttcacgt aagatactaa aaagacaagc tctttaacaa 1981 caaagccgag cgctgcgaaa acacctgtca ccatcgtcac ctccacaaaa gtatgtaggc 2041 aaaagcctat atgtaaatgt atgtgaccgg gaggctgtct gtgtctgtac cgggaaattt 2101 ctcgcggggg agtccggctt tggtaagcag catactaact tcaaatgtta aaaatattct 2161 tcatcaagcg cccatacatt gaaatgaaca aaggtatggg gggatgggga tgaaaaaatg 2221 gatggcaggc ctgtttcttg ctgcag // LOCUS BSUSIG43 5127 bp ds-DNA BCT 03-AUG-1990 DEFINITION Bacillus subtilis sigma 43 operon with P23-dnaE-rpoD genes (dnaE for DNA primase, rpoD for RNA polymerase). ACCESSION X03897 KEYWORDS DNA primase; RNA polymerase; polymerase; primase; unidentified reading frame. SOURCE Bacillus subtilis. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 5127; enum. 1 to 5127) AUTHORS Wang,L.-F. and Doi,R.H. TITLE Nucleotide sequence and organization of Bacillus subtilis RNA polymerase major sigma (sigma 43) operon JOURNAL Nucleic Acids Res. 14, 4293-4307 (1986) STANDARD simple automatic COMMENT EMBL features not translated to GenBank features: key from to description PRM 728 733 -35 region P1 PRM 750 755 -10 region P1 PRM 754 759 -35 region P2 PRM 778 783 -10 region P2 RBS 848 851 pot. rRNA binding site PRM 942 949 -35 region P3 PRM 963 971 -10 region P3 RBS 1462 1472 pot. rRNA binding site RBS 3462 3474 pot. rRNA binding site OPR 4609 4631 palindrome pot. transcription terminator FEATURES from to/span description pept 855 1445 P23 unidentified reading frame pept 1479 3290 dnaE pept 3489 4604 rpoD BASE COUNT 1677 a 904 c 1177 g 1369 t ORIGIN 1 gaattcttat gttgaagata taggcactat caatgaagtg atttcacttg caaaggcaga 61 cggcggcatt atctgtttta cactcgtggt gccggaaatc agagaatatt tgatagccga 121 agcggaaaaa gcaaatgttt tatattatga tattatcggc ccgttgattg ataaaatgga 181 aacagcctac ggtttaacag cgaaatacga accggggcgg gtgcgccagc ttgatgaaga 241 ttatttcaaa aaagtggagg ccatcgagtt tgcagttaaa tacgatgatg gacgtgatcc 301 aagagggatt ttaaaagctg atatcgtttt gatcggcgtg tcaagaacgt ctaaaacacc 361 gctgtctcaa tatctcgcac acaaacgcct gaaggttgcc aatgttccga ttgtaccgga 421 ggttgatccg ccggaagaac tctttaacgt tgatccgaaa aaatgcatcg gtttaaagat 481 tagccctgat aaactgaatc atatcagaaa agaacgttta aaatcactcg ggcttaatga 541 taaagcgatt tatgcaaata tcaacagaat caaagaggaa ctcgagtatt tcgaaaagat 601 tgtggatcgg atcggctgcc aggttgttga tgtttcaaat aaagcggttg aggaaacagc 661 aaatattatc catcatctca aaacaaaaaa catataactc aggacgctct atcctgggtt 721 tttggctgtg ccaaaaggga ataatgaaaa acaatagcat ctttgtgaag tttgtattat 781 aataaaaaat tgtgataaaa tgattaattt taggtttaag gatcgtgtga tacgaataaa 841 ctattatggg taagatgtca agaatttctc ccggaaattt ttcgacaaat tcatatacat 901 ccacaataat aaaggatgtg cgattttgct tgcttttatg caggagttta atggagggat 961 ggagaattac tcttcttaat gaacaagaaa agacgatttt tgtcgatgct gatgcttgtc 1021 cggtaaaaga tgaaatttta caaacagcat ccgagtatga agttcaagtt ctttttgtcg 1081 cttcatttga acattatcag ctttccagaa gcaatgaaga aaaatggaag tatgttgatc 1141 ctcataaaga agctgctgat ttatatatcg caaatcacgt gaaaccggga gatattgttg 1201 tgacgcagga catcggatta gcatctctgc tgttgaacag aaatgtctct gttatgtcgg 1261 aaagaggtcg tctttacaag gaagacacga ttgattttgc cctagagggc cgtcattttt 1321 ccggcaaaca aagaagaaaa ggcgtatatg ccaaagggcc taaaaaattg aataaagaag 1381 atcgagaacg atttattaca ctgctgcaaa aaatcctgtc gaacgatgaa gggattttgc 1441 actaaagcat cgaataatgt acgacggagt gttataagat gggaaatcgg ataccagatg 1501 aaattgtgga tcaggtgcaa aagtcggcag atatcgttga agtcataggt gattatgttc 1561 aattaaagaa gcaaggccga aactactttg gactctgtcc ttttcatgga gaaagcacac 1621 cttcgttttc cgtatcgccc gacaaacaga tttttcattg ctttggctgc ggagcgggcg 1681 gcaatgtttt ctctttttta aggcagatgg aaggctattc ttttgccgag tcggtttctc 1741 accttgctga caaataccaa attgattttc cagatgatat aacagtccat tccggagccc 1801 ggccagagtc ttctggagaa caaaaaatgg ctgaggcaca tgagctcctg aagaaatttt 1861 accatcattt gttaataaat acaaaagaag gtcaagaggc actggattat ctgctttcta 1921 ggggctttac gaaagagctg attaatgaat ttcagattgg ctatgctctt gattcttggg 1981 actttatcac gaaattcctt gtaaagaggg gatttagtga ggcgcaaatg gaaaaagcgg 2041 gtctcctgat cagacgcgaa gacggaagcg gatatttcga ccgcttcaga aaccgtgtca 2101 tgtttccgat ccatgatcat cacggggctg ttgttgcttt ctcaggcagg gctcttggca 2161 gccagcagcc taagtatatg aacagtcctg aaaccccgct ctttcataaa agcaaactgc 2221 tttacaattt ttataaggcc cgccttcata tcagaaagca ggaaagagca gtcttatttg 2281 aagggtttgc tgatgtctat acggccgtaa gctcggatgt aaaggaaagc atagccacga 2341 tgggaacgtc tcttacagat gatcatgtca agatcctgag aagaaacgtc gaagaaatca 2401 ttctttgcta tgactctgat aaagccggtt atgaagccac cttaaaagct tcggagcttc 2461 tgcaaaaaaa aggctgcaaa gtcagagttg caatgattcc tgacggattg gaccctgatg 2521 attacatcaa aaaattcggc ggggaaaaat ttaaaaacga cattattgac gcaagtgtca 2581 ccgtaatggc gttcaaaatg caatatttcc gaaaaggaaa gaacctgtcc gatgaaggcg 2641 accgcctagc ttacattaaa gacgtactga aagaaatcag cacgctttca gggtctctag 2701 agcaggaagt ctatgtaaag cagcttgctt cagagttttc gctttcacag gagtctttaa 2761 ctgagcagct gtctgttttc agcaagcaaa acaaacctgc tgacaatagc ggtgaaacta 2821 aaacgcggcg agcgcatctg acgacaaaag caaggcaaaa acgtttgcgt ccggcgtatg 2881 aaaatgcaga aaggctgtta ctcgctcaca tgcttcgaga tcggagcgtc atcaaaaaag 2941 tgattgaccg ggtagggttt caatttaata ttgatgagca ccgggcatta gccgcttatc 3001 tttatgcttt ttatgaagag ggagccgagc tgacgcctca gcatctgatg gccagggtga 3061 cggatgatca tataagccag ctcttgtccg atatattaat gcttcaggtt aatcaagagc 3121 ttagcgaagc cgagttatca gattatgtaa aaaaagtgtt gaatcaaaga aattggtcaa 3181 tgataaaaga aaaagaggcg gaaagagccg aagcagaaag gcaaaaagat tttttaagag 3241 ctgcttcttt ggctcaagaa atcgttacat tgaaccgatc tttaaaataa ctggagaact 3301 gatgaggagc atttattggc aatgattcct tgcggaggag caaatagatc gcttaacctc 3361 atcatgaatt gtcatttcat tattcgcaca ttgttaaagg cagttcacat agaaaacgcc 3421 tgaatggacc gaataagaat cataccgctt atagaattcg ttgcaagctt tggaaggagg 3481 gatccataat ggctgataaa caaacccacg agacagaatt aacattcgac caagtaaaag 3541 agcaattaac agagtctggt aaaaaacgtg gcgttttgac atatgaagaa attgctgagc 3601 gtatgtccag ctttgaaatt gaatcagacc aaatggatga gtattatgaa tttttaggtg 3661 aacaaggtgt tgaattaatt agtgagaatg aagaaacaga agatcctaat attcagcagc 3721 ttgccaaagc cgaagaagaa tttgacctta atgacctaag tgtaccgcct ggcgttaaaa 3781 tcaatgaccc agttcgtatg tatttaaagg aaatcggtcg ggttaacctt ctttctgcaa 3841 aagaagaaat cgcctacgct caaaagattg aagaaggtga cgaagaatct aaacgcagat 3901 tggctgaagc gaacctgcgg cttgttgtca gtatcgcaaa acggtatgtc ggacgcggta 3961 tgctgttcct tgatctgatc catgaaggaa acatgggcct gatgaaagcc gttgaaaaat 4021 ttgattatcg caaaggttat aaattcagta cgtatgctac gtggtggatc agacaggcga 4081 ttacacgcgc cattgccgat caggcgagaa cgatccggat tcccgttcat atggttgaaa 4141 ccattaataa attaatccgt gtgcagcgtc aattactgca agacttaggc agagaaccaa 4201 cacctgaaga aattgcggaa gatatggatt taacgcctga aaaagtacgc gaaatcttaa 4261 agattgctca agagccggta tctctggaaa caccgatcgg tgaagaggat gactcgcacc 4321 ttggtgattt cattgaagac caagaagcaa cttcaccttc tgaccacgcc gcatacgagc 4381 tattgaaaga gcagctggaa gatgtgcttg atacgttaac tgatcgtgaa gaaaatgtat 4441 tgcgtcttcg attcggtctt gatgacggcc gtacaagaac attagaagag gtcggcaaag 4501 tatttggagt aacgagagag cgtattcgac aaatcgaagc caaagcgttg cggaaactaa 4561 gacatcctag cagaagtaaa cgtttgaaag atttccttga ataagatgga acgggtcttg 4621 aagatccgtt cttctttttt taaaaagata tatggataat atgcctttat tttactgaaa 4681 aatgatgtca tttgcaaatg aacattgtgg tgaaaaattt caaaatctaa ttccatattt 4741 tctattgtaa gcgtatacaa tacattatac aatagaataa aaaggatatt agagatttag 4801 gcatgtttct atttcaattt tgtataaagt gtgaataaaa aacttttgta tagcaatcca 4861 tttacttttt gtaaaaataa gttagaatta gaagtgttta cataggggga aggatcaaaa 4921 agggggaagg gaaatgaaat ggaacccgct tattccattt ttgctgatcg ctgttttagg 4981 aatgggtcta actttctttt tatcagtaaa aggacttgat gactctcggg aggttgcgag 5041 cggaggagaa agcaaatctg ctgaaaagaa agatgcaaac gcttcaccag aagaaattta 5101 caaggcaaat cgcatcgcat cgcatgc // LOCUS CHKMHBFVB 4790 bp ds-DNA VRT 03-AUG-1990 DEFINITION Chicken MHC class I B-FIV-B12 alpha-chain gene, complete cds. ACCESSION M31012 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Chicken (strain CB; haplotype B12), clone cBF23. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 4790) AUTHORS Kroemer,G., Zoorob,R. and Auffray,C. TITLE Structure and expression of a chicken MHC class I gene JOURNAL Immunogenetics 31, 405-409 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.Kroemer, 21-DEC-1990. FEATURES from to/span description pept 1405 1471 MHC B12 alpha-chain, exon 1 1589 1852 MHC B12 alpha-chain, exon 2 2081 2353 MHC B12 alpha-chain, exon 3 2457 2729 MHC B12 alpha-chain, exon 4 2803 2910 MHC B12 alpha-chain, exon 5 3021 3053 MHC B12 alpha-chain, exon 6 3213 3245 MHC B12 alpha-chain, exon 7 (put.) 3400 3419 MHC B12 alpha-chain, exon 8 IVS 1472 1588 MHC B12 alpha-chain intron A IVS 1853 2080 MHC B12 alpha-chain intron B IVS 2354 2456 MHC B12 alpha-chain intron C IVS 2730 2802 MHC B12 alpha-chain intron D IVS 2911 3020 MHC B12 alpha-chain intron E IVS 3054 3212 MHC B12 alpha-chain intron F IVS 3246 3399 MHC B12 alpha-chain intron G pre-msg 1347 3587 MHC B12 alpha-chain and introns (alt.) pre-msg 1379 3587 MHC B12 alpha-chain and introns (alt.) binding 1328 1337 Sp1 binding site site 1219 1228 interferon responsive element site 1263 1270 cAMP responsive element site 1238 1244 S box (B-L-beta) site 1247 1255 X box (B-L-beta) site 1288 1294 Y box (B-L-beta) site 1313 1317 CAAT box BASE COUNT 812 a 1637 c 1460 g 881 t ORIGIN Microchromosome 16. 1 acctataagg gacatgggaa cacagcagga ggtgtttggg ttctggggac atggggggaa 61 ctctggggaa ggagatgtga ggtgacatgg gtgacatggc aggaggggac ccaagggtgc 121 acacatccct gtgtgccaca ggggtgtccc caaccccctc accccagagg acatccccca 181 tgtccccata tcccccggtg tccccatacc tcaatttgta ggagctggtt gaagttgttg 241 tggggtgtga gggcaaatgg gactgggagg gtgcaggccc agccccacgg tccccctcag 301 cccatgcagt caccgtcccg gtcaccggcc ccacagctgc cagcaacagc ccatgggggt 361 ccccaaggcc acaggggcgg ggggccacca ggacccacct gagggaacac tgcttgagcc 421 accagcccca taactgctcc ataagccctc ccagatccca taagcatcca gacacccctt 481 aggtccatag gaatcccccc agctctggcc ccataacccc acccaaccac acagaaatct 541 cccttagtcc ccccagactc catcacaccc caatcctata gcagccccca gccccaccag 601 tgccccatag tccctcatag gcactgcaga acattcatag tcccatagct accccacaaa 661 caccccatag ccccatactc agggtcctgg gagctggccc tcattaaggg agtgaggaag 721 aggaggagga agagctgaag ccagtgaggc ccatggctcc ggcagtgtgg ggcagatggg 781 agcagtggcg cagggacagc ggggttgggg tcaatcatta acttgggggg tggacattgg 841 ggtccacctg cagaccctga gggcatcgac ctcagatgga ccacgtggaa tgtggggaag 901 cggtgtgggg cggcgctgtg gggaaatggc actgatccca aaggaagccc tgggggacac 961 cactgtcacc accccctccc ccccggacac gagcctctaa ccctatggat gtgacccccc 1021 ccccaaacag ttccttgact ggaatggaga atgactcctc ctcttctcac ggcccccgtc 1081 ctcccggcag agaccctcgg ccccctcccc cggggtaagg acgcggcccc gtgcgaacac 1141 agcccgggtg cccccccgtg ctcgaagggc cgggggttcc cacaccacgc ccatcccctc 1201 ccccgctccg cctttcgctt tcgcttcaca acctgaggga gcgcattctg cctggcgccc 1261 gatgacgtca cataaaactc caactaccat tggcggagag gcgacggagg agccaatggg 1321 ggcgcggggc ggggcggagg agtaggaaaa gctgaaggag ctgcgctggg tgcggcggac 1381 ttgagagtgc agcggtgcga ggcgatgggg ccgtgcgggg cgctgggcct ggggctgctg 1441 ctcgccgccg tgtgcggggc ggcggccccc ggtgagtgcg gccggaccgg gacccctccc 1501 ccgcccgtaa ccccaccccg gggctgtgtc cgtgggatcc tgagacccac acccgcggct 1561 cacggcccgc tgtgctccgt ccccgcagag ctccataccc tgcggtacat ccaaacggcg 1621 atgacggatc ccggccccgg gcagccgtgg ttcgtgactg tggggtacgt ggacggggaa 1681 ctcttcgtgc actacaacag caccgcgcgg aggtacgtgc cccgcaccga gtggatagcg 1741 gccaaggcgg accagcagta ctgggatgga cagacgcaga tcggacaggg caatgagcag 1801 attgaccgcg agaacctggg catactgcag cggcgctaca accagaccgg cggtgagcac 1861 ggccggggcc gcggctccgt gggtgtggga tgggctccat gcgcagtgcc gcccacaccc 1921 cccaggcctg gccctgcccg gcggcaccgt cccggggctg cccgtcacag ccccaccgcg 1981 ctcggggtgc cgcgtcccgg ggggacccca acccatcccc gctgcagtgg gagccccgga 2041 gccggagggg cccctcaccc cctgcccggc tgtgtttcag ggtctcacac ggtgcagtgg 2101 atgtacggct gtgacatcct cgagggcggc cccatccggg ggtattatca gatggcctac 2161 gatgggagag acttcactgc cttcgacaaa ggcacgatga cgttcactgc ggcagttcca 2221 gaggcagttc ccaccaagag gaaatgggag gaagagagtg aacctgagag gtggaagaat 2281 tacctggagg aaacctgcgt ggagtggctg cggagatacg tggaatacgg gaaggctgag 2341 ctgggcagga gaggtgagcg gggtgggggg gggagcggct gcagtgtggg gctggacgtg 2401 gggcgggggc tcagcgtggg gagctcagcc cggccctcac tgccgcccgc ccgcagagcg 2461 gcccgaggtg cgagtgtggg ggaaggaggc cgacgggatc ctgaccttgt cctgccgcgc 2521 tcacggcttc tacccgcggc ccatcgttgt cagctggctg aaggacggcg cggtgcgggg 2581 ccaggacgcc cactcggggg gcatcgtgcc caacggcgac ggcacctacc acacctgggt 2641 caccatcgat gcgcagccgg gggacgggga caagtaccag tgccgcgtgg agcacgccag 2701 cctgccccag cccggcctct actcgtgggg tgagtgaggg gatgtggggc tggggggctg 2761 cgggctgccc cttcccctgc tgatggcccc gctctccccc agagccgcca cagcccaacc 2821 tggtgcccat cgtggcgggg gtggccgtcg ccattgtggc cattgccatc atggttggtg 2881 ttggattcat catctacaga cgccatgcag gtaaaagcag aggggtgcag gcgggcagtg 2941 gggggatctg ggtcccccct tgggagccct catgcctggc tgtgatgtga acctgtgctg 3001 aagcatctct ctgtctgcag ggaagaaggg gaagggctac aacatcgcgc ccggtgagtg 3061 atgagggcag cgctgtcccc cacctctgcc cagtgccagg gcggtcctgg ggtctgcact 3121 ttctcccagg gtacccattc ctggtgcttg gggctgctcc acgccccata gggagcacag 3181 ggctggatct cacagctgtt cctcccttat agacagggaa ggtggatcca gcagctcgag 3241 cacaggtgcg tgtggggctg tgggttggga ggggtccgtg tgctctctgt ggtactgccc 3301 agggctgggc tatgctgggg ctctgcgggg agacccccgg agcagagggt tgggatgtga 3361 acctggcccc gtgggacatc atcccttctc atcccacagg gagcaacccc gccatctgag 3421 tgctgtgctt cagcctgcaa ggagccaaca gtccacacca gcatttgggg tcagtgatgg 3481 gcacagcccc atcctcttga cctctcacat ctcattctgc ttcctatgct gactgttatg 3541 ctttgcctgc actgcttcct gtgaaataaa atgatgggcc attctgtggc tcagcttgcc 3601 tgcagtctgc actgtgctgt ggttggggat ggggtggatg aggggaccgt gtcagtttgg 3661 ctgctcaggg tgcagatgtg gccctgtgct gagtacccac tgccctcccc tcttctatct 3721 gcctgctgct cactcccccc tgtgtacccc catcccttct cacctctcct ctgtgacccc 3781 atgctggtgg ttgcttgctc cctgtcctgg cagaactctc attttccaat ggcatccctg 3841 tttgttggga tgtggtctcc ttggtcctcc ccccagcagt cactgcacat atccacccca 3901 cttccccccc aggttgctgt cccacagcac tcctatttcc ttctctgccc tccccccccg 3961 cccatccagc tgcctctgca atcctcaccc acaaccttgc ccactccacc tccctcatcc 4021 cgcccttccc ccagctgtcc tgtccctgct gggccccctc cccccacatt gtaccctaca 4081 cccaaataaa tacgtttgtt ctgctgccct ccagccgtct tctggtttat ttcccccccg 4141 atttgttgtt gtggcgtccg ctcttctcct ggggggaagg ggctctgggg gtccctcatt 4201 ctccctgctc ttcttacagc accgggactc cccgcgctga gatcccaaca cacccgggta 4261 caaacatgcg gcttttattc ccagttctgt gtcccacccc cggccctggt ggcactcagt 4321 ggcactgcag tccatgcagt ggccgttgtg tgtcgtagag cagcggtacc cgagcgcgcc 4381 ccggctcggc atccatgtgc ccacggcaca gctcttgtgg tcccttgtcg ctgccccggt 4441 gtccccacct ccaccctcag tgtccccaac gtgcagtgtc cccatcccac atcccaccac 4501 cccctgccct ccccatccat aactcccgct atccctctcc ctctgtcccc ctcccccggt 4561 gctccctgtt gtccccagtc ccgcagaagg ctgccggggg cagcacctcg tgggggggtc 4621 cctgctgcca cacctctccc ccgtccaaca ccactaccca ttgtaccagg gctgcccgcc 4681 ccgtcaccat cagcactgca catccttacc ctctggctag ctggatccaa cttgtggaca 4741 ggcttctgat gccctgtgta tgacaaggtt tctgatcctc gcgcctgcag // LOCUS DOGMHDL9A 1499 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Dog MHC class I DLA-A9/A9 alpha-chain mRNA, complete cds. ACCESSION M32283 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Dog (#C624; haplotype DLA-A9/A9 homozygote) peripheral blood leukocyte, cDNA to mRNA, clone pBT-I16. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1499) AUTHORS Sarmiento,U.M. and Storb,R. TITLE Nucleotide sequence of a dog class I cDNA clone JOURNAL Immunogenetics 31, 400-404 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by U.M. Sarmiento, 22-FEB-1990. FEATURES from to/span description pept 1 1089 MHC DLA-A9/A9 alpha-chain precursor sigp 1 72 MHC DLA-A9/A9 alpha-chain signal peptide matp 73 1086 MHC DLA-A9/A9 alpha-chain mRNA 1 1499 MHC DLA-A9/A9 alpha-chain mRNA BASE COUNT 296 a 425 c 489 g 289 t ORIGIN 1 atggaggtgg tgatgccgcg agccctcctc gtgctgctgt cggcggccct ggccctgacc 61 ccgacccggg cgggctccca ctccctgagg tatttctaca cctccgtgtc ccggcccggc 121 gcgggggacc cccgcttcat cgccgtcggc tacgtggacg acacgcagtt cgtgcggttc 181 gacagcgacg cggccactgg gaggatggag ccgcgggcgc cgtgggtgga gcaggagggg 241 ccggagtatt gggaccggca gacgcggacc atcaaggaga ccgcacggac tttccgagtg 301 gacctggaca ccctgcgcgg ctactacaac cagagcgagg ccgggtctca cacccgccag 361 accatgtacg gctgtgacct ggggcccgac gggcgccttc tccgcgggta cagtcaggac 421 gcctacgacg gcgccgatta catcgccctg aacgaggacc tgcgctcctg gaccgcggcg 481 gacacggcgg cgcagatcac ccagcgcaag tgggaagcgg caggtgtagc agagctacaa 541 tggaggaact acctggagac gacgtgcgtg gagtggctgc ggaggtacct ggagatgggg 601 aaggagacgc tgctgcgcgc agacccccca agcacacgtg tgacccacca ccccgtctct 661 gaccatgagg tcaccctgag gtgctgggcg ctgggcttct accctgcgga gatcaccctg 721 acctggcagc gggatgggga ggaccagacc caggacacag aggttgtgga cacaaggcct 781 gcaggagatg ggaccttcca gaagtgggcg gccgtggtgg tgccttctgg acaggagcag 841 agatacacgt gccacgtcca gcatgagggg ctgccggagc ctatcacgcg gagatgggag 901 ccttcccctc tgtccaccat tgtcatcgtc agcattgctg ctctggttct cctcgtggtc 961 gctggggtga ttggagctgt gatctggagg aagcagcgct caggaggaaa aggaccaggc 1021 tactctcatg ctgcacgtga tgacagtgcc cagggctctg atgtgtctct gacagctcct 1081 agagtgtgag accagctgcc tgtgggactg acggatgcaa gatgtgttca catctcacgt 1141 gatgacatca acaaccctgg cttgtctctg caaacagtgt caggatgtgc ctgtgtccct 1201 aggagcataa tgtgaggagg tggggagatt ggcccaccct gcccaccatg acctgtccct 1261 aatctgatgt gcgctctcct ctctgatgtg ctttcctgtc caggagaggc agggctggac 1321 catctccatc cctgtctttg tttcatgttg agtactaatc tcttactatc cgattgaaaa 1381 taagaatcca gatatgagtt tgtgtttcct gagtcttggg atgtggggct gatgaggtaa 1441 taaaaggaga tttgtgaagt tgagagagca aataaatgga agccctgaga accttccag // LOCUS DOGMHDRBA 1165 bp ss-mRNA MAM 03-AUG-1990 DEFINITION C.familiaris MHC class II DLA-DR-beta-1 gene, complete cds. ACCESSION M29611 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE C.familiaris (dog #C624, haplotype DLA-Dw3/Dw3) leukocyte, cDNA to mRNA, clone pBT-DR5. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1165) AUTHORS Sarmiento,U.M. and Storb,R. TITLE Nucleotide sequence of a dog DRB cDNA clone JOURNAL Immunogenetics 31, 396-399 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by U.M.Sarmiento, 31-OCT-1989. FEATURES from to/span description pept 40 840 MHC DLA-DR-beta-1 precursor sigp 40 126 MHC DLA-DR-beta-1 signal peptide matp 127 837 MHC DLA-DR-beta-1 mRNA 1 1165 MHC DLAB1 mRNA BASE COUNT 237 a 313 c 319 g 296 t ORIGIN 1 cgactgctcc cctgcaccct gtcctttctg ttttccagca tggtgtgtct gtgttttctt 61 ggaggctcct ggatgacagc tctgatgctg atactgatgg tgctgaaccc tcccttcgct 121 tgggccaggg acaccccacc acatttcttg gaggtggcaa agtccgagtg ctatttcacc 181 aacgggacgg agcgggtgcg gttcgtggaa agatacatcc ataaccggga ggagttcgtg 241 cgcttcgaca gcgacgtggg ggagttccgg gcggtcacgg agctcgggcg gcccgtcgct 301 gagtcctgga acgggcagaa ggagatcttg gagcaggagc gggcaacggt ggacacctac 361 tgcagacaca actacggggt gattgagagc ttcacggtgc agcggcgagt cgagcctaca 421 gtgactgtgt atcctacgaa gactcagacc ttgcagcatc acaacctcct ggtctgctct 481 gtgaatggtt tctatccagg ccacattgaa gtcaggtggc tccggaatgg ccaggaagag 541 gaagctgggg ttgtgtccac aggcctaatc cgtaatggag actggacctt ccagatcctg 601 gtgatgctgg agatagttcc tcagagcgga gaggtctaca cctgccaagt ggagcaccca 661 agtttgacaa gccctgtcac cgtggaatgg agggcacagt ctgattctgc acagagcaag 721 atgctgagtg gaatcggggg ctttgtcctg ggtctgctct tccttgcagt ggggctgttc 781 atctacttca gaaatcagaa gggacactct ggacttcagc caacaggact cctgagctga 841 agtggagatg gtgaccctca agaaaaaacc ttcctttcca gcttcttctc agagtccaaa 901 ggtttccagc ttggctctta ttcttctaca aatacagcgc ctttctcagg atctggtttg 961 ccctggcttc agtgaccctg cacagcttgt gttccccaat ggctccctct gccgtgtcct 1021 tgacctggaa cccccagtgc tgactgcagt accttatctg cattctctcc ttcctcttcc 1081 tatgcagccc ttatgtttcc cttgcatggg aactcacctt ctgctcatgt ttctttataa 1141 acttttctca aataaatgtg gattg // LOCUS LACTHYSYN 957 bp ds-DNA BCT 03-AUG-1990 DEFINITION L.lactis thymidylate synthase (thyA) gene, complete cds. ACCESSION M33770 KEYWORDS thymidylate synthase. SOURCE L.lactis lactis (strain 712) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 957) AUTHORS Ross,P., O'Gara,F. and Condon,S. TITLE Cloning and characterization of the thymidylate synthase gene from Lactococcus lactis subsp. lactis JOURNAL Appl. Environ. Microbiol. 56, 2156-2163 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by P.R.Rose, 04-APR-1990. FEATURES from to/span description pept 87 926 thymidylate synthase (EC 2.1.1.45) binding 77 80 ribosome binding site site 28 33 -35 region site 51 56 TATA box BASE COUNT 329 a 139 c 192 g 297 t ORIGIN 1 tctgagaggt tattttggga aatactattg aaccatatcg aggtgtgtgg tataatgaag 61 ggaattaaaa aagataggaa aatttcatga cttacgcaga tcaagttttt aaacaaaata 121 tccaaaatat cctagataat ggtgtttttt cagaaaatgc aagaccaaag tataaggatg 181 gtcaaatggc gaatagcaaa tatgtcactg gttcattcgt tacttatgat ttgcaaaagg 241 gggagtttcc aattaccact ttgcgtccaa ttccaatcaa atctgctatt aaagaattga 301 tgtggatata ccaagaccaa acaagtgaac tttctgttct cgaagagaag tatggagtca 361 aatactgggg agaatgggga attggtgatg gtacgattgg gcaacgttat ggtgcaacag 421 tcaaaaaata taatatcatt ggtaaattat tagaaggctt ggccaaaaat ccatggaatc 481 gtcgtaatat catcaacctt tggcagtatg aagattttga ggaaacagaa ggtcttttac 541 catgtgcttt ccaaacgatg tttgatgtcc gtcgagaaaa agatggtcag atttatttgg 601 atgccacact gattcaacgt tcaaacgata tgcttgtagc ccaccatatc aatgcgatgc 661 aatatgttgc tttgcaaatg atgattgcaa aacatttttc ttggaaagtt gggaaattct 721 tttattttgt aaataattta catatttatg ataatcagtt tgagcaggca aatgaattaa 781 tgaagcgaac agcttctgaa aaagaacctc gtttggtcct taatgttcct gatggtacaa 841 actttttcga tattaaacct gaagattttg aacttgtgga ctatgagcca gtaaaacctc 901 aattgaaatt tgatttagca atttaaatta atctataagt tactgacaaa actgtca // LOCUS MUSLY48A 121 bp ss-mRNA ROD 03-AUG-1990 DEFINITION Mouse leukosialin (Ly48) mRNA, partial cds. ACCESSION M30693 KEYWORDS leukosialin. SOURCE Mouse (strain C57BL/6J) adult thymus EL-4 tumor, cDNA to mRNA, clone ML18. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 121) AUTHORS Baecher,C.M., Dorfman,K.S., Mattei,M.G. and Frelinger,J.G. TITLE cDNA cloning and localization of the mouse leukosialin gene (Ly48) to chromosome 7 JOURNAL Immunogenetics 31, 307-314 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.G.Frelinger, 14-DEC-1989. FEATURES from to/span description pept < 3 > 119 leukosialin (Ly48; AA at 3) BASE COUNT 28 a 28 c 43 g 22 t ORIGIN Chromosome 7 put. 7F3/7F4 bands. 1 gccagggctc tttagtacta gaagagctga agcctgggtc gggtcccaac ctgaaggggg 61 aggaagagcc gcttgtgggc agtgaggatg aagctgtgga aaccccaact tctgacggtc 121 c // LOCUS MUSUBILP 2912 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse housekeeping DXS253E (P3) and DXS254E (GdX) genes, 3' end, and complete cds. ACCESSION J04761 KEYWORDS housekeeping protein. SOURCE Mouse (strain Balb/c) liver DNA, clone lambda-M7P. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2912) AUTHORS Filippi,M., Tribioli,C. and Toniolo,D. TITLE Linkage and sequence conservation of the X linked genes DXS253E (P3) and DXS254E (GdX) in mouse and man JOURNAL Genomics 7, 453-457 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by D.Toniolo, 10-APR-1990. FEATURES from to/span description pept < 1 550 housekeeping protein DXS253E (P) (AA at 2) pept 1536 1583 housekeeping protein DXS254E (GdX), exon 1 1740 1845 housekeeping protein DXS254E (GdX), exon 2 2089 2297 housekeeping protein DXS254E (GdX), exon 3 2413 2523 housekeeping protein DXS254E (GdX), exon 4 pre-msg < 1 801 DXS253E mRNA pre-msg 1533 > 2912 DXS254E mRNA and introns IVS 1584 1739 DXS254E intron A IVS 1846 2088 DXS254E intron B IVS 2298 2412 DXS254E intron C BASE COUNT 649 a 822 c 765 g 676 t ORIGIN X chromosome. 1 cactggtttc ctaccactct catcagccat ctacagctac cttctcagca tccatgaaac 61 actccacgtg cccatctcca agatattggg gaccctgctg tttatcgcca tccccatagc 121 agcgggtgtg gtgatcaaat ctaagctccc caagttctct gaactactgt tacaggtcat 181 caagcccttc agctttatac ttctcctggg tggcctgttc ctggcctacc acatgggggt 241 cttcatccta gtgggagtca ggttacccat tgtactggtg ggtttcacag tgcctcttgt 301 tggcctcttg gtgggctaca gcctggccat ctgcctgaag ctgccagtgg ctcagcgacg 361 aacagtcagc attgaggtag gggtgcaaaa cagcctgcta gccttagcca tgctacagct 421 gtctctgcgc cgccttcaag cagactatgc ctctcaggcc cctttcattg tggcactgag 481 tggtacctcc gagatgttgg ctttggttat tggccagttc atctatagca tcttgtttcc 541 tgttccctga gccctctgga tcaagtttta taacttccag cccccatact cctctcttcc 601 cacagttctt gtgtaccgta ccaaaggcct ttagttctca tgcactatgc actcaagaca 661 aatccaggct tattttttta ctccttttcg ccccagcttt cagtgccaaa gaggccatgc 721 tgagttagat agttgggtat tgcccagaaa tatatttcaa taaaaaaaaa ccaaaaaaac 781 aaaagcaagc tcgggtccct tccagtctgt cccttggagc ctgtggattg gggtaagaag 841 agtgtgtctg ggcggatatg ggccagaaca gtgaaccagc ttggtgaccc tgcacagaac 901 ccactttctc ctcctcactc tgttttgcac atggtagaca tgtacctttg agctgtatcc 961 tcagatctac aaaggctttg ctctaacagc cctacctaac agggcagacc tctgcactat 1021 aaggagtgaa gagagcacta cttccgagct tcagtcagga tatttttgca gccagactag 1081 aaaggaagta gaaagccatt aagctatgac aggcaggact ggaccatttc tgtatctgag 1141 agacaccacc acgtgtttgc attgacttgg tccaccagca gggtgggaat tagcacaggc 1201 cgggcctgga gtgaggctga cattctggtc cttcctttca agccacagct cccagactaa 1261 gcaagcctgg ccccacaggt acctcacttg gatcccagtg ggcggggcat ccccacgtcc 1321 ccgcgccatt aagcccaggt ggtggactct acccaaggca cggccggggc agaagcgaag 1381 gccgggaggc gggactgaag aatgcgcagg cgggagccac gagcggcgcg cccctttcac 1441 ggcttccggc ggcggcgggc ggttccgggg cggcggcgcg cggcagggga ccgttggtgt 1501 ttgcgttggc cgtagtggac tgggccgtgg acaccatgca gctgaccgtg aaggcgctcc 1561 agggccggga atgtagccta caggtaaagt caccccgcag gctgctgcac tcactcctgc 1621 cgctggggcc gagcgcgcgt gggggaggcc gggagagctg acctcaggag ggggtgcgag 1681 ggcttcaccg cccagggaat tcgggctgag cccttgcctc tccacctcgg ccggatcagg 1741 tggcggagga cgagctagtg tctacactga agcacctggt ctcggataag ctgaatgtcc 1801 ctgtgcgcca gcaacgtctg ctgttcaagg gcaaggccct agcaggtaat cagggaaagg 1861 agccatccag ggagctttac agaaagctgg gggacgggat gtacaggcca gtgtagtgga 1921 tcggggcact ggctgccact ggacctcaga aaagtagctg agccatcacc accccaccac 1981 cccatagcca gatgctgcag tgcttcactc tggctgtctg tggcaggatt gcttcccagc 2041 cacctccaca tcagtgagga aggaaccaca ccaatctctc tttggcagat gaaaaacgac 2101 tgtcagatta caacattggg cccaattcta agctcaacct agttgttaag cctttggaga 2161 aggtgctact ggaagaaggg tctgcccaca gactggtcga ctccccagcc acccccatct 2221 ggcagctgat ctccaaagtc ctggcccgtc acttcagtgt agcagatgcc agcagggtcc 2281 tggaacaact acagagggtg agaagagtaa tgctggccct ccattcttaa tgtttctacc 2341 cgtcccctac ctaatgcagc atacgccttc cccactggcc tctctgaata ccactcggtc 2401 ttctctcctc aggattatga caggtccttg agccgcctaa cactggatga catcgaacgt 2461 ttggccagcc gctttctaca ccctgaagtg actgaggcta tggaaaaagg gttctgcaaa 2521 tagcattctg ggattgtggg gagaaatccc aggtcaggcc acagctgcat gttgcattaa 2581 atgtgttctc atgtcgcagt ttggctcata gtgataataa tagctggtat gtatccagct 2641 cttgctaggt gctaggtact caaagtacat ggcctgagtt ctcttctgcc tcccacagca 2701 aagttgaggc tgcactgtag agacacctag caatggcttg ggacagacag ggtccaagtc 2761 tagccagcct atagaagtgc ctctgtagga gaaagagtgt gctgttttca gagatgggtc 2821 ttagtattcc catttagccc actcagcata caccctcaaa agactccctc aggaaaccac 2881 tagttcctgt cacctgccct ctggtgtgtc cc // LOCUS YEPLCREA 3316 bp ds-DNA BCT 03-AUG-1990 DEFINITION Virulence plasmid pYVO3 (from Y.enterocolitica) LcrE protein mRNA, complete cds. ACCESSION M32097 KEYWORDS . SOURCE Virulence plasmid pYV03 (from Yersinia enterocolitica, strain 6471/76, serotype 0:3) DNA, clones pY[MS4146,MS4073,P6]. ORGANISM Plasmid pYV03 Prokaryota; Bacteria. REFERENCE 1 (bases 2 to 1081) AUTHORS Viitanen,A.-M., Toivanen,P. and Skurnik,M. TITLE The lcrE gene is part of an operon in the lcr region of Yersinia enterocolitica O:3 JOURNAL J. Bacteriol. 172, 3152-3162 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 3316) AUTHORS Viitanen,A.-M., Toivanen,P. and Skurnik,M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by M.Skurnik, 14-FEB-1990. FEATURES from to/span description pept 649 1530 lcrE product ORF 1511 1789 ORF1 ORF 1776 2147 ORF2 (gtg start codon) ORF 2509 2853 ORF4 ORF 2840 > 3315 ORF5 ORF 451 119 (c) ORF6 ORF 173 < 1 (c) ORF7 mRNA 1020 < 1 (c) lcrB locus mRNA BASE COUNT 892 a 751 c 791 g 882 t ORIGIN 1 aggatcccca aagaaagtgg tgtggtaata agttttcggc tcatcggagc tggggcatcc 61 tgataaactg ggtaccaagc cgccggttca gggagatgcc ccccatcgaa aggctgccct 121 aaaccatcca acacctgccc cagcagatgt tcacccaccc caacctgatg cattgtcctg 181 tcgggctaac ttcagtatta gaagatatcc cgtacatttc accaagtgga ataagtaatg 241 cttgatgttg ggcaaaacct atgacttcag cctgtaaaga caggctgttg tctgggttac 301 gtaagtaaca taactcaccg atgcgcacac caggcactac cgcttttaat agcgttcctg 361 tcacttgagt gacacgtcct ctaatttgga ttaggcggct acctacaatg ccatgacgaa 421 tatgatgagg tatctgatct agtgagagca taaatccata atggttgaaa tattaaccac 481 tattttagtg actaaaaacg ctaaaaaatt gtagcgggag ccgcgagttt ttagaaaaat 541 agccaagcag cactaaaatt tctcggctga ttttggcatc gataagcaag aactattttt 601 ataatcgcgg taattgcaat tataaaccgt tcatcttagg gagtagttat gacgacgctt 661 cataacctat cttatggcaa taccccgctg cgtaatgagc atccagagat tgccagtagt 721 cagatcgtaa atcagactct gggtcaattt cggggagaat ctgtgcagat agtcagcggc 781 actctgcagt ctatagctga tatggcagaa gaggtaacat ttgtcttctc cgagcgtaag 841 gagctctccc tcgacaaacg caaattaagt gacagccaag ctcgagttag cgacgttgag 901 gagcaggtta atcaatacct tagcaaagtt ccagagttga aacaaaaaca gaatgtgagt 961 gagctgctca gtctgttgag taacagcccc aatataagct tgtcccagtt aaaggcttat 1021 ctggagggga aatcagaaga accgagtgag caattcaaaa tgctctgcgg cttgcgtgat 1081 gccctgaaag ggcgccctga attagcacat cttttgcatt tggttgaaca agctctggtc 1141 agcatggttg aagagcaaga agaagccatt gtattgggtg ccagaataac cccggaagcg 1201 tacagagaat cccagtcggg tgttaatcca ctgcagccgc tccgtgatac ctaccgcgat 1261 gcagtgatgg gttatcaagg gattaatgcg atctggagtg atttacaaaa acgttttccc 1321 aatggggata tagactcggt gatattattc ctgcaaaagg cgcttagtgc agatctacaa 1381 agtcaacaaa gcgggtctga acgggaaaaa ttagaaatag ttattagtga cttacagaag 1441 ctaaaggagt ttcgtagcgt gagtgaccag gttaaaggat tttggcaact tttttcagag 1501 ggtataacta atggcttacg acctttctga gtttatggga gatattgtcg cactggttga 1561 caagcgctgg gcggggattc atgacattga acatcttgcc aacgcctttt cccttcctac 1621 gcctgaaatc aaagtgcgtt tctatcaaga tttaaaaaga atgtttcgtc ttttccctct 1681 gggggtattt agcgatgagg agcaacggca aaatttattg caaatgtgtc aaaatgcgat 1741 cgatatggct attgagagtg aagaggaaga attgagtgag ttggattgaa cccatcattt 1801 cccatttctg ccaggatctg ggagtgccaa catctagccc cctttcgcct cttattcaat 1861 tagagatggc tcaatctggc acgctgcaac tggaacaaca tggtgcgaca ctgacactgt 1921 ggttagcgcg ttctctcgcc tggcaccagt gcgaagatgc tatggtcaaa gcgctaacgc 1981 tcacggcggc ccaaaagagt ggcgctttac cgctgcgagc ggggtggtta ggggaaaatc 2041 aactggtgtt atttgtctcg cttgatgagc gttccttaac cttgcccctt ttacatcaag 2101 ctttcgaaca gttactgcga ttgcagcaag aggtgcttgc gccgtgagtc gcataataac 2161 tgccccccat attggcatcg aaaaactgtc ggcgattagc ctggaagagc tatcctgtgg 2221 cttgcctgat cgttatgcct tgccgcctga tgggcatcca gtcgaaccac atttagagcg 2281 cctttaccct acagcacaaa gcaagcgtag cctatgggac tttgcttctc ccggctatac 2341 atttcatgga ttacatcgag ctcaagatta tcggcgcgaa ctggatacct tgcagtcact 2401 gctaaccacc agtcagtcct cagagctaca agctgccgcg gcgctgctga aatgccaaca 2461 agatgatgat cggttactgc aaataatcct taacctgttg cacaaagtat gaatattact 2521 ttaaccaaac gacaacagga gttcttgctg ctcaacggtt ggttacaact acaatgtggc 2581 catgcagagc gcgcatgtat tctattggac gccttgctga cgttaaatcc tgagcattta 2641 gccggtcggc gttgccgatt agtcgcgcta cttaataata accagggaga acgtgccgaa 2701 aaagaagcgc aatggctaat atcacatgac cctttacagg ctggaaattg gctctgcttg 2761 agccgtgccc aacaactgaa cggcgatctt gataaggctc gccatgctta tcaacattat 2821 ttggagttga aagatcataa tgaatcccca tgatcttgag tggctaaatc gtattggcga 2881 gcgtaaagat atcatgctgg cagtgctgct gttagctgtg gtattcatga tggtcttacc 2941 actccccccc ctcgtgttgg acattctgat tgctgttaac atgaccatct cagtggtgtt 3001 gttaatgata gcgatctata tcaactctcc tttacaattt tcagctttcc cagcggtgct 3061 actcgttacc acgttatttc gtctcgcact ttcagttagc accacccgca tgatcctgct 3121 acaagctgat gcggggcaga ttgtttacac ctttggtaat ttcgtcgttg gcggtaacct 3181 catcgtcggg attgtcatct tcctgatcat cactattgtg caatttttag tgataacgaa 3241 aggctcagaa cgtgtagcag aagttagtgc cagattctct cttgatgcga tgccgggtaa 3301 acagatgagt atcgat // LOCUS FSOFARS1A 170 bp ds-DNA PLN 03-AUG-1990 DEFINITION F.oxysporum autonomous replicating sequence (ARS). ACCESSION M29451 KEYWORDS autonomous replicating sequence. SOURCE F.oxysporum lycopersici (strain 73) ARS DNA, clone pFOLT4R4. ORGANISM Fusarium oxysporum Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Hypocreales; Hypocreaceae. REFERENCE 1 (bases 1 to 170) AUTHORS Powell,W.A. and Kistler,H.C. TITLE In vivo rearrangement of foreign DNA by Fusarium oxysporum produces linear self-replicating plasmids JOURNAL J. Bacteriol. 172, 3163-3171 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.A.Powell, 27-OCT-1989. FEATURES from to/span description site 20 28 DNA homologous to yeast autonomously replicating consensus sequence site 26 31 telomere repeat unit site 130 136 ligation point of telomeres -> ApaI site BASE COUNT 35 a 18 c 66 g 51 t ORIGIN 116 bp upstream of ApaI site. 1 catgccatcc gtaagatgct ttggtttagg gttagggtta gggttagggt tagggttagg 61 gttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg 121 gttagggtta gggccctaac cctaaaccaa agcatcttac ggatggcatg // LOCUS ARURRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION A.gigantea ribosomal RNA. ACCESSION M37170 M35661 KEYWORDS ribosomal RNA. SOURCE A.gigantea rRNA. ORGANISM Arundinaria gigantea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 18 c 12 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctacgt ggaggcatcc ctccgcagct // LOCUS ASTRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION A.sativa ribosomal RNA. ACCESSION M37171 M35661 KEYWORDS ribosomal RNA. SOURCE A.sativa rRNA. ORGANISM Avena sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS BLYRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION H.vulgare ribosomal RNA. ACCESSION M37172 M35661 KEYWORDS ribosomal RNA. SOURCE H.vulgare ribosomal RNA. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctacgt ggagccatcc ctccgcagct // LOCUS MUSELF2PS 2183 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse elongation factor 2 processed pseudogene, complete cds. ACCESSION X15335 J04749 KEYWORDS elongation factor 2; pseudogene. SOURCE Mouse cell line Ltk- DNA, clone cMER1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2183) AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and Uchida,T. TITLE Amplification of a long sequence that includes a processed pseudogene for elongation factor 2 in the mouse JOURNAL Genomics 6, 80-88 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Koide, 31-AUG-1989. FEATURES from to/span description pept.ps 319 2038 pseudo-elongation factor 2 rpt 319 330 repeat copy A rpt 2027 2038 repeat copy B BASE COUNT 594 a 543 c 585 g 461 t ORIGIN 1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca 61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct 121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaaatgta 181 gtttgaaaaa gggagcttgc ctgcctatca ttttagtgcc tgtgaggtgg agatgtgagg 241 atcagtaagt agttcaaggt tattcttggc tgtttagtga gtgttggcta tcctaggcta 301 cataagtccc tgtttcaaga aaacaaacca ggccactgta ggcgcgcttc cctgttcacc 361 tctgactctg agaatctgtc gccatccacc accatggtga acttcacagt agttcagatc 421 cgtgccatca cggacaggaa agccaacatt cagaacatgt cagtcatctc catgtggacc 481 acggcaagtc cacgctgaac gactcccttg tgtgcaaggc tggcatcatt gcctccgccg 541 gagccgggga gacacgtttc actgacatgc gcaaagatga gcaggagcgc tgcatcacaa 601 tcaagtccac cgccatctcc ctcttctacg agctctctaa gaacgacctg aaccgcatta 661 agcagagcaa ggatggctcg ggcttcctca tcaacctcat cgactctccc ggccatgtgg 721 acttctcttc agaggtgaca gctgccttgc gtgtcaccga tggagctctg gtggtggtgg 781 actgtgtgtc tggcgtgtgt gtgcagacag aaattgtgct gcgccaggcc atcgctgagc 841 gcatcaagcc cgtcctgatg atgaacaaga tggaccgggc ccctgctgga gctgcagctg 901 gaacccgagg agctctacca gaccttccag cgcatcgtgg aaaacgtcaa cgtcatcatc 961 tctaccgatg gcgagggtga gagtgggccc atgggcaata tcatgattga ccccgtcctg 1021 ggtgccgtag gctttggttc tggcctgcat ggctgggcct tcaccctgaa gcagtttgcg 1081 gagatgtacg tggccaagtt tgcagccaag ggcgagggcc agctgggtgc aactgagcgt 1141 gccaagaaag tagagggcat gatgaagaag ctgtggggag accagtactt tgatctgccc 1201 tcatggagcc tatctatctg gtggagatcc agtgtcctga gcaagtggtg ggtggcatct 1261 atggtgtcct gaacaggaag cgtggccata tgtcccaggt ggccggtacc cccatgtttg 1321 tggtcaaggc atacctgcct gtcaatgaat cctttggctt cactgctgat ctgcgatcca 1381 acactagcag ccaggccttc ccccagtgcg agtttgacca ctggcagatc ctgcctgggg 1441 atccttttgt caacagcagc cgccccagcc aagtggtagc tgagacctgc aagcacaagg 1501 gcctgaaaga gggctggaca acttcctgga caaactgtag gcagcctgat actgccacat 1561 gttgcacagt gcctgcccat cagaagacac cttgagactg tccccacagt gctcctctgg 1621 aggctgttgg ggccaccctg acatcactca gcactcactt ggctaccaat tctatttatt 1681 tcagaattac aaaatagcgg gaatctctct gcaggctgga ctggcaggcc gtggggtagg 1741 cggacatggc tcttaacatt ttcagaggga aacacgcaga tgtccaaaag tctaaataaa 1801 tgcattcaga ggtttatgag gtccatggcc aagtggagct cccccagagg gggaggtggg 1861 gtaagtacct ccaggaaggc aggcagcctg ccttagactt gcaacccggc tgtgcgaatc 1921 attggagtaa taaactacag tggttgatct gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaagaaa agaaaagaaa agaaagaaag aaagaaagaa agaaaggaaa acaaaccaaa 2041 agaaaaagag tttccatgat gtccaggatg atgcaggtgg gaagggctag ctcaggttag 2101 aagggctagc tcaggtctct cttaccctgc tctaactcag cctgcttctg gaggagagac 2161 aagtcaaaga gcagcaggag ctt // LOCUS MUSELF2PSA 2199 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse elongation factor 2 processed pseudogene, complete cds. ACCESSION X15336 J04749 KEYWORDS elongation factor 2; pseudogene. SOURCE Mouse cell line Ltk- DNA, clone cMER9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2199) AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and Uchida,T. TITLE Amplification of a long sequence that includes a processed pseudogene for elongation factor 2 in the mouse JOURNAL Genomics 6, 80-88 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Koide, 31-AUG-1989. FEATURES from to/span description pept.ps 324 2016 pseudo-elongation factor 2 rpt 324 335 repeat copy A rpt 2005 2016 repeat copy B BASE COUNT 602 a 546 c 590 g 461 t ORIGIN 1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca 61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct 121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa 181 caaaacaaaa atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga 241 ggtggagatg tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt 301 ggctatccta ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc 361 gcttccctgt tcacctctga ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc 421 acagtagttc agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc 481 atctccatgt ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca 541 tcattgcctc cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg 601 agcgctgcat cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg 661 acctgaaccg cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact 721 ctcccggcca tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag 781 ctctggtggt ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc 841 aggccatcgc tgagcgcatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg 901 ctggagctgc agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac 961 gtcaacgtca tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg 1021 attgaccccg tcctgggtgc cgtaggcttt ggttctggcc tgcatggctg ggccttcacc 1081 ctgaagcagt ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg 1141 ggtgcaactg agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag 1201 tactttgatc tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag 1261 tggtgggtgg catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg 1321 gtacccccat gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg 1381 ctgatctgcg atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc 1441 agatcctgcc tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga 1501 cctgcaagca caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc 1561 ctgatactgc cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc 1621 acagtgctcc tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta 1681 ccaattctat ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc 1741 aggccgtggg gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc 1801 aaaagtctaa ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc 1861 agagggggag gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac 1921 ccggctgtgc gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa 1981 gaaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa 2041 aagaaaacaa accaaaagaa aaagagtttc catgatgtcc aggatgatgc aggtgggaag 2101 ggctagctca ggttagaagg gctagctcag gtctctctta ccctgctcta actcagcctg 2161 cttctggagg agagacaagt caaagagcag caggagctt // LOCUS MUSELF2XX 2161 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse elongation factor 2 related sequence. ACCESSION X15337 J04749 KEYWORDS elongation factor 2. SOURCE Mouse cell line Ltk- DNA, clone MER7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2161) AUTHORS Koide,T., Ishiura,M., Hazumi,N., Shiroishi,T., Okada,Y. and Uchida,T. TITLE Amplification of a long sequence that includes a processed pseudogene for elongation factor 2 in the mouse JOURNAL Genomics 6, 80-88 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Koide, 31-AUG-1989. FEATURES from to/span description pept.ps 334 2054 elongation factor 2 related sequence rpt 334 345 tandem repeat rpt 2043 2054 tandem repeat BASE COUNT 575 a 544 c 581 g 461 t ORIGIN 1 aaaaaggaac caggtgtgtg ggttcatacc caccatctca gcactcagga ggtagaaaca 61 gaagataaga agttcaaggt tgtcttcaga tgcacagcag attccaggcc agcccgggct 121 acacaatacc ttgtctcata caaaacaaaa caaaacaaaa caaaacaaaa caaaacaaaa 181 atgtagtttg aaaaagggag cttgcctgcc tatcatttta gtgcctgtga ggtggagatg 241 tgaggatcag taagtagttc aaggttattc ttggctgttt agtgagtgtt ggctatccta 301 ggctacataa gtccctgttt caagaaaaca aaccaggcca ctgtaggcgc gcttccctgt 361 tcacctctgg ctctgagaat ctgtcgccat ccaccaccat ggtgaacttc acagtagttc 421 agatccgtgc catcacggac aggaaagcca acattcagaa catgtcagtc atctccatgt 481 ggaccacggc aagtccacgc tgaacgactc ccttgtgtgc aaggctggca tcattgcctc 541 cgccggagcc ggggagacac gtttcactga catgcgcaaa gatgagcagg agcgctgcat 601 cacaatcaag tccaccgcca tctccctctt ctacgagctc tctaagaacg acctgaaccg 661 cattaagcag agcaaggatg gctcgggctt cctcatcaac ctcatcgact ctcccggcca 721 tgtggacttc tcttcagagg tgacagctgc cttgcgtgtc accgatggag ctctggtggt 781 ggtggactgt gtgtctggcg tgtgtgtgca gacagaaatt gtgctgcgcc aggccatcgc 841 tgagcacatc aagcccgtcc tgatgatgaa caagatggac cgggcccctg ctggagctgc 901 agctggaacc cgaggagctc taccagacct tccagcgcat cgtggaaaac gtcaacgtca 961 tcatctctac cgatggcgag ggtgagagtg ggcccatggg caatatcatg attgaccccg 1021 tcctgggtac cgtaggcttt ggttctggcc tgcatggctg ggccttcacc ctgaagcagt 1081 ttgcggagat gtacgtggcc aagtttgcag ccaagggcga gggccagctg ggtgcaactg 1141 agcgtgccaa gaaagtagag ggcatgatga agaagctgtg gggagaccag tactttgatc 1201 tgccctcatg gagcctatct atctggtgga gatccagtgt cctgagcaag tggtgggtgg 1261 catctatggt gtcctgaaca ggaagcgtgg ccatatgtcc caggtggccg gtacccccat 1321 gtttgtggtc aaggcatacc tgcctgtcaa tgaatccttt ggcttcactg ctgatctgcg 1381 atccaacact agcagccagg ccttccccca gtgcgagttt gaccactggc agatcctgcc 1441 tggggatcct tttgtcaaca gcagccgccc cagccaagtg gtagctgaga cctgcaagca 1501 caagggcctg aaagagggct ggacaacttc ctggacaaac tgtaggcagc ctgatactgc 1561 cacatgttgc acagtgcctg cccatcagaa gacaccttga gactgtcccc acagtgctcc 1621 tctggaggct gttggggcca ccctgacatc actcagcact cacttggcta ccaattctat 1681 ttatttcaga attacaaaat agcgggaatc tctctgcagg ctggactggc aggccgtggg 1741 gtaggcggac atggctctta acattttcag agggaaacac gcagatgtcc aaaagtctaa 1801 ataaatgcat tcagaggttt atgaggtcca tggccaagtg gagctccccc agagggggag 1861 gtggggtaag tacctccagg aaggcaggca gcctgcctta gacttgcaac ccggctgtgc 1921 gaatcattgg agtaataaac tacagtggtt gatctgaaaa aaaaaaaaaa aaaagaaaag 1981 aaagaaagaa agaaagaaag aaaagaaaac aaaccaaaag aaaaagagtt tccatgatgt 2041 ccaggatgat gcaggtggga agggctagct caggttagaa gggctagctc aggtctctct 2101 taccctgctc taactcagcc tgcttctgga ggagagacaa gtcaaagagc agcaggagct 2161 t // LOCUS MUSTHBS1 4915 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse thrombospondin (THBS1) gene, exons 1-7. ACCESSION J05605 KEYWORDS thrombospondin. SEGMENT 1 of 2 SOURCE Mouse (strain BALB/C) DNA, clone phi-42. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4915) AUTHORS Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P. TITLE Characterization of the mouse thrombospondin gene and evaluation of the role of the first intron in human gene expression JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Bornstein, 28-JUN-1990. FEATURES from to/span description pept 1222 1288 thrombospondin, exon 2 (first expressed exon) 1580 2139 thrombospondin, exon 3 2947 3022 thrombospondin, exon 4 3267 3466 thrombospondin, exon 5 3561 3683 thrombospondin, exon 6 4439 + 4532 thrombospondin, exon 7 pre-msg 484 > 4915 thrombospondin mRNA and introns IVS 667 1193 thrombospondin intron A IVS 1289 1579 thrombospondin intron B IVS 2140 2946 thrombospondin intron C IVS 3023 3266 thrombospondin intron D IVS 3467 3560 thrombospondin intron E IVS 3684 4438 thrombospondin intron F IVS 4533 > 4915 thrombospondin intron G BASE COUNT 1188 a 1342 c 1100 g 1282 t 3 others ORIGIN Chromosome 2, region F. 1 gggatccccc acccaccagg attgactcag gaggcagctt cctttaaggg gtccaacctt 61 ttgccctctt ttatttttaa accagactgt ctcgatcccc aagcgtctct gtcatggaat 121 tattcaagga gatgtgcttt aatgaaagcc tccctaaagg gtcttaggtg gtccccaaag 181 aagcatcgcg tctggggact ctcatgcaga acgtctccag ttcacatggc gcaagatcct 241 aagcgctaaa ggctgagtac gccaaggctg cgtgggcgga gacctatttt ttctgacaag 301 ttccaggggc tcctgtgcgg gatcggagtc tcccccttca ctttcagccc gagagctgtg 361 cgccaagcag cagggggcgg agaaatgaac ccgtccgtcc ccgtcccccg cccccgcccc 421 cagaaccctc ccccctccct gcttcctgcc tctttaaaag gtcctgggct cctcagtcaa 481 gccagccact gcctggagtc agccagcctc atcggacttc tgcaggcaat cgcgaagctg 541 ctatccagtt ctgccacggt ctctcccggc gcaccggcag tctcagcgtc ttcaccggac 601 tcagcgtcct tgtccttcac ttcacctttg ccacctctcc gggttactga gccccggtgc 661 acacaggtaa acctccgcct ctgcggagca cctggccggc cacctcaaag ccccaggctg 721 cagctgcaga tgatcagcgc ttctcctcat ctcctactct tccccttctg gttttcctaa 781 ctcctttatc ctgccccgtc tatcccctgg aaagaacttt ccggagctgg ttctcttcag 841 ccttcgggtt cacaaactat acctccaaga caccccaaga ttttctcccc agtagtctcc 901 aggttacccc tgactgtgcc atccaaaata cattctgtaa tctgtctaaa ggggcggggg 961 ggggagggac ggacccaagt cactttatac cctttatcct ggaactccaa caatccccct 1021 gcctctggcg ctctttttat actggatacc gtttggaaag taaacccttc tttttcagca 1081 catatggggc aaaagaagaa acacacacac acacacacac acacacacac acctctgccc 1141 ttccaccttg cttcctggtc tctgactctg gcctcctgtg cttcttgcta caggctccgt 1201 gttgggcaca aaggctccac catggagctc ctgcggggac taggtgtcct gttcctgttg 1261 catatgtgtg gaagcaaccg cattccaggt gagtttgtga tatatcattt tcctccccct 1321 tctcctctct ctctctctct ctttctctct ctctctctct gctgctgctc tctctctctc 1381 tctctctctc tctctctctc tctctcgaag ctcctgtgtt tccttggagt ccttatctcc 1441 tgataaggag gcagcttccc tcgctcagct ttccataagt gccattattg tcacgagttc 1501 ccaccttggc tcctcacagc cagaacttta cccccttata cctagcctag aaagccgatt 1561 gtcttctctc tccttacaga gtctggggga gataacggtg tgtttgacat ctttgaactc 1621 attggaggtg cacgaagggg ccccggtcgc cgactggtga agggccaaga tctatccagc 1681 cccgccttcc gcattgagaa tgccaacctg atccccgctg tgccggatga caagttccaa 1741 gacctactgg acgctgtgtg ggccgacaaa ggcttcatct tcttggcttc cttgaggcag 1801 atgaagaaga cccggggcac actcctggct gtggaacgga aagacaacac tggccagatc 1861 ttcagtgtgg tctccaacgg caaagctggc accctggacc tgagcctgag cctgccaggg 1921 aagcaacaag tggtgtcagt ggaggaagct ctcctggcca ctggccagtg gaagagcatc 1981 acgctgtttg ttcaagagga ccgggctcaa ctctacatag actgtgataa gatggagagc 2041 gcggagctgg atgtacccat ccagagcatc ttcaccaggg atctggccag cgttgccagg 2101 ctccgagttg caaagggaga tgtcaatgac aattttcagg taaatcttct tacctgtcac 2161 aagggctatg tggaattaat atcggtagac taaaaatgat tctcaaatgn taaagaggaa 2221 cagacacccc cagaaggttg gttaaatcat attacagatg acccaaaact ccagggnttt 2281 cagatttgat agggacccga gaaattttnt tcctaataag aaataagttg ctaatggtgc 2341 caggacctgg gcacccatgc tttcaaagct atcaggagag ataagaaatg gttttcctat 2401 tactacagcc tgtttcagaa tggggtactc cctgtgccct taggaaagta acttgtatca 2461 aatgtaccta gcagttttaa tagggcatga aaaggatact ttgtcagtat ttgagtaaat 2521 gggcagggtg ttctgggtgc tttcagtatg gatctaaggc aagacatgaa taacagtcca 2581 gtcatgtgcc tctgggactc cttacttgcc ttaactttca gatatttttt tttcttacaa 2641 gcgtggcttt gaacgcccac attaactctt tgtcactgga gagtatcatg tcctccagac 2701 ttgggatcaa attccatttc ctgtgtaccc caagcagcct gaaaactgga ttctttgtaa 2761 aatgctccag ttggcagaag acacaattaa gattttgaag ttgttcggca atttatgtct 2821 ctgtgatagt ttggcaggat gatcttaagt tcttttaaac tttctgagcg ttcaccacaa 2881 gcaaactcag actggccgcc ccacccccac gcccagcatt aaaccacgtc tgtcctctgc 2941 ccacaggggg tgctgcagaa tgtgaggttt gtctttggaa ccaccccaga agacattctc 3001 aggaacaaag gctgctccag ctgtgagtac ccctgtgttt tttaagattt ggagaattct 3061 ggggaagtcc accgaaagca ccctgattaa tgtaagcagt agggtgcata cacaatatct 3121 cagggggagc agagttggct tgcacccact ctgtcatgcc aaaggaaatg tcttttataa 3181 cccagaaccc gaagggtagg ctggtaggat cagctgttga aaccttgaat cagctatcga 3241 atacgttact tcttatttct tagcagctac caacgtcctt cttacccttg acaacaacgt 3301 ggtgaacggt tccagccctg ctatccgcac caactacatc ggccacaaaa caaaggacct 3361 ccaagctatc tgtggcctct cctgtgatga actatccagc atggtcctgg aactgaaggg 3421 cctgcgcacc atcgtgacca ctctgcagga cagcatccga aaagtggtca gtggcctccc 3481 ctcccatttg ccagctcttg gaaatcacca caggtggtcc cacgggagcc tggacactca 3541 tgagccttct cccctttcag acggaagaga acagagagct ggtcagtgag ctgaagcggc 3601 ctcccctctg ctttcacaat ggagtccagt acaagaacaa cgaggagtgg actgtagaca 3661 gttgcacaga gtgtcactgc caggtaagag acactcacag actgcaataa gacttggaga 3721 catatgtctc gggagatacg taaatgacta cacacatttt taaatgtgct ctggcctgct 3781 accaaagtga ttttttttct ttaatatgta atgacatctg caaattcctc tttaaatatc 3841 agaaggttta tcttaataat accccaggct gagcagctaa gagaacaaat ctatctaaac 3901 gttggaatga cttacctgct ttagttcaaa gaccaagtat tgagtgtcag atacataaaa 3961 cccgcttaaa gcattcggtt ttggccagcc ctaatttgta ttcagctctg gcaggcagat 4021 tcttctaaca ctgttcggat ctctggctca ccgggaaaag tttaaatgca aatcactggt 4081 tactgaattt gagacacaac ttcaattaag ttcccgtctc ccttcaatat gtgataaaaa 4141 cccttatcta cctttgagat gcttactgat acctgcaatg tccgttgctt tcccttatgc 4201 aaatgtgttg ttctgtaact tccacccctg actgagtagc agttaaaaag aactttccca 4261 aagaatggag tgagaattag ccctggaact gtatacctct ggggactaca gatccatgag 4321 acatccctcc tataaaagcc acccctcctg tgtcctggct tataatggca atgtcacaga 4381 aagctcctct ctgcaatacg cttggtaaca tcgttcctct ccttctcttg ccttgcagaa 4441 ctcggttacc atctgcaaaa aggtgtcctg tcccatcatg ccctgctcca acgccacagt 4501 tcctgatggt gaatgctgcc cacggtgctg gcgtaagttc ctaaaatggt gtagccatcc 4561 ctcagtggct agctcagtct gggggcatcc tgacagctgg gcacaacacc agagcccagc 4621 ccctcccagc tccatcagac gttccagtca gatctgggtt atggcatagg caatggatga 4681 tcttagcaga gcttatggat tctggtttta tgctataagg agagagatgg atacagcagt 4741 agacttaagc tggttgcttt tagcaattga tctttgttga aaaaaatgat ctactggaga 4801 acggacagaa actgtgtaag cattgggctg aggcttgaaa agccacctgt tttatctgca 4861 gtgagaggag gcctaaggct tagcatctgt tcattcaggc agtaaagaaa tcagc // LOCUS MUSTHBS12 1145 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse thrombospondin (THBS1) gene, exons 8 and 9. ACCESSION J05606 KEYWORDS thrombospondin. SEGMENT 2 of 2 SOURCE Mouse (strain BALB/C) DNA, clone phi-42. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1145) AUTHORS Bornstein,P., Alfi,D., Devarayalu,S., Framson,P. and Li,P. TITLE Characterization of the mouse thrombospondin gene and evaluation of the role of the first intron in human gene expression JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by P.Bornstein, 28-JUN-1990. FEATURES from to/span description pept + 297 470 thrombospondin, exon 8 969 > 1145 thrombospondin, exon 9 pre-msg < 1 > 1145 thrombospondin mRNA and introns IVS < 1 296 thrombospondin intron G IVS 471 968 thrombospondin intron H BASE COUNT 302 a 294 c 260 g 289 t ORIGIN 820 bp after segment 1. 1 tgaccgtagt gctagattgc tgtgctgtga gagagatgtg tgatgctcca ttatcattat 61 cagtcagact cttgatagcc aagcgtctga gagtacataa atgtacacaa tctctagttg 121 ttaatcatgc ttccattaaa ctaagagtat gaaaaaagaa ggactttttt taaaaaaaga 181 aacctattgc tttaccacct ctggctataa agggccataa gcatataagt aaataaactg 241 cctaagaaca gtttatctta gacatgagat gctaacgacc tcatttctcc ccacagccag 301 cgactctgct gacgatggct ggtctccctg gtctgagtgg acctcctgct ctgccacatg 361 tggcaatgga attcagcaac gtggtcgttc ctgtgacagc ctcaacaaca gatgcgaggg 421 ctcttcggta cagacgagga cctgccacat tcaggagtgt gacaaaagat gtaagcatca 481 tggctgccaa gggtggcaat ctcccttgcc cagtgcttcc ctcacaaaga gcagctccgg 541 agcggggagc ctgagaatgc aggttctgag cctgcggctc tcagtctcaa tcacacctct 601 aacctttgca ctcctgggct tgaccttcct tccctgaggc tcctcctcat ctgtaacgag 661 agaacaataa ccattcctga taaagttact ttagccattt ctctagataa ttcatatgaa 721 acaatcagaa tggatctggg ctcctgacca cgaacctata aatattaggt tgttgacagc 781 aactcacaca cacagacaaa taccatcttc cctatagttt ctccactgaa gtgtggcagt 841 agccagtttc tacaatacac tagttgaagc tgcttttggt gggctgctgg aatgattgcc 901 tgctgagact tgctgtgtgg ccccctccaa ggcagcaccc attggtgaca atcaactttg 961 cccttcagtt aaacaggatg gtggctggag tcactggtct ccatggtcgt cctgttctgt 1021 gacctgtggt gacggtgtga tcacaaggat ccgtctctgc aactccccca gcccccagat 1081 gaacgggaag ccctgtgaag gtgaagcccg ggagaccaaa gcctgcaaga aagacgcctg 1141 cccaa // LOCUS MZERRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION Z.mays ribosomal RNA. ACCESSION M37178 M35661 KEYWORDS ribosomal RNA. SOURCE Z.mays rRNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 12 a 17 c 10 g 11 t ORIGIN 1 acgagacctc agcctgctaa ctagctatac ggagccatcc ctccgtagtt // LOCUS PNURRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION P.nudum ribosomal RNA. ACCESSION M37179 M35661 KEYWORDS ribosomal RNA. SOURCE P.nudum rRNA. ORGANISM Psilotum nudum Eukaryota; Plantae; Thallobionta; Psilotophyta; Psilotopsida; Psilotales; Psilotaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 16 c 12 g 11 t ORIGIN 1 acgagacctc agcctgctaa ctagttacgc gaaggatcct cttcgtggcc // LOCUS RATRGK5 1883 bp ds-DNA ROD 03-AUG-1990 DEFINITION Rat voltage-gated potassium channel protein (RGK5) gene, complete cds. ACCESSION M30312 KEYWORDS voltage-gated potassium channel protein. SOURCE Rat (Sprague Dawley) adult liver DNA, clone lambda RGK5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1883) AUTHORS Douglass,J., Osborne,P.B., Cai,Y.-C., Wilkinson,M., Christie,M.J. and Adelman,J.P. TITLE Characterization and functional expression of a rat genomic DNA clone encoding a lymphocyte potassium channel JOURNAL J. Immunol. 144, 4841-4850 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.O.Douglass 01-DEC-1989. FEATURES from to/span description pept 35 1612 voltage-gated potassium channel protein BASE COUNT 380 a 567 c 489 g 447 t ORIGIN 1 ctgcagagcc aggctacgcg agctgccgcc agacatgacc gtggtgcccg gggaccacct 61 gctggagcca gaagcggcgg gaggcggcgg cggggacccg cctcagggag gctgtgtcag 121 tggcggcggc tgcgaccgct acgaaccgct gccgcccgcg ctgcccgccg cgggcgagca 181 ggattgctgc ggggagcgcg tggtcatcaa catctccggg ctgcgcttcg agacacagct 241 caagaccctc tgccagttcc ctgagacgct gctaggcgac cccaagcggc gcatgcgata 301 cttcgacccg ctccgcaatg agtacttctt cgaccgcaac agacccagct tcgacgccat 361 cctctactac taccagtccg ggggccgcat ccgccggccg gtcaacgtgc ccatcgacat 421 cttctccgag gagatccgct tctaccaact gggtgaggag gctatggaga agttccgtga 481 ggacgagggc ttcctgcggg aggaggagcg acccctgccc cgccgtgact tccagcgcca 541 ggtgtggctg ctcttcgaat accccgagag ctcgcggccg gcccggggca ttgccatcgt 601 gtcagtgctg gtcattctca tctccattgt catcttctgc ttggagacac tacccgagtt 661 tcgcgacgag aaggactatc ccgcctctcc gtcgcaggac gtgtttgagg ctgccaacaa 721 cagcacgtcg ggggcctcct ctggagcctc cagcttctcg gaccccttct tcgtagtgga 781 gaccctgtgc atcatctggt tctcctttga gctgctggtg cgattctttg cttgccccag 841 taaagccacc ttctccagaa atatcatgaa cctgatagac attgtagcca tcatccctta 901 ttttattact ctgggcactg agctggctga gcgacagggt aatgggcagc aggctatgtc 961 actggccatc ctgagggtca tccgcctagt aagggtcttc cgcatcttca agctctcccg 1021 ccattctaag gggctgcaga tcctgggaca gacactgaag gcttccatgc gagagctggg 1081 gctgctcatt ttcttccttt tcattggggt catccttttc tccagtgcag tctactttgc 1141 tgaggcagac gacccttctt cgggttttaa cagtatcccg gatgccttct ggtgggccgt 1201 ggtaaccatg acaactgtcg gttatggtga tatgcaccca gtgaccatag gaggcaagat 1261 tgtgggctct ctttgtgcca tcgcaggtgt cttgaccatt gcattgccgg ttcctgtgat 1321 tctttccaac ttcaattact tctaccaccg ggagacagaa ggggaagagc aagcccagta 1381 catgcacgtg ggaagctgcc agcacctctc ctcttcagca gaggagctcc gaaaagcccg 1441 gagtaactcc actctgagta agtcggagta tatggtgatc gaagaggggg gtatgaacca 1501 cagcgccttc ccccagaccc ccttcaaaac gggcaactcc actgccactt gcaccaccaa 1561 caataatccc aactcctgtg tcaacatcaa aaagatattc actgatgtct aatagatgat 1621 acgattgcca ttctgtgccc agtattgtgt ggaacatgcc cccttggtct gtgtatgccc 1681 ttgatttata catttccaga ccattcatca aggaaagtac atgaagaagt ggaaaagcac 1741 acttcattct ccctctccct attgcttcat actgaaacag gtgcctggtt ttgcaagtgg 1801 gctcattctc tcagctcttt ttctctctcc ctctctctct ccctgtttct taattttgtg 1861 aacaacaaac ttacattaag ctt // LOCUS RICRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION O.sativa ribosomal RNA. ACCESSION M37173 M35661 KEYWORDS ribosomal RNA. SOURCE O.sativa rRNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS SAHRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION S.officinarum ribosomal RNA. ACCESSION M37174 M35661 KEYWORDS ribosomal RNA. SOURCE S.officinarum rRNA. ORGANISM Saccharum officinarum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS SRGRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION S.bicolor ribosomal RNA. ACCESSION M37175 M35661 KEYWORDS ribosomal RNA. SOURCE S.bicolor rRNA. ORGANISM Sorghum bicolor Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS TRDRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION T.dactyloides ribosomal RNA. ACCESSION M37176 M35661 KEYWORDS ribosomal RNA. SOURCE T.dactyloides rRNA. ORGANISM Tripsacum dactyloides Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 17 c 11 g 11 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgtagtt // LOCUS WHTRRUN 50 bp ss-rRNA RNA 03-AUG-1990 DEFINITION T.aestivum ribosomal RNA. ACCESSION M37177 M35661 KEYWORDS ribosomal RNA. SOURCE T.aestivum rRNA. ORGANISM Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 50) AUTHORS Hamby,R.K. and Zimmer,E.A. TITLE Ribosomal RNA sequences for inferring phylogeny within the grass family (Poaceae) JOURNAL Pl. Syst. Evol. 160, 29-37 (1988) STANDARD simple staff_review COMMENT The ribosomal RNA sequences given are either 26S or 18S, but it is not specifically delineated in the article. FEATURES from to/span description rRNA < 1 > 50 ribosomal RNA BASE COUNT 11 a 19 c 11 g 9 t ORIGIN 1 acgagacctc agcctgctaa ctagctatgc ggagccatcc ctccgcagct // LOCUS SUSFCK 3985 bp ss-mRNA INV 03-AUG-1990 DEFINITION S.purpuratus flagellar creatine kinase mRNA, complete cds. ACCESSION M33763 KEYWORDS creatine kinase; flagellar creatine kinase. SOURCE S.purpuratus sperm, cDNA to mRNA, clone WD2.2. ORGANISM Strongylocentrotus purpuratus Eukaryota; Animalia; Eumetazoa; Echinodermata; Echinozoa; Echinoidea; Echinacea; Echinoida; Strongylocentrotidae. REFERENCE 1 (bases 1 to 3985) AUTHORS Wothe,D.D., Charbonneau,H. and Shapiro,B.M. TITLE The phosphocreatine shuttle of sea urchin sperm: Flagellar creatine kinase resulted from a gene triplication JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5203-5207 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.D.Wothe, 18-APR-1990. FEATURES from to/span description pept 95 3619 flagellar creatine kinase (E.C. 2.7.3.2) signal 3961 3966 poly-A signal BASE COUNT 1184 a 923 c 910 g 968 t ORIGIN 1 ccctaaactt caatcacact tctaatcaag aaaggaacaa atttgtatta attatttcat 61 tttctacata tgcattgtaa tcagtgaccc agcaatggga tgcgcagcat cgtcacaaca 121 aacgacggcc acaggtggtc aaccagctgc tggtgaaaag gcaaacccgg cccccgctaa 181 taacaatcca aacgcagcaa acaaagcaga aacgacagga gcagctgaag aactcactaa 241 agaatctgaa ccatttgtgg aaccggatcc caattaccct gacctaagca agcacaacaa 301 ttacctggca gaatctttaa cgccttccat ctacaacaaa atctgtaacc ttcgcacact 361 ttctggttac tctgtggatg gttgcatgca gaccggcgtg gacaatccag gacatccttt 421 catcaagact gtcggattgg tggctgggga tgaagagtgt tacgatctct ttgctgacct 481 ctttgatcca actattgaca aaagacacaa cggatatccc cgcaatgcaa agcacacaac 541 tgatctcaac cctgatcatc tcaagggtgg tgatgacttt gatccaaagt atgtcctttc 601 atgcagagtt cgtacaggcc gctgcattag agggtacggc ctcccaccac attgcacgag 661 ggctgaacgt cgagacgttg agaaagtttg caaagacgca ctggcaactc tagatgggcc 721 actcaaaggc acgtattacc cactaacggg gatgacagag gaaatgcagg acaaactaat 781 tgcagatcat ttcctattcg ataagcccgt ttctcctttg cttatgtcag ctcgcatggc 841 ccgtgattgg cctgatggac gtgggatctg gcacaatgcc gataaaaact tcctggtttg 901 gataaatgag gaagatcaca ctcgggttat ttccatggaa acaagtggca acatgaagaa 961 tgtctttaaa cgattctgta atggtctcaa caaagtcgag aatgctctca aagcaaaagg 1021 gtatgaattc tcatggaatg agcatcttgg gtatgtgctc acttgtccat ccaaccttgg 1081 tacaggtgta cgtgcaggag tccacatcaa gattcctctt ttcagcaagc acgctggatt 1141 tgaatccatt ctgaagcatt atcgtctcca gaaacgtggc acaggaggcg tggatacggc 1201 atccactgac ggcacgttcg acatttcaaa cctagatcgt cttggcactt ctgaagtaca 1261 acaggttcaa tccgttgttg atggtgtcaa gaagctgatt gagttggaaa aagccttaga 1321 aaagggatca gatatcagcg gccagattcc ccgggaccca gcaatagttc gtgcagaaca 1381 agtgaaagaa ggatatccag acctatccaa acacaacaat catctggccc actgtctgac 1441 atacgacatt tggaaaagtc tcaaggacaa aaagacgcca agtggcttta ctctggatgg 1501 ctgcatccaa acgggagtca tgaaccctgg tcatcctcac atcatgaccg ttggaatggt 1561 tgcaggagat gaggaatcat atgacgtctt cgctgatatc tttgacccgg tcattgatgc 1621 acgacacgga ggttatccca aagatgccgt ccatgtcacc aacataaacc atgcagatct 1681 taaaggagga gataacctgg atcccaaata tgttttgtct tgccgtgttc gcacaggacg 1741 cagcatcatc gggtattctc tgcccccgca ttgcaccgtg gaagagagag ctgccgtaga 1801 gacgatcacg atcggagctc ttgataaatt tgatggagac ttgcaaggaa aatactaccc 1861 actggaagga atgtctgatg aaacacaaac acagctgata gatgaccatt tcctcttcga 1921 taagcctgtt tctcctcttc ttaccgccgc aagaatgcat cgtgattggc cacagggaag 1981 aggaatttgg cataacgaaa acaagaactt cttggtgtgg gtcaatgaag aagatcatat 2041 ccgtgtcatc tctatggaga aggatgggaa tatgcgagcg gtcttcaagc gattctgtga 2101 aggtttgcag aagttcgagc agatgatcaa aaaggatggc aaggagttca tgtggaacaa 2161 acatctgggt tacgtactta cttgtccgtc taacctggga accggcctac gcgctggcgt 2221 acatgtgaaa ctgcctctgc tgagcaagta tccgagattc gaccagatcc taagagccct 2281 ccgacttcag aagagaggta caggtggagt agatactgcc tcaactgatg gaaccttcga 2341 catctcgaac ctcgacaggc ttggatcatc agaagtacag caagttcagt ttgttgtcga 2401 tggcgttgaa cttctcgtcc aaatggagaa gaaacttgaa aagggcgagg acatttttga 2461 tattctacca cagcaatgcc gtcccaaacc accaataaag cctttcagtt atgactaccc 2521 cgatttctct ctgcacaaca actggatgtc taaatgtatg acagaagaaa tctacaacaa 2581 actttgcaac ttgaagacaa aaggtggagt caccctgaat gactgcattc aaactggcat 2641 tgacaatcct ggacatccat acatcatgac tgttggtttg gtagctggag atgaagagtg 2701 ctacgaggtt ttcgctcctc tcttcgaccc ggtgataagt gccaggcacg gcggctacgc 2761 tcttgatgcc aaacacccta ccaacctcaa tgcagcagaa ctcaaaggcg gagatgattt 2821 ggaccctgag ttcgtcctat cttgtcgtgt acgtactggt cgttgcatcc gcggcttggc 2881 tctcccgcca tgttgcacca gagcagaacg tgcggaagta gagaagatta caacagaagc 2941 attatctacc ctaagtggac ctctgaaggg aaagtactac ccactaaccg gtatgacaga 3001 cgaagagcaa gagaagctca tcgaagacca cttcctcttt gacaagcccg tatccccgct 3061 gttgctttgt gccaatatgg cacgtgactg gccccagggt cgtggcatat ggcacaatga 3121 tgagaagaat ttccttgtct gggtcaacga ggaagatcac actcgagtca tctccatgga 3181 gaagagcggg aacatgaagc gagtgtttga gaggttctgc gacggtctca aaaaggtcga 3241 agactccatc aagtccaagg gttaccagtt catgtggaat gagcatcttg gttatgtact 3301 gacatgtcca tccaacctcg ggacgggact gcgtgcaggc gtccatgtca aggttccttt 3361 gttaagtcaa cagaaaatct ttgactctat ccttgatcac atgcgtctgc aaaagcgagg 3421 aacaggaggt gtagacacag cctctactga cggcaccttt gatatctcca actcggatcg 3481 catcggtttc tccgaagtcc atcttgtcca acaactcgta gacggtgtca agctcctcgt 3541 caacctcgaa aaggctctca tgaagggaga agacatcaac agtctcctgc cagagaagct 3601 aagggaagat tcctcataaa gccaggatcc acagcttgca gaatagtggc cgttgtcgcg 3661 caagtgttat aaatgccacc tttagaagat gtagcctaca cattaaaaaa aaatacaatt 3721 agaattgaaa tctataagga cgatttgaat ataccataaa gatttgccca tgattttgaa 3781 tcaaaataac ttcatgctga acttgtaaat gaaaaagtaa tgttagaaaa ctctgtaaat 3841 tgggaagata taaaatttca tggccgtttt tctttctgat gaagtacact aattatatta 3901 atttctctta ttaatatgta tggaaacttt taattatgta aaaacaagga aacgaactga 3961 aatacaattc aaactacttt tcatt // LOCUS BSUSACP 3518 bp ds-DNA BCT 03-AUG-1990 DEFINITION B.subtilis sacT gene, complete cds, enzyme II (sacP) gene (of the sucrose phosphotransferase system), complete cds, and sucrase gene, 5' end. ACCESSION J03006 M33761 KEYWORDS enzyme II; sucrase; sucrose transferase system. SOURCE B.subtilis (strain Marburg 168) DNA, clone pBSG8-10. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1987 to 3518) AUTHORS Fouet,A., Arnaud,M., Klier,A. and Rapoport,G. TITLE Bacillus subtilis sucrose-specific enzyme II of the phosphotransferase system: Expression in Escherichia coli and homology to enzymes II from enteric bacteria enzymes II from enteric bacteria JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 8773-8777 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 2070) AUTHORS Debarbouille,M., Arnaud,M., Fouet,A., Klier,A. and Rapoport,G. TITLE The sacT gene regulating the sacPA operon in Bacillus subtilis shares strong homology with transcriptional antiterminators JOURNAL J. Bacteriol. 172, 3966-3973 (1990) STANDARD full staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by A.Fouet, 16-SEP-1987. Draft entry and computer-readable sequence for [2] kindly submitted by M.Debarbouille 18-APR-1990. FEATURES from to/span description pept 136 897 sacT protein pept 2008 3390 enzyme II (sacP) pept 3387 > 3518 sucrase BASE COUNT 1013 a 712 c 867 g 926 t ORIGIN 1 ctttacatgg cggtcaaaag gaggtaaaat cccagtcaaa agcaaattgg cgggagagat 61 aacctcttga aaatctataa agtattaaac aacaatgcgg ctttaataaa agaggatgat 121 caggaaaaaa tcgtgatggg accgggaatc gcttttcaaa aaaagaaaaa tgatctcatc 181 cctatgaata aagtggaaaa gatctttgtc gtgcgcgacg agaatgaaaa gttcaaacaa 241 atcctgcaaa cactgccgga ggagcatata gaaatagctg aggatatcat cagctatgcg 301 gaaggagagc tcgcggcgcc gctgagcgat cacattcata tcgcgctttc cgaccatttg 361 tcctttgcga tcgaaaggat tcaaaatggg ctgcttgtgc agaataaatt gctgcatgag 421 ataaaggcgc tctataaaaa agaatatgag atcggcctgt gggcgatcgg acatgtaaaa 481 gagacattgg gcgtgtctct gcctgaagat gaagcgggtt atattgccct tcacatccat 541 acggcgaaga tggatgcgga gagcatgtat tcagcgctga agcatacgac catgatcaaa 601 gaaatgatag agaaaataaa acaatacttt aatcgaaagg tggatgaaaa cagcatttcc 661 tatcaacgcc ttgtcacgca tttgcgatac gcggtcagcc ggttggaatc aaatgaagcg 721 ctccatcgta tggacgagga gatgctttat ttcatccaaa aaaagtattc attcgcctat 781 caatgtgcgc tggagctggc ggagtttttg aaaaatgaat atcaattaca tttgccggaa 841 tccgaggccg gctatatcac gctgcatgtc cagcgtcttc aagatctctc ggaataaccg 901 ctttgacttg cagggagtga tctctggaag tttttttatt gatcaggatt cagtgtgtag 961 tacatccagc ccatcataac tattcccgcg gtcagattgc cgagcgtaac ggggattaag 1021 tttctgactg ctcccatcag tgtcactgta tcagggtgct cgatcaaaag cgagatggcg 1081 aatgtgcaca tattggcaat gctgtgttca aagccggaaa taaagaagca gaaaacgaaa 1141 agcatcatgg taaaaagctt tgctccttcc cctttgagag acattggaat gaaaaaggcg 1201 aggcacacaa gccaattgca cagcattcct ctgaaaaaca attcggaagc cggcggctcc 1261 attttgtgct ctgccaaatg aatcaaaaag gaatgaacag aaggctcctc aaaaagtccc 1321 gtcgcgctga tcaggatggc aaacagaatg gcgccgatta aattgccggc atagcttgac 1381 atccacaagt atagggtgtc gcgccagctg atttttttcc ggagcgcggt ataggtgaaa 1441 taaaaggtgt tgccggtaaa taaatctccg cctccgtaag caatcatcag aatagcggcc 1501 ccgaaagtga cagcggctgc cggaaaggca aacggagaat cagccataaa gaaatagctg 1561 cctgttttgc ttgcggctgt aatgccaaaa ccaataaata tactggccaa aatggaccga 1621 agcacataac ggattttgct tgaagcgaat atgttttgtt ttttcaaagc atactgttca 1681 accttttgta atgcttgagt ttccattgtc ttttcctgct ttctttcatg attgagcata 1741 gtatggcacg atattgtgaa gtatttcaca attaattttt acgaattttt gtctacatgt 1801 tatatcaaac tatttaacta aaaaaatagt tgacgaaaac gctatcatga tttatgatga 1861 aagcgtattc ttaatctgaa taagcgggat tgtgactggt aaagcaggca agacctaaaa 1921 tttgcgtaaa tgaaaaagga tcgctgtgtc ctttattcgt tggcgaattt taggtctttt 1981 ttgtttaaat aaaaggggga gaaatagatg gattacaaag agactgcaaa acgcctcatt 2041 gagcttctcg gagggaaaga aaatattatc agcgcggctc attgtgcaac aagactgcgt 2101 ttagtgatga aagatgaatc aaagatagat caagcacaag tagaagagct tgatggcgtt 2161 aaaggggctt tcagcagctc tggccagtac cagatcattt tcggaacagg ccttgtgaat 2221 aaagtattcg atgccttttc taaagaagct gatatcgagc gtgaagaaca cgtcaatcat 2281 caggatgcgg caaaagaaaa gctgaatccc gctgcgagat ttgcgaaaac gctttctaat 2341 atttttgttc caatcatccc agctattgta gccagcggcc tattaatggg attgctgggt 2401 atgatcaatg cgtttcattg gatgagcaag gattctgcgc ttttgcagct gcttgatatg 2461 ttttcaagtg cagcattcat tttcttgccg attctaatcg gggtcagcgc ttcaaaagag 2521 tttggcagca atccatactt gggagcggtc atcgggggaa tcatgattca tccgaatctt 2581 ttaaatccat ggggattggc ggaagaacaa ctgattacat gcatcttttc ggatttgata 2641 tcgctcttct cggctacagg gaactgttat ccctgtcctg cttgcggtgt atgtgatgag 2701 caaggtagag aaatggacga gaaaagtggt tccacatgcg gtgatttgct tgttacacca 2761 tttgttactg tgattgtcac cgggtttgtt gcttttatcg cgattggccc tctaggcaga 2821 gcgcttggct ccggcattac ggttgcttta acttatgtat atgaccatgc cggttttgtc 2881 gcaggtctga ttttcggggg cacgtattca ctcatcgtgc tgacaggcgt tcatcacagc 2941 ttccatgcga ttgaggcagg gctaattgca gatataggca aaaactactt gctgccgatc 3001 tggtcaatgg cgaatgtcgc acagggcggg gcaggtcttg ccgtcttctt tatggcgaaa 3061 aaggcaaaga caaaagaaat cgcgcttccg gcagcatttt ccgcttttct cggcattact 3121 gagcccgtca tattcggagt caatctgcgc taccgaaaac cgtttatcgc cgcgatgatc 3181 ggaggcgcat tgggtggggc atatgtcgtc tttacacatg tagctgcaaa cgcttacggg 3241 ttaacgggaa ttccgatgat cgccattgcg gctccgtttg gcttcagcaa cttgattcac 3301 tatctgattg gcatggcgat tgcggcagtg tcagcgttca ttgcagcatt tgtcatgaag 3361 attaatgagg atgaggagag gaaaaaatga cagcacatga ccaggagctt cgtcgccggg 3421 cttatgaaga agtggagaaa aaagagccca tcgctaacag cgatccgcac cgccagcatt 3481 ttcatatcat gccgccggtt gggctgctga atgacccg // LOCUS BOVACTH 1084 bp ss-mRNA MAM 03-AUG-1990 DEFINITION Bovine corticotropin-beta-lipotropin mRNA, complete cds. ACCESSION M25587 KEYWORDS adrenocorticotropic hormone; corticotropin-beta-lipotropin; lipotropin; melanocortin. SOURCE Bovine, cDNA to mRNA, clone pSNAC20. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1084) AUTHORS Cohen,S.N., Chang,A.C.Y., Nakanishi,S., Inoue,A., Kita,T., Nakamura,M. and Numa,S. TITLE Studies of cloned DNA encoding the structure for the bovine corticotropin-beta-lipotropin precursor protein JOURNAL Ann. N.Y. Acad. Sci. 343, 415-425 (1980) STANDARD simple staff_review FEATURES from to/span description pept 114 911 corticotropin-beta-lipoprotein precursor sigp 114 191 corticotropin-beta-lipoprotein signal peptide matp 342 377 gamma-melanocortin (gamma-MSH) matp 507 548 adrenocorticotropic hormone (ACTH) matp 630 908 beta-lipotropin matp 756 809 beta-melanocortin (beta-MSH) matp 816 908 beta-endorphin mRNA < 1 1084 corticotropin-beta-lipoprotein mRNA BASE COUNT 220 a 337 c 370 g 157 t ORIGIN 1 aagagaacga agggaagaaa agtgaccgag acgccgctga acatcctcgc cccggcgcag 61 cgggagccgc ccgagccagc ttccccgtga cagagcctca gcctgcctgg aagatgccga 121 gactgtgcag cagtcgttcg gccgccctgc tgctggcctt gctgcttcag gcctccatgg 181 aagtgcgtgg ttggtgcctg gagagcagcc agtgtcagga cctcaccacg gaaagtaacc 241 tgctggcgtg catccgggcc tgcaagcccg acctctccgc cgagacgccg gtgttccccg 301 gcaacggcga tgagcagccg ctgactgaga acccccggaa gtacgtcatg ggccatttcc 361 gctgggaccg cttcggccgt cggaatggta gcagcagcag cggagttggg ggcgcggccc 421 agaagcgcga ggaggaagtg gcggtgggcg aaggccccgg gccccgcggc gatgacgccg 481 agacgggtcc gcgcgaggac aagcgttctt actccatgga acacttcccc tggggcaagc 541 cggtgggcaa gaagcggcgc ccggtgaagg tgtaccccaa cggcgccgag gacgagtcgg 601 cccaggcctt tcccctcgaa ttcaagaggg agctgaccgg ggagaggctc gagcaggcgc 661 gcggccccga ggcccaggct gagagtgcgg ccgcccggcc tgagctggag tatggcctgg 721 tggcggaggc ggaggctgag gcggccgaga agaaggactc ggggccctat aagatggaac 781 acttccgctg gggcagcccg cccaaggaca agcgctacgg cgggttcatg acctccgaga 841 agagccaaac gccccttgtc acgctgttca aaaacgccat catcaagaac gcccacaaga 901 agggccagtg agggcgcagc gggcaggggc ctctctccgc ggaaagttga ccctgaaggc 961 ctctcttctg ccctcctacc gcctcgcagc ctgggtgagg attcgcccag gcagtgatgg 1021 cgccaggtat cccgactctt aaagctgtct gtagttaaga aataaaacct ttcaagtttc 1081 acga // LOCUS ECOK88ACF 861 bp ds-DNA BCT 03-AUG-1990 DEFINITION E.coli K88ac fimbriae subunit gene, 5' end. ACCESSION M35954 KEYWORDS K88ac fimbriae subunit; antigen. SOURCE E.coli (K12, strain MC1000) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 861) AUTHORS Josephsen,J., Hansen,F., de Graaf,F.K. and Gaastra,W. TITLE The nucleotide sequence of the protein subunit of the K88ac fimbriae of porcine enterotoxigenic Escherichia coli JOURNAL FEMS Microbiol. Lett. 25, 301-306 (1984) STANDARD simple staff_review FEATURES from to/span description pept 13 > 861 K88ac fimbriae subunit precursor sigp 13 75 K88ac fimbriae subunit signal peptide matp 76 > 861 K88ac fimbriae subunit BASE COUNT 227 a 154 c 230 g 250 t ORIGIN 1 gggtttaatc ttatgaaaaa gactctgatt gcactggcaa ttgctgcatc tgctgcatct 61 ggtatggcac atgcctggat gactggtgat ttcaatggtt cggtcgatat cggtggtagt 121 atcactgcag atgattatcg tcagaaatgg gaatggaaag ttggtacagg tcttaatgga 181 tttggtaatg tattgaatga cctgaccaat ggtggaacca aactgaccat tactgttact 241 ggtaataagc caattttgtt aggccgaacc aaagaagcat ttgctacgcc agtaactggt 301 ggtgtagatg gaattcctca tattgcattt actgactatg aaggagcttc tgtagtactc 361 agaaaccctg atggtgaaac taataaaaaa ggtttagcat attttgttct gccgatgaaa 421 aatgcagagg gcactaaagt tggttcagtg aaagtgaatg catcttatgc cggtgtgtta 481 gggagaggtg gggttacttc tgcggacggg gagctgcttt cgctttttgc cgacgggttg 541 agcgctatct tttatggtgg tttgccgagg ggttctgaat cgtcggctgg gagtgccgca 601 gcggcgcgca caaagttgtt tggaagtcta tcaagaaatg atattctcgg acagattcaa 661 agagtaaacg caaatattac ttctcttgtt gacgtcgcag gttcttacag ggaaaacatg 721 gagtacactg atggaactgt tgtttctgct gcctatgcac tgggtattgc aaacggtcag 781 actattgagg caacttttaa tcaggctgta actaccagca ctcagtggag cgctccgctg 841 aacgtagcaa taacttatta c // LOCUS LBARNII 400 bp ds-DNA BCT 03-AUG-1990 DEFINITION L.acidophilus RNA polymerase (RNAI and RNAIIa) genes, 5' promoter region. ACCESSION M35739 KEYWORDS RNA polymerase. SOURCE L.acidophilus DNA, clone pRNL5. ORGANISM Lactobacillus acidophilus Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 400) AUTHORS Natori,Y., Kano,Y. and Imamoto,F. TITLE Characterization and promoter selectivity of Lactobacillus acidophilus RNA polymerase JOURNAL Biochimie 70, 1765-1774 (1988) STANDARD simple staff_review FEATURES from to/span description mRNA 143 > 400 RNAI mRNA mRNA 209 < 1 (c) RNAIIa mRNA BASE COUNT 149 a 69 c 67 g 115 t ORIGIN 1 tgcgcggggt tcgagctacg gccaaaagaa ccccgcgttg gaacatctgg cacaaacttt 61 ttctcatttt taactttttt ttaatttaat ttagacaaaa aaagaagcct taactggctt 121 cgatctgtgg tataattttt ttaaagtaaa caaaaaaagt gctaactaaa acaacaacaa 181 cagcactttt tcaatcaact tttcaacttc gaatttatta tagcataaaa ctagtaaaaa 241 taaatacaaa aaaaaattgc ctgtattatc ttcgaagctg ggaagttgtc ccactatcgg 301 aggttttttt tatgggaaaa atcgtaactt taccacaagg ataggctagt cagcctaagt 361 ctaaaaagaa aaaaagcaag aagagttcag gcagtgcaag // LOCUS LEIKPGRA 44 bp ss-RNA RNA 03-AUG-1990 DEFINITION L.tarentolae kinetoplast minicircle guide RNA (COIII gRNA-I). ACCESSION M33870 KEYWORDS guide RNA. SOURCE L.tarentolae (strain UC) kinetoplast minicircle D-12-1, guide RNA. ORGANISM Kinetoplast Leishmania tarentolae Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Leishmania tarentolae. REFERENCE 1 (bases 1 to 44) AUTHORS Sturm,N. and Simpson,L. TITLE Kinetoplast DNA minicircles encode guide RNAs for editing of cytochrome oxidase subunit III mRNA JOURNAL Cell 61, 879-884 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Simpson 13-APR-1990. BASE COUNT 17 a 7 c 9 g 11 t ORIGIN 1 ctatctttac gaggtaaaga cagagagatg aaaacactat tcgt // LOCUS LEIKPGRB 46 bp ss-RNA RNA 03-AUG-1990 DEFINITION L.tarentolae mitochondrial minicircle guide RNA (4E-2 gRNA). ACCESSION M33871 KEYWORDS guide RNA. SOURCE L.tarentolae mitochondrial minicircle Lt154, guide RNA. ORGANISM Mitochondrion Leishmania tarentolae Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Leishmania tarentolae. REFERENCE 1 (bases 1 to 46) AUTHORS Sturm,N. and Simpson,L. TITLE Kinetoplast DNA minicircles encode guide RNAs for editing of cytochrome oxidase subunit III mRNA JOURNAL Cell 61, 879-884 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Simpson 13-APR-1990. BASE COUNT 27 a 5 c 5 g 9 t ORIGIN 1 ataaaacaca acaaaaaaca tagagaaatc atagagtgtt aaatat // LOCUS MUSIFNRES 508 bp ds-DNA ROD 03-AUG-1990 DEFINITION Mouse MHC class II-associated invariant chain gene, exon 1. ACCESSION M35872 KEYWORDS MHC class II-associated invariant chain. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 508) AUTHORS Eades,A.-M., Litfin,M. and Rahmsdorf,H.J. TITLE The IFN-gamma response of the murine invariant chain gene is mediated by a complex enhancer that includes several MHC class II consensus elements JOURNAL J. Immunol. 144, 4399-4409 (1990) STANDARD simple staff_review FEATURES from to/span description pept 432 > 508 invariant chain protein, exon 1 mRNA 348 > 508 invariant chain protein mRNA site 87 158 IFN-gamma response element signal 267 272 SP1 binding site signal 255 258 CAAT box signal 315 319 TATA box BASE COUNT 133 a 131 c 140 g 104 t ORIGIN 1 gggtgggggt gggtagacaa aggcactttc tgcttcaaaa ataattcctt ccctggggaa 61 agtgcctgct tctagaaccg tccaaagagc ctgatgaatc cagaagtctg cctagaaaca 121 agtgatgata gccctggcca gccaatggga tcatgcaggc ctttctacct gtttagggaa 181 ctcccccttc atcctgccca gggaggcagc tttgagtgag tggggaattt ccagatttgt 241 ggctttcagt tccacatcta ccatgtgggc ggagtgacct gctgtgggcg aatcagattc 301 cttccagtat cagctttaag aggtgatctt ggggctcaag ggtcccagac acacagcagc 361 agcagcagca gcagcagcag caacagcagc agcagcagca gcgcctgtgg gaaaaaatag 421 aggctagagc catggatgac caacgcgacc tcatctctaa ccatgaacag ttgcccatac 481 tgggcaaccg ccctagagag ccagaaag // LOCUS SOYBCONGL 794 bp ss-mRNA PLN 03-AUG-1990 DEFINITION Soybean beta-conglycinin-alpha subunit mRNA, 3' end. ACCESSION M26128 KEYWORDS beta-conglycinin. SOURCE Soybean, cDNA to mRNA, clone pB36. ORGANISM Glycine max Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 794) AUTHORS Coates,J.B., Medeiros,J.S., Thanh,V.H. and Nielsen,N.C. TITLE Characterization of the subunits of beta-conglycinin JOURNAL Arch. Biochem. Biophys. 243, 184-194 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 659 beta-conglycinin-alpha subunit (AA at 3) mRNA < 1 794 beta-conglycinin-alpha subunit BASE COUNT 253 a 153 c 177 g 211 t ORIGIN 1 tgagcaaacg tgccaaatct agttcaagga aaaccatttc ttctgaagat aaacctttta 61 acttgggaag ccgcgacccc atctactcca agaagcttgg caagttcttt gagatcaccc 121 cagagaaaaa cccccagctt cgggacttgg atatcttcct cagtattgtg gatatgaacg 181 agggagctct tcttctacca cacttcaatt caaaggcgat agtgatactg gtaattaatg 241 aaggagatgc aaacattgaa cttgttggcc taaaagaaca acaacaggag cagcaacagg 301 aagagcaacc tttggaagtg cggaaatata gagccgaatt gtctgaacaa gatatatttg 361 taatcccagc aggttatcca gttgtggtca acgctacctc aaatctgaat ttctttgcta 421 ttggtattaa tgccgagaac aaccagagga acttcctcgc aggttcgcaa gacaatgtga 481 taagccagat acctagtcaa gtgcaggagc ttgcattccc tgggtctgca caagctgttg 541 agaagctatt aaagaaccaa agagaatcct actttgtgga tgctcagcct aatgaaaaag 601 aggagggtaa taagggaaga aagggtcctt tgtcttcaat tttgagggct ttttactgaa 661 taagtatgta ctaaaatgta tgctgtaata gctcatagtg agcgaggaaa gtatcgggct 721 atgtaactat gactagagct tcaactatga ataaataaat cgacagcata tgatgctttt 781 gttttgtgtt cttc // LOCUS SOYGLYB4 911 bp ss-mRNA PLN 03-AUG-1990 DEFINITION Soybean glycinin A3-B4 subunit mRNA, 3' end. ACCESSION M35671 KEYWORDS glycinin; glycinin-B4 subunit. SOURCE Soybean (var. Early-10) seed, cDNA to mRNA, clones pSPG[204,673]. ORGANISM Glycine max Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 911) AUTHORS Epishin,S.M., Kotenko,S.V., Zakharova,E.S., Kiselev,S.L. and Vinetskii,Y.P. TITLE Cloning and structural analysis of cDNA encoding a portion of the A-3-B-4 glycinin subunit JOURNAL Genetika 22, 810-815 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 723 glycinin precursor matp 202 720 glycinin-B4 subunit mRNA < 1 911 glycinin mRNA BASE COUNT 280 a 220 c 203 g 208 t ORIGIN 1 caagaagacg aagacgaaga tgaagacgaa gaatatgaac aaactccctc ttatcctcca 61 cgacgaccaa gccatggaaa gcatgaagat gacgaggacg aggacgaaga agaacatcaa 121 cctcgtcctg atcaccctcc acagcgacca agcaggcccg aacaacaaga accacgtgga 181 agaggatgtc agactagaaa tggggttgag gaaaatattt gcaccatgaa gcttcacgag 241 aacattgctc gcccttcacg tgctgacttc tacaacccaa aagctggtcg cattagcacc 301 ctcaacagtc tcaccctccc agccctccgc caattcggac tcagtgccca atatcttgtc 361 ctctacagga atggaattta ctctccacat tggaacttga acgcgaacag tgtgatctat 421 gtgactcgag ggaaaggaag agttagagtg gtgaactgcc aagggaatcc agtgttcgac 481 ggtgacctaa ccaggggaca attgctactg gtgccgcaga actttgtggt cgctgaccaa 541 gggggaaaac aaggattgga atacgtagtg ttcaagacac agcacaacgc cgtgagcagc 601 tacattaagg atctgtttag ggcaatccct tcggaggttc tttccaattc ttacaacctt 661 ggccagagtc aagtgcgtca gctcaagtat caaggaaact ccggcccttt gctcaaccca 721 taaataacaa caagcatata tcaaggtgtg gtgaggccat cttatatgaa taatatgaaa 781 atatattttg tgtaataata aaactatggc ctatgtattt accaccctcc gacccagcct 841 atgttaatat ctgagtggcg ttgtaccttt gaatcgcctt aataaaatgt cagtcttcaa 901 gttttgtctt t // LOCUS TLARGN 133 bp ds-DNA PLN 03-AUG-1990 DEFINITION T.lanuginosus 26S ribosomal RNA gene, partial. ACCESSION M35561 KEYWORDS 26S ribosomal RNA. SOURCE T.lanuginosus DNA. ORGANISM Thermomyces lanuginosus Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 133) AUTHORS Nazar,R.N. and Wong,W.M. TITLE Is the 5S RNA a primitive ribosomal RNA sequence? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 133 23S rRNA BASE COUNT 39 a 23 c 39 g 32 t ORIGIN 1 gtagcaaata ttcaaatgag aactttgaag actgaagtgg ggaaaggttc cacgtcaaca 61 gcagttggac gtgggttagt cgatcctaag agatggggaa gctccgtttc aaaggcctga 121 ttttatgcag gcc // LOCUS YSCRGNA 252 bp ds-DNA PLN 03-AUG-1990 DEFINITION S.cerevisiae 26S ribosomal RNA gene, partial. ACCESSION M35562 KEYWORDS 26S ribosomal RNA. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 252) AUTHORS Nazar,R.N. and Wong,W.M. TITLE Is the 5S RNA a primitive ribosomal RNA sequence? JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 5608-5611 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 252 23S rRNA BASE COUNT 62 a 60 c 75 g 55 t ORIGIN 1 gtagcaaata ctcaaatgag aactttgagg actgaagtgg ggaaaggttc catgtgaaca 61 gcagttggac atgggttagt cgatcctaag acacagggaa gttccgtttg aaaggcgccc 121 tggtgcgccg ttgacatgca acagctggga ttcgctggtg gtcacccacc caactactaa 181 ccagccggcg tgtggcttaa gtacggctga gcggacggga agccctgttt tccacaccct 241 atggtcgcat gt // LOCUS YSCSRP 413 bp ds-DNA PLN 03-AUG-1990 DEFINITION S.cerevisiae serine rich protein (srp) gene, 5' end. ACCESSION M35738 KEYWORDS serine rich protein. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 413) AUTHORS Marguet,D. and Lauquin,G.J.-M. TITLE The yeast srp gene: Positive modulation by glucose of its transcriptional expression JOURNAL Biochem. Biophys. Res. Commun. 138, 297-303 (1986) STANDARD simple staff_review FEATURES from to/span description pept 408 > 413 serine rich protein (srp) mRNA 347 > 413 SRP mRNA (alt.) mRNA 355 > 413 SRP mRNA (alt.) mRNA 404 > 413 SRP mRNA (alt.) BASE COUNT 123 a 83 c 71 g 136 t ORIGIN 1 gatccatctt tcttcgcgcg tttattttca gcaccatgtt ttaggttttt acagcatcgt 61 ttaaggaacc caacaataca atagcgggaa gaatgcactt tctcgttcca taaagggtct 121 ctttcaccta tacggttggt acagatttcc agtgaatgcc agtcagccac ggcattacgt 181 cgtttgcttc tattttcttc gttttgaact gcgtttgtat gcaactgtcc cttgacagag 241 aaaaaagtga agcaaaatga cagacaaaga aatcttttgt ataaaaggtc ggttgaatct 301 tgttgttagc ttgaaatcag cttgcttttc tcctctaaat tacataaaaa accaagaaaa 361 tatcagactt tttcattcgc tttcaacaag tactacaata attaaaaatg gct // LOCUS AVIVNFDG 6557 bp ds-DNA BCT 03-AUG-1990 DEFINITION A.vinelandii nitrogenase (vnfHDGK) genes, complete cds. ACCESSION M32371 KEYWORDS nitrogenase 2; vanadium nitrogenase. SOURCE A.vinelandii (strain CA) DNA. ORGANISM Azotobacter vinelandii Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Azotobacteraceae. REFERENCE 1 (bases 1 to 6557) AUTHORS Joerger,R.D., Loveless,T.M., Pau,R.N., Mitchenall,L.A., Simon,B.H. and Bishop,P.E. TITLE Nucleotide sequence and mutational analysis of the structural genes for nitrogenase 2 of Azotobacter vinelandii JOURNAL J. Bacteriol. 172, 3400-3408 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.N.Pau, 28-FEB-1990. FEATURES from to/span description pept 238 1110 nitrogenase (vnfH) pept 2415 3839 nitrogenase (vnfD) pept 3839 4180 nitrogenase (vnfG) pept 4237 5664 nitrogenase (vnfK) pept 1227 1424 ferredoxin-like ORF rpt 1172 1192 inverted repeat rpt 4188 4219 inverted repeat rpt 5737 5760 inverted repeat binding 1214 1218 pot. ribosome binding site binding 2400 2406 pot. ribosome binding site binding 4222 4228 pot. ribosome binding site site 2311 2327 pot. NTRA-dependent promoter BASE COUNT 1359 a 2270 c 1874 g 1054 t ORIGIN 1 gatcgcttcc cggctgtacc tgcgggtacg tcgacggagc gcacagcatc ctggcctgga 61 tttatggagt ccaataaacc tgcaaaaatt aaaataattc actaattaaa tatgtttttt 121 tgattttata ttccaaaaaa taggcaatca tcgacttatc gatccttgtg gcaccgccct 181 tgctcaactc tcgtcggcac aaatcaaaac gccaacgaat caacggaggt tcctaagatg 241 gcattgcgtc agtgtgcaat ttacggcaag ggtggcatcg gcaagtccac caccacccag 301 aacctggtcg ccgccctcgc cgaagccggc aagaaagtga tgatcgtcgg ttgtgacccg 361 aaagccgact ccacccgcct gatcctgcac tccaaggccc agggcaccgt catggaaatg 421 gccgcgtccg ccggctcggt cgaagacctg gagctggaag acgtgctgca gatcggcttc 481 ggcggcgtca agtgcgtcga atccggtggc ccggagccgg gcgtcgcttg cgccggccgt 541 ggcgtgatca ccgcgatcaa cttcctggaa gaagaaggcg cctacagcga cgacctggac 601 ttcgtgttct atgacgtgct gggcgacgtg gtatgcggcg gcttcgccat gccgatccgc 661 gagaacaagg cccaggaaat ctacatcgtc tgctccggcg agatgatggc catgtacgcc 721 gccaacaaca tcgccaaggg catcgtgaaa tacgcccact ccggcagcgt gcgtctgggc 781 ggcctgatct gcaacagccg caagaccgac cgcgaagacg agctgatcat ggccctggcc 841 gcgaagatcg gcacccagat gatccacttc gtgccgcgcg acaacgtcgt gcaacacgcc 901 gaaatccgcc gcatgaccgt gatcgaatac gatccgaagg ccggacaggc cgacgagtac 961 cgtgccctgg ctcgcaaaat cgtcgacaac aagctgctgg tcatcccgaa cccggcctcc 1021 atggaagaac tcgaagagct gctaatggaa ttcggcatca tggaagtcga agacgagtcc 1081 gtcgtcggca aggccgccgc cgaaggctga ttcacccagc acagcgtttg cggaggagcg 1141 tgcgccgcgg gctttcggaa tggcttctcg cggccggcgg cgcacgccgc cctcccttcg 1201 aacaaccgac ctcaggagct gacaccatgg ccatggccat cgacggctac gaatgcactg 1261 tctgcggcga ctgcgagccg gtctgcccga ccggttcgat cgtcttcagg gacgatcact 1321 acgcgatcga agccgacagt tgcaacgaat gcaccgacgt gggcgagccg cgctgtctcg 1381 gcgtctgccc cgtcgacttc tgcatccagc cgctcgatga ctgaacactg aacgactccg 1441 caccccgttg ccggcggcag gacattccgc gccgtcctgc cgccggaccc agaacggcga 1501 tcgcttttcc tcaggggcga tcgccgtttt acttttcccc gctccgctag cgcccgcgga 1561 caacagccgt ccgtatccgc gccgttcgcc gtcctcgacc ggccacgatc gcggcacagc 1621 tcgccaagct acagcccgct ccacaagctg accatcggca tccagacctt cgccaagatc 1681 cgcgaagact tcttacgtcg acaagaccgg cttcgcccgg cgaccgatcg aacgggacac 1741 actacttcct ctcccggcca tgccgcttcg gcaagagccc gttcctcgat accctggccg 1801 aaccgttcgc cggcgacgaa ccttcgttcc gcaggctgca gatgcatgac cgatgggact 1861 ggcgccggcg ccatccggca ttgcggatcg gcttcggcgg cgcgccgatc cgggacggcg 1921 gataagctcg ccgagccgct gctcccaaag ccacgcccac ctccacgtcc cggcgcggaa 1981 gtggcgatcc ccagccgcac gtccccatgg tttccgggtc tggaaccgtc ccccggactt 2041 cccgtacgga cgcccggaag ccaccgcccc gacggccacg cacagccacg tacagacccc 2101 ccgctctccg tacttggccg aacgaaaaac cttttacgta acccgccggc cgtccacccg 2161 gttgtcccga acaggaaaaa aaagcccgga aaaggcttgc cccggcgctt ttccaaaacc 2221 tcgaaaatgc gcacaaataa ttgattcgaa aggattaatc tgagacagcg gcgatgccgc 2281 gaaaaattcc tgcaagcgct gcaaggacat atggcacgca tcctgccctt acctcttcgc 2341 caaccggttt ttcggtcccg gtcgagtgcc gtcaggggac tcgatccacg catagagccg 2401 aggagacttc aatcatgcca atggtattgc tggaatgtga caaggacata cccgagcgcc 2461 agaaacacat ctatctgaag gcgcccaacg aggacacccg cgagttcctg ccgatcgcca 2521 acgcggcgac catccccggc accctgtccg aacgcggctg cgccttctgc ggcgccaagc 2581 tggtgatcgg cggtgtgctc aaagacacca tccagatgat tcacggcccg ctcggctgtg 2641 cctacgacac ctggcacacc aagcgctacc cgaccgacaa cggccacttc aacatgaagt 2701 acgtctggtc gaccgacatg aaggaaagcc atgtggtctt cggcggcgag aaacgcctcg 2761 agaagagcat gcacgaagcc ttcgacgaaa tgcccgacat caagcggatg atcgtctaca 2821 cgacctgccc gaccgcgctg atcggcgacg acatcaaggc cgtggccaag aaggtgatga 2881 aggaccgtcc ggacgtggac gtcttcaccg tcgaatgccc cggcttctcc ggtgtgtccc 2941 agtccaaggg ccaccacgtc ctgaacatcg gctggatcaa cgagaaagtc gagacgatgg 3001 agaaggaaat caccagcgaa tacaccatga acttcatcgg tgacttcaat attcaaggtg 3061 atacccaact gctgcaaacc tactgggacc gcctgggcat ccaggtcgtc gcccacttca 3121 ccggcaacgg cacctacgac gacctgcgct gcatgcacca ggcccagctc aacgtggtga 3181 actgcgcccg ttcctccggc tacatcgcca acgagctgaa gaagcgctac ggcatcccgc 3241 gtctggacat cgactcctgg ggcttcaact acatggccga gggcatccgc aagatctgcg 3301 ccttcttcgg catcgaggag aagggcgagg agctgatcgc cgaggaatac gccaagtgga 3361 agccgaagct cgactggtac aaggagcgtc tgcaaggcaa gaaaatggcg atctggaccg 3421 gcggcccgcg cctgtggcac tggaccaagt cggtcgagga cgacctgggc gtgcaagtgg 3481 tggccatgtc ctccaagttc ggccatgagg aagacttcga gaaggtcatc gcccgcggca 3541 aggaaggcac ctactacatc gacgacggca acgagctgga attcttcgag atcatcgacc 3601 tggtcaagcc ggacgtgatc ttcaccggcc cgcgcgtcgg cgaactggtc aagaagctgc 3661 acatccccta cgtcaacggc cacggctacc acaacggccc gtacatgggc ttcgaaggct 3721 tcgtcaacct ggcccgcgac atgtacaacg ccgtgcacaa cccgctgcgc cacctggccg 3781 ccgtggatat ccgcgacaag tcgcagacta ccccggtcat cgtgcggggg gccgcctgat 3841 gagccagtcc catctcgacg atctgttcgc ctatgtcgag gagcgctgcc tgtggcagtt 3901 cttctcgcgc acctgggacc gcgaggaaaa catcgagggc gtgctcaatc aggtcggccg 3961 cctgctgacc ggccaggagc cgctgcgcgg caccccgcag gagcgcctgt tctacgccga 4021 cgccctggcc atggccaacg atgtccgcga gcgtttcccc tgggcttcgc aggtcaacaa 4081 ggaagagatc gagttcctgc tcgacggtct caagtcccgt ctggtcgacg tgaccatcac 4141 ccgctcgacc aaccgcgaac tcaaccacca cctctactga gtcatcggcg cggagacggc 4201 cggaacgccg tctgcgcgcg cgaaggagcc attttcatga gcaattgcga actgaccgtg 4261 ctgaagccgg cagaagtcaa gctgagcccg cgtgaccgcg agggcatcat caacccgatg 4321 tacgactgcc agccggccgg cgcccagtac gccggcatcg gcatcaagga ctgcatcccg 4381 ctggtccacg gcggccaggg ctgcacgatg ttcgtccgcc tgctgttcgc ccagcacttc 4441 aaggaaaact tcgacgtcgc ctccacctcg ctgcacgagg agtcggcggt gttcggcggc 4501 gccaagcgcg tcgaggaagg cgtgctggtc ctcgcccgcc gctacccgaa cctgcgcgtc 4561 atcccgatca tcaccacctg ctccaccgaa gtcatcggcg acgacatcga gggcagcatc 4621 cgggtctgca accgggcact cgaagccgaa ttcccggatc gcaagatcta cctggcgccg 4681 gtacacaccc cgagcttcaa gggcagccac gtcaccggct acgccgagtg cgtgaagtcg 4741 gtgttcaaga ccatcaccga cgcgcacggc aagggccagc cgagcggcaa gctcaacgtg 4801 ttcccgggct gggtcaaccc cggcgacgtg gtgctgctca agcgctactt caaggaaatg 4861 gacgtcgaag ccaacatcta catggacacc gaggacttcg actcgccgat gctgccgaac 4921 aagagcatcg agacccacgg ccgcaccacc gtcgaggaca tcgccgacag cgccaacgcc 4981 ctggccaccc tgtccctggc ccgctacgag ggcaacacca ccggcgagtt gctgcagaag 5041 accttcgcgg tgccgaatgc cctggtcaac accccctacg gcatcaagaa caccgacgac 5101 atgctgcgca agatcgccga agtcaccggc aaggagatcc ccgagtcgct ggtccgcgag 5161 cgcggcatcg ccctcgacgc cctggccgac ctggcgcaca tgttcttcgc caacaagaaa 5221 gtggcgatct tcggccaccc ggacctggtg ctcggcctgg cccagttctg catggaagtc 5281 gagctggaac ccgtactgct gctgatcggc gacgaccagg gcaacaagta caagaaggac 5341 ccgcgcatcg aggagctgaa gaacaccgcg cacttcgaca tcgagatcgt ccacaacgcc 5401 gacctctggg aactggagaa gcgcatcaac gccggcctcc agctcgacct gatcatgggt 5461 cactcgaagg gccgctacgt cgccatcgag gccaacatcc cgatggtccg cgtcggcttc 5521 ccgaccttcg accgcgccgg tctctaccgc aagccctcga tcggctacca gggcgccatg 5581 gaactgggcg agatgatcgc caacgccatg ttcgcccaca tggaatacac ccgtaacaag 5641 gagtggatcc tcaatacgtg gtgagttgag gtgccggagc ggtttccacg cactccggct 5701 gtcgagccga ccgagatgac ggcatgcacg gatctcccct cgcccatgca tgggcgaggg 5761 agttgcgagc cgtcacgccc gaggtcgttc cgacggcagg ccgatgcacc ggaaaaccgc 5821 tcccgccccc ggcggctccc ggctccggcc ggggcctccg ccccgtattc cagatccggc 5881 cgccatgccg cattcccggg agtgatgcca tgaaacagcg acaggaaatg gtcgcccact 5941 accgcgcctg cttcggcgag ctgtgcgccc gaccggaaca ccgtcctatc gaaccctaca 6001 cccgcccccg gcgcctgagc ttcgcggagc cggaaagcgc gaccgcccgt caggtgccgg 6061 cgcgcctggt actggccctg accagtgcct acgccctgct cgccgactgg caggaatccc 6121 gcgacccgtc gctggccgac ctgggcagtt ggcaacgcta cctcgccctg ccccgccgca 6181 ccccggcgga aaagctgatg gccgagatct tccgcatcct gcgcgtgttc cgcgccgcgg 6241 cgatccagca gaacggcgcc atcgaaatcc gcgacgacgg cctgatccgc gccagttgca 6301 cctacaaccg ctgcgcgctg agcctgctga tcacccaggc cggcctcgaa ctgctcaccg 6361 cctgcgtcgc ctgctacctc gaatccttcg accagcccta cagcgatgcc tacgaggaac 6421 tgctgctcgg ccagtactac gccgacatcg tcgccgagat ccgctccttc gccgacgacg 6481 accgggtgct gttccagttc cgccagaaag cctggttcaa ccgccatgtc cgcctggact 6541 gcgacaaccc gcgcctg // LOCUS ECOCYS 5755 bp ds-DNA BCT 03-AUG-1990 DEFINITION E.coli thiosulfate binding protein (cysP), sulfate permease (cysT, cysW, cysA) and o-acetylserine (thiol)-lyase-B (cysM) genes, complete cds. ACCESSION M32101 KEYWORDS cysA gene; cysM gene; cysP gene; cysT gene; cysW gene; o-acetylserine (thiol)-lyase-B; sulfate permease; thiosulfate binding protein. SOURCE E.coli K12 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1292 to 5755) AUTHORS Sirko,A., Hryniewicz,M., Hulamicka,D. and Boeck,A. TITLE Sulfate and thiosulfate transport in E.coli K12: Nucleotide sequence and expression of the cysTWAM gene cluster JOURNAL J. Bacteriol. 172, 3351-3357 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1291) AUTHORS Hryniewicz,M., Sirko,A., Palucha,A., Boeck,A. and Hulamicka,D. TITLE Sulfate and thiosulfate transport in E.coli K12: Identification of a gene encoding a novel protein involved in thiosulfate binding JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.Halanicka, 14-FEB-1990. FEATURES from to/span description pept 559 1575 thiosulfate binding protein (cysP) pept 1575 2408 sulfate permease (cysT) pept 2408 3283 sulfate permease (cysW) pept 3273 4370 sulfate permease (cysA) pept 4505 5416 o-acetylserine (thiol)-lyase-B (cysM; gtg start codon; EC 4.2.99.8) signal 516 521 -10 region site 5737 5755 right end of mu BASE COUNT 1259 a 1477 c 1682 g 1337 t ORIGIN 52 min on K12 map. 1 gttaacgcca tttgcccggg atacgtgcgc acaccaatgg cggaaagcat tgcccgccag 61 tcgaacccgg aagatccaga gtcggtgctg actgaaatgg cgaaagcaat cccgatgcgt 121 cgcctcgccg atccgctgga agtcggcgaa ctggcggcct tcctcgcatc ggatgaatcc 181 agctatttaa ccggtacaca gaatgtgatt gatggcggca gcacactgcc ggagacggtt 241 agcgtcggta tctgattcac ctctgtttcc tccctgcatt tgtggggagg atttcgtctt 301 gaactaagtt caccaggcta ttttatttgt cattttggcc ccgggcagtg ctcgaaatcc 361 tcacgtacta tgtgtacgct ccggtttctc cgcgctgttc gtgtccaaac tgactgcaac 421 aattacgcct gttgaaccaa gttcttattc ccttttcaac ttccaaatca ccaaacggta 481 tataaaaccg ttactccttt cacgtccgtt ataaatatga tggctattag aaagtcatta 541 aatttataag ggtgcgcaat ggccgttaac ttactgaaaa agaactcact cgcgctggtc 601 gcttctctgc tgctggcggg ccatgtacag gcaacggaac tgctgaacag ttcttatgac 661 gtctcccgcg agctgtttgc cgccctgaat ccgccgtttg agcaacaatg ggcaaaagat 721 aacggcggcg acaaactgac gataaaacaa tctcatgccg ggtcatcaaa acaggcgctg 781 gcgattttac agggcttaaa agccgacgtt gtcacttata accaggtgac cgacgtacaa 841 atcctgcacg ataaaggcaa gctgatcccg gccgactggc agtcgcgcct gccgaataat 901 agctcgccgt tctactccac catgggcttc ctggtgcgta agggtaaccc gaagaatatc 961 cacgattgga acgacctggt gcgctccgac gtgaagctga ttttcccgaa cccgaaaacg 1021 tcgggtaacg cgcgttatac ctatctggcg gcatggggcg cagcggataa agctgacggt 1081 ggtgacaaag gcaaaaccga acagtttatg acccagttcc tgaaaaacgt tgaagtgttc 1141 gatactggcg gtcgtggcgc gaccaccact tttgccgagc gcggcctggg cgatgtgctg 1201 attagcttcg aatcggaagt gaacaacatc cgtaaacagt atgaagcgca gggctttgaa 1261 gtggtgattc cgaaaaccaa cattctggcg gaattcccgg tggcgtgggt tgataaaaac 1321 gtgcaggcca acggtacgga aaaagccgcc aaagcctatc tgaactggct ctatagcccg 1381 caggcgcaaa ccatcatcac cgactattac taccgcgtga ataacccgga ggtgatggac 1441 aaactgaaag acaaattccc gcagaccgag ctgttccgcg tggaagacaa atttggctcc 1501 tggccggaag tgatgaaaac ccacttcacc agcggcggcg agttagacaa gctgttagcg 1561 gcggggcgta actgatgttt gctgtctcct ccagacgcgt gctgccgggc tttaccttaa 1621 gcctcggcac cagtctgctg tttgtgtgcc tgattttgct gctgccgctc tccgcgctgg 1681 tgatgcaact ggcccagatg agctgggcgc agtactggga ggtgatcacc aacccgcagg 1741 tggtcgcggc ctacaaagta acgctgctgt cggcgtttgt ggcatcgatt tttaacggcg 1801 ttttcggtct gctgatggcg tggatcctaa cccgctatcg cttcccaggc cgcacgctgc 1861 ttgatgcgct gatggattta ccctttgcgc tgccaacggc tgtcgccggt ttaacgctgg 1921 cctcgctctt ttccgtaaac ggtttttacg gtgaatggct ggcgaagttt gatatcaaag 1981 tcacctatac atggctgggg attgcggtgg ctatggcctt taccagcatt ccgtttgtgg 2041 tgcgtaccgt gcagccggtg ctggaagagt taggcccgga atatgaagaa gcggcggaaa 2101 cgcttggtgc aacgcgctgg cagagtttct gcaaagtggt gctgccggag ctttctccgg 2161 cgctggtggc gggcgtggcg ctgtcgttta cccgtagtct tggtgaattt ggcgcggtga 2221 tttttatcgc cggaaatatc gcgtggaaga cggaagtgac gtcgctgatg atttttgtgc 2281 gcttacagga gtttgattac ccggcagcga gcgcgattgc ttcggtgatc ctcgcggcat 2341 ctctgctgct gctgttctca attaacactc tgcaaagtcg ctttggtcgg cgtgtggtag 2401 gtcattaatg gcggaagtta cccaattgaa gcgttatgac gcgcgcccga ttaactgggg 2461 caaatggttt ctgattggca tcgggatgct ggtttcggcg ttcatcctgc tggtgccgat 2521 gatttacatc ttcgtgcagg cattcagcaa ggggctgatg ccggttttac agaatctggc 2581 cgatccggac atgctgcacg ccatctggct gacggtgatg atcgcgctga ttgccgtacc 2641 ggtaaacctg gtgttcggca ttctgctggc ctggctggtg acgcgcttta acttccctgg 2701 acgccagtta ctgctgacgc tactggacat tccgtttgcc gtatcgccgg tggttgccgg 2761 tctggtgtat ttgctgttct acggctctaa cggcccgctc ggcggttggc tcgacgagca 2821 taacctgcaa attatgttct cctggccggg aatggtgctg gtcaccatct tcgtgacgtg 2881 tccgtttgtg gtgcgcgaac tggtgccggt gatgttaagc cagggcagcc aggaagacga 2941 agcggcgatt ttgcttggcg cgtccggctg gcagatgttc cgtcgcgtca cattaccgaa 3001 catccgctgg gcgctgcttt atggcgtggt gttgaccaac gcccgcgcaa ttggcgagtt 3061 tggcgcggtg tcggtggttt ccggctcgat tcgcggcgaa accctgtcgc tgccgttaca 3121 gattgaattg ctggagcagg actacaacac cgtcggctcc tttaccgctg cggcgctgtt 3181 aacgctgatg gcgattatca ccctgttttt aaaaagtatg ttgcagtggc gcctggagaa 3241 tcaggaaaaa cgcgcacagc aggaggaaca tcatgagcat tgagattgcc aatattaaga 3301 agtcgtttgg tcgcacccag gtgctgaacg atatctcact ggatattcct tcaggtcaga 3361 tggtcgcgtt gctggggccg tccggttccg ggaaaaccac gctgctgcgc attatcgccg 3421 ggctggagca tcaaaccagc gggcatattc gcttccacgg caccgacgtg agccgcctgc 3481 acgcacgtga tcgtaaagtc ggtttcgtgt tccagcatta cgcgctgttc cgccatatga 3541 cggtgttcga caatatcgct tttggcctga cggtgctgcc gcgtcgcgag cgcccgaatg 3601 ccgcagccat caaagcgaaa gtgacaaaat tgctggaaat ggtccagctt gcccatctgg 3661 cggatcgtta tccggcgcac gtttccggcg gccagaaaca gcgcgtggcg ctggcgcgcg 3721 cgctggctgt ggaaccgcaa attctgctgc ttgatgaacc gtttggcgcg ctggatgcgc 3781 aggtgcgtaa agagctgcgt cgctggctgc gtcaactcca tgaagaacta aaattcacca 3841 gcgtttttgt gacccacgat caggaagaag cgaccgaagt agctgatcgt gtagttgtga 3901 tgagccaggg caatattgaa caggctgacg cgccggatca ggtatggcgc gaaccggcga 3961 cccgttttgt gctcgaattt atgggcgaag tgaaccgcct gcagggaacc attcgcggcg 4021 ggcagttcca tgttggcgcg catcgctggc cgctgggcta cacacctgcg tatcaggggc 4081 cggtggatct cttcctgcgc ccttgggaag tggatatcag ccgccgtacc agcctcgatt 4141 cgccgctgcc ggtacaggta ctggaagcca gcccgaaagg tcactacacc caattagtgg 4201 tgcagccgct ggggtggtac aacgaaccgc tgacggtcgt gatgcatggc gacgatgccc 4261 cgcagcgtgg cgagcgttta ttcgttggtc tgcaacatgc gcggctgtat aacggcgacg 4321 agcgtatcga aacccgcgat gaggaacttg ctctcgcaca aagcgcctga taggttgagt 4381 gaatgttaaa cgcccggagg cgcttcccgc gagtccgggc tttttaatgg caaggtttgt 4441 aacctgtaga cctgataaga cgcgcaagcg tcgcatcagg caacaccacg tatggataga 4501 gatcgtgagt acattagaac aaacaatagg caatacgcct ctggtgaagt tgcagcgaat 4561 ggggccggat aacggcagtg aagtgtggtt aaaactggaa ggcaataacc cggcaggttc 4621 ggtgaaagat cgtgcggcac tttcgatgat cgtcgaggcg gaaaagcgcg gggaaattaa 4681 accgggtgat gtcttaatcg aagccaccag tggtaacacc ggcattgcgc tggcaatgat 4741 tgccgcgctg aaaggctatc gcatgaaatt gctgatgccc gacaacatga gccaggaacg 4801 ccgtgcggcg atgcgtgctt atggtgcgga actgattctt gtcaccaaag agcagggcat 4861 ggaaggtgcg cgcgatctgg cgctggagat ggcgaatcgt ggcgaaggaa agctgctcga 4921 tcagttcaat aatcccgata acccttatgc gcattacacc accactgggc cggaaatctg 4981 gcagcaaacc ggcgggcgca tcactcattt tgtctccagc atggggacga ccggcactat 5041 caccggcgtc tcacgcttta tgcgcgaaca atccaaaccg gtgaccattg tcggcctgca 5101 accggaagag ggcagcagca ttcccggcat tcgccgctgg cctacggaat atctgccggg 5161 gattttcaac gcttctctgg tggatgaggt gctggatatt catcagcgcg atgcggaaaa 5221 caccatgcgc gaactggcgg tgcgggaagg aatattctgt ggcgtcagct ccggcggcgc 5281 ggttgccgga gcactgcggg tggcaaaagc taaccctgac gcggtggtgg tggcgatcat 5341 ctgcgatcgt ggcgatcgct acctttctac cggggtgttt ggggaagagc attttagcca 5401 gggggcgggg atttaaggat taatagcatc ggagactgat gacaaacgca aaactgcctg 5461 atgcgctacg cttatcaggc ctacaaggtt tctgcaatat attgaattag cacgattttg 5521 taggccggat aaggcgttta cgccgcatcc ggcataaaca aagcgcactt ttttaacagt 5581 tgttgctgcc gacaaatgca gtatttaatt ttcgtgagga aacgccgtaa ggtcattgaa 5641 gcggcgcacg aaaaaccgaa agcgtttcac gataaatgcg aaaactttac gtttcgcgct 5701 tcaaatgaaa cagatgtatt aattactact ttttattcat tacatgggga tccag // LOCUS TTHTRPBA 2586 bp ds-DNA BCT 03-AUG-1990 DEFINITION T.thermophilus tryptophan synthetase (trpAB) genes, complete cds. ACCESSION M32108 KEYWORDS tryptophan synthetase. SOURCE T.thermophilus (strain HB27) DNA. ORGANISM Thermus thermophilus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 2586) AUTHORS Koyama,Y. and Furukawa,K. TITLE Cloning and sequence analysis of tryptophan synthetase genes of an extreme thermophile, Thermus thermophilus HB27: Plasmid transfer from replica-plated E.coli recombinant colonies to competent T.thermophilus cells JOURNAL J. Bacteriol. 172, 3490-3495 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by Y.Koyama, 14-FEB-1990. FEATURES from to/span description pept 353 1567 tryptophan synthetase B (EC 4.2.1.20) pept 1564 2379 tryptophan synthetase A (gtg start codon; EC 4.2.1.20) BASE COUNT 396 a 879 c 905 g 406 t ORIGIN 1 bp upstream of BglII site. 1 agatctcgtt gacgctcgcc gtgcggtggc ggaaccactg cctcgccacg aagatggggg 61 ccttgacgtg aaacttgaag accaccatct caaaggggct cgtgtggcgg tggcgcatga 121 ggtagtcaat gagggcggcg tcctcccgca ccgtcttcgt cccttccccg taggagaccc 181 gggccgcctg gacgatggcg cggtcgtccc ccatctggtc caccaggcgg acgaagccct 241 tgtccaagac cgggatcgtg agaggccctt ccatcccccc gagtttaccg ggaggcccct 301 ccggggtagg atgggagttg tcttggcgcg aggcgccttt agggagcgaa gcatgctgac 361 cctacccgac tttcccttgc ccgacgcgag ggggcggttc ggcccctacg gggggcggta 421 cgtgcccgag accctgatcc ccgccctgga ggagttggag gccgcctacc gggaggccaa 481 gaaggatccg gccttcctgg aggagctgga ccactacctc aggcagtttg ccggcaggcc 541 cacgcccctt taccacgcca agcggctttc cgagtactgg ggtggggccc aggtcttcct 601 caagcgggag gacctcctcc acaccggggc ccacaagatc aacaacaccc tgggccaagc 661 cctcctggcg aggcgcatgg gcaagaggcg ggtcatcgcc gaaacggggg cggggcagca 721 cggggtgagc gtggccacgg tggcggccct cttcggcctg gagtgcgtgg tctacatggg 781 ggaggaggac gtcaggcgcc aggccttaaa cgtcttccgc atgaagctcc tcggggccga 841 ggtccggccc gtggccgcgg ggagccgcac cctcaaggac gccaccaacg aggccatccg 901 cgactggatc acgaacgtgc gcaccacctt ctacatcctg ggctccgtgg tgggccccca 961 cccctacccc atgatggtcc gggacttcca gagcgtcatc ggggaggagg tgaagcggca 1021 aagcctggag cttttcggcc gcctgcccga cgccctcatc gccgcggtgg ggggcggttc 1081 caacgccatc ggcctcttcg cccccttcgc ctacctgcct gagggcaggc ccaagctcat 1141 cggggtggag gccgccgggg aggggctttc cacgggcagg cacgccgcga gcatcggggc 1201 ggggaagcgg ggggtgctgc acgggagcta catgtacctc ctctacgacc acgatgggca 1261 gatcaccccg gcccactccg tctccgcagg gctggactac cccggggtgg ggccggagca 1321 cagctactac gccgacgccg gggtggcgga gtacgccagc gtcacggacg aggaggccct 1381 cgagggcttc aagctcctgg cgcggcttga gggcatcatc cccgccttgg agtcggccca 1441 cgccatcgcc tacgccgcca aggtggtgcc ggagatggac aaggaccagg tggtggtcat 1501 caacctctcc ggccgggggg acaaggacgt gaccgaggtg atgcgccttc tgggagggga 1561 gctgtgacca ccctcgaggc cttcgccaag gcccggtccg agggccgggc cgccctcatc 1621 ccttacctca ccgcgggctt ccccagccgg gagggctttt tgcaggcggt ggaggaggtc 1681 ctcccctatg ccgaccttct ggagatcggc ctcccctact ccgaccccct gggggacggc 1741 cccgtgatcc agcgggcaag cgaactggcc ctgaggaagg ggatgagcgt ccaaggcgct 1801 ttggagctcg tccgcgaggt gcgcgccctc acggagaaac ccctcttcct catgacctac 1861 ctgaaccccg tcctggcctg ggggccggag cgcttcttcg gcctcttcaa gcaggcgggg 1921 gccacggggg tgatccttcc cgaccttccc cccgacgagg acccaggcct cgtgcgcctc 1981 gcccaggaga tcggcctgga gacggtcttc cttctcgccc ccacctccac ggacgcccgc 2041 atcgccaccg tggtccgcca cgccacgggc ttctcgtacg ccgtctccgt caccggggtc 2101 acggggatgc gggaaaggct tcccgaggag gtcaaggacc tggtgcggcg catcaaggcc 2161 aggaccgccc tccccgtggc cgtgggcttc ggggtttccg gcaaggccac ggccgcgcag 2221 gcggcggtgg cggacggggt ggtggtggga agcgccttgg tccgggcctt ggaggagggg 2281 aggtcccttg cgccccttct ccaggagatc cgccagggcc tccagcgcct cgaggccaac 2341 cccggcctga aggaaagctc aaagaagccg ttgtcctgag ccttggaggg taaacccatg 2401 cggcagtggc cttttttagt tctggcaggc ggcctcctcc tcggggcctg tgggctcctt 2461 ccggggtttg gggggcctaa ggtctccgga gagttccagg gggactggtc cggcgtggcc 2521 cagggcctgc gcctggccct ggtgggcctc acgacggagg ggcaggtgaa ctacgacaac 2581 cagctg //
GenBank-Updates@genbank.bio.net (08/04/90)
LOCUS M25435 120 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 6. Possible secondary structure model of 5S RNA. ACCESSION M25435 REFERENCE 1 (bases 1 to 120) AUTHORS Rabin,D. and Crothers,D.M. TITLE analysis of rna secondary structure by photochemical reversal of psoralen crosslinks JOURNAL Nucleic Acids Res. 7, 689-703 (1979) STANDARD unannotated staff_entry BASE COUNT 23 a 36 c 41 g 20 t ORIGIN 1 tgcctggcgg ccgtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac 61 gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat // LOCUS M25592 55 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. The nucleotide sequence of the fragment is given at the top of the figure. ACCESSION M25592 REFERENCE 1 (bases 1 to 55) AUTHORS Ogata,R.T. and Gilbert,W. TITLE dna-binding site of lac repressor probed by dimethylsulfate methylation of lac operator JOURNAL J. Mol. Biol. 132, 709-728 (1979) STANDARD unannotated staff_entry BASE COUNT 16 a 9 c 16 g 14 t ORIGIN 1 cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacag // LOCUS M25594 75 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. (a) the cloverleaf model for the base pairing of native tRNA-Trp. ACCESSION M25594 REFERENCE 1 (bases 1 to 75) AUTHORS Jones,C.R., Kearns,D.R. and Muench,K.H. TITLE nuclear magnetic resonance of the base-pairing structure of the native and denatured conformers of escherichia coli transfer rna-trp JOURNAL J. Mol. Biol. 103, 747-764 (1976) STANDARD unannotated staff_entry BASE COUNT 13 a 21 c 24 g 14 t 3 others ORIGIN 1 aggggcgtag ttcaaddggd agagcaccgg tctccaaacc gggtgttggg agttcgagtc 61 tctccgcccc tgcca // LOCUS M25597 78 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 1: (A) The cloverleaf structure of E. coli tRNA-Met-f1. ACCESSION M25597 REFERENCE 1 (bases 1 to 78) AUTHORS Daniel,W.E.Jr. and Cohn,M. TITLE changes in tertiary structure accompanying a single base change in transfer rna. protein magnetic resonance and aminoacylation studies of escherichia coli trna-met-f1 and trna-met-f3 and their spin-labeled (s4u8) derivatives JOURNAL Biochemistry 15, 3917-3924 (1976) STANDARD unannotated staff_entry BASE COUNT 15 a 26 c 25 g 12 t ORIGIN 1 cgcggggtgg agcagcctgg tagctcgtcg ggctcataac ccgaaggatc gtcggttcaa 61 atccggcccc cgcaacca // LOCUS M26841 459 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 1. Nucleotide sequence coding for mature Il-1-beta and the corresponding amino acid sequence. ACCESSION M26841 REFERENCE 1 (bases 1 to 459) AUTHORS Gronenborn,A.M., Clore,G.M., Schmeissner,U. and Wingfield,P. TITLE a h-nmr study of human interleukin-1-beta JOURNAL Eur. J. Biochem. 161, 37-43 (1986) STANDARD unannotated staff_entry BASE COUNT 143 a 109 c 106 g 101 t ORIGIN 1 gcacctgtac gatcactgaa ctgcacgctc cgggactcgc agcaaaaaag cttggtgatg 61 tctggtccat atgaactgaa agctctccac ctccagggac aggatatgga gcaacaagtg 121 gtgttctcca tgtcctttgt acaaggagaa gaaagtaatg acaaaatacc tgtggccttg 181 ggcctcaagg aaaagaatct gtacctgtcc tgcgtgttga aagatgataa gcccactcta 241 cagctggaga gtgtagatcc caaaaattac caaaagaaga agatggaaaa gcgatttgtc 301 ttcaacaaga tagaaatcaa taacaagctg gaatttgagt ctgcccagtt ccccaactgg 361 tacatcagca cctctcaagc agaaaacatg cccgtcttcc tgggagggac caaaggcggc 421 caggatataa ctgacttcac catgcaattt gtgtcttcc // LOCUS M28421 142 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 5. The unique Xho I site of phi X174 DNA. ACCESSION M28421 REFERENCE 1 (bases 1 to 142) AUTHORS Lam,L.H. and Reynolds,R.J. TITLE dna sequence dependence of closely opposed cyclobuty1 pyrimidine dimers induced by uv radiation JOURNAL Mutat. Res. 178, 167-176 (1987) STANDARD unannotated staff_entry COMMENT BASE COUNT 43 a 27 c 28 g 44 t ORIGIN 1 gctttatcaa gataattttt cgactcatca gaaatatccg aaagtgttaa cttctgcgtc 61 atggaagcga taaaactctg caggttggat acgccaatca tttttatcga agcgcgcata 121 aatttgagca gatttgtcgt ca // LOCUS M28574 160 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. Sequence of tyrT DNA. ACCESSION M28574 REFERENCE 1 (bases 1 to 160) AUTHORS Drew,H.R. and Travers,A.A. TITLE dna bending and its relation to nucleosome positioning JOURNAL J. Mol. Biol. 186, 773-790 (1985) STANDARD unannotated staff_entry COMMENT BASE COUNT 40 a 42 c 38 g 40 t ORIGIN 1 ccggttacct ttaatccgtt acggatgaaa attacgcaac cagttcattt ttctcaacgt 61 aacactttac agcggcgcgt catttgatat gatgcgcccc gcttcccgat aagggagcag 121 gccagtaaaa agcattaccc cgtggtgggg gttcccgaga // LOCUS M28575 169 bp ds-DNA BAD 04-AUG-1990 DEFINITION Figure 4. Sequence of tyrT DNA. ACCESSION M28575 REFERENCE 1 (bases 1 to 169) AUTHORS Drew,H.R. and Travers,A.A. TITLE dna bending and its relation to nucleosome positioning JOURNAL J. Mol. Biol. 186, 773-790 (1985) STANDARD unannotated staff_entry COMMENT BASE COUNT 42 a 46 c 40 g 41 t ORIGIN 1 aattccggtt acctttaatc cgttacggat gaaaattacg caaccagttc atttttctca 61 acgtaacact ttacagcggc gcgtcatttg atatgaagcg ccccgcttcc cgataaggga 121 gcaggccagt aaaaagcatt accccgtggt gggggttccc gagccccgg // LOCUS RCANIFR4 2583 bp ds-DNA BCT 04-AUG-1990 DEFINITION Rhodobacter capsulatus nitrogen fixation (nifR4 and nifA) genes, 5' end. ACCESSION X12358 KEYWORDS nifA gene; nifR4 gene; nitrogen fixation. SOURCE Rhodobacter capsulatus (strain SB1003) DNA, clone pRCN200. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 2583) AUTHORS Jones,R. and Haselkorn,R. TITLE The DNA Sequence of the Rhodobacter capsulata ntrA, ntrB and ntrC gene analogues required for nitrogen fixation JOURNAL Mol. Gen. Genet. (1988) In press STANDARD simple automatic COMMENT See X07567 for overlapping sequence [1] Author address Jones R., University of Southern California, Mathematics Department DRB289 University Park Los Angeles, CA 90089-1113 USA. Submitted (15-JUL-1988) on tape to the EMBL data library by: FEATURES from to/span description pept 644 1924 nifR4 protein pept 2302 > 2583 nifA protein BASE COUNT 406 a 906 c 835 g 436 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgctt gcggggcccg ttacggttgg cacgcttctt gctggatcat ttctgaagcc 61 tcaaccgatc aaaaaggggg atgcgatgaa ggatctcttc gacgaaagcc tgacgctgga 121 caccgcgctg ccgcgcccca ggcaccgccc gccgccgcgc cctggcgccg gcaggccgcg 181 gcgaaggccc ccgacaccgc cgccttcctt gcgaatttcg tcaggatcgg agagatcgcc 241 gccccgaaac ccccggccgc ggcctgtttc cgcccgagga agaagccgcg gtcgtcgccg 301 agctgatcgc cgagatgcgc ccgatgttcc agcgcgacgg cggcgacatc gagctgatcg 361 gcctcaccgg cgccaccgtt caggtgcggc tttcgggcag ctgcgcgggc tgcatgatgt 421 cggcgcgcac gctttccacg gtgcaacacc agctgatcga aaccctgggc cgccggtccg 481 ggtcgtcccc gagatccggc actgagatga tcgcccgccg ccccgcccaa aaacacggca 541 caacccaggg tggcgcggcg cggatttcgg cacctgaccc tttcttcacc tgcggcgccc 601 tgccgcccca cgcaatgacc cgagtttgag ataggttgtc cccatggaac tcgcgcagac 661 gctttcccaa agacagacga tgcagatggc ggggcagatg ctccattccc tcgccatcct 721 tggcatgtcg tcgcaggatt tgtccgagca tctgaccgaa caggccacat cgaacccctt 781 tctgacctac cgcgccccgc ccgccttcat cgcccgcggc ggcgaggatt tcgacgccgt 841 gggcgcggtg gccgcgcaca agcccagcct gatggcccat gtggtcgatc agatcgagat 901 ggccttcacc gagacccccg accggctgct ggcgctgcgc tttgccgagg cgctggagcc 961 gagcggctgg ctgggccaga gcctcgacag catcgcgctg gcggcgggcg tgtcgctttc 1021 gcgcgccgaa tcgatgctgg cggtcctgca gggcttcgag cctaccggcc ttttcgcccg 1081 cgacctgtcg gattgcctga tcctgcaggc gcgcgaggcc gacatcctga cctgggaagt 1141 cgaaacgctg atccgcaaca ttcggctgat cgcggaaaac cggctgtccg atctggccga 1201 tctttgcgat tgcgacatcg gcgacatccc cgaaatcatc aagcagatcc ggcacttgaa 1261 cccgaagccg gggcttgctt tcgaccatca gccgacgccc gttttcccgc ccgatctgat 1321 cgccgtgcgc ggcgccgagg gctggacggt ggaactcaac cgcgccacct cgcccacgat 1381 caccgtgcgc gaagatcgtt tcgccgatgg caccgccgat gcgaaggcgc gcgcggaacg 1441 gcgaaggcgt ggccgaggcc cgggcgctgg cgaggcgctg gagcgacgcc gcgatacgct 1501 gttgcgcacc gccgccgtgc tggtggcgcg gcaatcggcc ttcctcgaca aggggcccgc 1561 ccatctggtg ccgctgacgc tggaggatgt ggcctcggaa ctgggcctgc atgcctcgac 1621 gatttcgcgc gcggtctcgg ggcggatgat ccagacccag acccgggcgc tgccgctgcg 1681 ggcgttcttt tcccgcgcgg tctcgaccca gggcggtggc gaggcggtgt cgcgcgacag 1741 ccttgatttc gtgcagcgca cgtgggcggc gaagatccgc caaaacccgc tgtcggatga 1801 tgcgatcgtg acgctcgccg aacgcgccgg gctgcggatc gcgcggcgga cggtggcgaa 1861 ataccgctcg acgctgggcc ttgcctcctc ctacgagcgt cgccgcgccg ccgccgcgcg 1921 ctgagtctgc gaaacgcgac ctctgccaga ccggaagcgc catttttttc ggcgctccgg 1981 ccattcccgc ctcaaagttg atcttaacga agcctcaatc acccccaagt tgcgacatat 2041 tggcacctga gactgtttgc gccgtgcatc cgggcgcaag cttgaccaag tctcgacatg 2101 tccgccacgg cacccaaccg cgatcgaacg atccacgcag cgcagactgc ggcaccccat 2161 gggagggggt gatcgtcgtt tgatcccccg gcgccgggca agcgagggag aacagtgatg 2221 atcgacatcc gggaccggct cgtcccccaa ccgcagcaag acatcgctcg gcgcgggcca 2281 cggccgaccg ccttgccttg gatgcgcttt acgagatcgc gaagaccttt gcggcggccg 2341 gatccggtgg cgaggtgccg agatcttcaa cgtgctgtcg tctttcctcg atctgcgcca 2401 tggggtgctg gcgttctggc cgagccgggc gagggcgcgg gggtgaaccc ctatgtcatc 2461 gccgcgacgg cctttcagcg cagccccgag gcccccgccg cggatgtgct gcccgatgcc 2521 gtggcgcgga tcgtctttcg ctccggcgtg cctttcgtca gctttgatct ggccgccgaa 2581 ttc // LOCUS ACTCPA 634 bp ds-DNA PLN 04-AUG-1990 DEFINITION A.deliciosa actinidin gene, 5' end. ACCESSION M35795 KEYWORDS actinidin. SOURCE A.deliciosa (var. deliciosa, strain cv.Hayward) leaf DNA, clone pKIWI 37. ORGANISM Actinidia deliciosa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Theales; Actinidiaceae. REFERENCE 1 (bases 1 to 634) AUTHORS Keeling,J., Maxwell,P. and Gardner,R.C. TITLE Nucleotide sequence of the promoter region from kiwifruit actinidin genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Keeling, 05-JUL-1990. Author address: J.Keeling Centre for Gene Technology Dept of Cellular and Molecular Biology University of Auckland Private Bag Auckland NEW ZEALAND email: SYSTEM%CELVAX.AUKUNI.AC.NZ FEATURES from to/span description pept 595 > 634 actinidin mRNA 558 > 634 actinidin mRNA signal 507 514 TATA box BASE COUNT 245 a 120 c 71 g 198 t ORIGIN 1 caaaccgtag ttacaaggtg ggcctgaacc agtttgctga cctgactgat gaggagttcc 61 gataaattag ctttcttttt ttgtcttcac ctattttctt tttcaaattt gatttacatt 121 tgtacattta aaattatttt cgccaagaaa aaccgaataa tataaaaatt tgatcaaaca 181 attaaaacaa ttcattaaag gcgaaaaata taaaaaccaa atcaactaaa aaatatcaac 241 aaaaaaataa ctaatcaaaa ctgggcataa atagaaagga caatttaata attttgtata 301 ataaaaacaa aaataaagta tttaaagttg tggttggata aggatttaaa gaagaaaaaa 361 aattaaatct aaatcattga aatttaattt tatatttttt ttctcttttt tctactgaat 421 ctgcagttcc aacagaacct ttaaaaaaaa ttgtgaaaat cattttttca aatgtcgtaa 481 gacccccccc accccccacg caccctatat aaaggccact ctctccctcc acattcacac 541 acctccaatc ccaatctttt tcttctaaaa ttcaaaaaac gagagagaac aaaaatgggt 601 ttgcccaaat ccttcgtgtc aatgtctctc ctct // LOCUS BOVAPOAI 933 bp ss-mRNA MAM 04-AUG-1990 DEFINITION B.taurus apolipoprotein A-I mRNA, complete cds. ACCESSION M35870 KEYWORDS apolipoprotein A-I. SOURCE B.taurus liver, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 933) AUTHORS O'hUigin,C., Chan,L. and Li,W.-H. TITLE Cloning and sequencing of bovine apolipoprotein A-I cDNA and molecular evolution of apolipoproteins A-I and B-100 JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C. O'hUigin, 27-JUN-1990. FEATURES from to/span description pept 73 870 apolipoprotein A-I precursor sigp 73 127 apolipoprotein A-I signal peptide matp 145 867 apolipoprotein A-I signal 914 919 poly-A signal BASE COUNT 201 a 269 c 313 g 150 t ORIGIN 1 cgccaattac aggcctcgaa agagtcctgt attgttattt ttcgtcacta cctccccggg 61 tcgggattca agatgaaagc cgtggtgctg accttggctg tgctcttcct gacggggagc 121 caggctcggc atttctggca gcaagatgac ccccagtcat cctgggatcg ggtgaaggat 181 tttgccaccg tgtatgtgga agcaatcaag gatagtggca gagactatgt ggcccaattc 241 gaagcctccg ctttgggaaa acagctcaac ctgaaactcc tggacaactg ggacaccctg 301 gccagcacgt tgtccaaagt gcgtgaacag ctgggcccag tgacccagga gttctgggac 361 aacctggaaa aggagaccgc gtcgctgagg caggagatgc acaaggacct ggaggaggtg 421 aagcagaagg tgcagcccta cctggacgag ttccagaaga agtggcacga ggaggtggag 481 atctaccgcc agaaggtggc gccgctgggc gaggagtttc gcgagggcgc gcgccagaag 541 gtgcaggagc tgcaggacaa gctgagccct ctggcccagg agctccgcga ccgcgcgcgc 601 gcccacgtgg agacgctgcg gcagcacgtg gcgccctaca gcgacgacct gcgccagcgg 661 ctgaccgcgc gcctggaggc gctcaaggag ggcggcggca gcctggccga gtaccacgcc 721 aaggccagcg agcagctgaa ggcgctgggc gagaaggcca agccggtgct ggaggacctc 781 cgccagggcc tgctgcccgt gctggagagc ctcaaggtca gcatcctggc cgccatagac 841 gaggcctcca agaagctgaa cgcccagtga gatgcctcgg gctgccccgc tggccatcac 901 ttcggtttct tagaataaac atttccggag tgg // LOCUS DOGINVOLA 864 bp ds-DNA MAM 04-AUG-1990 DEFINITION Dog involucrin gene, complete cds. ACCESSION M34442 KEYWORDS involucrin. SOURCE Dog adult skin keratinocyte DNA. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 864) AUTHORS Tseng,H. and Green,H. TITLE The involucrin genes of pig and dog: Comparison of their segments of repeats with those of prosimians and higher primates JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Tseng and H.Green, 23-MAY-1990. FEATURES from to/span description pept 1 858 involucrin rpt 232 513 repeat segment BASE COUNT 286 a 226 c 249 g 103 t ORIGIN 1 atgtcccagc aacacactct gccagtgacc ctgccccctg ccctcagtca ggggcccctc 61 aagcctgagt ctcctcccat cgatactcaa caagagcagg tgaaacagcc gacttcacta 121 cctgtcctgt gccaaaaaat gccctcttct ctcccaggga aggttccctt ggggcatggg 181 gagaaacata caagtcttgt gaagggggaa cctgagcaac agtgtgagcc acaggagcag 241 gaacagcagc agaaacaaca agagtcacag gagcagaaac tgcacctgga acagtgtctg 301 gaacagcatc aggaacaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg 361 gaacagcagc aggagcaaca agagtcacag gaccagaaac tgtacccgga gcagtgtctg 421 gaacagcagc aggagcaaca agagtcacag gagaaggaac ttcatctgga acaggaacag 481 cagaaggagg agctacagca gcaggaacag cagcagggaa aggaacagtg tgagaaacat 541 caggaagcaa aaaatctgga gcagcagcta gagcagatag gagcacaaag gaagcagcag 601 caaaaggaac agctggaaca ggaaaagaag ctcgtggacc agcacctgga tcaagagcca 661 gcacagagaa ctgagcaacc ggaaaagaaa gaagagcagg tgctggagca gcaggggcag 721 caggaggggc agctggagca gcctgccttt gtcccagctc aggtccaggt ccgagaaccc 781 ctgaagggag aagtcctgcc cctcatagag cagcagcacc agaagcagga ggttcatgac 841 cctccagaac accagtgagc accc // LOCUS DRODMRA 360 bp ds-DNA INV 04-AUG-1990 DEFINITION D.melanogaster dispersed middle repetitive DNA sequence. ACCESSION M16263 KEYWORDS dispersed middle repetitive DNA sequence. SOURCE D.melanogaster DNA, clone p112-0.4RA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 360) AUTHORS Waring,G.L. and Pollack,J.C. TITLE Cloning and characterization of a dispersed, multicopy, X chromosome sequence in Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987) STANDARD simple staff_review BASE COUNT 138 a 56 c 42 g 124 t ORIGIN X chromosome. 1 ccaaatttaa atgcaaatcg atttggattt taataacaaa ctcaacgagg tatgacattc 61 catatttggt ctattatttt caatgttatg atcaaaatac caaatatttt tttcacaaaa 121 aaccaggaaa acgaaatttg gcaaaaaatt taatatttaa gtagagtttt ttgggtataa 181 cttgactaaa aatggtcaca caactaaaat aattaccatt ttgtgctcct aattaccaat 241 actaatcatc cccatcactt tttggcggac tttgaaaaat taatttttgc ccaaattttc 301 gcattttttg taaggggtaa ctcatcaaaa tttttaaaaa atgccaaaaa aataaagttc // LOCUS DRODMRB 354 bp ds-DNA INV 04-AUG-1990 DEFINITION D.melanogaster dispersed middle repetitive DNA sequence. ACCESSION M16264 KEYWORDS dispersed middle repetitive DNA sequence. SOURCE D.melanogaster DNA, clone pX-12R. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 354) AUTHORS Waring,G.L. and Pollack,J.C. TITLE Cloning and characterization of a dispersed, multicopy, X chromosome sequence in Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987) STANDARD simple staff_review BASE COUNT 130 a 57 c 47 g 120 t ORIGIN X chromosome. 1 ccaaatttga atgcaaatcg atttggattt aaaaaacaaa ctcaacgaag atgacattcc 61 atatttgggc cattattttc aatgttttga tcaaaatacc gattattttc ttctcaaaaa 121 attagggaaa cgagttccgg ccaaaaatgc atatttaaga ttggtttttt gggtatactt 181 ggctaaaatg gtcagacagc aaaaaaatta ccatttattc tcctataaca aatactaacc 241 acccctttca cttttaaacg gattttgtaa aattaatttt tggccaaatt ttcgcatttt 301 ttttaagggg taccatcata aaaatttgca aaaaattgaa aaaatctaga attc // LOCUS DRODMRC 352 bp ds-DNA INV 04-AUG-1990 DEFINITION D.melanogaster dispersed middle repetitive DNA sequence. ACCESSION M16265 KEYWORDS dispersed middle repetitive DNA sequence. SOURCE D.melanogaster DNA, clone pX-30 P-0.4R. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 352) AUTHORS Waring,G.L. and Pollack,J.C. TITLE Cloning and characterization of a dispersed, multicopy, X chromosome sequence in Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2843-2847 (1987) STANDARD simple staff_review BASE COUNT 128 a 64 c 42 g 118 t ORIGIN X chromosome. 1 ccaaacttaa atgcaaatcg attgggattc aaaaaacaaa ctcaacgagg tatgacattc 61 catatttggg ccattatttc catgttttca tcaaaatacc gaatttttat cacaaaaaat 121 ctgggaaact atttttggca aaaactcaat tttcaagttg gctcttttgg ctataactga 181 ctaaaaatgg tcacagagca aaaattagta ccattttata ctccttataa ccaatactaa 241 ccaccccttt cactttttaa ctgatttagt aaaatttaat tttggccaaa ttttcgcatt 301 ttttgtaagg gtaacatcat aaatttgcaa aaaattgaaa aatcctagat tc // LOCUS DROTMLPA 5546 bp ss-mRNA INV 04-AUG-1990 DEFINITION D.melanogaster serrate (Ser) mRNA, complete cds. ACCESSION M35759 KEYWORDS Ser gene; serrate. SOURCE D.melanogaster pupa, cDNA to mRNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 5546) AUTHORS Fleming,R.J., Scottgale,T.N., Diederich,R.J. and Artavanis-Tsakonas,S. TITLE The gene serrate encodes a putative EGF-like transmembrane protein essential for proper ectodermal development in Drosophila melanogaster JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Fleming, 28-JUN-1990. Author address: R.J.Fleming Yale University Dept of Biology KBT 1104 New Haven, CT 06511 email: WHAKRIA@YALEVM FEATURES from to/span description pept 443 4657 serrate protein precursor sigp 443 679 serrate protein signal peptide matp 680 4657 serrate protein mRNA < 1 5546 BASE COUNT 1384 a 1446 c 1504 g 1212 t ORIGIN Chromosome 3 97F. 1 ccgagtcgag cgccgtgctt cgagcggtga tgagcccctt ttctgtcaac gctaaagatc 61 tacaaaacat cagcgcctat caagtggaag tgtcaagtgt gaacaaaaca aaaacgagag 121 aagcacatac taaggtccat ataaataata aataataatt gtgtgtgata acaacattat 181 ccaaacaaaa ccaaacaaaa cgaaggcaaa gtggagaaaa tgatacagca tccagagtac 241 ggccagttat tcagctatcc agagcaagtg tagtgtggca aaatagaaac aaacaaaggc 301 accaaaatct gcatacatgg gctaattaag gctgcccagc gaatttacat ttgtgtggtg 361 ccaatccaga gtgaatccga aacaaactcc atctagatcg ccaaccagca tcacgctcgc 421 aaacgccccc agaatgtaca aaatgtttag gaaacatttt cggcgaaaac cagctacgtc 481 gtcgtcgttg gagtcaacaa tagaatcagc agacagcctg ggaatgtcca agaagacggc 541 gacaaaaagg cagcgtccga ggcatcgggt acccaaaatc gcgaccctgc catcgacgat 601 ccgcgattgt cgatcattaa agtctgcctg caacttaatt gctttaattt taatactgtt 661 agtccataag atatccgcag ctggtaactt cgagctggaa atattagaaa tctcaaatac 721 caacagccat ctactcaacg gctattgctg cggcatgcca gcggaactta gggccaccaa 781 gacgataggc tgctcgccat gcacgacggc attccggctg tgcctgaagg agtaccagac 841 cacggagcag ggtgccagca tatccacggg ctgttcgttt ggcaacgcca ccaccaagat 901 actgggtggc tccagctttg tgctcagcga tccgggtgtg ggagccattg tgctgccctt 961 tacgtttcgt tggacgaagt cgtttacgct gatactgcag gcgttggata tgtacaacac 1021 atcctatcca gatgcggaga ggttaattga ggaaacatca tactcgggcg tgatactgcc 1081 gtcgccggag tggaagacgc tggaccacat cgggcggaac gcgcggatca cctaccgtgt 1141 ccgggtgcaa tgcgccgtta cctactacaa cacgacctgc acgaccttct gccgtccgcg 1201 ggacgatcag ttcggtcact acgcctgcgg ctccgagggt cagaagctct gcctgaatgg 1261 ctggcagggc gtcaactgcg aggaggccat atgcaaggcg ggctgcgacc ccgtccacgg 1321 caagtgcgat cgtccggggg aatgcgaatg cagacccggc tggcgtggtc cattgtgcaa 1381 cgagtgcatg gtctatcccg gctgcaagca tggttcctgc aacggcagcg cctggaaatg 1441 cgtgtgcgac accaactggg gtggcatatt gtgcgatcaa gatttaaatt tctgcggcac 1501 ccatgaaccc tgcaagcacg gcggcacctg cgaaaatacc gctccggaca aatatcggtg 1561 cacatgcgcc gagggcctct cgggcgagca gtgcgagatc gtggagcacc catgtgccac 1621 caggccatgc cgcaacggcg gcacatgcac actcaagacg agtaaccgaa ctcaagccca 1681 agtgtatcgc acatcacatg gcaggagcaa catgggccgg ccggtaagac gcagcagttc 1741 gatgcgcagc ctggatcacc tgcggccgga ggggcaggcg ctgaatggca gcagctcctc 1801 gggattggtg tccctaggtt cgctgcagct gcagcagcaa ctggcccccg acttcacttg 1861 cgactgcgca gccggatgga cgggaccgac atgcgaaata aatatcgacg agtgcgccgg 1921 gggtccctgc gagcatggtg gcacttgcat cgatctaatc ggtggctttc gatgtgaatg 1981 tccgccggag tggcatggcg atgtctgtca ggtggatgtg aacgagtgcg aggcgccgca 2041 ttccgccgga atcgctgcga acgcattgct gaccaccaca gccaccgcga ttattggtag 2101 taatctgagc agtactgctc ttctggccgc tctgaccagt gcagtggcat ccacatcctt 2161 ggccatcgga ccctgcatca atgccaagga gtgtcgcaat cagccgggtt cctttgcctg 2221 catctgcaag gagggctggg gcggagtgac ctgtgccgag aatctagatg actgtgtggg 2281 tcagtgccgg aatggagcca cctgcattga tctggtcaac gactataggt gcgcctgtgc 2341 ctctggattc acgggtcgcg attgcgagac ggacatagac gagtgcgcca cttccccgtg 2401 ccgaaacgga ggcgaatgtg tggacatggt gggcaaattc aattgcattt gcccacttgg 2461 ctactcgggt tctctgtgcg aggaggccaa ggagaactgc acaccgtcgc catgtttgga 2521 gggtcactgc ctcaacacgc ccgaaggata ctactgccat tgtccaccgg atcgcgccgg 2581 aaagcactgc gagcaactgc gtccgctctg ctcccagccg ccctgcaacg agggctgctt 2641 cgccaatgtc agcctagcga cgtcagcgac aacgacgacg acaaccacca caacggcgac 2701 aacgacaagg aagatggcca agccaagcgg attgccctgc agcggacacg gcagctgcga 2761 gatgagcgac gtgggcacct tctgcaaatg ccatgtgggc cacaccggca ccttctgcga 2821 gcacaatctc aacgaatgct cgccgaatcc ttgtcgaaat gggggaattt gccttgacgg 2881 cgacggcgat tttacatgcg agtgcatgtc gggctggaca ggtaaacgct gctcggagcg 2941 cgctacaggt tgttatgccg gtcagtgcca gaatggtggt acctgcatgc ctggagcccc 3001 ggacaaggct ctgcagccgc attgccgctg tgcgccaggt tggactggtc tgttttgcgc 3061 cgaggctatt gaccagtgtc gcgggcagcc gtgccacaat ggcggaacgt gcgagtcggg 3121 agcgggctgg ttccgctgcg tctgcgctca gggattctct ggtccagact gccgcatcaa 3181 tgtgaacgag tgctcgccac agccttgcca gggcggtgcc acctgcatcg acggaatcgg 3241 tggatacagc tgcatctgcc caccaggaag gcatggattg cggtgtgaaa ttttgctctc 3301 cgatcccaag tccgcctgcc agaacgcaag caacactatc tctccgtata cagctctaaa 3361 ccgaagccaa aactggctgg atattgctct aaccggaaga acagaagacg atgagaactg 3421 caatgcgtgt gtctgcgaaa acggcacctc tcggtgcacg aatctctggt gtggattgcc 3481 caattgctat aaggtggatc cgctctccaa gtcctcgaat ctgtccggtg tttgcaaaca 3541 gcacgaggtg tgcgttccgg cactgagtga gacatgcctg tcatcgcctt gtaatgttcg 3601 tggagattgc cgggcactgg aaccatcgcg tcgggttgct ccaccccgac tgccagccaa 3661 atctagctgc tggcccaatc aggccgtggt caacgagaac tgcgcccgac tcaccatcct 3721 tttggccctg gagcgagtgg gcaagggagc ttcggtggag ggtctctgct ccctggtaag 3781 ggtgctgctg gctgcccagt tgatcaagaa gccggcgagt acttttggcc aggatccggg 3841 aatgcttatg gtgctctgcg atctcaaaac gggcaccaat gataccgttg aactaactgt 3901 gtcgtccagt aaattaaatg atccccagct gccagtggcg gtgggtctgc tgggtgaact 3961 cctgagctcc aggcagttga atggcatcca gcggcgcaag gaactggagc tgcagcatgc 4021 aaaattggct gccctcacct ccattgtgga ggtcaagttg gaaacggccc gcgtggccga 4081 tggatcgggt catagtctgc tgataggagt gctatgcggt gtctttatag tcctggtggg 4141 attctcggtg ttcatcagtc tttactggaa acagcgtctg gcttatcgca ccagttcggg 4201 aatgaactta actccctccc tggatgcact gcgtcacgag gaggagaagt cgaataatct 4261 gcagaacgag gagaatctgc gaaggtatac aaatccgctg aagggcagca ccagttccct 4321 aagagcggcc accggcatgg aactaagcct caatcccgct ccggaattag ccgcctcggc 4381 ggcgagtagt tccgccttgc acagatcgca gccactattc ccgccatgcg atttcgagcg 4441 tgagctggac tccagtacgg gcctgaagca ggcgcacaag cggagctcac agattctgct 4501 gcacaaaacc caaaactcgg acatgcggaa gaacactgtg ggctcgctgg acagtccgcg 4561 taaggacttt ggcaagcggt cgatcaactg caagtccatg ccaccctctt cgggcgacga 4621 gggctccgat gtccttgcca ccactgtgat ggtttagccg tgatctcacc aaccaaccaa 4681 tcaagaaacc aaccagccgc ccacagccag ctcaaagttc caattgccac agcacgggcg 4741 ctatttccaa gtgcattagt agcgtaatta aaactaggat attgttaagg ataccaaggt 4801 aggccacaac ggagtggctc tgttgaaaac gtaaagttct aaaaatccag gtctctcaga 4861 caaagatgag gtacacaaat aaattggcta gttaatcaag catgttatgg ccacgggatg 4921 ggcaaattta tttgtatacc tgatcttatc ttaatactaa accagttttc tactattttt 4981 tttttgtgga tcaagcttaa aagttcagct aggcaggcgt tttccgcagt gccatgtcga 5041 tgtggaagcc caaaatattt aggttagata gtgtaatttc gaactcttct cttcgctaag 5101 caacatccta cacagtgtga tatttagtgt aacccaggcg cgcatttaca ttcaattaaa 5161 gacaatgata tataaatata aacgaaatca actccttggc tagcacaagc tgtatgtata 5221 tagttctcat ttaggatcgt cgcgctctat attgtgtata agctgtaaat actgtaaatt 5281 agcagttacc gttattgtat tttgtctata gttagattgg tactattaaa ctaagaacca 5341 gccgcaacgc gttagacttt aaaagttgtt tgcaattgta cgcaataata tagttttatg 5401 ctcgtagtta ggtagctgtg taaccgggta agattcaaac gattttgtac tgtattatat 5461 acctatctgt gtagtaatat ttatttatta tattaaattt gatctagacg caataaagta 5521 atatcaataa agatagtaaa agacat // LOCUS ECOLRRPA 495 bp ds-DNA BCT 04-AUG-1990 DEFINITION E.coli leucine-responsive-regulatory protein (Lrp) gene, complete cds. ACCESSION M35869 KEYWORDS Lrp gene; leucine-responsive-regulatory protein. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 495) AUTHORS Willins,D.A., Platko,J.V., Ryan,C. and Calvo,J.M. TITLE Lrp, a small, basic global regulatory protein of E.coli JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.M.Calvo, 27-JUN-1990. Author address: J.M.Calvo Cornell University Dept of Biochemistry Room 451 Biotechnology Bldg. Ithaca, NY 14853 email:DEKY@CRNLVAX5 FEATURES from to/span description pept 1 495 leucine-responsive-regulatory protein (Lrp) BASE COUNT 127 a 110 c 128 g 130 t ORIGIN 20 min. on E.coli map. 1 atggtagata gcaagaagcg ccctggcaaa gatctcgacc gtatcgatcg taacattctt 61 aatgagttgc aaaaggatgg gcgtatttct aacgtcgagc tttctaaacg tgtgggactt 121 tccccaacgc cgtgccttga gcgtgtgcgt cggctggaaa gacaagggtt tattcagggc 181 tatacggcgc tgcttaaccc ccattatctg gatgcatcac ttctggtatt cgttgagatt 241 actctgaatc gtggcgcacc ggatgtgttt gaacaattca ataccgctgt acaaaaactt 301 gaagaaattc aggagtgtca tttagtatcc ggtgatttcg actacctgtt gaaaacacgc 361 gtgccggata tgtcagccta ccgtaagttg ctgggggaaa ccctgctgcg tctgcctggc 421 gtcaatgaca cacggacata cgtcgttatg gaagaagtca agcagagtaa tcgtctggtt 481 attaagacgc gctaa // LOCUS HUMA1AT 143 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human alpha-1-antitrypsin (alpha-1-AT) mRNA, 3' end. ACCESSION M26123 KEYWORDS alpha-1-antitrypsin. SOURCE Human adult liver, cDNA to mRNA, clone PJN1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 143) AUTHORS Coutelle,C., Speer,A., Rogers,J., Kalsheker,N., Humphries,S. and Williamson,R. TITLE Construction and partial characterization of a human liver cDNA library JOURNAL Biomed. Biochim. Acta 44, 421-431 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 101 alpha-1-antitrypsin (alpha-1-AT; AA at 3) /hgml_locus_uid="LX0081X" /nomgen="PI" /map="14q32.1" BASE COUNT 37 a 53 c 19 g 34 t ORIGIN 1 ccgaggtcaa gttcaacaaa ccctttgtct tcttaatgat tgaccaaaat accaagtctc 61 ccctcttcat gggaaaagtg gtgaatccca cccaaaaata actgcctctc gctcctcaac 121 ccctcccctc catccctggc ccc // LOCUS HUMGAPDHG 5378 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene, complete cds. ACCESSION J04038 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SOURCE Human DNA. REFERENCE 1 (bases 1 to 5378) AUTHORS Ercolani,L., Florence,B., Denaro,M. and Alexander,M. TITLE Isolation and complete sequence of a functional human glyceraldehyde-3-phosphate dehydrogenase gene JOURNAL J. Biol. Chem. 263, 15335-15341 (1988) STANDARD simple staff_entry REFERENCE 2 (bases 650 to 1157) AUTHORS Nasrin,N., Ercolani,L., Denaro,M., Kong,X.F., Kang,I. and Alexander,M. TITLE An insulin response element in the glyceraldehyde-3-phosphate dehydrogenase gene binds a nuclear protein induced by insulin in cultured cells and by nutritional manipulations in vivo JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5273-5277 (1990) STANDARD simple staff_entry COMMENT FEATURES from to/span description pept 1453 1481 glyceraldehyde-3-phosphate dehydrogenase, exon 2 (first expressed exon) (EC 1.2.2.12) 3116 3215 glyceraldehyde-3-phosphate dehydrogenase, exon 3 3306 3412 glyceraldehyde-3-phosphate dehydrogenase, exon 4 3542 3632 glyceraldehyde-3-phosphate dehydrogenase, exon 5 3723 3838 glyceraldehyde-3-phosphate dehydrogenase, exon 6 3931 4012 glyceraldehyde-3-phosphate dehydrogenase, exon 7 4206 4618 glyceraldehyde-3-phosphate dehydrogenase, exon 8 4723 4792 glyceraldehyde-3-phosphate dehydrogenase, exon 9 pre-msg 1138 4993 GAPDH mRNA and introns IVS 1190 1429 GAPDH intron A IVS 1482 3115 GAPDH intron B IVS 3216 3305 GAPDH intron C IVS 3413 3541 GAPDH intron D IVS 3633 3722 GAPDH intron E IVS 3839 3930 GAPDH intron F IVS 4013 4205 GAPDH intron G IVS 4619 4722 GAPDH intron H site 650 729 insulin response element A [2] site 729 867 insulin response element B [2] site 1049 1071 insulin response element A [2] signal 884 892 CAAT box signal 1108 1113 TATA box site 1447 1456 Kozak consensus sequence revision 877 879 ccc in [2]; cc in [1] BASE COUNT 994 a 1642 c 1627 g 1115 t ORIGIN 1 ggatcccctg ctgggagggg gcaggggacc tgttcccacc gtgtgcccaa gacctctttt 61 cccacttttt ccctcttctt gactcaccct gccctcaata tcccccggcg cagcagtgaa 121 agggagtccc tggctcctgg ctcgcctgca cgtcccaggg cggggaggga cttccgccct 181 cacgtcccgc tcttcgcccc aggctggatg gaatgaaagg cacactgtct ctctccctag 241 gcagcacagc ccacaggttt caggagtgcc tttgtgggag gcctctgggc ccccaccagc 301 catcctgtcc tccgcctggg gccccagccc ggagagagcc gctggtgcac acagggccgg 361 gattgtctgc cctaattatc aggtccaggc tacagggctg caggacatcg tgaccttccg 421 tgcagaaacc tccccctccc cctcaagccg cctcccgagc ctccttcctc tccaggcccc 481 cagtgcccag tgcccagtgc ccagcccagg cctcggtccc agagatgcca ggagccagga 541 gatggggagg gggaagtggg ggctgggaag gaaccacggg cccccgcccg agcccatggg 601 cccctcctag gcctttgcct gagcagaccg gtgtcactac cgcagagcct cgaggagaag 661 ttccccaact ttcccgcctc tcagcctttg aaagaaagaa aggggagggg gcaggccgcg 721 tgcagccgcg agcggtgctg ggctccggct ccaattcccc atctcagtcg ttcccaaagt 781 cctcctgttt catccaagcg tgtaagggtc cccgtccttg actccctagt gtcctgctgc 841 ccacagtcca gtcctgggaa ccagcaccga tcacctccca tcgggccaat ctcagtccct 901 tccccctacg tcggggccca cacgctcggt gcgtgcccag ttgaaccagg cggctgcgga 961 aaaaaaaaag cggggagaaa gtagggcccg gctactagcg gttttacggg cgcacgtagc 1021 tcaggcctca agaccttggg ctgggactgg ctgagcctgg cgggaggcgg ggtccgagtc 1081 accgcctgcc gccgcgcccc cggtttctat aaattgagcc cgcagcctcc cgcttcgctc 1141 tctgctcctc ctgttcgaca gtcagccgca tcttcttttg cgtcgccagg tgaagacggg 1201 cggagagaaa cccgggaggc tagggacggc ctgaaggcgg caggggcggg cgcaggccgg 1261 atgtgttcgc gccgctgcgg ggtgggcccg ggcggcctcc gcattgcagg ggcgggcgga 1321 ggacgtgatg cggcgcgggc tgggcatgga ggcctggtgg gggaggggag gggaggcgtg 1381 tgtgtcggcc ggggccacta ggcgctcact gttctctccc tccgcgcagc cgagccacat 1441 cgctcagaca ccatggggaa ggtgaaggtc ggagtcaacg ggtgagttcg cgggtggctg 1501 gggggccctg ggctgcgacc gcccccgaac cgcgtctacg agccttgcgg gctccgggtc 1561 tttgcagtcg tatgggggca gggtagctgt tccccgcaag gagagctcaa ggtcagcgct 1621 cggacctggc ggagccccgc acccaggctg tggcgccctg tgcagctccg cccttgcggc 1681 gccatctgcc cggagcctcc ttcccctagt ccccagaaac aggaggtccc tactcccgcc 1741 cgagatcccg acccggaccc ctaggtgggg gacgctttct ttcctttcgc gctctgcggg 1801 gtcacgtgtc gcagaggagc ccctccccca cggcctccgg caccgcaggc cccgggatgc 1861 tagtgcgcag cgggtgcatc cctgtccgga tgctgcgcct gcggtagagc ggccgccatg 1921 ttgcaaccgg gaaggaaatg aatgggcagc cgttaggaaa gcctgccggt gactaaccct 1981 gcgctcctgc ctcgatgggt ggagtcgcgt gtggcgggga agtcaggtgg agcgaggcta 2041 gctggcccga tttctcctcc gggtgatgct tttcctagat tattctctgg taaatcaaag 2101 aagtgggttt atggaggtcc tcttgtgtcc cctccccgca gaggtgtggt ggctgtggca 2161 tggtgccaag ccgggagaag ctgagtcatg ggtagttgga aaaggacatt tccaccgcaa 2221 aatggcccct ctggtggtgg ccccttcctg cagcggctca cctcacggcc ccgcccttcc 2281 cctgccagcc tagcgttgac ccgaccccaa aggccaggct gtaaatgtca ccgggaggat 2341 tgggtgtctg ggcgcctcgg ggaacctgcc cttctcccca ttccgtcttc cggaaaccag 2401 atctccaccg caccctggtc tgaggtctga ggttaaatat agctgctgac ctttctgtag 2461 ctgggggcct gggctggggc tctctcccat cccttctccc cacacacatg cacttacctg 2521 tgctcccact cctgatttct ggaaaagagc taggaaggac aggcaacttg gcaaatcaaa 2581 gccctgggac tagggggtta aaatacagct tcccctcttc ccacccgccc cagtctctgt 2641 cccttttgta ggagggactt agagaagggg tgggcttgcc ctgtccagtt aatttctgac 2701 ctttactcct gccctttgag tttgatgatg ctgagtgtac aagcgttttc tccctaaagg 2761 gtgcagctga gctaggcagc agcaagcatt cctggggtgg catagtgggg tggtgaatac 2821 catgtacaaa gcttgtgccc agactgtggg tggcagtgcc cacatggccg cttctcctgg 2881 aagggcttcg tatgactggg ggtgttgggc agccctggag ccttcagttg cagccatgcc 2941 ttaagccagg ccagcctggc agggaagctc aagggagata aaattcaacc tcttgggccc 3001 tcctgggggt aaggagatgc tgcattcgcc ctcttaatgg ggaggtggcc tagggctgct 3061 cacatattct ggaggagcct cccctcctca tgccttcttg cctcttgtct cttagatttg 3121 gtcgtattgg gcgcctggtc accagggctg cttttaactc tggtaaagtg gatattgttg 3181 ccatcaatga ccccttcatt gacctcaact acatggtgag tgctacatgg tgagccccaa 3241 agctggtgtg ggaggagcca cctggctgat gggcagcccc ttcataccct cacgtattcc 3301 cccaggttta catgttccaa tatgattcca cccatggcaa attccatggc accgtcaagg 3361 ctgagaacgg gaagcttgtc atcaatggaa atcccatcac catcttccag gagtgagtgg 3421 aagacagaat ggaagaaatg tgctttgggg aggcaactag gatggtgtgg ctcccttggg 3481 tatatggtaa ccttgtgtcc ctcaatatgg tcctgtcccc atctcccccc caccccggta 3541 ggcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc gtggagtcca 3601 ctggcgtctt caccaccatg gagaaggctg gggtgagtgc aggagggccc gcgggagggg 3661 aagctgactc agccctgcaa aggcaggacc cgggttcata actgtctgct tctctgctgt 3721 aggctcattt gcagggggga gccaaaaggg tcatcatctc tgccccctct gctgatgccc 3781 ccatgttcgt catgggtgtg aaccatgaga agtatgacaa cagcctcaag atcatcaggt 3841 gaggaaggca gggcccgtgg agaagcggcc agcctggcac cctatggaca cgctcccctg 3901 acttgcgccc cgctccctct ttctttgcag caatgcctcc tgcaccacca actgcttagc 3961 acccctggcc aaggtcatcc atgacaactt tggtatcgtg gaaggactca tggtatgaga 4021 gctggggaat gggactgagg ctcccacctt tctcatccaa gactggctcc tccctgctgg 4081 ggctgcgtgc aaccctgggg ttgggggttc tggggactgg ctttcccata atttcctttc 4141 aaggtgggga gggaggtaga ggggtgatgt ggggagtacg ctgcagggcc tcactccttt 4201 tgcagaccac agtccatgcc atcactgcca cccagaagac tgtggatggc ccctccggga 4261 aactgtggcg tgatggccgc ggggctctcc agaacatcat ccctgcctct actggcgctg 4321 ccaaggctgt gggcaaggtc atccctgagc tgaacgggaa gctcactggc atggccttcc 4381 gtgtccccac tgccaacgtg tcagtggtgg acctgacctg ccgtctagaa aaacctgcca 4441 aatatgatga catcaagaag gtggtgaagc aggcgtcgga gggccccctc aagggcatcc 4501 tgggctacac tgagcaccag gtggtctcct ctgacttcaa cagcgacacc cactcctcca 4561 cctttgacgc tggggctggc attgccctca acgaccactt tgtcaagctc atttcctggt 4621 atgtggctgg ggccagagac tggctcttaa aaagtgcagg gtctggcgcc ctctggtggc 4681 tggctcagaa aaagggccct gacaactctt ttcatcttct aggtatgaca acgaatttgg 4741 ctacagcaac agggtggtgg acctcatggc ccacatggcc tccaaggagt aagacccctg 4801 gaccaccagc cccagcaaga gcacaagagg aagagagaga ccctcactgc tggggagtcc 4861 ctgccacact cagtccccca ccacactgaa tctcccctcc tcacagttgc catgtagacc 4921 ccttgaagag gggaggggcc tagggagccg caccttgtca tgtaccatca ataaagtacc 4981 ctgtgctcaa ccagttactt gtcctgtctt attctagggt ctggggcaga ggggagggaa 5041 gctgggcttg tgtcaaggtg agacattctt gctggggagg gacctggtat gttctcctca 5101 gactgagggt agggcctcca aacagccttg cttgcttcga gaaccatttg cttcccgctc 5161 agacgtcttg agtgctacag gaagctggca ccactacttc agagaacaag gccttttcct 5221 ctcctcgctc cagtcctagg ctatctgctg ttggccaaac atggaagaag ctattctgtg 5281 ggcagcccca gggaggctga caggtggagg aagtcagggc tcgcactggg ctctgacgct 5341 gactggttag tggagctcag cctggagctg agctgcag // LOCUS HUMVARNA 156 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human integrated adenovirus 2 VA RNA. ACCESSION M35961 KEYWORDS . SOURCE Human adenovirus 2 infected KB cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 156) AUTHORS Ohe,K. and Weissman,S.M. TITLE Nucleotide sequence of an RNA from cells infected with adenovirus 2 JOURNAL Science 167, 879-881 (1969) STANDARD simple staff_review BASE COUNT 26 a 46 c 53 g 31 t ORIGIN 1 gggcactctt ccgtggtctg gtggataaat tcgcaagggt atcatggcgt ggacgaccgg 61 ggttcgaacc ccggatccgt gatccatgcg gttaccgtcc gccgcccgtg cgtcgaaccc 121 aggtgtgcga cgtcagacaa cgggggagcg ctcctt // LOCUS MUSGFAPA 2511 bp ss-mRNA ROD 04-AUG-1990 DEFINITION Mouse glial fibrillary acidic protein mRNA, 3' end. ACCESSION M25937 KEYWORDS glial fibrillary acidic protein. SOURCE Mouse brain, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2511) AUTHORS Cowan,N.J., Lewis,S.A., Balcarek,J.M., Krek,V. and Shelanski,M. TITLE Structural implications of a cDNA clone encoding mouse glial fibrillary acidic protein JOURNAL Ann. N.Y. Acad. Sci. 455, 575-582 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 1212 glial fibrillary acidic protein (AA at 1) BASE COUNT 625 a 690 c 697 g 499 t ORIGIN 1 ctgggtacca tgccacgttt ctccttgtct cgaatgactc ctccactccc tgccagggtg 61 gacttctccc tggccggggc gctcaatgct ggcttcaagg agacacgggc gagcgagcgt 121 gcagagatga tggagctcaa tgaccgcttt gctagctaca tcgagaaggt ccgcttcctg 181 gaacagcaaa acaaggcgct ggcagctgaa ctgaaccagc ttcgagccaa ggagcccacc 241 aaactggctg atgtctacca ggcggagctt cgggagctgc ggctgcggct ggaccagctt 301 acggccaaca gtgcccggct ggaggtggag agggacaact ttgcacagga cctcggcacc 361 ctgaggcaga agctccaaga tgaaaccaac ctgaggctgg aggcagagaa caacctggct 421 gcgtatagac aggaggcaca tgaagccacc ctggctcgtg tggatttgga gagaaaggtt 481 gaatcgctgg aggaggagat ccagttctta aggaagatct atgaggagga agttcgagat 541 ctccgggagc agctggccca acagcaggtc cacgtggaga tggatgtggc caagccagac 601 ctcacagcgg ccctgagaga gattcgcact caatacgagg cagtggccac cagtaacatg 661 caagagacag aggagtggta tcggtctaag tttgcagacc tcacagacgc tgcgtcccgc 721 aacgcagagc tcctccgcca agccaaacac gaagctaacg actatcgccg ccaactgcag 781 gccttgacct gcgatctgga gtccctccgc ggcacgaacg agtccctaga gcggcaaatg 841 cgcgaacagg aagagcgcca tgcgcgggag tcggccagtt accaggaggc acttgctcgg 901 ctggaggagg agggccaaag cctcaaggag gagatggccc gccacctgca ggagtaccag 961 gatctactca acgttaagct agccctggac atcgagatcg ccacctacag gaaattgctg 1021 gagggcgaag aaaaccgcat caccattcct gtacagactt tctccaacct ccagatccga 1081 gaaaccagcc tggacaccaa atccgtgtca gaaggccacc tcaagaggaa catcgtggta 1141 aagactgtgg agatgcggga tggtgaggtc attaaggact cgaagcagga gcacaaggac 1201 gtggtgatgt gaggtgtgcc acctggtggc ccttgccatg cagtgtgagg gcccaaagct 1261 tagcctcaaa taggcctgtt tgccaggctc agttcccacc cacaccagca cttcccttcc 1321 ttccctggtt ttctgcctgt gtgctgccca aggcgtcaat caggtcataa gcttcataga 1381 tggcatagat ggcatatacc cttcaccttc aactaacagg atactcaccc caaaggcgca 1441 gtcaggaggg gagggaaccc cagctggtta gaattggaag ggaagaggaa agatgagcag 1501 agtagagaga tttaacaaat cacttccttc atccttgttg ttatggaaac cgttgccaga 1561 gctggaagtt tccacaggct gctggagcta gacaacaatt cagacagaaa gggaaagtcc 1621 ctgaggcaaa gtctctctag ccagagacct atgcatcccg aatgccacta aggcagtcct 1681 gaagggccct ccaggtgatg actccagtgt gtcagcccca ctgagcagct atgcaggttg 1741 actgcccaca ggcatgtgaa acttggttct cagcacttgg cagggatcta tggcataagt 1801 ggagagggaa ggtgtactgg acggcggaga ggagggctcc ctggccccta agtgtggatg 1861 cagagaggtg gagcccagga agggtctctg cttaggctgc agggtaccga atggcagagg 1921 cactggtaga gatcatttgg acactcggag ttgaaagtta caggcaatct gttacacttg 1981 gctctgaatc ctatcaatca aggaaataac ccgttctctg gaagacactg aaacaggaga 2041 gcaggacttc cgtccactgg gcagggtaca gatgtgtctc agttgtgaag gtctattcct 2101 ggctgcacag tccccatccg ctcaggtcat cttacccctg tgactgctct cagccctgga 2161 agaatccaca accatccttc caaggttgtc catccccaca atgactagct gttgctctcc 2221 aagctaaggg accattccct gctctatgca tatacgtaat gtcacctatt tagtatcatc 2281 ctatttgaga gtttgaggaa ctgaaacact gttctgttca agcacctggt gctatgcctt 2341 catattagag caccttctct gaggctgatt ggtgggcagg tagggaagac attgagcaga 2401 cagtgtccgc tcagttgtcc ttccctccct tccaaggtcc ctccctcttt ccaggacatc 2461 gcccccccac cccacccctc ctttccacct ccgctaacct ccagacgcac t // LOCUS MUSHOX61A 318 bp ss-mRNA ROD 04-AUG-1990 DEFINITION Mouse homeobox-6.1 protein (Hox-6.1) mRNA, 3' end. ACCESSION M35986 KEYWORDS Hox-6.1 gene; homeobox-6.1 protein. SOURCE Mouse male kidney, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Sharpe,P.T., Miller,J.R., Evans,E.P., Burtenshaw,M.D. and Gaunt,S.J. TITLE Isolation and expression of a new mouse homeobox gene JOURNAL Development 102, 397-407 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 318 homeobox-6.1 protein (AA at 1) site 30 210 homeobox BASE COUNT 94 a 80 c 100 g 44 t ORIGIN Chromosome 14E2. 1 aatcacagtg gggtcggtta cggagcggac cggaggcgcg gccgccagat ctactctcgg 61 taccagaccc tggaactgga gaaggaattt cacttcaacc gctacctaac tcggcgccgg 121 cgcatcgaga tcgccaatgc tctgtgcctg accgagcgac agatcaaaat ctggttccag 181 aacgcgcgca tgaagtggaa aaaagaatct aatctcacgt ccacactctc agggggtggc 241 ggaggggcaa ccgccgacag cctgggagga aaggaggaaa agcgagaaga gacagaagaa 301 gagaagcaga aagagtga // LOCUS PEAVICIL 190 bp ds-DNA PLN 04-AUG-1990 DEFINITION P.sativum vicilin mRNA, partial cds. ACCESSION M35978 KEYWORDS vicilin. SOURCE P.sativum, cDNA to mRNA, clone pRC 2.2.1. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 190) AUTHORS Gatehouse,J.A., Lycett,G.W., Croy,R.R.D. and Boulter,D. TITLE The post-translational proteolysis of the subunits of vicilin from pea (Pisum sativum L.) JOURNAL Biochem. J. 207, 629-632 (1982) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 190 vicilin (AA at 2) BASE COUNT 74 a 30 c 46 g 40 t ORIGIN 1 agagatagag aagattctct tagaagagca tgagaaagag acacatcaca gaagaagcct 61 tagggataag agacaacaga gccaagaaaa gaatgtaata gtcaaagtat caaagcaagt 121 gcaaagttac aaagctaagt tgactccagg tgatgttttt gtaattccgg caggtcatcc 181 tgttgccgta // LOCUS PIGINVOLA 1204 bp ds-DNA MAM 04-AUG-1990 DEFINITION Pig involucrin gene, complete cds. ACCESSION M34441 KEYWORDS involucrin. SOURCE Pig (Yorkshire) adult skin keratinocyte DNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1204) AUTHORS Tseng,H. and Green,H. TITLE The involucrin genes of pig and dog: Comparison of their segments of repeats with those of prosimians and higher primates JOURNAL Mol. Biol. Evol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Tseng and H. Green, 23-MAY-1990. FEATURES from to/span description pept 1 1044 involucrin rpt 247 831 repeat segment BASE COUNT 377 a 311 c 363 g 153 t ORIGIN 1 atgtcccagc aatacactct tccagtgacc ctgccccctg ccctcagtca ggagcccctc 61 aagcctgttt ctcctcccgc cgatacccag caggagcaag tgaaacaggc aactccgctg 121 cctgctccat gccagaagat gctctcggag ctcccagtgg aggtcccctt ggaggatgcg 181 gagaaacaca caactcttgt gaagggagtg cctgagcaag agtgtgagcc gcagccacag 241 gagccacagc agcaggaact gcatgtggaa cagcagcagc agcagcaaga atcacaggtg 301 caggaactgc atgtggatca gcagcagcag caacaagagt cacaggagca ggaactgcat 361 gtggatcagc agcagcagca gcaagagtca caggagcagg aactgcatgt ggatcagcag 421 cagcaacaag agtcacaggt gcaggaactg catgtgggtc atcatcagca gcagcaagag 481 tcacaggagc aggaactgca tgtggatcat catcagcagc agcaagagtc acaggagcag 541 gaactgcatg tggatcagca gcagcagcaa caagagtcac aggagcagga actgcatgtg 601 gatcagcagc agcagcagca agagtcacag gagcaggaac tgcatgtgga tcatcatcag 661 cagcagcaag agtcacaggt gcaggaactg catgtggatc atcagcagca gcagcaagag 721 tcacaggagc aggaactgca tgtggatcag caccagcagc agcaagagtc acaggagcag 781 gaactgcatg tggatcagca gcagcaggag ctccaggtgc aggaagtgca gcagcagcag 841 cagcagcagc aagaacagca ggaggaccat cagaaagcag aacacctgga gcaggaggaa 901 gcacaaaggg agcagcagct aaagggacag ctggaacagg agaagaaggg cgtgtaccag 961 cacctggatc aagagctaac aaagagagat gaacacctgg aaaagaaagg ggagcactgc 1021 tggagcagca ggagaagcct ctagagccag cagagcagca ggaggggcag ttgaaacagc 1081 ctgtgcttat tccagctcct ggccaggtcc aggagaccca ccccagtcca actgctgaag 1141 ggagaagttt tgccccctga gaagcaggac ataagtaacc acctctggac ataagtaacc 1201 accc // LOCUS FLANPB 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/Wilson-Smith/33 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30746 KEYWORDS nucleoprotein. SOURCE Influenza A /Wilson-Smith/33 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 507 a 318 c 409 g 331 t ORIGIN 1 agcaaaagca gggtagataa tcactcacag agtgacatcg aaatcatggc gaccaaaggc 61 accaaacgat cttacgaaca gatggagact gatggagaac gccagaatgc cactgaaatc 121 agagcatctg tcggaaaaat gattggtgga attggacgat tctacatcca aatgtgcacc 181 gaacttaaac tcagtgatta tgagggacgg ctgattcaga acagcttaac aatagagaga 241 atggtgctct ctgcttttga cgagaggagg aataaatatc tagaagaaca tcccagtgcg 301 gggaaagatc ctaagaaaac tggaggacct atatacagga gagtagatgg aaagtggatg 361 agagaactca tcctttatga caaagaagaa ataagacgaa tctggcgcca agctaataat 421 ggtgacgatg caacggctgg tctgactcac atgatgatct ggcactccaa tttgaatgat 481 gcaacttacc agaggacaag agctcttgtt cgcacaggaa tggatcccag gatgtgctca 541 ctgatgcagg gttcaaccct ccctaggagg tctggggccg caggtgctgc agtcaaagga 601 gttggaacaa tggtgatgga attgatcaga atgatcaaac gtgggatcaa tgatcggaac 661 ttctggaggg gtgagaatgg acggagaaca aggattgctt atgaaagaat gtgcaacatt 721 ctcaaaggga aatttcaaac agctgcacaa agagcaatgg tggatcaagt gagagagagc 781 cggaatccag gaaatgctga gttcgaagat ctcatctttc tagcacggtc tgcactcata 841 ttgagagggt cagttgctca caagtcctgc ctgcctgcct gtgtgtatgg acctgccgta 901 gccagtggat acgactttga aagagaggga tactctctag tcggaataga ccctttcaga 961 ctgcttcaaa acagccaagt atacagccta atcagaccaa atgagaatcc agcacacaag 1021 agtcaactgg tgtggatggc atgccattct gctgcatttg aagatctaag agtatcaagc 1081 ttcatcagag ggacgaaagt ggtcccaaga gggaagcttt ccactagagg agttcaaatt 1141 gcttccaatg aaaacatgga gactatggaa tcaagtaccc ttgaactgag aagcagatac 1201 tgggccataa ggaccagaag tggagggaac accaatcaac agagggcttc ctcgggccaa 1261 atcagcatac aacctacgtt ctcagtacag agaaatctcc cttttgacag accaaccatt 1321 atggcagcat tcactgggaa tacagagggg agaacatctg acatgagaac cgaaatcata 1381 aggctgatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcagc gagcccgatc gtgccctcct ttgacatgag taatgaagga 1501 tcttatttct tcggagacaa tgcagaggag tacgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPC 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/swine/Iowa/15/30 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30747 KEYWORDS nucleoprotein. SOURCE Influenza A/swine/Iowa/15/30 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1565; revises [1]) AUTHORS Gorman,O.T. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 509 a 318 c 411 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcg aaatcatggc gtctcaaggc 61 accaaacgat catacgaaca aatggaaact ggtggagaac gccagaatgc cacggaaatc 121 agagcatctg tcggaagaat gattggtgga attggaagat tctacatcca aatgtgcacc 181 gaactcaaac tcagtgatta tgagggacgg ctgattcaga acagcataac aatagagaga 241 atggtgctct ctgcttttga cgagaggagg aacaaatatc tggaagaaca tccaagcgct 301 gggaaagacc ctaagaaaac tggaggaccc atatacagga gaatagacgg aaagtggata 361 agagaactca tcctttatga caaagaagaa ataaggagaa tctggcgcca agccaacaat 421 ggtgaggata caacagccgg tctgactcac atgatgatct ggcattccaa tttgaatgat 481 gccacttatc agaggacaag agctcttgtt cgcaccggaa tggatcccag gatgtgctct 541 ctgatgcagg gttcaactct ccccaggagg tctggagccg caggtgctgc agtcaaagga 601 gttgggacag tggtgatgga gttgatcaga atgatcaaac gtggaatcaa tgatcggaac 661 ttctggaggg gtgaaaatgg acgaaggaca agaattgcct atgaaagaat gtgcaacatt 721 ctcaaaggga aatttcaaac agctgcacag agagcaatga tggatcaagt gagagagagc 781 cgaaacccag gaaatgctga aatcgaagat ctcatctttc tggcacgatc cgcactcata 841 ttacgaggat cagttgcaca caagtcctgt ctgcctgcct gtgtgtatgg acttgctgta 901 gccagtggac atgactttga aagagagggg tactctctgg tcggaataga ccctttcaga 961 ctgcttcaaa acagccaagt attcagccta atcagaccaa atgaaaaccc agcgcacaag 1021 agtcaattag tgtggatggc atgccattct gctgcatttg aagatttaag ggtatcaagc 1081 ttcataagag ggaaaagagt ggttccacga gggcagcttt ccaccagagg ggttcagatt 1141 gcttccaatg agaacatgga gactatggac tctagtactc ttgaactgag aagcagatac 1201 tgggccataa ggaccagaag tggaggaaac actaatcaac agagggcatc tgcgggccaa 1261 atcagcgtgc aacctacatt ctcggtgcag agaaatctcc cttttgagag agcaaccgtt 1321 atggcagcat tcactgggaa cacagaggga agaacatcag acatgagaac cgaaatcata 1381 aggataatgg aaagtgcaag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gagcccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPD 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/swine/Tennessee/24/77 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30748 KEYWORDS nucleoprotein. SOURCE Influenza A/swine/Tennessee/24/77 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1565; revises [1]) AUTHORS Gorman,O.T. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 514 a 307 c 409 g 335 t ORIGIN 1 agcaaaagca gggtagataa tcactcaatg agtgacatcg aaatcatggc gtctcaaggc 61 accaaacgat catatgaaca aatggagact ggtggggaac gccaggatgc cacagaaatc 121 agagcatctg tcggaagaat gattggtgga atcggaagat tctacatcca aatgtgcact 181 gaactcaaac tcagtgacta tgagggacga ctgattcaaa atagcataac aatagagaga 241 atggtgctct ctgcttttga cgagagaagg aataaatacc tagaagagca tcccagtgct 301 gggaaagatc ctaagaaaac tggaggaccc atatatagaa gagtagacgg aaaatggatg 361 agggaactca tcctttatga caaagaagaa ataaggagag tttggcgcca agccaacaat 421 ggtgaagatg caacagccgg ccttacccat attatgattt ggcactccaa tctgaatgat 481 gccacctatc agagaacaag agctcttgtt cgcactggga tggatcccag aatgtgctcc 541 ctaatgcaag gttcaacact tcccagaagg tctggagccg caggtgctgc agtaaaagga 601 gttggaacaa tagcgatgga gttaatcaga atgatcaaac gtgggatcaa tgaccgaaac 661 ttctggaggg gtgaaaatgg acgaaggaca aggattgcat atgaaagaat gtgcaacatt 721 ctcaaaggga aatttcagac agctgcccag agggcaatga tggatcaagt gagagaaagt 781 cggaacccag gaaacgctga aattgaagat ctcattttcc tggcacggtc agcactcatt 841 ttaagagggt cagttgcaca taagtcttgc ctgcctgctt gtgtgtatgg gcttgcagta 901 gcgagtgggc atgactttga aagagaagga tattctctgg tcggaataga ccccttcaaa 961 ctacttcaaa acagtcaagt attcagcctg atcagaccaa atgaaaaccc agctcacaag 1021 agtcaactgg tgtggatggc atgccactct gccgcatttg aggatttaag agtatcaggc 1081 ttcataagag ggaagaaagt ggttccaaga ggaaagcttt ccacaagagg ggttcagatt 1141 gcttcaaatg agaatgtgga agctatggac tctagtaccc tggaactaag aagcaggtac 1201 tgggccataa ggaccagaag cgggggaaat accaatcaac agaaggcatc cgcaggccag 1261 atcagtgtgc aacctacatt ctcagtacaa aggaatctcc cttttgagag agcgaccgtt 1321 atggcagctt tcattgggaa caatgaggga cgaacatcag atatgcgaac tgaaatcata 1381 aggatgatgg aaagtgcaaa gccagaagat ttgtccttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgaa taatgagggg 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPE 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/swine/Netherlands/12/85 (H1N1) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30749 KEYWORDS nucleoprotein. SOURCE Influenza A/swine/Netherlands/12/85 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 503 a 307 c 419 g 336 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacattc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaacc ggaggagaac gccagaatgc tactgaaatc 121 agagcatctg ttgggggaat ggttggtgga attggaagat tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga atagcataac aatagagaga 241 atggttctct ctgcatttga tgagaggagg aacaaatacc tggaagaaca tcccagtgcg 301 gggaaggacc caaagaaaac tggaggtcca atctacaaaa agagagacgg aaaatggatg 361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcaaacaat 421 ggtgaagatg ctactgctgg tctcactcat ctgatgattt ggcattccaa cctgaatgat 481 gccacatatc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggaga tctggagctg caggtgcagc agtaaaggga 601 gttgggacga tggtaatgga actgattcgg atgataaagc gggggatcaa tgatcggaac 661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcat atgagagaat gtgcaacatc 721 ctcaaaggga aatttcagac agcagcgcaa cgagcaatga tggaccaggt gcgagaaagc 781 agaaatcctg ggaatgctga gattgaagac cttatctttc tggcacgatc tgcactcatt 841 ctgagaggat cagtggctca taaatcctgc ctgcctgctt gtgtatatgg acttgttgtg 901 gcaagtggat atgactttga aagagaaggg tactctctag tcggaataga tcctttccgt 961 ctgctccaga acagccaggt gttcagcctc attagaccaa atgagaatcc agcacataag 1021 agtcagctgg tatggatggc atgccattct gcagcgtttg aagatctgag agtgtcaagt 1081 ttcatcagag ggacaaaagt ggtcccaaga ggacaactgt ccaccagagg agttcaaatt 1141 gcttcaaatg aaaacatgga aacaatggac tccagtactc ttgaactgag aagcaaatac 1201 tgggctataa gaaccaggag cggaggaaac accaaccaac agagagcatc tgcagggcaa 1261 atcagtgtac aacctacttt ctcggtacag agaaatcttc ctttcgagaa agcgaccatc 1321 atggcagctt ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaattata 1381 agaatgatgg aaagtgccag accagaagat gtgtccttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPF 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/equine/London/1416/73 (H7N7) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30750 KEYWORDS nucleoprotein. SOURCE Influenza A/equine/London/1416/73 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 497 a 322 c 418 g 328 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca aactcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc 121 agagcatctg ttggaaggat ggtgggagga atcggccggt tttatgttca aatgtgtact 181 gagctcaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagagagg 241 atggtgcttt cggcattcga cgaaagaaga aacaagtacc tggaggagca tcccagtgct 301 ggaaaagacc ccaagaaaac gggaggcccg atatacagaa ggagagatgg gaaatggatg 361 agagaactca tcctccatga taaagaagaa atcaggagga tctggcgtca ggccaacaat 421 ggtgaagatg ctactgctgg tctgactcac atgatgatct ggcactccaa tctcaatgac 481 accacctacc aaagaacaag ggctcttgtt cggactggga tggatcccag aatgtgctct 541 ctgatgcaag gatcaactct cccacggaga tctggagctg ccggtgctgc agtgaagggt 601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaac 661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaacatc 721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gagagagggc 781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcgcgatc agcactcatt 841 ctgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta 901 gccagtgggt atgacttcga gaaagaggga tactctctgg ttggaattga tcctttcaaa 961 ctactccaga acagccaaat tttcagtctg atcagaccga aagaaaatcc agcacacaag 1021 agccagctgg tgtggatggc atgccattct gcagcatttg aggacctgag agtttcgaat 1081 ttcattagag gaaccaaagt aatcccaaga ggacagttgg caaccagagg agtgcaaatt 1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat 1201 tgggcaataa ggaccaggag tggggggaac accagtcaac agagagcatc tgcaggacag 1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt 1321 atggctgcat tcactggaaa cactgagggg aggacttccg acatgagaac ggaaatcata 1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPG 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/equine/Kentucky/2/86 (H3N8) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30751 KEYWORDS nucleoprotein. SOURCE Influenza A/equine/Kentucky/2/86 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 502 a 318 c 411 g 334 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc 121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact 181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg 241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct 301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg 361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat 421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac 481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct 541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt 601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat 661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc 721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc 781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt 841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta 901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa 961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag 1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat 1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt 1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat 1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag 1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt 1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata 1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPH 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Massachussetts/26/80 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30752 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Massachussetts/26/80 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 492 a 317 c 429 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt 121 agggcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggacga ttgatccaaa acagcatcac catagagaga 241 atggtcctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagaa ctaagaagaa tctggcggca ggcaaacaat 421 ggggaagatg cgactgctgg tctcacccac ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc agagaactag agcacatgtg cgcacaggaa tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtgaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaaaacgg acgaagaaca agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcaaac agcagcacag cgggctatga tggatcaggt gcgggaaagc 781 cggaatcctg gaaatgctga aatagaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggcggtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattct gctgcatttg aggatttgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aacaatgaat tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPI 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Astrakhan/227/84 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30753 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Astrakhan/227/84 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 489 a 323 c 426 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggtgaac gccagaatgc cactgagatt 121 cgggcatctg tcggaaggat ggttggagga atcggaagat tctacataca gatgtgcact 181 gaactcaaac taagtgacaa cgaggggagg ctgatccaga acagtatcac aatagaaaga 241 atggtccttt ctgcgtttga tgagagaagg aacaggtact tggaggagca ccccagcgct 301 gggagggacc ccaagaagac tggaggacca atttacagaa ggagagaggg aaaatgggtg 361 agagaattgg tcctgtatga caaggaagaa ataagaagaa tctggcgaca agcaaacaat 421 ggagaagatg caactgccgg ccttacccac ttgatgatct ggcattctaa cttgaatgac 481 gccacatatc agaggactag agcccttgtg cgcactggga tggaccccag gatgtgctcc 541 cttatgcaag gatcgacact tccaagaagg tctggagcgg ctggtgcagc tgtaaaggga 601 gttgggacaa tggtgatgga actcatcaga atgataaaaa gaggggtcaa tgaccgtaac 661 ttctggagag gtgaaaatgg acgaagaaca agaattgcct atgaaagaat gtgcaacatt 721 ctcaaaggaa aattccaaac agctgcacaa cgagctatga tggatcaggt cagagaaagc 781 cgtagtcctg gaaatgctga aatcgaggac ctcatctttc tggccaggtc tgcacttatt 841 ctgaggggag cagtggctca caaatcatgc ctgcctgcct gtgtgcatgg acttgccgta 901 gcaagtggat acgactttga gagggaggga tactcccttg ttgggataga ccctttccgt 961 ctgctccaga acagccaggt attcagtcta atccgaccca atgaaaatcc agcacataag 1021 agtcagttgg tgtggatggc atgccattcc gctgcatttg aggatttgag agtgtcaagc 1081 ttcatcaggg gagcaagagt gttaccaaga gggcaactat ccacaagagg tgttcaaatt 1141 gcatccaatg agaacatgga gactatgagt tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag cggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 atcagtgtgc aacccacttt ctctgtgcaa agaaacctcc cctttgagag agcgaccatc 1321 atggctgcat tcacaggaaa tgcagaaggc agaacatctg atatgagaac tgagatcata 1381 cggatgatgg aaaatgcaag accagaggat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taaagaggga 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPK 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Maryland/1824/78 (H13N9) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30755 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Maryland/1824/78 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 492 a 319 c 428 g 326 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt 121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga 241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat 421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaagt gcgggaaagc 781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtgaatgg actggctgtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPL 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Maryland/1815/79 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30756 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Maryland/1815/79 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 490 a 319 c 429 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca gatggaaact ggtggcgagc gccagaatgc caatgaaatt 121 agagcatctg tcgggaggat ggttggggga atcggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga 241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagaa ctaaggagaa tctggcggca ggcaaacaat 421 ggggaagatg cgaccgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc agagaactag agcactagtg cgcacaggga tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaaaacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcaaac agcagcacaa cgggctatga tggatcaggt gcgggaaagc 781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattgag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccaac aaagagcatc tgcaggacaa 1261 gtcagtgttc aacccagttt ctctgtgcag agaaacctcc cctttgagag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc aggacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPN 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/turkey/Minnesota/833/80 (H4N2) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30769 KEYWORDS nucleoprotein. SOURCE Influenza A/turkey/Minnesota/833/80 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 494 a 311 c 429 g 331 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc 121 agggcatctg ttgggagaat ggttggtgga atcgggagat tctacataca gatgtgcaca 181 gaactcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga 241 atggttctct cggcatttga tgagaggaga aacaaatatc tggaagagca tcccagtgct 301 ggaaaagacc ctaagaagac tggaggtcca atatacagga ggagagatgg gaaatgggtg 361 agagaactga tcctgtatga caaagaggag atcagaagga tttggcgtca agcgaataat 421 ggagaagacg caaccgctgg tctcacccat ttgatgatct ggcattccaa tctgaatgat 481 gccacatatc agagaacaag ggcactggtg cgtactggga tggatcccag gatgtgctct 541 ctgatgcaag gatcaactct tccgaggagg tctggggctg ctggagcagc agtgaaagga 601 gttggaacaa tggtgatgga attgattcgg atgatcaagc gagggatcaa tgatcggaat 661 ttctggagag gtgaaaatgg acggagaaca agaattgcat acgaaagaat gtgcaacatt 721 ctcaagggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gagggaaagc 781 cggaatccag ggaatgctga aattgaagat ctaatctttc ttgcacggtc tgctctcatt 841 ctgaggggat cagtggctca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg 901 gccagtggat acgactttga gagggaggga tactccctag tcggaattga tcctttccgt 961 ctgctccaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa 1021 agtcagctgg tatggatggc atgtcattct gcagcatttg aagatctgag agtgtcaagc 1081 ttcatcagag gaacaagagt ggtcccaaga ggacaactgt ccaccagagg agttcagatt 1141 gcttcaaatg agaacatgga gacaatggac tccagtactc ttgaattgag gagcagatac 1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcaggacaa 1261 atcagcgtac agcccacatt ctctgtgcag agaaacctcc cattcgagag agcaaccatt 1321 atggcggcat ttacgggaaa cactgaaggc agaacttcag acatgagaac tgagatcata 1381 aggatgatgg aaaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga 1501 tcttatttct tcggagacaa tgcagaggaa tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPO 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/gull/Minnesota/945/80 (H13N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30757 KEYWORDS nucleoprotein. SOURCE Influenza A/gull/Minnesota/945/80 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry REFERENCE 2 (sites) AUTHORS Gorman,O.T. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989, and [2] 18-JUL-1990. The isolate mentioned in the published paper [1] is incorrect and should be "A/Gull/Minnesota/945/80". FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 490 a 321 c 426 g 328 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtcccaaggc 61 accaaacgat cttatgagca aatggaaact ggtggcgagc gccagaatgc caatgacatc 121 agagcatctg tcgggaggat ggttggggga attggaagat tctacataca gatgtgcact 181 gaactcaagc tcagtgacaa tgagggaaga ttgatccaaa acagcatcac catagagaga 241 atggttctat ctgcatttga tgagagaagg aacaagtacc tggaagagca tcccagcacc 301 gggagagacc ccaagaaaac tggaggacca atttacagaa ggagagatgg aaaatgggtg 361 agagaattag ttctgtatga caaagaagag ctaaggagaa tctggcggca ggccaacaat 421 ggggaagatg cgactgctgg tctcacccat ttgatgatct ggcattccaa tttgaatgac 481 gccacgtatc aaagaactag agcacatgtg cgcacaggga tggatcccag gatgtgctcc 541 ctcatgcagg gctcaacact ccctagaagg tctggagctg ctggtgcagc agtaaaggga 601 gttgggacaa tggtaatgga actcatcagg atgataaaga gaggagtcaa tgaccgcaat 661 ttctggagag gtgaagacgg acgaagaacg agaattgcct atgaaagaat gtgcaacatc 721 ctcaaaggga aatttcagac agcagcacaa cgggctatga tggatcaagt gcgggaaagc 781 cggaatcctg gaaatgctga aatcgaggac cttatatttc tggctcgatc tgcacttatc 841 ctgaggggag cagtagctca caaatcatgc ctgccggcct gtgtatatgg actggctgtg 901 gcgagtggat atgactttga aagggaggga tattccctcg ttggaataga tcctttccgt 961 ctactccaaa acagccaggt attcagccta atccgaccaa atgagaatcc agcacataag 1021 agtcaattgg tgtggatggc atgccattcc gctgcatttg aggatctgag agtgtcaagt 1081 ttcatccggg gaacaagggt gctaccaagg ggacaattgt cgactagggg tgttcaaatt 1141 gcatccaatg agaacatgga aaccatgaat tccagcactc ttgaattaag aagcaaatac 1201 tgggcaataa ggactagaag tggaggaaac accaaccagc aaagagcatc tgcgggacaa 1261 gtcagtgttc aacccacttt ctctgtgcag agaaacctcc cctttgaaag ggcgaccatc 1321 atggctgcat tcacaggaaa tccggagggc agaacatctg atatgagaac ggaaatcata 1381 aggatgatgg aaaattcaag gccagaggat gtgtctttcc agggacgggg agtcttcgag 1441 ctctcagacg aaaaggccac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgctgaggag tatgacaatt gaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPP 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/equine/Tennessee/5/86 (H3N8) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30758 KEYWORDS nucleoprotein. SOURCE Influenza A/equine/Tennessee/5/86 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 502 a 318 c 411 g 334 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca aagtcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc aactgaaatc 121 agagcatctg tcggaaggat ggtgggagga atcggccggt tctatgttca gatgtgtact 181 gagcttaaac tcaacgacca tgaagggcgg ctgattcaga acagcataac aatagaaagg 241 atggtacttt cggcattcga cgaaagaaga aacaagtacc tcgaggagca tcccagtgct 301 gggaaagacc ccaagaaaac gggaggcccg atatacagaa ggaaagatgg gaaatggatg 361 agagaactca tcctccatga taaagaagaa atcatgagga tctggcgtca ggccaacaat 421 ggtgaagacg ctactgctgg tcttactcat atgatgatct ggcactccaa tctcaatgac 481 accacatacc aaagaacaag ggctcttgtt cgggctggga tggatcccag aatgtgctct 541 ctgatgcaag gatcaaccct cccacggaga tctggagctg ccggtgctgc agtaaaaggt 601 gttggaacaa tggtaatgga actcatcagg atgatcaaac gcgggataaa tgatcgaaat 661 ttctggagag gtgaaaatgg tcgaagaacc agaattgctt atgaaagaat gtgcaatatc 721 ctcaaaggga aattccaaac agcagcacaa cgggcaatga tggaccaagt gagggagggc 781 cgcaatcctg gaaatgctga gattgaggat ctcattttct tggcacgatc agcactcatt 841 ttgagaggat cagtagccca taaatcatgc ctacctgcct gtgtttatgg ccttgcagta 901 gccagtgggt atgactttga gaaggaagga tactctctgg ttggaattga tcctttcaaa 961 ctactccaga acagccaaat tttcagtcta atcagaccga aagaaaatcc agcacacaag 1021 agccagttgg tgtggatggc atgccattct gcagcatttg aggacctgag agttttgaat 1081 ttcattagag gaaccaaagt aatcccaaga ggacagttag caaccagagg agtgcaaatt 1141 gcttcaaatg aaaacatgga gacaatagat tctagcacac tcgaactgag gagcagatat 1201 tgggcaataa ggaccaggag tggagggaac accagtcaac agagagcatc tgcaggacag 1261 ataagtgtgc aacccacttt ctcagtgcag agaaatcttc cctttgaaag agcaaccatt 1321 atggctgcat tcactgggaa cactgagcgg aggacttccg acatgagaac ggaaatcata 1381 aggatgatgg aaaatgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag caatgaaggg 1501 tcttatttct tcggagacaa tgctgaggag tttgacagtt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPR 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/duck/New Zealand/31/76 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30760 KEYWORDS nucleoprotein. SOURCE Influenza A/duck/New Zealand/31/76 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 502 a 325 c 412 g 326 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca gatggagact ggtggagaac gccagaatgc tactgagatc 121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaaggaagg ctaatccaaa acagcataac aatagagaga 241 atggttcttt ctgcatttga tgaaaggaga aacaagtact tggaggagca ccccagtgct 301 ggaaaggatc ctaagaaaac tggaggtcca atctatagaa gaagagatgg aaaatgggtg 361 agagaactga tcctgtacga caaagaggag atcaggagga tttggcgcca agcgaataat 421 ggagaggacg caactgctgg tcttacccat ttgatgatct ggcattccaa tcttaatgat 481 gcaacatacc agagaacaag agcccttgtg cgtactggga tggaccccag gatgtgctct 541 ctgatgcaag gctcaactct cccgaggagg tctggagctg caggtgctgc agtaaaagga 601 gttggaacaa tggtgatgga gctgattcga atgataaaac gaggaatcaa tgaccgtaac 661 ttctggagag gcgagaatgg acgaaggaca agaattgctt acgagaggat gtgcaacatt 721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgggagagc 781 cgaaatcctg gaaatgctga aattgaagac ctcatcttct tggcacgatc tgcactcatt 841 ctgaggggat cagtggctca caaatcctgc ctacctgctt gtgtatatgg attagctgta 901 gccagtggat atgactttga aagagaagga tactccctgg tcggaataga ccccttccgt 961 ctgctccaga acagccaggt attcagcctc atcagaccca atgaaaatcc agcacacaag 1021 agtcaactgg tgtggatggc atgccactct gcagcatttg aagacctgag agtttcaagc 1081 ttcatcaggg gaacaagagt gatcccgaga gggcaactgt ccactagagg ggtccaaatt 1141 gcttcaaatg aaaacatgga aacaatggac tctagcactc tcgaattgag aagcagatac 1201 tgggccataa ggaccagaag tggaggaaac accaaccaac agcgggcatc tgcgggacaa 1261 atcagcgtgc aacccacttt ctctgtgcaa agaaatcttc ctttcgaaag agcaaccatc 1321 atggcagcat ttacagggaa cactgaaggc agaacatctg acatgagaac tgagatcata 1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcagacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPS 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/grey teal/Australia/2/79 (H4N4) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30761 KEYWORDS nucleoprotein. SOURCE Influenza A/grey teal/Australia/2/79 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 513 a 327 c 401 g 324 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca aatggaaact ggtggagaac gccagaatgc tactgaaatc 121 agatcttctg ttggaagaat ggttggtgga attggaagat tctacataca gatgtgcact 181 gaattcaaac tcagcgacta tgaaggaagg ctgatccaaa acagcataac aatagagaga 241 atggtccttt ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagtgct 301 ggaaaagatc ctaagaagac tggaggtcca atctacagaa ggagagatgg aaagtggatt 361 agagagctga tcctatgcga caaagaggag atcaggagaa tctggcgcca agcgaataat 421 ggggaggatg caactgctgg ccttacccat ttgatgatat ggcattccaa cctcaatgat 481 gccacttacc agagaacaag agcccttgtg cgtactggga tggatcctag aatgtgctct 541 ctgatgcaag gctcaactct cccgagaaga tctggagctg ctggcgctgc agtaaaagga 601 gttgggacaa tggtgatgga actgattcgg atgataaaac ggggaatcaa tgaccgaaat 661 ttctggagag gcgaaaatgg gcgaagaaca agaattgcct atgagagaat gtgcaacatt 721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggaccaagt gcgggaaagc 781 cgaaatcctg ggaatgctga aattgaagac cttatatttc tggcacgatc tgcactcatc 841 ctaagaggat cagtagccca caaatcctgc ctacctgctt gtgtgtatgg attggccgta 901 gccagtggat atgactttga gaaagaagga tactccctag tcggaataga tcctttccgc 961 atgctccaga acagccaggt attcagtctc atcaggccca atgaaaattc agcacacaag 1021 agtcaattgg tatggatggc atgccactct gcagcatttg aagacctgag agtatcaagc 1081 ttcatcagag gaacaaaagt ggtcccaaga gggaaactct ccaccagagg agtccaaatc 1141 gcttcaaatg agaacatgga aacaatggac tctacaactc tcgaattgag aagcaggtac 1201 tgggctataa ggaccagaag cggaggaaac acaaatcagc agcgggcgtc agctggacaa 1261 atcagcgtac aacccacttt ctctgtgcag aggaatctcc cattcgagag agcaaccatt 1321 atggcagcat ttaccgggaa cactgaaggc agaacatccg acatgagaac tgagatcata 1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aagggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPT 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/duck/Czechoslovakia/56 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30762 KEYWORDS nucleoprotein. SOURCE Influenza A/duck/Czechoslovakia/56 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 489 a 315 c 432 g 329 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca gatggaaact ggtggagagc gccagaatgc cactgagatc 121 agagcatctg ttgggagaat ggtcggtgga attgggaggt tttacataca gatgtgtact 181 gaactcaagc tcagtgacta tgaagggaga ctgatccaga acagcataac aatagagaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgct 301 gggaaggacc ctaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatgggtg 361 agagaattga tcctgtacga caaagaggag atcaggagga tttggcgcca agcaaacaat 421 ggagaagacg caactgctgg tctcactcac ttgatgatct ggcattccaa tctgaatgat 481 gccacatatc agagaacaag agctcttgta cgtactggga tggacccaag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggagg tctggagctg ctggtgcagc agtgaaggga 601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat 661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagaggat gtgcaatatc 721 ctcaaaggga agttccaaac agcagcacaa agagcaatga tggaccaggt gcgggagagc 781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacggtc tgcactcatt 841 ctgagaggat cagtggctca taagtcctgt ctgcctgctt gtgtgtatgg acttgctgtg 901 gccagtggat acgactttga gagagagggg tactctctgg tcggaataga tcctttccgt 961 ctgctccaaa acagccaagt attcagcctc attagaccaa atgagaatcc agcacataag 1021 agccaactgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc 1081 ttcatcagag ggacaagagt ggtcccaaga gggacaactg tcaccagagg atttcaaatt 1141 gcttcaaatg agaacatgga aacaatggac tccagcactc ttgaactgag gagcagatat 1201 tgggctataa ggaccaagag cggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagtgtac aacccacttt ctcggtacag agaaatctcc ctttcgagag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata 1381 agaatgatgg agagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPU 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/duck/Ukraine/2/60 (H11N8) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30763 KEYWORDS nucleoprotein. SOURCE Influenza A/duck/Ukraine/2/60 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 496 a 325 c 423 g 321 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggagatc gccagaatgc cactgagatc 121 agagcatctg ttgggaggat ggtcagcgga attggaagat tctacataca gatgtgcact 181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagaaaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgca 301 gggaaggatc ctaagaaaac cggaggtcca atctatagaa ggagagacgg aaaatggatg 361 agagagctga tcctgtatga caaagaggag atcaggagaa tttggcgcca agcgaacaat 421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat 481 gccacatatc aaagaacaag agctctcgtg cgcactggaa tggaccccag aatgtgctcc 541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggcgcagc agtaaaggga 601 gttgggacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat 661 ttctggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc 721 cttaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaagt gcgggaaagc 781 cggaatcctg ggaatgctga aattgaagac cttatcttcc tggcacgatc tgcactcatt 841 ctgagaggat cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg 901 gccagtggat acgacttcga gagagaggga tactctctgg tcggaataga tcctttccgt 961 ctgctccaaa acagccaggt gttcagtctc attagaccaa atgagaatcc tgcacataag 1021 agccaactgg tgtggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc 1081 ttcatcagag ggacaagagt ggtcccaaga gggcaactgt ccaccagagg agttcaaatt 1141 gcttcaaatg aaaacatgga aacaatggac tccagcactc ttgaactgag gagcagatac 1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc cgcaggacaa 1261 ataagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgagag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacgtctg acatgaggac tgagatcata 1381 agaatgatgg agagtgccag atcagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPV 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/mallard/Astrakhan/244/82 (H?N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30764 KEYWORDS nucleoprotein. SOURCE Influenza A/mallard/Astrakhan/244/82 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 500 a 317 c 421 g 327 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatca acatcatggc gtctcaaggc 61 accaaacgat cttatgaaca gatggaaact ggtggagaac gccagaatgc tactgagatc 121 agagcatctg ttggaagaat ggttggtgga attgggaggt tttacataca gatgtgcact 181 gaactcaaac tcagcgacta tgaagggagg ctgatccaga acagcataac aatagagaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagaaca tcccagtgcg 301 gggaaggacc cgaagaaaac tggaggtcca atctaccgaa ggagagacgg gaaatggatg 361 agagagctga ttctgtatga caaagaggag atcaggagga tttggcgtca agcgaacaat 421 ggagaagacg caactgctgg tctcactcac ctgatgatct ggcattccaa cctaaatgat 481 gccacatacc agagaacaag agctctcgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggaga tctggagctg ctggtgcagc agtaaaggga 601 gtcgggacga tggtgatgga actaattcgg atgataaagc gagggattaa cgatcggaat 661 ttctggagag gcgaaaacgg acgaagaaca aggattgcat atgagagaat gtgcaacatc 721 ctcaaaggga aattccaaac agcagcacaa agagcaatga tggatcaggt gcgagaaagc 781 agaaatcctg ggaatgctga aattgaagat ctcatctttc tggcacggtc tgcactcatc 841 ctgagaggat cagtggccca taagtcctgc ttgcctgctt gtgtgtacgg acttgctgtg 901 gccagtggat atgactttga gagagaaggg tactctctgg tcgggataga tcctttccgt 961 ctgcttcaaa acagccaggt cttcagtctc atcagaccaa atgagaatcc agcacataag 1021 agtcaactgg tatggatggc atgccattct gcagcatttg aggacctgag agtctcaagt 1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactat ccaccagagg agttcaaatt 1141 gcttcaaatg agaacatgga aacaatggac tccagtactc ttgaactgag aagcagatat 1201 tgggctataa gaaccaggag tggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagtgtac aacccacttt ctcggtacag agaaatcttc ctttcgaaag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgaaatcata 1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaagga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPW 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/budgerigar/Hokkaido/1/77 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30765 KEYWORDS nucleoprotein. SOURCE Influenza A/budgerigar/Hokkaido/1/77 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 500 a 322 c 428 g 315 t ORIGIN 1 agcaaaagca gggtagataa tcactcacag agtgacatcc acgacatggc gtctcaaggc 61 accaaacgat cctatgagca gatggaaact ggtggagaac gccagaatgc tactgagatc 121 agagcatctg ttgggagaat ggtcggtgga attggaagat tctacataca gatgtgcact 181 gaaatcaaac tcagtgacta tgaagggagg ctgattcaga acagcataac aatagagaga 241 atggtcctct ctgcatttga ggaaaggagg aacaaatacc tggaagagca ccccagtgcg 301 ggaaaggacc cgaagaaaac tggaggtcca atctacagaa ggagagatgg aaaatgggtg 361 agagaactga tcctgtatga caaagaggag ataaggagga tctggcgcca agcgaacaat 421 ggagaagacg caactgcagg tctcactcac ctgatgatct ggcattccaa cctgaatgat 481 gccacatatc agagaacaag agccctcgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgagaaga tctggggcag ctggtgcagc aataaaggga 601 gttgggacga tggtgatgga gctgattcgg atgataaaac ggggaatcaa tgatcggaat 661 ttctggagag gcgaaaatgg gcgaagaaca aggactgcat acgagagaat gtgcaacatc 721 ctaaaaggaa aattccaaac agcagcacaa cgagcaatga tggatcaggt gcgagaaagc 781 aggaatcctg gaaatgctga gattgaagac cttatatttc tggcacggtc tgcactcatt 841 ctgagaggat cagtagctca taagtcctgc ctgcctgctt gtgtatatgg gcttgcagtg 901 gccagtgggt acgactttga gagagaggga tactctctgg tcggaataga tcctttccgc 961 ctgcttcaaa acagtcaggt gttcagcctc attagaccaa atgagaatcc agcacataaa 1021 agccaactag tctggatggc ttgccattct gcagcatttg aagacctgag agtctcaagc 1081 ttcatcagag ggacaagact agtcccaaga ggacaactgt ccactagagg agttcaaatt 1141 gcttcaaatg agaacatgga agcaatggac tccaatactc ttgagctgag gagcagatac 1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagcgtgc aacctacttt ctcggtacag aggaatctcc ctttcgagag agctaccatc 1321 atggcggcat ttacagggaa caatgaagga aggacttctg acatgaggac tgagatcata 1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPX 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/ruddy turnstone/New Jersey/47/85 (H4N6) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30766 KEYWORDS nucleoprotein. SOURCE Influenza A/ruddy turnstone/New Jersey/47/85 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 491 a 317 c 423 g 334 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgaaatc 121 agagcatctg ttgggagaat ggttggtgga atcggaagat tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaagggaga ctgatccaaa acagcataac aatagagaga 241 atggttctct cggcatttga cgagaggaga aacaaatatc tggaagagca ccccagtgct 301 gggaaagacc ctaaaaagac tggaggtcca atctacagga ggagagatgg gaaatggatg 361 agagaactca tcctatatga caaagaggag atcagaagga tttggcgtca agcgaataat 421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcattccaa tctgaatgat 481 gccacatacc agagaacaag ggcacttgtg cgtactggga tggatcccag gatgtgctct 541 ctgatgcaag gctcaactct tccgaggagg tctggggctg ctggagcagc agtgaaaggg 601 gttggaacaa tggtgttgga attgatccgg atgatcaagc gagggatcaa tgatcggaat 661 ttctggagag gcgaaaatgg aaggaggaca aggattgctt atgaaagaat gtgcaacatt 721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggatcaggt gagggaaagc 781 cggaatcccg ggaatgctga aattgaagat ctcatctttc ttgcacggtc tgctctcatt 841 ctgagaggat cagtggccca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg 901 gccagtggat acgactttga aagggaggga tactccctag tcggaattga tcctttccgt 961 ttgcttcaaa acagtcaagt cttcagtctt atcagaccaa acgaaaatcc agcacataaa 1021 agtcagctgg tatggatggc atgccactct gcagcatttg aagatctgag agtgtcaagc 1081 ttcattagag gaacaagagt agtcccaaga ggacaactgt ccaccagagg agttcagatt 1141 gcttcaaatg agaacatgga gacaatggac tccagcactc ttgaattgag gagcagatac 1201 tgggctataa ggaccagaag cggagggaac actaaccagc agagagcatc tgcagggcaa 1261 atcagcgtac agcccacatt ctctgtgcag aggaacctcc cattcgagag agcaaccatt 1321 atggcggcat ttacaggaaa cactgaaggc agaacttcag acatgagaac tgagatcata 1381 aggatgatgg aaaatgccag tcctgaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taacgaagga 1501 tcttatttct tcggagacag tgcagaggaa tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPY 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/tern/South Africa/61 (H5N3) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30767 KEYWORDS nucleoprotein. SOURCE Influenza A/tern/South Africa/61 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 493 a 322 c 428 g 322 t ORIGIN 1 agcaaaagca gggtagataa tcactcactg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggagagc gccagaatgc cactgagatc 121 agagcatctg ttgggagaat ggtcggtgga atcgggagat tctacataca gatgtgcact 181 gaactcaaac tcagtgacta tgaagggagg ctgatccaga acagcataac aatagagaga 241 atggttctct ctgcatttga tgaaaggagg aacaaatacc tggaagagca tcccagcgcg 301 gggaaggacc cgaagaaaac tggaggtcca atctatagaa ggagagacgg aaaatggatg 361 agagaactga tcctgtatga caaagaggag atcaggagga tttggcgcca agcaaacaat 421 ggagaagacg caactgctgg tctcactcat ctgatgatct ggcattccaa cttgaatgat 481 gccacatatc agagaacaag agctcttgtg cgtactggga tggaccccag aatgtgctct 541 ctgatgcaag gatcaactct cccgaggaga tctggagccg ctggtgcagc agtgaaggga 601 gttggaacaa tggtaatgga gctaattcgg atgataaagc ggggaatcaa tgatcggaat 661 ttttggagag gcgaaaatgg acgaagaaca aggattgcat atgagagaat gtgcaacatc 721 ctcaaaggga aattccaaac agcagcacaa cgagcaatga tggaccaggt gcgagaaagc 781 cggaatcctg ggaatgctga aattgaagac cttatctttc tggcacggtc tgcactcatt 841 ctgagagggt cagtggctca taagtcctgc ctgcctgctt gtgtatatgg acttgctgtg 901 gccagtggat acgactttga gagagaggga tactctctgg tcggaataga ccctttccgt 961 ctgctccaaa acagccaggt attcagcctc attagaccaa aagagaatcc agcacataag 1021 agccaattgg tatggatggc atgccattct gcagcatttg aagacctgag ggtatcaagc 1081 ttcatcagag ggacaagagt ggtcccaaga ggacaactgt ccaccagagg atttcaaatt 1141 gcttcgaatg agaacatgga ggcaatggac tccagtactc ttgaactgag gagcagatac 1201 tgggctataa ggaccaggag cggaggaaac accaaccaac agagagcatc tgcaggacaa 1261 atcagcgtac aacccacttt ctcagtacag agaaatcttc ctttcgagag agcgaccatc 1321 atggcggcat ttacagggaa cactgaaggc agaacatctg acatgaggac tgagatcata 1381 agaatgatgg aaagtgccag accagaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcaac gaacccgatc gtgccttcct ttgacatgag taatgaggga 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS FLANPZ 1565 bp ss-RNA VRL 04-AUG-1990 DEFINITION Influenza A/chicken/Pennsylvania/1/83 (H5N2) nucleoprotein (seg 5) mRNA, complete cds. ACCESSION M30768 KEYWORDS nucleoprotein. SOURCE Influenza A/chicken/Pennsylvania/1/83 RNA, and cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1565) AUTHORS Gorman,O.T., Bean,W.J., Kawaoka,Y. and Webster,R.G. TITLE Evolution of the nucleoprotein gene of influenza A virus JOURNAL J. Virol. 64, 1487-1497 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by O.T.Gorman, 14-DEC-1989. FEATURES from to/span description pept 46 1542 nucleoprotein BASE COUNT 492 a 313 c 427 g 333 t ORIGIN 1 agcaaaagca gggtagataa tcactcaccg agtgacatcc acatcatggc gtctcaaggc 61 accaaacgat cttatgagca gatggaaact ggtggggaac gccagaatgc cactgagatc 121 agagcatctg ttgggagaat ggttggtgga atcggaaggt tctacataca gatgtgcact 181 gaactcaaac tcagcgacca tgaagggaga ctgatccaga acagcataac aatagagaga 241 atggttctct cggcatttga tgagagaaga aacagatatc tggaagagca tcccagtgct 301 gggaaagatc ctaagaagac tgggggtcca atctacagaa ggagagatgg gaaatgggtg 361 agagaattga tcctctatga caaagaggag atcagaagaa tttggcgtca agcaaataat 421 ggagaagacg caactgctgg tctcactcat ttgatgatct ggcactccaa cctgaatgat 481 gccacatatc agagaacaag ggcacttgtg cgtaccggga tggaccccag gatgtgctct 541 ctgatgcaag gctcaacact tccaaggagg tctggtgctg ctggagcagc cgtgaaaggg 601 attggaacaa tggtaatgga attgatacgg atgatcaaac gagggatcaa tgataggaat 661 ttctggagag gcgaaaatgg gcggagaaca agaattgcat acgagagaat gtgtaacatt 721 ctcaagggga aatttcaaac agctatacaa cgggcaatga tggaccaagt gagggaaagc 781 cggaatcctg ggaatgctga aattgaagat ctcatctttc ttgcgcggtc tgctctcatt 841 ctgaggggat cagtggcaca taagtcctgc ctgcctgctt gtgtgtatgg acttgctgtg 901 gccagtggat acgattttga gagggaggga tactccctag tcgggattga tcctttccgt 961 ctgctccaaa acagtcaagt tttcagtctt ataagaccaa acgaaaatcc agcacataaa 1021 agtcagctgg tatggatggc atgtcactct gcagcatttg aggatctgag agtatcaagc 1081 ttcattagag gaacaagagt ggtcccaaga gggcaacttt ccaccagagg agttcagatt 1141 gcttcaaatg agaacatgga tgcaatagac tccagtactc ttgaattgag gagcagatac 1201 tgggccataa gaaccagaag cggaggaaac acaaatcagc agagagcatc tgcaggacaa 1261 atcagcgtac agcccacatt ctcagtgcag agaaacctcc cattcgagag agcaaccatc 1321 atggcggcat tcacaggaaa cacagaaggc agaacttcag acatgagaac tgagattata 1381 aggatgatgg agaatgccag acctgaagat gtgtctttcc aggggcgggg agtcttcgag 1441 ctctcggacg aaaaggcgac gaacccgatc gtgccttcct ttgacatgag taacgagggg 1501 tcttatttct tcggagacaa tgcagaggag tatgacaatt aaagaaaaat acccttgttt 1561 ctact // LOCUS HUMCEAB 2690 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM1 gene, exons 1 and 2. ACCESSION M22433 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCHM1-1 library) DNA, clone hsCHM1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2690) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept 1250 1313 carcinoembryonic antigen hsCHM1, exon 1 /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" 2158 / 2517 carcinoembryonic antigen hsCHM1, exon 2 IVS 1314 2157 carcinoembryonic antigen hsCHM1, intron A IVS 2518 > 2690 carcinoembryonic antigen hsCHM1, intron B BASE COUNT 823 a 637 c 653 g 577 t ORIGIN Chromosome 19q31.1-q31.3. 1 gaattccaca gcaataacca cgatgacaac caccatgtac tcaacacccg cctgggcacg 61 gggctcccac agcagctcac ttattcccaa caactctgca aggaggattt taccatcctc 121 cttttacaaa tcagggaatc aaggatcata gaagccacgt gcacttgtcc aagtcaacat 181 agttaagtga cagaaccatt agctgtcccc aggtacatct ggacataaag ttcatgttta 241 tgccactgtg tcagcatttc caaaaactga ttttaggcga aacgtaagta agctttttaa 301 aaactttaat acttatgcgt ttattttaat atacattgag aaaacattta agcacacatc 361 aaatctgtaa tttcatggac aatattgcat aagacaagga tgttttgtct ccaactcctg 421 gcctcaagcc atcctcccac catagctctc gagtagctgg gatcacagac tgagccaccg 481 atccctgcta ggacaggatg ttttgtaaac taaatttatt tagaaaaaag gatgaaatat 541 ataataataa aggtggtaca agctagagag aaaatcataa agtcagccta gaaatgtctg 601 gtgtctggat gacataaagc tacagcactg tgaagcctca ttctcagtta ctcccaggaa 661 attagagtca cataatgctg cagaaagaac agctcagaat cttagatccg gctttagccc 721 tagatatatc catttgtagg accccagaca tctctgtgac ctccttgctg ggagtaaatc 781 caaccttccc agacatgtga gaacagtaag aagaccctgc acacacaaag gagtttctcc 841 gtcacagaga aaataacacc aggttcaggg accccaggga ctctgcatgg tgctgacaga 901 cccaaggcca aggcagagca gaggtccacg ctggggaggg agggtcatcc tgttatgaaa 961 cagggatcca agtaagcctt gcttctcaga gcctggtctg ggcaactcaa atgtagacag 1021 aaggccccaa ggaagaagag aaaatgaggc aaaactgaga ggggagggga cagagaggtg 1081 acctgggcag agcttcaccc atgaccctgg aaagtgctcc tgccctggga ggaggctcag 1141 catggaaaga ggaaggacag cagagcctaa gtcacagtag ccctgactac agcattcctg 1201 gagcccaggc tcttttccac agaggaggaa agagcaggca gcagagacca tggggccccc 1261 ctcagcctct ccccacagag aatgcatccc ctggcagggg cttctgctca caggtgagtg 1321 gaggattcct gggagtgggc aagaggaggg atcacagaga atggctgggg tctcctgggg 1381 aggatggggc tctgataggg gacagaaggc ttctgctgaa gcctcagggg agagaacatc 1441 agagagggac acgggtcaca acaagacaat cacattgaac tgggattgat aagagggagg 1501 aaaatccatt gatcatgttt tccaagttaa tcattactgg ccactacaat tagaaaatga 1561 taagaataag aattacatca gggtgatact ttaaataaaa atataaccag ggcactaaaa 1621 cctgtctttg ccccaaccac aagttgcaaa ataaccacca ctccttaact catccaccag 1681 tatttgcaat caaattttag gcactggcgt acaacaaata tcagacaagt ctctgtgttc 1741 aaagagctta cactcttgca gagatgaaga tagacaccca aagagatcta gaatgtgagt 1801 tcaggtgttg acaagagccc tggagggaac agagcagaaa aaggtcagaa agggacgccc 1861 cagggtctct agaggaggtg tcaggggagg gatctcccaa ggatgccctg atgtgagcag 1921 gatctgaggg cagtggggag ggagccatgc agacccctgg ggaaggggat tccacacagg 1981 aaaatgccaa ggttagaggt gctgaagaaa gaaaggtcac gttactgacc ttaaccaagt 2041 gggacacacc tacactctca aggctgaagg gagaagagac tctctcagga cccagggccc 2101 catctttcca tcccaataca tgggtaccaa tattgactga tgctttctcc ctcctagcct 2161 cacttctaaa cttctggaac ccgcccacca ctgccaagct cactattgaa tccacgccgt 2221 tcaatgtcgc agaggggaag gaggtgcttc tacttgtcca caatctgccc caacatcttt 2281 ttggctacag ctggtacaaa ggggaaagag tggatggcaa cagtctaatt gtaggatatg 2341 taataggaac tcaacaagct accccagggg ccgcatacag cggtcgagag acaatataca 2401 ccaatgcatc cctgctgatc cacaatgtca cccagaatga cataggattc tacaccctac 2461 aagtcataaa gtcagatctt gtgaatgaag aagcaactgg acagttccat gtataccgtg 2521 agtatttcca catgacctct ggaggttggg ggtcagttct acttcccaca tatgggattg 2581 tacggcctgg gctgtgcctc tggccctctc tgcattacat tctgtatcag ggtttggaca 2641 tttagtgcag gacacacacg ggggagacaa acttccacag actagaattc // LOCUS HUMCEAC 781 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM2 gene, exon 2. ACCESSION M22434 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCGM2-1 library) DNA, clone hsCGM2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 781) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept / 221 / 583 carcinoembryonic antigen hsCGM2, exon 2 (AA at 223) /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" IVS < 1 220 carcinoembryonic antigen, intron A IVS 584 > 781 carcinoembryonic antigen, intron B BASE COUNT 220 a 195 c 184 g 182 t ORIGIN Chromosome 19q31.1-q31.3. 1 ctgcagatcc ctggggaaga ggattccgaa cagggaaatg taaggtcaga ggtgctgata 61 ggggacatgc tgctgtcatt gatccagtag gacacacaca cacacacaca cttacttcaa 121 gatgggggtg ggtgaagaga cctgctcagg atccagggcc ccatctttcc accccaatac 181 ataggtccca atattgactg atgttctctc cccctcctag cctcgctttt aaccttctgg 241 aacctgccaa acagtgccca gaccaatatt gatgtcgtgc cgttcaatgt cgcagaaggg 301 aaggaggtcc ttctagtagt ccataatgag tcccagaatc tttatggcta caactggtac 361 aaaggggaaa gggtgcatgc caactatcga attataggat atgtaaaaaa tataagtcaa 421 gaaaatgccc cagggcccgc acacaacggt cgagagacaa tataccccaa tggaaccctg 481 ctgatccaga acgtcaccca caatgacgca ggattctata ccctacacgt tataaaagaa 541 aatcttgtga atgaagaagt aaccagacaa ttctacgtat tctgtgagtg atacctccat 601 gacttctggg tgctgggggc cagttctact tcatacacac ggggttgtca ggcctgggtt 661 gtgcctgtgt ccccatctac attttatcca gtgttggagt ttgggcattt agtgaaggac 721 acacatgggg gagacaaact tctacagacc agaatccctt tcctgcatcc agaccctgca 781 g // LOCUS HUMPSBGA 1042 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human pregnancy-specific B-1-glycoprotein gene, exon 2. ACCESSION M22437 D51537 KEYWORDS glycoprotein; pregnancy-specific B-1 glycoprotein. SOURCE Human fetal liver (lambda-PSBG-1 library) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1042) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept / 497 / 862 pregnancy-specific B-1 glycoprotein, exon 2 (AA at 499) /nomgen="B1G1" /map="6" /hgml_locus_uid="LG0073P" IVS < 1 496 pregnancy-specific B-1 glycoprotein, intron A IVS 863 > 1042 pregnancy-specific B-1 glycoprotein, intron B BASE COUNT 289 a 241 c 274 g 238 t ORIGIN Chromosome 6. 1 gagctcacgc tgtcatgggg gggaagacag acatgcaaag agatctagaa tgtgaggtca 61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag 121 ggtctctgaa gcaggcatca ggaaagaagt ctaacgatgc cctgatgtaa gcaggacctg 181 agggcattgt ggagggggcc atgcggaccc ctggggaaga ggattccaaa cagaaaaatg 241 ccaacgtcag aagtgttgaa ggaatgggcg tcatgctgct gaccttgacc tagtaggaca 301 gtaggacaca cacacataca cacacacaat cacacatgcc gcttttgtgt gtgtgtgtgt 361 gtgtgtgttt gtatgtgtgt gtgtgtctgt gttgtcaagg ctgaggactg aagagacctt 421 ctcaggaccc agggccccat gttttcacac caatacatag gtctcaatat tgactgatgt 481 tctctccacc tcctagcatc acttttaaac ttctggaacc tgcccaccac tgcccaagtc 541 acgattgaag ccgagccaac caaagtttcc gaggggaagg atgttcttct acttgtccac 601 aatttgcccc agaatcttac cggctacatc tggtacaaag ggcaaatgag ggacctctac 661 cattacatta catcatatgt agtagacggt gaaataatta tatatgggcc tgcatatagt 721 ggacgagaaa cagcatattc caatgcatcc ctgctgatcc agaatgtcac ccgggaggac 781 gcaggatcct acaccttaca catcataaag ggagatgatg ggactagagg agtaactgga 841 cgtttcacct tcaccttaca ccgtaagtga ttccacatga tccctgggtg ttgggggaca 901 ggggtcactt ctacttcaca cacacaggat tgtcaggcct ggactctgcc tgtgtcactc 961 actgcgttat gtcccatgct gggatttggg catttagtgc aggacacaca cagaggagac 1021 aaatttcaac agatcagaat tc // LOCUS ASNMTATP9 225 bp ds-DNA ORG 04-AUG-1990 DEFINITION A.nidulans mitochondrion ATPase subunit 9 gene, complete cds. ACCESSION M35967 KEYWORDS ATPase. SOURCE A.nidulans mitochondrion DNA. ORGANISM Mitochondrion Aspergillus nidulans Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Plectomycetes; Eurotiales; Trichocomaceae; Aspergillus nidulans. REFERENCE 1 (bases 1 to 225) AUTHORS Brown,T.A., Ray,J.A., Waring,R.B., Scazzocchio,C. and Davies,R.W. TITLE A mitochondrial reading frame which may code for a second form of ATPase subunit 9 in Aspergillus nidulans JOURNAL Curr. Genet. 8, 489-492 (1984) STANDARD simple staff_review FEATURES from to/span description pept 1 225 ATPase subunit 9 BASE COUNT 66 a 29 c 51 g 79 t ORIGIN 1 atgttacaat cagctagaat aataggaaca ggattagcta caacaggttt aatcggagca 61 ggtgtaggta tcggtgtagt atttggtgcc ttaatcttag gagttgcaag aaaccctgca 121 ttaagaggtc aactgttttc atatgcaatt ttaggatttg catttgcgga agcaactgga 181 ttatttgctt taatgatggc ttttttatta ttatatgtag cttaa // LOCUS DOGPHA2 562 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Canine phospholipase A-2 mRNA, complete cds. ACCESSION M35301 KEYWORDS phospholipase A-2. SOURCE Canine pancreas, cDNA to mRNA. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 562) AUTHORS Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G. TITLE Primary structures of canine pancreatic lipase and phospholipase A-2 messenger RNAs JOURNAL Pancreas 1, 430-437 (1986) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.Scheele, 06-MAR-1990. FEATURES from to/span description pept 29 469 phospholipase A-2 precursor sigp 29 73 phospholipase A-2 signal peptide matp 95 466 phospholipase A-2 mRNA < 1 562 phospholipase A-2 mRNA BASE COUNT 143 a 167 c 128 g 124 t ORIGIN 1 ttctcttgct cagcaacttg accacaaaat gaaattcctc gtactggctg ctctgctcac 61 agtggccgct gccgaggggg gcatcagccc gcgggcagtt tggcagttcc gcaacatgat 121 caagtgcacg atccccgaga gtgacccctt gaaggactac aacgactacg gctgctactg 181 tggcctgggt ggatccggca cccctgtgga tgagttggac aagtgttgcc agacgcatga 241 ccactgctac tcagaagcca agaaactgga cagctgtaaa ttcctcctgg acaaccccta 301 caccaaaatc tactcatact catgctccgg ctctgagatc acctgcagca gcaaaaacaa 361 agactgccag gcctttatct gcaactgtga ccgcagtgct gccatctgct tttcaaaggc 421 tccgtataac aaggagcaca agaacctgga cactaagaag tactgctaga attgagtctc 481 ctctctggaa agcatcattc ctacatgcct cgtggccttc accctaccct gtgtcctcca 541 ataaagcact ttgttgaaag gc // LOCUS DOGPLIP 1493 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Canine lipase mRNA, complete cds. ACCESSION M35302 KEYWORDS lipase. SOURCE Canine pancreas, cDNA to mRNA. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 1493) AUTHORS Kerfelec,B., LaForge,K.S., Puigserver,A. and Scheele,G. TITLE Primary structures of canine pancreatic lipase and phospholipase A-2 messenger RNAs JOURNAL Pancreas 1, 430-437 (1986) STANDARD simple staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by G.Scheele, 06-MAR-1990. FEATURES from to/span description pept 35 1438 lipase precursor sigp 35 85 lipase signal peptide matp 86 1435 lipase mRNA < 1 1493 lipase mRNA BASE COUNT 410 a 377 c 378 g 328 t ORIGIN 1 cgaaagggag ggaaggccca gaacatccca caggatggtg agcatctgga caatcgcgct 61 gtttctgctg ggagcagcca aagcaaagga agtttgctat gaacaaatcg gatgcttttc 121 tgacgcggag ccctgggccg ggacagcaat caggcctctg aaagttctcc cctggagccc 181 cgagagaatc ggcacccgct tcctgctcta caccaacaag aacccaaaca actttcaaac 241 tctccttccc tctgatccat caacgattga ggcatcaaat tttcaaacag acaagaagac 301 ccggttcacc atccatggct tcataaacaa gggagaagag aactggctgc tggatatgtg 361 caagaacatg ttcaaggttg aggaggtgaa ctgcatctgc gtggactgga agaaaggttc 421 ccaaacctca tatacacagg ctgccaacaa cgtgcgggtg gtgggcgccc aggtggccca 481 gatgctcagc atgctctcgg caaactacag ctactcacct tcccaagtcc agctcatcgg 541 ccacagccta ggagcccacg tggctgggga ggcagggagc aggactccag gtctgggcag 601 gattacaggg ttggatcctg tagaagcaag tttccagggc actcctgaag aggttcgact 661 tgatcccact gatgctgact ttgttgatgt gattcacaca gatgcagctc ccctgatccc 721 attcctgggt tttggaacaa gccaacagat gggtcacctt gacttcttcc ccaatggtgg 781 agaggaaatg ccaggatgca agaagaatgc cctgtcacag atcgtgaacc tagatggcat 841 ttgggaagga actcgggact tcgtggcttg caatcacctg agaagttaca agtattactc 901 agagagcatc ctcaaccctg atggatttgc ttcctacccc tgtgcttcct acagggcctt 961 tgaatctaac aagtgcttcc cctgcccaga tcaagggtgc ccacagatgg gtcactatgc 1021 tgataaattt gctgtcaaga caagtgatga gacacagaaa tacttcctga acaccggaga 1081 ttccagcaat tttgctcgct ggagatacgg ggtttctata acattgtctg ggaaaagagc 1141 cactggtcag gctaaagttg ctttgtttgg aagtaaggga aatactcatc aattcaatat 1201 cttcaagggg attctcaaac caggctctac tcattccaat gagtttgatg caaagcttga 1261 tgttggaaca attgagaaag tcaagtttct ttggaataac aacgtggtaa acccaacctt 1321 tcccaaagtg ggtgcagcca agatcaccgt gcaaaaggga gaggagaaaa cagtgcacag 1381 cttctgcagc gaaagcaccg tgagggagga tgttctgctc accctcacgc cctgttagcg 1441 tccagcgcaa cccgggcacc gtgttaacag caataaaaac cactgatgca ttt // LOCUS HUMPGDZ1 1106 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human MHC class II lymphocyte antigen (HLA-DNA) gene, complete cds. ACCESSION M31525 M35125 KEYWORDS cell surface glycoprotein; class II gene; lymphocyte antigen; major histocompatibility complex. SOURCE Human B-cell cell line PGF, cDNA to mRNA, clone PGDZ1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1106) AUTHORS Young,J.A.T. and Trowsdale,J. TITLE The HLA-DNA (DZA) gene is correctly expressed as a 1.1kb mature mRNA transcript JOURNAL Immunogenetics 31, 386-388 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.T.Young, 22-JAN-1990. FEATURES from to/span description pept 77 829 MHC HLA-DNA precursor /hgml_locus_uid="LV0117X" /nomgen="HLA-DNA" /map="6p21.3" sigp 77 151 MHC HLA-DNA signal peptide matp 152 826 MHC HLA-DNA mRNA 1 1106 MHC HLA-DNA mRNA BASE COUNT 204 a 354 c 289 g 259 t ORIGIN 1 cttcttcttt acctccgcct tgttcctgtc ctcaccacac ggactgagac tgatttgatt 61 aaagcaccag agtgtaatgg ccctcagagc agggctggtc ctggggttcc acaccctgat 121 gaccctcctg agcccgcagg aggcaggggc caccaaggct gaccacatgg gctcctacgg 181 acccgccttc taccagtctt acggcgcctc gggccagttc acccatgaat ttgatgagga 241 acagctgttc tctgtggacc tgaagaaaag cgaggccgtg tggcgtctgc ctgagtttgg 301 tgactttgcc cgctttgacc cgcagggcgg gctggccggc atcgccgcaa tcaaagccca 361 tctggacatc ctggtggagc gctccaaccg cagcagagcc atcaacgtgc ctccacgggt 421 gaccgtgctc cccaagtctc gggtggagct gggccagccc aacatcctca tctgcatcgt 481 ggacaacatc ttcccccctg tgatcaatat cacctggctg cgcaacggcc aaactgtcac 541 tgagggagtg gcccagacca gcttctattc ccagcctgac catttgttcc gcaagttcca 601 ctacctgccc ttcgtgccct cagccgagga cgtctatgac tgccaggtgg agcactgggg 661 cctggatgcg ccactcctca ggcattggga gctccaggtg cctattccac caccagatgc 721 catggagacc ctggtctgtg ccctgggcct ggccatcggc ctggtgggct tcctcgtggg 781 caccgtcctc atcatcatgg gcacatatgt gtccagtgtc cccaggtaat gatccttctg 841 agagaaatga cttgtgggag acaccctgca gatcctcatg ggtttgtgac agcccctgcg 901 tgctcagtgc cctttaagtg catcccgctg tgctgacttt gagtgggatc aacatctgtc 961 ctacgggtcc cctctttttt ggccccagta ttcatggcag ggtttgttgg acacctacta 1021 gcttcccttc ccattcaaca cacacacaca ttcttgctct acccaaagct ctggctggca 1081 gcactaaatg ctttggtggt gtttgc // LOCUS LACPEPXP 3963 bp ds-DNA BCT 04-AUG-1990 DEFINITION L.lactis X-prolyl dipeptidyl aminopeptidase (pepXP) gene, complete cds. ACCESSION M35865 KEYWORDS X-prolyl dipeptidyl aminopeptidase. SOURCE L.lactis lactis (strain NCDO 763, ML3) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 3963) AUTHORS Nardi,M., Chopin,M.-C., Chopin,A., Cals,M.-M. and Gripon,J.-C. TITLE Cloning and DNA sequence analysis of an X-prolyl dipeptidyl aminopeptidase determinant from Lactococcus lactis subsp. lactis NCDO 763 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Chopin, 02-JUL-1990. Author address: A.Chopin Institut National de la Recherche Agronomique Lab de G/032n/032tique Microbplenne Institut de Biotechnologie CRJ 78352 Jouy-en-Josas Cedex FRANCE FEATURES from to/span description pept 1105 3396 X-prolyl dipeptidyl aminopeptidase (pepXP) (EC 3.4.14.5) pept 933 64 (c) ORF1 (put.) pept 3398 3883 ORF3 (put.) BASE COUNT 1261 a 748 c 733 g 1221 t ORIGIN 1 actagtcatt gcacttatgg tcaaataaaa aaattactga caactctgtc agtaattttt 61 ttattaaaga taaatcattt taaataatgc aacggctgca agtgatgcaa gaattggtgc 121 caaaactgga acccatgcat accaccattt agatgaacct ttagcttcac ccaaaactga 181 ttttggtaag agactgtgaa ccaaacgtgg accaaagtca cgcgccggat tcaaaccagg 241 acctgtaggt ccaccaagtg caactacgag tcccataaca aggaatccta ggaacaagtg 301 agcaatcatt ttagaagctg aagctcctga tgcttgaacc caaatttgat tcataacgtc 361 tgatgaagaa acatctgctc cttgaccttt caagtagttt gtcatccaag taattgattg 421 actaccaaag aagatatttg tcgcagcaac cgcaccgaag aataatacaa atgaacctaa 481 gaactcattc aagaaaccat taatggttgc ccccaaacga gttttttcac tgttatcatc 541 aacgttgtcg atagttgaga aagttccaag aatagcattt ggattttgtg ttttaagata 601 gtatggacga taaaccataa cgattaataa ttgtccaaac atcgcaccta atacttgcgc 661 aataatgtat tgagcaacgt gtgcccaagg aaaaagtcct gaggctgcaa gtccaagtgt 721 gaaagctggg ttgatttgtg aagtaatatt accaaaagca actgctggca acattacgcc 781 aagtccgtaa ccccaaccaa taatcatcca agattgtgcg tgagcttttg tacctttaag 841 ttcaacgtta gcaaccgcac cattacccat aataatgagt agggcagttc caacaaattc 901 agtgatatat ttcactgtcc atgtaacatc cattttttat ggaaccctcc aaaatttgct 961 tttaatataa taacctcagc tattatacca gaaaagcacg tttgcttcaa atattcattg 1021 aaaatagcgc gcctttagtc tgattaatca ctgaaaattt gctataatta aaagtattat 1081 catgtttatt acggaggatt taaaatgcgc tttaaccatt tttcaattgt tgacaaaaat 1141 tttgatgaac aattagcaga acttgatcag cttggttttc gttggtctgt tttttgggat 1201 gaaaagaaaa tcttaaaaga ttttctcatt caaagtccta gtgatatgac tgctctacaa 1261 gcaactgctg aattagatgt cattgaattt ttaaaatcct ctatagaatt agattgggaa 1321 attttctgga atattgcttt gcaacttctt gattttgtac caaactttga ttttgagatt 1381 ggcaaagctt ttgagtatgc aaaaaatagc aacctccctc aaatcgaagc agaaatgacc 1441 acagaaaata tcatttctgc tttctactac ctcctttgta cgcgtcgaaa aactggaatg 1501 attttagttg aacattgggt atcagaggga cttcttcctt tagataatca ttatcatttt 1561 ttcaatgaca aatcgttggc aacctttgat tcttcacttc ttgaacgtga agtcctttgg 1621 gttgaaagtc cagtcgatag tgaacaacgg ggtgaaaatg atttgataaa aattcaaatc 1681 attcgtccaa aaagtaccga aaaacttcca gtggtaatga cggcgagtcc ttatcattta 1741 gggattaatg ataaagcaaa tgatttagcc ttacacgata tgaatgtaga acttgaagaa 1801 aaaaccagtc acgaaattca tgttgaacaa aagctcccgc aaaaattatc agcgaaagca 1861 aaggaacttc cgattgttga caaagctcct tatcgtttta ctcatggttg gacttattct 1921 ttaaatgact atttcttgac tagaggattt gcctctatct atgtggccgg tgttggaacc 1981 cgttcttctg acggctttca aacctcaggc gattaccaac aaatttatag tatgacagct 2041 gtcattgact ggttgaatgg tcgagctcgt gcttatactt ctcgtaaaaa gacacatgaa 2101 atcaaagcct cttgggcaaa tggtaaagtg gctatgactg gaaaatctta tctgggaacc 2161 atggcttatg gagccgctac tactggagtt gaaggtctag agcttatctt agccgaagct 2221 ggaatttctt cttggtataa ttattaccgt gaaaatggcc ttgtccgctc tcctggcggt 2281 tttcctggtg aagatctaga tgttcttgct gcgctcactt attcacgtaa tcttgatggt 2341 gctgatttct taaaaggtaa tgctgaatac gaaaaacgat tggcagaaat gactgctgct 2401 ttagaccgca aatctggtga ttacaatcaa ttttggcatg accgaaatta tctgataaat 2461 actgacaaag ttaaggctga tgtcttaatt gttcatggac tacaagattg gaatgtgaca 2521 ccagagcaag cctataattt ctggaaagct ctgccagaag gtcatgctaa acatgccttt 2581 ttacaccgtg gggctcatat ttatatgaat tcttggcagt ccattgattt ttctgaaacg 2641 attaatgctt actttgtcgc aaaattactg gatagagatt taaatttaaa tctcccacct 2701 gttattttac aggaaaattc taaagaccaa gtatggacga tgatgaatga ttttggtgcc 2761 aatacccaaa tcaaactccc tcttggtaaa acggccgttt cttttgctca attcgacaat 2821 aattatgacg atgagacttt caaaaagtat tctaaagatt tcaatgtctt taaaaaagac 2881 ttatttgaaa acaaagccaa tgaagctgtc attgatttag aacttccgtc aatgctgaca 2941 attaatggtc cagttgagct tgaactcaga ctaaaattaa atgacacgaa aggcttctta 3001 tctgctcaaa ttcttgattt tggtcagaaa aaacgtttgg aagataaagt cagagtgaaa 3061 gattttaaag tgcttgaccg tgggcgaaat ttcatgttag acgacttagt tgaactccca 3121 cttgttgaga gcccttatca gttagtcact aaaggcttta ctaatctcca aaatcaaagt 3181 ttactgacag tcagtgattt aaaggctgac gaatggttta caataaaatt tgaactacag 3241 ccaaccattt atcatttaga aaaagctgac aagcttcggg tcattctcta tagtactgac 3301 tttgaacata cggttcgtga taatcgtaaa gtgacttacg agattgattt atctcaatct 3361 aaactcatta ttcctattga aagtgtgaaa aattaatatg gcaattattg gaattcttgg 3421 aactccttac aacacagttg aacgttctcc tttttggtgg aacaaagtaa gctatacacg 3481 tcaatctttt attgatgttt tccaggaatt aggtcacact gtgattgttc ttcctgttga 3541 caaaaccgaa aatataaaaa attatctgcc attggtcgat aaaattgtac tgactggtgg 3601 cgttgatgtc agtccttatc tctatggcga ggaacctcat gctcaattag gaacgactga 3661 cccaattcgt gaccgttttg agcttgccgc aattaaagct gctctagaag cgaataaacc 3721 tattcttggt gtatgtcgtg gcctacaatt gctcaatgtt tattttggag gaactcttta 3781 tcaagacttg agcctaactt caagtcagat aaagcacctc caaagtccaa ctccccaaga 3841 agttccgaca catcacatta gcgtcgaaaa agaagttctt tagatttttt acctgaaaat 3901 tacatggtca attctttcca ccatcaagtc attaaggatt tgggtcaggg cttgcaggcc 3961 att // LOCUS PIGA1AG 668 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Swine alpha-1 acid glycoprotein mRNA, 3' end. ACCESSION M35990 KEYWORDS alpha-1 acid glycoprotein. SOURCE Swine, cDNA to mRNA. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 668) AUTHORS Stone,R.T. and Maurer,R.A. TITLE Cloning and developmental regulation of alpha-1 acid glycoprotein in swine JOURNAL Dev. Genet. 8, 295-304 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 552 alpha-1 acid glycoprotein (AA at 1) mRNA < 1 668 alpha-1 acid glycoprotein mRNA BASE COUNT 181 a 186 c 170 g 131 t ORIGIN 1 ccgctgtgcg ccaacttgac ggcagtgccc atcaccaatg ccaccttgga cttgatctcc 61 ggcaagtggt attatatcgg ctcggccttc cgaaaccccc agtacaatga gtcggccaga 121 tccatccagg cagccttctt tttctttgat cccaagcccg cggaggacaa gataaacctc 181 agagagtacc agaccatagg gaaccagtgc atctataacg acagctccct gaaggtccat 241 cgggagaatg gaagcctatc gaaacacgag atgggcagag aacacgttgc tgacctgctg 301 ttaaccaagg tccccaagac cttcatgctc atcaattcct tgcacgataa gaacaacgtg 361 gggctctcct tctacgccga caaggcggag gtgactccgg agcagatgaa agagttccat 421 gatgctattg agtgcacggg aatccacaag tcagaaatca catacaccga cgagaaaaag 481 gatctgtgtg ggccgctgga gaagcagcat gaggaggaaa ggaagaagga gaaggagaag 541 gaggggtcct agcattgggt caggtccttg ggcaccttgg aggcccatcc tcagcacctc 601 ccacccaccc tttgcacctc aattctctcc ctcagttgca tcaataaaac ttcgcatttg 661 gtacaggc // LOCUS RATMTNDI 946 bp ds-DNA ORG 04-AUG-1990 DEFINITION Rat NADH-dehydrogenase (NDI) gene, complete cds. ACCESSION M35826 KEYWORDS NADH-dehydrogenase. SOURCE Rat mitochondrion DNA. ORGANISM Mitochondrion Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae; Rattus norvegicus. REFERENCE 1 (bases 1 to 946) AUTHORS Quagliariello,C. and Spena,A. TITLE Sequenza nucleotidica del gene NDI del DNA mitocondriale di fegato di ratto JOURNAL Boll. Soc. Ital. Biol. Sper. 64, 211-218 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 946 NADH-dehydrogenase (NDI) (att start codon) BASE COUNT 294 a 299 c 103 g 250 t ORIGIN 1 attaatatcc taacactcct aatcccaatc ttaattgcca taggccttct caccctagta 61 gaacggaaaa tcctaggcta catacaatta cgcaaaggcc ccaacaacga aggcccatat 121 ggtaaactac aaccatttgc agatgccata aaactattca taaaagaacc catacgccct 181 ctaaccacct caatatcact atttattatc gccccaaccc tctcccttac actagctcta 241 agcctatgaa ttcccttacc aataccccac ccccttatca acctcaacct aggcatacca 301 tttattctag ccacatccag cctttcggtc tactccattc tatgatcggg atgagcctca 361 aattcaaaat actccctatt cggagcccta cgagccgttg cccaaaccat ctcttacgaa 421 gtcacaatag ccttatacct cttatccgtc ctcctaataa gcggctcctt ctccctacaa 481 atacttatca ctacacaaga acatatctga ctattaatcc ccgcctgacc aatagccata 541 atatgataca tttcaaccct cgcagaaaca aatcgagctc ccttcgactt aacagaagga 601 gaatcagaat tagtctcagg ctttaacgtc gaatacgccg caggaccatt cgccctattc 661 ttcatagccg agtacaccaa cattattcta ataaacgccc taacatcaat tgtattccta 721 ggccccttat atcatatcaa ttaccctgaa ttatactcaa ccagcttcat aacagaaaca 781 ctacttctat ccacaacttt cctatgaatt cgagcatcct acccccgttt tcgatatgac 841 caactaatgc acctcctatg aaaaaatttc ctcccactaa cactagcatt ctgcatatga 901 tacatttccc tgccaatttt cctagcagga attccaccct acacat // LOCUS RICPETB 1682 bp ds-DNA PLN 04-AUG-1990 DEFINITION Rice chloroplast apocytochrome b6 (petB) gene, complete cds. ACCESSION M35995 KEYWORDS apocytochrome b6. SOURCE Rice (L. cv. Labelle) chloroplast DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1682) AUTHORS Cote,J.-C., Wu,N.-H. and Wu,R. TITLE Nucleotide sequence of the rice chloroplast apocytochrome b6 gene (petB) JOURNAL Plant Mol. Biol. 11, 873-874 (1988) STANDARD simple staff_review COMMENT Authors indicate two potential petB products for the given sequence. The first product contains an intron, and has its initiation codon at position 67. The second product has no intron, and is read straight through from position 829. FEATURES from to/span description pept 67 72 apocytochrome b6, exon 1 (alt.) 886 1527 apocytochrome b6, exon 2 (alt.) pept 829 1527 apocytochrome b6 (alt.) IVS 73 885 apocytochrome b6 intron A (put.) (no splice consensus) BASE COUNT 499 a 290 c 318 g 575 t ORIGIN 1 gctctacatt tctagacatt ctggtagttc gaccgtggaa ttttttggtt tcggtatctc 61 tggaatatga gtgtgtgact tgttagaatt tgctcctatt gataatacat agaaagcacc 121 tgttatctct atcaagatga ttctaattcg tcggatatta tttattctag tatctggaac 181 acgaaataga tagagtggat caagaaaaaa aaatgaaact atgattcata ttaactattc 241 agacctcgca accagactga aaaaaattca agtagttctt aataaaaata aaaaaagaaa 301 atttcttcct tccaattttg tttgcccaaa aaacaacttt ttttctctcg attttgtcga 361 gtcattacac cgattcaata aatgatcatc aagcggttct tattcgaaga acccttgcct 421 tttgtttagc ttgagactca atcatcgtgg ctctagtatg aatctaaggt tttaattgaa 481 ctgattcata ggatcgcaac aagataattt ctaccagaaa actactccaa tttttgcttt 541 atttatttat ctagtaaaac aagagtaaat ctgcattacg cacaaaaaaa agaaatccaa 601 aatagggaag agaaaaatca agaggcctct aatgatcaac atttgggaaa gaaagataga 661 cgagccaact tgagattttt tggcattatc atcacaaaga agaaattctg gatttttctt 721 atttcatatc ttcaaggcaa atcgacccaa cccagtggct gatgaagttt tgaacctttt 781 ttctaatatc cgttgaaaat ttgtgtgttt ctgtttgagc cgtacgagat gaaattctca 841 tatacggttc tcggaggggg gttcgggtta gttacctatc tcaataaagt atatgattgg 901 tttgaggaac gtcttgagat tcaggcaatt gcagatgata taactagtaa atatgttcct 961 cctcatgtca acatatttta ttgtttaggg ggaattacac ttacttgttt tctagtacaa 1021 gttgctaccg gttttgctat gactttttac tatcgcccaa ccgttacaga ggctttttcc 1081 tcggttcaat acataatgac cgaggccaac tttggttggt taatccgatc agttcatcga 1141 tggtcagcaa gtatgatggt tctaatgatg atcctgcacg tatttcgtgt gtatctcaca 1201 ggtgggttta aaaaaccccg cgaattaact tgggtcactg gtgtggtttt agctgtatta 1261 actgcatcgt ttggtgtaac tggttattct ttaccttggg atcaaattgg ttattgggca 1321 gtcaaaattg tgacaggtgt gcctgacgcg attccggtaa taggatcacc tttagtggag 1381 ttattacgtg gaagtgctag tgtgggtcaa tccactttga ctcgttttta tagtttacat 1441 acctttgtac ttcctctgct tactgccgta tttatgttaa tgcactttct aatgatacgt 1501 aagcaaggta tttcgggccc tttataagga aggcatctca tagagagttc taattctcat 1561 atatcatatc gggtaggttg tggtatttca ttgctacaaa catgggttat tctaaaataa 1621 gacatgtcat ttagatactt ctcttcaact ccgaactatt gtgatacaaa taatacaaat 1681 ag // LOCUS VAZTHYK 1524 bp ds-DNA VRL 04-AUG-1990 DEFINITION Varicella-zoster virus thymidine kinase gene, complete cds. ACCESSION M36160 KEYWORDS thymidine kinase. SOURCE Varicella-zoster virus (strain Oka vaccine) DNA. ORGANISM Varicella-zoster virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 1524) AUTHORS Mori,H., Shiraki,K., Kato,T., Hayakawa,Y., Yamanishi,K. and Takahashi,M. TITLE Molecular analysis of the thymidine kinase gene of thymidine kinase-deficient mutants of Varicella-zoster virus JOURNAL Intervirology 29, 301-310 (1988) STANDARD simple staff_review FEATURES from to/span description pept 420 1445 thymidine kinase BASE COUNT 434 a 354 c 327 g 408 t 1 others ORIGIN 1 ctggcgcata ccctcgcaaa actggtgata cttagtaggg gtatgtatat tagcgctaaa 61 acggcaagat tttaattcca ctataaaaca aacggtcttt ccggcaccac tggattccgt 121 ttgtataata caaacacaat cggggcgtcg gcgtcccaaa tttacttcaa acgacattga 181 tatgcgtaca gccctttgaa catccacgtg ggataacggc gacaggagtt ttgccagcct 241 cgggttgaac gcgtccgcga aacctcgacg tacgttatca atatcctttt tgagtacatc 301 gtaaaaacga gtgtggcaac gttgtcccaa acgaaaacac ttggcccgaa ttcgactagc 361 ggacatattt gaagttccgt cccagaagat aacctaagac gcgtttgtct acaataaaca 421 tgtcaacgga taaaaccgat gtaaaaatgg gcgttttgcg tatttatttg gacggggcgt 481 atggaattgg aaaaacaacc gccgccgaag aatttttaca ccactttgca ataacaccaa 541 accggatctt actcattggg gagcccctgt cgtattggcg taaccttgca ggggaggacg 601 ctatttgcgg aatttacgga acacaaactc gccgtcttaa tggagacgtt tcgcctgaag 661 acgcacaacg cctcacggct cattttcaga gcctgttctg ttctccgcat gcaattatgc 721 atgcgaaaat ctcggcattg atggacacaa gtacatcgga tctcgtacaa gtaaataagg 781 agccgtataa aattatgtta tccgaccgac acccaatcgc ctcaactata tgttttccct 841 tgtccagata cttagtggga gatatgtccc cagcggcgct tcctgggtta ttgtttacgc 901 ttcccgctga accccccggg accaacttgg tagtttgtac cgtttcactc cccagtcatt 961 tatccagagt aagcaaacgg gccagaccgg gagaaacggt taatctgccg tttgttatgg 1021 ttctgagaaa tgtatatata atgcttatta atacaattat atttcttaaa actaacaact 1081 ggcacgcggg ctggaacaca ctgtcatttt gtaatgatgt atttaaacag aaattacaaa 1141 aatccgagtg tataaaacta cgcgaagtac ctgggattga agacacgtta ttcgccgtnc 1201 ttaaacttcc ggagctttgc ggagagtttg gaaatattct gccgttatgg gcatggggaa 1261 tggagaccct ttcaaactgc ttacgaagca tgtctccgtt cgtattatcg ttagaacaga 1321 caccccagca tgcggcacaa gaactaaaaa ctctgctacc ccagatgacc ccggcaaaca 1381 tgtcctccgg tgcatggaat atattgaaag agcttgttaa tgccgttcag gacaacactt 1441 cctaaatata cctagtattt acgtatgtac cagtaaaaag atgatacaca ttgtcatact 1501 cgcgtgtacg tgtttttctt tttt // LOCUS HUMCAMA 2175 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human calmodulin mRNA, complete cds. ACCESSION J04046 KEYWORDS calmodulin. SOURCE Human teratoma cell line, cDNA to mRNA, clone lambda-ht6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2175) AUTHORS Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A., Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E. TITLE Multiple divergent mRNAs code for a single human calmodulin JOURNAL J. Biol. Chem. 263, 17055-17062 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2175; revises [1]) AUTHORS Fischer,R., Koller,M., Flura,M., Mathews,S., Strehler-Page,M.-A., Krebs,J., Penniston,J.T., Carafoli,E. and Strehler,E.E. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Fischer, 28-JUL-1988, and [2] 18-JUL-1990. [2] Author address: M.Koller Biochemistry Dept. Swiss Federal Inst. of Technology Universitaetsstr. 16 Zurich, CH biochemie@czheth5a FEATURES from to/span description pept 104 553 calmodulin mRNA 6 2175 CALM mRNA BASE COUNT 430 a 626 c 592 g 527 t ORIGIN 1 tgagtgtgga ggcgcggacg cgcggcggag ctggaactgc tgcagctgct gccgccgccg 61 gaggaacctt gatccccgtg ctccggacac cccgggcctc gccatggctg accagctgac 121 tgaggagcag attgcagagt tcaaggaggc cttctccctc tttgacaagg atggagatgg 181 cactatcacc accaaggagt tggggacagt gatgagatcc ctgggacaga accccactga 241 agcagagctg caggatatga tcaatgaggt ggatgcagat gggaacggga ccattgactt 301 cccggagttc ctgaccatga tggccagaaa gatgaaggac acagacagtg aggaggagat 361 ccgagaggcg ttccgtgtct ttgacaagga tgggaatggc tacatcagcg ccgcagagct 421 gcgtcacgta atgacgaacc tgggggagaa gctgaccgat gaggaggtgg atgagatgat 481 cagggaggct gacatcgatg gagatggcca ggtcaattat gaagagtttg tacagatgat 541 gactgcaaag tgaaggcccc ccgggcagct ggcgatgccc gttctcttga tctctctctt 601 ctcgcgcgcg cactctctct tcaacactcc cctgcgtacc ccggttctag caaacaccaa 661 ttgattgact gagaatctga taaagcaaca aaagatttgt cccaagctgc atgattgctc 721 tttctccttc ttccctgagt ctctctccat gcccctcatc tcttcctttt gccctcgcct 781 cttccatcca cgtcttccaa ggcctgatgc attcataagt tgaagccctc cccagatccc 841 cttggagcct ctgccctcct ccagcccgga tggctctcct tcattttggt ttgtttcctc 901 ttgtttgtca tcttattttg ggtgctgggg tggctgccag cctgtcccgg gacctgctgg 961 gagggacaag aggccctccc caggcagaag agcatgccct ttgccgttgc atgcaaccag 1021 ccctgtgatt ccacgtgcag atcccagcag cctgttgggg caggggtgcc aagagaggca 1081 ttccagaagg actgaggggg cgttgaggaa ttgtggcgtt gactggatgt ggcccaggac 1141 tgggtcgagg gggccaactc acagaagggg actgacagtg ggcaacactc acatcccact 1201 ggctgctgtt ctgaaaccat ctgattggct ttctgaggtt tggctgggtg gggactgctc 1261 atttggccac tctgcagatt ggacttgccc gcgttcctga agcgctctcg agctgttctg 1321 taaatacctg gtgctaacat cccatgccgc tccctcctca cgatgcaccc accgccctga 1381 gggcccgtcc taggaatgga tgtggggatg gtcgctttgt aatgtgctgg ttctcttttt 1441 ttttctttcc cctctatggc ccttaagact ttcattttgt tcagaaccat gctgggctag 1501 ctaaagggtg gggagaggga agatgggccc caccagctct caagagaaac gcacctgcaa 1561 taaaacagtc ttgtcggcca gctgcccagg gacggcagct acagcagcct ctgcgtcctg 1621 gtccgccagc acctcccgct tctccgtggt gacttggcgc cgcttcctca catctgtgct 1681 ccgtgccctc ttccctgcct cttccctcgc ccacctgcct gcccccatac tccccagcgg 1741 agagcatgat ccgtgccctt gcttctgact ttcgcctctg ggacaagtaa gtcaatgtgg 1801 gcagttcagt cgtctgggtt ttttcccctt ttctgttcat ttcatctggc tccccccacc 1861 acctccccac cccacccccc accccctgct tcccctcact gcccaggtcg atcaagtggc 1921 ttttcctggg acctgcccag ctttgagaat ctcttctcat ccaccctctg gcacccagcc 1981 tctgagggaa ggagggatgg ggcatagtgg gagacccagc caagagctga gggtaagggc 2041 aggtaggcgt gaggctgtgg acattttcgg aatgttttgg ttttgttttt tttaaaccgg 2101 gcaatattgt gttcagttca agctgtgaag aaaaatatat atcaatgttt tccaataaaa 2161 tacagtgact acctg // LOCUS BMEGDH1 2834 bp ds-DNA BCT 04-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase gene and ORFs. ACCESSION D90043 KEYWORDS glucose dehydrogenase. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2834) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan. FEATURES from to/span description pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47) ORF 181 867 ORF1 ORF 1086 1946 ORF2 signal 89 94 put. -35 region for ORF1 signal 116 121 put. -10 region for ORF1 signal 167 172 ORF2 ribosome binding site signal 893 928 termination signal signal 972 977 put. -35 region for ORF2 signal 997 1003 put. -10 region for ORF2 signal 1070 1076 ORF2 ribosome binding site signal 1949 1954 glucose dehydrogenase ribosome binding site signal 2751 2783 termination signal for glucose dehydrogenase BASE COUNT 896 a 442 c 644 g 852 t ORIGIN 1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga 61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag 121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa 181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg 241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt 301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat 361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca 421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat 481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag 541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta 601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat 661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct 721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt 781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt 841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg 901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt 961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt 1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt 1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt 1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt 1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg 1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta 1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca 1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg 1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag 1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa 1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat 1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca 1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg 1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc 1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga 1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag 1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga 1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg 2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa 2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt 2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt 2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt 2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga 2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt 2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa 2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat 2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg 2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc 2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc 2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca 2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga 2821 ctggtgttga attc // LOCUS BMEGDH2 1202 bp ds-DNA BCT 04-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene. ACCESSION D90044 KEYWORDS glucose dehydrogenase; isozyme. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1202) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura Department of Fermentation Technology Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax: 06-876-9036 FEATURES from to/span description pept 125 910 glucose dehydrogenase signal 27 32 put. -35 region signal 49 55 put. -10 region signal 111 116 SD sequence signal 928 959 termination signal BASE COUNT 427 a 190 c 268 g 317 t ORIGIN 1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa 61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt 121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt 181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg 241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc 301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc 361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt 421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg 481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa 541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc 601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa 661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa 721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg 781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt 841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg 901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt 961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg 1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa 1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg 1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg 1201 ag // LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 04-AUG-1990 DEFINITION B.mori PTTH mRNA. ACCESSION D90082 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to mRNA, clones P1, P2, C2, C9 and C19. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 1023) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept 34 708 preproPTTH matp 379 708 PTTH subunit signal 768 773 polyadenylation signal signal 826 831 polyadenylation signal signal 997 1002 polyadenylation signal signal 1004 1009 polyadenylation signal BASE COUNT 358 a 179 c 186 g 300 t ORIGIN 1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc 61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg 121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac 181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg 241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta 301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa 361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct 421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc 481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc 541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct 601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg 661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt 721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata 781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc 841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt 901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca 961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt 1021 aaa // LOCUS BMOPTTHP4 944 bp ss-mRNA INV 04-AUG-1990 DEFINITION B.mori preproPTTH mRNA. ACCESSION D90083 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to mRNA, clone P4 and C21. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 944) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept < 1 631 preproPTTH matp 302 631 PTTH subunit signal 691 696 polyadenylation signal signal 749 754 polyadenylation signal signal 918 923 polyadenylation signal signal 925 930 polyadenylation signal BASE COUNT 337 a 163 c 177 g 267 t ORIGIN 1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt 61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag 121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga 181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa 241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag 301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa 361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg 421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag 481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga 541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag 601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt 661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa 721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag 781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt 841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata 901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa // LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 04-AUG-1990 DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA. ACCESSION D90047 KEYWORDS Ip; complex II; iron sulfur subunit; sdh B; succinate-ubiquinone oxidoreductase. SOURCE Human liver mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 958) AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M. TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning of iron sulfur(Ip) subunit of liver mitochondria JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kiyoshi Kita Department of Parasitology Juntendo University 2-1-1,Hongo Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax: 03-814-9300 FEATURES from to/span description pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B) (AA at 1) site 74 95 iron-sulfur binding site I site 167 179 iron-sulfur binding site II site 224 236 iron-sulfur binding site III BASE COUNT 319 a 212 c 204 g 223 t ORIGIN 1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt 61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg 121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag 181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt 241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac 301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat 361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag 421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag 481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc 541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg 601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc 661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat 721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct 781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt 841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata 901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMNCAW 2287 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA. ACCESSION D90064 KEYWORDS CEA; CEA gene family; PI-anchored membrane protein. SOURCE Human white blood cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2287) AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and Matsuoka,Y. TITLE Characterization of a cDNA clone encoding a new species of the nonspecific cross-reacting antigen (NCA), a member of the CEA gene family JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumiko Arakawa First Department of Biochemistry School of Medicine Fukuoka University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone: 092-801-1011 x2892 Fax: 092-801-3600 FEATURES from to/span description ORF 87 1136 nonspecific cross-reacting antigen ORF BASE COUNT 618 a 593 c 453 g 623 t ORIGIN 1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag 61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc 121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca 181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc 241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa 301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag 361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg 421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg 481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca 541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc 601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc 661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac 721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga 781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag 841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt 901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa 961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca 1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag 1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc 1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa 1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact 1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc 1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa 1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt 1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc 1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga 1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga 1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc 1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt 1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt 1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa 1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca 1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat 1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga 2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt 2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac 2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc 2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg 2281 gctgatc // LOCUS PIGDESTN 1666 bp ss-mRNA MAM 04-AUG-1990 DEFINITION Porcine destrin mRNA. ACCESSION D90053 J05290 KEYWORDS actin-binding protein; cofilin; destrin. SOURCE Pig adult brain, cDNA to mRNA, clone PD2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1666) AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S., Iida,K. and Yahara,I. TITLE Destrin, a mammalian actin-depolymerizing protein, is closely related to cofilin: Cloning and expression of porcine brain destrin cDNA JOURNAL J. Biol. Chem. 265, 5768-5773 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Moriyama Department of Biophysics and Biochemistry Faculty of Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan Phone: 03-821-2111 x4408 FEATURES from to/span description pept 54 551 destrin mRNA < 1 1666 destrin mRNA signal 1645 1660 polyadenylation signal BASE COUNT 492 a 322 c 372 g 480 t ORIGIN 1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt 61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt 121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag 181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa 241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg 301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc 361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg 421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc 481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat 541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc 601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg 661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct 721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga 781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt 841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt 901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt 961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac 1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca 1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca 1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt 1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta 1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag 1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc 1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg 1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga 1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt 1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta 1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt // LOCUS RATPMP70X 3324 bp ss-mRNA ROD 04-AUG-1990 DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA. ACCESSION D90038 J05256 KEYWORDS PMP70; peroxisomal membrane protein. SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181, 189,201]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3324) AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T. TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Keiju Kamijo Department of Biochemistry Shinshu University School of Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax: 263-33-6458 FEATURES from to/span description pept 36 2015 peroxisomal membrane protein (PMP70) signal 3271 3276 polyadenylation signal (put.) BASE COUNT 921 a 658 c 773 g 972 t ORIGIN 1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga 61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc 121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg 181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac 241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta 301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac 361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact 421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg 481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag 541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc 601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta 661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg 721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa 781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta 841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa 901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct 961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg 1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca 1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc 1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg 1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta 1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg 1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa 1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg 1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat 1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc 1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa 1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg 1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg 1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac 1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca 1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc 1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa 1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt 2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta 2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca 2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat 2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg 2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga 2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct 2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt 2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat 2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt 2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt 2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg 2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac 2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg 2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg 2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca 2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc 3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg 3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa 3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt 3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat 3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag 3301 attaaaaaaa aaaaaaagga attc // LOCUS VACSANT 1525 bp ds-DNA VRL 04-AUG-1990 DEFINITION Vaccinia virus surface (S) antigen gene. ACCESSION D90076 KEYWORDS S gene; surface antigen. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1525) AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y. TITLE Identification and nucleotide sequence of the gene encoding a surface antigen induced by vaccinia virus JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Yoshiaki Ueda National Institute of Health Gakuen, Musashimurayama Tokyo 190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315 FEATURES from to/span description pept 382 1437 surface antigen S BASE COUNT 568 a 229 c 258 g 470 t ORIGIN 1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata 61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat 121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa 181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat 241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac 301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata 361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca 421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc 481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt 541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt 601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa 661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta 721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca 781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc 841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga 901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat 961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta 1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat 1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa 1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact 1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga 1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc 1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt 1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata 1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt 1501 gctatagaat gaatgaggat acgcg // LOCUS YSCA1 881 bp ds-DNA PLN 04-AUG-1990 DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1). ACCESSION D90072 X13682 KEYWORDS acidic ribosomal protein; ribosomal protein. SOURCE S. cerevisiae (strain IFO-40028) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 315; 631 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989) STANDARD full staff_entry REFERENCE 3 (bases 277 to 742) AUTHORS Tsurugi,K. and Mitsui,K. TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1 from Saccharomyces cerevisiae JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988) STANDARD simple automatic COMMENT These data kindly submitted in computer readable form by: Kazuhiro Mitsui Department of Biochemistry Yamanashi Medical college Tamaho, Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257 FEATURES from to/span description pept 313 633 acidic ribosomal protein A1 signal 125 135 UASrpg box1 signal 182 192 UASrpg box2 signal 716 721 poly(A) signal variant 303 303 a in [1]; g in [3] variant 684 685 tt in [1]; t in [3] variant 719 719 a in [1]; t in [3] BASE COUNT 270 a 165 c 157 g 289 t ORIGIN 1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt 61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca 121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa 181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca 241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa 301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac 361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca 421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac 481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct 541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc 601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg 661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa 721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg 781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc 841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c // LOCUS YSCCDC23X 3107 bp ds-DNA PLN 04-AUG-1990 DEFINITION S. cerevisiae CDC23 protein gene. ACCESSION D90081 KEYWORDS CDC23 protein. SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line D22) DNA, clone YX34. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3107) AUTHORS Doi,A. and Doi,K. TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces cerevisiae JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Doi The Institute of Scientific and Industrial Research Osaka University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111 Fax: 06-877-4977 FEATURES from to/span description ORF 765 2645 ORF for CDC23 site 153 161 calcium-binding site BASE COUNT 1010 a 595 c 629 g 873 t ORIGIN 1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact 61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt 121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc 181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt 241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg 301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct 361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt 421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga 481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag 541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat 601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact 661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc 721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc 781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac 841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag 901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag 961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt 1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt 1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc 1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta 1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc 1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga 1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag 1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa 1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact 1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat 1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta 1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg 1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc 1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg 1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt 1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg 1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact 1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt 2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca 2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca 2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt 2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg 2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca 2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat 2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg 2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa 2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta 2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata 2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat 2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc 2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact 2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc 2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc 2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga 3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga 3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag // LOCUS BIM127 8482 bp ss-RNA VRL 04-AUG-1990 DEFINITION Bovine immunodeficiency-like virus, complete proviral genome. ACCESSION M32690 KEYWORDS complete genome. SOURCE Bovine immunodeficiency-like virus (isolate HXB3), proviral DNA, clone 127. ORGANISM Bovine immunodeficiency-like virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 8482) AUTHORS Garvey,K.J., Oberste,M.S., Elser,J.E., Braun,M.J. and Gonda,M.A. TITLE Nucleotide sequence and genome organization of biologically active proviruses of the bovine immunodeficiency-like virus JOURNAL Virology (1990) In press STANDARD full staff_review COMMENT Kindly submitted in computer-readable form prior to publication by Dr. M.A. Gonda, Frederick Cancer Research Facility, Frederick, MD. Clones 127 and 106 (see <BIV106>, Accession number M32691) were obtained from a single virus isolation; the latter, 106, has 8391 nucleotides as the result of a sizeable deletion in env and two minor deletions in the LTR R regions. The Q and S orfs are named by analogy with similar orfs in visna; other short reading frames can be identified. FEATURES from to/span description pept 316 1746 gag polyprotein pept < 1572 4742 pol polyprotein pept 4601 5197 protein Q (put.) pept 5228 5539 protein S (put.) pept 5415 8129 env polyprotein BASE COUNT 2699 a 1798 c 2017 g 1968 t ORIGIN 1 ttcagctcgt gtagctcatt agctccgagc tccccaacct acagcctgag aggcactggc 61 tcggttgggt agccagcctt tcgggtaata aaggcttgtt ggcattcggc atctacccgt 121 gcctcctgtc ttgtcttact cgagcgaacc cacaactccg tcctgctgag ctcacagctc 181 gcggggcggt gaagaacacc caacagttgg cgcccaacgt ggggctcgag taagagagac 241 tcggctcgag taaaagaaga cccagctcga acgagaagac tccggacagg tgagtagttg 301 cgtgttttcc ccgggatgaa gagaagggag ttagaaaaga agcttcgtaa ggttagggtg 361 acaccccaac aggataaata ttatactata gggaatcttc aatgggccat tagaatgata 421 aatctaatgg ggatcaaatg tgtgtgtgac gaggagtgct cggcagcaga ggtagccctt 481 atcataaccc aattttcagc tttagactta gaaaattctc ctatcagagg taaggaggag 541 gtggccataa aaaatactct gaaggttttc tggtccctgc tggcggggta caaaccagag 601 agtacagaaa cggccctagg atattgggag gcctttacat atagagaaag ggaggccaga 661 gctgataagg aaggcgaaat taagagtatt tacccttccc taacacagaa cacacagaat 721 aagaagcaga catcgaatca gacaaacact caatcattac cagctatcac tactcaagat 781 ggtactccta ggtttgatcc tgacctcatg aagcagctta agatctggtc agacgccact 841 gaaagaaatg gggttgacct tcatgcagtg aatatattag gggtcattac agcaaaccta 901 gtacaggaag aaattaaact cctcttgaat agtacaccca agtggagatt agatgtacaa 961 cttatagaat caaaagtaag agagaaagaa aatgcccaca gaacgtggaa acagcatcat 1021 ccagaagccc caaaaacaga tgaaatcatc ggtaaggggc ttagttctgc tgaacaagcc 1081 accctgatct cagtagaatg cagagaaact ttcagacagt gggtgctgca ggcagctatg 1141 gaggtggcac aggcaaaaca tgctacccca ggtcccatca acattcatca gggacccaag 1201 gagccgtaca cagactttat aaatagatta gtggcagccc ttgaaggtat ggcggctcca 1261 gaaaccacaa aagaatactt actccaacat ctatctattg atcatgccaa tgaagactgc 1321 cagtctattc taagaccttt gggacccaac accccaatgg agaaaaaatt agaagcatgt 1381 agggtagtgg gatctcagaa atcaaagatg caatttttgg tagcagctat gaaagaaatg 1441 gggatccaat caccaattcc agcagtcttg cctcacacac cagaagcata tgcctcccaa 1501 acctcagggc ccgaggatgg taggagatgt tacggatgtg ggaagacagg acatttgaag 1561 aggaattgta aacagcaaaa atgctaccat tgtggcaaac ctggccacca agcaagaaac 1621 tgcaggtcaa aaaacgggaa gtgctcctct gccccttatg ggcagaggag ccaaccacag 1681 aacaattttc accagagcaa catgagttct gtgaccccat ctgcaccccc tcttatatta 1741 gattagacaa acagcctttt ataaaggtgt tcataggggg aagatgggta aaagggttag 1801 tagacactgg agcagatgag gtagtgctta agaacataca ttgggatagg ataaaagggt 1861 atccagggac accaattaaa caaattgggg taaatggagt aaatgtggcc aaaaggaaga 1921 cccacgtaga gtggagattt aaggataaga ctgggataat tgatgtcttg ttctcagata 1981 ctcctgtaaa cctttttggg agatctcttc tacgtagcat agtgacttgc ttcaccctac 2041 ttgttcacac agaaaaaatc gaacccctac ccgtcaaggt aaggggacca gggcctaagg 2101 taccccagtg gcccttgaca aaagaaaagt atcaggctct taaggaaatt gtgaaagatc 2161 ttttagcaga aggaaaaatt tccgaagctg cttgggataa cccatataat accccagttt 2221 ttgttataaa gaaaaaggga acgggaagat ggaggatgct aatggatttt agggaattaa 2281 ataagataac agttaaagga caagaattct ctacaggctt accttaccct ccaggaatta 2341 aggaatgtga acacttaact gcaatagata taaaagatgc ctactttact atccctttac 2401 atgaggactt tagacccttt acagccttct ctgtagtccc tgtaaatcga gaaggaccta 2461 tagagaggtt ccagtggaat gttctaccac aaggatgggt atgtagccct gccatttatc 2521 agactaccac ccagaagatt atagaaaaca ttaaaaagag tcacccagat gtcatgttgt 2581 atcaatatat ggatgatttg ttgattgggt ctaataggga tgatcataag caaatagtgc 2641 aggaaatcag ggataagtta ggatcatatg gtttcaagac tccagatgaa aaggtccagg 2701 aagagagagt gaaatggatc ggttttgagc tcacacccaa gaaatggcgt tttcagccca 2761 ggcaactaaa gataaaaaac ccactcacag taaatgaatt acagcaatta gtaggtaatt 2821 gtgtttgggt acagccagaa gtaaaaatcc ctctataccc cttaaccgat ctactgaggg 2881 ataagaccaa tctccaagaa aagatacaac taacaccaga agccatcaag tgtgtagaag 2941 aattcaatct aaaactaaaa gatccagaat ggaaagatag aataagagaa ggagcagaat 3001 tagtcataaa aatacagatg gttcctcggg gcatagtatt tgatctgttg caagatggaa 3061 atcccatatg gggaggagta aaaggactaa attatgatca ttcaaacaaa ataaaaaaga 3121 tacttagaac tatgaatgag ctgaacagaa cagtggtaat tatgacagga agagaagcta 3181 gtttcctgct tcctgggtct tctgaagatt gggaagcggc actccagaag gaagaaagtc 3241 taacacaaat attcccagta aagttttata ggcactcctg cagatggacc tccatatgtg 3301 ggccagtaag agaaaatcta accacctact atactgacgg agggaagaaa gggaaaacag 3361 ctgcagcagt atattggtgt gaaggaagga ctaagtcaaa ggtatttcca ggaaccaatc 3421 aacaggcgga attgaaggcc atatgcatgg ctctcttgga tggaccacca aaaatgaata 3481 tcataacaga tagtagatac gcctatgagg gaatgagaga agaaccagaa acgtgggcca 3541 gggaaggaat ctggctggag attgccaaga tattgccctt taagcagtac gtgggggtcg 3601 ggtgggtgcc tgcacataaa gggataggag gaaatacaga ggcagatgaa ggagttaaga 3661 aagccttaga acagatggcc ccgtgtagcc ctcctgaggc cattctatta aaaccaggag 3721 aaaaacaaaa tctggagaca gggatctaca tgcaggggct tagaccacaa agcttcctcc 3781 caagagcaga cttaccagta gccatcacag gaaccatggt agattcagag ctacagctac 3841 agctacttaa cataggaact gagcatataa gaatccaaaa agatgaggtc ttcatgacct 3901 gtttcctaga aaatatcccc tcagccactg aagatcatga gagatggcat acctcaccag 3961 acattttggt taggcagttc catctcccta agagaatagc taaagagata gtagccagat 4021 gccaagaatg taaaaggaca accactagcc cagtcagagg aacaaacccc agaggtcgat 4081 tcttatggca gatggacaat actcactgga ataaaacaat tatttgggta gcagtagaga 4141 caaattcagg attagtggaa gctcaggtga tccctgaaga aacagcacta caagtagctc 4201 tctgcatttt acagctaatc cagagatata cagttcttca cttacatagt gacaacgggc 4261 cgtgctttac tgcacacagg atagaaaatc tatgtaagta tctggggatc acaaaaacta 4321 cgggaatacc ctacaaccca caatcccagg gagttgtaga aagagcccac agagatctaa 4381 aagacagatt ggcagcttat cagggagatt gtgaaaccgt agaagcagcc cttagcctcg 4441 cattagtttc tttaaataaa aaaagagggg gaataggggg ccatacacca tatgaaatat 4501 acctagaatc agaacatacc aaataccaag accaactaga acaacaattt tcaaaacaaa 4561 aaattgaaaa gtggtgttac gtaaggaaca gaagaaagga atggaaagga ccctacaaag 4621 tgttgtggga cggagacggg gcagcagtaa tagaggaaga gggaaaaaca gccttatatc 4681 cacaccgtca tatgcgcttc atcccccccc cagattcaga tatccaagat gggagttcgt 4741 gaggcagaca gaatacagca tgaccgcgtg cgtaagaaaa gggaaattag tccttactta 4801 ccagtacgcg atctggaaaa gagtctggac gatagaaaca ggatttacag atccaagtct 4861 gtttatgacc ccagctggaa cacacaccac tgaagaaata ggtcacttag atctcttttg 4921 gcttaggtac tgttcatgtc cgcatgagat gcccccgtgg ctagacttcc ttagaggcac 4981 cctcaatcta cgcatttcct gtcgacgcgc tcttcaagcg tcagtgttga ctagcacccc 5041 tagacactcc ctccaacgct tagctgcact tcagctgtgc actaacgcat gtctctgttg 5101 gtacccgtta ggacgcatca acgacaccac cccgttgtgg ttgaactttt cgtctgggaa 5161 ggaaccaacg atccaacaac tgagtggcca cccctaactc gtcgtaacat tcatagattg 5221 tggcaatatg cccggacctt gggtggcgat gataatgttg ccacagccca aagaaagctt 5281 tggaggaaag ccaattggct ggcttttctg gaacacgtgc aaaggaccta ggcgggactg 5341 tccacattgt tgttgtccca tatgtagttg gcattgtcag ctttgctttt tgcagaaaaa 5401 tctaggaatc aactatggat caggacctag acggcgcgga acgcggggaa aggggaggag 5461 gatccgaaga actgcttcag gaggagatca acgaagggag gctgacagcc agagaagctt 5521 tacaaacatg gatcaataac ggtgagatcc acccttgggt cctggcagga atgctgtcca 5581 tgggagtagg aatgctacta ggagtatatt gtcagttacc agacacactg atttggatac 5641 taatgtttca attatgcctt tattggggtt tgggtgaaac atctagagaa ttagacaagg 5701 atagttggca gtgggtcaga agtgtattta taatagcaat attgggaact ctcactatgg 5761 caggaactgc tttggccgac gacgatcaaa gtactttaat ccccaatatc acaaaaattc 5821 ctacaaagga cacggaaccc ggttgcacct atccgtggat attaatcctc ttgattttgg 5881 ctttcatact gggaattctg ggtataatac ttgtcttgag acgcagcaac tcggaggata 5941 tattggcagc cagagatacc atagattggt ggctctcagc taatcaggaa atacctccaa 6001 agtttgcttt cccaataata ttaatatctt cccctctagc aggcataata ggatattatg 6061 tcatggaaag gcacttagag atcttcaaaa agggatgtca aatttgtggg agcctgagca 6121 gcatgtgggg aatgcttttg gaagaaattg gcaggtggct cgcacgtagg gaatggaatg 6181 ttagtagagt aatggttatc ctcttaatca gcttcagttg gggaatgtat gtcaataggg 6241 taaatgcctc agggtcacat gtagccatgg tcaccagccc tccagggtac cgcatagtga 6301 atgataccag ccaggcacct tggtattgct tctcctcggc accaatccca acgtgtagtt 6361 cctctcagtg gggagacaaa tattttgagg agaaaataaa cgagacactg gtcaaacagg 6421 tgtatgaaca ggccgcgaaa cattcgagag ccacatggat tgaacctgat ctattggagg 6481 aagcagtcta tgagctagct ctgttatcag ctaatgacag tcgtcaggtg gtggtagaaa 6541 atggtacaga cgtatgtagc tcacagaact cgagcacaaa caaaggccac ccaatgacgc 6601 ttctaaagtt gagagggcag gtgtcagaaa cttggatagg gaattcctcc ctccagtttt 6661 gtgtccagtg gccatatgtc ttggtaggtc ttaataatag tgatagtaat attagcttca 6721 attcgggaga ttggatagca accaattgta tgcacccaat tacactaaat aaaagtgcac 6781 aagatctagg aaaaaatttt ccgagactaa catttcttga cggacaactg tcccagttga 6841 agaacacact gtgcggacat aacacaaact gtttgaaatt tggaaacaag tccttcagta 6901 caaattccct aatactatgc caagacaacc ccatcggcaa cgacaccttt tatagcctaa 6961 gtcattcctt ctcaaaacag gcctctgccc ggtggattct tgtaaaggtc cccagctatg 7021 ggtttgtggt agtaaatgac acagatacac caccatccct ccgcatccga aagcctcgag 7081 cagtcggact agcaatattc ctgcttgtgc tggctatcat ggccatcaca tcctccttgg 7141 tggcagctac aacgctcgtg aaccagcaca cgacggctaa ggttgtggag agggttgtgc 7201 aaaatgtgtc atatattgct caaacccagg accaattcac ccacctgttc aggaatataa 7261 acaacagatt aaatgtccta caccatagag tttcatactt ggagtatgta gaggaaatca 7321 gacaaaaaca agtattcttt ggttgcaaac ctcatggaag gtattgccac tttgactttg 7381 gaccagagga agttggatgg aacaatagtt ggaatagcaa aacttggaat gatctacaag 7441 atgagtatga taagatagaa gaaaaaatat taaaaattcg agtggactgg ctcaatagct 7501 ccctgagtga cacacaggac acctttggcc tggagacctc tatttttgac catttagtgc 7561 aattgtttga ttggacttct tggaaagact ggataaaaat cattatagta atcattgtac 7621 tttggcttct gataaagatt ctcctaggta tgttaagaag ctgcgccaag gtcagccaga 7681 attaccaaca tctcccggcg gaggaggagg acggggacac agagccagaa agctccccgg 7741 cgagaggaga cccggcttct ggaagtctct acgagaattg gttgaacaaa ataggagaaa 7801 gcaagaacga cgcctatcgg gtctggacag aagaatacaa cagcttgagg atcttgttcg 7861 ccacatgtcg ctgggatctc ctgacccctc aactccttca gcttccgttc tttctgttaa 7921 ccctcctgct caaactcctt tgggacatct tccgccacgc tcctatttta aacttaaaag 7981 ggtggactgt ggggcagggt gggacctcag gacaacagca gcccccggac ttcccatatg 8041 tgaattggac tggatccagg gaacaaaata acccagaagg gggattagac tctggggctt 8101 ggtatgaagg cctgagaggt tctcagtaga ttgtaagtct tcggcgagac tgcatgtctg 8161 cacgtagaca ggaaatgttt atcttctcag ctgattgtgg ttaggccgat tactggaaac 8221 tagacaacct gattcattag tggttaagat tatgcataag tgctcgcaat gatgtagctg 8281 cttacgcttg cttactccgc cctgaaacgc ctaccttaac acgcaacacg cccacctgta 8341 agaatatata aaccatatct tcactctgta cttcagctcg tgtagctcat tagctccgag 8401 ctccccaacc tacagcctga gaggcactgg ctcggttggg tagccagcct ttcgggtaat 8461 aaaggcttgt tggcattcgg ca // LOCUS CEAVCG 9180 bp ss-RNA VRL 04-AUG-1990 DEFINITION Caprine arthritis encephalitis virus, complete proviral genome. ACCESSION M33677 KEYWORDS complete genome. SOURCE Caprine arthritis encephalitis virus (isolate Clements) proviral DNA, infectious molecular clone. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9180) AUTHORS Saltarelli,M., Querat,G., Konings,D.A.M., Vigne,R. and Clements,J.E. TITLE Nucleotide sequence and transcriptional analysis of an infectious clone of CAEV JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Kindly provided in computer-readable form prior to publication by Dr. Janice Clements, Johns Hopkins University, Baltimore, MD., 10 April 1990. The annotation shown below is tentative. FEATURES from to/span description pept 512 1858 gag polyprotein pept < 1717 5046 pol polyprotein (NH2-terminus uncertain) pept 5006 5695 protein Q (put.) pept 5688 5951 protein S (put.) pept 6012 8837 env polyprotein ORF 8405 8794 ORF 4 BASE COUNT 3505 a 1466 c 2301 g 1908 t ORIGIN 1 gagttctagg agagtccctc ctagtctctc ctctccgagg aggtaccgag acctcaaaat 61 aaaggagtga ttgccttact gccgagtgga gagtgattac tgagcggccg gtgtatcggg 121 agtcgtccct taatctgtgc aataccagag cggctctcgc agctggcgcc caacgtgggg 181 cccgaggaga agaaaagaaa gcggccctga gaactcggct tctgaaaaag aggaagagga 241 caagttgcta tagcaacaag agagaagaag tagagcaaag gtccagtggc tcggaaaaag 301 aggaactgaa acttcgggga cgcctgaagg agtaaggtaa gtgactctgc tgtacgcggg 361 gcgaggcaga ggtttccttc taaattgaaa gagaagtgtt gctgcgagag gtcttggtgg 421 tcgagaatcc tgtacaaaaa aaaggaggga tctcggtcag gaccaggacc cctgggagta 481 atacaacagc aacaccgtaa gaaaatccgc catggtgagt ctagatagag acatggcgag 541 gcaagtctcc ggggggaaaa gagattatcc tgagctcgaa aaatgtatca agcatgcatg 601 caagataaaa gttcgactca gaggggagca cttgacagaa ggaaattgtt tatggtgcct 661 taaaacatta gattacatgt ttgaggacca taaagaggaa ccttggacaa aagtaaaatt 721 taggacaata tggcagaagg tgaagaatct aactcctgag gagagtaaca aaaaagactt 781 tatgtctttg caggccacat tagcgggtct aatgtgttgc caaatgggga tgagacctga 841 gacattgcaa gatgcaatgg ctacagtaat catgaaagat gggttactgg aacaagagga 901 aaagaaggaa gacaaaagag aaaaggaaga gagtgtcttc ccaatagtag tgcaagcagc 961 aggagggaga agctggaaag cagtagattc tgtaatgttc cagcaactgc aaacagtagc 1021 aatgcagcat ggcctcgtgt ctgaggactt tgaaaggcag ttggcatatt atgctactac 1081 ctggacaagt aaagacatac tagaagtatt ggccatgatg cctggaaata gagctcaaaa 1141 ggagttaatt caagggaaat taaatgaaga agcagaaagg tggagaagga ataatccacc 1201 acctccagca ggaggaggat taacagtgga tcaaattatg ggggtaggac aaacaaatca 1261 agcagcagca caagctaaca tggatcaggc aaggcaaata tgcctgcaat gggtaataaa 1321 tgcattaaga gcagtaagac atatggcgca caggccaggg aatccaatgc tagtaaagca 1381 aaaaacgaat gagccatatg aagattttgc agcaagactg ctagaagcaa tagatgcaga 1441 gccagttaca cagcctataa aagattatct aaagctaaca ctatcttata caaatgcatc 1501 agcagattgt cagaagcaaa tggatagaac actaggacaa agagtacaac aagctagtgt 1561 agaagaaaaa atgcaagcat gtagagatgt gggatcagaa gggttcaaaa tgcaattgtt 1621 agcacaagca ttaaggccag gaaaaggaaa agggaatgga cagccacaaa ggtgttacaa 1681 ctgtggaaaa ccgggacatc aagcaaggca atgtagacaa ggaatcatat gtcacaactg 1741 tggaaagaga ggacatatgc aaaaagaatg cagaggaaag agagacataa ggggaaaaca 1801 gcagggaaac gggaggaggg ggatacgtgt ggtgccgtcc gctcctccta tggaataact 1861 tcagcaccac ctatggttca ggtccgcata ggttcccagc agaggaactt gttatttgat 1921 accggggcgg accgaactat agttagatgg catgagggct cgggaaaccc agccggaagg 1981 ataaaactgc aaggaatagg aggaatagta gaaggagaaa aatggaataa tgtagaatta 2041 gaatataaag gagaaacaag aaagggaaca atagtagtgt taccacaaag tccagtagaa 2101 gtattaggac gagataacat ggcccgattt ggaataaaga taataatggc aaatttagag 2161 gaaaaaagaa tcccaattac aaaagtaaaa ttgaaagagg gatgtacggg tccacatgtc 2221 ccacaatggc cattaacaga agagaaatta aaaggtctaa cagaaatcat agataaatta 2281 gtggaagaag gaaaactagg aaaggcaccc ccacattgga catgtaatac tccaatcttt 2341 tgcataaaaa agaaatcagg gaagtggaga atgttaatag atttcagaga attgaacaaa 2401 cagacagaag atttaacaga agcgcagtta ggactcccgc atccgggagg actacaaaag 2461 aaaaaacatg ttacaatatt ggacatagga gatgcatatt ttactatacc cctatatgaa 2521 ccatatcgag agtacacatg ttttactcta ttaagtccta ataatctagg accatgtaaa 2581 agatactatt ggaaagtgct gccacaaggt tggaaattga gtccatctgt atatcaattt 2641 actatgcagg agatcttaga ggattggata cagcagcatc cagaaattca atttggcata 2701 tatatggatg atatttacat aggaagtgat ttagaaatta aaaagcatag agaaatagtg 2761 aaagatttag ccaattatat tgcccaatat ggattcactc tgccagaaga gaagagacaa 2821 aagggatatc cagcaaaatg gctaggattt gaactacacc cgcagacctg gaaatttcag 2881 aagcatacat tacctgaatt aacaaaggga acaataacat taaataaatt acagaaatta 2941 gtaggagaat tagtatggag acaatccata attgggaaaa gcattcctaa cattctgaaa 3001 ttaatggaag gagatagaga attacaaagt gaaagaaaaa ttgaagaagt acatgtgaaa 3061 gaatgggaag catgtaggaa aaaattagaa gaaatggaag gaaattatta taataaagac 3121 aaagatgtct atggacaatt ggcttgggga gacaaagcta tagaatatat agtgtatcag 3181 gagaaaggga aaccattatg ggtaaatgtg gttcacaata taaagaacct aagcatcccg 3241 caacaggtta ttaaagcagc gcaaaaatta acccaagaag tcatcattag gacaggaaaa 3301 ataccatgga tattgttgcc agggaaagaa gaagattgga gactagaatt gcaattaggg 3361 aacatcacat ggatgccaaa attttggtcc tgttatcgag gacatacaag atggagaaaa 3421 agaaatataa tagaagaagt agtagaaggg cctacatatt atacagatgg aggaaaaaag 3481 aataaagtag gaagtctagg gttcatagta tcaacagggg aaaaatttag aaagcatgaa 3541 gagggcacaa accagcaact agaattaaga gccatagagg aagctctaaa acaagggcct 3601 caaacaatga atttagtaac agatagtaga tatgcatttg aatttttatt aagaaattgg 3661 gatgaagaag taataaagaa tccaattcaa gcaagaatta tggaaattgc ccacaagaaa 3721 gataggatag gagtgcattg ggtgccagga cataaaggga ttccccaaaa tgaagaaata 3781 gacaaatata tttcggaaat atttcttgca aaagaaggag aaggaattct cccaaaaaga 3841 gaagaggatg cagggtatga tttaatatgc ccagaagagg ttaccataga gccaggacaa 3901 gtgaaatgca tccccataga gctaagatta aatttaaaga aatcacaatg ggctatgatt 3961 gctacaaaaa gcagcatggc tgccaaagga gtgttcacac aaggaggaat catagactca 4021 ggatatcagg gacaaataca ggtaataatg tataatagca ataaaatagc agtagtcata 4081 ccccaaggga gaaaatttgc acaattaata ttaatggata aaaagcatgg aaaattggaa 4141 ccctgggggg aaagcagaaa aacagaaagg ggagaaaaag gatttgggtc tacaggaatg 4201 tattggatag aaaatattcc tctggcagag gaagaccaca caaaatggca tcaagatgcc 4261 cgatcattgc atctagaatt tgaaattcca agaacagcag cagaagacat agtaaatcaa 4321 tgtgaaatat gcaaagaagc gaggacacct gcagtaatta gaggcggaaa caaaaggggg 4381 gtaaatcatt ggcaagtgga ttatacccat tatgaaaata tcatactatt agtatgggta 4441 gaaacaaatt caggactaat atatgcagaa aaagtaaaag gagaatcagg gcaagaattc 4501 agaataaaag tgatgcattg gtatgcatta tttggtccag agtcattgca gtcagacaat 4561 ggacctgcat ttgcagcaga gcccacacag ctgttaatgc aatacctagg agtaaaacac 4621 acaacaggca taccttggaa tccacagtct caggctatag tagaaagggc acatcaacta 4681 ttgaaaagca ctttaaagaa gttccagcca caatttgtcg ctgtagaatc agccatagca 4741 gcagccctag tcgccataaa tataaaaaga aagggtgggc tggggacaag ccctatggat 4801 atttttatat ataataaaga acagaaaaga ataaataata aatataataa aaattctcaa 4861 aaaattcaat tctgttatta cagaataagg aaaagaggac atcaggagag tggaaaggac 4921 caacccaggt actgtggaaa ggggaaggag ccaattgtgg taaaggatat agaaagtgaa 4981 aagtatttag taatacctta caaagatgca aaattcatcc cgccaccaac aaaagaaaag 5041 gaataaaaaa cctggaccag aattaccctt agcactatgg atacatatag cagaaagcat 5101 taatggggat agctcatggt acataacaat gagactgcaa cagatgatgt ggggaaaaag 5161 aggaaataag ttacaatata agaatgaaga cagggaatat gaaaattggg aaattacatc 5221 atggggatgg aaaatgcacc taaggagagt gaaacaatgg atacaagaca acaggagagg 5281 aagcccatgg cagtacaaag taggaggaac atggaaaagt ataggagtgt ggttcctgca 5341 agcaggagat tacagaaagg tagacaggca cttctggtgg gcatggagga tactgatatg 5401 ttcctgcagg aaagaaaagt ttgatataag agaatttatg agaggaagac atagatggga 5461 tttgtgcaaa tcctgtgctc aaggagaagt agtaaagcat actagaacaa aaagtctgga 5521 aagactagta ctgctacaga tggtagaaca gcatgtgttt caagtattgc cattgtggag 5581 agccaggaga agtagtacaa cagatttccc atggtgcagg gacacaacgg gatacacgca 5641 tgcgtggtct gtccaggagt gctggttgat ggaatatctc ttagaggatg agtgaagaac 5701 tgcctcaaag aagggagaca catccagaag aacttgtaag gaacgtacgg gaaagagaaa 5761 gggatacatg gcaatggaca agcatcagag tacctgcgga aatactgcaa agatggcttg 5821 ctatgcttag gtcaggcaga aatagaaaga aagtgtatag agaaatgcaa aaatggatgt 5881 ggatacatcc caaggcgcct gtgattaggg cctgtggatg cagactatgt aacccggggt 5941 ggggaacata atcaagggaa taataaatgc aaataaatgt aactaacaag tagcaaaagt 6001 gtctgtgtta gatggatgct ggggccagat acatgcgctt aactgggaag gaaaactggg 6061 ttgaagtaac catggacgga gagaaggaaa ggaaaagaga aggtttcact gcgggacagc 6121 aaggtaagta tcaaccccag gtaagtaagc aaatagggaa cagaaatact aacccatgct 6181 ttgcctataa agggatattc ctatggagga tatcactaac aatgtggata ttgctaggga 6241 taaatatgtg tgtcagtgca gaggattaca taacactaat atcagatccc tatgggttct 6301 cacccataaa aaatgtgtct ggggtaccag tgacttgtgt aacaaaagaa ttcgcaaaat 6361 ggggatgtca accactagga gcgtaccctg atccagaaat agaatacaga aatgtgagtc 6421 aggaagtagt gaaagaagta tatcaagaga attggccatg gaatacatat cattggcctc 6481 tctggcaaat ggagaatgtt aggtactggt taaaagaaaa tatgcaagaa aatcaacaga 6541 gaaaaaataa tacaaaagag ggtatagagg aattattagc aggaactata aggggaagat 6601 tctgtgtacc atacccattt gccttgttaa aatgcacaaa gtggtgctgg tatacagcgg 6661 ccataaacaa cgagtcagga aaagcaggaa aaataaaaat aaattgcaca gaagcaagag 6721 cagtctcctg tacagaggac atgccattag cctcaataca aagagcatat tgggatgaga 6781 aagacagaga gagcatggcc tttatgaata tcaaagcatg tgatagcaac ctaaggtgtc 6841 agaaaagacc tggagggtgt atggaaggat accctatccc agtaggagca gaaataatcc 6901 ctgaaagtat gaaataccta aggggagcaa agagtcagta tgggggaata aaagataaga 6961 atggagaatt aaaattacca ttaacattaa gagtgtgggt aaaattagca aatgtgtcag 7021 aatgggtaaa tgggacaccc ccggattggc aagacagaat taacggatcc aaaggaataa 7081 atgggacgct ctggggagag cttaacagta tgcatcacct aggatttgcc cttagccaga 7141 acggcaaatg gtgtaactac accggggaaa taaaattagg gcaagaaaca ttccaatatc 7201 attacaagcc aaactggaac tgtaccggga attggacgca atatccggtg tggcaagtga 7261 ttagaaacct ggatatggtg gaacatatga caggagaatg tgtgcagaga ccacaaaggc 7321 acaatataac agtaggaaat ggaaccataa cagggaattg cagtacaaca aactgggatg 7381 gatgtaattg ctcacgatca ggaaactacc tatataacag ctctgaggga ggattgttat 7441 taattctgtg cagacaaaac agcaccctaa caaggatcct gggaacaaat acaaattgga 7501 caactatgtg gggaatatac aaaaattgtt caggatgcga gaatgcaaca ttagacaaca 7561 caggagaagg aaccttagga ggtgtagcta ataagaactg tagcttgcct cataaaaatg 7621 agagcaacaa gtggacttgt gccccaagac aaagagatgg aaaaacagat tcgctataca 7681 tagcaggagg aaaaaagttt tggacacgaa ttaaggccca attcagctgt gaaagtaaca 7741 taggacaatt agatggaatg ttgcatcagc aaatactatt gcaaaaatat caagtaatta 7801 aggtaagagc ttatacatat ggggtgatag aaatgccaga aaactatgca aaaacaagaa 7861 tcataaacag gaaaaaaaga gaactcagcc acaagaggaa gaagagaggc gttggcttgg 7921 tcattatgct agttatcatg gcaatagtag ctgccgcagg ggcttctctg ggagtcgcaa 7981 acgcgattca gcagtcttac actaaggcag ctgtccagac ccttgctaat gcaactgctg 8041 cacagcagga tgtgttagag gcaacctatg ccatggtaca gcatgtggct aaaggcgtac 8101 gaatcttgga agctcgagtg gaagctatca cagatagaat aatgctatac caagaattgg 8161 attgttggca ctatcatcaa tactgtataa cctctacaaa aacagaagta gcaaaatata 8221 tcaattggac gaggtttaag gataattgca catggcagca gtgggagaga ggattacagg 8281 ggtatgatac aaacttaaca atactgttaa aggaatcagc agcaatgaca caactagcag 8341 aagagcaagc aaggaggata ccagaagtat gggaaagttt aaaagacgtc tttgattggt 8401 caggatggtt ctcatggcta aagtatattc ctattatagt agtaggatta ttaggatgca 8461 ttctgataag agctgtgata tgtgtatgtc aacctcttgt gcagatatac agaactctaa 8521 gtaccccgac ataccaacgg gtcacagtca tcatggaaac aagagcagac gtcgcaggag 8581 aaaatcagga ttttggcgat ggcttagagg aatcagacaa cagcgaaaca agcgaaagag 8641 tgacagtaca gaaagcttgg agccgtgcct gggagctttg gcagaactca ccctggaagg 8701 agccatggaa aaggggcctg ctgaggctgc tcgtccttcc gctgacgatg ggaatctgga 8761 taaatggatg gcttggagaa caccacaaaa ataaaaaaag aaagggtgac tgtgagacat 8821 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat 8881 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact 8941 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg 9001 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag 9061 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt 9121 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga // LOCUS CEAVLTRM 522 bp ss-RNA VRL 04-AUG-1990 DEFINITION Caprine arthritis encephalitis virus proviral long terminal repeat. ACCESSION M14149 KEYWORDS long terminal repeat. SOURCE Caprine arthritis encephalitis virus (CAEV) unintegrated proviral DNA. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 522) AUTHORS Hess,J.L., Pyper,J.M. and Clements,J.E. TITLE Nucleotide sequence and transcriptional activity of the caprine arthritis-encephalitis virus long terminal repeat JOURNAL J. Virol. 60, 385-393 (1986) STANDARD full staff_review COMMENT [1] presents the LTR for CAEV in a fashion that includes sequence data downstream of the 5' LTR (bases 499-522) and sequence data upstream of the 3' LTR (bases 1-48). For this reason, two "pre-msg" entries are found in the FEATURES table, one for the 5'end of the unspliced transcript and the other for the 3' end of the unspliced transcript. Similarly, both the TATA box and the polyadenylation signal are annotated. The CAEV LTR and the Visna LTR are approximately 50% homologous. [1] did not find a significant level of homology of CAEV LTR with any other retroviral LTR. [1] showed that, unlike Visna and HIV, CAEV LTR is not transactivated. Plasmids constructed with CAEV LTR and CAT showed no evidence of transactivated up-regulation in four cell types (SCP, GSM, SAM and primary goat macrophages). This may explain CAEV's lack of cytotoxicity. FEATURES from to/span description pre-msg < 1 420 genomic mRNA, 3' end (see COMMENT) pre-msg 336 > 522 genomic mRNA, 5' end (see COMMENT) site 336 336 mRNA cap site LTR 49 498 long terminal repeat rpt 87 157 71 bp direct repeat, 5' copy rpt 158 228 71 bp direct repeat, 3' copy rpt 336 420 R repeat binding 499 515 Lys-tRNA primer binding site signal 306 310 TATA box signal 393 398 poly-A signal BASE COUNT 163 a 108 c 137 g 114 t ORIGIN 84 bp upstream of AluI site. 1 aaatggatgg cttggagaac accacaaaaa taaaaaaaga aagggtggac tgtgagacat 61 gggctaaaga ggactaataa caagctaggc caaattcctg taaatcactt ggggggttat 121 aagaaaagca agttcactat gacaaagcaa aatgtaaagg ccaaattcct gtaaatcact 181 tggggggtta taagaaaagc aagttcacta tgacaaagca aaatgtaacc gcaagtgctg 241 acagatgtaa cagctgacat atcagctgat gcttgctcat gctgacactg tagctctgag 301 ctgtatataa ggagaagctt gctgcttgca cttcagagtt ctaggagagt ccctcctagt 361 ctctcctctc cgaggaggta ccgagacctc aaaataaagg agtgattgcc ttactgccga 421 gtggagagtg attactgagc ggccggtgta tcgggagtcg tcccttaatc tgtgcaatac 481 cagagcggct ctcgcagctg gcgcccaacg tggggcccga gg // LOCUS CEAVPOL 651 bp ss-RNA VRL 04-AUG-1990 DEFINITION Caprine arthritis encephalitis virus proviral pol gene, partial cds. ACCESSION K03327 KEYWORDS . SOURCE Caprine arthritis encephalitis virus proviral DNA. ORGANISM Caprine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (sites; for [2]) AUTHORS Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F., Tronick,S.R. and Aaronson,A. TITLE Nucleotide sequence evidence for relationship of AIDS retrovirus to lentiviruses JOURNAL Nature 317, 366-368 (1985) STANDARD full staff_review REFERENCE 2 (bases 1 to 651) AUTHORS Chiu,I.-M. JOURNAL Unpublished (1985) Ohio State U. Medical Sch., Columbus, Ohio STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by I.-M.Chiu, 09-OCT-1985. FEATURES from to/span description pept < 1 > 651 pol protein (reverse transcriptase; AA at 1) BASE COUNT 269 a 102 c 122 g 158 t ORIGIN 434 bp upstream of BglII site 1 gtcccacaat ggccattgac agaagaaaaa ttaaaaggtc tatcagaaat agtagataaa 61 ttagtaaaaa aaaaaaaact aggaaaggca cccccacatt ggacatgtaa tactccaata 121 ttttgcataa aaaagaaatc aggaaaatgg aaaatgttaa tagatttcag aaaattaaac 181 aaacaaacag aggatttaac agaagcgcaa ttaggacttc cgcatccggg gggactacaa 241 aagaaaaaac atgttacaat attagacata ggagatgcat attttactat acccctatat 301 aaaccatatc gagagtacac ttgctttact ctattaagtc ctaataatct aggaccatgt 361 aaaagatatt attggaaggt tctaccacaa ggttggaaac tgagtccatc tgtatatcaa 421 tttaccatgc aggagatctt aggggaatgg atacaggagc atcctgaaat tcaatttaga 481 atatatatgg atgatatcta cataagaagt gatttagaaa taaaaaagca tagagagata 541 gtggaggaat tagccaatta tattgcccaa tatagattca ctctgccaga agagaaaagg 601 caagagagat atccagcaaa atggttaggg tatgaactac atcctcaggc c // LOCUS EAEVWU5 2580 bp ss-RNA VRL 04-AUG-1990 DEFINITION Equine arthritis encephalitis virus, envelope gene, complete cds. ACCESSION X16988 KEYWORDS envelope protein. SOURCE Equine arthritis encephalitis virus, clone pWU5. ORGANISM Equine arthritis encephalitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 2580) AUTHORS McGuire,T.C., Lacy,P.A. and O'Rourke,K. TITLE cDNA sequence of the env gene of a pathogenic equine infectious anemia lentivirus variant JOURNAL Nucleic Acids Res. 18, 196 (1990) STANDARD full staff_review COMMENT Kindly submitted in computer-readable form by K. O'Rourke, Washington State University, Pullman WA. FEATURES from to/span description pept 1 2580 env polyprotein BASE COUNT 891 a 428 c 565 g 696 t ORIGIN Start of env cds. 1 atggtcagca tcgcattcta tggggggatc ccagggggaa tctcaacccc tattacccaa 61 cagtcagaaa aatccaagta tgaggaaaac acaatgtttc aaccttattg ttataataat 121 gacagtaaga acagcatggc agaatcgaag gaagcaagag accaagaaat gaacctgaaa 181 gaagaatcta aagaagaaaa aagaagaaat gactggtgga aaataggtat gtttctgtta 241 tgcttagcag gaactactgg aggaatactt tggtggtatg aaggactccc acagcaacat 301 tatatagggt tggtagcgat agggggaaga ttaaacggat ctggccaatc aaatgctata 361 gaatgctggg gttccttccc ggggtgtaga ccatttcaaa attacttcag ttatgagacc 421 aatagaagca tgcatatgga taataatact gctacattat tagaagctta tcatagagag 481 ataacattca tttataagtc ttcttgcaca gatagtgatc attgtcaaga gtatcaatgt 541 aaaaaagtta atcttaattc ctctgactcc tctaactctg tacgtgttga ggatgtaacg 601 aacacagcgg aatattgggg atttaaatgg ctagaatgta atcaaacaga aaattttaag 661 actatattag tacctgaaaa tgaaatggta aatatcaatg atactgatac ctggatacct 721 aaggggtgta atgagacgtg ggcaagagtg aaacgttgtc ctatagatat tttatatggg 781 atacatccaa tcaggctgtg tgtacagcca ccattttttc tggtacagga gaaagggatt 841 gctgatactt ctagaattgg caattgtggg ccaacaatat ttcttggggt tttagaagat 901 aataagggag tagtacgggg ggattataca gcctgcaatg tgagtcgcct aaatataaat 961 agaaaggatt atacagggat ctatcaagta cctatatttt atacatgtac tttcactaac 1021 ataacttcct gcaataatga gccaataatc agtgttatca tgtatgaaac aaaccaggta 1081 caatatttat tgtgtaataa taataatagt aataattata attgtgtagt acaaagtttt 1141 ggagttatag gacaggcaca cttagaactg cctagaccta acaaaagaat aaggaatcaa 1201 agctttaacc aatataactg ctctataaat aacaaaacag aattagaaac atggaagtta 1261 gtaaagactt ctggcataac tcctttacct atttcttctg aagctaacac tggactaatt 1321 agacataaga gagattttgg tataagtgca atagtggcag ctattgtagc cgctactgct 1381 attgctgcta gcgctactat gtcttatgtt gctctaactg aggttaacaa aataatggaa 1441 gtacaaaatc atacttttga ggtagaaaat agtactctaa atggtatgga tttaatagaa 1501 cgacaaataa agatattata tgctatgatt cttcaaacac atgcagatgt tcaactgtta 1561 aaggaaagac aacaggtaga ggagacattt aatttaattg gatgtataga aagaacacat 1621 gtattttgtc atactggtca tccctggaat atgtcatggg gacatttaaa tgagtcaaca 1681 caatgggatg actgggtaag caaaatggaa gatttaaatc aagagatact aactacactt 1741 catggagcca ggaacaattt ggcacaatcc atgataacat tcaatacacc agatagtata 1801 gctcaatttg gaaaagacct ttggagtcat attggaaatt ggattcctgg attgggagct 1861 tccattataa aatatatagt gatgtttttg cttatttatt tgttactaac ctcttcgcct 1921 aagatcctca gggccctctg gaaagtgacc agtggtgcag ggtcctccgg cagtcgttac 1981 ctgaagaaaa aattccatca caaacatgca tcacgagaag acacctggga ccaggcccaa 2041 cacaacatac acctagcagg cgtgaccggt ggatcagggg acaaatacta caagcagaag 2101 tactccagga acgactggaa tggagaatca gaggagtaca acaggcggcc aaagagctgg 2161 gtgaagtcaa tcgaggcatt tggagagagc tatatttccg agaagaccaa aggggagatt 2221 tctcagcctg gggcggctat caacgagcac aagaacggct ctggggggaa caatcctcac 2281 caagggtcct tagacctgga gattcgaagc gaaggaggaa acatttatga ctgttgcatt 2341 aaagcccaag aaggaactct cgctatccct tgctgtggat ttcccttatg gctattttgg 2401 ggactagtaa ttatagtagg acgcatagca ggctatggat tacgtggact cgctgttata 2461 ataaggattt gtactagagg cttaaatttg atatttgaaa taatcagaaa aatgcttgat 2521 tatattggaa gagctttaaa tcctggcaca tctcatgtat caatgcctca gtatgtttag // LOCUS EIAV 8344 bp ss-RNA VRL 04-AUG-1990 DEFINITION Equine infectious anemia virus (EIAV), complete proviral genome. ACCESSION M16575 M11337 K03334 M14855 KEYWORDS complete genome. SOURCE Equine infectious anemia virus proviral DNA, clones 1369 and 409-2 [1]. ORGANISM Equine infectious anemia virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 8229) AUTHORS Kawakami,T., Sherman,L., Dahlberg,J., Gazit,A., Yaniv,A., Tronick,S.R. and Aaronson,S.A. TITLE Nucleotide sequence analysis of equine infectious anemia virus proviral DNA JOURNAL Virology 158, 300-312 (1987) STANDARD full staff_review REFERENCE 2 (bases 324 to 5122) AUTHORS Stephens,R.M., Casey,J.W. and Rice,N.R. TITLE Equine infectious anemia virus gag and pol genes: Relatedness to visna and AIDS virus JOURNAL Science 231, 589-594 (1986) STANDARD full staff_review REFERENCE 3 (bases 2300 to 2950) AUTHORS Chiu,I.-M. JOURNAL Unpublished (1985) Meloy Labs Inc, Springfield, VA 22151 STANDARD full staff_review REFERENCE 4 (sites; pept) AUTHORS Chiu,I.-M., Yaniv,A., Dahlberg,J.E., Gazit,A., Skuatz,S.F., Tronick,S.R. and Aaronson,A. TITLE Nucleotide sequence evidence for relationship of AIDS retrovirus to lentiviruses JOURNAL Nature 317, 366-368 (1985) STANDARD full staff_review REFERENCE 5 (bases 4769 to 8048; 8069 to 8344) AUTHORS Payne,S.L. JOURNAL Unpublished (1987) Dept Biochem, Louisiana St Univ, Baton Rouge, LA STANDARD full staff_review REFERENCE 6 (bases 4769 to 8048; 8069 to 8228) AUTHORS Rushlow,K., Olsen,K., Stiegler,G., Payne,S.L., Montelaro,R.C. and Issel,C.J. TITLE Lentivirus genomic organization: The complete nucleotide sequence of the env gene region of equine infectious anemia virus JOURNAL Virology 155, 309-321 (1986) STANDARD full staff_review REFERENCE 7 (sites; correction of 5348 [1] personal communication) AUTHORS Tronick,S.R. and Fuller,F. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Correction of [1] by [7] leads to agreement with [6] regarding the env cds. The sequence starting with coordinate 8230 is horse cellular DNA. FEATURES from to/span description pept 465 1925 gag polyprotein (put.) pept < 1682 5122 pol polyprotein (AA at 1682; NH2-terminus uncertain) pept 5312 7891 envelope polyprotein pept < 7234 7641 3' ORF protein (AA at 7233; NH2-terminus uncertain) pre-msg 209 8193 EIAV genomic RNA cell 8230 8344 horse cellular DNA binding 324 342 primer (Lys-tRNA) binding site variant 342 342 c in [1], t in [2] variant 2341 2341 t in [1], c in [2],[3] variant 2349 2349 t in [1], c in [2] variant 2436 2436 g in [1], a in [3] variant 2677 2677 t in [1], a in [2] variant 2865 2870 tggaag in [1], agtaaa in [2] variant 2870 2870 g in [1], a in [3] variant 3777 3777 g in [1], a in [2] variant 3858 3858 g in [1], a in [2] variant 4529 4530 ac in [1], ca in [2] variant 4573 4573 a in [1], g in [2] revision 5348 5348 gg in [6],[7]; [g] in [1] variant 6348 6348 a in [1], g in [5],[6] variant 6350 6350 g in [1], a in [5],[6] variant 7060 7060 c in [1], t in [5],[6] variant 8048 8069 gagttccgcatttgtgacgcgt in [1], gt in [5],[6] variant 8073 8073 g in [1], c in [5],[6] variant 8098 8098 a in [1], g in [5],[6] variant 8104 8104 g in [1], a in [5],[6] variant 8118 8119 ga in [1], gca in [5],[6] mRNA 210 8193 EIAV genomic mRNA BASE COUNT 2984 a 1358 c 1846 g 2156 t ORIGIN 1 tgtggggttt ttatgagggg ttttataaat gattataaga gtaaaaagaa agttgctgat 61 gctctcataa ccttgtataa cccaaaggac tagctcatgt tgctaggcaa ctaaaccgca 121 ataaccgcat ttgtgacgcg agttccgcat ttgtgacgcg ttaagttcct gtttttacag 181 tatataagta cttgtgttct gacaattggg actcagattc tgcggtctga gtcccttctc 241 tgctgggctg aaaaggcctt tgtaataaat ataattctct actcagtccc tgtctctagt 301 ttgtctgttc gagatcctac agttggcgcc cgaacaggga cctgagaggg gcgcagaccc 361 tacctgttga acctggctga tcgtaggatc cccgggacag cagaggagaa cttacagaag 421 tcttctggag gtgttcctgg ccagaacaca ggaggacagg taagatggga gaccctttga 481 catggagcaa ggcgctcaag aagttagaga aggtgacggt acaagggtct cagaaattaa 541 ctactggtaa ctgtaattgg gcgctaagtc tagtagactt atttcatgat accaactttg 601 taaaagaaaa ggactggcag ctgagggatg tcattccatt gctggaagat gtaactcaga 661 cgctgtcagg acaagaaaga gaggcctttg aaagaacatg gtgggcaatt tctgctgtaa 721 agatgggcct ccagattaat aatgtagtag atggaaaggc atcattccag ctcctaagag 781 cgaaatatga aaagaagact gctaataaaa agcagtctga gccctctgaa gaatatccaa 841 tcatgataga tggggctgga aacagaaatt ttagacctct aacacctaga ggatatacta 901 cttgggtgaa taccatacag acaaatggtc tattaaatga agctagtcaa aacttatttg 961 ggatattatc agtagactgt acttctgaag aaatgaatgc atttttggat gtggtacctg 1021 gccaggcagg acaaaagcag atattacttg atgcaattga taaaatagca gatgattggg 1081 ataatagaca tccattaccg aatgctccac tggtggcacc accacaaggg cctattccca 1141 tgacagcaag gtttattaga ggtttaggag tacctagaga aagacagatg gagcctgctt 1201 ttgatcagtt taggcagaca tatagacaat ggataataga agccatgtca gaaggcatca 1261 aagtgatgat tggaaaacct aaagctcaaa atattaggca aggagctaag gaaccttacc 1321 cagaatttgt agacagacta ttatcccaaa taaaaagtga gggacatcca caagagattt 1381 caaaattctt gactgataca ctgactattc agaacgcaaa tgaggaatgt agaaatgcta 1441 tgagacattt aagaccagag gatacattag aagagaaaat gtatgcttgc agagacattg 1501 gaactacaaa acaaaagatg atgttattgg caaaagcact tcagactggt cttgcgggcc 1561 catttaaagg tggagccttg aaaggagggc cactaaaggc agcacaaaca tgttataact 1621 gtgggaagcc aggacattta tctagtcaat gtagagcacc taaagtctgt tttaaatgta 1681 aacagcctgg acatttctca aagcaatgca gaagtgttcc aaaaaacggg aagcaagggg 1741 ctcaagggag gccccagaaa caaactttcc cgatacaaca gaagagtcag cacaacaaat 1801 ctgttgtaca agagactcct cagactcaaa atctgtaccc agatctgagc gaaataaaaa 1861 aggaatacaa tgtcaaggag aaggatcaag tagaggatct caacctggac agtttgtggg 1921 agtaacatat aatctagaga aaaggcctac tacaatagta ttaattaatg atactccctt 1981 aaatgtactg ttagacacag gagcagatac ttcagtgttg actactgcac attataatag 2041 gttaaaatat agagggagaa aatatcaagg gacgggaata ataggagtgg gaggaaatgt 2101 ggaaacattt tctacgcctg tgactataaa gaaaaagggt agacacatta agacaagaat 2161 gctagtggca gatattccag tgactatttt gggacgagat attcttcagg acttaggtgc 2221 aaaattggtt ttggcacagc tctccaagga aataaaattt agaaaaatag agttaaaaga 2281 gggcacaatg gggccaaaaa ttcctcaatg gccactcact aaggagaaac tagaaggggc 2341 taaagagata gtccaaagac tattgtcaga gggaaaaata tcagaagcta gtgacaataa 2401 tccttataat tcacccatat ttgtaataaa aaagaggtct ggcaaatgga ggttattaca 2461 agatctgaga gaattaaaca aaacagtaca agtaggaacg gaaatatcca gaggattgcc 2521 tcacccggga ggattaatta aatgtaaaca catgactgta ttagatattg gagatgcata 2581 tttcactata cccttagatc cagagtttag accatataca gctttcacta ttccctccat 2641 taatcatcaa gaaccagata aaagatatgt gtggaattgt ttaccacaag gattcgtgtt 2701 gagcccatat atatatcaga aaacattaca ggaaatttta caacctttta gggaaagata 2761 tcctgaagta caattgtatc aatatatgga tgatttgttc gtgggaagta atggttctaa 2821 aaaacaacac aaagagttaa tcatagaatt aagggcaatc ttactggaag agggttttga 2881 gacaccagat gataaattac aagaagtgcc accttatagc tggctaggtt atcaactttg 2941 tcctgaaaat tggaaagtac aaaaaatgca attagacatg gtaaagaatc caacccttaa 3001 tgatgtgcaa aaattaatgg ggaatataac atggatgagc tcaggggtcc cagggttgac 3061 agtaaaacac atagcagcta ctactaaggg atgtttagag ttgaatcaaa aagtaatttg 3121 gacggaagag gcacaaaaag agttagaaga aaataatgag aagattaaaa atgctcaagg 3181 gttacaatat tataatccag aagaagaaat gttatgtgag gttgaaatta caaaaaatta 3241 tgaggcaact tatgttataa aacaatcaca aggaatccta tgggcaggta aaaagattat 3301 gaaggctaat aagggatggt caacagtaaa aaatttaatg ttactgttgc aacatgtggc 3361 aacagaaagt attactagag taggaaaatg tccaacgttt aaggtaccat ttaccaaaga 3421 gcaagtaatg tgggaaatgc aaaaaggatg gtattattct tggctcccag aaatagtata 3481 tacacatcaa gtagttcatg atgattggag aatgaaattg gtagaagaac ctacatcagg 3541 aataacaata tacactgatg ggggaaaaca aaatggagaa ggaatagcag cttatgtgac 3601 cagtaatggg agaactaaac agaaaaggtt aggacctgtc actcatcaag ttgctgaaag 3661 aatggcaata caaatggcat tagaggatac cagagataaa caagtaaata tagtaactga 3721 tagttattat tgttggaaaa atattacaga aggattaggt ttagaaggac cacaaagtcc 3781 ttggtggcct ataatacaaa atatacgaga aaaagagata gtttattttg cttgggtacc 3841 tggtcacaaa gggatatgtg gtaatcaatt ggcagatgaa gccgcaaaaa taaaagaaga 3901 aatcatgcta gcataccaag gcacacaaat taaagagaaa agagatgaag atgcagggtt 3961 tgacttatgt gttccttatg acatcatgat acctgtatct gacacaaaaa tcatacccac 4021 agatgtaaaa attcaagttc ctcctaatag ctttggatgg gtcactggga aatcatcaat 4081 ggcaaaacag gggttattaa ttaatggagg aataattgat gaaggatata caggagaaat 4141 acaagtgata tgtactaata ttggaaaaag taatattaaa ttaatagagg gacaaaaatt 4201 tgcacaatta attatactac agcatcactc aaattccaga cagccttggg atgaaaataa 4261 aatatctcag agaggggata aaggatttgg aagtacagga gtattctggg tagaaaatat 4321 tcaggaagca caagatgaac atgagaattg gcatacatca ccaaagatat tggcaagaaa 4381 ttataagata ccattgactg tagcaaaaca gataactcaa gaatgtcctc attgcactaa 4441 gcaaggatca ggacctgcag gttgtgtcat gagatctcct aatcattggc aggcagattg 4501 cacacatttg gacaataaga taatattgac ttttgtagag tcaaattcag gatacataca 4561 tgctacatta ttatcaaaag aaaatgcatt atgtacttca ttggctattt tagaatgggc 4621 aagattgttt tcaccaaagt ccttacacac agataacggc actaattttg tggcagaacc 4681 agttgtaaat ttgttgaagt tcctaaagat agcacatacc acaggaatac catatcatcc 4741 agaaagtcag ggtattgtag aaagggcaaa taggaccttg aaagagaaga ttcaaagtca 4801 tagagacaac actcaaacac tggaggcagc tttacaactt gctctcatta cttgtaacaa 4861 agggagggaa agtatgggag gacagacacc atgggaagta tttatcacta atcaagcaca 4921 agtaatacat gagaaacttt tactacagca agcacaatcc tccaaaaaat tttgttttta 4981 caaaatccct ggtgaacatg attggaaggg acctactagg gtgctgtgga agggtgatgg 5041 tgcagtagta gttaatgatg aaggaaaggg aataattgct gtaccattaa ccaggactaa 5101 gttactaata aagccaaatt gagtattgtt gcaggaagca agacccaact accattgtca 5161 gctgtgtttc ctgaggtctc taggaattga ttacctcgat gcttcattaa ggaagaagaa 5221 taaacaaaga ctgaaggcaa tccaacaagg aagacaacct caatatttgt tataaggttt 5281 gatatatggg attatttggt aaaggggtaa catggtcagc atcgcattct atggggggat 5341 cccaggggga atctcaaccc ctattaccca acagtcagaa aaatctaagt gtgaggagaa 5401 cacaatgttt caaccttatt gttataataa tgacagtaag aacagcatgg cagaatcgaa 5461 ggaagcaaga gaccaagaaa tgaacctgaa agaagaatct aaagaagaaa aaagaagaaa 5521 tgactggtgg aaaataggta tgtttctgtt atgcttagca ggaactactg gaggaatact 5581 ttggtggtat gaaggactcc cacagcaaca ttatataggg ttggtggcga tagggggaag 5641 attaaacgga tctggccaat caaatgctat agaatgctgg ggttccttcc cggggtgtag 5701 accatttcaa aattacttca gttatgagac caatagaagc atgcatatgg ataataatac 5761 tgctacatta ttagaagctt atcatagaga gataacattc atttataagt cttcttgcac 5821 agatagtgat cattgtcaag agtatcaatg taaaaaagtt aatcttaatt cctctgactc 5881 ctctaactct gtacgtgttg aggatgtaac gaacacagcg gaatattggg gatttaaatg 5941 gctagaatgt aatcaaacag aaaattttaa gactatatta gtacctgaaa atgaaatggt 6001 aaatatcaat gatactgata cctggatacc taaggggtgt aatgagacgt gggcaagagt 6061 gaaacgttgt cctatagata ttttatatgg gatacatcca atcaggctgt gtgtacagcc 6121 accatttttt ctggtacagg agaaagggat tgctgatact tctagaattg gcaattgtgg 6181 gccaacaata tttcttgggg ttttagaaga taataaggga gtagtacggg gggattatac 6241 agcttgcaat gtgcgtcgcc taaatataaa tagaaaggat tatacaggga tctatcaagt 6301 acctatattt tatacatgta ctttcactaa cataacttcc tgcaataatg agccaataat 6361 cagtgttatc atgtatgaaa caaaccaggt acaatattta ttgtgtaata ataataatag 6421 taataattat aattgtgtag tacaaagttt tggagttata ggacaggcac acttagaact 6481 gcctagacct aacaaaagaa taaggaacca aagctttaac caatataact gctctataaa 6541 taacaaaaca gaattagaaa catggaagtt agtaaagact tctggcgtaa ctcctttacc 6601 tatttcttct gaagctaaca ctggactaat tagacataag agagattttg gtataagtgc 6661 aatagtggca gctattgtag ccgctactgc tattgctgct agcgctacta tgtcttatgt 6721 tgctctaact gaggttaaca aaataatgga agtacaaaat catacttttg aggtagaaaa 6781 tagtactcta aatggtatgg atttaataga acgacaaata aagatattat atgctatgat 6841 tcttcaaaca catgcagatg ttcaactgtt aaaggaaaga caacaggtag aggagacatt 6901 taatttaatt ggatgtatag aaagaacaca tgtattttgt catactggtc atccctggaa 6961 tatgtcatgg ggacatttaa atgagtcaac acaatgggat gactgggtaa gcaaaatgga 7021 agatttaaat caagagatac taactacact tcatggagcc aggaacaatt tggcacaatc 7081 catgataaca ttcaatacac cagatagtat agctcaattt ggaaaagacc tttggagtca 7141 tattggaaat tggattcctg gattgggagc ttccattata aaatatatag tgatgttttt 7201 gcttatttat ttgttactaa cctcttcgcc taagatcctc agggccctct ggaaagtgac 7261 cagtggtgca gggtcctccg gcagtcgtta cctgaagaaa aaattccatc acaaacatgc 7321 atcgcgagaa gacacctggg accaggccca acacaacata cacctagcag gcgtgaccgg 7381 tggatcaggg gacaaatact acaagcagaa gtactccagg aacgactgga atggagaatc 7441 agaggagtac aacaggcggc caaagagctg ggtgaagtca atcgaggcat ttggagagag 7501 ctatatttcc gagaagacca aaggggagat ttctcagcct ggggcggcta tcaacgagca 7561 caagaacggc tctgggggga acaatcctca ccaagggtcc ttagacctgg agattcgaag 7621 cgaaggagga aacatttatg actgttgcat taaagcccaa gaaggaactc tcgctatccc 7681 ttgctgtgga tttcccttat ggctattttg gggactagta attatagtag gacgcatagc 7741 aggctatgga ttacgtggac tcgctgttat aataaggatt tgtattagag gcttaaattt 7801 gatatttgaa ataatcagaa aaatgcttga ttatattgga agagctttaa atcctggcac 7861 atctcatgta tcaatgcctc agtatgttta gaaaaacaag gggggaactg tggggttttt 7921 atgaggggtt ttataaatga ttataagagt aaaaagaaag ttgctgatgc tctcataacc 7981 ttgtataacc caaaggacta gctcatgttg ctaggcaact aaaccgcaat aaccgcattt 8041 gtgacgcgag ttccgcattt gtgacgcgtt aagttcctgt ttttacagta tataagtact 8101 tgtgttctga caattgggac tcagattctg cggtctgagt cccttctctg ctgggctgaa 8161 aaggcctttg taataaatat aattctctac tcagtccctg tctctagttt gtctgttcga 8221 gatcctacac aaaggattag caatcagaat aatattagac tactcaaaag caacacataa 8281 agccagaaga cagctgagtc atgttttcaa attctgacaa aaaattattt acaacttaga 8341 attc // LOCUS HIV2BEN 10359 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral genome. ACCESSION M30502 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 2 (isolate BEN) proviral DNA, clones MK[2,6]. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (sites; for [2]) AUTHORS Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C., Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and Hunsmann,G. TITLE A novel proviral clone of HIV-2: Biological and phylogenetic relationship to other primate immunodeficiency viruses JOURNAL Virology 177, 305-311 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 10359) AUTHORS Kirchhoff,F., Jentsch,K., D Bachmann,B., Stuke,A., Laloux,C., Lueke,W., Stahl-Henning,C., Schneider,J., Nieselt,K., Eigen,M. and Hunsmann,G. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Kindly submitted prior to publication by Dr. F. Kirchhoff, DPZ, Gottingen, 0551/3851-0. HIV2BEN was isolated from a German AIDS patient(with predominantly neurological complications) who was probably infected in Mali (Klemm et al., J Neuro 235,304,1988). The sequence was determined from two clones: bases 1903 to 5057 and 7293 to 9131 were from MK2 and bases 1 to 1902, 5058 to 7292 and 9132 to 10359 were from MK6. FEATURES from to/span description pept 1103 2668 gag polyprotein pept < 2326 5754 pol polyprotein (NH2 terminus uncertain) pept 5423 6070 vif protein pept 5898 6239 vpx protein pept 6239 6502 vpr protein pept 6402 6697 tat protein, exon 2 (first expressed exon) 8861 8957 tat protein, exon 3 (AA at 8862) pept 6628 6697 rev protein, exon 2 (first expressed exon) 8861 9102 rev protein, exon 3 pept 6704 9286 env polyprotein pept 9120 9893 nef protein LTR 1 855 5' LTR LTR 9505 10359 3' LTR binding 859 875 primer (Lys-tRNA) binding site signal 10211 10216 poly-A signal BASE COUNT 3506 a 2132 c 2598 g 2123 t ORIGIN Left end of viral genome (5'LTR) 1 tgcaagggat gttttacagt aggaggagac atagaatcct agacatatac ctagaaaaag 61 aggaagggat aataccagat tggcagaatt atactcatgg gccaggagta aggtacccaa 121 tgtacttcgg gtggctgtgg aagctagtat cagtagaact ctcacaagag gcagaggaag 181 atgaggccaa ctgcttagta cacccagcac aaacaagcag acatgatgat gagcatgggg 241 agacattagt gtggcagttt gactccatgc tggcctataa ctacaaggcc ttcactctgt 301 acccagaaga gtttgggcac aagtcaggat tgccagagaa agaatggaag gcaaaactga 361 aagcaagagg gataccatat agtgaataac aggaacaacc atacttggtc aaggcaggaa 421 gtagctacta agaaacagct gaggctgcag ggactttcca gaaggggctg taaccaaggg 481 agggacatgg gaggagctgg tggggaacgc cctcatactt actgtataaa tgtacccgct 541 tcttgcattg tattcagtcg ctctgcggag aggctggcag atcgagccct gagaggttct 601 ctccagcact agcaggtaga gcctgggtgt tccctgctgg actctcacca gtacttggcc 661 ggtactgggc agacggctcc acgcttgctt gcttaaagac ctcttcaata aagctgccag 721 ttagaagcaa gttaagtgtg tgttcccatc tctcctagtc gccgcctggt cattcggtgt 781 tcatctgagt aacaagaccc tggtctgtta ggacccttct cgctttggga atccaaggca 841 ggaaaatccc tagcaggttg gcgcccgaac agggacttga agaggactga gaagccctgg 901 aactcggctg agtgaaggca gtaagggcgg caggaacaaa ccacgacgga gtgctcctag 961 aaaggcgcgg gccgaggtac caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg 1021 gtgaaggtaa gtacctacac caaaaactgt agccagaaaa ggcttgttat cctaccttta 1081 gacaggtaga agattgtggg agatgggcgc gagaaactcc gtcttgagag ggaaaaaagc 1141 agacgaatta gaaaaagtta ggttacggcc cggcggaaag aaaaagtaca ggttaaaaca 1201 tattgtgtgg gcagcgaatg aattggataa attcggattg gcagagagcc tgttggagtc 1261 aaaagaaggt tgccaaaaga ttctcagagt tttagatcca ttagtaccaa cagggtcaga 1321 aaatttaaaa agccttttta ataccgtctg cgtcatttgg tgcttgcacg cagaagagaa 1381 agtgaaagat actgaggaag caaagaaact agcacagaga catctagtgg cagaaactgg 1441 aactgcagag aaaatgccaa atacaagtag accaacagca ccacctagtg ggaaaagagg 1501 aaactacccc gtgcaacaag cgggtggcaa ctatgtccat gtgccactga gcccccgaac 1561 tctaaatgca tgggtaaaat tagtggagga aaagaagttc ggggcagaag tagtgccagg 1621 atttcaggca ctctcagaag gctgcacgcc ctatgatatt aatcaaatgc ttaattgtgt 1681 gggcgatcac caagcagcta tgcaaataat cagagagatt attaatgaag aagcagcaga 1741 ctgggattcg cagcacccaa taccaggccc cttaccagca ggacagctca gagacccaag 1801 agggtctgac atagcaggaa caacaagcac agtagatgaa cagatccagt ggatgtatag 1861 gccacaaaat cccgtaccgg tagggaacat ctacagaaga tggatccaaa tagggctgca 1921 aaagtgtgtc agaaagtaca acccaactaa catcttagac ataaaacagg gaccaaaaga 1981 accgttccaa agctatgtag acaggttcta caaaagcttg agggcagaac aaacagaccc 2041 agcagtaaaa aattggatga cccaaacgct gctaatacag aatgccaacc cagactgcaa 2101 gttagtacta aaaggactgg ggatgaatcc caccctagaa gagatgctaa ccgcctgcca 2161 gggggtaggc ggaccaggcc agaaagccag gctaatggct gaagccctaa aagaggctat 2221 gggaccaagc cctatcccat ttgcagcagc ccaacaaaga aaggcaatta ggtattggaa 2281 ctgtggaaag gagggacact cggcaagaca gtgccgagca cctagaagac agggctgctg 2341 gaagtgtggc aagccaggac acatcatggc aaactgcccg gaaagacagg caggtttttt 2401 agggttgggc ccacggggaa agaagcctcg caacttcccc gtgacccaag cccctcaggg 2461 gctgatacca acagcacctc cggcagatcc agcagcggaa ctgttggaga gatatatgca 2521 gcaagggaga aagcagaggg agcagaggga gagaccatac aaagaggtga cggaggactt 2581 gctgcacctc gagcagagag agacacctca cagagaggag acagaggact tgctgcacct 2641 caattctctc tttggaaaag accagtagtc acagcgtaca tcgaggatca gccggtagaa 2701 gtcttactag acacaggggc tgatgactca atagtagcag gaatagaatt aggggacaat 2761 tacactccaa aaatagtagg gggaataggg ggatttataa acaccaaaga atacaaaaat 2821 gtagaaataa aagtactaaa taaaagagta agagccacca taatgacagg agatacccca 2881 atcaacatct ttggcagaaa tattctgaca gccttaggca tgtcattaaa tttaccagtt 2941 gccaagatag agccaataaa agtaacattg aagccaggga aagatggacc aaggctgaaa 3001 caatggcccc taacaaaaga gaaaatagaa gcactaaaag agatctgtga aaaaatggaa 3061 aaagagggcc agctagaaga ggcacctcca actaatcctt ataatacccc cacatttgca 3121 attaagaaaa aggacaagaa caaatggagg atgctgatag attttagaga actaaataag 3181 gtgactcaag atttcacaga aattcagcta ggaattccac acccggcagg actagccaaa 3241 aagaaaagga tctctatatt agatgtaggg gatgcctatt tttccatacc actacatgaa 3301 gattttaggc agtatactgc atttacccta ccagcagtaa acaatatgga accaggaaaa 3361 agatatatat ataaagtctt gccacaagga tggaagggat caccagcaat ttttcaatac 3421 acaatgaggc aagtcttaga acctttcaga aaagcaaacc cagatgtcat tctcatccag 3481 tacatggatg atatcttaat agctagtgac aggacaggtt tagagcatga caaagtggtc 3541 ctgcagctaa aagaacttct aaatggccta gggttttcta ctccagatga gaagttccaa 3601 aaagaccctc catttcaatg gatgggctgt gaactatggc caactaaatg gaagctgcag 3661 aaactacaac tgccccagaa agacatatgg acagtcaatg acatccaaaa gctagtggga 3721 gtcttaaatt gggcggcaca aatctattca ggaataaaaa ccaaacactt atgtagacta 3781 attagaggaa aaatgacact cacagaagaa gtgcagtgga cagaactagc agaagcagag 3841 ctagaagaaa acaaaattat cttgagccag gaacaagaag gatattatta ccaagaagaa 3901 aaagaattag aggcaacaat ccaaaaaagc caaggacatc aatggacata caaaatacac 3961 caggaagaga aaatcctaaa agtaggaaag tatgcaaaga taaaaaatac ccataccaat 4021 ggggtcagat tactagcaca ggtagttcag aaaataggaa aagaggcact agtcatttgg 4081 ggacggatac caaaatttca cctgccagtg gagagagaga cctgggagca gtggtgggat 4141 aactactggc aagtgacatg gatcccagag tgggactttg tatctacccc accactggtc 4201 aggttaacat ttaacctagt aggagatcct ataccaggcg cagagacctt ctacacagat 4261 ggatcatgca atagacagtc aaaagaggga aaagcaggat atgtaacaga tagaggaaaa 4321 gacaaagtaa aagtattaga acaaactacc aatcagcagg cagaattaga agtctttcgg 4381 atggcactgg cagactcagg cccaaaggtt aatatcatag tagattcaca gtatgtaatg 4441 gggatagtag caggccagcc aacagagtca gaaaatagaa tagtgaacca gatcatagaa 4501 gaaatgataa agaaggaagc agtctatgtt gcatgggtcc cagcccataa aggcatagga 4561 ggaaaccagg aagtagacca tttagtaagt caaggcatca gacaagtatt attcctggaa 4621 aagatagagc ccgctcaaga ggaacatgaa aaatatcata gcattataaa agaactaacc 4681 cataaatttg gaatacccct tctagtagca agacagatag taaactcatg tgcccaatgc 4741 caacagaaag gagaagccat acatgggcaa gtaaatgcag aaataggcgt ttggcaaatg 4801 gactacacac acttagaagg aaaaatcatt atagtagcag tacatgttgc aagtggattc 4861 atagaagcag aagtcatccc acaggaatca ggaaggcaga cagcactctt cctattaaaa 4921 ctggccagta ggtggccaat aacgcacttg cacacagaca atggccccaa cttcacttca 4981 caggaagtga agatggtggc atggtgggta ggtatagaac aatcctttgg agtaccttac 5041 aacccacaaa gccagggagt agtagaagca atgaatcacc acctaaagaa tcagataagt 5101 agaattagag aacaggcaaa tacaatagaa acaatagtac tgatggcagt tcattgcatg 5161 aattttaaaa gaaggggagg aataggggat atgaccccag cagaaagact aatcaacatg 5221 attaccacag aacaagaaat acaattcctc caaagaaaaa attcaaattt taaaaatttc 5281 caggtctatt acagagaagg cagagatcag ctgtggaaag gacctggtga actactgtgg 5341 aagggagaag gagcagtcat agtcaaggta gggacagaca taaaagtagt accaagaagg 5401 aaggccaaga ttatcaggga ctatggagga agacaggaac tggatagtag tccccacctg 5461 gagggtgcca gggaggatgg agaaatggca tgcccttgtc aagtacctga aatacagaac 5521 aaaagaccta gaggaggtgc gctatgttcc ccaccacaag gtgggatggg catggtggac 5581 ttgcagcagg gtaatattcc cactacaagg aaaaagtcat ctagaaatac aggcatattg 5641 gaacctaaca ccagaaaaag gatggctctc ctctcatgca gtaagattaa cctggtatac 5701 agaaaagttc tggacagatg ttaccccaga ctgtgcagac atcctaatac atagcactta 5761 tttctcttgc tttacggcag gtgaagtaag aagagccatc agaggggaaa agttattgtc 5821 ctgctgcaac tatccccaag ctcataaagc acaggtacca tcacttcaat acctagccct 5881 agtagtagta caacaaaatg acagacccca gagaaagggt accgccagga aacagtggag 5941 aagagaccat tggagaggcc ttcgagtggc tagagaggac catagaagcc ttaaacaggg 6001 aggcagtgaa ccatctgccc cgagagctca ttttccaggt gtggcaaagg tcctggagat 6061 attggcatga tgaacaaggg atgtcagcaa gctacacaaa gtatagatat ttgtgcctaa 6121 tgcaaaaagc tatatttaca catttcaaga gagggtgcac ttgctggggg gaggacatgg 6181 gccgggaagg attggaagac caaggacctc cccctcctcc ccctccaggt ctagtctaat 6241 gactgaagca ccaacagagt ttcccccaga agatgggacc ccacggaggg acttagggag 6301 tgactgggta atagaaactc tgagggaaat aaaggaagaa gccttaagac attttgatcc 6361 ccgcttgcta attgctcttg gctactatat ccataataga catggagaca cccttgaagg 6421 cgccagagag ctcattaaaa ccctacaacg agccctcttc gtgcacttca gagcgggatg 6481 taaccgctca agaattggct aaacaaggag aagaactcct tgcccagctg caccgacccc 6541 tagagccatg cactaacaaa tgctattgta agcgatgcag tttccattgc cagctgtgtt 6601 tctcgaaaaa ggggctcgga atatcatatg agcgaaaggg cagacgaaga aggactccaa 6661 ggaaaactaa gactccttcg ccttctgcac cagacaagtg agtatggagc ctggtaggaa 6721 tcagctgttt gttgtcattt tactaacaag tgcttgctta gtatattgta gccagtatgt 6781 gactgttttc tatggcatac ccgcgtggaa aaatgcatct attcccttat tttgtgcaac 6841 taaaaataga gacacttggg ggaccataca gtgcttgcca gacaatgatg attatcagga 6901 aataatttta aatgtgacag aggcttttga tgcatggaat aatacagtga cagaacaagc 6961 agtagaagat gtctggcatc tatttgagac atcaataaaa ccatgtgtca agctaacacc 7021 tctatgtgtg gcaatgaatt gtagcagggt tcaagggaat accacgaccc cgaatcccag 7081 gacctcgagt tccacaacct cgagaccacc cacatccgca gcctccataa taaatgaaac 7141 ttctaactgc atagaaaaca acacatgcgc aggattaggg tatgaggaga tgatgcaatg 7201 tgagttcaat atgaaggggt tagaacaaga taagaaaagg aggtataagg acacatggta 7261 tttagaagat gtggtttgtg acaacacaac agctggcaca tgttacatga gacattgcaa 7321 cacatcaatc atcaaagagt catgtgataa gcactattgg gatgctatga ggtttagata 7381 ctgtgcacca ccgggctttg ccctattaag atgtaatgat accaactatt caggctttga 7441 acctaagtgc actaaagtag tagctgcttc atgcacaagg atgatggaaa cgcaaacttc 7501 tacttggttt ggctttaatg gcactagagc agaaaataga acatatatct attggcatgg 7561 cagagataat aggactatca ttagcttaaa caagtattat aatctcacaa tgcgttgtaa 7621 gagaccagga aataagacag ttttaccaat aacacttatg tcaggattag tgtttcactc 7681 tcagccaatc aacacaaggc ctaggcaggc atggtgccgg tttggaggca gatggaggga 7741 agccatgcag gaggtgaagc aaacccttgt acaacatccc agatacaaag gaatcaatga 7801 tacagggaaa attaacttta cgaaaccggg agcaggctca gacccggaag tggcatttat 7861 gtggactaac tgcagaggag aatttctcta ctgtaacatg acttggttcc tcaattgggt 7921 agaagacaag aaccaaacac ggcgcaacta ttgccatata aagcagataa ttaatacctg 7981 gcataaagta gggaaaaatg tatatttgcc tcctagggaa ggggagttgg cctgtgaatc 8041 aacagtaacc agcataattg ctaacattga catagataaa aatcggactc ataccaacat 8101 tacctttagt gcagaagtgg cagaactgta ccgattagaa ctgggagact acaaattaat 8161 agaaataaca ccaattggct tcgcacctac agatcagaga aggtactcct caactccagt 8221 gaggaacaaa agaggtgtgt tcgtgctagg gttcttgggt tttctcgcga cagcaggttc 8281 tgcaatgggc gcgcggtccc tgacgctgtc agcccagtcc cggactttac tggccgggat 8341 agtgcagcaa cagcaacagc tgttggacgt agtcaagaga caacaagaaa tgttgcgact 8401 gaccgtctgg ggaacgaaaa acctccaggc aagagtcact gctatcgaga agtacctaaa 8461 gcatcaggca cagctaaatt catggggatg tgcgtttaga caggtctgcc acactactgt 8521 accgtgggta aatgactctt tatcgcctga ctggaaaaat atgacatggc aggagtggga 8581 gaaacaagtc cgctacctag aggcaaatat cagtcaaagt ttagaagaag cccaaattca 8641 acaagaaaag aatatgtatg aattacaaaa attaaatagc tgggatattc ttggcaactg 8701 gtttgactta acctcctggg tcaagtatat tcaatatgga gtgcatatag tagtgggaat 8761 aatagcttta agaatagcaa tctatgtagt gcaattgtta agtagattta gaaagggcta 8821 taggcctgtt ttctcttccc cccccggtta tctccaacag atccatatcc acaaggaccg 8881 gggacagcca gccaacgaag gaacagaaga agacgtcgga ggcgacagtg gttacgactt 8941 gtggccttgg ccaataaact atgtgcagtt cctgatccac ctactgactc gcctcttgat 9001 cgggctatac aacatctgca gagacttact atccaagaac tccccgaccc gccgactgat 9061 ctcccagagt ctaacagcaa tcagggactg gctgagactt aaggcggccc aactgcaata 9121 tgggtgcgag tggatccaag aagctttcca agcattcgcg aggactacga gagagactct 9181 tgcgggcgcg tggggatggt tatgggaagc agcgcgacgc atcgggaggg gaatactcgc 9241 agttccaaga agaatcaggc agggagcaga actcgccctc ctgtgaggga cagcagtatc 9301 agcagggaga gtacatgaac agcccatgga gaaacccagc aacagaaaga cagaaagatt 9361 tgtataggca gcaaaatatg gatgatgtag attctgatga tgatgaccta ataggagttc 9421 ctgttacacc aagagtacca cggagagaaa tgacctataa attggcaata gatatgtcac 9481 attttataaa agaaaaaggg ggactgcaag ggatgtttta cagtaggagg agacatagaa 9541 tcctagacat atacctagaa aaagaggaag ggataatacc agattggcag aattatactc 9601 atgggccagg agtaaggtac ccaatgtact tcgggtggct gtggaagcta gtatcagtag 9661 aactctcaca agaggcagag gaagatgagg ccaactgctt agtacaccca gcacaaacaa 9721 gcagacatga tgatgagcat ggggagacat tagtgtggca gtttgactcc atgctggcct 9781 ataactacaa ggccttcact ctgtacccag aagagtttgg gcacaagtca ggattgccag 9841 agaaagaatg gaaggcaaaa ctgaaagcaa gagggatacc atatagtgaa taacaggaac 9901 aaccatactt ggtcaaggca ggaagtagct actaagaaac agctgaggct gcagggactt 9961 tccagaaggg gctgtaacca agggagggac atgggaggag ctggtgggga acgccctcat 10021 acttactgta taaatgtacc cgcttcttgc attgtattca gtcgctctgc ggagaggctg 10081 gcagatcgag ccctgagagg ttctctccag cactagcagg tagagcctgg gtgttccctg 10141 ctggactctc accagtactt ggccggtact gggcagacgg ctccacgctt gcttgcttaa 10201 agacctcttc aataaagctg ccagttagaa gcaagttaag tgtgtgttcc catctctcct 10261 agtcgccgcc tggtcattcg gtgttcatct gagtaacaag accctggtct gttaggaccc 10321 ttctcgcttt gggaatccaa ggcaggaaaa tccctagca // LOCUS HIV2D205 7817 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), gag, pol, vif, vpX, vpR, partial env. ACCESSION X16109 KEYWORDS . SOURCE Human immunodeficiency virus type 2 (isolate D205) viral RNA. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 7817) AUTHORS Dietrich,U., Adamski,M., Kreutz,R., Seipp,A., Kuhnel,H. and Rubsamen-Waigmann,H. TITLE A highly divergent HIV-2-related isolate JOURNAL Nature 342, 948-950 (1990) STANDARD full staff_review COMMENT This isolate, also called HIV2-ALT by the authors, was taken from a healthy Ghanian woman. Tree analysis reveals that it likely arose prior to the branching of HIV2s and SIVs, i.e. the sooty mangabey and macaque isolates. [1] calls attention to an extension of the gag-pol overlap region resulting in a dilated (by 20 residues) pol orf. The env cds begins at 6671; tat-exon1 at 6375 and rev-exon1 at 6601, with the possible donor splice site at 6665. FEATURES from to/span description pept 1079 2644 gag polyprotein pept < 2299 5475 pol polyprotein (NH2 terminus uncertain) pept 5396 6046 vif protein pept 5877 6212 vpX protein pept 6212 6517 vpR protein pept 6375 / 6664 tat protein, exon 2 (first expressed exon) pept 6601 / 6664 rev protein, exon 2 (first expressed exon) pept 6744 > 6875 env polyprotein site 6876 6878 env polyprotein in-frame stop codon BASE COUNT 2705 a 1602 c 1930 g 1580 t ORIGIN 5'-terminus of 5'LTR 1 tggaagggat gtattatagt gagagaagac acagaatatt agacacatat tttgagaatg 61 aagaaggcat tgtgtctggc tggcaaaact atactcatgg gccagggata aggcatccca 121 aatactttgg ttggctgtgg aagctggtac cagtagaggt gccagcagcg acccgagagg 181 aggaggaaac ccattgccta atgcacccgg cacagatctc ctcatgggat gacatccatg 241 gggagactct tatctggcag tttgattccc tcctggcata tgattatgtg gctttcaata 301 ggtttccaga agagttcggg taccagtcag ggctaccaga ggaggagtgg aaggctagac 361 taaaagcaag agggatacct acagattagg caagagacag cagcataaac aggaactagc 421 tgacactgca caagaaggaa actagcagac actgcaggga ctttccaaaa gaggctgtaa 481 catgggagga gcaagggagg ggttggttgg gaacgcccat taaatctctg tataaatgta 541 cccgcttctc gcattgtatt cagtcgctct gcggagaggc tggcagattg agccctggga 601 ggttctctcc agcactagca ggtagagcct gggtgttccc tgctagactc tcaccagcac 661 taggccagtg ttgggcagac ggctccacgc ttgcttgctt aaaaccctct taataaactg 721 ccaattagaa gcaagtaagt gtgtgttccc atctctccta gtcgcgcctg gtcactcggt 781 gctccactga tagaccctgg tctgttagga cccttcttgc tttgggaaac cgaagcggga 841 aaatccctag cagattggcg cccgaacagg gacctgggaa gagtgagagt cttcaagagc 901 acagctgagt gaaggcagta agggcggcag gaatcaacca cgacggagag ctcctgtaaa 961 agcgcaggcc ggtaccaggc agcgtgagga gcgggaggag aagaggctcc gggagtaagg 1021 taagtactcc aaattttgca ttgtctactc taaagaggga gtagggcata agtgggagat 1081 gggcgcgaga ggctccgtcc tgtcagggaa gaaaacagat gaattagaaa aagttaggtt 1141 acggcccggc ggaaagaaaa agtatatgct aaaacatgta gtatgggcag tgaacgaatt 1201 agacagattt ggattagcag agagcctgtt ggaatcaaaa gaaggatgtc aaaaaattct 1261 taaagtctta gctccattag taccaacagg gtcagaaaat ttaaaaagcc tttttaatat 1321 cgtctgcgtc attttttgcc tgcacgcaga agagaaagtg aaagatacag aggaagcaaa 1381 aaagatagca cagagacatc tagcggcgga cacagaaaaa atgccagcta caaataaacc 1441 aacagcacca cctagcggcg gaaattatcc agtgcagcaa ctggctggca actacgtcca 1501 cctgccgcta agcccccgaa ccttaaatgc ttgggtaaag ttagtagaag aaaagaagtt 1561 cggggcagaa gtagtaccag gatttcaggc actatcagaa ggatgcaccc cttatgatat 1621 aaatcagatg ctaaattgtg taggagaaca tcaggcagcc atgcaaatta ttagagaaat 1681 aatcaatgag gaagcagcag actgggacca gcaacacccg tcaccaggcc caatgccggc 1741 aggacaactt agggacccaa gagggtcaga tatagcagga accaccagca cagtagagga 1801 acagatacag tggatgtaca gggcccaaaa tcctgtccca gtgggaaaca tttatagaag 1861 atggattcaa ttaggattgc agaaatgtgt ccgaatgtac aatcctacca acatattaga 1921 cataaagcag ggaccaaagg agcccttcca aagctatgta gatagattct acaaaagctt 1981 acgggcagaa caaacagacc cagcagtgaa aaattggatg acacaaacac tgctgattca 2041 gaatgctaac ccagattgca agttagtgct taagggcttg ggaatgaatc ccaccttaga 2101 ggaaatgcta acggcctgcc aagggatagg aggcccaggg cagaaggcaa ggctaatggc 2161 cgaagcctta aaagaggccc taacacctgc acccataccg tttgctgccg ttcaacaaaa 2221 agcagggaag agagggacag tgacatgctg gaactgtggc aaacagggac acacagccag 2281 gcaatgcagg gcccctagaa gacagggatg ctggaaatgt ggaaaaacag gacacatcat 2341 gtcaaaatgc ccagaaagac aggcgggttt tttagggtta ggaccctggg gaaagaagcc 2401 tcgcaacttc cccatgaccc aagtgcctca gggagtgaca ccatctgcac ccccgatgaa 2461 cccagcagag ggcatgacac ctcggggggc gacaccatct gcgccccctg cagatccagc 2521 agtggagatg ctgaaaagtt acatgcagat ggggagacaa cagagagaga gccgagagag 2581 accctacaag gaggtgacag aggatttgct gcacctcaat tctctctttg gagaagacca 2641 gtagtcaaag catgtatcga gggtcagtca gtagaagtat tactagacac aggagttgac 2701 gactcaatag tagcagggat agaattaggt agcaattaca ccccaaaaat agtaggaggg 2761 ataggagggt tcataaatac caaagaatac aaagatgtag aaatagaagt agtgggaaaa 2821 agagtaaggg caactataat gacaggagat accccaataa acatttttgg cagaaatatt 2881 ttaaatacct tgggcatgac tttaaatttc ccagtggcaa aggtagaacc agtaaaagtt 2941 gagttaaaac ctggaaaaga tgggccaaag atcagacaat ggcctctatc cagggaaaag 3001 atactagccc tcaaagaaat ctgtgaaaaa atggaaaagg agggacaatt agaagaagca 3061 ccccctacta atccatacaa cacacccacc tttgccataa agaagaaaga taagaacaaa 3121 tggagaatgc taatagactt tagagagtta aacaaggtaa cccaggactt cacagaagtc 3181 aactgggtat tccccacccg gcaggtggca gaaaaaagga gaataacagt aatagatgtg 3241 ggagatgcct acttcagtat cccactagat ccaaacttca gacagtatac agcattcacc 3301 ttgccatcag taaacaatgc agagccagga aagagataca tttataaagt tctaccacaa 3361 gggtggaagg ggtcccagtc aatttgtcaa tactccatga gaaaggtatt agatcctttt 3421 agaaaggcca acagcgatgt cattataatt cagtacatgg atgacatcct tatagcaagt 3481 gacagaagtg atctggagca cgacagggta gtgtcccaac taaaagagtt attaaatgac 3541 atgggattct ctaccccaga agaaaagttc caaaaagacc ctccgttcaa atggatgggt 3601 tatgagctct ggccaaaaaa gtggaaactg caaaaaatac aactgccaga aaaagaagtt 3661 tggacagtga atgcaattca aaaactggta ggagtattaa actgggcagc tcaactcttt 3721 cctggaatta agacaaggca catatgcaaa ctaattaggg gaaagatgac cctaacagaa 3781 gaagtacagt ggacagaact agcagaagca gagctacagg agaataaaat catcttagaa 3841 caggaacaag aaggatccta ctacaaggaa agggtaccgc tagaagcaac agtacagaaa 3901 aacctagcaa atcagtggac atacaaaatt catcagggaa ataaagtcct aaaagtagga 3961 aaatatgcaa aggttaaaaa cacgcacacc aacggggtaa gactactggc acatgtagtt 4021 cagaaaatag gcaaagaagc cctagtcatc tggggagaga taccagtgtt ccatctgcca 4081 gtagaaagag agacatggga ccagtggtgg acagattact ggcaagtaac ctggatccca 4141 gagtgggact ttgtctcgac cccaccatta ataagactag cctacaacct agtcaaagac 4201 cccctagaag ggagagaaac ctactacaca gatgggtcct gcaatagaac ctcaaaggaa 4261 ggaaaagcag gatatgtcac tgacagggga aaagataagg ttaaagtgtt agaacagaca 4321 acaaaccaac aagcagaact tgaagcattt gcattagcat taacagactc agaaccacaa 4381 gttaacatca tagtagattc acaatatgtc atgggaataa tagctgcaca gccaacagaa 4441 acagaatcac caatagtagc aaaaataatt gaagaaatga tcaaaaaaga ggcagtatat 4501 gtaggatggg taccagctca caagggactg ggtggtaatc aggaagtaga ccacctagta 4561 agtcaaggaa tcagacaggt cttgttccta gaaaaaatag aaccagccca ggaagagcat 4621 gaaaaatatc atggcaatgt aaaagaactg gtccataaat tcggaattcc acaattagtg 4681 gcaaaacaga tagtaaattc ctgtgataaa tgccaacaaa aaggggaagc tattcatgga 4741 caggtaaatg cagacctagg gacatggcag atggactgta cacatttaga aggaaaaatt 4801 ataatagtgg cagtccatgt agccagtggg tttatagaag cagaggtaat accccaagag 4861 acaggaagac agacagctct cttcctacta aagttggcca gcagatggcc tatcacacac 4921 ctacacacag acaacggtgc caacttcacc tcaccaagtg taaagatggt agcctggtgg 4981 gtaggaatag aacaaacttt tggagtaccc tataacccac aaagtcaagg agtagtggaa 5041 gcaatgaacc atcacctgaa aaatcaaata gacagactca gagaccaagc agtatcaata 5101 gagacagttg tactaatggc aactcactgc atgaatttta aaagaagggg aggaataggg 5161 gatatgaccc ctgcagaaag actagttaac atgataacca cagagcaaga aatacagttc 5221 ttccaagcaa aaaatttaaa atttcaaaat ttccaggtct attacagaga aggcagagat 5281 caactctgga agggacctgg tgaactattg tggaaagggg aaggagcagt catcataaag 5341 gtagggacag aaatcaaagt agtacccagg agaaaagcaa aaattataag gcactatgga 5401 ggaggaaaag gattggattg tagtgccgac atggaggata ccaggcaggc tagagagatg 5461 gcacagtctg attaagtatc ttaagtatag aacaggagag ttgcaacagg tctcttatgt 5521 ccctcaccac aaggtaggat gggcttggtg gacttgcagt agaataatat ttcccctaaa 5581 caaaggagca tggctagaag tccaaggata ttggaaccta accccagaaa ggggattctt 5641 gagctcctat gctgtaagac taacatggta tgagaggaac ttttatacag atgtaacacc 5701 tgatgtggca gaccagctac tgcatgggtc ttatttctct tgcttttcag ccaatgaagt 5761 aaggagagcc atcaggggag aaaagatatt gtcctactgc aactatccat cagctcacga 5821 agggcaggta ccaagcttac agtttctagc cctaagggtc gtacaggaag gaaaaaatgg 5881 atcccaggga gagagtgcca ccaggaaaca gcgacgaaga aacagtagga gaagcattcg 5941 cttggctaga aagaacaata acagagctca acagggtagc ggtcaaccat ttgccccgag 6001 aacttatttt ccaggtctgg cagaggtctt gggcatactg gcgtgaggaa cagggcatgt 6061 caattagcta taccaaatat agatacttgt tgctaatgca gaaagcaatg tttgtgcact 6121 atacaaaggg ctgtaggtgc ctgcaggagg gccatgggcc agggggatgg agatcaggac 6181 ctcctcctcc tcctccccca ggcctggcct aatggcagaa gcagccccag agatccctcc 6241 agagaacgag aacccacaaa gagaaccgtg ggaagagtgg ataggggaga tcctggagga 6301 aataaagcaa gaagccttaa agcattttga tcctcgcttg ctaactgcgc ttggtaactt 6361 tatctacagt aggcatggag atacccttgc aggagcagga gagctcatta aaatcctcca 6421 acgagccctc ttcctccact tcagagccgg ttgtcaacac tcaaggattg gacaatcagg 6481 gggaggaaat cctctctcaa ctataccgcc cccttaaggc atgcgataat acatgctact 6541 gtaagaaatg ctgctaccat tgccagcttt gttttcttaa aaagggtctt gggatatgtt 6601 atgaccgctc gagaaggaga tctgcaaaaa gagctaagac tactgcacct tctgcaccag 6661 acaagtgagt atggcatatt ttagcagccg cctgcctatt gcgctcctgc ttataggtat 6721 cagtgggttt gtatgtaaac aatatgttac tgtcttctat ggcatacccg catggaggaa 6781 cgcaacagtt cccctcattt gtgcaaccac aaacagagac acctggggaa ctgtacagtg 6841 tctcccagac aatggtgact acactgagat caggctaaac ataacagagg cttttgatgc 6901 atgggataat acagtgacac aacaggcagt agatgatgtg tggagactct ttgaaacctc 6961 cataaaacca tgtgtcaaac taaccccact gtgtgtggca atgaactgta gtaaaaccga 7021 aacaaaccca gggaatgcca gtagtactac caccactaag cctactacca cctctcgtgg 7081 gctgaaaacg attaacgaaa cagacccatg cataaaaaat gacagctgca caggactagg 7141 agaagaggaa ataatgcaat gtaattttag tatgacggga ctaagaagag atgagctaaa 7201 acaatataaa gacacctggt actcagaaga tttagagtgt aataatacca ggaagtatac 7261 cagcagatgc tatataagaa cctgcaacac aacaattatc caagagtcat gtgacaaaca 7321 ttattgggac agcttaaggt ttaggtattg tgctcccccg gggttttttc tactaagatg 7381 taatgatacc aactattcag gcttcatgcc caactgcagt aaggtagtag cgtcctcctg 7441 cacaagaatg atggaaacac agtcctctac atggtttggc ttcaatggta caagggcaga 7501 gaacaggaca tatatatatt ggcatgaaaa agacaatagg accatcataa gcttaaatac 7561 atactataat ttgtcaatac actgtaagag gccaggaaac aagacggttg taccaataag 7621 aaccgtgtca ggactacttt tccattcaca gcctatcaat aagagaccca gacaagcttg 7681 gtgctggttt aagggaaact ggacagaagc cataaaggag gtgaaaagga ccatcataaa 7741 acatcccagg tataaaggag gtgcaaaaaa tatcacaagc gtaaagttag tatcagaaca 7801 tggaaaaggt tcagatc // LOCUS HIV2GH1 9480 bp ds-DNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 proviral DNA, gag, pol, vif, vpX, vpR, tat, rev, env and nef genes. ACCESSION M30895 KEYWORDS . SOURCE Human immunodeficiency virus type 2 MOLT-4 cell (isolate GH1) proviral DNA, clone 8. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9480) AUTHORS Hasegawa,A., Tsujimoto,H., Maki,N., Ishikawa,K.-I., Miura,T., Fukasawa,M., Miki,K. and Hayami,M. TITLE Genomic divergence of HIV-2 from Ghana JOURNAL AIDS Res. Hum. Retroviruses 5, 593-604 (1989) STANDARD full staff_review COMMENT GH1 was isolated from a Ghanian AIDS patient; virus was cloned in MOLT-4 clone 8 cells. FEATURES from to/span description pept 544 2112 gag polyprotein pept < 1770 4937 pol polyprotein pept 4867 5514 vif protein pept 5342 5680 vpX protein pept 5680 5997 vpR protein pept 5843 6138 tat protein, exon 2 (first expressed exon) 8281 8377 tat protein, exon 3 (AA at 8282) pept 6069 6138 rev protein, exon 2 (first expressed exon) 8281 8522 rev protein, exon 3 (AA at 8283) pept 6148 8364 env polyprotein (see COMMENT) pept 8540 9307 nef protein BASE COUNT 3257 a 1957 c 2349 g 1917 t ORIGIN 1 cagtcgctct ggcagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg 121 gctccacgct tgcttgctta aagacctctt aataaagctg ccaattagaa gcaagttaag 181 tgtgtgttcc catctctcct agtcgccgcc tggtcattcg gtgttcatct gaataacaag 241 accctggtct gttaggaccc ttccgctttg ggaatccaag gcaggaaaat ccctagcagg 301 ttggcgccca acagggactt gaaagggact gagaagccct ggaacacggc tagtgaaggt 361 agtaaggcgg caggaacaaa ccacgacgga gtgctcctag aaaagcgcgg gccgaggtac 421 caaaggcggc gtgtggagcg ggagtgaaag aggcctccgg gtgaaggtaa gtgcctacac 481 cgaaaaccgt agccagaaaa ggctagttat cctatctttt agacaggtag aagattgtgg 541 gagatgggcg cgagaaactc cgtcttgaga gggaaaaaag cagatgaatt agaaaagatt 601 aggttacggc ccagcggaaa gaaaaagtac aggctaaaac atattgtgtg ggcagcgaac 661 gaattggaca aattcggatt ggcagagagc ctgttggagt caaaagaagg ttgccaaaag 721 attctcacag ttttagatcc attagtacca acagggtcag aaaatttaaa aagccttttt 781 aacacagtct gcgtcatttg gtgcttgcac gcagaagaga aagtgaaaga tactgaagaa 841 gcaaaaaaac tagtacagag acatctaggg gcagaaacag gaactgcaga aaaaatgcca 901 agtacaagta gaccaacagc accacctagc gggagaggaa gaaacttccc tgtacaacag 961 acaggcggtg gcaactatat ccacgtgcca ctgagccccc gaactctaaa tgcttgggta 1021 aaattagtag aggacaagaa gttcggggca gaagtagtgc caggatttca agcactctca 1081 gaaggctgca cgccctatga tatcaaccaa atgcttaatt gtgtgggcga tcaccaagca 1141 gctatgcaaa taatcagaga gattatcaat gacgaagcag cagattggga tgcacagcac 1201 ccaataccag gccccttacc agcagggcag cttagagacc caagggggtc tgacatagca 1261 ggaacaacaa gcacagtaga agaacagatc cagtggatgt ataggccaca aaatcccgtg 1321 ccggtaggga acatctacag aagatggatc cagatagggc tacagaagtg tgtcaggatg 1381 tacaacccaa ctaacatctt agacgtaaag cagggaccaa aggaaccgtt ccagagctat 1441 gtggacaggt tctataaaag cttgagggca gaacaaacag atccggcagt aaagaactgg 1501 atgacccaaa cgctgctaat acagaatgcc aacccagact gcaagttagt actaaaagga 1561 ctggggatga atcccaccct agaagagatg ctgactgcct gtcagggggt aggtggacca 1621 ggccaaaaag ccagactaat ggccgaagcc ctgaaggagg ctttgacacc accccctatc 1681 ccatttgcag cagcccaaca aagaaaggta attaggtgtt ggaactgcgg aaaggaggga 1741 cactcggcga gacagtgccg agcgcctaga agacagggct gctggaaatg tggcaagaca 1801 ggacacgtca tggcaaaatg cccagaaaga caggcaggtt ttttagggat gggtccatgg 1861 ggaaagaagc cccgcaactt ccccgtggcc caagctcctc cggggctgat accaacagca 1921 cccccagcag atccagcagt ggatctattg gaaagatata tgcagcaggg gagagagcag 1981 agggagcaga gggagagacc atacaaagag gtgacggaag acttactgca cctcgagcag 2041 ggaaaagcac ctcacagaga ggcgacagag gacttgctgc acctcaattc tctctttgga 2101 aaagaccagt agtcacagca tacatcgagg tacagccggt agaagtctta ttagacacag 2161 gggctgatga ctcaatagtg gcaggaatac agttagggga caattacgtt ccaaaaatag 2221 tggggggaat agggggattc ataaacacca aagagattaa aaatatagaa ataaaagtac 2281 taaataaaag agtaagagcc accataatga caggagatac cccaatcaac atctttggca 2341 gaaatatcct gacagcctta ggcatgtcat taaatttacc aatcgccaag atagaaccaa 2401 taaaagtaac gttgaagcca gggaaagatg gaccaagact gagacaatgg cccctaacaa 2461 aagagaaaat agaagcacta agagagatct gcgaaaaaat ggaaaaggag ggccaactag 2521 aagaggcacc tccaactaat ccttataata cccccacatt tgcaattaag aaaaaggaca 2581 agaacaaatg gaggatgcta atagacttta gagaactaaa cagggtgact caagatttca 2641 cagaaattca gctaggaatt ccacacccgg caggattagc caaaaagaaa aggatcactg 2701 tactagatgt aggggatgcc tacttttcca taccgctaca tgaagatttt aggcagtata 2761 ctgcatttac cctaccatca gtaaacaatg cagaaccagg aaaaagatat atatataagg 2821 tcttaccaca aggatggaag ggatcaccag caatttttca acacacaatg aggcaagtct 2881 tagaaccttt cagaaaagca aacccagatg tcattctcat ccaatacatg gatgatattt 2941 taatagctag tgacaggaca ggtttagagc atgacaaagt agtcctgcag ctaaaagaac 3001 ttctgaatgg cctagggttc tctaccccag atgagaagtt ccaaaaagac cctccgcttc 3061 aatggatggg ctatgaacta tggccaacta aatggaagct gcagaaatta caactgcccc 3121 agaaagaaat atggacagtc aacgacatcc agaagctggt gggagtctta aattgggcgg 3181 cacaaatcta tccaggaata aaaaccaaac atttgtgtag gctaatcaaa ggaaaaatga 3241 cactcacaga agaagtgcag tggacagagt tagcagaagc agagctagaa gaaaacaaaa 3301 ttatcttaag ccaggaacaa gaaggatact attaccaaga agaaaaagaa ttagaggcaa 3361 caatccaaaa aaatcaagac aatcagtgga catacaaaat acaccaggaa gagaaaatcc 3421 taaaagtagg gaaatatgca aagataaaaa atacccatac caatggggtc agattactag 3481 cacaggtagt tcagaaaata ggaaaagaag cactagtcat ctggggacgg ataccaaaat 3541 tccacttacc ggtggagaga gagacctggg agcagtggtg ggataactac tggcaagtga 3601 catggatccc agagtgggac ttcgtatcta ccccaccact agtcaggtta acatttaacc 3661 tagtaggaga tcctatacca ggcgcagaga ccttctacac agatggatca tgcaatagac 3721 agtcaaagga aggaaaagca cgatatgtga cagatagagg aagagacaag gtaagagtac 3781 tagaacgaac taccaatcag caagcagaat tagaagcctt tgcgatgaca ctgacagact 3841 cgggtccaaa agttaatatc atagtagact cacagtatgt aatgggaata gtagtaggcc 3901 aaccaacaga gtcagaaagt agaatagtaa accaaatcat tgaggacatg ataaagaaag 3961 aagcagtcta tgtcgcgtgg gtcccagccc ataaaggcat aggaggaaac caggaagtag 4021 accatttagt aagtcagggc atcagacaag tattgttcct agaaaggata gagcccgccc 4081 aagaagaaca tgaaaaatat catagcaata tgaaagaact aacccataaa tttggaatac 4141 cccaactagt agcaagacag atagtaaaca catgtgccca atgccaacag aaaggagagg 4201 ccatacatgg gcaggtaaat gcagaaatag gtgtctggca aatggactgc acacacttag 4261 aaggaaagat cattatagta gcagtacatg ttgcaagtgg attcatagaa gcagaagtta 4321 tcccacagga atcaggaagg cagacagcgc tcttcctatt aaaactggcc agtaggtggc 4381 caataacaca cttacacaca gacaatggct ccaacttcac ttcacaggaa gtgaagatgg 4441 tggcatggtg gataggtata gagcaatcct ttggagtacc ttacaaccca caaagccagg 4501 gagtagtaga agcaatgaat caccacttaa agaatcagat aagtagaatt agagaacagg 4561 caaatacaat agaaaccata gtactaatgg cagttcattg catgaatttt aaaagaaggg 4621 gaggaatagg ggatatgacc ccagcagaaa gactaatcaa catgattacc acagaacaag 4681 aaatacaatt cctccaaaga aaaaattcaa attttaaaaa cttccaggtc tattacagag 4741 aaggcagaga tcagctgtgg aaaggacctg gagaactact gtggaaggga gacggagcag 4801 tcatagtcaa ggtaggggca gacataaaag taataccaag aaggaaggcc aagattatca 4861 gagactatgg aggaaggcaa gaactggata gttcccacct ggagggtgcc agggaggagg 4921 atggagaggt ggcatagtct tgtcaagtac ctaaaataca gaacaagaga cctagaggag 4981 gtgcgctatg ttccccacca caaggtggga tgggcgtggt ggacttgcag cagggtaata 5041 ttccctctaa aaggagaaag tcatctagag atacaggcat attggaacct aacgccagaa 5101 aaaggatggc tctcctctca ttcagtgaga ataacctggt atacagaaag gttctggaca 5161 gatgttaccc cagactatgc agacatccta atacatagca cttatttctc ttgctttacg 5221 gcaggtgaag taagaagagc catcagaggg gaaaagttat tgtcctgctg caactatccc 5281 caagctcata aagtacaggt accatcactt caatacctag ccctagtggt agtgcaacaa 5341 aatgacagac cccagagaaa gggtaccgcc aggaaacagt ggagaagaga ccattggaga 5401 ggccttcgag tggctagaca ggactataga agccttaaac agagaggcag tgaaccatct 5461 gccccgagag ctcattttcc aggtgtggca aaggtcctgg agatactggc atgatgacca 5521 agggatgtca ccaagctaca caaagtatag atatttgtgc ctgatgcaga aagctgtatt 5581 tatacatttc aagagagggt gcacttgcct ggggggagga catggaccgg gaggatggag 5641 atcaggacct ccccctcctc cccctccagg tctagtctaa tgactgaagc accaacagag 5701 tttcccccag aagatgggac cccacggagg gaactagggg gtgactgggt aataagaatt 5761 ctgggagaaa taaaggaaga agccttaaag cattttgatc cccgcttgct aatcgctctt 5821 ggcaactata tccatagtag acatggagac acacctgaag gcgccagaga gctcattaga 5881 atcctacaac gagccctctt cgtgcacctc agagcagggt gtaaccgctc aagaattagc 5941 caaacaagga gaagaactcc tttcccagct gcaccgaccc ctagaggcat gtactaactc 6001 atgctattgt aagcagtgca gcttccattg ccagctgtgt ttcttgaaaa aagggctcgg 6061 gatatggtat gcacgaaaaa gcagacggag aagaactcca agaaagacta agactcattc 6121 gtcttctgca tcagacaagt gagtaagatg tgtggtaaga gtctactatg tgttgccagc 6181 ttgctagcta gtgcttactt agtatattgc acccaatatg tgactgtttt ctatggcgtg 6241 cccgtgtgga gaaatgcatc cattcccctc ttttgtgcaa ctaaaaacag agatacttgg 6301 ggaaccatac agtgcaagcc agacaatgat gattatcaag aaataacctt gaatgtgaca 6361 gaggctttcg acgcatggga taatacagta acagaacaag cagtagaaga tgtctggagt 6421 ctatttgaga catcaataaa accatgcgtc aaactaacac ctctatgtgt agcaatgagt 6481 tgtaacagca ccactaacaa cacaaccaca acagggagca caacagggat gtcagagata 6541 aatgaaactt ctccatcata ctcagacaac tgcacaggat tggggaagga agagatagtc 6601 aattgtcagt tctatatgac aggattagag agagataaga aaaaacagta taatgaaaca 6661 tggtactcaa aagatgtagt ttgtgaatca aataatacca aagatgggaa aaacagatgt 6721 tacatgaacc attgcaacac atcagtcatc acagagtcat gtgataagca ctattgggat 6781 gctataaagt ttagatactg tgcaccaccg ggttatgccc tactaagatg caatgatacc 6841 aattattcag gcttcgagcc aaagtgttct aaagtagtag cttctacatg cacaaggatg 6901 atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga gaatagaaca 6961 tatatctatt ggcatggcag agataatagg actatcatca gcttaaacaa gtattataac 7021 ctttctatac attgtaagag accaggaaac aagacagttg taccaataac actcatgtca 7081 gggttagtgt ttcactccca gccaatcaat acaagaccta ggcaagcatg gtgctggttc 7141 aaaggcaagt ggagggaagc catgcaggag gtaaagcaaa cccttataaa acatcccagg 7201 tataaaggaa ccaacgacac aaagaatatt aactttacaa aaccaggaag aggctcagac 7261 ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg caacatgact 7321 tggttcctca attgggtaga aaacagaccg aatcagacac agcacaatta tgcgccgtgc 7381 catataaggc aaataattaa tacctggcac aaggtaggga aaaatgtata tttgcctcct 7441 agagaaggac agttgacctg caactcaaca gtgaccagca taattgctaa cattgacgta 7501 aatagtaacc agacaaatat tacctttagt gcagaggtgg cagaactata ccgattagaa 7561 ttgggagatt ataaattaat agaagtaaca ccaattggct tcgcacctac gagagagaaa 7621 agatattcct cggctccagt gaggaacaaa agaggtgtgt ttgtgctagg gttcttgggt 7681 tttctcgcga cagcaggttc tgcaatgggc gcggcgtccc tgacgctgtc ggctcagtcc 7741 cggactttac tggccgggat agtgcagcaa cagcaacagc tgttggacgt agtcaagaga 7801 caacaagaaa tgttgcgact gaccgtctgg ggaacgaaaa atctccaggc aagagtcact 7861 gctatcgaga agtacttaaa ggaccaggca cagttaaatt catgggggtg tgcgtttagg 7921 caggtctgcc acactactgt accatgggta aatgattctt tatcgcctga ctggaacaat 7981 atgacatggc aggaatggga aaaacaagtc cgctacctag aggcaaatat cagtcaaagt 8041 ttagaacagg cacaaattca acaagaaaag aatatgtatg aattacaaaa gctaaatagc 8101 tgggatgtct ttggcaactg gtttgattta acctcctgga tcaagtatat tcaatatgga 8161 gtttatatag tagtaggagt gatagtttta agaatagcaa tatatatagt gcaattgtta 8221 agtagactta gaaagggcta taggcctgtt ttctcttccc ctcccggtta tctccaacag 8281 atccatatcc acacggaccg gggacagcca gccaacgaag gaacagaaga agacgacaga 8341 gacgacgatg gttacgactt gtagccttgg ccgataaact atatacattt cctgatccac 8401 ctactgactc gcctcttgac cgggctatac aagatctgca gagacttact atccacgaac 8461 tccccgaccc accgactgat ctcccagaat ctaacagcaa tcagggactg gctgagactt 8521 aaagcggcct acctgcaata tgggggcgag tggatccaag aagcattcca agcattcgca 8581 aagactacga gagagactct tgcgagcgca tggggggggt tatgtgcagc agtgcaacgc 8641 gtcgggaggg gaatactcgc agtcccaaga aggatcaggc aaggggcaga aatcgccctc 8701 ctgtgaggga cagcagtatc ggcagggaga tttcatgaac accccatgga gaaccccggc 8761 aatagaagga cagaaaaagt tatacaagca gcaaaatatg gatgatatag attctagtga 8821 tgatgaccta gtgggagttc ctgttacacc aagagtacca ctaagagcaa tgacctataa 8881 attggcagta gatatgtcac attttataaa gaaaagggga ctggatggga tgttttacag 8941 tagggataga catagaatcc tagacttgta cctagaaaag gaggaaggga taataccaga 9001 ttggcagaat tatactcatg ggccaggagt aaggtaccca atgtgtttcg ggtggctatg 9061 gaagctagta ccagtagatg tctcacaaga ggcagaggac gatgagacca actacttaac 9121 acacccagca caaacaagca gacatgatga cgagcatgga gaaacactac tttggcggtt 9181 tgaccccacg ctggcctatg attacaaggc tttcattctg catccagaag agtttgggca 9241 caagtcagga ttgccagaga aagagtggaa ggcaaaactg aaagcaagag ggataccata 9301 tagttaacaa caggaacagc catacttggt caaggcagga agtagctact gagaacagct 9361 gaggctgcag ctgcagggac tttccagaag gggctgtaac caagggaggg acatgggagg 9421 agctggtggg gaacgccctc atacttactg tataaatgta ctcgcttctt acattgtatt // LOCUS HIV2NIHZA 9431 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2) proviral gag, pol, vif, vpX, vpR, tat, rev, env and nef genes. ACCESSION J03654 KEYWORDS . SOURCE Human immunodeficiency virus type 2 (isolate FG) proviral DNA, clone NIHZ. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9431) AUTHORS Zagury,J.F., Franchini,G., Reitz,M., Collalti,E., Starcich,B., Hall,L., Fargnoli,K., Jagodzinski,L., Guo,H.G., Laure,F. and et,al. TITLE Genetic variability between isolates of human immunodeficiency virus (HIV) type 2 is comparable to the variability among HIV type 1 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5941-5945 (1988) STANDARD full staff_review COMMENT Hard copy of sequence kindly provided by G. Franchini (August 1987). The NIHZ clone has a large deletion in the nef cds and LTR relative to other HIV2 sequences; nevertheless, it is biologically active. FEATURES from to/span description pept 547 2106 gag polyprotein pept < 1824 4931 pol polyprotein (NH2 terminus uncertain; AA at 1824) pept 4861 5508 vif protein pept 5336 5674 vpX protein pept 5674 5991 vpR protein pept 5837 6132 tat protein, exon 2 (first expressed exon) 8272 8368 tat protein, exon 3 (AA at 8273) pept 6063 6132 rev protein, exon 2 (first expressed exon) 8272 8525 rev protein, exon 3 (AA at 8274) pept 6139 8709 env polyprotein pept 8543 9085 nef protein LTR < 1 301 5' LTR LTR 8928 > 9431 3' LTR rpt < 1 177 R repeat 5' copy rpt 9254 9431 R repeat 3' copy binding 305 321 primer (Lys-tRNA) binding site signal 9404 9409 poly-A signal BASE COUNT 3213 a 1913 c 2348 g 1957 t ORIGIN Cap site of genomic RNA. 1 agtcgctctg gcagaggctg gcagattgag ccctgggagg ttctctccag cactagcagg 61 tagagcctgg gtgttccctg ctagactctc accagtgctt ggccggcgct gggcagacgg 121 ctccacgctt gcttgcttaa agacctctca ataaagctgc caattagaag caagttaagt 181 gtgtgctccc atctctccta gtcgccgcct ggtcatccgg tgttcacctg tataacaaga 241 ccctggtctg ttaggaccct tcttgctttg ggaaaccgag gcaggcggaa aatccctagc 301 aggttggcgc ccgaacagga cttgaagaag actgagaagc cttggaacaa gggctgggtg 361 aaggcagtaa gggcggcagg aacaaaccag cacggagtgc tcctagaaag gcgcgggccg 421 aggtaccaaa gcggcgtgtg gacgggagtc gagaggcctc cgggtgaagg gtaagtacct 481 acaccaaaaa ctgtagccaa agagggcttg ttatcctacc tttagacagg tagaagattg 541 tgggagatgg gcgcgagaaa ctccgtcttg agagggaaaa aagcagatga attagaaaaa 601 attaggttac ggcccggcgg aaagaaaaag tacaagttaa aacatattgt gtgggcagcg 661 aatgaattgg acagattcgg attagcagag agcctgttgg agtcaaaaga aggttgccaa 721 aaaattctta cagttttaga tccattagta ccgacaggtt cagaaaattt aaaaagcctt 781 tttaatactg tctgcgtcat ctggtgcata cacgcagaag agaaagtgaa agatactgaa 841 ggagcaaaac aaatagtaca gagacatcta gtggcagaaa caggaactgc agagaaaatg 901 ccaaatacaa gtagaccaac agcaccacct agcgggaaaa acttccccgt gcagcaagta 961 gctggcaact atacccatat accactgagt cccggaactc taaatgcttg ggtaaaatta 1021 gtggaggaaa agaaattcgg ggcagaagta gtgccaggat ttcaggcact ctcagaaggc 1081 tgcacgccct atgatattaa tcagatgctt aattgtgtgg gcgaccatca agcagctatg 1141 caaataatca gggaaattat taatgaggaa gcagcagact gggatgttgc acatcccata 1201 ccaggcccct taccagcggg gcagcttaga gagccaagag ggtctgacat agcagggaca 1261 acaagcacag tagaagagca gatccagtgg atgtttagac cacaaaatcc tgtaccagta 1321 ggcaacatct ataggagatg gattcagata ggactgcaaa agtgcgtcag gatgtacaat 1381 ccaaccaaca tcctagacat aaatcaagga ccaaaggaac cattccaaag ctatgtagat 1441 aggttctaca aaagcttgag ggcagaacaa acagatccag cagtaaaaaa ttggatgacc 1501 caaacactgc taatacagaa tgccaaccca gattgtaaat tagtactaaa aggattagga 1561 atgaatccta ccttagaaga aatgctgacc acctgtcagg gggtaggggg accaggccag 1621 aaagctagat taatggcaga agccctaaag gaggtcatgg caccagcccc catcccattt 1681 gcagcagccc agcagagaaa gacattcaag tgctggaatt gtggaaagga aggacactcg 1741 gcaagacaat ggagcgcgcc tagaagacag ggctgctgga agtgtggtaa gtcaggacat 1801 gtcatggcaa attgcccaga tagacaggct ggttttttag ggattggcca ctggggaaag 1861 aaggcccgca acttccccgc ggcccaagtc ccgcaggggc taacaccaac agcaccccca 1921 ttggatccag cagtggacct actggagaaa tatatgcagc aaggaaaaag gcaaagggag 1981 cagagagaga gaccgtacaa ggaagtgaca gaggacttac tgcgtttcga gcaggcagag 2041 acaccatgca gggagacgac agaggacttg ctgcacctca attctctctt tggaaaagac 2101 cagtagtcac agcacacatt gagggtcagc cagtagaagt tttgttagac acaagggcta 2161 acgactcaat agtagcagga atagagttag ggagcaatta tagtccaaaa atagtaggag 2221 gaataggggg attcataaat accaaggaat ataaaaatgt agaaatagaa gtcctaggta 2281 aaagggtaaa agccaccata atgacaggtg ataccccgat caacattttt ggcagaaatg 2341 ttctgacagc tttaggcatg tcattaaacc tgccagttgc caagatagaa ccaataaaaa 2401 taatgctaaa gccagggaaa gatggaccaa gactaaaaca atggccttta acaaaagaaa 2461 aaatagaagc actaaaagaa atctgtgaaa aaatggaaaa agaaggccag ctagaagaag 2521 cacctccaac taatccttat aataccccca catttgcaat caagaaaaag gacaaaaaca 2581 aatggagaat gctaatagac tttagggagc taaacaaggt aactcaggat ttcacagaaa 2641 ttcagttagg aattccacac ccagcaggat tggccaaaaa gaggagaatt actgtactag 2701 atgtagggga tgcttacttt tccataccac tacatgagga ctttagacag tatactgcat 2761 tcactctacc atcagtaaac aatgcagaac caggaaaaag atatatatac aaagtcctac 2821 cacaggggtg gaaggggtca ccagcaattt ttcaatacac aatgaggcag atcttagaac 2881 cattcagaaa agcaaacgag gatgtcatta tcattcagta catggatgat atcttaatag 2941 ccagcgacag gacagactta gaacatgaca aagtggtcct gcagctaaaa gaacttctaa 3001 atggactagg attttccacc ccagatgaga aattccagaa agaccctcca tatcgctgga 3061 tgggctatga attatggcca actaaatgga agttgcaaaa aatacagctg ccccaaaaag 3121 aagtatggac agtcaatgac atccaaaagc tagtgggtgt cctaaattgg gcagcacaaa 3181 tctacccagg gataaagacc aaacacctat gtaggctaat tagaggaaaa atgacactca 3241 cagaagaggt acagtggaca gaattagcag aagcagagct agaggaaaac agaattatct 3301 taagccagaa acaagaagga cactattacc aggaagaaaa aaagttagaa gcaacagtcc 3361 agaaagatca agacaatcag tggacatata aggtacacca gggggagaaa attctcaaag 3421 gtgggaaaat atgcaaagat aaaaaatacc cataccaacg ggtcagattg ttagcacagg 3481 tagttcaaaa aataggaaaa gaagcactag tcatttgggg acggatacca aaatttcacc 3541 taccagtaga gagagatacc tgggagcagt ggtgggataa ctactggcaa gtaacatgga 3601 tcccagactg ggactttgta tctaccccac cactggtcag gctagcattt aacctggtag 3661 gagagcctgt accaggcgca gaaactttct acacagatgg atcctgcaat aggcagtcaa 3721 aagaagggaa agcaggatat ataacagata gagggagaga cagggtaaaa gtattagagc 3781 aaactaccaa tcagcaagca gaattagaag cctttgcaat ggcactaaca gactcaggtc 3841 caaaagctaa tattatagta gactcacaat atgtaatggg gatagtagca ggccaaccaa 3901 cagagtcaga gaatagaata gtaaatcaga tcatagaaga aatgataaag aaagaagcaa 3961 tctatgttgc gtgggtccca gcccacaaag gcataggagg aaaccaggaa gtagatcatt 4021 tagtaagtca gggcattaga caagtattat tcctagagaa aatagaaccc gcgcaggaag 4081 aacatgaaaa atatcatagc aatataaaag aactgtccca taaatttgga atacccaagc 4141 tagtggcaag acaaatagta aacacatgtg cccatgtgca acagaaaggg gaggctatac 4201 atgggcaagt aaatgcagaa ctaggcactt ggcaaatgga ctgcacacat ttagaaggaa 4261 aagtcattat agtagcagta catgttgcaa gtggatttat agaggcagaa gttatcccac 4321 aggaatcagg aaggcaaaca gcactgttct tactgaaact ggccagtagg tggccaataa 4381 cacacctgca cacagataat ggtgccaact tcacttcaca ggaagtgaaa atggtagcat 4441 ggtgggtagg tatagaacaa acctttggag tgccttacaa tccacaaagc caaggagtag 4501 tagaagcaat gaatcatcat ctaaaaaatc agatagacag aattagagag caggcaaata 4561 cagtagaaac aatagtatta atggcagttc attgcatgaa ttttaaaaga aggggaggaa 4621 taggggatat gaccccagca gaaagaataa tcaatatgat caccacagaa caagaaatac 4681 aattcctcca agcaaaaaat tcaaaattaa aaaattttcg ggtctatttc agagaaggca 4741 gagatcagct gtggaaagga cctggggaac tgctgtggaa gggagacgga gcagtcatag 4801 tcaaggtagg gacagaaata aaagtagtac caagaaggaa agccaagatc attaaagact 4861 atggaggaag gcaagagatg gatagtggtt cccatttgga gggtgccagg gaggatggag 4921 agatggcata gcttggtcaa gtacttgaaa tacagaacaa aagacttaga gaaggtatgc 4981 tatgttcccc atcataaggt ggggtgggca tggtggactt gcagcagggt aatattccca 5041 ctgaaagaga acagtcatct ggagatacaa gcatattgga acttaacacc agaaaaagga 5101 tggctctcct ctcactcagt gagaataact tggtatacag aaaagttctg gacagatgtt 5161 accccagact gtgcagatac cctaatacat agcacttatt tctcttgctt tacggcaggt 5221 gaagtaagaa gagccatcag aggggaaaaa ttattgtcct gctgcaaata cccccgagcg 5281 catagatccc aggtaccgtc acttcaattt ctggccttag tggtagtgca acaaaatgac 5341 agatcccagg gaaacagtgc caccaggaaa cagcggcgag gagactatcg aagaggcctt 5401 cgcatggcta gacaggacag tagaggctat aaacagagag gcagtgaatc accacccacg 5461 agagctcatt ttccaggtct ggcagaggtc ctggagatac tggcatgatg aacaagggat 5521 gtcaacaagc tacacaaaat atagatattt gtgcttaata cagatggcaa tgtacatgca 5581 tgctaagaga gatggcactt gcctgggggg aggcatgggc cagaagggtg gagatcaggg 5641 acctccccct cctccacctc caggtttagt ctaatgactg aagcaccaac agaactcccc 5701 ccggaggata ggaccccacc gagggagcca ggggatgcat gggtaataga aatcctgaga 5761 gaaatagaag aagaagcttt aaggcatttt gaccctcgct tgctaactgc tcttggcaga 5821 tatatctata ctaggcatgg agacaccctt gaaggcgcca gagagctcat tagaatcctg 5881 caacgagccc tcttcgcgca cttcagagca ggatgtggcc actcaagaat tggccagaca 5941 aggggaggaa atcctctctc agctataccg acccctagag gcatgcacta actcctgcta 6001 ttgcaagaaa tgctgctacg attgccagct gtgtttttta caaaaggggc tcgggatatg 6061 gtatgaccga aagggcagac gaagaaggac tccaaagaaa actaaggctc atccgtcttc 6121 tgcatcagac aagtgagtat gaagggtagt aagaatcaac tgctgattgc tattgtacta 6181 gctagtgctt acctaataca ttgcaagcaa tttgtgactg ttttctacgg catacccgcg 6241 tggaggaatg catccattcc cctgttttgt gcaaccaaaa atagagatac ttggggaaca 6301 atacagtgct taccagacaa tgatgattat caggaaataa ctctaaatgt gacagaggct 6361 ttcgatgcat ggaataatac agtaacagaa caagcagtag aggatgtctg gaatctattt 6421 gagacatcaa taaaaccatg tgtcaaatta acacccttat gtgtagcaat gaactgtaca 6481 aggaacatga ccacatggac agggaggaca gacacccaaa atatcacaat tataaatgac 6541 acttcccatg cgcgtgcaga caactgcaca ggattaaagg aggaagaaat gatcgactgt 6601 cagtttagta tgacaggatt agagagagac aagagaaaac agtatactga agcatggtac 6661 tcaaaagatg tggtttgtga caataacacc tcaagtcaga gcaagtgtta catgaaccat 6721 tgcaacacat cagtcatcac agagtcatgt gataagcact attgggatgc tatgaggttt 6781 agatactgtg caccaccggg ttttgcctta ctaagatgca atgataccaa ttattcaggc 6841 tttgcaccta actgctctaa agtagtagct gctacatgca ccagaatgat ggaaacgcaa 6901 acttctacat ggtttggctt taatggcact agagcagaaa atagaacata tatctattgg 6961 catggtaaag ataacagaac tattatcagc ttaaataact tttataatct cactatgcat 7021 tgtaagaggc cgggaaataa gacagtgtta ccaataacat ttatgtcagg gtttaagttt 7081 cactcccagc cagtcatcaa taaaaaaccc aggcaagcat ggtgttggtt cgaaggccaa 7141 tggaaggaag ccatgcagga ggtgaaggag acccttgcga aacatcccag atataaaggg 7201 aacaggagcc gcacagagaa tattaaattt aaagcaccag ggagaggctc agacccagaa 7261 gtaacataca tgtggactaa ctgcagagga gaatctctct actgcaacat gacttggttc 7321 ctcaattggg tagagaacag gacgggtcag aaacagcgca attatgcacc gtgccgtata 7381 aggcaaataa ttaatacctg gcacagggta gggaaaaatc tatatttgcc tcccagggaa 7441 ggggagttga cctgcaactc aacggtgacc agcataattg ccaacattga tgcgggagat 7501 caaacaaata ttacctttag tgcagaggcg gcagaactat accgattgga attgggggat 7561 tacaaattag tagaaatcac accaattggc ttcgcaccta catcagtaaa gagatactcc 7621 tctgctcacc agagacatac aagaggtgtg tttgtgctag ggttcctagg ttttctcgca 7681 acagcaggtt ctgccatggg cgcggcgtcc ttgacgctgt cagctcagtc tcggacttta 7741 ttggccggga tagtgcagca acagcaacag ctgttggatg tggtcaagag acaacaagaa 7801 atgttgcgac tgaccgtctg gggaacaaaa aatctccagg caagagtcac tgctatagag 7861 aagtacctaa aggaccaggc gcagctaaat tcatggggat gtgcgtttag acaagtctgc 7921 cacacttctg taccatgggt aaatgatacc ttgacacctg attggaacaa tatgacgtgg 7981 caggaatggg aacaaaaagt ccgctacctg gaggcaaata tcagtcaaag tctagaacaa 8041 gcacaaattc agcaagaaaa gaatatgtat gagctacaaa aattaaatag ctgggatgtt 8101 tttaccaatt ggcttgactt cacctcctgg gtcaggtata ttcaatatgg agtttatgta 8161 gtagtaggaa tagtagcttt aagaatagta atatatatag tacagatgtt aagtagactt 8221 agaaagggct ataggcctgt tttctcttcc ccccccggtt acatccaaca gatccatatc 8281 cacaaggacc aggaacagcc agccagagaa gaaacagaag aagacgttgg aagcaacggt 8341 ggagacagat cttggccttg gccgatagca tatatacatt tcctgatccg cctgctgatt 8401 cgcctcttga ccgggctata caacatctgc agggacttac tatccaggat ctccccgatc 8461 ctccaaccaa tcttccagag tctccagaga gcactaacag caatcagaga ctggctgagg 8521 cttaaagcag cctacctgca gtatgggtgc gagtggatcc aagaagcgtt ccaagccctt 8581 gcaaggacta caagagagac tcttgcaggc gcggggagag acttgtggag ggcgctgcaa 8641 cgaatcggga gggggatact tgcagtccca cgaaggatca ggcagggagc agaactcgcc 8701 ctcctgtgag ggacagcgat atcagcaggg agattttgta aataccccat ggagaacccc 8761 agcagcagaa agggagaaag aattgtacaa acagcaaaat atggatgatg tagatctaga 8821 tgatgatgac caagtaggat tccctgtcac accaagagta ccattaagac caatgacatt 8881 caaattggca gtagatatgt ctcattttat aaaagaaaaa gggggactgg aagggctgtt 8941 ttatagtcag agaagacata gaatcttaga cttatactta gacaaggctt ttactctgta 9001 cccagaggaa tttgggcata attcaggact gccagagaaa gagtggaagg cgagactgaa 9061 agcaagggga ataccattta gttaaaaaca ggaacagcta tacttggtca gggcaggaag 9121 tagctactga gaacagctga gactgcaggg actttccaga aggggctgta accaggagag 9181 ggacatggga ggagctggtg gggaacgccc ttcatacttt ctgtataaat gtacccgctg 9241 cttgcattgt attcagtcgc tctggcagag gctggcagat tgagccctgg gaggttctct 9301 ccagcactag caggtagagc ctgggtgttc cctgctagac tctcaccagt gcttggccgg 9361 cgctgggcag acggctccac gcttgcttgc ttaaagacct ctcaataaag ctgccaatta 9421 gaagcaagtt a // LOCUS HIV2RODZ 9671 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2, complete proviral genome. ACCESSION M15390 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 2 (isolate ROD) proviral DNA, clones lambda-ROD[27,35]. ORGANISM Human immunodeficiency virus type 2 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 9361 to 9671) AUTHORS Clavel,F., Guyader,M., Guetard,D., Salle,M., Montagnier,L. and Alizon,M. TITLE Molecular cloning and polymorphism of the human immunodeficiency virus type 2 JOURNAL Nature 324, 691-695 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 9671) AUTHORS Guyader,M., Emerman,M., Sonigo,P., Clavel,F., Montagnier,L. and Alizon,M. TITLE Genome organization and transactivation of the human immuno- deficiency virus type 2 JOURNAL Nature 326, 662-669 (1987) STANDARD full staff_review COMMENT The ROD isolate was obtained from offshore Senegal in 1985. Its sequence was determined from two clones: bases 1 to 2658 are from ROD 27 and bases 2659 to 9671 are from ROD 35. An infectious clone of this virus has been constructed by Keith Peden, Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD. 21205 (301)955-3652. FEATURES from to/span description pept 546 2114 gag polyprotein pept < 1829 4939 pol polyprotein (NH2 terminus uncertain; AA at pept 4869 5516 vif protein pept 5344 5682 vpX protein pept 5682 5999 vpR protein pept 5845 6140 tat protein, exon 2 (first expressed exon) 8307 8403 tat protein, exon 3 (AA at 8308) pept 6071 6140 rev protein, exon 2 (first expressed exon) 8307 8539 rev protein, exon 3 (AA at 8309) pept 6147 8723 env polyprotein pept 8557 9327 nef protein LTR < 1 299 5' LTR LTR 8942 > 9671 3' LTR rpt < 1 173 R repeat 5' copy rpt 9497 9671 R repeat 3' copy binding 303 319 primer (Lys-tRNA) binding site signal 9649 9654 poly-A signal BASE COUNT 3314 a 1972 c 2401 g 1984 t ORIGIN Cap site of genomic RNA. 1 ggtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagacg 121 gccccacgct tgcttgctta aaaacctctt aataaagctg ccagttagaa gcaagttaag 181 tgtgtgctcc catctctcct agtcgccgcc tggtcattcg gtgttcacct gagtaacaag 241 accctggtct gttaggaccc ttcttgcttt gggaaaccga ggcaggaaaa tccctagcag 301 gttggcgcct gaacagggac ttgaagaaga ctgagaagtc ttggaacacg gctgagtgaa 361 ggcagtaagg gcggcaggaa caaaccacga cggagtgctc ctagaaaggc gcgggccgag 421 gtaccaaagg cagcgtgtgg agcgggagga gaagaggcct ccgggtgaag gtaagtacct 481 acaccaaaaa ctgtagccga aagggcttgc tatcctacct ttagacaggt agaagattgt 541 gggagatggg cgcgagaaac tccgtcttga gagggaaaaa agcagatgaa ttagaaagaa 601 tcaggttacg gcccggcgga aagaaaaagt acaggctaaa acatattgtg tgggcagcga 661 ataaattgga cagattcgga ttagcagaga gcctgttgga gtcaaaagag ggttgtcaaa 721 aaattcttac agttttagat ccaatggtac cgacaggttc agaaaattta aaaagtcttt 781 ttaatactgt ctgcgtcatt tggtgcatac acgcagaaga gaaagtgaaa gatactgaag 841 gagcaaaaca aatagtgcgg agacatctag tggcagaaac aggaactgca gagaaaatgc 901 caagcacaag tagaccaaca gcaccatcta gcgagaaggg aggaaattac ccagtgcaac 961 atgtaggcgg caactacacc catataccgc tgagtccccg aaccctaaat gcctgggtaa 1021 aattagtaga ggaaaaaaag ttcggggcag aagtagtgcc aggatttcag gcactctcag 1081 aaggctgcac gccctatgat atcaaccaaa tgcttaattg tgtgggcgac catcaagcag 1141 ccatgcagat aatcagggag attatcaatg aggaagcagc agaatgggat gtgcaacatc 1201 caataccagg ccccttacca gcggggcagc ttagagagcc aaggggatct gacatagcag 1261 ggacaacaag cacagtagaa gaacagatcc agtggatgtt taggccacaa aatcctgtac 1321 cagtaggaaa catctataga agatggatcc agataggatt gcagaagtgt gtcaggatgt 1381 acaacccgac caacatccta gacataaaac agggaccaaa ggagccgttc caaagctatg 1441 tagatagatt ctacaaaagc ttgagggcag aacaaacaga tccagcagtg aagaattgga 1501 tgacccaaac actgctagta caaaatgcca acccagactg taaattagtg ctaaaaggac 1561 tagggatgaa ccctacctta gaagagatgc tgaccgcctg tcagggggta ggtgggccag 1621 gccagaaagc tagattaatg gcagaggccc tgaaagaggt cataggacct gcccctatcc 1681 cattcgcagc agcccagcag agaaaggcat ttaaatgctg gaactgtgga aaggaagggc 1741 actcggcaag acaatgccga gcacctagaa ggcagggctg ctggaagtgt ggtaagccag 1801 gacacatcat gacaaactgc ccagatagac aggcaggttt tttaggactg ggcccttggg 1861 gaaagaagcc ccgcaacttc cccgtggccc aagttccgca ggggctgaca ccaacagcac 1921 ccccagtgga tccagcagtg gatctactgg agaaatatat gcagcaaggg aaaagacaga 1981 gagagcagag agagagacca tacaaggaag tgacagagga cttactgcac ctcgagcagg 2041 gggagacacc atacagggag ccaccaacag aggacttgct gcacctcaat tctctctttg 2101 gaaaagacca gtagtcacag catacattga gggtcagcca gtagaagtct tgttagacac 2161 aggggctgac gactcaatag tagcaggaat agagttaggg aacaattata gcccaaaaat 2221 agtaggggga atagggggat tcataaatac caaggaatat aaaaatgtag aaatagaagt 2281 tctaaataaa aaggtacggg ccaccataat gacaggcgac accccaatca acatttttgg 2341 cagaaatatt ctgacagcct taggcatgtc attaaatcta ccagtcgcca aagtagagcc 2401 aataaaaata atgctaaagc cagggaaaga tggaccaaaa ctgagacaat ggcccttaac 2461 aaaagaaaaa atagaagcac taaaagaaat ctgtgaaaaa atggaaaaag aaggccagct 2521 agaggaagca cctccaacta atccttataa tacccccaca tttgcaatca agaaaaagga 2581 caaaaacaaa tggaggatgc taatagattt cagagaacta aacaaggtaa ctcaagattt 2641 cacagaaatt cagttaggaa ttccacaccc agcagggttg gccaagaaga gaagaattac 2701 tgtactagat gtaggggatg cttacttttc cataccacta catgaggact ttagaccata 2761 tactgcattt actctaccat cagtgaacaa tgcagaacca ggaaaaagat acatatataa 2821 agtcttgcca cagggatgga agggatcacc agcaattttt caacacacaa tgagacaggt 2881 attagaacca ttcagaaaag caaacaagga tgtcattatc attcagtaca tggatgatat 2941 cttaatagct agtgacagga cagatttaga acatgatagg gtagtcctgc agctcaagga 3001 acttctaaat ggcctaggat tttctacccc agatgagaag ttccaaaaag accctccata 3061 ccactggatg ggctatgaac tatggccaac taaatggaag ttgcagaaaa tacagttgcc 3121 ccaaaaagaa atatggacag tcaatgacat ccagaagcta gtgggtgtcc taaattgggc 3181 agcacaactc tacccaggga taaagaccaa acacttatgt aggttaatca gaggaaaaat 3241 gacactcaca gaagaagtac agtggacaga attagcagaa gcagagctag aagaaaacag 3301 aattatccta agccaggaac aagagggaca ctattaccaa gaagaaaaag agctagaagc 3361 aacagtccaa aaggatcaag agaatcagtg gacatataaa atacaccagg aagaaaaaat 3421 tctaaaagta ggaaaatatg caaaggtgaa aaacacccat accaatggaa tcagattgtt 3481 agcacaggta gttcagaaaa taggaaaaga agcactagtc atttggggac gaataccaaa 3541 atttcaccta ccagtagaga gagaaatctg ggagcagtgg tgggataact actggcaagt 3601 gacatggatc ccagactggg acttcgtgtc taccccacca ctggtcaggt tagcgtttaa 3661 cctggtaggg gatcctatac caggtgcaga gaccttctac acagatggat cctgcaatag 3721 gcaatcaaaa gaaggaaaag caggatatgt aacagataga gggaaagaca aggtaaagaa 3781 actagagcaa actaccaatc agcaagcaga actagaagcc tttgcgatgg cactaacaga 3841 ctcgggtcca aaagttaata ttatagtaga ctcacagtat gtaatgggga tcagtgcaag 3901 ccaaccaaca gagtcagaaa gtaaaatagt gaaccagatc atagaagaaa tgataaaaaa 3961 ggaagcaatc tatgttgcat gggtcccagc ccacaaaggc atagggggaa accaggaagt 4021 agatcattta gtgagtcagg gtatcagaca agtgttgttc ctggaaaaaa tagagcccgc 4081 tcaggaagaa catgaaaaat atcatagcaa tgtaaaagaa ctgtctcata aatttggaat 4141 acccaattta gtggcaaggc aaatagtaaa ctcatgtgcc caatgtcaac agaaagggga 4201 agctatacat gggcaagtaa atgcagaact aggcacttgg caaatggact gcacacattt 4261 agaaggaaag atcattatag tagcagtaca tgttgcaagt ggatttatag aagcagaagt 4321 catcccacag gaatcaggaa gacaaacagc actcttccta ttgaaactgg caagtaggtg 4381 gccaataaca cacttgcata cagataatgg tgccaacttc acttcacagg aggtgaagat 4441 ggtagcatgg tggataggta tagaacaatc ctttggagta ccttacaatc cacagagcca 4501 aggagtagta gaagcaatga atcaccatct aaaaaaccaa ataagtagaa tcagagaaca 4561 ggcaaataca atagaaacaa tagtactaat ggcaattcat tgcatgaatt ttaaaagaag 4621 ggggggaata ggggatatga ctccatcaga aagattaatc aatatgatca ccacagaaca 4681 agagatacaa ttcctccaag ccaaaaattc aaaattaaaa gattttcggg tctatttcag 4741 agaaggcaga gatcagttgt ggaaaggacc tggggaacta ctgtggaaag gagaaggagc 4801 agtcctagtc aaggtaggaa cagacataaa aataatacca agaaggaaag ccaagatcat 4861 cagagactat ggaggaagac aagagatgga tagtggttcc cacctggagg gtgccaggga 4921 ggatggagaa atggcatagc cttgtcaagt atctaaaata caaaacaaag gatctagaaa 4981 aggtgtgcta tgttccccac cataaggtgg gatgggcatg gtggacttgc agcagggtaa 5041 tattcccatt aaaaggaaac agtcatctag agatacaggc atattggaac ttaacaccag 5101 aaaaaggatg gctctcctct tattcagtaa gaataacttg gtacacagaa aagttctgga 5161 cagatgttac cccagactgt gcagatgtcc taatacatag cacttatttc ccttgcttta 5221 cagcaggtga agtaagaaga gccatcagag gggaaaagtt attgtcctgc tgcaattatc 5281 cccgagctca tagagcccag gtaccgtcac ttcaatttct ggccttagtg gtagtgcaac 5341 aaaatgacag accccagaga gacagtacca ccaggaaaca gcggcgaaga gactatcgga 5401 gaggccttcg cctggctaaa caggacagta gaagccataa acagagaagc agtgaatcac 5461 ctaccccgag aacttatttt ccaggtgtgg cagaggtcct ggagatactg gcatgatgaa 5521 caagggatgt cagaaagtta cacaaagtat agatatttgt gcataataca gaaagcagtg 5581 tacatgcatg ttaggaaagg gtgtacttgc ctggggaggg gacatgggcc aggagggtgg 5641 agaccagggc ctcctcctcc tccccctcca ggtctggtct aatggctgaa gcaccaacag 5701 agctcccccc ggtggatggg accccactga gggagccagg ggatgagtgg ataatagaaa 5761 tcttgagaga aataaaagaa gaagctttaa agcattttga ccctcgcttg ctaattgctc 5821 ttggcaaata tatctatact agacatggag acacccttga aggcgccaga gagctcatta 5881 aagtcctgca acgagccctt ttcacgcact tcagagcagg atgtggccac tcaagaattg 5941 gccagacaag gggaggaaat cctctctcag ctataccgac ccctagaaac atgcaataac 6001 tcatgctatt gtaagcgatg ctgctaccat tgtcagatgt gttttctaaa caaggggctc 6061 gggatatgtt atgaacgaaa gggcagacga agaaggactc caaagaaaac taagactcat 6121 ccgtctccta caccagacaa gtgagtatga tgaatcagct gcttattgcc attttattag 6181 ctagtgcttg cttagtatat tgcacccaat atgtaactgt tttctatggc gtacccacgt 6241 ggaaaaatgc aaccattccc ctcttttgtg caaccagaaa tagggatact tggggaacca 6301 tacagtgctt gcctgacaat gatgattatc aggaaataac tttgaatgta acagaggctt 6361 ttgatgcatg gaataataca gtaacagaac aagcaataga agatgtctgg catctattcg 6421 agacatcaat aaaaccatgt gtcaaactaa cacctttatg tgtagcaatg aaatgcagca 6481 gcacagagag cagcacaggg aacaacacaa cctcaaagag cacaagcaca accacaacca 6541 cacccacaga ccaggagcaa gagataagtg aggatactcc atgcgcacgc gcagacaact 6601 gctcaggatt gggagaggaa gaaacgatca attgccagtt caatatgaca ggattagaaa 6661 gagataagaa aaaacagtat aatgaaacat ggtactcaaa agatgtggtt tgtgagacaa 6721 ataatagcac aaatcagacc cagtgttaca tgaaccattg caacacatca gtcatcacag 6781 aatcatgtga caagcactat tgggatgcta taaggtttag atactgtgca ccaccgggtt 6841 atgccctatt aagatgtaat gataccaatt attcaggctt tgcacccaac tgttctaaag 6901 tagtagcttc tacatgcacc aggatgatgg aaacgcaaac ttccacatgg tttggcttta 6961 atggcactag agcagagaat agaacatata tctattggca tggcagagat aatagaacta 7021 tcatcagctt aaacaaatat tataatctca gtttgcattg taagaggcca gggaataaga 7081 tagtgaaaca aataatgctt atgtcaggac atgtgtttca ctcccactac cagccgatca 7141 ataaaagacc cagacaagca tggtgctggt tcaaaggcaa atggaaagac gccatgcagg 7201 aggtgaagga aacccttgca aaacatccca ggtatagagg aaccaatgac acaaggaata 7261 ttagctttgc agcgccagga aaaggctcag acccagaagt agcatacatg tggactaact 7321 gcagaggaga gtttctctac tgcaacatga cttggttcct caattggata gagaataaga 7381 cacaccgcaa ttatgcaccg tgccatataa agcaaataat taacacatgg cataaggtag 7441 ggagaaatgt atatttgcct cccagggaag gggagctgtc ctgcaactca acagtaacca 7501 gcataattgc taacattgac tggcaaaaca ataatcagac aaacattacc tttagtgcag 7561 aggtggcaga actatacaga ttggagttgg gagattataa attggtagaa ataacaccaa 7621 ttggcttcgc acctacaaaa gaaaaaagat actcctctgc tcacgggaga catacaagag 7681 gtgtgttcgt gctagggttc ttgggttttc tcgcaacagc aggttctgca atgggcgcgg 7741 cgtccctgac cgtgtcggct cagtcccgga ctttactggc cgggatagtg cagcaacagc 7801 aacagctgtt ggacgtggtc aagagacaac aagaactgtt gcgactgacc gtctggggaa 7861 cgaaaaacct ccaggcaaga gtcactgcta tagagaagta cctacaggac caggcgcggc 7921 taaattcatg gggatgtgcg tttagacaag tctgccacac tactgtacca tgggttaatg 7981 attccttagc acctgactgg gacaatatga cgtggcagga atgggaaaaa caagtccgct 8041 acctggaggc aaatatcagt aaaagtttag aacaggcaca aattcagcaa gagaaaaata 8101 tgtatgaact acaaaaatta aatagctggg atatttttgg caattggttt gacttaacct 8161 cctgggtcaa gtatattcaa tatggagtgc ttataatagt agcagtaata gctttaagaa 8221 tagtgatata tgtagtacaa atgttaagta ggcttagaaa gggctatagg cctgttttct 8281 cttccccccc cggttatatc caacagatcc atatccacaa ggaccgggga cagccagcca 8341 acgaagaaac agaagaagac ggtggaagca acggtggaga cagatactgg ccctggccga 8401 tagcatatat acatttcctg atccgccagc tgattcgcct cttgaccaga ctatacagca 8461 tctgcaggga cttactatcc aggagcttcc tgaccctcca actcatctac cagaatctca 8521 gagactggct gagacttaga acagccttct tgcaatatgg gtgcgagtgg atccaagaag 8581 cattccaggc cgccgcgagg gctacaagag agactcttgc gggcgcgtgc aggggcttgt 8641 ggagggtatt ggaacgaatc gggaggggaa tactcgcggt tccaagaagg atcagacagg 8701 gagcagaaat cgccctcctg tgagggacgg cagtatcagc agggagactt tatgaatact 8761 ccatggaagg acccagcagc agaaagggag aaaaatttgt acaggcaaca aaatatggat 8821 gatgtagatt cagatgatga tgaccaagta agagtttctg tcacaccaaa agtaccacta 8881 agaccaatga cacatagatt ggcaatagat atgtcacatt taataaaaac aaggggggga 8941 ctggaaggga tgttttacag tgaaagaaga cataaaatct taaatatata cttagaaaag 9001 gaagaaggga taattgcaga ttggcagaac tacactcatg ggccaggagt aagataccca 9061 atgttctttg ggtggctatg gaagctagta ccagtagatg tcccacaaga aggggaggac 9121 actgagactc actgcttagt acatccagca caaacaagca agtttgatga cccgcatggg 9181 gagacactag tctgggagtt tgatcccttg ctggcttata gttacgaggc ttttattcgg 9241 tacccagagg aatttgggca caagtcaggc ctgccagagg aagagtggaa ggcgagactg 9301 aaagcaagag gaataccatt tagttaaaga caggaacagc tatacttggt cagggcagga 9361 agtaactaac agaaacagct gagactgcag ggactttcca gaaggggctg taaccaaggg 9421 agggacatgg gaggagctgg tggggaacgc cctcatattc tctgtataaa tatacccgct 9481 agcttgcatt gtacttcggt cgctctgcgg agaggctggc agattgagcc ctgggaggtt 9541 ctctccagca gtagcaggta gagcctgggt gttccctgct agactctcac cagcacttgg 9601 ccggtgctgg gcagacggcc ccacgcttgc ttgcttaaaa acctccttaa taaagctgcc 9661 agttagaagc a // LOCUS HIV2ST 9672 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 2 (HIV-2), complete proviral genome. ACCESSION M31113 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 2 (isolate HIV-2/ST) proviral DNA, clone JSP4-27. REFERENCE 1 (bases 1 to 9672) AUTHORS Kumar,P., Hui,H., Kappes,J.C., Haggarty,B.S., Hoxie,J.A., Arya,S.K., Shaw,G.M. and Hahn,B. TITLE Molecular characterization of an attenuated human immunodeficiency virus type 2 isolate JOURNAL J. Virol. 64, 890-901 (1990) STANDARD full staff_review REFERENCE 2 (sites; corrections bases 6952, 7003, 7926) AUTHORS Hahn,B. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Kindly provided in computer-readable form by B. Hahn, University of Alabama, Birmingham. The corrections of nts 6952, 7003 and 7926, from 'c' to 't' result in amino acid changes. This isolate, obtained from an asymptomatic Senegalese woman, has low pathogenicity, infecting cells at a relatively slow rate and causing little cell death or fusion. FEATURES from to/span description pept 548 2113 gag polyprotein pept < 1717 1767 pol polyprotein (NH2 terminus uncertain; AA at 1717) 1771 4938 pol polyprotein pept 4868 5515 vif protein pept 5343 5681 vpX protein pept 5681 5776 vpR protein 5780 5998 vpR protein pept 5844 6139 tat protein, exon 2 (first expressed exon) 8291 8387 tat protein, exon 3 (AA at 8292) pept 6070 6139 rev protein, exon 2 (first expressed exon) 8291 8544 rev protein, exon 3 (AA at 8293) pept 6149 8728 env polyprotein site 1768 1770 pol in-frame stop codon site 5777 5779 vpR in-frame stop codon revision 6952 1 t in [2]; c in [1] revision 7003 1 t in [2]; c in [1] revision 7926 1 t in [2]; c in [1] pept 8562 9329 nef protein pre-msg 1 9672 genomic mRNA pre-msg 1 9672 tat, rev, nef subgenomic mRNA rpt 9499 9672 R repeat 3' copy binding 305 321 primer (Lys-tRNA) binding site signal 9650 9655 poly-A signal BASE COUNT 3304 a 1991 c 2412 g 1965 t ORIGIN 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagtgct tggccggcac tgggcagacg 121 gctccacgct tgcttgctta aaagacctct taataaagct gccagttaga agcaagttaa 181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcattc ggtgttcatc taaagtaaca 241 agaccctggt ctgttaggac cctttctgct ttgggaaacc aaggcaggaa aatccctagc 301 aggttggcgc ccgaacaggg acttgaagaa gactgagaag ccttggaaca cggctgagtg 361 aaggcagtaa gggcggcagg aacaaaccac gacggagtgc tcctagaaaa gcgcaggccg 421 aggtaccaag ggcggcgtgt ggagcgggag tgaaagaggc ctccgggtga aggtaagtgc 481 ctacaccaaa tacagtagcc agaagggctt gttatcctac ctttagacgg gtagaagatt 541 gtgggagatg ggcgcgagaa actccgtctt gagagggaaa aaagcagacg aattagaaaa 601 gattaggtta cggcccggcg gaaagaaaaa atataggcta aaacatattg tgtgggcagc 661 gaatgaattg gacagattcg gattggcaga gagcctgttg gagtcaaaag agggttgcca 721 aaaaattctt acagttttag atccattagt accgacaggg tcagaaaatt taaaaagcct 781 ttttaatact gtctgcgtca tttggtgtat acacgcagaa gagaaagcga aagatactga 841 agaagcaaaa caaaaggtac agagacatct agtggcagaa acaaaaacta cagaaaaaat 901 gccaagtaca agtagaccaa cagcaccacc tagcgggaac ggaggaaact tccccgtaca 961 acaagtggcc ggcaactata cccatgtgcc actaagtccc cgaaccctaa atgcttgggt 1021 aaaactagta gaggaaaaga agttcggggc agaagtagtg ccaggatttc aggcactctc 1081 agaaggctgc acgccctatg atattaatca aatgcttaat tgtgtgggcg accatcaagc 1141 agctatgcaa ataatcaggg aaattattaa tgaagaagca gcagattggg acgcacaaca 1201 cccaatacca ggccccttac cagcggggca gctcagggag ccaaggggat ctgacatagc 1261 agggacaaca agcacagtag aagagcagat ccagtggatg tttaggccac aaaatcctgt 1321 accagtagga agcatctata gaagatggat ccagataggg ctacagaagt gcgtcaggat 1381 gtacaaccca accaacatcc tagacataaa acagggacca aaggagccat tccagagtta 1441 tgtagataga ttctacaaga gcttgagggc agaacaaaca gatccagcag taaaaaattg 1501 gatgacccaa acactgctag tgcagaatgc caacccagac tgtaagttag tactaaaagg 1561 actagggata aatcctacct tagaagaaat gctaaccgcc tgtcaggggg taggtggacc 1621 aggccagaaa gccagattaa tggcagaagc cttaaaggag gccatggcac cagcccccat 1681 cccatttgca gcagcccaac agagaaggac aattaagtgc tggaattgcg gaaaggaagg 1741 gcactcggca agacaatgcc gagcacctag aagacaaggc tgctggaaat gtggcaaggc 1801 aggacacatc atggcaaaat gcccagaaag acaggcgggt tttttagggt tgggcccatg 1861 gggaaagaag ccccgcaatt tccctgtggc ccaaatcccg caggggctga caccaacagc 1921 acccccgata gacccagtag aggacctact agagaagtac atgcagcaag ggaaaaggca 1981 gagagagcag agagagaggc catacaaaga agtgacagag gacttcctgc agctcgagaa 2041 acaagagaca ccatgcagag agacgacaga ggacttgctg cacctcaatt ctctctttgg 2101 aaaagaccag tagtcacagc acatgttgag ggccagccag tagaagtttt gctagacaca 2161 ggggctgacg actcaatagt agcaggcgta gagttaggga gcaattatag tccaaagata 2221 gtagggggaa tagggggatt cataaatacc aaagaatata aaaatgtaga aataagagta 2281 ttaaataaaa gagtaagagc caccataatg acaggtgata ccccaatcaa catttttggc 2341 agaaacattc tgacagcctt aggcatgtca ttaaatctac cagtcgccaa gatagaacca 2401 ataaaaataa tgctgaagcc aggaaaggat ggaccaaaac tgagacaatg gcccttaaca 2461 aaagaaaaaa tagaggcact aaaagagatc tgtgagaaaa tggaaagaga gggccagcta 2521 gaggaggcac ctccaactaa tccttataat acccccacat ttgcaatcaa gaaaaaggac 2581 aaaaacaaat ggagaatgct aatagatttt agagaactaa acaaggtaac tcaagacttc 2641 acagaaatcc agttaggaat tccacaccca gcaggactag ccaagaagaa acgaattact 2701 gtcctagatg taggggatgc ttacttttcc ataccactac atgaggattt tagacagtat 2761 actgcattta ctctaccatc aataaacaat gctgaaccag gaaaaagata catatataaa 2821 gtctcaccac agggatggaa gggatcacca gcaatttttc agtacacaat gaggcaggtc 2881 ttagaaccat tcagaaaagc aaacccggat atcattctca ttcagtacat ggatgatatc 2941 ttgatagcca gcgacaggac agatttagaa catgacagag tggttctgca gctaaaggaa 3001 cttctaaatg gcctgggatt ttccacccca gatgagaagt tccaaaaaga ccctccatac 3061 caatggatgg gctatgaact gtggccaact aaatggaagc tgcaaagaat acaattgccc 3121 caaaaggaag tatggacagt caatgacatc caaaaactgg tgggtgtcct aaattgggca 3181 gcacaaatct acccagggat aaagaccaga aacttatgta ggttaatcag aggaaaaatg 3241 acactcacag aagaggtaca gtggacagaa ttagcagaag cggaactaga agaaaacaaa 3301 atcatcttaa gccaggaaca agaaggatgc tattaccaag aggaaaagga gctagaagca 3361 acagtccaaa aagatcaaga caatcagtgg acatataaga tacaccaggg aggaaaaatt 3421 ctaaaagtag gaaaatatgc aaaggtaaaa aatacccaca ccaacggagt cagactccta 3481 gcacaagtag ttcaaaaaat aggaaaagaa gcactagtca tttggggacg aataccaaaa 3541 tttcacctac cagtagaaag agatacctgg gaacagtggt gggataacta ctggcaagtg 3601 acatggatcc cagactggga cttcatatct accccgccac tggtcagatt agtatttaac 3661 ctggtgaaag atcccatact aggcgcagaa accttctaca cagatggatc ctgcaataag 3721 caatcaagag aaggaaaagc aggatacata acagatagag gaagagacaa ggtgaggcta 3781 ttagagcaaa ccaccaatca gcaagcagaa ttagaagcct ttgcgatggc agtaacagac 3841 tcaggtccaa aggccaacat tatagtagac tcacaatatg taatgggaat agtagcaggc 3901 caaccaacag agtcagagag taaaatagta aatcaaatca tagaagaaat gataaaaaag 3961 gaagcaatct atgttgcatg ggtcccagcc cataaaggca taggaggaaa tcaggaggta 4021 gatcacttag taagtcaggg catcagacaa gtattattcc tagagaaaat agaacccgct 4081 caggaggaac atgaaaaata tcatagcaat gtaaaagaac tatcccataa atttggactg 4141 cccaaattag tggcaagaca aatagtaaac acatgcaccc aatgtcagca gaaaggggag 4201 gctatacatg ggcaagtaaa tgcagaatta ggcacttggc aaatggactg cacacactta 4261 gaaggaaaaa tcattatagt agcagtacat gttgcaagtg gatttataga agcagaagtc 4321 atcccacagg aatcaggaag gcaaacggca ctcttcctac taaaactggc cagtaggtgg 4381 ccaataacac atttgcacac agacaatggt gccaacttca cttcacagga agtaaagatg 4441 gtggcatggt ggataggtat agaacaatcc ttcggagtac cttacaatcc acaaagccaa 4501 ggagtagtgg aagcaatgaa tcaccaccta aaaaatcaga taagcagaat tagagagcag 4561 gcaaacacag tagaaacaat agtactaatg gcagttcatt gcatgaattt taaaaggagg 4621 ggaggaatag gggatatgac cccagcagaa agactaatca atatggtcac tgcagaacag 4681 gaaatacaat tcctccaagc aaaaaattca aaattacaaa attttcgggt ctatttcaga 4741 gaaggcagag atcagctgtg gaaaggacct ggggaactac tgtggaaggg ggacggagca 4801 gtcatagtca aggtaggggc tgacataaaa ataataccaa gaaggaaagc taagatcatc 4861 aaagactatg gaggaaggca agagatggat agcggttcca acttggaggg tgccagggag 4921 gatggagagg tggcatagcc ttatcaagta tctaaaatac agaacaggag atctagagaa 4981 ggtgtgctat gttccccacc ataaggtggg atgggcgtgg tggacttgca gcagggtaat 5041 attcccatta aaaggagaaa gtcatctgga gatacaggca tactggaacc taacaccaga 5101 aaaaggatgg ctctcctcct attcagtaag actaacttgg tatacagaaa aattctggac 5161 agatgttacc ccagactgtg cggactccct aatacatagc acttatttct cttgctttac 5221 ggcaggcgaa gtaagaagag ccatcagagg ggaaaagcta ttatcctgct gcaactaccc 5281 ccaagcccat aagtaccagg taccgtcact ccagtttctg gccttagtgg tagtgcaaca 5341 aaatggcagg ccccagagag acaataccac caggaaacag tggcgaagaa actatcggag 5401 aggccttcga gtggctagac aggacggtag aagccataaa cagagaggca gtgaaccacc 5461 tgccccgaga gcttattttc caggtgtggc aaaggtcctg gagatactgg catgatgaac 5521 aaggaatgtc aataagttac acaaagtata gatatttgtg cctaatgcag aaagctatgt 5581 tcatacattc taagagaggg tgcacttgcc tggggggagg acatgggccg ggaggatgga 5641 gatcaggacc tccccctcct ccccctccag gtctagtcta atgactgaag caccaacaga 5701 gtctcccccg gaggatagga ccccaccgag ggagccaggg gatgagtggg taatagaaac 5761 cctgagagag ataaaataag aagctttaaa gcactttgac cctcgcttgc taattactct 5821 tggcaactat atctatgcta gacatggaga cacccttgaa ggcgccagag ggctcattag 5881 gatcctacaa cgagccctcc tcttgcactt cagagcagga tgcggccgct caaggattgg 5941 tcagcccagg ggacgaaatc ctttatcagc tataccaacc cctagaggca tgcgataaca 6001 aatgttactg taaaaagtgc tgctaccatt gccagatgtg ttttttaaac aaggggctcg 6061 ggatatggta tgaacgaaag ggcagaagaa gaagaactcc gaagaaaact aaggctcatt 6121 cgtcttctgc atcagacaag tgagtaagat gtgtggtagg aatcaactat ttgttgccag 6181 cttgctagct agtgcttgct taatatattg cgtccaatat gtgactgttt tctatggcgt 6241 gcccgtgtgg agaaatgcat ccattcccct cttttgtgca actaaaaata gagatacttg 6301 gggaaccata cagtgcttgc cagacaatga tgactatcag gaaatagctt taaatgtgac 6361 agaggccttc gacgcatgga ataatacagt aacagaacaa gcagtagaag atgtctggag 6421 tctatttgag acatcaataa aaccatgcgt caaactaaca cccttatgtg tagcaatgcg 6481 ttgtaacagc acaactgcaa aaaacacaac ctccacacca acaaccacca caacagcaaa 6541 cacaacaata ggagagaatt cttcatgcat acgcacagac aactgcacag ggttgggaga 6601 agaagagatg gtcgactgtc agttcaatat gacaggatta gagagggata agaaaaaact 6661 atataatgaa acatggtact caaaagatgt agtctgtgaa tcaaatgaca ccaagaaaga 6721 gaaaacatgt tacatgaacc actgcaacac atcagtcatc acagagtcat gtgacaagca 6781 ctattgggat actatgaggt ttagatattg tgcaccaccg ggttttgccc tgctaagatg 6841 caatgatacc aattattcag gctttgagcc caattgttct aaggtagtag ctgctacatg 6901 tacaaggatg atggaaacgc aaacctccac ttggtttggc tttaatggca ctagggcaga 6961 aaatagaaca tatatctatt ggcatggtag ggataataga actatcatta gcttaaacaa 7021 gttttataat ctcaccgtac attgtaagag gccaggaaac aagacagttg taccaataac 7081 actcatgtca gggttagtgt ttcactccca gccaatcaat agaagaccca ggcaagcatg 7141 gtgctggttc aaaggcgagt ggaaggaagc catgaaggag gtgaagctaa cccttgcaaa 7201 acatcccagg tataaaggaa ccaacgacac agaaaaaatt cgttttatag cgccaggaga 7261 acgctcagac ccagaagtgg catacatgtg gactaactgc agaggagaat ttctctactg 7321 caatatgact tggttcctca attgggtaga aaacagaacg aatcagacac agcacaatta 7381 tgtgccatgc catataaagc aaataattaa tacctggcac aaggtaggga aaaatgtata 7441 tttgcctcct agggaaggac agttaacctg caactctaca gtgaccagca taattgctaa 7501 cattgacgga ggagagaacc agacaaatat tacctttagt gcagaggtgg cagaactata 7561 ccgattagaa ttgggggatt ataaattgat agaagtaaca ccaattggct ttgcacctac 7621 accagtaaaa agatactcct ctgctccagt gaggaataaa agaggtgtat tcgtgctagg 7681 gttcttaggt tttctcacga cagcaggagc tgcaatgggc gcggcgtcct tgacgctgtc 7741 ggctcagtct cggactttat tggccgggat agtgcagcaa cagcaacagc tgttggacgt 7801 ggtcaagaga caacaagaaa tgttgcgact gaccgtctgg ggaacaaaaa atctccaggc 7861 aagagtcact gctatcgaga aatacttaaa ggaccaggcg caactaaatt catggggatg 7921 tgcgtttaga caagtctgcc acactactgt accatgggta aatgacacct taacgcctga 7981 ttggaacaac atgacatggc aggaatggga gcaacgaatc cgcaacctag aggcaaatat 8041 cagtgaaagt ttagaacagg cacaaatcca gcaagaaaag aacatgtatg aactacaaaa 8101 attaaatagc tgggatgttt ttggcaactg gtttgattta acctcctgga tcaaatatat 8161 tcagtatgga gtttatatag tagtaggaat aatagtttta agaatagtaa tatatgtagt 8221 acaaatgtta agtagactta gaaagggcta taggcctgtt ttctcttccc cccccgctta 8281 cttccaacag atccatatcc acaaggaccg ggaacagcca gccagagaag aaacagaaga 8341 agacgttgga aacagcgttg gagacaattg gtggccctgg ccgataagat atatacattt 8401 cctgatccgc cagctgattc gcctcttgaa cagactatac aacatctgca gggacttact 8461 atccaggagc ttccagaccc tccaactaat ctcccagagt cttcggagag cattgacagc 8521 agtcagagac tggctgagat ttaacacagc ctacctgcaa tatgggggcg agtggatcca 8581 agaagcgttc cgagccttcg cgagggctac gggagagact cttacaaacg cctggagagg 8641 cttctggggg acactgggac aaattgggag gggaatactt gcagtcccaa gaaggatcag 8701 gcagggggca gaaatcgccc tcctgtgagg gacggcggta tcaacaggga gattttatga 8761 ataccccatg gagagcccca gcagaagggg agaaaggctc gtacaagcaa caaaatatgg 8821 atgatgtaga ttcagatgat gatgacctag taggggtccc tgtcacacca agagtaccat 8881 taagagaaat gacatatagg ttggcaagag atatgtcaca tttgataaaa gaaaaggggg 8941 gactggaagg gctgtattac agtgatagga gacgtagagt cctagacata tacttagaaa 9001 aggaagaggg aataattgga gactggcaga actatactca tggaccagga gtaaggtatc 9061 caaagttctt tgggtggtta tggaagctag taccagtaga tgtcccacaa gagggagatg 9121 acagtgagac tcactgctta gtgcatccag cacaaacaag caggtttgat gacccgcatg 9181 gagaaacatt agtttggagg tttgacccca cgctagcttt tagctacgag gcctttattc 9241 gatacccaga ggagtttggg tacaagtcag gcctgccaga ggatgaatgg aaggcaagac 9301 tgaaagcaag agggataccg tttagctaaa aacaggaaca gctatacttg gtcagggcag 9361 gaagtaacta acagaaaaca gctgagactg cagggacttt ccagaagggg ctgttaccag 9421 gggagggaca tgggaggagc cggtggggaa cgccctcata ctttctgtat aaatgtaccc 9481 gctactcgca ttgtattcag tcgctctgcg gagaggctgg cagattgagc cctgggaggt 9541 tctctccagc actagcaggt agagcctggg tgttccctgc tagactctca ccagtgcttg 9601 gccggcactg ggcagacggc tccacgcttg cttgcttaaa agacctctta ataaagctgc 9661 cagttagaag ca // LOCUS HIVBOS1 253 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 1, envelope region. ACCESSION M27910 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate 1153), clone 1153-1. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 253) AUTHORS Ou,C.-Y.et.al. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer-readable form by C.-Y. Ou, CDC, Atlanta. The sequence below, designated 1153-1, appears to be a major form of 11 sibling sequences taken from a Boston isolate. BASE COUNT 98 a 46 c 54 g 55 t ORIGIN Approximately 1170 nt from env start (BRU coordinates). 1 ttcaacacaa ctgtttaata gtacttggac ttggaatggt actgaaaggt caaataacac 61 tgaaggaaat atcacactcc catgcagaat aaaacaaatt ataaacatgt ggcaggaagt 121 aggaaaagca atgtatgccc ctcccatcag aggacaaatt agatgttcat caaatattac 181 agggctacta ttaacaagag atggtggtaa tgagaccaac gagaccgaga ccttcagacc 241 tggaggagga aat // LOCUS HIVJFL 2553 bp ss-RNA VRL 04-AUG-1990 DEFINITION Human immunodeficiency virus type 1 proviral env gene, 5' end. ACCESSION M31451 KEYWORDS envelope protein. SOURCE Human immunodeficiency virus type 1 (isolate JFL) proviral DNA, clone I-BL2. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 2553) AUTHORS McNearney,T., Westervelt,P., Thielan,B., Trowbridge,D.B., Garcia,J., Whittier,R. and Ratner,L. TITLE Limited sequence heterogeneity among biologically distinct human immunodeficiency virus type 1 isolates from individuals involved in a clustered infectious outbreak JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 1917-21 (1990) STANDARD full staff_review COMMENT This sequence was kindly provided prior to publication by Dr. Lee Ratner, Washington University School of Medicine, St. Louis. Clone I-BL2 and twelve other clones sequenced from seven isolates, taken from one donor and two TA-AIDS recipients, all have pre-mature stop codons in env. The isolate itself is described as monocytropic. FEATURES from to/span description pept 1 1851 envelope polyprotein 1855 1992 envelope polyprotein 1996 > 2553 envelope polyprotein site 1852 1854 envelope polyprotein in-frame stop codon site 1993 1995 envelope polyprotein in-frame stop codon BASE COUNT 890 a 435 c 603 g 625 t ORIGIN Env start codon. 1 atgagagtga aggagatcag gaagaattgt cagcacttat ggagatgggg caccatgctc 61 cttggaatgt tgatgatctg tagtgctgca ggaaaattgt gggtcacagt ctattatggg 121 gtacctgtat ggaaagaagc aaccaccact ctattttgtg catcagacgc taaagcatat 181 gacacagagg tacataatgt ttgggccaca catgcctgtg tacccacaga ccctaaccca 241 caagaagtaa aattggaaaa tgtgacagaa aattttaaca tgtggaaaaa tgacatggta 301 gaacagatgc atgaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa 361 ttaaccccac tctgtgttac tttaaattgc attaatttga agaatgttac taatatcaat 421 aatagttggg gaaagatgga ggaaggagaa ataaaaaact gctctttcaa taccaccaca 481 agcataaaaa ataagatgca gagagaatat gcactttttt ataaacttga tgtagtacca 541 atagataagg ataatacaag ctatacattg ataaattgta acacctcaac cattacacag 601 gcctgtccaa aggtatcctt tgaaccaatt ccaatacatt attgtacccc ggctggtttt 661 gcgattctaa agtgtaatga taagaagttc aatggaaaag gaccatgtaa aaatgtcagc 721 acagtacaat gtacacatgg aattaggcca gtagtgtcaa ctcacttgct gttaaatggc 781 agtctagcag aaggagaggt agtaattaga tctgaaaatt tcatggacaa tgctaaaacc 841 ataatagtac agctgaataa atctatagca attaattgta caagacctaa caacaataca 901 agaaaaagta tcactctagg accagggagg gcattttata caacaggaga cataatagga 961 gacataagac aagcacattg taaccttagt agagtacaat ggaataacac tttaaaacag 1021 atagttataa agttaagaga acagtttggg aataaaacaa tagtctttaa gcaatcctca 1081 ggaggggacc cagaaattgt aatgcacact tttaattgtg gaggggaatt tttctactgt 1141 aattcaacac agctgtttaa tagtacttgg ctgtccaatg gtacttggaa tgatactgaa 1201 gagtcaaaag acactagaga acctatcata ctcccatgca gaataaaaca aattataaac 1261 aggtggcagg aagtaggaaa agcaatgtat gcccctccca tcaacggaca aattagatgt 1321 tcatcaaata ttacagggct actattaaca agagatggtg gtaaagagaa tggggccgag 1381 atcttcagac ctggaggagg aaatatgagg gacaattgga gaagtaaatt atataaatat 1441 aaagtagtaa aaattgagcc attaggagta gcacccacca aggcaaagag aagagtggtg 1501 cagagaaaaa aaagagcagt gggaacgata ggagctatgt tccttgggtt cttgggagca 1561 gcaggaagca ctatgggcgc agcgtcgatg acgctgacgg tacaggccag actattattg 1621 tctggtatag tgcaacagca aaacaatttg ctgagggcta ttgaggcgca acagcatctg 1681 ttgcaactca cagtctgggg catcaagcag ctccaggcaa gagtcctggc tgtggaaaga 1741 tacctacagg atcaacagct cctggggatt tggggttgct ctggaaaact catttgcacc 1801 actactgtgc cttggaatgc tagttggagt aataaatctc tggatgagat ttaaaataac 1861 atgacctgga tgcagtggga gagagaaatt agcaattaca caagcctaat atacacctta 1921 attgaagaat cgcagaacca acaagaaaag aatgaactag aattattgga attggataag 1981 tgggcaagtt tgtgaaattg gtttgacata acaaactggc tgtggtctat aaaaatattc 2041 ataatgatag tagcaggttt agtaggttta agaatagttt ttgctgtgct ttctatagtg 2101 aatagagtta ggcagggata ctcaccattg tcattccaga cccgcctccc aaccccgagg 2161 ggacccgaca ggcccggagg aatcgaagaa gaaggtggag agagagacag agacagatcc 2221 ggttacttag tgactggatt cttagcactt atctgggaag acctgtggag cctgtgcctc 2281 ttcagctacc accgcttgag agacttactc ttgattgtaa cgaggattgt ggaactcctg 2341 ggacgcaggg ggtgggaagc cctcaaatat tggtggaatc tcctacagta ttggagtcag 2401 gaactaaaga atagtgctgt tagtttgctt aacgctacag ctatagcagt agctgagggg 2461 acagatagga ttatagaagt agtacaaaga attggtagag ctatcctcca catacctgta 2521 agaataagac agggcttaga aagggctttg cta // LOCUS RATPKATA1 4644 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene, exons 1-5. ACCESSION D90055 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 1 of 4 SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4644) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of twO closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1 Chuo-ku, Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 The sequence 1487-1594 encodes presumptive leader peptide. FEATURES from to/span description pept 1487 1687 3-ketoacyl-CoA thiolase, exon 1 1827 1920 3-ketoacyl-CoA thiolase, exon 2 2800 2857 3-ketoacyl-CoA thiolase, exon 3 4064 4143 3-ketoacyl-CoA thiolase, exon 4 4381 + 4423 3-ketoacyl-CoA thiolase, exon 5 pre-msg 1373 > 4644 3-ketoacyl-CoA thiolase mRNA and introns IVS 1688 1826 3-ketoacyl-CoA thiolase, intron A IVS 1921 2799 3-ketoacyl-CoA thiolase, intron B IVS 2858 4063 3-ketoacyl-CoA thiolase, intron C IVS 4144 4380 3-ketoacyl-CoA thiolase, intron D signal 1223 1228 GC box signal 1263 1268 GC box signal 1295 1300 GC box signal 1346 1351 GC box signal 1202 1206 CAAT box signal 1217 1221 CAAT box signal 1327 1331 CAAT box signal 1238 1244 enhancer core signal 1464 1470 enhancer core BASE COUNT 1013 a 1303 c 1238 g 1090 t ORIGIN 1 ggatcccacg cggggcctcc cgcagacatg gcaagcaacc ctgggccccg gttctgtacg 61 gtgcgctcgc ttgttgagct tcctacttcc cttaaggctc ctgccaaccc ctcctccttt 121 cgctttccga gaagctccgc cccttgctag aatctggaat ccggacccac aacacgcagc 181 ctgaaattca acccacttga gttctcacct ccccttgaag gtgtgagagc ttatagaaac 241 cgcctctgac gtccgccttg gcccaccgac ctgtcctggg cgatctggta gtggtggcta 301 tgcgtggtga agtgattcag ccttgtttag gacttgacag gagttacagc tcttccgcac 361 agtaatttag attctggtaa tagggaggga agagacggac tggcttcgta agcagtaacc 421 tgaaggctct gtctacctta gccatctggg acaagtggtg accgtgcttt ttaaagggaa 481 agtccacgac cttgcccgag aatagacatt tagggcatcc tgtcctttgt tatcttagac 541 agacaacact accttccgag ggtctgcaaa aagcacacgg acctatcact ttactcttta 601 acgttgggct tagaccaagg aagagtgttt cgttatttcg ctcggttaat ttacttaggt 661 gtcaatcacc agtactagat ctcccccttc tcctcccccg ggcttagtag aattccttgg 721 ggaaagtggg cattaatggc gttatcccaa acgaacgtag aacaggctta agacatgtgg 781 ggcagtgcca cagttagaac ctgcctgcca ttcagcacct tcacaccagc cggacaccca 841 aaattaattt gttttctgag agagcttctg tgacagccca atagggttgg ggttgaacta 901 agtacctcaa cattatctag gtcctcggtt ctgggttttg agacccgggg ccgagcacag 961 caaattttga ggagaaacaa gtgtcttccc tggctcttcc agcaaaggga gaagccctga 1021 gtcgagctcc gccttccacg ccgctgcgcg gtgacctcag aaggagggac tccctcgaga 1081 ctgagttcct cttaggaaac cagaagtttg gggcgcctcc tttccgtgag tctaccaaag 1141 gttgtgcaaa cactagtttc cagtctgcct ttctagggga ggattaagag tacgccttat 1201 gattggcttc cgtaagattg gcgggcgggg ctttgcgtgg aaagagtctt aggacgggtc 1261 ttgggcgggg ctctggggca aggggtgagt gggtgggcgg ccctcaagca gacacccctt 1321 aatctaattg gttgtgtcta agagtgggcg ggccctctaa agaccccatc gcaccactct 1381 cagcattatc ctacccgagc tctgatccac tttcaggcct cgtgaggtag agggctggcc 1441 tgcgcctgcg cctgccatca ttttggtttg ttaagcaagg cagagcatga gcgagtcggt 1501 gggacgcacc tccgcgatgc atcggctgca ggtagtgctg ggccacctgg ccggccgacc 1561 cgagtcgagc tccgcgctgc aagccgcgcc ctgctccgct accttcccgc aggcttcggc 1621 ctccgacgtg gtggtggtgc acggacggcg cacccccatc ggccgcgccg gccgcggcgg 1681 cttcaaggtg aggcccctcg ggcgggcgag aggagcagaa ggcggccaga gtgggtggga 1741 tcctgccctc gggggtccgg tgagcgcaga gacggcgggt ggggatgaag gagttgggct 1801 caccgctgct gcacctcgcc ctgcaggaca ccacccccga cgagcttctg tcggccgtgt 1861 tgaccgcggt tctccaggac gtgaagctaa agcctgagtg tttgggagac atctctgtgg 1921 gtgagctccc ctgagggtcc tgcaccaccc cacccagcct ctctttctta ccagaacctg 1981 ggtttcactg agcaagacct acggattcca ccgatttcag tgggatggtc aaagggccct 2041 tttacagggt tcgcctaaga tcattgggaa acacagacat ctacattatg gttcataaca 2101 gtagcagaat tacagttctg aagtagcaat gaaaaataat tttgtggttg ggggtcacca 2161 cagcatgagg aactgtatta aaagggtcac agcactaaga tagttgagaa ccattgatct 2221 aaagcccaga agttcctgga gacttccagg aggtgagaaa acgcagttct ctacctacat 2281 gaacatcctg acccgggtac gggctgcaga tgggttcacc ccaccctgcg ctatccttga 2341 gttacaacag agtataggac atggttttgt ccccaggtgc cagcctgctc ctcgtgcctg 2401 caccatcgca gggtttggag gaggatactt ggggaggtga acagttgcag gatctgacct 2461 ctttcaccca cagcccgtct tctcacagcg ttcaggctgt gcactgtgtc tcttacccta 2521 gtgtctctcc tactaccgag caagcacccc actccccacc ccaaccgtga tttagaagag 2581 cataccttgg ccccttcagc actgggctgg tagatgactg gctgttgggt agcttggctc 2641 cccattttcc agactccaga cttctatagc ttcacacgag cctgggaaaa tcctaaagtg 2701 atccgttaga aagatctgga atgtttcctc tctggaactc acagctgaac ctcagtgtgc 2761 agaacactac ttggtcctga ataaggtttc tatctacagg taacgtactt gagccaggag 2821 ccggagcagt catggcgcgc attgcccaat ttctgaggta acccttctac ctctggtggg 2881 ggattctacc accatgaacc agtccgtagg tgacaggtta tcaggttgga cagtagatag 2941 caagtcagcc agaatccttg gacccaacag tgctcctact gggttaacca acccagaagc 3001 aggataccca ggctttggga ccagagccaa ggtgaccaga tgcttcccag ggccgaactt 3061 ttgttaccct tggaagcatg gactctgacc tcatatgtca gataaataat cacgtgacca 3121 cttgatcagg atctttactt catttgctac ttgagctgcc tgagaaccct gggctcagtc 3181 agttatcaga agcttgacgg tagtggggtg ggggaagcag ggtctggaca tcttggccct 3241 tcaggtaggg aagcagtagc tcatttgcag gattaagagc agcagtagcc agagtccaaa 3301 ccggggaaaa ccttttgtct agagacacgg aaagggtcag acttagcctt cctgtccctc 3361 ttgtaacttt cctgttcatc tctcacctgc ttcttcattt tgactttccc ttctctgctg 3421 tactggccac cctctcccta tacactcgct ctgcctattc tgtcccaata gagagctctc 3481 tccctgctca cctagctctg gccagtccct gcccagacac ctcggaagct ctcaagcatc 3541 agatgagtga ccttcttgtc caccaagtca ctatgtgcct ccctgtcgct aagacacttg 3601 caatggaggg tctggaaaca catgcctgct accttgagct tctaagctca ggcaggggga 3661 gctcgtttgg tccacccaca ggcactcttc ctctccttca gaagttttga caaaggaaag 3721 gtagacatct ttgcccacaa agagactagc caccagccta agagctaggt ctcatgggca 3781 aggcttccaa ccatctcctc agttgcaagt ggctgtagcg acagggtttg cctagtagtt 3841 gacccctctg catctatcta ccctagtggg agcaaggcca gctgttacct cctggtctgt 3901 acactgctgt gggccataat gggaccctaa agactcgtgg gtcctgcttt ccatgacaca 3961 gggtcgagtc cggggctgac cacacatcca aggatgacta accaccatct cccccaggga 4021 tggctgcaca tccaggctga ccgccatctc ccctctgttt cagtggcatc ccagagaccg 4081 tgcctctgtc agcagtcaac agacagtgtt catcgggact gcaggcagtg gccaacattg 4141 ctggtaagta gtgattcctg ggtgcccatg agtggaccct gaccttggtc cttcctcagg 4201 accctaggca ggaagagtgg aggggcaccc attgccacac cccaagcctt tttgggtgct 4261 cccgggtctt ccatgaggct ccgttgaatg cacccctgag aacagggagg gaacgggctg 4321 tgctgctccc agggcgggcc cgctccggcc aaggataatt atggagtgct ttcgttgcag 4381 gtggcatcag aaatgggtct tacgacattg gcatggcctg tgggtaagat ggccttccgc 4441 cgggccttat taaccaacag caataatcaa actaatttct ctctcagttc ggaggggccg 4501 tgattccagg cttctccccg cccacgaaga cagctgaata gggtgggctt ccagaatgtg 4561 tctaggagtg gcccaaccct ggctgctcag gaatgggcca atcttagcac aagcagtaac 4621 cggaattggg tcctctaagg tacc // LOCUS RATPKATA2 1205 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene, exon 6. ACCESSION D90056 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 2 of 4 SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1205) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 283 + 381 3-ketoacyl-CoA thiolase, exon 6 pre-msg < 1 > 1205 3-ketoacyl-CoA thiolase mRNA and introns IVS < 1 282 3-ketoacyl-CoA thiolase intron D IVS 382 > 1205 3-ketoacyl-CoA thiolase intron E BASE COUNT 241 a 351 c 302 g 311 t ORIGIN 1 ggtaccagta ggctgtgcta gggactgcct gtttcctgca tctgccctgg gagtgtgtct 61 ggtagagtgc cgggtcctaa gcttgaggga aagaaggctc ggctaccttg taacctccag 121 ctgcccttct caggcaggct tcctagagag cttacctcac tccccaaggg ctccctgggc 181 ctttattccc tagcacttgt cacacagatg gctgtcccta gggcttccta tgaaaactga 241 gagctcattc cttcttcttc tccccctctt cctcctcctc agggtggagt ccatgtccct 301 gtctaacaga gggaaccctg ggaatatttc ctcccgcctg ctggagagtg acaaagccag 361 agactgcctg attcctatgg ggtgagtgtt catgggaagc tgtgtatgta tccactcacc 421 gttctgcctg tccttcacca gacagcagca tactcttctc tgactggcag tactcagctt 481 cctcttcttc ctctgactgt ctcatgctcg acgggaactg ggtatgagat caatgctgcc 541 cttgacctcg ctttcctgag gtccccccta tacaacacac acacacacac acacacacac 601 acacacacac acactcgttc ctctccaact ctacaaatgt ccaaatgcac ctcactctac 661 attactgtgc ctgaggtgcc tgtgtcttag ggaaggcagt gggagtcctc aaagccttgg 721 gaggcctcaa tagttgaaga cacggggagc ttggcttttg ttattgcggg ggagggctcg 781 ggctctgaga taccttgtaa gaagtctgtt ccatctctcc agtgctttat aattctgcat 841 ccgtgtctct gagctctcct cagtttgtcc aagggcaagg cattctggga ggctcaccat 901 aggagaacca taggctggcc tggctttctc cacatcctgt gtgtttccaa gagcagccta 961 gaagccgcag agaaactggg gccagatagt gtccccagca tcctcctgcc tatgtcttat 1021 gggcagagac gactcacccg gggggaacag ttccttctgt aggccccacg ttcaggctgc 1081 tcttcattct ttgaggtgtg agtctttgag catccgtgga catgggcgct cagtatttaa 1141 gatgagccag gagtccctgg ggcgtggact gaagctctgt ctctgacgtc gacgccagag 1201 gtcca // LOCUS RATPKATA3 470 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) A gene, exon 7. ACCESSION D90057 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 3 of 4 SOURCE Rat (Sprague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 470) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 80 + 160 3-ketoacyl-CoA thiolase, exon 7 pre-msg < 1 > 470 3-ketoacyl-CoA thiolase mRNA and introns IVS < 1 79 3-ketoacyl-CoA thiolase intron E IVS 161 > 470 3-ketoacyl-CoA thiolase intron F BASE COUNT 120 a 91 c 154 g 105 t ORIGIN 1 cgcaagccta gtccttcatc atgtccctgg ccctcaggca ggcagaccca cggcctctgc 61 tttcctgtgt gtgcactagg ataacctcgg agaatgtggc tgagcggttt ggcatctcac 121 ggcagaagca agatgccttc gcgctggcct ctcagcagaa gtgagtgcag agggatgggc 181 tgagggtggg gctggagagt ggatcgggta gagttttgcc aagaaacaaa ctgtaggttg 241 atgtccttat gcaggtgaag gcaggtacag gatagatcga agcctgtcgt tagatgagaa 301 ggaaggatga gcaggagaaa agttttaggg aggaggagac cggagagagg aaagagcagc 361 caggagaact tggaggcaga tgttaacatt tctctctgca catttacagg tcgttatgac 421 tgttcttaag ggatggatac agggctttgt atgtctcggt gaacaataac // LOCUS RATPKATA4 2264 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketocayl-CoA thiolase (EC 2.3.1.16) A gene, exons 8-12. ACCESSION D90058 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 4 of 4 SOURCE Rat (Sparague-Dawley) liver DNA, clone PT147. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2264) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 301 491 3-ketocayl-CoA thiolase, exon 8 633 812 3-ketocayl-CoA thiolase, exon 9 1089 1144 3-ketocayl-CoA thiolase, exon 10 1230 1375 3-ketocayl-CoA thiolase, exon 11 1696 1771 3-ketocayl-CoA thiolase, exon 12 pre-msg < 1 2030 3-ketocayl-CoA thiolase mRNA and introns IVS < 1 300 3-ketoacyl-CoA thiolase intron F IVS 492 632 3-ketocayl-CoA thiolase intron G IVS 813 1088 3-ketocayl-CoA thiolase intron H IVS 1145 1229 3-ketocayl-CoA thiolase intron I IVS 1376 1695 3-ketocayl-CoA thiolase intron J signal 2012 2017 polyadenylation signal BASE COUNT 446 a 615 c 703 g 500 t ORIGIN 1 gagtctgctc agtgaaaccc aggttctggt aagaaagaga ggctgggtgg ggtggtgcag 61 gaccctcagg ggagctcacc cacagagatg tctcccaaac actcaggctt tagcttcacg 121 tcctggagaa ccgcggtcaa cacggccgac agaagctcgt cgggggtggt gtcctgcagg 181 gcgaggtgca gcagcggtga gcccaactcc ttcatcccca cccgccgtct ctgcgctcac 241 cggacccccg agggcaggat cccaaacatg cctatgtctt actgcccacg gtgctggcag 301 ggcagcaagt gcccagagca aaggctgctt ccgtgctgag atcgtacctg tgacaaccac 361 tgtcctcgat gacaagggtg acaggaaaac catcaccgtg tctcaggatg agggtgtccg 421 ccccagcacc accatggagg gcctggccaa gctgaagcct gccttcaagg atggaggctc 481 taccacggct ggtgagcgtg gccaggagta gggccaatct cagtggcttg ttctactctg 541 agacctggag atgaccaggc cgcctgggaa ggtgatggtg gggggctggc tctcctcagg 601 cctgctgacg tctcctggtc caccttctcc aggaaactcc agtcaggtga gtgatggagc 661 agccgccgtc ctgctggccc ggaggtccaa ggctgaagaa ctgggcctcc ccatccttgg 721 cgtcctgagg tcctatgcag tggtcggggt ccctcctgac atcatgggca tcggacctgc 781 ctatgccatc cctgcggcct tgcagaaagc aggtgaggtg gctccttctc accctgtgct 841 tggatccttc attacctgga tctgggactg gggagtgctg gggtttgacc ttctgggtgc 901 tgtggatgga gcgggctagc accctacacc caagaccttc ctggcactac agctggctgg 961 gtctcactct cttgtcccta gaagacaaac atcagactcc tgggatctag gcttgggata 1021 gagtcctggg ggagtccaaa gtcctgccca gtggttgttg tcaattctgg cctatgcctc 1081 acccacaggg ctgactgtga atgacataga catctttgag atcaatgagg cctttgcaag 1141 tcaggtgagc ctgggtgttg tggtaggatt gactgagcca gcagttcaga gcagatggag 1201 actttagctc ctgcgcttgc tccttccagg ccctctactg tgtggagaag ctgggaattc 1261 ctgcagagaa ggtgaacccc ctggggggtg caatagccct gggccacccc ctgggctgca 1321 ccggagcaag gcaggtggtc acgctgctca atgagctgaa gcgccgaggc agacggtaag 1381 gctgctcctc gtggggtgca gtgggggtta ttagagctgg ggtgtctgat agctggggcg 1441 tggggaggtg agcacacacg caggggttct gaacttggga atggaggggt gagagcctgt 1501 cttccagaag ccttgcttca aaaacagcat aaccatgcat gccacctggg tgtgtgtctg 1561 tccggggctc ggttcccaca gatctgctgt gctggcttgg actctgccca tgaacccttg 1621 tgactgcagc ctgcctgatg ggagcatggg ttggggtttg gctgactcga gccaacactg 1681 gttctttttc cccagggctt atggcgtggt gtccatgtgc attgggactg ggatgggagc 1741 cgctgctgtc tttgaatacc ctgggaactg aggccctgac tgcaggcact acccagagag 1801 tcctatagta gtgtctggag agggatggta caggagccat cttcgtggga cactcagcag 1861 tggagggatt tgtcacagca cttcaattca gaagatgtag tcgatgttgg aacaggaggt 1921 ggaactgccc tgtcaagtac cccaagccat gctaaagtga gcatgggaca cccaggttgc 1981 aaagccatct gtacctctga cggatgggtg tagtaaatgt gtgctgtctc attttggtgg 2041 ctgtcatatc cttcgcaggc agctttcgcc tcctggattc tgtgtataaa tgattaagaa 2101 ccagggtgtc tccggtgaga tctagctgta tgcaggctct ggggacctgt catgaaagac 2161 ctcagctgac accattgccc tagaagaggc tcccccatcc caggagtcat ctagccccta 2221 gccacccccc cccccgctct ccagctgtgg gctatgcaga attc // LOCUS RATPKATB1 2234 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exons 1-3. ACCESSION D90059 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 1 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clone PTB1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2234) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 The sequence, 1251-1328 encodes leader peptide. FEATURES from to/span description pept 1251 1421 peroxisomal 3-ketoacyl-CoA thiolase, exon 1 1562 1655 peroxisomal 3-ketoacyl-CoA thiolase, exon 2 1985 + 2042 peroxisomal 3-ketoacyl-CoA thiolase, exon 3 pre-msg 1223 > 2234 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns IVS 1422 1561 peroxisomal 3-ketoacyl-CoA thiolase intron A IVS 1656 1984 peroxisomal 3-ketoacyl-CoA thiolase intron B signal 848 854 enhancer core signal 1037 1045 enhancer core signal 1114 1118 CAAT box signal 1163 1167 CAAT box signal 1192 1198 TATA box homolog site 1099 1108 NFK-B site BASE COUNT 460 a 614 c 647 g 513 t ORIGIN 1 agtgctctta accactgagc tatctctcca gccctccact ttattttctg aggcggtatc 61 tctaaacacg gtgatttgac tagggtaggg agccactctg cccctcggat cctctgcctt 121 ccccacctaa gcactaagga agcctgccta gcatcgatgt gtgcagtggg gatctgatct 181 ctggtctccc acttatgggg tcgagcactt taccctctga gccctctcgc actcccccct 241 cctttcgagt tttgggctct cctggggact gagaaaaccc tgggtgattg tcgtggtttg 301 gggctgacct gtcactgtga ttggggaacc cagccttact ctttaaggtt ggtaaaacat 361 tctagatcca gggagcccca attcatccac tcagcaggaa accattgacc agctaatgta 421 tcaggcacac tgcagacacc tctgctcagg ggggaggaga gtgggcgctg caggggtgtc 481 catagatgag agcatttcaa atgggcatcc agaggtgtca agacccgcag gacactctca 541 gagacctttg aaccacttca gtgctttaat aaaaccggtt cttccagcct gttagctgcc 601 aaagattagg cctgagaatc tcaattactc aagtatccag cttggggcca gggacccagg 661 agagacaaga ggcaaatgga acagaaaaca gagtcgggac ctctatctcc attcatagtg 721 taaagctgtt accatggctc cagaacctcc tgccttttgt gtcatagtgt gcgatggtca 781 tatttgaggg agggtatcct ggttcatgtt cagaaaccca gaagacttcg aggtcctacc 841 tacagtgtgg attgcaaaat gctagaaacg acaggaaagt tccagacact ggaactggga 901 ggggtttcaa aagtgcgcac aaactttttt ttaaattata tttgaaaaaa aaatcttatt 961 ttgtagccca gactggcctt gaacttgcgc agaaaccact tgcaccatca caggtgcaga 1021 cggatgagct taaggagtgg ttttgaagtt tatttgacca gcattgagaa gctgcgaggt 1081 ggtggtggga tggatggtgg gactttcctt ctgattggct gtagaatggg taggcgggac 1141 ttatgaaaaa tgcaaggctc ccattggctc tgccctgtgc cagccggtgt cttttattct 1201 ctggggtcag cacccagagc ccaggctaat cggtgggacg cacctccgcg atgcatcggc 1261 tgcaggtagt gctgggccac ctggccggcc ggtccgagtc gagctccgcg ctgcaagccg 1321 cgccctgctc tgcgggcttc ccgcaggcct cggcctccga cgtggtggtg gtgcacggac 1381 ggcgcacccc catcggccgc gccggccgcg gcggcttcaa ggtgaggccc ctcgggcggg 1441 cgaggggagc agaaggcggc catagtgggt gggatcctgg ccctcggggg tccggtgacc 1501 gcagagacgc cgggtgggaa tgaaggagct gggctcaccg ctgttgcgcc tcgccctgca 1561 ggacaccacc cccgacgagc ttctgtcggc cgtgttgacc gcggttctcc aggatgtgaa 1621 gctaaagcct gagtgtttgg gagacatctc tgtgggtgag ctcccctgag ggtcctgcac 1681 cggctcttcc ccgccctcct ccttcccggg atccgtctct cacctcacag agcagagtta 1741 ctgaagggca ggagtctgga cctctgaggt ctgaaccagg ctgcccccga tggccagtct 1801 caagcctaga accttgggcg ccccctcgtg gcacatctgc caactgagag gcgctggaaa 1861 gcgggaacct ctctcgaaag gctgtgattc ttggagagat ctggactgtc tcctgtctgg 1921 ggcatgtttt acttcaggcc tcgggacacc acctggcact gagtcaggac tcttctgtcc 1981 acaggcaatg tgcttcagcc cggggccgga gccgcaatgg cacgcatcgc ccagttcctg 2041 aggtaatctt tctggatccg tcttagttgt ccaccatgac tgccgaaggc ccttggcact 2101 gtgtgtgtca cggtggaaga tcaggttgaa tgtagagact gaggcaactg gagtccactg 2161 acctgacctc tgctcgataa ccttttccct acttagaggg agtgtgttgc tggcccatgg 2221 ccaaggcgcc cggg // LOCUS RATPKATB2 913 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exon 4 and 5. ACCESSION D90060 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 2 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 913) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 353 432 peroxisomal 3-ketoacyl-CoA thiolase, exon 4 673 + 715 peroxisomal 3-ketoacyl-CoA thiolase, exon 5 pre-msg < 1 > 913 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns IVS 433 672 peroxisomal 3-ketoacyl-CoA thiolase intron D BASE COUNT 173 a 234 c 275 g 231 t ORIGIN 1 tgtctcagat gagcacaaga aatgctgttc catctcagag tcttcaagat agaggaaaga 61 ggttgtctgg gcccacggca ggcctccctg gtccctcttg agaactatgg gacagatgag 121 aatgtgcttg ggggctcttg tgtgtagcta cggagacaga tctgcacagt agctgcttct 181 gctcttgtgg atctgctcca gtgggggttg cctgttgtcc tgagctatat gcacactgct 241 ggggctcttg tgggggccca gaagtctggt ggggcctgct ttccatgaga gagggtcatg 301 tccagggatg gctgcacgtc cagggctgac cgccatctcc cctctgtttc agtggcatcc 361 cagagaccgt gcctctgtca gcagtcaaca gacagtgttc atcgggactg caggcagtgg 421 ccaacattgc tggtaagtga tgattcctgg gtgcccatga gtggaccctg accttggtcc 481 ttgaccctca ggaccctagg caggaagagt ggagggacac ccattgccat accccaagcc 541 tttttgggtg ctcccggggt cttccgtgag gctccgttga atgcactccc gagaacaggg 601 agggaacggg ctgtgctgct ccagggcggg cccgttccgg ccaaggataa ttatggagtg 661 ctcttggtgc aggtggcatc agaaatgggt cttacgacat tggcatggcc tgtgggtaag 721 atggccttcc cgccgggctt attaaccaat gcaataatca aactaaaatt tttctctcct 781 tgactgtgat tctccttgcc cactgtgaca gctggctggg ctgggctccc agggatgcag 841 cccaggttgt ggctctgagt atcgtgacct cttttgttct cacaactttg cttgtcacat 901 tggggatctg cag // LOCUS RATPKATB3 145 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exon 6. ACCESSION D90061 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 3 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 145) AUTHORS Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 22 + 120 peroxisomal 3-ketoacyl-CoA thiolase, exon 6 pre-msg < 1 > 145 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns BASE COUNT 32 a 38 c 41 g 34 t ORIGIN 1 gagctcactc tttctcctca gggtggagtc catgaccctg tctgagagag gcaaccctgg 61 gaatatttcc tcccgtcttc tggagaacga gaaggccaga gactgcctga ttcctatggg 121 gtgagtggcg acaggaatct ctcaa // LOCUS RATPKATB4 369 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exon 7. ACCESSION D90062 J05269 KEYWORDS 3-ketoacyl-CoA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 4 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clones PTB[1 and 2]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 369) AUTHORS Hijikata,M., Wen,J.-K., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute Tsukiji 5-1-1, Chuo-ku Tokyo 104 Japan Phone: (03)542-2511 Fax: (03)545-3567 FEATURES from to/span description pept + 242 + 322 peroxisomal 3-ketoacyl-CoA thiolase, exon 7 pre-msg < 1 > 369 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns BASE COUNT 66 a 94 c 123 g 86 t ORIGIN 1 ggggggaaca gttccttctg taggccccac gttcaggctg ctcttcactc tttgaggtgt 61 gagtctttga gcatccgtgg acatgggcgc tcagtattta agatgagcca ggagtccctg 121 gggcgtggac tgaagctctg tctctgacgt ggacgcccag aggtgcaagt ctagtccttc 181 atcgtgtccc tggccctcag gccagcagac ccatggcctc tgctttcctg tgtcgcacta 241 ggataacctc ggagaatgtg gctgagcggt ttggcatctc acggcagaag caggatgcct 301 tcgcgctggc ctctcagcag aagtgagtgc agagggatgg gctgaggtgg gcagagatgc 361 tgagctggg // LOCUS RATPKATB5 2787 bp ds-DNA ROD 04-AUG-1990 DEFINITION Rat peroxisomal 3-ketoacyl-CoA thiolase (EC 2.3.1.16) B gene, exons 8-12. ACCESSION D90063 J05269 KEYWORDS 3-ketoacyl-coA thiolase; fatty acid beta-oxidation; peroxisomal protein. SEGMENT 5 of 5 SOURCE Rat (Sprague-Dawley) liver DNA, clone PTB2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2787) AUTHORS Hijikata,M., Wen,J.-k., Osumi,T. and Hashimoto,T. TITLE Rat Peroxisomal 3-Ketoacyl-CoA Thiolase Gene: Occurrence of two closely related but differentially regulated genes JOURNAL J. Biol. Chem. 265, 4600-4606 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Makoto Hijikata Virology Division, National Cancer Center Research Institute 5-1-1 Tsukiji, Chuo-ku Tokyo 104 Japan Phone: 03-542-2511 Fax: 03-545-3567 FEATURES from to/span description pept + 987 1177 peroxisomal 3-ketoacyl-CoA thiolase, exon 8 1326 1505 peroxisomal 3-ketoacyl-CoA thiolase, exon 9 1782 1837 peroxisomal 3-ketoacyl-CoA thiolase, exon 10 1923 2068 peroxisomal 3-ketoacyl-CoA thiolase, exon 11 2387 2462 peroxisomal 3-ketoacyl-CoA thiolase, exon 12 pre-msg < 1 2742 peroxisomal 3-ketoacyl-CoA thiolase mRNA and introns IVS 1178 1325 peroxisomal 3-ketoacyl-CoA thiolase intron H IVS 1506 1781 peroxisomal 3-ketoacyl-CoA thiolase intron I IVS 1838 1922 peroxisomal 3-ketoacyl-CoA thiolase intron J IVS 2069 2386 peroxisomal 3-ketoacyl-CoA thiolase intron K signal 2724 2729 polyadenylation signal BASE COUNT 572 a 722 c 864 g 629 t ORIGIN 1 ctcacggtgg ggcagagatg gcctgagctg gggctctcac tgtggtacta aggtgttggt 61 tagccccatg actttagctg ggaggtggaa tggttcagcc cggagtgtgg ggtgggtctt 121 cccagttctt aactattcac tggggtcggt ctctctagag acctcctcct cctctaccct 181 gcttcttagg acagaaggac ccagttctga acagacatgg tggtcactgg tgatccttag 241 gcctggagtg tggggtcagt ggtgctaaga ccggatcccc ctcctaccac aagatgaaga 301 cctgggtttg cagcggtggc ctgaggccat cattccagaa aaccagagcc ttaggcccgt 361 gggaaacaga agggcaggtt ggccactcaa atcatggatt gagaacagag ggaggatgga 421 gtgtaagcca ggtggcctca ccttttagag ttggccagca gtgtgcgttc cctggaagcc 481 gtgcgttcct gcccacaggc atgactctac ccacttgttc tgttcaccct gacaacagcg 541 tgggaatcag gaaataaatc caggaggtgc acctcgcttt taaatttaag aggggaagca 601 ggtgaagttt cgcttttagc agtcaaacat gggccttgaa gaatccggca cagacattcc 661 caggggtgca gcctcgttgt ggaagagacc tctgagcagg ctaagcagct agcggattaa 721 gaatcaggag acagaatagc agctctgaac tccctgctgt tggctgggga tgagtactgc 781 caaaccggga aaatcaggag ctgctgctga gtctgggaga gcttcgggtg cgcctttccc 841 tccctgggga atcttagctg tcactcagag ctccatcagg gccacaaagg gctgtggttg 901 cattgccttt ctgagttagc caagcccagg gactgggagg aaagactaca aaggggcccg 961 actctcgctg tccatggtgc tggcagggca gcaagcgccc agagcaaagg ctgcttccgt 1021 gctgagatcg tgcctgtgac aaccactgtc ctcgatgaca agggtgacag gaaaaccatc 1081 accgtgtctc aggatgaggg tgtccgcccc agcaccacca tggagggcct ggccaagctg 1141 aagcctgcct tcaaggatgg aggctctacc acggctggtg agcgtgggcc aggattaggg 1201 ccgatctcgg tggcttgttc tactctgaga cctggagctg accaggccgc ctgggaaggt 1261 gatagtcagg ggctggctct ctcaggccct gcctgacgtc tcctcgtccc acccttcctc 1321 cccaggaaac tccagtcagg tgagtgatgg agcagccgcc gtcctgctgg cccggaggtc 1381 caaggctgaa gaactgggcc tccccatcct tggcgtcctg aggtcctatg cagtggtcgg 1441 ggtccctcct gacatcatgg gcatcggacc tgcctatgcc atccctgcgg ccttgcagaa 1501 agcaggtgag gtggctcctt ctcaccctgt gcttggatcc ttcattacct ggatctggga 1561 ctggggagtg ctggggtttg accttctggg tgctgtggat ggagcgggct agcaccctac 1621 acccaagacc ttcctggcac tacagctggc tgggtctcac tctcttgtcc ctagaagaca 1681 aacatcagac tcctgggatc taggcttggg atagagtcct gggggagtcc aaagtcctgc 1741 ccagtgattg ttgtcaattc tggcctatgc ctcacccaca gggctgactg tgaatgacat 1801 agacatcttt gagatcaatg aggcctttgc aagtcaggtg agcctgggtg ttgtggtagg 1861 attgactggg ccagcagttc agagcagatg gagactttag ctcctgcgct tgctccttcc 1921 aggccctcta ctgtgtggag aaggtgggaa ttcctgcaga aaaggtgaac cccctggggg 1981 gtgcaatagc cctgggccac cccctgggct gcaccggagc aaggcaggtg gtcacgctgc 2041 tcaatgagct gaagcgccga ggcagacggt aaggctgctc ctcgtggggt gctgtggggg 2101 ttattagagc tggggtgtcc aatagctggg ggcgtgggga ggtgagcaca cacacagggg 2161 ttctgaactg ggaatggagg ggcgagacct gtcttccaga agccttgctt caaaaacagc 2221 atgactgtgc ctgccactgg gtgtgtgtct gtccggggct cggttcccac agatctgctg 2281 tgctggcttg gactctgccc atgaaccctt gtggctgcag ccctggctgg tgggagcatg 2341 ggttggggtt tggctactcg agccaacact ggttcttttt ccccagggct tatggcgtgg 2401 tgtccatgtg catcgggact gggatgggag ccgctgctgt ctttgaatac cctgggaact 2461 gaggccctga ctgcaggcac tacccagaga gtcctatagt agtgtctgga gagggatggt 2521 acagaagcca tcttcgtggg acactcagca gtggagggat ttgtcacagc actttaattt 2581 agaaaatgta atcgcaatac gggcacaagg gcatccaatc cactcctctc agttgaacca 2641 aagacagcgc acgtcacacg tgaccacagg acaacatgtt tctgagggac agtggcctca 2701 tgaatagtgg gcatgagggg catagtaaat gtatgttatc tcatcttgct tgcagtcatt 2761 tcccccaggc cggctcttgc tttgttg // LOCUS NPOCAP 2102 bp ds-DNA VRL 04-AUG-1990 DEFINITION Orgyia pseudotsugata polyhedrosis virus capsid protein (P39) gene, complete cds. ACCESSION M21042 M23429 KEYWORDS capsid protein; structural protein. SOURCE Orgyia pseudotsugata polyhedrosis virus DNA, clone cosmid #58. ORGANISM Orgyia pseudotsugata polyhedrosis virus Viridae; ds-DNA enveloped viruses; Baculoviridae; Baculovirus; Nuclear polyhedrosis viruses. REFERENCE 1 (bases 1 to 2102) AUTHORS Blissard,G.W., Quant-Russell,R.L., Rohrmann,G.F. and Beaudreau,G.S. TITLE Nucleotide sequence, transcriptional mapping, and temporal expression of the gene encoding p39, a major structural protein of the multicapsid nuclear polyhedrosis virus of Orgyia pseudotsugata JOURNAL Virology 168, 354-362 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by G.W.Blissard, 03-NOV-1988. The p39 protein is found in both the PDV and BV virion phenotypes; P39 mRNAs are approximately 2600 nt in length, are expressed late (24-48 hpi), and have two major transcription initiation sites. At each major transcription start site, two transcripts are initiated within the ATAAG consensus. Transcripts initiate at positions 2 and 3 of the ATAAG consensus. FEATURES from to/span description pept 124 1179 capsid protein mRNA 27 > 1179 capsid protein mRNA (alt.) mRNA 28 > 1179 capsid protein mRNA (alt.) mRNA 75 > 1179 capsid protein mRNA (alt.) mRNA 76 > 1176 capsid protein mRNA (alt.) BASE COUNT 493 a 607 c 585 g 417 t ORIGIN Map position 57-58. 1 ccggcgcagc gcaccgtttc tttatataag aatctaaaat tagatacaag aggtcttggc 61 taaaattgat tgtataagag atttcttgct ctatcaaaac gtcggccccc atcgccgcag 121 aaaatggcgc tcgtgtcacc cggggtgtcg tcgcgacggt cgaccaacca ctgcatcttt 181 ggcgcgattg aaccgttcga ctcgtgcgtc acgtacaggt cgccatgctc gtcggacgcg 241 tcggtggacg acggctggtt catctgcgac taccacctga agttgcgctt caaaatgtcc 301 aaaatggtgc tgcccattta cgacgaagac gacaaccagt acaagcgcac catagcgcgg 361 catttggtgg ggcacaaaga aaggggcgta aagcgcattc tggtgcccac cagagccaac 421 tacatgaccg tgttcaattt gccgggcatg atgctggccg agcagctgat ttttcatttg 481 atctacgaca accgtttgga agtcaatcgc atttgcgcct cgttgaaaaa caacgaaaac 541 ttcattgaca acacgtacag cgttgtggag agcgtgtaca gcgcgacgcg caacattctg 601 tcgctcacag acccgcaagc gtactgctcg cgcgtggcca acgacgacgt gcgctttttc 661 gacgcgaacg tagtcgacaa caactaccag gcgggcaacg gcgacaccgt gtttaacaac 721 atgcccgggt ttttgcgcaa cctaatcagg cgcgccgtgg cgcccgaaac gctgcagatc 781 gacagcgaag acctgcgcct gcgaaactgc aacacctgtg tgattaacaa caccggcttg 841 gtagcaaccg tgacaaacac cgagctgtac aacccggtgc gcagctcgga catcattaaa 901 acgcggccca atcgcttgca gattagaaac gttttgaagt ttgagggcga cacacgcgcg 961 ttggagcgca ccttgggccg ctacgaggaa taccccatgt acgtgccctt gtttttgggc 1021 taccagctgg tcaacctgca aaacgacatt ttgcgcgcca acaacttttt acccgcgccg 1081 tttggcgtgc cgcaggccgt aaacaacctg gaggcccaag cgcccgcagc gcccgcaccc 1141 gcagcgcccg cgcccgcgcc cgcagcgccc gttgtttaat tccgttgcca tggacacggt 1201 gaggctgcag tgccacattt gctgctcggt gggcgaaatc aaaaattatt ttttgcagcc 1261 cgtcgacgca atcaccattt tgcccattgt ggagctgcac acgtgccggc accaattgtg 1321 cgtcatgtgc gtgcgcaaaa ttgcgcagcg cggccgagac aagcgcgtcg agtgccccat 1381 gtgccgccgc aaaaacgcgc actttaacgt gtacagcgtg aaccgtaatt cggtggatgt 1441 gttgcgctgc agcgtggccg acgtgcgcga gcacgggcgc tttggcggat tggcggacgc 1501 cgcgtcgctg gctcgtgggc tgtttgagcc cagcttgctt gaagccgagc ccgcgccgga 1561 caactcgttt ggcccaaacg agttacaact tgtgctaaaa cgtctaaagg cgcaaattga 1621 ggcgcaaaca agaactaatt atgatttgca actgcaagca actgctttgg agcgaacaat 1681 cgaagaagca aacgaccgct tgggtaaaag ccgcggcgac tacagcgacg cgtgcaagct 1741 gatggacgag cttcgcggcg acaggctgcg cgcggagcgc gcggtcaagg cgctcgccga 1801 cgcgcacgcc cagtgggcgg acaagaacgc caaaatgcgg cgcgagaacg acaggttgac 1861 aaatgaaaac attggtttga ttcgggacaa caatttattt aaacaaaaca ctgcacgcaa 1921 acgtaaaatc gctccgtaac cgcctttgtt tgcgcgtcaa agtccacctc gtagacgtgt 1981 cgctcgttta cgctgccgca gcggtcccag ttgtccgtcg tgcaccacgc cgcattgtgt 2041 cggcggcaca aatacagcgc ttcgttgaat tgcgccggcg gcagattgaa gcggccgagc 2101 tc // LOCUS PT2REGA 462 bp ds-DNA PHG 04-AUG-1990 DEFINITION Bacteriophage T2 translational repressor (regA) gene, complete cds. ACCESSION M27737 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage T2, cDNA to viral mRNA. ORGANISM Bacteriophage T2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 462) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.S.Miller, 08-SEP-1989. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9804-10266. An RNA secondary structure is detected by primer extension reactions on mRNA templates at positions 407-424. An RNA nuclease sensitive site is noted at position 431. FEATURES from to/span description pept 36 404 translational repressor protein (RegA) BASE COUNT 169 a 66 c 88 g 139 t ORIGIN 1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac 61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag 121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggactatac tatatcgttc 181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag 241 ttcgtcgtga ttcgattgca tggctattag aagattgggg actaattgaa atcgttcctg 301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa 361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc 421 ccttatttgg agtataatat atcaagagcc taataactcg gg // LOCUS PT4G55G43 13062 bp ds-DNA PHG 04-AUG-1990 DEFINITION Bacteriophage T4 genes 55, alpha-gt, 47, 46, 45, 44, 62, regA, and 43. ACCESSION M10160 J02510 X00769 KEYWORDS DNA polymerase; DNA polymerase accessory protein; alpha-glucosyltransferase; exonuclease; polymerase; translational repressor. SOURCE Bacteriophage T4 DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 7513 to 8269) AUTHORS Spicer,E.K., Noble,J.A., Nossal,N.G., Konigsberg,W.H. and Williams,K.R. TITLE Bacteriophage T4 gene 45: Sequences of the structural gene and its protein product JOURNAL J. Biol. Chem. 257, 8972-8979 (1982) STANDARD full staff_review REFERENCE 2 (bases 7513 to 10735) AUTHORS Spicer,E.K. and Konigsberg,W.H. TITLE ; JOURNAL (in) Mathews,C.K., Kutter,E.M., Mosig,G. and Berget,P.B. (Eds); Bacteriophage T4: 299-301; American Society for Microbiology, Washington, DC (1983) STANDARD full staff_review REFERENCE 3 (bases 9747 to 10265) AUTHORS Trojanowska,M., Miller,E.S., Karam,J., Stormo,G. and Gold,L. TITLE The bacteriophage T4 regA gene: Primary sequence of a translational repressor JOURNAL Nucleic Acids Res. 12, 5979-5993 (1984) STANDARD full staff_review REFERENCE 4 (bases 8257 to 9272) AUTHORS Spicer,E.K., Nossal,N.G. and Williams,K.R. TITLE Bacteriophage T4 gene 44 DNA polymerase accessory protein: Sequences of gene 44 and its protein product JOURNAL J. Biol. Chem. 259, 15425-15432 (1984) STANDARD full staff_review REFERENCE 5 (bases 1 to 7084) AUTHORS Gram,H. and Rueger,W. TITLE Genes 55, alpha-gt, 47, and 46 of bacteriophage T4: The genomic organization as deduced by sequence analysis JOURNAL EMBO J. 4, 257-264 (1985) STANDARD full staff_review REFERENCE 6 (bases 6985 to 7512) AUTHORS Hsu,T. and Karam,J. JOURNAL Unpublished (1985) Med U of South Carolina, Charleston, SC. STANDARD full staff_review REFERENCE 7 (bases 9781 to 10291) AUTHORS Adari,H.Y., Rose,K., Williams,K.R., Konigsberg,W.H., Lin,T.-C. and Spicer,E.K. TITLE Cloning, nucleotide sequence, and overexpression of the bacteriophage T4 regA gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1901-1905 (1985) STANDARD full staff_review REFERENCE 8 (bases 7513 to 12985; revises [1],[2],[4],[7]) AUTHORS Spicer,E.K. JOURNAL Unpublished (1986) Yale U School of Med, New Haven, CT. STANDARD full staff_review REFERENCE 9 (bases 7513 to 12985; revises [1],[2],[4],[7],[8]) AUTHORS Spicer,E.K. JOURNAL Unpublished (1987) Yale U School of Med, New Haven, CT. STANDARD full staff_review REFERENCE 10 (bases 10205 to 13062) AUTHORS Spicer,E.K., Rush,J., Fung,C., Reha-Krantz,L.J., Karam,J.D. and Konigsberg,W.H. TITLE Primary Structure of T4 DNA Polymerase: Evolutionary relatedness to eucaryotic and other procaryotic DNA polymerases JOURNAL J. Biol. Chem. 263, 7478-7486 (1988) STANDARD full staff_entry REFERENCE 11 (bases 9184 to 9903) AUTHORS Rush,J., Lin,T.-C., Quinones,M., Spicer,E.K., Douglas,I., Williams,K.R. and Konigsberg,W.H. TITLE The 44P subunit of the T4 DNA polymerase accessory protein complex catalyzes ATP hydrolase JOURNAL J. Biol. Chem. 264, 10943-10953 (1989) STANDARD full staff_entry REFERENCE 12 (bases 10212 to 10215) AUTHORS Andrake,M., Guild,N., Hsu,T., Gold,L., Tuerk,C. and Karam,J. TITLE DNA polymerase of bacteriophage T4 is an autogenous translational repressor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7942-7946 (1988) STANDARD full staff_entry COMMENT Gene 45 protein is required for both T4 DNA replication and T4 late gene transcription [1]. Gene 44 protein is a DNA polymerase accessory protein required for T4 DNA replication [4]. Gene 55 protein associates with the modified host RNA polymerase and is necessary for late RNA transcription [5]. Gene 43 encodes a T4 DNA polymerase [8]. [5] notes several open reading frames not included in the FEATURES table. A draft entry and computer-readable copy of the sequence in [8],[9] were kindly provided by E.K.Spicer 13-MAR-1986 and 27-JUL-1987. FEATURES from to/span description pept 25 582 g55 protein pept 2356 3558 alpha-glucosyltransferase (alpha-gt) pept 3735 4754 exonuclease g47 pept 5198 6880 exonuclease g46 (gtg start codon) pept 7579 8262 DNA polymerase accessory protein g45 pept 8313 9272 DNA polymerase accessory protein g44 pept 9274 9837 DNA polymerase accessory protein g62 pept 9839 10207 translation repressor protein (regA) pept 10286 12982 DNA polymerase (g43) revision 8707 8707 g in [8]; c in [2],[4] revision 8835 8835 c in [8]; g in [2],[4] revision 8950 8950 g in [8]; c in [2],[4] revision 9135 9136 cg in [8]; gc in [2],[4] revision 9484 9484 c in [8]; g in [2] revision 9929 9929 g in [2],[3],[8]; aaag in [7] revision 10212 10215 gggg in [12]; gg in revision 10551 10552 ct in [8]; tc in [2] revision 10807 10807 a in [9]; c in [8] revision 12597 12599 ttg in [9]; tg in [8] revision 12605 12607 aaa in [9]; aa in [8] revision 12609 12610 tt in [9]; t in [8] revision 12612 12613 tt in [9]; t in [8] revision 12618 12619 aa in [9]; a in [8] revision 12644 12646 aaa in [9]; aa in [8] BASE COUNT 4471 a 1954 c 2512 g 4125 t ORIGIN About 45.5 kb on genomic map. 1 tatgaattga gctaagaggt gaaaatgtca gaaactaagc ctaaatataa ttacgtaaac 61 aataaagagc ttttacaagc tattattgat tggaaaacag aattagcaaa taataaagac 121 ccaaataaag tagttcgtca gaatgatact atcggattag ccattatgct tattgcagaa 181 ggcttatcta aacgtttcaa cttttcagga tacacccagt cttggaaaca agaaatgatt 241 gcagatggta tagaagcttc tattaagggg cttcacaatt ttgatgaaac gaaatataaa 301 aacccacatg cgtatataac tcaagcttgt tttaatgcat tcgtccaacg tattaaaaaa 361 gaacgtaagg aagttgcaaa gaaatatagt tacttcgttc acaatgtcta tgacagccgt 421 gacgacgata tggttgcgtt agtagatgaa acttttattc aagacatcta tgataaaatg 481 acgcattacg aagaatcaac ctatagaaca ccgggggctg aaaagaaaag tgttgtagat 541 gattctccta gtttggattt tttatatgag gctaacgatt aacctctccg gattcttgga 601 agaaatacct gaagttgaag ctattcccta tttacttaaa atgtatctca gggaagtttt 661 agctcttgac attgatattg atccagaaaa tccgtatgat accgctttta aatctaatgg 721 tgtagaatta aactatcggt atcatttaac agatgatgat ttttatttta tattagagaa 781 ataatatgac tgataaaccc gaaattaatg atgaagtgga aaagcttatt tcttctattg 841 aagaaaagaa ccgtcttgaa gcagaaagaa aagcaaataa gttattgtct aaaaacaaac 901 gcgaactgaa tcgtctttat aagcacgctc agatcgcagc tgaaaataat aattttgctc 961 aatacgaata tgctatcaag aaaagtcggg atattctaaa acagccatat aacgatgaac 1021 tcatcagtat tctttggaag actactagat cgcagattga ggatatgatt gatgcttaca 1081 cacgtaaaat tcaagcgtct taaaattaat gcaggattta ctgaatcttt gaatggtcat 1141 ctttgcgtga aaatttctga aaaagaatac cgtgatagtt caatcaaaga agttaatcct 1201 cctattgtaa gagcagaccc taatatgaaa gtgtgggttg attcttatca agtcaaaaaa 1261 tggtggcaac tgtgaaagat gaacatccga cttctgaaat agattataat aaaattcgtt 1321 cctctagaga ggaaatgatg agacgcttta aagaggctca tgataaagct aaagcagaag 1381 gaactataac atataagcgc ataaaattta aaagttctaa cgagcctctg tatggcgtat 1441 tatgtggata ggagcttcgg ctcctatatt gctttataaa ttttttggta aaataaacta 1501 aaatgaatta tttaaacata tataatttac tgatcgaaaa aggactaaac agaggcaatg 1561 ataaaagtct tttgacatac tatactgaaa cacatcatat aattcctaga tgtatgggtg 1621 gtacagatga taaaactaat ttagttcttt tgacaccaga agaacatttt accgcgcatt 1681 tattattatt taaaatttat agacttccta aattggcttt agctattcgt atgatgtgtt 1741 atagttctga cggaacaaga ctaaataata aaatgtatgg atggattaaa acagctgtat 1801 cttcttcaat atcggaaagc atgaaagagt tctggaaaga tgatgataat aaaaaataca 1861 tgtctaacgc tagacgaaat gctggaaaac caatttatca atatgattta aatggaaatt 1921 ttattcgcaa atatagatgt attacagatg ctgcagaaga tatgtcatat agctgttcaa 1981 cttcaattaa acaatgtgta gatggtaaaa ggaaaaccgc tggaggattt caatggaaat 2041 attattattc tgataacata ggaaagccct ctaggatgtc taatgcaact aaacaaaaaa 2101 tgtctaaatc taaacgtggg ataactcaga aaagaaatgt gccagtattt caatatgata 2161 ctactggaaa attattacga gtatttccta gaattaaaga cgcagctgta tccgttaaag 2221 gatgcatgtc taatattaaa aagtgcatat ctggaaaaag caaaattgct tatggatatg 2281 tttgggccta ttcgtagcgt tttatagaaa ataaaatatt atttacatga tttattaaat 2341 gaaaagagga aaactatgcg tatttgcatt tttatggctc gaggtcttga aggttgtggt 2401 gtaacaaaat tctcactcga gcaacgtgat tggtttatta aaaatggtca tgaagtaact 2461 ttggtttatg ctaaagataa atcatttact cgtacaagtt ctcatgacca caaatcattt 2521 tcaattccag ttattttagc taaagaatac gataaagcac ttaagctagt aaatgattgt 2581 gatattctaa ttattaattc tgttcctgct acttccgttc aagaagctac gattaataac 2641 tataaaaaac ttttagataa tattaaacct tctattcgtg ttgtagttta tcagcatgat 2701 cattctgttc tttctttgcg tcgaaatttg ggattagaag aaactgttcg tcgagctgat 2761 gttattttta gccattctga taatggtgat tttaataaag ttctgatgaa agaatggtat 2821 ccagaaactg tttctctgtt tgatgatatt gaagaagcac cgacagtata taattttcag 2881 cctcctatgg atattgtgaa ggttcggtca acttattgga aagatgtttc tgaaattaac 2941 atgaatatca accgttggat tggtcgtacg actacatgga aaggttttta ccagatgttt 3001 gattttcatg aaaaattctt aaaacctgct ggtaaatcca ctgtaatgga aggtctggaa 3061 cgttcccctg cttttattgc aattaaggaa aaaggtattc cgtatgaata ttacggtaat 3121 cgtgagattg ataaaatgaa tctcgcgccg aatcaaccgg cacaaatcct agattgttat 3181 attaatagtg aaatgcttga acgaatgagt aaatctggct ttggatatca gttgagtaaa 3241 cttaaccaga aatacttaca acgctcactc gaatatactc atctcgagct tggtgcatgt 3301 ggaacaattc cggtattttg gaaatctact ggcgaaaatt taaaattccg tgttgataat 3361 actcctttga cctcgcatga tagcggtatc atttggtttg atgaaaatga tatggaatca 3421 acatttgaac gtattaaaga actgtcatct gaccgagctc tttatgaccg tgagcgagaa 3481 aaagcatatg aatttttgta tcagcatcaa gattcaagct tctgctttaa agaacagttt 3541 gacattatta caaaataaag ggcttcggcc ctttagcttt atacggagtt tgatataatg 3601 atatttcttg gatatgtgat actttttctt gcattttatc tattcactag agcatgttgg 3661 attgggttct ttagcactcc agatggattt atttcaataa ttttattttg catttcaatg 3721 acggttcttg atatatgaaa attttaaatt taggtgattg gcatttaggc gttaaagctg 3781 atgatgagtg gattcgcggt attcaaattg atggaattaa gcaagcgata gaatattcta 3841 agaaaaatgg aattactacc tggattcaat acggtgatat ttttgatgtg cgaaaagcga 3901 tcacacataa aactatggag tttgcccgtg aaatagttca aacgcttgat gatgctggaa 3961 ttacattaca cactattgta ggaaatcacg atctccacta taaaaatgta atgcatccaa 4021 atgcttctac tgagcttttg gctaaatatc ctaatgttaa agtgtatgat aagcctacta 4081 cagtagattt tgacgggtgt ttgattgatt tgattccttg gatgtgcgaa gaaaatactg 4141 gtgaaattct tgagcatatc aaaacttcat ctgcttcttt ttgtgttggt cactgggaac 4201 tgaatggatt ttatttttat aaaggaatga aatctcatgg tcttgaaccc gatttcctta 4261 agacttataa agaagtgtgg tctggtcact tccatactat ttctgaggct gcaaacgtca 4321 gatatattgg gacaccatgg acactaaccg caggtgacga gaatgaccct cgtgggttct 4381 ggatgtttga tacagaaaca gaacgaacgg aatttattcc aaacaatact acctggcatc 4441 gtagaattca ttatccattt aaaggaaaaa ttgactataa agattttaca aatctatcag 4501 tacgtgttat agtaactgaa gtagacaaaa atctgacgaa gttcgaatct gaactagaaa 4561 aagttgtgca ttcattacga gttgtgtcaa agattgataa ctctgtcgag tcagatgaca 4621 gtgaagaagt tgaagttcaa tcgcttcaga cattaatgga agagtatatt aatgcaattc 4681 cagacatcac tgattctgac cgtgaagcac ttattcaata tgcaaatcag ctatatgtag 4741 aggcaacaca atgacttttg atgaatttaa aaatgttatg atgagtcagc attttaaatg 4801 tgaagtaaaa gatgatattg gtcataaaga aattattgaa tactggtttg aaccgttaga 4861 ggttgaagat aattgtatta aaaaggttac tgtctgcact gactgggctg tatcttttaa 4921 cttcaacatt ttagataatg acacacctaa atcattacga gatatggctg tatcttgtat 4981 taaggatgca tactgtgaag ttttcgacat ttgacattaa tgatgaattc atagcaaata 5041 ttgattatac cgaagaagat tctagatatg ttggaataat ttatatcaca tcaaaaacag 5101 cacaaggcgt tgtttgcatg gctgaatttg atgaatactt tttagattat gatgatatga 5161 tagaatggtc taaaagatac attaaaagga atcttttgtg aagaatttta aacttaatag 5221 agtaaaatat aaaaatataa tgtcagttgg tcaaaatggg attgacattc aattagataa 5281 ggttcaaaaa actcttatta ctggacgaaa tggcggtggt aagtctacta tgctagaagc 5341 catcacattt gggctttttg gtaagccatt tcgcgatgta aagaaaggtc aattaataaa 5401 cagcacaaat aagaaagaac ttttagttga actgtggatg gaatatgatg agaaaaagta 5461 ctatatcaaa agaggacaaa aaccaaacgt tttcgaaatc accgttaacg gtacacgtct 5521 taatgaatct gccagcagta aagatttcca agcagaattt gaacagctta tcggaatgtc 5581 atatgccagt ttcaagcaga ttgttgtcct tggtacagca gggtataccc ctttcatggg 5641 tttgtcgacc cctgcgcgaa gaaagcttgt ggaagacctg cttgaggtag gaacattagc 5701 tgaaatggat aagcttaata aagcactaat acgtgaatta aattcgcaga accaagtgct 5761 tgatgttaaa aaagatagta ttatccaaca aattaaaata tataatgata atgttgaacg 5821 ccagaaaaaa ttaacgggtg acaaccttac tcgtctacaa aatatgtatg atgatttggc 5881 aaaagaagct agaacgctaa aatcggaaat agaagaagct aacgaaagat tagttaatat 5941 tgttttggac gaagacccga ctgatgcatt taataaaatc ggtcaagaag catttttaat 6001 taaatcaaaa attgactcgt ataataaagt cattaatatg tatcacgaag gtggattatg 6061 tccaacctgt ttgtcacaat tgagttccgg tgataaagtt gtttctaaaa ttaaagataa 6121 agtttctgaa tgcacacatt cgtttgaaca gctttcaaca catcgtgata atttaaaagt 6181 tcttgttgat gaataccgag ataatattaa aacccagcag tcgttggcaa atgatattcg 6241 caataaaaag caatctctga tcgcagcagt agataaagct aaaaaagtta aagcggctat 6301 agaaaaagca tcttctgagt ttattgacca tgctgatgaa atagcactgc ttcaagaaga 6361 acttgataaa attgttaaga caaaaactaa tttagtaatg gaaaaatacc accgaggaat 6421 tttgactgat atgctcaaag attctggtat taaaggtgct attattaaaa agtacattcc 6481 attatttaat aagcagatta accattatct taaaataatg gaagcggatt atgtgtttac 6541 attagatgaa gaatttaatg agacaattaa atcccgtggt cgtgaagatt ttagttatgc 6601 ttcatttagt gaaggtgaaa aggcacgaat cgatattgct cttttattta cttggcgtga 6661 tattgcttct atcgtatctg gtgttagtat tagtacatta attcttgatg aagtgtttga 6721 tgggtcattt gatgccgaag gtattaaagg tgtagctaat attataaatt caatgaaaaa 6781 cactaacgtt tttataattt cgcataaaga ccatgacccg caagaatatg gtcagcatct 6841 tcaaatgaag aaagttggtc gatttactgt aatggtttaa tttataagag attatgcttt 6901 aatttattag agtataatct ctatggagga aaaacatgga atattcaact ggacagcatc 6961 tattaactat tcctgaaata aaacgatata ttctgagaaa taatttttct aatgaagagc 7021 atatagttac tgaatctatg cttaggaatg catttaaagc agaatataca aaaataatgt 7081 ccaatagaaa tgaagcttgg actgttactg attattatga ctaaaggtgt attatgacta 7141 aaattactgt gaattatact gttgatgtaa aagatattca gccaaaacac gtgcgttctg 7201 aatcaaatcc acaaaaccaa aataaaattc gtcgagcatg tgttttgtct ctttctgata 7261 acgcaatgga agttattcag aacaaaatta aatctgcacc tgctcgtcat gcgtattatg 7321 aagctatcga tcgtgaagta agtaataaat ggattgaact aatgcgcaaa catactacag 7381 aatccctaaa cgccggtgct aaatttatta tgacttcatg tggtgaaacg ccttgaagat 7441 gattattgcg gtaatgcaga tgaacgtcta attgttgctg ctcaaattgt tgcggaaaca 7501 attgcggctg attttaacgt taattgcttt attaaattag ttataaaatt aaatctcatt 7561 tgaattgaag gaaattacat gaaactgtct aaagatacta ctgctctgct taaaaatttc 7621 gctactatta actctggtat tatgcttaaa tccggtcaat ttattatgac tcgcgcagtt 7681 aatggtacaa cttatgcgga agcaaatatt tctgacgtta ttgattttga tgtagcaatt 7741 tacgatttga acggttttct cggtattctg tctttagtta atgatgcaga aatttcccag 7801 tcagaagatg gaaatattaa aattgctgat gcccgctcaa caattttttg gcgagcagcc 7861 gatccgagta cagatgttgc tcctaataaa ccaattccat tcccggtagc atctgctgtt 7921 actgaaatta aagctgaaga ccttcaacag ctgttgcgtc tatctcgtgg tctgcaaatt 7981 gatacaattg ctatcacggt aaaagaaggt aaaatcgtaa ttaacggttt taataaagta 8041 gaagattctg ctctgacccg tgttaaatat tctttgactc ttggtgatta tgatggtgaa 8101 aatacattta atttcattat caatatggca aatatgaaaa tgcaaccagg aaattataaa 8161 cttctgcttt gggcaaaagg taaacaaggt gctgctaaat ttgaaggtga acacgcgaat 8221 tatgtggtag ctcttgaagc tgattctacc cacgattttt aatagagggc ttcggccttt 8281 ataatttaca ctaaaacttg aatgaggaaa ttatgattac tgtaaatgaa aaagaacaca 8341 ttcttgaaca gaaatatcgt ccatctacta tcgatgaatg tattcttccc gcttttgata 8401 aagaaacctt taaatctatt acaagtaaag gtaagattcc acatattatt cttcattctc 8461 cttctccagg aacaggtaaa acaactgtag caaaagcatt atgtcatgat gtaaatgctg 8521 atatgatgtt tgtgaatggg tcagattgta aaattgattt cgttcgtggt cctttgacta 8581 attttgccag cgccgcttca tttgatggtc gtcaaaaagt aatcgttatt gatgaatttg 8641 accgttcagg gttagcagag tctcagcgac atcttcgttc ctttatggaa gcttatagtt 8701 caaactgtag tattattatt actgctaata atattgatgg tattattaaa ccgcttcagt 8761 cacgctgccg agttattaca ttcggtcaac caactgatga agataaaatt gaaatgatga 8821 agcagatgat tcgtcgattg actgaaatct gcaagcatga aggaattgct atagctgata 8881 tgaaagttgt agcagctttg gttaaaaaga attttcctga ttttcgtaaa actattggcg 8941 agctcgatag ttattcgtct aaaggtgttt tggatgctgg tattttatca ctggttacta 9001 acgatcgtgg tgctattgat gatgttcttg agtctctcaa aaataaagat gttaaacaac 9061 tcagagcttt agcaccaaaa tatgcggctg attattcgtg gttcgtgggt aaacttgccg 9121 aagaaatcta ttcacgtgta actccacaaa gtattattcg tatgtacgaa attgtcggcg 9181 aaaataatca gtatcatggt attgcagcta atactgaatt gcatttagct tatcttttca 9241 ttcaattagc atgcgaaatg cagtggaagt gatatgagct tatttaaaga tgatattcaa 9301 ttaaacgagc atcaagttgc ttggtattca aaagattgga cagctgtcca atccgctgct 9361 gattctttta aagaaaaagc agaaaatgaa ttttttgaaa taattggagc tattaataat 9421 aaaactaaat gctctattgc tcaaaaagat tattcaaaat tcatggttga aaatgcatta 9481 tcacaatttc cagagtgtat gccagctgta tatgctatga atttaattgg atcaggctta 9541 agtgatgaag ctcattttaa ttatctaatg gctgcagttc ctcgtggtaa aagatatggt 9601 aaatgggcaa aactggttga agattccacc gaagtattga ttattaagtt acttgctaag 9661 cggtatcaag ttaatacaaa tgatgcaatt aactataaat caattcttac taaaaatgga 9721 aaacttcctt tagtattaaa agaactaaaa ggtttagtca cggatgattt tttgaaagaa 9781 gtgactaaga acgtaaaaga acagaaacaa ctcaaaaaac tagcattgga atggtaaaat 9841 gattgaaatt actcttaaaa aacctgaaga ttttctgaaa gtaaaagaaa ctttgactcg 9901 tatgggaatt gctaataata aagataaagt tctatatcag tcctgtcata ttcttcagaa 9961 aaaaggacta tactatatcg ttcattttaa agaaatgctt cgtatggatg gtcgtcaagt 10021 tgaaatgaca gaagaagatg aagttcgtcg tgattcgatt gcatggctat tagaagattg 10081 gggactaatt gaaatcgttc ctggtcaaag aacttttatg aaagatttaa ctaataactt 10141 ccgagttatt tcttttaaac aaaaacatga atggaagctc gttcctaaat atacgattgg 10201 taattaagca aggggcttcg gccccttatt tggagtataa tatatcaaga gcctaataac 10261 tcgggctata aactaaggaa tatctatgaa agaattttat atctctattg aaacagtcgg 10321 aaataacatt gttgaacgtt atattgatga aaatggaaag gaacgtaccc gtgaagtaga 10381 atatcttcca actatgttta ggcattgtaa ggaagagtca aaatacaaag acatctatgg 10441 taaaaactgc gctcctcaaa aatttccatc aatgaaagat gctcgagatt ggatgaagcg 10501 aatggaagac atcggtctcg aagctctcgg tatgaacgat tttaaactcg cttatataag 10561 tgatacatat ggttcagaaa ttgtttatga ccgaaaattt gttcgtgtag ctaactgtga 10621 cattgaggtt actggtgata aatttcctga cccaatgaaa gcagaatatg aaattgatgc 10681 tatcactcat tacgattcaa ttgacgatcg tttttatgtt ttcgaccttt tgaattcaat 10741 gtacggttca gtatcaaaat gggatgcaaa gttagctgct aagcttgact gtgaaggtgg 10801 tgatgaagtt cctcaagaaa ttcttgaccg agtaatttat atgccattcg ataatgagcg 10861 tgatatgctc atggaatata tcaatctttg ggaacagaaa cgacctgcta tttttactgg 10921 ttggaatatt gaggggtttg acgttccgta tatcatgaat cgtgttaaaa tgattctggg 10981 tgaacgtagt atgaaacgtt tctctccaat cggtcgggta aaatctaaac taattcaaaa 11041 tatgtacggt agcaaagaaa tttattctat tgatggcgta tctattcttg attatttaga 11101 tttgtacaag aaattcgctt ttactaattt gccgtcattc tctttggaat cagttgctca 11161 acatgaaacc aaaaaaggta aattaccata cgacggtcct attaataaac ttcgtgagac 11221 taatcatcaa cgatacatta gttataacat cattgacgta gaatcagttc aagcaatcga 11281 taaaattcgt gggtttatcg atctagtttt aagtatgtct tattacgcta aaatgccttt 11341 ttctggtgta atgagtccta ttaaaacttg ggatgctatt atttttaact cattgaaagg 11401 tgaacataag gttattcctc aacaaggttc gcacgttaaa cagagttttc cgggtgcatt 11461 tgtgtttgaa cctaaaccaa ttgcacgtcg atacattatg agttttgact tgacgtctct 11521 gtatccgagc attattcgcc aggttaacat tagtcctgaa actattcgtg gtcagtttaa 11581 agttcatcca attcatgaat atatcgcagg aacagctcct aaaccgagtg atgaatattc 11641 ttgttctccg aatggatgga tgtatgataa acatcaagaa ggtatcattc caaaggaaat 11701 cgctaaagta tttttccagc gtaaagactg gaaaaagaaa atgttcgctg aagaaatgaa 11761 tgccgaagct attaaaaaga ttattatgaa aggcgcaggg tcttgttcaa ctaaaccaga 11821 agttgaacga tatgttaagt tcagtgatga tttcttaaat gaactatcga attacaccga 11881 atctgttctc aatagtctga ttgaagaatg tgaaaaagca gctacacttg ctaatacaaa 11941 tcagctgaac cgtaaaattc tcattaacag tctttatggt gctcttggta atattcattt 12001 ccgttactat gatttgcgaa atgctactgc tatcacaatt ttcggccaag tcggtattca 12061 gtggattgct cgtaaaatta atgaatatct gaataaagta tgcggaacta atgatgaaga 12121 tttcattgca gcaggtgata ctgattcggt atatgtttgc gtagataaag ttattgaaaa 12181 agttggtctt gaccgattca aagagcagaa cgatttggtt gaattcatga atcagttcgg 12241 taagaaaaag atggaaccta tgattgatgt tgcatatcgt gagttatgtg attatatgaa 12301 taaccgcgag catctgatgc atatggaccg tgaagctatt tcttgccctc cgcttggttc 12361 aaagggcgtt ggtggatttt ggaaagcgaa aaagcgttat gctctgaacg tttatgatat 12421 ggaagataag cgatttgctg aaccgcatct aaaaatcatg ggtatggaaa ctcagcagag 12481 ttcaacacca aaagcagtgc aagaagctct cgaagaaagt attcgtcgta ttcttcagga 12541 aggtgaagag tctgtccaag aatactacaa gaacttcgag aaagaatatc gtcaacttga 12601 ctataaagtt attgctgaag taaaaactgc gaacgatata gcgaaatatg atgataaagg 12661 ttggccagga tttaaatgcc cgttccatat tcgtggtgtg ctaacttatc gtcgagctgt 12721 tagcggttta ggtgtagctc caattttgga tggaaataaa gtaatggttc ttccattacg 12781 tgaaggaaat ccatttggtg acaagtgcat tgcttggcca tcgggtacag aacttccaaa 12841 agaaattcgt tctgatgtgc tatcttggat tgaccactca actttgttcc aaaaatcgtt 12901 tgttaaaccg cttgcgggta tgtgtgaatc ggctggcatg gactatgaag aaaaagcttc 12961 gttagacttc ctgtttggct gatagaataa atctaggacc tccaggtccc tttttcatac 13021 aagtaatata aatctatact tatgaaaaag agatgattct gg // LOCUS PT6REGA 462 bp ss-mRNA PHG 04-AUG-1990 DEFINITION Bacteriophage T6 translational repressor protein (regA), complete cds. ACCESSION M27738 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage T6, cDNA to viral RNA. ORGANISM Bacteriophage T6 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 462) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by E.S. Miller, 08-SEP-1989. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9804-10266. An RNA secondary structure is detected by primer extension reactions on mRNA templates at positions 407-424. An RNA nuclease sensitive site is noted at position 431. Author address: E.S.Miller Dept. of Microbiology Box 7615 North Carolina State University Raleigh, NC 27695 E-mail: esm@cgc.ncsu.edu FEATURES from to/span description pept 36 404 translational repressor protein (regA) site 431 431 RNA endonuclease site BASE COUNT 170 a 64 c 89 g 139 t ORIGIN 1 gaaacaactc aaaaaactag cattggaatg gtaaaatgat tgaaattact cttaaaaaac 61 ctgaagattt tctgaaagta aaagaaactt tgactcgtat gggaattgct aataataaag 121 ataaagttct gtatcagtcc tgtcatattc ttcagaaaaa aggattatac tatatcgttc 181 attttaaaga aatgcttcgt atggatggtc gccaagttga aatgacagaa gaagatgaag 241 ttcgtcgtga ttcaatagca tggttgttag aagattgggg actgattgaa atcgtacctg 301 gtcaaagaac ttttatgaaa gatttaacta ataacttccg agttatttct tttaaacaaa 361 aacatgaatg gaaactcgtt cctaaatata cgattggtaa ttaagcaagg ggcttcggcc 421 ccttatttgg agtataatat atcaagagcc taataactcg gg // LOCUS RB18REGA 674 bp ds-DNA PHG 04-AUG-1990 DEFINITION Bacteriophage RB18 translational repressor protein (regA) and Orf43.1, complete cds. ACCESSION M33533 M27739 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage RB18, cDNA to viral mRNA. ORGANISM Bacteriophage RB18 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 674) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by E.S.Miller, 03-APR-1990. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9815-10255. Author address: E.S.Miller Dept. of Microbiology Box 7615 North Carolina State University Raleigh, NC 27695 E-mail: esm@cgc.ncsu.edu FEATURES from to/span description pept 25 393 translational repressor protein (regA) pept 395 616 Orf43.1 peptide site 643 643 RegB endonuclease site BASE COUNT 251 a 106 c 115 g 202 t ORIGIN 1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt 61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg 121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa 181 atgcttcgta tggatggtcg ccaggttgaa atgacagaag aagatgaagt tcgtcgtgat 241 tcgattgcat ggctattaga agattgggga ctgattgaaa tcgtccctgg tcaaagaact 301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg 361 aaactcgttc ctaaatatac gattggtaat taatatgact gctataactc cacaagaata 421 catggcgtct cttaaagaaa aatataatct ttctgcaaca gaaacacttt ttgatttacc 481 agaaaacctc caactaaaat ttcaggtaga atttcaaaaa ttaattcatc cagaacaaaa 541 acactttact gcagtcgtta agtcaattaa tgcagatgga atgacaattt ttcaccgaca 601 aatagtacta atttaagcaa ggggcttcgg ccccttattt ggagtataat atatcaagag 661 cctaataact cggg // LOCUS RB51REGA 700 bp ss-mRNA PHG 04-AUG-1990 DEFINITION Bacteriophage RB51 translational repressor protein (regA) and Orf43.1, complete cds. ACCESSION M33532 KEYWORDS RNA binding protein; translational repressor. SOURCE Bacteriophage RB51, cDNA to viral mRNA. ORGANISM Bacteriophage RB51 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 700) AUTHORS Miller,E.S. and Jozwik,C.E. TITLE Sequence analysis of conserved regA and variable orf43.1 genes in T4-like bacteriophages JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by E.S.Miller, 03-APR-1990. The T2 RegA protein coding sequence corresponds to positions 9839-10207 of the T4 regA sequence contained in the entry with accession number M10160. The complete sequence reported herein would correspond to positions 9815-10255. Author address: E.S.Miller Dept. of Microbiology Box 7615 North Carolina State University Raleigh, NC 27695 E-mail: esm@cgc.ncsu.edu FEATURES from to/span description pept 25 393 translational repressor protein (regA) pept 404 640 Orf43.1 peptide site 671 671 RegB endonuclease site BASE COUNT 255 a 114 c 125 g 206 t ORIGIN 1 aaaaactagc attggaatgg taaaatgatt gaaattactc ttaaaaaacc tgaagatttt 61 ctgaaagtaa aagaaacttt gactcgtatg ggaattgcta ataataaaga taaagttctg 121 tatcagtcct gtcatattct tcagaaaaaa ggactatact atatcgttca ttttaaagaa 181 atgcttcgta tggatggccg tcaagttgaa atgacagaag aagatgaagt tcgtcgtgat 241 tcgattgcat ggctgttaga agattgggga ctgattgaaa tcgttcctgg tcaaagaact 301 tttatgaaag atttaactaa taacttccga gttatttctt ttaaacaaaa acatgaatgg 361 aaactcgttc ctaaatatac gattggtaat taaggtaaat tatatgacac aatcagagat 421 tcagtttacc gaagcttttc taatattcga aaatctaaaa caaacatatg gtctgtcttc 481 tacccaactg gtttttgacc tcccggaaga taaacaaata gaattccaaa aagaattcca 541 acgattagtt tacccacgcc aacgcgcatt tcatgcgatt atcaaaacca caaataaaga 601 tggcaagtcc gttatatcgc gttgtacaat agaaatttaa agcaaagggg cttcggcccc 661 ttatttggag tataatatat caagagccta ataactcggg // LOCUS HUMQBPC1 235 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein gene, 5' flank with an LTR-like sequence. ACCESSION M35760 J05438 KEYWORDS ubiquinone-binding protein. SEGMENT 1 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 235) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Common protein binding sites in the 5'-flanking regions of human genes for cytochrome c1 and for ubiquinone-binding protein JOURNAL J. Biol. Chem. 265, 8159-8163 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Suzuki, 08-AUG-1989. FEATURES from to/span description site 26 33 Mt3 element site 103 110 Mt1 element site 114 122 Mt4 element BASE COUNT 52 a 59 c 56 g 68 t ORIGIN 1 tagccaaccg ggttcggatt gtgcagtcca actccagcca atggagtcag gacacagtag 61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgtttagg gtgctctcgt 121 ggcaaccaga cttaccagga gctctattct gcaaaagtaa atttgccttg ctgagagacc 181 ctttgtcctt tggctcagtg ttggttcttc tttgcagcac cgagcatttg tttcc // LOCUS HUMQBPC2 1567 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human ubiquinone-binding protein (QP) gene, exon 1. ACCESSION M35761 M26705 J05438 KEYWORDS ubiquinone-binding protein. SEGMENT 2 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1567) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Common protein-binding sites in the 5'-flanking regions of human genes for cytochrome c-1 and ubiquinone-binding protein JOURNAL J. Biol. Chem. 265, 8159-8163 (1990) STANDARD full staff_review REFERENCE 2 (bases 888 to 1232) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Suzuki, 08-AUG-1989. FEATURES from to/span description pept 1151 + 1169 ubiquinone-binding protein (QP), exon 1 IVS 1170 > 1567 ubiquinone-binding protein intron A site 425 432 Mt3 element site 525 533 Mt1 element site 559 567 Mt4 element BASE COUNT 442 a 339 c 339 g 447 t ORIGIN 1 cttcctgatt agtagtgtag cagtatagca tatgacccta gaattaagag ccccaaattc 61 gatctggatt ttcagattct ggaagaagaa cctggactct tctagtcatg tgctcttgga 121 caagttattc aacctttctg agcctcagct ttctcctctc tacaatgaga taattacagt 181 acttttagaa ttaaaagata gtttagggga gtgagagatc acctccaacc ctccctcccc 241 tttgctctta tgtgggctct ttggtgaatc taggaaccaa attgacacat ggcagattaa 301 caagagaaaa gcatactaat tttactagtt ttacatgtac atagggatcc gcacaaaaga 361 gtgaagtggc caatgtaaga tgcctttata ctttttacac gaactacaaa tcagaagaaa 421 ggggatctgg ctaggggcag taaatttctc gcggagtcat taggagaaaa gtgtgtgtgt 481 tggggtgtaa acttagtgga agataacggt tacttcggaa agtatattca ggtccaaagc 541 agcccccagt ttcgtctctg gtgataggtg ctattttctc cctctggtaa gggacgggta 601 ctccagagga atcctggagg aagctttatg cgcttgctgc atgcaggaag agataggtca 661 tcaaaccttt tctgaaacta caatttctcc aatattttca actcgaaata atcaatgtac 721 caattccata tatttttggg atggcacatc catcagtcct tcaatagaaa gtaagacgat 781 tgcccttaaa acatttagct tcctgcctgg cacaaagagc tgaattaata ctgtccacgc 841 tcccatagct attaacagca aataacggga acccagcttt aagccttggt ctcctaattc 901 ctgacctgtg ttcttcccat cacatcacaa ccaatacttc ttagaatgta gcacgcaaaa 961 tgccccgcct actccattgg cctcaccctg gccgagaggt gcttcagcag gtgctgaacg 1021 gcagtcgtca gaactgcgcc tgcgcaagcg gcctttctct gttcgcgatg tgacgtaacg 1081 cgcctgcgga ctgggcccag cttgtcctct atgacttacc cagaaggcaa cgcttctctt 1141 tctggtcaaa atggctggta agcaggccgg taagtaactg ggggtcttct tggccggcat 1201 cgtcgcgctt caccgtttat ttttgcctgc aggtcaacgg atcacaggaa ataattgtaa 1261 acttttcatt tcaagagagc tggagttgta gggaaaagac agtgagattt cagcatagac 1321 tggcaggtct aaatgatcct catcctctaa aatttttact cgatattaca gttatttcat 1381 ttggccacat tctttaaaaa tgtatgtatt ttggggagga ttttgatacc attagtcatt 1441 taaacattaa tatggattcg taaaagaagt ccatttacct cctcctaagc aactaagttc 1501 ccccaaggga atggatattg aaaaaggatg agaaataggt aattttaaaa ctgtttgggg 1561 aacaggg // LOCUS HUMQBPC3 236 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein gene, intron A with an LTR-like sequence. ACCESSION M33945 J05438 KEYWORDS ubiquinone-binding protein. SEGMENT 3 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 236) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Common protein binding sites in the 5'-flanking regions of human genes for cytochrome c1 and for ubiquinone-binding protein JOURNAL J. Biol. Chem. 265, 8159-8163 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by H.Suzuki, 08-AUG-1989. FEATURES from to/span description site 26 33 Mt3 element site 103 110 Mt1 element site 114 122 Mt4 element BASE COUNT 57 a 55 c 54 g 70 t ORIGIN 1 aaccgggttc agcttagatt gtacggtcca actctagtca atggagtcag gatacagtag 61 cagggacaag ctgcgttaga cataaaaacc tctgctttcc tttgttcggg gtgctctcgt 121 ggcaaccgaa cctatgagga gcaaccttat gcaaaagtaa atttgccttg ctgagagacc 181 ctttgtcctt tgtctcagtg ctagttcttc tttgcagcac caagcatttg ttttca // LOCUS HUMQBPC4 114 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 2. ACCESSION M26706 KEYWORDS ubiquinone-binding protein cytochrome bc1 complex subunit QP. SEGMENT 4 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 114) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 23 + 94 ubiquinone-binding protein (QP), exon 2 IVS < 1 22 ubiquinone-binding protein intron A IVS 95 > 114 ubiquinone-binding protein intron B BASE COUNT 34 a 17 c 29 g 34 t ORIGIN 1 ttaacagtgt gacaattatc agtttcagca tcaggcaagt ggctggatgg tattcgaaaa 61 tggtattaca atgctgcagg attcaataaa ctgggtaagc agctgtgctg ttta // LOCUS HUMQBPC5 208 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 3. ACCESSION M26707 KEYWORDS QP gene; ubiquinone-binding protein. SEGMENT 5 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 208) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 21 + 188 ubiquinone-binding protein (QP), exon 3 IVS < 1 20 ubiquinone-binding protein intron B IVS 189 > 208 ubiquinone-binding protein intron C BASE COUNT 73 a 36 c 48 g 51 t ORIGIN 1 aactttgtca tctatcatag ggttaatgcg agatgataca atatacgagg atgaagatgt 61 aaaagaagcc ataagaagac ttcctgagaa cctttataat gacaggatgt ttcgcattaa 121 gagggcactg gacctgaact tgaagcatca gatcttgcct aaagagcagt ggaccaaata 181 tgaagagggt agcacagctt attactcc // LOCUS HUMQBPC6 233 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (QP) gene, exon 4. ACCESSION M26730 KEYWORDS QP gene; ubiquinone-binding protein. SEGMENT 6 of 6 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 233) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 21 97 ubiquinone-binding protein (QP), exon 4 IVS < 1 20 ubiquinone-binding protein intron C BASE COUNT 82 a 23 c 49 g 79 t ORIGIN 1 atatttcatt cattctttag aaaatttcta ccttgaaccg tatctgaaag aggttattcg 61 ggaaagaaaa gaaagagaag aatgggcaaa gaagtaatca tgtagttgaa gtctgtggat 121 gcagctgtta tgaagatggt taaacttgaa acaaacaatt ttaagaatta tttggtctga 181 agatgtttta ctttaaataa atgtctattg taatggctgg agtttttgaa ttc // LOCUS HUMQBPCA 498 bp ss-mRNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein mRNA, complete cds. ACCESSION M26700 KEYWORDS ubiquinone-binding protein. SOURCE Human fibroblast SV40 transformed cell line GM637, cDNA to mRNA, clones 2 and 6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 498) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept 53 388 ubiquinone-binding protein BASE COUNT 172 a 72 c 122 g 132 t ORIGIN 1 agcttgtcct ctatgactta cccagaaggc aacgcttctc tttctggtca aaatggctgg 61 taagcagggc gtttcagcat caggcaagtg gctggatggt attcgaaaat ggtattacaa 121 tgctgcagga ttcaataaac tggggttaat gcgagatgat acaatatacg aggatgaaga 181 tgtaaaagaa gccataagaa gacttcctga gaacctttat aatgacagga tgtttcgcat 241 taagagggca ctggacctga acttgaagca tcagatcttg cctaaagagc agtggaccaa 301 atatgaagag gaaaatttct accttgaacc gtatctgaaa gaggttattc gggaaagaaa 361 agaaagagaa gaatgggcaa agaagtaatc atgtagttga agtctgtgga tgcaggtgtt 421 atgaagatgg ttaaacttga aacaaacaat tttaagaatt atttggtctg aagatgtttt 481 actttaaata aatgtcta // LOCUS HUMQBPIC 75 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPI) gene, exon 2. ACCESSION M37387 KEYWORDS ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 75) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept / 3 / 75 ubiquinone-binding protein (HQP1), exon 2 (AA at 5) BASE COUNT 22 a 11 c 21 g 21 t ORIGIN 1 agtttcagca tcggcaagtg gctggatggt attcgaaaat ggtattacaa tgctgcagga 61 ttcaataaac tgggt // LOCUS HUMQBPIIC 172 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPII) gene, complete cds. ACCESSION M26701 KEYWORDS ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 172) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review COMMENT Intron A starts after nucleotide 98. FEATURES from to/span description pept 80 / 170 ubiquinone-binding protein (HQPII), exon 1 IVS 171 > 172 HQPII intron A site 98 99 site of 2.3 kb intron BASE COUNT 42 a 40 c 47 g 43 t ORIGIN 1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc agaaggcaac 61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccgtt tcagcatcag gcaagtggct 121 ggatggtatt cgaaaatggt attacaatgc tgcaggattc aataaactgg gt // LOCUS HUMQBPIIIC 100 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPIII) gene, complete cds. ACCESSION M26702 KEYWORDS ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept 80 / 98 ubiquinone-binding protein (HQPIII), exon 1 IVS 99 > 99 HQPIII intron BASE COUNT 20 a 30 c 27 g 23 t ORIGIN 1 gactgaacgc gcctgcggac tgggcccagc ttgtcctcta tgacttaccc acaaggcaac 61 gcttctcttt ctggtcaaaa tggctggtaa gcaggccggt // LOCUS HUMQBPIVC 465 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPIV) pseudogene, complete cds. ACCESSION M26703 KEYWORDS pseudogene; ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 465) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept.ps 80 392 pseudo-ubiquinone-binding protein (HQPIV), exon 1 BASE COUNT 180 a 66 c 110 g 109 t ORIGIN 1 aaaataaaat aaaataaatt agataaataa ataaataaaa agagagaggg agaaggcaac 61 atttcacgtt tgggtcaaga tggctggcag gctggctacc gcagcatcag acaggtagct 121 ggatggtatt tgaaaatggt atgacaatgc tgcaggattc gagaaactaa agtaaatgtg 181 agatgagaca atatatgaag atgaagatgt aaaggaagcc ataagacggc tttctaacgg 241 caggatgttt cgcattagaa cactggacct atgagacagc agatcttgcc ttaatagcac 301 ggaccaaaca tgaagaggat aaattctacc atgagccata tctgaaaccg cttatttggg 361 aaagagaaga atgagaatgg tcaaagaagt catcacatat ttgaagttcg tggatgtagc 421 tgcctcaagt atttttatga aaatggttaa acctgaaaca tataa // LOCUS HUMQBPVC 357 bp ds-DNA PRI 04-AUG-1990 DEFINITION Human mitochondrial ubiquinone-binding protein (HQPV) pseudogene, complete cds. ACCESSION M26704 KEYWORDS pseudogene; ubiquinone-binding protein. SOURCE Human leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 357) AUTHORS Suzuki,H., Hosokawa,Y., Toda,H., Nishikimi,M. and Ozawa,T. TITLE Isolation of a single nuclear gene encoding human ubiquinone-binding protein in complex III of mitochondrial respiratory chain JOURNAL Biochem. Biophys. Res. Commun. 161, 371-378 (1989) STANDARD simple staff_review FEATURES from to/span description pept.ps 71 169 pseudo-ubiquinone-binding protein (HQPV) BASE COUNT 159 a 51 c 64 g 83 t ORIGIN 1 catttttagt tcccaagagc cgggcataaa caagagacct gagggaatcc tcctctctta 61 aaaaacaaag cagtggacca actatgaaga ggaaaatttc tgccttgaac tgtatctgaa 121 agatgttatt tgggaaagaa aagaaagaga agaatgggca aagaagtaat cacatggttg 181 aagtctgtgg atgcaactgt tatgaagatg gttaaacttg aaataaacaa ttttaagaat 241 tatttggtct gcagatattt tactttaaat aaatgtctat tgtaatggta aaaaaaacaa 301 aaacaaaaac aaaacaaaac aaaaacaaaa acaaaacaaa accaaaaaaa cctcagg //
GenBank-Updates@genbank.bio.net (08/07/90)
LOCUS RATKV1AA 3291 bp ds-DNA ROD 07-AUG-1990 DEFINITION Rat potassium channel-Kv1 gene, complete cds. ACCESSION M27158 M37145 KEYWORDS potassium channel-Kv1; transmembrane protein. SOURCE Rat brain DNA and cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 477 to 3291) AUTHORS Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B., Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A., Bennett,C., Stein,R.B. and Kaczmarek,L.M. TITLE Cloning and expression of cDNA and genomic clones encoding three delayed rectifier potassium channels in rat brain JOURNAL Neuron 4, 929-939 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3291) AUTHORS Luneau,C.J. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Computer readable copy of sequence [1],[2] kindly submitted by R.J.Swanson, 16-AUG-1989, for release after publication. FEATURES from to/span description pept 761 2569 potassium channel-Kv1 BASE COUNT 741 a 905 c 897 g 748 t ORIGIN 1 acaatacgat agcaacatga cagttatgaa gtagcaacaa aaaataaatt tagggtgtgt 61 gtgtgggggg atggtcaaca caacatgagg aactgtatta aaggtcgaga agcactgctc 121 taaaagaaaa acataggaag gactctcttc ttcaatctct tacactggaa ctcagctaaa 181 agcctttccc ttccatctaa agcagttccc accctctggc gtcccctggc tttcagtgca 241 ttcccaggtg atctgagatc ctgaagctag ctattctcat cctgttgccc tgggagtttc 301 agcactcctg gtacaacctg ttcctcgctc ctcctgctct tttcctcctc cccttcctgt 361 cttgcggcag gggccccagc tgaaccctcc agagagacac tctccactcc gtccccaaag 421 gagaaacgcc cgggggagga gcagaggagg gagccacaga gggcggaggc cggggagctc 481 agccagagag gggctgctgg aggttgcatc tgctggaagg aggtttttcc cgctgcctgg 541 taaccagctg ccagaacact gagagggaga gagagaggca gggagcaggg gcagcagctt 601 cttgacgtca ggaccagcca aggatcgggc cagctacccc ggccagccta gagtcagcgg 661 ggccctcggc tggagaggag acctacgcga ggcttggagc gtgagttggg ggtgtgggaa 721 ccggttgtct ggggcgcagc atgccccctg cccccggacc atggagatct ccctggtgcc 781 cctggagaat ggcagtgcca tgaccctcag aggaggaggg gaggcagggg caagctgtgt 841 gcagaccccc aggggagagt gtgggtgccc tccgacgtct ggactcaata atcagtccaa 901 agaaacactg ttgagggggc gcacaacgct cgaggatgcg aaccagggtg gacggccttt 961 gccccctatg gctcaggagc tgccacaacc tagaaggcta tctgctgagg atgaggaggg 1021 agaaggcgac cctggcctgg gcacagtgga ggaggaccag gctcctcagg atgcagggtc 1081 actccatcac cagcgagtcc tcataaacat ctccgggttg cgtttcgaga cgcagctggg 1141 caccctggca cagtttccca acaccctcct gggggaccca gccaagcgcc tgcactactt 1201 cgaccccttg agaaatgaat acttcttcga ccgcaaccgg cccagcttcg atggcatttt 1261 gtactactac cagtctgggg gccgcctgcg caggcccgtc aatgtctccc tggatgtgtt 1321 tgcagatgag atccgctttt accagctggg ggacgaggcc atggagcgct tccgggagga 1381 tgagggcttc atcaaggaag aggagaagcc cctgccccgc aacgagttcc agcgccaggt 1441 atggcttatc ttcgaatacc cagaaagctc tgggtccgca agagccatcg ccatagtgtc 1501 ggtcctggtc attctcatct ctatcatcac cttctgcctg gagactctgc ctgagttcag 1561 ggatgagcgg gagctgctac gccatccccc agtgccgccc cagcccccag cccctgcccc 1621 tgggatcaat ggcagcgtct ctggagcact ttcctctggc cctacggtgg ctccactctt 1681 gcctaggaca ctggccgatc cattcttcat cgtggagacc acatgtgtga tctggttcac 1741 ttttgagttg ctcgtgcgct tctttgcctg ccccagcaag gcagaattct ctcggaatat 1801 catgaacatc atcgatgtcg tggccatctt cccctacttt atcaccctgg gcaccgagct 1861 ggcagagcaa caaccagggg gagggggtca gaatgggcag caggccatgt ccctagccat 1921 cctcagggtg atccgcctgg tgcgggtgtt tcggatcttc aagctctccc gccactccaa 1981 gggactgcag atcctgggta agaccttgca agcatccatg cgggaactcg ggctactcat 2041 cttcttcctc ttcattggag tcatcctctt ctccagcgct gtctacttcg cagaggcaga 2101 caatcacggg tcccatttct ctagtatccc agatgccttc tggtgggcag tagtcactat 2161 gaccactgta ggctatggag acatgagacc catcactgta gggggcaaga tcgtgggttc 2221 actgtgcgcc atagctgggg tcctcaccat tgccctgcct gtccccgtca tcgtctccaa 2281 ctttaattac ttctatcatc gggagacaga ccacgaggag caggctgccc tgaaggaaga 2341 acaaggcaac cagaggcggg agtctgggct ggacacaggg ggtcaacgga aggtcagctg 2401 cagcaaggcc tccttttgca agactggggg ttccctggag agttctgaca gtatcagaag 2461 gggtagctgt cctctagaaa agtgtcacct caaggccaag agcaacgtgg acttgcggag 2521 gtccctgtat gccctctgtc tggacactag ccgtgaaaca gatttgtaaa gagagatcca 2581 ggcagactgg caccagtgaa gctggccaca ggggtgcccc ttgagcctgg gcatctgctt 2641 tacaccaccg agtatttaag cccacctggt cacccaccga tgtctgcctt tctttcaccc 2701 ttctcacctt cctgccctct aacttcccca ttttaactcc tctttccata acccccaggg 2761 tcgcctattt ttaacaagta tgacattcca tgacgcagga gccgttgaag tgctgagccc 2821 tcactggaag atggatgcat tcatagccag ttttctacac ccagcagagg gataatcaaa 2881 caaatttgac tttaaaaagc ttagatccca agagaatcca tagcattccc tacccgtgtt 2941 ccaaatttac tttacatggg attacatttg tgtatgggca aatattattt ttatggccga 3001 tgactgcatt ttgtacagca gtgcacatga agaatatttg ggtgtgctct taagatatac 3061 tttgaattat gaaagtgttg ggggggtcac catttcctgg gacactcagg gacaaagacc 3121 ctaaaagagt tgttcaggtt tctgtgttct caagcttctt attattctgg gatgtactat 3181 tggtgcttct ggtctaggag agagaatgtt ctagaagacc agcaaatcta atcttgtgac 3241 tgtcttcaca gaagcatttt aaggatgttg gaagaaggat ttggagaatt c // LOCUS RATKV2AA 2098 bp ds-DNA ROD 07-AUG-1990 DEFINITION Rat potassium channel-Kv2 gene, partial cds. ACCESSION M27159 M37146 KEYWORDS potassium channel-Kv1; transmembrane protein. SOURCE Rat brain DNA and cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 622 to 2098) AUTHORS Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B., Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A., Bennett,C., Stein,R.B. and Kaczmarek,L.M. TITLE Cloning and expression of cDNA and genomic clones encoding three delayed rectifier potassium channels in rat brain JOURNAL Neuron 4, 929-939 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2098) AUTHORS Luneau,C.J. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Computer readable copy of sequence [1],[2] kindly submitted by R.J.Swanson, 16-AUG-1989, for release after publication. FEATURES from to/span description pept 865 > 2098 potassium channel-Kv2 BASE COUNT 358 a 588 c 698 g 453 t 1 others ORIGIN 1 agtgtgcatc tctctctctc tctctccagc atccaggctt gctgcagccc ctagctgggt 61 aaggggtgtg atgtgagagt ggggtgggag ggggcacagg gcggggcctg ccacgtcact 121 tggagagtgt gttaaggagg aagggcagag ctgagagctg agcctgctgc tgctgttgct 181 gctgctgctg ttgctgctgc ttgctgttgc tgctgctgct ggggctgctg cgggtgcggg 241 caaggattga tgcggtgggg aggtgggtct ctcgctccgg ncccgggcag gcccagggcg 301 gttccgaggc ctgtagaaca gctcctggga agaaggtggc ggctgcaatc gcgaccttgg 361 ccagacctag ttcgtggtgg acgtagggcg gaggcggagg ccgagcccgg gcaggagtct 421 ttggcgagcc agagggaggc gcatctggcg cttcggtacc agcggcagcc ttgtgagtct 481 tgtgaggctg gagaagtgca gcaggacctg gtgagagcta gcaagtctgg agggcgcctg 541 tgggtcccga gcccagtttt aggcaccacc gagagctaag ccaagcgtct tttcgggcag 601 ccagtttcac cgcgcgacag tttgcgggtt ccaggcatct cagaaatctt gagcacggag 661 gcgcggctac tgagagccag agccacatcc cagacctagc ctggcagaga gaccagctgc 721 agggttcacc gacctaaccg ccaggtcaga gcacgggccc caccctaaag gagggcgcag 781 ccggagctgg gaagccggtg ccgcgctccg gagctcgtgt cgtgggcgcc gtcctagtgg 841 cggggagcgc accgccgagg tgacatgaga tcggagaaat ccctgacgct ggcggcgccg 901 ggggaggtcc gtgggccgga gggggagcaa caggatgcgg gtgagttcca ggaggccgag 961 ggcggcggcg gctgctgtag tagtgagagg ctggtgatca acatctctgg gctgcgctac 1021 gagacgcagc tgcgcacctt gtcgctgttc cctgacacgc tgctaggaga ccctggccgc 1081 agagtccgct tctttgaccc cttgaggaat gagtacttct ttgaccgcaa ccgacccagc 1141 ttcgacgcta tcctttatta ctaccagtcg gggggtcgcc tgcgcaggcc ggttaacgtg 1201 ccccttgaca tctttatgga agagattcgc ttctatcagt tgggagatga agccctggcg 1261 gccttccggg aggatgaggg ttgcctgccc gaaggtggtg aggatgagaa gccactcccc 1321 tcccagcctt tccagcgaca ggtctggctc ctctttgagt atccggagag ttctgggccc 1381 gcccgaggca ttgccatcgt ctcagtgttg gtcatcctca tctccattgt catcttttgc 1441 ctggagacct tgcctcagtt ccgtgcagat gggcgcggtg gaagcaacga ggggagtggg 1501 acccgcatgt ccccggcctc cagggggagc cacgaggagg aagatgaaga cgaggattcc 1561 tatgcatttc ctggtagcat tccctctggg gggttgggga ccggaggaac ttcttcattt 1621 agtactctcg ggggttcctt cttcacagac cccttcttcc tggtggaaac tctgtgtatc 1681 gtctggttca cctttgagct cctggtgcgc ttctctgcct gtcccagcaa ggcggccttc 1741 tttcgcaata tcatgaacat cattgacttg gtggccatct tcccctactt tatcaccctg 1801 ggcaccgagc tagtgcaacg tcacgagcag cagcctgtga gtggtggcag tggtcagaat 1861 gggcagcagg ccatgtccct agccatcctc agggtgatcc gcctggtccg ggtgtttcgg 1921 atcttcaagc tctcccgcca ctccaagggg ttgcagatcc tgggtaagac cttgcaagca 1981 tccatgcggg agctcgggct actcatcttc ttcctcttca ttggagtcat cctcttctcc 2041 agcgctgtct acttcgcaga ggcagatgac gttgactcgc tcttccctag catcccag // LOCUS RATKV3AA 2264 bp ds-DNA ROD 07-AUG-1990 DEFINITION Rat potassium channel-Kv3 gene, complete cds. ACCESSION M31744 KEYWORDS potassium channel-Kv3; transmembrane protein. SOURCE Rat brain DNA and cDNA to mRNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (sites) AUTHORS Swanson,R., Marshall,J., Smith,J., Williams,J., Boyle,M.B., Folander,K., Luneau,C.J., Antanavage,J., Oliva,C., Buhrow,S.A., Bennett,C., Stein,R.B. and Kaczmarek,L.M. TITLE Cloning and expression of cDNA and genomic clones encoding three delayed rectifier potassium channels in rat brain JOURNAL Neuron 4, 929-939 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2264; for [1]) AUTHORS Luneau,C.J. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Computer readable copy of sequence [1],[2] kindly submitted by R.J.Swanson, 16-AUG-1989, for release after publication. [2] Author Address Merch, Sharp, and Dohme Research Labs W42-300 West Point, PA 19486 FEATURES from to/span description pept 508 2085 potassium channel-Kv3 BASE COUNT 436 a 684 c 695 g 449 t ORIGIN 1 gggttcctcc aaggccagtc tggcagggta ggggttccgc ggggtctggg tagtggggtg 61 caagtcgaag gcgacccagt gacaaaagaa gggcgaggaa caggggtggc aagcggaaag 121 cggagccgcg cgcgcagtgg gcgggaaacc cgggggcggc ggcagagagg cgggcgcgcg 181 cctcgcgggg aggggtgtgg aggagggagg aggaggcgtc tggacggtgc agccgggcgg 241 gccgcgccag ccctgacaga gcctcgcggc agccgcagca acaggcggcg gggccgggct 301 cgtggggcag aggcggtggt ggggcgggga gctccaggag gagcgactcg gagctggcca 361 ccgccagcag ggaaggagag catcgccgcc tcccgcgctc ggccgccgca gccctccgcc 421 cgccaccgcg cccaccctgc gctggacccc gcgggcggcg gcgcgcgcat cctgcagagc 481 ccaggctacg ccgagctgcc gccagacatg accgtggtgc ccggggacca cctgctggag 541 ccagaagcgg cgggaggcgg cggcggggac ccgcctcagg gaggctgtgt cagtggcggc 601 ggctgcgacc gctacgaacc gctgccgccc gcgctgcccg ccgcgggcga gcaggattgc 661 tgcggggagc gcgtggtcat caacatctcc gggctgcgct tcgagacaca gctcaagacc 721 ctctgccagt tccctgagac gctgctaggc gaccccaagc ggcgcatgcg gtacttcgac 781 ccgctccgca atgagtactt ctttgaccgc aacagaccca gcttcgacgc catcctctac 841 tactaccagt ccgggggccg catccgccgg ccggtcaacg tgcccatcga catcttctcc 901 gaggagatcc gcttctacca actgggtgag gaggctatgg agaagttccg tgaggacgag 961 ggcttcctgc gggaggagga gcgacccctg ccccgccgtg acttccagcg ccaggtgtgg 1021 ctgctcttcg aataccccga gagctccggg ccggcccggg gcattgccat cgtgtcagtg 1081 ctggtcattc tcatctccat tgtcatcttc tgcttggaga cactacccga gtttcgcgac 1141 gagaaggact atcccgcctc tccgtcgcag gacgtgtttg aggctgccaa caacagcacg 1201 tcgggggcct cctctggagc ctccagcttc tcggacccct tcttcgtagt ggagaccctg 1261 tgcatcatct ggttctcctt tgagctgctg gtgcgattct ttgcttgccc cagtaaagcc 1321 accttctcca gaaatatcat gaacctgata gacattgtag ccatcatccc ttattttatt 1381 actctgggca ctgagctggc tgagcgacag ggtaatgggc agcaggctat gtcactggcc 1441 atcctgaggg tcatccgcct agtaagggtc ttccgcatct tcaagctctc ccgccattct 1501 aaggggctgc agatcctggg acagacactg aaggcttcca tgcgagagct ggggctgctc 1561 attttcttcc ttttcattgg ggtcatcctt ttctccagtg cagtctactt tgctgaggca 1621 gacgaccctt cttcgggttt taacagtatc ccggatgcct tctggtgggc cgtggtaacc 1681 atgacaactg tcggttatgg tgatatgcac ccagtgacca taggaggcaa gattgtgggc 1741 tctctttgtg ccatcgcagg tgtcttgacc attgcattgc cggttcctgt gattgtttcc 1801 aacttcaatt acttctacca ccgggagaca gaaggggaag agcaagccca gtacatgcac 1861 gtgggaagct gccagcacct ctcctcttca gcagaggagc tccgaaaagc ccggagtaac 1921 tccactctga gtaagtcgga gtatatggtg atcgaagagg ggggtatgaa ccacagcgcc 1981 ttcccccaga cccccttcaa aacgggcaac tccactgcca cttgcaccac caacaataat 2041 cccaactcct gtgtcaacat caaaaagata ttcactgatg tctaatagat gatacgattg 2101 ccattctgtg cccagtattg tgtggaacat gcccccttgg tctgtgtatg cccttgattt 2161 atacatttcc agaccattca tcaaggaaag tacatgaaga agtggaaaag cacacttcat 2221 tctccctctc cctattgctt catactgaaa caggtgcctg gttt // LOCUS RATKV3AB 2956 bp ss-mRNA ROD 07-AUG-1990 DEFINITION Rat mRNA for putative potassium channel subunit protein (RCK3). ACCESSION X16001 KEYWORDS membrane protein; potassium channel protein. SOURCE Rattus rattus. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2956) AUTHORS Pongs,O. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 2956) AUTHORS Stuehmer,W., Ruppersberg,J.P., Schoerter,K.H., Sakmann,B., Stocker,M., Giese,K.P., Perschke,A., Baumann,A. and Pongs,O. TITLE Molecular Basis of Functional Diversity of Voltage Gated Potassium Channels in Mammalian Brain JOURNAL EMBO J. 8, 3235-3244 (1989) STANDARD simple automatic COMMENT *source: developmental stage=adult;tissue=brain-cortex; *source: clone=RCK3; [1] Author address Pongs O., Ruhr Universitaet Bochum, Fakultaet f Chemie , Lehrstuhl f Biochemie, Universitsstr 150, D-4630 Bochum, F R G. Submitted (31-JUL-1989) on tape to the EMBL Data Library FEATURES from to/span description pept 8 1585 put. RCK3 protein (AA 1-525) BASE COUNT 733 a 774 c 690 g 759 t ORIGIN 1 gccagacatg accgtggtgc ccggggacca cctgctggag ccagaagcgg cgggaggcgg 61 cggcggggac ccgcctcagg gaggctgtgt cagtggcggc ggctgcgacc gctacgaacc 121 gctgccgccc gcgctgcccg cggcgggcga gcaggattgc tgcggggagc gcgtggtcat 181 caacatctcc gggctgcgct tcgagacaca gctcaagacc ctctgccagt tccctgagac 241 gctgctaggc gaccccaagc ggcgcatgcg gtacttcgac ccgctccgca atgagtactt 301 cttcgaccgc aacagaccca gcctcgacgc catcctctac tactaccagt ccgggggccg 361 catccgccgg ccggtcaacg tgcccatcga catcttctcc gaggagatcc gcttctacca 421 actgggtgag gaggctatgg agaagttccg tgaggacgag ggcttcctgc gggaggagga 481 gcgacccctg ccccgccgtg acttccagcg ccaggtgtgg ctgctcttcg aataccccga 541 gagctccggg ccggcccggg gcattgccat cgtgtcagtg ctggtcattc tcatctccat 601 tgtcatcttc tgcttggaga cactacccga gtttcgcgac gagaaggact atcccgcctc 661 tccgtcgcag gacgtgtttg aggctgccaa caacagcacg tcgggggcct cctctggagc 721 ctccagcttc tcggacccct tcttcgtagt ggagaccctg tgcatcatct ggttctcctt 781 tgagctgctg gtgcgattct ttgcttgccc cagtaaagcc accttctcca gaaatatcat 841 gaacctgata gacattgtag ccatcatccc ttattttatt actctgggca ctgagctggc 901 tgagcgacag ggtaatgggc agcaggctat gtcactggcc atcctgaggg tcatccgcct 961 agtaagggtc ttccgcatct tcaagctctc ccgccattct aaggggctgc agatcctggg 1021 acagacactg aaggcttcca tgcgagagct ggggctgctc attttcttcc ttttcattgg 1081 ggtcatcctt ttctccagtg cagtctactt tgctgaggca gacgaccctt cttcgggttt 1141 taacagtatc ccggatgcct tctggtgggc cgtggtaacc atgacaactg tcggttatgg 1201 tgatatgcac ccagtgacca taggaggcaa gattgtgggc tctctttgtg ccatcgcagg 1261 tgtcttgacc attgcattgc cggttcctgt gattgtttcc aacttcaatt acttctacca 1321 ccgggagaca gaaggggaag agcaagccca gtacatgcac gtgggaagct gccagcacct 1381 ctcctcttca gcagaggagc tccgaaaagc ccggagtaac tccactctga gtaagtcgga 1441 gtatatggtg atcgaagagg ggggtatgaa ccacagcgcc ttcccccaga cccccttcaa 1501 aacgggcaac tccactgcca cttgcaccac caacaataat cccaactcct gtgtcaacat 1561 caaaaagata ttcactgatg tctaatagat gatacgattg ccattctgtg cccagtattg 1621 tgtggaacat gcccccttgg tctgtgtatg cccttgattt atacatttcc agaccattca 1681 tcaaggaaag tacatgaaga agtggaaaag cacacttcat tctccctctc cctattgctt 1741 catactgaaa caggtgcctg gttttgcaag tgggctgcat tctctcagct ctttttctct 1801 ctccctctct ctctccctgt ttcttaattt tgtgaacaac aaacttacat taagcttggt 1861 ttcttgtgca agccctaaag ctacatcctg agaggaaatg aaactataga gtcagagtaa 1921 ctgtgtgacc tcaaaaatca aaagatagtt gtttctttac taagtaaagg aggcagaact 1981 taaatgatgc tcctgttcgg tggacttttc agcattattt cctgactaag tgtaactact 2041 taacctctgg atgtgtgggt ggaaagtcta actagagtgt tgacgtgtaa accctctata 2101 attttttttt ttttggactg gcctttcagt ttgatctccc ctctaagaat ttcaggttca 2161 cataactttg aaccaaaagg gaaatgcccc aaatgtcctg atctgactat taactcttcg 2221 gagtttgcaa agcattttga aaacattaga cacagattcc aatgaagttg tgagttcatg 2281 cccaagtcaa tatctacgaa agtctagaaa ctcattttca gtgctgccag gagaaacaac 2341 agaaattcct gatgcatctg agagataagc ttctgcagta tcacaggaag attaaagtgg 2401 cagacacccc ttccagcgga agttactaat tcggacctga ctgatgcagt tcccatagca 2461 acccgtgttt cctgggaaac ccgaaaaagg ttgtcatggc atcttttgct ctctagcccc 2521 acccccaccc caggccgtgc gtttctacag taacctttcc agatggttcc tacttacacg 2581 acttcataag aaaaaacact atttgaataa agcgcacaaa ttaagtgaaa gactatatag 2641 aggtgaaatg aatcacaaaa tgctttttta cattttatat aaatcattga tgttatccca 2701 taacgactgg atcagaaagg aaactgtcgt ctttgaaatg ttacatgtaa ggcatgatct 2761 gaattaaatt atgtcagctg tttggtaaca atttttaaag atgcttctgt acagttcttt 2821 cactaagaaa tgacaagtca acagatgaaa cgaaaacagc aatataaaca tggtctgtac 2881 ataagcaagt gagagtcagt gtacccgaag tcttacacga taccttcaga tttcctcaat 2941 taaaaaaaaa aaaaag // LOCUS CEUMTFVLA 2682 bp ds-DNA ORG 07-AUG-1990 DEFINITION C.unicolor mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes, complete, and Leu-tRNA gene, 5' end. ACCESSION M35875 KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu; transfer RNA-Phe; transfer RNA-Val. SOURCE C.unicolor liver mitochondrion cell line DH-alpha-5 DNA, clone PCUNMR 9.0-18. ORGANISM Mitochondrion Cervus unicolor Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; Cervinae; Cervus unicolor. REFERENCE 1 (bases 1 to 2682) AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A. TITLE Phylogeny and evolution of antlered deer determined from mitochondrial DNA sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.M.Miyamoto, 03-JUL-1990. FEATURES from to/span description tRNA 1 69 Phe-tRNA rRNA 70 1024 12S ribosomal RNA tRNA 1025 1091 Val-tRNA rRNA 1092 2663 16S ribosomal RNA tRNA 2664 > 2682 Leu-tRNA BASE COUNT 1010 a 555 c 467 g 650 t ORIGIN 1 gttgatgtag cttaaacaac aaagcaaggc actgaaaatg cctagatgag tatattaact 61 ccataaacat acaggtttgg tcccagcctt cctattaacc cttaatagac ttacacatgc 121 aagcatccgc accccagtga aaatgccccc taagttaata agactaagag gagctggtat 181 caagcacaca tccgtagctc acgacacctt gcacagccac acccccacgg gagacagcag 241 tgataaaaat taagccataa acgaaagttt gactaagcca tattaatcag ggttggtaaa 301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggcaca cggcgtaaag 361 cgtgttaaag cactatttca aataaagtta aactccaatt aagctgtaaa aagccataat 421 tgcaacaaaa ataaacaacg aaagtaactt tacaaccgct gaaacacgat agctaggacc 481 caaactggga ttagataccc cactatgcct agccttaaac acaaatagtt gtgtaaacaa 541 aactattcgc cagagtacta ccggcaatag cttaaaactc aaaggacttg gcggtgcttt 601 acacccttct agaggagcct gttctataat cgataaaccc cgataaacct caccattcct 661 tgctaataca gtctatatac cgccatcttc agcaaaccct aaaaaggtac aaaagtaagc 721 acaatcataa tacataaaaa cgttaggtca aggtgtaacc tatggaacgg aaagaaatgg 781 gctacatttt ctaatctaag aaaatccaac acgaaagtta ttatgaaatt aataaccaaa 841 ggaggattta gcagtaaact aagaatagag tgcttagttg aattaggcca tgaagcacgc 901 acacaccgcc cgtcaccctc ctcaagtagg cacaatacac tcaaatttat ttacacgtat 961 taatcatatg agaggagaca agtcgtaaca aggtaagcat actggaaagt gtgcttggat 1021 aaatcaagat atagcttaaa taaagcatct agtttacacc tagaagattt cacatatcac 1081 gaatatcttg aaccaattct agcccgcaag cccattcaca ctaaattatc aatgtaacat 1141 aaaataaaac atttattcaa caacaaaaag tataggagat agaaatttta atacggcgct 1201 atagagaaag taccgtaagg gaacgatgaa agaaaaaaat caaagtacaa aaaagcaaag 1261 attacccctt gtaccttttg cataatgagt taactagtaa aaacttaaca aaatgaattt 1321 cagctaagta ccccgaaacc agacgagcta cttatgaaca atttatcgag aaccaactca 1381 tctatgtagc aaaatagtga gaagatttgt aagtagaggt gaaacgccca acgagcctgg 1441 tgatagctgg ttgtccagaa aatgaatatt agttcagctt taaaaatacc aaaaatatga 1501 acaaattata atgtattttt aaaagttagt ctaaaaaggt acagcctttt agaaatggat 1561 acaaccttaa ctagagagta agttttaaca ttaaaccata gtaggcctaa aagcagccac 1621 caattaagaa agcgttaaag ctcaacaata aaatagtatt aatttcaata ataaatagtc 1681 aactcctaat ctaatactgg actaatctat taagaataga agcaataatg ttaatatgag 1741 taacaagaag taacttctcc ccgcataagt ttaagtcagt atctgataat actctgacta 1801 ttaacagcaa aataagaata acctaactat aaataactta ttaactatac tgttaatccg 1861 acacaggaat gcacttaagg aaagattaaa agaagtaaaa ggaactcggc aaacactaaa 1921 ccccgcctgt ttaccaaaaa catcacctcc agcataacta gtattggagg cactgcctgc 1981 ccagtgacaa ccgttaaacg gccgcggtat cctgaccgtg caaaggtagc ataatcactt 2041 gttctctaaa tagggacttg tatgaatggc cacacgaggg ttttactgtc tcttacttcc 2101 aatcagtgaa attgaccttc ccgtgaagag gcgggaatat actaataaga cgagaagacc 2161 ctatggagct ttaactactt agcccaaaga aacaaatttc attaccaagg aaacaacaac 2221 actctttatg ggctaacagc tttggttggg gtgacctcgg agaacaagaa aacctccgag 2281 cgattttaaa gactagacct acaagtcgaa tcacacaatc gtttattgat ccaaaaaatt 2341 gatcaacgga acaagttacc ctagggataa cagcgcaatc ctattcaaga gtccatatcg 2401 acaatagggt ttacgacctc gatgttggat caggacatcc cgatggtgca accgctatca 2461 aaggttcgtt tgttcaacga ttaaagtcct acgtgatctg agttcagacc ggagtaatcc 2521 aggtcggttt ctatctatta tgtatttctc ccagtacgaa aggaccagag aaataaggcc 2581 aacttcaaat aagcgcctta aattagttaa tgatatcatc ttaattaacc tcacaaacaa 2641 actttgccct agagaagggc tttgttaagg tggcagagcc gg // LOCUS CPDBA 76 bp ss-RNA INV 07-AUG-1990 DEFINITION C.colpoda 5.8S ribosomal RNA. ACCESSION M35558 KEYWORDS 5.8S ribosomal RNA. SOURCE C.colpoda ribosomal RNA. ORGANISM Colpidium colpoda Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae. REFERENCE 1 (bases 1 to 76) AUTHORS Van Bell,C.T. TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina (Ciliophora: Hymenostomatida) JOURNAL J. Mol. Evol. 22, 231-236 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 76 5.8S ribosomal RNA BASE COUNT 21 a 14 c 20 g 21 t ORIGIN 1 atgcgaattg cagaacacgc gagtcatcag atctttgaac gcaattggtg ggggtgtaaa 61 aaccttcatg ttcgtt // LOCUS ECOLIVRA 729 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli high-affinity branched-chain amino acid transport repressor (livR) gene, complete cds. ACCESSION M36020 KEYWORDS high-affinity branched-chain amino acid transport repressor. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 729) AUTHORS Antonucci,T.K., Wagner,L.M. and Oxender,D.L. TITLE Cloning, expression, and nucleotide sequence of livR, the repressor for high-affinity branched-chain amino acid transport in Escherichia coli JOURNAL Proteins 1, 125-133 (1986) STANDARD simple staff_review FEATURES from to/span description pept 229 729 high-affinity branched-chain amino acid transport repressor (livR) BASE COUNT 181 a 192 c 170 g 186 t ORIGIN 1 cggcatgtca tttcctctct ctgcaattgt gtacaaatgt acacaacctt gctaatgttg 61 tgttccctct cttatttgac tgatctatga ccgtaaattc ttcacgtaat gcattgaaac 121 gccgaacctg gcattgttta tgttcttctt tttgccaggc ctgttaatgg cgtcctgggc 181 aacccgtacg cctctatccg cgatattctc tctgtctcga tcgctgaaat gggcgtgttc 241 tctttggtct ggatctccat tgccggcgcg caggtacgtc cgcgatcatt gcgccgaaca 301 acagcgcgcg gttctttcgc aatacataag cacgaacaag aaacagataa ccacgcctgc 361 cataacacca tcaagccgaa ctcttcacat gcttggtgca cctgcgaatg taaccgaagg 421 tcggtggtta gcacgcataa agccagcata attccggcga ataacaaccc ggcagcgttc 481 cccatgccga aactgaatgt gctgaactgg aaggtgatca tcccgatcat cagcccaata 541 acaaagaagg cgcagaatgc cagcaggtca gtgacctgga ctgtgaatcg agataaagcc 601 gatgcgatcc agtgttttta cgcggcgggc atcgtggatc ttgtaaaagg tcactttgtt 661 aagcacgacg ttgtcatcta tcggcatctc aatctggcta gaatgacgcg gttaaggaag 721 caaccgtga // LOCUS GCHBA 76 bp ss-RNA INV 07-AUG-1990 DEFINITION G.chattoni 5.8S ribosomal RNA. ACCESSION M35559 KEYWORDS 5.8S ribosomal RNA. SOURCE G.chattoni ribosomal RNA. ORGANISM Glaucoma chattoni Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Glaucomidae. REFERENCE 1 (bases 1 to 76) AUTHORS Van Bell,C.T. TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina (Ciliophora: Hymenostomatida) JOURNAL J. Mol. Evol. 22, 231-236 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 76 5.8S ribosomal RNA BASE COUNT 21 a 13 c 20 g 22 t ORIGIN 1 atgcgaattg cagaatccgc gagtcatcag atctttgaac gcaagtggtg gaggtgtaaa 61 aaccttcatg tttgtt // LOCUS HYRMTFVLA 2680 bp ds-DNA ORG 07-AUG-1990 DEFINITION H.inermis mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes, complete, and Leu-tRNA gene, 5' end. ACCESSION M35876 KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu; transfer RNA-Phe; transfer RNA-Val. SOURCE H.inermis liver and spleen mitochondrion cell line DH-alpha-5 DNA, clone PHINMR 4.0-6. ORGANISM Mitochondrion Hydropotes inermis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; Odocoileinae; Hydropotes inermis. REFERENCE 1 (bases 1 to 2680) AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A. TITLE Phylogeny and evolution of antlered deer determined from mitochondrial DNA sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.M.Miyamoto, 03-JUL-1990. FEATURES from to/span description tRNA 1 70 Phe-tRNA rRNA 71 1027 12S ribosomal RNA tRNA 1028 1094 Val-tRNA rRNA 1095 2661 16S ribosomal RNA tRNA 2662 > 2680 Leu-tRNA BASE COUNT 1010 a 526 c 468 g 676 t ORIGIN 1 gttaatgtag cttaaaaaat aaagcaaggc actgaaaatg cctagatgag tatatttaac 61 tccataaaca caaaggtttg gtcccagcct tcctattgac ctttaataga cttacacatg 121 caagcatccg catcccagtg aaaatgccct ccaagttaat aagactaaga ggagctggta 181 tcaagcacac acccgtagct catgacacct tgcttagcca cacccccacg ggagacagca 241 gtgataaaaa ttaagccatg aacgaaagtt tgactaagcc atattaatta gggttggtaa 301 atttcgtgcc agccaccgcg gtcatacgat taacccaagt taataggcat acggcgtaaa 361 gcgtgttaaa gcaacatata aaaatagagt taaactttaa ttaagctgta aaaagccata 421 attattaacg aaaataaata acgaaagtaa ctttacaatc gctgaaacat gatagctaag 481 acccaaactg ggattagata ccccactatg cttagcccta aacacaaata gttataataa 541 caaaactgtt cgccagagta ctaccggcaa tagcttaaaa ctcaaaggac ttggcggtgc 601 tttatatcct tctagaggag cctgttctat aatcgataaa ccccgatata cctcaccacc 661 ccttgctaat gcagtctata taccgccatc ttcagcaaac cctaaaaagg agcaaaagta 721 agcataatca taatacataa aaacgttagg tcaaggtgta acctatggag tgggaagaaa 781 tgggctacat tttctaattt aagaaaactt aatacgaaag ttattatgaa attaataacc 841 aaaggaggat ttagcagtaa actaagaata gagtgcttag ttgaactagg caatgaagca 901 cgcacacacc gcccgtcacc ctcctcaaat aagcacaata cgcttaaatt tattagtatg 961 tataaactgt atgagaggag acaagtcgta acaaggtaag catactggaa agtgtgcttg 1021 gataaatcaa gatatagctt aaataaagca cctagtttac acctagaaga tttcacacac 1081 cacgaatatc ttgaactaat tctagcccat aaatttattt acactaaact attattaata 1141 ttataaacaa agcatttatc tataattaaa gtataggaga tagaaatttt taatatgacg 1201 ctatagagaa agtaccgtaa gggaatgatg aaagaagaaa attaaagtac aaaaaagcag 1261 agattacccc tcgtaccttt tgcataatga gttaactagt aaaaacttaa caaaacgaat 1321 tttagctaag taccccgaaa ccagacgagc tacttataaa cagtttatca agaaccaact 1381 catctatgtg gcaaaatagt gagaagattt ataagtagag gtgaaacgcc taacgagcct 1441 ggtgatagct ggttgtccag aaaatgaata ttagttcagc tttaaaaata ccaaaaatat 1501 aaacaaatca cactgtattt ttaaaagtta gtctaaaaag gtacagcctt ttagaaatgg 1561 atacaacctt gactagagag taaaatttaa taacaccata gtaggcctaa aagcagccat 1621 caattaagaa agcgttaaag ctcaacaata agtttacatt aatttcaata attataaatc 1681 aactcctagc ttaatactgg actaatctat aaaaatagaa gcaataatgt taatatgagt 1741 aacaagaagt atattctccc tgcataagtt taagtcagta tctgataata tcctgactat 1801 taacagcaaa ataagaataa ctcaactata aatgacttat taattctact gttaatccaa 1861 cacaggaatg cacttaagga aagattaaaa gaagtaaaag gaactcggca aacactaaac 1921 cccgcctgtt taccaaaaac atcacctcca gcataactag tattggaggc actgcctgcc 1981 cagtgacaac cgttaaacgg ccgcggtatc ctgaccgtgc aaaggtagca taatcacttg 2041 ttctctaaat aaggactagt atgaatggcc aaacgagggt tttactgtct cttacttcca 2101 atcagtgaaa ttgaccttcc cgtgaagagg cgggaatata ataataagac gagaagaccc 2161 tatggagctt taactactta acccaaagaa acaaacttaa ccactaaggt gacaacaata 2221 ttctctatgg gttaacagct ttggttgggg tgacctcgga gaacaaaaaa tcctccgagc 2281 gattttaaag actagaccta caagtcgcat cgcacaatcg tttattgatc caaaaattga 2341 tcaacggaac aagttaccct agggataaca gcgcaatcct atttaagagt ccatatcgac 2401 aatagggttt acgacctcga tgttggatca ggacatcctg gtggtgcagc cgctatcaaa 2461 ggttcgtttg ttcaacgatt aaagtcctac gtgatctgag ttcagaccgg agtaatccag 2521 gtcggtttct atctattgtg tatttctccc agtacgaaag gaccagagaa ataaggccaa 2581 ctaaaaaaaa gcgccttaac ccaattaatg atctcatctc aattaaaatc acaaacaaac 2641 tccgccctag aaaagggctt tgttaaggtg gcagagccgg // LOCUS M36004 59 bp ds-DNA BAD 07-AUG-1990 DEFINITION Figure 1. Insert of plasmiod p WL144 and nucleic acid sequence of selA-selB overlap. ACCESSION M36004 REFERENCE 1 (bases 1 to 59) AUTHORS Forchhammer,K., Ruecknagel,K.-P. and Boeck,A. TITLE Purification and biochemical characterization of SELB, a translation factor involved in selenoprotein synthesis JOURNAL J. Biol. Chem. 365, 9346-9350 (1990) STANDARD simple staff_review COMMENT Secondary reference to EMBL entry X16644 for Nature 342, 453-456 (1989) FEATURES from to/span description BASE COUNT 14 a 9 c 18 g 18 t ORIGIN 1 gagcaacggt ttttggagat gttgttgaaa tgattattgc gactgccgga cacgttgac // LOCUS M36102 611 bp ds-DNA BAD 07-AUG-1990 DEFINITION Figure 2. Nucleotide sequence of AVP-NpII precursor mRNA from bovine hypothalamus. ACCESSION M36102 REFERENCE 1 (bases 1 to 611) AUTHORS Brownstein,M.J. TITLE biosynthesis of vasopressin and oxytocin JOURNAL Ann. Rev. Physiol. 45, 129-135 (1983) STANDARD unannotated staff_entry COMMENT Secondary reference to Nature 295, 299-303, accession number J00001. FEATURES from to/span description BASE COUNT 92 a 227 c 205 g 86 t 1 others ORIGIN 1 ngcacagtct acagagcagc actgcgcacg tgtgcccacg cgtgccagga tgcccgacgc 61 cacactgccc gcctgcttcc tcagcctgct ggccttcacc tctgcttgct acttccagaa 121 ctgcccaagg ggcggcaaga gggccatgtc cgacctggag ctgagacagt gtctcccctg 181 cggccccggg ggcaaaggcc gctgcttcgg gcccagcatc tgctgcgggg acgagctggg 241 ctgcttcgtg ggcacggccg aggcgctgcg ctgccaagag gagaactacc tgccgtcgcc 301 ctgccagtcc ggccagaagc cctgcgggag cgggggccgc tgcgccgccg ccggcatctg 361 ctgcaacgat gagagctgcg tgaccgagcc cgagtgccgg gaaggtgtcg gcttcccccg 421 ccgcgttcgc gccaacgacc ggagcaacgc gaccctgctg gacgggccga gcggggcctt 481 gttgctgcgg ctggtgcagc tggcgggggc gccggagccc gcggagcccg cccagcccgg 541 cgtctactga ggcgcgcccc ccccctcccc acccctgccc tcgcagcacg aaaaataaac 601 gttttaaagg c // LOCUS MMUMTFVLA 2682 bp ds-DNA ORG 07-AUG-1990 DEFINITION M.reevesi mitochondrial Phe-, Val-tRNA, 12S, and 16S rRNA genes, complete, and Leu-tRNA gene, 5' end. ACCESSION M35877 KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu-tRNA; transfer RNA-Phe; transfer RNA-Val. SOURCE M.reevesi liver and brain mitochondrion cell line JM109 DNA, clone PMREMR 4.8-16. ORGANISM Mitochondrion Muntiacus reevesi Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; Cervulinae; Muntiacus reevesi. REFERENCE 1 (bases 1 to 2682) AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A. TITLE Phylogeny and evolution of antlered deer determined from mitochondrial DNA sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.M.Miyamoto, 03-JUL-1990. FEATURES from to/span description tRNA 1 69 Phe-tRNA rRNA 70 1028 12S ribosomal RNA tRNA 1029 1196 Val-tRNA rRNA 1197 2664 16S ribosomal RNA tRNA 2659 > 2682 Leu-tRNA BASE COUNT 1001 a 543 c 463 g 675 t ORIGIN 1 gttaatgtag cttaagcaat aaagcaaggc actgaaaatg cctagatgag tgtactagct 61 ccataaacat ataggtttgg tcccagcctt cctattaacc cttaatagac ttacacatgc 121 aagcatccac atcccagtga aaatgccctc taagttaata agactaagag gagctggtat 181 caagcacaca tccgtagctc acgacgcctt gctcagccac acccccacgg gagacagcag 241 tgacaaaaat taagccataa acgaaagttt gactaagcca tattaattag ggttggtaaa 301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggtata cggcgtaaag 361 cgtgttaaag cactatttcc aaataaagct aaatttcaat taagctgtaa aaagccataa 421 ttgcaacaaa aataaataac gaaagtaact ttacagttgc tgaaacacga tagctaggac 481 ccaaactggg attagatacc ccactatgcc tagccctaaa cacaaatagt ttccacaaac 541 aaaactattc gccagagtac taccggcaat agcttaaaac tcaaaggact tggcggtgct 601 ttataccctt ctagaggagc ctgttctata atcgataaac cccgatagac ctcaccattc 661 ctcgctaata cagtctatat accgccatct tcagcaaacc ctaaaaagga ataaaagtaa 721 gcgcaatcat aatacgtaaa aacgttaggt caaggtgtaa cctatgggat gggaagaaat 781 gggctacatt ttctaactta agaataattc atatacgaaa gttattatga aattaataac 841 caaaggagga tttagcagta aactaagaat agagtgctta gttgaattag gccatgaagc 901 acgcacacac cgcccgtcac cctcctcaag taagtacgat atgctcaaac ttatttacat 961 atattaacca tatgagagga gataagtcgt aacaaggtaa gcatactgga aagtgtgctt 1021 ggataaatca agatatagct taaataaagt atctagttta cacctagaag atttcactat 1081 accacgaata tcttgaacca attctagccc ataagccgat tcacactaaa ttaccaattt 1141 attataaata aaacatttac ctaccattaa aagtatagga gatagaaatt ttaatatggc 1201 gctatagaga tagtaccgta agggaacgat gaaagaaaaa aatcaaagta caaagaagca 1261 aagattaccc cttgtacctt ttgcataatg agttaactag caaaaactta acaaaatgaa 1321 ttttagctaa gtaccccgaa accagacgag ctacttatga acaatttatc gagaaccaac 1381 tcatctatgt agcaaaatag tgagaagatt tgtaagtaga ggtgaaacgc ctaacgagcc 1441 tggtgatagc tggttgtcca gaaaatgaat attagttcag ctttaaaaat accaaaaata 1501 ttaacaaatt ataatgtatt tttaaaagtt agtctaaaaa ggtacagcct tttagaaacg 1561 gatacaacct taattagaga gtaaaactta atacaaatca tagtaggctt aaaagcagcc 1621 accaattaag aaagcgttaa agctcaacaa taaaacaacc ttaattccaa caacaagtaa 1681 tcaactccta acccaatact ggactaatct attaaaaata gaagcaataa tgttaatatg 1741 agtaacaaga agtatttctc cctgcatgag tttaagtcag tatctgataa tattctgact 1801 attaacagca agataagaat aacctaccca taaataattt attaattata ctgttaatcc 1861 aacacaggaa tgcacttaag gaaagattaa aagaagtaaa aggaactcgg caaacactaa 1921 accccgcctg tttaccaaaa acatcacctc tagcattact agtattagag gcactgcctg 1981 cccagtgaca accgttaaac ggccgcggta tcctgaccgt gcaaaggtag cataatcact 2041 tgttctctaa ataaggactt gtatgaatgg ccacacgagg gttttactgt ctcttacttc 2101 caatcagtga aattgacctt cccgtgaaga ggcgggaata tattaataag acgagaagac 2161 cctatggagc tttaactact tagcccaaag aaacaaaatt tatttctaag gaaacaacaa 2221 cattctctat gggctaacag ctttggttgg ggtgacctcg gagaacaaaa aatcctccga 2281 gcgattttaa agactagacc cacaagtcaa atcacataat cgctcattga tccaaataat 2341 tgatcaacgg aacaagttac cctagggata acagcgcaat cctattcaag agtccatatc 2401 gacaataggg tttacgacct cgatgttgga tcaggacatc ccgatggtgc aaccgctatc 2461 aaaggttcgt ttgttcaacg attaaagtcc tacgtgatct gagttcagac cggagtaatc 2521 caggtcggtt tctatctatt atgtatttct cccagtacga aaggaccaga gaaataaggc 2581 caacttcaaa caagcgcctt aaattaatta atgatattat cttaattaac tatacaaaca 2641 gaccctaccc tagaaaaggg ttttgttaag gtggcagagc cg // LOCUS ODOMTFVLA 2677 bp ds-DNA ORG 07-AUG-1990 DEFINITION O.virginianus mitochondrial Phe-, Val-tRNA, 12S and 16S rRNA, genes, complete, and Leu-tRNA gene, 5' end. ACCESSION M35874 KEYWORDS 12S ribosomal RNA; 16S ribosomal RNA; transfer RNA-Leu; transfer RNA-Phe; transfer RNA-Val. SOURCE O.virginianus brain mitochondrial cell line JM109 DNA, clone POVIMR 4.1-19. ORGANISM Mitochondrion Odocoileus virginianus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Cervidae; Odocoileinae; Odocoileus virginianus. REFERENCE 1 (bases 1 to 2677) AUTHORS Miyamoto,M.M., Kraus,F. and Ryder,O.A. TITLE Phylogeny and evolution of antlered deer determined from mitochondrial DNA sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.M.Miyamoto, 03-JUL-1990. FEATURES from to/span description tRNA 1 69 Phe-tRNA rRNA 70 1022 12S ribosomal RNA tRNA 1023 1091 Val-tRNA rRNA 1092 2658 16S ribosomal RNA tRNA 2659 > 2677 Leu-tRNA BASE COUNT 1000 a 550 c 467 g 660 t ORIGIN 1 gttaatgtag cttaaacaat aaagcaaggc actgaaaatg cctagatgag tataattact 61 ccataaacac agaggtttgg tcccagcctt cctattaatc cctagcagac ttacacatgc 121 aagtatcctc atcccagtga aaatgccctc caagtcaata aaactaagag gagctggtat 181 caagctcaca cccgtagctc atgacgcctt gctcagccac acccccacgg gagacagcag 241 tgataaaaat taagctataa acgaaagttt gactaagtca tgctaattag ggttggtaaa 301 tttcgtgcca gccaccgcgg tcatacgatt aacccaagtt aataggcaca cggcgtaaag 361 cgtgtttaag cactatgcca aatagagtta aactccaatt aagctgtaaa aagccataat 421 caagacgaaa ataaataacg aaagtaactt tacaaccgct gaaacacgat agctaagatc 481 caaactggga ttagataccc cactatgctt agccctaaac ataaatagtt atataaacaa 541 aactattcgc cagagtacta ccggcaatag cttaaaactc aaaggacttg gcggtgcttt 601 atacccttct agaggagcct gttctataat cgataaaccc cgatagacct taccacccct 661 tgctaataca gtctatatac cgccatcttc agcaaaccct aaaaaggaac aaaagtaagc 721 acaatcatta tacataaaaa cgttaggtca aggtgtaacc tatggagtgg aaagaaatgg 781 gctacatttt ctaatctaag aaaactcttt acgaaagtta ctatgaaatt agtaaccaaa 841 ggaggattta gcagtaaact aagaatagag tgcttagttg aattaggcca tgaagcacgc 901 acacaccgcc cgtcaccctc ttcaaatagg cacaatacac ttaaatttaa ttatacgtat 961 taatcatatg agaagagata agtcgtaaca aggtaagcat actggaaagt gtgcttggat 1021 aatcaagata tagcttaaac taaagcatct agtttacacc tagaagattt cacatattat 1081 gaatatcttg aactaaccct agcccacaaa tccatttata ctaaattatc aagatactac 1141 aaataaaaca tttatccact gttaaaagta taggagatag aaattttaga tatggcgcta 1201 tagagaaagt accgtaaggg aacgatgaaa gaaaaaatta aagtacaaaa aagcaaagat 1261 taccccttgt accttttgca taatgagtta actagtaaaa acttaacaaa atgaatttca 1321 gctaagtacc ccgaaaccag acgagctact tatgaacaat ttatcgagaa tcaactcatc 1381 tatgtggcaa aatagtgaga agattcataa gtagaggtga aacgcctaac gagcctggtg 1441 atagctggtt gtccaggaaa tgaatattag ttcagcttta aaaaatacca ataatataaa 1501 caaatatact gtatttttaa aagttagtct aaaagggaac agccttttag aaacggatac 1561 aaccttaact agagagtaaa acctgataat accatagtag gcccaaaagc agccaccaat 1621 taagaaagcg ttaaagctca acaataaaac tatcttaatc ccaataacaa ataatcaact 1681 cctagcctta atactggact aatctataaa aatagaagca ataatgttaa tatgagtaac 1741 aagaagtatc ttctccccgc acgagtttaa gtcagtacct gataatattc tgaccattaa 1801 cagcaaaata agaataacct aattataaat aacttactta ttatactgtt gatccaacac 1861 aggagtgcac ttaaggaaag attaaaagaa gtaaaaggaa ctcggcaaac accaaacccc 1921 gcctgtttac caaaaacatc acctccagca taactagtat tgggggcatt gcctgcccag 1981 tgacaaccgt taaacgccgc ggtatcctga ccgtgcaaag gtagcataat catttgttct 2041 ctaaataagg acttgtatga atggccaaac gagggtttta ctgtctctta cttccaatca 2101 gtgaaattga ccttcccgtg aagaggcggg aatactataa taagacgaga agaccctatg 2161 gagctttaac tacttagtcc aaagaaataa attcactaac caaggaaaca acaatattct 2221 tcatggacta acagctttgg ttggggtgac ctcggagaac agaaaatcct ccgagcgatt 2281 ttaaagacta gactcacaag tcaaatcaca caatcgctta ttgatccaaa aaattgatca 2341 acggaacaag ttaccctagg gataacagcg caatcctatt caagagtcca tatcgacaat 2401 agggtttacg acctcgatgt tggatcagga caccccgatg gtgcaaccgc tatcaaaggt 2461 tcgtttgttc aacgattaaa gtcctacgtg atctgagttc agaccggagt aatccaggtc 2521 ggtttctatc tattacgtat ttctctcagt acgaaaggac cagagaaata gggccaactt 2581 agaacaagcg ccttaaatta actaatgatt tactcttaat tagatacaca aacaaatctc 2641 gccctagaaa agggctttgt taaggtggca gagcccg // LOCUS SHPTGA 470 bp ss-mRNA MAM 07-AUG-1990 DEFINITION Ovine thyroglobulin mRNA, 3' end. ACCESSION M36148 KEYWORDS thyroglobulin. SOURCE Ovine, cDNA to mRNA. ORGANISM Ovis sp. Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 470) AUTHORS Durand,J., Malthiery,Y., Chabaud,O. and Lissitzky,S. TITLE L'extremite C-terminale de la thyroglobuline ovine presente de fortes homologies interspecifiques JOURNAL C R Soc Biol (Paris) 181, 258-266 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 385 thyroglobulin (AA at 2) mRNA < 1 470 thyroglobulin mRNA BASE COUNT 122 a 140 c 112 g 96 t ORIGIN 1 ggggcagttt actctggagg agaaaagcct ttccctgaaa atcatgcagt atttttccaa 61 cttcatccga tccggaaatc ccaactaccc tcatgagttc tccaggagag cgcctgaatt 121 cgcagcccct tggcctgact tcgtcccccg tgatggcgca gagagctaca aggagctcag 181 tgtcctgctc cccaaccgac agggcctgaa aaaagccgac tgctccttct ggtccaagta 241 catccagtct ctgaaggcct cagcagatga aactaaggac gggctgttag cagagagcga 301 agaggaggac cagctggctg gctctgggct gacagaggac ctcctgggtc tcccagagtt 361 agcctccaag acctacagca agtgaccagc cccatcccaa atgccacctt ggacacctta 421 ttctccaaca tagccgctat caataaagtg tctacatgtc agaaaaagtc // LOCUS TELBA 75 bp ss-RNA INV 07-AUG-1990 DEFINITION T.leucophrys 5.8S ribosomal RNA. ACCESSION M35555 KEYWORDS 5.8S ribosomal RNA. SOURCE T.leucophrys ribosomal RNA. ORGANISM Tetrahymena leucophrys Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Van Bell,C.T. TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina (Ciliophora: Hymenostomatida) JOURNAL J. Mol. Evol. 22, 231-236 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 75 5.8S ribosomal RNA BASE COUNT 22 a 13 c 20 g 20 t ORIGIN 1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa 61 accttcatgt ttgta // LOCUS TEPBA 74 bp ss-RNA INV 07-AUG-1990 DEFINITION T.paravorax 5.8S ribosomal RNA. ACCESSION M35556 KEYWORDS 5.8S ribosomal RNA. SOURCE T.paravorax ribosomal RNA. ORGANISM Tetrahymena paravorax Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 74) AUTHORS Van Bell,C.T. TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina (Ciliophora: Hymenostomatida) JOURNAL J. Mol. Evol. 22, 231-236 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 74 5.8S ribosomal RNA BASE COUNT 21 a 14 c 18 g 21 t ORIGIN 1 atgcgaattg cagaatccgc gagtcatcag atctttgaac gcaagtggtg aaggctaaaa 61 ccttcatgtt tgtt // LOCUS TEUBA 75 bp ss-RNA INV 07-AUG-1990 DEFINITION T.patula 5.8S ribosomal RNA. ACCESSION M35553 KEYWORDS 5.8S ribosomal RNA. SOURCE T.patula ribosomal RNA. ORGANISM Tetrahymena patula Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Van Bell,C.T. TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina (Ciliophora: Hymenostomatida) JOURNAL J. Mol. Evol. 22, 231-236 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 75 5.8S ribosomal RNA BASE COUNT 22 a 13 c 19 g 21 t ORIGIN 1 atgcgaattg caaaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa 61 accttcatgt ttgtt // LOCUS TEXBA 75 bp ss-RNA INV 07-AUG-1990 DEFINITION T.vorax 5.8S ribosomal RNA. ACCESSION M35554 KEYWORDS 5.8S ribosomal RNA. SOURCE T.vorax ribosomal RNA. ORGANISM Tetrahymena vorax Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Van Bell,C.T. TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina (Ciliophora: Hymenostomatida) JOURNAL J. Mol. Evol. 22, 231-236 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 75 5.8S ribosomal RNA BASE COUNT 22 a 13 c 20 g 20 t ORIGIN 1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa 61 accttcatgt ttgta // LOCUS TEYBA 75 bp ss-RNA INV 07-AUG-1990 DEFINITION T.pyriformis 5.8S ribosomal RNA. ACCESSION M35552 KEYWORDS 5.8S ribosomal RNA. SOURCE T.pyriformis ribosomal RNA. ORGANISM Tetrahymena pyriformis Eukaryota; Animalia; Metazoa; Ciliophora; Oligohymenophora; Hymenostomata; Hymenostomatida; Tetrahymenina; Tetrahymenidae. REFERENCE 1 (bases 1 to 75) AUTHORS Van Bell,C.T. TITLE 5S and 5.8S ribosomal RNA evolution in the suborder Tetrahymenina (Ciliophora: Hymenostomatida) JOURNAL J. Mol. Evol. 22, 231-236 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 75 5.8S ribosomal RNA BASE COUNT 22 a 13 c 20 g 20 t ORIGIN 1 atgcgaattg cagaaccgcg agtcatcaga tctttgaacg caagtggtgg aggtgtaaaa 61 accttcatgt ttgta // LOCUS WHTRDNA 140 bp ds-DNA PLN 07-AUG-1990 DEFINITION Wheat ribosomal RNA spacer DNA. ACCESSION M36062 KEYWORDS . SOURCE Wheat seedling DNA, clone pTA250. ORGANISM Triticum sp. Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 140) AUTHORS Appels,R. and Dvorak,J. TITLE The wheat ribosomal DNA spacer region: Its structure and variation in populations and among species JOURNAL Theor. Appl. Genet. 63, 337-348 (1982) STANDARD simple staff_review BASE COUNT 15 a 49 c 42 g 34 t ORIGIN 1 ggatccgcgt caacttccgt ccacgacggc cgccgggcct tttttcccgt gcccacgtac 61 agcccgttca cgggtccgtg taagggtccg tgtacgcggc gtctgccacg tacgtggttt 121 tgcccggttt tccatggcgc // LOCUS HUMINT01 42 bp ss-mRNA PRI 07-AUG-1990 DEFINITION Human leukocyte adhesion glycoprotein p150,95 mRNA, exon 1. ACCESSION M29165 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 1 of 7 SOURCE Human cell line HL-60, cDNA to mRNA, clone lambda-X47. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 42) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE cDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD simple automatic FEATURES from to/span description mRNA < 1 > 42 P150,95 mRNA, exon 1 BASE COUNT 9 a 15 c 10 g 8 t ORIGIN 1 bp upstream of EcoRI site; chromosome 16p11-13.1. 1 gaattcctgc cactcttcct gcaacggccc aggagctcag ag // LOCUS HUMINT02 3690 bp ds-DNA PRI 07-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 2 - 6. ACCESSION M29482 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 2 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3690, exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3690) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept 1028 1064 integrin alpha subunit precursor, exon 2 (first expressed exon) 1666 1771 integrin alpha subunit precursor, exon 3 2391 2494 integrin alpha subunit precursor, exon 4 2795 2865 integrin alpha subunit precursor, exon 5 3020 + 3131 integrin alpha subunit precursor, exon 6 sigp 1028 1064 integrin alpha subunit signal peptide 1666 1685 integrin alpha subunit signal peptide matp 1686 1771 integrin alpha subunit 2391 2494 integrin alpha subunit 2795 2865 integrin alpha subunit 3020 + 3131 integrin alpha subunit pre-msg < 1 > 3690 P150,95 mRNA and introns IVS < 1 975 P150,95 intron A IVS 1065 1665 P150,95 intron B IVS 1772 2390 P150,95 intron C IVS 2495 2794 P150,95 intron D IVS 2866 3019 P150,95 intron E IVS 3132 > 3690 P150,95 intron F BASE COUNT 764 a 1050 c 1020 g 856 t ORIGIN 1 bp upstream of BamHI site; chromosome 16p11-13.1. 1 ggatcccttg ggcccaggag ttcgaagcag cagtgaacta tgcacccact gcactccagc 61 ctgggtggca gagcaagacc ctgtttctga aattaaaaaa aaaaattgat gtacattagg 121 gggcttccac ggcctgagct gcttcccctt gctttcctcc cagtggccct gaccttgtct 181 cttacaactt cccaccctga ctgtctggtt acccattgct gatttcacac acagaccctc 241 ctgtaccctg cctcatccat gtctggctgc tctgtcatct cccaactttg gttgctttca 301 atgctcagct caagcaccac ctctttcagg aagccttctc agaaagccac accttcacaa 361 cccgggtgag gcaccctgtg gtctctgtgc ttccccctca cagcaatgaa cttgctgttt 421 atacatctgc ctctccactg accccagggc tggtgctttg tggtttatat tttcttcccc 481 acctagcaga gggcttgcat ctccaggctc aaattaggct tcttgaataa atgatgaata 541 aatgagtgaa tgaatgaatg aacaaatact cgctctgtgc tcctcctagg gacccggatc 601 ccccactcct tggcccagac tttccaggtc agagtggagg cctcccacca gggtttcctt 661 taggggtcct gaggggtggg catctgccca aaccccctcc agtctggctg aaatttcaag 721 gtcaaggggt ccttctggca gtcaagggtg agcctgggag gggcagggca gggatttgca 781 tccatctaag caaagggcat caagccaagt catctgatga gagtgactcc ggttgggggg 841 tgggggcgtg tgggagccga gcctgtcctc ggatcagttg cgtactctgc ccgccccctc 901 tgactcatgc tgacaatctt cttccttccc ctggccacct ctctgcccac ttgcttcctc 961 agtaccttgg tccagctctt cctgcaacgg cccaggagct cagagctcca catctgacct 1021 tctagtcatg accaggacca gggcagcact cctcctgttc acaggtgagc ctggacccca 1081 atgaagtagg gctggggacc caggcccaag ggagccaggg ccctgaactg ggggctcagg 1141 ctggggggtt aggatctggg taggaagaga gactcagtca agcctgaggg ggaggcaggc 1201 acatagggtt tgagatttgg agtttgtgga gggagaggat attgatgaac caattttggg 1261 agagttccag agatgctgga agagaggcca gttgtctctg tactgcagag atttttaaaa 1321 taggcagaat gcgccaactt gtgctctgtg gacaggatgc tttggtccgc aagttttcct 1381 ggacgcactc tcatagcgcc cgaggtgcac gttggggaaa gatccttttt agagcctggg 1441 tactgctctg cagaaatgga gaactgcaac tcgatagtgg atggtgggca aggggcatcc 1501 ctggaccctg ggaaggagag aaggggatga gttgggtgtc cagaagaccc aggcaccccg 1561 ggcatcaggc tcggagggga gattgggacg ctggggccgg gggtggaggg cagccaggca 1621 gaaggaagac ccttctccaa agctctcttc ccacctcttt cccagcctta gcaacttctc 1681 taggtttcaa cttggacaca gaggagctga cagccttccg tgtggacagc gctgggtttg 1741 gagacagcgt ggtccagtat gccaactcct ggtgaggccc aggtggtgct cctttggctc 1801 catccatcct ctccctgctc aggccccatc cccccggccc tgccctgtta tttgcaaact 1861 ctcctctctg tctggtgtag cgactgccct ggctaatgaa gatttgcctt gaaggcaggc 1921 acggtctcac agctaacatt tacagagcag taagtgcagt gccaggctca tcacaggtgg 1981 atgctgattt agtccacacg acagcctgtg agtaggaatc agtcgtgcaa caaacactta 2041 tttgtttttt ctttcttttt ttctatacat ttaaaaatat atagagacag ggtctcacta 2101 tgttgcctgg gttggtctca aactcctggg ctcaagcaat cctcccgcct cagcctccca 2161 aagtgctggg attccaggtg tgagccacca cacccagact caacaaatat ttcttgtctc 2221 catacgccag agaatccaac agacagaaat cccttccaca tggactttaa attattaaaa 2281 tccatcttgc agatgaggaa gctgaggctc agggagggaa cgcaaacttg ccggagtggc 2341 agctgtcggc gtccacactc ttacctaaag tgttctttgt ctcctcgcag ggtggtggtt 2401 ggagcccccc aaaagataac agctgccaac caaacgggtg gcctctacca gtgtggctac 2461 agcactggtg cctgtgagcc catcggcctg cagggtgagt caccgcccct cccgggaccc 2521 agggccgggc tcccaggctt ccctgctcca ggggcccgtg gactcccgga gtgtcacttt 2581 cagcttccct gtgtctgaga ccctcaccct cagatatgct tcctggcccc ttaaggcctc 2641 cccgcccatc gcactcccgc agctctgtca agacccgaca gcttccttca ccgtcagacc 2701 tccttgtctc ccaggtggag gtgacccctg cccagctctt ccacagcctt ctctgtaggg 2761 cccgagagtg accatgcaca tatctgtccc acagtgcccc cggaggccgt gaacatgtcc 2821 ctgggcctgt ccctggcgtc taccaccagc ccttcccagc tgctggtgag tggccctggg 2881 tcacaggagg cttctgaggg agggagggag gagccggggc cgccgggggc tgggactctc 2941 ctgtagggtg gaggttccgg catctgaggg tgggaggtac atgccaggga gtgcccccag 3001 cagcccgctg tgtccccagg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat 3061 gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt 3121 gtccaggcag ggtgagtgtc gggaccacca aggctttgag gagctcacgc acatccaatt 3181 gggggtgcgg tgggctagag acagtcttgc cagagtggat cagaaagaag ggatctggaa 3241 aaagagttac ctcgtgttgc agtggttcct gacgctgctg cccgcacatc ctgccgatcg 3301 ccgcacgctg ccggaccttt cctgtgacct taacctctcc aagcctcagt ttcttcatct 3361 gttggatggg gataataaca cacccagcac tgaaagcaac acaggatgat tcatggccag 3421 gggttagcac agcagctagc accaggcgac acccatgccg gccagctgtt gttattttta 3481 gaggagagga ctattttcat ccaatgggtc ctgggatatg accaattggt ttgtgccgta 3541 gtttaggaaa ggtcagtgaa agtgcagtgt gagcaacgtg tgtgtgtaca tgtgtgtata 3601 tgtatgcatg tgtatacatg tgcacatgca catgtacatg catgtgtgtg catgtatgtg 3661 tgtgtgtgca tgtgcatgca ggttgagacg // LOCUS HUMINT03 4863 bp ds-DNA PRI 07-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 7 - 15. ACCESSION M29483 Y00093 KEYWORDS integrin; protein p150,95. SEGMENT 3 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 294 to 3967; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4863) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 294 424 integrin alpha subunit precursor, exon 7 511 656 integrin alpha subunit precursor, exon 8 901 1054 integrin alpha subunit precursor, exon 9 1659 1809 integrin alpha subunit precursor, exon 10 2428 2501 integrin alpha subunit precursor, exon 11 2665 2794 integrin alpha subunit precursor, exon 12 3207 3349 integrin alpha subunit precursor, exon 13 3531 3671 integrin alpha subunit precursor, exon 14 3760 + 3969 integrin alpha subunit precursor, exon 15 matp + 294 424 integrin alpha subunit 511 656 integrin alpha subunit 901 1054 integrin alpha subunit 1659 1809 integrin alpha subunit 2428 2501 integrin alpha subunit 2665 2794 integrin alpha subunit 3207 3349 integrin alpha subunit 3531 3671 integrin alpha subunit 3760 + 3969 integrin alpha subunit pre-msg < 1 > 4861 P150,95 mRNA and introns IVS < 1 293 P150,95 intron F IVS 425 510 P150,95 intron G IVS 657 900 P150,95 intron H IVS 1055 1658 P150,95 intron I IVS 1810 2427 P150,95 intron J IVS 2502 2664 P150,95 intron K IVS 2795 3206 P150,95 intron L IVS 3350 3530 P150,95 intron M IVS 3672 3759 P150,95 intron N IVS 3970 > 4863 P150,95 intron O BASE COUNT 947 a 1358 c 1460 g 1097 t 1 others ORIGIN Chromosome 16p11-13.1. 1 acctgtgatc gccccctcgc ctcccaaagt actgggatta cacggtgagc caccacgcct 61 ggctcaatca cagcctcttt aggcaacttt aagagaatga agggccttgt tccaggcaag 121 gggttaggga acgtctgccc ctgatgagga gaggacccag ggtgtggagc ctgactccca 181 tcgccagact aggggcttag ggaggaaggg ttttggagag tgagctcttg caggagccac 241 ggtcctggac tccaggagtg tcacttggag gacggtgcca cctccttccc cagagtgccc 301 aagacaggag caggacattg tgttcctgat cgatggctca ggcagcatct cctcccgcaa 361 ctttgccacg atgatgaact tcgtgagagc tgtgataagc cagttccaga gacccagcac 421 ccaggtgtgc tttgggggag ggaggctgct gggggtgggt gcttggatcc tggtgatagg 481 cctcagccca gccctgtgtg cttctcccag ttttccctga tgcagttctc caacaaattc 541 caaacacact tgactttcga ggaattcagg cgcacgtcaa accccctcag cctgttggct 601 tctgttcacc agctgcaagg gtttacatac acggccaccg ccatccaaaa tgtcgtgtga 661 gtcctgattt cttccaggca cagtcccaaa gcacccaggt cttcccttgg cctcatctga 721 tctccacgag aaggggacag gcagggacca aaatccagcc cgtgataccc ttgccaagct 781 ggggcctctg ggtgggactg gggcctccca aaggaaaagg catcttctaa ttttcacaag 841 ggcaccaggg gctagtgtgg tttggttcac aggcctctaa gacctctcct ttcctgatag 901 gcaccgattg ttccatgcct catatggggc ccgtagggat gccaccaaaa ttctcattgt 961 catcactgat gggaagaaag aaggcgacac gctggattat aaggatgtca tccccatggc 1021 tgatgcagca ggcatcatcc gctatgcaat tggggtaggc ctgggatggc ttcccacttc 1081 tcccacggct tcctctcagg gcaactcccc tttctgtgta tgatgttctt ttctctttga 1141 gacagggtct tgctctatca cccaggaagt ggtgcaatcc tagctcactg cagccttgaa 1201 ctcctgggct ccagtgatcc tcccaccccg cctcccagta gtcgggacca caggtgtgtg 1261 ccatcaagcc tggctatttt ctttttggtt gagatggggt cttgctatgt tgcccaggct 1321 ggtctcaaat tcctggcctt aagcaattct gccaccttgg tctcccaaag gcacagggga 1381 ttacaggcgt gaaccaccgc caacaacatc cctttcaagg atagaaacac cagctctctc 1441 ggctcttact gccttaagga tgaaaactct gccccagact ggagaccatg atgatccttt 1501 ctcctaaact ccctgatgct gtccgggctt cgtgtttctc ctgtgtccac cgggtgtgat 1561 catgttgatc ttgtggggtt attggaagat gttgcaccca gtgcacacag gcacatttga 1621 tttattattt ttactgagtt gatcttttct ggggacaggt tggattagct tttcaaaaca 1681 gaaattcttg gaaagaatta aatgacattg catcgaagcc ctcccaggaa cacatattta 1741 aagtggagga ctttgatgct ctgaaagata ttcaaaccca actgagggag aagatctttc 1801 ccattgaggg tgagtctgaa gggagctctt cgcttgggga atcctcagcc gttaacacct 1861 ttccacttag aacccgaggc tccgtgaaac aggtagacag cgtctcggtt ctcctgcttt 1921 cccgggaccc cgatagccat gtctgtcagc ttgtccccac tgacgtcccc cagcactgtc 1981 agagctgccc caaagtggcc ccagggatgg ccctgctccc cacagagagt gatctcacac 2041 caccaccggc tccactgcag aacaaaagca gtccaggccc aacccaggag acccttccac 2101 ccacaccggg ccctacccag cccacatccc accagccact cactcccctg ggcaaggggc 2161 acacggacac ctggccccct cggtctgctt gtagacctgt ggggggccct gatgaggacc 2221 agatcggtgc tgccatcgct gtccacatcc atggagcaga ggggggcccc gaagtcggag 2281 ctgatctgga ggcagagcct ggtccctgtc acaggcacca gctctccctg tagcctccag 2341 tcttagcttc tcctaaagct gaagtgttct tggacctggc aaagcccgtc tccctccctg 2401 gcactcaagc gtcatgcctt accccaggta cggagaccac aagcagtagc tccttcgaat 2461 tggagatggc acaggagggc ttcagcgctg tgttcacacc tgtgcgtggg gccccttagg 2521 ccgatgatgt gccgtgaggg gagggggggc agggaaggcc agggtgggtg tcaggtgggt 2581 aagaggcgca aggcggaagg catatctctg gtcatgctgt cttcctgctc tcggctctgc 2641 tcagccctgg aatcctttct ccaggatggc cccgttctgg gggctgtggg gagcttcacc 2701 tggtctggag gtgccttcct gtacccccca aatatgagcc ctaccttcat caacatgtct 2761 caggagaatg tggacatgag ggactcttac ctgggtgaga aacagccagg ggttggggac 2821 aggtgggaga tgcactgccc agggtggggt ccagggttct ggggaagggg taggggnatg 2881 ggggctgtgc tgcccagtgt ggggcccagc ttctggggag ggaggatggg cactgtgctg 2941 cccggggtgg gttccagggt tctggggagg gggaatgggg gctgtgctgc ctggggtggg 3001 aatccagggt tctggggaga ggggatgggc gctgtgctgc ctggggtggg ttccagggtt 3061 ctggggagag aggatggggg ctgcattgcc cagggtgggg tccagggttc tggggagggg 3121 agatggtgct gtgctgcccg gggtgggaat ccagggttct ggggaggggg aatgggggcc 3181 tttgtgctga ggcctgggcc cctcaggtta ctccaccgag ctggccctct ggaaaggggt 3241 gcagagcctg gtcctggggg ccccccgcta ccagcacacc gggaaggctg tcatcttcac 3301 ccaggtgtcc aggcaatgga ggatgaaggc cgaagtcacg gggactcagg ttgggcgtga 3361 caggagccac aggccgggaa ttcagggtag gggaggtggc tgggcagaga agaggatgga 3421 ggggctttga gggccttggg ggaggtcctg gtacctgggg agaggtggga cctggcccac 3481 agggctgcct ctggcaggga caggcagcat gacccagctc tgcccttcag atcggctcct 3541 acttcgggcc ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc 3601 tcatcgggcc cccccattac tacgagcaga cccgaggggc ccaggtgtct gtgtgtccct 3661 tgcccagggg ggtgagtggc tgatgggcct ggtgtgtgtg gggtctggtg tgggtgaggg 3721 gttgcccggg ttgggcctgg cactgttttt tttctgcagt ggagaaggtg gtggtgtgat 3781 gctgttctct acggggagca gggccacccc tggggtcgct ttggggcggc tctgacagtg 3841 ctgggggatg tgaatgggga caagctgaca gacgtggtca tcggggcccc aggagaggag 3901 gagaaccggg gtgctgtcta cctgtttcac ggagtcttgg gacccagcat cagcccctcc 3961 cacagccagg tgaggccgtg tcccatttct gtcactagag cagcctgctt cttgcctctc 4021 ccactctgtc atactggaaa actgtccctt tttacctttt cctacctccc ttgcccagct 4081 ctgagcacct tgtagcagtg gcgtggtctc agctcactgc aacctccgcc tcccaggttc 4141 aagcgattct ctctgcctca gcctccagag tagctgggat tacaggcatg caccaccatg 4201 tccggttatt ttttgtattt tagtagagac acgtttcgcc atgttggcta ggctggtctt 4261 gaactcctga cctcaggtga tctgcctgtc tcggcctccc aaagtgctgg gattataggc 4321 gtgagccgcc atgcccaggc ccctgccagt tttacaaggt acacaggtca ggcacagaaa 4381 acccatttta cagatggaat ctgggacact aggaagacaa gggccttggt ttgttggagg 4441 ttcagagtgg gtccgagatg gtgaaggaac tccggcctcc tgacctctaa cccggtgtgc 4501 agtctcccgg ctccctgctg ctcaccactt aggtccagtc atttcaacct ccctccacct 4561 gcccctctcc tccctggatg ctacatgatt ttattccctt cctgccatca aggtcccacc 4621 aaatgcccat ccctgcagcc tccctccacc ccaagggtag cagggttccc tgagaacgaa 4681 gggctgcctt tcttggcaaa agtcaagaaa gctctgttaa aaaataggca aagggcctgc 4741 tccctggtgg ctcacatctg taattccgac actttgggag gctgaggcag gaggatcact 4801 tgaggccagg agtttcaggc cagctgggca acataggggg accccatctc tagaaaaaat 4861 ttt // LOCUS HUMINT04 2746 bp ds-DNA PRI 07-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 16 - 21. ACCESSION M29484 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 4 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 382 to 2672; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2746) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 382 512 integrin alpha subunit precursor, exon 16 628 791 integrin alpha subunit precursor, exon 17 924 1078 integrin alpha subunit precursor, exon 18 1656 1787 integrin alpha subunit precursor, exon 19 1875 1948 integrin alpha subunit precursor, exon 20 2531 + 2672 integrin alpha subunit matp + 382 512 integrin alpha subunit 628 791 integrin alpha subunit 924 1078 integrin alpha subunit 1656 1787 integrin alpha subunit 1875 1948 integrin alpha subunit 2531 + 2672 integrin alpha subunit pre-msg < 1 > 2746 P150,95 mRNA and introns IVS < 1 381 P150,95 intron O IVS 513 627 P150,95 intron P (no splice consensus) IVS 792 923 P150,95 intron Q IVS 1079 1655 P150,95 intron R IVS 1788 1874 P150,95 intron S IVS 1949 2530 P150,95 intron T IVS 2673 > 2746 P150,95 intron U BASE COUNT 602 a 821 c 746 g 577 t ORIGIN Chromosome 16p11-13.1. 1 gaattcctat cctgagcatg gctaaactct gagctaatag tatcattata gaaagatgag 61 gaaacggagg cacagacaga ttgagtcctt gcccacggcc tcgtggctca tacgtggagg 121 agtcagaatt ggaactagag actgatcgaa tgaatgacac tcgggtcacc aggacacctt 181 cctatctcca ctcttacatc tgtttcttag caatcatctc ccaactccta cctcctcttt 241 tcaggttctt cttggtgaca tctgttacaa ctcacccctt ctctcccttt ccgatggtcc 301 tacctccata ttccccttgt tacttatttc caacttcttc cctagtttcc atcttgattc 361 acccttctct cctctggcca gcggatcgcg ggctcccagc tctcctccag gctgcagtat 421 tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt ggacctggct 481 gtgggggccc ggggccaggt gctcctgctc aggtgagagc agactttctc agaggctccc 541 catgtggtcc taggttcaga tgggggtgcc cacccacgtg gtgctcccac cagcgacggc 601 tgtcctcagc tcggtgctct gcccgcagac cagacctgtg ctctgggtgg gggtgagcat 661 gcagttcata cctgccgaga tccccaggtc tgcgtttgag tgtcgggagc aggtggtctc 721 tgagcagacc ctggtacagt ccaacatctg cctttacatt gacaaacgtt ctaagaacct 781 gcttgggagc cgtgagtccc ctcccctcca acccaggaca ccctgacctc tggagtcccc 841 catcccaggc ccctgtctcc caccctgctc attgtccacc caaggagttc ctgtctcaac 901 gccgtccctg cgaccgccta caggtgacct ccaaagctct gtgaccttgg acctggccct 961 cgaccctggc cgcctgagtc cccgtgccac cttccaggaa acaaagaacc ggagtctgag 1021 ccgagtccga gtcctcgggc tgaaggcaca ctgtgaaaac ttcaacctgc tgctcccggt 1081 gcgtctgggc atgaacgtgg gtggcggccg cgctggggct ggcagaaggc agggcaggga 1141 gagaacaggc tgtgttccgg cctccctgtg gctcagccca gcacaggacc agccatgcag 1201 gacgtgctta ctgcacgtta gccagtgagt gagtgagcga gcaaacaagt gatgagatcg 1261 tctgcaattt ccagggccac acgattggat ttcaggaaag agaattgggc aacctgagag 1321 agctctgggc ttaccttctg gcttttcagg cattcactga cagggttatc gagctgctcc 1381 tggagacagc cttgcctggg ccatgggcat aggtggccaa aacagtcatt gctgatcggg 1441 aggtctgggg gggggaggaa aaaaacaaag acaaacaagg ggagaggaca gagagggtgt 1501 cagggaggca tcctgaaggc ggtgacgctg agcaggctct ggaggaagtg aagcagagcg 1561 ggagctgggc agaggcagga taagaactgc ggatgaggcc gagcgcagct cttaccctcc 1621 ccttaccctc gctccccgcg acgcccgtcc cccagagctg cgtggaggac tctgtgaccc 1681 ccattacctt gcgtctgaac ttcacgctgg tgggcaagcc cctccttgcc ttcagaaacc 1741 tgcggcctat gctggccgcc gatgctcaga gatacttcac ggcctccgtg agtcctggca 1801 ctgggtctcc cagagagggt gcacagcgtg gggcctgggt ctcggagaaa accccccgtt 1861 gccttcccac gcagctaccc tttgagaaga actgtggagc cgaccatatc tgccaggaca 1921 atctcggcat ctccttcagc ttcccagggt gagcgcccca ccttagatgc cctactgccc 1981 cagcctcctt cctggaatct gggactcctg cctctgctct ccctaacatt gtctcatcct 2041 atagtcaaaa cccaggtgtc ttggctgggc acagtggctc actcctgtaa tccagcactt 2101 tgggaggccg aggtgggagg acttttgagg ccaggagtta gggttacgac ctgggcaaca 2161 gagcgacacc catttccaca aaaacaaaac aacaacaaca acaacaacaa caacaacaac 2221 aacaacatca cttgagtgtg gtagagcatg cctatagtcc cagctacttg ggaggctgaa 2281 gcttaaggct tgcttgagct ctggagttgg aggtctgcag tgagccataa tcacaccact 2341 gcactccagc ctgggtgaaa gagcaggact ctgtctctta aaaaaaaaga agaagaagaa 2401 gaagaagaag aagaacccag gggtccgtcc cctgtctatc tcccaaatcc ccacccaccc 2461 cattttatcc cagaccattt ctagcctcag tcacagaatc atcttatcct ttccttcacc 2521 tgatacccag cttgaagtcc ctgctggtgg ggagtaacct ggagctgaac gcagaagtga 2581 tggtgtggaa tgacggggaa gactcctacg gaaccaccat caccttctcc caccccgcag 2641 gactgtccta ccgctacgtg gcagagggcc aggtgcacct ctggggaagg aggaggaggc 2701 agggctgggc gttagcgtag attcccgtgc gggttcagaa cccggg // LOCUS HUMINT05 1006 bp ds-DNA PRI 07-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 22 - 24. ACCESSION M29485 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 5 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 202 to 665) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1006) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 203 319 integrin alpha subunit precursor, exon 22 404 483 integrin alpha subunit precursor, exon 23 582 + 665 integrin alpha subunit precursor, exon 24 matp + 203 319 integrin alpha subunit 404 483 integrin alpha subunit 582 + 665 integrin alpha subunit pre-msg < 1 > 1006 P150,95 mRNA and introns IVS < 1 202 P150,95 intron U IVS 320 403 P150,95 intron V IVS 484 581 P150,95 intron W IVS 666 > 1006 P150,95 intron X BASE COUNT 228 a 286 c 229 g 263 t ORIGIN Chromosome 16p11-13.1. 1 ttctatcctg gtgacagagt gagacctggt ctcaaaacaa acaaacaaac aaaatataag 61 cttaaggtgg gctccaggaa gctttatcac tacttcgtgg cgtgtctttg gaatgctgtt 121 atattaggtt ggtgcaaaag taattgggtt tttgccattg ctttcaattt caactaatac 181 tcctctactt tctcatgcct agaaacaagg gcagctgcgt tccctgcacc tgacatgtga 241 cagcgcccca gttgggagcc agggcacctg gagcaccagc tgcagaatca accacctcat 301 cttccgtggc ggcgcccagg tcagcctggc ttctgtcccc tcactgctcc cctgccccac 361 cctgtcttta ctgctctgtg acctctcagt tccttttcct cagatcacct tcttggctac 421 ctttgacgtc tcccccaagg ctgtcctggg agaccggctg cttctgacag ccaatgtgag 481 caggtgagcc gggccatggc caggggcagt gcctcatctc cagcctcaca ccccattctc 541 ctctggggcc tctggcaact gagtctctcc tctttctcca gtgagaacaa cactcccagg 601 accagcaaga ccaccttcca gctggagctc ccggtgaagt atgctgtcta cactgtggtt 661 agcaggtcac aggtacccac tgcaggaaaa agggttcttc tctctgaccc tcaaaaagaa 721 aaaaaaaaaa aaggccttga aacgctgcca cagagggtga gataaggtgt ttgaaagtaa 781 aaggtcaggt gtttcagaag acaccttcct tcagccaatg ccttcctcga atttgctgtg 841 tgccaggcag ggtgctgtgg ttattttcca tacattcatt tgacattcat tgaagattta 901 ctgagccccc attatgtgtg atcaaaccag acatgaaccc tcgccttgtg ggtgtgcctt 961 gctggatgtc tcctgtgttc cactctcact gcactgcatg ctgagt // LOCUS HUMINT06 1904 bp ds-DNA PRI 07-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 25 - 30. ACCESSION M29486 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 6 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 86 to 1528; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1904) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 86 164 integrin alpha subunit precursor, exon 25 280 387 integrin alpha subunit precursor, exon 26 505 588 integrin alpha subunit precursor, exon 27 788 901 integrin alpha subunit precursor, exon 28 1044 1145 integrin alpha subunit precursor, exon 29 1417 + 1527 integrin alpha subunit precursor, exon 30 matp + 86 164 integrin alpha subunit 280 387 integrin alpha subunit 505 588 integrin alpha subunit 788 901 integrin alpha subunit 1044 1145 integrin alpha subunit 1417 + 1527 integrin alpha subunit pre-msg < 1 > 1904 P150,95 mRNA and introns IVS < 1 85 P150,95 intron X IVS 165 279 P150,95 intron Y IVS 388 504 P150,95 intron Z IVS 589 787 P150,95 intron AA IVS 902 1043 P150,95 intron AB IVS 1146 1416 P150,95 intron AC IVS 1528 > 1903 P150,95 intron AD BASE COUNT 381 a 545 c 525 g 453 t ORIGIN Chromosome 16p11-13.1. 1 accacctgtc ctctcatgct ctagccaatg ccttctgcag atgcccatgg tagttcacat 61 ccacttatgc gtcttctctc tccagccacg aacaattcac caaatacctc aacttctcag 121 agtctgagga gaaggaaagc catgtggcca tgcacagata ccaggtcagg tggtggtgta 181 cgcaggaaga ccttgggcat ggggtgggag gctgggtagc cggagactgg ggagggattt 241 ggctttggcg tggctctgcc ctcagtgccc tctgtgcagg tcaataacct gggacagagg 301 gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga ggctgtgtgg 361 atggatgtgg aggtctccct cccccaggta cccaaggact gcatgtggct cctccacgaa 421 tgccctttct acctggattc cttgtgcccc atgtgggtcc ctgatgtccc agctgagaca 481 cttgttctct gcattttccc ccagaaccca tcccttcggt gctcctcaga gaaaatcgcg 541 ggcccagcat ctgacttcct ggcgcacatt cagaagaatc ccgtgctggt gaggagggct 601 ctgggtctgg ccctcactgt aggcccacat cagaggaatt taacccagga gttcatgttc 661 catatccatc ctgctgaagt accctcttgc attcggatat ggccgctgcc ctcaagtcac 721 acgcataatg ctgcctccca ccttcacact catctttctc agccccatgc tatttatctg 781 cccccaggac tgctccattg ctggctgcct gcggttccgc tgtgacgtcc cctccttcag 841 cgtccaggag gagctggatt tcaccctgaa gggcaacctc agctttggct gggtccgcca 901 ggtgtgtggg tgcaacgaca gagcccctgc cccagactca ggcgggacct ggcatgtctg 961 tgcccatctg caagccaggg cacccccaga gctctgagcc tcccccagag ccagttcaac 1021 aggtttcccc cacccctttg cagatattgc agaagaaggt gtcggtcgtg agtgtggctg 1081 aaattacgtt cgacacatcc gtgtactccc agcttccagg acaggaggca tttatgagag 1141 ctcaggtaga gaccatgtgg agggcagcga ccaggcagga aagagggtcc caagggctac 1201 atctgtggtg ctgggtgggg ggtttgcaag ccttggggga ggagggtgaa ggcctctggg 1261 caggatagct gtccctaagg gcacgggtgc tgctgtgtct cacctcttgg agcagggcct 1321 ggggaaggag gggagggagt taaaggttgg ggagcctggg aggagtctgg gatagtagga 1381 ggatgggagt ctctgacagg gtcacttcca cttcagacga caacggtgct ggagaagtac 1441 aaggtccaca accccacccc cctgatcgta ggcagctcca ttgggggtct gttgctgctg 1501 gcactcatca cagcggtact gtacaaagtg agtgttttat gccacccttg acaccaccag 1561 catctggtcc cgctcttttt gcagagtgag aaggagctca ctttgaaggc agaggcacat 1621 tcttactggg tcacttcata tgagaaactg cttcccacct gcaatgtcac cgtgccccag 1681 tggccccctg ctttgtgatt cccaggcttc ctctaatatt tctccctttc tttcctgctc 1741 ttctccatca ttctacgtgt tcctgacagc agattatcat ataaaagcac agacctgggt 1801 tgaatgcgac atcaccacgg gttcttttgt cttgaccata ggccagtgtc tgctccactc 1861 tgggccttga tttccatgtg aggtgatatc acccagctca taga // LOCUS HUMINT07 149 bp ds-DNA PRI 07-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exon 31. ACCESSION M29487 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 7 of 7 SOURCE Human DNA, (library pWE15), clone 30.1, and cell line HL-60, cDNA to mRNA, clone lambda-X47. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 44 to 149) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD simple automatic REFERENCE 2 (bases 1 to 149; revises [1]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL Unpublished (1989) Center for Blood Res., Boston MA 02115 STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 44 148 integrin alpha subunit precursor, exon 31 matp + 44 145 integrin alpha subunit pre-msg < 1 > 149 P150,95 mRNA and introns IVS < 1 85 P150,95 intron AC BASE COUNT 40 a 41 c 37 g 31 t ORIGIN Chromosome 16p11-13.1. 1 actgaatggg cttcctgagt ttcttcttcg tcctcccccc taggttggct tcttcaagcg 61 tcagtacaag gaaatgatgg aggaggcaaa tggacaaatt gccccagaaa acgggacaca 121 gacccccagc ccgcccagtg agaaatgat // LOCUS ECOKDPDE 3600 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli transmembrane protein (kdpD) and cytoplasmic protein (kdpE) genes, complete cds. ACCESSION M36066 KEYWORDS cytoplasmic protein; kdpD gene; kdpE gene; transmembrane protein. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3600) AUTHORS Walderhaug,M.O., Daniel,J.M., Hesse,J. and Epstein,W. TITLE KdpD and KdpE, regulatory proteins for the kdp operon of Escherichia coli, are members of the two component sensor-effector class JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.O.Walderhaug, 03-JUL-1990. Author address: M.O.Walderhaug University of Illinois at Chicago Department of Microbiology and Immunology Box 6998, Chicago, IL 60680 FEATURES from to/span description pept 141 2825 transmembrane protein (kdpD) pept 2822 3499 cytoplasmic protein (kdpE) mRNA 68 > 3499 kpd operon signal 30 36 -35 region signal 55 60 -35 signal binding 130 133 ribosome binding site BASE COUNT 789 a 968 c 1051 g 792 t ORIGIN 16 min on K12 map. 1 acgcgtggcg aaagcgcgta atctcagcgt tgaacagctc acgcaactga tcgcaaaata 61 cagccaacaa ccgctggtga aatatatcgg ccagccggtt gtcaacattg ttgaactcaa 121 tctggcgctg gataaacttg atgaataacg aacccttacg tcccgacccc gatcgtctgc 181 tggaacaaac tgccgcgccg catcggggga agctgaaagt tttcttcggt gcctgtgcag 241 gcgtcgggaa gacctgggcg atgctggcag aagcccagcg actgcgggcg caagggctgg 301 atattgtggt tggcgtggta gaaacccacg ggcgaaaaga taccgccgcc atgctggaag 361 ggctggctgt tctgccgtta aaacgccagg cgtaccgtgg gcggcatatc agcgagtttg 421 atctcgatgc cgccctcgcc cgccgcccgg cgctgatctt aatggacgaa ctggcgcaca 481 gtaatgcgcc aggttcccgt catcccaaac gctggcagga tatcgaagaa ctgctggaag 541 ctggcattga tgttttcact accgtcaacg ttcagcatct ggaaagtctg aatgatgtgg 601 tcagcggcgt caccggaatt caggtacggg aaaccgtgcc cgatcctttt ttcgatgccg 661 ccgacgacgt ggtgctggtg gacttgcccc cggacgatct gcgccagcgg ctgaaagaag 721 gcaaagtcta tattgccggg caggcggagc gcgccattga acattttttc cgcaaaggta 781 atctgatcgc cctgcgcgaa ctggcactgc gccgtactgc cgatcgcgtt gatgagcaaa 841 tgcgcgcctg gcgggggcat cctggcgaag agaaagtgtg gcacacgcgc gacgcgatcc 901 ttttatgcat cggccataac accggcagcg aaaaactggt ccgcgcagcg gcgcggctgg 961 cgtcacggct gggtagcgtc tggcacgcgg tgtatgttga aacccctgcc ctgcaccgct 1021 taccggaaaa aaaacgtcgg gcaattctca gcgccttacg tctggccgag gaactgggcg 1081 cggagacggc aacactttct gatccagcgg aagagaaagc ggtagtgcgt tatgcccgtg 1141 aacataatct cggcaagatt attctcggtc gcccggcctc gcgccgctgg tggcgtcggg 1201 aaacgtttgc tgaccgactg gcgcgcatcg cccccgatct cgatcaggtg ctggtcgcgc 1261 ttgatgaacc acccgcccgc acgattaaca acgcgccgga taaccgctct tttaaagaca 1321 agtggcgtgt acaaattcag ggatgcgtgg ttgccgccgc gttatgcgcc gttatcacct 1381 taattgccat gcagtggctg atggcgtttg atgccgccaa cctggtgatg ctgtatctgc 1441 ttggcgtggt ggtggtggcg ctattttatg gacgctggcc ttcagtggtt gccaccgtca 1501 ttaatgtagt gagtttcgat ctctttttta tcgccccacg cggcacgctc gccgtctctg 1561 atgtgcaata tctgctgacc ttcgcggtga tgttaaccgt cgggctggtg atcgggaacc 1621 ttactgctgg cgtgcgttat caggcgcggg tagcccgtta ccgcgagcaa cgcacacggc 1681 acttatatga aatgtcgaaa gctctggcgg tgggccgcag tccgcaggat atcgctgcca 1741 ccagcgaaca atttattgcc tccacgtttc atgcccgcag tcaggtgttg ttgcccgatg 1801 acaacggtaa attgcagccg ttaacacatc cgcaaggaat gacgccgtgg gacgatgcca 1861 tcgcgcagtg gagttttgat aaaggcctgc ctgcgggcgc gggcaccgac acgttacccg 1921 gtgtaccgta ccagattttg ccgctaaaaa gcggcgagaa aacctacggg ctggtggtgg 1981 tggagccggg gaatctgcgc cagttgatga tcccggaaca gcagcgcctg ctggagacgt 2041 ttacgctgtt agtcgccaat gcccttgagc ggctgacgct aaccgccagc gaagaacagg 2101 cgcggatggc aagcgaacgt gaacagatcc gcaacgccct gctggcggcg ctttcgcatg 2161 atttacgcac gccgcttacg gtgctgtttg gtcaggcaga aatcttaacg ctcgatctgg 2221 caagcgaagg atcaccccac gcccgccagg ccagcgagat ccgtcagcat gtgctgaaca 2281 ctacccgact ggtgaataat ctactggata tggcgcgaat tcagtccggc ggctttaatt 2341 tgaagaaaga gtggttaacg ctggaagaag tagtcggcag cgcgctgcaa atgctggaac 2401 cgggtttatc gtcgcccatc aatctttctc tgccagaacc gctgacctta atccacgttg 2461 acgggccact ctttgaacgg gtgctgatta atctgctgga gaacgcggtg aaatatgcgg 2521 gtgcgcaggc cgaaattggt atcgatgccc acgttgaggg cgaaaatcta caactggatg 2581 tctgggataa cggccccggt cttccgccag gccaggagca gacgatattt gataagtttg 2641 ctcgcgggaa taaagagtcg gcagtaccgg gggtagggct tggactggca atttgtcggg 2701 cgatagtgga tgtacacggg ggcactatta ccgcgttcaa ccgaccggaa ggtggtgcct 2761 gttttcgtgt tacacttccc cagcaaactg cccctgaact tgaagaattt catgaggata 2821 tgtgacaaac gttctgattg ttgaagatga acaggctatt cgtcgctttc tgcgcacggc 2881 gctggagggc gacgggatgc gcgtctttga ggccgaaacg ctgcaacgcg gcttgctgga 2941 agcggcaacc cgtaagccag atttgattat tctcgatctc ggcctgcccg atggtgatgg 3001 gattgagttt atccgcgacc tgcgccagtg gagccgggtg ccggtgattg tgctttccgc 3061 acgcagcgaa gagagcgaca aaatcgccgc gctggatgcc ggagcggatg attatctgag 3121 taagccgttt ggcattggcg aattgcaggc ccgtctgcgc gtcgcattac gccgccactc 3181 tgccaccacc gcgcccgatc cgctggtaaa attttccgat gttaccgtcg atttagccgc 3241 ccgcgtgatt caccggggtg aggaagaggt gcatctcaca ccaattgagt tccgccttgc 3301 tggtcgctgc tcaacaatgc cggaaaagta ctcacccagc ggcccagtcc ttaaccaggt 3361 gtgggggcca aacgcggtcg aacacagtca ctatttgcgt atttatatgg gacatctgcg 3421 acaaaaactg gaacaggatc ccgcccgccc acgccatttc attactgaaa ccggtattgg 3481 ctatcggttt atgctttgaa tattaatttt aatacagcct gccttttatt aattaaagcc 3541 gtaataataa atacggcttt ttatcttaaa caacacacaa aaataacaat tcaatatttt // LOCUS KAEUREX 4761 bp ds-DNA BCT 07-AUG-1990 DEFINITION K.aerogenes urease subunits A (ureA), B (ureB), C (ureC), and urease accessory protein (ureE, ureF and ureG) genes, complete cds. ACCESSION M36068 KEYWORDS urease; urease accessory protein. SOURCE K.aerogenes (strain CG253) DNA. ORGANISM Klebsiella aerogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 4761) AUTHORS Mulrooney,S.B. and Hausinger,R.P. TITLE Sequence of the Klebsiella aerogenes urease genes and evidence for accessory proteins facilitating nickel incorporation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.B.Mulrooney, 03-JUL-1990. Author address: S.B.Mulrooney Michigan State University Dept of Biochemistry East Lansing, MI 48824 FEATURES from to/span description pept 264 566 urease subunit A (ureA) pept 576 896 urease subunit B (ureB) pept 889 2592 urease subunit C (ureC) pept 2602 3078 urease accessory protein (ureE) pept 3080 3754 urease accessory protein (ureF) pept 3763 4380 urease accessory protein (ureG) site 4385 4424 transcription termination signal BASE COUNT 898 a 1542 c 1452 g 869 t ORIGIN 1 ctctcgccga acgtccctgg gtcggcactt tgctgtgcta tccggctacc gatgccctgc 61 tcgacggggt gcgcgacgcg ctggcgccgc tcggtctcta cgccggcgcc agcctgaccg 121 accgcctgct gacggtgcgt ttcctcagtg acgataatct gatttgccag cgggtgatgc 181 gcgacgtatg gcagtttctg cgccctcatc tcaccggtaa atctcccgta cttccccgaa 241 tctggctgac ttaagagaac gttatggaac tgaccccccg agaaaaagac aagctgttgc 301 tgtttaccgc cgcgctggtg gcggagcgtc gcctggcccg cggcctgaag ctcaactatc 361 cggagtccgt ggccctgatc agcgccttta ttatggaagg cgctcgggac ggcaaaagcg 421 tggcctcgct gatggaggaa ggccgtcacg tcctgacccg cgagcaggtg atggagggcg 481 tcccggaaat gatcccggat atccaggtcg aagccacctt cccggacggc tcgaagctgg 541 tcaccgttca caacccgatt atctgaggta gcgccatgat ccccggtgaa tatcacgtta 601 agcccggtca gatagccctg aataccggcc gggcaacctg tcgcgtggtc gttgagaacc 661 acggcgatcg gccgattcag gtcggttcgc actaccattt cgccgaggtt aacccggcgc 721 tgaagttcga ccgtcagcag gccgccggct atcgcctgaa tatcccggcg ggcacggcgg 781 tacgctttga acccggccag aaacgcgagg tcgagctggt ggccttcgcc ggtcaccgcg 841 ccgtcttcgg cttccgcggc gaggtcatgg gccctctgga ggtaaacgat gagtaatatt 901 tcacgccagg cctatgccga tatgttcggc cccaccgtcg gcgacaaggt gcgcctggca 961 gataccgagc tgtggatcga ggtggaggac gatttgacca cctacgggga agaggtcaaa 1021 ttcggcggcg gcaaagtgat ccgcgacggc atgggccagg gacagatgct ggccgccgac 1081 tgtgtcgacc tggtgctcac caacgcgttg atcgtcgatc actgggggat cgttaaggcc 1141 gatatcggcg tgaaggacgg ccggatcttc gccatcggca aagccggcaa ccccgacatc 1201 cagcccaacg tcaccatccc catcggcgct gcgacggaag tgatcgccgc cgaaggaaaa 1261 attgtcaccg ccggcgggat cgatacccat attcactgga tctgtccgca gcaggcggaa 1321 gaggcgctgg tctctggcgt gaccaccatg gtcggcggcg gcaccggccc ggccgcgggc 1381 acccatgcca ccacctgcac cccgggcccg tggtatatct cacgcatgct gcaggcggcc 1441 gacagcctgc cggtcaatat cggcctgctg ggcaagggaa acgtttctca gccggatgcc 1501 ctgcgcgagc aggtggcggc aggcgttatt ggcctgaaga tccatgagga ctggggcgcc 1561 accccggcgg cgatcgactg tgcgttaacc gtcgccgatg aaatggacat ccaggtcgcc 1621 ctgcacagcg acaccctgaa tgaatccggt tttgtggaag acaccctcgc cgccatcggc 1681 gggcgcacca tccacacctt ccataccgaa ggggccggcg gcggccatgc gccggacatc 1741 atcaccgcct gcgcccaccc gaacattttg ccgtcgtcca ccaacccaac gctgccctac 1801 accctcaaca ccatcgatga acatctcgat atgctgatgg tctgccacca tctggacccg 1861 gacatcgccg aggacgtggc ctttgccgag tcgcgcattc gccgggaaac catcgctgcg 1921 gaagacgtgc tgcacgatct cggcgccttc tcgctcacct cctccgattc gcaggccatg 1981 ggccgcgtcg gggaagtgat tctccgcacc tggcaggtgg cgcatcgcat gaaggtgcag 2041 cgcggagcgc tggcggagga gaccggggat aacgacaact tccgcgtgaa gcgctacatc 2101 gccaaataca ccatcaaccc ggcgctgacc cacggcatcg cacacgaagt cggatccatt 2161 gaggtgggta agctggctga cctcgtggtc tggtcaccag ccttcttcgg cgtgaaaccg 2221 gccaccgtga tcaaaggcgg catgatcgcc atcgcgccga tgggcgatat caatgcctct 2281 attccgaccc cgcagccggt gcactaccgc ccgatgtttg gcgcgctggg cagcgcccgc 2341 catcactgcc gcctcacctt cctgtcgcag gcggcggcag ccaatggcgt tgccgagcgg 2401 ctgaacctgc gcagcgcgat cgccgtggtg aaaggctgcc gtacggtgca gaaagccgac 2461 atggtgcaca acagtctgca gcctaacatc accgtcgacg cccagaccta tgaggtgcgg 2521 gtggatggcg aacttatcac cagcgagccg gcagacgttc tgccgatggc gcaacgatat 2581 tttctgtttt aaggagagcg gatgctttat ttaactcaac gtctggagat ccccgccgcc 2641 gcgaccgcca gcgttacgct gccgattgat gttcgcgtca aaagccgggt taaggtcacc 2701 ctcaacgatg gccgggatgc cggcctgctg ctgccccgcg gcctgctact acgcggcggc 2761 gatgtgctca gcaacgaaga aggcaccgag tttgtgcagg tgattgccgc tgatgaagag 2821 gtgtcggtag tgcgctgcga cgatccgttt atgctggcga aggcctgcta ccacctcggc 2881 aaccgtcacg tgccgctgca gatcatgccg ggcgagctgc gctaccatca cgatcacgtg 2941 ctggacgata tgctgcgcca gttcggcctg acggtgacct ttggccagct gccgttcgag 3001 ccggaagccg gcgcttacgc cagcgagagc cacggtcatc atcatgctca tcatgaccac 3061 cacgctcaca gccactagca tgtcgacagc ggaacaacgc ctgcggctga tgcagctggc 3121 cagcagcaac ctgccggtag ggggttacag ctggtcccag gggctggagt gggctgtgga 3181 agccggctgg gtgctggacg tcgcggcctt cgagcgctgg cagcgacgcc agatgacgga 3241 aggctttttt accgttgacc tgccgctgtt cgcccgcctg taccgcgcct gcgaacaagg 3301 cgatatcgct gcggcccagc gctggaccgc ctatctgctg gcctgccggg aaactcgtga 3361 actgcgggag gaagagcgca accgcggcgc ggcgtttgcc cgtctgctga gcgactggca 3421 gccggactgt ccgccgccgt ggcgctccct gtgccagcaa agccagctcg ccgggatggc 3481 ctggctcggc gtgcgctggc gtatcgccct gcccgagatg gccctcagcc tgggctatag 3541 ctggattgag agcgccgtga tggccggcgt caagctggtc cccttcggcc agcaggccgc 3601 ccagcagctg attttacgtc tttgtgacca ctacgcggcc gagatgcccc gcgcgctggc 3661 cgcgccggac ggcgatatcg gatcggccac cccgctcgcc gccatcgcct ctgcccggca 3721 tgaaacccaa tactctcgat tattccgttc ctaggagaag ccatgaactc ttataaacac 3781 ccgctgcgcg tcggcgtcgg cggcccggtc ggctccggta aaaccgctct gctggaagcg 3841 ctgtgtaaag cgatgcgcga tacctggcag ctggcggtgg tcactaacga catctatacc 3901 aaagaagatc agcgcatcct caccgaagcg ggcgcgctgg cgcctgaacg catcgtcggt 3961 gtggaaaccg gcggctgccc gcatacggcg atccgcgaag atgcctcaat gaacctcgcc 4021 gccgtggaag cgctgagtga aaagttcggt aacctcgacc ttatcttcgt ggaaagcggc 4081 ggcgataacc tgagcgccac cttcagcccg gagctggcgg atctgaccat ctacgtcatc 4141 gatgtggccg aaggggagaa gatcccgcgc aaaggcggac cggggatcac caaatccgat 4201 ttcctggtga tcaataaaac cgaccttgcc ccctatgtgg gcgcgtcgct ggaggtgatg 4261 gcgagcgata cccagcgtat gcgcggcgat cgcccatgga ccttcaccaa tctgaagcag 4321 ggcgacggcc tgagcaccat tatcgccttc ctcgaagaca aaggcatgct tggcaaatag 4381 gcctgttgca ccagccgggc gcaagcgcgt ccggatggtg cagcctctgc tctctcttta 4441 tatcatcctg cctccacctc cgcgccacgc ctgccctgca atatggcata aggtttgcta 4501 attcaagtca tgcctaacca ttaaggaatg actatgtcat cactggatct taaccctgaa 4561 ttacccgcga caacgcggac ttccggtacc cgggaaacct tagaagatta caccttacgt 4621 tacgccccgc tgagcttccg ccgctggggt ccgggcgtcg tcgcggtcac cgcgctgggc 4681 ggcatcgcct atctggccga cttttccatc ggcgccagca tcggtatggc ctggggcacc 4741 agcaacgcca tctattcgat c // LOCUS MUSTCP1X 1714 bp ss-mRNA ROD 07-AUG-1990 DEFINITION Mouse t-complex protein (Tcp-1x) mRNA, 3' end. ACCESSION M35797 KEYWORDS t-complex protein. SOURCE Mouse (strain C57BL/6J) 8.5 day-old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1714) AUTHORS Dudley,K., Shanahan,F., Burtenshaw,M., Evans,E.P., Ruddy,S. and Lyon,M.F. TITLE Isolation and characterisation of a cDNA clone corresponding to the mouse t-complex gene Tcp-1x JOURNAL Genet. Res. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Dudley, 29-JUN-1990. FEATURES from to/span description pept < 1 875 t-complex protein (Tcp-1x) BASE COUNT 454 a 387 c 445 g 428 t ORIGIN Chromosome 17, A2/A3 junction. 1 taaaacgacg gccagtgaat tcatgtctta cgggcaagga ggatatgagc aagcctcctc 61 acttgactca cctgagaaca ggagtcagga tgggtgaggt gccactggct gacagcatac 121 tctgtgatgg tctgacagat gcctttcaca actaccacat gggcatcaca gctgaaaacg 181 tagccaaaaa atggcaagtg agcagagagg cccaggacaa ggttgcagtt ctgtcacaga 241 acagggcaga gactgctcag aaagctggcc actttgacaa ggagattgtg ccagtgctgg 301 tgtcttccag aaaaggtctt actgaagtga aaatcgatga atttccccgt catgggagta 361 accttgaagc catgggaacg ctgaaacctt actttcttac tgatgggaca ggaactgtca 421 ccccagcgaa cgcatcagga atgaacgatg gcgctgctgc tgtggtcctt atgaagaaga 481 cagaagctga gaggcggatg ctgaaacctt tagcaagaat agtctcctgg tcccaagccg 541 gtgtggagcc ttctgttatg ggagtaggac caattccagc cataaagcaa gctgttgcaa 601 aggcaggctg gtccctggag gatgttgacc tgtttgaaat caatgaagcc tttgcagcgg 661 tgtctgcggc aatagctaaa gaacttggat taaaccccgg gaaggtcaac atcgatggag 721 gagccattgc cttgggccat cctctgggag catctggctg caggattcta gtgaccttgc 781 tgcacaccct ggagagagtg ggcgggaccc gtggtgttgc agccctgtgc attgggggtg 841 ggatgggagt cgcaatgtgt gttcagagag ggtgaactgc atgacagcca ccaccttgga 901 cagttcttgt taaatcagtg caacactaag cagaaaggga aaccagagga ccaaactgag 961 gacaggaacc caggtggaca gcttgctgta ctttaatgtg agacacccaa ggctacggca 1021 ttcgacctga cactgttgta aataagaggg aaatccaatc agtcatcaag ggctccagag 1081 tgaacagcat tttcataact tccatgttta tcgtctttca tttctggtga tttctctgat 1141 cattgttttg tttttgtttc ttgtttgagt taggctacat atatgcctca aaatcgtttt 1201 gtagatcagg ttgactttga actcccggag atccaccttt ctgtgcctcc tgaatacttg 1261 aattaaaggc atgcaccatt atggcagggc ccccaaaatg atccattcaa aacatgggga 1321 ttcggcttct gttacgaact tcaggctctt cactttaact atggttggtt caaacgcccc 1381 tgcgtgcctg tctctcggtt tcccattgat caaatcaaga ccaatcctgt agtgaaagtc 1441 ggattcagtg agcctcttcc aggctgaggt ggggtaggta ttccagctgt tctagtcaga 1501 agactggcag tgccaacaag tgtgcctctg gattgttgaa aggtgggcag caagcagcat 1561 ccacgtactt atttctctgc ttgacgggat gggatgttac ctgcttcaga tatctacctt 1621 gacttcccca aaatgatgac tggaactgga actgacagtc aaataaactc tttcctccct 1681 taaaaaaaaa aaccttgacc gtaagacatg aatt // LOCUS YSCAER2 2551 bp ds-DNA PLN 07-AUG-1990 DEFINITION S.cerevisiae heme-regulated and catabolite repressed genes repressor (AER2) gene, complete cds. ACCESSION M35861 KEYWORDS catabolite repressed gene repressor; heme-regulated gene repressor. SOURCE S.cerevisiae. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2551) AUTHORS Zhang,M., Rosenblum-Vos,L.S., Lowry,C.V., Boakye,K. and Zitomer,R.S. TITLE A yeast protein with homology to the B-subunit of G-proteins is involved in repression of heme-regulated and catabolite repressed genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Zitomer, 02-JUL-1990. Author address: R.S.Zitomer Department of Biological Sciences State University of New York Albany, NY 12222 FEATURES from to/span description pept 247 2388 repressor AER2 BASE COUNT 783 a 628 c 488 g 652 t ORIGIN 1 aaaacaaaat aacaagtaca taagtaaaca aatagagaga aattccccca tctaccccca 61 attgataaac atcagcgaag caagaacaac tggctgaaca cgtgcccctc tatctgtcct 121 tctgcaattg tttaattttc tttcacgcca agttaccttt cgcacaccac tactcttttt 181 ctattgtttt tttgtctttt ttgataagca ggggaagaaa gaaatcagct ttccatccaa 241 accaatatga ctgccagcgt ttcgaatacg cagaataagc tgaatgagct tctcgatgcc 301 atcagacagg agtttctcca agtctcacaa gaggcaaata cctaccgtct tcaaaaccaa 361 aaggattacg atttcaaaat gaaccagcag ctggctgaga tgcagcagat aagaaacacc 421 gtctacgaac tggaactaac tcacaggaaa atgaaggacg cgtacgaaga agagatcaag 481 cacttgaaac tagggctgga gcaaagagac catcaaattg catctttgac cgtccagcaa 541 cagcggcaac agcaacagca gcaacaggtc cagcagcatt tacaacagca acagcagcag 601 ctagccgctg catctgcatc tgttccagtt gcgcaacaac caccggctac tacttcggcc 661 accgccactc cagcagcaaa cacaactact ggttcgccat cggccttccc agtacaagct 721 agccgtccta atctggttgg ctcacagttg cctaccacca ctttgcctgt ggtgtcctca 781 aacgcccaac aacaactacc acaacagcaa ctgcaacagc agcaacttca acaacagcaa 841 ccacctcccc aggtttccgt ggcaccattg agtaacacag ccatcaacgg atctcctact 901 tctaaagaga ccactacttt accctctgtc aaggcacctg aatctacgtt gaaagaaact 961 gaaccggaaa ataataatac ctcgaagata aatgacaccg gatccgccac cacggccacc 1021 actaccaccg caactgaaac tgaaatcaaa cctaaggagg aagacgccac cccggctagt 1081 ttgcaccagg atcactactt agtcccttat aatcaaagag caaaccactc taaacctatc 1141 ccacctttcc ttttggatct agattcccag tctgttcccg atgctctgaa gaagcaaaca 1201 aatgattatt atattttata caacccggca ctaccaagag aaattgacgt tgagttacac 1261 aaatctttgg atcatacttc agttgtttgt tgcgtgaagt tcagtaacga tggtgaatac 1321 ttagccacag gctgcaacaa aactactcaa gtgtatcgcg tttcagatgg ttctctggtg 1381 gcccgtctat ctgacgattc tgctgccaat aaccatcgaa attcgatcac tgaaaataac 1441 accaccacgt ccacggataa caatacaatg acaaccacta ctaccaccac aattactacc 1501 acagcgatga cttcggcagc agaattggca aaagatgtgg aaaacctgaa cacttcgtct 1561 tccccatcat ccgacttgta tatccgttca gtgtgttttt ctccagatgg gaaatttttg 1621 gcaacaggtg ctgaagacag actgattaga atttgggata ttgaaaatag aaagattgtt 1681 atgattcttc aaggccacga acaagatatt tattcattgg actactttcc ctcaggtgac 1741 aaattagtct ccggttctgg tgaccgtacc gttcgtattt gggacttacg tacaggccag 1801 tgttcattga ctttatccat tgaagatggt gttaccaccg tcgctgtatc accaggtgat 1861 ggtaaataca tcgctgctgg ttctctagat cgtgctgtga gagtttggga ttccgagacc 1921 ggattcttgg tggaaagact agattcggaa aacgaatccg gtacaggcca caaggactct 1981 gtttatagcg ttgtcttcac tagagatgga caaagcgttg tatccggctc attagataga 2041 tctgttaagc tctggaattt gcagaatgca aacaacaaga gcgattcgaa aactccaaat 2101 tccggcactt gtgaagttac gtatatcggg cataaagact ttgtattgtc cgtggccacc 2161 acacaaaatg atgagtacat cttgtccggt tccaaagatc gtggtgtcct gttttgggat 2221 aagaaatccg gcaatccgtt attgatgttg caaggtcata ggaattcagt tatatctgtg 2281 gctgtggcaa acgggtctcc gctgggtcca gaatataacg tttttgctac tggtagcggt 2341 gattgtaaag caaggatttg gaagtataaa aaaatagcgc caaattaaga acagaacaca 2401 aaaggaacac tttacaaatg taactaacta aactattctt caattcattg tcattatatc 2461 tttctctcta ttctttttta tctacttatt tgttatgtta tgatccaggt acgctttaaa 2521 acaaacagtt actttcctca ccattcatcc c // LOCUS ALFHISH3A 1089 bp ds-DNA PLN 07-AUG-1990 DEFINITION Alfalfa histone H3 (H3-1.1) gene, complete cds. ACCESSION M35867 KEYWORDS histone H3. SOURCE Alfalfa (L. cv. Nagyszenasi) DNA, clone lambda-ALH3-1. ORGANISM Medicago sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 1089) AUTHORS Wu,S.C., Boegre,L., Vincze,E., Kiss,G.B. and Dudits,D. TITLE Isolation of an alfalfa histone H3 gene: Structure and expression JOURNAL Plant Mol. Biol. 11, 641-649 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 285 695 histone H3 (H3-1.1) signal 93 97 CAAT box signal 168 174 TATA box signal 817 826 polyA signal signal 963 968 polyA signal signal 1008 1013 polyA signal site 2 6 GATCC motif site 38 43 ACGTCA consensus site 46 50 GATCC motif site 53 46 (c) CGCGGATC octamer BASE COUNT 305 a 271 c 178 g 335 t ORIGIN 1 ggatcctcat cacacaaaca aaacacatcc acacgccacg tcatcgatcc gcgtgtcgca 61 aatactccaa ataaacgaca cccgtcgatt aaccaataat caacggccac aattacaccc 121 cattcaccca cttctcaaat ttcaaaaccc gcacaaaatc atatcactat ataactcacc 181 cctttcatct cttcttcctc atcactctca attcttcaaa gcacaaaaac aaccttcaag 241 tttctctgtt tgatactgtt cttttcagtt atttttcata atcaatggca cgtaccaagc 301 aaaccgctcg caaatccacc ggtggcaaag ctccaaggaa gcaactcgcc acaaaagccg 361 ctcgcaaatc tgctccggcc accggcggag tgaagaaacc tcaccgtttc aggccaggaa 421 ccgtcgctct ccgtgagatc cgcaagtacc agaagagcac tgagctcctc atccgcaaac 481 tccccttcca gcgtcttgtc cgtgagatcg ctcaggattt caagactgat ctccgtttcc 541 agagctccgt cgtgtcggct ttgcaagaag cggccgaggc ttatctcgtc ggtctctttg 601 aggatactaa cctctgcgcc attcatgcta agcgtgtcac tatcatgcct aaggatatcc 661 agctcgctag gcgtatccgt ggcgagcgtg cttgatcttg ttgattcgct ttgttagggt 721 ttgtgtagat aggttcatga tgtagttaaa tcacaaaccg ttgctataag tttctctatg 781 gattttgtta tattgtaatg tgcttaacgc ttaatcaatg aaatcgatca tcttttgtta 841 aactctttgt tcaattactt atgctttttt tttatctttt cttaacccta attttctgtc 901 attttattac actttccgaa cttttgttat ccctaattgg atttgaaatc aaaattaggg 961 ttgataaagg catattgtat aatgttgaaa ttcttgttaa atatattgat aaatacaatt 1021 atttcaaagt aaaaattaaa tgctgttgat tacatcttga aatgtaaaaa taattgcatg 1081 cataagctt // LOCUS APHYEPPC 400 bp ds-DNA SYN 07-AUG-1990 DEFINITION Foot-and-mouth disease virus/SP6 promoter recombinanant DNA. ACCESSION M35873 KEYWORDS . SOURCE Foot-and-mouth disease virus (strain O1K), cDNA to viral RNA, clone pFMDV-YEP-polyC. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 400) AUTHORS Zibert,A., Maass,G., Strebel,K., Falk,M.M. and Beck,E. TITLE Infectious foot-and-mouth disease virus derived from a cloned full- length cDNA JOURNAL J. Virol. 64, 2467-2473 (1990) STANDARD simple staff_entry FEATURES from to/span description RNA 18 396 RNA signal 1 17 SP6 promoter recomb 29 30 plasmid DNA end/foot-and-mouth disease DNA start BASE COUNT 68 a 122 c 112 g 98 t ORIGIN 1 atttaggtga cactatagaa tacaagctct tgaaaggggg cattagggtc tcacccctag 61 taagccaacg acagtccctg cgttgcactc cacacttacg ttgtacacac gcgggacccg 121 atgggctatc gttcacccac ctacagctgg actcacggcg ccgcgtggcc atttagctgg 181 attgtgcgga cgaacacgct tgcgcacctc gcgtgaccgg ttagtactct taccactctc 241 cgcctacttg gtcgttagcg ctgtcttggg cattcctgtg ggggccgttc gacgctccac 301 gggaactctc ctgtgtgaca tctacggtga tggggccgtt tcgcgtgggc tggtcgtttg 361 gactgcttcg gctgtcaccc ggcgcccgcc tttcagctag // LOCUS CYNAAAGFP 282 bp ss-mRNA SYN 07-AUG-1990 DEFINITION C.tetragonoloba alpha-amylase/alpha-galactosidase fusion protein gene, 5' end. ACCESSION M35837 KEYWORDS alpha-amylase; alpha-galactosidase; fusion protein. SOURCE C.tetragonoloba and plasmid DNA, clone pUR2601. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 282) AUTHORS Overbeeke,N., Termorshuizen,G.H.M., Giuseppin,M.L.F., Underwood,D.R. and Verrips,C.T. TITLE Secretion of the alpha-galactosidase from Cyamopsis tetragonoloba (Guar) by Bacillus subtilis JOURNAL Appl. Environ. Microbiol. 56, 1429-1434 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 88 > 282 alpha-amylase/alpha-galactosidase fusion protein binding 77 80 ribosomal binding site (put.) recomb 180 181 alpha-amylase end/alpha-galactosidase start BASE COUNT 88 a 56 c 70 g 68 t ORIGIN 1 gctgcaggta accggatccg aattcccggg gatccgtcct gatatgtaaa atataatttg 61 tataagaaaa tgagagggag aggaaacatg attcaaaaac gaaagcggac agtttcgttc 121 agacttgtgc ttatgtgcac gctgttattt gtcagtttgc cgattacaaa aacatcagcc 181 gcggaaaacg gtttgggtca aaccccacca atgggttgga actcctggaa ccacttcggt 241 tgtgacatta acgaaaacgt tgtcagagaa acagctgatg ca // LOCUS HSODR3E 111 bp ds-DNA VRL 07-AUG-1990 DEFINITION Herpesvirus papio 3' dyad region DNA. ACCESSION M35835 KEYWORDS dyad region. SOURCE Herpesvirus papio DNA, from baboon B-cell. ORGANISM Herpesvirus papio Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 111) AUTHORS Loeb,D.D., Sung,N.S., Pesano,R.L., Sexton,C.J., Hutchison,C.H.III. and Pagano,J.S. TITLE Plasmid origin of replication of herpesvirus papio: DNA sequence and enhancer function JOURNAL J. Virol. 64, 2876-2883 (1990) STANDARD simple staff_entry BASE COUNT 29 a 30 c 28 g 24 t ORIGIN 1 ggacagcaca tactgcctcg cagatagcat ataccgccgg catgatagca tatgctaccc 61 agatggatag catacgctac ctcccgggta gtatttgcta tccgggtaag g // LOCUS HUMMHDQB3A 552 bp ss-mRNA PRI 07-AUG-1990 DEFINITION Human MHC class II DQ3.1ER (DR4) mRNA, partial cds. ACCESSION M35730 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human homozygous cell line ER, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 552) AUTHORS Holbeck,S.L. and Nepom,G.T. TITLE Molecular analysis of DQ-beta-3.1 genes JOURNAL Hum. Immunol. 21, 183-192 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 552 MHC class II DQ3.1ER protein (AA at 3) /hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1" /map="6p21.3" site 269 270 MHC class II DQ3-beta-1 exon end/MHC class II DQ3-beta-2 exon start BASE COUNT 128 a 165 c 162 g 97 t ORIGIN Chromosome 6p21.3. 1 aggatttcgt gtaccagttt aaggccatgt gctacttcac caacgggacg gagcgcgtgc 61 gttatgtgac cagatacatc tataaccgag aggagtacgc acgcttcgac agcgacgtgg 121 aggtgtaccg ggcggtgacg ccgctggggc cgcctgacgc cgagtactgg aacagccaga 181 aggaagtcct ggagaggacc cgggcggagt tggacacggt gtgcagacac aactaccagt 241 tggagctccg cacgaccttg cagcggcgag tggagcccac agtgaccatc tccccatcca 301 ggacagaggc cctcaaccac cacaacctgc tggtctgctc agtgacagat ttctatccag 361 cccagatcaa agtccggtgg tttcggaatg accaggagga gacaaccggc gttgtgtcca 421 ccccccttat taggaacggt gactggacct tccagatcct ggtgatgctg gaaatgactc 481 cccagcatgg agacgtctac acctgccacg tggagcaccc cagcctccag aaccccatca 541 ccgtggagtg gc // LOCUS HUMMHDQB3B 530 bp ss-mRNA PRI 07-AUG-1990 DEFINITION Human MHC class II DQ3.1LUY (DR8) mRNA, partial cds. ACCESSION M35731 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human homozygous cell line LUY, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 530) AUTHORS Holbeck,S.L. and Nepom,G.T. TITLE Molecular analysis of DQ-beta-3.1 genes JOURNAL Hum. Immunol. 21, 183-192 (1988) STANDARD simple staff_entry FEATURES from to/span description pept.ps < 1 > 530 MHC class II DQ3.1LUY protein (AA at 3) /hgml_locus_uid="LS0098W" /nomgen="HLA-DQB1" /map="6p21.3" site 248 249 MHC class II DQ3-beta-1 exon end/MHC class II DQ3-beta-2 exon start BASE COUNT 124 a 157 c 154 g 95 t ORIGIN Chromosome 6p21.3. 1 aggatttcgt gtaccagttt aaggccatgt gctacttcac caacgggacg gagcgcgtgc 61 gttatgtgac cagatacatc tataaccgag aggagtacgc acgcttcgac agcgacgtgg 121 aggtgtaccg ggcggtgacg ccgctggggc cgcctgacgc cgagtactgg aacagccaga 181 aggaagtcct ggagaggacc cgggcggagt tggacacggt gtgcagacac aactaccagt 241 tggagctctg gagcccacag tgaccatctc cccatccagg acagaggccc tcaaccacca 301 caacctgctg gtctgctcag tgacagattt ctatccagcc cagatcaaag tccggtggtt 361 tcggaatgac caggaggaga caaccggcgt tgtgtccacc ccccttatta ggaacggtga 421 ctggaccttc cagatcctgg tgatgctgga aatgactccc cagcatggag acgtctacac 481 ctgccacgtg gagcacccca gcctccagaa ccccatcacc gtggagtggc // LOCUS LGILG106NL 378 bp ss-mRNA PLN 07-AUG-1990 DEFINITION L.gibba negatively light-regulated protein (Lg106) mRNA, 3' end. ACCESSION M35866 KEYWORDS negatively light-regulated protein. SOURCE L.gibba (L. G-3), cDNA to mRNA, clone pLg106. ORGANISM Lemna gibba Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Arecidae; Arales; Lemnaceae. REFERENCE 1 (bases 1 to 378) AUTHORS Okubara,P.A., Flores,S. and Tobin,E.M. TITLE Characterization of a negatively light-regulated mRNA from Lemna gibba JOURNAL Plant Mol. Biol. 11, 673-681 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 261 ORF1 (AA at 1) pept < 1 82 ORF2 (AA at 2) pept < 1 62 ORF3 (AA at 3) BASE COUNT 96 a 96 c 102 g 84 t ORIGIN 1 gacggcggat tattacccaa aaagcctcct ctcatttcca aggaccacga gcgagccttt 61 gactccgctg actgggcatt aggaaagcaa ggaggccccg tggagaagcc gaagcctctg 121 gaggcgctac ggccgaaact ccagccaact cagcagcaag ttcgcttgag atcggtttat 181 gctcctgccg agggcgaaga ttggatccag cggcgcgaca gaagagatgg gcaacgagtg 241 agtcgcctcg gaggaatgta gtatccatcc tatatctctc tctccgattc ctgaataaaa 301 cgatctattt tgtggcatct cctgaagaac cctaattgat tagtagttga ggtttgacct 361 tgggtcaacg gatcagat // LOCUS MUSIGHADM 360 bp ss-mRNA ROD 07-AUG-1990 DEFINITION Mouse Ig heavy chain mRNA V-region, partial cds. ACCESSION M26808 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain C57BL/6) hybridoma line B1-8.V4, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Brueggemann,M., Mueller,H.-J., Burger,C. and Rajewsky,K. TITLE Idiotypic selection of an antibody mutant with changed hapten binding specificity, resulting from a point mutation in position 50 of the heavy chain JOURNAL EMBO J. 5, 1561-1566 (1986) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 360 Ig heavy chain V-region (AA at 1) BASE COUNT 89 a 92 c 99 g 80 t ORIGIN Chromosome 12. 1 caggtccaac tgcagcagcc tggggctgag cttgtgaagc ctggggcttc agtgaagctg 61 tcctgcaagg cttctggcta caccttcacc agctactgga tgcactgggt gaagcagagg 121 cctggacgag gccttgagtg gattggaggg attgatccta atagtggtgg tactaagtac 181 aatgagaagt tcaagagcaa ggccacactg actgtagaca aaccctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgcggtct attattgtgc aagatacgat 301 tactacggta gtagctactt tgactactgg ggccaaggca ccactctcac agtctcctca // LOCUS PLYJCRR 270 bp ds-DNA VRL 07-AUG-1990 DEFINITION Polyomavirus JC regulatory region DNA. ACCESSION M35834 KEYWORDS regulatory region. SOURCE Polyomavirus JC DNA, from human progressive multifocal leukoencephalopathy patient CY urine. ORGANISM Polyomavirus JC Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 270) AUTHORS Yogo,Y., Kitamura,T., Sugimoto,C., Ueki,T., Aso,Y., Hara,K. and Taguchi,F. TITLE Isolation of a possible archetypal JC virus DNA sequence from nonimmunocompromised individuals JOURNAL J. Virol. 64, 3139-3143 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 268 > 270 late leader protein orgrpl 12 < 1 (c) origin of DNA replication signal 15 29 TATA box BASE COUNT 78 a 66 c 77 g 49 t ORIGIN 1 gcctcggcct cctgtatata taaaaaaaag ggaaggtagg gaggagctgg ctaaaactgg 61 atggctgcca gccaagcatg agctcatacc tagggagcca accagctgac agccagaggg 121 agccctggct gcatgccact ggcagttata gtgaaacccc tcccatagtc cttaatcaca 181 agtaaacaaa gcacaagggg aagtggaaag cagccagggg aacatgtttt gcgagccaga 241 gctgttttgg cttgtcacca gctggccatg // LOCUS PSEIAAMH 3677 bp ds-DNA BCT 07-AUG-1990 DEFINITION P.syringae tryptophan 2-monooxygenase (iaaM) and indoleacetamide hydrolase (iaaH) genes, complete cds. ACCESSION M11035 M35690 KEYWORDS iaaH gene; iaaM gene; indoleacetamide hydrolase; indoleacetic acid; indoleacetic acid operon promoter region; tryptophan 2-monooxygenase; tryptophan-2-oxyreductase. SOURCE P.syringae savastanoi (strain EW2009) DNA, clones pLUC2, pCP3, and pCJP12. ORGANISM Pseudomonas syringae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 202 to 3677) AUTHORS Yamada,T., Palm,C.J., Brooks,B. and Kosuge,T. TITLE Nucleotide sequences of the Pseudomonas savastanoi indoleacetic acid genes show homology with agrobacterium tumefaciens T-DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6522-6526 (1985) STANDARD simple staff_review REFERENCE 2 (bases 1 to 312) AUTHORS Gaffney,T.D., da Costa e Silva,O., Yamada,T. and Kosuge,T. TITLE The indoleacetic acid operon of Pseudomonas syringae subsp. savastanoi: Transcription analysis and promoter identification JOURNAL J. Bacteriol. (1990) In press STANDARD simple staff_entry COMMENT The iaaH and iaaM genes are part of an operon that is borne on a plasmid, pIAA, in oleander strains of the pathogen. There appears to be no genetic transformation of host tissue by P.savastanoi. No Pribnow boxes were found, though potential ribosome binding sites are located at positions 435-439 and 2164-2168. Expression of iaaH depends upon the promoter for iaaM. Draft entry and computer-readable sequence for [1] kindly submitted by T.Yamada, 14-JAN-1986. Draft entry and computer-readable sequence for [2] kindly submitted by T.D.Gaffney, 28-JUN-1990. FEATURES from to/span description pept 454 2127 tryptophan 2-monooxygenase (EC 1.13.12.3) pept 2170 3537 indoleacetamide hydrolase pept 82 312 promoter proximal ORF mRNA 54 > 3537 indoleacetic acid mRNA site 19 47 indoleacetic acid operon promoter binding 577 651 FAD binding site BASE COUNT 813 a 917 c 1046 g 901 t ORIGIN 1 bp upstream of EcoRI site. 1 ccacaaaccg gcaagacgtt gaaaatcaat ttcaagcatt gtatgttttt ggtgggacgc 61 cacttggaaa atctggtgct gatgcccgga cggactcctg cgaactggct ggacaaggcg 121 atggtggcag tggcccgaat cagacagcgc aagccccagg cggcagtacc cgcggcgctc 181 ctccaagccc cgcacgctat ggaattcgtt tggagcacca ccaaaacctt taccgaatgc 241 cattgtccgg cttacacccc ctatacgcaa gccagtgcgt ggtcggccag cgcggtcgtt 301 agtgcttatt aattttagag gactagcatg gtttgaagga atggttcagg tgttttggat 361 tttggcggct gccgttaaaa ggtgctgttt cagtttcatt taaagaatcg taatccgggt 421 agcacgtaag gtcgaagagg gtagagtcgc gttatgtatg accattttaa ttcacccagt 481 attgatattt tgtacgacta cggtcccttt ctgaaaaaat gtgaaatgac gggaggcata 541 ggcagctatt cagccggaac gcccacccct cgggtagcga tagtcggtgc cggcatcagt 601 gggctggtcg ctgcaactga actattacgt gcgggagtca aggacgttgt cttatatgaa 661 tcgcgtgatc gaatcggggg acgggtatgg tctcaagttt tcgatcagac tcgtccacgt 721 tacattgcag aaatgggtgc gatgcgcttt cctcccagtg caactggcct tttccactac 781 ctgaaaaagt ttggtatttc gacgtcgacc acctttccgg atcctggtgt ggtggacacg 841 gagctgcatt accgtggcaa gcgctatcac tggccagcgg gcaaaaagcc gcccgaatta 901 ttcaggcgag tctatgaggg gtggcagtct ctattgtccg aaggttacct ccttgaaggc 961 ggttctttag ttgccccgct ggacattacc gcaatgctga agtcgggtcg tctggaagag 1021 gcagcgatcg catggcaggg atggctcaat gtattccggg attgttcatt ctataacgcg 1081 attgtctgta tttttactgg ccgccatccg ccaggcggcg acagatgggc tcgtcctgaa 1141 gactttgagc tgtttggctc gcttggcata ggctcgggcg ggtttttgcc agtctttcag 1201 gctggcttta cggaaatact gcggatggtt atcaacggat accaaagtga ccagcgactg 1261 attccggacg ggatatccag tctggccgcg agactcgctg atcagtcgtt tgacggcaaa 1321 gcgttaaggg accgcgtttg ttttagccgg gtaggtcgca tttccagaga agctgaaaaa 1381 atcatcatcc agacggaagc aggagaacag cgtgtatttg atcgagtaat tgtcactagc 1441 agtaatcggg ccatgcaaat gattcactgc ctcacggata gcgagagctt tctgagtcgt 1501 gatgtcgctc gtgctgtccg cgaaacccat ctgacaggtt catcgaagct tttcattctc 1561 acccgaacca aattctggat aaaaaacaag cttcccacca ccatccagtc ggacggtctg 1621 gtgcgcggcg tctattgtct ggattatcag cccgatgaac ctgaggggca tggcgttgtt 1681 ctgctcagtt acacgtggga agacgacgct caaaaaatgc tggcgatgcc tgacaagaaa 1741 acgcgttgcc aggtactggt tgatgacctt gctgcgatac acccgacgtt cgccagttat 1801 ctcctgcccg ttgatgggga ttatgagcgg tatgtattgc accatgactg gctcaccgat 1861 ccccattctg cgggcgcttt caaactcaat tatcccggcg aggacgttta ctcgcagcga 1921 ttgttttttc aaccaatgac agcgaacagt cccaataaag acacggggct ctatctggct 1981 ggctgcagtt gctcttttgc cggagggtgg atcgaaggtg ctgtccagac agcattgaac 2041 agtgcttgcg cggtgctgcg cagcaccgga gggcaactgt caaaaggcaa cccgctggac 2101 tgtatcaacg cctcctatcg ctattaacag tcctgagagc agcgctaagc taatacgggt 2161 gaaaagagca tgcatgaaat aatcacactt gaatcgctgt gccaggcatt ggctgatggc 2221 gagattgccg ctgcggagct gcgcgagcgt gcgctcgata ctgaggcccg cttggcccgt 2281 ttgaactgtt tcatacgtga gggtgatgcg gtcagccagt ttggcgaggc tgatcatgca 2341 atgaagggta ctccactctg gggaatgccg gtttccttca aggataatat ctgtgttcgc 2401 ggtttgccgt tgacagccgg gacgcgagga atgtccggtt tcgtatctga tcaggatgct 2461 gcgattgtca gtcaactcag agccctcggg gctgtggtcg ctggcaagaa caacatgcac 2521 gaactcagtt tcggagtgac ctccatcaat cctcattggg gaaccgtggg aaaccccgtg 2581 gcccccggtt attgtgccgg aggtagtagt ggtggaagtg ccgccgcagt ggcaagtgga 2641 attgttccgc tgtcggtggg gaccgacacg gggggctcga taagaatacc ggcggccttc 2701 tgcggcatta cgggcttcag acccactact ggacgctggt caacggcagg cattatccct 2761 gtttctcata caaaggattg cgttgggttg ctgacacgaa cggcaggcga cgccggattt 2821 ttgtacggac tgttgtcagg caagcagcag tcttttcccc tgagcaggac ggccccctgt 2881 cgaatcggcc taccggtctc catgtggtcc gatctggatg gcgaggtgga gagggcatgc 2941 gtgaatgcac tcagcctgct gcgcaagaca gggttcgagt ttattgaaat tgatgatgcc 3001 gatattgtcg aactgaacca gacactcacg ttcaccgttc cgctttacga attctttgcc 3061 gaccttgctc agtcattgct ctccttgggc tggaagcacg gtatccatca tatttttgca 3121 caggttgatg atgccaacgt gaaaggcatc atcaatcacc accttgggga gggcgctata 3181 aaaccggctc actatttgag ttcactgcaa aatggcgagc tgttgaaaag aaaaatggat 3241 gagctatttg ctcgtcataa tatcgagctc ctggggtacc ccacggtacc ttgccgggta 3301 cctcatctgg atcatgctga ccgaccagag tttttcagtc aggcaattcg caataccgac 3361 ctggccagca atgcgatgct cccctccatt actattcctg tcgggccaga agggcgattg 3421 ccggtcggct tgagctttga tgctctaaga gggcgagacg cacttttgct gagcagagtc 3481 agtgccatag agcaggtatt aggttttgta cgaaaagtat tgccgcatac aacgtaggct 3541 acaggccagc gtgaccatgg ctgcgtagct cttggccagc ttgtcgtagc gggtgccgat 3601 tcggcggttc tcttttagcc agccaaacat ccgctcaatg atgttgcgct gccgatactt 3661 tggacgatcg aagagtc // LOCUS SVCG3E 68 bp ss-RNA VRL 07-AUG-1990 DEFINITION Spring viremia of carp virus genome, 3' end. ACCESSION M35836 KEYWORDS . SOURCE Spring viremia of carp virus RNA. ORGANISM Spring viremia of carp virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae. REFERENCE 1 (bases 1 to 68) AUTHORS Roy,P., Gupta,K.C. and Kiuchi,A. TITLE Characterization of Spring viremia of carp virus mRNA species and the 3' sequence of the viral RNA JOURNAL Virus Res. 1, 189-202 (1984) STANDARD simple staff_entry BASE COUNT 20 a 13 c 10 g 25 t ORIGIN 1 tgcttctgtt tatttggtaa ctattgtcaa tgtaccaaat aaacagacca ttgtctgtag 61 tactcaca // LOCUS TTHSUCDA 880 bp ds-DNA BCT 07-AUG-1990 DEFINITION T.aquaticus B succinyl-CoA synthetase alpha-subunit (sucD) gene, complete cds. ACCESSION M35832 X12815 KEYWORDS sucD gene; succinyl-CoA synthetase alpha-subunit. SOURCE T.aquaticus (strain B (NCIB 11247)) DNA. ORGANISM Thermus aquaticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 880) AUTHORS Nicholls,D.J., Sundaram,T.K., Atkinson,T. and Minton,N.P. TITLE Nucleotide sequence of the succinyl-CoA synthetase alpha-subunit from Thermus aquaticus B JOURNAL Nucleic Acids Res. 16, 9858-9858 (1988) STANDARD full staff_entry COMMENT Gene product mature structure is (alpha beta)2. Submitted (09-SEP-1988) on tape to the EMBL data library. FEATURES from to/span description pept 14 880 succinyl-CoA synthetase alpha-subunit (sucD; EC 6.2.1.5) binding 2 7 sucD ribosomal binding site (put.) BASE COUNT 153 a 304 c 310 g 113 t ORIGIN 1 tgggaggtgc ggcgtgatcc tggtgaaccg cgagacccgc gtcctggtcc agggcatcac 61 cggccgggag gggcagttcc acaccaagca gatgctggac tacggcacca agatcgtcgc 121 cggggtcacc ccgggcaaag ggggaacgga ggtcctaggg gtccccgtct acgacacggt 181 gaaggaggcg gtggcccacc acgaggtgga cgcctccatc atcttcgtgc ccgccccggc 241 cgcggcggac gccgccctgg aagcggccca cgccgggatc cccctcatcg tcctcatcac 301 cgagggcatc cccaccctgg acatggtgcg ggcggtggag gagatcaagg ccctgggaag 361 ccgcctcatc ggggggaact gccccgggat catcagcgcc cacctaccca agatcgggat 421 catgcccggc cacgtcttca agcggggccg ggtggggatc atcagccgct ccggcaccct 481 tcggcacgag gccgcagccg ccctttccca ggcggggctc ggcaccacca ccacggtggg 541 gatcgggggc gaccccgtca ggcggctcac cttcaaggac ctcctccccc tcttcaacga 601 ggacccggag acggaggccg tggtcctcat cggggagatc ggcggctccg acgaggagga 661 ggcggcggct tgggtgaagg accacatgaa gaagccggtg gtgggcttca tcggaggccg 721 ctccgccccc aagggcaagc gcatgggcca cgccggggcc atcatcatgg gcaacgtggg 781 caccccggag tccaagctcc gggccttcgc cgaggcgggc atccccgtgg ccgacaccat 841 cgacgagatc gtggagctgg tcaagaaggc cctgggctaa // LOCUS MUSMHPCRE 256 bp ds-DNA ROD 07-AUG-1990 DEFINITION Mouse MHC class I Q even processed pseudogene, exons 7 and 8. ACCESSION M30902 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex; pseudogene. SOURCE Mouse adult (strain SK/CamE1) liver. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 256) AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K., Dilwith,R. and Flaherty,L. TITLE Genetic polymorphisms of Q region genes from wild derived mice: Implications for Q region evolution JOURNAL Immunogenetics 31, 315-325 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Flaherty 04-JAN-1990. FEATURES from to/span description pept.ps / 45 82 pseudo-MHC Q even, exon 7 (AA at 46) 191 219 pseudo-MHC Q even, exon 8 IVS < 1 44 pseudo-MHC Q even intron A (no splice consensus) IVS 83 190 pseudo-MHC Q even intron B (no splice consensus) BASE COUNT 52 a 52 c 70 g 82 t ORIGIN Chromosome 17. 1 agatatatct ttgtatatat ttttccctag gcagggacag ctcccagagc tctgatatgt 61 ttctctcaag attgtaaagg tgacattcta tggcctgatt gcagaggggc actgtggaca 121 tggttgtgtt tcagggactc ccacaatccc ctgtgagtgg tgggttgttg ggatattgtc 181 ttcattgtgg tggttcctga ccctcgttct ctattatgaa gacagctgcc tggagtggac 241 tcagtgacag ccagtg // LOCUS MUSMHPCRM 264 bp ds-DNA ROD 07-AUG-1990 DEFINITION Mouse MHC class I Q odd processed pseudogene, exons 7 and 8. ACCESSION M30901 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex; pseudogene. SOURCE Mouse adult (strain Metkovic 2) liver. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 264) AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K., Dilwith,R. and Flaherty,L. TITLE Genetic polymorphisms of Q region genes from wild derived mice: Implications for Q region evolution JOURNAL Immunogenetics 31, 315-325 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Flaherty 04-JAN-1990. FEATURES from to/span description pept.ps / 45 81 pseudo-MHC Q odd, exon 7 (AA at 45) 194 225 pseudo-MHC Q odd, exon 8 IVS < 1 44 pseudo-MHC Q odd intron A (no splice consensus) IVS 82 193 pseudo-MHC Q odd intron B BASE COUNT 59 a 53 c 76 g 76 t ORIGIN Chromosome 17. 1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata 61 gatctctccc agatggtaaa ggtgacactc tggggcctga ttggagaggg gcaatgtgga 121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat 181 gttgtcttca cagtgatggg tcatgtctct cattctctag catgaagaca gctgcctgga 241 ctgcactgag tgacagacga tgtg // LOCUS MUSMHQ4D 261 bp ds-DNA ROD 07-AUG-1990 DEFINITION Mouse MHC class I Q4d Qb-1 processed pseudogene (d haplotype), exons 7 and 8. ACCESSION M30896 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex; pseudogene. SOURCE Mouse adult (strain Balb/c, haplotype d) sperm, clone 65.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 261) AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K., Dilwith,R. and Flaherty,L. TITLE Genetic polymorphisms of Q region genes from wild derived mice: Implications for Q region evolution JOURNAL Immunogenetics 31, 315-325 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Flaherty 04-JAN-1990. FEATURES from to/span description pept.ps / 45 82 peudo-MHC Q4d Qb-1, exon 7 (AA at 46) 191 222 peudo-MHC Q4d Qb-1, exon 8 IVS < 1 44 peudo-MHC Q4d Qb-1 intron A (no splice consensus) IVS 83 190 peudo-MHC Q4d Qb-1 intron B (no splice consensus) BASE COUNT 54 a 52 c 70 g 85 t ORIGIN Chromosome 17. 1 tatagatata tctttgtata tatttttccc taggcaggga cagctcccag agctctgata 61 tgtttctctc aagattgtaa aggtgacatt ctatggcctg attgcagagg ggcactgtgg 121 acatggttgt gtttcaggga ctcccacaat cccctgtgag tggtgggttg ttgggatatt 181 gtcttcattg tggtggttcc tgaccctcat tctctatcat gaagacagct gcctggagtg 241 gacttagtga cagccagtgt g // LOCUS MUSMHQ5B 264 bp ds-DNA ROD 07-AUG-1990 DEFINITION Mouse MHC class I Q5B processed pseudogene (b haplotype), exons 7 and 8. ACCESSION M30897 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex; pseudogene. SOURCE Mouse adult (strain B10, haplotype b) liver, clone B2.5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 264) AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K., Dilwith,R. and Flaherty,L. TITLE Genetic polymorphisms of Q region genes from wild derived mice: Implications for Q region evolution JOURNAL Immunogenetics 31, 315-325 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Flaherty 04-JAN-1990. FEATURES from to/span description pept.ps / 45 81 pseudo-MHC Q5b, exon 7 (AA at 45) 194 225 pseudo-MHC Q5b, exon 8 IVS < 1 44 pseudo-MHC Q5b intron A (no splice consensus) IVS 82 193 pseudo-MHC Q5b intron B BASE COUNT 58 a 55 c 77 g 74 t ORIGIN Chromosome 17. 1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata 61 gatctctccc agatggtaaa ggtgacactc tgggacctga ttggggaggg gcaatgcgga 121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat 181 gttgtcttca cagtgatggg tcgtgtccct cattctctag catgaagaca gctgcctgga 241 ctgcactgag tgacagacga tgtg // LOCUS MUSMHQ5D 264 bp ds-DNA ROD 07-AUG-1990 DEFINITION Mouse MHC class I Q5d processed pseudogene (d haplotype), exons 7 and 8. ACCESSION M30898 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex; pseudogene. SOURCE Mouse adult (strain Balb/c, haplotype d) sperm, clone 65.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 264) AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K., Dilwith,R. and Flaherty,L. TITLE Genetic polymorphisms of Q region genes from wild derived mice: Implications for Q region evolution JOURNAL Immunogenetics 31, 315-325 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Flaherty 04-JAN-1990. FEATURES from to/span description pept.ps / 45 81 pseudo-MHC Q5d, exon 7 (AA at 45) 194 225 pseudo-MHC Q5d, exon 8 IVS < 1 44 pseudo-MHC Q5d intron A (no splice consensus) IVS 82 193 pseudo-MHC Q5d intron B BASE COUNT 58 a 53 c 77 g 76 t ORIGIN Chromosome 17. 1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata 61 gatctctccc agatggtaaa ggtgacactc tggggcctga ttggggaggg gcaatgtgga 121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat 181 gttgtcttca cagtgatggg tcatgtctct cattctctag catgaagaca gctgcctgga 241 ctgcactgag tgacagacga tgtg // LOCUS MUSMHQ6B 261 bp ds-DNA ROD 07-AUG-1990 DEFINITION Mouse MHC class I Q6b Qa-2 processed pseudogene (b haplotype), exons 7 and 8. ACCESSION M30899 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex; pseudogene. SOURCE Mouse adult (strain B10, haplotype b) liver, clone B2.17. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 261) AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K., Dilwith,R. and Flaherty,L. TITLE Genetic polymorphisms of Q region genes from wild derived mice: Implications for Q region evolution JOURNAL Immunogenetics 31, 315-325 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Flaherty 04-JAN-1990. FEATURES from to/span description pept.ps / 45 82 pseud-MHC Q6b Qa-2, exon 7 (AA at 46) 191 222 pseud-MHC Q6b Qa-2, exon 8 IVS < 1 44 pseud-MHC Q6b Qa-2 intron A (no splice consensus) IVS 83 190 pseud-MHC Q6b Qa-2 intron B (no splice consensus) BASE COUNT 53 a 52 c 71 g 85 t ORIGIN Chromosome 17. 1 tatagatata tctttgtata tatttttccc taggcaggga cagctcccag agctctgata 61 tgtttctctc aagattgtaa aggtgacatt ctatggcctg attgcagagg ggcactgtgg 121 acatggttgt gtttcaggga ctcccacaat cccctgtgag tggtgggttg ttgggatatt 181 gtcttcattg tggtggttcc tgaccctcgt tctctatcat gaagacagct gcctggagtg 241 gacttagtga cagccagtgt g // LOCUS MUSMHQ89DA 264 bp ds-DNA ROD 07-AUG-1990 DEFINITION Mouse MHC class I Q8/Q9d Qa-2 processed pseudogene (d haplotype), region surrounding exons 7 and 8. ACCESSION M30900 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex; pseudogene. SOURCE Mouse adult (strain Balb/c, haplotype d) sperm, clone 46.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 264) AUTHORS Tine,J.A., Walsh,A., Rathbun,D., Leonard,L., Wakeland,E.K., Dilwith,R. and Flaherty,L. TITLE Genetic polymorphisms of Q region genes from wild derived mice: Implications for Q region evolution JOURNAL Immunogenetics 31, 315-325 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Flaherty 04-JAN-1990. FEATURES from to/span description pept.ps / 45 81 peudo-MHC Q8/Q9d Qa-2, exon 7 (AA at 45) 194 225 peudo-MHC Q8/Q9d Qa-2, exon 8 IVS < 1 44 peudo-MHC Q8/Q9d Qa-2 intron A (no splice consensus) IVS 82 193 pseudo-MHC Q8/Q9d Qa-2 intron B (no splice consensus) BASE COUNT 57 a 54 c 78 g 75 t ORIGIN Chromosome 17. 1 tatgtacata tgcatataca ttttttaccc ttggcaggga cagctcctag agctctgata 61 gatctctccc aggtggtaaa ggtgacactc tgggacctga ttggggaggg gcaatgtgga 121 tatgattggg tttcagggac tccacgaatc ccctctgagt gagtggtggg ttgttggaat 181 gttgtcttca cagtgatggg tcgtgtccct cattctctag catgaagaca gctgcctgga 241 ctgcactgag tgacagacga tgtg // LOCUS PMUNE1 220 bp ds-DNA PHG 07-AUG-1990 DEFINITION Bacteriophage mu genomic left end. ACCESSION M34919 KEYWORDS . SEGMENT 1 of 2 SOURCE Bacteriophage mu DNA. ORGANISM Bacteriophage mu Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 220) AUTHORS Groenen,M.A.M., Timmers,E. and Van De Putte,P. TITLE DNA sequences at the ends of the genome of Bacteriophage mu essential for transposition JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2087-2091 (1985) STANDARD simple staff_review BASE COUNT 82 a 32 c 44 g 62 t ORIGIN 1 tgtattgatt cacttgaagt acgaaaaaaa ccgggaggac attggattat tcgggatctg 61 atgggattag atttggtggg gcttgcaagc ctgtagtgca aattttagtc cttaatcaat 121 gaaacgcgaa agatagtaaa aaattgcttt tgtttcattg aaaatacgaa aaacaaaaac 181 actgcaaatc atttcaataa cagcttcaaa aaacgttcaa // LOCUS PMUNE2 220 bp ds-DNA PHG 07-AUG-1990 DEFINITION Bacteriophage mu genomic right end. ACCESSION M34920 KEYWORDS . SEGMENT 2 of 2 SOURCE Bacteriophage mu DNA. ORGANISM Bacteriophage mu Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 220) AUTHORS Groenen,M.A.M., Timmers,E. and Van De Putte,P. TITLE DNA sequences at the ends of the genome of Bacteriophage mu essential for transposition JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2087-2091 (1985) STANDARD simple staff_review BASE COUNT 71 a 47 c 41 g 61 t ORIGIN 1 gtggtacaca aatttaatca gtatcgctac atcagattcc tgaacaaacg agcaaggaag 61 cggctaaata ccaaactatt caaggttcag gcatacccta agtgatcccc atgtaatgaa 121 taaaaagcag taattaatac atctgtttca tttgaagcgc gaaagctaaa gttttcgcat 181 ttatcgtgaa acgctttcgc gtttttcgtg cgccgcttca // LOCUS STYRGGA 103 bp ds-DNA BCT 07-AUG-1990 DEFINITION S.typhimurium 23S rRNA gene. ACCESSION M35803 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE S.typhimurium (strain ATCC 13311) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 103) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 > 103 23S ribosomal RNA BASE COUNT 23 a 22 c 38 g 20 t ORIGIN 1 ggaagatgta acggggctaa accatgcacc gaagctgcgg cagcgacgct gatgcgttgt 61 tgggtagggg agcgttctgt aagcctgcga aggtcacctg tga // LOCUS STYRGGB 104 bp ds-DNA BCT 07-AUG-1990 DEFINITION S.typhimurium 23S rRNA gene. ACCESSION M35804 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE S.typhimurium (strain LB5010) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 104) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 > 104 23S ribosomal RNA BASE COUNT 25 a 22 c 34 g 23 t ORIGIN 1 cctgcgcgga agatgtaacg gggctcaaac catacaccga agctacgggt tcatcctttg 61 gatgagcggt agaggagcgt tctgtaagcc tgtgaaggtg agtt // LOCUS STYRGGC 185 bp ds-DNA BCT 07-AUG-1990 DEFINITION S.typhimurium 23S rRNA gene. ACCESSION M35819 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE S.typhimurium (strain ATCC 13311) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 185) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 38 23S ribosomal RNA 145 > 185 23S ribosomal RNA site 39 144 23S rRNA intervening sequence BASE COUNT 49 a 36 c 60 g 40 t ORIGIN 1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac 61 aactttacgg agttgacgat tgacggagcg aagcgacgtc aaagcgttca ttaaagtcga 121 gttggcttag ggatacgttc gttgggtagg ggagcgttct gtaagcctgt gaaggtgtgt 181 cgtga // LOCUS YEPRGGA 85 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.pestis 23S rRNA gene. ACCESSION M35798 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.pestis (strain EV76) DNA. ORGANISM Yersinia pestis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 85) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 > 85 23S ribosomal RNA BASE COUNT 15 a 16 c 34 g 20 t ORIGIN 1 atgcaccgaa gctgcggcag cgacgcttag gcgttgttgg gtaggggagc gttctgtaag 61 ccgttgaagg tgacctgtga gggtt // LOCUS YEPRGGB 120 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.pseudotuberculosis 23S rRNA gene. ACCESSION M35799 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.pseudotuberculosis (strain YPIII/pIB1; serotype III) DNA. ORGANISM Yersinia pseudotuberculosis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 120) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 > 120 23S ribosomal RNA BASE COUNT 23 a 24 c 47 g 26 t ORIGIN 1 gtcggcctgc gcggaagatg taacggggct aaaccatgca ccgaagctgc ggcagcgacg 61 cttaggtgtt gttgggtagg ggagcgttct gtaagccgtt gaaggtgacc tgtgagggtt // LOCUS YEPRGGC 83 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35800 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 761/80; serotype 0:6,310) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 83) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 > 83 23S ribosomal RNA BASE COUNT 14 a 16 c 34 g 19 t ORIGIN 1 gcaccgaagc tgcggcagcg acgcttaggc gttgttgggt aggggagcgt tctgtaagcc 61 gttgaaggtg acctgtgagg gtt // LOCUS YEPRGGD 114 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35801 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 8081; serotype 0:8) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 114) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 > 114 23S ribosomal RNA BASE COUNT 23 a 24 c 43 g 24 t ORIGIN 1 gtcggcctgc gcggaagatg taacggggct aaaccatgca ccgaagctgc ggcagcgaca 61 cttaggtgtt gttgggtagg ggagcgttct gtaagccgtt gaaggtgacc tgtg // LOCUS YEPRGGE 104 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.bercovieri 23S rRNA gene. ACCESSION M35802 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.bercovieri (strain 3016/84; serotype 0:58,16) DNA. ORGANISM Yersinia bercovieri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 104) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 > 104 23S ribosomal RNA BASE COUNT 21 a 21 c 38 g 24 t ORIGIN 1 cgcggaagat gtaacggggc taaaccatgc accgaagctg cggcagcttc gcttaggtgt 61 tgttgggtag gggagcgttc tgtaagccgt tgaaggtgac ctgt // LOCUS YEPRGGF 197 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35805 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 132; serotype 0:1) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 197) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 37 23S ribosomal RNA 159 > 197 23S ribosomal RNA site 38 158 23S rRNA intervening sequence BASE COUNT 52 a 43 c 60 g 42 t ORIGIN 1 agatgtaacg gggctaaacc atgcaccgaa gctgcggcag cgaacgtatc acttaaaacg 61 cttaacgtga cgcaaagccg gcatgttcaa gccacactaa acgttgagtt ggccggtgtg 121 ctgacaaacg aacaggcgtt ttaaggatac gttcgttggg taggggagcg ttctgtaagc 181 cgttgaaggt ggcctgt // LOCUS YEPRGGG 179 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35806 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 1142; serotype 0:2) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 179) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 19 23S ribosomal RNA 141 > 179 23S ribosomal RNA site 20 140 23S rRNA intervening sequence BASE COUNT 47 a 40 c 52 g 40 t ORIGIN 1 ccatgcaccg aagctgcggc agcgaacgta tcacttaaaa cgcttaacgt gacgcaaagc 61 cggcatgttc aagccacact aaacgttgag ttggccgatg tgctgacaaa cgaacaggcg 121 ttttaaggat acgttcgttg ggtaggggag cgttctgtaa gccgttgaag gtagtctgt // LOCUS YEPRGGH 206 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35807 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 6471/76; serotype 0:3) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 206) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 38 23S ribosomal RNA 160 > 206 23S ribosomal RNA site 39 159 23S rRNA intervening sequence BASE COUNT 54 a 43 c 65 g 44 t ORIGIN 1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacttaaaac 61 gcttaacgtg acgcaaagcc ggcatgttca agccacacta aacgttgagt tggccggtgt 121 gctgacaaac gaacaggcgt tttaaggata cgttcgttgg gtaggggagc gttctgtaag 181 ccgttgaagg tggtctgtga gggctg // LOCUS YEPRGGI 175 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35808 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 7500; serotype 0:5,27) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 175) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 10 23S ribosomal RNA 132 > 175 23S ribosomal RNA site 11 131 23S rRNA intervening sequence BASE COUNT 46 a 36 c 55 g 38 t ORIGIN 1 gaagctgcgg cagcgaacgt atcacttaaa acgcttaacg tgacgcaaag ccggcatgtt 61 caagccacac taaacgttga gttggccgat gtgctgacaa acgaacaggc gttttaagga 121 tacgttcgtt gggtagggga gcgttctgta agccgttgaa ggtgacctgt gaggg // LOCUS YEPRGGJ 190 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35809 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain Ruokola/71; serotype 0:9) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 190) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 30 23S ribosomal RNA 152 > 190 23S ribosomal RNA site 31 151 23S rRNA intervening sequence BASE COUNT 49 a 43 c 58 g 40 t ORIGIN 1 acggggctaa accatgcacc gaagctgcgg cagcgaacgt atcacttaaa acgcttaacg 61 tgacgcaaag ccggcatgtt caagccacac taaacgttga gttggccggt gtgctgacaa 121 acgaacaggc gttttaagga tacgttcgtt gggtagggga gcgttctgta agccgttgaa 181 ggtggcctgt // LOCUS YEPRGGK 192 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35810 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain C36; serotype 0:15) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 192) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 31 23S ribosomal RNA 153 > 192 23S ribosomal RNA site 32 152 23S rRNA intervening sequence BASE COUNT 50 a 42 c 59 g 40 t 1 others ORIGIN 1 aacggggcta aaccatgcac cgaagctgcg gcagcgaacg tatcacttaa aacgcttaac 61 gtgacgcaaa gccggcangt tcaagccaca ctaaacgttg agttggccgg tgtgctgaca 121 aacgaacagg cgttttaagg atacgttcgt tgggtagggg agcgttctgt aagccgttga 181 aggtggtctg tg // LOCUS YEPRGGL 177 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35811 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain E701; serotype 0:4,32) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 177) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 29 23S ribosomal RNA 139 > 177 23S ribosomal RNA site 30 138 23S rRNA intervening sequence BASE COUNT 43 a 40 c 57 g 37 t ORIGIN 1 cggggctaaa ccatgcaccg aagctgcggc agcgaacgta tcacccaagc caattcacgg 61 cagtggataa tgattgacgg agcgcagcga cgtcaatgcg tccaataaag tctagttggc 121 ttagggatac gttcgttggg taggggagcg ttctgtaagc cgttgaaggt ggcctgt // LOCUS YEPRGGM 191 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35812 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 8081; serotype 0:8) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 191) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 38 23S ribosomal RNA 148 > 191 23S ribosomal RNA site 39 147 23S rRNA intervening sequence BASE COUNT 51 a 39 c 63 g 38 t ORIGIN 1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac 61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt 121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg 181 acctgtgagg g // LOCUS YEPRGGN 158 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35813 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 5081; serotype 0:13a,13b) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 158) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 17 23S ribosomal RNA 127 > 158 23S ribosomal RNA site 18 126 23S rRNA intervening sequence BASE COUNT 41 a 33 c 51 g 33 t ORIGIN 1 atgcaccgaa cgtgcggcag cgaacgtatc acccaagaca attcacggca gtggataatg 61 attgacggag cgcagcgacg tcaatgcgtc caataaagtc gagttggctt agggatacgt 121 tcgttgggta ggggagcgtt ctgtaagccg ttgaaggt // LOCUS YEPRGGO 180 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35814 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 9312/78; serotype 0:13,18) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 38 23S ribosomal RNA 148 > 180 23S ribosomal RNA site 39 147 23S rRNA intervening sequence BASE COUNT 50 a 36 c 58 g 36 t ORIGIN 1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac 61 aattcacgga agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt 121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg // LOCUS YEPRGGP 180 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35815 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 874/77; serotype 0:20) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 38 23S ribosomal RNA 148 > 180 23S ribosomal RNA site 39 147 23S rRNA intervening sequence BASE COUNT 49 a 37 c 58 g 36 t ORIGIN 1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac 61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt 121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg // LOCUS YEPRGGQ 180 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35816 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain E736; serotype 0:21) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 38 23S ribosomal RNA 148 > 180 23S ribosomal RNA site 39 147 23S rRNA intervening sequence BASE COUNT 48 a 38 c 58 g 36 t ORIGIN 1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac 61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccactaaagt 121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg // LOCUS YEPRGGR 180 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.enterocolitica 23S rRNA gene. ACCESSION M35817 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.enterocolitica (strain 2139/72; serotype 0:34) DNA. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 180) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA < 1 38 23S ribosomal RNA 148 > 180 23S ribosomal RNA site 39 147 23S rRNA intervening sequence BASE COUNT 49 a 37 c 58 g 36 t ORIGIN 1 aagatgtaac ggggctaaac catgcaccga agctgcggca gcgaacgtat cacccaagac 61 aattcacggc agtggataat gattgacgga gcgcagcgac gtcaatgcgt ccaataaagt 121 cgagttggct tagggatacg ttcgttgggt aggggagcgt tctgtaagcc gttgaaggtg // LOCUS YEPRGGS 153 bp ds-DNA BCT 07-AUG-1990 DEFINITION Y.bercovieri 23S rRNA gene. ACCESSION M35818 KEYWORDS 23S ribosomal RNA; ribosomal RNA. SOURCE Y.bercovieri (strain 3016/84; serotype 0:58,16) DNA. ORGANISM Yersinia bercovieri Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 153) AUTHORS Skurnik,M. and Toivanen,P. TITLE Intervening sequences (IVS) in the 23S ribosomal RNA genes of pathogenic Yersinia enterocolitica strains. The IVSs in Y.enterocolitica and Salmonella typhimurium have common origin JOURNAL Unpublished (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Skurnik, 29-JUN-1990. Author address: M.Skurnik Turku University Dept of Medical Microbiology Kiinanmyllynkatu 13 SF-20520 Turku FINLAND email:mskurnik@finabo FEATURES from to/span description rRNA 110 > 153 23S ribosomal RNA site < 1 109 23S rRNA intervening sequence BASE COUNT 39 a 30 c 51 g 33 t ORIGIN 1 cagcgaacgt atcacccaag acaattcacg gcagtggata atgattgacg gagcgcagcg 61 acgtcaatgc gtccaataaa gtcgagttgg cttagggata cgttcgttgg gtaggggagc 121 gttctgtaag ccgttgaagg tgacctgtga ggg // LOCUS ECOINSW 49 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli insertion site for transposon Tn1545. ACCESSION M31752 KEYWORDS . SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 49) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 26 27 transposon Tn1545 insertion site BASE COUNT 12 a 10 c 0 g 27 t ORIGIN 1 cttttcccaa catttccttt tttattatta aaaatcattt ttttcttca // LOCUS ECOINSX 49 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli insertion site for transposon Tn1545. ACCESSION M31754 KEYWORDS . SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 49) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 26 27 transposon Tn1545 insertion site BASE COUNT 21 a 4 c 6 g 18 t ORIGIN 1 atttgaaaat gctgattctt ttatagataa aaatagcaat gcttaaata // LOCUS ECOINSY 49 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli insertion site for transposon Tn1545. ACCESSION M34227 KEYWORDS . SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 49) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 26 27 transposon Tn1545 insertion site BASE COUNT 16 a 8 c 5 g 20 t ORIGIN 1 tgttggattt catttcactt tctccataaa aaaaaacggt aatcatttt // LOCUS ECOINSZ 49 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli insertion site for transposon Tn1545. ACCESSION M35413 KEYWORDS . SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 49) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 26 27 transposon Tn1545 insertion site BASE COUNT 18 a 6 c 11 g 14 t ORIGIN 1 cacctcatac atttgatgtt tatcagtaaa aaaatgggag gattaaggg // LOCUS ECOT1545A1 136 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 5' junction. ACCESSION M35311 KEYWORDS . SEGMENT 1 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 136) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 26 27 E.coli end/ transposon Tn1545 start BASE COUNT 49 a 28 c 10 g 49 t ORIGIN 1 cttttcccaa catttccttt tttattttaa aaaatagcat aaaaatctag ttatccgcat 61 aaaaactgga cttatcacac tttatcaagg tcaaaaccac tcaatttact actaatttac 121 tacttatgaa tgagct // LOCUS ECOT1545A2 212 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 3' junction. ACCESSION M35315 KEYWORDS . SEGMENT 2 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 212) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 189 190 transposon Tn1545 end/ E.coli start BASE COUNT 84 a 29 c 28 g 71 t ORIGIN 1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac 61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac 121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact 181 ttgtttatta ttaaaaatca tttttttctt ca // LOCUS ECOT1545B1 135 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 5' junction. ACCESSION M35312 KEYWORDS . SEGMENT 1 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 135) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review staff_entry FEATURES from to/span description recomb 26 27 E.coli end/ transposon Tn1545 start BASE COUNT 53 a 23 c 14 g 45 t ORIGIN 1 atttgaaaat gctgattctt ttatagataa aaatagcata aaaatctagt tatccgcata 61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact 121 acttatgaat gagct // LOCUS ECOT1545B2 212 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 3' junction. ACCESSION M35316 KEYWORDS . SEGMENT 2 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 212) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 189 190 transposon Tn1545 end/E.coli start BASE COUNT 88 a 29 c 30 g 65 t ORIGIN 1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac 61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac 121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact 181 ttgtttctta taaaaatagc aatgcttaaa ta // LOCUS ECOT1545C1 135 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 5' junction. ACCESSION M35313 KEYWORDS . SEGMENT 1 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 135) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 26 27 E.coli end/ transposon Tn1545 start BASE COUNT 48 a 27 c 13 g 47 t ORIGIN 1 tgttggattt catttcactt tctccattaa aaatagcata aaaatctagt tatccgcata 61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact 121 acttatgaat gagct // LOCUS ECOT1545C2 212 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 3' junction. ACCESSION M35317 KEYWORDS . SEGMENT 2 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 212) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 189 190 transposon Tn1545 end/ E.coli start BASE COUNT 88 a 28 c 31 g 65 t ORIGIN 1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac 61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac 121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact 181 ttgttttgat aaaaaaaaac ggtaatcatt tt // LOCUS ECOT1545D1 135 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 5' junction. ACCESSION M35314 KEYWORDS . SEGMENT 1 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 135) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 26 27 E.coli end/ transposon Tn1545 start BASE COUNT 51 a 28 c 13 g 43 t ORIGIN 1 cacctcatac atttgatgtt tatcagctaa aaatagcata aaaatctagt tatccgcata 61 aaaactggac ttatcacact ttatcaaggt caaaaccact caatttacta ctaatttact 121 acttatgaat gagct // LOCUS ECOT1545D2 212 bp ds-DNA BCT 07-AUG-1990 DEFINITION E.coli/transposon Tn1545 3' junction. ACCESSION M35318 KEYWORDS . SEGMENT 2 of 2 SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 212) AUTHORS Caillaud,F. and Courvalin,P. TITLE Nucleotide sequence of the ends of the conjugative shuttle transposon Tn1545 JOURNAL Mol. Gen. Genet. 209, 110-115 (1987) STANDARD simple staff_review FEATURES from to/span description recomb 189 190 transposon Tn1545 end/ E.coli start BASE COUNT 87 a 26 c 37 g 62 t ORIGIN 1 gttcttgctt agtagtacaa atgaatttac tacttattta ccacttctga cagctaagac 61 atgaggaaat atgcaaagaa acgtgaagta tcttcctaca gtaaaaatac tcgaaagcac 121 atagaataag gctttacgag catttaagaa aatataaaaa gataattaga aatttatact 181 ttgttttgat aaaaaaatgg gaggattaag gg //
GenBank-Updates@genbank.bio.net (08/08/90)
LOCUS MUSMHC4B1 278 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse MHC class II H2-C4-w7 gene (C3H, w7), 5' flank, segment 1. ACCESSION M15601 KEYWORDS class III gene; complement protein; complement system protein; major histocompatibility complex; protease. SEGMENT 1 of 2 SOURCE Mouse (C3H.W7) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 278) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. TITLE Recombination of two homologous MHC class III genes of the mouse (C4 and Slp) that accounts for the loss of testosterone dependence of sex-limited protein expression JOURNAL J. Immunol. 138, 620-627 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 278; revises [1]) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. JOURNAL Unpublished (1990) STANDARD simple staff_review FEATURES from to/span description revision 81 82 ca in [2]; ac in [1] BASE COUNT 67 a 94 c 62 g 55 t ORIGIN Chromosome 17. 1 gtcaacagtt attgcacatg cttaatgtaa acctgtagcc tctacatgct tgtgtacctg 61 cccagacagt atatgacacc cacacccaca cccacaccca cacccacacc caccccaccc 121 cacttcacac ccaggcagca gtagtggacc caacagggta agtgggtccc tgcttgggtt 181 gcctgtgttt ctggaagctc aaggtccatg catcagcagt aggggaaggg agggttctca 241 gcccaggctg catcatcatg gccgcctacc acctgcag // LOCUS MUSMHSLQ2 314 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse MHC class III H2-Slp-w7d gene (C3H,w7), 5' flank, segment 2. ACCESSION M15609 KEYWORDS class III gene; complement protein; complement system protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (C3H.W7) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 314) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. TITLE Recombination of two homologous MHC class III genes of the mouse (C4 and Slp) that accounts for the loss of testosterone dependence of sex-limited protein expression JOURNAL J. Immunol. 138, 620-627 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 314; revises [1]) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. JOURNAL Unpublished (1990) STANDARD simple staff_review FEATURES from to/span description revision 119 121 cac in [1]; c in [1] BASE COUNT 81 a 88 c 72 g 73 t ORIGIN About 1064 bp downstream from segment 1; chromosome 17. 1 tcgactgggg aacccattct agttttgtta ggacctttac ctgtcttatc tatgtgtcat 61 gttggagctg aatacacgca cacatgggca agcacatgta ttcatacaca cacacacaca 121 cccacacgca catgcagagg gacacgcaca cgcacacaaa cacacacaca aggggaagtc 181 cctggtactg ggaataaaca agcaccagat caattttggg gccaagattt ctacagagac 241 cctggtgttt gggttgcctc cccagctctg gcctagggcc agttctgttt cttggccatc 301 acgtggtttc ccgg // LOCUS MUSMHSLR2 320 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse MHC class III H2-Slp-w7c gene (C3H,w7), 5' flank, segment 2. ACCESSION M15610 KEYWORDS class III gene; complement protein; complement system protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (C3H.W7) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 320) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. TITLE Recombination of two homologous MHC class III genes of the mouse (C4 and Slp) that accounts for the loss of testosterone dependence of sex-limited protein expression JOURNAL J. Immunol. 138, 620-627 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 320; revises [1]) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. JOURNAL Unpublished (1990) STANDARD simple staff_review FEATURES from to/span description revision 124 126 aca in [2]; a in [1] BASE COUNT 83 a 92 c 71 g 74 t ORIGIN About 1064 bp downstream from segment 1; chromosome 17. 1 tcgactgggg aacccattct agttttttta ggacctttac ctgtcttatc tatgtgtcat 61 gttggagctg aatacacgca cacatgggca agcacatgta ttcgtacaca cacacacaca 121 cacacaccca cacgcacatg cacagggaca cgcacacgca cacaaacaca cacacaaggg 181 gaagtccctg gtactgggaa taaacaagca ccagatcaat tttggggcca agatttctac 241 agagaccctg gtgtttgggt tgcctcccca gctctggcct agggccagtt ctgtttcttg 301 gccatcacgt ggtttcccgg // LOCUS MUSMHSLS2 294 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse MHC class III H2-Slp-w7b gene (C3H,w7), 5' flank, segment 2. ACCESSION M15611 KEYWORDS class III gene; complement protein; complement system protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (C3H.W7) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 294) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. TITLE Recombination of two homologous MHC class III genes of the mouse (C4 and Slp) that accounts for the loss of testosterone dependence of sex-limited protein expression JOURNAL J. Immunol. 138, 620-627 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 294; revises [1]) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. JOURNAL Unpublished (1990) STANDARD simple staff_review FEATURES from to/span description revision 232 232 a in [2]; g in [1] BASE COUNT 76 a 80 c 66 g 72 t ORIGIN About 1064 bp downstream from segment 1; chromosome 17. 1 tcgactgggg aacccattct agttttgtta ggacctttac ctgtcttatc tatgtgtcat 61 gttggagctg aatacacgca cacatggaca agcacatgta ttcgtacaca cacacacaca 121 cacacgcaca cgcacacaaa cacacacaca aggggaagtc cctggtactg ggaataaaca 181 aggaccagat caattttggg gccaagattt ctacagagac cctggtgttt gagttgcctc 241 cccagctctg gcctagggcc agttctgttt cttggccatc acgtggtttc ccgg // LOCUS MUSMHSLT1 253 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse MHC class III H2-Slp-w7a gene (C3H,w7), 5' flank, segment 1. ACCESSION M15605 KEYWORDS class III gene; complement protein; complement system protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (C3H.W7) liver DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 253) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. TITLE Recombination of two homologous MHC class III genes of the mouse (C4 and Slp) that accounts for the loss of testosterone dependence of sex-limited protein expression JOURNAL J. Immunol. 138, 620-627 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 253; revises [1]) AUTHORS Nakayama,K., Nonaka,M., Yokoyama,S., Yeul,Y.D., Pattanakitsakul,S.-N. and Takahashi,M. JOURNAL Unpublished (1990) STANDARD simple staff_review FEATURES from to/span description revision 95 96 ct in [2]; ta in [1] revision 138 144 tgggtc in [2]; gg in [1] revision 148 150 ctt in [2]; c in [1] revision 174 176 tca in [2]; t in [1] revision 187 189 atc in [2]; a in [1] BASE COUNT 58 a 76 c 62 g 57 t ORIGIN Chromosome 17. 1 gtcaacagtt attgcacata cttaatgtaa acctgtagcc tctacatgct tgtgtacctg 61 cccagacagt atatgacacg cacacaccca cacactcccc cacacccagg cagctgtggt 121 ggacccaaca gggtaagtgg gtccctgctt gggttgcctg tgtttctgga agctcaatgt 181 ccacgcatca gcagttgggg aagggagggt tctcagccca ggctgcatca tcatggctgc 241 ctaccacctg cag // LOCUS BOVCMPR 7913 bp ss-mRNA MAM 08-AUG-1990 DEFINITION Bovine cation-independent mannose 6-phosphate receptor protein mRNA, complete cds. ACCESSION J03527 M15869 KEYWORDS cation-independent mannose 6-phosphate receptor protein; insulin-like growth factor II receptor protein. SOURCE Bovine (calf), cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 3267 to 7913) AUTHORS Lobel,P., Dahms,N.M., Breitmeyer,J., Chirgwin,J.M. and Kornfeld,S. TITLE Cloning of the bovine 215-kDa cation-independent mannose 6-phosphate receptor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2233-2237 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 7913; revises [1]) AUTHORS Lobel,P., Dahms,N.M. and Kornfeld,S. TITLE Cloning and sequence analysis of the cation-independent mannose 6-phosphate receptor JOURNAL J. Biol. Chem. 263, 2563-2570 (1988) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.Lobel, 22-JUL-1987 and for [2] 23-NOV-1988. FEATURES from to/span description pept 153 7652 cation-independent mannose 6-phosphate receptor protein BASE COUNT 1665 a 2401 c 2413 g 1434 t ORIGIN 1 cgcccctcga gctcccccgg ctcgtccgtg gctcctcgcg ctcctgtccg tctccgcctg 61 ccgccctggc tgcacacccg tgtcccggac gccgcccccg gccgccgcgc gccgctcgcc 121 cggctccgga cgcgcagccc gggcccggcg cgatggaggc ggccgccggc cggagctcgc 181 acctggggcc cgcgcccgcc gggcgcccgc cgcggtgccc gctcctgctg cagctgcagc 241 tgctgctgct gctgctgctg ctgccgccgg gctgggttcc cggggccgcg ggcacccagg 301 gcgccgagtt cccagagctg tgcagttata catgggaagc agtggatacc aaaaataaca 361 tgctttataa aatcaacatc tgtggaaata tgggtgttgc ccagtgtgga ccatcaagtg 421 ctgtctgtat gcatgacttg aagacagaca gctttcattc tgtgggtgac tctcttttga 481 aaacagcaag cagatctctt ctggaattta acacaacagt gaactgtaag cagcagaatc 541 acaaaattca gagtagcatc accttcttat gtgggaaaac cttgggaact cccgagtttg 601 taactgcaac agattgtgtg cattacttcg agtggaggac tactgcagcc tgcaaaaaga 661 atatatttaa agcgaataaa gaggtgccct gttacgcttt cgacagagag ctcaagaagc 721 acgatttaaa cccactgatc aagaccagcg gtgcttactt ggtggacgac tctgacccgg 781 acacatctct gttcatcaat gtctgcaggg acatagaggt gctccgggcc tcgagtccac 841 aagtgcgcgt gtgtcccacc ggcgcggccg cctgcctggt gcgaggggac cgcgcgttcg 901 acgtgggccg gccccaggag gggctgaagc tcgtgagcaa tgacaggctc gtcctgagtt 961 acgtgaagga aggggccggc cagcccgact tctgtgacgg ccacagcccg gcggtgacca 1021 tcacgttcgt gtgcccgtcg gagcgcagag agggcaccat tcccaagctc acagcgaaat 1081 ccaactgccg ctttgagatc gagtgggtca ccgagtacgc ctgccacagg gattacctgg 1141 aaagccggag ctgctccctg agcagcgcgc agcatgacgt ggccgtcgac ctccagccgt 1201 tgagccgggt ggaagcctca gactccttgt tctacacctc ggaggcggac gagtatacat 1261 attatttgag catctgcgga ggaagccaag cgcccatctg taataagaaa gatgctgcag 1321 tgtgccaagt gaaaaaggca gattccactc aagtcaaagt ggccgggaga ccccagaacc 1381 tgaccctccg gtactcggat ggagacctca ccttgatcta tttcgggggt gaagagtgca 1441 gctccggctt ccagcggatg agtgtcatca acttcgagtg caatcagaca gcaggtaaca 1501 atggcagagg ggctcctgtg ttcaccgggg aggtggactg cacctacttc ttcacgtggg 1561 atacgaagta cgcctgtgtc cacgagaagg aggccctgct gtgcggcgtc tccgacggga 1621 aacagcgctt cgacctgtcg gcgctggccc ggcactcaga actggaacaa aattgggaag 1681 ctgtggatgg cagtcagagg gaagcagaaa agaagcattt cttcattaac atctgccaca 1741 gggtcctgca gacgggccag gcacggggct gccccgaaga cgcggccgtg tgtgccgtgg 1801 ataagaatgg aagtaaaaat ctgggcagat ttatttcttc tcccaccaga gagaaaggaa 1861 atattcagct ctcttactca gatggtgatg agtgcggtgg tggccagaag ataataacaa 1921 atataacact catgtgcaaa ccaggtgatt tagaaagtgc cccggtgctg acaacctcca 1981 gggctgacgg ctgcttctac gagtttgagt ggcgcacggc tgcagcctgc gtgctctcca 2041 ggaccgaggg ggacaactgc actgtctttg actcccaggc agggttttct ttcgacttga 2101 cgcctctcac gaagaaggac gcctacaagg tcgagacgga caagtacgag ttccacatca 2161 acgtgtgcgg cccggtgtcc gtgggcgcct gcccgccgga ctcgggggcc tgtcaggtgt 2221 ccaggagtga taggaagtct tggaacttgg gacgcagcaa tgctaagctt tcgtattacg 2281 acgggatgat ccagctgacc tacagggacg gcacacccta caataacgag aagcgcacgc 2341 cgagagccac gctcatcacc ttcctctgtg accgagacgc cggagtgggt ttccccgaat 2401 atcaggagga agataactct acatacaact tccggtggta caccagttac gcctgcccgg 2461 aggagccgct ggagtgcatc gtgactgacc ccgtcacact ggaccagtac gacctctcca 2521 ggctagcgaa atccgagggc ggtcctgggg gaaactggta ctctctggac aacggcgggg 2581 cacgcagcac gtggcggaag tactacatca acgtgtgtcg tcccctgaac ccggtgccgg 2641 gctgtgaccg ctacgcgtcc gcctgtcaga tgaagtacca gggcgagcag ggctcgtact 2701 ctgagaccgt ctccatcagc aacctggggg tggcgaagac gggccccatg gtggaggaca 2761 gcggcagcct gctcctggag tacgtcaacg gctccgcctg caccaccagc gaccagaggc 2821 gcaccaccta caccaccagg atccaccttg tctgctctac cggcagcctg tatacccatc 2881 ccatattttc tctcaactgg gagtgtgtgg tcagcttcct gtggaacacg gcggcagcct 2941 gtcctatccg aatcaccacg gacatagacc aggtctgctc catcaaggac cccaacagcg 3001 ggtacgtgtt tgatctgaac ccactgaaca attcccgagg atacgtggtt ttgggcatcg 3061 ggaagacgtt tctgttcaac gtgtgcggtg acatgcccgc ctgtggcacc ctggatggga 3121 agccagcttc cggctgcgag gcagaagtcc agatggacga catgaagacc ctgaagccgg 3181 gcaggctggt gggcctggag aagagcctgc agctgtccac cgagggcttt ataaccctga 3241 actacacggg gcttccttcc caccccaacg ggagggctga tgccttcatc atccgcttcg 3301 tctgcaatga tgacgtttac ccagggacac ccaagttcct gcaccaggac atcgactcta 3361 gcctggggat ccgggacact ttcttcgagt ttgaaaccgc gctggcctgt gtaccttctc 3421 cggtagattg ccaagtcaca gaccccgccg ggaacgagta tgatctgagt ggcctgagca 3481 aggccaggaa gccgtggact gcggttgaca cgttcgatga ggggaagaag aggaccttct 3541 acctgagcgt gtgcacgcct ctcccgtaca ttcccggctg ccacggcacc gctgtggggt 3601 gctgcctggt gacggaagac agcaagttga acctaggcgt cgtgcagatc agtcctcagg 3661 tgggcgccaa cgggtccctg agcctcgtct acgtcaacgg ggacaagtgc aagaaccagc 3721 gtttctccac caggataaac ctcgagtgtg cccacacaac gggctccccg acctttcagc 3781 tccagaacga ctgtgagtat gtgtttctct ggagaaccgt ggaagcctgt cccgtcgtgc 3841 gtgcggaagg agactactgc gaggtgagag acccaaggca cggcaacctg tataacctga 3901 tacctcttgg tctgaacgac actgtcgtgc gggccggcga atacacctat tacttccgcg 3961 tctgcggaga gctgacatcc ggcgtctgcc caaccagtga caagtccaag gtcatctcat 4021 catgccagga aaagcgggga ccccagggat ttcaaaaagt ggcaggtctg tttaatcaga 4081 agctgaccta cgagaatggg gtgctgaaga tgaactacac cgggggcgac acctgccaca 4141 aggtgtacca gcgttccacc accatctttt tctactgcga ccgcagcacg caggcgcccg 4201 tgtttctcca ggagacgtcc gattgctcct acctgtttga gtggcgcacg cagtacgcct 4261 gcccgcccta cgacctgacc gagtgttcgt tcaaaaacga ggctggggaa acctacgacc 4321 tctcgtctct gtcgaggtac agcgacaact gggaggctgt cacgggcaca gggtccaccg 4381 agcactacct catcaacgtg tgcaagtccc tgtccccgca ggctggctca gatccgtgcc 4441 ctccggaggc ggccgtgtgt ctgctgggcg gccccaagcc cgtgaacctg ggcagggtgc 4501 gggacagtcc tcagtggagc cagggcttga ccctcctgaa gtacgttgac ggtgacttgt 4561 gtccagacca gattcggaag aagtcaacca ccatccgctt cacgtgcagc gagagccacg 4621 tgaactccag gcccatgttc atcagcgctg tggaggactg cgaatacacc ttctcctggc 4681 ccacggccgc cgcctgcgcg gtcaagagca acgtgcacga cgactgccag gtcaccaacc 4741 ccgccacggg acacctgttt gacctgagct ctctgagcgg ccgggccggc ttcaccgccg 4801 cctacagcga gaaggggtta gtctacctca gcgtgtgcgg ggacaacgag aactgcgcca 4861 acggcgtggg ggcctgcttt gggcagacca ggatcagcgt gggcaaggcg agcaagaggc 4921 tgacctacgt ggaccaggtc ttgcaactgg tgtatgaggg cggctccccc tgcccctcca 4981 agaccggcct gagctacaag agcgtcatca gcttcgtgtg caggcctgag gtcgggccca 5041 ccaacaggcc catgctgatc tccctggaca agcgcacgtg cacgcttttc ttctcctggc 5101 acacacccct ggcctgcgag cagacgaccg agtgctccgt gaggaacggc agctccctca 5161 tcgacctgtc cccactcatc caccgcaccg ggggttacga agcatacgat gagagtgagg 5221 acgacggctc cgacaccagc cctgacttct acatcaacat ctgccagccg ctcaacccca 5281 tgcacgggtt ggcctgcccc gccggcacgg ccgtgtgcaa ggttcccgtg gacggccccc 5341 cgatagatat tggccgagtg gcaggacctc cgatcctcaa tcccatagcc aacgaagttt 5401 acttgaactt tgaaagcagc actccttgct tggcggaccg gcacttcaac tacacctcac 5461 tgatcacgtt ccactgtaag cggggcgtga gcatgggaac gccaaaactg ctgaggacca 5521 gtgtgtgtga cttcgtgttt gagtgggaga ctcctctggt ctgtcccgac gaagtgaaga 5581 cggacggctg ctccctcacg gacgaacagc tgtactacag cttcaacctg tccagcctct 5641 ccaagagcac cttcaaggtg acccgaggcc cgcacaccta cagtgtgggg gtgtgcaccg 5701 cagccgcagg cctggacgaa ggaggctgca aggacggtgc tgtctgcctg ctgtccggga 5761 gcaagggggc gtctttcggg cggctggcgt ccatgaagct ggactacagg catcaggacg 5821 aagctgtcat cctgagttac gccaacggag acacttgccc tccggaaact gaggacggcg 5881 agccgtgtgt gttccccttc gtgttcaacg ggaagagcta cgaggagtgt gttgtggaga 5941 gcagggccag gctctggtgc gcgaccaccg ccaactacga cagagaccac gagtggggct 6001 tctgcaagca ctccaccagc caccggacgt ccgtcatcat cttcaagtgt gacgaggacg 6061 ccgacgtggg gcggccccag gtcttcagcg aggtgcgcgg ctgtgaggtg accttcgagt 6121 ggaagacgaa ggtggtctgc cccccgaaga agatggagtg caagttcgtc cagaagcacc 6181 ggacctacga cctgcggctg ctctcgtccc tcaccggctc ctggtccttc gtccacaacg 6241 gagcctcgta ctacatcaac ctgtgtcaga aaatatacaa gggaccccag gactgctcgg 6301 agagagccag cgtgtgcaaa aagagcacct ctggcgaggt gcaggtcctc gggctcgttc 6361 acacacagaa gctggatgtt gtagatgaca gagtcatcgt aacttactct aaaggccact 6421 actgtgggga caataagaca gcgtctgctg tcatcgagct gacctgtgcc aagacagtgg 6481 ggcggccttc gttcacgagg ttcgacgtcg acagctgcac ctaccacttc agctgggact 6541 cacgagcggc ctgcgccgtg aagcctcagg aggtgcagat ggtgaatggg accatcacca 6601 acccggccaa cggccggagc ttcagcctcg gggatattta cttcaaacga ttcagcgcct 6661 ctggggacgt gagaaccaac ggggacaggt acatctacga gatccagctg tcgtccatca 6721 cgggctccag cagccccgcc tgctctgggg ccagcatctg ccagaggaag gccaacgacc 6781 agcacttcag tcgcaaagtc ggaacctcca accaaaccag atactacgtt caagatggcg 6841 acctggatgt ggtgttcacc tcgtcctcca agtgtggaaa agacaagaca aagtctgtgt 6901 cctccaccat cttcttccac tgtgaccccc tggtgaagga cgggatcccc gagttcagcc 6961 acgagactgc cgactgccag tacctcttct cctggcacac ctctgccgtg tgcccgctgg 7021 gggcgggctt cgacgaggag atcgcagggg atgacgccca ggagcacaaa gggctctcag 7081 agcgcagcca ggcggtcggg gcagtgctca gcctgctgct ggtggcactc accgcctgcc 7141 tgctcaccct gctgctgtac aagaaggagc gccgggagat ggtaatgagc aggcttacca 7201 actgctgccg ccggagcgcg aacgtgtcct acaagtactc gaaggtgaac aaggaggagg 7261 aggccgacga gaacgagacc gagtggctga tggaggagat ccagccgccg gcgccgcggc 7321 ccgggaagga gggccaggag aacgggcacg tggccgccaa gtcggtgaga gctgccgaca 7381 cgctgagcgc cctgcacggc gacgagcagg atagcgagga cgaggtcctg acgctgcccg 7441 aggtgaaggt gcgcccgcca ggccgggctc ctggtgccga aggtggcccc ccgctgcggc 7501 cactgcctcg gaaggcgcca ccgccgctgc gggcggacga ccgggtgggg ctggtgcgtg 7561 gggagccggc gcgccggggg cggccccggg cggcggccac gcccatcagc accttccacg 7621 acgacagcga cgaggacctc ctgcacgtct aggctcgccc gcgccggctc cgaccaaatc 7681 cgatgggact ccgtgatgct tctgtccttg gcctttaacg aaaactgtcc aaaaaaggga 7741 agagtgttgg tggtggggga ggaggggggc gcccctctcc gtgggcacgg ggaggaggcg 7801 cggccgctga tggccgcgcc agccccgggt ctgtccccag ccctcacctt gagcgtggcc 7861 gccgtgcgct tcctaaaggc gccggggcca gacgcgtctt gggactgagg gct // LOCUS HUMHBBC 1552 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human hemoglobin-related sequence across the breakpoint for Indian delta-beta thalassemia. ACCESSION M37467 KEYWORDS hemoglobin. SOURCE Human (case V.1) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1552) AUTHORS Mishima,N. and Gilman,J.G. TITLE The 3' breakpoint of the Indian delta-beta thalassemia deletion JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.G.Gilman, 01-AUG-1990. Author address: J.G.Gilman, Montefiore Medical Sickle Cell Center, 111 East 210th Street, The Bronx, NY 10467 This sequence of 1552 bases connects the sequence of <HUMHBB> at position 73309, with the sequence beginning at position 15 under accession number M33706 (position 1464 of this sequence). In <HUMHBB> coordinates, this sequence begins at 73309 and ends at 74860. FEATURES from to/span description BASE COUNT 447 a 271 c 299 g 535 t ORIGIN 1 ttttctcagt cagttaacat tccttcaact agattagttg tgacaaaatt tcaggccttg 61 ctcaactcta acctcagtct ctccatatat ttctgagagt cttccttctt atttgtttat 121 aagaatttag taaatggcaa tattgagaag tagaaaattt tggagacttg tttttcatga 181 accagaacaa tctcaaagca gcaatactaa acaggagcca agcctagaat aaagttgaat 241 ttctggaatc aacaagctag gggaaagtct cccattaagt caaatatttg acaactggga 301 ggtcaagtca gaggtgaaaa atgattaaag ataacaagag gtctgaaaac aaagtgacca 361 attaaaataa aaagaaagag ccaagctggg cacagtggct tgcatctgta gtcccagtaa 421 cccaggaggc tgaagtgata ggatcacttg agcccaggat ttcagggttg cagtgagcaa 481 taatgacacc actgctctct agccttggtg actgagatcc tgtctctgag gaaaaacaaa 541 caaacaagca aaaaacaaac aaacaaacac acagatatta agaacataga agttttgtta 601 cggtgatgat attctccaac acaggtctga accccttgca gtgcctcggg gctgtttctc 661 cagtggattc ttgggttctt cacatcttct caactttgtg aaaagatatt catccaaagg 721 tctcacctgt cttattcaga gcttggattt tggactagct ttttctattt cttgggattc 781 tctagagcca tatattcaga tataacacag atataaccaa aagaagcaaa ggaggcatca 841 cagaaggggc tttgtattct ggggaggggg acatagagtt gagagaaata caggaattta 901 taggactcta ctcaaagagc cctcaaataa agtacggcta gtcttctttg cctaagacaa 961 tgagcagtcc aggaagttat ccatttcttt tagattttct agtttatgta catagagatg 1021 ttcatagtag tttttgatga ttatttgtat ttctgtgggg tgagtgataa tattcccttt 1081 tcatttctaa ctgtgcttat ttggatcttc tctcttgtct tctttattag tctagctagt 1141 gatctacctt attaattttt tcacaaaacc agctcttgga ctcattaatc ttttgaatgg 1201 tattttgtgt tttaatatcc ttcagttagc tttgatcttg tttatttctt gtcttctgct 1261 agatttaggg ttggtttgct cttggttctc tggttctttt agttgtgaca ttaggttgtt 1321 aatttgaggg ctttaagact ttttgatgtg ggcatttagt gtataaattt ctctcttaac 1381 actgtctaag ctgtgtccca gagattccgg tatattgtat ctttgttctc attagtttca 1441 aagaactatt ttatttttgc cttagttttc attatttacc caaaattcat tcaggagcag 1501 gttgtttaat ttccatgtaa ttgcatagtt ttcagctatt cattagtctt ga // LOCUS HUMHBBTHMU 2337 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human hemoglobin DNA with a deletion causing Indian delta-beta thalassemia. ACCESSION M33706 KEYWORDS hemoglobin. SOURCE Human (patient B.T.) leukocyte DNA. REFERENCE 1 (bases 1 to 2337) AUTHORS Mishima,N. and Gilman,J.G. TITLE The 3' breakpoint of the Indian delta-beta thalassemia deletion JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.G.Gilman, 01-AUG-1990. Author address: J.G.Gilman, Montefiore Medical Sickle Cell Center, 111 East 210th Street, The Bronx, NY 10467 FEATURES from to/span description recomb 15 17 overlap of 5' end 3' breakpoint DNA BASE COUNT 942 a 462 c 401 g 529 t 3 others ORIGIN 1 aagcttgggt tttgagtttt cattatttac ccaaaattca ttcaggagca ggttgtttaa 61 tttccatgta attgcatagt tttcagctat tcattagtct tgaaatctat ttttattcct 121 ctgtggtctg agagtgattg atatgatttt tttgcatttg ctgaggattg ttttaggtcc 181 tcccaaggct gaaccaggaa gaaatcaatt tcctgaagag atccatgatg agctctgaaa 241 atgattcagt aataaatagc ctatcaaaca aaaaaagccc tggacaatat gattcacaac 301 caaattctat cagatgcaca aagaagagtt ggaaaaattc ctactgaagc tactccaaaa 361 aattgagggg gagggattca acctcagctc attttatgag gccagcatca tcctgatacc 421 aaaacctgat agacacacaa caacaaaaaa agaaaacttc aggccaatat ccttgaacat 481 tgatgcaaaa atactagcaa gccaaatcca gcaacacatc acaaagctaa tccacctcaa 541 tcaagtaggc tttatccctg agaagcaaga ttggttcagc atatacaaat tgataaatgt 601 gattcatcat atatacagca ctaaaaacag aaaccacatg atcatttcaa tatgtgcaaa 661 aaaagatttt gataaaattc aacttcactt catattaaaa cctctcaaca aactagctat 721 ttaaggacat gacctcaaaa taatatgagc catctatgac aaacccacag ccaacatcat 781 acagaatggg caaaacctgg aagcattctt cttgaaaact ggcacaagac aaggatgccc 841 tctctcacca ctgctattca gtatagtact ggcagtcctg gccagagcaa tcaggcaaga 901 gaagggaata aagggcatcc aaataggaag agagaaagtc aaagtatccc tgttttctgt 961 atctagaaaa caccatagtc aaggcccaaa agctccttca gctgataact tcagcaaagt 1021 ttcaggatat aaaatcaatg tacaaaaatc actagcattc ccatacatca acaatagcca 1081 agcctagagc caaatcagga agtcaatccc atttaccgtt gccacaaaaa gaataaaata 1141 cctaaggata tatctaacca aggagttgaa agatctctac aatgagattt acaaaacact 1201 gctcaaagaa atcagagctg atagaaacaa atggaaaaac tttccatgct catggataaa 1261 aagaatcaat attgttaaaa tgtccatact gcctagagca atttacagtt tcaaagctat 1321 tcctatcaaa ctaccaatga cattcttcac agaactagaa aaatacactt taaagttcat 1381 atggaacaac aacaaaaaaa gtccaaatag cctaggaaat cctaatactc acaggtggga 1441 attgaaaaat gagaacactt ggacacaggg tggggaacat cacacaccgg ggcctgtcgt 1501 ggggtggtag gagagnggag ggataacatt aggagatata tctaatgtaa atgacgagtt 1561 aacgggtgca gcacaccaat atggcacacg tatacatatg taacaaactg cacaggtacc 1621 gtagaactta gagtacaaaa aaaaaaanga gaaaaagaaa aaagctggag ataacacatt 1681 gcctaacttc aaaccatacc acagggctac agtaaccaaa acagtntggt actgttacag 1741 aaaagacaca cagacaaatg gaacagaata gagagcccag aaataagacc acatgcctac 1801 aaccatctga tctttgacaa agctgataaa aacaagcaat gaggaaaaga caccctattc 1861 aacaagtcgt gttgggataa ctgactaacc gtatgtagaa gattgaaacc agaccccgtt 1921 ctatatcata tacacaaata aactcaagat ggattagaga cttaaatgca aagcccaaac 1981 tataaaaatg cttgaaagca acctagacaa taccattccg gacataagca ggggcaaaga 2041 tttcatgaca aagatgccaa gagtcattgc agcaaaagca aaaattgaca aataggatct 2101 aattaaaata aagtttctgc acagcaaaag aaactatcaa ctgagtaaac aacctataga 2161 atgggagaaa atttttgcaa actatgcatc caacaaaggc ctaatattca gaatctataa 2221 ggaacttaaa tgaatctaca agaacaaaac aaacaatgcc acaaaaaatg ggtaaaggac 2281 atgaacagac agtattcaaa agaagacata catgtggcca ccaagcatat gaaaaaa // LOCUS BSUGLPKD 4259 bp ds-DNA BCT 08-AUG-1990 DEFINITION B.subtilis glycerol kinase (glpK) and glycerol-3-phosphate dehydrogenase (glpD) genes, complete cds. ACCESSION M34393 KEYWORDS glycerol kinase; glycerol-3- phosphate dehydrogenase. SOURCE B.subtilis (starin W168) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 4259) AUTHORS Holmberg,C., Beijer,L., Rutberg,B. and Rutberg,L. TITLE Glycerol catabolism in Bacillus subtilis: Nucleotide sequence of the genes encoding glycerol kinase (glpK) and glycerol-3- phosphate dehydrogenase (glpD) JOURNAL Unpublished (1990) University of Lund, Dept. of Microbiology, Solvegatan 21, S-223 62 Lund , SWEDEN STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.M.E.Holmberg, 22-MAY-1990. FEATURES from to/span description pept < 1 679 ORF 1 (AA at 2) pept 698 2188 glycerol kinase (glpK) (EC 2.7.1.30) pept 2329 3996 glycerol-3-phosphate dehydrogenase (glpD) (EC 1.1.99.5) pept 4128 > 4259 ORF 4 binding 682 690 ribosome binding site binding 2315 2321 ribosome binding site binding 4116 4121 ribosome binding site signal 2243 2303 transcription termination signal signal 4000 4033 transcription termination signal signal 4067 4072 -35 region signal 4090 4095 -10 region BASE COUNT 1269 a 853 c 1092 g 1045 t ORIGIN 1 ccatggcgga tacgcggttg gcggcatcag cggtgcccat ttgaatccgg cgctaacgat 61 agcgcttgca tttgtaggag attttccttg gaaagaggtt ccggtttata ttgcggcgca 121 aatgatcgga gcgatcatcg gtgcggtgat tatttatctg cattacctcc cgcactggaa 181 gtcaacggat gatcccgctg ccaagctggg tgttttctca acagggccca gcattccgca 241 tacatttgca aacgttttaa gcgaagtgat tgggacattt gtccttgtac ttggaatctt 301 ggccataggt gcaaatcaat ttacagaagg acttaatcct ttaatcgtcg gtttcctcat 361 tgtagcaatc ggtatttctc tgggaggcac caccggctat gctatcaatc ctgcacgtga 421 cttaggtccg cggatcgccc acgcttttct tccgattccg gggaagggct catcaaactg 481 gaaatacgca tgggttccgg tagtcggccc gattttaggc ggatcattcg gcggtgtatt 541 ttacaacgct gcatttaaag ggcacatcac aagcagcttc tggattgtaa gcgttatatt 601 ggttgtggta ttgttaggac tctatgttta tacgaaatca cattctgcta aaacattatc 661 aaattctaaa tatatttaat caaaggggag acatcttatg gaaacgtaca ttttatcctt 721 agatcagggg acgacaagtt caagagcgat tctgtttaat aaagaaggca aaattgtcca 781 ctctgctcaa aaggaattta cacaatactt cccgcatcct ggctgggttg agcataatgc 841 caatgaaatt tggggctctg tcctcgcggt tatcgcctca gtcatctctg aatcaggaat 901 cagcgcttct caaattgccg gcatcggcat cacgaaccag cgcgagacga cggttgtgtg 961 ggataaagat acaggaagtc ctgtctataa tgcaatcgtt tggcagtcca gacagacgtc 1021 cggcatttgt gaggaacttc gtgaaaaagg atataatgat aaattcagag aaaaaacagg 1081 gcttttaatc gatccttact tctccggcac gaaggtgaag tggattttag acaatgtgga 1141 aggcgcaaga gaaaaagcgg aaaaaggcga gctgctgttt ggaacgattg atacgtggct 1201 catttggaaa atgtcaggcg gaaaagcgca tgtgaccgat tactccaatg cctcaagaac 1261 actgatgttt aatatttacg atttaaaatg ggacgatcaa ctgctcgaca ttctaggcgt 1321 accgaaatcc atgctccctg aagtgaagcc gtcctctcat gtgtatgcgg agactgttga 1381 ttatcacttc ttcggaaaaa atatcccgat tgctggagcg gcaggcgacc agcagtccgc 1441 attgttcggc caggcatgct ttgaagaagg catggggaaa aacacttacg gcacaggatg 1501 tttcatgctg atgaataccg gggaaaaagc aattaagtcc gaacatgggc ttttgacaac 1561 aatcgcttgg ggcattgacg gaaaagtgaa ctatgcgtta gaagggagca tttttgtcgc 1621 aggctctgcc atccagtggc ttagagacgg tttgagaatg ttccaggatt catcgctaag 1681 cgaatcttat gcagaaaaag tggattcaac tgacggcgtg tatgttgttc cagcatttgt 1741 cggactggga acgccttact gggacagcga tgtgcgcggt tcggttttcg gcctgacaag 1801 agggacaaca aaagagcact ttatccgtgc gacactggag tcattggctt atcagaccaa 1861 agatgtgctt gacgcaatgg aagcagattc aaacatttca ttaaagacgc tccgtgtaga 1921 cggaggagct gtaaaaaaca atttcctaat gcagttccaa ggagacctgt tgaatgttcc 1981 tgtggagcgc ccggaaatta atgaaacgac tgcacttggc gcggcttatt tggcgggtat 2041 cgctgtggga ttctggaagg accgttctga aatcgcgaac cagtggaatc tggataaacg 2101 gtttgagcct gaattggaag aagaaaaacg aaatgagctg tataaaggct ggcaaaaagc 2161 cgtgaaagca gctatggctt ttaaataaag taatactatg gtataatggt tacaagttaa 2221 taagaacggt cctgagatga ggagagacca cagcaccaaa gtgtaagcat gcactttggc 2281 tgttgtggtc tctttttcta tttaccgtga caacaaggag gaaacgtaat gatgaatcat 2341 caattttcaa gtcttgaaag agatcgcatg ctgacagaca tgacgaaaaa aacatatgac 2401 ctatttatta tcggaggagg aattacagga gccggaacag ctcttgacgc ggcatcaagg 2461 ggaatgaagg tcgcactaag cgaaatgcag gactttgcgg cgggaacatc aagccgatca 2521 acgaagctgg tacatggcgg cttgcgctat ttaaaacaat ttgaagtgaa aatggtcgct 2581 gaggtaggaa aagagcgggc gattgtgtat gaaaacggcc cgcatgtcac tacaccggaa 2641 tggatgctgc ttccgtttca taaaggcggc acattcggtt catttacaac atctattggg 2701 ttaagggttt atgacttcct tgcaggtgtg aaaaagtcag aacgaagaag catgctttca 2761 gcaaaagaaa cgctgcaaaa agagcctttg gtgaaaaaag acggcttaaa aggcggcggg 2821 tactatgtgg aataccgcac tgacgatgcg agactgacca tcgaagtcat gaaggaagcg 2881 gttaaattcg gggcagagcc tgtgaattac tccaaagtga aggagcttct ttacgaaaaa 2941 ggcaaagccg tcggcgtatt aattgaagat gtgctgacaa agaaagaata taaagtgtat 3001 gcgaaaaaaa ttgtcaatgc tacaggccct tgggtcgatc agctcagaga aaaagaccat 3061 tcgaaaaacg gaaagcattt gcagcataca aaaggcattc accttgtatt tgaccagtct 3121 gtctttccgc tgaaacaggc tgtatatttc gatacacctg atggccggat ggtatttgcg 3181 attcctcgtg aaggcaaaac atacgtggga acaacagaca ctgtttacaa agaggcgctg 3241 gagcatccgc ggatgacaac ggaggatcgt gattatgtca tcaaatcaat caattacatg 3301 ttcccggaac tgaatatcac tgcgaatgac atcgaatcca gctgggcggg gctgcgtccg 3361 ctgattcatg aagaaggcaa agatccttct gaaatttcac ggaaagacga gatttggaca 3421 tctgactcag gcctgatcac cattgccggc ggaaaactga ctggatacag aaaaatggcg 3481 gacgacatcg ttgatcttgt ccgtgatcgc ttaaaagaag agggcgaaaa ggattttgga 3541 ccatgtaaaa cgaaaaacat gccaatctct ggcgggcacg tcggcggttc gaaaaatctt 3601 atgtccttcg ttaccgcgaa aacaaaagaa ggaattgcag ccggtttatc agaaaaagac 3661 gcaaaacagc ttgcgatcag atacggctct aacgtagatc gcgtctttga ccgggtagaa 3721 gcgctgaaag atgaggccgc gaaacgcaac attccggttc atattcttgc tgaggcagaa 3781 tacagtatag aagaagagat gactgcaacc cctgctgact tctttgtccg cagaacggga 3841 cgtttatttt ttgatatcaa ttgggtaaga acatataaag atgccgttat tgattttatg 3901 agcgagcgat tccaatggga tgagcaggcg aaaaacaaac atacagaaaa cctcaacaag 3961 cttttacacg atgcggtcgt accgcttgag caataaatca taacgggctg tctgcagccc 4021 gttatttctt tttacgtgcc gaaaggggga gatctcaggt tatcaagtga cagaggcacc 4081 cgcttcatat aaaataatgg cataagctga tacataggag gacgaatatg acttggagaa 4141 agagctatga acgctggaaa cagacagaac atttagatct ggaattaaaa gagcgcctta 4201 ttgaattaga gggagatgaa caggcccttg aggactgttt ctataaagac cttgaattc // LOCUS CLCMTPLAS 379 bp ds-DNA PLN 08-AUG-1990 DEFINITION C.purpurea mitochondrial plasmid terminal 5' inverted repeat (TIR). ACCESSION M35177 KEYWORDS . SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA. ORGANISM Claviceps purpurea Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Clavicipitales; Clavicipitaceae. REFERENCE 1 (bases 1 to 379) AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B., Rogmann-Backwinkel,P. and Tudzynski,P. TITLE Structural and functional analysis of mitochondrial plasmids in Claviceps purpurea JOURNAL Mol. Gen. Genet. 214, 128-134 (1988) STANDARD simple staff_review BASE COUNT 97 a 76 c 64 g 142 t ORIGIN 1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt 61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact 121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa 181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac 241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta 301 aactttccgc taaaacccgc acgtcttcgt ctgcatggta tgggcgttag attctattct 361 acaatgtctt ccgccaaaa // LOCUS CLCMTPLASX 379 bp ds-DNA PLN 08-AUG-1990 DEFINITION C.purpurea mitochondrial plasmid 3' terminal inverted repeat (TIR). ACCESSION M35178 KEYWORDS . SOURCE C.purpurea (strain K) mitochondrial plasmid pClK1 DNA. ORGANISM Claviceps purpurea Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Clavicipitales; Clavicipitaceae. REFERENCE 1 (bases 1 to 379) AUTHORS Duevell,A., Hessberg-Stutzke,H., Oeser,B., Rogmann-Backwinkel,P. and Tudzynski,P. TITLE Structural and functional analysis of mitochondrial plasmids in Claviceps purpurea JOURNAL Mol. Gen. Genet. 214, 128-134 (1988) STANDARD simple staff_review BASE COUNT 105 a 75 c 62 g 137 t ORIGIN 1 gagagagcag tcgtttcgat tatatgcctt tttttcgcga ccctataaga gagagaattt 61 gctacccttg atggtctctc cacggtttta ccgttaaatg cttttgatta attattgact 121 tggttctgtt tagatactag cttttgctgt actattaata gttctttgct taataccaaa 181 tttcctgatt tcaggtccga ttgaattttt actatgttaa ttcttttact acaatacaac 241 gtttctaacc cttgcctaac taatttatta gaaaacaaaa ctggcacagg tgttgggtta 301 aactttccgc taaaacccgc acgtcttact ttagctcaac aaagaaagcg tagtgaactt 361 aaaaaagcac gtcttgctc // LOCUS DDISP60 328 bp ds-DNA INV 08-AUG-1990 DEFINITION D.discoideum spore coat 60 (sp60) gene, 5' flank. ACCESSION M34546 KEYWORDS spore coat 60. SOURCE D.discoideum (strain Ax4) DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 328) AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F. TITLE Coordinate regulation of the spore coat genes in Dictyosteiium discoideum JOURNAL Dev. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Fosnaugh, 24-MAY-1990. BASE COUNT 168 a 15 c 11 g 134 t ORIGIN 1 aaaacataga acaaatttaa ataattattt tttattaaat tgtattattt ttttttatta 61 tttatttatt ttttattttc aaaaaaaaaa aaaaaaagtt aacagacaaa aagtataatc 121 tatttaatta ttcaaaaaaa aaaatattaa atcattgtag tattttgttc atattcaaat 181 taaatatcaa tacaattaaa aaaagttaaa atgtaattat taatatatat atatatttat 241 caatatataa tttaatataa aaaaaaaaaa gaaatttaat aaacataaat atttataaaa 301 tcataaattt aaaataaata attttttt // LOCUS DDISP70 563 bp ds-DNA INV 08-AUG-1990 DEFINITION D.discoideum spore coat 70 (sp70) gene, 5' flank. ACCESSION M34547 KEYWORDS spore coat 70. SOURCE D.discoideum (strain Ax4) DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 563) AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F. TITLE Coordinate regulation of the spore coat genes in Dictyosteiium discoideum JOURNAL Dev. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.Fosnaugh, 24-MAY-1990. BASE COUNT 207 a 73 c 47 g 236 t ORIGIN 1 gaagtgtgag ttgcgcagtg tgttagtgtg ggtttaatga gagtgttttt tttttttttt 61 tttttttttt tgaacgtaca tatttatttg tttttattaa tatttcataa atgaatttgt 121 aaatataaaa ataaattgtt tttttttatt ttattttttt gattatgaca ataatttcaa 181 taaaatgtaa aatatgataa aaataatttt aattattgaa acacacacac tcactcactt 241 tttcccacta gtattacact ttccaatgaa aaataaatta atattgaaaa agaaggaaat 301 ggaaaaaaat gaaaaataaa aataattaaa taaacaataa tattacaaac acactatttc 361 tttctcaccc acacaaccaa tatactaaca cttctcatac acccaccaac acacctacac 421 ccagttcttc acattcatac acaaagttta atattacact ttaaaaaaca ttcatattaa 481 tgatttttat gatttattta tttattaatt tatttaattg tgtattttga ttattgctct 541 ctacttttta tatatttttt ttc // LOCUS DDISP96 682 bp ds-DNA INV 08-AUG-1990 DEFINITION D.discoideum spore coat 96 (sp96) gene, 5' flank. ACCESSION M34548 KEYWORDS spore coat 96. SOURCE D.discoideum (strain Ax4) DNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 682) AUTHORS Fosnaugh,K., Loomis,K. and Loomis,W.F. TITLE Coordinate regulation of the spore coat genes in Dictyosteiium discoideum JOURNAL Dev. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by K.Fosnaugh, 24-MAY-1990. BASE COUNT 321 a 37 c 63 g 261 t ORIGIN 1 tgaaaaaaaa aaaattttgt gtgtgatagt ttgtcaaaaa tcatttgaaa ttaaaatcat 61 attataatat atattaaaaa aatgttgtat ctttaatttt ttttcatact agtgtgttaa 121 atgaattaaa gtgtggtaat gaataaatga aaaaaaaaaa aaaattaaag tgagagagtg 181 tgtaaaagtg gtaatgaatg tgtgtttgag tgtgttaaaa atgtgtttac aaaatataat 241 aatttaaatt cattcatatt caactatgga gaaagcgtgt tttcaaacac aataaatgtg 301 taatgaaaaa tacaattaaa taattattta attttttgta tgattaaaaa aaaaagagta 361 aattagtagt gccaccctaa aacaataata ataataatta aataatatta atatttattt 421 aattttttta tttaattttt tttcaaaaca attcataata taaatttatt cttttttttt 481 tttttttttt tttttaaaaa tattcaactc agtaataata ataataaaaa taataacaat 541 aataaggaat tataatcaaa taataatata gctataatct ataattacta tagtaaaatt 601 acgtatacat atataaataa ttattattat cattgttttt aaaaataaat aaataaataa 661 taaaaaaaaa aaaaaaaaaa aa // LOCUS GLAA2GIA 1870 bp ds-DNA INV 08-AUG-1990 DEFINITION Giardia lamblia alpha-2 giardin gene, complete cds. ACCESSION M34550 KEYWORDS alpha-2 giardin. SOURCE G.lamblia (strain Portland-1) trophozoite DNA, clone pG2. ORGANISM Giardia lamblia Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Zoomastigophora; Diplomonadida; Diplomonadina; Hexamitidae. REFERENCE 1 (bases 1 to 1213) AUTHORS Alonso,R.A. and Peattie,D.A. TITLE Genomic organization of alpha giardin genes in Giardia lamblia JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1214 to 1870) AUTHORS Alonso,R.A. and Peattie,D.A. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Alonso, 25-MAY-1990. Author address: R.A.Alonso Dept of Tropical Public Health Harvard School of Public Health 665 Huntington Avenue, Boston, MA 02115 FEATURES from to/span description pept 215 1105 alpha-2 giardin BASE COUNT 466 a 468 c 507 g 429 t ORIGIN 1 atactgcaga tacacctcat ggtgtacatg ctggcggcaa gtatcgtata aactctatgg 61 agacatgggg cagaccttgt ctcgctgctt tgacgagcga atggccccta ttttgcgcac 121 gctatggcgc gtcacagatg cataggataa gcgaactcat gatggaaatt caaattacct 181 taaaatatat tttctgagcg cttacattta gaaaatgccg aagctatccc agatcgtcgc 241 tgacatgaag caggccatcg atgccaagga cgaggcccag atcgcgttca tagcgtcgga 301 atactctgca gatgcgcgtc agaggattgc gcaggggtac agggaccaat acggcaagga 361 gctccccgac gacatcaaga aggcgctcaa gggcggctcc gaggagagcc tcctgatgga 421 tctcttcagc gatcgtcacg aggtccgcgc ccagcacatc cgcgacgcgc tttccggcaa 481 gaacgaccac atggccttct tcgacacagt cattctgtgc acccccgaag actggcacga 541 gaccgtcgcc gcctacacga ggatgttcaa gaagcccctc gtcgaggact tcatgaagga 601 tgtcggccgc aaggagaatt ggtgtctttt catggagaag tggatggctc acgagcgcac 661 tagccgtgaa ggcagccctg acgaggaagc agagaaactg aacaaggcgt tcagtgagag 721 cgaccacgac tacatttcca gcttcatggc tggggtgcct cccgaagaat acaagtccat 781 caacaccagc ttcaagagcc ttacaggaaa gggcatcgac caggccttcg ccaccatcta 841 tacggggact gactactact ccctgtactg cgcccacttt gctcttcttg gcatgcacaa 901 gcttgccgcc tacctggtca actgcgcctg caacgataag ggcgacgaga agcgcatgag 961 gaggatcacg ggcatgatgg tcgacaagtg cttggctgca aagtatgcct acaagaccta 1021 cgggagtatg aaagccgacg tcgaaaggtg ctttgataag cgcatggctc cgatcctctg 1081 taccctctgg cgcctaaggg agtgagcgcc tttactgcgg ggtttccttt cgctagtgaa 1141 ttgcctagcg ggtgtttact gccttctgtt tctttttgcc ctgtcctggc ggtccaagat 1201 gctatcctgc gatgagaggt ctttcgagaa cggatgcgag aagtctggga ttgctacatg 1261 ccgtccacct gactgtgagg gtgagtagga ctggagtagc ggcaaggagt gggagctcat 1321 gttgtccata gagctgtatg cctgatcaat ctgcccattt cctgcttgtc ctctgcgggt 1381 tttttgtttc agatctgagc tgcctaaccg agaggggccg tactctatgc ttagttctct 1441 attaaccaac tgaacagcct ttgctcgctt gcttgccagt gtatatttat atctatattt 1501 caagaacaga gagaggaccg tgatgaagag gagcaggatg agtgtcgcaa cggaccagaa 1561 cacgacaata agggctatag acatcttaga atgataacca acaaagctta tgaactcctc 1621 tccactgaaa gaagaggaaa gataccgagt ctcgtaaaac tgaatggtag tcctatcgat 1681 ggtgtgcccc cagatagggt ctactatcgg ctggttggta agattgcttg cagcggtatc 1741 gacagctgcc acaaccacta agtagtcaag gctcgtgctt gtacgaagga aggtcaacgg 1801 gaatgcgaag catggaccag gtgttatttg cttgctcgat aaactcagga gtatatattt 1861 cagtattgtc // LOCUS KAEHUTC 1625 bp ds-DNA BCT 08-AUG-1990 DEFINITION K.aerogenes histidine utilization repressor C (hutC) gene, complete cds. ACCESSION M34604 KEYWORDS histidine utilization repressor C. SOURCE K.aerogenes (strain W-70) DNA. ORGANISM Klebsiella aerogenes Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1625) AUTHORS Schwacha,A. and Bender,R.A. TITLE Nucleotide sequence of the gene encoding the repressor for histidine utilization genes of Klebsiella aerogenes JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.A.Bender, 29-MAY-1990. FEATURES from to/span description pept < 1 669 histidine utilization repressor G (AA at 1) pept 731 1456 histidine utilization repressor C (hutC) pept 1599 > 1625 histidine utilization repressor U signal 667 686 hutC promoter signal 1522 1493 Pc promoter signal 1531 1557 Puh promoter binding 1475 1495 CRP binding site site 34 61 helix-turn-helix BASE COUNT 311 a 476 c 508 g 330 t ORIGIN 1 gtcgacggcg agcagttaga agccgcgcat caggcgctgc gcgaggcggt ggccgactgt 61 cagcgggcgg gtaaacgcac gctggtcctg ggcggcggcc acgagacggc cttcggccac 121 ggcgctgggg tactggatgc cttcccaggc gaaaaagtgg gcattattaa tctcgatgcg 181 catctggatc tgcgctttgc cgactgcgcc agctccggga cgccgttccg ccagttggcg 241 ctggagtgtg acgcgcagca gcgcggtttt cactatacct gcatcggggt tagccgggcg 301 gcgaacaccc aggcgctatg ggatgaagcg gcgcgccgcc aggtggctat cgttgaagat 361 ctggaggtgc tgacagcctt cgaaactcgc gtgttgccgg agcttgagcg caatatcgcg 421 caattcgacc gcctgtatct gactatcgat ctcgacgtgc tgccggcgcg agaaatgccg 481 gcggtgtcgg ccccggcggc gctgggcgtg ccgctgggca cgctgctgcg catcgttgag 541 ccgctgtgcc gcagcggtaa gctgcaggcg gtggatctgg tggagtttaa cccgctgttt 601 gacattgacg gtcagggcgc tcgcgcggcg gcccgtgtgg catggcaaat cgcccattgg 661 tggcgctagc gatccgcgct attattttcg tttcgccgct ccgcttatta cgcataagga 721 aagccaggcc atgtttgcac aacaacctcg ttccgcgccc gcgcctttct atgaaaaggt 781 gaagcaggcg atcagcgaaa agatccacag cggcgtctgg cgaccgcatg accgtatccc 841 gtcggaagct gagctggtgg cccagttcgg cttcagccgg atgaccatca atcgcgcgct 901 gcgcgagctg acggacgaag gcctgctggt caggttgcaa ggggtgggaa cctttgtcgc 961 cgagccgaaa gggcagtcgg cgctgttcga ggtgcgcagt attgccgcag aaattgtcgc 1021 ccgtcaccat cagcaccgct gtgaggtact cctgcttgag gagacccggg cggatcatat 1081 tcaggcgacg gcgctcagcg tcccggaagg cacccgcatc tttcactcgc tgatggtgca 1141 ttacgaaaac gaggtgccgg tgcaaatcga agatcgctgc gtcaatgccg cagtggtacc 1201 ggactatctg catcaggact acaccgccac cacgccccat gattacctgt cgctgatcgc 1261 gccgttaacc gaaggtgaac atattgttga agcggtgcag gccacggcgg aagagtgcgc 1321 gctgctgcat attcacgccc acgatccgtg cctgctgatc cgtcgtcgca cctggtcgac 1381 aacccacatc gtctcccacg cgcgcctgct gttccccggt agccgatatc gtctgcaggg 1441 gcgctttggc tcctgatcca cgctggcgtc agcaaaacgt gattgctgac gcaatataac 1501 aaaattgtat catttctgtt aaatcctggc ttgcgcatgc ttgtatagac aagtatatgt 1561 atctacgtaa acaacgttta ttgtcaggag aaaccccgat gtcgcaaagc aaatatcgcc 1621 agctg // LOCUS MUSMHH2KBM 66 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Mouse major histocompatibility complex antigen H-2Kbm29 mRNA. ACCESSION M34542 KEYWORDS major histocompatibility complex antigen. SOURCE Mouse ((C3H.CAS3(R4)XB6)F2, mutant #6414(bm29)), liver, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 66) AUTHORS Horton,R.M., Loveland,B.E., Parwani,A., Pease,L.R. and Fischer Lindahl,K. TITLE Immunological characterization and molecular analysis of the H-2Kbm29 mutant JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Weiss,E., Golden,L., Zakut,R., Mellor,A., Fahrner,K., Kvist,R. and Flavell,R.A. TITLE The DNA sequence of the H-2Kb gene: Evidence for gene conversion as a mechanism for the generation of polymorphism in histocompatibility antigens JOURNAL EMBO J. 2, 453-462 (1983) STANDARD full automatic COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.M.Horton, 24-MAY-1990. Mutation in codon 89 changes bp 43 and 44 to "gc", otherwise this sequence is identical to reference 2. FEATURES from to/span description pept < 1 > 66 MHC H-2Kbm29 (AA at 3) BASE COUNT 14 a 22 c 19 g 11 t ORIGIN 1 cgagtggacc tgaggaccct gctcggctac tacaaccaga gcgcgggcgg ctctcacact 61 attcag // LOCUS YSCKGD2 2112 bp ds-DNA PLN 08-AUG-1990 DEFINITION S.cerevisiae dihydrolipoyl transsuccinylase (KGD2) gene, complete cds. ACCESSION M34531 KEYWORDS alphaketogluterate dehydrogenase complex; dihydrolipoyl transsuccinylase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2112) AUTHORS Repetto,B. and Tzagoloff,A. TITLE Structure and regulation of KGD2, the structural gene for yeast dihydrolipoyl transsuccinylase JOURNAL Mol. Cell. Biol. 10, 4221-4232 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Tzagoloff, 29-MAY-1990. FEATURES from to/span description pept 560 1987 dihydrolipoyl transsuccinylase mRNA 306 > 1987 dihydrolipoyl transsuccinylase mRNA mRNA 451 > 1987 dihydrolipoyl transsuccinylase mRNA (alt.) signal 196 200 CAAT box signal 228 232 CAAT box BASE COUNT 649 a 435 c 425 g 603 t ORIGIN 1 tcgaacgtct cgtggaaagc cttgaccgtt gcgtctttac tcgtctctat tcaaccctgt 61 ggtcctatta gaaccgatct ttgttccatc gtcattacta cttcttgtct cttctctctt 121 ggggaataat aaacaaaaca attccagcca cccgcatctc cgtcaaagac gatattaaca 181 accctttttt tgcagccaat caaaactaag gtcctgcccg gagtgagcca ataaagcggc 241 cacaaattac gcaggaatct atggttccta aagcttattc ttattctaca ttgacaccat 301 tctactacag ggctcggatc cacacttggt ctatcagcaa atgcaaatcc tttttccctc 361 aactaggcct acttttcata tataatctct ttttgagctt caggggagga aatatctaac 421 gtaaaaaaaa gaaaaatccg aaataaaaaa ttaggcagct atctttaacg gtatatatca 481 gtgaatagat atattgttat aaacttcact accacatttg ttacaaccaa agacacaact 541 tcagataatt atttaaacaa tgctttccag agcgacgcgt actgcagctg ccaaatcctt 601 agtaaaatct aaagtggcta gaaatgttat ggctgcttct ttcgtcaaga gacatgcttc 661 tacaagtttg ttcaaacaag ctaacaaggt cgaatcctta ggttcaatat atttatccgg 721 caagaaaatt tcagttgcgg cgaatccgtt ctccataact agcaatcgtt ttaaatctac 781 ctctattgaa gttcctccga tggcagagtc cctgactgaa ggctctttaa aggaatatac 841 taaaaacgtt ggtgatttta ttaaggagga cgagctgttg gccactattg agaccgataa 901 aattgatatt gaggtcaatt cgccagtatc aggtactgtt acgaagctaa atttcaaacc 961 agaggacact gtcactgttg gtgaggagtt agctcaggtc gagcctggtg aagcacctgc 1021 tgagggttct ggagaatcta agccagagcc taccgaacaa gcggagcatc gcaaggtgtc 1081 gccgcaaggg aaaactcaag tgaggaaacg gcttcaaaga aagaagctgc tccaaagaaa 1141 gaagccgctc caaagaaaga agttacagaa ccaaaaaagg actgatcaac caaagaagac 1201 cgtctctaag gcgcaggaac ccccagtagc ctctaactct ttcacaccat ttccacgtac 1261 agaaaccagg gtcaaaatga accgtatgag attgaggatt gccgaaagat taaaagagtc 1321 tcaaaacact gctgcttcct taaccacatt caacgaagtt gacatgtcag ctttgatgga 1381 aatgaggaaa ctgtataaag atgagattat taagaagacc ggtactaaat tcggattcat 1441 gggtcttttc tccaaagcat gtaccttggc cgccaaggat attccagccg tcaatggtgc 1501 cattgaaggt gaccagattg tttatcgtga ttacacagat atttctgttg ctgtggccac 1561 tccaaagggt ttggttaccc ccgtcgttcg taatgcagag tcattgagtg ttttagatat 1621 tgagaacgaa attgttcgct tgagtcataa agcgcgtgat ggcaaattaa ccctagaaga 1681 tatgacgggt ggtactttca ccatatctaa tggtggtgtt tttggttcat tatacggtac 1741 tcctatcatc aattcaccac aaacagccgt cctaggcttg catggtgtca aagagagacc 1801 tgtcactgtt aatggacaaa ttgtctcaag accaatgatg tacttggctt tgacttatga 1861 tcatagattg ctagatggtg agaagctgtt atccttcttg aagactgtta aagagttgat 1921 tgaagaccct agaaaatgtt gttatggtga tttgaaattt gcagcccata ccaatctgat 1981 ttcatgatta ttcttgtccg ctattactgt gtgaatctct tctatttaag tatgatattt 2041 aaagatatgc tatgatgata aatggaacta tgaattaatg ttttcttttt ataatgagat 2101 atggaaaagc gc // LOCUS RATALADPS 1612 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat delta-aminolevulinate dehydratase processed pseudogene, complete cds. ACCESSION J04764 KEYWORDS delta-aminolevulinate dehydratase; pseudogene. SOURCE Rat (strain Sprague-Dawley) adult liver DNA, clone lambda 27D. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1612) AUTHORS Bishop,T.R., Frelin,L.P. and Boyer,S.H. TITLE Isolation and characterization of a rat delta-aminolevulinate dehydratase processed pseudogene JOURNAL Genomics 7, 629-632 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.R.Bishop, 20-APR-1990. FEATURES from to/span description pept.ps 271 1264 pseudo-delta-aminolevulinate dehydratase rpt 101 108 8 bp flanking direct repeat rpt 1385 1392 8 bp flanking direct repeat BASE COUNT 404 a 411 c 418 g 379 t ORIGIN 1 tagatgttta ctctcctaca tgagggataa ttgtctataa tgaatgtcac tcctgtatgt 61 gggattcata aagtctatgt gaggagagaa ggacatgtct taaatctcat tcaggggctt 121 gtgtttttag agcgggagag cgcggtgtcc agaatcccgg cttggagcgg tggtgaggca 181 gctgagcagc ttctctgagc tgaggctcag ggccactcca ggggcctaga tgcaatctta 241 gcagactgct gtgttccgcc ggcccccacc atgcatcatc agtctgttct gcacagaggc 301 tactttcacc cactgcttcg ggcctggcaa gaccaccgcc tccaccgtca gtaccaccaa 361 cctcatctat cccatcttta acacggatat ccctgatgat gtccagccca ttgccagcct 421 cccgggagtg gccaggtatg gcctaaacca gctagaggag atgctgagat ctctggtgga 481 agctggcctg cgctgtgtcc tgatctttgg tgtccccagc agagttccca aggatgaaca 541 gggctctctg cagctgactc agaggactcc ccaactattg aggctgtctg tctgctgagg 601 aagaccttcc ctaccctcct agtggcctgt gatgtctgct tgtgccccta cacctcccat 661 ggccactgtg gcctcctgag tgaaaatgga gcgttcctag cagaggaaag ctgacagcgg 721 ttggcagagg tggcactggc ctatgccaag gaaggctgtc aggttgtagc tccatcagat 781 atgatggatg gacgagttga agccatcaag gctaccctgc taaaacatgg acttggcaac 841 agggtctctg tgatgagcta tagtgccaaa tttgcctcct gtttctatgg tcctttccat 901 gatacagctc agtcaagccc agcttttgga gaccgacgct gttatcagct gcctcctgga 961 gccggtgcca ggccctctga gcagtggcct gggacattca agagggagct gacataatca 1021 tggtgaatcc gggattgccc tacctggata tggtgcagga ggtaaaggac aagcacccct 1081 cgctccctct cgcagtatac caggtgtctg aagagtttgc catgtgtggc agggagccaa 1141 ggctggggcc tttgatctca gtactgctgt actagagtcc atgacagcct tccgcagagc 1201 cggtgccgac attatcatca cctactatgc accccagctg ttgagtgggc tgaaggaaga 1261 gtgaaggaaa gtgaaggact tggtcttgac agagcgctcc ctgggcctta cagaagggaa 1321 aaagtaaacg cgctgttaga accgaaaaaa caaacaaaca aacaaaaacc ctctttcaga 1381 atcaaggaaa tctcaaccca ataggtcaca aagtcataat tacaagagtg aagagttcta 1441 gtcatctggg ttcctgccct tcaggatcag caggaataga gatgtaactt tacaagtctt 1501 caagaagtag actacatctt atgtacatga aacaccacat ctttctagaa aatgagactg 1561 aggcataaga caccacaaca gaaacactgt gttactgtca tgctgtgatc at // LOCUS MCAAAF1 229 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus aphid acquisition factor 5' end. ACCESSION M32809 KEYWORDS aphid acquisition factor. SOURCE Cauliflower mosaic virus viral DNA, clone pCMS31. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 229) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept / 36 > 229 aphid acquisition factor (AA at 38) IVS < 1 35 intron X BASE COUNT 88 a 43 c 40 g 58 t ORIGIN 1 aattaatcat cttaacaacc tcaatgagat tgtaggaaga agcttactcg gaatatggaa 61 gatcaactca tacttcggac taagcaaaga cccttcggag tccaaatcaa aaaacccgtc 121 agtttttaat actgcaaaaa ccatttttaa gagtgggggg gttgattact cgagcctaaa 181 ggaaataaaa tcccttttag aagctcaaaa cactagaatt aaaagtcta // LOCUS MCAAAFCP 1125 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus aphid acquisition factor (3' end), and coat protein (5' end) genes. ACCESSION M32813 KEYWORDS aphid acquisition factor; coat protein. SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 1125) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept < 1 246 aphid acquisition factor (AA at 1) pept 248 637 ORF pept 619 > 1125 coat protein BASE COUNT 435 a 218 c 223 g 249 t ORIGIN 1 aaaaacccgt cagtttttaa tactgcaaaa accattttta agagtggggg ggttgattac 61 tcgagccaac taaaagaaat aaaatctctt ttagaatctc aaaatactag aattaaaagt 121 ctagaaaaag caattcaatc cttagagcgt aagattgaac cagagccctt aactaaagaa 181 gaagttaaag agcttaaaga atcgattaac tcgatcaaag aaggattaaa gaatattatt 241 ggctaaaatg gctaatctta atcaaatcca aaaagaagtc tctgaaatcc tcagtgacca 301 aaaatccatg aaagcggata taaaagctat cttagaacta ttaggatccc aaaatcctat 361 taaagaaagc ttagaagctg ttgcagcgaa aatcgttaat gacttaagca agctcatcaa 421 tgattgcccc tgtaacaaag agatattaga agccttaggc aatcagccta aagagcaact 481 aatagaacaa cctaaagaga aaggcaaagg ccttaatcta ggaaaatatt cttaccccaa 541 ttacggagta ggaaatgaag aattaggatc ctctggaaac cctaaagctt taacctggcc 601 cttcaaagct ccagcaggat ggccgaatca attttagaca ggaccattaa ccggttctgg 661 tataatctgg gagaagattg tctctcagaa agtcaatttg accttatgat aaggttaatg 721 gaagagtccc ttgacgggga ccaaattatt gatctaacct ctctacctag tgataatttg 781 caggtcgaac aggttatgac aactaccgaa gactcgatct cggaagaatc agaattcctt 841 ctagcaatag gagaaacatc tgaagacgaa agcgattcag gagaagaacc tgaattcgaa 901 caagttcgaa tggatcgaac aggaggaacg gagattccca aagaagaaga tggtgaacca 961 tctagacaca atgagagaaa gagaaagacc ccggaagacc ggtactttcc aactcaacca 1021 aagaccattc caggacaaaa acaaacgtct atgggaatgc tcaacatcga ctgccaaacc 1081 aatcgaagaa ccttaatcga cgactgggca gcagaaatcg gactg // LOCUS MCACPX 106 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus coat protein, partial cds. ACCESSION M32814 KEYWORDS coat protein. SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 106) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept < 1 > 106 coat protein precursor (AA at 2) BASE COUNT 52 a 16 c 18 g 20 t ORIGIN 1 atgcaagaaa acatccaaga agaagtatca taagcgatac aagaagaaat ataaggtcta 61 taaaccttat aagaagaaga agaaattccg atccggaaaa tacttc // LOCUS MCACTCTP 221 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus cell-to-cell transport protein, 5' end. ACCESSION M32812 KEYWORDS cell-to-cell transport protein. SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 221) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept 43 > 221 cell-to-cell transport protein BASE COUNT 93 a 40 c 32 g 56 t ORIGIN 1 ctctaacgag tatccacaga aaaaataacc ttctgtgttg agatggattt gtatccagaa 61 gaaaataccc aaagcgagca atcgcagaat tctgaaaata atatgcaaat atttaaatca 121 gaaaattcgg atggattctc ctccgatcta atgatctcaa acgatcaatt aaagaatatc 181 tctaaaaccc aattaacctt ggaaaaagaa aagatattca a // LOCUS MCAMPX 972 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus matrix protein (put.), partial cds. ACCESSION M32811 KEYWORDS matrix protein. SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 972) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept < 1 187 matrix protein (put.) (AA at 2) ORF 895 > 972 ORF 7 mRNA < 1 283 35S RNA signal 253 257 35S TATA box binding 882 895 primer binding site BASE COUNT 321 a 209 c 200 g 242 t ORIGIN 1 ctgtcacttc atcgaaagga cagtagaaaa ggaaggtggc tcctacaaat gccatcattg 61 cgataaagga aaggctatca ttcaagatgc ctctgccgac agtggtccca aagatggacc 121 cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt caaagcaagt 181 ggattgatgt gacatctcca ctgacgtaag ggatgacgca caatcccact atccttcgca 241 agacccttcc tctatataag caagctcatt tcatttggag aggacacgct gaaatcacca 301 gtctctctct acaaatctat ctctctctct attctccata ataatgtgtg agtagttccc 361 agataaggga attagggttc ttatagggtt tcgctcatgt gttgagcata taagaaaccc 421 ttagtatgta tttgtatttg taaaatactt ctatcaataa aatttctaat tcctaaaacc 481 aaaatccagt actaaaatcc agatctccta aagtccctat agatctttgt cgtgaatata 541 aaccagacac gagacgacta aacctggagc ccagacgccg tttgaagcta gaagtaccgc 601 ttaggcagga ggccgttagg gaaaagatgc taaggcaggg ttggttacgt tgactccccc 661 gtaggtttgg tttaaatatg atgaagtgga cggaaggaag gaggaagaca aggaaggata 721 aggttgcagg ccctgtgcaa ggtaagaaga tggaaatttg atagaggtac gttactatac 781 ttatactata cactaaggga tgcttgtatt ttaccctata ccccctaata accccttatc 841 gatctaaaga aataatccgc ataagccccc gcttaaaaaa ttggtatcag agccatgaat 901 aggtctatga ccaaaactca agaggataaa acctcaccaa aatacgaaag agttcttaac 961 tctaaagata aa // LOCUS MCAORFIX1 74 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus unidentified protein gene, partial cds. ACCESSION M32808 KEYWORDS . SOURCE Cauliflower mosaic virus viral DNA, clone CMS31. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 74) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept < 1 / 34 unidentified protein (AA at 2) IVS 35 > 74 intron X BASE COUNT 29 a 12 c 13 g 20 t ORIGIN 1 attatcagtg gacattcacg atgccacagg taaggtatat cttcctttaa tcactaaaga 61 ggagataaat aaaa // LOCUS MCAORFX2 178 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus unidentified protein gene, 5' end. ACCESSION M32810 KEYWORDS . SOURCE Cauliflower mosaic virus viral DNA, clone pCMS31. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 178) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept 94 > 178 Unidentified protein binding 82 94 primer binding site BASE COUNT 68 a 38 c 24 g 48 t ORIGIN 1 tgcttgtatt ttaccctata taccctaata accccttatc gatttaaaga aataatccgc 61 ataagccccc gcttaaaaat tggtatcaga gccatgaatc ggtttaaaga ccaaactcaa 121 gagggtaaaa cctcatcaaa atacgaaaga gttcttaact ctaaagataa aagatctt // LOCUS MCARTRA 467 bp ds-DNA VRL 08-AUG-1990 DEFINITION Cauliflower mosaic virus reverse transcriptase (V) gene, partial cds. ACCESSION M32815 KEYWORDS reverse transcriptase. SOURCE Cauliflower mosaic virus viral DNA, clone pLW303S. ORGANISM Cauliflower mosaic virus Viridae; ds-DNA nonenveloped viruses; Caulimovirus. REFERENCE 1 (bases 1 to 467) AUTHORS Vaden,V.R. and Melcher,U. TITLE Recombination sites in Cauliflower mosaic virus DNAs: Implications for mechanisms of recombination JOURNAL Virology 177, 717-726 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by U.K.Melcher, 13-MAR-1990. FEATURES from to/span description pept < 1 > 467 reverse transcriptase (AA at 3) BASE COUNT 157 a 110 c 86 g 114 t ORIGIN 1 agaagatctt ctcttccttc gactgcaagt caggattctg gcaagttctg ctagatcaag 61 aatcaagacc tctaacggca ttcacatgtc cccaaggtca ctacgaatgg aatgtggtcc 121 ctttcggcct aaagcaggct ccatccatat tccaaagaca catggacgaa gcatttcgtg 181 tgttcagaaa gttctgttgc gtttatgtcg acgacattct cgtattcagt aacaacgaag 241 aagatcacct acttcatgta gcgatgatct tacaaaagtg caatcaacat ggaatcatcc 301 tttccaaaaa gaaagcacaa ctcttcaaga agaagataaa cttccttggt ctagaaatag 361 atgaaggaac acataagcct caaggacaca tcttggaaca catcaacaag ttcccagata 421 cccttgaaga caagaagcaa cttcagagat tcttaggcat actaaca // LOCUS HUMOSTRO 1425 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human osteopontin mRNA, complete cds. ACCESSION J04765 KEYWORDS osteopontin; phosphoprotein. SOURCE Human adult osteoblast, cDNA to mRNA, clone HOP-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1425) AUTHORS Young,M.F., Kerr,J.M., Termine,J.D., Wewer,U.M., Wang,M.G., McBride,O.W. and Fisher,L.W. TITLE cDNA cloning, mRNA distribution and heterogeneity, chromosomal location, and RFLP analysis of human osteopontin (OPN) JOURNAL Genomics 7, 491-502 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.F.Young, 04-MAY-1990. FEATURES from to/span description pept 68 1012 osteopontin precursor /hgml_locus_uid="LK0223T" /nomgen="OPN" /map="unassigned" sigp 68 115 osteopontin signal peptide matp 176 1012 osteopontin mRNA < 1 1425 osteopontin mRNA BASE COUNT 449 a 293 c 296 g 387 t ORIGIN Chromosome 4q13. 1 gaccagactc gtctcaggcc agttgcagcc ttctcagcca aacgccgacc aaggaaaact 61 cactaccatg agaattgcag tgatttgctt ttgcctccta ggcatcacct gtgccatacc 121 agttaaacag gctgattctg gaagttctga ggaaaagcag ctttacaaca aatacccaga 181 tgctgtggcc acatggctaa accctgaccc atctcagaag cagaatctcc tagccccaca 241 gaatgctgtg tcctctgaag aaaccaatga ctttaaacaa gagacccttc caagtaagtc 301 caacgaaagc catgaccaca tggatgatat ggatgatgaa gatgatgatg accatgtgga 361 cagccaggac tccattgact cgaacgactc tgatgatgta gatgacactg atgattctca 421 ccagtctgat gagtctcacc attctgatga atctgatgaa ctggtcactg attttcccac 481 ggacctgcca gcaaccgaag ttttcactcc agttgtcccc acagtagaca catatgatgg 541 ccgaggtgat agtgtggttt atggactgag gtcaaaatct aagaagtttc gcagacctga 601 catccagtac cctgatgcta cagacgagga catcacctca cacatggaaa gcgaggagtt 661 gaatggtgca tacaaggcca tccccgttgc ccaggacctg aacgcgcctt ctgattggga 721 cagccgtggg aaggacagtt atgaaacgag tcagctggat gaccagagtg ctgaaaccca 781 cagccacaag cagtccagat tatataagcg gaaagccaat gatgagagca atgagcattc 841 cgatgtgatt gatagtcagg aactttccaa agtcagccgt gaattccaca gccatgaatt 901 tcacagccat gaagatatgc tggttgtaga ccccaaaagt aaggaagaag ataaacacct 961 gaaatttcgt atttctcatg aattagatag tgcatcttct gaggtcaatt aaaaggagaa 1021 aaaatacaat ttctcacttt gcatttagtc aaaagaaaaa atgctttata gcaaaatgaa 1081 agagaacatg aaatgcttct ttctcagttt attggttgaa tgtgtatcta tttgagtctg 1141 gaaataacta atgtgtttga taattagttt agtttgtggc ttcatggaaa ctccctgtaa 1201 actaaaagct tcagggttat gtctatgttc attctataga agaaatgcaa actatcactg 1261 tattttaata tttgttattc tctcatgaat agaaatttat gtagaagcaa acaaaatact 1321 tttacccact taaaaagaga atataacatt ttatgtcact ataatctttt gttttttaag 1381 ttagtgtata ttttgttgtg attatctttt tgtggtgtga ataag // LOCUS VSVGLYPA 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-H) glycoprotein gene, complete cds. ACCESSION M35207 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-H, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 482 a 364 c 364 g 447 t ORIGIN 1 aacagagatc aatctgtttc ctcgacacca tgaagtgcct tttgtactta gctctgttat 61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggtaattgga 121 aaaatgttcc ttccagttac cattactgcc catcaagctc agatttgaat tggcataatg 181 gcctaatagg cacagcctta caggtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac 301 cgaaatatat aacacattcc atccggtcct tcacaccatc cgtggagcaa tgcaaggaaa 361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactccc catcatgtac 481 ttgtcgatga gtacacggga gaatgggttg attcccaatt catcaacggg aaatgcagca 541 atgacatctg ccctactgtg cataactcca caacctggca ttccgattac aaagttaaag 601 ggctgtgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggggagc 661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgct tacgaaactg 721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg 781 tttggttcga aatggcagac aaagatctgt ttgctgcagc aaaattccca gaatgtccag 841 aagggtcaag catatctgct ccatctcaga catcagtgga cgtgagtctc atccaggatg 901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc 961 ttcccatctc tccggtagac ctaagctatc ttgctcctaa aaatccagga accggcccag 1021 ctttcacaat catcaatggt actctgaaat attttgagac caggtacatc agagtcgata 1081 ttgctgctcc aattctccca agaatggttg gaacaatcag tggaactacc acagaaagag 1141 aactgtggga agactgggca ccatatgaag atgtggaaat cgggcctaat ggagttctga 1201 aaaccagttc aggatataaa tttcctttat acatgattgg acatggcatg ctggactctg 1261 atctccatct cagctcgaag gctcaagtgt ttgaacaccc tcacattcag gatgctgcat 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttgtcc aaaaatccaa 1381 tcgagctcgt agaagggtgg ttcagcggtt ggaagagctc cattgcttct tttttcttta 1441 tcataggttt gatcattgga ttattcttgg tcctccgagt tggtatctat ctttgcatca 1501 agttaaaaca gaccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaaccaga atcggacacg acagagtaat catatttaaa ccaaatcgac ttgtgatatc 1621 atgctccaag aggctttaat tatgtttcaa ttttatg // LOCUS VSVGLYPB 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 85-CR-B1) glycoprotein gene, complete cds. ACCESSION M35208 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-CR-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 358 c 366 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagtcttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPC 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 87-CR-B2) glycoprotein gene, complete cds. ACCESSION M35209 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-CR-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 489 a 358 c 364 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPD 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 84-CR-B) glycoprotein gene, complete cds. ACCESSION M35210 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-CR-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 488 a 357 c 365 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 gattgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPE 1665 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 87-OA-B) glycoprotein gene, complete cds. ACCESSION M35211 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-OA-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 476 a 364 c 358 g 467 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcgtct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPF 1665 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 87-VC-B) glycoprotein gene, complete cds. ACCESSION M35212 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-VC-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 476 a 362 c 358 g 469 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagttaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcgtct tagctcaaag gctcaggtat ttgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctcc tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPG 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-B1) glycoprotein gene, complete cds. ACCESSION M35213 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 488 a 356 c 366 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttctggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcggg tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccga 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctatgcatca 1501 acttacggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgtttgaa ttgtatg // LOCUS VSVGLYPH 1665 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain Mudd-Summers) glycoprotein gene, complete cds. ACCESSION M35214 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain Mudd-Summers, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 477 a 366 c 353 g 469 t ORIGIN 1 aacagagatc gatctgtttc cttgacacca tgaagtgcct tttgtactta gcttttttat 61 tcatcggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttagtagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtacggac 301 cgaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag cgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttccgactat aaggtcaaag 601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaagaag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 1021 tctttaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 1141 tactgtggga tgactgggct ccatatgaag acgtgggaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt ttgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatggtgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagtttgt agaaggttgg ttcagtagtt ggaagagctc tattgcctct tttttcttta 1441 ccatagggtt aatcattgga ctattcttgg ttctccgagt tggtatttat ctttgcatta 1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 cgtaactcaa atcctgcaca acagattctt catgtttgaa ccaaatcaac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttaattt ttatg // LOCUS VSVGLYPI 1665 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 86-DF-P) glycoprotein gene, complete cds. ACCESSION M35215 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-DF-P, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 477 a 363 c 357 g 468 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttccaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccttg caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 caaagtatat aacacattcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaacccagg tttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggttg attcacagtt catcgacgga aaatgcagca 541 atgacatatg ccccactgtc cataactcca caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctt atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct aggaaaggag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtctc attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ctttcaccat aatcaatggg accctaaaat actttgaaac cagatacatc agggtcgaga 1081 ttgctgctcc gatcctctca agaatggtcg gaatgatcag tggaactacc accgagaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtat tcgaacatcc tcacattcaa gacgctgctt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcctct tttttcttta 1441 tcatagggtt gatcattgga ctattcttgg ttctccgagt tagtatttat ctttgcatta 1501 aattaaaaca cactaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgcttgaa ccaaatcgac ttgtgatatc 1621 atgctcaaag aggccttaat tatattttaa tttttagttt ttatg // LOCUS VSVGLYPJ 1788 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 82-HD-B) glycoprotein gene, complete cds. ACCESSION M35216 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 82-HD-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1788) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 508 a 362 c 344 g 574 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat 1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg // LOCUS VSVGLYPK 1788 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 83-HD-B1) glycoprotein gene, complete cds. ACCESSION M35217 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 83-HD-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1788) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 508 a 362 c 344 g 574 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgtc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcatcgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaagag 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 attaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttat tttttatttt tattttttat 1741 ttttattttt tatcttttta ttttttattt ttatttttat tttttatg // LOCUS VSVGLYPL 1815 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 83-HD-B2) glycoprotein gene, complete cds. ACCESSION M35218 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 83-HD-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1815) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 512 a 358 c 346 g 599 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gattgctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatt agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcactat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggcctaatt catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttat tttttatttt ttatttttta 1741 ttttttattt tttattttta ttttttatct ttttattttt tattttttat ttttattttt 1801 atttttattt ttatg // LOCUS VSVGLYPM 1665 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain San Juan 56-NM-B) glycoprotein gene, complete cds. ACCESSION M35219 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain San Juan 56-NM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1665) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 484 a 364 c 355 g 462 t ORIGIN 1 aacagagatc gatctgtttc cttgacacta tgaagtgcct tttgtactta gcctttttat 61 tcattggggt gaattgcaag ttcaccatag tttttccaca caaccaaaaa ggaaactgga 121 aaaatgttcc ttctaattac cattattgcc cgtcaagctc agatttaaat tggcataatg 181 acttaatagg cacagccata caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg tcactacttg tgatttccgc tggtatggac 301 cgaagtatat aacacagtcc atccgatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca aacgaaacaa ggaacttggc tgaatccagg cttccctcct caaagttgtg 421 gatatgcaac tgtgacggat gccgaagcag tgattgtcca ggtgactcct caccatgtgc 481 tggttgatga atacacagga gaatgggttg attcacagtt catcaacgga aaatgcagca 541 attacatatg ccccactgtc cataactcta caacctggca ttctgactat aaggtcaaag 601 ggctatgtga ttctaacctc atttccatgg acatcacctt cttctcagag gacggagagc 661 tatcatccct gggaaaggag ggcacagggt tcagaagtaa ctactttgct tatgaaactg 721 gaggcaaggc ctgcaaaatg caatactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aaggatctct ttgctgcagc cagattccct gaatgcccag 841 aagggtcaag tatctctgct ccatctcaga cctcagtgga tgtaagtcta attcaggacg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttccaatctc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggtcctg 1021 ctttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgatcag tggaactacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgaag acgtggaaat tggacccaat ggagttctga 1201 ggaccagttc aggatataag tttcctttat acatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcaa gacgctgctt 1321 cgcaacttcc tgatgatgag agtttatttt ttggtgatac tgggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtagtt ggaaaagctc tattgcctct tttttcttta 1441 tcatagggtt aatcattgga ctattcttgg ttctccgagt tggtatccat ctttgcatta 1501 aattaaagca caccaagaaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgga ccaaatcaac ttgtgatacc 1621 atgctcaaag aggcctcaat tatatttgag tttttaattt ttatg // LOCUS VSVGLYPN 1812 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 85-ES-B1) glycoprotein gene, complete cds. ACCESSION M35220 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-ES-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1812) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 512 a 362 c 347 g 591 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt 1741 tttatttttt attttttatt tttatttttt atctttttat ttcttatttt tatttttatt 1801 tttattttta tg // LOCUS VSVGLYPO 1861 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 86-ES-B2) glycoprotein gene, complete cds. ACCESSION M35221 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-ES-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1861) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 363 c 347 g 631 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttta ttttttattt tttatttttt atttttattt tttatttttt 1801 tattttttta ttttttattt ttatttttat ttttattttt atttttattt ttatttttat 1861 g // LOCUS VSVGLYPP 1802 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 84-GM-B) glycoprotein gene, complete cds. ACCESSION M35222 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1802) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 506 a 362 c 348 g 586 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta 1741 ttttttattt ttatttttta tctctttatt ttttattttt atttttattt ttatttttta 1801 tg // LOCUS VSVGLYPQ 1868 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 87-ES-B) glycoprotein gene, complete cds. ACCESSION M35223 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-ES-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1868) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 362 c 347 g 639 t ORIGIN 1 aacagagatc aatctgtttt cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat tacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct catcatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catatttcaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt 1801 atttttttat ttttttattt tttattttta tttttatttt tatttttatt tttattttta 1861 tttttatg // LOCUS VSVGLYPR 1860 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 86-ES-B1) glycoprotein gene, complete cds. ACCESSION M35224 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-ES-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1860) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 520 a 362 c 347 g 631 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gcattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcta caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtccgactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat acttcgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggactagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagactctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttaa tttttaattt tatttttatt tttatttttt attttttatt 1741 ttttattttt tattttttat tttttatttt ttatttttta tttttatttt ttattttttt 1801 atttttttat tttttatttt tatttttatt tttattttta tttttatttt tatttttatg // LOCUS VSVGLYPS 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 69-PN-L) glycoprotein gene, complete cds. ACCESSION M35225 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 69-PN-L, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 358 c 364 g 448 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcattggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattattgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgccacgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 ctaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca agtgactcct catcatgtat 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggagaac 661 tctcatccct agggaaggaa ggcacgggat tcagaagcaa ttactttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgac aaagatctat ttgctgcagc aaaattcccg gagtgtcccg 841 aaggatcgag tatctccgct ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt ggagcaggtc 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctaaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccacct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg ttcagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttgttcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca ataccacaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPT 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 87-CR-B1) glycoprotein gene, complete cds. ACCESSION M35226 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 87-CR-B1, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 490 a 358 c 363 g 446 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtacggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccagg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gatggggaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg aatcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agaacaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatacgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcat cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttaga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPU 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 85-CR-B2) glycoprotein gene, complete cds. ACCESSION M35227 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-CR-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 486 a 357 c 367 g 447 t ORIGIN 1 aacagagatc aatctgtttc actgacacca tgaagtgcct tttgtactta gctcttttat 61 tcataggagt gaactgtaaa ttcaccatag tattcccaca caatcagaaa gggaattgga 121 aaaatgttcc ttccggttac cattactgcc cgtccagctc agatttaaat tggcacaacg 181 gcttgatagg cacagcttta caagtcaaga tgcccaagag tcacaaggct attcaggcag 241 acggttggat gtgccatgct tctaaatggg ttactacttg cgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcacaccatc agtggaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacatggc tgaatccggg ttttcctcct cagagttgcg 421 gatatgcaac tgtgacggat gctgaagccg tgattgtcca agtgactcct catcatgtac 481 tggttgatga gtacacagga gagtgggtcg attcccagtt catcaatgga aaatgcagca 541 acgacatatg ccccactgta cataattcca caacctggca ttcagattac aaagtgaaag 601 ggttgtgtga tgccaacctc atctccatgg acatcacttt tttctcagag gaaggagaac 661 tttcatccct agggaaagaa ggcacgggat tcagaagcaa ttattttgct tacgagactg 721 gagacaaagc ctgtagaatg cagtattgca agcattgggg agtcagactc ccgtcaggtg 781 tttggtttga aatggctgat aacgatctgt ttgctgcagc aaaattccca gaatgtcccg 841 aaggatcgag tatctccgcc ccatctcaga catcagtgga tgtaagtctc attcaggatg 901 ttgagagaat cctggattac tctctttgcc aagaaacctg gagcaagatt agagcaggta 961 ttcccatttc cccagtggat ctaagctatc tcgctcctaa aaacccaggg acaggcccag 1021 cctttacgat aatcaacggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattcttcca agaatggtcg ggaccatcag tggaaccacc acagaaaggg 1141 aactgtggga tgactgggca ccatatgagg acgtggaaat tggtccaaac ggagttttga 1201 aaaccagctc aggatataaa tttcctttat atatgattgg acatggtatg ttggactccg 1261 atctccgcct cagttcaaag gcccaagtgt tcgaacaccc tcacattcag gacgctgcat 1321 cacagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaacccaa 1381 ttgagctcgt agaaggttgg tttagtggtt ggaagagctc cattgcttct tttttcttta 1441 tcgtagggtt gataattgga ttattcttgg tgctccgagt tggtatctat ctctgcatca 1501 acttaaggca caccaaaaaa agaaaaattt acacagacat tgagatgaat cgacttggaa 1561 agtaatcaca atactgcaca acagaataat catatttgga ccaaatcgac ttgtgatatc 1621 atgctttaag aggctttaac gatgttttga ttgtatg // LOCUS VSVGLYPV 1796 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 86-GM-B) glycoprotein gene, complete cds. ACCESSION M35228 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 86-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1796) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 503 a 362 c 349 g 582 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttcccattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat agtcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttttttattt tttattttta 1741 ttttttattt tttattttta ttttttatct ttttattttt atttttattt tttatg // LOCUS VSVGLYPW 1800 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 85-GM-B) glycoprotein gene, complete cds. ACCESSION M35229 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-GM-B, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1800) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 506 a 362 c 348 g 584 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtacttg gctttgttat 61 tcattggggt gtattgtaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tccaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacctggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gacggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgccccg 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga tgactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttat ctttgcatta 1501 aattaaagca caccaataaa agacagattt atgcagacat agagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggtcttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taatttttat tttttatttt tattttttta ttatttttta ttttttattt 1741 ttatttttta tttttatttt ttcatctttt tattttttat ttttattttt attttttatg // LOCUS VSVGLYPX 1652 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 59-PN-L) glycoprotein gene, complete cds. ACCESSION M35230 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 59-PN-L, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1652) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 487 a 344 c 367 g 453 t 1 others ORIGIN 1 aacagagatc aatctgtctt cctagcatca tgaaatgnat tttgtacttc gcttttctcc 61 tggtaggagt gaactgtaaa tttacaatag ttttcccaca taatcaaaaa gggaactgga 121 aaaatgtgcc atccagttac cattattgtc catctagttc cgatcttaat tggcacaatg 181 gcctgattgg tacagcatta caagtcaaga tgcccaaaag tcacaaagca attcaggcag 241 acggctggat gtgtcatgca tccaaatggg tcaccacatg tgacttcagg tggtacggac 301 ccaaatatat tacgcattct attcggtcat tcacaccctc tgtggaacag tgcagagaaa 361 gcattgaaca gaccaaacaa ggaatctgga taaatcctgg ctttcctcct cagagttgtg 421 gatatgcaac tgtgacggac gcggaagcag taattgtcca ggtcacccca catcatgtgc 481 tcgtagatga gtacacaggc gaatgggtag attcccaatt tttgaacgga aagggcaaag 541 acgacatatg ccccacagtc cataattcga ccacatggca ttcagactac aaagtcaaag 601 ggctttgtga tgctaaccta atctcaatgg acatcacttt cttctcagaa gatggtgagc 661 tgtcctcctt gggcaaagat gggactggat tcagaagcaa ttactttgct tatgaaaatg 721 gagataaggc ttgccgcatg cagtattgca aacattgggg agtcagactc ccttctggag 781 tctggttcga gatggcagac aaagatttgt ttgcaggagc caagttcccc gagtgtccag 841 acggaaccag tatctctgct ccttctcaga catccgtaga tgtaagctta attcaggacg 901 tcgaaagaat cttggactat tccttgtgtc aagagacttg gagtaaaatc aaggcaggcc 961 ttccgatttc tccagttgat ctaagctatc tcgctccaaa aaatccaggg acaggtcccg 1021 ctttcaccat catcaatggg accttgaagt actttgagac cagatacatc agagtggaca 1081 ttgctgctcc aatcataccc agaatggtgg gaactatcag tggaacatct acagaaagag 1141 aattgtggga tgactggtac tcttatgaag atgtggagat tgggcccaat ggagtattga 1201 aaacaacctc aggatataaa tttcctcttt atatgattgg acacggaatg ctggactctg 1261 atcttcacct gacatcgaag gcccaggttt ttgaacatcc ccacattcag gacgctgcat 1321 cacagcttcc tgacggcgag actttatttt ttggtgatac cgggctctca aaaaacccca 1381 ttgaattagt agagggatgg ttcagtgggt ggaaaagctc tattgcttcg ttcttctttg 1441 tcattggatt gatcattgga ttatatttgg ttttaagggt gggaatatat ctttgtctta 1501 aactaaagca tacaaaaaag agaaaaattt acacagacat tgagatgaac agacttggaa 1561 aataaccaaa attgtgtgat ggatttgctt ttttgccaaa gaagcctgtg atatccttct 1621 cagagaagct ttaatctttt attgacaata tg // LOCUS VSVGLYPY 1657 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 84-PN-B2) glycoprotein gene, complete cds. ACCESSION M35231 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 84-PN-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1657) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 489 a 367 c 357 g 444 t ORIGIN 1 aacagagatc catctgtttc ctcgacacca tgaagtgcct tttgtactta gctcttttat 61 tcattggggt gaattgtaaa ttcaccatag tttttccaca caatcaaaag ggaaattgga 121 aaaatgttcc ttccagttac cattactgcc cgtcaagttc agatttgaac tggcataacg 181 gcctaatagg cacagcccta caggtcaaga tgcccaagag tcacaaggct attcaagcag 241 acggttggat gtgtcatgct tccaaatggg ttactacttg cgacttccgc tggtatggac 301 cgaaatatat aacacattcc atccggtcct tcacaccatc tgtggagcaa tgcaaggaaa 361 gcattgaaca gacgaaacag ggaacatggc tgaatccagg ctttcctcct caaagttgcg 421 gatatgcaac tgtgacggat gctgaagcag tgattgtcca ggtaactccc catcatgtac 481 ttgtcgatga gtacacggga gagtgggtag attcccaatt catcaacggg aaatgcagca 541 atgacatatg ccctactgtc cataactcca caacctggca ttccgactac aaagtcaaag 601 ggctgtgtga ttctaacctc atctccatgg acatcacctt cttctcagaa gatggggagc 661 tatcatctct agggaaagag ggcactggat tcaggagtaa ctactttgca tacgaaacag 721 gagacaaggc ctgcagaatg cagtactgca agcattgggg agtcagactc ccttcaggtg 781 tctggtttga aatggccgac aaagatctgt ttgctgcagc aaaattccca gaatgtccag 841 aagggtcaag catctctgct ccatctcaga catcagtgga cgtgagtctt attcaggacg 901 ttgagagaat cctggattat tccctttgtc aagaaacctg gagcaagatc aaagcaggtc 961 ttcccatctc tccggtagat ctaagctatc ttgctccaaa aaacccagga accgggccag 1021 cctttacaat aatcaatggt actctgaaat actttgagac cagatacatc agagtcgata 1081 ttgctgctcc aattctccca agaatggtcg gaacaatcag tggaactacc accgagagag 1141 aattgtggga cgactgggca ccatatgaag atgttgaaat cggaccaaat ggagttctga 1201 aaaccagttc aggatataaa ttccctttat atatgattgg acatggcatg ctggactctg 1261 atctccatct cagctcaaag gctcaggtat ttgaacaccc tcacattcag gatgctgcat 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac cgggttatcc aaaaatccaa 1381 tcgagctcgt agaagggtgg ttcagtggtt ggaaaagctc cattgcttct tttttcttta 1441 tcataggttt gatcattgga ttattcttgg ttctccgagt tggtatctat ctctgcatca 1501 agttaaaaca taccaaaaaa agaaaaatct atacagacat tgagatgaat cgactcggaa 1561 agtaactgga atcagacacg acagaataat catatttaaa ccaaatcgac ttgtgatatc 1621 atgctctaag aggctttaat tatgttttaa ttttatg // LOCUS VSVGLYPZ 1812 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 85-ES-B2) glycoprotein gene, complete cds. ACCESSION M35232 KEYWORDS glycoprotein; surface antigen. SOURCE Vesicular stomatitis virus (strain 85-ES-B2, Indiana serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1812) AUTHORS Bilsel,P.A. and Nichol,S.T. TITLE Polymerase errors accumulating during natural evolution of the glycoprotein gene of vesicular stomatitis Indiana serotype viruses JOURNAL J. Virol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.T.Nichol, 14-JUN-1990. FEATURES from to/span description pept 30 1565 glycoprotein BASE COUNT 513 a 360 c 346 g 593 t ORIGIN 1 aacagagatc aatctgtttc cttgacacta tgaagtgcct tttgtactta gctttgttat 61 tcattggggt gtattgcaag ttcaccacag ttttccctca caacaaaaaa ggagactgga 121 aaaatgttcc ttccaattac cattattgcc catcaagctc agatttaaat tggcataatg 181 acttaatagg cacagcctta caagtcaaaa tgcccaagag tcacaaggct attcaagcag 241 atggttggat gtgtcatgct tcaaagtggg tcactacttg tgatttccgc tggtatggac 301 cgaaatatat aacacattcc atcagatcct tcactccatc tgtagaacaa tgcaaggaaa 361 gcattgaaca gacaaaacaa ggaacttggc tgaatccagg cttccctccc caaagttgcg 421 gatacgcaac tgtgacagat gctgaagcag tgattgtcca ggtgactcct caccatgtgc 481 ttgttgatga atacacagga gaatgggtcg attcacagtt catcaacgga aaatgcagtg 541 atgatatatg ccccactgtt cataactcca caacctggca ttctgactat aaggttaaag 601 gactatgtga ttctaacctc atttccatgg acatcacctt cttctcagaa gatggagagc 661 tatcatccct aggaaaagag ggtacagggt tcagaagtaa ctactttgct tatgaaactg 721 gagacaaggc ctgcaaaatg cagtactgca agcattgggg agtcagactc ccatcaggtg 781 tctggttcga gatggctgat aagaatctct ttgctgcagc caaattccct gaatgcccag 841 aaggttcaag tatctctgct ccttctcaga cctcagtgga tgtgagtctc attcaggatg 901 ttgagaggat cttggattat tccctctgcc aagaaacctg gagcaaaatc agagcgggtc 961 ttcccatatc tccagtggat ctcagctatc ttgctcctaa aaacccagga accggccctg 1021 ccttcaccat aatcaatggt accctaaaat actttgagac cagatacatc agggtcgata 1081 ttgctgctcc aatcctctca agaatggtcg gaatgattag tggaactacc accgaaaggg 1141 aactatggga ggactgggca ccatatgaag acgtggagat tggacccaat ggtgttctga 1201 ggaccagttc aggatataag tttcctttat atatgattgg acatggtatg ttggactccg 1261 atcttcatct tagctcaaag gctcaggtgt tcgaacatcc tcacattcca gacgctactt 1321 cgcagcttcc tgatgatgag actttatttt ttggtgatac agggctatcc aaaaatccaa 1381 tcgagcttgt agaaggttgg ttcagtggtt ggaagagctc tattgcttcc tttttcttta 1441 tcatagggtt aatcattgga ttattctttg ttctccgaat tggtgtttac ctttgcatta 1501 aattaaagca caccaataaa agacagattt atacagacat aaagatgaac cgacttggaa 1561 agtaactcaa atcctgcaca acagattctt catgtttgaa ttaaatccac ttgtgatgca 1621 aagctcaaag aggccttaat catattttaa ttttttaatt tttaattttt aatttttaat 1681 ttttaatttt taattttatt tttttatttt tattttttta ttttttatta ttttttattt 1741 tttatttttt attttttatt tttatttttt atctttttat tttttatttt tatttttatt 1801 tttattttta tg // LOCUS ECODKSA 1273 bp ds-DNA BCT 08-AUG-1990 DEFINITION E.coli dnaK suppressor (dksA) gene, complete cds. ACCESSION M34945 KEYWORDS dnaK suppressor. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1273) AUTHORS Kang,P.J. and Craig,E.A. TITLE Identification and characterization of a new Escherichia coli gene that is a dosage-dependent suppressor of a dnaK deletion mutation JOURNAL J. Bacteriol. 172, 2055-2064 (1990) STANDARD simple staff_review FEATURES from to/span description pept 229 441 ORF 1 pept 619 1074 dnaK suppressor (dksA) BASE COUNT 343 a 301 c 333 g 296 t ORIGIN 1 gacgaaagag gctatcctta atgaatcaat ttcagaactg tcaggctata gctcgctgaa 61 aagcgaagta aaatacggcg cagaacgcag ccgtattgac tttatgttgc aggcggattc 121 gcgtccagac tgctatattg aagtgaaatc ggttacgtta gcggagaacg aacagggata 181 ttttcccgat gcggtcactg aacgaggtca gaaacacttc gggagttgat gagcgtagcg 241 gctgaaggcc agcgtgcggt tatctttttc gccgtgctgc attcagccat tacacggttt 301 tcacccgcgc gccacatcga tgagaaatac gcgcaactat tgtcagaagc tcaacagagg 361 ggggtagaaa ttctggctta caaagcggaa atttctgctg aaggcatggc tcttaaaaaa 421 tcactgccgg ttacattgta gtaaagtaag taactggtta atttacattc tggtcgcgtg 481 cgcaaatacg cttttcctca cacagttgtc aagtgttacg tttagataat tgctatccgg 541 aaaagcatct gctatttata gcggcctcat ttttcccccg aacatgggga tcgatagtgc 601 gtgttaagga gaagcaacat gcaagaaggg caaaaccgta aaacatcgtc cctgagtatt 661 ctcgccatcg ctggggtgga accatatcag gagaagccgg gcgaagagta tatgaatgaa 721 gcccagctgg cgcacttccg tcgtattctg gaagcatggc gtaatcaact cagggatgaa 781 gtcgatcgca ccgttacaca tatgcaggat gaagcagcca acttcccgga cccggtagac 841 cgtgcagccc aggaagaaga gttcagcctc gaactgcgta accgcgatcg cgagcgtaac 901 gtgatcaaaa agatcgagaa gacgctgaaa aaagtggaag acgaagattt cggctactgc 961 gaatcctgcg gtgttgaaat tggtattcgc cgtctggaag cgcgcccgac agccgatctg 1021 tgcatcgact gcaaaacgct ggctgaaatt cgcgaaaaac agatggctgg ctaattacag 1081 ccgttccatc acgtttacca cacggggaaa tcgtcccgcc ttattttttg ttcaaagaga 1141 tgacagacac acagtatatt ggcctgtcgc ccctctcttc cggcgagctt cattttggct 1201 ctctgatcgc tacgctcggc agctatttgc acgtcgcgcc cggcaaggtc gctggctggt 1261 acgcatagaa gat // LOCUS STFCYCLI 2180 bp ss-mRNA INV 08-AUG-1990 DEFINITION Starfish (A.pectinifera) cyclin B (CYC) mRNA, complete cds. ACCESSION M33880 KEYWORDS cyclin B. SOURCE Starfish (A.pectinifera) egg, cDNA to mRNA, clone lambda-gt10-cyc10. ORGANISM Asterina pectinifera Eukaryota; Animalia; Eumetazoa; Echinodermata; Asterozoa; Stelleroidea; Asteroidea; Spinulosida; Asterinidae. REFERENCE 1 (bases 1 to 2180) AUTHORS Tachibana,K., Ishiura,M., Uchida,T. and Kishimoto,T. TITLE The starfish egg mRNA responsible for meiosis reinitiation encodes cyclin JOURNAL Dev. Biol. 140, 241-252 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Kishimoto, 11-MAY-1990. FEATURES from to/span description pept 126 1313 cyclin B (CYC) mRNA < 126 2180 cyclin B mRNA BASE COUNT 650 a 440 c 482 g 608 t ORIGIN 1 ttattatgtt gctcagttct gacctcttta gcaacgtaca tgacgtacat gaagtacacg 61 tatgacgtac atcgtagcga ctgtctgaat ttttcttcga tgactaaaat tcatctggga 121 aaacaatgca gacagcttgt tctggcaatt tgtgtgggta tcaactgatg ttcagtttgt 181 ctactgttgt aactgtatgc agatcactcc gatcccgcaa ccgccactgg tttttgaagc 241 ttttgaggtg tacgtttaac gatcgcatga gatgcgctct ggagaacatc agcaatgtag 301 caaagaacaa tgtacaagct gcagctaaga aggagatcaa acaaaagaga ggaatgacca 361 aatccaaagc tacaagttct ctacagtcgg tcattggtct ccatgtagaa cctgtggaaa 421 aggtccagtc gccagagccc atggacatga gtgaagtcag caatgctctg gaggctttct 481 cacagaacat tcttgagatg ggcgtcgatg acattgacaa agatgaccat gaaaatccac 541 agctgtgcag cgagtacgtc aacgacatct atctatacat gagacatctg gagcgtgagt 601 tcaaagtgag gacagattac atggcaatgc aagagatcac tgagcgtatg agaacgatcc 661 tgattgactg gctggtccaa gtacatctta gattccatct tctacaggaa acactgttcc 721 ttaccatcca gatcctcgac agatacctag agggtgcaag cgtatccaag accaaactcc 781 agctggtcgg tgtgacctcc atgctgattg ctgcctatga agagatgtac gcagagattg 841 gagactttgt ctacatcacg gacaacgctt acagcaaggc acagatccgc gccatggagt 901 gtaacattct ccggaaacta gacttcaatc tgggcaagcc actctgcatt cacttcctca 961 gacgttgctc aaaggctggt ggggttgatg gtcacaagca cacactgtcc aagtacatca 1021 tggagttgac gttacagagt acagctttgt caagtatgac catcgagatt gctgctgcag 1081 ccttgctatc acaagattct gggatgagga tatgtggaat gggaacaaaa tccctggttc 1141 actacagtgc ctacagtgaa ggccacctgg gaccaattgt gcagaagatg gccgtgctat 1201 ctcaacaatc gcacccaagt ccaaattcca ggcttgatca ggaagaagat atggccagca 1261 gcaagttcat gagcgatcag caagctaccc aagaactgaa atcaatcagg tagtcaactg 1321 aatcttgccg acgagaactg ctgagcttcc atccgcccag atgaatggtc atgtaatagt 1381 agtaaatagt agtgtattat agtctttaat taaataacac cccttcagaa gttgacaggt 1441 ttcaacttag tgcatgattt aagcaactcg aggaggtact ccgatttttt ccccccttgg 1501 ttgtcatttt ttaagttggc aagtgcagtt gaatctattt taatcttgta tagatagcaa 1561 tgcttgtact gccatggagg ccaaaggcgt agatagaatt gtgcatgaaa gtacaatgtt 1621 gttgaaatcg ggtggagtgg gattatttga atgatacgct acattttgtg caatgacaga 1681 cgcactacag catgatcgag gtttcaaagt aaaattatgg ctatctaaca ttttgtaagc 1741 attgcatgta taatagcttt ctgcaagtgc aatcagattt ctgatcagag gttcaatgca 1801 taacgtgtca cgaaagccca tctgatcaag cgtaatgtaa aatgaaaagg ggaaattgac 1861 ttctgcaatt tattatgctt ctagaatttt tactcgtcca actttttgtc tgtcgttcat 1921 gacttttgcg ctagatatcc gagaccaatt catttctcca aagaaaaaaa taaacatgag 1981 gttgtttgtc atgaagtttc ccacacaact tcagatgaac agctcatcaa gttgtcagat 2041 ttgcttgttc aaaagttaaa acgaaaaaaa tcatgtctta atgttttatt atttaatatg 2101 taaaattgaa tgattcgtgt tgcagtattt gtacctaaat gcttttgtct gtcagtgttt 2161 gtaataaagt taatggaaat // LOCUS ECOCLPPA 1236 bp ds-DNA BCT 08-AUG-1990 DEFINITION Escherichia coli ATP-dependent clp protease proteolytic component (clpP) gene, complete cds. ACCESSION J05534 KEYWORDS ATP-dependent protease. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1236) AUTHORS Maurizi,M.R., Clark,W.P., Katayama,Y., Rudikoff,S., Pumphrey,J., Bowers,B. and Gottesman,S. TITLE Sequence and structure of ClpP, the proteolytic component of the ATP-dependent Clp protease of Escherichia coli JOURNAL J. Biol. Chem. 265, 12536-12545 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Gottesman, 23-APR-1990. FEATURES from to/span description pept 378 1001 ATP-dependent protease (clpP) mRNA 279 > 1236 clpP mRNA pept < 1 133 ORF signal 244 249 -35 region signal 267 272 -10 region signal 165 193 transcription termination signal binding 367 372 ribosome binding site (put.) BASE COUNT 329 a 291 c 318 g 298 t ORIGIN 1 caaaaacaaa gaactgatgg acaacatgcg caatgttgct ctggaagaac aggctgttga 61 agctgtactg gcgaaagcga aagtgactga aaaagaaacc actttcaacg agctgatgaa 121 ccagcaggcg taatttacgc agcataacgc gctaaattcg cacaaaggcc cgtcaccgcc 181 aggtggtggg cttttttttg tcatgaattt tgcatggaac cgtgcgaaaa gcctctttcg 241 gtgttagcgt aacaacaaaa gattgttatg cttgaaatat ggtgatgccg tacccataac 301 acaggactag ctgataatcc gtccataagg ttacaatcgg tacagcaggt tttttcaatt 361 ttatccagga gacggaaatg tcatacagcg gcgaacgaga taactttgca ccccatatgg 421 cgctggtgcc gatggtcatt gaacagacct cacgcggtga gcgctctttt gatatctatt 481 ctcgtctact taaggaacgc gtcatttttc tgactggcca ggttgaagac cacatggcta 541 acctgattgt ggcgcagatg ctgttcctgg aagcggaaaa cccagaaaaa gatatctatc 601 tgtacattaa ctccccaggc ggggtgatca ctgccgggat gtctatctat gacaccatgc 661 agtttatcaa gcctgatgtc agcaccatct gtatgggcca ggcggcctcg atgggcgctt 721 tcttgctgac cgcaggggca aaaggtaaac gtttttgcct gccgaattcg cgcgtgatga 781 ttcaccaacc gttgggcggc taccagggcc aggcgaccga tatcgaaatt catgcccgtg 841 aaattctgaa agttaaaggg cgcatgaatg aacttatggc gcttcatacg ggtcaatcat 901 tagaacagat tgaacgtgat accgagcgcg atcgcttcct ttccgcccct gaagcggtgg 961 aatacggtct ggtcgattcg attctgaccc atcgtaattg atgccagagg cgcaactgtg 1021 ccgctatact tatccagggc ggcacaacgc tgtaagcgct tgcgcctgag aatggcattt 1081 gcgtcgtcgt gtgcggcaca aagaacaaag aagaggtttt gacccatgac agataaacgc 1141 aaagatggct caggcaaatt gctgtattgc tctttttgcg gcaaaagcca gcatgaagtg 1201 cgcaagctga ttgccggtcc atccgtgtat atctgc // LOCUS HAMACAPT 1837 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA, partial cds. ACCESSION J05590 M22755 KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase; tunicamycin-resistance. SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone TRG-[4,22]. ORGANISM Cricetulus longicaudatus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 889 to 960) AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S. TITLE Amplification and molecular cloning of the hamster tunicamycin- sensitive N-acetylglucosamine-1-phosphate transferase gene: The hamster and yeast enzymes share a common peptide sequence JOURNAL J. Biol. Chem. 263, 19796-19803 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 1837) AUTHORS Zhu,X. and Lehrman,M.A. TITLE Cloning, sequence and expression of a cDNA encoding hamster UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman, 04-APR-1989. Draft entry and computer-readable sequence for [1] kindly submitted by M.Lehrman, 12-JUN-1990. FEATURES from to/span description pept 148 1374 N-acetylglucosamine-1-phosphate transferase (GPT) binding 67 79 dolichol binding site binding 222 234 dolichol binding site BASE COUNT 363 a 515 c 418 g 541 t ORIGIN 1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg 61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg 121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg 181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc 241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg 301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc 361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc 421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg 481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct 541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta 601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc 661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat 721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta 781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt 841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga 901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc 961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct 1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg 1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc 1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag 1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt 1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc 1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc 1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca 1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt 1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt 1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc 1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat 1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag 1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc 1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt // LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 08-AUG-1990 DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C genes, complete cds. ACCESSION M34440 KEYWORDS nuclear antigen. SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 10089) AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E., Rickinson,A. and Kieff,E. TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their EBNA-3A, EBNA-3B and EBNA-3C genes JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.T.Sample, 23-MAY-1990. Kieff Laboratory Harvard Medical School 75 Francis Street Boston, MA 02115 FEATURES from to/span description pept 189 524 nuclear antigen EBNA-3A, exon 1 610 3051 nuclear antigen EBNA-3A, exon 2 pept 3237 3593 nuclear antigen EBNA-3B, exon 1 3672 6155 nuclear antigen EBNA-3B, exon 2 pept 6277 6636 nuclear antigen EBNA-3C, exon 1 6711 9560 nuclear antigen EBNA-3C, exon 2 IVS 525 609 EBNA-3A intron IVS 3594 3671 EBNA-3B intron IVS 6637 6710 EBNA-3C intron BASE COUNT 2495 a 2837 c 2619 g 2138 t ORIGIN 1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct 61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag 121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg 181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag 241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca 301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc 361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac 421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc 481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg 541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt 601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat 661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg 721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg 781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact 841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg 901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg 961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc 1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct 1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac 1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat 1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg 1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca 1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg 1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg 1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct 1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa 1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac 1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc 1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc 1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg 1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg 1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa 1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca 1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca 2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag 2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc 2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac 2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg 2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa 2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg 2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg 2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc 2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg 2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg 2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa 2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg 2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct 2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat 2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt 2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg 3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg 3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat 3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc 3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga 3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg 3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac 3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag 3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca 3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc 3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag 3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc 3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga 3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac 3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct 3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata 3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa 3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag 4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt 4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga 4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata 4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag 4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct 4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg 4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa 4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg 4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca 4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag 4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc 4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg 4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca 4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca 4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga 4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt 4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac 5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc 5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac 5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct 5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc 5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc 5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc 5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc 5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc 5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc 5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag 5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta 5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg 5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac 5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac 5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc 5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct 6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt 6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt 6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg 6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta 6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg 6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat 6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac 6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc 6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta 6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca 6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc 6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc 6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct 6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg 6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat 6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca 6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg 7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag 7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga 7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc 7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt 7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta 7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc 7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg 7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag 7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc 7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac 7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag 7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg 7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag 7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt 7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga 7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta 7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac 8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc 8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc 8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc 8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc 8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc 8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg 8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac 8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg 8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg 8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata 8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc 8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact 8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc 8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt 8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag 8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc 9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac 9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc 9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc 9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac 9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg 9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag 9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa 9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg 9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca 9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga 9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta 9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca 9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg 9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg 9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg 9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag 9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc 10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt 10081 ggaggatcc // LOCUS HUMPAPB 3061 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human prostatic acid phosphatase mRNA, complete cds. ACCESSION M34840 KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein. SOURCE Human prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1199) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3061) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Van Etten, 08-JUN-1990. Chemistry Department Purdue University West Lafayette, IN 47907 FEATURES from to/span description pept 15 1175 prostatic acid phosphatase precursor (EC 3.1.3.2) /hgml_locus_uid="LA0168F" /nomgen="ACPP" /map="3q21-qter" sigp 15 110 prostatic acid phosphatase signal peptide matp 111 1172 prostatic acid phosphatase BASE COUNT 917 a 642 c 681 g 821 t ORIGIN Chromosome 3q21-qter. 1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc 61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga 121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg 181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg 241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt 301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg 361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc 421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc 481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg 541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag 601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg 661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga 721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat 781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc 841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc 901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga 961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc 1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt 1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa 1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag 1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc 1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca 1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt 1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa 1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt 1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt 1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga 1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc 1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa 1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt 1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat 1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata 1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga 1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag 2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag 2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt 2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta 2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca 2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa 2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc 2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac 2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa 2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat 2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac 2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc 2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc 2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat 2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc 2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat 2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta 3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa 3061 a // LOCUS LDV3ECPA 1064 bp ss-RNA VRL 08-AUG-1990 DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete cds. ACCESSION M34856 KEYWORDS viral capsid protein. SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones A16, B24, B63, B90, B103, C44, dt4. ORGANISM Lactate dehydrogenase-elevating virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae. REFERENCE 1 (bases 1 to 1064) AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A. TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid protein (Vp1) gene JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.K.Godeny, 06-JUN-1990. FEATURES from to/span description pept 637 984 capsid protein pept 135 650 unidentified ORF BASE COUNT 267 a 223 c 265 g 309 t ORIGIN 3' terminus region. 1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc 61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc 121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt 181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg 241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt 301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg 361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc 421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg 481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta 541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga 601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc 661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat 721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc 781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga 841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt 901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat 961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg 1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt // LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 08-AUG-1990 DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase (frhADGB) gene, complete cds. ACCESSION J02914 KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase. SOURCE M.thermoautotrophicum (strain delta-H) DNA. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 3903) AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T. TITLE Cloning, sequence determination, and expression of the genes encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin reducing hydrogenase from Methanobacterium thermoautotrophicum delta-H JOURNAL Biochemistry (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.Alex, 25-MAY-1990. FEATURES from to/span description pept 298 1515 F420 hydrogenase alpha-subunit (frhA) pept 1524 2000 F420 hydrogenase delta-subunit (frhD) pept 2114 2824 F420 hydrogenase gamma-subuint (frhG) pept 2837 3682 F420 hydrogenase beta-subunit (frhB) binding 284 293 ribosomal binding site (put.) binding 1514 1520 ribosomal binding site (put.) binding 2100 2106 ribosomal binding site (put.) binding 2823 2832 ribosomal binding site (put.) BASE COUNT 1201 a 878 c 965 g 859 t ORIGIN 1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta 61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac 121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt 181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt 241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg 301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg 361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg 421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg 481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc 541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac 601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac 661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt 721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac 781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa 841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc 901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa 961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata 1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca 1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt 1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg 1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta 1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag 1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca 1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc 1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa 1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg 1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt 1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc 1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg 1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg 1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa 1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa 1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg 1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc 2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga 2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag 2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact 2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga 2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga 2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac 2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt 2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc 2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc 2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt 2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct 2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg 2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct 2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac 2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata 2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg 3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa 3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga 3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta 3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat 3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat 3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga 3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg 3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg 3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga 3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga 3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat 3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt 3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa 3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt 3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat 3901 cga // LOCUS MUSLRCNA 1751 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse loricrin gene, complete cds. ACCESSION M34398 KEYWORDS cell envelope protein; loricrin. SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1751) AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D., Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M., Yuspa,S.H. and Roop,D.R. TITLE Identification of a major keratinocyte cell envelope protein, loricrin JOURNAL Cell 61, 1103-1112 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Rothnagel, 22-MAY-1990. FEATURES from to/span description pept 71 1516 loricrin signal 1740 1745 polyA signal BASE COUNT 223 a 493 c 706 g 329 t ORIGIN 1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc 61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg 121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg 181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg 241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg 301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg 361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg 421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg 481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg 541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg 601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg 661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc 721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg 781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg 841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg 901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg 961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg 1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg 1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc 1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg 1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag 1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg 1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg 1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg 1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac 1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc 1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac 1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct 1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca 1741 ataaatttgc t // LOCUS PFAGIPA 1906 bp ds-DNA INV 08-AUG-1990 DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete cds. ACCESSION J05544 KEYWORDS glucosephosphate isomerase. SOURCE P.falciparum (strain 7G8) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1906) AUTHORS Kaslow,D.C. and Hill,S. TITLE Cloning metabolic pathway genes by complementation in Escherichia coli: Isolation and expression of Plasmodium falciparum glucosephosphate isomerase JOURNAL J. Biol. Chem. 265, 12337-12341 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.C.Kaslow, 04-JUN-1990. FEATURES from to/span description pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9) BASE COUNT 782 a 174 c 289 g 661 t ORIGIN Chromosome 14. 1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat 61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat 121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct 181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa 241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc 301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata 361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat 421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg 481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa 541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat 601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat 661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct 721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt 781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg 841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta 901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga 961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat 1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat 1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca 1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa 1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat 1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa 1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg 1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa 1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt 1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta 1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt 1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat 1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat 1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag 1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt 1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt // LOCUS RATGADA 3198 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds. ACCESSION M34445 KEYWORDS glutamic acid decarboxylase. SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and pCR22HP. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3198) AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I. TITLE Characterization of a cDNA coding for rat glutamic acid decarboxylase JOURNAL Mol. Brain Res. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Gottlieb, 23-MAY-1990. FEATURES from to/span description pept 184 1965 glutamic acid decarboxylase mRNA < 1 3198 glutamic acid decarboxylase mRNA signal 3174 3179 polyA signal BASE COUNT 869 a 742 c 767 g 820 t ORIGIN 1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca 61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc 121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag 181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat 241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc 301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag 361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa 421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct 481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg 541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac 601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac 661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt 721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta 781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc 841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca 901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc 961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc 1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg 1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt 1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat 1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca 1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg 1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc 1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa 1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag 1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac 1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc 1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt 1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa 1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc 1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg 1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc 1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc 1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca 2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg 2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa 2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat 2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc 2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca 2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag 2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta 2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca 2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac 2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag 2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt 2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta 2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt 2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt 2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc 2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa 3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta 3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt 3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa 3181 taaaaaatat tctccatg // LOCUS RATTSHRA 5270 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds. ACCESSION M34842 KEYWORDS TSH receptor; thyroid stimulating hormone receptor; transmembrane protein. SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5270) AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C., McBride,O.W. and Kohn,L.D. TITLE Cloning, chromosomal assignment and regulation of the rat thyrotropin receptor by thyrotropin, agents which increase cAMP levels and thyroid autoantibodies JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.D.Kohn, 05-JUN-1990. FEATURES from to/span description pept 55 2349 thyroid stimulating hormone receptor precursor sigp 55 117 thyroid stimulating hormone receptor signal peptide matp 118 2346 thyroid stimulating hormone receptor signal 5256 5261 poly-A signal BASE COUNT 1415 a 1343 c 1176 g 1336 t ORIGIN 1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg 61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga 121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag 181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact 241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat 301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa 361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta 421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct 481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct 541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg 601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg 661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga 721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct 781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag 841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac 901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat 961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc 1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag 1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag 1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac 1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca 1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg 1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc 1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc 1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac 1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt 1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc 1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg 1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc 1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac 1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg 1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt 1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc 1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg 2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc 2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag 2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa 2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt 2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca 2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa 2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct 2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc 2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac 2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg 2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag 2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga 2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat 2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc 2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat 2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt 3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta 3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat 3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac 3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga 3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt 3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa 3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct 3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt 3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt 3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata 3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt 3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac 3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat 3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga 3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat 3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag 3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc 4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc 4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg 4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa 4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga 4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag 4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc 4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg 4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc 4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct 4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg 4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca 4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa 4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta 4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga 4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg 4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc 4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg 5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca 5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt 5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg 5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg // LOCUS STMBLIPA 840 bp ds-DNA BCT 08-AUG-1990 DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete cds. ACCESSION M34538 KEYWORDS beta-lactamase inhibitory protein. SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP. ORGANISM Streptomyces clavuligerus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 840) AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E. TITLE Isolation and characterization of a beta-lactamase inhibitory protein from Streptomyces clavuligerus and cloning and analysis of corresponding gene JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Jensen, 25-MAY-1990. Dept. of Microbiology University of Alberta Edmonton, Alberta Canada T6G 2E9 FEATURES from to/span description pept 67 672 beta-lactamase inhibitory protein precursor (bliP) sigp 67 174 beta-lactamase inhibitory protein signal peptide matp 175 669 beta-lactamase inhibitory protein binding 52 63 ribosomal binding site signal 686 724 transcription termination signal BASE COUNT 140 a 248 c 320 g 132 t ORIGIN 1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga 61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc 121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg 181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc 241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc 301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc 361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg 421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg 481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg 541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc 601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg 661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc 721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg 781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc // LOCUS TRBPARPA 763 bp ss-mRNA INV 08-AUG-1990 DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds. ACCESSION M34790 KEYWORDS procyclic acidic repetitive protein; procyclin. SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone pPS-1. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 763) AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C. TITLE Inhibition of protein synthesis results in super-induction of procyclin RNA JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Boothroyd, 04-JUN-1990. Stanford University School of Medicine Dept of Microbiology and Immunology Sherman Fairchild Science Building Stanford, CA 94305-5402 USA FEATURES from to/span description pept 29 466 procyclin mRNA < 1 763 procyclin mRNA rpt 215 393 12 bp degenerate repeats BASE COUNT 192 a 215 c 156 g 200 t ORIGIN 1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct 61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc 121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg 181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga 241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga 301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga 361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc 421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt 481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc 541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt 601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc 661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag 721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt // LOCUS W10PPIBCA 4761 bp ds-DNA BCT 08-AUG-1990 DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes, complete cds, and ippD gene, 3' end. ACCESSION M34849 KEYWORDS antigen. SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA, clones HB101/pHS[4108,5357]. ORGANISM Plasmid pWR100 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes (ipaB and ipaC) of Shigella flexneri JOURNAL Microb. Pathog. 4, 345-357 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 538 and 3987 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Baudry, 05-JUN-1990. Center for Vaccine Development University of Maryland at Baltimore 10 South Pine Street Baltimore, MD 21201 FEATURES from to/span description pept 606 1073 polypeptide I (ippI) pept 1079 2821 polypeptide B (ipaB) pept 2841 3932 polypeptide C (ipaC) pept 3983 > 4761 ipaD protein pept < 1 549 unidentified ORF binding 595 600 ippI ribosomal binding site binding 1065 1070 ipaB ribosomal binding site binding 2829 2834 ipaC ribosomal binding site binding 3971 3977 ipaD ribosomal binding site signal 556 561 ippI -10 region signal 2723 2728 ipaC -35 region signal 2745 2750 ipaC -10 region BASE COUNT 1769 a 952 c 805 g 1235 t ORIGIN 1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat 61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat 121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct 181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca 241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat 301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc 361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg 421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca 481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat 541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag 601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa 661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca 721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct 781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta 841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat 901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag 961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat 1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat 1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga 1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac 1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa 1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct 1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa 1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga 1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat 1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat 1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga 1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca 1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat 1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa 1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg 1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc 1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga 1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt 2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt 2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca 2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc 2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc 2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac 2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac 2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat 2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc 2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc 2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa 2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga 2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc 2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg 2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata 2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc 2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat 3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg 3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag 3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag 3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga 3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc 3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa 3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc 3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa 3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc 3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt 3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc 3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca 3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca 3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc 3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag 3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg 3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc 4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa 4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa 4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac 4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc 4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt 4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga 4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata 4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag 4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa 4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc 4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac 4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat 4741 agacaatatg ttaaaaagct t // LOCUS YEPOMPHA 824 bp ss-mRNA BCT 08-AUG-1990 DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene, complete cds. ACCESSION M34854 KEYWORDS cationic outer membrane protein; enterobacterial outer membrane protein; lipopolysaccharide binding protein; secretory protein; structural membrane protein. SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 824) AUTHORS Hirvas,L., Koski,P. and Vaara,M. TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and comparison to known ompH sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.H.Hirvas, 06-JUN-1990. University of Helsinki FEATURES from to/span description pept 324 818 ompH protein precursor (gtg start codon) sigp 324 392 ompH protein signal peptide matp 393 815 ompH protein binding 308 317 ompH ribosomal binding site signal 120 125 ompH -35 region signal 143 148 ompH -10 region BASE COUNT 257 a 147 c 198 g 221 t 1 others ORIGIN 1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg 61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg 121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt 181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt 241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa 301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag 361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa 421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag 481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc 541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga 601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc 661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta 721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta 781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac // LOCUS YSCEXG1A 2056 bp ds-DNA PLN 08-AUG-1990 DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase (EXG1) gene, complete cds. ACCESSION M34341 KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase. SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2056) AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A., Nebreda,A.R., Mendez,E. and del Rey,F. TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1, in the yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.del Rey, 17-MAY-1990. Instituto de Microbiologia bioquimica Facultad de Biologia CSIC/Universidad de Salamanca 37008 Salamanca, SPAIN FEATURES from to/span description pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase precursor (EXG1, EC 3.2.1.58) sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase signal peptide matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase mRNA 265 > 1709 EXG1 mRNA signal 153 159 TATA box BASE COUNT 610 a 395 c 396 g 655 t ORIGIN Chromosome XII-right arm. 1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg 61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac 121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa 181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat 241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt 301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta 361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta 421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa 481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg 541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg 601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg 661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg 721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc 781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca 841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt 901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca 961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat 1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca 1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca 1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact 1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct 1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg 1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga 1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg 1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt 1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct 1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg 1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc 1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat 1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt 1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa 1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta 1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat 1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt 2041 gagcttttcc gccggc // LOCUS ACCPCACGH 2391 bp ds-DNA BCT 08-AUG-1990 DEFINITION A.calcoaceticus gamma-carboxymuconolactone decarboxylase (pcaC) and protocatechuate 3,4-dioxygenase alpha and beta subunit (pcaGH) genes, complete cds. ACCESSION M33798 KEYWORDS gamma-carboxymuconolactone decarboxylase; protocatechuate 3,4-dioxygenase. SOURCE A.calcoaceticus DNA, clone pZR2. ORGANISM Acinetobacter calcoaceticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 2391) AUTHORS Hartnett,C., Neidle,E.L., Ngai,K.-L. and Ornston,N. TITLE DNA sequences of genes encoding Acinetobacter calcoaceticus protocatechuate 3,4-dioxygenase: Evidence indicating shuffling of genes and of DNA sequences within genes during their evolutionary divergence JOURNAL J. Bacteriol. 172, 956-966 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 324 728 gamma-carboxymuconolactone decarboxylase (pcaC) (EC 4.1.1.44) pept 750 1463 protocatechuate 3,4-dioxygenase, alpha subunit (pcaH) (EC 1.13.11.3) pept 1491 2120 protocatechuate 3,4-dioxygenase, beta subunit (pcaG) (EC 1.13.11.3) ORF < 1 203 ORF (AA at 3) binding 312 315 ribosomal binding site (put.) binding 737 740 ribosomal binding site (put.) binding 1478 1482 ribosomal binding site (put.) BASE COUNT 670 a 463 c 533 g 725 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttatca aacccaacgt tacttgctct gtttcatttt atgtgcaggt attgccgtca 61 atggtgctca atcttcaatg ccagtattga gtgcgcgttt ctatccaacg caatgtcgtg 121 caacaggtgt ggcatggatg tctggaattg gccgttttgg tcgccgtatt tggtgcatgg 181 attggtgcag ttttattagg taataactgg tcatttacca tgatcctgag catgttgata 241 tcctgctcgt gccgctgcaa ttgcaatttt tgttaaatcc cttgttggca catacagatg 301 caacttgatg tgaggtttct cttatgaatg atgaacaacg ttataaacaa gggctggaag 361 tccgtacaga agttctgggt gaaaagcatg ttaaccgttc acttgagaac ctaaacgact 421 tcaatcagga ttttcaaaac tttattagcc gttttgcatg gggagaggtt tggtctcgtc 481 ctggacttcc tcgtcatacg cgtagtttgg tcacaattgc agtgttgttg gcgcttgggc 541 gtgaagatga gctgcgtatc gatttacgtg cctgtttcaa caatggtgtg accaaagatg 601 aattaaaaga gttgattcta cattgttcac tttatgcagg cttacccgct tcaaatgctg 661 cgatgcatat ggcagaagag gtctttaagg acttgggaat tgcacctgaa aaagtaaaca 721 aggattaatg gatttagagg gagtagaaaa tgtctcaaat tatttgggga gcttacgctc 781 agtcgaatac cgaagatcat ccgccagctt atcgaccagg ttataaaacc agtgtgttac 841 gctcgcctaa gaatcgactc atttctattg cagaaacttt aagtgaagtc actgcaccac 901 attttagtgc tgataaattt ggtccaaaag acaatgactt aatcttgaac tatgccaaag 961 atggtttgcc gattggtgaa cgtgtcattg tgcatggtta tgtacgtgac cagtttggtc 1021 gacctgtaaa aaatgcactt gttgaagtat ggcaagccaa tgcatctggt cgttatcgtc 1081 atccaaatga tcaatatatc ggtgccatgg atcctaattt cggtggttgt ggccgtatgt 1141 tgactgatga taatggttat tatgtattcc gtacgattaa accaggtcca tacccatggc 1201 gtaaccgtat caatgaatgg cgtcctgctc acattcactt ctctttaatt gcagatggtt 1261 gggcacaacg tttaatttcg cagttctatt ttgaaggcga tacattgatt gattcttgcc 1321 caatcctgaa aaccattcct tctgagcaac aacgtcgtgc gctaattgca ttagaagaca 1381 agagcaattt cattgaagct gacagccgtt gttatcgctt tgacattacg cttcgtggtc 1441 gcgcggactt acttcgaaaa tgacttaact taatggatgg agtaaaaaac atgaatggtt 1501 ggaattttca ggaactcaaa gaaaccccat ctcaaacagg tggtccatat gtccatattg 1561 gtcttttgcc aaaacaagcc aatatcgaag tatttgaaca caatttaggt aacaaccttg 1621 tacaggacaa tacgcaaggt caacgcatcc gattagaagg tcaagtgttt gatgggctta 1681 gtttacctct acgtgacgta ctgattgaaa tctggcaagc agataccaat ggcgtttacc 1741 caagtcaagc tgatactcaa ggtaaacaag tcgatccgaa ctttttgggt tggggccgta 1801 caggtgcaga ttttggtaca ggcttctgga gttttaatac cattaagcca ggtgctgttc 1861 caggccgtaa aggttcaact caagcaccac atatttcact gattattttt gcgcgtggta 1921 tcaacattgg tcttcacact cgtgtctatt ttgatgacga agcagaagca aatgcaaaag 1981 atcctgtttt aaacagtatc gagtgggcga cacgtcgtca aacactggtt gccaaacgtg 2041 aagagcgtga tggtgaagtg gtttatcgtt ttgatattcg tattcaaggt gaaaacgaaa 2101 cggtattttt cgatatctaa atttttgact atggaacggg ccgtatatct ttcggtatac 2161 agcctccata attttacaag gaaacgtaaa tgttcaatac aaacaaactg gttaaacaac 2221 tttgtctaag tggtgcgttg ttgatgtcat tgcctatatt gtcaacgacc tatgcggctg 2281 aaaatactgt accagcatct aaatcaactt atgttgtaaa aaatttaaac attggtgatt 2341 tacctgttaa aaccctcgtg ccaatcactg ctaaaacaag agagcaagct t // LOCUS HAMSAA3A 530 bp ss-mRNA ROD 08-AUG-1990 DEFINITION M.auratus serum amyloid A (SAA3) mRNA, complete cds. ACCESSION M33431 KEYWORDS serum amyloid A protein. SOURCE M.auratus (strain LVG) embryo fibroblast, cDNA to mRNA, clone pAA825. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 530) AUTHORS Gervais,C. and Suh,M. TITLE Serum amyloid A protein-related mRNA expression in herpes simplex virus type 2-transformed hamster cells JOURNAL Mol. Cell. Biol. 10, 4412-4414 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Suh, 30-MAR-1990. FEATURES from to/span description pept 42 410 serum amyloid A signal 510 515 polyA signal BASE COUNT 147 a 122 c 140 g 121 t ORIGIN 1 ggggacctct agagtcactc accaggcatc tcgaaagcag gatgaagcct ttccttgcca 61 tcattttttg cttcttgatc ctgggagttg acagccaaag gtggttccag ttcatgaaag 121 aagctggcca agggagtaca gacatgtgga gagcttactc tgacatgaga gaggccaact 181 ggaaaaactc agataaatac ttccatgccc gggggaacta tgatgctgcc aaaaggggac 241 caggaggagc ctgggctgct aaagtgatca gtgatgccag agagggtatt cagaggttca 301 caggacgtgg agcagcggac tcaagagctg accagtttgc caacaaatgg ggccggagtg 361 gtaaagaccc caaccacttt cgacctgctg gcctgcctag taaatactga gtttcatctt 421 catgctattc ccagccatgc accccaaggt tgcatttact agattaagtt agttcctgag 481 ctctggacgt gtaagcaccc gaaatttaga ataaatacct gtgaaatgct // LOCUS HEIBEXBA 795 bp ds-DNA BCT 08-AUG-1990 DEFINITION H.influenza (strain RM153) capsulation protein (bexB) gene, 5' end. ACCESSION M33787 KEYWORDS capsulation protein. SOURCE H.influenza (type b, strain RM153) DNA. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 795) AUTHORS Kroll,J.S. and Moxon,E.R. TITLE Capsulation in distantly related strains of Haemophilus influenzae type b: genetic drift and gene transfer at the capsulation locus JOURNAL J. Bacteriol. 172, 1374-1379 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 795 capsulation protein (bexB) BASE COUNT 188 a 133 c 177 g 297 t ORIGIN 1 atgcaatatg gtgataaaac aacatttaaa cagtcattag ctattcaagg acgggtgatt 61 aacgcgttgc ttatgcgtga aataatcact cgttatgggc gtcaaaatat tggctttttt 121 tggctatttg ttgagccttt gctcatgact ttttttattg taatgatgtg gaaatttatc 181 cgtgctgata aattttctac tttaaatatg attgcctttg tgatgacggg gtatccaatg 241 gcaatgatgt ggcgaaatgc ttcaaaccgt gcgattggtt ccatttctgc aaacttgagt 301 ttactttatc accgaaatgt tcgagtgcta gataccattt ttactcgcgt gttgcttgag 361 gttgctggtg cttctattgc tcaaattctt tttatggcaa tcttagtcat gattgattgg 421 attgatgcac cgcatgatgt gttttatatg ctcattgcat ggttcctcat ggcaatgttt 481 gcttttggct tggggttaat catttgtgcg attgctcagc aatttgatgt gtttggtaaa 541 atttggggaa cactcagttt cgttttatta ccaatttctg gcgcattctt cttcgtccat 601 aaccttccag cacaggcaca atctattgca ctttggtttc cgatgattca tggtacagaa 661 atgttccgcc atggctattt tggcgataca gtggttactt acgaaagtat aggctttttg 721 gttgtgagtg atctagcctt gttgttactg ggcttagtga tggttaaaaa ctttagtaaa 781 ggagttgagc cacaa // LOCUS HEIBEXBB 795 bp ds-DNA BCT 08-AUG-1990 DEFINITION H.influenza (strain RM926) capsulation protein (bexB) gene, 5' end. ACCESSION M33788 KEYWORDS capsulation protein. SOURCE H.influenza (type b, strain RM926 ATCC 8468) DNA, clones lambda-926.[8,11,14]. ORGANISM Haemophilus influenzae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Pasteurellaceae. REFERENCE 1 (bases 1 to 795) AUTHORS Kroll,J.S. and Moxon,E.R. TITLE Capsulation in distantly related strains of Haemophilus influenzae type b: genetic drift and gene transfer at the capsulation locus JOURNAL J. Bacteriol. 172, 1374-1379 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 > 795 bexB protein BASE COUNT 188 a 128 c 178 g 301 t ORIGIN 1 atgcaatatg gtgatcaaac aacgtttaaa cagtcattag cgattcaagg aagagtgatt 61 aatgcattac ttatgcgtga aataatcact cgttatgggc gcaaaaatat tggctttcta 121 tggctatttg ttgagccttt actcatgact ttttttattg tgatgatgtg gaaatttatt 181 cgtgctgata aattttcaac tttaaatatg attgcttttg tcatgaccgg gtatcctatg 241 gcgatgatgt ggcgaaatgc ttctaatcga gctattggtt cgatttctgc aaacttgagt 301 ttactttatc accgaaatgt tcgagtgcta gatactatct ttactcgagt attgcttgag 361 gttgcaggtg cttcgattgc tcaaatcctt tttatggcag ttttagtgct tattggttgg 421 attgatgcac cacgtgatgt gttttatatg cttatggcat ggtttctcat ggctatgttt 481 gcttttgcct taggccttat tatttgtgcg gtagctcaac aatttgatgt gttcggtaaa 541 atttggggaa ccttaagttt cgtattgtta ccaatttctg gtgcattttt ctttgttcac 601 aaccttccat cacaggctca atctatcgcg ctttggctcc caatgattca tggtacggaa 661 atgtttcgtc atggctactt tggggatacg gtcgttactt atgagagtat cggattttta 721 gtcgtaagtg atttggcctt attgctaatg ggcttggtga tggttaaaaa ctttagtaag 781 gggattgagc cacaa // LOCUS PSEARCDA 1502 bp ds-DNA BCT 08-AUG-1990 DEFINITION P.aeruginosa membrane protein arcD (complete cds) and arcA (5' end) genes. ACCESSION M33223 KEYWORDS membrane protein. SOURCE P.aeruginosa (strain PAO1) DNA, clone pME183. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1502) AUTHORS Luethi,E., Baur,H., Gamper,M., Brunner,F., Villeval,D., Mercenier,A. and Haas,D. TITLE The arc operon for anaerobic arginine catabolism in Pseudomonas aeruginosa contains an additional gene, arcD, encoding a membrane protein JOURNAL Gene 87, 37-43 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Haas, 26-MAR-1990. FEATURES from to/span description pept 24 1472 membrane protein (arcD) pept 1494 > 1502 membrane protein (arcA) binding 8 15 ribosome binding site (put.) BASE COUNT 247 a 507 c 443 g 305 t ORIGIN 1 attcctatag ggagattccg aaaatgtccc aagaatccag ccaaaaactc cgactaggag 61 cgctaacggc tctggttgtc ggctcgatga tcggaggggg catcttctcc ctcccgcaaa 121 acatggcagc cagcgccgat gtcggcgccg tcctgatcgg atgggcgatc accgccgtcg 181 gcatgctcac cctcgctttc gtcttccaga ccctcgccaa ccgcaagccc gaactcgacg 241 gcggcgtgta cgcctacgcc aaggccggct tcggcgacta catgggtttc tcctcggcct 301 ggggctactg gatcagcgcc tggctgggca acgtcggcta cttcgtcctg ttgttcagca 361 ccctcggcta cttcttcccg atcttcggca agggcgacac cgtcgcggcg atcgtctgcg 421 cctcggtcct gctctgggcc ctgcacttcc tggtgctgcg cgggatcaag gaggcggcgt 481 tcatcaacac cgtcaccacc gtggccaagg tcgtgccgct gttcctgttc atcctgatct 541 gcctgttcgc cttcaagctg gacatcttca ccgccgacat ctggggcaag agcaacccgg 601 acctgggcag cgtgatgaac caggtgcgca acatgatgct ggtcaccgtc tgggtgttca 661 tcggcatcga gggcgcgagc atcttctcct cccgcgcgga aaaacgttcc gacgtcggca 721 aggccaccgt gatcggcttc atcaccgtcc tgctcctgct ggtgctggtc aacgtgctgt 781 ccatgggcgt gatgacccag ccggaactgg ccaagctgca gaacccgtcg atggcgctgg 841 tactcgagca tgtggtcggc cactggggcg ccgtgctgat cagcgtcggc ctgctgatct 901 cgctgctggg cgcgctgctc tcctgggtgc tgctgtgcgc cgagatcatg ttcgccgccg 961 ccaaggacca caccatgccg gagttcctgc gccgcgagaa cgccaaccag gtgccggcca 1021 acgccctgtg gctgaccaac atctgcgtac aggtgttcct ggtggtggtg ttcttcacct 1081 cgggcgaccc ggacggcatg gacccgtaca ccaagatgct gctcctggcc acctcgatga 1141 tcctgatccc gtacttctgg tccgccgcct atggcctgct gctgaccctg aagggcgaga 1201 cctacgagaa cgatgcccgc gaacgcagca aggacctggt catcgccggc atcgccgtgg 1261 cctacgcggt ctggctgctc tacgcggggg gcctgaagta cctgctgctg tccgccctgc 1321 tgtatgcgcc cggcgcgatc cttttcgcca aggccaagca cgaggtcggc cagccgatct 1381 tcaccggaat cgaaaaactg atcttcgccg cagtcgtcat tggcgccctg gtggctgcct 1441 acggcctcta cgacggtttc ctcaccctct gattactcca ctggagacgt gaaatgagca 1501 cg // LOCUS HUMSPTB 6765 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human beta-spectrin (SPTB) mRNA, complete cds. ACCESSION J05500 KEYWORDS beta-spectrin; spectrin. SOURCE Human fetal liver, cDNA to mRNA, clones beta-[28,21A,29,286] and V252. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6765) AUTHORS Winkelmann,J.C., Chang,J.-G., Tse,W.T., Scarpa,A.L., Marchesi,V.T. and Forget,B.G. TITLE Full length sequence of the cDNA for human erythroid beta-spectrin JOURNAL J. Biol. Chem. 265, 11827-11832 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Winkelmann, 08-MAY-1990. FEATURES from to/span description pept 96 6509 beta-spectrin /nomgen="SPTB" /map="14" /hgml_locus_uid="LS0033T" mRNA < 1 6765 SPTB mRNA signal 6716 6722 poly-A signal BASE COUNT 1626 a 1822 c 2146 g 1171 t ORIGIN Chromosome 14q23-q24. 1 cgccaccccc gggctcgggt ggccccgctt cagtcccagg gcagggatcc ttccatgaag 61 actgaggcag gcggagctgc taagagcctg ctgacatgac atcggccaca gagtttgaaa 121 atgtgggcaa ccagccacct tacagcagga tcaatgcccg ctgggacgcc ccagacgacg 181 agctggataa tgacaatagc tcagccaggc tctttgagag gtcccggata aaggccttgg 241 cagatgagcg ggaagttgtt cagaaaaaga ccttcacgaa atgggtgaac tcgcacctgg 301 ctcgagtgtc ctgccgcatc accgatctct acaaggacct gcgggatggg cgcatgctca 361 tcaagctgct ggaggtgctc tctggagaga tgctgccaaa gcccaccaag gggaagatgc 421 gcatccactg cctggagaat gtggacaagg ctctccagtt cctcaaggag cagcgtgtac 481 acctggagaa catgggctcc catgacattg tagatggcaa ccaccgcctg gtcctgggcc 541 tcatctggac catcatcctc cgcttccaga ttcaggacat tgtggtccaa actcaggaag 601 gtcgtgaaac acgctcagcc aaggatgcgt tgctgttgtg gtgtcagatg aagacggcag 661 gctaccctca tgttaatgtc accaacttta cctccagctg gaaggatggc ttggccttta 721 atgccctgat acacaagcac cggcccgacc tgatcgactt tgataagctg aaggactcca 781 atgcccggca caacctggag cacgcattca atgtggctga gcgccagctg ggcatcatcc 841 cgctcctcga ccccgaagat gtctttacgg aaaaccctga tgagaaatcc atcatcacct 901 atgtggtggc cttttaccac tacttctcca agatgaaggt gctggcagtg gagggcaagc 961 gtgtcggcaa ggttattgac catgccattg agactgagaa gatgattgaa aagtacagcg 1021 ggctagcctc ggacctgctc acctggatcg agcagaccat cactgtcctg aacagccgca 1081 agtttgccaa ctcgctgacg ggcgtccagc agcagctgca ggccttcagc acctaccgca 1141 ccgtggagaa gccgcccaag tttcaagaga aggggaatct ggaagttcta ctttttacca 1201 tccagtcccg gatgagagcc aacaatcaga aagtgtacac accccacgat gggaaactag 1261 tgtctgacat caacagggcc tgggaaagcc tggaggaagc tgggtatcgg cgggagctgg 1321 ccctgagaaa tgagctcatt cggcaggaga agctagagca actagcccgg cgctttgacc 1381 ggaaggccgc aatgagagag acctggctca atgaaaacca gcgcctcgtg gcccaggata 1441 actttgggta tgacctggca gctgtggagg ccgccaagaa gaagcatgag gccatcgaga 1501 ccgacacggc tgcctacgag gagcgggtga gagccctgga ggacctggct caggagctgg 1561 agaaagagaa ctaccatgac cagaagcgca tcacggcccg caaggacaat atactgcgcc 1621 tatggagcta cctgcaggag ctgctgcagt cccggcgcca gaggctcgag accaccctgg 1681 cactgcagaa gctcttccag gacatgctgc acagcatcga ctggatggat gagatcaagg 1741 ctcacctctt gtctgccgag tttgggaagc acttgttgga ggttgaagac ctgctacaga 1801 agcacaagtt gatggaagct gacatcgcca tccaagggga caaagtgaag gccatcaccg 1861 cagccaccct gaagttcacc gaggggaaag ggtaccagcc ttgtgacccc caggtcatcc 1921 aggaccgcat gagccacttg gagcagtgct ttgaggagct gagcaacatg gcagctggcg 1981 caaggaccca actggagcag tccaaacgac tctggaagtt cttctgggag atggatgagg 2041 ctgagagctg gatcaaggag aaggagcaga tctattcttc cctggactat ggcaaagacc 2101 tgaccagtgt gctcatctta cagcgcaagc acaaggcctt tgaggatgag ctccgtgggc 2161 tggatgctca cctggagcag atcttccagg aggctcatgg catggttgcg cgcaagcagt 2221 ttgggcaccc gcagatcgag gcccgcatca aggaggtgtc ggcacagtgg gaccagctga 2281 aggacctggc tgccttctgc aagaagaacc tccaggatgc tgagaacttt ttccagttcc 2341 agggcgatgc ggatgacctg aaggcttggc tgcaagacgc ccaccggctg ctctctggtg 2401 aagatgtggg gcaggacgaa ggggccacgc gggccctggg gaaaaagcac aaggacttcc 2461 tggaggagct ggaggagagc cgtggggtga tggagcacct ggagcagcag gcccagggat 2521 tccccgaaga gtttcgggat tccccagatg tgacccatcg gctgcaggcc ctgcgggagc 2581 tctaccaaca ggtggtggcc caggcggacc tgcgtcagca gaggctgcag gaagccctgg 2641 acctgtacac ggtgttcggg gagacagacg cctgtgagct gtggatggga gagaaggaga 2701 agtggctggc cgagatggaa atgccagaca ccctggagga cctggaggtc gtgcagcaca 2761 ggttcgacat cctggaccag gagatgaaga ccttgatgac tcagattgat ggtgtgaacc 2821 tcgctgccaa cagcttggta gagagtggcc acccacgcag cagggaggtg aagcagtacc 2881 aggaccatct gaacaccagg tggcaggcat ttcagaccct ggtgtcggag cggcgggagg 2941 ctgtggactc agccctccga gtgcacacac tatgcgtaga ttgcgaggag accagcaagt 3001 ggatcacgga caagacaaag gtagtggagt ccacaaaaga cctggggcgg gacctggcag 3061 gtatcatcgc catccagagg aagttgtcag ggctggagcg tgacgtggcc gccatccagg 3121 cccgtgtgga tgccctggag cgtgagtccc agcagctgat ggactcgcac cctgagcaga 3181 aggagaatat tggtcagcgg caaaaacact tggaggagct gtggcagggc ctgcagcaat 3241 ccctgcaggg ccaggaggac ttgctggggg aagtcagcca gctgcaggcc ttcctgcagg 3301 atctggatga cttccaggcc tggctctcca tcacccagaa agctgtggcc tctgaggaca 3361 tgcccgaatc cctcccagag gctgagcagc tcctgcagca gcatgcaggt atcaaggatg 3421 agattgacgg gcaccaagac agctaccagc gtgttaagga gtctggggag aaagtgatcc 3481 aaggccagac ggacccagag tatctgcttc tgggccagcg gctggagggc ctggatactg 3541 gctgggatgc cctgggcagg atgtgggaga gccgcagcca caccctcgct cagtgccttg 3601 gcttccagga gttccagaaa gatgccaagc aggctgaagc catcctcagc aaccaggaat 3661 acactctggc tcacttggag cccccagact ccctggaagc tgcagaggct gggatccgga 3721 agtttgagga tttcttgggg tctatggaga acaaccggga taaggtcttg agtcctgtgg 3781 actctggaaa caagctggta gctgagggaa acctatactc agacaagatc aaggagaagg 3841 tgcagctgat tgaggacagg cacaggaaga acaacgagaa ggcccaggag gcctctgtcc 3901 tactgagaga caacctggag ctacagaact tcctccagaa ctgccaggag ctcactctct 3961 ggatcaacga caagctgctg acatctcagg atgtctccta tgatgaagca cgaaaccttc 4021 acaataaatg gctaaagcac caggcgtttg tggcagagct ggcttcccat gaagggtggc 4081 tagagaacat cgatgcggaa ggaaagcagc tgatggatga gaagccccag tttacagccc 4141 tggtgtccca aaagctggaa gccctgcacc ggctctggga cgagctgcag gccaccacaa 4201 aggagaagac ccagcacctc tcggctgcca ggagctccga cctgcgcttg cagacccatg 4261 ctgacctcaa caagtggatc agcgccatgg aggaccagct gcggtcagac gacccgggca 4321 aggacctgac cagtgtcaat cggatgttgg ctaagctgaa gcgagtggag gaccaagtga 4381 atgtgcggaa agaggagctg ggggagctgt ttgcccaggt gccttcaatg ggagaggagg 4441 gaggagatgc agacttgagc atcgagaagc ggttcctgga cctcctggaa cccctaggaa 4501 ggaggaagaa gcagctggaa tcatccagag ccaagctgca gatcagccgg gacttagagg 4561 atgagacgct ttgggtggag gagaggctgc ctctggccca gtcagccgac tatggcacta 4621 atctgcaaac tgtgcaactg ttcatgaaga agaaccagac actgcagaat gagattctgg 4681 gccatacgcc gcgggttgag gatgtgctgc agagagggca gcagctggtg gaggcggcgg 4741 agatcgactg ccaggacctt gaggagcgcc tggggcacct gcagagctcc tgggacaggc 4801 tgcgggaggc agcggccggg aggctgcagc gactgaggga cgccaatgag gcacagcagt 4861 actacctgga tgcggacgag gctgaggcct ggattggcga gcaggagctc tatgtcatct 4921 ccgatgagat ccccaaggat gaagagggcg ccatcgtgat gctgaagcga catttgcggc 4981 agcagcgtgc ggtggaggac tacggccgga acatcaagca gctggccagc cgggcccagg 5041 gcctgctgtc tgcaggccac cctgaggggg aacagatcat cagacttcag gggcaagtgg 5101 acaagcacta cgcagggctg aaggacgtgg cggaagagcg caagcgcaag ctggagaaca 5161 tgtaccacct gttccagctc aagcgggaga ccgacgacct ggagcagtgg atttcagaaa 5221 aggagctagt ggcctcttcc ccggaaatgg ggcaagactt tgaccacgtg actcttctgc 5281 gggacaagtt ccgggacttt gcccgggaga ccggggcgat tgggcaggag cgggtggaca 5341 atgtgaatgc cttcatcgag cgactcatcg acgcgggcca cagcgaggcg gccaccatcg 5401 ccgagtggaa ggacgggctg aacgagatgt gggcagacct cctggagctc attgacacgc 5461 gcatgcagct gctggccgcc tcctatgacc tgcaccgcta cttctacacg ggtgccgaga 5521 tcctgggcct catcgacgag aagcaccgcg agctgcccga ggacgtgggg ctggacgcca 5581 gcacggccga gtccttccac cgggtgcaca cagccttcga gcgggacgtt cacctgctgg 5641 gtgtccaggt gcagcagttc caggacgtgg ccacccgtct gcagacagca tatgctgggg 5701 agaaggcaga ggccatccag aacaaggagc aggaggtgtc tgccgcgtgg caggcgctgc 5761 tcgatgcctg tgccgggcgc cggacccagc tagtggacac ggcggataaa ttccgcttct 5821 tcagcatggc ccgtgacctc ctctcctgga tggagagcat catccggcag atcgagaccc 5881 aggagaggcc cagggatgtc tcctctgtgg aactgctcat gaagtatcac cagggcatca 5941 atgcagagat tgaaacccgg agcaagaact tcagtgcctg cctggagctt ggcgagtccc 6001 tgctgcagcg gcagcaccag gcctcagagg agatccgcga gaaactgcag caggtgatgt 6061 ccaggaggaa agagatgaat gagaagtggg aagcccgctg ggagcggctc cgcatgttgc 6121 tggaggtgtg ccagttctcg agggatgcct ctgtggctga ggcgtggctg attgcccagg 6181 agccctacct ggccagcggg gactttggac acacagtgga cagtgtggag aagctcatca 6241 agaggcatga ggcttttgag aagtccacgg ccagctgggc agagcgcttt gctgccctgg 6301 agaagcccac cacgcttgag ctgaaagaac gccagattgc agagagaccc gcagaggaga 6361 ctgggcctca agaggaggaa ggcgagacag caggggaggc tccagtttcc caccatgcgg 6421 ccaccgagag aacgtccccg gtcagtctct ggtctcgttt gtctagttcc tgggagtcac 6481 tgcagccaga gccctctcac ccctactagc tcagcccagg tggaggcgag atgagctgcg 6541 cagccccgcc ctccatcctc cccacatccc tgcagccacc tcccagcaga gcaggctacg 6601 tcctcactga ggtgttcttc atgagagtac tagcctcctc cactcctccc cacagcgcag 6661 aggaaacagg ccagcccagt gacatgacgt tattagtttt gttttacctg aatgtaataa 6721 attttattgt ataaatatat caccatttac atgaggggaa acact // LOCUS STYEUTBC 2526 bp ds-DNA BCT 08-AUG-1990 DEFINITION S.typhimurium ethanolamine ammonia-lyase (eutB, eutC) genes, complete cds. ACCESSION J05518 KEYWORDS ethanolamine ammonia-lyase. SOURCE S.typhimurium (strain LT2) DNA, clones pBSE4.5 and pUCE6.5. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2526) AUTHORS Faust,L.R.P., Connor,J.A., Roof,D.M., Hoch,J.A. and Babior,B.M. TITLE Cloning, sequencing, and expression of the genes encoding the adenosylcobalamin-dependent ethanolamine ammonia-lyase of Salmonella typhimurium JOURNAL J. Biol. Chem. 265, 12462-12466 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.M.Babior, 08-MAY-1990. FEATURES from to/span description pept 141 1499 ethanolamine ammonia-lyase (eutB) pept 1518 2378 ethanolamine ammonia-lyase (eutC) binding 130 133 ribosome binding site binding 1507 1510 ribosome binding site BASE COUNT 563 a 687 c 779 g 497 t ORIGIN 1 accgcaactt ccgctggcgg tcatcgatga ggtggtcgtg cgggcgggag actatatcga 61 cattggtacg cctctttttg gcggatcggt tgtgccggtg acgtgaaatc actcgcattt 121 ccttcctgag ggaacgactt atgaaactaa agaccacatt gttcggcaat gtttatcagt 181 ttaaggatgt aaaagaggta ctggctaaag ccaacgaact gcgttcgggg gatgtgctgg 241 ccggggttgc cgcggcaagt tcgcaggagc gcgtagcggc aaaacaggta ctgtcggaaa 301 tgacggtggc ggatatccgc aacaatccgg tgattgccta tgaagaggac tgcgtgacgc 361 gcctgattca ggacgacgtc aacgaaacgg cctataaccg gattaaaaac tggagcatca 421 gcgaactgcg tgaatacgtg ctgagcgatg aaacctccgt ggacgacatc gcgtttaccc 481 gcaaaggcct gacctccgaa gtggtggcgg cagtagcgaa aatctgctcc aacgctgacc 541 tgatctacgg cggcaagaaa atgccggtga tcaaaaaagc caataccacc atcggtattc 601 cgggcacctt tagctgccgt ttgcagccga acgatacccg tgacgatgta cagagtatcg 661 ccgcgcaaat ctacgaaggg ctttctttcg gcgcaggcga tgcggtgatc ggcgttaacc 721 cggtgaccga tgacgtggag aacctgaccc gcgtgctcga caccgtttac gcgttatcga 781 taaattcaat attccgaccc agggctgcgt gctggcgcac gtcaccaccc agatcgaagc 841 gattcgtcgc ggcgcccggg cggactgatt ttccagagca tttgcggcac gagaagggct 901 taaaagagtt cggcgtcgag ctggccatgc tcgacgaagc gcgggctgtg ggggcggagt 961 tcaaccgcat cgccggggaa aactgcctgt actttgaaac cgggcaaggg tctgcgctct 1021 ccgcaggcgc gaactttggt gccgaccagg tgacgatgga agcgcgtaac tacgggctgg 1081 cgcgccacta cgatccgttc ctggtgaaca ccgtggtggg ctttatcggg ccggagtatc 1141 tctacaacga caggcagatt atccgcgccg gtctcgaaga tcactttatg ggcaagctga 1201 gcggcatctc gatgggctgc gactgctgct ataccaacca tgccgacgcc gaccagaacc 1261 ttaacgaaaa cctgatgatt ctgctcgcca ctgccggctg taactacatc atggggatgc 1321 cgctcggcga cgacatcatg ctcaactacc agaccaccgc tttccacgat accgccaccg 1381 tccgtcagtt gctgaattta cggccgtcgc cggagtttga acgctggctg gaaacgatgg 1441 gcattatggc aaacggtcgt ctgaccaaac gggcgggcga tccgtcactg ttcttctgat 1501 gacgcgggga taacaccatg gatcaaaaac agattgaaga aattgtacgt agcgtgatgg 1561 cgtcaatggg acaggacgta ccgcagcccg ccgcgccgtc aacgcaggaa ggcgcaaagc 1621 cgcagtgcgc cgcgccgacg gtgaccgaaa cgtgcgcgct ggatttaggt tccgcggagg 1681 caaaagcctg gattggcgtc gagaacccac atcgtgcgga cgtgctgacc gaactgcgtc 1741 gcagtactgc ggcacgcgtc ttgtacgggg cgtgccgggc cgcgtccgcg cacccaggcg 1801 ctgttgcgtt cctggcggat cactcccgtt cgaaagatac cgtgctcaaa gaagtgccgg 1861 aagagtgggt aaaagcgcaa gggctgctgg aagtgcgttc ggaagagtgg gtaaaagcgc 1921 aagggctgct ggaagtgcgt tcggagatca gcgacaaaaa cctgtacctg acgcgcccgg 1981 atatggggcg tcgcctgagc ccggaagcca ttgacgcgct gaagtcacag tgcgtgatga 2041 acccggatgt gcaggtagtg gtctccgatg gcctctctac ggatgcgatc accgccaact 2101 atgaagagat cctgccgccg ttgcttgccg gtctgaagca ggccgggctg aacgtcggca 2161 cgccgttctt tgtgcgctat ggccgtgtga agattgaaga tcagattggc gaaattctcg 2221 gcgcgaaggt cgtcatcctg ctggtaggcg aacgtccggg gctggggcag tcggaaagcc 2281 tttcctgcta cgcggtctat tccccgcgcg tggcaccacc gtcgaggccg acagaacctg 2341 tatttcaaac attcatcagg gggggacgcc gccagtagaa gccgccgccg tgattgtgga 2401 tttggccaaa cggatgctgg agcatgaaag cgtccggcat caacatgtac ccggttaagg 2461 agacatcatg cctgcattag atttaattcg accttcacgt gactgccata gcgcgtgatt 2521 gcctcc // LOCUS XELPCNA 1018 bp ss-mRNA VRT 08-AUG-1990 DEFINITION X.laevis proliferating cell nuclear antigen (PCNA) mRNA, complete cds. ACCESSION M34080 KEYWORDS nuclear protein; proliferating cell nuclear antigen. SOURCE X.laevis oocyte, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1018) AUTHORS Leibovici,M., Gusse,M., Bravo,R. and Mechali,M. TITLE Characterization and developmental expression of Xenopus proliferating cell nuclear antigen (PCNA) JOURNAL Dev. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Leibovici, 08-MAY-1990. FEATURES from to/span description pept 28 813 proliferating cell nuclear antigen (PCNA) mRNA < 1 1018 PCNA mRNA BASE COUNT 284 a 223 c 237 g 274 t ORIGIN 1 ccgcagtaat cccttacagc cgccgccatg tttgaggctc gcttggtgca gggttccatc 61 ctgaagaagg tgttggaggc gctgaaggac ctaatcgatg aggcgtgctg ggacattaca 121 tccagcggca tcagcttgca gagcatggac tcctcgcacg tctccctggt tcaactcact 181 ctgcgatctg acggctttga cacctaccgg tgtgatcgca atcaatctat cggcgtcaag 241 atgagcagta tgtccaaaat cttgaagtgt gccgcaagtg acgatatcat tactctgagg 301 gcagaagaca atgctgatac agtcacaatg gtgtttgagt cgccaaatca agagaaagtt 361 tcagactatg aaatgaagct aatggacctt gatgtggagc agctgggcat tcctgaacaa 421 gagtacagct gtgtaataaa gatgccatct ggtgaatttg cacgtatctg ccgagatctc 481 agccagattg gtgacgcagt agtaatttct tgtgctaagg atggggtaaa gttctctgca 541 agcggagagc tgggaactgg aaatgtaaag ctgtcacaga cttcaaatgt ggataaagaa 601 gaggaagctg ttacaataga aatgaatgag ccagtacagc ttacatttgc tttgcggtat 661 ctgaacttct tcaccaaagc tacacccctg tccccaacag ttattctcag tatgtctgca 721 gatatcccac ttgttgtgga atacaaaatt gcagatatgg aacatgtgaa atactacctg 781 gctcccaaga ttgaagatga agaagcttct taatgtctga actagcttat tttataaacc 841 tcaactgaac gtccaatggc gctttcacac acctgccttg ttttaacagc tttggctgaa 901 cctacccaac ttgtaccaac tggctgtact tctaggcatg tctgtagata tttttgtaaa 961 tacgtcacga tttttgtaaa atctctgccc taggaggtca ataaatcttt gtaataac // LOCUS YSCAAC2A 1333 bp ds-DNA PLN 08-AUG-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC2) gene, complete cds. ACCESSION M34076 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1333) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator gene in yeast JOURNAL J. Biol. Chem. 265, 12711-12716 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 235 1158 ADP/ATP-translocator protein (AAC2) BASE COUNT 388 a 209 c 301 g 435 t ORIGIN 1 ataacctgag gtgacgattt gaataagttt cctttttttt tttctttcat gttggttgcc 61 ttcaattaca tatagattct cgagaaggtt tccattgtcc tttcattagg cgttgaagtg 121 aatctaaagt gcgcttgaat gatttcagat agaaagacta aagaagtggt gtgagtataa 181 ttaactcaat tgaagacggt ttacctgaag tgatatactg tgccttgaga aacaatgagt 241 agcgacgcta agcaacaaga aacaaacttt gccattaatt tcttaatggg tggtgtgagt 301 gcggccatcg ctaaaactgc tgcctcacca atcgaaagag tcaagatctt gatccaaaat 361 caagatgaaa tgatcaagca aggaacttta gataaaaagt attccggtat cgtggattgt 421 ttcaagagaa ctgcaaagca agagggacta atatcctttt ggcgaggaaa tactgccaat 481 gttattcgtt attttcccac tcaagctttg aacttcgcct tcaaagataa gattaagttg 541 atgtttggtt tcaagaaaga ggaaggctat ggtaaatggt ttgcaggtaa tctggcttct 601 ggtggtgcag ctggtgctct ttcgttatta tttgtttatt ctttagattt tgccagaacc 661 agacttgctg ctgatgcaaa atcgtcgaaa aagggtggcg ctcgccaatt caatgggttg 721 actgatgttt ataaaaagac cttgaaatcg gatggtatcg caggattata cagaggattc 781 atgccatcag tagtgggtat cgtggtttat agaggactat atttcggtat gtttgattct 841 ctcaagccac tggtgctaac tggttcatta gatggttcat tcttggcttc atttttattg 901 ggatgggtgg tcactacagg tgcctcaaca tgttcttatc cattagacac agtgagaaga 961 agaatgatga tgacttcagg tcaagcagta aagtacaacg gtgctataga ttgtctcaaa 1021 aaaatcgtag cttctgaagg tgtagggtca ttgttcaaag gctgcggggc aaatatcttg 1081 agaagtgttg ctggagctgg tgttatttcc atgtatgacc agttgcaaat gatattgttc 1141 ggtaaaaaat tcaaatgatc agttggatga agaaaaaagt cattttctcg acttctcttc 1201 acctttcgat cgatttgatt ttggccgcca acttgtttat agaaaaaaaa tagtaggaag 1261 gttatgtatc gctttctttt attttttatt atagagtata actgaataaa tttgtaaatc 1321 agccactgtt gtt // LOCUS YSCAAC3 1308 bp ds-DNA PLN 08-AUG-1990 DEFINITION S.cerevisiae ADP/ATP-translocator protein (AAC3) gene, complete cds. ACCESSION M34075 J05542 KEYWORDS ADP/ATP translocase; ADP/ATP-translocator protein. SOURCE S.cerevisiae (strain W303-1B) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1308) AUTHORS Kolarov,J., Kolarova,N. and Nelson,N. TITLE A third ADP/ATP-translocator gene in yeast JOURNAL J. Biol. Chem. 265, 12711-12716 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Nelson, 08-MAY-1990. FEATURES from to/span description pept 78 1034 ADP/ATP-translocator protein (AAC3) BASE COUNT 353 a 228 c 263 g 464 t ORIGIN 1 atatttgtcg ttgttctttt ttgtgtgctc ttttatactt cagaatcata cattaacata 61 catataagca aatagccatg tcttccaacg cccaagtcaa aaccccatta cctccagccc 121 cagctccaaa gaaggaatct aactttttga ttgatttctt aatgggtggt gtcagtgccg 181 ctgtcgccaa aactgctgca tctcccatcg aaagagttaa acttttgatc caaaaccaag 241 atgaaatgat caagcaagga actttagata aaaagtattc cggtatcgtg gattgtttca 301 agagaactgc aaagcaagag ggactaatat ccttttggcg aggaaatact gccaatgtta 361 ttcgttattt ccccactcaa gctttgaact tcgccttcaa agataagatt aagttgatgt 421 ttggtttcaa gaaagaggaa ggctatggta aatggtttgc cggtaacttg gcatctggtg 481 gtgctgctgg tgccttgtca ttactatttg tttactcttt ggattatgca agaactagat 541 tggctgctga ctccaagtcc tctaaaaagg gtggtgctcg tcaattcaac ggtttgatcg 601 atgtctacaa gaagacctta aaatctgatg gtgttgctgg tctttacaga ggtttcttac 661 cttctgtcgt tggtattgtt gtctacagag gtctatactt cggtatgtac gattctttga 721 agcctctatt gttgactggt tctttggaag gttcattctt ggcttcattc ttgttgggtt 781 gggttgttac tactggtgct tctacatgtt cttacccatt ggataccgtt agaagaagaa 841 tgatgatgac ctccggtcaa gctgttaagt acgacggtgc ctttgactgt ttgaggaaga 901 ttgttgctgc tgaaggtgtt ggttctctat tcaagggttg tggtgctaac atcttaagag 961 gtgtcgcagg tgctggtgtt atctcaatgt acgaccaact gcaaatgatc ttgtttggta 1021 agaagttcaa ataagtctaa tctggcttga ttcttaatct aaattctttc tcacattttc 1081 ctttttttct tctttggatt tttgggtgtt taatgagtga cacgatttgt tttgataata 1141 ttattatcct cctatttttt tagaaattct tttcaacaag aatcaaagat tcataaaaaa 1201 agtaaaacga tgaaattttt tgaacaaatt ttacgtataa agaagaaaaa aattaaattc 1261 taaatatcca gtaaatcgtt ttatattagt agtattcttt cccacttt // LOCUS VSVPNA 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 06/85-NM-E) nucleoprotein gene, complete cds. ACCESSION M31845 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 06/85-NM-E, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 438 a 267 c 293 g 331 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc 61 aaaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac 121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catatgtata 181 ccagggtatc aagtccggta atccatccat cctccacata aatgcttatc tctacgctgc 241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg 301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gacgataaat ggcttcccat 421 ctacatccta ggtctctaca gagtgggcag atctaaagtt acggattaca gaaagaaact 481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga 541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc 601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg 661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa 721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga 781 cgagctctgc cagatgatgt atccgggaca agaaattgac aaagcagatt catacatgcc 841 gtatatgatt gactttgggt tatctcagaa atccccctat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc 961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt 1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaagtacac 1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt 1201 cgcaagaaga gcagtcaact ctctgcaatc acttcgtgag aagacaattg gcaaatatgc 1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNB 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 11/83-CA-B) nucleoprotein, complete cds. ACCESSION M31846 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 11/83-CA-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 437 a 265 c 296 g 331 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc 61 gaaattaccg gccaacgagg atccggtcga atacccggct gattacttca aaaataatac 121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata 181 ccaggggatc aagtccggta atccatccat cctccacata aatgcttatc tctatgctgc 241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg 301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat 421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact 481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctcgtcga 541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc 601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg 661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa 721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga 781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc 961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt 1021 tgctgttggg tcctcagcag acattgaaca gcagttctat attggagaag acaaatacac 1081 aacagaaaaa gatgatggtc tgaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt 1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgaa aagacaattg gcaaatatgc 1261 caaggcagag tttgacaagt gacaaaattt aagataccac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNC 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 01/84-SN-P1) nucleoprotein gene, complete cds. ACCESSION M31847 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 01/84-SN-P1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 438 a 268 c 292 g 331 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc 61 aaaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac 121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata 181 ccagggtatc aagtccggta atccatccat cctccacata aatgcttatc tctacgctgc 241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg 301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gacgataaat ggcttcccat 421 ctacatccta ggtctctaca gagtgggcag atctaaagtt acggattaca gaaagaaact 481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga 541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc 601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg 661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa 721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga 781 cgagctctgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 gtatatgatt gactttgggt tatctcagaa atccccctat tcatcagtaa agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc 961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt 1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaagtacac 1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt 1201 cgcaagaaga gcagtcaact ctctgcaatc acttcgtgag aagacaattg gcaaatatgc 1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPND 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 07/83-GA-P) Phosphoprotein and nucleocapsid genes. ACCESSION M31848 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 07/83-GA-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 435 a 272 c 297 g 325 t ORIGIN 1 aacagatatc aaaatggccc ctacagttaa gagaatcatt aacgactcaa ttattcagcc 61 caaattaccg gccaacgaag atccggttga atacccggcc gattatttca aaaataatac 121 caatatagtg ttatatgtga gcaccaaagt agcactaaat gatttgaggg catacgtata 181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc 241 attgaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg 301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gtcgaagaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat 421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaatt 481 actggacggg cttgaaaatc aatgcaaagt ggcgtcgacc agatttgaga gtctagtcga 541 agatggtctc gacttctttg acatatggga gaacgatcca aatttcacca agatagtggc 601 tgcagtggat atgttcttcc acatgtttaa aaagcatgag cgtgctccaa tcagatacgg 661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa 721 agtcagtggg ctctcaattg aagaactcac aacatgggtc ctgaataggg aggttgcaga 781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 gtatatgatc gactttggat tatcccagaa atccccatat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaacgc 961 tagacagcct gacgacatcg aatacacctc actaacttgt gcaagtttgc tgctgtcatt 1021 cgctgttggg tcatcagcag acattgaaca gcaattctat attggagagg ataaatacac 1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt 1141 agactggctt ggctggtatg atgacaatgg ggggaaaccc acaccagata tgctcaactt 1201 cgcgagaaga gcagttaact ccctgcagtc actccgtgag aagacaattg gcaaatatgc 1261 caaggcagag tttgacaagt gacaaaactt aatataccac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNE 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 11/82-VC-B2) nucleoprotein gene, complete cds. ACCESSION M31849 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 11/82-VC-B2, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description BASE COUNT 435 a 263 c 296 g 335 t ORIGIN 1 aacagacatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc 61 taaattaccg gccaacgagg atccggttga atacccggct gattacttca aaaataatac 121 caatatagtc ttatatgtga gcaccaaagt ggcgctaaat gatttgagag catacgtata 181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc 241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg 301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaagaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat 421 ctacatccta ggtctttaca gagtgggcag atccaaagtt acggattaca gaaagaaact 481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga 541 ggatggtctt gacttttttg acatatggga gaatgatccg aatttcacca agatagttgc 601 tgcagtggat atgttcttcc acatgtttaa aaagcatgaa cgtgctccaa tcagatacgg 661 aaccatagtc tcaagattca aggactgtgc agcacttgcg acatttggac atctcagcaa 721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga 781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 atatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaagaatgc 961 tagacagcct gacgacattg aatacacttc actaacttgt gcaagtttac tgctgtcgtt 1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag acaaatacac 1081 aacagaaaaa gatgatggtc ttaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt 1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgag aagacaatcg gcaaatatgc 1261 caaggcagag tttgacaagt gacaaaattt aagatatcac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNF 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 07/84-OA-B) nucleoprotein gene, complete cds. ACCESSION M31850 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 07/84-OA-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 435 a 260 c 298 g 336 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcca ttattcagcc 61 taagttaccg gccaacgagg atccgattga atacccggct gattacttca aaaataatac 121 caatatagta ttatatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata 181 tcaggggatc aagtccggta atccatccat actccacata aatgcttatc tttacgctgc 241 attaaaggga gtggaaggaa ctttggacag agactgggtt agctttggaa gaacaattgg 301 aaagagagag gagaatgtaa aaatcttcga tctagtgaaa gttgaagaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat 421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact 481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga 541 ggatggtcta gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc 601 tgcagtggat atgttcttcc acatgtttaa aaagcatgaa cgtgctccaa tcagatacgg 661 aaccatagtc tcaagattca aggactgtgc agcacttgcg acattcggac atctcagcaa 721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga 781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgcggcact cttgctaaga tcgactcgag caaagaatgc 961 tagacagcct gatgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt 1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac 1081 aacagaaaaa gatgatggtc ttaagaaatc ggatgtccca ccaaaaggaa ggaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt 1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc 1261 caaggcagag tttgacaagt gacaaaattt aagataccac tgtcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNG 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../52-GA-P) nucleoprotein gene, complete cds. ACCESSION M31851 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain ../52-GA-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 428 a 260 c 302 g 339 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc 61 taaattaccg gccaacgagg atccggttga atacccggcc gattacttca aaaataatac 121 caatatagtg ttgtatgtga gcacaaaagt agcactaaat gatttgaggg catacgtata 181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc 241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg 301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaggaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat 421 ctacatccta ggtctttaca gggtgggcag atctaaagtt acggattaca gaaagaaatt 481 attggacggg cttgagattc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga 541 agatggtctc gacttctttg atgtatggga gaatgatcca aatttcacca agatagtggc 601 tgcagtggat atgttcttcc acatgttcaa aaagcatgag cgtgctccaa tcagatacgg 661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa 721 agtcagtggt ctttcaattg aagaactcac aacatgggtc ctaaataggg aagttgcaga 781 cgagctatgt cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 gtatatgatc gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc 961 tagacagcct gacgacatcg agtacacttc actaacttgt gcaagtttgc tgctgtcatt 1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac 1081 aacagaaaaa gatgatggtc ttaggaaatc agatgttcca ccaaaaggaa gaaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctccaatt 1201 cgcaaggaga gcagttaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc 1261 caaggccgag tttgacaagt gacaaaactt aagataccac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNH 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 10/84-GM-P) nucleoprotein gene, complete cds. ACCESSION M31852 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 10/84-GM-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 441 a 274 c 280 g 334 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcaa taattcagcc 61 caaactaccg gccaatgagg atccggtaga atatccagct gattacttca aaaataatac 121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttaaggg cctatgtcta 181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc 241 actaaaggga gtagaaggga cattagacag agactgggtc agctttggaa gaacaatcgg 301 aaagagggag gagaatgtca aaatcttcga tctagttaaa gttgaggaac tgaagacagc 361 tcttcctgac gggaaatcgg atccagatcg atcagctgaa gatgataaat ggcttcctat 421 ttacatccta ggcctctacc gagtaggtag atccaaagtt acagattaca gaaagaaact 481 cctggacggt ctagaaaatc agtgtaaagt ggcatcaacc agatttgaaa gccttgttga 541 agatggtctt gacttcttcg acatctggga aaatgatccc aatttcacta aaatagttgc 601 tgcagtggat atgttcttcc acatgttcaa gaagcatgaa cgtgctccga tcagatacgg 661 aaccatagtc tcacgattca aagactgtgc agcacttgct acatttggac atctcagcaa 721 ggtcagcgga ctttcaattg aggacctaac aacatgggtt ctgaatcgag aagtcgcaga 781 cgaactatgc caaatgatgt acccaggtca agaaattgat aaggcggatt catacatgcc 841 gtatatgatt gattttgggt tatcccaaaa atctccttat tcatcagtta agaatccagc 901 ctttcacttc tggggtcaac tggctgcact cttgctaaga tcaactcgag caaaaaatgc 961 tcggcaaccc gatgacattg agtacacttc actgacttgt gcaagtttac tactgtcgtt 1021 tgcagttgga tcttcagcag acattgaaca acaattctac atcggagagg acaaatacac 1081 aacagaaaaa gatgacggtc tcaagaaatc tgatgtccca ccgaaaggaa gaaatgttgt 1141 cgactggctt ggttggtatg atgacaatgg aggaaaaccc acaccagata tgctcaactt 1201 cgcacgaaga gcagtcagtt cgttgcagtc acttcgcgag aaaacaattg gcaagtacgc 1261 caaggcagaa tttgacaaat gaccaagttt gacatatcaa tatcaatatt atattctgtg 1321 ctatatatg // LOCUS VSVPNI 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 11/84-HD-B1) nucleoprotein gene, complete cds. ACCESSION M31853 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 11/84-HD-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 442 a 278 c 283 g 326 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa taattcagcc 61 taaactaccg gccaatgagg atccggtaga atatccagct gattacttca aaaataatac 121 caacatagta ttatatgtga gcactaaagt agccctaaat gatttgaggg cctatgtcta 181 ccagggaatc aaatccggta atccatcaat actccacata aatgcttatc tttatgccgc 241 actaaaggga gtagagggaa cattagacag agactgggtt agctttggaa gaacaatcgg 301 aaaaagggag gagaatgtca aaatcttcga cctggtcaaa gttgaagaac tgaagacagc 361 tcttcctgac ggaaaatcag atccagaccg atcagccgaa gatgataaat ggcttcctat 421 ttacatccta ggcctctatc gagtaggtag atccaaagtg acagattaca gaaagaaact 481 cctggatggt ctagaaaatc agtgcaaagt ggcatcaacc agatttgaaa gcctggttga 541 agatggtctt gacttcttcg acatctggga aaatgatccc aatttcacta aaatagttgc 601 tgcagtggac atgttcttcc acatgttcaa gaagcatgaa cgtgctccga tcagatacgg 661 aaccatagtc tcacgattca aagactgtgc ggcacttgct acatttggac atctcagcaa 721 agtcagcgga ctttcaattg aggacctaac aacatgggtt ctgaatagag aagtcgcaga 781 tgaactatgc caaatgatgt acccaggcca agaaattgat aaggcggatt catacatgcc 841 atatatgatt gactttgggt tatcccaaaa atctccgtat tcatcagtga agaatccagc 901 ctttcacttc tggggtcaac tggctgcact cttgctaaga tcgactcggg caaaaaatgc 961 taggcaacct gatgacattg agtacacttc actgacctgt gcaagtttac tcctgtcatt 1021 tgcagttgga tcttctgcag acattgagca acaattctac atcggggaag acaaatacac 1081 aacagagaag gatgacggtc tcaagaaatc cgaagtccca ccgaaaggaa gaaatgttgt 1141 agactggctt ggttggtacg atgacaacgg aggaaaaccc acacctgata tgctcaactt 1201 cgcacgaaga gcagtcagct ctttgcagtc acttcgtgag aaaacaatcg gcaaatacgc 1261 taaagcagaa tttgataaat gaccaggttt gacatatcaa tatcaatatt atattctgtg 1321 ctatatatg // LOCUS VSVPNJ 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 09/82-HD-B) nucleoprotein gene, complete cds. ACCESSION M31854 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 09/82-HD-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 437 a 265 c 296 g 331 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aacgactcaa ttattcagcc 61 gaaattaccg gccaacgagg atccggtcga atacccggct gattacttca aaaataatac 121 caatatagta ttgtatgtga gcaccaaagt ggcactaaat gatttgagag catacgtata 181 ccaggggatc aagtccggta atccatccat cctccacata aatgcttatc tctatgctgc 241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaatagg 301 aaagagagag gagaatgtga aaattttcga tctagtgaaa gttgaagaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat 421 ctacatccta ggtctttaca gagtgggcag atctaaagtt acggattaca gaaagaaact 481 actggacggg cttgaaaatc agtgcaaagt ggcgtcaacc agatttgaga gtctcgtcga 541 ggatggtctc gacttctttg acatatggga gaatgatcca aatttcacca agatagttgc 601 tgcagtggat atgttcttcc acatgttcaa aaagcatgaa cgtgctccaa tcagatacgg 661 aaccatagtc tcaagattca aggactgtgc agcacttgca acatttgggc atctcagcaa 721 agtcagtgga ctctcaattg aggaactcac aacatgggtc ctgaataggg aagttgcaga 781 cgagctatgc cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 gtatatgatt gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc 961 tagacagcct gacgacatcg aatacacttc actaacttgt gcaagtttac tgctgtcatt 1021 tgctgttggg tcctcagcag acattgaaca gcagttctat attggagaag acaaatacac 1081 aacagaaaaa gatgatggtc tgaagaaatc agatgtccca ccaaaaggaa gaaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctcaactt 1201 cgcaagaaga gcagtcaact ctctgcagtc acttcgtgaa aagacaattg gcaaatatgc 1261 caaggcagag tttgacaagt gacaaaattt aagataccac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNK 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../49-UT-B1) nucleoprotein gene, complete cds. ACCESSION M31855 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain ../49-UT-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 428 a 261 c 302 g 338 t ORIGIN 1 aacagatatc aaaatggctc ctacagttaa gagaatcatt aatgactcca taattcagcc 61 taaactaccg gccaacgagg atccggttga atacccggcc gattacttca aaaataatac 121 caatatagtg ttgtatgtga gcacaaaagt agcactaaat gatttgaggg catacgtata 181 ccaggggatc aagtccggta atccatccat actccacata aatgcttatc tctacgctgc 241 attaaaggga gtggaaggaa ctttagacag agactgggtt agctttggaa gaacaattgg 301 aaagagagag gagaatgtga aaatcttcga tctagtgaaa gttgaggaac tgaagacagc 361 acttcctgat gggaaatcag accctgaccg ttctgctgag gatgataaat ggcttcccat 421 ctacatccta ggtctttaca gggtgggcag atctaaagtt acggattaca gaaagaaatt 481 attggacggg cttgagattc agtgcaaagt ggcgtcaacc agatttgaga gtctagtcga 541 agatggtctc gacttctttg atgtatggga gaatgatcca aatttcacca agatagtggc 601 tgcagtggat atgttcttcc acatgttcaa aaagcatgag cgtgctccaa tcagatacgg 661 aaccatagtc tcacgattca aggactgtgc agcacttgcg acatttggac atctcagcaa 721 agtcagtggt ctttcaattg aagaactcac aacatgggtc ctaaataggg aagttgcaga 781 cgagctatgt cagatgatgt atccgggtca agaaattgac aaagcagatt catacatgcc 841 gtatatgatc gactttgggt tatctcagaa atccccatat tcatcagtga agaatccagc 901 ttttcatttc tggggacaac ttgctgcact cttgctaaga tcaactcggg caaaaaatgc 961 tagacagcct gacgacatcg agtacacttc actaacttgt gcaagtttgc tgctgtcatt 1021 tgctgttggg tcctcagcag acattgaaca gcaattctat attggagaag ataaatacac 1081 aacagaaaaa gatgatggtc ttaggaaatc agatgttcca ccaaaaggaa gaaatgtcgt 1141 ggactggctt ggctggtatg atgacaatgg gggaaaaccc acaccagata tgctccaatt 1201 cgcaaggaga gcagttaact ctctgcagtc acttcgtgag aagacaattg gcaaatatgc 1261 caaggccgag tttgacaagt gacaaaactt aagataccac tatcactatt atattctatg 1321 ttatatatg // LOCUS VSVPNL 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 10/82-CR-B) nucleoprotein gene, complete cds. ACCESSION M31856 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 10/82-CR-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 431 a 268 c 298 g 332 t ORIGIN 1 aacagacatc aaaatggctc ctacagtcaa gagaattgtc aatgactcta ttattcatcc 61 taaattgcca gcccatgaag atccggtgga atacccggct gattacttca aaaacaatac 121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttatgtcta 181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttatgcagc 241 attgaagggg gtagaaggga ccttagacag agactgggtc agcttcggac gcacaattgg 301 gaaaagagaa gagagtgtga aaatctttga ccttgttaaa gttgaggaat taaagacagc 361 ccttcctgat ggaaaatcag atcccgaccg ttcagctgag gatgacaaat ggcttcccat 421 ctatatcctg ggtctttaca gagtaggcag atcaaaagtt actgattaca gaaagaaatt 481 gctggatggt ttagagaatc agtgcaaagt agcatcgacc agattcgaaa gcctcgttga 541 agatggattg gatttctttg acatatggga gaatgatccg aacttcacca agatagttgc 601 agcggtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatacgg 661 aaccatagtc tcacggttca aagactgtgc agcattagcg acatttgggc atctaagcaa 721 ggtcagtgga ctctctattg aagatctcac aacatgggtc cttaatagag aagttgcaga 781 cgagctatgc cagatgatgt atccggggca agaaattgat aaggcggatt cttacatgcc 841 atatatgatt gactttggat tgtcgcagaa atcaccatac tcatcagtta aaaacccagc 901 tttccacttc tggggtcaac tagcagcact cctattgaga tctactcgcg caaaaaatgc 961 ccgacagcca gatgacattg aatatacatc attaacttgt gcaagtctac tgctgtcatt 1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag ataaatacac 1081 aacagacagg gatgatggtc tgaagaaatc agatgtacca cccaaaggga ggaatgtcgt 1141 agactggctc ggctggtatg acgataatgg aggaaaacct acaacggaca tgctgaactt 1201 cgcaagaaga gcagtcagct ccctacaagc acttcgcgag aaaacaatcg gcaaatatgc 1261 caaggcagag tttgataaat gaccaaactc tgcataccac tatcaatatt atattctatg 1321 ttacatatg // LOCUS VSVPNM 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 12/82-HD-B) nucleoprotein gene, complete cds. ACCESSION M31857 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 12/82-HD-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 441 a 267 c 291 g 330 t ORIGIN 1 aacagacatc aaaatggctc ctacagtcaa gagaattgtc aatgactcta ttattcatcc 61 taaactgcca gcccatgaag atccggtgga atacccggct gattacttca aaaacaatac 121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttatgtcta 181 ccaagggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc 241 attgaaggga gtagaaggga ccttggacag agactgggtc agcttcggac gcacaattgg 301 gaaaagagaa gagagtgtga aaatctttga ccttgttaaa gtggaagaat taaagacagc 361 acttcctgat ggaaaatcag atccagaccg ttcagctgag gatgataaat ggcttccaat 421 ttatatcctg ggtctctaca gagtaggtag atcaaaagtt actgattacc gaaagaaatt 481 gctggacggt ttagagaatc agtgcaaagt agcatcgacc agattcgaaa gcctagttga 541 agacggattg gatttctttg acatatggga gaatgatcca aacttcacca agatagttgc 601 agcagtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatacgg 661 aaccatagtc tcacggttca aagactgtgc agcattggcg acatttgggc atctcagcaa 721 ggtcagtgga ctctctattg aagatctcac aacatgggtt ctaaatagag aagttgcaga 781 cgagctatgc cagatgatgt atccagggca agaaattgat aaggcggatt cttacatgcc 841 atatatgatt gactttggat tgtcgcagaa atcaccatac tcatcagtta aaaacccagc 901 tttccacttc tggggtcaac tcgcagcgct cctattaaga tctactcgcg caaaaaatgc 961 cagacaacca gatgacattg aatatacatc attaacttgt gcaagtctac tgctgtcatt 1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag acaaatacac 1081 aacagataag gatgatggtc taaagaaatc agatgtacca cccaaaggga ggaatgtcgt 1141 agactggctc ggctggtatg acgataatgg gggaaaacct acaccagaca tgctgaactt 1201 cgcaagaaga gcagtcagct ccctacaagc acttcgtgag aaaacaatcg gcaaatatgc 1261 caaggcagag tttgataaat gaccaaactc tacataccac tatcaatatt atattctatg 1321 ttacatatg // LOCUS VSVPNN 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 10/85-HD-B1) nucleoprotein gene, complete cds. ACCESSION M31858 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 10/85-HD-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 437 a 264 c 294 g 334 t ORIGIN 1 aacagacatc aaaatggctc ctacagttaa gagaatcgtc aatgactcta ttattcatcc 61 taaactgcca gcccatgaag atccggtgga atacccagct gattacttca aaaacaatac 121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gatttgaggg cttacgtcta 181 ccaagggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc 241 attgaaggga gtagaaggga ctttggacag agactgggtc agctttggac gcacgattgg 301 gaaaagagaa gagagtgtga aaatatttga ccttgttaaa gtggaagaat taaagacagc 361 acttcctgat ggaaaatcag atccagaccg ttcagctgag gatgataaat ggcttcccat 421 ttatatcctg ggtctctaca gagtaggtag atcaaaagta actgattacc gaaagaaatt 481 gctggatggt ttagagaatc aatgcaaagt agcatcgacc agattcgaaa gcctagttga 541 agatggattg gatttctttg acatatggga gaatgatccg aacttcacca agatagttgc 601 agcggtagat atgttctttc atatgttcaa aaaacatgag cgggcaccaa tcagatacgg 661 aaccatagtc tcacggttca aagactgtgc cgcattggcg acatttgggc atctcagcaa 721 ggtcagtgga ctctctattg aagatctcac aacatgggtc cttaataggg aagttgcaga 781 cgagctatgc cagatgatgt atccagggca agaaattgat aaggcggatt cttacatgcc 841 atatatgatt gactttggat tatcgcagaa atcaccatac tcatcagtta aaaacccagc 901 tttccacttc tggggtcaac ttgcagcgct cctattaaga tctactcgcg caaaaaatgc 961 ccgacagcca gatgacattg aatatacatc attgacttgt gcaagtctac tgctgtcatt 1021 tgcagttggg tcctcagcag acattgagca gcagttctat attggagaag ataaatacac 1081 aacagataag gatgatggtc taaagaaatc agatgtacca cccaaaggga ggaatgtcgt 1141 agactggctc ggctggtatg acgataatgg gggaaaacct acaccagaca tgctgaactt 1201 cgcaagaaga gcagtcagct ccctacaagc acttcgtgag aaaacaattg gtaaatatgc 1261 caaggcagag tttgataaat gaccaaactc tacataccac tatcaatatt atattctatg 1321 ctacatatg // LOCUS VSVPNO 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 07/83-NC-P) nucleoprotein gene, complete cds. ACCESSION M31859 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 07/83-NC-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 430 a 270 c 293 g 336 t ORIGIN 1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtt aacgactcta ttattcatcc 61 caagctgcca gcccatgaag atccggtaga atacccggct gattacttca aaaacaatac 121 aaatatagtg ttatatgtta gcactaaagt ggcactaaat gacttgaggg cttatgtcta 181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc 241 attgaaagga gtagaaggga ccttggatcg agactgggtc agcttcggac gcaccattgg 301 aaaaagagaa gagagtgtga aaatctttga tcttgtcaaa gtggaagagt taaaaacagc 361 acttcctgac ggaaaatcag accctgatcg ttcagctgag gatgacaaat ggcttcccat 421 ttatatcctg ggtctctaca gagttggtag atcaaaagtt actgattacc ggaagaaatt 481 gctagatggt ctagagaatc agtgcaaggt agcatcaacc agattcgaaa gtttagtcga 541 agatggattg gatttctttg acatatggga gaacgatccg aacttcacca agatagttgc 601 agcagtagat atgttctttc atatgttcaa gaaacatgag cgtgcaccaa tcagatatgg 661 aaccatagta tcacggttca aagactgtgc cgcattagca acatttgggc atctcagtaa 721 agtcagtgga ctttctattg aggatctcac aacatgggtc ctaaataggg aagttgcaga 781 cgaactatgc cagatgatgt acccagggca agaaattgat aaagcggatt cctacatgcc 841 atatatgatt gactttggat tgtcccagaa atcaccatat tcatcagtga aaaacccagc 901 cttccacttc tggggtcaac ttgcagcact cctattaaga tctactcgcg caaagaatgc 961 tcgacagccg gatgacattg aatatacatc attgacttgt gcaagtctat tactgtcatt 1021 tgcagttggg tcctcagcag acattgaaca gcagttctat attggagaag ataagtacac 1081 aacagataag gatgatggtc tgaagaaatc agatgtacca cccaaaggga ggaatgttgt 1141 agactggctc ggctggtatg atgataatgg gggaaaaccc acaccggaca tgctaaattt 1201 cgcaagaaga gcagtcaact ccctacaagc actccgtgag aaaacgattg gcaagtatgc 1261 caaggccgag ttcgataaat gaccagattc tacctactac tatcaatatt atattctctg 1321 ttatatatg // LOCUS VSVPNP 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 01/85-PN-B1) nucleoprotein gene, complete cds. genes. ACCESSION M31860 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain 01/85-PN-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 437 a 265 c 293 g 334 t ORIGIN 1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtc aacgactcta ttatccatcc 61 taagctgcca gcccatgaag atccggtaga atacccggct gattacttca agaacaatac 121 gaatatcgtg ttgtatgtaa gcactaaagc agcattaaat gatttaaggg catatgtcta 181 ccaggggatt aagtctggca atccatctat actccacata aatgcttacc tttacgcagc 241 attaaaggga gtagaaggga ccttggacag agattgggtc agtttcggac gcaccattgg 301 gaaaagagaa gagaatgtga aaatatttga tcttgttaaa gtggaagaat taaaaacagc 361 acttcctgat ggaaaatcag accctgatcg ttcggcagag gatgataaat ggctccccat 421 ttatatccta ggtctctaca gagtgggtag atcgaaagtt actgattacc ggaagaaatt 481 gctagatggt ctagagaacc agtgcaaagt agcatcaacc agatttgaaa gtctagtcga 541 agatggattg gatttttttg acatatggga aaatgatccg aatttcacca agatagttgc 601 ggcagtagac atgttctttc atatgttcaa gaaacatgag cgtgcaccaa taagatatgg 661 aaccatagta tcacggttca aggactgtgc agcgttagca acatttggac atctcagtaa 721 agtcagcgga ctttctattg aagatctcac aacatgggtc ctcaataggg aagttgcaga 781 cgaactctgc cagatgatgt acccaggtca agaaattgat aaatcggatt cttacatgcc 841 atatatgatt gactttggat tgtcccagaa atcgccgtac tcatcagtaa aaaacccagc 901 tttccacttc tggggtcaac ttgcagcact cctgttaaga tctactcgtg caaaaaatgc 961 ccgacagcca gatgacattg aatatacatc actgacttgt gcaagcctac tactgtcatt 1021 tgcagtcggg tcttcagcag acatcgaaca gcagttttat attggagaag acaagtacac 1081 aacagaaaag gatgatggtc taaggaactc agatgtacca cccaaaggga ggaatgttgt 1141 agactggctc agctggtatg acgataatgg aggaaaacca acaccggaca tgctaaattt 1201 tgcaaaaaga gcggtcaact ccttgcaggc actccgtgag aaaacgattg gcaagtatgc 1261 caaggcagag ttcgataaat gaccagatta tacataccgc tatcaatatt atattctctg 1321 ttatatatg // LOCUS VSVPNQ 1329 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../60-PN-B) nucleoprotein gene, complete cds. ACCESSION M31861 KEYWORDS nucleoprotein. SOURCE Vesicular stomatitis virus (strain ../60-PN-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 1329) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 14 1282 nucleoprotein BASE COUNT 434 a 273 c 290 g 332 t ORIGIN 1 aacagacatc aaaatggctc ctacagtcaa gagaatcgtt aacgactctg ttattcatcc 61 taagctacca gcccatgagg atcccgtaga atatccagct gattacttca aaaacaatac 121 aaatatcata ttatatgtta gcactaaagt ggcactgaat gatttaaggg cttatgtcta 181 ccaggggatc aagtctggca atccatctat actccacata aatgcttacc tttacgcagc 241 attgaaggga gtagaaggga ctttagacag agactgggtc agcttcggac gaaccattgg 301 gaaaagagaa gagagtgtga aaatctttga tcttgtcaaa gtggaagagt taaaaacagc 361 acttcctgat ggaaaatcag accctgatcg ttcggccgag gatgacaaat ggcttcccat 421 ttatatctta ggtctctaca gagtgggtag atcaaaggtt actgattacc ggaagaaatt 481 gctagacggt ctagagaacc agtgcaaagt agcatcaacc agattcgaaa gtctagtcga 541 agatggattg gatttttttg acatatggga aaatgatccg aacttcacca agatagttgc 601 agcagtagac atgttctttc acatgttcaa gaaacatgag cgagcaccaa tcagatatgg 661 aaccatagta tcacggttca aggactgtgc agcattagca acatttgggc atctcagtaa 721 agtcagtgga ctttctattg aggatcttac aacatgggtc ctcaataggg aagttgcaga 781 tgaactatgc cagatgatgt atccggggca agaaattgat aaagctgatt cttacatgcc 841 atatatgatt gactttggat tgtcccagaa atcaccatat tcatcagtta aaaacccagc 901 cttccacttc tggggtcaac ttgcagcact cctgttaaga tctactcgtg caaaaaatgc 961 ccgacagcca gatgacattg aatatacgtc attgacctgt gcaagtctac tactgtcctt 1021 tgcagttgga tcctcagcag acattgaaca gcagttctat attggagaag ataagtacac 1081 gacagacaag gatgatggtc taaagaaatc agatgcacca cccaaaggga ggaatgttgt 1141 agactggctc ggttggtatg acgataatgg gggaaaaccc acaccggaca tgctaaattt 1201 cgcaagaaga gcggtcaact ccctacaagc actccgtgag aaaacgattg gcaaatatgc 1261 caaggcagag ttccataaat gaccagatta tacccaccgc taccaatatt atatcctatg 1321 ttacatatg // LOCUS VSVPPAA 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 11/84-HD-B1) phosphoprotein gene, complete cds. ACCESSION M31871 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 11/84-HD-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 290 a 163 c 182 g 221 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt 61 ggattctgcc ttgcaggatg ccaatgaatc tgaagaaaga cgagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccggatga 241 aagaatagac aagtgggaag aggacttgcc tgaattagaa aatattgatg atgatatacc 301 ggtgaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt 361 gtctctgttc cctccagttg gactaacaaa gattcaaaca gagcaatgga gaaaaaccat 421 tgaggcagtt tgtgagagct caaaatattg gaatttatca gaatgtcaaa ttctgaactt 481 ggaagactgc ctcaccctca aaggccgagt gatgactcct gattgtagtt cctcagtaaa 541 atctcaaaat tctattcaga gttcggaatc tctctactcc tctcattccc caggtcccac 601 actcaaggga tcagaatcca tcaatctatg ggatttgaag tccacagaag tacaattgat 661 ctctaagaga gctggagtta aggacatgac agtcaaattg acagacttct tcgaaagtga 721 ggaagagtat tactcagtat gcccagaagg ggcaccagat ttaataggag ctatcatcat 781 gggattgaaa tataagaaac tattcaatca agcaagaatg aagtatcgtc tctaagtctt 841 tctcaggatc agtatg // LOCUS VSVPPAB 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 09/82-HD-B) phosphoprotein gene, complete cds. ACCESSION M31872 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain , New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 283 a 168 c 185 g 220 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt 61 ggattctgcc ttgcaggatg ccaatgaatc tgaggaaagg cgagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccagatga 241 aagaatagac aagtgggaag aagatttgcc tgaattagaa aagattgatg atgatatacc 301 ggtgaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt 361 gtctctgttc cctccagttg gattaacaaa aattcaaaca gaacaatgga gaaaaaccat 421 tgaggcggtt tgtgagagtt caaaatattg gaatctatca gaatgccaaa ttctgaactt 481 agaagactgc ctaaccctca aaggccgagt gatgactcct gactgtagtt cttcagtcaa 541 atctcaaaat tctgtccaga gttcggaatc tctctactcc tctcattccc caggccccac 601 actcaaggtg tcagagtcca tcaatctatg ggatttaaag tccacagaag tacaactgat 661 ctctaagaga gccggagtta aggacatgac agtcaaattg acagacttct tcggaagtga 721 ggaagagtat tattcagtat gcccagaagg ggcaccagac ttgatgggag ctatcatcat 781 gggattgaaa tataagaaac tcttcaatca ggcgcgaatg aagtatcgtc tttaattcct 841 tctcatgatc agtatg // LOCUS VSVPPAC 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../49-UT-B1) phosphoprotein gene, complete cds. ACCESSION M31873 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain ../49-UT-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 284 a 161 c 185 g 226 t ORIGIN 1 aacagatatc atggacagtg ttgataggct caagacttac ttagccactt atgataattt 61 ggattctgcc ttgcaggatg ccaatgaatc tgaggaaaga cgagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg attataatgc tgaacatctt acgatgctgt caccggatga 241 aagaatagac aagtgggaag aagatttgcc tgaattagaa aagattgatg atgatatacc 301 ggtgaccttt tctgattgga cacagcctgt aatgaaggaa aatgggggag agaaatcatt 361 gtctctgttc cctccagtcg ggttaacaaa gattcaaaca gaacaatgga aaaaaaccat 421 tgaggcggtt tgtgagagtt caaaatattg gaatttatca gaatgccaaa ttcttaactt 481 ggaagacagc ctcactatca aaggccgatt gatgactcct gattgtagtt cttcagtaaa 541 atctcaaaat tctgtccgga ggtcagaacc tctctactcc tctcattctc caggtccccc 601 actcaaggta tcagagtcca tcaatttatg ggatttaaag tccactgaag tacaattgat 661 ctccaagaga gccggagtta aggacatgac agtcaaattg acagacttct ttggaagtga 721 ggaagagtat tattcagtat gcccagaagg ggcgccagac ttgatgggag ctatcatcat 781 gggactgaag tacaagaaac tcttcaatca ggcaagaatg aaatatcgtc tttaattcct 841 tttcatgatc aatatg // LOCUS VSVPPAD 855 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 10/82-CR-B) phosphoprotein gene, complete cds. ACCESSION M31874 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 10/82-CR-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 855) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 275 a 153 c 199 g 228 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga 241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgatatacc 301 ggtcaccttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt 361 atctttgttt cctcccgttg gattgaccaa ggttcagacc gaacaatgga ggaagacgat 421 tgaggcagtc tgtgagagct caaaatattg gaatttatca gagtgccgga ttgtaacttc 481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa 541 atctcagaac tccgtacgga gttcagaatc tctttactcc tcccattcac caggacctgc 601 atctagggca tcagagtcaa tgaatttatg ggacttgaag tcaacagaag tccaattgac 661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga 721 ggaagagtat tattcagtat gcccggaagg agcatcagat ttgatgggag caattattat 781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaactcct 841 tactacagtc aatat // LOCUS VSVPPAE 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 12/82-HD-B) phosphoprotein gene, complete cds. ACCESSION M31875 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 12/82-HD-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 277 a 153 c 197 g 229 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga 241 gagagtagac aagtgggagg aagacttgcc tgaattagag aagattgatg atgatatacc 301 ggtcactttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt 361 atctttgttt cctcccgttg gattgaccaa gattcagacc gaacaatgga gaaagacgat 421 tgaggcagtc tgtgagagct caaaatactg gaatttatca gagtgccgga ttgtaacttc 481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa 541 atctcagaat tctgtacaga gttcagaatc tctctactcc tcccattcac caggacctgt 601 tcctagggca tcagagtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgat 661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga 721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat 781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaataccgac tttaactcct 841 tactacagtc aatatg // LOCUS VSVPPAF 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 10/85-HD-B1) phosphoprotein gene, complete cds. ACCESSION M31876 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 10/85-HD-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 279 a 160 c 195 g 222 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccagacga 241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgatatacc 301 ggtcatcttc cctaattgga ctcaacctgt aatgaaagag aatggtgggg agaagtcatt 361 atctttgttt cctcccattg gattgaccaa ggttcagacc gaacaatgga gaaagacgat 421 tgaggcagtc tgtgagagct caaaatactg gaatttatca gagtgccgga ttgtaacttc 481 agatgattgt cttattctca gaggacgagt tatgactcct gattgcagtt catcagttaa 541 atctcagaat tctgtacaaa gttcagaatc tctctactcc tccccttcac caggacctgc 601 acctagggca tcagagtcaa tgaatttatg ggacctgaag tcaacagaag tccaactgac 661 ctcaaagaga gcaggagtca aagacatgac agtcaagtta acagacttct ttggaagtga 721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caatcattat 781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaataccgac tttaactcct 841 tactacagtc aatatg // LOCUS VSVPPAG 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 07/83-NC-P) phosphoprotein gene, complete cds. genes. ACCESSION M31877 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 07/83-NC-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 279 a 148 c 199 g 230 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaggagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aggaggaaga 181 atcgtcagat tcagatactg actataatgc tgagcatctt acgatgttgt cgccagatga 241 aagaatagac aagtgggaag aggacttgcc tgaattagag aagattgatg atgatatacc 301 ggtcaccttc cctaattgga ctcaacctgt aatgaaggag aatggtgggg agaagtcatt 361 atctttgttt cctccagttg gattgactaa ggttcaaaca gaacaatgga ggaagactat 421 tgaggcagtc tgtgagagct caaaatattg gaatttatca gagtgccgga ttgtaacttc 481 agatgattgt ctcattctca gaggacgagt tatgactcct gattgcagct catcagttaa 541 atctcagaat tctgtacaga gttcagaatc tctctactcc tcccattcac caggacctgc 601 acctagagca tcagagtcaa cgaatctatg ggatttgagg tctacagagg tgcaattgac 661 ttcaaagaga gcaggagtta aggatatgac agtaaagtta acagatttct ttgaaagcga 721 agaagagtat tattcagtat gcccagaggg agcatcagac ttaatgggag caattattat 781 gggactgaag cataagaaac tctttaatca agcaagaatg aaatatcgac tttaacttct 841 tactgcagac aatatg // LOCUS VSVPPAH 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 01/85-PN-B1) phosphoprotein gene, complete cds. ACCESSION M31878 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 01/85-PN-B1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 276 a 156 c 198 g 226 t ORIGIN 1 aacagatatc atggacagtg tcgatcggct caagacgtat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaggaaga 181 atcgtcagat tcagatactg actataatgc tgagcatctt acgatgttgt cgcccgagga 241 aagaatagac aagtgggaag aagacttgcc tgaattggag aagatcgatg acgatatact 301 ggtcacattc cctagttgga ctcaacctgt aataaaagag aatggtgggg agagatcatt 361 atctttattt cctccagtcg gattgactca ggttcagaca gaacagtgga agaagaccat 421 tgagacagtc tgtgagagct caaaatattg gaatttgtca gagtgccaga ttgtaacttc 481 aggaaattgt cttattttga ggggacaagt tatgacctct gattgcagcc catcagctaa 541 atctaagaat tctgtacaga gttccgagtc tccttcccct tccaattcac ccgaacatgc 601 atctagagca tcagcttcac cgaatctatg ggactttaag tttacagagg tgcaattgat 661 ctcaaaaaga gcaggagtta aggacatgac agtaaaatta acagacttct ttgaaagcga 721 agaagagtat tattcagtat gcccggacgg agcatctgat ttgatggggg caattgttat 781 gggcctgaag cataagaaac tctttaatca ggcaaggatg aaatatcgac tttaacttct 841 tactgcagac aatatg // LOCUS VSVPPAI 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../60-PN-B) phosphoprotein gene, complete cds. ACCESSION M31879 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain ../60-PN-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 271 a 163 c 206 g 216 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc cgaagagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa accaactccg tcgtattatc aggaggaaga 181 atcgtcagat tcagatactg actacaatgc tgagcatctt acgatgttgt cgccagatga 241 aagaatagac aagtgggaag aggacttgcc agaattagag aagatcgatg atgatatacc 301 ggtcaccttc cctaattgga ctcaacctgt aatgaaggag aatggtgggg agaagtcatt 361 atcattgttt cctccagttg gattgactaa ggttcagaca gaacagtgga agaagaccat 421 cgaggcagtc tgtgagagct caaaatattg gaacttatca gagtgccgga ttgtgacttc 481 agatgattgt ctcattctga ggggacgagt tatgactcct gattgcagct catcagttaa 541 atctcagaat tctgtacaga gttccgagtc tctctgctcc tcccattcac caggacctgc 601 ccctagagca tcagattcaa cgaatctatg ggacttgaag tccacagagg tgcaactagc 661 ctctaagaga gcaggagtta aggacatgac agtaaagtta acagacttct ttgaaagcga 721 agaagagtat tattcagtat gcccagaagg agcatccgac ttgatggggg caattattat 781 gggactgaag cataagaaac tctttaacca ggcaagaatg aaatatcgga tttaacttct 841 tactgcggac aatatg // LOCUS VSVPPAJ 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../76-EC-M) phosphoprotein gene, complete cds. ACCESSION M31880 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain ../76-EC-M, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 278 a 156 c 197 g 225 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg ccaatgaatc tgaagagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga 241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgacatacc 301 ggttaccttt cctaattgga ctcaacctgt catgaaagag aatggtgggg agaaatcatt 361 atctctgttt cctcccgttg gattgaccaa gtttcagacc gaacaatgga ggaagacgat 421 tgaagcggtc tgtgaaagct caaaatattg gaatttatca gaatgccgga tggtaacttc 481 agatgattgt ctgattctca gaggacgagt tataactcct gattgcagtt catcagttaa 541 atctcagaat tctgtacgga gttcagaatc tctctattca tcccactcac caggacctgc 601 acctggggca tcagggtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgac 661 ctcaaagaga gcaggagtca aggacatgac agtcaagtta acagacttct ttggaagtga 721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat 781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaacccct 841 tactacagtc aatatg // LOCUS VSVPPR 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../76-EC-M) phosphoprotein gene, complete cds. ACCESSION M31862 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain ../76-EC-M, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E., Fitch,W.M. and Nichol,S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 278 a 156 c 197 g 225 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg ccaatgaatc tgaagagaga agagaggata aatatctcca 121 ggacctcttc atcgaagatc aaggagataa atcaactccg tcgtattatc aagaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccagatga 241 gagagtagac aagtgggaag aagacttgcc tgaattagag aagattgatg atgacatacc 301 ggttaccttt cctaattgga ctcaacctgt catgaaagag aatggtgggg agaaatcatt 361 atctctgttt cctcccgttg gattgaccaa gtttcagacc gaacaatgga ggaagacgat 421 tgaagcggtc tgtgaaagct caaaatattg gaatttatca gaatgccgga tggtaacttc 481 agatgattgt ctgattctca gaggacgagt tataactcct gattgcagtt catcagttaa 541 atctcagaat tctgtacgga gttcagaatc tctctattca tcccactcac caggacctgc 601 acctggggca tcagggtcaa tgaatttatg ggacttgaag tcaacagaag tccaactgac 661 ctcaaagaga gcaggagtca aggacatgac agtcaagtta acagacttct ttggaagtga 721 ggaagagtat tattcagtat gcccagaagg agcatcagat ttgatgggag caattattat 781 gggactgaag cataagaaac tctttaatca ggcaagaatg aaatatcgac tttaacccct 841 tactacagtc aatatg // LOCUS VSVPPS 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 06/85-NM-E) phosphoprotein gene, complete cds. ACCESSION M31863 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 06/85-NM-E, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and itch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 288 a 161 c 190 g 217 t ORIGIN 1 aacagagatc atggacaata ttgatcggct caaaacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca 121 agacctcttc atcgaagatc agggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataattc tgaacatctt acgatgttgt cgccggatga 241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagatcgatg atgatatacc 301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact 361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat 421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc 481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa 541 atctcagaac tccacacagg gttctgaatc tctatcctcc tcacattcac ccggtccagc 601 acccaaggtc aaggagtcaa ccagtctatg ggacttaaga tcaacggaag tacaattgac 661 ttcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttgggagtga 721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat 781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat 841 tccgatgatc gatatg // LOCUS VSVPPT 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 11/83-CA-B) phosphoprotein gene, complete cds. ACCESSION M31864 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 11/83-CA-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 292 a 158 c 186 g 220 t ORIGIN 1 aacagagatc atggacagta ttgatcggct caaaacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca 121 agacctcttc atcgaaaatc agggagataa aacgactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataattc tgaacatctt acgatgttgt cgccggatga 241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc 301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact 361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat 421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc 481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa 541 atctcagaac tccatacaga gttccgaatc tctctcctct tcacattcac ccggtccagc 601 acccaaagtc aaggagtcaa ccagtctatg ggacttaaga tcaacagaag tacaattgat 661 ctcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga 721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat 781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat 841 ttcgatgatc aatatg // LOCUS VSVPPU 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 01/84-SN-P1) phosphoprotein gene, complete cds. ACCESSION M31865 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 01/84-SN-P1, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 289 a 160 c 190 g 217 t ORIGIN 1 aacagagatc atggacaata ttgatcggct caaaacttat ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca 121 agacctcttc atcgaagatc agggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataagtc tgaacatctt acgatgttgt cgccggatga 241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagatcgatg atgatatacc 301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact 361 atctctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat 421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc 481 agatgattgt cttatcctca aaggccgagt tatgactcct gactgcagtt catcaatcaa 541 atctcagaac tccacacagg gttctgaatc tctatcctcc tcacattcac ccggtccagc 601 acccaaggtc aaggagtcaa ctagtctatg ggacttaaga tcaacggaag tacaattgac 661 ttcgaagagg gctggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga 721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat 781 gggactaaaa cataagaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat 841 tccgatgatc gatatg // LOCUS VSVPPV 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 07/83-GA-P) phosphoprotein gene, complete cds. ACCESSION M31866 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 07/83-GA-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 296 a 161 c 188 g 211 t ORIGIN 1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt 61 ggattctgcc ctgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataatgt tgaacatctt acgatgttat cgccggatga 241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc 301 agtcactttt tctgactgga cacaacctgt aatgaaggaa aacggagggg agaaatcact 361 atctctgttt cctccggttg gattaacaaa agttcagaag gaccaatgga ggaagacaat 421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc 481 agaagattgt cttatcctca aaggccgaat tatgactccc gactgcagct catcaatcaa 541 atctcagaac tccatacaga gttccgaatc gctctcctcc tcgcattcac ccggtccagc 601 acccaaagtc acggagtcaa ctagtctatg ggatttaaaa tcgacagaag tacaactgat 661 atcgaagagg gccggagtaa aagacatgat ggtgaagttg acagactttt ttggaagtga 721 agaggaatat tattcagtat gcccaaaagg ggcaccagac ttgatgggag caatcatcat 781 gggactaaaa cataaaaagc tttttaatca agcaagaatg aagtatcgtc tctaattaat 841 tccgatgaac gatatg // LOCUS VSVPPW 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 11/82-VC-B2) phosphoprotein gene, complete cds. ACCESSION M31867 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 11/82-VC-B2, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 291 a 164 c 187 g 214 t ORIGIN 1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga 241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc 301 agtcactttc tctgactgga cacaacctgt aatgaaggaa aacggaggtg agaaatcact 361 atctctgttc cctccggttg gattaacaaa agttcagacg gaacaatgga ggaagacaat 421 tgaggcagtc tgtgagagct ctaaatattg gaatttgtca gaatgccaga ttatgaactc 481 agatgattgt cttatcctca aaggccgagt gatgactcct gactgcagtt catcaatcaa 541 atctcagaat tccatacaga gttctgaatc tctctcctcc tcgcattcac ccggtccagc 601 acccaaagtc acggagtcaa ctagtctatg ggacttaaaa tcaacagaag tacaactgat 661 ctcgaagagg gccggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga 721 agatgaatat cattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat 781 gggactaaaa cataaaaaac tctttaatca agcaagaatg aagtatcgtc tctaattcat 841 tccgatgatc gatatg // LOCUS VSVPPX 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 07/84-OA-B) phosphoprotein gene, complete cds. ACCESSION M31868 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 07/84-OA-B, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 294 a 158 c 186 g 218 t ORIGIN 1 aacagagatc atggacagta ttgatcggct caaaacttac ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagagaga agagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga 241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc 301 agtcactttt tctgattgga cacaacctgt aatgaaggaa aacggagggg agaaatcact 361 atctctgttt cctccggttg gattaacaaa aattcagacg gaccaatgga ggaagacaat 421 tgaggcagtc tgtgagagct caaaatattg gaatttgtca gaatgccaaa ttatgaactc 481 agataattgt cttatcctca aagtccgagt tatgactcct gactgcagtt catcaatcaa 541 atctcagaac tccatacaga gttccgaatc tttctcctcc tcgcattcac cgggtccagc 601 actaaaagtc acggagtcaa ctagtctatg ggacttaaga tcaacagaag tacaactgat 661 ctcgaagagg gccggagtga aagacatgat ggtgaaattg acagactttt ttggaagtga 721 agatgaatat tattcagtat gcccagaagg ggcaccagac ttgatgggag caatcatcat 781 gggactaaaa cataaaaaac tctttaatca agcaagaatg aagtatcgtc tctaattaat 841 tccgatgatc gatatg // LOCUS VSVPPY 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain ../52-GA-P) phosphoprotein gene, complete cds. ACCESSION M31869 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain ../52-GA-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 292 a 164 c 189 g 211 t ORIGIN 1 aacagagatc atggatagtg ttgatcggct caaaaattac ttggctactt atgataattt 61 ggattctgcc ttgcaggatg caaatgaatc tgaagaaaga agagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa atcaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgttgt cgccggatga 241 aagaatagac aaatgggaag aggatttgcc cgaattggaa aagattgatg atgatatacc 301 agtcactttt tctgactgga cacaacctgt aatgaaggaa aacggagggg agaaatcact 361 ctatctgttt cctccggttg gattaacaaa agttcagacg gaccaatgga ggaagacaat 421 tgatgcggtc tgtgagagct ccaaatattg gaatttgtca gaatgccaga ttatgaactc 481 agaagattgt cttatcctca aaggccgaat tatgactcct gactgcagct catcaatcaa 541 atctcagaac tccatacaga gttccgaatc gccctcctcc tcgcattcac ccggtccagc 601 acccaaagtc acggaatcaa ctagtctatg ggacttaaaa tcaacagacg tacaactgat 661 ctcgaagagg gccggagtga aagacatgac ggtgaagttg acagactttt ttggaagtga 721 agaggaatat tattcagtat gcccaaaagg ggcaccagac ttgatgggag caatcatcat 781 gggactaaaa cataaaaagc tttttaatca agcaagaatg aagtatcgtc tctaattaat 841 tccgatgaac gatatg // LOCUS VSVPPZ 856 bp ss-RNA VRL 08-AUG-1990 DEFINITION Vesicular stomatitis virus (strain 10/84-GM-P) phosphoprotein gene, complete cds. ACCESSION M31870 KEYWORDS phosphoprotein. SOURCE Vesicular stomatitis virus (strain 10/84-GM-P, New Jersey serotype), cDNA to viral RNA. ORGANISM Vesicular stomatitis virus Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Rhabdoviridae; Vesiculovirus. REFERENCE 1 (bases 1 to 856) AUTHORS Bilsel,P.A., Rowe,J.E. and Fitch,W.M.Nichol.S.T. TITLE Phosphoprotein and nucleocapsid protein evolution of vesicular stomatitis virus, New Jersey JOURNAL J. Virol. 64, 2498-2504 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by S.T. Nichol, 05-FEB-1990. FEATURES from to/span description pept 11 835 phosphoprotein BASE COUNT 283 a 166 c 183 g 224 t ORIGIN 1 aacagatatc atggacagtg ttgatcggct caagacttac ttagctactt atgataattt 61 ggattctgcc ctgcaggatg ccaatgaatc tgaggaaaga cgagaggata aatatctcca 121 agacctcttc atcgaagatc aaggagataa accaactccg tcatattatc aggaagaaga 181 atcgtcagat tcagatactg actataatgc tgaacatctt acgatgctgt cgccggatga 241 aagaatagac aagtgggaag aagacttgcc tgaattagaa aagattgatg atgatatacc 301 ggtaaccttt cctaattgga cacagcctgt aatgaaggaa aatggaggag agaaatcatt 361 gtccctgttc cctccagttg ggttaacaaa gattcaaaca gaacaatgga gaaaaaccat 421 tgaggcggtt tgtgagagtt caaaatattg gaatttatca gaatgccaaa ttcttaactt 481 agaagactgc ctcactctca aaggccgagt tatgactcct gattgtagtt cttcagtcaa 541 atctcaaaat tctgtccgga gttcagaatc tctctactct tctcattccc caggtcccac 601 actcaaggta tcagagtcca tcaatctatg ggatttaaag tccacagaag tacaattgat 661 ctctaagaga gctggagtta aggacatgac agtcaaattg acagacttct tcggaagtga 721 ggaagagtat tattcagtat gcccagaagg ggcaccagac ttgatgggag ccatcatcat 781 gggattgaag tacaagaaac tcttcaatca ggcaagaatg aagtatcgcc tttaattcct 841 tcttatgatc agtatg // LOCUS HUMCYPIIF 1825 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human cytochrome P450IIF1 protein (CYP2F) mRNA, complete cds. ACCESSION J02906 KEYWORDS cytochrome P450. SOURCE Human lung, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1825) AUTHORS Nhamburo,P.T., Kimura,S., McBride,O.W., Kozak,C.A., Gelboin,H.V. and Gonzalez,F.J. TITLE The human CYP2F gene subfamily: Identification of a cDNA encoding a new cytochrome P450, cDNA-directed expression and chromosome mapping JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.T.Nhamburo, 18-MAY-1990. FEATURES from to/span description pept 56 1531 cytochrome P450IIF1 /nomgen="CYP2F" /map="19q13.1-q13.2" /hgml_locus_uid="LV0170W" mRNA < 1 1813 CYP2F mRNA BASE COUNT 356 a 596 c 470 g 403 t ORIGIN 1 gcaggctcag cgcatcccag ccagtgtctc ctgcagctca gcagctgcct tcaccatgga 61 cagcataagc acagccatct tactcctgct cctggctctc gtctgtctgc tcctgaccct 121 aagctcaaga gataagggaa agctgcctcc gggacccaga cccctctcaa tcctgggaaa 181 cctgctgctg ctttgctccc aagacatgct gacttctctc actaagctga gcaaggagta 241 tggctccatg tacacagtgc acctgggacc caggcgggtg gtggtcctca gcgggtacca 301 agctgtgaag gaggccctgg tggaccaggg agaggagttt agtggccgcg gtgactaccc 361 tgcctttttc aactttacca agggcaatgg catcgccttc tccagtgggg atcgatggaa 421 ggtcctgaga cagttctcta tccagattct acggaatttc gggatgggga agagaagcat 481 tgaggagcga atcctagagg agggcagctt cctgctggcg gacgtgcgga aaactgaagg 541 cgagcccttt gaccccacgt ttgtgctgag tcgctcagtg tccaacatta tctgttccgt 601 gctcttcggc agccgcttcg actatgatga tgagcgtctg ctcaccatta tccgccttat 661 caatgacaac ttccaaatca tgagcagccc ctggggcgag ttgtacgaca tcctagaccc 721 cagattcccg agcctcctgg actgggtgcc tgggccgcac caacgcatct tccagaactt 781 caagtgcctg agagacctca tcgcccacag cgtccacgac caccaggcct cgtctccccg 841 ggacttcatc cagtgcttcc tcaccaagat ggcagaggag aaggaggacc cactgagcca 901 cttccacatg gataccctgc tgatgaccac acataacctg ctctttggcg gcaccaagac 961 ggtgagcacc acgctgcacc acgccttcct ggcactcatg aagtacccaa aagttcaagc 1021 ccgcgtgcag gaggagatcg acctcgtggt gggacgcgcg cggctgccgg cgctgaagga 1081 ccgcgcggcc atgccttaca cagacgcggt gatccacgag gtgcagcgct ttgcagacat 1141 catccccatg aacttgccgc accgcgtcac tagggacacg gcctttcgcg gcttcctgat 1201 acccaagggc accgatgtca tcaccctcct taacaccgtc cactacgacc ccagccagtt 1261 cctgacgccc caggagttca accccgagca ttttttggat gccaatcagt ccttcaagaa 1321 gagtccagcc ttcatgccct tctcagctgg gcgccgtctg tgcctgggag agctgctggc 1381 gcgcatggag ctctttctgt acctcaccgc catcctgcag agcttttcgc tgcagccgct 1441 gggtgcgccc gaggacatcg acctgacccc actcagctca ggtcttggca atttgccgcg 1501 gcctttccag ctgtgcctgc gcccgcgcta acgccccggc ccttccagat tcgcctgtga 1561 gcgatgaggc ccacccatgt gggttgctac gtccccttct tggtccacag tctgccctca 1621 tccctctggc agtcacgctg tcttccctgc atgctgtgcc tgccgcgtgc ccttccccca 1681 tccctccaat ctgtgccccg tctgcagggc agaggcagat gtggcatgtc tttttgtacc 1741 cacagagctt gttctatggc acgccctttt ctaggctttt tgtatcattt cttagtacat 1801 tgtaatagat tcaaaccagt cttgg // LOCUS MUSADA3A 891 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse adenosine deaminase (ada) gene, 3' end. ACCESSION J05523 KEYWORDS adenosine deaminase. SOURCE Mouse fibroblast cell line B-1/200 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 891) AUTHORS Maa,M.-C., Chinsky,J.M., Ramamurthy,V., Martin,B.D. and Kellems,R.E. TITLE Identification of transcription stop sites at the 5' and 3' ends of the murine adenosine deaminase gene JOURNAL J. Biol. Chem. 265, 12513-12519 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.Ramamurthy, 18-MAY-1990. BASE COUNT 241 a 207 c 262 g 181 t ORIGIN 1 atagctggcc accttgcctt gattctatca aaggcattcc agtcccagca tcaagcacat 61 tctgtcctta tgcctccctt tcagacatca gtcatggtgg ctagcaagca tcatgtactg 121 ggaggaccca aatacacaaa gactctcagt aaggcaaacg gtgttgtaaa ggcctacatg 181 aggggactag tgatggctaa ggatgaaggc aggcagcaaa caaagctgat gagagctgag 241 cagagcacag gagccttgaa gagccccttg gcactgtgcc cggtgtgcca gctgggtcca 301 ctgctcattc ggttggacag tgagcagccg ttgagtaggt acccgtcacg tgccaaacac 361 tgttagtcac agagggcatg agacagcctg tgctggaaca gctcagttca tagggccatg 421 gagatgggga gaaaggggcg cctctgtcag agacaagctg tggtctggga aggccttagc 481 actaaaagca ccacaatgag aagcaaccgc cagaagcagg gcccgcaggc ctttgttcca 541 gctgcaaaga gaaaggaaaa agtggggaat aagagttggg gctgcggagg gggtggggag 601 cattgtgcag gttccgtact tgaacagaaa gcagggacca acacaaggaa ggccaaaggg 661 ctcagcctgg gaaactttgt gaggctatgg aggtaagggg cagggtattc tcccgaggca 721 tagagactgt cataccaatg atggtcattg gctgggcgag cctaagaaga ctccagtttc 781 ccttgacctc agttaaatct attacatgag aaactgggtc cctgtcccca acattgtccc 841 tgttccgtgt tcctaagtca tttctgggca ataaggactg agaactctag a // LOCUS YSCG6PD 3782 bp ds-DNA PLN 08-AUG-1990 DEFINITION S.cerevisiae glucose-6-phosphate dehydrogenase (ZWF1) gene and ORF, complete cds. ACCESSION M34709 KEYWORDS glucose-6-phosphate dehydrogenase. SOURCE S.cerevisiae (strain S288C) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3782) AUTHORS Nogae,I. and Johnston,M. TITLE Isolation and characterization of the ZWF1 gene of Saccharomyces cerevisiae, encoding glucose-6-phosphate dehydrogenase JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Johnston, 18-MAY-1990. FEATURES from to/span description pept 1585 3102 glucose-6-phosphate dehydrogenase (ZWF1) (EC 1.1.1.49) pept 246 1037 pot. ORF mRNA 1472 > 3102 ZWF1 mRNA BASE COUNT 1066 a 899 c 887 g 930 t ORIGIN 1 atcgattagc aaaaccgtag agaagatcat tgcgcataag aaacaaaaag aaaataccgg 61 tgcggaccgc aagcctctat tatctgctgt gtgtccggga tttcttatat acacggaaaa 121 gaccaaaccg cagttggtgc ctatgctgct caatgtcaag tcgcctcagc aaattacggg 181 ctcgttgatc agagctacgt tcgagagtct ctatcgcgcg tgagtccttt taccatctgt 241 cgctgatgcc atgttttgac aagaaactag aagcgtcaag gccagaatcg cttgacgatg 301 ggattgactg tgtcattacg ccacgtgaaa tagtgacaat gttacaagag ttgaatttag 361 acttcaagtc gtttcttact gaggacacaa gtttatatgg cgctatcacc gccgggtggg 421 acccgcgagt ccactgggcg tcgaatctgg gaggcacttg tggtggctat gcctaccagt 481 acgtaacggc tgtccaacga ctacacccag gaagtcagat gatagttctg gagggtagaa 541 acagcgacat tgtcgagtac cgattgctgc acgatgatcg cataatagcc gccgccagcg 601 agctttccgg gtttagaaac atccaaaatc tggtgcgtaa actgacctcg ggctccgggt 661 cagagcgcaa gaggaacatc agctctgcgg aagagagacg aacaggtcca aaggcgaact 721 cacgcgagat ggcgccagcc gccacggccg atccctacca ctcggactac atcgaggtaa 781 acgcgtgtcc gggtgcatgc atgaacggtg gtggcctact caatggcgag caaaacagtc 841 tcaagcggaa gcagttggtt cagacactga ataagcgtca cggagaggag ctagcaatgg 901 tagatcctct aaccctggga cccaagctag aagaggcggc accgcccgct ttcgctagag 961 tacgtcttcg cgcccgtcaa gcaggccgtc gaaaaggatc tcgtctctgt tgggagcacc 1021 tggtaagtaa ggtgtagttt tgcacccgtg tacataagcg tgaaatcacc acaaactgtg 1081 tgtatcaagt acatagtgac atttaaataa tagcaagaac aacaataata gtagcgctac 1141 tggaagcacc acgtaatagt ggaaaagaac tggaaaaacc gctataagat gcatactccg 1201 gcggtcttac gcggagatac aagcttccaa cggtgctaaa agcccggttt cggctcggcc 1261 ggaggaggaa gagagacgaa aaaaaaaaaa atgactaaaa aaaaaatgga atattattaa 1321 tgtgggattt ttggctcaag gtgtggtggc cccttttcta agggtggcga attcttcaat 1381 gtacggaaaa ctcgccaagg ctatcccata tataagcaaa ctgtgggttc atctatatac 1441 cgacacataa cacctaaagt ggcttcctcc tgcccctctc tcccttttct ccactcaccc 1501 ctccttctcc cccttccccc tctccaattg gctgtataga cagaaagagt aaatccaata 1561 gaatagaaaa ccacataagg caagatgagt gaaggccccg tcaaattcga aaaaaatacc 1621 gtcatatctg tctttggtgc gtcaggtgat ctggcaaaga agaagacttt tcccgcctta 1681 tttgggcttt tcagagaagg ttaccttgat ccatctacca agatcttcgg ttatgcccgg 1741 tccaaattgt ccatggagga ggacctgaag tcccgtgtcc taccccactt gaaaaaacct 1801 cacggtgaag ccgatgactc taaggtcgaa cagttcttca agatggtcag ctacatttcg 1861 ggaaattacg acacagatga aggcttcgac gaattaagaa cgcagatcga gaaattcgag 1921 aaaagtgcca acgtcgatgt cccacaccgt ctcttctatc tggccttgcc gccaagcgtt 1981 tttttgacgg tggccaagca gatcaagagt cgtgtgtacg cagagaatgg catcacccgt 2041 gtaatcgtag agaaaccttt cggccacgac ctggcctctg ccagggagct gcaaaaaaac 2101 ctgggggccc tctttaaaga agaagagttg tacagaattg accattactt gggtaaagag 2161 ttggtcaaga atcttttagt cttgaggttc ggtaaccagt ttttgaatgc ctcgtggaat 2221 agagacaaca ttcaaagcgt tcagatttcg tttaaagaga ggttcggcac cgaaggccgt 2281 ggcggctatt tcgactctat aggcataatc agagacgtga tgcagaacca tctgttacaa 2341 atcatgactc tcttgactat ggaaagaccg gtgtcttttg acccggaatc tattcgtgac 2401 gaaaaggtta aggttctaaa ggccgtggcc cccatcgaca cggacgacgt cctcttgggc 2461 cagtacggta aatctgagga cgggtctaag cccgcctacg tggatgatga cactgtagac 2521 aaggactcta aatgtgtcac ttttgcagca atgactttca acatcgaaaa cgagcgttgg 2581 gagggcgtcc ccatcatgat gcgtgccggt aaggctttga atgagtccaa ggtggagatc 2641 agactgcagt acaaagcggt cgcatcgggt gtcttcaaag acattccaaa taacgaactg 2701 gtcatcagag tgcagcccga tgccgctgtg tacctaaagt ttaatgctaa gacccctggt 2761 ctgtcaaatg ctacccaagt cacagatctg aatctaactt acgcaagcag gtaccaagac 2821 ttttggattc cagaggctta cgaggtgttg ataagagacg ccctactggg tgaccattcc 2881 aactttgtca gagatgacga attggatatc agttggggca tattcacccc attactcaag 2941 cacatagagc gtccggacgg tccaacaccg gaaatttacc cctacggatc aagaggtcca 3001 aagggattga aggaatatat gcaaaaacac aagtatgtta tgcccgaaaa gcacccttac 3061 gcttggcccg tgactaagcc agaagatacg aaggataatt agaaaaatgc aagcacattc 3121 atttatcggc taagtcactg aaattttttt tttcgagtga ttttatcttg caagctcact 3181 ctctttgttc ttttctcctc tataatggca tctccccccc accaactcct ctctacttgc 3241 gtgtgtattt gtttgtatac atgtgtaaat atatatacat aaaagaatgt cgtctcatgt 3301 attttttaac ttttagccgc ggtcagtgac attttggctt tcccaccatt ccacgtctga 3361 aaaaaaaaaa aaaaaaagag ataaaatatg aattgaatat atatcaaaaa tgtctgcaaa 3421 aatttttacg aatcagtccg attggacttg tacttatcat tgatttgctc gttatgcgtc 3481 ttatccactt gattagctat tccctgtaaa gtcttagaca ctgcctcagt ggccccaatt 3541 aacggccgaa taattgccac aggtgcagct ctagcaacat aaactgctgc ggcgctcggc 3601 cctcccggtt tatcatcctt catctgtccc tttgccctcc ataccgcatc gtacgcaata 3661 tgcatatgct tttctaatga actatacgcc tccttcaaac ccgtgggcag gtcgaggggc 3721 tgatctgcgt acagactcac tattgtgggt tcgttgtgat ctcccggtga agacatatcg 3781 at // LOCUS BOVLMCB 326 bp ds-DNA SYN 08-AUG-1990 DEFINITION Synthetic bovine liver microsomal lipase-solubilized cytochrome b-5 gene, complete cds. ACCESSION J02905 KEYWORDS cytochrome b-5. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 326) AUTHORS Funk,W.D., Lo,T.P., Mauk,M.R., Brayer,G.D., MacGillivray,R.T.A. and Mauk,A.G. TITLE Mutagenic, electrochemical, and crystallographic investigation of the cytochrome b-5 oxidation-reduction equilibrium: Involvement of asparagine-57, serine-64, and heme propionate-7 JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by G.Mauk, 16-MAY-1990. FEATURES from to/span description pept 27 311 microsomal cytochrome b-5 binding 16 19 ribosome binding site BASE COUNT 106 a 82 c 77 g 61 t ORIGIN 1 gtacggtacc gtaaaggaga acaacaatga gcaaagctgt caaatactac actctggaac 61 aaatccaaaa acacaacaac tcgaaatcga cgtggctgat cctcgactac aaagtatacg 121 acctgactaa attcctggaa gagcacccgg gaggcgaaga agtcctgcgc gaacaggccg 181 gcggtgacgc gactgaaaac tttgaggatg ttggccatag taccgacgct cgtgaactgt 241 cgaaaacgtt catcatcggt gagctgcacc cggacgatcg ttctaaaatc accaaaccgt 301 ccgaatctta atgagaattc gtctga // LOCUS DROEST6P 5424 bp ds-DNA INV 08-AUG-1990 DEFINITION D.melanogaster carboxylesterase 6 and P (Est-6 and Est-P) genes, complete cds. ACCESSION M33780 M33781 KEYWORDS carboxylesterase. SOURCE D.melanogaster (strain Dm145) adult DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 929 to 4027) AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and Richmond,R.C. TITLE Molecular analysis of duplicated esterase genes in Drosophila melanogaster JOURNAL Mol. Biol. Evol. 7, 9-28 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 5424) AUTHORS Collet,C., Nielsen,K.M., Russell,R.J., Karl,M., Oakeshott,J.G. and Richmond,R.C. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.C.Richmond, 19-APR-1990. FEATURES from to/span description pept 1173 2559 carboxylesterase 6 (Est-6) precursor (EC 3.1.1.1), exon 1 2611 2858 carboxylesterase 6 precursor, exon 2 sigp 1173 1235 carboxylesterase 6 signal peptide matp 1236 2559 carboxylesterase 6 2611 2855 carboxylesterase 6 pept 3052 4438 carboxylesterase P (Est-P) precursor (EC 3.1.1.1), exon 1 4495 4742 carboxylesterase P precursor, exon 2 sigp 3052 3111 carboxylesterase P signal peptide matp 3112 4438 carboxylesterase P 4495 4739 carboxylesterase P pre-msg 1132 2867 Est-6 mRNA and intron (alt.) pre-msg 1132 2998 Est-6 mRNA and intron (alt.) IVS 2560 2610 Est-6 intron A IVS 4439 4494 Est-P intron A signal 1104 1110 TATA box BASE COUNT 1536 a 1117 c 1235 g 1536 t ORIGIN Chromosome 3L 69A1-5. 1 aagcttgcta tatatctatc tgtatatacc tatttgctta gaccgaactt aatgattttc 61 cctttttcca aaacttacac aaatatatac ctaatacctt acactgaaaa aactgttcaa 121 ttcactatgt acttattagt agttcttata aaccttgtta ttactaatta cgtacgaaaa 181 cagaacggga aataaatatc taaaaattta aatagtaaac tgagctgatg tttctacaca 241 aggatgaaca caagtatcaa agtgctcata ggtacagcat ttctagaaca caattacacc 301 agtcttttgc tagcaataaa ttttcactca tctcccactt ttctctagaa ctatcctgaa 361 tttatttatt tgattctatt ttgtcgcaag cacactcaat tcttgatctt tttaataata 421 atcgtagcgg aaagctgtat ctgtacaggc aagaccaccg cccctcaaaa tgccaacgaa 481 ctacagtcgg attcaaaagc gtgcacctgc aagtgcaagt tgtatgatat cttgcaccat 541 tttatttcga tacacgtttt ggtacttcga tttaatatgg ctaattgtta ttcagaaaat 601 aaacacaaaa acttaggaca catgttcaaa aatagaaata tgtacatatt tgttgcaaac 661 cttaaatctt attaattatc tttatctata atcaaaaagc ctactcattt gttaaatggt 721 ttcgttttaa agtgcattat caaattcatc tggataaaca ttcaaacaaa aaattgcatg 781 catgtaaagt aattaatcaa gttgactcga agcgattgtg caatataagt ctcacctgaa 841 gtgggattgg caactgcttg ggatgcatgt cgaaaatcta tattaagccc agtcaaatat 901 tttaagcgta aagtaacaaa cctaatttgt atacggctat cgttttaatt cgcacacgcc 961 tatcaactgg atgatgttca cactagagtt actcccattt gaaagccggg cattggaaaa 1021 ctaatctcat gcgtgcccag atctcaattg agactggttg actggatgtt caggtggccg 1081 ggtggcgata agccgatcga tggaataaaa ggggccgcaa ttgccgcatc tcaagatagt 1141 tgcggtctga attcgccgga gtgaggagca acatgaacta cgtgggactg ggacttatca 1201 ttgtgctgag ctgcctttgg ctcggttcga acgcgagtga tacagatgac cctctgttgg 1261 tgcagctgcc ccagggcaag ctacgtggtc gcgataatgg aagctactac agctacgaat 1321 cgattcccta cgccgaaccg cccactggcg atctacgatt cgaggctcca gagccgtaca 1381 aacaaaagtg gtcggatata ttcgatgcca ccaaaacccc ggtggcgtgc ctgcagtggg 1441 atcagttcac gcctggggcc aacaaattgg taggagagga ggattgccta accgtcagcg 1501 tctacaagcc gaagaatagc aagaggaata gctttccggt ggtggcccac attcacggag 1561 gtgcctttat gttcggtgca gcatggcaaa atggacacga gaacgtgatg cgtgagggca 1621 aattcattct ggtgaagata agctatcgcc tggggccatt gggtttcgtg agcaccggcg 1681 atagggatct tcccggaaac tatggactga aagatcaacg gctggctctc aaatggatta 1741 agcagaatat agccagtttt ggtggagaac cgcagaacgt actgttggtt ggtcactccg 1801 ctggaggagc ttcggtccat ctgcagatgc ttcgtgaaga tttcggccag ctggccaggg 1861 cggcattctc gtttagtgga aatgctctag atccatgggt tatacagaag ggagcaagag 1921 gacgagcctt tgaactggga cgcaacgtgg gatgtgaatc ggctgaagac tcgaccagcc 1981 tgaagaaatg cctaaagtca aagccagcca gtgaattagt caccgccgtc cgtaaattcc 2041 ttatattttc ctatgtgccc tttgctccat ttagtcctgt attggagcca tcggatgctc 2101 cagacgccat tatcacccag gatcccaggg atgtcattaa gagcggaaag ttcggacagg 2161 ttccgtgggc tgtttcctat gtcacagagg atggtggcta caatgccgcc ttgcttttga 2221 aggaacggaa atctggaata gttatcgatg atctaaacga gcgttggctt gagttggcac 2281 catatttact attctaccgg gacacgaaga ccaaaaagga tatggacgac tactcgcgga 2341 aaattaagca ggagtatata ggcaatcaga gatttgacat cgaaagctat tcagaattgc 2401 agcggctatt cacggatatt ctcttcaaga atagcacgca ggagtcattg gatcttcatc 2461 gcaaatatgg aaagagtcct gcctacgctt atgtctatga caatccagcc gaaaaaggaa 2521 tcgcacaggt cctggccaat cgaaccgatt atgattttgg taaggaaatc gtacttttaa 2581 atggacttag ttaaatcatt acttttatag gaactgtaca cggtgacgac tactttttga 2641 tattcgaaaa tttcgtacga gatgtggaaa tgcgtccgga tgagcagata atttcgagaa 2701 attttatcaa tatgctggca gattttgctt cgagtgataa tggctctcta aaatatggtg 2761 aatgcgattt caaagataat gtaggtagtg agaaattcca attattagct atttatattg 2821 atggctgcca gaataggcag catgtggaat ttccgtaagt tacatgaata aaatcaaaaa 2881 tttttcgttc tgtgtaattt ttaattattt aattattctc aactggcttt aaatatcatt 2941 tgtacaaaac gtgtttgtcg ctttatattt tggtttttgt tgtttcttta taaagaataa 3001 taaacgtttg tttacccgca gtttgattag tactttgtgg agttccagat catgagtata 3061 ttcaaacggc tgttgtgcct gactttgctg tggatagcag ctttagaatc tgaagctgat 3121 cccttgattg ttgagataac aaatggaaaa atccgtggca aagataatgg gttgtactac 3181 agctacgaat cgattcccta tgccgagcat ccaactggtg ccctccgttt tgaagcacct 3241 cagccgtata gtcatcattg gactgatgtt ttcaatgcca cgcagtctcc agttgagtgc 3301 atgcagtgga atcagtttat aaacgaaaac aataagctga tgggtgatga ggattgctta 3361 acggtaagca tctataagcc aaagaaaccc aatcggagca gctttcctgt cgtagtactc 3421 ctgcatggag gtgctttcat gttcggtagt ggatccatat atggacacga ctccattatg 3481 cgtgagggaa ctttgcttgt ggtaaaaata agctatcgtc ttggaccatt gggttttgca 3541 agtaccggcg atagacactt gccgggaaac tatggtctaa aggatcaacg tctggcccta 3601 caatggatca agaagaacat tgctcacttt ggtggaatgc cagataatat tgtgctcatt 3661 ggtcactctg caggcggtgc ttcggctcat ttgcagctgt tgcacgagga tttcaaacat 3721 ttggccaaag gagcgatttc ggtgagcggc aatgcattgg atccttgggt catacagcag 3781 ggtggacgac gacgtgcatt tgaactgggt cgtattgtcg gttgtggaca cacaaatgtc 3841 tccgcagaac tcaaggactg cttgaagtct aagccggcta gcgatatagt ctctgctgtc 3901 cgaagcttcc ttgtgttttc ctatgtaccc ttcagtgctt ttggacctgt tgtggagccg 3961 tcagatgcac cagacgcctt tctaaccgag gacccaagag cagtgattaa gagcgggaag 4021 tttgcccaag tcccttgggc tgtgacgtac accactgagg acgggggata caacgctgct 4081 cagctgttgg aaagaaacaa attaactggc gagagttgga ttgacctact caatgatcga 4141 tggtttgatt gggcaccata cttgctcttc tatcgggacg ccaagaaaac catcaaagat 4201 atggatgatc tttcatttga tctcaggcag cagtatctag cagatcggcg attcagtgtg 4261 gaaagttatt ggaacgtgca gcgaatgttt actgatgttc ttttcaagaa tagcgtgcca 4321 agtgcaatag atcttcaccg aaagtatggc aaaagtccgg tttattcttt tgtctacgat 4381 aatcctaccg attccggagt gggtcaattg ctttccaatc gaacagatgt acattttggt 4441 gggtacatac ttgcgtttca aatacttaaa gctctagctt tcttattatt aaaggtactg 4501 tccacggaga tgactttttc ttgattttca atacagctgc ataccgtatc ggcattcgtc 4561 cggatgaaga agttatttca aaaaagttta taggtatgct ggaggatttc gcactcaacg 4621 ataagggaac attaacattt ggagaatgta atttccaaaa taatgtgaac agcaaggaat 4681 atcaagtgct gcgtatttca cgaaacgctt gtaaaaacga ggaatatgct cggtttccct 4741 aaggaataaa tgcatttaaa aaatgtaaac agtataataa tgtaattaaa gaacaaattt 4801 attacacctt tatgccgcct caactgtcag aggtttttca ataagttcgt cttcttcttc 4861 tgggtcgttt tcctcgccct ccacattccc atacatctca atcttgctgt cgaggaagcg 4921 aaaatccttg aggatatgca acttgttgtg aaaccagcaa tggaaatgaa tttaatggcg 4981 atgtttgcca aacagcagag tagcaggcag tgtgctccac gggaatattg gcttgcagta 5041 ctccatttag agtgtaggat tgcacatgat tgtgctcacc accgattaag acacaatcgt 5101 ccacaaagtc acatagatgc acgcgtccgg ggaagctaaa gacacatgtg ctttccatgg 5161 aacgcaggtg gaaaattgaa gctttgggtc caccaccgca gagcagccaa tcctcgttga 5221 cagccactgc tcctatccat ttgccccaat ccggacgtaa cagatttggg ttcttgtaag 5281 gctccagcat tgaagtgtgc tgctgctgct tggtgctcca cactcgcaca gtgccatcct 5341 cggcgccaga gaagatctgt ccgttggcat tgcccaccac actgtgcacg tagtctgtgt 5401 gtccgcggta ctcgcgctga attc // LOCUS ECOLASNII 1530 bp ds-DNA BCT 08-AUG-1990 DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds. ACCESSION M34277 KEYWORDS L-asparaginase II. SOURCE E.coli (strain K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1530) AUTHORS Jennings,M.P. and Beacham,I.R. TITLE Analysis of the Escherichia coli gene encoding L-asparaginase II, ansB, and its regulation by cyclic AMP receptor and FNR proteins JOURNAL J. Bacteriol. 172, 1491-1498 (1990) STANDARD simple staff_review FEATURES from to/span description pept 385 1431 L-asparaginase II (ansB) precursor sigp 385 450 L-asparaginase II signal peptide matp 451 1428 L-asparaginase II mRNA 323 > 1431 ansB mRNA BASE COUNT 409 a 381 c 382 g 358 t ORIGIN 1 aaatgggcgc gaaagcggtg ctgaaaagcg gcggtaaccc attacagaat gtgctgggaa 61 gcctgggaag cctggggggg ctgcaatcct caatccaaac cgagtggaaa aagcaggaaa 121 aagatttcca gcagtttggc aaagatgttt gtagccgcgt tgtgactctg gaagatagcc 181 gcaaagccct ggtcgggaat ttaaaataat cctctatttt aagacggcat aatacttttt 241 tatgccgttt aattcttcgt tttgttacct gcctctaact ttgtagatct ccaaaatata 301 ttcacgttgt aaattgttta acgtcaaatt tcccatacag agctaaggga taatgcgtag 361 cgttcacgta actggaggaa tgaaatggag tttttcaaaa agacggcact tgccgcactg 421 gttatgggtt ttagtggtgc agcattggca ttacccaata tcaccatttt agcaaccggc 481 gggaccattg ccggtggtgg tgactccgca accaaatcta actacacagt gggtaaagtt 541 ggcgtagaaa atctggttaa tgcggtgccg caactaaaag acattgcgaa cgttaaaggc 601 gagcaggtag tgaatatcgg ctcccaggac atgaacgata atgtctggct gacactggcg 661 aaaaaaatta acaccgactg cgataagacc gacggcttcg tcattaccca cggtaccgac 721 acgatggaag aaactgctta cttcctcgac ctgacggtga aatgcgacaa accggtggtg 781 atggtcggcg caatgcgtcc gtccacgtct atgagcgcag acggtccatt caacctgtat 841 aacgcggtag tgaccgcagc tgataaagcc tccgccaacc gtggcgtgct ggtagtgatg 901 aatgacaccg tgcttgatgg ccgtgacgtc accaaaacca acaccaccga cgtagcgacc 961 ttcaagtctg ttaactacgg tcctctgggt tacattcaca acggtaagat tgactaccag 1021 cgtaccccgg cacgtaagca taccagcgac acgccattcg atgtctctaa gctgaatgaa 1081 ctgccgaaag tcggcattgt ttataactac gctaacgcat ccgatcttcc ggctaaagca 1141 ctggtagatg cgggctatga tggcatcgtt agcgctggtg tgggtaacgg caacctgtat 1201 aaatctgtgt tcgacacgct ggcgaccgcc gcgaaaaccg gtactgcagt cgtgcgttct 1261 tcccgcgtac cgacgggcgc taccactcag gatgccgaag tggatgatgc gaaatacggc 1321 ttcgtcgcct ctggcacgct gaacccgcaa aaagcgcgcg ttctgctgca actggctctg 1381 acgcaaacca aagatccgca gcagatccag cagatcttca atcagtacta atcgcctcgc 1441 cccggtatcg tgccggggct ttttcacttc agactcacgt ccattgccaa ttttaattac 1501 cctaatgata atcaccggaa taaattattt // LOCUS HUMPADPRP 2085 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human poly(ADP-ribose) polymerase (PADPRP) gene, exon 1. ACCESSION M34337 KEYWORDS poly(ADP-ribose) polymerase. SOURCE Human peripheral blood leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2085) AUTHORS Ogura,T., Nyunoya,H., Takahashi-Masutani,M., Miwa,M., Sugimura,T. and Esumi,H. TITLE Characterization of a putative promoter region of the human poly (ADP-ribose) polymerase gene: Structural similarity to that of the DNA polymerase beta gene JOURNAL Biochem. Biophys. Res. Commun. 167, 701-710 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1943 / 2062 poly(ADP-ribose) polymerase /hgml_locus_uid="LZ0154E" /nomgen="PPOL" /map="1q41-q42" pre-msg 1777 > 2085 PPOL mRNA and introns IVS 2063 > 2085 PPOL intron A BASE COUNT 495 a 535 c 550 g 505 t ORIGIN 1 tttagggatg atatagttgt caacccagag atggcatgat catgcctttt gacttggtca 61 ttctctaagt aaaactttta tttgttccat catattttcc acttattctg tttaccttca 121 aaatatcttt tttttttttt tttgagacag ggtcacactg tcacccaggc tagagtccag 181 tggcactatc atggctcacc acagcctcaa ccttcagggc tcaggtgatc ctcccacttc 241 agcctcccga gtagatggga ctacaggcac ctgccaccac ccccagctaa tttttgtaga 301 gacaaggttt tgccatgttg tccaggctgg tcttgaactc ctgggctcaa gggatccggc 361 cacctcagcc tcccaaagtg ctaggattat aggcatgagc cactgtgccc agcctacctt 421 caacgtatct aactggttac taacttttag gattcggcct atgtctcaca accttcttgc 481 ttactcaaca tccttgtctc ttaagccact agcttcttct ctatggttaa cactttttat 541 gagttttatt catctgctta tttttcttat cctctatacc agaattgaat attttcaaat 601 aaagcacact catgttacaa tctttgaaat ggaaaaaaaa aatgcatagg attagaaaag 661 aaaccaattt taataaacta tattttgaag tatagttcta tattaaacaa caagatctag 721 gccaggtgca gtggctcatg cctgtaatcc cagcaatttg ggaagtcgag gtgggaggat 781 tgcttgaggc caggggttca agaccagcct gggcaacatg gagagattcc ccatctcttt 841 ctttacacac acacacacac acacacacaa aatatctgat agcaacaggt gcagtcatta 901 ccacaatttc gagtagtgat gagcttaata atatttcgag ttatcaccaa caactgtaaa 961 ctaacatgaa aacgtctgtg atgactattg cccacaaagt cacaggtact gctaatactc 1021 ctggtatttg tagtaaattc ataataaagg aaatgctagg tttcagttgg tattttgtcc 1081 cgacggtctg tggacggcag gttagaacgc ccgtccaagc caggagggtg gacctagcac 1141 tgcagggtcc acctcgggcc aatcaactat attcccgagg cgggggcctg cgcttcccgg 1201 acccagctgc cctcagggga gagaggacac acttaagagt ttggggccgg cgtggtagct 1261 catgcccctg atcccagcac ttcgggaggc tgaggcgtga agatcacttg tagcaggagt 1321 ttgagaccag tctagccaac ttggcgagac cctgtcccta aaaaaaattt ttttttaatt 1381 agccagttgt ggtgagcgcc tgtagtccca gctactcggg aggctgaggt gggaggatcg 1441 ctgggctcag gagttccaga ctgcagtgag ccatgatggc ggcactgcac tccagcgcgg 1501 tgagactcag tctcaaaaat aaaaggggga ggggttgggg gtaaaattag ttgtgaaatc 1561 aagtaagact tcctgggaca gaacaatcaa aggggtggcg ccgggtcctc caaagagcta 1621 ctagctcagc ccaagccccg cctcggcccc cagggcagcg gccgcagagc tccacccggc 1681 aggcgcccgg gaaactccgc cccccggccg gcagggcgcg cgcccgccgg ccccgccccg 1741 tggacgcggg ttccgtggcg ttcccgcggc caggcatcag caatctatca gggaacggcg 1801 gtggccggtg cggcgtgttc ggtgcgctct ggccgctcag ccgtggcggc tgggtgagcg 1861 cacgcgaggc ggcgaggcgg caagcgtgtg tttctaggtc gtggcgtcgg gcttccggag 1921 ctttggcggc agctagggga ggatggcgga gtcttcggat aagctctatc gagtcgagta 1981 cgccaagagc gggcgcgcct cttgcaagaa atgcagcgag agcatcccca aggactcgct 2041 ccggatggcc atcatggtgc aggtgcgggc cgctgtgcgg cgggg // LOCUS HUMREVERB1 295 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human Rev-ErbA-alpha mRNA, 5' end. ACCESSION M34339 KEYWORDS . SEGMENT 1 of 2 SOURCE Human fetal skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 295) AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W. TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription from the noncoding DNA strand of a thyroid hormone receptor gene results in a related protein that does not bind thyroid hormone JOURNAL DNA 9, 77-83 (1990) STANDARD simple staff_review FEATURES from to/span description pept 204 > 295 Rev-ErbA-alpha protein (put.) BASE COUNT 58 a 107 c 74 g 56 t ORIGIN 1 ccgttgcctc aacgtccaac ccttcctgca gggctgcagt ccggccaccc caagcccttg 61 ctgcagggtg cttcggatcc tgatcgtgag tcgcggggtc cactccccgc ccttagccag 121 tgcccagggg gcaacagcgg cgatcgcaac ctctagtttg agtcaaggtc cagtttgaat 181 gaccgctctc agctggtgaa gacatgacca ccctggactc caacaacaac acaggtggcg 241 tcatcaccta cattggctcc agtggctcct ccccaagccg caccagccct gaatc // LOCUS HUMREVERB2 285 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human Rev-ErbA-alpha mRNA, partial cds. ACCESSION M34340 KEYWORDS . SEGMENT 2 of 2 SOURCE Human fetal skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 285) AUTHORS Lazar,M.A., Jones,K.E. and Chin,W.W. TITLE Isolation of a cDNA encoding human Rev-ErbA-alpha: Transcription from the noncoding DNA strand of a thyroid hormone receptor gene results in a related protein that does not bind thyroid hormone JOURNAL DNA 9, 77-83 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 285 Rev-ErbA-alpha protein (put.) (AA at 1) BASE COUNT 62 a 79 c 99 g 45 t ORIGIN About 1.7 kb after segment 1. 1 ctcgtaaagg agagagaagt gcagagttcg attctgtaca agggggcagc ggcagaaggc 61 cggccgggcg ggtcactggg cgtccacccg gaaggacagc agcttctcgg aatgcatgtt 121 gttcagggtc cgcaggtccg gcagcttgag cagcagcttg gtgaagcggg aagtctccaa 181 gggccggttc ttcagcacca gagcccgaag agcccgcagc agcgtctcct ggagctgctc 241 caccgaagcg gaattctcca tgcccgagcg gtctgcagag acaag // LOCUS HUMTFEB 1526 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human TFEB protein mRNA, partial cds. ACCESSION M33782 KEYWORDS . SOURCE Human BJAB B cell line, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1526) AUTHORS Carr,C.S. and Sharp,P.A. TITLE A helix-loop-helix protein related to the immunoglobulin E box-binding proteins JOURNAL Mol. Cell. Biol. 10, 4384-4388 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.S.Carr, 19-APR-1990. FEATURES from to/span description pept < 1 > 1526 TFEB protein (AA at 1) BASE COUNT 324 a 495 c 493 g 214 t ORIGIN 1 ggtggagtga cagccggagc ccgggcgccg gcgcggggct cggtgacagc ggaggcggcg 61 gcccgggcgg gacgagggag cggccgcggc gccgacggcg ggcgggaagg gcggacgggg 121 cgggcctccc tggtggcacg gggccggagc aggccgggag caggtggcgg gcacagcttg 181 cgggccaggc acccgaactt gcgacaagtt gccggagccg gggcgcgggc ggcggacaga 241 ttgaccttca gagcgaggga gccagcgccg gcagccacca tggcgtcacg catagggttg 301 cgcatgcagc tcatgcggga gcaggcgcag caggaggagc agcgggagcg catgcagcaa 361 caggctgtca tgcattacat gcagcagcag cagcagcagc aacagcagca gctcggaggg 421 ccgcccaccc cggccatcaa tacccccgtc cacttccagt cgccaccacc tgtgcctggg 481 gaggtgttga aggtgcagtc ctacctggag aatcccacat cctaccatct gcagcagtcg 541 cagcatcaga aggtgcggga gtacctgtcc gagacctatg ggaacaagtt tgctgctgcc 601 cacatcagcc cagcccaggc tctccgaaac ccccaccagc cgcctcccca ggggtgcgag 661 ctggacacgt gctgtcctcc tccgctggca acagtgctcc caataagccc catggccatg 721 ctgcacattg gctccaaccc tgagagggag ttggatgatg tcattgacaa cattatgcgt 781 ctgacgatgt ctcttggcta catcaatcct gaaatgcaga tgcccaacac gctacccctg 841 tccagcagcc acctgaatgt gtacagcagc gacccccagg tcacagcctc cctggtgggc 901 gtcaccagca gctcctgccc tgcggacctg acccagaagc gagagctcac agatgctgag 961 agcagggccc tggccaagga gcggcagaag aaagacaatc acaacttaat tgaaaggaga 1021 cgaaggttca acatcaatga ccgcatcaag gagttgggaa tgctgatccc caaggccaat 1081 gacctggacg tgcgctggaa caagggcacc atcctcaagg cctctgtgga ttacatccgg 1141 aggatgcaga aggacctgca aaagtccagg gagctggaga accactctcg ccgcctggag 1201 atgaccaaca agcagctctg gctccgtatc caggagctgg agatgcaggc tcgagtgcac 1261 ggcctcccta ccacctcccc gtccggcatg aacatggctg agctggccca gcaggtggtg 1321 aagcaggagc tgcctagcga agagggccca ggggaggccc tgatgctggg ggctgaggtc 1381 cctgaccctg agccactgcc agctctgccc ccgcaagccc cgctgcccct gcccacccag 1441 ccaccgtccc cattccatca cctggacttc agccacagcc tgagctttgg gggcagggag 1501 gacgagggtc ccccgggcta ccccga // LOCUS MUSFCERII 2237 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Mouse low affinity IgE receptor (Fc-epsilon-RII) mRNA, complete cds. ACCESSION M34163 KEYWORDS IgE receptor. SOURCE Mouse (strain Balb/c) splenic B cell, cDNA to mRNA, clone p23.18. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2237) AUTHORS Gollnick,S.O., Trounstine,M.L., Yamashita,L.C., Kehry,M.R. and Moore,K.W. TITLE Isolation, characterization, and expression of cDNA clones encoding the mouse Fc receptor for IgE (Fc-epsilon-RII) JOURNAL J. Immunol. 144, 1974-1982 (1990) STANDARD simple staff_review FEATURES from to/span description pept 175 1170 low affinity IgE receptor (Fc-epsilon-RII) mRNA < 1 2237 Fc-epsilon-RII mRNA BASE COUNT 592 a 622 c 533 g 490 t ORIGIN 1 ggagaagact actgtcttca acacactagc ctgagctacc ttatccaagt gctccacata 61 ttccagaagg agaaggacag acttcaagtt caaatcactt ccagagctgt gagtgacaag 121 tgccttggca ggtagtgcac gcctcatcac tgaaaggatc caaacaagac tgccatggaa 181 gaaaatgaat actcaggata ctgggaacct cctagaaagc gttgctgctg tgcaagacgt 241 gggacacagc tcatgttggt ggggctgctg agcacagcaa tgtgggctgg cctgctggcc 301 ctgcttcttc tgtggcactg ggaaacggag aagaatctaa aacagctggg agacactgca 361 attcagaatg tctctcatgt taccaaggac ttacaaaaat tccagagtaa tcaattggcc 421 cagaagtccc aggttgttca gatgtcacaa aacttgcaag aactccaagc tgaacagaag 481 caaatgaaag ctcaggactc tcggctctcc cagaacctga ccggactcca ggaggatcta 541 aggaacgccc aatcccagaa ctcaaaactc tcccagaacc tgaacagact ccaagacgat 601 ctagtcaaca tcaaatccct gggcttgaat gagaagcgca cagcctccga ttctctagag 661 aaactccagg aagaggtggc aaagctgtgg atagagatac tgatttcaaa gggaactgca 721 tgcaacatat gtcccaagaa ctggctccat ttccaacaga agtgctacta ttttggcaag 781 ggctccaagc agtggatcca ggccaggttc gcctgcagtg acctgcaagg gcgactagtc 841 agcatccaca gccaaaagga acaggacttc ctgatgcaac acatcaacaa gaaggattcc 901 tggattggcc tccaggatct caatatggag ggagagtttg tatggtcgga cgggagccct 961 gtgggttata gcaactggaa tccaggggag cccaataacg ggggccaggg tgaggactgt 1021 gtgatgatgc ggggatccgg ccagtggaac gacgccttct gccgcagcta cttggatgca 1081 tgggtgtgtg agcagctggc aacatgtgag atatctgccc ccttagcctc tgtgactcca 1141 acaaggccca ccccaaaaag tgaaccctga caaacttctg ctcacactct tctggatttc 1201 tcctctacct ttatcgtgga aacagctggg ccctgaggat acccctatca gggcccaggg 1261 ctctctgtga ccgaaggctt tgattatgtt cccacccata ctgaagcagc tggtggatgc 1321 cagctcctgc cagctaccca gaaaccctct ccagctctcc agctaagctg gccatcccat 1381 tccatctgcc ttcctcaaac ctgggcccca gccttgctag ctccctgact acgggcatgc 1441 atgtgggcag ctgagccaac cagggagctg ctgagaacaa agatttcgaa ggcttctttt 1501 gcagtcccca cctcctatca agttccccac tttctccccc tcggcatcag agaacagggg 1561 ttccctttcc ccaggatctg ggatgagtcc tcccatcaag tttgcatcag tggtcccagg 1621 actccgaccc tccttggagg ctaccaggtg tgctcctggt gcgggaggta ttgaaggaac 1681 tctaaacagc tccagcaagg cgagcctggc tctgtctggt aggcctggcc cttctctccc 1741 attccttcta ccttactaaa agctgttaga gaacagtcct aaagctagcc cccaaggtct 1801 attcccttat ttggccactt cctcctcctg aggctgacta caaggtccag ctatccaagt 1861 actgaagtct aacatcaaaa gccccctttg tctcacctaa gtagcaatgc ccaatcaaaa 1921 tacaccatca catcatagcc cagtctaaca gaccgccctt tttctcttca taaaattaca 1981 cctgcaacca ggcgtagtgg tgcaggcctt tagtcccagc acttgggagg cagagacaag 2041 cgaatttctg agttcgaggc cagcctggtc tacaaagtga gttccaggac agccagggct 2101 acacagagaa accctgtctc gaagaaagaa aaaaaaaaaa aattacacct gcgaggtcac 2161 ttgggctgct gtttttctgc ctgagtcaga gggcagccac ttaacttttc ttccctgctt 2221 aataaaggat ctctgtg // LOCUS RATATM1 1585 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 1b. ACCESSION M34137 KEYWORDS alpha-tropomyosin. SEGMENT 1 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1585) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review COMMENT [1] reports isolation of exon 1b which is the first exon coding for the alternatively expressed low-M-r alpha-tropomyosins. FEATURES from to/span description pept 1275 / 1406 low-M-r alpha-tropomyosin BASE COUNT 360 a 408 c 449 g 368 t ORIGIN 1 agaggtgttc cagctttgga agataagtga ggcaggctgt tttcttttct acctttgttt 61 ccattctcca cctcccacat ttaagcaagt tcaggactga ctctttgaca gaggtgtagg 121 attaaaaagc tcggagactt gacaggcttt aaaaaaaaca tagcggaaga aggaatgaca 181 gaggcgggtt cttacagact gtgtccatgt ggctttccag ctggtaacca gaggttctgt 241 cacaggtgtt tgtggtcttg cagacccggg ttcctcagat tccagatatc aggctgaatt 301 gcctgaagtt gcacagttcg ttatggggta cagacaaaat cttatggtag cgtttaatta 361 catagtttga gttcttaaat aagtgaggca tatgtaaatg ttttatatca tctgataact 421 tcaacagcct tgtgaagtgt cctgtactat tatactcgtt tttaagagag gaaatccctg 481 agaagtcatt tgtccaaagg cacacagcca gcagagactt aagagccaga ttgaaacctt 541 aaaaagatgt atttagaaca ggagatttaa ccactacgct ttgccagcta gtagtttaca 601 cttggtgaga tagtttgggt gttgggattc ttgtagtgag gatcttgtgt tctgtgttct 661 tcacaaggat ctttaggata ttcctatttc aggctccgag ttccttgttc agactctggg 721 aaatcgggag ttgccaatgc ctttagtatt cggaatgatg gcgttcttta caactcaaag 781 agtttaggga acgttcaagg tctaggttcc tacccttact cagcttcagg ggaatctcga 841 tttcagagcg ctccaaatgg ccagcaggag tcgccatagg ctttagcatc cagttcctct 901 acttggagtc acccaggctc cgcggaatgg atgcaaacgc cctcctgcgg tagcctcgct 961 ctaggcaaag cagcccagag ccagccattc ccgcgcgcac gcgcagtggt tccaagggta 1021 gccagttcct cgggtccacc ggccccgccc gcctgcccgc cgcgcccatc ccctgcgttc 1081 gcgcacccgc ccgcggctct ctagtgcggc cctggggccg tgacttccgg gctgcttctt 1141 agccgcaggg ggcgccggcg ttgcaaagcg ggcctgggcg ggacgaaccg gacctgggct 1201 ccgtaactgc aggagcccag cagtagcagc tgcagagcgc ggaggcagag cacagcctca 1261 gccgccgcct cgccatggcg ggtagcagct cgctggaggc ggtgcggaga aagatccgga 1321 gcctgcagga acaggcagat gccgcggagg agcgtgcggg cagcctgcag cgagagctgg 1381 accaggagcg gaagctgcgg gaaaccgtaa gggatccacc catcacccga ggcccccacg 1441 ccgcacctgg cgcacctagg ccaactggca cccggctact cagcctgccg gtctatctcc 1501 tcggtctaaa gtaaaacctc caagggggag caagagtgct tgctgttgag aaggttctgg 1561 gggcgagcat gaaatgggtc ccggg // LOCUS RATATM2 462 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat alpha-tropomyosin (alpha-TM) gene, exon 9c. ACCESSION M34138 KEYWORDS alpha-tropomyosin. SEGMENT 2 of 2 SOURCE Rat DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 462) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review COMMENT [1] reports isolation of exon 9c which is the final exon coding for the alternatively expressed brain alpha-tropomyosins TMBr-1 and TMBr-3. FEATURES from to/span description pept / 375 448 alpha-tropomyosin BASE COUNT 97 a 141 c 100 g 124 t ORIGIN Unknown number of bp after segment 1. 1 ctcgcctaga atctctgctg agctcccaca aagcatgagt gtctggggct gccctccagc 61 ctggccttag tccccaaatc tgttgtccta aactacacat aagtgagtca ctgggtggct 121 ttaatgaggt gcagaaggag cattggttct gcacctcttc gcctctgctc tacaccaagg 181 aaagattctg gcccctcctt tattctgcca gacctccatg cgcctcaagg gcacagggca 241 gagggcctgg ggtcagggta ccttgccgct gtcctttgta ttgtgttctc agtctgttaa 301 ccgtttctca ctgactcttg ctaacctctt ccttctgcct cttcttttct gctaaccctt 361 gctgacctgc ccagatcaac tctaccatca actcgagcaa aaccgccgtc taactaatga 421 actaaagctg gccctgaatg aggattaaaa ccctgggcca ag // LOCUS RATPLPA1 1658 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exons 1 and 2. ACCESSION M34108 KEYWORDS parathyroid hormone-like peptide. SEGMENT 1 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg10-2. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1658) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept 1477 + 1577 parathyroid hormone-like peptide (PLP) precursor, exon 2 (first expressed exon) sigp 1477 + 1577 parathyroid hormone-like signal peptide pre-msg 951 > 1658 PLP mRNA and introns IVS 1259 1452 PLP intron A IVS 1578 > 1658 PLP intron B BASE COUNT 325 a 507 c 466 g 360 t ORIGIN 1 gggaaccagt tacacacaca cacacacaca cacacacaca cacacacaca cacacacact 61 ggtaagaacc agtagcccac aacattctcc ccctctcccc acccacccac ccagcagcgc 121 tgagaagctg aaacgactgg cataaggtta ctcagtgaat ctatagcagg acatagagta 181 ggttatcaga ggaagagtgg ctaagttgga ttctgtgtgc tccaggttct ctgagatcac 241 aaagtgggtc ttcgatcctt gttcacacac cctttctagg atctgctcca gatgtgatgt 301 ccccagctgt tctcaaacag ttaagaccaa aaatcgcctt tgctctcctg ggtgctaccc 361 ttagtcccct ccccagggga aggccatgcc aggcaggaag gctggcttgc aacagtgcga 421 gcttgcaaac gagcttggga ctcttgggat tcgcaggcca cccggagttc ctccttgccg 481 ccggtgggga ccagggctcc tggacctggg ctcctggact caggcggcag caggctggtt 541 caggaggagg agagcgtcag atcggcccac gcggcgactg gggtggggct ccgtggctcc 601 cagctcccct gtgctccccg cccccgccgg cccgctcacc ccagtctgct ccgccggcgc 661 gcgttcctcg ggcgccacct ctttgcgact cgctcacttc tcagcaggtt ggccacagag 721 cgtgtgaacc ttccaaggct gggctcacag ctcctctcca acctgcgccg cccagctggg 781 cgtccgcccc gccgccggaa ccgcctcgcg ccacctggcc ctcgcatcca cgacacgcgc 841 gcctgcaact tgttcaaggg cgttgtggaa tcaactttcc ggaagcaacc agcccaccgg 901 aggaggtaga ccgacagcta tgtatatata cgctctctgc cgcaagtggc tttgcaccca 961 gaggcctggt gcgcaggaca gctgactcct gcgggcacac tcgcgtttga agaggggttc 1021 gccctgcccc gcgacccaga gcgctgccgc caagactaat tagacattgc tatgggagcc 1081 acagcaacgc gtcacgcatc cccgacgcct atgtaaaacg gttcggtctt cgctctttca 1141 gaggaaactc tctgattgct ttttcccttc tcgggtccct tttttttttt tttttttttt 1201 ttttttttgc ctgtgcggtt tgagagaggc acagttggag gcgctgattc ctacaaaagt 1261 aagtgctcag gggctcggaa acactttttt gctttctttt tcttgttttc ttttttcctt 1321 ataacttgtg agatgccctc gacattggag tggcttgttg ggaagccacg gggttccaaa 1381 agagggagag gggcagtgag aggaagagat agatggagag ccaggactga ccctcgggtt 1441 ccactcttgc aggtcccaga gccagcgagc ggcacgatgc tgcggaggct ggttcagcag 1501 tggagcgtcc tggtgttcct gctcagctac tccgtgccct cccgcgggcg ctcggtggag 1561 gggcttggtc gcaggctgta agtaccggat cttcctcccg ggtgagggat ggaggagacc 1621 agcaccagag gtgccgaact gaggcgggca atgcggaa // LOCUS RATPLPA2 144 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B (partial). ACCESSION M34109 KEYWORDS parathyroid hormone-like peptide. SEGMENT 2 of 5 SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 144) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 144 parathyroid hormone-like peptide intron B BASE COUNT 42 a 26 c 21 g 55 t ORIGIN 3 kb after segment 1. 1 gcagaacact atgttttgac tgaagtccaa gttgcctctg ataaactttt gcccaatttt 61 tcttttaatt tttttacagt aaccaaagac aaggataaag tataaatcca cttatttttc 121 catgctggca ctttggtttg taat // LOCUS RATPLPA3 145 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, intron B (partial). ACCESSION M34110 KEYWORDS parathyroid hormone-like peptide. SEGMENT 3 of 5 SOURCE Rat liver DNA, clones lambda-rPLPg[10-2,9-0]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 145) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 145 parathyroid hormone-like peptide intron B BASE COUNT 22 a 34 c 34 g 55 t ORIGIN 640 bp after segment 2. 1 agtctccata gggtcaatcc actgtcatgt gcttcttatg ctgtgctgtt tcttgtcccc 61 ttgctgtgac actcacttct gcctttcttg gacttgagaa cacgttgggg acttttgaga 121 atcggagtcc tgttttgtgg acttt // LOCUS RATPLPA4 920 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 3. ACCESSION M34111 KEYWORDS parathyroid hormone-like peptide. SEGMENT 4 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg9-0. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 920) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 94 + 516 parathyroid hormone-like peptide (PLP) precursor, exon 3 sigp + 94 100 parathyroid hormone-like signal peptide matp 101 + 516 parathyroid hormone-like peptide IVS < 1 93 PLP intron B IVS 517 > 920 PLP intron C BASE COUNT 198 a 353 c 166 g 203 t ORIGIN 1.3 kb after segment 3. 1 tccctactct agaggtcaac tgttcaccca tctcattcct tcaagtccaa cacgggcagg 61 tcctcagctg acaactgtct tctcccctga cagaaaacgc gcggtgtctg agcaccagct 121 actgcatgac aagggcaagt ccatccaaga cttgcgccgc cgtttcttcc tccaccatct 181 gattgcggag atccacacag ctgaaatcag agctacctcg gaggtgtccc ctaactccaa 241 acctgctccc aacaccaaaa accaccctgt gcggtttggg tcagacgacg agggcagata 301 cctaactcag gaaaccaaca aggtggagac gtacaaagag cagccactca agacgcccgg 361 gaagaagaag aaaggcaagc cggggaaacg cagagaacag gagaaaaaaa agcgaaggac 421 tcggtctgcc tggccaggca caactgggag tggcctgctt gaggaccccc agccccacac 481 ctccccgacc tcgacctccc tggagcccag ctcaaggtat tgtctgtccg ctgtgctcag 541 tagggctccc ctccctccac cggcctgggt cccagagatg cagaatcctc tgtatcttga 601 tctgagacta catcagttag gccacaatct cagtccattc ctggttagcc tgctagacaa 661 tttaatttcc ctacttccct cccaccagcc caatcaacca tccttcctcc ctccctccct 721 ccttccctcc ctcactccct ccctccctcc atccctccct ccctaccatc ctccctccct 781 tccttctccc tttcctttcc atcccctccc ctcttctccc tttcctttcc atcccctccc 841 ctcttctcct cccctctctc tgattcctgt cttcctcact cctccccaca catagggtct 901 tcctcaagcc aaggctgact // LOCUS RATPLPA5 855 bp ds-DNA ROD 08-AUG-1990 DEFINITION Rat parathyroid hormone-like peptide (PLP) gene, exon 4. ACCESSION M34112 KEYWORDS parathyroid hormone-like peptide. SEGMENT 5 of 5 SOURCE Rat liver DNA, clone lambda-rPLPg9-0. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 855) AUTHORS Karaplis,A.C., Yasuda,T., Hendy,G.N., Goltzman,D. and Banville,D. TITLE Gene-encoding parathyroid hormone-like peptide: Nucleotide sequence of the rat gene and comparison with the human homologue JOURNAL Mol. Endocrinol. 4, 441-446 (1990) STANDARD simple staff_review FEATURES from to/span description pept + 21 30 parathyroid hormone-like peptide (PLP) precursor, exon 4 matp + 21 27 parathyroid hormone-like peptide pre-msg < 1 449 PLP mRNA and introns IVS < 1 20 PLP intron C BASE COUNT 280 a 139 c 157 g 279 t ORIGIN 3.6 kb after segment 4. 1 ccccttcctt gtcttcacag gacgcattga aactttcacc gaagatcttc caagacacgt 61 tacaggattc tgtaatagta aacatatgga aagtattaga catatttatt gcctgtacat 121 actgtaaatg cattgggatc aaactgtctc cccaggaaac tgcacatggg tcatgttacc 181 ataatttatt ttgtcaactg atgtatttat ttgtaaatgt atcttggtgc tgctgactct 241 gtttttttgt aacataatgc actttaggta tacatatcaa gtatgtcgat gaatttaaca 301 cgaaaaagtg tctctatttt gtggttcatt ttgatgagtt ctgaaatata attatctaga 361 ctgatctccc tctgcgcatg taaaaatggc actattttaa atttgtaaag aatgtctaat 421 aaaatataat ctaattacac cgtgacttac aatgttggtt ttaattcttt aaggtttcta 481 atcagagaaa cacagtacaa ttttttttta aattctactt gaataaaatc ctaagcttga 541 tgtgtttgtg tggggatcta ctccaatgtg tccgcattag cttttagtta aaaaccactt 601 aaaatgtatt ttgccaatat ttcagctcaa gtcaatagaa tggctgaaca gcctgaactt 661 caagttgaaa agaaagacaa gaaagggcaa aaaataagca gagaagtcag gagaaagggt 721 caaagagggt tgtccagagt taaaaccaga gaagtgggtc cagacttgaa gactctgaat 781 aaattttgcc aatgtaaagt ttagtttgca agggcttgct ctcgattcat aacaataata 841 aaagatctgc tcggg // LOCUS RATTMBR1A 1050 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-1) mRNA, complete cds. ACCESSION M34135 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1050) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept 41 886 brain alpha-tropomyosin (TMBr-1) mRNA < 1 1050 TMBr-1 BASE COUNT 336 a 220 c 306 g 188 t ORIGIN 1 ctctggctcc tctgcctaag ggcccctcgc caccgccacc atggacgcca tcaagaagaa 61 gatgcagatg ctgaagctcg acaaagagaa cgccttggat cgagcagagc aggcggaggc 121 tgacaagaag gctgcggaag accggagcaa gcagctggaa gatgagctgg tgtcactgca 181 aaagaaactc aaggccactg aagatgaact ggacaaatac tccgaggctc tcaaagatgc 241 ccaggagaaa ctggagctgg cggagaaaaa ggccacagat gctgaagctg acgtagcatc 301 tctgaacaga cgcatccagc tggttgagga ggagttggat cgcgctcagg agcgtctggc 361 cacagctcta cagaagctgg aggaggctga gaaggctgca gatgagagtg agagaggcat 421 gaaagtcatt gaaagccgag cccaaaaaga tgaagaaaag atggagattc aggagatcca 481 gctgaaagag gccaagcaca ttgctgaaga tgctgaccga aagtatgaag aggtggcccg 541 taagctggtc atcatcgaga gcgatctgga gcgtgcggag gagagggctg agctctcgga 601 aggcaaatgt gccgagcttg aagaagagtt gaaaacggtg acgaacaact tgaagtcact 661 ggaggctcag gctgagaagt actctcagaa agaagacaag tatgaagagg agatcaaggt 721 tctctctgac aagctgaagg aggctgagac ccgggctgag tttgcagaga gatcagtaac 781 caaattggag aaaagcattg atgacttaga agatcaactc taccatcaac tcgagcaaaa 841 ccgccgtcta actaatgaac taaagctggc cctgaatgag gattaaaacc ctgggccaag 901 ttctaggaac ggagtccatg ttcagaaaat ctatgactct cggggttaaa aaatgataac 961 ttctagagaa atgggaagga aatgcattgg ctggaaggcc agccttgcct atgttttcct 1021 ctgtatatct taattaaatc ttcaataacc // LOCUS RATTMBR2A 1004 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-2) mRNA, complete cds. ACCESSION M34134 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk15. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1004) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept 137 892 brain alpha-tropomyosin (TMBr-2) mRNA < 1 1004 TMBr-2 BASE COUNT 265 a 242 c 321 g 176 t ORIGIN 1 cttagccgca gggggcgccg gcgttgcaaa gcgggcctgg gcgggacgaa ccggacctgg 61 ctccgtaact gcaggagccc agcagtagca gctgcagagc gcggaggcag agcacagcct 121 cagccgccgc ctcgccatgg cgggtagcag ctcgctggag gcggtgcgga gaaagatccg 181 gagcctgcag gaacaggcag atgccgcgga ggagcgtgcg ggcagcctgc agcgagagct 241 ggaccaggag cggaagctgc gggaaaccgc tgaagctgac gtagcatctc tgaacagacg 301 catccagctg gttgaggagg agttggatcg cgctcaggag cgtctggcca cagctctaca 361 gaagctggag gaggctgaga aggctgcaga tgagagtgag agaggcatga aagtcattga 421 aagccgagcc caaaaagatg aagaaaagat ggagattcag gagatccagc tgaaagaggc 481 caagcacatt gctgaagatg ctgaccgaaa gtatgaagag gtggcccgta agctggtcat 541 catcgagagc gatctggagc gtgcggagga gagggctgag ctctcggaag gcaaatgtgc 601 cgagcttgaa gaagagttga aaacggtgac gaacaacttg aagtcactgg aggctcaggc 661 tgagaagtac tctcagaaag aagacaagta tgaagaggag atcaaggttc tctctgacaa 721 gctgaaggag gctgagaccc gggctgagtt tgcagagaga tcagtaacca aattggagaa 781 aagcattgat gacttagaag ataagtttct ttgcttctct cctcccaaga ctccttcatc 841 aagccggatg tcccacctct ctgagctctg catctgtctg ctctccagct gacccaggtt 901 tctttctagt gcccacccac cctagggcca ggcacagacc gtgctttcta ttgtacagag 961 gtgatcctcc cagtgtaaaa taaacactgt acgctagtcc tgtt // LOCUS RATTMBR3A 2488 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Rat brain alpha-tropomyosin (TMBr-3) mRNA, 3' end. ACCESSION M34136 KEYWORDS alpha-tropomyosin. SOURCE Rat brain, cDNA to mRNA, clone pOk10. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2488) AUTHORS Lees-Miller,J.P., Goodwin,L.O. and Helfman,D.M. TITLE Three novel brain tropomyosin isoforms are expressed from the rat alpha-tropomyosin gene through the use of alternative promoters and alternative RNA processing JOURNAL Mol. Cell. Biol. 10, 1729-1742 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 604 brain alpha-tropomyosin (TMBr-3) mRNA < 1 2488 TMBr-3 BASE COUNT 713 a 531 c 579 g 665 t ORIGIN 1 tgaagctgac gtagcatctc tgaacagacg catccagctg gttgaggagg agttggatcg 61 cgctcaggag cgtctggcca cagctctaca gaagctggag gaggctgaga aggctgcaga 121 tgagagtgag agaggcatga aagtcattga aagccgagcc caaaaagatg aagaaaagat 181 ggagattcag gagatccagc tgaaagaggc caagcacatt gctgaagatg ctgaccgaaa 241 gtatgaagag gtggcccgta agctggtcat catcgagagc gatctggagc gtgcggagga 301 gagggctgag ctctcggaag gcaaatgtgc cgagcttgaa gaagagttga aaacggtgac 361 gaacaacttg aagtcactgg aggctcaggc tgagaagtac tctcagaaag aagacaagta 421 tgaagaggag atcaaggttc tctctgacaa gctgaaggag gctgagaccc gggctgagtt 481 tgcagagaga tcagtaacca aattggagaa aagcattgat gacttagaag atcaactcta 541 ccatcaactc gagcaaaacc gccgtctaac taatgaacta aagctggccc tgaatgagga 601 ttaaaaccct gggccaagtt ctaggaacgg agtccatgtt cagaaaatct atgactctcg 661 gggttaaaaa atgataactt ctagagaaat gggaaggaaa tgcattggct ggaaggccag 721 ccttgcctat gttttcctct gtatatctta attaaatctt caataaccaa gaagttttta 781 tgatacatta tgtatatggg aaaatattta taccaatcaa ttcagctctt cctgcatcag 841 caggccctct ttgaagtgat gataatgagg actcccccac caccaccatc cccccaaaaa 901 aaggaaaaaa ttttctattt cctctaaaca gctgtgagtc ctgctttaaa tatctcttct 961 acatttgtat tcctaacatc ctcctaacat gtatagtgtt tgaaatgtga tccaacgtgt 1021 acctatgctt gggcaaaata gcttttgaaa acaggagctc gtaccacaag tccctggttg 1081 aaaggtatgc tttgttaagt ttaacggtgt tggaatctga ttatcgtgtc atgctgttaa 1141 gcgggatact ataaaaagac taagagaatg tccctaatga agtgtgcatg agatgtgttc 1201 atataaccat ttgcgagccg tagaaataaa tctatttaaa aagtggtcag aaccttcctt 1261 acgtcatctt cgttttctgt gaggtcgtgt tctgacccac ctacccgatc tttaactgcc 1321 tgtaactcag aatcctgtgc tggtattaac gtcttcttcg gtctgtgtaa caatgctgta 1381 cctgtatttg gtttttgacc ctcccttgtt ccccgtagca cgcagtgggt gagccttcac 1441 caaacacacc ctgcacgtta cgtcagggtg ggttgcaggt gtaggtgcac ccggggcctt 1501 gtgtagatgt gagtgacttc caaagcatgt gtagtgccac tcatctgtgt gtgactggtg 1561 tttgcttact aaatgtacaa accactaccg agtgtgaagc tcagccattc atgactcaat 1621 tttctgatcc catcacagag aaagtggcac atgccaaaga agaaaacctt agtatgcacc 1681 agatgctgga ccagacttta ctggagctaa acaacatgtg aaaccctcct tagctgcggc 1741 cacattctct cgtttccccc tgttgtttta ttttgtcttt aaacacatgc ttaccatgaa 1801 accccttcaa tgcttttttt ttttatatat atatatacct ttaccactgt cactgcaaca 1861 tctgcagaga gccagctagg gcagggggtg gggaaagaca cacagaaagc aggtgtactt 1921 gttatttaaa tgtgccattt cccaggttga cattgccaca cttcacagag tttaggcatg 1981 cagtttgctt agccagtgta ggaagcctca cgaaaacaaa aggattaaca gcctaagtgc 2041 caatggtggg gtcatcagga agactgatgt tggagacaat caggtgcaga ttggtgctaa 2101 tttaaacaaa agaacccctg gtgtcttttg ttcagtctaa tacaatttag acttctgtcc 2161 aacactaatt tatttttgtc ttgagtagca ctgcgagagg atatgggttc catgtgcccg 2221 tgttcctcac agccaagggt ttgaaagcca cgctggtctt gtaagattcc cattccttcc 2281 cagctgacat gcttgtggtc ccttccaact cagtggggag agggctcagt gctgttcgcc 2341 cccccccccc ccaattctcc atgtcagggt aagagatcta ggcactacat atgtaattgg 2401 taaagaaaag gcattttctc gagagttata actatgtaaa cattgtataa tgatatgaaa 2461 taaaatgcac attgtaggac attttcta // LOCUS RSPNIFHD 1375 bp ds-DNA BCT 08-AUG-1990 DEFINITION R.rubrum dinitrogenase reductase (nifH) gene, complete cds; and dinitrogenase alpha subunit (nifD') gene, 5' end. ACCESSION M33774 KEYWORDS dinitrogenase alpha subunit; dinitrogenase reductase. SOURCE R.rubrum DNA. ORGANISM Rhodospirillum rubrum Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 1375) AUTHORS Lehman,L.J., Fitzmaurice,W.P. and Roberts,G.P. TITLE The cloning and functional characterization of the nifH gene of Rhodospirillum rubrum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.J.Lehman, 18-APR-1990. FEATURES from to/span description pept 367 1254 dinitrogenase reductase (nifH) pept 1370 > 1375 dinitrogenase alpha subunit (nifD) binding 355 358 ribosome binding site binding 1357 1361 ribosome binding site signal 145 160 upstream activating sequence signal 175 190 upstream activating sequence signal 280 296 ntr promoter BASE COUNT 287 a 404 c 431 g 253 t ORIGIN 1 ggggatccca aggagggacc ctccgtcctg accaagaact gtgcgttcgt gttttcgccc 61 attcctttat caccgatcgg gcggcgacgg gaagggcgtt ctgggcagta gcgcaccatc 121 gcgcatagcg tcgcgccgcc ccgctgttgg gtccgcgaca gggttttggg gcgttgtcgg 181 gtttgcgaca cggttgaaaa taaccctaca ttccttgtta gatagattta ttcctttgat 241 taaaaacgat aattttcaaa aaaatcgggc aaatcaggtt tggcacgctg ggtgcattaa 301 cggggtgagc gtggcggtgg agcttccccg ccttaggacc ggatcacgaa gaaaaggatt 361 gattccatga gcgcactgcg tcagatcgcg ttctacggca agggcggtat cggcaagtcg 421 acgacgtccc agaacacgct ggccgctctg gtcgaaatgg gccagcggat cctgattgtc 481 ggctgcgatc ccaaggccga ttccacccgc ctgatcctga acaccaagct gcaggacacc 541 gttcttcatc tggccgccga ggccggctcg gtcgaagacc tcgacgtcgc cgatgtggtg 601 aagatcggct acaagggcat caagtgcacc gaatccggtg gtccggagcc cggcgtcggc 661 tgtgccggcc gtggcgtgat caccgccatc aacttccttg aggaaaacgg cgcctacgac 721 gacctcgatt acgtctctta cgacgtgctg ggcgacgtgg tttgcggtgg cttcgccatg 781 ccgatccgcg agaacaaggc tcaggaaatc tacatcgtca tgtcgggcga gatgatggcg 841 ctgtatgccg ccaacaacat cgccaagggc attttgaaat acgcccacac cggtggggtt 901 cgtctgggcg gtctgatctg caacgagcgt cagactgata aggaagtcga actggccgag 961 gccctggccg gccgtttggg ctgccgcctc atccacttcg tgccgcgcga caacggcgtt 1021 cagcacgccg agctgcgccg ccaaacggtg atccaatacg cccccgacag caagcaggcc 1081 ggtgagtatc gcacgctggc gaccaagatc cacaacaact ccggccaggg cgtggtgccg 1141 accccgatca ccatggaaga cctggaagag atgctgatgg aattcggcat catgaagtcc 1201 gacgaagagg cgctggccga gctcgaagcc aaggaatccg ccgccgccaa ttaaggccgt 1261 cgccgtgcga cgttcccccc ggccggccgg tcaggcgcag gggcgacgga agcgggcgac 1321 ctccaagaag gcctcttcgc gaggcgatag tcgatcagag gagagggata tgagc // LOCUS YSCMTFDEL 70 bp ds-DNA ORG 08-AUG-1990 DEFINITION S.cerevisiae mitochondrial novel junction sequence. ACCESSION M34150 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review FEATURES from to/span description site 33 44 novel junction site BASE COUNT 11 a 25 c 19 g 15 t ORIGIN 1 cttcggagtt cggccccccc ataagggggg gacctcactc cttccccact gcactggatg 61 cgggacttat // LOCUS YSCMTGAP12 457 bp ds-DNA ORG 08-AUG-1990 DEFINITION S.cerevisiae mitochondrial gap 12 sequence. ACCESSION M34149 X15185 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 457) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review COMMENT The gap 12 sequence is a segment of yeast mitochondrial DNA that had not been determined previously. BASE COUNT 161 a 57 c 57 g 182 t ORIGIN 1 tacaatttat aatttaataa agaaggaaat aaataataat aactcctttt ggggttccgg 61 tggggttcac acctttataa ataataaata aagatgttta ctcctcttcg gggttcggtc 121 ccctttttgg gttccggaac taattaatat tttatataat aataataata tattaatata 181 atttcattat taataaatat ctcctgcggg gttcggtccc ccccgtaagg ggggggtcct 241 cactccttcg gagcgtacta ttattataaa taattatata ttataatata attaaaaagt 301 attataattg aaacgaaaat tgtaatttta aatggaataa taattattat atatttaata 361 tatttaataa agttataata tctctttcta ccggactatt ttattttatt ttattttatt 421 tttataaaga aaaatagtat aatattatct tctcctc // LOCUS YSCMTORI1A 70 bp ds-DNA ORG 08-AUG-1990 DEFINITION S.cerevisiae mitochondrial ori1 gene, 5' flank. ACCESSION M34153 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review BASE COUNT 19 a 11 c 24 g 16 t ORIGIN 1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag 61 ggggtgggtg // LOCUS YSCMTORI2A 70 bp ds-DNA ORG 08-AUG-1990 DEFINITION S.cerevisiae mitochondrial ori2 gene, 5' flank. ACCESSION M34152 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 32 > 70 ori2 mRNA (+/- 4 bp) BASE COUNT 23 a 0 c 18 g 29 t ORIGIN 1 tttttttata agataatttt tgtaaatata taagtaataa attaagtttt atagggggag 61 ggggtgggtg // LOCUS YSCMTORI7A 70 bp ds-DNA ORG 08-AUG-1990 DEFINITION S.cerevisiae mitochondrial ori7 gene, 5' flank. ACCESSION M34154 KEYWORDS . SOURCE S.cerevisiae mitochondrial DNA. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 70) AUTHORS Skelly,P.J. and Clark-Walker,G.D. TITLE Conversion at large intergenic regions of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 1530-1537 (1990) STANDARD simple staff_review BASE COUNT 19 a 11 c 24 g 16 t ORIGIN 1 tataaatata taagtcccgg tttcttacga aaccgggacc tcggagacgt aataggggag 61 ggggtgggtg // LOCUS MXAFRZE 2493 bp ds-DNA BCT 08-AUG-1990 DEFINITION M.xanthus gliding motility regulatory protein (frzE) gene, complete cds. ACCESSION M35192 KEYWORDS gliding motility regulatory protein. SOURCE M.xanthus (strain DZF1) DNA. ORGANISM Myxococcus xanthus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; Myxococcaceae. REFERENCE 1 (bases 1 to 2493) AUTHORS McCleary,W.R. and Zusman,D.R. TITLE FrzE of Myxococcus xanthus is homologous to both CheA and CheY of Salmonella typhimurium JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.R.McCleary, 05-JUN-1990. FEATURES from to/span description pept 64 2397 gliding motility regulatory protein (frzE) BASE COUNT 375 a 848 c 926 g 344 t ORIGIN 1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg 61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc 121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag 181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt 241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag 301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc 361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc 421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg 481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc 541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc 601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc 661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc 721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc 781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg 841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg 901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc 961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc 1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag 1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg 1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg 1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc 1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc 1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg 1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc 1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag 1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc 1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg 1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg 1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg 1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc 1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac 1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc 1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc 1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc 2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag 2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag 2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg 2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg 2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc 2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga 2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg 2461 caaggggctg cgtgcgctcg cggcccgggg cct // LOCUS RATPCBB 463 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Rat PCB-binding protein mRNA, complete cds.. ACCESSION J05536 KEYWORDS PCB binding protein; binding protein. SOURCE Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone pPCB6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 463) AUTHORS Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J., Gillner,M., Gustafsson,J.-A. and Lund,J. TITLE Cloning, structure, and expression of a rat binding protein for polychlorinated biphenyls: Homology to the hormonally regulated progesterone-binding protein uteroglobin JOURNAL J. Biol. Chem. 265, 12690-12693 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.Lund, 15-JUN-1990. FEATURES from to/span description pept 58 348 PCB binding protein precursor sigp 58 114 PCB binding protein signal peptide matp 115 345 PCB binding protein BASE COUNT 117 a 156 c 84 g 106 t ORIGIN 1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg 61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg 121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat 181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag 241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg 301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat 361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta 421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc // LOCUS YSBCPR 2990 bp ds-DNA PLN 08-AUG-1990 DEFINITION C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds. ACCESSION M35199 KEYWORDS NADPH-cytochrome P450 reductase. SOURCE C.tropicalis (ATCC 750) DNA, clone pTS1. ORGANISM Candida tropicalis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 2990) AUTHORS Sutter,T.R., Sanglard,D. and Loper,J.C. TITLE Isolation and characterization of the alkane-inducible NADPH- cytochrome P450 oxidoreductase gene from Candida tropicalis: Identification of invariant residues within similar amino acid sequences of divergent flavoproteins JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by T.R.Sutter, 15-JUN-1990. Author name and address T.R.Sutter CIIT, P.O. Box 12137 Research Triangle Park, NC 27709 FEATURES from to/span description pept 554 2596 NADPH-cytochrome P450 reductase signal 198 204 TATA box BASE COUNT 988 a 442 c 558 g 1002 t ORIGIN 1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa 61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg 121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag 181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag 241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag 301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa 361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata 421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta 481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc 541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt 601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt 661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa 721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa 781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc 841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt 901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac 961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac 1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg 1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt 1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa 1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga 1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt 1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga 1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa 1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat 1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa 1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat 1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg 1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga 1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc 1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt 1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt 1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac 1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca 2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag 2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg 2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg 2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt 2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa 2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa 2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc 2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga 2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca 2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta 2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga 2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat 2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc 2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta 2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt 2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta // LOCUS CPCURA5A 2029 bp ds-DNA BCT 08-AUG-1990 DEFINITION C.neoformans orotidine monophosphate pyrophosphorylase (URA5) gene, complete cds. ACCESSION M34606 KEYWORDS orotidine monophosphate pyrophosphorylase. SOURCE C.neoformans (strain B-3501) asexual phase cell line B-3501 DNA, clone pURA5g2. ORGANISM Cryptococcus neoformans Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina; Filobasidiales. REFERENCE 1 (bases 1 to 2029) AUTHORS Edman,J.C. and Kwon-Chung,K.J. TITLE Isolation of the URA5 gene from Cryptococcus neoformans var neoformans and its use as a selective marker for transformation JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Edman, 29-MAY-1990. Author address: J.C.Edman University of California San Francisco Hormone Research Institute San Francisco, CA 94143-0534 email: edman@socrates.ucsf.edu FEATURES from to/span description pept 714 814 orotidine monophosphate pyrophosphorylase (URA5), exon 1 (E.C. 2.4.2.10) 867 1095 orotidine monophosphate pyrophosphorylase, exon 2 1145 1492 orotidine monophosphate pyrophosphorylase, exon 3 pre-msg < 714 1527 URA5 mRNA and introns IVS 815 866 URA5 intron A IVS 1096 1144 URA5 intron B BASE COUNT 512 a 515 c 479 g 523 t ORIGIN 1 gatcttgggg atggtattga agacgatcca gaaaaagtct ctccatccct cccacgttcg 61 ctgttcggtt tctttgcaaa gagcgaagtt gctcgacctt tgacagaccc taaaatctgg 121 tcgtattttg cgttcaaaga tcgcttactg cgaccgcgat tttttagagg tggatttttt 181 gttgcgagac tagcgcatcg gataggcatg ggtgatatac gtttgcgttc cttatgaggg 241 gtatcatcag tggcaaatta ccgactagat gaagaagacg acctacaccc cgctaaccgg 301 tgtccgatcg acatgatcca cggtgtccaa agcgaagaag gacatgatga ttggagttcg 361 cttacctttg gtataaacca gtgttgtttc tgaagaggtg aggaagaaaa tgatctaggg 421 caataacaga aatcgtttga agaagaagta aaagaaaaga cacacgtcag acaagcatca 481 tcatcattca ttttgaatac tcatcacggc accttcttcc ttgcatgcgc attattcatg 541 atgagatgct cgcagccgcc ggatttcggc tgcaatcaag aacgatatcg cgtacggcga 601 tgtccgcctc cactccatct tccaacaacc accacagtgc ccgataacac tacgacgaaa 661 attgttcttt tttggatctt gtcgacaacg gcgaccaaaa caagaacttg atcatgtcct 721 cccaagccct cgactccgcc aaagttgcct tcatcgaggc tgccatcgaa catggcgtgc 781 ttcttttcgg caactttacc ttgaagtccg gccggtgagc catattgcag cgcttcacag 841 tccaatcgaa tctgacatgt gttcagccaa tccccttact tcttcaatgc cggtctcctt 901 tactcttcat cgcttctctc aactaccgct caggcttacg ccaaggtact ttcctcttct 961 aggattcctg actttgacgt cctcttcggc ccagcttaca agggtatctc cttggctgct 1021 gtctccgctg taagccttta tcagcaaacc ggcaaagata tcggctactg ctacaacagg 1081 aaggagaaga aggacgtgag tctgtcctaa ccagtgcgac agcgatgagc tcataagcca 1141 gtagcacggt gagggcggta ctatggtcgg tgcgcctctc aagggacgaa tcgtcatcat 1201 cgacgatgtt ctcacctctg gcaaggccat ccgtgaagct attgacattc tcaaggcctc 1261 ccctgaagcg aagcttgtcg gaattgtcca gcttgtcgac agacaagaga aaggccagag 1321 cggtagcggc aagagtaccg tacaggaggt tgaggaagag ttcggtgtgc ctgtcgagcc 1381 tattattggt ttggacgaca ttgtgaagta cttagaaagc tccggcaagt gggaaaagga 1441 gctgcaagag gtcaggaagt acagggcgga gtacggtgtt cagaggtctt aagggttttc 1501 ttcttaaatg cacgggttta ggtctagcta atcaagttcc gacatattac aagtttgtaa 1561 gcttgtatca aaggaactta agtacaggca ggcgtcctga ggcgacaaag gaagctgtaa 1621 tatgattgtt ggctgtcaat cttcatcgta tctactttgt caatactgac ttcaatgacc 1681 caataataca attttattag tgttgaccca gaatggttag caggaaactc cccttctctt 1741 cctctcaatc ccaatcatac ttcatatctc ctgctccccc catttccgtc ttcctcgatg 1801 actccctggt cccatccctc ccacctcctg gaggcaagct ggagcacctg gacctaatgg 1861 gtcgtcgccc caaattgcct ccacctctta agacaatcat cgtccaatca aactctgaac 1921 tatcttccaa gccaatggcc ggatctggac acaagatgat gtcgaagccg cttgatgtgc 1981 cagggttggt ccgtggcctg gagacgtaag agcgggtagt actgggatc // LOCUS ECOCHEA 2190 bp ds-DNA BCT 08-AUG-1990 DEFINITION E.coli cheA protein (complete cds), motB protein (3' end), and cheW protein (5' end) genes. ACCESSION M34669 KEYWORDS cheA gene; cheW gene; motB gene. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2190) AUTHORS Kofoid,E.C. and Parkinson,J.S. TITLE Tandem translational starts in the cheA locus of E.coli JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.C.Kofoid, 01-JUN-1990. Author address: E.C.Kofoid University of Utah Dept of Biology Salt Lake City, UT 84112 email: kofoid@bioscience.utah.edu FEATURES from to/span description pept < 1 104 MotB protein pept 109 2073 CheA protein (alt.) pept 400 2073 CheA protein (alt.) pept 2094 > 2190 CheW protein BASE COUNT 575 a 546 c 609 g 460 t ORIGIN 42 min on K12 map. 1 atgaaaacgc cgaaagccag aatgagccag taagcgccct ggaaaaacct gaggttgcac 61 cacaggtcag tgttcccaca atgccatcag ccgaaccgag gtgacagcgt gagcatggat 121 ataagcgatt tttatcagac attttttgat gaagcggacg aactgttggc tgacatggag 181 cagcatttgc tggttttgca gccggaagcg ccagatgccg aacaattgaa tgccatcttt 241 cgggctgccc actcgatcaa aggaggggca ggaacttttg gcttcagcgt tttgcaggaa 301 accacgcatc tgatggaaaa cctgctcgat gaagccagac gaggtgagat gcaactcaac 361 accgacatta tcaatctgtt tttggaaacg aaggacatca tgcaagaaca gctcgacgct 421 tataaacagt cgcaagagcc ggatgccgcc agcttcgatt atatctgcca ggccttgcgt 481 caactggcat tagaagcgaa aggcgaaacg ccatccgcag tgacccgatt aagtgtggtt 541 gccaaaagtg aaccgcaaga tgagcagagt cgcagtcagt cgccgcgacg aattatcctt 601 tcgccgctga aggccgggga agtcgacctg ctggaagaag aactgggaca tctgacaacg 661 ttaactgacg tggtgaaagg ggcggattcg ctctcggcaa tattaccggg cgacatcgcc 721 gaagatgaca tcacagcggt actctgtttt gtgattgaag ccgatcagat tacctttgaa 781 acagtagaag tctcgccaaa aatatccacc ccaccagtgc ttaaactggc agccgaacaa 841 gcgccaaccg gccgcgtgga gcgggaaaaa acgacgcgca gcaatgaatc caccagcatc 901 cgtgtagcgg tagaaaaggt tgatcaatta attaacctcg tcggcgagct ggttatcacc 961 cagtccatgc ttgcccagcg ttccagcgaa ctggacccgg ttaatcatgg tgatttgata 1021 accagcatgg ggcagttaca acgtaacgcc cgtgatttgc aggaatcagt gatgtcgatt 1081 cgcatgatgc cgatggaata tgtttttagt cgctatcccc ggctggtgcg tgatctggcg 1141 ggaaaactcg gcaagcaggt agaactgacg ctggtgggca gttctactga actcgacaaa 1201 agcctgatag aacgcattat cgacccgctg acccacctgg tacgcaatag cctcgatcac 1261 ggtattgaac tgccagaaaa acggctcgcc gcaggtaaaa acagcgtcgg aaatttaatt 1321 ctgtctgccg aacatcaggg cggcaacatt tgcattgaag tgaccgacga tggggcgggg 1381 ctaaaccgtg agcgaattct ggcaaaagcg gcctcgcaag gtttgactgt cagcgaaaac 1441 atgagcgacg acgaagtcgc gatgctgata tttgcacctg gcttctccac ggcagagcag 1501 gtcaccgacg tctccgggcg cggcgtcggc atggacgtcg ttaaacgtaa tatccagaag 1561 atgggcggtc atgtcgaaat ccagtcgaag cagggtactg gcactacgat ccgcatttta 1621 ctgccgctga cgctggccat cctcgacggc atgtccgtac gcgttgcgga tgaagttttc 1681 attctgccgc tgaatgctgt tatggaatca ctgcaacccc gtgaagccga tctccatcca 1741 ctggccggcg gcgagcgggt gctggaagtg cggggtgaat atctgcccat cgtcgaactg 1801 tggaaagtgt tcaacgtcgc gggcgcgaaa accgaagcca cccagggaat tgtggtgatc 1861 ttacaaagtg gcggtcgccg ctacgccttg ctggtggatc aattaattgg tcaacaccag 1921 gttgtggtta aaaaccttga aagtaactat cgcaaagtcc ccggcatttc tgctgcgacc 1981 attcttggcg acggcagcgt ggcactgatt gttgatgtct ccgccttgca ggcgataaac 2041 cgcgaacaac gtatggcgaa caccgccgcc tgaatgagta aaaaggtaac aatatgaccg 2101 gtatgacgaa tgtaacaaag ctggccagcg agccgtcacg gcaggaattt ctggtattta 2161 cccttggtga tgaagagtac ggtattgatg // LOCUS HAMHSP60A 2726 bp ds-DNA ROD 08-AUG-1990 DEFINITION C.griseus chaperonin (HSP60) non-functional pseudogene. ACCESSION M34562 KEYWORDS chaperonin; pseudogene. SOURCE C.griseus cell line CHO DNA, clone GC-1. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2726) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description IVS 1738 1824 HSP60 intron A BASE COUNT 915 a 513 c 579 g 719 t ORIGIN 1 catggtgatt tttctttaga tattctgggc ctcctgaaac tgacaagatt ctgtaaagca 61 aaagacataa tcaacaagat aagacagcag tgcaaacaat gggaaaagtt cttcaccaac 121 cccatatcta gccaagggca aatttctaaa acatacaaag aactgatgaa agcagtcaac 181 aaaacaccaa ataatcgccc tcactctccg ccaccagcct gctttgccag atgagaccag 241 tgtcccgggc gctggctcct catgtcactc gggcctatgc caatgatgta aaatgtgggg 301 caaatccttg atccttaaag cttcaagttg tagacctttt agccgatgct gtagctgtta 361 caatggggcc aaagggaaga acagtgatta ttgaacagag ttgaggaagt cccaaagtaa 421 caaaagatgg ggtcattttt acaaaggcaa ttgatctaaa ggatacatac aaataatatc 481 agagctaaac ttgttcaagt tgttgccaat aactcaaatg aagaggctgg gatggcacta 541 ccactgctac tgttctggca agttctattg ccaacgaagg ctttgagaag atcagcaaag 601 ggtctaatcc agtagaaatc tggagaagtg tgatgttggc tgttgatgct gtaattgctg 661 aactaacaaa acaatctaaa catgtgacaa cccatgaaga aatttctcag gttgctacaa 721 tttctgcaaa tggacacata gacattggaa acatcatatc tgatgcaatg aaacatgttg 781 gagggaaaga cgtcatcaca ctgaaggata gaaataccct gaatgatgaa ttagaaatta 841 ttggaggcat gaagtttgaa agaagatgta tttctcagac atcaataagt caaaaatgtg 901 aattccaaga tgcctatgtt ctattgagtg aaaagaaaat ttctattgtc gagcccatta 961 ctgctcatag gaaacaaatg gtgatatttg ctgaagatgt tgatggagaa gctctaagca 1021 cactggtttt gaacaggcta aaagctggtc ttcagatcgt agcagttaaa gctccaggtt 1081 tttggggaca ataggaagaa taagcttaaa gatacggcta ttgctactga tggtgagcaa 1141 tttggagaac aggggctaaa actaaatatt gatgatggtc aagttcatga tttaggaaaa 1201 tttggagagg tcattgtccc caaagatgat gccatgcttt tgaaaggaac tgatgactaa 1261 actcaaattg agaaaccaat tcaagaaatc actaagcagc tagacatcac aactagtgaa 1321 tatgaaaagg aaaagctaaa cgaaacactt gctaaacttt cagatggtgc agctgtgttt 1381 aaggttggag gaacaaggga tgttgaagtg aaggagaaga aagacagagt tacaaatgct 1441 ctcaatgcta caagagcagc tgttgaaaag ggcattgttt taggacttag ctgtgctctg 1501 cctctgtgca tcccagcctt ggattcctta aagccttcaa atgaagatca gaaaataggt 1561 atagaaatta ttaaaagagc actcaaaatt cctgcagtga caattgctga gaatgcaggt 1621 gttgaataat ctttgagagt tgagaaaatt ctgcagggtt cctagggagt tggttattag 1681 gctatgcccg caaattttgt gaacatggta gaaaaaggaa tcattaatcc aacaaaggta 1741 aatgagtcag caattcttaa acattggatg gcttgttcta tggtatgtgt tttcattaat 1801 cattttataa acattttttt gtaggttgta agaactgctt tagtagatgc ctccttgcta 1861 acttcagcag aagctgtagt gacagaaatt ccaaaagaag agaaggacct tggaatgcgc 1921 acacatgcct ttaatcctag cacttgcgag gcagaggcag gcagatatct gtgagttcga 1981 ggccagcctg gactccggag tgagtgccag gatagtctcc aaagctatcc aaagagaaac 2041 cctgtcttga aaaacaaaca aacaaaaaag acatttgtac attcctgaaa aaaacccaaa 2101 taatccaatt aaactagggg tcagcctaat aaagataccc gaataggttt ttacatctta 2161 agagaaaaat ctactaataa atgacttgga gcaggtcctt cacagacaat atttagtgag 2221 agtaggtagc ctcatgtggg ttagtagttc cagttgcttc aaaatgccct caccttttgt 2281 cataggctca tgtggaggat ctactccatt ggatggtgat attaaaagta gcacatgggt 2341 acatgcctta agatattaag atccccaagt aaaaactgca tctcagatgc tctctgtttc 2401 ttgggccatg atatgggcag tgagatttct gtcacacttg ccaggaaaca cttatgatct 2461 gtctgtcaca atctcttata ttgtcaagta cttaaccaaa cacaatcacc catgacctct 2521 catatcctcc tggacttaga caggaggcac aagtctactc aggtataaat atttttactt 2581 ctgatgcccc acagtaacct acaatggctt cagtccagta cacaggtaag taaacccacc 2641 aatataccat tccagctgat tgataacctc ttagatacat cccatatgct gcagttccca 2701 aatggagaca acccaaagtt tcttga // LOCUS HAMHSP70A 2243 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Chinese hamster heat shock protein (cognate form of hsp70) mRNA, complete cds. ACCESSION M34561 KEYWORDS heat shock protein; stress protein. SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone RS 23. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 2243) AUTHORS Ahmad,S., Ahuja,R., Venner,T.J. and Gupta,R.S. TITLE Identification of a protein altered in mutants resistant to microtubule inhibitors with the constitutive form of the major heat shock protein (hsp70) JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN% GUPTAR@,SSCVAX.McMASTER.CA FEATURES from to/span description pept 50 1990 heat shock protein (hsp70) mRNA < 1 2243 hsp70 mRNA signal 2220 2225 poly-A signal BASE COUNT 654 a 486 c 551 g 552 t ORIGIN 1 gttttgtgtg gtctcgtcgc cagcacagcc gggcctacag caagcaacca tgtctaaggg 61 acctgcagtt ggtattgatc ttggcaccac ctactcctgt gtgggtgtct tccagcatgg 121 aaaggtggag ataattgcca atgaccaggg taaccgaacc acgccaagct atgttgcttt 181 tactgacaca gaacgattaa ttggggatgc cgcgaagaat caggttgcaa tgaaccccac 241 caacacagtt tttgatgcca aacgtctgat cggacgtagg tttgatgatg ctgttgttca 301 gtctgatatg aagcactggc ccttcatggt ggtgaacgat gcaggcaggc ccaaggtcca 361 agtagaatac aaaggagagg caaaaagttt ctacccagag gaggtgtcct ctatggttct 421 gaccaagatg aaggaaattg cagaagccta tctcgggaag actgttacca atgctgtggt 481 cacagtgcca gcctacttca atgattctca gcgacaggct acaaaagatg ctggaactat 541 tgctggcctg aatgtcctgc gaattatcaa tgaaccaact gctgctgcta ttgcttatgg 601 gctagataag aaggttggag ctgaaaggaa tgtgctcatt tttgacttgg gaggtggtac 661 ttttgatgtg tctatcctca ctattgagga tggaattttt gaagtcaaat caacagccgg 721 agacacccac ttgggtggag aagactttga taacagaatg gtcaaccatt tcattgctga 781 gtttaagcga aacgacaaga aggacatcag cgagaacaag agagctgtca ggcgtctgcg 841 tactgcctgt gaacgggcaa agcgcaccct ctcctccagc acccaggcca gtattgagat 901 tgattctctc tatgagggaa ttgatttcta tacctccatt acccgtgctc gattcgaaga 961 attgaatgct gacctgttcc gaggcactct ggaccctgta gagaaggccc ttcgagatgc 1021 taaactagac aagtcacaga tccatgacat tgtcttggtg ggtggttcca ccagaatccc 1081 caagattcag aagcttctgc aagacttctt caatggaaaa gaactgaata agagcattaa 1141 ccccgatgaa gctgttgctt atggtgcagc tgtccaggcg gccattctat ctggagacaa 1201 gtctgagaat gttcaggatt tgttgctctt ggatgtcact cctctttccc ttggcattga 1261 aactgctggt ggagtcatga ctgtcctcat caagcgcaat accaccatcc ctaccaagca 1321 gacgcagact ttcactacct attctgacaa tcagcctggt gtgctcattc aggtctatga 1381 aggcgaaagg gccatgacca aggataacaa cctgcttgga aagtttgaac ttacaggcat 1441 ccctccagca ccccgtggtg ttcctcagat tgaagttact tttgacattg atgccaatgg 1501 catcctcaat gtctctgctg tagataagag cacaggcaag gagaacaaga tcactatcac 1561 caacgacaag ggccgcttga gcaaggagga tattgaacgc atggtccagg aagctgagaa 1621 gtacaaagct gaagatgaga agcagagaga taaggtttcc tccaagaatt cactggagtc 1681 ctatgctttc aacatgaaag caacagttga agacgagaaa cttcaaggca agatcaatga 1741 tgaggacaaa cagaagattc tagacaagtg taatgaaatc atcagttggc tggataagaa 1801 tcagactgca gagaaggaag aatttgaaca ccagcagaaa gaactggaga aagtctgcaa 1861 ccccatcatt accaagctgt accagagtgc tggtggcatg cctggaggaa tgcccggggg 1921 cttccctggt ggtggagctc ctccatctgg tggtgcttct tcagggccta ccattgaaga 1981 ggtggattaa gtcagtccaa gtagaggtgt agcattgttc cacagggaaa catttgaagg 2041 acccaaattt gcaaagtttc atggcagttt ttaaatttaa actgttaaca ataaattact 2101 gggcattctc aatacttgaa catggaacat gtacacagga aataaaacat tgctctttat 2161 aagcactgta tcctaagtgg aaaactcaat gtctaatatg gtggaaaatg gaatgtctaa 2221 ataaaactat ttaaattggc acc // LOCUS HAMTCP1A 1845 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Chinese hamster T-complex protein 1 (TCP-1) mRNA, complete cds. ACCESSION M34665 KEYWORDS T-complex protein 1. SOURCE Chinese hamster cell line CHO, cDNA to mRNA, clone pRG 501. ORGANISM Cricetulus griseus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 1845) AUTHORS Ahmad,S. and Gupta,R.S. TITLE Cloning of a Chinese hamster ovary cell protein homologous to the mouse T-complex protein TCP-1: Structural similarity to the ubiquitous "Chaperonin" family of heat shock proteins JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 01-JUN-1990. Author address: R.S.Gupta McMaster University 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR@,SSCVAX.McMASTER.CA FEATURES from to/span description pept 89 1759 T-complex protein 1 signal 1818 1823 poly-A signal BASE COUNT 530 a 368 c 472 g 475 t ORIGIN 1 gttccccgct gtggtggccg cagtccggcc tctgagcgga gatcccggcc gtccgcggtg 61 ttagcgaggc cagccaggtt ccctaacgat ggagggccct ttgtccgtgt tcggggaccg 121 cagcactgga gaggcgatcc gctcccagaa tgttatggct gcagcttcca ttgccaacat 181 tgtgaaaagt tctctggggc cagttggctt ggataaaatg ttggtggatg acattggtga 241 tgtaaccatt accaacgatg gggccaccat cctgaagtta ctggaggtag aacaccctgc 301 agctaaagtt ctttgtgagc tggctgactt gcaagacaaa gaagtgggag atggaactac 361 ctctgtggta attattgcag cggagcttct gaaaaatgca gatgaactag tcaaacagaa 421 aattcatcca acatcagtta ttagtggcta tcggcttgcc tgcaaggaag cagtgcgcta 481 tatcagtgag aacctaatta ttaacacaga cgaacttgga agagactgtc tgattaatgc 541 cgctaagaca tccatgtctt ccaaaattat tggaataaat ggtgatttct ttgctaatat 601 ggtggtagat gctgtacttg ctgtcaaata cacagatctg agaggccagc ctcgatatcc 661 agtcaattct gttaatattc tgaaagccca tggaagaagt caggtagaaa gcatgctgat 721 caatggctat gcactgaact gtgtggtggg atctcagggc atgcccaaga ggatagttaa 781 tgcaaaaatt gcatgtcttg acttcagcct gcagaaaaca aaaatgaagc ttggtgtgca 841 ggtggttatt acagatcctg agaaattgga ccaaattaga cagagagaat cagacatcac 901 caaggagaga attgaaaaga ttctggcgac tggggccaat gttattctga ccactggtgg 961 gatagatgat atgtgtctga agtactttgt ggaggctggt gccatggctg ttagacgagt 1021 tctaaaaagg gatctcaagc gtatcgctaa agcttctgga gcaagtatcc tgtccacact 1081 ggccaatttg gaaggcgaag aaacttttga agctaccatg ctggggcaag cagaagaagt 1141 ggtccaggag agaatttgtg atgatgagct gatcttaatc aaaaatacca aagctcgtac 1201 atctgcatca atcatcttac gaggagcaaa tgatttcatg tgtgatgaaa tggagcgatc 1261 tttacatgat gctctttgtg tggtgaagag agttttggag tcaaaatctg tagtcccagg 1321 aggaggtgct gtagaagccg ccctgtccat ataccttgaa aactatgcaa ccagtatggg 1381 gtctcgggaa cagcttgcta ttgcagagtt tgcaaggtct cttcttgtta ttcctaatac 1441 actggcagtg aatgctgccc aagactccac tgacctggtt gccaagttac gagcttttca 1501 caatgaggct caagttaacc cagaacgtaa aaacctgaag tggattggtc ttgatttgat 1561 caatggaaaa ccacgagaca acaagcaagc aggggtgttt gaaccaacca tagttaaagt 1621 gaagagcctg aagtttgcaa cagaagctgc aatcactatt cttcgaatcg atgatctgat 1681 aaaattacac ccggaaacta aagacgataa acatggaagt tatgaaaatg ctgttcactc 1741 tggagccctt gatgactgat tggatttccc ttttatttat agcagtgtca ggtgcaatgt 1801 agccttgggt atctcacatt aaagtacagt gagctgtcca cctga // LOCUS HUMHP601A 1941 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 1. ACCESSION M34660 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS2-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 111 to 1941) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1941) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 624 a 329 c 446 g 541 t 1 others ORIGIN 1 atatgaaata aagcctttat gtgctctacc taaattaata aatttattta ttttttttng 61 tatatttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct 121 gcctcgccat gcgacatgtc ctgccagcac cccacagaaa tgcttccatt ccattaccca 181 cagtctttca ccagatgaga ccagtgtcca ggctactggc tcctcacctc acttgaagtg 241 atggtaaaga tgtaaaattt ggtgctgatg tccgagcctt aatgcttaaa agtgtagacc 301 tttttgtcga tgtagccatt acaatggggc caaagggaag aacagtaatt attgaacata 361 gctgggcaag tcccaaagta acaaaagatg gtgtgactga tgcaaagtca attgacttaa 421 aagataaata taaaagcatt ggagctaaac ttgtccaaga tgttgccaat aacacagatg 481 aagagactgg ggatggcact atcacttgct gttgtactgg tatgctctat ttccaaataa 541 ggtttccaga aggttagcaa aggtgctaat ccagtggaaa tcaggagagg tgtatgttag 601 ctgatgctgt aattgctgaa cttaaaaagc agtctaaacc tgtgaccaaa cctgaagaaa 661 ttgcacaggt tgctacaatt tctgcaaatt gagacaaaga aattggcaac atcatctctg 721 atgcaatgaa aaaagttgga agaaagggca tcatcacagt aaaggatgga aaaacactga 781 ctgtagaatt agaaattatt gaaggcatga aatttgatcg aggatatatt tctccatact 841 ttattaatac atcaaaaggt gagaaatgtg aattccagga tgcctatgtt ctgttgcatg 901 aaaagcaaaa tttctacgtg ctccacgtcc attgtaactg ctcttgcaaa ttgccaatgc 961 ttactgtaag cctttggtca taattgctgg agatattgat ggagaagctc taactacact 1021 catcctgaat aggctaaagg ttggtcttca ggttgtggca gtcaaggctc cagggtttgg 1081 tgacaataga aagaaccagc ttaaagatac ggttattgct actggtggta cagtgtttgg 1141 agaagagggc tcacactaaa tcttgaagac gttcagcctc gtgacgtagg agagttggag 1201 aggtactgtg atcaagatat gccatgctct taaaagcaaa aggtaacaag tctcaaatta 1261 aaaatgtgtt caagaaatca ttgaccagtc agatgtcaca actatgtaat tgaaaaggaa 1321 aaagtgagtg gagaaacttt cagatggagt agctgtgctg aaggttggtg gcaagtgatg 1381 ttgaagttaa tgaagagaaa gacagagtta taggtgcact taatgctaca agagctgctg 1441 ttgaagaagg cattgtttag ggagggggtt gtgccctgct tcgatgcatt ccagccttgg 1501 actcattcac tccagctaat gaagataaaa taattggtat agaaattatt aaaagaacac 1561 tcaaaattcc agcaatgaca attcgtaaga atgcaggtgt tgatgcattt ttgatagttg 1621 agaaaattat gtaaagttcc tcagaagttg gttatgatac tatgttagga gatgtcgtga 1681 atatggtgga aaaagacatt attgacccaa caaaggttgt gagaactgct tcattggatg 1741 ctgctggcat gccctctata ttaactacag cagctgttgt agtcacagaa attcctaaag 1801 aagggaacag ccctggaatg ggtgcatgtg gaatgggagg tggcctgttc taattcctag 1861 aatagtgctt tacctttatt aatgaattgt gataggaacc caaggcagtg ttcctcacca 1921 ataacttcag agaagtcagt t // LOCUS HUMHP602A 1920 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 2. ACCESSION M34662 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS2-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 117 to 1920) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1920) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 619 a 324 c 431 g 544 t 2 others ORIGIN 1 tggattatgn aaatnaaagc ctttatgtgc tctgcctaaa ttaatatttt attttttttt 61 gatacttgct atgtcacctt tagaaagtaa gaggcagctc tcacttgctg ctgatgacct 121 gcctcacctc accatgcaca tgtcctgcca ccaccccaca gaaatgcttc acttacccac 181 agtctttcgc cagatgagac cagtgtccag ggtactggct cctcaccctc acttgaagga 241 tggtaaagat gtaaaatttg gtgctgatgt ctgcgcctta atgcttcaaa gtgtagactt 301 ttgtcagtaa ttattgaaca tagctgggaa gtcccaaagt aacaaaagat ggtctgactg 361 atgcaaagtc aattgactta aaggataaat ataaaagcat tggagctaaa cttctccaag 421 atgttgccta aaacacaaca aatggagaga ctggggatgg cactatcact gctgctgtac 481 tggtatgctc tatttccaaa taagccttcc agaaggttag caaaggtgct aatccagtgg 541 aaatcatcag agagtgtgat gttagctgtt gatgctgtaa ttgctgaact taaaaagcag 601 tctaaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga 661 cacaaagaaa attggtaaca tcatctctga tgcaatgaaa aagtttggaa gaaagggcat 721 catcacagta aaggatggaa aaacactgac tgatgaatta gaaattattg aaggcatgaa 781 aaatttgatc gaggatatat ttctccatac tttcttaata catcaaaagg tgagaaatgt 841 gaattccagg atgcctatgt tctactgagt gaaaagaaca tttctggtgt ccagtccatt 901 gtacctgctc ttgaaattgc cagtgcttac cttaagcctt tggtcacaat tgctgaagat 961 attgatgaag aaacgttaag tacgctcatc ttgaataggc taaaagttgg tcttcaggtt 1021 gtagcagcca aagtttcagg ttttggtgac aacggagcca gcttaaagat atggctattg 1081 ctactggtag tgcagtgttt ggagaagagg ggttaaccct aatcttgaag atgctcagcc 1141 tcatgactca ggaaaatttg gagaggtcat tgtgccaaag tagtgctatg tatttaaaag 1201 gaaaagatga caaagcttaa attgaaaaat gtattcaaga aatcattaag tagttagata 1261 tcataactag tggatatgaa aagaaaaaac tgaatgaata tatggcaaaa ctttcagatg 1321 gagtagccgt gttgatggtt ggttggtgag acaagtgatg ttcaggtgaa agacaagaaa 1381 gacagattta ccgatgtcct taatgctaca agagctgcta ttgaagaagg cattgttttg 1441 ggagggggtt gtgccctgct tcgagcattc cactcattca ctccagctaa tgaagataaa 1501 aataatggca tagaaattat taaaagaaca ctcaaaattc cagcaatgac tattgctaag 1561 aatgcaggtg ttgatttttg atagttgaga aaattatgca aagttcctca gaagttggtt 1621 atgatactat gttaggagat gtcgtgaata tggtggaaaa agacattatt gacccaacaa 1681 aggttgtgag aactgcttta ttggatgctg cttgcatggc tctctattaa ctacagcagc 1741 tgttgtagtc acaaaaattc ctaaagaagg gaacagccct ggaatgggtg caatgtgtgg 1801 aatgggaggt ggcctgttct aattcctaga atagtgcttt acctttatta atgaattgtg 1861 atagggagcc caaggcgtgt tcctcaccaa taacttcaga gaagtcagtt ggagaaaatg // LOCUS HUMHP603A 1912 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 3. ACCESSION M34661 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS5-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 99 to 1912) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1912) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 614 a 319 c 449 g 527 t 3 others ORIGIN 1 ctttatgtgc tcttanctaa attaataatt tatttanntt tttttgtata tttgctatgt 61 cacctttaga aagtaagagg cagctctcac ttgctgctga tgacctgcct cgccatgcac 121 atgtctgcca gcaccccaca gaaatgcttc cattacccac agtctttcac cagatgagac 181 cagtgtccag ggtactggct cctcacctca cttgaagtga tggtaaagat gtaaaatttc 241 ctgctgatgt ccgagcctta atgcttcaaa gtgtagactt ttttgtcgat gctgtagcca 301 ttacaatggg gccaaaggga agaacagtaa ttattgaaca tagctgggga agtcccaaag 361 taacaaaaga tggtgtaact gatgcaaagt caattgactt aaaggataaa tataaaagca 421 ttggagctaa acttgtccaa gatgttgcca ataacacaga tgaagagact gggggatggc 481 actatcactg ctgctgtact ggtatgctct atattcaaat aaggcttcca gaaggttagc 541 aaaggtgcta atccagtgga aatcaagaga ggtgtgatgt tagctgttga tgctgtaatt 601 gctgaactta aaaagcagtg taaacctgtg accaaacctg aaattgcaca ggttgctaca 661 atttctgcaa atggagacaa agaaattggt aacatctctg atgcaatgaa aaagtttgga 721 agaaagggca tcacagtaaa ggatggaaaa acactgactg atgaattaga aattattgaa 781 ggcatgaaaa tttgatcgag gatatatttc tccatacttt attaatacat caaaaggtga 841 gaaatgtgaa ttccaggatg cctatgttct gttgcatgaa aagaaaattt ctagtgtcca 901 gtccattgta actgctcttg aaattgccaa tgcttactgt aagcctttgg tcataattgc 961 tggagacatt gatggagaag ctctaactac actcatcctg aataggctaa aggttggtct 1021 tcaggttgtg gcagtcaaag ctccagggtt tgggtacaat agaaagaacc agcttaaaga 1081 tacggtcatt gctactggtg gtacagtgtt tggagaagag ggctgacact aaatcttgaa 1141 gacgttcagc ctcgtgacgt aggagaagtt ggagaggtca ctgtgaacaa agattatgcc 1201 atgctcttaa aaggaaaagg taacaagtct caaattgaaa aatgtgttca agaaatcatt 1261 gaccagtcag atgtcacaac tagtgaatac gaaaaggaaa aagtgagtgg agaaactttc 1321 agatggagta gctgtgctga aggttgttgg tgggacaagt gatgttgaag tgaatgaaga 1381 gaaagacaga gttacaggtg cacttaatgc tacaagagct gctgttgaag aaggcattgt 1441 ttagggaggg ggttgtgccc tgcttcgatg cattccagcc ttggactcat tcactccagc 1501 taatgaagat aaataattgg tatagaaatt attaaaagaa cactcaaaat tccagcaatg 1561 acaattgcta agaatgcagc tgttgatggc attttgatag ttgagaaaat tatgtaaagt 1621 gcctcagaag ttggttatga tactatgtta ggagatgtcg tgaatatggt gaaaaagaca 1681 ttattgaccc aacaaaggtt gtgagaactg cttcattgga tgctgctggc atggcctctc 1741 tattaactac agcagctgtt gtagtcacag aaattcctaa agaagagaac agccctggaa 1801 taggtgcaat gtgtggaatg ggaggtggcc tgttcaaatt cctagaatag tgctttacct 1861 ttattaatga attgtgatag gaacccaagg cagtgttcct caccaataac tt // LOCUS HUMHP604A 2019 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human chaperonin (HSP60) non-functional pseudogene 4. ACCESSION M34663 KEYWORDS chaperonin; pseudogene. SOURCE Human fetal liver DNA, clone PS5-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 123 to 2019) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 2019) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description BASE COUNT 657 a 327 c 479 g 556 t ORIGIN 1 agagaaatgt aatatgaaat aaagctttat gtgctctacc taaattaata atttatttat 61 ttttttttgt atatttgcta tgtcaccttt agaaagtaag aggcagctct cacttgctga 121 ttgatgacct gatgatgacc tgcctcgcca tgcacatgtc ctgccagcac cccacagaaa 181 tgcttccatt acccacagtc tttcaccaga tgagaccagt gtccaggcta ctggctactg 241 gctcctcacc tcacttgaag tgatggtaaa gatgtaaaat ttggtgctga tgtccgagcc 301 ttaatgcttc aaagtgtata gcttttgtcg agctgttagc cattacaaat gggccaaagg 361 gaagaacagt agattattga acatagctgg ggaagtccca aagtaacaaa agagtggtgt 421 gactgatgca aaggcaattg acttaaagga taaatataaa agcattggag ttaaacttgt 481 ccaagatgtt gccaataaca cagatgaaga agactgggga tggcactatc actgctgctg 541 tactggtatg ctctatttcc aaataaggct tccagaaggt tagcaaaggt gctaatccgt 601 ggaaatcagg agaggtgtga tgttagctgt tgatgctgta attgctgaac ttaaaaagca 661 gtctaacctg tgaccaaacc tgaagaaatt gcacaggttg ctacaatttc tgcaaatgga 721 gacaaagaaa ttggtaacat catctctgat gcaatgaaaa actttggaag aaagggcatc 781 atcacagtaa aggatggaaa aacactgact gatgaattag aaattattga aggcatgaaa 841 atttgatcga ggatatattt ctccatactt tattaataca tcaaaaggtg agaaatgtga 901 attccaggat gcctatgttc tgttgcatga aaagaaaatt tctagtgtcc agtccattgt 961 aactgctctt gaaattgcca atgcttagtg taagcctttg gtcatatatt gctggagaca 1021 ttgatggaga agctctaact acactcatcc tgaataagct aaaggttggt cttcaggttg 1081 tggcagtcaa agctaaagct ccagggtttg gtgacaatag aaagaaccag cttaaagata 1141 cggttattgc tactggtggt acagtgtttg gagaagaggg ctgacactaa atcttgaaga 1201 cgttcagcct cgtgacgtag gagaagttgg agaggtcact gtgatcaaga ttatgccatg 1261 ctcttaaaag gaaaaggtaa caagtctcaa attgaaaaat gtgtgcaaga aatcgttgac 1321 cagtcagatg tcacaactat gtaatacgaa aaggaaaaag tgagtgaaga aactttcaga 1381 tggagtagct gtgctgaagg ttggtgggac aagtgatgtt gaagtgaatg aagagaaaga 1441 cagagttata ggtgcactta atgctacaag agctgctgtt gaagaaggca ttgtttaggg 1501 agggggttgt gccctgcttc gatgcattcc agccttggac tcattcactc cagctaatga 1561 agataaaata attggtatag aaattattaa aagaacactc aaaattccag caatgacaat 1621 tgctaagaat gcaggtgttg atgcattttt gatagttgag aaaattatgt aaagttcctc 1681 agaagttggt tatgatacta tgttaggaga tgtcgtgaat atggtggaaa aagacattat 1741 tgacccaaca aaggttgtga gaactgcttc attggatgct gctgcatggc ctctctatta 1801 ttaactacag cagctgttgt agtcacagaa attcctaaag aagggaacag ccctggaatg 1861 ggtggaatgg gtgcaatgat atgtggaatg ggaggtggcc tgttctaatt cctagaatag 1921 tgctttacct ttattaatga attgtgatag gaaacccaag gcagtgttcc tcaccaataa 1981 cttcagagaa gtcagttgga gaaaatgaag aaaaaggct // LOCUS HUMHSP60A 2202 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human chaperonin (HSP60) mRNA, complete cds. ACCESSION M34664 KEYWORDS chaperonin. SOURCE Human placenta cDNA to mRNA, and DNA, clone PGEM-10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2202) AUTHORS Venner,T.J., Singh,B. and Gupta,R.S. TITLE Nucleotide sequence and novel structural features of human and Chinese hamster HSP60 (chaperonin) gene families JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.S.Gupta, 29-MAY-1990. Author address: R.S.Gupta McMaster University Dept of Biochemistry 1200 Main Street West Hamilton Ontario, CANADA L8N 3Z5 email: IN%GUPTAR.@SSCVAX.McMASTER.CA FEATURES from to/span description pept 25 1746 chaperonin (HSP60) BASE COUNT 699 a 371 c 538 g 594 t ORIGIN 1 cacgcttgcc gccgccccgc agaaatgctt cggttaccca cagtctttcg ccagatgaga 61 ccggtgtcca gggtactggc tcctcatctc actcgggctt atgccaaaga tgtaaaattt 121 ggtgcagatg cccgagcctt aatgcttcaa ggtgtagacc ttttagccga tgctgtggcc 181 gttacaatgg ggccaaaggg aagaacagtg attattgagc agggttgggg aagtcccaaa 241 gtaacaaaag atggtgtgac tgttgcaaag tcaattgact taaaagataa atacaagaac 301 attggagcta aacttgttca agatgttgcc aataacacaa atgaagaagc tggggatggc 361 actaccactg ctactgtact ggcacgctct atagccaagg aaggcttcga gaagattagc 421 aaaggtgcta atccagtgga aatcaggaga ggtgtgatgt tagctgttga tgctgtaatt 481 gctgaactta aaaagcagtc taaacctgtg accacccctg aagaaattgc acaggttgct 541 acgatttctg caaacggaga caaagaaatt ggcaatatca tctctgatgc aatgaaaaaa 601 gttggaagaa agggtgtcat cacagtaaag gatggaaaaa cactgaatga tgaattagaa 661 attattgaag gcatgaagtt tgatcgaggc tatatttctc catactttat taatacatca 721 aaaggtcaga aatgtgaatt ccaggatgcc tatgttctgt tgagtgaaaa gaaaatttct 781 agtatccagt ccattgtacc tgctcttgaa attgccaatg ctcaccgtaa gcctttggtc 841 ataatcgctg aagatgttga tggagaagct ctaagtacac tcgtcttgaa taggctaaag 901 gttggtcttc aggttgtggc agtcaaggct ccagggtttg gtgacaatag aaagaaccag 961 cttaaagata tggctattgc tactggtggt gcagtgtttg gagaagaggg attgaccctg 1021 aatcttgaag acgttcagcc tcatgactta ggaaaagttg gagaggtcat tgtgaccaaa 1081 gacgatgcca tgctcttaaa aggaaaaggt gacaaggctc aaattgaaaa acgtattcaa 1141 gaaatcattg agcagttaga tgtcacaact agtgaatatg aaaaggaaaa actgaatgaa 1201 cggcttgcaa aactttcaga tggagtggct gtgctgaagg ttggtgggac aagtgatgtt 1261 gaagtgaatg aaaagaaaga cagagttaca gatgccctta atgctacaag agctgctgtt 1321 gaagaaggca ttgttttggg agggggttgt gccctccttc gatgcattcc agccttggac 1381 tcattgactc cagctaatga agatcaaaaa attggtatag aaattattaa aagaacactc 1441 aaaattccag caatgaccat tgctaagaat gcaggtgttg aaggatcttt gatagttgag 1501 aaaattatgc aaagttcctc agaagttggt tatgatgcta tggctggaga ttttgtgaat 1561 atggtggaaa aaggaatcat tgacccaaca aaggttgtga gaactgcttt attggatgct 1621 gctggtgtgg cctctctgtt aactacagca gaagttgtag tcacagaaat tcctaaagaa 1681 gagaaggacc ctggaatggg tgcaatgggt ggaatgggag gtggtatggg aggtggcatg 1741 ttctaactcc tagactagtg ctttaccttt attaatgaac tgtgacagga agcccaaggc 1801 agtgttcctc accaataact tcagagaagt cagttggaga aaatgaagaa aaaggctggc 1861 tgaaaatcac tataaccatc agttactggt ttcagttgac aaaatatata atggtttact 1921 gctgtcattg tccatgccta cagataattt attttgtatt tttgaataaa aaacatttgt 1981 acattcctga tactgggtac aagagccatg taccagtgta ctgctttcaa cttaaatcac 2041 tgaggcattt ttactactat tctgttaaaa tcaggatttt agtgcttgcc accaccagat 2101 gagaagttaa gcagcctttc tgtggagagt gagaataatt gtgtacaaag tagagaagta 2161 tccaattatg tgacaacctt tgtgtaataa aaatttgttt aa // LOCUS MUSGCANF 4049 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Mouse guanylate cyclase/atrial natriuretic factor receptor mRNA, complete cds. ACCESSION J05504 KEYWORDS atrial natriuretic factor receptor; guanylate cyclase. SOURCE Mouse (strain C57B1/6J) Leydig cell line MA-10, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 4049) AUTHORS Pandey,K.N. and Singh,S. TITLE Molecular cloning and expression of murine guanylate cyclase/atrial natriuretic factor receptor cDNA JOURNAL J. Biol. Chem. 265, 12342-12348 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.N.Pandey, 31-MAY-1990. FEATURES from to/span description pept 348 3521 guanylate cyclase/atrial natriuretic factor receptor signal 4029 4034 poly-A signal BASE COUNT 879 a 1164 c 1140 g 866 t ORIGIN 1 cagaaaccct cccaaactcc tatagccaca cacacctttc ccggccaaga tccaaacaaa 61 cctctacttt cctcttccct aggagccaga ctcccttcgg gtgctgcgct cgctctcacc 121 tgctctaaag cacctccgct ctcggacgct cccaattccg ccctcctgct cgacggcggg 181 acagtcgcag cctcggcagg cagcttgctc tcgccgctgc ggcttcaacc cagccccctc 241 cctcgctacg gctgggcgct cttgactccc gaccctcgcc tctgagcccg aggacggcga 301 tcagaccatg gtgacagcgc tgctccgtcg ctgcgctcgc tgaggccatg ccgcgttccc 361 gacgcgtccg tccgcgccta agggcgctgc tgctgctacc gccgctgctg ctgctccgaa 421 gcggccacgc gagcgacctg accgtggctg tggtgctgcc cgtgaccaac acctcgtacc 481 cgtggtcctg ggcgcgtgta gggccggcgg tggaactggc tctcgggagg gtgaaggctc 541 ggccggactt gctgccgggt tggacggtcc gtatggtgct gggcagcagc gagaacgcgg 601 cgggcgtctg ctccgacacc gctgcaccgc tggccgcggt ggatctcaag tgggagcaca 661 gccccgccgt gttcctgggc cccggctgcg tatactctgc tgccccggtg gaccgcttca 721 ccgcgcactg gcggttgccg ctgctgacgg ctggcgcccc ggctctgggc atcggggtga 781 aggatgagta cgcgttaacc acccgcacag gacccagcca tgtcaagctg ggcgacttcg 841 tgacggcgct gcatcgacgg ctgggctggg agcaccaggc gcttgtgctc tatgcagatc 901 ggctgggcga cgaccggccg tgcttcttca tagtggaggg gctgtacatg cgggtgcgtg 961 agcgactcaa catcacagta aatcaccagg agttcgtcga gggcgacccg gaccactaca 1021 ccaagctact gcggaccgtg cagcgcaagg gcagagttat ctacatctgc agttctccgg 1081 atgccttcag gaatctgatg cttttggccc tggatgctgg cctgactggg gaggactatg 1141 ttttcttcca cctggatgtg tttgggcaaa gccttcaggg tgctcagggc cctgttccag 1201 agaagccctg ggaaagagac gatgggcagg ataggagagc ccgccagcgc tttcaggctg 1261 caaaaattat tacttacaaa gaacccgata atcctgagta cttggaattc ctgaagcagc 1321 taaaactctt ggctgacaag aaattcaact tcaccatgga ggatggcctg aaaaatatca 1381 tcccagcatc cttccatgac gggctcctgc tctatgtcca ggcagtgaca gagactctgg 1441 cacagggggg cactgtcact gatggagaga acatcactca gcggatgtgg aaccgaagct 1501 tccaaggtgt gacaggatac ctgaaaattg atagaaatgg agatcgggac actgattctc 1561 ctctctggga tatggacccc gagacaggtg ccttcagggt tgtcctgaac tttaatggta 1621 cttcccagga gctgatggct gtgtcagaac acagattata ctggcctctg ggatacccac 1681 ctcctgacat ccctaaatgt ggctttgaca atgaggaccc agcctgcaac caagaccact 1741 tttccacact ggaggttctg gctttggtgg gcagcctctc tctggttagc tttctgatcg 1801 tgtctttctt catatacagg aagatgcagc tggaaaagga gctggtctca gagttgtggc 1861 gggtgcgctg ggaggacttg cagcccagca gcctggagag gcaccttcgg agcgctggca 1921 gtcggctgac cctgagtggg cgaggctcca attatggctc cctgctaacc acggagggcc 1981 agttccaagt ctttgccaag acagcatact ataagggcaa cctcgtggct gtgaaacgtg 2041 tgaaccggaa acgcattgag ttgacacgaa aagtcctgtt tgaacttaaa catatgcggg 2101 atgtgcagaa tgagcaattg accagatttg tgggagcttg taccgaccct cccaacatct 2161 gtatcctcac agagtactgt ccccgtggaa gcctacagga cattctagag aatgagagta 2221 ttaccctgga ctggatgttt cggtactcac tcaccaatga cattgtcaag ggaatgctct 2281 ttctacacaa cggggccatt tgttcccatg ggaacctcaa gtcatccaac tgcgtggtag 2341 atggacgttt tgtgttaaag atcacagact atgggctcga gagcttcaga gacccggagc 2401 cagagcaagg acacaccctc tttgccaaaa aactgtggac tgcacctgag ctcctgcgaa 2461 tggcttcccc acctgcccgt ggctcccaag ctggggatgt ctacagtttt ggtatcatcc 2521 ttcaggaaat tgccctaaga agtggggtct tctatgtgga aggtttggac ctcagcccaa 2581 aagagatcat tgagcgtgtg actcggggtg agcagccccc attccgacct tccatggatc 2641 tgcagagcca cctggaggaa ctggggcagc tgatgcagag gtgctgggca gaggatcctc 2701 aggagcggcc accctttcaa cagatccgcc tggcgctgcg caagttcaac aaggagaaca 2761 gcagcaacat cctggacaac ctgctgtcac gcatggaaca gtacgccaac aacctggagg 2821 aactggtaga ggagagaaca cagccttatc tggaggagaa gcgcaaagct gaggccctgc 2881 tttaccagat tctgcctcac tctgtggctg agcagctgaa gagaggcgag acagtccagg 2941 ctgaggcatt tgatagtgtt actatctatt tcagtgatat cgtgggcttt acagctcttt 3001 cagcagagag cacacccatg caggtggtca ccctgctcaa tgatctgtac acctgttttg 3061 atgctgtcat agacaacttt gatgtgtaca aggtagagac cattggtgat gcttacatgg 3121 tggtatcagg gctcccagtg aggaatggac agctccatgc ccgagaggta gcccgaatgg 3181 cacttgcact gctcgatgct gtacgctcct tccgcatcgg ccataggccc caggaacagc 3241 tgcgcttgcg cattggaatt cacacaggtc ctgtgtgtgc tggtgtggta gggctaaaga 3301 tgccccgata ctgcctcttt ggagacacag tcaacacagc ttcaagaatg gagtctaatg 3361 gggaagccct caggatccac ttgtcttcgg agaccaaggc tgtgctggaa gagttcgatg 3421 gtttcgagct ggagctccga ggggatgtgg aaatgaaggg caaaggcaag gttcgttcct 3481 attggctcct cggggaccgg ggatgcagct ctcgagcctg acctactgcc ctgctattcc 3541 ttgtcacctc ccctccctat cccagcaatg acacgggtct ccaacttccc cctctcccac 3601 agcagctcag ccactgtgga aagattaggg acctaaccag cgcagtcatc agatgtgacc 3661 tctgagagag gatggagatg gtggggactg gagggggact cctaagttta tagggctgac 3721 tgaaataccc agtcactccc gtagcacatg ccccgccccc cccccgcccc cccactcagc 3781 tgcctagcag acagtgattc cttctgccgc cctcaactta gctccactgt gagttagagg 3841 gagggaaatt gccacctgaa ggaaagagaa aagagattct cggggtttgc aggaggcagg 3901 cagtcctgtg tcacaaatac tcccctcact cccagtccac cacctgcccc accgacttcc 3961 cttcccacac agtgcactga ggagaagaga ggcatggggt tgccttgctt ctcctatgag 4021 caaaacccat taaagtcttt attcctgtg // LOCUS MUSMHH2DR 1415 bp ss-mRNA ROD 08-AUG-1990 DEFINITION Mouse MHC class I H-2Dr protein mRNA, complete cds. ACCESSION M34961 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Mouse (strain B10.RIII) spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1415) AUTHORS Zeling,C. and Pease,L.R. TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Zeling, 29-MAY-1990. Author address: C.Zeling Mayo Clinic 323 Guggenheim 200 1st St. SW Rochester, MN 55905 email: Pease@mayo.edu FEATURES from to/span description pept 1 1089 MHC H-2Dr protein precursor sigp 1 72 MHC H-2Dr protein signal peptide matp 73 1086 MHC H-2Dr protein BASE COUNT 298 a 398 c 442 g 277 t ORIGIN 1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact 61 cagacccgcg cgggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc 121 ctcggggagc cccggtacat agaagtcggc tacgtggacg acacggagtt cgtgcgcttc 181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag 241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagat tttccgagtg 301 aacctgagga ccctgctcag ctactacaac cagagcaagg gcggctctca cacgatccag 361 tggatgttcg gctgtgacgt ggagtcggac gggcgcctcc tccgcgggta ccagcagtac 421 gcatacgacg gctgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg 481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg ctggtgctgc agagagaaag 541 agggcctacc tagagggcac gtgcgtggag tggctcctca gacacctgaa gaacgggaac 601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaaa 661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc 721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca 781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctctcgggaa ggagcagaat 841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct 901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg 961 gccattattg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa 1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt 1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc 1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc 1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc 1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga 1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag 1381 aataagaatt tgaatgtggg tggctggaga gatgg // LOCUS MUSMHH2DS 1415 bp ss-mRNA ROD 08-AUG-1990 DEFINITION M.musculus MHC class I H-2Ds protein mRNA, complete cds. ACCESSION M34962 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE M.musculus (strain B10.RIII) spleen, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1415) AUTHORS Zeling,C. and Pease,L.R. TITLE Diversity among the H-2D/L loci: The structure of H-2Dr and H-2Ds JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Zeling, 29-MAY-1990. Author address: C.Zeling Mayo Clinic 323 Guggenheim 200 1st St. SW Rochester, MN 55905 email: Pease@mayo.edu FEATURES from to/span description pept 1 1089 MHC H-2Dr protein precursor sigp 1 72 MHC H-2Dr protein signal peptide matp 73 1086 MHC H-2Dr protein BASE COUNT 292 a 401 c 444 g 278 t ORIGIN 1 atgggggcga tggctccgcg cacgctgctc ctgctgctgg cggccgccct ggccccgact 61 cagacccgcg ctggccgaca ctcgctgagg tatttcgtca ccgccgtgtc ccgacccggc 121 ctcggggagc cccggtacat agaagtcggc tacgtggaca acacggagtt cgtgcgcttc 181 gacagcgacg cggagaatcc gagagttgag ccgcgggtgc ggtggatgga gcaggtggag 241 cccgagtatt gggagcggaa cacacagatc gccaagggca atgagcagag tttccgagtg 301 aacctgagga ccgcgctccg ctactacaac cagagcaagg gcggctctca cacggtccag 361 gtgatgtctg gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta ccagcagcac 421 gcatacgacg gccgcgatta catcgccctg aacgaagacc tgaaaacgtg gacggcggcg 481 gacacggcgg cgcagatcac ccgacgcaag tgggagcagg gtggtgatgc agagcattac 541 aaggcctacc tagagggcac gtgcgtggag tggctcctca gatacctgga gctcgggaac 601 gagacgctgc tgcgcacaga ttccccaaag gcacatgtga cccatcaccc cagatctaga 661 ggtgaagtca ccctgaggtg ctgggccctg ggcttctacc ctgctgacat caccctgacc 721 tggcagttga atggggagga gctgacccag gacatggagc ttgtggagac caggcctgca 781 ggggatggaa ccttccagaa gtgggcagct gtggtggtgc ctcttgggaa ggagcagaat 841 tacacatgcc atgtgcacca tgaggggctt cctgagcccc tcaccctgag atgggagcct 901 cctccgtcca ctgactctta catggtgatc gttgctgttc tgggtgtcct tggagctatg 961 gccattactg gagctgtggt ggcttttgtg atgatgatga ggagaaacac aggtggaaaa 1021 ggaggggact atgctctggc tccaggctcc cagagctctg aaatgtctct ccaagattgt 1081 aaagcgtgaa gacagctgcc tggactgcac tgagtgacag ccagtgtgtt caggtctctc 1141 ctgtgacatc cagagccctc agttctcttt acacaacgtt gtctgatgtt ccctgtgagc 1201 ttgggttcag tgtgaagaac tgtggagccc agcctgccct gcacaccagg accctatccc 1261 tgcactgccg tgtgttccct tccatagcca accttgctgc tccagccaaa cactggggga 1321 catctgcatc ctgtaagctc catgctaccc tgagctgcag ctcctcactt ccacactgag 1381 aataagaatt tgaatgtggg tggctggaga gatgg // LOCUS CREAPCYN 577 bp ss-mRNA PLN 08-AUG-1990 DEFINITION C.reinhardtii apoplastocyanin (PC6-2) mRNA, complete cds. ACCESSION J05524 KEYWORDS apoplastocyanin. SOURCE C.reinhardtii (strain 2137) vegetative cell, cDNA to mRNA, clone PC6-2. ORGANISM Chlamydomonas reinhardtii Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae. REFERENCE 1 (bases 1 to 577) AUTHORS Merchant,S., Hill,K., Kim,J.H., Thompson,J., Zaitlin,D. and Bogorad,L. TITLE Isolation and characterization of a complementary DNA clone for an algal pre-apoplastocyanin JOURNAL J. Biol. Chem. 265, 12372-12379 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Hill, 11-MAY-1990. FEATURES from to/span description pept 22 459 apoplastocyanin (PC6-2) precursor sigp 22 162 apoplastocyanin signal peptide matp 163 456 apoplastocyanin BASE COUNT 91 a 189 c 171 g 126 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccgta tcactttaaa aatgaaggct actctgcgtg cccccgcttc ccgcgccagc 61 gctgtgcgcc ccgtcgccag cctgaaggcc gctgctcagc gcgtggcctc ggtcgccggt 121 gtgtcggttg cctctctggc cctgaccctg gctgcccacg ccgacgccac cgtcaagctg 181 ggcgctgact ctggtgctct ggagttcgtc cccaagaccc tgaccatcaa gtccggcgag 241 accgtgaact tcgtgaacaa cgctggcttc ccccacaaca tcgtcttcga cgaggatgcc 301 atcccctccg gcgtgaacgc tgatgccatc tcccgcgatg actacctgaa cgcccccggc 361 gagacctact cggtgaagct gaccgctgcc ggcgagtacg gctactactg cgagccccac 421 cagggcgctg gcatggtcgg caagatcatt gtccagtaaa ttgctggcgg ctgccttcat 481 tttgtgaccg tgtgtgtttc ggggtgtggg gtcgggggtt tttgcggcgt ccggatggac 541 gcagagagcg tgtagctctg taactttttc ggaattc // LOCUS HUMTNC1 1085 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human fast skeletal muscle troponin C gene, exon 1. ACCESSION M33771 J05502 KEYWORDS troponin C. SEGMENT 1 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1085) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis, and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. 265, 12520-12528 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept 1033 + 1035 troponin C, exon 1 pre-msg 968 > 1085 troponin C mRNA and introns (alt.) pre-msg 969 > 1085 troponin C mRNA and introns (alt.) IVS 1036 > 1085 troponin C intron A BASE COUNT 266 a 276 c 306 g 237 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcccctg agcctaggag ttcgagacca gcctgggcaa catggtgaaa ccccatctct 61 ataaaaatac aaaaaaaaaa aaagccaggg gtggtggtgc acctctagtc ccagctactc 121 gggaggctga agtgggagga tcacttgagc ccaggagata gaggctgcag tgagctataa 181 tagcaccact gcacatcagc cttggtgaca gagtgagtat agacctttat aatttacaaa 241 gctctgcagt caagtgggga taatagttgc tacctcatag gaggaacact agatgagatc 301 atgttaaaat gcttagcaca gtgtgtggca cgtcatgggg ttcagtaaac agtggcctac 361 ctgtcacttt ctcctcctat cccctgcccc ttaccatggt tcaaagacgg gacattcctc 421 ttgccaagca ccaacgctgt gtacccagca caagctaggt gacttatacc cgggttccca 481 tttaccatgg tctctctgtg agggagaccc cacagacagg tccaatccca cagatgaaga 541 aatggctcag agaggcactg ctaccggccc aaggtcacac aactcgcaga ggctggcctt 601 ggatcttggg ttttggactt caaggcttgt actctcccca ttttaccatg ctgcctcacc 661 ctttactact ctgatcatcc ttaccatctt ggagccagtg tgcttcctcg gtcgggggga 721 ctccaaggca gtggaacagg gcaggccttc ctcacaagaa tctaggacgt caaggcctgc 781 cacctgcttg gaggcttaaa tttctctgca agggcccttg gctaaattag gtaatgggtt 841 cagactgtgg gaggggtggg actcgctgac cccaggatct gattgggcag ggtctccagt 901 gctggggagc agggaggtgg gaggggaggg tgcccctaca aatcccgggg gctagagcag 961 gccaggtcat ctttgggtgg tggagtgcaa aggaggcgac ctgcaacaga ggagtcccgg 1021 tcaccagcaa ccatggtaag gacaagaggg gactttgtgc ctttactcac gggagcactg 1081 tggac // LOCUS HUMTNC2 1773 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human fast skeletal muscle troponin C gene, exons 2-6. ACCESSION M33772 J05502 KEYWORDS troponin C. SEGMENT 2 of 2 SOURCE Human HeLa cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1773) AUTHORS Gahlmann,R. and Kedes,L. TITLE Cloning, structural analysis, and expression of the human fast-twitch skeletal muscle troponin C gene JOURNAL J. Biol. Chem. 265, 12520-12528 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Gahlmann, 20-APR-1990. FEATURES from to/span description pept + 51 102 troponin C, exon 2 247 390 troponin C, exon 3 479 593 troponin C, exon 4 758 894 troponin C, exon 5 1506 1537 troponin C, exon 6 pre-msg < 1 1668 troponin C mRNA and introns IVS < 1 50 troponin C intron A IVS 103 246 troponin C intron B IVS 391 478 troponin C intron C IVS 594 757 troponin C intron D IVS 895 1505 troponin C intron E BASE COUNT 358 a 455 c 628 g 331 t 1 others ORIGIN About 2.5 kb after segment 1. 1 ctgctctggg gctgcctctt cagctcagac tttctctgtg tctcctgcag acggaccagc 61 aggctgaggc caggtcctac ctcagcgaag agatgatcgc tggtgagtga ggcgcggcag 121 gctgggggct gggtggatgt atgctgggga catgttggcc tctaggcagg gtgggggttg 181 ggggggggta ggtgtgaggc tgacagtcca gcaggcctca cctctgccct ctgccctccc 241 tcgcagagtt caaggctgcc tttgacatgt ttgatgctga tggtggtggg gacatcagcg 301 tcaaggagtt gggcacggtg atgaggatgc tgggccagac acccaccaag gaggagctgg 361 acgccatcat cgaggaggtg gatgaggacg gtgagcgggt gtccctcgga ggcaggggat 421 ggtggggaga ggcggcagcg gctgggctca ggctcagtct gccacctgct gcccgcaggc 481 agcggcacca tcgacttcga ggagttcttg gtcatgatgg tgcgccagat gaaagaggac 541 gcgaaaggga agagcgagga ggagctggcc gagtgcttcc gcatcttcga caggtgcgct 601 gggggcccgg gagccgaggg aggggcttag cagtcagaac ccggccggca gcgctggaag 661 cttccccgtg ttgggagtgc ggaagcgggg cgtcgagtgt ggtggccagg gcagctcgcc 721 ccggccctga gccctgccct gtccctcgga cccgaaggaa tgcagacggc tacatcgacc 781 cggaggagct ggctgagatt ttcagggcct ccggggagca cgtgacggac gaggagatcg 841 aatctctgat gaaagacggc gacaagaaca acgacggccg cattgacttc gacggtgagg 901 gccaacggga gcttgggaag agcgggtggg agccccagag gacggcgggc actggtgcct 961 ggctccggtg cagtgtcctt ggccttgtgg gctgggctgg gcgtaacttc accaagttcc 1021 ccatagtctc ctgggggcag gtatagaagc gctccacgac tctgtgcagt ttatcaatta 1081 atgaccaggc aggggggcgt ggtggctcag gactgtaaaa tcctagaact ttgggaggat 1141 gaggtgggag gattgcttga ggccaagagt tcgagaccag cctgggcaac atagtgagac 1201 cctgtcttta caaaaaataa aataaaataa aaaggcaggt gtggtggcac acctgtggtc 1261 ccagttgctc cgggggctga ggtaagagga tcgcttgagc ctaggagatc aaggctgcaa 1321 tgagttatga ttgtgccact gcactctacc ctgggtgaca gaccttggcg accttggcaa 1381 cagagtgaga ctctgtctca acaaaaacaa aaacaaaaag tcatcccctc tgtgtggctg 1441 gagccctggc ctggcctctg cgtccccagt gggacccctg accctctccc tctcctttcc 1501 ctcagagttc ctgaagatga tggagggcgt gcagtaagga gtggacagtc gcctctacca 1561 agatcgcgtg tccctagggt gtgggagact ccgccctgcc gggtctccac cagggaggcg 1621 cggccccttg tgggtctttg tctggaagga ataaaagcaa atgttccaaa actcgtggcc 1681 tgaatgaggg nagagccgag gacagcaggt cgcaagggct gtggtccagg gcgccatccc 1741 ggctggcacc acccggagca cccttgggga tcc // LOCUS ENTLACBOHI 1164 bp ds-DNA BCT 08-AUG-1990 DEFINITION Plasmid PDS075 (from E.cloacae) beta-lactamase (Ohio-1) gene, complete cds. ACCESSION M33655 KEYWORDS beta-lactamase. SOURCE Plasmid PDS075 (from Enterobacter cloacae) DNA. ORGANISM Plasmid PDS075 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1164) AUTHORS Shlaes,D.M., Currie-McCumber,C., Hull,A., Behlau,I. and Kron,M. TITLE The Ohio-1 beta-lactamase is part of the SHV-1 family JOURNAL Antimicrob. Agents Chemother. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.M.Shlaes, 10-APR-1990. FEATURES from to/span description pept 143 1003 Ohio-1 beta-lactamase signal 56 62 -35 region signal 80 85 -10 region binding 126 131 ribosome binding site BASE COUNT 233 a 345 c 357 g 229 t ORIGIN 1 cgagtaatac tcacatggca ctgacgcaaa atcatcgatt tattcaacaa agccattgtg 61 aatcagcaaa acgccgggtt attcttattt gtcgcttctt tactcgcctt tatcggccct 121 cactcaagga tctaccgtgg ttatgcgtta ttttcgcctg tgtattatct ccctgttagc 181 caccctgccg ctgcgggtac acgccggacc gcagccgctt gagcaaatta aactaagcga 241 aagccagctg tcgggcagcg taggcatgat agaaatggat ctggccaggc ccggcacgct 301 gaccgcctgg cgcgccgatg aacgctttcc catgatgagc acctttaaag tagtgctctg 361 cggcgcaggt ctggcgcggg tggatgccgg tgacgaacag ctggagcgaa agatccacta 421 tcgccgacag gatctggtgg actactcgcc ggtcagcgaa aaacaccttg ccgacggcat 481 gacggtcggc gaactctgtg ccgccgccat taccatgagc gataacagcg ccgccaatct 541 gctgctgcca gccgtcggcg gccccgcagg attgactgcc tttttgcgcc agatcggcga 601 caacgtcacc cgccttgacc gctgggaaac ggaactgaat gaggcgcttc ccggcgacgc 661 ccgcgacacc actaccgccc gcagcatggc cgcgaccctg cgcaagctgc tgaccagcca 721 gcgtctgagc gcccgttcgc aacggcagct gctgcagtgg atggtggacg atcgggtcgc 781 cggacgtttg atccgctccg tgctgccggc gggctggttt atcgccgata agaccggagc 841 tggcgaacgg ggtgcgcgcg ggattgtcgc cctgcttggc ccgaataaca aagcagagcg 901 gattgtggtg atttatctgc gggatacgcc ggcgagcatg gccgagcgaa atcagcaaat 961 cgccgggatc gccggggcgc tgatcgagca ctggcaacgc taacccggcg gtgggcgcgc 1021 gcgttatccg gctcgtagca cctcgcaggc gtgccggcga tatgactggc ggcggcatcg 1081 gagagatggc ttctggctaa tgatggtcgg tgcaacctgg gtgaaaggta acgccataaa 1141 cggtggccac ctgattgtgt ttct // LOCUS HUMTRPY1B 2609 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human tryptase-I gene, complete cds. ACCESSION M33494 KEYWORDS serine protease; tryptase-I. SOURCE Human adult skin DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2609) AUTHORS Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M., Craik,C.S. and Caughey,G.H. TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a multigene serine protease family JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Vanderslice 02-APR-1990. FEATURES from to/span description pept 458 518 tryptase-I, exon 1 673 844 tryptase-I, exon 2 954 1219 tryptase-I, exon 3 1346 1509 tryptase-I, exon 4 1595 1759 tryptase-I, exon 5 pre-msg 225 2072 tryptase-I mRNA IVS 248 457 tryptase-I, intron A IVS 519 672 tryptase-I, intron B IVS 845 953 tryptase-I, intron C IVS 1220 1345 tryptase-I, intron D IVS 1510 1594 tryptase-I, intron E signal 131 135 pot. CAAT box signal 197 201 pot.TATA box BASE COUNT 422 a 941 c 776 g 470 t ORIGIN 1 accagctgac aggtggagct gccagtctcc agtgctcagc cctcagcggg gcctgcctgg 61 cagccccaca cacagagggc atcggggtgg cgggggcacg tgttacacgg gggccctggg 121 tctgagtcat ccacttcctc cgagtctgga tgggaggacc cagcgcccct cctccgcccc 181 ctcctgatct ggaaggataa atggggaggg gagagccact gggtagaagg aacagggagt 241 ggccagggta agtccccact ctcagagacc ctgacatcag cgtcacctgg agcagagtgg 301 cccagcctca gactcagagc accaagaccc aggcccgcag gcctggaccc accccggtcc 361 ccccgtccca gctccattct tcaccccaca atctgtagcc cccagccctg ccctgtgagg 421 cccggccagg cccacgatgc tcctccttgc tccccagatg ctgaatctgc tgctgctggc 481 gctgcccgtc ctggcgagcc gcgcctacgc ggcccctggt gagtcccagc cggggtccac 541 cctgcccctc accacattcc acaggtcagg gcctgggtgg gttctgggga ggtcgggctg 601 gccccccaca cagggaaggg ctgggcccag gcctggggct gcttcctggt cctgacctgg 661 cacctgcccc agccccaggc caggccctgc agcgagtggg catcgtcggg ggtcaggagg 721 cccccaggag caagtggccc tggcaggtga gcctgagagt ccacggccca tactggatgc 781 acttctgcgg gggctccctc atccaccccc agtgggtgct gaccgcagcg cactgcgtgg 841 gaccgtgagt ctcccggggc ctggaggggt ggggaagggc tggatgtgag ccctggctcc 901 cgggtgctcc tgggggctgc ccagggccct gagtgggatc ctccgctgcc cagggacgtc 961 aaggatctgg ccgccctcag ggtgcaactg cgggagcagc acctctacta ccaggaccag 1021 ctgctgccgg tcagcaggat catcgtgcac ccacagttct acaccgccca gatcggagcg 1081 gacatcgccc tgctggagct ggaggagccg gtgaacgtct ccagccacgt ccacacggtc 1141 accctgcccc ctgcctcaga gaccttcccc ccggggatgc cgtgctgggt cactggctgg 1201 ggcgatgtgg acaatgatgg tgggtctggg gacagtggag gtggggccag ggtcttagcc 1261 acagcccagc ccctgggtcc ctctgggctc caggtggggg ttgcccggcc ccctcctgag 1321 gctgcaccct cttccccacc tgcagagcgc ctcccaccgc catttcctct gaagcaggtg 1381 aaggtcccca taatggaaaa ccacatttgt gacgcaaaat accaccttgg cgcctacacg 1441 ggagacgacg tccgcatcgt ccgtgacgac atgctgtgtg ccgggaacac ccggagggac 1501 tcatgccagg tgggccccgc ctgtcccccg ccccccgccc cccaaccccc actcccaggc 1561 ctgttcggcg agcgctgacc tctgaccttc ccagggcgac tccggagggc ccctggtgtg 1621 caaggtgaat ggcacctggc tgcaggcggg cgtggtcagc tggggcgagg gctgtgccca 1681 gcccaaccgg cctggcatct acacccgtgt cacctactac ttggactgga tccaccacta 1741 tgtccccaaa aagccgtgag tcaggcctgg gttggccacc tgggtcactg gaggaccaac 1801 ccctgctgtc caaaacacca ctgcttccta cccaggtggc gactgccccc cacaccttcc 1861 ctgccccgtc ctgagtgccc cttcctgtcc taagccccct gctctcttct gagccccttc 1921 ccctgtcctg aggacccttc cctatcctga gcccccttcc ctgtcctaag cctgacgcct 1981 gcaccgggcc ctccagccct cccctgccca gatagctggt ggtgggcgct aatcctcctg 2041 agtgctggac ctcattaaag tgcatggaaa tcactggtgt gcatcgctgt gtttctggtt 2101 gtggatgtca ctgggagaga aggggtccag gtgtgctgag gacacctgcc acagtgtgag 2161 gtcctagccc tcaaggcaca gccagtcacc gtgggaccac tggaggacca acccctgctg 2221 tccaaaacac cactgcttcc tacccaggtg gcgactgccc cccacacctt ccctgccccg 2281 tcctgagtgc cccttcctgt cctaagcccc ctgctctctt ctgagcccct tcccctgtcc 2341 tgaggaccct tccctatcct gagccccctt ccctgtccta agcctgacgc ctgcaccggg 2401 ccctccagcc ctcccctgcc cagatagctg gtggtgggcg ctaatcctcc tgagtgctgg 2461 acctcattaa agtgcatgga aatcactggt gtgcatcgct gtgtttctgg ttgtggatgt 2521 cactgggaga gaaggggtcc aggtgtgctg aggacacctg ccacagtgtg aggtcctagc 2581 cctcaaggca cagccagtca ccgtgggac // LOCUS HUMTRY1A 1137 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human tryptase-I mRNA, 3' end. ACCESSION M33491 KEYWORDS serine protease; tryptase-I. SOURCE Human adult skin, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1137) AUTHORS Vanderslice,P., Ballinger,S.M., Tam,E.K., Goldstein,S.M., Craik,C.S. and Caughey,G.H. TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a multigene serine protease family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3811-3815 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Vanderslice, 02-APR-1990. FEATURES from to/span description pept < 1 824 tryptase-I (AA at 3) signal 1119 1124 poly-A signal BASE COUNT 196 a 410 c 328 g 203 t ORIGIN 1 tgaatctgct gctgctggcg ctgcccgtcc tggcgagccg cgcctacgcg gcccctgccc 61 caggccaggc cctgcagcga gtgggcatcg tcgggggtca ggaggccccc aggagcaagt 121 ggccctggca ggtgagcctg agagtccacg gcccatactg gatgcacttc tgcgggggct 181 ccctcatcca cccccagtgg gtgctgaccg cagcgcactg cgtgggaccg gacgtcaagg 241 atctggccgc cctcagggtg caactgcggg agcagcacct ctactaccag gaccagctgc 301 tgccggtcag caggatcatc gtgcacccac agttctacac cgcccagatc ggagcggaca 361 tcgccctgct ggagctggag gagccggtga acgtctccag ccacgtccac acggtcaccc 421 tgccccctgc ctcagagacc ttccccccgg ggatgccgtg ctgggtcact ggctggggcg 481 atgtggacaa tgatgagcgc ctcccaccgc catttcctct gaagcaggtg aaggtcccca 541 taatggaaaa ccacatttgt gacgcaaaat accaccttgg cgcctacacg ggagacgacg 601 tccgcatcgt ccgtgacgac atgctgtgtg ccgggaacac ccggagggac tcatgccagg 661 gcgactccgg agggcccctg gtgtgcaagg tgaatggcac ctggctgcag gcgggcgtgg 721 tcagctgggg cgagggctgt gcccagccca accggcctgg catctacacc cgtgtcacct 781 actacttgga ctggatccac cactatgtcc ccaaaaagcc gtgagtcagg cctgggttgg 841 ccacctgggt cactggagga ccaacccctg ctgtccaaaa caccactgct tcctacccag 901 gtggcgactg ccccccacac cttccctgcc ccgtcctgag tgccccttcc tgtcctaagc 961 cccctgctct cttctgagcc ccttcccctg tcctgaggac ccttccctat cctgagcccc 1021 cttccctgtc ctaagcctga cgcctgcacc gggccctcca gccctcccct gcccagatag 1081 ctggtggtgg gcgctaatcc tcctgagtgc tggacctcat taaagtgcat ggaaatc // LOCUS HUMTRY2A 1128 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human tryptase-II mRNA, 3' end. ACCESSION M33492 KEYWORDS serine protease; tryptase-II. SOURCE Human adult skin, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1128) AUTHORS Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M., Craik,C.S. and Caughey,G.H. TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a multigene serine protease family JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Vanderslice, 02-APR-1990. FEATURES from to/span description pept < 1 826 tryptase-II (AA at 2) signal 1121 1126 poly-A signal BASE COUNT 190 a 409 c 329 g 200 t ORIGIN 1 gctgaatctg ctgctgctgg cgctgcccgt cctggcgagc cgcgcctacg cggcccctgc 61 cccaggccag gccctgcagc gagtgggcat cgttgggggt caggaggccc ccaggagcaa 121 gtggccctgg caggtgagcc tgagagtcca cggcccatac tggatgcact tctgcggggg 181 ctccctcatc cacccccagt gggtgctgac cgcagcgcac tgcgtgggac cggacgtcaa 241 ggatctggcc gccctcaggg tgcaactgcg ggagcagcac ctctactacc aggaccagct 301 gctgccggtc agcaggatca tcgtgcaccc acagttctac accgcccaga tcggagcgga 361 catcgccctg ctggagctgg aggagccggt gaaggtctcc agccacgtcc acacggtcac 421 cctgccccct gcctcagaga ccttcccccc ggggatgccg tgctgggtca ctggctgggg 481 cgatgtggac aatgatgagc gcctcccacc gccatttcct ctgaagcagg tgaaggtccc 541 cataatggaa aaccacattt gtgacgcaaa ataccacctt ggcgcctaca cgggagacga 601 cgtccgcatc gtccgtgacg acatgctgtg tgccgggaac acccggaggg actcatgcca 661 gggcgactcc ggagggcccc tggtgtgcaa ggtgaatggc acctggctgc aggcgggcgt 721 ggtcagctgg ggcgagggct gtgcccagcc caaccggcct ggcatctaca cccgtgtcac 781 ctactacttg gactggatcc accactatgt ccccaaaaag ccgtgagtca ggcctgggtt 841 ggccacctgg gtcactggag gaccaacccc tgctgtccaa aacaccactg cttcctaccc 901 aggtggcgac tgccccccac accttccctg ccccgtcctg agtgcccctt cctgtcctaa 961 gccccctgct ctcttctgag ccccttcccc tgtcctgagg acccttcccc atcctgagcc 1021 cccttccctg tcctaagcct gacgcctgca ccgggccctc cggccctccc ctgcccaggc 1081 agctggtggt gggcgctaat cctcctgagt gctggacctc attaaagt // LOCUS HUMTRY3A 1081 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human tryptase-III mRNA, 3' end. ACCESSION M33493 KEYWORDS serine protease; tryptase-III. SOURCE Human adult skin, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1081) AUTHORS Vanderslice,P., Ballinger,S., Tam,E.K., Goldstein,S.M., Craik,C.S. and Caughey,G.H. TITLE Human mast cell tryptase: Multiple cDNAs and genes reveal a multigene serine protease family JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Vanderslice, 02-APR-1990. FEATURES from to/span description pept < 1 805 tryptase-III (AA at 2) BASE COUNT 181 a 398 c 317 g 185 t ORIGIN 1 gctgcccgtc ctggcgagcc gcgcctacgc ggcccctgcc ccaggccagg ccctgcagcg 61 agtgggcatc gttgggggtc aggaggcccc caggagcaag tggccctggc aggtgagcct 121 gagagtccgc gaccgatact ggatgcactt ctgcgggggc tccctcatcc acccccagtg 181 ggtgctgacc gcagcgcact gcgtgggacc ggacgtcaag gatctggccg ccctcagggt 241 gcaactgcgg gagcagcacc tctactacca ggaccagctg ctgccggtca gcaggatcat 301 cgtgcaccca cagttctaca ccgcccagat cggagcggac atcgccctgc tggagctgga 361 ggagccggtg aaggtctcca gccacgtcca cacggtcacc ctgccccctg cctcagagac 421 cttccccccg gggatgccgt gctgggtcac tggctggggc gatgtggaca atgatgagcg 481 cctcccaccg ccatttcctc tgaagcaggt gaaggtcccc ataatggaaa accacatttg 541 tgacgcaaaa taccaccttg gcgcctacac gggagacgac gtccgcatcg tccgtgacga 601 catgctgtgt gccgggaaca cccggaggga ctcatgccag ggcgactccg gagggcccct 661 ggtgtgcaag gtgaatggca cctggctgca ggcgggcgtg gtcagctggg gcgagggctg 721 tgcccagccc aaccggcctg gcatctacac ccgtgtcacc tactacttgg actggatcca 781 ccactatgtc cccaaaaagc cgtgagtcag gcctggggtg tccacctggg tcactggagg 841 accagcccct cctgtccaaa acaccactgc ttcctaccca ggcggcgact gccccccaca 901 ccttccctgc cccgtcctga gtgccccttc ctgtcctaag ccccctgctc tcttctgagc 961 cccttcccct gtcctgagga cccttcccca tcctgagccc ccttccctgt cctaagcctg 1021 acgcctgcac cgggccctcc ggccctcccc tgcccaggca gctggtggtg ggcgctaatc 1081 c // LOCUS YSCAPN1 1815 bp ds-DNA PLN 08-AUG-1990 DEFINITION S.cerevisiae apurinic endonuclease (APN1) gene, complete cds. ACCESSION M33667 KEYWORDS 3'-repair diesterase; apurinic endonuclease. SOURCE S.cerevisiae (strain EJ2169) DNA, clone pSCP8. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1815) AUTHORS Popoff,S.C., Spira,A.I., Johnson,A.W. and Demple,B. TITLE Yeast structural gene (APN1) for the major apurinic endonuclease: Homology to Escherichia coli endonuclease 4 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 4193, 4197 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.C.Popoff, 10-APR-1990. FEATURES from to/span description pept 340 1443 apurinic endonuclease (APN1) signal 265 269 TATA box BASE COUNT 607 a 287 c 372 g 549 t ORIGIN Chromosome 11. 1 cccgagcaca agaaaataaa aaattgaaca aaaataagaa taaagtcaca aagggaagaa 61 gatgaggggg ttctttttcc agactttctt cttttggtcc ttcaccttgg atatatgaac 121 atagtagtat tattgaacac gaactttttt catctaatac ttctcttttt ttgccagttt 181 ttctgggaac ttggaaactg gaatttttca ttcttacctg gtttcttttt cttcatttat 241 ttacatgttg gaaatcattg taaatataag atgacaaact ccgaataaga aacacaaaac 301 gcaacattaa taagcttttg gcatatcgga accatcgtaa tgccttcgac acctagcttt 361 gttagatctg ctgtctcgaa atacaaattt ggtgcgcaca tgtcaggtgc cggtggaatt 421 tctaatagtg taactaatgc atttaacact ggctgtaatt cgtttgccat gtttttaaaa 481 tctccaagaa agtgggtttc tccgcagtat acacaggagg aaatagataa atttaagaaa 541 aactgtgcaa cttacaatta taacccattg acggacgtct taccgcatgg ccaatacttt 601 attaatttgg ctaatccgga tagggaaaag gcagaaaaga gttatgagtc attcatggac 661 gatctgaaca ggtgtgagca attagggata ggcctgtata acttacatcc aggttctact 721 ttgaaaggag accatcagtt gcagttaaaa caactagcct catatttgaa caaagcaatt 781 aaggaaacga aatttgttaa aattgtatta gaaaatatgg ctggtactgg aaatttagta 841 ggaagttctc tggtagattt gaaggaggtt attggaatga ttgaagataa atcaagaatc 901 ggcgtttgca tagatacatg ccatacattt gcagcaggct acgatatcag caccactgaa 961 acgtttaaca atttttggaa agagttcaat gatgtaatag ggttcaaata tctaagtgct 1021 gttcacttaa atgattccaa ggctccttta ggaagcaata gagatttgca tgaacgcttg 1081 ggtcaaggtt atttgggtat agatgtgttt agaatgatcg cgcactctga atacctgcag 1141 ggtatcccta ttgtcttgga aactccatac gaaaatgatg aaggctatgg taatgaaatc 1201 aaacttatgg agtggttgga atcaaagagt gagagcgaat tgttagaaga caaggagtat 1261 aaagaaaaaa atgatacgct gcagaaatta ggtgctaaat cgcgtaagga acagttagac 1321 aagtttgagg ttaaacaaaa gaagcgagct gggggcacca agaggaagaa agccactgca 1381 gagccaagcg ataatgatat cttgtcacaa atgacaaaga agaggaagac taagaaagaa 1441 taatgagaag cgagaagaat tttaaatacg taatcaattt ttgtagatta tctcaacgta 1501 cgtaactaaa tgctaccctt cgtcatccaa gatgctgttg cccatttcat agaaactact 1561 tacttggttt atagatgttt gcgttgcgtt acagaataag gcagcctttt ttttttttag 1621 attttcattt ttcacatact gatggattag cagattataa aagggtaaag ttaaaatcat 1681 tccaaacttg aactaaggaa ggcgctagac catcctctta tggaacgtta ttcctaagca 1741 tagaaagttt tttacttgta atcttgaacg cggggtgtgt tctttatctt ttttttcttt 1801 tttacacaat tagac // LOCUS YSCCKA2 2393 bp ds-DNA PLN 08-AUG-1990 DEFINITION S.cerevisiae casein kinase-II alpha' subunit gene, complete cds. ACCESSION M33759 KEYWORDS casein kinase-II. SOURCE S.cerevisiae (strain S288C) DNA, (library of M.Snyder), clone ScA2-1. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2393) AUTHORS Padmanabha,R., Chen-Wu,J.L.-P., Hanna,D.E. and Glover,C.V.C. TITLE Isolation, sequencing, and disruption of the yeast CKA2 gene: Casein kinase II is essential for viability in Saccharomyces cerevisiae JOURNAL Mol. Cell. Biol. 10, 4089-4099 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.V.C.Glover, 19-APR-1990. FEATURES from to/span description pept 520 1539 casein kinase-2 signal 337 342 TATA box BASE COUNT 771 a 470 c 455 g 697 t ORIGIN 1 agatctacat actgaatgaa ttgtcccctc aactctgaag ttgatttact tgctgtaggt 61 cgattgctcc aaagttggat atccctaatg accaaacttc cctgctttcc gctaagagta 121 aaatttagtg tgcataattt ccgtgacatc aaaagtcagg cgaattggtg aatgctgtcc 181 cctttcaagg cagtctatca tttcttagtt acctgccaac aggcctttat caggcctcaa 241 agaaaaaaat ttccttgcgt agcccgcttt tcgcccgaac ctgacaaagc aaacagacga 301 tcgaaaaatt tccaacgatc ttatctaggt ggaagttata aaaagaagaa taaagcatcg 361 tcaacctttt taacctgaat tgggaaacaa cgggcacaca catttgttgg agtttgtatg 421 attgctaaaa aataaaaagt aattaaaaaa acatcataga gttgtaaggt ctcagggaaa 481 tagaaggaac aataaaccta aaagaataga agaaacagaa tgccattacc tccgtcaaca 541 ttgaaccaga aatctaatag agtctactct gtagctaggg tgtacaagaa tgcctgcgag 601 gagagaccac aagaatactg ggactacgaa caaggggtga ccatcgattg gggaaagatt 661 tccaattacg aaattatcaa caaaattgga agagggaaat attccgaagt gttcagcggt 721 agatgtattg taaacaacca gaagtgtgtt attaaagttt taaaaccagt taaaatgaaa 781 aaaatttata gagagttgaa aattctgacc aatctaacag gcggccccaa tgttgttggc 841 ctttatgata tagtacaaga cgctgactcc aaaatacctg ctttgatctt tgaggaaatc 901 aaaaatgttg atttcagaac tttatatcct acattcaaac ttcctgacat ccagtattat 961 ttcacgcaat tattgattgc gttagactac tgtcactcca tgggcataat gcacagagac 1021 gtaaagcctc agaatgtcat gattgatcct acggaacgta aactaaggct gatcgattgg 1081 ggcctggcgg agttctacca tccaggtgta gattacaacg ttcgtgtcgc ttcgcgttac 1141 cacaagggac cagaactttt agtaaacttg aaccaatatg actactccct agacttatgg 1201 tcagtaggat gcatgctagc agctattgtc ttcaaaaaag aacctttttt caaagggtcg 1261 tctaatccag atcaactggt aaagattgcc acagtactag gaaccaagga actgttaggc 1321 tatttgggta agtacgggtt gcacttacca tctgaatacg acaacattat gagagacttt 1381 acaaaaaaat cgtggacaca ctttataacc tccgagacca aattagctgt tcctgaagtg 1441 gttgatttaa tcgacaattt attaaggtat gaccatcaag aaagattaac agcaaaggag 1501 gctatggatc ataagttttt caaaacgaag tttgaataat tggacaacaa tactcttagc 1561 aaggcaattc tttttccacc actgatttcc atttataaaa agcaggggtt ttcctttttt 1621 gtagcttgat attagagaat atccttaatg tgaatattaa attgcaaact ctatttaaca 1681 acgaatgtag attattatat gaccagaaaa tcactttata tagatataag agccttccgt 1741 aatgctatgc aagttttgat gaattattat attgtcgtga ggcgtgttcc gaggagtttc 1801 gtcgccattg agatagagat gctcgtaatc gtacgtgtct ggaagtgcct gccgacggta 1861 gcgatgtcta ctctcgtccc agacaatgga atagttttga aattgttcta catcatttat 1921 tagaacgcac gattcacgaa atctcccacg tttgtcaata tctcgcctac gaacagtgtc 1981 atcgaatgat attgaacgct gagagagcca tttacctttt gagtaaaata cgctgccttt 2041 tttcagcaat gaagaaccac ttccctcctt tcttacatcc aataattgct tcccttgttc 2101 ctttatattc cccataggaa agtccgttgg agtaggttgc tctctttcat gtattccaac 2161 cagatcaggt ccgtaaagaa cagtaacgtc tatttcttta ttccagttta tgcttagtgg 2221 atcaattttg tattcatcta aattgaagtg ctttatggac cacctcctcc atatacaatg 2281 atacaaccta gccatatata tgctatagtc aaactcattt tcagaagagc aactgggtaa 2341 cctcaataat ttatgcttgc gcattttcag caacaagcgc caggtagtgg aca // LOCUS BSURPOF 3078 bp ds-DNA BCT 08-AUG-1990 DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB), complete cds. ACCESSION M34995 KEYWORDS RNA polymerase; minor sigma-37 factor. SOURCE B.subtilis (strains 168 and Marburg), clones lambda-[21,gt11,1,11,21,31]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 368 to 2297) AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W. TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus subtilis RNA polymerase: Isolation, nucleotide sequence, chromosomal locus, and cryptic function JOURNAL J. Bacteriol. 169, 771-778 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 3078) AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W. TITLE Similar organization of the sigB and spoIIA operons encoding alternate sigma factors of Bacillus subtilis RNA polymerase JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.W.Price, 18-FEB-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990. FEATURES from to/span description pept 200 529 ORF V pept 526 1008 ORF W pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg start codon) pept 1762 2361 ORF X mRNA 131 2368 sigB operon mRNA signal 131 168 sigB operon promoter signal 2368 2401 sigB transcription termination signal binding 188 194 ORF V ribosomal binding site binding 511 519 ORF W ribosomal binding site binding 954 961 sigB ribosomal binding site binding 1750 1753 ORF X ribosomal binding site BASE COUNT 999 a 575 c 755 g 749 t ORIGIN 40 degrees on B.subtilis map. 1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt 61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt 121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg 181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat 241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct 301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat 361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg 421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt 481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct 541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg 601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc 661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg 721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa 781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac 841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat 901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg 961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga 1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg 1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca 1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc 1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg 1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc 1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga 1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg 1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac 1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca 1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct 1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat 1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga 1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta 1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct 1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac 1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc 1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg 2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt 2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc 2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata 2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta 2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat 2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt 2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg 2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac 2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga 2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac 2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca 2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag 2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa 2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt 2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt 2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc 3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg 3061 caaactgctg cgaagctt // LOCUS HUMERPR 1624 bp ss-mRNA PRI 08-AUG-1990 DEFINITION Human erythropoietin receptor mRNA, complete cds. ACCESSION M34986 KEYWORDS erythropoietin receptor. SOURCE Human cell line OCI-M1, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1624) AUTHORS Ehrenman,K. and St John,T. TITLE Sequence of the human erythropoietin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.St.John, 07-JUN-1990. Author address: T.St.John Hutchinson Cancer Res Center 1124 Columbia St. Seattle, WA Email: TSTJOHN@FHCRCVAX FEATURES from to/span description pept 1 1527 erythropoietin receptor precursor sigp 1 72 erythropoietin receptor signal peptide matp 73 1524 erythropoietin receptor BASE COUNT 276 a 537 c 498 g 313 t ORIGIN 1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct 61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc 121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg 181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc 241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt 301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta 361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat 421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc 481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac 541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag 601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc 661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg 721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg 781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag 841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac 901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc 961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg 1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc 1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg 1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc 1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct 1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca 1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct 1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta 1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg 1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata 1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc 1621 agga // LOCUS HUMGFP40H 4379 bp ds-DNA PRI 08-AUG-1990 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES from to/span description pept 796 909 P40 T-cell and mast cell growth factor precursor, exon 1 1022 1057 P40 T-cell and mast cell growth factor precursor, exon 2 1140 1172 P40 T-cell and mast cell growth factor precursor, exon 3 2461 2592 P40 T-cell and mast cell growth factor precursor, exon 4 4096 4215 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 796 849 P40 T-cell and mast cell growth factor signal peptide matp 850 909 P40 T-cell and mast cell growth factor 1022 1057 P40 T-cell and mast cell growth factor 1140 1172 P40 T-cell and mast cell growth factor 2461 2592 P40 T-cell and mast cell growth factor 4096 4212 P40 T-cell and mast cell growth factor IVS 910 1021 hP40 intron A IVS 1058 1139 hP40 intron B IVS 1173 2460 hP40 intron C IVS 2593 4095 hP40 intron D signal 742 747 TATA box signal 4354 4359 polyA signal rpt 2965 3265 Alu repeat binding 370 376 AP-2 binding site binding 624 629 IRF-1 binding site binding 627 633 AP-1 binding site BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS MEFDNABP 1710 bp ds-DNA BCT 08-AUG-1990 DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds. ACCESSION M34778 KEYWORDS DNA binding protein HMfB. SOURCE M.fervidus DNA. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (sites; for [2]) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. TITLE DNA binding protein HMf, isolated from the hypothermophilic archaeal Methanothermus fervidus is most closely related to histones JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1710) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Sandman, 01-JUN-1990. Author address: K.Sandman Department of Microbiology Ohio State University 484 W. 12th Ave. Columbus, Ohio 43210 E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU FEATURES from to/span description pept 578 787 DNA binding protein HMfB pept 1662 805 (c) ORF285 signal 450 457 promoter signal 523 530 promoter signal 790 814 transcription termination signal BASE COUNT 619 a 263 c 265 g 561 t 2 others ORIGIN 1 bp upstream of HindIII site. 1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa 61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag 121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag 181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg 241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat 301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat 361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa 421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt 481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt 541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat 601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct 661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag 721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa 781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat 841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc 901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt 961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat 1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa 1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg 1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata 1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc 1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc 1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc 1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat 1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc 1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt 1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt 1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa 1681 gttttgtttt acccattgct tctaaatatt // LOCUS MUSINT1A 5607 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and van Ooyen,A. TITLE The Wnt-1 (int-1) oncogene promoter and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. 10, 4170-4179 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES from to/span description pept 1779 1882 Wnt-1 protein (exon 1) 2452 2705 Wnt-1 protein (exon 2) 3279 3544 Wnt-1 protein (exon 3) 4002 4490 Wnt-1 protein (exon 4) IVS 1883 2451 Wnt-1 cds intron A IVS 2706 3278 Wnt-1 cds intron B IVS 3545 4001 Wnt-1 cds intron C site 1133 1134 MMTV insertion site in tumor 17 [1] site 1394 1395 MMTV insertion site in tumor 35 [1] site 1631 1632 MMTV insertion site in tumor 102 [1] site 4495 4496 MMTV insertion site in tumor 53 [1] mRNA 1428 5398 Wnt-1 mRNA (alt.) mRNA 1595 5398 Wnt-1 mRNA (alt.) signal 1562 1566 TATA box signal 1573 1577 TATA box BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSP40M 3808 bp ds-DNA ROD 08-AUG-1990 DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete cds. ACCESSION M30136 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor. SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210 DNA, clone lambda-S40.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3808) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 21-NOV-1989. FEATURES from to/span description pept 644 757 P40 T-cell and mast cell growth factor precursor, exon 1 846 881 P40 T-cell and mast cell growth factor precursor, exon 2 965 997 P40 T-cell and mast cell growth factor precursor, exon 3 2184 2315 P40 T-cell and mast cell growth factor precursor, exon 4 3393 3512 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 644 697 P40 T-cell and mast cell growth factor signal peptide matp 698 757 P40 T-cell and mast cell growth factor 846 881 P40 T-cell and mast cell growth factor 965 997 P40 T-cell and mast cell growth factor 2184 2315 P40 T-cell and mast cell growth factor 3393 3509 P40 T-cell and mast cell growth factor IVS 758 845 mP40 intron A IVS 882 964 mP40 intron B IVS 998 2183 mP40 intron C IVS 2316 3392 mP40 intron D signal 589 594 TATA box binding 227 233 AP-2 binding site binding 475 480 IRF-1 binding site binding 478 484 AP-1 binding site BASE COUNT 1012 a 903 c 869 g 1024 t ORIGIN Chromosome 13. 1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca 61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt 121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag 181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc 241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga 301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt 361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt 421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga 481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg 541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta 601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat 661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg 721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga 781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac 841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag 901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt 961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc 1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct 1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat 1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga 1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa 1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac 1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac 1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc 1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca 1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta 1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac 1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc 1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct 1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta 1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat 1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg 1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc 1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg 2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa 2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt 2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt 2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa 2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca 2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc 2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag 2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg 2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct 2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag 2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg 2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag 2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat 2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc 2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct 2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc 3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga 3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg 3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga 3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa 3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt 3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc 3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc 3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga 3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg 3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag 3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga 3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac 3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat 3781 aggtaggatg gagcaaagac ttggtaat // LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 08-AUG-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete cds. ACCESSION M29938 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone ZQ30-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 13 777 MHC SLA-DQ alpha chain precursor sigp 13 81 MHC SLA-DQ alpha chain signal peptide matp 82 774 MHC SLA-DQ alpha chain mRNA < 1 1105 MHC SLA-DQ mRNA BASE COUNT 254 a 290 c 255 g 306 t ORIGIN 1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc 61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta 121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag 181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa 241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg 301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact 361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac 421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt 481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc 541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat 601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag 661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc 721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc 781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa 841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc 901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg 961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc 1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1081 aaacaataaa tcacttcatg aggtt // LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 08-AUG-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete cds. ACCESSION M29939 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 69 836 MHC SLA-DQ alpha chain precursor sigp 69 137 MHC SLA-DQ alpha chain signal peptide matp 138 833 MHC SLA-DQ alpha chain BASE COUNT 273 a 313 c 266 g 322 t ORIGIN 1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg 61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga 121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg 181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat 241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta 301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca 361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt 421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca 481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt 541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca 601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata 661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga 721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct 781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca 841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac 901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct 961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga 1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc 1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1141 aaacaataaa tcacttcatg aggtttttgt actg // LOCUS PIGMHDQBA 212 bp ds-DNA MAM 08-AUG-1990 DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end. ACCESSION M29944 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 212) AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F., Gustafsson,K. and LeGuern,C. TITLE Class II genes of miniature swine: II. Molecular identification and characterization of B (beta) genes from SLA-c haplotype JOURNAL Immunogenetics 31, 1-6 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept < 1 12 MHC SLA-DO beta chain BASE COUNT 38 a 58 c 52 g 64 t ORIGIN 1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg 61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc 121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct 181 cttacccttc tgcctttctt cctcctactt gt // LOCUS PP2LEND 266 bp ds-DNA PHG 08-AUG-1990 DEFINITION Bacteriophage P2 left end DNA. ACCESSION M34756 KEYWORDS . SOURCE Bacteriophage P2 DNA. ORGANISM Bacteriophage P2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Ziermann,R. and Calendar,R. TITLE Characterization of the cos sites of Bacteriophages P2 and P4 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Ziermann, 05-JUN-1990. Author address: R.Ziermann University of California Dept. Mol. Cell Biology Berkeley, CA 94720 Email: calendar@garnet.berkeley.edu FEATURES from to/span description site 1 19 cohesive end of phage DNA site 20 266 left end of phage DNA BASE COUNT 41 a 58 c 99 g 68 t ORIGIN Map position 0.0-0.8%. 1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct 61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt 121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg 181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc 241 gagccagccg tttatctcgc ggatcc // LOCUS RCABCHH 314 bp ds-DNA BCT 08-AUG-1990 DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and F108) genes, 3' end, and 5' end respectively. ACCESSION M34843 KEYWORDS . SOURCE R.capsulata DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 314) AUTHORS Yang,Z. and Bauer,C.E. TITLE Rhodobacter capsulatus genes involved in early steps of the bacteriochlorophyll biosynthetic pathway JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.E.Bauer, 04-JUN-1990. FEATURES from to/span description pept < 1 79 bacteriochlorophyll biosynthesis protein (bchH) (AA at 2) pept 108 > 314 bacteriochlorophyll biosynthesis protein (F108) BASE COUNT 59 a 87 c 119 g 49 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga 61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg 121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg 181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg 241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca 301 agcgcgtgct gcag // LOCUS TREANTAT 999 bp ds-DNA ORG 08-AUG-1990 DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle. ACCESSION M34848 KEYWORDS . SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Bajyana Songa, 05-JUN-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS VIBSCRRE 1524 bp ds-DNA BCT 08-AUG-1990 DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds, and sucrose transport (scrA) protein, 5' end. ACCESSION M35009 KEYWORDS Scr repressor protein; sucrose transport protein. SOURCE V.alginolyticus DNA. ORGANISM Vibrio alginolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1524) AUTHORS Blatch,G.L. and Woods,D.R. TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr repressor-encoding gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Blatch, 10-JUN-1990. Author address: G.L.Blatch University of Cape Town Dept. Microbiology Private Bag Rondebosch 7700 SOUTH AFRICA Email: bltgre01.uctvax@f4.n494.z5.fidonet.org FEATURES from to/span description pept 36 < 1 (c) sucrose transport protein (scrA) pept 271 555 Scr repressor protein (scrR) pept 791 1090 ORF5 binding 264 267 scrR ribosomal binding site binding 780 785 ORF5 ribosomal binding site signal 178 183 scrA -10 region signal 203 208 scrA -35 region signal 190 195 scrR -10 region signal 162 167 scrR -10 region signal 762 767 ORF5 -35 region signal 742 747 ORF5 -35 region site 78 91 repressor operator site (put.) site 229 242 repressor operator site (put.) site 12 71 ScrR helix-turn-helix motif BASE COUNT 466 a 344 c 321 g 393 t ORIGIN 1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc 61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg 121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt 181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc 241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg 301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa 361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc 421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg 481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt 541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga 601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt 661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac 721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag 781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt 841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga 901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg 961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga 1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta 1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc 1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt 1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt 1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat 1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct 1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca 1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa 1501 gccaattgtg atattgaaga agct // LOCUS XELNRP1A 2431 bp ss-mRNA VRT 08-AUG-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA, complete cds. ACCESSION M34894 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2431) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 74 1117 pot. RNA-binding protein (nrp-1B) mRNA < 1 2431 nrp-1B mRNA signal 2407 2418 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence site 921 941 alternative exon (put.) BASE COUNT 663 a 522 c 543 g 703 t ORIGIN 1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt 61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc 121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact 181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt 241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa 301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt 361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt 421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga 481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac 541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa 601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt 661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat 721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc 781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc 841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc 901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag 961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag 1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc 1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct 1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct 1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga 1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat 1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg 1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat 1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta 1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa 1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat 1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac 1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg 1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc 1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat 1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc 1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga 1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct 2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg 2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc 2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat 2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg 2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca 2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata 2401 gttcaaaata aactatgaat tttattcatt t // LOCUS XELNRP1B 2324 bp ss-mRNA VRT 08-AUG-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA, complete cds. ACCESSION M34895 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2324) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 29 1051 pot. RNA-binding protein (nrp-1B) mRNA < 1 2324 nrp-1B mRNA signal 2306 2311 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence BASE COUNT 647 a 467 c 520 g 690 t ORIGIN 1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc 61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac 121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt 181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca 241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga 301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat 361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca 421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg 481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt 541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc 601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat 661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag 721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac 781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg 841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc 901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat 961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat 1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg 1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct 1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat 1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta 1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa 1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa 1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct 1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa 1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc 1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact 1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg 1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt 1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct 1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag 1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag 1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt 1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta 2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac 2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt 2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat 2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa 2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc // LOCUS YSJSACDHY 2569 bp ds-DNA PLN 08-AUG-1990 DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds. ACCESSION M34929 KEYWORDS saccharopine dehydrogenase. SOURCE Y.lipolytica (strain W29) DNA. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2569) AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and Gaillardin,C. TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia lipolytica JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.M.Fournier, 06-JUN-1990. ORF1 is probably not translated, since a lacZ fusion/expression experiment did not produce results. FEATURES from to/span description pept 649 1929 ORF1 pept 1764 655 (c) saccharopine dehydrogenase mRNA 582 2060 ORF1 mRNA (alt.) mRNA 598 2060 ORF1 mRNA (alt.) mRNA 599 2060 ORF1 mRNA (alt.) mRNA 602 2060 ORF1 mRNA (alt.) mRNA 606 2060 ORF1 mRNA (alt.) mRNA 609 2060 ORF1 mRNA (alt.) mRNA 1807 639 (c) ORF2 mRNA (alt.) mRNA 1809 639 (c) ORF2 mRNA (alt.) mRNA 1811 639 (c) ORF2 mRNA (alt.) mRNA 1812 639 (c) ORF2 mRNA (alt.) mRNA 1814 639 (c) ORF2 mRNA (alt.) signal 521 528 TATA box signal 565 575 TATA box BASE COUNT 668 a 573 c 666 g 662 t ORIGIN 1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag 61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt 121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca 181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa 241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag 301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc 361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc 421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt 481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca 541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt 601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc 661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt 721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc 781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc 841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac 901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt 961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca 1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt 1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc 1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt 1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata 1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc 1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc 1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct 1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg 1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg 1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc 1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt 1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag 1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg 1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag 1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg 1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga 1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa 2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca 2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg 2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc 2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct 2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag 2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga 2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat 2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat 2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc //
GenBank-Updates@genbank.bio.net (08/09/90)
LOCUS INS43AAA 130 bp ds-DNA BCT 09-AUG-1990 DEFINITION Insertion sequence IS2-43. ACCESSION M25093 KEYWORDS RNA polymerase binding site; insertion sequence; insertion sequence IS2. SOURCE Insertion sequence IS2 DNA. ORGANISM Insertion sequence IS2 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 130) AUTHORS Sommer,H., Cullum,J. and Saedler,H. TITLE IS2-43 and IS2-44: New alleles of the insertion sequence IS2 which have promoter activity JOURNAL Mol. Gen. Genet. 175, 53-56 (1979) STANDARD simple staff_entry FEATURES from to/span description BASE COUNT 41 a 23 c 22 g 44 t ORIGIN 1 cctaagacat caatcatctg ttctccaatg actagtctaa aaactagtat taagactatc 61 acttatttaa gtgatatact tatttaagtg atattggttg tctggagatt cagggggcca 121 gtctaatacc // LOCUS PSCIS1IN 146 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid pDG128 insertion element IS1 target region sequence. ACCESSION M25018 KEYWORDS insertion element; insertion element IS1. SOURCE Plasmid pDG128, a derivative of Plasmid pSC101, DNA, clone 128/10R7. ORGANISM Plasmid pSC101 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 146) AUTHORS Sommer,H., Schumacher,B. and Saedler,H. TITLE A new type of IS1-mediated deletion JOURNAL Mol. Gen. Genet. 184, 300-307 (1981) STANDARD simple staff_entry FEATURES from to/span description BASE COUNT 38 a 33 c 34 g 41 t ORIGIN 1 gctgcgaaaa tgccttatct ggcctacaga ttcgatgcga ttcgtaggtc ggataagatg 61 cgcaagcatc gcatccgaca ataagtgccg aatgcgacct acattcacat ggcgcttttt 121 acatctgacg gtttttattg aagtta // LOCUS BRVRNASA 197 bp ss-mRNA VRL 09-AUG-1990 DEFINITION Berne virus ORF5 mRNA, 5'end. ACCESSION M33503 M33501 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 197) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 137 > 197 ORF5 mRNA 113 > 197 RNA5 BASE COUNT 50 a 27 c 38 g 82 t ORIGIN 1 ttatttcttc ttcctacttt gtggctactt gggttttgtt ggtggtggtt attattttag 61 tatttataat tataagtttt tgtattagta attaagtagg ttagtgagag acactatctt 121 tagagaaaga gccaagatga attctatgct taatccaaat gctgtgccat ttcaaccatc 181 acctcaggtt gttgcat // LOCUS BRVRNASB 179 bp ss-RNA VRL 09-AUG-1990 DEFINITION Berne virus ORF3 mRNA, 5' end. ACCESSION M33502 KEYWORDS core protein. SOURCE Berne virus (strain P138/72) viral RNA. ORGANISM Berne virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Toroviridae. REFERENCE 1 (bases 1 to 179) AUTHORS Snijder,E.J., Horzinek,M.C. and Spaan,W.J.M. TITLE A 3'-coterminal nested set of independently transcribed mRNAs is generated during Berne virus replication JOURNAL J. Virol. 64, 331-338 (1990) STANDARD simple staff_review FEATURES from to/span description pept 153 > 179 ORF3 BASE COUNT 52 a 17 c 34 g 76 t ORIGIN 1 ttataatctt cttcctactt ggattacatg gcttacttta ggttttagtt tgtttagtat 61 agtaataagt ggtattaata ttattttgtt ttttgaaatg aatggtaagg tgaagaaaag 121 ttagtcactt tctttagaag aaggttgcca aaatgtttga gaccaattat tggccattt // LOCUS CHKGLOBA 1204 bp ds-DNA VRT 09-AUG-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H3/H4. ACCESSION M30485 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H3/H4. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1204) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 91 > 1204 pie-alpha-globin mRNA fragment H3/H4 (put.) BASE COUNT 282 a 252 c 263 g 407 t ORIGIN 1 ggatctatct agttgctgca gtcgtttgta tgaaggttgg atccatcctg ttttgtactg 61 gatgactgcc ttcaattcac tggcaatcta ggatcaaatg tgtcctagag aacattcaat 121 atcgcttttt ttctaagctg ttgcaagcca gaatggttac ttttgagctg atctcggtgg 181 agcagttgag ttgttgtaag ttatttctta atggctccag aaaattacat catttaggtg 241 ctataactct ccatttccat cttgtatgcg taattgcatt tcttgaatac ttcagacatt 301 aatttcccgt cctacctgca ggttactggt gtgtattggc tatacagatt acttttccac 361 agatgtaacc ctaggtcttt tgaatataga tcccatctat tgtctgctta gagaccccga 421 taaccctccc gataaatcag agtccatgtt ttttgacagt atatcggtgt gaacatctgg 481 attttagtgc aatatgctag tagcaatctg agtccccgtt tctaagacag agtcatttag 541 tccgagaatg gctgtttaag actccaaatg gcagtcttga gtcttttagt gactgtactc 601 gttcctctac tgagggcagt cttgagtgtt ttagtgactg taccctgtct cttaacttga 661 ccggtctgat agatcttaaa tgacagtcgt ggccgcaatt tcaaatggaa gagctaggag 721 tctcaggaac cgtcgccctt gtttactctt atgtttaccc gttaagccgt catgaaaagg 781 atttttctgt agagaacggt tatatgagtt gtattccatc tagggtcacg gcccctagac 841 caaccaacga cgagtcgatt tgttgtctgg cactttctgt gacttcaagt tttgtggctt 901 tctctattaa ctttccccac aacgtaactg tctaacttag atgttggcgc gagaactaca 961 gtctgaggga cttgtcaaga gctggcacac tcgcctttat gttaaagtgt gtcctttgtc 1021 gatactggta ctaatgctta agctcgagcg ggcccctaga ccaacgacga gtcgatttgt 1081 tgtctggctc tttctgtgac ttcaagtttt gtggctttct ctattaactt tcccacaacg 1141 taactgtcta attagatgtt ggcgcgagaa tacagtctga gggattgtca agagtggact 1201 ggtt // LOCUS CHKGLOBB 582 bp ds-DNA VRT 09-AUG-1990 DEFINITION Chicken pie-alpha-globin gene, fragment H10. ACCESSION M30486 KEYWORDS pie-alpha-globin. SOURCE Chicken AEV transformed erythroblast DNA, fragment H10. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 582) AUTHORS Broders,F., Zahraoui,A. and Scherrer,K. TITLE The chicken alpha-globin gene domain is transcribed into a 17-kilobase polycistronic RNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 503-507 (1990) STANDARD simple staff_review FEATURES from to/span description mRNA 241 > 582 pie-alpha-globin fragment mRNA H10 BASE COUNT 171 a 128 c 108 g 175 t ORIGIN 1 tccaaaaaac ttactctgct tgtaaatgtc gtctcctttt tcggagacaa aaacttgata 61 ccttcttgcc ttgtccgaag tcactttatc ggttatagga cccaagtttt gggccttgct 121 agaaggatac aattccctat gaccgccgta ttttggggta ctcgcattcg cccgacatcg 181 agtggacctc ctttttttct cttgtcgttc gtagaggtta tcgaggtccc cccatatata 241 ataaccctat cgtgagttta gacttcctac aaaaacttct gtcgtttaat gttttcgtac 301 cgtcacggtg actgtccagt aatcaaagtt gtcactgtct aaaaagattc gacaacttcg 361 tcttaccaat gcgaaaactc gactagagac actcgtcaac tcacacattc aataaagaat 421 taccgaggtc ttttaatgta gtgaaatcac gatattgaga ggtaaaggta gaaacatacg 481 cattaaccta aagaacttat gaagtctgta attaaaggac cacaagcaat acgaaagaca 541 atgtatttct tctaacgtcg gataagtatt aggatggacg tc // LOCUS ECOPHOAA 600 bp ds-DNA BCT 09-AUG-1990 DEFINITION E.coli alkaline phosphatase (phoA) gene, 5' end. ACCESSION M33536 KEYWORDS alkaline phosphatase. SOURCE E.coli (strain K-12) cell line BW7710 DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 600) AUTHORS Agrawal,D.K. and Wanner,B.L. TITLE A phoA structural gene mutation that conditionally affects formation of the enzyme bacterial alkaline phosphatase JOURNAL J. Bacteriol. 172, 3180-3190 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.K.Agrawal, 03-APR-1990. The phoA503 mutation does not interfere with export of active enzyme but does interfere with assembly. FEATURES from to/span description pept 283 > 600 alkaline phosphatase precursor (phoA) (EC 3.1.3.1) sigp 283 345 alkaline phosphatase signal peptide matp 346 > 600 alkaline phosphatase variant 413 413 c in wild type; t in phoA503 mutation BASE COUNT 159 a 130 c 151 g 160 t ORIGIN Map position 8.7 minutes; 1 bp upstream of HindIII site. 1 aagctttgga gattatcgtc actgcaatgc ttcgcaatat ggcgcaaaat gaccaacagc 61 ggttgattga tcaggtagag ggggcgctgt acgaggtaaa gcccgatgcc agcattcctg 121 acgacgatac ggagctgctg cgcgattacg taaagaagtt attgaagcat cctcgtcagt 181 aaaaagttaa tcttttcaac agctgtcata aagttgtcac ggccgagact tatagtcgct 241 ttgtttttat tttttaatgt atttgtacat ggagaaaata aagtgaaaca aagcactatt 301 gcactggcac tcttaccgtt actgtttacc cctgtgacaa aagcccggac accagaaatg 361 cctgttctgg aaaaccgggc tgctcagggc gatattactg cacccggcgg tgctcgccgt 421 ttaacgggtg atcagactgc cgctctgcgt gattctctta gcgataaacc tgcaaaaaat 481 attattttgc tgattggcga tgggatgggg gactcggaaa ttactgccgc acgtaattat 541 gccgaaggtg cgggcggctt ttttaaaggt atagatgcct taccgcttac cgggcaatac // LOCUS GCOEARA 1771 bp ds-DNA PLN 09-AUG-1990 DEFINITION G.tikvahiae McLachlan 18S ribosomal RNA gene. ACCESSION M33640 KEYWORDS 18S ribosomal RNA. SOURCE G.tikvahiae McLachlan (isolate Pomquet Harbour-Nova Scotia) DNA. ORGANISM Gracilaria tikvahiae McLachlan Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gigartinales; Gracilariaceae. REFERENCE 1 (bases 1 to 1771) AUTHORS Liu,Q.-Y., Bird,C.J., Rice,E.L., Murphy,C.A. and Ragan,M.A. TITLE Nucleotide sequence of the 18S ribosomal RNA gene from the red alga Gracilaria tikvahiae mclachlan JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Ragan 08-APR-1990. Atlantic Research Lab, National Research Council of Canada, 1411 Oxford Street, Halifax, Nova Scotia CANADA B3H 3Z1 FEATURES from to/span description rRNA 1 1771 18S ribosomal RNA BASE COUNT 445 a 371 c 501 g 454 t ORIGIN 1 ccacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc 61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccata 181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt 241 tttttctgat cgcactcgtt gcgacgcacc gttcaaattt ctgacctatc aactttggat 301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg 361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc 421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta 481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc 541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc 601 tcgtagtcgg attttggcgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg 661 cctttgtgga tgggagtcag gtggtgcttc actggatcgc ttggctgccg ccaccgttta 721 ctgtgaaaaa attagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata 781 atagaatagg acccggtcct attttgttgg tttgtttgaa tcgggtaatg attaagaggg 841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg 901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga 961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga 1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg 1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg 1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg 1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt 1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag 1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc 1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc 1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat 1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa 1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct 1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat 1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt 1741 ttccgtaggt gaacctgcag aaggatcaag c // LOCUS HS6MCP 4440 bp ds-DNA VRL 09-AUG-1990 DEFINITION Human herpesvirus type 6 major capsid protein (MCP) gene, complete cds. ACCESSION M33515 KEYWORDS major capsid protein. SOURCE Human herpesvirus type 6 DNA. ORGANISM Human herpesvirus type 6 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4440) AUTHORS Littler,E., Lawrence,G., Liu,M.-Y., Barrell,B.G. and Arrand,J.R. TITLE Identification, cloning, and expression of the major capsid protein gene of human herpesvirus 6 JOURNAL J. Virol. 64, 714-722 (1990) STANDARD simple staff_review FEATURES from to/span description pept 235 4272 major capsid protein (MCP) BASE COUNT 1422 a 1169 c 785 g 1064 t ORIGIN 1 tatcgtgaac gatatttggc ccggacgttt gaaaaatttt ctctatgatt gactcgatct 61 tttccagaac tacaggcatg gatcgcgcta aacgagtttc ctcgtcgcga gacacttcag 121 cggtcagatc acacgaatct ataaaaactg gaatcgaccg tgcacaagtg gaaccaaaac 181 atgaattaac tattaaagtt tcacaattac cggtgtgctg cataacgccg aaacatggaa 241 aattggcagg cgaccgaaat tttacctaag atcgaagcac ctctaaatat tttcaatgac 301 attaaaacat acacagccga acaacttttt gacaatttgc gaatttattt cggtgacgat 361 ccgagccgtt acaacatcag ttttgaagcc ttactcggaa tctactgcaa caaaatagaa 421 tggattaact ttttcaccac gccgatcgcc gttgcagcga acgtaatccg cttcaatgat 481 gtgagtcgaa tgaccctcgg gaaggttctc ttctttattc aattacctag agtcgctaca 541 ggaaacgacg taactgcttc aaaagaaacc accatcatgg tagccaaaca ctcagaaaaa 601 caccccataa acatatcgtt cgatttgagc gctgcctgtc tggaacatct ggaaaacaca 661 tttaaaaaca cagtcatcga tcagatttta aacatcaatg cgttacatac agtcttaaga 721 tctttaaaga attcagccga ttcgctcgag cgaggtttga ttcacgcatt catgcaaacc 781 ttattgagaa aatctccccc gcaatttatc gtcctgacca tgaatgagaa caaagtacat 841 aataaacaag ctctgagccg agtacagcgc agcaacatgt ttcagagcct gaagaacaga 901 ttgttaacgt cattattttt tttgaacagg aataataata tttcatatat ctatagaatt 961 ctaaacgaca tgatggaatc ggtcacggaa agcattctaa atgatacgaa caactacact 1021 tccaaagaaa acgtccccct agatggtgtt ttattaggac cgatcggctc tatccaaaaa 1081 ctcaccagca tactctccca gtacatctcc acacaagtcg tctccgcccc aatctcatat 1141 ggtcacttta ttatgggcaa agaaaacgca gtgactgcga ttgcataccg tgcaatcatg 1201 gccgatttta ctcaattcac cgtgaacgcc gggacagaac aacaagacac taacaacaaa 1261 tcagaaatct tcgacaaaag ccgcgcgtac gccgacctaa agctgaacac gttgaaattg 1321 ggagataaat tagtcgcatt cgaccaccta cacaaagttt acaaaaacac agacgtcaac 1381 gatccgctag aacagagctt acaactaaca ttctttttcc ctttgggtat ctacataccg 1441 agcgagaccg gtttcagtac aatggaaaca cgtgtgaaat taaacgacac catggaaaac 1501 aacctaccca ccagcgtttt tttccacaat aaagaccaag tcgtgcagcg aattgatttt 1561 gccgacatat taccgtcggt ttgccatccc attgtccacg actcgaccat cgtcgaacga 1621 ctcatgaaaa gcgaaccatt gcctaccggc caccgctttt cccaactatg tcaactaaaa 1681 attacccgag aaaacccagc caggatctta cagaccttat acaacttata cgaaagtcga 1741 caagaagtac ccaaaaacac caacgtctta aaaaacgaat taaacattga agatttttac 1801 aaaccggaca atccaacact gccgaccgaa agacacccct tcttcgatct cacgtatatc 1861 cagaaaaacc gagccacaga agtactctgc acaccaagaa taatgatagg caacatacct 1921 ttaccgttag ctccagtctc tttccacgaa gcccgtacaa atcaaatact ggaacatgca 1981 aagacgaact gccaaaagta cgacttcacc ctcaaaattg tcaccgaaag cttgacgagt 2041 ggctcgtacc cagaattggc ttacgttatc gagaccttag tgcatggaaa caagcatgct 2101 tttatgatcc taaaacaagt aattagccag tgtatttctt attggtttaa catgaaacat 2161 atacttcttt tttgcaacag cttcgagatg atcatgctaa tctctaacca catgggcgac 2221 gaactgatcc cgggagcagc tttcgctcac tacagaaatc ttgtgtcgct aattcgccta 2281 gtgaagagaa caatctctat ctccaacctc aacgagcaac tttgcggcga acctctggtg 2341 aatttcgcca acgcgttgtt cgacggacgt ctgttctgcc cgttcgtcca taccatgccc 2401 agaaacgaca cgaatgcaaa aataacagcg gatgatacac cactgacaca gaacaccgta 2461 agagttagaa attacgaaat atccgatgtg caaagaatga atctaataga ttcaagcgtc 2521 gtctttaccg acaatgacag accatcgaac gaaaccacca tcctgagcga gatattttac 2581 ttctgcgtac tcccggcact atcaaataac aaggcctgtg gcgctggcgt caacgtaaag 2641 gaactagttc tagacttatt ctacacggaa ccgttcatca gtccagatga ttatttccag 2701 gagaatccga ttaccagcga cgttctaatg tctctgatcc gagaaggtat gggccctggc 2761 tacaccgtag ccaacacatc ctgtatcgca aaacagttgt ttaaatcgct aatctacatt 2821 aatgaaaata cgaaaatatt ggaagtggaa gtctccttag atcccgcgca gcgacacggc 2881 aactccgttc attttcaatc actacaacac attctataca acgggctttg cctgatctca 2941 ccgatcacca ccctaagacg gtactatcaa ccaatcccat ttcatcgatt cttctccgac 3001 ccgggaatct gcggcaccat gaatgctgat atccaagttt tcctaaatac atttcctcac 3061 tgtcaaagaa acgacggcgg ttttcctctc ccgcccccat tagcattaga attttataat 3121 tggcaacgaa caccgttttc cgtgtactca gccttctgcc ccaattccct gttgagcatt 3181 atgacgcttg ccgccatgca ctcaaaattg tctcccgttg ccatagcgat ccaaagcaaa 3241 aacaaaatcc atccgggctt tgcggccaca ctagtccgga cggataattt cgacgtcgag 3301 tgcctattat acagttccag agcagccaca tctataattt tagacgatcc cacggtcacc 3361 gcggaagcta aagatatcgc aaccacttac aacttcaccc agcacctaag ttttgtagat 3421 atgggcttag gttttagctc taccaccgcc actgccaatc ttaagcgaat taaatcagat 3481 atggggagca agatacaaaa ccttttctcc gccttcccga tacacgcgtt taccaacgcg 3541 gacataaata cgtggattcg acatcacgtc gggatagaaa aacctaatcc ctccgagagc 3601 gaagcactaa acatcataac gttcggcgga attaacaaaa acccaccctc catactactg 3661 catggtcaac aagctatctg cgaagttata ctgaccccgg ttacgacaaa cattaacttt 3721 ttcaaatcgc cccacaaccc aagaggcagg gaatcatgta tgatgggaac ggacccgcac 3781 aacgaagagg cggctagaaa agcattgtac gaccacaccc aaacagacag cgatacattc 3841 gccgcaacca caaacccttg ggcatctcta ccaggctcct taggcgatat tctatacaac 3901 acggcacaca gagaacaact atgttacaac cccaagacat acagtcccaa cgctcaattt 3961 tttaccgaat ctgacatctt aaaaacaaac aagatgatgt acaaagtgat aagcgaatac 4021 tgcatgaaat cgaactcgtg tttaaacagc gatagcgaaa tacaatactc gtgctctgag 4081 ggcacggata gcttcgtaag cagaccatgc cagttcttac aaaacgctct gcctcttcac 4141 tgttcatcca accaagctct attagagagt cggtctaaaa ccggcaatac gcagatcagc 4201 gaaacccatt attgtaatta cgccatagga gaaaccatac ctttccaact cattatcgaa 4261 tcatccatat aaaatggaaa ccgtctactg cactttcgat cacaaactgt cactttccga 4321 tatcagcacc ctatgcaagc tcatgaacat cgtcataccg atcccagctc accaccatct 4381 aataggtagc ggcaatttag gtctttatcc catcgtctcc tccaacaaag attacgtcca // LOCUS HUMSEXREPB 916 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human sex chromosome repeat, clone pDP330. ACCESSION M33524 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP320. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 916) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics 7, 625-628 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 916 sex chromosome repeat BASE COUNT 206 a 228 c 229 g 253 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtat gtctgtaaca caacagacag ggtctgcagg ggtcgaagta 61 ttttgtcatc aaagaggaag gaatgatcat tcatcataaa aggcaagaca tctttggtgc 121 aaggaaaact caagaaaaat accgcagacc atgcaatgag gcactggtcg atggagtgtt 181 gtaaacccgt cttcccagag tggcatgcac atggatccct cagcacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacatt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaaaga aatcccaaag actgggcccc ttaaacaaca accttgattc 421 tcacagtcct tgaggctaga agtctgagat caagctatgg ccagggctgg ttcctcctga 481 ggcctctctc cttgggttgt agatgctgtc ttctccctgt gtcctcacag ggttgtccct 541 ctgtgtgtgt ctgtgtcctc atctcctctt cttatgaggt gtcttagtcc atttcaggct 601 gctgtcacag catgccgtag actgggtggc ttatcagcaa cagacattga ttctcccaca 661 gtcctggaag ctggacgtct gagatcaggg tatgggcagg gctgcttcct cctgaggcct 721 ctgtcctggg cttgtagatg ctgtcttctc catgtgtccc catgtggtca tccctctgtg 781 ggtgtgtctg tttcctcatc tgctcttcta atgagatgtc ttagtccatt gcaggctgct 841 atcacagaat accataggct gggtggctta taaaccacag agttttattc ttccacagtc 901 ctggaggctg gaattc // LOCUS HUMSEXRPA 918 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human sex chromosome repeat, clone pDP316. ACCESSION M33523 KEYWORDS sex chromosome repeat. SOURCE Human cell line OXENII DNA, clone pDP316. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 918) AUTHORS Fisher,E.M.C., Alitalo,T., Luoh S,-W., de la Chapelle,A. and Page,D.C. TITLE Human sex-chromosome-specific repeats within a region of pseudoautosomal/Yq homology JOURNAL Genomics 7, 625-628 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.C.Fisher, 03-APR-1990. FEATURES from to/span description site 1 388 low copy flanking sequence rpt 389 918 sex chromosome repeat BASE COUNT 207 a 242 c 226 g 243 t ORIGIN Chromosome Yp. 1 gaattcaggc ctcagtgtct gtctgtaacc caacagacgg tgtctgcaga gatcgaagta 61 ttttgtcgtc gaagaggaag gaatgatcat tcatcacaaa aagcaagaca tctttggtgc 121 aaggaaaact cgaggaaaat accgcagacc atgcaatgag gcactggttg acggtgtgtt 181 ataaacccgt cttcccagag tggcatgcac acggatccct caggacatgg gtgacacaca 241 gactatgctt cagcaggtct gtctgggccc aagacacagt gtttctcatc agctcccagg 301 ggatgtcaag gctgcagatc catggatctc actttgcagg acagagactt ggtaatggct 361 tcccagagtt gttacaatgc aatcccaaag actgggcagc ttaaacaaca accttgattc 421 tcccacagtc ctggaagctg gaagtctgag atcaaggtgt gggcagggcg gttcctcctg 481 agtcctctct cctgggcttg tagatgccgt cttctccctg agtccccacg tggtcatccc 541 tctgtgtgcg tctgtgtcct catctcctct tcttatgagg tgtcttagtc catttcaggc 601 tgctgtcaca gcataccata gactgggtgg cttataagca acagacattg attctcccac 661 agccctggag gctggacgtc ttgagatcag gatatgggca aggctgtttc ctcctgaggc 721 ctctgtcctg ggcttgtaga caccatcttc tccctgtgtc cccacgtggt catccctcta 781 tgtgcatgtc tgtgtcctca tctgctcttc ttatgagatg tcttagtcca ttgcaggctg 841 ctatcacaga ataccatagg ctgggtggct tacaaaccac agacttttat tctcccacag 901 tcctggaggc tggaattc // LOCUS IRICAP 2461 bp ds-DNA VRL 09-AUG-1990 DEFINITION Iridescent virus type 1 capsid protein gene, complete cds. ACCESSION M33542 KEYWORDS capsid protein. SOURCE Iridescent virus type 1 DNA. ORGANISM Iridescent virus type 1 Viridae; ds-DNA nonenveloped viruses; Iridoviridae. REFERENCE 1 (bases 1 to 2461) AUTHORS Tajbakhsh,S., Lee,P.E., Watson,D.C. and Seligy,V.L. TITLE Molecular cloning, characterization, and expression of the Tipula iridescent virus capsid gene JOURNAL J. Virol. 64, 125-136 (1990) STANDARD simple staff_review FEATURES from to/span description pept 601 1995 capsid protein mRNA 587 > 2461 capsid protein mRNA ( 5' end +/- 5 bp) BASE COUNT 717 a 462 c 443 g 839 t ORIGIN 1 gaaggtgttg aaagatctac tgaaataggc ttcattagca tttttatttt gtccacaaat 61 tcattatttt taataggctg ttcttcacct ttattcgcat attcaaagta atcgattaaa 121 tttttttgaa tatggacgat atcatccatg aacataaacc aaacttcata atatatagta 181 tggagtaacg ggttaattaa accattgatt ccttttaatt gttttggatt aatgaggttt 241 aaatcatcat aaattttttc tatttttttt aaattttttc gagcaatttt taaatttgat 301 ttaaccaaac aaacttcctc tactttaatt gttacggttg gtacttttaa accattaatt 361 ttatttttag aggaagaaca acgctttatt aaagcgttgg aatccattaa tcgcttgttt 421 tatcataggt tattttttaa ctataaaaaa ataactaaat tactacagtt accaatatgt 481 cggcattagt tctccttcat attttcgtat tttataccct taaatttaac ctaatcaatt 541 tctacattta tttttgggtt caaaattttt agccgaaata ttgctactaa taaattaaac 601 atgtctatgt cctcatcgaa tataacctca gggtttatcg atatcgccac ttttgacgaa 661 atcgaaaaat atatgtatgg cggcccaaca gcaacagcat actttgttag agaaattaga 721 aagtcgactt ggttcactca agtaccagtt ccactatcta gaaatactgg taatgcggct 781 tttggacaag aatggtcggt atctatatca cgtgctggag attatttgtt gcagacctgg 841 ttacgagtca atatcccacc agttactctt agtggtctac ttggtaacac ttactcttta 901 agatggacca aaaatttaat gcataacttg attcgtgaag ccaccattac ctttaatgat 961 ttggttgcag ctcgatttga taactatcat ttggatttct ggtctgcttt caccgtacct 1021 gccagcaaac gcaatgggta tgataacatg attggtaatg tctcttcttt aattaatcca 1081 gttgctccgg gtggtacttt gggtagcgta ggtggtatta accttaatct tccacttcca 1141 tttttcttct ctcgagatac tggtgtagca ctaccaacag ctgctctacc ttacaatgag 1201 atgcaaatca actttaattt cagagattgg catgagcttt tgattttgac taacagtgct 1261 ctagtaccac cagcaagtcc atatgttcca attgttgtag gtactcatat ttcagctgct 1321 ccagttttag gaccagttca agtatgggct aactatgcca tcgtctccaa cgaagaacgt 1381 cgtagaatgg gttgtgccat tcgagacatt ttgattgaac aggttcaaac ggcaccacgt 1441 caaaattatg tacctttgac caatgctagt ccaacatttg atattcgttt ctctcatgca 1501 atcaaagcat tattctttgc tgtacgaaat aaaacatctg cagcagaatg gtcaaattat 1561 gctacttctt ctccagttgt tactggtgca acggttaact acgaaccaac aggttctttt 1621 gaccctattg ccaatacaac attgatttat gagaacacta atcgtttggg tgccatggga 1681 tcagattact tctctttgat taatccattc tatcatgctc caactattcc atcattcatt 1741 ggatatcatt tgtactcata ttctcttcac ttttatgact tggatccgat gggttctacc 1801 aattacggta aactcactaa tgtgtctgtt gtaccccaag ctagtccggc agcaattgcg 1861 gcagcaggag gtactggtgg tcaagcaggt tcagattacc ctcaaaatta tgaatttgtc 1921 atattagctg tcaataataa tattgtcaga atatcaggtg gagaaacacc acaaaattac 1981 atagcagttt gttaaggtaa tttgtaacgc tccacaacag gcggaagtgg tctcgtgaga 2041 gaccgatatt gaggttttat caaccttaat ttgaatcatg aattaacatg atactttggt 2101 accgtctagt cggcttatat gtcgggctaa tggtcttttt tgatcatcaa gtggctataa 2161 gtggtacgtc gacgacagtc gacacctagt ggtttaataa aggtttttta cccaaattaa 2221 actggaacag gcaaggttga tgaaaacggt caaaattcag atagtctcgg gggctatttt 2281 ggacaagacc gtcggtgcag ctaatgcgta agcatcagtg atatcgctat cgactgggtc 2341 atcaatcggt tgtcctatct gactttttaa agtctcagga tggctcaatg tacagtcagc 2401 ccgcagtaag gtgtattccg agctgtcttt gaggataaaa gtaaacttga aaaagaagct 2461 t // LOCUS MUSIGHAAR 363 bp ss-mRNA ROD 09-AUG-1990 DEFINITION Mouse Ig rearranged H-chain mRNA V-D-J region, partial cds. ACCESSION M33679 KEYWORDS diversity exon; immunoglobulin heavy chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 363 Ig heavy chain V-D-J region (AA at 1) BASE COUNT 98 a 83 c 89 g 93 t ORIGIN 1 gaggttcagc ttcagcagtc tggagctgag ttgatgaggc ctgggtcctc agtgacgatg 61 tcctgcaagg cttccggata tgcaatcaca agctacggtt taaactgggt gaaacagagg 121 cctggacagg gcctggaatg ggttggatat attcatcctg gaaaaggtta tattcactac 181 aatgaaaaat tcaagggcaa gaccacactg actgtagaca aatcctccaa tacagcctac 241 atgcaggtca gaagcctgac atctgaggac tctgcagtct atttctgtgc aagatcgttt 301 tttgacattt acatgtatta ctttgactac tggggccagg gcaccactct cacagtctcc 361 tca // LOCUS MUSIGKABF 324 bp ss-mRNA ROD 09-AUG-1990 DEFINITION Mouse Ig rearranged L-chain mRNA V-J region, partial cds. ACCESSION M33678 KEYWORDS immunoglobulin light chain; joining exon; processed gene; variable region. SOURCE Mouse (strain A/J) hybridoma cell line 45-49, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Parhami-Seren,B., Wysocki,L.J., Margolies,M.N. and Sharon,J. TITLE Clustered heavy chain somatic mutations shared by anti p azophenylarsonate antibodies confer enhanced affinity and ablate the cross-reactive idiotype JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Parhami-Seren, 11-APR-1990. Massachusetts General Hospital, Jackson 1402, Blossom Street Receiving, Boston, MA 02114 FEATURES from to/span description pept < 1 > 324 Ig light-chain V-J region (AA at 1) BASE COUNT 96 a 77 c 73 g 77 t 1 others ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagntgca gggcaagtca ggacattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctactac acatcaaaat taaagtcagg agtcccatca 181 aggttcagtg gcagtgggtc tggaacagat tattctctca ccattagtga cctggagcat 241 gaagacattg ccacttactt ttgccaacag ggtaatacgc ttcctcggac gttcggtgga 301 ggcaccaagt tggaaatcaa acgg // LOCUS MUSTCVYAN 2567 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse T cell receptor rearranged beta-chain gene, V-2 region, 5' end. ACCESSION M33500 KEYWORDS T cell receptor; beta-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c) DNA, hybridoma B.1.1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2567) AUTHORS Ratanavongsiri,J., Igarashi,S., Mangal,S., Kilgannon,P., Fu,A. and Fotedar,A. TITLE Transcription of the T cell receptor beta-chain gene is controlled by multiple regulatory elements JOURNAL J. Immunol. 144, 1111-1119 (1990) STANDARD simple staff_review FEATURES from to/span description pept 2544 > 2567 T cell receptor beta-chain V-2 region precursor sigp 2544 > 2567 T cell receptor beta-chain signal peptide mRNA 2478 > 2567 T cell receptor beta-chain mRNA BASE COUNT 708 a 560 c 583 g 716 t ORIGIN 1 ctaaagttct tggctactgt tgtgtgcact ttgagtaatg attaagatgc attgggacag 61 ggggtggaga aatgtcccaa ggaggtagcc atgacctcca acactggtcc tgtggaggcc 121 ccgaggagct agctagccat ctgatctgga aacaagaggc ttaacctggc tcagtactga 181 aagctggtca agataagagg gggcaggcag atacctggag gcactgacct tgggaggcag 241 gaaggttagc aagggagata actggagtgt gagagacatt ctgatcccaa tcttgttaga 301 ggattaggct gaagagggtt cagtgtgaag ctcagtaaac tgagaagggc ctaggtttcc 361 ttctcctgga gtctgcttgg ctggacagag cacactgtcc ttagaaaagc aacagagctc 421 tcctggagga gctaggagcc actgacttca gacccaggga atatcttctc taccctcttc 481 cttctggctc ttaaggaggc tcacagggag cttatttagc tttttaagga gatttataga 541 ggctggagga acttgttttt tcaaaagtaa atgctctaga aaaatgaagg ttgaaggtgt 601 tatcaaactt gtgggtcaaa gctaaatgaa aaaaaaaatc aaaagaagga catgtctatt 661 cccaacataa gcagaagact tttattataa atatggtggg agaccatagt cagagacaga 721 gacagctggg aaaggccagc atgaacttga ccctgagcct ggacatctga ggacttgggg 781 gagcaggtgg gaagaaagaa gagagaaaag agagaagagg ggagaccagg agagtaaaga 841 gtagacaaaa ggacagcata gcaaaaatag ctggatttat aggggaaggt agctggggaa 901 aaggcagccc atcccctggg ctggagaagt ttagattaga gggtctgtat tctggccata 961 tcatatacta ggtaggacta aggaatgctg agtgaagctg gcatccaggt ccacaatgac 1021 atgttaaata agaacttcag ttagccattt gctttgggat tgaggcataa taaacgccag 1081 taccccaagc cagctctgtc cacttgtcct cagtaagtga acttaaacag ccaaaccagt 1141 aatctaaata actaactaac taactaacta aatcaatcaa tcaatcaatc aataaaagta 1201 gaaaagattt tttcagtgta aacacattgg taacatggaa aaagatccag agatccagta 1261 aactccctgt gtcagtcttg gggacctgca ggcaagatgg aagtttagag ggccaaggat 1321 aagcaatcta gctcaaagta tggtcctgcc ctgcattgac ccattgccta ggcttgttaa 1381 agctgtgtga aatctctttc caggagatac attcccactc tcgctggtgc ctttcctttc 1441 ttccatgttt tcctggggaa atttctcttt ctttggggtc acttttatca atagcctgct 1501 gttcagattg aaagactgtc tctttagaat gtctttattt ctgccaggtc agttatagaa 1561 agtggcatgt tttcctttat tcaggacaaa actcccattt tgattttctg cttgcattcc 1621 tggagtcaga cagatgagta ttcactgcat acagcctcgt ataaccctgc aaccacctcc 1681 acatgttcac ttaaatggag acattttact ctcttgcaag agcttgaaac tcaaactcag 1741 atctgtgaaa ctataaatcc agtttccttc catccctgct cctggagtga tgaccctgag 1801 actaattatc aataaatgcc tagagcataa gctccagcta gttctctgac ttgctctcaa 1861 cttattatgc cttttattct aacccagctt tagctacatg gctggtttcc tctccttgtc 1921 ttcttacttc agtctcctca gcattacagc tcgaatctct gttctatttc tcaagttcct 1981 ctacctgctg gattatgtcc ttttcctcag tgttccaggc aatctctact tttattctat 2041 cttgagtgac tagttacttc tgctcagctc ccatgattct gacctcctgt gttttgcagg 2101 caaatcttcc atgccctctc ctactatttc ccagaattct ctctattcct gctggatgtc 2161 ccacctactt cctgcatcag ctcattggcc ataagctttt ttattgacag gtgatactta 2221 acacatatca cttccaggaa tatctgttca ccactgagaa gatgcagggg cccagtcact 2281 gcactcagtt ctgtagtgag tgtacaatgt gcatgagtgt ggatgagaga gcattgctca 2341 gaccacagga aagggtgcaa accttcagtt tgaggttttc actttagagg aaagcttagt 2401 cagtttcctg aggaagtcac accctttgga acctcagccc caagacttaa gtttctcgtt 2461 accaccttac tggtttggat tctcttctct tgcctgatgc cctgcatgcc ccacagagat 2521 agagagaacc tgaggtctca gagatgtggc agttttgcat tctgtgc // LOCUS R751TRA 578 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid R751 traJ and traK genes, 5'end. ACCESSION M25422 KEYWORDS inverted repeat; transfer origin region. SOURCE Plasmid R751 (strain HB101, Inc P-beta) DNA. ORGANISM Plasmid R751 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 578) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 48 < 1 (c) traJ protein pept 403 > 578 traK protein signal 243 211 promoter PL signal 266 294 promoter PR rpt 49 64 inverted repeat rpt 118 157 inverted repeat rpt 296 331 inverted repeat BASE COUNT 141 a 168 c 163 g 106 t ORIGIN 1 cggccgtgtt ccttttcgtc gttctccatg cctcgcctcg tctctcatgc cggcggtagc 61 cggctgcctc gcagagcagg atgacccgtt gagcgccccc ggcgcgaata agggacagtg 121 aagatagata accggctcgc cggttagcta acttcacaca tcctgcccgc cttacggcgt 181 taataacacc aaggaaagtc tacaccagcc attacgattt atccgcaact atcgcgctat 241 caggccgcaa aagcagcaac ggatatagcg aaacccgcca caatggccca taatgccgct 301 atcgaagcgt gccaatgcac gccgatagcg gactttttgc gtttccgtag cgccgcttag 361 tagcgttaca tttgcgatga gaggattaga tggacgaaca cgatgccaaa gacctacccc 421 gaagagctgg ctgaatgggt gaagggacgg gaagccaaga agccgcgcca ggacaagcac 481 gtggtcgcgt tcctggccgt caagagcgac gttcaagcgg cgctcgatgc gggctatgcg 541 atgaaaacga tctgggagca catgaaggaa accggccg // LOCUS RP4TRAB 571 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid RP4 traJ and traK genes, 5' end. ACCESSION M25423 KEYWORDS inerted repeat; transfer origin region. SOURCE Plasmid RP4 (strain HB101, IncP-alpha) DNA. ORGANISM Plasmid RP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 571) AUTHORS Lanka,E. and Euerste,J.P. TITLE Conjugative transfer of promiscuous IncP plasmids: Interaction of plasmid-encoded products with the transfer origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1771-1775 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by E.Lanka, 17-JUL-1989. FEATURES from to/span description pept 26 < 1 (c) traJ protein pept 394 > 571 traK protein rpt 48 63 inverted repeat rpt 118 157 inverted repeat signal 219 192 promoter PL rpt 281 318 inverted repeat signal 246 272 inverted repeat BASE COUNT 135 a 148 c 181 g 107 t ORIGIN 1 ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc ggcggtagcc 61 ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata agggacagtg 121 aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg ctgacgccgt 181 tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata tcgtgcgaaa 241 aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta tgcagcggaa 301 aagcgctgct tccctgctgt tttgtggaat atctaccgac tggaaacagg caaatgcagg 361 aaattactga actgagggga caggcgagag acgatgccaa agagctacac cgacgagctg 421 gccgagtggg ttgaatcccg cgcggccaag aagcgccggc gtgatgaggc tgcggttgcg 481 ttcctggcgg tgagggcgga tgtcgaggcg gcgttagcgt ccggctatgc gctcgtcacc 541 atttgggagc acatgcggga aacggggaag g // LOCUS STAREPEBR 2389 bp ds-DNA BCT 09-AUG-1990 DEFINITION S.aureus ethidium resistance (ebr) and replication protein (repA) genes, complete cds. ACCESSION M33479 KEYWORDS ethidium resistance protein; replication protein. SOURCE S.aureus plasmid DNA. ORGANISM Staphylococcus aureus Prokaryota; Bacteria; Firmicutes; Gram-positive cocci; Micrococcaceae. REFERENCE 1 (bases 1 to 2389) AUTHORS Liao,J., C,-H., Moghazeh,S.L. and Projan,S.J. TITLE Genetic mapping and nucleotide sequence of pWBG32, an ethidium bromide resistance plasmid naturally occurring in Staphylococcus aureus JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.J.Projan, 30-MAR-1990. Public Health Res Inst, 455 First Avenue, RM 1166, New York, NY 10016 FEATURES from to/span description pept 1153 1476 ethidium resistance protein (ebr) BASE COUNT 796 a 403 c 290 g 900 t ORIGIN 1 ggtcaatatc tttaagataa tctaaatcgc cattttttaa tttatttctt gcgtctttaa 61 ataatccaga ataaacaaga atttgtttcc ctttaagaga tttataaaat gcgtcgaaca 121 ctttctgatt aattaaatag tcactatcct taccagaata tttagccatt tcatataatt 181 ctttattgct attttgctta attttttgaa catgaacttg cgtaatttca gaaattcctg 241 ttacatctcg ccataaattt aaccattctt tttgactaat ataagctttt gtatctttaa 301 aatatgattt attaacggcc atcaaaacat gaaaatgcgg attataatca tcacgctttg 361 agttatacgt tatctctaat tttcttacat aacctttagt gatcgcattt acttttttgc 421 gtttaaacat cttttgaaag gcatgattat aattcttaat ttcactttct aaatgctcat 481 ctgtaacgtt tggtgtcgta agtgtcaaaa agataaattg cttatcttct tcttgcttaa 541 tatattgcat cattaacgat aatcctaatg catcttttct tgctttacgc cacgcacata 601 ccggacaaaa tcgattctta caaggattcg atttatataa tttctttttt tcaaattttt 661 tatccgtcac aaaagacaaa aatgtattac aatttttaac caaatccatt tgatctcccc 721 gatatgacgt tcaataaaat ttttaaatac ttgatttctt tgctttttct cagtatactt 781 ttccatacga taatacacaa aaacaactta gttttctcaa aaactatgca taaaaaagtt 841 gcttttttct ccttttcttt ttttttcgtt tggattagac acctaaaacg atacaatagt 901 atgctagaaa aagcaacttt ttttgtgctt caaaccagtt ataccaatga attgaaaggg 961 ttatacatcg ccgggaatag ttacccttat tatcaagaca agaagaaact cgttttcaac 1021 tcgtttcaaa aacctttcaa aaaccatcaa tccacaaaaa taccacgcga atgacactca 1081 aaatacaaga ctacaattaa aaaatactta gaataaaatt aaataaaata cgaaaattaa 1141 aaggagttaa aaatgcctta tatttattta ataatagcca taagtactga agttattgga 1201 agtgcatttc ttaaatcttc agaaggcttt tcaaaattta taccatcctt aggaacaata 1261 atttcatttg gaatttgttt ctatttttta agtaaaacaa tgcaacacct accactaaat 1321 ataacttatg caacttgggc gggactaggt ttagtcttaa caaccgtagt ctcaataatt 1381 attttcaaag aacaaataaa tctaataact atagtatcta tagttttaat catagtcggc 1441 gtagtttcgt taaacatttt cggaacatcg cattaattgc tttattccaa ttgctttatt 1501 gacgttgagc ctcggaaccc ttaacaatcc caaaacttgt cgaatggtcg gcttaatagc 1561 tcacgctatg ccgacattcg tctgcaagtt tagttaaggg ttcttctcaa catcaataaa 1621 ttttctcggc ataaatgcca tgctataata gatacacgtc ttctcttagc gtttcatagt 1681 attatcctcg tttattatac ttataattat aggggaaggc ttagagctat cattttgata 1741 gctctttatt tttgttcaaa catttattca aaatcagaat gcctttattt tttaatttta 1801 aggggtattt tgaagaatta agggttattt atatagtttt atacctaaaa acttatatcg 1861 gctcttaaaa cgcaaataag agccgaataa aaataattgc ttttcacaaa caaaaatttg 1921 agcaaaacca gtgttgaatt ttttagacac tgcccatcta catgcaaatt taaaaattgg 1981 cataaaaaat gggcaaccat gctggttgaa cgctatagtt cctgcagggg caaaaaagca 2041 taaaaaaacg ctagctttga tgagctaacg ttagttataa aattcagtaa tatgcttttg 2101 taattcaata gattctcttt cttttttagc ttgtcttttt ttaaaacctt ctgaatttct 2161 agaagcctta tatatatcca ttattttttt ataatcaatg tcgtaaccat atttttgtaa 2221 ctcttctaca aaaaacttat cgcaatttaa tatcattttt cttcctcgat ttcgtttatc 2281 atttgatgat ttattttttc tttttcttgt tcagttaaat cataaatttc acttgctaag 2341 tattcttttt gattccaaat ataaaaaatt tgataaatat attcagtcg // LOCUS XANAVR 2100 bp ds-DNA BCT 09-AUG-1990 DEFINITION X.campestris avirulence protein (avrBs1) gene, complete cds. ACCESSION M32142 J03672 KEYWORDS avirulence protein. SOURCE X.campestris (strain E3, race 2, pv. vesicatoria) DNA. ORGANISM Xanthomonas campestris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2100) AUTHORS Ronald,P.C. and Staskawicz,B.J. TITLE The avirulence gene avrBs-1 from Xanthomonas campestris pv. vesicatoria encodes a 50-kD protein JOURNAL Mol. Plant Microb. Interact. 1, 191-198 (1988) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by P.Ronald, 15-FEB-1990. FEATURES from to/span description pept 308 622 ORF1 pept 713 2050 ORF2 BASE COUNT 656 a 423 c 505 g 516 t ORIGIN 1 ccattgtcgg cggttatccg ggtacttggc gtacaccaaa caactggggc aatgctggca 61 aatcacgtga cgaagccttg gcagacgagc aacagaggat tcaagcgctt aaatcgcaag 121 agacggtaca tatcttccat cgcaaagatg tcaagagcga acccgcaacc cacgcggggc 181 gacgttaagt aagccactga tttttagcga agaagagctt gtgagagctg cgggcgccaa 241 atatgtacgt ttgacagtga cagatcatct ttcaccacgg gcggacgata ttgatgcgtt 301 tattgcaatg gagcgggaga tggcccatga tgagagactg catgtacatt gtggtatggg 361 cctaggccgt acgacaatat ttattgtcat gcatgacata ctaagaaatg ctgcaatgtt 421 atcgtttgat gatatcatcg aacggcaacg taaatttaat ccagggcgaa gcttggataa 481 taataaagac gtttctgaca aggggcgctc agaatttcgt aatgaacggt cagagttcct 541 tcctctattc tacgagtacg ccaagcaaaa tccaaagggc cagccattgt tatggtccga 601 atggctcgac cacaatgcat aaatcgcaag tacattttcg gctatgacgg acttgtgctc 661 gatgcgctgg cggctttctc gataaatatc aattaatata aatatcgaac taatgtccga 721 catgaaagtt aatttctctt caaaaataat agattcaaca cccagtgaag aggaggtcgc 781 cactcagcaa gatagttata cgaaatctgg actggtggcg ccatcgctcg attcacaagc 841 cttgaaaaaa gcacctagaa aaagagtaat aaaagaaaat atagctgctt tgcacacctc 901 atcgttagag cgagttcatc aaaagaaggt attagttcag aatttagcgc agttgcagag 961 agggttggct aagataaatg gtagagtcga actcgaagag ctaattgatg gattttcagt 1021 caaggaattg ctaataaaaa gaaatccaaa gattgctgaa gagtatggag aaggaaatcc 1081 tttaatgatt cgatctctaa gattttcaaa cccccaagag gtgactagta agcttggggc 1141 ggaaggaaaa acgccagcca aaagagaggt tgatacgatt tgcaataaat ccacgctgca 1201 tgacattgtc atgacgcccg cctcccttgt aaaaaaggaa gtgcggatga acctgatatc 1261 tgaagtccca agggcgaagg ataaacaaaa atacagaggt cttccttcag tcgtatatgg 1321 ccaaagcagc cgccgtagtg aatcagacta tctaacgtct cgaaatggtt tcggcgacgt 1381 gcactctttg aaatccaata acgcatttaa ttccgactac gaaaaaatat gtgggtcgct 1441 tagccatgcc gaaaagttgg ggttaattga aaggaatctt actcccttta taaggcatga 1501 tccagataga atctccaccg actttgttca ctctattgaa gaattggctg aacaccagat 1561 gctattgcaa tcaagaaaac ctgccagtgc tttgcggcat aatgaatatt gcaccaagct 1621 tgaactgtgg gatgctaaag ctatagcagt tggtgaatct cgtgccttgg cggtcgctac 1681 cctgattgaa tttaatttgg agatgttgtc gatagcacaa gagatagatg atgatgggca 1741 caagagtaaa atggtcgccg attttatcga gcgccaacta tcatggcttg gcccacaaac 1801 cgcacttgac agcaagtcaa cgcttgaaag ggtttcagcg gtgaccatac aagaaaggga 1861 atttatcgct aatgagatta gccgatcgtt gcgtcaaggt gtttcacttt gcacttacga 1921 taaagatgaa gcaggaagtc atatccgtga aatgagtttg ttggatttta gggttgaaga 1981 aatcatagag gggataagta tttttatttc ctccaagctt ttacatgtta caaatgcagg 2041 agaagcgtaa gagaagaagt atccgccaca atcgtgcgac ggaccgacgt cctaacgccc // LOCUS YSCSCD25 5055 bp ds-DNA PLN 09-AUG-1990 DEFINITION S.cerevisiae SCD25 gene, complete cds. ACCESSION M26647 M31771 KEYWORDS Ras protein; SCD25 gene; cell division cycle. SOURCE S.cerevisiae (strain OL136) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 2129 to 5055) AUTHORS Boy-Marcotte,E., Damak,F., Camonis,J., Garreau,H. and Jacquet,M. TITLE The C-terminal part of a gene partially homologous to CDC25 gene suppresses the CDC25-5 mutation in Saccharomyces cerevisiae JOURNAL Gene 77, 21-30 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 3880) AUTHORS Damak,F., Boy-Marcotte,E., Le-Roscouet,D., Guilbaud,R. and Jacquet,M. TITLE SCD25, a CDC25 like gene, which contains a RAS activating domain is a dispensable gene of Saccharomyces cerevisiae JOURNAL Unpublished (1990) See COMMENT for author address STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by E.Boy-Marcotte, 02-AUG-1989, for [2] by F.Damak, 01-FEB-1990. Laboratoire IGD, Groupe des laboratoires de biologie cellulaire, Centre universitaire d'Orsay, 91405-Orsay Cedex FEATURES from to/span description pept 128 3880 SCD25 protein pept 4319 > 5055 ORF X BASE COUNT 1638 a 973 c 900 g 1544 t ORIGIN 1 ctgcaggctc gcaaaattta aggttccctt ctacaatagt agtcaaaatt gcttttttgc 61 atataacaaa gtgaaaaaaa aaaatatgag agacatatct aaaagacata tataatctgc 121 caccataatg agttgcactg cgtcatatgc cggcatgaca actccggtga aagataagga 181 aggccacggg attccatgct tacaacctat cgatgtagtg gaatgtacct atcaatattt 241 tacaaaatca cggaataaac tgtctttaag ggtaggcgat ttgatttacg tactcactaa 301 aggttctaat ggctggtggg atggtgttct tatcagacac agcgctaata ataataataa 361 taattcgttg atactagaca gaggttggtt ccccccttct tttacacggt ccattctaaa 421 cgaactacac ggggtgcctg acatcggtaa tgaattggaa atatttcaag cgggtcttaa 481 tcttaaactg gaattatcaa gcaacccagt gatcttatca ttggaagact ttttagactg 541 ctgtcgcgat attgaattca aggaacaact ggcttggtca cctactcccg tccacgaaag 601 gaaaggctgc tgtgagctgc tgtactataa ccaggattta gatgtttatt gtcgcacgtt 661 accatattta ccacaaaatc aagttgaaac cgtgaacgac tattcgtctt ttcctgcaat 721 atcgaagatt gctggtaaaa agatgcctat aacgtcaagc cccgatctgt tctatctcaa 781 tgattgtgat gtcgtctatt ggtatgacct cactcgctta gtgtgtcatt atgttaattt 841 aacagagcgc gacctattgg caaatgaacg ggaaaagttt ctaacttcct tggatttatt 901 aacagctcaa ataacctatg tttatatgct tttcaggaat ctccgtttag ttgaagatag 961 tttcaaaaaa accctcaaaa aactaattta caccttgtct aggttttcaa taaatgcaaa 1021 tatttggttt cattccacat cgtttgaaga aagagaagcc atagcctccc agaaggatcc 1081 agaaagaaga tcccctcttc tacagtcaat cctaggaacc ttccaaaaat ttcattttct 1141 actgcgtcta ctacatttcc tctcaaatcc taacgaactt acaatactgc ctcaattgac 1201 tcctcgattt ttcaaggatt ctttcaatac aatttcatgg aataacccgt ttttgcgtac 1261 agtcttcaac cagcatatgt ccatgacctt accgagacag atgattaaag ccgttgctgg 1321 cgcttcagga attgttgcgg aaaatattga tgaaattcca gcttccaaac agggcacttt 1381 catctcgtca gaaacgtctc accattcacc atcagccccg tttcaaagaa ggagaagagg 1441 taccattttc tctaatgtgt caggaagttc cgatgagtct gacaccatat ggtccaaaag 1501 gaaaaaacca tacccgctaa atgaagaaac tctaagcctt gtaagggcca ggaagaagca 1561 gcttgatggt aaactaaaac aaatgatcaa aagtgctaat gaatatctca gtaacacggc 1621 taatttcaaa atgttgaatt ttgaaatgaa cttcaaaacc tacgaagaag taagcggaac 1681 aattcctata attgatattc tggaaaacct agatttaact atttttctaa acttgagaga 1741 gttgggagat gagaatagag tttttgacga agatgtcttt gacgaagatg tcgctattgg 1801 tgatgaagat aaagagtttt tgaaacactc tttatcatcc ctatcgtata tcttatccga 1861 ctattttaat atgaagcaat attttcatga attgtcgccc acgcatttga cattagagga 1921 tcctttcgtt ttctcgccaa tgcaaaacga cttgcctacc ggttattatg aaccaatgaa 1981 accttcatcc ttgaatttag ataatgccaa ggataagaag aatgggagcc aaaatactga 2041 tatccaagag gaggaagatg aatatgagcc agacccggat agtcttattc tcttccacaa 2101 cctcatcaat caagattctg atttcaatga tctaaagttt tttaatctcg cccacgtttt 2161 taaaaaatcc tgtgatgact attttgatgt gcttaaacta gccattgagt tcgtgaatca 2221 attaattcta gaaagagaga atttgttaaa ttatgctgct agaatgatga aaaacaatat 2281 cacggaattg ctattgcgcg gggaagaagg ctatgggtcc tatgacggcg gtgaaactga 2341 aaaaagtgac acgaatgctg tttatgcaga ttcagatact aaagacaatg acgaatggcg 2401 tgacagccaa gtcaaattac cgaggtattt gcagcgcgag tatgacagtg aactgatttg 2461 gggctctaac aataggatta aaggtggttc taaacacgca ctgatctctt acttgacaga 2521 taatgaaaag aaggacctat ttttcaatat tactttttta atcactttca gaagcatctt 2581 tactacaacg gagtttttaa gctacttgat ctcgcaatat aatttggatc caccagagga 2641 tttgtgcttt gaagaataca atgaatgggt gacgaaaaag cttataccgg ttaaatgtag 2701 ggtggttgag attatgacaa cctttttcaa gcaatattgg ttcccgggct atgatgagcc 2761 cgatcttgcg accctaaatc tggattattt tgcgcaagta gcaatcaagg aaaatataac 2821 aggatctgtg gaattactaa aggaggtcaa tcagaagttt aaactaggta atatacaaga 2881 agcgactgca ccaatgaaaa cgttagatca acagatctgc caggaccatt actcgggcac 2941 tttatactct accacggaat ccattttggc cgtcgatcca gttttatttg ccactcaatt 3001 aacgatacta gagcatgaaa tttattgtga gataaccact tttgattgtt tgcaaaaaat 3061 ttggaagaac aagtatacaa aatcgtatgg ggcttcaccg ggtttgaacg agtttatcag 3121 ttttgccaat aaactgacaa atttcatatc ctactctgtt gtaaaggagg ctgataaaag 3181 taagcgcgcc aagctactct ctcattttat ttttatcgca gaatattgta ggaaattcaa 3241 taacttttct tccatgactg acatcatttc agcattatat tcttcaccaa tttatcgttt 3301 agagaaaacc tggcaggcag ttattcctca aacgagagat ctattgcagt cactgaacaa 3361 gttgatggat cccaagaaaa atttcataaa ttacagaaac gagctgaagt ctttacatag 3421 cgctccctgc gtaccgtttt tcggcgttta tttatctgat ctaaccttta ctgattccgg 3481 aaatccggat tatcttgtct tggaacatgg tttaaagggt gtccatgatg agaagaaata 3541 tataaacttc aacaaaagga gcagacttgt tgatatctta caagagatca tatatttcaa 3601 gaaaacacat tatgatttca ctaaagatcg gacggtaatt gaatgtatat caaattcatt 3661 ggaaaacatc ccccatattg agaaacaata ccaattatca ttaattattg aaccaaaacc 3721 aagaaagaaa gtcgttccga attccaattc gaataataaa tcacaagaaa aatccaggga 3781 tgaccaaacc gatgaaggaa aaacatccac taagaaagac agatttccaa aatttcaatt 3841 acataagaca aagaaaaaag ctcccaaggt ttctaagtaa cggcgccgta tgttcgattt 3901 ccttctctcg gtggattaat tattttgttt gttttctcct gttatattat ttattgatca 3961 ctatagtaaa ctatgtccgt catcaagccc gacggctgct atcccacaat gttgatcgta 4021 ttgtttgcct agtttattat atatttgctt atttatagca taccataata tttaaatgcc 4081 ctcaaatttt tggccgtagc gacatcgcga taattccaat tccctttaaa aaattgcgcc 4141 tgagtataag ttaattcagc cagttctcca aattaaaatc gcatactcct gaacctatca 4201 acagattgtc ctcgcatact tttctatacc aaggtctctt ctgaacatat attagcagtg 4261 gttaatttta aagagatcat aaagaaaatt ttgtctaaaa aagattaata taaagacaat 4321 gtcttcacta gaagtggtag atgggtgccc ctatggatac cgaccatatc cagatagtgg 4381 cacaaatgca ttaaatccat gttttatatc agtaatatcc gcctggcaag ccgtcttttt 4441 cctattgatt ggtagctatc aattgtggaa actttataag aacaataaag taccacccag 4501 atttaagaac tttcctacat taccaagtaa aatcaacagt cgacatctaa cgcatttgac 4561 caatgtttgc tttcagtcca cgcttataat ttgtgaactg gccttggtat cccaatctag 4621 cgatagggtt tatccattta tactaaagaa ggctctgtac ttgaatctcc ttttcaattt 4681 gggtatttct ctccctactc aatacttagc ttattttaaa agtacatttt caatgggcaa 4741 ccagcttttc tattacatgt ttcaaattct tctacagctc ttcttgatat tgcagaggta 4801 ctatcatggt tctagtaacg aaaggcttac tgttattagc ggacaaactg ctatgatttt 4861 agaagtgctc cttcttttca attctgtggc aatttttatt tatgatctat gcatttttga 4921 gccaattaac gaattatctg aatactacaa gaaaaatggg tggtatcccc ccgttcatgt 4981 actatcctat attacattta tctggatgaa caaactgatt gtggaaactt accgtaacaa 5041 gaaaatcaaa gatct // LOCUS ADAMLPA1 630 bp ds-DNA VRL 09-AUG-1990 DEFINITION Simian adenovirus 30 major late promoter region DNA. ACCESSION M31631 KEYWORDS promoter. SEGMENT 1 of 3 SOURCE Mastadenovirus s30 viral DNA. ORGANISM Mastadenovirus s30 Unclassified. REFERENCE 1 (bases 1 to 630) AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T., Lee,S.G. and Hung,P.P. TITLE Conservation of essential sequences in the major late promoter and tripartite leader of the simian adenovirus type 30 JOURNAL Gene 89, 275-277 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by C.L.Hsiao, 22-JAN-1990. FEATURES from to/span description pre-msg 126 > 630 leader sequence 1 mRNA and intron IVS 167 > 630 leader sequence 1 intron A signal 95 100 TATA box signal 273 292 downstream promoter element site 64 73 upstream regulatory sequence site 210 220 downstream regulatory sequence BASE COUNT 128 a 150 c 214 g 138 t ORIGIN Map position 16.0-16.6. 1 acggtgtgca ggcagaggtc cccgtcctcc gcatccaaaa aggtgattgg cttgtaggtg 61 taagtcacgt gaccttcctt tgggggcggg gggcgataaa agggggcggc gccgtcgtcg 121 ccgtcactgt cctctgcgtc gctgtggacg atcgccagct gctcgggtga gtagaggcgc 181 tcgaaggcgg gcatgacgtc ggcgctgagg gtgtcagttt ctacaaacga ggaggatttg 241 atgttaacct gcccggagcg atgcctttga gaagggcggg gtcgagctgg tcggcaaaaa 301 caattttttt attgtccagc ttagtggcaa aggacccgta gagggcgtag gtcgtaagaa 361 gcttcttgct ttttttccca cagctcgcga ttcaagaggt actcttggcg gttctgccag 421 tactcgggaa gcggaaaccc ctgcgcgtcg gctcggtaag cgcccagcat gtaaaattcg 481 ttaggcgctg acgatgcatt tgattaactg ctgcgtaggc acttgacgcc aggacctgaa 541 ggcggagaaa tccaccggat cggagaactt gtcgaggaag gcgtgtagcc agtcgcagtc 601 gcaaggtaag ctgaggacgg tttccggggg // LOCUS ADAMLPA2 135 bp ds-DNA VRL 09-AUG-1990 DEFINITION Simian adenovirus 30 leader region 2 DNA. ACCESSION M34220 KEYWORDS promoter. SEGMENT 2 of 3 SOURCE Mastadenovirus s30 viral DNA. ORGANISM Mastadenovirus s30 Unclassified. REFERENCE 1 (bases 1 to 135) AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T., Lee,S.G. and Hung,P.P. TITLE Conservation of essential sequences in the major late promoter and tripartite leader of the simian adenovirus type 30 JOURNAL Gene 89, 275-277 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by C.L.Hsiao, 22-JAN-1990. FEATURES from to/span description IVS < 1 35 leader sequence 2 intron N-1 IVS 108 > 135 leader sequence 2 intron N site 36 107 leader sequence 2 BASE COUNT 28 a 36 c 36 g 35 t ORIGIN About 0.8 kb after segment 1; map postion 26.1-26.6. 1 aggtcgtaag aagcttcttg ctttttttcc cacagctcgc gattcaagag gtactcttgg 61 cggttctgcc agtactcggg aagcggaaac ccctgcgcgt cggctcggta agcgcccagc 121 atgtaaaatt cgtta // LOCUS ADAMLPA3 147 bp ds-DNA VRL 09-AUG-1990 DEFINITION Simian adenovirus 30 leader sequence 3 DNA. ACCESSION M34221 KEYWORDS promoter. SEGMENT 3 of 3 SOURCE Mastadenovirus s30 viral DNA. ORGANISM Mastadenovirus s30 Unclassified. REFERENCE 1 (bases 1 to 147) AUTHORS Hsiao,C.L., Woessner,K., Cheng,S.M., Dheer,S.K., Vince,T., Lee,S.G. and Hung,P.P. TITLE Conservation of essential sequences in the major late promoter and tripartite leader of the simian adenovirus type 30 JOURNAL Gene 89, 275-277 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by C.L.Hsiao, 22-JAN-1990. FEATURES from to/span description IVS < 1 35 leader sequence 3 intron N-1 IVS 123 > 147 leader sequence 3 intron N site 36 122 leader sequence 3 BASE COUNT 34 a 33 c 53 g 27 t ORIGIN About 2.5 kb after segment 2; map position 19.3-19.9. 1 ggcgctgacg atgcatttga ttaactgctg cgtaggcact tgacgccagg acctgaaggc 61 ggagaaatcc accggatcgg agaacttgtc gaggaaggcg tgtagccagt cgcagtcgca 121 aggtaagctg aggacggttt ccggggg // LOCUS TFEMERA 1730 bp ds-DNA BCT 09-AUG-1990 DEFINITION T.ferrooxidans mercuric reductase (merA) gene, complete cds. ACCESSION M32353 KEYWORDS mercuric reductase. SOURCE T.ferrooxidans (strain E-15) DNA, clones pTM31[4,5]. ORGANISM Thiobacillus ferrooxidans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Colorless sulfur bacteria. REFERENCE 1 (bases 1 to 1730) AUTHORS Inoue,C., Sugawara,K., Shiratori,T., Kusano,T. and Kitagawa,Y. TITLE Nucleotide sequence of the Thiobacillus ferrooxidans chromosomal gene encoding mercuric reductase JOURNAL Gene 84, 47-54 (1989) STANDARD simple staff_review FEATURES from to/span description pept 65 1702 mercuric reductase BASE COUNT 323 a 542 c 574 g 291 t ORIGIN 1 gcgaccgacg gctgcgaaac gcccgccccg cgtagctgag cacatagaca ctttggagga 61 tattatgacc gagaacgcgc ccaccgaact cgctatcact ggcatgacct gcgacggttg 121 cgccgcgcat gtgcgcaaag cactcgaagg cgtgcccggc gtacgcgagg cgcaggtgtc 181 ctacccggat gccacggccc gggtcgtgct ggagggcgag gtgccgatgc agcggctaat 241 caaggcggtg gttgcaagtg gctatggtgt gcatccacgg agcgacggtg cctcctccac 301 aaacgatgga caggagctac acatcgctgt gatcggcacc ggcggagcgg cgatggcgtg 361 cgcattgaag gctgtcgagc ggggcgcgcg cgtgacgctg atcgaacgca gcaccatcgg 421 cggcacctgc gtgaacatcg gttgcgtgcc gtccaagatc atgatccgcg ccgcccatat 481 cgcccacctc cgccgggaaa gcccattcga tggcggcatc caggcggtcg cgccgaccat 541 ccagcgcaca gcgctgctgg tccaacagca ggcccgtgtc gatgaactgc gtcacgccaa 601 gtacgaaggc atcctggacg gcaacccggc catcaccgtt ctgcgcggtg aagcgcgttt 661 caaggacagc cggagtgttg tcgtccattt gaacgatggt ggcgagcgcg tcgtaatgtt 721 cgaccgctgc ctggttgcca cgggcgccag tccggccgtg ccgccgattc ccggcttgaa 781 agacactcct tattggacct ccaccgaagg gctggtcagc gaatcgatcc ccgagcgtct 841 ggccgtgatc ggctcgtcgg tggtggcgct ggaactggcg caagccttcg cccggctcgg 901 cagccatgtg acgatcctgg cgcgcggcac cttgttcctc cgggaagacc cggccatcgg 961 tgaggccatc acggcggcgt ttcgcgccga aggcatcgag gtgctggagc acacccaggc 1021 cagccaggtc gcttatgcgg atggcgaatt tgtgctagcc accgggcacg gcgaactgcg 1081 cgccgataag ctgctggtcg ccactggtcg cgcaccgaac acacgccgcc tgaatctgga 1141 agcggcgggc gtggccatca atgcgcaagg ggccatcgtc atcgaccagg gtatgcgcac 1201 gaacagcccg aacatttacg ccgctggcga ctgcaccgac cagccgcaat tcgtctacgt 1261 ggcggcagcg gccggcaccc gtgcggccat caacatgatg ggcggtagtg cagccctgga 1321 cttgacggcg atgccagccg tggtgttcac cgatccgcaa gtggcgactg tgggttacag 1381 cgcggaagcg catcgcgacg gcatcgaaac cgacagccgc atgacgctcg acaacgtgcc 1441 gcgggcgctc gccaatttca atacacgcgg cttcatcaag ctggtagccg aagtgggcag 1501 tggctcgcta atcggcgtgc aggtggtcgc cccggaagcg ggcgagctga tccagactgc 1561 cgcgctggcg attcgtaacc ggatgacggt acaggaactg gctgaccagt tgtttcccta 1621 cctgacgatg gtcgaagggc tgaagcttgc tgcccagacc ttcaccaggg atgtgaagca 1681 gttgtcctgc tgtgcgggtt gagacggatt gataaaggag tccctgttgc // LOCUS MMTELPMA 830 bp ss-RNA VRL 09-AUG-1990 DEFINITION Mouse mammary tumor virus (MMTV) phorbol myristate acetate induced mRNA, clone 14. ACCESSION M37198 M19737 M19738 M22729 KEYWORDS . SOURCE Mouse mammary tumor virus, cDNA to viral RNA, clone 14, passed in EL4.E1 cells. ORGANISM Mouse mammary tumor virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses. REFERENCE 1 (bases 1 to 202; 692 to 720) AUTHORS Elliott,J.F., Pohajdak,B., Talbot,D.J., Shaw,J. and Paetkau,V. TITLE Phorbol diester-inducible, cyclosporine-suppressible transcription from a novel promoter within the mouse mammary tumor virus env gene JOURNAL J. Virol. 62, 1373-1380 (1988) STANDARD simple staff_review REFERENCE 2 (bases 1 to 830) AUTHORS Paetkau,V.H. JOURNAL Unpublished (1990) STANDARD full staff_review FEATURES from to/span description pept 168 575 PMA-induced transcript mRNA 1 > 830 PMA-induced transcript mRNA revision 1 1 c in [2]; g in [1] revision 39 39 g in [2]; a in [1] site 94 95 intron site revision 119 119 a in [2]; g in [1] site 706 707 site of 491 bp deletion relative to MMTV virus BASE COUNT 233 a 183 c 184 g 230 t ORIGIN 1 cactgccaga tcgcctttaa gaaggacgcc ttctgggagg gagacgagtc tgctcctcca 61 cggtggttgc cttgcgcctt ccctgaccaa ggggtgcctt gcgaagagcc ttgaccaaat 121 gcagtcagat cttaacgtgc ttcttttaaa aaagaaaaaa gggggaaatg ccgcgcctgc 181 agcagaaatg gttgaactcc cgagagtgtc ctacacctag gggagaagca gccaaggggt 241 tgtttcccac caaggacgac ccgtctgcgc acaaacgggt gagcccatca gacaaagaca 301 tattcattct ctgctgcaaa cttggcatag ctctgctttg cctggggcta ttgggggaag 361 ttgcggttcg tgctcgcagg gctctcaccc ttgactcttt taatagctct tctgtgcaag 421 attacaatct aaacaattcg gagaactcga ccttcctcct gaggcaagga ccacagccaa 481 cttcctctta caagccgcat cgattttgtc cttcagaaat agaaataaga atgcttgcta 541 aaaattatat tttaccaata agaccaatcc aataggtaga ttattagtta ctatgttaag 601 aaatgaatca ttatctttta gtactatttt tactcaaatt ctgttgttag aaatgggaat 661 agaaaataga aagagacgct caacctcaat tgaagaacag gtgcaaggat gtgagacaag 721 tagtttcctg acttggtttg gtatcaaatg ttttgatcta agctctgaat gttctattct 781 cctatgttct tttgcaactt atccaaggtc ttatgtaaat ggcttagtaa // LOCUS MUSPBGD1 2663 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 1. ACCESSION M28663 M29949 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 1 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2663) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept 505 537 porphobilinogen deaminase (housekeeping) exon 1 (EC 4.3.1.8) 2191 2307 porphobilinogen deaminase (housekeeping) exon 2 2457 + 2510 porphobilinogen deaminase (housekeeping) exon 3 pep$ 2475 + 2510 porphobilinogen deaminase (erythroid sp.) exon 1 pre-msg 341 > 2663 PGB deaminase (hk) mRNA and introns IVS 538 2190 PGB deaminase (hk) intron A (no splice consensus) IVS 2308 2456 PGB deaminase (hk) intron B IVS 2511 > 2663 PGB deaminase (hk) intron C pre-msg 2192 > 2663 PGB deaminase (ery.sp.) mRNA and introns IVS 2511 > 2663 PGB deaminase (ery.sp.) intron A binding 1126 1133 NPE binding site binding 312 317 Sp1 binding site binding 329 334 Sp1 binding site site 2101 2107 CACCC box site 2118 2124 CACCC box BASE COUNT 622 a 661 c 707 g 673 t ORIGIN Chromosome 9 1 ccacccccac cccacacaca cacacaaagt aaatagggct ggagagctta gtggttaaga 61 gcactgactg ctctttcaga ggtcctgagt tcaattccca gaaaccacat ggtgctcaca 121 accatctgca atagggtctg atgccctttt ctggtgtgtc taaagaagag agcaatggtg 181 tactcatata cataaaataa ttttttttaa aaagtaaaag ataataaaaa ttgaaaagga 241 aaaaaatctt tttgagttgt tctgtgcagt ggacttgagc gaaaaggctg gctatgtcgc 301 aatcctaatt cccgcccaga ggaaggcacc gccccgttga gggagggcag cggacgtgac 361 gcagagctca gcaggtcctg cagccggagt gaagtgcggg ctcgggcccc atgtgccttc 421 agtcccggcc ggcccaggtc gtcggcttct gcagacacca ggggaccgca gcggcactgc 481 cgcgcctgcg ccctgggcgg agtcatgtcc ggtaacggcg gcgcggccac aaccgcggtg 541 agttctgagc cggtgaccga tgacccgcac ttctcggggc tttctgggtg caacgattgg 601 ccccgggttg ccatgttctc gtcgtctatt ggtcggaata gttagctgtc atttttcccc 661 ccccacacct caaggttttt tttaaagggc cagtaactag gttgccctaa ggcagggaag 721 gagtgatctc gagcagtggg ggcggggttg tgagtggaaa ggtggtccgc cctgggattc 781 catccctgta ggctctggct ggatctctgt tgttcccgac cagtaaagga ttatgcacag 841 acaagatcct tttcacgaag aaggggctga ggcaaatcca gctatctcgg aatacgatcc 901 acttcattca ggggagagca caccccactt cttaaaactg tatacaaaca tcttggaggt 961 tacacgcctt ctcccgttct ccgttatgaa gtcacccagc cttagccacc cacaaaagtc 1021 ctagtagaga cacacctgaa ttgctattgt gagcggggga acccacccct gggccttgtc 1081 atttctggcc tgcctggaaa gttctgaact tgtgggcagg ctgcctgaga taaggctgag 1141 ctgggaagct tgcttatctc ctgcccaggc agtaagcagt agtcttggct atgaaaacat 1201 ttttagagca ctgggttagg gtaggaaggc ctggatttca gcacccactt tctgtctgtt 1261 catagctgtg agatgtttag acagtaattt gaccactctg catctttgct tctgtgacac 1321 gggtggaagt acctaccctg tctaacctag tagggttgtt gcaaggacaa tatgcagaca 1381 ctgctcaaat gctgttctgg gtcaatcaat taaaaaacaa attgtttgaa cttagcaatt 1441 cctttctatg ggctccctgt tgtccgaaat ttctgtgtta tttcaagccc agctaaattg 1501 caaaggctat ctcagagtcg tttgttggag gaatcttcgc agtggagtag actggagtcc 1561 aagagcaagt tttcaccttc agtgaccaag aacttgagtg tctggttata gaagaacctg 1621 tgagatgagg aacctggtgc agggaagggg gacaatctgt acagtgactc ctgtcccctt 1681 tgtatcagac tgcagaaccc agttctacct gcttggccct agacaccttt atccaaggcg 1741 ccttaacaaa agaaagaggt gtgtcctttt gagctcttgg ctctggctta agacaccaga 1801 ggaaacccgt aggcaatgac tgttaggcag tttattcttg tagtcttctg ggacttcttg 1861 aggcatgagg tggcctttaa tttaacaagc ccttgatggg atgatgttcc caaagtcacc 1921 caccaagggc atgaaagggc tgtacattag cttggttgat ttcagtcctt gttaggagta 1981 catcctggtg tctcacccag ggcttagtga ggccttctca agtgcctgag ttgttgtgga 2041 cagtgagctt gttctctagc aatgggaggc ttcagctgtc ctgccccagc ttctgtaggc 2101 cccaccctcc agcagggccc accctcactg tgccgaggct gatgggcctt atcattttgc 2161 ccacctggct gtgtgcagcc ctcccactca gaacctcctt ggccaggctg ggctttgggg 2221 ctcagtgtcc tgttgctgct gccacaacag atcctattac agcttttctt ctggtcttgc 2281 ttctctggat cccgtagagg gcagaaggta ccaaggaaga ttcaaggacc agtcctggga 2341 gtctctcctt cctagcagcc tcacctgcct aggacccggg agtcctctct cctaagcctg 2401 tgatcctagt tctttgaatg aggaaaagat cgtaacctag ggactttctt ctgcaggaag 2461 aaaacggctc aaagatgagg gtgattcgag tgggcacccg taagagccag gtgagtacag 2521 acatagcgcg ttgcctcaag aattgtaatg ctcacgggtc actagtggga accaaaggct 2581 agcatcgagc aaataagagt gtgtgagagt cgatttcatg ggggatggca gctcacttcc 2641 tctgaaaaga gagtctctgg agc // LOCUS MUSPBGD2 2763 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 2. ACCESSION M28664 M29950 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 2 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2763) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 52 124 porphobilinogen deaminase (housekeeping), exon 4 (EC 4.3.1.8) 487 536 porphobilinogen deaminase (housekeeping), exon 5 619 674 porphobilinogen deaminase (housekeeping), exon 6 983 1060 porphobilinogen deaminase (housekeeping), exon 7 1971 2048 porphobilinogen deaminase (housekeeping), exon 8 2143 + 2218 porphobilinogen deaminase (housekeeping), exon 9 pep$ + 52 124 porphobilinogen deaminase (erythroid sp.) exon 2 487 536 porphobilinogen deaminase (erythroid sp.) exon 3 619 674 porphobilinogen deaminase (erythroid sp.) exon 4 983 1060 porphobilinogen deaminase (erythroid sp.) exon 5 1971 2048 porphobilinogen deaminase (erythroid sp.) exon 6 2143 + 2218 porphobilinogen deaminase (erythroid sp.) exon 7 pre-msg < 1 > 2763 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 51 PGB deaminase (hk) intron C; ery.sp. intron A IVS 125 486 PGB deaminase (hk) intron D; ery.sp. intron B IVS 537 618 PGB deaminase (hk) intron E; ery.sp. intron C IVS 675 982 PGB deaminase (hk) intron F; ery.sp. intron D IVS 1061 1970 PGB deaminase (hk) intron G; ery.sp. intron E IVS 2049 2142 PGB deaminase (hk) intron H; ery.sp. intron F IVS 2219 > 2763 PGB deaminase (hk) intron I; ery.sp. intron G BASE COUNT 704 a 605 c 726 g 728 t ORIGIN Chromosome 9; 150 bp upstream of segment 1. 1 gaaaggcagt ggccagggga ggtgagaaac catctgactc tctttcccca gctggctcgc 61 atacagaccg agactgtggt ggcgatgctg aaagccttgt accctggcat acagtttgaa 121 atcagtaagt tttcttgaga ggagtgattg gtagtgaacg ggaagccagt gaaccggagg 181 acagggcatc tctcgtttgc ctgtggtcaa agcctgcctt gtaagactat tctggctgct 241 tgtgaaggga aagaaagatt gtctcctgtg cacatctcct ccagctgccc gggctagcct 301 gacatttcca tactttctgc tttgggttct tttatgagta tgtctgcttt ttctgtcggt 361 gtgtgtatct gagagagtta ggggctgggt cttctatgcc tcagactcca ctgtgaatcc 421 agtcaaggcc tgaacgaggg gtgactcagt aggtgttaat gggtatctga ttgactctct 481 cctcagttgc tatgtccacc acgggagaca agattgttga tactgcactc tctaaggtaa 541 cgccagtcct tgtcccattc ttcttgtccc tctcccacgt gtaaggggtt cactctgagg 601 ctctctcttg cctggcagat tggagagaag agcctgttta ccaaggagct agaaaacgcc 661 ctggaaaaaa acgagtgagt gaggatggag gaatgtggta ccccgagcct agaaccccaa 721 agtggctctc caatattggc aggattgtcg ggttagactg tggagctcac aggctttcac 781 agagaagaga gccttgcctt ggagtagcct aactacctgg ggaatcagac tgccggggga 841 aaggggtaga gtagttgaga agagaccagg tcttagatct taagatgcta tcttcctgaa 901 cggtcaagga tgctggggtg ggtggtggag ataaggtcac ctactcaaag cctctctctg 961 tgcctccccc tgccgtctcc agagtggacc tggttgttca ctccctgaag gatgtgccta 1021 ccatactacc tcctggcttt actattggag ccatctgcaa gtaagcgggg aggacatgca 1081 tgggacggag ggccctgggc aggattaatc ctactgtggg aatctttgag tttttttttt 1141 ttttttttcc atttggaact taaccgctta gccgtctgtt ttgaaggttc tcagacatag 1201 tgtggcagga aagccaattg gttgacttgg ttgactattt agagtttgtg gagttgggct 1261 cagtggcacg gacctgaaat cccagctact gggaggctaa gacaggatca gagattctgg 1321 gccagcctgg gctacagagg gatttgaacc agcctgagga acttagattg tgccttaggg 1381 gcacagaagg ctggcttaca gtggcttagg tggtaaaggc attttttgct gtcaagccaa 1441 tgacctgagt tcagtccgtg gggtgcactt ggtgaaagaa gagggttgaa tcccacaagt 1501 tgtcatctga ctcatgcata catgctgtag aatgtttatg ctcctcatcc ctcaatgaaa 1561 atggaaacaa tcaaggaaat gaaatataaa acctgctggg tggtggtgcg cacgcctata 1621 atcccagcac ttgggaggca gaggcaggtg aattcaacct ggtctacaaa gtgagttcca 1681 ggactataca gagaaaccca gtcttaaaaa caaaacaaaa ctaaacaaca acaacaacaa 1741 caacaaaaaa gaaaaaacaa agaaagaaat ataaaacctt tccaaagaaa ataaaatgaa 1801 tttggcctgg tggctcatgc tataatctca gcattcagag agctgaggca ggagggttat 1861 tgtgagttaa aggctagctg gggtacagag aaaattttag gtcacctggg ctagagttaa 1921 ccctatctcc aaatgctaat acctttattt catcatcatt tgctttgcag acggcaaaac 1981 ccttgtgatg ctgttgtctt tcacccaaag tttattggaa agaccctgga aaccttgcca 2041 gagaaaaggt gagtgggcct agtgtgcggg ggagagaggc ctggacagtg gagaacagtt 2101 ggcagcctgg gttaagttta attctaaact ctctctgagc agtgccgtgg gaaccagctc 2161 tctgaggaga gtggctcagc tacagagaaa gttccccaac ctggaattca agagtattgt 2221 atcctttcag aagaaggagg ggaaaaagag ggaaagaagg accttccgaa gcaagtggtc 2281 catgcggtca gggggtcgtc tttccatctg tccgtccacc cacccaccca cccatccatc 2341 catccatcca cacatccaca gtcctttaat gttttgcttt tttttttttt tcctgagaca 2401 gggtttctct gtgtagcctg gctgtcctgg aactcacttt gtagaccaag ctggcctgca 2461 aagtgagaaa tccgcctgcc tctgcctcct gagtgctggg attaaaggtg cgccaccact 2521 gcctggcacc ctaatgtttt ttaaactcag gcctggcaat gaggacaatt tgcaaaacaa 2581 acatggttcc ttgttctata cagctgacat gttagacaga caggcaggca ctgcagatac 2641 tgaccggtga ccactcctgg tgcagggaca gaggcgcttc tgcttttact ttctgtgctg 2701 ctaagtggtt ttggttttta cagtgaatat gtgatatgtt tcataaaagt aatttttttt 2761 tct // LOCUS MUSPBGD3 800 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 5. ACCESSION M28665 M29951 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 3 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 800) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 402 515 porphobilinogen deaminase (housekeeping), exon 10 (EC 4.3.1.8) 739 + 777 porphobilinogen deaminase (housekeeping), exon 11 pep$ + 402 515 porphobilinogen deaminase (erythroid sp.) exon 8 739 + 777 porphobilinogen deaminase (erythroid sp.) exon 9 pre-msg < 1 > 790 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 401 PGB deaminase (hk) intron I; ery.sp. intron G IVS 516 738 PGB deaminase (hk) intron J; ery.sp. intron H IVS 778 > 790 PGB deaminase (hk) intron K; ery.sp. intron I BASE COUNT 181 a 201 c 182 g 236 t ORIGIN Chromosome 9; 500 bp upstream of segment 2. 1 ctgtacccca gctagccttt aactcacaat aaccctcctg cctcagctct ctgaatgctg 61 agattatagc catgagccac caggccaaat tcattttata tttctttctt tcttttttct 121 tttttgttgt tgttgttgtt gttgtttagt tttgttttgt ttttaagact gggaaactct 181 gtatagtcct ggaactcact ttgtagacca gatttagcct tgaattcatg gagatctgta 241 tctgcctcca gtgctgggat ttaaaggtgt atacaccacc actcaacaaa aacacaacaa 301 aaacaaaagt tttttaaaag ttagctagag gggggaaaag agactgtggg gcagagggtg 361 cactgggtag gtcttgactt ctccttagca acgctccaca gcggggaaac ctcaacaccc 421 gccttcggaa gctggatgag ctgcaggaat tcagtgccat tgtcctggct gtggctggcc 481 tacagcgcat gggctggcag aaccgggtgg gccaggtagg agctgccctg ttctgcttcc 541 cattgaatct gcctctctcc tgccttgatt tcttggtgac cattctgcca acaacactac 601 aaccagaagc ccaggctagg gatattggga ctcattgctg gatttcctac ctgtgccttc 661 cccaggcttc ctagattgca aaccctagct cactgccttt gaacatcccc tatcccacca 721 tcttgtctct ctccacagat tttgcaccca gaggaatgca tgtatgctgt gggtcaggta 781 ggtaggtttg cctggagaga // LOCUS MUSPBGD4 1386 bp ds-DNA ROD 09-AUG-1990 DEFINITION Mouse porphobilinogen deaminase (PBG deaminase) gene, exon 4. ACCESSION M28666 M29952 J04981 KEYWORDS hydroxymethylbilanesynthase; porphobilinogen deaminase. SEGMENT 4 of 4 SOURCE Mouse (strain C3H) DNA, clone PBGD. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1386) AUTHORS Beaumont,C., Porcher,C., Picat,C., Nordmann,Y. and Grandchamp,B. TITLE The mouse porphobilinogen deaminase gene: Structural organization, sequence, and transcriptional analysis JOURNAL J. Biol. Chem. 264, 14829-14834 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Grandchamp, 06-OCT-1989. FEATURES from to/span description pept + 154 273 porphobilinogen deaminase (housekeeping), exon 12 (EC 4.3.1.8) 459 512 porphobilinogen deaminase (housekeeping), exon 13 618 704 porphobilinogen deaminase (housekeeping), exon 14 783 956 porphobilinogen deaminase (housekeeping), exon 15 pep$ + 154 273 porphobilinogen deaminase (erythroid sp.) exon 10 459 512 porphobilinogen deaminase (erythroid sp.) exon 11 618 704 porphobilinogen deaminase (erythroid sp.) exon 12 783 956 porphobilinogen deaminase (erythroid sp.) exon 13 pre-msg < 1 1279 PGB deaminase (hk and ery.sp.) mRNA and introns IVS < 1 153 PGB deaminase (hk) intron K; ery.sp. intron I IVS 274 458 PGB deaminase (hk) intron L; ery.sp. intron J IVS 513 617 PGB deaminase (hk) intron M; ery.sp. intron K IVS 705 782 PGB deaminase (hk) intron N; ery.sp. intron L BASE COUNT 372 a 305 c 343 g 366 t ORIGIN Chromosome 9; 80 bp upstream of segment 3. 1 atcagtagtt cctgaaacct gttcatacct tgcacctcta tccatcaata atgttaaaga 61 caggtttgtt gttatgcata acccaggaag cagtagaggt gtgtttctca tcttagctct 121 attactagag aagaacagcc tgttgttctt tagggggccc tagccgtgga agtccgagcc 181 aaggaccagg atatcttgga cctagtgagt gtgttgcacg atcctgaaac tctgcttcgc 241 tgcattgctg aaagggcttt tctgaggcac ctggtaagat gggctcctcc catggtgttg 301 tggggaaacc aggaagggca gtagggaggg agatttgtca agtactcagt atgtaatgtt 361 ttgtatgtat ggagaggacc ttgatctggc ctcttgaggt ctgtggtcaa aagtggtgtt 421 aaaggccctt agagctcaaa ggaacaatat cattgcagga aggaggctgc agcgtgcccg 481 tagcagtgca tacagtgata aaggatgggc aagtaagcca gggaaatgga tgaggggagg 541 gactgtcatt tccatgtgca cccaaacatc taagtaactt tctttaaaca tcctggtaca 601 aacattttat ttcctagctg tacctgactg gtggagtatg gagtctagat ggctcagata 661 gcatgcaaga gactatgcag gccaccatcc aggtccctgt tcaggtattg actgggagat 721 gaggaggaat aaatagaact cttgtaatct tcctcttacc aaaattgtaa cctgtcatcc 781 agcaagaaga tggtccagaa gatgacccac aactggttgg aatcactgcc cggaacattc 841 caagaggagc ccagctagct gctgagaacc tgggcatcag cctggccagc ttgctgctca 901 acaaaggagc caagaacatc ctggatgttg cacggcagct taatgatgtg cgctaactgg 961 tctgtagggc acaggaaccc tggctgccac tccagtgcct acttctggct tccaagtgcc 1021 ctgtgctcca tccctagggg tgtgattatc ccaggaaatt gaaccacagg gttgttgaga 1081 cttccacttt ggaagatatg cctcaccttg gggcctccat atctgccttt ccctcagtag 1141 ttgggggctt catctcttta gagaaagtcc atgccaatct ttgaatgtaa ccaataccac 1201 taataaacca gtttagaatg tggttcttct gatagagttg gggaagatat gaataaaccc 1261 aaagcccttt taaacttgaa tgagtctgag acctttctgt tgtaaaacac gctgtgattt 1321 gcctcatgtt ctcaaaaaaa aaaaaaaaaa tcagccttta attcctacag cctgtcttca 1381 gtcgac // LOCUS HUMIBP3 10884 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human insulin-like growth factor-binding protein-3 gene, complete cds. ACCESSION M35878 M35879 M35880 M35881 M35882 M35883 M35884 M35885 M35886 M36121 M36122 J05537 J05538 KEYWORDS insulin-like growth factor-binding protein-3. SOURCE Human leukocyte DNA and, cDNA to mRNA, clone #HL1006d. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 10884) AUTHORS Cubbage,M.L., Suwanichkul,A. and Powell,D.R. TITLE Insulin-like growth factor binding protein-3: Organization of the human chromosomal gene and demonstration of promoter activity JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Powell, 03-JUL-1990. The sequence presented here appears in Figures 2 and 3 of ref. [1]. FEATURES from to/span description pept 2038 2440 insulin-like growth factor-binding protein-3 precursor (IGFBP-3), exon 1 5726 5952 insulin-like growth factor-binding protein-3 precursor, exon 2 6497 6616 insulin-like growth factor-binding protein-3 precursor, exon 3 8212 8337 insulin-like growth factor-binding protein-3 precursor, exon 4 sigp 2038 2118 insulin-like growth factor-binding protein-3 signal peptide, exon 1 matp 2119 2440 insulin-like growth factor-binding protein-3, exon 1 5726 5952 insulin-like growth factor-binding protein-3, exon 2 6497 6616 insulin-like growth factor-binding protein-3, exon 3 8212 8334 insulin-like growth factor-binding protein-3, exon 4 pre-msg 1906 10775 IGFBP-3 mRNA and introns IVS 2441 5725 IGFBP-3 intron A IVS 5953 6496 IGFBP-3 intron B IVS 6617 8211 IGFBP-3 intron C signal 102 1905 bp 3 promoter binding 1808 1821 Sp1 and AP-2 binding domains signal 1876 1881 TATA box site 5728 5736 potential ASN-linked glycosylation site site 5917 5926 potential ASN-linked glycosylation site site 7087 7255 region homologous to IGFBP-3 genomic sequence signal 10751 10756 Poly-A signal BASE COUNT 2796 a 2578 c 2737 g 2773 t ORIGIN 1 ctgcagacct gggacctcaa gaattgcatt tgatgccgaa cccagctcta atttcagagt 61 caaggtctct gcgagtattt aaggaacgga tgtaaacctg ggggattcgt tttgtttcct 121 tcaattttcc aatgaaatca gagatcctgt tcttgggtgt caacgcagat actagaagga 181 ggtgatacaa gagaaaggaa acagcaagcg acgattatgg cacggtttcc tgtaaacaag 241 gttgagtgta gccacagcct gagcactgtg ggagaagagc tcataagaaa atgacggtgc 301 tgggccttcg tcaccccggg gccctccatt gttcttgtct ttggtctctt tttatttgta 361 gaggtccaat tatttattta tttagtacaa gagggaacga aattgatctt tccattctaa 421 aaggagagta tatatgtata aaaggaagct gtatagatat gggggaagag gtggacaggg 481 ggaaaagggg agaggacgag agagagaaag ggagggagag ggacaaggag agacactggg 541 cgagagatcg attaggagag acagaaatga tgaatgaaga ttaacttcac ccaaggcttc 601 gtcgctggag gggaatggag gagctcctga tttgctatta ctactccaaa ctgcaaaggg 661 ctccttcaag tcacctatcc acctcctaag gcaagcgtcc aatttcaaca gcgttcagga 721 aagtctcctc ccgcggaggt ctcaccgctt cccactccac ccccacaaac tctttggaaa 781 agtgccttga aaaatttaat cctcaatcca atcctggacc accagcgtcc tctgttggtc 841 accgaaggag ggggtgcgca gacaaaactg aagaaactcg agtgccagag aaggccgaca 901 ggagttacag cgacctcagc gcgcaattgc gccccgaact ttactgaaaa gtgtttagat 961 tgcagagata agctagaatc ccaacgcatc gagaatacag taatacgaag tcgccttcaa 1021 aaaatgacaa tgaaaattgc ctattaaagg actatttggt taattacgtt tcagcagtgc 1081 ccagtttatt gtctttatta ttcttttgtc gtgggtgtaa actccatttg aaaacataat 1141 cagggagaat acccaagaca agaagaacag ttgtcattta aaatatttga aaagccctgc 1201 cttaaggagc attcgcttgc cggtccactc ttaattgggg acttgcggtg tagcaacacg 1261 tgagagtctt cttgcgttga gaagtaagcc tggaaaggcg aaggccccgg ggcatcttca 1321 gatgcgtatt tgtgggcccc tggggatata aacagcccag cgggtgtaaa ttaaaccccg 1381 cagtgccttg gctccctgag acccaaatgt aagtcagaaa tgtcccaaga cttcgcctgc 1441 caacggaatt aaattttaga aagctccacg aggtacacac gaatgcggag cgctgtatgc 1501 cagtttcccc gacaccggct cgccgcaggg agacctcacc ccgagagcgg aaggggtaag 1561 ggcggcgggg tcaaggagat cgggggtgct gagttggcca ggagtgactg gggtgaccgg 1621 gggtgctgag gtggcctgga gtgccggggt ggccgggcac accttggttc ttgtagacga 1681 caaggtgacg ggctccgggc gtgagcacga ggagcaggtg cccgggcgag tctcgagctg 1741 cacgcccccg agctcggccc cggctgctca gggcgaagca cgggccccgc agccgtgcct 1801 gcgccgaccc gcccccctcc caacccccac tcctgggcgc gcgttccggg gcgtgtcctg 1861 ggccaccccg gcttctatat acgggccggc gcgcccgggc cgcccagatg cgagcactgc 1921 ggctgggcgc tgaggatcag ccgcttcctg cctggattcc acagcttcgc gccgtgtact 1981 gtcgccccat ccctgcgcgc ccagcctgcc aagcagcgtg ccccggttgc aggcgtcatg 2041 cagcgggcgc gacccacgct ctgggccgct gcgctgactc tgctggtgct gctccgcggg 2101 ccgccggtgg cgcgggctgg cgcgagctcg gggggcttgg gtcccgtggt gcgctgcgag 2161 ccgtgcgacg cgcgtgcact ggcccagtgc gcgcctccgc ccgccgtgtg cgcggagctg 2221 gtgcgcgagc cgggctgcgg ctgctgcctg acgtgcgcac tgagcgaggg ccagccgtgc 2281 ggcatctaca ccgagcgctg tggctccggc cttcgctgcc agccgtcgcc cgacgaggcg 2341 cgaccgctgc aggcgctgct ggacggccgc gggctctgcg tcaacgctag tgccgtcagc 2401 cgcctgcgcg cctacctgct gccagcgccg ccagctccag gtgagccgcc cgccaggtgc 2461 gctgcgtgca gcaccgccac tggcgccgaa gggcctgggg gttgctgggt gccgctgcgg 2521 gagactccgc ttttcttctc actggagata atatgtgggg aaactgaagg cgctccggga 2581 aaggtgaagg cggtcgccga gggaccctcc ccagccggcc ctctacttgc tcgattctct 2641 aagtgcagag tacttgtaaa ttgcaaagcg ctttcagtga aaatgggtaa aggtttccgg 2701 agctgagggg agcggtaccg atgtttagct gttggaaaga tcctggacac aggagattct 2761 cctcgccccg cacgggtgca cacggactgc aatcccaggg atgcttgggg atggggggat 2821 ataggcggat ttggaccaag gaaggtgggt aggcacgttg taggaaatag tacctctctt 2881 ttaaaatact gactttgcac agccttttgg tttgcaaagc aatgtctagt cccggtatgt 2941 ccaaaaacaa gtaaagtgga ttcgggtttt gatatcttct gcggttggaa aacctgaagc 3001 tgaaaaagaa gtaacttctt aaggttaccc agcggccaca acagagtgta ggtttgaact 3061 ccgcgtgcca ctttcagtac cataccattc ttacaactcg ggccacccct gcacctgcgc 3121 cgacctcaaa caaacttcca ggtgcgtggt gggtgcgggc aatgtggact aagtcaattt 3181 caatgacacg gcaagggaat tggaatcagt cctaggctgt ctcccttctt aatctgaaat 3241 gggggggggg aatgagatgt tgttaagggg agccccagaa gaggaaaaat gcaaacattt 3301 ggcagagtta ccctcttgct tagccactat cagtatcagg cagacagcga ctctggtaag 3361 ggcatcacat tgttccctta aaaaaaggag cgggggttgt ttaaatggat ttggcagctg 3421 ttctttcaag cattcttagc cagcctcacc tagttatatg agaaataaag ttcctgcctt 3481 gcacagctga aggctgggag aattctcccc atcctaattc ccccaactcc ccaacgatca 3541 cgttggacag atgtcactgg gcaggccccc atctagggct agcaggatga acagtccctt 3601 tataatttat gtagctgtag agttccacgc ccgggtgaag ttattttctg gctcggcaag 3661 gctggctctg ttcacccctg agaaatgctg gattcatgga aaggcaagat gcctgaaaca 3721 tacactggct ctggtcagct gttaaagctg ctggaggcat ttgtctctcg gggcaaagtt 3781 atgtcatttg ccaagtgtcg tacattattg tgcattttgg ggtattcaaa aagtgatctt 3841 agaaatactg atacacatcg tcattcttgg gctttagcaa tcatcatgat taccacctta 3901 gtagcactgt agtataggtt gatgtgagtt ataagattat aaaaagatct aagtgacttc 3961 tagaatctat ttgacaaaaa aaggtaaatt ttcgacagtc aaaagtcaca attatctgtt 4021 gcttaaatag aactgttttg tcttcatgcc ctagtctgca gcccaggcat taagaagaaa 4081 ccaaggaaat ttaagaaatt actcaaggtt cttagaaaag aagtataaat acgtttattt 4141 acatgttctt agagtattta cattcttagt atctctttta tctcagtatt tccttgaaaa 4201 agaaagcaag ctaagattaa aagaaattga aaccaaatcc tcgcaggtag ggacctcctc 4261 tgtgaggctc tgtgctggac cctgggaatg tgtgcttccc aaggtatgaa accccttggg 4321 gaactttaca gcaggacctc agtgagctgt ttggcaggtg aggaaactaa gacccagaga 4381 ggagagggac tttcctaagg ccctggtgag tgacctgcca gtagccactt ccaggggaga 4441 gcagagcatc tgcagccaaa tcattgcagc cccaggtagc tttctagata gactgtggac 4501 cagatgggcc acctgagctc cctgctaggg ttacacatta tagccctgtt tgtgtagtag 4561 agaaatttca tgactctcaa ttgtggactt aagccgatgc ctccagacct tggcatggtc 4621 cacaggccct gggagcatgg gctctgaatg tagcctttga tccccatagc ggtcttacag 4681 cccctccaag ttcattctga agaaggaatg gagtgagaat cctggctgca gatccagtct 4741 tgaatttagt catatactta aaattccaat tcaactgtta acattccagc atccatttta 4801 agcatcagac tttcttcatt tagcactttt tattataaaa gggagatctg ctggaggggg 4861 atttctccta ccccaccccc acccagggaa ggaaaagctc tttggcactt agaagtctga 4921 gccgtgagtg ggactttggc attgtctgca tccatgtgct gctgtgttca cccggggtga 4981 aaaggactca cttaggcagg caccagcaag atgcacaggg tctgtgtaga ccttgagttt 5041 tagagatgta acggggacct agaaaacaag ccaccaacat gcttgcatga ttctgagccc 5101 ctgaggcaaa acgctttgca ggtaataatt cagttttccc atctgagctg gacaccaagc 5161 tcttataagc gtgtttacct ggtagcattg aggacggtac tggtcaacct tggaattccc 5221 ataagggctt gttacaactc agactcgtgc cgccactcca gcgtttccgg agtggagaat 5281 gtgcatttct tccaagtccc cgggctgccg ctgctcccgc gggtgggagg accacacttg 5341 gagttgactg caaaatttct gagccggcgc tgcagcagcc tcccgtggct caggtctgcc 5401 ccctgccggt ggaagatgaa gcatactgcc ttcacctact gaggggcact gaagcgtttg 5461 tctgccttct ttagttgcag ctacttagga agagcacctg tcagattgac tttcaaacag 5521 ataacttctt gaggtagagc aaccaccatg tagtgagtag tatgatggaa taatacttca 5581 tcgaggtatt taaaaaaaaa acctcacttg gattgccaac taatattgtc atttacatgt 5641 gacctggttg caacgttaag atttttacaa gactgtgata gatattgatg actctcatgt 5701 gtttgtctct cttgggcgtt ttaaggaaat gctagtgagt cggaggaaga ccgcagcgcc 5761 ggcagtgtgg agagcccgtc cgtctccagc acgcaccggg tgtctgatcc caagttccac 5821 cccctccatt caaagataat catcatcaag aaagggcatg ctaaagacag ccagcgctac 5881 aaagttgact acgagtctca gagcacagat acccagaact tctcctccga gtccaagcgg 5941 gagacagaat atgtgagagc ttttcctctt gttaaaggag gagggcaaga cctgccaagc 6001 ctgggtactc agagcctctt gagggcaatt cttactcaac aaaccccagc gcctggctga 6061 tgggtgggca acccctagcc cctctgtgcc ctacctctct cctctcctta cataaagaat 6121 attgaccctt ttggagaatc ttatgaggat caagctgaaa taacactctt aaaagcatat 6181 gggatgtcat aaagacctct gcagataatg aaaatattct cataaagata gttttattta 6241 cttcatcctc tatgcttgtt gacctgctat tggttccatg ccagcttctg tgccttactc 6301 tgggaagagc aaaaaggaga cagggagtga tggttagctt attcggggga ctttcgtgct 6361 acatcagaca taaggtatct gaggagcaaa ttacaggtcc cacttttggt agttgtgcag 6421 catcgtaaga tttttaaagc acacattcta gagtaaaaac tgtgactctg ttgctctggt 6481 ccttcctgat ccccagggtc cctgccgtag agaaatggaa gacacactga atcacctgaa 6541 gttcctcaat gtgctgagtc ccaggggtgt acacattccc aactgtgaca agaagggatt 6601 ttataagaaa aagcaggtga gtgaggtcct cagtgtgttt tcttcctctt ctgttgacac 6661 agaggagaaa cccatgtcac cagcgcccag gctcttgtgg ccatagctct aactctgagc 6721 ctgtgcagca ccagtgccca ggacttggtg ccagtctcag gaggtcagac caagggctgc 6781 tttgacttgt tgctctgagt gctgctatat tggccataat cctcaaccct agtgcctttc 6841 caccacccgc ttcccactcc tgtcctttca atggttcacc cacaggcgga caagatgctg 6901 cccagtggca ccctttataa actgcaagtg gacatgttaa cacatttgtt aatgctgcgt 6961 cagggagtga catttcaaac aactattata gtcagtttcc aagaagtgtg acatgaggtc 7021 ataccacaaa aaagcttacc ctgaaatccc acaatcgtcc cctttcctac tgatgccttc 7081 ccgatagtga gcaggttgca atattaagat tttgaaaagg ctgttgctag atgttggtga 7141 ctcgtgtgtc tctgtctccc ttgggctttt caaggaaatg ctagtgagtg gggggatgac 7201 tgcagcatgg ccagcttgga gagcccagcc atccccagca cataccaggt gtctgtcttg 7261 gcgtggaggg gatggaactt gaaatcagac actcggtcca tgctggggat ggccagtctc 7321 tccaaactgg catgtggtct tcctccgagt cactggcatt tccctagaaa gtccaagtga 7381 gaagaaggca tgagagtcat caacatcaaa caacagtctt ttcaaaatct ttatattgca 7441 acatagtccc attcctggaa aaggaatgga gtgagaatcc tggctacaca tcagccccaa 7501 atgtagtcat tgcctaaaat cccaattaac ctgaaaatga tcaaacaaat ttaagatata 7561 gtaatattaa gctgtaataa atatgcttct ataggctttg tgttatgtga tggcactatt 7621 tcaattggct ttctaattgg acaattgata ctatgctatc tacagaattg gcctttggag 7681 acctaagtga gccacagtgg cctcagggtg accatatact aggattcata gcagtggcca 7741 cagtcagaag cctaagcttt cctccattgc cattgctcgt ttataccacg tttctgtcaa 7801 agtcatattc attcaacaaa gtcatactga gaaggtgtca tgtgaggctg gatgtgggct 7861 ccaaagtcat agctgtgaca ttcgcaggca gcgggatgtt ctcagttcca catttggcag 7921 agaagtcagt caagaggttc tacaagggct ggtgtccacc ttatactcct agaaacacaa 7981 aactgccccc acccccgctt tcttggagca ggaagttaca cccacacgca tgcacaggcg 8041 cacactcagc gggcctaggc agcgtggctc ttgtgttgcc ttagctgaaa tttctgttgt 8101 gctttctcag catagcagag tcacgctggc aaaccatcat gcgccctggc caccgacctg 8161 acaccagacc caggagcatt cacttctctg tcttctgttt ctctcccaca gtgtcgccct 8221 tccaaaggca ggaagcgggg cttctgctgg tgtgtggata agtatgggca gcctctccca 8281 ggctacacca ccaaggggaa ggaggacgtg cactgctaca gcatgcagag caagtagacg 8341 cctgccgcaa gggtgagtac tcaggagggg cagcctgggc tccagggcct cactgtcctt 8401 ggaccagcct caggggctgg gcgtggccac tggccttccc caggcttaca gacccaggag 8461 ctgcagctca gggccagaaa gagcaaagca aataggacag agccctcaga agggtgcagg 8521 gagagggaga ccccatcaac ccaaccaaac aagtgtgggg aaggaggccg gccagtgcac 8581 ctcagggaca ctctgcttta tctcagatac ctcacagcac ctaagctatc attcatccac 8641 acacaaagtg aagattttca aagttaggct ttacccgtga gtctggaggt catttatctt 8701 cacagagaac gtttatcgca gactgctaag atacatgttc taattaagat gtgatgtgag 8761 aacgctgaat gctcgttgga gactcagttg aagtgcagct ttttttctgt caaatatata 8821 atgaatattc tgttagtctg tggctaatat aattttaata aagttaattt aaatctgata 8881 gaaaaatgaa attttaaacg ataattttag agaatgctat tatatccagt cttctttttt 8941 cttttaataa atgagggaac tattggggga aaggaataaa tacattttct ttcattttat 9001 taagacaaat ttagtaagca gaagaaattt gcatgtttag ttataagggt ttcttttttc 9061 cttacaagtt ggaaaaaata attctaattt aagggtaact ctttgacaat gaacactgtg 9121 agcagcatct ggtactcgtt gctttgtttg aaaacatgag ttgagacccc agccgcactt 9181 gcagcctagt gccattagcc tgcaggctgt gctggatatc tcagggcaag agtcgagccc 9241 ttttgatttt ggggggatta tttcaatata tttgcttttt ctttttgttt tagttaatgt 9301 ggagctcaaa tatgccttat tttgcacaaa agactgccaa ggacatgacc agcagctggc 9361 tacagcctcg atttatattt ctgtttgtgg tgaactgatt ttttttaaac caaagtttag 9421 aaagaggttt ttgaaatgcc tatggtttct ttgaatggta aacttgagca tcttttcact 9481 ttccagtagt cagcaaagag cagtttgaat tttcttgtcg cttcctatca aaatattcag 9541 agactcgagc acagcaccca gacttcatgc gcccgtggaa tgctcaccac atgttggtcg 9601 aagcggccga ccactgactt tgtgacttag gcggctgtgt tgcctatgta gagaacacgc 9661 ttcaccccca ctccccgtac agtgcgcaca ggctttatcg agaataggaa aacctttaaa 9721 ccccggtcat ccggacatcc caacgcatgc tcctggagct cacagccttc tgtggtgtca 9781 tttctgaaac aagggcgtgg atccctcaac caagaagaat gtttatgtct tcaagtgacc 9841 tgtactgctt ggggactatt ggagaaaata aggtggagtc ctacttgttt aaaaaatatg 9901 tatctaagaa tgttctaggg cactctggga acctataaag gcaggtattt cgggccctcc 9961 tcttcaggaa tcttcctgaa gacatggccc agtcgaaggc ccaggatggc ttttgctgcg 10021 gccccgtggg gtaggaggga cagagagacg ggagagtcag cctccacatt cagaggcatc 10081 acaagtaatg gcacaattct tcggatgact gcagaaaata gtgttttgta gttcaacaac 10141 tcaagacgaa gcttatttct gaggataagc tctttaaagg caaagcttta ttttcatctc 10201 tcatcttttg tcctccttag cacaatgtaa aaaagaatag taatatcaga acaggaagga 10261 ggaatggctt gctggggagc ccatccagga cactgggagc acatagagat tcacccatgt 10321 ttgttgaact tagagtcatt ctcatgcttt tctttataat tcacacatat atgcagagaa 10381 gatatgttct tgttaacatt gtatacaaca tagccccaaa tatagtaaga tctatactag 10441 ataatcctag atgaaatgtt agagatgcta tatgatacaa ctgtggccat gactgaggaa 10501 aggagctcac gcccagagac tgggctgctc tcccggaggc caaacccaag aaggtctggc 10561 aaagtcaggc tcagggagac tctgccctgc tgcagacctc ggtgtggaca cacgctgcat 10621 agagctctcc ttgaaaacag aggggtctca agacattctg cctacctatt agcttttctt 10681 tattttttta actttttggg gggaaaagta tttttgagaa gtttgtcttg caatgtattt 10741 ataaatagta aataaagttt ttaccattaa aaaaatatct ttccctttgt tattgaccat 10801 ctctgggctt tgtatcacta attattttat tttattatat aataattatt ttattaaaat 10861 gttccctgct ttccctttta gcaa // LOCUS PINCABII2 583 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pinus sylvestris cab II/2 mRNA for chlorophyll a/b-binding protein. ACCESSION M37489 X14507 KEYWORDS Cab gene; chlorophyll a/b-binding protein; thylakoid protein. SOURCE P.sylvestris cotyledones cDNA to mRNA, clone pINE ab 11. ORGANISM Pinus sylvestris Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae; Pinaceae. REFERENCE 1 (bases 1 to 583) AUTHORS Jansson,S. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 583) AUTHORS Jansson,S. and Gustafsson,P. TITLE Type I and type II genes for the chlorophyll a/b-binding protein in the gymnosperm Pinus sylvestris (Scots pine): cDNA cloning and sequence analysis JOURNAL Plant Mol. Biol. 14, 287-296 (1990) STANDARD simple automatic COMMENT [1] Author address Jansson,S. Plant Physiology Umea University S-901 87 Umea Sweden FEATURES from to/span description pept < 1 455 chlorophyll a/b-binding protein BASE COUNT 135 a 133 c 171 g 144 t ORIGIN 1 cggagctgtt ggttaaaaac ggggtgaaat ttggggaagc tgtgtggttc aaggccgggg 61 cgcagatatt ctcagaggga ggccttgact acctggggaa ccccaacctg atccacgcgc 121 agagcattct agccatctgg gcctgccagg ttgttctcat gggattgatt gaaggataca 181 gagtgggagg aggacccctt ggagaagggt tggaccctct gtacccaggg gatgccttcg 241 acccactggg gctggccgac gaccccgagg ccaaggcgga gctgaaggtg aaggagatta 301 agaacggtcg gctggccatg ttctccatgt tcggtttctt cgttcaggca atcgtgaccg 361 ggaagggccc cattgaaaat ctctacgacc acttggcgga ccccgttgcc aacaatgcct 421 gggcctacgc caccaatttc gttcctggca agtgaaggtg acggaaaata aaagaggcct 481 gtgatctgtg catcaatcat ttgacagcct tagtgttaat aaaatatgtt ctttcagctg 541 tatgtatttg ttggtgatct tcgttaataa aatattttct ttc // LOCUS RATMHCIAB 1563 bp ss-mRNA ROD 09-AUG-1990 DEFINITION Rat MHC class I cell surface antigen mRNA. ACCESSION M25319 KEYWORDS antigen; cell surface antigen; class I gene; glycoprotein; histocompatibility antigen; major histocompatibility complex. SOURCE Rat cDNA to mRNA, clone pARI.5. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1563) AUTHORS Radojcic,A., Stranick,K.S., Locker,J., Kunz,H.W. and Gill,T.J.III. TITLE Nucleotide sequence of a rat class I cDNA clone JOURNAL Immunogenetics 29, 134-137 (1989) STANDARD full staff_entry COMMENT Draft entry and sequence in computer readable form for [1] kindly provided by J.J.Rushton 24-JAN-1990. FEATURES from to/span description pept < 1 1134 MHC class I antigen (AA at 1) sigp < 1 60 MHC class I antigen signal peptide site 61 330 alpha-1 domain (exon 2) site 381 606 alpha-2 domain (exon 3) site 607 882 alpha-3 domain (exon 4) site 883 1131 transmembrane and cytoplasmic domains (exons 5, 6, 7, and 8) signal 1535 1541 poly-A signal BASE COUNT 324 a 412 c 471 g 356 t ORIGIN 1 gcaccgcgca cgctgctcct gctgttggcg gccgccctgg ccccgaccca gattcacgcg 61 ggctcacact cgctgcggta tttcgacatc accgtgtccc ggcccggcct cggggagccc 121 cggttcatct ctgtcggcta cgtggacgac acggagttcg tgcgctacga cagcgacgca 181 gagaatccga gattcaagcc gcgggtccgg tggatggagc gggaggggcc ggagtattgg 241 gagcggatca cacggatcgc caaggaaagc gagcagattt accgagtggg cctgaggacc 301 ctgcgcggtt actacaacca gagcgagggc ggctctcaca ccatccagag attgtctggc 361 tgtgaggtgg ggtcggacgg gatcctcctc cgcgggtatg agcagttcgc ctacgacggc 421 cgcgattaca tcgccctgaa cgaagacctg aaaacgtggg cggcggcgga ctttgcagca 481 gggatcaccc ggaacaagtt ggagcgggat ggtgaggcag agagactcag ggcctacctt 541 gaaggcggga gcgtggagtg gctccgcaga tacttggagc tcaggaagga gacgctgctg 601 cgctcagaac ccccaaaggc acatgtgacc cttcactcca gacctgaagg tgatgtgacc 661 ctgaggtgct gggccttggg cttctaccct gctgacatat tcctgacctg gctgttgaat 721 ggggaggacc tgacccagga catggaactt gtggagacca ggcctgcagg ggatggaacc 781 ttccagaagt gggcatctgt ggtggtgcct cttgggaagg agcagaatta cacatgccat 841 gtggagcatg aggggctgcc tgagccgctc accctgagat gggagggtcc tccctccgcc 901 aactccaaca cgggaatgtc tgttattctt ggaactgtgg ccatcattgc agttatggcc 961 atcattgcag ctgtggcctt cattggacct gttgtgagga agaggtggat aaaaacagct 1021 tttcttctca caagtggaaa aggaggagac tacacccctg ctccaggcag ggacagctcc 1081 cagagctctg atgtgtctct cccagattgt aaagccatga agacagctgc ttgaggtgaa 1141 ctggatgccg gccgatgtgt tcaggtctct cttgtgacat ccggagccct cggttctctt 1201 tggacaccga tgcctgggat tccctatgat cctatgactt cggtataggg gactatggga 1261 cccggcccaa ccctacacac cgggacccta tccctgcact gtttgtgttt cctttcacag 1321 ccaaccttgc tggttcagcc tgggttgggg cctggacatc tgcatcctat cactcagtgg 1381 tgctttgaac tgcaactcct cacttctaca ctgagaataa gaatctgagt gtgaacttga 1441 ctgttcacat ccttgacaca gtgttgactg ctttttaaat tactggattg agaatactta 1501 gaggttgttt tttgtttttg ttttgttttg ttttaaataa atggcaggtg gagaagcttc 1561 cag // LOCUS HUMINT01 42 bp ss-mRNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion glycoprotein p150,95 mRNA, exon 1. ACCESSION M29165 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 1 of 7 SOURCE Human cell line HL-60, cDNA to mRNA, clone lambda-X47. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 42) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE cDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD simple automatic FEATURES from to/span description mRNA < 1 > 42 P150,95 mRNA, exon 1 BASE COUNT 9 a 15 c 10 g 8 t ORIGIN 1 bp upstream of EcoRI site; chromosome 16p11-13.1. 1 gaattcctgc cactcttcct gcaacggccc aggagctcag ag // LOCUS HUMINT02 3690 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 2 - 6. ACCESSION M29482 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 2 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3690, exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3690; exons and intron/exon boundaries only) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 3690; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12752 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 3690; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept 1028 1064 integrin alpha subunit precursor, exon 2 (first expressed exon) 1666 1771 integrin alpha subunit precursor, exon 3 2391 2494 integrin alpha subunit precursor, exon 4 2795 2865 integrin alpha subunit precursor, exon 5 3020 + 3131 integrin alpha subunit precursor, exon 6 sigp 1028 1064 integrin alpha subunit signal peptide 1666 1685 integrin alpha subunit signal peptide matp 1686 1771 integrin alpha subunit 2391 2494 integrin alpha subunit 2795 2865 integrin alpha subunit 3020 + 3131 integrin alpha subunit pre-msg < 1 > 3690 P150,95 mRNA and introns IVS < 1 975 P150,95 intron A IVS 1065 1665 P150,95 intron B IVS 1772 2390 P150,95 intron C IVS 2495 2794 P150,95 intron D IVS 2866 3019 P150,95 intron E IVS 3132 > 3690 P150,95 intron F BASE COUNT 764 a 1050 c 1020 g 856 t ORIGIN 1 bp upstream of BamHI site; chromosome 16p11-13.1. 1 ggatcccttg ggcccaggag ttcgaagcag cagtgaacta tgcacccact gcactccagc 61 ctgggtggca gagcaagacc ctgtttctga aattaaaaaa aaaaattgat gtacattagg 121 gggcttccac ggcctgagct gcttcccctt gctttcctcc cagtggccct gaccttgtct 181 cttacaactt cccaccctga ctgtctggtt acccattgct gatttcacac acagaccctc 241 ctgtaccctg cctcatccat gtctggctgc tctgtcatct cccaactttg gttgctttca 301 atgctcagct caagcaccac ctctttcagg aagccttctc agaaagccac accttcacaa 361 cccgggtgag gcaccctgtg gtctctgtgc ttccccctca cagcaatgaa cttgctgttt 421 atacatctgc ctctccactg accccagggc tggtgctttg tggtttatat tttcttcccc 481 acctagcaga gggcttgcat ctccaggctc aaattaggct tcttgaataa atgatgaata 541 aatgagtgaa tgaatgaatg aacaaatact cgctctgtgc tcctcctagg gacccggatc 601 ccccactcct tggcccagac tttccaggtc agagtggagg cctcccacca gggtttcctt 661 taggggtcct gaggggtggg catctgccca aaccccctcc agtctggctg aaatttcaag 721 gtcaaggggt ccttctggca gtcaagggtg agcctgggag gggcagggca gggatttgca 781 tccatctaag caaagggcat caagccaagt catctgatga gagtgactcc ggttgggggg 841 tgggggcgtg tgggagccga gcctgtcctc ggatcagttg cgtactctgc ccgccccctc 901 tgactcatgc tgacaatctt cttccttccc ctggccacct ctctgcccac ttgcttcctc 961 agtaccttgg tccagctctt cctgcaacgg cccaggagct cagagctcca catctgacct 1021 tctagtcatg accaggacca gggcagcact cctcctgttc acaggtgagc ctggacccca 1081 atgaagtagg gctggggacc caggcccaag ggagccaggg ccctgaactg ggggctcagg 1141 ctggggggtt aggatctggg taggaagaga gactcagtca agcctgaggg ggaggcaggc 1201 acatagggtt tgagatttgg agtttgtgga gggagaggat attgatgaac caattttggg 1261 agagttccag agatgctgga agagaggcca gttgtctctg tactgcagag atttttaaaa 1321 taggcagaat gcgccaactt gtgctctgtg gacaggatgc tttggtccgc aagttttcct 1381 ggacgcactc tcatagcgcc cgaggtgcac gttggggaaa gatccttttt agagcctggg 1441 tactgctctg cagaaatgga gaactgcaac tcgatagtgg atggtgggca aggggcatcc 1501 ctggaccctg ggaaggagag aaggggatga gttgggtgtc cagaagaccc aggcaccccg 1561 ggcatcaggc tcggagggga gattgggacg ctggggccgg gggtggaggg cagccaggca 1621 gaaggaagac ccttctccaa agctctcttc ccacctcttt cccagcctta gcaacttctc 1681 taggtttcaa cttggacaca gaggagctga cagccttccg tgtggacagc gctgggtttg 1741 gagacagcgt ggtccagtat gccaactcct ggtgaggccc aggtggtgct cctttggctc 1801 catccatcct ctccctgctc aggccccatc cccccggccc tgccctgtta tttgcaaact 1861 ctcctctctg tctggtgtag cgactgccct ggctaatgaa gatttgcctt gaaggcaggc 1921 acggtctcac agctaacatt tacagagcag taagtgcagt gccaggctca tcacaggtgg 1981 atgctgattt agtccacacg acagcctgtg agtaggaatc agtcgtgcaa caaacactta 2041 tttgtttttt ctttcttttt ttctatacat ttaaaaatat atagagacag ggtctcacta 2101 tgttgcctgg gttggtctca aactcctggg ctcaagcaat cctcccgcct cagcctccca 2161 aagtgctggg attccaggtg tgagccacca cacccagact caacaaatat ttcttgtctc 2221 catacgccag agaatccaac agacagaaat cccttccaca tggactttaa attattaaaa 2281 tccatcttgc agatgaggaa gctgaggctc agggagggaa cgcaaacttg ccggagtggc 2341 agctgtcggc gtccacactc ttacctaaag tgttctttgt ctcctcgcag ggtggtggtt 2401 ggagcccccc aaaagataac agctgccaac caaacgggtg gcctctacca gtgtggctac 2461 agcactggtg cctgtgagcc catcggcctg cagggtgagt caccgcccct cccgggaccc 2521 agggccgggc tcccaggctt ccctgctcca ggggcccgtg gactcccgga gtgtcacttt 2581 cagcttccct gtgtctgaga ccctcaccct cagatatgct tcctggcccc ttaaggcctc 2641 cccgcccatc gcactcccgc agctctgtca agacccgaca gcttccttca ccgtcagacc 2701 tccttgtctc ccaggtggag gtgacccctg cccagctctt ccacagcctt ctctgtaggg 2761 cccgagagtg accatgcaca tatctgtccc acagtgcccc cggaggccgt gaacatgtcc 2821 ctgggcctgt ccctggcgtc taccaccagc ccttcccagc tgctggtgag tggccctggg 2881 tcacaggagg cttctgaggg agggagggag gagccggggc cgccgggggc tgggactctc 2941 ctgtagggtg gaggttccgg catctgaggg tgggaggtac atgccaggga gtgcccccag 3001 cagcccgctg tgtccccagg cctgcggccc caccgtgcac cacgagtgcg ggaggaacat 3061 gtacctcacc ggactctgct tcctcctggg ccccacccag ctcacccaga ggctcccggt 3121 gtccaggcag ggtgagtgtc gggaccacca aggctttgag gagctcacgc acatccaatt 3181 gggggtgcgg tgggctagag acagtcttgc cagagtggat cagaaagaag ggatctggaa 3241 aaagagttac ctcgtgttgc agtggttcct gacgctgctg cccgcacatc ctgccgatcg 3301 ccgcacgctg ccggaccttt cctgtgacct taacctctcc aagcctcagt ttcttcatct 3361 gttggatggg gataataaca cacccagcac tgaaagcaac acaggatgat tcatggccag 3421 gggttagcac agcagctagc accaggcgac acccatgccg gccagctgtt gttattttta 3481 gaggagagga ctattttcat ccaatgggtc ctgggatatg accaattggt ttgtgccgta 3541 gtttaggaaa ggtcagtgaa agtgcagtgt gagcaacgtg tgtgtgtaca tgtgtgtata 3601 tgtatgcatg tgtatacatg tgcacatgca catgtacatg catgtgtgtg catgtatgtg 3661 tgtgtgtgca tgtgcatgca ggttgagacg // LOCUS HUMINT03 4863 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 7 - 15. ACCESSION M29483 Y00093 KEYWORDS integrin; protein p150,95. SEGMENT 3 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 294 to 3967; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4863) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 4863; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 4863; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 294 424 integrin alpha subunit precursor, exon 7 511 656 integrin alpha subunit precursor, exon 8 901 1054 integrin alpha subunit precursor, exon 9 1659 1809 integrin alpha subunit precursor, exon 10 2428 2501 integrin alpha subunit precursor, exon 11 2665 2794 integrin alpha subunit precursor, exon 12 3207 3349 integrin alpha subunit precursor, exon 13 3531 3671 integrin alpha subunit precursor, exon 14 3760 + 3969 integrin alpha subunit precursor, exon 15 matp + 294 424 integrin alpha subunit 511 656 integrin alpha subunit 901 1054 integrin alpha subunit 1659 1809 integrin alpha subunit 2428 2501 integrin alpha subunit 2665 2794 integrin alpha subunit 3207 3349 integrin alpha subunit 3531 3671 integrin alpha subunit 3760 + 3969 integrin alpha subunit pre-msg < 1 > 4861 P150,95 mRNA and introns IVS < 1 293 P150,95 intron F IVS 425 510 P150,95 intron G IVS 657 900 P150,95 intron H IVS 1055 1658 P150,95 intron I IVS 1810 2427 P150,95 intron J IVS 2502 2664 P150,95 intron K IVS 2795 3206 P150,95 intron L IVS 3350 3530 P150,95 intron M IVS 3672 3759 P150,95 intron N IVS 3970 > 4863 P150,95 intron O BASE COUNT 947 a 1358 c 1460 g 1097 t 1 others ORIGIN Chromosome 16p11-13.1. 1 acctgtgatc gccccctcgc ctcccaaagt actgggatta cacggtgagc caccacgcct 61 ggctcaatca cagcctcttt aggcaacttt aagagaatga agggccttgt tccaggcaag 121 gggttaggga acgtctgccc ctgatgagga gaggacccag ggtgtggagc ctgactccca 181 tcgccagact aggggcttag ggaggaaggg ttttggagag tgagctcttg caggagccac 241 ggtcctggac tccaggagtg tcacttggag gacggtgcca cctccttccc cagagtgccc 301 aagacaggag caggacattg tgttcctgat cgatggctca ggcagcatct cctcccgcaa 361 ctttgccacg atgatgaact tcgtgagagc tgtgataagc cagttccaga gacccagcac 421 ccaggtgtgc tttgggggag ggaggctgct gggggtgggt gcttggatcc tggtgatagg 481 cctcagccca gccctgtgtg cttctcccag ttttccctga tgcagttctc caacaaattc 541 caaacacact tgactttcga ggaattcagg cgcacgtcaa accccctcag cctgttggct 601 tctgttcacc agctgcaagg gtttacatac acggccaccg ccatccaaaa tgtcgtgtga 661 gtcctgattt cttccaggca cagtcccaaa gcacccaggt cttcccttgg cctcatctga 721 tctccacgag aaggggacag gcagggacca aaatccagcc cgtgataccc ttgccaagct 781 ggggcctctg ggtgggactg gggcctccca aaggaaaagg catcttctaa ttttcacaag 841 ggcaccaggg gctagtgtgg tttggttcac aggcctctaa gacctctcct ttcctgatag 901 gcaccgattg ttccatgcct catatggggc ccgtagggat gccaccaaaa ttctcattgt 961 catcactgat gggaagaaag aaggcgacac gctggattat aaggatgtca tccccatggc 1021 tgatgcagca ggcatcatcc gctatgcaat tggggtaggc ctgggatggc ttcccacttc 1081 tcccacggct tcctctcagg gcaactcccc tttctgtgta tgatgttctt ttctctttga 1141 gacagggtct tgctctatca cccaggaagt ggtgcaatcc tagctcactg cagccttgaa 1201 ctcctgggct ccagtgatcc tcccaccccg cctcccagta gtcgggacca caggtgtgtg 1261 ccatcaagcc tggctatttt ctttttggtt gagatggggt cttgctatgt tgcccaggct 1321 ggtctcaaat tcctggcctt aagcaattct gccaccttgg tctcccaaag gcacagggga 1381 ttacaggcgt gaaccaccgc caacaacatc cctttcaagg atagaaacac cagctctctc 1441 ggctcttact gccttaagga tgaaaactct gccccagact ggagaccatg atgatccttt 1501 ctcctaaact ccctgatgct gtccgggctt cgtgtttctc ctgtgtccac cgggtgtgat 1561 catgttgatc ttgtggggtt attggaagat gttgcaccca gtgcacacag gcacatttga 1621 tttattattt ttactgagtt gatcttttct ggggacaggt tggattagct tttcaaaaca 1681 gaaattcttg gaaagaatta aatgacattg catcgaagcc ctcccaggaa cacatattta 1741 aagtggagga ctttgatgct ctgaaagata ttcaaaccca actgagggag aagatctttc 1801 ccattgaggg tgagtctgaa gggagctctt cgcttgggga atcctcagcc gttaacacct 1861 ttccacttag aacccgaggc tccgtgaaac aggtagacag cgtctcggtt ctcctgcttt 1921 cccgggaccc cgatagccat gtctgtcagc ttgtccccac tgacgtcccc cagcactgtc 1981 agagctgccc caaagtggcc ccagggatgg ccctgctccc cacagagagt gatctcacac 2041 caccaccggc tccactgcag aacaaaagca gtccaggccc aacccaggag acccttccac 2101 ccacaccggg ccctacccag cccacatccc accagccact cactcccctg ggcaaggggc 2161 acacggacac ctggccccct cggtctgctt gtagacctgt ggggggccct gatgaggacc 2221 agatcggtgc tgccatcgct gtccacatcc atggagcaga ggggggcccc gaagtcggag 2281 ctgatctgga ggcagagcct ggtccctgtc acaggcacca gctctccctg tagcctccag 2341 tcttagcttc tcctaaagct gaagtgttct tggacctggc aaagcccgtc tccctccctg 2401 gcactcaagc gtcatgcctt accccaggta cggagaccac aagcagtagc tccttcgaat 2461 tggagatggc acaggagggc ttcagcgctg tgttcacacc tgtgcgtggg gccccttagg 2521 ccgatgatgt gccgtgaggg gagggggggc agggaaggcc agggtgggtg tcaggtgggt 2581 aagaggcgca aggcggaagg catatctctg gtcatgctgt cttcctgctc tcggctctgc 2641 tcagccctgg aatcctttct ccaggatggc cccgttctgg gggctgtggg gagcttcacc 2701 tggtctggag gtgccttcct gtacccccca aatatgagcc ctaccttcat caacatgtct 2761 caggagaatg tggacatgag ggactcttac ctgggtgaga aacagccagg ggttggggac 2821 aggtgggaga tgcactgccc agggtggggt ccagggttct ggggaagggg taggggnatg 2881 ggggctgtgc tgcccagtgt ggggcccagc ttctggggag ggaggatggg cactgtgctg 2941 cccggggtgg gttccagggt tctggggagg gggaatgggg gctgtgctgc ctggggtggg 3001 aatccagggt tctggggaga ggggatgggc gctgtgctgc ctggggtggg ttccagggtt 3061 ctggggagag aggatggggg ctgcattgcc cagggtgggg tccagggttc tggggagggg 3121 agatggtgct gtgctgcccg gggtgggaat ccagggttct ggggaggggg aatgggggcc 3181 tttgtgctga ggcctgggcc cctcaggtta ctccaccgag ctggccctct ggaaaggggt 3241 gcagagcctg gtcctggggg ccccccgcta ccagcacacc gggaaggctg tcatcttcac 3301 ccaggtgtcc aggcaatgga ggatgaaggc cgaagtcacg gggactcagg ttgggcgtga 3361 caggagccac aggccgggaa ttcagggtag gggaggtggc tgggcagaga agaggatgga 3421 ggggctttga gggccttggg ggaggtcctg gtacctgggg agaggtggga cctggcccac 3481 agggctgcct ctggcaggga caggcagcat gacccagctc tgcccttcag atcggctcct 3541 acttcgggcc ctccctctgc tccgtggacg tagacagcga cggcagcacc gacctggtcc 3601 tcatcgggcc cccccattac tacgagcaga cccgaggggc ccaggtgtct gtgtgtccct 3661 tgcccagggg ggtgagtggc tgatgggcct ggtgtgtgtg gggtctggtg tgggtgaggg 3721 gttgcccggg ttgggcctgg cactgttttt tttctgcagt ggagaaggtg gtggtgtgat 3781 gctgttctct acggggagca gggccacccc tggggtcgct ttggggcggc tctgacagtg 3841 ctgggggatg tgaatgggga caagctgaca gacgtggtca tcggggcccc aggagaggag 3901 gagaaccggg gtgctgtcta cctgtttcac ggagtcttgg gacccagcat cagcccctcc 3961 cacagccagg tgaggccgtg tcccatttct gtcactagag cagcctgctt cttgcctctc 4021 ccactctgtc atactggaaa actgtccctt tttacctttt cctacctccc ttgcccagct 4081 ctgagcacct tgtagcagtg gcgtggtctc agctcactgc aacctccgcc tcccaggttc 4141 aagcgattct ctctgcctca gcctccagag tagctgggat tacaggcatg caccaccatg 4201 tccggttatt ttttgtattt tagtagagac acgtttcgcc atgttggcta ggctggtctt 4261 gaactcctga cctcaggtga tctgcctgtc tcggcctccc aaagtgctgg gattataggc 4321 gtgagccgcc atgcccaggc ccctgccagt tttacaaggt acacaggtca ggcacagaaa 4381 acccatttta cagatggaat ctgggacact aggaagacaa gggccttggt ttgttggagg 4441 ttcagagtgg gtccgagatg gtgaaggaac tccggcctcc tgacctctaa cccggtgtgc 4501 agtctcccgg ctccctgctg ctcaccactt aggtccagtc atttcaacct ccctccacct 4561 gcccctctcc tccctggatg ctacatgatt ttattccctt cctgccatca aggtcccacc 4621 aaatgcccat ccctgcagcc tccctccacc ccaagggtag cagggttccc tgagaacgaa 4681 gggctgcctt tcttggcaaa agtcaagaaa gctctgttaa aaaataggca aagggcctgc 4741 tccctggtgg ctcacatctg taattccgac actttgggag gctgaggcag gaggatcact 4801 tgaggccagg agtttcaggc cagctgggca acataggggg accccatctc tagaaaaaat 4861 ttt // LOCUS HUMINT04 2746 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 16 - 21. ACCESSION M29484 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 4 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 382 to 2672; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2746) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 2746; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 2746; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 382 512 integrin alpha subunit precursor, exon 16 628 791 integrin alpha subunit precursor, exon 17 924 1078 integrin alpha subunit precursor, exon 18 1656 1787 integrin alpha subunit precursor, exon 19 1875 1948 integrin alpha subunit precursor, exon 20 2531 + 2672 integrin alpha subunit matp + 382 512 integrin alpha subunit 628 791 integrin alpha subunit 924 1078 integrin alpha subunit 1656 1787 integrin alpha subunit 1875 1948 integrin alpha subunit 2531 + 2672 integrin alpha subunit pre-msg < 1 > 2746 P150,95 mRNA and introns IVS < 1 381 P150,95 intron O IVS 513 627 P150,95 intron P (no splice consensus) IVS 792 923 P150,95 intron Q IVS 1079 1655 P150,95 intron R IVS 1788 1874 P150,95 intron S IVS 1949 2530 P150,95 intron T IVS 2673 > 2746 P150,95 intron U BASE COUNT 602 a 821 c 746 g 577 t ORIGIN Chromosome 16p11-13.1. 1 gaattcctat cctgagcatg gctaaactct gagctaatag tatcattata gaaagatgag 61 gaaacggagg cacagacaga ttgagtcctt gcccacggcc tcgtggctca tacgtggagg 121 agtcagaatt ggaactagag actgatcgaa tgaatgacac tcgggtcacc aggacacctt 181 cctatctcca ctcttacatc tgtttcttag caatcatctc ccaactccta cctcctcttt 241 tcaggttctt cttggtgaca tctgttacaa ctcacccctt ctctcccttt ccgatggtcc 301 tacctccata ttccccttgt tacttatttc caacttcttc cctagtttcc atcttgattc 361 acccttctct cctctggcca gcggatcgcg ggctcccagc tctcctccag gctgcagtat 421 tttgggcagg cactgagcgg gggtcaagac ctcacccagg atggactggt ggacctggct 481 gtgggggccc ggggccaggt gctcctgctc aggtgagagc agactttctc agaggctccc 541 catgtggtcc taggttcaga tgggggtgcc cacccacgtg gtgctcccac cagcgacggc 601 tgtcctcagc tcggtgctct gcccgcagac cagacctgtg ctctgggtgg gggtgagcat 661 gcagttcata cctgccgaga tccccaggtc tgcgtttgag tgtcgggagc aggtggtctc 721 tgagcagacc ctggtacagt ccaacatctg cctttacatt gacaaacgtt ctaagaacct 781 gcttgggagc cgtgagtccc ctcccctcca acccaggaca ccctgacctc tggagtcccc 841 catcccaggc ccctgtctcc caccctgctc attgtccacc caaggagttc ctgtctcaac 901 gccgtccctg cgaccgccta caggtgacct ccaaagctct gtgaccttgg acctggccct 961 cgaccctggc cgcctgagtc cccgtgccac cttccaggaa acaaagaacc ggagtctgag 1021 ccgagtccga gtcctcgggc tgaaggcaca ctgtgaaaac ttcaacctgc tgctcccggt 1081 gcgtctgggc atgaacgtgg gtggcggccg cgctggggct ggcagaaggc agggcaggga 1141 gagaacaggc tgtgttccgg cctccctgtg gctcagccca gcacaggacc agccatgcag 1201 gacgtgctta ctgcacgtta gccagtgagt gagtgagcga gcaaacaagt gatgagatcg 1261 tctgcaattt ccagggccac acgattggat ttcaggaaag agaattgggc aacctgagag 1321 agctctgggc ttaccttctg gcttttcagg cattcactga cagggttatc gagctgctcc 1381 tggagacagc cttgcctggg ccatgggcat aggtggccaa aacagtcatt gctgatcggg 1441 aggtctgggg gggggaggaa aaaaacaaag acaaacaagg ggagaggaca gagagggtgt 1501 cagggaggca tcctgaaggc ggtgacgctg agcaggctct ggaggaagtg aagcagagcg 1561 ggagctgggc agaggcagga taagaactgc ggatgaggcc gagcgcagct cttaccctcc 1621 ccttaccctc gctccccgcg acgcccgtcc cccagagctg cgtggaggac tctgtgaccc 1681 ccattacctt gcgtctgaac ttcacgctgg tgggcaagcc cctccttgcc ttcagaaacc 1741 tgcggcctat gctggccgcc gatgctcaga gatacttcac ggcctccgtg agtcctggca 1801 ctgggtctcc cagagagggt gcacagcgtg gggcctgggt ctcggagaaa accccccgtt 1861 gccttcccac gcagctaccc tttgagaaga actgtggagc cgaccatatc tgccaggaca 1921 atctcggcat ctccttcagc ttcccagggt gagcgcccca ccttagatgc cctactgccc 1981 cagcctcctt cctggaatct gggactcctg cctctgctct ccctaacatt gtctcatcct 2041 atagtcaaaa cccaggtgtc ttggctgggc acagtggctc actcctgtaa tccagcactt 2101 tgggaggccg aggtgggagg acttttgagg ccaggagtta gggttacgac ctgggcaaca 2161 gagcgacacc catttccaca aaaacaaaac aacaacaaca acaacaacaa caacaacaac 2221 aacaacatca cttgagtgtg gtagagcatg cctatagtcc cagctacttg ggaggctgaa 2281 gcttaaggct tgcttgagct ctggagttgg aggtctgcag tgagccataa tcacaccact 2341 gcactccagc ctgggtgaaa gagcaggact ctgtctctta aaaaaaaaga agaagaagaa 2401 gaagaagaag aagaacccag gggtccgtcc cctgtctatc tcccaaatcc ccacccaccc 2461 cattttatcc cagaccattt ctagcctcag tcacagaatc atcttatcct ttccttcacc 2521 tgatacccag cttgaagtcc ctgctggtgg ggagtaacct ggagctgaac gcagaagtga 2581 tggtgtggaa tgacggggaa gactcctacg gaaccaccat caccttctcc caccccgcag 2641 gactgtccta ccgctacgtg gcagagggcc aggtgcacct ctggggaagg aggaggaggc 2701 agggctgggc gttagcgtag attcccgtgc gggttcagaa cccggg // LOCUS HUMINT05 1006 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 22 - 24. ACCESSION M29485 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 5 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 202 to 665) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1006) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 1006; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 1006; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 203 319 integrin alpha subunit precursor, exon 22 404 483 integrin alpha subunit precursor, exon 23 582 + 665 integrin alpha subunit precursor, exon 24 matp + 203 319 integrin alpha subunit 404 483 integrin alpha subunit 582 + 665 integrin alpha subunit pre-msg < 1 > 1006 P150,95 mRNA and introns IVS < 1 202 P150,95 intron U IVS 320 403 P150,95 intron V IVS 484 581 P150,95 intron W IVS 666 > 1006 P150,95 intron X BASE COUNT 228 a 286 c 229 g 263 t ORIGIN Chromosome 16p11-13.1. 1 ttctatcctg gtgacagagt gagacctggt ctcaaaacaa acaaacaaac aaaatataag 61 cttaaggtgg gctccaggaa gctttatcac tacttcgtgg cgtgtctttg gaatgctgtt 121 atattaggtt ggtgcaaaag taattgggtt tttgccattg ctttcaattt caactaatac 181 tcctctactt tctcatgcct agaaacaagg gcagctgcgt tccctgcacc tgacatgtga 241 cagcgcccca gttgggagcc agggcacctg gagcaccagc tgcagaatca accacctcat 301 cttccgtggc ggcgcccagg tcagcctggc ttctgtcccc tcactgctcc cctgccccac 361 cctgtcttta ctgctctgtg acctctcagt tccttttcct cagatcacct tcttggctac 421 ctttgacgtc tcccccaagg ctgtcctggg agaccggctg cttctgacag ccaatgtgag 481 caggtgagcc gggccatggc caggggcagt gcctcatctc cagcctcaca ccccattctc 541 ctctggggcc tctggcaact gagtctctcc tctttctcca gtgagaacaa cactcccagg 601 accagcaaga ccaccttcca gctggagctc ccggtgaagt atgctgtcta cactgtggtt 661 agcaggtcac aggtacccac tgcaggaaaa agggttcttc tctctgaccc tcaaaaagaa 721 aaaaaaaaaa aaggccttga aacgctgcca cagagggtga gataaggtgt ttgaaagtaa 781 aaggtcaggt gtttcagaag acaccttcct tcagccaatg ccttcctcga atttgctgtg 841 tgccaggcag ggtgctgtgg ttattttcca tacattcatt tgacattcat tgaagattta 901 ctgagccccc attatgtgtg atcaaaccag acatgaaccc tcgccttgtg ggtgtgcctt 961 gctggatgtc tcctgtgttc cactctcact gcactgcatg ctgagt // LOCUS HUMINT06 1904 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exons 25 - 30. ACCESSION M29486 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 6 of 7 SOURCE Human DNA, (library pWE15), clone 30.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 86 to 1528; exons only) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1904) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 1904; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 1904; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 86 164 integrin alpha subunit precursor, exon 25 280 387 integrin alpha subunit precursor, exon 26 505 588 integrin alpha subunit precursor, exon 27 788 901 integrin alpha subunit precursor, exon 28 1044 1145 integrin alpha subunit precursor, exon 29 1417 + 1527 integrin alpha subunit precursor, exon 30 matp + 86 164 integrin alpha subunit 280 387 integrin alpha subunit 505 588 integrin alpha subunit 788 901 integrin alpha subunit 1044 1145 integrin alpha subunit 1417 + 1527 integrin alpha subunit pre-msg < 1 > 1904 P150,95 mRNA and introns IVS < 1 85 P150,95 intron X IVS 165 279 P150,95 intron Y IVS 388 504 P150,95 intron Z IVS 589 787 P150,95 intron AA IVS 902 1043 P150,95 intron AB IVS 1146 1416 P150,95 intron AC IVS 1528 > 1903 P150,95 intron AD BASE COUNT 381 a 545 c 525 g 453 t ORIGIN Chromosome 16p11-13.1. 1 accacctgtc ctctcatgct ctagccaatg ccttctgcag atgcccatgg tagttcacat 61 ccacttatgc gtcttctctc tccagccacg aacaattcac caaatacctc aacttctcag 121 agtctgagga gaaggaaagc catgtggcca tgcacagata ccaggtcagg tggtggtgta 181 cgcaggaaga ccttgggcat ggggtgggag gctgggtagc cggagactgg ggagggattt 241 ggctttggcg tggctctgcc ctcagtgccc tctgtgcagg tcaataacct gggacagagg 301 gacctgcctg tcagcatcaa cttctgggtg cctgtggagc tgaaccagga ggctgtgtgg 361 atggatgtgg aggtctccct cccccaggta cccaaggact gcatgtggct cctccacgaa 421 tgccctttct acctggattc cttgtgcccc atgtgggtcc ctgatgtccc agctgagaca 481 cttgttctct gcattttccc ccagaaccca tcccttcggt gctcctcaga gaaaatcgcg 541 ggcccagcat ctgacttcct ggcgcacatt cagaagaatc ccgtgctggt gaggagggct 601 ctgggtctgg ccctcactgt aggcccacat cagaggaatt taacccagga gttcatgttc 661 catatccatc ctgctgaagt accctcttgc attcggatat ggccgctgcc ctcaagtcac 721 acgcataatg ctgcctccca ccttcacact catctttctc agccccatgc tatttatctg 781 cccccaggac tgctccattg ctggctgcct gcggttccgc tgtgacgtcc cctccttcag 841 cgtccaggag gagctggatt tcaccctgaa gggcaacctc agctttggct gggtccgcca 901 ggtgtgtggg tgcaacgaca gagcccctgc cccagactca ggcgggacct ggcatgtctg 961 tgcccatctg caagccaggg cacccccaga gctctgagcc tcccccagag ccagttcaac 1021 aggtttcccc cacccctttg cagatattgc agaagaaggt gtcggtcgtg agtgtggctg 1081 aaattacgtt cgacacatcc gtgtactccc agcttccagg acaggaggca tttatgagag 1141 ctcaggtaga gaccatgtgg agggcagcga ccaggcagga aagagggtcc caagggctac 1201 atctgtggtg ctgggtgggg ggtttgcaag ccttggggga ggagggtgaa ggcctctggg 1261 caggatagct gtccctaagg gcacgggtgc tgctgtgtct cacctcttgg agcagggcct 1321 ggggaaggag gggagggagt taaaggttgg ggagcctggg aggagtctgg gatagtagga 1381 ggatgggagt ctctgacagg gtcacttcca cttcagacga caacggtgct ggagaagtac 1441 aaggtccaca accccacccc cctgatcgta ggcagctcca ttgggggtct gttgctgctg 1501 gcactcatca cagcggtact gtacaaagtg agtgttttat gccacccttg acaccaccag 1561 catctggtcc cgctcttttt gcagagtgag aaggagctca ctttgaaggc agaggcacat 1621 tcttactggg tcacttcata tgagaaactg cttcccacct gcaatgtcac cgtgccccag 1681 tggccccctg ctttgtgatt cccaggcttc ctctaatatt tctccctttc tttcctgctc 1741 ttctccatca ttctacgtgt tcctgacagc agattatcat ataaaagcac agacctgggt 1801 tgaatgcgac atcaccacgg gttcttttgt cttgaccata ggccagtgtc tgctccactc 1861 tgggccttga tttccatgtg aggtgatatc acccagctca taga // LOCUS HUMINT07 653 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human leukocyte adhesion protein p150,95 alpha subunit gene, exon 31. ACCESSION M29487 Y00093 KEYWORDS integrin; leukocyte adhesion glycoprotein; protein p150,95. SEGMENT 7 of 7 SOURCE Human DNA, (library pWE15), clone 30.1, and cell line HL-60, cDNA to mRNA, clone lambda-X47. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 44 to 149) AUTHORS Corbi,A.L., Miller,L.J., O'Connor,K., Larson,R.S. and Springer,T.A. TITLE CDNA cloning and complete primary structure of the alpha subunit of a leukocyte adhesion glycoprotein JOURNAL EMBO J. 6, 4023-4028 (1987) STANDARD simple automatic REFERENCE 2 (bases 1 to 653; revises [1]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 2782-2788 (1990) STANDARD full staff_entry REFERENCE 3 (bases 1 to 653; exons and intron/exon boundaries) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. TITLE Genomic structure of an integrin alpha subunit, the leukocyte p150,95 molecule JOURNAL J. Biol. Chem. 265, 12750-12751 (1990) STANDARD full staff_entry REFERENCE 4 (bases 1 to 653; exons and intron/exon boundaries; revises [3]) AUTHORS Corbi,A.L., Garcia-Aguilar,J. and Springer,T.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by J.Garcia-Aguilar, 19-OCT-1989. FEATURES from to/span description pept + 44 148 integrin alpha subunit precursor, exon 31 matp + 44 145 integrin alpha subunit pre-msg < 1 > 149 P150,95 mRNA and introns IVS < 1 85 P150,95 intron AC BASE COUNT 126 a 195 c 150 g 182 t ORIGIN Chromosome 16p11-13.1. 1 actgaatggg cttcctgagt ttcttcttcg tcctcccccc taggttggct tcttcaagcg 61 tcagtacaag gaaatgatgg aggaggcaaa tggacaaatt gccccagaaa acgggacaca 121 gacccccagc ccgcccagtg agaaatgatc cctctttgcc ttggacttct tctcccgcga 181 ttttccccac ttacttaccc tcacctgtca ggctgacggg gaggaaccac tgcaccaccg 241 agagaggctg ggatgggcct gcttcctgtc tttgggagaa aacgtcttgc ttgggaaggg 301 gcctttgtct tgtcaaggtt ccaactggaa acccttagga cagggtccct gctgtgttcc 361 ccaaaaggac ttgacttgca atttctacct agaaatacat ggacaatacc cccaggcctc 421 agtctccctt ctcccatgag gcacgaatga tctttctttc ctttcctttt tttttttttt 481 cttttctttt tttttttttt tgagacggag tctcgctctg tcacccaggc tggagtgcaa 541 tggcgtgatc tcggctcgct gcaacctccg cctcccgggt tcaagtaatt ctgctgtctc 601 agcctcctgc gtagctggga ctacaggcac acgccacctc gcccggcccg atc // LOCUS PEAHSP177A 772 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 17.7 kDa heat shock protein (hsp17.7) mRNA, complete cds. ACCESSION M33901 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 772) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 65 523 17.7 kDa heat shock protein (hsp17.7) BASE COUNT 254 a 127 c 170 g 221 t ORIGIN 1 caaaaatcaa aacgtgcgac aaacacaaaa tcatcccaca aagaaagcaa tggatttcag 61 gctaatggat ttggattctc cactcttcaa cactctccat catataatgg acctcaccga 121 cgacacaacc gagaagaact taaacgctcc aactcgaaca tatgtccgtg acgcaaaggc 181 aatggctgca actccagcgg acgtgaaaga gcatccaaat tcatacgtgt ttatggtgga 241 catgcctggg gtgaaatctg gtgacataaa ggttcaggtg gaagatgaga atgtgctatt 301 gataagtggc gagaggaaga gagaagaaga gaaagaaggt gttaaatatt tgaagatgga 361 aagaaggatt ggtaagttga tgaggaaatt tgtgttacct gagaatgcga atattgaagc 421 tatctctgct atttctcaag atggtgttct tacggttaca gttaataaat tgcctccacc 481 tgaacctaag aaaccaaaaa ctattcaagt taaggttgct tgatcggtgt acgatttcat 541 gtcaacaaat cagaaggaat gtttgtcttt ttagttggtt tgtgtagcaa tggttttgtg 601 tgttttcgcc tagttggccc tatatatgat gatcatcatg cgatgtaatt tgtaacaata 661 tgacatgaat gaattttaat tacttggttt ttctgcttgt aacattgttg cgttgccccc 721 atgataaaat tgagaaactg aagtattaaa gaaaagaaaa tgtttcattt ac // LOCUS PEAHSP179A 700 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 17.9 kDa heat shock protein (hsp17.9) mRNA, complete cds. ACCESSION M33900 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 700) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept < 1 469 17.9 kDa heat shock protein (hsp17.9) BASE COUNT 209 a 123 c 170 g 198 t ORIGIN 1 gataattcca agagtcttcg gtactggacg aagaaccaat gcattcgatc cattctcatt 61 agatttatgg gacccattcc agaacttcca actcgcaaga tccgccaccg gaaccaccaa 121 cgagacggca gcttttgcca acgctcacat tgactggaag gaaacaccgg aggctcacgt 181 gttcaaggct gatcttcccg gagtgaagaa ggaagaagtg aaagttgaaa tagaagaaga 241 tcgtgtgctc aagataagcg gagagaggaa aactgaaaag gaagacaaga acgacacctg 301 gcaccgtgtt gagcgtagtc aggggagttt cctccgccgt ttcaggttgc cggaaaatgc 361 taaagttgat caggtgaagg ctgctatgga aaacggtgtt cttaccgtta ctgttcctaa 421 agaggaggtt aagaagcctg aagctaagcc cattcagatt acaggatgag ctcttattct 481 tcctatattt tgatgtttgt gtctcttaat aaaatgttaa aataaaacaa ataataattg 541 tgtgtagtcg agttccagct ttaagagatt gagacatgta tggacttggc tattacttaa 601 gtgtagtagt ttgtgagtat tttgttgggt tatgttagtg tgtatgcaaa taactttttt 661 gagtatgtga aagtttcttt tgattaagct gtatttatcc // LOCUS PEAHSP181A 862 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 18.1 kDa heat shock protein (hsp18.1) mRNA, complete cds. ACCESSION M33899 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 862) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 12 488 18.1 kDa heat shock protein (hsp18.1) BASE COUNT 278 a 128 c 207 g 249 t ORIGIN 1 ctatatcaaa catgtctctg attccaagtt tctttagtgg ccgaaggagc aatgttttcg 61 atcctttctc cctggacgtc tgggatcctt tgaaggactt tccattttca aattcttcac 121 cttccgcttc attccctcgt gagaatcctg cttttgtgag cacacgagtt gactggaagg 181 aaacaccgga agcgcatgtt ttcaaggctg atcttcctgg gctgaaaaag gaggaagtga 241 aagttgaagt tgaagatgat agggttctac agataagcgg agagagaagc gttgagaaag 301 aagataagaa tgatgaatgg catcgcgtgg aacgtagcag tggaaagttc ttaagaaggt 361 tcagattgcc tgagaatgct aaaatggata aagtgaaagc ttccatggag aacggcgttc 421 tgacagtgac cgttccaaaa gaagagataa agaaggctga ggttaagtct attgagattt 481 ctggttaaac ttagaatgag ctatgttact ctgttgcttt tcttggttat aatgttttcc 541 tttttgtggc gtgtgcaaga aataaatggt catgtaattc tgaaatgtta atgtataaat 601 aaataagtaa acagttgttg ttggttattc agaggtgtta tagtattcat attgtaatgt 661 atcagaatga atcttgagaa aagagctgct ataaatagag cttgaagttt taaataaaaa 721 aaaaggttcc agaaaggaat aaaaaactgg taacagctag cagagagaaa aagctcaaac 781 cactgtgtta aggtgaacag cggaagaaaa tgaagagatg ttcatagccc ttcttcttga 841 gtctctccaa gatggagaat tc // LOCUS PEAHSP227A 795 bp ss-mRNA PLN 09-AUG-1990 DEFINITION Pisum sativum 22.7 kDa heat shock protein (hsp22.7) mRNA, complete cds. ACCESSION M33898 KEYWORDS heat shock protein. SOURCE P.sativum (cv Little Marvel) leaf, cDNA to mRNA. ORGANISM Pisum sativum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 795) AUTHORS Lauzon,L.M., Helm,K. and Vierling,E. TITLE A cDNA clone from Pisum sativum encoding a low molecular weight heat shock protein JOURNAL Nucleic Acids Res. 18, 4274-4274 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Vierling, 01-MAY-1990. University of Arizona Department of Biochemistry Biological Sciences West Building Tucson, AZ 85721 FEATURES from to/span description pept 44 637 22.7 kDa heat shock protein (hsp22.7) BASE COUNT 261 a 122 c 170 g 242 t ORIGIN 1 ccaagttcca aacctcaaga acaaaaaaca cacatttcta agtatgagtc tgaaacctct 61 aaacatgtta ctcgttccat ttcttctgct tattctcgcg gctgattttc ctttgaaagc 121 aaaagcatca ctactaccat tcatagattc tcccaacact ctcttatcgg atctctggtc 181 tgatcgtttc ccagatccgt ttcgcgtctt agaacaaatt ccctatggag ttgagaaaca 241 cgaaccatcc ataacattgt cacatgctag agtagactgg aaggaaactc cagagggaca 301 tgtgataatg gtggacgtgc ctgggttgaa aaaagatgat ataaagatag aagtggaaga 361 gaatagggtg ctaagagtga gtggtgagag gaagaaagaa gaagataaaa aaggagatca 421 ttggcacaga gttgaaagat cttatggaaa gttctggagg cagtttaaat tacctcaaaa 481 tgttgatttg gattctgtca aagctaaaat ggaaaacggt gttcttactt taactcttca 541 taagttgtcg catgataaga ttaaaggtcc tagaatggtt agtattgtgg aagaggatga 601 caaaccatct aagatcgtca atgatgagtt gaaataatta tgtgatttgt actcataaaa 661 atgaaaaatg ttttttcatt gtgttatttg tgaataaagg aatgttacct atgatattgg 721 ttgtttgttg tatgtcaact aaagagtgct gtaaaggctt gttaatttca tagtgaataa 781 cttgttggct tttgt // LOCUS ECOHGRF 140 bp ds-DNA SYN 09-AUG-1990 DEFINITION Synthetic human growth hormone releasing factor (hGRF) gene, complete cds. ACCESSION M26106 KEYWORDS growth hormone releasing factor; somatocrinin. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 140) AUTHORS Cravador,A., Jacobs,P., Van Elsen,A., Lacroix,C., Colau,B., Van Alphen,P., Herzog,A. and Bollen,A. TITLE Total DNA synthesis and cloning in Escherichia coli of a gene coding for the human growth hormone releasing factor JOURNAL Biochimie 67, 829-834 (1985) STANDARD simple staff_review FEATURES from to/span description pept 2 139 synthetic human growth hormone releasing factor (hGRF) BASE COUNT 30 a 35 c 36 g 39 t ORIGIN 1 catgtacgct gacgctatct tcactaactc ttaccgtaaa gttctgggtc agctgtctgc 61 tcgtaaactg ctgcaggaca tcatgtctcg tgagcagggt gaatctaacc aggaacgtgg 121 tgctcgtgct cgtctgtaag // LOCUS HUMACALX 724 bp ss-mRNA PRI 09-AUG-1990 DEFINITION Human calcitonin mRNA, complete cds. ACCESSION M26095 KEYWORDS calcitonin. SOURCE Human cell-line BEN, cDNA to mRNA, clone hBEN-JR2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 724) AUTHORS Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M., Al-Kazwini,S.J., Holman,J.J. and Marshall,I. TITLE Expression and function of the human calcitonin/alpha-CGRP gene in health and disease JOURNAL Biochem. Soc. Symp. 52, 91-105 (1986) STANDARD simple staff_review FEATURES from to/span description pept 35 460 calcitonin precursor sigp 35 109 calcitonin signal peptide matp 287 382 calcitonin matp 383 457 flanking peptide BASE COUNT 163 a 195 c 200 g 166 t ORIGIN 1 ggtgagcccc gagattctgg ctcagagagg tgtcatgggc ttccaaaagt tctccccctt 61 cctggctctc agcatcttgg tcctgttgca ggcaggcagc ctccatgcag caccattcag 121 gtctgccctg gagagcagcc cagcagaccc ggccacgctc agtgaggacg aagcgcgcct 181 cctgctggct gcactggtgc aggactatgt gcagatgaag gccagtgagc tggagcagga 241 gcaagagaga gagggctcca gcctggacag ccccagatct aagcggtgcg gtaatctgag 301 tacttgcatg ctgggcacat acacgcagga cttcaacaag tttcacacgt tcccccaaac 361 tgcaattggg gttggagcac ctggaaagaa aagggatatg tccagcgact tggagagaga 421 ccatcgccct catgttagca tgccccagaa tgccaactaa actcctccct ttccttccta 481 atttcccttc ttgcatcctt cctataactt gatgcatgtg gtttggttcc tctctggtgg 541 ctctttgggc tggtattggt ggctttcctt gtggcagagg atgtctcaaa cttcagatgg 601 gaggaaagag agcaggactc acaggttgga agagaatcac ctgggaaaat accagaaaat 661 gagggccgct ttgagtcccc cagagatgtc atcagagctc ctctgtcctg ctttctgaat 721 gtgc // LOCUS HUMCALARP 234 bp ds-DNA PRI 09-AUG-1990 DEFINITION Human calcitonin gene, exon 5. ACCESSION M26094 KEYWORDS calcitonin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 234) AUTHORS Craig,R.K., Riley,J.H., Edbrooke,M.R., Broad,P.M., Foord,S.M., Al-Kazwini,S.J., Holman,J.J. and Marshall,I. TITLE Expression and function of the human calcitonin/alpha-CGRP gene in health and disease JOURNAL Biochem. Soc. Symp. 52, 91-105 (1986) STANDARD simple staff_review FEATURES from to/span description pept / 36 195 calcitonin precursor, exon 5 (AA at 37) matp 54 165 alpha-calcitonin related peptide matp 166 192 carboxyl-terminal-flanking peptide (PDN-21) IVS < 1 35 alpha-calcitonin related peptide intron C BASE COUNT 59 a 63 c 61 g 51 t ORIGIN 1 cagatcttct cttctttctc catcctgcaa atcagaatca ttgcccagaa gagagcctgt 61 gacactgcca cctgtgtgac tcatcggctg gcaggcttgc tgagcagatc agggggtgtg 121 gtgaagaaca actttgtgcc caccaatgtg ggttccaaag cctttggcag gcgccgcagg 181 gaccttcaag cctgagcagc tgaacgactc aagaaggtca caataaagct gaac // LOCUS PIPVGB 1883 bp ds-DNA BCT 09-AUG-1990 DEFINITION Plasmid pIP630 (from S. aureus) virginiamycin B hydrolase (vgb) gene, complete cds. ACCESSION M36022 KEYWORDS virginiamycin B hydrolase; virginiamycin-resistance. SOURCE Plasmid pIP630 (from Staphylococcus aureus) DNA. ORGANISM Plasmid pIP630 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1883) AUTHORS Allignet,J., Loncle,V., Mazodier,P. and El Solh,N. TITLE Nucleotide sequence of a Staphylococcal plasmid gene, vgb, encoding a hydrolase inactivating the B components of virginiamycin-like antibiotics JOURNAL Plasmid 20, 271-275 (1988) STANDARD simple staff_review FEATURES from to/span description pept 641 1540 virginiamycin B hydrolase BASE COUNT 641 a 284 c 375 g 583 t ORIGIN 1 agatctacgg attttcgcca tgccacgaaa ttagcatcat gctagcaagt taaacgaaca 61 ctgacatgat atattagtgg ttagctatat ttttttactt tgcaacagaa ccattattat 121 ggtttcttaa aaaaatacaa tgctttttcg ttccttttta ttcatcttcc aattctttgg 181 catgactgtg tgcattttaa atttgttcag caaatgtgcc gtgtaatgga atacttttta 241 aatactgtgt aatgataatg caaggcacat actaaaagga atcttcgatt ttgttggctt 301 attatttgac ttttcataac aattatctta aggttaaaca aatcaataat cgaaagggtg 361 aaaaaaagca catgatcata taatcctaat tttaaaagaa atcgatattt tggccttggg 421 ttcaatttca aagtggtttt ggaatgaact ctatttgtta tcggcttttt tctgagatag 481 gattaatgta atgtgctttt ttggctttaa aaagaccttt gttatccaaa aagtcttttt 541 aagtgtcctt atccgtgcca cattgcctcc tatctcgaaa aaagagatgg aggctatttt 601 tgttttggaa atttaattta aataaaacgg aggggataga atggaattta aattacaaga 661 attaaatctt actaaccaag atacaggacc atatggtata accgtttcag ataaggggaa 721 agtttggatt acacaacata aagcaaatat gataagttgc atcaatttag atggaaaaat 781 tacagagtac ccactaccga caccagatgc aaaagtcatg tgtttaacta tatcctcaga 841 tggggaagtt tggtttactg agaatgcagc aaacaaaata gggaggatta caaaaaaagg 901 gattattaag gaatatacat tgcctaaccc agattcagca ccctacggta ttacagaagg 961 accaaatgga gatatatggt ttacagaaat gaatggcaac cgtattggac gtattacgga 1021 cgacggtaaa attcgtgaat acgagctgcc taataaagga tcttaccctt cttttatcac 1081 tttgggttct gataatgccc tgtggttcac agaaaatcaa aataatgcta ttggtagaat 1141 tacagaaagt ggggatatta cagagtttaa aattcctaca cctgcatcag gaccagttgg 1201 tattacaaag gggaacgacg atgctttatg gtttgtggaa attatcggta ataagatagg 1261 gcgaataact cctctggggg aaattaccga attcaaaatt ccaacgccaa acgctcgacc 1321 tcatgcaatt actgctggag caggaattga tttatggttt actgaatggg gggctaataa 1381 aataggaagg ctgacaagca ataatataat tgaggaatac ccaattcaaa tcaaaagtgg 1441 tgaaccacat ggcatttgtt tcgatggtga aacaatttgg tttgcaatgg agtgtgacaa 1501 gataggcaaa ttaactctca ttaaggataa tatggagtga gtcttttgaa tttaaacaat 1561 gaccatggac ctgatcccga aaatatttta ccgataaaag ggaatcggaa tcttcaattt 1621 ataaaaccta ctataacgaa cgaaaacatt ttggtggggg aatattctta ttatgatagt 1681 aagcgaggag aatcctttga agatcaagtc ttatatcatt atgaagtgat tggagataag 1741 ttgattatag gaagattttg ttcaattggt cccggaacaa catttattat gaatggtgca 1801 aaccatcgga tggatggatc aacatatcct tttcatctat tcaggatggg ttgggagaag 1861 tatatgcctt ccttaaaaga tct // LOCUS ECOLIVHMGF 8703 bp ds-DNA BCT 09-AUG-1990 DEFINITION E.coli leucine-specific transport (LS-BP; LIV-BP) system (livHMGF) genes, complete cds. ACCESSION J05516 M13166 M10426 M10427 K02178 KEYWORDS heat shock protein; high affinity branched-chain amino acid transport system; htpR gene; isoleucine binding protein; leucine binding protein; leucine binding protein; livJ gene; livK gene; valine binding protein. SOURCE E.coli (K12 strain AE404) isolate W3110 DNA, clone pOX[1,15]. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 90 to 1312) AUTHORS Landick,R., Vaughn,V., Lau,E.T., VanBogelen,R.A., Erickson,J.W. and Neidhardt,F.C. TITLE Nucleotide sequence of the heat shock regulatory gene of E. coli suggests its protein product may be a transcription factor JOURNAL Cell 38, 175-182 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 1312; revises [1]) AUTHORS Vaughn,V. JOURNAL Unpublished (1985) Univ Michigan Med School, Ann Arbor MI 48109 STANDARD full staff_review REFERENCE 3 (bases 1407 to 2507 and 3503 to 4609) AUTHORS Landick,R. and Oxender,D.L. TITLE The complete nucleotide sequences of the Escherichia coli LIV-BP and LS-BP genes: Implications for the mechanism of high-affinity branched-chain amino acid transport JOURNAL J. Biol. Chem. 260, 8257-8261 (1985) STANDARD full staff_review REFERENCE 4 (bases 4610 to 5696) AUTHORS Nazos,P.M., Antonucci,T.K., Landick,R. and Oxender,D.L. TITLE Cloning and characterization of livH, the structural gene encoding a component of the leucine transport system in Escherichia coli JOURNAL J. Bacteriol. 166, 565-573 (1986) STANDARD simple staff_review REFERENCE 5 (bases 1136 to 8703) AUTHORS Adams,M.D., Wagner,L.M., Graddis,T.J., Landick,R., Antonucci,T.K., Gibson,A.L. and Oxender,D.L. TITLE Nucleotide sequence and genetic characterization reveal six essential genes for the LIV-I and LS transport systems of Escherichia coli JOURNAL J. Biol. Chem. 265, 11436-11443 (1990) STANDARD full staff_review COMMENT Draft entry and sequence in computer readable form for [2] kindly provided by V.Vaughn, 15-NOV-1985. Draft entry and computer-readable sequence for [1] kindly submitted by M.D.Adams 19-APR-1990, for release after publication. The htpR (also known as "hin") gene product appears homologous to the sigma factor of RNA polymerase, and the two proteins are predicted to have similar secondary structures. In addition, two regions of the predicted htpR product resemble protein-DNA contact points conserved in known DNA-binding proteins. The htpR gene encodes a protein, which appears to be identical in size (33 kd by migration on two dimensional polyacrylamide gel) and isoelectric point with the protein, F33.4, normally present in E.coli but deficient in an htpR mutant. A region homologous to the rpoD gene is located at positions 508-549 [2]. The E.coli LIV-I and LS AA transport systems are high-affinity, periplasmic, binding protein-dependent systems that utilize the leucine-, isoleucine-, valine-binding protein (LIV-BP) and leucine-specific binding protein (LS-BP), respectively. These two binding proteins interact with a common set of membrane proteins to transport branched-chain AAs into the cytoplasm. The two BP genes are encoded in a regulon that also contains the genes for the common membrane protein components. FEATURES from to/span description pept 1407 2510 LIV-BP precursor (livJ) sigp 1407 1475 LIV-BP signal peptide matp 1476 2507 LIV-BP mature protein pept 3503 4612 LS-BP precursor (livK) sigp 3503 3571 LS-BP signal peptide matp 3572 4609 LS-BP mature peptide pept 4660 5586 leucine-specific binding protein (livH) pept 5583 6857 livM product pept 6854 7621 livG product pept 7623 8336 livF product mRNA 1303 > 4612 livJ mRNA [2] /nomgen="livJ" ORF 3078 2566 (c) ORF19 revision 101 102 ct in [2]; cgt in [1] revision 270 270 a in [2]; g in [1] revision 280 280 a in [2]; g in [1] revision 284 284 a in [2]; g in [1] revision 304 304 a in [2]; g in [1] revision 310 310 a in [2]; g in [1] revision 322 324 caa in [2]; agg in [1] revision 328 328 g in [2]; a in [1] revision 1453 1453 c in [5]; g in [3] revision 3832 3832 t in [5]; c in [3] BASE COUNT 2039 a 2176 c 2425 g 2062 t 1 others ORIGIN 76 min on the K12 map. 1 ctgcacggat caacattacg ccacttacgc ctgaataata aaagcgtgta tactctttcc 61 tgcaatgggt tccgtagcag ggaaagagac cccgttgtct cttcccggta tttcatctct 121 atgtcacatt ttgtgcgtaa tttattcaca agcttgcatt gaacttgtgg ataaaatcac 181 ggtctgataa aacagtgaat gataacctcg ttgctcttaa gctctggcac agttgttgct 241 accactgaag cgccagaaga tatcgattga gaggatttga atgactgaca aaatgcaaag 301 tttagcttta gccccagttg gcaacctgga ttcctacatc cgggcagcta acgcgtggcc 361 gatgttgtcg gctgacgagg agcgggcgct ggctgaaaag ctgcattacc atggcgatct 421 ggaagcagct aaaacgctga tcctgtctca cctgcggttt gttgttcata ttgctcgtaa 481 ttatgcgggc tatggcctgc cacaggcgga tttgattcag gaaggtaaca tcggcctgat 541 gaaagcagtg cgccgtttca acccggaagt gggtgtgcgc ctggtctcct tcgccgttca 601 ctggatcaaa gcagagatcc acgaatacgt tctgcgtaac tggcgtatcg tcaaagttgc 661 gaccaccaaa gcgcagcgca aactgttctt caacctgcgt aaaaccaagc agcgtctggg 721 ctggtttaac caggatgaag tcgaaatggt ggcccgtgaa ctgggcgtaa ccagcaaaga 781 cgtacgtgag atggaatcac gtatggcggc acaggacatg acctttgacc tgtcttccga 841 cgacgattcc gacagccagc cgatggctcc ggtgctctat ctgcaggata aatcatctaa 901 ctttgccgac ggcattgaag atgataactg ggaagagcag gcggcaaacc gtctgaccga 961 cgcgatgcag ggtctggacg aacgcagcca ggacatcatc cgtgcgcgct ggctggacga 1021 agacaacaag tccacgttgc aggaactggc tgaccgttac ggcgtttccg ctgagcgtgt 1081 acgccagctg gaaaagaacg cgatgaaaaa attgcgtgct gccattgaag cgtaatttcc 1141 gctattaagc agagaaccct agatgagagt ccggggtttt tgttttttgg gcctctgtaa 1201 taatcaattt cccctccggc aaaacgccaa tccccacgca gattgttaat aaactgtcaa 1261 aatagctatt ccaatatcat aaaaatcggg atatgtttta gcagagtatg ctgctaaagc 1321 acgggtagtc atgcataaaa cgaaataaag tgctgaaaaa caacatcaca acacacgtaa 1381 taaccagaag aatggggatt ctcaggatga acacaaaggg caaagcgtta ctggcaggat 1441 tgatcgcgct ggcattcagc aatatggctc tggcagaaga tattaaagtc gcggtcgtgg 1501 gcgcaatgtc cggtccggtt gcgcagtacg gtgaccagga gtttaccggc gcagagcagg 1561 cggttgcgga tatcaacgct aaaggcggca ttaaaggcaa caaactgcaa atcgcaaaat 1621 atgacgatgc ctgtgatccg aaacaggcgg ttgcggtggc gaacaaagtc gttaacgacg 1681 gcattaaata tgtgattggt cacctctgtt cctcatcaac gcagcctgcg tcggatatct 1741 acgaagacga aggcattttg atgatcaccc cagcggcaac cgcgccggag ctgaccgccc 1801 gtggctatca gctgatcctg cgaaccaccg gcctggattc cgaccaaggg ccgacggctg 1861 ccaaatatat tcttgagaaa gtgaaaccgc agcgtattgc tatcgttcac gacaaacagc 1921 aatacggcga aggtctggcg cgagcggtgc aggacggcct gaagaaaggc aatgcaaacg 1981 tggtgttctt tgatggcatc accgccgggg aaaaagattt ctcaacgctg gtggcgcgtc 2041 tgaaaaaaga gaatatcgac ttcgtttact acggcggtta tcacccggaa atggggcaaa 2101 tcctgcgtca ggcacgcgcg gcagggctga aaactcagtt tatggggccg gaaggtgtgg 2161 ctaacgtttc gctgtctaac attgcgggcg aatcagcgga agggctactg gtgaccaaac 2221 cgaagaacta cgatcaggtt ccggcgaaca aacccattgt tgacgcgatc aaagcgaaaa 2281 aacaggaccc aagtggcgca ttcgtttgga ccacctacgc cgcgctgcaa tctttgcagg 2341 cgggcctcaa tcagtctgac gatccggctg aaatcgccaa atacctgaaa gcgaactccg 2401 tggataccgt aatgggcccg ctgacctggg atgagaaagg cgatctgaaa ggctttgagt 2461 tcggcgtatt tgactggcac gccaacggca cggccaccga tgcgaagtaa tcattaatcg 2521 gcaactttgg gttgccgcca aattgctaat atcgagtacg ttgcttcatg ccggatgcgg 2581 cgtaaacgcc ttatccggcc tacaagatcc aaagaaatca gtaaattgca acacacattg 2641 taggcctgat aagcgtagcg catcaggcaa tacacttttg aaatcggact tgacgattaa 2701 cacttctccc agccgccctg ttgtgccgta aaccccagcg cctgcataaa cgccgtcatc 2761 acaccgcgat cttccacgcc gcagccgcca tccaccagca tgaaacgcca agattgttac 2821 gcaaaacctc ttccagcaga tattgcccca ccgcgacggc gggtgacttc ccgcacgcgc 2881 agggaatcca gtgctccctc ggtgccgctt aaggttgccc gcgcggcgcg agcaggcgct 2941 cgttaaacgc gcggcgtaga tacggtggtt atcgtcaacc tgtaacgagg aaggggaata 3001 ctcncggcca agatcttttg cgaggtcaat ccggtcttgg tcgctaaatt tttctaatcg 3061 aatgatggtc agcttcatgg gtaacccgtg taaatcacaa aagtgtaacc agtgtagcga 3121 aataatttaa tcggaggctt tctctttttt atttcttttg gcaggtgatt aattttttaa 3181 cagcaataat tacaaaatta aaacattaga gaatgaaaaa tgtccagcat aatcccctga 3241 atgatagtga attattccgc ccctttgtgc cgttatttta tgctgacaaa ggcacttttt 3301 tctgtttgtc tatcaataaa ttcggaatat tatctgttct taatcgactg aaaaatgggg 3361 attttaatcg ctattatcac aaaatactgc gctaacccct taatcagaca ggcaaaaaca 3421 gtgcagtata aaaaaagaac agtctgattt gttaacacat aaaaacaaag caacacaaca 3481 tcacgaatgg ggatttttga ctatgaaacg gaatgcgaaa actatcatcg cagggatgat 3541 tgcactggca atttcacaca ccgctatggc tgacgatatt aaagtcgccg ttgtcggcgc 3601 gatgtccggc ccgattgccc agtggggcat aatggaattt aacggcgcgg agcaggcgat 3661 taaagacatt aatgccaaag ggggaattaa gggcgataaa ctggttggcg tggaatatga 3721 cgacgcatgc gacccgaaac aagccgttgc ggtcgccaac aaaatcgtta atgacggcat 3781 taaatacgtt attggtcatc tgtgttcttc ttctacccag cctgcgtcag atatctatga 3841 agacgaaggt attctaatga tctcgccggg agcgaccgcg ccggaactaa cccaacgcgg 3901 ttatcaacac attatgcgta ctgccgggct ggactcttcc caggggccaa cggcggcaaa 3961 atacattctt gagacggtga agccccagcg catcgccatc atccacgaca aacaacagta 4021 tggcgaaggg ctggcgcgtt cggtgcagga cgggctgaaa gcggctaacg ccaacgtcgt 4081 cttcttcgat ggtattaccg ccggggagaa agatttctcc gcgctgatcg cccgcctgaa 4141 aaaagaaaac atcgacttcg tttactacgg cggttactac ccggaaatgg ggcagatgct 4201 gcgccaggcc cgttccgttg gcctgaaaac ccagtttatg gggccggaag gtgtgggtaa 4261 tgcgtcgttg tcgaacattg ccggtgatgc cgccgaaggc atgttggtca ctatgccaaa 4321 acgctatgac caggatccgg caaaccaggg catcgttgat gcgctgaaag cagacaagaa 4381 agatccgtcc gggccttatg tctggatcac ctacgcggcg gtgcaatctc tggcgactgc 4441 ccttgagcgt accggcagcg atgagccgct ggcgctggtg aaagatttaa aagctaacgg 4501 tgcaaacacc gtgattgggc cgctgaactg ggatgaaaaa ggcgatctta agggatttga 4561 ttttggtgtg ttccagtggc acgccgacgg ttcatccacg gcagccaagt gatcatccca 4621 ccgcccgtaa aatgcgggcg ggtttagaaa ggttacctta tgtctgagca gtttttgtat 4681 ttcttgcagc agatgtttaa cggcgtcacg ctgggcagta cctacgcgct gatagccatc 4741 ggctacacca tggtttacgg cattatcggc atgatcaact tcgcccacgg cgaggtttat 4801 atgattggca gctacgtctc atttatgatc atcgccgcgc tgatgatgat gggcattgat 4861 accggctggc tgctggtagc cgcgggattc gtcggcgcaa tcgtcattgc cagcgcctac 4921 ggctggagta tcgaacgggt ggcttaccgc ccggtgcgta actctaagcg cctgattgca 4981 ctcatctctg caatcggtat gtccatcttc ctgcaaaact acgtcagcct gaccgaaggt 5041 tcgcgcgacg tggcgctgcc gagcctgttt aacggtcagt gggtggtggg gcatagcgaa 5101 aacttctctg cctctattac caccatgcag gcggtgatct ggattgttac cttcctcgcc 5161 atgctggcgc tgacgatttt cattcgctat tcccgcatgg gtcgcgcgtg tcgtgcctgc 5221 gcggaagatc tgaaaatggc gagtctgctt ggcattaaca ccgaccgggt gattgcgctg 5281 acctttgtga ttggcgcggc gatggcggcg gtggcgggtg tgctgctcgg tcagttctac 5341 ggcgtcatta acccctacat cggctttatg gccgggatga aagcctttac cgcggcggtg 5401 ctcggtggga ttggcggcat tccgggggcg atgattggcg gcctgattct ggggattgcg 5461 gaggcgctct cttctgccta tctgagtacg gaatataaag atgtggtctc attcgccctg 5521 ccgattctgg tgctgctggt gatgccgacc ggtattctgg gtcgcccgga ggtagagaaa 5581 gtatgaaacc gatgcatatt gcaatggcgc tgctctctgc cgcgatgttc tttgtgctgg 5641 cgggcgtctt tatgggcgtg caactggagc tggatggcac caaactggtg gtcgacacgg 5701 cttcggatgt ccgttggcag tgggtgttta tcggcacggc ggtggtcttt ttcttccagc 5761 ttttgcgacc ggctttccag aaagggttga aaagcgtttc cggaccgaag tttattctgc 5821 ccgccattga tggctccacg gtgaagcaga aactgttcct cgtggcgctg ttggtgcttg 5881 cggtggcgtg gccgtttatg gtttcacgcg ggacggtgga tattgccacc ctgaccatga 5941 tctacattat cctcggtctc gggctgaacg tggttgttgg tctttctggt ctgctggtgc 6001 tggggtacgg cggtttttac gccatcggct tacacttttg cgctgctcaa tcactattac 6061 ggcttgggct tctggacctg cctgccgatt gctggattaa tggcagcggc ggcggcttcc 6121 tgctcggttt tccggtgctg cgtttgcgcg gtgactatct ggcgatcgtt accctcggtt 6181 tcggcgaaat tgtgcgcata ttgctgctca ataacaccga aattaccggc ggcccgaacg 6241 gaatcagtca gatcccgaaa ccgacactct tcggactcga gttcagccgt accgctcgtg 6301 aaggcggctg ggacacgttc agtaatttct ttggcctgaa atacgatccc tccgatcgtg 6361 tcatcttcct ctacctggtg gcgttgctgc tggtggtgct aagcctgttt gtcattaacc 6421 gcctgctgcg gatgccgctg gggcgtgcgt gggaagcgtt gggtgaagat gaaatcgcct 6481 gccgttcgct gggcttaagc ccgcgtcgta tcaagctgac tgcctttacc ataagtgccg 6541 cgtttgccgg ttttgccgga acgctgtttg cggcgcgtca gggctttgtc agcccggaat 6601 ccttcacctt tgccgaatcg gcgtttgtgc tggcgatagt ggtgctcggc ggtatgggct 6661 cgcaatttgc ggtgattctg gcggcaattt tgctggtggt gtcgcgcgag ttgatgcgtg 6721 atttcaacga atacagcatg ttaatgctcg gtggtttgat ggtgctgatg atgatctggc 6781 gtccgcaggg cttgctgccc atgacgcgcc ggcaactgaa gctgaaaaac ggcgcagcga 6841 aaggagagca ggcatgagtc agccattatt atctgttaac ggcctgatga tgcgcttcgg 6901 cggcctgctg gcggtgaaca acgtcaatct tgaactgtac ccgcaggaga tcgtctcgtt 6961 aatcggccct aacggtgccg gaaaaaccac ggtttttaac tgtctgaccg gattctacaa 7021 acccaccggc ggcaccattt tactgcgcga tcagcacctg gaaggtttac cggggcagca 7081 aattgcccgc atgggcgtgg tgcgcacctt ccagcatgtg cgtctgttcc gtgaaatgac 7141 ggtaattgaa aacctgctgg tggcgcagca tcagcaactg aaaaccgggc tgttctctgg 7201 cctgttgaaa acgccatcct tccgtcgcgc ccagagcgaa cggctcgacc gcgccgcgac 7261 ctggcttgag cgcattggtt tgctggaaca cgccaaccgt caggcgagta acctggccta 7321 tggtgaccag cgccgtcttg agattgcccg ctgcatggtg acgcagccgg agattttaat 7381 gctcgacgaa cctgcggcag gtcttaaccc gaaagagacg aaagagctgg atgagctgat 7441 tgccgaactg cgtaatcatc acaacaccac tatcttgttg attgaacacg atatgaagct 7501 ggtgatggga atttcggacc gaatttacgt ggtcaatcag gggacgccgc tggcaaacgg 7561 tagcccggag cagatccgta ataacccgga cgtgatccgt gcctatttag gtgaggcata 7621 agatggaaaa agtcatgttg tcctttgaca aagtcagcgc ccactacggc aaaatccagg 7681 cgctgcatga ggtgagcctg catatcaatc agggcgagat tgtcacgctg attggcgcga 7741 acggggcggg gaaaaccacc ttgctcggca cgttatgcgg cgatcccggt gccaccagcg 7801 ggcgaattgt gtttgatgat aaagacatta ccgactggca gacagcgaaa atcatgcgcg 7861 aagcggtggc gattgtcccg gaagggcgtc gcgtcttctc gcggatgacg gtggaagaga 7921 acctggcgat gggcggtttt tttgctgaac gcgaccagtt ccaggagcgc ataaagtggg 7981 cgtatgagct gtttccacgt ctgcatgagc gccgtattca gcgggcgggc accatgtccg 8041 gcggtgaaca gcagatgctg gcgattggtc gtgcgctgat gagcaacccg cgtttgctac 8101 tgcttgatga gccatcgctc ggtcttgcgc cgattatcat ccagcaaatt ttcgacacca 8161 tcgagcagct gcgcgagcag gggatgacta tctttctcgt cgagcagaac gccaaccagg 8221 ggctaaagct ggcggatcgc ggctacgtgc tggaaaacgg ccatgtagtg ctttccgata 8281 ctggtgatgc gctgctggcg aatgaagcgg tgagaagtgc gtatttaggc gggtaataac 8341 acgttgattg atagggagtc aaaagactcc tttgagacag gtgacaaatg taaaattgcc 8401 tgatgcgctg cgcttatcag gcctactggg tgagtggcaa tatgttgaat ttgcacgatc 8461 ttgtaggcct gataagcgtt taccgcgcat ccggcatgaa acgatgagca atctgtagag 8521 tttgattcag accttctata ttttcccgct tatccgtgcc ccatctccca ttttccctca 8581 cccacgccgt caccgccttg tcatctttct gacaccttac tatcttacaa atgtaacaaa 8641 aaagttattt ttctgtaatt cgagcatgtc atgttacccc gcgagcataa aacgcgtgaa 8701 ttc // LOCUS BOVGOA 472 bp ss-mRNA MAM 09-AUG-1990 DEFINITION B.taurus go-alpha mRNA, 3' end. ACCESSION J02900 KEYWORDS go-alpha. SOURCE B.taurus retina, cDNA to mRNA, clone GO3.1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (sites) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. TITLE Multiple forms of go-alpha mRNA: Analysis of the 3'-untranslated regions JOURNAL Biochemistry 29, 5069-5076 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 472) AUTHORS Price,S.R., Murtagh,J.J.Jr., Tsuchiya,M., Serventi,I.M., Van Meurs,K.M., Angus,C.W., Moss,J. and Vaughan,M. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by S.R.Price, 12-JUN-1990. FEATURES from to/span description pept < 1 3 go-alpha (AA at 1) BASE COUNT 130 a 133 c 88 g 121 t ORIGIN 1 tgacctcttg tcctgtatag caacctattt ggtaatgatt ccagcactca cagaaaagct 61 tgcacacata cacacacacc ccacccctcc ccactaacaa atgcaagttg gtaaacaaat 121 tccaaaaagg cataacaaac cttatatata tagacaaata tatattaaag ttttttagtc 181 tgtactagaa agagcttcag acagaactga ccaccattcc attgctcatc aatttcctgg 241 gacagcacct gagcgtgcgc ttacgcgcgt acacacacat agacacgcac tgcgatacaa 301 gtcctgattt gggagtccgt ccttttaaaa acagccacat gctttcacgc tctgagaccc 361 acccgtttct gtgagcaggg ggagggcaag gaaagccctg gcctcagtcc agccttttct 421 ctgcttccac ctgctcaggc tgtgtgctct tggttctgtc ctgcacttgt gt // LOCUS CAJCAT 1334 bp ds-DNA BCT 09-AUG-1990 DEFINITION C.coli plasmid C-589 chloramphenicol acetyltransferase (cat) gene, complete cds. ACCESSION M35190 KEYWORDS chloramphenicol acetyltransferase. SOURCE C.coli plasmid C-589 DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1334) AUTHORS Wang,Y. and Taylor,D.E. TITLE Chloramphenicol resistance in Campylobacter coli, nucleotide sequence, expression and cloning vector construction JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.E.Taylor, 15-JUN-1990. FEATURES from to/span description pept 309 932 chloramphenicol acetyltransferase (cat) mRNA 277 > 932 chloramphenicol acetyltransferase mRNA signal 242 271 promoter binding 297 301 ribosome binding site signal 960 1006 transcriptional termination signal BASE COUNT 433 a 232 c 282 g 387 t ORIGIN 1 attcccacaa cgccggaaac aagccgtgcc acgagcttat aataaaagag ggaagagaag 61 cgtatttttc ctcacttccg gtgaaggata tcgagaaaaa tctaaatgat aacggaattc 121 cgtcgtcggt atcgtatgga gcggacaacg agtaaaagag tgaccgccga gataacccat 181 tgctcggcgg tgttcctttc caagttaatt gcgtgatata gattgaaaag tggatagatt 241 tatgatatag tggatagatt tatgatataa tgagttatca acaaatcgga atttacggag 301 gataaatgat gcaattcaca aagattgata taaataattg gacacgaaaa gagtatttcg 361 accactattt tggcaatacg ccctgcacat atagtatgac ggtaaaactc gatatttcta 421 agttgaaaaa ggatggaaaa aagttatacc caactctttt atatggagtt acaacgatca 481 tcaatcgaca tgaagagttc aggaccgcat tagatgaaaa cggacaggta ggcgtttttt 541 cagaaatgct gccttgctac acagtttttc ataaggaaac tgaaaccttt tcgagtattt 601 ggactgagtt tacagcagac tatactgagt ttcttcagaa ctatcaaaag gatatagacg 661 cttttggtga acgaatggga atgtccgcaa agcctaatcc tccggaaaac actttccctg 721 tttctatgat accgtggaca agctttgaag gctttaactt aaatctaaaa aaaggatatg 781 actatctact gccgatattt acgtttggga agtattatga ggagggcgga aaatactata 841 ttcccttatc gattcaagtg catcatgccg tttgtgacgg ctttcatgtt tgccgttttt 901 tggatgaatt acaagacttg ctgaataaat aaaatcccag tttgtcgcac tgataaaaac 961 cctttaggaa ctaaagggcg cacttctata ctctctgtcg agagtagtgc gtcctgcgga 1021 gcttcattcc cggtcagcgc gcttatcaat atatctatag aatgggcaaa gcataaaaac 1081 ttgcatggac taatgcttga aacccaggac aataacctta tagcttgtaa attctatcat 1141 aattgtggtt tcaaaatcgg ctccgtcgat actatgttat acgccaactt tgaaaacaac 1201 tttgaaaaag ctgttttctg gtatttaagg ttttagaatg caaggaacag tgaattggag 1261 ttcgtcttgt tattaattag cttcttgggg tatctttaaa tactgtagaa agaggaagga 1321 aataataaat ggct // LOCUS CLOCBA 5120 bp ds-DNA BCT 09-AUG-1990 DEFINITION C.acetobutylicum beta-D-galactosidase (cbgA) and beta-D-galactosidase regulatory protein (cbgR) genes, complete cds. ACCESSION M35107 KEYWORDS beta-D-galactosidase; beta-D-galactosidase regulatory protein. SOURCE C.acetobutylicum (strain NCIB2951) DNA. ORGANISM Clostridium acetobutylicum Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1260 to 5120) AUTHORS Hancock,K.R., Rockman,E., Pearce,L., Maddox,I.S. and Scott,D.B. TITLE Clostridium acetobutylicum beta-galactosidase gene, cbgA, is positively regulated in Escherichia coli by a novel regulatory gene, cbgR JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 5120) AUTHORS Scott,D.B., Hancock,K.R., Pearce,L. and Maddox,I.S. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by D.B.Scott, 11-JUN-1990. Author address:D.B.Scott: Molecular Genetics Unit Department of Microbiology and Genetics Massey University Palmerston North, New Zealand E-mail:D.B.Scott@massey.ac.nz FEATURES from to/span description pept 1560 4253 beta-D-galactosidase (cbgA) pept 4500 4805 beta-D-galactosidase regulatory protein (cbgR) BASE COUNT 1921 a 683 c 876 g 1640 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattccttt tcatatatat ctttaatatt tctactggaa tagaagaggt tgctcaatac 61 aaaaaatgct tctttaaaac tatttgaaac tacttctgaa atattttcta gcttactaaa 121 tagagaatta taatttttat catcaaaatt tagaattaca actatgattt cgttttcaat 181 attagcaatt tgtatattat aattgctatt taatccgtct aaagaaaatt ctttgccgat 241 ttctgaaatt gtaaaatcaa taatttcatg gcgtttgcta taattatcat atatttcttt 301 gcgtttaaac caaataagca aaatgattga aaagtaaata tgtatcaaag tagttaaagt 361 caggatcatg tcaaaacctg atataaggcg atttaaggcg ctattagtga gacttaaaga 421 gtttccttct aaagtatttc ttttcatttt tattgaaatc ttttttagag tacttaataa 481 ctcagaagga tttagagaag gttttaaaat ataatcaaca gcaccatttt gaaaagatga 541 tttaacatat tcaaaatcgc tataactact taagatgata attcttatct taggatattt 601 gtcctgcaca aatttagcta attcaacccc atttatttgg ggcattacaa catcagaaat 661 tataatgtca ggaatatcct tttttatcat ttccagagct tcttgaccat tagaagcctg 721 tcctataatt tgaaagcctt ctttttccca atcaatcata tgagttatgc cttgccgcat 781 aataaattca tcatcaacaa ctaaattttt actatattcg ttcaatagta tagcacccct 841 tattctaaaa ttaccacaac atagataaat attgcttaat actattatac cttatagatt 901 tattgtatgt atctgtatac gttacgttaa ttcatctaca aatttatatg agttttggtt 961 gcacttttag agaaaatctt tttgtctatg gtcttattgt cctataatgg tcaaatcatc 1021 tttaccaaag tctcttgatt taaagagata aaaacaccac tgatccatta ttcctcattt 1081 tggtaatgaa cctatgcggt tgaagatatt aatcagatgt ctaaatactt tagaaaaaaa 1141 gacctttact aatatcttca atatttacac ccctattcta aaattaccac aagatagata 1201 aatattgctt aatactgatt ataccttata gattaaaggt tttcaattaa acaataaatt 1261 actttagtaa agtttagtaa aatataattg attttttact aaaaagataa taaaatgaaa 1321 ctataaattt agttaatagc ataaatctaa catcagaaga taggataaat taaagaagta 1381 atgtaattga ttacgaaaca aaatctcata ttaatattag cccataattt ttttattctc 1441 atatatgttt aagtattaat taaatgtgac tttataaaaa ggttgcattt agttaatacg 1501 attaacaact ttaatttaaa aaagcaataa ctctacaaag tgaaagtgag ggggtaagta 1561 tgattaataa taaaccgtca ttagattggc tagaaaatcc ggaaatattt agagttaata 1621 gaatagatgc tcattctgat acttggtttt atgaaaaatt tgaggatgtt aaattagaag 1681 acaccatgcc tcttaagcaa aatttaaatg gaaaatggag attttcatat agtgaaaatt 1741 catcattaag aattaaagag ttttataagg atgagtttga cgtaagttgg attgattata 1801 ttgaagttcc aggtcatatt cagcttcaag gatatgataa atgtcaatat attaatacta 1861 tgtatccttg ggaaggtcac gatgaattaa gaccacctca tatttcaaaa acatataatc 1921 cggtgggaag ctatgtaaca ttttttgaag ttaaagatga actcaaaaat aagcagactt 1981 ttatttcttt tcaaggtgtt gaaacagcat tttacgtatg ggtaaatgga gaatttgtag 2041 gatatagcga agatacattt acaccatcag aatttgatat tactgattat ttaagagagg 2101 gagaaaataa acttgcagtt gaggtttata aaaggagtag cgcaagttgg atagaagatc 2161 aagatttctg gagattttca ggcatcttta gagatgtata tttatatgca gttccagaaa 2221 ctcatgtaaa tgatatattt ataaaaacag atttatatga cgatttcaaa aacgcaaagt 2281 taaatgctga acttaaaatg attggaaatt cagaaacaac agttgaaaca tatttagaag 2341 ataaagaagg aaataaaata gctatatctg aaaagattcc gttctctgat gagttgactt 2401 tatatttaga tgcgcaaaat ataaacctat ggagtgcaga agagcctaac ttatatacac 2461 tttatatttt agtgaataaa aaagatggta atttaattga ggttgtaact caaaagatag 2521 ggtttaggca ctttgaaatg aaggataaaa ttatgtgtct aaaatggaaa cgtattatct 2581 ttaaaggcgt aaaccgtcac gaatttagcg caagacgtgg acgctcaatt acgaaagagg 2641 acatgttgtg ggatattaag ttcttgaaac aacacaatat taatgctgtt agaacatcac 2701 attatccaaa tcaaagttta tggtacagac tttgcgatga atacgggatt tatttaatag 2761 atgaaacaaa tttagaaagc catggttcat ggcaaaagat ggggcagatt gaaccatcat 2821 ggaatgtgcc aggaagtctt ccacagtggc aggcagcagt tttagatcga gcatcatcaa 2881 tggttgaaag agataaaaat catccatctg tacttatttg gtcatgtggt aatgaatcct 2941 atgcgggtga agatatttat cagatgtcta aatactttag aaaaaaagat ccttcacgtt 3001 tagtgcacta tgaaggggta actagatgca gagaatttat gacacgacga catgaaagta 3061 gaatgtatgc aaaggcagca gaaatagaag aatatcttaa tgataatccg aagaaacctt 3121 atatacagct gcgatacatg cactcaatgg gtaactcaac tggtggaatg atgaaataca 3181 cagaacttga agataaatat ttgatgtatc aaggtggatt catttgggat tacggcgatc 3241 aggcgttgta tagaaaactt ccagatggaa aagaagttct agcttatgga ggagacttta 3301 cagatcgtcc aacagactat aatttctctg gaaatggttt gatttatgca gatagaacta 3361 tatcacctaa agcacaggaa gttaagtatc tatatcaaaa cgtaaaatta gaaccagatg 3421 aaaaaggggt gactattaag aatcaaaatc tttttgttaa tactgataaa tatgatttat 3481 actatatcgt tgaaagagat ggaaaactaa taaaagatgg ttatctaaat gtatctgtag 3541 ctccagatga agaaaaatat atagaacttc caataggaaa ttacaatttt cctgaagaaa 3601 ttgtacttac aacctcatta agattagcac aagctacact ttgggcagaa aaaggatatg 3661 aaatagcatt tggacaaaag gttattaaag aaaaatcaga tatgaataat cataattcag 3721 agtctaaaat gaagatcatt catggagatg taaacatagg ggttcacgga aaagatttca 3781 aggctatatt ctctaaacaa gagggaggaa tcgtatcctt gagatataat aataaggagt 3841 ttataacgag aacgccaaaa actttctatt ggagagcaac aacagataat gatagaggaa 3901 atagacatga atttagatgc agtcaatggc tggctgctac tatggggcag aagtatgtgg 3961 atttttcagt tgaggaattt gatgagaaga ttacattata ttatacttat caattgccaa 4021 cagtgccatc tactaatgtt aagataactt atgaagtatc tggagaagga ataattaaag 4081 taaatgttaa gtataaagga gttagcggat tacctgaatt gcctgtacta ggaatggatt 4141 ttaaattatt agccgaattt aattcattta gctggtatgg aatggggcca gaagaaaact 4201 atatagacag atgtgaaggt gcaaaacttg gaatatatga gagtacacaa tagaaaatct 4261 atcaaggtat ttagtaccac aagaatgtgg taacaggata ggaactagat gggtagtagt 4321 taaaaatcat aagaatgaag gtcttaaatt tacttatgtt aaagttccat ttgaatttag 4381 tgttttacca tacagcagca tggaattaga aaattcactt catatagaag aattaccatc 4441 tgttaatttt acacattgtg aatataatag gtaaacaaat gggtgttggc ggagatgcaa 4501 tgctggggag caccatgata cctaaattct gtatagattc aagtaaggat ttagaatata 4561 gttttataat ttctaaaatt atactacgca catatgggaa ctatagatat ccaaaacaaa 4621 acttagactt atgcaataat ttacgaaagg acaggtactc tgttgtttcg gttactaaga 4681 ataagttgag gctttctaac atcataagtt gcaccatttc agcatgctcc cgagacaagc 4741 tcgtgacaag caaaaatgga acaacttatg atgaagaaat gcctgcaaca tattctttaa 4801 tgtaacactg cacaaaagag tacctgtcct ttctgatata gcagattttt caagctataa 4861 gtatatctca cgaaatcata aatattttga ttccgaaaag ctatgaaaat atcgctgaag 4921 gttctaagca gctggttgtg tgcaccttag catgctccaa ctttcagttt gacaagctaa 4981 aatggaacaa tctacagctc aagaaacttt aacagctcat tttcaaatgt tttctacaca 5041 aatatattta tatttctagt gaagatatga aattaaattt ttagcgactt tgtaaatatg 5101 ttaatctaat atacgaattc // LOCUS ECOPNCB 1490 bp ds-DNA BCT 09-AUG-1990 DEFINITION E.coli nicotinic acid phosphoribosyl transferase (pncB) gene, complete cds. ACCESSION J05568 KEYWORDS nicotinic acid phosphoribosyl transferase. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1490) AUTHORS Wubbolts,G., Terpstra,P., Van Beilen,J.B., Kingma,J., Meesters,H.A.R. and Witholt,B. TITLE Variation of cofactor levels in Escherichia coli: Sequence analysis and expression of the pncB gene encoding nicotinic acid phosphoribosyl transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by P.Terpstra, 31-MAY-1990. Nicotinic acid phosphoribosyl transferase is the first enzyme of the three enzyme Preiss-Handler pathway leading to the synthesis of NAD. The protein sequence shows similarity to orotate phosphoribosyl transferase (pyr5) from Dictyostelium discoideum (acc P09556, SWISS-PROT) FEATURES from to/span description pept 216 1418 nicotinic acid phosphoribosyl transferase (pncB) (EC 2.4.2.11) mRNA 158 1450 nicotinic acid phosphoribosyl transferase mRNA (3' end put.) signal 124 129 -35 region signal 146 151 -10 region rpt 170 185 inverted repeat binding 197 202 ribosome binding site signal 1426 1450 rho-independent transcription termination signal BASE COUNT 348 a 374 c 364 g 404 t ORIGIN 1 tgttgcgtaa tgcgtatgca gaatcttcat cttttcaggt acaaacgcct ttattgctac 61 atttttataa catacagcgc gtaatgccat cgaccagaaa ggtggcatat ggtgtgatcg 121 gggttcaata aattgcgaaa caaggtatac tccagcagtt cctgaagatg tttattgtac 181 taaacgctcc tgtacgagga cgctactgcg cacctatgac acaattcgct tctcctgttc 241 tgcactcgtt gctggataca gatgcttata agttgcatat gcagcaagcc gtgtttcatc 301 actattacga tgtgcatgtc gcggcggagt ttcgttgccg aggtgacgat ctgctgggta 361 tttatgccga tgctattcgt gaacaggttc aggcgatgca gcacctgcgc ctgcaggatg 421 atgaatatca gtggctttct gccctgcctt tctttaaggc cgactatctt aactggttac 481 gcgagttccg ctttaacccg gaacaagtca ccgtgtccaa cgataatggc aagctggata 541 ttcgtttaag cggcccgtgg cgtgaagtca tcctctggga agttcctttg ctggcggtta 601 tcagtgaaat ggtacatcgc tatcgctcac cgcaggccga cgttgcgcaa gccctcgaca 661 cgctggaaag caaattagtc gacttctcgg cgttaaccgc cggtcttgat atgtcgcgct 721 tccatctgat ggattttggc acccgtcgcc gtttttctcg cgaagtacaa gaaaccatcg 781 ttaagcgtct gcaacaggaa tcctggtttg tgggcaccag caactacgat ctggcgcgtc 841 ggctttccct cacgccgatg ggaacacagg cacacgaatg gttccaggca catcagcaaa 901 tcagcccgga tctagccaac agccagcgag ctgcacttgc tgcctggctg gaagagtatc 961 ccgaccaact tggcattgca ttaaccgact gcatcactat ggatgctttc ctgcgtgatt 1021 tcggtgtcga gttcgctagt cggtatcagg gcctgcgtca tgactctggc gacccggttg 1081 aatggggtga aaaagccatt gcacattatg aaaagctggg aattgatcca cagagtaaaa 1141 cgctggtttt ctctgacaat ctggatttac gcaaagcggt tgagctatac cgccacttct 1201 cttcccgcgt gcaattaagt tttggtattg ggactcgcct gacctgcgat atcccccagg 1261 taaaacccct gaatattgtc attaagttgg tagagtgtaa cggtaaaccg gtggcgaaac 1321 tttctgacag ccctggcaaa actatctgcc atgataaagc gtttgttcgg gcgctgcgca 1381 aagcgttcga ccttccgcat attaaaaaag ccagttaata tcatcaggga gctaatcggc 1441 tccctttttt tacctttaat tccgaaatct ttcgctgcat ttgcgaattc // LOCUS NEUCCON13 2728 bp ds-DNA PLN 09-AUG-1990 DEFINITION N.crassa conidiation-specific protein (con-13) gene, complete cds. ACCESSION M35120 KEYWORDS conidiation-specific protein. SOURCE N.crassa (strain 74-OR23-1A) DNA, clone pCon10a. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 2728) AUTHORS Hager,K.M. and Yanofsky,C. TITLE Genes expressed during conidiation in Neurospora crassa: Molecular characterization of con-13 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.M.Hager, 12-JUN-1990. Author address:K.M.Hager: Dept. of Physiology UCLA Medical School 10833 Le Conte Avenue Los Angeles, CA 90024-1751 E-mail:COTRAN%VXBIO.SPAN@STAR.STANFORD.EDU FEATURES from to/span description pept 1009 1275 conidiation-specific protein (con-13), exon 1 1333 1847 conidiation-specific protein, exon 2 1910 2150 conidiation-specific protein, exon 3 pre-msg 922 2367 con-13 mRNA and introns (alt.) pre-msg 927 2367 con-13 mRNA and introns (alt.) pre-msg 936 2367 con-13 mRNA and intron (alt.) pre-msg 946 2367 con-13 mRNA and intron (alt.) IVS 1276 1332 con-13 intron A IVS 1848 1909 con-13 intron B site 2364 2367 polyadenylation site BASE COUNT 653 a 695 c 720 g 660 t ORIGIN Linkage group IV. 1 gatctcatca tctgaaacgc cgcctgagtc aatgactctt ggcaatcggg ctctgcgtcc 61 ggctagatag acagcgtccc actgatacag acttggtaag ctgccacagt tgccaagttt 121 ttatatcgat tattctttga acttccaagg acagtcttca agggcgcttt ctgtctcagc 181 atcgggagat atgacgcccg tggttcgtat accaatggtt cggcactaag gcgctgcatt 241 tgactcggag atattgacgc ctgccccctt ttgagaggag actgagtgag cgaggcccaa 301 tactatcacc acagttgcgg ttagctgccg agacttatcg gtcaacaccg aaatattggc 361 ccagaagggc aacaaaacgg gctgtcgatg gcttgcaacc attgatatcc ctgattgcca 421 ttcctacact accgcccatt cttcattcaa acctgactct cttactccct ttacagtcta 481 gcagatctgg acgtacctgc atgtaatgcg gccaacgggg ctggtaagct gaacacacca 541 ttcggagcgg ctggcaagtc tgtcatgccc gatcgacagc acatgtacta gactatctta 601 agcctagttc cgtgttcaga aacatccggt ttgattgcga atcaacagta cattgatgtt 661 catccaccgg actctaaacc gatcagctaa ttgttggcgg agcggagttc atcgcgggcg 721 taggaaacaa ggttgatgtt acccgtaaat ggaaatcgtg cttcgctcac ggcgttgctc 781 cgaagtaggg tgaagaggtc cgttggctgt gatggtttgc gctggtgtgt gtcaacgctt 841 agtgatgctg gtgatccaac tccgatccaa atgacaaagc aatgcatata agaaggactg 901 ggcatcacca acagcgcaac ggcggcagac acgaagccct agctcgacaa gcagccttca 961 taccccgacc aaaaagtcac acttgtcgta ccgtaacctc gtcgcaagat gccccaggct 1021 catttcttcg cgttgctgct tgcagccgtt gtaccggccg ttttggcgga cggtcccccg 1081 gaatcgatgg gcgagaagtt cagcggcctc aacgttctgg atgggaacgg cggacttcaa 1141 agtttgaccc cgacacccta caccataagt caatggcctt ggggtactgt acccaagctg 1201 tgctatgaca cgtctgtcaa caacaagtac tgcaacccgt acgatctcga agtatacgat 1261 gtcagataca cggatgtagg taaaagactt gcctcggatt cggaacctgt gcttacctta 1321 acttgacaat agtgccccat tcccaccacc gtctgccgat gcaagaactc acctatggcc 1381 atagacacca ttgcgcagcg tgtcggccaa ctccctgtca aggctcgcca gtataatggc 1441 tatgtgtcca gctttgcggg agacatgtgc tcagcctaca gcgatagctt caacaactac 1501 ttctttggcg actgcggcaa ttccgagtcc gtcttcttcc atgagctcag ccacaacctt 1561 gaccgtcacg ttgcaggggc gtccatcaac gattggtact ccctttcgca agactggaag 1621 gataccgttg ccaaggacac ttgcgtcgca gaccactatt ccaaggccag ctggctcgag 1681 gcatatgccc aggtgggagt catggctgga tacgatgcta cggtacagtc tatctatacc 1741 caaaatgtcg gctgtatggt caatcaggtc aagaaggtgg ttggacagtt gaacagtgtc 1801 tggcgtaaac agcctgggca gatgtgcgat cgttactgga tcaaggagta agtttctttc 1861 aacaagaccc attttcttga tgaccctgtg ctgaccggaa tgtaaacagc accacggttt 1921 gcatgggacc tgatgcggaa gccagtggcc actgtcaagc atccaaagct gatgtcgcgg 1981 cggagtctgg tggtgtaaac ccagtgttgc cggacgggca gcagaagaag cacgacgcct 2041 tggtcaagga gcttcagcgt cacgccgagg ccgcggccgg catttcttcc ggaaaaccgg 2101 cggccgatag aaagaccaag ggtaagaagg gtaccaaatt cagggtctga agcgggaact 2161 atgatcgatt ccaggtcctg ggctctagct gtgagttcag tcagggtgtt gaggaagttg 2221 cgaggcctca gttgtgagcg acgtcatcaa accgtctcct tttgggataa tgataacctt 2281 ttatttctgg ataactggga caggttaggc tgtctttgtc gatagactag gtacgtaaga 2341 attgatttga tgcttgttcg atgcttttaa gttgttgtcg cttgtggttg cgaggtagtc 2401 ggcaggtttg tttggataga cgggagacgc ccactcgcac ccagggcgat gaataacgaa 2461 ggccgatggc tctttccatg tgggaaatac acaagtctgg cattgtccac ttgtttgtct 2521 tcgagcgggg ttacgatttc tgtcaagccc tttgctcctt tcttccgaga acaaaggaag 2581 ttttcgatcc agatcgccaa catccgaaaa gggaggaata gttcgatcga tgtaccttga 2641 cggctcggcc atcgatctga tctgcatttc ccactctgga ttccagggga agggtcatat 2701 gatggaaacg agatcgaaac ccattgag // LOCUS VVUVVHAB 2237 bp ds-DNA BCT 09-AUG-1990 DEFINITION V.vulnificus cytolysin (vvhA) and vvhB gene (pot.), complete cds. ACCESSION M34670 KEYWORDS cytolysin; cytotoxin; hemolysin; toxin. SOURCE V.vulnificus (strain EDL174) DNA, clone pCVD702. ORGANISM Vibrio vulnificus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 2237) AUTHORS Yamamoto,K., Wright,A.C., Kaper,J.B. and Morris,J.G. TITLE The cytolysin gene of Vibrio vulnificus: Sequence and relationship to Vibrio cholerae El Tor hemolysin JOURNAL Infect. Immun. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.B.Kaper, 29-MAY-1990. FEATURES from to/span description pept 745 2160 cytolysin (vvhA) precursor sigp 745 804 cytolysin signal peptide matp 805 2157 cytolysin pept 237 743 pot. cytolysin (vvhB) signal 55 60 -35 region signal 80 85 -10 region signal 87 92 -35 region signal 110 115 -10 region signal 184 189 -35 region signal 206 211 -10 region signal 2185 2219 transcription termination signal binding 54 69 CRP binding site binding 59 74 Fur binding site binding 185 199 Fur binding site binding 226 231 vvhB ribosome binding site binding 730 735 vvhA ribosome binding site BASE COUNT 639 a 498 c 509 g 591 t ORIGIN 1 tatattagat cacttttaaa acaataatag atcagatatt aatctgttga ttttgtgata 61 atgagccaaa aaatactttt attttattta tatgaaatat tttcaggatt attaataaat 121 agccaacagg attttggtgc atatctattc tcaaggacga accaaacaat ctccatacaa 181 atattaatgt tatggagaaa ataacaataa taacccttac tcgtaatgag gaatctatgc 241 ttaataacaa aaatagaaat gtaggacgcc ttaccctact ctgctgtttg tttgcggcga 301 atacttttgc tgatgttcaa attttgggca gcgaaagtga gctttcacaa accattgccg 361 atcagtacca acaaaatgtc acgctgttta acggccagct aaacagtaat gatgtgttgt 421 atgtcaatgt aggaacagca accgatgacg aaatcactca agcaaaaagt catatcatct 481 ccggtagcac cgtggtgatt gatttgactc aaattgctgg tgacgacgca aggcttgatt 541 ggagccaaaa actcactggt ttaggactgt cagcgcctgt tgtggttacg ggggtttatc 601 aaggcgacgc cttagtcaat gcgattgtca gcgatgtcac cgacgagaat gacaacccaa 661 tcaacgatcc ccaagccgag ttagagagcg ttaaactttc tctcactcat gccctagacc 721 gcttccaatc tgagggaaaa taagatgaaa aaaatgactc tgtttaccct ttctctttta 781 cgtaccgcgg tacaggttgg cgcacaagaa tatgtgccga ttgttgagaa acctatttac 841 atcaccagct caaagattaa gtgtgtgttg cacacaagcg gtgatttcaa cgccacacga 901 gactggtgta atgcgggtgc ttccatcgat gttcgcgtca atgtggcaca aatgcgctcg 961 gtacaatcgg caacgtcaga tggttttact cctgacgcca aaattgtccg tttcaccgtc 1021 gatgccgaca agcctggcac gggtattcat ttggttaacg agctacagca agatcacagc 1081 tggttccaga gttgggcaaa ccgccgcact tacattggtc cattcgccag cagttacgac 1141 ctttgggtga aacccgtttc tggttacaca ccgaaaaaag cccgtgacct accgcagaat 1201 gagaacaaaa actaccaaca ccgcgatact tacggttact ccatcggtat taacggcaaa 1261 gtaggtgcgg aagtgaacaa agacggcccg aaagtgggtg gcgaagtcag tggctcattt 1321 acctacaact actcgaagac cttggtgttt gatacaaaag actatcgcat caacaaccgt 1381 tcatcattga gtgattttga tatttcattc gagcgtgaat ttggggaatg tgatgaactg 1441 cgccgccaag agcttggatg ctatttcacc gccgctcact ggggcagtgg ctgggtattt 1501 gataagacga agttcaaccc tatctcttat tccaacttca aaccgaacta tgacgttttg 1561 tacgaagcgc ccgtgtctga aactggcgta acggattttg agatgggcgt gaaactcaac 1621 tatcgtgcac gctttggtac cgttcttcct tcagcgctgt tttcggttta cggctctgcg 1681 ggctcgtcaa ccaacagcag tactgtgaaa caacgtattc gcatcgactg gaatcaccca 1741 ctgtttgaag cggaacgaca cgttacactg cagtcactga gcaacaacga tctctgcctg 1801 gatgtttatg gtgagaacgg tgacaaaacg gttgcgggtg gttcggttaa cggctggagc 1861 tgtcacggca gttggaacca agtttggggc ctagataaag aagaacgtta tcgtagccga 1921 gtggcatccg atcgttgttt gaccgtaaac gcagacaaaa cgctcacagt cgaacagtgt 1981 ggtgcgaact tagcacagaa atggtattgg gaaggcgata agctcattag ccgctatgtt 2041 gatggcagta atactcgcta ccttctaaac attgttggtg gtcgtaatgt tcaagtaacc 2101 cctgaaaatg aagcaaatca ggcgcgttgg aaacccacat tacaacaagt caaactctag 2161 gctctgttga ccttagcgat atccaaacgc tccctgtata ctagggagcg tttttcttta 2221 ttcgccatct attcgtc // LOCUS TOBCPCG 155844 bp ds-DNA circular ORG 09-AUG-1990 DEFINITION N.tabacum (var. Bright Yellow 4) chloroplast, complete genome. ACCESSION Z00044 KEYWORDS 16S ribosomal RNA; 23S ribosomal RNA; 4.5S ribosomal RNA; 5S ribosomal RNA; ATP synthetase; ATPase; NADH dehydrogenase; RNA polymerase; autonomous replication; carboxylase; chloroplast; complete genome; cytochrome; cytochrome b559; cytochrome b6; cytochrome f; initiation factor; phosphoprotein; ribosomal protein; ribosomal protein L14; ribosomal protein L16; ribosomal protein L2; ribosomal protein L20; ribosomal protein L22; ribosomal protein L23; ribosomal protein L33; ribosomal protein S11; ribosomal protein S12; ribosomal protein S15; ribosomal protein S18; ribosomal protein S19; ribosomal protein S2; ribosomal protein S3; ribosomal protein S7; ribosomal protein S8; ribulose bisphosphate carboxylase; transfer RNA-Ala; transfer RNA-Arg; transfer RNA-Asn; transfer RNA-Asp; transfer RNA-Cys; transfer RNA-Glu; transfer RNA-Gly; transfer RNA-His; transfer RNA-Ile; transfer RNA-Leu; transfer RNA-Lys; transfer RNA-Met; transfer RNA-Phe; transfer RNA-Pro; transfer RNA-Ser; transfer RNA-Thr; transfer RNA-Trp; transfer RNA-Tyr; transfer RNA-Val. SOURCE Nicotiana tabacum (var. Bright Yellow 4) chloroplast DNA, clone pHC79 (IR-A and IR-B). ORGANISM Chloroplast Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Nicotiana tabacum. REFERENCE 1 (sites) AUTHORS Shinozaki,K., Ohme,M., Tanaka,M., Wakasugi,T., Hayashida,N., Matsubayashi,T., Zaita,N., Chunwongse,J., Obokata,J., Yamaguchi-Shinozaki,K., Ohto,C., Torazawa,K., Meng,B.Y., Sugita,M., Deno,H., Kamogashira,T., Yamada,K., Kusuda,J., Takaiwa,F., Kato,A., Tohdoh,N., Shimada,H. and Sugiura,M. TITLE The complete nucleotide sequence of tobacco chloroplast genome: Its gene organization and expression JOURNAL EMBO J. 5, 2043-2049 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 155844) AUTHORS Sugiura,M. JOURNAL Unpublished (1986) Biology Dept, Nagoya Univ., Nagoya 464, Japan STANDARD full staff_review COMMENT The circular tobacco chloroplast DNA sequence is presented in a linearized form by cutting at the junction between IRA and LSC. The DNA strand which codes for the large subunit of ribulose-1,5-bisphosphate carboxylase is designated as A strand and the complementary strand as B strand. The nucleotide sequence of the B strand is presented. Large single copy region (LSC): 1-86684 (86684 bp) Inverted repeat B (IR-B): 86685-112023 (25339 bp) Small single copy region (SSC): 112024-130505 (18482 bp) Inverted repeat A (IR-A): 130506-155844 (25339 bp) Rps12 consists of three exons. There are two sets of exons 2 and 3. One set is located on the same strand in IR-B 28 kb upstream of exon 1. The oteher set is located on the opposite strand in IR-A 69 kb downstream of exon 1. The tobacco rps12 gene probably consists of three transcription units and requires trans-splicing. The chloroplast DNA segments capable of replication in yeast, ars1 and ars2, are located at positions 112768-113117 and 14570-15088 respectively. Seven open reading frames, (RF236, RF548, RF862, stop codon to stop codon) and (ORF151, ORF90, ORF80, ORF134, start codon to stop codon) are present near the rpoB gene. Four or these ORFs show some homology to portions of the beta'-subunit sequence of E.coli RNA polymerase [1]. Most open reading frames indicated in FEATURES are from start codon to stop codon. The intron boundaries for the ndhA and ndhB are not known and thus the largest possible intron is indicated (from stop codon to stop codon). FEATURES from to/span description tRNA 80 6 (c) His-tRNA (GUG) tRNA 4407 4371 (c) Lys-tRNA (UUU), exon 1 1844 1810 (c) Lys-tRNA (UUU), exon 2 tRNA 7487 7416 (c) Gln-tRNA (UUG) tRNA 8719 8632 (c) Ser-tRNA (GCU) tRNA 9499 9521 Gly-tRNA (UCC), exon 1 10213 10260 Gly-tRNA (UCC), exon 2 tRNA 10430 10501 Arg-tRNA (UCU) tRNA 28783 28854 Cys-tRNA (GCA) tRNA 31999 31926 (c) Asp-tRNA (GUC) tRNA 32191 32108 (c) Tyr-tRNA (GUA) tRNA 32323 32251 (c) Glu-tRNA (UUC) tRNA 33172 33243 Thr-tRNA (GGU) tRNA 37223 37132 (c) Ser-tRNA (UGA) tRNA 38050 38120 Gly-tRNA (GCC) tRNA 38421 38348 (c) fMet-tRNA (CAU) tRNA 47111 47197 Ser-tRNA (GGA) tRNA 48577 48505 (c) Thr-tRNA (UGU) tRNA 49288 49322 Leu-tRNA (UAA), exon 1 49826 49875 Leu-tRNA (UAA), exon 2 tRNA 50232 50304 Phe-tRNA (GAA) tRNA 54390 54353 (c) Val-tRNA (UAC), exon 1 53781 53747 (c) Val-tRNA (UAC), exon 2 tRNA 54581 54653 Met-tRNA (CAU) tRNA 68880 68807 (c) Trp-tRNA (CCA) tRNA 69118 69045 (c) Pro-tRNA (UGG) tRNA 88770 88697 (c) Ile-tRNA (CAU) tRNA 96507 96427 (c) Leu-tRNA (CAA) tRNA 102459 102530 Val-tRNA (GAC) tRNA 104547 104583 Ile-tRNA (GAU), exon 1 105291 105325 Ile-tRNA (GAU), exon 2 tRNA 105390 105427 Ala-tRNA (UGC), exon 1 106137 106171 Ala-tRNA (UGC), exon 2 tRNA 109973 110046 Arg-tRNA (ACG) tRNA 110699 110628 (c) Asn-tRNA (GUU) tRNA 116067 116146 Leu-tRNA (UAG) tRNA 131830 131901 Asn-tRNA (GUU) tRNA 132556 132483 (c) Arg-tRNA (ACG) tRNA 137139 137102 (c) Ala-tRNA (UGC), exon 1 136392 136358 (c) Ala-tRNA (UGC), exon 2 tRNA 137982 137946 (c) Ile-tRNA (GAU), exon 1 137238 137204 (c) Ile-tRNA (GAU), exon 2 tRNA 140070 139999 (c) Val-tRNA (GAC) tRNA 146022 146102 Leu-tRNA (CAA) tRNA 153759 153832 Ile-tRNA (CAU) rRNA 102758 104246 16S rRNA rRNA 106325 109134 23S rRNA rRNA 109236 109338 4.5S rRNA rRNA 109595 109715 5S rRNA rRNA 132934 132814 (c) 5S rRNA rRNA 133293 133191 (c) 4.5S rRNA rRNA 136204 133395 (c) 23S rRNA rRNA 139771 138283 (c) 16S rRNA RNA 32347 31836 (c) Asp-tRNA, Tyr-tRNA, Glu-tRNA RNA precursor pept 1595 534 (c) PSII 32kd protein (psbA) pept 6211 6172 (c) ribosomal protein S16 (rps16), exon 1 5311 5094 (c) ribosomal protein S16 (rps16), exon 2 pept 12148 10625 (c) ATPase alpha subunit (atpA) pept 13452 13308 (c) ATPase I subunit (atpF), exon 1 12612 12203 (c) ATPase I subunit (atpF), exon 2 pept 14099 13854 (c) ATPase III subunit (atpH) pept 16001 15258 (c) ATPase a subunit (atpI) pept 16938 16228 (c) ribosomal protein S2 (rps2) pept 27501 24289 (c) RNA polymerase beta subunit (rpoB) pept 34462 35523 PSII D2 protein (psbD) pept 35471 36892 PSII 44kd protein (psbC) pept 38873 38571 (c) ribosomal protein S14 (rps14) pept 41200 38996 (c) PSI P700 apoprotein A2 (psaB) pept 43478 41226 (c) PSI P700 apoprotein A1 (psaA) pept 48133 47528 (c) ribosomal protein S4 (rps4) pept 55276 54875 (c) ATPase epsilon subunit (atpE) pept 56769 55273 (c) ATPase beta subunit (atpB) pept 57587 59020 RuBisCO large subunit (rbcL) pept 64327 65289 cytochrome f (petA) pept 66860 66741 (c) PSII component (psbF) pept 67121 66870 (c) PSII cytochrome b559 (psbE) pept 70123 70323 ribosomal protein L33 (rpl33) pept 70510 70815 ribosomal protein S18 (rps18) pept 71401 71015 (c) ribosomal protein L20 (rpl20) pept 72326 72213 (c) ribosomal protein S12 A (rps12A), exon 1 100851 100620 (c) ribosomal protein S12 A (rps12A), exon 2 100083 100058 (c) ribosomal protein S12 A (rps12A), exon 3 pept 72326 72213 (c) ribosomal protein S12 B (rps12B), exon 1 141678 141909 ribosomal protein S12 B (rps12B), exon 2 142446 142471 ribosomal protein S12 B (rps12B), exon 3 pept 74950 76476 PSII P680 apoprotein (psbB) pept 77098 77319 PSII 10kd phosphoprotein (psbF) pept 77449 77454 cytochrome b6 (petB), exon 1 78208 78849 cytochrome b6 (petB), exon 2 pept 79845 80264 cytochrome b/f complex subunit 4 (petD) pept 81465 80452 (c) RNA polymerase alpha subunit (rpoA) pept 81947 81531 (c) ribosomal protein S11 (rps11) pept 82465 82175 (c) RF96 pept 83004 82600 (c) ribosomal protein S8 (rps8) pept 83544 83173 (c) ribosomal protein L14 (rpl14) pept 85093 85085 (c) ribosomal protein L16 (rpl16), exon 1 84064 83669 (c) ribosomal protein L16 (rpl16), exon 2 pept 85896 85240 (c) ribosomal protein S3 (rps3) pept 86348 85881 (c) ribosomal protein L22 (rpl22) pept 86680 86402 (c) ribosomal protein S19 (rps19) pept 88231 87841 (c) ribosomal protein L2 (rpl2), exon 1 87174 86741 (c) ribosomal protein L2 (rpl2), exon 2 pept 88531 88250 (c) ribosomal protein L23 (rpl23) pept 100004 99537 (c) ribosomal protein S7 (rps7) pept 125398 125135 (c) ribosomal protein S15 (rps15) pept 142525 142992 ribosomal protein S7 (rps7) pept 153998 154279 ribosomal protein L23 (rpl23) pept 154298 154688 ribosomal protein L2 (rpl2), exon 1 155355 155788 ribosomal protein L2 (rpl2), exon 2 mRNA 1680 441 (c) psbA mRNA (441 +/- 2 bp) mRNA 57025 54637 (c) atpB, atpE mRNA (alt.; 54637 +/- 1 bp) mRNA 57025 54676 (c) atpB, atpE mRNA (alt.; 54676 +/- 2 bp) mRNA 57405 59161 rbcL mRNA IVS 4370 1845 (c) Lys-tRNA intron IVS 6171 5312 (c) rps16 intron IVS 9522 10212 Gly-tRNA intron (no splice consensus) IVS 13307 12613 (c) atpF intron IVS 49323 49825 Leu-tRNA intron (no splice consensus) IVS 54352 53782 (c) Val-tRNA intron IVS 77455 78207 petB intron (no splice consensus) IVS 85084 84065 (c) rpl16 intron IVS 87840 87175 (c) rpl2 intron IVS 98349 97838 (c) ndhB intron IVS 72326 72213 (c) rps12A intron A IVS 141678 141909 rps12A intron B (no splice consensus) IVS 142446 142471 rps12A intron C (no splice consensus) IVS 104584 105290 Ile-tRNA intron (no splice consensus) IVS 105428 106136 Ala-tRNA intron (no splice consensus) IVS 123219 122140 (c) ndhA intron IVS 137101 136393 (c) Ala-tRNA intron IVS 137945 137239 (c) Ile-tRNA intron IVS 144180 144690 ndhB intron (no splice consensus) IVS 154689 155354 rpl2 intron (no splice consensus) rpt 86685 112023 inverted repeat B (IR-B) rpt 130506 155844 inverted repeat A (IR-A) site 1 86684 large single copy region (LSC) site 112024 130505 small single copy region (SSC) ORF 3658 2129 (c) ORF-509A cds ORF 7724 8020 ORF98 cds ORF 19753 17165 (c) RF862 cds ORF 20277 19873 (c) ORF134 cds ORF 20423 20181 (c) ORF80 cds ORF 20646 20374 (c) ORF90 cds ORF 21475 20765 (c) RF236 cds ORF 23127 21481 (c) RF548 cds ORF 24283 23828 (c) ORF151 cds ORF 37558 37241 (c) ORF105 cds ORF 37586 37774 ORF62 cds ORF 44264 44497 ORF77 cds ORF 45394 45146 (c) ORF82 cds ORF 46464 46240 (c) ORF74A cds ORF 48933 49145 ORF70A cds ORF 51457 50981 (c) ORF158 cds ORF 52417 51563 (c) bhpB cds ORF 52659 52297 (c) bhpA cds ORF 59785 61323 ORF512 cds ORF 62630 > 62630 ORF184 ORF 63407 64096 ORF229 ORF 66168 66467 ORF99A ORF 67580 67269 (c) ORF103 ORF 72686 72465 (c) ORF73 ORF 73547 73323 (c) ORF74B ORF 82162 82049 (c) ORF37 ORF 88883 90628 ORF581 ORF 90598 95724 ORF1708 ORF 95815 96078 ORF87 ORF 146472 96057 (c) ORF115 ORF 96116 96394 ORF92 ORF 96553 96792 ORF79 ORF 98889 98350 (c) ndhB, exon 1 97837 97047 (c) ndhB, exon 2 ORF 140581 101948 (c) ORF131 ORF 102099 102311 ORF70B ORF 110820 110593 (c) ORF75 ORF 111025 112077 ORF350 ORF 114198 112066 (c) ndhF ORF 116250 117191 ORF313 ORF 118958 117429 (c) ndhD ORF 119860 119555 (c) ndhE ORF 120383 120084 (c) ORF99B ORF 120612 120196 (c) ORF138 ORF 121512 121009 (c) ORF167 ORF 123840 123217 (c) ndhA, exon 1 122109 121597 (c) ndhA, exon 2 ORF 125023 123842 (c) ORF393 ORF 126482 125796 (c) ORF228 ORF 127561 126740 (c) ORF273 ORF 131501 127767 (c) ORF1244 ORF 131709 131936 ORF75 ORF 140186 140581 ORF131 ORF 140430 140218 (c) ORF70B ORF 145976 145737 (c) ORF79 ORF 146125 146472 ORF115 ORF 146413 146135 (c) ORF92 ORF 146714 146451 (c) ORF87 ORF 151931 146805 (c) ORF1708 ORF 153646 151901 (c) ORF581 ORF 96404 96057 (c) ORF 115 anticdn 45 43 (c) His-tRNA anticodon gtg anticdn 4376 4374 (c) Lys-tRNA anticodon ttt anticdn 7455 7453 (c) Gln-tRNA anticodon ttg anticdn 8685 8683 (c) Ser-tRNA anticodon gct anticdn 10222 10224 Gly-tRNA anticodon tcc anticdn 10463 10465 Arg-tRNA anticodon tct anticdn 28815 28817 Cys-tRNA anticodon gca anticdn 31965 31963 (c) Asp-tRNA anticodon gtc anticdn 32157 32155 (c) Tyr-tRNA anticodon gta anticdn 32289 32287 (c) Glu-tRNA anticodon ttc anticdn 33204 33206 Thr-tRNA anticodon ggt anticdn 37189 37187 (c) Ser-tRNA anticodon tga anticdn 38082 38084 Gly-tRNA anticodon gcc anticdn 38387 38385 (c) fMet-tRNA anticodon cat anticdn 47145 47147 Ser-tRNA anticodon gga anticdn 48544 48542 (c) Thr-tRNA anticodon tgt anticdn 49841 49843 Leu-tRNA anticodon taa anticdn 50265 50267 Phe-tRNA anticodon gaa anticdn 54357 54355 (c) Val-tRNA anticodon tac anticdn 54614 54616 Met-tRNA anticodon cat anticdn 68846 68844 (c) Trp-tRNA anticodon cca anticdn 69084 69082 (c) Pro-tRNA anticodon tgg anticdn 88737 88735 (c) Ile-tRNA anticodon cat anticdn 96474 96472 (c) Leu-tRNA anticodon caa anticdn 102491 102493 Val-tRNA anticodon gac anticdn 104579 104581 Ile-tRNA anticodon gat anticdn 105423 105425 Ala-tRNA anticodon tgc anticdn 110007 110009 Arg-tRNA anticodon acg anticdn 110667 110665 (c) Asn-tRNA anticodon gtt anticdn 116101 116103 Leu-tRNA anticodon tag anticdn 131862 131864 Asn-tRNA anticodon gtt anticdn 132522 132520 (c) Arg-tRNA anticodon acg anticdn 137106 137104 (c) Ala-tRNA anticodon tgc anticdn 137950 137948 (c) Ile-tRNA anticodon gat anticdn 140038 140036 (c) Val-tRNA anticodon gac anticdn 146055 146057 Leu-tRNA anticodon caa anticdn 153792 153794 Ile-tRNA anticodon cat BASE COUNT 47824 a 29991 c 28992 g 49037 t ORIGIN 2692 bp upstream of BamHI site. 1 ttatgggcga acgacgggaa ttgaacccgc gcatggtgga ttcacaatcc actgccttga 61 tccacttggc tacatccgcc ccctcgccta cttacattcc gtttttacat tatttaaatt 121 agaaaacaaa agattcaagt tcgaatatag ctcttctttc ttatttcaat gatattatta 181 tttcaaagat aagagatatt caaagataag agataagaag aagtcaaaat ttgatttttt 241 ttttggaaaa aaaaaatcaa aaagatatag taacattagc aagaagagaa acaagttcta 301 tttcacaatt taaacaaata caaaatcaaa atagaatact caatcatgaa taaatgcaag 361 aaaataacct ctccttcttt ttctataatg taaacaaaaa agtctatgta agtaaaatac 421 tagtaaataa ataaaaagaa aaaaagaaag gagcaatagc accctcttga tagaacaaga 481 aaatgattat tgctcctttc ttttcaaaac ctcctataga ctaggccagg atcttatcca 541 tttgtagatg gagcttcgat agcagctagg tctagaggga agttgtgagc attacgttca 601 tgcataactt ccataccaag gttagcacgg ttaatgatat cagcccaagt attaattaca 661 cggccttgac tgtcaactac agattggttg aaattgaaac catttaggtt gaaagccata 721 gtgctgatac ctaaagcggt aaaccagata cctactacag gccaagcagc taggaagaag 781 tgtaacgaac gagagttgtt gaaactagca tattggaaga tcaatcggcc aaaataacca 841 tgagcggcta cgatgttata agtttcttcc tcttgaccga atctgtaacc ttcattagca 901 gattcatttt ctgtggtttc cctgatcaaa ctagaagtta ccaaggaacc atgcatagca 961 ctgaataggg agccgccgaa tacaccagct acgcctaaca tgtgaaatgg gtgcataagg 1021 atgttgtgct cagcctggaa tacaatcatg aaattgaaag taccagagat tcctagaggc 1081 ataccatcag aaaaacttcc ttgaccaatt gggtagatca agaaaactgc ggtagcagct 1141 gcaacaggag ctgaatatgc aacagcaatc caaggtcgca tacccagacg gaaactaagc 1201 tcccactcac gacccatgta acaagctacg ccaagtaaga agtgtagaac aattagttca 1261 taaggaccac cgttgtataa ccattcatca acggatgccg cttcccagat tgggtaaaaa 1321 tgtaaaccta tagctgcaga agtaggaata atggcaccgg aaataatatt gtttccgtaa 1381 agtagagacc ctgaaacagg ttcacgaata ccatcaatgt ctactggagg agcagcaatg 1441 aaggcaataa taaatacaga agttgccgtc aataaggtag ggatcatcaa aacaccaaac 1501 catccaatgt aaagacggtt ttcagtgcta gttatccagt tacagaagcg accccatagg 1561 ctttcgcttt cgcgtctctc taaaattgca gtcatggtaa aatcttggtt tatttaatca 1621 tcagggactc ccaagcacac tagttttcta caaatcaaaa tagaaaatgg aaggcttttt 1681 attcaacagt ataacatgac ttatatactc gtgtcaacca aggtgtatgt agatctattc 1741 aaatttttaa tgaagttgat tggaaaaata cggacttctc tacagaaaat tagaatttcg 1801 atatgctagt gggttgcccg ggattcgaac ccggaactag tcggatggag tagataagtt 1861 ccttgttaaa taaaataaat gttaatctta aattaaataa acaagtaaag acccctcccc 1921 aagccgtgct tgcatttttc attgcacacg gctttcccta tgtatacatc agttcctttc 1981 ttatagaaat tagaaagact ttaaaaagtt gaatactcag ttgatttacc ccttaattac 2041 tattacaatc aacatttcag aatagtgaaa tttttttatc tcttcatcat ttagaaacaa 2101 atttccattt agaaaatcta agaatgaatc attgataatt cgccagatca ttgatacaaa 2161 aaatatccaa ataccaaatc cgacttctat atactcccca caaactagaa gaagctcgtg 2221 ggaaggtcaa agaaagaact tgttcttccg acgttaagaa ttcttccaat aattccgagc 2281 ccgatctttt caaaaaagtg cgtacagtac ttttgtgttt ccgagctaaa gttctagcac 2341 aagaaagtcg aagtatatac tttattcgat ataaagtctt ttttttggaa gatccgctat 2401 aataatgaaa aagatttctg catatacgcc caaatcggtc aataatatca gaatctgata 2461 aatcggacca aaccggttta ctaatgggat gccctaatac ggtacaaaag tttgctttag 2521 ctaatgatcc aatcaaagga ataattggaa caagggtatc gaacttctta attgcattat 2581 tgattagaaa tgaattttct aacatttgac tacgtaccat tgaaggattt agtcgcacac 2641 ttgaaagata gcccataaag tcacgggaat gattggataa ttggtttata tggatccttc 2701 ctgtgtgaaa gcacagagaa caatgacatt gccaaaaatt gacaaggtaa aatttccatt 2761 tattcatcaa aagaaacgtc ccttttgaag ccagaatgga ttttccttga tacctaacat 2821 aatgcatgaa aggatccttg aataaccata gggtaacctg aaaatcctta gcaaagactt 2881 ctacaagacg ttctattttt ccatagaaat atattcgttc aagaagggct ccaaaagatg 2941 ttgatcgtaa atgagaagat tggttccgta gaaagacgaa agtggattcg cattcatata 3001 cataagaatt atataagaag aagaagaatc tttgattttt ttttgaaaag gagtaaccgg 3061 gcttctttga agtaataaga ctattcaaat tccaaaattc atggagaaag aatcgtaata 3121 aatgtaaaga agaggcatct tttacccaat agcgaagagt ttgaaccaag atttccagat 3181 gaacagggta gggtattagt atatctaata cataatttag atgtgaaaaa ttgtcctcta 3241 aaaaaggaaa tgttgaatga attgatcgta aattataaga tttaaaaatc tttttgcctt 3301 cgaaagaaga taaagaagat attaatcgta gagaaaacgg aatttccaca ataaacgcaa 3361 atccctctga tatcatttga gaatacaaac tcttgttgca ccccaaaaat gaatttttgt 3421 tagaatcatt agtagaaata agaaaatgat tctgttgata cattcgagta attaaacgtt 3481 tcacaattag gaaacttaat ttattgttat aacctggatt ttccaacaaa atcgacctat 3541 ttctatttaa accatgatca tgagcaagtg cataaatata ctcctgaaag ataagtggat 3601 ataggaagtt gtgttgttgc gatctatctg gctgtaaata tctttggatt tcttccattt 3661 gaaattcgat ttgaaccaaa gacggaagat tttgagggtt atcaaatgat acatagtgcg 3721 atacagttaa aacaaagtat tttagtaaga atagatacct tggatacagg taaacttctc 3781 aacggattct ctatcatctc tttttttttt cgtttcgttt aattggtcta tgttatagtg 3841 ttataggata ataagatggt tagaaatcct ttattttttc aacctaatcg ctcttttgac 3901 ttcggaaaaa actttcttta tcaatatact gtttcttcta cacacacatc tccgtaatag 3961 aaaatggtaa tagttaggat tcattaaaaa aatggagaat ccactcatgg gacaagaaac 4021 ccttcccgca tcaggcacta ataaattttt aacgtctaat tagatcggga atcattcaaa 4081 ttaagaacaa aagctcgttg ctttttcttt ctttccctat aatttaattg aagccgcagc 4141 cctatccatt tattcattcg acccaacttt attttgttcc gttccaagaa ttctaacacg 4201 gttttatacc catctaggaa caatgaaata ttctcagaac tttccgttga tacgacatgc 4261 tatttttacc attcattccc tttcaggatc agtcgtggtc ttccaaactt taccgagagt 4321 atggacgaat ccctcacttc atccatatgt gtaaaagaga ctagccgcac ttaaaagccg 4381 agtactctac cgttgagtta gcaacccgaa gaaaatatcg aagaaaaata aataaagaga 4441 ttagacaaga caaccaaaaa ccattgaagg aataaatcta aaaaaataca ttcacatttt 4501 cgaattaatt taaaacataa aataaaacta aatagatcca cttcatttat cacaatgaat 4561 tatatttgtt cgatacactc tgttgtcata aatattgaat agtgaaaaaa aaaaaagaaa 4621 tttcaattga caacaataaa aaatattaaa aaaaaggact tgtgttagat tggcactaca 4681 aatctaatcc aaataaaata gatacaaaaa agtatagatg agagaataaa ttaagtggaa 4741 aacaaaacta caatttattt agatttattt aatccataat ggattcaatc aagttaagtg 4801 agataagcaa acttgatttc ttttttagtt ttagtagagt tccaatgaaa aacggaaaac 4861 cacccaattg aaggaaatgc ccgaattttc tatttcgagg atcaataaaa taagaggttt 4921 tgtcgttata gaacacggaa ttcaatggaa gcaatgataa aaaaatacaa atagaaaagg 4981 aaagggagga aatacaaaaa aatagaagag aaaagtcata caaagttata tacaaatgac 5041 tacccccctt tttgtatttc cttaatttat ttccttaatt gaatttcggt tgattaggat 5101 tgattaggac gaagttcctt aaaaacctcc gccttcttta aaatatcctg aacagttcct 5161 gtaggttgag cccctttttc aaggaaatat aaaatagcag gaacatttaa ataagtttga 5221 ttctttatcg gatcataaaa acccactttc cgaagatctt ttccttctct tcgggatcga 5281 acatcaattg caacgattcg atagacggct cattgggatt gatgtagatg aacaacaccc 5341 cccctagaaa cgtataggaa gctttctcct cgtacggctc gagaaaaatg attgattcga 5401 ggttttatct ctgtatggaa ttctatctaa gaaatgacaa ctgggtccat aaaatgatca 5461 aatcaattaa agatgtaagt cttttttttt cttctttctt cctgaaaatg aaaaagaaac 5521 cattcgtact ctcataactc aagttggata actttcaaac agttcaaagg aaaatctttc 5581 ggcaatttca tttattgagc ggtctttcct ccttttatgt ttgtctcgtt taaaatggat 5641 ttggattctt cagttcgatc cagttattaa gacaataaaa aaggtgtttc cttgttctgg 5701 gatcctttat ctttgtttta ttttaaatca ttgggtttag acattacttc ggtgcttttt 5761 aatcctttca aaatggcagc aacatacccc ttttgcgatt tctatgaaag aatcctacag 5821 acgatggatt cccgcgtgaa acactttgga tcgaaaagtt tgaatcaatt ccaaggaatt 5881 tttgaattgg aaacttgctc gaattggatt ctttcgattt ccataccgaa aatatattta 5941 cgaagttgtt ccaatttttt tattgattgg cattaaccct agactcttgc cccgagaaat 6001 aaattaatac tttctactcg agctccatca tggactattt acattccaag acaacaaaaa 6061 agaggggttc taatgaaaca gaaccaatga tgtcgagcca agagcacctt cattcctaca 6121 taaaatggtg gatgtacaaa tccacaacgg atcctgtcct tcaagtcgca cgttgctttc 6181 taccacatcg tttcaaacga agttttacca taacattcct ctaagaaccg gtctggaatt 6241 gattcaatta tggaatcatg aatagtcatt ggttgggctg atgtataaac accataatct 6301 atactttgtt ctatatctat atactataga gataggtgga taaatatttt tctttagtaa 6361 gaccccatcg ctaatattaa tttatctaac atattaatta atatttaata tataaatata 6421 tatagaaata ataataaata agaataataa taaataagac gaataaatga gttctttttg 6481 attctgcatc ttcacgtgac tcaataggag agattgacct atttcagact tcttcaaata 6541 gcaaagattc cgcttataag gaatgattaa aactatttat atttctaaat ttagaaagtt 6601 cccttttcga catcattatt tgaagaaaat ttgatagtta aagatcactt ttgatcatct 6661 taggaaagaa aaaagataag tctttctttt ttaattgaat catcaacgat ttcaatgatc 6721 taaaatagat aaatacacca aacaacaaat ccaatttttt tttatgagat ggataaaaaa 6781 agattaatat aaggtaagat tttcattctt attctttttt ttttttttca tctgattgat 6841 aaaatccaaa gaatggggag ggtttcgtat ctatcaattc gatcaaatag actgagcaat 6901 tgtcaccgtt tatagatatt gaaatgaatg ccttcccatt actgattaac tcctatctac 6961 cccattctat gggcctgatg cagcataaat caaaagaaaa gaggggggtg tcctagtctt 7021 tttgattttt acgaaatgcg agctgtctag gcacaaagcc aaacaagtcc agattaagtc 7081 aagtttttgc tcctattttt tgatatttta gcctaactca ttgattaaga attaagagac 7141 ttagtgaatt taattagtac caaaaatccc ctcttggcga aaagtcaaga aatccacaaa 7201 aaagaaaatg gaatctaatt aggctaattt aggggataga gaatacgaga tagggaatat 7261 agattctttc gcatctcgat tccgtttttg aaaaaaaaaa atgattcatc gaagaaaaaa 7321 atcagaaaca acaatcacat tccagctaac atttcgattt taaacagaac attgttaaaa 7381 aagcaatcta tattctcata gaatatatat atgttctggg acggaaggat tcgaacctcc 7441 gaatagcggg accaaaaccc gttgccttac cacttggcca cgccccattt agatttctat 7501 tcgatactaa gaaagtatat tgcttgtttt gtttgtttgt caactctagt ccaaatatct 7561 atagaataga ttagattggt actaggattt tgcgatgttt ttggtatgtg tagatataga 7621 attcaactta atttattgat cattacatat aattcaatta agatattgta tgaaaatatg 7681 attttttcga ttctcctttg agaaaaggag gatttttgat tgggtgggtt caaagaaaaa 7741 gaagtatttt ttgtttacct tacttacttt ccctttcctt atatcaataa cgcaatcaaa 7801 atgcaattat ctctccaaga acaaaaagtc tgttatgctt aataccttta gtttgatcgg 7861 tatctgtctt aattcgaccc ttttttcgag tagttttttc ttcggcaaat tgcccgaggc 7921 ctatgctttt ttgaatccaa tcgtagatat tatgccagtc atacctctgt ttttttttct 7981 cttagccttt gtttggcaag ctgctgtaag ttttcgatga gatccttaat aatatcctag 8041 aaaattcatg atttattcga gaaaaattct aaaataaata aaatcagata agctttaccg 8101 tttgaaacct cgattcaaac attgaaattc ttggatagtc acgagaaatc cggcttaact 8161 tatttcctta ttttttgacg ctttcccttc cagtgaaaga ccttattagg ctcctcacaa 8221 tacctaattg tgtatataaa aaaattttgg ttaatgacaa actcttagta gaaaagaatt 8281 tatgaaaatt cttttagaga aagagcttca ttgcttggtg tcaaactagg atatgcggta 8341 gaaaaatgga tgatctattc tctttttttc aaaaaaaatc atcttggaga ttgtgtaatg 8401 cttactctca aactcttcgt ttacacagta gtgatatttt ttgtttctct cttcatcttt 8461 ggattcctat ctaatgatcc cggacgtaat cctggacgtg aagaataaaa taaaaaaggt 8521 ttttccttgc ttgattttcc aattttctta tgatttggtc tattccacac atttaactaa 8581 gaataagaac aaaggatttc gaaatttgaa aaaaaaaaat caagtcatca acggaaagag 8641 agggattcga accctcggta cgattaactc gtacaacgga ttagcaatcc gccgctttag 8701 tccactcagc catctctccc aattgaaaaa gataattact acatgagata gcacataaga 8761 taaaggaaag aatctttctt tctctctttt cttctttcta tattatatag atatgtacaa 8821 cttttatcat caatttcctt tatttcttta tctaaagtaa aggaagggct cagaagagcc 8881 aagaatatca agaaaaataa agaagacctc ttttctttgt cttgattttg ttcgaaagga 8941 ccctcttatt ctcatggcct ggtctggtca gtacccagcc gggcctcttt tgttccaacg 9001 aatttgaatt tgaaaactaa aaagcctgtt atagttgtaa tatttcattt taattgaata 9061 gttaatattc aagcaacaag aaaaaattcc cattttttgt aaaagtaaaa taaaatatat 9121 aaaatagaaa attcgatcaa aataaaagtc tcatttctct ttctgctttt ttattttatg 9181 tttaccacct tactggacta aaaaaaagaa gctttcgagt attccacaat gcatttttat 9241 gttatgattt tagtggtttt gacgagccgt atctctatca aaactcctcc agcaaaagaa 9301 aagataaaac taaattctgt aatttagtta tttaaatgaa ccctcgtttc caaatctcat 9361 caaattggaa tccccccagg aaaaaagatc aacactctaa tttggatgat tctgtgacga 9421 ccctatctta tcctatcttg attaccacaa ttcccctgtt cgacaaaagt tgcatttgta 9481 tacaataatc ggattgtagc gggtatagtt tagtggtaaa agtgtgattc gttctattat 9541 cccttaaata gttaaagggt ccttcggttt gattcgtatt ccgatcaaaa acttgatttc 9601 taaaaaggat ttaatccttt tcctctcaat gacagattcg agaacaaata cacattctcg 9661 tgatttgtat ccaagggtca cttagacatt gaaaaattgg attatgaaat tgcgaaacat 9721 aattttggaa ttggatcaat acttccaatt gaataagtat gaataaagga tccatggatg 9781 aagatagaaa gttgatttct aatcgtaact aaatcttcaa tttcttattt gtaaagaaga 9841 aattgaagca aaatagctat taaacgatga ctttggttta ctagagacat caacatattg 9901 ttttagctcg gtggaaacaa aacccttttc ctcaggatcc tattaaatag aaatagagaa 9961 cgaaataact agaaaggttg ttagaatccc ctcttctaga aggatcatct acaaagctat 10021 tcgttttatc tgtattcaga ccaaaagctg acatagatgt tatgggtaga attctttttt 10081 tttttcgaat tttgttcaca tcttagatct ataaattgac tcatctccat aaaggagccg 10141 aatgaaacca aagtttcatg ttcggttttg aattagagac gttaaaaata atgaatcgtc 10201 gtcgactata acccctagcc ttccaagcta acgatgcggg ttcgattccc gctacccgct 10261 ctatatctat ttattctaaa tattttaatg tattcattaa atcaaattta gtttattagt 10321 attagtacat cattgaatat acaattccaa aaattctttc acatccgatt ctttctgttt 10381 tttttttcaa acaaaaagtt aaaatacgaa aaaaaaatca gaatgaaaag cgtccattgt 10441 ctaatggata ggacagaggt cttctaaacc tttggtatag gttcaaatcc tattggacgc 10501 aatttatttc catatatatt tttttttaga tttcgatagc aagaaagact gtttgaatat 10561 ttgaatccaa gacgcttgat tccttttttt tattaagatt aagacaaaag tgatcaatat 10621 ttctttatgc ttgttcctga agtataaaac ggtccatttg ttcctgaata gcttctttca 10681 aaagggcttc tgcttcctcg gtaaatgtct tggtagaaga tatgatttct tggaactgag 10741 gtttattagt ttttaagtaa gtacgtagct caacaagaaa tttccttacc tgtccaactt 10801 ctaatgaatc aagatagccg tttgttccgg tataaatagt cattatctgc tcttctaccg 10861 tgagaggagc tgattgggat tgtttaagca attcacgtaa tcgttgacct cttgccaatt 10921 gattctgagt agctttatcg agatcagaag caaattgtgc aaaggcttct aattctgcga 10981 attgtgctag ttctaatttt aatttaccag ctacttgttt catggctttt atttgagctg 11041 cggaccccac tctggaaacg gagataccca cattaatagc aggtctgatt ccagaattga 11101 ataggtcggc ggataagaag atttgtccat cagtaatgga aattacatta gtaggaatat 11161 aagccgaaac atctcccgat tgggtttcaa ctattggtaa ggcggtcata cttccttcac 11221 ctaaactaga acttaattta gcggctcttt ccaaaaggcg tgaatgcaaa taaaaaacat 11281 ctcctagata agcttcacga ccgggcggtc ttcgtaatag aagagacatt tggcgataag 11341 cttgcgcttg tttggaggga tcatcataaa tgattaaagt gtgtcgttca cgatacataa 11401 aatattcagc cagagctgct cctgtataag gagcaaggta ttgtaatgta gcaggggaat 11461 ctgccgtttc ggctaccaca atagtgtatt ccatcgctcc cctttcctgt aaagtagtta 11521 cgacctgggc cacagaagat gctttttgcc caatagctac ataaacacat attacatttt 11581 gaccttgttg attgaggatc gtatctgtgg ctactgctgt tttaccggtc tgtctgtccc 11641 caataattaa ttctcgctga ccacgtccta tagggatcat cgaatcaata gcaataagcc 11701 cggtttgaag aggctcatat acggaacggc gcgaaataat acccggggcg gcagattcga 11761 ttaatcgaaa ttcagaagct gaaatttcac ctctaccatc aataggttta gccagggcat 11821 ttataacacg acccaaataa gcctcactca cgggtatctg agcaattctt cccgttgctt 11881 ttacagaact tccttcttgt atcaataaac catcgcccat taatacaaca ccaacattat 11941 ttgattccaa attcagagca atgcctattg taccctcttc aaattcgact aattcacccg 12001 ccattacttc atcaagaccg tgaatacgag caatgccgtc gcctacttga agtacggtac 12061 cggtatttac aatctttact tctctattat attgttcaat acgttcacgg ataatattac 12121 taatttcgtc agctcgaatg gttaccatga ttctttcttt attatttttt gaaagaaaaa 12181 aataatacct acagtagaaa gactaatcag ttatttcttt cattgttccc aacatgccaa 12241 tattggacct aatggtacgt aaatgtaact cgttgttcaa acaactattc agagttccta 12301 gagctcctcg taaggcttgt tggaaaaccc gttgtcggac ttgattaatc gccctttgct 12361 gttcaaactg aatcgtttcg tttttgtaat tttctaattg ttccaaagtc ttataagttg 12421 aattaatcaa attcaatttt tctcgttcta tttcagagta tccattcact cgaaactgct 12481 cggcttcgct ttcgactttc cgtaagcgag aacgagcttt ttcgagttgt tcaatagccc 12541 ctccacgcag ttcttctgaa tttcgaatag tattcaagat cctctgtttt cgattatcta 12601 ataaatcact taatgaaagt agattatttt tccattcctt tccaaaattc cataatccct 12661 tcccgaacca aacatgaatc tttcgattca tttggctctc acgctcaatt acttaaggta 12721 aattctcata tcttttttta tgaatgtaat gagcctatct tctcttcttt gttcatattc 12781 caaaaagata tcgaaactaa tgtaatacca aaatattcgg aggactcttc tgacaaaata 12841 aaaaatatgt aattgtcagc aaagttgttt cttttttttt ttttcaaatc caaaaagctc 12901 ttcttactta gaataggtcg tcgattcagc attagataaa gggggtaaaa tccccgtttt 12961 tacaatttac aataagcggt tcaaatcatt ttatcaatat gagtatccta tatcgataaa 13021 atatttattt tgaaaccacc tctatattaa catagtggta gaaagagtac catgctgcgt 13081 ctagacttca aacagtttgt tttaaccatg ttaatagttc cacattattg gttaatagag 13141 aatcaaaatt gatttaccaa tgaatcgcga aatgctatgg ttcttacata taatttctga 13201 atttattcag aagtaattcg cgagatcatg cacctctctt tcctagttat aacggaaaag 13261 ggtacagctg ggtggtccag cctattcttg aaataaacaa ctcgcacaca ctccctttcc 13321 aaaaaaaatc aatacaccaa gcactacact tagatttatt ggatttgttg ctaaaatatc 13381 ggtattaaac ccgaaactcc cggcagatgg ccagtggccc aaagaaacga aagaatcggt 13441 tacgtttttc atatgatctc ctcttataga tagactaaaa aatcgaacag agttcttttt 13501 gtagcacttc gcccctcttt ttatttattc ttttattttt tctgaaattg agtcaaaaaa 13561 taaaaaatat tcgagttagt tataaattat gaactaacga actagccctt ttattggtta 13621 ttggaacact aacacttact aaaaagagtt tcccttggtc tatgaacggg aaggatgaaa 13681 gcgagtcagt atgctaattc ctcatccgca aatcagccct tcccgtaggt tcttttctca 13741 aagaataaag aattggagga gggaaatctt gatagaattt gaaaaagcaa acgacaagtc 13801 gaaggcaata aaatatgaaa aatgtattta tttttcatat ttctaagcta agattaaaca 13861 aaaggattcg caaataaaag tgctaatgct acaaccagtc cataaattgt taaagcttcc 13921 ataaaagcta gactaagcaa tagcgtacct cgtatttttc cctctgcctc aggctgtctc 13981 gcgataccct ctacagcttg acccgcagca gtcccttgac caactccggg tccaatagaa 14041 gcaagcccta cggccaatcc agccgcaata acggaagcgg cagaaatcag tggattcatg 14101 ataagttcct cgtaccaaaa aaaagaaatg gttaacgata caatcaacca atgagttatg 14161 acttaattat tccctcgcta ggaatcatcc agtcgaagta actaagaact tcggattgaa 14221 gtaataagat tattgaatca tcagaactac ttcgatatat cttttttact ttttagccac 14281 agagtctttg tgaacccata cgactttcgt tcttccattt cttggttcga actgttagtt 14341 gaattatttc ttgatttcat ccgtttattc attcaattca cagtcacaag gggccggaag 14401 gacttctagt ctattagaat cccctagagt agtaaaatta tatctttagt tcatttcata 14461 tataactagc actagtcaat atctaatatc acatatacat gtctttcttc cataacgtaa 14521 accaagcatt catcttagat tcaatcctat tcgagaatca agcgtcgaaa catctagaag 14581 ggttggctta tagttattca attacagata cctccctctc ctaaccgacc ctttctaaaa 14641 tactcaaaaa aatccctttt ttgtaaattc ttttgaacct taccttttct tattattcca 14701 cctagataaa tctaaatgga caaattgatt aggccgaata attccatatg tatagaaata 14761 tcattatttg attgatctaa gttcatgcaa tttattaata aaaatgaata atttatttat 14821 taattattaa tattttggtc aatcgttgaa taaaatcaac tgaaagggaa atcgtttcgc 14881 cctttttaat ttaatttaat tacacgtcgt aaacctatac aacaagaatt ataattattg 14941 acaaaaattc ttatattcaa attgttttaa caatgaatta ataatgagat ggactaagca 15001 atctaaagtg aatattcatt gagacgaagt atgatattaa gtgaaggaaa ggggaatttt 15061 aggaaaaaga tctttttttt ttagatcttt ttccccttac tctttaatat catcgtaatt 15121 tttttgctat cactctagat cgtatataaa atagttgtat atttagattc ccctattcta 15181 ttccctaagt taagtaattc tcttgagcca cccaccatat ttatacattg ctttgggcta 15241 agctaaataa gactatttca atgatggccc tccatggatt cacctatata agccgcggct 15301 aaagttgcaa aaataagagc ttgaatacca cttgtaaata atccaaggag catgacaggt 15361 ataggaacta ctaaaggtac taaagaaaca agaacaacaa ctactaattc atcagctaag 15421 atattcccga aaagtcgaaa actaagtgat aaaggttttg tgaaatcttc taagatgtta 15481 atgggtaaaa ggattggggt tggttgaata tattttccga aataacctaa tccttttttt 15541 gtaagacccg catagaaata tgccactgat gtgagtaaag ccaaagcaac agtagtattt 15601 atatcattcg tgggtgcggc taactcccca tgaggtaatt gtatgatttt ccaaggtaaa 15661 agagctcctg accaattaga aacaaaaata aataaaaaca tagtgccaat aaaaggaacc 15721 cagggcccat attcttcgcc aatttgagtt ttactcacat ctcgaataaa ttcaagaaca 15781 tattcgaaga aattctgacc cccggtcgga atggtttgtg ggttccgaac agctatagtg 15841 gctgaaccta ataagatagc aattacaacc caagaagtaa taagtacttg gccatgtact 15901 tggaaacccc ctatttgcca atagaaatgt tggcctactt ccacaccgga tatatcgtat 15961 aaccccttta gagtattgat ggaacatgat agaacattca tattgccttg ccctctgaaa 16021 aaattgaact ttaaacaaaa ttttttgatt caaccatctc tttgtctact tgaatcggat 16081 attttgaata ccaactaaga tttagaatac taataaatca cataatatcc ccagctattt 16141 ttatctcttt tttgaaattc agaaatagta agcgattcca taagggattt ctgaagtaag 16201 ttatttatct tattatgtta ttattaatca aggatttctt atatagctag aacgaccctc 16261 acaaattgcg aatactaatt tgttaagaat taatcggatt gaggatatgg cgtcatcatt 16321 cgctggaatt gaaatatctg cgagatcggg gtcacaattt gtatcggtta aacaaattgt 16381 tggaattcct aaagtaatac actctcgcag ggccgtatat tcttcgtgct gatcaacgat 16441 gattacaata tcgggtaccc ctgtcatata tttaatcccg cccagatatg tttgcaagcg 16501 agataattgt cttttcaaca tagcagcatc tcttttcggg agacggttga gtctccctgt 16561 tttttgttcc attctcaagt ccctgaactt atgaagtctt gtttcggtag tggaccaatt 16621 cgttaacata ccgccaagcc attttttatt aacataatga caccgggccc ttattgcagc 16681 ccactctact gaatcagctg ctttattttt ggtaccaaca attaagaatt gttttcccct 16741 acttgctgcg tcaaaaacta aatcacaagc ttctgataaa aaacgagcag ttctagtaag 16801 atttgtaata tgaataccct tacgctttgc agaaatataa ggcgccattt taggattcca 16861 tttcctagta ccatgaccaa aatgaactcc tgcctccatc atctcttcca aatttatgtt 16921 ccaatatctt cttgtcattt ctctccacac cccccctttt ttttttattc tttttcaaaa 16981 aaaaaaaaag agacgaggaa ccctgaactg aaataaataa ttgttccgat ggaaccttct 17041 cttctaccgt agattggacg tagatacacg acccaaacca ttattctttt ctattcatta 17101 ttctttttat taccaaagca aataaccata ccaaatgcag atagcgaaag agatgaatcc 17161 gttgttagga atcattaaat cctataaacg attgttcggg tatatcgtgg aaattttttg 17221 aaagacaaga atcaaataat tttttgtggt ggaacaaaat atctctcatc tccccctcga 17281 atagattctt tttttttgtt tccaaaggaa tgttgttatg ttgttttgaa gggtgcacta 17341 atcccttgaa tccggtacca acgggtatca ccccccccaa aacaacgttc tctttcaggc 17401 ctttcaacca atcgatacga ccccggagag ccgcttttgc taaaactcga gcagtttctt 17461 gaaaactcgc ttcagatatg aaactttgag tattgagaga tgctcttgtt attcccaata 17521 agacgactcg gtaacagatc gcttcttcca aagcgcgccc cattcgttct gctcgcaaca 17581 atccaataag ttctccgggt gaaaaaacat tagacattcc atcttctgaa accaacactt 17641 ttgatgttat ttgacgtaca ataatttcta gatgcctatt atgaatctgc accccctggg 17701 agcgataaac ctgttggatt ttattaacca aagagattcg gctttgcgct atagttagct 17761 cagcaccaat caagaatccc caaggaattc caagaattct tgttatacat ttgttccaac 17821 cctcaatcct cttttctaga ttcatggata ttgaatcaac cgaacgcact tctaacacct 17881 gttctacttt tggaagacct tgtgttatat caccagatct cgatttttca tatataaatg 17941 taactaatgt atctccttcg taaagggttt ccccataatg gccatgaaca gttgctccgg 18001 gggtggccaa ataaggctta gctgatcgta tcactatcga atcaacttga acaagtataa 18061 cttgacccga tttgaggggc ggtccatttt tggctataca tacattttca caaataaact 18121 gtccaagact aattatttta gatgtctctt cacaataatt gtgatggaga aaataccaat 18181 tcaaattgaa tggatttaaa ataatgttac gacacggatc gggattaaaa atttttccat 18241 tttcatccat taaataatat ttaaatttaa tcacttgaaa agtctgtttt aaattgtcaa 18301 gttgcaaata gttagttact aagatctgat tatgagttat taaatggtaa gatgaataaa 18361 aattctcaat tggaagggat gttcctaaag ggcccaatga attcctaatt ggaattaggg 18421 gatctttttt aattgatttt tttatcacac tgtgatattt tacatccttg aatggcccca 18481 ttcgagaaca attggctgct gacaaaatta tcaacgactg acattcctta tttctattta 18541 acaacgtatg aatagttcct tgaggttggt taatagattg ttgaattttt gccttggaat 18601 aggaataaat ggaagaaaag gggttgatat tggtacaatc tgatccatta tcagagagca 18661 atcctgaccc cgacggatca ttcctttttc cgatatacga aataggggat ttcactaagt 18721 tgattcttag gaaatgtcga atcaaaccat ttgtccttat ttcaacaaaa gaagcacggg 18781 cttcttcgca agaagaactt tttttgtctt ggttccaatt taatactaaa caagtccgaa 18841 ctaattgaat acttgtgtca gaaattcctc gaatcggttt gccatttcca taaaggatat 18901 aattgacaat tcgaagttgc acattatccc tttcctgcaa tggatccggt ggaaaaaggg 18961 ttgctaaatt tataccgtcc gttatttcat atgtgacgac aggtcgaact aaaacaaaaa 19021 actttttctt gctaggtgta atccgttgga catagatcca atttttcact tttttggatt 19081 ccttggaatt tctttttcct gttcctggtg gtatcaaaac gccggtatgt cgggatatct 19141 tatctgtctc tccaggaaaa tggatatctc cagaaaagat tttaagttca attcgttttt 19201 tttttctctc cacccggacc aacccaccga ctcggcttct tagatttaag gtgatttgtg 19261 tatctacccc aacgatacta ttgttccgta ccattatgga agaagatccg ggcaagatat 19321 gcacctcttc aggaatgaaa aaaaatcgat ctactttcat ttggtatttt ggcctaaatt 19381 ccttgactcc tcgatactca agcaaatcct cttttttgat gactgaatgc gtttctacag 19441 tcccatattt aataatgccc gaactctttc ttctgtatcg aggatcatcg aaataagcaa 19501 gaatactatt tcgacggaaa ataccattta cggggatttc aatcgagata cctgaacagg 19561 gcattagttc attctcgagt tcttgaatcg agtgtagtgg aatgatgaat ttatttcttc 19621 gcctttttga caataaatca gaattctcgt ggagaatagg cgaatatacg agattatact 19681 gaccagcaca tataattcga ttaaggtctg aataatcagg aatcctatct tcttttttac 19741 cagaaaaatc gaactaaata atttctgcct cgcctgatcg ttggttactc gagaggttag 19801 aagtatatct tcgcttgcca gaaagaaaat gcgcattcat ttgatcctga tccttgtgga 19861 tcgaaaggta gactagactg gacctcgagg ccttcctaat aatatccata aatggcttgt 19921 ttttggtaat agatgaacat taccgtatgt aaattcgggt gcatgataga catcggtact 19981 ccagtgcatt tctccgtctg aatcagaata aatatgtttt cgaaccttct ctttaaaatt 20041 caaagtggat attcctgcgc gaatctcagc aattacttgt tctgattcta catattgatc 20101 gttttgaact aaaagcaaac ttttgggtgg aatattcaca ttatgtagaa tatcttcact 20161 ctcaatagtt acatacaagt ctatagaaca tagaaaggcg ggatgcccat gacgtgtacg 20221 tgtcggatga accaaatcct cattgaattt gatttttcca ttagatgggg ctcgcacatg 20281 ttctgcagta ccccccgtga atatctccgg tatgaaaagt tcttaatgtt aattgagtac 20341 ccggttctcc aatcgattga cctgcaataa tacctacagc ttccccaatt caaccaggtc 20401 gccatgagta ggactccggc cataacataa tcgacaaatc caagatgtac tcctacaagt 20461 aaagggagtt cgaatagaga ttggttgtgc ccgaaaggtt atgaatcgat ttacaagtcc 20521 aatgccaatg tcttgatttc tagtggcaat acatcgcgga cccatgtata tatcatctgc 20581 taatacacga ccaattaatg tttggataaa aatcctttcc ggcatcatcc cattccgagg 20641 actcacagaa ataccccggg cggtgcacaa tccgttcgac gtacaacaat gtgttgaact 20701 acttcaacaa gtctgcgagt gagatatcca gcgtctgatg ttcgtacagc agtatccaca 20761 atctttaggg ctcgtagcaa gaaatgatgt attctgttaa agagagtcct tcgcgtaaat 20821 tgctttgaat aggtaaatca atcatttgtc cttgaggatc tgacattaat cctctcatac 20881 ctactaattg atgtacctga gatgcatttc ctctagctcc cgagaaagac attatatgaa 20941 ctggattaaa agggtcagtc atcctaaaat taggattcat ttcttgtcgc aaatattcac 21001 ttgtagcata ccatatttca atggattgac gtaatttttc taccgcgtgt acattcccat 21061 aatgatggtg tttttccaaa atcaaacttt gttgttcagc atcttgaact agccatccct 21121 tagaaggtat tgttaaaaga tcatcaattc ctaatgaaat ggatgtagca gtagcttgtt 21181 ggaaccccag agtttttact tgatccagga tatgtgatgt atatgccatt ccgaagtgat 21241 ctattaatct actaataagt cgtttcatgg cagttccgtt tatcgcttta ttgtgaaaga 21301 ccagattggc ccgttctgcc ataagtacct ccatattccg ctgagtagaa ttcgacaatg 21361 ggtttgagtc ggtgattgta aaacttcctt ttatcgatct tgattcgcgt ataaattccg 21421 gaactatgga cctagctgaa ccggagagcc ccgaagtccc acgggtatca tagaattacg 21481 ttaggtacca gatgaatagg cccgagaaaa cccctgtata gcttcttcga tttctcgata 21541 aagagcaata tgaccaacag tggttcgaat gtatataaaa aggatttgtt tttttagact 21601 tcttactatt agatagtgtc cataaatctc ataaaaagta cctaaagatt catagtgaac 21661 ttcgatggga gtttctcttg aagcaataac gcgttgatct agtcgccacc ggagccacaa 21721 aggactatct aaattgattc gtttctgccg ataagcccca attgcatcat aggaattaga 21781 aaaaaacggt tctttcgtat acttatagtg actattgtca cttctttttt gattttgata 21841 gtttctgcga ttacatggat tatatctatt tacacaaata cctcgatgat ttccgctcgt 21901 taatacatag agtccaataa gcatatcttg cgttggtacg gaaatgggat ccccaatagc 21961 cggagacaaa agattcatat gagaaaacat aagtaaacgg gcctctactt gagcctccaa 22021 ggataaaggt acatgaacag ccatttgatc tccatcaaaa tcagcattga atcccttgca 22081 aactaatgga tgtaaacaaa tagcgcgccc ctccactaaa acgggctgga atgcctgtat 22141 gcctaatcta tgcaaagtgg gtgctctatt cagcaataca ggatgtccct gcataacttc 22201 ttgaaggatt tcccaaacaa tcggctcttt ttctcgaatt ttactcttag caactcctat 22261 gttcgaagca agatgttgtc taattagacc acgaattaca aatgtctgga aaagttctat 22321 tgctatttca cgaggcaatc cacatcgatg taatgaaagt gaaggaccca cgacaatgac 22381 agaacgtcct gaataatcga ctcgtttgcc aagcagagtc tcacgaaatc ttccctcttt 22441 gccttcaatt acatcagaaa atgacttgta aactttatta tgaccgtccc tcattggttg 22501 tccccggatt ccattatcaa gaagtgtatc cacggcttct tgtactaatt tctcctgaca 22561 cattactaat tctcctggcg tagatctact tgttgttaat agatcggtaa gggtattgtt 22621 ccgatagata actcttctat agagttcatt aatatctgag ctcattagtt tacccccatc 22681 tatctgaatg atcggtctca actcaggagg aagaactggt aatagacaca aaaccatcca 22741 ttctggctct atatttgttc gaataaaatg cttagccaat tccacgcgtc taaccaaaaa 22801 gtcctttctt cttccaactt ttcgatcttc ccattcattc cctgtgtgcc cttcttcccc 22861 caattcttcc cattctacca acgaattctc tataataatt cgtaaatcta gatcggctaa 22921 ttgttctcgg atagcacccg cgccagtaga gatttctcga ttgcgaaatg tatcgaaacc 22981 ctgggtagta aaaaaaagtg ggatgctgta tttccaagat tggatttcat attcgaataa 23041 acctcgtaat cgtaagaaag tgggcttttt agttatgggc ctagcaaaag aaaaattggg 23101 ataggattct ataggatctc ccccccttca aaatcggacg tgaaagtttc ctttcatccg 23161 gctcaagtag gtacaccaaa taaggaaagg agttctcgtt ttcaaactct agaaaatccc 23221 aaaataaaaa ggtctactcc ttactcaagt tcccagtgaa gacgaaacaa gatttcagtg 23281 attccgtctt ctattaattc tttattcaaa ttcaattcca acaaataaaa tagaaaattc 23341 ttgagtagtc tacttccctt tgaatgataa atcccttaac tcttaataat taaaggaata 23401 ccttggaacc cataagggat ttacttgtct atatattgtt ccattcgatc ttttaggtcc 23461 cgacttcacc tcgatggtta ggccaccacg cccttaaagt ctatacgcga tagatagact 23521 cctagaacca tgacatattt gcttacttga acataatttc tttccacgaa aagaaaggaa 23581 atgtttcatt ctacaaaata aaaagctttt tttacgatgg tacaaataga aattcctctt 23641 tatttgattt gttacgaaat cgaccataga tcaattccct ttttatttgg gagtattgac 23701 tacaccccaa ttctgagctt catgttactc tttccaagtg cacatgtcag gtccagggca 23761 tcccaattgg attgactggg atgacagttt ctccttccga gtctgtaaaa tcagaatttc 23821 gatcaaatca cacatcgcag tatactaggc cttctaattc tttaagaggt ttatctaaaa 23881 gattcgcaat ataactagga agacgtttta aataccacac atgggttact gggcatgcga 23941 gtttgatata gcccatttga taccttcgta tccgagaatc aacaaattcg accccgcatt 24001 gttcacaaaa tttcgggtct tctttttcat ctccgattac tcgataattt ccacaagcac 24061 aaattccgct ttttatagga ccaaaattct tcacaaaata atccatcttt tccggtttgt 24121 tagttttgta atgaaaagta tagggttttg ttacctctcc aactatctct ccattaggca 24181 ggattttagt ggcccaagca cttatttgtt gaggagaaac tgatccaatt cggagctgtt 24241 gatgtttata tcgatcgatc atagaagaaa aattattatt cattccgatt aagcttcctt 24301 cctattaatc tggaagttct tctcagatac aaggaaatga ttcagttcca gagctaaaga 24361 tcgtagttct cgaacgagca atcgaaaaga ttctggagca tcttcgggat taggtattgt 24421 tcccccaatg atcgtagtac caagtacttc ctggcgagct ctaatatgat ccgatttata 24481 agtaagcatc tcttgtaaaa tatgagcaac cccaaaccct tctagagccc aaacctccat 24541 ttctcctacc cgctgtcccc cctgtttggc tcttcctcta aggggttgtt gtgtaacaag 24601 cgcataatgt ccactggagc gcccatggat tttatcatca acttgatgaa ttaatttcaa 24661 gatataaggc tttcctatta taacgggttg ttcaaaagga ttccccgtcc ttccatcaaa 24721 tattctgctt tttcctggat attcgggttc aaatacccat ggattcgctg tttgcttact 24781 ggcttcatat aattcagaaa acacaagttt tctcgaagct tcttgttcat atctctcatc 24841 aaaaggtgct attcgataat gtctgtctag cagactccct gctaacccta gtgaacattc 24901 aaatatctgt cctacattca ttcgtgaagg tactcctaat gggttaaaga ccatatcaac 24961 ggatcttcca tcttgtaaat aaggcatatc ttgtctaggc aaaattttgg aaatgatacc 25021 tttatttccg tgtcttccag ctactttatc gcctactttg atttcacgtt tctgtaaaat 25081 atatacacga atcgtttcgg gattataact agaaccaccc ctcttctgga tccacctcac 25141 atcaataacc cgacccctgc cacctatagg tagttttaga caagtttctt ttgaagtaga 25201 tacctgaata ccaagtatag ctcgtaacaa tctatcttcc ggggcatacg acgattcttt 25261 cacgacctgg ggtgttaatt tacctactaa aatatcacct gtctctaccc aagatcccag 25321 catcacaatt ccatttttat ctaaattgcg gagtaaatgg gcttctaaat gcggtatttc 25381 attagttact ttttcagggc cttggcttgt cacatgagtc tgaatttcat atttccgtat 25441 gtgaaaagaa gtataaatat cttcatatac caaacgctcg ctaataagta ctgcatcttc 25501 agaattgtaa ccctcccacg gcatataagc tactaatacg tttttcccca aagcaagttc 25561 gccaccaacc gtagcagcac catccgctaa aatttgtccc tttttaatgc atttaccccg 25621 aggaacctgg agtttttgat gcatacaagt atttttattg gaacgttgat atataactaa 25681 tggaatgctt agaatatctc cattacctgc taaaagaatc ttgtcagtat tggtataaac 25741 gacccttccc tcgcgttcgg ctatagcaag agcccccgaa tctagagctg cttgtcgttc 25801 caacccagtt ccaacaatgc atttctcgga gcgagaaaga ggaactgctt gacgttgcat 25861 attagaactc attaaagctc gattcgcatc attatgttcg ataaaaggaa tgagggaagc 25921 tccaatagaa aaatattgaa aaggaaaaat acttcgaaga tgaacctgtt cccatgcaat 25981 agtcaagaat tcttgacggt atcgagctgg aacaacctgt tcttcctgaa tatcctgatt 26041 taaggctaaa gaatttcctg ccgctaccat atagtattca tctctacctg gtgataaata 26101 aagcatccgt accccggttg acctctcaga aatttcataa aaagggcttt ctagagatcc 26161 ccaatgacca atcctcgcat gaattgctaa ggatccaata agtccaacat tgattccttc 26221 agatgtgtca attgggcaaa tacgtccata gtgactagga tggatatctc gtatccgaaa 26281 actagcagtg cgccctgtca gtcctccagg gcccaaataa cttaattttc tcccatgaac 26341 tatttgtgtc aatggattag ttcgatccaa aacttgagat aatgggtgta aaccgaaaaa 26401 ggattcataa gtagttgtta atggagttga ggttaccaaa ttctgaggtg tcggtatcaa 26461 tttatgccga attgctccac atatagtccc ccgaaccaca ttttctaaac gaaccagagc 26521 caatccgaat tgatcttgta aaagatctgc tacagaacga atacgtttat ttttcaaatg 26581 attcatatcg tcaagtgcac ccattccaaa tttcagccca atcaaatgat cggcggctgc 26641 caatatatct cgtggtaaca aaaatgtatt gttctggggt atatcaaggt tcagtcttcg 26701 gttcatattt cgtcgaccaa tccttcctaa ttcacatctt tgttgaaaga atttcttttg 26761 taattcctta cataaggatt cagaaaatac cggatcgcca cctacacaag caaattgttg 26821 ataaaactcc aaaatggcat tttcttttga cccaattttt tttctctcct tatcactcag 26881 aaaagacaaa aaaatttcag gatagcaaac attctctaga atttctctta gattcaaacc 26941 catagctgat gatagaacta gaatagatat tttttgtttc ctacttacac gagcccatat 27001 ccttgctttt ctatcaattt ctaattctga tcttcctccc caatctgata ttatggtgcc 27061 ggtatagacc gaaattccgt tatggtccaa ttctgatcgg taataaatac cgggactttg 27121 caatatttga ttgatcacaa ttctatatat tccattgact atagaagttc ccagggaatt 27181 cattagagga atgtttccga taaaaattgt ttgttcttgc atatccctac tgtttttcca 27241 aattaatccc gcggatacat ataattcaga agaatatgtg agtgattcat acacagcatc 27301 tctttccttt atcaagggtt cgaccaattg atatgtttcc acaaataatt gaaattcaat 27361 ttcttgatct gtatcttcaa tttttggaaa cttataaagt tcttccgtca aaccttgatc 27421 aatgaaccta caaaatcctt caaattgtat ctgattaaat ccaggtattg tagatattcc 27481 ctcatttcca tccccgagca tttttaattt cccatttatc aaaaaatacc actattggtt 27541 cattcttcat ctaattagat agattagata aatgatctag caatgatggc atttctattt 27601 tgtttaccga atcacatgaa attttaccca actccatatc tggaatgtat gaaatacgta 27661 tgaacggagg aagaaagaga attttctact taaattgaat tggaatttat tggaattttc 27721 aacagataca aatggaaaga aattgataaa acatccctag aaacagactt ctgctactta 27781 gacttattaa ttaagttata gaattttgta tagaatatca aaacaaaaat gattccattt 27841 ctaccattat tatgataata cacattccaa cctgcttgaa taccagaaaa ataaatggat 27901 tcgacatttg atcttttcgc tgagataaag gcataaaaat aagaaagaat atatagaatt 27961 agaatcggtt ttttagcatt taaccccctt ttctgttatg gatttcgttg ctaaaaaaat 28021 gatttgtaga gaagagagag attttgttta cggatttttg aatagaatac gattgtgaag 28081 tgtataagaa aagaaggttt gtatggctta accacgtgtg gagatatcta taatatccgt 28141 ctttcttctc ttttattgtt ttattgtcgt tctctgttct attcggggca acccgggttg 28201 tgctctatga aaacagaatt tcaattttct attcaattca aaattcaaat tgaagtatga 28261 tacttttctg atatctgata attctctatc ggaacatata taaataatat ataccgtcta 28321 acaatttctc ttgggggttt acatatactc ataattgttg ttataattaa aattgagaag 28381 gattttttga ttgaaaaaat ccatactgat tagttatata tcaagttgta ttttcttatg 28441 tcattaggaa aacaaaattt ggagattcaa atccaagaat cattcatgca ttctaagtca 28501 atagttaatg gttccgattt tcagaaattt gaattttgga ttttgcgact gaaaatccac 28561 atttgatttt tcaatagaaa ggtaagagaa agctttgaac attatgaatt tggagatcga 28621 aattgaaagg atgaatcaaa cccaatcaaa agggaagaag gattaggatt tctttgactt 28681 ttaggaaaaa ttaaggaaaa cagaactcaa ggtgcaagta caataaaaaa gcagttcagt 28741 aatcctggaa agttttcatc tattttgtat ttgtagcatt ttggcgacat ggccgagtgg 28801 taaggcagag gactgcaaat ccttttttcc ccagttcaaa tccgggtgtc gcctgatcaa 28861 caaaaaactc gaaatctctt cttttcttct gttctgttga tataacccgc cgaatgattc 28921 cccagcagaa gcagagaaag cagactgttg atacttgttt gattctaaac atctggtctg 28981 ggggtttttc taaaaaattg taaatatctt tgcattgcat atttaggctt caaggaaata 29041 ttcgaatgct agaggggcta tcaagacttc gcaattacct tctactacaa atcaaaattt 29101 tctattatta atgcattgta taatgactgg accttgaatt agattggaga gcccgatagg 29161 aaatctaaat agttgtggaa gggggcggaa gatactttat tatatacgag gaactcacga 29221 aaatctctga gtgctcaagc atccaatcaa ttgaaatgag ggtcaacaaa aaaagaatag 29281 gacctattat tcctacatgt tccattagta acattccctt gagatgttac tgcagatttt 29341 gcttgtgttt aatctttccc gattagaaat cctataggaa tttcttataa aatgagcgaa 29401 tttattggat tggtttatta atagtcttcg ttctttttga ctctgcgcca ttgattccac 29461 tattattagt gaggaataac ggaacaattc ctttatattt atagagatag gggacataat 29521 tcatatggat atagtaagtc ttgcttgggc tgctttaatg gtagtcttta ctttttccct 29581 ttcactcgta gtgtggggaa gaagtggact ctaggggtcc tactaattga gttaaggaag 29641 caaactgtat caatatcaat tgctttcgag atcgttctgc aacacgtttt gaacaaaatc 29701 aaaatatctt cattttgaaa ttccattgga ctcgactgga gtaatgtatt ataggaatca 29761 tcctctttca atcaaagagc tatttcaacg attcccatgt ttgtagttcg aaaggaagag 29821 gatcccagga aatttattcg aacctaattc ttccgaaatt ttctattcca atcaacggcc 29881 tcttacaggt gatactgagg agggccggac ccttttttta tttctttctc tctttactgt 29941 tcaaagaaga ggtagttttg ttaagtgtat acgcactttg tatgagaaag aaaggatata 30001 aacatagtgg ttgtctaacg agatactatg cagaataaga tcttcagatg agtcacatat 30061 tgcgcattta ccgctttcga atttttgaaa ttggatttat gctttatcga cttatttcat 30121 atcatggttc aggcgttaaa aatcggtgag gtttactctt ccttttcgat gcccgtggaa 30181 ctactgtcaa tggtttactc aattacttct tgggaatgtt aaaaaaaaga ttactacgtg 30241 attttttgaa tctgcctata tctatcgctt ttccttcatt gatttgattc tttcaataga 30301 taccgagatt cagattggaa atcaaaaatc tagtaattca aactataaga cataagagta 30361 atttagattg atcagaacaa atagatatag caaataaatg gaattggatg ctatgtcaat 30421 cccatatatg gaattgatat tcacatatat caagataata ttgtagattg atctatagat 30481 ccatatcaaa agatccatat caaatgcagc ctctatcttt attttattcc agggggcagc 30541 tttataacta caatctaact aataaatagt atggtagaaa gaaatagatg aatctttctt 30601 tctaccatac tatctatcta ttagaatact gccgattcta gtccatacat tttcatttaa 30661 gacatgaaat tagaatcttt ttcattttat ttcgtcaatt ttggctaaga actcagaagt 30721 caagtttcat tcaaattagt taataattaa tcgttttgac tgactgtttt tacgtaaatg 30781 ataagtagaa aagcggtagg aactagaata aatagtgcag tagcaataaa tgcaagaata 30841 tttacttcca taatctcatc ggttttttac ttcgcaataa ctcgggattt aatcccatag 30901 agatgataaa tctttggcct gtaaattcaa tgaatgaata ttacctctcg atgatcttga 30961 atcggatcaa tatcatgaat aacaatatct gaactatcaa atcaattcgt cgtcgagaat 31021 tgaatagtat aacataggaa gttcttttat ccataccgcc ccaaacttgg attcctgacc 31081 caatccaaaa ttcctttatt tatttatcat tatcattttt tctcatctgt tctttttttc 31141 tctctaatct atctagttcc ttcttgtaca atcatctgat gaagtctcat caaatagctc 31201 ttccacttcc agtggtcaca catagttaca aacccaaaca aacaataaaa gctaaatgga 31261 aaaagaaagg agtttagaac taaactattt ttgacttgga agacaaagaa gtgtgataaa 31321 gatgagaccg tataaaatga atattcatca aattgactat tttccgattt gttctttcgt 31381 cgatgggggc cttaaaacaa aatgaaaaat cggaaaaatg attcattccc ctttctaaga 31441 ggagtaggat ctttcctttc ccctcctttc ttcgtagatt attagccccg ggacacctat 31501 accaaaagct cagtgtgcaa tttgcatgaa atctattttt caacttcaaa ctagtaagtg 31561 aggttccata aatccgtagc cagaaaaata aattgttttt ttttttgttt tttctgggaa 31621 agtattttct tatattaaat tttgtattgg acaagaaagg aattcccctt gtgtatgcgc 31681 gcctcaaaaa ggtatagtac tcgattccat tacatgcatc gggggcaatc gaaaaagcca 31741 gcatttcttg gaatactgac tataatgcta ccaataatcg tactaatcca accgcatatg 31801 tctttctcct accaaaagga aagaaaaaag aaataaggat ttcccctttg ctttgacaat 31861 gaaattctgc ccccggtccc cttcataaaa agggagagat ttattgatat atttattgga 31921 tccatcggga ctgacggggc tcgaacccgc agcttccgcc ttgacagggc ggtgctctga 31981 ccaattgaac tacaatccca gggaaatacg ggatctagca gaaaatttga ttctttttta 32041 tctccggatc gggtatttct gaagtacgaa gggggttata tcatctcatg gcggattggc 32101 gaatttttgg gccgagctgg atttgaacca gcgtagacat attgccaacg aatttacagt 32161 ccgtccccat taaccgctcg ggcatcgacc caagaagaat caattttaga cttattggta 32221 atccatgatc aacttccttt cgtagtaccc tacccccagg ggaattcgaa tccccgctgc 32281 ctccttgaaa gagagatgtc ctaaaccact agacgatggg ggcctgcttg accaaccgcc 32341 atcatactat gatcatagta tgatcagttt tttgaaattg tcaatataat cgaatgattc 32401 tatccgaggg atctttcccc ctttcagaat tgcatagaat ttttttattc gtcattgatg 32461 aattattcat tagaatcgcc attagaaatc tagtagtagt attttttttt ttttggaatt 32521 atttcaattg aatttctttc gattatttta gtttagatta tttagtattt agaattttct 32581 ttttttatta taaataaaaa aaaaattaat aaatacaaaa aatagaaata ataaggaaga 32641 gtaggatttt tgcagggaat gattggtccg tcagaaaagg aaaaaggtgt gaaattctat 32701 ttctttcact ttcatttgat tcattgttaa gacgagatat ccttatctcc ctcccaccaa 32761 gacaggaaat taacaaacga gaaatctagt aagcgggatc aagaagaaaa ttcttttttc 32821 tccaagaatt tagttcagga gacaagtaga atctcttcat tccatgattc gatgaaatat 32881 cttgaatttt atgttgaatt gctaggtgta tgtacatgta tcaatcaagt gaattttgtt 32941 ctggtgggat caattcaata aaagaaaaaa agcaattcga gtcggtcttg aaacaattca 33001 ttgcattttc tcctagactt cctaggtaaa tccattttat tattcaacaa tgagccacta 33061 gacactatgt atctactgca tgtacttatg catatatact tatgtttata atatatgtac 33121 ctatagatat tttatccaca tagtgaataa ttccggaatt aaatcaaaaa ggccctttta 33181 actcagtggt agagtaacgc catggtaagg cgtaagtcat cggttcaaat ccgataaggg 33241 gctttgtaaa actccaatct agtattcata tttgagggga gaattgtatt tttatttgta 33301 ataaaaaaag taactaactg gataatacat tatcattata cttaattatt atacttagtt 33361 ataaagttga acatttgttt agtcaatttt cattattatg aatttctgaa taatgaaaag 33421 tcacttcttg aactcaccga atattcctat tttccattat accaaccaaa tccattcgaa 33481 aggttagaaa tcaacaaaag aaaaagtaag tggacctgac ctattgaatc atgactatat 33541 ccgctattct gatattaaaa ttcgatagag atgaaattgg agcagttgat ttttttttaa 33601 tttcattttt ttgttttgga ttccacaaga atttgtcgat atttccgatt aaatcttctt 33661 gttactagat tttctatagg aaaaattata ggaataaatt gttattcctt tcctctacag 33721 agaaaccttt cttccaagtc acaccataag agccatttat tatctttctt tgattccaga 33781 tcaaagatta atttcatcat taatttctat ctagattata tatctatatt atattaagta 33841 gattgtagat ttcgatgtat atctatcaga tcgtggcttc atgtaccaaa tatttcaata 33901 tcgttgcatc cggtattttt gttttgttcc aacagtgtga tgaagaatag atccgagaaa 33961 gagactttca ttttcagtct cttatttatt ttatttttat tgaattttcg attttctaaa 34021 aggaaaatct aaaaggaaaa atagtagatt atctcttttt ctaacagata aaagaatcta 34081 aaaataaata ttcgatcgaa ctgtcttttt tccttcgatc cgtggaaaga tatactctgg 34141 ggttttagat ttatttatat gaagtatgaa ggaaagggat cgcttggtcc ttgaagagtt 34201 ctttcaaaac aaaggattga ttgaattgtc ttattaggac aattaatggt tcatatgctt 34261 agtcagaagg aataatccaa tggagttcat ggatttacct aggtcagttt atgggctaat 34321 caataaagca tttttatctt cgaaacccat tggaaagggc agtgcaagag aaatcataca 34381 aaaatgatcg aatcttcgga cgccccgaaa aagatatgag gtgctcggaa atggtcgaag 34441 tagttgaata ggaggatcac tatgactata gcccttggta agtttaccaa agacgaaaat 34501 gatttatttg atattatgga tgactggtta cggagggacc gtttcgtttt tgtaggctgg 34561 tccggtctat tgctctttcc ttgtgcctat ttcgctgtag ggggttggtt cacaggtaca 34621 acctttgtaa cttcatggta tacccatgga ttggccagtt cttatttgga aggctgcaat 34681 ttcttaactg ccgcggtttc tactcctgct aatagtttag cacattcgtt gttgttacta 34741 tggggtcctg aagcacaagg agattttact cgttggtgtc aattgggggg tctgtggact 34801 tttgttgctc tccatggagc ttttggccta ataggtttca tgttacgtca attcgagctt 34861 gctcgatctg ttcaattgag accttataat gcaatcgcat tctctggtcc aattgctgtt 34921 tttgtttctg tatttctgat ttatccactg ggtcagtctg gttggttctt tgcacctagt 34981 tttggtgtag cagctatatt tcgattcatc ctcttttttc aagggtttca taattggacg 35041 ttgaacccat ttcatatgat gggagttgcc ggtgtattgg gcgctgcttt gctatgcgcc 35101 attcatggtg ctaccgtaga aaatacttta tttgaagacg gtgatggtgc aaatacattc 35161 cgtgctttta acccaactca agccgaagaa acttattcaa tggtcaccgc taaccgcttt 35221 tggtcccaaa tctttggggt tgctttttcc aataaacgtt ggttacattt ctttatgtta 35281 tttgtaccag taaccggttt atggatgagt gctcttggag tagtcggtct agccctgaac 35341 ctacgtgcct atgacttcgt ttctcaggaa attcgcgcag cggaagatcc tgaatttgag 35401 actttctaca ccaaaaatat tctcttaaac gaaggtattc gcgcttggat ggcggctcaa 35461 gatcagcctc atgaaaacct tatattccct gaggaggttc taccacgtgg aaacgctctt 35521 taatggaact ttagccttag ctggtcgtga ccaagaaacc actggtttcg cttggtgggc 35581 cgggaatgcc cgacttatca atttatccgg taaactacta ggggctcatg tagcccatgc 35641 tggattaatc gtattctggg ccggagcaat gaacctattt gaagtggccc atttcgtacc 35701 agagaagcct atgtatgaac aaggattaat tttacttccc cacctagcta ctctaggttg 35761 gggggtaggc cctgggggag aagttataga cacctttcca tactttgtat ctggagtact 35821 tcatttaatt tcttctgcag tattgggctt tggcggcatt tatcatgcac ttctgggacc 35881 tgagacactt gaagaatctt ttcccttctt tggttatgtc tggaaagatc gaaataaaat 35941 gaccacaatt ttaggtattc acttaatctt gttaggtcta ggtgcttttc ttctagtatt 36001 caaggctctt tattttgggg gcgtatatga tacctgggct ccgggagggg gagatgtaag 36061 aaaaattacc aacttgaccc ttagcccgag tatcatattt ggttatttac taaaatcccc 36121 ttttggaggg gaaggatgga ttgttagtgt ggacgattta gaagatataa tcggaggaca 36181 tgtatggtta ggttccattt gtatacttgg tggaatctgg catatcttaa ccaaaccctt 36241 cgcatgggct cgacgcgcac ttgtatggtc tggagaggct tacttatctt atagtttagg 36301 ggctttatcc gtctttggtt tcattgcttg ttgttttgtc tggttcaata ataccgctta 36361 tcctagtgaa ttttacggac ctactggacc agaagcttct caagctcaag catttacttt 36421 tctagttaga gaccaacgtc ttggggctaa cgtgggatcc gctcaaggac ctactggttt 36481 aggtaaatat ctaatgcgtt ccccgactgg agaagtcatt tttggaggag aaactatgcg 36541 tttttgggat ctgcgtgctc catggttaga gcctctaagg ggtccaaatg ggttagactt 36601 gagtaggttg aaaaaagaca tacaaccttg gcaggaacgg cgttccgcag aatatatgac 36661 tcatgctcct ttaggttctt taaattccgt gggtggtgta gctaccgaga tcaatgcagt 36721 caattatgtc tctcctagaa gttggttagc tacctctcat tttgttctag gattcttctt 36781 cttcgtaggt catttgtggc acgcgggaag ggctcgtgca gctgcagcag gatttgaaaa 36841 aggaattgat cgtgactttg aacctgttct ttccatgacc cctcttaatt gagatgagac 36901 aggagatcca atgcttgaat gaagtaaaaa tcactttgat tcaatcatac atcttggaat 36961 cagcctaagt attccttttt tgtattcctt ttttcttttt ttttttcaat tcattttatc 37021 taatttattt ttctggcttg gctaggtggg atagccgagc cattcccttt tctttcggat 37081 agcaggttgg gcaaaaccac taaagaaaaa aatctattca attagcaaaa aaggagagag 37141 agggattcga accctcgata gttctttgtt aaaactatac cggttttcaa gaccggggct 37201 atcaaccgct cagccatctc tccgaaagac tatttttatt ttattcctcc gaatagaaca 37261 tggccatagg ggtggatacc cccactatct gtactatctg taaaaagatc tcaggtgcga 37321 atccaccggt cgatctatct atccgtatat agatatatga tctagcatgc ccatttgtga 37381 aataaaaaat aaaattccat ttccccccac tccatgtacg aataaagtgc gaaaggggga 37441 gtagtaataa gtcatataga atcaatggat tcatgataaa gtaaaatccc tcgatgacat 37501 attttatcac aattaatatt ttttggctga tagagggatc aaatggtata tagttcattt 37561 gttggtagct tggaggatta aaagcatgac tcttgctttc caattggctg tttttgcatt 37621 aattgctact tcattaatct tattgattag cgtacccgtt gtatttgctt ctcctgatgg 37681 ctggtcaagt aacaaaaatg ttgtattttc tggtacatcc ttatggattg gattagtctt 37741 tctggtgggt atccttaatt ctctcatctc ttgaacctat tcgtcgcaga cccaaaacca 37801 aaatgacccc cctaattttt ctcggttgtg agacacatta aattggaatc taagtcccca 37861 aagaaaacgc aaatcaaata aagaaaacaa aaaaattaga ggggggtcaa acttcttgaa 37921 taaaaagaat acaattaaaa aaataattgg aatcgttccg aagagaatat gtgtcccggc 37981 actgcacaaa aaagatccgg ttatatatca tatatgtggg tacatattgt gtatcaagaa 38041 caaaaaaatg cggatatggt cgaatggtaa aatttctctt tgccaaggag aagatgcggg 38101 ttcgattccc gctatccgcc caagatccaa gataaagtaa ttttattact atttatttat 38161 tatttaattt cataaatagc attaaatata tccttaaatt aaggatttgg tatagttggc 38221 cgtgatagtg tagtgattct atccctcccc tacgttttct ttttccttcc acccccaaaa 38281 agcgaaaggc gggaattaat tactagttaa cagagtcaac cctaaaatag tttggcaaaa 38341 caagatgttg cggagacagg atttgaaccc gtgacctcaa ggttatgagc cttgcgagct 38401 accaaactgc tctaccccgc gccgaagata agaactgaaa actaatagat aaacaaggat 38461 taaatgcgcc cctccaccct atctgtacaa atagaatagc ccatttatac agaatggtaa 38521 aggggcttct atgatcatcg accatagaaa tagaaatgaa gcgttaatcc ttaccaactt 38581 gatcttgttg ctcctggcaa caaacatgca tgaaccattt cacgaagtat gtgtccggat 38641 agtccaaagt ctcgatagtt agctctcggc cttccggtca aaaaacaacg tcgatgaagg 38701 cgtgtaggtg cactattccg tggtggggat tgtaactttc cataaatttc ccatttgtca 38761 ctcaacgacg gaaccttgct tatttctttc tttgaggatc gacgaatcga atgatatttc 38821 tgttccaatt tttgcctctt cttctccctc tgaatcaaac ttttccttgc cataatggtt 38881 gaattcctat tagtatccat gatacaagtc gaatcctaga tgtagaaata gaagaaggtg 38941 gaccccctct ccgtcgaaag aaatgagatt atcgcagata cacacattaa aaatattaac 39001 caaatttgcc cgacgtagag gcaatcaaga aagccgcata agtgaatata taacctacag 39061 aaaagtgagc taatccaacc aatcttgctt gtacaatgga aagggccact ggtttatctc 39121 tccagcgaat caaattggcc aaaggtgtgc gttcatgagc ccatgctaaa gtttcaatca 39181 attcctgcca atatccacgc caagaaatta agaacataaa tccagtagcc caaacaagat 39241 gtccaaataa gaacatccat gcccaaaccg ataaactatt cataccaaaa ggattatatc 39301 cgttgataag ttgtgaagag tttaaccata aataatccct taaccagccc atcaaataag 39361 tggaagattc attaaactgt gaaacgttac cctgccataa tgtgatgtgc ttccaatgcc 39421 aataaaaagt aacccatcca atagtattta acatccaaaa aactgccaaa taaaacgcgt 39481 cccatgccga aatatcacaa gtaccgcctc gtcctgggcc atcgcacgga aaactataac 39541 cgaaatcctt tttatctggc attaacttgg aaccacgtgc atctaaagca ccttttacta 39601 agatcaatgt agttgtatgt aaaccaagag caatagcatg atgaaccaaa aagtctccag 39661 gacctattgt taaaaataat gaattactat tttcattaac agcatttaac caacccggca 39721 accagatgct tcgacccgca ttgaatgctg gaccactcgt tgaagataaa agtacatcga 39781 acccatatga agttttacca tgagcggatt gtatccattg agcaaatata ggttcaatca 39841 agatttgctt ctccggagtg ccaaaggcaa gcatgacatc attatgaaca taaagtccca 39901 gggtatggaa tcccagaaag aggctggccc aacttaaatg agatatgata gcttctttat 39961 gctctaacat tcttgccaat acattatctt cattttgctc cggattgtaa tctctaatga 40021 aaaatatagc tccatgagca aaagctcctg tcatgatgaa tcctgcgata tattggtggt 40081 gggtatataa tgcagcttga gtagtaaagt cttgtgctat gaatgcataa gcaggtaaag 40141 agtacatgtg ttgagctacc aaagaagtaa taacccctaa agaagctaga gcaaggccta 40201 attgaaaatg aagcgaatta ttgattgtgt cataaagacc cttatgtcca cgccccaatc 40261 gtcccccggg gggaatatgt gcatctaaaa ggtctttcat actgtgccca atcccgaaat 40321 tggttctata catatgacca gcaacgagaa aaataaatgc aatagctaaa tggtgatggg 40381 caatatcagt cagccataaa ctttgcgttt gtggatggaa tcccccgaga agagttagaa 40441 tggcagttcc cgccccttgg gcggtaccaa ataaatgact acttgaatcg gggttttgag 40501 cataaagatt ccattgacct gtaaaaagtg ggcctaaccc ttggggatgc ggtaatacat 40561 ctaagaaatt attccaccga acgtactccc ctctggatgc aggaatagca acatgaacta 40621 aatgccctgt ccaagccaag gaacttacgc caaagagtcc tgacaaatga tgattcagac 40681 gagattcggc atttttgaac caggaaacgc tcggtttcca tttcggttgt aggtgtaacc 40741 aacctgctat taaggatatg gcagaaagaa ataatagaaa aagagcacca gtataaagat 40801 cttcattagt gcgtaaaccg attgtatacc accactgata aacaccagaa taagcgatat 40861 tcactgggcc aagagcaccc cctcgagtaa aagcttccac ggccggttga ccaaaatgag 40921 gatcccaaat tgcatgagca ataggtctta catgtaaagg gtcctgtacc cacgactcaa 40981 aatttccttg ccaagctaca tgaaacagat ttccggaagt ccacagaaaa attattgcta 41041 attgaccaaa gtgagaagca aaaatattct gataaagacg ttcctcagta atatcatcat 41101 gactctcgaa gtcatgtgcg gtagcaatac caaaccaaat acgacgagta gtggggtcct 41161 gagctaagcc ttggctaaac cttggaaatc gtaatgccat aatgcttttc aaatcctcct 41221 agccattatc ctactgcaat aattcttgct aagaagaatg cccatgttgt ggcaattcca 41281 cccagaaggt aatgggttac tcctacagca cgtccttgta taatgctcaa ggctctcggc 41341 tgagtagcag gagcaacttt taatttatta tgagcccaaa cgatggattc aataagttct 41401 tgccaataac cacgtccact gaatagaaac attaaactaa aagcccagac aaaatgagca 41461 cctaggaaaa aaaggccata tgcagataat gaagaaccat aagactgaat tacctgggat 41521 gcctgtgccc ataagaaatc gcggagccac ccattaatag taatagaact ttgcgcaaag 41581 tttcctcccg tgatatgagt tactacccct tgatcactta cactgcccca aacatctgac 41641 tgcattttcc aactgaaatg gaatattact accgaaattg cattgtacat ccagaatagt 41701 cctaagaaga catgatccca ggccgatact tgacatgtac cccctcttcc aggtccatca 41761 caaggaaaac gaaaaccaag gtttgcttta tccggtgtca aacgggaact gcgagcaaat 41821 agaacacctt tcaagagtat cagtgccgtc acatgaatcg taaatgcatg aatgtgatgt 41881 accaagaaat ccgcggttcc taatggaata ggcaacaaag ccaccttgcc acccactgcc 41941 actaaatcac caccccccca agttaaactg gtacttgctg ttgcaccagg agccgttgca 42001 ccaggtgcta aagcatgggt gttttgtatc cattgagcaa aaacgggttg taattgtata 42061 gcggtatctg aaaacatatc ttgaggacgc cctaaagcgc tcatggtatc attatgaata 42121 tacaaaccaa aactgtgaaa gcctagaaat atacatgccc agttgagatg ggatatgatt 42181 gcatcacgat gtctaaggac acgatctaat agatcgttgt accgagtagt tggatcataa 42241 tctcttacca taaaaatggc tgcatgcgcg gcagcaccaa ctatgagaaa tccaccaatc 42301 cacatgtgat gtgtgaacaa tgacagttgt gtaccatagt cagtagctag atacggataa 42361 gggggcatgg aatacatatg gtgagctaca acaatggtta aagagcctaa catagctaag 42421 ttaagagata attgagcatg ccatgacgtt gttaggatct catataggcc tttatggccc 42481 tgacctgtaa atggaccttt atgagcttct aaaatatctt ttagtccatg accaataccc 42541 cagttggtcc tatacatgtg acccgctatc aggaaaagaa ttgcaatagc taaatggtga 42601 tgggcaatat cagtcagcca cagaccccca gttactggat ctaatcctcc acgaaaagta 42661 agaaagtccg catattttga ccaattcaag gtgaaaaatg gggttgctcc ctcggcaaaa 42721 ctgggataaa gttgagccaa aagatctcga ttcaagataa attcatgagg aagtggtatc 42781 tctttaggat ctactccagc gtttagaaat tggttaatcg gtaaagatac atgtacttga 42841 tgccccgccc aagagagaga cccaagtcct agtagccctg ccaaatggtg attcagcata 42901 gattctacat cttgaaacca agccaatttt ggcgccgctt tatgataatg aaaccaacca 42961 gcaaaaagca ttaacgctgc aaagaccaat gccccaattg ctgtacaata gagttgtaat 43021 tcactagtta ttccagatgc tcgccaaatc tgaaaaaaac cagaggttat ttgtattcct 43081 cggaaacccc cgcctacgtc accatttaat atttcttggc ccactattgg ccaaaccacc 43141 tgggcactag gcccaatgtg agttggatca cttagccacg cttcataatt agaaaaacga 43201 gcaccgtgga aatacatgcc gctcagccaa agaaagatga tggagagttg accgaaatgt 43261 gcactaaata cttttcgaga gatctcctcc aaatcactgg tatggctatc gaaatcgtga 43321 gcatcagcat gtaggttcca gatccaagtg gtagtatcag gccctttagc tattgttctt 43381 gagaaatgac ccggtctggc ccattcctcg aacgaagttt ttacgggatc cctatctacc 43441 aaaattttaa cttctggttc cggcgaacga ataatcattg agtcctcctc tttccggaca 43501 acacatacaa agagacccgc caacagtcaa ataattagtg aaccttagag atagagagat 43561 atttctataa ttagttcgtt tctcttctat ttttctatct cccatctatc tattttcttt 43621 agttatttac tagagcaatt atgatctgga agtcgatccg gggcaagtgt tcggatctat 43681 tatgacatag ccttgaggcg ctcaacggac cttttaacct tctaaaaacc tttttgggct 43741 ttggattgat ccaaaaacga cttttttgtg caacctagtg tatattcata gaagttatta 43801 gatggagctc tttaattttt tacctagaag attttaatta ctctattcca aatcacgcga 43861 gtagccatta gacattacta agagacatcc ccgctatata tatttagtga ttcgagggtt 43921 tattttatta gttttaataa taagaatttt gtttaattta atataataaa caaagtctat 43981 tttgtactct atctgtgtat ccttttttat tcctaaaaaa tagcagatga aatagaaggc 44041 ttagaaggga gataatgaaa ttatgtgatt gggtcttcca aaagcaaagg aataatccgt 44101 tttttagtta actgatctga tgggtccaac aaacaataaa ttataacaaa tatctaaatt 44161 ctaaataaaa aaatcaaaaa taatagacta agattctaaa taaaggataa taaataaacg 44221 ggatcttctt ttattcgaaa cgtctcgtga tcttcaacca attatgcgct tcaatataat 44281 taccgggagt aagcgctata gcctgtttcc aatactcagc ggcttgatcg aaccaagcct 44341 ctgcaatttc agaatctccc tgttgaatgg cctgttctcc ccggccggaa taggtagttc 44401 aattccttcc cttagaaccg tacttgagaa tttcttacct catacggctc agcagtcaat 44461 tcttttggtg tcccattttg atctatacca tatctaataa aatctaatga gatttctcat 44521 ggatctatcc cagttttagg gttaaccaaa agccaaatag gttaattaca tgagtttcaa 44581 actgaaattt ggatgaataa tccgtttatt tagttttatc ttttttccca ccttcagaag 44641 aataaagcat aggcatttct actagtgtta gaattttatg aaaggtaact atctcggttt 44701 catagataaa tttatataga atctttgaaa aagactttct ttcataagaa agaaaatact 44761 tactatcttt gggatctgat cctacaccgc tgctcaagac tttagtggat cgactctatt 44821 acataagtta attcctaatt tttatttcac atcatgagat aagtatttct tccatcatga 44881 cataagtacg cagttattat tgtatcggcc caaaacctcg ctaattgatc tttacggtgc 44941 ttcctctatc tctatcaatt aaagccttat atccatagaa aaaagttgct aggcattttt 45001 attttttcct attttgactt ctatgaagtt tctttctttg ctacagctga taaaaatcgt 45061 tgttttagac gatgcatatg tagaaagcct atttggttct actagttact ttactagatt 45121 tttctttttt tttttttttt tctttctata gtggagatag tcgcacgtaa tgacagatca 45181 cggccatatt attaaaagct tgtggtaaga atgggtttcg ttctagtgct cgaaaataat 45241 attccaaagc tttcgtatgt tctccattac ttgtgtggat aagccctata ttatagagta 45301 tataacttcg atcataggga tcaatttcta gtcgcatagc ttcataataa ttctgcaaag 45361 cttccgcgta atttccttcg gattgagccg acatccgtta cggtcgtcat tcaattgaaa 45421 gaatctccgt tccagaaccg tacgtgagat tttcacctca tacggctcct cccttatgtg 45481 cataatgaga ataatacata gaatcaaaaa agattcaacg atgaaaatat tctcattatg 45541 aactcagcag ggctagtgtt tttacaagaa atctctagcc aaccttcctg caagagattc 45601 tttcttaaca tcaagcctat tgggactaga tagaaatgat aagataactc caacaatttc 45661 tttgttttta acgcctccta atttccagga attagtcact tcaatagcct tcgatggtta 45721 tacgggtatc caaaggacga acgagatgga tgtttgttgt cccaaccatt cttttagtcc 45781 caagcccgct aaggaaaggg ctgacttaga acaaagtttt cgtgttgttg attcctaggt 45841 gtagtgcttc ttcccctctg ctgcctatta gcgctagtag agtaggattg acccgtaata 45901 cagaacctct aggcgtaacc tttcgcttaa tactagaatc gagaatcgaa acatagcatc 45961 tgaggttgca ttaatcgagg atacacgaca gaaggaattg ttctatttcc aaacttcacc 46021 ttcaaaaagc gtagattttt tcaaaaattt tctcgaatca cgtgtttttc tcctcgtaag 46081 actgagagaa atgactaaat atgaaataaa aaaaaaaaaa gaatcaaatc gcaccatctc 46141 tgtaataggt aaatgcctct ttttctcctg aagttgtcgg aattactcgt aataagatat 46201 tggctacaat tgaaaaggtc ttatcaataa aatttccatt tatccgtgat ctaggcatag 46261 gtagcaatcc attctagaat tcttctcatt acctctcatg ggaaaaagat cccacaaaga 46321 aaagaattgt atagtacgaa ataacataaa aacttctttt ttttttaaga aaaaaacaaa 46381 agatatgaat cctctattcc aattgttcct ttttgacagg aatcgataag aaataagaaa 46441 tatttcaagg cgattcgatt tcatactaat gtagtagtat aggaactatt ccgatttcgg 46501 tgaagttaca aattcgaaga actcgagaaa ttttgattga atcatgatac aaattacaaa 46561 gaagaaaaaa gaccgaataa tcattctatg atgaaaatag aataactgcc aattttgtgt 46621 acataacggg tatacactat acaatcaaat ctaaattttt tttatgaatt tctattctaa 46681 tagaggggta ggtgtttgtt gttgagaact ccaaaaccga aaagtaattt gaaaattttt 46741 ctggtatgga atcatagtct atataattag aattatgatt taagagtatc cattaactat 46801 agtctaaaag atatagacca tcaatcagtt gattcgttct aattcattga attaatccgt 46861 tataaaatat cagaaaaaga aaaagaaggg aacgttgttt tgcaaacatg aatcgaattt 46921 tttttttcac aatttttacg caaaattgta tctttatccc ggagcctcga aggaaagaaa 46981 aatcgttctt tgctttgact ttgatgaaaa attttcagtt aaaatggatt gatcatacct 47041 atccaataat ggaatatgga ttatgactga ctcgctattc actcggtttt tgggtcataa 47101 tcgttatgta ggagagatgg ccgagtggtt gaaggcgtag cattggaact gctatgtagg 47161 cttttgttta ccgagggttc gaatccctct ctttccgtac cttcgcttaa ttcaccaatt 47221 ttactaacaa caagggctca aatagcaatg gataccatta ttccaacagc tagacccttc 47281 tttgatctaa agatatagat tctcaattcc taattgctgt gacgcgtaaa atagaatact 47341 aaaaaataat aataatcaaa atactggaaa gaaaagagta gacaaggaat gaaaatagat 47401 ccttggtcta tgatacaaaa atgggggaaa tccagatcaa actcggattt atcttactta 47461 accttaggtt aatttacttc gcctaaaggg aagaaaattt tccgaaccct cggtttcagt 47521 ctgaggttta agtctgacga gaataatatt ctacgactag caattcattt attttcaaac 47581 cgacccattt actatctatt atttgattga ctaatccttt atattggaat gggtgaaggg 47641 tcaaatggtt tggtaattcc tcatgagggg atgaatcgag agaaatttga atcagagctc 47701 tggatttttg ttcatccttt gccgtaataa tatctcgggg tttgcagcga taactcggta 47761 tatctactat acgaccatta actaaaatat gtcgatggtt aactaattga cgggctgcgg 47821 gaatagttga agccataccc aatcgaaaaa ggatgttatc caaacgcatt tcaagtaatt 47881 gtagtaaaac ttgacctgtt gaccccttgg cttttctggc gatacgaacg tatttaagta 47941 attgtcgttc tgtaagacca taatgaaaac gcaatttttg tttttcttct agacgaatac 48001 gatattgaga ttttttcccg gaacgcgatt ggtttctaag atcacttccg ttcctaggct 48061 ttttattagt tagtcctggt aaagccccca ggcggcgtat ttttttgaaa cgaggtcctc 48121 ggtaacgcga cataaagact ccttattctt atttcttatt tagtatttcg aattaattct 48181 tatttctatt tattttattt tttattgaat tttattttac agaataaacc taaactaaaa 48241 ctaaactgaa tctaaatgaa gcgaagttta ctgaaatagt gtacttgtac tattactata 48301 aagaaaagaa gaatgggatg aattggataa atatacagac ccccttctat tatatatata 48361 atcctttccc gacataattg gaagttccta taataaattg atagcttttg gaaaaggaag 48421 aaggcgctat ttcaatattc tttgatttca aaggaacatt atcaatcatc taaaaaatgg 48481 aataaaaaaa aaagaatagg gaaaagccgg ctatcggaat cgaaccgatg accatcgcat 48541 tacaaatgcg atgctctaac ctctgagcta agcgggccca cataacagaa atcttatatg 48601 catagtaatt gactaaacta ttggaattgg aatcttagtt attaactatt caatattata 48661 ttgaatattc tagaacataa ggattaatat agcgatatag aatttcgatt tatcacaatt 48721 ctaataacaa ttctaatact aatattatta aatagtgatt gtaaatattg ttaatattct 48781 ttttttttca ttttccattt gaatggtaaa tgttcttttt catttctttt tttgtcattt 48841 gaaatccttt tgatttttta ttacagttct atattttatt ctatatcata tatatctctc 48901 attctatatt tatttcaaat tctaattgtt taatggaatg gttagttata actaatgaga 48961 cattcctccg ctttcaggcg aaagtgaaga taaaaaaaaa gaatcgaccg ttcaagtatt 49021 ccaaattgaa tggcaaaatg gcaggaagag agacatatag atggggtata tatccatcta 49081 tattgaattg cggattccga aatgataaaa tcatttttga ttggacaaaa aaaggtctcc 49141 tatagaagat agttaagaaa atcaaagagg agaaaacacg ttttcgagat aggaatcggt 49201 atctaatgaa ttcaatggtt ccagtataaa tgaaagaaaa agaaaaagga atgacatcac 49261 aacgagatcc taatctcaaa aagaaagggg gatatggcga aatcggtaga cgctacggac 49321 ttaattggat tgagccttgg tatggaaact tactaagtga tcactttcaa attcagagaa 49381 accctggaat taacaaaaat gggcaatcct gagccaaatc ctgttttccg aaaacaaaca 49441 aaggttcaga aaaaaaggat aggtgcagag actcaatgga agctattcta acaaatggag 49501 ttaaatgcgt tggtagagga atctttacat cgaaacttca gaaagaaaaa gaatgaagtg 49561 aaggataaac gtatatacat acgtattgaa tactatatca aaatcaaatg attaatgatg 49621 acccgaatct gtattttttc tataaaaaat agaagaattg gtgtgaatcg attctacatt 49681 gaagaaagaa tcgaatattc attgatcaaa ccattcactc catagtctga tagatctttt 49741 gaagaactga ttaatcggac gagaataaag atagagtccc gttctacatg tcaataccgg 49801 caacaatgaa atttatcgta agaggaaaat ccgtcgactt taaaaatcgt gagggttcaa 49861 gtccctctat ccccaaaaag actatttcac tccccaacta tttatccgac cccctttcct 49921 tagcggttcc aaattcctta tctttctcat tcactctatt cttttagaaa tggatttgag 49981 cgtaaatggc tttctcttat cacaagtctt gtgatatata tgatacacat agaaatgaac 50041 gtctttgagc aaggaatccc tagttgaatg attccctatc aatatcatta ctcatactga 50101 aacttacaaa gtcatctttt tgaagatcga agaaattccc cggctttgag aaaattttta 50161 atctactttt gtccttgtaa ttgacataga ccccagttct ctaataaaat gaggatacta 50221 cattgggaat agccgggata gctcagttgg tagagcagag gactgaaaat cctcgtgtca 50281 ccagttcaaa tctggttcct ggcacatgat taatttgtat gggtctctct tccctcgaat 50341 taatttctaa ttaattgata tgaatcaaca tacatattct tttagagtct agattagaat 50401 aatagcttta tccagtttgg cgagatatac cccatctatg ttctagatgg gtagagtttc 50461 ttagataaag tatctaaaag aattggattc tatctcctct tttttttctc ctctcgttca 50521 accgaatttg aatacgtaat acatattcga aaggttcaat tggttaattg ttgaaaggct 50581 caaaagtcga atccgaatct aggggggttg aaatagacaa gattcagctc agatccaaag 50641 aaatagaatc cgatattctc tcatttcttt gtcttttctt tcatattcga tttcttcatt 50701 ccggatttct ccattccttc ctatatgcct ttctagaacc catctaagta atgtgcgcag 50761 tacaaagttc atgatgcaga actcatttgg ttcatcctat tggtgtgacc catccgaaat 50821 aagtatcttc caaataaatg tgagaattcc aatgaatccc taattgtctt tttttgttag 50881 cctatcgata attccctaaa ttagacctgc ttaatctaga acagaacgtg caatccttga 50941 atatctgaaa ttgtctaagt ggaaatagct ttcttatcat tcaatgagca tcttgtattt 51001 cataaaaatt gggggcaata taatccttac gtaagggcca tcctatccaa ctttcaggca 51061 ttaagatacg tttcaagcgt ggatgattat cataagagat tcccaacata tcatatgatt 51121 ctcgttcttg aaaatccaca cttttccaaa cccagaaaac agacggaatt ctaggattcc 51181 tcctggaggc aaatactttt atgcatacct cctctggttg atccacacca tcctctattc 51241 tcgtaagatg atacacacta gctaacagcc cgccaggcgc tacatcatag gcacattgag 51301 agcggagata gttgtaccca tatacataaa aaatgacagc aatggaatgc caatcctcgg 51361 gctttatttg taaagtctct attccttggt aatcaaagcc caaagatcta tgaattagcc 51421 catgcttgac tagccaagca gacaaacgac cctgcatctt ttttatctct cccgcatttt 51481 tatttatata agtatttcac atttacgatg aaatttctga aaattgaccc accacttttt 51541 attctggaca aaggaatcct gtctaattca ctaattcggg ggaagatact gaatttttgt 51601 atttgaaaaa gatttccgta gggatctctg aagtagatgg gggttgataa agaactcttt 51661 gatcataatt tcccgtatga atactgtgtt gaacatgaaa cttgtgattg gtagtaaaac 51721 accgattcgc tcgttgagac ctaattcgat cttcatagag ttctcgagat attttcttac 51781 gaagttttgt tatagcatct ataaccgctt ccggtttagg tgggcaacct ggcaaatata 51841 catctacagg aattagctta tcgactcccc gaacagtact ataagaatcg gtactgaaca 51901 tcccgcctgt aattgtacag gctcccatag caataacata ttttggttca ggcatttgct 51961 catataatct cactaaagag ggggccattt tcattgttac tgttccggct gttaaaatta 52021 gatccgcttg tctaggactc gatcttggta ctagtccata acgatcaaag tcgaagcgtg 52081 agcctattag tgaagcaaat tcaatgaagc aacaactggt accatagaga agcggccata 52141 aactagagag tcttgaccaa tttgaaagat catttaatgt agttgaaata actgaatttt 52201 gggttgttcg atcaagtaaa ggaaactgaa tggaattcat aactgtctca atcttatttt 52261 ttccgttttt ctttttattg tctgaatatt caggagctaa gaccattcca atgccccctt 52321 tcgccatgca taaactaaac caataattaa gataagcacg aaaatgaaag cttctataaa 52381 tacagataca cccaatacgt cgaaactcat tgcccatgga taaagaaaaa ccgtttcaac 52441 atcaaaaaca acaaaaacta gagcaaacat ataataacgg attcgaaatt gtaaccaagc 52501 atcgcccatt ggttctatac ccgactcata agtagaaagt ttctccggcc ctttgctaat 52561 cggggctaac actccggaaa ttaaaaatgc caaaatagga acaaggatag atattattag 52621 aaatgcccaa aaaaaatcat attcgtaaag cagaaacata aacgcactcc tatgaacgtg 52681 gaaaatatac cggattcgat tggtcgattc gaattggaat tgtcaagtca tccataacta 52741 tttagtcaaa acaagaattc attttgatcg aaccgtctag tttgctttgt ttattggttt 52801 attgtagggc atatctcatt gcaagattca tcgactggaa tccgatttta tttccattat 52861 acttatttcc attttattta gttagtagaa ccttctaact atatattact cttatacaaa 52921 ttctcttgtt tctcttgttt tcatccagga ttttctctaa agacggggaa ttctaaatta 52981 attacttatc ttatttcttc tttaattaga aattctttaa agatttctat ttttttctat 53041 aaatagaatc aggaggtctt ttttcttatt ttttcttagt gatttagaat agaacaagta 53101 atcaaataga agagaatgta taggaatttc catctcaaga tttagaagat cttgtgttgg 53161 tatattcctt attattatta tttaataata gtattagggt tcgaatccag gtgacggggt 53221 ttttcttggt tgaatacaga aaaagaggac tggccttttt cgtgttgtgc ttcgctaggt 53281 cgaggtaagt aaggtatacg aaggaaaagc ctatttgaca atgaaagtga ccaaaggtat 53341 tcgtttttca aaaaacttta gcttgtacac aaatacagca ggcccttcct aaatccatgt 53401 gaattcctct tcgtagtttt tcatttcacc aggcccgtga aatgatttga cttccacaac 53461 tcaataagat tggggatatc aaaagaaagg gagtctcact aattctttta ttgtggatat 53521 gaatatgtaa ttcgcctccg aagattaatg acgaaaggtt ggtttcttta tccgcaattg 53581 aaaaaatcaa tatcgattgg atccgttgat atgcattttt tctttcatct gcttaaacga 53641 ttgccgtgag taaacttata ggaataattg gatttcactt agttacaagc aagaaataat 53701 aatgaagaaa tgaaaattat agaatttttt ggattttgca tttttatagg gctatacgga 53761 ctcgaaccgt agaccttctc ggtaaaacag gtcaaactta ttattattaa aatgatctga 53821 actgtttcaa agacccaaca tgcatttttt ttgcattggg ctctttcatt aactgatata 53881 aatatcagtt agtctgccat tttttttctt gacagaaaaa aagataagga aatggctcca 53941 tgtgctctga ttcattattt gggagcatta ccaaagtgtt tcaaaggtgg gattatcttg 54001 acgtaggtct gtctctggcc tagatcaacc taagttaaat gaagtctcta tcgttctgct 54061 gaaaaaatca aatatgaaac ttcatacacc ttaaagttca tatgacgaaa agagattttt 54121 ttgaggtcct tatactcatt atgcctagca ttgaatagac tgggtattca ccttatcaag 54181 atctcaaatc aatgatgggg tctgtttggc acctcctaaa tgggcgtcca aattggaccg 54241 aactctttgt caggctatgg ttccctcaaa gttatggagt aagacatcga tttctcaaca 54301 agatcaattt ttctgattgt atgatgaact cccttgaaaa acattggcgc gcgtgtaaac 54361 gagttgctct accaactgag ctatagccct tagtgcttgt gatacatatt ttatcatgta 54421 gataaattct tgtcaagata aatattccat gatccaacat caacaatctt tgatctcttt 54481 gagcggtatt ccttagatta gtattgctta ttaagtaata tgatatttat aatccatcga 54541 caggatgggt ttcatttggt tctctttggg atgataaatg acctacttaa ctcagtggtt 54601 agagtactgc tttcatacgg cgggagtcat tggttcaaat ccaatagtag gtaaaactta 54661 ttagatacca gagtcaatgg tatctaataa ggtttacgac ccacccttag tgatattgat 54721 tttttgattt tgtatctttt ctatttcatt tttgaatttg aatttttgca tcagaattgg 54781 attctgtttg attgtatttg attgtattca cccgacagaa tctaaatagg attagaaaga 54841 gaacttcttt ttattattcg aacgtaccaa ctagttatga aatcggattg atagcctcca 54901 cccgtgttct agctcgtcgg agagctagat ttgcctcaat tttttgtctc cttccttcag 54961 cctttttcac attagcttcc gctagttcaa gagtttgctg agcttcttgt ggatcaatgt 55021 cactaccctt ctccgcatca tttactaaaa cagtgatctc attattgcct attctagcaa 55081 aaccacccat cagagccatc gttaaccatt ggtcgttaag acgtattctc aaaatcccta 55141 tatctacagc tgtggcaata ggggcgtgat ttggtaatat gccaatttga ccgctattag 55201 tagataaaac aatttcttcc acttctgaat cccaaacaat tcgattaggg gtcagtacac 55261 taagatttaa ggtcatttct tcaaattgct ctccatttct aagttcatag ccttcgcggt 55321 agcttcatcg atattaccta ccaaataaaa ggcctgttca ggaagaccat ctaattctcc 55381 ggaaaggatc aattgaaatc ctcgaattgt ttctgctaga ccaacatatt tacctggaga 55441 accggtaaat acttctgcta cgaaaaaggg ttgtgataag aaacgctcaa tttttcgcgc 55501 tcttgctacg agtaaacgat cctcttcgga taattcgtcc aatccaagga tagctataat 55561 gtcctgaagt tctttgtaac gttgtaaagt ttgcttaact ctttgggcgg tttcgtaatg 55621 ttcctcacca acgatccgag gttgaagcat ggttgacgtt gaatctaaag gatctactgc 55681 tggataaata cctttggcag ccaatcctct tgatagtacg gtagtagcat ctaaatgtgc 55741 aaatgtcgta gcaggagcag ggtcggtcaa atcgtctgcg ggtacataaa ctgcttgaat 55801 agaggttatg gacccttctt tggtagaagt aattctttct tgtaaagaac ccatttcggt 55861 actcagggtg ggttgataac ccacagcgga aggcattcta cccaataagg ccgatacttc 55921 ggatcctgct tggacgaaac ggaagatatt gtcaataaaa agaagtacgt cttgctcatt 55981 aacatctcgg aaatattccg ccatagttag ggcagtcaaa ccaactctca tacgagctcc 56041 cggcggttca ttcatctgac cataaactag ggctactttt gattctgcaa tattttcttc 56101 attaattact ccagattctt tcatttccat gtaaagatca tttccttccc gagtacgttc 56161 acccactccg ccaaatacgg atacgccccc gtgagcttta gcaatattgt taatcaattc 56221 cataataagt actgttttac ccactccagc tcccccgaat agtccgattt ttcctccacg 56281 gcgataaggg gctaaaagat ctactacttc aattcctgtt tcaaaaatag ataattttgt 56341 atccaactgt ataaaggcgg gcgcagatct atgaatagga gacgttgtac tagtatctac 56401 aggccctaaa ttatcaacag gttctccgag cacgttaaaa attcgtccca gagtcgctcc 56461 cccgaccgga acacttatag gagctcctgt gtcaatcact tccattcctc tcgttagacc 56521 ctctgtagca ctcatagcta tagccctaac tcgattattt cctaataatt gctgtacctc 56581 acaagccaca ttaattggtt gaccaacact atctcgacct tgaactacca gagcgttata 56641 aatattcggc atcttgcccg ggggaaaggc tacatctagt accggaccga tgatttggac 56701 gacacgcccc gggttttttt tttcaagcgt ggaaacccca gaaccagaag tagtaggatt 56761 gattctcata ataataaaat aaataaatat gtcgaaatgt ttttgcaaaa attatcgaat 56821 tcaaaataaa tgtccgctag cacgtcgatc ggttaattca ataaaatggg aattagcact 56881 cgatttcgtt ggcaccatgc aattgaaccg attcaattgt ttacttattc actgagactg 56941 agtgaatttg caagcccacc caacctattt taattttaaa atctcaagtg gatgaatcag 57001 aatcttgaga aagtctttca tttgtctatc attatagaca atcccatcca tattatctat 57061 tctatggaat tcgaacctga actttatttt ctatttctat tacgattcat tatttgtatc 57121 taattggctc ctcttcttat ttatttttga tttcaatttc agcatatcga tttatgccta 57181 gcctattctt ttctttgtgt ttttctttct tttttatacc tttcatagat tcatagagga 57241 attccgtata ttttcacatc taggatttac atatacaaca tataccactg tcaaggggga 57301 agttcttatt atttaggtta gtcaggtatt tccatttcaa aaaaaaaaaa agtaaaaaag 57361 aaaaattggg ttgcgctata tatatgaaag agtatacaat aatgatgtat ttggcaaatc 57421 aaataccatg gtctaataat caaacattct gattagttga taatattagt attagttgga 57481 aattttgtga aagattccta tgaaaagttt cattaacacg gaattcgtgt cgagtagacc 57541 ttgttgttgt gagaattctt aattcatgag ttgtagggag ggatttatgt caccacaaac 57601 agagactaaa gcaagtgttg gattcaaagc tggtgttaaa gagtacaaat tgacttatta 57661 tactcctgag taccaaacca aggatactga tatattggca gcattccgag taactcctca 57721 acctggagtt ccacctgaag aagcaggggc cgcggtagct gccgaatctt ctactggtac 57781 atggacaact gtatggaccg atggacttac cagccttgat cgttacaaag ggcgatgcta 57841 ccgcatcgag cgtgttgttg gagaaaaaga tcaatatatt gcttatgtag cttacccttt 57901 agaccttttt gaagaaggtt ctgttaccaa catgtttact tccattgtag gtaacgtatt 57961 tgggttcaaa gccctgcgcg ctctacgtct ggaagatctg cgaatccctc ctgcttatgt 58021 taaaactttc caaggtccgc ctcatgggat ccaagttgaa agagataaat tgaacaagta 58081 tggtcgtccc ctgttgggat gtactattaa acctaaattg gggttatctg ctaaaaacta 58141 cggtagagcc gtttatgaat gtcttcgcgg tggacttgat tttactaaag atgatgagaa 58201 cgtgaactca caaccattta tgcgttggag agatcgtttc ttattttgtg ccgaagcact 58261 ttataaagca caggctgaaa caggtgaaat caaagggcat tacttgaatg ctactgcagg 58321 tacatgcgaa gaaatgatca aaagagctgt atttgctaga gaattgggcg ttccgatcgt 58381 aatgcatgac tacttaacgg ggggattcac cgcaaatact agcttggctc attattgccg 58441 agataatggt ctacttcttc acatccaccg tgcaatgcat gcggttattg atagacagaa 58501 gaatcatggt atccacttcc gggtattagc aaaagcgtta cgtatgtctg gtggagatca 58561 tattcactct ggtaccgtag taggtaaact tgaaggtgaa agagacataa ctttgggctt 58621 tgttgattta ctgcgtgatg attttgttga acaagatcga agtcgcggta tttatttcac 58681 tcaagattgg gtctctttac caggtgttct acccgtggct tcaggaggta ttcacgtttg 58741 gcatatgcct gctctgaccg agatctttgg ggatgattcc gtactacagt tcggtggagg 58801 aactttagga catccttggg gtaatgcgcc aggtgccgta gctaatcgag tagctctaga 58861 agcatgtgta aaagctcgta atgaaggacg tgatcttgct caggaaggta atgaaattat 58921 tcgcgaggct tgcaaatgga gcccggaact agctgctgct tgtgaagtat ggaaagagat 58981 cgtatttaat tttgcagcag tggacgtttt ggataagtaa aaacagtaga cattagcaga 59041 taaattagca ggaaataaag aaggataagg agaaagaact caagtaatta tccttcgttc 59101 tcttaattga attgcaatta aactcggccc aatcttttac taaaaggatt gagccgaata 59161 caacaaagat tctattgcat atattttgac taagtatata cttacctaga tatacaagat 59221 ttgaaataca aaatctagaa aactaaatca aaatctaaga ctcaaatctt tctattgttg 59281 tcttggatcc acaattaatc ctacggatcc ttaggattgg tatattcttt tctatcctgt 59341 agtttgtagt ttccctgaat caagccaagt atcacacctc tttctaccca tcctgtatat 59401 tgtccccttt gttccgtgtt gaaatagaac cttaatttat tacttatttt tttattaaat 59461 tttagatttg ttagtgatta gatattagta ttagacgaga ttttacgaaa caattatttt 59521 tttatttctt tataggagag gacaaatctc ttttttcgat gcgaatttga cacgacatag 59581 gagaagccgc cctttattaa aaattatatt attttaaata atataaaggg ggttccaaca 59641 tattaatata tagtgaagtg ttcccccaga ttcagaactt tttttcaata ctcacaatcc 59701 ttattagtta ataatcctag tgattggatt tctatgctta gtctgatagg aaataagata 59761 ttcaaataaa taattttata gcgaatgact attcatctat tgtattttca tgcaaatagg 59821 gggcaagaaa actctatgga aagatggtgg tttaattcga tgttgtttaa gaaggagttc 59881 gaacgcaggt gtgggctaaa taaatcaatg ggcagtcttg gtcctattga aaataccaat 59941 gaagatccaa atcgaaaagt gaaaaacatt catagttgga ggaatcgtga caattctagt 60001 tgcagtaatg ttgattattt attcggcgtt aaagacattc ggaatttcat ctctgatgac 60061 acttttttag ttagtgatag gaatggagac agttattcca tctattttga tattgaaaat 60121 catatttttg agattgacaa cgatcattct tttctgagtg aactagaaag ttctttttat 60181 agttatcgaa actcgaatta tcggaataat ggatttaggg gcgaagatcc ctactataat 60241 tcttacatgt atgatactca atatagttgg aataatcaca ttaatagttg cattgatagt 60301 tatcttcagt ctcaaatctg tatagatact tccattataa gtggtagtga gaattacggt 60361 gacagttaca tttatagggc cgtttgtggt ggtgaaagtc gaaatagtag tgaaaacgag 60421 ggttccagta gacgaactcg cacgaagggc agtgatttaa ctataagaga aagttctaat 60481 gatctcgagg taactcaaaa atacaggcat ttgtgggttc aatgcgaaaa ttgttatgga 60541 ttaaattata agaaattttt gaaatcaaaa atgaatattt gtgaacaatg tggatatcat 60601 ttgaaaatga gtagttcaga tagaattgaa cttttgatcg atccgggtac ttgggatcct 60661 atggatgaag acatggtctc tctagatccc attgaatttc attcggagga ggagccttat 60721 aaagatcgta ttgattctta tcaaagaaag acaggattaa ccgaggctgt tcaaacaggc 60781 ataggccaac taaacggcat tcccgtagca attggggtta tggattttca gtttatgggg 60841 ggtagtatgg gatccgtagt cggagagaaa atcacccgtt tgattgaata cgctgccaat 60901 caaattttac cccttattat agtgtgtgct tctggggggg cgcgcatgca ggaaggaagt 60961 ttgagcttga tgcaaatggc taaaatatcg tctgctttat atgattatca attaaataaa 61021 aagttatttt atgtatcaat ccttacatct ccgacaactg gtggagtgac agctagtttt 61081 ggtatgttgg gggatatcat tattgccgaa cccaacgcct acattgcatt tgcaggtaaa 61141 agagtaattg aacaaacatt gaataaaaca gtacccgaag gttcacaagc agctgaatac 61201 ttattccaga agggtttatt cgacctaatt gtaccacgta atcttttaaa aagcgttctg 61261 agtgagttat ttaagctcca cgcctttttt cctttgaatc aaaagtcaag caaaatcaag 61321 tagagcacta agttcaatta ttttatttgt gtttgtagca aaaaagtagt tagtttgtcg 61381 gaatcaaagt aaataagata ataatggcgc tttctttggt gatagaagat ctaattgtag 61441 aaagaatcaa aactaaagtt gaggataact ctttttttga cctatattcc tgattacgaa 61501 tcaagaagcc tttatcaaca agagtgagtt cttcctttcg tgaaattagg aaaataaaac 61561 gaatttcttc ttcttgtctt aggtatataa tttgaaattc aaatatagat aatagagttt 61621 tgtatctttc tctatctccc gaaaaaccat tttagctaaa aattcatgtt gggtcggatt 61681 cgaacgaatc tttcgataat ctgtaagaaa ctctttatct atttttagaa aattagaaga 61741 caagaacaaa agacaaagaa atgaagaaaa ataataaagt ttattatgat acatatcttt 61801 ctcatgtagg ggatgaataa gtccatttat ttagttctac agttctacat tctttgcact 61861 tattatacct actcagttag atttagatat atagatactt agatctatac taagaatttc 61921 aaattcttca aattctatta ataataaata ttatctaatt tctaattagt aattagaatt 61981 caaattctta atttaattat aattattaca agatatcttt atttatataa taacataata 62041 acagatacaa atagtaaatc gaggtacccc ttctatgaca aatttgaacc ttccatctat 62101 ttttgtgccg ttagtaggcc tagtctttcc ggcaattgca atggcttctt tatttcttca 62161 tgttcaaaaa aataagattg tttagatccg ctgggaccca atctcatcca tttttttttt 62221 gaaaacgtgg acttgtatca taacacagat atctatttat tggaatatag tataacatgt 62281 gatttccacc gaacataaag gaaaaaactc ttatgcccgc agaaatatga tatatggata 62341 tatcaattct aacaattttc aaatagatca ggatcgctgg atggctgaaa tgtagtcggt 62401 gaatctctat gtatatcgat atgtatagtg ggatcgtatt aaataaagag tatgttatta 62461 ttttagattt aaccaatttg atgaattact cctaaaggtt gacatcaaac tagtgctagt 62521 tcacctcaaa ctagtgctag ttgatgagag ttacttcgga aacaaaaaag taaagtcaaa 62581 tttctctggg gtattatctc aattccaata aaatgcaatc gggtaaagta tgacttggcg 62641 atcagaacat atatggatag aacttataac ggggtctcga aaaataagta atttctgctg 62701 ggcctttatc ctttttttag gttcattagg cttcttatta gttggaactt ccagttatct 62761 tggtagaaat ttgatatctt tttttccgcc tcagcaaatc attttttttc cacaaggact 62821 cgtgatgtct ttctacggaa ttgcgggtct ctttattagc tcttatttgt ggtgcacaat 62881 ttcctggaat gtaggtagtg gttatgatcg attcgataga aaggaaggaa tagtctgtat 62941 ttttcgttgg ggatttccgg gaaaaaatcg tcgcatattc ctccgattcc ttataaaaga 63001 tattcagtcc gttagaatag aagttaaaga gggtatttct gctcgtcgtg ttctttatat 63061 ggacatccga ggccaggggt ccattccctt gactcgtact gatgagaatt tgactccacg 63121 agaaattgaa caaaaggctg ctgaattagc ctatttcttg cgtgtaccaa ttgaagtatt 63181 ttgagaaatt gagatatcag tatcaggaaa caatattctg aatttcttca ttcgaagtga 63241 attcttagct tttttctgga ttctttctag attcaaagac taaccacaaa atcacaaaga 63301 aaatagattc attagtccga taccttgtat aaaactcatg tgtgtaagaa atattcgatc 63361 gcatagagtg tacgaatggg ttgattaaca attcacagat gaaaaaatgg caaaaaagaa 63421 agcattcact cctcttttct atcttgcatc tatagtattt ttgccctggt ggatttcttt 63481 ctcagttaat aaatgtctgg aatcttgggt taccaattgg tggaatactg ggcaatccga 63541 aatttttttg aataatattc aagaaaagag tcttctagaa aaattcatag aattagagga 63601 actcctcttc ttggacgaaa tgatcaagga atactcggaa acacatctcg aagagtttgg 63661 gataggaatc cataaagaaa cgatccaatt aatcaagata caaaatgaga atcgtatcca 63721 tacgattttg cacttctcga caaatatcat ctgttttatt attctaagcg ggtattcaat 63781 tttgggtaat gaaaaacttg ttattcttaa ctcttgggct caggaattcc tatataactt 63841 aagtgacaca gtaaaagctt tttctattct tttattaact gatttatgta tcggattcca 63901 ttcaccccac ggttgggaat taatgattgg ctctatctat aaagattttg gatttgttca 63961 taatgatcaa atcatatctg gtcttgtttc cacctttcca gtcattctcg atacaatttt 64021 taaatattgg attttccgtt atttaaatcg tctgtctccg tcacttgtag ttatttatca 64081 ttcaatgaat gactgataaa ggatccattg atattaatct aatccaatta gaatgcttgg 64141 tactttgtag ttgtacataa gcaaagtatt gaaaatcata tttactcttt ctatttctaa 64201 ccatcgggga gattcatcct atattattcc tagattattc cagcaaatag cagaatcgtg 64261 gctagggaac tatactagcg acctacccaa tttattgtag aaattttcgc gatcaatgat 64321 tggaccatgc aaactagaaa tgctttttct tggctaaaga aacagattac tcgatctatt 64381 tccgtatcgc tcatgatata tatcttaact cggacatcca tttcaagtgc atatcccatt 64441 tttgcacagc agggttatga aaatccacga gaagcgactg ggcgtattgt atgtgccaat 64501 tgccatttag ctaataagcc cgtggagatt gaggttccac aagcggtact tcctgatact 64561 gtatttgaag cagttgttcg aattccttat gatatgcaac tgaaacaggt tcttgctaat 64621 ggtaaaaggg gggggttgaa cgtgggggct gttcttattt taccggaggg gtttgaatta 64681 gctcctcccg atcgtatttc tcccgagatg aaagaaaaga ttggcaattt gtcttttcag 64741 agctatcgcc ccaataaaaa aaatattctt gtgataggcc ctgtccctgg tcaaaaatat 64801 agtgaaataa ccttccctat tctttccccg gaccctgcta ctaagaagga tgttcacttc 64861 ttaaaatatc ctatatacgt aggcgggaac aggggaaggg gtcagattta tcccgacggc 64921 agcaagagta acaatactgt ttataatgct acagcagcag gtatagtaag caaaatcata 64981 cgaaaagaaa agggtgggta tgagataacc ataacggatg cgtcggatgg acgtcaagtg 65041 gttgatatta tccctcccgg accagaactt cttgtttccg agggcgaatc tatcaaattt 65101 gatcaaccat taacgagtaa tcctaatgta ggcggatttg gtcagggaga tgcagaaata 65161 gtacttcaag atccattacg tgtccaagga cttttgttct tcttggcatc tgttattttg 65221 gcacaaatct ttttggttct taaaaagaaa cagttcgaga aggttcaatt ggccgaaatg 65281 aatttctaga ttcgcagatt tgtcgacatc aagttcgtaa aaagaaccaa attcttgttg 65341 gcgattattt atgatcaaaa aaatgaaatt ctgaaaactc ctttgtctta tttatactct 65401 tcttcaaaat ctacatacta tgtggtacaa gggattccca gcatctcgta gaaaaagagt 65461 atgtaatgta gaatttgaag aagagtattt gactttcatt atttttattt cgttttttaa 65521 aattggagta gtgtgactat gttactattg acagatttca atgccataag acgtatcaat 65581 agttttctat tctaaataga aagaaagtca aatttgtcta aatactagac ataaggaagc 65641 aggggataaa tgcggggaac aaaaaattct aggagggatt atttgtcttc ctagtcttcg 65701 acacaagaaa ggggtgtaga aaaatccttt tttcttgtgt cgaaacgaaa gagtaatgat 65761 tcttgatcct gtttgttaaa aattcctagt cttggtttcg atttttccag atgtatcaga 65821 aaccctttac cttaccccca ccccctttac gtataatata ctaagtggtg gacaaacaaa 65881 acaaaaaaag agaggaaatt ttattaatta aataaaactt cttcaatcaa cttatcttat 65941 acaaaatttg atgatgaaat atgaaaacaa taaaaaataa atagagtaat gtaatagaga 66001 gagtaaggtt ctacattaga ttagtataga aaggatttgc acgatatcta atatattata 66061 gcagccaaga aattgagtga ttccttcttt cttccaactt tgaaagtacc gatagatact 66121 atcatagaaa aagaagaggt ggtccgaata gtgaattttt caaaaacatg atcagaaaaa 66181 tgagaaaaat ggagtttttg aaaagaaaaa gaaatccatt ttatcattta gacgaaaaaa 66241 atattatgat tcttaagaac tcaacgggcc cttccccttc gaatcaaaca aacaaagaag 66301 ggaattccgt tgagttctta cgctttcatg ttgacgactc aattcattcg attactagag 66361 ggatgaaccc aatccggaat atgaaccata aaagaaaata cctattaaac cgattacaag 66421 aataccagct acagtaccta ttatccaaag aggaatcctt ccagtagtat cggccattta 66481 ccccacttcc ctccagattt catcaagtgg tcatgctaga gacataaaca gtcatggata 66541 attaaattat gagatccttc cgaatgagct aagagaatct tattgattct ctttcgtttt 66601 cttaattgaa gaaataattg gaaaataaaa cagcaagtac aaaaatgagt aataaccccc 66661 agtagagact ggtacgattc aattcaacat tttgttcgtt cgggtttgat tgtgtcgtag 66721 ctctataatt cggattaagt ttatcgttgg atgaactgca ttgctgatat tgatcccaaa 66781 aaaaagacgg taggtacagc taggccgtga acagccaacc atcgtactgt aaaaattgga 66841 taggttcgat ctatagtcat tagggcctcc taaaacgatc tactaaattc atcgagttgt 66901 tccaaaggat caaaacggcc agttattaat ggaattcctt gtcggctctc tgtaaaatac 66961 tcgtttggcc gagggcttcc aaacacatcg taagctaaac cggtgctgac aaataaccaa 67021 cccgcaatga atagggaagg tatagtaatg ctatgaatga cccagtatcg aatactggta 67081 ataatatcag caaacgaacg ttctcctgtg cttccagaca tgctgagctc cacatattct 67141 tgtacagtca aagaagatcg attccgtaaa agatgagatc agtaaatgac aattcactga 67201 aatttcatct ttgtgagatc gtcaatattg taccgaaggc gtctttagag tataccgaat 67261 cagtatagct atccttcttc tgacacagca acgcaatttg aaatagtatc aaaagtaagt 67321 actaaataat ttcttttttc ctttacttgt tgatgtaaaa tcatcttcca ttcaatagaa 67381 aattctttca attcaacgaa agagattctc atattcacac aatttaagta gatgcgagat 67441 atagaaattt gcttttcgta gttgtggaag cagttttgtt gttggaatcc tttttttaaa 67501 gaagaagtta atggtcgagt aagaaataag agtagtagat catattcgag gaaagaaaaa 67561 atcgaataat tggaatccat agttgtgatg cattgttgtg gatctcgatc caaaggttct 67621 ttcttgatct agctacaagg atggggcagt agggaaagat aaaatgtgga acctaataga 67681 aattactagt tttagaatct agttggacaa aaaaaagatt ttttcaagcg attgtgtgat 67741 aactttttct tcttctccat cattcaagat attatgtgaa ttaatatatt actaaatcta 67801 atgagttaaa cttaaatgaa agtaaaaaga aaaagtttta taaggtaact gttcgcttta 67861 aaatcgaaaa tggagtcgat acaattcaac agaatctaag aaatgatcaa attcgaaaat 67921 catttctatt tttattctat aaaaattcaa gtttcatttt tgaatgcagt tagacgatac 67981 agctcttatt agtttaatag tttactcaag agttactcaa tgaatcggtt gattggaatt 68041 gcgggatgga tagatgttac agatgatgaa tcaatttctt ttatatgtct gtcactttat 68101 ctttgttagt gctgtctgcc tataatgata gataaatcaa aaacttttca ttcaacttat 68161 tctttcaatt gaaattgaga tttttgccta tcctcctatt ttattttgaa aaatttgaaa 68221 cttaggtaag tgctttttaa acatatgtat aaaaagaaca tatttcattt aatttagccc 68281 cttcatgctt actataacta gttatttcgg ttttctatta gcggctttaa ctataacctc 68341 agctctattt attggtctga gcaagatacg acttatttaa actgaatatt taaaatgaac 68401 aattcataaa aagaaatcct tctgtgggat tacgcgtatt ctatatttac ttacgttacc 68461 aattgtcaat tcttgttcat tgtcattgag attcatgtca attcggatta atatttaggt 68521 atcgatatta cctctttttt tctcctttca aacaaataaa aatgattgaa gtttttctat 68581 ttggaatcgt gttaggtcta attcctatta ctttggctgg attattcgta actgcatatt 68641 tacaatatag gcgtggtgat cagttggacc tttgattaat taacatctct ttttgattga 68701 cctcctcctt tctttaattc acaggcacag gaggtcaaat tccgattgtt gtgaaagtta 68761 ctgaatgaat ctattttatt ctaattcgat ctaagaagaa aaaaatcacg ctctgtagga 68821 tttgaaccta cgacatcggg ttttggagac ccacgttcta ccgaactgaa ctaagagcgc 68881 tttcttatca gaatagataa gactgtaaac aaaaggattc ttttcataac cccaatacat 68941 tttgtatgca tatactagaa tagcatgata aaaatcaaag attatgtcca atttgaggcg 69001 atctcaattg atccctcgtt actgctcctt tgagcagtaa taggtaggga tgacaggatt 69061 tgaacctgtg acattttgta cccaaaacaa acgcgctacc aagctgcgcc acatcccttc 69121 aattgttcca cagtgtaatt gtagagaatt cctgtcttgt tttccacatg gttatttcct 69181 ccattgatat atacaaattt tctgctcatt tcgtcttttt ggtctcattt aacatataat 69241 agtaaaataa aaggaaaaga cttctcttat agattatata gaaaatactt atatacaatt 69301 atatacaaaa tatataaata cagaacccgt cgtaaaaatc aattagtatt tttcggaaat 69361 tctcggtaag aaagaagggg atgtattttt tttttctgtt ttaagaaaag gaaaatctta 69421 tttcccgaat cattgtacat tgcaatttga attaggaatt ctgtgtccaa ctctaagcag 69481 cccttaacta catatgcatc tgattatata tgtattatct attccaacaa ataatacaaa 69541 agaaggaggt ttttcaatgc gagatctaaa aacatatctc tctgtggcac cagtactaag 69601 tacgctatgg ttcggggctt tagcaggtct attgatagag attaatcgtt ttttcccgga 69661 tgcgttgaca ttcccctttt tttcattcta gttattgtca tgggaaggaa tgaagaagat 69721 tagagatcca atcaaatatt ggtgatgaat ccctctcccc ctcttttctc ttttttccct 69781 ttttagaata agggaggaaa gagaaagaat aaaaaaagtg gattcaacat tcgggctcaa 69841 gttcgaatta actgaatatt aataatagag gaatgggggt agaatagaag atctagggca 69901 agagtattat acaagatact taaatgatta cttcaatttg aaatatactt tagaaaaatc 69961 gttgtatttt actatgactt tgctttacta ttactttatt ttcttgattt taatctttta 70021 cttttagaat tggatttcaa gttagtaact tctattttat cctttcttcg ttttgaatcg 70081 aaaatagaag agttgagtaa atcaaaaatc caaaggaggt tcatggccaa ggggaaagat 70141 gtccgagtaa cggtgatttt ggaatgtact agttgtgtcc gaaacagtgt tgataaggta 70201 tcaagaggta tttccagata tattactcaa aagaaccggc acaatacgcc taatcgatta 70261 gaattgaaaa aattctgtcc ctattgttac aaacatacga ttcatgggga gataaagaaa 70321 tagagcgaac caagtacctg tgtcttaccc tttcaaggaa ggggaaaaaa tgacattata 70381 tatataacat atttaaatag aaaataaaca aatcttattt tttaaaaatc ctattttggg 70441 tggatttaaa ctgaattaga attaagaaat aggattttag ggataaggaa taaattaaac 70501 aaacaaacca tggataaatc caagcgacct tttcttaaat tcaagcgatc ttttcgtagg 70561 cgtttgcccc cgattcaatc gggggatcga attgattata gaaacatgag tttaattagt 70621 cgatttatta gtgaacaagg aaaaatatta tcaagacgag tgaatagatt gaccttgaaa 70681 caacaacgat taattactct tgctataaaa caagctcgta ttttatcttt gttacccttt 70741 ctcaataatg agaaacaatt tgaaagaacc gagtcgaccg ctagaactac tggttttaaa 70801 gcccgaaata aataggctta ctttttcttc acttgaatca taattacaag aatctagatt 70861 tgagtatcgt gtcgtaagaa aaaaaatgaa tcggaaaaaa agatttcttt ttttattgaa 70921 ttgaacgtgt tcattcattt tgactacttt agcatatttt ctcatagaaa tttctactct 70981 accttcccgg agttcattct ccggggaact ccatttaaat tattctggtg gattctttcc 71041 aatctacttc ctttatgatt tcgttcgaaa tcatataaag acaattccta tttgatatag 71101 ctatttgtgc aagtatttta cggttaagaa gcaactgtct cttgtacaga tcgtgtatta 71161 atctactata actataggat actccccttt cgcgaattac tgcgtttatc cgagtgatcc 71221 acaaacgacg aaaatctctc tttttcctat ccctatcccg atgagccgaa actaaagctc 71281 ttattttctg ttgagtaata gttcgagtaa gccttgaatg agccccccga aagcttgatg 71341 caaataaacg aatttttgtt ctacgtctcc gagctatata tccccgttta attctggtca 71401 ttgaataaat gaaactttga cgaataacta atcgattgcc tttctttcag ttattctttt 71461 cccccttcct agtctattaa taacaaaacg gatttttcca atgtataaaa taaaaattcc 71521 aatggctttg gctactctaa ccttcccgac cacgattttt tctttttttt ttttttaggt 71581 atttcactgc gaaataagaa agaaataaaa aattgtattt tcctaggtat caaaaatcta 71641 gtaaataaaa gaaatcaaaa aataaagtag tgggttcctt cgtttctatg gttacttctt 71701 aaacggtgag gtcttctcta tacaccggag cctttacttt atactttaat ttaatattta 71761 atcaactaat tgatgttatt gggaacttgt atagttcaca ctctttggct ctacccatga 71821 attatccagt aataggtctt tcacaatcag atctacctat acagtaagcg gtatttaatt 71881 atgaaagttt gctgggtagc tgaccctctt agtccgttct tgccagagtg ggagcctgcc 71941 taatctttat gttttatgct ttttaaataa gatttcctcc gcttaatgga taaccatttg 72001 ttaccaatgg agaatttctt atcatctgtg attggattta caccaacgga aaccataaac 72061 ttcatacaca atagagggat atgagagagt tttttttaaa taatgaatgg agttccttct 72121 tccatcctat cccattcacc ggtactgatc attgatactg taaaagtcgt tttcttgctt 72181 ttgtgccagc tcatgatcta aacgagtcgc acatacaccc tagtacatgt tcctcgacgc 72241 tgaggacagc cccgaagagc gggggatttc gtgacatttc tgattggctg tcttgtattt 72301 ctaataagtt gtttaatagt tggcatgttg aatcgtatac ataatatgat gggttggttt 72361 agattgatcc taaccgaatg atgatgaatt acttctattt aatagaatat tcaattcgaa 72421 gataaaatct caaatcacag atttgcgcga aatccatgtt attttcattc aaccgctaca 72481 agatcaacaa ttccataagc ttgggcttct gttgctgaca taaaaacatc tctttccata 72541 tcttcggata caacccataa gggtttcccc gttctttgta cataaaccct tgtgagggtt 72601 tcacgcagtt tcagcagttc ttccgcttcc aggacaaatt cgcctgtttg tgcctcataa 72661 aaagaactag caggttgatg gatcattacc ctgatgatat aacaaaataa aagcttcccc 72721 tatctcgcat gataaagcaa agagaaaaga aagataaaga atagaaaaaa gatagaattg 72781 aaccaaccgt acaggccatc ttttgtgcat acggcctcta caagaaaatt gacctcccct 72841 cctttctatt gaagaaagag aaaaaataga atctatcaga ctcagatggg taaatgatca 72901 aattccgatc cttcctttcg gaggagttaa aaaatactat gatggctccg ttgctttata 72961 tgtttatttt ttcttttttt ttttttgtct gtgattcacg aatcccaaag tttcttttta 73021 atccgatcaa ataaggaaaa aagtcttttt tttttttttt cgtactcttt cataacataa 73081 atattgttaa gaactctccg gcatgaaaac aaaaaagttt gtgacgctga actgaactcc 73141 cgatagataa gagaaaatcg gaaatacccc ttatctcata ctactctctc gatacagaat 73201 ctaatgtttt gaaaaaaaaa caatacaaaa atttctcata tcgaattcga agtgccatgc 73261 tattattact tagtattcat atggcgaagg catagtcttc ttttttctct caaataaaaa 73321 cctcattggc gccaagcgtg agggaatgct agacgtttgg taatttctcc tccgaccagg 73381 ataaaagatc ccattgaagc ggctaatccc atgcatattg tatggacatc tggtcgcaca 73441 aattgcatag tatcataaat agccacccca ggtattaccc agcccccagg agagtttata 73501 aacaaataca gatctttggt ctcatcctcg atactgagat ataccataag accaataagt 73561 tgattcgaaa tctcgctatc aacctcttgg cctaaaaaaa gtaatctttc tcgataaagt 73621 cggttgatta gggtaaaatt gtatccctta ggaaccgtac atgcgccttt tgatgcatac 73681 ggttcaaaaa aaaaatggtg aatcaatgta tagattccag tcctctttct ttttttctag 73741 aaaggttctt tcttacttct aacgaaaggg cttttcttcg attttttaat aaagacgagt 73801 tttgactcct tttttatatt ttcgattttc cattataaaa tttgaagtta taagaaaggg 73861 tcattaaact tatcgaatta acttctcatt gatgtattct ttcatcgaga tttaatccaa 73921 accgcgatgg tattttcttg ttcctgaatg ggtctgtttc atctttttag gtttatgctc 73981 tactccgggt aaagatccgc ccgatttgga tttgtacata taggacaaat gctcccatta 74041 ccatttcttt ttgtatttct tttttttttt caattcattt tatacaagta tttcttagag 74101 ttgagataac tttgcttgac aattaggatc tctttacaaa gaaaaaatat gaatagcaat 74161 catagatatc ttaccaatcc aattgggttt tttctaaacg gagcctggat acttcatttt 74221 tttagtccaa ccaagccaac cataaattat tctaattgaa tttttctaat tgataatagt 74281 aatatgaatc ccctcaaaaa tggatctaat tgcacttcac gctccaaatt tttgatgatt 74341 aaatttatct ttcttgggtg aaacggggga tatctcgatc gggggagaga acggggaaat 74401 accatatgac ccaatatatc tgacaagtcg cactatacgt caacccaaga tgcatcttcc 74461 tctccaggac ttcggaaagg gacttttgga acaccaatag gcattaaatg aaagaaagaa 74521 ctaaatacta tatttcactt tgaggtggaa acgtaacaat tttttttatt gtctttataa 74581 tattcatatt ggtttttatc gtatttattt tatccataga ttataaaaat tcataaagaa 74641 agacagaatg aataaactca aattattacg aataggtctt tctaatgata aataagtatg 74701 gactcattcg ctcatagaaa atgggatcaa ctcccccatt gcgtattggt acttatcgag 74761 tatagaataa atctgcttct ctttgttcct acgaacagaa ttgttccatt attaccaaca 74821 gaatagaaca cccttgttcg gaaataatcg actgaacaag agtggtccat aggatagtca 74881 tattatagtc ttttccaatg caataaagtt acgtagtgtc tatttatctt tgatataagg 74941 ggtatttcca tgggtttgcc ttggtatcgt gttcataccg ttgtattgaa tgatcccggt 75001 cggttgcttt ctgttcatat aatgcataca gctctggttg ctggttgggc cggttcgatg 75061 gctctgtatg aattagcggt ttttgatcct tctgatcctg ttcttgatcc aatgtggaga 75121 cagggtatgt tcgttatacc cttcatgact cgtttaggaa taaccaattc atggggcggt 75181 tggagtatca caggggggac tgtaacgaat ccgggtattt ggagttacga aggtgtagct 75241 ggagcacata ttgtgttttc tggcttatgc tttttggcag ctatctggca ttgggtgtat 75301 tgggatctag aaatattttg tgatgaacgt acaggaaaac cttctttgga tttgccaaag 75361 atctttggaa ttcatttatt tctctcaggg gtggcttgct ttggttttgg tgcatttcat 75421 gtaacaggct tgtatggtcc cggaatatgg gtgtccgacc cttatggact aacgggaaaa 75481 gtacaacctg taaatccagc gtggggcgtg gaaggttttg atccttttgt tccaggagga 75541 atagcctctc atcatattgc agcaggaaca ttgggcatat tagcgggcct attccatctt 75601 agcgtccgtc cgccacaacg tctatacaaa ggattgcgta tgggaaatat tgaaaccgtc 75661 ctttccagta gtatcgctgc tgtctttttt gcagcttttg ttgttgccgg aactatgtgg 75721 tatggttcgg caacaacccc gattgaatta tttgggccca ctcgttacca atgggatcag 75781 gggtacttcc agcaagaaat atatcgaaga gttagtgctg ggctagcaga aaatcaaagt 75841 ttatcagaag cctggtctaa aattcctgaa aaattagctt tttatgatta catcggcaat 75901 aatccggcaa aagggggatt attcagagcg ggctcaatgg ataacgggga tggaatagcg 75961 gttggatggt taggacaccc tatctttaga gataaagaag gccgtgaact ttttgtacgt 76021 cgtatgccta ctttttttga aacatttccg gtcgttttgg tagatggcga tggaattgtt 76081 agagccgatg ttccttttag aagggcagaa tcgaagtata gtgttgaaca agtaggtgta 76141 actgttgagt tctacggcgg tgaactcaac ggcgtcagtt atagtgatcc tgctactgtg 76201 aaaaaatatg ctagacgtgc tcaattgggt gaaatttttg aattagatcg tgctactttg 76261 aaatccgatg gtgtttttcg tagcagtcca aggggttggt ttacttttgg gcatgcttcg 76321 tttgctttgc tcttcttctt cggacacatt tggcatggtg ctagaacctt gttcagagat 76381 gtttttgctg gtattgaccc agatttagat gctcaagtcg aatttggagc attccaaaaa 76441 cttggagatc caactacaaa aagacaggca gcctgataca acattacttt ggtatctttc 76501 tttcgccctt attttctttc ttttactttt attgacatag ggtaccagag aaatctttat 76561 ttgaatcaac ttcgttttta ctcttgttcg ttctttatcc ggaagatgac aaaaaaaaga 76621 aaataaaaag aaacaaacag gtatgaaagc tataattgta aaccacgatc gaatctatgg 76681 aagcattggt ttatacattc ctcttagtct cgactctagg gataattttt ttcgctatct 76741 tttttcgaga accgcctaaa gttccaacta aaaagaacta aaaaggtgaa ataattcttc 76801 attatctcag ttgaagtact gagcctcccg ataccgggag gctcagtact tcaactagtc 76861 tccatgttcc tcgaatggat ctcttagttg ttgagaaggt tgcccaaaag cggtatataa 76921 ggcgtaccca gtaaaactta caagtaaacc agatataaag atggcgacta gggttgctgt 76981 ttccattctt atcatattta taaaatttca agaccccaat ggatctatga taggatcgtt 77041 tatttacaac ggaatggtat acaaagtcaa cagatctcaa tgaatacaat aggatttatg 77101 gctacacaaa ctgttgaaaa cagttctaga tctggtccaa gacgaactgc ggtaggagat 77161 ttattaaaac cattgaattc ggaatatggt aaagtagctc ctgggtgggg aactactcct 77221 ttgatgggtg tcgcaatggc cttatttgcg gtatttctat ctattatttt ggagatttat 77281 aattcttccg ttttattgga tggaatttca atgaattaga tctataagaa ccgcaaagtt 77341 cttgcttttg agtccaaaat gaatcattta gagctccgat ttctagtcca ttctattttc 77401 ttttggtagt tcgatcgtgg aatttctttg tttctgtatt tccggagtat gagtgtgtga 77461 cttgttataa ttgatcctat tgatagtaca gagaatgggt ctgtcatctt gatagagatg 77521 gttctacttc gtcagatatt tattctaata tttggaacac gaaatagatt aagaaatatt 77581 tgaactatga ttcatactta atattcagac ctcgtgtccg ggctccaaaa aattttcaaa 77641 caaagaattc taatttctaa atcgaaagat tcttttcttt caacccctat ttatattttg 77701 accaaaagca aaacctttct ttgaattttt agtcattcta tttattcagg gaataagtga 77761 tgatccgagg attcttactc agggaatcct tgatttgatt taggttaggt ttttttattg 77821 aatcatcgtg gttctagtat gaatctgagg ttttaatcga ttcatagggt cttaacaaga 77881 gaattcctat caataataaa gaaaacaaat aataaaagcc atattccaca aaaacaaatt 77941 ctagaaagaa atagggaaaa agagaattca agaggcccat aagtatcaaa ataaagataa 78001 agacgactgc gccaacttga tattttggta ttatcgccac aaagaagagc tttcggattt 78061 tccagagaag atgggatcag aacttaataa atttaaaact ttctattcca tatccgttgc 78121 aactagtatt tgggtgtttt tgcttgagct gtacgagatg aaagtctcat atacggttct 78181 cagaggggga gttccgccta tctcaataaa gtatatgatt ggttcgaaga acgtctcgag 78241 attcaagcaa ttgcggatga tataactagt aaatacgttc ctccccacgt caatatattt 78301 tattgtttag ggggaattac gcttacttgt tttttagtac aagtagctac tgggtttgct 78361 atgacttttt actatcgtcc gaccgttact gaggcttttg cttctgttca atacataatg 78421 actgaagcca actttggttg gttaatccga tcagttcatc gatggtcggc aagtatgatg 78481 gtcctaatga tgatcctgca tgtatttcgt gtgtatctca ccggcggatt taaaaaacct 78541 cgcgaattga cttgggttac aggtgtggtt ctggctgtat taaccgcatc ttttggcgta 78601 actggttatt ccttaccttg ggaccaagtc ggttattggg cagtgaaaat agtaacaggt 78661 gtccctgacg ctattcctgt aataggatca cccttggtcg aattattgcg cggaagcgct 78721 agtgtgggac aatctacttt gacccgtttt tatagtttac acacttttgt attgccgctt 78781 cttactgccg tatttatgtt aatgcacttt ccaatgatac gtaaacaagg tatttctggg 78841 cctttataga gaaaagaaaa atagatccta aatatttgta atcaatcatt tatcacttgg 78901 tggaggaata tatagtattt cattgctaca agtatggatt attgaaaata ataagacatg 78961 gatttggata tttcccttta actattcatg tcaactaaac ggggggattg aagggaattt 79021 tgtgaagaga aaatggatta tgggagtgtg tgacttgaac tattgattgg tctgtgtaga 79081 tatatgcctg ccacatggga attcacaacc aaatgtgtct ttgttccaat cgccgtgtaa 79141 gccctataca gaggataggc tggttcgctt aaagagaatc ttttctatga tcaggtccga 79201 atcatgttgt acatgagcag gctccgtaag atccagtata agtgaactag ataaaacgga 79261 atcaagattc cgttttatct agttcactta taagattaaa tagtatgtaa atgtattcat 79321 ttcctctgca gtgacacgat caatactact atcggagtga aacaagggat ctaaagaaga 79381 agagaggcta gactatatta gtaacaagca aaccttgtat gtgtatctcc aaatattttg 79441 gagataaata ccaattagaa ggtctgagac gacccagaaa gcacttgatc atatcatgat 79501 ctgatttgta agcctacttg ggtcttgagt atttacttgt aagaacggaa ttctttgttt 79561 tgtaatggat agttgcaact ccgtaaaaaa gaattcagtc aaatttttct tacattgaac 79621 cattcctata tcatatatgt gtatgtgtaa atacaggtac catatatata ttttatatgg 79681 atatatggag tcatttggtt ctttttattc ttgctcgagc tggatgatta aaaattatca 79741 tgtccagttc cctcggggga tggatctata agaattcacc tatcccaata acaaaaaaac 79801 ctgacttgaa tgatcctgta ttaagagcta aattggctaa aggtatgggt cataattatt 79861 atggagagcc cgcatggccc aatgatcttt tatatatttt tccagtagta attctaggta 79921 ctattgcatg taatgtaggc ttagccgttt tagaaccatc aatgattggt gaaccggcag 79981 atccatttgc aacccctttg gaaatattac ctgaatggta tttctttcct gtatttcaaa 80041 tacttcgtac agtgcccaat aaattattgg gggttctttt aatggtttca gtacctgcgg 80101 gattattaac agtacctttt ttagagaatg ttaataaatt ccaaaatcca tttcgccgtc 80161 cagtagcgac gactgtcttt ttgattggta ccgcagtcgc cctttggttg ggcattggtg 80221 caacattacc tattgataaa tccctaactt taggtctttt ttaaattttt aaattgattc 80281 aattgtgaaa taacacgaca tgtgtatcta gggaatagtt tcttcaaagc gaattctccc 80341 tagatacatc tattcaattt aattctgaat ttattttgaa tatatgatat attaatatat 80401 taattgtgct aaagagtttc aatctatttt cactaagtaa gtccaataga tttaaaactt 80461 attttttgct aaatcaatta cgaaatattt ttctaaaatg cccaatatcc gttttacatc 80521 ttcgctacga aaatgttcaa ttttcataag atcttcttgg ctgttattca aaaggtccaa 80581 caatgtatat atattggaca ttttgaggca attatagatc ctggaaggca attctgattg 80641 gtcaataaaa atcgatttca atgctatttt ttttttgttt tttatgagtt tagccaattt 80701 atcatgaaag gtaaaagggg ataaaggaac cgtgtgttga ttgtcctgta aatataagtt 80761 gtcttcctcc atatgtaaaa agggaataaa taaatcaatt aaatttcggg atgcttcatg 80821 aagtgcttct ttcggagtta aacttccgtt tgtccatatt tcgagaaaaa gtatctcttg 80881 tttttcattc ccattcccat aagaatgaat actatgattc gcgtttcgaa caggcatgaa 80941 tacagcatct ataggataac ttccatcttg aaagttatgt ggcgttttta taagatatcc 81001 acgatttctc tctatttgta atccaataca aaaatcaatt ggttccgtta aactggctat 81061 atgttgtgta ttatcaacga tttctacata aggcggcaag atgatatctt gggcagttac 81121 agatccagga cccttgacac aaatagatgc gtcagaagtt ccatatagat tacttcttaa 81181 tataatttct ttcaaattca ttaaaatttc atgtaccgat tcttgaatgc ccgttatggt 81241 agaatattca tgtgggactt tctcagattt tacacgtgtg atacatgttc cttctatttc 81301 tccaagtaaa gctcttcgca tcgcaatgcc tattgtgtcg gcttggcctt tcataagtgg 81361 agacagaata aagcgtccat aataaaggcg tttactgtct gttcttgatt caacacactt 81421 ccactgtagt gtccgagtag atactgttac tttctctcga accatagtac tattatttga 81481 ttagatcatc gaatctttta tttctcttga gatttcttca atgttcagtt ctacacacgt 81541 ctttttttcg gaggtctaca gccattatgt ggcataggag ttacatcccg tacgaaagtt 81601 aatagtatac cacttcgacg aatagctcgt aatgctgcat ctcttccgag accgggacct 81661 tttatcatga cttctgctcg ttgcatacct tgatccacta ctgtacggat agcgtttgct 81721 gctgcggttt gagcagcaaa cggtgttcct cttctcgtac ctttgaatcc agaagtaccg 81781 gcggaggacc aagaaactac tcgaccccgt acatctgtaa cagtgacaat ggtattattg 81841 aaacttgctt gaacatgaat aactcccttt ggtattctac gtgcaccctt acgtgaacca 81901 atacgtccat tcctacgcga actaattttc ggtatagctt ttgccatatt ttatcatctc 81961 gtaaatatga gtcagagata tatggatata tccatttcat gtcaaaacag attctttatt 82021 tgtacatcgg ctcttctggc aagtctgatt atccctgtct ttgtttatgt ctcgggttgg 82081 aacaaattac tataattcgt ccccgcctac ggattagtcg acatttttca caaattttac 82141 gaacggaagc tcttattttc atatttctca ttccttacct taattctgaa tctatttctt 82201 ggaagaaaat aagtttcttg aaatttttca tctcgaattg tattcccacg aaaggaatgg 82261 tgaagttgaa aaacgaatcc ttcaaatctt tgttgtggag tcgataaatt atacgccctt 82321 tggttgaatc ataaggactt acttcaattt tgactctatc tcctggcagt atccgtataa 82381 aactatgccg gatctttcct gaaacataat ttataatcag atctaaacaa acccggaaca 82441 gaccgttggg aaggcgattc agtaattaaa gcttcatgac tcctttttgg ttcttaaagt 82501 ccctttgagg tatcaactaa taagaaagat attagacaac cccccttttt tctttttcac 82561 aaataggaag tttcgaatcc aatttggata ttaaaaggat taccagatat aacacaaaat 82621 ctctccacct attccttcta gtcgagcctc tcggtctgtc attatacctc gagaagtaga 82681 aagaattaca atccccattc cacctaaaat tcgcggaatt cgttgataat tagaatagat 82741 tcgtagacca ggtcgactga ttcgttttaa atttaaaata tttctatagg gtcttttcct 82801 attccttcta tgtcgcaggg ttaaaaccaa aaaatatttg tttttttctc gatgttttct 82861 cacgttttcg ataaaacctt ctcgtaaaag tatttgaaca atattttcgg taatattagt 82921 agatgctatt cgaaccaccc tttttcgatc catatcagca tttcgtatag aagttattat 82981 ctcagcaata gtgtccctac ccatgatgaa ctaaaattat tggggcctcc aaatttgata 83041 taatcaacgt gttttttact tatttttttt ttgaatatga tatgaattat taaagatata 83101 tgcgtgagac acaatctact aattaatcta tttctttcaa ataccccact agaaacagat 83161 cacaatttca ttttataata cctcgggagc taatgaaact attttagtaa aatttaattc 83221 tctcaattcc cgggcgattg caccaaaaat tcgagttcct tttgatttcc ttccttcttg 83281 atcaataaca actgcagcat tgtcatcata tcgtattatc atcccgttgt cacgtttgag 83341 ttctttacag gtccgcacaa ttacagctct gactacttct gatctttcta ggggcatatt 83401 tggtacggct tctttgatca cagcaacaat aacgtcacca atatgagcat atcgacgatt 83461 gctagctcct atgattcgaa tacacatcaa ttctcgagcc ccgctgttat ccgctacatt 83521 taaatgggtc tgaggttgaa tcattttttt aatccgttct ttgaatgcaa agggcgaaga 83581 aaaaaaagaa atatttttgt ccaaaaaaaa agaaacatgc ggtttcgttt catatctaag 83641 agccctttcc gcattttttt ctattacatt acgaaataat gaattgagtt cgtataggca 83701 ttttagatgc tgctagtgaa atagcccttc tggctatatt ttctgttact ccacccattt 83761 cataaagtat tcgacccggt ttaacaacag ctacccaata ttcaggggat ccttttcctg 83821 aacccatacg tgtttctgcg ggtcttagtg taactggttt gtctggaaat atacgtaccc 83881 atatttttcc accacgacgt gcatttcgtg tcattgctcg tcggcctgct tctatttgtc 83941 tagatgtaat ccaagcaggt tcaagtgcct gaagagcata tttaccgaaa gaaatatgat 84001 tacctcgatg agatattccc ttcattcttc ctctatgttg tttacggaat ctggttcttt 84061 tggggttata gttgatggtt gtttctgaat tccatctcta ctacagaacc ggacgtgaga 84121 gtttcttctc atccagctcc tcgcgaataa aaggattcaa aaaatttaat tagaattaag 84181 ctagaatagt caatcttaag ttaagatata tatgtattta ctgagtaata ccttgaacgt 84241 gggattcttt gagatttcat tcaatctatt agtaatttgt atatcttgtt tgaatagata 84301 actaaacttt tgagttttat aaatagaaat ctaaaaaaaa attgtattat tataccaaat 84361 ccttattttg tcctttattg tattgtccta aattttgcaa taaaaaaagt tttcgcgggc 84421 gaatattgac tctttcaatc cctatttcat ttgtagggtt aactcgtgac ttctcagatc 84481 tccgaataca tgaattaatc tctggttcgt tccgccatcc cgaccagtga atcattaaga 84541 ttcctttttc aatagaatct tttgcattca caagttccgt cgttcccatc acttcttact 84601 taatggttag gtccgaattc tacaatggag ctcagaatga aattggttct tgagtcaatc 84661 ttctcagtct ttattggctc gaagctcttg attttttgtt ctatttctat aagaagattc 84721 attttattat ggtatgaatg cgtattgatg ctttattaca ctgcctttta tgagattact 84781 catagacctt acatattgga attttatatc attggtattc tttttctctc tttctctcat 84841 ccttccattt atccacatct tttttgtcta ttttgcttta caacttagaa tcagatttcc 84901 ttttttgttt atgcaaaaga tttcagttgc tacaaagata tgacctatat atcatatctt 84961 gactggttct ttagatccag ataatgcgaa gtgatgggtt ggttattagt tctatagttt 85021 ttagttcata ctatgtgggc tggtcttttt taatcctaac cctaaaaaac caacggagtc 85081 acacactaag catagcaatt atatcaaatg gtcaatcgaa tttttattca accttataga 85141 attaagaatt agaaatgttt cccttgattg attagaaaaa gaatgaattt gtcttttttt 85201 gttcaatcat tggatagaag ggaaagacaa gtagtaaaat tattcctcgt ctagaaatat 85261 ccaaattttg atgcccaata ctccatagat agttcgaact gtataagagc aataatcaat 85321 tttcgctcga atcgtttgta ggggaaccct accttctctg atccattcga cacgtgcaat 85381 ttcttttccg tcgatacgcc ccgcaatttg tatttgaatt ccttttgtat ctgcttgttc 85441 tgttaattca atagcctttt tcattgcttt tcgaaaggaa actctattct ttaattgtcc 85501 agctataaat tctgcaagaa tattagggtt tccataaggt tttgcaattc ttgtgacagc 85561 aatgttcagt tttcggttta cacaatgaaa ttctttttgt aaggtcgttt gtaattcttc 85621 gattccgcgc ggtcgacttt ctattaataa ttttgggaat cccataaaga ttatgacctg 85681 gatcagatcg attctttttt gaatctctat acgtgcaatt ccctcgacgc cagaggacgt 85741 tctcatattc ttttgtacat aattcttgat acaatctctt attttttgat cttcttgtaa 85801 accttcagaa taattttttg gttgtgaaaa ccaaagggaa tgatgacctt gggttgtacc 85861 cagtctgaaa ccaagtggat ttattttttg tcccataatc ccccactatt atacatatca 85921 cgatacggca tagctgtaga tttttttttc catctcgttt tttttaacga atacatctct 85981 acatattcat catctaaaga tatatctttc attacaatag ttatatgaca ggtcgatctt 86041 tttattggaa aactacgtcc tcgagctcga ggtttcaatt tcttcacagt agtacctcca 86101 ttgacttcgg ctttactaat gactaaattg gcttcgctgg aacccatatt gtaactagca 86161 tttgctgctg cagaataaat caatttcaaa atgggataac atgctcgata gggcatgagt 86221 tctagtatca taagcgtttc ctcataggaa cggccgcgaa tttgattaat tactcttcgt 86281 gctttgtcag cagacataga tatatgttca cctaaagcat atacttctgt ttttttcttc 86341 tttagcataa ggtttgcctc ctactactga atcataagca tctagatttt ttttattaat 86401 attaacgacg agatctatta tcgctttttg catgtcctct aaaatttaat gtaggtgcaa 86461 attctcccaa tttgtggcct accatactat ccgttatata aataggcaaa tgctcttttc 86521 cattatggat agcaatcgta tgaccgatca ttgtgggtat aatggtagat gcccgggacc 86581 aagttactat tatttctttt tctgcttttg tgttaagctt atcaattttt tttaataaat 86641 gattggctac aaagggattt ttttttagtg aacgtgtcac aagcttactc ctattttttt 86701 tttttttgta aaaacgaaga atttaattcg attttctctc ctatttacta cggcgacgaa 86761 gaatcaaatt atcactatat ttattccttt ttctacttct tcttccaagt gcaggataac 86821 cccaaggggt tgtgggtttt tttctaccaa ttggggctct cccttcacca cccccatggg 86881 gatggtctac agggttcata actactcctc ttactacagg acgcttacct agccaacgct 86941 tagatccggc tctacccaaa cttttctggt tcaccccaac attccccact tgtccgactg 87001 ttgctgagca gtttttggat atcaaacgga cctccccaga aggtaatttt aatgtggccg 87061 atttcccctc ttttgcaatc agtttcgcta cagcacccgc tgctctagct aattgtccac 87121 cctttccaag tgtgatttct atgttatgta tggccgtgcc taagggcata tcggttgaag 87181 tagattcttc ttttgatcaa tcaaaacccc ttcccaaact gtacaagctt cttccaaagc 87241 atacttcttt ctggatgtag atgatgatat ctatacagat ggatcttata tatatcgtag 87301 aatgaagtac cacatgggtg gatatatata tgaatccaaa tctgccgaat cactcatgtt 87361 atgatcttct acatcctggg tcttcccgtt ccgtcatctg gcttatgttc ttcatgtagc 87421 attcagaccg aatgactcta tgaaattacg tcgatacttc cacatattat gggtaacgta 87481 ggagacatct ctatttttcc cccggggaat ctttagaatt cccactgctt aactttcaat 87541 tcgcctctga ccatcaaatg aaatgtgaat aacccgtcct cctctctttg aaagaagggg 87601 cgcttccggt tctgtcggtg cttgaaacaa ttttgtcttc tccatattac tatatctcta 87661 gagtcaataa ttttatatga ggaactactg aactcaatca cttgctgccg ttactcttca 87721 gttttctgtt gaggtctatc ctgcagaggt actcaaattg gatcagtgat cgatttctag 87781 gtttcgtcgt aaacctaatt ggttatttcc aattacgtaa atcaatagtt caaaccgcac 87841 tcaaaggtag ggcatttccc atttttatag gaacttctgt accagaaaca atggtatctc 87901 caattatagc ccctctggga tgtaaaatat atctcttctc accatcccca tagtgtatga 87961 gacaaatgta tgcatttcga ttagggtcgt attctatggt tacgattcta ccatatatgt 88021 ctttttcatt ccgtcgaaaa tcgattttac ggtatagacg cttatgacct ccccctctat 88081 gccttgcggt aatgattcct ctggcattac gacctttacc acaatgatgc tgtccataga 88141 tcaaattatt tcgtggattg gatttcactt gactgtctac ggttccattg cgtgtgctcg 88201 gggtagaagt tttgtataaa tgtatcgcca tgctattaag tatttttttt taagttcttt 88261 tctttctaag aggtggaata gaataacccg gttgaagcgt aatgatcata cgtctgtaat 88321 gcattgtatg tcccataata ggtcccattc ttctactctt tcccggaagt cgatgactat 88381 tcatagctat taccttgaca ccaaagaaga gttcgaccca atgctttatt tctgtcctag 88441 ttgatcctga ttcgacatta gaagtatatt gatttttccc caataaccga atacttttgt 88501 ctgtaaatac tgcatatttg attccatcca taaatcgatt ttcttcccta tgagttatag 88561 tctcaataag aatgctagtt cttactgttc atatattatg atatgaatat accacaccaa 88621 ttcgttatgt atggatgatg agattccatt gatacagagc caattccaat agacttattg 88681 gagggtccca ttggcgtgca tccagtagga attgaaccta cgaattcgcc aattatgagt 88741 tgggcgcttt aaccattcag ccatggatgc ttagcgggga tcctcgtaca tggtgaataa 88801 ccaaattcca attgaaatga aatctttagg ataaatcaat gcaatttagt taggataaat 88861 caatgcaatt taggaggaat caatgagagg acatcaattc aaatcctgga ttttcgaatt 88921 gagagagata ttgagagaga tcaagaattc tcaccatttc ttagattcat ggacccaatt 88981 caattcagcg ggatccttca ttcacatttt tttccaccaa gaacgttttc taaaactctt 89041 tgacccccga atttggagta tcctactttc acgcaattca cagggttcaa caagcaatcg 89101 atatttcacg atcaagggtg taatactctt tgtagtagcg gtccttatat atcgtattaa 89161 caatcgaaat atggtcgaaa gaaaaaatct ctatttgata gggcttcttc ctatacctat 89221 gaattccatt ggacccagaa atgatacatt ggaagaatcc gttgggtctt ccaatatcaa 89281 taggttgatt gtttcgctcc tgtatcttcc caaaggaaaa aagatctctg agagttgttt 89341 cctgaatccg aaagagagta cttgggttct cccaataact aaaaagtgta gcatgcctga 89401 atctaactgg ggttcgcgtt ggtggaggaa ctggatcgga aaaaagaggg attctagttg 89461 taagatatct aatgaaaccg tcgctggaat tgagatctta ttcaaagaga aagatctcaa 89521 atatctggag tttctttttg tatattatat ggatgatccg atccgcaagg accatgattg 89581 ggaattgttt gatcgtcttt ctctgaggaa gagtcgaaat agaatcaact tgaattcggg 89641 accgctattc gaaatcttag tgaaacactg gatttcttat ctcatgtctg cttttcgtga 89701 aaaaatacca attgaagtgg agggtttctt caaacaacaa ggggctgggt caactattca 89761 atcaaatgat attgagcatg tttcccatct cttctcgaga aacaagtggg ctatttcttt 89821 gcaaaactgt gctcaatttc atatgtggca attccgccaa gatctcttcg ttagttgggg 89881 gaagaatccg cccgaatcgg attttttgag gaacgtatcg agagagaatt ggatttggtt 89941 agacaatgtg tggttggtaa acaaggatcg gttttttagc aaggtacaga atgtatcgtc 90001 aaatattcaa tatgattcca caagatctag tttcgttcaa gtaacggatt ctagccaact 90061 gaaaggatct tctgatcaat ccagagatca tttggattcc attagtaatg aggattcgga 90121 atatcacaca ttgattaatc aaagagagat tcaacaacga aaagaaagat cgattctttg 90181 ggatccttcc tttcttcaaa cggaacgaaa agagatagaa tcaggccgat tcccgaaatg 90241 cctttctgga tattcctcaa tgtcccggct attcacggaa cgtgagaagc agatgattaa 90301 tcatctgttt ccggaagaaa tcgaagaatt tcttgggaat cctacaagat ccgttcgttc 90361 ttttttctct gatagatggt cagaacttca tctgggttcg aatcctactg agaggtccac 90421 tagggatcag aaattgttga agaaacaaca agatctttct tttgtccctt ccaagcgatc 90481 ggaaaataaa gaaatggtta atatattcaa gataattacg tatttacaaa ataccgtctc 90541 aattcatcct atttcatcag atccgggatg tgatatggtt ccgaagatga accggatatg 90601 gacagttcca ataagatttc attcttgaac aaaaatccat tttttgattt atttcatcta 90661 ttccatgacc ggaacagggg aggatacacg ttacactacg attttgaatc agaagagaga 90721 tttcaagaaa tggcagatct attcactcta tcaataaccg agccggatct ggtgtatcat 90781 aagggatttg ccttttctat tgattcctgc ggattggatc aaaaacaatt cttgaatgag 90841 gccagggatg aatcgaaaaa gaaatcttta ttggttctac ctcctatttt ttatgaagag 90901 aatgaatctt tttctcgaag gatcagaaaa aaatgggtcc ggatctcctg cgggaatgat 90961 ttggaagatc caaaaccaaa aatagtggta tttgctagca acaacataat ggaggcagtc 91021 actcaatata gattgatccg aaatctgatt caaatccaat atagtaccta tgggtacata 91081 agaaatgtat tgaatcgatt ctttttaatg aatagatccg atcgcaactt cgaatatgga 91141 attcaaaggg atcaaatagg aaaggatact ctgaatcata gaactataat gaaatatacg 91201 atcaaccaat atttatcgaa tttgaaaaag agtcagaaga aatggttcga gcctcttatt 91261 ttgatttctc gaaccgagag atccatgaat cgggatcctg atgcatatag atacaaatgg 91321 tccaatggga gcaagaattt ccaggaacat ttggaacagt ccgtttcgga gcagaagagc 91381 cgttttcaag tagtgttcga tcgattacgt attaatcaat attcgattga ttggtctgag 91441 gttatcgaca aaaaagattt gtctaagcca cttcgtttct ttttgtccaa gtcacttctt 91501 tttttgtcca agttgctttt ctttttgtct aactcacttc cttttttctg tgtgagtttc 91561 ggaaatatcc ccattcatag gtccgagatc tacatctatg aattgaaagg tccgaatgat 91621 caactctgca atcagttgtt agaatcaata ggtcttcaaa ttgttcattt gaaaaaatgg 91681 aaacccttct tattggacga tcatgatact tcccaaaaat cgaaattctt gatcaatgga 91741 ggaacaatat caccattttt gttcaataag ataccaaagt ggatgattga ctcattccat 91801 actagaaata atcgcaggaa atcctttgat aacccggatt cctatttctc aatgatattc 91861 cacgatcaag acaattggct gaatcccgtg aaaccatttc atagaagttc attgatatct 91921 tctttttata aagcaaatcg acttcgattc ttgaataatc cacatcactt ctgcttctat 91981 tggaacacaa gattcccctt ttctgtggaa aaggcccgta tcaataattc tgattttacg 92041 tatggacaat tcctcaatat cttgttcatt cgcaacaaaa tattttcttt gtgcgtcggt 92101 aaaaaaaaac atgctttttg ggggagagat actatttcac caatcgagtc acaggtatct 92161 aacatattca tacctaacga ttttccacaa agtggtgacg aaacgtataa cttgtacaaa 92221 tctttccatt ttccaagtcg atccgatcca ttcgttcgta gagctattta ctcgatcgca 92281 gacatttctg gaacacctct aacagagggg caaatagtca attttgaaag aacttattgt 92341 caacctcttt cagatatgaa tctatctgat tcagaaggga agaacttgca tcagtatctc 92401 aatttcaatt caaacatggg tttgattcac actccatgtt ctgagaaaga tttatcatcc 92461 gaaaagagga aaaaacggag tctttgtcta aagaaatgcg ttgagaaagg gcagatgtat 92521 agaacctttc aacgagatag tgctttttca actctctcaa aatggaatct attccaaaca 92581 tatatgccat ggttccttac ttcgacaggg tacaaatatc taaatttgat atttttagat 92641 actttttcag acctattgcc aatactaagt agcagtcaaa aatttgtacc catttttcat 92701 gatattatgc atggatcagg tatatcatgg cgaattcttc agaaaaaatt gtgtcttcca 92761 caatggaatc tgataagtga gatctcgagt aagtgtttac ataatcttct tctgtccgaa 92821 gaaatgattc atcgaaataa tgagtcacca ttgatatcga cacatctgag atcgccaaat 92881 gctcgggagt tcctctattc aatccttttc cttcttcttg ttgctggata tctcgttcgt 92941 acacatcttc tctttgtttc ccgggcctct agtgagttac agacagagtt cgaaaaggtc 93001 aaatctttga tgattccatc atctatgatt gagttgcgaa aacttctgga taggtatcct 93061 acatctgaac cgaattcttt ctggttaaag aatctctttc tagttgctct ggaacaatta 93121 ggagattctc tagaagaaat acggggttct gcttctggcg gcaacatgct tggtcccgct 93181 tatggggtca aatcaatacg ttctaagaag aaagattgga atatcaatct catcgagatc 93241 atcgatctca taccaaatcc catcaatcga atcacttttt cgagaaatac gagacatcta 93301 agtcatacaa gtaaagagat ctattcattg ataagaaaaa gaaaaaacgt gaacggggat 93361 tggattgatg ataaaataga atcctgggtc gcgaacagtg attcgattga tgatgaagaa 93421 agagaattct tggttcagtt ctccacctta acgacagaaa ataggattga tcaaattcta 93481 ttgagtctga ctcatagtga tcgtttatca aagaatgact ctggttatca aatgattgaa 93541 caaccgggag caatttactt acgatactta gttgacattc ataaaaagca tctaatgaat 93601 tatgagttca atccatcctg tttagcagaa agacggatat tccttgctca ttatcagaca 93661 atcacttatt cacaaacttc gtgtggggaa aatagttttc atttcccatc tcatggaaaa 93721 cccttttcgc tccgcttagc cttatccccc tctaggggta ttttagtgat aggttctata 93781 ggaactggac gatcctattt ggtcaaatac ctagcgacaa actcctatgt tcctttcatt 93841 acggtatttc tgaacaagtt cctggataac aagcctaaag gttttcttct tgatgagatc 93901 gatattgatg atagtgacga tattgatgat agtgacaatc ttgatgctag tgacgatatc 93961 gatcgtgacc ttgatacgga gctgaaactg ctaactagga tgaatgggct aactatggat 94021 atgatgccgg aaatagaccg attttatatc acccttcaat tcgaattagc aaaagcaatg 94081 tctccttgca taatatggat tccaaacatt catgatctgg atgtgaatga gtcgaatgac 94141 ttagccctcg gtctattagt gaaccatctc tccagggatt gtgaaagatg ttctactaga 94201 aatattcttg ttattgcttc gactcatatt ccccaaaaag tggatcccgc tctaatagct 94261 ccgaataaat taaatacgtg cattaagata cgaaggcttc ttcttccaca acaacgaaag 94321 cactttttca ctctttcata tactagggga tttcacttgg aaaagaaaat gttccatact 94381 aacggattcg ggtccataac catgggttcc aatgcacgag atcttgtagc acttaccaat 94441 gaggtcctat cgattagtat tacacagaag aaatcaatta tagacactaa tacaattaga 94501 tccgctcttc atagacaaac ttgggatttg cgatcccagg taagatcggt tcaggatcat 94561 gggatccttt tctatcagat aggaagggct gtagcacaaa atgtacttct aagtaattgc 94621 cccatagatc ctatatctat ctatatgaag aagaaatcat gtaacgaagg ggattcttat 94681 ttgtacaaat ggtacttcga gcttggaacg agcatgaaga gattaacgat acttctttat 94741 cttttgagtt gttctgccgg atcggtcgct caagatcttt ggtctttatc cggacccgat 94801 gaaaaaaatg ggatcacttc ttatggactc gttgagaatg attctgatct agttcatggc 94861 ctattagaag tagaaggcgc tctggtggga tcttcacgga cagaaaaaga ttgcagtcag 94921 tttgataatg atcgagtgac attgcttctt cggcccgaac cgaggaatcc cttagatatg 94981 atgcaaaacg gctcttgttc tatccttgat cagagatttc tctatgaaaa atatgaatcg 95041 gagtttgaag aaggggaggg agaaggagcc cttgacccgc aggaggattt attcaatcac 95101 atagtttggg ctcctagaat atggcgccct tggggctttc tatttgattg tatcgaaagg 95161 cccaatgaat tgggatttcc ctattggtcc aggtcatttc ggggcaagcg gatcatttat 95221 gatgaagagg atgagcttca agagaatgat tcggagttct tgcagagtgg aaccatgcag 95281 taccagacac gagatagatc ttccaaagaa caaggccttt ttcgaataag ccaattcatt 95341 tgggaccctg cagatccact ctttttccta ttcaaagatc agccccctgg ctctgtgttt 95401 tcacatcgag aattatttgc agatgaagag atgtcaaagg ggcttcttac ttcccaaaca 95461 gaccctccta catctatata taaacgctgg tttatcaaga atacgcaaga aaagcacttc 95521 gaattgttga ttaatcgtca gagatggctt agaaccaaca gttcattatc taatggatct 95581 ttccgttcta atactctatc cgagagttat cagtatttat caaatctgtt cctatctaac 95641 ggaacgctat tggatcaaat gacaaagaca ttgttgagaa aaagatggct tttcccggat 95701 gaaatgaaaa ttggattcat gtaacaggag aaagatttcc cattccttag ccggaaagat 95761 atgtggccat gaaagaggga ttaagtggaa cagaattgac tgggtggtag agtcgtggaa 95821 acgcttgttt cttccatatt ttggacctta gctccatgga agaatatgtt actgctgaaa 95881 cacggaagaa ttgaaatctt agatcaaaac actatgtatg gatggtatga actgcctaaa 95941 caagaattct tgaacagcaa acaaccagtt cagatattca cgaccaagaa gtactggatt 96001 ctctttcgga taggccctga aaggagaagg aaggctggaa tgccaacagg cgtctattat 96061 attgaattta cccgatagtc cccattttgg gaacgtccag tgccaaagtc actgaatggg 96121 taagtcgcca atccctggac tatgtaatgt actttatctg ctgggttacg ggcgggcatt 96181 ttaccagagg tttctaatct acccttgtgt gattcctgtt gaagcatata ctcggggggt 96241 gggtgcaggg cggacgattt taaagcggac tccccattca ttagatagag aagatcacca 96301 agatttcgcg atccgctgcc gaatttattc caattccaag agctcggatc gaatcggtat 96361 atcaataccg attcgatccg agctctctta ttgagaatgc tcattcaatg agcattctca 96421 atattatgcc ttgaagagga ctcgaacctc cacgctattt agcacgagat tttgagtctc 96481 gcgtgtctac catttcacca ccaaggcatc ttgaaagtga atcgtattcc atgaatatga 96541 tatctatcta gtgtgatgta tggaatatat gacaaaggtg gatctattga tcggtcatgt 96601 catataggcc cgagttggac atccaattgc ttcgatttga attatccgga gaatgcaatg 96661 cctgatatat atcaaaaaga tggacaatca aacctatttc tcgattcact caaagaggtg 96721 aatagggtcc caatagagat atgtaaaaag caggtccgat tacgcgtatt cctaatccta 96781 aatggaatgt aatgatgtag gaatccatat gtaaacatag tatctattta gataggcccg 96841 aatgacccct tctcataatg agaatgtata taaccctatt ccggcctggt ccggtatgga 96901 atgaacttat aatcatggaa tcgactcgat catcagatta taagttcata accctagccc 96961 attcccattt tgggcggaac agatctacta attctttgat tccagttagt aagagggatc 97021 ttgaactaag aaatagaccc tagaagctaa aaaaggctat cctgagcaat tgcaataatt 97081 gggttcattg atattcctgg tatagtagat gctatcacac atacaatcat actcaattcg 97141 atggaattgt ttgatcttaa aggggatctt ctataatttc gcacgtgagg ggttatttct 97201 tggtttcgtc cagtcattaa taactttatt atttttagat aatagtagat agaaacaacg 97261 cttgtaagga gtcctattaa aaccaagaaa tataggcctg cctgccatcc acaccagaat 97321 aaatagagtt ttccgaaaaa acctgctagt ggaggaagac ctcctaggga taagagacat 97381 agggctaaag agagagccaa aaaaggatct tttgtgtata atcctgcata atctcgaatg 97441 ttatcagttc cggtacgtag accaaataat acaatgcaag caaaagttcc tagattcatg 97501 gagatataga acagcatata agttatcatg cttgcatatc catcatttga gtctccaaca 97561 attattccaa taattacata tccgatttgg cctatggacg aatatgcaag catacgtttc 97621 atgcttgttt gagtaatagc aatgagattt cccaatatca tgctaagaat agctaggatt 97681 tccagaagaa gatgccattc gtttgatgag aaataaaaag gaatatcgaa aattcgagtg 97741 gctgaagctg aagcagctac tttcgaagta acagaaagaa aagcaacgac tggagtggga 97801 gagtcagagt cgaaaagagg attcctcact tctttctctc attcaaaacc gtgcatgaga 97861 ctttcatctc acacggctcc taagtgataa aagaaagaag aacccatttt ctttcttttt 97921 tgattacctt cctcgcgtat gtataagacc gaatccattc gatttctaaa aaggattact 97981 aatccttaac ttttcgagga atccttcatc agtggttgtg aatgactgat tttttcaatc 98041 ttttcgacct tggtttcgta ggagcaagtc agaaagattg agaaatagaa ccatctgatt 98101 taattcgttc tcaatagcca cgagatgatc atcttagggt gatccttttg tcgacggatg 98161 ctcttattac actcgtagtc tctgaaggat gagaaccaac tatgtagcat ctacatcgag 98221 aattcaagta ttgtatacgt cattagtccg atcctttgta ggaactaccc gtaataacga 98281 acttgcaaaa tggatctgtt tatcataaag agattcgtcg ttcctgaccc tgcttcacct 98341 taattgttat ttgaacaagt aaaagttctg tcttggtccg agtggggata gcatttctct 98401 tctgcatgtc catggagttt tgaaaaatcc aaacatctca gagatagata gagaggtagg 98461 aatttctcga acgaaccgca ctccttcgta tacgtcagga gtccattgat gagaaggggc 98521 tggggaaagc ttgaacccaa ttcctacggt aatgaatatg agcgcaattg aaattcctgg 98581 ggagttatac atttgtgtat tgataagacc gtttactatt tcttgaagct caatctctcc 98641 cccggatgaa ccatatagcc aagagaaacc atgaaccaga atagaagagc ttgccccacc 98701 catgagtaaa tatttcatag tagcctcatt agaccgtaca tctttcttgg tatatccaga 98761 taataggtag gagcataaac tgaaacattc tggggctaca aagatagtta ttaaatcgtt 98821 agcaccgcat aaaaacattc cccctagagt agctgttaat acgaataaga gaaactctgt 98881 tatagccatt tctgtacatt caatgtactc tacggataga ggaatacata gagttgaaca 98941 tagtaaaata agaaattgaa agatttcgtt gaaattgttc gtttggaaat ttcccgaaaa 99001 gctaatcata ggttcttctc tccatcggaa caatagggcc gttatgctca ttactaaact 99061 tgttgaagag atgaaatata accaaggtat atctttttga tcagaggttg aatcgatcat 99121 cagaagaaga attaggccaa aaattaggat acattctggg aaaatcaaac ttccatcgaa 99181 gagaagcaaa tgaaaggctt tcataaaaat tctcgtagaa tcgagaatga agttttcatt 99241 ctgtacatgc cagatcatga attagtaact gcttccaatt tccaaaaaaa atcccaattg 99301 tgtcgaactt tccatttttg gaatagttac ggaatctcca tgaataggat caaaccttat 99361 tccatggtat ttacatgagg ttcctcttta agaaagtccc cgagaggctt agttgatcca 99421 tgatttatgt ttcatctttc cttttcgttt gtttcgagaa atctatcgat caattccgat 99481 tctttctttt tctcttgatt cttttccgat cgagatgtat agatcctgtt catggattaa 99541 cgaaaatgtg caaaagctct atttgcctct gccattctat gagtctcttc ctttttgcgt 99601 atggcatcgc cactcccttt ggcagcatcc actaattcgg aacttaattt gaaagccata 99661 tttcgacccg gacgttttcg ggatgccgct aataaccaac gaatggcaag tgcttttcct 99721 tgtgtggatc ctatttcaat gggaacttga tgagtcgatc cacctacacg tcttgctttt 99781 actgttatat cgggagttac tccacgtatt gcttgacgta aaacggatag tggatttgtt 99841 tctgtctttt gttgaatctt tttcacggct cgatagataa tttgataagc caatgatttt 99901 tttccgtgtt tcagaatacg gttaaccaac atgttaacta atcgattacg ataaattgga 99961 tcggattttg ctgttttttt ttctgcagta cctcgacgtg acatgagcgt gaaaggggtt 100021 caagaatcag ttttcttttt ataagggcta aaatcactta ttttggcttt tttaccccat 100081 attgtagggt ggatctcgaa agatatgaaa gatctccctc caagccgtac atacgacttt 100141 catcgaatac ggctttccgc agaattctat atgtatctat gagatcgagt atggaattct 100201 gtttactcac tttaaattga gtatccgttt ccctcccttt cctgctagga ttggaaatcc 100261 tgtattttac atatccatac gattgagtcc ttgggtttcc gaaatagtgt aaaaagaagt 100321 gcttcgaatc attgctattt gactcggacc tgttctaaaa aagtcgaggt atttcgaatt 100381 gtttgttgac acggacaaag tcagggaaaa cctctgaaat tatttcaata ttgaaccttg 100441 gacatataag agttccgaat cgaatctctt tagaaagaag atcttttgtc tcatggtagc 100501 ctgctccagt ccccttacga aactttcgtt attgggttag ccatacactt cacatgtttc 100561 tagcgattca catggcatca tcaaatgata caagtcttgg ataagaatct acaacgcact 100621 agaacgccct tgttgacgat cctttactcc gacagcatct agggttcctc gaacaatgtg 100681 atatctcaca ccgggtaaat ccttaaccct tccccctctt actaagacta cagaatgttc 100741 ttgtaaatta tggccaatac cgggtatata agcagtgatt tcaaatccag aggttaatcg 100801 tactctggca actttacgta aggcagagtt tggttttttt ggggtgatag tggaaaagtt 100861 gacagataag tcacccttac tgccactcta cagaaccgta catgagattt tcacctcata 100921 cggctcctcg ttcaattctt tcgaattcat tggatccttt ccgcgttcga gaatcccccc 100981 cttcttccac tccgccccga agagtaacta ggaccaattt agtcacgttt tcatgttcca 101041 attgaacact gtccattttt gattattctc aaaggataag attattctct ttaccaaaca 101101 tatgcggatc caatcacgat cttatatata agaagaacaa aagatctttc ttgatcaatc 101161 cctttgcccc tcattcttca agaataagga agatcctttt caagtttgaa tttgttcatt 101221 tggaatctgg gttcttctac ttcatattta tttaatatga atattttccc tctctttttt 101281 ttatatcatt ccttaagtcc cataggtttg atcctgtaga atttgaccca ttttctcatt 101341 gaacgaaagg tacgaaataa atcagattga taaaagtacc atgtgaaatc ttcggttttt 101401 ccccttcctc gatccctatc ccataggtta ggtacagtgt ttgaatcaat agagaacctt 101461 ttcttctgta tgaatcgata ttattccatt ccaaatcctt cccgatacct cccaaggaaa 101521 atctcgaatt tggatcccaa attgacgggt tagtgtgagc ttatccatgc ggttatgcac 101581 tctttgaata ggaatccgtt ttctgaaaga tcctggcttt cgtactttgg tgggtctccg 101641 agatcctttc gatgacctat gttgaaggga tatctatcta atccgatcga ttgcgtaaag 101701 cccgcggtag caacggaacc ggggaaagta tacagaaaag acagttcttt tctattatat 101761 tagtattttc tattatatta gatatattag actattatat tagattagta ttagttagtg 101821 atcccgactt agtgagtctg atgaattgtt ggcaccagtc ctacattttg tctctgtgga 101881 ccgaggagaa aaggggctcg gcgggaagag gagtgtacca tgagagaagc aaggaggtca 101941 acctctttca aatatacaac atggattctg gcaatgtagt tggactctca tgtcgatccg 102001 aatgaatcat cctttccacg gaggtaaatc tttgcctgct aggcaagagg atagcaagtt 102061 ccaaattctg tctcggtagg acatgtattt ctattactat gaaattcata aatgaagtag 102121 ttaatggtag ggttaccatt atcctttttg tagtgacgaa tcttgtatgt gttcctaaga 102181 aaaggaattt gtccattttt cggggtctca aaggggcgtg gaaacgcata agaactcttg 102241 aatggaaaag agatgtaact ccagttcctt cggaatcggt agtcaatcct atttccgata 102301 ggggcagttg acaattgaat ccgattttga ccattatttt catatccgta atagtgcgaa 102361 aagaaggccc ggctccaagt tgttcaagaa tagtggcgtt gagtttctcg accctttgac 102421 ttaggattag tcagttctat ttctcgatgg ggcggggaag ggatataact cagcggtaga 102481 gtgtcacctt gacgtggtgg aagtcatcag ttcgagcctg attatcccta agcccaatgt 102541 gagtttttct agttggattt gctcccccgc cgtcgttcaa tgagaatgga taagaggctc 102601 gtgggattga cgtgaggggg cagggatggc tatatttctg ggagcgaact ccgggcgaat 102661 atgaagcgca tggatacaag ttatgccttg gaatgaaaga caattccgaa tccgctttgt 102721 ctacgaacaa ggaagctata agtaatgcaa ctatgaatct catggagagt tcgatcctgg 102781 ctcaggatga acgctggcgg catgcttaac acatgcaagt cggacgggaa gtggtgtttc 102841 cagtggcgga cgggtgagta acgcgtaaga acctgccctt gggaggggaa caacagctgg 102901 aaacggctgc taataccccg taggctgagg agcaaaagga ggaatccgcc cgaggagggg 102961 ctcgcgtctg attagctagt tggtgaggca atagcttacc aaggcgatga tcagtagctg 103021 gtccgagagg atgatcagcc acactgggac tgagacacgg cccagactcc tacgggaggc 103081 agcagtgggg aattttccgc aatgggcgaa agctgacgga gcaatgccgc gtggaggtag 103141 aaggcccacg ggtcgtgaac ttcttttccc ggagaagaag caatgacggt atctggggaa 103201 taagcatcgg ctaactctgt gccagcagcc gcggtaatac agaggatgca agcgttatcc 103261 ggaatgattg ggcgtaaagc gtctgtaggt ggctttttaa gtccgccgtc aaatcccagg 103321 gctcaaccct ggacaggcgg tggaaactac caagctggag tacggtaggg gcagagggaa 103381 tttccggtgg agcggtgaaa tgcgtagaga tcggaaagaa caccaacggc gaaagcactc 103441 tgctgggccg acactgacac tgagagacga aagctagggg agcgaatggg attagatacc 103501 ccagtagtcc tagccgtaaa cgatggatac taggcgctgt gcgtatcgac ccgtgcagtg 103561 ctgtagctaa cgcgttaagt atcccgcctg gggagtacgt tcgcaagaat gaaactcaaa 103621 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaaagcgaa 103681 gaaccttacc agggcttgac atgccgcgaa tcctcttgaa agagaggggt gccttcggga 103741 acgcggacac aggtggtgca tggctgtcgt cagctcgtgc cgtaaggtgt tgggttaagt 103801 cccgcaacga gcgcaaccct cgtgtttagt tgccatcgtt gagtttggaa ccctgaacag 103861 actgccggtg ataagccgga ggaaggtgag gatgacgtca agtcatcatg ccccttatgc 103921 cctgggcgac acacgtgcta caatggccgg gacaaagggt cgcgatcccg cgaggtgagc 103981 taaccccaaa aacccgtcct cagttcggat tgcaggctgc aactcgcctg catgaagccg 104041 gaatcgctag taatcgccgg tcagccatac ggcggtgaat tcgttcccgg gccttgtaca 104101 caccgcccgt cacactatgg gagctggcca tgcccgaagt cgttacctta accgcaagga 104161 gggggatgcc gaaggcaggg ctagtgactg gagtgaagtc gtaacaaggt agccgtactg 104221 gaaggtgcgg ctggatcacc tccttttcag ggagagctaa tgcttgttgg gtattttggt 104281 ttgacactgc ttcacacccc caaaaaaaag aagggagcta cgtctgagtt aaacttggag 104341 atggaagtct tctttccttt ctcgacggtg aagtaagacc aagctcatga gcttattatc 104401 ctaggtcgga acaagttgat aggaccccct tttttacgtc cccatgttcc ccccgtgtgg 104461 cgacatgggg gcgaaaaaag gaaagagagg gatggggttt ctctcgcttt tggcatagcg 104521 ggcccccagt gggaggctcg cacgacgggc tattagctca gtggtagagc gcgcccctga 104581 taattgcgtc gttgtgcctg ggctgtgagg gctctcagcc acatggatag ttcaatgtgc 104641 tcatcggcgc ctgaccctga gatgtggatc atccaaggca cattagcatg gcgtactcct 104701 cctgttcgaa ccggggtttg aaaccaaact cctcctcagg aggatagatg gggcgattcg 104761 ggtgagatcc aatgtagatc caactttcga ttcactcgtg ggatccgggc ggtccggggg 104821 ggaccaccac ggctcctctc ttctcgagaa tccatacatc ccttatcagt gtatggacag 104881 ctatctctcg agcacaggtt tagcaatggg aaaataaaat ggagcaccta acaacgcatc 104941 ttcacagacc aagaactacg agatcgcccc tttcattctg gggtgacgga gggatcgtac 105001 cattcgagcc gtttttttct tgactcgaaa tgggagcagg tttgaaaaag gatcttagag 105061 tgtctagggt tgggccagga gggtctctta acgccttctt ttttcttctc atcggagtta 105121 tttcacaaag acttgccagg gtaaggaaga aggggggaac aagcacactt ggagagcgca 105181 gtacaacgga gagttgtatg ctgcgttcgg gaaggatgaa tcgctcccga aaaggaatct 105241 attgattctc tcccaattgg ttggaccgta ggtgcgatga tttacttcac gggcgaggtc 105301 tctggttcaa gtccaggatg gcccagctgc gccagggaaa agaatagaag aagcatctga 105361 ctacttcatg catgctccac ttggctcggg gggatatagc tcagttggta gagctccgct 105421 cttgcaattg ggtcgttgcg attacgggtt ggatgtctaa ttgtccaggc ggtaatgata 105481 gtatcttgta cctgaaccgg tggctcactt tttctaagta atggggaaga ggaccgaaac 105541 gtgccactga aagactctac tgagacaaag atgggctgtc aagaacgtag aggaggtagg 105601 atgggcagtt ggtcagatct agtatggatc gtacatggac ggtagttgga gtcggcggct 105661 ctcccagggt tccctcatct gagatctctg gggaagagga tcaagttggc ccttgcgaac 105721 agcttgatgc actatctccc ttcaaccctt tgagcgaaat gcggcaaaag aaaaggaagg 105781 aaaatccatg gaccgacccc atcatctcca ccccgtagga actacgagat caccccaagg 105841 acgccttcgg catccagggg tcacggaccg accatagaac cctgttcaat aagtggaacg 105901 cattagctgt ccgctctcag gttgggcagt cagggtcgga gaagggcaat gactcattct 105961 tagttagaat gggattccaa ctcagcacct tttgagtgag attttgagaa gagttgctct 106021 ttggagagca cagtacgatg aaagttgtaa gctgtgttcg ggggggagtt attgtctatc 106081 gttggcctct atggtagaat cagtcggggg acctgagagg cggtggttta ccctgcggcg 106141 gatgtcagcg gttcgagtcc gcttatctcc aactcgtgaa cttagccgat acaaagcttt 106201 atgatagcac ccaatttttc cgattcggcg gttcgatcta tgatttatca ttcatggacg 106261 ttgataagat ccatccattt agcagcacct taggatggca tagccttaaa agtgaagggc 106321 gaggttcaaa cgaggaaagg cttacggtgg atacctaggc acccagagac gaggaagggc 106381 gtagtaatcg acgaaatgct tcggggagtt gaaaataagc atagatccgg agattcccga 106441 atagggcaac ctttcgaact gctgctgaat ccatgggcag gcaagagaca acctggcgaa 106501 ctgaaacatc ttagtagcca gaggaaaaga aagcaaaagc gattcccgta gtagcggcga 106561 gcgaaatggg agcagcctaa accgtgaaaa cggggttgtg ggagagcaat acaagcgtcg 106621 tgctgctagg cgaagcagcc cgaatgctgc accctagatg gcgaaagtcc agtagccgaa 106681 agcatcacta gcttatgctc tgacccgagt agcatggggc acgtggaatc ccgtgtgaat 106741 cagcaaggac caccttgcaa ggctaaatac tcctgggtga ccgatagcga agtagtaccg 106801 tgagggaagg gtgaaaagaa cccccatcgg ggagtgaaat agaacatgaa accgtaagct 106861 cccaagcagt gggaggagcc agggctctga ccgcgtgcct gttgaagaat gagccggcga 106921 ctcataggca gtggcttggt taagggaacc caccggagcc gtagcgaaag cgagtcttca 106981 tagggcaatt gtcactgctt atggacccga acctgggtga tctatccatg accaggatga 107041 agcttgggtg aaactaagtg gaggtccgaa ccgactgatg ttgaagaatc agcggatgag 107101 ttgtggttag gggtgaaatg ccactcgaac ccagagctag ctggttctcc ccgaaatgcg 107161 ttgaggcgca gcagttgact ggacatctag gggtaaagca ctgtttcggt gcgggccgcg 107221 agagcggtac caaatcgagg caaactctga atactagata tgacctcaaa ataacagggg 107281 tcaaggtcgg ctagtgagac gatgggggat aagcttcatc gtcgagaggg aaacagcccg 107341 gatcaccagc taaggcccct aaatgatcgc tcagtgataa aggaggtagg ggtgcagaga 107401 cagccaggag gtttgcctag aagcagccac ccttgaaaga gtgcgtaata gctcactgat 107461 cgagcgctct tgcgccgaag atgaacgggg ctaagcgatc tgccgaagct gtgggatgta 107521 aaaatacatc ggtaggggag cgttccgcct tagagagaag cctccgcgcg agcggtggtg 107581 gacgaagcgg aagcgagaat gtcggcttga gtaacgcaaa cattggtgag aatccaatgc 107641 cccgaaaacc taagggttcc tccgcaaggt tcgtccacgg agggtgagtc agggcctaag 107701 atcaggccga aaggcgtagt cgatggacaa caggtgaata ttcctgtact gccccttgtt 107761 ggtcccgagg gacggaggag gctaggttag ccgaaagatg gttatcggtt caagaacgta 107821 aggtgtccct gctttgtcag ggtaagaagg ggtagagaaa atgcctcgag ccaatgttcg 107881 aataccaggc gctacggcgc tgaagtaacc catgccatac tcccaggaaa agctcgaacg 107941 actttgagca agagggtacc tgtacccgaa accgacacag gtgggtaggt agagaatacc 108001 taggggcgcg agacaactct ctctaaggaa ctcggcaaaa tagccccgta acttcgggag 108061 aaggggtgcc tcctcacaaa gggggtcgca gtgaccaggc ccgggcgact gtttaccaaa 108121 aacacaggtc tccgcaaagt cgtaagacca tgtatggggg ctgacgcctg cccagtgccg 108181 gaaggtcaag gaagttggtg acctgatgac aggggagccg gcgaccgaag ccccggtgaa 108241 cggcggccgt aactataacg gtcctaaggt agcgaaattc cttgtcgggt aagttccgac 108301 ccgcacgaaa ggcgtaacga tctgggcact gtctcggaga gaggctcggt gaaatagaca 108361 tgtctgtgaa gatgcggact acctgcacct ggacagaaag accctatgaa gcttcactgt 108421 tccctgggat tggctttggg cctttcctgc gcagcttagg tggaaggcga agaaggcctc 108481 cttccggggg ggcccgagcc atcagtgaga taccactctg gaagggctag aattctaacc 108541 ttgtgtcagg acctacgggc caagggacag tctcaggtag acagtttcta tggggcgtag 108601 gcctcccaaa aggtaacgga ggcgtgcaaa ggtttcctcg ggccggacgg agattggccc 108661 tcgagtgcaa aggcagaagg gagcttgact gcaagaccca cccgtcgagc agggacgaaa 108721 gtcggcctta gtgatccgac ggtgccgagt ggaagggccg tcgctcaacg gataaaagtt 108781 actctaggga taacaggctg atcttcccca agagctcaca tcgacgggaa ggtttggcac 108841 ctcgatgtcg gctcttcgcc acctggggct gtagtatgtt ccaagggttg ggctgttcgc 108901 ccattaaagc ggtacgtgag ctgggttcag aacgtcgtga gacagttcgg tccatatccg 108961 gtgtgggcgt tagagcattg agaggacctt tccctagtac gagaggaccg ggaaggacgc 109021 acctctggtg taccagttat cgtgcccacg gtaaacgctg ggtagccaag tgcggagcgg 109081 ataactgctg aaagcatcta agtagtaagc ccaccccaag atgagtgctc tcctattccg 109141 acttccccag agcctccggt agcacagccg agacagcgac gggttctctg cccctgcggg 109201 gatggagcga cagaagtttt tttgagaatt caagagaagg tcacggcgag acgagccgtt 109261 tatcattacg ataggtgtca agtggaagtg cagtgatgta tgcagctgag gcatcctaac 109321 agaccggtag acttgaacct tgttcctaca tgacctgatc aattcgatca ggcactcgcc 109381 atctattttc attgttcaaa tctttgacaa cacgaaaaaa ccattgttca actctttgac 109441 aacatgaaaa aaccaaaagc tctgccctcc ctctctatct atccaaggga tggaagggca 109501 gaggcctttg gtgtcccctc cagtcaagaa ttggggcctc acaatcacta gccaatatgc 109561 ttttctctca tgcctttctt cgttcatggt tcgatattct ggtgtcctag gcgtagagga 109621 accacaccaa tccatcccga acttggtggt taaactctac tgcggtgacg atactgtagg 109681 ggaggtcctg cggaaaaata gctcgacgcc aggatgataa aaagcttaac acctctcatt 109741 cttattactt tttcaatatg aaaacgaaaa aaaaaaaaat gaaaaatcaa aaggtcgttt 109801 tattcaaaac cccaattgtg acatcccttc tctcccactt cacacctcgg aacgcaccct 109861 tcttatagag ataaacgcgc cttcacatct tcttaacccg aaatggctgg ggagaggaaa 109921 ggttcctttt tttgagggta ctcccgggaa cagatccagt ggagacgggg tggggcctgt 109981 agctcagagg attagagcac gtggctacga accacggtgt cgggggttcg aatccctcct 110041 cgcccacaac cggcccaaaa gggaagtacc tttccctctg ggggtaggaa aatcatgatc 110101 gggatagcga accaaaagct atggaacttg ggtgtgggtc ttttgtcgaa atggaatggc 110161 ttttcttttt ctctttttat ttatcgtgaa tgggggaatc attacacata gtatgcccgg 110221 tcagcatatt tttttgtttt acgccccgta actcttcctc agccaggctt gggcagaata 110281 gcagagcaag tattagtagc ataacaaaaa agccttcctc gtcattaata tctttgctcg 110341 cggcaattgt gacctctcgg gagaatcgat gactgcatct ttgatgcagt gctagtatat 110401 ctgagacttc ttaattggct agttgtaaat agccccaggg ctatggaaca aaggattatc 110461 tcggacctag accgaggtat tgatggtgat tttctaatct cgcagaacag aatgtgatac 110521 gatgagatag aatgcaatag aaacaaagac agggaacggg ttacctactc ttaacgggca 110581 aagcgagccc ctttattctg aattctttaa ttcagaatca atcaaatctc cccaagtagg 110641 attcgaacct acgaccaatc ggttaacagc cgaccgctct accactgagc tactgaggaa 110701 caacaggaga ttcgatctca tagagttcaa ttcccgttcc caacccatga ccaatatgag 110761 ctcgaagctt ccttcgtaac tcccggaact tcttcgtagt ggctccctta catgcctcat 110821 ttcagaggga acctcaaagt ggctctattt cattatattc catccatatc ccaattccat 110881 tcatttaata tccctttggt gtcattgaca taacagatgt cgtttctagt ctatctcttt 110941 ctatttcttt tctatatatg gaaagttcaa aaatcatcat ataataatcc agaaattgca 111001 atagaaaaga aataagggag gtttgtgatg atttttcaat cttttctact aggtaatcta 111061 gtatccttat gcatgaagat aatcaattcg gtcgttgtgg tcggactcta ttatggattt 111121 ctgaccacat tctccatagg gccctcttat ctcttccttc tccgagctct ggttatggaa 111181 gaaggaaccg agaagaaggt atcagcaaca actggtttta ttacggggca gctcatgatg 111241 ttcatatcga tctattatgc gcctctgcat ctagcattgg gtagacctca tacaataact 111301 gtcctagctc taccatatct tttgtttcat ttcttctgga acaatcacaa acactttttt 111361 gattatggat ctactaccag aaattcaatg cgtaatctca gcattcaatg tgtattcctg 111421 aataatctca tttttcaatt attcaaccat ttcattttac caagttcaat gttagccaga 111481 ttagtcaaca tttatctctt tcgatgcaac agcaagatct tatttgtaac aagtggtttt 111541 gttggttggt taattggtca cattttattc atgaaatggc ttggattggt attagtctgg 111601 atacggcaaa atcattctat tagatcgaat aagtacattc gatctaataa gtaccttgtg 111661 ttagaattga gaaattctat ggctcggatc tttagtattc tcttatttat tacctgtgtc 111721 tactatttag gcagaatacc ctcacccatt cttactaaga aactaaaaga agcctcaaaa 111781 acagaagaaa gggtggaaag tgaggaagaa agagatgtag aaatagaaac agcttccgaa 111841 atgaagggga ctaaacagga acaagaggga tccactgaag aagatcctta tccttctcct 111901 tccctttttt cggaagaaag gtgggatccg gacaaaatcg atgaaacgga agaaatccga 111961 gtgaatggaa aggacaaaat aaaggataaa ttccactctc accttacaga gacaggctat 112021 aattgtaatt gtgaattaaa aaaaacagaa aataaggaat ttgattcaca aagttgaaaa 112081 gagtaagtaa taaactaata aaaagattga aacataagct aaatacaaga aaagataaga 112141 agagatgcgt ccgcccccta tatatttgat accttctcct acaatgaaac taataacccc 112201 aaccccgtta tcagtcccat caattactcg tcgatcaaaa aaatgagtaa attcagctaa 112261 tcctcttatc ccaccaacta agaatcttgt ataaaaagca tctatgtaag cacgattata 112321 tgaccaatca tatatgccat ttataatttt gtcccacaga attctcttag gacccttttt 112381 aacaaaagaa ttaattaact caaaattttt taaagaagaa taaatgggtt tatataaaaa 112441 ggatgctata aatattccga aataagctaa ccatttcgat aatatatcca aattccctcc 112501 ctcttggttg aaaggaattc ctatagatcc aacaaacaaa gtaaagagtc ctaatacaaa 112561 tattgggaat agcatagtat tgtccgattc ataaggatag gaataaaccg ctttatgctc 112621 aaaatgagca atagtcataa aaggtcgtgt catctttctt ccatttttat caattggata 112681 tttagttttt gcaaaaaaat aagtactttc attattattc atagttaata aacaagagtt 112741 tttcttaact ccgtttttac cccatagaga tattgaatag aagggggttt tttgtttccc 112801 accataattt ggaaaatgag cgtttaaatg cccttcaaaa gtaagtaaat agatccgaaa 112861 catataaaat gcggttaatc ccgccgtggc ccaagctatt attgcgaaaa ttggcgaata 112921 caaccaacta tcattaagaa tttcatcttt ggaccaaaaa caagcaagag gtggaatacc 112981 acaaagagaa agtgtaccta ataaaaatgt gattttgcta attggtacat gttttcttaa 113041 acctcccata agacccatat tctgactttt agctggagaa tatccaacaa tagtttccat 113101 tgaatgaata atggatccgg atcctaaaaa taataatgct ttggaataag catgagtaat 113161 caaatgaaat aaagcgcttc gataagaccc cataccaaga gctaacatca tataacccaa 113221 ttgagacatt gtggaatagg ctaaacctct cttaatgtct ttttgagcaa gagctaaagt 113281 agctcctaat aatactgtta ttattcctat aaccgagatc aaatacatta tgtaaggtat 113341 aactctgaaa agaggaagaa gccgagctac aagaaaaatt cccgccgcta ccatagtagc 113401 agcatgtata agagccgaaa tgggagtagg cccctccatg gcatcaggta accatacatg 113461 aagggggaat tgggcggatt tagcaactgc accggcaaat aagagaacag cacataaagt 113521 aacaaataaa aaatcgactt cattattata aatcaagtta ttgaatattt cgaataaatc 113581 cctaaattcg aaactccctg ttatccaata aaaacctaaa attcctaata ataaaccaaa 113641 atcccctaca cgattagtta caaacgcttt ttgacaagca tttgccgcaa caggtcgtgt 113701 aaaccaaaat cctattaata gataggaaca cagcccaacc aattcccaaa aaatataaat 113761 ttgtatcaaa ttcgaactag taactaatcc caacatggaa gtactgaaaa aactcatata 113821 agcaaaaaat ctcaaatagc cttgatcatg agccatataa ttatcactat aaataagaac 113881 cataattcca accgtagtga ttaatattga cataatagaa gtaagtgggt cgatcaagta 113941 tccgaagtct aaagaaaaat cattattgat gatccaagac catacatatt gataaaaaga 114001 actgctattt atttgctgaa tagacaggta gattgaaaaa accatgacta tgcttaacaa 114061 taaaacactc tgaaaagccc acatacggcg aaaacttttt gttgccgttg gaaaaagaaa 114121 aagtcccgct cctattaaca tagggactgg aagtggaatg aaaggtatga tccacgcata 114181 ttcatatgtc tgttccataa aaaagttttg aattcttaat taattgtttc cgattcaccg 114241 gatcttacct cttttgaaag gagtcaataa aaagtcaaaa tatggactaa ctgaaactaa 114301 tttaaaactt aaatcgaatt ttctattctt acttattctg agtctttgct aaatacttca 114361 actattgaaa tcaagaagtt acaattggtc aaatgatatg aaagggatta attactagtc 114421 tcttttgaaa taggcctatt tttctccaag tttgaccagt gaatcgaacg gggattcaag 114481 tttttcattt catgaagtaa aaatgcggtt cttatcttta aacctttcga ggtattttat 114541 tgcatgtaaa tgaaatgtgg aaccataaat agaaatcgag tattttttgg attctttatt 114601 ttattttttc tttttattaa gttcaactaa tttcctttct acagaacagc cgattagcaa 114661 attctatagg tatagatttt atgaatcaaa aataatgtga aataaagata ccagtcaata 114721 gagaaccttt tttttacaat tatgaatgtt ttatggaata gaaaaacttg aaaaaaacac 114781 atattgacct tcttttttta tttccagtat tatgcaattt tcacacatct tttgcctatc 114841 tcgataatgt tttattttag gacgacacta ttagctcgaa aataaatagt agtaaaaaga 114901 attcgttttg aacaatagat gtctttcaca tccagctata acaatgagta attttttaat 114961 ttctaaatgg cagttccaaa aaaacgcact tcgacatcaa aaaagcgtat tcgtaaaaat 115021 atttggaaaa ggaagggata ttcgatcgca ttaaaggctt tttcattagc gaaatctctt 115081 tctaccggga attcaaaaag tttttttgta cgccaaacaa aaataaataa gtaataaaac 115141 gttcgaataa tttgaatcaa cttgaaaaaa gaattcaatt attcttaaat tattcaatta 115201 gataataatt gaataattta acgatttccc tttcatattt gatattgatt agctcaccaa 115261 tcaatacgta atggaactcg cttcgctttt ctgattgata gataaaataa tagaattagg 115321 aaatcctcta tttactgaat aataactttt ttgttgacaa aagagtaaac atcatttcta 115381 ttccaaggtg gggagtttca ttttccccat cgacctattt gcagaattcc attaaaaaaa 115441 aattctatat ttccattcta tttccatatc tatagaagaa cgtatataaa aatctttagt 115501 gaaattagtg aaagttaaga actcattgaa actaattgat tctattttga aacctttttg 115561 ttttgtctaa ctttctaact ctttattttc tctgaattat tatatagata cccatgtata 115621 tcttgccctt aacccaatag agaaaattgc ttaatgaaat tctgtatgac tggttgtcaa 115681 ttttgagcga tgcaaaatag gttcttttct ttctattttg tcttcaaaat ccattttttg 115741 ttttagattt ctgaaataaa ataaatagga aatagctgat taaacaatga aaacaaaaaa 115801 tttgggaact ctattcctta attgagtata gaacggttta gttacaagag ttcaattcga 115861 ggaaagcata aaatatggga aagtcccagg ttaaataaaa aaaactaaga ctctaaactc 115921 aaatctaaaa taatgaacct tcaacttcaa attcctattt gaacaacttt ttattgttat 115981 tgatccattt gaatcattac taaactaaaa tagcttcctc aatctcgacg attgcttatt 116041 cataggctat tatgagttca agacaggccg ctatggtgaa attggtagac acgctgctct 116101 taggaagcag tgctaatgca tctcggttcg agtccgagtg gcggcatacc gtcttctaaa 116161 aaggataaat agatcttata atgaattcaa ttcccgattt cctttttaga attatgtaat 116221 taagggactc ttctttttta agatttttta tgatattttc aaccttagag catatattaa 116281 ctcacatttc cttttcgatc gtttcaattg taattacaat tcatttgata acctttttag 116341 tcgatgaaat cgtaaaacta tacgattcat cagaaaaggg cataatagtt acttttttct 116401 gtataacagg attattagtt actcgttgga tttcttctgg acatttccca ctaagcgatt 116461 tatatgaatc attaattttc ctttcatgga gtttctccct tattcatata attccgtatt 116521 tcaaaaaaaa tgttttaatt ttaagtaaaa taactggccc tagtgctatt tttacccaag 116581 gctttgctac gtcaggtatt ttaactgaaa tacaccaatc tgtaatatta gtacctgctc 116641 ttcaatccga gtggttaata atgcacgtaa gtatgatgat attgggctat gcagctcttt 116701 tatgtggatc attattatca gtagcacttc tagtgattac atttcgaaaa aacagacagc 116761 ttttttataa gagcaatggt tttttaaacg agtcattttt cttgggtgaa aatgttttac 116821 aaaatacttc ttttttttct gctaaaaatt attacaggtc ccaattgatt caacaattgg 116881 attattggag ttatcgggtt attagtttag gatttacttt tttaaccata ggaatccttt 116941 cgggagcggt atgggctaat gaagcgtggg ggtcgtattg gaattgggac ccaaaagaaa 117001 cttgggcatt tattacttgg atcgtatttg caatttattt acatactcga acaaatagaa 117061 atttgcgggg tgcaaattct gcaattgtag cgtctatagg ctttcttata atttggatat 117121 gctattttgg ggtcaatctt ttaggaatag ggttacatag ttatggttct tttccatcaa 117181 catttaattg aattcaagac aagttattac aaatacaaga gcgggcggcg cattgtatga 117241 accagcgtgc ggaccgtgtg aatcatcaat acaatatttg attcacacgg ttttctacca 117301 tatgtagttc aatttcattg tttttactta acttaagagt taagagaaga aaaaaagtct 117361 tctttttttc attgtccaag aatgtttttc aaaacaaaca taggtttttt ttatttcagt 117421 catccaaatt atctataaaa aaaattagat agaataactt cgaccttgtc aactgctaat 117481 gaaagaacga aatccgggta tataccaata cctattacgg gtaaaaagat ggagatcgaa 117541 agaaataact ctcgcggtcc agaatcaaaa aaagaatcct tcggggcatt aaatagcttg 117601 tatccataga acatctggcg tgacatagat aatgaataaa taggagttaa tatcattcca 117661 attgccatta caaaagtaat tagtattttt ggaattaaaa gatatttttg gccggtaatt 117721 attccaaaaa atactatcaa ttcggcaaca aaaccactca tacctggtaa tgcaagggaa 117781 gccatcgaaa agctactgaa catcgtgaac atttttggca ttggaatagc tattccgccc 117841 atttcgtcaa gataaacaag gcggattcta tcataagtcg ttcccgccaa gaaaaaaagt 117901 gcagcaccaa taaatccatg agatattatt tgtaaaaggg ctccattaag tcccgtgtcg 117961 gttagagaac taattcctat aattatgaaa cccatatgag agacagagga ataggctatt 118021 ctttttttta aattccgttg gccaagagat gttaaagctg catagattat ttgtattgta 118081 cctattatca tcaaccaagg agaaaatata gaatgggcat gaggtaataa ttccatattg 118141 attcgaatta atccatacgc tcccattttt aataaaattc cggctagaag catacaagta 118201 ctgtaatgtg cttctccatg ggtatctggt aaccatgtgt gtagggggat aatgggcgat 118261 ttgacagcaa aagcaataaa aaatccaata tagaatatta tttctaaaac cacaggatat 118321 gactgattaa ctgatgtttc aaaatttaat gttggttcat tagaaccata taaagcaaga 118381 cccaaaactc ccattaagag aaaaacagaa ccccccgccg tgtacaaaat aaattttgta 118441 gctgagtaca gacgtttctt tcctccccac atgcatagaa gtagataaac aggaattaat 118501 tctaactccc acatgatgaa aaaaagtaaa aggtcccgag acgaaaatga tccaatttga 118561 ccactgtaca ttgctaacat gagaaaatgg aataatcgag aatctcgagt aactggccaa 118621 gccgctaaag tagctaaagt agtgataaat cctgttaata aaatgggtcc tatagaaagt 118681 ccatctattc ctaatctcca atggaaatca aaaaaattga tccatttata atcctccact 118741 agttggatta atggatcatc cgattggaaa tgataacaaa atgcataagt cgttagaagg 118801 agttctaaaa tacatataca tatcgtatac cacctaatta ccctatttcc tttatgggga 118861 agaaagaaaa ttaaggaacc cgcaaatatt ggaaaaacta caattattgt taaccaagga 118921 aaataattcg tagtaaagac aagatacact tggaccataa aaacccgtgc tcaaaatatt 118981 gtgattttcg agcacaggtt tgtcggtaaa aaaaattaaa tggattcaag tagagttttc 119041 tcgaacgtat caataagcta gacccatact gcgagttgtt tcatgccata aataaactcg 119101 gacactcaag aaatctgttg gacaggcgga ttcacatctc ttacaaccaa cacagtcctc 119161 tgttcgtgga gcagaagcaa tttgtttagc cttacaaccg tcccaaggta tcatttctaa 119221 tacatcggtg ggcaggctcg gacacattga gtacatccta tacacgtatc ataaatcttt 119281 actgaatgtg acattgggtc tatacgtttt tgaatgttag aaattttcga tctagtaaac 119341 ttagaaacga atcatataat catatattta tataccagat gaatcaatga gttatcataa 119401 ttttctaatc aacccccttc tggattggtt tatgagatat gagagagggc caaaatactt 119461 tgatttctta tgttttgcaa acaagatcac accttacgta gcaaacatgc taattaaaat 119521 cgatttatca atattagaat ctagatgatt aatactaatt attcaacaaa tttgattggt 119581 tgatacgagt tgattttctg ttacggtaaa ttgatgaaac aatagccagt ccaatggctg 119641 cttcagcggc tgcaatagct ataacaaaaa ttgagaaaat gtctcctttt aattgacgat 119701 tatcaaaaaa atcagaaaat gttacaaaat ttatattaac cgcattcaat ataagttcaa 119761 gacacataag ggctctaacc atatttcgac ttgtgatcaa tccatagatc ccgatagaaa 119821 ataaataggc actcaaaaca agtacatgtt cgagaatcat taaacaactc cttatcaatc 119881 tcgactcctt tcaatatgaa caacaattca accgatttaa ttgactagta tataacaagt 119941 atggaacaaa gaaatatatt ggtactagat tgacctaaag tctttctatt tatacaacag 120001 gaattcaaat agaattgaag gaaaatgaat gtgataagac agaacaaaat tttatttgaa 120061 ttccaagttt taatagaaat tttttattga cgagctacag caattgcacc tattaaagca 120121 actaaaagga ttattgaaat cagttcaaat ggaagaaaaa aatctgttga taaatgaatt 120181 ccaatttgtt gactattact tataaaatct tgctctataa tctggtttga tcttgtagtc 120241 caaataatcc cgtaccatga cgtatctgaa atagtagtaa ttagtgaaat aaaaagactt 120301 atacaaacca tcgaagtaat tccatctcct acggtccaaa gatgaaaatc tttgtaatat 120361 tctgaaccat tcatgaacat cacagcaaaa atgattaaaa catttatagc tcctacgtaa 120421 ataagtactc gcagcagcta caaaatagga gttagataga atatagaata acgatgtaca 120481 aacaagaacc aatcccaagg aaaaggcaga ataaattgga ttgggaagta ataccactcc 120541 tagaccccct aatataagac ccgaccctag aaagactaaa agaaaatcat gtattggttc 120601 agataaatcc attttttatc aaaaatcaaa aacgaagaat ttcatgactt tattgacctg 120661 accaggaaaa aagaagtttt tcaatttttt atgatacttc ttaattgtta attgaatgaa 120721 attgtaatgg gtatgaattg acgtagatgc ttttatttta ttggaccact atcaattctt 120781 tattcgtcga acgagtagtt taaacctatc gattttggat atcatttatc tactttgaaa 120841 ccattactat tattataact ataatataga aatccgtttt gttttcaatc taaattaagc 120901 taggagtctc attaaccaac cactagtttg aattgaacaa gcaaaaatat cattctttta 120961 gatccgaact aagccttcgt aattcggaat ttttttcgaa tttagggttt attcattttt 121021 tatttgaggt aaattcgaaa ttgttcgaat tgtgtaatca tcaattactg acattggtaa 121081 gcgacccaaa gcgatttgat tataattcaa ttcgtgacga tcataagtag aaagttcata 121141 ttcttcggtc attgataaac aatttgttgg acaatactca acgcaattac cacaaaatat 121201 acagattcca aaatcaatac tgtaattaag caatcgtttc tttcgaatat cagtttccaa 121261 cttccaatca acaacgggta aatctatagg acatacacgc acacatactt cacaagcaat 121321 gcatttatca aattcaaagt ggattcggcc tcggaaacgt tccgatgtga tcaatttttc 121381 gtaggggtat tgaatagtta caggtaaacg atttgcgtgg gacagggtaa tcatgaaacc 121441 ttggccgatg tatctggcgg ctcgtattgt ttgttgacca taatttatga attcagttat 121501 catagggagc atatttagaa tatctataaa aaagatttta tgcttgtttc tttctcttgt 121561 ttgagacaag tcgtgaatct agaatattgt agtcttttac agtgaaagaa gttgggacga 121621 ggttgtcaat aatagattac ctagagaaat aggtaaaaga aatttccacc caagatttaa 121681 tagttggtcc attctcagcc tcggtaaagt ccatcttgtt gcaataggaa tgaacaaaaa 121741 caaataagtt ttggctaatg tgataaagat accaattagt gttccaaaga ctttacccct 121801 tttatttatg ccaaatagct caggaacaaa tatgtacgga atagaaagat tccaacctcc 121861 caaataaaga actgttacaa ataatgaaga aactagtaga ttcagatatg aagcaactgt 121921 aaaatcaaac caaatttgat acctgaatat tcggtttgat accctgctac taattcttct 121981 tctgcttctt ggtaaatcaa aaggtaatct ttcacactcg gctagagaag aaattagaaa 122041 aacgataaac ccgatgggtt gacgccacaa attccacccc caaaagccat attttgactg 122101 cgcttccact atatcaactg tacttaaact gttagataat catagtcgat gataacatca 122161 ctgtgcccat cgctattaca gaaccgtacg tgagattttc atctcatacg gctcctcaga 122221 ggtcacaaat aaatctaagg accctttcct attctttatc ttgatatgtt tgtcagatag 122281 agtaaaaatc tatcctaagg tcccaaatta gaccaatgga attctgtctg ctatatttaa 122341 aactaataaa tacgggcttc tgaattgatc tcatctttta agaattttca tttttctttg 122401 ttgattaata accttatcat taaataaaat gcgctttata gcaatatcac atatacattt 122461 caacctcgaa ttctcaatta cgaaaaaaat tagagagtcc attagttcat gaatcatgac 122521 aaaaaatttc tctctcgaac tagaaatcaa aatggaatta taggaaagaa agaataaaaa 122581 caaaaaaaga aaaaagtaag aaaaaaaaag acatcccccc tttttgcttt tgcaattaga 122641 ttcttttctt tctatttcta ttttatttca ttcctattct cctttctcag aaaaagggcc 122701 tttaaccaaa gtaaaagatt acttcgttct tgatagttat ttacttactc agtggatagg 122761 aacatactct ggatcagaat catggggagt acttcttgat catttctacg aacgtaaagc 122821 cccaattcga attcctttta tgtacagaaa tatcctcttg gataacttac ataatctcaa 122881 ttactaatcc tttgtgtatc ttggtcttcc taaccatcca ctcatttttg ctttcaacct 122941 cccgttgtgg aaatccatct atggtaatag acagtaaaaa ctccatacag ttgatctttt 123001 gaacccgctt caagctatca tgacaattca ccaatcttgg ggtaaacaat ctctattgct 123061 tatgtttact tttttcacca tttgattctt gtacatagga aatgagactc aaccttttta 123121 ctgcaaattt agaagccgtt ttctttcact catataacta tctggtttag ttcatcaacc 123181 caaatgctga ataaaaatga aaatatatat attcaatcaa atctttttac ctttgtttct 123241 agaaagaaaa gaatttggag aaattttagg tctcaccgaa tcacacgtag agatattgat 123301 aacacacata gagctaatgg tattttcata actaattgat tgagcagctg cccgtagacc 123361 acctaaaaaa gaatatttat tatttgatcc atatcccgac ataagaagtc caacgggagc 123421 aatacttgaa atggcaatcc agaaaaaaac accaatacta agatcggcta gaacaaggtg 123481 atcaccaaaa ggaattactg aataacttag aaagatggat attactgcta tggatggtcc 123541 gatactgaat aaacgagtat ctcctgtaga tggaataagg ttctctttca aaagtagttt 123601 tgtcccatct gctagagctt gaagaattcc taaagggcca gcatattcag gtccgatacg 123661 ttgttgtatt cctgcagata tttctctttc taaccaaaca attactagta cacctattgt 123721 gattcctaat acaagagtca aaatagggaa aagcatccat atgatcccat agacttcttt 123781 taaggattcc aatttggaaa aagaattgat agtttctatt tctgttgtat caattatcat 123841 ttcaacgatc aacttctccc ataatgatat ctatgctacc tagtattgtc ataatatcag 123901 ccaatttcat tcttttaact aactgaggaa gaatttgcaa attgataaaa cctggtgggc 123961 gaattttcca tctccaagga aaaacgctct gatctcctat gagaaaaatt cccaattctc 124021 cttttggggc ttcaactctc acataaagtt cttgtttcga caattcaaaa gttggagaag 124081 gttttttact aataaaccga tattcaaaat cattccattc aggatctttt aatctgtcaa 124141 aacgtcggat ttctaaattt tcgtaaggcc ctcctggaat tccttccaga gcctgttgaa 124201 taatctttat ggattctgtc atttcaccga ttcgtactaa ataacgagct aatgaatccc 124261 cttctcgttg ccattgaacc tgccaatcaa attcgtcgta agactcataa tgatcaactt 124321 tacgaagatc ccattctatt ccggaagctc gtagcattgg tcccgataac ccccaattta 124381 atgcttcgtc tcccccaata atgcctacgc cttcaactcg ttctaaaaaa ataggattcc 124441 gggtaataag tttttgatac tcagcaaccc ctgttaaaaa ataatcgcaa aaatccaaac 124501 atttatctat ccagccatag ggtagatcgg cagccactcc cccgatacga aaataattat 124561 gcatcattcg cataccggtg gcagcttcga agaggtcata tatcaattct ctttctcgaa 124621 aaatatagaa gaaaggggtc tgcgcaccaa tatccgccat aaaagggcct agccataaca 124681 aatgagaagc tatccgactc aactccaaca taatgactct gatatagcta gcccttttag 124741 gtacttgaat attgcctaat tgttcgggtc catttatggt tattgcttct gtgaacatag 124801 tagctaaata atcccaacgt gttacataag gcaaatattg tataattgtt cggttttccg 124861 caattttctc catccctcta tgtaaataac ccaatattgg ttcgcagtcg acaacatctt 124921 caccatctag agtaacgatg agtcgaagaa caccgtgcat tgatgggtgc tgaggcccca 124981 tattgactat catgaggtct tttcttgtag ttggtgcagt cataagtttt ttaccgattc 125041 attcttccat gaattgctga aagtgaaaag aagttcatca aaatttaatc gaaacatata 125101 agtgaaaatg aaatgactct tcaaataaat caaattaacg agtttttgtc tctcgaatgt 125161 ccaactgatt aattaattct ttataacgta ctctattttt ttttgacaaa taagctagga 125221 gtcgttgacg ttttcccaaa attttcttca aacctctctg agataaatag tcttttttgt 125281 gcaattctaa atgtgaagta agtctccgta tcttattggt gaaattgaat acttgaaatt 125341 caacagatcc tctcttttct tcttgagaaa taactgaaat gacagaattt tttaccataa 125401 aagaatttcc cctttcttta ttttacagat atggatttta tcgaatttta tcgatcagta 125461 ataataatgc cagtaatttg aacgtggtat atagacttaa tttctttatg aactcctaat 125521 tttatcaatt ccaataaatt aatcaaattc aaaatttgat tcagatagga atccaaaaag 125581 atggtaggta cttttttttt cattcacaaa agcgactaat ttaaacctaa aatcctaaaa 125641 tgaagaagat tttgttgatt cctttctaga tctaatcgat actttattga tttagtatcg 125701 tctactcgaa ttagattcga atgagatgta agaaaaagca tgtgtacatt tgtttacttt 125761 cagatactct atacgaaaca ggatatatag tactatcaat ttattttcaa ttgtggatac 125821 atatgtatcc ttaagatact gaaacgacta ccattattgg tatcaaacca ataacgattc 125881 atacaagcta aatcttctaa tcgataatta ggccaaagaa agaacttcaa tttaattaat 125941 tcatttttct ctttataaag aggtttcctt tcatccaaaa attgactcca gttttttaca 126001 ttgttttcgt tgcaaaatac tgaatttcta tcgatgccat tccaattcaa agaattaaac 126061 aaacttcgaa ttctcaattc tctacgacgt ctagaccata aaatattttc aggaacaagc 126121 aaatcaaaat gatttttgtc tgtatttatt ctttgagttt gaggttgcag aatgaattca 126181 tcaaaattct ttttatcaac atatctttgt tcggggtatc tttgattagt ttggtgttta 126241 cttttatgaa ccaatgaaat acctatggtt tgatacataa taaattgtcc attatttttt 126301 acagacaacc gaataggttc gataattaat atccccttct tcatcaattc tgtaagagtt 126361 aaattcttct gaatcagcat tatatccaaa ctcatttctc tcctttgaat tgacgatata 126421 gcaattttgc ttggatttat cagtcgaagc aggagacaat ataccttgat attctcgatc 126481 attctttgat tcaaagcatc gttccatctc aattgaaaaa gcaaataacg tttcaagaac 126541 aaatctagtt ctgcttccgt gttgcttttg tattgttttt tctttttacc cttctttgtg 126601 tctgattccg cgtaatcttt tttaagagcg ttttgatgtt ttgagagaac agggcccaga 126661 tttcctttgt tttctatatc tgatccacgc tctttttctc cttgacttgc gggttctttt 126721 gcttcttgaa ttcgattctt tattttttta tttgatcgta gaaaaaagtt ttgtttttgg 126781 tttttattga tgtttttatt tgactaacat tttcatttgt attcaaattt aaaagaagta 126841 atttgcttgg tataatccac ggttttattt tatatacatt ataaagtggt acaaattctg 126901 ggaagaacca aaattccaga ttcaatatgg gacgatttaa tattttttca ttcattccca 126961 tccaatcaaa aaaggctttt ttcgaatttt tttgattgtt ttctggattt tgatgaatcg 127021 taagataaaa aaagcctttt ttatcaattt tatcaattat ttgataatta ttaataccaa 127081 ttttagtatt tggattactg ttggtatcga tcttaaccca ggcctcaata tcttcttttt 127141 gtctaagaga aaaatggata attttccaat caaaatattt tctatcgaga tttctttcta 127201 tatatagaat attgcctttt cttagataat tattgatatg aagattgccg agcatatcaa 127261 aaaggttgtg tttggacgtg ttggaattag aagaaatttc gaggttctta tttacttgaa 127321 agggtaatct agaaataaaa gagtcatttt ttttttcata attaatcgat ttatatgcta 127381 aaagatcata tctataacat ttttgaaaat tatctttttg gtttgctaat gaatagagct 127441 cagaatcatt ttcttttttg taatgaatta attggtcttt ttcatatgaa ttccatttgt 127501 ttaaatttcg attttgagcc atacaacctt gattaaccct atttcgccat ttttgtggca 127561 ttaatctaga ccatctaatc tgagataaat cgtattgata atgccgtctt aaccagtttt 127621 tccattgatt gattctataa ctctgaagtt tcttatgttt taattcagaa tgaaatattc 127681 ctagtgttcg aaaatagtcc tttattttag tcttaaggaa aaaagacgtt ctgttatatt 127741 gaagaacaga tcttaattta gacaaattaa taacttgggg ttgtgataat ttgtaaaata 127801 cgatatgctt gtgataagta ggataaatca aaaaaaatat gtgaattttt cttactaata 127861 ttataaagtg acttttttat agtcgaaata aagtgaattt ttttttgatt attaattttt 127921 tcttgattta tttcattatt ggaaatgtat ttatcaatca atttgtttgt tgattcaaga 127981 aagagttgtg tattaattct gggaatatta atgatagata aaaatagatc gatgtataat 128041 ctttgaatga ataattttag aaaataatgg aatttccata ttaatcgagt atttcttctt 128101 tttaatattt ggaaaatctt ttttggcgat tcgaattttt taatattatt tgttttatta 128161 ggactaatgt ctatttctgg agttactttc tttttctctt ttgtaattct ttctatttga 128221 tttttgattg tacttgttct atcagtcaaa tccttcattt tgctttctat cagtgaagaa 128281 tttggccaat ttccagattc aatttgacta aatgattcgt taattatctg attactcatt 128341 agagaatctt tttctttttt cgtttcattc gattcatcta tttctttgag tctaaataat 128401 acaattggat ttacttttga aagttctttt ttcatttttt ttataaatag actacttttg 128461 ataagccatt ttttggtttc ttttgaaatt cttcgaaata attttatttt tcctttgaaa 128521 acttttagag ttataaaata tttctttttg aattttccaa tttttttttc gagttcctta 128581 aaaatgggct caaaaaaaga agggcgtttt cggggagaac caaagggaag ttcagcttcc 128641 attccccaaa ctgttaaaaa acaaaaatca tctttttgtt ttttcttttt cattagctct 128701 ccacgggagg agtacagttt agatatatgc caaggtttca gacaaaaagg aaataatatt 128761 ttgatctgaa tgccatcttt caaccaattt tttggaaatt ctgtttctga taattgaaca 128821 ccattataag tacatttaat atgcatttct ctattccatt cctgcaaatc ttcagaccat 128881 tcaggaagtt gcaagactaa catacgcccg agatttttgg ctattatcaa tgaaggtaat 128941 acaatatatt ttcgaagaat tgattgagtt attaacatgt aacctcttat tatttgcgca 129001 aaaggaatgg tatcccaggc ttctgctatc tctatccgtg ctttttcctt tcttttgttc 129061 tccccttttt tgtccttttc ctttttctct tctctttttg tttgttcttc tctagactct 129121 agaatcttga attctccttc tttacctgac caatttcgaa aaattggttt aatcagtcca 129181 gagatatcaa aagaaaaaag aaaggggggg gttattctgt caagaaaaag gggggaatgc 129241 acatttgctt gaaagagttt ccaaataact gttttgcgcc tttgagcccg catagagcct 129301 ttgattatac ctcgccgaaa atctggttgt tgcgaatagc gtattaaagc cacttccttt 129361 gtttgatctt gatctgcggt atcagtatct ttggtatcag gatcgttatt ctggttgttg 129421 gcagtaaaaa tcactacacg tttggctttt cttgaacgaa tttgatgatc cagtggtacg 129481 ccctcttgat agtcacccga ttgttgttcc aattcggtga ttaatttatg tgaccagcga 129541 ggtatttttt tactgatttc ttttattcca atcgattttt tttcagatgt tgtcccatta 129601 ggagcaattg cattgaatac aaattttaca aatttagttc ttttttctga attcactctt 129661 ccctgttctt ggtctgaaaa taaagaaagg tctttcaaat ttaaactcga ttttggttcg 129721 ttaccaaatt cattgattaa agttaagaac tcgtcaattt ctgttgataa tggtttttta 129781 gcaaccgtat ccactttttg ttccaattct tggtaatcag tattcggaag aaagatagta 129841 tgaatcctat ttattctaac cctctctttc aaattttcta gcgaagtatt gtttatgatt 129901 gaaggtgaaa actttttttt gattgttcct cgatatggtc catttaacaa aggatcatac 129961 attttaggca cgtattcttt tttagtatca tcattacaca atctagtcct tgtttcaagt 130021 atatcgagag aaaaagattc cttgtctaga acttcaagtc gatttaaaaa ttccttattc 130081 agattattac ttttttcttt gttggtagaa atccactgat tgtccagttc attagggagt 130141 gttttttgga gtgacaatag gggtatcctt ctttttatca ttttccaaaa agttgataaa 130201 cttggcgggt atgtaaaaga tattctttgt tttccatcac ttttacatgt gttaaaaaaa 130261 tattgtgaca tttccgttct tatggcctgt tcaaatcgat tattctttat gtagcgaaat 130321 ggtcgattcc atcgattata atcgaaaaga agactcacaa gaggctgttg aaaccagaag 130381 aggtctttat tttcattttt tttatcaagc agttgcaatt taaaaatttc tgtattcccc 130441 gtgttattat tattcagata agaatcctca taatcataaa ttggactatt actagtatta 130501 atattattat agcctgtctc tgtaaggtga gagtggaatt tatcctttat tttgtccttt 130561 ccattcactc ggatttcttc cgtttcatcg attttgtccg gatcccacct ttcttccgaa 130621 aaaagggaag gagaaggata aggatcttct tcagtggatc cctcttgttc ctgtttagtc 130681 cccttcattt cggaagctgt ttctatttct acatctcttt cttcctcact ttccaccctt 130741 tcttctgttt ttgaggcttc ttttagtttc ttagtaagaa tgggtgaggg tattctgcct 130801 aaatagtaga cacaggtaat aaataagaga atactaaaga tccgagccat agaatttctc 130861 aattctaaca caaggtactt attagatcga atgtacttat tcgatctaat agaatgattt 130921 tgccgtatcc agactaatac caatccaagc catttcatga ataaaatgtg accaattaac 130981 caaccaacaa aaccacttgt tacaaataag atcttgctgt tgcatcgaaa gagataaatg 131041 ttgactaatc tggctaacat tgaacttggt aaaatgaaat ggttgaataa ttgaaaaatg 131101 agattattca ggaatacaca ttgaatgctg agattacgca ttgaatttct ggtagtagat 131161 ccataatcaa aaaagtgttt gtgattgttc cagaagaaat gaaacaaaag atatggtaga 131221 gctaggacag ttattgtatg aggtctaccc aatgctagat gcagaggcgc ataatagatc 131281 gatatgaaca tcatgagctg ccccgtaata aaaccagttg ttgctgatac cttcttctcg 131341 gttccttctt ccataaccag agctcggaga aggaagagat aagagggccc tatggagaat 131401 gtggtcagaa atccataata gagtccgacc acaacgaccg aattgattat cttcatgcat 131461 aaggatacta gattacctag tagaaaagat tgaaaaatca tcacaaacct cccttatttc 131521 ttttctattg caatttctgg attattatat gatgattttt gaactttcca tatatagaaa 131581 agaaatagaa agagatagac tagaaacgac atctgttatg tcaatgacac caaagggata 131641 ttaaatgaat ggaattggga tatggatgga atataatgaa atagagccac tttgaggttc 131701 cctctgaaat gaggcatgta agggagccac tacgaagaag ttccgggagt tacgaaggaa 131761 gcttcgagct catattggtc atgggttggg aacgggaatt gaactctatg agatcgaatc 131821 tcctgttgtt cctcagtagc tcagtggtag agcggtcggc tgttaaccga ttggtcgtag 131881 gttcgaatcc tacttgggga gatttgattg attctgaatt aaagaattca gaataaaggg 131941 gctcgctttg cccgttaaga gtaggtaacc cgttccctgt ctttgtttct attgcattct 132001 atctcatcgt atcacattct gttctgcgag attagaaaat caccatcaat acctcggtct 132061 aggtccgaga taatcctttg ttccatagcc ctggggctat ttacaactag ccaattaaga 132121 agtctcagat atactagcac tgcatcaaag atgcagtcat cgattctccc gagaggtcac 132181 aattgccgcg agcaaagata ttaatgacga ggaaggcttt tttgttatgc tactaatact 132241 tgctctgcta ttctgcccaa gcctggctga ggaagagtta cggggcgtaa aacaaaaaaa 132301 tatgctgacc gggcatacta tgtgtaatga ttcccccatt cacgataaat aaaaagagaa 132361 aaagaaaagc cattccattt cgacaaaaga cccacaccca agttccatag cttttggttc 132421 gctatcccga tcatgatttt cctaccccca gagggaaagg tacttccctt ttgggccggt 132481 tgtgggcgag gagggattcg aacccccgac accgtggttc gtagccacgt gctctaatcc 132541 tctgagctac aggccccacc ccgtctccac tggatctgtt cccgggagta ccctcaaaaa 132601 aaggaacctt tcctctcccc agccatttcg ggttaagaag atgtgaaggc gcgtttatct 132661 ctataagaag ggtgcgttcc gaggtgtgaa gtgggagaga agggatgtca caattggggt 132721 tttgaataaa acgacctttt gatttttcat tttttttttt ttcgttttca tattgaaaaa 132781 gtaataagaa tgagaggtgt taagcttttt atcatcctgg cgtcgagcta tttttccgca 132841 ggacctcccc tacagtatcg tcaccgcagt agagtttaac caccaagttc gggatggatt 132901 ggtgtggttc ctctacgcct aggacaccag aatatcgaac catgaacgaa gaaaggcatg 132961 agagaaaagc atattggcta gtgattgtga ggccccaatt cttgactgga ggggacacca 133021 aaggcctctg cccttccatc ccttggatag atagagaggg agggcagagc ttttggtttt 133081 ttcatgttgt caaagagttg aacaatggtt ttttcgtgtt gtcaaagatt tgaacaatga 133141 aaatagatgg cgagtgcctg atcgaattga tcaggtcatg taggaacaag gttcaagtct 133201 accggtctgt taggatgcct cagctgcata catcactgca cttccacttg acacctatcg 133261 taatgataaa cggctcgtct cgccgtgacc ttctcttgaa ttctcaaaaa aacttctgtc 133321 gctccatccc cgcaggggca gagaacccgt cgctgtctcg gctgtgctac cggaggctct 133381 ggggaagtcg gaataggaga gcactcatct tggggtgggc ttactactta gatgctttca 133441 gcagttatcc gctccgcact tggctaccca gcgtttaccg tgggcacgat aactggtaca 133501 ccagaggtgc gtccttcccg gtcctctcgt actagggaaa ggtcctctca atgctctaac 133561 gcccacaccg gatatggacc gaactgtctc acgacgttct gaacccagct cacgtaccgc 133621 tttaatgggc gaacagccca acccttggaa catactacag ccccaggtgg cgaagagccg 133681 acatcgaggt gccaaacctt cccgtcgatg tgagctcttg gggaagatca gcctgttatc 133741 cctagagtaa cttttatccg ttgagcgacg gcccttccac tcggcaccgt cggatcacta 133801 aggccgactt tcgtccctgc tcgacgggtg ggtcttgcag tcaagctccc ttctgccttt 133861 gcactcgagg gccaatctcc gtccggcccg aggaaacctt tgcacgcctc cgttaccttt 133921 tgggaggcct acgccccata gaaactgtct acctgagact gtcccttggc ccgtaggtcc 133981 tgacacaagg ttagaattct agcccttcca gagtggtatc tcactgatgg ctcgggcccc 134041 cccggaagga ggccttcttc gccttccacc taagctgcgc aggaaaggcc caaagccaat 134101 cccagggaac agtgaagctt catagggtct ttctgtccag gtgcaggtag tccgcatctt 134161 cacagacatg tctatttcac cgagcctctc tccgagacag tgcccagatc gttacgcctt 134221 tcgtgcgggt cggaacttac ccgacaagga atttcgctac cttaggaccg ttatagttac 134281 ggccgccgtt caccggggct tcggtcgccg gctcccctgt catcaggtca ccaacttcct 134341 tgaccttccg gcactgggca ggcgtcagcc cccatacatg gtcttacgac tttgcggaga 134401 cctgtgtttt tggtaaacag tcgcccgggc ctggtcactg cgaccccctt tgtgaggagg 134461 caccccttct cccgaagtta cggggctatt ttgccgagtt ccttagagag agttgtctcg 134521 cgcccctagg tattctctac ctacccacct gtgtcggttt cgggtacagg taccctcttg 134581 ctcaaagtcg ttcgagcttt tcctgggagt atggcatggg ttacttcagc gccgtagcgc 134641 ctggtattcg aacattggct cgaggcattt tctctacccc ttcttaccct gacaaagcag 134701 ggacacctta cgttcttgaa ccgataacca tctttcggct aacctagcct cctccgtccc 134761 tcgggaccaa caaggggcag tacaggaata ttcacctgtt gtccatcgac tacgcctttc 134821 ggcctgatct taggccctga ctcaccctcc gtggacgaac cttgcggagg aacccttagg 134881 ttttcggggc attggattct caccaatgtt tgcgttactc aagccgacat tctcgcttcc 134941 gcttcgtcca ccaccgctcg cgcggaggct tctctctaag gcggaacgct cccctaccga 135001 tgtattttta catcccacag cttcggcaga tcgcttagcc ccgttcatct tcggcgcaag 135061 agcgctcgat cagtgagcta ttacgcactc tttcaagggt ggctgcttct aggcaaacct 135121 cctggctgtc tctgcacccc tacctccttt atcactgagc gatcatttag gggccttagc 135181 tggtgatccg ggctgtttcc ctctcgacga tgaagcttat cccccatcgt ctcactagcc 135241 gaccttgacc cctgttattt tgaggtcata tctagtattc agagtttgcc tcgatttggt 135301 accgctctcg cggcccgcac cgaaacagtg ctttacccct agatgtccag tcaactgctg 135361 cgcctcaacg catttcgggg agaaccagct agctctgggt tcgagtggca tttcacccct 135421 aaccacaact catccgctga ttcttcaaca tcagtcggtt cggacctcca cttagtttca 135481 cccaagcttc atcctggtca tggatagatc acccaggttc gggtccataa gcagtgacaa 135541 ttgccctatg aagactcgct ttcgctacgg ctccggtggg ttcccttaac caagccactg 135601 cctatgagtc gccggctcat tcttcaacag gcacgcggtc agagccctgg ctcctcccac 135661 tgcttgggag cttacggttt catgttctat ttcactcccc gatgggggtt cttttcaccc 135721 ttccctcacg gtactacttc gctatcggtc acccaggagt atttagcctt gcaaggtggt 135781 ccttgctgat tcacacggga ttccacgtgc cccatgctac tcgggtcaga gcataagcta 135841 gtgatgcttt cggctactgg actttcgcca tctagggtgc agcattcggg ctgcttcgcc 135901 tagcagcacg acgcttgtat tgctctccca caaccccgtt ttcacggttt aggctgctcc 135961 catttcgctc gccgctacta cgggaatcgc ttttgctttc ttttcctctg gctactaaga 136021 tgtttcagtt cgccaggttg tctcttgcct gcccatggat tcagcagcag ttcgaaaggt 136081 tgccctattc gggaatctcc ggatctatgc ttattttcaa ctccccgaag catttcgtcg 136141 attactacgc ccttcctcgt ctctgggtgc ctaggtatcc accgtaagcc tttcctcgtt 136201 tgaacctcgc ccttcacttt taaggctatg ccatcctaag gtgctgctaa atggatggat 136261 cttatcaacg tccatgaatg ataaatcata gatcgaaccg ccgaatcgga aaaattgggt 136321 gctatcataa agctttgtat cggctaagtt cacgagttgg agataagcgg actcgaaccg 136381 ctgacatccg ccgcagggta aaccaccgcc tctcaggtcc cccgactgat tctaccatag 136441 aggccaacga tagacaataa ctcccccccg aacacagctt acaactttca tcgtactgtg 136501 ctctccaaag agcaactctt ctcaaaatct cactcaaaag gtgctgagtt ggaatcccat 136561 tctaactaag aatgagtcat tgcccttctc cgaccctgac tgcccaacct gagagcggac 136621 agctaatgcg ttccacttat tgaacagggt tctatggtcg gtccgtgacc cctggatgcc 136681 gaaggcgtcc ttggggtgat ctcgtagttc ctacggggtg gagatgatgg ggtcggtcca 136741 tggattttcc ttccttttct tttgccgcat ttcgctcaaa gggttgaagg gagatagtgc 136801 atcaagctgt tcgcaagggc caacttgatc ctcttcccca gagatctcag atgagggaac 136861 cctgggagag ccgccgactc caactaccgt ccatgtacga tccatactag atctgaccaa 136921 ctgcccatcc tacctcctct acgttcttga cagcccatct ttgtctcagt agagtctttc 136981 agtggcacgt ttcggtcctc ttccccatta cttagaaaaa gtgagccacc ggttcaggta 137041 caagatacta tcattaccgc ctggacaatt agacatccaa cccgtaatcg caacgaccca 137101 attgcaagag cggagctcta ccaactgagc tatatccccc cgagccaagt ggagcatgca 137161 tgaagtagtc agatgcttct tctattcttt tccctggcgc agctgggcca tcctggactt 137221 gaaccagaga cctcgcccgt gaagtaaatc atcgcaccta cggtccaacc aattgggaga 137281 gaatcaatag attccttttc gggagcgatt catccttccc gaacgcagca tacaactctc 137341 cgttgtactg cgctctccaa gtgtgcttgt tccccccttc ttccttaccc tggcaagtct 137401 ttgtgaaata actccgatga gaagaaaaaa gaaggcgtta agagaccctc ctggcccaac 137461 cctagacact ctaagatcct ttttcaaacc tgctcccatt tcgagtcaag aaaaaaacgg 137521 ctcgaatggt acgatccctc cgtcacccca gaatgaaagg ggcgatctcg tagttcttgg 137581 tctgtgaaga tgcgttgtta ggtgctccat tttattttcc cattgctaaa cctgtgctcg 137641 agagatagct gtccatacac tgataaggga tgtatggatt ctcgagaaga gaggagccgt 137701 ggtggtcccc cccggaccgc ccggatccca cgagtgaatc gaaagttgga tctacattgg 137761 atctcacccg aatcgcccca tctatcctcc tgaggaggag tttggtttca aaccccggtt 137821 cgaacaggag gagtacgcca tgctaatgtg ccttggatga tccacatctc agggtcaggc 137881 gccgatgagc acattgaact atccatgtgg ctgagagccc tcacagccca ggcacaacga 137941 cgcaattatc aggggcgcgc tctaccactg agctaatagc ccgtcgtgcg agcctcccac 138001 tgggggcccg ctatgccaaa agcgagagaa accccatccc tctctttcct tttttcgccc 138061 ccatgtcgcc acacgggggg aacatgggga cgtaaaaaag ggggtcctat caacttgttc 138121 cgacctagga taataagctc atgagcttgg tcttacttca ccgtcgagaa aggaaagaag 138181 acttccatct ccaagtttaa ctcagacgta gctcccttct tttttttggg ggtgtgaagc 138241 agtgtcaaac caaaataccc aacaagcatt agctctccct gaaaaggagg tgatccagcc 138301 gcaccttcca gtacggctac cttgttacga cttcactcca gtcactagcc ctgccttcgg 138361 catccccctc cttgcggtta aggtaacgac ttcgggcatg gccagctccc atagtgtgac 138421 gggcggtgtg tacaaggccc gggaacgaat tcaccgccgt atggctgacc ggcgattact 138481 agcgattccg gcttcatgca ggcgagttgc agcctgcaat ccgaactgag gacgggtttt 138541 tggggttagc tcacctcgcg ggatcgcgac cctttgtccc ggccattgta gcacgtgtgt 138601 cgcccagggc ataaggggca tgatgacttg acgtcatcct caccttcctc cggcttatca 138661 ccggcagtct gttcagggtt ccaaactcaa cgatggcaac taaacacgag ggttgcgctc 138721 gttgcgggac ttaacccaac accttacggc acgagctgac gacagccatg caccacctgt 138781 gtccgcgttc ccgaaggcac ccctctcttt caagaggatt cgcggcatgt caagccctgg 138841 taaggttctt cgctttgcat cgaattaaac cacatgctcc accgcttgtg cgggcccccg 138901 tcaattcctt tgagtttcat tcttgcgaac gtactcccca ggcgggatac ttaacgcgtt 138961 agctacagca ctgcacgggt cgatacgcac agcgcctagt atccatcgtt tacggctagg 139021 actactgggg tatctaatcc cattcgctcc cctagctttc gtctctcagt gtcagtgtcg 139081 gcccagcaga gtgctttcgc cgttggtgtt ctttccgatc tctacgcatt tcaccgctcc 139141 accggaaatt ccctctgccc ctaccgtact ccagcttggt agtttccacc gcctgtccag 139201 ggttgagccc tgggatttga cggcggactt aaaaagccac ctacagacgc tttacgccca 139261 atcattccgg ataacgcttg catcctctgt attaccgcgg ctgctggcac agagttagcc 139321 gatgcttatt ccccagatac cgtcattgct tcttctccgg gaaaagaagt tcacgacccg 139381 tgggccttct acctccacgc ggcattgctc cgtcagcttt cgcccattgc ggaaaattcc 139441 ccactgctgc ctcccgtagg agtctgggcc gtgtctcagt cccagtgtgg ctgatcatcc 139501 tctcggacca gctactgatc atcgccttgg taagctattg cctcaccaac tagctaatca 139561 gacgcgagcc cctcctcggg cggattcctc cttttgctcc tcagcctacg gggtattagc 139621 agccgtttcc agctgttgtt cccctcccaa gggcaggttc ttacgcgtta ctcacccgtc 139681 cgccactgga aacaccactt cccgtccgac ttgcatgtgt taagcatgcc gccagcgttc 139741 atcctgagcc aggatcgaac tctccatgag attcatagtt gcattactta tagcttcctt 139801 gttcgtagac aaagcggatt cggaattgtc tttcattcca aggcataact tgtatccatg 139861 cgcttcatat tcgcccggag ttcgctccca gaaatatagc catccctgcc ccctcacgtc 139921 aatcccacga gcctcttatc cattctcatt gaacgacggc gggggagcaa atccaactag 139981 aaaaactcac attgggctta gggataatca ggctcgaact gatgacttcc accacgtcaa 140041 ggtgacactc taccgctgag ttatatccct tccccgcccc atcgagaaat agaactgact 140101 aatcctaagt caaagggtcg agaaactcaa cgccactatt cttgaacaac ttggagccgg 140161 gccttctttt cgcactatta cggatatgaa aataatggtc aaaatcggat tcaattgtca 140221 actgccccta tcggaaatag gattgactac cgattccgaa ggaactggag ttacatctct 140281 tttccattca agagttctta tgcgtttcca cgcccctttg agaccccgaa aaatggacaa 140341 attccttttc ttaggaacac atacaagatt cgtcactaca aaaaggataa tggtaaccct 140401 accattaact acttcattta tgaatttcat agtaatagaa atacatgtcc taccgagaca 140461 gaatttggaa cttgctatcc tcttgcctag caggcaaaga tttacctccg tggaaaggat 140521 gattcattcg gatcgacatg agagtccaac tacattgcca gaatccatgt tgtatatttg 140581 aaagaggttg acctccttgc ttctctcatg gtacactcct cttcccgccg agcccctttt 140641 ctcctcggtc cacagagaca aaatgtagga ctggtgccaa caattcatca gactcactaa 140701 gtcgggatca ctaactaata ctaatctaat ataatagtct aatatatcta atataataga 140761 aaatactaat ataatagaaa agaactgtct tttctgtata ctttccccgg ttccgttgct 140821 accgcgggct ttacgcaatc gatcggatta gatagatatc ccttcaacat aggtcatcga 140881 aaggatctcg gagacccacc aaagtacgaa agccaggatc tttcagaaaa cggattccta 140941 ttcaaagagt gcataaccgc atggataagc tcacactaac ccgtcaattt gggatccaaa 141001 ttcgagattt tccttgggag gtatcgggaa ggatttggaa tggaataata tcgattcata 141061 cagaagaaaa ggttctctat tgattcaaac actgtaccta acctatggga tagggatcga 141121 ggaaggggaa aaaccgaaga tttcacatgg tacttttatc aatctgattt atttcgtacc 141181 tttcgttcaa tgagaaaatg ggtcaaattc tacaggatca aacctatggg acttaaggaa 141241 tgatataaaa aaaagagagg gaaaatattc atattaaata aatatgaagt agaagaaccc 141301 agattccaaa tgaacaaatt caaacttgaa aaggatcttc cttattcttg aagaatgagg 141361 ggcaaaggga ttgatcaaga aagatctttt gttcttctta tatataagat cgtgattgga 141421 tccgcatatg tttggtaaag agaataatct tatcctttga gaataatcaa aaatggacag 141481 tgttcaattg gaacatgaaa acgtgactaa attggtccta gttactcttc ggggcggagt 141541 ggaagaaggg ggggattctc gaacgcggaa aggatccaat gaattcgaaa gaattgaacg 141601 aggagccgta tgaggtgaaa atctcatgta cggttctgta gagtggcagt aagggtgact 141661 tatctgtcaa cttttccact atcaccccaa aaaaaccaaa ctctgcctta cgtaaagttg 141721 ccagagtacg attaacctct ggatttgaaa tcactgctta tatacccggt attggccata 141781 atttacaaga acattctgta gtcttagtaa gagggggaag ggttaaggat ttacccggtg 141841 tgagatatca cattgttcga ggaaccctag atgctgtcgg agtaaaggat cgtcaacaag 141901 ggcgttctag tgcgttgtag attcttatcc aagacttgta tcatttgatg atgccatgtg 141961 aatcgctaga aacatgtgaa gtgtatggct aacccaataa cgaaagtttc gtaaggggac 142021 tggagcaggc taccatgaga caaaagatct tctttctaaa gagattcgat tcggaactct 142081 tatatgtcca aggttcaata ttgaaataat ttcagaggtt ttccctgact ttgtccgtgt 142141 caacaaacaa ttcgaaatac ctcgactttt ttagaacagg tccgagtcaa atagcaatga 142201 ttcgaagcac ttctttttac actatttcgg aaacccaagg actcaatcgt atggatatgt 142261 aaaatacagg atttccaatc ctagcaggaa agggagggaa acggatactc aatttaaagt 142321 gagtaaacag aattccatac tcgatctcat agatacatat agaattctgc ggaaagccgt 142381 attcgatgaa agtcgtatgt acggcttgga gggagatctt tcatatcttt cgagatccac 142441 cctacaatat ggggtaaaaa agccaaaata agtgatttta gcccttataa aaagaaaact 142501 gattcttgaa cccctttcac gctcatgtca cgtcgaggta ctgcagaaaa aaaaacagca 142561 aaatccgatc caatttatcg taatcgatta gttaacatgt tggttaaccg tattctgaaa 142621 cacggaaaaa aatcattggc ttatcaaatt atctatcgag ccgtgaaaaa gattcaacaa 142681 aagacagaaa caaatccact atccgtttta cgtcaagcaa tacgtggagt aactcccgat 142741 ataacagtaa aagcaagacg tgtaggtgga tcgactcatc aagttcccat tgaaatagga 142801 tccacacaag gaaaagcact tgccattcgt tggttattag cggcatcccg aaaacgtccg 142861 ggtcgaaata tggctttcaa attaagttcc gaattagtgg atgctgccaa agggagtggc 142921 gatgccatac gcaaaaagga agagactcat agaatggcag aggcaaatag agcttttgca 142981 cattttcgtt aatccatgaa caggatctat acatctcgat cggaaaagaa tcaagagaaa 143041 aagaaagaat cggaattgat cgatagattt ctcgaaacaa acgaaaagga aagatgaaac 143101 ataaatcatg gatcaactaa gcctctcggg gactttctta aagaggaacc tcatgtaaat 143161 accatggaat aaggtttgat cctattcatg gagattccgt aactattcca aaaatggaaa 143221 gttcgacaca attgggattt tttttggaaa ttggaagcag ttactaattc atgatctggc 143281 atgtacagaa tgaaaacttc attctcgatt ctacgagaat ttttatgaaa gcctttcatt 143341 tgcttctctt cgatggaagt ttgattttcc cagaatgtat cctaattttt ggcctaattc 143401 ttcttctgat gatcgattca acctctgatc aaaaagatat accttggtta tatttcatct 143461 cttcaacaag tttagtaatg agcataacgg ccctattgtt ccgatggaga gaagaaccta 143521 tgattagctt ttcgggaaat ttccaaacga acaatttcaa cgaaatcttt caatttctta 143581 ttttactatg ttcaactcta tgtattcctc tatccgtaga gtacattgaa tgtacagaaa 143641 tggctataac agagtttctc ttattcgtat taacagctac tctaggggga atgtttttat 143701 gcggtgctaa cgatttaata actatctttg tagccccaga atgtttcagt ttatgctcct 143761 acctattatc tggatatacc aagaaagatg tacggtctaa tgaggctact atgaaatatt 143821 tactcatggg tggggcaagc tcttctattc tggttcatgg tttctcttgg ctatatggtt 143881 catccggggg agagattgag cttcaagaaa tagtaaacgg tcttatcaat acacaaatgt 143941 ataactcccc aggaatttca attgcgctca tattcattac cgtaggaatt gggttcaagc 144001 tttccccagc cccttctcat caatggactc ctgacgtata cgaaggagtg cggttcgttc 144061 gagaaattcc tacctctcta tctatctctg agatgtttgg atttttcaaa actccatgga 144121 catgcagaag agaaatgcta tccccactcg gaccaagaca gaacttttac ttgttcaaat 144181 aacaattaag gtgaagcagg gtcaggaacg acgaatctct ttatgataaa cagatccatt 144241 ttgcaagttc gttattacgg gtagttccta caaaggatcg gactaatgac gtatacaata 144301 cttgaattct cgatgtagat gctacatagt tggttctcat ccttcagaga ctacgagtgt 144361 aataagagca tccgtcgaca aaaggatcac cctaagatga tcatctcgtg gctattgaga 144421 acgaattaaa tcagatggtt ctatttctca atctttctga cttgctccta cgaaaccaag 144481 gtcgaaaaga ttgaaaaaat cagtcattca caaccactga tgaaggattc ctcgaaaagt 144541 taaggattag taatcctttt tagaaatcga atggattcgg tcttatacat acgcgaggaa 144601 ggtaatcaaa aaagaaagaa aatgggttct tctttctttt atcacttagg agccgtgtga 144661 gatgaaagtc tcatgcacgg ttttgaatga gagaaagaag tgaggaatcc tcttttcgac 144721 tctgactctc ccactccagt cgttgctttt ctttctgtta cttcgaaagt agctgcttca 144781 gcttcagcca ctcgaatttt cgatattcct ttttatttct catcaaacga atggcatctt 144841 cttctggaaa tcctagctat tcttagcatg atattgggaa atctcattgc tattactcaa 144901 acaagcatga aacgtatgct tgcatattcg tccataggcc aaatcggata tgtaattatt 144961 ggaataattg ttggagactc aaatgatgga tatgcaagca tgataactta tatgctgttc 145021 tatatctcca tgaatctagg aacttttgct tgcattgtat tatttggtct acgtaccgga 145081 actgataaca ttcgagatta tgcaggatta tacacaaaag atcctttttt ggctctctct 145141 ttagccctat gtctcttatc cctaggaggt cttcctccac tagcaggttt tttcggaaaa 145201 ctctatttat tctggtgtgg atggcaggca ggcctatatt tcttggtttt aataggactc 145261 cttacaagcg ttgtttctat ctactattat ctaaaaataa taaagttatt aatgactgga 145321 cgaaaccaag aaataacccc tcacgtgcga aattatagaa gatccccttt aagatcaaac 145381 aattccatcg aattgagtat gattgtatgt gtgatagcat ctactatacc aggaatatca 145441 atgaacccaa ttattgcaat tgctcaggat agcctttttt agcttctagg gtctatttct 145501 tagttcaaga tccctcttac taactggaat caaagaatta gtagatctgt tccgcccaaa 145561 atgggaatgg gctagggtta tgaacttata atctgatgat cgagtcgatt ccatgattat 145621 aagttcattc cataccggac caggccggaa tagggttata tacattctca ttatgagaag 145681 gggtcattcg ggcctatcta aatagatact atgtttacat atggattcct acatcattac 145741 attccattta ggattaggaa tacgcgtaat cggacctgct ttttacatat ctctattggg 145801 accctattca cctctttgag tgaatcgaga aataggtttg attgtccatc tttttgatat 145861 atatcaggca ttgcattctc cggataattc aaatcgaagc aattggatgt ccaactcggg 145921 cctatatgac atgaccgatc aatagatcca cctttgtcat atattccata catcacacta 145981 gatagatatc atattcatgg aatacgattc actttcaaga tgccttggtg gtgaaatggt 146041 agacacgcga gactcaaaat ctcgtgctaa atagcgtgga ggttcgagtc ctcttcaagg 146101 cataatattg agaatgctca ttgaatgagc attctcaata agagagctcg gatcgaatcg 146161 gtattgatat accgattcga tccgagctct tggaattgga ataaattcgg cagcggatcg 146221 cgaaatcttg gtgatcttct ctatctaatg aatggggagt ccgctttaaa atcgtccgcc 146281 ctgcacccac cccccgagta tatgcttcaa caggaatcac acaagggtag attagaaacc 146341 tctggtaaaa tgcccgcccg taacccagca gataaagtac attacatagt ccagggattg 146401 gcgacttacc cattcagtga ctttggcact ggacgttccc aaaatgggga ctatcgggta 146461 aattcaatat aatagacgcc tgttggcatt ccagccttcc ttctcctttc agggcctatc 146521 cgaaagagaa tccagtactt cttggtcgtg aatatctgaa ctggttgttt gctgttcaag 146581 aattcttgtt taggcagttc ataccatcca tacatagtgt tttgatctaa gatttcaatt 146641 cttccgtgtt tcagcagtaa catattcttc catggagcta aggtccaaaa tatggaagaa 146701 acaagcgttt ccacgactct accacccagt caattctgtt ccacttaatc cctctttcat 146761 ggccacatat ctttccggct aaggaatggg aaatctttct cctgttacat gaatccaatt 146821 ttcatttcat ccgggaaaag ccatcttttt ctcaacaatg tctttgtcat ttgatccaat 146881 agcgttccgt tagataggaa cagatttgat aaatactgat aactctcgga tagagtatta 146941 gaacggaaag atccattaga taatgaactg ttggttctaa gccatctctg acgattaatc 147001 aacaattcga agtgcttttc ttgcgtattc ttgataaacc agcgtttata tatagatgta 147061 ggagggtctg tttgggaagt aagaagcccc tttgacatct cttcatctgc aaataattct 147121 cgatgtgaaa acacagagcc agggggctga tctttgaata ggaaaaagag tggatctgca 147181 gggtcccaaa tgaattggct tattcgaaaa aggccttgtt ctttggaaga tctatctcgt 147241 gtctggtact gcatggttcc actctgcaag aactccgaat cattctcttg aagctcatcc 147301 tcttcatcat aaatgatccg cttgccccga aatgacctgg accaataggg aaatcccaat 147361 tcattgggcc tttcgataca atcaaataga aagccccaag ggcgccatat tctaggagcc 147421 caaactatgt gattgaataa atcctcctgc gggtcaaggg ctccttctcc ctccccttct 147481 tcaaactccg attcatattt ttcatagaga aatctctgat caaggataga acaagagccg 147541 ttttgcatca tatctaaggg attcctcggt tcgggccgaa gaagcaatgt cactcgatca 147601 ttatcaaact gactgcaatc tttttctgtc cgtgaagatc ccaccagagc gccttctact 147661 tctaataggc catgaactag atcagaatca ttctcaacga gtccataaga agtgatccca 147721 tttttttcat cgggtccgga taaagaccaa agatcttgag cgaccgatcc ggcagaacaa 147781 ctcaaaagat aaagaagtat cgttaatctc ttcatgctcg ttccaagctc gaagtaccat 147841 ttgtacaaat aagaatcccc ttcgttacat gatttcttct tcatatagat agatatagga 147901 tctatggggc aattacttag aagtacattt tgtgctacag cccttcctat ctgatagaaa 147961 aggatcccat gatcctgaac cgatcttacc tgggatcgca aatcccaagt ttgtctatga 148021 agagcggatc taattgtatt agtgtctata attgatttct tctgtgtaat actaatcgat 148081 aggacctcat tggtaagtgc tacaagatct cgtgcattgg aacccatggt tatggacccg 148141 aatccgttag tatggaacat tttcttttcc aagtgaaatc ccctagtata tgaaagagtg 148201 aaaaagtgct ttcgttgttg tggaagaaga agccttcgta tcttaatgca cgtatttaat 148261 ttattcggag ctattagagc gggatccact ttttggggaa tatgagtcga agcaataaca 148321 agaatatttc tagtagaaca tctttcacaa tccctggaga gatggttcac taatagaccg 148381 agggctaagt cattcgactc attcacatcc agatcatgaa tgtttggaat ccatattatg 148441 caaggagaca ttgcttttgc taattcgaat tgaagggtga tataaaatcg gtctatttcc 148501 ggcatcatat ccatagttag cccattcatc ctagttagca gtttcagctc cgtatcaagg 148561 tcacgatcga tatcgtcact agcatcaaga ttgtcactat catcaatatc gtcactatca 148621 tcaatatcga tctcatcaag aagaaaacct ttaggcttgt tatccaggaa cttgttcaga 148681 aataccgtaa tgaaaggaac ataggagttt gtcgctaggt atttgaccaa ataggatcgt 148741 ccagttccta tagaacctat cactaaaata cccctagagg gggataaggc taagcggagc 148801 gaaaagggtt ttccatgaga tgggaaatga aaactatttt ccccacacga agtttgtgaa 148861 taagtgattg tctgataatg agcaaggaat atccgtcttt ctgctaaaca ggatggattg 148921 aactcataat tcattagatg ctttttatga atgtcaacta agtatcgtaa gtaaattgct 148981 cccggttgtt caatcatttg ataaccagag tcattctttg ataaacgatc actatgagtc 149041 agactcaata gaatttgatc aatcctattt tctgtcgtta aggtggagaa ctgaaccaag 149101 aattctcttt cttcatcatc aatcgaatca ctgttcgcga cccaggattc tattttatca 149161 tcaatccaat ccccgttcac gttttttctt tttcttatca atgaatagat ctctttactt 149221 gtatgactta gatgtctcgt atttctcgaa aaagtgattc gattgatggg atttggtatg 149281 agatcgatga tctcgatgag attgatattc caatctttct tcttagaacg tattgatttg 149341 accccataag cgggaccaag catgttgccg ccagaagcag aaccccgtat ttcttctaga 149401 gaatctccta attgttccag agcaactaga aagagattct ttaaccagaa agaattcggt 149461 tcagatgtag gatacctatc cagaagtttt cgcaactcaa tcatagatga tggaatcatc 149521 aaagatttga ccttttcgaa ctctgtctgt aactcactag aggcccggga aacaaagaga 149581 agatgtgtac gaacgagata tccagcaaca agaagaagga aaaggattga atagaggaac 149641 tcccgagcat ttggcgatct cagatgtgtc gatatcaatg gtgactcatt atttcgatga 149701 atcatttctt cggacagaag aagattatgt aaacacttac tcgagatctc acttatcaga 149761 ttccattgtg gaagacacaa ttttttctga agaattcgcc atgatatacc tgatccatgc 149821 ataatatcat gaaaaatggg tacaaatttt tgactgctac ttagtattgg caataggtct 149881 gaaaaagtat ctaaaaatat caaatttaga tatttgtacc ctgtcgaagt aaggaaccat 149941 ggcatatatg tttggaatag attccatttt gagagagttg aaaaagcact atctcgttga 150001 aaggttctat acatctgccc tttctcaacg catttcttta gacaaagact ccgttttttc 150061 ctcttttcgg atgataaatc tttctcagaa catggagtgt gaatcaaacc catgtttgaa 150121 ttgaaattga gatactgatg caagttcttc ccttctgaat cagatagatt catatctgaa 150181 agaggttgac aataagttct ttcaaaattg actatttgcc cctctgttag aggtgttcca 150241 gaaatgtctg cgatcgagta aatagctcta cgaacgaatg gatcggatcg acttggaaaa 150301 tggaaagatt tgtacaagtt atacgtttcg tcaccacttt gtggaaaatc gttaggtatg 150361 aatatgttag atacctgtga ctcgattggt gaaatagtat ctctccccca aaaagcatgt 150421 ttttttttac cgacgcacaa agaaaatatt ttgttgcgaa tgaacaagat attgaggaat 150481 tgtccatacg taaaatcaga attattgata cgggcctttt ccacagaaaa ggggaatctt 150541 gtgttccaat agaagcagaa gtgatgtgga ttattcaaga atcgaagtcg atttgcttta 150601 taaaaagaag atatcaatga acttctatga aatggtttca cgggattcag ccaattgtct 150661 tgatcgtgga atatcattga gaaataggaa tccgggttat caaaggattt cctgcgatta 150721 tttctagtat ggaatgagtc aatcatccac tttggtatct tattgaacaa aaatggtgat 150781 attgttcctc cattgatcaa gaatttcgat ttttgggaag tatcatgatc gtccaataag 150841 aagggtttcc attttttcaa atgaacaatt tgaagaccta ttgattctaa caactgattg 150901 cagagttgat cattcggacc tttcaattca tagatgtaga tctcggacct atgaatgggg 150961 atatttccga aactcacaca gaaaaaagga agtgagttag acaaaaagaa aagcaacttg 151021 gacaaaaaaa gaagtgactt ggacaaaaag aaacgaagtg gcttagacaa atcttttttg 151081 tcgataacct cagaccaatc aatcgaatat tgattaatac gtaatcgatc gaacactact 151141 tgaaaacggc tcttctgctc cgaaacggac tgttccaaat gttcctggaa attcttgctc 151201 ccattggacc atttgtatct atatgcatca ggatcccgat tcatggatct ctcggttcga 151261 gaaatcaaaa taagaggctc gaaccatttc ttctgactct ttttcaaatt cgataaatat 151321 tggttgatcg tatatttcat tatagttcta tgattcagag tatcctttcc tatttgatcc 151381 ctttgaattc catattcgaa gttgcgatcg gatctattca ttaaaaagaa tcgattcaat 151441 acatttctta tgtacccata ggtactatat tggatttgaa tcagatttcg gatcaatcta 151501 tattgagtga ctgcctccat tatgttgttg ctagcaaata ccactatttt tggttttgga 151561 tcttccaaat cattcccgca ggagatccgg acccattttt ttctgatcct tcgagaaaaa 151621 gattcattct cttcataaaa aataggaggt agaaccaata aagatttctt tttcgattca 151681 tccctggcct cattcaagaa ttgtttttga tccaatccgc aggaatcaat agaaaaggca 151741 aatcccttat gatacaccag atccggctcg gttattgata gagtgaatag atctgccatt 151801 tcttgaaatc tctcttctga ttcaaaatcg tagtgtaacg tgtatcctcc cctgttccgg 151861 tcatggaata gatgaaataa atcaaaaaat ggatttttgt tcaagaatga aatcttattg 151921 gaactgtcca tatccggttc atcttcggaa ccatatcaca tcccggatct gatgaaatag 151981 gatgaattga gacggtattt tgtaaatacg taattatctt gaatatatta accatttctt 152041 tattttccga tcgcttggaa gggacaaaag aaagatcttg ttgtttcttc aacaatttct 152101 gatccctagt ggacctctca gtaggattcg aacccagatg aagttctgac catctatcag 152161 agaaaaaaga acgaacggat cttgtaggat tcccaagaaa ttcttcgatt tcttccggaa 152221 acagatgatt aatcatctgc ttctcacgtt ccgtgaatag ccgggacatt gaggaatatc 152281 cagaaaggca tttcgggaat cggcctgatt ctatctcttt tcgttccgtt tgaagaaagg 152341 aaggatccca aagaatcgat ctttcttttc gttgttgaat ctctctttga ttaatcaatg 152401 tgtgatattc cgaatcctca ttactaatgg aatccaaatg atctctggat tgatcagaag 152461 atcctttcag ttggctagaa tccgttactt gaacgaaact agatcttgtg gaatcatatt 152521 gaatatttga cgatacattc tgtaccttgc taaaaaaccg atccttgttt accaaccaca 152581 cattgtctaa ccaaatccaa ttctctctcg atacgttcct caaaaaatcc gattcgggcg 152641 gattcttccc ccaactaacg aagagatctt ggcggaattg ccacatatga aattgagcac 152701 agttttgcaa agaaatagcc cacttgtttc tcgagaagag atgggaaaca tgctcaatat 152761 catttgattg aatagttgac ccagcccctt gttgtttgaa gaaaccctcc acttcaattg 152821 gtattttttc acgaaaagca gacatgagat aagaaatcca gtgtttcact aagatttcga 152881 atagcggtcc cgaattcaag ttgattctat ttcgactctt cctcagagaa agacgatcaa 152941 acaattccca atcatggtcc ttgcggatcg gatcatccat ataatataca aaaagaaact 153001 ccagatattt gagatctttc tctttgaata agatctcaat tccagcgacg gtttcattag 153061 atatcttaca actagaatcc ctcttttttc cgatccagtt cctccaccaa cgcgaacccc 153121 agttagattc aggcatgcta cactttttag ttattgggag aacccaagta ctctctttcg 153181 gattcaggaa acaactctca gagatctttt ttcctttggg aagatacagg agcgaaacaa 153241 tcaacctatt gatattggaa gacccaacgg attcttccaa tgtatcattt ctgggtccaa 153301 tggaattcat aggtatagga agaagcccta tcaaatagag attttttctt tcgaccatat 153361 ttcgattgtt aatacgatat ataaggaccg ctactacaaa gagtattaca cccttgatcg 153421 tgaaatatcg attgcttgtt gaaccctgtg aattgcgtga aagtaggata ctccaaattc 153481 gggggtcaaa gagttttaga aaacgttctt ggtggaaaaa aatgtgaatg aaggatcccg 153541 ctgaattgaa ttgggtccat gaatctaaga aatggtgaga attcttgatc tctctcaata 153601 tctctctcaa ttcgaaaatc caggatttga attgatgtcc tctcattgat tcctcctaaa 153661 ttgcattgat ttatcctaac taaattgcat tgatttatcc taaagatttc atttcaattg 153721 gaatttggtt attcaccatg tacgaggatc cccgctaagc atccatggct gaatggttaa 153781 agcgcccaac tcataattgg cgaattcgta ggttcaattc ctactggatg cacgccaatg 153841 ggaccctcca ataagtctat tggaattggc tctgtatcaa tggaatctca tcatccatac 153901 ataacgaatt ggtgtggtat attcatatca taatatatga acagtaagaa ctagcattct 153961 tattgagact ataactcata gggaagaaaa tcgatttatg gatggaatca aatatgcagt 154021 atttacagac aaaagtattc ggttattggg gaaaaatcaa tatacttcta atgtcgaatc 154081 aggatcaact aggacagaaa taaagcattg ggtcgaactc ttctttggtg tcaaggtaat 154141 agctatgaat agtcatcgac ttccgggaaa gagtagaaga atgggaccta ttatgggaca 154201 tacaatgcat tacagacgta tgatcattac gcttcaaccg ggttattcta ttccacctct 154261 tagaaagaaa agaacttaaa aaaaaatact taatagcatg gcgatacatt tatacaaaac 154321 ttctaccccg agcacacgca atggaaccgt agacagtcaa gtgaaatcca atccacgaaa 154381 taatttgatc tatggacagc atcattgtgg taaaggtcgt aatgccagag gaatcattac 154441 cgcaaggcat agagggggag gtcataagcg tctataccgt aaaatcgatt ttcgacggaa 154501 tgaaaaagac atatatggta gaatcgtaac catagaatac gaccctaatc gaaatgcata 154561 catttgtctc atacactatg gggatggtga gaagagatat attttacatc ccagaggggc 154621 tataattgga gataccattg tttctggtac agaagttcct ataaaaatgg gaaatgccct 154681 acctttgagt gcggtttgaa ctattgattt acgtaattgg aaataaccaa ttaggtttac 154741 gacgaaacct agaaatcgat cactgatcca atttgagtac ctctgcagga tagacctcaa 154801 cagaaaactg aagagtaacg gcagcaagtg attgagttca gtagttcctc atataaaatt 154861 attgactcta gagatatagt aatatggaga agacaaaatt gtttcaagca ccgacagaac 154921 cggaagcgcc ccttctttca aagagaggag gacgggttat tcacatttca tttgatggtc 154981 agaggcgaat tgaaagttaa gcagtgggaa ttctaaagat tccccggggg aaaaatagag 155041 atgtctccta cgttacccat aatatgtgga agtatcgacg taatttcata gagtcattcg 155101 gtctgaatgc tacatgaaga acataagcca gatgacggaa cgggaagacc caggatgtag 155161 aagatcataa catgagtgat tcggcagatt tggattcata tatatatcca cccatgtggt 155221 acttcattct acgatatata taagatccat ctgtatagat atcatcatct acatccagaa 155281 agaagtatgc tttggaagaa gcttgtacag tttgggaagg ggttttgatt gatcaaaaga 155341 agaatctact tcaaccgata tgcccttagg cacggccata cataacatag aaatcacact 155401 tggaaagggt ggacaattag ctagagcagc gggtgctgta gcgaaactga ttgcaaaaga 155461 ggggaaatcg gccacattaa aattaccttc tggggaggtc cgtttgatat ccaaaaactg 155521 ctcagcaaca gtcggacaag tggggaatgt tggggtgaac cagaaaagtt tgggtagagc 155581 cggatctaag cgttggctag gtaagcgtcc tgtagtaaga ggagtagtta tgaaccctgt 155641 agaccatccc catgggggtg gtgaagggag agccccaatt ggtagaaaaa aacccacaac 155701 cccttggggt tatcctgcac ttggaagaag aagtagaaaa aggaataaat atagtgataa 155761 tttgattctt cgtcgccgta gtaaatagga gagaaaatcg aattaaattc ttcgttttta 155821 caaaaaaaaa aaaaatagga gtaa // LOCUS TIPNPSS 6425 bp ds-DNA SYN 09-AUG-1990 DEFINITION A.tumefaciens T-DNA vector containing octopine T-DNA borders and markers: neomycin-phosphotransferase - octopine synthase (3' end) and Sp/Sm adenyltransferase. complete cds. ACCESSION M35007 KEYWORDS neomycin phosphotransferase; streptomycin/spectinomycin adenyltransferase. SOURCE N.tabacum T-DNA inserts in A.tumefaciens DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 6425) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 2782 1985 (c) neomycin phosphotransferase (gtg start codon) pept.ps 5009 4042 (c) streptomycin/spectinomycin adenyltransferase (gtg start codon) mRNA / 1811 1105 (c) octopine synthase (3' end) recomb 24 25 T-DNA end/plant DNA start recomb 40 41 plant DNA end/T-DNA start recomb 1094 1095 plant DNA end/T-DNA start recomb 2786 2787 T-DNA end/plant DNA start recomb 3044 3045 T-DNA end/plant DNA start recomb 3354 3355 plant DNA end/T-DNA start recomb 5585 5586 T-DNA end/plant DNA start recomb 6389 6390 T-DNA end/plant DNA start signal 1650 1645 (c) poly-A signal signal 1686 1681 (c) poly-A signal site 1634 1633 (c) major poly-A site site 1 24 left T-DNA border site 372 395 24 bp border-like sequence site 1569 1592 24 bp border-like sequence site 1669 1692 24 bp border-like sequence site 1779 1756 (c) 24 bp border-like sequence site 2128 2105 (c) 24 bp border-like sequence site 2449 2472 24 bp border-like sequence site 2485 2462 (c) 24 bp border-like sequence site 3660 3683 24 bp border-like sequence site 3875 3898 24 bp border-like sequence site 4359 4336 (c) 24 bp border-like sequence site 5868 5891 24 bp border-like sequence BASE COUNT 1509 a 1754 c 1710 g 1452 t ORIGIN 1 cggcaggata tattcaattg taaatggctt catgtccggg aaatctacat ggatcagcaa 61 tgagtatgat ggtcaatatg gagaaaaaga aagagtaatt accaattttt tttcaattca 121 aaaatgtaga tgtccgcagc gttattataa aatgaaagta cattttgata aaacgacaaa 181 ttacgatccg tcgtatttat aggcgaaagc aataaacaaa ttattctaat tcggaaatct 241 ttatttcgac gtgtctacat tcacgtccaa atgggggctt agatgagaaa cttcacgatc 301 gatgccttga tttcgccatt cccagatacc catttcatct tcagattggt ctgagattat 361 gcgaaaatat acactcatat acataaatac tgacagtttg agctaccaat tcagtgtagc 421 ccattacctc acataattca ctcaaatgct aggcagtctg tcaactcggc gtcaatttgt 481 cggccactat acgatagttg cgcaaatttt caaagtcctg gcctaacatc acacctctgt 541 cggcggcggg tcccatttgt gataaatcca ccatcacaat agatagtcta atggacgaaa 601 aaggcgaata tttcgatgct gagattcgac gcaattaatt cgagaaaaat cccgtgattg 661 atgctgttga gttaccaata atatgggcag cgaaggccat ttaattataa gatcctgcaa 721 gcctcgtcgt cctggccgga ccacgctatc tgtgcaaggt ccccggcccc ggacgcgcgc 781 tccatgagca gagcgcccgc cgccgaggcg aagagtcggg cggcgccctg cccgtcccac 841 caggtcaaca ggcggtaacc ggcctcttca tcgggaatgc gcgcgacctt cagcatcgcc 901 ggcatgtccc cctggcggac gggaagtatc cagctcgacc aaagcggcca tcgtgcctcc 961 ccactcctgc agttcggggg catggatgcg cggatagccg ctgctggttt cctggatgcc 1021 gacggatttg cactgccggt agaactccgc gaggtcgtcc agcctcaggc agcagctgaa 1081 ccaactcgcg aggggatcga gcccctgctg agcctcgaca tgttgtcgca aaattcgccc 1141 tggacccgcc caacgatttg tcgtcactgt caaggtttga cctgcacttc atttggggcc 1201 cacatacacc aaaaaaatgc tgcataattc tcggggcagc aagtcggtta cccggccgcc 1261 gtgctggacc gggttgaatg gtgcccgtaa ctttcggtag agcggacggc caatactcaa 1321 cttcaaggaa tctcacccat gcgcgccggc ggggaaccgg agttcccttc agtgaacgtt 1381 attagttcgc cgctcggtgt gtcgtagata ctagcccctg gggccttttg aaatttgaat 1441 aagatttatg taatcagtct tttaggtttg accggttctg ccgctttttt taaaattgga 1501 tttgtaataa taaaacgcaa ttgtttgtta ttgtggcgct ctatcataga tgtcgctata 1561 aacctattca gcacaatata ttgttttcat tttaatattg tacatataag tagtagggta 1621 caatcagtaa attgaacgga gaatattatt cataaaaata cgatagtaac gggtgatata 1681 ttcattagaa tgaaccgaaa ccggcggtaa ggatctgagc tacacatgct caggtttttt 1741 acaacgtgca caacagaatt gaaagcaaat atcatgcgat cataggcgtc tcgcatatct 1801 cattaaagca gggggtgggc gaagaactcc agcatgagat ccccgcgctg gaggatcatc 1861 cagccggcgt cccggaaaac gattccgaag cccaaccttt catagaaggc ggcggtggaa 1921 tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc 1981 ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg 2041 cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct tcagcaatat 2101 cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg ccacagtcga 2161 tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca tcgccatggg 2221 tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac agttcggctg 2281 gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg gcttccatcc 2341 gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag gtagccggat 2401 caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg gcaggagcaa 2461 ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag tcccttcccg 2521 cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc agccacgata 2581 gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc ttgacaaaaa 2641 gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag ccgattgtct 2701 gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa cctgcgtgca 2761 atccatcttg ttcaatccac atgatcagat ctctaggcgc gtgggtgcgg acgtagtcag 2821 cgccattgcc gatcgcgtga agttccgccg caaggccgct ggacccagat cctttacagg 2881 aaggccaacg gtggcgccca agaaggattt ccgcgacacc gagaccaata gcggaagccc 2941 caacgccgac ttcagctttt gaaggttcga cagcacgtgc agcgatgttt ccggtgcggg 3001 gctcaagaaa aatcccatcc ccggatcgag gatgagccgg tcggcagcga ccccgctccg 3061 tcgcaaggcg gaaacccgcg cctcgaagaa ccgcacaatc tcgtcgagcg cgtcttcggg 3121 tcgaaggtga ccggtgcggg tggcgatgcc atcccctgcg ctgagtgcat aaccaccagc 3181 ctgcagtccg cctcagcaat atcgggatag agcgcagggt caggaaatcc ttggatatcg 3241 ttcaggtagc ccacgccgcg cttgagcgct agcgcgggtt tccggttgga agctgtcgat 3301 tgaaacacgg tgcatctgat cggacagggc gtctaagagc ggcgcaatac gtctgatctc 3361 atcggccggc gatacaggcc tcgcgtccgg atggctggcg gccggtccga catccacgac 3421 gtctgatccg actcgcagca tttcgaccgc cgcggtgaca gcgttggtgg ggtctagcag 3481 tacgtcaatc gaagaaggag tcctcggtga gattcagaat gccgaacacc gtcaccatgg 3541 cgtcggcctc cgcagcgact tccacgatgg ggatcgggcg agcaaaaagg cagcaattat 3601 gagccccata cctacaaagc cccacgcatc aagcttttga ccctgaagca actaggcaat 3661 ggctgtaatt atgacgacgc cgagtcccga accagactgc ataagcaaca accgacagaa 3721 tggatttcga aaccagagaa agaaaataaa tgcgatgcca taaccgatta tgaacaacgg 3781 cggaaggggc aagcttagta aatgcctcgc tagattttaa tgcggatgtt gcgattactt 3841 cgccaactat tgcgataaca agaaaaagcc agcctttcat gatatatctc ccaatttgtg 3901 tagggcttat tatgcacgct taaaaataat aaaagcagac ttgacctgat agtttggctg 3961 tgagcaatta tgtgcttagt gcatctaatc gcttgagtta acgccggcga agcggcgtcg 4021 gcttgaacga attgttagac attatttgcc gactaccttg gtgatctcgc ctttcacgta 4081 gtggacaaat tcttccaact gatctgcgcg cgaggccaag cgatcttctt cttgtccaag 4141 ataagcctgt ctagcttcaa gtatgacggg ctgatactgg gccggcaggc gctccattgc 4201 ccagtcggca gcgacatcct tcggcgcgat tttgccggtt actgcgctgt accaaatgcg 4261 ggacaacgta agcactacat ttcgctcatc gccagcccag tcgggcggcg agttccatag 4321 ocgttaaggt ttcatttagc gcctcaaata gatcctgttc aggaaccgga tcaaagagtt 4381 cctccgccgc tggacctacc aaggcaacgc tatgttctct tgcttttgtc agcaagatag 4441 ccagatcaat gtcgatcgtg gctggctcga agatacctgc aagaatgtca ttgcgctgcc 4501 attctccaaa ttgcagttcg cgcttagctg gataacgcca cggaatgatg tcgtcgtgca 4561 caacaatggt gacttctaca gcgcggagaa tctcgctctc tccaggggaa gccgaagttt 4621 ccaaaaggtc gttgatcaaa gctcgccgcg ttgtttcatc aagccttacg gtcaccgtaa 4681 ccagcaaatc aatatcactg tgtggcttca ggccgccatc cactgcggag ccgtacaaat 4741 gtacggccag caacgtcggt tcgagatggc gctcgatgac gccaactacc tctgatagtt 4801 gagtcgatac ttcggcgatc accgcttccc tcatgatgtt taactttgtt ttagggcgac 4861 tgccctgctg cgtaacatcg ttgctgctcc ataacatcaa acatcgaccc acggcgtaac 4921 gcgcttgctg cttggatgcc cgaggcatag actgtacccc aaaaaaacag tcataacaag 4981 ccatgaaaac cgccactgcg ccgttaccac cgctgcgttc ggtcaaggtt ctggaccagt 5041 tgcgtgaggc catacgctac ttgcattaca gcttacgaac cgaacaggct tatgtccact 5101 gggttcgtgc cttcatccgt ttccacggtg tgcgtcaccc ggcaaccttg ggcagcagcg 5161 aagtcgaggc atttctgtcc tggctggcga acgagcgcaa ggtttcggtc tccacgcatc 5221 gtcaggcatt ggcggccttg ctgttcttct acggcaagtg ctgtgcacgg atctgccctg 5281 gcttcaggag atcggaagac ctcggccgtc cgggcgcttg ccggtggtgc tgaccccgga 5341 tgaagtggtt cgcatcctcg gttttctgga aggcgagcat cgtttgttcg cccagcttct 5401 gtatggaacg ggcatgcgga tcagtgaggg tttgcaactg cgggtcaagg actggatttc 5461 gatcacggca cgatcatcgt gcgggagggc aagggctcca aggatcgggc cttgatgtta 5521 cccgagagct tggcacccag cctgcgcgag cagctgtctc gtgcacgggc atggtggctg 5581 aaggactagg ccgagggccg cagcggcgtt gcgcttcccg acgcccttga gcggaagtat 5641 ccgcgcgccg ggcattcctg gccgtggttc tgggtttttg cgcagcacac gcattcgacc 5701 gatccacgga gcggtgtcgt gcgtcgccat cacatgtatg accagacctt tcagcgcgcc 5761 ttcaaacgtg ccgtagaaca agcaggcatc acgaagcccg ccacaccgca caccctccgc 5821 cactcgttcg cgacggcctt gctccgcagc ggttacgaca ttcgaaccgt gcaggatctg 5881 ctcggccatt ccgacgtctc tacgacgatg atttacacgc atgtgctgaa agttggcggt 5941 gccggagtgc gctcaccgct tgatgcctgc cgcccctcac tgtgagaggt agggcagcgc 6001 aagtcaatcc tagcggattc actacccctg cgcgaaggcc atcggtgccg catcgaacgg 6061 ccggttgcgg aaagtcctcc ctgcgtccgc tgatggccgg cagcagcccg tcgttgaagg 6121 atccctgaaa gcgacgttgg atgttaacat ctacaaattg ccttttctta cgaccatgta 6181 cgtaagcgct tacgtttttg gtggaccctt gaggaaactg gtagctgttg tgggcctgtg 6241 gtctcaagat ggatcattaa tttccacctt cacctacgat ggggggcatc gcaccggtga 6301 gtaatattgt acggctaaga gcgaatttgg cctgtagacc tcaattgcga gctttctaat 6361 ttcaaactat tcgggcctaa cttttggtgt gatgatgctg actggcagga tatataccgt 6421 tgtaat // LOCUS TOBPRMMG 200 bp ds-DNA PLN 09-AUG-1990 DEFINITION N.tabacum promoter activating a promoterless nptII marker gene. ACCESSION M34757 KEYWORDS . SOURCE N.tabacum (strain SR1) DNA. ORGANISM Nicotiana tabacum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Gheysen,G.D.R., Herman,L., Breyne,P., Gielen,J., Van Montagu,M. and Depicker,A. TITLE Cloning and sequence analysis of truncated T-DNA inserts from Nicotiana tabacum JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.R.Gheysen, 01-JUN-1990. FEATURES from to/span description pept 198 > 200 ORF mRNA 130 > 200 mRNA (5' end +/- 2 bp) recomb 193 194 T-DNA end/plant DNA start signal 25 30 CAAT box signal 47 55 CAAT box signal 101 107 TATA box BASE COUNT 64 a 53 c 32 g 51 t ORIGIN 1 caagcctcgc tagtcaaaag tgtaccaaac aacgctttac agcaagaacg gaaatgcgcg 61 tgacgctcgc ggtgacgcca tttcgccttt tcagaaatgg ataaatagcc ttgcttccta 121 ttatatcttc ccaaattacc aatacattac actagcatct gaatttcata accaatctcg 181 atacaccaaa tcggatcatg //
GenBank-Updates@genbank.bio.net (08/10/90)
LOCUS HUMMPRCI 9081 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human cation-independent mannose 6-phosphate receptor mRNA, complete cds. ACCESSION J03528 KEYWORDS mannose 6-phosphate receptor. SOURCE Human placenta, cDNA to mRNA (library of J.E.Sadler), clones PP[4,112,188), RB3 and RC5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 9081) AUTHORS Oshima,A., Nolan,C.M., Kyle,J.W., Grubb,J.H. and Sly,W.S. TITLE The human cation-independent mannose 6-phophate receptor: Cloning and sequence of the full-length cDNA and expression of functional receptor in COS cells JOURNAL J. Biol. Chem. 263, 2553-2562 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.Miller, 04-JAN-1988. FEATURES from to/span description pept 148 7623 mannose 6-phosphate receptor precursor /hgml_locus_uid="LY0166H" /nomgen="M6PR" /map="12" sigp 148 267 mannose 6-phosphate receptor signal peptide matp 268 7620 mannose 6-phosphate receptor mRNA < 1 9081 MPR-CI mRNA BASE COUNT 2243 a 2233 c 2476 g 2129 t ORIGIN Chromosome 12; 475 bp upstream of EcoRI site. 1 gagcccagtc gagccgcgct cacctcgggc tcccgctccg tctccacctc cgcctttgcc 61 ctggcggcgc gaccccgtcc cgggcgcggc cccagcagtc gcgcgccgtt agcctcgcgc 121 ccgccgcgca gctccgggcc cggcgcgatg ggggccgccg ccggccggag cccccacctg 181 gggcccgcgc ccgcccgccg cccgcagcgc tctctgctcc tgctgcagct gctgctgctc 241 gtcgctgccc cggggtccac gcaggcccag gccgccccgt tccccgagct gtgcagttat 301 acatgggaag ctgttgatac caaaaataat gtactttata aaatcaacat ctgtggaagt 361 gtggatattg tccagtgcgg gccatcaagt gctgtttgta tgcacgactt gaagacacgc 421 acttatcatt cagtgggtga ctctgttttg agaagtgcaa ccagatctct cctggaattc 481 aacacaacag tgagctgtga ccagcaaggc acaaatcaca gagtccagag cagcattgcc 541 ttcctgtgtg ggaaaaccct gggaactcct gaatttgtaa ctgcaacaga atgtgtgcac 601 tactttgagt ggaggaccac tgcagcctgc aagaaagaca tatttaaagc aaataaggag 661 gtgccatgct atgtgtttga tgaagagttg aggaagcatg atctcaatcc tctgatcaag 721 cttagtggtg cctacttggt ggatgactcc gatccggaca cttctctatt catcaatgtt 781 tgtagagaca tagacacact acgagaccca ggttcacagc tgcgggcctg tccccccggc 841 actgccgcct gcctggtaag aggacaccag gcgtttgatg ttggccagcc ccgggacgga 901 ctgaaggtgg tgcgcaagga caggcttgtc ctgagttacg tgagggaaga ggcaggaaag 961 ctagactttt gtgatggtca cagccctgcg gtgactatta catttgtttg cccgtcggag 1021 cggagagagg gcaccattcc caaactcaca gctaaatcca actgccgcta tgaaattgag 1081 tggattactg agtatgcctg ccacagagat tacctggaaa gtaaaacttg ttctctgagc 1141 ggcgagcagc aggatgtctc catagacctc acaccacttg cccagagcgg aggttcatcc 1201 tatatttcag atggaaaaga atatttgttt tatttgaatg tctgtggaga aactgaaata 1261 cagttctgta ataaaaaaca agctgcagtt tgccaagtga aaaagagcga tacctctcaa 1321 gtcaaagcag caggaagata ccacaatcag accctccgat attcggatgg agacctcacc 1381 ttgatatatt ttggaggtga tgaatgcagc tcagggtttc agcggatgag cgtcataaac 1441 tttgagtgca ataaaaccgc aggtaacgat gggaaaggaa ctcctgtatt cacaggggag 1501 gttgactgca cctacttctt cacatgggac acggaatacg cctgtgttaa ggagaaggaa 1561 gacctcctct gcggtgccac cgacgggaag aagcgctatg acctgtccgc gctggtccgc 1621 catgcagaac cagagcagaa ttgggaagct gtggatggca gtcagacgga aacaaagaag 1681 aagcattttt tcattaatat ttgtcacaga gtgctgcagg aaggcaaggc acgagggtgt 1741 cccgaggacg cggcagtgtg tgcagtggat aaaaatggaa gtaaaaatct gggaaaattt 1801 atttcctctc ccatgaaaga gaaaggaaac attcaactct cttattcaga tggtgatgat 1861 tgtggtcatg gcaagaaaat taaaactaat atcacacttg tatgcaagcc aggtgatctg 1921 gaaagtgcac cagtgttgag aacttctggg gaaggcggtt gcttttatga gtttgagtgg 1981 cacacagctg cggcctgtgt gctgtctaag acagaagggg agaactgcac ggtctttgac 2041 tcccaggcag ggttttcttt tgacttatca cctctcacaa agaaaaatgg tgcctataaa 2101 gttgagacaa agaagtatga cttttatata aatgtgtgtg gcccggtgtc tgtgagcccc 2161 tgtcagccag actcaggagc ctgccaggtg gcaaaaagtg atgagaagac ttggaacttg 2221 ggtctgagta atgcgaagct ttcatattat gatgggatga tccaactgaa ctacagaggc 2281 ggcacgccct ataacaatga aagacacaca ccgagagcta cgctcatcac ctttctctgt 2341 gatcgagacg cgggagtggg cttccctgaa tatcaggaag aggataactc cacctacaac 2401 ttccggtggt acaccagcta tgcctgcccg gaggagcccc tggaatgcgt agtgaccgac 2461 ccctccacgc tggagcagta cgacctctcc agtctggcaa aatctgaagg tggccttgga 2521 ggaaactggt atgccatgga caactcaggg gaacatgtca cgtggaggaa atactacatt 2581 aacgtgtgtc ggcctctgaa tccagtgccg ggctgcaacc gatatgcatc ggcttgccag 2641 atgaagtatg aaaaagatca gggctccttc actgaagtgg cttccatcag taacttggga 2701 atggcaaaga ccggcccggt ggttgaggac agcggcagcc tccttctgga atacgtgaat 2761 gggtcggcct gcaccaccag cgatggcaga cagaccacat ataccacgag gatccatctc 2821 gtctgctcca ggggcaggct gaacagccac cccatctttt ctctcaactg ggagtgtgtg 2881 gtcagtttcc tgtggaacac agaggctgcc tgtcccattc agacaacgac ggatacagac 2941 caggcttgct ctataaggga tcccaacagt ggatttgtgt ttaatcttaa tccgctaaac 3001 agttcgcaag gatataacgt ctctggcatt ggtaagattt ttatgtttaa tgtctgcggc 3061 acaatgcctg tctgtgggac catcctggga aaacctgctt ctggctgtga ggcagaaacc 3121 caaactgaag agctcaagaa ttggaagcca gcaaggccag tcggaattga gaaaagcctc 3181 cagctgtcca cagagggctt catcactctg acctacaaag ggcctctctc tgccaaaggt 3241 accgctgatg cttttatcgt ccgctttgtt tgcaatgatg atgtttactc agggcccctc 3301 aaattcctgc atcaagatat cgactctggg caagggatcc gaaacactta ctttgagttt 3361 gaaaccgcgt tggcctgtgt tccttctcca gtggactgcc aagtcaccga cctggctgga 3421 aatgagtacg acctgactgg cctaagcaca gtcaggaaac cttggacggc tgttgacacc 3481 tctgtcgatg ggagaaagag gactttctat ttgagcgttt gcaatcctct cccttacatt 3541 cctggatgcc agggcagcgc agtggggtct tgcttagtgt cagaaggcaa tagctggaat 3601 ctgggtgtgg tgcagatgag tccccaagcc gcggcgaatg gatctttgag catcatgtat 3661 gtcaacggtg acaagtgtgg gaaccagcgc ttctccacca ggatcacgtt tgagtgtgct 3721 cagatatcgg gctcaccagc atttcagctt caggatggtt gtgagtacgt gtttatctgg 3781 agaactgtgg aagcctgtcc cgttgtcaga gtggaagggg acaactgtga ggtgaaagac 3841 ccaaggcatg gcaacttgta tgacctgaag cccctgggcc tcaacgacac catcgtgagc 3901 gctggcgaat acacttatta cttccgggtc tgtgggaagc tttcctcaga cgtctgcccc 3961 acaagtgaca agtccaaggt ggtctcctca tgtcaggaaa agcgggaacc gcagggattt 4021 cacaaagtgg caggtctcct gactcagaag ctaacttatg aaaatggctt gttaaaaatg 4081 aacttcacgg ggggggacac ttgccataag gtttatcagc gctccacagc catcttcttc 4141 tactgtgacc gcggcaccca gcggccagta tttctaaagg agacttcaga ttgttcctac 4201 ttgtttgagt ggcgaacgca gtatgcctgc ccacctttcg atctgactga atgttcattc 4261 aaagatgggg ctggcaactc cttcgacctc tcgtccctgt caaggtacag tgacaactgg 4321 gaagccatca ctgggacggg ggacccggag cactacctca tcaatgtctg caagtctctg 4381 gccccgcagg ctggcactga gccgtgccct ccagaagcag ccgcgtgtct gctgggtggc 4441 tccaagcccg tgaacctcgg cagggtaagg gacggacctc agtggagaga tggcataatt 4501 gtcctgaaat acgttgatgg cgacttatgt ccagatggga ttcggaaaaa gtcaaccacc 4561 atccgattca cctgcagcga gagccaagtg aactccaggc ccatgttcat caacgccgtg 4621 gaggactgtg agtacacctt tgcctggccc acagccacag cctgtcccat gaagagcaac 4681 gagcatgatg actgccaggt caccaaccca agcacaggac acctgtttga tctgagctcc 4741 ttaagtggca gggcgggatt cacagctgct tacagcgaga aggggttggt ttacatgagc 4801 atctgtgggg agaatgaaaa ctgccctcct ggcgtggggg cctgctttgg acagaccagg 4861 attagcgtgg gcaaggccaa caagaggctg agatacgtgg accaggtcct gcagctggtg 4921 tacaaggatg ggtccccttg tccctccaaa tccggcctga gctataagag tgtgatcagt 4981 ttcgtgtgca ggcctgaggc cgggccaacc aataggccca tgctcatctc cctggacaag 5041 cagacatgca ctctcttctt ctcctggcac acgccgctgg cctgcgagca agcgaccgaa 5101 tgttccgtga ggaatggaag ctctattgtt gacttgtctc cccttattca tcgcactggt 5161 ggttatgagg cttatgatga gagtgaggat gatgcctccg ataccaaccc tgatttctac 5221 atcaatattt gtcagccact aaatcccatg cacggagtgc cctgtcctgc cggagccgct 5281 gtgtgcaaag ttcctattga tggtcccccc atagatatcg gccgggtagc aggaccacca 5341 atactcaatc caatagcaaa tgagatttac ttgaattttg aaagcagtac tccttgctta 5401 gcggacaagc atttcaacta cacctcgctc atcgcgtttc actgtaagag aggtgtgagc 5461 atgggaacgc ctaagctgtt aaggaccagc gagtgcgact ttgtgttcga atgggagact 5521 cctgtcgtct gtcctgatga agtgaggatg gatggctgta ccctgacaga tgagcagctc 5581 ctctacagct tcaacttgtc cagcctttcc acgagcacct ttaaggtgac tcgcgactcg 5641 cgcacctaca gcgttggggt gtgcaccttt gcagtcgggc cagaacaagg aggctgtaag 5701 gacggaggag tctgtctgct ctcaggcacc aagggggcat cctttggacg gctgcaatca 5761 atgaaactgg attacaggca ccaggatgaa gcggtcgttt taagttacgt gaatggtgat 5821 cgttgccctc cagaaaccga tgacggcgtc ccctgtgtct tccccttcat attcaatggg 5881 aagagctacg aggagtgcat catagagagc agggcgaagc tgtggtgtag cacaactgcg 5941 gactacgaca gagaccacga gtggggcttc tgcagacact caaacagcta ccggacatcc 6001 agcatcatat ttaagtgtga tgaagatgag gacattggga ggccacaagt cttcagtgaa 6061 gtgcgtgggt gtgatgtgac atttgagtgg aaaacaaaag ttgtctgccc tccaaagaag 6121 ttggagtgca aattcgtcca gaaacacaaa acctacgacc tgcggctgct ctcctctctc 6181 accgggtcct ggtccctggt ccacaacgga gtctcgtact atatgaatct gtgccagaaa 6241 atatataaag ggcccctggg ctgctctgaa agggccagca tttgcagaag gaccacaact 6301 ggagacgtcc aggtcctggg actcgttcac acgcagaagc tgggtgtcat aggtgacaaa 6361 gttgttgtca tgtactccaa aggttatccg tgtggtggaa ataagaccgc atcctccgtg 6421 atagaattga cctgtacaaa gacggtgggc agacctgcat tcaagaggtt tgatatcgac 6481 agctgcactt actacttcag ctgggactcc cgggctgcct gcgccgtgaa gcctcaggag 6541 gtgcagatgg tgaatgggac catcaccaac cctataaatg gcaagagctt cagcctcgga 6601 gatatttatt ttaacctgtt cagagaatct ggggacatga ggaccaatgg ggacaactac 6661 ctgtatgaga tcctactttc ctccatcaca agctccagaa acccggcgtg ctctggagcc 6721 aacatatgcc aggtgaagcc caacgatcag cacttcagtc ggaaagttgg aacctctgac 6781 aagaccaagt actaccttca agacggcgat ctcgatgtcg tgtttgcctc ttcctctaag 6841 tgcggaaagg ataagaccaa gtctgtttct tccaccatct tcttccactg tgaccctctg 6901 gtggaggacg ggatccccga gttcagtcac gagactgccg actgccagta cctcttctct 6961 tggtacacct cagccgtgtg tcctctgggg gtgggctttg acagcgagaa tcccggggac 7021 gacgggcaga tgcacaaggg gctgtcagaa cggagccagg cagtcggcgc ggtgctcagc 7081 ctgctgctgg tggcgctcac ctgctgcctg ctggccctgt tgctctacaa gaagaagagg 7141 agggaaacaa tgataagtaa gctgacctct tgctgtagga gaagttccaa cgtgtcctac 7201 aaatactcaa aggtgaataa ggaagaagag acagatgaga atgaaacaga gtggctgatg 7261 gaagagatcc agctgcctcc tccacggcag ggaaaggaag ggcaggagaa cggccatatt 7321 accaccaagt cagtgaaagc cctcagctcc ctgcatgggg atgaccagga cagtacggat 7381 gaggttctga ccatcccaga ggtgaaagtt cactcgggca ggggagctgg ggcagagagc 7441 tcccacccag tgagaaacgc acagagcaat gcccttcagg agcgtgagga cgatagggtg 7501 gggctggtca ggggtgagaa ggcgaggaaa gggaagtcca gctctgcaca gcagaagaca 7561 gtgagctcca ccaagctggt gtccttccat gacgacagcg acgaggacct cttacacatc 7621 tgactccgca gtgcctgcag gggagcacgg agccgcggga cagccaagca cctccaacca 7681 aataagactt ccactcgatg atgcttctat aattttgcct ttaacagaaa ctttcaaaag 7741 ggaagagttt ttgtgatggg ggagagggtg aaggaggtca ggccccactc cttcctgatt 7801 gtttacagtc attggaataa ggcatggctc agatcggcca cagggcggta ccttgtgccc 7861 agggttttgc cccaagtcct catttaaaag cataaggccg gacgcatctc aaaacagagg 7921 gctgcattcg aagaaaccct tgctgcttta gtcccgatag gtatttgacc ccgatatatt 7981 ttagcatttt aattctctcc ccctatttat tgactttgac aattactcag gtttgagaaa 8041 aaggaaaaaa aaacagccac cgtttcttcc tgccagcagg ggtgtgatgt accagtttgt 8101 ccatcttgag atggtgaggc tgtcagtgta tggggcagct tccggcggga tgttgaactg 8161 gtcattaatg tgtcccctga gttggagctc attctgtctc ttttctcttt tgctttctgt 8221 ttcttaaggg cacacacacg tgcgtgcgag cacacacaca tacgtgcaca gggtccccga 8281 gtgcctaggt tttggagagt ttgcctgttc tatgccttta gtcaggaatg gctgcacctt 8341 tttgcatgat atcttcaagc ctgggcgtac agagcacatt tgtcagtatt tttgccggct 8401 ggtgaattca acctgcccaa agattgattt gtgtgtttgt gtgtgtgtgt gtgtgtgtgt 8461 gtgtgtgtgt gagtggagtt gaggtgtcag agaaaatgaa ttttttccag atttggggta 8521 taggtctcat ctcttcaggt tctcatgata ccacctttac tgtgcttatt tttttaagaa 8581 aaaagtgttg atcaaccatt cgacctataa gaagccttaa tttgcacagt gtgtgactta 8641 cagaaactgc atgaaaaatc atgggccaga gcctcggccc tagcattgca cttggcctca 8701 tgctggaggg aggctgggcg ggtacagcgc ggaggaggag ggaggccagg cgggcatggc 8761 gtggaggagg tgggaggccg ggcggtcaca gcatggagga ggagggaggc gctgctggtg 8821 ttcttattct ggcggcagcg cctttcctgc catgtttagt gaatgacttt tctcgcattg 8881 tagaattgta tatagactct ggtgttctat tgctgagaag caaaccgccc tgcagcatcc 8941 ctcagcctgt accggtttgg ctggcttgtt tgatttcaac atgagtgtat tttttaaaat 9001 tgatttttct cttcattttt ttttcaatca actttactgt aatataaagt attcaacaat 9061 ttcaataaaa gataaattat t // LOCUS HUMMRA 5185 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human mannose receptor mRNA, complete cds. ACCESSION J05550 KEYWORDS mannose receptor. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5185) AUTHORS Taylor,M.E., Conary,J.T., Lennartz,M.R., Stahl,P.D. and Drickamer,K. TITLE Primary structure of the mannose receptor contains multiple motifs resembling carbohydrate-recognition domains JOURNAL J. Biol. Chem. 265, 12156-12162 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Drickamer, 18-MAY-1990. FEATURES from to/span description pept 104 4474 mannose receptor precursor sigp 104 157 mannose receptor signal peptide matp 158 4471 mannose receptor BASE COUNT 1602 a 999 c 1198 g 1386 t ORIGIN 1 gggaacttgg attaggtgga gaggcagttg gggggcctcg ttgttttgcg tcttagttcc 61 gccctcctgt ccatcaggag aaggaaagga taaaccctgg gccatgaggc tacccctgct 121 cctggttttt gcctctgtca ttccgggtgc tgttctccta ctggacacca ggcaattttt 181 aatctataat gaagatcaca agcgctgcgt ggatgcagtg agtcccagtg ccgtccaaac 241 cgcagcttgc aaccaggatg ccgaatcaca gaaattccga tgggtgtccg aatctcagat 301 tatgagtgtt gcatttaaat tatgcctggg agtgccatca aaaacagact gggttgctat 361 cactctctat gcctgtgact caaaaagtga atttcagaaa tgggagtgca aaaatgacac 421 acttttgggg atcaaaggag aagatttatt ttttaactac ggcaacagac aagaaaagaa 481 tattatgctc tacaagggat cgggtttatg gagcaggtgg aagatctatg gaaccacaga 541 caatctgtgc tccagaggtt atgaagccat gtatacgcta ctaggcaatg ccaatggagc 601 aacctgtgca ttcccgttca agtttgaaaa caagtggtac gcagattgca cgagtgctgg 661 gcggtcggat ggatggctct ggtgcggaac cactactgac tatgacacag acaagctatt 721 tggatattgt ccattgaaat ttgagggcag tgaaagctta tggaataaag acccgctgac 781 cagcgtttcc taccagataa actccaaatc cgctttaacg tggcaccaag cgaggaaaag 841 ctgccaacaa cagaacgctg agctcctgag catcacagag atacatgagc aaacatacct 901 gacaggatta accagttcct tgacctcagg actctggatt ggacttaaca gtctgagctt 961 caacagcggt tggcagtgga gtgaccgcag tcctttccga tatttgaact ggttaccagg 1021 aagtccatca gctgaacctg gaaaaagctg tgtgtcacta aatcctggaa aaaatgctaa 1081 atgggaaaat ctggaatgtg ttcagaaact gggctatatt tgcaaaaagg gcaacaccac 1141 tttaaattct tttgttattc cctcagaaag tgatgtgcct actcactgtc ctagtcagtg 1201 gtggccgtat gccggtcact gttacaagat tcacagagat gagaaaaaaa tccagaggga 1261 tgctctgacc acctgcagga aggaaggcgg tgacctcaca agtatccaca ccatcgagga 1321 attggacttt attatctccc agctaggata tgagccaaat gacgaattgt ggatcggctt 1381 aaatgacatt aagattcaaa tgtactttga gtggagtgat gggacccctg taacgtttac 1441 caaatggctt cgtggagaac caagccatga aaacaacaga caggaggatt gtgtggtgat 1501 gaaaggcaag gatgggtact gggcagatcg gggctgtgag tggcctcttg gctacatctg 1561 caagatgaaa tcacgaagcc aaggtccaga aatagtggaa gtcgaaaaag gctgcaggaa 1621 aggctggaaa aaacatcact tttactgcta tatgattgga catacgcttt caacatttgc 1681 agaagcaaac caaacctgta ataatgagaa tgcttattta acaactattg aagacagata 1741 tgaacaagcc ttcctgacta gtttcgttgg cttaaggcct gaaaaatatt tctggacagg 1801 actttcagat atacaaacca aagggacttt tcagtggacc atcgaggaag aggttcggtt 1861 cacccactgg aattcagata tgccagggcg aaagccaggg tgtgttgcca tgagaaccgg 1921 gattgcaggg ggcttatggg atgttttgaa atgtgatgaa aaggcaaaat ttgtgtgcaa 1981 gcactgggca gaaggagtaa cccacccacc gaagcccacg acgactcccg aacccaaatg 2041 tccggaggat tggggcgcca gcagtagaac aagcttgtgt ttcaagctgt atgcaaaagg 2101 aaaacatgag aagaaaacgt ggtttgaatc tcgagatttt tgtcgagctc tgggtggaga 2161 cttagctagc atcaataaca aagaggaaca gcaaacaata tggcgattaa taacagctag 2221 tggaagctac cacaaactgt tttggttggg attgacatat ggaagccctt cagaaggttt 2281 tacttggagt gatggttctc ctgtttcata tgaaaactgg gcttatggag aacctaataa 2341 ttatcaaaat gttgaatact gtggtgagct gaaaggtgac cctactatgt cttggaatga 2401 tattaattgt gaacacctta acaactggat ttgccagata caaaaaggac aaacaccaaa 2461 acctgagcca acaccagctc ctcaagacaa tccaccagtt actgaagatg ggtgggttat 2521 ttacaaagac taccagtatt atttcagcaa agagaaggaa accatggaca atgcgcgagc 2581 gttttgcaag aggaattttg gtgatcttgt ttctattcaa agtgaaagtg aaaagaagtt 2641 tctatggaaa tatgtaaaca gaaatgatgc acagtctgca tattttattg gtttattgat 2701 cagcttggat aaaaagtttg cttggatgga tggaagcaaa gtggattacg tgtcttgggc 2761 cacaggtgaa cccaattttg caaatgaaga tgaaaactgt gtgaccatgt attcaaattc 2821 agggttttgg aatgacatta actgtggcta tccaaacgcc ttcatttgcc agcgacataa 2881 cagtagtatc aatgctacca cagttatgcc taccatgccc tcggtcccat cagggtgcaa 2941 ggaaggttgg aatttctaca gcaacaagtg tttcaaaatc tttggattta tggaagaaga 3001 aagaaaaaat tggcaagagg cacgaaaagc ttgtataggc tttggaggga atctggtctc 3061 catacaaaat gaaaaagagc aagcatttct tacctatcac atgaaggact ccactttcag 3121 tgcctggact gggctgaatg atgtcaattc agaacacacg ttcctttgga cggatggacg 3181 aggagtccat tacacaaact gggggaaagg ttaccctggt ggaagaagaa gcagtctttc 3241 ttatgaagat gctgactgtg ttgttattat tggaggtgca tcaaatgaag caggaaaatg 3301 gatggatgat acctgcgaca gtaaacgagg ctacatatgc cagacacgat ccgacccttc 3361 cttgactaat cctccagcaa cgattcaaac agatggcttt gttaaatatg gcaaaagcag 3421 ctattcactc atgagacaaa aatttcaatg gcatgaagcg gagacatact gcaagcttca 3481 caattccctt atagccagca ttctggatcc ctacagtaat gcatttgcgt ggctgcagat 3541 ggaaacatct aatgaacgtg tgtggatcgc cctgaacagt aacttgactg ataatcaata 3601 cacttggact gataagtgga gggtgaggta cactaactgg gctgctgatg agcccaaatt 3661 gaaatcagca tgtgtttatc tggatcttga tggctactgg aagacagcac attgcaatga 3721 aagtttttac tttctctgta aaagatcaga tgaaatccct gctactgaac ccccacaact 3781 gcctggcaga tgcccggagt cagatcacac agcatggatt cctttccatg gtcactgtta 3841 ctatattgag tcctcatata caagaaactg gggccaagct tctctggaat gtcttcgaat 3901 gggttcctct ctggtttcca ttgaaagtgc tgcagaatcc agttttctgt catatcgggt 3961 tgagccactt aaaagtaaaa ccaatttttg gataggattg ttcagaaatg ttgaagggac 4021 gtggctgtgg ataaataaca gtccggtctc ctttgtcaac tggaacacag gagatccctc 4081 tggtgaacgg aatgattgtg tagctttaca tgcgtcttct gggttttgga gtaatattca 4141 ctgttcttcc tacaaaggat atatttgtaa aagaccaaaa attattgatg ctaaacctac 4201 tcatgaatta cttacaacaa aagctgacac aaggaagatg gacccttcta aaccgtcttc 4261 caacgtggcc ggagtagtca tcattgtgat cctcctgatt ttaacgggtg ctggccttgc 4321 cgcctatttc ttttataaga aaagacgtgt gcacctacct caagagggcg cctttgaaaa 4381 cactctgtat tttaacagtc agtcaagccc aggaactagt gatatgaaag atctcgtggg 4441 caatattgaa cagaatgaac actcggtcat ctagtacctc aatgcgattc tgagatattt 4501 gaatttcata aaattgtaac tgaaatttaa aatttttagt tcaatgtgat tgttttcttt 4561 aaaatgagta ctgaattgta ctggtctgtc cttttttcct ttgcctaatt gaagaaataa 4621 ttgcttgttt tctagcctgg caagatattt tcataaaaga gggataacaa tgctgattac 4681 taccttttaa aatattttag ataaatgcac agcaccacag caccacatct aagcattagt 4741 gatgggtagc tgatgtcagc ttcatgtgga ttttaagcac tctagaaaca atgaagcttc 4801 ttggcatatt ttaaggagct cccaaaatgt gttacctatt aaattgtaac tcagcaagta 4861 gaagaccatt tgaaaagtca ggtacaaatt tcctcaagtg gcataaaaat gtagtcagtt 4921 ttctctttta ccagttttta tttccactcc aattatttag aactttattt gtacatgtgc 4981 agaagaataa ggcagctgag aatcttgttt cccccaagag agttttacag gctgagtgtt 5041 gcaaatgtgt tctttgtcct gttatatgta tatcaggaat acaaggatgt gaaataaaac 5101 tgtaaatttg cataactgga tgtacttaga taatgtgaaa taaacattaa agacaaggtc 5161 tatttttaat aaaaaaaaaa aaaaa // LOCUS BFRACP 1788 bp ss-RNA PHG 10-AUG-1990 DEFINITION Bacteriophage fr maturation and coat protein genes, complete cds, and replicase gene, 5' end. ACCESSION M31635 KEYWORDS coat protein; maturation protein; replicase. SOURCE Bacteriophage fr RNA. ORGANISM Bacteriophage fr Viridae; ss-RNA nonenveloped viruses; Isometric ss-RNA viruses; Leviviridae. REFERENCE 1 (bases 1 to 1788) AUTHORS Berzin,V., Avots,A., Jansone,I., Gintnere,L. and Tsimanis,A. TITLE Sequence of the genes coding for the A-protein and coat protein of bacteriophage fr JOURNAL Nucleic Acids Res. 15, 6741-6741 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 121 1302 maturation protein (gtg start codon) pept 1328 1720 coat protein pept 1754 > 1788 replicase protein BASE COUNT 443 a 454 c 473 g 418 t ORIGIN 1 cccctttcgg ggtcctgctc gacttcctgt caagctaaat gccattttta atgtctttag 61 cgagacgcta ccatggctat cgctgtaggt agccgcaatt ccattgctag ggagcctcgt 121 gtgcgaaagt tcataccaac cgagagaatg agtaagagcc acgtggtctc cgtccgcgaa 181 tatgctgacg gggaactcga ggataactct ttaccgttga tttaccgatc caattggtcg 241 ccgggccagt acacaagtac gggtccgcgc acaaaggaat ggcactaccc atcgtcctac 301 tcacgtgggg cgataggaat caaggctctt gatcaaggta agtacgcacg attgggcaca 361 tcgtggggcc gcgagttcga agaacgtgcc ggttatggca tgtctatcga cgcacgtagt 421 tgttatagcc tattccccgt tagtcaaaac ttaacgtgga tagacgtacc aacgaacgta 481 gctaatcgcg ccacgactga ggtcttaggt aaggtcactc aaggtaattt taaccttggc 541 gtggcccttg ctgaagcccg gtctacggcc tcacaactgt cgacgcaaac catcgcgttg 601 attaaggcgt acaccgcggc tcgccgcggg aactggcgcc aggcactccg ctacttagcg 661 ctgaacgaga atcgaaaatt caattcgaag tccgtcgcaa gcaggtggct ggagttgcag 721 ttcgggtgga tgccgcttct cagcgatatc caaggtgcgt atgagatgct caccaaagtg 781 catcttaagg catttatgcc tatgcgtgcc gtgagtcaag tcggacaaaa cgtcagtttg 841 tctggccggc tcacttctcc ggctgcaagc tataagtcta cgtgcaacat atcacgacgc 901 attgtgatat ggttttacat aaacgatgca cgtctggctt ggctgtcctc cttagggatt 961 ttgaacccgc taggaatagt gtgggaaaag gtcccgttct ctttcctggt cgattggttg 1021 ctgccggttg ggaacatgct tgaggggctt accgccccga taggctgttc gtatcaatcg 1081 ggaacagtaa ccgacgtaat atcaggagag tcgacaataa tcgccgatga catctatggt 1141 tgggatacag tgcgacctgc aaccgctaag gtgcaaatca gtgctgtcca ccggggggta 1201 caaagcgtgt ggcccacaac gggcgtatac gttaagtcac ctttctcgat ggtccatacc 1261 ttagatgcct tggcactttt caggcaacgc ctctggaaat agaggcccta accgaaggga 1321 gagccacatg gcttcgaact ttgaagagtt cgttctcgtc gacaatggcg gaacgggaga 1381 tgtaaaagtc gctccgagca acttcgctaa cggggttgca gaatggatca gctcgaactc 1441 acgttctcag gcttacaaag tgacctgtag cgtgcgtcag agctctgcga acaatcggaa 1501 atacaccgtc aaggtcgagg tcccgaaagt ggcaactcag gtccaaggcg gcgttgagct 1561 tcctgttgcg gcgtggcgct cgtacatgaa tatggaatta actattccgg tattcgcgac 1621 gaacgacgac tgtgccttaa tcgttaaggc attgcaaggc acctttaaaa ctggtaaccc 1681 aattgcaaca gccatcgcag ccaactcggg aatctactaa gaaacccgtg ccattccaac 1741 atgaggaata cccatgtcaa aatcaacaaa gaagttcaac tctttatg // LOCUS HUMERCC1A 1098 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human excision repair protein (ERCC-1) mRNA, complete cds. cDNA clone pcDE. ACCESSION M28650 KEYWORDS alternative splicing; excision repair protein. SOURCE Human, cDNA to mRNA, clone pcDE. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1098) AUTHORS Hoeijmakers,J.H.J. TITLE Characterization of genes and proteins involved in excision repair of human cells JOURNAL J. Cell Sci. Suppl. 6, 111-125 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 143 1036 excision repair protein site 863 916 alternatively spliced out exon BASE COUNT 243 a 352 c 312 g 191 t ORIGIN 1 aagtgctgcg agccctgggc cacgctggcc gtgctggcag tgggccgcct cgatccctct 61 gcagtctttc ccttgaggct ccaagaccag caggtgaggc ctcgcggcgc tgaaaccgtg 121 aggcccggac cacaggctcc agatggaccc tgggaaggac aaagaggggg tgccccagcc 181 ctcagggccg ccagcaagga agaaatttgt gatacccctc gacgaggatg aggtccctcc 241 tggagtggcc aagcccttat tccgatctac acagagcctt cccactgtgg acacctcggc 301 ccaggcggcc cctcagacct acgccgaata tgccatctca cagcctctgg aaggggctgg 361 ggccacgtgc cccacagggt cagagcccct ggcaggagag acgcccaacc aggccctgaa 421 acccggggca aaatccaaca gcatcattgt gagccctcgg cagaggggca atcccgtact 481 gaagttcgtg cgcaacgtgc cctgggaatt tggcgacgta attcccgact atgtgctggg 541 ccagagcacc tgtgccctgt tcctcagcct ccgctaccac aacctgcacc cagactacat 601 ccatgggcgg ctgcagagcc tggggaagaa cttcgccttg cgggtcctgc ttgtccaggt 661 ggatgtgaaa gatccccagc aggccctcaa ggagctggct aagatgtgta tcctggccga 721 ctgcacattg atcctcgcct ggagccccga ggaagctggg cggtacctgg agacctacaa 781 ggcctatgag cagaaaccag cggacctcct gatggagaag ctagagcagg acttcgtctc 841 ccgggtgact gaatgtctga ccaccgtgaa gtcagtcaac aaaacggaca gtcagaccct 901 cctgaccaca tttggatctc tggaacagct catcgccgca tcaagagaag atctggcctt 961 atgcccaggc ctgggccctc agaaagcccg gaggctgttt gatgtcctgc acgagccctt 1021 cttgaaagta ccctgatgac cccagctgcc aaggaaaccc ccagtgtaat aataaatcgt 1081 cctcccaggc caggctca // LOCUS MZECPPETE 3916 bp ds-DNA ORG 10-AUG-1990 DEFINITION Maize chloroplast cytochrome b559 alpha- and beta-subunit (psbE and F), cytochrome b6-f complex (petE), and photosystem II polypeptide, complete cds, Pro-tRNA and Trp-tRNA. ACCESSION J04502 KEYWORDS cytochrome; photosystem II protein; transfer RNA-Pro; transfer RNA-Trp. SOURCE Maize (strain FR9 cms + FR37) DNA, clone pZmc503. ORGANISM Chloroplast Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 2565 to 2725) AUTHORS Haley,J. and Bogorad,L. TITLE A 4-kDa maize chloroplast polypeptide associated with the cytochrome b-6-f complex: Subunit 5, encoded by the chloroplast petE gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 1534-1538 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3916) AUTHORS Haley,J. and Bogorad,L. TITLE Expression of the maize chloroplast petE gene JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by L.Bogorad, 07-MAY-1989. Author address: J.Haley Department of Cellular and Developmental Biology Harvard University Biological Laboratories 16 Divinity Ave Cambridge, MA 02138 Ph: 617-495-4260 FEATURES from to/span description pept 454 332 (c) ORF40 pept 696 580 (c) photosystem II protein (psbL) pept 838 719 (c) cytochrome b559 beta subunit (psbF) pept 1100 849 (c) cytochrome b559 alpha subunit (psbE) pept 2337 2432 ORF31 pept 2596 2709 cytochrome b6-f complex subunit 5 (petE) pept 3499 3627 ORF42 tRNA 2903 2830 (c) Trp-tRNA tRNA 3115 3042 (c) Pro-tRNA BASE COUNT 1242 a 684 c 658 g 1332 t ORIGIN 1 bp upstream of BamHI site. 1 ggatcctctc ccctctttct tgttgcttca taagagtgaa tccattttat gggcgaaagg 61 cgggggcttt aaatcgaccg atggtttgct tcactcacta acatcattaa caaacaaaag 121 aataaatgga gggattccaa ccatcagagc aaaggttttc tctttgttat ttttacaaat 181 agaaataggt aaccaatttc tagattatgg aacaaaaccg cgttataaca ataagaattc 241 cgcgggccct ttccgctcta atcagataaa ggggagtaag gacccgctaa gttcctactt 301 tttcatgttt acaatctgga tcctccgatt actatagaga tgaacccaat ccagaatacg 361 aaccgtaaaa gaaaaaacct attaaaccaa tcacaagaat accagttaca gtacctatca 421 gccaaagagg aattcttcca gtagtatcgg ccatttcccc tactttcctc cacattttct 481 caagtggtca tgctagagac aaaaacagtc atggatagtt ataaagatgg tatccttcca 541 aatgggataa gagaattctt actactctct ttctttctct caattgaaga agtaattgga 601 aaataaaaca gcaagtacaa aaatgagtaa taaaccccag tatagactgg tacgattcaa 661 ttcaacattt tgttcattcg ggtttgattg tgtcatagtt ctatagttgg aatttggttt 721 atcgttggat gaactgcatt gctgatattg atcccaagaa aaaaaccgtg ggtacagcta 781 atccgtgaac agccagccat cgcactgtaa aaataggata ggttcgatct atggtcattg 841 ggggcctcct aaaaggatct actaaattca tctagttgtt ctaaagaatc aaaacggtcg 901 gttattaatg gaattccttg tcggctttcc gtgaaatact cgtttggtcg aggacttcca 961 aacacgtcat aagctaaacc cgtactgaca aataaccaac ccgcaatgaa tagggaaggt 1021 atagtaatgc tatgaataac ccagtatcga atactggtaa taatatcagc aaaagaacgt 1081 tctcccgtgc ttccagacat gctgagctcc caaaattttt attcaaaaaa ggaattgatt 1141 ccgtaaaaga tgggatccac cagtaaatag aaaattactg atatttcatc cttgtgagat 1201 tgtcaatttt gtaccaaagg tgtattttga gtataccgaa ttagtatagc tatccttcct 1261 atggcacagc aatccagttt tgcttggtcc cgaaacagaa ttcctttttt ctcttttttg 1321 ttccttgtct ataggaaaat tacatgttat tcaaggcatc aatagaaccc cacaattttt 1381 tgggtcctac ttattttcat tgtcttcgga atagtagaat aatttaattt ggaatagcgg 1441 ccaagatctt gggaaaatct aagttaatga ttaataagat aaagaattta ggaaggatat 1501 tctcatattg acgcaataca aagataagta tatgcaaaat cgatcccttt ttagttaaag 1561 gaatttaatt ggttagcata aaataatatc taataaatag aaaatcgaat agcggataat 1621 ccgttatgaa agaaacggaa tacattcttt gaagaatcaa gattcgtaat caatccttgt 1681 cttgtttgtt ggattaggtc taattttctt gaccaaacag caagcatgga actttacaag 1741 aagaacttaa aagaaaagga taattgaagt aactttgaat ccacttttgt tggggttcaa 1801 aaaacgaata aaaataaata aaaaaaagta aatttaagga atagttccct ttttgagggg 1861 gccctcgggg gtcgtggaat gcttttcttc tcctcttatt ccatatggaa tacaatgagt 1921 taaaatttgc tccaaaaaga ggatgaaatc catcctatta aaaaagaaat aatccgaatc 1981 cgaaatagaa agaatttttt ttttttcaaa ttcaattctt tatttatctt ttattccaaa 2041 attctcccga aaatccaatt tcatttttca atggggttag atgatctagt tcttaatatt 2101 attagtttac ttaactgaca gattccacaa caaatctctt gattcggaat tagggactca 2161 tgttccgtct gatgaatcga ttttctcttt tttttacact tctgtatctc actctatctt 2221 gttttttagt attatctaaa ataaccgatg aattatgaat tttccataac ttaaacttag 2281 gtaagtgctt taccaacata tgtagtgtag taaaaaaaat ggaattgaac cccttcatgc 2341 ttactataac tagttatttc ggttttctac tggctgcttt aactataacc ccagctctat 2401 ttattagctt gaacaagata cgtcttattt gaaatgaatt gaatagaaga atctttcttt 2461 tggattcttg gtattctaga ctcttttcca cactaattac caattctttt cttggtcatt 2521 gagattcgtg ggtagtttag actattattt agagatagat cgtacctctt tttttatccc 2581 ctcgaacaaa tcgaaatgat tgaagttttt ctatttggaa tcgtcttagg cctaattcct 2641 attactttag cgggattatt cgtgactgcg tatttgcaat acaggcgtgg ggatcagttg 2701 gatctttgat tgagtaatat ttcttttttg attgacctcc tctctggtct ggaggaggtc 2761 aaattggagt tgcaattcta ctttgttttt ttttaagtta ttttagtttc gacataagat 2821 atatggaatc acgctctgta ggatttgaac ctacgacatc gggttttgga gacccgcgtt 2881 ctaccaaact gaactaagag cgctttcaaa acaaaaaaaa aaaagaaaat ccttttctat 2941 tcctaacgtg tctcacgtcc gtatagtatc cacaaattca agttataccc actttaatcg 3001 atctccccac tactgcccat aacgaagaga gaattaatag gtagggatga caggatttga 3061 acctgtgaca ttttgtaccc aaaacaaacg cgctaccaag ctgcgctaca tcccttttcc 3121 aaattgttgt acaatgccat tgtacacaat tcctttcttg ttttccacat cgtaattttc 3181 ttctatttct ttatccatat agaactttct tgtcatttct tgtttttggt ctcatataag 3241 tcaaggaagg ggtataaact aaaatccagt caaatttcac ctataaaaga aagactacta 3301 ttccttagta atgtatagga agaaggggtc atctttttta gggataggaa aatctcgtct 3361 atacggttca ttctatatat atataatatt tattttgttt ttttagttag gaatttcgcc 3421 taaacaaaag aaatacaaag gatcttgggc aagagtatct gatcatatat gtattccaat 3481 acggaaggag gattttcaat gcgggatata aaaacatatc tctctgtagc acccgtgcta 3541 agtactctat ggtttggggc tttagcaggt ttattgatag aaatcaatcg tttattccca 3601 gatgctttgt cattcccttt tttttaatta tagttattcc tatgcgagag atagaattct 3661 tcgtgacatg acgaaaattc cccctttttg aattcttttt tagtatatga agcaaaaaga 3721 aagaaaagat ggataaggat tgtattcttt aattatttct ctatgtttta ttacttaatt 3781 tacgaatttc aaaaattttg tattctattg gattggattt gttcgagaat tcgaagaatt 3841 acaacaaaat ctttagaaat cacattttta gttaggaact tctatggatt ttattcttct 3901 tcttttcttt ggatcc // LOCUS RATLOX 2672 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat aorta lysyl oxidase mRNA, complete cds. ACCESSION J02903 KEYWORDS lysyl oxidase. SOURCE Rat neonatal aorta, cDNA to mRNA, (library of Clontech), clones 7, 13, 8-1 and IIB. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2672) AUTHORS Trackman,P.C., Pratt,A.M., Wolanski,A., Tang,S.-S., Offner,G.D., Troxler,R.F. and Kagan,H.M. TITLE Cloning of rat aorta lysyl oxidase cDNA: Complete codons and predicted amino acid sequence JOURNAL Biochemistry 29, 4863-4870 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by P.C.Trackman, 07-FEB-1990. FEATURES from to/span description pept 287 1516 lysyl oxidase precursor sigp 287 349 pot. lysyl oxidase signal peptide matp 350 1513 lysyl oxidase site 2252 2274 region of dyad symmetry BASE COUNT 650 a 763 c 622 g 637 t ORIGIN 192 bp upstream of TaqI site. 1 cttctacttc agacactgtg cgctctcccg gaccgtcgtg cgccgctccc cgtcgccttc 61 caggactggg aaaggggaga ggaggacggt gccacgtccg acggcctcct gggctggggg 121 cagggtctgc cgctcgccct tggcaccagt ccctgcgacc catccccgag cctcgccctc 181 ctcctccctg ctcgaagagg tctccctcct tcgcgggatc tgagtccctg tcttcatttt 241 tctcctagcc acgtccctcc ccgagaaggg acgagccggg agcatcatgc gtttcgcctg 301 gaccgtgctc tttctgggac agctgcagtt ctgtcccctt ctccgctgcg ccccgcaggc 361 cccgcgcgag cctcccgccg cccccggtgc ctggcgccag acaatccaat gggagaacaa 421 cgggcaggtg ttcagtctgt tgagcctggg ggcgcagtac cagcctcagc gacgccgcga 481 ctccagcgcc actgccccga gagccgacgc aacgctgcag cacagccacg cacgcccatt 541 ctgctgctgc gtgacaaccg cactgcctct gcccgtgcga ggactccaag cccatctggg 601 gtcgccgcgg gtcgtccccg gcccgcagcc cgccactggt tccaagttgg tttctcgccg 661 tcgggggccg gcgatggagc ctcaaggcgc gcagaaccgg actgcgtcgc cacagcctcc 721 gcagctcagt aatctgaggc cacccagcca cgtagatcgc atggtggcga cgacccctac 781 aatccctaca agtactccga cgacaacccc tattataact actatgacac ttatgagaga 841 ccggtccggg agcaggcacc gacctggata tggcaccggt tacttccagt acggtctccc 901 ggacctggta cccgatccct actacatcca ggcatccacg tacgtacaaa agatgtctat 961 gtacaacctg agatgcgctg cggaagaaaa ctgcctggcc agttcagcat atagggcgga 1021 tgtcagagac tatgaccaca gggtactgct acgatttcct cagagagtga aaaaccaagg 1081 gacgtctgac ttcttaccaa gccgcccccg ctactcctgg gagtggcaca gctgccacca 1141 acattaccac agcatggatg aattcagcca ctacgacctg ctggatgcca gcacacagag 1201 gagagtggcc gagggccaca aagcaagctt ctgtctggag gacacttcct gtgattatgg 1261 gtaccacaga cgatttgcct gtactgcaca cacacagggg ttgagtcccg gatgttatga 1321 tacttatgca gcagacatag actgccagtg gattgatatt acagatgtac aacccggaaa 1381 ttacattcta aaggtcagtg taaaccccag ctacctggtg cctgaatcag actacagtaa 1441 caatgtcgta cgctgtgaaa ttcgctacac aggacatcac gcctatgcct caggctgcac 1501 catttcaccg tattagaaag aagctcacct tcccaaagga tgaagcagta cctggtgttt 1561 ggacctatga aaaccgtaga ttagcttaag taggaagact tagatatttt aaaaggcaaa 1621 cggaaaaaca acaaagaagg ttttgtttgg actctttcac aacaaatcac ataactggat 1681 tttgagtgtt taaatcagca ttagattggc acattttaaa tacttattca tgttgcttta 1741 tgaagtaatg gtgtttcaat tctgtgggtg catagtgggc tctttcaaag aattctgaat 1801 ttcttacctt cttttgaaat tatagtgcaa aaagaagagg atattttaat gaatgagcca 1861 caatttgaac tgattacttt ctaaattgcc agacccatga gacaatgatg atgggtttgt 1921 atttgcctca acatagattc gctttttaaa aagggtgttc ctattgtata ggcaaaaatg 1981 gatacacttg gtgctgagga agggtcaaat actaactatt gttgtcacga aatataggtc 2041 tacagcagag agatggtgag tatatattca gatagttaca tccctatata aactatgttt 2101 acattttaga tgcttttctt tctgttaatt gcttaatctc actctgactt gaggtacaac 2161 ttctgttttg gaatgaatta gataattcca gattctggtt tgataattgt tgacattccc 2221 ccatgctact ttttctgagg gcagaaacgt ctaatgtgac gactcttcac attaccatta 2281 cgaggataca cagcacagcg aaatcattcc gatgacaggt gtgatagatg gagagctaac 2341 atgcaactgc cgagtgtttc actgttagcc agaactaagt cacttgcccc acacagcaat 2401 tacaccatga atctctaaca tcacaacctt ctttcaaata cccacggact catccatcct 2461 tccatccgtc atccatccat ccgtccgtcc gtccgtcctg actgcctagt gccactgtct 2521 ggctaggcac acccactatc aacctggttc acctgtcatg gcagcctgta cccacccccg 2581 ccacacaccc cgacgctggc ctatagtgca aaggttgtgc gggctggtcc ttcccacaat 2641 gcagtactgt aatccccgtc cctcctggag cc // LOCUS RHMDCTABA 764 bp ds-DNA BCT 10-AUG-1990 DEFINITION R.meliloti dctA and dctB genes, 5' end. ACCESSION M33555 KEYWORDS regulatory protein. SOURCE R.meliloti DNA. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 764) AUTHORS Wang,Y.-P., Birkenhead,K., Boesten,B., Manian,S. and O'Gara,F. TITLE Genetic analysis and regulation of the Rhizobium meliloti genes controlling C-4-dicarboxylic acid transport JOURNAL Gene 85, 135-144 (1989) STANDARD simple staff_review FEATURES from to/span description pept 726 > 764 dctA protein BASE COUNT 149 a 239 c 252 g 124 t ORIGIN 1 cccgggccgc ttgctgaccg tgcccatggc gaaatgttcg gccatgccgt cggcggacgg 61 cgagccggaa ataatcgcgg aaggcatagt cgttgccgac gaagctcgtc ggctcctgcc 121 agttgctggc cgcgacggcg acgccgctgc ggtcgatcag ataaatgacc gcggcttcgg 181 cgcttgtcgc cagggcctcg agcttacggt tgatgcggtc gagcgacggc ctgtccggcg 241 aaagcaatgc gccacgaatg gctgcgtcgt cggcgaggac aagcggcagg gcgcgctgcc 301 gttccacgac tgctcgaaga agcgaggctt tcaggctggc gtcgatccgg ctctgaccgg 361 caaggccggc gagcgcctgc gaccggccgt agtctcgcgc aagaagaagg ccggctgcca 421 gaaggaccag cgcgactgcc gcgaaaacga gccaggaccg tcgggcccgg ctgcgaacgc 481 atgcgatcgc tcgcttctgc aggaagtttg accatgcgaa cattgtgcat gttttcgccc 541 aggacgccag cacttctgtg cggaaatccg cacatatcca cgaacggcaa gcgagcaccg 601 ctcccaaaaa tgtcatggat tcaatcgcat aggccttccg cggcaaactg gcacgcatgt 661 tgctgacaag ctccacaagg cagccactgc tgtcgatctt cggaaagcgg cccgggaggc 721 ccggcatgtt gccggactgg gcctgccacg tggaggatat catg // LOCUS BSUCOMGA 6278 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.subtilis comG-[1,2,3,4,5,6,and 7] proteins in comG operon, complete cds. ACCESSION M29691 M22854 KEYWORDS exogenous DNA-binding. SOURCE B.subtilis (strain 168) DNA, clone pED4. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 6278) AUTHORS Albano,M., Breitling,R. and Dubnau,D.A. TITLE Nucleotide sequence and genetic organization of the Bacillus subtilis comG operon JOURNAL J. Bacteriol. 171, 5386-5404 (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly provided by D.A.Dubnau, 02-MAR-1989. ORF1 is required for the ability of competent cultures to resolve into two populations with different cell densities on Renografin gradients, as well as for full expression of comE, another late competent locus. ORF1 shows significant similarity to the virB ORF11 protein from Agrobacterium tumefaciens, which is probably involved in T-DNA transfer. The N-terminal sequences of comG ORF3 and, to a lesser extent, the comG ORF4 and 5 proteins are similar to a class of pilin proteins from members of the genera Bacteroides, Pseudomonas, Neisseria and Moraxella. All of the comG proteins, except ORF1, possess hydrophobic domains that are potentially capable of spanning the bacterial membrane and may be part of the DNA transport machinery. FEATURES from to/span description pept 1298 2368 comG-1 protein (ttg start codon) pept 2420 3391 comG-2 protein pept 3405 3701 comG-3 protein pept 3691 4122 comG-4 protein (ttg start codon) pept 4106 4453 comG-5 protein pept 4863 5237 comG-7 protein ORF 4365 4862 comG-6 protein (put.) ORF < 1 886 ORF1 (AA at 2) ORF 6126 6276 ORF2 mRNA 1278 > 6278 comG mRNA signal 1100 1124 pot. transcription termination signal signal 5501 5533 transcription termination signal signal 5696 5715 transcription termination signal signal 1243 1248 comG -35 region signal 1266 1271 comG -10 region BASE COUNT 1960 a 1122 c 1383 g 1813 t ORIGIN 1 bp upstream of BglII site. 1 agatctaatc catttttctc actggcctca gtgtgaaaag tggtttgaaa ataaccatca 61 cgttaatttt ttgcgagtag atacaactga aacggaaaat gaagcagtat ttgggtcgat 121 tgtttatgat caggggcttg gtgaagaaaa agaccatact gtttttcact tttatatcac 181 cagacaatat ttttttacaa tcaactttga cttttcaatt ttgagagaga ttaaaggcaa 241 agaagttgtt cggcaaatgg aaagagcgga caatgcgata gaggggtttt taattcttct 301 cggcgaacta atgaatgcgt atttaatcgg tgttgatgaa tttgaagtca agctgagaaa 361 gctcagatgg caaattaaag acgacaatag caaaagcatt ttaaaccgcg tccatctcct 421 gcgccatgaa ctgatgattt ggaaaaattt gatattaagc gctaaaaaaa ttgaaatggc 481 gttgaaagaa acctttttac ctcaaaatga agggaaaaag gattatcagc ggacacaact 541 gaagattgac aggggattta catacatcag cgaatttgaa ggggagctta acaatctgct 601 gcattcagag gaagtcatta cctcacatag ggggaatgaa attgtaaaag cgctgaccat 661 tttcacgacg ctttttactc cgattacagc tctgggtgcc ttatggggga tgaacttttc 721 agtgatgccg gaactgaatt ggaaatacgg atatctcttt tccctcttat tgattgtcac 781 atctacagtt ctgatctatc tctatttgag aaaaaaaggc tggacgggag atatgctgca 841 ggagcggaag aagaaaaaga aacctcgaaa aaggcggact ctataggatg tttcatattt 901 tgtgcagcgt gccccgcttt ttcaccagac atatcagggt gaccggatac gatgtcaagg 961 ggcttatgac agagcattaa atccgcagtt tatcgattct tgaaaatgac caaatgaccg 1021 gtattgttgc attaggcgat ctttccgttg agaaagatac tggtcaataa gcgaaaacag 1081 cataatgaaa atggaatcta gcaggcatgg tgaccatgtc tgctttttta tttataggga 1141 aaattataat gacaggggta cattcagttg aaagtctttt ttcttgccag aaagaattgg 1201 tttttcagca tataacatct cacaaaatca cgttttccct gtttgattac cttttcttct 1261 ttttctacaa tatgcgttga aaggagaggg aatcaaattg gattcaatag aaaaggtaag 1321 caaaaacttg attgaagagg catatctaac aaaggcttct gatattcaca ttgtgccgag 1381 ggagcgggac gctatcattc attttcgggt cgatcatgcc ttgctgaaaa aaagggacat 1441 gaaaaaagaa gagtgcgtaa gactgatttc acattttaaa tttctttcag caatggatat 1501 aggtgaaagg cgaaagccgc aaaacggttc gcttacgtta aagttgaaag agggaaatgt 1561 tcatttaaga atgtcaacgc tgcccacaat taatgaagaa agcctcgtga tcagagtgat 1621 gccccaatac aatatccctt cgattgataa attgtcgcta tttccgaaga caggagccac 1681 attactctcg tttttaaaac attcccatgg catgctcatt tttaccgggc cgactggttc 1741 agggaagact accacattat actctctcgt tcaatatgca aaaaaacact ttaatcgaaa 1801 tattgtcaca ttagaggacc ctgttgaaac aagggacgaa gatgttcttc aggttcaggt 1861 gaatgaaaaa gccggtgtaa cttattccgc aggtctgaaa gcaattttgc gccatgaccc 1921 cgatatgatt attttaggtg agatcagaga cgcggaaaca gctgaaattg cggtgcgggc 1981 agcgatgacg ggacatctgg tactaacgag ccttcatacg agagacgcaa agggcgcaat 2041 ttacagactg cttgaattcg gtatcaatat gaatgaaatc gaacagactg tcattgcaat 2101 agcggctcag cgcttggttg atttggcttg cccgttttgt gaaaacggat gttcatcagt 2161 gtattgccga cagtcacgaa atactaggag agctagcgtt tatgagcttc tatacgggaa 2221 aaatcttcag caatgtatcc aggaggcaaa aggaaatcat gcaaattacc aatatcaaac 2281 gcttcgtcaa attatcagaa aaggaattgc gctcggctat ttaacgacaa acaactatga 2341 ccggtgggtt tatcatgaaa aagattagaa agtctggttg ttaaaggatc aagccaggtt 2401 attaaagagg ctcggtgaaa tgactgcggg cggatataca cttctggatg gattacgcct 2461 gatggaactt cagatgaata agaggcaggc ggctgacttg actgattcgg tcacttgttt 2521 gagggaaggg gctccgtttt atcaagtact aaagagtttg tcatttcata aggaagccgt 2581 aggtatttgt tattttgctg aaacacatgg tgaactgcct gcttcaatga tccagagcgg 2641 agagctgctg gaacgaaaaa ttgcacaggc agaccagctg aaaagagtgc tgcgctatcc 2701 gcttttcctc atctttacgg tcgctgtcat gttttatatg ttacagtcca tcatcattcc 2761 tcagttttcc ggtatctatc aatcgatgaa tatggaaacc tcacgttcaa ccgatatgct 2821 ttttgctttt tttcagcata ttgatcttgt gatcattttg cttgttcttt ttacagcagg 2881 tatcgggatt tattattggc ttgtgtttaa gaaaaaatca cctgcccggc aaatgctgat 2941 ttgtatcagg attcctttgg ttggaaagct tgtaaagctg tttaacagct actttttttc 3001 tttgcagcta agcagccttt taaaatcagg cctctcaatt tatgacagcc ttaatgcatt 3061 taaacatcaa acgtttctcc ctttctaccg ctgcgaggct gaacaattga ttgaacggct 3121 aaaagccggt gagtcaattg aatccgctat ttgtggaagc cttttttatg aaactgattt 3181 atcaaaagtc atatctcacg gccagctgag cggccgattg gatcgggagc ttttcacata 3241 cagccaattc atattacagc ggctggaaca caaagcgcaa aaatggacag gcatccttca 3301 gccaatgatt tatggatttg ttgcagcgat gatcttactt gtgtatttat ctatgcttgt 3361 gcctatgtat cagatgatga atcaaatgtg aaaggaagag gctgatgaat gagaaaggat 3421 ttacacttgt tgaaatgtta atcgtgctct ttattatttc gattttgctt ttaattacga 3481 taccgaacgt cacgaaacat aatcaaacca ttcaaaaaaa gggctgtgaa ggcttacaaa 3541 acatggttaa ggcacaaatg actgcatttg agcttgatca tgaaggacaa actccgagcc 3601 ttgccgattt acagtcagag ggctatgtga aaaaggatgc tgtctgtcca aatggtaagc 3661 gcattatcat caccggcgga gaagttaagg ttgaacatta aattaaacga ggagaagggg 3721 tttacccttt tagaaagttt gcttgtgtta agccttgcct ctatcctcct ggtggccgtc 3781 ttcactacac ttcctcctgc ttatgacaat acagctgtcc gacaggcagc aagtcagctg 3841 aaaaatgata ttatgctcac acagcagact gctatttccc gtcaacaaag aacaaaaatt 3901 ctctttcata aaaaagaata tcaattagtc attggtgata cggttattga acgtccgtat 3961 gcaacgggac tttctataga actgctgaca ttaaaagacc gtttggaatt taatgagaaa 4021 gggcacccga atgcaggcgg aaaaatacga gtaaaaggcc atgccgttta tgacataaca 4081 gtttatctag ggagcgggag agtcaatgtg gagagaaaat aaaggttttt ctacaataga 4141 aacaatgtct gcgctaagcc tgtggctgtt tgtgctgctg acagtcgtcc ccttgtggga 4201 caagctgatg gctgatgaaa aaatggcgga atcacgagaa attggctatc agatgatgaa 4261 tgagagcatt agcaaatatg tcatgagtgg tgaaggagcc gcgtcaaaaa cgattacaaa 4321 gaacaatcat atctatgcaa tgaagtggga ggaggagggc gaatatcaaa acgtatgtat 4381 caaagccgca gcttataaag aaaaatcatt ttgcctcagc attttgcaga cagaatggct 4441 acacgcttct taacgtatta ttttcgctct cagtcttttt gctcatatca ggatcgttag 4501 ctgcgattat ccatctgttt ttgtctcgac agcaggaaca tgacggtttc acacagcagg 4561 aatggatgat ttcgatagaa cagatgatga atgaatgcaa ggaatcacag gcagttaaga 4621 cagccgagca tgggagcgtg ttaatctgca ccaatctttc cggacaagac atccgttttg 4681 acatttatca ttcaatgata agaaaaagag tggatggcaa agggcatgtt ccgattttag 4741 atcatattac tgccatgaaa gctgatattg aaaatggtgt tgttttgctg aaaattgaga 4801 gtgaagacca aaaagtgtat caaactgctt ttccagtcta ttcgtattta ggaggggggt 4861 gaatgtatcg tacaagaggg tttatttatc cagctgttct ttttgtgtca gcgcttgtgc 4921 tgttaatcgt gaactttgtt gctgctcaat atatttcacg ctgcatgttt gagaaggaaa 4981 caaaagagtt atacatagga gagaatttgc ttcaaaatgg ggtgcttctt tcgattcggc 5041 atgttctaga ggaacggaaa ggccaggagg gtacgcagca atttctatat ggacgggttt 5101 cttattacat tcatgataca tcgataaaag aacaaaaaga aatcaactta agagtgtcaa 5161 cggattcggg aacagaaaga actgcacaga tcgtgtttga ccaaaaacag aaaaaactgc 5221 tgagatggac agaataaaac agtgtaaagg gtataaaaaa agtcatgtga gacaacactc 5281 ataataattg aatgatgagg tgatcacgtg aaaacgaatg attatgttaa atatatgacg 5341 cagcaatttg tcaaatatat agatactccg agagatgagc gaaaagaacg aaaagaggtg 5401 cggaaagaaa caaaaacgcc tgtttcccag cagtggttcg gtattttacc ctatggcttc 5461 cgactttggc tgaaacggaa aaaataaccg caaataaacg aataaggtcc ttcaaaaaat 5521 ggaggacctt attgatattc ttctaatatg gcaattttat tgaccttttg gctataagga 5581 tcaaatgaaa tcgtcacaaa aacgccgaat tcttttgacc cttccctcag agttaaatgg 5641 tattgcttca ctgcttcatc ttttctttta cggtcccata ctttttgttt gaacagtacc 5701 tgtgcgagcg ggtacctttt ttttgcttct tttacagcaa tctcttccca tttggacatg 5761 tggcgggcgg ttacaagcgg tgtttcttct gcgtgagcgg ctgtggtgcc aaagacgaga 5821 agagatagac aaatcacaca ttgtttgatc atcatgctgt cacctttctt tgtttattat 5881 taccaaataa taatgggata tgcatttaac ttctcacata acaatcccaa aaatttctaa 5941 aaaattgaaa aaatgagcaa tactgagcaa gactttgtaa tatgatgaaa acattctttt 6001 aaacgaacaa aatgagcgat ttcggtgttt ttaaatctat aaatcgttga ttatactcta 6061 tttgtgaagt tctttaaaga gaacgattgt catatcaagt tacagtgttt tacaggaggt 6121 aagatatgtt tcgattgttt cacaatcagc aaaaggcgaa gacgaaactg aaagttctgc 6181 ttatctttca gctttcagtc attttcagtc tgactgccgc aatatgctta caattttcga 6241 tgatacaagc gctgcttttc atgatattga aacatttg // LOCUS CELTC1A 234 bp ds-DNA INV 10-AUG-1990 DEFINITION C.elegans Tc1(1664) element with a DNA insert. ACCESSION M22301 M29977 KEYWORDS Tc1 element; Tc1 repetitive element. SOURCE C.elegans (var. Bristol, strain N2) DNA, clone pCeh45. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 234) AUTHORS Harris,L.J. and Rose,A.M. TITLE Structural analysis of Tc1 elements in Caenorhabditis elegans var. Bristol (strain N2) JOURNAL Plasmid 22, 10-21 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.J.Harris, 18-JAN-1989. FEATURES from to/span description rpt < 1 > 234 Tc1(1664) insertion element rpt 85 94 direct repeat copy A rpt 107 116 direct repeat copy B rpt 128 137 direct repeat copy C recomb 87 88 Tc1(1664) DNA end/inserted DNA start recomb 141 142 inserted DNA end/Tc1(1664) DNA start BASE COUNT 67 a 36 c 32 g 99 t ORIGIN 3 bp upstream of EcoRV site. 1 gatatccact tttggttttt tgtgtgtaac ttttttctca agcatccatt tgacttgaat 61 ttttccgtgt gcataaagcg aaatgtttcg caaattaagc aaataaagtt tcgcaaagaa 121 ttaaattgtt tcgcaaacga aacgcaaatt tgcggaccaa acattacatg attatcgatt 181 ttttctgaat tttatttcaa ttttttgatt ttttcgtttt tccaattttc atta // LOCUS HUMDEF1A 448 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human neutrophil peptide (defensin) 1 mRNA, complete cds. ACCESSION M21130 M22160 KEYWORDS cytotoxic protein; defensin 1; microcidal protein; neutrophil peptide. SOURCE Human leukocyte (myeloid cell line HL-60), cDNA to mRNA, clones HNP1a and HNP1b. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 448) AUTHORS Daher,K.A., Lehrer,R.I., Ganz,T. and Kronenberg,M. TITLE Isolation and characterization of human defensin cDNA clones JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7327-7331 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.Daher, 25-NOV-1988. Submission form for [1] kindly provided by M.Kronenberg, 01-JAN-1989. FEATURES from to/span description pept 51 335 neutrophil peptide 1 precursor /nomgen="DEF1" /map="8p23" /hgml_locus_uid="LR0109A" sigp 51 107 neutrophil peptide 1 signal peptide matp 243 332 neutrophil peptide 1 mRNA < 1 448 neutrophil peptide 1 mRNA BASE COUNT 106 a 127 c 110 g 105 t ORIGIN Unreported. 1 tctgccctct ctggtcaccc tgcctagcta gaggatctgt gaccccagcc atgaggaccc 61 tcgccatcct tgctgccatt ctcctggtgg ccctgcaggc ccaggctgag ccactccagg 121 caagagctga tgaggttgct gcagccccgg agcagattgc agcggacatc ccagaagtgg 181 ttgtttccct tgcatgggac gaaagcttgg ctccaaagca tccaggctca aggaaaaaca 241 tggcctgcta ttgcagaata ccagcgtgca ttgcaggaga acgtcgctat ggaacctgca 301 tctaccaggg aagactctgg gcattctgct gctgagcttg cagaaaaaga aaaatgagct 361 caaaatttgc tttgagagct acagggaatt gctattactc ctgtaccttc tgctcaattt 421 cctttcctca tcccaaataa atgccttg // LOCUS MUSMIPAA 753 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse macrophage inflammatory protein (MIP)-1a mRNA, complete cds. ACCESSION X12531 M19382 KEYWORDS lymphokine; macrophage inflammatory protein. SOURCE Mouse cell line RAW264.7., cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 753) AUTHORS Davatelis,G., Tekamp-Olson,P., Wolpe,S.D., Hermsen,K., Luedke,C., Gallegos,C., Coit,D., Merryweather,J. and Cerami,A. TITLE Cloning and characterization of a cDNA for murine macrophage inflammatory protein (MIP), a novel monokine with inflammatory and chemokinetic properties JOURNAL J. Exp. Med. 167, 1939-1944 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by G.Davatelis, 27-MAY-1988. FEATURES from to/span description pept 71 349 macrophage inflammatory protein precursor sigp 71 139 macrophage inflammatory protein signal peptide matp 140 346 macrophage inflammatory protein mRNA < 1 753 MIP mRNA signal 753 753 polyA signal BASE COUNT 190 a 203 c 143 g 217 t ORIGIN 1 aagcagcagc gagtaccagt cccttttctg ttctgctgac aagctcaccc tctgtcacct 61 gctcaacatc atgaaggtct ccaccactgc ccttgctgtt cttctctgta ccatgacact 121 ctgcaaccaa gtcttctcag cgccatatgg agctgacacc ccgactgcct gctgcttctc 181 ctacagccgg aagattccac gccaattcat cgttgagtat tttgaaacca gcagcctttg 241 ctcccagcca ggtgtcattt tcctgactaa gagaaaccgg cagatctgcg ctgactccaa 301 agagacctgg gtccaagaat acatcactga cctggaactg attgcctgag agtcttggag 361 gcagcgagga accccccaaa cctccatggg tcccgtgtag agcaggggct tgagccgaac 421 attcctgcca cctgcatagc tccatctcct ataagctgtt tgctgccaag tagccacatc 481 gagggactct tcacttgaaa ttttatttaa tttaatccta ttggtttaat actatttaat 541 tttgtaattt attttattgt catacttgta tttgtgacta tttattctga aagacttcag 601 gacacgttcc tcaaccccca tctccctccc agttgttcac actgtttggt gacagctatt 661 ctaggtagac atgatgacaa agtcatgaac tgacaatgta caatagatgc tttgtttata 721 ccagagaagt aataaatatg ccctttaaca agt // LOCUS ONGSSDNA 154 bp ds-DNA INV 10-AUG-1990 DEFINITION O.volvulus species-specific DNA. ACCESSION J03990 M22037 KEYWORDS . SOURCE O.volvulus (strain Liberian) DNA, clone C1A1-2. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 154) AUTHORS Harnett,W., Chambers,A.E., Renz,A. and Parkhouse,R.M.E. TITLE An oligonucleotide probe specific for Onchocerca volvulus JOURNAL Mol. Biochem. Parasitol. 35, 119-126 (1989) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by W.Harnett, 18-JAN-1989. BASE COUNT 50 a 23 c 27 g 54 t ORIGIN 1 gaattcaaaa aaatatagtc gaatattttt cttaggaccc agtttgaagg tacgtacccg 61 ttttttgaaa ttatatgtca taggtcatct gttaataatc aattttgcaa aatgcgtttt 121 tcgccgaaaa aatcgccgtg taaatgtgga attc // LOCUS RHMDCTAB 2300 bp ds-DNA BCT 10-AUG-1990 DEFINITION R.meliloti C-4-dicarboxylate carrier protein (dctA) gene, complete cds, and dctB protein gene, partial cds. ACCESSION M26399 M29729 KEYWORDS C4-dicarboxylate carrier protein; membrane protein; transport protein. SOURCE R.meliloti (strain 2011) DNA, clone pRmSC121. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 2300) AUTHORS Engelke,T., Jording,D., Kapp,D. and Puehler,A. TITLE Identification and sequence analysis of the Rhizobium meliloti dctA gene encoding the C4-dicarboxylate carrier JOURNAL J. Bacteriol. 171, 5551-5560 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Jording, 13-OCT-1989. FEATURES from to/span description pept 750 2111 C4-dicarboxylate carrier A (put.) pept 786 2111 C4-dicarboxylate carrier A' (put.) pept 555 < 1 (c) dctB protein signal 582 597 activator sequence signal 673 689 5' promoter region BASE COUNT 388 a 758 c 707 g 447 t ORIGIN 1 gtcgacgcgc cgggaaatat aaagcccggg ccgattgctg accgtgccca tggcgaaatg 61 ttcggccatg ccgtcgcgga cggcgagccg gaaataatcg cggaaggcat agtcgttgcc 121 gacaaagctc gtcggctcct gccagttgct ggccgcgacg gcgacgccgc tccggtcgat 181 cagataaatg accgcggctt cggcgcttgt cgccagggcc tcgagcttac ggttgatgcg 241 gtcgagcgac ggcctgtccg gcgaaagcaa tgcgccacga atggctgcgt cgtcggcgag 301 gacaagcggc agggcgcgct gccgttccac gactgctcga agaagcgagg ctttcaggct 361 ggcgtcgatc cggctctgac cggcaaggcc ggcgagcgcc tgcgaccggc cgtagtctcg 421 cgcaagaaga aggccggctg ccagaaggac cagcgcgact gccgcgaaaa cgagccagga 481 ccgtcgggcc cggctgcgaa gcgcatgcgg atcgctcgct tctgcaggaa gtttgaccat 541 gcgaacatgg tgcatgtttt cgcccaggac gccagcactt ctgtgcggaa atccgcacat 601 atccacgaac ggcaagcgag caccgctccc aaaaatgtca tggattcaat cgcataggcc 661 ttccgcggca aactggcacg catgttgctg acaagctcca caaggcagcc actgctgtcg 721 atcttcggaa agcggcccgg gaggccggca tgttgccgga ctgggcctgc cacgtggagg 781 atatcatgat catcgaacat tccgcggagg tccgcggcaa gacacccctt taccgccatc 841 tctatgttca ggtgctggcg gcgatcgccg cgggcatcct gctcgggcat ttctatcctg 901 atatcggcac ggagctcaaa ccgcttggcg acgccttcat caggctcgtg aagatgatca 961 tcgcgccggt gatcttcctg acggtcgcga ccgggattgc cggcatgacc gatctcgcca 1021 aggtcggccg cgtcgccggc aaggcgatga tctactttct cgccttctcc accctcgcgc 1081 ttgtcgtcgg cctcgtcgtc gcaaacgtgg tgcagccggg cgcgggcatg catatcgatc 1141 cggcctcgct ggacgccaag gcggtcgcga cctatgccga gaaagcgcat gagcagtcga 1201 tcaccggctt cctgatgaac atcatcccga cgacgctcgt cggcgccttc gccgaaggcg 1261 acatcctgca ggtcctgttc atctcggtgc tcttcggtat ctcgctggcg atcgtcggca 1321 agaaagccga gcccgtggtc gatttcctgc aggcgctgac gctgccgatc ttccggctcg 1381 ttgcgatcct gatgaaggcc gccccgatcg gcgccttcgg cgccatggcc ttcaccatcg 1441 gcaagtacgg tatcgcctcg atcgccaatc tcgccatgct gatcggcacc ttctatctga 1501 cgtcgtttct cttcgtcttc atcgttctcg gcgcggtcgc acgctataac ggcttctcga 1561 tcctctcgct catccgctac atcaaggagg agctgctgct ggtgctcggg acgtcctctt 1621 cggaggcggc tcttccgggc ctcatgaaca agatggagaa ggccggctgc aagcgctcgg 1681 tcgtcggtct cgtcattccg accggttatt ccttcaacct ggacggcacc aatatctaca 1741 tgacccttgc ggccctgttc atcgcccagg cgaccgatac gccgctctca tacggcgacc 1801 agattctgct gctcctcgtc gcaatgctga gctcgaaggg tgcggccggc attaccggcg 1861 ccggcttcat cacgcttgcc gcaacgctct cggtcgttcc ctccgtgccg gtcgccggca 1921 tggcgctgat cctcggcatc gaccgcttca tgtcggaatg ccgcgccctg accaatttcg 1981 tcggcaacgc ggttgcgacg atcgtggtgg cgaagtggga gggcgagctc gatcaggcgc 2041 agctttccgc agctctcggc ggcgaggcgt ccgtcgaggc catcccggcg gtcgtccagc 2101 ccgccgaata agcttgcctc ccagggcgag accgcacgct ggtgcgcttg gcgcggtccg 2161 gttctccggg ccgcgctttt ttatcgccgg ccaggcctga gggggtgggc actcggcaca 2221 agatcagatg ccgtgcagca cctgcgtcgc cttcactgcc gccgaggccc ggttttcgac 2281 gccgagcttc acatagatct // LOCUS HUMMYOHP 6016 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human perinatal myosin heavy chain mRNA, complete cds. ACCESSION M36769 M35250 Y00821 KEYWORDS myosin; myosin heavy chain. SOURCE Human fetal muscle, cDNA to mRNA, clone pSMHCP. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 2651 to 6016) AUTHORS Leinwand,L.A. TITLE ; JOURNAL Unpublished (1989) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 2651 to 6016) AUTHORS Feghali,R. and Leinwand,L.A. TITLE Molecular genetic characterization of a developmentally regulated human perinatal myosin heavy chain JOURNAL J. Cell Biol. 108, 1791-1797 (1989) STANDARD simple automatic REFERENCE 3 (bases 1 to 2650) AUTHORS Karsch-Mizrachi,I., Feghali,R., Shows,T.B. and Leinwand,L.A. TITLE Generation of a full-length human perinatal myosin heavy-chain-encoding cDNA JOURNAL Gene 89, 289-294 (1990) STANDARD full staff_review COMMENT Author address: L.A.Leinwand Albert Einstein College of Medicine 1300 Morris Park Avenue Bronx, N.Y. 10461, USA. Submitted (12-APR-1989) on tape to the EMBL data library. Draft entry and computer-readable sequence for [3] kindly submitted by I.Karsch-Mizrachi, 18-JUN-1990. FEATURES from to/span description pept 74 5887 perinatal myosin heavy chain mRNA < 1 6016 perinatal myosin heavy chain mRNA BASE COUNT 1953 a 1300 c 1586 g 1177 t ORIGIN 1 gtggaacact tctgaacctg catttttatc tggaactcca gaagcagaat cctttgctaa 61 ataaatcgca gccatgagtg cgagctcaga cgctgagatg gctgtttttg gcgaagctgc 121 tccctacctt cgaaaatcag aaaaggagcg gattgaggcc caaaacaagc cgtttgatgc 181 taaaacatct gtctttgtgg cggagcccaa ggaatcctat gtgaagagca ctatacaaag 241 caaagaagga gggaaagtaa ccgtaaagac tgaaggtgga gcaactctaa ctgtcaggga 301 agaccaagtc ttccctatga accctccgaa atatgacaaa attgaggaca tggccatgat 361 gactcatcta cacgagcctg gagtgctgta caacctcaaa gagcgctatg cagcctggat 421 gatctacacc tactcaggcc tcttctgtgt caccgtcaac ccctacaagt ggctgccggt 481 gtacaagccc gaggtggtgg ctgcctacag aggcaaaaag cgccaggagg ccccgcccca 541 catcttctcc atctctgaca atgcctatca gttcatgttg actgatcgag agaatcagtc 601 catcctgatc accggagaat ctggtgccgg aaagactgtg aacaccaagc gtgtcatcca 661 atactttgca acaattgcag ttactggaga gaagaagaag gatgaatctg gcaaaatgca 721 ggggactctg gaagatcaaa tcatcagcgc caatccccta ctggaggcct ttggcaatgc 781 caaaaccgtg aggaatgaca actcctctcg ctttggtaaa ttcattagaa tccactttgg 841 tactacaggg aagctggcat ctgctgatat agaaacatat cttttagaaa agtccagagt 901 tactttccag ctaaaggcgg aaagaagcta ccatattttt tatcagatca cttccaataa 961 gaagccagat ctaattgaaa tgctcctgat caccaccaac ccatatgact atgccttcgt 1021 cagtcagggg gagatcacag ttcccagtat tgatgaccaa gaagagttga tggccactga 1081 tagtgccatt gacatcctgg gcttcactcc tgaagagaaa gtgtccatct ataaactcac 1141 aggggctgtg atgcattatg ggaacatgaa attcaagcaa aagcagcgtg aggagcaagc 1201 tgagccagat ggcacagaag tcgctgacaa ggcagcctat ctccagagtc tgaactctgc 1261 agacctactc aaagccctct gctaccctag ggtcaaggtt ggcaatgagt atgtcaccaa 1321 aggccagact gtgcagcagg tgtacaatgc cgtgggtgct ctggccaaag ccgtctacga 1381 gaagatgttc ctgtggatgg tcacccgcat caaccagcag ctggacacca agcagcccag 1441 gcagtacttc atcggggtct tggacattgc tggctttgaa atctttgatt ttaacagcct 1501 ggagcagctg tgcatcaact tcaccaacga gaaactgcaa cagtttttca accaccacat 1561 gtttgtgcta gagcaggagg agtacaagaa ggaaggcatc gagtggacgt tcattgactt 1621 tgggatggac ctggctgcct gcattgagct cattgagaag ccactgggca tcttctccat 1681 cctggaagag gagtgcatgt tccctaaggc cacggacacc tccttcaaga acaagctgta 1741 tgaccagcac ctgggcaagt ctgccaactt ccagaagccc aaggtggtca aaggcaaggc 1801 tgaggcccac ttctctctga ttcactatgc tggcactgtg gactacaaca ttactggctg 1861 gctggacaaa aataaggacc ccctgaatga tactgtggtt gggctgtacc agaagtctgc 1921 aatgaagact ctagccagtc tcttttccac gtatgctagt gctgaagcag atagcagcgc 1981 gaagaaaggt gctaagaaaa agggctcttc tttccagact gtgtctgccc ttttcaggga 2041 aaatttaaat aaattgatga cgaatctgag gagcacacac cctcacttcg tacggtgtat 2101 cattcccaat gaaaccaaaa ctcctggggc aatggaacat gaacttgtgt tgcaccagct 2161 gaggtgtaat ggtgtgctgg aaggcatccg catctgtagg aaaggattcc caagcagaat 2221 cttatatggt gatttcaaac aaagatacaa ggttttaaat gcaagtgcta ttccagaggg 2281 acagttcatt gacagcaaga aggcttctga gaaacttctt gcatctattg atattgatca 2341 tactcaatat aaatttggac ataccaaggt tttcttcaaa gctggacttc tgggtcttct 2401 ggaagaaatg agagatgaaa aattagccca aattataaca agaacacaag ctgtctgtag 2461 gggattccta atgagggtag aatatcagaa gatgttgcaa aggagagaag cacttttctg 2521 catccagtat aatgtccgtg ccttcatgaa cgtcaagcac tggccctgga tgaaactctt 2581 tttcaagatt aagcccctcc tcaagagtgc agagaccgag aaagagatgg ccaccatgaa 2641 ggaagaattc cagaaaacca aagatgaact cgccaagtca gaggcaaaac ggaaggagct 2701 agaggaaaaa atggtcactc tcttaaaaga gaaaaatgac ctgcaactcc aggttcaatc 2761 tgaagcagat agcttggctg atgcagagga aaggtgtgag caactgatta aaaacaaaat 2821 ccaacttgag gccaaaatca aagaggtgac tgaaagagct gaggaggagg aagagatcaa 2881 tgctgagctg acagccaaga agagaaaact ggaggatgaa tgttcagaac tcaagaaaga 2941 cattgatgac cttgagctga cactggccaa ggttgagaag cagaaacatg ccacggagaa 3001 caaggtgaaa aatcttacag aagagatggc aggcctggat gaaaccattg caaaactgtc 3061 caaggagaag aaggctctcc aagagaccca ccagcagacc ctggatgacc tgcaggcaga 3121 ggaggacaaa gtcaacatcc tgaccaaagc taaaaccaag ctagaacagc aagtggatga 3181 tcttgaaggg tctctggaac aagaaaagaa gcttcgaatg gatctagaaa gagcaaagcg 3241 gaaactggag ggtgacctca aattggccca agaatccaca atggatatgg aaaatgacaa 3301 acagcaactt gatgaaaagc ttgaaaagaa agaatttgaa atcagcaatt tgataagcaa 3361 aattgaagat gagcaagctg tagaaattca actacagaag aagatcaaag agttgcaggc 3421 ccgcattgag gagctggggg aagaaatcga ggcagagagg gcgtcccgag ccaaagcgga 3481 gaagcagcgc tctgacctct cccgggaact ggaggagatc agcgagaggc tggaagaagc 3541 cggtggggca acttctgctc aggtggaatt gaacaagaag cgggaggctg agtttcagaa 3601 actgcgcagg gacctggagg aggccaccct gcagcatgaa gctatggtgg ctgctcttcg 3661 gaagaagcac gcagacagta tggctgagct tggggagcag attgacaact tgcagcgggt 3721 caaacagaag ctggagaagg agaagagtga gctgaagatg gagactgatg acctcagcag 3781 taacgcagag gccatttcca aagccaaggg ccaccttgaa aagatgtgcc gctctctaga 3841 agatcaagtg agtgggctta agaccaagga agaggagcag cagcggctga tcaatgacct 3901 cacagcacag agagcgcgcc tgcagacaga agcgggtgaa tattctcgac aattagatga 3961 gcaagatgct ttagtctctc agctttcaag gagcaagcaa gcatctactc agcagattga 4021 agagctgaaa catcaactag aggaagaaac taaagccaag aacgccctgg cacacgccct 4081 gcagtcctcc cgccatgact gcgacctgct gcgggaacag tatgaggaag agcaggaagg 4141 caaagctgag ctgcagaggg cgctgtccaa ggccaacagt gaggttgccc agtggagaac 4201 caaatacgag acggatgcca tccagcgcac agaggagctg gaggaggcca agaaaaagtt 4261 ggcccagcgc ctgcaagaag ctgaggaaca tgtagaagct gtgaacgcca aatgtgcttc 4321 ccttgagaag acgaagcagc ggctccagaa tgaagttgaa gacctcatgc ttgatgtgga 4381 aaggtctaat gcagcctgtg cagcccttga taagaagcaa aggaactttg acaaggtcct 4441 atcagaatgg aagcagaagt atgaggaaac tcaggctgaa cttgaggcct cccagaagga 4501 gtcacgttct cttagcactg agctgttcaa ggtgaagaat gtctatgagg aatccctgga 4561 tcaactcgaa acgctaagaa gagcacataa gaacttgcaa caggagattt ctgacctcac 4621 tgagcagatt gcagagggag gaaagcaaat tcatgaattg gagaaaataa agaagcaagt 4681 agaacaagag aaatgtgaaa ttcaggctgc tttagaggaa gcagaggcat ctcttgaaca 4741 tgaagaagga aagattctgc gtatccagct tgagttaaac caagtcaagt ctgaagttga 4801 tagaaaaatc gcagaaaagg atgaggaaat tgaccagctg aagagaaacc acactagagt 4861 cgtggagaca atgcagagca cgctggatgc agagattaga agcagaaatg atgctctgag 4921 agtcaagaag aaaatggaag gagatctgaa tgaaatggaa atccagctga accatgccaa 4981 tcgcttagct gcagagagtt taaggaacta caggaacacc caaggaatcc tgaaggaaac 5041 ccagctccac ctggatgatg ctctccgggg ccaggaggac ctcaaggaac agctggcaat 5101 tgtggagcgc agagccaacc tgctgcaggc tgagatcgag gagctgtggg ccactctgga 5161 acagacagag agaagcagga aaatcgccga acaggagctc ctggatgcca gtgagcgtgt 5221 ccagctcctc cacacccaga ataccagtct cattaacacc aagaagaaat tagaaaatga 5281 cgtttcccaa ctccaaagtg aagtggaaga agtaatccaa gaatcacgca atgcagaaga 5341 gaaagccaag aaggccatca ctgatgctgc catgatggct gaggagctga agaaggaaca 5401 ggacaccagc gcccacctgg agcggatgaa gaagaacctg gagcagacgg tgaaggacct 5461 gcagcatcgt ctagatgagg ccgagcagct ggcgctgaag ggtgggaaga agcagatcca 5521 gaaactggag gccagggtac gtgagcttga aggagaggtt gaaaatgaac agaaacgtaa 5581 tgcagaggct gttaaaggtt tacggaaaca tgaccgacga gtaaaagaac tcacctacca 5641 gactgaagaa gatcgcaaga atgttctcag gctgcaggac ttggtagata aattacaggc 5701 gaaggtgaaa tcatacaaga gacaagctga ggaggctgag gaacaatcca atgctaatct 5761 atctaaattc cgcaaactcc agcatgagct ggaggaggcc gaggaacggg ctgacattgc 5821 tgagtcccag gtcaacaaat tgcgagtgaa gagccgagag gttcacacaa aaatcagtgc 5881 agagtaaaca cacctgcctg atgctatcaa gaggctgaag aaaggcacaa aatgtgctat 5941 ttttggtcac ttgctttatg acgtttattt tcctgttaaa gctgaataaa taaaaactac 6001 agtaaatgta tacatt // LOCUS CDCMANABD 2155 bp ds-DNA BCT 10-AUG-1990 DEFINITION C.saccharolyticum beta-D-mannanase (manA) gene, complete cds. ACCESSION M36063 KEYWORDS beta-D-mannanase. SOURCE C.saccharolyticum DNA, clone lambda-NZP2. ORGANISM Caldocellum saccharolyticum Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 2155) AUTHORS Leuthi,E., Bhana Jasmat,N., Grayling,R.A., Love,D.R. and Bergquist,P.L. TITLE Cloning, sequence analysis and expression in Escherichia coli of a gene coding for a beta-mannanase from the extremely thermophilic bacteria Caldocellum saccharolyticum JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Luethi, 03-JUL-1990. Author address: E.Luethi University of Auckland Department of Cellular and Molecular Biology Private Bag Auckland NEW ZEALAND email:SYSTEM@CELVAX.AUKUNI.AC.NZ FEATURES from to/span description pept 841 1881 beta-D-mannanase precursor (E.C. 3.2.11.78) sigp 841 963 beta-D-mannanase signal peptide matp 964 1878 beta-D-mannanase pept < 1 685 ORF1 (AA at 2) binding 818 830 manA ribosome binding site BASE COUNT 721 a 334 c 590 g 510 t ORIGIN 1 gggtagcaat acatggtttg gattccaggc atggtcgatg cagagggtag cggagtatta 61 ctatgtgaca ggagataaag atgcaggggc actgcttgag aagtgggtaa gctggataaa 121 gagtgtagtg aagttgaata gtgatggtac atttgcgata ccatcgacgc ttgattggag 181 tgggcagcca gacacatgga atgggacata tacaggtaat ccgaacttgc atgtgaaggt 241 agtagattat gggacggatt taggaataac ggcatcactt gcgaatgcac tactttatta 301 cagtgcaggg acgaagaagt atggggtatt tgatgaggaa gcgaagaatt tagcgaagga 361 attgctggac aggatgtgga agttatacag ggatgagaaa ggtttatcgg cgccagagaa 421 gagagcggac tacaagaggt tctttgagca agaggtatac attccggcag gatggacagg 481 gaagatgccg aatggagatg taataaagag cggagttaag tttatagaca taaggagcaa 541 gtacaaacaa gatcctgatt ggccgaagtt agaggcggca tacaagtcag ggcaggtacc 601 ggagttcaga tatcacaggt tctgggcaca gtgtgacata gcaattgtta atgcaacata 661 tgaaattctg ttcggtaatc aataatgagt aggtaaatgg aaatttagcg gggtggcaca 721 tctataagtt tggtgtgctg cctcgctaaa atcctgtatg gaagtgttcg aaaaatagta 781 caaaaaaatg gcgaggtaaa atggcaaaaa tgcataaaaa agagggggtt aggttttaag 841 atgagactaa aaacaaaaat aagaaagaaa tggttgagtg tactatgtac agttgttttt 901 ttattgaaca ttttgtttat agcaaatgta acgattttac ccaaagttgg tgcggctaca 961 tctaatgatg gagtagtgaa gatagatacg agtacgttaa taggaaccaa tcacgcacat 1021 tgctggtaca gagatagact tgatacggca ttgcgtggaa ttaggtcatg gggtatgaac 1081 tctgtgaggg tagtgctgag taatggttac cgatggacga agataccagc aagtgaagtg 1141 gcaaatatta tatcattgtc aagaagtctt ggtttcaaag ctattatatt agaagtgcac 1201 gacacaacag gatatggaga agatggggca gcatgttcat tggcacaagc agtggaatat 1261 tggaaggaga taaagagcgt attagacggt aacgaagatt ttgtaattat aaacattggt 1321 aatgagccgt atgggaacaa taactatcaa aactgggtta atgacacgaa gaacgctata 1381 aaagcactta gagatgcagg attcaagcac acgataatgg tggatgcgcc gaactggggt 1441 caggattggt ctaatactat gagagataat gcccagagca taatggaagc agatccgctg 1501 cgcaatttgg tattttcgat tcatatgtat ggcgtataca atacagcaag caaggtcgaa 1561 gagtacatca aatcatttgt tgataagggg ttaccattgg ttattggaga atttggacat 1621 cagcacacag atggtgaccc tgatgaagaa gctattgtca ggtatgcaaa acagtacaag 1681 ataggattat ttagttggtc gtggtgtgga aattcgagct atgttgggta tttggacatg 1741 gtaaacaact gggaccccaa taatccaact ccatggggac aatggtataa aactaatgca 1801 attggtacat cttctacacc aacacctacg tcgacagtaa cgccaacgcc acccccacgc 1861 caacaccaac accgacagta acagcgacac caacaccaac acctacacct gttagcacac 1921 ctgcgacaag tgggcagata aaggtactgt atgctaacaa ggagacaaac agcactacaa 1981 acacgataag gccgtggttg aaggtagtga atagtggtag cagtagcata gatttgagca 2041 gggtaacgat aaggtactgg tacacggtag atggtgagag ggcacagact ggatatcaga 2101 ctgggcacag ataggagcaa gcaatgtcaa cattcaagtt tgtgaagctg agagt // LOCUS MUSMLHRCA 1431 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse lymph node homing receptor mRNA, complete cds. ACCESSION M36005 KEYWORDS lymph node homing receptor. SOURCE Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22+(C3H). ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1431) AUTHORS Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K. TITLE The mouse lymph node homing receptor is identical with the lymphocyte cell surface marker Ly-22: Role of the EGF domain in endothelial binding JOURNAL Cell 61, 611-622 (1990) STANDARD simple staff_review FEATURES from to/span description pept 52 1170 lymph node homing receptor precursor sigp 52 165 lymph node homing receptor signal peptide matp 166 1167 lymph node homing receptor BASE COUNT 412 a 323 c 362 g 334 t 0 others ORIGIN Chromosome 1. 1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt 61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg 121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat 181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat 241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc 301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc 361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag 421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac 481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg 541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat 601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct 661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt 721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca 781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag 841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc 901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt 961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc 1021 tcaaagatca aagaaggtga ctacaacccc ctcttcattc ctgtagccgt catggtcacc 1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa 1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag 1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag 1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga 1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt 1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a // LOCUS MUSMLHRCB 1431 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse lymph node homing receptor mRNA, complete cds. ACCESSION M36058 KEYWORDS lymph node homing receptor. SOURCE Mouse cell line EL-4/MEL-14hi, cDNA to mRNA, clone Ly-22-(SJL). ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1431) AUTHORS Siegelman,M.H., Cheng,I.C., Weissman,I.L. and Wakeland,E.K. TITLE The mouse lymph node homing receptor is identical with the lymphocyte cell surface marker Ly-22: Role of the EGF domain in endothelial binding JOURNAL Cell 61, 611-622 (1990) STANDARD simple staff_review FEATURES from to/span description pept 52 1170 lymph node homing receptor sigp 52 165 lymph node homing receptor signal peptide matp 166 1167 lymph node homing receptor BASE COUNT 412 a 322 c 362 g 335 t 0 others ORIGIN Chromosome 1. 1 ggtggaggag gctgaggctg cagagagact tgcagagaga cccagcaagc catggtgttt 61 ccatggagat gtgagggtac ttactggggc tcgaggaaca tcctgaagct gtgggtctgg 121 acactgctct gttgtgactt cctgatacac catggaactc actgttggac ttaccattat 181 tctgaaaagc ccatgaactg ggaaaatgct agaaagttct gcaagcaaaa ttacacagat 241 ttagtcgcca tacaaaacaa gagagaaatt gagtatttag agaatacatt gcccaaaagc 301 ccttattact actggatagg aatcaggaaa attgggaaaa tgtggacatg ggtgggaacc 361 aacaaaactc tcactaaaga agcagagaac tggggtgctg gggagcccaa caacaagaag 421 tccaaggagg actgtgtgga gatctatatc aagagggaac gagactctgg gaaatggaac 481 gatgacgcct gtcacaaacg aaaggcagct ctctgctaca cagcctcttg ccagccaggg 541 tcttgcaatg gccgtggaga atgtgtggaa actatcaaca atcacacgtg catctgtgat 601 gcagggtatt acgggcccca gtgtcagtat gtggtccagt gtgagccttt ggaggcccct 661 gagttgggta ccatggactg catccacccc ttgggaaact tcagcttcca gtccaagtgt 721 gctttcaact gttctgaggg aagagagcta cttgggactg cagaaacaca gtgtggagca 781 tctggaaact ggtcatctcc agagccaatc tgccaagtgg tccagtgtga gcctttggag 841 gcccctgagt tgggtaccat ggactgcatc caccccttgg gaaacttcag cttccagtcc 901 aagtgtgctt tcaactgttc tgagggaaga gagctacttg ggactgcaga aacacagtgt 961 ggagcatctg gaaactggtc atctccagag ccaatctgcc aagagacaaa cagaagtttc 1021 tcaaagatca aagaaggtga ctataacccc ctcttcattc ctgtagccgt catggtcacc 1081 gcattctcgg ggctggcatt tctcatttgg ctggcaaggc ggttaaaaaa aggcaagaaa 1141 tctcaagaaa ggatggatga tccatactga ttcatccttt gtgaaaggaa agccatgaag 1201 tgctaaagac aaaacattgg aaaataacgt caagtcctcc cgtgaagatt ttacacgcag 1261 gcatctccca cattagagat gcagtgtttg ctcaacgaat ctggaaggat ttcttcatga 1321 ccaacagctc ctcctaattt cccctcgctc attcatccca ttaaccctat cccataatgt 1381 gtgtctatac agagtagtat tttatcatct tttctgtgga ggaacaagca a // LOCUS RATMHRT1B 562 bp ds-DNA ROD 10-AUG-1990 DEFINITION Rat MHC class II A-beta RT1.B-b-beta gene, partial cds. ACCESSION M36151 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Rat (strain BUF-(RT-b)) DNA. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 562) AUTHORS Figueroa,F., Guenther,E. and Klein,J. TITLE MHC polymorphism pre-dating speciation JOURNAL Nature 335, 265-267 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 562 MHC A-beta RT1.B-b-beta cell surface glycoprotein (AA at 1) BASE COUNT 122 a 171 c 173 g 96 t ORIGIN 1 agagactccc caagggattt cgtgtaccag ttcgagggcc agtgctacta caccaccggg 61 acgcagcgca tgcggctcgt gaccagacac atctacaacc gggaggagta cgtgcgcttc 121 gacagcgacc tgggcgagta ccgcgcgctg accgagctgg ggcggccctc agccgagtac 181 tggaataagc agtacctcga gcagacgcgg gccgagctgg acagggtctg cagatacaac 241 tacgaggggc cgggggctct cacctccctg agacggcttg agcagcccaa tgtggccatc 301 tccctgtcca ggacagaggc ccttaaccac cacaacctgc tggtctgctc agtgacagat 361 ttctacccag cccagatcaa agtgcgctgg ttccggaatg gccaggagga gacgacgggg 421 gtcgtgtcca cacagcttat taggaatggg gactggacct tccagatcct ggtcatgctg 481 gagatcacgc ctcagcgggg agatgtgtac acctgccatg ttgaccaccc cagccttcag 541 agccctgtca cagtggagtg gc // LOCUS WHTMTTGPAA 1743 bp ds-DNA ORG 10-AUG-1990 DEFINITION Wheat mitochondrial Pro-tRNA gene. ACCESSION M36018 KEYWORDS transfer RNA-Pro. SOURCE Wheat mitochondrial DNA, clone H-P2. ORGANISM Mitochondrion Triticum aestivum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Triticum aestivum. REFERENCE 1 (bases 1 to 1743) AUTHORS Joyce,P.B.M., Spencer,D.F. and Gray,M.W. TITLE Multiple sequence rearrangements accompanying the duplication of a Pro-tRNA gene in wheat mitochondrial DNA JOURNAL Plant Mol. Biol. 11, 833-843 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 831 905 Pro-tRNA BASE COUNT 483 a 396 c 414 g 450 t ORIGIN 1 aagcttcatg aaggaagggt ccaaggacct aagcgacaac ctctacgaaa aagaaaaacc 61 tatatataga tagtggtatg tgttcgctga cactaaatag acttggctcg gggcagaact 121 gagtggctag gctaagttcc ctagtcttct attggcctac ccacccctac gacatggagg 181 cgcttaccct tctccaacaa gcgttccacc acgcggcgat tgattaccgc tttccgtctc 241 aatcagttca gaccaagccc cttctctgat cgggggaagg tcattctaaa acatcagatc 301 tcccattgat gctgatcaaa aataagttta tccacctaga atatatgata ggtaccgaaa 361 gattggatgt catttctaaa caccaggcgg gacgagcagc cctataccac gtgtagccac 421 actctagtgt ccttttctac ttagttggac agatcacttc agaaaatcgt ataaaaatca 481 agcaagaaaa cggatgcgct aacgcgcaac ggctttcgcg ctagttgctc aaaaaatcgt 541 ataaaaatca agcaagaaaa aggttctggc aggctgcgtg ggactgtaaa tcctctttcg 601 ctggctgggc cctttggact cgaaatccaa acggagtgag tggttcgatt ccactctcag 661 aacgagactg aacgaaataa aatgcaagtg aaacgagacg agaatcaaat tgtaggcttc 721 tttcctaaaa gcggtggttc tcgcctcccc gtgcccaaag cggggtgggc gacacagcgc 781 tgcggttctt ttctttatcg atcgggtaga tccatatgtt ctgaggggga cgaggtgtag 841 cgcagtctgg tcagcgcatc tgttttgggt acagagggcc ataggttcga atcctgtcac 901 cttgatgtgg tattcacaca atggggccga agtgcaaagc cccgcagcct atccgtggtc 961 gggaaggcag gggtattaaa agtgaaactt ggcttgcgtg tgcgcgcaca caaaggagac 1021 tgagttggat tgcaaagcct ctcagacgga aggaacctgt tttaaaaacc cctttctaca 1081 atcttgagaa atctcacgca agcaactaca cgccgcaagt cttcggtatt gtcgcaagaa 1141 ttgaaaagat gactccgctt tttctttcga gcatgcaagg cctcagtaga gcgagtctgt 1201 gcatgagtag cgcagtttac aaactagtga atggtcaaat gaataggctc gtttcatcag 1261 gggtaaccaa ggagtaagaa tgatttgata aagtaaacat tccctttgag tgcggttcct 1321 tcagaacaga ctcttccctt ttcagattgc ttctttgggc cacttgagca tcgactgggc 1381 aaggttggat tccctctact tcggggtggc atctatctca catgttttac attgatccat 1441 agccctccgt tagataggaa cagctttgct aaatactgag aactctcgaa tagaatatga 1501 gaagggaaag atcctttaca taaggaacta ttgatcatag gcgtacactc tgaaaagaat 1561 tgaatgaaac tctctgtttc gtcactttga tatcattagc tgtctctgta gctggctctc 1621 atgtctggtg ttccaagtaa aaacttttca actagctggg tcaccgctgt tttaaaccct 1681 gacttggaat cagagccata aacgagtgat tcggtcggac ggacgattga gttcttaaag 1741 ctt // LOCUS YSCRSDSBA 76 bp ss-tRNA PLN 10-AUG-1990 DEFINITION S.cerevisiae Lys-tRNA. ACCESSION M36150 KEYWORDS transfer RNA-Lys. SOURCE S.cerevisiae tRNA, clone alpha-S288c. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 76) AUTHORS Madison,J.T., Boguslawski,S.J. and Teetor,G.H. TITLE Nucleotide sequence of a lysine transfer ribonucleic acid from bakers' yeast JOURNAL Science 176, 687-689 (1972) STANDARD simple staff_review FEATURES from to/span description tRNA 1 76 Lys-tRNA anticdn 34 36 anticodon stt modified 1 1 p modified 9 9 m2a modified 16 16 d modified 17 17 d modified 20 20 d modified 26 26 m22g modified 27 27 p modified 28 28 p modified 37 37 t6a modified 46 46 m7g modified 47 47 d modified 48 48 m5c modified 55 55 p modified 58 58 m1a modified 67 67 p BASE COUNT 15 a 18 c 22 g 16 t 5 others ORIGIN 1 tccttgttag ctcagddggd agagcgttcg gctsttaacc gaaatgdcag gggttcgagc 61 cccctatgag gagcca // LOCUS TNCPOLY 3660 bp ss-RNA VRL 10-AUG-1990 DEFINITION Tobacco necrosis virus RNA dependent RNA polymerase and coat protein genes, complete cds. ACCESSION M33002 KEYWORDS RNA dependent RNA polymerase; coat protein; polymerase. SOURCE Tobacco necrosis virus (strain A, isolate TNV-A-FM1B) Wk6 cell line, cDNA to viral RNA, clones pFM2[0,1,2,3]. ORGANISM Tobacco necrosis virus Viridae; ss-RNA nonenveloped viruses. REFERENCE 1 (bases 1 to 3660) AUTHORS Meulewaeter,F., Seurinck,J. and van Emmelo,J. TITLE Genome structure of tobacco necrosis virus strain A JOURNAL Virology 177, 699-709 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Meulewaeter, 20-MAR-1990, for release after publication. FEATURES from to/span description pept 60 665 RNA dependent RNA polymerase (put.) /transl_except=(666..668) 669 2234 RNA dependent RNA polymerase pept 2218 2436 ORF 3 pept 2440 2610 ORF 4 pept 2613 3443 coat protein pept 3467 3646 ORF 6 site 666 668 readthrough codon BASE COUNT 1034 a 890 c 895 g 841 t ORIGIN 1 agtattcata ccaagaatac caaataggtg caaggcctta ctcagctaaa gagtctaaaa 61 tggagctacc aaaccaacac aagcaaacgg ccgccgaggg tttcgtatct ttcctaaact 121 ggctatgcaa cccatggaga cgacagcgaa cagtcaacgc tgcagttgcg ttccaaaaag 181 atcttctcgc cattgaggat tccgagcatt tggatgacat caatgagtgt ttcgaggagt 241 ctgctggggc acaatctcag cgaactaagg ttgtcgccga cggagcatat gcccccgcaa 301 aatccaacag gacccgccga gttcgtaagc agaagaagca caagtttgta aaatatcttg 361 tcaacgaagc tcgtgccgag tttggattgc ccaaaccaac tgaggcaaac agacttatgg 421 tccaacattt cttgctcaga gtgtgcaagg attggggcgt tgttactgcc cacgtacacg 481 gcaatgttgc actagctttg ccactggtgt tcatcccaac ggaagatgat ctgctatcac 541 gagcattgat gaacacacat gctactagag ccgctgtacg aggcatggac aatgtccaag 601 gggaggggtg gtggaacaat aggttgggga ttgggggcca ggtcggactg gccttccggt 661 ccaaataggg gtgccttgaa aggaggccag gattctccac gtccgtttcg cgtggggaac 721 atcctgatct ggtggtcata ccatcagggc gccctgagaa acagcgtcag ttgttacgct 781 atagtggtat aggcggccat ttattaatcg gcatccacaa caactctctt tccaacctgc 841 gtaggggctt gatggaaaga gtattctatg tcgaggggcc caatgggctt caagacgccc 901 ctaagcccgt caagggagct tttcgaaccc ttgataagtt tcgtgatctc tatactaaaa 961 atagttggcg tcatacccct gtaactagtg aacaattcct aatgaattac acgggcagga 1021 aactgactat ttacagagag gcggttgata gtttgtcgca tcaacccctt agctcacgag 1081 atgcgaaact aaagacattc gtgaaggccg aaaaattaaa tctttctaag aagcctgacc 1141 ctgctcccag ggtcatccaa cctagatcgc ctcggtataa cgtttgtttg ggcaggtacc 1201 tccgacatta tgagcatcac gcgtttaaaa ccattgccaa gtgctttggg gaaatcacgg 1261 tcttcaaagg gtttactctg gagcaacaag gggaaatcat gcgctcgaag tggaataaat 1321 atgttaatcc cgtcgcagtc ggactcgacg ccagtcgttt cgaccaacac gtgtctgttg 1381 aagcactcga gtatgagcat gaattttacc tcagagacta cccaaatgat aaacagctaa 1441 aatggctgct aaagcagcaa ttgtgcaacg taggaacggc attcgccagt gacggcatta 1501 taaaatacaa gaagaagggt tgtagaatga gcggagacat gaacacgagc ttgggcaact 1561 gcattctaat gtgcgccatg gtctacgggt tgaaagaaca cttaaacatc aatttgtccc 1621 ttgcaaataa tggggatgac tgcgtcattg tctgtgagaa agcggattta aagaaattga 1681 caagcagcat cgagccatat ttcaagcagt ttggattcaa gatggaagtg gaaaaacccg 1741 tggatatatt tgagcgcata gaattttgcc aaacccaacc tgtgttcgat ggatcccagt 1801 acatcatggt acgcaaacct tctgtggtaa catctaaaga cgtcactagc cttatcccat 1861 gtcaaacgaa agcacaatac gcagaatggc tgcaagctgt aggtgagtgt ggcatgagca 1921 ttaacggtgg gattcctgtc atgcagaatt tctaccaaaa gctccaaact ggcatccgcc 1981 gcacaaaatt caccaagacc ggcgagttcc agacgaacgg attggggtat cactctagat 2041 atatgcatag agtggcccgg gttccttcgc ctgaaacccg tttatccttc tatctagctt 2101 tcggtatcac accagacctc caagaagcat tggagatctt ctatgatacc cacaagcttg 2161 agttggatga tgttatccca actgatacct accaagtgtc aggagagcat ttgatcaatg 2221 gattaccaaa ctgatgtaac ggaggacaat gtgcaaatac gcggtcgggc taggagcgtt 2281 gagggtaaga aacacaatgg ttcgggatta actggcgtta agcgtcacgc ggtgagcgaa 2341 acatctcaga aatcacagca aggtactggc aatggaacta tgaccaatat agccgaagaa 2401 cagaccatta ccgtgacata caactttaac ttttaagtta tggctgcgtg tcgctgttgt 2461 gatacttcac caggtattac actattccct tactttgcaa ttctcatcct tatattggca 2521 atacttgttg tagggactcc caatcaacaa tatcaccatt ctccaagcac ttacgagtac 2581 aagactcaac acatttcgat cgcaaaatag acatggcagg aaagaagaac aacaacaacg 2641 gtcagtatat aatactgcgt actccagagc aacaggtgga gatagaccag cgcaacgccc 2701 gtcgtgctca aatgggtcgc atgaagaagg ctagacagcc cgttcagcga tacttacagc 2761 aacacgggtt gcgaaacgga ttgtccggta gagggggcta catagtggct cccacctccg 2821 ggggggttgt cactcgaccc atagtgccga aattctccaa caggggagat tccactatag 2881 tccgtaacac tgagattttg aacaaccaaa tcttagcggc gctaggcgca ttcaatacaa 2941 caaactccgc actgattgca gcagcaccat catggctggc tagcatcgct gatctttaca 3001 gtaaatacag atggctctca tgtgagatca tctacattcc aaaatgcccc accaccacca 3061 gtggatcaat tgccatggct ttcacatacg acagaaatga cgctgcaccc accgcaaggg 3121 ctcagctgtc acaatcttac aaggccatca attttccacc gtatgcggga tacgacggag 3181 cagcatattt gaattcgaac cagggagctg ggtcagccat cgccgttcaa cttgatgtta 3241 ccaagttgga caagccatgg taccccacta tctcctctgc cggcttcggg gcgctcagcg 3301 tcctcgatca gaaccaattc tgccccgcgt cccttgtggt cgctagcgat gggggacccg 3361 ctactgctac tccagcaggg gaccttttca tcaagtacgt gattgagttc attgaaccaa 3421 tcaacccaac aatgaacgtc tagttctttg tactgtaact tggctaatgc ctaaggtgga 3481 gtcacaccat tggagacgga gacggatcct gggaaacagg cttgacgggc ggggggtggt 3541 gcccccgacg acgcatcact ccggatacca atggtacacc actatggcag ggtctgccaa 3601 ggtcttgtgc accaagaacc cctggaaacg ggggggaggg gggtagcaca tatcatccag // LOCUS BMORGA1 120 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mori 5.8S ribosomal RNA upstream region. ACCESSION M36169 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 1 of 2 SOURCE B.mori DNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 120) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review BASE COUNT 34 a 15 c 20 g 51 t ORIGIN 1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca 61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta // LOCUS BMORGA2 354 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mori 5.8S ribosomal RNA. ACCESSION M36302 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 2 of 2 SOURCE B.mori DNA. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 354) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 139 305 5.8S rRNA BASE COUNT 103 a 74 c 74 g 103 t ORIGIN About 50 bp after segment 1. 1 gctcgtgtct ctctcgtcgt cgttgacggt ttcgtaaatt aacacaaaat acattttatt 61 ttatttattg tttttttttc gacaagaaaa aaaaaaaaga aattttttat ttttatttaa 121 acattacaga aaaaacgaaa atcattaccc tggacggtgg atcacttggc tcgcgggtcg 181 atgaagaacg cagttaactg cgcgtcatag tgtgaactgc aggacacatt tgaacatcga 241 catttcgaac gcacattgcg gtccgtggag acacatccag gaccactcct gtctgagggc 301 cggctgcata aaaaacatta atcacattgc gtggctcgtg gcgcgctcgt gatc // LOCUS BMORGB1 120 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mandarina 5.8S ribosomal RNA upstream region. ACCESSION M36170 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 1 of 2 SOURCE B.mandarina DNA. ORGANISM Bombyx mandarina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 120) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review BASE COUNT 34 a 15 c 20 g 51 t ORIGIN 1 cgaaacaaaa actcacagac gttttagttt tcttaacgtg tgtgtgcgtg cgttgtacca 61 ccgtgtttta tatgtttatt attatatatt ttgagattcg aatattataa tatttatgta // LOCUS BMORGB2 138 bp ds-DNA INV 10-AUG-1990 DEFINITION B.mandarina 5.8S ribosomal RNA. ACCESSION M36303 KEYWORDS 5.8S ribosomal RNA; ribosomal RNA. SEGMENT 2 of 2 SOURCE B.mandarina DNA. ORGANISM Bombyx mandarina Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 138) AUTHORS Fujiwara,H., Maekawa,H., Ninaki,O., Takada,N., Miyajima,N. and Ishikawa,H. TITLE Comparative studies on the rDNA of the silkworm, Bombyx mori and its presumed ancestor JOURNAL Comp. Biochem. Physiol. 88, 761-766 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 89 5.8S rRNA BASE COUNT 35 a 39 c 34 g 30 t ORIGIN About 50 bp after segment 1. 1 ctgcaggaca catttgaaca tcgacatttc gaacgcacat tgcggtccgt ggagacacat 61 ccaagaccac tcctgtctga gggccggctg cataaaaaac attaatcaca ttgcgtggct 121 cgtggcgcgc tcgtgatc // LOCUS BOVINTERA 1003 bp ss-mRNA MAM 10-AUG-1990 DEFINITION Bovine interleukin-1 alpha mRNA, complete cds. ACCESSION M36182 KEYWORDS interleukin-1 alpha. SOURCE Bovine macrophage, cDNA to mRNA. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1003) AUTHORS Leong,S.R., Flaggs,G.M., Lawman,M. and Gray,P.W. TITLE The nucleotide sequence for the cDNA of bovine interleukin-1 alpha JOURNAL Nucleic Acids Res. 16, 9053-9053 (1988) STANDARD simple staff_review FEATURES from to/span description pept 33 839 interleukin-1 alpha precursor sigp 33 368 interleukin-1 alpha signal peptide matp 369 836 interleukin-1 alpha mRNA < 1 1003 interleukin-1 alpha mRNA BASE COUNT 313 a 218 c 215 g 257 t ORIGIN 1 tgttgctagc tcggttcagc aaagaagtga agatggccaa agtccctgac ctctttgaag 61 acctgaagaa ctgttacagt gaaaatgaag actacagttc tgaaattgac cacctctctc 121 tcaatcagaa gtccttctat gatgcaagct atgagccact tcgtgaggac cagatgaata 181 agtttatgtc cctggatacc tcggaaacct ctaagacatc caagcttagc ttcaaggaga 241 atgtggtgat ggtggcagcc agtgggaaga ttctgaagaa gagacggttg agtttaaatc 301 agttcatcac cgatgatgac ctggaagcca ttgccaataa tacagaagaa gaaatcatca 361 agcccagatc agcacattac agcttccaga gtaacgtgaa atacaacttt atgagagtca 421 tccaccagga atgcatcctg aacgacgccc tcaatcaaag tataattcga gatatgtcag 481 gtccatacct gacggctact acattaaata atctggagga ggcagtgaaa tttgacatgg 541 ttgcttatgt atcagaagag gattctcagc ttcctgtgac tctaagaatc tcaaaaactc 601 aactgtttgt gagtgctcaa aatgaagacg aacccgtctt gctaaaggag atgcctgaga 661 cacccaaaat catcaaagat gagaccaacc tcctcttctt ctgggaaaag catggctcta 721 tggactactt caaatcagtt gcccatccaa agttgtttat tgccacaaag caagaaaaat 781 tggtgcacat ggcaagtggg ccgccctcga tcactgactt tcagatattg gaaaaatagc 841 cttgactgtg cactctactt acttgtaaag tggtgaccat ccgtatgtac tatgtacatg 901 aaggagtcga gcccttcact gttagtcact cgctgagcat gtgctgagct tttgtaattc 961 taaatgaatg tttactctct ttgtaagaga gaacacaaag tcc // LOCUS CHKCALMA 450 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken calmodulin mRNA, complete cds. ACCESSION M36167 KEYWORDS calmodulin. SOURCE Chicken, cDNA to mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 450) AUTHORS Iida,Y. TITLE cDNA sequences and molecular evolution of calmodulin genes of chicken and eel JOURNAL Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984) STANDARD simple staff_review FEATURES from to/span description pept 1 450 calmodulin BASE COUNT 160 a 62 c 122 g 106 t ORIGIN 1 atggctgatc aactgacaga agagcagatt gcagaattca aagaagcttt ttcactattt 61 gacaaggatg gtgatggtac tataactaca aaggagttgg ggactgtgat gagatcactt 121 ggtcagaacc ccacagaagc agaattacag gacatgatca atgaagtaga cgctgatggc 181 aatggcacaa ttgacttccc agagtttctg acaatgatgg caagaaaaat gaaagataca 241 gatagcgaag aagaaattag agaagcgttc cgtgtgtttg acaaggatgg taatggttac 301 attagtgctg cagaacttcg tcatgtgatg acaaatcttg gggagaagct aacagatgaa 361 gaagttgatg aaatgattag ggaagcagac attgatggtg atggtcaagt aaactatgaa 421 gagtttgtac agatgatgac agcgaagtga // LOCUS FSBCALMA 450 bp ss-mRNA VRT 10-AUG-1990 DEFINITION E.electricus calmodulin mRNA, complete cds. ACCESSION M36168 KEYWORDS calmodulin. SOURCE E.electricus, cDNA to mRNA. ORGANISM Electrophorus electricus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Cypriniformes; Characoidei; Electrophoridae. REFERENCE 1 (bases 1 to 450) AUTHORS Iida,Y. TITLE cDNA sequences and molecular evolution of calmodulin genes of chicken and eel JOURNAL Bull. Chem. Soc. Jpn. 57, 2667-2668 (1984) STANDARD simple staff_review FEATURES from to/span description pept 1 450 calmodulin BASE COUNT 133 a 96 c 137 g 84 t ORIGIN 1 atggcagatc agctgactga ggaacagatt gctgagttca aggaggcgtt ttccctcttt 61 gacaaagatg gtgacggcac catcaccacc aaagagctgg gtactgtgat gcgctctctg 121 ggccaaaacc ccaccgaggc agagctgcag gacatgatca atgaagtgga tgctgacggc 181 aatggaacaa tagacttccc ggagttcctg accatgatgg ccaagaaaat gaaggacaca 241 gacagtgaag aagagatccg agaagccttc cgagtttttg acaaggacgg taatggctac 301 atcagtgcag ccgagttgcg acatgtcatg actaacttgg gcgagaagct gacggacgag 361 gaggtggatg agatgatccg agaggccgac atcgatggcg acggccaggt gaactatgaa 421 gagttcgtgc aaatgatgac tgcaaagtga // LOCUS HALTRL1L 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-1. ACCESSION M35741 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Leu-tRNA anticdn 37 39 Leu-tRNA anticodon gag modified 29 29 m22g modified 40 40 m1g modified 42 42 m5c modified 59 59 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm modified 69 69 m1i BASE COUNT 16 a 29 c 29 g 13 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgtgggtag ccaanccagg ccaacggcgc agcgttgagg gcgctgtcct gtagaggtcc 61 gccggttcaa atccggtccc acgcacca // LOCUS HALTRL2L 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-2. ACCESSION M35742 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Leu-tRNA anticdn 37 39 Leu-tRNA anticodon cag modified 29 29 m22g modified 40 40 m1g modified 60 60 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm modified 69 69 m1i BASE COUNT 18 a 28 c 25 g 16 t 1 others ORIGIN 5' end of mature tRNA. 1 gcagggatag ccaantctgg ccaacggcgc agcgttcagg gcgctgtctc ataggagtcc 61 gcaggttcaa atcctgctcc ctgcacca // LOCUS HALTRL3L 87 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-3. ACCESSION M35743 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 87) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 87 Leu-tRNA anticdn 36 38 Leu-tRNA anticodon caa modified 28 28 m1i modified 39 39 m1g modified 41 41 p modified 59 59 m5c modified 64 64 m1g modified 65 65 p modified 66 66 p modified 67 67 cm modified 68 68 m1i BASE COUNT 20 a 27 c 25 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgagggtag ctaantcagg aaaaagcggc ggactcaaga tccgctcccg taggggtccg 61 tgggttcaaa tccctcccct cgcacca // LOCUS HALTRL4L 86 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-4. ACCESSION M35744 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 86) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 86 Leu-tRNA anticdn 35 37 Leu-tRNA anticodon tag modified 27 27 m22g modified 38 38 m1g modified 48 48 m5g modified 53 53 m1g modified 54 54 p modified 55 55 p modified 56 56 cm BASE COUNT 16 a 27 c 28 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgcgggtag ccaantggcc aaaggcgcag cgcttaggac gctgtggtgt agaccttcgc 61 aggttcgaac cctgtcccgc gcacca // LOCUS HALTRL5L 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Leu-tRNA-5. ACCESSION M35745 KEYWORDS transfer RNA; transfer RNA-Leu. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Leu-tRNA anticdn 37 39 Leu-tRNA anticodon taa modified 29 29 m2g modified 40 40 m1g modified 42 42 p modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm BASE COUNT 16 a 28 c 29 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gcgggggtgg ctganccagg ccaaaagcgg cggacttaag atccgctccc gtaggggttc 61 gcgagttcga atctcgtccc ccgcacca // LOCUS HALTRS1S 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Ser-tRNA-1. ACCESSION M35746 KEYWORDS transfer RNA; transfer RNA-Ser. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Ser-tRNA anticdn 37 39 Ser-tRNA anticodon gct modified 29 29 m22g modified 40 40 t6a modified 60 60 m5c modified 61 61 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm BASE COUNT 15 a 31 c 27 g 14 t 1 others ORIGIN 5' end of mature tRNA. 1 gttgcggtag ccaancctgg cccaaggcgc tgggttgcta actcagtggc gtcaagcccc 61 cggggttcga atccccgccg caacgcca // LOCUS HALTRS2S 88 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Ser-tRNA-2. ACCESSION M35747 KEYWORDS transfer RNA; transfer RNA-Ser. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 88) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 88 Ser-tRNA anticdn 36 38 Ser-tRNA anticodon cga modified 13 13 p modified 28 28 m22g modified 36 36 ac4c modified 41 41 p modified 60 60 m5c modified 65 65 m1g modified 66 66 p modified 67 67 p modified 68 68 cm modified 69 69 m1i BASE COUNT 20 a 28 c 22 g 17 t 1 others ORIGIN 5' end of mature tRNA. 1 gccgaggtag cctancccgg ccaaggcggt agattcgaaa tctactgtcc attcggacac 61 gtgagttcaa atctcaccct cggcgcca // LOCUS HALTRS3S 84 bp ss-tRNA RNA 10-AUG-1990 DEFINITION H.volcanii Ser-tRNA-3. ACCESSION M35748 KEYWORDS transfer RNA; transfer RNA-Ser. SOURCE H.volcanii tRNA. ORGANISM Halobacterium volcanii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Halobacteriales; Halobacteriaceae. REFERENCE 1 (bases 1 to 84) AUTHORS Gupta,R. TITLE Transfer RNAs of Halobacterium volcanii: Sequences of five leucine and three serine tRNAs JOURNAL Syst. Appl. Microbiol. 7, 102-105 (1986) STANDARD simple staff_review FEATURES from to/span description tRNA 1 84 Ser-tRNA anticdn 33 35 Ser-tRNA anticodon gga modified 25 25 m22g modified 56 56 m5c modified 61 61 m1g modified 62 62 p modified 63 63 p modified 64 64 cm modified 65 65 m1i BASE COUNT 14 a 26 c 28 g 15 t 1 others ORIGIN 5' end of mature tRNA. 1 gccaggatgg ccgancggta aggcgcacgc ctggaaagcg tgttccctct gggatcgggg 61 gttcaaatcc ctctcctggc gcca // LOCUS HUMIGHVY 920 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D5-region. ACCESSION M35647 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human placental DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 920) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 604 / 633 Ig H-chain D-region (AA at 604) signal 456 462 heptamer recombination signal signal 488 497 nonamer recombination signal signal 575 583 nonamer recombination signal signal 596 602 heptamer recombination signal signal 634 640 heptamer recombination signal signal 654 662 nonamer recombination signal BASE COUNT 256 a 269 c 255 g 140 t ORIGIN 1 ggatccaggc agcaacatca cgggagaatg cagggctccc agacatccca accctcttgc 61 aggcctctcc tgggaagaga ctgcatccac caccaaacag catggagccc gctgcatagt 121 aactgagtca gtgactgacc tggagggcag gggagcagtg agccggagcc cagaccatag 181 ggaccgagac cagccgctga catcccgagc ccctcactgg tggccccaga aaatatggaa 241 acaggacgga cccacattcc cagctggacc agggcggaga ctgctgagcc ccgagcacca 301 gccccaagaa acaccaggca aaatcatcag aggaggctcc tgagaaagag aggaggggag 361 gtctccctca ccagcaaatg cttcccttaa ccaaacacag ggtccacgca actcccccca 421 ggacaaagga ggagccccct gtacagcact gggctcagag tcctctccaa gacaccctga 481 gtttcagaca aaaaccccct ggaaatcaca gtatcagcag gagaactggc cagagacagc 541 aaggggggac tccgtgacac ccgtggggac aggaggattt tgtgggggct tgtgtcactg 601 tgagaatatt gtaatagtac tactttctat gcccacagtg acacagcccc agtcccaaag 661 ccctgctgta aatgcttcca cttctggagc tgagggggag cgtctgggaa gtagggccta 721 gggtgtccac caatgccaaa acacaccaga cttcccccag acatcacccc caccagccag 781 cgagcagagt aaacagaaaa tgagaagcaa gctgggaact gctggcacag gccccaagga 841 aagagctggg gttgtgcaag agaagatgca ggcagagcta gcagggcctt ttactgtttc 901 tgctttctgg cagagaattc // LOCUS HUMIGHVZ 688 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D4-region. ACCESSION M35648 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human peripheral blood cell DNA, clone D31. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 688) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 70 / 100 Ig H-chain D5-region (AA at 71) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" signal 42 50 nonamer recombination signal signal 63 69 heptamer recombination signal signal 101 107 heptamer recombination signal signal 121 129 nonamer recombination signal BASE COUNT 185 a 160 c 209 g 134 t ORIGIN 1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt 61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt 121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct 181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc 241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca 301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc 361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttca agatcgatgg 421 ctgggagtga gccgagacgg acagtgtggg aagggcacag ggaaggagaa gcagccgcta 481 tcctacactg tcatctttca agagtttgcc ctgtgcccac aatgctgcat catgggatcg 541 ttaacactga tgtagacaca gctaaagaga gaatcagtga atggatttgc agcacagatc 601 tgaataaatt ctcagaatgt ggagccacac agaagcaagc acacggaaag tggcctgatc 661 gaatttcaag tacagtggca cttcagct // LOCUS HUMIGHWA 690 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D4-region. ACCESSION M35649 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human peripheral blood cell DNA, deletion clone D30. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 690) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Hongo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 70 / 100 Ig H-chain D5-region (AA at 71) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" signal 42 50 nonamer recombination signal signal 62 68 heptamer recombination signal signal 101 107 heptamer recombination signal signal 121 129 nonamer recombination signal BASE COUNT 186 a 160 c 212 g 132 t ORIGIN 1 tcagcaggag aactagccag agacagcaag aggggacagg aggattttgt gggggctcgt 61 gtcactgtgt ggatattgta gtagtaccag ctgctatgcc cacagtgaca cagccccatt 121 cccaaagccc tgctgtaaac gcttccactt gctggagctg aggggctggg gggagcgtct 181 gggaagtagg cctaggtggc catcaatgcc caaaacgcac cagactcccc cccagacatc 241 accccactgg ccagtgagca gagtaaacag aaaatgagaa gcagctggga agcttccaca 301 ggcccaagga aagagctttg gcgggtgtgt aagaggggat gcgggcagag cgagcagggc 361 cttttgctgt ttctgctttc ctgtgcagag agttcagcag ctggtgttcg agatcaatgg 421 ctgggagtga gccgagcagg acagcgtggg aagagcacag ggaaggagga gcagccgcta 481 tcctacactg tcatctttcg aaagtttgcc ttgtgcccac actgctgcat catgggatcg 541 ttaacactga tgtagacaca gctaaagaga gaatcatgga atggatttgc agcacagatc 601 tgaataaatt ctcagaatgt ggagcagcac agaagcaagc acacagaaag tggcctgatg 661 caaggacaag ttcagtggca ctagcatgct // LOCUS HUMIGHWB 681 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human Ig germline H-chain gene D1-region. ACCESSION M35650 KEYWORDS diversity exon; germline; immunoglobulin heavy chain. SOURCE Human peripheral blood cell DNA, clone D31. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 681) AUTHORS Zong,S.Q., Nakai,S., Matsuda,F., Lee,K.H. and Honjo,T. TITLE Human immunoglobulin D segments: Isolation of a new D segment and polymorphic deletion of the D-1 segment JOURNAL Immunol. Lett. 17, 329-334 (1988) STANDARD simple staff_review FEATURES from to/span description pept / 69 / 99 Ig H-chain D5-region (AA at 70) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" signal 41 49 nonamer recombination signal signal 61 67 heptamer recombination signal signal 100 106 heptamer recombination signal signal 120 128 nonamer recombination signal BASE COUNT 187 a 156 c 203 g 135 t ORIGIN 1 ccagcaagat cgggctccag tgacaccgag acaggacgga ggattttgtg ggggctcgtg 61 tcactgtgag gatattgtac taatggtgta tgctataccc acagtgacac agccccattc 121 ccaaagccct actgcaaaga ttccacttgc tgggctgagg ggctggggag cgtctgggaa 181 atagggctag ggaccagcaa tgcccaaaac gcagattccg tcccatacat caccccattc 241 accagtcagc agagtaaaca gaaaatgaga agcagctggg aagcttccac aggcccaagg 301 aaagagcttt ggcgggtgtg taagagggga tgcgggcaga gcgagcaggg ccttttgctg 361 tttctgcttt cctgtgcaga gagttcagca gctggtgttc gagatcaatg gctgggagtg 421 agccgagcag gacagcgtgg gaagagcaca gggaaggagg agcagccgct atcctacact 481 gtcatctttc gaaagtttgc cttgtgccca cactgctgca tcatgggatc gttaacactg 541 atgtagacac agctaaagag agaatcatgg aatggatttg cagcacagat ctgaataaat 601 tctcagaatg tggagcagca cagaagcaag cacacagaaa gtggcctgat gcaaggacaa 661 gttcagtggc actagcatgc t // LOCUS HUMTCAGVR 508 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human T-cell receptor germline gamma-chain gene V4-region (V4; subgroup I), exons 1 and 2. ACCESSION M36285 KEYWORDS T-cell antigen receptor; T-cell receptor gamma-chain; germline; variable region. SOURCE Human EBV-transformed lymphoblastoid cell line DNA, clone 601. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 508) AUTHORS Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L., Cann,H., Cohen,D., Dausset,J. and Sigaux,F. TITLE The V-gamma locus of the human T cell receptor gamma gene JOURNAL J. Exp. Med. 168, 1383-1394 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 43 T-cell receptor gamma-chain (V4), exon 1 /hgml_locus_uid="LE0226F" /nomgen="TCRGV4" /map="7p15" 159 > 508 T-cell receptor gamma-chain (V4), exon 2 IVS 44 158 T-cell receptor gamma-chain (V4) intron A BASE COUNT 128 a 119 c 122 g 139 t ORIGIN 1 atgcagtggg ccctagcggt gcttctagct ttcctgtctc ctggtgagtg cgctgcctac 61 agagaggatc atgggttttg ttttctttat tttcttcttt tgcaaggatt gccatactaa 121 ggaattcctc attatatttt gtgttgttcc cattgcagcc agtcagaaat cttccaactt 181 ggaagggaga acgaagtcag tcatcaggca gactgggtca tctgctgaaa tcacttgtga 241 tcttgctgaa ggaagtaccg gctacatcca ctggtaccta caccaggagg ggaaggcccc 301 acagcgtctt ctgtactatg actcctacac ctccagcgtt gtgttggaat caggaatcag 361 cccagggaag tatgatactt acggaagcac aaggaagaac ttgagaatga tactgcgaaa 421 tcttattgaa aatgactctg gagtctatta ctgtgccacc tgggatgggc acagtgattc 481 agatccgccc tacaccacac tgaaaacc // LOCUS HUMTCAGVS 509 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human T-cell receptor germline gamma-chain gene V5-region (V5; subgroup I), exons 1 and 2. ACCESSION M36286 KEYWORDS T-cell antigen receptor; T-cell receptor gamma-chain; germline; variable region. SOURCE Human EBV-transformed lymphoblastoid cell line DNA, clone 601. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 509) AUTHORS Font,M.P., Chen,Z., Bories,J.C., Duparc,N., Loiseau,P., Degos,L., Cann,H., Cohen,D., Dausset,J. and Sigaux,F. TITLE The V-gamma locus of the human T cell receptor gamma gene JOURNAL J. Exp. Med. 168, 1383-1394 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 43 T-cell receptor gamma-chain (V5), exon 1 /hgml_locus_uid="LF0226G" /nomgen="TCRGV5" /map="7p15" 160 > 508 T-cell receptor gamma-chain (V5), exon 2 IVS 44 159 T-cell receptor gamma-chain (V5) intron A BASE COUNT 131 a 117 c 122 g 139 t ORIGIN 1 atgcggtggg ccctactggt gcttctagct ttcctgtctc ctggtgagta cgctgcctac 61 agagaggctc acaggttggg ttttgttttg tttacttctt ttgaaagggg tgcgatacaa 121 aggaatacct cattaaattt tgtgttgttc ccattgcagc cagtcagaaa tcttccaact 181 tggaaggggg aacgaagtca gtcacgaggc cgactaggtc atctgctgaa atcacttgtg 241 accttactgt aataaatgcc ttctacatcc actggtacct acaccaggag gggaaggccc 301 cacagcgtct tctgtactat gacgtctcca actcaaagga tgtgttggaa tcaggactca 361 gtccaggaaa gtattatact catacaccca ggaggtggag ctggatattg atactacgaa 421 atctaattga aaatgattct ggggtctatt actgtgccac ctgggacagg cacagtgatt 481 cagacctgtc ctacaccaca ctgaaaatc // LOCUS MUSNM23A 664 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse tumor metastatic process-associated protein (NM23) gene mRNA, 3' end. ACCESSION M35970 KEYWORDS tumor metastatic process-associated protein. SOURCE Mouse cell line K-1735, cDNA to mRNA, clone pNM23. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 664) AUTHORS Steeg,P.S., Bevilacqua,G., Kopper,L., Thorgeirsson,U.P., Talmadge,J.E., Liotta,L.A. and Sobel,M.E. TITLE Evidence for a novel gene associated with low tumor metastatic potential JOURNAL J Natl Cancer Inst 80, 200-204 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 501 tumor metastatic process-associated protein NM23 (AA at 1) mRNA < 1 664 NM23 mRNA BASE COUNT 150 a 154 c 201 g 159 t ORIGIN 1 cagtcgcagc cggcggtaaa gccttgtcat ctgaagggga ccatggccaa cagtgagcgt 61 accttcattg ccatcaagcc tgatggggtc cagcgggggc tggtgggcga gatcatcaag 121 cggttcgagc agaaggggtt ccgccttgtt ggtctgaagt ttctgcaggc ttcagaggac 181 cttctcaagg agcactacac tgacctgaag gaccgcccct tctttactgg cctggtgaaa 241 tacatgcact caggaccagt ggttgctatg gtctgggagg gtctgaatgt ggtgaagaca 301 ggccgcgtga tgcttggaga gaccaacccc gcagactcta agcctgggac catacgagga 361 gacttctgca ttcaagttgg caggaacatc attcatggca gcgattctgt aaagagcgca 421 gagaaggaga tcagcttgtg gtttcagcct gaggagctgg tggagtacaa gagctgtgcg 481 cagaactgga tctatgagtg ataggacggt gccggttttc tacctgctta ctcttgttct 541 cacaggcagg ggaccagcaa ccctagatat ttctggaact tctttgacct ggaaggaacc 601 tttgggagcc gtgactccct gtgcagtgtt acgtgccact gttagattaa agtgtttaat 661 ctgt // LOCUS SYNHGHOA 582 bp ds-DNA SYN 10-AUG-1990 DEFINITION Synthetic human growth hormone gene, complete cds. ACCESSION M36282 KEYWORDS growth hormone. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 582) AUTHORS Ohtsuka,E., Taniyama,Y., Iwai,S., Kitano,K., Miyamoto,S., Ohgi,T., Sakuragawa,Y., Fujiyama,K., Ikari,T., Kobayashi,M., Miyake,T., Shibahara,S., Tokunaga,T., Ikehara,M., Ono,A., Ueda,T., Tanaka,T., Baba,H., Miki,T., Sakurai,A. and Oishi,T. TITLE The synthesis of human growth hormone gene JOURNAL Nucleic Acids Symp Ser 12, 79-82 (1983) STANDARD simple staff_review FEATURES from to/span description pept 1 579 growth hormone BASE COUNT 146 a 153 c 129 g 154 t ORIGIN 1 atgttcccaa ctattccact gagtcgcctg ttcgataacg cgatgctgcg tgcgcatcgt 61 ctgcaccaac tggctttcga cacttaccag gagttcgaag aagcatacat cccgaaagaa 121 cagaaataca gcttccttca gaacccacag acctcgttgt gtttctctga aagtatcccg 181 accccttcta accgcgaaga gacccagcag aaatcgaacc ttgaactgct tcgtatctcg 241 ctgcttctca ttcagtcgtg gctggagcca gtacagttcc tgcgttcggt tttcgcaaac 301 tcactggtat acggtgcgtc tgacagtaac gtttacgacc tgctgaaaga ccttgaagaa 361 gggatccaga ccctgatggg tcgcctggaa gatggttcac cacgcactgg tcagatcttc 421 aaacagactt actccaaatt cgatactaac tctcataacg atgatgctct gctgaaaaac 481 tacggcctgc tgtactgttt ccgtaaagat atggataaag ttgaaacttt cctgcgtatc 541 gttcagtgtc gttctgttga agggtcgtgt ggcttctaat ag // LOCUS AC2TFDC 803 bp ds-DNA BCT 10-AUG-1990 DEFINITION Plasmid pAC27 (from Pseudomonas putida) pyrocatechase (tfdC) gene, complete cds. ACCESSION M36279 KEYWORDS pyrocatechase. SOURCE Plasmid pAC27(from Pseudomonas putida, strain AC867) DNA. ORGANISM Plasmid pAC27 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 803) AUTHORS Ghosal,D. and You,I.-S. TITLE Nucleotide homology and organization of chlorocatechol oxidation genes of plasmids pJP4 and pAC27 JOURNAL Mol. Gen. Genet. 211, 113-120 (1988) STANDARD simple staff_review FEATURES from to/span description pept 21 803 pyrocatechase (tfdC) BASE COUNT 174 a 240 c 250 g 139 t ORIGIN 1 atcttgaaag gagacgagtc atggataaac gagttgccga ggtcgcaggc gcgatcgtcg 61 aggcagtacg caaaattttg ctggacaagc gcgtcacgga agccgaatac cgcgcgggtg 121 tcgactatct caccgaggtc gcacagacgc gggaaaccgc gctgcttctg gacgttttcc 181 tgaacagcac catcatcgaa ggcaaggcgc agcgctcgcg gacctctgcg cctgcgatcc 241 aggggccgta cttcctggaa ggtgctcctg tagttgaagg cgtcctcaag acctacgata 301 ccgacgacca caaaccgctg atcattcgcg gtacggtgcg ctcggacacg ggcgagttgc 361 tcgctggcgc tgtcatcgac gtgtggcact cgacgcctga tggcttgtac agcgggatcc 421 acgacaacat ccccgtggac tactaccgcg gaaaactcgt gacggattcc cagggcaact 481 atcgcgtgcg caccacgatg ccagtgccat accagatccc ctacgagggg ccgactgggc 541 gtctgctggg ccacctgggc agccatacct ggcgtccgcc gcacgtgcac ttcaaggtgc 601 gcaaggacgg tttcgaaccg ttgaccacgc aatactactt cgaagggggc aaatgggtgg 661 acgatgactg ctgtcacggc gtcacccccg acctgattac gcccgagacg atcgaggacg 721 gggtgcgggt catgaccctg gacttcgtaa tcgagcgtga gcaggccgag caacgcaagt 781 cggctacgga gacagtggca tga // LOCUS AFATFDC 788 bp ds-DNA BCT 10-AUG-1990 DEFINITION Plasmid pJP4 (from Alcaligenes eutrophus) pyrocatechase (tfdC) gene, complete cds. ACCESSION M36280 KEYWORDS pyrocatechase. SOURCE Plasmid pJP4 (from Alcaligenes eutrophus, strain JMP134) DNA. ORGANISM Plasmid pJP4 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 788) AUTHORS Ghosal,D. and You,I.-S. TITLE Nucleotide homology and organization of chlorocatechol oxidation genes of plasmids pJP4 and pAC27 JOURNAL Mol. Gen. Genet. 211, 113-120 (1988) STANDARD simple staff_review FEATURES from to/span description pept 21 788 pyrocatechase (tfdC) BASE COUNT 178 a 203 c 240 g 167 t ORIGIN 1 gttttcatga cggaggcaaa gtgaacaaaa gagtcaagga tgttgtcgat gcgatcgtcg 61 cggcggtaca gcgggtgctc gaccagaagg aggttactga ggcggaatac cgaactgcgg 121 ttcattacct catgcaagta gccgaacagc gtgaaaccgc tcttctttgc gatgtttttt 181 tcaatagcac cgtggctgcg acgaaggctc gtattagcga aggttcaacg cctgctatcg 241 agggacccta ctatcgcgac gacgcccctc tggtcgatga ccggctcaag acttacgaca 301 cggacgacca caagcctttg cttatccagg gaacggtcaa agcggtcgac gggagcgttg 361 tcgaggacgt gacgattgat gtctggcatt cgacgcccga tgggaagtac agcggtttcc 421 atgacgacat cccgactgat ttttatcgag ggaagctcag ggtgggcacc gatggcagct 481 tccgcgtgcg cacaacgatg ccggtgccgt atcagatccc ggatcagggt cccacgggcg 541 cattgctcga aaccatgggt ggtcactcgt ggcgtcccgc tcatgtacat ttcaaggtga 601 aggcgccggg atatgaaacg ttgaccacgc agtactactt cgaaggtgga gattggatca 661 cggacgactg ttgcaacggc gtccagtcta gtctgattac tcccgatatt gtggaagagg 721 gtgtccgcct gatgaacatc aacttcgtta tcgagcccgc gcgtgcgcag gcgggggcaa 781 acccgtga // LOCUS ECOHSEII 530 bp ds-DNA BCT 10-AUG-1990 DEFINITION E.coli heat-stable enterotoxin II (STII) gene, complete cds. ACCESSION M35729 KEYWORDS enterotoxin II. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 530) AUTHORS Picken,R.N., Mazaitis,A.J., Maas,W.K., Rey,M. and Heyneker,H. TITLE Nucleotide sequence of the gene for heat-stable enterotoxin II of Escherichia coli JOURNAL Infect. Immun. 42, 269-275 (1983) STANDARD simple staff_review FEATURES from to/span description pept 166 381 enterotoxin II (STII) BASE COUNT 188 a 86 c 88 g 168 t ORIGIN 1 taaataccta caacgggtga ttgacactac actcattaac tatactgcaa gtagcattaa 61 aaatcttaat aaaggagagc ttcgtcacat ttttttgact tgactcatat aaaagcccac 121 tggtataagt tttattgctt atagcaataa ggttgaggtg attttatgaa aaagaatatc 181 gcatttcttc ttgcatctat gttcgttttt tctattgcta caaatgccta tgcatctaca 241 caatcaaata aaaaagatct gtgtgaacat tatagacaaa tagccaagga aagttgtaaa 301 aaaggttttt taggggttag agatggtact gctggagcat gctttggcgc ccaaataatg 361 gttgcagcaa aaggatgcta atatatttat caatagcatt cagcaccata tacacaaaaa 421 taatttttca taaaaagaac tctataaaat aaatattttt tgtgacaatg tcctaacgca 481 agacggacat tgtccatttc tcactgcagg taaatgatct gtaaatagtc // LOCUS RAT6P26B 1086 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase mRNA, 3' end. ACCESSION M15685 KEYWORDS 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase. SOURCE Rat liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1086) AUTHORS Colosia,A.D., Lively,M., El-Maghrabi,M.R. and Pilkis,S.J. TITLE Isolation of a cDNA clone for rat liver 6-phosphofructo 2- kinase/fructose 2,6-bisphosphatase JOURNAL Biochem. Biophys. Res. Commun. 143, 1092-1098 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 915 6-phosphofructo 2-kinase/fructose 2,6-bisphosphatase (AA at 1) BASE COUNT 290 a 279 c 264 g 253 t ORIGIN 1 gcagaaaaca tcaagcaagt gaaacttggt agtcctgatt acatagactg tgaccaagaa 61 aaggttttgg aagactttct aaagagaata gagtgctatg agatcaacta ccaacctttg 121 gatgaggaat tggacagcca cctgtcctac atcaagatct tcgacgtggg cacacgctac 181 atggtaaatc gagtgcagga ccacgttcag agccgtacag cctactacct catgaacatc 241 catgtcacac ctcgatctat ctacctatgc cgccatggtg agagtgaact caaccttaga 301 ggccgcattg gaggtgactc tggcctctca gctcggggca agcagtatgc ctatgcacta 361 gccaacttca tccggtctca aggcatcagc tccctgaaag tatggactag ccacatgaag 421 aggaccattc agaccgctga agccctaggt gtcccctatg aacagtggaa ggccctgaat 481 gagattgatg cgggtgtctg tgaagagatg acctatgaag aaattcagga acactaccct 541 gaggaatttg cactacggga ccaggataaa tatcgttacc gctatccgaa gggagagtcc 601 tatgaggatc tggttcagcg tcttgaacca gttataatgg agctagaacg gcaagaaaat 661 gtactggtga tctgtcacca ggctgtcatg cggtgcctcc tggcatactt cctggataaa 721 agttcagatg agctgcccta tctcaagtgt cctctgcata ctgtgctcaa actcacacct 781 gtggcttatg gctgcagagt ggagtccatc tacctgaatg tggaggctgt gaacacacac 841 cgggacaagc ctgagaatgt ggacatcacc cgtgaagctg aggaagcctt ggacactgta 901 cctgcccatt actgagccct tttcaagtga tcagattgtc tctgttctca cccccttcct 961 cctgtaggag ctgctgccct tgttctcctt aagcagactc tggctatggc ctgagagtgc 1021 cctacctcca gtgaagaagt ccgtagcagc tcccaaacag gtctcaattc ctagccacaa 1081 ctaagg // LOCUS RATFABP 491 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat fatty liver acid binding protein (FABP) mRNA, complete cds. ACCESSION M35991 KEYWORDS fatty acid binding protein. SOURCE Rat adult liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 491) AUTHORS Gordon,J.I. and Lowe,J.B. TITLE Analyzing the structures, functions and evolution of two abundant gastrointestinal fatty acid binding proteins with recombinant DNA and computational techniques JOURNAL Chem. Phys. Lipids 38, 137-158 (1985) STANDARD simple staff_review FEATURES from to/span description pept 40 423 fatty acid binding protein (FABP) mRNA < 1 491 fatty acid binding protein mRNA BASE COUNT 161 a 95 c 128 g 107 t ORIGIN 1 ctgttggtgg cagctgggaa aggaaacctc attgccacca tgaacttctc cggcaagtac 61 caagtgcaga gccaagagaa ctttgagccc ttcatgaagg cgatgggtct gcctgaggac 121 ctcatccaga aagggaagga catcaagggg gtgtcagaaa tcgtgcatga agggaagaaa 181 gtcaaactca ccatcaccta tgggtccaag gtgatccaca atgagttcac cttgggggag 241 gagtgcgaac tggagaccat gactggggaa aaggtcaagg cagtggttaa gatggagggt 301 gacaataaaa tggtgacaac tttcaaaggc ataaagtccg tgactgaatt caatggagac 361 acaatcacca ataccatgac actgggtgac atcgtctaca agagagtcag caagagaatt 421 tagacaaggc tgtatttcat attcttttac agtataaaat taatacaata aagttacctc 481 tgttttgaaa t // LOCUS RATFABPX 564 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Rat intestinal fatty acid binding protein (FABP) mRNA, complete cds. ACCESSION M35992 KEYWORDS fatty acid binding protein. SOURCE Rat adult small intestinal epithelium, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 564) AUTHORS Gordon,J.I. and Lowe,J.B. TITLE Analyzing the structures, functions and evolution of two abundant gastrointestinal fatty acid binding proteins with recombinant DNA and computational techniques JOURNAL Chem. Phys. Lipids 38, 137-158 (1985) STANDARD simple staff_review FEATURES from to/span description pept 13 411 fatty acid binding protein (FABP) BASE COUNT 179 a 108 c 146 g 131 t ORIGIN 1 acagctgaca tcatggcatt tgatggcact tggaaagtag accggaatga gaactatgaa 61 aagttcatgg agaaaatggg cattaacgtg gtgaagagga agcttggagc tcatgacaac 121 ttgaaactga cgatcacaca ggaaggaaat aaattcacag tcaaagaatc aagcaacttc 181 cgaaacattg atgttgtgtt tgaactcggc gtcgactttg cctatagtct agcagatgga 241 acagaactca ctgggacctg gaccatggag ggaaataaac ttgttggaaa attcaaacgt 301 gtagacaatg gaaaggagct gattgctgtc cgagagattt ctggtaacga actaatccaa 361 acctacacat atgaaggagt ggaggccaag cggatcttta agaaggaata ggccagcttc 421 ttggagcctg gcgcaatgca gaagggctag cttggtgtcg gattcctttc tccatctcgc 481 tgataatgca ggctcactca ctgtgtcatg agcactgctt ccaagccttg tcagagccaa 541 agaagtaaaa gttgattagg actt // LOCUS BSTRR5SA 117 bp ds-DNA RNA 10-AUG-1990 DEFINITION B.stearothermophilus 5S RNA. ACCESSION M25591 KEYWORDS 5S ribosomal RNA; ribosomal RNA. SOURCE B.stearothermophilus RNA. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 117) AUTHORS Zimmermann,J. and Erdmann,V.A. TITLE Binding sites of E.coli and B.stearothermophilus ribosomal proteins on B.stearothermophilus 5S RNA JOURNAL Nucleic Acids Res. 5, 2267-2288 (1978) STANDARD simple staff_entry COMMENT FEATURES from to/span description rRNA 1 117 5S ribosomal RNA BASE COUNT 25 a 36 c 38 g 18 t ORIGIN 1 cctagtggtg atagcggagg ggaaacaccc gttcccatcc cgaacacgga agttaagccc 61 tccagcgccg atggtagttg gggccagcgc ccctgcaaga gtaggccgct gctaggc // LOCUS CHKVIM1 2842 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken vimentin gene, exons 1 and 2. ACCESSION M15850 J02759 M25923 KEYWORDS intermediate filament; vimentin. SEGMENT 1 of 3 SOURCE Chicken DNA, library of Dodgson. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2842) AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M. TITLE The chicken vimentin gene: Nucleotide sequence, regulatory elements, and comparison to the hamster gene JOURNAL J. Biol. Chem. 262, 8112-8120 (1987) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE The chicken vimentin gene: Aspects of organization and transcription during myogenesis JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by B.A.Roe, 11-MAY-1987. FEATURES from to/span description pept 824 1368 vimentin, exon 1 1458 + 1518 vimentin, exon 2 pre-msg 737 > 2842 VIM mRNA IVS 1369 1457 VIM cds intron A IVS 1519 > 2842 VIM cds intron B BASE COUNT 569 a 827 c 882 g 564 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgagg atttctgtcc tgcctttgag gctctttggg gatcacttct aatgaccttt 61 ttgtaagagc tctttcccga gacagtgacc gtgtgcctac agaaactaaa agcaaacaga 121 ccctcggtta aaaccgctcc tgccgctccc ccgctgacca ggagcgctgt gcccgaagca 181 aagcgatgcc cctcctgcag ccgtgcgagt gcgcagcgcc gtccgcaatg agcgcaactc 241 cgtcggcacc gagcaggctg ggcgggcccc gggagctgcg ggagcgcgct gagcccatga 301 gcacaggacg ggaggctgag gggggccgtc ggaagcagct aaccccaaac ccaccgacgg 361 ttcacgtttc tatctgtttg ctttggacgt tcggtagccc ttcgtgaata gcatcactcc 421 cccgatctcg gtgcctttca gagagctaag aaagaaggga cggggctcgc acaaagttca 481 acgcatgcat ttttaagcac tcggcccgtt ctaggagcgc tggcgagcac cgcagcccgt 541 cccgccctcg gaatcactgc gctgtcgaag gggcgtaggg cgcaaggcac ctcggcagcc 601 gcccgggtcc ctccgccttt gtcccagcgc cgccaccgcc gccctccctc cgcccctccc 661 cgggcccgac gcggcattgg cgacggacgg ggcggggcgg cgcggcggcg ggcggggcgg 721 cggccccata agaggggccc cggggtgggg acgccgctct tcttcgccgc cgcgctccga 781 gcccgtcgct cccggattac aaagccgctc cgttcctcgc gccatgagct tcaccagcag 841 caagaactcc tcgtaccgcc gcatgttcgg cgggggcagc cggcccagca gcggcacccg 901 ctacatcacg tccagcaccc gctattccct gggcagcgcc ctgcggccca gcagcgcccg 961 ctacgtgtcc gcctcgcccg gcggcgtgta ccgcaccaag gcgacgtcgg tgcggctgcg 1021 gagcagcatg ccgcccatgc ggatgcacga cgccgtggac ttcaccctgg cggacgccat 1081 caacacggag ttcaaggcga accgcaccaa cgagaaggta gagctgcagg agctcaacga 1141 ccgcttcgcc aactacatcg acaaggtgcg cttcctggag cagcagaaca agatcctgct 1201 ggccgagctg gagcagctca agggcaaagg cacgtcccgc ttgggcgacc tgtacgagga 1261 ggagatgcgg gacgtgcggc gccaggtgga ccagctgacc aacgacaagg cccgcgtcga 1321 ggtggagcgc gacaacctgg ccgacgacat catgcgcctg cgggagaagt gagtggggcc 1381 gcgggccggg gccgggaagc gggcgggcac ccacccctcc ctcacctcac ctcacctcac 1441 cctgtcgctg cttgcaggtt gcaggaggag atgctgcagc gggaggaggc cgagagcacc 1501 ctgcagtcct tccgacaggt gggcgcggcg ccggggcggt gggggcggaa ggtcggggcg 1561 gggctgggct gggggctgct ctgcccgcac aaaggcgctc gcgcagcctc ttgttctccc 1621 cgttgttttc tttagggggg tgccgcagtc cgggggggag ggtgaggcta ttgtcccgct 1681 ttatccggtc tggccggctc cacaaaagcg ctccgacgtg gaatcgcttt ctgatccaat 1741 agttgtgaaa acaggccatc tgcgcgccgg gctgaagggt attaatggtt tctatgggat 1801 tcacagagga aatgcagatc caggcattat ggccggtgtg gtggcagcag attgaaataa 1861 tggatccctt tgtgtcggag gggagggtgc cgtgggctgc attcctacca aatgcgtaat 1921 ggtgggggca tattcctggg ggagactgtg tgaaatacag aactccttct ggatgagcca 1981 cgtcggtctg cgctctcaga agactgtccc acatttgaag tatttcctat cagtggtgtc 2041 tgccgatcgg aagcagacaa tgggctcctt gggatgagct cgcagcagct gtctgcgtcc 2101 ccttgacgtg cattaaaaat gtagaattta atcctatata agcaaagatg tgaatgtaag 2161 aaacagtgca ctggagatga gtgtgtagga agctgcggtg tcacacagat gaaccaaagc 2221 cgatgtgaca gccaaaacca actgtgaagc ctgtgaaaag ttgtgagctc aagttctgat 2281 ttgtgcaaaa acagctctcg gcttggattg agggctgttt ggactagcag atgtttagga 2341 atgtgccttc tcaaaacaag ataacttcct gaaaactgct ggtgtgaaac aacactgctg 2401 tcactcctct gagctggaac gagctgccct gtatctccag acagctgggc gcaggctgac 2461 cgtcccatag agctctccat aggcaatagg tgtcaccatg caggaagttg tcatttgaat 2521 tttagcattt tccctagcgg tcttagtgag acagtgtggg agcagacaca tggcagggtc 2581 tgggaatgag gtgtccactg acgttctggg ggctgaggct tgaaatagca gctccaggta 2641 ctgcctttaa cctgtgctgc tgtgttacca ggctgagccg atccaggttg ttgccaagtc 2701 agtgcagatg catttccctt tttttgtttt aaattactag agttcaagca caagctcctt 2761 tgacttaaag ctagtttaat catgcgtgct tctctggtgc gcttcaaggc atgtcttcct 2821 gtccctccaa ccgatgaagc tt // LOCUS CHKVIM2 685 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken vimentin gene, exons 3 and 4. ACCESSION M15851 J02759 M25923 KEYWORDS intermediate filament; vimentin. SEGMENT 2 of 3 SOURCE Chicken DNA, library of J.B.Dodgson. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 685) AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M. TITLE The chicken vimentin gene: Nucleotide sequence, regulatory elements, and comparison to the hamster gene JOURNAL J. Biol. Chem. 262, 8112-8120 (1987) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE The chicken vimentin gene: Aspects of organization and transcription during myogenesis JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by by B.A.Roe, 11-MAY-1987. FEATURES from to/span description pept + 7 102 vimentin, exon 3 246 + 407 vimentin, exon 4 pre-msg < 1 > 685 VIM mRNA IVS < 1 6 VIM cds intron B IVS 103 245 VIM cds intron C IVS 408 > 685 VIM cds intron D BASE COUNT 189 a 142 c 158 g 196 t ORIGIN About 3 kb after segment 1. 1 gatcaggatg ttgacaatgc ctctctggca ggccttgatc ttgagcgccc cgttgagtcc 61 ctgcaagaag aaattgtctt cttgaagaag cttcatgatg aggtaagctg aataatgagg 121 tctgtgtctg ggcaccgtca gtcaagtcaa caagtaaatt gtctctggtt ctggccccga 181 gtcctaacct gtgcattttc tgcatagtgt tgtttattaa ctccaagtgt ctacttctct 241 ctaaggaaat ccgggaactg caggctcaac tccaggaaca gcacatccaa atcgatatgg 301 atgtttctaa gcctgatctt actgctgccc tgcgcgatgt tcgtcaacaa tatgaaagcg 361 ttgctgctaa gaatcttcag gaagctgaag agtggtacaa gtccaaagta agtacaatgc 421 gttttaggat tatgcagatg cagtaggcat tcacatgaat tttaggtaac agctttgctg 481 tgttccttca gtactgaatg ttagcacagc cacagagtag ggtagtcttc aaactatcag 541 tcagggaaga atacagttgg aaaagtcatt tgtgccctaa gtgcagcatc agactaatgg 601 gctcatttcc tgttcttccc tattcttgta gctggaggca gttcttaagt atatacctaa 661 gaaaggaatg aacaaatgca ttaaa // LOCUS CHKVIM3 2857 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken vimentin gene, exons 5-9. ACCESSION M15852 J02759 M25922 KEYWORDS intermediate filament; vimentin. SEGMENT 3 of 3 SOURCE Chicken DNA, library of J.B.Dodgson. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 2857) AUTHORS Zehner,Z.E., Li,Y., Roe,B.A., Paterson,B.M. and Sax,C.M. TITLE The chicken vimentin gene: Nucleotide sequence, regulatory elements, and comparison to the hamster gene JOURNAL J. Biol. Chem. 262, 8112-8120 (1987) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE The chicken vimentin gene: Aspects of organization and transcription during myogenesis JOURNAL Ann. N.Y. Acad. Sci. 455, 79-94 (1985) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by by B.A.Roe, 11-MAY-1987. Polyadenylation signals are located at positions 2546-2551, 2594-2599, 2829-2834, and 2850-2855. Multiple mRNAs are found for vimentin, each using a different polyadenylation site. FEATURES from to/span description pept + 159 284 vimentin, exon 5 901 1121 vimentin, exon 6 1216 1259 vimentin, exon 7 1685 1770 vimentin, exon 8 2256 2297 vimentin, exon 9 pre-msg < 1 > 2297 VIM mRNA IVS < 1 158 VIM cds intron D IVS 285 900 VIM cds intron E IVS 1122 1215 VIM cds intron F IVS 1260 1684 VIM cds intron G IVS 1771 2255 VIM cds intron H BASE COUNT 816 a 536 c 611 g 892 t 2 others ORIGIN About 0.5 kb after segment 2. 1 tcagatagga aaggtcagtg taacgttcat gtttaaatat ttcaagcgct agaacttccc 61 tgtaatggct tcagttgcct ccaaatcctg ggtctagaag tggtccatta caccattttt 121 gtgaagtgtt atgctaacca atctgggtgt ttccacagtt tgcagatctc tccgaagctg 181 ctaataggaa caatgatgcc ctgcgccagg ccaaacaaga agctaatgaa tatcgcagac 241 agattcagtc tctcacctgt gaagttgatg cccttaaagg aagtgtaagt agacgttctt 301 cctgattcag ctttttaagc actatgctgt agtttagggg atgtgtgtgg ctgcagctca 361 tctcccacag aagacagcag tgcacccatc cttcctgaaa aagactctct gggctccatt 421 tgtctgcatg tactttgtag tgcactgctc agattgctac actggactgg tctttctcaa 481 ggtgtaaaca atacagaatg agctatacga agtggaaaag tgcagatctt gcacttccta 541 gaatacagtt gcctgaaagg ctggcttgaa atggaagtaa ctgtatagta ctcctgtaga 601 ttccctcacc aagtactgtc tgcatgctga cttgtactgt atgtgtttgg cttactgtgg 661 gggtttgctt tgctgtctgg tagaataaat gtgaaacatc tctcatcaga caccagaata 721 acacacctgg actttgcaag gggaggataa ggacatctcc ttatggggac atatttgagc 781 atgttcagaa aggacactgt cattcaactt ccctttgcgt ttttttctta agttatgaag 841 ggtcttgaca ttagaaagag tggctgaaat gtagttaaat ttgactttat tctctcctag 901 aatgaatccc tggagcgcca gatgcgtgaa atggaggaga attttgctgt tgaagctgct 961 aactaccagg acactattgg ccgcctgcag gatgagattc agaacatgaa ggaagaaatg 1021 gctcgccatc ttcgtgagta ccaggacctg ctgaatgtaa agatggctct tgatattgag 1081 attgctacct acagaaaact gctggaggga gaagagagca ggtaaaaatc acatgcaggc 1141 acctaccttc tagcagttac agtttagaga ttccttacca ttgtctgact tgatctattt 1201 cttcttttcc ttcaggatta acatgcctat tccaaccttt gcttctttga acctgagagg 1261 taagctgttt catttgggcc tgtgtaagca gctgttttta atctttagca aaaaggtcat 1321 ggatagttcc ttttgctgta gtatttgaac tggtgtaaat gcagttgtat atgaaagtca 1381 ttcttacaca caactgccag tatttgcagt tacttaattt tggatgagag gtggtacttt 1441 gttcctgggt gtacacgtaa gggagtatca atacataaca agtgacttgg atttctgttg 1501 aagctgcttt aagaggaata agaagaaaat aagcaaagct agtttataat tcattatcat 1561 gcaaaagaga agaggacttc tttgtataca aaactgggat tctgaaagta gatactgaca 1621 ttcctcagtt ctttgaggag gcacttcatc tagactcttg gtaacaatac attttcttct 1681 gcagaaacca acattgagtc tcagccaatt gttgacactc actcgaagag gacacttcta 1741 attaagaccg tggaaactag agatggacag gttggtggtt actttgattt tttttcattc 1801 tatctcagct ggttactttg cccagtgctg tagtctgggc cactcattgg tgcagcctgt 1861 atatggcaga cagtgtctgg cagtccaagc tgcatggctg aatgtgctga tgtagcaaat 1921 gtacttctgt ggaatagacc tcaaaacaga tctgagaaca gcttgagtga ctgacagccc 1981 gtgtttagag gagactttag gttcagttga gttgcaagtc ctcaagattt ctcccattgc 2041 ataagaaaaa tgttgcagga gaagcgttta ttaggggcag gtagtgtatt tgtactaaat 2101 tgaagtttag atggttaagc gtgtgcagct accatttaaa cttctcattc ataggcaaat 2161 gctgatacct ttgttcacat ctgtgctact atttgtttca gttctaactc tttataacaa 2221 ctgtcttaag cttnnatgcc tattttccaa cacaggttat taatgaaact tcccagcatc 2281 acgatgactt ggagtaaagt gaagatgcaa acttaatgca ggagaaattc ttaccagcaa 2341 ggttttaaaa agttcatgtc ttaaaggaag aaacagcttt caagtgcctt tctccagttt 2401 tccatgagcg caagattatt atgctaggaa ataggtctta gatcttgcaa actgactctc 2461 cctgaaggat tagagtttac aatggagtct agtttacaaa tagcaatatc ttgtgctgca 2521 atactgtttt taagtatctg aatttaataa aactgctttt tccagcacag tatgagcaac 2581 ctgtcgctac ttcaataaat ctttggaaaa tggctcttga tgtgttctaa tttaacttca 2641 tgactttctg caaagccata acttaatgct ggaattacta tacggttgac aactccagta 2701 ctgattgtgt gaaatattgt tttcagatta actagacaaa ctgtcttccc atttactgct 2761 taggttttgg aaccaattaa aatggactat aactggcaga tgcataatgt attgatactt 2821 atcagttgaa taaaatgata cttcaagcta ataaaaa // LOCUS CHKVIMN 591 bp ds-DNA VRT 10-AUG-1990 DEFINITION chicken intermediate filament protein vimentin gene, 3' end. ACCESSION X00185 J00915 KEYWORDS vimentin. SOURCE Chicken DNA and cDNA to embryonic muscle mRNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 591) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE Characterization of the chicken vimentin gene: Single copy gene producing multiple mRNAs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 911-915 (1983) STANDARD simple staff_review REFERENCE 2 (bases 30 to 591) AUTHORS Zehner,Z.E. and Paterson,B.M. TITLE Vimentin gene expression during myogenesis: Two functional transcripts from a single copy gene JOURNAL Nucleic Acids Res. 11, 8317-8332 (1983) STANDARD simple staff_review COMMENT FEATURES from to/span description mRNA < 1 308 vimentin mRNA (alt.) mRNA < 1 367 vimentin mRNA (alt.) mRNA < 1 591 vimentin mRNA (alt.) BASE COUNT 193 a 101 c 105 g 192 t ORIGIN 1 cagcatcacg atgacttgga gtaaagctga agtgaagatg caaacttaat gcaggagaaa 61 ttcttaccag caacctttta aaaagttcat gtcttaaagg aagaaacagc tttcaagtgc 121 ctttctccag tttttccatg agcgcaagat tattatgcta ggaaataggt cttagatctt 181 gcaaactgac tctccctgaa ggattagagt ttacaatgga gtctagttta caaatagcaa 241 tatcttgtgc tgcaatactg tttttaagta tctgaattta ataaaactgc tttttccagc 301 acagtatgag caacctgtcg ctacttcaat aaatctttgg aaaatggctc ttgatgtgtt 361 ctaatttaac ttcatgactt tctggaaagc cataacttaa tgctggaatt actatacggt 421 tgacaactcc agtactgatt gtgtggaata ttgttttcag attaactaga caaactgtct 481 tcccatttac tgcttaggtt ttggaaccaa ttaaaatgga ctataactgg cagatgcata 541 atgtattgat acttatcagt tgaataaaat gatacttcaa gctaataaaa a // LOCUS ECORRD 1541 bp ss-rRNA RNA 10-AUG-1990 DEFINITION E.coli 16S ribosomal RNA. ACCESSION J01859 M24828 M24911 K02555 M24996 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE Escherichia coli rRNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1541) AUTHORS Ehresmann,C., Stiegler,P., Carbon,P. and Ebel,J.-P. TITLE Recent progress in the determination of the primary sequence of the 16 S RNA of Escherichia coli JOURNAL FEBS Lett. 84, 337-341 (1977) STANDARD full staff_review REFERENCE 2 (bases 1 to 1541) AUTHORS Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P. TITLE The sequence of Escherichia coli ribosomal 16 S RNA determined by new rapid gel methods JOURNAL FEBS Lett. 94, 152-156 (1978) STANDARD full staff_review REFERENCE 3 (bases 1 to 1541) AUTHORS Carbon,P., Ehresmann,C., Ehresmann,B. and Ebel,J.-P. TITLE The complete nucleotide sequence of the ribosomal 16-S RNA from Escherichia coli: Experimental details and cistron heterogeneities JOURNAL Eur. J. Biochem. 100, 399-410 (1979) STANDARD full staff_review REFERENCE 4 (bases 1346 to 1541) AUTHORS Taylor,B.H., Prince,J.B., Ofengand,J. and Zimmermann,R.A. TITLE Nonanucleotide sequence from 16S ribonucleic acid at the peptidyl transfer ribonucleic acid binding site of the Escherichia coli ribosome JOURNAL Biochemistry 20, 7581-7588 (1981) STANDARD full staff_review REFERENCE 5 (bases 1442 to 1541) AUTHORS Schnare,M.N. and Gray,M.W. TITLE 3'-terminal sequence of wheat mitochondrial 18S ribosomal RNA: Further evidence of a eubacterial evolutionary origin JOURNAL Nucleic Acids Res. 10, 3921-3932 (1982) STANDARD full staff_review REFERENCE 6 (bases 1 to 1541) AUTHORS Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P. TITLE The determination of the primary structure of the 16s ribosomal rna of escherichia coli: III. Further studies JOURNAL Biochimie 57, 711-748 (1975) STANDARD simple staff_entry REFERENCE 7 (bases 1 to 1541) AUTHORS Ehresmann,C., Stiegler,P., Fellner,P. and Ebel,J.-P. TITLE the determination of the primary structure of the 16S ribosomal RNA of Escherichia coli: (2) nucleotide sequences of products from partial enzymatic hydrolysis JOURNAL Biochimie 54, 901-967 (1972) STANDARD simple staff_entry REFERENCE 8 (sites) AUTHORS Zimmermann,R.A. and Singh-Bergmann,K. TITLE Binding sites for ribosomal proteins S8 and S15 in the 16S RNA of Escherichia coli JOURNAL Biochim. Biophys. Acta 563, 422-431 (1979) STANDARD simple staff_entry REFERENCE 9 (bases 558 to 890) AUTHORS Gregory,R.J., Zeller,M.L., Thurlow,D.L., Gourse,R.L., Stark,M.J.R., Dahlberg,A.E. and Zimmermann,R.A. TITLE Interaction of ribosomal proteins S6, S8, S15 and S18 with the central domain of 16S ribosomal RNA from Escherichia coli JOURNAL J. Mol. Biol. 178, 287-302 (1984) STANDARD simple staff_review COMMENT [2], [3] contain updated sequence data for the original work by the same laboratory [1]. There were too many discrepancies between [1] and [2], [3] to list each revision in our sites table. The sequence shown is from [3]. [1], [2], [3] point to a number of cistron heterogeneities. There is uncertainty, however, with regard to assigning these various heterogeneities to specific cistrons. The RNA method used by [1], [2], [3] gives the average of all the cistrons present in the cell [3]. The heterogeneities are classified by their relative proportions into major, minor and undetermined species. The sequence shown corresponds to the major species. The heterogeneities were annotated as variations in the sites table. It is not known which of the residues "c" (base 633) or "a" (base 641) undergoes a deletion, giving rise to the minor component "atctg". [3] suggests the existence of one or two mutated cistrons among the known seven cistrons of ribosomal RNA. With the exception of a single base deletion, this sequence is identical to the current 16S rDNA sequence for the E.coli rRNB gene. FEATURES from to/span description rRNA 1 1541 16S rRNA variant 76 76 g in major species; c in undetermined species variant 80 80 a in major species; c in undetermined species variant 88 89 tc in major species; tgc in undetermined species variant 90 92 ttt in major species; substitution of one of the t by c in undetermined species variant 179 179 t in major species; c in minor species variant 182 182 c in major species 1; t in major species 2 variant 193 194 ca in major species; cga in undetermined species variant 267 272 tcacct in major species; ccacct or tcaccc in minor species variant 283 285 ccc in major species; cc in minor species conflict 348 349 ag in [3]; ga in [2] modified 526 526 m7g = 7-methylguanosine variant 630 631 ct in major species; cct in minor species variant 632 634 gca in major species; ga in minor species (see comment) variant 640 642 tac in major species; tc in minor species (see comment) conflict 853 855 ttc in [3]; ctt in [2] revision 915 916 tg in [3]; tcg in [1],[2] (typing error) variant 965 965 g in undetermined species 1; 2-methylguanosine in species 2 conflict 1027 1029 ctt in [3]; ttc in [2] variant 1071 1072 gt in major species; gtt in undetermined species variant 1074 1074 t in major species; a in undetermined species modified 1099 1099 m5c = 5-methylcytidine modified 1206 1206 m2g = 2-methylguanosine variant 1280 1280 c in major species; t in minor species variant 1320 1322 tcg in major species; tg in minor species modified 1401 1401 m4c = n4-methylcytidine modified 1402 1402 cm = 2'-o-methylcytidine variant 1405 1405 t in major species; c in minor species variant 1406 1406 5-methylcytidine in major species; c in minor species conflict 1490 1493 gaag in [3]; gag in [4] modified 1497 1497 um = 2'-o-methyluridine modified 1517 1517 m66a = n6,n6-dimethyladenosine modified 1518 1518 m66a = n6,n6-dimethyladenosine BASE COUNT 389 a 352 c 487 g 313 t ORIGIN 5' end of mature rRNA. 1 aaattgaaga gtttgatcat ggctcagatt gaacgctggc ggcaggccta acacatgcaa 61 gtcgaacggt aacaggaaga agcttgctct ttgctgacga gtggcggacg ggtgagtaat 121 gtctgggaaa ctgcctgatg gagggggata actactggaa acggtagcta ataccgcata 181 acgtcgcaag accaaagagg gggaccttcg ggcctcttgc catcggatgt gcccagatgg 241 gattagctag taggtggggt aacggctcac ctaggcgacg atccctagct ggtctgagag 301 gatgaccagc cacactggaa ctgagacacg gtccagactc ctacgggagg cagcagtggg 361 gaatattgca caatgggcgc aagcctgatg cagccatgcc gcgtgtatga agaaggcctt 421 cgggttgtaa agtactttca gcggggagga agggagtaaa gttaatacct ttgctcattg 481 acgttacccg cagaagaagc accggctaac tccgtgccag cagccgcggt aatacggagg 541 gtgcaagcgt taatcggaat tactgggcgt aaagcgcacg caggcggttt gttaagtcag 601 atgtgaaatc cccgggctca acctgggaac tgcatctgat actggcaagc ttgagtctcg 661 tagagggggg tagaattcca ggtgtagcgg tgaaatgcgt agagatctgg aggaataccg 721 gtggcgaagg cggccccctg gacgaagact gacgctcagg tgcgaaagcg tggggagcaa 781 acaggattag ataccctggt agtccacgcc gtaaacgatg tcgacttgga ggttgtgccc 841 ttgaggcgtg gcttccggag ctaacgcgtt aagtcgaccg cctggggagt acggccgcaa 901 ggttaaaact caaatgaatt gacgggggcc cgcacaagcg gtggagcatg tggtttaatt 961 cgatgcaacg cgaagaacct tacctggtct tgacatccac ggaagttttc agagatgaga 1021 atgtgccttc gggaaccgtg agacaggtgc tgcatggctg tcgtcagctc gtgttgtgaa 1081 atgttgggtt aagtcccgca acgagcgcaa cccttatcct ttgttgccag cggtccggcc 1141 gggaactcaa aggagactgc cagtgataaa ctggaggaag gtggggatga cgtcaagtca 1201 tcatggccct tacgaccagg gctacacacg tgctacaatg gcgcatacaa agagaagcga 1261 cctcgcgaga gcaagcggac ctcataaagt gcgtcgtagt ccggattgga gtctgcaact 1321 cgactccatg aagtcggaat cgctagtaat cgtggatcag aatgccacgg tgaatacgtt 1381 cccgggcctt gtacacaccg cccgtcacac catgggagtg ggttgcaaaa gaagtaggta 1441 gcttaacctt cgggagggcg cttaccactt tgtgattcat gactggggtg aagtcgtaac 1501 aaggtaaccg taggggaacc tgcggttgga tcacctcctt a // LOCUS HUMRSALPA 320 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone pRI-680-05, dimer 1. ACCESSION M28029 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 320) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 320 alphoid repeat, dimer 1 BASE COUNT 100 a 54 c 59 g 107 t ORIGIN 1 tgatgtgtgt attcaacgca cagagttgaa cgttccttta gacagagtag attggaaaca 61 ctctttttgt agaattttca agtggaggta tcaagcgctt tgaggccaat gatagaaaag 121 gaaatacctt cgtataataa ttagacagaa tcattctcag aaaccgcttt gcaatgtgtg 181 cgttcaactc acagtgttta acctttcttt tcatacagtt gtttcgaaac actcttttta 241 cagaatctac aagtaaatat ttggacctct ttgaagtctt cgttagaaat gagatttctt 301 catataatac tagctagaag // LOCUS HUMRSALPB 314 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone pRI-680-05, dimer 2. ACCESSION M28030 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 314) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 314 alphoid repeat, dimer 2 BASE COUNT 89 a 51 c 65 g 109 t ORIGIN 1 ttctctcagt aacttgtatt tgctcggtgc tatcaactct cagagttgaa ctttccttta 61 gaaacagtag atttgaaact ctctttttgc ggaatttgta agtggagata tcaagagttt 121 tgaggccaat ggtagaaaag gaaatatctt cgtatgtaaa ctagatagaa tcattctcag 181 aaactacttt ggcacgtgtg cgttcaactc acagtgttta acctttcttt tcatagagca 241 gtttggaaac actcagtttg taaagtcagc aactggatat ttggatgtat ttgattcctt 301 cgttggaaat ggga // LOCUS HUMRSALPC 2550 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone pHS53. ACCESSION M28031 M28033 M28032 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2550) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 850 alphoid pentamer rpt 851 1700 alphoid pentamer rpt 1701 2550 alphoid pentamer BASE COUNT 743 a 496 c 524 g 787 t ORIGIN 1 gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg gataaacttc 61 ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt caactcacag 121 agttgaacct tgctttcata gttcagcttt caaacactct ttttgtagaa tctgcaagtg 181 gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca tcaaacctag 241 accgaagcat tctcagaatg tttcctttga tgactgcatt caactcacag aggtgaacaa 301 tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt ggatatgtgg 361 acctctgtga agatttcttt ggaaacgggt tcatcttcac agaaaaacta aacagaagca 421 ttctcagaaa ctgctttgtg atgtttgtgt tccacttcag gaattaaact ttcctcttga 481 cagagcagct ctgaaaccct cttattctag aatctgcaag tggacatttg gaggactttg 541 aggcctgtgg tggaaaagga aaatcttcac ataaaaacta gatggaagca ttctcagaaa 601 ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga tagagcaggt 661 tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg aggcctactg 721 tagtaaagga aataacttca tctaaaaacc aaacggaagc attcacagac aattcttagt 781 gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt ttccaaacac 841 actttctgta gaatctgcaa gtggatattt ggacttctct gaggatttcg ttggaaacgg 901 gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga tgtttgcatt 961 caactcacag agttgaacct tgctttcata gttcagcttt caaacactct ttttgtggaa 1021 tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta tatcttcaca 1081 tcaaacctag accgaagcat tctcagaatg tttcctgtga cgactgcatt caactcacag 1141 agatgaacaa tcctgctgat ggagcagttt tgaaactctc tttctttgga ttctgcaagt 1201 tgatatgtgg acctctgtga agatttcgtt ggaaacgggt tcatcttcac agaaaaacta 1261 aacagaagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa gaattgaact 1321 ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag tggacatttg 1381 gagggctttg aggcctgtgg tgcaaaagga aaatcttcac ataaaaacta gatggaagca 1441 ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca ttcctataga 1501 tagagcaggt tgtaaacaat ctttttgtag aatctgcgat tggagatttg gactgctttg 1561 aggcctactg tagtaaagga aataacttca tctaaaatcc aaacggaagc attcacagac 1621 aattcttagt gataattgca ttgatctaac agagctgaac attcctttag atggcgtagt 1681 ttccaaacac actttctcta gaatctgcaa gtggatattt ggacttctct gaggatttcg 1741 ttggaaacgg gataaacttc ccagaactac acggaagcat tgtgagaaac ttctttgtga 1801 tgtttgcatt caactcacag agttgaacct tgctttcata gttcagcttt caaacactct 1861 ttttgtggaa tctgcaagtg gatatttgga ccactttgtg gccttccttc gaaacgggta 1921 tatcttcaca tcaaacctag acagaagcat tctcagaatg tttcctgtga cgactgcatt 1981 caactcacag aggtgaacaa ccctgctgat ggagcagttt tgaaactctc tttctttgga 2041 ttctgcaagt ggatatgtgg acctctgtga agatttcgtt ggaaacggtt tcatcttcac 2101 agaaaaacta aacaggagca ttctcagaaa ctgctttgtg atgtttgtgt tccacttcaa 2161 gaattgaact ttcctcttga cagagcagct ctgaaaccct ctttttctag aatctgcaag 2221 tggacatttg gagggctttg aggcctgtgg tggaaaagga aaatcttcac ataaaaacta 2281 gatggaagca ttctcagaaa ctactttgtg atgattgcat tcgactcaca gagttgaaca 2341 ttcctataga tagagcaggt tgtaaacaat gtttttgtag aatctgcgat tggagatttg 2401 gactgctttg aggcctactg tagtaaagga aataacttca tctaaaaacc aaacgcaagc 2461 attcacagac aattcttagt gatcattgga ttgaactaac agagctgaac attcctttag 2521 atggagaagt ttccaaacac actttctgca // LOCUS HUMRSALPD 134 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-11A. ACCESSION M28034 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 134) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 134 alphoid repeat BASE COUNT 47 a 23 c 27 g 37 t ORIGIN 1 tcacagagtt gaacgatcct ttacacagag cagacttgta acactctttt tgtggaattt 61 gcaagtggag atttcagccg ctttgaagtc aaaggtagaa aaggaaatat cttcctataa 121 aaactagaca gaat // LOCUS HUMRSALPE 146 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-23A. ACCESSION M28035 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 146) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 146 alphoid repeat BASE COUNT 46 a 28 c 26 g 43 t 3 others ORIGIN 1 ttttttccca actcacacga gttaacgatc ctttacacag agcagacttg tannnctctc 61 tttatggaat ttgcaagtgg agatttcagc cgctttgagg tcaatggtag aaaaggaaat 121 atcttcctat aaaaactaga cagaat // LOCUS HUMRSALPF 171 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-10A. ACCESSION M28036 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 171) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 171 alphoid repeat BASE COUNT 53 a 27 c 38 g 53 t ORIGIN 1 aattctcaga gtcttctttg tgatgtgtgc tttcaactca ccgagataaa gatttctctt 61 gatagagcaa attggaaaca ctctttttgt agaatttgca agggtacatt gagagcgctt 121 tcaggcctat ggtagaaaag ggaatatctt tccataaaag gtagacagaa g // LOCUS HUMRSALPG 137 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-11B. ACCESSION M28037 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 137) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 137 alphoid repeat BASE COUNT 33 a 27 c 30 g 47 t ORIGIN 1 gattctcaga aactcctttg tgatgtgtgc gttcaactca cagagtttaa cctttctttt 61 catagagcag ttaggaaaca ctctgtttgt gaagtctgca agtggatatt cagacctctt 121 tgaggccttc gttggaa // LOCUS HUMRSALPJ 158 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-03B. ACCESSION M28040 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 158) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 158 alphoid repeat BASE COUNT 44 a 31 c 40 g 43 t ORIGIN 1 gcaactcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa ccgagcgcaa 61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt 121 tgaggccttc gttggaaacg ggatttcttc ctataaag // LOCUS HUMRSALPK 138 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alphoid repetitive DNA, subclone p340-10B. ACCESSION M28039 KEYWORDS alphoid repetitive sequence; repetitive DNA. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Zaitsev,I.Z. and Rogaev,E.I. TITLE Structural analysis of alphoid DNA of primates JOURNAL Mol. Biol. 20, 530-538 (1986) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt 1 138 alphoid repeat BASE COUNT 37 a 27 c 32 g 42 t ORIGIN 1 caatctcaga aactactttg tgatgtgtgc attcaactca ccgagtgcaa cattcctctt 61 gatagagcag tttggaaaca ttgtgtctgt agaatctgca agtggatata tggaccgctt 121 tgaggccttc gttggaaa // LOCUS HUMHBGF1 1082 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 1. ACCESSION M23017 KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1. SEGMENT 1 of 3 SOURCE Human fetal liver DNA, clone ChIK117-1.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1082) AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M. TITLE Cloning of the gene coding for human class 1 heparin-binding growth factor and its expression in fetal tissues JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.-M.Chiu, 13-MAR-1989. FEATURES from to/span description pept 602 + 770 heparin-binding growth factor 1, exon 1 /nomgen="FGFA" /map="5q31.3-q33.2" /hgml_locus_uid="LD0150G" pre-msg 568 > 1082 heparin-binding growth factor 1 mRNA and intron IVS 771 > 1082 heparin-binding growth factor 1 intron A BASE COUNT 258 a 285 c 262 g 277 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttccct taacatacta accctttact ttccctgttg tgtccctgaa aggcctcctg 61 tgcctttggc tgcaggtccc gaacgtccag gccatctgtg ctatctgctt cgcggtacct 121 caccaacgca acgtgagggt ggagggcaga accttggtcc tggcctctca gcttttgtgg 181 gtttcagcca gaccctaggt gttattttag tgcaactttg gtgtttaatt tgaggatgtg 241 tgtggaccag aaggagggac caaaacatga ttcttttccc catggtcaga tgattaaatt 301 tgaagttcta aaaaatgcag tttggtccaa agctgtgtcc aattgggaag agagaaaaat 361 gccctggaaa cccctcccag gcctgggacc atccttcctt aaccaccagc cacctcacag 421 gcccgcggac tgcgggcatc acctgggcag gctgtgctta ctcactaccc gggaaccctg 481 tgccctggag ctgtccttcc tctcttcaaa gtgcattttg tgcctttgct ggaagaaccg 541 actacaggtt tgttcaattt cttacagtct tgaaagcgcc acaagcagca gctgctgagc 601 catggctgaa ggggaaatca ccaccttcac agccctgacc gagaagttta atctgcctcc 661 agggaattac aagaagccca aactcctcta ctgtagcaac gggggccact tcctgaggat 721 ccttccggat ggcacagtgg atgggacaag ggacaggagc gaccagcaca gtaagcccat 781 ctctatggca ccccccttcc ctttctgaca tcttctgtag tcaaggtggg aggaaggtgc 841 acatttaagt acaggtactt gcttctccaa ggttctattc aggcatgaca cattcagagg 901 tggagtcaca taaatgcgta aaatgtctgg gaaatgaaaa tagggacttg tgggggccac 961 cacttaccca aacgtgtcct atttcaagtt ttttaaagca ctctctgctg acccaacaga 1021 acgggctgcc ggtgctcaat tgctgtatgt tttcccaggt ttctgtaact agtgaaagat 1081 ct // LOCUS HUMHBGF2 427 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 2. ACCESSION M23086 KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1. SEGMENT 2 of 3 SOURCE Human adult liver DNA, clone ChIL208-1.4.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 427) AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M. TITLE Cloning of the gene coding for human class 1 heparin-binding growth factor and its expression in fetal tissues JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.-M.Chiu, 13-MAR-1989. FEATURES from to/span description pept + 186 + 289 heparin-binding growth factor 1, exon 2 /nomgen="FGFA" /map="5q31.3-q33.2" pre-msg < 1 > 427 heparin-binding growth factor 1 mRNA and introns IVS < 1 185 heparin-binding growth factor 1 intron A IVS 290 > 427 heparin-binding growth factor 1 intron B BASE COUNT 106 a 85 c 108 g 128 t ORIGIN Unknown number of bp after segment 1. 1 cagctttctt tggaaggcaa agaaaaaggg actgtatttc tatgttttga ttaatctgag 61 gctcatcctg agggctccgt gaaatgaatg agcagaattt tccatggcca actgtcctgg 121 ctgccgggtc ctatcggcaa aagcgtagtg tttatttact tttgctcgtg ttatttttat 181 tccagttcag ctgcagctca gtgcggaaag cgtgggggag gtgtatataa agagtaccga 241 gactggccag tacttggcca tggacaccga cgggctttta tacggctcag taagtatgaa 301 gctgacatgc ttccagacgt tggccaaggt ttgaggtttc cagaaatctt gttacatgga 361 gtgaggcaaa ctataaagca acaattagtc tctgtttgtt attttttcca gaaggattcc 421 caccctc // LOCUS HUMHBGF3 664 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human heparin-binding growth factor 1 (hbgf-1), exon 3. ACCESSION M23087 KEYWORDS acidic fibroblast growth factor; heparin-binding growth factor 1. SEGMENT 3 of 3 SOURCE Human fetal liver DNA, clone ChIK105a-2.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 664) AUTHORS Wang,W.-P., Lehtoma,K., Varban,M.L., Krishnan,I. and Chiu,I.-M. TITLE Cloning of the gene coding for human class 1 heparin-binding growth factor and its expression in fetal tissues JOURNAL Mol. Cell. Biol. 9, 2387-2395 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.-M.Chiu, 13-MAR-1989. FEATURES from to/span description pept + 304 498 heparin-binding growth factor 1, exon 3 /nomgen="FGFA" /map="5q31.3-q33.2" pre-msg < 1 > 498 heparin-binding growth factor 1 mRNA and intron IVS < 1 303 heparin-binding growth factor 1 intron A BASE COUNT 181 a 153 c 142 g 188 t ORIGIN Unknown number of bp after segment 2. 1 tgaggactct tagaagtgct cttatcagta gcatcttaat tactttacaa tggattttaa 61 atggaaagga agtttacaat aatagcaaat gcatattgac agctctttag tgcccggtgc 121 tgttctaagt ccttatgact accctgtgaa ataagttcca ccatgacccc aattttcctg 181 aaaaggagac tgaggcatgg agagctttag tattttgccc aatgtcacac agctagtaaa 241 tggggacccc catgtgaaac tactcactga ttgtcctact ctcttgtggt tttatctttt 301 tagcagacac caaatgagga atgtttgttc ctggaaaggc tggaggagaa ccattacaac 361 acctatatat ccaagaagca tgcagagaag aattggtttg ttggcctcaa gaagaatggg 421 agctgcaaac gcggtcctcg gactcactat ggccagaaag caatcttgtt tctccccctg 481 ccagtctctt ctgattaaag agatctgttc tgggtgttga ccactccaga gaagtttcga 541 ggggtcctca cctggttgac ccaaaaatgt tcccttgacc attggctgcg ctaaccccca 601 gcccacagag cctgaatttg taagcaactt gcttctaaat gcccagttca cttctttgca 661 gagc // LOCUS HUMASAT3 1741 bp ds-DNA PRI 10-AUG-1990 DEFINITION Human alpha satellite and satellite 3 DNA. ACCESSION M21305 KEYWORDS alpha satellite; satellite 3; satellite DNA. SOURCE Human DNA, clone pTR9. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1741) AUTHORS Vissel,B., Earle,E. and Choo,K.H. TITLE A clone containing contiguous satellite 3 and alpha satellite sequences, separated by a direct repeat of 9 bp JOURNAL Unpublished (1989) Royal Children's Hospital, Victoria, Australia STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by B.Vissel 29-MAR-1989. Indirect evidence suggests that this sequence may represent a natural junction sequence between alpha satellite and satellite 3 sequences on chromosome 9. FEATURES from to/span description site 1 1482 human satellite 3 DNA site 1501 1741 human alpha satellite DNA rpt 1483 1491 direct repeat 1A rpt 1492 1500 direct repeat 1B signal 5 9 caat box (put.) signal 71 77 tata box (put.) ORF 38 207 ORF 1 (alt.) ORF 43 172 ORF 1 (alt.) signal 630 634 caat box (put.) signal 718 722 tata box (put.) ORF 752 1109 ORF 2 (alt.) ORF 757 925 ORF 2 (alt.) BASE COUNT 661 a 152 c 534 g 394 t ORIGIN 1 gaattcaata gaatggaatg gtatcagctc gagtggaatg gaatggaatg gaatggaatg 61 gcatggaatc gtataaagtg gaatggaatc aactcgagtg gaatggaatg gaatggaatg 121 gaatggaatg cagtacaatg caatagaatg gaatggaatg aactcgagtt gactggaatg 181 gaatggaatg gaatgcattt gaattgaatg gacctgaaga gaatggaatg gaatggaatg 241 gaatgaagtg gcctcgaatg caatagatta gattgcaacg gaatcatctt gagtggaatg 301 gagtgaaatg gaacggaatg gaatggaatg gaatggaata gaatggaatg gaatggaatg 361 gaatagaatg gaatggaatg gaatggaatg aaatgcaata gaatggaatg gtatcaactc 421 cagtggaatg gaatggaatg caatggaatg gaatggaatg gaatggaatg gaatcaattc 481 acctggaacg gaatggaatg caatggaatg gaatggaatg gaatgcagtg gaatggaatg 541 gaatagaatg gaatggaatg gaatggaatg gaatccactt gagtggaatg gaatggaatg 601 gaatggattc gaacggaatg gaatggaatg caatggaatg gaatgcatgt gattgaatgg 661 acacgaaaag aattgaatgg aaatgaatgg aatcgaatga agtggcctcg aatgcaatat 721 aatggaatgg aatcaagtcg agtggaatgg aatggaatgg gttggaatgg aatggaatgg 781 aatggaatgc attagaatgg aatggaatca actcgagtgg aatggaatgg aatggaatgg 841 aatggaatgg aatgcaatgg aatggaatgg aatggaatgg aatggaatgg aatggaatgc 901 aatggaatgg aatggaatgc aatagaatgg aatggaatca cctcgactgg aatggaatgg 961 aatggaatgc atttgaattg aatggacccg aagagaatgg aatggaatgg aatggaagga 1021 gtggcctcaa atgcaataga atgcaatgga atcatctcga gtggaatgga gtgaaatgga 1081 atggaatgga atggaatgga atgctataga atgaaatgga atgaagtcaa gtggaatgga 1141 atggaatgga atgggttgga agggaatgaa atggaatgca atagaatgga aaggaatgga 1201 atggaatgaa ctggccaatg caatagaatt gaatggaatc aactggagtg gaatggaatg 1261 gaatggaatg gattggaatg gaatgcaata gaatggaatg gaatcaactc gagtaaaatg 1321 gaatggaatg gaatggaatg gaatggaatg gaatggaatt gaatggaatg gaatggaatg 1381 catttgaatt gaatggacac gaaaagaatg gattggaatg gaatggaatg aagtagcctc 1441 gagtgcaatg atatggaatg gcatcaactc gagtggaatg gagtgaaaaa ggtgaaaaag 1501 gaaatacctt cacacaaaaa ctagacagaa gcattctcag aatcttcctt ttgatgtgtg 1561 cattcaactc acagagttga acctttcttt tgatagagca gttttgaaac actctctttg 1621 taggatctgc aactggacat ttggagctct ttgataccta cagtgaaaaa ggaaatatct 1681 tcacataaaa acttcacaga agcattctca aaaacatttt tcagatgtgt gcattgaatt 1741 c // LOCUS DROLINEJA 5154 bp ds-DNA INV 10-AUG-1990 DEFINITION D.melanogaster LINE element J-1. ACCESSION M22874 KEYWORDS LINE element; dispersed repetitive element; long interspersed element. SOURCE D.melanogaster DNA, clone J-1. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 5154) AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V. TITLE The Drosophila mobile element jockey belongs to LINEs and contains coding sequences homologous to some retroviral proteins JOURNAL Gene 70, 253-262 (1988) STANDARD simple staff_entry COMMENT FEATURES from to/span description pept / 414 2165 ORF1, nucleic-acid binding protein (AA at 1) (put.) pept 2162 4912 ORF2, reverse transcriptase (put.) rpt 115 5134 jockey dispersed repetitive element BASE COUNT 1580 a 1226 c 1072 g 1276 t ORIGIN 1 catacccaac ctgaacacgt ttttctcaat atttgtgcca caaaatactt cgccagtaaa 61 ttaacatgaa gtcgctaaat gcgattacaa ttgacgacgc aatcatatac aaaaaaaaat 121 cattcacatg ggagatgagc aatcgagtgg acgtgttcac agaagtcgcg agataaaaca 181 aaaacgtaat tgtgatccat cacaaacatc tgcgcagatc gtgtgcttat ctcacaaaca 241 aaatctattt ttagtcactg cataacggtg acggcttcgg ttcgcgaaac ttatcagcaa 301 ctagcaattt ctaagctgtg ttgtttttgc ccctcgccct gcgcgctgcg caagcgggag 361 gttgttacaa tttaccttac aagtaaaccg gtaaatctta tcgtgtttag taaatatcaa 421 ttgcattata cggcataagt ataaagacaa ttgatataat ggagaattca tttgctcaat 481 cgcgacctag caatgggtgc gataaatttg agaaaatgag gaaagtagca ggtgttgagc 541 caggagaatt acgctcccaa ctccgcgcca gctgtgcagt tgtttcccct aacctggaag 601 gtatgccaac tcaatctgcg gtctccagct taatggtgac aatcagcagc aacaccaatg 661 caagtgttac ctgcactatt tctaacgtac aggccaacat gatctgtact cctacataca 721 ctgattgcac aaccgtgacc actagcattt gcccaactac gccttatgac aatggactgc 781 cgacacctct gtcatcactg cccaataagc catctaaagc gaattgcccc tttcaagcac 841 atgatcgtac tgtcaacagg aaacgaaaag gcgtgtctca gcccccatta cctatcctca 901 ccccttctcc aagccgtaaa actaaaaggc aggccactat gccactcaat gaggaggcct 961 ctacctccac tgcagcagca ttaaataaca atcgcttcgc gcttttgtcc gctgaagcgg 1021 agaatatgga gcaagacgtg tcggatgctg attctgacat tgaagactct gctgcccgag 1081 atggtggtgg acaatccgct aaatatagca aacccccagc catatgcgta ccaagtgtaa 1141 gcgatccggt caccttggaa cgggctctca atctgagcac cggctcctca aactactaca 1201 tccgcatttc tagatttggt gtatccagaa tctatacagc caaccctgat gctttccgca 1261 ccgctgtaaa agaactaaat aagttaaatt gtcaattctg gcatcaccaa cttaaagaag 1321 aaaaacccta cagagtagtg cttaaaggaa tccatgctaa tgttcctagt tcgcagatag 1381 aacaagcatt tagtgatcac ggctatgagg tccttaatat ctattgcccc agaaagtctg 1441 actggaagaa cattcaggta aacgaagatg ataatgaagc tacaaaaaac ttcaaaacta 1501 gacaaaattt gttttatatt aatcttaaac aaggcccgaa tgttaaagag tctcttaaga 1561 taactcgact tggcagatac agagtcactg ttgagcgcgc tacacgtaga aaagaactgc 1621 tacaatgtca aagatgccaa atttttggac actctaagaa ctattgcgcc caggatccta 1681 tttgtggtaa atgtagtggt ccccatatga ccgggttcgc tttgtgcata agtgacgtat 1741 gtctgtgtat aaattgtggt ggtgatcatg tctcgacaga caaaagctgc cctgtcagag 1801 cagagaaagc caagaagcta aaaccaaggt ccaggctacc gatgactaat aatattgcca 1861 cactcaaacc tccacaacgt tcttcaagcg gttacatacc agctgaggca ttaagaacca 1921 acatctctta tgctgatatt gctcgacgca acacgactca atctagggct cgtgctactg 1981 tgcaggctga agttatacca acgtcggaca atagccttaa caataaattt atgacgttag 2041 acaactccat tcgggccatc aatacgagaa tggacgaact atttaagctt atacacgaaa 2101 ctgtagaggc taataaagct ttcagagaac tggttcaggt tctaattaca cgtattccta 2161 aatgactcaa ccaaccttaa aaatcggatt gtggaacgct cgcggattaa caaggggctc 2221 tgaggagctt cggatattcc tcagcgatca cgatatagac gtaatgctta ccacggaaac 2281 acacatgcga gttggtcagc gcatctatct cccagggtat cttatgtatc acgcccacca 2341 ccccagtggt aacagtagag gtggctctgc agtcatcata aaatctagac tttgtcacag 2401 ccctctgaca cctatctcta ctaatgacag gcagatagcg agagtgcacc tgcaaacatc 2461 ggttgggacc gtcactgtag ctgctgttta tctacctcca gcagaaagat ggatagtaga 2521 tgacttcaaa tccatgtttg ctgcgttagg caacaaattt attgctggtg gtgattacaa 2581 tgccaaacat gcatggtggg ggaacccaag atcctgtcct agaggtaaaa tgttgcaaga 2641 agtcattgca catgggcaat accaagttct ggctacgggc gaacccactt tctactctta 2701 caaccctttg ttaacaccat cagcccttga tttttttata acctgtgggt acggcatggg 2761 caggctagat gtacaaactc tccaggaact ctcgtcggac catcttccta ttctggctgt 2821 attgcacgct acgccgttaa agaaaccaca acgcgtacga ctacttgccc ataatgctga 2881 cataaacata ttcaaaaccc atcttgaaca gctgagtgag gtaaatatgc aaattctgga 2941 ggcggtggac attgataatg ccacaagcct tttcatgagc aaactaagtg aggctgctca 3001 gcttgctgca ccgagaaatc ggcatgaagt agaggccttc agaccacttc aacttccttc 3061 cagtatattg gcactgctca ggctaaaacg aagagttcga aaagaatatg ctagaacagg 3121 tgatccccgc atgcaacaga tccacagtag actggccaac tgcctgcata aggcccttgc 3181 tcgaagaaag caggcccaaa tagatacctt cttggataac ttgggtgctg acgcgagcac 3241 aaattactca ctgtggcgta tcacgaaacg gttcaaagct cagcccaccc caaaatcagc 3301 aatcaaaaat ccgtctggtg gctggtgtcg cactagcttg gaaaaaactg aagtgttcgc 3361 taacaacctt gagcaacgtt ttacacccta taactatgca ccggaaagtc tctgtcgtca 3421 ggttgaagaa tacttggaat cgccctttca aatgagcctg cctctgagtg ctgtcacact 3481 ggaagaagtg aagaatttaa tagccaagct gccacttaag aaagctcctg gagaagatct 3541 tcttgataat agaaccatta gacttctccc agatcaagca ttgcagttcc ttgccttaat 3601 attcaacagc gttcttgatg ttggctactt tccgaaagct tggaaatcgg cgagcataat 3661 tatgatccat aagactggaa aaacaccgac agacgttgac tcgtacaggc ccaccagctt 3721 actcccatct ctgggtaaaa ttatggagag gctgatccta aacaggctgc tcacatgcaa 3781 ggatgttacc aaagcgattc ccaaatttca gtttggcttc cggttgcagc acggtactcc 3841 tgagcaacta catagagtag tgaactttgc tctggaagct atggaaaaca aggagtatgc 3901 agtaggtgcc tttcttgata ttcaacaggc atttgacaga gtctggcacc ctgggctcct 3961 gtacaaagcg aagaggctgt tcccgccgca gctatatttg gttgttaaaa gtttcctgga 4021 agaacgcaca ttccacgtct ctgttgatgg gtacaaatca tcaatcaagc caattgcagc 4081 tggagttcct caaggaagcg ttcttggccc aaccctatac tcagtttttg cttcggacat 4141 gcctactcac acaccagtca cagaggtaga cgaagaagat gtgctcatag ccacctacgc 4201 tgacgatact gctgtgctca cgaaaagtaa aagtatcctg gctgccactt ctggtctaca 4261 ggaatacctg gatgcattcc agcaatgggc tgagaactgg aatgtgcgca tcaacgctga 4321 gaagtgtgcc aatgtgacgt tcgccaaccg aacaggtagc tgtccgggtg tcagtctgaa 4381 tggaagactg atcagacacc atcaggctta taaatacctt ggtattaccc tcgataggaa 4441 gctcaccttc agcaggcaca tcacaaatat tcagcaagcg ttcaggacca aggttgctcg 4501 gatgtcttgg ctcattgcac cacgcaacaa actgtcgctt ggctgcaagg tcaatattta 4561 caagtccata ttggccccct gcctgttcta cggcctgcag gtatacggca ttgctgcgaa 4621 gagtcacctt aataagatcc ggattttaca ggcgaagacc ttaagaagaa tttcgggggc 4681 tccttggtat atgagaacaa gagacatcga acgcgacctc aaggtgccca aattaggaga 4741 caagctccag aacatcgccc aaaaatatat ggaaaggctt aatgtacacc ccaacagcct 4801 agcaaggaag ctaggaactg cagctgtggt caatgctgac cctcggacta gagtcaaaag 4861 aagactcaag cgacaccacc ctcatgacct ccctaacctg gttttgacct agaaagtctt 4921 agttttaaaa ttcattagaa taatcaaata aataataatt actatgttat atcaactatt 4981 ataattctcc ctatcatttt tagattaaaa atctgttagt cttaagtaac caagacacat 5041 tgtaaaataa aataatttaa gcagatcaaa ttaagttgcc gcatgggtaa cagtgcgttg 5101 atcaaataat aaaaacatca taaaaaaaaa aaaaatatac aaaattaatg atgg // LOCUS DROLINEJB 490 bp ds-DNA INV 10-AUG-1990 DEFINITION D.melanogaster LINE element J-1. ACCESSION M22875 KEYWORDS LINE element; dispersed repetitive element; long interspersed element. SOURCE D.melanogaster DNA, clone J-0. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 490) AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V. TITLE The Drosophila mobile element jockey belongs to LINEs and contains coding sequences homologous to some retroviral proteins JOURNAL Gene 70, 253-262 (1988) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt < 1 490 jockey dispersed repetitive element BASE COUNT 218 a 64 c 51 g 157 t ORIGIN 1 tttgttatta aaattcatta gaataatcaa ataaataata attactatgt tatatcaact 61 attataattc tccctatcat ttttaggatt aaaaatctgt tagtcttaag taaccaagac 121 acattgtaaa ataaaataat ttaagcagat caaattaagt tgccgcatgg gtaacagtgc 181 gttgatcaaa taataaaaac atcatcatat aaagacctcc ctaacctggt tttgacctag 241 aaagtcttag ttttaaaatt cattagaata atcaaataaa taataattac tatgttatat 301 caactattat aattctccct atcattttta ggattaaaaa tctgttagtc ttaagtaacc 361 aagacacatt gtaaaataaa ataatttaag cagatcaaat taagttgccg catgggtaac 421 agtgcgttga tcaaataata aaaacatcat cgtatataat taaaaaaaaa aaaaaaaaaa 481 aaaaaaaaaa // LOCUS DROLINEJC 367 bp ds-DNA INV 10-AUG-1990 DEFINITION D.melanogaster LINE element J-1. ACCESSION M22876 KEYWORDS LINE element; dispersed repetitive element; long interspersed element. SOURCE D.melanogaster DNA, clone J-4. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 367) AUTHORS Priimaegi,A.F., Mizrokhi,L.J. and Ilyin,Y.V. TITLE The Drosophila mobile element jockey belongs to LINEs and contains coding sequences homologous to some retroviral proteins JOURNAL Gene 70, 253-262 (1988) STANDARD simple staff_entry COMMENT FEATURES from to/span description rpt < 1 367 jockey dispersed repetitive element BASE COUNT 135 a 76 c 53 g 103 t ORIGIN 1 ctatctcccg atctctattg tacaccccaa cagcctagca aggaagctag gaactgcagc 61 tgtggtcaat gctagccctc agcgctagag tcaaaagaag actcaagcga caccaccctc 121 atgacctccc taacctggtt ttgacctaga aagtcttagt tttaaaattc attagaataa 181 tcaaataaat aataattact atgttatatc aactattata attctcccta tcatttttag 241 gattaaaaat ctgttagtct taagtaacca agacacattg taaaataaaa taatttaagc 301 agatcaaatt aagttgccgc atgggtaaca gtgcgttgat caaataataa aaacatcatc 361 gtaaaaa // LOCUS HUMPOVRA 1254 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human poliovirus receptor mRNA, clone H20A. ACCESSION M24407 KEYWORDS oncogene; poliovirus receptor; transmembrane protein; tyrosine kinase; viral receptor. SOURCE Human Hela cell cDNA to mRNA, clone H20A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1254) AUTHORS Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R. TITLE Cellular receptor for poliovirus: Molecular cloning, nucleotide sequence, and expression of a new member of the immunoglobulin superfamily JOURNAL Cell 56, 855-865 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1254; revises [1]) AUTHORS Racaniello,V.R. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence kindly provided by V.Racaniello, 27-APR-1989. FEATURES from to/span description pept 1 1254 poliovirus receptor revision 1020 1020 a in [1]; g in [1] BASE COUNT 255 a 389 c 362 g 248 t ORIGIN 1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc 61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc 121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat 181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa 241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc 301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac 361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga 421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca 481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac 541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc 601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc 661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg 721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat 781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc 841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt 901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct 961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata 1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg 1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg 1141 tgtccctcga gtacagagca tgccagcgcc tcagctaatg ggcatgtctc ctattcagct 1201 gtgagcagag agaacagctc ttcccaggat ccacagacag agggcacaag gtga // LOCUS HUMPOVRB 1179 bp ss-mRNA PRI 10-AUG-1990 DEFINITION Human poliovirus receptor mRNA, clone H20B. ACCESSION M24406 KEYWORDS oncogene; poliovirus receptor; transmembrane protein; tyrosine kinase; viral receptor. SOURCE Human Hela cell cDNA to mRNA, clone H20B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1179) AUTHORS Mendelsohn,C.L., Wimmer,E. and Racaniello,V.R. TITLE Cellular receptor for poliovirus: Molecular cloning, nucleotide sequence, and expression of a new member of the immunoglobulin superfamily JOURNAL Cell 56, 855-865 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1179; revises [1]) AUTHORS Racaniello,V.R. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence kindly provided by V.Racaniello, 27-APR-1989. FEATURES from to/span description pept 1 1179 poliovirus receptor revision 1020 1020 a in [1]; g in [1] BASE COUNT 234 a 368 c 341 g 236 t ORIGIN 1 atggcccgag ccatggccgc cgcgtggccg ctgctgctgg tggcgctact ggtgctgtcc 61 tggccacccc caggaaccgg ggacgtcgtc gtgcaggcgc ccacccaggt gcccggcttc 121 ttgggcgact ccgtgacgct gccctgctac ctacaggtgc ccaacatgga ggtgacgcat 181 gtgtcacagc tgacttgggc gcggcatggt gaatctggca gcatggccgt cttccaccaa 241 acgcagggcc ccagctattc ggagtccaaa cggctggaat tcgtggcagc cagactgggc 301 gcggagctgc ggaatgcctc gctgaggatg ttcgggttgc gcgtagagga tgaaggcaac 361 tacacctgcc tgttcgtcac gttcccgcag ggcagcagga gcgtggatat ctggctccga 421 gtgcttgcca agccccagaa cacagctgag gttcagaagg tccagctcac tggagagcca 481 gtgcccatgg cccgctgcgt ctccacaggg ggtcgcccgc cagcccaaat cacctggcac 541 tcagacctgg gcgggatgcc caatacgagc caggtgccag ggttcctgtc tggcacagtc 601 actgtcacca gcctctggat attggtgccc tcaagccagg tggacggcaa gaatgtgacc 661 tgcaaggtgg agcacgagag ctttgagaag cctcagctgc tgactgtgaa cctcaccgtg 721 tactaccccc cagaggtatc catctctggc tatgataaca actggtacct tggccagaat 781 gaggccaccc tgacctgcga tgctcgcagc aacccagagc ccacaggcta taattggagc 841 acgaccatgg gtcccctgcc accctttgct gtggcccagg gcgcccagct cctgatccgt 901 cctgtggaca aaccaatcaa cacaacttta atctgcaacg tcaccaatgc cctaggagct 961 cgccaggcag aactgaccgt ccaggtcaaa gagggacctc ccagtgagca ctcaggcata 1021 tcccgtaacg ccatcatctt cctggttctg ggaatcctgg tttttctgat cctgctgggg 1081 atcgggattt atttctattg gtccaaatgt tcccgtgagg tcctttggca ctgtcatctg 1141 tgtccctcga gtgagcatca ccagagctgc cgtaattga // LOCUS MUSSNAP25 2040 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse SNAP-25 mRNA encoding synaptosomal associated protein 25, complete cds. ACCESSION M22012 KEYWORDS synaptosomal associated protein 25. SOURCE Mouse (strain BALB/c) adult neural brain cell, cDNA to mRNA, clones pMuBr8, p8.1, p8.51, p8.52, p8.53, and p8.71. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2040) AUTHORS Oyler,G.A., Higgins,G.A., Hart,R.A., Battenberg,E., Billingsley,M., Bloom,F.E. and Wilson,M.C. TITLE The identification of a novel synaptosomal associated protein, SNAP25, differentially expressed by neuronal subpopulations JOURNAL J. Cell Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.Wilson 18-JAN-1989 FEATURES from to/span description pept 164 784 synaptosomal associated protein 25 mRNA < 1 2037 synaptosomal associated protein 25 mRNA BASE COUNT 613 a 462 c 430 g 535 t ORIGIN Chromosome 2. 1 cccgaggttt ggagctgtct ttccttccct ccctacccgg cggctcctcc actcttgcta 61 cctgcaggga tcagcggaca gcatcctctg aagaagacaa ggttccttaa ctaagcacca 121 ctgacttgct ggccccggcg cccagcaacc ccccaccact accatggccg aggacgcaga 181 catgcgtaat gaactggagg agatgcagag gagggctgac cagctggctg atgagtccct 241 ggaaagcacc cgtcgcatgc tgcagctggt cgaagagagt aaagatgctg gcatcaggac 301 tttggttatg ttggatgagc aaggcgaaca actggaacgc attgaggaag ggatggacca 361 aatcaataag gatatgaaag aagcagaaaa gaatttgacg gacctaggaa aattctgcgg 421 gctttgtgtg tgtccctgta acaagcttaa atccagtgat gcttacaaaa aagcctgggg 481 caataatcag gatggagtag tggccagcca gcctgcccgt gtggtggatg aacgggagca 541 gatggccatc agtggtggct tcatccgcag ggtaacaaac gatgcccggg aaaatgaaat 601 ggatgaaaac ctagagcagg tgagcggcat catcggaaac ctccgtcata tggccctaga 661 catgggcaat gagattgaca cccagaatcg ccagattgac aggatcatgg agaaggctga 721 ctccaacaaa accagaattg atgaagccaa ccaacgtgca acaaagatgc tgggaagtgg 781 ttaaatctgc cgttctgctg tgctgtcctc caatgttgtt ggacaagaga gaagagagct 841 ccttcatgct tctctcatgg tattacctag taagacttac acacacacac acacacacac 901 acacacacac acacacacac acacacacac agagtagtca cccccattgt aaatgtctgt 961 gtggtttgtc agcttcccaa tgataccatg tgtcttttgt tttctccggc tctctttctt 1021 tgccaaaggt tgtacatagt ggtcatctgg tgactctatt tcctgactta agagttcttg 1081 ggtctctctc tttcttttct cagtggcgtt tgctgaatga caacaattta ggaatgctca 1141 atgtactgtt gatttttctc aatacacagt attgttcttg taaaactgtg acttaccaca 1201 gagctactac cacagtcctt tcttagggtg tcaggctctg aatctctcca aatgtgctct 1261 ctttggttcc tcagtgctat tctttgtctt tatgatttca taattagaca atgtgaaatt 1321 acataacagg cattgcacta aaagtgatgt gatttatgca tttatgcatg agaactaaat 1381 agacttttag atcctactta aacaaaaact tccatgacag tagcatactg acaagaaaac 1441 acacacaaca gcaacaataa caaagcaaca actacgcatg ctcagcattg ggacactgtc 1501 aagattaagt cataccagca aaacctgcag ctgtgtcacc ttcttctgtc aacatacaga 1561 ctgatcataa tgatcccttc tttacacaca cacacacaca cacacacaca cacacacaca 1621 caaatggaat ttaaccaact tcccagaatt gatgaagcaa atatatgttt ggctgaaact 1681 attgtaaatg ggtgtaatat agggtttgtc gaatgctttt gaaagctctg ttttccagac 1741 aatactcttg tgtggaaaac gtgaagatct tctaagtctg gctcttgtga tcaccaaacc 1801 ctggtgcatc agtacaacac tttgcgctaa tctagagcta tgcacaacca aattgctgag 1861 atgtttagta gctgataaag aaacctttaa aaaattatat aaatgaatga aatatagata 1921 aactgtgaga taaatatcat tacagcatgt atattaaatc cctcctgtct cctctgttgg 1981 tttgtgaagt gatttgacat tttgtagcta gtttaaaatt attaaaaatt atagatgtta // LOCUS CHKNCAMC1 26 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12. ACCESSION M24014 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 1 of 6 SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 7 to 18) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 26) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) STANDARD full staff_entry FEATURES from to/span description pept < 1 + 18 neural cell adhesion molecule, exon 12 (AA at 1) IVS 19 > 26 intron BASE COUNT 10 a 6 c 5 g 5 t ORIGIN 1 tcaagacaca gccagttcgt aagtaa // LOCUS CHKNCAMC2 29 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12A. ACCESSION M23991 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 2 of 6 SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 6B1, and cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 7 to 21) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 6; 22 to 29) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021 STANDARD full staff_entry FEATURES from to/span description pept + 7 + 21 neural cell adhesion molecule exon 12A IVS < 1 6 N-CAM intron IVS 22 > 29 N-CAM intron BASE COUNT 6 a 10 c 6 g 7 t ORIGIN 1 ctgcaggtat ccctcactca cgtaagtgc // LOCUS CHKNCAMC3 54 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12B. ACCESSION M23992 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 3 of 6 SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 12AC3, and cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 13 to 46) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 12; 47 to 54) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021 STANDARD full staff_entry FEATURES from to/span description pept + 13 + 46 neural cell adhesion molecule exon 12B IVS < 1 12 N-CAM intron IVS 47 > 54 N-CAM intron BASE COUNT 11 a 13 c 12 g 18 t ORIGIN 1 ttctttcttt agcaagtgct gctgccccca ccatgtttag cagagagtga gtat // LOCUS CHKNCAMC4 62 bp ds-DNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12C. ACCESSION M23993 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 4 of 6 SOURCE Chicken embryonic day 10 cardiac muscle DNA, clone 7C1, and cDNA to mRNA, clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 13 to 54) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 12; 55 to 62) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) Rockefeller U., 1230 York Ave., NYC NY 10021 STANDARD full staff_entry FEATURES from to/span description pept + 13 + 54 neural cell adhesion molecule exon 12C IVS < 1 12 N-CAM intron IVS 55 > 62 N-CAM intron BASE COUNT 18 a 20 c 12 g 12 t ORIGIN 1 atgtttccat agcaactcag cctctccctg catccgaaag cacagagcca ccaagtgagt 61 ag // LOCUS CHKNCAMC5 3 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 12D. ACCESSION M23994 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 5 of 6 SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 3) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry COMMENT Exon 12D represents a very small exon. FEATURES from to/span description pept + 1 + 3 neural cell adhesion molecule, exon 12D BASE COUNT 2 a 0 c 1 g 0 t ORIGIN 1 aag // LOCUS CHKNCAMC6 33 bp ss-mRNA VRT 10-AUG-1990 DEFINITION Chicken cardiac neural cell adhesion (N-CAM) gene, exon 13. ACCESSION M37649 J04140 KEYWORDS neural cell adhesion molecule. SEGMENT 6 of 6 SOURCE Chicken embryonic day 10 cardiac muscle cDNA to mRNA clone pEC101B. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 33) AUTHORS Prediger,E.A., Hoffman,S., Edelman,G.M. and Cunningham,B.A. TITLE Four exons encode a 93-base-pair insert in three neural cell adhesion molecule mRNAs specific for chicken heart and skeletal muscle JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 9616-9620 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 33) AUTHORS Prediger,E.A. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT FEATURES from to/span description pept + 13 > 33 neural cell adhesion molecule, exon 13 IVS < 1 12 intron BASE COUNT 9 a 11 c 7 g 6 t ORIGIN 1 ccattgtttc aggggaaccc agtgcaccca aac // LOCUS BADLDH 1147 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.caldolyticus lactate dehydrogenase (LDH) gene, complete cds. ACCESSION M19394 M19386 KEYWORDS lactate dehydrogenase. SOURCE B.caldolyticus (strain DSM 405) DNA, clone pLDHY1. ORGANISM Bacillus caldolyticus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1147) AUTHORS Zuelli,F., Weber,H. and Zuber,H. TITLE Nucleotide sequences of lactate dehydrogenase genes from the thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus and B.caldotenax JOURNAL Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.Zuelli, 27-MAY-1988. FEATURES from to/span description pept 146 1099 lactate dehydrogenase (AC 1.1.1.27) BASE COUNT 282 a 275 c 321 g 269 t ORIGIN 301 bp upstream of NcoI site. 1 tgctcccgtt gtttgccgcg gcgtctgtca tgaaatggac aaacaatagt caaacaatcg 61 ccacaatgcc gcatgcatcg cggtgcgcct ttcgcgtaaa atagtttgtg aaagtgttca 121 cattgtattg agggggatga atgcaatgaa aaacaacggt ggaacccgag tagtggtcat 181 cggcaccggg tttgtcggcg ccagttatgc gtttgcctta atgaatcaag ggattgccga 241 tgagatcgtg ctcatcgatg cgaatgaaag caaggccata ggcgatgcga tggacttcaa 301 ccatgggaaa gtatttgcgc cgaagccggc tgacatttgg cacggcgatt acgatgattg 361 ccgcgatgcc gatttggttg tcatttgcgc cggcgccaac caaaaaccgg gcgagacgcg 421 gcttgatctt gtggacaaaa acattgccat tttccgctcg atcgttgagt cggtcatggc 481 atccggattt caaggactgt ttctcgtcgc caccaatccg gtcgacattt taacgtacgc 541 gacgtggaaa ttcagcggcc tgccgcatga gcgggtgatc ggttcgggga cgattttaga 601 tacggcgcgg ttccgctttt tgttgggcga gtatttctct gtcgctccgc aaaatgttca 661 tgcctatatt attggggaac acggcgacac tgaactcccg gtctggagcc aggctgatat 721 cggcggcgtg ccgatccgca agctggtcga gtccaaaggg gaagaagcgc aaaaagagct 781 cgagcgcatt tttgtcaatg tgcgcgatgc cgcctaccaa attattgaga aaaaaggagc 841 gacgtactac ggaattgcga tgggacttgc ccgcgtgacg cgcgccattt tgcataacga 901 aaacgctatt ttgaccgtat cggcctacct cgatggccca tatggggagc gcgacgtcta 961 catcggcgtg ccggctgtca ttaaccgcaa tggcatccgc gaggtgatcg aaattgaatt 1021 gaatgatgac gaaaaaaatc gattccatca tagcgcagcc acattaaaaa gcgtgctagc 1081 ccgttttttt acgcgatgag gcaacaaacg aaaagttgtg ccaacagtct ttctcgtcca 1141 aacaggg // LOCUS BMELDHA 1423 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.megaterium L-lactate dehydrogenase gene. ACCESSION M22305 KEYWORDS L-lactate dehydrogenase. SOURCE B.megaterium (strain DSM 090) DNA, clone pMH/pML. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1423) AUTHORS Waldvogel,S., Weber,H. and Zuber,H. TITLE Nucleotide sequence of the L-lactate dehydrogenase gene from the mesophilic bacterium B.megaterium. Preparation and properties of a hybrid lactate dehydrogenase comprising moieties of the B. megaterium and B.stearothermophilus enzymes JOURNAL Biol. Chem. Hoppe-Seyler 368, 1391-1399 (1987) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1423; revises [1]) AUTHORS Suter,F. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by S.Waldvogel 02-OCT-1987, and [2] by F.Suter 16-MAY-1989. Author address [2] F.Suter ETH-Hoenggerberg Inst. f. Molekularbiologie und Biophysik CH 8093 ZUERICH Switzerland FEATURES from to/span description pept 250 1206 L-lactate dehydrogenase (AA at 1) BASE COUNT 449 a 263 c 277 g 434 t ORIGIN 1 taacatacaa atagagaatg ttaaactctt tagctttctg caaaacaagt tgcctttact 61 caacacttta aacgtttaag atctttctaa atgtggcact tcatgtagaa ggacattttt 121 ttaaatcgcc ctgtaaaatt gtttactatt ttttaaaggg tgtgattttt atcacagctg 181 aagctattta ttcttgttac actaacattg tgaaaaacat cacaaaataa aattcaaagg 241 atgatacaaa tgaaaacaca atttacacca aaaacacgaa aagttgccgt tatcggaact 301 ggttttgttg gctcaagcta cgctttttca atggtgaatc aaggtattgc caatgaatta 361 gtgttaatcg atatgaacaa agaaaaagca gaaggtgaag cacgtgatat caatcatgga 421 atgccatttg ccacaccgat gaaaatctgg gctggagatt ataaagactg tgctgacgct 481 gatttagcag ttattacagc gggcgctaat caagctccag gggaaacacg cttagatcta 541 gttgaaaaaa acgttaaaat tttcgaatgc attgtaaaag atattatgaa cagcggattt 601 gacggcatca ttttagtggc aacaaatcca gttgatattc tcgcacacgt tacacaaaaa 661 gtatcaggat taccaaacgg acgggtaatt ggttcaggaa cgattcttga cacagctcgc 721 ttccgctact tgttaagcga ctatttcgaa gtagattctc gcaacgtcca cgcttatatt 781 atgggggaac atggagatac ggaatttcct gtttggagcc acgcgcaaat tggcggtgtg 841 aagctcgaac attttatcaa tactgccgct attgaaaaag aaccggatat gcagcatcta 901 ttcgaacaaa cccgcgatgc ggcttaccat attattaatc gaaaaggagc gacttattac 961 ggaattgcaa tggggcttgt acgcattacc aaggctattt tagatgatga aaattctatt 1021 ttaacagtat ctgctttatt agaaggacaa tacggtattt ctgatgtgta tatcggcgta 1081 ccagctatca ttaataaaaa cggcgtgcgt caaattattg aattgaattt aactcctcac 1141 gaacagcagc agctcgagca ctctgctagc attcttaagc aaactcgcga cagagctttt 1201 gtgtaacatc taaagatttt tgcgggggac tcccccgcac cttgttttaa atatgtagta 1261 gaggtgtatg aatatgacgt ggactcaggt atataatcct ttagataata tttggctttc 1321 tgcactaatt gcactcattc ccattatctt tttctttatt gctttaactc ttttgaaatt 1381 aaaaggacac attgcttgcc ggtattacgg tgcttctttc tat // LOCUS BSTLDH 1150 bp ds-DNA BCT 10-AUG-1990 DEFINITION B.stearothermophilus lactate dehydrogenase (LDH) gene, complete cds. ACCESSION M19396 M19386 KEYWORDS lactate dehydrogenase. SOURCE B.stearothermophilus (strain NCIB 8924) DNA, clone pLDHY1. ORGANISM Bacillus stearothermophilus Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1150) AUTHORS Zuelli,F., Weber,H. and Zuber,H. TITLE Nucleotide sequences of lactate dehydrogenase genes from the thermophilic bacteria Bacillus stearothermophilus, B. caldolyticus and B. caldotenax JOURNAL Biol. Chem. Hoppe-Seyler 368, 1167-1177 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by F.Zuelli, 27-MAY-1988. FEATURES from to/span description pept 149 1102 lactate dehydrogenase (EC 1.1.1.27) BASE COUNT 283 a 270 c 321 g 276 t ORIGIN 304 bp upstream of NcoI site. 1 gtgctcccgt tgtttgccgc ggcgtctgtc atgaaatgga caaacaatag tcaaacaatc 61 gccacaatcg cgcatgcatt gcggtgcgcc tttcgcgtaa aatatttata tgaaagtgtt 121 cgcattatat tgagggagga tgaatgcaat gaaaaacaac ggtggagccc gagtagtggt 181 catcggcgcc gggtttgtcg gcgccagtta tgtgtttgcc ttaatgaatc aagggattgc 241 cgatgagatc gtgctcatcg atgcgaatga aagcaaggcc ataggcgatg cgatggactt 301 caaccatggg aaagtatttg cgccgaagcc ggttgacatt tggcacggcg attacgatga 361 ttgccgcgat gccgatttgg ttgtcatttg cgccggcgcc aaccaaaaac cgggcgagac 421 gcggcttgat cttgtggaca aaaacattgc cattttccgc tcgatcgttg agtcggtcat 481 ggcatccgga tttcaaggac tgtttctcgt cgccaccaat ccggtcgaca ttttaacgta 541 cgcgacgtgg aaattcagcg gcctgccgca tgagcgggtg atcggttcgg ggacgatttt 601 agatacggcg cggttccgct ttttgttggg cgagtatttc tctgtcgctc cgcaaaatgt 661 tcatgcctat attattgggg aacacggcga cactgaactc ccggtctgga gccaggctta 721 tatcggcgtc atgccgatcc gcaagctggt cgagtccaaa ggggaagaag cgcaaaaaga 781 tctcgagcgc atttttgtca atgtgcgcga tgccgcctac caaattattg agaaaaaagg 841 agcgacgtac tacgggattg cgatggggct tgcccgcgtg acgcgcgcca ttttgcataa 901 cgaaaacgct attttgaccg tatcagccta cctcgatggc ctatatgggg agcgcgacgt 961 ctacatcgga gtgccggctg tcattaaccg caatggcatc cgcgaggtga tcgaaattga 1021 attgaatgat gacgaaaaaa atcgattcca tcatagcgca gctacattaa aaagcgtgct 1081 agcccgtgct tttacgcgat gaggcaacaa acgaaaagtt gtgccgacag tctttctcgt 1141 ccaaacaggg // LOCUS MUSIGVDJA 351 bp ss-mRNA ROD 10-AUG-1990 DEFINITION Mouse Ig active H-chain mRNA variable region (V186-2 DFL16.1 J2) (AA at 1) anti-NP hybridoma, clone H1-7. ACCESSION M18548 KEYWORDS immunoglobulin; immunoglobulin heavy chain; joining exon; rearranged gene; variable region. SOURCE Mouse (strain C57BL/6J) anti-NP hybridoma mRNA, clone H1-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 351) AUTHORS Blier,P.R. and Bothwell,A. TITLE A limited number of B cell lineages generates the heterogeneity of a secondary immune response JOURNAL J. Immunol. 139, 3996-4006 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by A.Bothwell 03-MAY-1988. FEATURES from to/span description ORF < 1 > 351 Ig heavy-chain V-region (V-D-J2) (AA at 1) recomb 293 294 V(186-2) region end/ D(FL16.1) start recomb 308 309 D(FL16.1) region end/J2 start BASE COUNT 84 a 92 c 89 g 79 t 7 others ORIGIN Unreported. 1 caggtcaaac tgcagcagcc tggggctgag cttgtgaagc ctggggcttc agnnaagctg 61 tcctgcaagg cttctggcta caccttcacc acctncttga tncactgggt gaaacagagg 121 cctggacgag gccttgagtg gattggaagg attgatccta atggtggtgg tactaagtac 181 gatgnaagtt caagagcaag gccacactga ctgtagacga accctccagc acagcctaca 241 tgcagctcag cagcctgaca tctgaggact ctgcggncta ttattgtgca acctattact 301 acggtctcta ctttaactac tgggnaaggc accactctca cagtctcctc a //
GenBank-Updates@genbank.bio.net (08/13/90)
LOCUS MUSCZSOD 535 bp ss-mRNA ROD 13-AUG-1990 DEFINITION Mouse Cu-Zn superoxide dismutase mRNA, complete cds. ACCESSION M35725 KEYWORDS Cu-Zn superoxide dismutase. SOURCE Mouse SWR/J liver, cDNA to mRNA, clone pS61-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 535) AUTHORS Bewley,G.C. TITLE cDNA and deduced amino acid sequence of murine Cu-Zn superoxide dismutase JOURNAL Nucleic Acids Res. 16, 2728-2728 (1988) STANDARD simple staff_review FEATURES from to/span description pept 5 469 Cu-Zn superoxide dismutase (EC 1.15.11) mRNA < 1 469 Cu-Zn superoxide dismutase mRNA BASE COUNT 148 a 112 c 159 g 116 t ORIGIN 1 aagcatggcg atgaaagcgg tgtgcgtgct gaagggcgac ggtccggtgc agggaaccat 61 ccacttcgag cagaaggcaa gcggtgaacc agttgtgttg tcaggacaaa ttacaggatt 121 aactgaaggc cagcatgggt tccacgtcca tcagtatggg gacaatacac aaggctgtac 181 cagtgcagga cctcatttta atcctcactc taagaaacat ggtggcccgg cggatgaaga 241 gaggcatgtt ggagacctgg gcaatgtgac tgctggaaag gacggtgtgg ccaatgtgtc 301 cattgaagat cgtgtgatct cactctcagg agagcattcc atcattggcc gtacaatggt 361 ggtccatgag aaacaagatg acttgggcaa aggtggaaat gaagaaagta caaagactgg 421 aaatgctggg agccgcttgg cctgtggagt gattgggatt gcgcagtaaa cattccctgt 481 gtggtctgaa gtctcagact caactgctac cctcaaacca ttaaactgta atctg // LOCUS MZECPSIIA 721 bp ds-DNA ORG 13-AUG-1990 DEFINITION Maize chloroplast ribosomal protein S11 (rpS11) gene, complete cds, and rpoA gene, 5' end. ACCESSION M35831 KEYWORDS ribosomal protein S11; rpS11 gene; rpoA gene. SOURCE Maize chloroplast DNA. ORGANISM Chloroplast Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 1 to 721) AUTHORS Markmann-Mulisch,U. and Subramanian,A.R. TITLE Nucleotide sequence of maize chloroplast rpS11 with conserved amino acid sequence between eukaryotes, bacteria and plastids JOURNAL Biochem. Int. 17, 655-664 (1988) STANDARD simple staff_review FEATURES from to/span description pept 200 631 ribosomal protein S11 (rpS11) pept 695 > 721 rpoA gene product BASE COUNT 252 a 120 c 147 g 202 t ORIGIN 1 agacaggggt aatctttcga aaaagaagct tttctttcta atagcttttc tttctaatta 61 aagtaaaaag tacccacgga aatgcccaaa ttgcaaataa aaaaaatgaa agtaaaggat 121 atattttaac ctgaaacgga tatctttgta tcttttttct ttttgttatt tctaactcat 181 atttatgaga taataaaata tgacaaaagc tataccaaaa ataggttcac gtaagaaagt 241 gcgtattggt ttgcgtagga atgcccgttt tagtttacgg aagagtgcac gtagaataac 301 aaaagggatt attcatgttc aagctagttt caacaatacc attataaccg ttacagaccc 361 acaaggtcgg gtcgttttct ggtcctccgc aggtacttgt ggattcaaaa gctcaagaaa 421 agcatcaccc tatgctggtc aaagaacagc agtagatgct attcgtacag tgggtttgca 481 acgagcagaa gttatggtaa aaggtgctgg tagcggaaga gatgccgcat tacgagccat 541 tgctaaaagt ggtgtacggt taagttgtat acgcgatgta acacctatgc cgcataatgg 601 atgtcgacct cctaaaaaaa gacgtctgta aaaaaaaatg aaaccgcttt caagagaaat 661 aaacgattca atgatcaaat aatactagtc tgttatggtt cgagaagaga taacaggatc 721 c // LOCUS ACUCPCG 103 bp ds-DNA ORG 13-AUG-1990 DEFINITION A.calamus chloroplast 4.5S rRNA gene. ACCESSION M36166 KEYWORDS 4.5S ribosomal RNA. SOURCE A.calamus chloroplast DNA. ORGANISM Chloroplast Acorus calamus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Arecidae; Arales; Araceae; Acorus calamus. REFERENCE 1 (bases 1 to 103) AUTHORS Bobrova,V.K., Troitsky,A.V., Ponomarev,A.G. and Antonov,A.S. TITLE Low-molecular-weight rRNAs sequences and plant phylogeny reconstruction: Nucleotide sequences of chloroplast 4.5S rRNAs from Acorus calamus (Araceae) and Ligularia calthifolia (Asteraceae) JOURNAL Pl. Syst. Evol. 156, 13-27 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 102 4.5S rRNA BASE COUNT 30 a 20 c 31 g 22 t ORIGIN 1 taaggtcacg gcgagacgag ccgtttatca ttacgatagg tgccaagtgg aagtgcagtg 61 atgtatgcag ctgaggcatc ctaacagacc gagagatttg aac // LOCUS LIGCPCG 103 bp ds-DNA ORG 13-AUG-1990 DEFINITION L.calthifolia chloroplast 4.5S rRNA gene. ACCESSION M36165 KEYWORDS 4.5S ribosomal RNA. SOURCE L.calthifolia chloroplast DNA. ORGANISM Chloroplast Ligularia calthifolia Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Asterales; Asteraceae; Ligularia calthifolia. REFERENCE 1 (bases 1 to 103) AUTHORS Bobrova,V.K., Troitsky,A.V., Ponomarev,A.G. and Antonov,A.S. TITLE Low-molecular-weight rRNAs sequences and plant phylogeny reconstruction: Nucleotide sequences of chloroplast 4.5S rRNAs from Acorus calamus (Araceae) and Ligularia calthifolia (Asteraceae) JOURNAL Pl. Syst. Evol. 156, 13-27 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 1 103 4.5S rRNA BASE COUNT 29 a 20 c 32 g 22 t ORIGIN 1 gaaggtcacg gcgagacgag ccgtttatca ttacgatagg tgtcaagtgg aagtgcagtg 61 atgtatgcag ctgaggcatc ctaacagacc ggtagacttg aac // LOCUS MBICPR1A 128 bp ss-rRNA RNA 13-AUG-1990 DEFINITION M.thermoautotrophicum 5S ribosomal RNA. ACCESSION M36186 KEYWORDS 5S ribosomal RNA; ribosomal RNA. SOURCE M.thermoautotrophicum (strain delta-H; variant 1) RNA. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 128) AUTHORS Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R. TITLE Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two methanogen species, secondary structure models, and molecular evolution JOURNAL Syst. Appl. Microbiol. 7, 151-159 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 128 5S rRNA BASE COUNT 18 a 30 c 41 g 39 t ORIGIN 1 ggggcatggc ggtcatggcg caggggttat acctgatctc gtttcgatct cagtagttaa 61 gtcctgctgc gttgtgggtg tgtactgcgg ttttttgctg tgggaagccc acttcactgc 121 cagccact // LOCUS MBICPR2A 131 bp ss-rRNA RNA 13-AUG-1990 DEFINITION M.thermoautotrophicum 5S ribosomal RNA. ACCESSION M36187 KEYWORDS 5S ribosomal RNA; ribosomal RNA. SOURCE M.thermoautotrophicum (strain delta-H; variant 2) RNA. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 131) AUTHORS Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R. TITLE Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two methanogen species, secondary structure models, and molecular evolution JOURNAL Syst. Appl. Microbiol. 7, 151-159 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 131 5S rRNA BASE COUNT 17 a 30 c 41 g 43 t ORIGIN 1 gggttttggc ggtcatggcg tgggggttta tacctgatct cgtttcgatc tcagtagtta 61 agtcctgctg cgttgtgggt gtgtactgcg gttttttgct gtgggaagcc cacttcactg 121 ccagaccacc c // LOCUS MESCPRA 130 bp ss-rRNA RNA 13-AUG-1990 DEFINITION M.barkeri 5S ribosomal RNA. ACCESSION M36188 KEYWORDS 5S ribosomal RNA; ribosomal RNA. SOURCE M.barkeri (strain DSM 800) RNA. ORGANISM Methanosarcina barkeri Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanomicrobiales; Methanosarcinaceae. REFERENCE 1 (bases 1 to 130) AUTHORS Willekens,P., Huysmans,E., Vandenberghe,A. and De Wachter,R. TITLE Archaebacterial 5S ribosomal RNA: Nucleotide sequence in two methanogen species, secondary structure models, and molecular evolution JOURNAL Syst. Appl. Microbiol. 7, 151-159 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 130 5S rRNA BASE COUNT 27 a 40 c 33 g 30 t ORIGIN 1 ggagtttggc gccatagcgg cagtgtaact cctgtaccca tcccgaacac agtagataag 61 cctgcccgcg ttccttactg tactgaagtg tgcgagcctt cgggaactct ggatcgctgc 121 catactcacc // LOCUS RICCPQBP 1442 bp ds-DNA ORG 13-AUG-1990 DEFINITION O.sativa L. chloroplast Q-B protein (psbA) gene, complete cds. ACCESSION M36191 KEYWORDS Q-B protein; thylakoid membrane protein. SOURCE O.sativa L. (var. Labelle) chloroplast DNA. ORGANISM Chloroplast Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Oryza sativa. REFERENCE 1 (bases 1 to 1442) AUTHORS Wu,N.-H., Cote,J.-C. and Wu,R. TITLE Structure of the chloroplast psbA gene encoding the Q-B protein from Oryza sativa L JOURNAL Dev. Genet. 8, 339-350 (1987) STANDARD simple staff_review FEATURES from to/span description pept 257 1318 Q-B protein (psbA) mRNA 177 > 1442 Q-B protein mRNA BASE COUNT 350 a 275 c 302 g 515 t ORIGIN 1 ggttcgagtc ccgggcaacc catatggaaa ctagaaagga aaaggagcaa tctgagtttt 61 gatttttcac tcacttcatt tacaaaattt tttggtttgg taaattttgt gtatggatat 121 acaactgtcg gggctggctt ggttgacatt ggtatatagt ctatattata ctgttaaata 181 acaagccttc tattatcttt ctagttaata cgtgtgcttg ggagtccttg caatttgaat 241 aaaccaagat cttaccatga ctgcaatttt agagagacgc gaaagtacaa gcctgtgggg 301 tcgcttctgc aactggataa ctagcaccga aaaccgtctt tacatcggat ggttcggtgt 361 tttgatgatc cctaccttat tgaccgcaac ttctgtattt attatcgcct tcatcgctgc 421 ccctccagta gatattgatg gtattcgtga gcctgtttct ggttctttac tttacggaaa 481 caatattatc tctggtgcca ttattcctac ttctgcggcg atcggattgc acttttaccc 541 aatttgggaa gctgcatctg ttgatgagtg gttatacaat ggtggtcctt atgagctaat 601 tgttctacac ttcttacttg gtgtagcttg ttatatgggt cgtgagtggg aactgagttt 661 ccgtctgggt atgcgtcctt ggattgctgt tgcatattca gctcctgttg cagctgctac 721 tgctgttttc ttgatttacc ctattggtca aggaagtttc tctgatggta tgcctttagg 781 aatatctggt actttcaact ttatgattgt attccaggca gagcataaca tccttatgca 841 tccatttcac atgttaggtg tagctggtgt attcggcggt tccctattca gtgctatgca 901 tggttccttg gtaacctcta gtttgatcag ggaaaccact gaaaacgaat ctgctaatga 961 gggttacaga tttggtcaag aggaagaaac ttataatatt gtggccgctc atggttattt 1021 tggccgatta atcttccaat atgctagttt taacaactct cgttctttac acttcttctt 1081 ggctgcttgg cctgtagtag gaatttggtt taccgcttta ggtattagta ctatggcttt 1141 caatctaaac ggattcaatt tcaaccaatc tgtagttgat agccaaggtc gcgttattaa 1201 tacttgggct gatatcatca accgtgctaa tcttggtatg gaagtaatgc acgaacgtaa 1261 tgctcacaac ttccctctag acctagctgc tcttgaagtt ccatctctta atggataagg 1321 cttttctgct aacatatagc aatttttgaa gaaaggaaag ctagaaatac ccaatatctt 1381 gctgaagcaa gatattgggt atttcttttt ttttttattt tgaatctttc tattctgaat 1441 tc // LOCUS RATC7AH7A 3561 bp ss-mRNA ROD 13-AUG-1990 DEFINITION Rat cytochrome P450 cholesterol 7-alpha-hydroxylase (P450 VII) mRNA, complete cds. ACCESSION J05509 KEYWORDS cholesterol 7-alpha-hydroxylase; cytochrome P450. SOURCE Rat (strain Sprague-Dawley) cholestyramine-treated female liver, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3561) AUTHORS Li,Y.C., Wang,D.P. and Chiang,J.Y.L. TITLE Regulation of cholesterol 7-alpha-hydroxylase in the liver: Cloning, sequencing, and regulation of cholesterol 7-alpha-hydroxylase mRNA JOURNAL J. Biol. Chem. 265, 12012-12019 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Chiang,J.Y., 08-MAY-1990, for release after publication. FEATURES from to/span description pept 47 1558 cholesterol 7-alpha-hydroxylase (EC 1.14.13.17) mRNA < 1 1983 P450 VII mRNA BASE COUNT 1048 a 719 c 695 g 1099 t ORIGIN 1 gctaggcaaa gagtctcccc tttggaaatt ttcctgcttt tgcaaaatga tgactatttc 61 tttgatttgg ggaattgccg tgttggtgag ctgttgcata tggtttattg ttggaataag 121 gagaaggaaa gctggtgaac ctcctttgga gaacgggttg attccgtacc tgggctgtgc 181 tctgaaattt ggatctaatc ctcttgagtt cctaagagct aatcaaagga agcatggtca 241 cgtttttacc tgcaaactga tggggaaata tgtccatttc atcacaaact ccctgtcata 301 ccacaaagtc ttatgtcatg gaaaatattt tgactggaaa aaatttcatt acactacttc 361 tgcgaaggca tttggacaca gaagcattga cccaaatgat ggaaatacca cggaaaatat 421 aaacaacact tttaccaaaa ccctccaggg agatgctctg tgttcacttt ctgaagccat 481 gatgcaaaac ctccaatctg tcatgagacc tcctggcctt cctaaatcaa agagcaatgc 541 ctgggtcacg gaagggatgt atgccttctg ttaccgagtg atgtttgaag ccggctatct 601 aacactgttt ggcagagata tttcaaagac agacacacaa aaagcactta ttctaaacaa 661 ccttgacaac ttcaaacaat ttgaccaagt ctttccggca ctggtggcag gccttcctat 721 tcacttgttc aagaccgcac ataaagctcg ggaaaagctg gctgagggat tgaagcacaa 781 gaacctgtgt gtgagggacc aggtctctga actgatccgt ctacgtatgt ttctcaatga 841 cacgctctcc acctttgacg acatggagaa ggccaagacg cacctcgcta tcctctgggc 901 atctcaagca aacaccattc ctgcaacctt ttggagctta tttcaaatga tcaggagtcc 961 tgaagcaatg aaagcagcct ctgaagaagt gagtggagct ttacagagtg ctggccaaga 1021 gctcagctct ggagggagtg ccatttactt ggatcaagtg caactgaatg acctgccggt 1081 actagacagc atcatcaagg aggctctgag gctttccagt gcatccttga atatccgcac 1141 agctaaggag gacttcactc tccatcttga ggacggttcc tataacatcc gaaaagatga 1201 catgatagct ctttatccac agttaatgca cttggatcct gaaatctacc cagacccttt 1261 gactttcaaa tatgaccggt accttgatga aagcgggaaa gcaaagacca ccttctacag 1321 taatggaaac aagctgaagt gtttctacat gcccttcgga tcaggcgcga caatatgtcc 1381 tggaagactc tttgccgtcc aagaaatcaa gcagtttttg atcctgatgc tctcctgctt 1441 tgaactggag tttgtggaga gccaagtcaa gtgtccccct ctagaccagt cccgggcagg 1501 cttgggaatt ttgccaccac tacatgatat tgagtttaaa tataaactga aacactgata 1561 cgtggttgga agaagcgaac actggatgat gtcacttggc ggctgagagt catcacttaa 1621 acaggccttc gggaccaatg ctcactgatg cgccctagcg actggattag tgggaagaac 1681 tttgttctcg ctgcccacat tcctgggtgt tcacatagct ggggccagag cttcatcact 1741 ttcagaaagc aatgtctttt gtatttattt tcaaaatgaa gatattccaa ttggcaggat 1801 atttttccta aggaaattgc tttatatttt tatgaaaact accaattaat tatgaaaggc 1861 ttgaaattca cgttttagtg aaattactga tttttcacta gtaaggttct tcaggtgtga 1921 aactgtatta taaaaatgtt gtaatgggtc acactgtgct ttgcataaag gtaaaggaaa 1981 ctatgtttca gccttttctg tgtctatgag attcgaaaat aatcttactg ttctagaaac 2041 actggggaga atccgacatc gtctagctat attttatttt actgttgcga gaaattttaa 2101 ttccagtttt caactacctt atctttcccc cattttgaca tgcatgccaa tgagaagagt 2161 attttttagg aattaacaag gcacctccca gaaccctacc ctgagacttt taagccttta 2221 atcccagcac tcgagaagta agagccaggc aagatctctg agtctgaggt tattctggtc 2281 tacatcagct ccagataagc caggactaca gaatgggatc ttgtctaaaa aatacagcta 2341 atctttatgt cataactgat tatgaatcaa cctaaaagat aaattttcaa tcaggactca 2401 gagaaaatga gcaattaaaa aacttagctc tgaggtatgt ggaattcatt aagtacaagt 2461 tgacattaca tgttctttaa aaatagttta tgttttatct ctaaatgccc tgcagatgaa 2521 gaataataat gaaaagttga ataatactgt ttaaacacta agtgcaataa tgctttggta 2581 atgtacttta agagaatcat tagccgtgcc agttttacta aaatatattt atatgtaaat 2641 tatatttatc tttttcttat accataaata taaaaatatt gcaacattta ggtaattttt 2701 aaaattacat acctttcaga aaattcgaat ggtatgaatg tttgtatgtt ttttaacttt 2761 gaacagaaca tttaaattat tcatctacgg tgatttttat cttatttatt tctttttgtc 2821 tcattcatat cttgaagaaa tccaaaaata tctgaaggaa tcgctcactc aaatctctcc 2881 ctatcgttac agaaaaattc aataccatct ctttttcctc ggggactgaa gcagggtctc 2941 gtgggtcccg agcagaggct cctgctgcag cgagctttat ccacgggact ccttaaacct 3001 tttaaaatct tatcactatt atcatgcagt tattacctaa gtaggatatt tccctttcct 3061 ttttcatttc agcagagtcc cttagcaacc caggctgact gggaccctcc atgtagctta 3121 agctgtgaac tcactgtact tctgttttca cttattttag gaagtaattt tccctatcag 3181 aaattttaat tgtttagatg atgtataaga gtaacacaat tctgttatat actaatctgt 3241 agtaactaat tgttcttaga cagttgatga ctctcaattg atgatcatac atctttccat 3301 ggcttcttga atgcccattt ctcatacaca gaataatggg tttcacggtg atatcttcct 3361 ttcatgtctt tattcttgtg cggtgatggt tggcaaatga tacccatgga gcaaggttac 3421 tcttcctatt tctgtgcagc ctaagtgtta agaaataaat tttttaaata cttagaggga 3481 aggcacattt tgtgtcatat gtgagtgaca tgtgacacac agactagcaa tccatgagta 3541 aaattttatt gggattaccg g // LOCUS MUSTEAA 2397 bp ss-mRNA ROD 13-AUG-1990 DEFINITION Mouse tea (20.5) mRNA, complete cds. ACCESSION M32484 KEYWORDS cell surface protein. SOURCE Mouse (strain AKR/J) T-lymphoma cell line SL12.4.10, cDNA to mRNA, clone 20.5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2397) AUTHORS MacLeod,C.L., Finley,K., Kakuda,D., Kozak,C.A. and Wilkinson,M.F. TITLE Activated T cells express a novel gene on chromosome 8 that is closely related to the murine exotropic retroviral receptor JOURNAL Mol. Cell. Biol. 10, 3663-3674 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.L.MacLeod, 02-MAR-1990, for release after publication. FEATURES from to/span description pept 410 1771 Tea gene product (20.5) BASE COUNT 548 a 571 c 587 g 691 t ORIGIN Chromosome 8. 1 gggtgtcttt cctcatcgct gccctggcct cggttatggc cggcctttgc tatgctgaat 61 ttggggcccg agtacccaag actggatctg cgtatctata cacttacgtc acggtcggag 121 agctgtgggc cttcatcact ggctggaatc tcatcctgtc atatgtcata ggtacgtcca 181 gtgtcgcaag agcatggagt ggcacctttg acgaacttct taataaacag attggccagt 241 ttttcaaaac gtacttcaaa atgaattaca ctggtctggc agagtatcca gacttctttg 301 ccgtgtgcct tgtattactc ctggcaggtc ttttatcttt tggagtaaaa gagtctgctt 361 gggtgaataa atttttacag ctattaatat cctggtcctt ctctttgtca tggtggctgg 421 gtttgtgaaa ggaaatgtgg ctaactggaa gatcagtgaa gagtttctca aaaatatatc 481 agcaagtgct agagaaccac cttctgagaa cggaacaagc atctacgggg ctggcggctt 541 tatgccctat ggctttacag ggacgttggc tggtgctgca acgtgctttt atgcctttgt 601 gggctttgac tgcattgcaa caaccggtga agaggttcgg aatccacaaa aggcgatccc 661 catcggaata gtgacgtcct tacttgtctg ctttatggct tactttgggg tttctgcagc 721 tttaacgctt atgatgcctt actacctcct ggatgagaaa agtccactcc cagtcgcgtt 781 tgagtatgtc agatggggcc ccgccaaata cgttgtcgca gcaggctccc tctgcgcctt 841 atcaacaagt cttcttggat ccattttccc aatgcctcgt gtaatctatg ctatggcgga 901 ggatgggttg cttttcaaat gtctagctca aatcaattcc aaaacgaaga caccagtaat 961 tgctactttg tcatcgggtg cagtggcagc tgtgatggcc tttctttttg acctgaaggc 1021 cctcgtggac atgatgtcta ttggcaccct catggcctac tctctggtgg cagcctgtgt 1081 gcttattctc aggtaccaac ctggcttgtg ttacgagcag cccaaataca cccctgagaa 1141 agaaactctg gaatcatgta ccaatgcgac tttgaagagc gagtcccagg tcaccatgct 1201 gcaaggacag ggtttcagcc tacgaaccct cttcagcccc tctgccctgc ccacacgaca 1261 gtcggcttcc cttgtgagct ttctggtggg attcctggct ttcctcatcc tgggcttgag 1321 tattctaacc acgtatggcg tccaggccat tgccagactg gaagcctgga gcctggctct 1381 tctcgccctg ttccttgtcc tctgcgctgc cgtcattctg accatttgga ggcagccaca 1441 gaatcagcaa aaagtagcct tcatggtccc gttcttaccg tttctgccgg ccttcagcat 1501 cctggtcaac atttacttga tggtccagtt aagtgcggac acttggatca gattcagcat 1561 ctggatggcg cttggctttc tgatctattt cgcctatggc attagacaca gcttggaggg 1621 taaccccagg gacgaagaag acgatgagga tgccttttca gaaaacatca atgtagcaac 1681 agaagaaaag tccgtcatgc aagcaaatga ccatcaccaa agaaacctca gcttaccttt 1741 catacttcat gaaaagacaa gtgaatgttg atgctggccc tcggtcttac cacgcatacc 1801 ttaacaatga gtacactgtg gccggatgcc accatcgtgc tgggctgtcg tgggtctgct 1861 gtggacatgg cttgcctaac ttgtacttcc tcctccagac agcttctctt cagatggtgg 1921 attctgtgtc tgaggagact gcctgagagc actcctcagc tatatgtatc cccaaaacag 1981 tatgtccgtg tgcgtacatg tatgtctgcg atgtgagtgt tcaatgttgt ccgttattag 2041 tctgtgacat aattccagca tggtaattgg tggcatatac tgcacacact agtaaacagt 2101 atattgctga atagagatgt attctgtata tgtcctaggt ggctggggaa atagtggtgg 2161 tttctttatt aggtatatga ccatcagttt ggacatactg aaatgccatc ccctgtcagg 2221 atgtttaaca gtggtcatgg gtggggaagg gataaggaat gggcattgtc tataaattgt 2281 aatgcatata tccttctcct acttgctaag acagctttct taaacggcca gggagagtgt 2341 ttctttcctc tgtatgacaa gatgaagagg tagtctgtgg ctggagatgg ccaatcc // LOCUS MUS195A 1183 bp ss-mRNA ROD 13-AUG-1990 DEFINITION Mouse 19.5 mRNA, complete cds. ACCESSION M32486 KEYWORDS cell surface protein. SOURCE Mouse (strain AKR/J) T lymphoma cell line AKR/J SL12.4, cDNA to mRNA, clone 19.5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1183) AUTHORS MacLeod,C., Fong,A., Seal,B., Walls,L. and Wilkinson,M. TITLE Isolation of novel cDNA clones from T-lymphoma JOURNAL Cell Growth and Differ. 1, 271-279 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by C.L. MacLeod, 02-MAR-1990, for release after publication. FEATURES from to/span description pept 55 876 19.5 protein BASE COUNT 297 a 250 c 255 g 381 t ORIGIN Chromosome 16. 1 cctaactgac aaagtgggga gagtaaggtg tgcgcaaaca ggacaagttg ggtcatgggg 61 agtttcaaag gacatgctct ccctgggagt ttcttcttcg ccatgggctt ttggtggact 121 atgaagaaca tcctgaaatc tgtctacaaa aggcaaactc gaacctgcta ccttaactct 181 aaaacattat tacgtcggac agagatttgg gaaggagttg ttgtgctttt aatgtctctc 241 actggtatag ctggtgaaca gtttatctca ggaggacctg ccttgatctt gcataaagat 301 ggccagtgga accagatcct gggctggcat cacacaacca tgtacttatt ctttgggcta 361 cagggtataa cccaaatcat atgtttcact actaatgtac ttccactttc ctcaagcaag 421 ttaatgttat caattgccat ctttgtggag acatttatgt tctacaacca cacacacggt 481 cgggaaatga ttgacatttt tgtacaccaa cttctggtct tcgttggcac attttcgggt 541 ctggttgcct tcttggagtt cctcgtaaag aacaacgcac ttctggagct cctgcggtgc 601 agtctcctca tgtttcaagg aacctggttc tggcagatgg cgtttgtgct gtacccccca 661 tgtggaagtg ctacatggaa cctgtcagat attcaaaata aaatgtttct ctcaatgtgc 721 ttttgctggc attatgcatc aatccttatc ctcattggag taaaatatgc tttggccaac 781 tggttagtca agtctaggct gaggaagggc tgcacctcag aagttggact cctgaagcat 841 gctgaccgtg agcaagaatc agaagaagaa gtatgatctt gaagtctttc ttgataagcc 901 ttctcccttt gcgttgcctt tgttcatggc tttgtttcct gacctctggt ctcaagaaca 961 cttgtctgag gctgactcca tgctgtttgt acttccagtt ttgttaaagt gttggacttt 1021 aagtatctta ctttcagctc tgaaagaacc atgagtgata aattcacttt ttacactgtg 1081 catgccatgt aattcaagac caatcataat tgttttccaa agtttagttt cgtgtccatt 1141 tattaaaaat attttttttt attttccggg tagatacctt caa // LOCUS MUSCYR61A 2018 bp ss-mRNA ROD 13-AUG-1990 DEFINITION Mouse Cyr61 mRNA, complete cds. ACCESSION M32490 KEYWORDS cyr61 product. SOURCE Mouse (strain BALB/c) fibroblast cell line 3T3, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2018) AUTHORS O'Brien,T.P., Yang,G.P., Sanders,L. and Lau,L.F. TITLE Expression of cyr61, a growth factor-inducible immediate-early gene JOURNAL Mol. Cell. Biol. 10, 3569-3577 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.F.Lau, 02-MAR-1990, for release after publication. FEATURES from to/span description pept 190 1329 Cyr61 product mRNA < 1 2018 Cyr61 mRNA site 1453 1551 49 bases repeat BASE COUNT 500 a 536 c 527 g 455 t ORIGIN 1 agaccgtgag cgagagcgcc ccagagaagc gcctgcaatc tctgcgcctc ctccgccagc 61 acctcgagag aaggacaccc gccgcctcgg ccctcgcctc accgcactcc gggcgcattt 121 gatcccgctg ctcgccggct tgttggttct gtgtcgccgc gctcgccccg gttcctcctg 181 cgcgccacaa tgagctccag caccttcagg acgctcgctg tcgccgtcac ccttctccac 241 ttgaccagac tggcgctctc cacctgcccc gccgcctgcc actgccctct ggaggcaccc 301 aagtgcgccc cgggagtcgg gttggtccgg gacggctgcg gctgctgtaa ggtctgcgct 361 aaacaactca acgaggactg cagcaaaact cagccctgcg accacaccaa ggggttggaa 421 tgcaatttcg gcgccagctc caccgctctg aaagggatct gcagagctca gtcagaaggc 481 agaccctgtg aatataactc cagaatctac caaaacgggg aaagcttcca gcccaactgt 541 aaacaccagt gcacatgtat tgatggcgcc gtgggctgca ttcctctgtg tccccaagaa 601 ctgtctctcc ccaatctggg ctgtcccaac ccccggctgg tgaaagtcag cgggcagtgc 661 tgtgaagagt gggtttgtga tgaagacagc attaaggact ccctggacga ccaggatgac 721 ctcctcggac tcgatgcctc ggaggtggag ttaacgagaa acaatgagtt aatcgcaatt 781 ggaaaaggca gctcactgaa gaggcttcct gtctttggca ccgaaccgcg agttcttttc 841 aaccctctgc acgcccatgg ccagaaatgc atcgttcaga ccacgtcttg gtcccagtgc 901 tccaagagct gcggaactgg catctccaca cgagttacca atgacaaccc agagtgccgc 961 ctggtgaaag agacccggat ctgtgaagtg cgtccttgtg gacaaccagt gtacagcagc 1021 ctaaaaaagg gcaagaaatg cagcaagacc aagaaatccc cagaaccagt cagatttact 1081 tatgcaggat gctccagtgt caagaaatac cggcccaaat actgcggctc ctgcgtagat 1141 ggccggtgct gcacacctct gcagaccaga actgtgaaga tgcggttccg atgcgaagat 1201 ggagagatgt tttccaagaa tgtcatgatg atccagtcct gcaaatgtaa ctacaactgc 1261 ccgcatccca acgaggcatc gttccgactg tacagcctat tcaatgacat ccacaagttc 1321 agggactaag tgcctccagg gttcctagtg tgggctggac agaggagaag cgcaagcatc 1381 atggagacgt gggtgggcgg aggatgaatg gtgccttgct cattcttgag tagcattagg 1441 gtatttcaaa actgccaagg ggctgatgtg gacggacagc agcgcagccg cagttggaga 1501 atgccaaggg gctgatgtgg acggacagca gcgcagccgc agttggagaa gacttcgctt 1561 catagtactg gagcgggcat tattgctcca tattggagca tgtttacgga tgacgttctg 1621 ttttctgttt gtaaattatt tgctaagtgt atttttttgc tccagacccc cccccccttt 1681 cttggttcta caattgtaat agagacaaaa taagattagt tgggccaagt gaaagccctg 1741 cttgtccttt gacagaagta aatgaaagcg cctctcattc cttcccgagc ggagggggac 1801 actctgtgag tgtccttggg gcagctacct gcactctaaa actgcaaaca gaaaccaggt 1861 gttttaagat tgaatgtttt tttatttatc aaagtgtagc ttttggggag ggaggggaaa 1921 tgtaatactg gaataatttg taaatgattt taattttata tcagtgaaga gaatttattt 1981 ataaaattaa tcatttaata aagaaatatt tacctaaa //
GenBank-Updates@genbank.bio.net (08/14/90)
LOCUS RATNCAMC 1195 bp ds-DNA ROD 14-AUG-1990 DEFINITION Rat neural cell adhesion molecule (NCAM-C) gene, exon 1. ACCESSION M32612 KEYWORDS neural cell adhesion molecule. SOURCE Rat (strain Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1195) AUTHORS Chen,A., Reyes,A. and Akeson,R. TITLE Transcription initiation sites and structural organization of the extreme 5' region of the rat neural cell adhesion molecule gene JOURNAL Mol. Cell. Biol. 10, 3314-3324 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Akeson 06-MAR-1990, for release after publication. FEATURES from to/span description pept 1142 / 1193 neural cell adhesion molecule, exon 1 (NCAM-C) pre-msg 947 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 948 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 950 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 951 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 952 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 955 > 1193 NCAM-C mRNA and introns (alt.) pre-msg 956 > 1193 NCAM-C mRNA and introns (alt.) IVS 1194 > 1195 NCAM-C intron 1. site 307 484 homopurine-homopyrimidine region BASE COUNT 324 a 275 c 333 g 263 t ORIGIN 1 gaattcttgc tcttgctcct gtgccgtaat acacaccgaa aagccagggc agcagtggag 61 ggagaggaag agaggattcc aatacttgca agcagtttct ggtaatttct gaatactact 121 tctaaatgtc tctgcatttt cttctaactg cccccccgca cccccccccc acttctgaga 181 tctgagttgt gctgctgtga gttgctccct ctgtgggaag gtaaacttca taggtaagag 241 aaaacaaaca acaaagcaaa cctccacaat cctcatttct ctccaaattt ccaatttcct 301 tcacttagga ggaagaggag gaagaggagg aagagggtga ggaggaggaa gaggaagaag 361 aggaagaaga ggaagaagag gaagaagagg aagaagagga agaagaggat gagggaaagg 421 aggagaagag gaggaggagg aggaggaaga aaaggagaag aaggagaagg agaaaaggag 481 gaagccattt gcttctattg caaggacttt agtaactaca tggggtgggt gtgggggctc 541 tacaactttg aaaagcgagc tgaatctaaa aattctttct ttttttttcc taattattta 601 aacgttcaaa ttcccgatta aggaacgctg gagggcagaa ggatctccaa aggcttgagt 661 gaaggaaaag ggtgaaaaga agaaatccca gcttctgcct ggaaggttct gtgtgaaaga 721 gcccggctcc cttggtgact ccaggccgtg ttttgcaggc ggccgcatct gcctcccctg 781 tctcttacct cctggatgtt aggaactatt tgtggtcggc gtggtggaag gacacagtga 841 ggttctcacc tccgcccccc gcccgtcgct cgcatcccca gttccatcaa agccaacccg 901 ggccagcgca gggatctccg agttgcgagt gtgctgaggc tggggctgtc actcattctc 961 cgctcagcgg tgaacgcagc tcggcagtgg ctggcaagaa acaattctgc aaaaataatc 1021 atacccagcc tggcaattgt ctgctcctcg gtccattgct ccgccgccgt ccacagtcgc 1081 ttgcaaggga aggcactgaa tttaccgcgg ccagaacatc cctcccagcc ggcagtttac 1141 aatgctgcga actaaggatc tcatctggac tttgtttttc ctgggaactg caggt // LOCUS YSCTEC1A 2308 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae TEC1 protein gene, complete cds. ACCESSION M32797 KEYWORDS TEC1 protein. SOURCE S.cerevisiae (strain GRF18) haploid cell DNA, clone pBC43-2. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2308) AUTHORS Laloux,I., Dubois,E., Dewerchin,M. and Jacobs,E. TITLE TEC1, a gene involved in the activation of Ty1 and Ty1-mediated gene expression in Saccharomyces cerevisiae: Cloning and molecular analysis JOURNAL Mol. Cell. Biol. 10, 3541-3550 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [1] kindly submitted by I.M.Laloux 13-MAR-1990. FEATURES from to/span description pept 395 1855 TEC1 protein (TEC1) signal 289 294 TATA box BASE COUNT 781 a 513 c 410 g 604 t ORIGIN Chromosome 2, right arm. 1 tgcgtgtttc agattgcgtt tgcttattat tctgggaaac tctccggaaa caagctgagc 61 tggactccac gaatggtccc tcacaaggct cttggctcat gcacagttaa aaagcacctg 121 tagctaccat cagcaatcca tggtgctgga gtttctctcg atggtacggc ccatagctgg 181 agaaaatagc ttattaccaa aaaagtaatt cccccgcaga ggtatatatt aagtttgaaa 241 caaaaggctt gcttaatttt ttacatatga caaccctttt cttactttta taaatgttta 301 tgaaacagga gattcttcga ctggtgccag attcctccca tatcgacaag aagaataatc 361 cacctatttc aacaattctg atacctgttt aaccatgagt cttaaagaag acgactttgg 421 caaggataat tctagaaata tagaatcata tactggtaga atttttgacg tatatataca 481 aaaagattcg tattcacagt cggccttgga tgatatgttt ccagaagccg tagtttcaac 541 cgccgcttgt gtgaaaaatg aagcggagga taacatcaat ctcatagaca cgcatcctca 601 attcgaactg gtaaatactg gactgggtgc taaatcggac gatttgaaat ctccatcagc 661 aaaggctacg ttcactgaca agcagaggaa gaatgaagta ccaaatatat ctgtgagcaa 721 ctactttccc ggacaaagta gcgaaacgtc gtcaacaacg gaatcttgga ctatcggttg 781 tgataagtgg tcagaaaagg tagaagaggc attccttgag gcacttagac tgataatgaa 841 aaatgggacc acaaaaataa aaataagaaa tgccaatttt ggaagaaacg agctgatttc 901 attatatatc aagcacaaaa ccaacgagtt cagaaccaaa aagcaaattt cttcccatat 961 tcaagtctgg aagaagacca tacaaaacaa aatcaaggac tcgctgaccc tatcatcaaa 1021 ggagaaggag cttctacacc ttatcgaaca tggcgctgaa caaactactg aaaactcaaa 1081 cctgttttat gacatatttg aagaaattat cgactctcta ccttcagtca gtgattctgg 1141 aagtttaacc cctaaaaacc tctatgtaag taataatagc agtggattgt cagtacattc 1201 aaaactgctt acgccaatca ctgcttccaa cgagaaaaaa attgaaaatt tcataaaaac 1261 taatgctgca tctcaagcca aaacccccct catttacgct aagcacattt atgaaaacat 1321 agacggctac aagtgcgttc cgtcaaagag gcctcttgaa caactttccc ccacggaact 1381 ccaccaggga gatcgcccca ataaggctag cttttccaac aagaaggcaa tcctggagag 1441 tgcaaaaaaa atcgaaatag agcagagaaa gataatcaac aaataccaaa gaatttcccg 1501 catacaagaa catgaaagta atcctgagtt cagttccaat tccaattccg gttcagagta 1561 cgaatcggag gaagaagtag tcccaagatc agccacagtc acacaactcc aaagcagacc 1621 agtgccatac tacaagaata atggaatgcc ctactcactc tccaaagtac gaggaaggcc 1681 catgtatcca agacctgctg aagatgctta caatgccaat tatattcaag gtctgcccca 1741 gtaccaaaca tcttattttt cgcagctgtt attatcatca ccccagcatt acgaacattc 1801 tccacatcaa aggaacttta cgccatccaa ccaatcgcat gggaactttt attaaatgtc 1861 tacatacata catacatctc gtacataaat acgcatacgt atcttcgtag taagaaccgt 1921 cacagatatg attgagcacg gtacaattat gtattagtca aacattacca gtctcgaaca 1981 aaagctactc ctgcaacact cttctatcgc acatgtatgg ttcttattgt ttcccgagtt 2041 cttttttact gacgcgccag aacgagtaag aaagttctct agcgccatgc tgaaattttt 2101 ttcacttcaa cggacagcga ttttttttct ttttcctccg aaataatgtt gcagcggttc 2161 tcgatgcctc aagaattgca gaagtaaacc agccaataca catcaaaaaa caactttcat 2221 tactgtgatt ctctcagtct gttcatttgt cagatattta aggctaaaag gaaatgttgt 2281 cgagactatc tttattgagt aactcgag // LOCUS YSCACO1A 3130 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae aconitase (ACO1) gene, complete cds. ACCESSION M33131 KEYWORDS aconitase; isocitrate hydro-lyase. SOURCE S.cerevisiae (strain S288C) DNA, clone SG7-1. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3130) AUTHORS Gangloff,S.P., Marguet,D. and Lauquin,G.J.-M. TITLE Molecular cloning of the yeast mitochondrial aconitase gene (ACO1) and evidence of a synergistic regulation of expression by glucose plus glutamate JOURNAL Mol. Cell. Biol. 10, 3551-3561 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.J.-M.Lauquin 21-MAR-1990, for release after publication. FEATURES from to/span description pept 648 2987 aconitase (ACO1) (EC 4.2.1.3) binding 357 364 HAP2-HAP3 binding site signal 503 513 TATA box BASE COUNT 907 a 604 c 668 g 951 t ORIGIN 1 gaattccaaa ggctcacaaa tggcattccg tactgatata cttcgcactt tacatatgct 61 tatataaaaa tgccagtttc ccatcgtcaa ttacgcagag gtagccttcg tttattttct 121 ctctttttgt atatcattgt attaatcata atccattgta ttttactttg tcttatctgg 181 ctttaatgaa tgacgccggt cacacgcggg tgcctgattc tcgattgtgc caagccattt 241 gggcacggtg tcaaattacc taaaaaatgg ccgagagccg caaaagggag gtccgcgggg 301 ccgggcaata ccctttgttt ttcgagcatt tcggcgccga aatcggaaag gtcctgacca 361 atcaatagag aaattagtgc atacgagaaa aatttgaacc cttcagttgt tctccgcagg 421 cgactttaac catcaaacct ccaatcgctg ccggttttcg gaaaggcaag cacaaaaagg 481 gaggacaagg aaaatctttt gttatatata taatcttccg gttttagagg ttaattagga 541 ggagtatgtt gcttaattcc gttgtccctt gttctgttca ctctttctta gttattacat 601 agtagaacga aggtaataaa tactatcatt attactatag atatacaatg ctgtctgcac 661 gttctgccat caagagaccc attgttcgtg gtcttgcgac agtctccaac ttgactagag 721 attcaaaagt caaccaaaac ttattagaag atcattcttt tattaactac aagcagaatg 781 tggaaacgct ggatatcgta agaaaaagat taaacaggcc atttacctac gcggaaaaga 841 ttttgtacgg tcacttggat gaccctcatg gtcaagatat tcagagaggt gtttcatacc 901 taaaattaag accagatcgt gttgcctgtc aagatgctac tgctcaaatg gctattttac 961 aatttatgtc cgctggttta ccacaggttg ctaagccagt cactgtccac tgtgaccatt 1021 tgattcaagc acaagttggt ggtgaaaaag atttgaagag agctatagat ctaaacaagg 1081 aagtttatga tttcttggcc tctgccactg cgaaatataa catgggtttc tggaagccag 1141 gttccggtat cattcaccaa attgttctgg aaaactacgc tttcccaggt gctttgatca 1201 ttggtactga ctcccataca ccaaatgctg gtggtttagg tcaattggct attggtgttg 1261 gtggtgctga tgccgttgat gttatggcag gtcgtccatg ggaattgaag gctccaaaga 1321 tcttaggtgt taagttgact ggtaagatga acggttggac ttctccaaag gatattattt 1381 tgaaattggc tggtatcaca actgtcaaag gtggtactgg taaaattgtt gaatattttg 1441 gtgatggtgt tgacactttc tccgctactg gtatgggtac catttgtaat atgggtgctg 1501 aaatcggtgc taccacatct gttttcccat tcaacaaatc tatgattgaa tatttggaag 1561 caactggtcg tggtaagatc gctgactttg ctaaattata ccacaaggat ctattatctg 1621 ctgataagga tgctgaatac gatgaggtcg tcgaaattga cttgaacact ctggaaccat 1681 acatcaatgg gccatttacc cccgatttgg ctactccagt ttctaagatg aaggaagttg 1741 ctgttgctaa taactggcca ttggatgtca gagtcggttt gatcggttct tgtaccaatt 1801 cctcttatga agatatgtct cgttcagcat ccattgtcaa ggacgctgct gctcatggtt 1861 tgaaatccaa gaccattttc actgttactc caggttctga acaaatcaga gccactattg 1921 aacgtgatgg ccaattagaa accttcaaag aatttggtgg tatcgttttg gcaaacgcct 1981 gtggcccatg tattggtcaa tgggatcgta gagatatcaa gaaaggtgac aagaatacta 2041 ttgtttcctc ttacaacaga aatttcactt ctagaaatga tggtaaccca caaactcatg 2101 cttttgttgc atctccagaa ttagtaactg cgttcgccat tgcgggtgat ttgagattca 2161 accctctaac agacaaatta aaggacaagg atggtaatga gttcatgttg aaaccaccac 2221 atggtcgatg gtttgcctcg aaagaggtta tgatgctggt gagaacactt accaagctcc 2281 acctgcagac cgtagccacc gttgaagtta aagtttctcc aacttcagac cgtctacaac 2341 tgttgaaacc attcaaacct tgggatggta aggatgctaa agacatgcca atcttgatta 2401 aggccgtcgg taagacaact actgatcata tttctatggc tggtccatgg ttgaaataca 2461 gaggtcattt agaaaacatt tctaataact atatgattgg tgctattaat gctgaaaaca 2521 agaaggctaa ctgtgttaaa aatgtatata ctggtgaata caaaggtgtt ccagacactg 2581 ctagagatta cagagaccaa ggtatcaagt gggttgttat tggtgatgaa aactttggtg 2641 aaggttcctc tcgtgaacac gctgctttgg aaccaagatt cttgggcggt ttcgctatca 2701 tcacaaagtc tttcgctcgt atccatgaaa ctaacttgaa aaaacaaggt ctattgccat 2761 tgaacttcaa gaacccagct gactatgaca agatcaaccc tgatgacaga atcgatattc 2821 tgggtctagc tgaattggct ccaggtaagc ctgtaacaat gagagttcat ccaaagaatg 2881 gtaagccatg ggatgctgtg ttgacccata ctttcaacga tgagcaaatt gaatggttca 2941 aatatggttc tgccttaaat aaaattaagg ccgatgagaa gaaataatga aaacattgtt 3001 ataatctttt aaaggttatt atttattttg tcttctgtac acgtaccctt gtttatcttt 3061 tctgccttaa atttaatgac gttcggctgg agaagtcaag actatgaaat atatctcgta 3121 atttatgatc // LOCUS HIVTRA1 59 bp ss-RNA VRL 14-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), gag region. ACCESSION M27956 KEYWORDS . SEGMENT 1 of 2 SOURCE Human immunodeficiency virus type 1 (isolate TRA), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. Isolated in Paris. Sibling sequences are available (see 1989: I-A-169). FEATURES from to/span description pept < 1 > 59 gag protein (AA at 2) BASE COUNT 26 a 8 c 10 g 15 t ORIGIN 1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca // LOCUS HIVTRA2 268 bp ss-RNA VRL 14-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region. ACCESSION M27948 KEYWORDS . SEGMENT 2 of 2 SOURCE Human immunodeficiency virus type 1 (isolate TRA), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 268) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT Sibling sequences are available (see 1989: I-A-170). FEATURES from to/span description pept < 1 > 268 env protein (AA at 1) BASE COUNT 95 a 46 c 54 g 73 t ORIGIN 1 caattaaccc cactctgtgt tactttaaat tgcactgatt atttggggaa tgctactaat 61 accactagta gtagcggggg agggatggag agaggagaaa taaaaaactg ctctttcaat 121 atcacgacaa gcataagaga taagatgcag agagaatatg cactttttta taaacttgat 181 gtagtaccaa cagataatga taatactagc tataggttga taagttgtaa cacctcagtc 241 attacacagg cctgtccaaa agtatcct // LOCUS FLAH1424 1749 bp ss-RNA VRL 14-AUG-1990 DEFINITION Influenza A/Mallard/Gurjev/263/82 hemagglutinin subtype H14 gene. ACCESSION M35997 KEYWORDS hemagglutinin. SOURCE Influenza A/H14/263/82 RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (sites) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. TITLE Molecular characterization of a new hemagglutinin, subtype H14, of influenza A virus JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1749) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Kawaoka, 27-JUN-1990. Author address: Y.Kawaoka St. Jude Childrens Research Hospital 332 N. Lauderdale P.O. Box 318, Memphis, TN 38101 BASE COUNT 575 a 360 c 412 g 402 t ORIGIN 1 agcaaaagca ggggaaaatg attgcactca tattggttgc actggctctg agccacactg 61 cttattctca gatcacaaat gggacaacag gaaaccccat tatatgcttg gggcatcatg 121 cagtggaaaa cggcacatct gttaaaacac taacagacaa tcacgtagaa gttgtgtcag 181 ctaaagaatt agttgagacg aaccacactg atgaactgtg cccaagcccc ttgaagcttg 241 tcgacgggca agactgccac ctcatcaatg gtgcattggg gagtccaggc tgtgaccgtt 301 tgcaggacac cacttgggat gtcttcattg aaaggcccac tgcagtagac acatgttatc 361 cattcgacgt cccagattac cagagtctca gaagcatcct agcaagcagt gggagtttgg 421 agttcatcgc cgaacaattc acctggaatg gtgtcaaagt tgacggatca agcagtgctt 481 gtttgagggg cggtcgcaac agcttcttct cccgactaaa ctggctaacc aaagcaacaa 541 atggaaacta tggacctatt aacgtcacta aagaaaatac gggctcttat gtcaggctct 601 atctctgggg agtgcatcac ccatcaagcg ataatgagca aacggatctc tacaaggtgg 661 caacagggag agtaacagta tctacccgct cggaccaaat cagtattgtt cccaatatag 721 gaagtagacc gagggtaagg aatcagagcg gcaggataag catctactgg accctagtaa 781 acccagggga ctccatcatt ttcaacagta ttgggaattt gattgcacca agaggccact 841 acaaaataag caaatctact aagagcacag tgcttaaaag tgacaaaagg attgggtcat 901 gcacaagccc ttgcttaact gataaaggtt cgatccaaag tgacaaacct tttcagaatg 961 tatcaaggat tgctatagga aactgcccga aatatgtaaa gcaagggtcc ctgatgttag 1021 caactggaat gcgcaacatc cctggcaaac aggcaaaggg cttatttggg gcaattgctg 1081 gattcattga aaatggttgg caaggcctga ttgatgggtg gtatggattc aggcaccaaa 1141 atgctgaagg aacaggaact gctgcagacc tgaagtcaac tcaggcagcc attgatcaga 1201 taaatggcaa gctgaacaga ttgatagaga agacaaatga aaaatatcac caaatagaaa 1261 aggaattcga acaggtggaa ggaagaatac aagaccttga gaagtacgtt gaggacacta 1321 agattgattt gtggtcatac aatgctgaat tgctagtagc actagagaat cagcacacaa 1381 tagatgtcac agactccgaa atgaacaagc tttttgaaag agtaagaagg caattaagag 1441 agaatgcaga agatcaaggc aacggttgtt tcgagatatt ccatcagtgt gacaacaatt 1501 gtatagaaag cattagaaac ggaacttatg accacaacat ctacagggat gaagccatca 1561 acaatcgaat caaaataaat cctgtcactt tgacgatggg gtacaaggac ataatcctgt 1621 ggatttcttt ctccatgtca tgctttgtct tcgtggcact gattctggga tttgttctat 1681 gggcttgtca aaacgggaat atccgatgcc aaatctgtat ataaagaaaa aacacccttg 1741 tttctactc // LOCUS FLAH14244 1716 bp ss-RNA VRL 14-AUG-1990 DEFINITION Influenza A/Mallard/Gurjev/244/82 hemagglutinin subtype H14 gene. ACCESSION M35996 KEYWORDS hemagglutinin. SOURCE Influenza A/H14/244/82 RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (sites) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. TITLE Molecular characterization of a new hemagglutinin, subtype H14, of influenza A virus JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1716) AUTHORS Kawaoka,Y., Yamnikova,S., Chambers,T., Lvou,D. and Webster,R. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by Y.Kawaoka, 27-JUN-1990. Author address: Y.Kawaoka St. Jude Childrens Research Hospital 332 N. Lauderdale P.O. Box 318, Memphis, TN 38101 BASE COUNT 562 a 346 c 405 g 403 t ORIGIN 1 ttggttgcac tggctctgag ccaaactgct tattctcaga tcacaaatgg gacaacagga 61 aaccccatta tatgcttggg gcaccatgca gtggaaaacg gcacatctgt taaaacacta 121 acagacaatc acgtggaagt tgtgtcagct aaagaattag ttgagacgaa acacactgat 181 gaactgtgcc caagcccctt gaagcttgtc gacgggcaag actgcgacct catcaatggt 241 gcattgggga gtccaggctg tgatcgtttg caggacacca cttgggatgt cttcattgaa 301 aggcccactg cagtagacac atgttatcca ttcgacgtcc cagattacca gagtctcaga 361 agcattctag caagcagtgg gagtctggag ttcatcgccg aacaattcac ctggaatggt 421 gtcaaagttg acggatcaag cagtgcttgt ttgaggggcg gtcgcaacag cttcttctcc 481 cggctaaact ggctaaccaa agcaacaaat ggaaactatg gacctattaa cgtcactaaa 541 gaaaatacgg gctcttatgt caggctctat ctctggggag tgcatcaccc atcaagcgat 601 aatgagcaaa cggatctcta caaggtggca acagggagag tgacagtatc tacccgctcg 661 gatcaaatca gtattgttcc caatatagga agtagaccga gagtaaggaa tcagagcggc 721 aggataagca tctactggac cctagtaaac ccaggggact ccatcatttt caacagtatt 781 gggaatttga ttgcaccaag aggccattac aaaataagta aatctactaa gagcacagtg 841 cttaaaagtg acaaaaggat tgggtcatgc acaagccctt gcttaactga taaaggttcg 901 atccaaagtg acaaaccttt tcagaatgta tcaaggattg ctataggaaa ctgcccgaag 961 tatgtaaagc aagggtcctt gatgttagca actggaatgc gcaacatccc tggcaaacag 1021 gcaaagggct tatttggggc aattgctgga ttcattgaaa atggttggca aggcctgatt 1081 gatgggtggt atggattcag gcaccaaaat gctgaaggaa caggaactgc tgcagacctg 1141 aagtcaactc aggcagccat tgatcagata aatggcaagc tgaacagatt aatagaaaaa 1201 acaaatgaaa aatatcacca aatagaaaag gaattcgaac aggtggaagg aagaatacaa 1261 gaccttgaga agtacgttga ggacactaag attgatttgt ggtcatacaa tgctgaattg 1321 ctagtcgcac tagagaatca gcacacaata gatgtcacag actccgaaat gaacaagctt 1381 tttgaaagag taagaaggca attaagagag aatgcagaag atcaaggcaa cggttgtttc 1441 gagatattcc atcagtgtga caacaattgt atagaaagta ttagaaacgg aacttatgac 1501 cacaacatct acagggatga agccatcaac aatcgaatca aaataaatcc tgtcactttg 1561 acgatggggt acaaggacat aatcctgtgg atttctttct ccatgtcatg ctttgtcttc 1621 gtggcactga ttttgggatt tgttctatgg gcttgtcaaa acgggaatat ccgatgccaa 1681 atctgtatat aaagaaaaaa cacccttgtt tctact // LOCUS DOGCPLA01 458 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 1 (non-coding). ACCESSION M28140 J04961 KEYWORDS pancreatic lipase. SEGMENT 1 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 458) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pre-msg 386 > 458 pancreatic lipase mRNA and intron IVS 419 > 458 pancreatic lipase intron A signal 274 277 CAAT box signal 351 354 TATA box BASE COUNT 133 a 94 c 99 g 132 t ORIGIN 1 ttggagggca tttccaccta ttctaagtga gaaatactac cctccattat tggattttgc 61 tgctgtgttc ttactggaac ttgtaaacta tttgatgata acatagaaaa tgggaaagag 121 gggccgcatt ggctcagaat tacatgcaca ggtacatcgg tacaaaggta gggatctttt 181 acctcctttt aatcaagcat tctttcatct tcccagactg tgggaagatc tgactactat 241 aattagtctt gttttgtgta tttgagcaga ttacaatatt aagtgcacac agcgtgttct 301 atcatcacct gctgtttcct tgaagcggcc tccaaatgat aaggctccag tataaaaaaa 361 aaaaaaaagg cctgggtact tgattgaaag ggagggaagg cccagaacat cccacagggt 421 aagtcacctc ttctgtcctc ccaccagggt tgaagcag // LOCUS DOGCPLA02 129 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 2. ACCESSION M28141 J04961 KEYWORDS pancreatic lipase. SEGMENT 2 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 129) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept 41 + 89 pancreatic lipase precursor, exon 2 (EC 3.1.1.3) (first expressed exon) sigp 41 + 89 pancreatic lipase signal peptide IVS < 1 40 pancreatic lipase intron A IVS 90 > 129 pancreatic lipase intron B BASE COUNT 36 a 26 c 37 g 30 t ORIGIN About 38 bp after segment 1. 1 aataatcggt gagactgaga atgttccgaa tttattgtag atggtgagca tctggacaat 61 cgcgctgttt ctgctgggag cagccaaagg taggaaacgt cacaccggcc cctcgggaag 121 gacttaaat // LOCUS DOGCPLA03 235 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 3. ACCESSION M28142 J04961 KEYWORDS pancreatic lipase. SEGMENT 3 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 235) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 195 pancreatic lipase precursor, exon 3 sigp + 41 42 pancreatic lipase signal peptide matp 43 + 195 pancreatic lipase IVS < 1 40 pancreatic lipase intron B IVS 196 > 235 pancreatic lipase intron C BASE COUNT 59 a 66 c 59 g 51 t ORIGIN About 506 bp after segment 2. 1 ggggagctaa gccgtgagca gagtcatcct tcctctacag caaaggaagt ttgctatgaa 61 caaatcggat gcttttctga cgcggagccc tgggccggga cagcaatcag gcctctgaaa 121 gttctcccct ggagccccga gagaatcggc acccgcttcc tgctctacac caacaagaac 181 ccaaacaact ttcaagtgag acctctgtca ttttagtgcc attgtgactg gagtg // LOCUS DOGCPLA04 206 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 4. ACCESSION M28143 J04961 KEYWORDS pancreatic lipase. SEGMENT 4 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 206) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 166 pancreatic lipase precursor, exon 4 matp + 41 + 166 pancreatic lipase IVS < 1 40 pancreatic lipase intron C IVS 167 > 206 pancreatic lipase intron D BASE COUNT 55 a 60 c 44 g 47 t ORIGIN About 445 bp after segment 3. 1 ggctacaaaa taatgcgacc ctttcccact tttatcacag actctccttc cctctgatcc 61 atcaacgatt gaggcatcaa attttcaaac agacaagaag acccggttca tcatccatgg 121 cttcatagac aagggagaag agaactggct gctggatatg tgcaaggtag gggccggctc 181 ccaggctcac agtctgccct gctccc // LOCUS DOGCPLA05 215 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 5. ACCESSION M28144 J04961 KEYWORDS pancreatic lipase. SEGMENT 5 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 215) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 175 pancreatic lipase precursor, exon 5 matp + 41 + 175 pancreatic lipase IVS < 1 40 pancreatic lipase intron D IVS 176 > 215 pancreatic lipase intron E BASE COUNT 44 a 58 c 67 g 46 t ORIGIN About 2090 bp after segment 4. 1 gtgcccctgt catgacatcc tggtgtctct ccaacatcag aacatgttca aggttgagga 61 ggtgaactgc atctgcgtgg actggaagaa aggttcccaa acctcatata cacaggctgc 121 caacaacgtg cgggtggtgg gcgcccaggt ggcccagatg ctcagcatgc tctcggtgag 181 tctgccggct gggctcctgc tgaggtaggc agtgc // LOCUS DOGCPLA06 189 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 6. ACCESSION M28145 J04961 KEYWORDS pancreatic lipase. SEGMENT 6 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 189) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 149 pancreatic lipase precursor, exon 6 matp + 41 + 149 pancreatic lipase IVS < 1 40 pancreatic lipase intron E IVS 150 > 189 pancreatic lipase intron F BASE COUNT 49 a 55 c 49 g 36 t ORIGIN About 957 after segment 5. 1 accacaaaaa tgttctgaaa atatgatctt ccttctccag gcaaactaca gctactcacc 61 ttcccaagtc cagctcatcg gccacagcct gggagcccac gtggctgggg aggcagggag 121 caggactcca ggtctgggca ggattacagg taagacccaa ggggcaggct ccagattttg 181 tccccagta // LOCUS DOGCPLA07 380 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 7. ACCESSION M28146 J04961 KEYWORDS pancreatic lipase. SEGMENT 7 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 380) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 1 + 120 pancreatic lipase precursor, exon 7 matp + 1 + 120 pancreatic lipase IVS 121 > 380 pancreatic lipase intron G BASE COUNT 99 a 89 c 101 g 91 t ORIGIN About 1340 bp after segment 6. 1 ggttggatcc tgtagaagca agtttccagg gcactcctga agaggttcga cttgatccca 61 ctgatgctga ctttgttgat gtgattcaca cagatgcagc tcccctgatc ccattcctgg 121 gtgagcccaa tgacgcttta gttgcaaaca cgtgcaataa tgtttgaacc aaaaagagcc 181 ctgtgtggtg aaaagctcat tgctgctcca aacatttcag gttttggaac aagccaacag 241 atgggtcacc ttgacttctt ccccaatggt ggagaggaaa tgccaggatg caagaagaat 301 gccctgtcac agatcgtgga cctagatggc atttgggaag gtaaagccca agcagaacgg 361 ggcgtcactt ccctggggtg // LOCUS DOGCPLA08 198 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 8. ACCESSION M28147 J04961 KEYWORDS pancreatic lipase. SEGMENT 8 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 198) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 159 pancreatic lipase precursor, exon 8 matp + 41 + 159 pancreatic lipase IVS < 1 40 pancreatic lipase intron G IVS 160 > 198 pancreatic lipase intron H BASE COUNT 38 a 61 c 40 g 59 t ORIGIN About 1366 bp after segment 7. 1 ccccaaccac ccgtgcagac ttccttgttt ctttctccag gaactcggga cttcgtggct 61 tgcaatcacc tgagaagtta caagtattac tcagagagca tcctcaaccc tgatggattt 121 gcttcctacc cctgtgcttc ctacagggcc tttgaatctg taagctatcg tccttccttg 181 agtccctcgt ggacggtt // LOCUS DOGCPLA09 211 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 9. ACCESSION M28148 J04961 KEYWORDS pancreatic lipase. SEGMENT 9 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 211) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 42 + 171 pancreatic lipase precursor, exon 9 matp + 42 + 171 pancreatic lipase IVS < 1 41 pancreatic lipase intron H IVS 172 > 211 pancreatic lipase intron I BASE COUNT 55 a 56 c 41 g 59 t ORIGIN About 675 bp after segment 8. 1 gatttacatc ttacagctgg ctctctctct tttctcctta gaacaagtgc ttcccctgcc 61 cagatcaagg gtgcccacag atgggtcact atgctgataa atttgctgtc aagacaagtg 121 atgagacaca gaaatacttc ctgaacaccg gagattccag caattttgct cgtaagtttt 181 accttgaccc tgctcaccca tgtcaaggag a // LOCUS DOGCPLA10 189 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 10. ACCESSION M28149 J04961 KEYWORDS pancreatic lipase. SEGMENT 10 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 189) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 149 pancreatic lipase precursor, exon 10 matp + 41 + 149 pancreatic lipase IVS < 1 40 pancreatic lipase intron I IVS 150 > 189 pancreatic lipase intron J BASE COUNT 47 a 38 c 43 g 61 t ORIGIN About 1871 bp after segment 9. 1 gtgcatgagt caattgtact cttttctatc ccctgtgcag gctggagata cggggtttct 61 ataacattgt ctgggaaaag agccactggt caggctaaag ttgctttgtt tggaagtaag 121 ggaaatactc atcaattcaa tatcttcaag taagtttcct ctttcagcgt cacgtctcac 181 tggatgcta // LOCUS DOGCPLA11 248 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 11. ACCESSION M28150 J04961 KEYWORDS pancreatic lipase. SEGMENT 11 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 248) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 41 + 208 pancreatic lipase precursor, exon 11 matp + 41 + 208 pancreatic lipase IVS < 1 40 pancreatic lipase intron J IVS 209 > 248 pancreatic lipase intron K BASE COUNT 72 a 42 c 59 g 75 t ORIGIN About 756 after segment 10. 1 gtttctatgc atgagaatgt ttctatctct ttatttcaag ggggattctc aaaccaggct 61 ctactcattc caatgagttt gatgcaaagc ttgatgttgg aacaattgag aaagtcaagt 121 ttctttggaa taacaacgtg gtaaacccaa cctttcccaa agtgggtgca gccaagatca 181 ccgtgcaaaa gggagaggag aaaacagtgt atgtatatct ttgctggatt gtgtctaagg 241 atgttggt // LOCUS DOGCPLA12 184 bp ds-DNA MAM 14-AUG-1990 DEFINITION Canine pancreatic lipase gene, exon 12. ACCESSION M28151 J04961 KEYWORDS pancreatic lipase. SEGMENT 12 of 12 SOURCE Canine spleen DNA, clones L[13,20,4]. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 184) AUTHORS Mickel,F.S., Weidenbach,F., Swarovsky,B., LaForge,K.S. and Scheele,G.A. TITLE Structure of the canine pancreatic lipase gene JOURNAL J. Biol. Chem. 264, 12895-12901 (1989) STANDARD simple staff_review FEATURES from to/span description pept + 37 100 pancreatic lipase precursor, exon 12 matp + 37 97 pancreatic lipase pre-msg < 1 156 pancreatic lipase mRNA and intron IVS < 1 36 pancreatic lipase intron K signal 135 140 poly-A signal BASE COUNT 44 a 60 c 39 g 41 t ORIGIN About 3005 bp after segment 11. 1 tcacaacact tacaaagctc tcattttggt ctgcaggcac agcttctgca gcgaaagcac 61 cgtgagggag gatgttctgc tcaccctcac gccctgttaa cgtccaggcg caacccgggc 121 accgtgttaa cagcaataaa aaccactgat gcatttaccc actcccacgt ctgtcgctgc 181 tgtc // LOCUS MZECPSECX 395 bp ds-DNA ORG 14-AUG-1990 DEFINITION Maize chloroplast 50S ribosomal protein (secX) gene, complete cds. ACCESSION M35956 KEYWORDS 50S ribosomal protein. SOURCE Maize chloroplast DNA. ORGANISM Chloroplast Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 1 to 395) AUTHORS Markmann-Mulisch,U., von Knoblauch,K., Lehmann,A. and Subramanian,A.R. TITLE Nucleotide sequence and linkage map position of the secX gene in maize chloroplast and evidence that it encodes a protein belonging to the 50S ribosomal subunit JOURNAL Biochem. Int. 15, 1057-1067 (1987) STANDARD simple staff_review FEATURES from to/span description pept 165 278 50S ribosomal protein (secX) binding 151 154 secX ribosomal binding site (put.) BASE COUNT 153 a 55 c 74 g 113 t ORIGIN 1 gaagcgtacc gaagactcga aggatactga agatttgaag gataccaaag attcaaagga 61 ttaggttttt ctggggtaat aacttccaag tttcaaaatt taagtgaaga gacttatttt 121 ttccaaaaga atagattcat agtttaagaa aggaataccc atatatgaaa ataagagctt 181 ccgttcgtaa aatttgtaca aaatgtcgac tgattcgcag gcgtgggcga attagagtta 241 tttgttccaa tccgaagcat aaacaaagac aggggtaatc tttcgaaaaa gaagcttttc 301 tttctaatag cttttctttc taattaaagt aaaaagtacc cacggaaatg cccaaattgc 361 aaataaaaaa aatgaaagta aaggatatat tttaa // LOCUS ATHTGYA 210 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana tRNA-Tyr gene. ACCESSION M35957 KEYWORDS transfer RNA-Tyr. SOURCE A.thaliana leaf DNA, clone pAtT1. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 210) AUTHORS Stange,N., Gross,H.J. and Beier,H. TITLE Wheat germ splicing endonuclease is highly specific for plant pre-tRNAs JOURNAL EMBO J. 7, 3823-3828 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 71 107 Tyr-tRNA, exon 1 120 155 Tyr-tRNA, exon 2 RNA 71 181 precursor Tyr-tRNA (+/- 5 bp) IVS 108 119 Tyr-tRNA intron (no splice consensus) BASE COUNT 59 a 37 c 44 g 70 t ORIGIN 1 ctaacttaat acaaatcatt gtgttcatag aatatttaga tcagtacaca tgcatgaaat 61 agaatacaat ccgaccttag ctcagttggt agagcggagg actgtagtag acgcagatta 121 tccttaggtc actggttcga atccggtagg tcggaatttg ctcccacatg agagcttttt 181 atttttcttt cgttgtgaca ttaaggtttt // LOCUS ATHTGYB 210 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana tRNA-Tyr gene. ACCESSION M35958 KEYWORDS transfer RNA-Tyr. SOURCE A.thaliana leaf DNA, clone pAtT3. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 210) AUTHORS Stange,N., Gross,H.J. and Beier,H. TITLE Wheat germ splicing endonuclease is highly specific for plant pre-tRNAs JOURNAL EMBO J. 7, 3823-3828 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 71 107 Tyr-tRNA, exon 1 120 155 Tyr-tRNA, exon 2 RNA 71 178 precursor Tyr-tRNA (+/- 5 bp) IVS 108 119 Tyr-tRNA intron (no splice consensus) BASE COUNT 66 a 34 c 41 g 69 t ORIGIN 1 tcataagttt aaatttgatt aaaaagcttc tcttgaatca tcaacatgca taaagtgtta 61 taatacaaaa ccgaccttag ctcagttggt agagcggagg actgtagttg acgcagataa 121 tccttaggtc actggttcga atccggtagg tcggatcatt aaaattgaaa gttttttttt 181 tccatttcgt ttcaaagtgc ttagacaaga // LOCUS STMCHOPA 3104 bp ds-DNA BCT 14-AUG-1990 DEFINITION Streptomyces sp. cytochrome-P450-like protein (choP) gene and cholesterol oxidase (choA) gene, complete cds. ACCESSION M31939 J03356 KEYWORDS cholesterol oxidase; cytochrome-P450-like protein. SOURCE Streptomyces sp. (strain SA-COO) DNA, clone pCO-[1,3]. ORGANISM Streptomyces sp. Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 963 to 3104) AUTHORS Ishizaki,T., Hirayama,N., Shinkawa,H., Nimi,O. and Murooka,Y. TITLE Nucleotide sequence of the gene for cholesterol oxidase from a Streptomyces sp JOURNAL J. Bacteriol. 171, 596-601 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1450) AUTHORS Horii,M., Ishizaki,T., Paik,S.-Y., Manome,T. and Murooka,Y. TITLE An operon containing the gene for cholesterol oxidase and a cytochrome P-450-like protein from Streptomyces sp JOURNAL J. Bacteriol. 172, 3644-3653 (1990) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by Y.Murooka, 01-DEC-1989. Draft entry and printed sequence for [2] kindly submitted by Y.Murooka, 02-FEB-1990, for release after publication. FEATURES from to/span description mRNA 100 > 1283 choP mRNA pept 141 1286 cytochrome-P450-like protein (choP) pept 1361 3001 cholesterol oxidase precursor (choA, EC 1.1.3.6) sigp 1361 1486 cholesterol oxidase signal peptide matp 1487 2998 cholesterol oxidase mRNA 1384 > 3104 choA mRNA signal 65 70 -35 region signal 89 94 -10 region binding 129 134 choP ribosomal binding site (put.) binding 1340 1345 choA ribosomal binding site (put.) BASE COUNT 518 a 1234 c 932 g 420 t ORIGIN 1 ggtacccccg gtaaccgaac cccgtgcccg tacccgcagg tagcccgccg ggggaacgat 61 ctcgttgaca gccttcacat cgcctccata cggtcatttc gacggcgccg aacatgccga 121 tcactcgtga aagggcatac atgacccagg ctgcgccggt gaccttctcg acagtccgtg 181 agaattactt cggtccgccg gcggagatgc aggcactgcg gcacaaggca cccgtgacgc 241 gcacggcctt cgccgacggc cgccccggct ggctcgtcac cggatattcc gccgcgcgtg 301 cggtgttatc cgattcccgt ttcaccgccc gcggagaacg cgagcacccc gccgtgccgc 361 gcgccgcgac cctggaggac gagcgctgcc gcaggctgat cgccggacag ttcaccgccc 421 gccggatgcg gcagctcacc gggcggaccg agcggatcgt ccgcgaacac ctcgacgcca 481 tggagcacat gggctccccg gccgacctcg tcgagcactt cgccctcccc gtcccctcgc 541 tcgtcatcgc cgaactgctc ggcgtgcccc cccccgaccg ggagcacttc cagcacgaca 601 ccctccggtg gggcggcttc ggccggtcca ccgaggaggt caccgaggcc ttcgtctccc 661 tcggcggcca actgcagcgc ctcgtgcggc tcaagcgcac cgagccgggc gacgacctcc 721 tcagcggcct gatcgccgcc gaccccgcgc tcaccgacga ggaactcgcc tcgatcgcct 781 tcctgttgct ggtcgccggc cacgggacca ccgcccacca gatcgccctc ggcgccttcc 841 tgctgctcga acaccccgac cagctggccg cgctgcgcgc cgaccccgcc ctgacggaga 901 gcgcggtcga ggagctgctg cgccacctga gcgtcgtcca ccacggcccc acccgtgccg 961 cgctgcagga cgccgacatc gagggcaccc ccgtcaaggc gggcgaggtc gtcgtggtct 1021 cgctcggcgc cgccaaccgc gaccccgcac gcttcgagcg gcccgacgcc gtggatgtca 1081 cccgcgagga caccggccac ctcgccttcg ggcacggcat gcaccagtgc ctcggccggc 1141 aactggcccg catcgaactg cgcgtcgccc tcaccgccct cctcgaacgc ttcccgcacc 1201 tgcgcctggc ctgccccgcg gcggagatcc cgctccgcca cgacatgcag gtctacggcg 1261 ccgaccggct gccggtcgcc tggtgaccgt ccccgtcacc cccccgaacc accttcccca 1321 cccctgaaga aacgtctcgg aaaggtcagg tagaccccgc gtgactgcac aacagcacct 1381 gtcccgccgc cgcatgctcg gcatggccgc cttcggcgcc gccgccctcg ccgggggcac 1441 caccatcgcc gccccccgtg cggccgccgc cgccaagtcc gcggcggaca acggcggtta 1501 cgtccccgcc gtcgtcatcg gcaccggcta cggcgcggcc gtctccgcgc tgcgcctcgg 1561 cgaggcgggt gtgcagaccc tgatgctgga gatgggccag ctgtggaacc agcccggccc 1621 cgacggcaac atcttctgcg gcatgctcaa cccggacaag cggtccagct ggttcaagaa 1681 ccgcaccgag gccccgctcg gcagcttcct ctggctcgac gtcgtcaacc ggaacatcga 1741 cccctacgcg ggtgtcctgg accgtgtgaa ctacgaccag atgtcggtct acgtgggccg 1801 cggcgtcggc ggcggctcgc tcgtcaacgg cggcatggcc gtggagccca agcgctcgta 1861 cttcgaggag atcctcccgc gggtcgactc ctccgagatg tacgaccgct acttcccccg 1921 cgccaactcc atgctccgcg tcaaccacat cgacaccaag tggttcgagg acaccgagtg 1981 gtacaagttc gcccgcgtct cgcgcgagca ggcgggcaag gccggtctcg gcaccgtctt 2041 cgtccccaac gtctacgact tcggctacat gcagcgcgag gccgcgggcg aggtgcccaa 2101 gtccgccctg gcgaccgagg tcatctacgg caacaaccac ggcaagcaga gcctggacaa 2161 gacctacctg gccgccgcac tcggcaccgg caaggtcacc atccagaccc tgcaccaggt 2221 caagacgatc cgtcagacga aggacggcgg ctacgcgctg accgtcgagc agaaggacac 2281 cgacggcaag ctcctggcca ccaaggagat ctcctgccgc tacctgttcc tcggcgcggg 2341 cagcctcggc tccaccgaac tgctggtgcg cgcccgcgac accggcaccc tgccgaacct 2401 caactccgag gtgggcgcgg gctggggccc caacggcaac atcatgaccg cccgggccaa 2461 ccacatgtgg aaccccaccg gcgcccacca gtcctccatc cccgccctcg gcatcgacgc 2521 gtgggacaac agcgactcct cggtcttcgc ggagatcgcc cccatgccgg ccggcctgga 2581 gacgtgggtc agcctctacc tcgcgatcac caagaacccc cagcgcggca ccttcgtgta 2641 cgacgccgcg acggaccgcg cgaagctcaa ctggacccgt gaccagaacg cccccgcggt 2701 caacgcagcc aaggcgctgt tcgaccggat caacaaggcg aacggcacga tctaccggta 2761 cgacctcttc ggcacccagc tgaaggcctt cgccgacgac ttctgctacc acccgctcgg 2821 cggctgcgtc ctgggcaagg cgacggacga ctacggccgc gtcgccggtt acaagaacct 2881 ctacgtgacc gacggttcgc tgatcccggg ttccgtcggc gtcaacccgt tcgtgaccat 2941 cacggcgctg gccgagcgga acgtcgagcg catcatcaag caggacgtca cggcgtcgta 3001 atcacgcacc gaccgggccg gaagggaaga acggcccggt caccgcacca tcccggaagg 3061 aacgcaccat tcccgtgaac gccgacgaca acagcctgtg gatc // LOCUS STMCEFDA 2494 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.clavuligerus isopenicillin N epimerase (cefD) gene and deacetoxycephalosporin C synthetase (DAOCS) gene, complete cds. ACCESSION M32324 M24140 KEYWORDS deacetoxycephalosporin C synthetase; isopenicillin N epimerase. SOURCE S.clavuligerus DNA, clone pOW380. ORGANISM Streptomyces clavuligerus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1520 to 2494) AUTHORS Kovacevic,S., Weigel,B.J., Tobin,M.B., Ingolia,T.D. and Miller,J.R. TITLE Cloning, characterization, and expression in Escherichia coli of the Streptomyces clavuligerus gene encoding deacetoxycephalosporin C synthetase JOURNAL J. Bacteriol. 171, 754-760 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 1706) AUTHORS Kovacevic,S., Tobin,M.B. and Miller,J.R. TITLE The beta-lactam biosynthetis genes for isopenicillin N epimerase and deacetoxycephalosporin C synthetase are expressed from a single transcript in Streptomyces clavuligerus JOURNAL J. Bacteriol. 172, 3952-3958 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.R.Miller, 26-FEB-1990, for release after publication. FEATURES from to/span description pept 284 1480 isopenicillin N epimerase (cefD) pept 1559 2494 deacetoxycephalosporin C synthetase (DAOCS) mRNA 152 > 1706 cefD mRNA BASE COUNT 387 a 931 c 791 g 385 t ORIGIN 1 ccatgggtga taactcctgt ggagcgggac aaaaggtcga aagattcaaa tgccgagcag 61 cccggacggt ttactgcata atgtgggcta cctatgctgg acgacaggac tcttgaagtg 121 ctcttcggct ggtcttcaga atctcttcgc tatttttccg aagcggacag aaaatcttcc 181 tgttcgagat cgagtcatga ctgctacgct gcatcgggcc gatcgaattt cgatggctgc 241 actgaattga ctgttcttct ggatttgcgg gagatgcgtt gacatggcgg tagccgactg 301 ggaagaagcc cgcggccgta tgctgctcga ccccaccgtc gtcaacctca acaccggctc 361 cgggggaccg ctgccgcgca gcgccttcga gcgggtcacc ggcttccgcg cccatctcgc 421 ggccgagccg atggacttcc tgctccgcga ggtccccgca ctgctgtggc aggcgcggga 481 gagcctcgcc cgcctcatcg gcggggaccc gctgcgcctc gccctggcca ccaacgtcac 541 cgccgccgtc aacctcgtcg cgtcgtcact gcgcctcgaa gcgcccggcg agatcctgct 601 cagcgacgac gagtacacgc ccatgcgctg gtgctgggag cgggtcgccc ggcggcacgg 661 cctggagctg aggacgttcc ggctgcccga gctgccctcg gacccggccg agatcaccgc 721 ggcggcggtc gccgcgatgg gaccgcgcac ccggctgttc ttcttcagcc atgtcgtctc 781 cacgaccggg ctgatcctgc ccgccgccga actgtgcgag gaggcccgcg cacggggcat 841 caccaccgtg gtcgacggcg cccacgcacc cggcttcctc gacctcgacc tctcccggat 901 cccctgcgac ttctacgcgg gcagcggcca caaatggctg ctcgccccca ccggggtcgg 961 ctttctccac ctcgcccccg gccgcctgga agaactggag cccacccagg tgagctgggc 1021 gtacgagccc ccggagggca gcggcccgcc ggccgcgcgc gaccgcttcg gcagcacacc 1081 cgggctgcgc agactcgaat gcgaggggac ccgggacatc tgcccctggc tcgccacacc 1141 ggagtcgatc gacttccagg ccgagctggg ccccggggcg atccgcgccc gccgccgcga 1201 gctgacggac cacgcgcgcc gcctgctcgc cgaccgcccg ggccgcacgc tcctcacccc 1261 cgactccccg gagctgtccg gcggcatggt ggcctaccgg ctgcccccgg gaaccgacgc 1321 ggccgagctg cgccgggggc tctgggagcg cttccggatc gaggccgcgg tggccgagca 1381 gccgcccggg ccggtgctcc ggatctcggc gaacttctac accaccgaag aggagatcga 1441 ccgcctggcg gacgcgctgg acgcgctgac gggcgagtga tccccccggc tcgcggaccg 1501 cctcccccgc gctgttgacc acccggttca cggattacga gaggatcagt gagagttgat 1561 ggacacgacg gtgcccacct tcagcctggc cgaactccag cagggcctgc accaggacga 1621 gttccgcagg tgtctgaggg acaagggcct cttctatctg acggactgcg gtctgaccga 1681 caccgagctg aagtcggcca aggacctcgt catcgacttc ttcgagcacg gcagcgaggc 1741 ggagaagcgc gccgtcacct cgcccgtccc caccatgcgc cgcggcttca ccgggctgga 1801 gtcggagagc accgcccaga tcaccaatac cggcagctac tccgactact cgatgtgcta 1861 ctcgatgggc accgcggaca acctcttccc gtccggtgac ttcgagcgga tctggaccca 1921 gtacttcgac cgccagtaca ccgcctcccg cgcggtcgcc cgggaggtcc tgcgggcgac 1981 cgggaccgag cccgacggcg gggtcgaggc cttcctcgac tgcgagccgc tgctgcggtt 2041 ccgctacttc ccgcaggtcc ccgagcaccg cagcgccgag gagcagcccc tgcggatggc 2101 gccgcactac gacctgtcga tggtcaccct catccagcag acaccctgcg ccaacggctt 2161 cgtcagcctc caggccgagg tcggcggcgc gttcacggac ctgccctacc gtccggacgc 2221 cgtcctcgtc ttctgcggcg ccatcgcgac cctggtgacc ggcggccagg tcaaggcccc 2281 ccggcaccat gtcgcggccc cccgcaggga ccagatagcg ggcagcagcc gcacctccag 2341 tgtgttcttc ctccgtccca acgcggactt caccttctcc gtcccgctgg cgcgcgagtg 2401 cggcttcgat gtcagcctgg acggcgagac cgccacgttc caggattgga tcgggggcaa 2461 ctacgtgaac atccgccgca catccaaggc atag // LOCUS AMTMTCYB 225 bp ds-DNA ORG 14-AUG-1990 DEFINITION Salamander (A.tigrinum) mitochondrial cytochrome b, partial cds. ACCESSION M26230 M28046 KEYWORDS cytochrome; cytochrome b. SOURCE A.tigrinum liver mitochondrial DNA. ORGANISM Mitochondrion Ambystoma tigrinum Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Caudata; Ambystomatidae; Ambystoma tigrinum. REFERENCE 1 (bases 1 to 225) AUTHORS Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S., Villablanca,F.X. and Wilson,A.C. TITLE Dynamics of mitochondrial DNA evolution in animals: Amplification and sequencing with conserved primers JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 225) AUTHORS Kocher,T.D. and White,T.J. TITLE Evolutionary analysis via PCR JOURNAL (in) Erlich,H. (Ed.); PCR Technology: Princip. and Appl. for DNA amplification: 137-147; Stockton Press, New York (1989). STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1],[2] kindly submitted by T.D.Kocher, 18-JUL-1989. Bernie 12-SEP-1989 initial full entry [1] FEATURES from to/span description pept < 1 > 225 cytochrome b (AA at 1) BASE COUNT 68 a 39 c 35 g 83 t ORIGIN 1 acagggttat ttttagctat acattataca gcagatacat catcagcatt ctcatccgta 61 gcccacattt gccgagatgt aaattatggt tgacttatac gaaatattca cgcaaacggc 121 gcttcattct tttttatttg tatctttctt catattggtc gaggaatata ttacggctca 181 tatatgttca aagaaacatg aaacattgga gtaattttat tattt // LOCUS BLYLEC 972 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Barley root-specific lectin mRNA, complete cds. ACCESSION M29280 M24846 KEYWORDS lectin. SOURCE Barley (var. Betzes) embryo, cDNA to mRNA, clone BLc3. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 972) AUTHORS Lerner,D.R. and Raikhel,N.V. TITLE Cloning and characterization of root-specific barley lectin JOURNAL Plant Physiol. 91, 124-129 (1989) STANDARD simple staff_review COMMENT Draft entry and sequence for [1] kindly submitted by N.Raikhel, 16-MAY-1989. FEATURES from to/span description pept 16 654 lectin precursor sigp 16 93 lectin signal peptide (put.) matp 94 606 lectin BASE COUNT 197 a 285 c 291 g 199 t ORIGIN 1 cagaaaacaa gaaggatgaa gatgatgagc accagggccc tcgctctcgg cgcggccgcc 61 gtcctcgcct tcgcggcggc gaccgcgcac gcccagaggt gcggcgagca gggcagcaac 121 atggagtgcc ccaacaacct ctgctgcagc cagtacgggt actgcggcat gggcggcgac 181 tactgcggca agggctgcca gaacggcgcc tgctacacca gcaagcgctg cggcactcag 241 gccggcggca agacatgccc taacaaccac tgctgcagcc agtggggtta ctgcggcttc 301 ggcgccgagt actgcggcgc cggctgccag ggcggcccct gccgcgccga catcaagtgc 361 ggcagccagg ccggcggcaa gctttgcccc aacaacctct gctgcagcca gtggggttac 421 tgcggcctcg gctccgagtt ctgcggcgag ggctgccagg gcggtgcttg cagcaccgac 481 aagccgtgcg gcaaggccgc cggcggcaaa gtttgcacca acaactactg ctgcagcaag 541 tggggatcct gtggcatcgg cccgggctac tgcggcgcag gttgccagag cggcggctgc 601 gacggtgtct tcgccgaggc catcgccgcc aactccactc ttgtcgcaga atgatgatct 661 tgctaatggc agtattattg caacgacgaa taatccgtgg cagttttgtt gccacgtacg 721 gtctcccttc acttactttt agcactagtc cttaataatt ctccagcctt gcaatatgac 781 gtgcaggttg ctacatgcat ggacatattg cagtgagaag tactgtgtgg caatataggg 841 tgtactattg ttgccacaaa tttagttctt tcttgttacg tacgtacagt tgtcaggatg 901 catgcatccc cgttgtaatg ttggagtact ccatgatttc gttgcaatat atatattgcc 961 atgagtctaa ag // LOCUS CHKMTCYB 225 bp ds-DNA ORG 14-AUG-1990 DEFINITION Chicken mitochondrial cytochrome b, partial cds. ACCESSION M26231 M28048 KEYWORDS cytochrome; cytochrome b. SOURCE Chicken mitochondrial DNA. ORGANISM Mitochondrion Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae; Gallus gallus. REFERENCE 1 (bases 1 to 225) AUTHORS Kocher,T.D., Thomas,W.K., Meyer,A., Edwards,S.V., Paeaebo,S., Villablanca,F.X. and Wilson,A.C. TITLE Dynamics of mitochondrial DNA evolution in animals: Amplification and sequencing with conserved primers JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6196-6200 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 225) AUTHORS Kocher,T.D. and White,T.J. TITLE Evolutionary analysis via PCR JOURNAL (in) Erlich,H. (Ed.); PCR Technology: Princip. and Appl. for DNA amplification: 137-147; Stockton Press, New York (1989). STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1],[2] kindly submitted by T.D.Kocher, 18-JUL-1989. Bernie 12-SEP-1989 initial full entry [1] Mia 13-AUG-1990 citation update FEATURES from to/span description pept < 1 > 225 cytochrome b (AA at 1) BASE COUNT 56 a 84 c 35 g 50 t ORIGIN 1 accgccctac tactagccat gcactacaca gcagacacat ccctagcctt ctcctccgta 61 gcccacactt gccggaacgt acaatacggc tgactcatcc ggaatctcca cgcaaacggc 121 gcctcattct tcttcatctg tatcttcctt cacatcggac gaggcctata ctacggctcc 181 tacctctaca aggaaacctg aaacacagga gtaatcctcc tcctc // LOCUS HUMMUC18A 2940 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human MUC18 gene mRNA (3' variant), complete cds. ACCESSION M29277 KEYWORDS MUC18 glycoprotein; cell adhesion molecule; immunoglobulin-like protein; integral membrane glycoprotein. SOURCE Human (isolate JuSo) melanoma (cell line Mel JuSo), cDNA to mRNA, clone zapy1, drop 1, drop 4, drop 4.2, drop 4.7, drop 4.11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2940) AUTHORS Lehmann,J.M., Riethmueller,G. and Johnson,J.P. TITLE MUC18, a marker of tumor progression in human melanoma, shows sequence similarity to the neural cell adhesion molecules of the immunoglobulin superfamily JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 9891-9895 (1989) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by J.P.Johnson, 06-OCT-89. FEATURES from to/span description pept 8 1819 MUC18 glycoprotein precursor sigp 8 91 MUC18 glycoprotein signal peptide matp 92 1816 MUC18 glycoprotein site 122 373 immunoglobin like (V set) domain I site 464 691 immunoglobin like (V set) domain II site 793 985 immunoglobin like (C2 set) domain site 1073 1246 immunoglobin like (C2 set) domain site 1334 1522 immunoglobin like (C2 set) domain BASE COUNT 668 a 861 c 856 g 555 t ORIGIN 1 gggaagcatg gggcttccca ggctggtctg cgccttcttg ctcgccgcct gctgctgctg 61 tcctcgcgtc gcgggtgtgc ccggagaggc tgagcagcct gcgcctgagc tggtggaggt 121 ggaagtgggc agcacagccc ttctgaagtg cggcctctcc cagtcccaag gcaacctcag 181 ccatgtcgac tggttttctg tccacaagga gaagcggacg tcatcttccg tgtgcgccag 241 ggccagggcc agagcgaacc tggggagtac gagcaagcgg ctcagcctcc aggacagagg 301 ggctactctg gccctgactc aagtcacccc ccaagacgag cgcatcttct tgtgccaggg 361 caagcgcctc ggtcccagga gtaccgcatc cagctccgcg tctacaaagc tccggatgcc 421 aaacatccag gtcaaccccc tgggcatccc tgtgaacagt aaggagcctg aggaggtcgc 481 tacctgtgta gggaggaacg ggtaccccat tcctcaagtc atctggtaca agaatggccg 541 gcctctgaag gaggagaaga accgggtcca cattcagtcg tcccagactg tggagtcgag 601 tggtttgtac accttgcaga gtattctgaa ggcacagctg gttaaagaag acaaagatgc 661 ccagttttac tgtgagctca actaccggct gcccagtggg aaccacatga aggagtccag 721 ggaagtcacc gtccctgttt tctacccgac agaaaaagtg tggctggaag tggagcccgt 781 gggaatgctg aaggaagggg accgcgtgga aatcaggtgt ttggctgatg gcaaccctcc 841 accacacttc agcatcagca agcagaaccc cagcaccagg gaggcagagg aagagacaac 901 caacgacaac ggggtcctgg tgctggagcc tgcccggaag gaacacagtg ggcgctatga 961 atgtcaggcc tggaacttgg acaccatgat atcgctgctg agtgaaccac aggaactact 1021 ggtgaactat gtgtctgacg tccgagtgag tcccgcagcc cctgagagac aggaaggcag 1081 cagcctcacc ctgacctgtg aggcagagag tagccaggac ctcgagttcc agtggctgag 1141 agaagagaca gaccaggtgc tggaaagggg gcctgtgctt cagttgcatg acctgaaacg 1201 ggaggcagga ggcggctatc gctgcgtggc gtctgtgccc agcatacccg gcctgaaccg 1261 cacacagctg gtcaagctgg ccatttttgg ccccccttgg atggcattca aggagaggaa 1321 ggtgtgggtg aaagagaata tggtgttgaa tctgtcttgt gaagcgtcag ggcacccccg 1381 gcccaccatc tcctggaacg tcaacggcac ggcaagtgaa caagaccaag atccacagcg 1441 agtcctgagc accctgaatg tcctcgtgac cccggagctg ttggagacag gtgttgaatg 1501 cacggcctcc aacgacctgg gcaaaaacac cagcatcctc ttcctggagc tggtcaattt 1561 aaccaccctc acaccagact ccaacacaac cactggcctc agcacttcca ctgccagtcc 1621 tcataccaga gccaacagca cctccacaga gagaaagctg ccggagccgg agagccgggg 1681 cgtggtcatc gtggctgtga ttgtgtgcat cctggtcctg gcggtgctgg gcgctgtcct 1741 ctatttcctc tataagaagg gcaaggctgc cgtgcaggcg ctcagggaag caggagatca 1801 cgctgccccc gtctcgtaag accgaacttg tagttgaagt taagtcagat aagctcccag 1861 aagagatggg cctcctgcag gcagcagcgg tgacaagagg gctccgggag accagggaga 1921 gaaatacatc gatctgaggc attagccccg aatcacttca gctcccttcc ctgcctggac 1981 cattcccagc tccctgctca ctcttctctc agccaaagct caaagggact agagagaagc 2041 ctcctgctcc cctcgcctgc acaccccctt tcagagggcc actgggttag gacctgagga 2101 cctcacttgg ccctgcaagg cccgcttttc agggaccagt ccaccaccat ctcctccacg 2161 ttgagtgaag ctcatcccaa gcaaggagcc ccagtctccc gagcgggtag gagagtttct 2221 tgcagaacgt gttttttctt tacacacatt atgctgtaaa tacgctcgtc ctgccagcag 2281 ctgagctggg tagcctctct gagctggttt cctgccccaa aggctggcat tccaccatcc 2341 aggtgcacca ctgaagtgag gacacaccgg agccaggcgc ctgctcatgt tgaagtgcgc 2401 tgttcacacc cgctccggag agcaccccag cagcatccag aagcagctgc agtgcaagct 2461 tgcatgcctg cgtgttgctg caccaccctc ctgtctgcct cttcaaagtc tcctgtgaca 2521 ttttttcttt ggtcagaggc caggaactgt gtcattcctt aaagatacgt gccggggcca 2581 ggtgtggctc acgcctgtaa tcccagcact ttgggaggcc gaggcggcgg atcacaaagt 2641 cagacgagac catcctggct aacacggtga aaccctgtct ctactaaaaa tacaaaaaaa 2701 aattagctag gcgtagtggt tggcacctat agtcccagct actcggaagg ctgaagcagg 2761 agaatggtat gaatccagga ggtggagctt gcagtgagcc gagaccgtgc cactgcactc 2821 cagcctgggc aacacagcga gactccgtct cgagccggcc ggttgcgcgg gccctcggac 2881 cctcagagag gcgagggttc gagggcacga gttcgaggcc aacctggtcc acatgggttg // LOCUS HUMTELB 262 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human DNA from the end of the chromosomal telomere. ACCESSION M19947 J04078 KEYWORDS repetitive sequence. SOURCE Human placental DNA, clone pHuR 143. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 262) AUTHORS Moyzis,R.K., Buckingham,J.M., Cram,L.S., Dani,M., Deaven,L.L., Jones,M.D., Meyne,J., Ratliff,R.L. and Wu,J.R. TITLE A highly conserved repetitive DNA sequence, (TTAGGG)n, present at the telomeres of human chromosomes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6622-6626 (1988) STANDARD full staff_review COMMENT Draft entry and computer readable sequence [1] kindly submitted by S.Jett 27-JUL-88. The (ttaggg)n telomeric sequence is found at the ends of all human chromosomes as well as in some other higher eukaryotic genomes. FEATURES from to/span description site 80 81 rare insertion/substitution site 243 244 rare insertion/substitution BASE COUNT 44 a 0 c 131 g 87 t ORIGIN Unreported. 1 tttagggtta gggttagggt tagggttagg gttagggtta gggttagggt tagggttagg 61 gttagggtta gggttaggga gtagggttag ggttagggtt agggttaggg ttagggttag 121 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag 181 ggttagggtt agggttaggg ttagggttag ggttagggtt agggttaggg ttagggttag 241 ggtgttaggg ttagggttag gg // LOCUS LCOLHICDH 1320 bp ds-DNA BCT 14-AUG-1990 DEFINITION L.confusus L-2-hydroxyisocaproate dehydrogenase (L-HicDH) gene, complete cds. ACCESSION M31425 M28050 KEYWORDS L-2-hydroxyisocaproate dehydrogenase. SOURCE L.confusus (strain DSM20196) DNA. ORGANISM Lactobacillus confusus Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1320) AUTHORS Lerch,H.-P., Frank,R. and Collins,J. TITLE Cloning, sequencing and expression of the L-2-hydroxyisocaproate dehydrogenase-encoding gene of Lactobacillus confusus in Escherichia coli JOURNAL Gene 83, 263-270 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 328 1260 L-2-hydroxyisocaproate dehydrogenase ORF < 1 243 ORF (AA at 1) mRNA 301 1304 LDH mRNA (3' end put.) signal 1267 1303 transcription termination signal binding 318 323 ribosome binding site signal 267 272 -35 region signal 290 295 -10 region BASE COUNT 333 a 263 c 353 g 371 t ORIGIN 1 ttctttttgt acaccatgac aatgtttgct gctaagacat cttcgccagt tgaaacggcc 61 cgtttgtcag gtatggcgca agctggtggt tactttatgt ctgcattcgg tccaatgctt 121 tatggtatgg cttttaccgc aaatccgaat ggggtcatcc aaaacgtcgt ttatttggta 181 ttggtgattg tgatgattgt cgctgccgtg atgatggcga tgactaagca tttgtttgac 241 taaaataatg atgagatgtt gttcagtggt gaacaacgtc tcttttttgt ataatgaaca 301 tacgacaaaa acaatttggg ggatattatg gcacgtaaga ttggaattat cggccttgga 361 aacgttgggg ctgcagtagc gcacggattg attgcacaag gtgtagccga cgactacgtc 421 tttattgatg caaacgaagc aaaggtgaag gctgatcaaa ttgatttcca agacgcaatg 481 gcgaacttgg aagcgcacgg taacattgtg attaacgatt gggcagcctt ggctgatgct 541 gatgttgtga tttcaacact ggggaacatc aagttgcaac aagacaaccc aaccggtgac 601 cgttttgctg agttgaagtt taccagcagc atggtgcaat cagtcggcac aaacttgaag 661 gaatctggtt tccacggcgt attggtcgtg atttcaaacc cggtcgacgt gattacggcc 721 ttgttccaac acgtgactgg tttcccagct cacaaggtta tcggaaccgg tactttgctt 781 gacacggcgc gtatgcaacg tgcagttggt gaggcgtttg atttggatcc acgttctgtt 841 tcaggttaca acttgggtga gcacggtaac tcacaattcg tagcttggtc aacggtgcgc 901 gtgatgggtc aaccaatcgt gacgttggct gatgccggcg atattgactt ggcggccatc 961 gaagaggaag cacgtaaggg tggcttcacg gtcttgaatg gtaagggcta cacgagttat 1021 ggtgttgcaa cgtcagcaat ccgcattgcc aaggctgtta tggctgacgc gcatgctgaa 1081 ttggttgtct caaatcgtcg cgatgacatg ggaatgtact tgtcataccc agcgattatt 1141 ggtcgcgatg gtgtcttggc agaaacgacg cttgatttga cgacggatga gcaagaaaag 1201 cttttgcaat cacgtgacta catccaacaa cgtttcgacg aaattgtgga tacactctaa 1261 aaacacaaaa agtggcttca atctggtgat tggaccactt ttttctaatt gtgcatgaaa // LOCUS SIVAGM266 300 bp ss-RNA VRL 14-AUG-1990 DEFINITION Simian (African green monkey) immunodeficiency virus, isolate 266, partial pol cds. ACCESSION M22974 KEYWORDS . SOURCE Simian (African green monkey) immunodeficiency virus proviral DNA, isolate 266. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 300) AUTHORS Li,Y., Naidu,Y.M., Daniel,M.D. and Desrosiers,R.C. TITLE Extensive genetic variability of simian immunodeficiency virus from African green monkeys JOURNAL J. Virol. 63, 1800-1802 (1989) STANDARD full staff_review COMMENT Hard copy of sequence kindly provided by R.C. Desrosiers, 25-JAN-1989. FEATURES from to/span description pept < 1 > 300 pol polyprotein (AA at 1) pre-msg < 1 > 300 genomic mRNA pre-msg < 1 > 300 tat, rev, nef subgenomic mRNA BASE COUNT 108 a 59 c 74 g 59 t ORIGIN 1 gcaggactac tggcaggcag ttggatacct gactggacct ttgtaagcgt cccacctctg 61 gtaacattgt ggtacacctt aacaaaggaa cccatcccag gagaagatgt gtattatgta 121 gatggagctt gcaacaggaa ctcaagagaa ggaaaagcag ggtatataac tcagcaaggg 181 aaacagagag tggagaagct agaaaatacc acaaatcagc aagcagaatt aacagctata 241 aaaatggcct tagaagacag tggtcccagg gtaaacattg ttacagactc tcaatatgca // LOCUS STYPOL3E 454 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.typhimurium DNA polymerase III epsilon subunit gene, 5' end. ACCESSION M26045 M29726 KEYWORDS polymerase III. SOURCE S.typhimurium (strain LT2) DNA, clone pFF16. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 454) AUTHORS Lancy,E.D., Lifsics,M.R., Kehres,D.G. and Maurer,R. TITLE Isolation and characterization of mutants with deletions in dnaQ, the gene for the editing subunit of DNA polymerase III in Salmonella typhimurium JOURNAL J. Bacteriol. 171, 5572-5580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.A.Maurer, 10-JUL-1989. FEATURES from to/span description pept 59 > 454 polymerase III epsilon (editing) subunit BASE COUNT 107 a 90 c 96 g 104 t 57 others ORIGIN 1 gtagacttcc tgtaattgaa atcgataaca aaacgcaagt ctgacataaa tgaccgatat 61 gagcactgca attacacgac agatcgtcct cgataccgaa accaccggta tgaatcagat 121 aggcgcgcac tatgaaggtc acaagattat tgagatcggt gcggttgagg tgataaaccg 181 tcgtctgacc ggcaacaatt ttcatgttta cctgaagccc gatcgccttg tcgatccaga 241 ggcttttggc gtacacggta ttgccgatga gtttctgctg gataagccgg tttttgctga 301 tgtggtcgat gagtttcttg attatatnnn nggcgcggag ctggtcatcc ataacgcatn 361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncctaaaac 421 caatactttc tgcaaagtta ccgacagcct ggcg // LOCUS RHMSYRMA 2636 bp ds-DNA BCT 14-AUG-1990 DEFINITION R.meliloti syrM gene, complete cds. ACCESSION M33495 KEYWORDS symR gene. SOURCE R.meliloti (strain 1021) DNA, clones pMB[1,2]. ORGANISM Rhizobium meliloti Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 2636) AUTHORS Long,S.R. JOURNAL Unpublished (1990) STANDARD full staff_review REFERENCE 2 (bases 850 to 1990) AUTHORS Barnett,M.J. and Long,S.R. TITLE DNA sequence and translational product of a new nodulation-regulatory locus: SyrM has sequence similarity to NodD protein JOURNAL J. Bacteriol. 172, 3695-3700 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.R.Long, 02-APR-1990, for release after publication. FEATURES from to/span description pept 991 1971 SyrM protein BASE COUNT 534 a 769 c 792 g 541 t ORIGIN 1 gcatgcgcga cgagctgctg aacgagagtc tgttcttcgg tctcagccat gccagcaggg 61 ccatttccaa atggtcgacg actgcaatac gttccggccg cactcgtcgc tcggataccg 121 caccccggca gcctatcctg ggatcatcgc cgcaaccgag ccagttggtg tatcgaaaac 181 ggtcgaggct ctaatcgccg ctggatgaaa agttcagtgg caggtccgaa aggccacgta 241 ttcggagcca acctcctcta gcgatcgctc tctcctgaaa acttgacagg gcgtcccccg 301 accgccttgc cgaggcggcg cgtgcccgac ggcggcctcg acagcaagcc ggtctcgacg 361 gtcttctatc ggcgctgcag gcaatgggca tttgaaaccg tttcaagagt ctccttaaca 421 gagctcgcgc cgggtgaaga ccctgtatga tggatagtga agtgcagggg cggctgctta 481 ccggccgttg agtgaggcta tgtatccaaa acgtggatga cagtcatgcg aaaacgcgat 541 ttctcctgtc tcgccgaatg ccattacaag aacgttagat ataaccggag ggcagaagtg 601 ggcagcgaga tcaattccga cagactcctt tgcaatgaaa gcatgatttg cgaccagggc 661 ctcagatgtg ttgaggcagt tttcttcagt aagtttcagg cggggccact ttggcgagat 721 cattttgtgg cttgctcgtt ggcactgaaa tatgatgtga gccggcaatt cctcgaaagc 781 attctggacc agcgggtctt cgacatccat caggctaggc ctgcgaggcc cggggggcgt 841 cacagccaag tacttctgat taacggaaaa tcacgacgct tcccatttgt cggcgggcca 901 ggaaagtctt tggcacgcac acccatattg tgacgacctg gatcttcggg atgtggcgcg 961 ggaacattac ctgccgagga gaatagatcc atggatcagc ccacttggaa gcggccgcat 1021 agggccaaat ttgccggtgt cagtgacgcc gcacaacagc ggcaaatgcc gaaccttgcg 1081 tccatcgacc taaacctact ggtggatctt gaggctctgc tgcagtatcg gcacattact 1141 caagcggccc agcacgtcgg ccgtagccaa ccggcgatga gcagagcctt gtcaaggctg 1201 cgcggcatgt taaaggacga tcttctggtt gccggctcta gaggcttggt cctgacgcct 1261 ctggctgaat gcttgaccca gatgctgcct tcagtactag acgcgattcg ccagatgatg 1321 aacctcagct tggctccggc gcaacggcga tggaaggtga caatggctat gcccgatcac 1381 caagcggtgg ttctgctgcc acatctgttg ccgcggttac acgagcgcgc ccctcatctc 1441 gacattgtga ccgatccgct tttgggcggc gcgctggggt tacttgaaca aggtgagatc 1501 gacgtggtcg tcgggcagat gggcgccgct ccgctcggct acttgcggcg cagactctac 1561 gccgacagct tcacctgcgt gctacgccac aatcacccgg ccttggcgca ggagtggacg 1621 atcgaggctt tcgcggcctt gcgccacgtc gccattgcct cggaacctga cgagcttttc 1681 ggtcagatct atgacaggtt aaccaaactc ggactgcagc gtggtgatcc gatggtggtt 1741 tccaccgtac tgaccgcggc ggtcttgatc gcggcgactg actcagtgct ggttgtacca 1801 agccgcgtgg cgacccgtgt cgccgccatg ctgtcgctcg cggtgattcc cccgcctgtg 1861 gaactgagac catacgaagt cgcgctgatc tggcacgagc ggtgccatcg agacccagag 1921 catcgctggc tgcgtggcga aatcgccgcc gcagcgtcca cggccggtta gacgaagact 1981 agcgaatagg cgaggatggg tgagcataat gaatgacgac ggtttgatga ttgactatcc 2041 cggggcggtg gaggacccgg tcgatgcgcg atttgcccga tgactcacac ccgggagggg 2101 gtgagtccga gcgcgcctac cacatgcggc tgggtccaga caccgccgtt ttcgtcaacc 2161 ggccgaagtc gcaagggggc acgagcgagg atcgagttgc aacgatgagc cgtcatagcg 2221 tcatagcggc tgtgcacgcc ccaatcccgg cgggttcgcg gcgttgtgtc aaatgctcct 2281 ctcatcgcta ccgatgacaa tacgcgaccc tgctccgaac gtaggtgttc cgtcagtcgc 2341 cccacgtcgc cagcgcggac tgcgatccgc tgagacggaa ctatgcttag ccgccgcatt 2401 ccgtcgcggt gtgggaaatt gcacgcgcag cagagtcttg acgatgctaa aaccatgagg 2461 agcccttgga ggcaccggtc ttttcagagg tgctctctct gctgcgcgcc atccatactt 2521 tggacgatct gcaccgtcga gaacgcaata acgctcctgg cttctgccta acgatacggt 2581 ccgacccggc gtgaatcgac atgagcttag cgtacccatc gtcgcaaggt gcatgc // LOCUS ECCCPL 615 bp ss-mRNA INV 14-AUG-1990 DEFINITION E.granulosus cyclophilin mRNA, 3'end. ACCESSION J04664 M24616 KEYWORDS cyclophilin. SOURCE E.granulosus, cDNA to mRNA, clone EA21. ORGANISM Echinococcus granulosus Eukaryota; Animalia; Eumetazoa; Platyhelminthes; Cestoda; Eucestoda; Cyclophyllidea; Taeniidae. REFERENCE 1 (bases 1 to 615) AUTHORS Lightowlers,M.W., Haralambous,A. and Rickard,M.D. TITLE Amino acid sequence homology between cyclophilin and a cDNA-cloned antigen of Echinococcus granulosus JOURNAL Mol. Biochem. Parasitol. 36, 287-290 (1989) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Lightowlers, 02-MAY-1989. This E.granulosus antigen exhibits 71% amino acid homology with human cyclophilin. FEATURES from to/span description pept < 1 488 antigen ORF (cyclophilin) (AA at 3) mRNA < 1 615 CPL mRNA BASE COUNT 145 a 135 c 166 g 169 t ORIGIN 1 tgggcgtgaa gtgcttcttc gacatctcca ttggcggtaa gccagcgggt cgcatcgtct 61 ttgccctgtt tgatgacgtc ccaaagacag tagagaactt ccgtgcactg tgcaccggtg 121 agaagggttt cggatacaag ggctcgaagt ttcaccgaat tattcccggt tttatgtgcc 181 aaggtggtga ttttactgcc gggaatggta ccggtggcaa gagcatatac gggagcaaat 241 ttgaggatga aaatttcaat cacaagcaca gcaagccgat gatgctctcg atggcgaatg 301 cgggtaagaa caccaatggc tcgcaattct tcatcactac cgccgtcacc agctggcttg 361 atgggaagca tgttgtcttt ggtgaggtag aaagtggcga agatgttgtt aaggatatgg 421 aggccgtcgg tagcagtagt ggcaagactt cccaagaagt actgattact gactgcggtc 481 agctgtaaat caatcggccg aacgttcact ctattccctt tattaattgt atcgcgctaa 541 tcctgctcct cattgcttct ttacccgttt gtggccgtaa cgtgtttcaa caatgaagtc 601 gttgtgcgta ttttg // LOCUS ECOINTDLP 2269 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli ArgU-tRNA gene overlap with prophage DLP12 integrase gene, complete cds. ACCESSION M27155 KEYWORDS insertion element IS3; integrase; transfer RNA-ArgU. SOURCE E.coli (strain K12) DNA, clones pDC1 and pDM1614. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2269) AUTHORS Lindsey,D.F., Mullin,D.A. and Walker,J.R. TITLE Characterization of the cryptic lambdoid prophage DLP12 of Escherichia coli and overlap of the DLP12 integrase gene with the tRNA gene ArgU JOURNAL J. Bacteriol. 171, 6197-6205 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.F.Lindsey, 16-AUG-1989. A homologous region to bacteriophage P22 excisionase is located at positions 1423-1205. Positions 1727-1422 are 97.4 % identical to phage lambda positions 31627-31322 encoding the C-terminal end of exonuclease, and bases 1726-2126 are 92 % identical to phage lambda bases 39972-40372 encoding the C-terminal 102 residues of the acid P gene product and the N-terminal 31 residues of the ren gene product. FEATURES from to/span description tRNA 73 149 Arg-tRNA tRNA 48 227 Arg-tRNA precursor (alt.) tRNA 48 237 Arg-tRNA precursor (alt.) pept 1328 165 (c) DLP12 integrase trns 2127 > 2269 insertion element IS3 (alpha-4/beta-4) signal 14 19 -35 region signal 36 41 -10 region BASE COUNT 539 a 596 c 512 g 622 t ORIGIN 1 agatcaaaag ccattgactc agcaagggtt gaccgtataa ttcacgcgat tacaccgcat 61 tgcggtatca acgcgccctt agctcagttg gatagagcaa cgaccttcta agtcgtgggc 121 cgcaggttcg aatcctgcag ggcgcgccat tacaattcaa tcagttacgc cttctttata 181 tcctccataa tttcagagtg ggacatattt gggacattat caccaaaaat gtcgtctatt 241 ttcctcgcat gctctgtcaa atgattaggc gcaaggtgag catacctacg aaccatttct 301 atggactccc atccgcccat ttcctgaagc actgataatg ggacgcctga ctgaatcagc 361 cagcttgccc aggtgtgtct gaggtcatgg aaacggaaat cttcaattcc tgcacgacga 421 caagctgata gccatgatgt cttgctgtcg atgcgcatct tcctgaccgc aggcgttgat 481 gttccatctg ctcgcttagc cgccttggta tgtacaaaca cccatttgtg atgcttgcct 541 atttgatcac gcaacacttt acaggcggta tcgttcagcg ccacaccaat ggcgcggttt 601 gatttgctct cttctggatt cacccaggca actcgtcgct gcatgtcgat ttgttgccat 661 tccagattta tgatgttcga ctttctcaga ccagttgcca gcgcaaactt gacgacagat 721 ttcagtggtt cggggcactc atcaataagg cgttttgctt cctccttttc cagccatctg 781 actcgcttgt ttctgaccgc tggtatcttg atgacaggcg ctttttccag ccacttccag 841 tcgcgttctg cagcacggag aatggccttt atcatggcaa gatgctttgc ctttgtctga 901 gttgatactg gctttggttc ataaacaggc agttctttac ctttcctgat ggcggcctga 961 actttctgtt tccatatttc tttcgtcttt ctgttatgca ttctgcttac agcagagtaa 1021 atctttgcct ccgagatatc tttaagcctt ataccctcaa aatgttcaag ccagaactca 1081 atccggcttt tatctgaatc gagagatttt ttatcagctt tttcctcaag ccatcttagg 1141 caggcctctt caaaagtgac atcaggtaaa tcccctagct tttctactcg ccagagttct 1201 gcttttcgct tgtcgtgcaa ctcctgagct tgccgtttgt cctttgtgcc aagagattcc 1261 ttaattcgtt tcccgcccgg gagcgaatac gaggcatacc atatttcatt tctgcggaag 1321 agtgacattt tctttcctct gttatgccat cacccgcgct cacctggaca gtatgcagcg 1381 gagactgaag cgccgcaatg caggcttgcc gtgttgtgag gtacccggat attatcgtga 1441 ggatgcgtca tcgccattgc tccccaaata caaaaccaat ttcagccagt gcctcgtcca 1501 ttttttcgat gaactccggc accatctcgt caaaactcgc catgtacttt tcattccgct 1561 caatcacgac ataatgcagg ccttcacgct tcatgcgcgg gtcatagttg gcaaagtacc 1621 aggcatcttt tcgcgtcacc cacatgctgt actgcacctg ggccatgtaa gccgatttta 1681 ttgcctcgaa accaccgagc cggaatttca tgaaatcccg ggaggtacga gtattgccgg 1741 aagcgtggcc tgtatccgga tgcagagtct tatccgtgga aatcgaacgc gcattactgg 1801 ttggttacca acttgtacca gaacatgcgg gccaatgcgc tggctgacgc ggaattacgg 1861 cgcaaggctg ccgatgaact gacctgtatg acagcgcgaa ttaaccgtgg tgagacgata 1921 cctgaaccag taaaacaact tcctgttatg ggcggtagac ctctaaatcg tgcacaggct 1981 ctggcgaaga tcgcagaaat taaagctaag ttcggactga aaggagcaag tgtatgacgg 2041 gcaaagaggc aattattcat tacctgggga cgcataagag cttctgtgca caggacgttg 2101 ccgcggtaac aggcgcaacc gtaatctgat cttacccagc aatagtggac acgcggctaa 2161 gtgagtaaac tctcagtcag aggtgactca catgacaaaa acagtatcaa ccagtaaaaa 2221 accccgtaaa cagcattcgc ctgaatttcg cagtgaagcc ctgaagctt // LOCUS MHVJHM2CA 49 bp ss-RNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-a. ACCESSION M22059 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-a. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 49) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 16 a 10 c 4 g 19 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct // LOCUS MHVJHM2CB 44 bp ss-RNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-b. ACCESSION M23059 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM) RNA, clone JHM-b. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 44) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 14 a 9 c 4 g 17 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct // LOCUS MHVJHM2CC 49 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone B1-c. ACCESSION M23060 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain B1) RNA, clone B1-c. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 49) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 16 a 10 c 4 g 19 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaaac tttataaacg gcacttcct // LOCUS MHVJHM2CD 44 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone B1-d. ACCESSION M23061 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain B1) RNA, clone B1-d. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 44) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 14 a 9 c 4 g 17 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taaactttat aaacggcact tcct // LOCUS MHVJHM2CE 45 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-2c-e. ACCESSION M23062 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-e. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 45) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 14 a 10 c 4 g 17 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaatc taaacggcac ttcct // LOCUS MHVJHM2CF 40 bp ds-DNA VRL 14-AUG-1990 DEFINITION Mouse hepatitis virus leader RNA, clone JHM-2c-f. ACCESSION M23063 KEYWORDS . SOURCE MHV mouse hepatitis virus (strain JHM-2c) RNA, clone JHM-2c-f. ORGANISM Murine hepatitis virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Coronaviridae. REFERENCE 1 (bases 1 to 40) AUTHORS Makino,S. and Lai,M.M.C. TITLE Evolution of the 5'-end of genomic RNA of murine coronaviruses during passages in vitro JOURNAL Virology 169, 227-232 (1989) STANDARD simple staff_entry COMMENT Draft entry kindly provided by S.Makino, 03-JAN-1989. BASE COUNT 12 a 9 c 4 g 15 t ORIGIN 47 bp upstream of 5' end of genomic RNA. 1 cttgtagttt aaatctaatc taatctaaac ggcacttcct // LOCUS MUSPDGFA 906 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse platelet-derived growth factor A chain (PDGFA) mRNA, complete cds. ACCESSION M29464 KEYWORDS platelet-derived growth factor; platelet-derived growth factor A chain. SOURCE Mouse F9 embryonal carcinoma cell line, cDNA to mRNA, clone F9A5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 906) AUTHORS Mercola,M. JOURNAL Unpublished (1989) STANDARD full staff_review REFERENCE 2 (sites for [1]) AUTHORS Mercola,M., Wang,C., Kelly,J., Brownlee,C., Jackson-Grusby,L., Stiles,C. and Bowen-Pope,D. TITLE Selective expression of PDGF A and its receptor during early mouse embryogenesis JOURNAL Dev. Biol. 138, 114-122 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Mercola, 26-OCT-1989. FEATURES from to/span description pept 63 653 platelet-derived growth factor A chain BASE COUNT 217 a 250 c 270 g 169 t ORIGIN 1 cgggctggag cgcccgcccc ggccctcgcc tccctgccga gcttccagcg cctcgggacg 61 cgatgaggac ctgggcttgc ctgctgctcc tcggctgcgg atacctcgcc catgccctgg 121 ccgaggaagc cgagataccc cgggagttga tcgagcggct ggctcgaagt cagatccaca 181 gcatccggga cctccagcga ctcttggaga tagactccgt aggggctgag gatgccttgg 241 agacaagtct gagagcccat gggtcccatg ccattaacca tgtgcccgag aagcggcctg 301 tgcccattcg caggaagaga agtattgagg aagccattcc tgcagtttgc aagaccagga 361 cggtcattta cgagatacct cggagccagg tggaccccac atcggccaac ttcctgatct 421 ggcccccatg tgtggaggtg aagcgctgca ctggctgttg taacaccagc agcgtcaagt 481 gccagccttc acgggtccac caccgcagtg tcaaggtggc caaagtggag tatgtcagga 541 agaagccaaa attgaaagag gtccaggtga ggttagagga acacctggag tgtgcatgtg 601 cgacctccaa cctgaaccca gaccatcggg aggaggagac agatgtgagg tgagatagag 661 ccggccgccc ctctcccggg gacacgatgt gcgtggcgtg tgacattcct gaacatacta 721 tgtatggtgc ttcattgcca atgtgcgtgc ggtctttgtt ctcctccgtg aaaaaccgtg 781 tccgaggaca aagagacagt gtccgtttgt tcagtgtgac atcaaagcaa gtactgtagc 841 actcagagag acagtgagaa agcttccttg tcacagagag cgaaaacaaa accacaaaaa 901 agaaaa // LOCUS ORFPRTPS 660 bp ds-DNA VRL 14-AUG-1990 DEFINITION Orf virus homologue of retroviral pseudoprotease gene, complete cds. ACCESSION M30023 J04371 KEYWORDS pseudoprotease. SOURCE Orf virus (strain NZ2) DNA. ORGANISM Orf virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Parapoxvirus. REFERENCE 1 (bases 1 to 660) AUTHORS Mercer,A.A., Fraser,K.M., Stockwell,P.A. and Robinson,A.J. TITLE A homologue of retroviral pseudoproteases in the parapoxvirus, orf virus JOURNAL Virology 172, 665-668 (1989) STANDARD full staff_entry COMMENT Draft entry and clean copy of sequence [1] kindly submitted by A.A.Mercer 12-JUL-1989. FEATURES from to/span description pept 100 579 pseudoprotease homologue BASE COUNT 157 a 170 c 166 g 167 t ORIGIN 1 gctggacttg cgccgcgcca tgctgacccg cccctgattt ttctggagaa agtgtaaatt 61 gtacaccccg tagtcgatcg gccgctcgcc accctagcca tggagttctg ccacacggaa 121 actcttcagg ttgttcgcct cagccaaaac gccactatac cagccagagg atctcctggt 181 gccgcaggat tagacctatg tagtgcatac gactgtgtca tcccttcgca ttgcagtcgc 241 gtagtattta ctgaccttct gataaagccc ccttctggtt gctacggacg catagcacca 301 cgttctggcc ttgcagtcaa acactttatc gacgtgggcg ctggcgtgat cgacgaagac 361 taccgaggca acgtcggagt ggtgctgttt aactttggaa acagtgactt tgaggttaag 421 aagggagaca ggatagctca gctaatatgc gaacgcatat cgtgcccagc agttcaagag 481 gttaactgct tggacaacac tgaccgcggc gactctggat ttggttctac tggatcgggt 541 gcttgtggtg gccgcgatac agcgtggtac atcagctaat caaaattgta aaatgtagct 601 tctttttatt cgagagtctc gcacagttgc gtagataaca ctaattacaa cacactttaa // LOCUS SP8ALU156 192 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage SP82 Alu156 promoter region DNA. ACCESSION M22133 M20661 J03904 KEYWORDS promoter. SOURCE Bacteriophage SP82 (in B.subtilis) DNA, clone Alu156. ORGANISM Bacteriophage SP82 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 192) AUTHORS McAllister,C.F. and Achberger,E.C. TITLE Effect of polyadenine-containing curved DNA on promoter utilization in Bacillus subtilis JOURNAL J. Biol. Chem. 263, 11743-11749 (1988) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by E.C.Achberger, 14-SEP-1988. The nucleotide sequence contains a promoter from bacteriophage SP82 that is recognized by the major B.subtilis RNA polymerase, the sigma-43-containing enzyme. DNA with this sequence displays sequence-dependent DNA curvature. FEATURES from to/span description site 131 136 -35 region site 147 152 -10 region mRNA 159 > 174 Bacteriophage SP82 mRNA start mRNA 156 > 192 Alu156 mRNA BASE COUNT 62 a 45 c 38 g 47 t ORIGIN 1 gaattccccg gatccgtcct tccaagaaag atatcctaac agcacaagag cggaaacacg 61 ttttgttcta catccagaac aacctctgct aaaattcctg aaaaattttg caaaaagttg 121 ttgactttct ctacgaggtg tggcataata atcttaacaa cagcaggacg ctaggacgga 181 tccggggaat tc // LOCUS SP8BAL129 156 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage SP82 Bal129 promoter region DNA. ACCESSION M22134 M20660 J03904 KEYWORDS promoter. SOURCE Bacteriophage SP82 (in B.subtilis) DNA, clone Bal129. ORGANISM Bacteriophage SP82 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 156) AUTHORS McAllister,C.F. and Achberger,E.C. TITLE Effect of polyadenine-containing curved DNA on promoter utilization in Bacillus subtilis JOURNAL J. Biol. Chem. 263, 11743-11749 (1988) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by E.C.Achberger, 14-SEP-1988. The nucleotide sequence contains a promoter from bacteriophage SP82 that is recognized by the major B.subtilis RNA polymerase, the sigma-43-containing enzyme. This is the site of transcription initiation for a major early mRNA for Bacteriophage SP82. FEATURES from to/span description site 60 65 -35 region site 83 88 -10 region mRNA 95 > 126 Bacteriophage SP82 mRNA start mRNA 95 > 156 Bal129 mRNA BASE COUNT 60 a 29 c 29 g 38 t ORIGIN 1 gaattccccg gatccgtcac ccctaagaaa aaatatctac agaaaatatg aaaaagttgt 61 tgacatttct tcccatccat gctataataa agtcatagag aacaacacta tcaaatgaat 121 ggagagattg atatgatgaa tggggatgac ggatcc // LOCUS XELBGAI 2027 bp ds-DNA VRT 14-AUG-1990 DEFINITION X.laevis adult beta-globin gene (beta-AI), 5' flank. ACCESSION M27933 KEYWORDS beta-globin; globin. SOURCE X.laevis adult DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2027) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.Knoechel, 13-SEP-1989. BASE COUNT 792 a 266 c 284 g 685 t ORIGIN 1 agatcttgat acgttaactt tactagaaaa taatttaaac cccaatagcc tggttttgct 61 tccaatatgg tttaattata ccttagtttt caggataatg gatctttctg taatttggat 121 cttcatgcct taactgtacc agaaaatcat ttaaacttta aataaaccca atttgcttcc 181 agtacagttt aattatatct tagtttggat aagtacaagg tactgtttta ttattacagt 241 gaaaaaggta atcattttaa aaaaaaaata tatattattt ggataaaatg gagtctatgt 301 gtgatggcct ttccgtaatt ctcggtttct ggcaaacgga tctcatacct gtaataggta 361 tataaaaaac acacattaaa aaatactaca tatatattta tattcttttt tttttttaaa 421 gtgtgtaaat tcatgtcttt aaaataataa aatgtattta tatatatata tatatatata 481 tatatatata tatatatata tatatatata tatatatata tacttcaaca aaaaatttgc 541 caaattcata catacaaaaa aaataaaata ataattttaa ataattgaat ctgtctagct 601 gtttatattc tctgctctgc tggatctgac tcctgaaaaa atgtgcagaa gccatttgat 661 ttacagagct ggaggagaat ggctacatta gtttaaaagc cagaaccagg agaggatgca 721 ggcaacaaaa atggatacac acaaattaac gtctattaca attatattta caaataacct 781 taaagccaac ttttttaaaa attattatat attgtaaagt tgcttagaaa ccaatttttt 841 acttataggg agcaaaaaat agggagatcc tgtaaaacag aagctgcacc aaacatagat 901 caagctatcg agctttccat acgtatacat ttatttgaaa ggcactgtta aggagccacg 961 gtgctgtaca gtgcataaaa gtacaatata tatatataaa agtatacaca gggaagacaa 1021 atcacacaat gaatatacac agagctcata tcagaacaaa cagcttaagt gctttgtggt 1081 aagagacaca gtgggaagga ggtccctgtc ccgtagagct tacagtctta cagagctcag 1141 ttcaatccat atgactcaac catttaatta ataaaataat ttgcctttta atcattaatt 1201 aattccacac ttccatgtat aaatggaata tatatgtaag atttatatgt aatagctata 1261 tatgtaagat ttgatatttt ttttgtagga ataaaatgaa aatcaggcaa ataaaaaaca 1321 acatatatat gtttaaaaac ggtgttaatt tctatgcaac atgacatgaa aaagactttt 1381 caatattttt acatatgtat acataagata tatgattgaa ccatttgaat aataaaatag 1441 cttttgcctt accatcatta attattctac catactgtat gtaaaaagca ttgctatatg 1501 taagatttga ttatattttg ttgtaggaat aaaatgaatt ccaggcatat aaaaacacat 1561 ttataaaaaa catttataaa aaacactaca tatacatata catatatata tatatatata 1621 tatatatata tatatatatg tatatatata agtttaaaaa gtgtgttaat ttataatgtc 1681 tttctggaaa tagaatttca cacttcattg tatacaaaat tattaatatt tgtaatattt 1741 gattatatta tgttgtaggg ataaaatgaa taccaggcat ataaaaacac actttaaaaa 1801 aaaaaaaata catagataat aatttaataa tttgtattta ttttttctta atattctagc 1861 tctgctgtaa taaaaaaaac atgcatctaa aagtggtgcc aaatgggagg gtacaaatgg 1921 gctgggcaaa tgtaacgtgt gcttatccta gccaatcaac aggcagagtg gaaaggggca 1981 gtgcatcctt acagctacat aaagtctgat ggatggagaa ttagagc // LOCUS XELBGLI 6777 bp ds-DNA VRT 14-AUG-1990 DEFINITION X.laevis larval beta-globin gene (beta-LI), 5' flank. ACCESSION M27934 KEYWORDS beta-globin; globin. SOURCE X.laevis larval DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 6777) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta- globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.Knoechel, 13-SEP-1989. BASE COUNT 2121 a 1247 c 1228 g 2181 t ORIGIN 1 gaattcaaag cttttttttt tattaaacag ttttattgta ttttcaaacg aaaaacaagc 61 agaggtaaga cagtcaacag gttaacatta atgctgcgtg aagggtacta tacattgtgt 121 ttacattaca acttgttgga cattgatatg tcacttctgt gaatttgaag ctttacattt 181 aactaaaatt tgtgatggaa tgtctaacct gcatcccagt ccaaagaaat ttcaaagtag 241 aagatgacat aatgttggta gagatagtga tgagcggatt ttttgccagg tatggatttt 301 ggcaaaattc cgcgcttctt cgtctgcaat tttttttccc aaaactgcag caaaaatcca 361 ccataaccaa aaagtcacaa agacaaaatt gtcgcagaga caagaaagtc acagtaagac 421 ttgatgctcc tgattcactt gcactgacca caccactgta ttaaagggca gagaggggac 481 tataatgcag agacccatgg tccaggctcc tatgaccatg gggtctgctg tatagctgag 541 ctagttacac cagtaaggcc aaaacaaaat ggacttgcat gctggaataa tatgcaaaac 601 tgctgcagtg cctgttttta atctgctggg taaagagtgc aattacaaat gttaggataa 661 ttgcttatca tactctgctg catacactta ggggcccatt tacttagctc gagtgaagga 721 atagaggaaa aaaacttaga atttcgattg ttttttttgg ctacttcgac ttcgacctta 781 gacttcgaat cgaacgattc aaactaaaaa tcgtttgact attcgaccat tcgatagtca 841 aagtactgtc tctttaagaa aaaactttga ccacctagtt cgccacatta aagctaccga 901 agtcaatgtt agcctatggg gaaggtcccc atatgctttg ctagcttttt ttggtcaaaa 961 ataaaccatt cgatcgatgg attaaaatcc ttcgaatcga tcgaacgaat aatgctaaat 1021 cctttgactt cgatattcga actcgaagga tttaacttcg acagtcgaaa atcgagggtt 1081 aattaaccct cgatattcga ccttaagtaa atttgcactt attattgcaa atatttgggt 1141 ccatgacaga gtcatctgta tataatgtga aattacaaat actggtgcct cccctgtttt 1201 actttgctct atgtgagaaa aataatggag tcagtgccat acatatcctt gtgtgtatgg 1261 tggaaattgt agatgtcttg ggggcaaatt tactaaaggg cgaagtggct aacgctaggg 1321 aaaattcgcc agcgttacgt caatttgcca cttcgacaat ttagtttacg gttaccatgg 1381 cgaaaattcg ctagcaatgt aaatagacca gcgcaacttc acaccctaac gctggcgaag 1441 tcaggatgcc cacattcccc ctacatttcc taacatatgg cacctgaatt atactagggg 1501 cacatgtgta gggctttttt taagtttccc tgggcctctg tagtgttatg tatttgctgc 1561 agcaatatac atgtatacaa atttccaatc ggtagcgtaa cctcgaaccg ctgatcgtaa 1621 catcactagc gcaacttcgc aaatgattgg taacttgtgt gcaacttcgg atcttcgtga 1681 atttgcgcag ccactgcgaa gctatgcctg gcgaagtgcg gcgaatgcaa gtctcgggat 1741 ctccgcaggt aagtaaattt gccccatggt cagaggcaag gccagattat gtactaggtg 1801 acctaagaat caatactgtc cattctaaaa gtgcaagttc ataagtgccc gcaactacag 1861 aaacaatagg ggagaactaa caatctgttg taaacaacat tacaaggttg gctccctcat 1921 tgtttatatt atagctgtat aactgtaatg atgagtacga tctaagatat aatgaatctt 1981 attgcaggca aaacaatcct gttgattaat taatgcttaa attatcagaa attacaaaaa 2041 cctcaggtcc tgtgcattct ggataacagg tcccatacct gtactaaaac atgggaccag 2101 ggtgtctgca ttgatcaatc acctctttta tgattgtttt gggccatcac tctacttcaa 2161 gatgctgatg atatattacc aataaatgtt atattatata cttaaaaatc ttaattgaat 2221 taatatagtc aaatccttga tggagacaga cctagtagta tcatggataa taaaactagc 2281 aacagcaagc attggcccga cttgccatct tggagtcttg aaggaatctt ccacctttga 2341 ggaaaattgg agacagcttg tctatttttc aacctcttct aatatctaat tgaagaagat 2401 ccttacatac tgtatgtggt ggaaaatgca tgtttcttta aagatatgct gattgttgca 2461 ccaatctttg ctcaaagatc ttataagaaa tctttaagca tgactgtctg caactatgac 2521 tattataaaa tcctttccat gtagagtttt catccttttt gtgggtcaaa ggctgcccct 2581 cagcaatatc aggggaatga aattaaagtc acaaagagca aaacaattcg caccaatagg 2641 actaaaaatc cacatctcgc aatgcaatat tgttccttaa actgttattg taattgcgaa 2701 ttttaattgg ccattgcgga ttttaattgc gcactcttaa gaagtgcttg aagttgtcgt 2761 aatcttttgg agcaaacata acgacttttt cattaagagg tttaattaca ttgacgcatt 2821 ggcgcaaact ataaaatttg caaatggtct tccactgtcg gaagtggtcg caaaacagtt 2881 tctgggctcg caaaagctat attaaatttg cgaaagcaaa atgtgttcgc gcaaaggtat 2941 aacttttgca ttgcgaatag ttttccgtta gcaactttta ttgcattccc ctgtaaatat 3001 ctaataagca tggcctcgag cccaaaagac acctttttag gtaaagaaat aaatggggat 3061 ttcattctat aagtaattga atttgcacta aatattagta agtcggtttc ttgccctact 3121 ccaaccaaac tcaagaactt tcatttatta aagcacaaga aactctaact cacatattaa 3181 caaatagtta tagttggtca aattgtagct cagttaaggg tatattatat atttctgttt 3241 gttcgggtgt ggtgggccag tttttgaaac agtcaactgt tttacttaca gcagatgtcc 3301 aggtggcttg ccatatcttt gtcaaaaaca aatatattgt cagtattgtt ttttcaacat 3361 ctgccttagt tagataagaa ttgacaatat agaaccaagg gtatctaaaa atgctgctct 3421 gccttgcaat ctaatggtgg gtgggtccaa tgatttgtgt atttgcctga aaaaaaggga 3481 atattgttct ctcccttacc ttttttccca aagaaattgt ttcttttaat gtgtccaaaa 3541 tacagcaact tcagtcttgt gatttgagct tcaagtgaga tcagagacat gatttgctca 3601 gggatccatt tgtttgtgtt cctttcttcc acagtattct caaaagtctt tactaaaacc 3661 aaaatttatt agtacatttc cttgtactgc cacatttaca tctattaaga gtgacatcaa 3721 atactataac tggacaattc ccaaagtaac ctcagtacat gttaaaatat cgttgacgtc 3781 ttccatgtct cattctaagt gtcaatctgc tacttgacta taagattttt gttgtttata 3841 agtgacccag taaggcaaaa gctatacata actagctgcc cacaaactgg ccaatataaa 3901 gggagaagga aaattgttgt tcccactgga gttgttcccc tggttgggga aaaaatacta 3961 ttttgtatac aaaatgctgt tctggggtca ccaggagaga gcttctgatg ttcagggcca 4021 ggtagtgaca taagcctgag aataagactt aggggcacat tcaagctcgg gtgaatgaat 4081 agagggaaaa aaactcctcg actatcgaat tggcgtaaat tcgcctgagt agaatgattc 4141 aaatagattg agcgaaaaaa cgctgcgact attcgcccat cgatagtcga agtattgtct 4201 cttttaaaaa tcatttgact gcctacttcg ccagataaaa cctaccgaat tgctttaaaa 4261 gcctatggga aagtcccata ggcttctttt ctacgttttt gatcgaataa aaaggcattc 4321 gatcgaatat tcgatcgaat gaaaatcctt cgattgaata ttcgatcgtg cccattcgat 4381 tattcgccag cgcgtaaatt tgcccgaatt ccctattcga ttccattctc cagtcgaatt 4441 tcgagggatt taacccctcg aaattcgacc cttgatacat ctgcccctta gtgtgccaac 4501 ttgctcattg tgtgcatgtg tgtgacatgc cataaggctc tcttattaag cgcatgtatg 4561 tgatgaaaca taaccatccc cactgggagc tccttcatgg tttagcagaa tagcgctcac 4621 taccagcttt ttattcaaaa actgatattg tttccctcaa ccagagtata agctctatta 4681 gcttgcacca tcagtggggg aatttttttt cccctattag gtttccttta agctgcaaac 4741 ttgacctctc cttcccatct gcagtatatt gaccaatata agggaccaac cccacagtaa 4801 gatatctatt gtgtatgttt caaaatccca ttaggtaagg acagtacatt tatgtggtcc 4861 ctataggccc tcattatgat ctaattattg ggtcaatccg tcgtttttgg tacagtggtg 4921 ccagccttga actagagtgg taaaagaggg ctttgttggc tctttgagca tatcatagag 4981 ccttcagcaa aagttcactt tttaaatgta caccaatgaa tggagatttt tgaggccccc 5041 aaaattgtat tgctgtagat cctgcaacag ccaatgatcc ctttatctgc tctgaaatct 5101 tttttgtcgc tgctgctgct actggttaaa tacagtatag ttgaaaaaat ataggctttg 5161 agaataaaac ctgatgttca tttgcttttt aattattact ttacatcccc tttaaaaata 5221 tatacacatc actattccat gcattacact catttttaat tagacaaatc tataagaaat 5281 tctgcgagat gacacttttc atgataagca ttttgtaaaa ttgtaatatg ttcagttttt 5341 ttttttaaaa gttcaatgcc acactttatt tcaaaatgta ttaaggtgca gtaattatat 5401 taaataaatg tattgtaggg tacatgaata tatgtaacat ttaaaatgtg tgtttatgca 5461 cttctttcaa gtacagtaca tttgcactgt gatcaaatat taatttgaac tttaacagtc 5521 ctatctctac acctttatct tgtcctgggg atcagtctgt tttttagtga tatcttgtaa 5581 cacagaactt taaacaaaag ggctccgttt tgcacgtaga cctgtttgtg aatccatggc 5641 aattctgcca cctaaagcat acataacatt tagcatcttt ttttggtgtt ttttagacag 5701 atgatggtat agccatttgt gcaaataaaa tcagatattt tatcccaaat tatttgtgct 5761 gttagttgta tagggtttca acaaaatatc ttatttatca tttagagcaa atacttatgt 5821 gttacagtat ctgcaagtag tcaagtttga gcttaaaatt cccataattc ataattaagg 5881 ggatggctta gtataaaaaa acgtggaaaa aaaaacgtgt acagttatgc ttttatattg 5941 ccttgtaagt tcttttttat actattatta ttttaatgac cacgttttga attattgcat 6001 ggatttatga aaaccagttt aattgcaaag aggctcctaa aaattattta ttataagtta 6061 aaatttagta tatgcgtgca tgtatatgta acaatgcact ctcatatcta gtaaaaatca 6121 aagttgaagt aaagtgtata actaagtttg acctttctca ggcattaatg atcccagagg 6181 aaggccacac tatgtgacca aaacattgga ctacatttat taaatacatt taccttgatt 6241 tcttcaacac aatttgaaag ttcctccatg agctaatata aatttataaa gagagagagt 6301 gagtaaaaca tttttatcag aaaacagtgg cagagtaaat tctttcatac ttacaaaaga 6361 gtgctactat gcgcaacatt aacttgacat ttttgaattg tacctaatgc aattcatgat 6421 atttaaattg aatacattaa ttttaattat ttaattgtcc tgaaatctct acaggttcaa 6481 aaaaataatt ccatttatta catttatttt gtacacttaa ttatctactg ttaagtgtca 6541 caattgccct catttgatgt gggtttaagt ttcatgttgt tataaagaat caactttaca 6601 atttaagaac tatatggcat tccacatata caaaagatat attagcttaa ggttaaaaat 6661 ttattttgaa ggcaataggg tggggtggag gaaaaaaaat atgacacagc agaaatgcac 6721 aatgggtgtg actcagcatg gccatataaa gcaaggccaa caactcaaag gaacagc // LOCUS XELBGLII 910 bp ds-DNA VRT 14-AUG-1990 DEFINITION X.laevis larval beta-globin gene (beta-LII), 5' flank. ACCESSION M27932 KEYWORDS beta-globin; globin. SOURCE X.laevis larval DNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 910) AUTHORS Meyerhof,W., Stalder,J., Koester,M., Wirthmueller,U. and Knoechel,W. TITLE Sequence analysis of the upstream regions of Xenopus laevis beta-globin genes and arrangement of repetitive elements within the globin gene clusters JOURNAL Mol. Biol. Rep. 14, 17-26 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by W.Knoechel, 13-SEP-1989. BASE COUNT 322 a 152 c 128 g 308 t ORIGIN 1 ggatcccttg tctggaaacc agttatccag agggctccaa attatggaaa ggccatctcc 61 catagactca attttaatca aattattatt attttttttt tacaaattaa tgcctttttc 121 aatgtattaa taaaacagta ccttgatccc aaaattggag gcaaaacaat ccagtttgtt 181 ttatttaatg tttaaatatt ttttttaata ttgttttgat ccaaattaca gaaaggcccc 241 ttatccagaa aaacctccat ttaggataag gataacaggt ccaatacatt cataccctgt 301 acaaatctat gctatgttta attacttata aatagatcca catttcaatg gatatttcta 361 gaatatcgta ataacggtat atacttgttc aaagacaaac acatttaatg acctatgcct 421 aactggaata acagtcaagg aaatttaatg gaataatagg tatttcggag ctttccattt 481 attaacccta caaacaacta gttgttgttt caggaaacag cagtagttct atttggctta 541 catcttgaac aaaagcaaag ttgctatagt tttctttttc gtgtaaggaa agaaatgact 601 tgtgtcttta tctctacatt aaaaatgtat ctgccacaca gaatactttc tttttttaac 661 ttatctatag ataacgtatg tgcacccaaa ttgtagctgt gttacatcag cataattaag 721 tgcacacatg aagaaaaaaa atgacagatt gacaaaatgt tatattatat ggtaaggtct 781 cttggataat agcccttatc agtcataact ggttacaaat acagaaaaaa tgaggtgaca 841 cagcataaat gatatgaata cgtcactaac ttacacccct ataaatcaca aggttaaaat 901 attttttttt // LOCUS YSCADK1 2367 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae adenylate kinase (ADK1) gene, complete cds, and histone (H2A-1) gene, 3' end. ACCESSION M18455 KEYWORDS adenylate kinase; histone. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2367) AUTHORS Konrad,M. JOURNAL Unpublished (1988) STANDARD full staff_entry REFERENCE 2 (sites for [1]) AUTHORS Konrad,M. TITLE Analysis and in vivo disruption of the gene coding for adenylate kinase (ADK1) in the yeast Saccharomyces cerevisiae JOURNAL J. Biol. Chem. 263, 19468-19474 (1988) STANDARD simple staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly provided by M.Konrad 21-JUL-1988. FEATURES from to/span description pept 917 1585 adenylate kinase (EC 2.7.4.3) pept < 1 359 histone H2A-1 protein (AA at 3) ORF 2009 > 2367 ORF BASE COUNT 777 a 473 c 435 g 682 t ORIGIN 1 bp upstream of HindIII site; chromosome 4. 1 aagcttctca atctagatct gctaaggctg gtttgacatt cccagtcggt agagtgcaca 61 gattgctaag aagaggtaac tacgcccaaa gaattggttc tggtgctcca gtctacttga 121 ctgctgtctt ggaatatttg gccgctgaaa ttttagaatt agctggtaat gctgctaggg 181 ataacaagaa gaccagaatt attccaagac atttgcaatt ggctatcaga aatgatgacg 241 aattgaacaa gctattgggt aacgttacca ttgcccaagg tggtgttttg ccaaacatcc 301 atcaaaactt gttgccaaag aagtctgcca aggctaccaa ggcttctcaa gaattataag 361 atcggttctg gtattttaaa gaaggcggaa ggaactaaac tgcttctcca ttgtaactat 421 attaagtgtc tattagacct ttctgttttt cctttttatt tttctatata taatttattg 481 ggtatataat taacggtaac atatatcatg catgatataa attttttgtt gtgtcttcct 541 gttttctctg ttgctctgtt taggttcatt gggcactgtt gcttacttga acatgattga 601 gtagcaaaca cattctgtgc tttgttcact ttgatagtgt gacgaaattt tcacaaccaa 661 aggaaaaaag ctcacgattg cgcgatcctg tcacggagaa ctgtccgcag cagcccgcgg 721 cttctgtcct tattcgccca tttttttttg attttcgact ttttcactct ggctagtttt 781 attacgcata tttgtatata tatatacgca taaatttctg aaatggatta cctggtttat 841 cagaaacgag agctttaata aagaaacatt aacgtttctc tggtaaagtc accacacagc 901 atcaaatata acagtaatgt ctagctcaga atccattaga atggtcctaa ttggcccacc 961 tggtgccggt aaaggtactc aagctccaaa tttgcaagag cgtttccatg ccgctcactt 1021 ggccactggt gacatgttga gatctcaaat cgcaaagggc actcaattag gtttggaagc 1081 aaagaaaatt atggaccaag gtggtttagt ctctgatgac attatggtta acatgatcaa 1141 ggatgaattg accaacaatc cagcttgtaa gaatgggttc atcttggacg gtttcccaag 1201 aaccattcct caggctgaaa aattggacca aatgttgaaa gaacaaggaa ctcctttgga 1261 aaaagccatc gaattgaagg ttgatgatga attgttggtt gccagaatta ccggtagatt 1321 aattcaccca gcctctggca gatcctacca caagatcttt aacccaccaa aggaagacat 1381 gaaggatgac gtcaccggtg aagctttagt tcaaagatct gatgacaatg cagacgcctt 1441 gaagaagaga ttagctgctt accatgctca aaccgaacca attgttgact tttacaaaaa 1501 gaccggtatc tgggctggtg ttgatgcttc ccaacctcct gctactgttt gggctgacat 1561 cttgaacaag ctaggtaagg attaatgacc ttgcgcaatg tcttctaaat atcttttctt 1621 ttttttaaat taatgaaaat ataataaatt tttgtatgtg agtacatata tccgcagtat 1681 actataaatg aaataataac ataagtgttt gtaactggct ataacgtcct taaacatgtg 1741 cactgccatt aagtacttag tcaatatttt tatcgttgag aacgaacgca aatcatatta 1801 ctaaatagat gaacaaaagc gcatcacata aacaaaaacg gtataacagt aatagtgaag 1861 ctttttataa acctataacg aaataagcgc ttttcaggtt ttataattca gtcagagttc 1921 taactggaca tcgttttgca ggggataaaa aaaaaaagga agcttcaacc cacaatacca 1981 aaaaagcgaa gaaaacagcc aatgccaaat gacaataaga cacccaaatg ggtccagcac 2041 tcccaagttt actaaaaaac cagtaacccc gaatgataaa attcctgaaa gagaagaaaa 2101 atccaatgaa gtgaagacac ctaaaattcc attattcacg tttgccaaaa gcaaaaacta 2161 ttcaaggccg tcaaccgcaa ttcatacctc acctcatcaa ccaagtgatg taaaaccgac 2221 ttcccataaa cagttgcaac agccaaaatc ctcaccactg aaaaaaaata actataattc 2281 ttttcctcac tcaaatctgg aaaaaataag caacagcaaa ctactctccc ttcttcggtc 2341 caaaacgtca gcaggaagaa ttgaatc // LOCUS YSCSSB1A 1842 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Yeast (S.cerevisiae) heat shock protein 70 (YG101, SSB1) mRNA, complete cds. ACCESSION M25395 KEYWORDS heat shock protein. SOURCE Yeast (S.cerevisiae, strain GRF-18) DNA, and cDNA to mRNA, clone F430. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (sites for [2]) AUTHORS Normington,K., Kohno,K., Kozutsumi,Y., Gething,M.-J. and Sambrook,J. TITLE S.cerevisiae encodes an essential protein homologous in sequence and function to mammalian BiP JOURNAL Cell 57, 1223-1236 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1842) AUTHORS Normington,K.D. JOURNAL Unpublished (1989) Univ, of Texas SW Medical Center, Dallas, Texas STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.D.Normington, 09-JUN-1989. FEATURES from to/span description pept 1 1842 heat shock protein 70 BASE COUNT 510 a 385 c 418 g 529 t ORIGIN 1 atggctgaag gtgttttcca aggtgctatc ggtatcgatt taggtacaac ctactcttgt 61 gttgctactt acgaatcctc cgttgaaatt attgccaacg aacaaggtaa cagagtcacc 121 ccatctttcg ttgctttcac tccagaagaa agattgattg gtgatgctgc caagaaccaa 181 gctgctttga acccaagaaa cactgtcttc gatgctaagc gtttgattgg tagaagattc 241 gacgacgaat ctgttcaaaa ggacatgaag acctggcctt tcaaggttat cgacgtcgat 301 ggtaacccag tcatcgaagt ccaatacttg gaagaaacca agactttctc cccacaagaa 361 atttccgcta tggttttgac caagatgaag gaaattgctg aagctaagat tggtaagaag 421 gttgaaaagg ccgtcattac tgtcccagct tactttaacg acgctcaaag acaagctacc 481 aaggatgccg gtgccatttc tggtttgaac gttttgcgta tcatcaacga acctactgcc 541 gctgctattg cttacggtct aggtgctggt aagtccgaaa aggaaagaca tgttttgatt 601 ttcgatttgg gtggtggtac tttcgatgtt tccttgttgc acattgctgg tggtgtttac 661 actgttaaat ctacttccgg taacactcac ttgggtggtc aagatttcga caccaacttg 721 ttggaacact tcaaggccga attcaagaag aagactggtt tggacatctc cgacgatgcc 781 agagctttga gaagattgag aactgctgct gaaagagcta agagaacctt atcttctgtc 841 actcaaacta ccgttgaagt tgactctttg tttgacggtg aagatttcga atcctctttg 901 actagagcta gatttgaaga cttgaacgcc gcattgttca agtctacttt ggaacctgtt 961 gaacaagttt tgaaggatgc taagatctct aagtctcaaa tcgacgaagt tgtcttggtt 1021 ggtggttcca ccagaattcc aaaggtccaa aagttgttgt ctgacttctt tgacggtaag 1081 caattggaaa aatctattaa cccagatgaa gctgttgctt acggtgctgc tgttcaaggt 1141 gctatcttga ccggccaatc cacatctgac gaaaccaagg acttgttgtt gttagatgtt 1201 gctccattat ctctaggtgt tggtatgcaa ggtgacatgt tcggtatcgt tgttccaaga 1261 aacactactg ttccaaccat caagagaaga acctttacta catgtgctga caaccaaacc 1321 accgttcaat tcccagtcta ccaaggtgaa cgtgttaact gtaaagaaaa cactttgttg 1381 ggtgaattcg acttgaagaa catcccaatg atgccagctg gtgaaccagt cttggaagct 1441 atcttcgaag ttgatgctaa cggtatcttg aaggttactg ccgtcgaaaa gtctaccggt 1501 aagtcttcta acatcactat ctctaacgct gttggtagat tgtcttctga agaaattgaa 1561 aagatggtta accaagctga agagttcaag gctgccgatg aagcttttgc caagaagcac 1621 gaagctagac aaagattgga atcctacgtt gcctccatcg aacaaactgt cactgaccca 1681 gtcttgtctt ctaaattgaa gagaggttcc aagtccaaga ttgaagctgc tttgtccgat 1741 gctttggctg ctttgcaaat cgaagaccca tctgctgatg aattgagaaa ggctgaagtt 1801 ggtttgaaga gagttgtcac caaggccatg tcttctcgtt aa // LOCUS ATHGPA1 2030 bp ss-mRNA PLN 14-AUG-1990 DEFINITION A.thaliana G protein alpha subunit 1 (GP-alpha-1) mRNA, complete cds. ACCESSION M32887 KEYWORDS G protein. SOURCE A.thaliana (strain Landsberger) flow DNA and, cDNA to mRNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 2030) AUTHORS Ma,H., Yanofsky,M.F. and Meyerowitz,E.M. TITLE Molecular cloning and characterization of GPA1, a G protein alpha subunit gene from Arabidopsis thaliana JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3821-3825 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.M.Meyerowitz, 16-MAR-1990, for release after publication. FEATURES from to/span description pept 600 1751 G protein alpha subunit 1 (GP-alpha-1) pept 490 552 ORF1 pept 568 591 ORF2 mRNA 401 1885 GP-alpha-1 mRNA mRNA 401 1921 GP-alpha-1 mRNA (alt.) signal 312 318 pot. TATA box rpt 17 36 inverted repeat 1A rpt 188 207 inverted repeat 1B rpt 323 339 inverted repeat 2A rpt 341 357 inverted repeat 2B BASE COUNT 631 a 346 c 413 g 640 t ORIGIN Chromosome 2 near erecta. 1 gttaacttaa tagtatataa aataaaaatg catataggtt ccgtaattaa tctcttatcg 61 tcacgagagg cacatctttt tccaacattt gaccactctc tctctctctc tcaggacctt 121 tcggcgtaat ttcgtcttcc cctttgctta acattttctt tctttctttt tgaccaaata 181 ttaaaaatat atccattttt attttatttt taattaaatt cataatttgc atttgtaata 241 gaaaaaaaaa gaagagaata aatccaaaag agtgaagcaa aaacattaaa gcggaaagaa 301 agtggtaaaa caataataga aacaggagaa gcagaagtac tacttcttct tcttctgctc 361 tcttctcaga ccttgttttg tactttcttc ttcttcttct ttcttcttct tgtttgcgaa 421 ctccgatatc ttcttcacta cctttgactc catttctttt tcttcttcag gtgtaggcat 481 tgtcttgtta tgagaagcaa ctgtagctgg aagctcaagt atttgttttt agctgtggag 541 cttgaatctt gatagttttc gacttctatg ttattacctg tggggatata gaaacaatca 601 tgggcttact ctgcagtaga agtcgacatc atactgaaga tactgatgag aatacacagg 661 ctgctgaaat cgaaagacgg atagagcaag aagcaaaggc tgaaaagcat attcggaagc 721 ttttgctact tggtgctggg gaatctggaa aatctacaat ttttaagcag ataaaacttc 781 tattccaaac gggatttgat gaaggagaac taaagagcta tgttccagtc attcatgcca 841 atgtctatca gactataaaa ttattgcatg atggaacaaa ggagtttgct caaaatgaaa 901 cagattctgc taaatatatg ttatcttctg aaagtattgc aattggggag aaactatctg 961 agattggtgg taggttagac tatccacgtc ttaccaagga catcgctgag ggaatagaaa 1021 cactatggaa ggatcctgca atccaggaaa cttgtgctcg tggtaatgag cttcaggttc 1081 ctgattgtac gaaatatctg atggagaact tgaagagact atcagatata aattatattc 1141 caactaagga ggatgtactt tatgcaagag ttcgcacaac tggtgtcgtg gaaatacagt 1201 tcagccctgt gggagagaat aaaaaaagtg gtgaagtgta ccgattgttt gacgtgggtg 1261 gacagagaaa tgagaggagg aaatggattc atctgtttga aggtgtaaca gctgtgatat 1321 tttgtgctgc catcagcgag tacgaccaaa cgctctttga ggacgagcag aaaaacagga 1381 tgatggagac caaggaatta ttcgactggg tcctgaaaca accctgtttt gagaaaacat 1441 ccttcatgct gttcttgaac aagttcgaca tatttgagaa gaaagttctt gacgttccgt 1501 tgaacgtttg cgagtggttc agagattacc aaccagtttc aagtgggaaa caagagattg 1561 agcatgcata cgagtttgtg aagaagaagt ttgaggagtt atattaccag aacacggcgc 1621 cggatagagt ggacagggta ttcaaaatct acaggacgac ggctttggac cagaagcttg 1681 taaagaaaac gttcaagctc gtagatgaga cactaagaag gagaaattta ctggaggctg 1741 gccttttatg accttattat tacatatctc tagtaaatta cctctcctta ttattataag 1801 aaaaactcga aaactgaatg accgtgtaat ttatctttcg ggacaaaaga cttagcgatt 1861 caaaatctaa tgtgtctcga tggctacgac tagtttctat tttatcattg tttttgttaa 1921 cattcctctg tctttgactt cttatttttt ttctcatcaa aaacatctca ttttgatctt 1981 gtttttgggg ttatattatt attaaaatga ggcatccaca tcccgaaatc // LOCUS BLYABA 998 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Barley abscisic acid (ABA) mRNA, complete cds. ACCESSION M36000 KEYWORDS abscisic acid. SOURCE Barley (H.vulgare L. cv. Himalaya) seed, cDNA to mRNA, clone pHV A1-1. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 998) AUTHORS Hong,L., Uknes,S.J. and Ho,T.-h.D. TITLE Cloning and characterization of a cDNA encoding a mRNA rapidly- induced by ABA in barley aleurone layers JOURNAL Plant Mol. Biol. 11, 495-506 (1988) STANDARD simple staff_review FEATURES from to/span description pept 120 761 abscisic acid (ABA) mRNA < 1 998 ABA mRNA BASE COUNT 249 a 284 c 323 g 142 t ORIGIN 1 gtgccggtag taaatcatga gcatctcttg cgactcgaaa cgtagtacag caacagccta 61 aagcgagtcc gagtggtgat tccagttcgt gtttgtttga gctagatcgt gagacgaaga 121 tggcctccaa ccagaaccag gggagctacc acgccggcga gaccaaggcc cgcaccgagg 181 agaagaccgg gcagatgatg ggcgccacca agcagaaggc ggggcagacc accgaggcca 241 ccaagcagaa ggccggcgag acggccgagg ccaccaagca gaagaccggc gagacggccg 301 aggccgccaa gcagaaggcc gccgaggcca aggacaagac ggcgcagacg gcgcaggcgg 361 ccaaggacaa gacgtacgag acggcgcagg cggccaagga gcgcgccgcc cagggcaagg 421 accagaccgg cagcgccctc ggcgagaaga cggaggcggc caagcagaag gccgccgaga 481 cgacggaggc ggccaagcag aaggccgccg aggcaaccga ggcggccaag cagaaggcgt 541 ccgacacggc gcagtacacc aaggagtccg cggtggccgg caaggacaag accggcagcg 601 tcctccagca ggccggcgag acggtggtga acgccgtggt gggcgccaag gacgccgtgg 661 caaacacgct gggcatggga ggggacaaca ccagcgccac caaggacgcc accaccggcg 721 ccaccgtcaa ggacaccacc accaccacca ggaatcacta gacgcatgcg ttcgcgctta 781 atttccgttc ctttagtcgt gtttggtcgt tcgagggcct tctacatatt tcatatttgt 841 atgtttccac tctttcatga tttccgctca tttagtgtaa agtttgcctc cgatttgatg 901 tactcgtctc tggttctgta atgagttata atccatgggc tttggtgtaa atggataacg 961 aggacactcg aaggcggcaa taaagttgta tgtgatcg // LOCUS MVARRNAG 1466 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.vannielii 16S ribosomal RNA gene. ACCESSION M36507 KEYWORDS 16S rRNA; small subunit ribosomal RNA. SOURCE M.vannielii DNA. ORGANISM Methanococcus vannielii Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanococcales; Methanococcaceae. REFERENCE 1 (bases 1 to 1466) AUTHORS Jarsch,M. and Boeck,A. TITLE Sequence of the 16S ribosomal RNA gene from Methanococcus vannielii: Evolutionary implications JOURNAL Syst. Appl. Microbiol. 6, 54-59 (1985) STANDARD simple staff_review FEATURES from to/span description rRNA 1 1466 16S ribosomal RNA BASE COUNT 335 a 372 c 473 g 286 t ORIGIN 1 attccggttg atcccgccgg aggctactgc tattggggtt cgactaagcc atgcgagtct 61 atggtttcgg ccatggcgga cggctcatta acacgtggtt aacttaacct caggtggagc 121 ataaccttgg gaaactgagg ataattctcc ataagaaaag cagtctggaa cgattctttt 181 ctgaaagcat atgcgcccga ggataggact gcgctcgatt aggtagttgg tggggtaatg 241 gcccaccaag cctacgatcg atacgggcct tgagagaggg agcccggaga tggggactga 301 gacacggccc caggccctac ggggcgcagc aggcgcgaaa cctccgcaat gcacgaaagt 361 gcgacggggg gaccccaagt gctcatgcac agcatgggct tttatcaagt gtaaacagct 421 tgaggaataa gggctgggca agttcggtgc cagcagccgc ggtaataccg acggcccgag 481 tggtagccac tcttattggg cctaaagcgt ccgtagccgg tccagtaagt ccctgtttaa 541 attctctggc ttaaccagag gactggcagg gatactgctg gacttgggac cgggagagga 601 caagggtact ccaggggtag cggtgaaatg tgttgatcct tggaggacca cctatggcga 661 aggcacttgt ctggaacggg tccgacggtg agggacgaaa gccaggggcg cgaaccggat 721 tagatacccg ggtagtcctg gccgtaaact ctgcgaacta ggtgtcacct gggcctcgag 781 cccaggtggt gccgaaggga agccgttaag ttcgccgcct ggggagtacg gtcgcaagac 841 tgaaacttaa aggaattggc gggggagcac cacaacgggt ggagcctgcg gtttaattgg 901 attcaacgcc gggcatctca ccacgagcga cagcatgatg acggccaggt tgacgacctt 961 gcctgaagcg ctgagaggtg gtgcatggcc atcgtcagct cgtaccgcga ggcgtcctgt 1021 taagtcaggt aacgagcgag acccgtgccc tatgttgcga ctactttctc cggaaggtaa 1081 gcactcatag gggaccgcta gcgctaagct agaggaagga gcgggcaacg ataggtccgc 1141 atgccccgaa tctcctgggc tacacgcggg ctacaatggc taggacaatg ggctgctacc 1201 ctgaaaaggg acgcgaatct ccgaaaccta gtcgtagttc ggatcgtggg ctgtaactcg 1261 cccacgtgaa gctggaatcc gtagtaatcg cagttcataa tactgcggtg aatgtgtccc 1321 tgctccttgc acacaccgcc cgtcacacca cccgagttgg gttcaggtga ggccttggcc 1381 tttggctagg gtcgaacctg ggctcagcga ggggggtgaa gtcgtaacaa ggtagccgta 1441 ggggaacctg cggctggatc acctcc // LOCUS TOM2A11 526 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Tomato fruit-specific protein mRNA, complete cds. ACCESSION M21775 X13741 KEYWORDS fruit-specific protein. SOURCE Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 2A11. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 526) AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M. TITLE Isolation and characterization of a fruit-specific cDNA and the corresponding genomic clone from tomato JOURNAL Plant Mol. Biol. 13, 639-651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Rose, 09-DEC-1989, for release after publication. FEATURES from to/span description pept 34 324 fruit-specific protein BASE COUNT 163 a 97 c 92 g 174 t ORIGIN 1 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag 61 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg 121 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag 181 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc 241 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac 301 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct 361 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa 421 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc 481 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattcc // LOCUS TOM3H11 774 bp ss-mRNA PLN 14-AUG-1990 DEFINITION Tomato fruit-specific protein mRNA, complete cds. ACCESSION M21776 X13742 KEYWORDS fruit-specific protein. SOURCE Tomato (strain UC82B) red fruit stage, cDNA to mRNA, clone 3H11. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 774) AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Rose, 09-DEC-1989, for release after publication. Author address: R.E.Rosen, Calgene Inc. 1920 Fifth St. Davis, CA 95616 FEATURES from to/span description pept 214 504 fruit-specific protein BASE COUNT 247 a 143 c 119 g 265 t ORIGIN 1 tttttttgag caaagggcaa ctcagatatc caaagatgaa tccaacatat agcttacagc 61 tgggagaaca ttgtctaact cttctgaaat ttaaatgtta tccagaatcc ttcatcataa 121 aataatatca aaatgcaaat ctattttttc tactcttgtc tagcttcaac tttcttcttc 181 tgctcatcaa ttagcaatta atccaaaacc attatggctg ccaaaaattc agagatgaag 241 tttgctatct tcttcgttgt tcttttgacg accactttag ttgatatgtc tggaatttcg 301 aaaatgcaag tgatggctct tcgagacata cccccacaag aaacattgct gaaaatgaag 361 ctacttccca caaatatttt gggactttgt aacgaacctt gcagctcaaa ctctgattgc 421 atcggaatta ccctttgcca attttgtaag gagaagacgg accagtatgg tttaacatac 481 cgtacatgca acctgttgcc ttgaacaata tcaatgatct atcgatcgat ctatctatct 541 atttatctgt ctctgcgcgt atagtgttgt ctgtaccttt ggtgtgaaga atatgaataa 601 agggatacat atatctagat atattctagg taatgtccta ttgtatttaa aatttgtagc 661 aatgattgtt tgaataaaaa cataccatga gtgaaataat tattccacat taattcacgt 721 atttatttca cttatgatac gtatttttgt tcctttcgcg taaaaaaaaa aaaa // LOCUS TOMFSPG 4654 bp ds-DNA PLN 14-AUG-1990 DEFINITION Tomato fruit-specific protein gene, complete cds. ACCESSION M21777 X13743 KEYWORDS fruit-specific protein. SOURCE Tomato (strain VFNT Cherry) DNA. ORGANISM Lycopersicon esculentum Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae. REFERENCE 1 (bases 1 to 4654) AUTHORS Pear,J.R., Ridge,N., Rasmussen,R., Rose,R.E. and Houck,C.M. TITLE Isolation and characterization of a fruit-specific cDNA and the corresponding genomic clone from tomato JOURNAL Plant Mol. Biol. 13, 639-651 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.E.Rose, 09-DEC-1989, for release after publication. FEATURES from to/span description pept 1379 1445 fruit-specific protein, exon 1 2160 2383 fruit-specific protein, exon 2 pre-msg 1326 2640 mRNA and introns IVS 1446 2159 fruit-specific protein intron signal 1064 1067 TATA-box (put.) signal 1135 1138 TATA-box (put.) signal 1190 1193 TATA-box (put.) signal 1291 1297 TATA box (put.) signal 2640 2640 polyA signal BASE COUNT 1709 a 723 c 642 g 1580 t ORIGIN 1 ctcgagccct ttaaaaagta tagtcaatat ttacggtgac cgtgaatttc ttaattatga 61 tatataattt aaaagaaatc atgatcacat tctactgatg agaacatgtg ctaatcaagg 121 gaaaacatgg atgtgaaaaa tactttttgt taaaagtaaa aaaaaatgtg aaattttgtt 181 agttatttac tacctataca ttatttgagc atgtgcaaac tttacaaata cctaatagaa 241 gattttcacc tgcctgtata tatgtaaatt aattataatg aacactctca cataaaataa 301 ttatcagtat atacattaat acttgccctc cacaatgaat taaataaaat gtagaacatg 361 atctacactt caataaaact aagaccataa agaataattt caaaatatac acatgtcaac 421 aataaattat ttgcatatta tattaactta ctaaacaatc tttacttttg aaatataaaa 481 ataatcaagt tataagtctg ctcaaagtaa agcacttgtt agactcatct gattttgaga 541 aggtaagcaa attgatggtg cataatagtc acaagtaaaa tataaaatag atttcattag 601 taaaattgtt ttttactttc tttatatata attatcaata tccttcaatg gtaggttaat 661 tatattgtta acttcttgtt gaattaaagc aataagacaa gaatattaaa gataaaagaa 721 caataaaaat agaaagacta agagataaga gttttcttat tcttctttca ataagtatca 781 tcaagtgtat acaatataaa tttttgtatt tttgatctat ctatttataa tgttatatat 841 aagcatacaa aagatcagtc ataaatatga ctttaatcat gaaaataatg aaagagatta 901 tgaaggcgta aggttactag aataatagtc attaaaaaaa ggggttatct ttataattga 961 ataattgatg aagtaatgga gataattagt gagcataaat ttttttaaaa aaatggacat 1021 ttacactata atattttata acactttccc ttaaacatct aggtataaat aatgagtctt 1081 gtcaaaatct tagtaggaaa aattctgtga aattttttta gtgaaaacaa atgatataaa 1141 tatcttgaat actcattatt tgttgtctca ttaaaaatct tatctgacct ataaaataaa 1201 ttatttgctc aactcaaaat agtttttcat tctaaaatta gtataattat tagtgaatat 1261 ttaattaaca taattgtata ctaaggggcc tataaattgg attcttctca aagaaaaata 1321 aaatcaccac acaactttct tcttctgctc atcaattagc aattaatcca aaaccattat 1381 ggctgccaaa aattcagaga tgaagtttgc tatcttcttc gttgttcttt tgacgaccac 1441 tttaggttca caacacttct cccttatttt gttttcttaa tttcttggaa gtcatatgca 1501 tgtgtttggt atcatggtat atatataaag gaaaatattt ttcttaatta ctggttttct 1561 aatgtttggt aggtaatcgg aaattattat gagataatga acttgcaaag tcattattat 1621 ataacttttt ttttatactt tgatttaaga attcattttt ctcattttat ataaacttat 1681 ttttcaacag aaaatatttt tcgaactatt caaacacacc ctaagacatt acatatatat 1741 atatatacac cctccgtttt atattactta atgcctattg agttggccca ccctttaaga 1801 atgattcaat tagagatatg ttttactaaa ttaacctatg ctttaagact ctaaatttgg 1861 ctattactat tttacgttgt aatttaatga caaacatttc ataatgacta tagtctgaac 1921 ttaattagac agacgtatct atagtttgct tactaatgat tcatagctat atatttggag 1981 aggagagaga caaacgatat taagaaaggg aggagagagg cgaggtaaat ctgaaataga 2041 gaagagaaag gcaaccaatt ttgatcatct atcatacttt tgattattat ttttattata 2101 tgtacgttta cattacagtt ttcgaattct tacattaatc ttaatcataa tatatacagt 2161 tgatatgtct ggaatttcga aaatgcaagt gatggctctt cgagacatac ccccacaaga 2221 aacattgctg aaaatgaagc tacttcccac aaatattttg ggactttgta acgaaccttg 2281 cagctcaaac tctgattgca tcggaattac cctttgccaa ttttgtaagg agaagacgga 2341 ccagtatggt ttaacatacc gtacatgcaa cctgttgcct tgaacaatat caatgatcta 2401 tcgatcgatc tatctatcta tttatctgtc tctgcgcgta tagtgttgtc tgtacctttg 2461 gtgtgaagaa tatgaataaa gggatacata tatctagata tattctaggt aatgtcctat 2521 tgtatttaaa atttgtagca atgattgttt gaataaaaac ataccatgag tgaaataatt 2581 attccacatt aattcacgta tttatttcac ttatgatacg tatttttgtt cctttcgcgt 2641 agatttttga tccttttccc ttttgaatat taaacattaa acacaaataa tgtttattaa 2701 attaagttaa tatttttatt tagctattta tatttttatt tgaaatcaaa cttgataaat 2761 atttataaag ataattaaca agtaatgtga cactaacacc atgtaatatt atcttgtcgt 2821 tatttatgat aatattttaa aattataatt tcagttaaaa aattattaaa aaaacatact 2881 tttaaaaagt gagttagcct ccgctaccca catacttatg aattggacta gttgtttttt 2941 gacccacaaa aagaatgggc taattaaacc tgacctatca aatttcagaa tctgcataga 3001 ttagtccgaa cgaaatgagt cagcccgtat tgaacaaaat atcaacaagg acgttatgta 3061 aagatgttta agaaggaaaa aagatttcta atacatatgg actttcaata tcccaacttt 3121 gtctggcgat ctgaaccctg cttagtttgt tgatcattaa cttgtcttgc tatgtattta 3181 agatttaaac tttatatgtt taaacttaca gaaaatacat ataaatctct caagacttgg 3241 caacataatt tactttagta cttaaactac atgaaaattt aaatatcctt ttaacatctt 3301 tgaagtgaat taaattatca caatccgagc ctacaccttg gacgtggccg gcactcaaga 3361 accagtgctg gtccccaagc taaccctcat cctgactgac tacaagcgga aggctaactt 3421 aagtatacaa aagcttaaaa ctgaataaaa taaactttac aaggttttaa cacaaatgaa 3481 caactttgaa gaaaataata tattcaacta gccataaaat agacaacttt agtctttaaa 3541 acatttaata aaataaatgc aaaatataga ctccttaact aaactgacta tctatggagc 3601 ctctaattga taaagatgga agtcgggaca agaccacgac atcctgacta aactgagaag 3661 taaataaaat cccccggaaa aaaaggagcc tcaccatggc taactcgaac tcggggatat 3721 atcaatgaag ctcctgttga tgatcttgaa gacatgtctc tgcatcatca aaagatgcag 3781 gccaaatggc tcagtacgta aaatgtacga gtatgtaagg gaaattctaa agtataacat 3841 aagcttgata cttgaataaa aggaaacata cttacctctt ttcaactcaa ctcaaattaa 3901 gaataagata ctcaactcaa agattaggta ttcaacgcaa atatggcact ctactcaatg 3961 aagtacaaat taactcagga tactcgactt aagatactca actcccgaca ctcaactgaa 4021 ctcatttcaa tataaagcag cttaaaacaa gttcagtata aagtaaagtt gtttaaaaac 4081 atgatgtcaa ctctgtgtgt ataataagga tacaacataa ctttgaaatg tatataaaaa 4141 tacaattaac tgatgtatat aaaaatacat taatctatgg gagattctct aaccgacaac 4201 catcacttaa gggctaagat gatgatatag cgatctaccg cacgctgcca tcgcatctta 4261 tacccggcca aaggtataag acctgaactg cctaatgaat ccactaataa actgttaaaa 4321 ggaatcatct aaaaagtatg acccttttct acccatagtg gctaacatgg tttatggggg 4381 ctgtgagtta tctgaactct cccccatatc ggtgctcaat actactccaa aaaatatact 4441 gctcttatgt ttaaaaacat actgattctg tggtttgaaa ttattgctta aagcttagat 4501 ttttgaaaag ctctcttttg aaaatcgtag tttccttttt cttctattaa agctagacat 4561 aggctatgta gaactctagc ttaccttcct tctcaaaagt ttgaaaacat ttgcttagat 4621 tcttagggac tacttagttc ccttgttgga attc // LOCUS ACLP322P 1337 bp ds-DNA SYN 14-AUG-1990 DEFINITION Synthetic plasmid pWH1266 origin of replication (ori) region. ACCESSION M36473 KEYWORDS . SOURCE A.calcoaceticus lwoffi plasmid and pBR322 DNA, clone pWH1266. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 1337) AUTHORS Hunger,M., Schmucker,R., Kishan,V. and Hillen,W. TITLE Analysis and nucleotide sequence of an origin of DNA replication in Acinetobacter calcoaceticus and its use for Escherichia coli shuttle plasmids JOURNAL Gene 87, 45-51 (1990) STANDARD simple staff_entry FEATURES from to/span description orgrpl 310 337 origin of replication BASE COUNT 447 a 229 c 251 g 410 t ORIGIN 1 gatcgtagaa atatctatga ttatcttgaa gaacgcaacc ctatagcagc tattgaaatt 61 gatgatttaa ttgaagaaaa gacagattta gttgttgata atcgactgat ggggcgcaca 121 ggcagacaga aagatactag ggagttagtg atacatccgc attatgtggt tgtatatgac 181 atcactgata taatacggat actcagagtg ctacacacat cgcaggagtg gtcatgactt 241 actcatgtac tttggattat ttagtgttat aaaatcctga tttataaatt ttttttgtta 301 aaaaagataa aagccccttg caattgcttg gggctttacc gtaatttatg gggtacagat 361 cttcgatact gacatatcgg caatcgaaag cattaaggtt tgacgaccgc taatgatttc 421 accacagggg cttaatgtac ctgtcttaaa ttctaaggtt ttaactcgct ttgtcaagca 481 tagaccccaa aaatttagcc aatgtctgta actcaatctg tccatgtgtg ggtgatgagg 541 tacagtgacg ctagcacaca tcggaaaaac gctattacta ggggaactga acagagtagc 601 ggacgcaatg agtagtcatt taattggcgg ttatgagcgt gttcaggcgg tgctatcaat 661 cgtaatcata acagtggcag cttgatacag tgatgtcatc cctgatgcga aagcgaccga 721 ccgacggtac atcgaatggg aatactttag ggtgattttt aagaatcgct ctagggtgag 781 tatttcccat tcagctctgc tccctccctc tggtacttta atcaaaagca ctactaaaca 841 tatgttttta aataaaaaat attgatatag agataatatt agtaagaata attaaacaat 901 tgaatataga taaatcattg ttaaataaag attaattatt aaaatgaatg tatacttata 961 tataaatcaa tgatttaaaa tatttgataa agaaaacttt tcaaaaaaaa tataattgag 1021 attgtgtcat ttcggtcaat tcttaatatg ttccacgcaa gttttagcta tggtgctaaa 1081 cagaaatttg ctgaaaaaga acttttcact gaactggtta aaatgtaagc agcctgagag 1141 ccgccaaaaa ttttaaaaac aaaccgcctt aatcatcttc aaaaaatacc tctaaaacct 1201 caccatttgc gttttaagac ccatatttca tcctgccctt atgttcccat gctgatagct 1261 ataaagtgtc tgtaatcgct tcctatgacg ttctaggctg ttgataactt ttggaacaac 1321 gcaaaatgtt aaaatcc // LOCUS ATHUXP1A 1078 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ1) gene, complete cds. ACCESSION J05507 KEYWORDS gene UBQ1; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 1078) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 44 146 ubiquitin extension protein (UBQ1) 234 320 ubiquitin extension protein 406 508 ubiquitin extension protein 888 954 ubiquitin extension protein 1031 1057 ubiquitin extension protein pre-msg < 1 > 1078 UBQ1 mRNA and introns IVS 147 233 UBQ1 intron A IVS 321 405 UBQ1 intron B IVS 509 887 UBQ1 intron C IVS 955 1030 UBQ1 intron D BASE COUNT 295 a 195 c 214 g 374 t ORIGIN 1 ccttgtttag ggtttctacg tgagagaaga cgaaacacaa aagatgcaga tcttcgtgaa 61 aaccttgacc ggcaagacca tcactctcga ggtcgagagc agcgacacca tcgacaatgt 121 caaggccaag atccaagaca aagaaggtat cattcttcct cactcaatct ggattcttct 181 ctttagcttt ttgaaattca gatctcttat catttacttg tttctccttt aaggaatccc 241 tccggatcag cagagattga tcttcgccgg aaagcagctc gaagatggcc gtactttggc 301 tgactacaac atccagaaag gtacgaaatc atccgaatcc ttctgttgat catttcgatg 361 atctgattgt ataaactcta atggattgtt atcatttgta aacagaatct acacttcatc 421 ttgtgttgag gcttagagga ggtattattg agccttcctt gatgatgctt gctcgtaagt 481 acaatcagga taagatgatt tgccgcaagt aagtttcaat ttctccctaa gattgtagtt 541 ttgttgatta atagttcaaa gtaccaaatg tttattagtg tgttagggtt ttagttgttt 601 taatgggaat taacttagct tctcaagcct tatacaatgc gtgttgaggt tataaatgtt 661 cccaatatca aacatttcca ttctctaaac tttgtcagct ggtcaaattt ctcatctttc 721 cagatgattt tcttgaatgg tacatttttg caaatggttt tgaaaatgtg ttgaagcttg 781 tatttttgat tgctgaaaca tcttgtttgt tggatagttt gtcatctatg gtccgatctg 841 atgtctcaat gagattattc taatgtaaat ttcgcatctt ttggtaggtg ctatgctcgt 901 cttcacccaa gagctgtcaa ctgcaggaag aagaagtgtg gtcacagcaa ccaggtttgt 961 ttttgttact tgtttctttt tggtgtttat cagctgaaat gatatctgaa caaacttttt 1021 ttatttgcag ttgaggccta agaagaagat caagtagaga gactcttatc aagaatcc // LOCUS ATHUXP2A 976 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ2) gene, complete cds. ACCESSION J05508 KEYWORDS gene UBQ2; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 976) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 62 164 ubiquitin extension protein (UBQ2) 243 329 ubiquitin extension protein 433 535 ubiquitin extension protein 771 837 ubiquitin extension protein 929 955 ubiquitin extension protein pre-msg < 1 > 976 UBQ2 mRNA and introns IVS 165 242 UBQ2 intron A IVS 330 432 UBQ2 intron B IVS 536 770 UBQ2 intron C IVS 838 928 UBQ2 intron D BASE COUNT 262 a 164 c 209 g 341 t ORIGIN 1 cttcattttt tacactgcaa acgccgtttc gagacttccg gcgaagaaga cgaacgcaaa 61 gatgcagatt ttcgtgaaaa cgctaaccgg caagaccatc accctcgagg tcgagagcag 121 tgacaccatc gacaatgtca aggccaagat ccaggacaaa gaaggtatat tttctaatgc 181 tttttaaaaa ttgtaacttt gaaattgtga tctcttagtg attcgttttt gctctgtttc 241 aggaattcct ccggaccagc agaggttgat cttcgccgga aaacagcttg aagatggtcg 301 taccttggct gactacaaca tccagaaagg tctctttctc gaatttgaat gttttggatt 361 ctttagaaaa tgaaactagt atagctatga ttgttttgtt gatccaattc tgttttgttt 421 actataaaac agagtcgact ttgcatcttg ttctgaggct taggggaggt atcattgagc 481 cttcattgat gatgttggct cgtaagtata accaagacaa aatgatttgt cgcaagtaag 541 atctctgatc ttttctcttt gttgttctaa gtgtaccatg tccatgtgtg atgattatag 601 taaaaatggg gaattaactt aggggtgaac tattattcat tttctacgtg gattatgatt 661 ctcttagttt gctgataaat gtattgaaat ctatctagtt ctttcactgt aacactatgg 721 ttgtgtgaaa actgtgtctc atgtttgagt tgttaacctt gttggtttag gtgttatgct 781 cgtctccacc caagagctgt gaactgcagg aagaagaagt gtggccatag caaccaggtt 841 tggttttttg tctctttaga ttttaaaatt ccacagaatt tgtttgattt gttagttgtt 901 aagtgtgatt ttctgtttat ttttgcagtt gaggccaaag aagaagatca agtgatcttt 961 tgttcaccaa aagctt // LOCUS ATHUXP5A 720 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ5) gene, complete cds. ACCESSION J05539 KEYWORDS gene UBQ5; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 720) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 112 585 ubiquitin extension protein (UBQ5) site 339 340 ubiquitin-extension protein junction BASE COUNT 209 a 159 c 164 g 188 t ORIGIN 1 aagcccaaat aaaataaggg tagtaacgta attgagctaa gaaaccctag aggtctcctt 61 cgcctacaaa atccccattt tgataatctt cagcagccgt tgcctcaaaa gatgcagatc 121 ttcgtgaaaa ccctaacggg gaagaccata acccttgagg ttgaatcatc cgacaccatc 181 gacaatgtga aggcgaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg 241 attttcgccg ggaaacagct tgaagacggc cgtaccctcg ccgactacaa catccagaag 301 gaatcgacgc ttcatctcgt cctccgtctc cgtggtggtg ctaagaagag gaagaagaag 361 acttacacca agccgaagaa gatcaagcac aagcataaga aggttaagct cgctgttctt 421 cagttttaca aaatcgatgg atctggaaag gttcagcgtt tgaggaagga gtgccctaac 481 gcaacctgtg gagctgggac tttcatggcg agtcattttg atcgccatta ctgtggtaag 541 tgtggactca cctacgttta ccagaaagaa ggagttgaag cttgatctgt tgtagcggta 601 gatcgatcct tctctttctc tcttatcttt taaaaactgc atttctattt gggaattttg 661 taagctcttt aatttgagtt atcatggatt ctatgttgaa catctttcgt tctggattat // LOCUS ATHUXP6A 681 bp ds-DNA PLN 14-AUG-1990 DEFINITION A.thaliana ubiquitin extension protein (UBQ6) gene, complete cds. ACCESSION J05540 KEYWORDS gene UBQ6; ubiquitin extension protein. SOURCE A.thaliana (variety Columbia) DNA. ORGANISM Arabidopsis thaliana Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 681) AUTHORS Callis,J., Raasch,J.A. and Vierstra,R.D. TITLE Ubiquitin extension proteins of Arabidopsis thaliana: Structure, localization, and expression of their promoters in transgenic tobacco JOURNAL J. Biol. Chem. 265, 12486-12493 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Callis, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 102 575 ubiquitin extension protein (UBQ6) site 329 330 ubiquitin-extension protein junction BASE COUNT 191 a 154 c 153 g 183 t ORIGIN 1 gagctccaaa accctagttt ctcaaccact actcttttat ttcttctcac cacttaaaga 61 gtttccccag aaattttctt ccgccgtaaa agcaaaaaaa gatgcagatc ttcgtgaaaa 121 ccctaacggg gaagacgatc actctcgagg tcgagtcctc tgacaccatc gacaatgtca 181 aggccaagat ccaagacaag gaaggaatcc caccggacca gcagcgattg attttcgccg 241 gaaagcagct cgaagacgga cgtaccttag ccgattacaa catccagaag gaatcaacgc 301 ttcaccttgt ccttcgtctc cgtggaggtg ctaagaagag gaagaagaag acctacacca 361 agcctaagaa gatcaagcac aagcacaaga aggtcaagct cgctgttctt cagttctaca 421 aggttgatgg ttcaggtaag gttcagcgtt tgaggaagga gtgccctaac gcaacctgtg 481 gtgctgggac tttcatggcg agtcatttcg atcgtcacta ctgtggtaag tgtggtctca 541 cctacgttta ccagaaagaa ggagctcagg aatgattttc atctcgatct ctatcatttt 601 gaattgaata ctgctttttt tttggaattt ggaagttgtt tttggatgtt gtggatctta 661 tgttgaactt gtttgaattt c // LOCUS BLYG1HORDA 1614 bp ds-DNA PLN 14-AUG-1990 DEFINITION Barley gamma-1 hordein storage protein gene, complete cds. ACCESSION M36378 KEYWORDS gamma-hordein; storage protein. SOURCE Barley (cv. Carina) leaf DNA, clone lambda-hor-gamma-1. ORGANISM Hordeum vulgare Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 1614) AUTHORS Cameron-Mill,N. and Brandt,A. TITLE A gamma-hordein gene JOURNAL Plant Mol. Biol. 11, 449-461 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 379 1296 gamma-1 hordein precursor sigp 379 435 gamma-1 hordein signal peptide matp 436 1293 gamma-1 hordein storage protein signal 1372 1377 polyA signal signal 1428 1433 polyA signal BASE COUNT 516 a 433 c 271 g 394 t ORIGIN 1 agatccagtt tatccctcac aacttttaaa acatgacaaa gttagtttta gaaggattgg 61 aagatagata cttatgtgag atgtaaagtg aataagatga gtcagcacct attatcaacc 121 tcatcgtgca cttagaattt atgtacaacc acaaagcaac tttgatgatg tgtccaaaat 181 tacctttgta tgtagtgcca ccaaacacaa acattccaaa tgatcagttc gataagcacg 241 aacgcacatt aaaagaggga agaaaccatg aaaaggaacc atgatatgac atctataaag 301 aggcctacat tttgtagatc acataccatc atccatcctt tacattacta agaaaggttt 361 agctaacaca aatccaccat gaagatctta atcatcctta caatccttgc catggcgact 421 accttcgcca cctctgagat gcaagtcaac cctagtgtcc aagtacaacc gacacaacaa 481 caaccatatc ccgagtctca gcaaccattc atcagtcaat cacaacaaca atttccccag 541 ccacaacaac cattccctca gcaaccacaa cagccatttc ctcaatccca gcagcaatgc 601 ctccagcaac cacaacacca atttccccaa cccacacaac aattccccca acggccacta 661 ctaccattta ctcacccgtt cctaacattc cccgaccaac tactccctca gccaccacac 721 caatcattcc cccagccccc acaatcatac cctcagccac cgttacaacc ttttcctcag 781 cccccgcaac aaaaatatcc cgagcaacca caacaaccat ttccttggca acaaccaact 841 attcagctat atctccaaca acaactgaat ccgtgcaagg agtttctctt gcaacagtgc 901 aggcctgtgt cattgttatc gtacatctgg tcaaagatcg tgcaacagag cagttgtcgg 961 gtgatgcagc aacaatgttg cctgcaattg gcacagatcc ctgaacaata caaatgcaca 1021 gcaatcgata gcattgtgca tgccatcttc atgcagcaag gccaaagaca aggcgtgcag 1081 atcgtgcaac aacaacctca gccgcaacag gtgggtcaat gtgtgctcgt ccagggtcag 1141 ggcgtcgtcc aaccgcaaca actagctcaa atggaagcaa tcagaacgtt ggtgttgcag 1201 tccgttccat ccatgtgcaa ctttaatgtc ccacctaact gctccaccat caaagcacca 1261 tttgttggtg tagtcaccgg tgttggagga caatgaaaag atgagagcta taataattag 1321 tacatggatc atcgttgcat aatcgatgga ccaatcgatg taataacgat aaataaagtg 1381 tcgtgcacca tgatgtgtga ttcgaccagg acttgtccaa tcgtgggaat aaaagtcata 1441 aaaagttctt ctaagtgctt catttcttat actttttccc attcattgtg tatgtaaaag 1501 tcgatacaaa actggaatcg taagtctatc cgatctatga tttagtttag acaactagat 1561 taatgttcct tcggtgttag tcgtgcattg tagtaggttc aatctggtgg tacc // LOCUS BOLSLSGRA 1286 bp ss-mRNA PLN 14-AUG-1990 DEFINITION B.campestris S-locus-specific glycoprotein mRNA, partial cds. ACCESSION M36301 KEYWORDS S-glycoprotein; S-locus-specific glycoprotein; glycoprotein. SOURCE B.campestris, cDNA to mRNA. ORGANISM Brassica campestris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 1286) AUTHORS Takayama,S., Isogai,A., Tsukamoto,C., Ueda,Y., Hinata,K., Okazaki,K. and Suzuki,A. TITLE Sequences of S-glycoproteins, products of the Brassica campestris self-incompatibility locus JOURNAL Nature 326, 102-105 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 1257 S-6-glycoprotein BASE COUNT 345 a 278 c 316 g 347 t ORIGIN 1 ggatccgtcg tcttgattct attttgtcct gccttttcga tcaacacttt gtcgtctaca 61 gaatctctta gaatctcaag caacagaaca cttgtatctc caggtaataa cttcgaactc 121 ggcttcttcc gaaccaactc aagttctcgt tggtatctcg ggatatggta caagaaattg 181 ctcgacagaa cctatgtatg ggttgccaac agagataacc cactctccaa tgccattgga 241 accctcaaaa tctcaggcaa taatcttgtc ctccttggtc acaccaataa atctgtttgg 301 tcgacgaatc ttactagagg aaatgagaga cttccggtgg tggcagacgt tctctctaat 361 ggaaacttcg tgatgcgaga ctccagtaac aacgacgcaa gtgaatactt gtggcaaagt 421 ttcgattacc ctacggatac tttgcttcca gagatgaaac tgggttacga cctcaaaaca 481 gggttgaaca ggttccttac atcatggaga agttcagatg atccatcaag cggggatttc 541 tcgtacaagc tcgaaacccg aagccttcct gagttttatc tatggcatgg gatctttcca 601 atgcatcgga gtggtccatg gaatggagtc cgatttagtg gcataccaga ggaccaaaag 661 ctgagttaca tggtgtacaa cttcacagag aatagtgaag aggtcgctta tacattccga 721 atgaccaaca acagcatcta ctcgagattg acactaagtt ccgaaggcta ttttcagcga 781 cttacgtgga atccgtcaat agggatatgg acagcgttct ggtcttctcc agtggacccc 841 cagtgcgata catacataat gtgcgggcct tacgcttact gtggcgtgaa cacatcacct 901 gtttgtaact gtatccaagg gttcaatccc cggaatatac agcagtggga tcagagagtc 961 tgggcaggtg ggtgtataag gaggacgcgg cttagctgca gtggagatgg ttttacaagg 1021 atgaagaaca tgaagctgcc agaaactacg atggcgattg tcgaccgcag tattggtgtg 1081 aaagaatgtg agaagaggtg ccttagcgat tgtaattgta ctgcttttgc aaatgcggat 1141 atccggaatg gtgggacggg ttgtgtgatt tggaccggac ggcttgacga tatgcggaat 1201 tacgttgctc acggtcaaga tctttatgtc agattggctg ttgctgacct tgtttagctc 1261 tttctcttaa aataaagcac ggatcc // LOCUS BOVFPKPA 1835 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Bovine fructose 6-phosphate, 2-kinase: fructose 2,6-bisphosphatase mRNA, complete cds. ACCESSION M34241 KEYWORDS bifunctional enzyme; fructose 2,6-bisphosphatase; fructose 6-phosphate,2-kinase. SOURCE Bovine heart, cDNA to mRNA, clones BH[9G,7H]. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1835) AUTHORS Sakata,J. and Uyeda,K. TITLE Bovine heart fructose 6-phosphate, 2-kinase/fructose 2,6-bisphosphatase: Complete amino acid sequence and localization of phosphorylation sites JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4951-4955 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Uyeda, 11-MAY-1990, for release after publication. FEATURES from to/span description pept / 30 1622 fructose 6-phosphate,2-kinase:fructose 2,6-bisphosphatase (EC 2.7.1.105 and EC 3.1.3.46) BASE COUNT 466 a 493 c 493 g 383 t ORIGIN 1 caaccgacat ctcctgaaga gttgccatgt ccgggaatcc tgcctcttcc tcagaacaga 61 acaacaatag ctatgaaacc aaagcaagtc tccgaatatc agagaagaaa tgttcatggg 121 catcttacat gacaaactct ccaaccctca tcgttatgat tggcttgcca gcccgtggca 181 aaacctacgt gtccaagaaa cttacacgct acctcaactg gattggggtg cccaccaaag 241 tgtttaatct tggggtgtat cggcggcaag cagtcaagtc ctataagtcc tatgacttct 301 tccggcatga caacgaggag gccatgaaga ttcgcaaaca gtgtgccctg gtggcactga 361 aagacgtgaa ggcgtatctc acggaggaga gcgggcagat tgcggtgttt gatgctacca 421 ataccactcg agagaggagg gacttgattt tgaactttgc cgaggagaat tctttcaagg 481 tgttctttgt ggagtccgtg tgcgatgatc ctgatgtcat tgctgccaac atcctggagg 541 taaaggtgtc gagccccgac taccctgaaa ggaacaggga gaatgtgatg gatgacttcc 601 taaagaggat tgagtgctac aaagtcacct atcaacccct tgacccagac agccatgaca 661 aggatctttc tttcatcaag gtgataaacg tgggccagag atttctcgtg aacaaagtcc 721 aggactacat ccagagcaag atcgtctact acctcatgaa tatccacgtc caccctcgca 781 ccatctacct ttgccggcat ggagagagcg agttcaacct cttggggaag attgggggtg 841 actcaggcct ctcagtgcga ggaaaacagt ttgcccaggc tctaaggaag tttctggagg 901 aacaggagat agcagacctc aaagtgtgga cgagccagtt gaaaaggact atccagaccg 961 cagaatctct gggggtgacc tacgagcagt ggaagattct gaacgagatt gatgctggcg 1021 tgtgcgagga gatgacttac gcggagattc aggagcagta tccggatgag tttgcgcttc 1081 gagatgaaga gaaatatctg taccgatatc ctggagggga gtcataccag gacctcgtgc 1141 agcggttgga gccagtcatc atggagctgg agcgccaggg caatgtcctc gttatctccc 1201 accaggctgt catgcgctgc ctcctggcct acttcttgga caagggtgca gatgagctac 1261 catacctgag gtgccctctc cataccatct tcaaacttac tcctgtggcc tacgggtgca 1321 aagtggaaac aattaaactc aatgtggagg ctgtgaacac gcaccgtgac aagccaacta 1381 acaactttcc caagagccaa acccctgtaa ggatgagaag gaacagcttt acgcctctgt 1441 ccagttcgaa tacaatcagg cgtccaagaa attacagtgt tgggagccgg cccctccagc 1501 ccctcagccc gctccgtgcc ctggacacgc aagaaggggc cgaccagccg aagacccaag 1561 cagagacctc gcgggctgca cacaggctcc cgtctccagc gccccccacg tcgccctcct 1621 gatggcgtgg aggctgaggc cagaacgctc ccccggagac tggggtctgc tgaagacctg 1681 ggcgtgtcag ctccactggg gctggagcag gaagtcccgt ggggctcgga cattcggggc 1741 cacccgagtg agcctggaaa agctgccctc catttcttcc cctcatgcct aatgaaaacc 1801 gtgaaaagct cggcttccct gtacctggcc cctcg // LOCUS BOVTHYRA 2045 bp ds-DNA MAM 14-AUG-1990 DEFINITION Bovine thyroglobulin gene, 5' end. ACCESSION M35823 KEYWORDS thyroglobulin. SOURCE Bovine DNA, clone CBT1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 2045) AUTHORS Ledent,C., Parnentier,M. and Vassart,G. TITLE Tissue-specific expression and methylation of thyroglobulin- chloromyhenicol acetyltransferase fusion gene in transgenic mice JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by C.Ledent, 27-JUN-1990. FEATURES from to/span description mRNA 2037 > 2045 thyroglobulin mRNA, exon 1 signal 2006 2011 TATA box rpt 826 1025 bovine monomeric repeat rpt 1712 1816 bovine monomeric repeat BASE COUNT 467 a 508 c 579 g 491 t ORIGIN Chromosome 14q11-q15. 1 gatccaggag ggagaggtga gagggccaga ggtgaggagc ctgcagagag aatggagacc 61 caagccagga gcgggctcac ggaggccaga ccaaggagtt ggtgctgagt ttccgtagag 121 catcctcctg gctcacgagg catcagatga tggtaggagc tgagtcatcg tgagaaagag 181 gttttcacgg tgtggaggca gcgcctggca tggctctctc caagctcatc gcttcattgc 241 tggagtggag agaagcacct ctccttccag gtacactgtt ttgactcaga aggaagcccg 301 ggcggctctc accctcactg actctgctgt gcctcttccc aggggccagg ccagggagaa 361 tgactcagca cacaagctgg cctgtgcaag accctgcctg catgcatggg tgggggaagc 421 atgaggagtg ggctttctgg tagacctcag tcatggtctc agcattattt tttaaaagga 481 ttttatttat ttgttttgat gtggaccagt tttaaaatct ttattgaatt tgttaccatg 541 ttgcttctgt tttatgtttt ggttttttgg ccacgaggca tgggggatgt tagctccccg 601 accagggact gaacccttgc cccctgcatg gaaaggtcaa gtcttaactt cttgactgcc 661 taggaagtcc cggtttcaac attttagaaa attcatctgc tttagaacac ctggacctgg 721 agagctggca agggctgcac atggccaggc tcacggtaat attacagcgt ctccatagcc 781 tgggcagaac ttggcttgtg tgattggggc tgtgggctcc ccaggtggtg ctagtggtaa 841 agaacccttc tgccagtgca ggagacataa gagatgcggg ttctatccct ggatcaggaa 901 gatcccctgg agaaggaaat ggcaacctgc tccaggattc ttgcctggag aatcccaagg 961 acagaagagc ctggcgggcc gtagtccatg gcgttgcaaa gagtcagaca caaccgaagc 1021 gacttagcac ccaatggggg ctggagaggg ccaagctggg ggattctaat tctccagtaa 1081 atgccagcat gtcttctcaa atcatcttta cctaaaacct aaggctcatt gtgatctccc 1141 tcactcccaa gtgctctccc acacctgccc acctaccttc tgaacaccca tggagcctgt 1201 ctccatcccc caccccttta atgggagaag ccactggaga aagctcactg atgtacaggc 1261 tttatgtgga agacaagctt cctgctgcct tttggttgtc tgacgtcctg ggacagaggg 1321 gaaaggggga tgactacgag tatgactgtg cgtgtgtttg gcttatctca tcaaaatctc 1381 tacattctgt gttaatggat ctgcctgttt tgttccctgc catatcctca tggcctagaa 1441 tagtgtctgc ttctctatca gactctaaag aaacattgct aggagggaag gaaggagcat 1501 ggatgaggag ggagggagca ttgtgtttct ctcacggtgg gcctgaacgt gtggcccacc 1561 aagttgttaa ctttggcctt tacccctgaa gatgaattat gaagccacac ccccagttct 1621 tccttggtgg ctcagatggt caagaatcca cctgcaatgc gggagacctg ggtttgatcc 1681 ctgggttggg aagatcccct ggagaaggga atggctaccc actccagtat tctggcctgg 1741 agaatcccat ggacagagga gcctggcggg atgcagtcca tggggtctca gagagtcaga 1801 tgtgactgag cgactttcac acacattcgt ccctggttct gctcccctac agcctccaca 1861 agattttcac cccacactgg ccacatgagt gtcctccagg ggaacagacg caggtggagg 1921 acctccttgt gaccagcaga gaaaacaggg tgggcactgc ttccctgagt gcctgtgggt 1981 gggggctaag tacccacagc agtgctataa aggctccttg gccagagccc taaggtgggc 2041 agcag // LOCUS BRPTUBBA 4571 bp ds-DNA INV 14-AUG-1990 DEFINITION B.pahangi beta-tubulin gene, complete cds. ACCESSION M36380 KEYWORDS beta-tubulin. SOURCE B.pahangi DNA, clone pBTY. ORGANISM Brugia pahangi Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Gnathostomatidae. REFERENCE 1 (bases 1 to 4571) AUTHORS Guenette,S., Prichard,R.K., Klein,R.D. and Matlashewski,G. TITLE Characterization of a beta-tubulin gene and beta-tubulin gene products of Brugia pahangi JOURNAL Biochem. Pharmacol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Guenette, 11-JUL-1990. FEATURES from to/span description pept 396 452 beta-tubulin, exon 1 891 999 beta-tubulin, exon 2 2115 2341 beta-tubulin, exon 3 2424 2552 beta-tubulin, exon 4 2718 2880 beta-tubulin, exon 5 3002 3192 beta-tubulin, exon 6 3299 3394 beta-tubulin, exon 7 3630 3816 beta-tubulin, exon 8 4047 4234 beta-tubulin, exon 9 pre-msg < 395 > 4400 beta-tubulin mRNA and introns IVS 453 890 beta-tubulin intron A IVS 1000 2114 beta-tubulin intron B IVS 2342 2423 beta-tubulin intron C IVS 2553 2717 beta-tubulin intron D IVS 2881 3001 beta-tubulin intron E IVS 3193 3298 beta-tubulin intron F IVS 3395 3629 beta-tubulin intron G IVS 3817 4046 beta-tubulin intron H signal 66 69 reverse CAAT box signal 90 95 weak TATA box signal 4395 4400 polyA signal BASE COUNT 1278 a 783 c 910 g 1600 t ORIGIN 1 ggtgtgcaga tagcacgagg aaatgagagg agcagagaag tagggcgcag tcagacagtt 61 ggatgattgc gcctcccgtg tcacttccct atataccttt gcaaggagga catgacaaag 121 ttcgaggtta tttctcgacg cgtctctctg tatactcaca tgtctgtctg tctgtctgtt 181 tgtccgtctg cctgtctgcc ttggttgcta ggaagcccgt cgactttctc cacagtttag 241 tgagtgtttg tgtgatggat aggatggttg gcgcgcggat gggtcggcca aacagttctg 301 tgactgattg tagttgccgg atgacttgat cagtaattac cggattgatt gcaggtctca 361 tttcggtcga caagatttca ttaagtgttt aagctatgag agaaattgtc cacgttcaag 421 ctggtcaatg tggcaaccag attggtgcca aggtattgat tttctgctct ttctctttat 481 cctttaggaa gatgactcat cagagggtat ctttccgcag aaattttccc gagatataat 541 tcccacttcc cctcattgaa attgttttgg tatttaggta tcatcgtcaa aatattccat 601 ttcatttctt aatgtgctat cgtttatcat ctgctcttcg attgtatctt ctcccttctt 661 tgcttcattg attcctaatg caatacataa tacctaccat ctatcacggt taataaaatt 721 cgatgttaga tcacaaagat caagttgtga taatctttcg tataataagt tgttgttttc 781 aacaatctgc agttaggtaa gatgtctgag gagcaagcaa aatgtggatg taggggatta 841 ctgaagttaa aaattgaaaa ttttaaacat aacttgagtt tttttttcag ttctgggaag 901 taatatcgga tgaacatggt gttcaacctg atggtacata caaaggtgat tcagacctgc 961 aaattgaacg aatcaacgtc tactataatg aagcgaatgg ttggtccatt attggagttt 1021 tttttcttct ttgctttgga atgttctttt ttgttttctc tactagttgt tgccactcgt 1081 tttcacacac attatacata gttctacata cacatgcata ccacacatag tcccatatat 1141 aaacaattgt tgtctgacat cgaacattag ttcggcggat gaggttcatc agggcgtcct 1201 caaaataatg tgttcgagaa ttaaatgaag aatgtctgcg tctcttttca aggtccatcg 1261 tggtgaaatc atttgagaca taaggagatg gatggactgt ttaaattaga taaatttaaa 1321 tacaagtttc agaataaccg acataatcaa agatgttttt gttacagaaa tatttgggaa 1381 tgattatagc aagattgtaa aacagatatt ttgacatcta cctgtctttg atcacaacta 1441 ttgtttatga tatttttggg atttttgcaa tgtaatgaca cttattttta gaatatcgta 1501 aacactttct gacctaactt gtgatttgtt cgaagcctgg tagtgaaatt tatttggcgg 1561 ttgtgaaaat acagtaaata accgcactat ataaggaaag aataacattg tcactttaac 1621 caaacaaatc tcaaattgtc gtcacttcaa ccaaaacctt atcaaacagt gtttttgacc 1681 tattctaact tctaacccca gtgtttcttg attgtcttga ttgttattct ataacaccta 1741 caaagaataa gaattccttc aattcaaact tttcacgagt tattcgtctt tgtcagtcta 1801 cactatgagg acttcgtcgt caacttattc caagataaaa aactttcagt gcaacagtct 1861 tctttttttt ctcatctctt cgtaatggaa ttaaaaccaa acaagacgtt cgtcatatgt 1921 tagcttgtca gtaatgtttg taatacatct ttattttcgt ctttttgttt ttcctctctt 1981 cattatgcag tgaaaaggag tatttttctt taggttttct gtaaagatat acttgataat 2041 agattgccat aatgaaccaa gcatcgttgt aaatcttcac ttcgtctcaa gcgatattaa 2101 ttttttgtgt tcagggggca aatatgtgcc acgagcagtc cttgttgatt tggaaccagg 2161 taccatggat tctattcgag gaggtgagtt cgggcaacta ttccgacctg acaattttgt 2221 ttttgggcaa agtggagctg gcaacaactg ggctaaggga cattatacgg aaggtgcgga 2281 actagttgat aatgtgttgg acgtgatacg aaaagaagct gagggatgcg attgtcttca 2341 ggtacggatt gccatagttt tataagacat tttaattgtg gatgtgtttt tattttaaag 2401 aaaaaattaa aaaatatttt cagggatttc aactaacgca ttcacttggt ggtggtaccg 2461 gttccggcat gggaacattg ctgatctcga aaattcgtga ggagtatccg gatcgaatta 2521 tgagctcttt ttcggttgtg ccatcgccca aagtatgtat ttgaatttct gattcttatt 2581 tttaacatct gttatatcta ttttaaacat agctccataa cttttaagtg gcggttgtct 2641 cacaaagtgg tttttcggct atcatttcat tttcatagtt gaagaaaaag tatgctagac 2701 gtattaatga tttccaggta tcagatgttg tgttggaacc ctacaatgca acattatcag 2761 tccaccaact agttgaaaac actgacgaaa ctttctgcat tgataacgag gctttgtatg 2821 acatctgctt ccgaacgttg aagttggcaa atccaactta cggtgacctc aaccatttgg 2881 gttcgtcctt taatttattc ttcgaactgt ccggtttaaa tgcattctat ttaactattt 2941 ataaaattcg gatttaattg taatttattc aactcttcta aaggttaaaa tttctattta 3001 gtgtctgtga caatgtcggg agtaacaact tgcttacgtt tccctggaca gttgaacgcc 3061 gatctccgta aacttgccgt caatatggtg ccattcccac ggttgcattt ctttatgcca 3121 ggatttgctc ctctctctgc tcgtgatgct gctgcttatc gagccctcaa tgttgctgaa 3181 cttactcaac aggtctctgt ttatttttat ggtcagggtc tctttttttg aaagtaatcc 3241 atttataatc gctttaaatg tctcctgttt tatgtttttt aatatcttat ttcttcagat 3301 gtttgatgcc aaaaatatga tggcagcatg tgatccgcgt catggtcgtt acctaaccgt 3361 agctgccatg ttccgaggta gaatgtctat gcgggtaagt gaattttgat aatctccatg 3421 ttaaactcag ccgctgaagc atgaagattt taatccatgt gatgtcggtt tttcttgacc 3481 tgaacttagt ggtgtatcac ttggtcttag ttatgttctt cctatttcca gagtgcttgc 3541 ttaactgcaa ttcttttgtg gaagcttgtt tttgatgtag ggtttttttc ttcatcgctg 3601 tttcaaagtt tttaataagt aggtcttagg aagtagacga gcaaatgatg caagtacaga 3661 ataagaattc atcgtatttc gttgaatgga ttccaaataa cgtaaagaca gctgtttgcg 3721 acattccacc acgtggatta aagatgagcg caacatttat tggaaataca acagctatac 3781 aagaactttt caagcgaatt tccgaacagt ttactggtga acttattaat tccatctcat 3841 ttgagcttag tttgatagct aaaaactaaa tttctggcaa gattgaaaag agatacaatt 3901 gggtatgtgt aggttacatg atttttttgc catattatga gttttatgca aaaggtgaac 3961 atttttaata tgtttgttcc gaatcgtctg aacttgcgtt ctatggaggt tatacttaag 4021 tcgatccgat tttttaatga tttcagccat gttccgacgt aaagcattct tgcattggta 4081 tactggcgaa ggtatggatg aaatggaatt cacggaagcg gagagtaata tgaatgactt 4141 ggtgtccgaa tatcaacaat atcaggatgc gacggctgat gaagaaggtg atcttcagga 4201 aggtgaatcg gaatacattg aacaggaaga gtgagcacaa agtgtcagct ttgtgaaaaa 4261 ggaactttta cagtatttta tttatcgtct tttccatgtt tttatgttgt tacattttat 4321 tttgttactc gctttcgaca tatattagca agttttaaag aattatagcg gcaaatctca 4381 ttgttgtact tctcaataaa ttacaatcaa aagttaatga ttcctaatgc aaagtcgtaa 4441 ggcagaaatc acgtgaaaaa cagtaatcat gattctctga aagttccaat ttgaaataaa 4501 gtactaattt tatgtgggca gtggacggta ttccgtgaca gaagatgcta cagaagattg 4561 cagcagtttt t // LOCUS BSPCELALKA 3498 bp ds-DNA BCT 14-AUG-1990 DEFINITION Bacillus sp. alkaline cellulase gene, complete cds. ACCESSION M27420 KEYWORDS alkaline cellulase; endo-beta-1,4-glucanase. SOURCE Bacillus sp. (strain KSM-635) DNA, clone pBC100. ORGANISM Bacillus sp. Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 3498) AUTHORS Ozaki,K., Shikata,S., Kawai,S., Ito,S. and Okamoto,K. TITLE Molecular cloning and nucleotide sequence of a gene for alkaline cellulase from Bacillus sp. KSM-635 JOURNAL J. Gen. Microbiol. 136, 1327-1334 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Ito, 30-AUG-1989, for release after publication. FEATURES from to/span description pept 605 3430 alkaline cellulase precursor (EC 3.2.1.4) sigp 605 691 alkaline cellulase signal peptide matp 692 3427 alkaline cellulase binding 530 536 SD1 ribosomal binding site binding 595 600 SD2 ribosomal binding site signal 471 476 -35 region signal 494 499 -10 region BASE COUNT 1133 a 605 c 704 g 1056 t ORIGIN 1 ggcacctgag gaagtagcga atcctgttct tgatgcacat ccatacttaa cttctggatt 61 tgccttcatg agtcgtgatg aaaacggaag tgcaccatta catggactgt ttgcatttaa 121 ttattcggca ctgattagct gtggcatttc cgcttctgct ctttctggaa tgaagtacgg 181 ggtcccaaga cttgtcactg ccattgccga tcagttattc caagatgatc gagacgagat 241 tctaaaggac ttctttgagt atgatgagaa ggagtttgtc ggaaactggc ctttaaacgt 301 ctaaatgaac ataatagcga aagggcttaa ccaaaaatat gaattgaacc cacataaatt 361 tgtgggtttt tattaatcaa aaaaatggta aataaaccta ttttaacaat gcttataacc 421 atttttctat ttattgcata aaaaaatcag taaaaaaatt catttatatg tagacgtaaa 481 ttaacaaata ttatattata tatacgaaag cggtttcgaa aaatagagga aggaggagag 541 tttttagttt ttgttgtttg tttattgtaa gcgtttacta ttaatacatt tctgggaggt 601 tattatgaaa ataaagcaaa ttaaacaatc tttatctttg cttttaatca tcacactcat 661 tatgtcacta tttgttccta tggcttcagc aaacacaaat gagtctaagt ctaatgcatt 721 tcctttttct gatgttaaaa aaacttcttg gtcttttcca tatataaagg atttatatga 781 gcaagaagtt attacaggaa catctgcaac aacgttctct ccaacagatt ccgttactcg 841 tgcacaattt acagtgatgc ttacccgtgg tcttggacta gaagcatctt ctaaagatta 901 cccttttaaa gatcgtaaaa actgggctta caaagaaatt caagctgcat atgaagctgg 961 aattgtaact gggaaaacaa acggtgaatt tgcaccaaat gaaaacatta ctcgtgaaca 1021 aatggctgct atggccgtac gtgcttatga atacttagaa aatgagctat ctttaccaga 1081 agagcaaaga gaatataatg actcttcttc tatttcaacc tttgctcaag atgctgttca 1141 aaaagcatac gtattagagc taatggaagg aaatacagat ggatattttc aaccaaaaag 1201 aaactctact agagaacagt ctgctaaagt tatctctact ttactttgga aagtagctag 1261 tcatgattat ttataccata cagaagctgt taaaagccct tcagaagctg gtgcgcttca 1321 gttagtagaa ctaaacggac aattaacact agctggtgaa gatggtactc ccgttcaatt 1381 acgtggaatg agtacacatg gcctacaatg gttcggtgaa atcgtaaacg aaaacgcttt 1441 cgtagcacta tcgaatgatt ggggatctaa catgattcgt ctcgctatgt acattggcga 1501 aaatggatat gcaacaaacc ctgaagtaaa agatttagtt tatgaaggaa ttgaattagc 1561 gtttgagcac gatatgtatg taattgttga ctggcatgta catgctcctg gtgatcctag 1621 agcggatgta tactcaggtg cttatgattt cttcgaagaa attgctgatc attacaaaga 1681 tcatccgaaa aaccattata tcatttggga actagcaaac gaaccaagtc caaataataa 1741 cggtggacct ggattaacaa atgatgaaaa aggttgggaa gctgtaaaag aatatgcaga 1801 gccaatcgtt gaaatgttgc gtgaaaaagg tgacaacatg attttagttg gaaatcctaa 1861 ctggagccaa cgtcctgact tatcagctga caacccaatt gatgcagaaa atatcatgta 1921 ttctgttcac ttctacacag gctcacatgg cgcttctcac attggttacc ctgaaggaac 1981 accaagctct gaacgttcta atgttatggc taacgttcgt tatgctctag acaatggcgt 2041 tgctgtgttt gcgacagagt ggggtacgag tcaagcgaat ggagatggag gaccttattt 2101 tgatgaagct gatgtttggc ttaatttctt aaacaaacat aacattagct gggcaaactg 2161 gtcgttaacg aacaaaaatg agatttctgg agcatttaca ccttttgagc ttggtagaac 2221 agatgctaca gatcttgatc caggtgctaa tcaagtatgg gcacccgagg aactaagttt 2281 atctggtgaa tatgttcgtg ctcgtattaa aggaattgag tatacaccta tcgaccgcac 2341 aaaattcaca aagcttgttt gggattttaa cgatggaaca acacaaggat tccaagttaa 2401 tggagacagc cctaacaaag aaagcattac tttaagtaat aataatgatg cattacaaat 2461 tgaaggatta aatgtaagta atgatatttc tgaaggaaac tactgggata atgtacgcct 2521 gtcagctgat ggctggagtg aaaatgtaga tattttaggt gctacagagc ttacaattga 2581 tgttatcgtt gaagaaccga caacagtttc aattgctgct attccacaag gacctgctgc 2641 tggctgggct aacccgacta gagcaattaa agtaactgaa gacgatttcg aatctttcgg 2701 agatggatac aaagctctcg taactattac ttctgaagat tcaccttcac ttgaaaccat 2761 tgcaactagt cctgaagaca atacaatgag caatatcatt ctatttgtag gtactgaaga 2821 tgcagatgtt atttctttag ataatatcac ggtttctggt actgagattg aaattgaagt 2881 tattcacgat gaaaaaggaa cagcaacact tccttctact tttgaagatg gaactcgcca 2941 aggctgggat tggcatacag aatcaggagt taagacagct cttacaattg aagaagctaa 3001 tggatctaac gctctttcat gggaatatgc gtatcctgaa gtaaaaccaa gtgatggttg 3061 ggctactgct cctcgtctag acttctggaa agacgaacta gttcgtggca caagcgacta 3121 tattagtttt gacttttaca tcgatgcagt tcgtgcttct gaaggtgcta tatcaattaa 3181 cgccgttttc caaccacctg caaacgggta ttggcaagaa gttccaacta catttgaaat 3241 tgatttaaca gagcttgatt ctgcaactgt aacttctgat gagttgtatc attatgaagt 3301 aaaaattaac attagagaca ttgaggctat tacagacgat acagagcttc gtaacttatt 3361 actaatcttt gctgatgaag acagtgattt tgctggtaga gtttttgttg ataatgtaag 3421 atttgaataa tttaaaaaca gtagatagag agactctcta tctacctgtt tattgcttac 3481 tattcgtctt ccactttt // LOCUS BSUPHOA3A 743 bp ds-DNA BCT 14-AUG-1990 DEFINITION B.subtilis alkaline phosphatase (phoAIII) gene, complete cds. ACCESSION M33634 KEYWORDS alkaline phosphatase. SOURCE B.subtilis (strain 168) DNA. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 743) AUTHORS Bookstein,C., Edwards,C.W., Kapp,N.V. and Hulett,F.M. TITLE The Bacillus subtilis 168 alkaline phosphatase III gene: Impact of a phoAIII mutation on total alkaline phosphatase synthesis JOURNAL J. Bacteriol. 172, 3730-3737 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.M.Hulett, 06-APR-1990, for release after publication. FEATURES from to/span description pept 503 > 743 alkaline phosphatase precursor (phoAIII, EC 3.1.3.1) sigp 503 598 alkaline phosphatase signal peptide matp 599 > 743 alkaline phosphatase BASE COUNT 221 a 149 c 131 g 242 t ORIGIN 1 gatccgtaaa catggctgct tccattttat ttctgtatgt aaatttggat atttattttt 61 tatccaagtt aacacctctt ccgttctttc tcggtggaac gggtcagcaa tccccgctaa 121 gtactcagaa aaaacgtcca tgttgttccc tcctaaaata aaatgatgcc actccatatg 181 aactgctcag acatgtgctt acacaatacc tcttgccaac gtaatacttt aaattgttca 241 agatgtaagc gggttgcttc tatgcgacta tagcagattt cagtccgcct atccacggcc 301 agtatctatt cctccaattt atttctttag tactatcgct ctttccttct aaaacttctc 361 ataaaagaat aaccattatt taagggtgcc agttcattat tcttgtaaat ccaatcttta 421 aaatcgatta atactagctt aacagtttaa aaatataatt gggttgtcat tgagattcat 481 ctatatttag gaggttatcc agttgaaaaa attcccaaag aaattactgc ctatcgcggt 541 tttatcatca attgcgttca gcagcttagc cagcggcagt gtgcctgaag ccagcgccca 601 ggaaaagaaa aaggggaacc aagacgaaat taaaaatgtt attgttctga ttggtgatgg 661 tatgggtgtg tcttatacgt ctgcttatcg atacttaaag gataataaaa agacaaaagt 721 tgtggagcct acggcttttg atc // LOCUS CHTSIGAA 2180 bp ds-DNA BCT 14-AUG-1990 DEFINITION C.trachomatis sigma-A protein (sigA) gene, complete cds. ACCESSION M36475 KEYWORDS sigma factor; sigma-70 homolog. SOURCE C.trachomatis (strain MoPn) DNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 2180) AUTHORS Engel,J.N. and Ganem,D. TITLE A polymerase chain reaction-based approach to cloning sigma factors from eubacteria and its application to the isolation of a sigma -70 homolog from Chlamydia trachomatis JOURNAL J. Bacteriol. 172, 2447-2455 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 360 2075 sigma-A protein (sigA) BASE COUNT 679 a 380 c 517 g 603 t 1 others ORIGIN 1 tctagaggga tttgcccaat gggagcgaga tttattaaca gaatctcgaa cagaacgctc 61 tggcaaaatc gtaattaaaa cgaaagcttt tggagccact ccgttctact attgtgtagt 121 aacgttagat aaaggacctt tggcagaaca tgtattaggg gttttatacc ctgcaaaagc 181 aagttttttt acaaatcttt cctatattta atgataaata gggatgtcgg tgtaggaagt 241 ttttctaggg ccgaatctga tcagctggtt ttttatctaa aaatccctag atttcttgtt 301 ttccttaaga taaactgtca ctttctaggc cgatttttcc ttagttttaa tttgttagca 361 tgcgcatcca tacgctagat agtcaacgtg ctgatgcagc ccaagaagaa gaaatccaaa 421 gaaagttaga agagctagtc actcttgcta aggatcaagg gttcatcacg tatgaagaaa 481 ttaatgaaat tcttccccct tctttcgatt cgccagaaca gatagatcaa gttttaattt 541 ttctggcggg gatggacgtt caagtcttaa accaagcaga cgtagagcgg cagaaagaaa 601 gaaaaaaaga agctaaagag ctagaagggt tggctaagcg ttctgaggga acgcctgatg 661 atccagtgcg tatgtatctg aaggaaatgg gtacagttcc tcttcttaca agagaagagg 721 aggtggagat ttctaaaagg atagaaaaag ctcaggtaca aatagaaaga attattttac 781 gctttcgtta ttcgactaaa gaagcggttt ctattgcgca atacttaatt aatggtaagg 841 aacgatttga taagatcgtt tccgnaaaag aggtggaaga taagacacat tttcttaatc 901 ttttgccgaa gttaatctct ttgctgaaag aagaggactc ttacttagaa gagcggcttt 961 tggctttgaa agatccggca ctgtccaagc aagatcaagc aaagttgaac gatgagcttg 1021 aaaaatgccg tattcgaaca caagcttatc tgagatgttt tcattgtcgt cacaatgtca 1081 cggaagattt tggagaagtt gtttttaaag cttatgactc gttcttacag ttggaacaac 1141 aaatcaatga tttaaaggtt cgtgctgaaa gaaataagtt tgctgcagca aaactagctg 1201 cagcccgacg caagctttac aaacgagaag ttgcagcggg acgtactctt gaagagttta 1261 aaaaagatgt gcggatgttg cagcgttgga tggataaaag ccaggaagca aagcaagaga 1321 tggtggaatc caacttacgt ttggtgatct ctatcgcgaa aaaatatacc aaccgagggc 1381 tgtctttctt ggatttgatt caagaaggaa atatgggctt aatgaaagct gttgaaaaat 1441 ttgagtatcg ccggggttat aaattttcaa cttatgccac ttggtggatt cgtcaggctg 1501 tgacgcgagc tattgccgat caggcaagaa ccattcgaat tcctgttcat atgatcgaga 1561 ccattaataa ggtgcttcgt ggagccaaga aattaatgat ggaaactgga aaagagccta 1621 cgcccgaaga actcggagag gaactaggtt tcactccaga ccgtgttcga gaaatttata 1681 agatcgctca gcatccgatt tctttacagg ctgaggttgg agatagtgga gaaagctctt 1741 ttggagattt cttggaagat acagctgttg aatctccggc agaggcaaca ggctactcca 1801 tgttgaaaga caaaatgaag aaagtgctaa aaacgcttac tgatcgcgaa cgttttgttt 1861 tgatccatcg gtttggcctt ttagatggcc gtcccaaaac tttggaagag gtaggctccg 1921 cgttcaacgt gacacgagag cggattcgac aaatcgaagc caaagctttg cgaaaaatgc 1981 gtcatcctat ccgttccaaa cagctacgag cattcttgga tttattggaa gaagagaaga 2041 ctggttcggg caagattaag agttataaga attaaggatt ccttttcata aggagtatgc 2101 ttgtatcggt tagacgtaac gaactttcgt gtttgggtat ctataggggt ctcagagcag 2161 gaacgctatc acaagcaacc // LOCUS CUSRGSBHA 1137 bp ds-DNA PLN 14-AUG-1990 DEFINITION Melon 17S rRNA, 5.8S rRNA, and 25S rRNA gene region. ACCESSION M36377 KEYWORDS 17S ribosomal RNA; 25S ribosomal RNA; intraribosomal spacer region. SOURCE C.melo 4-wk old cotyledon DNA, clone pML17. ORGANISM Cucumis melo Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1137) AUTHORS Kavanagh,T.A. and Timmis,J.N. TITLE Structure of melon rDNA and nucleotide sequence of the 17-25S spacer region JOURNAL Theor. Appl. Genet. 76, 673-680 (1988) STANDARD simple staff_entry FEATURES from to/span description rRNA < 1 259 17S rRNA rRNA 476 638 5.8S rRNA rRNA 876 > 1137 25S rRNA BASE COUNT 253 a 333 c 324 g 227 t ORIGIN 1 atagatcatt gcaattgttg gtcttcaacg aggaattcct agtaagcgcg agtcatcagc 61 tcgcgttgac tacgtccctg ccctttgtac acaccgcccg tcgctcctac cgattgaatg 121 gtccggtgaa gtgttcggat cgcggcgacg tgggcggttc gctgcccgcg acgtcgcgag 181 aagtccactg aaccttatca tttagaggaa ggagaagtcg taacaaggtt tccgtaggtg 241 aactgcggaa ggatcattgt cgatgcctaa acatcaaacg acccgcgaac gcgtttaaaa 301 acaaactgtt cgcgttaggg gcggggggaa gcatgctctt tggctgcctc ctccccttcc 361 aacgcgttta aacaaaaccc cgcgcaggtc gcgccaagga acttgaaatg aattcgcctg 421 tcccctgccc cggcctcggc gtgcggggga tggagcattc tagtcgtatt actaacaacg 481 actctcggca acggatatct cggctctcgc atcgatgaag aacgtagcga aatgcgatac 541 ttggtgtgaa ttgcaggatc ccgcgaacca ccgagtcttt gaacgcaagt tgcgcccgga 601 gccttctggc cgagggcacg tctgcctggg cgtcacgcat cgctgccccc accacacaac 661 tctccccatg cggggtcgtt gtgaaggcag ggacacacac tggcctcccg tacgcaccgt 721 cgtgcggatg gcttaaattt gagtcctcga tgctcgtcgt cgcgacacta cggtggttga 781 ttcaactcgg tgacgcgtct cgacctcgac gtcgacttca cggactcctt cacgaccctt 841 cgaacgccgc cccttaaaag gacgacgctc tcgacgcgac cccaggtcag gcgggactac 901 ccgtgagagt ttaagcatat caataagcgg aggaaaagaa acttacaagg attcccctag 961 taacggcgag cgaaccggga agagcccagc ttgagaatcg ggcgtcctcg acgtccgaat 1021 tgtagtctgg agaagcgtcc tcagcggcgg accgggcaca agtccctgga aggggcgcca 1081 gagagggtga gagagcccgt tgcgctcgga ccctgtcgca ccacgaggcg ctgtcaa // LOCUS DOGFIXA 3080 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Canine blood clotting factor IX mRNA, complete cds. ACCESSION M33826 KEYWORDS blood clotting cascade protein; blood clotting factor IX; protease. SOURCE Dog hemophilic liver, cDNA to mRNA, clone dF9.28. ORGANISM Canis familiaris Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Carnivora; Caniformia; Canoidea; Canidae. REFERENCE 1 (bases 1 to 3080) AUTHORS Axelrod,J.H., Read,M.S., Brinkhous,K.M. and Verma,I.M. TITLE Phenotypic correction of factor IX deficiency in skin fibroblasts of hemophilic dogs JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5173-5177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.H.Axelrod, 23-APR-1990, for release after publication. FEATURES from to/span description pept 22 1380 factor IX protein precursor sigp 22 138 factor IX signal peptide matp 139 1377 factor IX blood circulatory protein matp 577 1377 factor IX activated protein matp 673 1377 factor IX activated protein mRNA < 1 3080 factor IX mRNA BASE COUNT 878 a 640 c 677 g 885 t ORIGIN Chromosome X. 1 aagcagcgcc tgaataggat catggcagaa gcatcgggcc tcgtcaccgt ctgcctttta 61 ggatatctac tcagtgccga atgtgcagtt tttcttgatc gtgaaaatgc caccaaaatt 121 ctgagtcggc caaagaggta taattcaggt aaactggaag agtttgttcg agggaacctt 181 gagagagaat gtatagaaga aaagtgcagt tttgaagaag cacgggaagt ttttgaaaac 241 actgaaaaaa ccactgaatt ttggaagcaa tatgttgatg gagatcaatg tgaatccaat 301 ccatgtttaa atgacggtgt atgcaaggat gacattaatt cctatgaatg ttggtgtcga 361 gctggatttg aaggaaagaa ctgtgaatta gatgtaacat gcaacattaa gaatggcaga 421 tgcaagcagt tttgtaaatt gggccccgat aacaaggtgg tttgttcctg tactacggga 481 taccaacttg cggaagacca aaggtcctgt gaaccagcag tgccatttcc atgtggaaga 541 gtttctgtcc ctcacatttc tatgacacgc acccgtgctg aaactctttt ttccaatatg 601 gactatgaaa attcaactga agtggaaaaa attttggata acgtcaccca accgcttaac 661 gacttcactc gagttgttgg tggaaaagat gccaaaccag gtcaattccc ttggcaggtc 721 cttttgaatg ggaaagttga tgcattctgc ggaggttcca tcatcaatga aaaatgggtg 781 gtaactgcag cccactgtat tgagcctgat gttaaaatta ccatagttgc aggtgagcat 841 aacaccgaga agagggaaca tacagagcag aagcgaaacg tgattcgcac tattcttcac 901 cacagctata atgcaactat taataagtac aaccatgaca tcgcccttct ggaactggat 961 gagcccttaa cgctgaacag ctatgtaaca cctatttgca ttgctgacag ggaatactcg 1021 aacatcttcc tcaaatttgg gtctggctat gtgagtggct gggggagagt cttcaacaaa 1081 gggcgatcgg cttcaattct tcaatacctt aaagttccac ttgttgaccg agccacgtgc 1141 cttcggtcca cgaagttcac catttataac aacatgttct gtgctggctt ccatgaggga 1201 ggtaaagatt catgccaggg cgatagtggg ggaccccatg tcaccgaagt agaaggcata 1261 agtttcttaa ctgggattat tagctggggt gaagagtgtg cgatgaaagg gaagtatgga 1321 atatatacca aggtgtcccg gtatgtcaac tggattaaag aaaagacgaa gctcacctaa 1381 agaataatgt atttccaagg ttgacacgtt tagggtagaa aatggacaag gtcctttact 1441 aactaatcac tttttttatc tctttagatt tgcactatat acattctctg catactgctt 1501 tttctctttc tggggagaaa tctatctaga attcctattt tactagacta agtgaattag 1561 aaaatgtaat cactacggga atgtactgtg atgggacttg tgaccactcc cacaggtcta 1621 gcccttggca ccattgtgag gttaggttat tcctccgtcc cgtcagtgtt aagtttctcc 1681 actggggcaa ctccctgatt ctccctcctt ggcagcattc catgttccag acctttctta 1741 cctttcccat ggaaagcatc atcaaaatgt gttagatcta catccaggat atttgatcta 1801 gttcagaaca agtctaacac ctcacttgtt aaggaagaac acaggagcaa ctgacaggtt 1861 gcaactcacc agaaaacact atttcctttt ctgtatgctt attcctgctt ccttatctct 1921 tctgtttcct aatcctgaaa tcagtgtctc tctttctctt tctctctctc tttttcctac 1981 agaggtttaa aggagggaag ggacacatca tgctgtttta ctactgtcca cagttataca 2041 tgtctatcaa acccagactt gctttcagtt tggtctttga cttgcttttc ggagcatagg 2101 gatgaagcca ggtgcctgaa gaccttcagg aaaatttgtc tgaaagagtc atgttactga 2161 aatacatggc atcgtggaag gaatgaccga cccatcagaa tacgtccaat aagcatttgt 2221 aattgtgttg tgatagaggt aactaagaag agtgacatca actcctgcgt ctcatccccc 2281 atgtgaaaaa actgtgaact aaaagagaac agccagtgtg caacgtagaa ctagtagagt 2341 cttaaggaag aattcattgg tgggtctcca gtactggccg gagctaagga agaagttgcc 2401 ctcgaccaaa gaacatgagc accctgtctc ttaaactagc atgtccccac agtggagaag 2461 gatgtactgg tggcttaaag gcatgagtca ggggtgtctg ggtggctcag tggttgagcc 2521 tttggctcag ggcatgatcc cagcgtcctg ggattgagtc ccgcatcagg cttcctgcag 2581 ggggcctgct tctccctctg cttatgtctc tgcttctctg tgtgtgtgtg tgtgtctctc 2641 atgaaaaaaa taaataaaat ctttaaaaaa aaaaaaggta tgagtcattc cagtgagcca 2701 tctcactccc ccttttctgg tttcaggttc tctaggcaac cttttgatta taattaggcc 2761 ttccatattg aattttctaa agagttgctg accaacctct gcatgtactt tagtgtgtta 2821 ggctggctcc ctttatgaat taataaactg ttgttctggt tgatactgta gctttttgtg 2881 aatgtggctg atgtgaatca gtcatcccac atgttgttat atatgagact actgacaaaa 2941 tcatgtttgg gaccactttg acctgaccaa gctgctgcct tcacctcacc cccagccagg 3001 catttacttt agtccttcaa atattaagta ccaataaatg tcttttcaaa tttcaaaaaa 3061 aaaaaaaaaa aaaaaaaaaa // LOCUS ENHSREHPA 722 bp ss-mRNA INV 14-AUG-1990 DEFINITION E.histolytica serine rich protein (SHEHP) mRNA, complete cds. ACCESSION M34438 KEYWORDS serine rich protein. SOURCE E.histolytica (strain HM1:IMSS), cDNA to mRNA. ORGANISM Entamoeba histolytica Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Schizopyrenida; Tubulina; Entamoebidae. REFERENCE 1 (bases 1 to 722) AUTHORS Stanley,S.L.Jr., Becker,A., Kunz-Jenkins,C., Foster,L. and Li,E. TITLE Molecular cloning and expression of a membrane antigen of Entamoeba histolytica possessing multiple tandem repeats JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4976-4980 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.L.Stanley,Jr., 22-MAY-1990, for release after publication. FEATURES from to/span description pept 4 705 serine rich protein precursor sigp 4 42 serine rich protein signal peptide matp 43 702 serine rich protein ORF 188 688 ORF2 site 247 582 multiple tandem repeats BASE COUNT 315 a 121 c 118 g 168 t ORIGIN 1 tcaatgttcg catttttatt gtttattgca ttcactagtg caactaatat cattcttgat 61 ttggatcaag aagtaaaaga tactaatatc tacggtgtat ttttaaagaa cgaagctagt 121 cctgaaaagc ttgaagaagc tgaagaaaaa gaaaaaagta gctcagcaaa accagaatca 181 agttcaaatg aagataatga agatgatgaa gatgaaaaag caagttcaag tgataactca 241 gaatcaagct caagtgataa accagataat aaaccagaag caagttcaag tgataaacca 301 gaagcaagtt caagtgataa accagataat aaaccagaag caagttcaag tgataaacca 361 gataataaac cagaagcaag ttcaagtgat aaaccagata ataaaccaga agcaagttca 421 agtgataaac cagataataa accagaagca agctcaagtg ataaaccaga taataaacca 481 gaagcaagct caactaataa accagaagca agctcaacta ataaaccaga agcaagctca 541 actaataaac cagaagcaag ctcaactaat aaaccagaag caagctcaac tagtaattca 601 aatgataaat caggaagtag ttcagataac gataataata accttgatgc tgcatcaagt 661 ccattcattg ttttctgtgc tatcattata gctatcatct tctaaatgag ttatttgact 721 tt // LOCUS EUBBAIOA 6212 bp ds-DNA BCT 14-AUG-1990 DEFINITION Eubacterium sp. bile acid-inducible operon (baiBCDEA2F), complete cds. ACCESSION M36292 KEYWORDS 7-alpha-dehydroxylation; bile acid-inducible operon. SOURCE Eubacterium sp. DNA. ORGANISM Eubacterium sp. Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 6212) AUTHORS Mallonee,D.H., White,W.B. and Hylemon,P.B. TITLE Cloning and sequencing of a bile acid-inducible operon from Eubacterium sp. VPI-12708 JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.Mallonee, 10-JUL-1990. Medical College of Virginia-VCU Dept. of Microbiology Box 678 MCV Station Richmond, VA 23298-0678 FEATURES from to/span description pept 24 1586 baiB protein pept 1634 3256 baiC protein pept 3235 3552 baiD protein pept 3597 4097 baiE protein pept 4130 4879 baiA2 protein pept 4932 6212 baiF protein BASE COUNT 1864 a 1324 c 1698 g 1326 t ORIGIN 1 aaaagatatt aagcattaag aaaatgcaca aaaaatcagc gtgtgagagg gagggcaagg 61 agttgaagcg tgactttttt aacaagttta atttggggac atcgaacttt gtcacgccgg 121 gaaaacagtt ggaatacgtt tcggaatgca agccagattc tactgcggtc atttgcttag 181 ataaagaaca gaactgttcc gttattactt ggcatcagct gcacgtctat tccagccagc 241 tggcatggta ccttatagaa aatgagattg gcccggggtc gatcgtactt acaatgtttc 301 cgaacagcat cgagcacatt attgcggtat ttgcaatctg gaaggcgggc gcctgctata 361 tgcccatgtc ctataaggcg gcggaatccg agatcaggga ggcctgcgat accatccacc 421 cgaatgcggc ttttgcggaa tgcaagattc caggattaaa attctgcctt agcgcagacg 481 agatatatga ggcgatggaa ggaagatcca aggagatgcc ttcggaccgt ctggccaatc 541 cgaacatgat atccttatca ggcggaacca gcggaaagat gaagttcatc cgtcagaacc 601 ttccatgcgg gctggacgat gagacgatca gaagctggtc tttgatgtct ggaatgggat 661 ttgagcagcg ccagctgctg gtaggcccgc tgtttcatgg cgcgcctcac tccgcggcgt 721 ttaatggact gttcatgggc aacaccctgg tactgaccag gaacctttgc ccgggaaata 781 tcctgaacat gattaagaaa tataagattg aatttataca gatggtgccg accctgatga 841 accggcttgc caaactggag ggagtcggaa aagaagactt tgcatccctg aaggcgctgt 901 gccatacagg gggcgtctgt tctccctggc ttaagcagat ctggatcgac ctgctggggc 961 ctgaaaagat ctatgagatg tattccatga cggaatgcat cggccttacc tgcatccggg 1021 gagacgagtg ggtgaagcat ccgggaagca tcggacggcc agtgggcgat agcaaggtgt 1081 ctatccggga tgagaatggc aaggaagttg cgccttttga gattggcgag atctatatga 1141 cagcgccggc ctcctatctg gttaccgagt acatcaattg ggaaccgctg gaagtgaaag 1201 agggaggctt ccgaagcgta ggggatatcg gctacgtgga tgagcagggc tatctgtact 1261 tttctgaccg gcgcagcgac atgctggtat caggcggaga aaacgtgttc gccaccgaag 1321 tcgagacggc gcttttgaga tataaggata tcctggacgc tgtagtggta gggataccgg 1381 atgaagatct ggggcgaagg ctccatgcgg tcattgagac agggaaagag ataccggcag 1441 aggaactgaa aacattcctg agaaagtatc tgactccata taagatacca aagacgttcg 1501 agttcgtaag gagcatacga aggggagaca atggaaaggc cgacaggaag cggatcctgg 1561 aagattgtat tgcccgcggg ggatgattct ataaatgcaa agaaaacaaa ttatataaag 1621 gaggagtaac aaaatgagtt acgaagcact tttttcacca ttcaaggtca gaggactgga 1681 acttaaaaac cgtatcgtcc tgcctggaat gaacaccaag atggcaaaga acaagcacga 1741 cataggcgag gatatgatag cctaccatgt tgccagggca aaagcgggat gcgcgttaaa 1801 tatatttgaa tgcgtagcat tatgtccggc gcctcacgct tatatgtata tggggcttta 1861 tacggaccat catgtagaac agcttaagaa attgacggat gcagtccatg aagcaggcgg 1921 caagatgggc atccagctgt ggcatggagg attcagcccg cagatgttct ttgacgagac 1981 caacaccctg gaaactccgg acactcttac ggtagagagg attcatgaga tcgtagaaga 2041 attcggacgc ggcgcaagga tggctgttca ggctggattt gacgcagtag aattccatgc 2101 ggctcacagt tatctgcctc acgagttctt aagccctgga atgaacaaac gtacggatga 2161 gtacggcgga agttttgaga accgctgcag attctgttat gaagtcgttc aggcaatccg 2221 ttccaatatc ccggatgaca tgccattctt tatgcgtgca gactgcatcg acgaattaat 2281 ggaacagacc atgacagagg aagagatcgt tacatttatc aataagtgcg cagaacttgg 2341 cgtggatgtg gcagaccttt cccgtggaaa cgcgacttca ttcgcaaccg tatatgaagt 2401 tccgccattc aacctggctc atggcttcaa catagagaat atttacaaca tcaaaaagca 2461 gatcaatatc ccggttatgg gagttggccg tatcaataca ggagagatgg caaacaaggt 2521 cattgaagaa ggcaagtttg acctggtagg catcggacgc gcccagcttg cagatccaaa 2581 ctggatcacc aaagtaagag aaggcaaaga agacctgatc cgccactgta tcggatgtga 2641 ccagggatgc tatgacgcag tcatcaatcc aaagatgaag catatcacct gcacccacaa 2701 tccaggattg tgcttagagt atcagggaat gccaaagaca gacgctccta agaaagtcat 2761 gatcgtagga ggcggaatgg caggcatgat cgctgcggaa gtattaaaga ccagaggcca 2821 taacccggta atcttcgagg catccgacaa gcttgcagga cagttcaggc tggcaggcgt 2881 agcgccgatg aagcaggatt gggcagatgt tgcagaatgg gaagcaaaag aagtagagcg 2941 ccttggaatc gaagtacgtc tgaataccga agtgactgca gagaccatca aggaattcaa 3001 tccggataat gtcatcatcg cagtaggctc tacctatgcg ctgcctgaga ttccgggaat 3061 cgacagccca agcgtatact cccagtatca ggtactgaaa ggggaagtaa atccgacagg 3121 ccgtgtagcc gttatcggat gcggactggt tggtacggaa gtcgcagaac ttctggcatc 3181 cagaggcgca caggtaatcg cgatcgagag gaagggcgta ggtaccggcc ttagatgctt 3241 cgcagaatgt tcatgaaccc ggaattcaaa tattacaaga tcgccaagat gtccggaaca 3301 aatgtcaccg ctttagagca gggcaaggtt cactacatca tgacagacaa gaagaccaaa 3361 gaagtgacgc agggagtcct ggaatgcgac gctaccgtta tctgtacagg aattaccgca 3421 cgtccaagcg atgggcttaa ggcaagatgc gaagaacttg gaatcccggt tgaggtgatc 3481 ggagacgctg ctggcgcaag agactgcacg atcgcgacac gcgaaggcta tgacgcagga 3541 atggcaatct agaaaatcag aacttatcaa tcttacatat agaaaggatg atacatatga 3601 cattagaaga gagagttgaa gcattagaaa aagaattgca ggagatgaag gatattgagg 3661 caatcaagga actgaaagga aagtatttcc gctgcctgga cggaaagatg tgggatgagc 3721 tggagaccac cctgtcacca aatatcgtaa cctcttattc caacgggaaa ctggtattcc 3781 atagcccgaa ggaagttacc gattacttaa agagctcgat gccaaaagaa gagatcagca 3841 tgcatatggg ccacacgccg gagatcacca ttgacagcga gactacggct acgggcagat 3901 ggtatctgga agatagactg atctttacgg acggtaagta caaagacgta ggaatcaatg 3961 gcggcgcgtt ctatacagac aaatatgaga agatagacgg ccagtggtac atccttgaaa 4021 ccggctatgt acgaatctat gaagaacatt tcatgcgtga tccaaagatc catatcacga 4081 tgaacatgca caaataagaa tattgtaaaa gaaaggcagg agtaagagta tgaatctcgt 4141 acaagacaaa gttacgatca tcacaggcgg cacaagaggt attggattcg ccgctgccaa 4201 aatatttatc gacaatggcg caaaagtatc catcttcgga gagacgcagg aagaagtaga 4261 tacagcgctt gcacagttaa aagaacttta tccggaagaa gaggttctgg gattcgcgcc 4321 ggatcttaca tccagagacg cagttatggc agcggtaggc caggtagcac agaaatatgg 4381 cagactggat gtcatgatca acaatgcagg aattaccagc aacaacgtat tctccagagt 4441 gtctgaagaa gagttcaagc atattatgga catcaacgta acaggcgtat tcaacggcgc 4501 atggtgcgca taccagtgca tgaaggatgc caaaaagggc gttatcatca acacggcatc 4561 cgttacaggc atcttcggat cactctcagg cgtaggatat ccggccagca aggcaagcgt 4621 gatcggactc acccatggac ttggaagaga gatcatccgc aagaatatcc gtgtagtagg 4681 agtggctcct ggagttgtga acacggatat gaccaatggc aatcctccgg agatcatgga 4741 aggatatctg aaggcgcttc cgatgaagag aatgcttgag ccggaagaga tcgctaatgt 4801 atacctgttc ctggcatctg acttggcaag cggcattacg gctactacgg tcagcgtaga 4861 cggggcttac agaccataat tttaattttt actaagtaga atatgtgata tagaaaagga 4921 gatataaaaa catggctgga ataaaagatt ttccaaaatt cggagctctt gcagggctta 4981 agatacttga cagcggatct aacatcgccg gacctttagg cggaggcctt ctggcagaat 5041 gcggagcaac ggtcatccat tttgaaggac caaagaaacc tgataaccag agaggatggt 5101 acggctatcc acagaatcac cgtaatcagc tgtctatggt agcagacatc aaatctgaag 5161 aaggaagaaa gatcttcctt gatctgatca aatgggcaga tatctgggta gagtcatcca 5221 aaggcggaca gtatgacagg ctgggacttt ccgatgaagt catctgggaa gtaaatccta 5281 agattgccat cgtgcacgta tccggatatg gacagacagg agacccgtct tacgttacac 5341 gtgcatccta tgacgcagta ggccaggcat tcagcggcta tatgtcactg aacggaacaa 5401 cggaagcgct gaagatcaat ccttatctga gcgatttcgt atgcggactt accacatgct 5461 gggctatgct tgcctgctat gtaagcacca ttcttaccgg aaaaggcgaa tctgttgacg 5521 ttgcacagta cgaagcgctg gcacgtatca tggacggacg tatgatccag tacgctacag 5581 acggcgtgaa gatgccaaga accggcaata aggatgcgca ggctgccctg ttcagcttct 5641 acacctgtaa agacggacgt acgatcttta tcggaatgac tggcgcggaa gtatgtaaga 5701 gaggcttccc gatcatcgga cttccggtac ctggaaccgg agacccggac ttcccggaag 5761 gcttcacagg ctggatgatc tatactcctg taggacagag aatggaaaag gctatggaga 5821 agtatgtatc tgagcatacg atggaagaag tagaggctga gatgcaggca caccagattc 5881 catgccagag agtatacgag ctggaagact gcctgaacga tcctcactgg aaagcacgtg 5941 gaactattac ggagtgggat gacccgatga tgggacatat cacaggcctt ggactgatca 6001 acaagttcaa gagaaatcct tccgaaatct ggagaggcgc tccgctgttc ggtatggata 6061 accgcgatat cctgaaagac ctgggatatg acgatgcaaa gatcgatgaa ctctatgagc 6121 agggcatcgt caatgaattc gaccttgaca ctactatcaa acgctataga ctggatgaag 6181 taattccaca tatgagaaag aaagaggagt aa // LOCUS FDICPECDA 2371 bp ds-DNA BCT 14-AUG-1990 DEFINITION F.diplosiphon phycoerythrin-associated linker protein (cpeCD) gene, complete cds. ACCESSION M33832 KEYWORDS cpeC gene; cpeD gene; phycoerythrin-associated linker protein. SOURCE F.diplosiphon DNA. ORGANISM Fremyella diplosiphon Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 2371) AUTHORS Federspiel,N.A. and Grossman,A.R. TITLE Characterization of the light-regulated operon encoding the phycoerythrin-associated linker proteins from the cyanobacterium Fremyella diplosiphon JOURNAL J. Bacteriol. 172, 4072-4081 (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by N.A.Federspiel, 24-APR-1990, for release after publication. FEATURES from to/span description pept 454 1314 phycoerythrin-associated linker protein (cpeC) pept 1355 2104 phycoerythrin-associated linker protein (cpeD) mRNA 267 > 2371 cpeCD mRNA binding 440 444 cpeC ribosomal binding site binding 1339 1343 cpeD ribosomal binding site BASE COUNT 679 a 529 c 514 g 649 t ORIGIN 1 gatctgctca agcagtagat ttattaatca aagcaggggc gacggagtga agtggggatt 61 ggggactggg gatgaggagg atgaggggga tgaggaggat gagggggatg aggggaaata 121 accattaccc attccccatt ccccattacc caatttccca tgcccattac aaatagtttg 181 tgcaaattga gtgcaaaatt ctctcatttc atcaagattt acaaatcttg atgtacatct 241 ttccgcgctc ataagaaaat gatcataaca aagcgtataa aacgcctaaa accttgagca 301 tctaggaaat cacagctttc atgcttatgg aagctggtaa tttgggttgc aacaaatcaa 361 tccaaagaat tgatgagcct ggatgagcaa tctgaaattt atttacaaat cgaaacaaat 421 ttcttaaact ttcgttaaca ggagaaacaa ttaatgccat ttggaccagc ttcacgcttg 481 ggagtcagcc tatttgatga aactcctccc gttgagtggg taccaggtcg ctcacaagaa 541 gaagcagaaa caatcattcg ggcaatctat cggcaagtat taggtaatgc ctatgtgatg 601 gaaagtgagc ggcttgctgt gcctgaatcc cagtttaagc ggggtgagtt gagcgtccgc 661 gagtttgtca gagcagtggc taaatctgaa ctatatcgtt ctcgcttttt caccagttgt 721 gcgcgctacc gagccattga actcaacttc cgccatctat tgggtcgtcc accactagat 781 ttagaagaaa tgcgctccca cagcacaatc cttgatactc aagggtttga agctgagatt 841 gattcttata tcgatggtga tgagtatcag tctacttttg gcgagaacat tgtaccttac 901 atccgaggct ataaaaccga agcgcttcag agcatggtgc aatttactca taccttccaa 961 ctggtacgag gtgcttctag cagcagcctg aagggtgact tatctggcaa ggctcctaag 1021 ctgaatgcat tagtgattca aagcacacca acagcagtaa tttcacctgc tagcgctgga 1081 gcaaccttct cgacaccacc tactggtgcc cgtacccgtc ttggagtcga tgctagtgct 1141 ggtggcaaag tttaccgcat tgaagttaca ggttatcgtg ccaaaacctt caataatatt 1201 tccaagtttc gccgttccaa tcaagtcttt ctggtgccct acgaaaagct ctctcaagag 1261 tatcaacgga ttcaccagca aggcggcgtg atcgcaagta tcactcctgt ataaattagg 1321 tgcaaactta aaaaattgag gagcagaaat tttaatggca tcccagacaa ttcttgaact 1381 ttggccctct agtagcttag aagaagttca aactattatc cgtgcagttt acaaacaggt 1441 tttaggcaac cctcatgtta tggagagtga gcggttggtg acagcagaat cacaattatg 1501 cgatcgctcc atcaccgtgc gggaatttgt ccgcagcgtt gccaagtctg atttttatcg 1561 caaccgctac ttccaatcct gcgctcccta ccgatttgta gaacttaact tcttacattt 1621 gcttggtcgc gcaccccagg atcaaagaga agtttccgaa cacatcgttc gtactgtagc 1681 tgaaggctac gatgctgaaa ttgactccta tatcgatagt agtgaatatg aagcagcctt 1741 tggtgaaaac gtagtgcctt actatcgtgg tagaagtagc gaagccaact ccaagcaagt 1801 aggcttcaac cgcatatttg cccttgatcg cggccctgcc caaattgata gtgcagttaa 1861 atcggctcaa ttggtctatg ctgttgctac taacagcgcc aacgcgatca aagcctcttc 1921 atccacagtc attggctctg gaactgaaaa acgattcaaa atcttggtgc aaggttccaa 1981 attcgacagt ccccgacgca tcagtaccac tgagtacatt gttccagcta gtaagatgac 2041 tccccaaatt cagcggatta atcgtacttc tggcaaaatc gtcagcatta ctgaaattgt 2101 ctaaccttta acagggtggg cattaatact tgtctgttct aaatgttgaa ccgattagta 2161 ttaacaactc gtaattcgta gttactaatt cgtaattacg ttacggatag ggatttagct 2221 cccacctgta acggcatgat tttatagaag tcggagactc aaaccctcag cttaattacg 2281 aattacgaat tacgaattac gaattagtaa tttaggttga gacattgccc gccccagatc 2341 attcactttt ttaatttgta attttcaaaa a // LOCUS HIVPRBAC 582 bp ds-DNA SYN 14-AUG-1990 DEFINITION Synthetic HIV type-I protease (syn BAC) gene, complete cds. ACCESSION M36450 KEYWORDS HIV type-1 protease; HIV-PR gene. SOURCE Synthetic oligonucleotide DNA, clone pET3AM. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 582) AUTHORS Cheng,Y.-S.E., McGowan,M.H., Kettner,C.A., Schloss,J.V., Erickson-Viitanen,S. and Yin,F.H. TITLE High-level synthesis of recombinant HIV-1 protease and the recovery of active enzyme from inclusion bodies JOURNAL Gene 87, 243-248 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 7 573 HIV type-I protease (BAC) BASE COUNT 120 a 125 c 170 g 167 t ORIGIN 1 gaattcatgg aagatctggc tttcctgcaa ggtaaggctc gtgagttctc ttctgaacag 61 actcgtgcta actctccgac tcgtcgtgag ctgcaggttt ggggtcgtga taacaactct 121 ccgtctgagg ctggcgccga tcgtcagggt accgtttctt ttaactttcc gcaggttact 181 ctgtggcagc gtccactggt tactattaag attggtggtc agctgaagga agctctgctg 241 gataccggtg ctgatgatac tgtactggag gagatgtctc tcccgggccg ttggaagccg 301 aagatgatcg gtggtattgg tggtttcatt aaggtacgtc agtatgatca gattctgatt 361 gagatctgtg gccataaggc gatcggtacc gttctggtgg gcccgactcc ggttaacatt 421 atcggccgta acctgctgac tcagatcggt tgtactctga actttccgat ttctccgatt 481 gagactgttc cggttaagct gaagccgggt atggatggtc cgaaggttaa gcagtggcca 541 ctgactgagg agaagattaa ggctctggtt taatagaagc tt // LOCUS HNNMTPIT 1336 bp ds-DNA ORG 14-AUG-1990 DEFINITION H.annuus mitochondrial plasmid pIT DNA. ACCESSION M36422 KEYWORDS mitochondrial plasmid. SOURCE H.annuus (strain CANP3) mitochondrial plasmid pIT DNA, clone pITC20. ORGANISM Mitochondrion Helianthus annuus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Asterales; Asteraceae; Compositae; Helianthus annuus. REFERENCE 1 (bases 1 to 1336) AUTHORS Perez,C., Dujon,B., Heizmann,P. and Berville,A. TITLE Sequence of a mitochondrial plasmid of sunflower (Helianthus annuus) and its relationship to other mitochondrial plasmids JOURNAL Plant Sci. 58, 59-69 (1988) STANDARD simple staff_entry BASE COUNT 382 a 314 c 290 g 350 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctttt gttccctcaa caagcaacta atcacaagga aagaaaacaa aagacaaaga 61 aaggcggttt tgagacataa aaaataaatg ctaaacttgc tttttaatct caaaaggtcg 121 atcttgttgt tttgagaact caaatgcagc tatcattatc ggggtgctcc gccacccctc 181 gtttccatca tccctcgact gcagtcgagg gtacgtctcc cttccctaaa aaaagatcga 241 gctttttact tattacagaa gagagagagc accttacaga agagagacac atcgaccgtt 301 ttttacttat tacagaagag agatagcacc ttacagagag atagcacatc gaccgttttt 361 tacttattac agaagagaga gacctaaaaa ggttcaaatg agaaatatct taagtgatcg 421 tggaaactga gctttgtaaa ggagtagaga gtgtaagacg gggttgcgag tctttgcgag 481 ttgggtagtc acctctccaa cccattccgg acgttgcccc aagcccccta gcagcgctct 541 cgaaactcta caatgcgcag acaagaaggc gattgagtta acggaagcgt ctcaaaaaca 601 ttctttttga agaacaagaa gaagactcag cgctgggtaa gtggtttaag acttttcccg 661 accctcggct ccttggggag tagtgccccc aagacacagc cacgccaatc cgacgcggct 721 ccatacccct agtttagtat atagagctgt ttttcttttt ttctctgagg agcaaagaaa 781 aaagtaaaaa atttttttat ttttggacgg ccagcccggt aggaaggtga gataaatctc 841 cccctccccg gcctccccct ctgttaaggt cgagtcctta ctcaattcat cggatgaacg 901 gactcggctg gacttgttag ctgcataaaa gcatagaaca tagcccggag aatgaagatc 961 acccgaactt cattcccctg gctacaaccg gcagcaggag aagctcttct ctttgctggc 1021 caagttatcg ccatggggcc cccccccctc ccatggcttg ctttgttgct gctcttttac 1081 tgcctatagc tgacaagtcc gctctccccg acttaacacc gagaaaaaca aaaaacagct 1141 ctatatacta ggggaggaga gcggagtgag tgcaggccct tcaacgagag cattgttttg 1201 tggtccgggt tcggtattga attatgaaac tcctttctta ttgatgattt acggcctttt 1261 aaaacagttg ttcagtctca aatggtaccc tatattgatc ttgaaagaaa tcattttgtc 1321 tgtcaagatg gtcata // LOCUS HSE1GBA 3958 bp ds-DNA VRL 14-AUG-1990 DEFINITION Equine herpesvirus type 1 glycoprotein B (gB) gene, complete cds. ACCESSION M36298 KEYWORDS antigen; glycoprotein B; structural protein. SOURCE Equine herpesvirus type 1 (strain Ab1) DNA, from COS cells. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 3958) AUTHORS Bonass,W.A., Elton,D.M., Stocks J,-M., Killington,R.A., Meredith,D.M. and Halliburton,I.W. TITLE Molecular analysis of the Equine herpesvirus type-1 strain Ab1, glycoprotein B gene and its expression in COS cells JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.A.Bonass, 10-JUL-1990. Leeds University Dept. of Microbiology Leeds LS2 9JT, UNITED KINGDOM FEATURES from to/span description pept 603 3545 glycoprotein B precursor (gB) sigp 603 860 glycoprotein B signal protein matp 603 3542 glycoprotein B mRNA 475 > 3958 glycoprotein B mRNA signal 370 373 CAT box signal 453 458 TATA box signal 3554 3560 polyA signal site 2165 2166 internal cleavage site BASE COUNT 1019 a 1075 c 1010 g 854 t ORIGIN 1 gttaacgtgt atggcgaagt gctgctacat acgtacgtat cgtcctacaa cgggttttgc 61 gccaggcgcg ggttttgcgc ggcggtgagt cgagcgggta ccatcataga taaccgctct 121 agcacgtccg cgttcgactc gcatcagttc atgaaggcgg cgctgcttcg ccaccccatt 181 gaccagtcgc tcatgccgtc cataacacac aagtttttcg agctgatcaa cgggcccgtg 241 tttgacaacg ctggccacaa ctttgcgcag ccgccaaaca cggcattata ttacagcgtt 301 gaaaacgttg ggttgttacc gcatctcaag gaggaactag ctcggtttat gattactgcg 361 gctaaaggtg attggtcaat tagcgagttt caaaggtttt attgctttga gggagtgaca 421 ggtgtgacgg ccacgcagcg gctggcgtgg aaatatatcg gggagctcat cctagccgcc 481 gcagtattct cctcggtttt ccactgtgga gaggtgcgcc tcctgcgcgc agatcgtacc 541 tacccggact ccagcggcgc acagcgctgc gtgagcggca tttacataac ctacgaggcg 601 tcatgtcctc tggttgccgt tctgtcggcg gctccacatg gggcaattgg cgcggagacg 661 gtggtgattt acgacagcga cgtgttctct ctcctgtatg cagtgctcca gcagctggct 721 cctggatcgg gagccaacta ggcaatgttg gaaacttact cgccaccccc cacccgctgg 781 gaaagccggc atcatcgagg gtgggcacaa tagttctagc ctgtttgttg ctttttggaa 841 gctgtgttgt tagagccgta cccaccacgc caagcccccc aactagtact cccacttcca 901 tgtcaacgca ctcccatggg acagtagacc ctacgctgct ccccacagaa acgcccgacc 961 cactcagact ggctgtgcgc gagtccggta tactcgctga ggatggagac ttttacacct 1021 gcccaccgcc taccggatcc accgtcgtac gcatcgaacc acctagaact tgccccaagt 1081 ttgaccttgg gagaaacttc acggagggga ttgctgttat ttttaaggaa aacatcgctc 1141 cctacaaatt cagggcaaac gtatactaca aggacatcgt tgtaacacgt gtgtggaaag 1201 gatacagcca tacgtccctg tccgacagat acaatgacag ggttccggtt tcggtggagg 1261 agatcttcgg tctcatcgac agtaagggaa aatgttcgtc aaaggccgag tacctcagag 1321 ataacatcat gcaccacgcg taccacgacg acgaggacga ggtggagctt gatttggtgc 1381 cgtccaagtt tgcaactccg ggggccagag cctggcagac caccaacgat actacgtctt 1441 acgtggggtg gatgccatgg aggcactaca cgtcaacgtc tgtcaactgc atcgtcgagg 1501 aggtggaggc gcggtccgtc tacccctacg actccttcgc cctgtccacc ggtgatattg 1561 tgtacgcgtc tccgttttac ggcctgaggg ctgccgctcg catagagcac aatagctacg 1621 cgcaggacag tttcaggcaa gttgaagggt acaggccccg cgacttagac agtaaactac 1681 aagccgaaga gccggttacc aaaaatttta tcactacccc gcatgtcacc gtcagctgga 1741 actggaccga gaagaaagtc gaggcgtgta cgctgaccaa atggaaagag gtcgacgaac 1801 tcgtcaggga cgagttccgc gggtcctaca gatttactat tcgatccatc tcgtctacgt 1861 ttatcagtaa cactactcaa tttaagttgg aaagtgcccc ccttactgaa tgtgtatcca 1921 aagaagcaaa ggaagccata gactcgatat acaaaaagca gtacgagtct acgcacgtct 1981 ttagcggtga tgtggaatat tacctggcac gcggggggtt cttaattgca ttcagaccta 2041 tgctctccaa cgaactcgcc aggctgtacc tgaacgagct tgtgagatct aaccgcacct 2101 acgacctaaa aaatctattg aaccccaatg caaacaataa caataacacc acgcgaagac 2161 gcaggtctct cctgtcagta ccagaacctc agccaaccca agatggtgtg catagagaac 2221 aaattctaca tcgcttgcac aaacgagcag tggaggcaac ggcaggtacc gattcttcca 2281 acgtcaccgc caaacagctg gagctcatca aaaccacgtc gtctatcgag tttgccatgc 2341 tacagtttgc atacgatcac atccaatccc acgtcaatga aatgctaagt agaatagcaa 2401 ctgcgtggtg taccctccaa aacaaagagc ggaccctatg gaacgaaatg gtgaagatta 2461 acccgagcgc catagtctcc gcaacccttg acgagcgagt tgcagcgagg gtcctggggg 2521 acgtgatagc tataacgcac tgcgccaaaa tagagggcaa cgtgtacttg caaaactcca 2581 tgcgctcgat ggacagtaac acgtgctact cccgcccccc cgtaacattt acaattacta 2641 agaatgcaaa caacagaggg tcgatagaag gccagctggg agaggagaac gagattttca 2701 cggagcgcaa gctgatcgag ccgtgcgccc tcaatcagaa gcgctacttt aagtttggca 2761 aagagtacgt ttactacgag aactacacgt tcgtccgcaa agtgcccccc acggaaatcg 2821 aggttatcag cacgtacgtt gaactaaact tgaccctttt ggaagaccgc gagtttctgc 2881 ccctggaggt gtacacgcgg gctgagctgg aggacaccgg cctgctagac tacagcgaaa 2941 tacagcgccg caaccagctc cacgctctca ggttttacga catcgacagc gtggtcaacg 3001 tggacaatac cgcagtgatt atgcagggga tcgccagctt tttcaagggc ctgggtaaag 3061 tgggggaggc cgtgggaacg ctcgttctcg cggccgccgg cgctgttgtt tcaaccgtat 3121 ctggaatagc ttcgttttta aacaacccat ttggggggct agccatcggc ctgctggtaa 3181 tcgccggcct ggtagctgcg ttttttgctt acagatatgt aatgcagatc cgcagtaacc 3241 ccatgaaagc tctatacccc ataacaacaa aggccttgaa aaacaaagcc aaaacttcct 3301 acggccagaa cgaggaggac gatgggagcg actttgatga ggccaagctt gaagaggctc 3361 gcgaaatgat caaatacatg tctatggttt cggccctgga aaagcaggaa aagaaagcta 3421 taaagaaaaa cagtggggtt ggcctgatcg ccagtaacgt ctcaaagctg gccctgcgaa 3481 ggcgcggtcc caaatatacc cgactccaac agaacgatac catggaaaat gaaaaaatgg 3541 tttaaacatg tttaataaat attatgacac gtactcaaag tgtgacctca tatttgcata 3601 accactttct agttccggcc ccaaggatat ttaagcctag tatctccgcc gaggtttcat 3661 cctcattcac caactcacac ttagagttga cgcttcctct tgcgcctttg ctctcgccgc 3721 tcctgtgtta gcgtatactg cccaagaaat ggattctcca cgcggtatct ccacagctac 3781 cggtgatgcc cacgccgagg ccgcggtttc cccagccgcg gaaatccaga taaaaacgga 3841 agcccccgat gtagacggac cagaagccac tactgagtgt ttagaccaca cctacaccca 3901 acagacaagc gggggtgatg gcctagatgc tatcgatacg gacgatctct ggagatgg // LOCUS HSE1GPEIA 4707 bp ds-DNA VRL 14-AUG-1990 DEFINITION Equine herpes virus type 1 glycoproteins gE, gI and 10 kDa protein genes, complete cds. ACCESSION M36299 KEYWORDS glycoprotein gE; glycoprotein gI. SOURCE Equine herpesvirus type 1 (strain Ab1) DNA, clone pUC19. ORGANISM Equine herpesvirus type 1 Viridae; ds-DNA enveloped viruses; Herpesviridae; Alphaherpesvirinae. REFERENCE 1 (bases 1 to 4707) AUTHORS Elton,D.M., Bonass,W.A., Killington,R.A., Meredith,D.M. and Halliburton,I.W. TITLE DNA sequence analysis of the 4.7Kb BamHI to EcoRI fragment of the Equine herpesvirus type-1 short unique region JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.M.Elton, 10-JUL-1990. University of Leeds Dept of Microbiology Leeds LS2 9JT, UNITED KINGDOM FEATURES from to/span description pept 594 1868 glycoprotein I precursor (gI) sigp 594 701 glycoprotein I precursor (gI) matp 702 1865 glycoprotein I precursor (gI) pept 2101 3753 glycoprotein E precursor (gE) sigp 2101 2208 glycoprotein E precursor (gE) matp 2209 3750 glycoprotein E precursor (gE) pept 4478 > 4707 10 kDa protein ORF < 1 484 ORF1 ORF 3750 4140 ORF4 signal 544 550 gI TATA box signal 1924 1929 gI polyA signal signal 1955 1958 gE CAT box signal 4139 4144 gE polyA signal BASE COUNT 1204 a 1291 c 1107 g 1105 t ORIGIN 1 gaattcaact tcccacaagg agagcatatg acatggttga agttctggtt cgtctacgat 61 ggtggaaacc taccagtgca gttttatgaa gcccaggcat tcgcaagacc cgtgcctccg 121 gataaccacc ctggatttga ttctgttgag tcggagatta cacaaaataa aacagacccg 181 aaaccaggcc aggcggaccc caaacccaat cagcctttta agtggcccag catcaaacac 241 ttggccccaa gactcgatga ggtggatgag gtcatagagc ccgtaacaaa gcccccaaaa 301 acgtctaaga gcaactctac gtttgtgggc atcagcgtcg gtttgggtat cgccggccta 361 gtattggtgg gcgtcattct atacgtctgc ttgcgtcgga agaaggaact gaaaaagtct 421 gcacagaacg gcttgactcg cctacgctcg acctttaagg atgttaaata tacccagctt 481 ccgtaaacag tgttgcgtaa cctgctggga ggtgtccacg gccttaaagc ttcgcggttt 541 ggagatataa cgcacaacct acaacaaacg cgacacagca agtagtagtc gctatggcca 601 aactcactgg gatgttcagc gctgcgatat tactgtctat ggctatatgc tcaaccgcaa 661 tcatatatcg cggagaacat atgagcatgt acctaaacgc cagctctgag tttgccgtgt 721 accccactga tcagtccctt gttttggttg gccacttgct ctttctcgac ggacaacgct 781 tacccaccac caactatagt gggctgatcg aattgattca ttacaactac tccagcgttt 841 gctacactgt tatccaaacg atatcgtatg aatcatgccc gcgtgtagcc aacaatgctt 901 tcagatcgtg cctccacaaa acttctaagc actaccacga ctatttccga gtcaatgcct 961 ctgttgaaac caacgttctc ttaaacatca caaagccaca gcctacagat tccggggcgt 1021 atatccttcg cgtaaaactt gaccacgcgc caaccgcaga tgtttttgga gtttccgcct 1081 ttgtttacga tctaaaatct aaaacggtcc ccgatccaat gcccaccaca caaacggtag 1141 aacctacaac gagctatgtg tcgactccca catacgacta taccgatgac gtaaccaccg 1201 aaactgaatc cacatcaaca tctacccaac aggcgatgac ctccactcaa acccctagcg 1261 ctacatgggg aacccagcta accacagagc tgccgacaaa cgaaactgtg gttattggtc 1321 aggaggccct gttatgccat tggttccagc catcgacaag ggtgccgacc ctgtatctgc 1381 atctgttggg acgcactggc aatctcccgg aagatgttct actggtcgaa gactctgagt 1441 ttcttcgtac cacatcgcct gcacataggc cttctgcatc acccgctgac ggtgatgatt 1501 ttaaacagac aaactcaact tcccttaagg cgcgcaacaa gatcgtcgca atggtggtta 1561 tcccgaccgc gtgtgtacta atgctcctgt tggtggttgt cggtgccatc ataaacggtg 1621 ccgtgcgcaa acatttattg agttgcgcaa gccgcaggat ctaccgctcc ggacaggggg 1681 gcgcatcggc ggccgaacgg agacggctga cttgcggtcc tactttagcc gcgtcatcgg 1741 agtcgctggc cgacgataca acgtcatcac ctccaacccc caaaccttcg aagaaaacca 1801 agttggagac cgatccgctt atggaacagc tgaaccggaa actggaggcc atcaaagaag 1861 aatcatagtt gtgggggtag atggggttgg tattaaagtt tgtgtattat cgattttata 1921 tttattaaaa tttgtgaaac ataaacatct tgtgcaatgt ttacattatt tgtgattggg 1981 acggtccact gggaggtggt acaactcggg tttaaagctc tggatgtttg gtaggaaact 2041 cacagttctc cactttggcg tcaaagcaat cagacgtcta attcgaagta gaacgtcaca 2101 atggagctgt tggccgcaag tcgcgcttgt atattttttg ggctagtaac agtactcgat 2161 gcgtggggag tccaacaagt tgaactttcc gagggggctt gggctatgat cgacggaagg 2221 gacgttttaa cccctactaa cacaactact cgggtcacaa aggcctggac gtttttggaa 2281 acccctcccg gttgcgctgg cgacatatca gttaagaagg tgtgcgtgag ccatagtctg 2341 tgcgaagata acattataat aggaaagcac tgtaacctct taactgggga acatggcatt 2401 gcgttggccg agtttaacgt agtaaacgga tcgctgcgca gaacagacga tgtgtacttt 2461 gtgaatggta cagtctttcc aatccttgcc gaaacccgca gcgtcctaca aatccatagg 2521 gcaaccccct ctatcgcagg ggtttacacc ctccacgttt ccatcgacgg aatgatgaaa 2581 cactccgtcg tgctgctcac cgtcaagaag ccgcccaaac aaccgcaacc acgcttgcgc 2641 gttaagaccc cgccacccgt aaccgttcct caggttcccg taaagaccca cacggatttt 2701 gtggtgcacg gataccactc gcgcgtgtac gctgatggcg aatctttcga gctgtcggtg 2761 aacctggagt cacatatcgt agagcccagc ttcagcgcgg agattcagtg gtactatatg 2821 aatacatcat cgtcatcatg cgatctattt cgagttttcg aaacctgcat ctttcacccg 2881 acagccatgg cctgcctgca cccggaacaa cacacctgca gcttcacatc ccccatcaga 2941 gcgaccaaga tcctacaccg ggtgtatgga aactgcagcg atcatggaaa ttcgtggcct 3001 tctaggtgcc atagcactct gctgggcaat cgtctatact ttattcaacc agcacagaac 3061 agagtggacc tgttgttcaa agacactccc gcgtcggcta ccgggctgta tgtgtttgta 3121 ttattgtaca acggacatcc ggaggcgtgg acgtatacgc tgctgtcaac cgcaaatcac 3181 tttatgaatg tgcttactga cgtgacccgc ccacggctag gagagcactt ttatacggac 3241 ctcgggcaca aaatcatcac tcctcatcca tctgtagcta ccactgaaga gttgggagct 3301 tggactcgac actacctcgc ctttttgctg gttattatct gcacgtgcgc ggcgctgcta 3361 gttgcattgg tggtgtgggg ctgtattctc tacatccgaa gcaaccgtaa gccgtatgaa 3421 gtgctgaacc cctttgaaac ggtttacacg agcgttccaa gcaacgaccc ctcggacgag 3481 gtcttggtgt ttgagcgcct agcttcggac tctgacgact ccttcgactc tgattcagac 3541 gaagagttgg aatacccacc acctcccaaa ccagctccac agctcccacc ataccagttt 3601 gtagacgggg gagacgcccc tagcggcagg tccggattca aggtttggtt ccgcgataca 3661 cccgaggcgt ccccggttcc tcttcataaa ccaacgctac agggtccaga ctacagccgg 3721 gtagcgtcga agctaaagtc gatactaaaa tgagcagcaa cagcgataac acagagtgct 3781 tcgggggagt caactatgcc gagggaatgc gcaacggtaa acgcaaccct gtcagaaaca 3841 gcacctttca agagtatctc gacgcgcgta acgcgcgtta tcccagatcc ggctcaacct 3901 ccgattccga cgaggactac acaaccagat caaagtacga gtcagatgtc agcgagttta 3961 aaaaaatgat ggatctggaa actctacctc ccccaaaggc tgagccgcaa gctcagaagg 4021 ccgagcctga tgctgcgaag gaggagccag tcagcaccac tagctacatc ttaaacgaat 4081 gggtggctcc tatgattggg cattttctgg caatgtgtat gtatgagttg cttttcaaat 4141 aaaaacaaac attaacccct gtaaacatcc gtttgtctac tgtgtatgat agagttaaac 4201 ccaaccctag agagttatgt atttaatgcc ctgggacccc gcggaagtca tatatccctc 4261 ggccccctca tttgggcgca cattgcctgc ccggcggcag tcttactccc ttagctcgcc 4321 ctcttgcata agataaacta ttcccctccc agctagtttc acccaccaga ttaagcgagg 4381 ttttccctct cagcgatcac ttttcaccac cgaagaacag gccctcatcg gtttccctcc 4441 gtgttttccc atccatctat ccaaccacta cattttcatg gagaaggcgg aggctgccgc 4501 agttgttata cccctgtcag tttccaaccc cagctaccgt ggaagcggta tgtccgacca 4561 agaagtaagc gaagaacaat ctgctggaga tgcctgggtg tctgcagcaa tggcagccgc 4621 agaggcggtg gctgctgccg ctacctccac cggaattgat aacactaacg actacacgta 4681 caccgctgct tctgagaatg gggatcc // LOCUS HUMADRA2RA 2072 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human alpha-2-adrenergic receptor (alpha-2 c2) gene, complete cds. ACCESSION M34041 KEYWORDS G-protein coupled receptor protein; alpha-2-adrenergic receptor; plasma membrane protein. SOURCE Human placenta DNA, clone alpha-2 C2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2072) AUTHORS Lomasney,J.W., Lorenz,W., Allen,L.F., King,K., Regan,J.W., Yang-Feng,T.L., Caron,M.G. and Lefkowitz,R.J. TITLE Expansion of the alpha-2-adrenergic receptor family: Characterization of a human alpha-2-adrenergic receptor subtype, the gene for which is located on chromosome 2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5094-5098 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.W.Lomasney, 03-MAY-1990, for release after publication. FEATURES from to/span description pept 413 1765 alpha-2-adrenergic receptor (alpha-2 C2) /hgml_locus_uid="LY0100G" /nomgen="ADRA2RL1" /map="2" BASE COUNT 316 a 705 c 660 g 391 t ORIGIN Chromosome 2. 1 ctgcaggcgc ggctggggcg aaacgtgcga gctgagcggg cgcaaggtcc tccgcgcctc 61 ctttaagaac cggcccagcc cggcccgcgc ccccagagcg tacggcatcc gcgtggcgga 121 gggcgcgact ttctccggtc ccgggcggga cggggacggc ggcgggacaa cttgggaaac 181 ttctctgggg cggacggcag ggaccccggg caccgtggag gaggatgtag gagggcggct 241 gctggtcctg ggtgttcccg acctcctagg ccccgctcgt ccaggccatg gggctccagc 301 gccctcgcgg cgccggaggg gcgacgctct tgtctagccg agccgggcag cgctgtcgtc 361 cacggtgcgc actgggcggg cagcgctccc tctgcccacc tcccgccccg tcatggacca 421 ccaggacccc tactccgtgc aggccacagc ggccatagcg gcggccatca ccttcctcat 481 tctctttacc atcttcggca acgctctggt catcctggct gtgttgacca gccgctcgct 541 gcgcgcccct cagaacctgt tcctggtgtc gctggccgcc gccgacatcc tggtggccac 601 gctcatcatc cctttctcgc tggccaacga gctgctgggc tactggtact tccggcgcac 661 gtggtgcgag gtgtacctgg cgctcgacgt gctcttctgc acctcgtcca tcgtgcacct 721 gtgcgccatc agcctggacc gctactgggc cgtgagccgc gcgctggagt acaactccaa 781 gcgcaccccg cgccgcatca agtgcatcat cctcactgtg tggctcatcg ccgccgtcat 841 ctcgctgccg cccctcatct acaagggcga ccagggcccc cagccgcgcg ggcgccccca 901 gtgcaagctc aaccaggagg cctggtacat cctggcctcc agcatcggat ctttctttgc 961 tccttgcctc atcatgatcc ttgtctacct gcgcatctac ctgatcgcca aacgcagcaa 1021 ccgcagaggt cccagggcca agggggggcc tgggcagggt gagtccaagc agccccgacc 1081 cgaccatggt ggggctttgg cctcagccaa actgccagcc ctggcctctg tggcttctgc 1141 cagagaggtc aacggacact cgaagtccac tggggagaag gaggaggggg agacccctga 1201 agatactggg acccgggcct tgccacccag ttgggctgcc cttcccaact caggccaggg 1261 ccagaaggag ggtgtttgtg gggcatctcc agaggatgaa gctgaagagg aggaagagga 1321 ggaggaggag gaggaagagt gtgaacccca ggcagtgcca gtgtctccgg cctcagcttg 1381 cagccccccg ctgcagcagc cacagggctc ccgggtgctg gccaccctac gtggccaggt 1441 gctcctgggc aggggcgtgg gtgctatagg tgggcagtgg tggcgtcgaa gggcgcacgt 1501 gacccgggag aagcgcttca ccttcgtgct ggctgtggtc attggcgttt ttgtgctctg 1561 ctggttcccc ttcttcttca gctacagcct gggcgccatc tgcccgaagc actgcaaggt 1621 gccccatggc ctcttccagt tcttcttctg gatcggctac tgcaacagct cactgaaccc 1681 tgttatctac accatcttca accaggactt ccgccgtgcc ttccggagga tcctgtgccg 1741 cccgtggacc cagacggcct ggtgagcccg cctgcgctgc ccctgtgggt tggtgcggtg 1801 gcgccggggt caccctgctt cttgccctgc tgtgtgtggc tgcctcccct gggctttctg 1861 ctccctgccc agatcctgta ggcctcatct taggaacccc ttgggagggg tgggcagggg 1921 ggctgctagc aagggtccca gtgaagcttc cccttgccgg cttagctgtg ggggacccct 1981 tctccaccct ctccctgagc acaggccgat ggaggtggtt caaatcctct ggaacatagc 2041 caagaccagg agaagagaga gcactttctt cc // LOCUS HUMARF1BA 1816 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human ADP-ribosylation factor (ARF1) mRNA, complete cds. ACCESSION M36340 M25203 KEYWORDS ADP-ribosylation factor. SOURCE Human foreskin fibroblast, cDNA to mRNA, clone pCD2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 29 to 866) AUTHORS Bobak,D.A., Nightingale,M.S., Murtagh,J.J., Price,S.R., Moss,J. and Vaughan,M. TITLE Molecular cloning, characterization, and expression of human ADP-ribosylation factors: Two guanine nucleotide-dependent activators of cholera toxin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 6101-6105 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1816) AUTHORS Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C. TITLE Human ADP-ribosylation factors (ARF's): A functionally conserved family of GTP-binding proteins JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer copy of sequence [1] kindly submitted by D.Bobak, 02-JUN-1989. Draft entry and computer-readable sequence for [2] kindly submitted by R.A.Kahn, 10-JUL-1990. National Cancer Institute Lab of Biol. Chem. DCT Bldg. 37, RM 5D-02 Bethesda, MD 20892 FEATURES from to/span description pept 78 623 ADP-ribosylation factor (ARF1) BASE COUNT 405 a 512 c 469 g 430 t ORIGIN 1 ggggaaaacc aacgcctggc tcggagcagc agcctctgag gtgtccctgg ccagtgtcct 61 tccacctgtc cacaagcatg gggaacatct tcgccaacct cttcaagggc ctttttggca 121 aaaaagaaat gcgcatcctc atggtgggcc tggatgctgc agggaagacc acgatcctct 181 acaagcttaa gctgggtgag atcgtgacca ccattcccac cataggcttc aacgtggaaa 241 ccgtggagta caagaacatc agcttcactg tgtgggacgt gggtggccag gacaagatcc 301 ggcccctgtg gcgccactac ttccagaaca cacaaggcct gatcttcgtg gtggacagca 361 atgacagaga gcgtgtgaac gaggcccgtg aggagctcat gaggatgctg gccgaggacg 421 agctccggga tgctgtcctc ctggtgttcg ccaacaagca ggacctcccc aacgccatga 481 atgcggccga gatcacagac aagctggggc tgcactcact acgccacagg aactggtaca 541 ttcaggccac ctgcgccacc agcggcgacg ggctctatga aggactggac tggctgtcca 601 atcagctccg gaaccagaag tgaacgcgac ccccctccct ctcactcctc ttgccctctg 661 ctttactctc atgtggcaaa cgtgcggctc gtggtgtgag tgccagaagc tgcctccgtg 721 gtttggtcac cgtgtgcatc gcaccgtgct gtaaatgtgg cagacgcagc ctgcggccag 781 gctttttatt taatgtaaat agtttttgtt tccaatgagg cagtttctgg tactcctatg 841 caatattact cagctttttt tattgtaaaa agaaaaatca actcactgtt cagtgctgag 901 aggggatgta ggcccatggg cacctggcct ccaggagtcg ctgtgttggg agagccggcc 961 acgcccttgg cttagagctg tgttgaaatc cattttggtg gttggtttta acccaaactc 1021 agtgcatttt ttaaaatagt taagaatcca agtcgagaac acttgaacac acagaaggga 1081 gaccccgcct agcatagatt tgcagttacg gcctggatgc cagtcgccag cccagctgtt 1141 cccctcggga acatgaggtg gtggtggcgc agcagactgc gatcaattct gcatggtcac 1201 agtagagatc cccgcaactc gcttgtcctt gggtcaccct gcattccata gccatgtgct 1261 tgtccctgtg ctcccacggt tcccaggggc caggctggga gcccacagcc accccactat 1321 gccgcaggcc gccctaccca ccttcaggca gcctatggga cgcaggcccc atctgtccct 1381 cggtccgcgt gtggccagag tggtccgtcg tccccaacac tcgtgctcgc tcagacactt 1441 tggcaggatg tctggggcct caccagcagg agcgcgtgca agccgggcag gcggtccacc 1501 tagacccaca gcccctcggg agcaccccac ctctgtgtgt gatgtagctt tctctccctc 1561 agcctgcaag ggtccgattt gccatcgaaa aagacaacct ctactttttt cttttgtatt 1621 ttgataaaca ctgaagctgg agctgttaaa tttatcttgg ggaaacctca gaactggtct 1681 atttggtgtc gtaggaacct cttactgctt tcaatacacg attagtaatc aactgttttg 1741 tatacttgtt ttcagttttc atttcgacaa acaagcactg taattatagc tattagaata 1801 aaatctctta actatt // LOCUS HUMARF2A 1529 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human ADP-ribosylation factor 2 (ARF2) mRNA, complete cds. ACCESSION M36341 M31890 KEYWORDS ADP-ribosylation factor 2. SOURCE Human breast cancer cell line MDA-MB 231, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 766) AUTHORS Monaco,L., Murtagh,J.J., Newman,K.B., Tsai,S.-C., Moss,J. and Vaughan,V. TITLE Selective amplification of an mRNA and related pseudogene for a human ADP-ribosylation factor, a guanine nucleotide-dependent protein activator of cholera toxin JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 66 to 1529) AUTHORS Kahn,R.A., Kern,F.G., Clark,J., Gelmann,E.P. and Rulka,C. TITLE Human ADP-ribosylation factors (ARF's): A functionally conserved family of GTP-binding proteins JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence [1] kindly submitted by L.Monaco, 02-FEB-1990. Draft entry and computer-readable sequence for [2] kindly submitted by R.A.Kahn, 10-JUL-1990. National Cancer Institute Lab of Biol. Chem. DCT Bldg. 37, RM 5D-02 Bethesda, MD 20892 FEATURES from to/span description pept 130 672 ADP-ribosylation factor 2 (ARF2) BASE COUNT 427 a 294 c 295 g 513 t ORIGIN 1 ctgcctccct ctttcttcct ccgctctttc tcttccctct cgtttagttt gcctggagct 61 tgaaaggaga aagcacgggg tcgccccaaa ccccttctgc ttctgcccat cacaagtgcc 121 actaccgcca tgggcctcac tatctcctcc ctcttctccc gactatttgg caagaagcag 181 atgcgcattt tgatggttgg attggatgct gctggcaaga caaccattct gtataaactg 241 aagttagggg agatagtcac caccattcct accattggtt ttaatgtgga aacagtagaa 301 tataagaaca tttgtttcac agtatgggat gttggtggtc aagatagaat taggcctctc 361 tggaagcatt acttccagaa tacccagggt cttatttttg tggtagatag caacgatcgt 421 gaaagaattc aggaagtagc agatgagctg cagaaaatgc ttctggtaga tgaattgaga 481 gatgcagtgc tgctactttt tgcaaacaaa caggatttgc caaatgctat ggccatcagt 541 gaaatgacag ataaactagg gcttcagtct cttcgtaaca gaacatggta tgttcaagcc 601 acttgtgcaa cacaaggaac tggtctgtat gaaggacttg actggctgtc aaatgagctt 661 tcaaaacgtt aaatgaaatt ggatatctaa ccaaggacat gtttgataaa attggtctag 721 gcttgttaca acaaaattag tttgtatctt ggttattaaa cagtatctgg gactggtttg 781 ggcagaatat taaacttatt ttgttgccaa ttattgttta ccgagtataa tgttgctatt 841 tagcaatgtg cttggtttta aagaaattct ccttgggaaa aaagtatcct cttttaattt 901 tacttcccat aagcgtaaat gcctggacat agctcttgtg aacctttaaa taaattgttt 961 gagtgttttt gagccccaga caaataatgt tttaaagtta tcccttgcta ctttactgat 1021 acctttatca ttcctgagac agtttgctaa tttaaaaatg tagcattcca tttgtattta 1081 tttctctccc ttgccaaaaa gattttctaa tactgcttgt accagccaga gaaagatcca 1141 aaacactact cagctctctt gcactgagga aatttttccc cctacattga ctcctggcct 1201 acatcagcca aacttaacct tggtggggtt tggatttgat agccaattag ttctgtgctg 1261 gttgcaaaga attgatattt agatggtttt taatactcag cagattgtct tcccatattg 1321 tgtctttttt atgttgcatg ttgcttttgt tatcagcctg attttttgct cagtatatga 1381 tagttctgct gatgttttgt ttattgggca gacatatctt cattaagagt ttttggaaaa 1441 ctcatcaaat tcgatgaata cattttcttc ataacccatt tggaattatt cctaataaaa 1501 tgataaaata cgtaaaaaaa aaggaattc // LOCUS HUMCDC25HS 2055 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human cdc25Hs mRNA, complete cds. ACCESSION M34065 KEYWORDS mitotic inducer. SOURCE Human HeLa cell line D98/AH-2, cDNA to mRNA, clone BSK1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2055) AUTHORS Sadhu,K., Reed,B.I., Richardson,H. and Russell,P. TITLE Human homolog of fission yeast cdc25 mitotic inducer is predominantly expressed in G-2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5139-5143 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Russell, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 211 1632 CDC25Hs ORF mRNA < 1 2055 cdc25Hs mRNA BASE COUNT 576 a 472 c 477 g 530 t ORIGIN 1 caggaagact ctgagtccga cgttggccta cccagtcgga aggcagagct gcaatctagt 61 taactacctc ctttccccta gatttccttt cattctgctc aagtcttcgc ctgtgtccga 121 tccctatcta ctttctctcc tcttgtagca agcctcagac tccaggcttg agctaggttt 181 tgtttttctc ctggtgagaa ttcgaagacc atgtctacgg aactcttctc atccacaaga 241 gaggaaggaa gctctggctc aggacccagt tttaggtcta atcaaaggaa aatgttaaac 301 ctgctcctgg agagagacac ttcctttacc gtctgtccag atgtccctag aactccagtg 361 ggcaaatttc ttggtgattc tgcaaaccta agcattttgt ctggaggaac cccaaaatgt 421 tgcctcgatc tttcgaatct tagcagtggg gagataactg ccactcagct taccacttct 481 gcagaccttg atgaaactgg tcacctggat tcttcaggac ttcaggaagt gcatttagct 541 gggatgaatc atgaccagca cctaatgaaa tgtagcccag cacagcttct ttgtagcact 601 ccgaatggtt tggaccgtgg ccatagaaag agagatgcaa tgtgtagttc atctgcaaat 661 aaagaaaatg acaatggaaa cttggtggac agtgaaatga aatatttggg cagtcccatt 721 actactgttc caaaattgga taaaaatcca aacctaggag aagaccaggc agaagagatt 781 tcagatgaat taatggagtt ttccctgaaa gatcaagaag caaaggtgag cagaagtggc 841 ctatatcgct ccccgtcgat gccagagaac ttgaacaggc caagactgaa gcaggtggaa 901 aaattcaagg acaacacaat accagataaa gttaaaaaaa agtatttttc tggccaagga 961 aagctcagga agggcttatg tttaaagaag acagtctctc tgtgtgacat tactatcact 1021 cagatgctgg aggaagattc taaccagggg cacctgattg gtgatttttc caaggtatgt 1081 gcgctgccaa ccgtgtcagg gaaacaccaa gatctgaagt atgtcaaccc agaaacagtg 1141 gctgccttac tgtcggggaa gttccagggt ctgattgaga agttttatgt cattgattgt 1201 cgctatccat atgagtatct gggaggacac atccagggag ccttaaactt atatagtcag 1261 gaagaactgt ttaacttctt tctgaagaag cccatcgtcc ctttggacac ccagaagaga 1321 ataatcatcg tgttccactg tgaattctcc tcagagaggg gcccccgaat gtgccgctgt 1381 ctgcgtgaag aggacaggtc tctgaaccag tatcctgcat tgtactaccc agagctatat 1441 atccttaaag gcggctacag agacttcttt ccagaatata tggaactgtg tgaaccacag 1501 agctactgcc ctatgcatca tcaggaccac aagactgagt tgctgaggtg tcgaagccag 1561 agcaaagtgc aggaagggga gcggcagctg cgggagcaga ttgcccttct ggtgaaggac 1621 atgagcccat gataacattc cagccactgg ctgctaacaa gtcaccaaaa agacactgca 1681 gaaaccctga gcagaaagag gccttctgga tggccaaacc caagattatt aaaagatgtc 1741 tctgcaaacc aacaggctac caacttgtat ccaggcctgg gaatggatta ggtttcagca 1801 gagctgaaag ctggtggcag agtcctggag ctggctctat aaggcagcct tgagttgcat 1861 agagatttgt attggttcag ggaactctgg cattcctttt cccaactcct catgtcttct 1921 cacaagccag ccaactcttt ctctctgggc ttcgggctat gcaagagcgt tgtctacctt 1981 ctttctttgt attttccttc tttgtttccc cctctttctt ttttaaaaat ggaaaaataa 2041 acactacaga atgag // LOCUS HUMDBTG 1401 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human delta-beta-thalassemia gene region DNA. ACCESSION M36375 KEYWORDS delta-beta-thalassemia. SOURCE Human 30 yr old japanese female blood cell DNA, clone Jp-thal. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1401) AUTHORS Shiolawa,S., Yamada,H., Takihara,Y., Matsunaga,E., Ohba,Y., Yamamoto,K. and Fukumaki,Y. TITLE Molecular analysis of japanese delta-beta-thalassemia JOURNAL Blood 72, 1771-1776 (1988) STANDARD simple staff_entry BASE COUNT 390 a 341 c 237 g 433 t ORIGIN 1 gagctcttta gtttaattag attccattgg tcaatttttg cttttgctgc aattgctttt 61 cacgctttca tcatgaaatc tgtgcccgtg tttatatcat gaatagtatt gccttgattt 121 ttttctaggc tttttatagt ttggggtttt tcatttaagt ctctaatagc tcatagaatc 181 tttcttcaac tcctgcaatc aaagggaaac gtaataggat tcttgtaaca gagggaactc 241 tattagcaaa catgggtcca agaacagctc cccccaccac caccaccacc acccgccccc 301 accatccctg ctcctggagc tggagtgcaa tggcacaatc acctactcct cccaggttca 361 agcaattccc ttgcctcagc ctcccgagta gctaggacta caggtacatg ccaccacacc 421 cagctaattt tttgtatttt agtagagatg gggtttcacc atgttggcca ggatggtctc 481 gatctcctga cctcgtgatc tgcccacttc agcctcccaa agtgctggga ttacaggcgt 541 gagccactgc tcccggccaa taatagcttt ttaaaaaaaa ttctgtgatt cttaaaccag 601 tcttgagttg tcctgcgcta aaactaatga gcaaatatta aactcttctt aaacagttct 661 ctttaagtgt ctctttctcc cccactgaga attcttccat gacagagcct ataacttaaa 721 catccctatt accagactac caggaacatc taaatagaca gccctcacaa caatgcttca 781 cattcattac tatcacaaca ccattattgg gtattaataa ctccccttta tgggtgagaa 841 agcagggtaa gattagataa tcgagtctca gaggaaacat acgacgttct caaattcacc 901 caacaagcca atggcagatt caggaattga gctctgttct tactccaaac gccatgctct 961 cgcaaactcc ttatttaaat ttttttttct ttttttactg tggcatgaac aaacacatgc 1021 tgatgactgt agtctatatt tggaataact attttgaacg ctacaacgat atttttgagt 1081 tgacataacg tatgaaaata cactactgag ttcatgctca catttactct gaatacttcc 1141 gcgtttgttt catatactat ctgaaggtta tacagtacct aatttattta taactgtctt 1201 caatgcacta caacactgac cttgtgcaca gaattcttgg gtctggaagg tgaggttatt 1261 tgatcctgcc ccctgctatt tgaaggatca tcccacacag cagagaatct attaatgcaa 1321 agcaagtgag agagagttat ctcaacctca ctaacttact tgcagaaagg ttctgtacta 1381 ctatagctgc ggaatcaccc c // LOCUS HUMHER3A 4975 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human epidermal growth factor receptor (HER3) mRNA, complete cds. ACCESSION M34309 KEYWORDS HER3 gene; epidermal growth factor receptor. SOURCE Human cell line MDA-MB-361, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4975) AUTHORS Plowman,G.D., Whitney,G.S., Neubauer,M.G., Green,J.M., McDonald,V.L., Todaro,G.J. and Shoyab,M. TITLE Molecular cloning and expression of another epidermal growth factor receptor-related gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 4905-4909 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.D.Plowman, 16-MAY-1990, for release after publication. FEATURES from to/span description pept 199 4227 HER3 protein precursor sigp 199 255 HER3 signal peptide matp 256 4224 HER3 protein BASE COUNT 1195 a 1309 c 1327 g 1144 t ORIGIN Chromosome 12q13. 1 ctctcacaca cacacacccc tcccctgcca tccctccccg gactccggct ccggctccga 61 ttgcaatttg caacctccgc tgccgtcgcc gcagcagcca ccaattcgcc agcggttcag 121 gtggctcttg cctcgatgtc ctagcctagg ggcccccggg ccggacttgg ctgggctccc 181 ttcaccctct gcggagtcat gagggcgaac gacgctctgc aggtgctggg cttgcttttc 241 agcctggccc ggggctccga ggtgggcaac tctcaggcag tgtgtcctgg gactctgaat 301 ggcctgagtg tgaccggcga tgctgagaac caataccaga cactgtacaa gctctacgag 361 aggtgtgagg tggtgatggg gaaccttgag attgtgctca cgggacacaa tgccgacctc 421 tccttcctgc agtggattcg agaagtgaca ggctatgtcc tcgtggccat gaatgaattc 481 tctactctac cattgcccaa cctccgcgtg gtgcgaggga cccaggtcta cgatgggaag 541 tttgccatct tcgtcatgtt gaactataac accaactcca gccacgctct gcgccagctc 601 cgcttgactc agctcaccga gattctgtca gggggtgttt atattgagaa gaacgataag 661 ctttgtcaca tggacacaat tgactggagg gacatcgtga gggaccgaga tgctgagata 721 gtggtgaagg acaatggcag aagctgtccc ccctgtcatg aggtttgcaa ggggcgatgc 781 tggggtcctg gatcagaaga ctgccagaca ttgaccaaga ccatctgtgc tcctcagtgt 841 aatggtcact gctttgggcc caaccccaac cagtgctgcc atgatgagtg tgccgggggc 901 tgctcaggcc ctcaggacac agactgcttt gcctgccggc acttcaatga cagtggagcc 961 tgtgtacctc gctgtccaca gcctcttgtc tacaacaagc taactttcca gctggaaccc 1021 aatccccaca ccaagtatca gtatggagga gtttgtgtag ccagctgtcc ccataacttt 1081 gtggtggatc aaacatcctg tgtcagggcc tgtcctcctg acaagatgga agtagataaa 1141 aatgggctca agatgtgtga gccttgtggg ggactatgtc ccaaagcctg tgagggaaca 1201 ggctctggga gccgcttcca gactgtggac tcgagcaaca ttgatggatt tgtgaactgc 1261 accaagatcc tgggcaacct ggactttctg atcaccggcc tcaatggaga cccctggcac 1321 aagatccctg ccctggaccc agagaagctc aatgtcttcc ggacagtacg ggagatcaca 1381 ggttacctga acatccagtc ctggccgccc cacatgcaca acttcagtgt tttttccaat 1441 ttgacaacca ttggaggcag aagcctctac aaccggggct tctcattgtt gatcatgaag 1501 aacttgaatg tcacatctct gggcttccga tccctgaagg aaattagtgc tgggcgtatc 1561 tatataagtg ccaataggca gctctgctac caccactctt tgaactggac caaggtgctt 1621 cgggggccta cggaagagcg actagacatc aagcataatc ggccgcgcag agactgcgtg 1681 gcagagggca aagtgtgtga cccactgtgc tcctctgggg gatgctgggg cccaggccct 1741 ggtcagtgct tgtcctgtcg aaattatagc cgaggaggtg tctgtgtgac ccactgcaac 1801 tttctgaatg gggagcctcg agaatttgcc catgaggccg aatgcttctc ctgccacccg 1861 gaatgccaac ccatgggggg cactgccaca tgcaatggct cgggctctga tacttgtgct 1921 caatgtgccc attttcgaga tgggccccac tgtgtgagca gctgccccca tggagtccta 1981 ggtgccaagg gcccaatcta caagtaccca gatgttcaga atgaatgtcg gccctgccat 2041 gagaactgca cccaggggtg taaaggacca gagcttcaag actgtttagg acaaacactg 2101 gtgctgatcg gcaaaaccca tctgacaatg gctttgacag tgatagcagg attggtagtg 2161 attttcatga tgctgggcgg cacttttctc tactggcgtg ggcgccggat tcagaataaa 2221 agggctatga ggcgatactt ggaacggggt gagagcatag agcctctgga ccccagtgag 2281 aaggctaaca aagtcttggc cagaatcttc aaagagacag agctaaggaa gcttaaagtg 2341 cttggctcgg gtgtctttgg aactgtgcac aaaggagtgt ggatccctga gggtgaatca 2401 atcaagattc cagtctgcat taaagtcatt gaggacaaga gtggacggca gagttttcaa 2461 gctgtgacag atcatatgct ggccattggc agcctggacc atgcccacat tgtaaggctg 2521 ctgggactat gcccagggtc atctctgcag cttgtcactc aatatttgcc tctgggttct 2581 ctgctggatc atgtgagaca acaccggggg gcactggggc cacagctgct gctcaactgg 2641 ggagtacaaa ttgccaaggg aatgtactac cttgaggaac atggtatggt gcatagaaac 2701 ctggctgccc gaaacgtgct actcaagtca cccagtcagg ttcaggtggc agattttggt 2761 gtggctgacc tgctgcctcc tgatgataag cagctgctat acagtgaggc caagactcca 2821 attaagtgga tggcccttga gagtatccac tttgggaaat acacacacca gagtgatgtc 2881 tggagctatg gtgtgacagt ttgggagttg atgaccttcg gggcagagcc ctatgcaggg 2941 ctacgattgg ctgaagtacc agacctgcta gagaaggggg agcggttggc acagccccag 3001 atctgcacaa ttgatgtcta catggtgatg gtcaagtgtt ggatgattga tgagaacatt 3061 cgcccaacct ttaaagaact agccaatgag ttcaccagga tggcccgaga cccaccacgg 3121 tatctggtca taaagagaga gagtgggcct ggaatagccc ctgggccaga gccccatggt 3181 ctgacaaaca agaagctaga ggaagtagag ctggagccag aactagacct agacctagac 3241 ttggaagcag aggaggacaa cctggcaacc accacactgg gctccgccct cagcctacca 3301 gttggaacac ttaatcggcc acgtgggagc cagagccttt taagtccatc atctggatac 3361 atgcccatga accagggtaa tcttgggggg tcttgccagg agtctgcagt ttctgggagc 3421 agtgaacggt gcccccgtcc agtctctcta cacccaatgc cacggggatg cctggcatca 3481 gagtcatcag aggggcatgt aacaggctct gaggctgagc tccaggagaa agtgtcaatg 3541 tgtagaagcc ggagcaggag ccggagccca cggccacgcg gagatagcgc ctaccattcc 3601 cagcgccaca gtctgctgac tcctgttacc ccactctccc cacccgggtt agaggaagag 3661 gatgtcaacg gttatgtcat gccagataca cacctcaaag gtactccctc ctcccgggaa 3721 ggcacccttt cttcagtggg tctcagttct gtcctgggta ctgaagaaga agatgaagat 3781 gaggagtatg aatacatgaa ccggaggaga aggcacagtc cacctcatcc ccctaggcca 3841 agttcccttg aggagctggg ttatgagtac atggatgtgg ggtcagacct cagtgcctct 3901 ctgggcagca cacagagttg cccactccac cctgtaccca tcatgcccac tgcaggcaca 3961 actccagatg aagactatga atatatgaat cggcaacgag atggaggtgg tcctgggggt 4021 gattatgcag ccatgggggc ctgcccagca tctgagcaag ggtatgaaga gatgagagct 4081 tttcaggggc ctggacatca ggccccccat gtccattatg cccgcctaaa aactctacgt 4141 agcttagagg ctacagactc tgcctttgat aaccctgatt actggcatag caggcttttc 4201 cccaaggcta atgcccagag aacgtaactc ctgctccctg tggcactcag ggagcattta 4261 atggcagcta gtgcctttag agggtaccgt cttctcccta ttccctctct ctcccaggtc 4321 ccagcccctt ttccccagtc ccagacaatt ccattcaatc tttggaggct tttaaacatt 4381 ttgacacaaa attcttatgg tatgtagcca gctgtgcact ttcttctctt tcccaacccc 4441 aggaaaggtt ttccttattt tgtgtgcttt cccagtccca ttcctcagct tcttcacagg 4501 cactcctgga gatatgaagg attactctcc atatcccttc ctctcaggct cttgactact 4561 tggaactagg ctcttatgtg tgcctttgtt tcccatcaga ctgtcaagaa gaggaaaggg 4621 aggaaaccta gcagaggaaa gtgtaatttt ggtttatgac tcttaacccc ctagaaagac 4681 agaagcttaa aatctgtgaa gaaagaggtt aggagtagat attgattact atcataattc 4741 agcacttaac tatgagccag gcatcatact aaacttcacc tacattatct cacttagtcc 4801 tttatcatcc ttaaaacaat tctgtgacat acatattatc tcattttaca caaagggaag 4861 tcgggcatgg tggctcatgc ctgtaatctc agcactttgg gaggctgagg cagaaggatt 4921 acctgaggca aggagtttga gaccagctta gccaacatag taagaccccc atctc // LOCUS HUMMEHA 959 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human microsomal epoxide hydrolase (HmEH) mRNA, partial cds. ACCESSION M36374 KEYWORDS microsomal epoxide hydrolase. SOURCE Human liver, cDNA to mRNA, clone HmEH7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 959) AUTHORS Craft,J.A., Jackson,M.R. and Burchell,B. TITLE Partial nucleotide sequence of a cloned cDNA for human liver microsomal epoxide hydrolase JOURNAL Biochem. Soc. Trans. 15, 708-709 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 959 microsomal epoxide hydrolase (AA at 3) (EC 3.3.2.3) BASE COUNT 217 a 272 c 251 g 219 t ORIGIN 1 cttcagtgct gggctttgcc atctactggt tcatctcccg ggacaaagag gaaactttgc 61 cacttgaaga tgggtggtgg gggccaggca cgaggtccgc agccagggag gacgacagca 121 tccgcccttt caaggtggaa acgtcagatg aggagatcca cgacttacac cagaggatcg 181 ataagttccg tttcacccca cctttggagg acagctgctt ccactatggc ttcaactcca 241 actacctgaa gaaagtcatc tcctactggc ggaatgaatt tgactggaag aagcaggtgg 301 agattctcaa caagtaccct cacttcaaga ctaaaattga agggctggac atccacttca 361 tccacgtgaa gcccccccag ctgcccgcag gccatacccc gaagcccttg ctgatggtga 421 acggctggcc cggctctttc tacgagtttt ataagatcat cccactcctg actgacccca 481 agaaccatgg cctgagcgac gagcacgttt ttgaagtcat ctgcccttcc atccctggct 541 atggcttctc agaggcatcc tccaagaagg ggttcaactc ggtggccacc gccaggatct 601 tttacaagct gatgctgcgg ctgggcttcc aggaattcta cattcaagga ggggactggg 661 ggtccctgat ctgcactaat atggcccagc tggtgcccag ccacctgaaa ggcctgcact 721 tgaacatggc tttggtttta agcaacttct ctaccctgac cctcctcctg ggacagcgtt 781 tcgggaggtt tcttggcctc actgagaggg atgtggagct gctgtacccc gtcaaggaga 841 aggtattcta cagcctgatg agggagagcg gctacatgca catccagtgc accaagcctg 901 acaccgtagg ctctgctctg aatgactctc ctgtgggtct ggctgcctat attctagag // LOCUS HUMRNPSMBA 847 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human small nuclear ribonucleoprotein particle SmB mRNA, 3' end. ACCESSION M34081 KEYWORDS lupus autoantigen; small nuclear ribonucleoprotein particle. SOURCE Human HeLa cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 847) AUTHORS Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P. TITLE Epitope mapping of recombinant HeLa SmB and B' peptides obtained by the polymerase chain reaction JOURNAL J. Immunol. 145, 636-643 (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990, for release after publication. FEATURES from to/span description pept < 1 675 small nuclear ribonucleoprotein particle (SmB) mRNA < 1 847 snRNP SmB mRNA BASE COUNT 170 a 251 c 252 g 174 t ORIGIN 1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc 61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag 121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc 181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc 241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc 301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt 361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga 421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag 481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc 541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct 601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg 661 cgaggccttc tttgaccctt ggccacagag tatggaagta gctccgcaga ggcgtgggct 721 cgattcctca gggccacgtt accacagacc tgtttgtttc ttatgctgtt gttcgtggag 781 tctcatggga ttgtctggtt tccttacagg gccccctccc ccgggaatgc gcccaccaag 841 gccctag // LOCUS HUMRNPSMBB 702 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human small nuclear ribonucleoprotein particle SmB' mRNA, 3' end. ACCESSION M34082 KEYWORDS lupus autoantigen; small nuclear ribonucleoprotein particle. SOURCE Human HeLa cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 702) AUTHORS Elkon,K.B., Hines,J.J., Chu J,-L. and Parnassa,A.P. TITLE Epitope mapping of recombinant HeLa SmB and B' peptides obtained by the polymerase chain reaction JOURNAL J. Immunol. 145, 636-643 (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by K.B.Elkon 23-APR-1990, for release after publication. FEATURES from to/span description pept < 1 702 small nuclear ribonucleoprotein particle (SmB') BASE COUNT 147 a 215 c 212 g 128 t ORIGIN 1 aagatgctgc agcatattga ttacaggatg aggtgcatcc tgcaggacgg ccggatcttc 61 attggcacct tcaaggcttt tgacaagcac atgaatttga tcctctgtga ctgtgatgag 121 ttcagaaaga tcaagccaaa gaactccaaa caagcagaaa gggaagagaa gcgagtcctc 181 ggtctggtgc tgctgcgagg ggagaatctg gtctcaatga cagtagaggg acctcctccc 241 aaagatactg gtattgctcg agttccactt gctggagctg ccgggggccc agggatcggc 301 agggctgctg gcagaggaat cccagctggg gttcccatgc cccaggctcc tgcaggactt 361 gctgggccag tccgtggggt tggcgggcca tcccaacagg tgatgacccc acaaggaaga 421 ggtactgttg cagccgctgc agctgctgcc acagccagta ttgccggggc tccaacccag 481 tacccacctg gccgtggggg tcctccccca cctatgggcc gaggagcacc ccctccaggc 541 atgatgggcc cacctcctgg tatgagacct cctatgggtc ccccaatggg gatcccccct 601 ggaagaggga ctccaatggg catgccccct ccgggaatgc ggcctcctcc ccctgggatg 661 cgagggcccc ctcccccggg aatgcgccca ccaaggccct ag // LOCUS HUMTPMYOC 1191 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human skeletal muscle beta-tropomyosin mRNA, complete cds. ACCESSION X06825 M36268 KEYWORDS actin-binding protein; beta-tropomyosin. SOURCE Human skeletal muscle, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1191) AUTHORS Liautard,J.P. JOURNAL Unpublished (1988) CRBM du CNRS, 34033 Montpelier Cedex, France STANDARD simple automatic REFERENCE 2 (bases 1 to 1191) AUTHORS Widada,J.S., Ferraz,C., Capony,J.-P. and Liautard,J.-P. TITLE Complete nucleotide sequence of the adult skeletal isoform of human skeletal muscle beta-tropomyosin JOURNAL Nucleic Acids Res. 16, 3109-3109 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 107 961 beta-tropomyosin mRNA < 1 1191 beta-tropomyosin mRNA BASE COUNT 321 a 320 c 368 g 182 t ORIGIN 1 cccgctccgt cctcctcgcc tgccaccggt gcacccagtc cgctcaccca gcccagtccg 61 tccggtcctc accgcctgcc ggccggccca ccccccaccg caggccatgg acgccatcaa 121 gaagaagatg cagatgctga agctggacaa ggagaacgcc atcgaccgcg ccgagcaggc 181 cgaagccgac aagaagcaag ctgaggaccg ctgcaagcag ctggaggagg agcagcaggc 241 cctccagaag aagctgaagg ggacagagga tgaggtggaa aagtattctg aatccgtgaa 301 ggaggcccag gagaaactgg agcaggccga gaagaaggcc actgatgctg aggcagatgt 361 ggcctccctg aaccgccgca ttcagctggt tgaggaggag ctggaccggg cccaggagcg 421 cctggctaca gccctgcaga agctggagga ggccgagaag gcggctgatg agagcgagag 481 aggaatgaag gtcatcgaaa accgggccat gaaggatgag gagaagatgg aactgcagga 541 gatgcagctg aaggaggcca agcacatcgc tgaggattca gaccgcaaat atgaagaggt 601 ggccaggaag ctggtgatcc tggaaggaga gctggagcgc tcggaggaga gggctgaggt 661 ggccgagagt aaatgtgggg acctagagga ggagctgaaa attgttacca acaacttgaa 721 atccctggag gcccaggcgg acaagtattc caccaaagaa gataaatatg aagaggagat 781 caaactgttg gaggagaagc tgaaggaggc tgagacccga gcagagtttg ccgagaggtc 841 tgtggcaaag ttggagaaaa ccatcgatga cctagaagat gaagtctatg cccagaagat 901 gaagtacaag gccattagcg aggaactgga caacgcactc aatgacatca cctccctctg 961 agccccacgc ccagcgtgcc acctcagctc tcttctctcc tctcctttcc attctctcta 1021 tggggagggg agagcaggca ggaggagcag aaattgccaa cattgcacag ccaggctggg 1081 agcagcctag ggagagcccc catcatgccc accacccact ctggcactgg cttcatcctt 1141 tacctatccc cttccaccct cctttgcttg cttaataaat tctgaacttg g // LOCUS HUMTRHA 1322 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human thyroid releasing hormone (TRH) mRNA, complete cds. ACCESSION M36317 KEYWORDS thyroid releasing hormone. SOURCE Human brain, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1322) AUTHORS Mandel,G. and Goodman,R.H. TITLE Using the brain to screen cloned genes JOURNAL Trends Neurosci. 10, 101-104 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 103 870 thyroid releasing hormone precursor (TRH) sigp 103 174 thyroid releasing hormone signal peptide matp 175 867 thyroid releasing hormone BASE COUNT 315 a 344 c 372 g 291 t ORIGIN 1 tccttggatt cgggagtatt gcaaactcta cccagccagt ttgcactctt cagctcagca 61 tcttggaaag ctctgcagag tctccacttc gcagactcca ggatgccggg accttggttg 121 ctgctggctc tggctttgat cttcacccta actggtatcc ctgaatcctg cgccttgccg 181 gaggcagccc aggaggaagg tgcagtgact cctgaccttc ctggcctgga gaatgttcag 241 gtccggccag aacgtcgatt cttgtggaaa gacctccagc gggtgagagg ggacctcggt 301 gctgccttag actcctggat cacaaaacgc cagcatccag gcaaaaggga ggaggaggaa 361 aaagacattg aagctgaaga gaggggagac ttgggagaag ggggagcctg gagactccac 421 aaacgacagc accccggccg acgtgccaac caggacaagt attcatgggc agatgaggag 481 gacagtgact ggatgccacg gtcctggtta ccagatttct ttctggattc ctggttctca 541 gatgtccccc aagtcaagcg gcagcaccct ggcaggcgat ccttcccctg gatggagtct 601 gatgtcacca agaggcaaca tccaggccgg aggttcatag atcccgagct ccaaagaagc 661 tgggaagaaa aagagggaga gggtgtctta atgcctgaga aacgccagca tcctggcaaa 721 agggcattgg gtcatccctg tgggccccag gggacttgtg gtcaaacagg cctgctccag 781 cttctaggtg acctgagcag gggtcaggag accctggtga agcaaagccc acaagtggaa 841 ccctgggaca aggaacctct ggaggagtaa ggccagagtc aggctttagg tctaggatga 901 tgtaagccct gtattcccta tcctgttccc ttcactagct gtcctctctt agatgctaac 961 cctgggccct ctgtacatcg tccacccaaa ccccttcctt accgacttca gagactttag 1021 aaagccagtc aggaagttaa aaccctactt atcccttcaa gcaagggggt gggagtcaca 1081 ccccttcagc actggccaag atggttcttt cacaccttct aagggacctc ctcagaaagg 1141 aagggtagaa ttgaaatgtt ttggtgttaa aacttctgta atctgcccca tgtggtaaga 1201 ctgacctggt atagcttcag cgcatcctcc aaggttgggg tccctgagca gtttgggaga 1261 tgtttagata tgtccttggg gtggggggac gcatcttgta tataaccctg ctcttgggaa 1321 tc // LOCUS MAARNA1A 163 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-1. ACCESSION M35975 KEYWORDS RNA-1 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 163) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 103 > 163 115 kDa protein ORF 92 > 163 ORF1 ORF 107 > 163 ORF2 BASE COUNT 39 a 38 c 24 g 62 t ORIGIN 5' end of RNA-1. 1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttt tccttgtatt 61 ttccacagta ttacgttcat tcaatactgt gatgaattca ctatgaatgc tgacgcccta 121 tccaccgatt cctgccttag tatgcgagaa cctttatccc atg // LOCUS MAARNA1B 115 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-1. ACCESSION M35976 KEYWORDS RNA-1 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 115) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 101 > 115 115 kDa protein ORF 105 > 115 ORF1 BASE COUNT 29 a 25 c 18 g 43 t ORIGIN 5' end of RNA-1. 1 ggtttttatc ttacacacgc ttgtgcaaga tagttaatcc atttattttc cttgcgcttt 61 ccacagcatt acgttcattc aatactgtga agaattcatt atgaatgctg acgcc // LOCUS MAARNA2A 108 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-2. ACCESSION M36389 KEYWORDS RNA-2 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 108) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 56 > 108 100 kDa protein BASE COUNT 26 a 19 c 19 g 44 t ORIGIN 5' end of RNA-2. 1 ggtttttatc ttttcgcgat tcaaaagata agtttcagag tctaatcttt tcaatatgtt 61 cactcttttg agatgtcttg gatttggcgt caacgacctt accgatat // LOCUS MAARNA2B 109 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-2. ACCESSION M36390 KEYWORDS RNA-2 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 109) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 57 > 109 100 kDa protein BASE COUNT 25 a 19 c 20 g 45 t ORIGIN 5' end of RNA-2. 1 ggtttttatc ttttcgcgat tgaaaagata agtttcctga gtctaatctt ttcaatatgt 61 tcactctttt gagatgtctt ggatttggcg tcaacgacct taccgatat // LOCUS MAARNA3B 305 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-S) 5' end of RNA-3. ACCESSION M36391 KEYWORDS RNA-3 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-S) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 305) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 259 > 305 35 kDa protein BASE COUNT 87 a 57 c 48 g 113 t ORIGIN 5' end of RNA-3. 1 ggttttcatc ttacacacgc ttgtgcaaga tagttaatca ttccaattca actcaattaa 61 cgttttttac agtgtaattc gtacttttcg taagtaagtt tctgtaaaag cgtttcttgt 121 tttaatttgg cctaacacgt aattcgtact cttcgtgagt aagttgtgtt agccatacct 181 atcctttaaa ttcctgtcaa tttaaaaaga aaatcattcc cattcgcgta attcgtactc 241 ttcgtgagta agttgcaaat ggagaataca aaaacaaatg cctcgagttc tggaatgttt 301 tcttc // LOCUS MAARNA3C 290 bp ss-RNA VRL 14-AUG-1990 DEFINITION Alfalfa mosaic virus (strain AlMV-B) 5' end of RNA-3. ACCESSION M36392 KEYWORDS RNA-3 leader sequence. SOURCE Alfalfa mosaic virus (strain AlMV-B) genomic RNA. ORGANISM Alfalfa mosaic virus Viridae; ss-RNA nonenveloped viruses; Alfamoviridae. REFERENCE 1 (bases 1 to 290) AUTHORS Ravelonandro,M., Godefroy-Colburn,T. and Pinck,L. TITLE Structure of the 5'-terminal untranslated region of the genomic RNAs from two strains of alfalfa mosaic virus JOURNAL Nucleic Acids Res. 11, 2815-2826 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 244 > 290 35 kDa protein BASE COUNT 90 a 52 c 41 g 107 t ORIGIN 5' end of RNA-3. 1 ggttttaata ccattttcaa aatattccaa ttcaactcaa ttaacccttt tacagtgtaa 61 ttcgtacttt tcgtaagtaa gtttctgtaa aagcgtttct tgttttaatt tggcctaaca 121 cgtaattcgt actcttcgtg agtaagttgt gttagccata tctatccttt aaatttttgt 181 caaatttaaa aaagaaaatc attcccattc gcgtaattcg tactcttcgt gagtaagttg 241 caaatggaga atacaaaaac aaatgcctcg agaactggaa tatcttcttc // LOCUS MBFRRDA 1476 bp ds-DNA RNA 14-AUG-1990 DEFINITION M.formicicum 16S rRNA. ACCESSION M36508 KEYWORDS 16S ribosomal RNA. SOURCE M.formicicum (strain DSM 1312) rRNA. ORGANISM Methanobacterium formicicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 1476) AUTHORS Lechner,K., Wich,G. and Boeck,A. TITLE The nucleotide sequence of the 16S rRNA gene and flanking regions from Methanobacterium formicicum: The phylogenetic relationship between methanogenic and halophilic archaebacteria JOURNAL Syst. Appl. Microbiol. 6, 157-163 (1985) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 1476 16S rRNA BASE COUNT 348 a 355 c 472 g 301 t ORIGIN 1 agtccgtttg atcctggcgg aggccactgc tattgggttt cgattaagcc atgcaagtcg 61 aacgatcctt cggggtcgtg gcgtacggct cagtaacacg tggataacct aaccttagga 121 ctgggataac cctgggaaac tggggataat accggatatg tagggctgcc tggaatggtt 181 ccctattgaa atgttccgac gcctaaggat ggatctgcgg cagattaggt agttggcggg 241 gtaaatgccc accaagccag taatctgtac gggttgtgag agcaagagcc cggagatgga 301 acctgagaca aggttccagg ccctacgggg cgcagcaggc gcgaaacctc cgcaatgcac 361 gaaagtgcga cgggggaaac ccaagtgcca ctcttaacgg ggtggctttt cttaagtgta 421 aaaagctttt ggaataagag ctgggcaaga ccggtgccag ccgccgcggt aacaccggca 481 gctcaagtgg tggccgtttt tattgggcct aaagcgttcg tagccggctt gataagtctc 541 tggtgaaatc tcacggctta accgtgagaa ttgctggaga tactattagg cttgaggccg 601 ggagaggtta gcggtactcc cggggtaggg gtgaaatcct ataatcccgg gaggaccacc 661 tgtggcgaag gcggctaact ggaacgggcc tgacggtgag taacgaaagc caggggcgcg 721 aaccggatta gatacccggg tagtcctggc cgtaaacgat gtggacttgg tgttgggatg 781 gctccgagct gccccagtgc cgaagggaag ctgttaagtc caccgcctgg gaagtacggt 841 cgcaagactg aaacttaaag gaattggcgg gggagcacca caacgcgtgg agcctgcggt 901 ttaattggat tcaacgccgg acatctcacc aggggcgaca gcagaatgat agccaggttg 961 atgaccttgc ttgacaagct gagaggaggt gcatggccgc cgtcagctcg taccgtgagg 1021 cgtcctgtta agtcaggcaa cgagcgagac ccacgccctt agttaccagc ggatccttcg 1081 ggatgccggg cacactaagg ggaccgccag tgataaactg gaggaaggag tggacgacgg 1141 taggtccgta tgccccgaat cccctgggct acacgcgggc tacaatggtt aggacaatgg 1201 gttccgacac tgaaaggtgg aggtaatctc ctaaacctgg ccttagttcg gattgagggc 1261 tgtaactcgc cctcatgaag ctggaatgcg tagtaatcgc gtgtcataac cgcgcggtga 1321 atacgtccct gctccttgca cacaccgccc gtcacgccac ccaaaaaggg tttggatgag 1381 gccatagtct ttggttatgg tcgaatctag gttctttgag gagggcgaag tcgtaacaag 1441 gtagccgtag gggaacctgc ggctggatca cctcct // LOCUS MSHLTRERA 190 bp ss-RNA VRL 14-AUG-1990 DEFINITION Harvey murine sarcoma virus LTR 75-bp direct-repeat enhancer region. ACCESSION M26322 KEYWORDS enhancer region; long terminal repeat. SOURCE Harvey murine sarcoma virus, cDNA to viral RNA, clone pM14-1. ORGANISM Harvey murine sarcoma virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Oncovirinae; Type C oncovirus group; Mammalian type C oncoviruses; Murine sarcoma viruses. REFERENCE 1 (bases 1 to 190) AUTHORS Ostrowski,M.C., Huang,A.L., Kessel,M., Wolford,R.G. and Hager,G.L. TITLE Modulation of enhancer activity by the hormone responsive regulatory element from mouse mammary tumor virus JOURNAL EMBO J. 3, 1891-1899 (1984) STANDARD simple staff_entry FEATURES from to/span description rpt 21 95 75 bp direct-repeat enchancer rpt 96 170 75 bp direct-repeat enchancer BASE COUNT 31 a 54 c 48 g 57 t ORIGIN 1 ggctggaccg catctgggga ccatctgttc ttggccctga gccggggcag gaactgtcta 61 ccacagatat cctgtttggc ccatattcag ctgttctatc tgttcttggc cctggagccg 121 ggcaggaact gtctaccaca gatatcctgt ttggcccata ttcagctgtt tctctgttcc 181 tgaccttgat // LOCUS MUSKERDA 1381 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse keratin D mRNA, complete cds. ACCESSION M36376 KEYWORDS keratin D. SOURCE Mouse F9 teratocarcinoma stem cell, cDNA to mRNA, clone B53. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1381) AUTHORS Alonso,A., Weber,T. and Jorcano,J.L. TITLE Cloning and characterization of keratin D, a murine endodermal cytoskeletal protein induced during in vitro differentiation of F9 teratocarcinoma cells JOURNAL Roux's Arch. Dev. Biol. 196, 16-21 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 50 1321 keratin D mRNA < 1 1381 keratin D mRNA site 28 42 TCTCC motif BASE COUNT 366 a 373 c 401 g 241 t ORIGIN 1 aactcctgtt ctggtctctc gcttcgctct cctctcctct ccagacaaga tgagcttcac 61 aacccgctcc accaccttct ccaccaacta ccggtccctg ggctctgtgc ggactcccag 121 ccagcgggtc cggcctgcca gcagcgcagc cagcgtctat gcaggtgctg ggggctccgg 181 gtcacggata tccgtgtccc gctccgtctg gggtggctct gtggggtccg caggcctggc 241 gggaatgggt ggaatccaga ccgagaagga gaccatgcaa gacctgaacg atcgcctggc 301 cagctaccta gacaaggtga agagtctgga aactgagaac aggagactgg agagcaaaat 361 ccgggaacat ctggagaaga aggggcccca gggcgtcaga gactggggcc actacttcaa 421 gatcatcgaa gacctgaggg ctcagatcct tgcgaattct gtggacaatg cccgcatcgt 481 cttgcagatc gacaatgccc gccttgccgc cgatgacttt agagtcaagt atgagacaga 541 actagccatg cgccagtctg tggagagcga catccatgga ctccgcaagg tggtagatga 601 caccaacatc acaaggctgc agctggagac agaaatcgag gcactcaagg aagaacttct 661 gttcatgaag aagaatcatg aagaggaagt ccaaggtctg gaagcccaga ttgccagctc 721 tggattgact gtggaagtgg atgcccccaa atctcaggac ctcagcaaga tcatggcgaa 781 catccgcgcc cagtatgaag cgctgggcca gaagaaccgc gaggaactgg acaagtactg 841 gtctcagcag attgaggaga gtaccacagt tgtcaccacc aagtctgccg aaatcaggga 901 cgctgagacc acactcacgg agctgagacg caccctccag accttggaga ttgacttgga 961 ctccatgaaa aaccagaaca tcaacttgga gaacagcctc ggggatgtgg aggcccgata 1021 caaggcacag atggagcagc tcaatggggt ccttctgcat ctggagtcag agctggcaca 1081 aactcgggca gagggccagc gccaggccca ggaatatgaa gccctcttga acatcaaggt 1141 gaagcttgag gcagagattg ccacctaccg ccgcttgctg gaggatggag aagatttcag 1201 tctcaacgat gccctagact cgagcaactc catgcaaact gtgcagaaga caactacccg 1261 taagatcgtg gatggcagag tggtgtccga gactaatgac accagagttc tgaggcactg 1321 aggcagagaa ggagggaacc cctaggaact gagggaccaa taaaagttga gagctcactg 1381 g // LOCUS MUSMBPMLD1 792 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse myelin basic protein (shi-mld) gene, exon 1. ACCESSION M36273 M36274 KEYWORDS myelin basic protein; shi-mld gene. SEGMENT 1 of 2 SOURCE Mouse mld mutated DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 792) AUTHORS Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K., Oshimura,M. and Mikoshiba,K. TITLE Gene organization and transcription of duplicated MBP genes of myelin deficient (shi-mld) mutant mouse JOURNAL EMBO J. 7, 77-83 (1988) STANDARD simple staff_entry FEATURES from to/span description mRNA 713 > 792 shi-mld mRNA signal 623 628 GC box signal 630 634 CAAT box signal 680 685 TATA box BASE COUNT 194 a 234 c 206 g 158 t ORIGIN Chromosome 18. 1 cctccagcgg aatccgggag agggattgct cagtgccctg cttccctgga ctgtaagctg 61 cagaaagatg tgggaagtcc tgttctccac tgagaacact aaaagcacct tttgtcaaac 121 gaccgcttca catctggggc ttgtgcactg gtggcctttt aaaccagaga caacccacaa 181 gatacctaac ctgcggggct ctctggtaca gtgagcaact caggaaatgc tttggcttga 241 ttgctgtggg ctctcaggcc atcgccctct ggagtggttc ttttaatgag aacctgaaga 301 ttggcccctg agccatgtat accaagcaag ctcaatccag gttagctccc tctggttggg 361 gcaagctaac gtgctccttg ggccccgcgc gtaactgtgc gttttatagg agacagctag 421 ttcaagaccc caggaagaaa gcggctttgt ccccctctag gcctcgtaca ggcccacatt 481 catatctcat tgttgttgca ggggaggcag atgcgatcca gaacaatggg acctcggctg 541 aggacacggc ggtgacagac tccaagcaca cagcagaccc aaagaataac tggcaaggcg 601 cccacccagc tgacccaggg aaccgccccc acttgatccg cctcttttcc cgagatgccc 661 cgggaaggag gacaacacct tcaaagacag gccctcagag tccgacgagc ttcagaccat 721 ccaagaagac cccacagcag cttccggagg cctggatgtc atggcatcac agaagagacc 781 ctcacagcga tc // LOCUS MUSMBPMLD2 200 bp ds-DNA ROD 14-AUG-1990 DEFINITION Figure 7. Nucleotide sequences near the inverted exon 3. ACCESSION M36275 KEYWORDS myelin basic protein; shi-mld gene. SEGMENT 2 of 2 SOURCE Mouse mld mutated DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 200) AUTHORS Okano,H., Tamura,T., Miura,M., Aoyama,A., Ikenaka,K., Oshimura,M. and Mikoshiba,K. TITLE Gene organization and transcription of duplicated MBP genes of myelin deficient (shi-mld) mutant mouse JOURNAL EMBO J. 7, 77-83 (1988) STANDARD simple staff_entry FEATURES from to/span description pept / 43 / 144 shi-mld myelin basic protein, exon 3 (AA at 43) pre-msg < 1 > 200 shi-mld mRNA and introns IVS < 1 42 shi-mld intron B IVS 145 > 200 shi-mld intron C BASE COUNT 54 a 62 c 40 g 44 t ORIGIN Chromosome 18; about 11 kbp after segment 1. 1 ctacgccctc tccatcctca gctgctcgct tctctctttc aggactcaca cacgagaact 61 acccattatg gctccctgcc ccataattcg cagcacggcc ggacccaaga tgaaaaccca 121 gtagtccatt tcttcaagaa cattgtaagt gactgtgcac aggggaacca agaaatcatc 181 aagggaaagg ttgtggatcc // LOCUS MUSMIGP4A 1284 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse monokine induced by gamma interferon (MIG) mRNA, complete cds. ACCESSION M34815 KEYWORDS MIG protein; gamma interferon-induced; monokine induced by gamma interferon; platelet factor 4-related protein; secreted protein. SOURCE Mouse (strain BAB/14) macrophage cell line RAW 264.7, cDNA to mRNA, clone 119/24. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1284) AUTHORS Farber,J.M. TITLE A macrophage mRNA selectively induced by gamma-interferon encodes a new member of the platelet factor 4 family of cytokines JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5238-5242 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.M.Farber, 04-JUN-1990, for release after publication. FEATURES from to/span description pept 95 475 gamma interferon induced monokine precursor (MIG) sigp 95 157 gamma interferon induced monokine signal peptide matp 158 472 gamma interferon induced monokine signal 6 16 TATA region BASE COUNT 401 a 316 c 247 g 320 t ORIGIN 1 tttcctaaat aaatatgatc cccaagaaca tgctctctaa agacattctc ggacttcact 61 ccaacacagt gactcaatag aactcagctc tgccatgaag tccgctgttc ttttcctttt 121 gggcatcatc ttcctggagc agtgtggagt tcgaggaacc ctagtgataa ggaatgcacg 181 atgctcctgc atcagcacca gccgaggcac gatccactac aaatccctca aagacctcaa 241 acagtttgcc ccaagcccca attgcaacaa aactgaaatc attgctacac tgaagaacgg 301 agatcaaacc tgcctagatc cggactcggc aaatgtgaag aagctgatga aagaatggga 361 aaagaagatc aaccaaaaga aaaagcaaaa gagggggaaa aaacatcaaa agaacatgaa 421 aaacagaaaa cccaaaacac cccaaagtcg tcgtcgttca aggaagacta cataagagac 481 cattacttta ccaacaagca ccctgaatct taatgggttt tagattgtac tgaaaagcct 541 tccctggcag agcagccttt aatacatagg cttttaatac attaactcaa ctacaaaaca 601 taaagtgtta atttgaaatt ataactaact ttaggaagtt aattgcaaaa ctccaatagt 661 aacaattgct agaggcaaaa actctgtgtt ctacacagcc aacaaaattt catcacgccc 721 ttgagcctag tcgtgataac atcagatctg ggcaagtgtc cctttccttc atagctatcc 781 aatgcacaac agctgtctgg cttccagagc cacacatttg gcagcctccg gagacttctg 841 aggctcacgt caccaagtcc caggcctgtc tgtttgctgg tgagctagat agacctcacc 901 aagctggaga ggccctcggc agctgcattt gggtcagcct agagcccctg cacacattgt 961 gtctcagaga tggtgctaat ggttttgggg ttctacagtg gagaccacca gagttggcct 1021 tcagaacctc ccacgtagct ttcgagacca tgggatttca ttattaactt gatcccatct 1081 tcagagctta ttctaagttt gcctcttcaa taaaactctc ctagaaggtt gtggctgtag 1141 cttagtggca gaacacttgg tgttgcaggg accaggtcct tcactaacag tgcaaaaact 1201 taaccaattt aaagaacatt ttctggctac tcaaattctc ttaaatttat tcctgtttca 1261 caagtaaaca cttcgctgct atct // LOCUS MUSZFPB1 561 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger (Zfp-35) gene, exon 1. ACCESSION J04770 KEYWORDS zinc finger protein. SEGMENT 1 of 3 SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 561) AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J. TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is unregulated during spermatogenesis JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by Cunliffe,V., 06-JUN-1990. FEATURES from to/span description pre-msg 233 > 561 zinc finger protein (Zfp-35) mRNA and introns IVS 286 > 561 Zfp-35 intron A BASE COUNT 141 a 135 c 149 g 136 t ORIGIN Chromosome 18 bands B3-C. 1 ccctagaagt tttgacatcg aaccacagca aagataagtt tgagtcaagc gggtccccct 61 atgctaaagc tgggagtcgc tttagggaac ttcctgttcc taattcacag agcgcgtact 121 tgggagggtg caccatagag ctgtgatcct cccacattcc tagagcggcc aagcgaggtc 181 gcattgcatc caaacggatg ttgaaggaat tgtgggtggc ggcgtttgac cggtagtttc 241 ttcatcgtcg gagagtaagg gaggcttctt gggcatggag agcaggtggg ttcttggcct 301 agtgttctgt ggacagcaag ttcgtgctcc ctgggccttt cattcaactg catgaacgtc 361 cgagatgttg gggctggtgg aataacgaag aggatgagcc cacactaatc agcctgcaga 421 gctggaaaag agatacctga cttgggttag aatcgaagga acgagagaga ataacatact 481 cttaattttt aaagccagta ttaccccccc cccccccaca cacacacacc gaattatcgt 541 tccatttacg cctctacttt a // LOCUS MUSZFPB2 340 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger (Zfp-35) gene, exon 2. ACCESSION M36145 KEYWORDS zinc finger protein. SEGMENT 2 of 3 SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 340) AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J. TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is unregulated during spermatogenesis JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by V,Cunliffe, 06-JUN-1990. FEATURES from to/span description pept 231 + 279 zinc finger protein (Zfp-35) pre-msg < 1 > 340 Zfp-35 mRNA and introns IVS < 1 66 Zfp-35 intron A IVS 280 > 340 Zfp-35 intron B BASE COUNT 90 a 79 c 66 g 105 t ORIGIN Chromosome 18 bands B3-C; unknown number of bps after segment 1. 1 gtcaccaaga ggaaacgcac atgaaagtta aatatatttt ttctccttta cgtttttcat 61 atctagaaac tctagtctca gaaatgacat ctctcctctg tacaatggtt ctcatctgag 121 ctcagggcta ccaagttgcc tgctggcttc aaaccattat gtggacaaaa ggatcctaca 181 gaagaaatgg taattctcca gttgcttcct gaggtgtcca gttgtgccct atggagatcc 241 aattcagtta cgaatctcaa gaccatcact ttctgtcagg tgagcctcag tgcatttcag 301 gttacgtggg tgatagcagt attcttcact gtctacactc // LOCUS MUSZFPB3 2268 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger (Zfp-35) gene, exon 3. ACCESSION M36146 KEYWORDS zinc finger protein. SEGMENT 3 of 3 SOURCE Mouse (strain 129) cosmid library pcos2EMBL DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2268) AUTHORS Cunliffe,V., Williams,S. and Trowsdale,J. TITLE Genomic analysis of a mouse zinc finger gene Zfp-35, which is unregulated during spermatogenesis JOURNAL Genomics (1990) In press STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by V,Cunliffe, 06-JUN-1990. FEATURES from to/span description pept + 87 1780 zinc finger protein (Zfp-35) pre-msg < 1 2166 Zfp-35 mRNA and introns IVS < 1 86 Zfp-35 intron B signal 2125 2130 polyA signal signal 2148 2153 polyA signal BASE COUNT 685 a 488 c 526 g 569 t ORIGIN Chromosome 18 bands B3-C; unknown number of bps after segment 2. 1 ctgcagggaa atactcattt gttctacctg gagctttctt acaacatgga gcgcattttc 61 catttagatt ttgtttatta ttccagatgg tgagactaag atcaagattg gagagccagc 121 tacagaggag gaaatgacag gaaagattgg aacagtgact gaagagtccg gcagccttga 181 ggaggatgtt ccccatgact ccagaggcaa agaattccgt gaatttggag aagaattgaa 241 cgatcagatg cttttcagaa gaagacagta taactgtgat gaatgtgacc aaagctttgc 301 ttggagtaca ggtctcatta ggcaccaaag aactcattgg aaaccttacg aatgtgagga 361 atgtggaaag gcctttcgga tgagctcagc cctggttctg catcagagaa ttcatactgg 421 agagaagccc tatccttgta gttggtgtat taaaagtttt agtcggagct cagaccttat 481 taaacatcaa agagtccaca ctggtgaaaa accttacaag tgtgatgagt gtggcaaggc 541 cttcagtcag agctcagatc ttatgataca tcagagaatc cacacaggag aaaaacccta 601 ccaatgtagt cattgtagta aaagctttag ccagcactca ggcatggtta aacatctgag 661 aatccacact ggagagaagc cttatatgtg taaccattgt tacaaacatt tcagtcagag 721 ctctgatctt ataaaacatc aaaggatcca cactggggag aaaccataca agtgtgatgt 781 gtgtgggaag gcctttagtc agagctctga tcgtattctc catcagagaa tccacactgg 841 ggagaagcca tatccgtgtg ctcagtgtaa caaaagtttt agtcaaaatt cagaccttat 901 taaacacaga aggatccaca ctggagagaa accgtataaa tgtagtgaat gtgggaaggc 961 ttttaaccag agctcagtcc ttattctgca tcaaagaatt cataccggag agaaacccta 1021 tccgtgtaac cagtgtacca aaagctttag taggctttca gatcttatta atcatcaacg 1081 aattcacacc ggagagaagc cttacccatg tagtcagtgc agtaaaatgt ttagtagaag 1141 gtcacacctt gttaaacatt acagaatcca cactggggag aagccctatg agtgtgataa 1201 gtgtggcaaa actttcagtc agagttccaa ccttattctc caccagagaa tccacactgg 1261 agagaaacca tacccgtgca acagctgttc taaaagcttc agtcgcggtt cagatctcat 1321 aaagcatcag agggtccaca ctggagaaaa accatacacg tgcaatctgt gcagtaagag 1381 tttcagtcaa agttcagacc tcactaaaca tcagagagtg cattctgggg agaagcccta 1441 ccactgtagt agttgtaaca aagcctttcg tcagagttct gaccttattc tccaccacag 1501 agttcacacg ggagaaagac catatgcgtg tacacagtgc cctagaagtt tcagtcaaaa 1561 gtcagacctg attaaacacc agagaattca cactggagag aagccatata aatgtatgtg 1621 tgggaaggct ttcagtcagt gctcagcctt tacccttcat cagagaatcc acactggaga 1681 gaaaccgtat ccctgtgctc agtgtggcaa aagcttcagt cagcgctctg atctggttaa 1741 ccatcaaaga gttcacgctg atcaaaaact acaaatgtga ctgatacctg tgtgaagcct 1801 tcagcactgt gaccaatctt actggatatc agaggatctg caggagccgt ttgcacctct 1861 ctgaaacgag aaaaaggaag agtgcctccc aggcggaatc aaagcaggtc agttggcaga 1921 gttcccggct tgctgatgca tggaacaatg agaagaggtg tgcttgaggc tcaagtcaag 1981 ggctgaatct tctgttgacc ttggtggaag gaacggggct tactggatgg gaacagatgt 2041 cttgtgtgta tattgcagca aattgttaat actggcttgt ggctttgctc cccatggtac 2101 tgtagtataa aaaaggctat gagaaataca ctcactgctg gcgtggtaat gacctgcagc 2161 cctcccatgt cagtctcttg catcgttttt ctgtctttcc tataatcatc ctcacttccc 2221 cctcaggggc tgttcgactt cacgccggcc ggactagttc aaggatcc // LOCUS PP1PAR1 285 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage miniplasmid P1 parA gene, 5' end. ACCESSION M36424 KEYWORDS centromeric-like DNA. SEGMENT 1 of 2 SOURCE Bacteriophage P1 plasmid prophage DNA, from E.coli. ORGANISM Bacteriophage P1 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 285) AUTHORS Austin,S. and Abeles,A. TITLE The partition functions of P1, P7, and F miniplasmids JOURNAL Basic Life Sci. 30, 215-226 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 232 > 285 parA protein binding 218 222 parA ribosomal binding site signal 149 154 parA -35 region signal 171 176 parA -10 region BASE COUNT 98 a 57 c 60 g 70 t ORIGIN 1 aagcttggct gcccattgaa gcaggaaatc accaaaatga ttcaggctac aacctgaacg 61 tagaagaaat ccgcgtcctt tatgcgtgga ggatgccaaa gcatgttgtg acacacttgg 121 caaaggagta agcatgcaga gaatgctatg tacaagcatc tacgcataca ttattatttt 181 atgcagcatt tttaattaaa ttcaaaaata cagcataaag gatgactttc gatgagtgat 241 tccagccagc ttcacaaggt tgctcaaaga gcaaacagaa tgctc // LOCUS PP1PAR2 178 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage miniplasmid P1 parB gene, 3' end. ACCESSION M36425 KEYWORDS centromeric-like DNA. SEGMENT 2 of 2 SOURCE Bacteriophage P1 plasmid prophage DNA, clone pALA161, from E.coli. ORGANISM Bacteriophage P1 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 178) AUTHORS Austin,S. and Abeles,A. TITLE The partition functions of P1, P7, and F miniplasmids JOURNAL Basic Life Sci. 30, 215-226 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 51 parB protein BASE COUNT 62 a 42 c 21 g 53 t ORIGIN 2150 bp after segment 1. 1 tcgataaaaa gccgaagcct taaactttcg ccattcaaat ttcactatta actgactgtt 61 tttaaagtaa attactctaa aatttcaagg tgaaatcgcc acgatttcac cttggatttt 121 accttcctcc cctcctcccg aaaaaaataa aaaaattgct tgtcacgaga aagtcaac // LOCUS PP7C4RNA 612 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage P7 c4-RNA (c4, orfx, ant1) gene, 5' end. ACCESSION M35139 KEYWORDS antirepressor protein; antisense RNA; c4 repressor. SOURCE Bacteriophage P7 (strain c1.48ts) DNA, from E.coli (strain C600). ORGANISM Bacteriophage P7 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 612) AUTHORS Citron,M. and Schuster,H. TITLE The c4 repressors of bacteriophages P1 and P7 are antisense RNAs JOURNAL Cell (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Schuster, 06-JUL-1990. FEATURES from to/span description pept 362 583 orfx pept 580 > 612 ant1 protein binding 346 354 orfx ribosomal binding site mut 278 278 c in P7c1.48ts; t in P7c4.2ts signal 46 74 promoter 51a signal 86 114 promoter 51b site 31 47 Op 51 site 146 152 presumptive c4 target site a1 site 160 167 presumptive c4 target site b1 site 271 278 c4 element b' site 285 291 c4 element a' site 315 321 c4 target site a2 site 344 351 c4 target site b2 BASE COUNT 147 a 158 c 128 g 179 t ORIGIN 1 acaaataata tatctattgc acgacactaa tctgctctaa taaatttgta tttttaagtc 61 acgaatgcta tcttttcgca tcatattgac cttttaatcg ttcaggctta tagttccgcc 121 gtcgtagcaa attctgcgac cgggtttagc agcctgaatg attgtgcgga cagccgcaga 181 tatccgatat tgcggtattt ttgtgtccgt aaaaccgcgt tacgcccaaa ttatggtggg 241 gcgtgatggg gaggcttcgg cctgctggtt tcacaatcgc cagtctgcta accccgtcac 301 gtcctgccac ctgtttagca gcgggtagca ggttgttaaa cctgattgtg aggccgtaac 361 tatggttaat gccaatcctt gcgcacgcca agaattcatc tggcgtttct attcctgtaa 421 aaaacaccac tatcacttcg tcatcgcagc aacagaagac gaagcacgct ctcaattgcc 481 tgatggcccc tgcattttta ctgcccgttt ttcaactaac tcgcgcaatt cacttagtta 541 ctggaacctc cccttctctg ccgacgttca ggggggttta tgaaaaaacc tctcgtcacc 601 cgtaatgaca ta // LOCUS PSEPGLAA 1986 bp ds-DNA BCT 14-AUG-1990 DEFINITION P.solanacearum polygalacturonase A (PGA, pglA) gene, complete cds. ACCESSION M33692 KEYWORDS alpha-1,4-endopolygalacturonase A. SOURCE P.solanacearum Race 1 (strain AW) DNA, clone pJE8. ORGANISM Pseudomonas solanacearum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1986) AUTHORS Huang,J. and Schell,M.A. JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1715) AUTHORS Huang,J. and Schell,M.A. TITLE DNA sequence analysis of pglA and mechanism of export of its polygalacturonase product from Pseudomonas solanacearum JOURNAL J. Bacteriol. 172, 3879-3887 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Schell, 13-APR-1990, for release after publication. FEATURES from to/span description pept 100 1689 polygalacturonase A precursor (pglA) sigp 100 162 polygalacturonase A signal peptide matp 163 1686 polygalacturonase A mRNA 58 > 1986 pglA mRNA signal 27 59 promoter region signal 27 31 -30 Pseudomonas promoter consensus binding 89 92 pglA ribosomal binding site (put.) BASE COUNT 351 a 685 c 616 g 334 t ORIGIN 1 ctgcagcacg cccttcgcgc agacgaaggc gtgccgctgc ttcgctgatc ccggcgcatt 61 gcgtcctcac aaaagacctc tcttttcgga gaattgacct tgaatcaccg ctacacactc 121 ctcgcactcg ccgccgcagc cctgtctgcc ggagcgcacg caaccggtac gtccgtcacg 181 gccccgtggg gcgaggtggc cgagcccagc ctgccggcgg attcggcggt ctgcaagacg 241 ctgtcggcct cgatcacgcc gatcaaaggc tccgtcgatt cggtggacgg caaccccgcc 301 aactcccagc cggacgcgag ccgcatccag tccgccatcg acaactgccc ggccggccag 361 gccgtcaagc tcgtgaaggg cagcgccggc gaatccggct tcctgagcgg ctcgctgaag 421 ctcaagtcgg gcgtgacgct gtggatcgac accggcgtga ccctgttcgc ctcgcgcaac 481 ccggccgact acgacaatgg cctgggcacc tgcggcaccg ccaccaccag caacgacaag 541 tcgtgcaacg cgctgatcgt ggcgcgcgat acggcgggca gcggcatcgt cggcgccggc 601 gccatcgatg gccgcggcgg cagcctggtc accagcggcc cgaacgcgaa ccggctcacc 661 tggtgggaca tcgcttacct gaacaagacc aaggggctga accagcagaa cccgcgcctg 721 atccagacgt ataacggcag cgcgttcacg ctgtacggcg tgacagtgca gaactcgccc 781 aacttccata tcgtcaccac cggcacgtcg ggcgtaacgg catggggcat caagatcgtg 841 acgccgagcc tggcctacgc cgtggcgggc tacaagtgcc cgtccggctc cacgcccgac 901 aaggtcacgc ccgcgacctg cttcacgccc gagacggtca agaacaccga cggcttcgac 961 ccggggcaat cgaccaacgt ggtgctggcc tattcctaca tcaacacggg tgacgaccac 1021 gtcgcggtca aggccagcag cggcccgacg cgcaacctgc tcttcgcgca caaccatttc 1081 tactatgggc acggcctgtc gatcggcagc gaaaccaata ccggcgtgag caacatgctg 1141 gtgaccgacc tgacgatgga cggcaacgac agctcggccg gcaacggcct gcgcatcaag 1201 tccgatgcct cgcgcggcgg caaggtcacc aacatcgtct acgacggcat ctgcatgcgc 1261 aacgtcaagg agccgctggt gttcgacccc ttctacagca gcgtgaaggg ttcgctctat 1321 ccgaacttca ccaacatcgt cgtcaagaac ttccatgacc tgggcagcgc caagtccatc 1381 aagcgcacga tgacgttcct gggctacaaa gccaacaagc agaagaaccc gctgacgatc 1441 acgctcgaca acgtggtctt cgacggcacg ctgccggcct tcgaggggtc ccactacggc 1501 gggccggctt cgcccaacgg ggtgcacttc accttcggcg gcaccggtcc ggtgagcttt 1561 gccgacgcca tcgtcacgtc gtcgaccacc gatgtgacgg tgaccggcac gccgggcacg 1621 gccgcggcgg tggattgcag caaggccttc gtgccgctga agtccgtggc accgacctcg 1681 ccgatctgat cgggcgactc cgctcagcgg tacccgttct gccgcttgta ctggcggatg 1741 gcttcgtcgt ggttggggtc ctcgtgtgca ttgtgcgcgc gctgatcttc gaactcacgg 1801 atggcattgg cctgagcctc cgtcagtttg ggcagcggcg gcttgcgttg gggtgacggg 1861 gtggtgctgt acgtcaccgg ctggctgtcg gcggagttcg tcttcgtagg cgatgtggtg 1921 tccgcccttt tcaccgtggt agtcgttgcg cgcttcggtc tcgcggtcca gttgggtccg 1981 ctgcag // LOCUS RATCATHHA 1362 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat cathepsin H mRNA, complete cds. ACCESSION M36320 Y00708 KEYWORDS cathepsin H; cysteine protease. SOURCE Rat (strain Wistar) kidney, cDNA to mRNA, clone lambda-H44. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1362) AUTHORS Ishidoh,K. JOURNAL Unpublished (1987) Tokyo Metro Inst of Med Sci,Tokyo 113, Japan STANDARD simple staff_entry REFERENCE 2 (bases 1 to 1362) AUTHORS Ishidoh,K., Imajoh,S., Emori,Y., Ohno,D., Kawasaki,H., Minami,Y., Kominami,E., Katunuma,N. and Suzuki,K. TITLE Molecular cloning and sequencing of cDNA for rat cathepsin H JOURNAL FEBS Lett. 226, 33-37 (1987) STANDARD simple staff_entry COMMENT EMBL features not translated to GenBank features: key from to description SITE 873 874 processing site SITE 211 213 pot. glycosylation site SITE 298 301 pot. glycosylation site SITE 685 688 pot. glycosylation site FEATURES from to/span description pept 4 1005 cathepsin H pre-pro-peptide sigp 4 342 cathepsin H signal peptide matp 343 1002 cathepsin H mRNA < 1 1362 cathepsin H mRNA signal 1320 1325 poly A signal BASE COUNT 369 a 341 c 343 g 309 t ORIGIN 1 gcgatgtgga ctgcgctgcc cctgctgtgc gctggggcct ggctgctgag tgctggggcc 61 accgctgagc tgaccgtgaa tgccatagaa aagtttcact ttacgtcatg gatgaaacag 121 catcaaaaga cgtacagctc gagggagtac agccacaggc tgcaggtgtt tgccaacaac 181 tggaggaaga ttcaagccca caaccagagg aaccacacat ttaaaatggg attgaaccag 241 ttttcagata tgagctttgc cgaaataaaa cacaaatacc tctggtcaga gcctcagaat 301 tgctcagcca ccaaaagtaa ctacctccgt ggtactggcc cctacccatc ctccatggac 361 tggaggaaga aaggaaatgt cgtttcacca gtgaagaacc agggggcctg tggcagctgc 421 tggactttct caaccactgg ggccctagag tcagctgtgg ctattgccag tgggaaaatg 481 atgaccttgg ctgagcagca gctggtggat tgtgcccaga acttcaacaa tcatggctgc 541 caaggaggtc tccccagcca ggccttcgag tacatcctgt acaacaaggg catcatggga 601 gaggacagct acccttacat aggcaagaat ggtcagtgca aattcaaccc agaaaaggcc 661 gtcgcgttcg tcaagaatgt tgtcaacatc acactcaatg atgaggctgc aatggtagag 721 gctgtggccc tatacaatcc tgtgagcttt gcctttgagg tgactgaaga ttttatgatg 781 tataaaagtg gtgtctactc cagtaactcc tgtcataaaa ctccagataa agtaaaccat 841 gcagtcctgg ctgttggcta tggagaacag aatggattac tctactggat tgtgaaaaac 901 tcttggggct ccaactgggg gaacaatggg tacttcctca ttgagcgtgg aaagaacatg 961 tgtggcctgg ctgcctgtgc ctcctacccc atccctcagg tataagccac ggctgcacag 1021 gccaactgct tggcagacaa agggaggaac tggtcctacg atgagaatgc cgccctggag 1081 aaagttgttc agaaatccac ccagaggccc tctcactcct gagtctagac gcctaaagac 1141 aagtaaggaa gaacttgacc agcaacaagc ccgcccatgt gacgacatca ccagccatac 1201 gctttgtttg aatatggttt ttaaatgacc caaaaccatg tggacctaga atcttctctt 1261 ttcagctctc ttcatatagg gaaagctccg atggcttacc ttttctatgt tgtatattca 1321 ataaatgcag tgaacgcctg cctgggctgg gcctagaccc ct // LOCUS RATDRTKCA 471 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat delayed-rectifier-type K+ channel mRNA, complete cds. ACCESSION M36461 KEYWORDS delayed-rectifier-type K+ channel. SOURCE Rat neonatal heart and ovariectomized, diethylstilbestrol-primed uterus, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 471) AUTHORS Folander,K., Smith,J.S., Antanavage,J., Bennett,C., Stein,R.B. and Swanson,R. TITLE Cloning and expression of the delayed-rectifier I-s-K channel from neonatal rat heart and diethylstilbestrol-primed rat uterus JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2975-2979 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 41 433 delayed-rectifier-type K+ channel BASE COUNT 110 a 132 c 122 g 107 t ORIGIN 1 agagttttgc tccacatcag ggaaaccgtg aagccccagg atggccctgt ccaattccac 61 aactgttctg ccttttctgg cctcgctgtg gcaggaaaca gatgagccgg gtggcaatat 121 gtcggcggac ttggctcgta ggtcccagct ccgagatgac agcaagctgg aggctctcta 181 tatcctcatg gtgctgggtt tcttcggctt cttcaccctg ggcatcatgc tgagttacat 241 ccgatccaag aagctggaac actcgcacga ccctttcaac gtgtacatcg agtcggacgc 301 ctggcaggag aaaggcaagg ccctcttcca ggcccgcgtt ctggagagct tcagagcttg 361 ctatgtcatt gaaaaccagg cggccgtaga acaacctgcc acacaccttc ctgaactgaa 421 gccactgtca tgaaccccat agttaattaa tagacaagtg ataagtgggt c // LOCUS RATPAPA 1603 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat prostatic acid phosphatase (rPAP) mRNA, complete cds. ACCESSION M32397 KEYWORDS prostatic acid phosphatase. SOURCE Rat (strain Sprague-Dawley) prostate, cDNA to mRNA, clone rPAP-[145,343A]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1603) AUTHORS Roiko,K., Jaenne,O.A. and Vihko,P. TITLE Primary structure of rat secretory acid phosphatase and comparison to other acid phosphatases JOURNAL Gene 89, 223-229 (1900) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Vihko, 02-MAR-1990, for release after publication. FEATURES from to/span description pept 41 1186 prostatic acid phosphatase precursor (rPAP; EC 3.1.3.2) sigp 41 133 prostatic acid phosphatase signal peptide matp 134 1183 prostatic acid phosphatase mRNA < 1 1603 rPAP mRNA signal 1583 1588 polyA signal BASE COUNT 434 a 414 c 387 g 368 t ORIGIN 1 cacctcctac agctgccaca agcagctctc ttcccccacc atgagagctg tccctctgca 61 cctcgtcggg acagcaagcc tcacccttgg cttcttgctc ctgctatctc tccgcctgga 121 cccaggccaa gccaaggagt tgaagtttgt gacattggtg ttccggcatg gagaccgagg 181 tcccatcgag acctttccta atgaccccat taaggaatcc tcgtggccac aaggatttgg 241 ccaactcacc aagtggggca tgggacagca ctacgaactc ggaagttata taaggagaag 301 atacgggaga ttcttgaaca actcctataa acatgaccag gtttatatcc gaagcacaga 361 tgttgacagg actctgatga gcgctatgac aaacctcgca gccctgtttc cccctgaggg 421 gatcagcatc tggaatccca gactgctctg gcagcccatc ccagtgcaca ccgtgtctct 481 ctctgaggat cggttgctat acctgccttt cagggactgt cctcgctttc aagaactcaa 541 gagtgagact ttaaaatctg aggagttcct gaagaggctt caaccatata aaagcttcat 601 agacaccttg ccatcgctgt cgggattcga ggaccaggat ctttttgaaa tctggagtag 661 gctttacgac cctttatatt gcgagagtgt tcacaatttc accttccgca cctgggccac 721 agaggacgcc atgactaagt tgaaggagtt gtcagaatta tctctgttat ctctttatgg 781 aattcacaag cagaaagaga aatctagact ccaggggggc gtcctggtca atgaaattct 841 caagaacatg aagcttgcaa ctcaaccaca gaaggccagg aagttgatca tgtattctgc 901 atatgacact actgtgagtg gcctgcagat ggcgctagag ctttataatg gacttctacc 961 tccctacgct tcctgccaca taatggaatt gtaccaggat aatgggggga ccttcgtgga 1021 gatgtactac cggaatgaga cccagaacga gccctaccca ctcacgctgc cgggctgtac 1081 ccacagctgc cctctggaga agtttgcaga gctactggac cccgtgatcc cccaggactg 1141 ggccacagag tgtatgggca caagcaacca ccaagcgtcg ctgtaaaaga acaaccagtt 1201 gccagtgtaa tcaggaaaag agttaggtca aacttgtggg tctgccgggg atccatgatt 1261 agcacaaaga actctattgc tgaaggattc gaagattttg cttgagcggg tgagtaaagc 1321 tgccctggga atgtctgatg taaaaacagg tttcagacga agcctaccaa gttagcgtgc 1381 aagcatgaaa aagaagacac ttatcccagg ctaatgggca aaggacctga ttcacctgac 1441 aacaacgctc acctgttccc ctgccttctc tcacgaggca aaaagatgag acaggaggac 1501 acctggagga gttttccata cccagcctag aacgatcaga ttttatatgt atgtcaaaag 1561 acgctgggta gcagaactga ataataaaag cctcttggtg tcc // LOCUS RHLNIFHA 524 bp ds-DNA BCT 14-AUG-1990 DEFINITION R.leguminosarum nitrogen fixation protein nifH gene, 5' end. ACCESSION M36435 KEYWORDS nitrogen fixation protein. SOURCE R.leguminosarum DNA, clone pGB5. ORGANISM Rhizobium leguminosarum Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rhizobiaceae. REFERENCE 1 (bases 1 to 524) AUTHORS Roelvink,P.W., Harmsen,M., van Kammen,A. and van den Bos,R.C. TITLE The nifH promoter region of Rhizobium leguminosarum: Nucleotide sequence and promoter elements controlling activation by NifA protein JOURNAL Gene 87, 31-36 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 435 > 524 nifH protein mRNA 364 > 524 nifH mRNA binding 420 426 nifH ribosomal binding site BASE COUNT 124 a 139 c 137 g 124 t ORIGIN 1 gttaacccta tgcgggcacc agacgcggca tggcgcttgt catcgccgga gacagcgctt 61 tgcggcaagt gatcagaatt tgttacggcc ggtctggtga tggctgctct aacaaagtcc 121 tagaagctac gcatggataa atgagacatc ctgccggttg gtcagatatt gcacaaacgg 181 ccaccattca ttggccctca gagggtgaca atctatttcg acattgtcat cacctttgtc 241 ggttacttga caagcgtttg tgttctcaat cccgtatttt acgcccacat agctggaatc 301 gcggcgaatt attgttcgtc agctcaatcg gccgcttggc acgaatcttg agagctattg 361 agaggcagcg gaacggccgc cgcattccgt tgcgggtaac caaattgctt cgaacacatg 421 aaggaacgcc aagcatggca gctctgcgtc agatcgcatt ctatggaaaa ggcggaattg 481 gcaagtccac tacgtcccaa aacacgctgg ccgcccttgt cgac // LOCUS STYAHPCFA 2608 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.typhimurium alkyl hydroperoxide reductase (ahpCF) gene, complete cds. ACCESSION J05478 KEYWORDS alkyl hydroperoxide reductase; oxidoreductase. SOURCE S.typhimurium (strain TN1379) DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 2608) AUTHORS Tartaglia,L.A., Storz,G., Brodsky,M.H., Lai,A. and Ames,B.N. TITLE Alkyl hydroperoxide reductase from Salmonella typhimurium: Sequence and homology to thioresoxin reductase and other flavoprotein disulfide oxidoreductases JOURNAL J. Biol. Chem. 265, 10535-10540 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.Tartaglia, 05-JUN-1990. FEATURES from to/span description pept 166 720 alkyl hydroperoxide reductase C22 protein (aphC) pept 969 2534 alkyl hydroperoxide reductase F52a protein (aphF) mRNA 142 > 2608 ahpCF mRNA binding 155 160 aphC ribosomal binding site (put.) binding 959 964 aphF ribosomal binding site (put.) signal 105 110 ahpCF -35 region signal 129 134 ahpCF -10 region BASE COUNT 652 a 655 c 724 g 577 t ORIGIN 13 minutes on S.typhimurium map. 1 tcaggcggcc tttttacttt agatggctga cagggcgcag tgttcacaat gtcgcccgaa 61 tcgggttgtt agttaacgct tattgatttg ataatggaaa cgcattagcc gaatcagcaa 121 aatttcgtta acttactcct caacgaaaac acggaggaag tatagatgtc cttaattaac 181 accaaaatca aacctttcaa aaaccaggcg ttcaaaaacg gtgaattcat tgaagtcacc 241 gagaaagata ccgaaggccg ctggagcgtc ttcttcttct acccggccga ttttaccttt 301 gtttgcccga ctgaactggg tgacgttgcc gaccattacg aagaactgca gaagctgggc 361 gtagacgttt attccgtctc aaccgatact cacttcacgc acaaagcatg gcacagcagc 421 tctgaaacta tcgcaaaaat caaatatgcg atgatcggcg acccgactgg cgccctgacc 481 cgtaacttcg acaacatgcg tgaagatgaa ggtctggcag acgtgcgacc tttgttgttg 541 acccgcaggg tatcatccag gcgatcgaag ttaccgctga aggtatcggc cgtgacgcgt 601 ctgacctgct gcgtaaaatt aaagcagcac agtacgttgc cgctcaccca ggcgaagtat 661 gcccggcgaa atggaaagaa ggcgaacgac tctggctcca tccttagacc tggtcggtaa 721 aatctaaatt tccgtcgtct ttcacgccat agctgcgttg gcgtcgcctg ctcaccccgg 781 tcacttactt cagtaagctc ccggggattc tcaggctagc cgccttactc tgacgtgaaa 841 tactttggaa attgtttcag tcatgggcgc acccgcgccc atttcatttc agcaccatga 901 cgcaagttgc attaatgcag tcccgcgaag tcggcttgca tgatgatgtt ttcagagagg 961 gaagaagaat gctcgacaca aatatgaaaa cccagctcag ggcttacctt gagaaactga 1021 ccaaacctgt tgagctgatt gccacgctgg atgacagcgc taaatcggcg gaaatcaagg 1081 aactgctggc tgaaatcgcc gagctgtccg acaaagtcac gtttaaagaa gacaatactc 1141 tgccggtgcg caagccgtct ttcctgatta cgaatccagg ttctcagcaa gggccgcgct 1201 ttgccggttc tccgctgggc cacgaattta cctcgctggt tttggcgctg ctgtggaccg 1261 gcggtcatcc gtcaaaagaa gcgcagtcgc tgctggagca gattcgcgat attgacggcg 1321 attttgaatt cgaaacttat tattcactct cctgccataa ctgcccggac gtagtgcagg 1381 cgctgaacct gatggcggtg ctgaacccgc gtattaaaca tacggcaatt gatggcggca 1441 ccttccagaa cgaaatcacc gaacgtaacg tgatgggcgt tccggcggta tttgttaacg 1501 gtaaagagtt cggccaggga cgtatgactc tgaccgagat tgtcgctaaa gtggatactg 1561 gcgcagaaaa acgcgcggcg gaagcgctga acaaacgcga tgcgtatgac gtactgattg 1621 tcggctccgg cccggcgggc gcggcggcgg cggtctactc ggcgcgcaaa ggcatccgta 1681 ccggtctgat gggcgagcgc ttcggcggtc aggtgctgga taccgtggat atcgaaaact 1741 atatctcggt gccgaaaacc gaaggccaga aactggcggg cgcgctgaaa gcgcatgtca 1801 gcgattacga cgttgacgtt atcgacagcc agagcgcaag taaactggtt ccggccgcaa 1861 cagaaggcgg cttgcatcag attgaaacgg cgtccggcgc agtattgaaa gcgcgtagta 1921 tcatcatcgc caccggcgcg aaatggcgca atatgaatgt gccgggcgag gatcagtatc 1981 gcaccaaagg cgtcacctat tgtccgcact gcgacggtcc gttatttaaa gggaagcgtg 2041 tggcggtcat cggcggcggc aactctggcg ttgaagcggc tatcgatctg gcaggtattg 2101 tagaacatgt tacgttgctg gagttcgcgc cggagatgaa ggcggatcag gtgctacagg 2161 acaaagtgcg tagtctgaaa aatgtcgata tcattttgaa tgcgcaaacg acagaagtga 2221 agggcgacgg ttcgaaagtg gtgggactgg agtatcgtga ccgtgtcagc ggcgatattc 2281 atagcgtcgc gctggcaggg atttttgtgc agattggtct gttgccgaat acccactggc 2341 tggaaggcgc gcttgagcgt aaccgcatgg gcgagatcat tattgacgcc aaatgtgaaa 2401 ccagcgttaa gggcgtcttc gccgcaggcg attgcaccac cgttccttac aaacagatta 2461 tcatcgccac gggcgaaggg gcgaaagcct cgttgagcgc ctttgattat ctgattcgca 2521 ccaaaatcgc ataaaaagaa gtaagcacac ctgcaaggcc agttaactgg ccatcgtaaa 2581 tggccgatag tgtaagctat tcgggcct // LOCUS TMGCG 6355 bp ss-RNA VRL 14-AUG-1990 DEFINITION Tobacco mild green mosaic virus complete genome. ACCESSION M34077 M22483 KEYWORDS coat protein. SOURCE Tobacco mild green mosaic virus (strain U2-TMV), cDNA to viral RNA, from N.tabacum cv. Samsum. ORGANISM Tobacco mild green mosaic virus Viridae; ss-RNA nonenveloped viruses; Rod-shaped ss-RNA viruses; Tobamovirus. REFERENCE 1 (bases 6127 to 6355) AUTHORS Garcia-Arenal,F. TITLE Sequence and structure at the genome 3' end of the U2-strain of Tobacco mosaic virus, a histidine-accepting tobamovirus JOURNAL Virology 167, 201-206 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 6355) AUTHORS Solis,I. and Garcia-Arenal,F. TITLE The complete nucleotide sequence of the genomic RNA of the tobamovirus tobacco mild green mosaic virus JOURNAL Virology 177, 553-558 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Garcia-Arenal, 10-FEB-1989. The RNA appears to have a tRNA- like, L-shaped structure at the 3' terminus, linked to a quasi- continuous double-helical stalk, with five pseudoknots involved in the formation of the whole structure. However, the structure of U2-TMV RNA is less stringently conserved than the 3' termini of "vulgare" and other histidine-accepting tobamoviruses. Draft entry and computer-readable sequence for [1] kindly submitted by F.Garcia-Arenal, 08-MAY-1990, for release after publication. FEATURES from to/span description pept 71 3406 126 kDa protein pept 71 3403 183 kDa protein 3407 4900 183 kDa protein pept 4890 5660 28.5 kDa protein pept 5666 6145 coat protein BASE COUNT 1933 a 1112 c 1489 g 1821 t ORIGIN 1 gatgttttaa tagttttcga caacaacaat taaaacaaaa acaacatatt acaaacaaca 61 aacaacaaca atggcacaca tacaatctat aattagcaac gcccttcttg aaagcgtgag 121 tggtaaaaac actctcgtta atgaccttgc aagaaggcgc atgtacgata cggccgtgga 181 agaatttaac gcccgcgacc gtagaccaaa ggtcaacttt tccaaaacta ttagcgaaga 241 gcaaacgctt ctagtctcca acgcgtaccc ggagttccag attacctttt ataatactca 301 aaatgccgta cacagtttgg ctggaggttt gagagcatta gaattggaat atctgatgct 361 acaagttccc tatggatcgc cgacatatga tataggtggg aactttgcag cacatttgtt 421 caaaggcagg gattacgtgc attgctgtat gcccaatctg gacatacgag atataatgag 481 gcacgaagga caaaaggact caattgagat gtatttgtcc agattgtctc gttctaacaa 541 ggtaattcct gagtttcaaa gggaggcttt taacaggtat gcagaagctc ccaacgaagt 601 ctgctgctct aaaacttttc aggattgtcg aatacatccg ccagagaata gtggtagaag 661 atacgctgtt gctctgcaca gtttgtatga tattcctgtg catgagtttg gagctgcgtt 721 aatatctaag aatatacatg tatgttatgc agcttccatt ttggcagaag cattattact 781 agaccagacg gaggttacgc ttaatgaaat aggcgcaact ttcaaaagag aaggtgatga 841 tgtttctttt ttctttgctg atgaaagtac tttaaattat agtcataaat acaaaaatat 901 cttgcattat gtagttaaat cttactttcc tgcttctagt agaatagttt actttaagga 961 atttttagtc actagggtta atacttggtt ttgtaaattt accaaagtag atacctatat 1021 tctgtacaag agtgttagac aagtagggtg tgatagtgat cagttctatg aggcgatgga 1081 agacgccttt gcttacaaga aaaccttggc catgttcaac actgaaagag caatctttag 1141 agacacggct tcggttaact tttggttccc taagatgaag gacatggtga tagtaccgct 1201 gtttgagggt tctattacca gcaaaaagat gacaaggagt gaggtcattg ttaatcgtga 1261 cttcgtttac acagtgctta atcatatcag aacatatcaa gccaaagcgt taacttacca 1321 gaacgtatta tctttcgtgg agtctataag atcccgcgtg ataatcaatg gtgttactgc 1381 taggtctgaa tgggatgtag ataaagcaat tcttcaaccc ttgtcaatga ctttcttctt 1441 gcagactaag ctggctgcgc ttcaagacga tatagtaatg ggaaagtttc ggtgcttgga 1501 taagaccact tctgaactta tttgggatga ggtgggcaaa ttttttggaa acgttttccc 1561 cactatcaaa gagagattgg tgagcaggaa aattctggat gtaagtgaga atgctctgaa 1621 gatcaagatc ccagatctgt atgtcacatg gaaagacagg ttcgtagctg aatacaccaa 1681 gtctgaggag ttaccgcatc tagatatcaa gaaggactta gaagaagctg agcaaatgta 1741 cgacgcgtta tcagaattat ctatccttaa gggtgctgat aatttcgata tcgcgaagtt 1801 caaagacatg tgcaaggctt tagatgttag tcctgatgtg gcagcacgag taatcgttgc 1861 agtggccgag aatagaagcg gtttaactct tacttttgat aagccaaccg aggagaatgt 1921 ggctaaggct cttaaaagca cggcgtctga ggccgtggta tgtcttgaac cgacatccga 1981 agaggtgaac gtaaataaat tttctattgc tgagaaaggg agattgcctg tgtgtgcaga 2041 aagtcatggt ttgacgaatg ctaacttaga gcaccaggag ttggagtccc tcaacgattt 2101 ccataaggct tgcgtggata gtgtgattac aaagcaaatg gcatcggttg tctacactgg 2161 ctcactcaaa gttcaacaaa tgaagaacta tgtggacagt ttggcagctt cgttgtccgc 2221 cactgtatca aatctatgca agtcactaaa ggatgaagtc gggtatgatt ctgattccag 2281 ggagaaagtt ggtgtttggg atgtcacttt gaaaaagtgg ctcctcaaac ctgcggccaa 2341 aggtcattca tggggagttg tcctggatta caaggggaaa atgtttactg cacttctatc 2401 ttatgaagga gatagaatgg tgactgagag cgactggagg agggtggctg tatcatctga 2461 tacaatggta tattctgata ttgcaaagct ccaaaatctg aggaaaacaa tgagagacgg 2521 tgaaccccac gaacctactg caaagatggt acttgtggat ggggtgcctg gttgtggaaa 2581 gtacaaagga gattttgaaa gatttgatct tgatgaggat ttgatcttgg ttcctggaaa 2641 acaagctgct gctatgatca gaagaagggc taattcatct ggactgataa gagccacaat 2701 ggacaatgtg agaacggtag attcacttct aatgcatcca aaaccgcgat cacacaagag 2761 gctttttatt gatgaagggt tgatgctgca caccggttgt gttaacttcc tggtgcttat 2821 ctctggttgc gacatcgcat acatttacgg agatacacag cagattcctt tcattaacag 2881 agttcagaat ttcccgtatc ccaaacattt tgagaagctg caagtggatg aagttgagat 2941 gaggaggacc acactgagat gcccaggtga tgtgaatttt ttcctacaat cgaagtacga 3001 aggagcggtg acaaccactt caactgtaca acgatcggtc tcatctgaga tgataggcgg 3061 taagggagta ctaaacagtg tttccaaacc actaaaaggg aaaattgtaa ctttcactca 3121 ggctgataaa tttgagttag aggagaaggg ctataagaat gtgaacaccg ttcatgagat 3181 ccaaggagaa acctttgaag atgtgtcgct ggtcagattg acggcaactc cactgactct 3241 gatttccaag tcttccccgc atgttctagt cgctctgact agacacacaa agagcttcaa 3301 atattacacc gtagtgttag atcctttagt acagataatt agtgatttgt cttctttaag 3361 ctccttcctt ttagaaatgt atatggtaga agcaggtagt agatagcaat tacagatgga 3421 tgcagtgttc aaaggtcata atctctttgt ggcaacacct aaatcaggag actttccaga 3481 tctacagttc tattacgatg tatgcctccc tggtaatagt actatactta acaagtatga 3541 tgctgttacc atgaggttac gtgataatag tcttaatgtg aaggattgtg ttcttgattt 3601 ttccaaaagt attccgatgc caaaggaggt gaaaccatgt ctagagccag ttttgcgtac 3661 cgcggcggaa ccgccaaggg ctgcaggact actcgaaaat ctggttgcaa tgattaaaag 3721 aaatttcaac gcaccagacc tgacggggac gattgacatt gagagcaccg catctgttgt 3781 agtagataag ttttttgata gctattttat taaaaaagaa aaatacacaa aaaatattgc 3841 tggagtgatg acgaaggatt caatgatgag atggttggaa aacaggaaag aagtactatt 3901 ggacgacttg gctaactaca attttacaga tctgccggcc atcgatcagt acaagcacat 3961 gatcaaggct caaccaaaac agaaattgga cctttcaatt cagaatgaat accctgctct 4021 gcaaacaatt gtctaccatt cgaagcagat caacggtatt ttggccggtt tctcagagct 4081 tacaaggttg ctgctcgagg catttgattc taagaagttt cttttcttta ctaggaaaac 4141 tccagaacag attcaagaat ttttctcgga tctcgactcg cacgttccta tggatgtgtt 4201 agaactggat atttctaagt atgataagtc acagaacgag tttcattgtg ctgtagagta 4261 tgaaatatgg aaaagattgg gtctcaatga gtttttggcc gaagtgtgga aacaagggca 4321 caggaaaaca actttgaagg attacattgc tggaatcaag acatgtctgt ggtatcaaag 4381 gaaaagcggt gatgtgacta ctttcatcgg caatactgtt ataatagcag cttgcttggg 4441 ttcaatgtta ccgatggaaa aggtcataaa aggtgctttt tgtggagacg attccgtttt 4501 gtattttcca aagggtttgg atttccctga cattcagtca tgtgctaatc tcatgtggaa 4561 ttttgaggcc aaactgtata gaaagaggta cggttacttt tgtggtagat acatcataca 4621 ccatgataag ggagcaatag tgtattatga tcctttgaag ttgatctcca aacttggggc 4681 aaaacatatc aaggattatg atcacttaga agagttaagg gtgtctttgt gcgatgttgc 4741 ttgttcgctc ggaaactggt gcttaggctt tccgcagctg aacgcagcta tcaaggaggt 4801 tcataaaacc gcgattgatg gttcgtttgc ttttaattgt gttaacaaat ttttgtgtga 4861 taaattttta tttagaactt tgtttttaaa tggctgttag tctcagagat actgtcaaaa 4921 ttagcgagtt cattgatctt tcgaaacagg atgagatact tccggcattc atgactaagg 4981 tcaagagtgt tagaatatcg actgtggaca agattatggc tgttaagaat gatagtcttt 5041 ctgatgtaga tttacttaaa ggtgttaagt tagttaagaa agggtatgtg tgcttagctg 5101 atttggtagt gtctggggag tggaatctcc cggataactg ccgtggtggt gtcagtgttt 5161 gtattgtaga taagagaatg aaaaggagta aggaagcaac gctgggtgcg tatcacgccc 5221 ctgcttgcaa aaagaatttt tcttttaagc taatccctaa ttattcaata acatccgagg 5281 atgctgagaa gcacccgtgg caagtgttag tgaatatcaa aggagtggct atggaagaag 5341 gatactgtcc tttatctttg gagttcgttt caatttgtgt agtacataaa aataatgtaa 5401 gaaaaggttt gagggaacgt attttgagtg tgacagacgg ctcgccaatt gaactcactg 5461 aaaaggttgt tgaggagttc gtggatgaag taccaatggc tgtgaaactc gaaaaggttc 5521 cggaaaacaa aaaagaaatg gtaggtaata atgttaataa taagaaaata aataacagtg 5581 gtaagaaggg ttttaaaatt gaggaaattg aggataatgt aagtgatgac gagtctatcg 5641 cgtcatcgag tacgttttaa tcaatatgcc ttatacaatc aactctccga gccaatttgt 5701 ttacttatct tccgcttacg cagatcctgt gcagctgatc aatctgtgta caaatgcatt 5761 gggtaaccag tttcaaacgc aacaagctag gacaacagtc caacagcaat ttgcggatgc 5821 ctggaaacct gtgcctagta tgacagtgag atttcctgca tcggatttct atgtgtatag 5881 atataattcg acgcttgatc cgttgatcac ggcgttatta aatagctttg atactagaaa 5941 tagaataata gaggttgata atcaacccgc accgaatact actgaaatcg ttaacgcgac 6001 tcagagggta gacgatgcta ctgtagctat aagggcttca atcaataatt tggctaatga 6061 actggttcgt ggaactggca tgttcaatca agcaggcttt gagactgcta gtggacttgt 6121 ctggaccaca actccggcta cttagctatt gttgtgagat ttcctaaaat aaagtcgctg 6181 aagacttaaa attcagggtg gctgatacca aaatcagcag tggttgttcg tccacttaaa 6241 tataacgatt gtcatatctg gatccaacag ttaaaccatg tgatggtgta tactgtggta 6301 tggcgtaaaa catcggagag gttcgaatcc tcccctaacc gccggtagcg gccca // LOCUS TRNTNPIA 1333 bp ds-DNA BCT 14-AUG-1990 DEFINITION Transposon Tn21 integrase (tnpI) gene, complete cds, and adenyltransferase (aadA), 5' end. ACCESSION M33633 KEYWORDS integrase. SOURCE Transposon Tn21 DNA, clone pMON90. ORGANISM Transposon Tn21 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1333) AUTHORS Mercier,J., Lachapelle,J., Couture,F., Lafond,M., Vezina,G., Boissinot,M. and Levesque,R.C. TITLE Structural and functional characterization of tnpI a recombinase locus in Tn21 and related transposons JOURNAL J. Bacteriol. 172, 3745-3757 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.C.Levesque, 06-APR-1990, for release after publication. FEATURES from to/span description ORF 209 613 potential ORF1 ORF 665 1060 potential ORF2 ORF 1150 137 (c) potential ORF3 ORF 1298 > 1333 adenyltransferase (aadA) site 1288 1291 integration site GTTA BASE COUNT 286 a 396 c 408 g 243 t ORIGIN 1 ggatccatca ggcaacgacg ggctgctgcc ggccatcagc ggacgcaggg aggactttcc 61 gcaaccggcc gttcgatgcg gcaccgatgg ccttcgcgca ggggtagtga atccgccagg 121 attgacttgc gctgccctac ctctcactag tgaggggcgg cagcgcatca agcggtgagc 181 gcactccggc accgccaact ttcagcacat gcgtgtaaat catcgtcgta gagacgtcgg 241 aatggccgag cagatcctgc acggttcgaa tgtcgtaacc gctgcggagc aaggccgtcg 301 cgaacgagtg gcggagggtg tgcggtgtgg cgggcttcgt gatgcctgct tgttctacgg 361 cacgtttgaa ggcgcgctga aaggtctggt catacatgtg atggcgacgc acgacaccgc 421 tccgtggatc ggtcgaatgc gtgtgctgcg caaaaaccca gaaccacggc caggaatgcc 481 cggcgcgcgg atacttccgc tcaagggcgt cgggaagcgc aacgccgctg cggccctcgg 541 cctggtcctt cagccaccat gcccgtgcac gcgacacgtg ctcgcgcagg ctgggtgcca 601 agctctcggg taacatcaag gcccgatcct tggagccctt gccctcccgc acgatgatcg 661 tgccgtgatc gaaatccaga tccttgaccc gcagttgcaa accctcactg atccgcatgc 721 ccgttccata cagaagctgg gcgaacaaac gatgctcgcc ttccagaaaa ccgaggatgc 781 gaaccacttc atccggggtc agcaccaccg gcaagcgccg cgacggccga ggtcttccga 841 tctcctgaag ccagggcaga tccgtgcaca gcaccttgcc gtagaagaac agcaaggccg 901 ccaatgcctg acgatgcgtg gagaccgaaa ccttgcgctc gttcgccagc caggacagaa 961 atgcctcgac ttcgctgctg cccaaggttg ccgggtgacg cacaccgtgg aaacggatga 1021 aggcacgaac ccagtggaca taagcctgtt cggttcgtaa gctgtaatgc aagtagcgta 1081 tgcgctcacg caactggtcc agaaccttga ccgaacgcag cggtggtaac ggcgcagtgg 1141 cggttttcat ggcttgttat gactgttttt tggggtacag tctatgcctc gggcatccaa 1201 gcagcaagcg cgttacgccg tgggtcgatg tttgatgtta tggagcagca acgatgttac 1261 gcagcagggc agtcgcccta aaacaaagtt aaacatcatg agggaagcgg tgatcgccga 1321 agtatcgact caa // LOCUS TTERRDA 1504 bp ss-rRNA RNA 14-AUG-1990 DEFINITION T.tenax 16S rRNA. ACCESSION M35966 KEYWORDS 16S ribosomal RNA. SOURCE T.tenax RNA. ORGANISM Thermoproteus tenax Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoproteales; Thermoproteaceae. REFERENCE 1 (bases 1 to 1504) AUTHORS Leinfelder,W., Jarsch,M. and Bock,A. TITLE The phylogenetic position of the sulfur-dependent archaebacterium Thermoproteus tenax: Sequence of the 16S rRNA gene JOURNAL Syst. Appl. Microbiol. 6, 164-170 (1985) STANDARD simple staff_entry FEATURES from to/span description rRNA 1 1504 16S rRNA BASE COUNT 297 a 448 c 563 g 196 t ORIGIN 1 aaaccggttg atcctgccgg acctgaccgc tatcggggtg gggctaagcc atgcgagtcg 61 cgcgcccggg gcgccgggcg cggcgcacgg ctcagtaaca cgtacccaac ctaacctcgg 121 gagggggaca accccgggaa actggggctg atcccccata ggggaagggc gctggaaggc 181 cccttcctcc aaagggatcg cgggcgatct cccgcggtcc gcccgagggt gggggtacgg 241 cccatcaggt tgttggcggg gtaacggccc gccaagccga agacgggtag gggcggtgag 301 agccgtgagc cccgagatgg gcactgagac aagggcccag gccctacggg gtgcagcagg 361 cgcgaatact ccgcaatgcg ggcaaccgcg acggggccac cccgagtgcc gggcgaagag 421 cccggctttt gcccggtgta aggagccggg cgaataagcg gggggtaagt ctggtgtcag 481 ccgccgcggt aataccagcc ccgcgagtgg tcagggtgat tactgggctt aaagcgcccg 541 tagccggccc ggcaagtcgc tcctgaaatc cccaggctca acctgggggc agggggcgat 601 actgccgggc tagggggcgg gagaggccgc cggtactccg ggggtagggg cgaaatccta 661 taatccccgg aggaccacca gtggcgaaag cgggcggcca gaacgcgccc gacggtgagg 721 ggcgaaagcc gggggagcaa aggggattag atacccctgt agtcccggcc gtaaacgatg 781 cgggctagct gtcggccggg cttagggccc ggccggtggc gtagggaaac cgttaagccc 841 gccgcctggg gagtacggcc gcaaggctga aacttaaagg aattggcggg ggggcaccac 901 aaggggtgaa gcttgcggct taattggagt caacgccgga aaccttaccc ggggcgacag 961 caggatgaag gccaggctaa cgaccttgcc ggacgagctg agaggaggtg catggccgtc 1021 gtcagctcgt gccgtgaggt gtccggttaa gtccggcaac gagcgagacc cccaccccta 1081 gttgctaccc cgctcttcgg ggcggggggc acactagggg gactgccggc gtaagccgga 1141 ggaaggaggg ggcgacggca ggtcagtatg ccccgaaacc ccggggctgc acgcgagctg 1201 caatggcggg gacagcggga tccgaccccg aaagggggag gcaatcccgt aaaccccgcc 1261 ccagtaggga tcgagggctg caactcgccc tcgtgaacgt ggaatcccta gtaaccgcgt 1321 gtcaccaacg cgcggtgaat acgtccctgc cccttgcaca caccgcccgt cgcaccaccc 1381 gagggagttc tctgcgaggc ccctcgcttg gggcaaccca ggtgggggga cgagcagaga 1441 actcccgagg ggggtgaagt cgtaacaagg tagccgtagg ggaacctgcg gttggatcac 1501 ctcc // LOCUS VACPOLRDA 15212 bp ds-DNA VRL 14-AUG-1990 DEFINITION Vaccinia virus RNA polymerase (rpo30), polyA polymerase large subunit and DNA polymerase genes, complete cds. ACCESSION M36339 KEYWORDS DNA polymerase; RNA polymerase; polyA polymerase large subunit. SOURCE Vaccinia virus (strain WR) DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 15212) AUTHORS Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y. TITLE Eukaryotic transcriptional elongation factor homolog is an integral subunit of the vaccinia virus encoded RNA polymerase JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 2041 to 8280) AUTHORS Gershon,P., Jones,E.V., Moss,B. and Ahn,B.Y. TITLE Eukaryotic transcriptional elongation factor homolog is an integral subunit of the vaccinia virus encoded RNA polymerase JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Authorin copy for [1] kindly submitted by B.Y.Ahn, 10-JUL-1990. National Institutes of Health NIAID/LVD 9000 Rockville Pike Bethesda, Maryland 20892 FEATURES from to/span description pept 1621 182 (c) polyA polymerase large subunit pept 3831 1618 (c) ORF1 pept 4530 3958 (c) ORF2 pept 5365 4586 (c) RNA polymerase subunit (rpo30) pept 5414 6439 ORF3 pept 6576 8279 ORF4 pept 8361 8861 ORF5 pept 8986 9807 ORF6 pept 12834 9814 (c) DNA polymerase pept 12866 13153 ORF7 pept 13537 13148 (c) ORF8 BASE COUNT 4992 a 2561 c 2400 g 5259 t ORIGIN 1 aagcttcctg cgtactcaaa gtagataaac cctcatcgcc cgcgtgtgag agaagacctt 61 cgtccccgtc cagatgcgag agaatgaata accctggaaa acaagttccg tttatgagga 121 cggacatgct acaaaatatg ttcgcggcta atcgcgacaa cgtggcgtcg agacttttga 181 actaaaatac aattatatcc ttttcgatat taataaatcc gtgtcgtcca ggttttttat 241 ctctttcagt atgtgaatag ataggtattt tatctctatt catcatcgaa tttaagagat 301 ccgataaaca ttgtttgtat tctccagatg tcagcatctg atacaacaat atatgtgcac 361 ataaacctct ggcacttatt tcatgtacct tccccttatc actaaggaga atagtatttg 421 agaaatatgt atacatgata ttatcatgaa ttagatatac agaatttgta acactctcga 481 aatcacacga tgtgtcggcg ttaagatcta atatatcact cgataacaca ttttcatcta 541 gatacactag acatttttta aagctaaaat agtctttagt agtgacagta actatgcgat 601 tattttcatc gatgatacat ttcatcggca tattattacg cttaccatca aagactatac 661 catgtgtata tctaacgtat tctagcatgg ttgccatacg cgcattaaac ttttcaggat 721 ctttggatag atcttccaat ctatctattt gagaaaacat ttttatcatg ttcaatagtt 781 gaaacgtcgg atccactata tagatattat ctataaagat tttaggaact acgttcatgg 841 tatcctggcg aatattaaaa ctatcaatga tatgattatc gttttcatct tttatcacca 901 tatagtttct aagatatggg attttactta atataatatt atttcccgta ataaatttta 961 ttagaaatgc caaatctata agaaaagttc tagaattagt ctgaagaata tctatatcgc 1021 cgtatagtat atttggatta attagatata gagaatatga tccgtaacat atacaacttt 1081 tattatggcg tctaagatat tcttccatca acttattaac atttttgact agggaagata 1141 cattatgacg tcccattact tttgccttgt ctattactgc gacgttcata gaatttagca 1201 tatctcttgc caattcttcc attgatgtta cattataaga aattttagat gaaattacat 1261 ttggagcttt aatagtaaga actcctaata tgtccgtgta tgtggtcact aatacagatt 1321 gtagttctat aatcgtaaat aatttaccta tattatatgt ttgagtctgt ttagaaaagt 1381 agctaagtat acgatctttt atttctgatg cagatgtatc aacatcggaa aaaaatcttt 1441 ttttattctt ttttactaaa gatacaaata tgtctttgtt aaaaacagtt attttctgaa 1501 tatttctagc ttgtaatttt aacatatgat attcgttcac actaggtact ctgcctaaat 1561 aggtttctat aatctttaat gtaatattag gaaaagtatt ctgatcagga ttcctattca 1621 ttttgaggat ttaaaactct gattattgtc taatatggtc tctacgcaaa ctttttcaca 1681 gagcgataga gtttttgata actcgttttt cttaagaaat ataaaactac tgtctccaga 1741 gctcgctcta tcttttattt tatttaattc gatacaaact cctgatactg gttcagaaag 1801 taattcatta attttcagtc ctttatagaa gatatttaat atagataata caaaattttc 1861 agttcttgat atcgatctga ttgatcctag aactagatat attaataacg tgctcattag 1921 gcagtttatg gcagcttgat aattagatat agtatattcc agttcatatt tattagatac 1981 cgcattgccc agattttgat attctatgaa ttcctctgaa aataaatcca aaataactaa 2041 acattctatt ttttgtggat tagtgtactc tcttccctct atcatgttca ctactggtgt 2101 ccacgatgat aaatatctag agggaatata atatagtcca taggatgcca atctagcaat 2161 gtcgaataac tgtaatttga ttcttcgttc ttcattatga attgattctt gaggtataaa 2221 cctaacacaa attatattat tagacttttc gtatgtaatg tctttcatgt tataagtttt 2281 taatcctgga atagaatcta ttttaatgag gcttttaaac gcagagttct ccaacgagtc 2341 aaagcataat actctgttgt ttttcttata tacgatgtta cgattttctt ctttgaatgg 2401 aataggtttt tgaattagtt tataattaca acataataga taaggaagtg tgcaaatagt 2461 acgcggaaaa aacataatag ctcccctgtt ttcatccatg gttttaagta aatgatcact 2521 ggcttcttta gtcaatggat attcgaacat taaccgtttc atcatcattg gacagaatcc 2581 atatttctta atgtaaagag tgatcaaatc attgtgttta ttgtaccatc ttgttgtaaa 2641 tgtgtattcg gttatcggat ctgctccttt ttctattaaa gtatcgatgt cgatctcgtc 2701 taagaattca actatatcga catatttcat ttgtatacac ataaccatta ctaacgtaga 2761 atgtatagga agagatgtaa cgggaacagg gtttgttgat tcgcaaacta ttctaataca 2821 taattcttct gttaatacgt cttgcacgta atctattata gatgccaaga tatctatata 2881 attattttgt aagatgatgt taactatgtg atctatataa gtagtgtaat aattcatgta 2941 tttcgatata tgttccaact ctgtctttgt gatgtctagt ttcgtaatat ctatagcatc 3001 ctcaaaaaat atattcgcat atattcccaa gtcttcagtt ctatcttcta aaaaatcttc 3061 aacgtatgga atataataat ctattttacc tcttctgata tcattaatga tatagttttt 3121 gacactatct tctgtcaatt gattcttatt cactatatct aagaaacgga tagcgtccct 3181 aggacgaact actgccatta atatctctat tatagcttct ggacataatt catctattat 3241 accagaatta atgggaacta ttccgtatct atctaacata gttttaagaa agtcagaatc 3301 taagacctga tgttcatata ttggttcata catgaaatga tctctattga tgatagtgac 3361 tatttcattc tctgaaaatt ggtaactcat tctatatatg ctttccttgt tgatgaagga 3421 tagaatatac tcaatagaat ttgtaccaac aaactgttct cttatgaatc gtatatcatc 3481 atctgaaata atcatgtaag gcatacattt aacaattaga gacttgtctc ctgttatcaa 3541 tatactattc ttgtgataat ttatgtgtga ggcaaatttg tccacgttct ttaattttgt 3601 tatagtagat atcaaatcca atggagctac agttcttggc ttaaacagat atagtttttc 3661 tggaacgaat tctacaacat tattataaag gactttgggt agataagtgg gatgaaatcc 3721 tattttaatt aatgcgatag ccttgtcctc gtgcagatat ccaaacgctt ttgtgatagt 3781 atggcattca ttgtctagaa acgctctacg aatatctgtg acagatatca tctttagaga 3841 atatactagt cgcgttaata gtactacaat ttgtattttt taatctatct caataaaaaa 3901 attaatatgt atgattcaat gtataactaa actactaact gttattgata actagaatca 3961 gaatctaatg atgacgtaac caagaagttt atctactgcc aatttagctg cattattttt 4021 agcatctcgt ttagattttc catctgcctt atcgaatact cttccgtcga tgtctacaca 4081 ggcataaaat gtaggagagt tactaggccc cactgattca atacgaaaag accaatctct 4141 cctagtaatt tggcagtact cattaataac ggtgacaggg ttagcacctt tccaatcaat 4201 aattttttta gccggaataa catcatcaaa agacttatga tcctctctca ttgatttttc 4261 gcgggataca tcatctatta tgacgtcagc catagcatca gcatccggct tatccgcctc 4321 cgttgtcata aaccaacgag gaggaatatc gtcggagctg tacaccatag cactacgttg 4381 aagatcgtac agagctttat taacttctcg cttctccata ttaagttgtc tagttagttg 4441 tgcagcagta gctccttcga ttccaatggt tttaatagcc tcacacacaa tctctgcgtt 4501 agaacgctcg tcgatataga ttttagacat ttttagagag aactaacgca atcagtaata 4561 aaactaattt attttatcat tttttttatt catcatcctc tggtggttcg tcgtttctat 4621 cgaatgtagc tctgattaac ccgtcatcta taggtgatgc tggttctgga gattctggag 4681 gagatggatt attatctgga agaatctctg ttatttcctt gttttcatgt atcgattgcg 4741 ttgtaacatt aagattgcga aatgctctaa atttgggagg cttaaagtgt tgtttgcaat 4801 ctctacacgc gtgtctaact agtggaggtt cgtcagcggc tctagtttga atcatcatcg 4861 gcgtagtatt cctactttta cagttaggac acggtgtatt gtatttctcg tcgagaacgt 4921 taaaataatc gttgtaactc acatccttta ttttatctat attgtattct actcctttct 4981 taatgcattt tataccgaat aagagatagc gaaggaattc tttttcggtg ccgctagtac 5041 ccttaatcat atcacatagt gttttatatt ccaaatttgt ggcaatagac ggtttatttc 5101 tatacgatag tttgtttctg gaatcctttg agtattctat accaatatta ttctttgatt 5161 cgaatttagt ttcttcgata ttagattttg tattacctat attcttgatg tagtactttg 5221 atgatttttc catggcccat tctattaagt cttccaagtt ggcatcatcc acatattgtg 5281 atagtaattc tcggatatca gtagcggtta ccgccattga tgtttgttca ttggatgagt 5341 aactactaat gtatacattt tccatttata acacttatgt attaactttg ttcatttata 5401 ttttttcatt attatgttga tattaacaaa agtgaatata tatatgttaa taattgtatt 5461 gtggttatac ggctacaatt ttataattag tgaaagtcag tgtccgatga tcaatgacga 5521 tagctttact ctgaaaagaa agtatcaaat cgatagtgcg gagtcaacaa taaaaatgga 5581 taagaagagg acaaagtttc agaatagagc caaaatggta aaagaaataa atcagacaat 5641 aagagcagca caaactcatt acgagacatt gaaactagga tacataaaat ttaagagaat 5701 gattaggact actactctag aagatatagc accatctatt ccaaataatc agaaaactta 5761 taaactattc tcggacattt cagccatcgg caaagcatca cggaatccaa gtaagatggt 5821 atatgctctg ctgctttaca tgtttcccaa tttgtttgga gatgatcata gattcattcg 5881 ttatagaatg catccaatga gtaaaatcaa acacaagatc ttctctcctt tcaaacttaa 5941 tcttattaga atattagtgg aagaaagatt ctataataat gaatgcagat ctaataaatg 6001 gagaataatt ggaacacaag ttgataaaat gttgatagct gaatctgata aatatacaat 6061 agatgcaagg tataacctaa aacccatgta tagaatcaag ggaaaatctg aagaagatac 6121 cctctttatc aaacagatgg tagaacaatg tgtgacatcc caggaattgg tggaaaaagt 6181 gttgaagata ctgtttagag atttgttcaa gagtggagaa tacaaagcgt acagatacga 6241 tgatgatgta gaaaatggat ttattggatt ggatacacta aaattaaaca ttgttcatga 6301 tatagttgaa ccatgtatgc ctgttcgtag gccagtggct aagatactgt gtaaagaaat 6361 ggtaaataaa tactttgaga atccgctaca tattattggt aaaaatcttc aagagtgcat 6421 tgactttgtt agtgaatagg catttcatct ttctccaata ctaattcaaa ttgttaaatt 6481 aataatggat agtataaata gttattagtg ataaaatagt aaaaataatt attagaataa 6541 gagtgtagta tcatagataa ctctcttcta taaaaatgga ttttattcgt agaaagtatc 6601 ttatatacac agtagaaaat aatatagatt ttttaaagga tgatacatta agtaaagtaa 6661 acaattttac cctcaatcat gtactagctc tcaagtatct agttagcaat tttcctcaac 6721 atgttattac taaggatgta ttagctaata ccaatttttt tgttttcata catatggtac 6781 gatgttgtaa agtgtacgaa gcggttttac gacacgcatt tgatgcaccc acgttgtacg 6841 ttaaagcatt gactaagaat tatttatcgt ttagtaacac aatacaatcg tacaaggaaa 6901 ccgtgcataa actaacacaa gatgaaaaat ttttagaggt tgccaaatac atggacgaat 6961 taggagaact tataggcgta aattatgact tagttcttaa tccattattt cacggagggg 7021 aacccatcaa agatatggaa atcatttttt taaaactgtt taagaaaaca gacttcaaag 7081 ttgttaaaaa attaagtgtt ataagattac ttatttgggc ttacctaagc aagaaagata 7141 caggcataga gtttgcggat aatgatagac aagatatata tactctattt caacaaactg 7201 gtagaatcgt ccatagcaat ctaacagaaa cgtttagaga ttatatcttt cccggagata 7261 agactagcta ttgggtgtgg ttaaacgaaa gtatagctaa tgatgcggat attgttctta 7321 atagacacgc cattaccatg tatgataaaa ttcttagtta tatatactct gagataaaac 7381 agggacgcgt taataaaaac atgcttaagt tagtttatat ctttgagcct gaaaaagata 7441 tcagagaact tctgctagaa atcatatatg atattcctgg agatatccta tctattattg 7501 atgcaaaaaa cgacgattgg aaaaaatatt ttattagttt ttataaagct aattttatta 7561 acggtaatac atttattagt gatagaacgt ttaacgagga cttattcaga gttgttgttc 7621 aaatagatcc cgaatatttc gataatgaac gaattatgtc tttattctct acgagtgctg 7681 cggacattaa acgatttgat gagttagata ttaataacag ttatatatct aatataattt 7741 atgaggtgaa cgatatcaca ttagatacaa tggatgatat gaagaagtgt caaatcttta 7801 acgaggatac gtcgtattat gttaaggaat acaatacata cctgtttttg cacgagtcgg 7861 atcccatggt catagagaac ggaatactaa agaaactgtc atctataaaa tccaagagta 7921 gacggctgaa cttgtttagc aaaaacattt taaaatatta tttagacgga caattggctc 7981 gtctaggtct tgtgttagat gattataaag gagacttgtt agttaaaatg ataaaccatc 8041 ttaagtctgt ggaggatgta tccgcattcg ttcgattttc tacagataaa aaccctagta 8101 ttcttccatc gctaatcaaa actattttag ctagttataa tatttccatc atcgtcttat 8161 ttcaaaggtt tttaagagat aatctatatc atgtagaaga attcttggat aaaagcatcc 8221 atctaaccaa gacggataag aaatatatac ttcaattgat aagacacggt agatcataga 8281 acagaccaaa tatattatta ataatttgta tatacataga tataattatc acatattaaa 8341 aattcacaca tttttgataa atgggaactg ctgcaacaat tcagactccc accaaattaa 8401 tgaataaaga aaatgcagaa atgattttgg aaaaaattgt tgatcatata gttatgtata 8461 ttagtgacga atcaagtgat tcagaaaata atcctgaata tattgatttt cgtaacagat 8521 acgaagacta tagatctctc attataaaaa gtgatcacga gtttgtaaag ctatgtaaaa 8581 atcatgcaga gaaaagttct ccagaaacgc aacaaatgat tatcaaacac atatacgaac 8641 aatatcttat tccagtatct gaagtactat taaaacctat aatgtccatg ggtgacataa 8701 ttacatataa cggatgtaaa gacaatgaat ggatgctaga acaactctct accctaaact 8761 ttaacaatct ccgcacatgg aactcatgta gcataggcaa tgtaacgcgt ctgttttata 8821 cattttttag ttatctgatg aaagataaac taaatatata agtataatcc cattctaata 8881 ctttaacctg atgtattagc atcttattag aatattaacc taactaaaag acataacata 8941 aaaactcatt acatagttga taaaaagcgg taggatataa atattatggc tgccaccgtt 9001 ccgcgttttg acgacgtgta caaaaatgca caaagaagaa ttctagatca agaaacattt 9061 tttagtagag gtctaagtag accgttaatg aaaaacacat atctatttga taattacgcg 9121 tatggatgga taccagaaac tgcaatttgg agtagtagat acgcaaactt agatgcaagt 9181 gactattatc ccatttcgtt gggattactt aaaaagttcg agtttctcat gtctctatat 9241 aaaggtccta ttccagtata cgaagaaaaa gtaaatactg aattcattgc taatggatcg 9301 ttctctggta gatacgtatc atatcttcga aagttttctg ctcttccaac aaacgagttt 9361 attagttttt tgttactgac ttccattcca atctataata tcttgttctg gtttaaaaat 9421 actcagtttg atattactaa acacacatta ttcagatacg tctatacaga taatgccaaa 9481 cacctggcgt tggctaggta tatgcatcaa acaggagact ataagccttt gtttagtcgt 9541 ctcaaagaga attatatatt taccggtccc gttccaatag gtatcaaaga tataaatcac 9601 cctaatctta gtagagcaag aagtccatcc gattatgaga cattagctaa tattagtact 9661 atattgtact ttaccaagta tgatccggta ttaatgtttt tattgtttta cgtacctggg 9721 tattcaatta ctacaaaaat tactccagcc gtagaatatc taatggataa actgaatcta 9781 acaaagagcg acgtacaact gttgtaaatt attttatgct tcgtaaaatg taggttttga 9841 accaaacatt ctttcaaaga atgagatgca taaaacttta ttatccaata gattgactat 9901 ttcggacgtc aatcgtttaa agtaaacttc gtaaaatatt ctttgatcac tgccgagttt 9961 aaaacttcta tcgataattg tttcatatgt tttaatattt acaagttttt tggtccatgg 10021 tacattagcc ggacaaatat atgcaaaata atatcgttct ccaagttcta tagtttctgg 10081 attattttta ttatattcag taaccaaata catattaggg ttatctgcgg atttataatt 10141 tgagtgatgc attcgactca acataaataa ttctagagga gacgatctac tatcaaattc 10201 ggatcgtaaa tctgtttcta aagaacggag aatatctata catacctgat tagaattcat 10261 ccgtccttca gacaacatct cagacagtct ggtcttgtat gtcttaatca tattcttatg 10321 aaacttggaa acatctcttc tagtttcact agtaccttta ttaattctct caggtacaga 10381 ttttgaattc gacgatgccg agtatttcat cgttgtatat ttcttcttcg attgcataat 10441 cagattctta tataccgcct caaactctat tttaaaatta ttaaacaata ctctattatt 10501 aatcagtcgt tctaactcct ttgctatttc tatggactta tctacatctt gactgtctat 10561 ctctgtaaac acggagtcgg tatctccata cacgctacga aaacgaaatc tgtaatctat 10621 aggcaacgat gttttcacaa tcggattaat atctctatcg tccatataaa atggattact 10681 taatggattg gcaaaccgta acataccgtt agataactct gctccattta gtaccgattc 10741 tagatacaag atcattctac gtcctatgga tgtgcaactc ttagccgaag cgtatgagta 10801 tagagcacta tttctaaatc ccatcagacc atatactgag ttggctacta tcttgtacgt 10861 atattgcatg gaatcataga tggccttttc agttgaactg gtagcctgtt ttaacatctt 10921 tttatatctg gctctctctg ccaaaaatgt tcttaatagt ctaggaatgg ttccttctat 10981 cgatctatcg aaaattgcta tttcagagat gaggttcggt agtctaggtt cacaatgaac 11041 cgtaatatat ctaggaggtg gatatttctg aagcaagagc tgattattta tttcttcttc 11101 caatctattg gtactaacaa cgacaccgac taatgtttcc ggagatagat ttccaaagat 11161 acacacatta ggatacagac tgttataatc aaagattaat acattattac taaacatttt 11221 ttgttttgga gcaaatacct taccgccttc ataaggaaac ttttgttttg tttctgatct 11281 aactaagata gttttagttt ccaacaatag ctttaacagt ggacccttga tgactgtact 11341 cgctctatat tcgaatacca tggattgagg aagcacatat gttgacgcac ccgcgtctgt 11401 ttttgtttct actccataat actcccacaa atactgacac aaacaagcat catgaataca 11461 gtatctagcc atatctaaag ctatgtttag attataatcc ttatacatct gagctaaatc 11521 aacgtcatcc tttccgaaag ataatttata tgtatcatta ggtaaagtag gacataatag 11581 tacgacttta aatccatttt cccaaatatc tttacgaatt actttacata taatatcctc 11641 atcaacagtc acataattac ctgtggttaa aacctttgca aatgcagcgg ctttgccttt 11701 cgcgtccgta gtatcgtcac cgatgaacgt catttctcta actcctctat ttaatacttt 11761 acccatgcaa ctgaacgcgt tcttggatat agaatccaat ttgtacgaat ccaatttttc 11821 agatttttga atgaatgaat atagatcgaa aaatatagtt ccattattgt tattaacgtg 11881 aaacgtagta ttggccatgc cgcctactcc cttatgacta gactgatttc tctcataaat 11941 acagagatat acagcttcct ttttgtccgg agatctaaag ataatcttct ctcctgttaa 12001 taactctaga cgattagtaa tatatctcag atcaaagtta tgtccgttaa aggtaacgac 12061 atagtcgaac gttagttcca acaattgttt agctattcgt aacaaaacta tttcagaaca 12121 taaaactagt tctcgttcgt aatccatttc cattagtgac tgtatcctca aacatcctct 12181 atcgacggct tcttgtattt cctgttccgt taacatctct tcattaatga gcgtaaacaa 12241 taatcgttta ccacttaaat cgatataaca gtaacttgta tgcgagattg ggttaataaa 12301 tacagaagga aacttcttat cgaagtgaca ctctatatct agaaataagt acgatcttgg 12361 gatatcgaat ctaggtattt ttttagcgaa acagttacgt ggatcgtcac aatgataaca 12421 tccattgtta atctttgtca aatattgctc gtccaacgag taacatccgt ctggagatat 12481 cccgttagaa atataaaacc aactaatatt gagaaattca tccatggtgg cattttgtat 12541 gctgcgtttc tttggctctt ctatcaacca catatctgcg acggagcatt ttctatcttt 12601 aatatctaga ttataactta ttgtctcgtc aatgtctata gttctcatct ttcccaacgg 12661 cctcgcatta aatggaggag gagacaatga ctgatatatt tcgtccgtca ctacgtaata 12721 aaagtaatga ggaaatcgta taaatacggt ctcaccattt cgacatctgg atttcagata 12781 taaaaatctg ttttcaccgt gactttcaaa ccaattaatg caccgaacat ccatttatag 12841 aatttagaaa tatattttca tttaaatgaa tcccaaacat tggggaagag ccgtatggac 12901 cattattttt atagtacttt cgcaagcggg tttagacggc aacatagaag cgtgtaaacg 12961 aaaactatat actatagtta gcactcttcc atgtcctgca tgtagacggc acgcgactat 13021 cgctatagag gacaataatg tcatgtctag cgatgatctg aattatattt attatttttt 13081 catcagatta tttaacaatt tggcatctga tcccaaatac gcgatcgatg tgacaaaggt 13141 taacccttta taaacttaac ccattataaa acttatgatt agtcacgact gaaataaccg 13201 cgtgattatt ttttggtata attctacacg gcatggtttc tgtgactatg aattcaaccc 13261 ccgttacatt agtgaaatct ttaacaaaca gcaagggttc gtcaaagaca taaaactcat 13321 tgtttacaat cgaaatagac cccctatcac acttaaaata aaaaatatcc ttatccttta 13381 ccaccaaata aaattctgat tggtcaatgt gaatgtattc acttaacagt tccacaaatt 13441 tatttattaa ctccgaggca catacatcgt cggtattttt tatggcaaac tttactcttc 13501 cagcatccgt ttctaaaaaa atattaacga gttccattta tatcatccaa tattattgaa 13561 atgacgttga tggacaaatg atacaaataa gaaggtacgg tacctttgtc caccatctcc 13621 tccaattcat gctctatttt gtcattaact ttaatgtatg aaaacagtac gccacatgct 13681 tccatgacag tgtgtaacac tttggataca aaatgtttga cattagtata attgttcaag 13741 actgtcaatc tataatagat agtagctata atatattcta tgatggtatt gaagaagatg 13801 acaaccttgg catattgatc atttaacaca gacatggtat caacagatag cttgaatgaa 13861 agagaatcag taattggaat aagcgtcttc tcgatggagt gtccgtatac caacatgtct 13921 gatattttga tgtattccat taaattattt agttttttct ttttattctc gttaaacagc 13981 atttctgtca acggacccca acatcgttga ccgattaagt tttgattgat ttttccgtgt 14041 aaggcgtatc tagtcagatc gtatagccta tccaataatc catcgtctgt gtgtagatca 14101 catcgtacac tttttaattc tctatagaag agcgacagac atctggagca attacagaca 14161 gcaatttctt tattctctac agatgtaaga tacttgaaga cattcctatg atgatgcaga 14221 attttggata acacggtatt gatggtatct gttaccataa ttcctttgat ggctgatagt 14281 gtcagagcac aagatttcca atctttgaca atttttagca ccattatctt tgttttgata 14341 tctatatcag acagcatggt gcgtctgaca acacaaggat taagacggaa agatgaaatg 14401 attctctcaa catcttcaat ggataccttg ctattttttc tggcattatc tatatgtgcg 14461 agaatatcct ctagagaatc agtatccttt ttgatgatag tggatctcaa tgacatggga 14521 cgtctaaacc ttcttattct atcaccagat tgcatggtga tttgtcttct ttcttttatc 14581 ataatgtaat ctctaaattc atcggcaaat tgtctatatc taaaatcata atatgagatg 14641 tttacctcta caaatatctg ttcgtccaat gttagagtat ttacatcagt tttgtattcc 14701 aaattaaaca tggcaacgga tttaatttta tattcctcta ttaagtcctc gtcgataata 14761 acagaatgta gataatcatt taatccatcg tacatggttg gaagatgctt gttgacaaaa 14821 tctttaattg tcttgatgaa ggtgggacta tatctaacat cttgattaat aaaatttata 14881 acattgtcca taggatactt tgtaactagt tttatacaca tctcttcatc ggtaagccca 14941 gacagaatat cgtgaacagg tggtatatta tattcatcag atatacgaag aacaatgtcc 15001 aaatctatat tgtttaatat attatataga tgtagtgtag ctcctacagg aatatcttta 15061 actaagtcaa tgatttcatc aaccgttaga tctattttaa agttaatcat ataggcattg 15121 atttttaaaa ggtatgtagc cttgactaca ttctcattaa ttaaccattc caagtcactg 15181 tgtgtaagaa gattatattc tatcataagc tt // LOCUS YSCERD2A 3327 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae ER lumen protein retaining receptor (ERD2) gene, complete cds. ACCESSION M34777 KEYWORDS ER lumen protein retaining receptor; HDEL receptor. SOURCE S.cerevisiae (strain SEY2102) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3327) AUTHORS Semenza,J.C., Hardwick,K.G., Dean,N. and Pelham,H.R.B. TITLE ERD2, a yeast gene required for the receptor-mediated retrieval of luminal ER proteins from the secretory pathway JOURNAL Cell 61, 1349-1357 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Semenza, 01-JUN-1990, for release after publication. FEATURES from to/span description pept 1337 1358 ER lumen protein retaining receptor, exon 1 (ERB2) 1456 2093 ER lumen protein retaining receptor, exon 2 ORF 2492 2960 ORF1 IVS 1359 1455 ERB2 intron A site 1359 1364 consensus splice donor site 1416 1422 branchpoint sequence site 1453 1455 acceptor sequence BASE COUNT 996 a 639 c 574 g 1118 t ORIGIN 1 aagcttcctg acggaatgtt aattctcgtt gaccctgagc aggctgttga gcccggggat 61 ttctgcatag ccagacttgg gggtgatgag tttaccttca agaaactgat ccatccaaac 121 catttttggg tttagttgct gcttccgctg gtatccttca agatgtcatt gaaggtaagt 181 acgatcttga ggccggcgaa aacaaattca acttttaaat caaaatttta gaataaaagt 241 ctactgcatt aacaaagata aatttctaat ttttttatgt atattatatt ttttggaaga 301 gaaaatattt gtagtgcttc tctacactgt tttttttttt tcgttcatgt tgtctttcct 361 ctcctgaata tatttaattt aataaatcta tcttatctga catatagaag ctgtttcgat 421 agagttttcg actggccaac agttgttact gactggataa aaatttacga tttttcctcc 481 aatcttggct ttcaaaatga gaaataagag tcttgaaatt caaagaatgt atctatgtat 541 gaatatagaa tatgctcaaa tatttagttt tgtataagga agttctaaat aattttccat 601 ggcgtgcttt taccaaagta ctgaacaggg agattattta tcattttttt tcttactctt 661 cgagttgaag tcgattgtag gtgatgacat gaacattaaa taaaatgtac ttgaacaaat 721 cagaccagca aacacccaca tcgcaatggt actgggtttt ctcgttattc tgtggaagaa 781 ttcaattacc ggattattgt taaaaaagcc cattagaagt atcctaacga ttgattaaag 841 taaattgtta ttcacgctac tatcgcaatc gtccaactga accttattct ttattgtata 901 taatctctgc cgagaaattc ttatacttat gtatctggga attcaccagg ttagaatacc 961 ttcccctcct caatttctaa tttttttttc tacccttatt tttctctctt gcttcatcgc 1021 tctcaaagct aaaagctaat acagcaaaca aaagtgttat taggaattca attgagtcgt 1081 gaaagaaaaa taatattcgt cttacttgag gactcacttt ggtcaattga gattaaagtg 1141 tagtctgcac cactgaattc ctcgttttcc tttaaagatc cgggatggct tatctgcggt 1201 acccggactg aaaacgacgt tttagctaaa tattgaacac gttaaaaata agttcaaatt 1261 tcgaggcaga tgaatttgct gagggcactg ttcaagagtg agatatacgc gcaagttgag 1321 gcataatcag taagcaatga atccgtttag aatcttaggt atgttactat ttggagtttc 1381 atgaggcttt tcccgccgta gatcgaaccc aatcttacta acagagaaag ggctttttcc 1441 cgaccatcaa gacaggtgat ttatcacatc taaccagtat actgatcctg attcataata 1501 tcaagaccac aaggtacatt gaaggtattt ctttcaagac ccaaacgttg tacgctttgg 1561 ttttcataac acgatacttg gatctcttga cttttcactg ggtatcccta tacaatgctc 1621 taatgaaaat atttttcatt gtatctaccg cttacattgt agtgctatta caagggtcta 1681 aaagaaccaa caccattgcg tataatgaaa tgcttatgca tgataccttt aagatccagc 1741 atttactaat tgggagtgct ctaatgagtg tttttttcca tcacaagttc acttttcttg 1801 aattagcatg gagtttttct gtatggttgg agagtgtggc tattctacct caattgtaca 1861 tgctatctaa gggtgggaag actagaagtc taactgttca ttatattttt gccatgggat 1921 tatacagagc attgtatatt cctaactgga tttggaggta cagcacggaa gataaaaaat 1981 tggacaagat tgccttcttc gcgggacttt tgcaaactct gttgtactct gatttctttt 2041 acatttacta cactaaagtc atcagaggaa agggtttcaa actgccaaaa taaaaaaaaa 2101 acctgtttta aatagtgttc gcagcaatat atacaaattt caatatatca atgctatacc 2161 aaaagaaaaa aaaaatacat aaaattgatg ttgatattgg tctatttctt tttatactat 2221 gatatgtatg cattaatctc tttttagctc ataaaattct ttccttacac catccttggt 2281 gacaataagg atttccagcc catcacccac ttgtatatgt ctttctgtag cagaagtgaa 2341 cgagtctctc accagtttga tgacttcttc cacggacaag tatttcaaag gctttttgac 2401 tttaccgttt gtacctggct catattgaat ttttgaaatt aacctgattg tccaaaaatg 2461 gcatgatcaa tgatgccgca gcaccacctg ctctacactg ttctctttcg taggagccaa 2521 actgggtcga acgaatagac agcgccctta ccatcttcgt caagacccgc aatgatcgta 2581 tgaacgtagt aagggaaaaa cctcttcccg tacagaagat gttgaatgtt ccttgctgca 2641 gagtttatag atagtttttt gtcgttgtgg tcgaaatggt accattttac actatttttg 2701 aatcttttta ctaaagcgtc gccgtctgct gcaaatccat tcgccgacat gactatgtta 2761 tcaccacaat caaaaacctt gggttcataa cgagaattaa ttgagtaatc ggtgatattt 2821 cttgtatcgc ctgctaacac tgcgaaatct tcacctgcaa tgcccaggat tgtaccacca 2881 ttatcaccgt aaggattgaa ttgatgttca atgggtgtat ttgacgcctc cgaagagtat 2941 tctgatgcaa tagtggccat attttgttta actttatagt tcaatagtct tggctactct 3001 ctttccaact cagttcacct tgttattata ccgcttgttt ttgccaccct ttgagtttct 3061 cgatccttta agttggaaaa gataaaaaca attatcgctc cgtacgaaca ccaaatctct 3121 gcgacgcaga gacggcaata caaaatcaga aaaaaaaaac aatttatcct atgcttcact 3181 agtcacaggc tctacgaaac attctatcgc ctcactgcct ttacatcact taagcaagaa 3241 gttttaaata ttacggcaat tcctttgcag caataccttt actttatttc gagagaaaat 3301 agatcgtaaa atgcctactc caagctt // LOCUS YSCLYS2A 4980 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae alpha-aminoadipate reductase (LYS2) gene, complete cds. ACCESSION M36287 M14967 M14968 KEYWORDS LYS2 gene; alpha-aminoadipate reductase. SOURCE S.cerevisiae (strain S288C) DNA, clone pDP6. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 746 and 3921 to 4980) AUTHORS Fleig,U.N., Pridmore,R.D. and Philippsen,P. TITLE Construction of LYS2 cartridges for use in genetic manipulations of Saccharomyces cerevisiae JOURNAL Gene 46, 237-245 (1986) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 4632) AUTHORS Morris,M.E. and Jinks-Robertson,S. TITLE Nucleotide sequence of the LYS2 gene of Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.Jinks-Robertson, 10-JUL-1990. Emory University Dept. of Biology 1510 Clifton Road N.E. Atlanta, GA 30322 FEATURES from to/span description pept 297 4475 alpha-aminoadipate reductase (LYS2; EC 1.2.1.31) BASE COUNT 1523 a 942 c 1026 g 1489 t ORIGIN Chromosome II. 1 tctagaggca tcgcacagtt ttagcgagga aaactcttca atagttttgc cagcggaatt 61 ccacttgcaa ttacataaaa aattccggcg gtttttcgcg tgtgactcaa tgtcgaaata 121 cctgcctaat gaacatgaac atcgcccaaa tgtatttgaa gacccgctgg gagaagttca 181 agatatataa gtaacaagca gccaatagta taaaaaaaaa tctgagttta ttacctttcc 241 tggaatttca gtgaaaaact gctaattata gagagatatc acagagttac tcactaatga 301 ctaacgaaaa ggtctggata gagaagttgg ataatccaac tctttcagtg ttaccacatg 361 actttttacg cccacaacaa gaaccttata cgaaacaagc tacatattcg ttacagctac 421 ctcagctcga tgtgcctcat gatagttttt ctaacaaata cgctgtcgct ttgagtgtat 481 gggctgcatt gatatataga gtaaccggtg acgatgatat tgttctttat attgcgaata 541 acaaaatctt aagattcaat attcaaccaa cgtggtcatt taatgagctg tattctacaa 601 ttaacaatga gttgaacaag ctcaattcta ttgaggccaa tttttccttt gacgagctag 661 ctgaaaaaat tcaaagttgc caagatctgg aaaggacccc tcagttgttc cgtttggcct 721 ttttggaaaa ccaagatttc aaattagacg agttcaagca tcatttagtg gactttgctt 781 tgaatttgga taccagtaat aatgcgcatg ttttgaactt aatttataac agcttactgt 841 attcgaatga aagagtaacc attgttgcgg accaatttac tcaatatttg actgctgcgc 901 taagcgatcc atccaattgc ataactaaaa tctctctgat caccgcatca tccaaggata 961 gtttacctga tccaactaag aacttgggct ggtgcgattt cgtggggtgt attcacgaca 1021 ttttccagga caatgctgaa gccttcccag agagaacctg tgttgtggag actccaacac 1081 taaattccga caagtcccgt tctttcactt atcgcgacat caaccgcact tctaacatag 1141 ttgcccatta tttgattaaa acaggtatca aaagaggtga tgtagtgatg atctattctt 1201 ctaggggtgt ggatttgatg gtatgtgtga tgggtgtctt gaaagccggc gcaacctttt 1261 cagttatcga ccctgcatat cccccagcca gacaaaccat ttacttaggt gttgctaaac 1321 cacgtgggtt gattgttatt agagctgctg gacaattgga tcaactagta gaagattaca 1381 tcaatgatga attggagatt gtttcaagaa tcaattccat cgctattcaa gaaaatggta 1441 ccattgaagg tggcaaattg gacaatggcg aggatgtttt ggctccatat gatcactaca 1501 aagacaccag aacaggtgtt gtagttggac cagattccaa cccaacccta tctttcacat 1561 ctggttccga aggtattcct aagggtgttc ttggtagaca tttttccttg gcttattatt 1621 tcaattggat gtccaaaagg ttcaacttaa cagaaaatga taaattcaca atgctgagcg 1681 gtattgcaca tgatccaatt caaagagata tgtttacacc attattttta ggtgcccaat 1741 tgtatgtccc tactcaagat gatattggta caccgggccg tttagcggaa tggatgagta 1801 agtatggttg cacagttacc catttaacac ctgccatggg tcaattactt actgcccaag 1861 ctactacacc attccctaag ttacatcatg cgttctttgt gggtgacatt ttaacaaaac 1921 gtgattgtct gaggttacaa accttggcag aaaattgccg tattgttaat atgtacggta 1981 ccactgaaac acagcgtgca gtttcttatt tcgaagttaa atcaaaaaat gacgatccaa 2041 actttttgaa aaaattgaaa gatgtcatgc ctgctggtaa aggtatgttg aacgttcagc 2101 tactagttgt taacaggaac gatcgtactc aaatatgtgg tattggcgaa ataggtgaga 2161 tttatgttcg tgcaggtggt ttggccgaag gttatagagg attaccagaa ttgaataaag 2221 aaaaatttgt gaacaactgg tttgttgaaa aagatcactg gaattatttg gataaggata 2281 atggtgaacc ttggagacaa ttctggttag gtccaagaga tagattgtac agaacgggtg 2341 atttaggtcg ttatctacca aacggtgact gtgaatgttg cggtagggct gatgatcaag 2401 ttaaaattcg tgggttcaga atcgaattag gagaaataga tacgcacatt tcccaacatc 2461 cattggtaag agaaaacatt actttagttc gcaaaaatgc cgacaatgag ccaacattga 2521 tcacatttat ggtcccaaga tttgacaagc cagatgactt gtctaagttc caaagtgatg 2581 ttccaaagga ggttgaaact gaccctatag ttaagggctt aatcggttac catcttttat 2641 ccaaggacat caggactttc ttaaagaaaa gattggctag ctatgctatg ccttccttga 2701 ttgtggttat ggataaacta ccattgaatc caaatggtaa agttgataag cctaaacttc 2761 aattcccaac tcccaagcaa ttaaatttgg tagctgaaaa tacagtttct gaaactgacg 2821 actctcagtt taccaatgtt gagcgcgagg ttagagactt atggttaagt atattaccta 2881 ccaagccagc atctgtatca ccagatgatt cgtttttcga tttaggtggt cattctatct 2941 tggctaccaa aatgattttt accttaaaga aaaagctgca agttgattta ccattgggca 3001 caattttcaa gtatccaacg ataaaggcct ttgccgcgga aattgacaga attaaatcat 3061 cgggtggatc atctcaaggt gaggtcgtcg aaaatgtcac tgcaaattat gcggaagacg 3121 ccaagaaatt ggttgagacg ctaccaagtt cgtacccctc tcgagaatat tttgttgaac 3181 ctaatagtgc cgaaggaaaa acaacaatta atgtgtttgt taccggtgtc acaggatttc 3241 tgggctccta catccttgca gatttgttag gacgttctcc aaagaactac agtttcaaag 3301 tgtttgccca cgtcagggcc aaggatgaag aagctgcatt tgcaagatta caaaaggcag 3361 gtatcaccta tggtacttgg aacgaaaaat ttgcctcaaa tattaaagtt gtattaggcg 3421 atttatctaa aagccaattt ggtctttcag atgagaagtg gatggatttg gcaaacacag 3481 ttgatataat tatccataat ggtgcgttag ttcactgggt ttatccatat gccaaattga 3541 gggatccaaa tgttatttca actatcaatg ttatgagctt agccgccgtc ggcaagccaa 3601 agttctttga ctttgtttcc tccacttcta ctcttgacac tgaatactac tttaatttgt 3661 cagataaact tgttagcgaa gggaagccag gcattttaga atcagacgat ttaatgaact 3721 ctgcaagcgg gctcactggt ggatatggtc agtccaaatg ggctgctgag tacatcatta 3781 gacgtgcagg tgaaaggggc ctacgtgggt gtattgtcag accaggttac gtaacaggtg 3841 cctctgccaa tggttcttca aacacagatg atttcttatt gagatttttg aaaggttcag 3901 tccaattagg taagattcca gatatcgaaa attccgtgaa tatggttcca gtagatcatg 3961 ttgctcgtgt tgttgttgct acgtctttga atcctcccaa agaaaatgaa ttggccgttg 4021 ctcaagtaac gggtcaccca agaatattat tcaaagacta cttgtatact ttacacgatt 4081 atggttacga tgtcgaaatc gaaagctatt ctaaatggaa gaaatcattg gaggcgtctg 4141 ttattgacag gaatgaagaa aatgcgttgt atcctttgct acacatggtc ttagacaact 4201 tacctgaaag taccaaagct ccggaactag acgataggaa cgccgtggca tctttaaaga 4261 aagacaccgc atggacaggt gttgattggt ctaatggaat aggtgttact ccagaagagg 4321 ttggtatata tattgcattt ttaaacaagg ttggattttt acctccacca actcataatg 4381 acaaacttcc actgccaagt atagaactaa ctcaagcgca aataagtcta gttgcttcag 4441 gtgctggtgc tcgtggaagc tccgcagcag cttaaggttg agcattacgt atgatatgtc 4501 catgtacaat aattaaatat gaattaggag aaagacttag cttcttttcg ggtgatgtca 4561 cttaaaaact ccgagaataa tatataataa gagaataaaa tattagttat tgaataagaa 4621 ctgtaaatca gctggcgtta gtctgctaat ggcagcttca tcttggttta ttgtagcatg 4681 aatcatattt gccttttttt cctgtaattc aatgattctt gcttctatac tatcctcaat 4741 gcaaaacctt gtgatcttca caggtcgata ctgaccaatt ctatgaactc tatcaccact 4801 ttgccattca acactagggt tccaccatgg gtctaaaatg aatacttgcg aagcttcaca 4861 aagattcaaa gcaacaccgc ccgcctttaa actgaccaag aaaacctcgc attgaatgtt 4921 gttcatgaaa tacttgatgg tttcatctct ttgcgtcggt gacatactac cctgaagctt // LOCUS YSCMTATPSA 2820 bp ds-DNA ORG 14-AUG-1990 DEFINITION Yeast mitochondrial oxi3 gene, 3' end, ATPase subunit 3 (aap1) and ATPase subunit 6 (oli2) gene, complete cds. ACCESSION M36379 KEYWORDS oli2 region. SOURCE Yeast (strain J69-1B) mitochondrial DNA, clone. ORGANISM Mitochondrion Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae; Saccharomyces cerevisiae. REFERENCE 1 (bases 1 to 2820) AUTHORS Novitski,C.E., Macreadie,I.G., Maxwell,R.J., Lukins,H.B., Linnane,A.W. and Nagley,P. TITLE Biogenesis of mitochondria: Genetic and molecular analysis of the oli2 region of mitochondrial DNA in Saccharomyces cerevisiae JOURNAL Curr. Genet. 8, 135-146 (1984) STANDARD simple staff_entry FEATURES from to/span description pept < 1 63 cytochrome c oxidase subunit I (oxi3) pept 1019 1165 ATPase subunit 3 (aap1) pept 1872 2651 ATPase subunit 6 (oli2) binding 701 709 ribosomal binding site C (put.) binding 1814 1821 ribosomal binding site A (put.) binding 1835 1845 ribosomal binding site B (put.) BASE COUNT 1159 a 200 c 183 g 1278 t ORIGIN 1 gaattcttat taacttctcc accagctgta cactcattta atacaccagc tgtacaatct 61 taagttataa tatttaatta tttacttaat aattaaaaaa aaaagtaaat attatatcta 121 aaacttaata atataataat aatattctta taaaaaaaaa tatataaaaa aaaaatatat 181 aaaatttatt aaaatatctt aatatattat cctttttggg aactatataa ataaatacta 241 atataatcct attatatata tatatatata aaataatata tatatataat taatataaat 301 aatatttata attatttttt taataatata tataatttaa tatattaatt aatattatat 361 aattattaaa tatattataa tattattatt attttataat aaaaatattt ttaatactaa 421 ttattattta ttatttataa atatataaat agtatgttta atattattaa tactaaaaaa 481 aagtataatt ataattagga tctacaatac atttatctga taaatattaa tattaatatt 541 aatatttata ttaataaagg gattaaatta attgtatcca atttaattaa attatagata 601 tattatttat aatattaata tattgtttta ttaaaaaggt aaaaatagtt tttattttat 661 atataaatat aggatataaa taaatatatt atatatatat taagaatata tttatatttt 721 acatataatt atttataata taaatatctc cgcaaagccg gattaatgta attatttaat 781 aattttattt aataatttat taaaataaat atttacattt aataatattt atattatgtc 841 agttatttta tattaatgtt taatctatta taatattttt ttttataaat atattattta 901 tattaataaa tatatatatt atttttataa tatatatata tttttattaa atatttatta 961 aatatttatt aaattattat aatgttgtta ttaatcttat taaaaaatat atataaaaat 1021 gccacaatta gttccatttt attttatgaa tcaattaata tatggtttct tattaatgat 1081 tctattatta attttattct cacaattctt tttacctatg atcttaagat tatatgtatc 1141 tagattattt atttctaaat tataatatat attattaatt tatttattca tataaatatt 1201 attattatat ataaatatta ataatattta tacttattta ataataataa taaaataaaa 1261 aaaataatta taatttaata tatttaatat atttccttac agactatata tttatatata 1321 tatattaaat acaatttaat ttaatttaat tatgttattt attaaataaa gttatattat 1381 aatataataa caatattata tattattata taattataat atattttaat ataattatca 1441 aaagaaataa taaaaaatat taataagaat ataatttaat aattattaaa aaaattctta 1501 tttatagtcc ggtccgcccc cgcggggcgg accccaaagg aggagtaata aaaattatta 1561 aatacaaata ttatatatat ataattcatt atatatatat atatataata attaatctta 1621 ttttttatat atttatttat atatctattt atatttatat tttatatata tttatttata 1681 tatctaaggg gtttggtccc tcccccgtaa gtataagtat acgggggagg gtccctcact 1741 atttatattt atatttatat ttatatttat atttatattt tatatatttt atatatttat 1801 aaataaagta taataagata taattatgat taattattta taagttatag ttttataaat 1861 ttataattat tatgtttaat ttattaaata catatattac atcaccatta gatcaatttg 1921 agattagact attatttggt ttacaatcat catttattga tttaagttgt ttaaatttaa 1981 caacattttc attatatact attattgtat tattagttat tacaagttta tatctattaa 2041 ctaataataa taataaaatt attggttcaa gatgattaat ttcacaagaa gctatttatg 2101 atactattat aaatatgctt aaaggacaaa ttggaggtaa aaattgaggt ttatatttcc 2161 ctatgatctt tacattattt atgtttattt ttattgctaa tttaattagt atgattccat 2221 attcatttgc attatcagct catttagtat ttattatctc tttaagtatt gttatttgat 2281 taggtaatac tattttaggt ttatataaac atggttgagt attcttctca ttattcgtac 2341 ctgctggtac accattacca ttagtacctt tattagttat tattgaaact ttatcttata 2401 ttgctagagc tatttcatta ggtttaagat taggttctaa tatcttagct ggtcatttat 2461 taatggttat tttagctggt ttactattta attttatgtt aattaattta tttactttag 2521 tattcggttt tgtaccttta gctatgatct tagctattat gattttagaa ttcgctattg 2581 gtattatcca atcttatgtt tgacttatct taacagcatc atacttaaaa gatacattat 2641 acttacatta aattataaaa taaaaattat ataaaaaaaa tataataata ataataatat 2701 aaaataaaaa ataaaaaata aaacaatgaa aaaacaaaat ttaaattcta ttttattaat 2761 gtatattaat tatattatta attattttaa taatattcat aaaaatcaat taaaaaaaga // LOCUS YSCSPO7A 1846 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae SPO7 gene, complete cds, and FUN14 and FUN37 genes, 5' end. ACCESSION M36073 KEYWORDS antimutator phenotype; premeiotic DNA replication. SOURCE S.cerevisiae (strain AB972) DNA, clone lambda-G12b and pLF31. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1846) AUTHORS Whyte,W., Koepp,L.H., Lamb,J., Crowley,J.C. and Kaback,D.B. TITLE Molecular cloning of chi I DNA from S.cerevisiae JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.B.Kaback, 05-JUL-1990. FEATURES from to/span description pept 564 1343 SPO7 protein pept 1624 > 1832 FUN14 protein pept 375 < 1 (c) FUN37 protein BASE COUNT 556 a 308 c 415 g 567 t ORIGIN Chromosome I. 1 gtcgactgtg gtgttgtcac tactcaacgt attcgcacta ctaacactga aattgaggtt 61 tggttgcaat tgtctgtatg tttcggtggc atcttgtaat gggatatcag tagagttgcg 121 catgaatttc tccaattgct gtgcatcgga gtataaataa ctcagagaac catttatcct 181 ggacctcgta gaaaaatcta aagaattgaa tgtattggga gtagatttgt tggaaatttg 241 caggtgtatt gctgagggaa ttcggaaatc taataatgtt ctcgatgtgg ccgttatatc 301 ctcgtagcta ttttgcgtac tccaatgggt gctctgataa aatgccctta gtacttggtc 361 catatagggt agcatcaaga tcggtcttct ctgttcgtgt ctttttccta acgtatattt 421 gctttgtttc ttcactcaac aataaagtca aagtaaaatt aaatactaat tattcttaaa 481 agggaagatg cgaaatttag cgaaaatcta ttgattatac acacaaagga agaaaggtag 541 tggaaagcta aataaaggag gtcatggagc cagagagcat aggcgatgtg gggaaccatg 601 cccaggatga tagtgccagt atagtgtccg ggcctcgcag gcgttctact agcaagacat 661 ccagtgcgaa gaatatacgg aactccagta atatctctcc agcatcgatg attttcagga 721 atttgttgat actggaggat gatttaagac gccaagctca cgaacaaaag atactgaagt 781 ggcaattcac tttgttctta gcgtctatgg ccggtgtagg cgcatttacc ttctacgaac 841 tttatttcac ttcagattat gtcaagggcc tccatagggt tattttgcaa ttcactcttt 901 ctttcatttc cattactgta gttctttttc atatcagtgg acaatataga agaactatcg 961 tcattccaag aagatttttt acctctacta ataaagggat taggcagttt aatgtgaagc 1021 tagttaaagt acagtctacg tgggacgaga aatacacaga ttcagtaaga tttgtgagtc 1081 gaacaattgc ttattgtaat atttattgtt tgaaaaaatt tctgtggctt aaagacgata 1141 atgccattgt gaaattttgg aaaagtgtca cgatacaatc ccaaccgagg atcggagctg 1201 tggatgtgaa attagtcctc aaccccagag catttagtgc agagattaga gaaggatggg 1261 agatttatag agacgagttt tgggccaggg aaggtgctag aagacgcaaa caagcgcacg 1321 aactccgacc taaatcagaa tgaaagagtt ggagggcttc ttccttcgaa taagaggtca 1381 tatttaccta tgtaaaattg taaccatcta tgttcacaca taaattatat tttatacatt 1441 attagaagtg aagctgttgt gtcgtgaaaa ttttacaaat ccgtcatttc atatttaagt 1501 tttccaacaa gtgctagaaa acctaggggt tgttgaaatt ggttaaacaa ggcatcttat 1561 tatacataca acagcataac gctagagggg caagaaggaa gaacttaaaa taataggtgt 1621 aaaatgactt tggcttttaa tatgcaacgg ttggtgtttc gtaatttgaa tgttgggaag 1681 cgcatgttca agaacgtccc cttatggagg tttaatgtcg ccaataaatt aggaaagccc 1741 ttaactcgct ctgtagggtt aggcggtgct ggcatagttg ctggtggctt ttacttgatg 1801 aatcgccagc cttctaagtt gatattcaat gattctttag gggcag // LOCUS YSKERD2A 1248 bp ds-DNA PLN 14-AUG-1990 DEFINITION K.lactis ER lumen protein retaining receptor (ERD2) gene, complete cds. ACCESSION M34844 KEYWORDS ER lumen protein retaining receptor; HDEL receptor. SOURCE K.lactis (SEY2102) DNA. ORGANISM Kluyveromyces lactis Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1248) AUTHORS Lewis,M.J., Sweet,D.J. and Pelham,H.R.B. TITLE The ERD2 gene determines the specificity of the luminal ER protein retention system JOURNAL Cell 61, 1359-1363 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Lewis, 06-JUN-1990, for release after publication. FEATURES from to/span description pept 390 414 ER lumen protein retaining receptor, exon 1 (ERD2) 572 1206 ER lumen protein retaining receptor, exon 2 IVS 415 571 ERD2 intron A site 415 420 consensus splice donor site 528 534 branchpoint sequence site 569 571 acceptor sequence BASE COUNT 368 a 235 c 202 g 443 t ORIGIN 1 gaattcttca ctcttcaccc ctcctctcta acgagactcg gataaacaga ctgtaacgat 61 gacgacgacg acgatgatga taatgattat gatgatgatg attgaatgtg ccatttgacg 121 tttttgacct gtaaacccct tttgtaaata gcctctgtcc tttttaagta cttatgcggt 181 gtctgttaat acaatctttg tggctcacgt gatttattat ttttctcaat gtgggccaaa 241 attgcaatct cttaactctg aaaaataaaa caaggcagta ctaatcgctt cttaaacact 301 tgttctttaa agattgaaat aaactaccga catttgtcgt tctaagttaa attcaaatca 361 aaacagcttt caattgagtg tgtcacatta tgttgaacgt tttcagaata gcaggtatgt 421 gaaaagaact gattttatat cattgttgga cgtcaacctc gatcggaacg ttatatcttg 481 gttctgaatg atgatttcca tcggcttatc tctactggca gcatctttac taacattgtt 541 attaccaact attttgtaat cgagtgttta ggtgatttct ctcatttggc tagtatcatc 601 attttgatac aatcgatcac aacatctaac tcagttgatg gtatctcatt gaaaactcaa 661 ctgctataca ccttggtctt tatcacacgt tatttgaacc tatttaccaa atggacctcc 721 ttgtacaact tcttaatgaa aattgttttc atttcatctt cggtttacgt cattgtgtta 781 atgcgccaac aaaaatttaa aaaccctgtc gcatatcaag acatgatcac cagagatcaa 841 tttaaaatca agtttttaat agtaccatgc attctcctag gattaatttt caattatcgt 901 ttcagtttta tacaaatatg ctggtccttc tctctatggt tggaaagtgt tgcaatcctt 961 cctcaattgt ttatgttgac taaaacaggt aaagcaaaac aattgacatc tcattatatt 1021 tttgcattgg ggttataccg tgccttgtac attccaaatt ggatatggag gtattatacc 1081 gaagaaagat tcgataaatt atcagttttc accggtgtga ttcaaacttt agtgtactct 1141 gatttcttct acatctatta tcagaaggtt atcaaactcg gtggagattt agaattacca 1201 caatagatga caggatttgc atctccactt gcagtcagtt cgaagctt // LOCUS CELCKINII 3180 bp ds-DNA INV 14-AUG-1990 DEFINITION C.elegans casein kinase II-alpha gene, complete cds. ACCESSION J05274 KEYWORDS casein kinase II-alpha. SOURCE C.elegans DNA. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 3180) AUTHORS Hu,E. and Rubin,C.S. TITLE Casein kinase II from Caenorhabditis elegans: Properties and developmental regulation of the enzyme; Cloning and sequence analyses of cDNA and the gene for the catalytic subunit JOURNAL J. Biol. Chem. 265, 5072-5080 (1990) STANDARD simple staff_review FEATURES from to/span description pept 250 288 casein kinase II-alpha, exon 1 533 703 casein kinase II-alpha, exon 2 763 890 casein kinase II-alpha, exon 3 940 1024 casein kinase II-alpha, exon 4 1084 1380 casein kinase II-alpha, exon 5 1838 1962 casein kinase II-alpha, exon 6 2018 2099 casein kinase II-alpha, exon 7 2301 2456 casein kinase II-alpha, exon 8 pre-msg 211 3073 casein kinase II-alpha mRNA and introns IVS 289 532 casein kinase II-alpha intron A IVS 704 762 casein kinase II-alpha intron B IVS 891 939 casein kinase II-alpha intron C IVS 1025 1083 casein kinase II-alpha intron D IVS 1381 1837 casein kinase II-alpha intron E IVS 1963 2017 casein kinase II-alpha intron F IVS 2100 2300 casein kinase II-alpha intron G BASE COUNT 974 a 580 c 544 g 1082 t ORIGIN 1 actttcgcag cgggaatatc tcaccacgca cgagttcaca tttggttttt tctctgcgct 61 ctctcgcatt tgcgtactct tttcatgttc tggcatgtat tttcaaaatt attcgggttt 121 tatcagtttt cccgagattt tactatttct actgaaaatt tcgttctaat aattcaatta 181 taacagccgt tttgtcaaaa cttgctcaaa aataacgtgt tacttgtttc agatcgttat 241 tcttgggaaa tgcctccgat tcctagccgg gcgagggtct acgccgaggt gagcaagttt 301 gtatagccag gagttaatta ttgaacagaa tatattgctc atacataact cacaatgacg 361 gaattaacgt tatccgtttt agttgtggaa tactgtgaaa cactgtacgc cacatttgtt 421 agattttata aatgttcata ggaatttata gtctcgtaaa gaaaaactat ttgtaaaagt 481 cactaaatta tgtttttatg agtttatata ttataaatca caattgtttc aggtgaatcc 541 gtcgagacca cgcgagtact gggactatga ggctcacatg atcgaatggg gacaaatcga 601 tgattatcag ctggtacgca agttgggacg tggaaaatat tcggaggttt ttgaaggatt 661 caagatgagc actgacgaga aagttgttgt caagattctc aaggtaagtt ttacaaaaat 721 aattgtaata aagctttcaa aatgtaattt gtttgttttc agccagtcaa aaagaagaag 781 atcaagcgtg agattaaaat tttggagaat cttcgtggcg ggacaaacat catcaccctc 841 ctcgacgtcg tcaaggatcc tatttcgaga actccggctc ttattttcga gttggtttac 901 tatttgaaga tttttagaat taataatttc tatttgaagg catgttaaca attcggactt 961 caagcaattg tatcaaaccc tcagtgacta tgatatccgt tactacctct acgagctctt 1021 gaaggtactt ttttcgtctg tccattaatt tttgaattca taacgtttca atttattctc 1081 caggctctcg acttctgcca ttctcaagga attatgcaca gagatgttaa gccgcacaat 1141 gtgatgattg acgccgaaaa acgggaactt cgattaatag attggggatt agccgaattc 1201 tatcatccaa gacaggacta taatgttcgt gttgcttcta gatatttcaa gggacccgag 1261 cttcttgtcg attatcaatg ttacgattat tcgttggata tgtggagttt gggatgcatg 1321 ttggctagta tgatcttcag gaaggaacca ttcttccacg gccatgataa ctacgatcag 1381 gtaagcactt taagctgaag tagcaaagaa ccgcaaatcc acaaatttca ttaaaatcca 1441 atattggaat atagaaaaca tttgatttat ctgattatgg cttttatttc agtgagaaca 1501 atggggaaaa acgcttaaaa aattaccgga aagtttaaaa tttccgaaaa aaaccgaaaa 1561 ttttccaatt ttgagcattt ttcaattgtt ccagctgcaa aaaactattt tcagataaag 1621 aaattatttt tttgaattcc atagctaaaa ttttactgat aagtttggct tattccgtag 1681 aatttcttgc cggcgattga ttcgcacccc tgagctgaaa tttaaatgaa taaataatat 1741 tgtcagggta tttattaaat attcatcaat aatttttatg atttttaata tttataaatt 1801 cacgaaatct gattttctta taaaaagttt tttgcagcta gtccgcattg caaaggtatt 1861 gggaactgat gagctttacg agtacattgc gagatatcac attgatctcg atccacgctt 1921 caacgatatt ctcgggcgac actcaagaaa acgctgggaa aggtatattt tgatcgaagc 1981 gaagtttcat gtttttctta aaattcattt gttgtagatt tatccacgct gaaaatcaac 2041 atcttgtcac cccagaggct cttgatttcc ttgataaact tcttcgctac gatcacgcgg 2101 taagaatcgg aaaaaagaga aattacacca aattagttac tgaaaaccag tttaacaaga 2161 acacagagaa ataaaaacaa ttttcacttg ttttattgaa aaaaaagaca tttgatactc 2221 gaaactaaac tataaaatat cttttatttg ttaaattaat ttatttccag aatcttttaa 2281 ttaaatttca ccatattcag gaacgcctaa ctgctcaaga ggccatgggt cacgaatact 2341 tccgcccagt tgttgaagca catgctcgtg ccaatggaac tgaacaagct gacggacagg 2401 gtgcatcgaa ttccgcttct tctcaatcct ccgatgccaa aattgacggc gcttaaaatt 2461 tctttctatt ttttttttaa ttttcctgtt gactatttcc ttttttatta atttccacac 2521 cactcctctc catcattatt cctttgctct aaaaattggc taagatatta aatcttcccc 2581 gcttctccga aaatttccct ctacatacca caaaagtgcc taatcttttc gtgagaaacg 2641 ttctttgcaa attctatttt taggattttt ttgtgtaatt catgcctctg aaattccgtt 2701 tcaattttaa atcttttttg tcccatttca tattgttctc cttcttctcc cctttcctct 2761 tcctcaaaca accacaatca tcatttttat aaatcgaatc acagttcgtg agaaaaaaaa 2821 ctgttacttg tatatcaata ttgtctttgg ttttgaaaaa cctactattg ttcctatgtt 2881 gagttgtgaa ctatgttcac ttttagagtt ttagatttaa aatgctcaac ggacgagtgc 2941 tccttcgatc gattgctcgc cggacaatgg catcgaaaat tgagccagga tctcgcagcc 3001 gatttatttg gatgtccagg ctaccgcgcc aatggtaatt aaatggaaat aataaagcat 3061 gattaaaaat aatataaaaa acttgtatct ggaaaaaatt aacgaaactt ttcagataca 3121 tttagatttc gaatgagttt ttttctaact aggcacagtt tagctagtag gcattctaaa // LOCUS DMORGX 1650 bp ds-DNA BCT 14-AUG-1990 DEFINITION D.mobilis ribosomal 16S RNA gene. ACCESSION M36474 KEYWORDS ribosomal 16S RNA. SOURCE D.mobilis DNA. ORGANISM Desulfurococcus mobilis Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoproteales; Desulfurococcaceae. REFERENCE 1 (bases 1 to 1650) AUTHORS Kjems,J., Garrett,R.A. and Ansorge,W. TITLE The sequence of the 16S RNA gene and its flanking region from the archaebacterium Desulfurococcus mobilis JOURNAL Syst. Appl. Microbiol. 9, 22-28 (1987) STANDARD simple staff_review FEATURES from to/span description rRNA 71 1565 16S ribosomal RNA BASE COUNT 323 a 492 c 602 g 233 t ORIGIN 1 ccgacgaggg ggagggagcc acttaagccg agcgctccag aaaccccggc cccaacccca 61 agcggccgta actccggttg atcctgccgg tcccgaccgc tatcggggtg gggctaagcc 121 atgggagtcg cacgctccgc cgctgcgggg cgtggcggac ggctgagtaa cacgtggcta 181 acctaccctc gggaggggga taacaccggg aaactggtgc taatccccca taggggagga 241 ggcctggaag ggttcctccc cgaaagggtg tggcaggggt taacgctgct acaccgcccg 301 aggatggggc tacggcccat taggttgttg gcggggtaac ggcccgccaa gccgataatg 361 ggtaggggcc gtgagagcgg gagcccccag atgggcactg agacaagggc ccaggcccta 421 cggggcgcac caggcgcgaa acctccgcaa tgcgggaaac cgtgacgggg ccaccccgag 481 tgccccctta cgggggcttt tccccgctgt aggaaggcgg gggaataagc ggggggcaag 541 tctggtgtca gccgccgcgg taataccagc cccgcgagtg gtcgggacga ttattgggcc 601 taaagcgccc gtagccggcc cggcaagtcc cctcctaaat tcccgggctc aacccgggga 661 ctggagggga tactgccggg ctagggggtg ggagaggccg agggtactcc cggggtaggg 721 gcgaaatcct ataatcccgg gaggaccacc agtggcgaag gcgctcggct ggaacacgcc 781 cgacggtgag gggcgaaagc cgggggagcg aaccggatta gatacccggg tagtcccggc 841 tgtaaacgat gcgggctagg tgttgggtgg gcttagagcc cacccagtgc cgcagggaag 901 ccgttaagcc cgccgcctgg ggagtacggc cgcaaggctg aaactcaaag gaattggcgg 961 gggagcacca caaggggtgg agcctgcggt tcaattggag tcaacgccgg gaatctcacc 1021 gggggagaca gcaggatgac ggccaggtta aaggccttgc ctgacgcgct gagaggaggt 1081 gcatggccgt cgccagctcg tgctgtgaag tgtccggtta agtccggaaa cgagcgagac 1141 ccccacccct agttgctacc cggggctacg gctccggggc acactagggg gactgccgcc 1201 gtttaaggcg gaggaaggag ggggccacgg caggtcagca tgccccgaac cccccgggct 1261 acacgcgggc tacaatggcg gggacagcgg gatccgaccc cgaaaggggg aggcaatccc 1321 tcaaaccccg ccgtggttgg gatcgagggc tgcaactcgc cctcgtgaac gaggaatccc 1381 tagtaaccgc gcgtcaacat cgcgcggtga atacgtccct gctccttgca cacaccgccc 1441 gtcgctccac ccgaggggag ggggagtgag gcccggcccc ttgggtcggg tcgaactccc 1501 cctccctgag gggggagaag tcgtaacaag gtagccgtac cggaaggtgc ggctggatca 1561 cctcctgcct caggccgggg ctggggccgg ggctggaatg cgctaaggct taagtggctc 1621 ccgcctcctc tattcattga tgcaacacgg // LOCUS ECOHBB221 3703 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli beta-globin gene, complete cds. ACCESSION M34058 KEYWORDS beta-globin. SEGMENT 1 of 2 SOURCE E.coli (strain HB101) DNA, isolated from patient GD. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 3703) AUTHORS Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M., Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J. TITLE Molecular basis for dominantly inherited inclusion body beta-thalassemia JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.L.Thein, 07-MAY-1990. Author address: S.L.Thien Institute of Molecular Medicine, MRC Molecular Haematology Unit John Radcliffe Hospital Oxford, OX3 9DU U.K. FEATURES from to/span description pept 1612 1703 beta-globin, exon 1 1834 2056 beta-globin, exon 2 2907 + 3015 beta-globin, exon 3 IVS 1704 1833 beta-globin intron A IVS 2057 2906 beta-globin intron B IVS 3016 > 3703 beta-globin intron C signal 1484 1492 CAAT box signal 1532 1538 TATA box BASE COUNT 1055 a 707 c 736 g 1205 t ORIGIN 1 tctatttatt tagcaataat agagaaagca tttaagagaa taaagcaatg gaaataagaa 61 atttgtaaat ttccttctga taactagaaa tagaggatcc agtttctttt ggttaaccta 121 aattttattt cattttattg ttttatttta ttttatttta ttttatttta ttttgtgtaa 181 tcgtagtttc agagtgttag agctgaaagg aagaagtagg agaaacatgc aaagtaaaag 241 tataacactt tccttactaa accgacatgg gtttccaggt aggggcagga ttcaggatga 301 ctgacagggc ccttagggaa cactgagacc ctacgctgac ctcataaatg cttgctacct 361 ttgctgtttt aattacatct tttaatagca ggaagcagaa ctctgcactt caaaagtttt 421 tcctcacctg aggagttaat ttagtacaag gggaaaaagt acagggggat gggagaaagg 481 cgatcacgtt gggaagctat agagaaagaa gagtaaattt tagtaaagga ggtttaaaca 541 aacaaaatat aaagagaaat aggaacttga atcaaggaaa tgattttaaa acgcagtatt 601 cttagtggac tagaggaaaa aaataatctg agccaagtag aagacctttt cccctcctac 661 ccctactttc taagtcacag aggctttttg ttcccccaga cactcttgca gattagtcca 721 ggcagaaaca gttagatgtc cccagttaac ctcctatttg acaccactga ttaccccatt 781 gatagtcaca ctttgggttg taagtgactt tttatttatt tgtatttttg actgcattaa 841 gaggtctcta gttttttatc tcttgtttcc caaaacctaa taagtaacta atgcacagag 901 cacattgatt tgtatttatt ctatttttag acataattta ttagcatgca tgagcaaatt 961 aagaaaaaca acaacaaatg aatgcatata tatgtatatg tatgtgtgta catatacaca 1021 tatatatata tatttttttt cttttcttac cagaaggttt taatccaaat aaggagaaga 1081 tatgcttaga actgaggtag agttttcatc cattctgtcc tgtaagtatt ttgcatattc 1141 tggagacgca ggaagagatc catctacata tcccaaagct gaattatggt agacaaaact 1201 cttccacttt tagtgcatca atttcttatt tgtgtaataa gaaaattggg aaaacgatct 1261 tcaatatgct taccaagctg tgattccaaa tattacgtaa atacacttgc aaaggaggat 1321 gtttttagta gcaatttgta ctgatggtat ggggccaaga gatatatctt agagggaggg 1381 ctgagggttt gaagtccaac tcctaagcca gtgccagaag agccaaggac aggtacggct 1441 gtcatcactt agacctcacc ctgtggagcc acaccctagg gttggccaat ctactcccag 1501 gagcagggag ggcaggagcc agggctgggc ataaaagtca gggcagagcc atctattgct 1561 tacatttgct tctgacacaa ctgtgttcac tagcaacctc aaacagacac catggtgcac 1621 ctgactcctg aggagaagtc tgccgttact gccctgtggg gcaaggtgaa cgtggatgaa 1681 gttggtggtg aggccctggg caggttggta tcaaggttac aagacaggtt taaggagacc 1741 aatagaaact gggcatgtgg agacagagaa gactcttggg tttctgatag gcactgactc 1801 tctctgccta ttggtctatt ttcccaccct taggctgctg gtggtctacc cttggaccca 1861 gaggttcttt gagtcctttg gggatctgtc cactcctgat gctgttatgg gcaaccctaa 1921 ggtgaaggct catggcaaga aagtgctcgg tgcctttagt gatggcctgg ctcacctgga 1981 caacctcaag ggcacctttg ccacactgag tgagctgcac tgtgacaagc tgcacgtgga 2041 tcctgagaac ttcagggtga gtctatggga cccttgatgt tttctttccc cttcttttct 2101 atggttaagt tcatgtcata ggaaggggag aagtaacagg gtacagttta gaatgggaaa 2161 cagacgaatg attgcatcag tgtggaagtc tcaggatcgt tttagtttct tttatttgct 2221 gttcataaca attgttttct tttgtttaat tcttgctttc tttttttttc ttctccgcaa 2281 tttttactat tatacttaat gccttaacat tgtgtataac aaaaggaaat atctctgaga 2341 tacattaagt aacttaaaaa aaaactttac acagtctgcc tagtacatta ctatttggaa 2401 tatatgtgtg cttatttgca tattcataat ctccctactt tattttcttt tatttttaat 2461 tgatacataa tcattataca tatttatggg ttaaagtgta atgttttaat atgtgtacac 2521 atattgacca aatcagggta attttgcatt tgtaatttta aaaaatgctt tcttctttta 2581 atatactttt ttgtttatct tatttctaat actttcccta atctctttct ttcagggcaa 2641 taatgataca atgtatcatg cctctttgca ccattctaaa gaataacagt gataatttct 2701 gggttaaggc aatagcaata tttctgcata taaatatttc tgcatataaa ttgtaactga 2761 tgtaagaggt ttcatattgc taatagcagc tacaatccag ctaccattct gcttttattt 2821 tatggttggg ataaggctgg attattctga gtccaagcta ggcccttttg ctaatcatgt 2881 tcatacctct tatcttcctc ccacagctcc tgggcaacgt gctggtctgt gtgctggccc 2941 atcactttgg caaagaattc accccaccag tgcagccaca cctatcagtg gtgtggctaa 3001 tgccctggcc cacaagtatc actaagctcg ctttcttgct gtccaatttc tattaaaggt 3061 tcctttgttc cctaagtcca actactaaac tgggggatat tatgaagggc cttgagcatc 3121 tggattctgc ctaataaaaa acatttattt tcattgcaat gatgtattta aattatttct 3181 gaatatttta ctaaaaaggg aatgtgggag gtcagtgcat ttaaaacata aagaaatgaa 3241 gagctagttc aaaccttggg aaaatacact atatcttaaa ctccatgaaa gaaggtgagg 3301 ctgcaaacag ctaatgcaca ttggcaacag ccctgatgcc tatgccttat tcatccctca 3361 gaaaaggatt caagtagagg cttgatttgg aggttaaagt tttctatgct gtattttaca 3421 ttacttattg ttttagctgt cctcatgaat gtcttttcac tacccatttg cttatcctgc 3481 atctctcagc cttgactcca ctcagttctc ttgcttagag ataccacctt tcccctgaag 3541 tgttccttcc atgttttacg gcgagatggt ttctcctcgc ctggccactc agccttagtt 3601 gtctctgttg tcttatagag gtctacttga agaaggaaaa acagggggca cggtttgact 3661 gtcctgtgag cccttcttcc ctgcctcccc cactcacagt gac // LOCUS ECOHBB222 552 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli beta-globin gene, complete cds. ACCESSION M34059 KEYWORDS beta-globin. SEGMENT 2 of 2 SOURCE E.coli (strain HB101) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 552) AUTHORS Thein,S.L., Hesketh,C., Taylor,P., Temperley,I.J., Hutchinson,R.M., Old,J.M., Wood,W.G., Clegg,J.B. and Weatherall,D.J. TITLE Molecular basis for dominantly inherited inclusion body beta-thalassemia JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by S.L.Thein, 07-MAY-1990. Author address: S.L.Thein Institute of Molecular Biology, MRC Molecular Haematology Unit John Radcliffe Hospital, Oxford, OX3 9DU, U.K. FEATURES from to/span description pept + 169 272 beta-globin, exon 4 IVS < 1 168 beta-globin intron C BASE COUNT 153 a 113 c 108 g 178 t ORIGIN 1 tctgcatata aattgtaact gatgtaagag gtttcatatt gctaatagca gctacaatcc 61 agctaccatt ctgcttttat tttatggttg ggataaggct ggattattct gagtccaagc 121 taggcccttt tgctaatcat gttcatacct cttatcttcc tcccacagct cctgggcaac 181 gtgctggtct gtgtgctggc ccatcacttt ggcaaataat tcaccccacc agtgcaggct 241 gcctatcaga aagtggtggc tggtgtggct aatgccctgg cccacaagta tcactaagct 301 cgctttcttg ctgtccaatt tctattaaag gttcctttgt tccctaagtc caactactaa 361 actgggggat attatgaagg gccttgagca tctggattct gcctaataaa aaacatttat 421 tttcattgca atgatgtatt taaattattt ctgaatattt tactaaaaag ggaatgtggg 481 aggtcagtgc atttaaaaca taaagaaatg aagagctagt tcaaaccttg ggaaaataca 541 ctatatctta aa // LOCUS HUMETN1 682 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human endothelin (ETN) gene, complete cds. ACCESSION M25377 J04819 J05489 KEYWORDS endothelin. SEGMENT 1 of 4 SOURCE Human leukocyte DNA and human umbilical vein endothelial cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 682) AUTHORS Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE Structural organization and chromosomal assignment of the gene encoding endothelin JOURNAL J. Biol. Chem. 264, 10851-10857 (1989) STANDARD simple staff_review REFERENCE 2 (bases 2 to 421) AUTHORS Lee,M.-E., Bloch,K.D., Clifford,J.A. and Quertermous,T. TITLE Functional analysis of the endothelin-1 gene promoter: Evidence for an endothelial cell-specific cis-acting sequence JOURNAL J. Biol. Chem. 265, 10446-10450 (1990) STANDARD simple staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by K.D.Bloch (06-AUG-89). FEATURES from to/span description pept 519 + 582 endothelin prepropeptide, exon 1 /nomgen="EDN1" /map="unassigned" /hgml_locus_uid="LU0066V" pre-msg 252 > 682 ETN mRNA and introns (5' end put.) IVS 583 > 682 ETN intron A BASE COUNT 121 a 178 c 169 g 213 t 1 others ORIGIN Chromosome 6. 1 ggcaggttta gcaaaggtct ctaatgggna ttttattttt cttagccctg cccccgaatt 61 gtcagacggc ggcgtctgcc tctgaagtta gcagtgattt ctttcgggcc tggccttatc 121 tccggctgca cgttgcctgt tggtgactaa taacacaata acattgtctg gggctggaat 181 aaagtcggag ctgtttaccc ccactctaat aggggttcaa tataaaaagc cggcagagag 241 ctgtccaagt cagacgcgcc tctgcatctg cgccaggcga acgggtcctg cgcctcctgc 301 agtcccagct ctccaccgcc gcgtgcgcct gcagacgctc cgctcgctgc cttctctcct 361 ggcaggcgct gcttttctcc ccgttaaagg gcacttgggc tgaaggatcg ctttgagatc 421 tgaggaaccc gcagcgcttt gagggacctg aagctgtttt tcttcgtttt cctttgggtt 481 cagtttgaac gggaggtttt tgatcccttt ttttcagaat ggattatttg ctcatgattt 541 tctctctgct gtttgtggct tgccaaggag ctccagaaac aggtaggcac gctcgttgac 601 ttgtaagtct cggaattaca agttagtgtg ttcttatcca ccttcatgct tttcttgctt 661 ctatttttcc ccgttctttt ta // LOCUS MUSKER19 5422 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse keratin 19 gene, complete cds. ACCESSION M36120 KEYWORDS keratin 19. SOURCE Mouse (strain C34/HeNCrlBR) embryo fibroblast DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5422) AUTHORS Lussier,M., Filion,M., Compton,J.G., Nadeau,J.H., Lapointe,L. and Royal,A. TITLE The mouse keratin 19-encoding gene: Sequence, structure and chromosomal assignment JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.Lussier, 06-JUL-1990. FEATURES from to/span description pept 375 803 keratin 19, exon 1 3728 3810 keratin 19, exon 2 4029 4185 keratin 19, exon 3 4553 4714 keratin 19, exon 4 4832 4957 keratin 19, exon 5 5070 5324 keratin 19, exon 6 pre-msg 304 > 5422 keratin 19 mRNA and introns IVS 804 3727 keratin 19 intron A IVS 3811 4028 keratin 19 intron B IVS 4186 4552 keratin 19 intron C IVS 4715 4831 keratin 19 intron D IVS 4958 5069 keratin 19 intron E signal 184 188 CAAT box signal 278 284 ATA box signal 5402 5408 poly-A signal BASE COUNT 1255 a 1429 c 1431 g 1307 t ORIGIN 1 gatatcgtca ctccggaagt tagagttgtc taagcctgtg tagtaagggg ctgaagggcc 61 agaaaaggga cgtgacatgt tggcagtagc tttggagtgg gctggggcgg ggcagctctg 121 ggaaggactg agacctctgg ctcctgggag gggagaggta ggagcagaat cgccaggaat 181 tgaccaatgg ggaaagagcc catatttgca ctctgggagc ttggaaattt ctgatacccg 241 ccccttcaac atctccatcc cccttcccgc cccgggcata aaaagccaca ggtgagggcc 301 ttgtcactcc tcctgcggcc agcagttctc agacctgcgt ccctttttcc ttcgctctgg 361 tctccctcct catcatgact tcctatagct atcgccagac ctcagctatg tcttcctttg 421 ggggtacggg cgggggttca gtacgcattg ggtcaggggg tgttttccgc gcacccagca 481 tccacggggg ctcaggtggc cgcggcgtgt ccgtgtcctc cacccgcttc gtgacctcgt 541 cctccgggag ctatggcgga gtccgcggtg gaagttttag tgggaccctg gctgtgtctg 601 atgggctgct gtctggcaat gagaagatca ccatgcaaaa cctcaatgat cgtctcgcct 661 cctacttgga caaggtgcgc gccctagagc aggccaatgg cgagctggag gtgaagatcc 721 gcgactggta ccagaagcag ggacccggac cctcccgaga ttacaaccac tactttaaga 781 ccatcgagga cttgcgcgac aaggtgggta gcggccaggc cctgggaggt gcacctgttg 841 gggtggtgcg gcagggcggg gaggttgtgg aaactcagcc cgcgggtgga caagcccgtt 901 actacgggcc gggcaggaac ccgcagggtg gggcacaaag tctgggctgg accctgggat 961 tcgagtggag gcggggtgga agggaactga gacaggttaa gtgggtgggg ctttcggaag 1021 cggtggggac ctgcttggat agtcagggat atctatccaa agagagcatt taacacagct 1081 ttgggctgcc cagagatgaa gataccgagc cgcagcctca gctgggggtg gggtgttggg 1141 gagggctggg gagctgccgg atattagtct gtaggctcag aggtcacctg agcctgtggc 1201 cctcttctgg gactcttgag cattgctttc tccagttgtg tgggttgttc ctgtctggtt 1261 ttctttacag cagctgtcct ctcactctcc tcctctccag cccaggggac cacctaggct 1321 cacacggagc tacgagaaga atgtaaaacc tgggggctgc acagtaaaac cagatcttaa 1381 aaaaaaaaac ttccagtgag gtgtggtagc tcaggcctgt aatcccagga gcaggtggat 1441 gagttcatag ctatgcccag ctacagttaa cttgaatcta gtcagggcta catgagtctc 1501 taggctccca cccggccgtc aaacagcccg aacagacaga cagacagaca gacagacaga 1561 cagacagaca gacaacaact ctaacacgaa acaaaagaaa aataaaaaca attattttag 1621 taagtaagag ggaggtggaa gtcttgaccc tctgcttctg tggacctctc tcctggagtc 1681 ccccggtcca gggaagacaa cctgcaagtc tctgtgggct ccaacatcca acaggctggt 1741 tgggtggggc attgccctca ctcctaggga agagctatgt cctgaagaaa tgtaccagag 1801 accaggcagg cgtttaatag cgtcctctcg ctagtgctcc caacatgcag ttctcaagag 1861 cattttccag ggctgtgtga acttagcatg gagctagaca cacaacacag caagcagcta 1921 gagaggtgtt tttttgaccc agtgatggac agctagaatc ctaggcggta cacatctttc 1981 tttctcttgc tttgtccctt ccccttccac acactcccag cacgcatatc tagaagcaat 2041 tcttactccg ctgtgaatgc tctcctccgg gtttttatat tctgttgttt tcgtcaacaa 2101 ccccccaccc ccaaactatc tatgtagctg agcgtggtgg ctgatacgtg gtggcttatc 2161 cagcattgag aaactgaggc agaaagaagg acgatgaagt taatgccagt tgaaccatac 2221 agtgagactc tgtctcaaaa tgtcaaacca gagctggggc gcagtggcac acacctttaa 2281 tcccagcact tggggaggca gagaaggtgg atctctgtga attcaaggcc agtctacaga 2341 gtgagtccag gacagccagg gctacaaaga gaaatcctgt ctcgaacccc ctagtccttc 2401 acccccaaag tctaacaaac cagaggaaac caacccaacc caacccaacc caacccaacc 2461 caacccaacc caacccaacc aaaaacatta aaccaaaata atcctgcaga ttccaacctg 2521 gtgacaggat aagtggccat gaccttaatc tttgaaaaac actgaaccct ggtactgaat 2581 tggtggtaaa cttaatgatt tgagtgaacc cctcaaacca gagtcttcat attgggtaaa 2641 ctcaggattt ccagtcacgt acagtggcag tactaaccta ggagtgaggc tgagttacca 2701 tgaagagagt acaaggtgcc tggcacatca ccgaagcaga atgtgctcta agtgtgggac 2761 acagacaggg ataagtgttg cttttctttt ctttaggatt tatttattta cttatctatt 2821 aatttattta tatgtatatt ggtgttttgc ttgcatgtat gtctgtgtga ggaacttgaa 2881 ttgcaaacag ttgtgagctg ccacgagggt gctgcaaatt gaactctgtt cctttggaag 2941 aatatccagg gctcttaatc tccaagttac ctctccagtc cctgggataa atgatccatc 3001 catccttcct tccttccttc cttccttcct tccttccttc cttccttcct tccgtctttc 3061 ctctttcttt cttttttttt tttttttggt ttttggtttt tggttttttg agacagagtt 3121 tctctgtgta gccctggctg gctcaaactc agaaatcctt ctgcctctgc ctcccaagtg 3181 gtggggttaa aggcatgtgc caccactgcc tggagggata aatgattctt ttttttttaa 3241 gatttattta tttattatat ctaagtacac tgtagctgtc ttcagacact ctggaagaga 3301 gcgtcagatc tagtttcgga tggttgtgag ccaccatgtg gttgctggga tttgaactca 3361 ggaccttagg aagagcagtc agtgctctta accgctgagc catctctcca gcccaggata 3421 aatgattctt aaagcttcat gtttctaggt ccttttcaga tcacttagag tttcacctgc 3481 acaggtcttg gccatgccct tctggaattg cctctttgcc tgcctccctt cacccaacct 3541 cctttacagt gtgttgtgct ccagctctgg ggctgtggat cgatgccaaa gtctcccaga 3601 gagcagacta gtgggtagtg tcaagggagt gggggctatc cattcctctc tccttctctg 3661 ggaagaaagt agattgataa tactggcttg ccaaactggt gccaactgcc agcctcctcc 3721 cttctagatt cttggtgcca ccattgacaa ctccaagatt gtcctacaga ttgacaatgc 3781 tcgcctggct gcagatgact tcagaaccaa gtaagtgtcc ctgtattggg ggctgcggaa 3841 gccagggcaa gtgagaggag agcagcggca ggtcttatag tgcttgatga agtagaggcc 3901 tggactctgc ccttctccac ttgagctggc tcaaagcaat gccccccctc cccgactcct 3961 aaaccctggg aaagttggca gagacacaga tccagcacta gcctgactgc aactatttct 4021 tcggaaaggt ttgagacaga acacgccttg cgtctgagcg tggaggccga catcaacggc 4081 ctgcgccggg tgctggatga gctgactctg gccaggactg acctggagat gcagattgag 4141 agcctgaagg aggagctggc ctacctgaag aagaaccatg aggaggtggg tcaagctggg 4201 ccttctgtcc attttgtccc aggattggga gaccctatta tcttgggctt actgaccatg 4261 acctcatctc ttggccctgg ctttgatcat gcctctggca gtctcagcag ggccatgaca 4321 tcatcactat gatgacctta ctgaccatga catcatcact atggccctgg ctttgatcat 4381 gcctctcgca gtctcagcag ggcatggtaa ccatgacatc tactttccag tgctggctcc 4441 tagatttgta cctccaaacc agccgtcttc agctcagggc ttgggggagt gacctgatcc 4501 aagttcagca tgtgtaggtc ttgacatgcc cttgactgtg gtgtcttcac aggaaattac 4561 tgccctgagg agccaggtgg gtggccaggt cagtgtggag gtggattcca ctcccggtgt 4621 cgacctagcc aagatcctga gtgagatgag aagtcagtat gagatcatgg ccgagaagaa 4681 ccggaaggat gctgaagcca cctaccttgc tcgggtatgc agaggatgct ggtatcccgt 4741 gagggtggcg aggagagacc ctgaaccctc acccatccac ccgcagagag aagcttgttg 4801 cttattttct ccttaacttg tcttgttaca gattgaggag ctgaacaccc aggtcgccgt 4861 ccactctgag cagatccaga taagcaagac cgaagtcacg gaccttcgac ggaccctcca 4921 gggccttgag attgagctgc agtcccagct cagcatggta cgtgtctcca ccctacggcc 4981 tgcacacttg tgccctggca acctcggtgc ctgtgcccgt gttcaggaac ctgcctctgc 5041 tgagtcacgc cccctttccc tcccctcaga aagctgccct ggaaggcacg ctggcagaga 5101 cggaggcccg ttatggagtc cagctgtcac agatccagag cgtgatcagc ggttttgaag 5161 cccagctgag cgacgtgcgt gccgacatag agcgccagaa ccaggagtat aagcagctca 5221 tggacatcaa gtccaggctg gagcaggaga tcgccaccta ccgcagcctg ctggagggcc 5281 aggaagccca ctacaacaat ctgcccaccc ccaaggccat ctgagctacc agcgagactc 5341 ccctgggaag gggcctgact ggggtgataa aagtttactc taacccctcc ctcgacttgt 5401 caataaaact atcctccaag gg // LOCUS VIBHEMOX 1586 bp ds-DNA BCT 14-AUG-1990 DEFINITION V.parahaemolyticus thermolabile hemolysin (TL) gene, complete cds. ACCESSION M36437 KEYWORDS thermolabile hemolysin. SOURCE V.parahaemolyticus DNA. ORGANISM Vibrio parahaemolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1586) AUTHORS Taniguchi,H., Hirano,H., Kubomura,S., Higashi,K. and Mizuguchi,Y. TITLE Comparison of the nucleotide sequences of the genes for the thermostable direct hemolysin and the thermolabile hemolysin from Vibrio parahaemolyticus JOURNAL Microb. Pathog. 1, 425-432 (1986) STANDARD simple staff_review FEATURES from to/span description pept 124 1380 thermolabile hemolysin precursor (TL) sigp 124 183 thermolabile hemolysin signal peptide matp 184 1377 thermolabile hemolysin BASE COUNT 466 a 378 c 361 g 381 t ORIGIN 1 tcgactgtct ggagtattta ctcagaatta gagggtttag tgagaaaaat tctcatctaa 61 acaatgttat agccaagtat tttttcaatg tgcttgggtc aataaccatt acaagaagag 121 tgaatgatga aaaaaacaat cacactatta actgcattac tcccgcttgc ttctgcagtt 181 gccgaagagc caaccttatc accagaaatg gtttcagcgt ctgaagtgat cagcacgcaa 241 gaaaaccaaa cctataccta tgttcgctgt tggtatcgca ccagctactc gaaagatgat 301 ccagcgaccg attgggaatg ggcaaaaaac gaagatggta gctacttcac cattgacggc 361 tactggtgga gctccgtttc atttaaaaac atgttctaca ccaacacgtc gcaaaacgtt 421 atccgtcagc gttgtgaagc aacattagat ttggcgaacg agaacgcaga cattacgttc 481 ttcgccgctg acaatcgctt ctcatacaac cacacgatct ggagcaacga cgcagcaatg 541 cagccagatc aaatcaacaa agtggttgca ctcggtgaca gcttgtctga tacaggcaac 601 atctttaacg catcacaatg gcgcttccct aacccgaaca gctggttctt aggtcacttc 661 tccaacggtt ttgtgtggac agaatacatt gccaaagcga agaaccttcc gctctacaac 721 tgggcagttg gcggcgcggc tggtgagaac caatacatcg cgctaacagg ggttggtgat 781 caagtttctt cgtacttaac ctacgcaaaa ctggcgaaga actacaaacc agcaaacacc 841 ttgtttacgc ttgagtttgg tttgaatgac ttcatgaact acaaccgtgg cgttccagaa 901 gtgaaagcgg attatgcaga agcactgatt cgtttgacgg acgcaggtgc gaagaacttc 961 atgttgatga cactgccaga tgcgacgaaa gcgcctcagt ttaagtactc aacacaagaa 1021 gagatcgaca aaattcgtgc gaaagtgctt gagatgaacg agttcatcaa ggcacaagcg 1081 atgtactaca aagcgcaagg ttacaacatc acgttgtttg atactcacgc cttgttcgag 1141 acgctaactt ctgcgcccga agagcacggt ttcgtgaacg cgagcgatcc ttgtttggac 1201 atcaaccgct catcgtctgt cgattacatg tacacccacg cattgcgctc tgagtgtgca 1261 gcgtctggtg ctgagaagtt tgtgttctgg aatgtcacgc atccaacaac agcaactcac 1321 cgctatgttg cagagaaaat gctagaaagt agcaacaact tagccgagta ccgtttctaa 1381 ccggacacgg cttctgagtt gaaaccttat cttcgtacac acgttgataa cgaacacatc 1441 gtggccattt ttatcgaagg aacgttgtgg tcacagcagt cacaacgcta aacaagttac 1501 agtggcgcga cgtcggttcc ccctaaagac tggtaaagcg taacctgagt gagaaactgg 1561 ttgtagcggt tctccaacaa agaagc // LOCUS CELCAPKCC1 195 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 1. ACCESSION M37114 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 1 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 195) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 121 + 190 cAMP-dependent protein kinase catalytic subunit C, exon 1 pre-msg 1 > 195 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS 191 > 195 intron A BASE COUNT 63 a 47 c 48 g 37 t ORIGIN 1 gtttgaggta acttaaacag aaagcagcga agaccacaca agagaagact gcaagtgtct 61 tctgaagtgt gacaacgtcc catccagtcc cgatcgtggt cttcgtctac cagcaaaaca 121 atgctcaagt ttctgaaacc aaaatcgtcg gacgagggaa gctcgaagga caacaagaac 181 tcggcgtcgc gtatg // LOCUS CELCAPKCC2 75 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 2. ACCESSION M37115 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 2 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 75) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 70 cAMP-dependent protein kinase catalytic subunit C, exon 2 pre-msg < 1 > 75 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron A IVS 71 > 75 intron B BASE COUNT 26 a 17 c 18 g 14 t ORIGIN 1 attttcagtc aaagaattcc tggacaaggc acgcgaagac ttcaaacaac gatgggagaa 61 tccggctcag gtaca // LOCUS CELCAPKCC3 142 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 3. ACCESSION M37116 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 3 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 142) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 137 cAMP-dependent protein kinase catalytic subunit C, exon 3 pre-msg < 1 > 142 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron B IVS 138 > 142 intron C BASE COUNT 44 a 27 c 35 g 36 t ORIGIN 1 attttcagaa cacggcgtgt cttgacgact ttgataggat aaaaaccctt ggaaccggat 61 cattcgggcg cgtgatgctt gtgaagcata agcagtcggg caactattat gccatgaaaa 121 ttctagacaa acaaaaggtt gt // LOCUS CELCAPKCC4 113 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 4. ACCESSION M37117 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 4 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 113) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 108 cAMP-dependent protein kinase catalytic subunit C, exon 4 pre-msg < 1 > 113 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron C IVS 109 > 113 intron D BASE COUNT 30 a 23 c 27 g 33 t ORIGIN 1 attttcaggt tgtaaaactc aagcaagtgg agcacacgtt gaacgaaaag cgcattctcc 61 aggctatcga ttttccgttt ttggtgaaca tgacgttctc tctcaagggt gag // LOCUS CELCAPKCC5 222 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 5. ACCESSION M37118 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 5 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 222) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 217 cAMP-dependent protein kinase catalytic subunit C, exon 5 pre-msg < 1 > 222 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron D IVS 218 > 222 intron E BASE COUNT 50 a 60 c 44 g 68 t ORIGIN 1 tatttcagac aactccaact tgtacatggt gctcgaattc atctcgggag gtgagatgtt 61 ctcacattta cgccgaatcg gacgtttctc cgagccccac tcgcgtttct acgccgctca 121 aattgtgctt gcctttgagt accttcactc cttagatctc atctatcgtg atctgaaacc 181 cgagaacttg cttatcgatt ccactggata tctaaaggta gg // LOCUS CELCAPKCC6 397 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 6. ACCESSION M37119 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 6 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 397) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 + 392 cAMP-dependent protein kinase catalytic subunit C, exon 6 pre-msg < 1 > 397 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron E IVS 393 > 397 intron F BASE COUNT 119 a 73 c 99 g 106 t ORIGIN 1 attttaaggt aaccgatttc ggatttgcga aacgtgtcaa aggacgaacg tggacattgt 61 gcggcacacc agaatacttg gcacccgaga ttattttgtc aaaaggatac aataaggctg 121 tcgattggtg ggcacttggt gtgttgatct acgaaatggc cgccggttat ccgccgttct 181 ttgccgatca accgattcaa atttatgaga agattgtgtc aggaaaagtg aaattcccgt 241 cgcacttctc gaatgagttg aaagacttgc tgaagaattt gctccaagtt gatcttacaa 301 agcgatacgg taacttgaaa aatggagttg ccgatatcaa aaatcacaaa tggtttggaa 361 gtaccgactg gattgccatc tatcagaaga aggtgag // LOCUS CELCAPKCC7 729 bp ss-mRNA INV 14-AUG-1990 DEFINITION C.elegans cAMP-dependent protein kinase catalytic subunit C gene, exon 7. ACCESSION M37120 J05289 M35424 KEYWORDS cAMP-dependent protein kinase catalytic subunit C. SEGMENT 7 of 7 SOURCE C.elegans (strain Bristol N2), cDNA to mRNA and DNA, clones lambda-CC[2,3,5]. ORGANISM Caenorhabditis elegans Eukaryota; Animalia; Metazoa; Nemata; Secernentea; Spiruria; Spirurida; Spirurina; Filarioidea; Filariidae. REFERENCE 1 (bases 1 to 729) AUTHORS Gross,R.E., Bagchi,S., Lu,X. and Rubin,C.S. TITLE Cloning, characterization, and expression of the gene for the catalytic subunit of cAMP-dependent protein kinase in Caenorhabditis elegans JOURNAL J. Biol. Chem. 265, 6896-6907 (1990) STANDARD simple staff_entry FEATURES from to/span description pept + 9 134 cAMP-dependent protein kinase catalytic subunit C, exon 7 pre-msg < 1 729 cAMP-dependent protein kinase catalytic subunit C mRNA and introns IVS < 1 8 intron F BASE COUNT 197 a 197 c 86 g 249 t ORIGIN 1 aatttcagat cgaagcaccg ttccttccaa aatgccgtgg ccctggcgac gcatctaact 61 ttgacgacta cgaggaagag ccgcttcgta tctcgggcac cgagaaatgc gccaaagagt 121 ttgccgagtt ttaattcatt atctttgtcg attccttcca taagcttttt gtttcaaatt 181 tcactctctc tctctctctt ttccatattc ctcaaaattg tactagtcca aaacaaaacc 241 ctgtccctat tgtccggtgg tttgtcttcc ctcacccaaa ccttctcatt ctcagccaaa 301 ttgaattcaa tattgttgcg acaattgaaa atgatgaaga tgcttactat ttcttccctt 361 ccttctgaac cccttatttt ttaatgttca tacattttct tgccccgatc ccgtccccct 421 aataataatt tgtaaaaata atagattgga atttgctcta atatatataa ttattatata 481 tcaaaatttt ctagtcaacc cctgactttc cctcctccaa ttcagcctcc cgcaaacttt 541 tttctagaaa agttgacttc tgtctcaaca aagaaatgga atcccacaaa accattaatt 601 gtatgttttt ttctcatttt ccttttcctc tctcatgtat acatatatat gtattttaaa 661 cacttttcac cccccccccc cccacacaca ctcaccgcaa aaatccccca gtcgtcataa 721 aaagtagtt // LOCUS DRODEC1A 3784 bp ds-DNA INV 14-AUG-1990 DEFINITION D.melanogaster defective chorion-1 fc125 (dec-1) gene, complete cds. ACCESSION M35887 M35688 X07697 KEYWORDS dec-1 protein; eggshell protein. SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 121) AUTHORS Hawley,R.J. and Waring,G.L. TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene required for proper assembly of the Drosophila eggshell JOURNAL Genes Dev. 2, 341-349 (1988) STANDARD simple automatic REFERENCE 2 (bases 44 to 3784) AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T. TITLE Multiple proteins are produced from the dec-1 eggshell gene in Drosophila by alternative RNA splicing and proteolytic cleavage events JOURNAL Dev. Biol. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Waring, 28-JUN-1990. EMBL features not translated to GenBank features: key from to description PRM 14 18 put. TATA-box SITE 44 44 transcription initiation site FEATURES from to/span description pept 119 3490 fc125 protein precursor (dec-1) sigp 119 187 fc125 protein signal peptide matp 188 3487 fc125 protein BASE COUNT 1103 a 999 c 1004 g 678 t ORIGIN Chromosome 7C-X. 1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga 61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat 121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag 181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga 241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc 301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag 361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa 421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc 481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg 541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc 601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc 661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc 721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc 781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg 841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca 901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc 961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa 1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca 1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga 1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt 1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag 1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc 1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg 1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc 1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt 1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa 1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag 1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc 1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca 1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa 1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg 1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat 1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat 1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca 2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag 2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca 2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt 2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca 2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag 2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca 2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca 2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga 2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc 2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga 2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg 2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa 2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga 2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca 2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac 2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa 3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca 3061 tcgagtccac aaaagtccat catcagcacc atccgaaacc gaaatcgaaa acgcaccatc 3121 atctgacccc caagtgggtt ccattttcac ctacggcgag ggattgctgc atccgtttat 3181 gggtctgctg ccggtggaga ggcccgatga tccgtggaac cagaagccat acgatcctca 3241 ccatcctctc tacacgggcg gtgggagcta cgatgcctat ctgagggacg gtcgccatcg 3301 aagggacacg cacatcatgg gtcaaggtac gcagcatggc atcctgacac ccggcatgtt 3361 ggagagactt cttcggatca agatggattt ccagcgtaga tttcccccat ttatacaagg 3421 gtatgctgaa tcaccatacg aatcttaccc gtgtagaggt acagcctccg gttctgggca 3481 agatttctaa gccgaaaact aagaccaagc ccaagaatga ggacgaaccc gtcttcgaat 3541 tgggagccgc tgagcgtagc ctgttcgagg acgagaccaa cgattcttta gagaaagatc 3601 ccgaaccaga accggatgag gaggatgacc gggatgtgga agagcccagc gagagcagtg 3661 aacctagggg attcagcagc aaaaaaagcc gggatgagaa cgacatcgat tacttcaatt 3721 tcgatgacga ttgatgttga tgattgactt ttattataat tgccaataaa attaatatta 3781 taaa // LOCUS DRODEC1B 3798 bp ds-DNA INV 14-AUG-1990 DEFINITION Drosophila melanogaster defective chorion-1 fc106 (dec-1) gene, complete cds. ACCESSION M35888 M35688 X07697 KEYWORDS dec-1 protein; eggshell protein. SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 121) AUTHORS Hawley,R.J. and Waring,G.L. TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene required for proper assembly of the Drosophila eggshell JOURNAL Genes Dev. 2, 341-349 (1988) STANDARD simple automatic REFERENCE 2 (bases 44 to 3798) AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T. TITLE Multiple proteins are produced from the dec-1 eggshell gene in Drosophila by alternative RNA splicing and proteolytic cleavage events JOURNAL Dev. Biol. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Waring, 28-JUN-1990. EMBL features not translated to GenBank features: key from to description PRM 14 18 put. TATA-box SITE 44 44 transcription initiation site FEATURES from to/span description pept 119 2971 fc106 protein precursor (dec-1) sigp 119 187 fc106 protein signal peptide matp 188 2968 fc106 protein BASE COUNT 1106 a 1003 c 1008 g 681 t ORIGIN Chromosome 7C-X. 1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga 61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat 121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag 181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga 241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc 301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag 361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa 421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc 481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg 541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc 601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc 661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc 721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc 781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg 841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca 901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc 961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa 1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca 1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga 1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt 1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag 1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc 1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg 1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc 1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt 1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa 1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag 1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc 1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca 1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa 1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg 1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat 1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat 1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca 2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag 2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca 2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt 2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca 2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag 2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca 2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca 2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga 2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc 2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga 2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg 2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa 2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga 2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca 2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac 2941 aatggccagg gcgacgcatt tcagacaata aacccgaagc aaccaggcga agtcggtgga 3001 tcggagagcc agaagagcaa ctctaatcct ccaacgacgt tgacaccagc tccacaagag 3061 caaccgcaag agcatcgagt ccacaaaagt ccatcatcag caccatccga aaccgaaatc 3121 gaaaacgcac catcatctga cccccaagtg ggttccattt tcacctacgg cgagggattg 3181 ctgcatccgt ttatgggtct gctgccggtg gagaggcccg atgatccgtg gaaccagaag 3241 ccatacgatc ctcaccatcc tctctacacg ggcggtggga gctacgatgc ctatctgagg 3301 gacggtcgcc atcgaaggga cacgcacatc atgggtcaag gtacgcagca tggcatcctg 3361 acacccggca tgttggagag acttcttcgg atcaagatgg atttccagcg tagatttccc 3421 ccatttatac aagggtatgc tgaatcacca tacgaatctt acccgtgtag aggtacagcc 3481 tccggttctg ggcaagattt ctaagccgaa aactaagacc aagcccaaga atgaggacga 3541 acccgtcttc gaattgggag ccgctgagcg tagcctgttc gaggacgaga ccaacgattc 3601 tttagagaaa gatcccgaac cagaaccgga tgaggaggat gaccgggatg tggaagagcc 3661 cagcgagagc agtgaaccta ggggattcag cagcaaaaaa agccgggatg agaacgacat 3721 cgattacttc aatttcgatg acgattgatg ttgatgattg acttttatta taattgccaa 3781 taaaattaat attataaa // LOCUS DRODEC1C 5740 bp ds-DNA INV 14-AUG-1990 DEFINITION Drosophila melanogaster defective chorion fc177 (dec-1) gene, complete cds. ACCESSION M35889 M35688 X07697 KEYWORDS dec-1 protein; eggshell protein. SOURCE D.melanogaster ovary, cDNA to mRNA and genomic DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 121) AUTHORS Hawley,R.J. and Waring,G.L. TITLE Cloning and analysis of the dec-1 female-sterile locus, a gene required for proper assembly of the Drosophila eggshell JOURNAL Genes Dev. 2, 341-349 (1988) STANDARD simple automatic REFERENCE 2 (bases 44 to 5740) AUTHORS Waring,G.L., Hawley,R.J. and Schoenfeld,T. TITLE Multiple proteins are produced from the dec-1 eggshell gene in Drosophila by alternative RNA splicing and proteolytic cleavage events JOURNAL Dev. Biol. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Waring, 28-JUN-1990. EMBL features not translated to GenBank features: key from to description PRM 14 18 put. TATA-box SITE 44 44 transcription initiation site FEATURES from to/span description pept 119 4888 fc177 protein precursor (dec-1) sigp 119 187 fc177 protein signal peptide matp 188 4885 fc177 protein BASE COUNT 1716 a 1481 c 1496 g 1046 t 1 others ORIGIN Chromosome 7C-X. 1 gtggcaaaaa atctatttgg accaccaacc ggctgcagtc ggcagtcagt cggccaacga 61 gctccggcga acacagatcg atctttagcg atttttccga gttttgccca aatacaggat 121 gagattgttt agccttctgc cgctcctggc gcttcttgtc gtccaggttg ccggacaaag 181 tgaagtaacc tctgatgatc ctgcaacgga tgctggatca acgaccaatt ccaccacgga 241 caccaaacca aggattccca gtcaggatga gatccttggc cagatgccgt ccattaatcc 301 catccgcact ggcaatcccc agatggacgc attttacatg atgttcccgg cgttgggcag 361 cctgctcaag tggggcagcc ttttcccagc ctactcgatt ctgggcgcca ttcccgacaa 421 tctacagcct acagcggcag cctccaaggt ggtccttgtc ctggccgatg atgcgacggc 481 caagactcga gttgcccgtc aaaatccgcc accaaatcca cttggtcagc taatgaattg 541 gcccgctctg ccgcaggact tccaactgcc ttccatggat ctgggaccgc aagtgggctc 601 gtttttggcc caactgcctg ctatgcctac cgtgcctggt cttctgggtg ccgctgctcc 661 agttcctgct ccagctcctg ctcccgctgc tgctcctcct ccggcgccag ctccagctgc 721 agatcctccg gcagcacctg ttccagatgc accccaacca gccatactgg gagaagccgc 781 tctgcagaac gctttcacct tctttaaccc ggctaacttt gatgcctcca gtcttctggg 841 ccagagtgta cccacatttg ctcctccaaa tcttgatttc gtcgctcaaa tgcaaaggca 901 attcttccca ggaatgacac cggcccaacc tgcggccgct ggcacggatg cccaggcctc 961 cgacatttcc gaggtaaggg tacgtcctga ggatccgtat tcgcaagagg cacagatgaa 1021 gatcaaatcg gcacttgaaa tggagcagga gaggcaacaa caggctcagg tcaaggatca 1081 ggagcaagtg cctctcctct ggttccgaat gcccacaact cagaatcagg atgcgactga 1141 agaaaagact ctggagcatc tgcgggtcga ggccaaattg agggcattcg agcgccaggt 1201 gatagccgaa ttgagaatgc tgcagaagat cgaacttatg gctaagcaga tgaggtccag 1261 caccgcagct caaaacggag attcacccta caggatcagt tatccactga gtcggacacc 1321 cattcacaag atcacccgtg ctgatattga gcaagccctt cgggatgatt atgtccgccg 1381 attggtcaac aaggaagcgc aacgaagggc tagaaattcc ggcataaaca cccagaaggc 1441 aaatgccttg aagcgacagg ccaaatccca ggatcagact ctgtccaagg aggatatcgt 1501 ccagattatg gcgtatgcct atcgcatggc caacgaacag atggagagcg agaagggcaa 1561 gcaggacaag gtttacgcgg cctacaggac ggaacagaat ccaatgatga tgcagcaaag 1621 acaatggtcg gaggagcagg ccaagatcca acagaatcaa cagcagatcc agcaaaatcc 1681 aatgatgatg cagcaacgac aatggtcgga ggagcaggcc aagatccaac agaatcaaca 1741 gcagatccag caaaatccaa tgatgatgca gcaacgacaa tggtcggagg agcaggccaa 1801 gatccaacag aatcaacagc agatccagca aaatccaatg atgatgcagc aaagacaatg 1861 gtcggaggag caggctaaga tccaacagaa tcaacagcag atccagcaaa atccaatgat 1921 ggtgcagcaa agacaatggt cggaggagca ggccaagatc caacagaatc aacagcagat 1981 ccagcaaaat ccaatgatga tgcagcaaag acaatggtcg gaggagcagg ccaagattca 2041 gcatgatcaa cagatggcac aacagatggc acagcaaggt ctcatgatga cggagcagag 2101 gcaaaggcag tggtcagaag atcaggccaa aattcagcag gctcaacaga tggcccaaca 2161 gacacccatg atgatgccac agatgcaaca aaggcagtgg acagaggatc cccaaatggt 2221 tcagcagatg caacagaggc agtgggcaga agatcagacc aggatgcaaa tggcccaaca 2281 gaatccaatg atgcaacagc agcgacagat ggcagagaat ccgcaaatga tgcagcaaag 2341 gcaatggtcc gaggagcaga ccaaaatcga acaggctcag cagatggcac aacagaatca 2401 aatgatgatg cagcagatgc agcaaagaca atggtcggag gatcaggccc agattcaaca 2461 gcagcagaga cagatgatgc agcagacccc catgatgatg aaggagcgcc agtgggcgga 2521 ggagaacccc cagtctgtcc agcagcaagg gcccatgatg atgcagcagc agatgccatc 2581 gatgatgcag cgtgaagtag aggacgaaga taataaggca gaggatgact tagttggcga 2641 ggcgggaccc caaatggccg gagaacgaag gcaccgccag gcacaaagtc gatgccctgg 2701 gagttggcgg caacaagcgc aagaagtcca agtccaagtc ggcgccgcaa cggtaattaa 2761 ctattactat gcggcaccca gcgtccggtg gttcagagtt acggaacaag ctacggcgga 2821 aggtggctat ggatcgaatg cctatggtgt tcctcgtcca gtcaattcgt atcaaagtca 2881 aggctacagg gctgccgtgg gtaacgatga ggtagatgag atgttgcgcc agcaccagac 2941 aatggccagg acaataaacc cgaagcaacc aggcgaagtc ggtggatcgg agagccagaa 3001 gagcaactct aatcctccaa cgacgttgac accagctcca caagagcaac cgcaagagca 3061 tcgagtccac aaaaggttag cacatttcca caggtttggg caagaggcag ggttaaatgc 3121 gaccacatcc aagggttgcg gatgtggcag gttggattgt ctgtgcggca ggagttgtcg 3181 atgcggaaga agaggcttgg aatctagagt ggtttctagt agaacttcgg gaacttgcca 3241 gtgcaaagcc agtcaccgga acaagcgttg tgaatatgga accttggaga ccatcgatga 3301 gggttctctt aacgaactta gaagagaata taaactggga ctaaaagaga tcacgctgag 3361 tccagatgaa gatcccgccg aggccctgat gcgctacaat gcggcctcca ttcgcgaagc 3421 tttggagcga gccagtatgg aaccgctgga gatcggtggg gatcagtatg aagaggatgc 3481 gcaacaggag cccatggagg aggagcaact acagcatgat cccaacacag agccccagta 3541 caatcataag gactttgtgc ggctaaccac ttccacagca tctccgatca caagtaccac 3601 agaagctgcc actccaacag gcagtgactc tacttcagaa gcaactgtga cacctgaagt 3661 caccaccacc actagcacat ccacaacaac aacaaccgaa agcaccaaag atgagggact 3721 ggacatgcag caagactcac aggctgaggc tgagagttct catgtcacca agtcgatatc 3781 gaagcaggag gccganattc atcagctgca cagtattgtg gaggagctga agaatgagat 3841 ccttaaactg aatttgcgtt gcagcacgat aatatcaaac aatgtggcca aagaaccggt 3901 cacagagaag aacccacccg tggtggagga accatccaag caggaggata agccaaaggt 3961 cgaggagaaa gtgattgccg aagagcaagc cccagtggag cccgaggaag atctggaaga 4021 ggacgaggat agtacctcta ttagtacgac cacagagact ccttcaccaa gtggtagtta 4081 ctcaacaaaa cctggacttt cccttggatc gcctagagtg gatgaacaaa gtggctcgtc 4141 caataaactg gactacgacg acgataacaa ttggcaacgc attctggcca atcgtggcta 4201 tgacacggac tacctaacca aatcccatga gcgtcagttt gcccagggtc aaaacctgga 4261 gatgcccaag aactgcaatt acgatggaaa tggcagccaa gagtatggcc cctatccgga 4321 gttccaagca gatgaaccca gcacggatac ggagggcaag gctaaaaggg ctttaagtgt 4381 gaaacaacag gcgcagctgc tcaatgcggc cctaaatgat agtggaagcg attcgagcga 4441 tggcaccacc accaccacca caccctctcc gtacgcgatg aggggaaagt ttgtgagacg 4501 gaggagcacg gcaagaaggg ttcccattcc aaagatagga aaggcgagcg acgaggtatg 4561 ggtacgatct cctcgtcaag cgaaaatgcc ccaacgaccc aagaagtcca tgtccaagcc 4621 gaaaaagcag agctcgcagg tgactacgca ggccaccgtg agcagcacta agctggacag 4681 tttggtggat gtgctcaagg atctggtgcg cctgcaaatt caaaaggaga aaaaatccag 4741 cctattaaga acccagagca ataacctttc gaaaaccaaa cccaaatcta ttaagcccgt 4801 caaagtcatc aagcgaaaaa ggttgcgcag gaggcaacac aagtctattg ccacgaccat 4861 caggagtccc attcaaacga aagcataaga tttcagccca tttaactaat tcacagatcc 4921 caactaatcc tatacataat ttagatatag tataagcata ttcactattc gttcgcaatt 4981 cattcatatc catccatctt tcatttgcat atccctcgtc catatcgcca gtccatcatc 5041 agcaccatcc gaaaccgaaa tcgaaaacgc accatcatct gacccccaag tgggttccat 5101 tttcacctac ggcgagggat tgctgcatcc gtttatgggt ctgctgccgg tggagaggcc 5161 cgatgatccg tggaaccaga agccatacga tcctcaccat cctctctaca cgggcggtgg 5221 gagctacgat gcctatctga gggacggtcg ccatcgaagg gacacgcaca tcatgggtca 5281 aggtacgcag catggcatcc tgacacccgg catgttggag agacttcttc ggatcaagat 5341 ggatttccag cgtagatttc ccccatttat acaagggtat gctgaatcac catacgaatc 5401 ttacccgtgt agaggtacag cctccggttc tgggcaagat ttctaagccg aaaactaaga 5461 ccaagcccaa gaatgaggac gaacccgtct tcgaattggg agccgctgag cgtagcctgt 5521 tcgaggacga gaccaacgat tctttagaga aagatcccga accagaaccg gatgaggagg 5581 atgaccggga tgtggaagag cccagcgaga gcagtgaacc taggggattc agcagcaaaa 5641 aaagccggga tgagaacgac atcgattact tcaatttcga tgacgattga tgttgatgat 5701 tgacttttat tataattgcc aataaaatta atattataaa // LOCUS DROGLDYYR 6458 bp ds-DNA INV 14-AUG-1990 DEFINITION D.melanogaster glucose dehydrogenase (GLD) gene, complete cds. ACCESSION M29298 X07358 X13581 X13582 KEYWORDS YYRR box; glucose dehydrogenase. SOURCE D.melanogaster (strain OregonR) DNA, clone pEG25D20. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1460 to 1531) AUTHORS Cavener,D.R. JOURNAL Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235 STANDARD full staff_entry REFERENCE 2 (bases 1460 to 1531) AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M., Schonbaum,C. and Xiao,X. TITLE The YYRR box: A conserved dipyrimidine-dipurine sequence element in Drosophila and other eukaryotes JOURNAL Nucleic Acids Res. 16, 3375-3390 (1988) STANDARD full staff_entry REFERENCE 3 (bases 78 to 1142 and 3792 and 4208) AUTHORS Whetten,R., Organ,E., Krasney,P., Cox-Foster,D. and Cavener,D. TITLE Molecular structure and transformation of the glucose dehydrogenase gene in Drosophila melanogaster JOURNAL Genetics 120, 475-484 (1988) STANDARD full staff_entry REFERENCE 4 (bases 1 to 6458) AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R. TITLE Evolution of the glucose dehydrogenase gene in Drosophila JOURNAL Mol. Biol. Evol. 7, 155-177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Cavener, 20-OCT-1990. FEATURES from to/span description pept 3838 4005 glucose dehydrogenase (GLD), exon 2 (EC 1.1.99.10) 4079 4199 glucose dehydrogenase, exon 3 4539 6088 glucose dehydrogenase, exon 4 pre-msg 502 6458 glucose dehydrogenase mRNA and introns IVS 837 3828 GLD intron A IVS 4006 4078 GLD intron B IVS 4200 4538 GLD intron C signal 394 400 GLD TATA box site 1457 1532 YYRR box tetranucleotide repeat region BASE COUNT 1782 a 1464 c 1418 g 1785 t 9 others ORIGIN Chromosome 3-84C8. 1 tatctgtctg acattaatat ccaaaaatgt aataaattca catccacaaa ttgtgaaaaa 61 aaatcaacaa cgcagttctg cagccgttcg actttatttt gccagtgctt cttaacttgg 121 ctggaaatcg ttaaactcgc aggcgcacga gcaagcagct ttttgtgtgg gttgtttagc 181 cgaaagcggt ggttgaagaa acctgtgacg tcttagccga agtcaggggt gcttaaagaa 241 agttttacaa cactagacca tattcatgag taaagggttg agtaataaaa tacataaaac 301 gtaagaaata ataataatac agattctaaa agttattagg taaaatttag accaatttag 361 acctactcat tgcaaacact caaaagctcc cgattcagac caagtttcag agagcgcagc 421 tttgcggcca gctttaagct gtctttcgtt gagttcgagc ttttcgtcag tttaaaaaga 481 ctggcgcctg ctggtcagaa gctgagtcgg taacggtctg cgtctgcgcg cagttcgaac 541 aagttgagaa agagaccaac agaaagccca tccaagtgca gtgatcaata cggtaactga 601 caaaaaccct agaagtcagg gcttaaaaac gattttgcag cgctgccagt gggtttttgt 661 gtgataaaaa aaaagcggct cagaaaaact tgctgacagc agatagcaca cacgtttttg 721 tcgtcttcgg tccattgaaa aattttcccg aggcattttc tataaggaat aaacaattaa 781 ttcaatattt aaagcataga aggaaactag acaccacatc accggactct acgatcgtaa 841 gttgatgcaa tcgtctttat ttctattatt ttctgccttt tcggtttttg cacaacccca 901 aaatccaaaa attcgcgatg tccgtttctg gcattgagga agctcaaaag atttggacag 961 ctttttggcc cgaagtctgc tggaaatttg ccattgtcat aagcccaaga aacggaattt 1021 tgttcggtct tctgatggct cacggcgata aatttactgc actttgtttg caatagctcc 1081 aattcgattc ggtttgtttt tctggcacag agcacagtat gcctcacgat ttctgctcta 1141 gactaaaact ataatcccaa atatgaaaac cgaattagct aattagttaa ggctaaagtc 1201 tgaactggaa tcttctgact aagatcccac cccttccaaa cgtggacctt acacaattca 1261 ctcaaaattc gcatagcttt atctggcgcc tttgtggcac tcataaattg cattacttta 1321 tgattattat tctaatatat gcacggttcc acattaataa gcacccgaaa aactatgcct 1381 aaacatatgg cagaagcggc aacgagataa ctgtataaca aggcccaaaa tcaagccggt 1441 atctacggta ttgttcacac tgactggctg actggctgac tggctgactg actgcctgat 1501 tgagtgactg actgactgac tgactaactg acaggcagct caaagaaacc ttcaaacgca 1561 gcaacgaaga caatgccaag ctatatcatt gaagttatcg tgcgagtgca aaacaaacca 1621 ccagaagaca actgcagctc gcgtcggttt tattcgacac ccgtgtcgct tcagtgaaca 1681 agggattatg caccatgctt tacatccgat aaagttgtta attccaaaaa aaaaagtcta 1741 attatgacag cgccattgat aatcgtgtca gataagttac acaggaagtg cgagtaatca 1801 aatgataaca agcaaaaatg tcgttaggct gattaaataa ttgtcttcat gttctcacca 1861 ttaaacaaaa aaatatttta aaagtcattg gtttctctga attttttatt taaattaaaa 1921 attctccatc tgctataaca catattttta tttttttaac gaaacattat tttaacttaa 1981 ctggctaaac tgcagataaa atcagaagtc caagtcctcc aaaaatctgt atattagaga 2041 tcatttaacc ctaataaacc aggcggaaaa taattaagtc ggggaaaatg tattgcggct 2101 atcccctttt cgtatcccta ggattttccc accaatttcc acgagccgcc acgcagaagt 2161 ggacctaaat ctgttagcta ggcaaaactt gccccgaaca acttttgatt ttctgagtgc 2221 tgcacgactt ttgcacgcgt ttttctcggc gtttgaagtt cgtctgtggc cacatttgtt 2281 gcatggcttt tactaaactg ctggaaaacg ggctgcgagg tcaactgcct ctcttttcga 2341 ttttccacgt tcgcgtgagn tgcgtatgtt ctgcgacttt gttttgactg tttttttgtt 2401 tttctatttt tttgtgggct cgcgctctgc ttcactttcc tggcatcctt caactttaaa 2461 agcgaaatgc aattgaaatg aatgaaaaat cgtgttccaa atgtcagatg cgagccccat 2521 tgtggcatcc aaaacagacg aactgtagca attgcatggt cttctcgatc tccgcttagg 2581 tgattctctt gtttcactct cggtttcagc ttgcgtttct gtttcaatta cttttctgct 2641 ttggttttgg cttcggtttc tgtttctgtt tctgtttctg ttttcggatg gatacaaatt 2701 gcaatttgat tgcttcaagc tgttcgagtt aattgaggtc aaaagcgctg tcagcatatt 2761 ccaaaaatgc actgatttcg gggatcggtt cggtttgggt ttcggtttca attttgggcg 2821 cctaattaac cgcccaaaaa ggagtgtgaa gagcaatatt ggaggcaagc aattaagatg 2881 gtttcttcac ataaaaggag cttagaacgg atcccgtttc agagctgctc gggagtcaat 2941 ctgatcagac ccgggggtca taagtcgacc accaacgaga cgaacaaatg acataattac 3001 attgcacaat actcaaccgt ggttggctac cgtaattcca tgcagcctca ttgcttctgt 3061 tttaggcatt tgctcgtcaa tttcgaaatt ccaaatagga aagaaaaggg aagccagcca 3121 gacatttttg gccgctcggg gctaaggtca tgtgaaacta tgcagaagct gccgccgacg 3181 gaacaaaagc caagctcgga tgatgagcta tgacttctgt ccgtccggcg ttctttttag 3241 acctttcgta atcgccaatg cagggggcag ttgccgtttg tttgcaaatg gcaagatata 3301 gttgccgntt gtttgcaaat ggcnnagata taaaaaatat cttttttgnt ttcaatttca 3361 tttcgntttt ttcctttcgg ccgcttatgc cgctgtgtgc atgggacaga cgacgcccat 3421 tgtctctcgg tgagtactct tattaattac agtactgctg ctggagcgtt actatgataa 3481 tgtttagtct ttgtctcagt ggacaaggtt aaacagaagt tcatgttgcg gtcttcggaa 3541 aggtttatga ctttcccaat ggctttcaat taatcagctg gaaattaaat aattaaatga 3601 ccatatcatt ttgcgattgc ttttaataaa aagagtatat tgctgacaac gtctttatcg 3661 ttctaacttt gtaaaacttt aaacaagcta gaaagagact tagaacacga acttgtagct 3721 ttatttanaa aaaaagttcg tgctctaagn cntttgtatt ttgaaacttg atcatagttt 3781 tttacctatt taatacttaa aataagaaat tcccattgat ttttttagtc tatcaacatg 3841 tccgccagcg cctcagcctg cgattgtttg gtgggcgtac ccactgggcc caccctggcc 3901 tccacatgtg gtggtagcgc cttcatgctg ttcatgggcc tcctggaggt ctttatccgc 3961 tcccagtgtg atctcgagga tccctgcgga agggccagca gtcgggtaag ctgtaaatga 4021 cgaaggccac taagagtggc tacccatatc tcaatataga tttccaacat tattacagtt 4081 tcgatcggag ccggactacg agtacgattt cattgtcatt ggcggcggct cagcgggctc 4141 tgtggtggcc tctcgactgt ccgaggtgcc ccaatggaag gtgcttttga ttgaagccgg 4201 taagggatgt gccccagcaa ctcataccca tacacatacc tacatactta agttacactg 4261 ctccggcaat tagaggcacg ttatgtggca tttgtttaga agcaattgaa gcttaacagc 4321 ggcataaact gcaacaatgg caccaacact aatgccactt gggcaagact cggcggcaaa 4381 gaagcaaccg caactgtcac gagcattatg gagcgataat ttctagtcct aatatagaac 4441 cgacaaacta atcatacact tatataaact acaaaaaact ataacaacat gatatttttt 4501 aattatgtat ttaacagtat cttaaccgtc ttacacaggt ggcgatgaac ccgtgggagc 4561 ccagataccc tcgatgttcc tcaactttat tggcagcgac attgactacc gctacaacac 4621 ggaacctgag ccaatggcct gtctgtcctc catggagcag cgctgctact ggccacgtgg 4681 caaggtactt ggcggaacct cggtattgaa cggaatgatg tatgtccgcg gcaaccggga 4741 ggactatgat gattgggcag cggatggaaa tcctggttgg gcctacaacg acgtgcttcc 4801 gttcttcaaa aagtcagagg acaacctgga tctggatgaa gtgggcacgg aatatcacgc 4861 caagggtggc ctcctgccag tgggcaagtt cccatacaac ccacctctat cctacgctat 4921 tctgaaggcc ggcgaggagt tgggcttctc cgttcacgat ctaaatggtc agaactctac 4981 tggatttatg attgcccaga tgacggcccg aaatggcatc agatatagct cagcccgggc 5041 tttcctgcgt ccggctcgca tgcgcaacaa cctgcatatc ctgctgaaca ccacggccac 5101 caagatcctt attcatccgc acactaagaa tgttctggga gtggaagtca gtgaccaatt 5161 cggcagcacc cgaaaaatcc tggtcaagaa ggaggtggtt ttaagtgctg gcgctgtgaa 5221 ctctccgcat atcctactac taagtggagt gggtcccaag gacgaactgc agcaggtgaa 5281 tgtgaggacc gtgcacaacc taccaggtgt gggcaagaat ctacacaatc acgtgaccta 5341 ctttaccaac ttcttcattg acgatgcgga cacggcgcca cttaactggg ccacggctat 5401 ggagtacttg ctctttcggg atggtctcat gtccggcact ggcatttcgg atgtgaccgc 5461 aaagctggcc actcgctatg ccgacagtcc cgaacgtccc gatctccagc tgtactttgg 5521 cggctatctg gccagttgtg cccgtacagg acaggtgggc gagttgctct cgaataactc 5581 tcggtccatt cagatcttcc cagctgtttt gaaccctcga tcgcggggat ttattggcct 5641 gcgatctgca gatcctctgg agccgccgcg tattgtggcc aattacctaa cacacgagcg 5701 ggatgtgaag acgctggtgg agggcattaa gtttgtcatc cgattgtcgc aaacgacgcc 5761 gttgaagcag tacggcatgc ggctggacaa gacggtggtc aagggatgcg aggcccatgc 5821 ctttggtagc gatgcctact gggagtgcgc cgtgaggcag aacacgggtc cggagaatca 5881 ccaggctggc tcctgcaaaa tgggtcccag ccacgacccc atggcggtgg tcaaccacga 5941 gttgcgggtc cacggaattc gcggactccg agtcatggac accagcataa tgcccaaggt 6001 cagctcggga aacacccatg cccccgccgt gatgatcgct gagaagggcg cctacctgct 6061 gaagagggcc tggggcgcca aggtctgacg cgtggatgcg acgtggacgt tgcatagagt 6121 aatttaatca gaatccaagc gaaaattaca gcgagtaaaa ttttaagcga aattacactt 6181 atggcacctc agatccaaaa ctaaactatt ttaccctaat ccaatcttat ttaatccgat 6241 cctattatga aggtttctca ctctaggcac accgcgttgc gcaatttcca ttaaagttat 6301 ctataagtta acaggctccc ttaaaactaa ctactgttac ccactgaata tgccgtcaaa 6361 cttcttgaac agcacactca gctcaaatta aagacaaata aaacttatta ggtatctagt 6421 aaatttacta aaattccaaa ttaaatttat gtgaattc // LOCUS DRPGLDYYR 7382 bp ds-DNA INV 14-AUG-1990 DEFINITION D.pseudoobscura glucose dehydrogenase (GLD) gene, complete cds. ACCESSION M29299 X07359 KEYWORDS YYRR box; glucose dehydrogenase. SOURCE D.pseudoobscura DNA, clone pBy2.1. ORGANISM Drosophila pseudoobscura Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 2131 to 2198) AUTHORS Cavener,D.R. JOURNAL Unpublished (1988) Vanderbilt Univ,Dept Mol Bio,Nashville,TN 37235 STANDARD full staff_entry REFERENCE 2 (bases 2131 to 2198) AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M., Schonbaum,C. and Xiao,X. TITLE The YYRR box: A conserved dipyrimidine-dipurine sequence element in Drosophila and other eukaryotes JOURNAL Nucleic Acids Res. 16, 3375-3390 (1988) STANDARD full staff_entry REFERENCE 3 (bases 1 to 7382) AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R. TITLE Evolution of the glucose dehydrogenase gene in Drosophila JOURNAL Mol. Biol. Evol. 7, 155-177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Cavener, 20-OCT-1990. FEATURES from to/span description pept 3959 4126 glucose dehydrogenase (GLD), exon 2 (EC 1.1.99.10) 4203 4323 glucose dehydrogenase, exon 3 5530 7079 glucose dehydrogenase, exon 4 pre-msg 777 > 7382 GLD mRNA and introns IVS 1136 3948 GLD intron A IVS 4127 4202 GLD intron B IVS 4324 5529 GLD intron C site 2129 2200 YYRR box tetranucleotide repeat region BASE COUNT 1983 a 1667 c 1611 g 2121 t ORIGIN Chromosome 3-84C8. 1 gaattccaat catattaatt ttaatatcat gaagcacacc aggagttttc ttatctattt 61 gttttttttt ttcaaagcat acatgactct acttttcttt ggtttcatat cactttttgg 121 aagttctcaa ctctcaaata caaagtatat tctgaacaag tattattaca tattactatc 181 ttactttttt atcaaaacct ttacaaatat ttttccactt tcaatgccca ggcttggcga 241 aaacaacaat actagaacgt agttccactc agaggtactt tattttacct ctgttcggtc 301 cgcttggaaa tcattaaacc acgcagggcc tctccgcgga aagagtgcct ttttgtgtgg 361 gctgtgtgct gtctgagagc gggggttgaa gaaacctgtg acgtcttatt gaagcatagg 421 ggtggctggc tgccataatc tggagttttg cagcactgtc aaagctgtat gagtcatcaa 481 atattcagta aagttcttat agaactaaag cttttaatgc atagttttta ttttgattgt 541 tatttttaga agttttagaa tgaattgtat ttgtttttac aaaaagaaat ctctcttttg 601 attaacagca aagcccaact taacgaagcc aaacgacgca gcgcgctagc aagcttgcca 661 accagcttta agctgccgct tcgtcaaatg aaaccttcgt taagccaaaa gcttacctca 721 gtggcgcagc gcggcttttg ggaagcttaa aaagcctcgc gctcgccggc cagagactga 781 gtcggtaacg gtctgcgtct gcgcgcagtt caaaacaggt tgaaaaaaac gaggcccagt 841 gcttaaaaga agggaacact aaaaacccta gaaggcagcc caacaaatac cgttgtacgc 901 atgagtgtct ttgtgttgtg tccaagaaaa taagaagaag aagaaactgc aacaagaaac 961 gtttaaataa attgtttaca attgtgcatt ttttacgagt cgaacagttc gtttttatca 1021 tctccatgcc attgagcaat tggcctagag gcatttctat ttttcaataa ggaataaata 1081 attaaatttt caaaaattta acaaacagaa aaggaaacct gaaccgacat agaccgtaag 1141 ttgatgcaat cgcctgatgc ccgatacctg atggctgatg cctgatgcct gagtttttct 1201 tttatttcgc tattttctgc gtgtttcggc catgtccgtt ttggcattga ggaagctaaa 1261 cgaaaactgt tttggccagc ctctgcctct tggactgctg gaaattttac cattgtcata 1321 agcccaagac ccaagaaacg gaacttcaac agagtcttgt gttgtgtcta ttggtcttct 1381 gatgcctcat ggcgataaat tgagttgcac ttttattgta tttttgtttt ttgcaatagc 1441 tccaattcca attcgattcg ttttgtttgc cgtcagtgta catgtctcac gattttactg 1501 cctctagaag accaaaaccc gttccagttc atagttggag gtccattggc caaatgaatg 1561 actacaattt gccgcaatgt gtttgagcat taaatttgct atagttttta tccactgatt 1621 atgttccagt ttttaagcgg aggggaagtt gaaaattgct ggtctgcaaa ttgactccaa 1681 agtagatgtg aaatttcttg aaatgatgtg aatttttctc agtttaattg aatgatggat 1741 tgccaatgga tgccaatgaa gttcttcaaa ttattgggtg tgaaagtctc acaaattttg 1801 tgaatttgat ggggaagcta ttattatgac aggtccaaag atctcacccc atccaaagag 1861 ccaaagacag agcccttaca caatccccgc actataatca catggtttta tctggcgcct 1921 ttgtcatggc acgcataaat tgcattactt tatgattata attctaataa atgcacagtt 1981 cccacattaa taagcgccct cgccctcccc tactcagaaa ccatcgctaa acatatggca 2041 aggaggtaca actaaaactc caaccaaacc caaaaaaact ctatctctgt ctgcggtatt 2101 gttcacaccc ctgagcacaa gaccgacata ctgactgact gactgactgg ctgactgact 2161 gactcactaa ctgactgatt gactgactgg cagactgaca actcaaagaa accttcaaat 2221 gcaacggcag caatgaagac aatgccaagc gatatcaata caaaaaccac aagaagacaa 2281 ctgcaacgca actacgatta taatcgacat gcgggatcgg atcgggttgg atcgcttcag 2341 tgcacaagat attatgtaca tccgataaaa ttgttaatac aaaaaaatca agctgttcat 2401 aattatgaaa aaaaactcac agataacatt gttgtcgcaa aaggaagtgt gaataggcca 2461 aagaagaggc acaaaaagtg tcattaatta attgaataat tacagagaca tgacagagag 2521 cagcacttgc ttaggatttc ccttaagttt tccatagatt tccatgccac aatactggaa 2581 aagcaaaact tgtcccgaac aacttttaat tttctgattg ctgcacgact tttgcacgcg 2641 tttttctttg gcttttgtga agttcgcctg tacttgtacc cgtagccaaa gcctgcctgg 2701 aaaacttgtt gcgaggtcaa ctgcctctct taaagccaca gttgcgtatg cactctggct 2761 ttagtttggc tttgctttac tttcgtggaa tccttcaact ttaaaacaag aaaaggacac 2821 caaaatccca ttcaaattga atgaaaaatc gtgtttcaaa tgtcaaatgc gaacaccaaa 2881 gaaacaaaaa aaaaacagac gaactgtagc aattgtagca tgatcttctc tggtgattct 2941 ctggctctga ttttttgggt ttcattttca gtttcgggct ttggtggttg tttttgtttt 3001 tggcttccac gtttggtttg gttgcaaatt gcaatttgat tgctttaagc cgtttcgatt 3061 taattgaggt caaaaacgtt gtcggattct tcccaaaaat gtgctgattt gtgggttggg 3121 ttttgggctt taaaagtaca tatttgtcta attgtcatgc ctggatgtgg tatgtagcaa 3181 ctttatcggt ctccccaaaa ctctcactta aaaatcactc aaatctcacc cgcccagagg 3241 tcatcaaaac gtttgcaaat gacataatta taatggacat tactcaatct ctcattgctt 3301 cttgttttag gcagtaggcc ccagcagaga gtacgagtaa tttgaaattt caaataggaa 3361 aagagccaga gcccgagcct tggccgctcg tccggggcta aggtcatgtg aaactatgca 3421 gctgactgaa caaaagccaa gagctaagac ccaaccaaga tgatgagctc tgtctgtctg 3481 acttggcctt ttggtaattg ccaatgctgc tgctgctgct ggccagttgc cgtttgccaa 3541 atgcccaacg atatcaaaaa agtatctttt tctgatttta attccatttt attcagcttt 3601 tgtttttact tcttcgactg cgatgacgcc cattgtctgc tgcggttggt acttttatta 3661 attgcagtac tgctgctgtt gctggagaat tacttagatg attttttgtc tttgccatgc 3721 cccatggatc ggcaaggtta aacagaagtt tggaattgcg gaatttggaa ttatatttat 3781 gattttgccc tttggctttg gttttgtgtt taatcaacgg caaattatta aaacaaacaa 3841 aaatacgagt agatacattt aaatattgaa ttataatttc tgaagctttg tttctcaagt 3901 tcgtttctta aatctttcgt tttctgctct tttcattcca cagcacagcc ttatcataat 3961 ggccaccagt ccatcctcct gcgattgttt ggtgggcgta cccacggggc ccaccctcgc 4021 ctccacctgc ggcggcagcg ccttcatgct gttcatgggc cttttggagg tcttcatacg 4081 ctcccaatgc gacctcgagg atccttgcgg acgggccagc acgcgggtaa gccaaagatc 4141 cgtatatggt caagacgtgt ggctaaccag atcacattca atccaatcca ttttgttggc 4201 agtttcgttc ggagccggat tacgagtacg atttcattgt cattggcggc ggctcggccg 4261 gctctgtggt tgcctcacgt ctctccgagg tgccccagtg gaaggtgttg ctgatcgaag 4321 ccggtaagtg ttgttgtttc gtctctcgcg tgtgtgtggg tgtgtgtgag tgttgcatgc 4381 aacatgttgc cagccgcagt gtgctcacgg gcaattagat gcacgttatg tatgtgcatt 4441 gtttagaagt atattgaagc ctaacagcac cagcagcagc agcagcacca gcagcagcta 4501 agacttgggc aaccgcaact ctgccacgag cattatgtaa actaaacgag ctgctgccac 4561 tgccacctag agcagcggct ccattgctgc ccccaaggca gttccccatg caaatgtctt 4621 cttgctatat aggatttgct gcatgccccc gccaatgcct catggaggtt ttttacgttc 4681 cgcttgactt gacttttttg cattggaata attgttggtt ttttgtacct tacatttttt 4741 tatgaatgaa ttataatgca tgaacaatac ggggaacgtg gaacttcgga acgagtgtgt 4801 agggcattca cttttcctta acgagaagca aatatgcggt aaactctgct gaagaggctt 4861 tttagcactg aataatttta catttaattt gcgatatttg aaaactggat ctatagcacc 4921 aggttttaga ggaaagcatt agcaaatcac ttaaatcttt gatggaatat attacaatcg 4981 tttaaaaccc aaaaaacaaa acgaaaatcc ctctatttgg tcggagcacc aaaaaacacc 5041 tcattaaaca tcataatttt tcgccattca tcactaataa cttaattagt tattcaggtg 5101 tctagaaaat cacgttaata cctctctcta tcgctgggaa aacattctca tagatcccat 5161 tagacccgtc ttatcagcat tcttctttct ggttggctta ctttgtgggt taccgtctat 5221 tcgtcgtttt caattgtata cgtgacaatt tttatgccca gtcactaatt acactaattg 5281 cataatgaaa ccctcgagga aagagaaaca ttttcaaagt tgtcaatcgg gttggcggta 5341 cttatacaat gccccagtca ggctgcagtt aagtgcaatt agttgtggcc attacttggc 5401 ttattgtcta tgaaaactac actttcaaag actccaaact catgctcatc acaagtatgg 5461 tttaatcttg ggttgacatt agacatgcaa ttcatcaaat aaaaattcgt attgtttgta 5521 ccatttcagg tggcgatgag ccggtgggtg cacagatccc ctccatgttc ctgaatttca 5581 ttggcagtga cattgactat cgctacaata cggagcccga gaggatggcc tgcttgtcct 5641 ccatggagca gcgttgctac tggcctcgcg gcaaggtcct aggcggcact tcggtgatga 5701 atggcatgat gtacatacgc ggcaatcgcg aggactacga cgaatgggcg gcccagggaa 5761 atccgggctg gtcctaccag gatgtcctgc cgttcttcaa gaaatcagag gataatctcg 5821 aattggatgc cgtgggcacc gaataccatg ccaagggcgg ccttttgcca gtgggcaagt 5881 tcccctacaa tcctccactg tcttatgccc tgctgaaggc cggcgaggag atgggcttct 5941 cggtgcagga tctaaacgga cagaattcga caggcttcat gattgcccag atgacggcca 6001 ggaatggcat tcgctatagt tcggccaggg ctttcctaag acccgctcgc atgcgtaaca 6061 atctgcacat attgctgaac accactgtga ccaaggtgct catccatccg ggaaccaaaa 6121 acgttgtggg tgtggaggtt agcgatcagt ttggcagcat gcgcaagatc cttgtcaaga 6181 aggaggtgat cgtcagtggt ggtgcggtga actcgccaca gatcctgctg ctgagtggcg 6241 taggacccaa agaggatctg caaaaggtga atgtccgtcc agtccatcat ctcccgggcg 6301 tgggaaagaa tctccagaac catgtggcgt acttcacgaa cttcttcatc gatgatgcgg 6361 atacggcgcc cctcaactgg gccacggcca tggagtactt gctcttccgc gatggcctga 6421 tgtcgggcac tggcatctcc gatgtcaccg caaagatggc cacccgctgg gccgatcgcc 6481 ccaatctgcc cgatctgcag ctctattttg gcggctacct ggccagctgt gcccgcaccg 6541 gacaggtggg agaactcctc tcgaacaact ctcgagccat acagatattc ccggctgtac 6601 tcaaccccaa gtctcgtggt tacattaccc tgcgctcggc cgatcccctg gatccccccc 6661 gcatctttgc caactatctg acggatgagc gcgatgttaa gaccctggtg gagggcatca 6721 agttcgccat aaggctgtcg cagacgtcac ccttgaagca gtacggcatg cggctggaca 6781 agaccgtggt gaagggctgt gagtcgcacg ccttcgcgag cgatgcctac tgggagtgcg 6841 ctgtgcgcca gaacaccggg ccggagaatc atcaggcggg ctcctgtaaa atgggtccct 6901 cccacgatcc catggcggtg gttaatcacg agctgagggt ccatggcgtg cggggactgc 6961 gcgtgatgga caccagcatt atgcccaagg tgacggcggg caatacccat gcaccggcgg 7021 tgatgattgc ggagaagggc gcctatctgc tgaagcgggc ctggggtgcc aaggtctgac 7081 gcgtggatgc gacgtggacg ttgcatagag taatttaatc aaattccaac gaaaattact 7141 acaaattcaa atttaagcga aaattattat taaggaagct ctgaactttt tctataccta 7201 ttataaaacc tttttggccc gatccaatcc gatctgaacc catgccccac tctaagcact 7261 ctgcgttgcg caatctacga ctcgacaatc gctctcgata agtttaacca tctaacagta 7321 ctattattta aactaattct tgttaaaaac ccattgaaat atcgaaattt cgaatatcac 7381 tc // LOCUS DRVGLDYYR1 945 bp ds-DNA INV 14-AUG-1990 DEFINITION D.pseudoobscura glucose dehydrogenase (GLD) gene, 5' end. ACCESSION M29300 KEYWORDS glucose dehydrogenase. SEGMENT 1 of 2 SOURCE D.virilis DNA, clone pCG79. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 945) AUTHORS Krasney,P.A., Carr,C.M. and Cavener,D.R. TITLE Evolution of the glucose dehydrogenase gene in Drosophila JOURNAL Mol. Biol. Evol. 7, 155-177 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.R.Cavener, 20-OCT-1990. FEATURES from to/span description pept 943 > 945 glucose dehydrogenase (GLD; EC 1.1.99.10) mRNA 545 > 945 GLD mRNA BASE COUNT 325 a 163 c 189 g 268 t ORIGIN Chromosome 3-84C8. 1 gcatcctcgg taaaaagtgc acttcgagga tattttttgg tgatatttaa attttttcgg 61 atttgcttca aacgtattat ttaaagctat acataaaaac gttatgaata cgtgttttaa 121 cgatttggaa aatttaaacc ggcaacattg gtaaatttgg ggaaaacgtt tgtatgaaag 181 ttgtttactt accgtccgat cagttttaaa atcattttca aagctatacc cgttagttta 241 atatatatta ttgatcgttt tcattcaaag tatatatttc gttaacgctc gtaactttaa 301 tgtttgcatg ttaataaata gtcatttcaa tagttagaat atgtatttat attaaattaa 361 aaaaataaat attatgccag caagcaattg gagagcgaca agcgagcttt aagctacaca 421 acgtgaggcg cagctggaac cagttggctc ccgaaccaac tgaaaaacga accagccaac 481 agctcaagct tcgtattggc gtttttgggg ggagtttaaa aagacgcgcg ctgccggcct 541 ttcactgagt cgacgccggt ctgcgattgc gcgcagttca aaaccaagtt gctaaaatca 601 tagccaggcc taaactgcat gcagacttgt gagctgcccg ttaagccaaa gaaggcaaaa 661 ctaagtactg tgcgtgtgtt actgtgctac tgtgtgtgct aacgaatgaa aaagaaaaaa 721 aatcaccaaa aaataaaccg aaaaacattt aaaaaatagt ttttgtcaaa ttattgtgca 781 cgtcttcggt ggttgtgcaa tttgccaaaa aggcattacc gataaggaga ggcacaataa 841 gaaaataaaa gccaaaagag ctaattgttt gttttgcggc aaaccgaaaa ggaaactggc 901 caacaactgg acacgggcat agaaaacttg aattaataca acatg // LOCUS DRVGLDYYR2 50 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis YYRR box located in intron-1 of Gld. ACCESSION X07360 KEYWORDS YYRR box; glucose dehydrogenase. SEGMENT 2 of 2 SOURCE D.virilis DNA, clone pCG79. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 50; enum. 1 to 50) AUTHORS Cavener,D.R. JOURNAL Unpublished (1988) Vanderbilt Univ,Nashville,TN 37235 STANDARD simple automatic REFERENCE 2 (bases 1 to 50) AUTHORS Cavener,D.R., Feng,Y., Foster,B., Krasney,P., Murtha,M., Schonbaum,C. and Xiao,X. TITLE Title is missing from this entry JOURNAL Nucleic Acids Res. (1988) In press STANDARD simple automatic COMMENT *source: library=Charon 30; clone=pCG79; EMBL features not translated to GenBank features: key from to description RPT 3 50 tetranucleotide repeat SITE 3 50 CTGA motif BASE COUNT 15 a 14 c 11 g 10 t ORIGIN About 2 kbp downstream of segment 1. 1 aactgactca ctgactcact gactgactga ctgacggact gacagactga // LOCUS HUMSYB1A1 727 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 1. ACCESSION M36196 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 1 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 727) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept 721 + 722 synaptobrevin 1 (SYB1), exon 1 IVS 723 > 727 synaptobrevin 1 intron A signal 544 549 TATA box BASE COUNT 130 a 217 c 257 g 123 t ORIGIN Chromosome 12. 1 aagcttcgtg cttgctttgg tgggacggaa ccccgcatcc ggcatctggc accgggagct 61 tgtggctgtg tctcgaggct cctgcaccct ctcagagcgt tagtcagagg cggggcggtg 121 gtgcggagcg agggccagag cagggcacca gggctgggcg cgcaccagga agccgccgcc 181 tccagctgaa tgcgcgagtc ccggcgcggg gtggggcagg ggcgggcacg gtcttcagta 241 actgcctagc gattggttgg gaggcaccac aaatgcctaa caaactgctg aagcgccctc 301 cgaagggaca gacctcatct ccaccccgcc ccgtccctcc gccagactgg acgcaaacta 361 tggtcgcatt gatcactgaa ggatcctcac agcaaccgct cctttccgga gtcggatgag 421 aggagagttg tgactggcaa ttggcagggg cggggcgggc taggcctgta gcgctgggcg 481 accgtcctgg gcatggattg ggccgcgggg ttgtcaccgt tatccgggag gcgtggtcag 541 cactaataaa ggcggaggcc ggcgcggcag ctgcagtaag ttccagcgca cgtagaccgc 601 ggggtggtcg gcgcgaggcg gagcttggca gttccgtcca cttcagccgc agcgtccctc 661 gccgggtgtc tcgccgcagc ctccggagag gaacagaccc tcactctctc tgtcagaaaa 721 atgtgag // LOCUS HUMSYB1A2 153 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 2. ACCESSION M36197 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 2 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 153) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 22 + 148 synaptobrevin 1 (SYB1), exon 2 pre-msg < 1 > 153 SYB1 mRNA and introns IVS < 1 21 SYB1 intron A IVS 149 > 153 SYB1 intron B BASE COUNT 37 a 49 c 42 g 25 t ORIGIN Chromosome 12. 1 tctgcacatg tctttccaca ggtctgctcc agctcagcca cctgctgaag ggacagaagg 61 gactgcccca ggtgggggtc cccctggccc tcctcctaac atgaccagta acagacgact 121 acagcaaacc caggcacaag tggaggaggt agg // LOCUS HUMSYB1A3 186 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 3. ACCESSION M36198 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 3 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 186) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 181 synaptobrevin 1 (SYB1), exon 3 pre-msg < 1 > 186 SYB1 mRNA and introns IVS < 1 22 SYB1 intron B IVS 182 > 186 SYB1 intron C BASE COUNT 50 a 36 c 60 g 40 t ORIGIN Chromosome 12. 1 tttgtttctg tgccctcctc aggtggtgga catcatacgt gtgaacgtgg acaaggtcct 61 ggagagggac cagaagctgt cagagctgga tgaccgagct gatgccttgc aggcaggagc 121 atcacaattt gagagcagtg ctgccaagct aaagaggaag tattggtgga aaaactgcaa 181 ggtgaa // LOCUS HUMSYB1A4 79 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 4. ACCESSION M36199 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 4 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 79) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 74 synaptobrevin 1 (SYB1), exon 4 pre-msg < 1 > 79 SYB1 mRNA and introns IVS < 1 22 SYB1 intron C IVS 75 > 79 SYB1 intron D BASE COUNT 16 a 14 c 19 g 30 t ORIGIN Chromosome 12. 1 ttttttgtcc atgtttcctc agatgatgat catgctggga gccatctgtg ccatcatcgt 61 ggtagttatt gtaagtaag // LOCUS HUMSYB1A5 428 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 1 (SYB1) gene, exon 5. ACCESSION M36200 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 5 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 428) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 22 38 synaptobrevin 1 (SYB1), exon 5 pre-msg < 1 > 428 SYB1 mRNA and introns IVS < 1 21 SYB1 intron D BASE COUNT 85 a 139 c 89 g 115 t ORIGIN Chromosome 12. 1 cttcctgatt cctgtgtcca gtctactttt ttacttgaga atgtacccac cccttccctg 61 ttgtccattg ccatccacat tcatgtcctc tgccctctgt ttgctctctc aacacacttc 121 cccacccacc gtcctccatt ccagcccagg cttctccatc acccattcct cctttttcgt 181 tgcgttcatt tgcactctgt ccctcaacac tagaaatgct gctcgtggca caatctaagt 241 cattacccga agagcaacag tggcgcctcc tccctgcctg ctttttctgt actctcaagt 301 tcccccaaag ccccaaagag ttggaggcca agggaagggg cagggagggg agtggctgag 361 gcgaagtacc catgaagctg cccagacttg ggaggagaag agtatcggtg cccatggtga 421 cttctaga // LOCUS HUMSYB2A1 954 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 1. ACCESSION M36201 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 1 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 954) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept 948 + 949 synaptobrevin 2 (SYB2), exon 1 IVS 950 > 954 SYB2 intron A BASE COUNT 125 a 361 c 305 g 163 t ORIGIN Chromosome 17. 1 ctgcagtcgg gtcgctgacc taagcctggg gcaaagctag gtcttggcca gtcagcagcc 61 accacgtgaa gcagcccctg cccccatcac ccccggcagc cttgtgccct cctcctcggg 121 gtcctcagct caccccacct cagtcgggcc tgctccagcg gttaagtacc cctagtctga 181 accttaatcg tgaggcgggg cagccccctc cccacaccct gtagacactc ccccgtcctc 241 cgccccctgc ttttcttcct accctcctac ccagtggcca gggaccccag gcttcctctg 301 agacatccac cagcggtgat gggttgggac caatctcccc cactcctctc caccctagca 361 gtaaggtccc cgctctcgat actgccccag cacctgcact ggccctctct gatttcccaa 421 ctccagccca ttccaagtgc tgaaggcggg tcgctaggtg ctgggacctg ggcgcggggc 481 aagaccttgg gctgtgtggg gtgcaggctg ggtttactgg gttcacgatt gagggagtgg 541 tctgggggtt aggtgtaggg ccaggctgcg ttgtgctcta aggggcaggc ggcaggcggt 601 ttgggggggc tgtggtcaga cagggtgggg tggcggtggg tttggggaag ccgcagcagc 661 ccgccttggg gtcccgcgag tgccgaggga gctgccgggg catggtggcg cggggcccgt 721 ggtcctgccc ggcacgaccc ccgccccgca gcgccccgca gcgccccgcc cccgcggccc 781 cgctcgcgac gcgtctcccg cggcgccccg cccccgcccg cagtgcccgg atgcgggtga 841 cgcgcggccg ccatctttcc gtcccgggca gccagcgcca gtcggagcca gcgcgagcgc 901 cgccgccatc actgccgctg ccaagtcctc cacccgctgc ccccgccatg tgag // LOCUS HUMSYB2A2 148 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 2. ACCESSION M36202 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 2 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 148) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 143 synaptobrevin 2 (SYB2), exon 2 pre-msg < 1 > 148 SYB2 mRNA and introns IVS < 1 22 SYB2 intron A IVS 144 > 148 SYB2 intron B BASE COUNT 25 a 58 c 41 g 24 t ORIGIN Chromosome 17. 1 ttaatccgtc cctgctcctc aggtctgcta ccgctgccac ggccccccct gctgccccgg 61 ctggggaggg tggtccccct gcaccccctc caaacctcac cagtaacagg agactgcagc 121 agacccaggc ccaggtggat gaggtgag // LOCUS HUMSYB2A3 186 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 3. ACCESSION M36203 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 3 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 186) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 23 + 181 synaptobrevin 2 (SYB2), exon 3 pre-msg < 1 > 186 SYB2 mRNA and introns IVS < 1 22 SYB2 intron B IVS 182 > 186 SYB2 intron C BASE COUNT 47 a 49 c 61 g 29 t ORIGIN Chromosome 17. 1 gcctcgtggg cccttctccc aggtggtgga catcatgagg gtgaacgtgg acaaggtcct 61 ggagcgagac cagaagctgt cggagctgga cgaccgtgca gatgcactcc aggcgggggc 121 ctcccagttt gaaacaagcg cagccaagct caagcgcaaa tactggtgga aaaacctcaa 181 ggtaag // LOCUS HUMSYB2A4 77 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 4. ACCESSION M36204 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 4 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 77) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 21 + 72 synaptobrevin 2 (SYB2), exon 4 pre-msg < 1 > 77 SYB2 mRNA and introns IVS < 1 20 SYB2 intron C IVS 73 > 77 SYB2 intron D BASE COUNT 16 a 24 c 15 g 22 t ORIGIN Chromosome 17. 1 tcctcgcctc tcacccccag atgatgatca tcttgggagt gatttgcgcc atcatcctca 61 tcatcatcat aggtgag // LOCUS HUMSYB2A5 146 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human synaptobrevin 2 (SYB2) gene, exon 5. ACCESSION M36205 J05611 KEYWORDS intrinsic membrane protein; synaptic vesicle protein; synaptobrevin 1. SEGMENT 5 of 5 SOURCE Human DNA, clone pcVP18-[1,5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 146) AUTHORS Archer,B.T.III., Ozcelik,T., Jahn,R., Francke,U. and Sudhof,T.C. TITLE structure and chromosomal localizations of two human genes encoding synaptobrevins 1 and 2 JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Sudhof, 06-JUL-1990. FEATURES from to/span description pept + 27 43 synaptobrevin 2 (SYB2), exon 5 pre-msg < 1 > 146 SYB2 mRNA and introns IVS < 1 26 SYB2 intron D BASE COUNT 24 a 65 c 19 g 38 t ORIGIN Chromosome 17. 1 tcatcttctt ccccctctct ccacagttta cttcagctct taaatccccg aggagtctgc 61 cctgcctaga gaagggcctc tcccccaacc ctcagccgtt cctccacctc tcagccatat 121 ctttcagccc ccactcccct ggatcc // LOCUS SPICPATP 4066 bp ds-DNA ORG 14-AUG-1990 DEFINITION Spinach plastid genes atpI-H-F for ATP synthase CF(O) subunits IV, III and I. ACCESSION X03775 M35501 KEYWORDS ATP synthase proteolipid; ATP synthetase; ATP synthetase CFo genes; atpA gene; atpF gene; atpH gene; atpI gene; synthetase. SOURCE Spinach chloroplast DNA. ORGANISM Chloroplast Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae; Spinacia oleracea. REFERENCE 1 (bases 1793 to 2048) AUTHORS Alt,J., Winter,W., Sebald,W., Moser,J.G., Schedel,R., Westhoff,P. and Herrmann,R.G. TITLE Localization and nucleotide sequence of the gene for the ATP synthase proteolipid subunit on the spinach plastid chromosome JOURNAL Curr. Genet. 7, 129-138 (1983) STANDARD simple staff_entry REFERENCE 2 (bases 1 to 4066) AUTHORS Hennig,J. and Herrmann,R.G. TITLE Chloroplast ATP synthase of spinach contains nine nonidentical subunit species, six of which are encoded by plastid chromosomes in two operons in a phylogenetically conserved arrangement JOURNAL Mol. Gen. Genet. 203, 117-128 (1986) STANDARD simple staff_entry COMMENT EMBL features not translated to GenBank features: key from to description OPR 1369 1414 pot. stem-loop structure OPR 2049 2079 pot. stem-loop structure OPR 3775 3799 stem-loop structure pot. transcription terminator INVREP 2319 2326 inverted repeat A INVREP 2334 2341 inverted repeat A' INVREP 2522 2534 inverted repeat B INVREP 3348 3359 inverted repeat B' SITE 2602 2604 pot. translation stop codon SITE 3312 3314 pot. translation stop codon Between positions 2584 and 3340 six stem-loop structures I(1)-I(6) are described. They designate secondary structures resembling those of class II introns FEATURES from to/span description pept 357 1100 CFo subunit IV (atpI) pept 1793 2038 CFo subunit III (atpH) proteolipid pept 2434 2578 CFo subunit I (atpF) 3343 3752 CFo subunit I (atpF) pept 3818 > 4066 CF1 subunit alpha (atpA) pre-msg 2434 3749 primary transcript of atpF gene IVS 2579 3342 put. intron (no splice consensus) binding 348 351 atpI ribosome binding site binding 1780 1783 atpH ribosome binding site binding 2423 2426 atpF ribosome binding site binding 3810 3813 atpA ribosome binding site signal 206 211 atpI -35 region signal 241 246 atpI -10 region signal 1693 1698 atpH -35 region signal 1710 1715 atpH -35 region signal 1724 1728 atpH -10 region signal 1744 1749 atpH -10 region BASE COUNT 1362 a 591 c 777 g 1336 t ORIGIN 1 ctatttgttt aattgataca aattgtaacc cggatctcgc agctctttcg attccggcaa 61 atgatgacgc tatagcttca attcgattaa ttcataccaa attagtattt gcaatttgtg 121 aaggccgctc tagctatata agaaatcctt gattcataat aaaatcaaaa attgacttcc 181 taaattctct atcggttact agattttgaa tctcaaaaac tagttcaaaa taacagggga 241 tattatgtaa ttaatcagta tccgaaatag aaaattttaa ttaaagtaga caagtcgaga 301 aagagctgat tgaatcaaaa taattttttt aagttatatt tctgtaagag gacaatatga 361 atgttctatc atattcaatc aacccgctaa aggggttata tgctatatcg ggtgtggaag 421 taggtcaaca tttttattgg caaataggag gtttccaaat ccatgacaag gcccttataa 481 cttcttgggt tgtaattgct atcttattag gttcagctgc tatagctgtt cggagtccgc 541 aaacaattcc gactggtggt caaaattttt ttgaatatgt ccttgaattc atccgagacg 601 tgagcaaaac tcaaattggc gaagaatatc gcccgtgggt tccctttatt gggactatgt 661 ttctatttat ttttgtttct aattggtcag gggctctttt accttggaaa atcatacagt 721 tacctcatgg ggagttagcc gcacccacga acgatataaa tactactgtt gctttagctt 781 tactcgcgtc agtagcctat ttctatgcag gtcttacaaa aaaaggatta ggttattttg 841 gtaaatacat tcaaccaact ccaattcttt tacccattaa catcttagaa gatttcacaa 901 aaccgctatc acttagtttt cgacttttcg gaaatatatt agcggatgaa ttagtagttg 961 ttgttcttgt ttctttagta cctttagtgg ttcctatacc tgtcatgttt cttggcttat 1021 ttacaagtgg tattcaggct cttatttttg caactttagc cgcagcttat ataggcgaat 1081 ccctggaagg tcatcattga tttgtcttag gaagagttta tctcctagtt tagatatatg 1141 tgtatgtgtg gctcaagata ctctataaag ataatctatt tagagcatat aaatatccaa 1201 atacatacag tctagtggta atagaaaaaa cgatattcga gaagtgtaaa aaaaaaagac 1261 gttggttagt cgagagggga tacccctgtt atatggaatc taatgactat aagctaattc 1321 ttgcagattc gatgtttcga agaatgttca aaaattcgat tgaatttcaa atataatagg 1381 cggtttacgt tatgtaagaa acctatgtat attttatatt agatattgac aagttatata 1441 tgaaagaata tttaatttgc cctacttgaa tttggataga gacaccaacc gacgaagtcc 1501 tttcctattc gtttatgact gcgaattgaa tggataaaca gacaaaatat aaaaaaaggt 1561 cgaagaatga ttagaaaaag aaaaaggaaa tggaaaaact caagttgtat tgattcagaa 1621 agactcaaca atataggaac taaaaaaatc aagtctttct aattatttaa taatattaat 1681 tattatttca actggataaa tattagcaat ggaataatta agtcataatg cattggttga 1741 ttgtatcatt aaccatttct tttttttttt tgtgtgtgtg aggaacttta tcatgaatcc 1801 actgattgct gccgcatccg ttattgctgc tggattggct gtagggttgg cttctattgg 1861 acctggagtt ggtcaaggta ctgctgcggg acaagctgta gaaggtattg cgagacagcc 1921 cgaagcagaa ggaaaaatac gaggtacttt attacttagt ttagcgttta tggaagcttt 1981 aacaatttat ggattggttg tagcattagc gcttttattt gcgaatcctt ttgtttaatc 2041 cgaaaaggaa aagaaatagg agaaatacac atttctttta tagtcttgaa cttgcaggtt 2101 gctttttcac atttataaga aaatatcgct cctacacaat tacttattcg ttgagaaaat 2161 aatacacgga aaggacttaa tttgaggatg aagaattcgt gttacccact cggtttcttc 2221 tttcctcccc ctttttagtc cgaaggagaa gtgttgcaac aaagagagta tttcgaaatt 2281 cacatgaaac ctagtaccta attagtaatt ctataaatcc aataagtatt attcttattg 2341 ggaatctcaa taaaaaaaat aaaattcatt tagaagtagc aaacaattga aataatacaa 2401 cgattttttt gtttatctat aagaggagat catatgaaaa atgtaaccga ttctttcgtt 2461 ttcttgggtc actggccatc cgccgggagt ttcgggttta ataccgatat tttagcaaca 2521 aatctaataa atctcagtgt agtgcttggt gtattgatct tttttggaaa gggagtgtgt 2581 gcgggttgtt tatttcaaaa ataggttgga ttcaaccaac tgtaccgctt tttttaatta 2641 gggcgaaagg tgcatgattt cacgaatgac ttctgaaatt aataataaag aaatcatatg 2701 taagaactag agcatttcgt gatttgttgg taaatatact ttgattctct atcaaccaat 2761 aatgtgggac cataacatgg ttaaaactta aattgtttga agtccaggca cagcagggta 2821 ttctttctac caccatgtta atacttaaat ataccgaata ccgaggctaa aattaaataa 2881 agtaaagaaa catagttaca aatttttcga tatataacac tcatgtcgat aaaattgttg 2941 gaatctctta tttattgtta taggaaaaat gtttcggctt tttttttatt aagtaaataa 3001 atgccaaagg ctgagtcgat gacctacgta taaaataaga aacatttttg gatttgaata 3061 aaaaaaaaac aactttgctg acaattactt atatattttt ttttgggggg tcagaagagt 3121 tctccgaata ttctagtctt gattagtgat gggtttccat tttttgtcga ctatgaacag 3181 agaagagagg ataagttcat tacattcaaa aaagatatgg aaatttgcca taaaaaattg 3241 aagtaattga gcgtgagagc caaatgaatt gaaaaattca cgtttggttc gggaagggat 3301 catgaatgtt ttgaaatgaa tggaaagata atctactttc attaagtgat ttattagata 3361 atcgaaaaca gaggatcttg aatactattc gaaattcaga agaactacgc ggaaaggcca 3421 ttgaacagct ggaaaaagcc cgggctcgct taaagaaagt cgaaatggac gcggatcagt 3481 ttcgagtgaa tggatattct gaaatagaac gagaaaagat gaatttgatt aattcaactt 3541 ataaaacttt agaacaattt gaaaattaca aaaacgaaac cattcagttt gaacaacaaa 3601 aagcgattaa tcaagtccga cagcgggttt tccaacaagc cttacaagga gctctaggaa 3661 ctctgaatag ttgtttgaac aatgagttac atttacggac catcaatgct aatattggca 3721 tgtttggtgc gatgaacgaa ataactgatt agtccttcta tttaattgta ggtattattt 3781 ttttgtttca aaaaaaaaaa gaatttaaga aagactcatg gcaaccattc gagcagatga 3841 aattagcaaa attatccgtg aacgtattga aggatataat cgagaagtaa aggttgtaaa 3901 taccggtaca gtgcttcaag tgggtgacgg cattgctcgt attcacggtc ttgatgaagt 3961 aatggcaggt gaattagtag aatttgaaga gggaacaata ggtattgctc tgaatttgga 4021 atcaaataat gttggtgttg tattaatggg tgacgggttg atgata // LOCUS DRVSEV1 4245 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis sevenless gene, exon 2. ACCESSION M34543 KEYWORDS sevenless protein. SEGMENT 1 of 3 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. TITLE Comparison of the sevenless genes of Drosophila virilis and Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4245) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.M.Rubin, 24-MAY-1990, for release after publication. FEATURES from to/span description pept / 2804 + 3629 sevenless protein, exon 2 (AA at 2805) (5' end is putative) IVS 3630 > 4245 sevenless protein intron A BASE COUNT 1103 a 957 c 922 g 1261 t 2 others ORIGIN 1 tataagtgtg tcgtattgtt ggccaataat tagccgagaa tgtcaaatca cttgctgtta 61 attgatgttt atcgagagta gagtcgaact aatcgttgtt acagcttggc tctgcttgtg 121 ctttgaatag tgtactttga gctgcataaa ttcagctgat agagttggcg cattttcggt 181 ggcacttcaa gcattatgtg taactattcc agatagatac ctttgtgctc tggaatcgcc 241 catgacgctg ggtaaatggg gctagctggc tacgtcaacc gaagtttcgt cgcatctttg 301 tcgttgttga tcgtcttgct ctggcttgcg acttcggctg gggttgtttt tatactctgt 361 acacatggaa attggacaaa tggggtataa tgggcagacg gtagcaactc gtaaaccatg 421 aatttcccat atgtatttct gatcgggaca aagagctgcg tcggcctggc tatgtctgtc 481 cgtccgttaa gataatgaaa attcaatact taacaaacgc agttccagct tttcttggaa 541 gtttttaaga aatggaagaa acttttatat ttgtgtgcgg ctagagtctt acgcttgggc 601 ttacacacac ttcttatgct ttatattaag cggcgaaatt tcataacgat cggactataa 661 ataccgaagt tctgccaaaa cacgttgact ttgatgtggg tgctaggcaa gcaagcggca 721 tgctttgggg tgtttgtttt tttttgtgta caatctttgt aaacgacagc ttgtcgcttt 781 gtccagggta acgcttagtc ggtcaattct gatcagagca cttacttgtt tttggagctg 841 tcgcagtcgt tgtcgcagtc gcaatcgccg gttgccgctt gtcgtttgtc gtttgccgtt 901 tgccgtttgc cggttccatc agttaagcga cgatcgcggc acggttaagt acgcagcgct 961 cagatataaa tacatagcca caggctgcag atacatttgt ggatataaag atacggctgt 1021 gacatattag ccgcgcgact ctctgaagta gcttttgaag ttcaatcccg tcccgcactg 1081 tcccgcttcg cccaatcttg caaaaagggc tacaataaaa tgcaattaag ctgctcgaat 1141 caattctcag caaattgaat taaagcgcgt gaagagctcc tctctacaca cacacacccc 1201 ccctccaccc ctccacccac acccataagc aataacaaca agcccctgat atgtgggcga 1261 gtgttatccg aaactcgaag tccatagcct cgagctcgag aactttgaga actttgccca 1321 cggagtgctg tgaaaatata caaaatgcag atgggcataa acagaaatta acgccaaacg 1381 gaattgctcg ctcatagaat aatgcgaata attaattaat tacgggcaat tttttgcaca 1441 attttttttt ttttttttgt tttttgatgc aattaaaaat gttttgtgtg catattttct 1501 ctggcatttt cgcgcgctta attagtgatt tatttatttt atttatgcat gtttttcttt 1561 tttttttttt tggttttcat ttttctcttg tgctgtgtgt gtttttggag ttaattaata 1621 aagcggcatt ttaacatttt ttgttatttt atgtgcgaat cggcaaatca attgcatttc 1681 attaatatat tcctccgacg gaatttatag cgattccatg cggtttcttc ttaatctgca 1741 tttatgcatt tgtgcggaga aagtgatccc cgtcccaccc aaagacccgc gctctatcac 1801 tcgcacctaa atatagtttt tcccataagg cttaaattgt tgagggggga cttttgtagt 1861 ttttgtggtt tgttgtgatt atttgatgat tattttcata tttattgatt atacacccaa 1921 aaatgttgac aggaatctcc cgatttgccc gattgtattt cactttcccc ttattcagct 1981 tgcaatcgaa gctagaacct tgtgcttgga tatgtgcaca tatttgggtg tcatatttca 2041 gtctcgaaan tttcatatag atcgggcaat atgcagtgca gttaacagag aacggggacc 2101 gaagccaagc cgtggtgtgt ctctataggc agtagctaga gtctgatccg aactccaatc 2161 tgtgttctct atataaaaag ttctccgatt ctttgccaag tatctgaatc aattgcaagc 2221 tttttttttt tatgctcagc agttgcgagc aggagacaaa ccaaacttta agccgggctt 2281 ctatttcagg agattccaat tccaattgtt tgcgtagatc gtaacagatc aaaaataggt 2341 acattgaact gttttggggc aattggttta gtttattatt gttgcgtgtt tgtcgattaa 2401 cataatatgt taattcattt atatgcatgt gccgagacgt ttctgatttt tcgagttttt 2461 cgttttcaat ttcagcaaat tgtgcgctaa aatgtaaatg caaaaacgtc gagccgagct 2521 tcttcaaatc gaaatcgaac gacaacaaat tagaaaccag cccagagaca gtcgcacaca 2581 cacacacaca aacatacact cgagcacaca cacacactcg gacacgcaca accattcgca 2641 cacacacaca cacacacata ccgtggtcaa agtcagagtc ggacggacgg cagctccggc 2701 cagttccgac cgccggaacg cagagtccca cttgccactt ggcattgccg tggcgtaggt 2761 cgagcttaat ttttgtgctt cgatgaacat ttaaattgtg gactatgttt tggcgagaag 2821 atgcagcaca gcagcagcag caacagcaac aacaacaaca gcaacagcag cagcagcaac 2881 agccgccgca tccacccaaa cgcttgagct ttagtttcaa tgttaaaatc gctgtcaatg 2941 tgaacaccaa gatgtccacg actcacatca accaggagcg ctccaagcag caaacaacaa 3001 ccggatctcg atctcgatct cgttccaatt ccaattccag tgtcagttgc aaaggcgacg 3061 gcgatcgccg tgttcggcgg cacacaacgc ggcttgtggg cctgcgtcag cagctgttgc 3121 atctgggacg tcagctaaac ccgggccaat tcttggtcac gggtcacggc ggcatctcca 3181 cgatactaat tgccaatttg ctgctgctgc tgctgctctc gctctgctgc aatgtctgtt 3241 gccgctccca catcgagccg gatcaaaact tgacaccaac aacaacatca ccagctgctg 3301 ttgcagttgt tccgatgttg ctgcccctgg cccagaccca tatgcgaccc cagctggaca 3361 gcgatgtggt cgagaaggtg gccgtttgga caaagcatgt cggtgcggcg ccacccagca 3421 tcgccgaggg cattgccatc agcagcgtgg tcagaatgcc acccagcatc caaaccccaa 3481 ccgaaactgt gcgcaggcaa gaacaacaac gacaacagca gcaacagcaa caagaggcgg 3541 ctgcggctgc ggctgccgac gccgccatcg atgagcgcat cgtactggag cgcgtcaccc 3601 gcgactgtgt gcagcgctgc attgtggagg tgagtcgcta gttgattaac gggcggttta 3661 aactataact aaatggaatt gttattgtta ttgttattgt gaccttgagc aacgtgcttg 3721 tgcaccgtaa gtacagtttg gtatgtccgt tattccatac ttcctggcaa agcgcatgtt 3781 agttaacaga atccttcaac ttatcgtaac ttacttgctt tgggtgtccc aatgcacttg 3841 acttgccttg acagttcttt aatactctta aggactttct agttattgta gagctttgct 3901 tgcactaaac taacttgttc agatcgttct aaaatctttc tggcaaagct catgttagtg 3961 aactacagca cacttatcat aaagctgtag ctctggcaag caaggcttta agggcttaag 4021 acttaaagag tttctagtgc tggtatatca atgaagccat aaagcttttt tataaactta 4081 acttgttccg acatttacat tatacttcct ggcaaagcgt atgttagtaa aatgcagcat 4141 tttaaacgta tcgtaacttg ctctggatat caacatacgt ggctttaggc catagattgt 4201 gctagcaagc ttaagaattt aacattaacg ctccganaaa aagtt // LOCUS DRVSEV2 5687 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis sevenless gene, exon 3,4,5,6 and 7. ACCESSION M34544 KEYWORDS sevenless protein. SEGMENT 2 of 3 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. TITLE Comparison of the sevenless genes of Drosophila virilis and Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD full staff_entry REFERENCE 2 (bases 1 to 5687) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.M.Rubin, 24-MAY-1990. FEATURES from to/span description pept + 1897 1971 sevenless protein (sev), exon 3 4050 4115 sevenless protein, exon 4 4177 4512 sevenless protein, exon 5 4574 4810 sevenless protein, exon 6 4867 + 5186 sevenless protein, exon 7 IVS < 1 1896 sev intron B IVS 1972 4049 sev intron C IVS 4116 4176 sev intron D IVS 4513 4573 sev intron E IVS 4811 4866 sev intron F IVS 5187 + 5687 sev intron G BASE COUNT 1665 a 1222 c 1168 g 1629 t 3 others ORIGIN 1 ccaacagata taaccatgtc cctctgtcct tttatctgtc accaagcgta ctagtctctt 61 agttttaaag ctattgtctc gaaacctctc tgttgcaagc agtatatatg ttagtactgg 121 atcggattgg atcgaagttc ttgtaagaaa aacttctttt caagatatct tgaccaaaac 181 cgtcctctac ttgttaaact atgctccttc tatataagct caattatatg taatttccat 241 ccttaagata ggtaaatgat caaaatgcga tgcccaaatg tttataagct ataaacttat 301 ctaaagtctg gcattttgta gacatttgag ctgtttataa gttcagttta agctgagata 361 atatgaaaac ttgaatataa tttagtattc ggttggactt tgtattaagt tttgacataa 421 ataagggctt tttttatagc ttgttgtttc gcagctttaa tgctgagaca gaccataacc 481 tgcgcttgtc tgttagaaat ttgtctgtta aagacctttg atccattatt ttactatgta 541 ttactaggtt aaccaagatt cgagaagctc tcgagttgtt tgccttcgct ctgaattggc 601 atgtcgctat ttttggctgc ccctcgccgc aacaacaatg aatcgatcgt ttgcggaccg 661 agtcggagat ctctttacga tcgatcacgt gctaaaagcg aaaagaaaga gaaaaaaagg 721 cagaaactcg tttggcgtga cgccgttaaa tgtcgatttg gtcaatagat ttatgcattc 781 gtaatgcgcg aatatctcag aggttgttaa aatgcctcta aagtcaataa tcaggtctat 841 gtttgagttg gcggctgtca cttgagaaaa gtagctaaat caaatcatca aaatagctat 901 aaaaagctgc attacctaag cagagtgatc tacaaaagcg ctgctgcttt ttgcggagtt 961 ttcttttcaa aaaaaagtta gactaaataa attagctaca aaatagccag aactgcaggc 1021 aatggcaaat atgacaagtt tcgtgtgaaa aacagcccat tctagcttgt aaatgacagc 1081 acaaaggtcc cagctctatt tcggcagtaa gccagaaaaa gggcacttct ttgttttctc 1141 gctgaaaata gccaaaatct agttttgaaa tagccataac tgactataaa atagccaaac 1201 tagcagacac ttgcaaatgt cagctattat caagttgggt agttgggtaa aaaaactcta 1261 gctagaaaat ggcaaaattt gttcgttgcg gatagggaga atagccaagt ctcgctataa 1321 aatagctaaa ttgacaacac aactctagcc tttttttttc aataaagcta gaacaagggc 1381 taaatcagct aaaaaagaat ttccggtatg caaatagaaa attctagtta aataaagcag 1441 aaatagtaat atactagctc tagctctttg ctctatttag cttgaaaatg gccaagtata 1501 cccaaaaaaa aaaatagctg atttctttct ttgctgtaat cagttagaaa atagtctagt 1561 ctagctagaa aatagtgaat tgctaacata ctgatccaag ctcactctct ctgttttgct 1621 tatatttagc ttgaaaatag ctatcttggc aatattggct gtagccaatt ctcgccctgc 1681 ctgctaccga tctgcctttt cacattgctt ttatctataa atgttaaaga ctgtgaattg 1741 attcattgcc attgacattg aattgaaaat tcttcccacg atttggattt cccttactct 1801 ttctttcctt gcccctgctt gctgcggcat ttccgttgca aatcggggct cttttgtgct 1861 aattgatttt cacttggatt tctttctttt ttccaggagg atctgtttct ggacgagttt 1921 ggcattaaat gcgagaaggc cgacaacagt gacaagtgct ataaaacacg agtaagtctg 1981 ccccccaaaa accaacacgt ccccctcgcc gaaacggtgt caagtgtttt gacaataacc 2041 ccccctcaag catgccccat tatccccaac taagcaaaag gccacaaagt cgcgtaattg 2101 gaaaccacct ttcaacttta ccagacaacg tgacgacgga ccgagcgacc gaccgaccga 2161 atgacccata aaaaaaaaca agaacaaatt cccagaaacc agaaacgaaa cgtatttacg 2221 gctcaaaaac cctgacaaaa aggaatatac ttaagcaaac gcacgaaaag aatggcaatg 2281 tctgtcttca ttgagccgaa gattaaatac ccttttatac agctcgtaat tgatgagaat 2341 atcttgcaat ggattctacc ttaccatagg atctatatga tatagtagtc caatgataac 2401 agaaagcaag gatatctctc acnagtatag aaaaagattc tctagctgtg cgatataaat 2461 ataaatagaa ncccaacaca aataactttt gcctgtgaca tggctattca caaatgccat 2521 atgccctttt gatacacact tgcaacagtt acagggtatc aggagtggca agcgagggat 2581 agccagcggt aaggccatca ggcacttgtt aaggaaaaat cataaacaca ttcgatatcc 2641 aagccagttc tgcctaacat gtatttcgga ttaaccacag ccagttgagg ttgggagtag 2701 ggagtgggat cgacctttga tgtgattaag ttaccaagca acacagctga gcagcttttt 2761 actggctttt aattgtcgcc atgctcgtga gtttagctag ttgcatacga actcaaagtg 2821 catcaggcgc ggccaatcgt gctccagctc cagacagcaa aggaaaccgt cgaacaacaa 2881 ttgtaaacgc tgctgtcggc tctcaattag tttgcttgca aggcatcgga tccggcggtt 2941 aatttccgtt tgagttttac agcaatccat cagggtacaa tagtttatct agaaattagc 3001 tgtgagcttg gctagcgcta agagacttac agactcgtac ttacctaata ataacatgga 3061 attttgctat cacgaaatac aagagtagtt attgaacata aaatatattt tcagagcata 3121 gtctaacagc tcactagtta tctgaattgt gtacgcccaa aggtaaatag tgttgagtgt 3181 gaaatagttt aaatatgggg aacagttggc tctgcttttt atacccttgc agagggtatt 3241 ataattttgt cgtgaaatgt gtaacgcata gaaggagaca tctctgaccc cataaagtat 3301 atatattctt aatcagcatc aacagccgag tcgatatagc catgtccgtc tgttcgtctg 3361 tccgtctgtc tgtttccatg tcgaactagt ccctcagttt taaagctatc ttaatgaaac 3421 tttgcagaac tccctctttc tgttgcactc agcacatatg tgaaaaccag ctggatcgga 3481 ccactatatc atatagctgc cataggaacg atcggtcgaa aattaagttt ttgtatgaaa 3541 aacattttgt tatcaagata tcttgaccaa actcggcatt tattagtctt actatgctcc 3601 tcatatatat gcaaaatctt attaagatcg gaccactata tcatatagct gccataggaa 3661 cgatcggtcg aaaattaagt tgtatgaaaa aacattttgt ttatcaagat atcttgacca 3721 aactcggcat ttactatttt cccggtactt cttagatagg ggcaaagcac tatgagcatt 3781 atgaaaaggt tgggtctgca agggtattag atctttggcg tgccgaagat agcccttctt 3841 tctcgttttt actggataag caaacgagtg cgaggcaaac gagggtatct cctagactac 3901 ttttgtagtg cactgatatt aatagctttt taaggctgat taaactttca gctggcttta 3961 gataagcctc actttactat gcacctcctg tcttttccct gatgggccaa ctcaacctca 4021 actaatctga tcgtaatctc tctttgcagt gcaacaaggg ctgtgcgcaa tggtatcgcg 4081 cgctgaagga gatcgagccc tgccaggagg cctgtgtgag tgatatctga cagctgccgt 4141 cttaatgata gcttcttaac tgcattattt tgataggctt ccacacagtt ttatccgtac 4201 gatatgccct gcattggagc ctgcgagacg gcgcagcgcg attattggca tatgcagcgg 4261 ctggccatgg ctcggttggt ggagacaacg cagccgcagc tgttggagat gacggatgag 4321 tcgagcactt tgaccatcaa atgggccatg cagttcccgg agaactattt ggccagtcga 4381 ccctttaata tacaatacca gcaagtggac aatcaatcgg aacctgaatg gcacaatctg 4441 gcggactacg actgcgatga atactacgta tgcgaaattc tggaggcgct tgtgccctac 4501 acccgctaca aggtatgtcc atcatatatt aagtgaaata tctctggaga tttaaatttg 4561 aactcttttt cagtttcgct tcgagctgcc atttggggag agcagcgaag acgtgctcta 4621 ttcgccagca acgcctgttt acgaaacgcc gatggagggc gcccccatct ctgcgcccat 4681 tatcgtggca ctgctggcgc tggatgagca tcatgtgttc gtgcactggc gtcccggacg 4741 ctatagcaat gcgcccattg aaggctatcg cgtgctcttg accagcgcag gcaacaccag 4801 ccgcgaacag gtacgtaacg agctccaact gccaagctgc caaatctaaa cgctcatctc 4861 taccagctgc tgcctgctca acgtactagc tgcatctttg cccagctgca gccgctcacc 4921 aactacaccg tggcgctgac catgatcaac aaacagggcg agggaccgtc tacggtggtc 4981 agcattgtga ccaagtcgcc cctggagcca cagcagctgc agagcgtgct gcttgcgagc 5041 gagcacagca tcatctggca atcactggag ccagcgggcg agacacgcct cctctacacc 5101 tctgagccgg cggccattag cgattttacc ttttcgcagc gcgagcagcg gctctggctg 5161 ctggacgaat tgggtcagtt gcacaggtga gcaattggcc atttcagtcg ctcttcaatt 5221 atcttttcga gagtgaagat tatttttttt tgtaggattt cgaggtgaaa ttacgtaaag 5281 tttctacagt aggaaaagct ttctagctag atcatcagtc agtcagtcat atatggatgg 5341 ggagctgcag ttaagctcta gccaagacca gattttgttc agcctcttta gtcacagtta 5401 aatcagtcag tcagtcaatc actcagtcag tcgaatgaat atataattta gtacgatcaa 5461 ttgtgactgt tccttgatca tcagtcagtt agttattcaa tcngtttaat caaaaatcaa 5521 tcagtaaagt aaattggcta ctacagctac aggttgtcca aaaaaatgga tagtttagga 5581 tacattcaat cagccgtcaa tcaatcagtc agtcataata tgatatactt ctattatgcc 5641 tctctatgag ttcagctaca gctaagacca actggggtgt gtgttga // LOCUS DRVSEV3 8779 bp ds-DNA INV 14-AUG-1990 DEFINITION D.virilis sevenless gene, exons 8, 9, 10, 11 and 12. ACCESSION M34545 KEYWORDS sevenless protein. SEGMENT 3 of 3 SOURCE D.virilis DNA. ORGANISM Drosophila virilis Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (sites) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. TITLE Comparison of the sevenless genes of Drosophila virilis and Drosophila melanogaster JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5351-5353 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 8779) AUTHORS Michaeil,W.M., Bowtell,D.D.L. and Rubin,G.M. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by G.M.Rubin, 24-MAY-1990. FEATURES from to/span description pept + 340 4973 sevenless protein (sev), exon 8 5836 6113 sevenless protein (sev), exon 9 6174 6365 sevenless protein (sev), exon 10 7286 7722 sevenless protein (sev), exon 11 7791 8175 sevenless protein (sev), exon 12 IVS < 1 339 sev intron H IVS 4974 5835 sev intron I IVS 6114 6173 sev intron J IVS 6366 7285 sev intrn K IVS 7723 7790 sev intron L BASE COUNT 1930 a 2456 c 2435 g 1958 t ORIGIN 1 tctggtagat gagcgttaga ttggcagctt ggcagttgga gctctcaagc tctatatata 61 tataccattt gtcctgactg attgactgat tgacggactg attggggatc taaatttccc 121 actcccataa ctcggccaaa actcaaccga ttttcatgag gcttggcttt ttattcatgg 181 tttggcctgt atatcaattt ggcattttca aagatccttg agagaattca ttttataggc 241 gccacgggca agggcgggtc taccggctgg tccgtcataa gtacatcgcc caatctctaa 301 gctctacttc agtttagact tgcaattttc tgtttacagc cagctgctgg atgagacgac 361 gacaagcgct gcccggcgtc tgcggctgga gctgccctca aatgggagca gccagtggac 421 gccacgcaag ctgagcctgg actggttgca gcgacgcctc tacatagcag cgcaggcgaa 481 cagcagcgac ggggcagaag gaggatttga gctgtttagc agcaatctgg agggcggcga 541 tgtgcagatg gcgggcgtgc agctgggcct ggtcgtggag cagctggagc tggacgccct 601 gaacggctgg ctcttctggt gcgatgcaga cagcctctgg cgtctcgatc tgagcagcaa 661 gcagcagctg cgccttacac agcccgcagg agcacccgga cgtttcatgt tggagccgca 721 gcgctggctg ctgcacgtgc tgctgccgca ggagaatcag ctgctggagc tgagctacga 781 tggtgggcac aagcatgcgc tggcactgtc caacgacagc tggcgcggat ttgcctggtc 841 cagcgatcag gcacagctgc tactggcgaa tgagacacag ttgcagctgc tggacggaca 901 gacgctggtg ccgcttgcca actggtcgcc ggatggcggc tgctgcgctt tgctgcccct 961 ggaacgtcgc cgccagccgc tgtcactgga gccgccggcg ccgcgcgagt tgcgcgccct 1021 gctaggagcg cagggcgcgc acataacatg gcagccgccg gcagctaatc cctaccagac 1081 tgcaacggcg gccgcgcgca atttcagcta cgagctggag gtgctcgacg tggccagcca 1141 aagcgcctac aacatacgca acatacgcgt gccccacttt ggcctggagc gcctgcaggc 1201 ggacaatctc tatcagctgc gcgtgcgcgc aaacaatgcg gccggccgtg cgggcgtgtg 1261 gacagcgccg ctggcgacgc gcacctggcc cctgggcgat catcgtctgc gctgggccac 1321 gcagcgcggc tcgctgtaca ccacgaacga gctgggcggc cagctgcagc cgctgcccgt 1381 gcagctggca tcgtcgcccg ggccgctagc cctggtcaat gccagcgtgg cgtactatgt 1441 cagcgggcgg gagcaatcgc tgcactgcgt caatttgctg cagccgcagc tgagctgcac 1501 cgacgagcgg ctggagcatg taggcgccgt tgcctatgac tggcgcggcg gtctgctcta 1561 ctggacggat ctggcgcgcg attgtgtcca gcgtctggat ccgttcagcg gtgagcgcga 1621 gctgttgccc atctttggcg cccgccattt ggccctcgac tcggcccagg gacatctcta 1681 ctactcgagc tcggcgcatt tggcgcgccg ttcgctcagc gctctctcca cccatcagcc 1741 ggagctggag tactatcatg tcaatggact ggccggccag attagcggct tctgtctgga 1801 tctgccgcag cgacacatct actggcttgt cgccggcaac tcggcgctgc atctctatcg 1861 cacagcattg tcggcgggcg gcagccaggc cgccgtccca ctgcagctgc tgaccacgct 1921 gccggcggcg gacgcactgc cgcatacgct gcagcatctg gcgccgctgg gtgcgctgct 1981 ctggctggcg gcggatggcc ggggcgcgca tctgctgcgg ctcgcggccc aactggagac 2041 ggacacggat actatgcgtc tgctgccgga gggcctggtc gagccgctca gcgccgtgca 2101 gctgctggag cgttcggctg ggccgccgcc accgccgccg gacgagggcg tgcgtccgct 2161 cgccgtgccg ccggacagtg tgcacatcga cgagggcggc cactggaatg actttcgtgt 2221 gcgctggcag cctgctgcca gtggcggcaa tcacagcgtc tgctacaagc tgctgctgga 2281 gcacggcagc gagcgtctga tcacgctgga gctcctcacg ccctttgccc gcatcacaca 2341 actggcccag gcgccgctcg gtctgcgcat cagcattacg ccgcacaccg cctggcgcgc 2401 cggatccacc acccgggtgc agctggacac gcccgttgcg gcgcccacgc agccgcggcg 2461 gctgcgcgtc tttgtcgagc gtcaggcggc gccgctgcag ttggcgccga atgtaagcgc 2521 tctgctgcgc tgggatgtgc ccgaggagca tgcgggcagt cagtcgctgc agtatcgcat 2581 cagctgctgg cgcggcagcg agttgcattc ggagctgctg ctcaatcaga gcacattgga 2641 ggcgcgcgtc gagcatctgc agccggagga gacgtatcgt tttcaggtgc aggcccatgt 2701 ggccgccacg ggcctggccg ccggcgccac cagccatgcg ctgcacgttt cgcccgaggt 2761 gcaatcggtg ccgcgtctgc tctatgccaa tgccgagcac attggcgaac tggatctgga 2821 tacgggccat cgcaagcagc tggtgcacac cgccagtcct gtcgagcatt tggttgtgct 2881 gcagggcgag cagcgtttgc tttgggtcaa cgagcacgtc gagctgttga gccatgtgcc 2941 gggtaaggcg ccggccaagc tggcacgcat gcgtgccgag gtgctggccc tgaccgtcga 3001 ctgggtgcag cgcatcgtct actgggccga gctggatgcc gcggacggcg gctgtgtcat 3061 ctatagcctg gatctctgtc gcttcgatgg acgcattctg cagggcgaac gtctctggag 3121 cacgccgcgc ggccagctgc tgcgcgatct tgtcgccctg ccgcatgccc gccagcttgt 3181 ctggctgcag cacgatctgg actcgcgcaa tgccacgctg cagggtcgca gcctggccaa 3241 tggcagcgcg ctcacctttg agggcgtcac gttaccgctg tggcgcctct tcgagggcag 3301 ccaggagccg ctggccgaga cccttaatct ggtggatcat ttgggccgtc tgtgcgttta 3361 ccatgtggcc cggcaactgt gcacgtccag cgcactgcga gcccagctca atctgctgaa 3421 cgatgatatc ggccagctgg cccaggatcc gggctatctg tatgctctgc gcaatggtag 3481 tgtgcgcgca tatggccgcc gccggcagca actggaattt ctgctggaac tgcagccgga 3541 cgaggtacgt ctgctgcgcg cctacaacta tcaggcgtat cccagtcgtc gctgcctgct 3601 gctgcccacc acggctgccg ccctcgaatc aacgccgtcc agctgcgagg agacgcagtg 3661 cagcctccag ctgccggcgc tcagcgccgc accagactgc ccgctgccgg tgcccgggct 3721 taactatcag ctgaatctta gcagcagcag cagaagcgca cagctggagc tgcgctcgct 3781 gcactccgcc gccggcctga cgctcaacat aagtcaactg cagccctatc aggcctacga 3841 gctgcgcgct caggtcggca gctactatca gcaacaactg ggccaggagc cgctgcagct 3901 gcccgtcctg acgctgcaca cggctgcggc cacgcccagt gcgccacgaa atttcagcgg 3961 acgggcactg agtcccagcg aactggagct cagctggctg gcgccgctgg agctgcgcag 4021 cgcgagtgtc tattatacgc tgcactggca gctgcagctg gaggataccg aggagcagtc 4081 acaggagcag cccgcccagg agcagcgcgt ggagacggcg ggcgtgcagc ggctgacggg 4141 tctgcagccc gcacgactct atcaggtctg gctgcaggcg catgccacgc ccagcaaata 4201 caatagcagc gggcggctgc tgatccgcag ctatgcgccg ctgccgccgc tgcagctgat 4261 cgagctgaac gcgtacggaa tgacactcgc ctggccgggc acaccggatg cgctcagttc 4321 gctcaccctg gaatgtcagt cgctgcgcga acagctccag ttcaatgtgg ccggaaatca 4381 tacacaaatg cggctggcac cgctgcagcc gaagacgcgc tactcgtgcc gcctggccct 4441 ggcgtatgcc gccacgccgg gcgcacccat ttactttggt cccagtcatg agtacgaaac 4501 gctgggtgat gcgcccagcg cgccgggtcg gccgcagctg gagcatatag ccggcgaaat 4561 ctttcgcgtc agctggacgc cagctctgga caacggttca cccattttgc tatacaatct 4621 ggaggctctg caggcccggc gcaccaatcg tcgtcggcgg cggcgccgag agactacgct 4681 ttccctattg ccctgggccg aggagccgct ggtcatcgag gatcagtggc tggacttttg 4741 taataccacc gagctgagtt gcattgtgcg cgaattgcat acccgtcgcc tgctgctgtt 4801 tcgcgtccgt gcccgcaatc ggccgcacgg ctggggaccc tacagcgagg acagtgaacg 4861 catcgctgag ccctttgtct cgcccgagaa gcgcggctcc ctcgtcttgg ccatcattgc 4921 gcccgctgcc attgtttcca gctgtgtgct cgccttggtg ctcgtgcgca aatgtaagtt 4981 caaaatgaat atttttccgc ccgacaatca gtcaatcatt caactagtcg tcggtcagtt 5041 tgtcaatctt tcagtcgaag aatcaatcga taagaaaatg agtggctaaa taagtaagtt 5101 agtcaatcaa atagtctgtc tttttgtcaa tcagtcgatc aatcaatcgc tcaatccatc 5161 tggtattgcc attcagccaa ttaaccagtc aatccagcaa tcattcaatt agtagtccgt 5221 aggttatcag tcaactatga taaccaatca atccgtcagc ttgataattt gtcgtttaac 5281 tagtcaagaa atcaatgtta atcaagtaat tggccaatta atcagtcagt tcagcagact 5341 gtctagtata actgtgaatt gaaggttcat gaatctgtca gtcaaacggc caatcaaatc 5401 aatcatttgg aaaatttttg atcacccaat ctcaaagtaa ttaagtgagg caagcagtca 5461 attaggcaga taaacagtta atcaggatgt cagtcaatca atcaatcata aatagtctgc 5521 cacctaatcc atcatgattc aatcagtcaa tcagccaata ctcacgtagt caattgatat 5581 tccggacaat cagtcagcca gtctgaaatt aatcagtaaa tcaatcgaat atgagtcagt 5641 caatcaatca attaacacgt aatcaatcaa tcaatcagtc aatggctcat taaatcaata 5701 agcagatcag aaaaatgttt aatcattcac tcaatcagtc agcctaatca atcatgaccc 5761 tttcattcaa tcaatcaatc aatcagtcag tcacatacac aagactcgct aattagtaaa 5821 tctgcgattt tgcagtgcaa aagcgtcgtc atcgcgccaa aaagctgctg cagcagagcc 5881 gcccaagcat ctggagcaat ctttcggctc tgcagacgca gcagcagctg ctggccgcac 5941 ggagtcgcac cttctcgatg tcgctgagcg atgcggacat tgcgctgctg ccgcagatca 6001 actggaaccg gctgacgctg ttgcgtttcc tgggcagcgg cgcctttggc gaggtgtacg 6061 agggtcagct gcaggccgag gacgaggcgc agccgcagcg tgttgccatc aaggtgagtg 6121 gagcagcagt tttttttttt gccacaattc aaggattctc ctcgccttta cagagtctgc 6181 gcaagggcgc cagcgagttc gcggagctgc tgcaggaagc gcagctgatg agcaatttca 6241 agcacgaaaa catcgtttgc ctgattggca tctgctgtga cacggactcc atatcgctga 6301 ttatggagca catggaggcc ggcgatttgc tcagctattt gcgtgcggca aggcccagtt 6361 cacaggtgag cgagcaagca ttaatattta ttatacgctg tagccattgc aaatgggtaa 6421 agggtaaact gtttttgtgc atatgcatgt gacgcacagt atgtccatgc ctttaatctt 6481 gatcaggacg cactgcgttc gtctaggcag gttcctcggt ctctcctttc gttttccaaa 6541 tatcgtaaaa tatcgaaaca aggttttttt tatttacttt atttaattta tttgttaatg 6601 gtcgacaaac acgagtgtct ttctaattat ttaatacatc aaatcatgaa aattaaatac 6661 taaaaaccta aaagcatcac cgactagata tataaagcgt taagtagata ggataaagag 6721 ataccatatg gatatattta gagctcatag cgggaaatag tcagtgagtt cccgaaaata 6781 tatgtacatg ttgaagatta taaatgctag atacaccaca atatggatga ggattaagcg 6841 cctttttgga tgcctccaat caaaactatt ccaaaatcaa aaggatttag aggttccgca 6901 ttcagctttt gtatgtacca acattttgat gccatattta cacgctgaaa gttttacaaa 6961 gatcggacaa taaacaccaa agttcctcga gaatgccata actgaaaatg tggaaccaaa 7021 gaaaagtaga cagaagagag acaaatgtga gtctgttgtg gcagccggat acaggatatc 7081 tctagtcgag cacttgaatt gagcactctg atgccatatt tatacgctga aagtttcatt 7141 atgatcggac cataaacacc gaagatcctc gagaattcca tatctgaaaa tgtggaatca 7201 agcaatagta gagattcagc tagaatgtgt gtgtggctac agggaatatt ctagtcgagc 7261 actcttatgt atggttcgtt tgcaggaagc gctatccaag ctgcagctgc cggagctgtt 7321 gagcatgtgc ctggacgtgg ccaatggctg cagctatatg gaggatatgc actttgtgca 7381 tcgcgatctc gcctgccgca attgtctggt gtcggatgga gccgcaattg ggggacggcg 7441 cattgttaaa attggcgact ttgggctggc acgcgacatc tacaagagcg actattaccg 7501 gaaggagggc gagggactgc tgcccgtgcg ctggatggcg ctcgagagcc tggtggatgg 7561 cctatttagc acacagtcag atgtgtgggc cttcggtgtg ctctgctggg agatattcac 7621 gctcggccag caaccgtatg cggcgcgcaa caacttcgag gtgctcgccc atgtcaagga 7681 gggcggacgg ctgcagcagc cggagcgctg tcccgagaag ctgtaagtgg gccagctctt 7741 ccagttaact tctctctact caatcactgt cccatttatg ctgtccacag ttatgcgctg 7801 ctcttgcaat gctggcgctc tgagccctgg gagagaccca gctttaagcg ctgcctcagc 7861 acgctgcagg ccctcagctc ggatttgcgg cgcactgaaa tgctggccac ggatgagacg 7921 ccgttggttt ctgcattgtg cgctttcaag ccggacgcca aggtgcgttt cgatgatgcg 7981 ccgcagcgtc ttacgctgca cttggacgcc aaagatacgg ttagcacaac cgatgccgat 8041 acgacaggaa gcccaaccac acccaccgca ccaaccacgc ccaccacgac caccagcaca 8101 atcgcagtcg taagcacagc tccatcgtcc gaaaatggcc agctctatgc aaatgagggc 8161 atttcgggcc tttgaaatcc cctttcattt ggatctgccg tcgcaagttg cggcgccaca 8221 aattgatgaa cattcgcatt cgcatgtttg ggcgcactca accatgtaac gcccacttgc 8281 cgaaacaccg cccacacgcg catacacgat tgtgggctta gaattatggt ttctatttaa 8341 tcgatttact attctgtact gaatccaaaa gctaaacaag accaaaagat tgacctcaac 8401 aagtcctaca tgtaaatcaa atggttctgt ttagtttttc ttttttgttt ttgttgtttt 8461 tttttttttt ttggtatacc caacaagacg agtatgctta ttcgaatata caagtagagc 8521 ctaaagtcga atcaataaat atcatgatat ttttgtagaa atagccttta tcattaaatt 8581 attatatttt gcctcaatca ggctgattga cggttgatga tcaatcaaat ataaggtatg 8641 cgctactcga aaatattata aataaatgaa ctcaacagta aatgaagata ttcttgtaaa 8701 accaggctgc atgttttagt ctgcctgatt gattagctaa cggactgact gatcatcagt 8761 caatcttttc gagtatgcg // LOCUS ECOLAMBAA 186 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli lambda receptor (lamB) gene, 5' end. ACCESSION M24997 KEYWORDS lamB; lambda receptor. SOURCE E.coli (K-12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 186) AUTHORS Clement,J.-M., Hedgpeth,J., Hofnung,M. and Jacob,F. TITLE Sequence de l'ADN correspondant a la region du peptide signal pour le recepteur de lambda chez E. coli K12 JOURNAL C. R. Acad. Sci., D, Sci. Nat. 289, 1033-1036 (1979) STANDARD simple staff_entry FEATURES from to/span description pept 82 > 186 lambda receptor precursor sigp 82 156 lambda receptor signal peptide matp 157 > 186 lambda receptor BASE COUNT 51 a 46 c 48 g 41 t ORIGIN 1 ccggcgttta agcaccccac aaaacacaca aagctgtcac aggtgatgtg aaaaaagaaa 61 agcaatgact caggagatag aatgatgatt actctgcgca aacttcctct ggcggttgcc 121 gtcgcagcgg gcgtaatgtc tgctcaggca atggctgttg atttccacgg ctatgcacgt 181 tccggt // LOCUS HUMCRPR 1797 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human cysteine-rich peptide mRNA, complete cds. ACCESSION M33146 KEYWORDS cysteine-rich protein. SOURCE Human normal term placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1797) AUTHORS Liebhaber,S.A., Emery,J.G., Urbanek,M., Wang,X. and Cooke,N.E. TITLE Characterization of a human cDNA encoding a widely expressed and highly conserved cysteine-rich protein with an unusual zinc-finger motif JOURNAL Nucleic Acids Res. 18, 3871-3879 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.A.Liebhaber, 22-MAR-1990, for release after publication. FEATURES from to/span description pept 36 617 cysteine-rich protein BASE COUNT 405 a 507 c 493 g 392 t ORIGIN 1 cctgccgccc ctgcgccgcc gagccagctg ccagaatgcc gaactgggga ggaggcaaga 61 aatgtggggt gtgtcagaag acggtttact ttgccgaaga ggttcagtgc gaaggcaaca 121 gcttccataa atcctgcttc ctgtgcatgg tctgcaagaa gaatctggac agtaccactg 181 tggccgtgca tggtgaggag atttactgca agtcctgcta cggcaagaag tatgggccca 241 aaggctatgg ctacgggcag ggcgcaggca ccctcagcac tgacaagggg gagtcgctgg 301 gtatcaagca cgaggaagcc cctggccaca ggcccaccac caaccccaat gcatccaaat 361 ttgcccagaa gattggtggc tccgagcgct gcccccgatg cagccaggca gtctatgctg 421 cggagaaggt gattggtgct gggaagtcct ggcataaggc ctgctttcga tgtgccaagt 481 gtggcaaagg ccttgagtca accaccctgg cagacaagga tggcgagatt tactgcaaag 541 gatgttatgc taaaaacttc gggcccaagg gctttggttt tgggcaagga gctggggcct 601 tggtccactc tgagtgaggc caccatcacc caccacaccc tgcccactcc tgcgcttttc 661 atcgccattc cattcccagc agctttggag acctccagga ttatttctct gtcagccctg 721 ccacatatca ctaatgactt gaacttgggc atctggctcc ctttggtttg ggggtctgcc 781 tgaggtccca ccccactaaa gggctcccca ggcctgggat ctgacaccat caccagtagg 841 agacctcagt gttttgggtc taggtgagag caggcccctc tccccacacc tcgccccaca 901 gagctctgtt cttagcctcc tgtgctgcgt gtccatcatc agctgaccaa gacacctgag 961 gacacatctt ggcacccaga ggagcagcag caacaggctg gagggagagg gaagcaagac 1021 caagatgagg aggggggaag gctgggtttt ttggatctca gagattctcc tctgtgggaa 1081 agaggttgag cttcctggtg tccctcagag taagcctgag gagtcccagc ttagggagtc 1141 actattggag gcagagaggc atgcaggcgg ggtcctagga gcccctgctt ctccaggcct 1201 cttgcctttg agtctttgtg gaatggatag cctcccacta ggactgggag gagaataacc 1261 caggtcttaa ggaccccaaa gtcaggatgt tgtttgatct tctcaaacat ctagttccct 1321 gcttgatggg aggatcctaa tgaaatacct gaaacatata ttggcattta tcaatggctc 1381 aaatcttcat ttatctctgg ccttaaccct ggctcctgag gctgcggcca gcagagccca 1441 ggccagggct ctgttcttgc cacacctgct tgatcctcag atgtggaggg aggtaggcac 1501 tgcctcagtc ttcatccaaa cacctttccc tttgccctga gacctcagaa tcttcccttt 1561 aacccaagac cctgcctctt ccactccacc cttctccagg gacccttaga tcatcactcc 1621 acccctgcca ggccccaggt taggaatagt ggtgggagga aggggaaagg gctgggcctc 1681 accgctccca gcaactgaaa ggacaacact atctggagcc acccactgaa agggctgcag 1741 gcatgggctg tacccaagct gatttctcat ctggtcaata aagctgttta gaccaga // LOCUS HUMCSPBB 910 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human cytotoxin serine protease-C mRNA, complete cds. ACCESSION M36118 KEYWORDS cytotoxin serine protease-C. SOURCE Human lymphokine activated cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 910) AUTHORS Klein,J.L., Selvakumar,A., Trapani,J.A. and Dupont,B. TITLE Characterization of a novel human cytotoxic lymphocyte specific serine protease cDNA clone (CSP-C) JOURNAL Tissue Antigens (1900) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Klein, 06-JUL-1990. FEATURES from to/span description pept 53 793 cytotoxin serine protease-C precursor sigp 53 157 cytotoxin serine protease-C signal peptide matp 158 790 cytotoxin serine protease-C BASE COUNT 231 a 245 c 250 g 184 t ORIGIN 1 ttcgatatcg agctcctcca acagctctga cctgggcagc cttcctgaga aaatgcagcc 61 attcctcctc ctgttggcct ttcttctgac ccctggggct gggacagagg agatcatcgg 121 gggccatgag gccaagcccc actcccgccc ctacatggcc tttgttcagt ttctgcaaga 181 gaagagtcgg aagaggtgtg gcggcatcct agtgagaaag gactttgtgc tgacagctgc 241 tcactgccag ggaagctcca taaatgtcac cttgggggcc cacaatatca aggaacagga 301 gcggacccag cagtttatcc ctgtgaaaag acccatcccc catccagcct ataatcctaa 361 gaacttctcc aacgacatca tgctactgca gctggagaga aaggccaagt ggaccacagc 421 tgtgcggcct ctcaggctac ctagcagcaa ggcccaggtg aagccagggc agctgtgcag 481 tgtggctggc tggggttatg tctcaatgag cactttagca accacactgc aggaagtgtt 541 gctgacagtg cagaaggact gccagtgtga acgtctcttc catggcaatt acagcagagc 601 cactgagatt tgtgtggggg atccaaagaa gacacagacc ggtttcaagg gggactccgg 661 ggggcccctc gtgtgtaagg acgtagccca aggtattctc tcctatggaa ataaaaaagg 721 gacacctcca ggagtctaca tcaaggtctc acacttcctg ccctggataa agagaacaat 781 gaagcgcctc taacagcagg catgagacta accttcctct gggcctgacc atctctggga 841 cagaggcaag aatccccaag ggtgggagtc ggggttgcag gactgtaata aatggatcga 901 gcacgatatc // LOCUS HUMFKBP 1532 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human FK506-binding protein (FKBP) mRNA, complete cds. ACCESSION M34539 KEYWORDS FK506-binding protein; immunosuppressive agent FK506-binding protein; peptidylprolyl cis-trans isomerase. SOURCE Human peripheral blood T-cell, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1532) AUTHORS Maki,N., Sekiguchi,F., Nishimaki,J., Miwa,K., Hayano,T., Takahashi,N. and Suzuki,M. TITLE Complementary DNA encoding the human T-cell FK506-binding protein, a peptidylprolyl cis-trans isomerase distinct from cyclophilin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5440-5443 (1990) STANDARD full staff_entry COMMENT Draft entry and printed sequence for [1] kindly submitted by N.Takahasi, 5-16-1990, for release after publication. FEATURES from to/span description pept 79 405 FK506-binding protein (FKBP) signal 1507 1512 Poly-A signal BASE COUNT 341 a 386 c 339 g 466 t ORIGIN 1 gaattcgggc cgccgccagg tcgctgttgg tccacgccgc ccgtcgcgcc gcccgcccgc 61 tcagcgtccg ccgccgccat gggagtgcag gtggaaacca tctccccagg agacgggcgc 121 accttcccca agcgcggcca gacctgcgtg gtgcactaca ccgggatgct tgaagatgga 181 aagaaatttg attcctcccg ggacagaaac aagcccttta agtttatgct aggcaagcag 241 gaggtgatcc gaggctggga agaaggggtt gcccagatga gtgtgggtca gagagccaaa 301 ctgactatat ctccagatta tgcctatggt gccactgggc acccaggcat catcccacca 361 catgccactc tcgtcttcga tgtggagctt ctaaaactgg aatgacagga atggcctcct 421 cccttagctc cctgttcttg gatctgccat ggagggatct ggtgcctcca gacatgtgca 481 catgagtcca tatggagctt ttcctgatgt tccactccac tttgtataga catctgccct 541 gactgaatgt gttctgtcac tcagctttgc ttccgacacc tctgtttcct cttccccttt 601 ctcctcgtat gtgtgtttac ctaaactata tgccataaac ctcaagttat tcattttatt 661 ttgttttcat tttggggtga agattcagtt tcagtctttt ggatataggt ttccaattaa 721 gtacatggtc aagtattaac agcacaagtg gtaggttaac attagaatag gaattggtgt 781 tggggggggg gtttgcaaga atattttatt ttaatttttt ggatgaaatt tttatctatt 841 atatattaaa cattcttgct gctgcgctgc aaagccatag cagatttgag gcgctgttga 901 ggactgaatt actctccaag ttgagagatg tctttgggtt aaattaaaag ccctacctaa 961 aactgaggtg gggatgggga gagcctttgc ctccaccatt cccacccacc ctccccttaa 1021 accctctgcc tttgaaagta gatcatgttc actgcaatgc tggacactac aggtatctgt 1081 ccctgggcca gcagggacct ctgaagcctt ctttgtggcc tttttttttt ttcatcctgt 1141 ggtttttcta atggactttc aggaattttg taatctcata actttccaag ctccaccact 1201 tcctaaatct taagaacttt aattgacagt ttcaattgaa ggtgctgttt gtagacttaa 1261 cacccagtga aagcccagcc atcatgacaa atccttgaat gttctcttaa gaaaatgatg 1321 ctggtcatcg cagcttcagc atctcctgtt ttttgatgct tggctccctc tgctgatctc 1381 agtttcctgg cttttcctcc ctcagcccct tctcacccct ttgctgtcct gtgtagtgat 1441 ttggtgagaa atcgttgctg cacccttccc ccagcaccat ttatgagtct caagttttat 1501 tattgcaata aaagtgcttt atgcccgaat tc // LOCUS HUMINTA2 272 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human interferon alpha-2' gene, partial cds. ACCESSION M28028 KEYWORDS interferon alpha-2'. SOURCE Human leukocytes DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 272) AUTHORS Ovchinnikov,Y.A., Sverdlov,E.D., Monastyrskaya,G.S., Tsarev,S.A., Zaitseva,E.M., Arsenyan,S.G., Chakhmakhcheva,O.G., Novokhatskii,A.S., Aspetov,R.D. and Kuznetsov,V.P. TITLE Expression of mutant human interferon alpha-2 in Escherichia coli cells JOURNAL Mol. Biol. 18, 36-45 (1984) STANDARD simple staff_entry FEATURES from to/span description pept < 1 > 272 interferon alpha-2' (AA at 2) BASE COUNT 63 a 76 c 66 g 67 t ORIGIN 1 actggtggcc ctcctggtgc tcagctgcaa gtcaagctgc tctgtgggct gtgatctgcc 61 tcaaacccac aacctgggta gcaggaggac cttgatgctc ctggcacaga tgaggagaat 121 ctctcttttc tcctgcttga aggacagaca tgactttgga tttccccagg aggagtttgg 181 caaccagttc caaaaggctg aaaccatccc tgtcctccat gagatgatcc agcagatctt 241 caatctcttc agcacaaagg actcatctgc tg // LOCUS HUMMLC1AA 836 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human embryonic myosin alkali light chain (MLC1) mRNA, complete cds. ACCESSION M36172 KEYWORDS embryonic myosin alkali light chain. SOURCE Human embryo, cDNA to mRNA, clone GT14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 836) AUTHORS Arnold,H.-H., Lohse,P., Seidel,U. and Bober,E. TITLE A novel human myosin alkali light chain is developmentally regulated: Expression in fetal cardiac and skeletal muscle and in adult atria JOURNAL Eur. J. Biochem. 178, 53-60 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 57 650 embryonic myosin alkali light chain (MLC1) BASE COUNT 199 a 238 c 232 g 167 t ORIGIN 1 cagtctctcg gtttcttctc agatcactcc tctgccaaag atcccaacaa gacaacatgg 61 ctcccaagaa gcctgagcct aagaaggagg cagccaagcc agctccagct ccagctccag 121 cccctgcacc agcccctgcc ccagctcctg aggctcccaa ggaacctgcc tttgacccca 181 agagtgtaaa gatagacttc actgccgacc agattgaaga gttcaaagag gccttttcat 241 tgtttgaccg gaccccgact ggagagatga agatcaccta cggccagtgc ggggatgtac 301 tgcgggccct gggccagaac cctaccaatg ccgaggtgct gcgtgtgctg ggcaagccca 361 agcctgaaga gatgaatgtc aagatgctgg actttgagac gttcttgccc atcctgcagc 421 acatttcccg caacaaggag cagggcacct atgaggactt cgtggagggc ctgcgtgtct 481 ttgacaagga gagcaatggc acggtcatgg gtgctgagct tcggcacgtc cttgccaccc 541 tgggagagaa gatgactgag gctgaagtgg agcagctgtt agctgggcaa gaggatgcca 601 atggctgcat caattatgaa gcctttgtca agcacatcat gtcagggtga agcagagtct 661 tccaggtgcc tggcccttgg ctttagccat accagggtga gttaaagaga ggccccggct 721 gggtgagctg agatggagtc ctcgacttat caccacacca ctgccccaag gaccttacag 781 gccctccctg ttaataaaca gctctaacac ggccaggctg ggctctggga ttctga // LOCUS MUSAIC2B 3113 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse interleukin 3 receptor-like protein (AIC2B) mRNA, complete cds. ACCESSION M34397 KEYWORDS IL-3 receptor; interleukin 3 receptor. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3113) AUTHORS Gorman,D.M., Itoh,N., Kitamura,T., Schreurs,J., Yonehara,S., Yahara,I., Arai K,-I. and Miyajima,A. TITLE Cloning and expression of a novel interleukin 3 receptor-like gene: Identification of another member of the cytokine receptor gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5459-5463 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.M.Gorman, 22-MAY-1990, for release after publication. FEATURES from to/span description pept 288 2978 interleukin 3 receptor-like protein (AIC2B) precursor sigp 288 353 interleukin 3 receptor-like protein signal peptide matp 354 2975 interleukin 3 receptor-like protein signal peptide BASE COUNT 727 a 955 c 806 g 625 t ORIGIN 1 gagctgactg ggagatggtg gggagcgagc tactctggca gaactaaatg tcatggggca 61 cagataaata ggaagagcct gcaactcact ggcacttgga ggctcccgaa ggaggctgcc 121 tgtcgcccaa gcacagagcc acaaaggatg cagtctagga gggaagaatc acaagccctg 181 taagatgagt ggagccaaac ccccagccaa gcaccaatac agaaccccgg gacaatgagg 241 acacccccct gcccatagct tccagtgcag ccaccaaaag tgccaaaatg gaccagcaaa 301 tggcactcac atgggggctg tgctacatgg cactggtggc tctctgttgg ggacacgggg 361 tgacagaggc agaagaaacg gtccctctga agactctgca gtgctacaat gactacacca 421 accacatcat ctgcagctgg gcggacacag aggatgccca ggggctaatc aacatgaccc 481 tctatcacca gctagagaaa aaacagccag tgtcctgtga gctcagtgag aaactcatgt 541 ggtcagagtg cccgtcatcc caccgctgtg tgcccagaag atgtgtcatc ccctatacac 601 gattttccat cacaaacgaa gactactact ccttccggcc agatagtgat ctgggcatcc 661 agctcatggt gccacttgcc cagaatgtgc agccaccact tcccaagaac gtcagcatca 721 gctcctctga ggatcgtttc ctgctggagt ggagtgtgtc ccttggggat gcccaggtct 781 cctggctttc atcaaaggac atagagtttg aggtggctta taagcggctt caggactcct 841 gggaggatgc ctacagtctc cacactagca aatttcaggt gaatttcgag ccaaagctat 901 tcctacccaa cagcatctat gcgccccgtg tgcgcactcg gctgtacccg ggttcaagct 961 tgtctgggag acccagcaga tggagcccag aggctcactg ggactcccag ccaggggaca 1021 aggcccagcc acagaacctt caatgcttct ttgatgggat ccagtccctc cactgctcct 1081 gggaggtgtg gacccagacg actggctctg tttcctttgg gctcttctat cgccccagcc 1141 ctgtagctcc ggaggagaaa tgctctccgg tggtgaagga gccgccgggg gccagtgtct 1201 acacccgcta ccattgcagt ctacctgtgc ctgagcccag tgcacacagc cagtacacag 1261 tctctgttaa gcacctggaa caagggaagt tcatcatgag ctataaccac atccagatgg 1321 agcctccaac cctcaacctg accaagaaca gagacagcta cagcctgcat tgggaaactc 1381 agaagatggc ttactcattc attgagcaca cattccaggt ccagtacaag aagaaatcgg 1441 acagctggga ggacagcaag acagagaacc tagatcgagc ccatagcatg gacctctccc 1501 agctggagcc agacacctca tactgcgcca gggtgagggt caagcccatc tctaactacg 1561 atgggatctg gagcaagtgg agcgaagagt acacttggaa gactgactgg gtgatgccca 1621 cgctgtggat agtcctcatc ctggtctttc tcatcctcac cttgctcctg atccttcgct 1681 ttggctgtgt ctctgtatac aggacgtaca ggaagtggaa ggaaaagatc cccaacccca 1741 gcaagagcct cctgttccag gatggaggta aaggtctctg gcctcctggc agcatggcag 1801 ccttcgccac taagaacccc gctctccagg ggccacagag caggcttctt gctgagcaac 1861 agggggagtc atatgcacat ttggaagaca acaacgtgtc acctctcact atagaggacc 1921 ctaatataat tcgagttcca ccatccgggc ctgatacaac cccagctgcc tcatccgaat 1981 ccacagagca acttcccaat gttcaagtag agggaccaac tcctaacaga cctaggaagc 2041 aattacccag ctttgacttc aatgggccct acctggggcc tccccaatcc cactctctgc 2101 ctgatctccc agaccagctg ggttcccccc aggtgggtgg gagcctgaag ccagcactgc 2161 caggctcctt ggagtacatg tgtctggccc ctggaggtca agtgcaactg gttccattgt 2221 cccaggtgat ggggcagggc caggctatgg atgtgcagtg tgggtccagc ctggagacct 2281 cagggagccc ttctgtggag ccaaaggaga accctccagt tgagctgagc atggaggaac 2341 aggaggcacg ggacaaccca gtgactctgc ccataagctc tgggggccct gagggcagta 2401 tgatggcctc tgattatgtc actcctggag atccggtgct cactctgccc acagggcccc 2461 tgtctacctc tctgggcccc tctctagggt tgccctcagc ccaaagcccc agtctctgtc 2521 ttaagctgcc cagggtcccc tctggaagcc cagctctagg gccaccaggg tttgaggact 2581 atgtggagct gcctccaagt gtgagccagg ctgccaagtc ccctccaggc catcctgctc 2641 ctcctgtggc aagcagcccc acagtgatcc caggagagcc cagggaggaa gtgggcccag 2701 catccccaca tcccgaaggc ctccttgttc ttcagcaggt tggggactac tgcttcctcc 2761 ctggcctggg acctggctcc ctctcaccac acagtaagcc accctctcca agtctgtgtt 2821 ctgagactga ggacctagtc caggacttgt ctgtcaaaaa gtttccctat cagcccatgc 2881 cccaggcgcc agccattcag tttttcaagt ccctaaagca tcaggactac ctgtccctgc 2941 ccccttggga caatagccag tctgggaagg tgtgctgagt ctgtctcctc ccaatctcac 3001 cagcagcctg gcaccgcagc ctgtggtcct cagcctgagc atcaccacag aagcctctct 3061 gagttcacac tcctccttgc tcccagccct gacatggcaa tacccccacc tgt // LOCUS MUSIGHADP 363 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36209 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 363 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 89 c 98 g 88 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaaatcgc 301 tactatggta actactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc 361 tca // LOCUS MUSIGHADQ 357 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36210 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 83 c 99 g 87 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagaggaaac 301 tataggtacg acgagtttgc ttactggggc caagggactc tggtcactgt ctctgca // LOCUS MUSIGHADR 336 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36211 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 336) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 336 Ig heavy-chain V region (AA at 1) BASE COUNT 81 a 77 c 94 g 84 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatggggc 301 tactatggta acctactggt acttcgatgt ctgggg // LOCUS MUSIGHADS 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130F2. ACCESSION M36212 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130F2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 86 a 83 c 97 g 88 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatactat 301 aggtacgaca ggtttgctta ctggggccaa gggactctgg tcactgtctc tgca // LOCUS MUSIGHADT 357 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130H9. ACCESSION M36213 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V region (AA at 1) BASE COUNT 86 a 91 c 102 g 78 t ORIGIN 1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg 61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aaggggctac 301 ggtagccact ggtacttcga tgtctggggc gcagggacca cggtcaccgt ctcctca // LOCUS MUSIGHADU 331 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H2-4C2. ACCESSION M36214 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 331 Ig heavy-chain V region (AA at 2) BASE COUNT 94 a 79 c 83 g 75 t ORIGIN 1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctggatatac 61 cttcacaaac tatggaatga attgggtgaa gcaggctcca ggaaagggtt taaagtggat 121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt 181 tgccttccct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa 241 tgaggacacg gctacatatt tctgtgcaag atatgactac aactactggt acttcgatgt 301 ctggggcgca gggaccacgg tcaccgtctc c // LOCUS MUSIGHADV 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-1VH. ACCESSION M36215 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 86 c 99 g 81 t ORIGIN 1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg 61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagtcagggg 301 tatggttacg ttgctatgga ctactggggt caaggaacct cagtcaccgt ctcc // LOCUS MUSIGHADW 360 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-10VH. ACCESSION M36216 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy-chain V region (AA at 1) BASE COUNT 89 a 90 c 92 g 89 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatcttac 301 tacggtagta gctactactt tgactactgg ggccaaggca ccactctcac agtctcctca // LOCUS MUSIGHADX 360 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-22VH. ACCESSION M36217 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy-chain V region (AA at 1) BASE COUNT 91 a 89 c 96 g 84 t ORIGIN 1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc 61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct 121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat 181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta 241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcactat 301 ggttacgacg gttactatgc tatggactac tggggtcaag gaacctcagt caccgtctcc // LOCUS MUSIGHADY 342 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-23VH. ACCESSION M36218 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 342) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 342 Ig heavy-chain V region (AA at 1) BASE COUNT 89 a 83 c 92 g 78 t ORIGIN 1 gaggtccagc ttcagcagtc aggacctgag ctggtgaaac ctggggcctc agtgaagata 61 tcctgcaagg cttctggata cacattcact gactacaaca tgcactgggt gaagcagagc 121 catggaaaga gccttgagtg gattggatat atttatcctt acaatggtgg tactggctac 181 aaccagaagt tcaagagcaa ggcaacattg actgtagaca attcctccag cacagcctac 241 atggagctcc gcagcctgac atctgaggac tctgcagtct attactgtgc aagagggggg 301 ctacgccctt ttgcttactg gggccaaggg actctggtca ct // LOCUS MUSIGHADZ 357 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-5VH. ACCESSION M36219 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 357) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 357 Ig heavy-chain V region (AA at 1) BASE COUNT 87 a 88 c 100 g 82 t ORIGIN 1 caggtccagc tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagatg 61 tcctgcaagg ctgctggata caccttcact aactactgga taggttgggt aaagcagagg 121 cctggacatg gccttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cacagcctac 241 atgcagctca gcagcctgac atctgaggac tctgccatct attactgtgc aagatgccct 301 tatggtaact acgtttggta cttcgatgtc tggggcgcag ggaccacggt caccgtc // LOCUS MUSIGHAEA 366 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-7VH. ACCESSION M36220 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 366) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 366 Ig heavy-chain V region (AA at 1) BASE COUNT 90 a 92 c 94 g 90 t ORIGIN 1 gaggtgaagc tggtggagtc tggaggaggc ttggtacagc ctgggggttc tctgagactc 61 tcctgtgcaa cttctgggtt caccttcact gattactaca tgagctgggt ccgccagcct 121 ccaggaaagg cacttgagtg gttgggtttt attagaaaca aagctaatgg ttacacaaca 181 gagtacagtg catctgtgaa gggtcggttc accatctcca gagataattc ccaaagcatc 241 ctctatcttc aaatgaacac cctgagagct gaggacagtg ccacttatta ctgtgcaaga 301 gacccccctt actacggcta cgctatggac tactggggtc aaggaacctc agtcaccgtc 361 tcctca // LOCUS MUSIGHAEB 328 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-2VH. ACCESSION M36221 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 328) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 328 Ig heavy-chain V region (AA at 2) BASE COUNT 87 a 86 c 80 g 75 t ORIGIN 1 acctggcctg gtggcgccct cacagagcct gtccatcaca tgcactgtct cagggttctc 61 attaaccagc tatggtgtaa gctgggttcg ccagcctcca ggaaagggtc tggagtggct 121 gggagtaata tggggtgacg ggagcacaaa ttatcattca gctctcatat ccagactgag 181 catcagcaag gataactcca agagccaagt tttcttaaaa ctgaacagtc tgcaaactga 241 tgacacagcc acgtactact gtgccaaacc ttatgattac gaagggtact actttgacta 301 ctggggccaa ggcaccactc tcacagtc // LOCUS MUSIGHAEC 331 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-5VH. ACCESSION M36222 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 331 Ig heavy-chain V region (AA at 2) BASE COUNT 74 a 88 c 86 g 83 t ORIGIN 1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc 61 actgagcact tctggtatgg gtgtgagctg gattcgtcag ccttcaggaa agggtctgga 121 gtggctggca cacatttact gggatgatga caagcactat aacccatccc tgaagagccg 181 gctcacaatc tccaaggata cctccagcaa ccaggtattc ctcaagatca ccagtgtgga 241 cactgcagat actgccacat actactgtgc tcgaagagat ggttacgacg aggggtttgc 301 ttactggggc caagggactc tggtcactgt c // LOCUS MUSIGHAED 334 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H250-6VH. ACCESSION M36223 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 334 Ig heavy-chain V region (AA at 2) BASE COUNT 75 a 81 c 95 g 83 t ORIGIN 1 gggaggctta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac 61 tttcagtagc tatgccatgt cttgggttcg ccagtctcca gagaagaggc tggagtgggt 121 cgcagaaatt agtagtggtg gtagttacac ctactatcca gacactgtga cgggccgatt 181 caccatctcc agagacaatg ccaagaacac cctgtacctg gaaatgagca gtctgaggtc 241 tgaggacacg gccatgtatt actgtgcaag ggatctcgac tatggtaact cctggtttgc 301 ttactggggc caagggactc tggtcactgt ctct // LOCUS MUSIGHAEE 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-10VH. ACCESSION M36224 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 88 a 90 c 90 g 86 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagatctaag 301 atcccgtact actttgacta ctggggccaa ggcaccactc tcacagtctc ctca // LOCUS MUSIGHAEF 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-15VH. ACCESSION M36225 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 95 a 88 c 89 g 82 t ORIGIN 1 caggtccagc ttcagcagtc tggggctgaa ctggcaaaac ctggggcctc agtgaagatg 61 tcctgcaagg cttctggcta cacctttact agctactgga tgcactgggt aaaacagagg 121 cctggacagg gtctggaatg gattggatac attaatccta gcactggtta tactgagtac 181 aatcagaagt tcaaggacaa ggccacattg actgcagaca aatcctccag cacagcctac 241 atgcaactga gcagcctgac atctgaggac tctgcactgt attactgtgc aagatgggtc 301 tattactatg ctatggacta ctggggtcaa ggaacctcag tcaccgtctc ctca // LOCUS MUSIGHAEG 360 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C10VH. ACCESSION M36226 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 360) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 360 Ig heavy-chain V region (AA at 1) BASE COUNT 84 a 87 c 100 g 89 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc aagagggcga 301 actatgattc ccgcctggtt tgcttactgg ggccaaggga ctctggtcac tgtctctgca // LOCUS MUSIGHAEH 345 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C7VH. ACCESSION M36227 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 345) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 345 Ig heavy-chain V region (AA at 1) BASE COUNT 85 a 78 c 96 g 83 t 3 others ORIGIN 1 caggttcagc tgcagcagtc tggagctgaa ctggtaaagc ctggggcttc agtgaagttg 61 tcctgcaagg cttctggcta caccttcaca agctatgata taaactgggt gaggcagagg 121 cctgaacagg gacttgagtg gattggatgg atttttcctg gagatggtag tactaagtac 181 aatgagaagt tcaagggcaa ggccacactg actacagaca aatcctccag cacagcctac 241 atgcagctca gcaggctgac atctgaggac tctgctgtct atttctgtgc aaactggnac 301 gggtttgctt actggnncca agggactctg gtcactgtct ctgca // LOCUS MUSIGHAEI 354 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H61-15VH. ACCESSION M36228 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 354) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 354 Ig heavy-chain V region (AA at 1) BASE COUNT 90 a 89 c 93 g 82 t ORIGIN 1 caggtgcagc tgaaggagtc aggacctggc ctggtggcgc cctcacagag cctgtccatc 61 acttgcactg tctctgggtt ttcattaacc agctatggtg tacactgggt tcgccagcct 121 ccaggaaagg gtctggagtg gctgggagta atatgggctg gtggaagcac aaattataat 181 tcggctctca tgtccagact gagcatcagc aaagacaact ccaagagcca agttttctta 241 aaaatgaaca gtctgcaaac tgatgacaca gccatgtact actgtgccag agggcattac 301 tacggctact atgctatgga ctactggggt caaggaacct cagtcaccgt ctcc // LOCUS MUSIGHAEJ 351 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-A15VH. ACCESSION M36229 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 351) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 351 Ig heavy-chain V region (AA at 1) BASE COUNT 96 a 82 c 91 g 81 t 1 others ORIGIN 1 gttcagctgc agcagtctgg agctgagctg atgaagcctg ggncctcagt gaagatatcc 61 tgcaaggcta ctggctacag attcagtagc tactggatgg agtgggtaaa gcagaggcct 121 ggacatggcc ttgagtggat tggagagctt ttacctggaa atagtaatac taattacaat 181 gagaagttca agggcaaggc cacattcgct gcagatacat cctccaacac agcctacatg 241 caactcagca gcctgacatc tgaggactct gccgtctatt actgtgcaag agatctgaaa 301 aatggtgact cgttctactt tgactactgg ggccaaggca ccactctcac a // LOCUS MUSIGHAEK 333 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-D3VH. ACCESSION M36230 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 333) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 333 Ig heavy-chain V region (AA at 1) BASE COUNT 95 a 73 c 84 g 80 t 1 others ORIGIN 1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc 61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct 121 ccaggaaagg atttaaagtg gatgggctgg ataaacacct atactggaga gccaacattt 181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat 241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaagagaa 301 aattcctact ggtatttcga tgtctggggc gca // LOCUS MUSIGHAEL 334 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. IC5-3A4HV. ACCESSION M36231 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 334) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 334 Ig heavy-chain V region (AA at 2) BASE COUNT 86 a 69 c 95 g 84 t ORIGIN 1 gggagactta gtgaagcctg gagggtccct gaaactctcc tgtgcagcct ctggattcac 61 tttcagtagt tatggcatgt cttggtttcg ccagattccg gagaagaggc tggagtgggt 121 cgcaaccatt agtggtggtg gtagttatat ttactatgta gacagtgtga aggggcgatt 181 caccatctcc agagacaatg ccaagaacaa cctgtacctg caaatgagca gtctgaggtc 241 tgaggacacg gccttgtatt attgtgcaag agggaatata atataccacg aggactatgc 301 tatggactac tggggtcaag gaacctcagt cacc // LOCUS MUSIGHAEM 328 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L11-2E2HV. ACCESSION M36232 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 328) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 328 Ig heavy-chain V region (AA at 2) BASE COUNT 85 a 78 c 96 g 69 t ORIGIN 1 ggcagagctt gtgaagccag gggcctcagt caagttgtcc tgcacagctt ctggcttcaa 61 cattaatgac acctatatgc actgggtgaa gcagaggcct gaacagggcc tggagtggat 121 tggaaggatt gatcctgcga atggtaatac taaatatgac ccgaagttcc agggcaaggc 181 cactataaca gcagacacat cctccaacac agcctacctg cagttcagca gcctgacatc 241 tgaggacact gccgtctatt actgtgatag agaggactac ggtagtaggt acggggggtg 301 gtacttcgat gtctggggcg cagggacc // LOCUS MUSIGHAEN 331 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-10C1HV. ACCESSION M36233 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 331) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 331 Ig heavy-chain V region (AA at 2) BASE COUNT 90 a 78 c 89 g 74 t ORIGIN 1 acctgagctg aagaagcctg gagagacagt caagatctcc tgcaaggctt ctgggtatac 61 cttcacaaac tatggaatga actgggtgaa gcaggctcca ggaaagggtt taaagtggat 121 gggctggata aacacctaca ctggagagcc aacatatgct gatgacttca agggacggtt 181 tgccttctct ttggaaacct ctgccagcac tgcctatttg cagatcaaca acctcaaaaa 241 tgaggacacg gctacatatt tctgtgcaag atgggactac gggtactggt acttcgatgt 301 ctggggcgca gggaccacgg tcaccgtctc c // LOCUS MUSIGHAEO 328 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-5D2HV. ACCESSION M36234 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 328) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 328 Ig heavy-chain V region (AA at 2) BASE COUNT 76 a 85 c 82 g 83 t 2 others ORIGIN 1 ccctgggata ttgcagccct cccagaccct cagtctgact tgttctttct ctgggttttc 61 actgagcact tctggtatgg gtgnnagctg gattcgtcag ccttcaggaa agggtctgga 121 gtggctggca cacatttact gggatgatga caagcgctat aacccatccc tgaagagccg 181 gctcacaatc tccaaggata cctccagaaa ccaggtattc ctcaagatca ccagtgtgga 241 cactgcagat actgccacat actactgtgc tcggtcgtat ggtaatgggg attactatgc 301 tatggactac tggggtcaag gaacctca // LOCUS MUSIGHAEP 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H120-2HV. ACCESSION M36235 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 89 c 78 g 74 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgta cacgttcgga 301 ggggggacca agctggaaat aaaa // LOCUS MUSIGHAEQ 317 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H130-5VK. ACCESSION M36236 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 317) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 317 Ig heavy-chain V region (AA at 1) BASE COUNT 91 a 76 c 71 g 79 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtacac gttcggaggg 301 gggaccaagc tggaaat // LOCUS MUSIGHAER 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H130-6VK. ACCESSION M36237 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 92 c 78 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAES 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H146-24B1VK. ACCESSION M36238 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H146-24B1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 90 c 75 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc 301 tcggggacaa agttggaaat aaaa // LOCUS MUSIGHAET 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H147-25H1VK. ACCESSION M36239 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H147-25H1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 84 a 89 c 77 g 74 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact aacgttcgga 301 ggggggacca agctggaaat aaaa // LOCUS MUSIGHAEU 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130H9. ACCESSION M36240 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130-F2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 92 c 78 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAEV 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H163-130H9. ACCESSION M36241 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H163-130H9. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 92 c 78 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccgct cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAEW 321 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H2-4C2. ACCESSION M36242 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H2-4C2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 321 Ig heavy-chain V region (AA at 1) BASE COUNT 93 a 79 c 69 g 80 t ORIGIN 1 aatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactc ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcac tatagtgagc ttccgtggac gttcggtgga 301 ggcaccaagc tggaaatcaa a // LOCUS MUSIGHAEX 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-1. ACCESSION M36243 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 90 c 75 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc 301 tcggggacaa agttggaaat aaaa // LOCUS MUSIGHAEY 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-10. ACCESSION M36244 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 90 c 75 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggc 301 tcggggacaa agttggaaat aaaa // LOCUS MUSIGHAEZ 303 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-22. ACCESSION M36245 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-22. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 303 Ig heavy-chain V region (AA at 1) BASE COUNT 82 a 78 c 73 g 70 t ORIGIN 1 tctcacaaat tcatgtccac atcagtagga gacagggtca gcatcacctg caaggccagt 61 caggatgtgg gtactgctgt agcctggtat caacagaaac cagggcaatc tcctaaacta 121 ctgatttact gggcatccac ccggcacact ggagtccctg atcgcttcac aggcagtgga 181 tctgggacag atttcactct caccattagc aatgtgcagt ctgaagactt ggcagattat 241 ttctgtcagc aatatagcag ctatcctcgg acgttcggtg gaggcaccaa gctggaaatc 301 aaa // LOCUS MUSIGHAFA 303 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-23. ACCESSION M36246 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-23. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 303 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 71 c 70 g 83 t 1 others ORIGIN 1 tctccatcct ccttatctgc ctctctggga gaaagagtca gtctcacttg tcgggcaagt 61 caggacattg gtagtagctt aaactggctt cagcaggaac cagatggaac tattaaacgc 121 ctgatctacg ccacatccag tttagattct ggtgtcccca aaaggttcag tggcagtagg 181 tctgggtcag attattctct caccatcagc agccttgagt ctgaagattt tgtagactat 241 tactgtctac aatatgctag ttctccgtac acgttcggag gggggaccaa gctgnaaata 301 aaa // LOCUS MUSIGHAFB 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-5. ACCESSION M36247 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 79 a 92 c 77 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAFC 297 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H220-7. ACCESSION M36248 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H220-7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 297) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 297 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 72 c 73 g 69 t 5 others ORIGIN 1 aaattcatgt ccacatcagt aggaganagg gtcagcatca cctgcaaggc cagtcaggat 61 gtgagtactg ctgtagcctg gtatcaacag aaaccaggac aatctcctaa actactgatt 121 tactcggcat cctaccggta cactggagtc cctgatcgct tcactggcag tggatctggg 181 nnggatttca ctttcaccat cagcagtgtg caggctgaag acctggcagt ttattactgt 241 cagcnacatn atagtactcc gtggacgttc ggtggaggca ccaagctgga aatcaaa // LOCUS MUSIGHAFD 315 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-2. ACCESSION M36249 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 315) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 315 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 80 c 75 g 77 t ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccattcac gttcggctcg 301 gggacaaagt tggaa // LOCUS MUSIGHAFE 303 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H230-5. ACCESSION M36250 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H230-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 303) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 303 Ig heavy-chain V region (AA at 1) BASE COUNT 77 a 77 c 75 g 74 t ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtggac gttcggtgga 301 ggc // LOCUS MUSIGHAFF 318 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H250-6. ACCESSION M36251 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H250-6. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 318 Ig heavy-chain V region (AA at 1) BASE COUNT 83 a 81 c 77 g 76 t 1 others ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcacctgca aggccagtca ggatgtgagt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaactact gatttactcg gcatcctacc ggtacactgg agtccctgat 181 cgcttcactg gcagtggatc tgggacggat ttcactttca ccatcagcag tgtgcaggct 241 gaagacctgg cagtttatta ctgtcagcaa cattatagta ctccgtgnac gttcggtgga 301 ggcaccaagc tggaaatc // LOCUS MUSIGHAFG 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-10. ACCESSION M36252 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 79 a 92 c 77 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAFH 312 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H280-15. ACCESSION M36253 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H280-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 312 Ig heavy-chain V region (AA at 1) BASE COUNT 72 a 88 c 79 g 73 t ORIGIN 1 gaaaatgtgc tcacccagtc tccagcaatc atgtctgcat ctccagggga aaaggtcacc 61 atgacctgca gggccagctc aagtgtaagt tccagttact tgcactggta ccagcagaag 121 tcaggtgcct cccccaaact ctggatttat agcacatcca acttggcttc tggagtccct 181 gctcgcttca gtggcagtgg gtctggcacc tcttactctc tcacaatcag cagtgtggag 241 gctgaagatg ctgccactta ttactgccag cagtacagtg gttacccgta cacgttcgga 301 ggggggacca ag // LOCUS MUSIGHAFI 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C10. ACCESSION M36254 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 79 a 92 c 77 g 76 t ORIGIN 1 gaaattgtgc tcacccagtc tccagcactc atggctgcat ctccagggga gaaggtcacc 61 atcacctgca gtgtcagctc aagtataagt tccagcaact tgcactggta ccagcagaag 121 tcagaaacct cccccaaacc ctggatttat ggcacatcca acctggcttc tggagtccct 181 gttcgcttca gtggcagtgg atctgggacc tcttattctc tcacaatcag cagcatggag 241 gctgaagatg ctgccactta ttactgtcaa cagtggagta gttacccact cacgttcggt 301 gctgggacca agctggagct gaaa // LOCUS MUSIGHAFJ 302 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H35-C7. ACCESSION M36255 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H35-C7. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 302) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 302 Ig heavy-chain V region (AA at 1) BASE COUNT 75 a 76 c 77 g 74 t ORIGIN 1 gcttctttgg ctgtgtctct agggcagagg gccaccatat cctgcagagc cagtgaaagt 61 gttgatagtt atggcaatag ttttatgcac tggtaccagc agaaaccagg acagccaccc 121 aaactcctca tctatcgtgc atccaaccta gaatctggga tccctgccag gttcagtggc 181 agtgggtcta ggacagactt caccctcaca attaatcctg tggaggctga tgatgttgca 241 acctattact gtcagcaaag taatgaggat ccgctcacgt tcggtgctgg gaccaagctg 301 ga // LOCUS MUSIGHAFK 321 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H61-15. ACCESSION M36256 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H61-15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. BASE COUNT 95 a 75 c 78 g 71 t 2 others ORIGIN 1 gacatcaaga tgacccagtc tccatcctcc atgtatgcnt cgctgggaga gagagtcact 61 atcacttgca aggcgagtca ggacattaaa agctatttaa gctggtacca gcagaaacca 121 tggaaatctc ctaagaccct gatctattat gcaacaagct tggcagatgg ggtcccatca 181 agattcagtg gcagtggatc tgggcaanat tattctctaa ccatcagcag cctggagtct 241 gacgatacag caacttatta ctgtctacag catggtgaga gcccgtacac gttcggaggg 301 gggaccaagc tggaaataaa a // LOCUS MUSIGHAFL 324 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-A15. ACCESSION M36257 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-A15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 324) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 324 Ig heavy-chain V region (AA at 1) BASE COUNT 72 a 94 c 76 g 82 t ORIGIN 1 caaattgttc tttcccagtc tccagcagtc atgtctgcat ctcctgggga gaaggtcacc 61 ttgacctgca gtgccagctc aagtgtaagt tccacctact tgtactggta ccagcagaag 121 ccaggatcct cccccaaact ctggatttat agcacatcca acctggcttc tggagtccct 181 gctcgcttca gtggcagtgg gtctgggacc tcttactctc tcacaatcag cagcatggag 241 gctgaagatg ctgcctctta tttctgccat cagtggagta gttacccacc gacgttcggt 301 ggaggcacca agctggaaat caaa // LOCUS MUSIGHAFM 300 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. H9-D3. ACCESSION M36258 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H9-D3. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 300) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 300 Ig heavy-chain V region (AA at 1) BASE COUNT 81 a 69 c 63 g 87 t ORIGIN 1 cagattacat cctccctgtc tgtctctctg ggagacagag tcaccatcag ttgcagtgca 61 agtcagggca ttaacagtta tttaaactgg tatcagcaga aaccggatgg aactgttaaa 121 ctcctgatct atttcacatc aattttatat tcaggagtcc catcaaggtt cagtggcagt 181 gggtctggga cagattattc tctcaccatc agcaacttgg aacctgaaga tattgccact 241 tactattgtc agcagtatac tattcttcca ttcacgttcg gctcggggac aaagttggaa // LOCUS MUSIGHAFN 318 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. IC5-3A4. ACCESSION M36259 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma IC5-3A4. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 318) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 318 Ig heavy-chain V region (AA at 1) BASE COUNT 87 a 80 c 75 g 76 t ORIGIN 1 gacattgtga tgacccagtc tcacaaattc atgtccacat cagtaggaga cagggtcagc 61 atcccctgca aggccagtca ggatgtaggt actgctgtag cctggtatca acagaaacca 121 ggacaatctc ctaaaatact gatttactgg gcatccaacc ggcaaattgg agtccctgat 181 cgcttcacag gcagtggatc tgggacagat ttcactctca ccattagcta tgtgcagtct 241 gaagacttgg cagattattt ctgtcagcaa tatagcacct atcccacgtt cggtgctggg 301 accaagctgg agctgaaa // LOCUS MUSIGHAFO 312 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L11-2E2. ACCESSION M36260 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L11-2E2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 312) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 312 Ig heavy-chain V region (AA at 1) BASE COUNT 92 a 76 c 67 g 77 t ORIGIN 1 gacatccaga tgacacagtc tccatcctca ctgtctgcat ctctgggagg caaagtcacc 61 atcacttgca aggcaagcca agacattaac aagtatatag cttggtacca acacaagcct 121 ggaaaaggtc ctaggctgtt catacattat acatctacat tacagccagg catcccatca 181 aggttcagtg gaagtgggtc tgggagacat tattccttca gcatcagcaa cctggagcct 241 gaagatattg caacttatta ttgtctacag tatgataatc ttctgacgtt cggtggaggc 301 accaagctgg aa // LOCUS MUSIGHAFP 321 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-10C1. ACCESSION M36261 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-10C1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 321) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 321 Ig heavy-chain V region (AA at 1) BASE COUNT 93 a 77 c 71 g 80 t ORIGIN 1 gatatccaga tgacacagac tacatcctcc ctgtctgcct ctctgggaga cagagtcacc 61 atcagttgca gtgcaagtca gggcattagc aattatttaa actggtatca gcagaaacca 121 gatggaactg ttaaactcct gatctattac acatcaagtt tacactcagg agtcccatca 181 aggttcagtg gcagtgggtc tgggacagat tattctctca ccatcagcaa cctggaacct 241 gaagatattg ccacttacta ttgtcagcag tatagtaagc ttccgtggac gttcggtgga 301 ggcaccaagc tggaaatcaa a // LOCUS MUSIGHAFQ 288 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. L2-5D2. ACCESSION M36262 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma L2-5D2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 288) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 288 Ig heavy-chain V region (AA at 1) BASE COUNT 78 a 65 c 68 g 73 t 4 others ORIGIN 1 cagtctccag cctccctgtc tgcatctgtg ggagaaactg tcaccatcac atgtcgagca 61 agtgggaata ttcacaatta tttagcatgg tatcagcaga nacagggaaa atctcctcag 121 ctcctggtct ataatgcaaa aaccttagca gatggtgtgc catcaaggtt cagtggcagt 181 ggatcaggaa cacaatattc tctcaagatc aacagcctgc agcctgaaga ttttgggagt 241 tattactgtc aacatttttg gnnnactccg tacacgttcg gagggggg // LOCUS MUSIGHAFR 363 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36207 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H120-2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 363) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 363 Ig heavy-chain V region (AA at 1) BASE COUNT 85 a 88 c 101 g 89 t ORIGIN 1 caggtccagt tgcagcagtc tggagctgag ctggtaaggc ctgggacttc agtgaagata 61 tcctgcaagg cttctggcta caccttcact aactactggc taggttgggt aaagcagagg 121 cctggacatg gacttgagtg gattggagat atttaccctg gaggtggtta tactaactac 181 aatgagaagt tcaagggcaa ggccacactg actgcagaca catcctccag cactgcctac 241 atgcagctca gtagcctgac atctgaggac tctgctgtct atttctgtgc gaggatttac 301 tacggtagta gctactggta cttcgatgtc tggggcgcag ggaccacggt caccgtctcc 361 tca // LOCUS MUSIGHAFS 348 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse Ig heavy-chain mRNA V region, partial cds. ACCESSION M36208 KEYWORDS immunoglobulin heavy-chain; processed gene; variable region. SOURCE Mouse (strain BALB/c), cDNA to mRNA, from hybridoma H130-5. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Kavaler,J., Caton,A.J., Staudt,L.M., Schwartz,D. and Gerhard,W. TITLE A set of closely related antibodies dominates the primary antibody response to influenza virus hemagglutinin JOURNAL J. Immunol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Kavaler, 06-JUL-1990. FEATURES from to/span description pept < 1 > 348 Ig heavy-chain V region (AA at 1) BASE COUNT 96 a 80 c 93 g 78 t 1 others ORIGIN 1 cagatccagt tggtccagtc tgnacctgag ctgaagaagc ctggagagac agtcaagatc 61 tcctgcaagg cttctgggta taccttcaca aactatggaa tgaactgggt gaagcaggct 121 ccaggaaagg gtttaaagtg gatgggctgg ataaacacct acactggaga gccaacatat 181 gctgatgact tcaagggacg gtttgccttc tctttggaaa cctctgccag cactgcctat 241 ttgcagatca acaacctcaa aaatgaggac acggctacat atttctgtgc aagaggagat 301 ggtaactact ggtacttcga tgtctggggc gcagggacca cggtcacc // LOCUS MUSMHEA2A1 478 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 2. ACCESSION M30158 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX2) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 478) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 154 + 423 A-beta 2 chain, exon 2 (AA at 156) IVS < 1 153 A-beta 2 chain intron A IVS 424 > 478 A-beta 2 chain intron B BASE COUNT 118 a 99 c 132 g 129 t ORIGIN 1 aaataaatgc caataagcat taaatgacat tgactttgtc tcttaatgtt gggcttttga 61 gcacattgct atctgactta gtgtaccagg tacaggcttt ccctgaggtg cggctggttc 121 ctaactggtt cctaatcctt ttatggtttc cagagaattt tgtgattcag gcaaaggcag 181 actgttactt caccaatggg acagaaaagg ttcacttcct ggtcaggttc atcttcaacc 241 tggaggagta tttacatttc gacagtgacc tggggatgtt cgtggcgctg acggagctgg 301 gggaacctga cgctgaccag tggaacaaac gcctagatct cctggagatg agcagagccg 361 ctgtgaacat ggtctgcagg cagaagtaca agctgggggc ccccttcatc gtggagagaa 421 atggtgggta acagcaaggt ctaggggttc tccgtgagaa ccttgtaata ggcagttt // LOCUS MUSMHEA2A2 526 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 3. ACCESSION M30159 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX2) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 526) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 49 / 330 A-beta 2 chain, exon 3 (AA at 49) IVS < 1 48 A-beta 2 chain intron B IVS 331 > 526 A-beta 2 chain intron C BASE COUNT 115 a 135 c 136 g 140 t ORIGIN 1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg cctccagagg 61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg 121 tgacaggctt ctaccccagg gacataagtg tcaagtggtt ccggaatgga caggaggagc 181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag 241 taatgctgga aatgatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg 301 gcctcctgag acctgtttct gtggcatgga gtgagcttca tttagttttc tgtatttctg 361 gaccttgacc tgtaggagct atcatttctg tgttcatctt gatgacaaga caagttcccc 421 tgatctaaga atcctagtgc cagggcagga gaaaacagga caggcatcca ttctctgaat 481 gtccaagaga ggctgttaac ccacaacttg cctgtaatcc tgggac // LOCUS MUSMHEAII1 491 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 2. ACCESSION M30156 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX1) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 491) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 158 + 427 A-beta 2 chain, exon 2 (AA at 160) IVS < 1 157 A-beta 2 chain intron A IVS 428 > 491 A-beta 2 chain intron B BASE COUNT 123 a 97 c 135 g 136 t ORIGIN 1 aaataaatgt caataagcat taaatgacat taagtgactt tgtctcttaa tgttgggctt 61 ttgagcacat tgctatctga cttagtgtac caggtacagg ctttcccaga ggtgcggctg 121 gttcctaact ggttcctaat ccttttatgg tttccagaga attttgtgat tcaggcaaag 181 gcggactgtt acttcaccaa tgggacagaa aaggttcact tcctggtcag gttcatcttc 241 aacctggagg aatatttaca tttcgacagt gacctgggga tgtttgtggc gctgacggag 301 ctgggggaac ctgacgctga ccagtggaac aaacgcctgg atctcctgga gacgagcaga 361 gccgctgtga acatggtctg caggcagaag tacaatctgg gggccccctt cattgtggag 421 agaaatggtg ggtaacagca aggtctaggg gttctccgtg agaaccttgt aataggcatt 481 tgaaaggttt t // LOCUS MUSMHEAII2 592 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II A-beta 2 gene, exon 3. ACCESSION M30157 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX1) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 592) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 49 / 330 A-beta 2 chain, exon 3 (AA at 50) IVS < 1 48 A-beta 2 chain intron B IVS 331 > 592 A-beta 2 chain intron C BASE COUNT 138 a 147 c 148 g 159 t ORIGIN 1 tctagatgct cctgtccttg ctgtgccttt tgtttcttct cttgccagtg tctccagagg 61 tgacagtgta cccagagagg accccattgc tgcagcagca caacctgctg ctctgctctg 121 tgacaggctt ctatcccggg aacataagtg tcaagtggtt ccggaatgga caggaggaga 181 ggtctggggt catgtccact ggccttgtta ggaatggaga ctggaccttc cagacaacag 241 taatgctgga aataatccca gagcttggtg acatctacag ctgcctcgtg gagcaccccg 301 gcctcctgag acctgtttct gtggcatgga gtgagcttta ttagttttct gtaattctgg 361 accttgacct gtaggagctg tcatttctgt gtcttgatga caagacaagt tcccctgatc 421 taagaatcct aatgccaggg gcaggagaaa acaggacagg catccattct ctgaatgtcc 481 aagagataga taaggaggct gttaacccac aacttgcctc taaccctggg acctagtatc 541 ctttagcact ttaatcttta cagacactgc cactggctat aagtcaggga tc // LOCUS MUSMHEB2B1 518 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 2 gene, exon 2. ACCESSION M30160 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 518) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 92 + 358 E-beta 2 chain, exon 2 (AA at 94) IVS < 1 91 E-beta 2 chain intron A IVS 359 > 518 E-beta 2 chain intron B BASE COUNT 120 a 94 c 151 g 153 t ORIGIN 1 ctgcaggtgt cacagttcct ttcactgact gccattctgg agcattgtct gtcctcacag 61 acatcctgta attgtgttca tgtacccaca gcacgttttc tggagcagtt gaaggctgag 121 tgtcactact tcaatgggaa ggagcgtgtg tggagtgtga ccagattcat ctataaccag 181 gaagagtttg tccgctttaa cactgacttt gggaagttcc tggcagtgac tgagctgggg 241 cggcccatag ctgagtactt gaacacccag aaggacatgc tggacaatta tcgtgcctca 301 gtggacaggt gcagaaataa ctatgacctt gtggatatct tcatgttgaa cttaaaaggt 361 aagcattaga tagagagtag atgggttgag gggtgtgtgt gtgtgtgtgt gtgtgtgtgt 421 gtgtgtgtgt gtgtgtaaga gaggatagtt ctgtgtcacg tgtgtggaga tcataagaca 481 ggttactgat tctcgccatc tatcctatgg tcctgaga // LOCUS MUSMHEB2B2 573 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 2 gene, exon 3. ACCESSION M30161 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 573) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 97 / 378 E-beta 2 chain, exon 3 (AA at 97) IVS < 1 96 E-beta 2 chain intron B IVS 379 > 573 E-beta 2 chain intron C BASE COUNT 138 a 155 c 143 g 137 t ORIGIN 1 gagatcttta gcctgggtgg atggaggcag gtaggcagcc cacagggaga acagctctat 61 cctcagggct ctctctcacc tccctttctc ctctagctga gcccaaggtg actgtgtatc 121 ctgcaaagac acagcctctg gaacaccaca acctcctggt ctgctctgtg aatgacttct 181 accctggcag cattgaagtc agatggttcc ggaatggcga ggaggagaag actggagttg 241 tgtccaccgg actgatccaa aatggagatt ggacctacca gaccctgttg atgctggaga 301 tggttcctca gggtggagag gtttacacct gccaggtgga gcatcccagc ctgaccagcc 361 ctgtcacagt ggagtggagt gagtgggaaa cccttgactc tgcaaatgcc caaccaccat 421 ataataggag tgactttctc tgcctgttcc ctttctctga ccctgtaaat ccctgcctac 481 cagggaagca catgggtgac tccacaggaa aaatgaatac aatttccaag ttactagctt 541 ttattcctca cctagttcaa catctatgcc agt // LOCUS MUSMHEBI1 445 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 1 gene, exon 2. ACCESSION M30154 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 1 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 445) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept / 11 + 280 E-beta 1 chain, exon 2 (AA at 13) IVS < 1 10 E-beta 1 chain intron A IVS 281 > 445 E-beta 1 chain intron B BASE COUNT 101 a 108 c 161 g 75 t ORIGIN 1 gtctccacag cacggttttt ggaacaagct aaacatgagt gtcatttcta caacggaacg 61 cagcgcgtgc ggtttctgtt gagacaaatc cacaacaggg aggagaacgt gcgcttcgac 121 agcgacctgg gcgagttccg cgcagtgaac gagctggggc ggccagacgc cgagtacttc 181 aacagccaga aggaggtcct ggaggatttg cgggccgctg tggacacgta ctgcagacac 241 aactacgaga tcttagataa cttccttgtg ccgcggagag gtgagacagg aggcggtggg 301 tggggcggaa ccacggtgag ggtggggctg tggggagcag caggaggcgg tgcgcacgtg 361 cgcaggagcc gcaggggatg ctgggttccc tgccgctgga gccacaggcg gctttaagca 421 gcctcatggc aggagaacag aattc // LOCUS MUSMHEBI2 525 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse MHC class II E-beta 1 gene, exon 3. ACCESSION M30155 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SEGMENT 2 of 2 SOURCE Mouse (haplotype SAX) liver DNA. ORGANISM Mus saxicola Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 525) AUTHORS Cam,P., Jouvin-Marche,E., Leguern,C. and Marche,P.N. TITLE Structure of class II genes in wild mouse Mus saxicola: Functional and evolutionary implications JOURNAL Eur. J. Immunol. 20, 1337-1343 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Cam, 27-NOV-1989, for release after publication. FEATURES from to/span description pept + 102 / 383 E-beta 1 chain, exon 3 (AA at 102) IVS < 1 101 E-beta 1 chain intron C IVS 384 > 525 E-beta 1 chain intron D BASE COUNT 116 a 144 c 141 g 124 t ORIGIN 1 ggatccagtc tggatggatg gaggcaggca ggcaggcagg cagcctacag gaaggacagc 61 tccaccctca tggcaccttc tcacctctct ttctcctcta gttgagccta cagtgactgt 121 gtaccctaca aagacacagc ccctggaaca ccacaacctc ctggtctgct ctgtgagtga 181 cttctaccca ggcagcattg aagtcagatg gttccggaat ggtgaggagg agaagacggg 241 aattgtgtcc actggcctga tccgaaatgg agactggact ttccagatgc tggtgatgct 301 ggagacggtt cctcaggtgg aagaggttta cacctgccag gtggagcatc ccagcctgac 361 cagccctgtc acagtccagt ggagtgagtg ataacttcca tactctgtga atgtcattcc 421 accagggagg gggcgtggct tatccctgcc tgtcagcttt ctctgcccac atactctttc 481 cactggtttt atgctgtcct gactttcacc acagcttaaa gggaa // LOCUS MUSMMTVA 448 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse mammary tumor virus long terminal repeat. ACCESSION M35959 KEYWORDS long terminal repeat. SOURCE Mouse lymphoma cell line MLA DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 448) AUTHORS Yanagawa,S.-I., Murakami,A. and Tanaka,H. TITLE Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas acquire a selective advantage in lymphocytes by alteration in the U3 region of the long terminal repeat JOURNAL J. Virol. 64, 2474-2483 (1990) STANDARD simple staff_entry BASE COUNT 150 a 70 c 95 g 133 t ORIGIN 1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact 61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc 121 tcagttaaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag 181 aagagtgtgc ttgtcaaaat aagacacagg tggtggcaac cagggactta taggggacct 241 tacatctaca gaccaacaga tgcccccttt cttaaaacaa ggatgtgaga caggtggttt 301 cctgagtcgg tttggtatca aatgttctaa tctgagctct tagtgttcta ttttcctgtg 361 ttcttttgga atctatccaa gtcttatgta aatgcttatg taaaccatga tataaaagag 421 tgctgatttt tttgagtaaa tttgcaac // LOCUS MUSMMTVB 527 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse mammary tumor virus long terminal repeat. ACCESSION M35960 KEYWORDS long terminal repeat. SOURCE Mouse lymphoma cell line DL8 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 527) AUTHORS Yanagawa,S.-I., Murakami,A. and Tanaka,H. TITLE Extra mouse mammary tumor proviruses in DBA/2 mouse lymphomas acquire a selective advantage in lymphocytes by alteration in the U3 region of the long terminal repeat JOURNAL J. Virol. 64, 2474-2483 (1990) STANDARD simple staff_entry BASE COUNT 169 a 82 c 120 g 156 t ORIGIN 1 aatccaatag gtagattatt aattactatg ttaagaaatg aatcattatc ttttagtact 61 atttttactc aaattcagaa gttagaaatg ggaatagaaa atagaaagag acgctcagcc 121 tcagttgaag aacaggtgca agagctaacg gccgcaggcc tagaagtaaa aaagggaaag 181 aagagtgtgc ttgtcaaaat aggagacagg tggtggcaac cagggactta taggggacct 241 tacatctaca gacggttagc ttggcttccc agggttcaaa ctgttcttaa aacaaggatg 301 tgagacaagt gcgcaacggt tagcttggct tcccagggtt caaactgttc ttaaaacaag 361 gatgtgagac aagtggtttc ctgagttggt ttggtatcaa atgttctaat ctgagctctt 421 agtgttctat tttcctatgt tcttttggaa tctatccaag tcttatgtaa atgcttatgt 481 aaaccatgat ataaaagagt gctgattttt ttgagtaaac ttgcaac // LOCUS PFDSYN 85 bp ds-DNA SYN 14-AUG-1990 DEFINITION Bacteriophage fd synthetic G2 promoter region DNA. ACCESSION M24953 KEYWORDS promoter. SOURCE Synthetic DNA. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 85) AUTHORS Ovchinnikov,Y.A., Efimov,V.A. and Chakhmakcheva,O.G. TITLE Synthesis of a polynucleotide corresponding to the promoter region of bacteriophage fd DNA JOURNAL FEBS Lett. 100, 341-346 (1979) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 14 G2 mRNA BASE COUNT 34 a 17 c 15 g 19 t ORIGIN 1 aaatcagtct ttaccctgtc tattatcgtc agaagcaaag cgaattgcat caaaaagatt 61 aagaggaagc ccgaaagact tcaaa // LOCUS PRMSPCA 200 bp ds-DNA BCT 14-AUG-1990 DEFINITION P.vulgaris spc r-protein operon L24 gene, 3' end, and L5 gene, 5' end. ACCESSION M36264 KEYWORDS r-protein. SOURCE P.vulgaris DNA. ORGANISM Proteus vulgaris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 200) AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M. TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus vulgaris, Serratia marcescens and Salmonella typhimurium JOURNAL J. Mol. Biol. 204, 326-329 (1988) STANDARD simple staff_entry REFERENCE 2 (sites) AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M. TITLE Translational regulation of the spc operon in Escherichia coli: Identification and structural analysis of the target site for S8 repressor protein JOURNAL J. Mol. Biol. 204, 309-329 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 57 r-protein L24 pept 71 > 200 r-protein L5 BASE COUNT 64 a 36 c 47 g 53 t ORIGIN 1 ttcgaagacg gcaaaaaagt tcgtttcttc aagtctaata aagaaactat caagtaattt 61 ggagtatacg atggcgaaac tgcatgatta ctacaaagac gaggtagtcc aaaaactgat 121 gtctcagttt ggttaccatt ctgtcatgca agtccctcgg gtcgagaaga tcaccctgaa 181 tatgggtgtt ggtgaagcga // LOCUS PSCTETR 1110 bp ds-DNA BCT 14-AUG-1990 DEFINITION Plasmid pSC101 tetracycline resistance element repressor protein (tetR) gene, complete cds. ACCESSION M36272 KEYWORDS tetracycline resistance element repressor protein. SOURCE Plasmid pSC101 DNA. ORGANISM Plasmid pSC101 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 1110) AUTHORS Brow,M.A.D., Pesin,R. and Sutcliffe,J.G. TITLE The tetracycline repressor protein of pSC101 JOURNAL Mol. Biol. Evol. 2, 1-12 (1985) STANDARD simple staff_entry FEATURES from to/span description pept 42 701 tetracycline resistance element repressor protein (tetR) BASE COUNT 235 a 298 c 351 g 226 t ORIGIN 1 aagcttatcg atgataagct gtcaaacatg agaattcgcg aatgaacaag ctccaacgcg 61 aggccgtgat ccgaaccgcg ctcgaactgc ttaacgacgt gggcatggaa ggtctaacga 121 cgcgccgact ggctgagcgc ctcggggtgc aacagccagc gctctactgg catttcaaga 181 acaagcgtgc gttgctcgac gcacttgccg aagccatgct gacgataaat cacacgcatt 241 cgacgccaag ggatgacgac gactggcgtt cgttcctgaa gggcaatgca tgcagttttc 301 gacgggcgtt gctcgcttat cgcgatggcg cgcgtattca tgccgggacg cggccagccg 361 cgccgcagat ggaaaaagcc gacgcgcagc ttcgcttcct ttgcgatgct ggcttttcgg 421 caggtgacgc gacctatgcg ttgatggcaa tcagctactt caccgtcggc gctgttcttg 481 agcagcaagc tagcgaggca gacgccgagg agcggggcga agatcagttg accacctcag 541 cgtctacgat gccggcgcgc ctacagagcg cgatgaaaat cgtctacgaa ggcggtccgg 601 acgcggcatt cgagcgaggc ctggctctca tcatcggcgg tcttgaaaaa atgaggctca 661 ctacgaacga cattgaggtg ctgaagaatg ttgacgaatg acagggggcg gcaggtgcgg 721 agggcgcggt tgcttcgtca tatgaagcaa agtcacctag ctgaattaat gggtgtggat 781 caggcaaccg tgtcgcgctg ggagcggggc acccttgcat tgtcggatgg gaggtggtca 841 gcggttcttc aattgcttac cgggccttcc gattcatcgt acgacgctgc gctgaagcgt 901 ctggtgcaat cctccgccca caaagtccat ctggtagcga ccggacacat tgtttgctcg 961 cggcatctcc ggccaggcaa agggaattgc ggattgacct agccgaactc cttggtaaat 1021 cgctgcgtgt ttatgcgtcc ccgagatagt tgcggccgac tctgcgctta atgggctcgg 1081 ttggcatgag gggcggctgg ggtcactcga // LOCUS PTVCOMPL 359 bp ss-RNA VRL 14-AUG-1990 DEFINITION Potato spindle tuber viroid complete sequence. ACCESSION M36163 KEYWORDS . SOURCE Potato spindle tuber viroid RNA. ORGANISM Potato spindle tuber viroid Viridae; Nonclassified viruses. REFERENCE 1 (bases 1 to 359) AUTHORS Gross,H.J. and Riesner,D. TITLE Viroids: A class of subviral pathogens JOURNAL Angew Chem. Int. Ed. Engl. 19, 231-243 (1980) STANDARD simple staff_entry BASE COUNT 73 a 108 c 102 g 76 t ORIGIN 1 cggaactaaa ctcgtggttc ctgtggttca cacctgacct cctgagcaga aaagaaaaaa 61 gaaggcggct cggaggagcg cttcagggat ccccggggaa acctggagcg aactggcaaa 121 aaaggacggg ggggagtgcc cagcggccga caggagtaat tcccgccgaa acagggtttt 181 cacccttcct ttcttcgggt gtccttcctc gcgcccgcag gaccacccct cgcccccttt 241 gcgctgtcgc ttcggctact acccggtgga aacaactgaa gctcccgaga accgcttttt 301 ctctatctta cttgcttcgg ggcgagggtg tttagccctt ggaaccgcag ttggttcct // LOCUS RATCBFB 1415 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat CCAAT binding transcription factor-B subunit (CBF-B) mRNA, complete cds. ACCESSION M34238 KEYWORDS CCAAT binding transcription factor-B subunit; heteromeric mammalian transcription factor; nuclear protein; transcription activator. SOURCE Rat (strain Sprague-Dawley) adult liver, cDNA to mRNA, clone pCBF-B1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1415) AUTHORS Maity,S.N., Vuorio,T. and de Crombrugghe,B. TITLE The B subunit of a heteromeric rat CCAAT-binding transcription factor shows a striking sequence identity with the yeast Hap2 transcription factor JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5378-5382 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.N.Maity, 11-MAY-1990, for release after publication. FEATURES from to/span description pept 171 1196 CCAAT binding transcription factor-B subunit (CBF-B) BASE COUNT 371 a 371 c 417 g 256 t ORIGIN 1 cagcggcagt ggcggcggcg cggcggctgg agcctctgat tgggtttcgg ggtccggtac 61 tggagccaat cagcgcgggc agcgaaccgg gggagcgagg cacggagtcc ctacctggca 121 gctgcctggg atctgtagag tgaagcttca ggacacaacg tggccgggcc atggagcagt 181 atacagcaaa tagcaatagt tccacagagc agatcgtggt gcaggctggc cagattcagc 241 agcagcagca gggtggtgtc actgctgtcc agctgcagac tgaggcccag gtggcatccg 301 cctcaggcca gcaagtccag accctccagg tagttcaggg gcagccatta atggtgcaag 361 tcagtggagg ccagctgatt acatcaactg gccaacccat catggtacag gctgtgcctg 421 gtggacaagg ccaaactatc atgcaagtac ctgtgtctgg aacacaaggt ttacagcaga 481 tacagttggt accccctgga cagatccaga tccagggcgg gcaggctgtg caggtgcaag 541 gccagcaggg acagacccag cagatcatca ttcagcagcc acagaccgcg gtcaccgctg 601 gccagactca gacacaacaa cagattgctg tccagggaca gcaagtggcc cagactgctg 661 aagggcagac tattgtctac cagccagtta atgcagatgg cacaatcctc cagcaaggca 721 tgatcaccat cccagcagcc agtttggcag gggcacagat cgttcagaca ggagccaata 781 ccaacacaac cagtagtgga caagggactg tcactgtgac actgccagtg gcagggaatg 841 tggtcaactc aggaggaatg gtcatgatgg taccaggggc tggctctgtg cctgctatcc 901 aaagaatccc tttacctggg gcagagatgc tggaagaaga gccgttgtat gtgaatgcca 961 aacagtatca ccgcatcctt aagaggagac aagcgagggc caagctagag gcagaaggga 1021 agatcccaaa ggaaagaagg aaatacctcc atgagtctcg gcaccgacac gccatggcac 1081 ggaagcgtgg ggaagggggc cgcttcttct ctccaaaaga aaaggacagt cctcacatgc 1141 aggatccaaa ccaagctgat gaagaagcca tgacacagat catccgagtt tcctaaccac 1201 aggagggtgg cagagcggat gagtcacagt cctctccact gttcctggaa atggactgac 1261 ttccagttgg gcactgacag tctcactctg ccctttctca ggacagaaac tacttagctc 1321 agtattacag ctgcagtgat ggctgacaaa ctgaagaggc aagcttttgt ctctcccttt 1381 cattgaggac ctgttcagac tgttggtgac attga // LOCUS RATRTG2 902 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat thyroglobulin (rTg-2) mRNA, complete cds. ACCESSION M35965 KEYWORDS thyroglobulin. SOURCE Rat (strain Fisher) thyrocyte, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 902) AUTHORS Graves,P.N. and Davies,T.F. TITLE A second thyroglobulin messenger RNA species (rTg-2) in rat thyrocytes JOURNAL Mol. Endocrinol. 4, 155-161 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 1 732 thyroglobulin (rTg-2) mRNA < 1 902 thyroglobulin mRNA BASE COUNT 239 a 216 c 234 g 213 t ORIGIN 1 atgatgacct tggtcttgtg ggtctcgact ttgttgagct cagtctgcct ggtagcagcc 61 aacatctttg agtaccaagt ggatgcacag ccactccgcc cctgtgagct gcaaagggag 121 aaggcctttc tgaagcagga tgaatatgtt ccacagtgct ctgaagatgg aagtttccag 181 acagttcagt gccaaaatga tggccaatct tgctggtgtg tggattctga tggcacggaa 241 gtacctggca gcaggcagct gggaaggcca acagcatgtc tgtccttctg ccagctacac 301 aagcaacgaa tcctgctgag cagctacatt aacagcacag atgctctcta ccttcctcag 361 tgccaggatt cagggaacta tgcaccagtg cagtgcgacc tgcagcaagt gcaatgctgg 421 tgtgtagaca cagaggggat ggaggtgtat ggcacccgcc agcaggggag gccaacacgg 481 tgtccaagga gctgtgagat aagaagccgc cgtctcctcc atggagtagg agacaaatcg 541 ccaccccagt gcgatgcaga tggggagttt atgcctgtcc agtgcaagtt tgtcaacacc 601 acagacatga tgatttttga tctgattcac aactacaaca gcaacaaacc tcacctcaaa 661 tgccctagta catctgttat actgtggcaa gagtcaagag cacctttaac atgtgctgaa 721 gaatcatcct aattggaata tagaagacta tggggaaatc ctgagaggca gatcgtctag 781 tcacctgcaa gaccacctat ggtggccaat ttgacttgtg tgtattgtta tgctgtgtat 841 agctttctgc caaagactgc ctgccctggc ccaggagaga ataaacacac tgttaattta 901 tt // LOCUS SMASPCA 201 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.marcescens spc operon r-protein L24 gene , 3' end, and L5 gene, 5' end. ACCESSION M36265 KEYWORDS r-protein. SOURCE S.marcescens DNA. ORGANISM Serratia marcescens Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 201) AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M. TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus vulgaris, Serratia marcescens and Salmonella typhimurium JOURNAL J. Mol. Biol. 204, 326-329 (1988) STANDARD simple staff_entry REFERENCE 2 (sites) AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M. TITLE Translational regulation of the spc operon in Escherichia coli: Identification and structural analysis of the target site for S8 repressor protein JOURNAL J. Mol. Biol. 204, 309-329 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 57 r-protein L24 pept 72 > 201 r-protein L5 BASE COUNT 65 a 40 c 46 g 50 t ORIGIN 1 ttcgaagacg ctaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt 61 ggagtaatac gatggcgaaa ctgcatgatt actacaaaga cgaggtagtc aaacaactga 121 tgtctcagtt tgattacaac tctgtcatgc aagtccctcg ggtcgagaag atcaccctga 181 acatgggtgt tggtgaagcg a // LOCUS STYSPCA 201 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.typhimurium spc operon r-protein L24 gene, 3' end, and L5 gene, 5' end. ACCESSION M36266 KEYWORDS r-protein. SOURCE S.typhimurium DNA. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 201) AUTHORS Mattheakis,L.C., Vu,L. and Nomura,M. TITLE Nucleotide sequence of the L24/L5 intercistronic region of Proteus vulgaris, Serratia marcescens and Salmonella typhimurium JOURNAL J. Mol. Biol. 204, 326-329 (1988) STANDARD simple staff_entry REFERENCE 2 (sites) AUTHORS Cerretti,D.P., Mattheakis,L.C., Kearney,K.R., Vu,L. and Nomura,M. TITLE Translational regulation of the spc operon in Escherichia coli: Identification and structural analysis of the target site for S8 repressor protein JOURNAL J. Mol. Biol. 204, 309-329 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 57 r-protein L24 pept 72 > 201 r-protein L5 BASE COUNT 67 a 38 c 46 g 50 t ORIGIN 1 ttcgaagacg gtaaaaaagt ccgtttcttc aagtctaaca gcgaaactat caagtaattt 61 ggagtagtac gatggcgaaa ctgcatgatt actacaaaga cgaagtagtt aacaaactca 121 tgactgagtt taactacaat tctgtcatgc aagtccctcg ggtcgagaag atcaccctga 181 acatgggtgt tggtgaagcg a // LOCUS GCOADDA 1771 bp ds-DNA PLN 14-AUG-1990 DEFINITION G.verrucosa 18S ribosomal RNA gene. ACCESSION M33638 KEYWORDS 18S ribosomal RNA. SOURCE G.verrucosa (isolate Norway) male tetrasporangial DNA. ORGANISM Gracilaria verrucosa Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gracilariaceae; Gracilariaceae. REFERENCE 1 (bases 1 to 1771) AUTHORS Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A. TITLE Nucleotide sequences of the 18S ribosomal RNA gene from the red algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson) Papenfuss and Gracilariopsis sp JOURNAL Nucleic Acids Res. 18, 4023-4024 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Ragan, 08-APR-1990. Atlantic Res. Lab., National Research Council of Canada, 1411 Oxford St., Halifax, Nova Scotia, CANADA B3H 3Z1 BASE COUNT 454 a 366 c 498 g 453 t ORIGIN 1 caacctggtt gatcctgcca gtggtatatg cttgtttaaa ggactaagcc atgcaagtgc 61 aagtatgagt gaattgtaca acgaaactgc gaatggctcg gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gatacccgta gtaattctag agctaatacg tgcctccaaa 181 acgacgcaag tcgtggtaca aattagagat acaagccaac ttgttggtga ttctagattt 241 tttttctgat cgcaattatt gcgacgcacc gttcaaattt ctgacctatc aactttggat 301 ggtaaggtat tggcttacca tggttgtgac gggtaacgga ccgtgggtgc gggattccgg 361 agagggagcc tgagagacgg ctaccacatc caaggaaggc agcaggcgcg caacttaccc 421 aatccggaca ccgggaggta gtgacaagaa atatcaatag agggcccgat gggttttcta 481 attggaatga gaacaaggta aacagcttat cgaggagcca gcagagggca agtctggtgc 541 cagcagccgc ggtaattcca gctctgtaag cgtataccaa agttgttgca gttaaaacgc 601 tcgtagtcgg attttggtgt ctgacttggg tcgtcctcgc ggacgctctc aggttgggcg 661 cctttgtgga tgggagctag gtggtgctta attggatcac ctagctgccg ccaccgttta 721 ctgtgaaaaa aatagagtgt tcaaagcagg cgattgccct gaatacatta gcatggaata 781 atagaatagg acccggtcct attttgttgg tttgcttgaa tcgggtaatg attaagaggg 841 acggttgggg gcattcgtat tccgacgtca gaggtgaaat tcttggattg tcggaagacg 901 aacagctgcg aaagcgtctg ccaaggacgt tttcattgat caagaacgaa agtaagggga 961 tcgaagacga tcagataccg tcgtagtctt tactataaac gatgaggact ggagatcgga 1021 taagactgat atatggctta tccggcatcc ttcgagaaat caaagtgttt gctttctggg 1081 gggagtatgg tcgcaaggct gaaacttaaa ggaattgacg gaagggcatc accgggtgtg 1141 gagcctgcgg cttaatttga ctcaacacgg gaaaacttac caggtcagga catagtaagg 1201 attgacagat tgagagctct ttcttgattc tatggttggt ggtgcatggc cgttcttagt 1261 tggtggagtg atctgtctgg ttaattccgt taacgagcga gacctgggcg tgctagctag 1321 gcgccgttac tatttttggt agcgaggctt gccttcctag acggactgtg ggcgtctagc 1381 ccacggaagc tccaggcaat aacaggtctg agatgccctt agatgtcctg ggccgcacgc 1441 gtgctacact gaacgggtca acgagttagg atatgcgaaa gcatttccca atctctaaat 1501 ccgttcgtga tggggatcga cggttgcaat tttccgtcgt caacgaggaa taccttgtaa 1561 gcgcgggtca tcatcccgcg ctgaatacgt ccctgccctt tgtacacacc gcccgtcgct 1621 cctaccgatt gagtggtccg gtgaggcctt gggagagcta gatgaactga ttattcagat 1681 cttttggctt gaacttggtc aaaccttatc acttagagga aggagaagtc gtaacaaggt 1741 ttccgtaggt gaacctgcag aaggatcaga a // LOCUS GRO18SAA 1782 bp ds-DNA PLN 14-AUG-1990 DEFINITION Gracilariopsis sp. 18S ribosomal RNA gene. ACCESSION M33639 KEYWORDS 18S ribosomla RNA. SOURCE Gracilariopsis sp. (isolate England-1) diploid tetrasporophyte DNA. ORGANISM Gracilariopsis sp. Eukaryota; Plantae; Thallobionta; Rhodophycota; Rhodophyceae; Florideophycideae; Gracilariaceae; Gracilariaceae. REFERENCE 1 (bases 1 to 1782) AUTHORS Bird,C.J., Rice,E.L., Murphy,C.A., Liu,Q.Y. and Ragan,M.A. TITLE Nucleotide sequences of the 18S ribosomal RNA gene from the red algae Gracilara tikvahiae McLachlan, Gracilaria verrucosa (Hudson) Papenfuss and Gracilariopsis sp JOURNAL Nucleic Acids Res. 18, 4023-4024 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Ragan, 08-APR-1990. BASE COUNT 443 a 372 c 515 g 452 t ORIGIN 1 caacctggtt gatcctgcca gtggtatatg cttgtctata ggactaagcc atgcaagtgc 61 aagtatgagt gtattataca acgaaactgc gaatggctct gtaaaacagc tataatttct 121 tcggtgctaa atactactcg gataaccgta gtaattctag agctaatacg tgcctacaca 181 gcgacgcaag tcgtggtaca aattagatac acaagccagc ttttttggtg attctagatt 241 ttttttctga tcgcacattg ttgcgacgca ccgttcaaat ttctgaccta tcaactttgg 301 atggtaaggt aatggcttac catggttgtg acgggtaacg gaccgtgggt gcgggactcc 361 ggagagggag cctgagagac ggctaccaca tccaaggaag gcagcaggcg cgcaacttac 421 ccaatccgga caccgggagg tagtgacaag aaatagcaat agagggcccg atgggttttc 481 taattggaat gagaacaaga taaacagctt atcgaggagc cagcagaggg caagtctggt 541 gccagcagcc gcggtaattc cagctctgta agcgtatacc aaagttgttg cagttaaaac 601 gctcgtagtc gaattttggc gtgtgatttg ggtcgttctc gcggacgcgc tcaggttgtg 661 cgcctttgtg gagagggggt gtggtggtgc ttgagtgcgc tgccatgctg ccgccaccgt 721 ttactgtgaa aaaattagag tgttcaaagc aggcgattgc cctgaataca ttagcatgga 781 ataatagaat aggacccggt tctattttgt tggtttgttt gaatcaggta atgattaaca 841 gggacggttg ggggcattcg tattccggcg ctagaggtga aattcttgga ttgtcggaag 901 acgaacagct gcgaaagcgt ctgccaagga cgttttcatt gatcaagaac gaaagtaagg 961 ggatcgaaga cgatcagata ccgtcgtagt ctttactata aacgatgagg actggagatc 1021 ggacaagact gatttatggc ttgttcggca tccttcgaga aatcaaagtg tttgctttct 1081 ggggggagta tggtcgcaag gctgaaactt aaaggaattg acggaagggc atcaccgggt 1141 gtggagcctg cggcttaatt tgactcaaca cgggaaaact taccaggtca ggacatagta 1201 aggattgaca gattgagagc tctttcttga ttctatggtt ggtggtgcat ggccgttctt 1261 agttggtgga gtgatctgtc tggttaattc cgttaacgag cgagacctgg gcgtgctagc 1321 taggcgccgt taccaattga gtttggtagc gaggcggtgc cttcctagac ggactgtggg 1381 cgtctagccc acggaagctc caggcaataa caggtctgag atgcccttag atgtcctggg 1441 ccgcacgcgt gctacactga acggcccaac gagtggagga tgtgcgagag catttccgaa 1501 tctccaaatc cgttcgtgat ggggatcgac ggttgcaatt ttccgtcgtc aacgaggaat 1561 accttgtaag cgcgggtcat catcccgcgc tgaatacgtc cctgcccttt gtacacaccg 1621 cccgtcgctc ctaccgattg agtggtccgg tgaggccttg ggagggctgg atgcgctgtt 1681 tttatgcaga cattttggcc tgaacctggt caaaccttat cacttagagg aaggagaagt 1741 cgtaacaagg tttccgtagg tgaacctgca gaaggatcag aa // LOCUS REO1SIG1 1463 bp ds-RNA VRL 14-AUG-1990 DEFINITION Reovirus type 1 sigma-1 protein (S1) gene, complete cds. ACCESSION M35963 KEYWORDS cell-attachment protein; hemagglutinin; sigma-1 protein. SOURCE Reovirus type 1 (strain Lang) RNA. ORGANISM Reovirus type 1 Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1463) AUTHORS Nibert,M.L., Dermody,T.S. and Fields,B.N. TITLE Structure of the reovirus cell-attachment protein: A model for the domain organization of sigma-1 JOURNAL J. Virol. 64, 2976-2989 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 14 1426 sigma-1 protein BASE COUNT 426 a 291 c 370 g 376 t ORIGIN 1 gctattcgcg cctatggatg catctctcat tacagagata cggaaaatag tactccaact 61 atctgtatca agcaatggct cccagtcaaa agaaatcgag gaaatcaaga aacaagtcca 121 ggtcaacgtt gatgatatca gggctgccaa tattaaactc gacggacttg gaagacagat 181 tgctgacatc agcaatagca tctcaaccat tgagtcaaga ttgggtgaga tggataatcg 241 acttgtgggt atctcgagtc aggtcacgca attatctaac tcagttagcc agaacactca 301 gagcatatcc tcattgggtg acagaatcaa tgctgtcgaa ccacgagttg acagtctgga 361 tacggtcacg tctaatctca ctggacgaac atccactttg gaggcagatg ttggaagctt 421 acggacagaa ctagcagcgc taacaacacg ggtgacaact gaggttacaa ggttagatgg 481 tctaatcaat agtggccaga attcgattgg tgagctatcc acaagactat ccaatgtgga 541 gacgtctatg gtgacgacgg ctggacgggg actgcagaaa aacggaaaca ccttgaacgt 601 cattgtaggt aatggaatgt ggtttaatag ttctaatcaa ttgcagctcg acctttcggg 661 gcaatcaaaa ggggtgggat ttgtcggcac aggaatggtg gttaagattg atactaatta 721 ttttgcttac aatagtaatg gagagattac attggtgagt caaatcaatg aattgccatc 781 gcgcgtatca acactggaat cagcgaaaat cgattcagtt ttacctccat taaccgtacg 841 cgaagcgagc ggcgtacgta ccctgagctt tggttatgat acgagcgatt ttacaatcat 901 caactccgta ctgtcgttac ggtcacgttt gactcttccg acatacaggt accctctgga 961 gctcgacaca gcaaataata gagtgcaggt ggcagatcgt tttggcatgc gcacgggtac 1021 ttggacggga caattgcaat atcagcaccc acaattgagt tggagagcaa atgtcacttt 1081 gaatttgatg aaggtggatg attggttggt gttgagcttt tctcagatga cgactaactc 1141 aataatggca gatgggaaat ttgtgattaa ttttgtgtct gggttatctt ctggatggca 1201 gacgggggat actgaaccat cgtcaactat tgatccatgg tctacgacat ttgccgcggt 1261 ccaatttcta aataacggtc aacgcattga tgcgtttagg atcatgggag tatcggaatg 1321 gacggatgga gaattagaga ttaagaatta tggtggcaca tacaccggtc atactcaagt 1381 atattgggct ccgtggacga tcatgtatcc atgcaatgtg aggtgaatct agcgcgaacc 1441 ctcggcacaa ggggtcaatc atc // LOCUS REO2SIG1 1440 bp ds-RNA VRL 14-AUG-1990 DEFINITION Reovirus type 2 sigma-1-s protein (S1) gene, complete cds. ACCESSION M35964 KEYWORDS cell-attachment protein; hemagglutinin; sigma-1-s protein. SOURCE Reovirus type 2 (strain Jones) RNA. ORGANISM Reovirus type 2 Viridae; ds-RNA nonenveloped viruses; Reoviridae. REFERENCE 1 (bases 1 to 1440) AUTHORS Nibert,M.L., Dermody,T.S. and Fields,B.N. TITLE Structure of the reovirus cell-attachment protein: A model for the domain organization of sigma-1 JOURNAL J. Virol. 64, 2976-2989 (1990) STANDARD simple staff_entry FEATURES from to/span description pept 14 1402 sigma-1-s protein BASE COUNT 384 a 314 c 383 g 359 t ORIGIN 1 gctattcgca ctcatgtcgg atctagtgca gctcataaga agggagatct tactgttaac 61 tgggaatgga gaatcagcca actcgaaaca cgagatcgag gaaattaaga aacaaattaa 121 agacatctct gctgatgtca acaggatcag taacatcgtt gattcaatcc aaggacaact 181 gggtggatta tctgtacgcg tgtcagccat tgaatcggga gttagtgaga acggcaatcg 241 aattgataga ctcgagcgag atgtctccgg catatcggct agcgttagcg gaatcgattc 301 gcgtttatcc gagctgggtg accgagtcaa tgttgcagaa cagcgaattg gccagttgga 361 tacagtcacg gataatctcc ttgagcgagc atcaagactg gaaactgaag tatcagccat 421 tactaatgac cttggatcat tgaatacgag ggtgacgact gaattgaacg atgtccgcca 481 aactattgct gcgatagaca cgcgtctcac gacactggag accgatgccg tgacgtcggt 541 tggtcaaggg cttcagaaga ctgggaactc gattaaggtt attgtgggta cggggatgtg 601 gttcgaccgc aataatgttc tgcagttatt cgtatcgaac cagcagaaag ggttgggatt 661 catagacaat ggaatggtag tgaaaataga tacccagtat ttcagcttcg atagcaatgg 721 caacataact ctgaacaaca acataagtgg tctgccggcg cgaacaggtt ccctcgaggc 781 atctcgtatc gatgtggtag cgccaccgct tgtgatacag tctactggta gcactcggct 841 actgcgtctc atgtacgagg ctgtggactt cgtggttact aacaacgttc tcacactgag 901 aaatcgatcg gtcacgccaa cattcaagtt tcctctggag ttgaatagtg ctgataactc 961 agtgagcatt catagaaatt accgcattag acttgggcaa tggtcaggtc aattggaata 1021 tcacacgccg agtttgcgtt ggaatgctcc cgtcacggtt aatttgatgc gagtagacga 1081 ttggctcatt ttgagtttta ctcggttttc gacgagcggc atcttagcgt caggaaagtt 1141 tgtattgaac ttcgtaactg gtttgtctcc agggtgggcg actgggagta ccgagccctc 1201 gacaactact aacccactgt caacgacgtt tgctgcaatt cagttcatca atgggtcatc 1261 tcgcgtagac gcctttagaa tcttgggagt cgcagagtgg aatgccgggg aactagagat 1321 cacgaattat ggcggaacat atacagcgca taccaatgtc gactgggcgc cgatgaccat 1381 tatgtaccca tgtctgggct gaggatccgg gtgctccact cggcacagtg gcgactcatc // LOCUS MARCMYC 100 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck mutant c-myc gene, intron A and breakpoint region. ACCESSION M30385 KEYWORDS c-myc oncogene. SOURCE Woodchuck (individual W64, chronically infected with woodchuck hepatitis virus [WHV]) hepatocellularcarcinoma (HCC) DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 100) AUTHORS Moeroey,T., Marchio,A., Etiemble,J., Trepo,C., Tiollais,P. and Buendia,M.-A. TITLE Rearrangement and enhanced expression of c-myc in hepatocellular carcinoma of hepatitis virus infected woodchucks JOURNAL Nature 324, 276-279 (1986) STANDARD simple staff_review FEATURES from to/span description IVS < 1 > 100 c-myc intron A recomb 48 49 cellular DNA of unknown function end/c-myc DNA start BASE COUNT 24 a 26 c 29 g 21 t ORIGIN 3 bp upstream of BGlII site. 1 ggagatctca ggaagtaagg cactcagaga gacagtgggc gattcgtagg attccccacg 61 gcgattccaa ctcgcccccg ctctttttaa gaagttggcg // LOCUS MARWHV1 120 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck DNA/woodchuck hepatitis virus DNA, 5' junction. ACCESSION M23563 KEYWORDS c-myc oncogene. SEGMENT 1 of 2 SOURCE Woodchuck (individual W74) hepatocellular carcinoma DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 120) AUTHORS Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C., Tiollais,P. and Buendia,M.-A. TITLE Activation of c-myc by woodchuck hepatitis virus insertion in hepatocellular carcinoma JOURNAL Cell 55, 627-635 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 6 c-myc oncogene recomb 92 95 woodchuck DNA end/virus DNA start BASE COUNT 41 a 23 c 25 g 31 t ORIGIN 1 gcctaagttg aactgttgga gggaggaact ggaatctcct aaattctcac tccttactaa 61 gggaaagcaa ggaaaaaggt tccttctgac tgaaggcata aaaaaccatt atcctaattt // LOCUS MARWHV2 40 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck DNA/woodchuck hepatitis virus DNA, 3' junction. ACCESSION M23564 KEYWORDS c-myc oncogene. SEGMENT 2 of 2 SOURCE Woodchuck (individual W74) hepatocellular carcinoma DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 40) AUTHORS Hsu,T.-y., Moeroey,T., Etiemble,J., Louise,A., Trepo,C., Tiollais,P. and Buendia,M.-A. TITLE Activation of c-myc by woodchuck hepatitis virus insertion in hepatocellular carcinoma JOURNAL Cell 55, 627-635 (1988) STANDARD simple staff_entry FEATURES from to/span description recomb 15 16 virus DNA end/woodchuck DNA start BASE COUNT 5 a 14 c 7 g 14 t ORIGIN About 900 bp after segment 1. 1 agggcagcat tgatctacgc ttcctgcttc cctctttctc // LOCUS BSUPAIA 1367 bp ds-DNA BCT 14-AUG-1990 DEFINITION B.subtilis negative regulator of protease synthase and sporulation (pai) genes, complete cds. ACCESSION M36471 KEYWORDS regulatory protein. SOURCE B.subtilis (strain DB104) DNA, clone pPAI121. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1367) AUTHORS Honjo,M., Nakayama,A., Fukazawa,K., Kawamura,K., Ando,K., Hori,M. and Furutani Y, TITLE A novel Bacillus subtilis gene involved in negative control of sporulation and degradative-enzyme production JOURNAL J. Bacteriol. 172, 1783-1790 (1990) STANDARD simple staff_review FEATURES from to/span description pept 100 618 negative regulator pai 1 pept 640 1263 negative regulator pai 2 BASE COUNT 504 a 205 c 283 g 375 t ORIGIN 1 tttttggaat ggtaacaagg tcagtatgtg ttccgtgttt aaatatataa aagggacaca 61 agacaattga attttaaata aaggggctga atccgaacaa tgagtgtaaa aatgaaaaaa 121 tgcagccggg aagatttaca aacacttcaa caattgagta ttgaaacatt caatgacatt 181 tttaaagaac agaactcacc tgaaaatatg aaagcctatt tagaaagcgc atttaacact 241 gagcagctgg aaaaagagtt atctaatatg tcttcgcaat tcttttttat ttactttgat 301 catgaaatcg ctggatatgt aaaggtcaat atcgatgatg ctcagtctga agaaatgggt 361 gctgaatcac ttgaaatcga gagaatttat ataaagaaca gctttcaaaa acatgggctt 421 ggcaaacatc tgctgaataa agcgatagaa attgcgctgg aacgtaataa aaagaacatt 481 tggctaggtg tgtgggaaaa aaatgaaaat gccattgcct tttataagaa aatggggttt 541 gttcagaccg gcgcccactc attttatatg ggtgatgaag aacaaacgga tttaatcatg 601 gctaaaacac tcatataata tttttcgaag gggatgatga tgtatattcc aaaatatttt 661 aaagtcacaa atgctgaaga aatttggaat tttgtacaag aaaactcctt tggtacggtt 721 gtcacaacag aacaaggaaa accaattgcc acacatttgc cgttaggatt caataaaaaa 781 gacgatcatt actatattac ggggcatttc gcttatggaa atcctcagtg gagaacgttt 841 gaagcctgtg aagatgtgct tgtgatgttt cagggaccgc acgcttatat ttcttcttcc 901 tggtattcgc gcgaaaatgt tccgacgtgg aattatcagg ctgtccacat gtatggaaaa 961 gcaagtatgt tagagaaaga tgaattagca gaagaattaa caatcatgtt ggagaaatac 1021 gaaaaacatc gtgacaatcc tgtgttatgg gataagcttt ctccaaaact cttagaaagt 1081 gaattgaaag gtattgttgg atttaagatt aaggtggaag acattcaagc ggcatataaa 1141 ttaagccaga accgaaatga aacggattat atgaacgtca ttgaacagtt gcaaaatgaa 1201 gaaaatccaa atgcaaaaca aatggcagaa ctgatggaag ataaactcaa aaagcaaata 1261 taagcttact ttgctgaaga agtagcagaa aacgccaact caagcggaga accctggcac 1321 ttggcgtttt tttatgatta atatttatcc cattaaaaat ggaattc // LOCUS HUMA2MGL 2041 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human alpha-2-macroglobulin mRNA, 3' end. ACCESSION M36501 KEYWORDS alpha-2-macroglobulin. SOURCE Human liver, cDNA to mRNA, clone pha2ml. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2041) AUTHORS Bell,G.I., Rall,L.B., Sanchez-Pescador,R., Merryweather,J.P., Scott,J., Eddy,R.L. and Shows,T.B. TITLE Human alpha-2-macroglobulin gene is located on chromosome 12 JOURNAL Somat. Cell Mol. Genet. 11, 285-289 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 1932 alpha-2-macroglobulin (AA at 1) /hgml_locus_uid="LH0012N" /nomgen="A2M" /map="12p13.3-p12.3" BASE COUNT 561 a 528 c 481 g 471 t ORIGIN 1 cccgccttcc tagctgtccc agtggagaag gaacaagcgc ctcactgcat ctgtgcaaac 61 gggcggcaaa ctgtgtcctg ggcagtaacc ccaaagtcat taggaaatgt gaatttcact 121 gtgagcgcag aggcactaga gtctcaagag ctgtgtggga ctgaggtgcc ttcagttcct 181 gaacacggaa ggaaagacac agtcatcaag cctctgttgg ttgaacctga aggactagag 241 aaggaaacaa cattcaactc cctactttgt ccatcaggtg gtgaggtttc tgaagaatta 301 tccctgaaac tgccaccaaa tgtggtagaa gaatctgccc gagcttctgt ctcagttttg 361 ggagacatat taggctctgc catgcaaaac acacaaaatc ttctccagat gccctatggc 421 tgtggagagc agaatatggt cctctttgct cctaacatct atgtactgga ttatctaaat 481 gaaacacagc agcttactcc agagatcaag tccaaggcca ttggctatct caacactggt 541 taccagagac agttgaacta caaacactat gatggctcct acagcacctt tggggagcga 601 tatggcagga accagggcaa cacctggctc acagcctttg ttctgaagac ttttgcccaa 661 gctcgagcct acatcttcat cgatgaagca cacattaccc aagccctcat atggctctcc 721 cagaggcaga aggacaatgg ctgtttcagg agctctgggt cactgctcaa caatgccata 781 aagggaggag tagaagatga agtgaccctc tccgcctata tcaccatcgc ccttctggag 841 attcctctca cagtcactca ccctgttgtc cgcaatgccc tgttttgcct ggagtcagcc 901 tggaagacag cacaagaagg ggaccatggc agccatgtat ataccaaaga cctgctggcc 961 tatgcttttg ccctggcagg taaccaggac aagaggaagg aagtactcaa gtcacttaat 1021 gaggaagctg tgaagaaaga caactctgtc cattgggagc gccctcagaa acccaaggca 1081 ccagtggggg atttttacga accccaggct ccctctgctg aggtggagat gacatcctat 1141 gtgctcctcg cttatctcac ggcccagcca gccccaacct cggaggacct gacctctgca 1201 accaacatcg tgaagtggat cacgaagcag cagaatgccc agggcggttt ctcctccacc 1261 caggacacag tggtggctct ccatgctctg tccaaatatg gagcagccac atttaccagg 1321 actgggaagg ctgcacaggt gactatccag tcttcaggga cattttccag caaattccaa 1381 gtggacaaca acaaccgcct gttactgcag caggtctcat tgccagagct gcctggggaa 1441 tacagcatga aagtgacagg agaaggatgt gtctacctcc agacatcctt gaaatacaat 1501 attctcccag aaaaggaaga gttccccttt gctttaggag tgcagactct gcctcaaact 1561 tgtgatgaac ccaaagccca caccagcttc caaatctccc taagtgtcag ttacacaggg 1621 agccgctctg cctccaacat ggcgatcgtt gatgtgaaga tggtctctgg cttcattccc 1681 ctgaagccaa cagtgaaaat gcttgaaaga tctaaccatg tgagccggac agaagtcagc 1741 agcaaccatg tcttgattta ccttgataag gtgtcaaatc agacactgag cttgttcttc 1801 acggttctgc aagatgtccc agtaagagat ctgaaaccag ccatagtgaa agtctatgat 1861 tactacgaga cggatgagtt tgcaattgct gagtacaatg ctccttgcag caaagatctt 1921 ggaaatgctt gaagaccaca aggctgaaaa gtgctttgct ggagtcctgt tctcagagct 1981 ccacagaaga cacgtgtttt tgtatcttta aagacttgat gaataaacac tttttctggt 2041 c // LOCUS HUMPPA1CO 1500 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human alpha-1 collagen type I mRNA, 5' end. ACCESSION M36546 KEYWORDS alpha-1 collagen type I. SOURCE Human, cDNA to mRNA, clone pHUCI. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1500) AUTHORS Tromp,G., Kuivaniemi,H., Stacey,A., Shikata,H., Baldwin,C.T., Jaenisch,R. and Prockop,D.J. TITLE Structure of a full-length cDNA clone for the prepro-alpha1(I) chain of human type I procollagen JOURNAL Biochem. J. 253, 919-922 (1988) STANDARD simple staff_review FEATURES from to/span description pept 84 > 1500 alpha-1 collagen type I /hgml_locus_uid="LG0047H" /nomgen="COL1A1" /map="17q21.3-q22" sigp 84 149 alpha-1 collagen type I signal peptide matp 567 > 1500 alpha-1 collagen type I matp 618 > 1500 alpha-1 collagen type I alpha chain BASE COUNT 257 a 488 c 488 g 267 t ORIGIN 1 ggcacgcgga gtgtgaggcc acgcatgagc ggacgctaac cccctcccca gccacaaaga 61 gtctacatgt ctagggtcta gacatgttca gctttgtgga cctccggctc ctgctcctct 121 tagcggccac cgccctcctg acgcacggcc aagaggaagg ccaagtcgag ggccaagacg 181 aagacatccc accaatcacc tgcgtacaga acggcctcag gtaccatgac cgagacgtgt 241 ggaaacccga gccctgccgg atctgcgtct gcgacaacgg caaggtgttg tgcgatgacg 301 tgatctgtga cgagaccaag aactgccccg gcgccgaagt ccccgagggc gagtgctgtc 361 ccgtctgccc cgacggctca gagtcaccca ccgaccaaga aaccaccggc gtcgagggac 421 ccaagggaga cactggcccc cgaggcccaa ggggacccgc aggcccccct ggccgagatg 481 gcatccctgg acagcctgga cttcccggac cccccggacc ccccggacct cccggacccc 541 ctggcctcgg aggaaacttt gctccccagc tgtcttatgg ctatgatgag aaatcaaccg 601 gaggaatttc cgtgcctggc cccatgggtc cctctggtcc tcgtggtctc cctggccccc 661 ctggtgcacc tggtccccaa ggcttccaag gtccccctgg tgagcctggc gagcctggag 721 cttcaggtcc catgggtccc cgaggtcccc caggtccccc tggaaagaat ggagatgatg 781 gggaagctgg aaaacctggt cgtcctggtg agcgtgggcc tcctgggcct cagggtgctc 841 gaggattgcc cggaacagct ggcctccctg gaatgaaggg acacagaggt ttcagtggtt 901 tggatggtgc caagggagat gctggtcctg ctggtcctaa gggtgagcct ggcagccctg 961 gtgaaaatgg agctcctggt cagatgggcc cccgtggcct gcctggtgag agaggtcgcc 1021 ctggagcccc tggccctgct ggtgctcgtg gaaatgatgg tgctactggt gctgccgggc 1081 cccctggtcc caccggcccc gctggtcctc ctggcttccc tggtgctgtt ggtgctaagg 1141 gtgaagctgg tccccaaggg ccccgaggct ctgaaggtcc ccagggtgtg cgtggtgagc 1201 ctggcccccc tggccctgct ggtgctgctg gccctgctgg aaaccctggt gctgatggac 1261 agcctggtgc taaaggtgcc aatggtgctc ctggtattgc tggtgctcct ggcttccctg 1321 gtgcccgagg cccctctgga ccccagggcc ccggcggccc tcctggtccc aagggtaaca 1381 gcggtgaacc tggtgctcct ggcagcaaag gagacactgg tgctaaggga gagcctggcc 1441 ctgttggtgt tcaaggaccc cctggccctg ctggagagga aggaaagcga ggagctcgag // LOCUS MACTRIIS1 280 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 1. ACCESSION M37561 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 1 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 280) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept 156 + 270 triosephosphate isomerase, exon 1 (E.C. 5.3.1.1) IVS 271 > 280 triosephosphate isomerase intron A signal 46 49 CAAT box signal 93 98 TATA box BASE COUNT 50 a 77 c 111 g 42 t ORIGIN 1 gcaggagggc gggcgggggg cagggtctcc gggggaccgg tcgggccatg gcggaggacg 61 gcgaggaggc ggagttccac ttctcggcgc tctatataag cgggcagtgg cccggctgcg 121 cgcagacact gaccttcagc gtctcgctcc atgccatggc gccttccagg aagttcttcg 181 ttggggggaa ctggaagatg aacgggcgga agcagaatct gggggagctc atcggcactc 241 tgaacgcggc caaggtgccg gcagacaccg gtaagcctcg // LOCUS MACTRIIS2 144 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 2. ACCESSION M37562 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 2 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 144) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 134 triosephosphate isomerase, exon 2 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron A IVS 135 > 144 triosephosphate isomerase intron B BASE COUNT 33 a 38 c 39 g 34 t ORIGIN 1 ttgccctcag aagtggtttg tgctcccccc actgcctata tcgacttcgc ccggcagaag 61 ctagatccca agattgctgt ggctgcacag aactgctaca aagtgactaa tggggccttt 121 actggggaga tcaggtgaga tcga // LOCUS MACTRIIS3 105 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 3. ACCESSION M37563 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 3 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 105) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 95 triosephosphate isomerase, exon 3 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron B IVS 96 > 105 triosephosphate isomerase intron C BASE COUNT 23 a 23 c 35 g 24 t ORIGIN 1 cttcctttag ccctggcatg atcaaagact gcggagccac gtgggtggtc ctggggcact 61 cagagagaag acatgtcttt ggggagtcag atgaggttag tagcc // LOCUS MACTRIIS4 153 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 4. ACCESSION M37564 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 4 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 153) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 143 triosephosphate isomerase, exon 4 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron C IVS 144 > 153 triosephosphate isomerase intron D BASE COUNT 39 a 29 c 49 g 36 t ORIGIN 1 ttctcaacag ctgattgggc agaaagtggc ccatgctctg gctgagggac tcggagtaat 61 cgcctgtatt ggggagaagc tagatgaaag ggaagctggc atcactgaga aggttgtttt 121 cgagcagaca aaggtcatcg caggtatctc ttg // LOCUS MACTRIIS5 102 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 5. ACCESSION M37570 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 5 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 102) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 7 + 92 triosephosphate isomerase, exon 5 (E.C. 5.3.1.1) IVS < 1 6 triosephosphate isomerase intron D IVS 93 > 102 triosephosphate isomerase intron E BASE COUNT 26 a 25 c 31 g 20 t ORIGIN 1 ctgcagataa tgtgaaagac tggagcaagg ttgtcctggc ctatgagcct gtgtgggcca 61 ttggtactgg caagactgca acgccccaac aggtaaccga cg // LOCUS MACTRIIS6 108 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 6. ACCESSION M37571 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 6 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 108) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 98 triosephosphate isomerase, exon 6 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron E IVS 99 > 108 triosephosphate isomerase intron F BASE COUNT 26 a 24 c 32 g 26 t ORIGIN 1 cttgttctag gcccaggaag tacatgagaa gctccgagga tggcttaagt ccaacgtctc 61 tgaagcagtg gctcagagca cccgtatcat ttatggaggt gagtggct // LOCUS MACTRIIS7 632 bp ds-DNA PRI 14-AUG-1990 DEFINITION M.mulatta triosephosphate isomerase gene, exon 7. ACCESSION M37572 M35724 KEYWORDS triosephosphate isomerase. SEGMENT 7 of 7 SOURCE M.mulatta DNA. ORGANISM Macaca mulatta Unclassified. REFERENCE 1 (bases 1 to 632) AUTHORS Old,S.E. and Mohrenweiser,H.W. TITLE Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta JOURNAL Nucleic Acids Res. 16, 9055-9055 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 129 triosephosphate isomerase, exon 7 (E.C. 5.3.1.1) IVS < 1 10 triosephosphate isomerase intron F BASE COUNT 122 a 201 c 154 g 155 t ORIGIN 1 cccttcccag gctctgtgac tggggcaacc tgcaaggagc tggccagcca gcctgacgtg 61 gatggcttcc ttgtgggtgg tgcttccctc aagcccgaat tcgtggacat catcaatgcc 121 aaacaatgag ccccatccac cttccctacc cttcctgcca agccagggac taagcagccc 181 agaagcccag tcactgcccc ttccccgcac atgcttctga tggtgtcgcc tgctccttcc 241 tgtggcctca tccaaactgt accttccttt actatcttca gcctgtaatg gttgggacca 301 ggccaatccc ttctccactt actataatgg ttggaactaa atgtcaccaa ggtggcttct 361 cctcggctga gagatggaag gggtgggatt tgctcctggg ttccctaggc cctagtgagg 421 gcaggagaga agccatcctc tcccttctta cacagtgagg ccgagatcct cccctccgaa 481 ggcaggagtg ccgccctctc ccatggtgcc catgcctgtg tgctgtgtac gtgaaccacc 541 cacatgtgag ggaataaaca cctagcacta ggtcttgtgg tctgtctgcc ttcactggac 601 ttgcccagat aatcttcctt tttgaggcag ct // LOCUS MUS2NF3MK 2470 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger protein (mkr3) mRNA, 3' end. ACCESSION M36514 KEYWORDS mkr3 gene; zinc finger protein. SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 15. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2470) AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P. TITLE Specific and ubiquitous expression of different Zn finger protein genes in the mouse JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 1287 zinc finger protein (mkr3; AA at 1) BASE COUNT 695 a 501 c 488 g 786 t ORIGIN 1 ataaaaactc acactggtga gaagcccttc cggtgtaaag tatgtgcaag gacttttaga 61 aattcctcat gccttaagac taactttcga attcacactg gaataaaacc atacaaatgt 121 aattactgtt ggaaagcctt cactgcacgc tcaggcctta ctaagcatgt actaattcac 181 aatggtgaga agccctatga gtgtaaggag tgtgggaaag ccttcagtac atcttccggc 241 cttgttgaac atataagaat tcatacagga gagaagccct ttgaatgtta ccagtgtggg 301 aaagccttgg tccattcctc atcccttgtt ggacatttaa gaactcacac tggagagaaa 361 ccctttgagt gtaatcagtg tgacaaaact tttacacgat cttcttatct tcgtattcat 421 atgcgaactc acaccggaga gaaaccatat gaatgtaaag agtgtgggaa aactttccct 481 gagcgctcat gccttactaa acacataagg acacacactg gtgaaaggcc ctatgaatgt 541 aaggagtgtg acaaaggctt tattagcttt gctcagctta ctgtacacat aaaaactcat 601 agttctgaga gaccctttca gtgtaaggta tgcacaaaat cttttagaaa ctcttcatcc 661 cttgagaccc acttccgaat tcacactgga gtaaaaccct ataaatgcag ttactgtggg 721 aaagccttca ctgctcgttc aggccctacc atccatttac gcaatcacac tggggagaaa 781 tcctatgcat gccaagagtg tggaaaggcc tttagcactt cctcaggcct tattgcacat 841 ataagaagtc acaaaggaga gaaacccttt gaatgtgacc actgtgggaa ggcctttcct 901 tcttcatctt atcttaatgt gcatttgaaa attcacactg gggaaaaacc ctttcggtgt 961 acggtatgtg ggaaaacatt tacgtgttct tcttaccttc cggttcacat gcgaactcac 1021 actggaggga ggccttttcg gtgtataata tgtggaaggt catttttgtg gtcctcgtac 1081 cttagagttc acatgcgaat tcacactgga gagaaaccct atgtatgtca gtactgtgga 1141 aaagctttta cagagcactc aggccttaat aaacatttac ggaaacacac aggagagaaa 1201 ccatatgaat ataaggaatg tggggaaaac ttcactactt ctgctgatgc taatgaacat 1261 gaaactcccc actggggaga cctttgaatg taagggatca gaaaattgat agccgttgac 1321 catcatttgc aatcaaaatt tatagtctat acaagtctta ttgataagca acgtataaaa 1381 aacattcggt tgctatgtgc tgtaaaaaca actcattctg aggatgctct gagtttaagg 1441 attgtgggag aaatcaaaat ttcatgatat atgtatgaac tgaaggtttg taatataggg 1501 aacgaaaacc tttagtgttt ctatgagtgt tgatgcaaga catggagaaa tgtttctatg 1561 ttcacgtgtg gtctcacagt tcgagaaatg tctttctgaa gtttgggaag taactttaca 1621 tttgaacaca agctacatgc gtgtaagagg ggatttgctt cagtgtcttg gatggcaaca 1681 cttggctttc ttgggtgctt ctctatgaac tttgaatatt tctccatttg tgtattttta 1741 tgacctcaat tccattgtta gttttccttt ttaaaatatt cctgtttgat gtgattgaaa 1801 cttgttttta tctagtatct gttatatgtc ccatgttggc attgtgaacc taatgacaca 1861 aactcccatc tcctcactac ttgttagttt atttgttgtt gtttttttcc cctcttttct 1921 ttggagacag tctcactttc tggcctagta taacatgaaa gttatgtagc ctagactggt 1981 ccctagctca tggcagtcta gacccagcct tctgagtgca tagattacag gtgcaagcca 2041 ccatacctgc tctttctact gttttaagta taaacttcac tagcactgaa aaaaccatca 2101 tgttttacaa ttgtgttcag gttcttaaca caagtaggtt tggatctgac caatacaatc 2161 ccttactctg agttctaagt ctttggtagc cattccattc tctgcttgga gttcagcttt 2221 tctttaggct ttgcttctta gatcatgtag tatctgtatt aactctactt agctttttta 2281 tttttgtgac agggtcttgc tgtagctcaa actgacctca aactcgcaaa cctttttgcc 2341 tcagacttac tgctacagtt ataggcatgt gccttcatgc ctagctggcg tatatcaatc 2401 tacataatgt tctcaaggtg gaactgtgtt atggtcagtt attttaagac atctaattat 2461 cctttggcct // LOCUS MUS2NF4MK 1258 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger protein (mkr4) mRNA, partial cds. ACCESSION M36515 KEYWORDS mkr3 gene; zinc finger protein. SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 14. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1258) AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P. TITLE Specific and ubiquitous expression of different Zn finger protein genes in the mouse JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1258 zinc finger protein (mkr4; AA at 1) BASE COUNT 356 a 335 c 321 g 246 t ORIGIN 1 aaagtccttt ttccaagtat cttctctgtt caggcaccgg agatccacac tggagaaaag 61 ctctacgact gtagccattg tgggaaaggc ttctcttaca actcagacct caggatacat 121 cagaagatcc atacaggaga gaaacgccat ggctgcgtcg actgtggcaa agccttcaca 181 caaaagtcca cactgaggat gcaccagaag atccacacgg gcgagagggc ctatgtctgt 241 attgagtgtg gacaggcttt tatccagaag acacacctgg tcgcacaccg aagaattcac 301 tcaagagtga ggccctgtgt atctctcgac cgtgcgaaac ccttcagcag cgctcccaac 361 ctccttccgc gtaagaaagt tcaaatgaga gagaaatcgt ccatctgcgc ggagtgtggg 421 aaggccttta cctacaggtc ggagctaatc attcatcaga gaactcacac gggagagaag 481 ccttatcagt gcggggactg tggcaaagcc ttcacccaga agtcagccct caccgtgcac 541 cgaagaatcc acacggggga gaagtcgtat gtgtgtgtga agtgcgggct agccttcgtg 601 caaagggcac acttggatgc acaccaggtg attcacactg gagaaaaacc ttaccagtgt 661 ggtcattgtg ggaaattctt cacttccaag tcgcaactcc acgtgcacaa gcggattcac 721 acgggggaaa aaccctatgt gtgcagtaac tgtgggaagg cctttgccaa caggtcaaat 781 ctcatcacac atcagaaaac tcatacaggg gagaaagcct atgtctgtgc aaggtgtggg 841 aaagcattca ctcagaggtc agaacctgtt acacaccaga ggatacatac cggagagaag 901 ccatatgggt gtagaccctg cgggaaagcc tttacccaga aatcacacct cagtatacac 961 gagaagattc acaccggaga gagacagtac ggatgccgag actgtgggaa agccttcaac 1021 cagaaatcca tactcatcgt gcaccagaaa atccacacgg gggagaaacc gcaggtgtgt 1081 gccgagtgcg ggcgagcttt catccgcaag tccaacttca tcacccatca gcgcatccat 1141 actggcgaga aaccttatgg atgcaccgac tgcggcaagt cctttacctc caagtcgcag 1201 ctcctagtgc accggccaat ccacaccggc gagaaacctt acgtgtgtgc cgagtgtg // LOCUS MUS2NF5MK 2433 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Mouse zinc finger protein (mkr5) mRNA, 3' end. ACCESSION M36516 KEYWORDS mkr3 gene; zinc finger protein. SOURCE Mouse 8.5 day cell line C57BL embryo, cDNA to mRNA, clone 10. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2433) AUTHORS Chowdhury,K., Rohdewohld,H. and Gruss,P. TITLE Specific and ubiquitous expression of different Zn finger protein genes in the mouse JOURNAL Nucleic Acids Res. 16, 9995-10011 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 1845 zinc finger protein (mkr5; AA at 1) BASE COUNT 752 a 516 c 536 g 629 t ORIGIN 1 cagcagtctg tacaggaagc ccaggatctg cttccaaggc aggactcaca tgctgaacgg 61 gtaacaggca gaacttggag cactaaactt gagtgctcca ctttcagaga tcaggattct 121 gagtgtacgt ttgaaaggaa tgagcaagag acagtcactc cgaacagagc cttctcagag 181 gggagagatg gtatgtgtat cgaatctgga agatggtttc atttgaacag ttcagacgag 241 agaagtcata actgtgactc aggtaaaagt ttctcctcaa atccagtagt tgtaaaggaa 301 accggaatct gttcaggaaa aaaacttttc caatgtaatg aatgtaagaa aactttcacc 361 cagagctcct cccttaccgt tcaccagaga attcatactg gagagaagcc ctataaatgc 421 aatcagtgtg ggaaggcctt cagtgacggg tcctccttcg cacggcacca gaggtaccat 481 acaggcaaga agccgtacga gtgcccggag tgtggcaaag ctttcataca gaacacctcg 541 ctcgttcgtc actggaggta ctatcacact ggggagaaac ccttcgattg catcgactgt 601 gggaaagcct tcagtgacca catagggctt aatcaacaca ggaggattca cactggagag 661 aaaccataca cgtgtgaagt gtgtcacaaa tcctttaggt acggctcatc cctcactgtg 721 caccaaagga ttcatactgg agagaaaccg tacgagtgtg agatttgcag aaaagccttc 781 agccaccatg catccctcac tcagcatcaa cgagtgcatt ctggagaaaa gccttttaaa 841 tgtaaagagt gtgggaaagc ttttaggcag aatatacacc ttgctagtca ctggaggatc 901 cataccgggg agaagccctt tgagtgtggg gaatgtggga aatctttcag catcagctca 961 cagcttgcca cgcatcagag aattcataca ggagagaagc cctttgagtg taaggtttgt 1021 aggaaagctt ttaggcagaa tatacacctt gctagtcact ggagaatcca taccggggag 1081 aagccctttg agtgtgggga atgtgggaaa tctttcagca tcagctcaca gcttgccacg 1141 catcagagaa ttcatacagg agagaagcct tatgaatgta aggtttgtag gaaagcattt 1201 acgcagaagg ctcaccttgc acaacatcag aagactcata cgggggagaa gccatacgag 1261 tgcaaggaat gtggcaaggc cttcagccag accactcacc tcattcaaca ccagagggtt 1321 catactggag agaagcccta taaatgcctg gaatgtggga aggcctttgg tgataactca 1381 tcctgtactc aacatcagag acttcacact ggcctgaggc cttatgagtg tgtggagtgt 1441 ggaaagacat tcaagaccaa gtcatccctt atctgttatt gtagatgtca cacgggagag 1501 aaaccttatg agtgcagtgc atgtggtaaa gcctttagcc actggcagtc ccttagtgta 1561 catcagagaa ttcattcggg gaagaaaccg tatgaatgta aggaatgtcg gaaaaccttc 1621 atccagattg gacaccttaa ccaacataag agagtccata ccggggagag aacgtacaac 1681 tacaagaagg gcaggagggc cttcaggcag actgcacact ttgctcacca tcagcagatt 1741 cattccggaa agtcacctgc tcaccactct ttgccttcca catcgaatcc tgtggatctc 1801 ttctccaaat tcgtctggaa tccatcctca ctgccatcat cataacctca tattttcacc 1861 acacttgttt atcaacttgt ctctttgccc ctttgttcat tcttgtccct tatgagtcct 1921 tcatattaca accaggttgg attatttttg gagtgtaaat gtaattaatt tgctcattta 1981 aagcttatat tcactgggga aaaaaaaaga agttagctga aacataaaga aacctcatga 2041 atctaagaaa tgttccatat cttgctctgg gtagtatatc tgtaaatctt agcaagccct 2101 gtatgtagta ttttaagaac tttctgactt tgttatcctt taatttttaa tgctgttata 2161 ttaagaaaaa cacatttata ataaatatgg tattttaaat aaaaagtatt atatcttata 2221 gctatttttc agataaacaa aaacaaattt gatacaaagt aaaatatgta taaggttggt 2281 gttccagaac aaaatctgca ggaatattta cagcatggta taaagcaacg aaaccaggtc 2341 atccaccaac attttgcctt agattttcta atatttcagc atatgagtag cacagtatta 2401 aaacactata ggtagtagca tttcacagaa ttc // LOCUS ORCHRCOM 1579 bp ds-DNA MAM 14-AUG-1990 DEFINITION O.orca highly repetitive element DNA. ACCESSION M36451 KEYWORDS repetitive element. SOURCE O.orca DNA. ORGANISM Orcinus orca Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Odontoceta; Delphinidae; Orcininae. REFERENCE 1 (bases 1 to 1579) AUTHORS Widegren,B., Arnason,U. and Akusjaervi,G. TITLE Characteristics of a conserved 1,579-bp highly repetitive component in the killer whale, Orcinus orca JOURNAL Mol. Biol. Evol. 2, 411-419 (1985) STANDARD simple staff_review FEATURES from to/span description rpt 1 1579 highly repetitive element BASE COUNT 368 a 358 c 333 g 520 t ORIGIN 1 atcttatgat gtttcttttt ttgtgtgact tatttcatgt agaatcatcg tacctgaatc 61 cactcattat gctgctacgg gcctgatgac atagatttca ttgctgagtg atattgcatt 121 gtacgtaagt accacaaagt ctttatccat ttttcgcttt ctgcgatact gaacttgtac 181 cataaacgag gttcttgtaa acagagccgt cacaaacttt gggatggctg tgtctttttc 241 attttaattt caactaagct ataggaccat aagtggaagt gccctaggct ctgttgcttt 301 gttgtttaga tgtttcagga aacaccatac acttctccag agtggctgtt ggcaatttac 361 atcccgccca tcagcataac aaggctccca attctccatg gcctgtcccg cctttctgga 421 ttttacactt ttttcagatg gcccttttga ccgtggggca gtgagacttc atgtagtgca 481 gatttccttt gcaagcttgc ttggttggcc aaaaagttcg tatgcgtttc tttctgaata 541 tattcaggaa aaaacgcata agaccttttt ggccaagtgc atcattgtgg acgttctgcc 601 tgttttccta tgctttacat gcaattccag tctacctcct gaaatcggtg tctggcaatt 661 ttgccccgct ttcaagtcct cttggcagcc ttacttcaat atatttttgg acgatagctg 721 tcatttataa ctctgcaggt ttgtgaatca cagtgcccct gagctccttt cttcaactgg 781 ctttcttgtg agctggctgc aacaccgcag gattgcttga ggccctcgtg tggttccggc 841 agggcacgct gagccttttg ttaattcctc ttcctggtgg gaaatgaggg ttaaatttgc 901 ccgtccagac acctccagct agtctctcat tgattctccc tattcctgtt cattttccgc 961 agaaattgca aactgggcca aacaggaggt taaaggcact gactctccaa gtggggagag 1021 tgttagtaaa gcgtctggaa tcttgcaccc gagtaccagg ggacgaaaac tgagacacat 1081 ttgaacacgt ttcccgatca cacggtggat catactctgg gttccacatg catgttttag 1141 ctgaaggaag aatcccttaa acctggagag ttgagaccca tggaatgggt accatgcaat 1201 atgacttcaa agggtctgta tttgctaact gaaactcact aatcctatca ctgctgcgtt 1261 tatgccactg tacacacgct tgattccctt tcggagacat ataagtccac aggttttaag 1321 attcttacta gtcaggtata ttcttaggag tttaatatgg agtgttgagt ccactcgatc 1381 agcaaggagt agctcttgtc tattaaatat ttggcttatg gaacggtatc tgtgctgatt 1441 tcaatctctg gttttatgca gcacctcaac tcacctttcc ccttaagcaa gcataagttg 1501 ctgttctaca tttgagaccc tgttctgttt tgtaattcag ttcctgtgta gccaagttta 1561 cattccgtgt agtagtgat // LOCUS RICADH2A 3779 bp ds-DNA PLN 14-AUG-1990 DEFINITION Rice alcohol dehydrogenase (adh2) gene, complete cds. ACCESSION M36469 KEYWORDS adh2 gene; alcohol dehydrogenase. SOURCE Rice DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 3779) AUTHORS Xie,Y. and Wu,R. TITLE Molecular analysis of an alcohol dehydrogenase-encoding genomic clone (adh2) from rice JOURNAL Gene 87, 185-191 (1990) STANDARD simple staff_review FEATURES from to/span description pept 639 669 alcohol dehydrogenase (adh2), exon 1 768 904 alcohol dehydrogenase, exon 2 1017 1063 alcohol dehydrogenase, exon 3 1672 1998 alcohol dehydrogenase, exon 4 2114 2195 alcohol dehydrogenase, exon 5 2301 2376 alcohol dehydrogenase, exon 6 2471 2529 alcohol dehydrogenase, exon 7 2628 2723 alcohol dehydrogenase, exon 8 2854 3012 alcohol dehydrogenase, exon 9 3149 3247 alcohol dehydrogenase, exon 10 IVS 670 767 adh2 intron A IVS 905 1016 adh2 intron B IVS 1064 1671 adh2 intron C IVS 1999 2113 adh2 intron D IVS 2196 2300 adh2 intron E IVS 2377 2470 adh2 intron F IVS 2530 2627 adh2 intron G IVS 2724 2853 adh2 intron H IVS 3013 3148 adh2 intron I BASE COUNT 1002 a 799 c 867 g 1111 t ORIGIN 1 taatcattac caacagatat cttagaaaaa aaagctatta ttcctcaaaa ggaaaaaaaa 61 ggctattatt ccttacaaga aaaaagaaga aggcaatgca ttatgctttt ctaaatgtct 121 agattaattc atagttggac agatatgacc gcatatcaac gttaattact gccgatttga 181 ctgttgatta gacatatctt agcactaaac ctcactgaaa attcggagaa tttactattg 241 gagagaagaa aaggcatcca tcctccttgt cttcactctg caatctgcat catcatccgc 301 ggaaaccgga aaaccaccga agctcgaaaa ccggcggagg caagagcagc tagcagcatg 361 cagcagacgc agagccttcc tcttcttcca agcaaaaaat attcagatga cgcctcctcc 421 tctttccccc ctcctcccac cttgcttggt tcctttcagg cgtacttgcc cgcgagactc 481 ctaccactct ctgatcccta tatatacacc ccctcctccc ctctcgtctc ctcaccagaa 541 ttccaagcaa cgaactgcga gtgattcaag aaaaaagaaa acctgagctt tcgatctcta 601 cggagtggtt tcttgttctt tgaaaaagag ggggattaat ggcgacaggg aaggtgatca 661 agtgcaaagg tccgttcttg ctgttcttgg agcggttcgt gagaaaatta gtttgatgtt 721 ttgtggggag gagaagctga ttagatcggg ttttggggtt tttgcagcgg cggtggcatg 781 ggaggccggg gaagcgctgt cgatcgagga ggtggaggtt gcgccgccgc agcgcatgga 841 ggtccgcgtc aagatcctct acaccgccct ctgccacacc gacgtctact tctgggaggc 901 caaggtaatt aatcagcatc tgccatggct gagctttctc accttctcaa gcttgcaaga 961 atgatcagtg actaactaac ttggtgtgtg tgtggtttct gctttgatct tcgcaggggc 1021 aaacacctgt tttccctagg atcttgggcc atgaagctgg agggtatgtg taatctgaac 1081 caaccctgtc caattcttgt tcttcttcta ctacactgtt tagtggttca gtctattctg 1141 ttcttgtatt gttttttttt tgttgaaaat ggtttttggt gtttagtgga atttagtgtg 1201 tacttgtacg tcataggttg tttggtttat ggtattcggt tactaactat agttagtgtg 1261 cacgttgggg agtcctgatt agtcagtggc attttcatta agttcagtaa agaaatctac 1321 caagtgtcca attgtgcatg atattttacc tggtttggat aaatcatatg aaggaatcaa 1381 agaacagcgt cattgatgac atatacaacg acaattggat tcctaggata atatactttt 1441 cttcatgtgc tcctctatgc tagtgcttta ttactttctc tcctttgatg ttttgaggta 1501 ctcaaaggaa aaagacacac tgaatcatta tccatcttat gaactgttta aacttgactt 1561 acttatcact tctctgctca aatggacccc tttttttttt catttccatg gaagatatat 1621 gtgtaatcct aacaatcaaa ttgatctaac aaaacaattt tttgctcaca gcattgtgga 1681 gagtgtggga gagggtgtga ccgaactcgc cgcgggcgac catgtcctcc cggtgttcac 1741 cggcgagtgc aaggagtgtg atcactgcaa atcggaggag agcaacatgt gtgacctcct 1801 caggatcaac gtcgaccgcg gcgtcatgat cggcgacggc aagtcccgat tcaccatcaa 1861 ggggaagccc atcttccact ttgttggcac ttccaccttc agcgagtaca ccgtcatcca 1921 tgtcggctgc ctcgcgaaga tcaacccgga ggcgcctctc gacaaggtct gcattctcag 1981 ctgcggtttc tcgtccaggt aagatgatgg atggcttcat gattatgatg acgtagtagt 2041 tttcacagtg aaatttggaa aggtcttttt ttttttggat gcattaagtt gatgggtttt 2101 gctgttcttg caggtttggt gcgacagtga atgtcgcgaa accgaaaaag ggccagaccg 2161 tagctatttt cggtcttgga gctgtaggtc ttgctgtaat gggcacacgc acaaagatat 2221 attcatttcc atttgatttg cagattgttg agatcttgaa actcaaactg atgaatgttt 2281 gtaaaattac atgattatag gctatggaag gtgccaggct gtctggagca tcgaggatca 2341 ttggtgtgga cctgaaccct gcgaaattcg aacaaggtac aatcttttat catctttgag 2401 tgaaaattac atgtttggta tatgttctgg attgaacact gaactctgca acttgttttc 2461 atctcttcag ctaagaagtt tggctgcact gacttcgtaa accccaagga ccacagcaag 2521 ccagtgcagg tacatcacat tgcctcacta atttctctta ttatcacaca ctgttcatga 2581 catggagtgt acatacttaa acatcaactc tgcaatggtg atctcaggtg ctgattgaga 2641 tgaccaacgg tggactcgac agggccgttg aatgcacggg caacatcaac gccatgatat 2701 cctgcttcga atgtgtccat gatgtcagtc tctttcttac aaacatgaaa catgatagta 2761 acaaattgaa gatgatcatt tcagcaatgc ctccagttct gctgtaaact gtatcctgaa 2821 aactaagagt ggcaatctga aacctgaatc cagggatggg gcgttgcggt gctggtcggc 2881 gtgaaggatg atgtgttcaa gacccacccg atgaacttcc tcaacgagaa gacgctgaag 2941 gggacgttga tcttcggtaa ctacaagccg cgcaccgacc tgcccaacgt cgtcgagctg 3001 tacatgaaga aggtaaaaat aattgcagaa cagaatcaga tttgcaacct gaatgtgcat 3061 tgttcagaaa tctatggtca gtgattccaa tgagccactg attgatcact attttatgct 3121 aatgatttct ggtgtttgat gtctgaagga gctggagctg gagaagttca tcacgagcgt 3181 gccgttctcg gagatcaaca cggcgttcga tctcatgctc aaggggagag tctccgctgc 3241 gtcatgagta tgtatgagta gagatgatga tgatctatct ctctctctct ctctctgcag 3301 tttcagtgca tacaagaaac caaaaatgaa agcaaaaaat ggatgagaag tttgaggttt 3361 tccatttaag tttaataatg gtcgtgagac agggcttgtg tggagtcagc tctgtttact 3421 ccctcctttc atggcttgat cttgtttatt gtcagcattg tggcttagca gattttcacc 3481 agttttaccc tgtaaattag taccattctg atcaatcgta ataaactact agcagtgtga 3541 ggtactattg ctggaactga attgattgtg cagtaaagag gtgaatagca gccagtatga 3601 aggttttaca atttgaacat cgaaacgaaa tccacacgaa attcataaat ttcatcgcat 3661 cttcgtgaag cttaccagat tctatcgatt cgatctagct ccaataggtc gatgggagca 3721 tctattgatt ttttcgacag acgaaaattt tgcaagggac tagctaggga gttgtcgac // LOCUS YSCCTA1A 2785 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae catalase A (CTA1) gene, complete cds. ACCESSION M36510 KEYWORDS catalase A. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2785) AUTHORS Cohen,G., Rapatz,W. and Ruis,H. TITLE Sequence of the Saccharomyces cerevisiae CTA1 gene and amino acid sequence of catalase A derived from it JOURNAL Eur. J. Biochem. 176, 159-163 (1988) STANDARD simple staff_review FEATURES from to/span description pept 820 2367 catalase A (CTA1) BASE COUNT 917 a 529 c 571 g 768 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcttag aaggtgaaga aatagtacta gattcatatg attgttataa taatggtgaa 61 ttgttgcctc aactaatact tttggtccaa atcttgacaa ttctttgcca aattccaggt 121 ttatgcaaac tggacataaa agcaatggaa aggcaagtgg agagaattgt aaagaagtgt 181 ttacaattga tagaaggtgc ccgcgccact acaaactgta gtgccacatg gaaacgttgt 241 attatgaagc gtctagccga ttaccccata aaaaagtgcg tttctatcga aaaaccttcg 301 aaaggaaact cattaacaag ggaagaacta agagatgtta tggctcggag agttttgaaa 361 agcgaaatag attcgctgca agtttgtgaa gaaaccatcg acaagaatta caaggttatt 421 cctgatgaaa agctgctaac taatatttta aagagaaagt tgacagagga agaaaaaagc 481 tctgtcaaac gtccttgcgt gaagaagtga gcggttgttc taaccactat ttaaagccgc 541 aattagtaat gcaaaaagtt ggccggaatt agccgcgcaa gttggtgggg tcccttaatc 601 cgaaaaagga cggctttaac aaatataaac tccgaaaatc cccacagtga cagaattgga 661 gaaacaacca gttttgatat cgccatacat ataaagagat gtagaaagca ttcttcactg 721 taatgtccaa atcgtacatt tgaatttctt gtaggtttat ttaaaaggta agttaaataa 781 atataatagt acttacaaat aaatttggaa ccctagaaga tgtcgaaatt gggacaagaa 841 aaaaatgaag taaattactc tgatgtaaga gaggatagag ttgtgacaaa ctccactggt 901 aatccaatca atgaaccatt tgtcacccaa cgtattgggg aacatggccc tttgcttttg 961 caagattata acttaattga ttctttggct catttcaaca gggaaaatat tcctcaaagg 1021 aatccacatg ctcatggttc tggtgccttc ggctattttg aagtaaccga tgacattact 1081 gatatctgcg ggtctgctat gtttagtaaa attgggaaaa gaacgaaatg tctaacaaga 1141 ttttcgactg tgggtggtga taaaggtagt gccgacacgg ttcgtgatcc aagggggttt 1201 gccaccaaat tctacactga agaaggtaat ttagattggg tctacaataa tacaccggta 1261 ttctttatca gagacccttc caagttccct cactttatcc acacacagaa gagaaaccca 1321 caaaccaacc taagggatgc tgacatgttt tgggatttcc tcaccactcc tgaaaatcag 1381 gtggccattc atcaagtaat gatccttttt tcagaccgtg gtacccctgc caactaccgt 1441 agtatgcatg gttattctgg tcatacctat aaatggtcca ataaaaacgg agattggcat 1501 tatgtgcaag ttcatatcaa aaccgatcaa ggaataaaga atttgaccat agaagaggct 1561 accaaaattg cgggatccaa tccagattac tgccagcagg atttatttga ggctattcag 1621 aatggaaact atccttcctg gacagtttat attcaaacaa tgaccgaacg cgatgccaaa 1681 aaattaccat tttcagtctt tgatttgact aaagtatggc ctcaggggca attcccttta 1741 cggcgtgtgg gtaagattgt tttgaacgag aatccactga acttcttcgc acaggtggaa 1801 caagctgcct tcgcccccag taccacggtt ccttaccaag aagcaagcgc tgatccagta 1861 ttacaggccc gtttgttttc atatgcggat gctcatagat acaggctagg tcctaacttc 1921 catcaaatac ccgtaaactg tccatatgca tctaaatttt tcaatcccgc tatcagagat 1981 ggaccgatga atgttaacgg caacttcggc tcagaaccta catatttggc caacgataaa 2041 tcgtacacgt atatccaaca ggacagaccc attcaacaac accaagaggt atggaatggg 2101 ccagctatcc cttatcattg ggcaacatcc ccaggtgatg tagatttcgt gcaagcaaga 2161 aatctctacc gcgttttggg taaacaacct ggacagcaaa agaacttggc atataacatc 2221 ggcattcatg tagaaggcgc ctgtcctcaa atacagcagc gcgtttatga tatgtttgct 2281 cgtgttgata agggactatc tgaggcaatt aaaaaagtag ctgaggcaaa acatgcttct 2341 gagctttcga gtaactccaa attttgaaac gctcaagtaa caaatgagtg gcgttgtttc 2401 cacgacaatt atttatgata gtgtgtattt ttaacacatt ttatttatta caatttatgt 2461 attttgttat gaattattta tttatacgac taataggtga tgctcatatt ctcgtgttag 2521 aaagttaaaa aaattatcat ttcacacata ggaaagctcg tcgcgccggg gaaaaagctg 2581 aggaatctct attattaggg gtaaagttca acacattcag tatgagataa gtgtgtcttc 2641 aagagagatg cagcactgag tagggaacca agaaacgatg tctgaattac tagatagctt 2701 tgagacagag tttgcgaaat tttataccga cagcaatctg gaagagacaa accttcaaaa 2761 atgtcttgat catactcatg aattc // LOCUS YSCILV1A 2766 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae threonine deaminase (ILV1) gene, complete cds. ACCESSION M36383 KEYWORDS ILV1 gene; threonine deaminase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2766) AUTHORS Kielland-Brandt,M.C., Holmberg,S., Petersen,J.G.L. and Nilsson-Tillgren,T. TITLE Nucleotide sequence of the gene for threonine deaminase (ILV1) of Saccharomyces cerevisiae JOURNAL Carlsberg Res. Commun. 49, 567-575 (1984) STANDARD simple staff_review FEATURES from to/span description pept 661 756 ORF1 pept 778 2508 threonine deaminase (ILV1) BASE COUNT 797 a 569 c 510 g 890 t ORIGIN 1 gaattcgttc tcattacatg gccgcccacc atgcaccttt acatttcgta tttctttcct 61 ctctgcgcta tatctttggg aaagatttgc aatcaaggca aacacccgct tgtccccttt 121 caattcttgt gttatttcaa aaaccatcct ttcataatat atttgtattt ctttcgccct 181 ggactctacc agttcatatt tagaaccttc cacatatcgt ccaattctgc agcccacatc 241 tttttccacc acgatacggg aaacagaatg ggtccttgga ttctcgctaa acagtctctc 301 tattcccctg ttcaaaaccc tcaagatatt tgtttcccgc aacagctgca attgcaattg 361 atcaatccta tgcgaaaatg ccgagtttat gttattcaag acgcatttta aaaaattcac 421 tagcggctcc ttgaaattca ttatgtctga tgaatatgaa aaccttttcc tgactaccaa 481 gactctttaa ctcttctctc tttattgcat attatctctg ctattttgtg acgttcaatt 541 ttaattgacg cgaaaaagaa aaaataagaa gggcaaaaag aaaaagcgca gcgggtagca 601 aatttggaat cgcataaaaa gaaaaaaaaa atatcaaaga aaaagagtca tctcaaacat 661 atgtctgcag atacttcatt atcagctttg aaaacttttg ttgttgctgc tttgagttct 721 ttcttgtgtg agtgctacaa gccacattta aactaagtca attacacaaa gttagtgatg 781 tcagctactc tactaaagca accattatgt acggttgttc ggcaaggtaa acagtccaaa 841 gtgtctggat tgaacctttt gagactaaag gctcatttgc acagacaaca cctgtcacct 901 tccttgataa aactacactc tgaattgaaa ttggatgagc tgcaaactga taacacccct 961 gattacgtcc gtttagtttt aaggtcctct gtatacgatg ttattaatga atctccaatc 1021 tctcaaggtg taggtttgtc ttcccgtcta aacacgaatg tcatcttgaa aagagaagat 1081 ctattgcctg ttttctcttt caagcttcgt ggtgcctata acatgattgc caagttggac 1141 gattctcaaa gaaaccaggg tgttattgcc tgttcagctg ggaatcatgc ccaaggtgtg 1201 gcctttgctg ctaaacactt gaaaatacct gctactatcg ttatgcctgt ttgtacacca 1261 tctattaagt atcaaaatgt ctcgagatta gggtctcaag tcgtcctata tggtaacgat 1321 tttgacgagg ctaaggctga atgtgccaaa ttggctgaag agcgtggctt gacgaacatt 1381 cctcctttcg atcatcctta tgtcattgcc ggtcaaggta ctgtagctat ggaaatccta 1441 agacaagtac gtaccgctaa taagatcggt gctgtctttg ttcccgtcgg cggtggtggt 1501 ttaattgctg gtattggtgc ttatttgaaa agggttgctc ctcatatcaa aactattggt 1561 gttgaaactt acgatgcggc cactttacat aattccttgc aacgcaacca gagaactcct 1621 ttacctgtgg tgggtacttt tgccgatggt acgtctgtgc gtatgattgg tgaagaaaca 1681 tttagagtcg cccaacaagt ggttgatgaa gttgttcttg ttaacactga cgaaatctgt 1741 gctgcagtaa aggatatttt tgaagatact agaagtattg tagaaccatc tggtgccctt 1801 tcagtagccg gtatgaagaa atacatctct accgtacatc cagaaattga ccacactaaa 1861 aacacctatg ttcccatcct ttctggtgct aacatgaact ttgatagatt aagatttgtt 1921 tccgaacgtg ctgttcttgg tgaaggaaag gaagtcttca tgttagttac tttacccgac 1981 gtccctggtg cgttcaagaa aatgcaaaag atcatccacc caagatctgt cactgaattc 2041 tcttaccgtt acaatgaaca tcgtcatgag tcctctagtg aagtgcccaa ggcttacatt 2101 tacacttctt tcagcgtcgt tgacagagaa aaggaaatca agcaagttat gcaacagttg 2161 aatgctttag gttttgaagc tgtggatatc tccgataacg aattggctaa atctcatggt 2221 agatacttgg ttggtggtgc ttctaaggtt cctaatgaaa gaattatttc atttgaattc 2281 cctgaaagac caggtgcctt gactaggttc cttggaggcc taagcgattc ttggaatctt 2341 actttattcc attatagaaa ccatggtgcc gatatcggta aggttttagc tggtatttcc 2401 gttcctccaa gggaaaactt aaccttccaa aaattcttgg aagatttagg ctacacttat 2461 catgatgaaa ctgataacac tgtttatcaa aaattcttga aatattaaaa ccgacaattt 2521 actttataaa tttacgcaac aacttgttag gtaaaatgaa cgtcgtcata atttaaatac 2581 tagacagcgc gtgtatgtta ctcatattta tactcatatc atgtgaacct ataaaccaat 2641 tcaatacaag atttacagtt ctcttgtttt cttcttccag ttttcccctt ttatttcctg 2701 tagaataaac ttaaacagtg atcattaatt tataattttc tgctttgtaa atacggcggt 2761 aatgga // LOCUS YSPRPS6A 1940 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.pombe ribosomal protein S6 (rps6) gene, complete cds. ACCESSION M36382 KEYWORDS ribosomal protein S6. SOURCE S.pombe DNA. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1940) AUTHORS Gross,T., Nischt,R., Gatermann,K., Swida,U. and Kaeufer,N.F. TITLE Primary structure of the ribosomal protein gene S6 from Schizosaccharomyces pombe JOURNAL Curr. Genet. 13, 57-63 (1988) STANDARD simple staff_review FEATURES from to/span description pept 495 1214 ribosomal protein S6 (rps6) precursor sigp 495 614 ribosomal protein S6 signal peptide matp 615 1211 ribosomal protein S6 mRNA 478 1240 rps6 mRNA (alt.) mRNA 478 1260 rps6 mRNA (alt.) BASE COUNT 592 a 371 c 355 g 622 t ORIGIN 1 gatcatctct atacgctagc gttttgttga gtaaataatt ctgttaatga tgtacaaacc 61 attgagttca gctagttatt taaaaaatgc aacgagtata tcttataaaa ttaagtcata 121 caagacatca atttcctcgt aacgatgata aatcacatct atggaaacat agcagactat 181 gactttaata ttaattacct ttctccataa attaaacgaa aattaaagat gagatgaaaa 241 gtgataatta tagtaacttg taagttatat tcaacagtaa aatcctaatt tccttttcaa 301 ttttctgatt cacaatgtat atattagcta cccgcgtgtt tgtttactaa aaatagtgca 361 ttaatcatcc atcgcaaaat atttataacg agagatataa agacttgtaa atctatccca 421 aattgtgact gctcttagag gagaaactta gctagtcaaa aatacaacat tactgccggg 481 aacctcaacg caaaatgaag ttaaacattt cttatccagc aaatggtaca caaaaattga 541 ttgaaatcga cgatgatcgt cgtcttcgag tctttatgga gaagagaatg ggacaagagg 601 ttccaggtga ctctgtaggc ccggagtttg ctggatatgt cttcaaaatt accggtggta 661 acgacaaaca aggatttccc atgtttcaag gcgtcttgct tcctcaccgt gttcgtcttt 721 tgcttcgtgc tggccaccct tgctatcgcc cacgccgtga tggtgagcgc aagcgtaaat 781 ctgtccgtgg ttgcatcgtc ggccaagatt tggctgtcct tgctcttgcc attattaaac 841 aaggtgaaca agatatccct ggtttgaccg atgttaccgt ccccaagcgt cttggtccaa 901 agcgtgcctc taagatccgc cgcttcttta acttgtccaa ggaagatgat gtccgtcaat 961 tcgttattcg tcgtgaggtt gttcccaaaa aggaaggaaa gaagccttat accaaggccc 1021 ctaagattca acgtcttgtt actcctagaa ctcttcaaca caagcgccac cgctttgctc 1081 ttaagcgcag acaagctgaa aagaaccgtg aagaggctgc tgagtttgct caacttatgg 1141 ctaagcgtgt cgctgaggct aagcaaaaga gagaggtcgt caaggctcgc cgtgcctctt 1201 ctttgaaaaa gtaaataggc atttttaagt aactttggtc atctccatat tattattttg 1261 ctggaacttt tcttatgggg tttattgttg ttgaataaac gtagttagta acattctcta 1321 agcatcgttt gtattcactt atgagtagga atgttaactg taattaggat tatcttcacg 1381 aagctgcttt ttgtggtaat ccttcatgct tttaaataat tttaggatct atatagtaga 1441 ggtaattaat gatattctaa tcaatctagc ttcaacacac acatgatcaa tttttatacg 1501 ttttttaatc attcatatac actacgctaa aaccttttac agattgtttt actaaaataa 1561 ttattatctg cgaaacactc cttctatttt cttgaggata tgctaggaga aaatattata 1621 ctataagtat aagagggggt ttgataaaac tatttaatta cctctgctca ctagagtttt 1681 tatcctttat gttcctatct aagtcgccta aatctcaaca atttggcgca gacaaaagtt 1741 gtttgcgttg cgctagtttc tacaaccaca ccactaatac caactagcaa gaaccagaat 1801 ttctgagcta ctttaggtag taaaataaga ggtgaattgt gagccccggt ttgctttttt 1861 agacataagg cgaacacgag ttagtgctgt ctgttgccaa attattttaa ttcggcttta 1921 tcaaaacaat atagcatcga // LOCUS CNAGMDHA 1334 bp ss-mRNA PLN 14-AUG-1990 DEFINITION C.vulgaris glyoxysomal malate dehydrogenase mRNA, complete cds. ACCESSION M33148 KEYWORDS glyoxysomal malate dehydrogenase. SOURCE C.vulgaris (variety Sugar Baby) cotyledon, cDNA to mRNA. ORGANISM Citrullus vulgaris Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1334) AUTHORS Gietl,C. TITLE Glyoxysomal malate dehydrogenase from watermelon is synthesized with an amino-terminal transit peptide JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5773-5777 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Gietl, 22-MAR-1990. FEATURES from to/span description pept 70 1140 glyoxysomal malate dehydrogenase precursor (EC 1.1.1.37) sigp 70 180 glyoxysomal malate dehydrogenase signal peptide matp 181 1137 glyoxysomal malate dehydrogenase mRNA < 1 1334 glyoxysomal malate dehydrogenase mRNA BASE COUNT 340 a 275 c 340 g 379 t ORIGIN 1 caacgctaag ttcccaaagg tttctgatct tgaagcggtt ggtttgtttt tctgtttgtc 61 aaactaatta tgcagccgat tccggatgtt aaccagcgca ttgctcgaat ctctgcgcat 121 cttcatcctc ccaagtctca gatggaggag agttcagctt tgaggagggc gaattgccgg 181 gctaaaggcg gagctcccgg gttcaaagtc gcaatacttg gcgctgccgg tggcattggc 241 cagccccttg cgatgttgat gaagatgaat cctctggttt ctgttctaca tctatatgat 301 gtagtcaatg cccctggtgt caccgctgat attagccaca tggacacggg tgctgtggtg 361 cgtggattct tggggcagca gcagctggag gctgcgctta ctggcatgga tcttattata 421 gtccctgcag gtgttcctcg aaaaccagga atgacgaggg atgatctgtt caaaataaac 481 gcaggaattg tcaagactct gtgtgaaggg attgcaaagt gttgtccaag agccattgtc 541 aacctgatca gtaatcctgt gaactccacc gtgcccatcg cagctgaagt tttcaagaag 601 gctggaactt atgatccaaa gcgacttctg ggagttacaa tgctcgacgt agtcagagcc 661 aatacctttg tggcagaagt attgggtctt gatcctcggg atgttgatgt tccagttgtt 721 ggcggtcatg ctggtgtaac cattttgccc cttctatctc aggtgaagcc tccaagttct 781 ttcacacaag aagagattag ttacctgact gataggattc aaaatggtgg aacagaagtt 841 gtcgaggcca aagcaggagc tggctcagca actctctcaa tggcttatgc tgccgttaag 901 tttgcagatg catgcctcag gggcttaaga ggagatgctg gtgtcattga atgcgcgttt 961 gtgtcttctc aggtgactga acttccattc tttgcatcaa aagtacgact tggtcgcaat 1021 ggtatcgaag aagtatactc ccttggcccg ctaaatgagt atgagaggat tggattggag 1081 aaagcgaaga aagagttggc aggaagcatt gagaagggag tttccttcat cagaagctga 1141 agagatgcca attaccatta gttttaatag aaacattcca tctcttatag attacttgtg 1201 ctcaatgttt tcctggagat tgaagttgat tgaaatgata ccacaccacg tatttttata 1261 ctaataaaac tatatcgcca tcatgtcgat atttaatgca caaccaaaag ggttggatta 1321 gagtaccttt tatg // LOCUS HAMACAPT 1837 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Hamster N-acetylglucosamine-1-phosphate transferase (GPT) mRNA, partial cds. ACCESSION J05590 M22755 KEYWORDS N-acetylglucosamine-1-phosphate transferase; glycosyl transferase; tunicamycin-resistance. SOURCE Chinese hamster ovary cell line CHO-K1, cDNA to mRNA, clone TRG-[4,22]. ORGANISM Cricetulus longicaudatus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 889 to 960) AUTHORS Lehrman,M.A., Zhu,X. and Khounlo,S. TITLE Amplification and molecular cloning of the hamster tunicamycin- sensitive N-acetylglucosamine-1-phosphate transferase gene: The hamster and yeast enzymes share a common peptide sequence JOURNAL J. Biol. Chem. 263, 19796-19803 (1988) STANDARD full staff_review REFERENCE 2 (bases 1 to 1837) AUTHORS Zhu,X. and Lehrman,M.A. TITLE Cloning, sequence and expression of a cDNA encoding hamster UDP-GlcNAc: dolichol phosphate N-acetylglucosamine-1 phosphate transferase JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT Draft entry and sequence for [1] kindly submitted by M. Lehrman, 04-APR-1989. Draft entry and computer-readable sequence for [1] kindly submitted by M.Lehrman, 12-JUN-1990. FEATURES from to/span description pept 148 1374 N-acetylglucosamine-1-phosphate transferase (GPT) binding 67 79 dolichol binding site binding 222 234 dolichol binding site BASE COUNT 363 a 515 c 418 g 541 t ORIGIN 1 tcaggtcata gttgggtttt gttcctgctg gcgcggaata ctcttgcgtg taggaagcgg 61 ctctaccgac tgccgaggcc tctgggattg tgccgggttg ctagttagct cagttggcgg 121 cggagcggcg gcccctggcg ggtcaccatg tgggccttcc cggagttgcc gctgccgctg 181 ctggtgaatt tgttcggctc gctgctggga tttgtggcta ctgtgaccct catccctgcc 241 ttccgtagcc actttatcgc cgcgcgcctc tgtggccagg acctcaacaa gctcagccgg 301 cagcagatcc cagaatccca gggagtgatc tgcggtgctg ttttccttat catcctcttc 361 tgcttcatcc ctttcccctt cctgaactgc tttgtggagg agcagtgtaa ggcattcccc 421 caccatgaat ttgtggccct gataggtgcc ctccttgcca tctgctgcat gatcttcctg 481 ggcttcgctg atgatgtact caatctgccg tggcgccata agctgctgct gcccacagct 541 gcctctctac ctctcctcat ggtttacttc actaactttg gcaatacaac cattgtggta 601 cccaagccct tccgctggat tcttggcctg catttggact tgggaatcct atactatgtc 661 tacatgggac tgcttgcggt gttctgtacc aatgccatca acatcctagc aggaattaat 721 ggcctagagg ctggtcagtc actagtcatc tctgcttcta tcattgtctt caacctggta 781 gagctggaag gtgattatcg ggatgatcat gtcttttccc tctacttcat gataccattt 841 ttttttacca ccttgggatt gctataccat aactggtacc catcacaggt gtttgtggga 901 gataccttct gttattttgc tggcatgacc tttgccgtgg tgggaatctt gggacacttc 961 agcaagacca tgctactctt ctttattcca caagtgttca atttcctcta ctcgctgcct 1021 cagctccttc acgccatccc ctgccctcga caccgcatac ccagactcaa tccgaagacg 1081 ggcaaactgg agatgagcta ttccaagttc aagaccaaga acctctcttt cttgggcacc 1141 tttattttaa aggtagcaga gcgcctccag ctagtgacag ttcaccgagg cgagagtgag 1201 gatggtgcct tcactgaatg taacaacatg accctcatca acttgctact caaaatcttt 1261 gggcccatac atgagagaaa cctcacactg ctcctgctgc ttttgcagat cctgagcagc 1321 gctgtcacct tctccattcg ataccagctt gtccgactct tctatgatgt ctgagttccc 1381 tgaagactga ctgcccatag cctccgttgg acctcaccca ggaccaacct gtctggtcca 1441 agactgcctt ctggtccagg cctctccaac tgttggtttt ttcagttttg ttttcagctt 1501 ctcccatcac ctgtaatgat tattggcctg gaccttattg gactttaaag tccattagtt 1561 gcactttgcc cctggctttc ttcagcttgc tactcttccc ctttctgtcc catctgcagc 1621 accataaggg ggatgtagca gctcttatgc aaatacagct caactttcag agccctgcat 1681 ttgctggacc ttgagagaga acctgggcta tgtgctagag ttagggccct atactccaag 1741 gtaacctcac atttgacttt aaaattaagt gttctgatta ggaagatcag aggcagggcc 1801 atgtgctcaa aatggtgaca ataaaggatt gtctttt // LOCUS HS4NA3ABC 10089 bp ds-DNA VRL 14-AUG-1990 DEFINITION Epstein-Barr virus nuclear antigen EBNA-3A, EBNA-3B, and EBNA-3C genes, complete cds. ACCESSION M34440 KEYWORDS nuclear antigen. SOURCE Epstein-Barr virus type 2 (isolate AG876) DNA. ORGANISM Epstein-Barr virus Viridae; ds-DNA enveloped viruses; Herpesviridae; Gammaherpesviridae. REFERENCE 1 (bases 1 to 10089) AUTHORS Sample,J., Young,L., Martin,B., Chatman,T., Kieff,E., Rickinson,A. and Kieff,E. TITLE Epstein-barr virus type 1 (EBV-1) and 2 (EBV-2) differ in their EBNA-3A, EBNA-3B and EBNA-3C genes JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.T.Sample, 23-MAY-1990. Kieff Laboratory Harvard Medical School 75 Francis Street Boston, MA 02115 FEATURES from to/span description pept 189 524 nuclear antigen EBNA-3A, exon 1 610 3051 nuclear antigen EBNA-3A, exon 2 pept 3237 3593 nuclear antigen EBNA-3B, exon 1 3672 6155 nuclear antigen EBNA-3B, exon 2 pept 6277 6636 nuclear antigen EBNA-3C, exon 1 6711 9560 nuclear antigen EBNA-3C, exon 2 IVS 525 609 EBNA-3A intron IVS 3594 3671 EBNA-3B intron IVS 6637 6710 EBNA-3C intron BASE COUNT 2495 a 2837 c 2619 g 2138 t ORIGIN 1 gaattccaga atctcaacat tgaaaaaacc aggatcatca cgcgtgagtt ggataaggct 61 ctggatggtg tactgacaca caagcaaggc tgcctccatt gtctcagcac cgatttctag 121 gcagcaccct ctttaatagg tgcaaggggg gtgcggtgtt ggtgagtcac actttcgttg 181 cagacaaaat ggacaaggac aggccgggtc tcccggcccc ggatgacaac atagaagaag 241 taccatctac ctcgggtgtt caggaacggg cgtctgaggg agattgggaa aatgtcctca 301 tagagatatc agatagcagc tcagaagagg aagcagaaga tgcccacctg gagccatccc 361 agaggggtaa aaagagaaaa cgggtcgatg atgatgccgg tggttcagct ccagcacaac 421 acgtgccccc ccctcagctg gatcaccctg gtcgagaagc cattctctac aggtttccgc 481 tagatttaag aaggtttatt caagcaattg gagccgcagc tacggtgagc ttccctatgg 541 cccaggtgtg tgatgtgtgt ttttgcccat cgcacaacaa ggtaagtgat ttgttgccgt 601 tggtttcagc accccgacac gcgagccata gaccagtttt tcggatccca gatttcaaat 661 accgacctgt acgtaatgta tgccatggcc attcgacagg ccattagaga tcgtcggagg 721 aatccagctt ctcgtagaag tcaggtcaaa tggagaatga ccaccctggc cgctggttgg 781 cctatgggtt accaggcata cagtagctgg atgtacagct acaccgatcc ccaggtgact 841 gccacgatca tacatctgca ggcgactctt gggtgcgcaa gtggccgtag gtgtcatgtg 901 accttttctg ccggcacctt taggccgccg cgatgtagtc ccggggatcg ccagtggttg 961 tatgttcaga gccgcgtggg tgaccttgtg cagagttcta atccatgcta cagtattttc 1021 tttgactaca tggccataca taggagcctc acgaaaatct gggatgaagt ggtaacacct 1081 gatcagcgtg ttacatttat ggaattcctg ggctttttgc aacgcacgga gttggtctac 1141 atcaagagct ttgtcagcta tgccttgggc accactagta tcgaaacacc gtggatggat 1201 gagaatccta gcacagagac ggcacaggct tggaatgccg gcttgctccg ggggcgtgcg 1261 tacgggcaag acttgcttag aactgaagga gaacatggcg aaggtgctac ctgtgaaaca 1321 cgggaagaaa gtgaggacac agagagcgat ggggatgatg aagaacttcc tcgtgtagtg 1381 tccagggatg gaactaagca cagacgaccc cctatatttt taagacgcct gcacaggttg 1441 ctgttgatga gagcgggcaa aggaaaggaa cgggccaggg agacactggc gaaggcccct 1501 aggcgcactt atggcacacc taggccgcca gttcagaaac caagaccaga ggtcccgcaa 1561 agctatgaga cagctaccag tcacgggtcg gcgcaagtcc cagaaccccc acccacccac 1621 ccattacatc agcaacacag catggccccg tgtatggtag ctcagaaccc acgtgcaccc 1681 ttaggggacc aactcccagg tgttcctaaa gatggacgag gggcgtgtgc accggtaccc 1741 gccctggccg ggcctattgt ccggccctgg gagtcatccc tgttacagtc tccgggaagg 1801 gcctttgcac ccgttagccc acaacccatg ccagtagaac ccgtccctgt ccctactgtg 1861 gcacttgagc gaccagtttg tcccgcacct cctgagattg ctatgcaggg cccgggggaa 1921 ccttctggca ttaaacgcac acgggagcgt tggaggcccg caccgtggac gccaaaccca 1981 ccccgctctc ccagtcagat gtccgtgcgt gaccgtctgg ctcgtttgcg tgctgaggca 2041 caggcccgtc aggctagtgt tgaggtgcag cccacccagt tgacccaagt atcccctcag 2101 caaccaatgg agaggccgtt ggaaccagag cagcagatgt tccctggttc cccctttagc 2161 caggtcgctg atgttgcccg ggaatctggg gtacctgcaa tgcagcctca gtactttgac 2221 ctccccttaa ctcaacccat tagccagggg gcacccgcgg ccccgttgag ggctagtatg 2281 ggcccggtac ctccggtacc ggcaacacag ccacagtatt ttgacatccc cttaactgaa 2341 cccattaacc agggggcatc cgcggcccat tttctccctc agcaaccaat ggaggggccg 2401 ttggtacccg agcggtggat gttccaaggt gccaccctga gccagagtgt taggccaggg 2461 gtagcgcagt cacaatattt tgacctcccc ttaactcaac ccattaacca tggggcaccc 2521 gcagcccatt tcctccatca gccaccaatg gaggggccgt gggtacccga gcagtggatg 2581 ttccaaggtg ccccccctag ccaaggcact gacgtggtcc aacatcagct ggatgatttg 2641 gggtatccac tccatgatct caaccatccc ggggttcccg tgtctcctgc cgttaaccaa 2701 tatcatttca gccaggctgc ctttgggtta cctattgatg aggatgagag tggcgagagg 2761 tccgatacct ccgagccgta tgaagctctt gatttgtcaa tccatggcag gccctgccct 2821 caggcccccg aatggcctgt tcaaggggag ggtggccagg atgccaccga ggttcttgat 2881 ttgtcaatcc atggcaggcc ccgccctcgg acccccgagt ggcctgttca aggggagagt 2941 ggccagaatg tcacagacca tgaacctaga agggtggtgg tgtcagctat tgttcacatg 3001 tgtcaggatg acgagtttcc ggatctacaa gatcctccag atgaggccta agcaaaggtg 3061 tagaagtgtg tccccctcca ttccacccac tgatatacgc ccgacaataa agttgatgat 3121 attgaattcc acacctactc gtgtttgtga ttttatttca tattccatga gagagacctc 3181 gcatatttgc agagggtcac tgaaacattt tatcttaaaa cagttacacc tgaaaaatga 3241 agaaagcgtg gctcagcaga gcacagcaag ccgatgccgg gggggcatct ggctccgagg 3301 acccaccaga ttatggagat caaggtaatg tgcaacaggt gggatctgat cctatttcac 3361 ctgcgattgg cccctttgaa ctctctgcgg ccagtgagga tgatcctcaa tctgggccag 3421 tggaagagaa tttagatgcc gctgcaagag aggaagagga acctgatgag caggagcaca 3481 atggtggtga tgatcccttg gaagtccata ctcggcagcc tagatttgtg gatgtgaacc 3541 caacgcaggc tccagtgatc caactagtcc atgctgttta tgattccatg ttggtaagag 3601 gcacctagaa catttccaga tgtttagctt ggatattttg gccagtctta atttattgtc 3661 attggtttca gcaatcggac ctccggtctc taggcagttt attccttgag caaaacctga 3721 acatcgaaga gtttatatgg atgtgcatga cagtgcgtca cagatgtcag gccatcagac 3781 aaaaaccatt accgattgat aagcagaggc ggtggaagct cctgtcacct tacagaacct 3841 ggcgtatggg ttaccgtacg caaaccctca atgtaaacag ttttgagaca gggggagata 3901 aagtccaccc actccttgtg actgctacgc taggatgtga agagggcctg cggcatgcaa 3961 taacttacag tgctggcatt gtacagctac cacgaatgtc agaccaaaac caaaagatag 4021 aaacagcctt cttgatggca cgtcgtgcta ggtcactttc ggcagaaaga tatactttgt 4081 tctttgattt agtatcctct ggaaacaccc tgtatgccat atggattggg ctgggcacga 4141 gaaaccgagt tgcatttgtc gagtttgtag gatggctatg taagaaggac cacactcata 4201 tacgtgaatg gttccgccag tgcaccggga gaccctcacc atccaagcca tggatgagag 4261 cgcatcccgt cgccgttcct tatgacgatc cattaacaag tgaggagact gacctggcct 4321 atgcccgtgg actggccatg agtatcgagg ctgctagact gccagatgat ccaataattg 4381 ttgaggatga cgatgaaagt gaggaaattg aagataaatg tgataaggat gaagaggaaa 4441 gtggaacgga agatgttaca agcataccgc aaacactgcc gcacagtcca acagtatacg 4501 gcaggccctc ggtgttttac cgaaagccag atactaaatc aaccaaaaaa tgcagggcca 4561 tagtgactga ccttagtata atcaaggtca ttgaagatga acacagaaag aagaagacag 4621 ccagaacaga gcaaccaaga gccaagcctg attcccctgc ccccacagtg gtccttcggc 4681 gaccacccac gcaaaaggtg actggccctg ccggttcact gagtgtccag gctcagctgg 4741 agccatggca acctttgtcc tggccacatg agacaagagt tatacttcac ggaccaccca 4801 cgcagggtga ccaagcacac ggttccatgc tagaccttct tgaaaaggac gaccagcaca 4861 tggagcagca ggttatggca accctactgc caccagaacc acaccagccc cggtctggga 4921 gaagagcccc ttgtgtctac accgctgacc tagacataga aagtgatgag cccgccacgt 4981 cagagccggt tcttgatcag ctactgcccg ccccaggact tggacctctt gcaattcaac 5041 cattaacgtc ccccaccacg tctcaactcc gcagttcagc accgagccac gcacaaactc 5101 catggccggt tacccaccca agtcagactc caggtggccc aacgacacag tccctggcac 5161 cggaaacaga agccccgcgc cagtggccaa tgccactgcg acctatccct ctgcacccct 5221 tgcggatgca gccaatatca tttaatcctg cagtgagacc cactccccat cagccacctc 5281 aggtggagcc cactttctat cagtccactt gggtgaaacc ccctcaacaa taccagcctc 5341 agatggggca cattccatat cagccccgac caacgggtca ctctactatg ctccggcccc 5401 agtgggcacc caccaccatg cagccaccac caagggcgcc cactcccatg ccgccacctc 5461 aggggccacc caccgctatg cagaggcctc agggggcgcc cactcccatg ccgccacctc 5521 aggggccacc caccgccatg cagaggcctc ggggggcgcc cactcccatg ccgccacctc 5581 aggggccacc caccgccatg cagctgtcac caagggcact taccggccag aaggggccag 5641 caaagcacat tctgcgccag ttgttaacgg ggggcgtcaa gagtgggaga ccatcactta 5701 agtttaaggc tgcccttgag cgtcaagccg ctgcgggctt gcgaccttca ccagggtctg 5761 gaacgggtgc caagattgtg caggcacctg ttttctatcc acccgtccta cagcccatac 5821 aagttatgtg gcaagtgggt tcctcaaagg ccgtggccgc ctcaacggtg acacaggcac 5881 ccacggaata taccggggaa aggaggttag gggggcctat gtctcccact gatattccgc 5941 cgtctaaacg ggtgaagaaa aaggcctatc cagagcgcaa gacgccgcat ggggggccct 6001 cacactcttc caccgttatg tgggagaatg tcagccaggg acaacagcag actctggagt 6061 gcggaggaac tgataaacag gaaaggaaca tgttggggat gggggacatt gcagtttctt 6121 ccccttcctc ttctgaaaca tcgaatgatg agtgatttca accacgtaac aagaactggg 6181 atgaaccctc gggcagtatc agactgcggg gaggggcagt gataagtcat gacaatttta 6241 gatgaggtag acattttgca tattttcaga cccaccatgg aatcatttga aggagagggg 6301 gactctatac agtcacctga caatgcgcgg ggagatgatg tacagaatac tggtgagcat 6361 attcaggacc ccgggccggg gccttcaacc ggcggggctt ctgagggatt ggtgcagaac 6421 gagccggact caagagatca acagtcccgg gggcagagaa ggggtgatga aaacagaggc 6481 tggatgcagc gcatcaggcg aaggcggaga agacgggccg cattgtccgg ccatctttta 6541 gacatggaag acaatgtgcc gccgtggttt cctccacacg atatcacacc atatgtcgca 6601 aggaatatca gggacgctgc ctgccaggct gtcaaggtga gcatgcctct aactgggttc 6661 atgggggcca tctaaggccc acgtgtgacc catgtttcca ttaattttag cactcgcacc 6721 tgcaagcgct atcaaacctg atactcgata gtggtttaga cacacaacac ctcttgtgct 6781 tcgtgatggc agccaggcag cgtcttcagg acattcgacg tggacccttg gttgtagagg 6841 gaggtgttgg ttggcgacat tggcttctga catctcccag ccgatcctgg tccatgggat 6901 atcgcacagc aacactacgc acattaactc ccgtgcctaa cagggttggg gctgacagca 6961 tcatgttaac tgccacgttt ggatgccaaa atggggcact agctataaac accttctccg 7021 ccaccgtgtg gataccaccc cctgctggac caagagagca agaaagatac gctcgggaag 7081 ccgaggtgcg cttccttcgt ggtaaatggc agaggcggtt ccgaagaatc tttgatttga 7141 tagaactgtg tggctctctg caccacgtct ggcaaaacat gctgcagacc gaggagaacc 7201 ttttagattt cgtgcgtttc atgggtgtca tgtccagctg caatagttca tctgtgaatt 7261 actggtttca caagacaatc ggaaacttta agccatatta cccgtggaat gcaccaccta 7321 atgaaaatcc atatcacgca cggagaggca taaaagaaca agtaatccag aaagcatttc 7381 taaaggcaca aagacagggt ttatcaatgt tagcaacggg aggtggaccc agaggtgatg 7441 ctactagtga aacgagcagc gatgaggata ccggtagaca gggttcggat gtggagctag 7501 agtcatcgga cgatgagctg ccatatatcg atcccaacat ggagccagtt cagcagaggc 7561 ccgtcatgtt tgtgagccgt gtgcctgtaa ggaaaccgag gacactgccg tggcctacac 7621 ccaagacgca cccagtgaag cgcacaattg ttaagacctc ctatagatct gatgaggcag 7681 aagaagcaca gagcacccct gaaaggccgg gcccttccaa acaaccatca gagcccgtgg 7741 agcccgccca cacaacccca gcggggaggt caacggtgat tctccacgaa ccacctcgag 7801 agcccgaagc tgtttccttc aagcctccgc caccaccttc ccggaggaga aggggagcgt 7861 gtgttgtata tgacgatgat atcatagagg tgattgatgt tgaaaccacc gaagaggaga 7921 cgacgtcaat gcaaagacag cctccgctcg ggcaacaacc gccccccccc gtgatttcta 7981 ccgggagtgc catgtcttcc agtcacacgg atccatcagt aacacagcca agtaagccac 8041 atcggaaacc tcaagacgga tttcaacgtt caggccgacg tcaaaaacga gccatgcctc 8101 ccccagtgag tccttctgac gctgggcctc cttccaccag gcctcgtgtc atggcgcctc 8161 cttccaccgg gcctcgtgtc atggcgactc cttccaccgg gcctcgtgac atggcgcctc 8221 cttccaccgg gcctcgtgac atggcgcctc cttccaccgg gcctcgtgac atggcgcctc 8281 cttccaccgg gcctcgtgac atggcgccta ccgtcgtaca tatgtttacg agggagcgcc 8341 tgctcaccca gtccaccggc cctgcacctc ggagcttctg ggaaatgcgg gccggccgtg 8401 atgctcccaa aattcaacaa gaaccaagtt cacaacagca gccagccact cagtctacac 8461 cgccttgcca atcatgggtc ccatctgtct atgtcctccc ggcagtggat gctggtaatg 8521 cccagcccct acaaatatca cacttgagct ccatgtcgcc cacgcagccg atatcgcatg 8581 aagaacaacc ccggtatgag gatcctgaca ctcctctgga tttgagttta catccagata 8641 ctgcaactct gccgcccacc caggatttat accctggacg cgaggatctg caggccaccc 8701 aggctccata cccgggatac gaggagccac ggccccccca ggctcctttt gtaggtgact 8761 atggttttgt tcaaattccc tcggctcaat gggagccaca cccctctcag ggcacatacc 8821 agggccacat tgatccccag cttccggctg ctctagattt agggccagag cagccccggt 8881 tcccacaaga tccatatgtg tattccgggg gccaattgtc ttcatgtcca ggttatgcag 8941 gtccctggcc ctcgaggcct caacatccaa gatataggca caccttggca ttgtggcctc 9001 gagaacccag gcatgggcac tctcagggcc catggaaacc ctggtcagca catctcccac 9061 ctcagtggga tggatctgca ggacatggcc aggatcaggt ctcccagttc ccacatctgc 9121 attcggagac aggcccacca cgtcttcaac tttcatcggt gccacaggtg ctgtacccgc 9181 agccactggt ctcatcctct gcaccatcgt ggtcatctcc ccagccccga gcccccatac 9241 gccccattcc aacaagattc ccccctcccc ctatgccgtt acaagatagc atggccgtgg 9301 ggtgcgactc atcaggtaca gcatgcccaa gcatgccctt tgccagtgat tacagtcaag 9361 gtgcatttac cccactggac attaatgccc ccacgccaaa aagtcctcga gtagaagaaa 9421 gttctcacgg acccgcccgg tgttcccaag ctacttctga agcacaggag attcttagtg 9481 acaattctga gatctccgtg ttcccaaaag atgcaaagca gactgactat gatgcatcca 9541 ctgaaagtga gctagattaa ggggatccaa ggtgacccct gttagctatt tgatctttga 9601 ctgacacata aacatggttt aaggaatgaa cactcatggt gtgagactgg aactgtacta 9661 aatttgctga catatgtaca atgagagcca aaaatttgat aaaccttaaa agtcccccca 9721 tctaatgatg tccagttccc ttctcccacc ctgtacaccc cgacccaaag ggactcaatg 9781 gcattcagat ttctagttac cacaggtaga atatcgggcg ttggcccata aaaataagtg 9841 catggatata gctctgcaca ggcttggaaa cacccattcc aggtgtgctt ctttttggtg 9901 aaataaaaac agcgttcttt atatgaaaat gtgtattctc ttgtgttgca gtatgtacag 9961 ttagctttgg tatagttttt gggtacctga aacgtgtgca gggtgggtgt ccaatgtggc 10021 agttttatct ctttgtcccc atactcctgc tcggccgtct tgttaaagtt aaccggcggt 10081 ggaggatcc // LOCUS HUMPAPB 3061 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human prostatic acid phosphatase mRNA, complete cds. ACCESSION M34840 KEYWORDS glycoprotein; prostatic acid phosphatase; secretory protein. SOURCE Human prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1199) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3061) AUTHORS Van Etten,R.L., Davidson,R., Stevis,P.E., MacArthur,H. and Moore,D.L. TITLE Covalent structure, disulfide bonding and identification of reactive surface and active site residues of human prostatic acid phosphatase JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.L.Van Etten, 08-JUN-1990. Chemistry Department Purdue University West Lafayette, IN 47907 FEATURES from to/span description pept 15 1175 prostatic acid phosphatase precursor (EC 3.1.3.2) /hgml_locus_uid="LA0168F" /nomgen="ACPP" /map="3q21-qter" sigp 15 110 prostatic acid phosphatase signal peptide matp 111 1172 prostatic acid phosphatase BASE COUNT 917 a 642 c 681 g 821 t ORIGIN Chromosome 3q21-qter. 1 cggctctcct caacatgaga gctgcacccc tcctcctggc cagggcagca agccttagcc 61 ttggcttctt gtttctgctt tttttctggc tagaccgaag tgtactagcc aaggagttga 121 agtttgtgac tttggtgttt cggcatggag accgaagtcc cattgacacc tttcccactg 181 accccataaa ggaatcctca tggccacaag gatttggcca actcacccag ctgggcatgg 241 agcagcatta tgaacttgga gagtatataa gaaagagata tagaaaattc ttgaatgagt 301 cctataaaca tgaacaggtt tatattcgaa gcacagacgt tgaccggact ttgatgagtg 361 ctatgacaaa cctggcagcc ctgtttcccc cagaaggtgt cagcatctgg aatcctatcc 421 tactctggca gcccatcccg gtgcacacag ttcctctttc tgaagatcag ttgctatacc 481 tgcctttcag gaactgccct cgttttcaag aacttgagag tgagactttg aaatcagagg 541 aattccagaa gaggctgcac ccttataagg attttatagc taccttggga aaactttcag 601 gattacatgg ccaggacctt tttggaattt ggagtaaagt ctacgaccct ttatattgtg 661 agagtgttca caatttcact ttaccctcct gggccactga ggacaccatg actaagttga 721 gagaattgtc agaattgtcc ctcctgtccc tctatggaat tcacaagcag aaagagaaat 781 ctaggctcca agggggtgtc ctggtcaatg aaatcctcaa tcacatgaag agagcaactc 841 agataccaag ctacaaaaaa cttatcatgt attctgcgca tgacactact gtgagtggcc 901 tacagatggc gctagatgtt tacaacggac tccttcctcc ctatgcttct tgccacttga 961 cggaattgta ctttgagaag ggggagtact ttgtggagat gtactatcgg aatgagacgc 1021 agcacgagcc gtatcccctc atgctacctg gctgcagccc tagctgtcct ctggagaggt 1081 ttgctgagct ggttggccct gtgatccctc aagactggtc cacggagtgt atgaccacaa 1141 acagccatca aggtactgag gacagtacag attagtgtgc acagagatct ctgtagaaag 1201 agtagctgcc ctttctcagg gcagatgatg ctttgagaac atactttggc cattaccccc 1261 cagctttgag gaaaatgggc tttggatgat tattttatgt tttagggacc cccaacctca 1321 ggcaattcct acctcttcac ctgaccctgc ccccacttgc cataaaactt agctaagttt 1381 tgttttgttt ttcagcgtta atgtaaaggg gcagcagtgc caaaatataa tcagagataa 1441 agcttaggtc aaagttcata gagttcccat gaactatatg actggccaca caggatcttt 1501 tgtatttaag gattctgaga ttttgcttga gcaggattag ataagtctgt tctttaaatt 1561 tctgaaatgg aacagatttc aaaaaaaatt cccacaatct agggtgggaa caaggaagga 1621 aagatgtgaa taggctgatg gggaaaaaac caatttaccc atcagttcca gccttctctc 1681 aaggagaggc aaagaaagga gatacagtgg agacatctgg aaagttttct ccactggaaa 1741 actgctacta tctgttttta tatttctgtt aaaatatatg aggctacaga actaaaaatt 1801 aaaacctctt tgtgtccctt ggtcctggaa catttatgtt ccttttaaag aaacaaaaat 1861 caaactttac agaaagattt gatgtatgta atacatatag cagctcttga agtatatata 1921 tcatagcaaa taagtcatct gatgagaaca agctatttgg gcacaacaca tcaggaaaga 1981 gagcaccacg tgatggagtt tctccagaag ctccagtgat aagagatgtt gactctaaag 2041 ttgatttaag gccaggcatg gtggtttacg cctataatcc cagcattttg ggactccgag 2101 gtgggcagat cacttgagct caggagctca agatcagcct gggcaacatg gtgaaacctt 2161 gtctctacat aaaatacaaa aacttagatg ggcatggtgc tgtgtgccta tagtccacta 2221 cttgtggggc taaggcagga ggatcacttg agccccggag gtcgaggcta cagtgaccca 2281 agagtgcact actgtactcc agccagggca agagagcgag accctgtctc aataaataaa 2341 taaataaata aataaataaa taaataaaaa caaagttgat taagaaagga agtataggcc 2401 aggcacagtg gctcacacct gtaatccttg cattttggaa ggctgaggca ggaggatcac 2461 tttaggcctg gtgtgttcaa gaccagcctg gtcaacatag tgagacactg tctctaccaa 2521 aaaaaggaag gaagggacac atatcaaact gaaacaaaat tagaaatgta attatgttat 2581 gttctaagtg cctccaagtt caaaacttat tggaatgttg agagtgtggt tacgaaatac 2641 gttaggagga caaaaggaat gtgtaagtct ttaatgccga tatcttcaga aaacctaagc 2701 aaacttacag gtcctgctga aactgcccac tctgcaagaa gaaatcatga tatagctttc 2761 catgtggcag atctacatgt ctagagaaca ctgtgctcta ttaccattat ggataaagat 2821 gagatggttt ctagagatgg tttctactgg ctgccagaat ctagagcaaa gccatccccc 2881 ctcctggttg gtcacagaat gactgacaaa gacatcgatt gatatgcttc tttgtgttat 2941 ttccctccca agtaaatgtt tgtccttggg tccattttct atgcttgtaa ctgtcttcta 3001 gcagtgagcc aaatgtaaaa tagtgaataa agtcattatt aggaagttca aaaaaaaaaa 3061 a // LOCUS LDV3ECPA 1064 bp ss-RNA VRL 14-AUG-1990 DEFINITION Lactate dehydrogenase-elevating virus capsid protein gene, complete cds. ACCESSION M34856 KEYWORDS viral capsid protein. SOURCE Lactate dehydrogenase-elevating virus, cDNA to viral RNA, clones A16, B24, B63, B90, B103, C44, dt4. ORGANISM Lactate dehydrogenase-elevating virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Togaviridae. REFERENCE 1 (bases 1 to 1064) AUTHORS Godeny,E.K., Speicher,D.W. and Brinton,M.A. TITLE Map location of lactate dehydrogenase-elevating virus (LDV) capsid protein (Vp1) gene JOURNAL Virology (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.K.Godeny, 06-JUN-1990. FEATURES from to/span description pept 637 984 capsid protein pept 135 650 unidentified ORF BASE COUNT 267 a 223 c 265 g 309 t ORIGIN 3' terminus region. 1 gtttttcgga accgcagttc cgtccttgtt gagcagcacg gaaaagtatt gctgcaaggc 61 caacccattg aggtgaagac agttgtgctc gacggtgtca aagctgtccg ggctaaaacc 121 gttccagccg aaaaatggga ggcctagaat tttgtgatca aacttcctgg tatcagattt 181 tcattgcctt tagtttgact tataccccaa ttgccatcta tagcttaaaa gtttttcgtg 241 gcactttggc aggtattgta aatattttta tttttattaa ttgttgtgta agttttgtgt 301 atttgatgta tcatcattct gtcaccaaca caattgcttt gtcattgggt gctgtgattg 361 ctcttgtgtg gggcatttac acccttgtga agatagttga ttggttggtg atcaggtgcc 421 ggttgtgctt cctaggtaga agttacatcc tagccccacc cagccacgtg gacacctctg 481 acgggcgtca gagcctaacc acatcgttaa caaccgcctt tgtggttagg aagccaggta 541 gtacccttgt aaacgggcag ttggtcccgg actttcaaag actcgtgctt gggggcaaga 601 aggccgtgtc aaaaggtgcg gttaaccttt tgaaatatgt ctcaaaataa gaagaagggc 661 ggccagaata aaggggcaaa ccagcaactt aatcagctga ttagtgcgtt gcttcgcaat 721 gccggacaaa ataaaggaaa ggggcaaaag aaaaagaagc agcccaagct gcatttcccc 781 atggcaggtc catctgacct gcggcatgtt atgactccca acgaggttca gatgtgtcga 841 tcttcattgg tgaccctttt taatcagggc ggaggtcagt gcacattggt tgactctggt 901 ggaataaatt tcaccgtgag tttcatgttg ccaacgcatg ccacagtccg gttaattaat 961 gcttctgcta attcatctgc ttaggagtta caatgtaagt catgtcagtc agatgcagcg 1021 actcagcctt ttgtaattaa ttgcgatttg gctgggccgg aatt // LOCUS MBIFRHADGB 3903 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.thermoautotrophicum hydroxy-5-deazaflavin-reducing hydrogenase (frhADGB) gene, complete cds. ACCESSION J02914 KEYWORDS 8-hydroxy-5-deazaflavin-reducing hydrogenase. SOURCE M.thermoautotrophicum (strain delta-H) DNA. ORGANISM Methanobacterium thermoautotrophicum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 3903) AUTHORS Alex,L.A., Reevell,J.N., Orme-Johnson,W.H. and Walsh,C.T. TITLE Cloning, sequence determination, and expression of the genes encoding the subunits of the Ni containing 8-hydroxy-5-deazaflavin reducing hydrogenase from Methanobacterium thermoautotrophicum delta-H JOURNAL Biochemistry (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.A.Alex, 25-MAY-1990. FEATURES from to/span description pept 298 1515 F420 hydrogenase alpha-subunit (frhA) pept 1524 2000 F420 hydrogenase delta-subunit (frhD) pept 2114 2824 F420 hydrogenase gamma-subuint (frhG) pept 2837 3682 F420 hydrogenase beta-subunit (frhB) binding 284 293 ribosomal binding site (put.) binding 1514 1520 ribosomal binding site (put.) binding 2100 2106 ribosomal binding site (put.) binding 2823 2832 ribosomal binding site (put.) BASE COUNT 1201 a 878 c 965 g 859 t ORIGIN 1 ctgcagcaac agctccaatt atcaggacta tcgccaccga tgctagaacc tttgagttta 61 tcatcaactc accttcctgg tgttaacaac acatttagta tgtgggtcaa gatatataac 121 actcagtatt actgaacaaa aaattaatat tataaactct gttttctact aataataagt 181 gatctataaa atgatataaa tagttttcac tttttttata attgaaaaac ttataacatt 241 taaaaatagt tataatattg aaggtttttt gaactttaaa aacaagaggt gatacatttg 301 agcgaaagga ttgttatatc gccgacatca cgacaagaag gacatgcaga acttgtcatg 361 gaagtcgatg atgagggaat cgtgacaaag gggcgatact tcagtattac tcctgtcagg 421 ggccttgaga aaatagtgac aggtaaagca cctgaaacag cacctgtcat cgtccagagg 481 atatgtggag tatgccctat accacacacc ctggcttccg ttgaggcaat agacgactcc 541 cttgacattg aggttccaaa ggccggaaga cttctccgtg aacttacact tgcagcacac 601 cacgtaaaca gccacgcaat acatcacttc ctcatagcac cggacttcgt gcctgaaaac 661 ctcatggcag acgccataaa ctcagtctcc gagataagga aaaacgcaca gtacgtagtt 721 gacatggttg caggtgaggg tatacaccca tccgatgtca ggataggtgg aatggccgac 781 aacataacgg aactcgcaag gaaaaggctg tatgcgaggt taaagcagct caaaccaaaa 841 gtcgacgaac acgttgaact catgatagga ttaatcgagg ataagggcct tccaaagggc 901 cttggagtcc acaaccagcc aacacttgca agccaccaga tctacggtga caggacaaaa 961 ttcgatctgg acaggttcac cgaggtaatg cctgagagct ggtacgatga ccctgaaata 1021 gccaagaggg cctgctcaac aataccactc tacgatggaa gaaacgtgga ggtcggtcca 1081 agagcaagga tggttgaatt ccagggattc aaggagaggg gtgttgttgc acagcacgtt 1141 gcaagggcac ttgagatgaa aacggcactc gcaagggcaa ttgaaatcct tgatgaactg 1201 gacacatccg caccggtaag ggcagacttc gatgagagag gcacaggtaa actgggtgta 1261 ggtgcaattg aaggtccaag gggactggac gttcacatgg cccaggttga aaatggtaag 1321 atacagttct acagtgcact cgtcccaaca acctggaaca tcccaacaat gggtcctgca 1381 actgaaggat tccaccatga atacgggcca cacgtcatac gcgcttacga cccctgtctc 1441 tcatgcgcaa cacacgtgat ggttgttgat gatgaggaca gatcagtaat cagggatgaa 1501 atggtcagac tctagggaat aacatgccat acgacgctga gattctagtg gtgggctgtg 1561 gaaacatcct tttcaaggac gatggattcg gcccagaggt tatcaaggcc cttgaagagt 1621 acttcaagga tagggaaaaa ccagataatg ttatgttcat tgatgctggg actggcggtc 1681 cgcactttgt cttctccctc ccccatgagg agtggaagaa gatgatagtt gtggatgttg 1741 ttgaattcaa cgcagaaccc ggaacactca ggaaattcga tgtcactgag attccaaagg 1801 gatcctatga aaatatgcat acatggccag tgagccagcc cctccatgaa ctcagtgaaa 1861 agatcgatgt tgtggtaata gggtgtaaac ccaaggagat atcagctccc aatgtggaaa 1921 tgggcctcac acccccagta aaaaaggcta ttcccagagc cattcagatg atcttagatg 1981 agattggggt ttctaaatga gcttaattgc ccgcatcaaa agatttttag gattggaggc 2041 tgaagctaag agggaagaac ccgaaaaaga aaaatcggaa cctgttggag cttcaaaaga 2101 ggaggttgaa aaagtggctg aagaaaatgc aaaaccaaga ataggttaca ttcacctcag 2161 tggatgtacc ggagatgcca tgtcgttaac tgaaaattac gacattctag cagaattact 2221 caccaacatg gtggacatag tatacggaca gaccctggtg gatctctggg agatgccaga 2281 gatggatctg gcccttgttg agggatctgt ctgtctgcag gacgaacaca gcctgcacga 2341 actcaaagaa ctgagggaga aggcaaaact cgtctgcgcc ttcggttcat gcgcacagac 2401 aggctgcttc acaaggtact caaggggcgg acagcaggca cagccatcac acgagtcctt 2461 tgtaccaata gcagacctca tagatgtgga cctcgccatt ccagggtgcc caccatcacc 2521 tgagataata gcaaaggcag tcgttgcact cctcaacaat gacatggagt acctccagcc 2581 aatgctggac cttgcaggct acacagaggc atgcggatgc gacctccaga caaaggttgt 2641 aaaccagggt ctctgcactg gatgtggaac atgtgcaatg gcctgccaga caagggccct 2701 tgacatgacc aacggaagac ccgaactcaa cagcgaccgc tgtataaaat gtggaatctg 2761 ctatgtgcag tgcccaagaa gctggtggcc agaagaacag atcaaaaagg agttagggct 2821 ataggaggct ggaaaaatgg ttttaggtac ttacaaggaa atagtttccg ccagatcaac 2881 tgacagagag attcagaaat tagcccagga tggaggaata gttacaggtc ttctagcata 2941 tgcccttgac gagggcataa tcgaaggcgc agttgttgca ggacccggaa aggagttctg 3001 gaagccagaa ccaatggtcg ccatgacctc agatgaactc aaggcggctg caggtaccaa 3061 gtacacattc tcaccaaacg tcctgatgct caagaaggca gtgaggcagt acggtataga 3121 gaaactgggt acagttgcaa taccctgcca gaccatgggt ataaggaagg cccagaccta 3181 cccattcggt gtcaggtttg ttgcagataa gataaaactc ctggtcggta tctactgcat 3241 ggagaacttc ccatacacat cactgcagac cttcatctgt gaaaaactcg gattaaacat 3301 ggagctcgtt gagaagatgg acataggtaa aggaaaattc tgggtctaca cccaggacga 3361 tgtctacaca ctccccctca aggagaccca tggatacgaa caggcaggat gcaagatctg 3421 caaggactat gtggctgagc tggcagacgt atcaacaggt tcagtgggat caccggatgg 3481 ctggtcaaca gtcataacaa ggaccgattc aggggactca atattcaagc aggctgttga 3541 ggcaggtata ttcgagacca aaccaataga ggaagtcaaa cctggcctgg gacttcttga 3601 aaaactctct gcacagaaga aggaaaaggc agagaagaac atcgctgcaa ggaaagagat 3661 gggattacca acaccatact gatcccatcc ctaaattttc tatttttcag aacattcagt 3721 ccattcaact gcatcctcaa gttaactatt ctgttataac cgtgcagccg tcctcctcaa 3781 caatgactgt atgttcccac tgggcgacca ttgcaccgct cttttccctc agcacatggt 3841 aggggtatat ggcccttgac tgtatcagca ttctcatgga ggcattcagt ctctttgcat 3901 cga // LOCUS MUSLRCNA 1751 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse loricrin gene, complete cds. ACCESSION M34398 KEYWORDS cell envelope protein; loricrin. SOURCE Mouse (strain BALB/c) new born epidermal keratinocyte, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1751) AUTHORS Mehrel,T., Hohl,D., Rothnagel,J.A., Longley,M.A., Bundman,D., Cheng,C., Lichti,U., Bisher,M.E., Steven,A.C., Steinart,P.M., Yuspa,S.H. and Roop,D.R. TITLE Identification of a major keratinocyte cell envelope protein, loricrin JOURNAL Cell 61, 1103-1112 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Rothnagel, 22-MAY-1990. FEATURES from to/span description pept 71 1516 loricrin signal 1740 1745 polyA signal BASE COUNT 223 a 493 c 706 g 329 t ORIGIN 1 acatcagcat cacctccttc cctcactcat cttccctggt gcttcagggt tccccttctc 61 cttaaacaag atgtctcacc agaaaaagca gcccactccc tgccctcctg tgggttgtgg 121 aaagacctct ggtggaggag gaggcggcgg cggctattat agcggtggcg gctctggctg 181 cggaggcggc tcatctggag gaggctctag ctgtggaggc ggaggcggtg gttcctatgg 241 aggtggttcc agctgcggcg gtggaggcgg ctccggtggg ggcgtcaagt actccggagg 301 cggcggtggc tctagctgcg gcggcggcta ctccggaggc ggtggtggct ctagctgcgg 361 cggtggctac tctgggggcg gcggcggctc cagctgcgga ggtggctact ccggaggcgg 421 cggcggctcc agctgcggcg gcggcagcta ctccgggggt ggctccagct gtggaggcgg 481 tggcggctct ggtgggggcg tcaagtactc cggaggtggt ggcggcggcg gctctagctg 541 cggcggcggc tcctccgggg gcggcggcgg cggctccagc tgcggaggcg gatcaggagg 601 cggcggctcc tactgcggag gctcctctgg aggcggcagc tccggtggct gcggcggcgg 661 ttccggaggc ggcaagtact ctggtggcgg cggtggctcc agctgcggag gcggctattc 721 cggcggcggt ggaagcagcg gcggctctag ctgtggcggc ggctactcag gtggcggtgg 781 atccagctgc ggcggcggcg gcggctattc cggtggcggc ggcacgagct gcggaggtgg 841 ttcctccggt ggcggcggcg gcggatcgtc ccaacagtat cagtgccaga gctacggagg 901 cggttctagc ggtggctcca gctgcggcgg cggctactcc gggggcggag gctccagctg 961 cggtggcggc tactccgggg gcggaggctc tagctgcgga ggcggctcct ctggtggtgg 1021 ctccagttgc ggcggcagcg gcggcggcgg ctattccggt ggtggcggtg gcagctgcgg 1081 cggcggctcc tctggcggcg gagggggcta ttactcctct cagcagacca gtcagacctc 1141 ctgcgccccc cagcagagct acggaggggg ctcttccgga ggaggtggta gctgtggagg 1201 tggctcctct ggcggcggtg gcggcggtgg ctgctactcc agcggtggtg gcggcagcag 1261 cggtggctgc ggtggaggct actccggagg cggcggtggc tgtggcggcg gctcttccgg 1321 gggcagcggc ggtggctgcg gaggtggctc ttccggaggc agcggcggtg gctgcggagg 1381 aggctactcc ggaggcggag gcggtggctc cagctgcgga ggcggctcct ctggtggcgg 1441 ctctggaggt ggcaagggtg tgccagtctg ccaccagacc cagcagaagc aggcgcctac 1501 ctggccgtgc aagtaaggtc accgggttgc aacggagaca acagagctgg aagagttctc 1561 cgtgggcgcc gatgggctta actttctcat gaatttgcct gaggtttcca aacccttcac 1621 attttaagcg ccccttcccc cagaagaagc cattgagtcg ctcaaggtgt atcctgttct 1681 gcagattttt catcttggtt tctgaatgac tacctcccaa ttctagtgtc tcctcagtca 1741 ataaatttgc t // LOCUS PFAGIPA 1906 bp ds-DNA INV 14-AUG-1990 DEFINITION P.falciparum glucosephosphate isomerase (Pfggi.R1) gene, complete cds. ACCESSION J05544 KEYWORDS glucosephosphate isomerase. SOURCE P.falciparum (strain 7G8) DNA. ORGANISM Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae. REFERENCE 1 (bases 1 to 1906) AUTHORS Kaslow,D.C. and Hill,S. TITLE Cloning metabolic pathway genes by complementation in Escherichia coli: Isolation and expression of Plasmodium falciparum glucosephosphate isomerase JOURNAL J. Biol. Chem. 265, 12337-12341 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.C.Kaslow, 04-JUN-1990. FEATURES from to/span description pept 28 1803 glucosephosphate isomerase (EC 5.3.1.9) BASE COUNT 782 a 174 c 289 g 661 t ORIGIN Chromosome 14. 1 tttttttttt tttattaata tttttctatg aatatggaga ttacaaattt gaagagctat 61 aaggaacttg tcaccttaag cgctgaagaa aaaacaaagg atctaaagga ctatttaaat 121 gataagaata gatctgaatc gttaataaag aagtttaaga atttttatat ggatttatct 181 cgtcaaaggt atagcgaaaa gactttaaac aaattagttg aatatgctga agaggtagaa 241 ttaaagaaga aggttgaaaa aacatttatg ggagaaaaag ttaatatgac agagaatcgc 301 agtgttttac atacagcttt aagaataccc atagagaaaa taaatacaca taaaataata 361 attgataata agaatgtgtt agaagacgta catggagtat taaaaaaaat agagaaatat 421 tcagatgaca ttagaaatgg ggtaataaaa acttgtaaaa atacaaaatt taaaaatgtg 481 atttgtattg gtattggagg ttcatattta ggtactgaat ttgtttatga agctatgaaa 541 tactattatt ataacatgga attaaataag aatgagaaag atcaagtaaa taattttaat 601 aataattatg atcaagataa tgtatttaat gtaaggtttt tagctaatgt agatccaaat 661 gatgtaaata gagctattca aaatttggat cagtatgata ctttagttat tataatatct 721 aaaactttta ctactgctga aactatgtta aatgctaggt caataaaaaa atggttaagt 781 ttaaaaataa aagatgatga aaatttaagt aaacatatgg tagctgtaag tactaatttg 841 aaattaactg atgaatttgg aatatcaaga gataatgtat ttgaattttg ggattgggta 901 ggaggtcgat tttctgtaac tagttctgtt ggtatattac ccttatcaat tgcttttgga 961 tataaaaata tgagaaattt tttaaatggt tgtcatgata tggatgaaca ttttttacat 1021 gcagatttaa aagaaaacat tcctgtctta ttagcattaa ctagttttta taatagtcat 1081 ttttttgatt ataaaaatgt tgctatatta ccatatttcc aaaatttatt aaaattttca 1141 gcacatattc aacaattatc tatggaaagt aatggaaaat cagtagatag aaataatcaa 1201 ccaatccatt ataatacttg tcaagtatat tttggtgaac ctggtacaaa tggtcagcat 1261 agtttttatc aattaataca tcaaggacaa gttatacctg ttgaattaat tggatttaaa 1321 cattctcatt tcccaattaa atttgataaa gaagtagtaa gtaatcatga tgaattaatg 1381 actaactttt ttgcacaagc tgatgcatta gctattggga aaacatacga acaagtaaaa 1441 gaagaaaatg aaaaaaataa aatgtctcca gaattattaa ctcataaagt ttttaacggt 1501 aatagaccat caaccttatt attatttgat gaattaaatt tctatacatg tggtttatta 1561 ttatccttat atgaaagtag aattgtcgct gaaggatttc tattaaatat taacagcttt 1621 gatcaatggg gggtagagtt gggtaaggtt ctagcaaaag aagtaagaaa ttattttaat 1681 gacacaagaa atcaaaagaa atcagataat acctataatt ttaatgaatc tacaaaaaat 1741 tttattaaat tattacttgt ccaaataaaa aaaaaaaaaa aaattaacac aaatttaaag 1801 taatatatgt aaatatatat atataaatat ataaatatat atatatatat atatatactt 1861 atatatttat gtgtgaaagt caaatatggg tttcttttct tctttt // LOCUS RATGADA 3198 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat glutamic acid decarboxylase (GAD) mRNA, complete cds. ACCESSION M34445 KEYWORDS glutamic acid decarboxylase. SOURCE Rat brain and hypothalamus, cDNA to mRNA, (libraries of D.Chikaraishi and S.Heinemann) clones RB10.7, ZAPG, pCR24RI and pCR22HP. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3198) AUTHORS Wyborski,R.J., Bond,R.W. and Gottlieb,D.I. TITLE Characterization of a cDNA coding for rat glutamic acid decarboxylase JOURNAL Mol. Brain Res. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.I.Gottlieb, 23-MAY-1990. FEATURES from to/span description pept 184 1965 glutamic acid decarboxylase mRNA < 1 3198 glutamic acid decarboxylase mRNA signal 3174 3179 polyA signal BASE COUNT 869 a 742 c 767 g 820 t ORIGIN 1 gaattccgaa gcagccccgg ggtgacaccc agcacgtacg tctgtggcag agcaaagcca 61 agcgggggac gttcgcggag gagtcgcggg agggtccagc tccctgtggc tgaatcgagc 121 ccgttcctgc gcccagaccg cgggggacac ttgaacagta gagaccccaa gaccaccgag 181 ctgatggcat cttccacgcc ttcgcctgca acctcctcga acgcgggagc ggatcctaat 241 actaccaacc tgcgtcctac aacatatgat acttggtgtg gcgtagccca tggatgcacc 301 agaaaactgg gcctgaagat ctgtggtttc ttgcaaagga ccaatagcct ggaagagaag 361 agtcgtcttg tgagcgcctt cagggagagg caggcctcca agaacctgct ttcctgtgaa 421 aacagtgacc ctggtgcccg cttccggcgc acagagacgg acttctccaa cctgtttgct 481 caagatctgc ttccagctaa gaacggggag gagcaaactg tgcagttctt actggaggtg 541 gttgacatac tcctcaacta tgtccgcaag acgtttgatc gctccaccaa ggttttggac 601 ttccaccacc cacaccagtt gctggaaggc atggaaggtt ttaatttgga gctgtctgac 661 caccccgagt ctctggagca gatcctggtt gactgtagag acaccctaaa gtacggggtt 721 cgcacaggtc accctcggtt tttcaaccag ctctctactg gtttggatat cattggttta 781 gctggcgaat ggctgacatc aactgccaat accaatatgt tcacatatga aattgcaccc 841 gtgtttgttc ttatggaaca gatcacactt aagaagatgc gagagatcat tggatggtca 901 aataaagatg gtgatgggat attttctcct gggggagcca tatccaacat gtacagcatc 961 atggcggctc ggtacaagta ctttccagaa gtgaaaacaa aaggcatggc ggctgtgccc 1021 aaactggtcc tcttcacctc agaacacagt cactattcca taaagaaggc tggggctgcg 1081 cttggctttg gaactgacaa tgtgatcctg ataaagtgca atgaaagggg gaagataatt 1141 ccagctgatt tagaggcaaa aattcttgat gccaaacaaa agggatttgt tcccctttat 1201 gtcaatgcaa ccgcaggcac gactgtttat ggagcgtttg atccgatcca ggagattgca 1261 gacatatgtg agaaatacaa cctttggctg catgtggatg ctgcctgggg tggtgggctg 1321 ctcatgtccc ggaagcatcg ccacaaactc agcggcatag aaagggccaa ttcagtcacc 1381 tggaaccctc acaagatgat gggtgtgctg ctccagtgtt ctgccatcct ggtcaaggaa 1441 aagggtatac tccaaggatg caaccagatg tgtgcaggct acctcttcca gccagacaag 1501 cagtatgacg tctcctacga tacaggggac aaggcgattc agtgtggccg ccatgtggac 1561 atcttcaagt tctggctgat gtggaaagca aagggcactg tgggatttga aaatcagatc 1621 aacaaatgcc tggagctggc tgaatacctc tatgccaaga ttaaaaacag agaagagttt 1681 gagatggttt tcaatggtga gcctgagcac acaaatgtct gtttctggta cataccacaa 1741 agcctccgag gggttccaga cagccctgag cgacgagaaa aactacacag ggtggctccc 1801 aagatcaaag ccctgatgat ggagtcagga acaaccatgg ttggttacca gcctcaaggg 1861 gacaaggcca acttcttccg gatggtcatc tccaacccag ccgccaccca gtctgacatc 1921 gatttcctca tcgaggaaat agagaggttg ggtcaagatc tgtaatcccc ccccctttgc 1981 agaaccgtaa tcaccggcca tagccctgcc cctctggcac cctagaacgc agttctgtca 2041 gtagttgaca cacctaggcc atttcactga aggaaattat aatctcttaa agaatatttg 2101 tcacatctca cgtaagcttg tttgttagaa ttagcgggaa ataatgttct tttaaaaaaa 2161 ttgcacatta gaaacacagt atatatgtac agttatatat atatatatac ctctctctat 2221 gtactatgta tgtatagtga atgtggctta gtcctagacc acagcatgtc gctcgctccc 2281 agggaattag cctcaccccc agcaattact aagaggctaa ccatctcgca agcaactgca 2341 ggacagatgg tacccccact gcagtgtcct agggacccag ggaaaggccg ttgctgggag 2401 cctaccaccc tgctagagct gtccccacct gaatggatgg atgatggatg gcaggaagta 2461 ccagtagatg acaaatgtca caccctccct gttagtgccc tgctagggga tatagtagca 2521 gagtctttgt cacaattgtg ctgttgctgc gttttagaga ttaatctgtg taaactgtac 2581 attaccgttg tctatctttg ggcaggggga agtgcatata atgatttaat tgtacgtcag 2641 tgagatattt gcttatttat attcaaatat atatcatgtt aaagagacat cttgtatttt 2701 cttcccattt gtaatgtatc ttatttatat attaatggag taagttctgg atactgttta 2761 tggtattttc gtgcatttgt gagccaaaga gaaaagatga aaattagtga gacttgcatt 2821 tatatttgag tgcccttaac ataatgattt gaacacatgt gtactgtctg gaaacgaatt 2881 ctgatactgt acatagagtg gcgttatgga aatcttgctt cagtagcctt tgctcttctc 2941 tttcccccct caggctgtac gtcaggtgtt ctcaaagctt ttctagtaac tgttgaaaaa 3001 taatgactag atctcctgta attttgtagt agtatatgac caatctctgt gactcgctta 3061 gctgaaacct aaggctatgt ttccgaagat ctccgataca ctgaccagtc ccacaagtgt 3121 ttttgaagac atgaaaccca cactgtgcat ttagagtatg caagaagaat ataaataaaa 3181 taaaaaatat tctccatg // LOCUS RATTSHRA 5270 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat thyroid stimulating hormone receptor mRNA, complete cds. ACCESSION M34842 KEYWORDS TSH receptor; thyroid stimulating hormone receptor; transmembrane protein. SOURCE Rat (strain Fisher) thyroid epithelial cell line FRTL-5, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 5270) AUTHORS Akamizu,T., Ikuyama,S., Saji,M., Kosugi,S., Kozak,C., McBride,O.W. and Kohn,L.D. TITLE Cloning, chromosomal assignment, and regulation of the rat thyrotropin receptor by thyrotropin, agents which increase cAMP levels and thyroid autoantibodies JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5677-5681 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.D.Kohn, 05-JUN-1990. FEATURES from to/span description pept 55 2349 thyroid stimulating hormone receptor precursor sigp 55 117 thyroid stimulating hormone receptor signal peptide matp 118 2346 thyroid stimulating hormone receptor signal 5256 5261 poly-A signal BASE COUNT 1415 a 1343 c 1176 g 1336 t ORIGIN 1 gcggccgcgt tgacagcgcg caacgatgaa gtagcactgg aggtcccttg gaaaatgagg 61 ccagggtccc tgctccagct cactctgctg ctcgccctgc ccaggagcct ctggggcaga 121 gggtgtactt ctccaccctg cgaatgccac caggaggacg acttcagagt cacctgcaag 181 gaactccacc aaatccccag cctaccgccc agcacccaga ctctgaagct catcgagact 241 cacctgaaga ccattcccag tcttgccttt tcgagcctgc ccaatatttc caggatctat 301 ctatccatag atgccactct gcagcgactg gagccacatt ctttctacaa tttgagtaaa 361 atgactcaca tagaaatccg gaacaccaga agcttgactt acatagaccc tgatgcctta 421 acagagctcc ccttgctcaa gtttcttgga attttcaata ctggacttag aatattccct 481 gacttgacca aaatttattc cacggatgtg ttctttatac tagaaatcac agacaaccct 541 tacatgacgt cagtccctga aaacgcattc cagggactat gcaatgaaac cttgaccctg 601 aaactataca acaatggatt tacttcaatc caaggacatg ctttcaatgg aacaaagctg 661 gatgctgttt acctaaacaa gaataaatat ctgacagcta tagacaaaga tgcctttgga 721 ggagtataca gtggacccac tttgctagac gtgtcttcca ccagcgttac tgctcttcct 781 tccaaaggcc tggagcacct caaagagctg atcgcgaaga acacctggac tctcaaaaag 841 ctccccctgt ccttgagctt cctccacctc actcgggctg acctctctta cccaagtcac 901 tgctgtgctt ttaagaacca gaagaaaatc aggggaatcc tagagtcttt gatgtgtaat 961 gagagtagta tccggaacct gcgtcaaaga aagtcagtga acgtcatgag gggtcccgtc 1021 taccaggaat atgaagaagg tctgggtgac aaccatgttg ggtacaaaca aaactccaag 1081 ttccaggagg gcccaagcaa ctctcactat tacgtcttct ttgaagaaca agaggacgag 1141 atcatcggtt tcggccaaga gctcaaaaat cctcaggaag agactctcca agccttcgac 1201 agccactatg actacactgt gtgtggggac aacgaggaca tggtgtgtac ccccaagtca 1261 gacgagttta acccctgtga agatatcatg ggctacaagt tcctgagaat cgtggtatgg 1321 tttgtcagtc cgatggctct cctgggcaac gtctttgtcc tgttcgtcct gcttactagc 1381 cactacaaac tgaccgtgcc acgcttcctg atgtgcaacc tggcctttgc agatttctgc 1441 atgggggttt acctgcttct cattgcctcc gtagacctgt acacacacac tgagtactac 1501 aaccacgcca ttgactggca gacgggccct ggatgcaaca cggctggttt cttcactgtt 1561 tttgccagcg agttatcagt gtacacactg acggtcatca ccctggagcg atggtatgcc 1621 atcaccttcg ccatgcgcct ggataggaag atccgcctca ggcacgctta taccatcatg 1681 gcgggggggt gggtttcctg cttccttctc gccttgctcc caatggtggg aataagcagc 1741 tacgccaagg tcagcatctg cctgccgatg gacacagaca cccctcttgc cctggcgtac 1801 atcgcccttg ttctcctgct caacgttgtt gcctttgtta ttgtctgctc ctgctatgtg 1861 aagatttaca tcaccgtccg aaatccccag tacaaccctc gagataaaga caccaagatt 1921 gccaagagga tggccgtgtt gatcttcact gacttcatgt gcatggcgcc catctccttc 1981 tatgccctgt cagcgcttat gaacaagcct ctgatcactg tcactaactc cggggtcttg 2041 ttggttctct tctacccact caactcctgt gctaacccat ttctctatgc cattttcacc 2101 aaggccttcc agagggatgt gttcatccta ctcagcaagt ttggcctctg caaacaccag 2161 gcccaagcct atcaggccca gagagtctgt cccaacaaca acactggtat tcagatccaa 2221 aagatccccc aggacacaag gcagagtctc cccaatgtgc aagataccta tgaaccgctt 2281 ggaagctccc acctgactcc gaaactacag ggacggatct cagaagagta cacgcaaaca 2341 gccctgtaaa ggaagcgtac gctagtcatg gtgggactta caaaaggctg gtttctagaa 2401 catgcactcc agttccagga cacgtgaatg cctaggtcca cggggtcaga gttcatctct 2461 agggagtatt gcctccaaga agggagagaa gctatcggcg tgactgagtc ctaggtgatc 2521 taaaagtcgt ccaatctctc tggaagattt gctcgatgct tgcctgtgat gccgggcaac 2581 gcgctccact gataaccagc tggactgttc ccacagctct ctactccatt ctacggagtg 2641 ttttatatga aagattcagc aagtggccag tgttcattta cgcgagcagc tccacattag 2701 ctcccctgag gtaaccaaag aaagtctttg ttccttaaaa ataaaatgcc atacttggga 2761 ggtggaggca gggggaccca gttcatactt aactacccag aaaattccaa gccaacctat 2821 gctccaacaa tagcactggt attcagatcc aaaagatccc ccaggacgca aggtagagtc 2881 cttctccatc tcaagagacc aaaagaaccc tgaaacgcca aacataactt gatgtcacat 2941 aattgctttt attaaactga aaaaagagaa agtctagttt tgagagttct ttttttcctt 3001 tgactctgaa acagttatct cttcaccggg cttgatgagc tcgatgtaac ccagctgtta 3061 cacacattgg ctagagaaac tgctaagatt tcagcttttg tggctgagcg aactaagaat 3121 tatcttctcg gacaagacat gagccctggg agagatttct aaggagcata tggtgaggac 3181 agggcgcact tagtcacctg ttgaataaga caactatccc cagttaaccg tcagctttga 3241 gcaagtccct ggacctctaa attctgtaga aatggggaga cttgatgccc tctcctactt 3301 aaaaaaaacc ctcagtatat ctctccctca aaagatagtg tctgtgacaa agagaaaaaa 3361 gcattactta agtggaatct aatttttttt ttctaccttg tgaggctgct ggctgcttct 3421 ttttggaacc gggacatgtc accgaaaaca ctttttcttt atttcatttt gatcatgcgt 3481 ctgagccaat actattgagt ggctatacgg agccagatct taattattca tctaattatt 3541 atattctgta ctgctctcaa gtccaaaaca taacagaact tagagaatct gttactaata 3601 cagcctggct tggtagtgta cgcctttaat ctcagcactg gggaggcaga ggctgaattt 3661 gaggccagcc tggtccaaga cagccaggtc tacacagaga aaccctgttt caaagaaaac 3721 aaaacaaaat agactaataa tggagttaac acgtgactgt gtaggctcac ttcaaaacat 3781 ctttttcttt tacttttttt tatagctctc tgggctcaca ccacactaac actgtcaaga 3841 gagaagtcgt ttcacccgac actgccagtt ccttctatcc ctgccagtca catgagacat 3901 gtgtgccctg atagtagcct tgcttccaac aggaactgta cctacaggtg ttccttctag 3961 gttgcctgtc tccttagctc cttaccatat tgttacggtg tccccccaag cagatctgcc 4021 atttgcttct cagtactcat gaaccaaaca ggaaagataa gggttttgcc accttaagtc 4081 tgtgaagtag ggtttatggg tcacctgaca tggctcctat ctgactggcc tggaactctg 4141 tcacagtcac gtaagtgtag gctttcccag gtgctcgctt gtaactactg aaataaataa 4201 atgtggtgcc ctgaggcatc cgtttctctc ttacgtcact tttggaatgt cacatctgga 4261 cagataaaat atttcatctc gagtagccga tctaatgctt tgaaaaccaa gtcctagaag 4321 agtgacttcg acaaaattgc tagtaaaatt caacttgaca gcatgaagta aagaaacctc 4381 ctgtagtaag atatatagta ttttttttta atttgtcaat ttcttttgag gttaaccttg 4441 tatgtgcatg tgttaaaagc aaaaagggaa catctgctgt ggcactcagg agagtagccc 4501 tgcccctccc cttggtcata cggtagatct gaccccggat gtgggagttg caggtgagct 4561 ggcccggagg acatgaattc agaagagcgg gccctgcttt gccttctgtg catgcagtgg 4621 catagactag ggaaagatgt ccttctaccc tctcttatta cttgtcacct atggcaggca 4681 ggagagctgt gcccaaagac atgagagcag gggtatggcc ctgtccctca ctatctgcaa 4741 cactctggag agcaggccct acaccttgcc tgggcagcag ggtagagctg gccctggtta 4801 tgggagtcac tggtgagcca gttctgaagg caggagaacc agtggactga ccagctcaga 4861 taccactcag cccagatcca gggctttaaa ttggctcact ctaacatcta cccatcagtg 4921 aactactgga gtacatgaag caggcagtcc tacagatcca aaactacaca caggatctcc 4981 atgatgcacg acaacaacag gatatccaag aagagccctg gtgaggttcc agcattgatg 5041 gtggagcaga aggcagaggg cttataccag aacaaggact cgctgcaatg aacatttgca 5101 agcaaaagaa gtgtggacaa aagggtatac tggacactac agcatctgca gtgaggtttt 5161 gtccctctgt tgggacagga agcatgcgtg gaggatggat atgaggggaa aagatgagtg 5221 gtttggggta cagggtgtaa agttcacaaa gaatcaataa aaaattttcg // LOCUS STMBLIPA 840 bp ds-DNA BCT 14-AUG-1990 DEFINITION S.clavuligerus beta-lactamase inhibitory protein gene, complete cds. ACCESSION M34538 KEYWORDS beta-lactamase inhibitory protein. SOURCE S.clavuligerus (strain NRRL 3585) DNA, clone pBLIP. ORGANISM Streptomyces clavuligerus Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 840) AUTHORS Doran,J.L., Leskiw,B.K., Aippersbach,S. and Jensen,S.E. TITLE Isolation and characterization of a beta-lactamase inhibitory protein from Streptomyces clavuligerus and cloning and analysis of corresponding gene JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.E.Jensen, 25-MAY-1990. Dept. of Microbiology University of Alberta Edmonton, Alberta Canada T6G 2E9 FEATURES from to/span description pept 67 672 beta-lactamase inhibitory protein precursor (bliP) sigp 67 174 beta-lactamase inhibitory protein signal peptide matp 175 669 beta-lactamase inhibitory protein binding 52 63 ribosomal binding site signal 686 724 transcription termination signal BASE COUNT 140 a 248 c 320 g 132 t ORIGIN 1 atcgatgcca gggttgagca ggccctcagc gagggcggag gcttggtcga cggaagggga 61 cgggcgatga ggacagtggg gatcggcgcg ggggttcgga ggctggggcg ggccgtggtc 121 atggcggcgg ccgtgggtgg gctggtgctc gggtcggcgg gggcttcgaa cgcggcgggg 181 gtgatgaccg gggcgaagtt cacgcagatc cagttcggga tgacacgtca gcaggtcctc 241 gacatagccg gtgcggagaa ctgtgagacc ggcgggtcgt tcggggacag catccactgc 301 cgggggcacg cggcagggga ctactacgcc tacgccacct tcggcttcac cagcgccgcc 361 gccgacgcga aggtggactc gaagagccag gagaagctgc tggccccgag cgccccgacg 421 ctcaccctcg ccaagttcaa ccaggtcacc gtggggatga ccagggccca ggtactggcg 481 accgtcgggc aggggtcctg caccacctgg agtgagtact acccggccta tccgtcgacg 541 gccggggtga ccctcagcct gtcctgcttc gatgtggacg gttactcgtc gacggggttc 601 taccgaggct cggcgcacct ctggttcacg gacggggtgc ttcagggcaa gcggcagtgg 661 gaccttgtat aaggacgcgg tttcactgtg cggggcggat cacccggtga tccgccccgc 721 acggccatga accaggggga tgggaaacat cagtacgtca gaaattctcg tcgccgaagg 781 agtcgacctc tcctacggcg atcagcccgc cgtgcgggat gcccggatct cggtggtacc // LOCUS TRBPARPA 763 bp ss-mRNA INV 14-AUG-1990 DEFINITION T.brucei procyclin (pPS-1) mRNA, complete cds. ACCESSION M34790 KEYWORDS procyclic acidic repetitive protein; procyclin. SOURCE T.brucei brucei (strain TREU 427, MITat 1.4), cDNA to mRNA, clone pPS-1. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 763) AUTHORS Dorn,P.L., Aman,R.A. and Boothroyd,J.C. TITLE Inhibition of protein synthesis results in super-induction of procyclin RNA JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Boothroyd, 04-JUN-1990. Stanford University School of Medicine Dept of Microbiology and Immunology Sherman Fairchild Science Building Stanford, CA 94305-5402 USA FEATURES from to/span description pept 29 466 procyclin mRNA < 1 763 procyclin mRNA rpt 215 393 12 bp degenerate repeats BASE COUNT 192 a 215 c 156 g 200 t ORIGIN 1 ccaattacac caaaaagtaa aattcacaat ggcacctcgt tccctttatc tgctcgctct 61 tcttctgttc agccccaacc tcttcgctgg cgtgggattt gccgcagccg ctgaaggacc 121 agaagacaag ggtcttacta agggaggcaa aggcaaaggc gagaagggaa ccaaggtcgg 181 cgccgacgat accaatggca ctgaccccga ccctgaaccc gaacctgaac ccgaacctga 241 acctgaaccc gaacctgaac ccgaacctga acctgaacct gaacccgaac ccgaacctga 301 acctgaacct gaacctgaac ccgaacctga acctgaacct gaacctgaac ccgagccgga 361 acctgaacca gaacctgaac cagaacctga acctggtgct gcaacgctga aatccgttgc 421 acttccgttc gcaatcgcgg ctgctgctct cgttgccgca ttctaagcgg atgcaagcgt 481 gtaaagcgcc tcggaggaac gaaacccttt gaaaaggttc ctttcattta tatcgcctcc 541 atatggtgca tcgtgtttgt ttcctgctgt ttcttgtaaa acaagtgtgg acattcattt 601 aatatttttt cgttatattt ttttggtgac atcctttcta atgccttatt aaccatcgcc 661 tgagacccac agccctgtag atttctgtga tgtttcggtt gcgtattcca taattttaag 721 cgtttcactt ctattttttt tcattccttt gaatttggat ctt // LOCUS W10PPIBCA 4761 bp ds-DNA BCT 14-AUG-1990 DEFINITION Plasmid pWR100 (from S.flexneri) antigen ippI, ipaB, ipaC genes, complete cds, and ippD gene, 3' end. ACCESSION M34849 KEYWORDS antigen. SOURCE Plasmid pWR100 (from Shigella flexneri type 5, strain M90T) DNA, clones HB101/pHS[4108,5357]. ORGANISM Plasmid pWR100 Prokaryota; Bacteria. REFERENCE 1 (bases 1 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. TITLE Nucleotide sequence of the invasion plasmid antigen B and C genes (ipaB and ipaC) of Shigella flexneri JOURNAL Microb. Pathog. 4, 345-357 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 538 and 3987 to 4761) AUTHORS Baudry,B., Kaczorek,M. and Sansonetti,P.J. JOURNAL Unpublished (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by B.Baudry, 05-JUN-1990. Center for Vaccine Development University of Maryland at Baltimore 10 South Pine Street Baltimore, MD 21201 FEATURES from to/span description pept 606 1073 polypeptide I (ippI) pept 1079 2821 polypeptide B (ipaB) pept 2841 3932 polypeptide C (ipaC) pept 3983 > 4761 ipaD protein pept < 1 549 unidentified ORF binding 595 600 ippI ribosomal binding site binding 1065 1070 ipaB ribosomal binding site binding 2829 2834 ipaC ribosomal binding site binding 3971 3977 ipaD ribosomal binding site signal 556 561 ippI -10 region signal 2723 2728 ipaC -35 region signal 2745 2750 ipaC -10 region BASE COUNT 1769 a 952 c 805 g 1235 t ORIGIN 1 aagcttgtaa agaaaatatt atctgttttt aacttaaaac aacgctttcc tcagaagaat 61 tttggatgcc cggtgaatat aaataaaatc agggacagcg tcatagacaa aataaaagat 121 agtaattcgg ggaatcaact attttgttgg atgagccaag agcgaaccac ttatgtctct 181 tcaatgataa acagaagtat agatgaaatg gccatacaca atggagttgt tttgacctca 241 gataacaaaa aaaacatatt tgcagctatt gaaaaaaaat tcccggatat aaaacttgat 301 gaaaaatcag cacaaacatc tatcagtcat acagcactaa acgagattgc ctcatctggc 361 ctcagagcaa agattttaaa acgctatagc agtgacatgg atttgtttaa cacacaaatg 421 aaggatctta caaatctagt atccagttcc gtctacgata aaatatttaa tgaatcaaca 481 aaagttttac aaatagaaat aagcgcagag gttctaaagg ctgtataccg tcaaagcaat 541 acaaattaaa tcttttatat taaatcttat acttattaaa gtaaagcaga caaaaaggag 601 accttatgtc tttaaatatc accgaaaatg aaagcatctc tactgcagta attgatgcaa 661 ttaactctgg cgctacactg aaagatatta atgcaattcc tgatgatatg atggatgaca 721 tttattcata tgcttatgac ttttacaaca aaggaagaat agaggaagct gaagttttct 781 tcaggttttt atgtatatac gacttttaca atgtagacta cattatggga ctcgcagcta 841 tttatcagat aaaagaacag ttccaacaag cagcagacct ttatgctgtc gcttttgcat 901 taggaaaaaa tgactataca ccagtattcc atactggaca atgtcagctt cggttgaaag 961 cccccttaaa agctaaagag tgcttcgaac tcgtaattca acacagcaat gatgaaaaat 1021 taaaaataaa agcacaatca tacttggacg caattcagga tatcaaggag taattattat 1081 gcataatgta agcaccacaa ccactggttt tcctcttgcc aaaatattga cttccactga 1141 gcttggagac aatactatcc aagctgcaaa tgatgcagct aacaaattat tttctcttac 1201 aattgctgat cttactgcta accaaaatat taatacaact aatgcacact caacttcaaa 1261 tatattaatc cctgaactta aagcaccaaa gtcattaaat gcaagttccc aactaacgct 1321 tttaattgga aaccttattc aaatactcgg tgaaaaatct ttaactgcat taacaaataa 1381 aattactgct tggaagtccc agcaacaggc aagacagcaa aaaaacctag aattctccga 1441 taaaattaac actcttctat ctgaaactga aggactaacc agagactatg aaaaacaaat 1501 taataaacta aaaaacgcag attctaaaat aaaagaccta gaaaataaaa ttaaccaaat 1561 tcaaacaaga ttatccgaac tcgatccaga gtcaccagaa aagaaaaaat taagccggga 1621 agaaatacaa ctcactatca aaaaagacgc agcagttaaa gacaggacat tgattgagca 1681 gaaaaccctg tcaattcata gcaaacttac agataaatca atgcaactcg aaaaagaaat 1741 agactctttt tctgcatttt caaacacagc atctgctgaa cagctatcaa cccagcagaa 1801 atcattaacc ggacttgcca gtgttactca attgatggca acctttattc aactagttgg 1861 aaaaaataat gaagaatctt taaaaaatga tctggctcta ttccagtctc tccaagaatc 1921 aagaaaaact gaaatggaga gaaaatctga tgagtatgct gctgaagtac gtaaagcaga 1981 agaactcaac agagtaatgg gttgtgttgg gaaaatactt ggggcacttt taactatcgt 2041 tagtgttgtt gcagcagctt tttctggagg agcctctcta gcactggcag ctgttggttt 2101 agctcttatg gttacggatg ctatagtaca agcagcgacc ggcaattcct tcatggaaca 2161 agccctgaat ccgatcatga aagcagtcat tgaaccctta atcaaactcc tttcagatgc 2221 atttacaaaa atgctcgaag gcttgggcgt cgactcgaaa aaagccaaaa tgattggctc 2281 tattctgggg gcaatcgcag gcgctcttgt cctagttgca gcagtcgttc tcgtagccac 2341 tgttggtaaa caggcagcag caaaacttgc agaaaatatt ggcaaaataa taggtaaaac 2401 cctcacagac cttataccaa agtttctcaa gaatttttct tctcaactgg acgatttaat 2461 cactaatgct gttgccagat taaataaatt tcttggtgca gcgggtgatg aagtaatatc 2521 caaacaaatt atttccaccc atttaaacca agcagtttta ttaggagaaa gtgttaactc 2581 tgccacacaa gcgggaggaa gtgtcgcttc tgctgttttc cagaacagcg cgtcgacaaa 2641 tctagcagac ctgacattat cgaaatatca agttgaacaa ctgtcaaaat atatcagtga 2701 agcaatagaa aaattcggcc aattgcagga agtaattgca gatctattag cctcaatgtc 2761 caactctcag gctaatagaa ctgatgttgc aaaagcaatt ttgcaacaaa ctactgcttg 2821 atacaaataa ggagaatgtt atggaaattc aaaacacaaa accaacccag actttatata 2881 cagatatatc cacaaaacaa actcaaagtt cttccgaaac acaaaaatca caaaattatc 2941 agcagattgc agcgcatatt ccacttaatg tcggtaaaaa tcccgtatta acaaccacat 3001 taaatgatga tcaactttta aagttatcag agcaggttca gcatgattca gaaatcattg 3061 ctcgccttac tgacaaaaag atgaaagatc tttcagagat gagtcacacc cttactccag 3121 agaacactct ggatatttcc agtctttctt ctaatgctgt ttctttaatt attagtgtag 3181 ccgttctact ttctgctctc cgcactgcag aaactaaatt gggctctcaa ttgtcattga 3241 ttgcgttcga tgctacaaaa tcagctgcag agaacattgt tcggcaaggc ctggcagccc 3301 tatcatcaag cattactgga gcagtcacac aagtaggtat aacgggtatc ggtgccaaaa 3361 aaacgcattc agggattagc gaccaaaaag gagccttaag aaagaacctt gccactgctc 3421 aatctcttga aaaagagctt gcaggttcta aattagggtt aaataaacaa atagatacaa 3481 atatcacctc accacaaact aactctagca caaaattttt aggtaaaaat aaactggcgc 3541 cagataatat atccctgtca actgaacata aaacttctct tagttctccc gatatttctt 3601 tgcaggataa aattgacacc cagagaagaa cttacgagct caataccctt tctgcgcagc 3661 aaaaacaaaa cattggccgt gcaacaatgg aaacatcagc cgttgctggt aatatatcca 3721 catcaggagg gcgttatgca tctgctcttg aagaagaaga acaactaatc agtcaggcca 3781 gcagtaaaca agcagaggaa gcatcccaag tatctaaaga agcatcccaa gcgacaaatc 3841 aattaataca aaaattattg aatataattg acagcatcaa ccaatcaaag aattcggcag 3901 ccagtcagat tgctggtaac attcgagctt aatatatcca agagccataa taatatatgg 3961 ctcttcctgt aaggaaataa ccatgaatat aacaactctg actaatagta tttccacctc 4021 atcattcagt ccaaacaata ccaacggttc atcaaccgaa acagttaatt ctgatataaa 4081 aacaacgacc agttctcatc ctgtaagttc ccttactatg ctcaacgaca cccttcataa 4141 tatcagaaca acaaatcagg cattaaagaa agagctttca caaaaaacgt tgactaaaac 4201 atcgctagaa gaaatagcat tacattcatc tcagattagc atggatgtaa ataaatccgc 4261 tcaactattg gatattcttt ccaggaacga atatccaatt aataaagacg caagagaatt 4321 attacattca gccccgaaag aagccgagct tgatggagat caaatgatat ctcatagaga 4381 actgtgggct aaaattgcaa actccatcaa tgatattaat gaacagtatc tgaaagtata 4441 tgaacatgcc gttagttcat atactcaaat gtatcaagat tttagcgctg ttctttccag 4501 tcttgccggc tggatctctc ccggaggtaa cgacggaaac tccgtgaaat tacaagtcaa 4561 ctcgcttaaa aaggcattgg aagaactcaa ggaaaaatat aaagataaac cgctatatcc 4621 agcaaataat actgttagtc aggaacaagc aaataaatgg cttacagaat taggtggaac 4681 aatcggcaag gtatctcaaa aaaacggggg atatgttgtc agtataaaca tgaccccaat 4741 agacaatatg ttaaaaagct t // LOCUS YEPOMPHA 824 bp ss-mRNA BCT 14-AUG-1990 DEFINITION Y.enterocolitica cationic outer membrane protein (ompH) gene, complete cds. ACCESSION M34854 KEYWORDS cationic outer membrane protein; enterobacterial outer membrane protein; lipopolysaccharide binding protein; secretory protein; structural membrane protein. SOURCE Y.enterocolitica (strain 03 isolate EH822) DNA, clone pUCHS115. ORGANISM Yersinia enterocolitica Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 824) AUTHORS Hirvas,L., Koski,P. and Vaara,M. TITLE The ompH gene of Yersinia enterocolitica: Cloning, sequencing, and comparison to known ompH sequences JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by L.H.Hirvas, 06-JUN-1990. University of Helsinki FEATURES from to/span description pept 324 818 ompH protein precursor (gtg start codon) sigp 324 392 ompH protein signal peptide matp 393 815 ompH protein binding 308 317 ompH ribosomal binding site signal 120 125 ompH -35 region signal 143 148 ompH -10 region BASE COUNT 257 a 147 c 198 g 221 t 1 others ORIGIN 1 tattcctgat tacagtaaag ccagcaatat tcgtgtctct gccggtgtgg cactacaatg 61 gatgtctcca ttggggcctt tagtgttcgc atatgctaaa ccggttaaag attacgaagg 121 tgataagtca gagcaattcc agtttaacat ggtaaaactt ggtaattgat tggcaaaagt 181 agttgcaaat tttaaagaca gcactggtag ttgaaataac ctaaggtagc gctgatgagt 241 ttggcgctat tttaggcatt attcagtatt aagtgcactt caaggtgtct ctgacacaaa 301 caggtaatgg taaggagttt atagtgaaaa agtggttgtg tgcggcaagt cttggtttag 361 cattggcagc ttctgccagy gttcaagccg caaaaattgc tattgttaac gtttccagaa 421 ttttccaaca attacctgag agtgaaaccg tagctaaaca gttggagaat gaattcaaag 481 gccgagcaac cgaactgcaa ggaatggaga gcgatctgca gactaaaatg cagaaactgc 541 aacgtgacgg ttctaccatg aaagccagtg atcgtaccaa actggaaaat gacgtaatga 601 aacagcgtga aactttctct actaaagccc aggcttttga gcaagataat cgccgtcgtc 661 agatggaaga gcgtaataaa atcctgagcc gtattcagga tgctgttaaa tctgttgcta 721 gcaaaggcgg ttatgacgtg gtgattgatg caaatgctgt tgcatatgca gatccttcta 781 aagatatcac tgctgacgtg ctgaaacagg ttaaataaaa gtac // LOCUS YSCEXG1A 2056 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase (EXG1) gene, complete cds. ACCESSION M34341 KEYWORDS 1,3-glucan glucanohydrolase; exo-1,3-B-glucanase. SOURCE S.cerevisiae (strain AB320) DNA, clone YEp13. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2056) AUTHORS Vazquez de Aldana,C.R., Correa,J., San Segundo,P., Bueno,A., Nebreda,A.R., Mendez,E. and del Rey,F. TITLE Nucleotide sequence of the exo-1,3-B-glucanase-encoding gene, EXG1, in the yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.del Rey, 17-MAY-1990. Instituto de Microbiologia bioquimica Facultad de Biologia CSIC/Universidad de Salamanca 37008 Salamanca, SPAIN FEATURES from to/span description pept 363 1709 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase precursor (EXG1, EC 3.2.1.58) sigp 363 482 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase signal peptide matp 483 1706 exo-1,3-B-glucanase/1,3-B-D-glucan glucanohydrolase mRNA 265 > 1709 EXG1 mRNA signal 153 159 TATA box BASE COUNT 610 a 395 c 396 g 655 t ORIGIN Chromosome XII-right arm. 1 ctcgagcacg ccgttataat tttagcgtgt tcctgtacct gtgtgcacat caataagcgg 61 tgtaacaaac ttgaacttgc catctcatat cgtcatatga gcagttgcag agaaaggcac 121 tttaaataaa aaggcgtgga tgataaaaaa tgtatataag ttggatggat ttttgggaaa 181 aagtaatgtt tttgcagacg ttttaaatac tccctccctt ttcttagtaa tttttattat 241 gtattgacta agtcaaaaat aactatagaa aactaaagtt tacgagagga cccaaaagtt 301 ttgaataaca cgtgcctttg attttttgtt tactttcttt ttctagttaa ttaccaacta 361 aaatgctttc gcttaaaacg ttactgtgta cgttgttgac tgtgtcatca gtactcgcta 421 ccccagtccc tgcaagagac ccttcttcca ttcaatttgt tcatgaggag aacaagaaaa 481 gatactacga ttatgaccac ggttccctcg gagaaccaat ccgtggtgtc aacattggtg 541 gttggttact tcttgaacca tacattactc catctttgtt cgaggctttc cgtacaaatg 601 atgacaacga cgaaggaatt cctgtcgacg aatatcactt ctgtcaatat ttaggtaagg 661 atttggctaa aagccgttta cagagccatt ggtctacttt ctaccaagaa caagatttcg 721 ctaatattgc ttcccaaggt ttcaaccttg tcagaattcc tatcggttac tgggctttcc 781 aaactttgga cgatgatcct tatgttagcg gcctacagga atcttaccta gaccaagcca 841 tcggttgggc tagaaacaac agcttgaaag tttgggttga tttgcatggt gccgctggtt 901 cgcagaacgg gtttgataac tctggtttga gagattcata caagtttttg gaagacagca 961 atttggccgt tactacaaat gtcttgaact acatattgaa aaaatactct gcggaggaat 1021 acttggacac tgttattggt atcgaattga ttaatgagcc attgggtcct gttctagaca 1081 tggataaaat gaagaatgac tacttggcac ctgcttacga atacttgaga aacaacatca 1141 agagtgacca agttatcatc atccatgacg ctttccaacc atacaattat tgggatgact 1201 tcatgactga aaacgatggc tactggggtg tcactatcga ccatcatcac taccaagtct 1261 ttgcttctga tcaattggaa agatccattg atgaacacat taaagtagct tgtgaatggg 1321 gtaccggagt tttgaatgaa tcccactgga ctgtttgtgg tgagtttgct gccgctttga 1381 ctgattgtac aaaatggttg aatagtgttg gcttcggcgc tagatacgac ggttcttggg 1441 tcaatggtga ccaaacatct tcttacattg gctcttgtgc taacaacgat gatatagctt 1501 actggtctga cgaaagaaag gaaaacacaa gacgttatgt ggaggcacaa ctagatgcct 1561 ttgaaatgag agggggttgg attatctggt gttacaagac agaatctagt ttggaatggg 1621 atgctcaaag attgatgttc aatggtttat tccctcaacc attgactgac agaaagtatc 1681 caaaccaatg tggcacaatt tctaactaag tcgccctcag tccgctcatt ttagctgaat 1741 tttctaatgt tatttttcat cagcaaaact taacagaacg ttaatttatc tacccctttt 1801 agttcattat ctctttttta tccaacattt tacagagatc tctcacttaa gtctaagtaa 1861 agacattatt ttatatggta cacttataga atatacgata ataataataa aaactatgta 1921 acataacctt cagaatttaa tattagtttc ctttttacct cattgcacta ataaaaaaat 1981 tctacagaat ctccgaaaaa gaaaatccag cttactcttt ttgttttctt cttcacacgt 2041 gagcttttcc gccggc // LOCUS CORDTXRA 2529 bp ds-DNA BCT 14-AUG-1990 DEFINITION C.diphtheriae toxin repressor (dtxR) gene, complete cds. ACCESSION M34239 KEYWORDS regulatory protein; toxin repressor. SOURCE C.diphtheriae (strain C7) DNA, clone pHH2500.. ORGANISM Corynebacterium diphtheriae Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 1390) AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R. TITLE Molecular cloning and DNA sequence analysis of the diphtheria tox iron-dependent regulatory element (dtxR) from Corynebacterium diphtheriae JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5968-5972 (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 2529) AUTHORS Boyd,J., Oza,M.N. and Murphy,J.R. JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.R.Murphy, 11-MAY-1990. University Hospital Dept Medicine-Rm E613 88 East Newton Street Boston, MA 02118-2393 FEATURES from to/span description pept 681 1361 toxin repressor protein (dtxR) pept 25 456 ORF1 pept 1384 2370 ORF3 signal 595 629 promoter sequence binding 664 669 ribosomal binding site (put.) binding 1200 1283 DNA-binding region (put.) BASE COUNT 691 a 615 c 593 g 630 t ORIGIN 1 bp upstream of HindIII site. 1 aagctttctc gaatcaaacg cgagatgtat caacagctcg gacgtgaagc aaccaacgaa 61 gagcttgcgg aagaatctgg cattgaagaa tccaaaattg aaatgttgct tcgccagtct 121 cgtgacccag taagcctaga tatgccagtt ggtgccgatg aagaggctcc acttggtgac 181 tttatcgaag attccgaggc gactgatgcc gaatcagctg tagttgcgtc gttgcgtcac 241 tctgatatca gaacagtctt ggacactttg gagcaacgtg aacaggacgt tattcgcctt 301 cgttatgggc tcgatgatgg agttccccgc actttagatc aaattggtcg tcaatttgga 361 ctgtctcgtg aaagagttcg ccaaattgaa cgcgaggtca tgtcgaaact gcgtgacggt 421 gcccgtgccg aaaaacttcg agcatacgca caataaagcg tttgcttaga tatgcctacc 481 aataaagaca taaacgccta ttaaaagcaa tctttagatt aggcgttata attaaagtct 541 catcgaaaaa cgcgctgcgg gactacaacg caacaagaaa actattccat atttttcacg 601 ctacaattcg ttgtagattg ataggaattg atcaccagca cacaacagtc tccatggcac 661 tataaggaaa gaggcttaca atgaaggact tagtcgatac cacagagatg tacttgcgta 721 ctatctatga gctggaagaa gagggagtca cccctcttcg cgctaggatc gctgagcgtc 781 tggaacaatc tggacctaca gttagccaaa ccgttgcccg tatggagcgc gatggacttg 841 tcgttgtcgc ctcagaccgc agtctacaaa tgacaccgac aggccgcact ttagcgactg 901 cagttatgcg taaacatcgc ttagctgagc gccttcttac cgatatcatt ggtctagata 961 tcaataaagt tcacgatgaa gcctgccgct gggaacacgt tatgagtgac gaagttgaac 1021 gcaggctcgt gaaagtattg aaagatgtca gtcggtcccc cttcggaaac ccaattccag 1081 gtctcgacga actcggcgta ggcaattctg acgcggcagc ccccggaact cgcgttattg 1141 acgctgccac cagcatgccc cgcaaagtac gcattgttca gattaacgaa atctttcaag 1201 ttgaaacgga tcagtttaca cagctcctcg atgctgacat ccgtgttgga tcagaagtcg 1261 aaattgtaga tagagacggc cacatcacgt tgagccacaa tggaaaagat gtcgaactcc 1321 tcgatgatct ggctcacact attcgtatcg aagaactcta aatactaaag gcggcgaaat 1381 tagatgaaac tcctcgttac cggtggcgcc ggatacgtag gaagtgtctg ttccactgtt 1441 ctgctcgaac aggggcacga agtaacaatt gtcgataatc ttacaaccgg caatcgcgat 1501 gctgttccgc taggagccac ttttgtcgag ggtgatatca aagacgttgc ggaaaacgtt 1561 ttatcatctg attcttttga tgcggttctt cacttcgcgg ctcgttcgct tgtaggcgaa 1621 tcagttgaaa agccagatga atattggcag cacaacatgg tgaccacact cgcacttctt 1681 gatgccatga aacgtaataa cgtgcgaaac attgtctttt cttctacagc tgctacatac 1741 ggcgaacccg aaacagtccc gattactgaa gacgctccta cccacccaac gaatccatac 1801 ggcgcaacca agctgtctat cgactacgcg atcacatcat acgcacatgc atatgggttt 1861 gctgcgacga gtctgcggta cttcaacgta gctggcgcat atggtttggt gggtgaaaat 1921 cgagaaatag aaacccatct cattccgctc gttttacaag tagcactggg acaccgtgac 1981 aagattttca tgtttggtga cgactggcca actgaagatg gaacccctat ccgcgattac 2041 attcatattc gtgatcttgc agatgcccat attttggcac ttcaaagtaa cgtcgaaggc 2101 agccaccgga tctttaatct cggatccggt gaaggatatt cggtcaaaca ggtcattgat 2161 acctgcagag aagttaccgg tcatcccatc cccgctgagg tagctcctcg acgggctggt 2221 gatcctgcag tgctcattgc atcatcggct aaggcgcaat ccgaactagg atggaagcca 2281 caacgcactg atctgcacac gattgtttct gatgcttggg cttttacgtc tcaactcgga 2341 gacaaagccc atagcgcttc tcgtggataa caacgagtag gtttatcttt tgcagtccgc 2401 ggtgtgatta cttaagtatt cacaccgcgg attttacatt tgttgactca ccaaaatgtt 2461 tctgagcttc ctgactccca ttgataacaa gatcaacaat ggatccagat agacctcgat 2521 aataagctt // LOCUS ECOANSBA 1643 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli L-asparaginase II (ansB) gene, complete cds. ACCESSION M34234 KEYWORDS L-asparaginase II; ansB gene. SOURCE E.coli (strain K12 isolate JM108) DNA, clones lambda-ansB and pansB7. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1643) AUTHORS Bonthron,D.T. TITLE L-asparaginase II of Escherichia coli K-12: Cloning and sequencing of the ansB gene and its position on the chromosomal map JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by D.T.Bonthron, 11-MAY-1990. FEATURES from to/span description pept 346 1392 L-asparaginase II (ansB) (EC 3.5.1.1) sigp 346 411 L-asparaginase II signal peptide matp 412 1389 L-asparaginase II binding 335 339 ribosomal binding site (put.) site 1400 1425 transcription termination signal (put.) BASE COUNT 442 a 404 c 405 g 392 t ORIGIN 63.8 min on K12 map. 1 gcggtaaccc attacagaat gtgctgggaa gcctgggggg gctgcaatcc tcaatccaaa 61 ccgagtggaa aaagcaggaa aaagatttcc agcagtttgg caaagatgtt tgtagccgcg 121 ttgtgactct ggaagatagc cgcaaagccc tggtcgggaa tttaaaataa tcctctattt 181 taagacggca taatactttt ttatgccgtt taattcttcg ttttgttacc tgcctctaac 241 tttgtagatc tccaaaatat attcacgttg taaattgttt aacgtcaaat ttcccataca 301 gagctaaggg ataatgcgta gcgttcacgt aactggagga atgaaatgga gtttttcaaa 361 aagacggcac ttgccgcact ggttatgggt tttagtggtg cagcattggc attacccaat 421 atcaccattt tagcaaccgg cgggaccatt gccggtggtg gtgactccgc aaccaaatct 481 aactacacag tgggtaaagt tggcgtagaa aatctggtta atgcggtgcc gcaactaaaa 541 gacattgcga acgttaaagg cgagcaggta gtgaatatcg gctcccagga catgaacgat 601 aatgtctggc tgacactggc gaaaaaaatt aacaccgact gcgataagac cgacggcttc 661 gtcattaccc acggtaccga cacgatggaa gaaactgctt acttcctcga cctgacggtg 721 aaatgcgaca aaccggtggt gatggtcggc gcaatgcgtc cgtccacgtc tatgagcgca 781 gacggtccat tcaacctgta taacgcggta gtgaccgcag ctgataaagc ctccgccaac 841 cgtggcgtgc tggtagtgat gaatgacacc gtgcttgatg gccgtgacgt caccaaaacc 901 aacaccaccg acgtagcgac cttcaagtct gttaactacg gtcctctggg ttacattcac 961 aacggtaaga ttgactacca gcgtaccccg gcacgtaagc ataccagcga cacgccattc 1021 gatgtctcta agctgaatga actgccgaaa gtcggcattg tttataacta cgctaacgca 1081 tccgatcttc cggctaaagc actggtagat gcgggctatg atggcatcgt tagcgctggt 1141 gtgggtaacg gcaacctgta taaatctgtg ttcgacacgc tggcgaccgc cgcgaaaacc 1201 ggtactgcag tcgtgcgttc ttcccgcgta ccgacgggcg ctaccactca ggatgccgaa 1261 gtggatgatg cgaaatacgg cttcgtcgcc tctggcacgc tgaacccgca aaaagcgcgc 1321 gttctgctgc aactggctct gacgcaaacc aaagatccgc agcagatcca gcagatcttc 1381 aatcagtact aatcgcctcg ccccggtatc gtgccggggc tttttcactt cagactcacg 1441 tccattgcca attttaatta ccctaatgat aatcaccgga ataaattatt ccgcgtgagg 1501 tttttcgggt gaaaaagcaa tggattgttg gcacggcact gcttatgttg atgactggta 1561 atgcctgggc agatggcgaa ccgccaactg aaaatatctt aaaagatcaa ttcaaaaagc 1621 agtatcacgg cattctcaag ctt // LOCUS HUMADE5RC 1946 bp ds-DNA SYN 14-AUG-1990 DEFINITION Synthetic human/adenovirus type 5 recombination junction. ACCESSION M34061 KEYWORDS . SOURCE Human fetal bone marrow fibroblasts transformed with Adeno5/SV40 recombinant virus (cell line H13.1) DNA, clone pH13.1SV1. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 1946) AUTHORS Romani,M., De Ambrosis,A., Alhadeff,B., Purrello,M., Gluzman,Y. and Siniscalco,M. TITLE Preferential viral integration at the highly recombinogenic chromosomal site 1p36 in human cells transformed with an Adeno5/SV40 hybrid JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Romani 08-MAY-1990. FEATURES from to/span description recomb 429 430 adenovirus type 5 end/Human DNA start rpt 430 1134 tandem Alu repeats BASE COUNT 492 a 548 c 510 g 396 t ORIGIN Chromosome 1p36. 1 tttgacccgg aacgcggaaa acacctgaga aaaacacctg ggcgagtctc cacgtaaacg 61 gtcaaagtcc ccgcggccct agacaaatat tacgcgctat gagtaacaca aaattattca 121 gatttcactt cctcttattc agttttcccg cgaaaatggc caaatcttac tcggttacgc 181 ccaaaattta ctacaacatc cgcctaaaac cgcgcgaaaa ttgtcacttc ctgtgtacac 241 cggcgcacac caaaaacgtc acttttgcca catccgtcgc ttacatgtgt tccgccacac 301 ttgcaacatc acacttccgc cacactacta cgtcacccgc cccgttccca cgccccgcgc 361 cacgtcacaa actccacccc ctcattatca tattggcttc aatccaaaat aaggtatatt 421 attgatgata tataatatag aggctgaggc aggagggttg cttgagcctg ggaggctgcg 481 gctgcagtga gccatgatca caccactgca cttcagcctg ggcaacagag cgagactccg 541 tctcaaaaaa tagtaatagg ccaggcgcgg tggctcatgc cagtaatccc agcgctttgg 601 gaggctgagg caggcagatc acctgaggtc aggagttcaa gaccagcctg gccaacatgg 661 tgaaaccctg tctctactaa aaatacaaaa attagttggg catggtggcg gcgcacacca 721 gtaatcccag ctactcaaga ggctgagggc acttgggcag cagaggaggt tgtagtgagc 781 agagatcacg ccactgcact ctagcctggg caacagaggg agactctgtc tcaaaaaata 841 ataggccggg cacggtggct catgcctgta atcccagcac tttgggaggc cgaggcgggt 901 ggatcacctg agatcaggag tttaagacca gcctggccaa catggtgaaa ccccatctct 961 actaaaaata caaaaaatta gccaggcgtg gtggcgggcg tctgtaatcc cagctacttg 1021 ggaggctgaa gcaggagaat cacttgagcc caggaggcag aggttgcagt gagctgagat 1081 catgccatta cactccagcc tgggcaacaa gagtgaaact ccgtctcaaa aaataataat 1141 aataatgata ataataataa taattgcatc tattcactga gcgcctcctg tacaccaggc 1201 accatgtggc attaggggcc catgctgtca tcatattctg ccagccaccc atggggtggt 1261 ggggtaattc cccaggccac acctgtccac acctctcttc aggtactgcg tgggcatgtg 1321 cggagacggc gccaatgact gtggggccct gaaggcggct gatgtcggca tctcgctgtc 1381 ccaggcagaa gcctcagtgg tctcaccctt cacctcgagc atggccagta ttgagtgcgt 1441 gcccatggtc atcaggtaag gcaggcaggg accggtgggt gagggctcag cagggctggt 1501 cagccaacct gggtccccta atgtccgtgc cctgccaccc agggaggggc gctgttccct 1561 tgacacttcg ttcagcgtct tcaagtacat ggctctgtac agcctgaccc agttcatctc 1621 cgtcctgatc ctctacacgg tgagtatctg cagagctccg tacccagagc cgtcccagga 1681 ctagggaggg gacacagccc tgtgccttgg agctggcagg tcccagatag atgacgtggg 1741 cccagatcct gaccctgcca ctctccggct gtgtggttgg agaggtggca aggcttgggg 1801 gagatgagaa aaacaccagg cacggagcct ggtacccact aggtcctcac aaatggcagc 1861 agctgtcatg ggctaacatc ttgaccgcaa ctctgggagc tccccaatct gcagggaaga 1921 ctgaacggag gcagggccaa aaccct // LOCUS MCLPCCA 1701 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.laminosus phycoerythrocyanin (PCC) alpha and beta subunit genes, complete cds, and linker protein, 5' end. ACCESSION M34254 KEYWORDS light-harvesting biliprotein; phycoerythrocyanin. SOURCE M.laminosus (strain Fischerella PCC 7603) DNA. ORGANISM Mastigocladus laminosus Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria. REFERENCE 1 (bases 1 to 1701) AUTHORS Eberlein,M. and Kufer,W. TITLE Genes encoding both subunits of phycoerythrocyanin, a light- harvesting, biliprotein from the cyanobacterium Mastigocladus laminosus JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.Kufer, 11-MAY-1990. Botanisches Institut Ludwig-Maximilians- Universitaet Muenchen Menzingerstr. 67 D-8000 Muenchen 19 Federal Republic of Germany FEATURES from to/span description pept 279 797 phycoerythrocyanin beta-subunit pept 884 1372 phycoerythrocyanin alpha-subunit pept 1528 > 1701 phycoerythrocyanin linker protein precursor matp 1531 > 1701 phycoerythrocyanin linker protein BASE COUNT 526 a 312 c 383 g 480 t ORIGIN 1 agctttacat ttaaggatat ggcgctcaaa gaatgggagt tttacagtta ttaaataaag 61 ttcataaaaa agaatacaag taatatataa aatacgaaac taaggtttga aagtaattgc 121 aaatatcaag gcaaacgaaa agataaaggc taattgttgc acgtaaaaat ggtatttttt 181 tgatttttag gtgtagttat tggcggattt tatctggtct cggtttgtct tgattaaaaa 241 tgcaatagct gccatgaaat cgaggagaaa atagaaaaat gcttgatgct ttttccagag 301 tagttgaaca agctgataaa aaaggtgctt acctaagcaa tgatgaaata aatgctttgc 361 aggctatagt tgcagatagc aacaaacgtt tggatgtagt gaatcggctt acaagcaacg 421 cctcctcaat tgtggcgaat gcttatcgtg ctttagtagc agaacgtccg caggtattta 481 atcctggtgg tccttgtttt caccatcgca accaagctgc ttgcatccgc gatttaggat 541 ttattctgcg ttatgttacg tactccgtac ttgctggtga cactagtgtc atggacgatc 601 gctgcttgaa tggtttgcgc gaaacctatc aagctcttgg tactcccggt gatgctgtag 661 cttctggaat taaaaaaatg aaagaagctg cacttaaaat tgctaacgat cccaatggca 721 ttaccaaagg tgattgcagt caattaatgt ctgaattagc aagttatttt gaccgagcag 781 cggctgctgt tgcttaatgc agtcaaaggc gcgactaagt gacaaaattt tcaaactctt 841 ttttagtaat taatttgaca agtttttgga gaagaagcga atcatgaaga ccccgttaac 901 cgaagcgatc gccgccgccg atcttcgcgg ttcttatttg agcaacacag aattgcaagc 961 agtatttggt cgtttcaatc gcgctcgtgc tggtttagaa gcagcaagag cattcgctaa 1021 caacggtaag aaatgggcgg aagcagcagc caatcacgta tatcaaaaat ttccctacac 1081 tacccaaatg caaggacctc agtatgcttc tactccagaa gggaaagcaa agtgcgtgcg 1141 ggatatcgac cattacttac gcacaattag ctactgctgt gtagttggcg gtactggtcc 1201 cttggatgac tatgtagttg caggcttaaa ggaatttaac agcgccctgg gtttatctcc 1261 cagctggtac atagctgctc tagaatttgt ccgcgacaat cacggtttga ccggtgatgt 1321 tgctggtgaa gccaacacct acatcaacta tgcaattaat gcattgagct aaatctaaac 1381 agcgaatgtc ctttgtattt gcttaatagc taatggctga tagctgatgg ctaatggcta 1441 atggaaaaat gcaattagca attagcagtg agttattggc cattaactgt gtttgagaaa 1501 aatgataaaa atggagcgat tgttaacatg agtacatcag tcgcagaacg cctcgcaatt 1561 aaagatgaag tagacaaaaa aattgaattg cgccctaatt ggagcgaaga cgagctgcaa 1621 attgttttta aaacagccta cgagcaagtt tttggacgtc agggattata tgccagtcaa 1681 agatttgcaa ctgccgaagc t // LOCUS YSCTSDNAA 238 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-ii. ACCESSION M34310 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-ii. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 238) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. 10, 4415-4419 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 146 g 92 t ORIGIN 1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg 61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg 121 gtgtggtgtg tgggtgtggt gtgtgggtgt ggtgtgggtg tgggtgtggt gtgtgtgggt 181 gtggtgtgtg ggtgtgggtg tggtgtggtg tgtgtgggtg tgtgggtgtg gtgtgtgt // LOCUS YSCTSDNAB 297 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-2-i. ACCESSION M34311 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-2-i. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 297) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 184 g 113 t ORIGIN 1 tgtgtgtggg tgtggtgtgg gtgtggtgtg tgggtgtggg tgtggtgtgg tgtgtgtgtg 61 ggtgtggtgt gggtgtggtg tgtgggtgtg gtgtgtgggt gtgtgggtgt gtgggtgtgg 121 gtgtggtgtg ggtgtgggtg tgggtgtggt gtgtgtgtgg gtgtggtgtg tgggtgtggg 181 tgtgtgtggg tgtgggtgtg ggtgtggtgt ggtgtgtggg tgtggtgtgt gggtgtggtg 241 tgtgtgggtg tggtgtgtgt gggtgtgggt gtgtgtgggt gtgggtgtgg tgtgtgg // LOCUS YSCTSDNAC 325 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-ii. ACCESSION M34312 KEYWORDS . SOURCE S.cerevisiae extrachromosomal plasmid DNA, clone YLP108CA-4-ii. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 325) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 199 g 126 t ORIGIN 1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg 61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg 121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg gtgtgtgggt gtggtgtggg tgtggtgtgt 181 gggtgtgtgg gtgtgggtgt gtgtggtgtg tggtgtgtgt ggtgtgggtg tggtgtgtgg 241 gtgtgggtgt ggtgtgtggg tgtgggtgtg gtgtgtgtgt gtgggtgtgg tgtgtgggtg 301 tgggtgtggt gtgtgggtgt gtggg // LOCUS YSCTSDNAD 248 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae telomeric sequence DNA, clone YLP108CA-4-i. ACCESSION M34313 KEYWORDS . SOURCE S.cerevisiae (strain XS595-6C) extrachromosomal plasmid DNA, clone YLP108CA-4-i. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 248) AUTHORS Wang,S.-S. and Zakian,V.A. TITLE Sequencing of Saccharomyces telomeres cloned using T4 DNA polymerase reveals two domains JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.-S.Wang, 16-MAY-1990. BASE COUNT 0 a 0 c 151 g 97 t ORIGIN 1 tgtgggtgtg tgtgtgggtg tggtgtgtgt gggtgtggtg tgtgggtgtg gtgtggtgtg 61 ggtgtgggtg tggtgtgggt gtggtgtgtg tgggtgtggt gtgtgtgggt gtggtgtgtg 121 gtgtgtgtgg gtgtgtgggt gtggtgtgtg tgtgtgtggg tgtggtgtgg gtgtggtgtg 181 tgggtgtgtg tgggtgtggt gtgtgtgggt gtgtgggtgt ggtgtgtgtg ggtgtggtgt 241 gtgggtgg // LOCUS PLYORIA 375 bp ds-DNA VRL 14-AUG-1990 DEFINITION Human polyomavirus BK (strain WWT) origin of replication. ACCESSION M34048 KEYWORDS origin of replication. SOURCE Polyomavirus BK (strain WWT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 375) AUTHORS Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL J. Virol. 64, 3864-3871 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 375 regulatory region site 143 210 P block site 211 249 Q block site 250 302 R block BASE COUNT 123 a 86 c 95 g 71 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctgaaattct caaataaaca caagaggaag tggaaactgg ccaaaggagt 301 ggaaagcagc cagacagaca tgttttgcgg gcctaggaat cttggccttg tccccagtta 361 aactggacaa aggcc // LOCUS PLYORIB 424 bp ds-DNA VRL 14-AUG-1990 DEFINITION Human polyomavirus BK (strain UT) origin of replication. ACCESSION M34049 KEYWORDS origin of replication. SOURCE Human polyomavirus BK (strain UT) DNA, from human urine. ORGANISM Polyomavirus BK Viridae; ds-DNA nonenveloped viruses; Papovaviridae; Polyomaviruses. REFERENCE 1 (bases 1 to 424) AUTHORS Sundsfjord,A., Johansen,T., Flaegstad,T., Moens,U., Villand,P., Subramani,S. and Traavik,T. TITLE At least two type of control regions can be found among naturally occurring BK virus strains JOURNAL J. Virol. 64, 3864-3871 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.Johansen, 07-MAY-1990. University of Tromso P.O. Box 977 N-9001, Tromso, NORWAY e-mail: terje.johansen@esuit.uit.no FEATURES from to/span description orgrpl 97 119 polyomavirus origin of replication site 1 424 regulatory region site 143 210 P block site 211 249 Q block site 250 261 R block site 263 315 P block site 316 349 Q block site 350 361 R block BASE COUNT 134 a 103 c 105 g 82 t ORIGIN 1 ttttgcaaaa attgcaaaag aatagggatt tccccaaata gttttgctag gcctcagaaa 61 aagcctccac acccttacta cttgagagaa agggtggagg cagaggcggc ctcggcctct 121 tatatattat aaaaaaaaag gccacaggga ggagctgctt acccatggaa tgcagccaaa 181 ccatgacctc aggaaggaaa gtgcatgact gggcagccag ccagtggcag ttaatagtga 241 aaccccgccc ctaaaattct ctcttaccca tggaatgcag ccaaaccatg acctcaggaa 301 ggaaagtgca tgactgggca gccagccagt ggcagttaat agtgaaaccg gaaagcagcc 361 agacagacat gttttgcgag cctaggaatc ttggccttgt ccccagttaa actggacaaa 421 ggcc // LOCUS MXAFRZE 2493 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.xanthus gliding motility regulatory protein (frzE) gene, complete cds. ACCESSION M35192 KEYWORDS gliding motility regulatory protein. SOURCE M.xanthus (strain DZF1) DNA. ORGANISM Myxococcus xanthus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Myxobacteria; Myxococcaceae. REFERENCE 1 (bases 1 to 2493) AUTHORS McCleary,W.R. and Zusman,D.R. TITLE FrzE of Myxococcus xanthus is homologous to both CheA and CheY of Salmonella typhimurium JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5898-5902 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.R.McCleary, 05-JUN-1990. FEATURES from to/span description pept 64 2397 gliding motility regulatory protein (frzE) BASE COUNT 375 a 848 c 926 g 344 t ORIGIN 1 gagctcatca agcggttcaa ggccgactag cgcttcctgg aaggggaggg gcgtacgccg 61 gccatggaca ccgaggctct caagaaatcc ctcctgaaga agttccagga ggtcaccgcc 121 gaccgcctcc agaagattca gctgggggta ttggacctgg agaaggagac cgcggaccag 181 gccgcggagg acgtcgcgcg cgagctgcac acgatgaagg gcgaggcccg catgttgggt 241 ctggccgcca tcgggcagct ggcgcacgcc gccgaggacg tcctgcgcgc agagcgcgag 301 ggcaagacgg ccaccgaggt ggccaccgac gtcctgctca gggcgtgcga cgtcctctcc 361 gatctcaacg aagacctgtc cggcgccaac acgggcaacc cggccagcga ggagatggtc 421 cgcatgctcg cggaagtctc cggacagacg ccgcccgcca tcgctggcgc acggcccgtg 481 gcgccgcctc ctgcaccgcc gccggcccct gtcgccgcgc ccgtggtgac accggcagcc 541 gtcgccgcgc cgcccgcgcc ggtgcaggct ccggtggctc cgcccccgac gcaggccccc 601 gtggccgagc cgggggcgca cgccgccgca gccgcgcctc acccggctgc cgcgcatggc 661 cgtgacgagg aggccccgag cgccgcgaag tccgcggtgg ccgaccggag catccgcgtc 721 aacgtggagg tgctcgacgc gttggggttg ctcgcgggcg acctgctggt ggagagcgcc 781 cgcggccggc tgcgcagctc ggagacggag gcgttgttcg agcgcttcag ccgcctgggg 841 gaccgcttcc tccggctggc ggaagagatc gacatctcga acgaggtgcg tgagcagttg 901 gaccgcgtgg agagcgacct ccacatgctg cgcgacgacg cgttccgctt cgtgcgccgc 961 aacgacgacg gcatcaacac gctgcacggc aacctggcga agatggcgga ccacgtggcc 1021 gaggcccggc tggtgccgct gtccaccgtg ttcgacgcct tcccgcgcgc cgttcgcgag 1081 atgtcgcgca cgcagggcaa ggaagtggac ctggtcatcg agaacgccga catcggcgtg 1141 gaccggtcca tgctgggcga cgtgcgagac gcgctggtgc acctgctgcg caactcggtg 1201 gaccacggcg tggagtcccc ggacacgcgc cagcagttgg gcaagccgct caacggccgc 1261 atccgcatcc gcgtgcgcgt ggacggcgac atgctccaca tcgaggtgga ggacgacggc 1321 cgcggcatcg acccggagcg gctgcgtcag gcggccatct ccaagcgcct catcaacgcg 1381 gtgcaggccg ccgcgctgtc ggagcgcgag gccatcgagc tcatcttccg ccccggcttc 1441 tccacccgcg accaggtcag cgagctgtct ggccgtggcg tgggcatgga cgtggtgaag 1501 cgcaaggtgg agacgctggg cggctcggtg ggcgtgagca gccgaatcgg ccgtggctcc 1561 accatcaccc tgcgcctgcc gcagtcgctg gcgttgatga aggtgctgct ggtgcgcctg 1621 ggggacgacg tctacggcat gcccgccgcg gacgtggaag ccgtcatgcg cgtcaagccg 1681 gatgaccggc tggagatctt cggcacgctg gccgtgcggc accgtggcaa gcccacggcg 1741 ctggtggcgc tggggccgct gttgggcctc aacggcggca atcgcttcga caagccaccc 1801 gcggtggtgg tgcgtcacgg cgaggaccac gcggcgctgg tggtggacgg cttcgtggac 1861 gagcgcgaag tggccgtgaa gccttgcggc ggcgagttcc tcaaggccgc gcccttcatc 1921 gccggcaccg cggcgctgga ggacgggcgc atcgccgtgc tgctccatgt cccggacatc 1981 atggcggagg tgcgacggat ggcccgcccc gtcacccagg cccccgccgc caagcgcctc 2041 cgggtgctgc tggtggacga ctcgcccatt gcccgcgcta cggaaggggc gctggtgaag 2101 gcgctggggc actcggtgga ggaagcccag gacggcgaag aggcctacgt gaaggtgcag 2161 aacaacacct acgacctcat cctcacggac gtgcagatgc ccaagctgga cgggttctcg 2221 ctggcgcggc ggctcaagtc gacgcccgcg gtggctcgca ttccggtcat catcctgtcg 2281 tcgctcgcct cgcccgagga caagcggcgc gggttggatg ccggcgcgga cgcgtacctc 2341 gtcaagggcg agctgggcgt ggaggttctc gcgcaggcca tcgaccggct gacctgagga 2401 gccaggcttg ggcggtggcg cggtagtcgc aggaatggcg tttcgggtgc tcatggtggg 2461 caaggggctg cgtgcgctcg cggcccgggg cct // LOCUS RATPCBB 463 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat PCB-binding protein mRNA, complete cds.. ACCESSION J05536 KEYWORDS PCB binding protein; binding protein. SOURCE Rat (strain Sprague Dawley) lung Clara cell, cDNA to mRNA, clone pPCB6. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 463) AUTHORS Nordlund-Moeller,L., Andersson,O., Ahlgren,R., Schilling,J., Gillner,M., Gustafsson,J.-A. and Lund,J. TITLE Cloning, structure, and expression of a rat binding protein for polychlorinated biphenyls: Homology to the hormonally regulated progesterone-binding protein uteroglobin JOURNAL J. Biol. Chem. 265, 12690-12693 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.Lund, 15-JUN-1990. FEATURES from to/span description pept 58 348 PCB binding protein precursor sigp 58 114 PCB binding protein signal peptide matp 115 345 PCB binding protein BASE COUNT 117 a 156 c 84 g 106 t ORIGIN 1 cctcacatta caacatcagc ccacatctac agacagccca agcctccggc ctctaccatg 61 aagatcgcca tcacaatcac tgtgctcatg ctgtccatct gctgcagctc agcctcttcg 121 gacatctgcc caggatttct tcaagtcctt gaggccctcc tcctaggctc agagtctaat 181 tatgaggcag ccctgaagcc tttcaaccct gcctcagacc tgcaaaatgc aggaacccag 241 ctgaagaggc tggtggatac cctcccacag gagaccagaa taaacatcgt gaagctcacg 301 gagaagatcc taacaagtcc tctgtgtgag caagatttaa gagtctgaag ctcactggat 361 ttcagagata ttctacgcta aagccccctg tcattcccct ctccctcctt gacctcccta 421 caccaacccc gccttcctct ttgcaataaa cttcgagcat ctc // LOCUS YSBCPR 2990 bp ds-DNA PLN 14-AUG-1990 DEFINITION C.tropicalis NADPH-cytochrome P450 reductase gene, complete cds. ACCESSION M35199 KEYWORDS NADPH-cytochrome P450 reductase. SOURCE C.tropicalis (ATCC 750) DNA, clone pTS1. ORGANISM Candida tropicalis Eukaryota; Plantae; Thallobionta; Basidiomycotina; Deuteromycotina. REFERENCE 1 (bases 1 to 2990) AUTHORS Sutter,T.R., Sanglard,D. and Loper,J.C. TITLE Isolation and characterization of the alkane-inducible NADPH- cytochrome P450 oxidoreductase gene from Candida tropicalis: Identification of invariant residues within similar amino acid sequences of divergent flavoproteins JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by T.R.Sutter, 15-JUN-1990. Author name and address T.R.Sutter CIIT, P.O. Box 12137 Research Triangle Park, NC 27709 FEATURES from to/span description pept 554 2596 NADPH-cytochrome P450 reductase signal 198 204 TATA box BASE COUNT 988 a 442 c 558 g 1002 t ORIGIN 1 tgtcatgtct attcaatgct tcgatggggg gggttgcgta gagtaaaacc ccaaaaataa 61 tggtggtggt agcggtggtc ggtgtagcat tctgcttttg tctcactcta gctcttagtg 121 aataaatgat ccgatatcga aattgcaccg ataaatcctc ctgagataaa actctaatag 181 acgaaaagaa aggacagtta tatatacaat aataattata gagatgaaat ttggtctaag 241 ttggtacaca ttacctgact gtcagaaatg atacatattt caaagtcagt atggtcccag 301 tatcgtaata cataaacgac gttgggtccc gtaattgaac atcagaagag gaacaaaaaa 361 aaaaaagaat aaaaaatttc atacgacaat cttcaaacga actgaacgag agaaaaaata 421 aaaaaaaaga ttttaaacat ttcaaatttc caatttttct ttgttttttt tttatttcta 481 ccttttattc tttttctttt cttgatccat tactgctcta tcaactaaga ctattgtttc 541 aatcgctcaa gtcatggcat tagataagtt agatttatat gttattataa cattggtggt 601 tgcaattgca gcttattttg caaagaatca gtttcttgac caacaacaag ataccgggtt 661 ccttaatact gatagtggag atggtaattc aagagatatc ttacaagctt tgaagaagaa 721 caataaaaat acgttattat tatttggatc ccaaacaggt acagcagaag attatgccaa 781 caaattgtca agagaattgc attcaagatt tggtttgaaa accatggttg ctgatttcgc 841 tgattatgat ttcgaaaact tcggagatat tactgaagat atcttggttt tctttattgt 901 tgctacttat ggtgaaggtg aaccaaccga taatgctgac gaatttcaca cttggttgac 961 tgaagaagct gacaccttga gtactttgaa atatactgtt tttggtttgg gtaattcaac 1021 ttatgaattc ttcaatgcta ttggtagaaa atttgacaga ttgttgggag aaaaaggtgg 1081 tgacagattt gctgaatacg gtgaaggtga cgatggtact ggtactttag atgaagattt 1141 cttggcctgg aaggataacg tgtttgattc cttaaagaat gatttgaatt ttgaagaaaa 1201 agagttgaaa tacgaaccaa atgttaaatt gactgaaaga gatgatttat ctggcaatga 1261 tccagatgtc tccttgggtg aaccaaatgt caaatacatt aaatctgaag gtgttgactt 1321 aactaaaggt ccatttgatc atactcatcc atttttggct agaattgtta aaactaaaga 1381 attgtttact tctgaagaca gacattgtgt tcatgttgaa tttgatattt ctgaatcaaa 1441 cttgaaatat accaccggtg atcatcttgc aatctggcca tctaactctg atgaaaacat 1501 taagcaattt gccaaatgtt ttggtttaga agacaaactt gatactgtta ttgaattgaa 1561 agctttggat tccacttatt ccatcccatt ccctaatcca atcacttatg gagctgttat 1621 tagacaccat ttggaaattt caggtcctgt ttctagacaa tttttcttat ctattgctgg 1681 atttgcccct gatgaagaaa ctaaaaagtc atttactaga attggtggtg ataagcaaga 1741 atttgctagt aaagtcaccc gtagaaaatt caacattgcc gatgctttat tatttgcttc 1801 caacaacaga ccatggtccg atgttccatt cgaattcctt attgaaaatg tccaacactt 1861 aactcctcgt tattactcca tttcttcttc ctcattaagt gaaaagcaaa ccattaatgt 1921 tactgctgtt gttgaagccg aagaagaagc tgatggaaga ccagttactg gtgttgtcac 1981 caacttgttg aagaatattg aaattgaaca aaacaaaact ggtgaaaccc caatggttca 2041 ttatgatttg aatggtccaa gaggcaaatt tagcaagttc agattgccag ttcacgttag 2101 aagatctaat ttcaaattac caaagaatag cactacccca gttattttga ttggtccagg 2161 taccggtgtt gcaccattga gaggttttgt tagagaaaga gttcaacaag ttaaaaatgg 2221 tgttaatgtt ggtaagactg tattgtttta tggatgtaga aattccgaac aagatttctt 2281 gtacaaacaa gaatggagtg aatatgcctc agtattggga gaaaatttcg aaatgtttaa 2341 tgccttctca agacaagatc caactaagaa agtttatgtt caagataaga ttttagaaaa 2401 tagtgctctt gttgatgagt tattatctag tggagcaatt atttatgttt gtggtgatgc 2461 cagtagaatg gctagagatg ttcaagctgc aattgccaag attgttgcca aaagtagaga 2521 tatccacgaa gataaagctg ctgaattggt taaatcttgg aaagttcaaa atagatacca 2581 agaagatgtc tggtaaggaa aatatgaaca tttaattaaa gttaagatta gttttggtta 2641 atattttttt tttttttagt gattgggata tgttttattt gacgtatgta tatttcatga 2701 aatatatagt tgacacatat gtatatatgt actgaataat aataaaaaaa atgacaatat 2761 gaaagattaa tgatttctac tctttggctt gcactttgtc gtcattgttg ttgttggttc 2821 tgcctggata tttagtacaa tatctccaat ttggattgga actattgtat atggcttgta 2881 aatcattttc aattttcaac caatatttat acttcatttt ggacttcaag tccaaatcgt 2941 aaatattaca agtaagttaa aatttttcac cgtcaacgat ataaacatta // LOCUS HLTLYS 642 bp ss-mRNA INV 14-AUG-1990 DEFINITION H.rufescens sperm lysin mRNA, complete cds. ACCESSION M34388 KEYWORDS sperm lysin. SOURCE H.rufescens testis, cDNA to mRNA, clones r13 and r14. ORGANISM Haliotis rufescens Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae. REFERENCE 1 (bases 1 to 642) AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D. TITLE Species-specific sequences of abalone sperm lysin, the sperm protein that creates a hole in the egg envelope JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.D.Vacquier, 22-MAY-1990. FEATURES from to/span description pept 16 480 sperm lysin precursor sigp 16 69 sperm lysin signal peptide matp 70 477 sperm lysin mRNA < 1 642 sperm lysin mRNA signal 625 630 poly-A signal BASE COUNT 202 a 124 c 151 g 165 t ORIGIN 1 cgaacagatt acaagatgaa gctgttggtg ctttgcattt ttgcgatgat ggcgacattg 61 gcaatgtcac gcagctggca ttatgttgaa cccaagtttt taaacaaggc ttttgaagtt 121 gcactgaagg tacagatcat tgctggattc gatagagggc tggtcaaatg gcttagagtt 181 catggcagaa cattgtctac cgttcagaag aaggcgctgt acttcgtcaa tagacgttac 241 atgcagaccc actgggcgaa ttacatgctc tggataaata agaagatcga tgcactgggc 301 agaactccag ttgttggtga ctacacacgg ttaggtgccg aaattggaag acgtattgat 361 atggcttact tttacgactt cctgaaagat aagaacatga tcccaaaata tctgccatac 421 atggaagaaa ttaatagaat gcgcccggca gatgtcccag ttaaatacat gggcaagtga 481 aacccatgaa gaagacagct tcatgcaccg tgcatgtcaa cagatacaga aaacatcgtc 541 aaacacgggg agctatgaca ccatcctaac atgggttggc tatttgttgc acgattttta 601 aaacggaaag attgtacatt ccagaataaa tagatttact ac // LOCUS HLTLYS2 650 bp ss-mRNA INV 14-AUG-1990 DEFINITION H.corrugata sperm lysin mRNA, complete cds. ACCESSION M34389 KEYWORDS sperm lysin. SOURCE H.corrugata testis, cDNA to mRNA, clones p1 and p3. ORGANISM Haliotis corrugata Eukaryota; Animalia; Eumetazoa; Mollusca; Gastropoda; Prosobrnachia; Archeogastropoda; Pleurotomariacea; Haliotidae. REFERENCE 1 (bases 1 to 650) AUTHORS Vacquier,V.D., Carner,K.R. and Stout,C.D. TITLE Species-specific sequences of abalone sperm lysin, the sperm protein that creates a hole in the egg envelope JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5792-5796 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by V.D.Vacquier, 22-MAY-1990. FEATURES from to/span description pept 16 483 sperm lysin precursor sigp 16 69 sperm lysin signal peptide matp 70 480 sperm lysin mRNA 1 650 sperm lysin mRNA signal 635 640 poly-A signal BASE COUNT 194 a 134 c 162 g 160 t ORIGIN 1 ggaacagatt acaagatgaa gctgttggtg ctttgccttt ttgcgatgat ggcgacattg 61 gctgtgtcgc gccaccgctt ccgttttatt ccacataagt atatacgcaa ggagtttgaa 121 gttgcactga aggtagagat cattgccgga ttcgatagaa cgctggtcaa atggcttaga 181 gttcatggcg gacgtttgag tacagttcag aagaaggcgc tgtacttcgt caatagacgt 241 tacatgcaga cccactggca gaattacatg ctctggatag tgaggaagac cgatgcactg 301 ggcagacctc cagttgttgc tgactacagt cggttaggtg ccgaaattgg aagacgtatt 361 gatatggcat atttttacaa cttcctgaac gggaggaaca tgatcccaaa atatctgcca 421 tacatggagg aaattaatag aatgcgcccg gcagatgtcc cagttgcaaa cagaggcaag 481 tgaaaccaac gaagaagaaa gctgcatgca ccctgcatgt caacagatac agaaaacatc 541 gtccgtcaaa cacgaggagc tgtgacagca tccatcccaa catgggttgg ctatttgttg 601 cacgatttta acacagaaag attgtacatt ccagaataaa tagatttact // LOCUS HUMROSSA 1843 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human 52 kD ribonucleoprotein Ro/SSA, complete cds. ACCESSION M34551 KEYWORDS ribonucleoprotein. SOURCE Human thymocyte, cDNA to mRNA, clone FI18.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1843) AUTHORS Itoh,K., Itoh,Y. and Frank,M.B. TITLE Heterogeneity in human Ro/SSA ribonucleoproteins: The 52 and 60 kd Ro/SSA proteins are encoded by separate genes JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable [or printed] sequence for [1] kindly submitted by M.B.Frank, 18-MAY-1990. Author address: M.B.Frank Oklahoma Medical Res Foundation Arthritis and Immunology program 825 WE 13th St, Oklahoma City, OK 73104 FEATURES from to/span description pept 39 1466 52 kD Ro/SSA ribonucleoprotein signal 1827 1832 poly-A signal BASE COUNT 474 a 458 c 488 g 423 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcgggc acactgctgt ttaacggcac acttgacaat ggcttcagca gcacgcttga 61 caatgatgtg ggaggaggtc acatgcccta tctgcctgga ccccttcgtg gagcctgtga 121 gcatcgagtg tggccacagc ttctgccagg aatgcatctc tcaggttggg aaaggtgggg 181 gcagcgtctg tcctgtgtgc cggcagcgct ttctgctcaa gaatctccgg cccaatcgac 241 agctagccaa catggtgaac aaccttaaag aaatcagcca ggaggccaga gagggcacac 301 agggggaacg gtgtgcagtg catggagaga gacttcacct gttctgtgag aaagatggga 361 aggccctttg ctgggtatgt gcccagtctc ggaaacaccg tgaccacgcc atggtccctc 421 ttgaggaggc tgcacaggag taccaggaga agctccaggt ggcattaggg gaactgagaa 481 gaaagcagga gttggctgag aagttggaag tggaaattgc aataaagaga gcagactgga 541 agaaaacagt ggaaacacag aaatctagga ttcacgcaga gtttgtgcag caaaaaaact 601 tcctggttga agaagaacag aggcagctgc aggagctgga gaaggatgag agggagcagc 661 tgagaatcct gggggagaaa gaggccaagc tggcccagca gagccaggcc ctacaggagc 721 tcatctcaga gctagatcga aggtgccaca gctcagcact ggaactgctg caggaggtga 781 taattgtcct ggaaaggagt gagtcctgga acctgaagga cctggatatt acctctccag 841 aactcaggag tgtgtgccat gtgccagggc tgaagaagat gctgaggaca tgtgcagtcc 901 acatcactct ggatccagac acagccaatc cgtggctgat actttcagaa gatcggagac 961 aagtgaggct tggagacacc cagcagagca tacctggaaa tgaagagaga tttgatagtt 1021 atcctatggt cctgggtgcc cagcactttc actctggaaa acattactgg gaggtagatg 1081 tgacaggaaa ggaggcctgg gacctgggtg tctgcagaga ctctgtgcgc aggaaggggc 1141 actttttgct tagttccaag agtggcttct ggacaatttg gttgtggaac aaacaaaaat 1201 atgaggctgg cacctacccc cagactcccc tccaccttca ggtgcctcca tgccaagttg 1261 ggattttcct ggactatgag gctggcatgg tctccttcta caacatcact gaccatggct 1321 ccctcatcta ctccttctct gaatgtgcct ttacaggacc tctgcggccc ttcttcagtc 1381 ctggtttcaa tgatggagga aaaaacacag cccctctaac cctctgtcca ctgaatattg 1441 gatcacaagg atccactgac tattgatggc tttctctgga cactgccact ctccccattg 1501 gcaccgcttc tcagccacaa accctgcctc ttttccccat gaactctgaa ccacctttgt 1561 ctctgcagag gcatccggat cccagcaagc gagctttagc agggaagtca cttcaccatc 1621 aacattcctg ccccagatgg ctttgtgatt ccctccagtg aagcagcctc cttatatttg 1681 gcccaaactc atcttgatca accaaaaaca tgtttctgcc ttctttatgg gacttaagtt 1741 ttttttttct cctctccatc tctaggatgt cgtctttggt gagatctcta ttatatcttg 1801 tatggtttgc aaaagggctt cctaaaaata aaaacccgaa ttc // LOCUS TRBBS12 4004 bp ds-DNA INV 14-AUG-1990 DEFINITION T.brucei expression site-associated protein (BS1.1 and BS1.2) genes, 5' end and complete cds. ACCESSION M34520 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA, clone BS1.1-2. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 4004) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept < 1 129 expression site-associated protein (BS1.1) pept 1318 2151 expression site-associated protein (BS1.2) site 3083 3604 RIME sequence BASE COUNT 1181 a 650 c 1052 g 1121 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattcaact atactgcaga accggtccgg ggacatttca cgggagcggg gtccaacgcg 61 gcagcagtac atttgagtgt ttctaccgct gcactttgtt tttcagtttt attgttggga 121 gtgctgtgaa ggaaggatgc gacagaacgt gcgctgctta gtgtgaaaga ttatggtaat 181 ggagggttgt gaaagattgg gggaacaaaa acctatttct tttatttggg ggaacaaatg 241 ggcaaaagta acgtaagttt ccagtgggag tggtatgtgt gtgtgtatgg ggctggctaa 301 ggaaagatgt gagttcggca tgtggtatgt acaagctacg aaaacgtgtg aaacaaaacg 361 agatgtaagg ggaaaatgta acaaccaact atgttaaatt tcaggagact atttttcaaa 421 tttagttaca acaaagtaaa tgtcaaataa tgccaactat aaaggaaaac tagactaatt 481 ctactgttta atgtgacgag aattgtggcg tgaaacctag taactgctta tcatggcgtc 541 gaatttcact tctccagaat ctttatttta ggtgcgctat agatgatagt ccaaagagaa 601 ggaattacaa ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga 661 ttcttttcta tgtcacttgt gaaggtcgat cttggaacaa tatcgaacgt gtgaagcctg 721 aaatggggcg tttttttgct tttttttttg gtggttgtta gagttatcta ccgctatggt 781 atgatgcata gaggtaacgc taaaaataag tgacaaggtg ctttagttga gcgtatggta 841 ggtggaagcg gctaatgatt gctagcagca ccaacagagg aagttaattg ttgacgtagc 901 cgtttgtgaa tgagatagaa ggttattggg tggaacatta tcttggaaag agtttttctt 961 ccactaaaaa tgtgtagagt ggtggtttcg agaaaatggt ttagatattg aggaagaaac 1021 aaggatttga ggtgcaagtt gatgcgctgt atgatttgtc ttgccgttgc tgttacttat 1081 tgttgagttg gtatttgagg aaaatgtttt ttcagtgtgc gttgttatat acggctgtga 1141 gttggtgatg tttatttgaa tgatagcttg ataggataag atgaaataat actaagcatg 1201 tattcttctg gctgttcgcc gaaatggtaa tctatgttac ccatagcacg aaaaaagttt 1261 ttttactcag taactgaagc ggaaagtttt aaaaagcata atttaaaaaa gttgataatg 1321 atgaagtttt ggtttgtgct gttggctctt ttgggaaaag aaacacatgc atattatgaa 1381 aataaaagga atgcattaaa tgcaaccgcc gctaataaag tgtgtgggct atcgaactat 1441 cttaaaggaa tagcgcacag agtaaacagc gagagtgccg tggttacgaa aaaactatca 1501 gatttgaaaa tgagaagcat ccagttgcag ttatcagtaa tgcggaacag agtgccttct 1561 ggcgagcagg attgtaaaga catcaggaca ctcttgaaaa cagtattgag gaatgagttt 1621 acattccagc aggagttgga ggaaatgagg aagccatccg cgttagcagc agctgcagct 1681 gggatagcag ctggaagact ggaagagtgg atttttgtat ttgcacaggc agcaggcagg 1741 tcatcacagt tttgcataag tgtggggaag catatcccgg ctgagcacgg caacttgcaa 1801 gaatgttttg atggaataat tggacctgaa accctttaca aaattgagga ttcacgtgta 1861 aaagagtcgg cgcagaaaag cttgcaactc catgaagtgt tatcatccat ttccttcaat 1921 agcttgggtg cggaaaatat ccgaggagga aatgggaggc atggatgtaa tttggtgaga 1981 acagatactg acggtgtatt ggaagggggg tcggtgagac ggcacaactt aacgtggggt 2041 ggtggagtga tgaattttgg atcctatcaa aacggaagta tgtacgttga gggtggtgaa 2101 tacgtgatgc tacagagtac ggtgcagtac ggtggaccga ggatccagta aagtgagcat 2161 atttaaggat gtcatccgcc tgtttgcgcg cttccaagaa gcaaaaaatg aagtgatgaa 2221 taaaataaaa actactgtgg atgaattggc gaaatgcata gggcagaaag aggttgagct 2281 cactgatgat cagctttacg aggaatttat ctgggagacc atacacagat tggagctgtc 2341 aaagagagtg agtgaacaac tctctttagg tgaagaagag gagacgatcc tgaaatctaa 2401 ctatactgca gaaccggtcc gtggactttc acggtagcgg ggtccaacgc ggcagcagta 2461 catttgagtg tttctaccgc tgcactttgt ttttcagttt tattgttggg agtgctgtga 2521 aggaaggatg cgacagaagc tgcgctgctt agtgtgaaag attatggtaa tggagggttg 2581 tgaaagattg ggggaacaaa aacctatttc ttttatttgg gggaacaaat gggcaaaagt 2641 aacgtaagtt tccagtggga gtggtatgtg tgtgtgtatg gggctggcta aggaaagatg 2701 tgagttcggc atgtggtatg tacaagctac gaaaacgtgt gaaacaaaac gagatgtaag 2761 gggaaaatgt aacaaccaac tatgttaaat ttcaggagac tatttttcaa atttagttac 2821 aacaaagtaa atgtcaaata atgccaacta taaaggaaaa ctagactaat tctactgttt 2881 aatgtgacga gaattgtggc gtgaaaccta gtaactgctt atcatggcgt cgaatttcac 2941 ttctccagaa tctttatttt agtgcgctat agatgatagt ccaaagagaa ggaattacaa 3001 ttattataac ccatccaaag ctatgtcaga gactcctctt ctccgagtga ttcttttcta 3061 tgtctcttgt gaaggtcgat ctttttgcta ccccctggcg atgccggcca cctcaacgtg 3121 gtgccagggt ccagtgaccc cgtatcattg ggggaagcca agagccagca gcgttccttt 3181 catggggaac actgctgtag ctccggctac gcatcataca gcacagggat cagcaggcgt 3241 cttgctggga caccgttttt catttgtcgg tccctgggca cgtgccagcg tcgatcagca 3301 gtatcatccg cgctaagatg ctgctgtccg gtgatgtgga cctccaaaaa aggattgcca 3361 ttttggcatc ttttggagag tccagggtgg gaggcttctc gccccatctg ctgtattccg 3421 ttcaactgcg gacgtacaac aaaaaattat agagggtgtg ttaggatgaa tgaaaaaggg 3481 agactctgcc acagtcgcca gaccgatagc actctcaggg ctctacggtg ttggctgatg 3541 gccgcgcaga ggggggaaac tctcacgaag gcacgaagaa aattcaaaaa aaaatttttg 3601 ctaccacagg ctgtaaggat aacgctggag atggagccga ctgtggtttt tcttgtgaat 3661 gcgtatcttc gtaaccggtt gttaagaagg gatcggacga aggaatttgt ctgagtatga 3721 ggtaatgtcg aggttttctt tcattggatg tagtaataat ctccaaattt gttggaacat 3781 ataatattat tgcaataaat gcttgaagag tggaattatt gatggattga cttgagtatc 3841 atgattccaa tgaacactct tataatcctt cgaatagcaa acgtatgaac aaattaacgt 3901 agcgtcatgt agacatggac atttttgtaa tgatacgact gctattgcac atctgaaaat 3961 ttctatgacg gaccagtgcc gttaatcacc gtacaattga attc // LOCUS TRBBS134 2396 bp ds-DNA INV 14-AUG-1990 DEFINITION T.brucei expression site-associated protein (BS1.3 and BS1.4) genes, 3' and 5' ends. ACCESSION M34521 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.5 DNA, clone BS1.3-4. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 2396) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept 1836 > 2396 expression site-associated protein (BS1.4) ORF < 1 1206 expression site-associated protein (BS1.3) BASE COUNT 744 a 357 c 642 g 653 t ORIGIN 1 bp upstream of HindIII site. 1 aagcttgcaa ctccatgaag tgttatcatc catttccttc ggtagcttgg gtgtgaaaaa 61 tattcgagga ggaaatggga gggatggatg caatttggtg agaacagaca ctaacggtat 121 attaaatggg ggttcaccaa cacggcacaa cttaacgtgg ggtggtggag tgatgaattt 181 tggatcctat caaaacggaa gtatgtacgt tgagggtggt gaatacggtg atgctacaga 241 gtacggtgcg gtacggtgga ctgaagatcc cagtaaagtg agcatatttg aggatgtcat 301 tcgcctgttt gcgcgcttcc aagaagcaaa aaatgcagtg atgaggagaa taaaaactac 361 tgtggatgaa ttgacgaaat gtatagggca gaaggaggct gaactcacta atgatcagat 421 ttacgaggaa tttatctggg agaccataca cagattggag ctgtcaaaga gagtgagtga 481 acaaccctct ttaggtgaag aagaggagac aatcctgaaa tctaactata ctgcagaacc 541 tgtccgtgga cctttcacgg gagcggggtc caacacagta gcagtacatt tgagtgtttc 601 taccgctgca ctctgttgtt tagctttatt gttgggagtg ctgtgaggga aggatgcgac 661 agaaactgcg ctgcttagcg tgaaagatta tggtaatgga gggttgggaa agattggggg 721 aacaaaaacc tatttctttt atttggggga acaaatgggc aaaagtaacg tgagtttcca 781 gcgggagtgg tgtgtgtgtg catggggctg actaaggaaa gatgtaagtt cggaatgtgg 841 tatgtacaag ctacgaaaac gtgtgaaaca aaacgatatg taaggggaaa atgtaacaac 901 caactatgtt aaatttcaag agactatttt tcaaatttag ttacaacaaa gtaaatgtca 961 aataatgcca actataaagg aaaactagac taattctact gtttaatgtg acgagaattg 1021 tggcgtgaaa cctagtaacg accacgtcac gatgtggaat ttcacttctc cagaatcttt 1081 gttttagtgc gctatagatg atagtccaaa gagaaggaat tgcagttatt ataacccatc 1141 caaagataga tcaaagactc ctcttctccg agtgattctt ttctatgtca cttgtggaag 1201 gtcgatcttg gaacaatatc gaacgtgtga agcctgaaat gaggcgttat tattgcttct 1261 ctttttggtg gttgttagag ttatataccg ctatggtatg atgcacagag gtaacgctaa 1321 aaataagtga caaggcgttg tagtagggcg tatggtagtt ggaagcggct aatgattgct 1381 ggcctcacca acagaggaag ttaattgttg acgtagccgt ttgtgaatga gatagaaggt 1441 tatcgggtgg aacattatct tggaaagagt ttttcatcac ctaaaaatgt gtagagtggt 1501 ggtttggaga aaatggttta gatattgagg aggaaacaag gatttgaggt gcaagttgat 1561 gcgctgtaag atttgtatta ccgttggtgt tacttattgt tgagtgagta tttgagggaa 1621 atgttatggg aggtgcttcg ttgttacata cggctctgag tcagtggtgt tcatttgaat 1681 gaaatcttga taggataaga tgaaataata ctaagcatgt attcttctgt gctgccgccg 1741 aaatggtaat ctatcttacc tacagcacga aaaatttatt ttttttactc agtaacaaag 1801 cggaaagttt taaaagcata atttaaaaaa gtgatatgat gaggttttgg tttgtgttgt 1861 tggccctttt gggaaaaaaa acacatgcat attatgaaaa tgaaaggaat gcattaaacg 1921 caaccgccgc taataaagtt tgtgggctat cgaactatct taaaggagta gcgcacagag 1981 taaacagcga gagtgctgtg gttacggaaa aactatcaga tttgaaaatg agaagcatcc 2041 agttgcagct atcagtaatg cggaacagag tgccttctgg cgagaaggat tgtaaagaca 2101 taaggacact cttgaaaaca gtattgagga atgagtttac attccagcag gagttggagg 2161 aaatgaggaa cgcatccgcg ttagcagcag ctgcagctgg gttggcagct ggaagactgg 2221 aagagtggat ttttgtattt gcacaggcag cagacaggtc atcacagttt tgcataagtg 2281 tggggaagca tatcgcggct gagcacggca acttgcaaga atgttttgat ggaacaattg 2341 gacctgaaac cctttacaaa attgaggatt cacgtgtaaa agagtcggcg cagaaa // LOCUS TRBBS15 714 bp ss-mRNA INV 14-AUG-1990 DEFINITION T.brucei expression site-associated protein (BS1.5) gene, partial cds. ACCESSION M34522 KEYWORDS expression site-associated protein. SOURCE T.brucei (strain 427) bloodstream stage isolate MiTat 1.4, cDNA to mRNA, clone BS1.5. ORGANISM Trypanosoma brucei Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 714) AUTHORS Hobbs,M.R. and Boothroyd,J.C. TITLE An expression-site-associated gene family of trypanosomes is expressed in vivo and shows homology to a VSG gene JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Hobbs, 22-MAY-1990. FEATURES from to/span description pept < 1 > 714 expression site-associated protein (BS1.5) (AA at 1) BASE COUNT 226 a 116 c 203 g 169 t ORIGIN 1 agcgagagtg ccgtggttac ggaaaaacta tcagatttga aaatgagaag catccagttg 61 cagttatcag taatgcggaa cagagtgcct tctggcgaga aggattgtaa agacatcagg 121 acactcttga aaacagtatt gaggaatgag tttacattcc agcaggagtt ggaggaaatg 181 aggaacgcat ccgcgttagc agcagctgca gctgggttgg cagctggaag actggaagag 241 tggatttttg tatttgcaca ggcagcagac aggtcatcac agttttgcat aagtgtgggg 301 aaaactatcc caccggaaca aaacaacttg caagaatgtt ttgatggaac aattggacct 361 gaaacccttt acaaaattga ggattcacgt gtaaaagagt ctgcaaagaa aagcttgcaa 421 ctccatgaag cgttatcatc catttccttc aatagtttgg gtgcggaaag tatccgagga 481 ggaaatggga aggatggatg taatttggtg agaacagaca ctgacggtat attaaatggg 541 ggttcaccaa cacggcacaa tttaacgtgg ggtggtggag tgatgaattt tggatcctat 601 caaaacggaa gtatgtacgt tgagggggga gaatacggtg atgctacaga gtacggtgca 661 gtacggtgga ctgaagatcc cagtaaagta agcatattta aggatgtcat tcgc // LOCUS ECOPOLBDA 4666 bp ds-DNA BCT 14-AUG-1990 DEFINITION E.coli DNA polymerase (polB) gene, 5' flank. ACCESSION M35371 KEYWORDS DNA polymerase; polB gene. SOURCE E.coli (strain W3110) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 4666) AUTHORS Chen,H., Sun,Y., Stark,T., Beattie,W. and Moses,R. TITLE Nucleotide sequence and deletion analysis of the polB gene of E.coli JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Chen, 20-JUN-1990. Author address: H.Chen Baylor College of Medicine Cell Biology and IMG One Baylor Plaza Houston, TX 77030 email: hchen@vulture.bcm.tmc.edu FEATURES from to/span description signal 1209 1214 -35 region signal 1229 1234 -10 region binding 1250 1255 ribosome binding site site 1193 1208 lexA box BASE COUNT 1100 a 1278 c 1257 g 1031 t ORIGIN 1 aagcttgcat gcctgcaggt cgactctaga ggatcctcgc tggtggcgcg caccataccg 61 tcttcagcca tgcactgaac ctcaacgata tgcgccaatt cgccgagatg cacgacattg 121 aaatcacggt gattgataac gacacacgcc tgccagcgtt taaagacgcg ctgcgctgga 181 acgaagtgta ttacgggttt cgtcgctaag tagccgcatc cggtatgtaa cgcctgatgc 241 gacgctgacg cgtcttatct ggcctacacg ctgcgatttt gtaggccgga taagcaaagc 301 gcatccggca ttcaacgcct gatgcgacgc tggcgcgtct tatcaggcct acgcgctgcg 361 attttgtagg ccggataagc aaagcgcatc cggcattcaa cgcctgatgc gacgctggcg 421 cgtcttatca ggcctacacg ctgcgatttt gtaggccgga taagcaaagc gcatccggca 481 cgaaggagtc aacatgttag aagatctcaa acgccaggta ttagaagcca acctggcgct 541 gccaaaacac aacctggtca cgctcacatg gggcaacgtc agcgccgttg atcgcgagcg 601 cggcgtcttt gtgatcaaac cttccggcgt cgattacagc gtcatgaccg ctgacgatat 661 ggtcgtggtt agcatcgaaa ccggtgaagt ggttgaaggt acgaaaaagc cctcctccga 721 cacgccaact caccggctgc tctatcaggc attcccctcc attggcggca ttgtgcatac 781 gcactcgcgc cacgccacca tctgggcgca ggcgggtcag tcgattccag caaccggcac 841 cacccacgcc gactatttct acggcaccat tccctgcacc cgcaaaatga ccgacgcaga 901 aatcaacggc gaatatgagt gggaaaccgg taacgtcatc gtagaaacct ttgaaaaaca 961 gggtatcgat gcagcgcaaa tgcccggcgt tctggtccat tcccacggcc cgtttgcatg 1021 gggcaaaaat gccgaagatg cggtgcataa cgccatcgtg ctggaagagg tcgcttatat 1081 ggggatattc tgccgtcagt tagcgccgca gttaccggat atgcagcaaa cgctgctgga 1141 taaacactat ctgcgtaagc atggcgcgaa ggcatattac gggcagtaat gactgtataa 1201 aaccacagcc aatcaaacga aaccaggcta tactcaagcc tggttttttg atggattttc 1261 agcgtggcgc aggcaggttt tatcttaacc cgacactggc gggacacccc gcaagggaca 1321 gaagtctcct tctggctggc gacggacaac gggccgttgc aggttacgct tgcaccgcaa 1381 gagtccgtgg cgtttattcc cgccgatcag gttccccgcg ctcagcatat tttgcagggt 1441 gaacaaggct ttcgcctgac accgctggcg ttaaaggatt ttcaccgcca gccggtgtat 1501 ggcctttact gtcgcgccca tcgccaattg atgaattacg aaaagcgcct gcgtgaaggt 1561 ggcgttaccg tctacgaggc cgatgtgcgt ccgccagaac gctatctgat ggagcggttt 1621 atcacctcac cggtgtgggt cgagggtgat atgcacaatg gcactatcgt taatgcccgt 1681 ctgaaaccgc atcccgacta tcgtccgccg ctcaagtggg tttctataga tattgaaacc 1741 acccgccacg gtgagctgta ctgcatcggc ctggaagcgt gcgggcagcg catcgtttat 1801 atgctggggc cggagaatgg cgacgcctcc tcgcttgatt tcgaactgga atacgtcgcc 1861 agccgcccgc agttgctgga aaaactcaac gcctggtttg ccaactacga tcctgatgtg 1921 atcatcggtt ggaacgtggt gcagttcgat ctgcgaatgc tgcaaaaaca tgccgagcgt 1981 taccgtcttc cgctgcgtct tgggcgcgat aatagcgagc tggagtggcg cgagcacggc 2041 tttaaaaacg gcgtcttttt tgcccaggct aaaggtcggc taattatcga cggtatcgag 2101 gcgctgaaat ccgcgttctg gaatttctct tcattctcgc tggaaactgt cgctcaggag 2161 ctattaggcg aaggaaaatc tatcgataac ccgtgggatc gaatggacga aattgaccgc 2221 cgtttcgccg aagataaacc tgcgctggca acttataacc tgaaagattg cgagctggtg 2281 acgcagatct tccacaaaac tgaaatcatg ccatttttac tcgaacgggc aacggtgaac 2341 ggcctgccgg tggaccgaca cggcggttcg gtggcggcat ttggtcatct ctattttccg 2401 cgaatgcatc gcgctggtta tgtcgcgcct aatctcggcg aagtgccgcc gcacgccagc 2461 cctggcggct acgtgatgga ttcacggcca gggctttatg attcagtgct ggtgctggac 2521 tataaaagcc tgtacccgtc gatcatccgc acctttctga ttgatcccgt cgggctggtg 2581 gaaggcatgg cgcagcctga tccagagcac agtaccgaag gttttctcga tgcctggttc 2641 tcgcgagaaa aacattgcct gccggagatt gtgactaaca tctggcacgg gcgcgatgaa 2701 gccaaacgcc agggtaacaa accgctgtcg caggcgctga aaatcatcat gaatgccttt 2761 tatggcgtgc tcggcaccac cgcctgccgc ttcttcgatc cgcggctggc atcgtcgatc 2821 accatgcgtg gtcatcagat catgcggcaa accaaagcgt tgattgaagc acagggctac 2881 gacgttatct acggcgatac cgactcaacg tttgtctggc tgaaaggcgc acattcggaa 2941 gaagaagcgg cgaaaatcgg tcgtgcactg gtgcagcacg ttaacgcctg gtgggcggaa 3001 acgctgcaaa aacaacggct gaccagcgca ttagaactgg agtatgaaac ccatttctgc 3061 cgttttctga tgccaaccat tcgcggagcc gataccggca gtaaaaagcg ttatgccgga 3121 ctgattcagg agggcgacaa gcagcggatg gtgtttaaag ggctggaaac cgtgcgcacc 3181 gactggacgc cgctggccca gcagtttcag caggagctat acctgcgcat cttccgcaac 3241 gagccatatc aggaatatgt acgcgaaacc atcgacaaac tgatggcggg tgaactggat 3301 gcgcgactgg tttaccgtaa acgccttcgc cgtccgctga gcgagtatca gcgtaatgtg 3361 ccgcctcatg tacgcgccgc tcgccttgcc gatgaagaaa accaaaagcg tggtcgcccc 3421 ttgcaatatc agaatcgcgg caccattaag tacgtatgga ccaccacagg cccggagccg 3481 cctggactac caacgttcac cactggatta cgaacactat ctgacccgcc agctacaacc 3541 cgtggcggag ggaatactcc cttttattga ggataatttt gctacactta tgaccgggca 3601 acttgggcta ttttgagcaa aaaaaagagt tcgccagata ccattttgat gcgtgacgaa 3661 tgctttgcca tccagtacca tagcgccctt tccattcctg gacctgaata acaccactac 3721 ctcataagca cggtagcggg tggttattgc ctgcaattaa agatatagag ccgaacacat 3781 atgcctttta cacttggtca acgctggatc agcgatacag aaagcgaatt gggacttgga 3841 accgttgtcg cggtggatgc gcgaactgtc actttacttt tcccatctac tggtgaaaac 3901 cgtctgtacg cacgcagtga ttcccccgtg acccgcgtga tgttcaaccc tggtgatacc 3961 attaccagcc atgacggctg gcagatgcaa gtcgaagaag taaaagaaga aaatggcttg 4021 ctgacctata tcggtactcg cctggatact gaagaggtcc ggcgtagccc tgcgtgaagt 4081 tttccttgat agcaaactgg tgttcagcaa accgcaggca ccgtctgttt gccgggcaga 4141 ttgaccgtat ggaccgcttt gcgctgcgtt atcgcgcgcg taaatattcc agcgaacagt 4201 tccgtatgcc gtacagcggc ctgcgcggtc agcgtaccag cctgatccgc atcagctcaa 4261 catcgctcat gatgttggtc gccgccacgc gccgcgcgtc ctgctggctg acgaagtggg 4321 tttagggaaa accattgaag ccgggatgat cctgcatcag caactgctct ctggcgctgc 4381 tgaacgtgtg ctaattatcg tcccggaaac cttacagcat cagtggctgg tagaaatgct 4441 gcgccgtttc aacctgcgct ttgcgctatt tgatgatgag cgttatgccg aagctcagca 4501 cgatgcttac aacccgtttg acaccgtgaa gcggcgcacg aaaaacgcga aagcgtttca 4561 cgataaatgc gaaaacttta gctttcgcgc ttcaaatgaa acagatgtat taattactgc 4621 tttttattca ttacatgggg atccccgggt accgagctcg aattcc // LOCUS HUMBIGFII 1387 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human insulin-like growth factor binding protein 2 (IGFBP2) mRNA, complete cds. ACCESSION M35410 KEYWORDS insulin-like growth factor binding protein 2. SOURCE Human 67-year old retina, cDNA to mRNA, clone AS200. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1387) AUTHORS Agarwal,N., Hsieh,C.-L., Sills,D., Swaroop,M., Desai,B., Francke,U. and Swaroop,A. TITLE Sequence analysis, expression and chromosomal localization of a gene, isolated from a subtracted human retina cDNA library, that encodes an insulin-like growth factor binding protein (IGFBP2) JOURNAL Exp. Eye Res. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 64 1050 insulin-like growth factor binding protein 2 (IGFBP2) precursor /hgml_locus_uid="LN0220S" /nomgen="IBP1" /map="7p13-p12" sigp 64 180 insulin-like growth factor binding protein 2 signal peptide matp 181 1047 insulin-like growth factor binding protein 2 mRNA < 1 1387 IGFBP2 mRNA signal 1362 1367 poly-A signal binding 175 197 ATP binding site BASE COUNT 232 a 455 c 477 g 223 t ORIGIN 1 gtgccacctg cccgcccgcc cgctcgctcg ctcgcccgcc gcgccgcgct gccgaccgcc 61 agcatgctgc cgagagtggg ctgccccgcg ctgccgctgc cgccgccgcc gctgctgccg 121 ctgctgccgc tgctgctgct gctactgggc gcgagtggcg gcggcggcgg ggcgcgcgcg 181 gaggtgctgt tccgctgccc gccctgcaca cccgagcgcc tggccgcctg cgggcccccg 241 ccggttgcgc cgcccgccgc ggtggccgca gtggccggag gcgcccgcat gccatgcgcg 301 gagctcgtcc gggagccggg ctgcggctgc tgctcggtgt gcgcccggct ggagggcgag 361 gcgtgcggcg tctacacccc gcgctgcggc caggggctgc gctgctatcc ccacccgggc 421 tccgagctgc ccctgcaggc gctggtcatg ggcgagggca cttgtgagaa gcgccgggac 481 gccgagtatg gcgccagccc ggagcaggtt gcagacaatg gcgatgacca ctcagaagga 541 ggcctggtgg agaaccacgt ggacagcacc atgaacatgt tgggcggggg aggcagtgct 601 ggccggaagc ccctcaagtc gggtatgaag gagctggccg tgttccggga gaaggtcact 661 gagcagcacc ggcagatggg caagggtggc aagcatcacc ttggcctgga ggagcccaag 721 aagctgcgac caccccctgc caggactccc tgccaacagg aactggacca ggtcctggag 781 cggatctcca ccatgcgcct tccggatgag cggggccctc tggagcacct ctactccctg 841 cacatcccca actgtgacaa gcatggcctg tacaacctca aacagtgcaa gatgtctctg 901 aacgggcagc gtggggagtg ctggtgtgtg aaccccaaca ccgggaagct gatccaggga 961 gcccccacca tccgggggga ccccgagtgt catctcttct acaatgagca gcaggaggct 1021 cgcggggtgc acacccagcg gatgcagtag accgcagcca gccggtgcct ggcgcccctg 1081 ccccccgccc ctctccaaac accggcagaa aacggagagt gcttgggtgg tgggtgctgg 1141 aggattttcc agttctgaca cacgtattta tatatggaaa gagaccagca ccgagctcgg 1201 cacctccccg gcctctctct tcccagctgc agatgccaca cctgctcctt cttgctttcc 1261 ccgggggagg aagggggttg tggtcgggga gctggggtac aggtttgggg agggggaaga 1321 gaaattttta tttttgaacc cctgtgtccc ttttgcataa gattaaagga aggaaaagta 1381 aagtgtg // LOCUS HUMLBPA 1431 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human lipopolysaccharide binding protein (LBP) mRNA, complete cds. ACCESSION M35533 KEYWORDS lipopolysaccharide binding protein. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites; for [2]) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1431) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1431 lipopolysaccharide binding protein (LBP) precursor sigp 1 75 lipopolysaccharide binding protein signal peptide matp 76 1431 lipopolysaccharide binding protein BASE COUNT 319 a 417 c 359 g 336 t ORIGIN 1 atgggggcct tggcaagagc cctgccgtcc atactgctgg cattgctgct tacgtccacc 61 ccagaggctc tgggtgccaa ccccggcttg gtcgccagga tcaccgacaa gggactgcag 121 tatgcggccc aggaggggct attggctctg cagagtgagc tgctcaggat cacgctgcct 181 gacttcaccg gggacttgag gatcccccac gtcggccgtg ggcgctatga gttccacagc 241 ctgaacatcc acagctgtga gctgcttcac tctgcgctga ggcctgtccc cggccagggc 301 ctgagtctca gcatctccga ctcctccatc cgggtccagg gcaggtggaa ggtgcgcaag 361 tcattcttca aactacaggg ctcctttgat gtcagtgtca agggcatcag catttcggtc 421 aacctcctgt tgggcagcga gtcctccggg aggcccacag gttactgcct cagctgcagc 481 agtgacatcg ctgacgtgga ggtggacatg tcgggagatt cggggtggct cttgaacctc 541 ttccacaacc agattgagtc caagttccag aaagtactgg agagcaggat ttgcgaaatg 601 atccagaaat cagtgtcctc cgatctacag ccttatctcc aaactctgcc agttacaaca 661 gagattgaca gtttcgccga cattgattat agcttagtgg aagcccctcg ggcaacagcc 721 cagatgctgg aggtgatgtt taagggtgaa atctttcatc gtaaccaccg ttctccagtt 781 accctccttg ctgcagctga ggaacacaac aaaatggtct actttgccat ctcggattat 841 gtcttcaaca cggccagcct ggtttatcat gaggaaggat atctgaactt ctccatcaca 901 gatgacatga taccgcctga ctctaatatc cgactgacca ccaagtcctt ccgacccttc 961 gtcccacggt tagccaggct ctaccccaac atgaacctgg aactccaggg atcagtgccc 1021 tctgctccgc tcctgaactt cagccctggg aatctgtctg tggaccccta tatggagata 1081 gatgcctttg tgctcctgcc cagctccagc aaggagcctg tcttccggct cagtgtggcc 1141 actaatgtgt ccgccacctt gaccttcaat accagcaaga tcactgggtt cctgaagcca 1201 ggaaaggtaa aagtggaact gaaagaatcc aaagttggac tattcaatgc agagctgttg 1261 gaagcgctcc tcaactatta catccttaac accctctacc ccaagttcaa tgataagttg 1321 gccgaaggct tcccccttcc tctgctgaag cgtgttcagc tctacgacct tgggctgcag 1381 atccataagg acttcctgtt cttgggtgcc aatgtccaat acatgagagt t // LOCUS HUMPEC12L 2344 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human cell 12-lipoxygenase gene, complete cds. ACCESSION M35418 KEYWORDS lipoxygenase. SOURCE Human platelet/erythroleukemia cell DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2344) AUTHORS Funk,C.D., Furci,L. and FitzGerald,G.A. TITLE Molecular cloning, primary structure, and expression of the human platelet/erythroleukemia cell 12-lipoxygenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5638-5642 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.D.Funk, 20-JUN-1990. FEATURES from to/span description pept 42 2033 12-lipoxygenase signal 2324 2329 poly-A signal mRNA < 1 2344 12-lipoxygenase mRNA BASE COUNT 514 a 692 c 642 g 496 t ORIGIN 1 ggaggacccg gctcccctcg cctaagctgc tggggggcgc catgggccgc taccgcatcc 61 gcgtggccac cggggcctgg ctcttctccg ggtcgtacaa ccgcgtgcag ctttggctgg 121 tcgggacgcg cggggaggcg gagctggagc tgcagctgcg gccggcgcgg ggcgaggagg 181 aggagtttga tcatgacgtt gcagaggact tggggctcct gcagttcgtg aggctgcgca 241 agcaccactg gctggtggac gacgcgtggt tctgcgaccg catcacggtg cagggccctg 301 gagcctgcgc ggaggtggcc ttcccgtgct accgctgggt gcagggcgag gacatcctga 361 gcctgcccga gggcaccgcc cgcctgccag gagacaatgc tttggacatg ttccagaagc 421 atcgagagaa ggaactgaaa gacagacagc agatctactg ctgggccacc tggaaggaag 481 ggttacccct gaccatcgct gcagaccgta aggatgatct acctccaaat atgagattcc 541 atgaggagaa gaggctggac tttgaatgga cactgaaggc aggggctctg gagatggccc 601 tcaaacgtgt ttacaccctc ctgagctcct ggaactgcct agaagacttt gatcagatct 661 tctggggcca gaagagtgcc ctggctgaga aggttcgcca gtgctggcag gatgatgagt 721 tgttcagcta ccagttcctc aatggtgcca accccatgct gttgagacgc tcgacctctc 781 tgccctccag gctagtgctg ccctcgggga tggaagagct tcaggctcaa ctggagaaag 841 aacttcagaa tggttccctg tttgaagctg acttcatcct tctggatgga attccagcca 901 acgtgatccg aggagagaag caatacctgg ctgcccccct cgttatgctg aagatggagc 961 ccaatgggaa gctgcagccc atggtcatcc agattcagcc tcccagcccc agctctccaa 1021 ccccaacact gttcctgccc tcagaccccc cacttgcctg gctcctggca aagtcctggg 1081 tccgaaattc agatttccaa ctgcacgaga tccagtatca cttgctgaac actcacctgg 1141 tggctgaggt catcgctgtc gccaccatgc ggtgcctccc aggactgcac cccatcttca 1201 agttcccgat cccccatatc cgctacacca tggaaatcaa cacccgggcc cggacccaac 1261 tcatctcaga tggaggaatt tttgataagg cagtgagcac aggtggaggg ggccatgtac 1321 agttgctccg tcgggcggca gctcagctga cctactgctc cctctgtcct cctgacgacc 1381 tggctgaccg gggcctgctg ggactcccag gtgctctcta tgcccatgat gctttacggc 1441 tctgggagat cattgccagg tatgtggagg ggatcgtcca cctcttctac caaagggatg 1501 acatagtgaa gggggaccct gagctgcagg cctggtgtcg ggagatcacg gaggtggggc 1561 tgtgccaggc ccaggaccga ggtttccctg tctccttcca gtcccagagt caactctgcc 1621 atttcctcac catgtgcgtc ttcacgtgca ctgcccagca tgccgccatc aaccagggcc 1681 agctggactg gtatgcctgg gtccctaatg ctccatgcac aatgcggatg cccccaccca 1741 ccaccaagga agatgtgacg atggccacag tgatggggtc actacctgat gtccggcagg 1801 cctgtcttca aatggccatc tcatggcatc tgagtcgccg ccagccagac atggtgcctc 1861 tggggcacca caaagaaaaa tatttctcag gccccaagcc caaagctgtg ctaaaccaat 1921 tccgaacaga tttggaaaag ctagaaaagg agattacagc ccggaatgag caacttgact 1981 ggccctatga atatctgaag cccagctgca tagagaacag tgtcaccatc tgagccctag 2041 agtgactcta cctgcaagat ttcacatcag ctttaggact gacatttcta tcttgaattt 2101 catgctttcc taaagtctct gctgctaagg ctctatttcc tcccccagtt aaacccctac 2161 attagtatcc cactagccca ggggagcagt aaactttctc tgcaaagact agatcctttt 2221 ttacgctttg cagaccgcat agtcactgtc tcaactactc agctctcctg ctgcagcatg 2281 aaggcagcca cagacaacat ggaaatgagt gtgactatgt tccaataaaa ctttatggac 2341 actg // LOCUS HUMRALBA 1327 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human GTP-binding protein (RALB) mRNA, complete cds. ACCESSION M35416 KEYWORDS GTP-binding protein. SOURCE Human retina, cDNA to mRNA, clone AS181. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1327) AUTHORS Hsieh,C.-L., Swaroop,A. and Francke,U. TITLE Chromosomal localization and cDNA sequence of human RALB, a GTP binding protein JOURNAL Somat. Cell Mol. Genet. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by A.Swaroop, 20-JUN-1990. FEATURES from to/span description pept 171 791 GTP-binding protein (RALB) mRNA < 1 1327 RALB mRNA signal 1303 1308 poly-A signal BASE COUNT 368 a 273 c 373 g 313 t ORIGIN Chromosome cen-q13. 1 gagcccggca gctcaatgac aaatcggtgg aggacggctg gggtccggcc ccgggagggc 61 ccggggcgcg tttaagagct gcgggccggg tgcggacggc ggaggcggcg ggactggtcc 121 ctgctcttca gtgggtcatc tgtgtgtcac agcctcagaa gaccagcgag atggctgcca 181 acaagagtaa gggccagagc tccttggccc tccacaaggt gatcatggtt ggcagcggag 241 gcgttggcaa gtcagccctg acgcttcagt tcatgtatga cgagtttgta gaagactatg 301 aacctaccaa agctgacagt tatagaaaga aagtggttct tgatggggaa gaagttcaga 361 tagatattct ggacaccgct gggcaagagg actacgcagc cattcgagat aactactttc 421 ggagtgggga agggtttctt cttgtgttct caatcacaga acatgaatcc tttacagcaa 481 ctgccgaatt cagggaacag attctccgtg tgaaggctga agaagataaa attccactgc 541 tcgtcgtggg aaacaagtct gacctagagg agcggaggca ggtgcctgtg gaggaggcca 601 ggagtaaagc cgaagagtgg ggcgtgcagt acgtggagac gtcagcgaag acccgggcca 661 acgtggacaa ggtgttcttt gacctaatga gagaaatcag aacaaagaag atgtcagaaa 721 acaaagacaa gaatggcaag aaaagcagca agaacaagaa aagttttaaa gaaagatgtt 781 gcttactatg agtgtcaagg tgacggatga agccagctgc tcctaaggac acagggctgg 841 gttggtaaag agaaggctat ggttgacttc ttgcttgtgc ttcccactct ccccgacttc 901 attcactcaa acttctttaa atggggaaaa atatttgtga ctctgtggct ggcagaagaa 961 ataagcccat gcaagtggaa gggctgcttt gtcaggaggt tgtggaattt ctttcttctc 1021 cccttcttcc ctcccaaaag cttagctatg tataaagtgc cacagatagg aaacagctgt 1081 taattacaaa gagaaagaat tgtcatagca tcttattttg ttcctagttt tataacatta 1141 ccatccttcg ttttgaacta cagatgttgt agtgggtttt ggaggaggga gtggagtaag 1201 atgccctccc acttttatca gtttagtagt agtactgaga aaaatccctt cagctctaag 1261 aacactgaaa aatccaccga ttttttgggt aagcttcttg gcaataccct gtggatctga 1321 aacagct // LOCUS LACLACR 1332 bp ds-DNA BCT 14-AUG-1990 DEFINITION L.lactis lactose phosphotransferase system repressor (lacR) gene, complete cds. ACCESSION M35375 KEYWORDS lactose phosphotransferase system repressor; lactose repressor. SOURCE L.lactis (strain MG1820) DNA. ORGANISM Lactococcus lactis Prokaryota; Bacteria; Firmicutes; Regular asporogenous rods; Lactobacillaceae. REFERENCE 1 (bases 1 to 1332) AUTHORS Van Rooijen,R.J. and Devos,W.M. TITLE Molecular cloning, transcriptional analysis, and nucleotide sequence of LACR, a gene encoding the repressor of the lactose phosphotransferase system of Lactococcus lactis JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.J.Van Rooijen, 20-JUN-1990. Author address: R.J.Van Rooijen Netherlands Inst for Dairy Research Kernhemseweg 2 P.O. Box 20 6710 BA EDE THE NETHERLANDS FEATURES from to/span description pept 370 1155 lactose repressor (lacR; alt.) pept 388 1155 lactose repressor (lacR; alt.) mRNA 79 1245 lactose repressor mRNA signal 1215 > 1155 transcription termination signal binding 353 357 ribosomal binding site signal 45 51 -35 region signal 68 74 -10 region BASE COUNT 469 a 207 c 198 g 458 t ORIGIN 1 gatatcaaac attcaaacaa aacgcaacta tttttgttaa ttttttgttt ttttttattt 61 gtttttttaa aaaatagata acaccgttaa attattgttc atttttgttc atttaatcca 121 tcacaaaatg gacgtgaaat atctattcag gtattacaaa agtcttttac tttctataac 181 ttactgatta agaggtccta ctttattttc gtcttataca aaatctgacc taagctaata 241 tacgtcaatc ctctgttctt atttcatcat ctaacgtttg tttttgtttg aaattgtttg 301 ttttaccttg aaaatattat cttttatgat acaattaaaa gagaattatc tttggaaaaa 361 aattacttta tgaaagaaag tcttcatatg aacaaaaaac gacgattaga aaaaatttta 421 gatatgttaa agattgatgg gaccataacc ataaaagaaa taatagatga actagatatt 481 tccgatatga cagcccgtag agaccttgat gctctagaag ctgatggact tttaacacgt 541 actcatggtg gtgcacaatt gctttcctct aaaaagccac ttgaaaagac acatatcgag 601 aagaaaagtc taaatacaaa agaaaaaatt gacattgcta aaaaagcctg ctctttaatc 661 aaagatggcg atactatttt tattggaccc ggaactacac ttgtacaact ggcattagaa 721 ttgaaaggtc gtaaaggtta taaaattcgt gtcattacaa atagtctccc tgtgttcttg 781 attctaaatg atagcgaaac cattgattta ttgcttcttg gcggtgaata tagagaaata 841 actggagctt ttgtaggttc aatggcttcg acaaatttaa aagcaatgag atttgccaaa 901 gcttttgttc gtgcaaatgc tgttacccat aattctattg ctacatatag tgacaaggaa 961 ggtgtgattc aacaacttgc cctaaacaat gctgtagaaa aattcttatt agtagacagt 1021 actaaattcg atcgatacga tttctttaac ttctacaatc tagatcaact cgataccatc 1081 attacagata accagattag ccctcaacac ttagaggaat ttagccagta cactactatt 1141 ttaaaagcgg actagaatta tgacttataa aaatattgga ctactcttaa ataattagac 1201 ataaaaaaag caccgtatga atcaaacaat tctacggtgt ttttttgtta tttctaatgt 1261 atggtttgtc gaaaatatgt acacattatt taactttcca aaaaattgga gttttcttga 1321 taattggata tc // LOCUS MARCMYCA 1391 bp ds-DNA ROD 14-AUG-1990 DEFINITION Woodchuck c-myc protein gene, exon 1. ACCESSION M35498 KEYWORDS c-myc protein. SOURCE Woodchuck (Marmota monax) DNA. ORGANISM Marmota monax Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Sciuromorpha; Sciuridae; Sciurinae; Marmotini. REFERENCE 1 (bases 1 to 1391) AUTHORS Wei,Y., Hsu,T.Y., Tiollais,P., Buendia,M.A. and Etiemble,J. TITLE Evolutionary conservation of target sequences for cis-acting regulation in c-myc exon 1 and its upstream sequences JOURNAL Gene (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.A.Buendia, 21-JUN-1990. FEATURES from to/span description pre-msg 818 > 1391 c-myc mRNA and introns IVS 1378 > 1391 c-myc intron 1 site 159 220 P0 promoter site 790 794 TATA box site 964 968 TATA box BASE COUNT 317 a 385 c 383 g 306 t ORIGIN 1 ctcagcgatt agtgcgtctt gcgggaatag ccgcttccca cacccggccg ggtggaagtc 61 tgagcctgct gggcaaaacg agcgatatct gctgttttgg cagcaaacta ggggattcat 121 tctgggtggg aagtgcccaa tctagatagc tgtgcataca taatgcataa tgaattacac 181 tcacacaacc tcaagaaatg taataggtat gtattcataa cactctccaa gtatatgtgg 241 caaggcattg ctgcgttatt ttaattattc cagaaatcat tttcctccct acctcctctg 301 tcatttatcc ctaacactcc atatactgaa tgcgcactca taaatattcc ttctgcccgc 361 ctgtcttcat aagacttatt ttcaaaatgc tgctctttcc ccagccttag ggaggcgccc 421 ggccgcccgg gacgtgcgtg cgcggccgtg ggtacatggt gtattctcag tgttgagggt 481 gagggcagct gttccacctg tgttaattgg aacacgcagg acgagaatgc agtttgtcag 541 agtactgcgc cagaggagca gcagagaaag ggaaaggatt taaacaggag caaaagaaaa 601 tggtaggcgc gcgcagttaa ttcttgctgc gcccttatac tgtttacatc cgatagctgg 661 agtgccgggc tgcggggctg agtctcctcc ccttccctca ctcggcagtg cccctcccag 721 gttcccaaag ccgagggcgg ggagaaagaa aaaaaaaaga ttccgtggaa tccccgccca 781 ccagcccttt ataatacgag ggtctgcgcg cccgaggacc cctgagctgc gcttctcgtg 841 gccgccaaca tcgccgcgcc ccggcggccg ctcttggctc ccctcctgcc tagagaaggg 901 cagggcttct tagaggcttg gcgggaaaaa gaagcgaggg ggagggatcg cgcgtaacag 961 cagtataaaa gtcgttttcg gggctttatc tcactcgctg tagtaattcc agcgagagac 1021 agagggagtg agcgggcgaa cccgtgaggt ggaagaaccg agcagagctg ctccgggcgt 1081 cctgggaagg gaaacccgga gtgaaaggag acttagtctt ctgaccagcg cccccacccc 1141 agccctcccg cggagcccct ccagggtccg caaccgcgaa actttgccct ttgctgcggg 1201 cggacacttt gcactggaac ttaaaatacc cgatcgagga cgcgactctc cggagcgggg 1261 aggctatact gcctatttgg ggacactttt ccccgccttt acccaggacc cgctcctctg 1321 aaagcgctcc tggctgccgt ttgaaggctg gatttccttc gggtagttga aaacccggta 1381 agcaccagat c // LOCUS ONGOSTLE 214 bp ds-DNA INV 14-AUG-1990 DEFINITION O.volvulus recombinant antigen gene, 3' end. ACCESSION M35370 KEYWORDS T-cell epitope; recombinant antigen. SOURCE O.volvulus DNA. ORGANISM Onchocerca volvulus Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Similiidae. REFERENCE 1 (bases 1 to 214) AUTHORS Colina,K.F., Perler,F.B., Matsumura,I., Meda,M. and Nutman,T.B. TITLE The identification of an Onchocerca-specific recombinant antigen containing a T cell epitope JOURNAL Unpublished (19900 STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.B.Nutman, 19-JUN-1990. Author address: T.B.Nutman Inst. Lab. of Parasitic Diseases NIAID National Institutes of Health Bldg 4, Rm 126 9000 Rockville Pike Bethesda, MD 20892 email: tbn@helix.nih.gov FEATURES from to/span description pept < 1 93 recombinant antigen (AA at 1) site 91 93 nematode splice junction BASE COUNT 81 a 37 c 31 g 65 t ORIGIN 1 gaattcagtg taagaagcag cagaacattt caatcattac gaagatatat atacaacatt 61 tctttcttct tcattcttga gttgcatatg taaattcaaa aataattacg atttaatgaa 121 ttgagcaagc ataacttttc ccagcaagta taacaaagtt ttgcgaggaa cgaactcaga 181 aaactttcac ttatgtaaaa ttgcgcacaa gacc // LOCUS PSEIAAL 2766 bp ds-DNA BCT 14-AUG-1990 DEFINITION P.syringae IAA-lysine synthetase (iaaL) gene, complete cds. ACCESSION M35373 KEYWORDS IAA-lysine synthetase. SOURCE P.syringae savastanoi DNA. ORGANISM Pseudomonas syringae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2766) AUTHORS Roberto,F.F., Klee,H., White,F., Nordeen,R. and Kosuge,T. TITLE Expression and fine structure of the gene encoding N-epsilon-(indole-3-acetyl)-L-lysine synthetase from Pseudomonas savastanoi JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5797-5801 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.J.Klee, 20-JUN-1990. FEATURES from to/span description pept 1100 2287 IAA-lysine synthetase (iaaL) pept 95 1003 ORF1 BASE COUNT 637 a 760 c 754 g 615 t ORIGIN 1 gaattccata gcgtgcgggg cttggaggag cgccgcggcc tgagtatctg tggctaaccc 61 ttgcggcttc ggtgctggtc gctgtcgagc agctatgcgt gcagatcctg cgcagtgcag 121 gcttcggcaa acaggcgatg tggaccctgc tgacggggac ggccgccgtt gccatcgcag 181 atcccctgct tattgtggcg ttcgatctgg gcctggtggg tgccggcatc gctacctgat 241 atcgagcctg gtatcggcct gtctggggtt ttactacgtt caccgagtcg cccatctgac 301 ctgtcgggtc agcctgaaga acctgtcagg tgacatcaga aatatcgggc gaaccgcctt 361 gccagcggtg attggcaacc tggcaactcc agtgggcatg gcctacgtaa tggctgcgat 421 ggcgccgttc ggatctcagg cgctggcgac tatcggggtg atcgacaggg tcattcaggt 481 tgctttttgt gtcgtgttcg ccttgcccgg tgcgctgatc ccgatactgg ggcaaaacct 541 gggcgcaatg aacactgctc gcgtgtctca agccataaag atgacgtacg gattgttgat 601 cggctacggc tcagtgacct cgctgttact cattctgctc gctgagccat tagccagctt 661 gtttcatctc gccgctgaac gccaagtcgt gttcttcgcg ttctgccgat ggggcggcgc 721 tctggacgct catcgggctg caattcattg ccacctcagt cttcctcagt atggggcgac 781 cggcgtacgt cacactgttc ggctggttcc gcgccacctg ggaaccatgc cgttcgtgtg 841 gtatggggca cataaatttg gcagcgtcgg ggtaatgctc gggcagttgc tgggtaacac 901 catagtggcc ttttgtgcct gcgtggctcg cgcatctgct catgaaaaag atgttggaca 961 tcgagatcca ttcaataggg aaccgatccc tccacaggag taactgataa tccacgtttt 1021 gcccaccctt ggctgtcgtc aggtgggcag gatgtccagg atgtccagga aatcaaaaaa 1081 cggactatag aggactcgca tgactgccta cgatatggaa aaggaatgga gtagaatttc 1141 cattactgcc gctaaaatcc accagaacaa cgattttgaa ggattcactt atcaggactt 1201 cagaacccac gtaccgatca tggacaaaga cggcttcgcg gcacagactg aacgctgtct 1261 agagcgcaat gagcgaaact gcctgatcgg ctttaccagt ggcaccagcg gcaacatcaa 1321 acgctgttat tactactacg actgcgaagt cgatgaagac agctccctct ccaacgtctt 1381 ccgcagcaac ggctttattc tgcccggtga tcgctgcgcc aacctgttca cgatcaacct 1441 gttttctgct ctgaacaaca cgattaccat gatggccggt aactgcggtg ctcacgtcgt 1501 gtccgtaggt gacatcaccc tggtgaccaa gagccatttc gaagcgctta actcgatcaa 1561 gctcaacgta ctgctcggcg tgccatccac tatcttgcag ttcatcaatg ccatgcaaca 1621 taacggtgtg cacatcaata tcgagaaggt tgtcttcacc ggcgagagcc tgaaaacttt 1681 ccagaagaaa atcatcaggc aagcctttgg cgaacaagtc tccatcgtcg gtgtgtatgg 1741 cagttccgag ggcggcattc tcggtttcac caacagccct tgccacactg aatacgagtt 1801 tctgtccgac aagtatttca tcgaaaaaga aggcgacagc atcctcatca cctcgctgac 1861 ccgagaaaac tttacgccgc tgctgaggta tcgcctagga gacaccgcaa ccctttcgat 1921 gaaaggcgac aagctctacc tgacagacat ccagcgggag gacatgagct tcaacttcat 1981 gggcaacctc atcgggctgg gcatcattca gcaaacgatt aaacagacac tgggccgatc 2041 gctggaaatc caggttcacc tgtcagtgac cgaagagcgc aaggaactgg tgaccgtttt 2101 cgttcaggcc tctgaagtcg atgaagacga acgcgtcaga atcgaaacag ccatcgccga 2161 tatccccgac atcaaagagg cgtatcagaa aaaccaaggc accgtgtcgg tcctgcgcaa 2221 ggatgccaga gactacgcgg tctcggagcg aggcaaaatg ctctacatca tcgaccgccg 2281 aaactgaatg gctgatgtga acgagtgagt agctgcaccg acggggcctt tggcggtgtc 2341 ggtgcagttt tttagaggat tcggaagcgc cagaggtcag agtccacgaa actggaacga 2401 actgggcagc ctgcggctgc aaattgtggg attttgaaat cggttatcat agccgaaatc 2461 gagtcgatcc ctcctcagca caggcttaca catggcgtca gagaccaaaa aacgtaaacg 2521 ggcgagccgg gcaaaagcca aggcaaagca gacccgtctc caacgcgccg ggcatactac 2581 cttcgtgccc gataccgact tttccttcga tatcgatcct ttcggtgatg tcgatctttg 2641 tagttgctgc cagacaacgt atctgaacga catgtttccc gacgcttctt gcgtaaggct 2701 ttagatgaga gaagggccag gcggattcgc atcaccgccg tcattcacca cgatgaggag 2761 ccgcct // LOCUS RABLPBA 1446 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Rabbit lipopolysaccharide binding protein (LBP) protein mRNA, complete cds. ACCESSION M35534 KEYWORDS lipopolysaccharide binding protein. SOURCE Rabbit liver, cDNA to mRNA. ORGANISM Oryctolagus cuniculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Lagomorpha; Leporidae. REFERENCE 1 (ases 1 to 1446ites; for [2] AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. TITLE Structure and function of lipopolysaccharide binding protein JOURNAL Science (1990) In press STANDARD full staff_review REFERENCE 2 (bases 1 to 1446) AUTHORS Schumann,R.R., Leong,S.R., Flaggs,G.W., Gray,P.W., Wright,S.D., Mathison,J.C., Tobias,P.S. and Ulevitch,R.J. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.S.Tobias, 21-JUN-1990. Author address: P.S.Tobias Department of Immunology, IMM-12 10466 N. Torrey Pines Rd La Jolla, CA 92037 FEATURES from to/span description pept 1 > 1446 lipopolysaccharide binding protein (LBP) precursor sigp 1 78 lipopolysaccharide binding protein signal peptide matp 79 1446 lipopolysaccharide binding protein BASE COUNT 316 a 454 c 391 g 285 t ORIGIN 1 atggggacct gggccagggc cctgctgggg tccaccctgc tgagcctgct gctcgcagct 61 gccccgggag ctctgggcac caaccccggc ctcatcacca ggatcaccga caaaggcctg 121 gagtacgcgg ccagggaggg gctgctggct ctgcagagaa agctcctgga agtcacgctg 181 ccggattccg atggggactt caggatcaaa catttcgggc gtgcacagta caagttctac 241 agtctgaaaa tccccagatt cgagctgctc cgtggcaccc tgaggcccct ccccggccag 301 ggcctgagtc tcgacatctc cgacgcctac atccacgtgc ggggcagctg gaaggtgcgc 361 aaggcgttcc tgagactgaa gaactccttt gacctgtatg tcaagggcct caccatttcc 421 gtccacctcg tgttgggcag cgagtcctcc gggaggccca cggtcaccac ctccagctgc 481 agcagcgaca tccagaacgt ggagttggac atagaggggg acctggagga gctgctgaac 541 ctcctccaaa gccagatcga tgccaggctg cgcgaagtgc tggagagcaa gatttgcagg 601 cagattgagg aagccgtgac ggcccacctg cagccttatc tacagacact gccagtcaca 661 acgcagatcg acagctttgc cggcattgac tacagcttga tggaggcccc ccgggcaaca 721 gctgggatgt tggatgtgat gtttaagggt gaaattttcc ctctggatca ccgcagccca 781 gtggacttcc ttgctccagc catgaacctc cccgaggctc acagccgaat ggtctacttt 841 tccatctccg attacgtctt caacaccgcc agcctggcct accacaagtc agggtactgg 901 aacttctcca tcacagacgc catggttccg gccgacctca acatccggcg gaccaccaag 961 tccttccgac ccttcgttcc cctgcttgcc aatctctacc ccaacatgaa cttggagctc 1021 caagggacag tgaactcgga acaactggtg aacctcagca ccgagaatct gttagaggaa 1081 cccgagatgg atattgaggc cttggtggtc ctgcccagct ctgccaggga gcctgtcttc 1141 cggctgggtg tggccactaa tgtgtctgcc acactgacct tgaacaccag gaagatcact 1201 gggttcctga agccgggaag gctacaggtg gaactgaaag aatccaaagt cggaggattc 1261 aatgtggagc tgttggaagc tctcctcaac tactacattc tcaacaacct ctaccccaag 1321 gtcaatgaga agttggccca ccgcttcccg ctccctctgc tgaggcacat tcagctctac 1381 gacctgcttc tccagaccca cgagaacttc ctgctcgtgg gcgccaacat ccagtacagg 1441 agagtt // LOCUS RATUKATPA 1484 bp ss-mRNA ROD 14-AUG-1990 DEFINITION R.norvegicus gastric (H+,K+)-ATPase beta-subunit mRNA, complete cds. ACCESSION M35535 KEYWORDS (H+,K+)-ATPase beta-subunit. SOURCE R.norvegicus gastric mucosa oxyntic cell, cDNA to mRNA, clone RG4. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1484) AUTHORS Canfield,V.A., Okamoto,C.T., Chow,D., Dorfman,J., Gros,P., Forte,J.G. and Levenson,R. TITLE Cloning of the H,K-ATPase beta subunit: Tissue-specific expression, chromosomal assignment, and relationship to Na,K-ATPase beta subunits JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by V.A.Canfield, 21-JUN-1990. Author address: V.A.Canfield Yale University School of Medicine Dept. of Cell Biology P.O. Box 3333 New Haven, CT 06510 email: levenson@YALEMED FEATURES from to/span description pept 176 1060 gastric (H+,K+)-ATPase beta-subunit (E.C. 3.6.1.3) mRNA 1 1484 gastric (H+,K+)-ATPase beta-subunit BASE COUNT 363 a 457 c 362 g 302 t ORIGIN 1 ctgacttctg ggacagtgga ggacagatag cacgcaagcc ccagccctcc cttatgttta 61 tagaggcgat agcggagaac tgatagctgg ttctgatgcc tttggcctca cacagaggag 121 actataagcc ccagaggacg ctccctgggc ccagtccagg caagcaggag aggacatggc 181 agccctgcag gagaagaagt catgcagcca gcgcatggcc gaattccggc aatactgttg 241 gaacccggac actgggcaga tgctgggccg caccccagcc cggtgggtgt ggatcagcct 301 gtactatgca gctttctacg tggtcatgac tgggctcttt gccttgtgca tctatgtgct 361 gatgcagacc attgatccct acacccccga ctaccaggac cagttaaagt caccgggggt 421 aaccttgaga ccggatgtgt atggggaaag agggctgcag atttcctaca acatctctga 481 aaacagctcc tgggctggcc tcacacacac cctccacagc ttcttagcgg gctacacccc 541 agcatcccag caggacagca tcaactgttc gtctgaaaag tacttcttcc aggagacctt 601 ttctgctccg aaccatacca agttctcctg caagttcacg gcggacatgc tacagaattg 661 ctcaggcctg gtggacccca gtttcggctt tgaggaggga aagccctgct tcattattaa 721 aatgaacagg attgtcaagt tcctgcccag caacaacacg gctccccgag tggactgcac 781 cttccaggat gacccccaaa agccccggaa ggacattgaa cccctgcagg tccagtacta 841 tccccccaat ggtaccttca gtctccacta cttcccctac tacggcaaga aagcacagcc 901 ccactacagc aaccctctgg tggcggcaaa gttcctcaac gtccccaaaa acacgcaagt 961 cctcatcgtg tgcaagatca tggcggacca cgtgaccttc gacaaccccc acgaccccta 1021 tgaagggaag gtggagttca agctcacaat acagaagtaa ggagtaggcg tggctgtcca 1081 ccccagagcc tggtggaccc tgagggacca ctcttcctga ctgacatcat cggctggcca 1141 gcatgcacgg ccacttcatg gttcagagct gacaccactg cccatctgcc gacagcagga 1201 agtgctcctt cccagcactc cctgagcacc accagctttg aactgaaacc cgacgtgcgc 1261 acgcacgttt gcaatcccgt gcggttaaca caggaaccca gagtccggct accactaagg 1321 gacaacccat ctgtagggca tttctatcct gtgaccattt gtctgtcctg cactttgata 1381 tgaactatgg gtccacatca gtgtaacact ggtcaccccg gcctccagtt tgtgcttctg 1441 gggccacagc ccctaggtca ttaaaacaaa ctatagtaaa gtta // LOCUS YSCMYO2A 5675 bp ds-DNA PLN 14-AUG-1990 DEFINITION S.cerevisiae myosin-1 isoform (MYO2) gene, complete cds. ACCESSION M35532 KEYWORDS myosin-1. SOURCE S.cerevisiae (strain GRF88) DNA, clone 10-2B. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 5675) AUTHORS Johnston,G.C., Prendergast,J.A. and Singer,R.A. TITLE The S.cerevisiae MYO2 gene encodes an essential myosin for vectorial transport of vesicles JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.C.Johnston, 21-JUN-1990. Author address: G.C.Johnston Dalhousie University 7E Tupper Medical Bldg. Halifax, N.S. CANADA B3H 4H7 email: JOHNSTON@AC.DAL.CA FEATURES from to/span description pept 581 5305 myosin 1 isoform (MYO2) binding 138 171 ATP-binding site binding 443 523 actin-binding site rpt 926 981 heptad repeat rpt 1010 1086 heptad repeat BASE COUNT 1994 a 986 c 1110 g 1585 t ORIGIN Chromosome 15 right arm. 1 gatcaataaa taaataggct cgaagacgcc tcagaactcc ggtcactggt ttgtcttgtt 61 gatatacgat gtgccaagcg ccgtttctcg atgcttatct ggtttagttt acgctgttaa 121 aaccaaaacc ccaacagatt ttcgacccta acgtatgtag ggctaaaata gatattgagt 181 aggttacaat taattattgg caattgcacc tagtgacaca tttacgaaaa cgtagggcaa 241 aaactattac ccgacccagg gctattttgt gattttttcc ttttttttgt ttatgatcgc 301 gcttctcgaa aagccaaata tcagaaatcc caaacacgcc ttcatttgat acgattcgta 361 gcctgcgttt cagagatcta tcaactttgc aaggccaatc agagaacaaa aaagtctcgc 421 aaagtcattt cacttttctc gcttgaaatt attcgttcga tttctggctg cttgcttgtt 481 ttttgttttc taaggtacta ttcgacacca ttccattgga cagcgatact tataccattg 541 tacatatagg acataaaaac agcagatatt acagcgtata atgtcttttg aagtgggtac 601 acgatgctgg tatccccata aagaattggg ctggattggg gcggaagtaa tcaaaaatga 661 gttcaacgac ggcaagtacc acctggagtt acaattggaa gacgatgaaa tcgtgtccgt 721 ggacacaaaa gacttgaata acgataagga ccaatctcta ccgcttctta gaaaccctcc 781 cattttggaa gcaacggaag atttgacctc tttatcttac ttgaatgagc cagctgtttt 841 acatgccatc aaacagcgct attctcaatt gaatatctac acatactcgg gtattgttct 901 gattgctaca aacccttttg atcgtgtcga ccagctttat acacaagaca tgatccaagc 961 atatgcggga aagcgcagag gtgaactgga acctcacttg tttgccattg ccgaagaagc 1021 gtataggttg atgaaaaatg acaaacaaaa tcaaaccatt gtggtaagtg gtgaatctgg 1081 tgctggaaaa acggtttctg ccaagtatat tatgcgttat tttgcttctg tagaagagga 1141 aaattccgct actgtacaac atcaagtgga aatgtcggaa acagaacaaa agattctagc 1201 tacaaaccct atcatggaag catttggtaa tgctaagact accagaaatg acaattcttc 1261 cagatttggt aagtatctag aaattttatt cgataaggac acatctatta ttggagcaag 1321 gatccgcaca tacttgttgg aacggtccag attagtttac cagccgccaa ttgagagaaa 1381 ctaccacata ttttatcaat taatggctgg attaccagct caaaccaagg aggaattgca 1441 tcttaccgat gcctcagatt acttctacat gaaccaaggc ggtgacacca agatcaacgg 1501 tattgatgat gccaaagaat acaaaattac agtagatgca ttgacattag tcggaatcac 1561 caaggaaact caacaccaaa tatttaagat cttggccgca cttctgcata tcggtaacat 1621 agaaattaaa aaaactagaa atgatgcatc actatcagct gatgagccaa acctgaaact 1681 ggcgtgcgaa ttgctgggaa ttgatgccta caactttgcc aaatgggtca ccaaaaagca 1741 gatcattaca aggtcagaga aaattgtttc gaatctaaat tatagtcaag ctctggttgc 1801 caaagattcc gtggctaagt ttatttattc cgcccttttc gattggcttg tggaaaatat 1861 caacaccgtg ttatgcaacc cggctgtgaa cgaccaaatt agctcattta ttggtgttct 1921 ggatatttat gggtttgaac attttgaaaa aaattcattt gaacaatttt gtattaacta 1981 tgccaacgaa aaactacaac aagagttcaa ccaacatgtt ttcaaattag agcaagaaga 2041 atacgttaaa gaagaaattg aatggtcttt tatagagttt aatgataatc aaccttgtat 2101 tgatctgatt gaaaacaagt tgggtatttt atcactgctt gacgaagaaa gtaggttacc 2161 tgctggttcc gacgaatctt ggacccaaaa actttatcaa actttggata aatctcctac 2221 gaacaaagta ttttctaaac caagattcgg gcaaactaaa tttatcgtga gccattatgc 2281 tctagatgtc gcttatgatg tggaaggatt tattgaaaaa aatagagaca ccgtatctga 2341 cggacatttg gaagtgttga aggcttctac caacgagaca ctaataaata tcttagaggg 2401 attagaaaaa gctgccaaaa aactggaaga agcgaaaaag cttgaattag agcaggctgg 2461 cagtaaaaag ccaggtccga taagaacggt taacaggaaa cccactttag gttccatgtt 2521 taagcaatct ttgattgaac taatgaatac catcaactca actaatgttc attatattcg 2581 ttgtataaag cctaatgcag ataaagaagc ttggcaattt gataatttga tggtgttgtc 2641 tcaactcaga gcctgtggtg ttttggaaac tattagaata tcttgtgctg ggtttccttc 2701 taggtggact tttgaagaat ttgtattaag atattacatc ttgataccac atgagcagtg 2761 ggacctaatc ttcaaaaaaa aggaaactac agaagaagat atcatatcag tggttaaaat 2821 gatcctagat gctactgtaa aggacaaatc caagtaccag attggtaata caaaaatttt 2881 cttcaaagca ggtatgcttg catatctgga aaaacttaga agcaataaga tgcataattc 2941 aattgttatg atccagaaga aaattagagc taaatattac cgtaagcagt atttgcaaat 3001 atctcaggcc atcaagtatt tgcagaacaa catcaaaggt ttcatcattc gtcaacgcgt 3061 taatgatgaa atgaaagtta actgtgcaac tttattacag gccgcttaca ggggtcattc 3121 catccgtgcc aatgtgttca gcgtattgag aacaattaca aatttgcaaa agaaaattag 3181 aaaggaacta aaacaaagac aactgaaaca agaacatgaa tataatgctg cggtaactat 3241 tcaaagtaaa gttaggacct ttgagccgag atcgagattt ttacgcacta aaaaagacac 3301 tgttgttgtc caatctttga tcagaagaag agctgctcaa aggaaattga aacaattgaa 3361 ggcagacgct aaatcagtta atcatctgaa agaagtgagc tataaattag agaataaagt 3421 gattgaactg acgcagaatc tagcatccaa ggtcaaagaa aataaagaaa tgacagaaag 3481 aattaaagaa ctacaggttc aagtggaaga aagtgccaag ttacaagaga cattagaaaa 3541 tatgaaaaaa gagcacttaa tagatattga taatcagaaa tctaaggata tggaattaca 3601 aaaaactatt gagaacaatt tgcaatccac tgaacaaact ctaaaggacg ctcaattaga 3661 gttggaggac atggttaaac aacatgatga attgaaagaa gaatctaaaa agcaacttga 3721 agaattagag caaacaaaga aaacattggt tgaataccag acattaaacg gagacttgca 3781 aaacgaagtt aaatctttaa aggaagaaat tgctaggtta caaactgcca tgtcgctggg 3841 caccgttact actagtgtac tacctcaaac accattaaag gatgtaatgg gaggcggtgc 3901 ttcaaatttc aacaatatga tgcttgagaa ttccgactta tctcctaatg atttgaatct 3961 aaagtctaga tctactccat cgtccggaaa caaccacatt gattcattga gtgtcgatcg 4021 cgaaaatggt gtcaatgcta cacaaatcaa tgaagagtta tacaggttat tggaggacac 4081 tgaaattttg aatcaagaaa tcacggaagg cctgttaaag ggattcgaag taccggatgc 4141 tggtgtagct attcaactaa gtaaaagaga cgttgtttat ccggctagaa tactgattat 4201 agttttaagt gaaatgtgga gatttgggct gaccaagcaa agtgaaagct ttcttgccca 4261 agtattgact acaattcaaa aagttgtcac tcaattgaag ggtaacgatt taattccaag 4321 cggtgtattc tggttagcaa acgttagaga gttatactca tttgtggtgt ttgctctaaa 4381 ctctatttta accgaagaaa cgttcaaaaa cggcatgacc gatgaggagt ataaggagta 4441 tgtttcattg gtcacagaac taaaggatga tttcgaagct ctaagttata atatatataa 4501 catttggctg aagaaattgc agaagcaatt gcaaaaaaag gccatcaatg ctgtggtcat 4561 ctccgaatca ttaccaggtt tcagcgcggg agaaaccagc gggtttttga acaaaatttt 4621 tgctaacact gaagaatata caatggacga cattttgacc tttttcaaca gcatatactg 4681 gtgcatgaaa tcttttcata ttgagaatga agtgttccat gctgtagtca caaccttatt 4741 gaattatgtg gatgcaattt gttttaacga attaatcatg aaacgtaatt tcttgtcgtg 4801 gaaaaggggt cttcaattga actacaacgt tactagatta gaggaatggt gcaagacgca 4861 tggcttgaca gatggtactg agtgcttaca acatttgatt cagaccgcta agctactgca 4921 agtccgtaag tatactatcg aagacattga tatcttaaga ggaatttgtt attcgctaac 4981 acctgcacaa ttgcaaaaat tgatttcaca ataccaggtg gcagactatg agtctccaat 5041 tccacaggaa atcttaagat acgttgctga tatagttaag aaagaagctg cgttatcttc 5101 atcaggtaat gattctaagg gtcacgagca tagcagcagt atatttatca ctccagaaac 5161 aggtccattt actgacccat tcagtttgat aaagacaaga aaatttgacc aagtagaagc 5221 ctatatacca gcgtggttat ccttgccctc aactaagaga atagttgacc ttgttgccca 5281 acaagtcgtt caagacggcc actaaaactg atggcgcgag aaacaaaatt gtacatgaat 5341 gctaaaaaaa gaaatgacaa aaaaagagaa aaaaaaaaat gaaactacat agttaattaa 5401 taatagaagt atttgtcaat agtatgataa tgaaatcgat attatggaag atattaaccg 5461 cgcgccgtat tagtgtacac tatattaaac tacattttgc ttcttactga atttataaat 5521 tatgattata ttattattac tattatgact actgtatata tttttagaat tagatcggga 5581 accgatgagc gttagctgaa atggacgacg ataaggaacg ataattacca ctagtaaaat 5641 aataacaact aagaataaac acattctcat tttta // LOCUS BSURPOF 3078 bp ds-DNA BCT 14-AUG-1990 DEFINITION B.subtilis minor sigma-37 factor of RNA polymerase (rpoF, sigB), complete cds. ACCESSION M34995 KEYWORDS RNA polymerase; minor sigma-37 factor. SOURCE B.subtilis (strains 168 and Marburg), clones lambda-[21,gt11,1,11,21,31]. ORGANISM Bacillus subtilis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 368 to 2297) AUTHORS Duncan,M.L., Kalman,S.S., Thomas,S.M. and Price,C.W. TITLE Gene encoding the 37,000-dalton minor sigma factor of Bacillus subtilis RNA polymerase: Isolation, nucleotide sequence, chromosomal locus, and cryptic function JOURNAL J. Bacteriol. 169, 771-778 (1987) STANDARD full staff_review REFERENCE 2 (bases 1 to 3078) AUTHORS Kalman,S., Duncan,M.L., Thomas,S.M. and Price,C.W. TITLE Similar organization of the sigB and spoIIA operons encoding alternate sigma factors of Bacillus subtilis RNA polymerase JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.W.Price, 18-FEB-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.W.Price, 07-JUN-1990. FEATURES from to/span description pept 200 529 ORF V pept 526 1008 ORF W pept 968 1762 37 kd minor sigma factor (rpoF, sigB; ttg start codon) pept 1762 2361 ORF X mRNA 131 2368 sigB operon mRNA signal 131 168 sigB operon promoter signal 2368 2401 sigB transcription termination signal binding 188 194 ORF V ribosomal binding site binding 511 519 ORF W ribosomal binding site binding 954 961 sigB ribosomal binding site binding 1750 1753 ORF X ribosomal binding site BASE COUNT 999 a 575 c 755 g 749 t ORIGIN 40 degrees on B.subtilis map. 1 ctgcagaagc tcattgagga acatatgtgt tcctctgcgc aggaaatggt caaaaacatt 61 tatgacagcc tcctcaaatt gcaggatttt cagcttcacg atgattttac gttaattgtt 121 ttgcggagaa aggtttaacg tctgtcagac gagggtataa agcaactagt gatttgaagg 181 aaaatttgag gtgatacgaa tgaatataaa tgttgatgtg aagcaaaacg agaatgatat 241 acaagtaaac attgcaggag aaattgatgt atactcagcc ccggtgctta gagagaagct 301 cgttcctctg gcagaacaag gagctgactt aagaatttgc ctgaaagatg tcagctacat 361 ggacagtacc ggattgggcg tttttgtagg gacctttaaa atggtgaaaa aacaaggtgg 421 ttcgctgaaa cttgaaaatc tttctgaacg gctgatccga ctgtttgaca ttacaggctt 481 gaaggacatc attgatattt ctgcaaagtc agaaggtgga gtgcaatgaa gaataatgct 541 gattacatcg aaatgaaagt gccggcccaa cctgaatatg tgggaattat aagactgacg 601 ctgtcagggg tcgcaagcag aatgggctat acgtacgatg aaattgaaga cttgaaaatc 661 gcagtcagtg aggcgtgcac aaatgcggtt cagcacgctt acaaagaaga taaaaatggg 721 gaagtgtcaa tacgattcgg tgtgtttgaa gaccgtttag aggttattgt ggcggatgaa 781 ggagacagct ttgactttga tcaaaagcag caggatctag ggccgtacac accttcgcac 841 acagttgatc aattatcaga aggagggctc ggtctatatt taatggaaac gctcatggat 901 gaagtcagag tgcaaaacca ctccggcgtc accgtagcga tgacaaagta tttaaatggg 961 gagcgagttg atcatgacac aaccatcaaa aactacgaaa ctaactaaag atgaagtcga 1021 tcggctcata agcgattacc aaacaaagca agatgaacaa gcgcaggaaa cgcttgtgcg 1081 ggtgtataca aatctggttg acatgcttgc gaaaaaatac tcaaaaggca aaagcttcca 1141 cgaggatctc cgccaggtcg gcatgatcgg gctgctaggc gcgattaagc gatacgatcc 1201 tgttgtcggc aaatcgtttg aagcttttgc aatcccgaca atcatcggtg aaattaaacg 1261 tttcctcaga gataaaacat ggagcgttca tgtgccgaga cgaattaaag aactcggtcc 1321 aagaatcaaa atggcggttg atcagctgac cactgaaaca caaagatcgc cgaaagtcga 1381 agagattgcc gaattcctcg atgtttctga agaagaggtt cttgaaacga tggaaatggg 1441 caaaagctat caagccttat ccgttgacca cagcattgaa gcggattcgg acggaagcac 1501 tgtcacgatt cttgatatcg tcggatcaca ggaggacgga tatgagcggg tcaaccagca 1561 attgatgctg caaagcgtgc ttcatgtcct ttcagaccgt gagaaacaaa tcatagacct 1621 tacgtatatt caaaacaaaa gccaaaaaga aactggggac attctcggta tatctcaaat 1681 gcacgtctcg cgcttgcaac gcaaagctgt gaagaagctc agagaggcct tgattgaaga 1741 tccctcgatg gagttaatgt aatgatccag gttgaagaaa acgagcatat tcaaactcta 1801 gtatatcaac tgaataaaga agggaaatcg atttgcggtg acagtttttt tatgaaagct 1861 gatgataagg agttaatttg cgcggttgct gacggactgg gaagtggatc acttgctaac 1921 gaatcctctg cagccatcaa agacttagtg gaaaactatg cgagtgaaga cgtagaaagc 1981 attatcgaac gctgtaatca ggcgatgaaa aacaaaagag gcgctacagc atctatcctg 2041 aaaatcaatt ttgagcaaag gcagtttacg tactgctctg tcggaaatgt acggtttatt 2101 ctgcattccc cgtctggtga aagcttttat cctctgccga tttcgggcta tttatcaggc 2161 aagccgcaaa aatacaaaac gcacaccgcc acctatgaaa agggttcaaa gttcattata 2221 catacagatg gactcaacgt acctgatatc cgctcccatt tgaaaaaagg ccaatcggta 2281 gaagaaatat cgaattcgct gaaaatgtat acgacatcga gaaaagatga cctgacctat 2341 attctcggac agctgtccta aaaaaccaga aaaagaagct ggacatccgg cttctttttt 2401 ttgcggttga ctattaataa gtgcgcttat tataattaat aagtgaactt atcattctgg 2461 gagcttatgg gatgtggaat tatgaataca cagcgatagc tgatgtggat gtgggaaaac 2521 ctggaactgt tatcaggaaa cagaaacaat ggatcaatgg gagacgggta ttgcatcaga 2581 ttctttgaaa ggacaatttc aaaatggaac ggaaggaaca ttgcttctgc aaggaaaaac 2641 aattctctta tggcttgagc atgtggagaa acatcgggcg ttttcttgtc atggtcaaca 2701 tgccccgcat agcaatgtaa taatggagtt ttttcacgtg cttcaaaagc aaaaaagaag 2761 atttgtgtat cacatatgga attgtattag aaggcgctga agcaaaaaag tggggagaaa 2821 tcattgggac tgaactgtct aaagacatgc cgacggctgt cagcaggctt gtccatcttt 2881 acggaggagt gataaaatga acacaaatta ctggatcggc gttgtgtcag agcaacatgt 2941 attaaaaggc gctgccggcg gttttgcaca gctgtgccat ggaaaaaaag ccccgctcgc 3001 caaaatgaaa gagggagatt ggctgattta ttattctccg agagacgcat atccagacgg 3061 caaactgctg cgaagctt // LOCUS HUMERPR 1624 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human erythropoietin receptor mRNA, complete cds. ACCESSION M34986 KEYWORDS erythropoietin receptor. SOURCE Human cell line OCI-M1, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1624) AUTHORS Ehrenman,K. and St John,T. TITLE Sequence of the human erythropoietin gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.St.John, 07-JUN-1990. Author address: T.St.John Hutchinson Cancer Res Center 1124 Columbia St. Seattle, WA Email: TSTJOHN@FHCRCVAX FEATURES from to/span description pept 1 1527 erythropoietin receptor precursor sigp 1 72 erythropoietin receptor signal peptide matp 73 1524 erythropoietin receptor BASE COUNT 276 a 537 c 498 g 313 t ORIGIN 1 atggaccacc tcggggcgtc cctctggccc caggtcggct ccctttgtct cctgctcgct 61 ggggccgcct gggcgccccc gcctaacctc ccggacccca agttcgagag caaagcggcc 121 ttgctggcgg cccgggggcc cgaagagctt ctgtgcttca ccgagcggtt ggaggacttg 181 gtgtgtttct gggaggaagc ggcgagcgct ggggtgggcc cgggcaacta cagcttctcc 241 taccagctcg aggatgagcc atggaagctg tgtcgcctgc accaggctcc cacggctcgt 301 ggtgcggtgc gcttctggtg ttcgctgcct acagccgaca cgtcgagctt cgtgccccta 361 gagttgcgcg tcacagcagc ctccggcgct ccgcgatatc accgtgtcat ccacatcaat 421 gaagtagtgc tcctagacgc ccccgtgggg ctggtggcgc ggttggctga cgagagcggc 481 cacgtagtgt tgcgctggct cccgccgcct gagacaccca tgacgtctca catccgctac 541 gaggtggacg tctcggccgg caacggcgca gggagcgtac agagggtgga gatcctggag 601 ggccgcaccg agtgtgtgct gagcaacctg cggggccgga cgcgctacac cttcgccgtc 661 cgcgcgcgta tggctgagcc gagcttcggc ggcttctgga gcgcctggtc ggagcctgtg 721 tcgctgctga cgcctagcga cctggacccc ctcatcctga cgctctccct catcctcgtg 781 gtcatcctgg tgctgctgac cgtgctcgcg ctgctctccc accgccgggc tctgaagcag 841 aagatctggc ctggcatccc gagcccagag agcgagtttg aaggcctctt caccacccac 901 aagggtaact tccagctgtg gctgtaccag aatgatggct gcctgtggtg gagcccctgc 961 acccccttca cggaggaccc acctgcttcc ctggaagtcc tctcagagcg ctgctggggg 1021 acgatgcagg cagtggagcc ggggacagat gatgagggcc ccctgctgga gccagtgggc 1081 agtgagcatg cccaggatac ctatctggtg ctggacaaat ggttgctgcc ccggaacccg 1141 cccagtgagg acctcccagg gcctggtggc agtgtggaca tagtggccat ggatgaaggc 1201 tcagaagcat cctcctgctc atctgctttg gcctcgaagc ccagcccaga gggagcctct 1261 gctgccagct ttgagtacac tatcctggac cccagctccc agctcttgcg tccatggaca 1321 ctgtgccctg agctgccccc taccccaccc cacctaaagt acctgtacct tgtggtatct 1381 gactctggca tctcaactga ctacagctca ggggactccc agggagccca agggggctta 1441 tccgatggcc cctactccaa cccttatgag aacagcctta tcccagccgc tgagcctctg 1501 ccccccagct atgtggcttg ctcttaggac accaggctgc agatgatcag ggatccaata 1561 tgactcagag aaccagtgca gactcaagac ttatggaaca gggatggcga ggcctctctc 1621 agga // LOCUS HUMGFP40H 4379 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human P40 T-cell and mast cell growth factor (hP40) gene, complete cds. ACCESSION M30135 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor; megakeryoblastic leukemia cell growth factor. SOURCE Human (cell line CESS) DNA, clone lambda-H40.3a1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4379) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 12-DEC-1989. FEATURES from to/span description pept 796 909 P40 T-cell and mast cell growth factor precursor, exon 1 1022 1057 P40 T-cell and mast cell growth factor precursor, exon 2 1140 1172 P40 T-cell and mast cell growth factor precursor, exon 3 2461 2592 P40 T-cell and mast cell growth factor precursor, exon 4 4096 4215 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 796 849 P40 T-cell and mast cell growth factor signal peptide matp 850 909 P40 T-cell and mast cell growth factor 1022 1057 P40 T-cell and mast cell growth factor 1140 1172 P40 T-cell and mast cell growth factor 2461 2592 P40 T-cell and mast cell growth factor 4096 4212 P40 T-cell and mast cell growth factor IVS 910 1021 hP40 intron A IVS 1058 1139 hP40 intron B IVS 1173 2460 hP40 intron C IVS 2593 4095 hP40 intron D signal 742 747 TATA box signal 4354 4359 polyA signal rpt 2965 3265 Alu repeat binding 370 376 AP-2 binding site binding 624 629 IRF-1 binding site binding 627 633 AP-1 binding site BASE COUNT 1208 a 921 c 904 g 1346 t ORIGIN 1 ctagtatgta gtaagttctc agtaaatgtt agctactata ctctttcaag tgctgggttt 61 ttacttgatg tcatacagtg ttatataaga tctccaaaga tactgaggag tcctcaaggc 121 caattttaac aagcatggtt gccgcattct tgtgcttata gttgaacatt tcttctttca 181 gacacttgca caaagggata cttctaagat gcatttgcat taggtggcaa acttcatcct 241 gggtatgaaa aacattgaga tttgggaata aagcatagta agactgaggt tgcaattact 301 aaaggaaaac cccaacagag ataagtgaag ttctgcaata tcatgcaccc tcccccaacc 361 cgctctgtct ccccaggccc cccttcgtta gaacacccat gactggctat attatatcag 421 catttcccat aatgtaaaaa gggaaaatac agacctgggc gttcatggaa agtattctaa 481 ctctcacaac cagaatccct gtctttgaat tttttttctt ggtttttaga tctttaactt 541 ttccttcagc atttcagtac tcaacttttt gaaaatcatc ttttctgagg aatgatattt 601 cctggcacag catcatctct gtcaagtgac tcagtttgat ttttttgttt gttagtataa 661 agtggcccca acttacagag aaaaagtggg ctcttggtat cagtttgatg tcagggtttt 721 tccgtgtttg agagggagct ttaaatacca ctcgatttga aggtgtctgc aagcgagctc 781 cagtccgctg tcaagatgct tctggccatg gtccttacct ctgccctgct cctgtgctcc 841 gtggcaggcc aggggtgtcc aaccttggcg gggatcctgg acatcaactt cctcatcaac 901 aagatgcagg taggctgcag ggggagccca tgggaaagac agctactgac aaagtgaaat 961 atgtatgagg atgaaaaaac tcggggctga ctaaaggttc ttatctctct atctacttta 1021 ggaagatcca gcttccaagt gccactgcag tgctaatgtg agtgaatgct ctttaagaac 1081 tttccaaatt aattttaatt ttcacatctg gaatcttcac tctgaaattt cccttgcagg 1141 tgaccagttg tctctgtttg ggcattccct ctgtaagtat agtgaaataa cataatgttg 1201 accttggatt tttttggttt gtttttaagt aaaaataagt tgctttattt aatatttaat 1261 gttatacatt gttgcttaat ttaattgtta cagattagta ttccctgtta aaaccacatt 1321 gttacaaatt attccctttt aaaactacga tcttgaaatc ctatattatg aacatttctt 1381 tgtatttaat taactttatg cctcttgaga agtttgaaca cttttcaaca ttaaaaaaag 1441 aatcctgaat atctttttag ataggtggcc atgtgcacaa ttaaataaaa ctggaactaa 1501 ggatataata attgctgtag ctcatatcat attgctttct aactcattta ctgataactc 1561 tagagttgtg aaacaatgta aataaaatga caactcctta tctttcatct gtcatgaatg 1621 atctatgcgc tatacctccc cctccctgcc tcctcccttc ctccccacca ccctgttgtc 1681 tgtctagctg attagagtga ctgttggttt gaatgctgcc ctctgggcag gtagaggatc 1741 tgaggttgtg agtggaagga gggcttccag agggccactg cccactacgg caggaaggat 1801 gggtggcagg aaagttctga ttcctaattc aaactcctgg ttagggtgag gaggaggcac 1861 ttctccaagg tgcagtgctt tattctttct catgcaaggc ctgggagaat ctgaagaatc 1921 tgagcttctt gccctggcta gggtaagaca tcgcacccat cgcggtccat ccattagatg 1981 agaagaggat agagtgcctt ctgggcagga accaggcaga cagcacagcc cctgtccctt 2041 ggagtacagt ccatgttttt agctgctgct gaaataccag ctgcattcaa ttgtcacatc 2101 ccattagctg gtgtgaaaag gcttttcctc actctgcact ttcagactta caagccttga 2161 agccgggaag cacccgttga aaagaacatt cagagccgac tatttcaggg cccagagccc 2221 tcatgtttcc tggatgtaac atacaggaag tctcctccag gggatgtcac tgtggaaaaa 2281 tggcatcccc tttaaatacg ggagatcact tcctacattg gcaagggacc tgtctaaaaa 2341 taatgcaagt ttgagtaatg gtgattaaat aaaaatcatc tctattatat tgctctttgt 2401 gatatatttc caaagctgtc ctcagaatat ttctttgaat aaatccttac tatttaccag 2461 gacaactgca ccagaccatg cttcagtgag agactgtctc agatgaccaa taccaccatg 2521 caaacaagat acccactgat tttcagtcgg gtgaaaaaat cagttgaagt actaaagaac 2581 aacaagtgtc cagtaagttt gttttcatat gtgatatgtt cctgttggtg atttctatgt 2641 gaatggtgat gccaaccctg tttgaacgca aaaggatgat aaagttggaa ttggtagttc 2701 aaggttgata aaagacatct aagaatttta atcagaagta atataattaa agtgagatcc 2761 actgaaacaa tagaattaaa gtgagataga tcattgttcc tgacgaggcc atttacttct 2821 ctctactatg gaataatgaa agaatccttt ctgagtgtaa ttagaagcta caatctagag 2881 aatcagggat gtagctcaca taatactaaa ttatcctaga gattcaatgt actaactgaa 2941 tggatgttgt taacagggat ttttttttcc tgttggttaa ggaggttttg ttttgttttg 3001 gagacagagt cttgctctgt tgcccaggct ggagtgcagt ggtgccatct gagctcactg 3061 cagcctctgc ctcccgggtt caagtgatta tcctgcctca gcctcccgag tagctggcat 3121 tacaggtgcg tgccaccatg cctggctaat ttttgtattt ttaatagaga tggggtttca 3181 ccatgttggc caggttgctc tccaactcct gaactcaagt gatttgcccg ccttgacctc 3241 ccaaagtgct gggatgacag gtgtgagcca ccatgcctgg cctgcattaa ggaggtattt 3301 aaagggcaat gcacccaggt caaggtggaa gcttgctact catcctgaat gcccatccac 3361 acattctttt cttcagcata taccctagtc cctgacagca gactgggatg gcaagttggg 3421 tagaggtgac ctccctctgt tttttgggta ttagcatctc cacacaagat cctagaaggc 3481 tgaaagccct gagctcagct gtttagctgc atgcgtttct accatcaatg gcatctagtt 3541 ctaagtgctt aatatatgct gtctcactga ataaatacat accttaggga caattattca 3601 atttattact ctcagtgagg ttaactaatt tgcctaaggc tgcatatttg ataagtggca 3661 gagctgagat ttgaactcag gcctatatga cctcagagcc ccactcttag ccattgtact 3721 gtcaaatgac cttggaaaga caacctaaaa ggataatgat acaattttag gcctcaaaga 3781 gtccccagaa aaggctttct ctaatgcaga gatttagggc cacttaatag gggtgtgtgt 3841 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaagaccc ctgaaatcca atttgaggtc 3901 aaccacctat gctgtcttta caccacatga gctagcctgg acctgcccac ctatttgctc 3961 tgtgtctcaa gccacttccc ttcccatccc cacaatcctc accaccgact ctggctcttg 4021 gcaggtaggc ttctggggct gcttggctct acatcatttg agtcactctg tccttatcaa 4081 ctttcatccc cacagtattt ttcctgtgaa cagccatgca accaaaccac ggcaggcaac 4141 gcgctgacat ttctgaagag tcttctggaa attttccaga aagaaaagat gagagggatg 4201 agaggcaaga tatgaagatg aaatattatt tatcctattt attaaattta aaaagctttc 4261 tctttaagtt gctacaattt aaaaatcaag taagctactc taaatcagta tcagttgtga 4321 ttatttgttt aacattgtat gtctttattt tgaaataaat acatatgtgg aaaaaacaa // LOCUS MEFDNABP 1710 bp ds-DNA BCT 14-AUG-1990 DEFINITION M.fervidus DNA binding protein HMfB gene, complete cds. ACCESSION M34778 KEYWORDS DNA binding protein HMfB. SOURCE M.fervidus DNA. ORGANISM Methanothermus fervidus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (sites; for [2]) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. TITLE HMf, a DNA binding protein isolated from the hypothermophilic archaeon Methanothermus fervidus, is most closely related to histones JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5788-5791 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 1710) AUTHORS Sandman,K., Krzycki,J.A., Dobrinski,B., Lurz,R. and Reeve,J.N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by K.Sandman, 01-JUN-1990. Author address: K.Sandman Department of Microbiology Ohio State University 484 W. 12th Ave. Columbus, Ohio 43210 E-mail KENDRICK-K@OSU-20.IRCC.OHIO-STATE.EDU FEATURES from to/span description pept 578 787 DNA binding protein HMfB pept 1662 805 (c) ORF285 signal 450 457 promoter signal 523 530 promoter signal 790 814 transcription termination signal BASE COUNT 619 a 263 c 265 g 561 t 2 others ORIGIN 1 bp upstream of HindIII site. 1 aagctttgag agcaatatat gaatccaatn gaactgccct aancgtcagt gatgaagaaa 61 tacttaaggc tcaaagacta ttagcaaaaa aaagaggtat aggtgttgag ccagcatcag 121 ctgcaagtat tgcagactaa tcaaattaat agatgatggc tatttagatc atggagaaag 181 agttgtatgc gtattgacag ggcatcttct aaaagatcca aaaattgcta taaatgcttg 241 taaaaagcct ttagaagtag aaccagattt taatttacta aaaaagatta ttggataaat 301 agaaaatttt attctataaa aaaacgcctt tgatacaata tctatattat ccatttatat 361 ggctatcaaa gcacaatata aactttttat ggttaataag aatttcacaa ccaccatcaa 421 aataaaacat caacgatgca aacaataaat ttatatagga taaatttgat aatattcttt 481 cgtaagaata gagaggaaaa catcaacgat gcaaacaata aatttatata ggataaattt 541 gataatattc tttcgtaaga atagagaggt ggtaagtatg gaattaccaa tagcaccaat 601 tggtagaatt ataaaagatg caggagccga aagagtcagt gatgatgcaa gaataactct 661 agcaaaaatt ttagaagaaa tgggtagaga cattgcttca gaagctataa aactagcaag 721 acacgccgga aggaaaacaa taaaagcaga ggacattgaa ctagcagttc gaagatttaa 781 gaaataatat ttttcttttt ccttttatat ttttattgca aataaaaaat catacaaaat 841 ttttgctgca ttcatcgctg cggggtcgcc tattctatct gttgcaactt caacaatgtc 901 aatgccaata attttttttc tagctatttt ttcaatgaat ttttccatat ggtagggagt 961 gatacctaca ggcgttggat ttccaacacc tggagcataa ccaggatcta atacatctat 1021 atccactgtt acatagaatg gtccatctaa tttatttatt tcatttaaaa ccatgttaaa 1081 tttttcaatt atgtctcttg atgtatagta tttgatattt tcattaagaa caaattcatg 1141 ttcttctttt gtacaagatc ttacaccaat ttggacaatt ttcttcggat gtaattcata 1201 tattcgtcgc atgactgttg catgcgaaaa ctttttacca gcataagtat ttgccatatc 1261 catatgtgca tcaaagtgta gaatagttac atcacttaga tcaaatgttt tcacaacacc 1321 atatgttatg ctatgttctc caccaaggac aataggaatc attccttttt ttaaaacttc 1381 ttgaactttt tcagaaatat tacgacaagt atttttaaag tcacctagaa ttacctcaat 1441 atcacctaaa tcaatgatag gaacatctag tttttttcta tagtgtaaac tatagttttc 1501 aaaattgtag gatgcttgac gaatcatcaa tggtccaaat cttgaccctg gcttatatgt 1561 ggatgtgcta tcaaaaggta ctcctaacaa tccaaaaact tttttattag aaaggtattt 1621 aagggaggaa gggtccttag aatatgcaaa ttttagaccc atattattta actcttaaaa 1681 gttttgtttt acccattgct tctaaatatt // LOCUS MUSINT1A 5607 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 KEYWORDS Wnt-1 oncogene; int-1 oncogene; oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and van Ooyen,A. TITLE The Wnt-1 (int-1) oncogene promoter and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. 10, 4170-4179 (1990) STANDARD full staff_review COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES from to/span description pept 1779 1882 Wnt-1 protein (exon 1) 2452 2705 Wnt-1 protein (exon 2) 3279 3544 Wnt-1 protein (exon 3) 4002 4490 Wnt-1 protein (exon 4) IVS 1883 2451 Wnt-1 cds intron A IVS 2706 3278 Wnt-1 cds intron B IVS 3545 4001 Wnt-1 cds intron C site 1133 1134 MMTV insertion site in tumor 17 [1] site 1394 1395 MMTV insertion site in tumor 35 [1] site 1631 1632 MMTV insertion site in tumor 102 [1] site 4495 4496 MMTV insertion site in tumor 53 [1] mRNA 1428 5398 Wnt-1 mRNA (alt.) mRNA 1595 5398 Wnt-1 mRNA (alt.) signal 1562 1566 TATA box signal 1573 1577 TATA box BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSP40M 3808 bp ds-DNA ROD 14-AUG-1990 DEFINITION Mouse P40 T-cell and mast cell growth factor (mP40) gene, complete cds. ACCESSION M30136 KEYWORDS T-cell growth factor; cytokine; mast cell growth factor. SOURCE Mouse (strain C57B1/6; haplotype H2b) lymphoid T cell line L1210 DNA, clone lambda-S40.1a. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3808) AUTHORS Renauld,J.C., Goethals,A., Houssiau,F., Merz,H., Van Roost,E. and Van Snick,J. TITLE Human P40/Il9: Expression in activated CD4+ cells, genomic organization, and comparison with the mouse gene JOURNAL J. Immunol. 144, 4235-4241 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Van Snick, 21-NOV-1989. FEATURES from to/span description pept 644 757 P40 T-cell and mast cell growth factor precursor, exon 1 846 881 P40 T-cell and mast cell growth factor precursor, exon 2 965 997 P40 T-cell and mast cell growth factor precursor, exon 3 2184 2315 P40 T-cell and mast cell growth factor precursor, exon 4 3393 3512 P40 T-cell and mast cell growth factor precursor, exon 5 sigp 644 697 P40 T-cell and mast cell growth factor signal peptide matp 698 757 P40 T-cell and mast cell growth factor 846 881 P40 T-cell and mast cell growth factor 965 997 P40 T-cell and mast cell growth factor 2184 2315 P40 T-cell and mast cell growth factor 3393 3509 P40 T-cell and mast cell growth factor IVS 758 845 mP40 intron A IVS 882 964 mP40 intron B IVS 998 2183 mP40 intron C IVS 2316 3392 mP40 intron D signal 589 594 TATA box binding 227 233 AP-2 binding site binding 475 480 IRF-1 binding site binding 478 484 AP-1 binding site BASE COUNT 1012 a 903 c 869 g 1024 t ORIGIN Chromosome 13. 1 ggatcctcaa ggccaatgct agcaagcaca gttccagcat ccacgtactt acagaaaaca 61 actcatatac tcgcatagag cgacttctca gaggtgtatg tacgaggtag aacttttgtt 121 cctaggcctg acatactgtg agtatctagc aatcaagtaa actgaggctc caatagccag 181 aggaaaaccc caatgagtga aaggcaagtc ttgctttccc ctgatatccc cagtgtgacc 241 ccttcattac cacccctgta actcactgtc tatcagcatt tctcactaat gtggagggga 301 aaacacagac ctgggctttc atggaaagtg ttttgattct cacaaccaga attcctgctt 361 ttaaaggggg ttggggctag atctttaact tttcctttag tatttcagaa cccgactatt 421 tgaagagcat cttttctgag gaatggtatt tcctggcata agacagcctt tgtcaagtga 481 ctcagactga tttttttttc aatatctcaa ttggcctcaa cttacagaga aaaaagtggg 541 cactgggtat cagtttgatg tcagggtttt tcccggtttg aagagcttta aatacagcta 601 gactggaaga tgctggtaga ctgagttcca gactcccgtc aacatgttgg tgacatacat 661 ccttgcctct gttttgctct tcagttctgt gctgggccag agatgcagca ccacatgggg 721 catcagagac accaattacc ttattgaaaa tctgaaggta gatttagggg actggggaga 781 gtcatctgct ggaggatggc aaagggggct caactcactg ggggttatct ctttgcctac 841 tgtaggatga tccaccgtca aaatgcagct gcagcggcaa cgtgagtgaa taatctttag 901 aactttccag acttctctca gtgttcacag ctagaatgct catgctgaga tttctctctt 961 gcaggtgacc agctgcttgt gtctctccgt cccaactgta agtatagtcg gaaaaaggtc 1021 aggggacact gtttacgctt tgaagagaaa tcacgcaact catttgacat tctaaattct 1081 aaattactgt ttaagtggga ctgttaaagt atagacccca cctttaaaac ttatatttat 1141 atatgcatca tacacacaac tatatgttta agctagcatc atgcccttga gaaatctgga 1201 tatttttcag tgtctaaaac gtattgacta tcttctgaga taaatgggca tgtaaatgaa 1261 aaaagaaaag aatattgctt actggttgat ctgtggtaag tccagatttg tgaaacaaac 1321 aaaatagtaa catcaggcag ggatcatcta gatatcacaa acacacacac acacacacac 1381 acacacacac acacacacac acacacacct cctttctctg gcatcttcct actctttccc 1441 cttcccagtc tctatctccc cctccttctg cttctccttg ctggtcttta gtggctccca 1501 aatggttcca actgtcctct gggaagacag tatctaagtt tggtgccgaa tgggacccta 1561 agagagcaac tcccatcctg gcaggaaggg gaaggcagga ggctctggaa ctattctaac 1621 tcctggttat gatgagggag cagtcaccca ggcctgtagc cactgcaggg ctcactcttc 1681 aagttggacc agggaaaagc tgagcttaga accccatgga agtcagaagc cccatttgct 1741 ggagttctat ggctagggga gggtccaatc taccccctcc cttctcgatg cagggttgta 1801 gcttgcggct aacaggagtc ctatgcaaat ggcttgtccc atcagctggc atgagaaaat 1861 gcttttcctt tctctggact ttcaaacact cagccactgg acctggacag ggccaccagg 1921 tgcgagacag atgcagtacc acttttgcta gggcatcagg tctacatttc ctgaggagtc 1981 tcctcaggaa tgtcactggg aaactgacct cacctcaaat tacttaagat agtttcctcg 2041 ggtggccagg aagaagagca tgagtttggg cgacactggt taaattaagt tgctctttaa 2101 aatcctcttg gcgattcttc ctgaaagcag tcctaggcag tttttattat aaatcctttt 2161 tataagtcct tgttatttac caggatgatt gtaccacacc gtgctacagg gagggactgt 2221 tacagctgac caatgccaca cagaaatcaa gactcttgcc tgttttccat cgggtgaaaa 2281 ggatagttga agtcctaaag aacataacgt gtccggtgag ttttttgtct gagtgtgaca 2341 aaagtagggc ctctgggtca gagcctgtgg cagggggcct tgagagcaac aactgtctgc 2401 tcacatccaa gcaaatgaga gaggatgata aagttagagc aggtaggccc aggctggaag 2461 ggggtcaggc aaaaaatgta accaggaaca agatcactgc agtggaacag atctgctctg 2521 gggtctgtgg ttcccctggt taaatggaga ccgttgacac ccaagtggcc aatgtcttct 2581 ctccacctgt tggatagtga aagaattcac ctaattgtaa gcaaaagcca gactcaaaag 2641 agtcagactg taactcatgc agcactaaat catcctaaaa attcacttta ctaatcgagg 2701 atgtgggagc ctctgagtca ggtttgtgtg tgtgtgttgg tttgttgaat gactgaaaag 2761 ttttgtaaaa gaaaagacac caaggtctag gtaagtactt ccacctgccc tactgactat 2821 cagcatacac cctggttcct gaggcagtgg cccgtcgctg tccgttctcc ctgtgtgtcc 2881 tttcatagga gaacaccagc agtcccaagt gctgtggttg gccagcctag tcacatatct 2941 ctgcccctag tggcacctgc cttggctatg ttacgtgagc cattaccctg agcacctgcc 3001 ttagaggcca gctttagctc tgttcgccag ctccataggg aggaatgtga tgcccgggga 3061 aagttatttt tctgatgcta ttgtatttta taagagacaa cgctgtgact taaaccctgg 3121 tctatgtgaa tccaaagtct cactcagaac tacgtgtgtg tacgtgcaca gactcctgga 3181 atcgtgaccc caaattaaca cttacatggt cttaccaaat gctagctagc ccaggatcaa 3241 ccttcttatt tccctcccat ctcattcctc tctcccctgc ccttcaactg gccaccaggt 3301 ttgagggaag ctggtcctgg gctgctagga gtgtattgct gcaaagtggg ctgcggatgc 3361 tcaccctgtt ctgcccactt ctctccctgc agtccttttc ctgcgaaaag ccatgcaacc 3421 agaccatggc aggcaacaca ctgtcatttc tgaagagtct cctggggacg ttccagaaga 3481 cagagatgca aaggcagaaa agccgaccat gaagacagat gctatttatt ctatttattg 3541 aatttacaaa acctcccctc cttaactgtt acagtgaaga aataaactaa gctattctag 3601 accaaggccc ttttgtgtcc ttttgcttca cagttgcttt tggaactatg gggagggaga 3661 agtaaaaggt atctggaata aagatccacg gggagggaga gagacaagct gtctccagac 3721 cctttccccg tgcctactcc ttcctgttgc tcagagcaaa tgaagctccc ggtgttgcat 3781 aggtaggatg gagcaaagac ttggtaat // LOCUS PIGMHDQAA 1105 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype c), complete cds. ACCESSION M29938 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype c) adult spleen, cDNA to mRNA, clone ZQ30-1. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1105) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 13 777 MHC SLA-DQ alpha chain precursor sigp 13 81 MHC SLA-DQ alpha chain signal peptide matp 82 774 MHC SLA-DQ alpha chain mRNA < 1 1105 MHC SLA-DQ mRNA BASE COUNT 254 a 290 c 255 g 306 t ORIGIN 1 cttgagaaga gcatggtccc aggccgagtt ctgatgtggg gggccctcgc cctgaccacc 61 gtgatgagcg cctgtggagg tgaagacatt gcggccgacc atgttgcctc ctatggctta 121 aatgtctacc agtcttacgg tcccagaggc tactttaccc atgaatttga tggcgacgag 181 caattctacg tggacctgga gaagaaggag actgtctggc ggctgcctct gtttagtgaa 241 tttacaagtt ttgacccgca gggtgcactg aggaacatag ctacgttaaa acataacttg 301 aacattgtga ctaaacgctc caacaacact gcggctgtca ataaggttcc tgaggtgact 361 gtgttttcca agtctccagt gatactgggt cagcccaaca ccctcatctg tcatgtggac 421 agcatctttc ctcctgtgat caacatcacg tggttgaaga acgggcactc tgtcaaaggt 481 ttttctgaga ccagcttcct ctccaaaaat gatcattcct tcctcaagat cagttatctc 541 accttcctcc cttctgatga tgatttttat gactgcaaag tggagcactg gggcctggat 601 aagccacttc tgaaacactg ggaacctgag attccagccc ccatgtcaga gctgacagag 661 actgtggtct gcgccctggg attgatcgtg ggccttgtgg gcatcgtggt gggcactgtc 721 ttcatcattc aaggcctgcg ctcaggtggt ccctctagac accaagggtc cttgtgagtc 781 acactccaga agggaaggag cactgcccgc ctacaagagc tgaagagtgg atgtgctcaa 841 cgacctagaa ctattttctg gccaaattca tcatatacct tctctcttcc tacattcttc 901 ttctcacctc ttctctggaa cttaaggtgc tgtgtcatct cagaattcat atgcccttgg 961 aatcctcttc ctgaccttct aggtcttttt cttttctcag ttgttacctg ttacgggatc 1021 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1081 aaacaataaa tcacttcatg aggtt // LOCUS PIGMHDQAB 1174 bp ss-mRNA MAM 14-AUG-1990 DEFINITION Pig MHC class II SLA-DQ alpha chain mRNA (haplotype d), complete cds. ACCESSION M29939 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Miniature pig (haplotype d) adult spleen, cDNA to mRNA, clone 16.2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1174) AUTHORS Hirsch,F., Sachs,D.H., Gustafsson,K., Pratt,K., Germana,S. and LeGuern,C. TITLE Class II genes of miniature swine: III. Characterization of an expressed pig class II gene homologous to HLA-DQA JOURNAL Immunogenetics 31, 52-56 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept 69 836 MHC SLA-DQ alpha chain precursor sigp 69 137 MHC SLA-DQ alpha chain signal peptide matp 138 833 MHC SLA-DQ alpha chain BASE COUNT 273 a 313 c 266 g 322 t ORIGIN 1 ctcagctcca tcctcatcat tgctctacaa ctccgaagag caacagctga gaccaccttg 61 agaagagcat ggtcccaggc cgagttctga tgtggggggc cctcgccctg accgccgtga 121 tgagcgcctg tggaggtgaa gacattgcgg ccgaccatgt tgcctcctat ggcttaaatg 181 tctaccagtc ttacggtccc agcggctatt atacccatga atttgatggc gacgaggaat 241 tctacgtgga cctggggaag aaggagactg tctggcagct gcctctgttt agcaaattta 301 gaagttttga cccacagggt gcactgagga acatagctac ggcaaaacat aatttgaaca 361 tcctgattaa acgttccaac aacaccgcgg ctgtcaatca ggttcctgag gtgactgtgt 421 ttcccaagtc tccagtgatg ctgggtcagc ccaacaccct catctgtcat gtggacaaca 481 tctttcctcc tgtgatcaac atcacgtggt tgaagaacgg gcactctgtc acagaaggtt 541 tttctgagac cagcttcctc tccaaaaatg atcattcctt cctcaagatc agttatctca 601 ccttcctccc ttctgatgat gatttttatg actgcaaagt ggagcactgg ggcctggata 661 agccacttct gaaacactgg gaacctgaga ttccagcccc catgtcagag ctgacagaga 721 cagtggtctg cgccctggga ttgatcgtgg gccttgtggg catcgtggtg ggcactgtct 781 tcatcattca aggcctacgc tcaggtggtc cctctagaca ccaagggtcc ttgtgagtca 841 cactccagaa gggaaggtgc actgcccgcc tacaagagct gaagagtgga tgtgctcaac 901 gacctagaac tattttctgg ccaaattcat catatacctt ctctcttcct acattcttct 961 tctcacctct tctctggaac ttaaggtgct gtgtcatctc agaattcata tgcccttgga 1021 atcctcttcc tgaccttcta ggtttttttt cttttctcag ttgttacctg ttatgggatc 1081 tctgagatat cccactcggc tacctgatcc ctcagtgacc ctaaactaat atgtacatgg 1141 aaacaataaa tcacttcatg aggtttttgt actg // LOCUS PIGMHDQBA 212 bp ds-DNA MAM 14-AUG-1990 DEFINITION Pig MHC class II SLA-DO beta chain gene (haplotype c/c), 3' end. ACCESSION M29944 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Pig (haplotype c/c) adult spleen DNA, clone D26. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 212) AUTHORS Pratt,K., Sachs,D.H., Germana,S., El-Gamil,M., Hirsch,F., Gustafsson,K. and LeGuern,C. TITLE Class II genes of miniature swine: II. Molecular identification and characterization of B (beta) genes from SLA-c haplotype JOURNAL Immunogenetics 31, 1-6 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.LeGuern, 17-NOV-1989. FEATURES from to/span description pept < 1 12 MHC SLA-DO beta chain BASE COUNT 38 a 58 c 52 g 64 t ORIGIN 1 cagccacact aaggtcctag gcgaacgtct ccccctggag cctgaagtgg aactgagtgg 61 cctgggccct ggacaaagtc aaggacgttc aggaggtcga tgttctggat gattcctttc 121 cacgagccct tggaggtgtc ctgaactgat tctagttttg tgttctgtga tcattcatct 181 cttacccttc tgcctttctt cctcctactt gt // LOCUS PP2LEND 266 bp ds-DNA PHG 14-AUG-1990 DEFINITION Bacteriophage P2 left end DNA. ACCESSION M34756 KEYWORDS . SOURCE Bacteriophage P2 DNA. ORGANISM Bacteriophage P2 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Ziermann,R. and Calendar,R. TITLE Characterization of the cos sites of Bacteriophages P2 and P4 JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by R.Ziermann, 05-JUN-1990. Author address: R.Ziermann University of California Dept. Mol. Cell Biology Berkeley, CA 94720 Email: calendar@garnet.berkeley.edu FEATURES from to/span description site 1 19 cohesive end of phage DNA site 20 266 left end of phage DNA BASE COUNT 41 a 58 c 99 g 68 t ORIGIN Map position 0.0-0.8%. 1 ggcgaggcgg ggaaagcact gcgcgctgac ggtggtgctg attgtatttt ttcagcgtct 61 cagcgcgtcg tgacggcact tagtctgccc gttgaggcgt tgtgtgtctg cggggtgttt 121 tgtgcggtgg tgagcgtgtg aggggggatg acggggtgta aaaaagccgc ccgcaggcgg 181 cgatgttcag tcgttgtcag tgtccagtga gtagttttta aagcggatga cctcctgacc 241 gagccagccg tttatctcgc ggatcc // LOCUS RCABCHH 314 bp ds-DNA BCT 14-AUG-1990 DEFINITION R.capsulata bacteriochlorophyll biosynthesis protein (bchH and F108) genes, 3' end, and 5' end respectively. ACCESSION M34843 KEYWORDS . SOURCE R.capsulata DNA. ORGANISM Rhodobacter capsulatus Prokaryota; Bacteria; Gracilicutes; Anoxyphotobacteria; Purple nonsulfur bacteria. REFERENCE 1 (bases 1 to 314) AUTHORS Yang,Z. and Bauer,C.E. TITLE Rhodobacter capsulatus genes involved in early steps of the bacteriochlorophyll biosynthetic pathway JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.E.Bauer, 04-JUN-1990. FEATURES from to/span description pept < 1 79 bacteriochlorophyll biosynthesis protein (bchH) (AA at 2) pept 108 > 314 bacteriochlorophyll biosynthesis protein (F108) BASE COUNT 59 a 87 c 119 g 49 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccggcg acgctggcgg ctttgcaagc cgcggcggac gagctcgaag accggatgga 61 aggcgttgcg gcagaatgag atggacgact acgaaaagga gggccgcatg agcccgcgcg 121 acgatattcc cgacctgaag ggcttcgacg gtgacggcga gggctcggtg caggtgcacg 181 acagcgaaga catcggcctc gatgtcggcg gcgcgcgggt gttctcggtc tatggcaagg 241 gggggatcgg caaatcgacg acttcctcga acctctcggc cgcgttctcg cttctgggca 301 agcgcgtgct gcag // LOCUS TREANTAT 999 bp ds-DNA ORG 14-AUG-1990 DEFINITION T.evansi AnTat 3/3 kinetoplast DNA minicircle. ACCESSION M34848 KEYWORDS . SOURCE T.evansi AnTat 3/3 kinetoplast minicircle DNA. ORGANISM Kinetoplast Trypanosoma evansi Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae; Trypanosoma evansi. REFERENCE 1 (bases 1 to 999) AUTHORS Bajyana Songa,E., Paindavoine,P., Viseshakul,N., Muyldermans,S., Steinert,M. and Hamers,R. TITLE Evidence for kinetoplast and nuclear DNA homogeneity in T.evansi JOURNAL Mol. Biochem. Parasitol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by E.Bajyana Songa, 05-JUN-1990. FEATURES from to/span description site 367 516 kinetoplast minicircle common region (ORI) BASE COUNT 391 a 78 c 203 g 327 t ORIGIN 1 gaatcagtgt cttttgaggg aagtaaagta atataataga tagaaacata ataataattt 61 aatttgatag tatatacata tcaacaacga caaagagtca gtgaaattag agataaagtt 121 attgtagtta tataattaaa ttaatctatc tattatttat tttcttttat agaggagagg 181 gaataagagg gaaaattcat ggagatacta gggtgagaga gttaatagag taattgtagt 241 tgggaatatg gagtagttat aattatattg gtgaaaggaa agggctaaaa gtcgtgtagt 301 agaatagagg ttgataggaa taagtggtgg aatttgtaga aggggttggt aaaaatctat 361 aaaaattgtt aaaattggct aaaaatcggg ctgaaaaaac ggaaaatctt atgggcggtg 421 caaattttca ccatacacaa acacgtgcta ttttcggggg tttttaggtc cgaggtactt 481 cgagaggggt tggtgtaata cacacacggt ttttctcagg gtttttgagg caattcgcag 541 ttttcctgag gttctcagta cacttaattt ggatttaatt gattctatag agaaaaatag 601 aataatagat aagtaagtaa ttcatgaata tagatatata taattgtaca tataccaaca 661 aacagaataa ctaatgcaca gtgatgatga tagttaatta attatatata aagttctaat 721 ctatctatta ttatatttaa ttgagtgatg tgagaataag gtgatatttc aatcctaaac 781 aaaagaaatg tgtaatagat agaagataat gagaagttaa ttataaatat atcatacaaa 841 ataacaatga tcacagataa tgagtgaata tagatagaga atttaattat tattattgtg 901 tatattgaat tacatattta ttattttatt ttagtatata ggacgcagaa atagcagtat 961 aaaataatgg ataaaagagt ttataggtga agttgaagt // LOCUS VIBSCRRE 1524 bp ds-DNA BCT 14-AUG-1990 DEFINITION V.alginolyticus Scr repressor protein gene (scrR), complete cds, and sucrose transport (scrA) protein, 5' end. ACCESSION M35009 KEYWORDS Scr repressor protein; sucrose transport protein. SOURCE V.alginolyticus DNA. ORGANISM Vibrio alginolyticus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Vibrionaceae. REFERENCE 1 (bases 1 to 1524) AUTHORS Blatch,G.L. and Woods,D.R. TITLE Nucleotide sequence and analysis of the Vibrio alginolyticus Scr repressor-encoding gene JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.L.Blatch, 10-JUN-1990. Author address: G.L.Blatch University of Cape Town Dept. Microbiology Private Bag Rondebosch 7700 SOUTH AFRICA Email: bltgre01.uctvax@f4.n494.z5.fidonet.org FEATURES from to/span description pept 36 < 1 (c) sucrose transport protein (scrA) pept 271 555 Scr repressor protein (scrR) pept 791 1090 ORF5 binding 264 267 scrR ribosomal binding site binding 780 785 ORF5 ribosomal binding site signal 178 183 scrA -10 region signal 203 208 scrA -35 region signal 190 195 scrR -10 region signal 162 167 scrR -10 region signal 762 767 ORF5 -35 region signal 742 747 ORF5 -35 region site 78 91 repressor operator site (put.) site 229 242 repressor operator site (put.) site 12 71 ScrR helix-turn-helix motif BASE COUNT 466 a 344 c 321 g 393 t ORIGIN 1 agttaacaac tcttttgcta ctgccggata attcatttca taccccattg gattattatc 61 tgagcttgca gaattttcgg gaacgtttgc aaaacagatt tttgtcaata aaaccaaatg 121 atcaaagata aagcagagag aatgtgagaa tgatcgtcaa attctctccc taagagcagt 181 ataggaggtt agaatcgcta cttatcaaag agcctttcat cttctctttg caaacaatcc 241 caaaaacgaa aaacagaatg gataggaaca atggcgagtc tgcatgatgt cgcccgtctg 301 gctggcgtat caaaatcaac agtgtcacgg gttataaacg atgagtatgg cgtaaaagaa 361 gcgacaaaac agaaggtccg ccaagcagta gcggagtgtg gctatgtgcc gaaccaagtc 421 gctaaagatc ttaaagaaga aagctctgcc attattgttc aagacaaaac aggcaaagtg 481 ctgtttgtaa aagaaggcgc acttgagcaa gatgaaattg caaaggtgat cgagctaatt 541 aaacagaaca tctagtctcg ttgcttggct ccttaatatc cgtaatacac aatataagga 601 gccatgcgac caaacttaaa gctgagattt ctgttgcccc tagttgaacg cttagtgatt 661 ccaccaagga gactgtctca gattcggcga catcacatgt gttgacgtaa attttcccac 721 tagccaacaa aagagtgaac ttttactcat tatgggtgtt atattataac agtcggtaag 781 aggagctcag atgtaccacc accagcaaaa gatacggaag cattggcatc gcactgtttt 841 atttttcagt gtcgcgttgc tgatcgcttg gaactttgcg gtaatccttc atcaagttga 901 tctgactccc gaacaccaca cacaccatca ttgccagcta ttttctgggg ttcagcacgg 961 catagccaaa gctcaaccga ccctatcgac gccaacattt acgcgcatcc aataccatga 1021 tgtctttcag cgccttgtta atagtgaaga cattcgtggt gcagctcgtg ccccgcctta 1081 ttttgcttaa tttccaaact attccacact caacataatt tgtgcgtatt cctgcacctc 1141 acgtgtcagc gaatcgcacc ttagtaaata caaatctgat ccggtattgc tcgggttagt 1201 gccagcaaaa ttaggaaaca caaaatgtca tcgaaacata ttttagcgct tgtcattggt 1261 ctctctttat cgacagcagc aaatgcggac gaataccgcc agcacggtgc tcatgtgcat 1321 ggtcacgtcg aatttaatat cgcccaagat ggaaaggatc tgttagtgga gattactgct 1381 cccggtgccg atgtggtcgg tttcgagcat gctccagaaa atgaacaaca agagcaagca 1441 cttaaacaag ccattgccac cctagaagac agcaacacac tgtttgcgat taacgcgcaa 1501 gccaattgtg atattgaaga agct // LOCUS XELNRP1A 2431 bp ss-mRNA VRT 14-AUG-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1A) mRNA, complete cds. ACCESSION M34894 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA, clones 24-39. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2431) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 74 1117 pot. RNA-binding protein (nrp-1B) mRNA < 1 2431 nrp-1B mRNA signal 2407 2418 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence site 921 941 alternative exon (put.) BASE COUNT 663 a 522 c 543 g 703 t ORIGIN 1 gtggctacac cgccccttcc aatcctcttg tccgcgggag ctgcctccga tctcaggagt 61 cgccagagac cgaatggaga cagaagcgcc ccagcccgga ctgtcgcccc cggactccgc 121 gcacgatccc tgcaaaatgt tcattggggg actgagctgg cagacaacac aagagggact 181 gcgggagtac ttcagtcatt ttggggatgt aaaagagtgt cttgtcatga gagatccatt 241 gaccaaaaga tcaaggggtt tcggatttgt aactttcatg gaccaagcgg gagtggacaa 301 agttttggct caatcacgac atgagctgga ctcaaagacg attgatccaa aagtagcatt 361 tcctcgtaga gctcaaccca agatggtaac acggacaaag aagatttttg tagggggctt 421 gtcagtgaat acaacagttg aagatgtgaa acagtatttt gagcaatttg ggaaggtgga 481 tgatgccatg ttaatgtttg ataaaacaac aaacaggcac agaggattcg ggtttgtgac 541 ttttgaagga gaggatattg tggagaaaat ttgcgatatt cacttccatg agatcaacaa 601 taagatggtt gagtgtaaga aggcccaacc caaggaggtg atgtcaccaa caggatctgt 661 gagaggccga tctcgggtca tgccctatgg aatggatgct ttcatgctgg gaataggcat 721 gctaggatac ccgggatttc aggcagctac atatgccagt cgcagctaca caggcattgc 781 tcctgggtat acttatcagt tcccagagtt tcgtgtagag agaacccctc tcccgggcgc 841 cccagtcctc ccagaactca cagctatccc cctgacggca tatggtcctg tagcggcagc 901 agctgcagca gtggtgagag gctccactcc aacccgcaca ggggggtttc ttggaacaag 961 cagcccagga ccaatggcag aactttatgg agcagccaat caagaatcgg cagttagcag 1021 ctatatcagt gcagcaagcc cagcacccag cacaggcttt ggacacagcc ttgggggtcc 1081 tctgattgcc actgctttca ccaacggcta ccactgaaac agcaagtcac tgcagaggct 1141 tgagtctgga agcatctctt gcacgattct gttcctcccc tctctgaata tcacatggct 1201 gtacctcacc ttgtgttcct gcacagtaac tggtgcctcc tgcacaacct tcacctagga 1261 atttcttatt tatttctatt tttgttttgg ggtttttttt tcattctcct tttattagat 1321 ttgtgttttg tccaaattag aaatcaggga gacatttcca taatcttcat ggttattctg 1381 ttctggtaag atgcagagaa tcctcggtac gaaacgattc tgcagagaga gaggggagat 1441 tgtatttcga ctgtctggaa atgtgttttt tttcttcttt tttttttata tttctatgta 1501 aatttcggct cttaaaaaca tgtaaaactg aaatttggag acaacagtgc acaagaacaa 1561 gacaaatgta tattttgcta aatgaaaaag aactatttgt agaacagaaa ttattttaat 1621 tttctcaaag ctggtttgaa gttttttttt ttctgtgtaa aatatgtaaa tattgcagac 1681 tgattgttct gccaccggaa aggtctcttt ggcccagatt atcttttttg cctacttggg 1741 cctgtgtttt agcaatattt tcctaaagcc tgaatggatt gtcgggtttt tttatacatc 1801 ttttattatg atggatgtca gtccacagga atatgatgag ttactctgat gcaggggaat 1861 gcagatgcca tgctgcaaaa cttcttactc tttagcctct gataattaac tctttaaggc 1921 tcacttgtcc tttgctcatg gatataatga gagggctata tggagggcta taaggtggga 1981 ctaacctcag agaaccagaa aagctacagt aaagtgctgt tacacactga catacttcct 2041 cagaaccctc agaaaagtaa gtgtatgccc agtcccgatt tatgaactgt ctgccctttg 2101 atctgctctg cactctttat agttttgggt cctataaatt aaatgcagat ctaccttttc 2161 tttatgtgca gtgtgtaagt ccacattttt tgttatgagg aatcttctta acaataaaat 2221 catgtggaat tagcccccaa ccaaaccccc cccacaaact gtctctcacg tgagcaaagg 2281 aaggtgctac actataaaga gtttcttatc actgggccac acacattact gcaatgtaca 2341 atgtactgca acgtcaaagc aataatcttt gttactgcaa ctggactttt ttaaaagata 2401 gttcaaaata aactatgaat tttattcatt t // LOCUS XELNRP1B 2324 bp ss-mRNA VRT 14-AUG-1990 DEFINITION X.laevis nervous system-specific RNA-binding protein (nrp-1B) mRNA, complete cds. ACCESSION M34895 KEYWORDS RNA-binding protein; nervous system-specific protein. SOURCE X.laevis stage 24, cDNA to mRNA. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 2324) AUTHORS Richter,K., Good,P.J. and Dawid,I.B. TITLE A developmentally regulated, nervous system-specific gene in Xenopus encodes a putative RNA-binding protein JOURNAL New Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.J.Good, 05-JUN-1990. FEATURES from to/span description pept 29 1051 pot. RNA-binding protein (nrp-1B) mRNA < 1 2324 nrp-1B mRNA signal 2306 2311 poly-A signal site 22 27 RNP2 consensus sequence site 58 68 RNP1 consensus sequence site 111 116 RNP2 consensus sequence site 147 157 RNP1 consensus sequence BASE COUNT 647 a 467 c 520 g 690 t ORIGIN 1 tccgatctca ggagtcgcca gagaccgaat ggagacagaa gcgccccagc ccggactgtc 61 gcccccggac tccgcgcacg atccctgcaa aatgttcatt gggggactga gctggcagac 121 aacacaagag ggcctgcggg agtactttag tcattttggg gatgtaaaag agtgtcttgt 181 catgagagat ccattgacca aaagatcaag gggttttgga ttcgtaactt tcatggacca 241 agcaggagtg gacaaagttt tggctcaatc acgacatgag ctggactcaa agacgattga 301 cccaaaagta gcatttcctc ggagagctca acccaagatg gtaacacgga caaagaagat 361 ttttgtaggg ggcttgtcag tgaatacaac agttgaagat gtgaaacaat attttgagca 421 gtttgggaag gtggatgatg ccatgttaat gtttgataaa acaacaaaca ggcacagagg 481 gttcgggttt gtgacttttg aaggagagga tattgtggag aaagtttgtg atatacactt 541 ccatgagatc aacaataaaa tggttgagtg taagaaggcc cagcccaagg aagtgatgtc 601 accaacaggg tctgtgagag gccgatctcg ggtcatgcca tatggaatgg atgccttcat 661 gttgggcata ggcatgctag gatacccggg atttcaggct gctacatatg ccagtcgcag 721 ctacacaggc attgcacctg gatatactta tcagttccca gagtttcgtg tagagagaac 781 ccctctcccg ggcgccccag tcctcccaga actcacagct atccccctga ctgcatatgg 841 tcccgtagca gcagctgctg cagcagtggt gagagggggg tttcttggaa caagcagccc 901 agggccaatg gcagaacttt atggagcagc caatcaagaa tcagcagtta gcagctatat 961 cagtgccgca agcccagcac ccagcacagg ctttggacac agtcttgggg gccctctgat 1021 tgccactgct tttaccaacg gctaccactg aaacagcaag tcactgcaga ggcttgagtg 1081 tggaagcatc tcttgcacga ttctgttcct cccctctctg aatatcagat ggctgtacct 1141 caccttgtgt tcctgcacag taactggtgc ctcctgcaca accttcacct aggaatttat 1201 ttatttcttt tttttttttt tttaatcctt ctttattaga tttgtgtttt gtccaaatta 1261 gaaatcaggg agacatttcc gaaatattca tggtattttg ttttggtaag atgcggagaa 1321 tcctcggtac taaatgattt tgcagaggga gaggggagat tgtatttcgc ctgtctggaa 1381 atgtgttttt tttttttcct tctttcctat tttttttatc ttatattttt atgtaaatct 1441 cggctcttaa aaacatgtaa aactgaaatt tgaagacaac agtgcacaag aacaagacaa 1501 atgtatattt tgctaaatga aaaaaaaact atttgtagaa cagaaattat tttaattttc 1561 tcaaagctgg tttgaagttt tttttttttt tctttgtaaa atatgtaaat attgcagact 1621 gattgttctg ccaccggaag gtctctttgg cccagattat cttttgccta cttgggcctg 1681 tgttttagca atattttcct aaagcctgaa tggattgtca ggttttttta tacatctttt 1741 attataatgg atgtcagtcc acaggaatat gatgcagggg aatgcagaac ttcttactct 1801 ttagcctctg ataattaact ctttaagttt cacatgtcct ttgctcatgg atataaggag 1861 agggctgtaa agtggggcta acctcagaga atcagctcca gggcacagga tttcggtgag 1921 ctacaaagct gctgtaaaat accgttacac actgacatac ttaggaatcc tcagcaaagt 1981 aaatgtatga actgtctgct ctttgatccg ctctgcactc tttatagttt tggagtacta 2041 taaattaaat gcagatctag ttcttctata tgtgcagtgt gcaagtccat ttgtgggcac 2101 ccactctcat tatggggcat cttttcaaca ataaaatcat atagaaattc ccatgaagtt 2161 gtcgctcacg tgagataagg aagttgcaac actataaaga gtttcttatc actgggccat 2221 atacaatact gcaatgtaca atgtactgca aattcaaagc aataatcttt gttactgcaa 2281 ctggactttt ttaaagatag ttcaaaataa attgaatttt attc // LOCUS YSJSACDHY 2569 bp ds-DNA PLN 14-AUG-1990 DEFINITION Y.lipolytica saccharopine dehydrogenase (LYS5) gene, complete cds. ACCESSION M34929 KEYWORDS saccharopine dehydrogenase. SOURCE Y.lipolytica (strain W29) DNA. ORGANISM Yarrowia lipolytica Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2569) AUTHORS Xuan J,-W., Fournier,P.M., Declerck,N., Chasles,M. and Gaillardin,C. TITLE Overlapping reading-frames at the LYS5 locus in the yeast Yarrowia lipolytica JOURNAL Mol. Cell. Biol. (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.M.Fournier, 06-JUN-1990. ORF1 is probably not translated, since a lacZ fusion/expression experiment did not produce results. FEATURES from to/span description pept 649 1929 ORF1 pept 1764 655 (c) saccharopine dehydrogenase mRNA 582 2060 ORF1 mRNA (alt.) mRNA 598 2060 ORF1 mRNA (alt.) mRNA 599 2060 ORF1 mRNA (alt.) mRNA 602 2060 ORF1 mRNA (alt.) mRNA 606 2060 ORF1 mRNA (alt.) mRNA 609 2060 ORF1 mRNA (alt.) mRNA 1807 639 (c) ORF2 mRNA (alt.) mRNA 1809 639 (c) ORF2 mRNA (alt.) mRNA 1811 639 (c) ORF2 mRNA (alt.) mRNA 1812 639 (c) ORF2 mRNA (alt.) mRNA 1814 639 (c) ORF2 mRNA (alt.) signal 521 528 TATA box signal 565 575 TATA box BASE COUNT 668 a 573 c 666 g 662 t ORIGIN 1 gagtctggaa cggtccctct tctcagaatt gtgaacgaaa ggaacaggtt caacgaacag 61 aacagattcc atagccacac ctttgcctat ggcttcacaa ccgaaggcaa ttcgagaggt 121 cgcgcttatg gaatcgactc gtataaagct gaagggaaag ggagacgttc cgagcgctca 181 gatgcaatag tcgtccagct aatgtggatt caaaaacaac cccaacagta atcttgaaaa 241 tttgaacgga tcaatctgaa cactcttgct ccaggtcatt cttctaacgc acatccccag 301 agtctagagg gagttgtgtt gtgaacatcc taataaacaa tgcaatggat tcgggatatc 361 ttctgtctcg ccccctactc gatgtcgagt aaaccgatca ccaactaaca atactcctcc 421 gcgttctgcc attgactctc aaacagacat cgctatcaac ggaacagcat attttagctt 481 cttaggacaa taaatattga taatgccggc tctccctcgg tatattaagc aatccattca 541 tacactcatt catcaggtta attttatata tataatttgt ctattcaaac accgtaaatt 601 actggtacca tcatctcctc cttttcaaat acacgtctat ttgcattaat gaaattactc 661 gccaattcgc agaacgtgtt tgtcgaacag agccttagct cgggtccaga caggagcagt 721 gtctcgctga ggaagctgca ggagagaagg cagcagagcc tcagaaaagg cctcggaagc 781 ctctcgcggc agaagagagg gcaggtggtc gatcgagcac acggacagct tggggccagc 841 agtggtctca acaggcacgg tgggatggtc gaacgtggta gcaattgtgt acacggggac 901 agggttgtgg gggttggtgg tgtcagcaga gacgtcgaca atgacactga gctttcgggt 961 ctccttgttg agcagatcgt agttgatgaa aggaggaatg ggctgagaca ggtagatgca 1021 gttgatgaag atatccgcgt cagcaatctc ttggaagggt ccacccttct tggtctcgtt 1081 catgtcccat cgaatgatgt tctcttcggg gataccgacc tttcgggcca gatcaatggc 1141 accggatcca catcggccaa gagcaccaat caccaggacg gtagggagct tggagccctt 1201 ctcaacagca gcggcaagat ccttcttaat cttgtcgaca agctcggtct cattgggata 1261 ggcagagaca ccgggcaggt tttcgctgtc ggggtgggtc tgctggaagg cccaagtctc 1321 gacaccgatg gcggcaccgg cgaatccagc gtggaagcca aaggcggcaa ctcgtcgtcc 1381 attgtcatcc tccaggaact caaggtcgta cagagttccg tttcctgcgg ggaatcggct 1441 cagaacgtcc ttccatccgc cctgatcctt gtagcagtga gcaaactgga tgtgctcgtg 1501 agacagaggg aaagattcct caggcagctc cttaagacca ataatcattc ggtcctcggg 1561 ggcagagacc caagagccct cctcgacaag agtggctccg acatcgacaa actcctggtc 1621 gtcgaagatt cgaaggggag acttctccac aaagacctcg aatccagcat caagcagctt 1681 tcgggtggta gtaggcgtga gagcagatcg gtgctcgaga ggcttggtct cggctcggag 1741 atggagcttc actggtgcgg tcattgtgtc tgtgttgttg tatgtggtga agtgacaatg 1801 gatggtgtgt cgtatcaggc tggggaacaa ttgtgcttaa gtatgctgca gttgagtaag 1861 agtcatcgat ccaccaaaat aaagtttgcc attagggttg gagagagaga tggtggctgg 1921 aagaattaaa tgacatcaag ctgaggattg tgggtgtgca ataacacatg ttaggggtga 1981 cctgtggctc gaaatctgat aattattttg taactttatg attattctta gattttttaa 2041 tattcctcta tataacacat aagtagctgt cgtctagttg ttcatagcct gactcctgca 2101 atagattagt gcagagtgat tttgtgcaat tgagagccac ggttgagtca agtgactttg 2161 tgtgtgaagt catcttacgt ttcaagtctc acaggttact caattggttg gttgtctgcc 2221 ctttacagat atttacagta cctgagcgta aagtcgttca tccacggaat gactgttcct 2281 gtcacgcagt catgatcatg gatgtggctg gtcaggaacc attttggata ggagacttag 2341 ggattggact attattgaaa aaactgagcc gaatatgata tagttctatt tgaatgcaga 2401 acttctgatg gtcaattcac ttatttcagg catatcggtc atggtggcag ctgccacgat 2461 gttatctcgt tggaaacctc gtccgccatt aaccggaaca tgcttctaga agatgagaat 2521 acagcatgta cttgtacaag attctcggca ttgatggtct caggcatgc // LOCUS HUMVINC 5102 bp ss-mRNA PRI 14-AUG-1990 DEFINITION Human vinculin mRNA, complete cds. ACCESSION M33308 KEYWORDS cytoskeletal protein; vinculin. SOURCE Human endothelial cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5102) AUTHORS Weller,P.A., Ogryzko,E.P., Corben,E.B., Zhidkova,N.I., Patel,B., Price,G.J., Spurr,N.K., Koteliansky,V.E. and Critchley,D.R. TITLE Complete sequence of human vinculin and assignment of the gene to chromosome 10 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 5667-5671 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.A.Weller, 28-MAR-1990. FEATURES from to/span description pept 51 3251 vinculin BASE COUNT 1379 a 1248 c 1273 g 1202 t ORIGIN Chromosome 10. 1 gaattccact tctctgtcgc ccgcggttcg ccgccccgct cgccgccgcg atgccagtgt 61 ttcatacgcg cacgatcgag agcatcctgg agccggtggc acagcagatc tcccacctgg 121 tgataatgca cgaggagggc gaggtggacg gcaaagccat tcctgacctc accgcgcccg 181 tggccgccgt gcaggcggcc gtcagcaacc tcgtccgggt tggaaaagag actgttcaaa 241 ccactgagga tcagattttg aagagagata tgccaccagc atttattaag gttgagaatg 301 cttgcaccaa gcttgtccag gcagctcaga tgcttcagtc agacccttac tcagtgcctg 361 ctcgagatta tctaattgat gggtcaaggg gcatcctctc tggaacatca gacctgctcc 421 ttaccttcga tgaggctgag gtccgtaaaa ttattagagt ttgcaaagga attttggaat 481 atcttacagt ggcagaggtg gtggagacta tggaagattt ggtcacttac acaaagaatc 541 ttgggccagg aatgactaag atggccaaga tgattgacga gagacagcag gagctcactc 601 accaggagca ccgagtgatg ttggtgaact cgatgaacac cgtgaaagag ttgctgccag 661 ttctcatttc agctatgaag atttttgtaa caactaaaaa ctcaaaaaac caaggcatag 721 aggaagcttt aaaaaatcgc aattttactg tagaaaaaat gagtgctgaa attaatgaga 781 taattcgtgt gttacaactc acctcttggg atgaagatgc ctgggccagc aaggacactg 841 aagccatgaa gagagcattg gcctccatag actccaaact gaaccaggcc aaaggttggc 901 tccgtgaccc tagtgcctcc ccaggggatg ctggtgagca ggccatcaga cagatcttag 961 atgaagctgg aaaagttggt gaactctgtg caggcaaaga acgcagggag attctgggaa 1021 cttgcaaaat gctagggcag atgactgatc aagtggctga cctccgtgcc agaggacaag 1081 gatcctcacc ggtggccatg cagaaagctc agcaggtatc tcagggtctg gatgtgctca 1141 cagcaaaagt ggaaaatgca gctcgcaagc tggaagccat gaccaactca aagcagagca 1201 ttgcaaagaa gatcgatgct gctcagaact ggcttgcaga tccaaatggt ggaccggaag 1261 gagaagagca gattcgaggt gctttggctg aagctcggaa aatagcagaa ttatgtgatg 1321 atcctaaaga aagagatgac attctacgtt cccttgggga aatatctgct ctgacttcta 1381 aattagcaga tctacgaaga caggggaaag gagattctcc agaggctcga gccttggcca 1441 aacaggtggc cacggccctg cagaacctgc agaccaaaac caaccgggct gtggccaaca 1501 gcagaccggc caaagcagct gtacaccttg agggcaagat tgagcaagca cagcggtgga 1561 ttgataatcc cacagtggat gaccgtggag tcggtcaggc tgccatccgg gggcttgtgg 1621 ccgaagggca tcgtctggct aatgttatga tggggcctta tcggcaagat cttctcgcca 1681 agtgtgaccg agtggaccag ctgacagccc agctggctga cctggctgcc agaggggaag 1741 gggagagtcc tcaggcacga gcacttgcat ctcagctcca agactcctta aaggatctaa 1801 aagctcggat gcaggaggcc atgactcagg aagtgtcaga tgttttcagc gataccacaa 1861 ctcccatcaa gctgttggca gtggcagcca cggcgcctcc tgatgcgcct aacagggaag 1921 aggtatttga tgagagggca gctaactttg aaaaccattc aggaaagctt ggtgctacgg 1981 ccgagaaggc ggctgcggtt ggtactgcta ataaatcaac agtggaaggc attcaggcct 2041 cagtgaagac ggcccgagaa ctcacacccc aggtggtctc ggctgctcgt atcttactta 2101 ggaaccctgg aaatcaagct gcttatgaac attttgagac catgaagaac cagtggatcg 2161 ataatgttga aaaaatgaca gggctggtgg acgaagccat tgataccaaa tctctgttgg 2221 atgcttcaga agaagcaatt aaaaaagacc tggacaagtg caaggtagct atggccaaca 2281 ttcagcctca gatgctggtt gctggggcaa ccagtattgc tcgtcgggcc aaccggatcc 2341 tgctggtggc taagagggag gtggagaatt ccgaggatcc caagttccgt gaggctgtga 2401 aagctgcctc tgatgaattg agcaaaacca tctccccaat ggtgatggat gcaaaagctg 2461 tggctggaaa catttccgac cctggactgc aaaagagctt cctggactca ggatatcgga 2521 tcctgggagc tgtggccaag gtcagagaag ccttccaacc tcaggagcct gacttcccgc 2581 cgcctccacc agaccttgaa caactccgac taacagatga gcttgctcct cccaaaccac 2641 ctctgcctga aggtgaggtc cctccaccta ggcctccacc accagaggaa aaggatgaag 2701 agttccctga gcagaaggcc ggggaggtga ttaaccagcc aatgatgatg gctgccagac 2761 agctccatga tgaagctcgc aaatggtcca gcaagggcaa tgacatcatt gcagcagcca 2821 agcgcatggc tctgctgatg gctgagatgt ctcggctggt aagagggggc agtggtacca 2881 agcgggcact cattcagtgt gccaaggaca tcgccaaggc ctcagatgag gtgactcggt 2941 tggccaagga ggttgccaag cagtgcacag ataaacggat tagaaccaac ctcttacagg 3001 tatgtgagcg aatcccaacc ataagcaccc agctcaaaat cctgtccaca gtgaaggcca 3061 ccatgctggg ccggaccaac atcagtgatg aggagtctga gcaggccaca gagatgctgg 3121 ttcacaatgc ccagaacctc atgcagtctg tgaaggagac tgtgcgggaa gctgaagctg 3181 cttcaatcaa aattcgaaca gatgctggat ttacactgcg ctgggttaga aagactccct 3241 ggtaccagta ggcacctggc tgagcctggc tggcacagaa acctctacta aaaagaagga 3301 aaatgatctg agtcccagga gctgcccaga gttgctggga gctgaaaaat cacatcctgg 3361 cctggcacat cagaaaggaa tgggggcctc ttcaaattag aagacattta tactcttttt 3421 tcatggacac tttgaaatgt gtttctgtat aaagcctgta ttctcaaaca cagttacact 3481 tgtgcaccct ctatcccaat aggcagactg ggtttctagc ccatggactt cacataagct 3541 cagaatccaa gtgaacacta gccagacact ctgctctgcc cttgttccct aggggacact 3601 tccctctgtt tctctttcct tggctcccat tcactcttcc agaatcccaa gacccagggc 3661 ccaggcaaat cagttactaa gaagaaaatt gctgtgcctc ccaaaattgt tttgagcttt 3721 ccatgttgct gccaaccata ccttccttcc ctgggctgtg ctacctgggt ccttttcaga 3781 agtgagcttt gctgctacag gggaaggtgg cctctgtgga gccccagcat atgggggcct 3841 ggattcattt cctgcccttc ctcagtttaa tccttctagt ttcccacaat ataaaactgt 3901 acttcactgt caggaagaaa tcacagaatc atatgattct gcttttacca tgcccctgag 3961 caatgtctgt gctagggaaa ctccccgtcc catatcctgc ctcagcccgc caaggtagcc 4021 atcccatgaa cacactgtgt cctggtgctc tctgccactg gaagggcaga gtagccaggg 4081 tgtggccctg ccatcttccc agcagggcca ctcccggcac tccatgctta gtcactgcct 4141 gcagaggtct gtgctgaggc cttatcattc attcttagct cttaattgtt cattttgagc 4201 tgaaatgctg cattttaatt ttaaccaaaa catgtctcct atatcctggt ttttgtagcc 4261 ttcctccaca tcctttctaa acaagatttt aaagacatgt aggtgtttgt tcatctgtaa 4321 ctctaaaaga tcctttttaa attcagtcct aagaaagagg agtgcttgtc ccctaagagt 4381 gtttaatggc aaggcagccc tgtctgaagg acacttcctg cctaagggag agtggtattt 4441 gcagactaga attctagtgc tgctgaagat gaatcaatgg gaaatactac tcctgtaatt 4501 cctacctccc tgcaaccaac tacaaccaag ctctctgcat ctactcccaa gtatggggtt 4561 caagagagta atgggtttca tatttcttat caccacagta agttcctact aggcaaaatg 4621 agagggcagt gtttcctttt tggtacttat tactgctaag tatttcccag cacatgaaac 4681 cttatttttt ccaaagccag aaccagatga gtaaaggagt aagaaccttg cctgaacatc 4741 cttccttccc acccatcgct gtgtgttagt tcccaacatc gaatgtgtac aacttaagtt 4801 ggtcctttac actcaggctt tcactatttc ctttaaaatg aggatgatta ttttcaaggc 4861 cctcagcata tttgtatagt tgcttgcctg atataaatgc aatattaatg cctttaaagt 4921 atgaatctat gccaaagatc acttgttgtt ttactaaaga aagattactt agaggaaata 4981 agaaaaatca tgtttgctct cccggttctt ccagtggttt gagacactgg tttacacttt 5041 atgccggatg tgcttttctc caatatcagt gctcgagaca cagtgaagca aattaaaaaa 5101 aa // LOCUS HUMPSBGA01 134 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 1. ACCESSION M32622 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 1 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 134; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept 50 + 113 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 1 sigp 50 + 113 pregnancy-specific beta-1-glycoprotein 4 signal peptide IVS 114 > 134 PSG4 intron A BASE COUNT 33 a 44 c 36 g 21 t ORIGIN Chromosome 19q13.2-13.3. 1 ggatcctagg cttatctcca cagaggagaa cacacaagca gcagagacca tggggcccct 61 ctcagcccct ccctgcacac acctcatcac ctggaagggg gtcctgctca caggtgagga 121 gagaacttcc tggg // LOCUS HUMPSBGA02 1010 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 2. ACCESSION M32623 M22436 D51537 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 2 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2] and hsCGM4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1010) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 3 (bases 1 to 1010; for [2]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990. FEATURES from to/span description pept + 468 + 833 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 2 sigp + 468 505 pregnancy-specific beta-1-glycoprotein 4 signal peptide matp 506 + 833 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 467 PSG4 intron A IVS 834 > 1010 PSG4 intron B BASE COUNT 281 a 225 c 267 g 237 t ORIGIN Chromosome 19q13.2-13.3, about 650 bp after segment 1. 1 gagctcacac agtcatcggg gggggaagac agacatgcta agtgatctag aatgtgagat 61 caggtgttga caagaaccct ggagggagga gagcagggaa aggtcagaaa gggaagaccc 121 agggtctctg aaggaggtat caggaaagaa gtctaaggat gccctgatgt gagcaggacc 181 tgagggcagt gtggaggggg ccgtgcggac cctggggaag aggaatccaa aaagaaaaat 241 gccaaggtca gaagtgttga aggaatgggg gtcatgctgc tgatcttgac ctagtgggac 301 agtaggacac acacacatac actcacgccc ctttagtgtg tgtatgtgtt tgtatgtgtg 361 tgtttgtgtg tcttcaaggc tgaggattga agagaccttc tcaggaccca tcttttcacc 421 ccaatacata ggtctcaata ttaactgatg ctctctgtac ctcctagcat cacttttaaa 481 cttctggaat ccgcccacaa ctgcccaagt cacgattgaa gcccagccac ccaaagtttc 541 tgaggggaag gatgttcttc tacttgtcca caatttgccc cagaatcttg ctggctacat 601 ttggtacaaa gggcaaatga catacctcta ccattacatt acatcatatg tagtagacgg 661 tcaaagaatt atatatgggc ctgcatacag tggaagagaa agagtatatt ccaatgcatc 721 cctgctgatc cagaatgtca cgcaggagga tgcaggatcc tacaccttac acatcataaa 781 gcgacgcgat gggactggag gagtaactgg acatttcacc ttcaccttac accgtgagtg 841 attccacatg atccctgggt gttgggggac aggggtcact tctacttcac acacacagga 901 ttctcaggcc tggactctgc ctgtgtccct ctctgcatta agtccatgct ggggtttggg 961 catttagtgc aggacacaca gaggagacaa atttcaacag atcagaattc // LOCUS HUMPSBGA03 430 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 3. ACCESSION M32624 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 3 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 430; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 124 + 402 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 3 matp + 124 + 402 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 123 PSG4 intron B IVS 403 > 430 PSG4 intron C BASE COUNT 108 a 117 c 103 g 102 t ORIGIN Chromosome 19q13.2-13.3, about 5500 bp after segment 2. 1 ttccatgggc ttttaaggac ttgggtgggc tgagaggtgg gaaatgccaa ctctgattga 61 aaaatgcctt tggaagaatc aaagatgcca cacagggcaa acctctctct gttttctgca 121 cagtggagac tcccaagccc tccatctcca gcagcaactt aaatcccagg gaggccatgg 181 aggctgtgat cttaacctgt gatcctgcga ctccacccgc aagctaccag tggtggatga 241 atggtcagag cctccctatg actcacaggt tgcagctgtc caaaaccaac aggaccctct 301 ttatatttgg tgtcacaaag tatattgcag gaccctatga atgtgaaata cggaacccag 361 tgagtgccag ccgcagtgac ccagtcaccc tgaatctcct ccgtgagtat cctctgttcc 421 tctgtgagcc // LOCUS HUMPSBGA04 573 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 4. ACCESSION M32625 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 4 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 573; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 181 365 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 4 369 + 435 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 4 matp + 181 + 435 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 180 PSG4 intron C IVS 436 > 573 PSG4 intron D site 366 368 internal stop codon BASE COUNT 146 a 149 c 132 g 146 t ORIGIN Chromosome 19q13.2-13.3, about 270 bp after segment 3. 1 agcttgtgat gggagaaaca ggtgtaaggt ctcaggctcc agatcagtga acacagcggg 61 gatttggctg ggacttcagt gttgcgactt ggctcacagg gtcactgtgg cccttccaca 121 gaccaggatt ttcccttccc tctgacaatg tcacctgtga ctttattctc tttgctccag 181 atggcctgga tgcccccacc atttcttcct catacaccta ttaccataca ggggaattcc 241 ccaatctctc ctgcctcaca gactctcacc cactggcaga gcattcttgg ctgattgatg 301 ggaagttcca gcaatcagca caagtgtgct ttatccccca aatcactaaa acatatagag 361 gggtttaggt ctgtttcatc cataactcag ccactgatgg aacaaatctc ataatcaaga 421 ggatcatagt gcctggtaag tggatccctg gagcattggc aatatgtttt ccagtgaagc 481 tatctatgct atcagggaaa agccaccgtc cctcgtcaaa gggagaggga aaatcaaaaa 541 cccagcacag ggaatatgtt tcgtctccgg ggg // LOCUS HUMPSBGA05 647 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 5. ACCESSION M32626 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 5 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 647; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 158 + 436 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 5 matp + 158 + 436 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 157 PSG4 intron D IVS 437 > 647 PSG4 intron E BASE COUNT 183 a 168 c 147 g 149 t ORIGIN Chromosome 19q13.2-13.3, about 1800 bp after segment 4. 1 gatcatcatg catctatctt gtgacagaag caccagctat tggctttcaa ggactcgggt 61 gggctgagag gtgggagatg ccaactctga ttgaagtatg cctgtggagg aatcaaaggt 121 gccacacagg acaatcttct ctctgttatc cacacagcaa agctgtccaa gccctacatc 181 acaatcaaca acttaaaccc cagagagaat aaggatgtct taaccttcac ctgtgaacct 241 aagagtaaga actacaccta catttggtgg ctaaatggtc agagcctccc tgtcagtccc 301 agggtaaagc gacccattga aaacaggatc ctcattctac ccaatgtcac gagaaatgaa 361 acaggacctt atcaatgtga aatacgggac cgatatggtg gcatccgcag tgacccagtc 421 accctgaatg tcctctgtga gtatcttttt gttcctctgt gggccaggac accagcttaa 481 atccaaacca ccagaggcca ggtctctcag tctctctcca gtccaagtat agacaccttt 541 acttctggac atccgaggtg gccatgactc cctgccctgg gaaaacctag gtaggcacag 601 cttaaccaag aatataaggg gaggggacgc tcttgtcatg gagactt // LOCUS HUMPSBGA06 558 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 4 (PSG4) gene, exon 6. ACCESSION M32627 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 6 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 558; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 45 / 299 pregnancy-specific beta-1-glycoprotein 4 (PSG4) precursor, exon 6 matp + 45 / 299 pregnancy-specific beta-1-glycoprotein 4 IVS < 1 44 PSG4 intron E IVS 300 > 558 PSG4 intron F BASE COUNT 161 a 144 c 103 g 150 t ORIGIN Chromosome 19q13.2-13.3, about 200 bp after segment 5. 1 ttcctcggat gacatcacct atggctttat tctctttgcg ccagatggtc cagacctccc 61 cagcatttac ccttcattca cctattaccg ttcaggagaa aacctctact tgtcctgctt 121 cgccgagtct aacccacggg cacaatattc ttggacaatt aatgggaagt ttcagctatc 181 aggacaaaag ctctctatcc cccaaataac tacaaagcat agtgggctct atgcttgctc 241 tgttcgtaac tcagccactg gcaaggaaag ctccaaatcc atcacagtca aagtctctgg 301 taagtggatc ccagcatcct tggcaatagg gatttaggtg gagtctatct ggcattcagg 361 gaagagtgag gaaaacattt ttattcccag cctgtgtccc atgggcacaa gcaaatccca 421 aattctactc ctgaacactc ccaatttgtc taagaacttc caaaacttta acaaacaggc 481 tgatatcttc ttaaaattcc cagcctagac caagcaggga gaacattgat ttcattgaaa 541 taattgacaa tattgagg // LOCUS HUMPSBGA07 621 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 1. ACCESSION M32628 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 7 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 621; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept 360 + 423 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 1 sigp 360 + 423 pregnancy-specific beta-1-glycoprotein 5 signal peptide IVS 424 > 621 PSG5 intron A BASE COUNT 176 a 135 c 193 g 113 t 4 others ORIGIN Chromosome 19q13.2-13.3, about 7000 bp after segment 6. 1 tatgtgtgtg actgtgtggg tctgtgtgtg tctcttctgt gtgtgtgtgt ctgcacaaag 61 tgtggttgag gtttggtgaa agaatcactg ctgaaaaatg cagaggcctc cacaattccc 121 agggacctga aacacagaca aaaggaaaaa cagaaggagg acaaggaggc aggactgaga 181 gaggagggga cagagaggtg tcctgggcct gaccccaccc atgagcctga gaagtgctcc 241 tgccctggag agaggctcag cacagaagga ggaaggacag cacagcctac agccgtgctc 301 aggaagtttc tggatcctag gctcagctcc acagaggaga acacgcaggc gcagagacca 361 tggggcccct ctcagcccct ccctgcacac agcacatcac ctggaagggg ctcctgctca 421 caggtgagga gagaacttcc tgggagagga caggaggagg aagaagagtg actggattgg 481 gtctcctgga gaggatnnnn ttctaaaaaa taaaagaagt ccagcacttt gggaggatga 541 ggtgggtgga ttatgagatc aggagttcaa ggtcagtcct gccaacacag tgaagccctg 601 tctctactaa aaatacaaaa a // LOCUS HUMPSBGA08 173 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, intron A. ACCESSION M32629 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 8 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 173; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description IVS < 1 > 173 pregnancy-specific beta-1-glycoprotein 5 intron A BASE COUNT 56 a 29 c 62 g 26 t ORIGIN Chromosome 19q13.2-13.3, about 560 bp after segment 7. 1 gagctcacgc tgtcatgggg aggaagacag acatgcaaag agatctagaa tgtgaggtca 61 ggtgttgaca agaaccctgg agggagcaga gcagggaaag gtcagaaagg gaagacccag 121 ggtctctgaa ggacgtgtca ggaaagaagt ctagagatgc cctgatgtga gca // LOCUS HUMPSBGA09 547 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 2. ACCESSION M32630 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 9 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 547; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 9 + 374 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 2 sigp + 9 47 pregnancy-specific beta-1-glycoprotein 5 signal peptide matp 48 + 374 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 8 PSG5 intron A IVS 375 > 547 PSG5 intron B BASE COUNT 149 a 139 c 117 g 142 t ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 8. 1 cctcctagca tcacttttaa acttctggaa cctgcctatc actgctcaag tcacgattga 61 agccctgcca cccaaagttt ccgaggggaa ggatgttctt ctacttgtcc acaatttgcc 121 tcagaatctt gctggctaca tctggtacaa aggacaactg atggacctct accattacat 181 tacatcatat gtagtagacg gtcaaataaa tatatatggg cctgcataca ctggacgaga 241 aacagtatat tccaatgcat ccctgctgat ccagaatgtc acccgggaag acgcaggatc 301 ctacacctta cacatcataa agcgaggtga taggactaga ggagtaactg gatatttcac 361 cttcaactta taccgtgagt gattccacat gatccctggg tgttggggga caggggtcat 421 ttctacttca cacacacagg attgtcaggc ctgcactgtg cctgtgtccc tctctgcatt 481 atgtcccatg ctgcggtttg ggcatttagt gcaggacaca cacagaggag acaaatttca 541 acagatc // LOCUS HUMPSBGA10 458 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 3. ACCESSION M32631 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 10 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 458; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 167 + 445 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 3 matp + 167 + 445 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 166 PSG5 intron B IVS 446 > 458 PSG5 intron C BASE COUNT 117 a 122 c 108 g 111 t ORIGIN Chromosome 19q13.2-13.3, about 5200 bp after segment 9. 1 gagctctcag attgccatgc atctgtcttg taacacacac acctgccatg ggcttttaag 61 gacttgggtg ggctgacagg tgggaaatgc caactctgat tgaaaagtgc ctttggagga 121 atcaaaggtg ccacacaggg caatcttctc tctgttttct gcaaagtgga aactcccaag 181 ccctccatct ccagtagcaa cttaaacccc agggaggcca tggaggctgt gatcttaacc 241 tgtgatcctg cggcttcaga cacaagctac ctgtggtgga tgaatggtca gagcctccct 301 atgactcaca ggttgcagct gtccaaaacc aacaggaccc tctttatatt tggtgtcaca 361 aagtatattg caggacccta tgaatgtgaa atacggaacc cagtgagtgc cagccacagt 421 gacccagtca ccctgaatct cctccgtgag tatcctct // LOCUS HUMPSBGA11 359 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 4. ACCESSION M32632 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 11 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 359; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 95 267 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 4 272 + 347 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 4 matp + 95 + 347 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 94 PSG5 intron C IVS 348 > 359 PSG5 intron D site 268 270 internal stop codon BASE COUNT 89 a 108 c 65 g 97 t ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 10. 1 acttggctca cagggtcact gtggcccttc cacagaccag gattttccct tccctctgac 61 aatgtcacct gtgactttat tctctttgct ccagatggcc tggatgcccc caccatttct 121 tcctcataca cctattacca tacaggggaa gtccccaatc tctcctgcct cacagactct 181 cacccactgg cagagcattc ttggctgatt gatgggaagt tccagcaatc agcacaagtg 241 tgctttattc cccaaatcac caaaacatag agggatctat gtctgtttca tccataactc 301 agccactgat ggaacaaatc tcataatcaa gaggatcata gtccctggta agtggatcc // LOCUS HUMPSBGA12 381 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 5. ACCESSION M32633 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 12 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 381; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 33 + 311 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 5 matp + 33 + 311 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 32 PSG5 intron D IVS 312 > 381 PSG5 intron E BASE COUNT 111 a 105 c 81 g 84 t ORIGIN Chromosome 19q13.2-13.3, about 2000 bp after segment 11. 1 acaggacaat cttctctctg ttatccacac agtgaagctg cccaagccct acatcaccat 61 caacaactca aaacccaggg agaataagga tgtcttagcc ttcacctgtg aacctaagag 121 tgagaactac acctacattt ggtggctaaa tggtcagagc ctcccggtca gtcccagggt 181 aaagcgaccc attgaaaaca ggatcctcat tctacccagt gtcacgagaa atgaaacagg 241 accctatgaa tgtgaaatac gggaccgaga tggtggcatg cgcagtgacc cagtcaccct 301 gaatgtcctc tgtgagtatc ttttgttcct ctgtgggcca ggacaccagc ttaaatctaa 361 atgaccacaa gccagtcctc t // LOCUS HUMPSBGA13 540 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 6. ACCESSION M32634 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 13 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 540; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 40 + 294 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 6 matp + 40 + 294 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 39 PSG5 intron E IVS 295 > 540 PSG5 intron F BASE COUNT 132 a 142 c 108 g 158 t ORIGIN Chromosome 19q13.2-13.3, about 300 bp after segment 12. 1 ggatgacatc acctgtggct tcattcttct ttgctccaga tggtccagac ctccccagca 61 tttacccttc attcacctat taccgttcag gagaaaacct ctacttgtcc tgcttcgcgg 121 aatctaaccc accggcagag tatttttgga caattaatgg gaagtttcag caatcaggac 181 aaaagctctc tatcccccaa attactacaa agcatagagg gctctatact tgctctgttc 241 gtaactcagc tactggcaag gaaagctcca aatccatgac agtcgaagtc tctggtaagt 301 ggatcccagc atccttggca atagggtttt aggtggagtc tacctggctt tcagagaaga 361 gttaggaaaa catttttatt cccagcctgt gtcccatggg cacaagcaaa tcccaaattc 421 tcctcctgaa cccttccaat ttgtctctac aaactctctt ctccttgttt ttctgttttc 481 tcatggctga ccttgtgtct ggcctaagaa agttagggag ggggctttat cagccctgag // LOCUS HUMPSBGA14 320 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 5 (PSG5) gene, exon 7. ACCESSION M32635 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 14 of 14 SOURCE Human placenta DNA, clones cosPSG4/5-[1,2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 320; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Barnert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 202 245 pregnancy-specific beta-1-glycoprotein 5 (PSG5) precursor, exon 7 matp + 202 245 pregnancy-specific beta-1-glycoprotein 5 IVS < 1 201 PSG5 intron F IVS 286 > 320 PSG5 intron G BASE COUNT 94 a 78 c 54 g 94 t ORIGIN Chromosome 19q13.2-13.3, about 4300 bp after segment 13. 1 ggatcaccat cctagaaaca tggtgagagg gaaaaaaaaa actgcaagaa tataatcatc 61 tcccatcaat ttcccaagag gaataatgta gtacttgaag aaacaattat agagtacctc 121 atgctacatg cctcttcctg aggctccccc atgtaaaata acatcacctt cattccttct 181 ttacttttct ttccatgaca gctccttcag gaataggacg tcttcctctc cttaatccaa 241 tatagcagcc gtgaagtcat ttctgtattt caggaagact ggcaggtatg atggcctttt 301 ctcttatcct ggtcctgcag // LOCUS HUMPSBGB1 237 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 4. ACCESSION M32617 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 1 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 237; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept < 1 + 14 pregnancy-specific beta-1-glycoprotein (PSG7), exon 4 (AA at 2) IVS 15 > 237 PSG7 intron D BASE COUNT 76 a 52 c 50 g 59 t ORIGIN Chromosome 19q13.2-13.3. 1 gatcatagtc cctggtaagt ggatcctgga gcattggcaa tatgttttcc agggaagtct 61 atctggctat cagggaagag ccacctgccc tctgcagagg agagggaaaa tcaaaactca 121 ggaaagggaa tatatttctg ctccaaaacc accaggtttt cctgtacctc actctttcta 181 gatcattctt cagactatgc gcactaacaa tgaacaatct gaaaagaaat taagaaa // LOCUS HUMPSBGB2 435 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 5. ACCESSION M32618 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 2 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 435; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 25 + 303 pregnancy-specific beta-1-glycoprotein 7 (PSG7), exon 5 IVS < 1 24 PSG7 intron D IVS 304 > 435 PSG7 intron E BASE COUNT 120 a 123 c 87 g 105 t ORIGIN Chromosome 19q13.2-13.3; about 1800 bp after segment 1. 1 atcttctctc tgttattcac acagtgaagc tgcctaagcc ctacatcacc atcaataact 61 taaaccccag ggagaataag gatgtctcaa ccttcacctg tgaacctaag agtgagaact 121 acacctacat ttggtggcta aatggtcaga gcctcccggt cagtcccagg gtaaagcgac 181 gcattgaaaa caggatcctc attctaccca gtgtcacgag aaatgaaaca ggaccctatc 241 aatgtgaaat acgggaccga tatggtggca tccgcagtga cccagtcacc ctgaatgtcc 301 tctgtgagta tccttgagta tcctttcttt cctctgtggg ccaggacaca ccagcttaac 361 tccaaacgac cagaggccag tcctctcagt ctctctccgg tccaagtata gacatattta 421 cttctggaca tccga // LOCUS HUMPSBGB3 274 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 7 (PSG7) gene, exon 6. ACCESSION M32619 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 3 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 274; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept + 8 / 262 pregnancy-specific beta-1-glycoprotein 7 (PSG7), exon 6 IVS < 1 7 PSG7 intron E IVS 263 > 274 PSG7 intron F BASE COUNT 72 a 78 c 52 g 72 t ORIGIN Chromosome 19q13.2-13.3; about 300 bp after segment 2. 1 cctccagatg gtccagacct ccccagaatt tacccttcat tcacctatta ccattcagga 61 gaaaacctct acttgtcctg ttttgcggac tctaacccac cggcacagta ttcttggaca 121 attaatggga agtttcagct atcaggacaa aagctttcta tcccccagat tactacaaag 181 catagcgggc tctatgcttg ctctgttcgt aactcagcca ctggcaagga aagctccaaa 241 tccgtgacag tcagagtctc tggtaagtgg atcc // LOCUS HUMPSBGB4 206 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 1. ACCESSION M32620 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 4 of 5 SOURCE Human leukocyte DNA, clone lambda-PSG6/7-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 2 (bases 1 to 206; for [1]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly submitted by J.Thompson, 06-MAR-1990. FEATURES from to/span description pept 50 + 113 pregnancy-specific beta-1-glycoprotein 6 (PSG6) precursor, exon 1 sigp 50 + 113 pregnancy-specific beta-1-glycoprotein 6 signal peptide IVS 114 > 206 PSG6 intron A BASE COUNT 59 a 50 c 66 g 31 t ORIGIN Chromosome 19q13.2-13.3; about 7800 bp after segment 3. 1 ggatcctagg ctcatctcca caggggagaa cacacagaca gcagagacca tgggacccct 61 ctcagcccct ccctgcactc agcacatcac ctggaagggg ctcctgctca caggtgagga 121 gaggacttcc tgggagagga caggaggagg aagcagagtg actggatggg gtatctggag 181 aggatggggt tctaaaaaat aaaaga // LOCUS HUMPSBGB5 1026 bp ds-DNA PRI 14-AUG-1990 DEFINITION Human pregnancy-specific beta-1-glycoprotein 6 (PSG6) gene, exon 2. ACCESSION M32621 M22435 D51537 KEYWORDS pregnancy-specific beta-1-glycoprotein. SEGMENT 5 of 5 SOURCE Human leukocyte DNA, clones lambda-PSG6/7-1 and hsCGM3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1026) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S., Shively,J.E. and Zimmermann,W. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review REFERENCE 2 (sites) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. TITLE The human pregnancy-specific glycoprotein genes are tightly linked on the long arm of chromosome 19 and are coordinately expressed JOURNAL Biochem. Biophys. Res. Commun. 167, 848-859 (1990) STANDARD full staff_review REFERENCE 3 (bases 439 to 846; for [2]) AUTHORS Thompson,J., Koumari,R., Wagner,K., Bamert,S., Schleussner,C., Schrewe,H., Zimmermann,W., Mueller,G., Schempp,W., Zaninetta,D., Ammaturo,D. and Hardman,N. JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.A.Thompson, 07-FEB-1989; for [2],[3], 06-MAR-1990. This gene was named human carcinoembryonic antigen (hsCGM3) in [1], but [2] says it is actually one of the pregnancy-specific beta-1-glycoproteins. FEATURES from to/span description pept + 484 / 846 pregnancy-specific beta-1-glycoprotein 6 (PSG6) precursor, exon 2 sigp + 484 521 pregnancy-specific beta-1-glycoprotein 6 signal peptide matp 522 / 846 pregnancy-specific beta-1-glycoprotein 6 IVS < 1 483 PSG6 intron A IVS 847 > 1026 PSG6 intron B BASE COUNT 291 a 235 c 264 g 236 t ORIGIN Chromosome 19q13.2-13.3; about 1200 bp after segment 4. 1 gagctcacac tctcatgggg aggaagacag acatgcaaag agatatagaa tgtgaggtca 61 ggtgttgaca agaaccctag agggagcaga gcagggaaag gtcagaaagg gaagacccag 121 ggtctctgaa gcaggcatca ggaaagaagt ctaaggatgc cctgatgtga gcaggacctg 181 agggcagtgt ggagggggcc gtgcggaccc ctggggaaga ggattgcaaa cagaaaaatg 241 ccaaggtcag gagtgttgaa ggaatggggg tcatgctgct gaccttgacc tagtaggaca 301 gtaggacaca cacacataca cacacacaaa cacacatgcc cttttgtgtg tgtgtgtttg 361 tatgtgtgtg tgtgcatatc ttcaaggctg atgattgaag agaccttctc aggacacagg 421 gccccatctt ttcaccccaa tacataggtc caaatattaa ctgatgctgt ctctacctcc 481 tagcatcact tttaaacttc tggaacctgc ccaccactgc ccaagtaata attgaagcca 541 agccacccaa agtttccgag gggaaggatg ttcttctact tgtccacaat ttgccccaga 601 atcttactgg ctacatctgg tacaaagggc aaatgacgga cctctaccat tacattacat 661 catatgtagt acacggtcaa attatatatg ggcctgccta cagtggacga gaaacagtat 721 attccaatgc atccctgctg atccagaatg tcacacagga ggatgcagga tcctacacct 781 tacacatcat aaagcgaggc gatgggactg gaggagtaac tggatatttc actgtcacct 841 tatactgtga gtgattccgc atgatccctg ggtgttgggg ggcaggggtc atttctactt 901 cacacacaca gaattgtcag gcctggactc tgcctgtgtc actctctgca ttatgtccca 961 tgctggggtt tgggcattta gtgcaggaca cacacagagg agacacattt caacagatca 1021 gaattc // LOCUS MSQRTR1A 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 16bs. ACCESSION M35448 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 16bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 76 a 78 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt // LOCUS MSQRTR1B 348 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 916. ACCESSION M35449 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 916. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 348) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 348 T1 retroposon BASE COUNT 79 a 75 c 63 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatat actcgacagt cgtcttaact ttaaactgca 61 gcttgatgag gttctactaa aagctaatcg aactcttggg tttattttac gttttacctc 121 tatttttaga gattaaagct tcttaagaaa cctttattat gctctggtaa ggcctcttct 181 tgaatatgct agcatcatct ggaatcctcc tactattgat ggctgttcga gaattgaaag 241 cattcagcgc ctttttacca gggttgcttt tcgtcgtttg ttcggtgctg cctcactacc 301 tccctatgaa acgagattgc agttattcaa tcttcactct ttaagctt // LOCUS MSQRTR1C 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 52a. ACCESSION M35450 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 52a. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 76 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcccattc 181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1D 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone L34. ACCESSION M35451 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone L34. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 77 a 77 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gccctactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1E 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 914a. ACCESSION M35452 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 914a. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 77 a 76 c 65 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctgtggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1F 343 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 13bs. ACCESSION M35453 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 13bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 343) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 343 T1 retroposon BASE COUNT 78 a 73 c 60 g 132 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta taatttacag tcgtcttaac tttaaactgc 61 agcttgatga gattctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagtatcatc tggaattgtc ctactattga tggctgttcg agaattgaat 241 gccttcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcagtta ttcaatcttc actctttaag ctt // LOCUS MSQRTR1G 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 12bs. ACCESSION M35454 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 12bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 79 c 62 g 130 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acccttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcatcgttt tttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttactca atcttcactc tttaagctt // LOCUS MSQRTR1H 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 9bs. ACCESSION M35456 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 9bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 77 a 77 c 64 g 131 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1I 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 6bs. ACCESSION M35457 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 6bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 76 c 64 g 131 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gaactcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagtatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1J 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 4bs. ACCESSION M35461 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 4bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 76 a 77 c 64 g 132 t ORIGIN 1 gtgttttgtc catccgtgac cttggtatta tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggttctacta aaagctaatc gatctcttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcttaagaa acctttattg tgctctggta aggcctcttc 181 ttgaatatgc tagcatcatc tggaatcctc ctactattga tggctgttcg agaattgaaa 241 gcattcagcg cctttttacc agggttgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc tttaagctt // LOCUS MSQRTR1K 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 1bs. ACCESSION M35462 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 1bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 78 a 80 c 64 g 127 t ORIGIN 1 gtgttgtgtc catccgtgac cttggtataa tactcgacag tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagctaatc gaacccttgg gtttatttta cgttttacct 121 ctatttttag agatcaaagc ttcctaagca acctttattg tgctttggta aggcctattc 181 ttgaatatgc tagtatcctc tggaatcctc ctactattga ttgctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgctt ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1L 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 5bs. ACCESSION M35463 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 5bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 80 c 64 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1M 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 3bs. ACCESSION M35464 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 3bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 80 a 80 c 65 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggagtcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1N 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 11bs. ACCESSION M35465 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 11bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 82 a 80 c 64 g 123 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacca 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1O 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 14bs. ACCESSION M35466 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 14bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 82 c 64 g 122 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1P 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 31b. ACCESSION M35467 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 31b. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 80 a 80 c 65 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctgcta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1Q 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 15bs. ACCESSION M35468 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 15bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 80 c 64 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1R 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 17bs. ACCESSION M35469 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 17bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 82 a 79 c 64 g 124 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgacttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agattaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1S 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 18bs. ACCESSION M35470 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 18bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 82 a 81 c 64 g 122 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaaccctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgatag cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1T 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 19bs. ACCESSION M35471 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 19bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 81 a 82 c 64 g 122 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatga ggtgctacta aaagcaaacc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc aagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgtct gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS MSQRTR1U 349 bp ds-DNA INV 14-AUG-1990 DEFINITION A.gambiae T1 retroposon, clone 28bs. ACCESSION M35472 KEYWORDS retroposon; retrotransposon. SOURCE A.gambiae (colony G3) DNA, clone 28bs. ORGANISM Anopheles gambiae Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Nematocera; Culicoidea; Culicidae; Anophelelinae. REFERENCE 1 (bases 1 to 349) AUTHORS Besansky,N.J. TITLE Evolution of the T1 retroposon family in the Anopheles gambiae complex JOURNAL Mol. Biol. Evol. 7, 229-246 (1990) STANDARD simple staff_review FEATURES from to/span description trns 1 349 T1 retroposon BASE COUNT 79 a 80 c 64 g 126 t ORIGIN 1 gtgttatgtc catccgtgac cttggtatta tacttgactg tcgtcttaac tttaaactgc 61 agcttgatgt ggtgctacta aaagcaaatc gaacccttgg gtttatctta cgttttacct 121 ctatttttag agatcaaagc atcctaagaa tcctttattg tgctttggta aggcctattc 181 ttgaatatgc tagtatcatc tggaatcctc ccactattga tggctgttcg agaattgaaa 241 gcattcagcg cctctttacc aggattgcct ttcgtcgttt gttcggtgct gcctcactac 301 ctccctatga aacgcgattg cagttattca atcttcactc cttaagctt // LOCUS SHRMTRRDA 1136 bp ss-RNA RNA 14-AUG-1990 DEFINITION Brine shrimp mitochondrion 16S ribosomal RNA precursor. ACCESSION X12965 M21833 KEYWORDS 16S ribosomal RNA; ribosomal RNA. SOURCE Brine shrimp mitochondrion, cDNA to RNA, clones pArLSU-[1,2]. ORGANISM Mitochondrion Artemia salina Eukaryota; Animalia; Metazoa; Arthropoda; Crustacea; Branchiopoda; Sarsostraca; Anostraca; Artimiidae; Artemia salina. REFERENCE 1 (sites) AUTHORS Sastre,L. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 2 (bases 1 to 1136; enum. 1 to 1136) AUTHORS Sastre,L. TITLE ; JOURNAL Unpublished (1988) see COMMENT for author address STANDARD simple automatic REFERENCE 3 (bases 518 to 1129) AUTHORS Palmero,I., Renart,J. and Sastre,L. TITLE Isolation of cDNA clones coding for mitochondrial 16S ribosomal RNA from the crustacean Artemia JOURNAL Gene 68, 239-248 (1988) STANDARD full staff_review REFERENCE 4 (bases 1 to 1136; revises [1],[2],[3]) AUTHORS Sastre,L. JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT Author address: Sastre L., Instituto de Investigaciones Biomedicas del CSIC, Av Arzobispo Morcillo 4, 28029 Madrid, Spain. Submitted (22-SEP-1988) on tape to the EMBL data library. [1] Submitted (17-NOV-1988) to the EMBL data library. [2] Draft entry and computer-readable sequence for [3] kindly provided by L.Sastre, 01-MAR-1989; for [4], 12-MAY-1989. FEATURES from to/span description rRNA 1 1136 16S ribosomal RNA revision 548 550 cct in [2]; ct in [1] revision 555 557 act in [2]; at in [1] revision 778 780 gtc in [2]; gc in [1] revision 906 908 tta in [2]; ta in [1] revision 981 982 tc in [2]; ttc in [1] revision 1003 1007 cagcc in [2]; cc in [1] revision 1111 1112 ta in [4]; tca in [2] BASE COUNT 382 a 196 c 218 g 340 t ORIGIN 1 atttaaaaca atattatatt agtaaaagaa tcgaatttat tcaaggtcta agtactgaaa 61 aggaaagtct actaattaca aagaagttat tgtttgtacc ttttgtatca gggtttatta 121 aatttcaatc tttattgctg aatcccgaaa caaggtgatc tatattggtt aaagatgatt 181 gtttcataaa tctcttcaag accatatcgt tgcgaaaata tttcgtacct ttatatctgg 241 ttggcagaat actcctttag ggtactccgc aaggaagatt ttctagggga cgagctctag 301 aaaaaattat attataatta tcaagaaaaa catgactagg cttaaaagtt gcctcggttt 361 aataaccatt gttagtaaaa tgataatttc aaaataatta attaagactg gccgatgcta 421 attacataaa tttagcaatc acaatgatag aattagtaga actataaatg cacttaacac 481 aatattcctc acaaaatccc aaaggaattc ggcaaaactc atctccgcct gtttaacaaa 541 gacatcgcct cttaactttg ggaggtcgag cctgcccact gatgaattaa agggccgtgg 601 tatactgacc atgcgaaggt agcataatca ttagcctttt gatttgaggc tggaatgaat 661 ggtttgacga gagatggtct gtctcttcga ttaaattgaa gttaatcttt aagtgaaaaa 721 gcttaaatgt acttggaggg cgataagacc ctatagatct ttacatttaa ttcttttgtc 781 ttgcggtagg taattagaca gagtaaaaca atgttcggtt ggggcgacgg taagaacaga 841 ataaacactt acaacataaa cacatcaata aatgaccatt gatccttaga tgaataaaga 901 ccaagttacc ttagggataa cagcgtaatt ctttttgaga gttcaaatcg acaaaagagt 961 ttgcgagcct cgatgttggt tcagggaccc tactcggtgc agcagcctag agaggcagtc 1021 tgttcgactg ttaaacccct acgtgatctg agttcagacc gacgtaagcc aggttagttt 1081 ctacccccaa ggcaaatcca ctcatgatag tacgaaagga cctcttgaga ttttat // LOCUS RATI145TR 9852 bp ss-mRNA ROD 14-AUG-1990 DEFINITION Rat inositol-1,4,5-triphosphate receptor mRNA, complete cds. ACCESSION J05510 KEYWORDS inositol-1,4,5-triphosphate receptor. SOURCE Rat adult brain, cDNA to mRNA, clones pI16, pI17, pI19, and pI25. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 9852) AUTHORS Mignery,G.A., Newton,C.L., Archer,B.T.III. and Suedhof,T.C. TITLE Structure and expression of the rat inositol-1,4,5-triphosphate receptor JOURNAL J. Biol. Chem. 265, 12679-12685 (1900) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.C.Suedhof, 19-APR-1990. FEATURES from to/span description pept 330 8579 inositol-1,4,5-triphosphate receptor (alt., clone pI16) pept 330 1280 inositol-1,4,5-triphosphate receptor, exon 1 (alt., clones pI17, pI19, and pI25) 1326 8579 inositol-1,4,5-triphosphate receptor, exon 2 (alt., clones pI17, pI19, and pI25) IVS 1281 1325 inositol-1,4,5-triphosphate receptor intron A (no splice consensus) signal 9033 9083 poly-A signal signal 9832 9837 poly-A signal variant 752 752 a in pI16; g in pI25 variant 802 802 g in pI16; a in pI17 variant 1923 1923 g in pI16; a in pI17 variant 2181 2181 g in pI16; a in pI17 variant 3340 3340 g in pI16; a in pI17 BASE COUNT 2688 a 2324 c 2498 g 2342 t ORIGIN 1 tggggtcact tccttcacac aggactttgg cggaaatctc tacacccgtg tctcgtcatc 61 cccttcgaga gcattaacta tttaaaacca ggagagaaag cgcacgccga gaggaggtgt 121 gggtgttccg ctcccatcct aacggaacga gctccctctt cgcggacatg ggattgccca 181 gcggctgcta acccctgtcc tggtcctgat ccctcaaacc ggcgtggctc cccggtcacc 241 aaggagctga ctacaaagga tttgcatcct tggctgggag tccattggct acagagtgcc 301 tgacctgggt caggcttttc aacaaggaca tgtctgacaa aatgtctagt ttcctacata 361 tcggagacat ttgttctctg tatgcagagg gatctacgaa tggatttatc agcaccttgg 421 gcttggttga tgaccgttgc gttgtacagc cagaagctgg ggaccttaac aatccaccca 481 agaaattcag agactgcctc tttaagctat gtcctatgaa tcgatattct gcacagaagc 541 agttctggaa agctgctaag cccggggcca acagtacaac agatgcggtg ctgctcaaca 601 aattgcatca tgctgcagac ttggaaaaga agcagaatga gacagaaaac aggaaattgc 661 tggggacggt catccagtat ggcaacgtga tccagctcct acatttgaaa agcaataaat 721 acttaactgt gaataagagg cttcctgccc tactggagaa gaacgccatg agagtgacgt 781 tggatgaggc tggaaatgaa gggtcttggt tttacattca gccgttctac aagctccgct 841 ccatcggaga cagtgtggtc ataggtgaca aggtagtttt gaaccctgtc aatgctgggc 901 agccgctaca tgccagcagc catcagctgg tggataaccc gggctgcaat gaggtcaact 961 ccgtcaactg taacacaagc tggaaaatag tgcttttcat gaaatggagt gataacaaag 1021 atgacattct aaaaggaggt gatgtggtaa gactcttcca tgctgagcaa gagaagtttc 1081 tcacgtgcga tgagcacagg aagaagcagc acgtcttcct gaggacaacc ggcagacagt 1141 cagccacgtc ggccaccagc tccaaagccc tgtgggaagt ggaggtagtc cagcatgacc 1201 catgtcgggg tggagctggg tactggaata gcctcttccg gttcaagcac ctggctacag 1261 ggcattactt ggcagcagag gtagaccctg actttgagga agaatgcctg gagtttcagc 1321 cctcagtgga ccctgatcag gacgcatctc ggagcaggtt gagaaatgcc caagaaaaaa 1381 tggtgtactc tctggtctct gtgcctgagg gcaacgacat ctcctccatc tttgagctag 1441 accccacaac tctacgtgga ggtgacagcc ttgtcccaag gaactcctat gtccggctca 1501 gacacctgtg tacaaacacc tgggttcaca gcacaaacat ccccattgac aaggaagagg 1561 aaaaacctgt catgctgaag attggtacct ctcccctgaa ggaggacaag gaagcttttg 1621 ccatagtccc cgtttctcct gctgaggttc gggacctgga ctttgccaat gatgccagca 1681 aggtgctggg ctccatcgct gggaagttgg aaaagggcac catcacccag aatgagagaa 1741 ggtctgtcac caagctgctg gaagacttgg tttactttgt cacgggtgga acgaactctg 1801 gccaagacgt gcttgaagta gtcttctcta agcccaaccg agagcggcag aagctgatga 1861 gggaacagaa tattctcaag cagatcttca agcttttgca ggcccccttc acggactgcg 1921 gggatggccc aatgcttcgg ctggaggagc tgggggacca gcggcacgct cctttcagac 1981 acatttgccg actctgctac agggtcctga gacactcaca gcaagactac aggaagaacc 2041 aggagtacat agccaagcag tttggcttca tgcagaagca gattggctat gatgtgctgg 2101 ccgaagacac catcactgcc ctgctccaca acaaccgaaa gctcctggaa aagcacatca 2161 ccgcagcaga gattgacacg tttgtcagcc tggtgcgaaa gaacagggag cccaggttct 2221 tggattacct ctctgacctc tgtgtatcca tgaataagtc aatccctgtg acccaggagc 2281 tcatttgtaa agctgtgctg aatcccacca atgctgacat cctgattgag accaagttgg 2341 ttctttctcg ttttgagttt gaaggcgttt ccaccggaga gaatgctctg gaagctgggg 2401 aagatgagga agaggtgtgg ctgttctgga gggacagcaa caaagagatc cgtagcaaga 2461 gtgttaggga attggcacaa gatgctaaag agggacagaa ggaagacagg gatgtcctca 2521 gctactacag gtatcagctg aacctctttg caaggatgtg tctggaccgc cagtacctgg 2581 ccatcaatga aatatccggg cagctggatg ttgacctcat cctccgctgc atgtccgatg 2641 agaacctccc ctatgacctc agagcatcct tttgccgcct catgcttcac atgcatgtgg 2701 accgagatcc ccaggagcag gtgacccctg tgaaatacgc ccgtctgtgg tcagaaattc 2761 cctccgagat cgccattgac gactatgaca gcagtggagc atcgaaggat gaaatcaagg 2821 agaggtttgc tcagacgatg gagtttgtgg aggagtacct gagagatgtg gtttgtcaaa 2881 gattcccctt ttctgataag gagaaaaata agctcacatt tgaggttgtg aacttagcca 2941 ggaacctcat atactttggc ttctacaact tttctgacct tctccgatta accaagatcc 3001 tcttggcaat cttagactgt gttcatgtga caaccatctt ccccattagc aagatgacga 3061 aaggcgaaga gaataaaggc agcaacgtga tgagatctat ccatggagtt ggggagctca 3121 tgacccaggt ggtgcttcgg ggaggaggct ttttgcccat gactcctatg gctgctgccc 3181 ctgaaggaaa tgtgaagcag gcagagccag agaaagagga tatcatggtc atggacacca 3241 agttaaagat cattgaaata ctccagttta ttttgaatgt gagattggat tataggatct 3301 cctgcctcct gtgtatattt aagcgggagt ttgatgaaag caattcccag tcatcagaaa 3361 catcctccgg aaacagcagc caagaagggc caagtaatgt accaggtgct cttgactttg 3421 aacacattga agagcaagca gaaggcatct ttgggggaag tgaggagaac acacctctgg 3481 acttggatga tcatggtggc agaaccttcc tcagggtcct gctccacttg acaatgcatg 3541 attacccacc cctggtgtct ggggccctgc agctcctctt ccggcatttc agccagaggc 3601 aggaggtcct tcaggccttc aaacaggttc aactgctggt tactagccaa gatgtggaca 3661 actacaaaca gatcaagcaa gacttggacc agctaagatc catcgtggag aagtctgagc 3721 tctgggtata caaaggccag ggtcccgatg agcctatgga cggagcctct ggtgaaaacg 3781 aacataagaa aactgaggag gggaccagta agccactgaa gcacgaaagc accagcagct 3841 acaactaccg agtggtgaaa gagattctga ttcgacttag caagctctgt gttcaagaga 3901 gcgcgtcggt gaggaagagc aggaagcagc agcagcgact gctgaggaac atgggtgccc 3961 atgctgtggt gctggagctg ctgcagatcc cctatgagaa ggccgaagat accaagatgc 4021 aggagatcat gaggctggct cacgaatttc tgcagaattt ctgcgcaggc aaccagcaga 4081 accaagcttt gctacataaa cacataaacc tgtttctcaa cccagggatc ctggaagcag 4141 tgaccatgca gcacatcttc atgaacaact tccagctgtg cagtgagatc aacgagagag 4201 tggtccagca tttcgttcac tgcatagaga cccacggtcg aaatgtccag tatatcaagt 4261 ttctccagac tattgtcaag gccgaaggga aattcattaa aaagtgccaa gacatggtca 4321 tggctgagct tgtcaattct ggagaggacg tcctcgtgtt ctacaatgac agagcctctt 4381 tccagactct gatccagatg atgcggtccg agcgtgaccg gatggatgag aacagccctc 4441 tcttcatgta ccacatccat ctggtggagc tcttggctgt gtgcacagag ggcaagaatg 4501 tgtacacaga gatcaagtgc aactccctgc tcccgctcga tgacatcgtt cgtgtggtca 4561 ctcatgaaga ctgtatcccc gaggttaaga ttgcttacat caacttcctg aatcactgct 4621 atgtggacac tgaggtggag atgaaggaga tttacacaag taatcacatg tggaagttgt 4681 ttgagaattt cctcgtggac atctgcaggg cctgtaacaa cacaagtgac aggaagcatg 4741 cagactcggt tctggagaag tacgtcactg aaatcgtgat gagcattgtc accaccttct 4801 tcagctctcc cttctcagac cagagtacca ctctgcagac tcgccagcct gtctttgtgc 4861 agctcctgca aggcgtgttc cgagtttacc actgcaactg gttgatgccg agccagaaag 4921 cctcggtgga gagctgtatc cgcgtgctct ctgacgtagc caagagccgg gccatagcca 4981 ttcctgttga cctagacagc caagtcaaca acctcttcct gaagtcccac aacattgtgc 5041 agaaaacagc catgaactgg cggttatcag cccgcaacgc tgctcgtaga gactctgtcc 5101 tggcagcttc cagagactac cgaaatatta ttgagaggtt acaggacatc gtgtctgccc 5161 tggaggaccg actcaggccc ctggtccagg ccgagctgtc tgtgctcgtg gatgttctac 5221 acagaccgga actgcttttc cctgagaaca cggatgccag gaggaaatgc gaaagtggag 5281 gtttcatctg caagctcata aaacacacca agcagctgct agaggagaat gaagagaaac 5341 tgtgcattaa agtcttacag accctcaggg aaatgatgac caaagataga ggctatggag 5401 aaaagcaaat ttccattgat gaattggaaa atgccgagct gccacagcca ccggaagctg 5461 agaactccac agaggagctt gaaccaagtc cacccctgag gcaactggaa gaccataaaa 5521 ggggtgaggc actccggcaa attttggtca accgttacta tggaaacatc agaccttcag 5581 gaagaagaga gagccttacc agctttggca atggtccact atcaccagga ggacccagca 5641 agcctggggg aggaggggga ggtcctgggt ccggctccac aagccggggt gagatgagcc 5701 tggctgaggt tcaatgtcat ctcgacaagg aaggcgcctc caatctggtc atcgacctca 5761 taatgaatgc atccagcgac cgagtattcc acgaaagcat tctcctggcc atcgcacttc 5821 tggaaggagg caacaccact atccagcact catttttctg ccggctgacg gaagataaga 5881 aatcagagaa gttcttcaag gtcttttacg atcgaatgaa ggtggcccag caggaaatca 5941 aggcgacagt gaccgtgaac accagcgatt tgggaaacaa aaagaaagat gatgaagtgg 6001 accgggatgc cccatcccga aagaaagcca aagagcccac aacacagata acagaagagg 6061 tccgggatca gctcttggaa gcatctgctg ccaccagaaa agcctttacc accttccgga 6121 gggaggccga ccctgatgac cattaccaat ctggggaggg cacccaggct acaacggaca 6181 aagccaagga tgacctagag atgagtgcag tcatcaccat catgcagcct atcctgcgct 6241 tcctgcagct cctgtgtgaa aaccacaacc gcgatctgca gaatttcctt cgttgccaaa 6301 ataataagac caactacaac ttggtgtgtg agacactgca gtttctggac tgtatctgtg 6361 ggagcacaac tggaggcctt ggtcttctcg gactgtacat aaatgaaaag aatgtagctc 6421 ttatcaacca aaccctggag agtctgactg agtactgtca agggccttgc catgaaaacc 6481 agaactgcat cgccacccac gagtccaatg gcatcgatat catcacagcc ctcatcctta 6541 acgatatcaa ccctctgggg aagaaacgga tggacctggt gttagaactg aagaacaatg 6601 cttcaaagct actcctggcc atcatggaaa gcagacacga tagtgaaaat gcagagagga 6661 tcctgtacaa catgaggccc aaggagctgg tggaagtgat caagaaggcc tacatgcaag 6721 gcgaagtgga gtttgaggat ggggagaacg gtgaggatgg agcagcctcc cccaggaacg 6781 tgggccataa catctacatc ctcgctcacc agttggctcg gcataacaaa gaacttcaaa 6841 ccatgctgaa acctggaggc caggtggatg gcgatgaagc tctggagttc tatgcgaagc 6901 acacagcaca aattgagatt gtcagactgg accgaactat ggaacagatt gtcttccctg 6961 tgcccagcat ctgtgaattt ctgactaagg aatcgaaact tcgaatatat tacaccacag 7021 agagggatga gcaaggcagc aagatcaatg acttcttcct gcgctccgag gacctcttta 7081 atgagatgaa ctggcagaag aaacttcgag cccagcctgt cttgtactgg tgcgcccgaa 7141 acatgtcttt ctggagcagc atctccttca acctggccgt cctgatgaac ctgctggtgg 7201 cgtttttcta cccatttaaa ggagtgagag gaggaacact agagccgcac tggtccggcc 7261 tcctgtggac agccatgctc atctctctgg caattgtcat tgcgctgccc aagccccatg 7321 gcatccgggc cttaattgct tctacaatcc tacgactgat attttcagtt ggattacagc 7381 ccacactgtt tctgctggga gctttcaatg tctgcaataa aatcatcttc ctgatgagct 7441 ttgtgggcaa ctgtgggacc ttcaccagag gctaccgagc catggttctg gacgttgagt 7501 tcctctatca tttgctgtat ctactgatct gtgccatggg cctcttcgta catgagtttt 7561 tctatagttt gctgcttttt gatttagtgt acagagagga gactttgctt aatgtcatta 7621 aaagtgtcac ccgcaatgga cggcccatca tcttgacagc ggccctggct ctgattctgg 7681 tttacctgtt ctcaattgtg ggctacctgt tcttcaagga tgactttatc ttggaagtag 7741 acaggttgcc caatgagaca gctggtccag aaactggtga gagtttggcc aacgatttcc 7801 tgtactctga tgtgtgccgg gtagagacgg gggagaactg cacctctccg gcacccaaag 7861 aagagctgct ccctgtggaa gaaacggaac aggataagga gcacacgtgt gagacactgc 7921 tcatgtgcat cgtcaccgtt ctgagtcacg ggctgcggag tgggggcgga gtaggagatg 7981 tgctcaggaa gccatccaaa gaggagcctc tgtttgctgc aagggtgatc tatgacctcc 8041 tcttcttctt catggtcatc atcattgtcc tgaacctcat ttttggggtc atcatcgaca 8101 cctttgctga cctgaggagt gagaagcaga agaaggaaga gatcttaaaa accacgtgct 8161 tcatttgtgg cttggaaagg gacaagtttg acaataagac cgtcaccttt gaagagcaca 8221 ttaaggaaga acacaacatg tggcactatc tgtgcttcat cgtgctggtg aaagtgaagg 8281 actccaccga gtacaccggg cccgagagct acgtggcaga gatgatcagg gaaagaaacc 8341 ttgattggtt ccccaggatg agagccatgt ccctggtcag cagtgactct gaaggggaac 8401 agaacgagct gaggaacctg caggagaagc tggagtccac catgaagctg gtcaccaatc 8461 tttctggcca gctgtcagaa ctaaaggacc agatgacaga acaaaggaag cagaaacaaa 8521 gaattggcct tctaggacat cctcctcaca tgaatgtcaa cccacagcag ccagcctaag 8581 caaatgagac ggagggactg tactcaaccc tttttaaatt attgtcagcg tgggtctggc 8641 tcttggttct gatctgccct ctaaggttac atgtgcgctt agtacatttg taaatactcg 8701 gttttgtatt gtatgtatat gattgcaatt ctcagaggtt tggactttcg tattgtaatt 8761 agctctgttg gcatggtgac ttgtcactcc tgccaaaaat attaaaaatg ccttttttgg 8821 aaggactaca gaaagtacct gatttgcact tgaaccagat tatagattta aaaagtattt 8881 gacatgtatt ttgttattta aaactagaat agccagtatt tatgtttttt ataaaactgt 8941 gcaatacaaa ttatgcaatc accataactc tgtaactcct gagtgtctta agggagtgcg 9001 catctttgaa gctgatctgt taatacttat gtaataaatg gttaaatatc aaatgatgat 9061 gctgctgcca aaattatatt aatagcaagt ttctggcccc tgggcaattt tgtaccttgt 9121 aattatccta tggtgatgct gtttctcgtt gctaatggca ttagtgcctc gtatcctagt 9181 gataactcca ggtctgtgaa ccattcaaac aacattcatt tttgagacaa agcaacttta 9241 gtttcaaggg taattttaag cttcaaaatt aatcatttaa agtatttctt taagagagcc 9301 atgttagagg ctcacacgtt agcttgaaag gagtcgatga aataattttt ttaaagggaa 9361 ctttttacac gacgtttgga ataacagcat attgctgacc agtcagtgtc atctcccagg 9421 tgaattttga tgtcacgtta tagtcaaatg agctagctta tggtttctag attctcttca 9481 tctgaaccaa catgatgcag taggtgagaa gttattatgc atatatacac atatacatcc 9541 atataaggca aagtaggagc tgtcccctta ggatgcatag ctgccccctt agggtgcaga 9601 gctcaacact gacagtgctg ttctgagaga gccacgtttg ggttttattt ctttgtcaca 9661 tgatttcttt tctggatgtg tgcaagtata acaggaagtg ttttctacct gttgcttgtt 9721 ttgtacctgg gtctcgtttt actagaccat ctctgcacaa aagtttaaaa aatgaaccgt 9781 aagcagagtt ctgaagcaag tcaagtttgt aagtgcatac ctgaaattat ttaataaatg 9841 acgcagaata ct //
GenBank-Updates@genbank.bio.net (08/15/90)
LOCUS RHPNIFDK 3500 bp ds-DNA PLN 15-AUG-1990 DEFINITION Parasponia rhizobium nifD and nifK genes coding for the alpha- and beta-subunits of the Mo-Fe protein of nitrogenase, complete cds. ACCESSION X01139 KEYWORDS nitrogenase. SOURCE Parasponia rhizobium (strain ANU289) DNA, clones pR289nif-[3,4,5]. ORGANISM Parasponia rhizobium Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Hamamelidae; Urticales; Ulmaceae. REFERENCE 1 (bases 1 to 3500) AUTHORS Weinman,J.J., Fellows,F.F., Gresshoff,P.M., Shine,J. and Scott,K.F. TITLE Structural analysis of the genes encoding the molybdenum-iron protein of nitrogenase in the Parasponia rhizobium strain ANU289 JOURNAL Nucleic Acids Res. 12, 8329-8344 (1984) STANDARD simple staff_review COMMENT EMBL features not translated to GenBank features: key from to description PRM 108 124 consensus promotor sequence SITE 135 135 transcription start RBS 163 168 pot. ribosome binding site RBS 1753 1758 pot. ribosome binding site SITE 3322 3355 pot. stem-loop structure FEATURES from to/span description pept 176 1678 Mo-Fe protein alpha-subunit pept 1767 3308 Mo-Fe protein beta-subunit BASE COUNT 826 a 957 c 992 g 725 t ORIGIN 1 bp upstream of EcoRI site. 1 gaattctccg tgcaaagcgc gatgtcgcct tcgcaacaac aaccagcccc atcggacgaa 61 acgcgctaac tgtttttatt tattctgctt tttgtgctcg cgccgcgctg gcatgctcgt 121 tgcagtcttg ttcaagaagc tgctcccgca cagttaattc ttgaaggaca tcagcatgag 181 tctcgccacg acccagagca tcgcagaaat cagggctcgc aataaagagc tgatcgagga 241 ggtgctgaaa gtctatccgg agaagaccgc gaaacggcgt gccaagcacc tcaacgttca 301 ccaagccggc aagtcggact gcggggtcaa gtccaacatc aaatcaatac ctggtgtgat 361 gacaatcaga ggctgcgcct atgcaggatc caaaggggtg gtctggggac cgatcaagga 421 catggtccat atcagccatg gcccggtcgg ctgtggtcag tattcgtggg gctcgcgtcg 481 caactattat gttggcacga cgggcgtcga tagtttcgtg accctgcagt tcacctccga 541 cttccaggaa aaggacatcg tatttggcgg cgacaagaag ctgatcaaag tccttgacga 601 aatccaggag ctgttcccgc tcaacaacgg catcaccatc caatcggaat gcccgatcgg 661 actgatcggg gacgacatcg aggctgtgtc aagatcgaaa tccaaagaat acggcggcaa 721 gaccatcgtg cctgttcgct gtgagggctt tcgcggcgtg tcgcaatcgc ttggccacca 781 cattgccaat gacgcggtgc gcgattggat cttcgacaag ctagagcccg agggcgaacc 841 aaagttccag ccgacgccct acgacgttgc gatcatcgga gactacaata ttggcggcga 901 tgcctggtca tcgcgcattc tgctggaaga aatgggcttg cgggtgattg cgcagtggtc 961 cggcgacggt tccctcgccg aactcgaagc aacgccgaag gcaaagctca atattctgca 1021 ttgctaccgt tccatgaact acatctcccg ccacatggag gagaagtttg gcatcccctg 1081 gtgcgagtac aacttcttcg gaccgtcgaa gatcgcagaa tcgctgcgca agattgcggg 1141 ctatttcgac gacaagatca aggaaggcgc cgagcgagta attgaaaaat accagccact 1201 ggtggacgcc gtaatcgcaa aatatcgccc ccgcctggag ggcaagactg tgatgctgta 1261 cgtcggcggg cttcgtccac gtcatgtgat tggcgcgtac gaggatctcg gcatggaagt 1321 cgtgggcacc ggatacgagt tcggccacaa cgacgattat cagcgcaccg cccagcacta 1381 cgttaaggac agcacgctca tctacgacga cgtcaatggc tatgaattcg agcgcttcgt 1441 cgaaaaggtc caaccagatc tggttggctc gggcatcaag gagaaatacg ttttccaaaa 1501 gatgggtgtg ccgttcccgg agatgcattc ctgggactat tccggcccat atcacggcta 1561 tgacggcttt gcgatcttcg cgcgggacat ggacatggct gtcaactcgc cgatctggaa 1621 gaagacgaag gccccctgga aggaagctgc gaagccgaag ctcttggctg cagaataaca 1681 agcacttggt tccacaatag agcgatcaat cccgctctct gcggagagct ggggcgacat 1741 catttcgata gtgaaggatc ttaacaatgg cgcagagtgc agaccatgtg ctcgatcatc 1801 tcgaactgtt ccgcggtcca gaataccaac aaatgctggc cgacaagaag atgttcgaga 1861 atccccgcga tcctgccgag gtcgaacgta tccgagcagt gacgaaaacg cccgaatatc 1921 gcgagaagaa ttttgcggag gcgcttgcgg taaatccggc caaggcttgc cagccgcttg 1981 gcgccgtatt cgtctcggtt ggttttgaag gcacgctgcc cttcgtccat ggctcgcagg 2041 gctgcgtggc ctattaccgc agccatctgt cgcggcactt caaggagccg agctcctgcg 2101 tgtcttcgtc gatgacggaa gacgccgctg tattcggggg gctgaacaat atgatcgatg 2161 gcctcgccaa cagctacaac atgtacaaac ccaagatgat ttgctcgacg acctgcatgg 2221 ccgaggtgat cggcgatgac ctgaacgcct tcatcaagac atcaaaagaa aaaggctcgg 2281 ttcggcggag ttcgactcct ttcgcgcaca ctccagcgtt cgtcggcagc cacgtcaccg 2341 gctatgacaa cgcactcaag ggcattctcg agcacttttg gaacggcaag gccggaacgg 2401 cgccgaagct ggagcgcaaa ccaaacgagg caatcaacat catcggcggt ttcgatggca 2461 ataccgttgg aaaccttcgt gagatcaagc gaatcttagc gttgatgggc atcaaacaca 2521 cgattctcgc cgataactct gaagtcttcg ataccccgac tgatggcgag ttccggatgt 2581 atgacggcgg tacccacgtg gaggacacgg ccaacgcgat tcacgccaag gcgacaatct 2641 ccatgcagca atggtgtacg gaaaaaacgc tgccgttcgt gtccgagcat ggacaggacg 2701 ttgtgtcttt caattacccg gtaggtgtat ccgcgacgga tgatcttctc gtggccttgt 2761 cacgcatcag cggcaaggag attccggagc aactcgcgcg agagcgtggc cgcttggttg 2821 atgccatcgc ggattccagc gcgcatatcc atggcaagaa gttcgcgatc tacggcgatc 2881 cggatctctg ctatgggttg gctgcctttc tgctcgaact cggcgccgag cctactcatg 2941 tgctgtccac caacggcaac aacgtggcag gagaaaatgc gacgctgttt gcaggctcgc 3001 catttggaga acttccagcc tatccgggac gagacctctg gcacatgcgc tcgctcttgt 3061 tcacagagcc ggttgacttt ctgattggca acacccatgg caagtacctg gagcgtgaca 3121 ctggaacgcc attgatccgc atcggctttc caatttttga tcggcatcac catcaccgct 3181 tccctgtatg gggctatcag ggcggcctga atgtgctggt gaagatcctc gacaagatct 3241 tcgacgaaat cgacaagaag accagcgttc ttggcaaaac tgactacagt ttcgacatca 3301 ttcgttgatg acgggcagtg cgcgtgggct cgccgaaaca gcggcgagcc cacgctgggc 3361 actggttgac attgaaattt tcttccgctg agaggaaaat gctgatgagt tcgtctagtc 3421 ggccacggtc cagggtattt tcaggcgaac cgggctgccg aagaatggaa gtaagtcgga 3481 ggctgagcgc aagaagggct // LOCUS RHPNIFH 2030 bp ds-DNA PLN 15-AUG-1990 DEFINITION Parasponia rhizobium nitrogenase (nifH) gene, iron protein component. ACCESSION K00487 KEYWORDS nifH gene; nitrogenase; unidentified reading frame. SOURCE Parasponia rhizobium (strain ANU289) DNA. ORGANISM Parasponia rhizobium Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Hamamelidae; Urticales; Ulmaceae. REFERENCE 1 (bases 1 to 2030) AUTHORS Scott,K.F., Rolfe,B.G. and Shine,J. TITLE Nitrogenase structural genes are unlinked in the nonlegume symbiont Parasponia rhizobium JOURNAL DNA 2, 141-148 (1983) STANDARD full staff_review COMMENT [1] states the iron protein subunit is encoded on a separate operon from other components of the nitrogenase enzyme complex, unlike previously studied nitrogen-fixing prokaryotes. FEATURES from to/span description pept 576 1460 nifH (nitrogenase iron protein) mRNA 421 > 1460 nifH mRNA BASE COUNT 430 a 572 c 618 g 410 t ORIGIN 5 bp upstream of PstI site 1 ctgcagggcc cttgtaaggc gcttcttgct gcctttaagc tcatgcgcac cgatctgatc 61 agctggatca atcgggaggt cagccgcaca attgatctcg tcatcctcga ccacgaaccc 121 catcgccggc cacttgcctt gaggttctga cctcgacctg catattgctc tccgcggatt 181 gccgccactg gcttgcaaga agaggagcaa gtcccgttcc agttgaggaa atcgaaccag 241 atcatgccaa accggcgttt tccggttgat gggtgtggcc gttgttcgtt ttctgacagc 301 cgcgcagatc ctgtccggtg caaacctccc tggggtagct cagcggctcg ttggcttttt 361 agagcgtaat caagaagctt aataagcgcg gacagtgttg gcatggcgat tgctgttgag 421 ttgcagcaac actgagtgag ggctgggtgc acgccgacgc gtaagacgag cgatgcgctc 481 cttcccttga acccgtgtgc cccgtttctg agagagaaac aagctcgcgt gtcggaagca 541 cgcaactttt ggcaaatcgg ttgatggaga acaacatgtc ttcactgaga caaatcgcgt 601 tctacggaaa gggcggcatc ggcaagtcga ccacgtccca gaatacgttg gcggcactgg 661 ccgagatggg ccagaaaatc ctgatcgtgg gatgcgatcc taaggcggac tcgacgcgcc 721 tcatcctgca cgcgaaggcg caggacacga ttttgagcct tgcagcgagc gctggcagcg 781 tggaagacct cgaactcgag gacgtgatga aggtcggcta caaggacatc cgatgcgtgg 841 agtccggtgg tcccgagccg ggtgtcggct gcgcgggccg cggcgtcatc acctcgatca 901 atttcctgga ggagaacggc gcctatgaga acattgacta tgtctcatat gacgtgctcg 961 gcgacgtcgt ttgcggtggc tttgcgatgc cgatccggga aaacaaggcg caggagatct 1021 atatcgtgat gtctggagaa atgatggcaa tgtatgccgc aaacaatatc tccaaaggta 1081 tcctgaaata cgccaactct ggcggcgtgc ggctgggcgg cctgatctgc aacgagcggc 1141 agaccgataa ggagctggag ctggcggagg cgctggccaa gaagttaggt actcagctga 1201 tctacttcgt gccgcgcgac aatgtggtgc agcatgccga gctacggcgc atgacggtgc 1261 tggagtatgc ccctgagtcg cagcaggccg atcactatcg caatcttgcg accaaggttc 1321 acaacaatgg cggcaaaggc atcattccga ctccgatctc catggatgag ctcgaggaca 1381 tgctgatgga gcatggcatt atgaagcccg tcgacgaatc catcgtcggc aagaccgccg 1441 ccgaactcgc ggcctcgtaa aggtcgcggg tcgcggcctt gtgaaggcgc gcgacggatg 1501 ccggtctccc tcacccccca tccggggaga ccggcattct gacgattatc tgaccagcca 1561 gagtggagct ggcaaccgtg accgctatgg gaacccaaaa catcatgaca ggagcgcact 1621 tccttccgct tatggcttct tgcgccgtcg aggcgagcag caaggtgcaa agaggaattg 1681 cgacctaccg agcgctcact ggcgtcctcc tgaagaggcc gacattgcga ccgacagcaa 1741 tttcgattgc catgtcctgg cgtcaatcct ggcggccgct cgatggatgg tggcccgctt 1801 cccgagcgcc ctgtccgcca ccagctggcg accctgctcg cagcaatttc catcggttga 1861 ggtcgatatc tcggagcagc tcctggcgtc taagtgcgat gagaatgacg agatcgcgat 1921 ggtgcgcgat cttttgctca agcaacgctc gacggacggg catattcggg ctggctagcc 1981 gcgatgattg cgcgccgcgc catagagcca gatcacctgt gggaagatct // LOCUS RHPHBEM 1520 bp ds-DNA PLN 15-AUG-1990 DEFINITION P.andersonii haemoglobin gene, complete cds. ACCESSION M36509 KEYWORDS haemoglobin. SOURCE P.andersonii DNA. ORGANISM Parasponia andersonii Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Hamamelidae; Urticales; Ulmaceae. REFERENCE 1 (bases 1 to 1520) AUTHORS Landsmann,J., Dennis,E.S., Higgins,T.J.V., Appleby,C.A., Kortt,A.A. and Peacock,W.J. TITLE Common evolutionary origin of legume and non-legume plant haemoglobins JOURNAL Nature 324, 166-168 (1986) STANDARD simple staff_review FEATURES from to/span description pept 198 313 haemoglobin, exon 1 436 550 haemoglobin, exon 2 877 993 haemoglobin, exon 3 1153 1293 haemoglobin, exon 4 IVS 314 435 haemoglobin intron A IVS 551 876 haemoglobin intron B IVS 994 1152 haemoglobin intron C BASE COUNT 470 a 281 c 276 g 493 t ORIGIN 1 ttatcttact aaaaagaaaa cgaaaataaa aaacccaaag atatggctcc ccaataccct 61 gaagagttac acacgatccc cattttttct actatatata cagagtgcct tcaccagatt 121 ttccaaacac actccaacat atcccattgc ccaaataaaa atttctcagc ttttagtccc 181 ctcaacccac agaagccatg agcagctcag aagttaacaa agttttcaca gaggagcagg 241 aagctctggt ggtgaaagca tgggctgtaa tgaagaagaa ctctgctgaa ctgggtcttc 301 aattcttcct caagtaagtc aaaattatat atagtacact ttttatttac tttgcttctt 361 ttatagacca agtttttgaa taaaagggta ctattttttt ttcctgaaaa aaattggttg 421 attgaaactt tgcaggatat ttgagattgc accgtctgcc aagaacttgt tctcttattt 481 gaaggactct ccggttcctt tggagcagaa cccaaagctc aagccccatg ctacgactgt 541 cttcgttatg gtaaagccaa cttttgttct cctattccct tatcctaatt ttacaagaat 601 ctaatgttaa taaaatagta ttttgcctat ttaaacaacc aaaaatttag acacaactat 661 ataaaacatt taaattcttg tggtttatga taccttgatc tacaatgatt ccaacttccc 721 gtgttgcatt tatgagttgt gctagcaaca gtcgcatcac agtcgtctat tccagaaagg 781 acgactgtga ctcttgagac atatcaaagc aaagctcagc aatttttatg tttctcactt 841 gctctgttct ttttctctgg tacttgtcct ggaaagacat gtgagtctgc ggttcaactt 901 cggaaagccg gaaaagtgac agtgaaagaa tcagacttga aaagaattgg ggctatccac 961 ttcaaaactg gcgtagttaa tgaacatttt gaggtactac cctggccact tagtagatat 1021 aattccctaa gtgtaatcca aacatttgtt gtttagagtc aaattattat tattctgtat 1081 ggtggttctt gaataatcga tcttattatg gtatttacta attatattat gcatgggaaa 1141 aacgatttgt aggtcacaag gtttgcactt ttggagacca taaaggaagc agtaccagaa 1201 atgtggtcac ctgagatgaa gaacgcatgg ggagtagctt atgatcagtt ggttgctgcc 1261 atcaagttcg aaatgaaacc ctccagtact tgagaatttt tatagttctt ggaacaattg 1321 ggtttgaata atgtgacaaa acttatactt aattacgttt gcatgagaga gaggtaataa 1381 ttgcatagtg tataacttgc atatgtatca tagtgtgacg caatctctcc acttgtgttg 1441 ttcatcttgt tcaaaaggaa ttagtctttc actttacatt ttgggtggaa gtatggaatg 1501 aaatcagagt ttcattgatt // LOCUS PT7RNAA 266 bp ds-DNA PHG 15-AUG-1990 DEFINITION Bacteriophage T7 RNA polymerase gene 1, 3' end. ACCESSION M24964 M24965 ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 266) AUTHORS Osterman,H.L. and Coleman,J.E. TITLE T7 ribonucleic acid polymerase-promoter interactions JOURNAL Biochemistry 20, 4884-4892 (1981) STANDARD simple staff_review FEATURES from to/span description pept < 1 201 RNA polymerase (gene 1; AA at 1) mRNA < 1 266 gene 1 mRNA BASE COUNT 72 a 66 c 63 g 65 t ORIGIN 1 ccggctgacg ctgcgaacct gttcaaagca gtgcgcgaaa ctatggttga cacatatgag 61 tcttgtgatg tactggctga tttctacgac cagttcgctg accagttgca cgagtctcaa 121 ttggacaaaa tgccagcact tccggataaa ggtaacttga acctccgtga catcttagag 181 tcggacttcg cgttcgcgta acgccaaatc aatacgactc actatagagg gacaaactca 241 aggtcattcg caagagtggc ctttat // LOCUS PT7RNAB 139 bp ds-DNA PHG 15-AUG-1990 DEFINITION Bacteriophage T7 class III RNA polymerase promoter L1 fragment. ACCESSION M24966 ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 139) AUTHORS Osterman,H.L. and Coleman,J.E. TITLE T7 ribonucleic acid polymerase-promoter interactions JOURNAL Biochemistry 20, 4884-4892 (1981) STANDARD simple staff_review FEATURES from to/span description mRNA 58 > 139 L1 mRNA BASE COUNT 44 a 25 c 28 g 42 t ORIGIN 1 cggtatttaa ttaaatattc tccctgtggt ggctcgaaat taatacgact cactataggg 61 agaacaatac gactacggga gggttttctt atgatgacta taagacctac taaaagtaca 121 gactttgagg tattcactc // LOCUS PT7RNAC 141 bp ds-DNA PHG 15-AUG-1990 DEFINITION Bacteriophage T7 L2 nonpromoter fragment. ACCESSION M24967 ORGANISM Bacteriophage T7 Viridae; ds-DNA nonenveloped viruses; Podoviridae. REFERENCE 1 (bases 1 to 141) AUTHORS Osterman,H.L. and Coleman,J.E. TITLE T7 ribonucleic acid polymerase-promoter interactions JOURNAL Biochemistry 20, 4884-4892 (1981) STANDARD simple staff_review BASE COUNT 36 a 35 c 35 g 35 t ORIGIN 1 cggaagtgct ggcattttgt ccaattgaga ctcgtgcaac tggtcagcga actggtcgta 61 gaaatcagcc agtacatcac aagactcata tgtgtcaacc atagtttcgc gcactgcttt 121 gaacaggttc gcagcgtcag c // LOCUS SIVSMMM7 1210 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) pol region. ACCESSION M27256 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate SMM-M7). ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 1210) AUTHORS Li,Y. JOURNAL Unpublished (1989) STANDARD full staff_entry COMMENT This sequence corresponds to the 3' third of the pol gene. Kindly provided in computer-readable form by Yen Li. Author address:Y.Li New England Regional Primate Research Center Southborough, Massachusetts 01772 (508-481-0400). BASE COUNT 478 a 207 c 269 g 256 t ORIGIN 1 gcccggccag taatccgccc accattgctc ccgaatttcg acccctcctc tagtcagatt 61 agtgttcaat ttggtaaagg atcccatcga agaaatagga acattttatg tggatggctc 121 ttgcaataaa cagtcaaaag agggaaaagc aggatacata acagacagaa ggaggagcaa 181 aataaagttc ttagaacaga ctaccaatca gcgagcagaa ttagaagcct ttctcatggc 241 agtaacagat tcaggagcag aggcaaatat tatagtagat tctcaatatg tgatggggat 301 agtgacaagg caacccactg aatcagaaag taaaatagta aatcagataa tagaagaaat 361 gatcaaaaag acagcagtat atgtgacata ggtaccagct cataaaggtc taggaagaaa 421 tcaagaaata gaccatttag ttagtcaaag gattaggcaa gtcttgttcc tagaaaagat 481 agaaccagcc caagaagagc acgaaaaata tcacagcaat gtaaaagaat tggtctttaa 541 atttaggata ccaagattag tagcaaaaca gatagtagat acctgtgata aatgccagca 601 gaaaggagaa gctatacata gacaggtaaa cacagagtta agaatttggc aaatagactg 661 cacacaccta gagggcaaag ttgttatagt agcagtacat gtggctagtg gattcataga 721 ggcagaagta atcccacaag aaacaggaag acagacagca ttgttcctgt taaaattagc 781 tagcaggtgg cccatcacac acctgcacac agataatggt gctaactttg cttcgcaaga 841 agtaaagatg gtagcctagt gggcagatat agaacacacc tttaaggtac catataatcc 901 acaaagtcaa agagtagtag aagcaatgaa tcatcaccta aagaatcaga tagagagaat 961 tagagagcag gcaaattcag tagaaacaat agtgctcatg gcagttcatt gcatgaattt 1021 taaaagaagg ggaggaatag gggatatgac cccagcagaa agattaatta atatgatcac 1081 cacagaacaa gaaatacaat tccaacaatc aaaaaattca aaatttaaaa attttcgggt 1141 ctatttcaga gaaggcagag accaactgtg gaaaggaccc ggtgaattac tgtggaaagg 1201 ggaaggagca // LOCUS ADEAD5A 180 bp ds-DNA VRL 15-AUG-1990 DEFINITION Adenovirus type 5 packaging domain region. ACCESSION M36423 KEYWORDS . SOURCE Adenovirus type 5 (strain dl309) DNA. ORGANISM Mastadenovirus h5 Viridae; ds-DNA nonenveloped viruses; Adenoviridae. REFERENCE 1 (bases 1 to 180) AUTHORS Graeble,M. and Hearing,P. TITLE Adenovirus type 5 packaging domain is composed of a repeated element that is functionally redundant JOURNAL J. Virol. 64, 2047-2056 (1990) STANDARD simple staff_review FEATURES from to/span description site 1 156 packaging domain BASE COUNT 48 a 27 c 52 g 53 t ORIGIN 1 gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag taaatttggg 61 cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga agtgaaatct 121 gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg gcatttgacc // LOCUS CAJFJAAB 1932 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.coli flagellin (flaB) gene, complete cds. ACCESSION M35141 KEYWORDS flaB gene; flagellin. SOURCE C.coli (strain VC167, serogroup LIO 8) DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1932) AUTHORS Guerry,P., Logan,S.M., Thornton,S. and Trust,T.J. TITLE Genomic organization and expression of Campylobacter flagellin genes JOURNAL J. Bacteriol. 172, 1853-1860 (1990) STANDARD simple staff_review FEATURES from to/span description pept 211 1932 flagellin (flaB) mRNA 185 > 1932 flagellin mRNA BASE COUNT 638 a 325 c 387 g 582 t ORIGIN 1 taacaaatcc aagcctagta gtaatactag gcttttttat ttctaaataa aacttggaac 61 attctttagc gtttactgta atttatacaa atccaagcct agtagtaata ctaggctttt 121 tttatttcta aataaaattt caatttgaat caaaacttgg aacacttctt gctttaatct 181 tttcgatgca atattttgaa aggatttaaa atgggtttta gaataaacac caacatcggt 241 gcattgaacg cacatgcaaa ttcagttgtt aatgctaggg agcttgacaa gtctttaagt 301 agacttagct caggtcttag aatcaactcc gcagcagatg atgcttcagg gatggcgata 361 gcagattctt tgcgttcaca agcagcaact ttaggtcaag ctataaacaa tggtaatgat 421 gctataggta tcttgcaaac tgcagataag gctatggatg agcaacttaa aatcttagat 481 accatcaaga ctaaagcgac tcaagctgct caagatggtc aaagcttaaa aacaagaact 541 atgcttcaag cagacatcaa ccgtttgatg gaagaacttg ataatatcgc aaataccact 601 tcatttaatg gcaaacaact tttaagtggt ggttttacca atcaagaatt ccaaatcggt 661 tcaagttcaa atcaaactat taaagcaagt ataggagcaa ctcagtcttc taaaatcggt 721 gtaacaagat ttgaaacagg ttcacaaagt ttttcttcag gcactgtagg acttactatt 781 aaaaactaca acggtatcga agattttaaa tttgatagtg tagtgatttc tacttcagta 841 ggaacaggtc ttggagcttt ggctgaagag atcaacagaa atgcagataa aacaggaatt 901 cgtgcaactt ttgatgtaaa atctgtagga gcctatgcaa taaaagcagg aaatacttct 961 caggattttg ctatcaatgg ggttgttatc ggacaaataa attataatga cggtgataac 1021 aatggtcaac ttatctcagc tatcaatgct gtaaaagata caactggtgt tcaagcctct 1081 aaagatgaaa atggtaaact tgttcttact tcggccgatg gtagagggat taaaatcaca 1141 ggtagcatag gtgtaggagc tggtatattg cacactgaaa attatggaag gttatcttta 1201 gttaaaaatg atggtagaaa tatcaatata agtggaacag gtctttcagc tataggtatg 1261 ggtgctacag acatgatttc tcaatcttca gtatctctaa gagagtcaaa agggcaaatt 1321 tcagcagcca atgctgatgc tatgggcttt aatgcttata atggcggcgg cgctaagcaa 1381 attattttcg cttctagtat tgcaggattt atgtctcagg ctggttcagg cttctctgct 1441 ggttcgggat tttcagtagg tagtggtaaa aattattcag ccattttatc agcttctata 1501 cagatagtat ctagcgcagc ttctatcagt agcacctatg ttgtttctac tggttcaggt 1561 ttctctgctg gttcaggtaa ttctcaattt gcagctttaa gaataagtac agtaagtgct 1621 catgatgaaa ctgcaggtgt aactacactt aagggtgcaa tggctgtgat ggatatagca 1681 gaaactgcta ttaccaattc tgatcaaatc agagcggata taggtgctgt gcaaaatcag 1741 ctccaagtaa cgataaataa tattaccgta acccaggtaa atgttaaagc agcagaatca 1801 accataagag atgtggattt cgctgcagaa agtgcaaatt tttctaagta caatatcctt 1861 gcgcagtcgg gttcatatgc tatgagccaa cgtaacgctg tgcaacaaaa tgtcttaaaa 1921 cttttacaat aa // LOCUS CAJFLA 1719 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.coli flagellin gene, complete cds. ACCESSION M26945 KEYWORDS flagellin. SOURCE C.coli (strain VC167) DNA. ORGANISM Campylobacter coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic/microaerophilic, motile, helical/vibrioid bacteria. REFERENCE 1 (bases 1 to 1719) AUTHORS Logan,S.M., Trust,T.J. and Guerry,P. TITLE Evidence for posttranslational modification and gene duplication of Campylobacter flagellin JOURNAL J. Bacteriol. 171, 3031-3038 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 1 1719 flagellin BASE COUNT 563 a 284 c 365 g 507 t ORIGIN 1 atgggatttc gtattaacac aaatgttgca gcattaaatg ctaaagcaaa ttcggatcta 61 aacagcagag cattagatca atcactttca agactcagtt caggtcttag aatcaactcc 121 gcagcagatg tagcttcagg gatggcgata gcagatagtt taagatctca ggcaaatact 181 ttgggtcagg ctatatctaa tggtaatgat gctttaggta tcttgcaaac tgcagataag 241 gctatggatg agcaacttaa aatcttagat accatcaaga ctaaagcgac tcaagctgct 301 gaagatggtc aaagcttaaa aacaagaact atgcttcaag cagacatcaa ccgtttgatg 361 gaagaacttg ataatatcgc aaataccact tcatttaatg gcaaacaact tttaagtggt 421 ggttttacca atcaagaatt ccaaatcggt tcaagttcaa atcaaactat taaagcaagt 481 ataggagcaa ctcagtcttc taaaatcggt gtaacaagat tgaacaggtt cacaaagttt 541 tcttcaggca ctgtagggct tactatcaaa aactacaacg gtatcgaaga ttttaaattt 601 gatagtgtag tgatttctac ttcagtagga acaggtcttg gagctttggc tgaagagatc 661 aacagaaatg cagataaaac aggaattcgt gcaacttttg atctaaaatc tgtaggagcc 721 tatgcaataa aagcaggaaa tacttctcag gattttgcta tcaatggggt tgttataggt 781 aaggttgatt attcagatgg tgatgagaat ggttctttaa tttcagctat caatgctgta 841 aaagatacaa ctggtgttca agcctctaaa gatgaaaatg gtaaacttgt tcttacttcg 901 gccgatggta gagggattaa aatcacaggt agcataggtg taggagctgg tatattgcac 961 actgaaaatt atggaaggtt atctttagtt aaaaatgatg gtagagatat caatataagt 1021 ggaacaggtt tttcagctat aggtatgggt gctacagaca tgatttctca atcttcagta 1081 tctctaagag agtcaaaagg gcaaatttca gcagccaatg ctgatgctat gggctttaat 1141 gcttataatg gcggcggcgc taagcaaatt attttcgctt ctagtattgc agggtttatg 1201 tctcaggctg gttcaggctt ctctgctggt tcgggatttt cagtaggtag tggtaaaaat 1261 tattcagcca ttttatcagc ttctatacag atagtatcta gcgcagcttc tatcagtagc 1321 acctatgttg tttctactgg ttcaggtttc tctgctggtt caggtaattc tcaatttgca 1381 gctttaagaa taagtacagt aagtgctcat gatgaaactg caggtgtaac tacacttaag 1441 ggtgcaatgg ctgtgatgga tatagcagaa actgctatta ccaatcttga tcaaatcaga 1501 gcggatatag gttctgtgca aaatcaaatc acatcgacta taaacaacat tactgtaacc 1561 caggtaaatg ttaaatcagc agaatcacaa atcagagatg tagattttgc aagcgagagt 1621 gcaaattact ctaaagcaaa tatattggct caaagtggtt cttatgctat ggctcaagca 1681 aattcaagcc agcaaaatgt tttaagatta ctacagtag // LOCUS CHKLNKPA1 215 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 2. ACCESSION M35035 KEYWORDS cartilage link protein. SEGMENT 1 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP532 and lambda gLP12.1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 51 to 65 and 182 to 195) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 215) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1,2] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept 87 + 186 cartilage link protein, exon 2 (first expressed exon pre-msg < 1 > 215 cartilage link protein mRNA and introns IVS < 1 60 cartilage link protein intron A IVS 187 > 215 cartilage link protein intron B BASE COUNT 63 a 43 c 47 g 62 t ORIGIN 1 gaattccata aagggttcca aaaaattgat gagcctttct gttatgtgat gcccttacag 61 tgaagaagat tcttgtgact gtgaagatga caagtctact ctttctggtg ctgatttctg 121 tctgctgggc agaacctcat cctgacaact caagcctgga gcatgagagg attattcaca 181 tccaaggtaa ggaaatacat cagaaaacgc ctttt // LOCUS CHKLNKPA2 460 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 3. ACCESSION M35036 KEYWORDS cartilage link protein. SEGMENT 2 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP39.13 and lambda gLP33.7. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 49 to 63 and 429 to 442) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 460) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept + 59 + 433 cartilage link protein, exon 3 pre-msg < 1 > 460 cartilage link protein mRNA and introns IVS < 1 58 cartilage link protein intron B IVS 434 > 460 cartilage link protein intron C BASE COUNT 145 a 90 c 113 g 112 t ORIGIN 1 tctgtaaaag gtggagtgca gactaattct cctttttgtt tttctccttg aattgtagaa 61 gaaaatggac cccgcctact tgtggtagca gaacaagcta agatcttctc tcagcgaggt 121 ggcaacgtca cactgccttg taaattttac catgaacaca catcaacagc tggctcagga 181 acccacaaaa tccgggtcaa gtggaccaaa ctcacctcag attacctcaa agaagtggat 241 gtctttgtcg caatgggaca ccacagaaag agctacggaa agtatcaggg cagagtgttt 301 ctgagggaaa gcagtgagaa cgatgcctct cttataatca cgaatataat gctggaggat 361 tatgggagat acaagtgcga agtgattgaa ggattagagg acgacacagc agtggtagct 421 ctgaatttgg aaggtaggta acatctaatg tagacttaaa // LOCUS CHKLNKPA3 427 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 4. ACCESSION M35037 KEYWORDS cartilage link protein. SEGMENT 3 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP33.7 and lambda gLP10.1. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 44 to 58 and 352 to 365) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 427) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept + 54 + 356 cartilage link protein, exon 4 pre-msg < 1 > 427 cartilage link protein mRNA and introns IVS < 1 53 cartilage link protein intron C IVS 357 > 427 cartilage link protein intron D BASE COUNT 99 a 108 c 105 g 115 t ORIGIN 1 aaaaaccctt ctagtgggga ttacccccag ctcacctctt tttgccattt caggtgttgt 61 tttcccctat tctccacgtc tgggtcgtta caacctaaac ttccatgagg ctcagcaagc 121 ttgcctggac caggactcca tcattgcctc cttcgaccag ctctacgagg cctggaggtc 181 agggctggac tggtgcaatg ctggctggct cagtgatggt tcagtgcagt accctatcac 241 caagcccaga gagccctgtg gagggaagaa tacggtgccc ggtgtcagaa actatggctt 301 ctgggataaa gagaggagcc gatatgatgt tttctgcttt acttcaaact tcaatggtaa 361 gaacctggtt tacatttacc ttgcaagggt ctttttccat gctttaaaaa gaaagagatg 421 ccagcgg // LOCUS CHKLNKPA4 826 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 5. ACCESSION M35038 KEYWORDS cartilage link protein. SEGMENT 4 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP10.1 and lambda gLP39.23. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 15 to 29) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 6399-6403 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1 to 826) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pept + 25 314 cartilage link protein, exon 5 pre-msg < 1 > 826 cartilage link protein mRNA and introns IVS < 1 24 cartilage link protein intron D signal 786 792 AATAAA sequence BASE COUNT 262 a 166 c 162 g 236 t ORIGIN 1 atggctccct ccgtctctcc ccaggtcgtt tttactacct aatacaccca accaagctga 61 cctatgatga agccgtgcag gcctgcctga aggatggcgc tcagattgcc aaggttgggc 121 agatattcgc tgcctggaag ctccttggtt atgaccgctg tgatgccggc tggctggcag 181 acggcagcgt ccgctacccc atctccagac ccagaaagcg ctgcagcccc aacgaggctg 241 ccgtccgctt tgtaggcttt cctgataaaa agcacaagct gtatggtgtc tactgtttca 301 gagcttacaa ctgaaaatac ctagagctgc aacagtcttt aattcattaa gaacatgtga 361 aatatttcga tatgaactcg tgcaagttac caaaactgtg ataaaccttt cttacttact 421 gtagagtcat tttcataaac caaaaccatt aatttgtttt tgtttctgtt taaatatttt 481 tgtaaaagta tcattccata gatatttaaa aataatataa gtttaatgga agctctaggt 541 aagaagagcc aaattcttta agctacgtca tcccaacaaa atataatttt catgaatggg 601 gcatgcaata gagcttgaca attgctagga cacaattatg gaatgtaagg ctactcaaag 661 cagaagcttt taaaagcaca aattttacat gtttgtaccc gtttgagata cacagcaaat 721 tgattgtatc tggagttttg aattaagatg tttttgttta taggggtcag tgaggttttg 781 caaaaaataa aaattaaaaa aaaaaaaaaa aaaaaaaaag gccgcc // LOCUS CHKLNKPA5 217 bp ds-DNA VRT 15-AUG-1990 DEFINITION Chicken cartilage link protein gene, exon 6. ACCESSION M35039 KEYWORDS cartilage link protein. SEGMENT 5 of 5 SOURCE Chicken (domesticus, strain White Leghorn) 9-day embryo DNA, clones lambda gLP10.1 and lambda gLP39.23. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 217) AUTHORS Kiss,I., Deak,F., Mestric,S., Delius,H., Soos,J., Dekany,K., Argraves,W.S., Sparks,K.J. and Goetinck,P. TITLE Structure of the chicken link protein gene: Exons correlate with the protein domains JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by I.Kiss, 04-JUN-1990. FEATURES from to/span description pre-msg < 1 217 cartilege link protein mRNA and intron signal 44 49 poly-A signal signal 98 103 poly-A signal BASE COUNT 102 a 24 c 22 g 69 t ORIGIN 1 tataatattt aatatttctt aagctattta cacatcacaa gaaaataaaa aattggaaaa 61 aaaaatcaaa tgatcaagtc ttagaagaag attattgaat aaaatctgaa accagctatt 121 aaggtttaga agagaagaag tactttattt ccttacatct tatctgtatc taaatataca 181 tctgtttttt aaactatcaa tgaaaaaaaa aaaaaaa // LOCUS CHTCRPA 3012 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.trachomatis 9-kD and 60-kD cysteine-rich and 15 kD serine-rich outer membrane protein genes, complete cds. ACCESSION M35148 M23180 M35161 KEYWORDS cysteine-rich outer membrane protein; serine-rich outer membrane protein. SOURCE C.trachomatis (serovar L1) DNA. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 753 and 1715 to 2577) AUTHORS Lambden,P.R., Everson,J.S., Ward,M.E. and Clarke,I.N. TITLE Sulfur-rich proteins of Chlamydia trachomatis: Developmentally regulated transcription of polycistronic mRNA from tandem promoters JOURNAL Gene 87, 105-112 (1990) STANDARD simple staff_review REFERENCE 2 (bases 483 to 3012) AUTHORS Clarke,I.N., Ward,M.E. and Lambden,P.R. TITLE Molecular cloning and sequence analysis of a developmentally regulated cysteine-rich outer membrane protein from Chlamydia trachomatis JOURNAL Gene 71, 307-314 (1988) STANDARD simple staff_review FEATURES from to/span description pept 185 451 9-kDa cysteine-rich outer membrane protein pept 703 2259 60-kDa cysteine-rich outer membrane protein precursor sigp 703 735 60-kD serine-rich outer membrane protein signal peptide matp 736 2256 60-kDa cysteine-rich outer membrane protein pept 2437 2889 15-kDa serine-rich outer membrane protein mRNA 93 2296 CrP operon mRNA (alt.) mRNA 159 2296 CrP operon mRNA (alt.) mRNA 160 2296 CrP operon mRNA (minor alt.) mRNA 2406 2965 SrP mRNA BASE COUNT 898 a 537 c 678 g 899 t ORIGIN 1 tttgtttgct ttgatttgct aattacctgt tattagacga tttgttttaa aaaacaattg 61 atataatttt tattttataa tgtaatattg tctatgaggg ctagtttctt ttattattaa 121 aagaattgct tttatcgata aaagaaactt caagagccct tttctagaaa ggagtctgga 181 agttatgaaa aaaactgctt tactcgctgc tttatgtagt gttgtttctt taagtagttg 241 ttgtcgtatc gttgactgtt gcttcgaaga tccatgcgca cctatccaat gttcaccttg 301 tgaatctaag aagaaagacg tagacggtgg ttgcaactct tgtaacgggt atgtcccagc 361 ttgcaaacct tgcggagggg atacgcacca agatgctgaa catggccctc aagctagaga 421 aattccagtt gacggcaaat gcagacaata ggtagcgcaa gttaagagcc tacccacaac 481 agatgtagtt agtaaggaag ttggcttcct tactaactat ttcggctaac aagaaaatgt 541 tgagggtaaa agttagttaa taacaatttc tacccgatgg cagacaaaaa ataatctatg 601 cgaataggag atcctatgaa caaactcatc agacgagcag tgacgatctt cgcggtgact 661 agtgtggcga gtttatttgc tagcggggtg ttagagacct ctatggcaga gtttatctct 721 acaaacgtta ttagcttagc tgacaccaaa gcgaaagaca acacttctca taaaagcaaa 781 aaagcaagaa aaaaccacag caaagagact cccgtaaacc gtaaaaaggt tgctccggtt 841 catgagtcta aagctacagg acctaaacag gattcttgct ttggcagaat gtatacagtc 901 aaagttaatg atgatcgtaa tgttgaaatc acacaagctg ttcctaaata tgctacggta 961 ggatctccct atcctgttga aattactgct acaggtaaaa gggattgtgt tgatgttatc 1021 attactcagc aattaccatg tgaagcagag ttcgtacgca gtgatccagc gacaactcct 1081 actgctgatg gtaagctagt ttggaaaatt gaccgcttag gacaaggcga aaagagtaaa 1141 attactgtat gggtaaaacc tcttaaagaa ggttgctgct ttacagctgc aacagtatgc 1201 gcttgtccag agatccgttc ggttacaaaa tgtggacaac ctgctatctg tgttaaacaa 1261 gaaggcccag agaatgcttg tttgcgttgc ccagtagttt acaaaattaa tgtagtgaac 1321 caaggaacag caacagctcg taacgttgtt gttgaaaatc ctgttccgga tagttacgct 1381 cattcttctg gacagcgtgt actaacgttt actcttggag atatgcaacc tggagagcac 1441 agaacaatta ctgtagagtt ttgtccgctt aaacgtggtc gtgctaccaa tatagcaatg 1501 gtttcttact gtggaggaca taaaaataca gcaagcgtaa caactgtgat caacgagcct 1561 tgcgtacaag taagtattgc aggagcagat tggtcttatg tttgtaagcc tgtagaatat 1621 gtgatctccg tttccaatcc tggagatctt gtgttgcgag atgtcgtcgt taaagacact 1681 ctttctcccg gagtcacagt tcttgaagct gcaggagctc aaatttcttg taataaagta 1741 gtttggactg tgaaagaact gaatcctgga gagtctctac agtataaagt tctagtaaga 1801 gcacaaactc ctggacaatt cacaaataat gttgttgtga agagctgctc tgactgtggt 1861 acttgtactt cttgcgcaga agcgacaact tactggaaag gagttgctgc tactcatatg 1921 tgcgtagtag atacttgtga ccctgtttgt gtaggagaaa atactgttta ccgtatttgt 1981 gtcaccaaca gaggttctgc agaagataca aatgtttctt taatgcttaa attctctaaa 2041 gaactgcaac ctgtatcctt ctctggacca actaaaggaa cgattacagg caatacagta 2101 gtattcgatt cgttacctag attaggttct aaagaaactg tagagttttc tgtaacattg 2161 aaagcagtat cagctggaga tgctcgtggg gaagcgattc tttcttccga tacattgact 2221 gttccagttt ctgatacaga gaatacacac atctattaat ctttgatttt atcgatgtgt 2281 aggtgccgtc cagggattcc tgggcggctt tttttgttat ctatatgaaa ataaaagagt 2341 tcattttcgt tctcagagca tattctagat gggtttttga aaaaaataag tgtttgtgta 2401 gactccctgc tcacaaccaa aaaaggaatg taaaatatga gcactgtacc cgttgttcaa 2461 ggagctggat cttccaattc ggcacaggat atttccacta gttctgtacc attaacactg 2521 caagggcgta tatcgaatct tctatcttcc actgcattta aggtgggatt agtggtgatg 2581 ggactacttt tagtgatggc tacgatattc ctagtttcgg cagcttcgtt tgtaaatccc 2641 atctatctag ctattcctgc tattgtggga tgcgtgaata tctgcgtagg aattttatcc 2701 atggaaggat actgttctcc ggagagatgg agcttatgta agaaggtatt aaaggcttca 2761 gaagatatca tcgatgatgg gcagataaac aactctaata aagtgtttac tgatgagagg 2821 ttgaatgcca taggtggggt agtggaatct ctatctagaa gaaatagtct ggtggatcag 2881 acccaatgat aagagattgc tctataggca aaagatgata gcggcagttt ttatggatga 2941 tctgctgaca gatgatgtat ggaaagggag gaggaaagag tcctcctccc agattttatt 3001 gagctggagt tt // LOCUS DDIGP80A 1545 bp ss-mRNA INV 15-AUG-1990 DEFINITION D.discoideum membrane-associated glycoprotein (gp80) mRNA, complete cds. ACCESSION M36545 KEYWORDS gp80 gene; membrane-associated glycoprotein. SOURCE D.discoideum, cDNA to mRNA. ORGANISM Dictyostelium discoideum Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Eumycetozoa; Dictyostelia; Dictyosteliida; Dictyosteliidae. REFERENCE 1 (bases 1 to 1545) AUTHORS Siu,C.-H., Wong,L.M., Lam,T.Y., Kamboj,R.K., Choi,A. and Cho,A. TITLE Molecular mechanisms of cell-cell interaction in Dictyostelium discoideum JOURNAL Biochem. Cell Biol. 66, 1089-1099 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1 1545 membrane-associated glycoprotein (gp80) precursor sigp 1 48 membrane-associated glycoprotein (gp80) signal peptide matp 49 1542 membrane-associated glycoprotein (gp80) BASE COUNT 502 a 332 c 209 g 502 t ORIGIN 1 atgaaatttt tattagtatt gataatatta tataatattt taaatagtgc acattcagct 61 ccaacaataa cagctgtttc aaatggaaaa tttggtgttc caacatatat taccattaca 121 ggtactggat ttacaggaac tccagttgta actattggtg gccagacctg tgatccagtt 181 attgtagcca ataccgcatc gttacaatgc caattttctg ctcaattagc tccaggaaat 241 tcaaattttg atgttattgt aaaggttggt ggtgtaccat ctacaggtgg taatggtctt 301 tttaaatata cacctccaac tctttcaaca atatttccaa ataatggaag aattggtatg 361 attttagttg atggaccatc caatatatct ggatacaaat taaatgtgaa cgactctatt 421 aactctgcta tgttatctgt tactgctgat tcagtatccc caacaattta tttcctcgtg 481 ccaaatacaa tcgctggtgg tctacttaat cttgaactca ttcaaccatt tggcttttca 541 acaattgtaa cttccaaatc agtgttttct ccaaccatta catcaatcac cccattagct 601 tttgatctca caccaaccaa tgtaaccgtc actggtaaat actttgttac tacagctagt 661 gttacaatgg gaagtcatat ctatacagga ttgactgttc aagatgatgg aacaaattgt 721 catgttattt ttactactcg ttcagtttat gaatcatcaa atactataac tgctaaagct 781 tcaacaggtg tcgatatgat ttatttagac aatcaaggta atcaacaacc aataactttt 841 acatataacc caccaaccat tacttcaaca aaacaagtca atgactctgt tgagatctca 901 acaaccaata ctggtactga tttcactcaa atttctttaa ccatgggaac ctcaagccca 961 acaaaccttg taatcactgg tacaaatgaa aagattgtta taactcttcc acatgctctt 1021 ccagaaggtg aaattcaatt caatttgaaa gctggtatct caaatgttgt cacatcaact 1081 ttattagtta ctccggttat aaatagtgtc actcaagcac ctcacaatgg tggaagtatt 1141 acaatttcag gtatcttttt aaacaatgcc catgtttcga ttgttgttga ccaaaatact 1201 actgatatag tttgtgctcc agattcaaat ggtgaatcaa tcatttgtcc agttgaagct 1261 ggtagtggta ctattaattt agtcgttaca aactataaaa actttgcttc agatccaact 1321 attaaaactg aagccacaac ctctacaacc tatacaattc cagacactcc aactccaact 1381 gatacagcca ccccatctcc aactccaact gaaacagcca ccccatctcc aactccaaaa 1441 ccaaccagca caccagaaga aactgaagca ccttcatcag caacaactct tatttcacca 1501 ttatctttaa ttgttatttt catttctttt gttttattaa tttaa // LOCUS ECOMANXF 1474 bp ds-DNA BCT 15-AUG-1990 DEFINITION E.coli enzyme III-Man function protein (manX (ptsL)) gene, complete cds, and manY (pel) gene, 5' end. ACCESSION M36404 KEYWORDS enzyme III-Man function protein; manX gene; manY gene; pel gene; ptsL gene. SOURCE E.coli (strain K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1474) AUTHORS Saris,P.E.J., Liljestroem,P. and Palva,E.T. TITLE Nucleotide sequence of manX (ptsL) encoding the enzyme III-Man (II-a-Man) function in the phosphotransferase system of Escherichia coli K-12 JOURNAL FEMS Microbiol. Lett. 49, 69-73 (1988) STANDARD simple staff_review FEATURES from to/span description pept 258 1205 enzyme III-Man function protein (manX (ptsL)) pept 1268 > 1474 manY (pel) gene product mRNA 120 > 1474 manXYZ operon mRNA (5' end put.) BASE COUNT 411 a 319 c 376 g 368 t ORIGIN 1 cctttgcaaa cgaatgtgac aaggatattt tacctttcga aatttctgct aatcgaaagt 61 taaattacgg atcttcatca cataaaataa ttttttcgat atctaaaata aatcgcgaaa 121 cgcaggggtt tttggttgta gcccttatct gaatcgattc gattgtggac gacgattcaa 181 aaatacatct ggcacgttga ggtgttaacg ataataaagg aggtagcaag tgaccattgc 241 tattgttata ggcacacatg ggttggggct gcagagcagg ttgcttaaaa cggcagaaag 301 tgctgttagg cgagcaggaa aacgtcggct ggatcaattt cgttccaggt gaaaatgccg 361 aaacgctgat tgaaaagtac aacgctcagt tggcaaaact cgacaccact aaaggcgtgc 421 tgtttctcgt tgatacatgg ggaggcagcc cgttcaatgc tgccagccgc attgtcgtcg 481 acaaagagca ttatgaagtc attgcaggcg ttaacattcc aatgctcgtg gaaaggttaa 541 tggcccgtga tgatgaccca agctttgatg aactggtggc actggcagta gaaacaggcc 601 gtgaaggcgt gaaagcactg aaagccaaac cggttgaaaa agccgcgcca gcacccggtg 661 ccgcagcacc aaaagcggct ccaactccgg caaaaccaat ggggccaaac gactacatgg 721 ttattggcct tgcgcgtatc gacgaccgtc tgattcacgg tcaggtcgcc acccgctgga 781 ccaaagaaac caatgtctcc cgtattattg ttgttagtga tgaagtggct gcggataccg 841 ttcgtaagac actgctcacc caggttgcac ctccgggcgt aacagcacac gtagttgatg 901 ttgccaaaat gattcgcgtc tacaacaacc cgaaatatgc tggcgaacgc gtaatgctgt 961 tatttaccaa cccaacagat gtagagcgtc tcgttgaagg cggcgtgaaa atcacctctg 1021 ttaacgtcgg tggtatggca ttccgtcagg gtaaaaccca ggtgaataac gcggtttcgg 1081 ttgatgaaaa agatatcgag gcgttcaaga aactgaatgc gcgcggtatt gagctggaag 1141 tccgtaaggt ttccaccgat ccgaaactga aaatgatgga tctgatcagc aaaatcgata 1201 agtaacgtat tgtgttgatt atcactcagt tttcacactt aagtcttacg taaacaggag 1261 aagtacaatg gagattacca ctcttcaaat tgtgctggta tttatcgtag cctgtatcgc 1321 aggtatggga tcaatcctcg atgaatttca gtttcaccgt cctctaatcg cgtgtaccct 1381 ggtgggctat cgttcttggg gatatgaaaa ccggtattat tatcggtggt acgctggaaa 1441 tgatcgcgct gggctggatg aacatcggtg ctgc // LOCUS FSCCKPA 1428 bp ss-mRNA VRT 15-AUG-1990 DEFINITION T.californica creatine kinase mRNA, complete cds. ACCESSION M36427 KEYWORDS creatine kinase. SOURCE T.californica electric organ, cDNA to mRNA, clone CK52g8. ORGANISM Torpedo californica Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Chondrichthyes; Elasmobranchii; Euselachii; Neoselachii; Squalomorphii; Torpediniformes; Torpedinoidea; Torpedinidae. REFERENCE 1 (bases 1 to 1428) AUTHORS West,B.L., Babbitt,P.C., Mendez,B. and Baxter,J.D. TITLE Creatine kinase protein sequence encoded by a cDNA made from Torpedo californica electric organ mRNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 7007-7011 (1984) STANDARD simple staff_review FEATURES from to/span description pept 90 1235 creatine kinase (E.C. 2.7.3.2) BASE COUNT 348 a 398 c 394 g 288 t ORIGIN 1 ggtcacccac accagcggta gttccagcac caagcaggac aaggtccaga gtggttcacc 61 gtgcgccagg agtcagccaa cctccaacca tgcctttcgg aaacactcac aataaatgga 121 agctgaacta ttcggcggcg gaagaattcc ccgacctcag caagcacaac aaccacatgg 181 ccaaggcttt aaccctggac atctacaaga aacttcggga caaggagact ccaagtggct 241 tcaccctcga tgatatcatc cagacaggag tggacaaccc aggtcacccc ttcatcatga 301 ccgtgggctg cgtggctggc gatgaggaat gctacgaggt tttcaaggac ctgttcgatc 361 ccgtcattga ggaccgccac ggtggctaca aaccaactga caagcacaag actgacctga 421 accaggagaa cctgaagggc ggcgatgacc tcgacccgaa ttacgtcctg agcagccggg 481 tgcgcactgg ccgcagcatc aagggcatcg ccctgcctcc tcactgcagc cgcggggagc 541 gccgtctggt tgagaagctc tgcatagacg gtctcgccac cttgacgggc gagttccagg 601 gcaagtacta ccccctctcc tccatgtctg atgcagagca gcagcagctg atcgatgacc 661 acttcctgtt tgacaaaccc atctctcctc tgcttctcgc ctctggcatg gctcgggact 721 ggcccgatgg ccggggcatt tggcataaca acgacaagac cttcctggtc tgggtcaacg 781 aggaggacca cctccgagtc atctcgatgc agaaaggtgg caacatgaag gaggtcttca 841 ggcgcttctg cgttggtctg aagaagatcg aggacatttt cgtgaaggct ggccgtggct 901 tcatgtggaa cgagcacctg ggctacgtcc tgacctgccc gtccaacctg ggcactggcc 961 tccgtggtgg tgtccacgtg aaaatccctc acctctgcaa gcacgagaag ttcagcgagg 1021 tcctcaagag aacgaggctg cagaaacgtg ggacaggtgg agtggatacc gcagcggttg 1081 gcagcatcta tgacatctcc aacgccgacc gtctgggctt ctccgaggtg gaacaggtcc 1141 agatggtggt ggacggtgtg aagctgatgg tcgagatgga gaagaggctg gaaaatggga 1201 aaagcatcga tgacctgatg ccggctcaga agtagacctt gggttggctg ggtgcctgcc 1261 actctgagat gccttgaaat atcacaggtc gcgaactttg aactttccca ctccaatctt 1321 tcttggccac agatctcgtg tctcaaatga ggaagcagaa ggtttggttt catcacattc 1381 agatttgcta gacacaattt taaccttgat gacacattaa taaaatat // LOCUS HUMLBPP2A 1541 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human phosphatase 2A-beta catalytic subunit mRNA, complete cds. ACCESSION M36511 KEYWORDS phosphatase 2A-beta catalytic subunit. SOURCE Human lung fibroblast, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1541) AUTHORS Hemmings,B.A., Wernet,W., Mayer,R., Maurer,F., Hofsteenge,J. and Stone,S.R. TITLE The nucleotide sequence of the cDNA encoding the human lung protein phosphatase 2A-beta catalytic subunit JOURNAL Nucleic Acids Res. 16, 11366-11366 (1988) STANDARD simple staff_review FEATURES from to/span description pept 22 951 phosphatase 2A-beta catalytic subunit BASE COUNT 436 a 296 c 327 g 482 t ORIGIN 1 ccgagcccca gcccggccgc catggacgac aaggcgttca ccaaggagct ggaccagtgg 61 gtcgagcagc tgaacgagtg taagcagctg aacgagaacc aagtgcggac gctgtgcgag 121 aaggcaaagg aaattttaac aaaagaatca aatgtgcaag aggttcgttg ccctgttact 181 gtctgtggag atgtgcatgg tcaatttcat gatcttatgg aactctttag aattggtgga 241 aaatcaccgg atacaaacta cttattcatg ggtgactatg tagacagagg atattattca 301 gtggagactg tgactcttct tgtagcatta aaggtgcgtt atccagaacg cattacaata 361 ttgagaggaa atcacgaaag ccgacaaatt acccaagtat atggctttta tgatgaatgt 421 ctgcgaaagt atgggaatgc caacgtttgg aaatatttta cagatctctt tgattatctt 481 ccacttacag ctttagtaga tggacagata ttctgcctcc atggtggcct ctctccatcc 541 atagacacac tggatcatat aagagccctg gatcgtttac aggaagttcc acatgagggc 601 ccaatgtgtg atctgttatg gtcagatcca gatgatcgtg gtggatgggg tatttcacca 661 cgtggtgctg gctacacatt tggacaagac atttctgaaa cctttaacca tgccaatggt 721 ctcacactgg tttctcgtgc ccaccagctt gtaatggagg gatacaattg gtgtcatgat 781 cggaatgtgg ttaccatttt cagtgcaccc aattactgtt atcgttgtgg gaaccaggct 841 gctatcatgg aattagatga cactttaaaa tattccttcc ttcaatttga cccggcgcct 901 cgtcgtggtg agcctcatgt tacacggcgc accccagact acttcctata aatttctcct 961 gggaaacctg cctttgtatg tggaagtata cctggctttt taaaatatat gtatttaaaa 1021 acaaaaagca acagtaatct atgtgtttct gtaacaaatt gggatctgtc ttggcattaa 1081 accacatcat ggaccaaatg tgccatacta atgatgagca tttagcacaa tttgagactg 1141 aaatttagta cactatgttc tagataggtc agtctaacag tttgcctgct gtatttatag 1201 taaccatttt cctttggact gttcaagcaa aaaaggtaac taactgcttc atctcctttt 1261 gcgcttattt ggaaatttta gttatagtgt ttaactggca tggattaata gagttggagt 1321 tttattttta agaaaaattc acaagctaac ttccactaat ccattatcct ttattttatt 1381 gaaatgtata attaacttaa ctgaagaaaa ggttcttctt gggagtatgt tgtcataaca 1441 tttaaagaga tttcccttca tttaaactaa attactgttt tatgttgatc tgcatatttc 1501 tgtatatttg tcatgacagt gcttgcatcc tatttggtgt g // LOCUS HUMPDEGA 978 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human cGMP phosphodiesterase gamma-subunit (PDEG) mRNA, complete cds. ACCESSION M36476 KEYWORDS cGMP phosphodiesterase gamma-subunit. SOURCE Human retina, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 978) AUTHORS Tuteja,N., Danciger,M., Klisak,I., Tuteja,R., Inana,G., Mohandas,T., Sparkes,R.S. and Farber,D.B. TITLE Isolation and characterization of cDNA encoding the gamma-subunit of cGMP phosphodiesterase in human retina JOURNAL Gene 88, 227-232 (1990) STANDARD simple staff_review FEATURES from to/span description pept 102 365 cGMP phosphodiesterase gamma-subunit (PDEG) mRNA < 1 978 PDEG mRNA BASE COUNT 213 a 341 c 257 g 167 t ORIGIN 1 ccgcactcac agcacagccc cctgagaccc gccctgcact tgaccgcagc aggagggagt 61 ccaggagcca aggttgccgc ggtgtctccg tcagcctcac catgaacctg gaaccgccca 121 aggctgagtt ccggtcagcc accagggtgg ccgggggacc tgtcaccccc aggaaagggc 181 cccctaaatt taagcagcga cagaccaggc agttcaagag caagccccca aagaaaggcg 241 ttcaagggtt tggggacgac atccctggaa tggaaggcct gggaacagac atcacagtca 301 tctgcccttg ggaggccttc aaccacctgg agctgcacga gctggcccaa tatggcatca 361 tctagcacga ggcccctgct gaagtccaga ccctccccct cctgcccact atgctaaacc 421 ctgctcagga ttcctgttga ggagatgacc tccctagccc cagatggcac ctggacacca 481 ggatgggact gcaacctcag gtctccccct acatattaat accagtcacc aggagcccac 541 cacctccctc taggatgccc cctcagggtg gccaggccct gctcaacatc tggagacaca 601 ggcccacccc tcagtcctgc ccacagagag gcttggtcgg tctccactcc cagggagaac 661 gggaagtgga ccccagcccg ggagcctgct ggaccccaga tcgtcccctc ctcccagctg 721 gaaagctagg gcaggtctcc ccagagtgct tctgcacccc agccccctgt cctgcctgta 781 aggggataca gagaagctcc ccgtctctgc atcccttccc aggggggtgc ccttagtttg 841 gacatgctgg gtagcaggac tccagggcgt gcacggtgag cagatgaggc cccaagctca 901 tcacaccagg gggccatcct tctcaataca gcccgccctt gcagtcccta tttcaaaata 961 aaattagtgt gtccttgc // LOCUS HUMSON3A 1449 bp ds-DNA PRI 15-AUG-1990 DEFINITION Human son3 protein gene, partial cds. ACCESSION M36428 KEYWORDS son3 protein. SOURCE Human placenta DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1449) AUTHORS Berdichevskii,F.B., Chumakov,I.M. and Kiselev,L.L. TITLE Determination of the nucleotide sequence of the son3 fragment of the human genome: Identification of a new protein with an unusual structure and homology with DNA-binding proteins JOURNAL Mol. Biol. 22, 639-646 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 1449 son3 protein (AA at 1) BASE COUNT 487 a 348 c 329 g 285 t ORIGIN 1 cgggctctgc tcagccctaa agaaagtagt ggaggagaaa aagaagtacc tccccctcct 61 aaagagacac tgcctgattc aggattttct gccaatattg aggatattaa tgaagcagat 121 ttagtgagac cgttacttcc taaggacatg gaacgtctta caagccttag agctggcatt 181 gaaggacctt tacttgcaag tgatgttgga cgtgacagat ctgctgccag cccggttgta 241 agtagtatgc cagaaagagc ttcagagtct tcttcagagg aaaaagatga ttatgaaatt 301 tttgtaaaag ttaaggacac tcacgaaaaa agcaagaaaa ataagaaccg tgataagggg 361 gagaaagaga agaaaagaga tcctcattta agatctcgaa gtaagcgttc caaatcttct 421 gaacacaaat cacgcaagcg taccagtgaa tctcgttcta gggcaagaaa gagatcatct 481 aagtccaagt ctcatcgctc tcagacacgt tcacggtcac gttcaagacg caggaggaga 541 agcagcagat caagatcaaa gtctagagga agaagatctg tatcaaaaga gaagcgcaaa 601 agatctccaa agcacagatc caagtctagg gaaagaaaaa gaaaaagatc aagctccagg 661 gataaccgaa agacagttag agctcgaagt cgaaccccaa gtcgtcggag tcggagtcat 721 actccaagtc gtcgacgaag gtctagatct gtgggtagaa gaaggagctt tagcatttcc 781 ccaagccgcc gcagccgcac ccccagccgc cgcagccgca cccccagccg ccgcagccgc 841 acccccagcc gccgcagccg cacccccagc cgccggagcc gcacccctag ccgtcggagc 901 cgcaccccaa gccgccggag aagatcaagg tctgtggtaa gaagacgaag cttcagtatc 961 tcaccagtca gattaaggcg atcaagaaca cccttaagaa gaaggtttag cagatctccc 1021 atccgtcgta aaagatccag gtcttctgaa cgaggcagat cacccaaacg tctgacagat 1081 ttggataagg ctcaattact tgaaatagcc aaagctaatg cagctgccat gtgtgctaag 1141 gctggtgtcc ctttaccacc aaacctaaag cctgcacctc cacctactat agaagagaaa 1201 gttgctaaaa agtcaggagg agctactata gaagaactaa ctgagaaatg taaacagatc 1261 gcacagagta aagaagatga tgatgtaata gtgaataaac ctcatgtttc ggatgaagag 1321 gaagaagaac ctccttttta tcatcatccc tttaaactca gtgaacccaa acctattttt 1381 ttcaatctga atattgctgc agcaaaacca actccaccaa aaagccaggt aacattaaca 1441 aaagaattc // LOCUS MYXGFA 2269 bp ds-DNA VRL 15-AUG-1990 DEFINITION Myxoma virus growth factor and M-T9 genes, complete cds. ACCESSION M15806 M35234 KEYWORDS M-T9 gene product; growth factor. SOURCE Myxoma virus (strain Lausanne) DNA, clone pMYH-1. ORGANISM Myxoma virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus. REFERENCE 1 (bases 1 to 1421) AUTHORS Upton,C., Macen,J.L. and McFadden,G. TITLE Mapping and sequencing of a gene form myxoma virus that is related to those encoding epidermal growth factor and transforming growth factor alpha JOURNAL J. Virol. 61, 1271-1275 (1987) STANDARD full staff_review REFERENCE 2 (bases 584 to 2269) AUTHORS Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G. TITLE Myxoma virus and malignant rabbit fibroma virus encode a serpi protein important for virus virulence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Computer-readable sequence for [1] kindly provided by C.Upton, 09-MAY-1987. Draft entry and computer-readable sequence for [2] kindly submitted by C.Upton, 14-JUN-1990. Author address: C.Upton University of Alberta Dept of Biochemistry 471 Med Sci Bldg Edmonton Alberta, CANADA T6G 2H7 email: USERCU11@ualtamts FEATURES from to/span description pept 204 461 growth factor pept 717 2246 M9-R gene product BASE COUNT 685 a 441 c 540 g 603 t ORIGIN 1239 bp upstram of DdeI site; about 13 kb from 3' viral end. 1 ttaaacaaga tacaacatac ggacgcggct atgttctcgg aagtcataga cggtattgtc 61 gcggaagaac agcaggtgat tggatttatt cagaaaaaat gtaaatataa cacgacatac 121 tacaatgtac gtagcggcgg gtgtaaaata tccgtctatc taaccgcggc agttgttggc 181 tttgtcgcat acggaatact aaaatggtac cgagggacct agtcgcaact ctcttatgtg 241 cgatgtgtat tgtacaggca acgatgcctt cgttggataa ttatctgtat attattaaac 301 gtattaaact atgtaacgac gactataaaa actattgtct aaataacgga acctgtttca 361 ccgtagcatt aaacaatgtt tcacttaacc cgttttgtgc gtgtcatatt aactacgtgg 421 gaagccgatg tcagtttatt aatctaatta ccattaagta acccgtttta catgtataat 481 aatacatacg tatttttaga taactttaat aaataacatt gtataaactt acttatcata 541 tacggtacac ataacgaata acactacatg tttttatata tacataggtt tggaaaaaac 601 ttaatcacga acgtatcatt agacaatgac tccatctagg aggggttttg ggaactacgt 661 acacgatata ttcacatcgc gaaaacataa ataataattt tttacaacga ttcacgatgt 721 cgcgcacttt attgagattt ctggaagatg gtgcaatgag cgacgtaaca gtcgtcgccg 781 gggactcgac gtttctcggg cataaagtta ttttatctct tcactcggat tacttctatc 841 gtctgtttaa tggagacttt acctcgcccg atacggttac gctggacgcg acggacgatg 901 ccgttcgtac ggtgtttacg tatatgtacg cgggatgtga cgggttaaac gatcgtacga 961 tagacgattt acaatccatt atcgtattgg cggactacct gggtataacg aaactggtgg 1021 acgaatgcgt acgtcgtatc gtatctaaag tggacgtatt aaactgcgta ggggtatata 1081 cgtttgcgga gacgtatcat ataacggact tgcagcgggc ggccaaaacg tttttaacag 1141 aactactggg gtctaaagaa gcgttcgaag aactatccca agacgatgcg gttatcgcgt 1201 taagggaaac gcgtaacatt gtcgatagac gatccattct tagagcgatc ctgttatggg 1261 ttcgaaaatg tccagatcgt atcgaacaac taaaggtgtt agtcgccgcc gtagacgacg 1321 tagacgacga tgacaacgta tatacgatct acgagagata cgctgaagaa ctaaaggata 1381 tgatcgcgtg tccattatcc tataattgcg tcgttgtggt cgacagagat agatacgttc 1441 gcctcattaa cccagacacc ctatggagta aacgcgtgac gtacatacgt aaacgcgcca 1501 taggcgatcg attcaccgtc gtttgtatga acaacgttct atactgttta gggggtacgt 1561 tagacggggc acccacgtgt gacgtgttgg cctacgatct actgacgaac gaatacagtt 1621 taatgccgga gatgggacac tatagacgta atgcgtcggc gtgtatcgta aatggatata 1681 tatacgtcgt aggaggcgta gacgaagaaa acagattaat cggttccgta gagtactggc 1741 aacccggaat ggaggaatgg cacgacgctc cttatctaca ggcgaacgta gaaacggcta 1801 cggtgtgtta caggaacgag ttgtggatcg taggaggcac cgtggactta tatcatccca 1861 cgtttataag cgcagttaag aaattaacag acaatcgatg gatgtcgatg gaacctcttc 1921 ccgaaccacg atcgggtgct acgaccgtcg tgtataataa tcgattatac tgcataggcg 1981 gaaggataca cggtggcgcg tacacaaatc acgtctacaa ctatttagac gagtcacgta 2041 cgtgggaacg ggtaggggat atggcgaacg tacgcagaaa tcccagttgt tgtgtgtaca 2101 ataaggcgat ttacgtattg ggagggaata caaacgccgt agagaaatac aacgggtgga 2161 agtggcaaga ggtaggtaat atatccacgt atcccgcgtg taataatacc gcgtatccat 2221 ttttttatac caacgacgag atataaaacg agtatgatat acaagtcgt // LOCUS MYXMAP1A 2204 bp ds-DNA VRL 15-AUG-1990 DEFINITION Myxoma virus MAP1 gene, complete cds, and M-T8 gene, 5' end. ACCESSION M35233 KEYWORDS M-T8 gene product; MAP1 gene product serpi protein. SOURCE Myxoma virus (strain Lausanne) DNA, clone pBU-3. ORGANISM Myxoma virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Leporipoxvirus. REFERENCE 1 (bases 1 to 2204) AUTHORS Upton,C., Macen,J.L., Wishart,D.S. and McFadden,G. TITLE Myxoma virus and malignant rabbit fibroma virus encode a serpi protein important for virus virulence JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.Upton, 14-JUN-1990. Author address: C.Upton University of Alberta Dept of Biochemistry 471 Med Sci Bldg Edmonton Alberta, CANADA T6G 2H7 email: USERCU11@ualtamts FEATURES from to/span description pept 363 1472 MAP1 gene product pept 1450 > 2204 M-T8 gene product BASE COUNT 592 a 520 c 569 g 523 t ORIGIN 1 ggatccgtaa caacacgtgt gtcgtagcgt atacataatg ccgtaaatga cagtcataaa 61 accatcgagt cgtcccaggc cgaggaaaaa caaaaatata aaagtaaata catacagaac 121 gagcgccatg gatctctctc cgggaagtgt ccacgagggt atcgtatatt ttaaagacgg 181 aatattcaaa gtccgcctac tcggatacga gggacacgag tgtattcttt tggactatct 241 gaactacagg caagacacgt tggatcggtt gaaggaacga ctcgtgggac gcgtgattaa 301 aacgcgagtc gttcgcgcgg acggtttata cgtggacctg cgacgttttt tttgagggtt 361 aaatgaagta tctggtcctc gtcttatgtt taacgtcgtg cgcgtgtcga gatatcggac 421 tatggacgtt ccgatacgtc tacaacgaaa gcgacaacgt cgtgttctca ccgtacggct 481 tgacctccgc gttgtccgtg ttacggatcg cggcgggcgg taacacgaaa cgagaaatag 541 acgtccccga atccgtcgtg gaggactccg acgcctttct cgcgttacgg gagttgttcg 601 tagacgcatc cgttccgtta cgtcccgagt ttacggcgga gttctcctcg cgattcaata 661 cctccgtgca acgcgtgacg tttaactcgg agaacgtcaa agacgtcatt aactcgtacg 721 ttaaggataa gacgggagga gacgtcccac gcgtattgga cgcctcccta gaccgagata 781 ctaaaatgct gctattgagc tccgttcgta tgaagacgag ctggagacac gtattcgacc 841 cttcgttcac gacggatcaa cctttttatt ccggaaacgt cacatacaag gtacgtatga 901 tgaataaaat agatacgttg aaaacggaga cgtttacgct tagaaacgtg ggatactccg 961 taacggaact gccgtataaa cggcgtcaaa cggccatgtt gctcgtcgtt ccggacgact 1021 tgggagagat cgtgcgggcc ctcgatcttt ctctagtacg cttctggata cgcaacatga 1081 ggaaagacgt gtgtcaggtg gtaatgccca agttctccgt cgaatcggtc ctggatctga 1141 gggacgccct ccagagactg ggggtgcgag acgcgttcga tccatcccgg gcggacttcg 1201 gtcaggcgtc cccgtcgaac gatctatacg tcacgaaggt gttacagacg tccaagatag 1261 aggcggacga acggggaacg acggcgtcga gcgacacagc catcaccctc atccccagga 1321 acgccctcac ggcgatcgtg gcgaacaaac cgtttatgtt tctcatctat cacaagccta 1381 caacgaccgt gttgtttatg ggaacgataa caaagggtga aaaagtaata tacgatacgg 1441 agggtcgaga tgatgtcgta tcctctgtat aaactctttt tgaagggtaa actatgcgac 1501 gtcgaaatcg tcgcggaagg caaaagcatc cgagcgcatc ggttggtgct ttccgcgtat 1561 tctaaatact tttacaactt gtttaatggg aatttcttag aaaaaaacgt agacgtaatc 1621 gacttagaag cggattataa aaccgtattt gacgtgattt attacatgta tacagaatcg 1681 atagaattac acaaagggaa taccgaatcc attttctcat tggttcatta cctacagatt 1741 aaacccctga ttaaaaaatg tatctacgag tttaacagca tcgtgaacga agaaaactgt 1801 atacgtctgt ttaagttcgc cgaattatac gacctgtccg agttgaaacg cagggcgcga 1861 tggcttatgc ccagtctcgt tatgaatgag aaagatcgcc tgcgggagat gtccttggac 1921 gacctatccc tgatgttagt ccagatacgg aacacggtcg atcgaagtat cgctttgtcg 1981 gcgatcacgg aatggataca gacaaacgtt cgcgaacgta ggagacacgc cgtccatctg 2041 gcgacgtgtt taggggatgt cccaggaacc gcatcctcca gagccgtata caaacactac 2101 atgtcggaac tacgtattcg ggttacggaa tttcaaccgg cgtatcacaa ctgcgtcgtg 2161 tacctgggag gatcgatgaa aggtcgcgtc accgccctgg atcc // LOCUS MZEMT2BATP 2054 bp ss-mRNA ORG 15-AUG-1990 DEFINITION Maize mitochondrial F-1-ATPase subunit-2 mRNA, complete cds. ACCESSION M36087 KEYWORDS ATPase subunit-2. SOURCE Maize (inbred line A188) embryo kernel mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 1 to 2054) AUTHORS Ehrenshaft,M. and Brambl,R. TITLE Respiration and mitochondrial biogenesis in germinating embryos of maize JOURNAL Plant Physiol. 93, 295-304 (1990) STANDARD simple staff_review FEATURES from to/span description pept 6 1667 F-1-ATPase subunit-2 BASE COUNT 421 a 539 c 558 g 536 t ORIGIN 1 cggccatggc gtcccgccgg gtcgtctcct cgctcctccg ctccgcgtcc cgcctgcggg 61 ccgcctcgcc cgctgctcca cgaccgcgcg cgccaccgca ccgcccgtcc ccggccgggt 121 acctcttcaa ccgcgctgcc gcctacgcct cttccgccgc ggcccaggcg gcacctgcca 181 ccccgccgcc ggccaccggg aagaccgggg ggggcaagat caccgacgag ttcaccggcg 241 ctggcgccat cggccaggtg tgccaggtga tcggcgccgt cgttgacgtg cgcttcgatg 301 agggcctccc gcccatcctc acggcgctcg aggtgctcga caacaacatc cgcctcgtgc 361 tcgaggtggc gcagcacctt ggcgagaaca tggtgcgcac catcgctatg gacggcacgg 421 aggggctcgt ccgcggccag cgcgtcctca acactggctc ccccatcacc gtgcctgttg 481 gcagggctac ccttggacgc atcataaatg ttattggtga accgattgat gagaagggtg 541 acataaagac aaaccacttc ctccctattc atcgtgaagc ccctgctttt gttgagcagg 601 ccactgagca gcaaattctt gttactggaa tcaaggtcgt ggatcttctt gcaccctacc 661 aaaggggtgg aaagattggt ctcttcggtg gtgcaggagt gggtaaaact gtgctcatta 721 tggagttgat caacaatgtt gctaaggccc atggtggttt ctctgtgttt gctggtgttg 781 gagaacgtac ccgtgaaggt aatgatctgt acagggaaat gattgaaagt ggtgtcatta 841 agctagatga caagcagagc gaaagcaagt gtgctcttgt ttacgggcag atgaatgagc 901 ccccgggtgc tcgtgctcgt gttgggttga ctggtttgac tgttgctgaa catttccgtg 961 atgctgaagg acaagatgtg cttctgttta ttgacaacat tttccgtttt actcaggcaa 1021 actctgaggt gtctgctctt cttggacgta tcccatctgc tgtgggatac cagccaaccc 1081 ttgccactga tcttggagga ctgcaagagc gtattacgac aacaaagaag ggttctatta 1141 catctgtgca ggccatctac gtgcctgccg atgacttgac ggatcctgct cctgctacta 1201 cctttgccca tcttgatgct acaactgtgt tgtcacgaca gatctctgag cttggtattt 1261 atcctgctgt tgatccactg gattccacat caagaatgct ttctccccac gtgctgggtg 1321 aggatcacta caacactgct cgtggtgtgc agaaggttct tcagaactac aaaaatcttc 1381 aggatattat tgctatcttg ggtatggatg agctcagtga ggatgacaag ctgacagtcg 1441 cccgtgcaag aaagattcag cgtttcctga gccagccttt ccatgtcgct gaagttttca 1501 cgggtgctcc aggaaagtat gtggagctga aggaaagcgt gaagagtttc cagggtgttt 1561 tggatgggaa gtatgatgac ctccctgagc agtcattcta catggttggt ggcattgagg 1621 aagtcattgc taaggctgag aaaattgcca aggagtctgc ttcataagga ggcttcttgc 1681 ttgttcaacc ctgtacaagt tccatttttg gattttaagc gtttatttat gcttttccca 1741 gttaggcatg acgagctgga gagtccatct cctgctgaga gatgtttgtt ttacccttct 1801 ttgcttcctc caccttacac ccaaataagc aactgcagtg ccgttggttt tggctgcacc 1861 caaactacat gactgaagaa acttgtggcc tgtgtaacgc gaatccatca gaacgccaaa 1921 gttatggctt ctggttgtgg caaattatgg ttcctccctg ttcggttgag tggttgcatt 1981 ctggaggtat tgttctggac tcaggctaat gattgtgcgt gcaactgttt cggagtcatt 2041 tcaaagggtt atcc // LOCUS PFAMTSSU 935 bp ds-DNA ORG 15-AUG-1990 DEFINITION P.falciparum mitochondrial small subunit rRNA gene. ACCESSION M23443 KEYWORDS small subunit ribosomal RNA. SOURCE P.falciparum (strain C10) mitochondrial DNA. ORGANISM Mitochondrion Plasmodium falciparum Eukaryota; Animalia; Protozoa; Microspora; Microsporea; Microsporida; Haemosporina; Plasmodiidae; Plasmodium falciparum. REFERENCE 1 (bases 1 to 935) AUTHORS Gardner,M.J., Bates,P.A., Ling,I.T., Moore,D.J., McCready,S., Gunasekera,M.B.R., Wilson,R.J.M. and Williamson,D.H. TITLE Mitochondrial DNA of the human malarial parasite Plasmodium falciparum JOURNAL Mol. Biochem. Parasitol. 31, 11-18 (1988) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 > 935 small subunit ribosomal RNA BASE COUNT 376 a 91 c 132 g 336 t ORIGIN 1 aagcttgata aagtaatatt tcttttagga agacagtatt attaaaatat tgtaaacttt 61 ttattttatt tttaaatatt gataaaaata aaaaatagta tttgctattt tctgtgccag 121 cagcagcggt aatacagaaa tgcaagcgtt attcatttta ttaggcgtaa agcgttttaa 181 ggttttatat taattttatg tttaaatatt taaattaaat ttaaaataaa ttaataaata 241 ataatataat agagtattat aaaagtatta agaatttttt gagaagtagt gaaatacaat 301 gatacaaaaa agaatatcaa aggcggaagc ataatactat ataattactg acacttaaaa 361 acgaaagcta aggtagcaaa taggattaga taccctagta gtcttagctg taaactatga 421 atattttata tttatatttt ataaatataa taactaacgt gataaatatt ccgcctgagt 481 agtatattcg caagaatgaa attcaaagga attgacggga gcttatacaa gtggtggaac 541 atgtggctta attcgatgca acacgataaa ccttaccaaa atttaacaat atttttaata 601 ttaagaaatt aatattttaa taaaatatat aggtagtgca tggctgtcgt cagttcgtgc 661 tgtgaagtgt taattttagt attataacga acgtaacctt ttataaaaaa aatttttata 721 ataaataata ataaagatta cgtcaagtca ttatgctcct tatattttgg gctgctcacg 781 tgttacataa aatattacaa tattttatta tatgttaaat ataataatta aaatatattt 841 atagttcaga ttataaattg aaactcattt atataaagat ggaatcacta gtaatcgcta 901 atcagaatta tagcggtgaa taagttctta agctt // LOCUS PSEALGR3A 120 bp ds-DNA BCT 15-AUG-1990 DEFINITION P.aeruginosa alginate synthesis regulatory protein (algR3) gene, 5' end. ACCESSION M35259 KEYWORDS alginate synthesis regulatory protein. SOURCE P.aeruginosa (strain 8882) DNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 120) AUTHORS Kato,J., Misra,T.K. and Chakrabarty,A.M. TITLE AlgR3, a protein resembling eukaryotic histone H1, regulates alginate synthesis in Pseudomonas aeruginosa JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2887-2891 (1990) STANDARD simple staff_review FEATURES from to/span description pept 70 > 120 alginate synthesis regulatory protein (algR3) mRNA 45 > 120 algR3 mRNA BASE COUNT 24 a 39 c 37 g 20 t ORIGIN 1 cgaacccgtt ggcgagaggg ggtttgcggg tctagtatgg gcgcaaccac gtccgcctgg 61 aggcacgtca tgtcggccaa caagaagccc gtcaccaccc ccttgcacct gttgcagcaa // LOCUS STYOMPH 992 bp ds-DNA BCT 15-AUG-1990 DEFINITION S.typhimurium cationic 16 kD outer membrane protein (ompH) gene, complete cds. ACCESSION J05101 M36486 KEYWORDS ompH gene; outer membrane protein. SOURCE S.typhimurium (strain LT2 subline, isolate SH5014) DNA, clones pUCHS[14,16]. ORGANISM Salmonella typhimurium Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 992) AUTHORS Koski,P., Rhen,M., Kantele,J. and Vaara,M. TITLE Isolation, cloning, and primary structure of a cationic 16 kDa outer membrane protein of Salmonella typhimurium JOURNAL J. Biol. Chem. 264, 18973-18980 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 992) AUTHORS Koski,P., Hirvas,L. and Vaara,M. TITLE Complete sequence of the ompH gene encoding the 16-kDa cationic outer membrane protein of Salmonella typhimurium JOURNAL Gene 88, 117-120 (1989) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.S.Vaara 02-SEP-1989. FEATURES from to/span description pept 311 796 cationic outer membrane protein precursor (gtg start codon) sigp 311 370 cationic outer membrane protein signal peptide matp 371 793 cationic outer membrane protein signal 142 147 -35 region signal 165 170 -10 region signal 854 879 transcription termination signal binding 293 305 ribosome binding site BASE COUNT 281 a 224 c 260 g 227 t ORIGIN 334 bp upstream of PstI site. 1 gatccgtcat ctgcgccgtc agatgtaccg gattacagcg atccaggcaa catccgtatg 61 tccgcgggta tcgcattaca atggatgtcc cattggggcc gttggtcttc tcctacgccc 121 agccgtttaa aaagtacgat ggagacaaag ccgagcagtt ccagtttaac attggtaaaa 181 cctggtaatt gttcactgca aaggaatgca ttggtagtgt agcgatgact tttggcgatg 241 cccccaggga tcgccaggcc acgcaaagag ctgtaccttc gggtgcaaat gggatggtaa 301 ggagtttatt gtgaaaaagt ggttattagc tgcaggtctt ggtttggcga tggtaacgtc 361 cgcacaggct gctgacaaaa ttgcaatcgt caacatgggt aatctgttcc aacaggttgc 421 gcagaagacg ggtgtatcca atacactgga aaacgaattt aaaggccgtg cggctgaact 481 gcaaaaaatg gaaaccgatc tgcaatctaa aatgcagcgt ctgcaatcca tgaaagcagg 541 tagcgatcgt actaagctgg aaaaagacgt gatgtctcag cgccagactt tcgcacaaaa 601 agcgcaggct tttgagaaag atcgcgctcg tcgttccaac gaagaacgca acaaactggt 661 gactcgtatc cagactgcgg tgaaaaaagt ggctaacgac cagagtatcg atctggtggt 721 agacgcaaac accgttgctt acaacagcag cgatgtgaaa gacatcaccg ctgacgtact 781 gaaacaggtt aaataagtaa tgcccttcaa ttcgactggc tgacttagca gaacagttgg 841 atgcagaatt acacggtgat ggcgatatcg tcatcaccgg cgttgcgtcc atgcaatgtg 901 caacaacagg ccacattacg tttatggtga atcctaagta ccgtgaacac ttaggtttat 961 gccaggcttc tgcggttgtc atgacgcagg ac // LOCUS SIVAGM155 9794 bp ds-DNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV), complete genome. ACCESSION M29975 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate 155) proviral DNA, clone 4. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9794) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication and in a computer-readable form by Phillip Johnson, Georgetown University, Rockville MD (301- 496-2976). The 155 isolate is from a monkey imported from Kenya. FEATURES from to/span description pept 931 2493 gag polyprotein pept 2199 5342 pol polyprotein pept 5260 5958 vif protein pept 5741 6100 vpx protein pept 6051 6268 tat protein, exon 2 (first expressed exon) 8492 8633 tat protein, exon 3 (AA at 8493) pept 6208 6268 rev protein, exon 2 (first expressed exon) 8492 8700 rev protein, exon 3 (AA at 8494) pept 6275 8581 env polyprotein pept 8724 9416 nef protein LTR 1 726 5' LTR LTR 9070 9794 3' LTR rpt 1 625 R repeat 5' copy rpt 9578 9794 R repeat 3' copy binding 727 744 primer (Lys-tRNA) binding site signal 9675 9680 poly-A signal BASE COUNT 3321 a 1905 c 2450 g 2118 t ORIGIN 1 tggatgggat ttattactcc gataggagaa ataagatcct taatctgtat gccctcaatg 61 aatggggaat cattgatgat tggaacgcat ggtcaaaagg acctgggata agatacccga 121 ggtgctttgg cttctgcttc aagctagtac cggttgccct gcatgaggaa gcagaaacat 181 gtgaaaggca ttgcttggta cacccagcac aactgcatga agaccctgat ggtataaatc 241 atggagaaat attggcatgg aagtttgatc caatgttggc tgttcagtac gacccctcaa 301 gggagtactt tacagactta tattcaacag ttggtacagg aaactagccg accacaggct 361 tgcggtttcc tggttgccta ggagatgaca ttaagaactg ctgacgggac tttccagcac 421 gggactttcc aaggcgggac atgggcggta cggggagtgg ctttaccctc agagctgcat 481 aaaagcagat gctcgctggc ttgtaactca gtctcttact aggagaccag cttgagcctg 541 ggtgttcgct ggttagccta acctggttgg ccaccagggg taaggactcc ttggcttaga 601 aagctaataa acttgcctgc attagagctt atctgagtca agtgccctca ttaatgcctc 661 actcttgaac gggagaagtt ccttactggg ttctctctca aacccaggcg agagaaactc 721 cagcatggcg cccgaacagg gacttgagtg aaggcacgta cagctgagaa gacgtcggac 781 gcgaaggaac cgcggggtgc gacgtgaccg agaagggctc ggtgagtagg cttctcgagt 841 gccgggaaaa agctcgagcc tagttagagg actaggaagg gccgtagccg taactactct 901 gggcaagtag ggcaggcgga cgggtacgta atgggggcgg ctacctcagc actgaatagg 961 agacaattag atgaatttga gcatatacga cttcgcccga acggaaagaa aaagtatcaa 1021 attaaacatt taatatgggc aggcaagaag atggaccgct tcggcctcca tgagaagtta 1081 ttggagacag aggaaggttg taaaaagatc atagaagttc tctctcccct agaaccaaca 1141 gggtcggaag gaatgaaaag tctgtataat ctggtgtgcg tattgctttg cgtccaccaa 1201 gaaaagaaag tgaaagacac agaggaagct ttagcaatag taagacaatg ctgccaccta 1261 gtggacaaag aaaaaactgc agttacgcca cctggtggac agcagaaaaa taacacagga 1321 ggaacagcga cacctggtgg cagccaaaat tttcccgcac aacagcaagg gaatgcatgg 1381 gtgcatgtac cactttcacc tcgcacccta aatgcatggg taaaagcagt agaagagaaa 1441 aaatttgggg cagaaatagt acccatgttc caagccctct cagaaggctg caccccatat 1501 gacatcaatc agatgcttaa tgtcttagga gatcatcagg gggccttgca aatagtgaaa 1561 gaaataatta atgaggaagc agcccagtgg gatgtaaccc acccaccgcc ggcaggcccc 1621 ttgccagcgg gacagctcag ggatccgggg ggatcagata tagcagggac cactagtaca 1681 gtgcaagagc agctagagtg gatctatact gctaacccaa gggtagatgt aggggccatc 1741 tatcgaagat ggatcatcct agggttacaa aaatgtgtaa aaatgtacaa tccagtgtct 1801 gttttagata tcagacaagg gcccaaagaa ccattcaaag attatgtaga cagattctat 1861 aaagcaataa gagcagaaca agcttcagga gaagtcaaac aatggatgac agaatctttg 1921 ctcattcaga atgccaaccc agattgcaaa gtaattttga agggcctagg gatgcacccc 1981 actcttgaag aaatgctgac agcctgtcaa ggggtgggag gcccaagtta caaagccaaa 2041 gtcatggcag aaatgatgca gaacctgcag agtcagaaca tggtacagca gggaggtgga 2101 aggggaagac caagaccccc gccaaagtgt tacaactgtg gaaaatttgg ccacatgcag 2161 aggcagtgtc ctgagccaag aaaaataaaa tgtcttaaat gtggaaagcc agggcactta 2221 gcaaaagact gcaggggaca ggtgaatttt ttagggtatg gccggtggat ggggacaaaa 2281 ccaagaaatt ttcccgcagc cactcttggg gcggaaccaa gtgcgccccc tccaccgaac 2341 aactctacac cttacgaccc agcaaagaag ctcctgcagc agtatgcaga gaaagggaaa 2401 caaatgagaa atcagaacag aaacccccca gcgaacaatc cagattggaa cgagggatat 2461 tctttgaact ccctctttgg agaagaccaa taaggacctg tataatagga ggaactgccg 2521 ttaaggcatt attagataca ggggcagatg acactataat aaaggataca gatttacaat 2581 taaggggatc atggagacca aaaatagtag gaggaattgg gggagggtta aacgtaaaag 2641 aatatgataa tgtagaagta caattggaag acaagatatt aagaggaaca gtcctcatag 2701 gagcaactcc catcaatatc ataggaagaa actttttagc ccaggcagga gccaaattag 2761 tgatggggca attgtcgcag acaataccaa tcaccccggt acgcttaaag gaaggggcca 2821 gaggaccacg attgaagcaa tggccactct ctaaagaaaa aataatagcc ctgcaagaaa 2881 tttgcaaaac attagaggaa gaaggaaaat taagcagggt agggggagac aatgcataca 2941 atacaccagt attctgtata aggaaaaaag acaaatcaca gtggagaatg ctggtagatt 3001 tcagggaact caacaaagct acacaagact tctttgaagt ccaattaggt ataccccatc 3061 cagcagggtt aaagaaaatg aagcaaataa ccattataga tgtgggggat gcatattata 3121 gcataccact ggatcctgag tttagaaaat acacagcttt caccatccct acggtaaaca 3181 atgagggacc aggcataaga tatcaattta attgcctacc gcagggctgg aagggatccc 3241 cgacaatttt ccaaaacaca gcatcaaaaa ttctagaaga aataaagaaa gaattaaaac 3301 agctgacgat tgtccagtac atggatgacc tctgggtagg atcacaagaa gagggtccaa 3361 agcatgatca gctagtacaa acacttagga atagattgca agaatgggga ttagaaacac 3421 cagagaaaaa ggtgcaaaga gaacctccct ttgagtggat gggatataaa ttatggcctc 3481 ataaatggaa gttacaaagt atagaattag agaagaaaga acaatggaca gtgaatgatc 3541 ttcagaaatt ggtagggaaa ttaaattggg cagcacaatt atatccagga ttgagaacaa 3601 aaaatatctg taagctactt agaggaaaga aaaatttatt agacgtggta gaatggaccc 3661 cagaggcaga agcagagtac gaagaaaaca aggagatcct aaaaacagag caagaaggta 3721 cttattatgc accagaaaaa ccccttaggg cagcagtaca gaaattagga gatgggcaat 3781 ggtcatacca attcaagcag gaaggaaaaa tcttaaaggt agggaagttc gccaaacaga 3841 aagctactca caccaatgag ttgcgtgtac tagcaggagt agtacagaaa atagggaaag 3901 aggccctagt aatttgggga caattaccca cttttgaact cccagtggag agggacacat 3961 gggaacaatg gtgggcagac tattggcaag tcagttggat acccgaatgg gactttgtca 4021 gtgttccgcc cttagtaact ttgtggtata cactgactaa ggaacccatc ccgggagagg 4081 atgtctacta tgtagatgga gcctgtaata gacagtcgaa agagggaaaa gcaggctaca 4141 taacccaaca aggcaaacaa agagtacaac agctagaaaa cacaacaaat caacaagctg 4201 aactgacagc cataaaaatg gccttggagg atagcggccc taaagtcaat atagtaacag 4261 attcacaata tgcgatgggc atattgacag cacagcccac acagagtgac tccccactag 4321 tagaacaaat aatagcacag atggtacaga aagaagccat ctatctgcaa tgggtacctg 4381 ctcataaagg tatagggggc aatgaagaaa tagacaaatt agtaagcaag ggagttagaa 4441 gaatattgtt cattggcagg atagaagaag cacaagaaga acatgatagg tatcacagta 4501 actggagaaa tctagcagac acatttggat tgccacaaat agtagctaaa gaaattgtag 4561 caatgtgccc aaaatgtcaa gtaaaagggg aaccaataca tggacaagta gatgcttcac 4621 caggagtgtg gcagatggac tgcacacata tagaaggaaa aatagtgata gtagcggtcc 4681 atgtagccag tgggtttata gaagcagagg ttatccctag ggaaacagga aaagagacag 4741 caaagttctt gttaaaaata ataggaagat ggcccatcac tcacctccat acagataatg 4801 gaccaaattt cacttctcag gaagtagctg ctatgtgctg gtggggaaag gtagaacaca 4861 caacgggggt accatataat ccacagtccc agggatctat agaaagtatg aacaaacaat 4921 tgaaagagat aattggaaaa ataagagatg actgtcaata tacagaaaca gcagtactta 4981 tggcctgcca cattcacaat tttaaaagaa agggaggaat aggggggcta acagctgcag 5041 agagactaat aaatatgata acaacacaat tagaaatcaa cactctacaa accaaaatcc 5101 aaaaaatttt gaattttaga gtctactaca gagaaggcag agatccagtg tggaagggac 5161 ctgctcgcct gatctggaaa ggagaaggcg cggtagttct caaggaaggt gaagaactga 5221 aggtagttcc gagaaggaaa gcaaaaatca taaaagacta tgagccaaga aaaacattgg 5281 gtgatgagac tcacctggaa ggtgcaggag gaagtgatca ccaaatggca ggggatagtt 5341 agatattgga tgaataaaag gaatctgaaa tgggaataca aaatgcatta tcaaatcact 5401 tgggcatggt acactatgag cagatatgta atacccctcc caggaagtgg agaaatccat 5461 gtggatatct attggcattt agctccaaaa caaggatggc tctcaactta tgcagtagga 5521 atacaatatg ttagcctagt aaatgataaa tatagaacag aattagatcc caatacagca 5581 gactccatga tacattgtca ttattttacc tgttttacag atagagccat ccaacaggca 5641 ctaaggggaa acaggttcat cttctgtcaa tttccaggag gacataaact aacaggtcag 5701 gtaccctcct tgcaatattt agcattacta gcccatcaaa atggcctcag gaagagatcc 5761 cagagaggag agaccaggag gactagaaat ttgggatctc agcagggagc cgtgggacga 5821 atggctcaga gatatggtag aagaaatcaa caacgaagcc aaactgcatt ttggccgaga 5881 actcctatac caagtatgga attattgtca ggaggaaggg gagagacagg gaagacccat 5941 agcggaaagg gcatataagt attatcgctt agttcagaaa gctctctttg tgcatttccg 6001 gtgtggatgt cgcaggagac aaccctttga gccatacgag gagaggagaa atggacaagg 6061 gggaggaaga ccaggacgtg tcccaccagg acttgattaa acaatacagg aaaccccttg 6121 agacatgtac aaataaatgc ttttgcaaaa aatgctgtta tcattgccaa ttctgcttct 6181 tacggaaagg actaggtatt acctatcatg cctttaggac cagaagaaag aagattgctt 6241 cggctgatcg cattcctgta ccgcagcagt aagtatgaca aagttcttag gaatttttat 6301 agtattagga atagggatag gaatagggat aagtacaaaa cagcagtgga taacagtgtt 6361 ctatggagta ccagtatgga aaaacagctc agtccaagct ttttgcatga cacctactac 6421 taggttgtgg gcaactacta attgcatacc agatgatcat gactatacag aagtaccact 6481 gaatataaca gagccatttg aagcatgggc agacagaaat cccttagtag cacaagcagg 6541 aagtaacatt cacctgctgt ttgaacagac attaaagccc tgtgtaaagc tatcacctct 6601 atgtatcaaa atgaattgtg tagagttaaa aggctccgca acctctaccc cagcaacctc 6661 tactacggca ggaaccaaac taccctgtgt tagaaataaa acagactcca acctacagtc 6721 atgcaacgac accatcatag aaaaggagat gaatgacgag gcagcgtcaa actgcacctt 6781 tgctatggct gggtacatta gggaccaaaa gaagaattac tcagtagtat ggaatgatgc 6841 agaaatcttt tgtaagcgta gtacatcgca taatgggaca aaagagtgct atatgatcca 6901 ctgtaatgat tcagttataa aggaagcttg tgataagaca tattgggatg aattaagact 6961 aagatattgt gctccagcag gatacgcttt gcttaaatgt aatgattggg attatgcagg 7021 atttaagcca gaatgttcta atgtttcagt agtgcattgc acaactttaa tgaatacaac 7081 agtaaccact ggtctgttat tgaatggaag ctattcagaa aatcgaaccc agatctggca 7141 aaaacatgga gtgagcaatg actcagtgtt aatcttgctc aataagcatt ataacctgac 7201 agttacatgc aaaaggccag ggaataagac agtcttgcca gtaacgataa tggcaggatt 7261 agtcttccac tcacagaagt ataatacaag actaaggcag gcctggtgcc acttccaggg 7321 caattggaaa ggagcttgga aggaagtaca agaggaaata gtaaaattac caaaagaacg 7381 gtaccaaggc accaatgata caaacaaaat ctttttgcaa agacaatttg gagacccaga 7441 agcagcaaat ctatggttca actgtcaagg ggaattcttc tactgtaaaa tggactggtt 7501 tttaaattat ctgaataatt taacagtgga tgctgatcat aatcattgta aaaacaacgc 7561 agggaaaggt cgaagtccag gtccctgtgt acagagaact tatgttgcct gccatatccg 7621 atctgtcata aatgattggt atactatatc aaagaaaaca tatgctccac caagagaagg 7681 acatttgcag tgcacgtcca cagttactgg gatgacagta gagctaaact ataataacca 7741 gaacaggaca aatgtaacat tgagtcccca gatagaaacc atctgggcgg cagaattggg 7801 cagatacaaa ttggtagaga ttacaccaat tggatttgca cccacagaag tcaggcgata 7861 cacgggaggc caagagaggc aaaaacgagt cccgttcgtg ctagggttcc taggcttctt 7921 gggagctgct gggactgcaa tgggagcagc ggcgacagcc ctgacggtcc agtctcagca 7981 tttacttgct gggatattgc agcagcagaa gaatctgctg gcggctgtgg gagctcaaca 8041 gcagatgttg aagctgacca tttggggtgt gaaaaacctc aatgcccgcg tcacagctct 8101 tgagaagtac ctggcggatc aggcacggtt aaacgcttgg gggtgcgcgt ggaaacaagt 8161 atgtcataca acagtaccct ggacgtggaa taatacacca gagtggaata atatgacctg 8221 gttggagtgg gaaaaacaga tagaaggatt ggagggcaac ataacaaaac aattggaaca 8281 ggcaagggaa caagaggaaa agaatttgga tgcttatcaa aagttgtcag actggtcgag 8341 tttttggtct tggttcgatt tttcaaaatg gctgaacatt ttaaagatag gctttttggc 8401 agtaataggc gttatagggt taagattgct ttacacatta tatacttgca tagctagggt 8461 taggcagggt tactctcctt tatctcctca gatccatatc catccgtgga agggacagcc 8521 agacaacgca ggagagccag aagaaggtgg aagaacaggc aaaagcaaat ctacgcatta 8581 gcagaaagaa tttgggggac gagacaagag gaccagttgg tgcaggcaat tgaccaattg 8641 gttcttgaca ctcagcatct ggttacacaa cagctgcctg accctccttc tcaagcttag 8701 aagcgcctgg cagtacttac aatatgggct tggggagctc aaagccgcag cacaagaagc 8761 agttaaccat ctggcgagct ttgcacgcaa cgcggcacac cagatatggc ttgcttgcag 8821 atccgcttat cgggcaatca tcaactctcc aagaagagtg cgacaagggc ttgaggaagt 8881 ccttaattag gaagagaaat ggcaacatga ctccagaagg aagacgtcta caggacgggg 8941 accaatggga tgaatggtca gatgaagaag atgaagtggg atttccagta agaccaagag 9001 tgccactaag acaaataaca tacaaacttg cagtagattt ttcgcacttt ttaaaagaaa 9061 agggaggact ggatgggatt tattactccg ataggagaaa taagatcctt aatctgtatg 9121 ccctcaatga atggggaatc attgatgatt ggaacgcatg gtcaaaagga cctgggataa 9181 gatacccgag gtgctttggc ttctgcttca agctagtacc ggttgccctg catgaggaag 9241 cagaaacatg tgaaaggcat tgcttggtac acccagcaca actgcatgaa gaccctgatg 9301 gtataaatca tggagaaata ttggcatgga agtttgatcc aatgttggct gttcagtacg 9361 acccctcaag ggagtacttt acagacttat attcaacagt tggtacagga aactagccga 9421 ccacaggctt gcggtttcct ggttgcctag gagatgacat taagaactgc tgacgggact 9481 ttccagcacg ggactttcca aggcgggaca tgggcggtac ggggagtggc tttaccctca 9541 gagctgcata aaagcagatg ctcgctggct tgtaactcag tctcttacta ggagaccagc 9601 ttgagcctgg gtgttcgctg gttagcctaa cctggttggc caccaggggt aaggactcct 9661 tggcttagaa agctaataaa cttgcctgca ttagagctta tctgagtcaa gtgccctcat 9721 taatgcctca ctcttgaacg ggagaagttc cttactgggt tctctctcaa acccaggcga 9781 gagaaactcc agca // LOCUS SIVAGM3 9625 bp ds-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) proviral, complete genome. ACCESSION M30931 KEYWORDS complete genome. SOURCE Simian immunodeficiency virus (isolate AGM3) from African Green monkey proviral genomic DNA. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9625) AUTHORS Baier,M., Garber,C., Mueller,C., Cichutek,K. and Kurth,R. TITLE Complete nucleotide sequence of a simian immunodeficiency virus from African green monkeys: A novel type of intragroup divergence JOURNAL Unpublished (1990); Paul-Ehrlich-Institute, 6070 Langen 1, Germany STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.Baier 20-DEC-1989. This sequence was taken from an infectious molecular clone (used for heterologous infection of the pigtail macaque). The 3' LTR sequence does not appear to match the 5' LTR sequence. FEATURES from to/span description pept 431 1996 gag polyprotein pept > 1687 4827 pol polyprotein (NH2 terminus uncertain) pept 4763 5461 vif protein pept 5244 5603 vpX protein pept 5554 5771 tat protein, exon 2 (first expressed exon) 8013 8154 tat protein, exon 3 (AA at 8014) pept 5711 5771 rev protein, exon 2 (first expressed exon) 8013 8221 rev protein, exon 3 (AA at 8015) pept 5778 8411 env polyprotein pept 8245 8934 nef protein BASE COUNT 3324 a 1827 c 2383 g 2091 t ORIGIN 1 cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc taacctggtt 61 ggccaccagg ggtaaggact ccttggctta gaaagctaat aaatcttcgc tgcattagag 121 cttctctgag tcaagtgccc tcattgacgc ctcactcttg aacgggtaaa acttccttac 181 tgggttctct ctcaacccag gcgagagaaa ctccagcagt ggcgcccgaa cagggacttg 241 acttgagtga aggcacgtac agctgagaag acgtcggacg cgaaggaagg cgcggggtgc 301 gacgtgacca agaagggctt ggtgagtagg cttctcgagt gccgggaaaa agctcgagcc 361 tagttagagg actaggaagg gccgtagcca taactactct gggcaagtag ggcaggcgga 421 cgggtacgca atgggggcgg ctacctcagc actaaatagg agacaattag acaaatttga 481 gcatatacga cttcgcccga ccggaaagaa aaagtaccaa attaaacatt taatatgggc 541 aggcaaggaa atggagcgct tcggcctcca tgagagatta ctagaatcag aagaaggatg 601 taagaagatc atagaagtac tctacccgct agaaccaaca gggtcggagg gcttaaaaag 661 tctgtttaac cttgtgtgcg tattgttttg cgtacacaaa gataaggaag tgaaagacac 721 agaagaagca gtagcaatag taagacaatg ctgccatcta gtggagaaag aaagaaatgc 781 agaaagaaat acaacagaga catctagtgg acaaaagaaa aatgacaagg gagtaacagt 841 gccacctggt ggcagtcaaa atttcccagc acaacaacag ggaaatgcat ggatacatgt 901 gcccttgtca ccacgcacct taaatgcgtg ggtaaaagca gtagaggaga aaaaattcgg 961 agcagaaata gtgcccatgt tccaggcttt atcagaaggg tgcacaccct atgacatcaa 1021 tcaaatgctt aatgtcctgg gagaccatca aggggcgcta caaatagtaa aagaaatcat 1081 caatgaggaa gcagcccagt gggatatagc tcacccacca ccagcaggac cattaccagc 1141 aggacaactc agagacccta gaggctctga catagcagga accaccagca cagtgcaaga 1201 acagctggaa tggatataca cagccaatcc cagagtagat gtgggtgcca tctatagaag 1261 gtggattatc ctggggttgc aaaaatgtgt aaaaatgtac aacccagtgt ctgtcttaga 1321 cataagacag gggcccaaag aagcattcaa agactacgta gataggttct acaaagcaat 1381 aagagctgag caggcctcag gagaagtaaa acagtggatg acagaatcat tactcattca 1441 gaatgctaat ccagactgta aagtcatcct aaagggcctg ggaatgcatc ccactctaga 1501 agaaatgtta actgcctgtc aaggagtggg aggaccaagt tacaaagcaa aagtgatggc 1561 agaaatgatg caaaatatgc aaagccagaa catgatgcaa cagggcggtc agagaggaag 1621 accaagaccc ccagtaaagt gttacaattg tggaaaattt ggccatatgc aaagacaatg 1681 ccctgaacca agaaagatga gatgcttgaa atgtgggaaa ccagggcatt tagcaaaaga 1741 ttgcagagga caggtaaatt ttttagggta tggccggtgg atgggagcga aacccagaaa 1801 ttttcccgcc gctactcttg gggtggagcc aactgcgccc cctccaccga gtccatacga 1861 ccctgcaaag aagctcctgc agcaatatgc agacaagggg aagcagttga gggaacaaag 1921 gaaaaaacca ccagcagtga atcccgattg gacagaggga tattctttga actccctctt 1981 tggagaagac caataaaaac agtttacata gaaggggtcc ccatcagagc attattagat 2041 acgggggcag atgataccat tataaaagaa gcagatttac aattatcagg aacatggaaa 2101 ccaaaaataa tagggggcat tggaggggga ctcaatgtaa aagagtatag tgatagggaa 2161 gtaagattgg aagacaaaat tttgagaggg accatattga taggaagcac tcccataaac 2221 ataattggaa gaaatatatt agcaccagca ggagccaaat tagtaatggg tcaactgtca 2281 gaacaaattc ccattacccc tgtgaaatta aaagaagggg ctagaggacc tttcttaaaa 2341 caatggcccc tctccaaaga aaaaataaaa gccttacagg aaatatgtga ccaattagag 2401 aaagaaggaa aaattagcaa gataggagga gagaatgcat acaacactcc agtgttttgc 2461 ataaagaaaa aagacaagtc acaatggaga atgttagtag attttaggga actaaacaaa 2521 gcaacacaag attttttcga agtacagtta ggcatacctc atccatcagg gttcgaaaag 2581 atgacggaaa taacagtatt agacataggg gatgcctatt attcaatacc attagaccca 2641 gagtttagaa agtataccgc ttttaccatt ccatcagtaa ataatcaagg gccaggtact 2701 agatatcagt tcaactgtct tccacaagga tggaagggat ccccaactat ttttcagaac 2761 acagcagctt ccattctaga agaaataaaa aaggagttaa aacccctaac cattgtgcaa 2821 tacatggatg acctatgggt agggtctcag gaagatgaat acacgcatga tcggttggta 2881 gaacaactaa gaatgaaatt aagtgcctgg ggattagaaa caccagacaa gaaagtacag 2941 aaaaaaccac cttatgagtg gatgggatac aaattgtggc cacacaagtg gcagataagc 3001 agcatagaat tagaagacaa agaagaatgg actgtaaatg atatacaaag actagtgggg 3061 aaactaaatt gggcagcaca gctttaccca ggactcagaa ctaaaaactt gtgtaaatta 3121 atcagaggaa aaaagaactt actagaaaca gtaacctgga cagaggaagc agaagcagaa 3181 tatgcagaaa acaaagagat cttaaaaacg gaacaggaag ggacctacta caaaccagga 3241 agacccatca gagcagcagt gcaaaaacta gaaggaggtc aatggagtta ccaattcaag 3301 caagagggac aagtattaaa agtaggtaaa tacacaaagc agaaaaacac tcataccaat 3361 gagttccgtg tattggcagg attagtacaa aaactttgta aagaatcttt agttatatgg 3421 ggagagttgc cagtccttga actcccaata gagagggaag tatgggaaca atggtgggct 3481 gattactggc aggtaagttg gattccagac tgggaatttg tcagtacccc acccctagta 3541 aaattatggt ataccctgac aaaagaaccc ataccaaagg aagatgtcta ctatgtggat 3601 ggagcttgta atagaaattc aagggaagga aaagcaggat atatcacaca atatgggaaa 3661 caaagggtgg aaaaattaga aaatacaaca aaccagcaag cagaattaat ggccataaaa 3721 atggcactag aagatagtgg gcctaatgta aacatagtaa cagattcaca atatgcaatg 3781 ggaatattaa ctgcccaacc cacacagagt gactcaccct taatagaaca aattatagca 3841 ctaatggtac aaaaacatca gatatacttg caatgggtac cagcagacaa agggatagga 3901 ggcaatgaag agatagataa actagtaagt caagggatga ggaaaatttt atttttagaa 3961 aaaatagaag aagcccagga ggaacatgaa aggtaccata ataattggag gaacttagca 4021 gacacttatg ggctaccaca aattgtggca aaagaaatag tagccatgtg tccaaaatgt 4081 cagataaaag gggaaccagt ccatgggcaa gtagatgcct cgccaggggt atggcaaatg 4141 gactgtacac atttagaagg caaggtaatc atagtagcag tccatgtagc cagtggattc 4201 atagaagcag aagttatacc tagagaaaca gggaaagaaa cagcaaaatt tttattaaag 4261 atactaagta gatggcccat aacccaactg catacagaca atggacccaa ttttacgtct 4321 caagaagtag cagcaatgtg ttggtgggga aaaatagaac acaccacagg tgtaccctat 4381 aaccctcaat cacaaggctc tatagagagt atgaataaac agttaaaaga aataattggg 4441 aaaataagag atgactgtca atacacagaa acagcagtac ttatggcatg ccacatccac 4501 aattttaaaa gaaagggagg aatagggggg ttaacaccgg cagagagatt aatcaatatg 4561 attactacac aattagaatt acaacaccta caaaccaaaa ttcaaaaaat tttaaatttt 4621 agagtctact acagagaagg gagagatcct gtctggaaag gaccaggaca gttaatttgg 4681 aaaggggaag gtgcagtggt catcaaagga ggtgtggaat taaaagaata cccaagaagg 4741 aaagcaaaaa ttataaagga ttatgaacca agaaaaagaa tgggtgatga gagtaacttg 4801 gaaggtgccg gaggagctga taactaaatg gcaagggata gtgaggtact ggatgaggac 4861 tagaaaatta gactggaaat atcgaatgca ctaccaaatt acatgggcat ggtacacaat 4921 gagtagatat gagatacccc tagggcaaca tggaagtata catgtagatc tatattggca 4981 tctgacacca gaaaagggat ggctatcaac atatgctgag gggatacagt atctaagcaa 5041 tagggatcct tggtatagga cagaattgga tcctgcaaca gcagatagcc tgatacatac 5101 ccattatttt acttgtttta cagaaagggc catcaggaaa gccctattgg gacagaggtt 5161 caccttctgt cagttccccg agggacacaa gaaaacagga caggtaccct ctttgcaata 5221 cttagctctc cttgcacacc aaaatggcct caggcagaga tcccagagaa gcaagaccgg 5281 gggaactaga aatatgggat ttgagcaggg agccgtggga cgaatggcta agagacatgc 5341 tagaagatat caatcaggaa gccaagatgc attttgggcg cgagctcctg ttccaagtat 5401 ggaactattg tcaggaggag ggagaaagga atcgcactcc catgctagaa agggcttata 5461 aatattataa attggtgcaa aaagctctct ttgtgcattt ccggtgtgga tgccgcagaa 5521 gacaaccctt tgaaccatac gaagaaagga gggatggaca agggggagga cgagcagggc 5581 gcgtaccacc aggacttgat tgaacaactc aaagcacccc tgaagcggtg tacaaacaag 5641 tgctattgta aatgttgctg ttatcactgt cagctttgct ttttacaaaa gggattaggt 5701 gttacctatc atgcccctag gatcagaaga aagaagattg ctccgcttga tcgctttcct 5761 gaacaaaaac agtgagtatg aagctgacat tactgatagg gatactatta atagggatag 5821 gagtagtgct taatacaagg caacaatggg tcacagtatt ttatggagta ccagtatgga 5881 aaaacagctc agtacaggct ttctgcatga cacccaccac cagactatgg gcaactacta 5941 actcgatacc agatgatcat gactacacag aggtaccatt aaacatcact gaaccatttg 6001 aagcatgggc tgacagaaac cccttagtag cacaagcagg aagtaatata cacctgctat 6061 ttgagcagac tctgaagcca tgtgtaaaat tatcaccttt gtgcattaaa atgtcctgtg 6121 tagaattgaa ctcctctgag cctaccacca ctcctaaaag taccacggcc tcaacaacca 6181 atatcacagc ctcaacaacc actttgccgt gtgtccagaa caagacaagt actgtgttag 6241 aatcatgtaa tgaaacaatc atagaaaagg aattaaatga agagcctgct tctaattgta 6301 catttgcaat ggcagggtat gtaagagatc agaaaaagaa gtattcagtg gtgtggaatg 6361 atgcagaaat catgtgtaag aagggtaaca attctaacag agaatgttat atgattcatt 6421 gtaatgattc agttataaaa gaagcctgtg ataaaacata ttgggatgag ttaagattaa 6481 ggtactgtgc cccggcaggg tttgctttat taaaatgcaa cgattatgat tatgcagggt 6541 ttaagacaaa ctgttctaat gtttcagtgg tgcattgtac taacttgata aatacaacag 6601 tgactactgg actgttgttg aatgggagct actcagagaa tcgaacccag atatggcaga 6661 aacatagagt aagcaatgac tcagtgttag tgttatttaa taaacattac aatctaacag 6721 ttacttgcaa aagaccagga aacaaaacag tcttaccagt aacaatcatg gcagggctag 6781 tgtttcattc tcagaggtac aatacaaggc tgagacaagc ttggtgtcac ttccagggca 6841 actggagagg agcctggaaa gaagtaaaaa atgaaatagt aaaattacca aaagatagat 6901 accaaggaac caatgatact gaagagattt atctgcagag actatttgga gatccagaag 6961 cagcaaattt atggtttaat tgtcaggggg aattcttcta ttgtaaaatg gattggtttc 7021 taaattacct gaataatcgt acagtagatc cggaccataa tccgtgtaat ggtacgaagg 7081 gaaaaggtaa ggcaccagga ccctgtgcac aaagaacata tgttgcttgc catatacgat 7141 ctgtcattaa tgattggtac acactatcaa ggaaaaccta tgcaccgcca agagaagggc 7201 acttgcaatg cacatccacg gtaacgggta tgtcagtgga gctaaattac aatagtaaga 7261 acaggactaa tgtaacatta agtccccaga tagaaaccat ctgggcagca gaattgggca 7321 ggtacaaatt agtagaaatt acaccaattg gcttcgcacc cacagaagta agaaggtata 7381 cgggaggtca tgacagaaca aagcgagtcc cgttcgtgct agggttccta ggcttcttag 7441 gagctgctgg gactgcaatg ggagcagcgg cgacagccct gacggtccag tctcagcatt 7501 tacttgctgg gatactgcag cagcagaaga atctgctggc ggctgtggag gctcaacagc 7561 agatgttgaa gctgaccatt tggggtgtga aaaacctcaa tgcccgcgtc acagctcttg 7621 agaagtacct agaggaccag gcgcggttga atgcttgggg gtgcgcatgg aagcaagtct 7681 gtcatacaac cgtaccgtgg cagtggaata ataggacccc tgattggaat aatatgactt 7741 ggctggaatg ggaaagacag atatcgtatt tggaaggtaa cataacaaca caattagagg 7801 aagccagagc acaggaggag aagaatttgg atgcatacca aaaattaagt agttggtcag 7861 atttctggtc ttggttcgat ttctcaaagt ggctgaacat tctaaaaata ggatttttgg 7921 atgtactagg tattatagga ttaagattgc tttatacagt atattcttgc atagctaggg 7981 ttaggcaggg ttactctcct ctttctccac agatccatat ccacccgtgg aagggacagc 8041 cagacaacgc agaagggcca ggagaaggtg gagacaagcg caagaacagc tccgagcctt 8101 ggcagaaaga atctggcaca gcagagtgga agagcaactg gtgcaagcga ttgaccaatt 8161 ggtgctcgat cagcagcatc tggctataca acagttgcct gaccctccta gttcatctta 8221 ggagcgcttt ccagtacata caatatgggc ttggggaact caaagccgca gcacaagaag 8281 cagttgtcgc tttggcacgc cttgcacaaa acgcgggcta ccagatatgg cttgcttgca 8341 gatccgctta tagggcaatc atcaactctc caagaagagt gcgacaaggc cttgaaggaa 8401 tccttaatta ggaagagaaa tggtaaaatg actccagaag gaagaaaatt acaagaagga 8461 gataaatggg atgaatggtc tgatgaagaa gatgaagtag gatttccagt aagaccaaga 8521 gtgccgctaa gacaaatgac ctataaatta gcggtggact tttcgcactt tttaaaagaa 8581 aaggggggac tggatgggat ttattactcc gacaggagga atcagatcct aaacctgtac 8641 gccctcaatg agtggggaat cattgatgat tggaatgctt ggtcagaagg accaggaatc 8701 agatacccaa gatgcttcgg cttctgcttt aaattggtac cagtagacct gcatgaggaa 8761 gcagagactt gtgagagaca ttgcctggtg catccagcac aagtgaggga agaccctgat 8821 ggaatcaacc atggagaagt cttggtctgg aagtttgatc ccatgttagc agtccaatat 8881 gaccctaaca gaaaatatct cactgacatg catgatcttg gcaagaggaa gtagctaacc 8941 gcaggcttgt ggttaagcac atcaccatgg tgatgacatt aagaactgct gacgggactt 9001 tccagcaagg gactttccag ggcgggtcat gggcggtacg gggagtggct ttaccctcag 9061 agctgcataa aagcagatgc tcgctggctt gtaactcagt ctcttactag gagaccagct 9121 tgagcctggg tgttcgctgg ttagcctaac ctggttggcc accaggggta aggactcctt 9181 ggcttagaaa gctaataaat cttcgctgca ttaggcagag acttgtgaga gacattgcct 9241 ggtgcatcca gcacaagtga gggaagaccc tgatggaatc aaccatggag aagtcttggt 9301 ctggaagttt gatcccatgt tagcagtcca atatgaccct aacagaaaat atctcactga 9361 catgcatgat cttggcaaga ggaagtagct aaccgcaggc ttgtggttaa gcacatcacc 9421 atggtgatga cattaagaac tgctgacggg actttccagc aagggacttt ccagggcggg 9481 tcatgggcgg tacggggagt ggctttaccc tcagagctgc ataaaagcag atgctcgctg 9541 gcttgtaact cagtctctta ctaggagacc agcttgagcc tgggtgttcg ctggttagcc 9601 taacctggtt ggccaccagg ggtaa // LOCUS SIVAGM691 683 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) long terminal repeat. ACCESSION M33719 KEYWORDS . SEGMENT 1 of 2 SOURCE Simian immunodeficiency virus (isolate ver-1 (692)) from African green monkey proviral DNA. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 683) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication in computer-readable form by Phillip Johnson. The ver-1 isolate is from a monkey imported from Ethiopia. Author address:Phillip Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description LTR 1 683 long terminal repeat BASE COUNT 174 a 149 c 187 g 173 t ORIGIN 1 tggatgggat ttattactcc gaaagaaggg aaaagatttt gaacctgtat gcattaaatg 61 aatggggaat catagatgat tggcaagctt atactccagg tccaggcatc agatatccaa 121 gatgctttgg gttctgtttt gaattagtgc cagtggacct tagtgaggaa gcgcaaggat 181 gtgaaaggca ctgtctggtc catcctgctc aattacagga ggatccagat ggtatctggc 241 atggagaaac attggtctgg agattcaatc ccatgctagc atgcaaggcc atgccaggag 301 tgttcaatga catgcatgca acagtgggga agtagcttgc ggttagcgcg tccgggacct 361 gtgtaccaac cagcatagca accatgctaa tgagctaggg actttccaga aggggagtgg 421 tttaaccctc agatattgta tataagcaga tgctcttggg cttgtaactc agtgctctta 481 ctaggagcca gctagagcct gggtgttcgc tggtagccta acctggactg gccctccagg 541 ggtaagagcc tccacggctt gaatgcttaa taaaccttgc ctgcattaga agtacttcga 601 gtcgtgtggt cccattgccg cctccgttca cgggaatcct caatactggg ttctctcttg 661 cccaggggag agaaactcca gca // LOCUS SIVAGM692 1542 bp ss-DNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) gag gene, complete cds. ACCESSION M29974 KEYWORDS . SEGMENT 2 of 2 SOURCE Simian immunodeficiency virus (isolate ver-1 (692)) from African green monkey proviral DNA, clone ver-1(692). ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 1542) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication in computer-readable form by Phillip Johnson. The ver-1 isolate is from a monkey imported from Ethiopia. Author address:Phillip Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description pept 1 1542 gag polyprotein BASE COUNT 532 a 299 c 405 g 306 t ORIGIN 1 atgggttcgg gttcctcagc actgtcaggg agaaaattag accaatttga acatatacgt 61 cttcgcccga acggaaagaa aaagtaccaa ttgaaacatt taatatgggc aggcaaggaa 121 atggagcgct ttggcctcca tgaaaagttg ttagaaacag aagaggggtg taaaaagatc 181 atagaagtat tgcttccctt agaaccaacc gggtcggaag gtttaaaaag cctgttcaat 241 ttgacctgcg tcatttgctg cattcatcag gaagcgaaag tgaaagacac agaggaagca 301 gtaataagaa taaagcaaca gtgccatcta gtggacaaag gtgagaatgc agccaaagga 361 atagataaga caacaccgac acctagtggt aggagtcaaa attacccggc acaacagcag 421 aataatgtat gggtacatgt gccacttagc cccagaacat taaatgcttg ggtaaaagta 481 attgaagaaa agaaatttgg agcagagata gttcccatgt ttcaggccct gtcagaagga 541 tgtaccccat atgatgtgaa ccaaatgttg aatgttctag gagaccatca gggggccctg 601 cagatagtga aagaggtcat caatgaagaa gctgcccagt gggacattac acatccccca 661 ccagcagggc cgctcccagc agggcaattg agagatccaa gggggtcaga catagcaggg 721 actactagta ccattcaaga acaactagaa tggatttaca cagccaaccc aagaatagac 781 gtgggagcta tctataggag atgggtaata gcagggctgc aaaaatgtgt cagaatgtat 841 aatccaacag gggttctgga tataagacaa ggaccaagag aatcttttag cgattatgta 901 gatagattct acaaggccct gagagcagaa caagcctctc aggatgttaa gaattggatg 961 acagacactc tgttgattca aaatgctaac ccagagtgta aggtcattct gaaagggcta 1021 ggcatgcacc ctaccttgga agaaatgctt acggcatgcc agggagtagg gggaccccaa 1081 tacaaagcca aattgatggt agaaatgatg aatcaaatgc agggggtcaa catggtacag 1141 caagcaggaa taggaggtag agggagagga agaccagtta aatgctacaa atgtggaaaa 1201 tttgggcatg tgcagaaaaa ttgcactcaa aaagggccag tagtatgcct gaaatgtgga 1261 aaacctggcc attttgctcg agattgcaga ggagcagtaa attttttagg gtatggcagg 1321 tggatgggag caaaaccaaa aaatttttta gaacacagag cagcagtccc ctccgcccct 1381 ccaccgccgc acaacccagg ggcgtacgac gaagccactc ggcttctgga gaaatatacc 1441 caagagggag cccaacaaag gagaaaagta gagaagagct cccaagcggg gagggaggaa 1501 gaggattatt ccttgaaatc cctctttgga gaagaccaat aa // LOCUS SIVAGM90 723 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) long terminal repeat. ACCESSION M33718 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate 90) from African green monkey proviral DNA, PCR clone 03F. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 723) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted in computer-readable form prior to publication by Phillip Johnson, Georgetown University, Rockville, MD (301-496- 2976). Author address:Phillip Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description LTR 1 723 long terminal repeat BASE COUNT 187 a 160 c 202 g 174 t ORIGIN 1 tggatgggat ttattactcc gaaaggagga atagaatcct caacctatat gctcttaatg 61 aatggggaat cattgatgat tggaatgcat ggtcagcagg accaggcata agatatcccc 121 gctgctttgg cttttgcttc aagttagtac cggtagagat gcatgaagag gcagaaacct 181 gtgagagaca ttgcttggtg catcctgcac aagtaaaaga ggaccccgat ggcatcagtc 241 atggagagac cttggtctgg aagtttgacc cctatgttag cagtgcagta tgacccaaac 301 agacagtatt tagaagacat gcatgcactg gtgaagagga agtagctaac cgcaggcttg 361 tggttaagcc gttgccgggg agatgacatt tgaaactgct gacaagggac tttccaaggg 421 actttccagg gcgggccatg ggcggtacgg ggagtggttt taccctcaga gctgcataaa 481 agcagatgct cgctggcttg taactcagtc tcttactagg agaccagctt gagcctgggt 541 gttcgctggt tagcctaacc tggttggcca ccaggggtaa ggactccttg gcttggaaag 601 ctaataaaca ttgcctgcat tagagcttat ccgagtcaag tgccctcatt gacgcctcac 661 tcaagcaggg gaaccgttcc ttactgggtt ctctctctga cccaggcgag agaaactcca 721 gca // LOCUS SIVMNDGB1 9215 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) gag, pol, vif, vpR, tat, rev, env and nef genes. ACCESSION M27470 X15781 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate GB1) from African mandrill. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9215) AUTHORS Tsujimoto,H., Hasegawa,A., Maki,N., Fukasawa,M., Miura,T., Speidel,S., Cooper,R.W., Moriyama,E.N., Gojobori,T. and Hayami,M. TITLE Sequence of a novel simian immunodeficiency virus from a wild-caught African mandrill JOURNAL Nature 341, 539-541 (1989) STANDARD full staff_entry COMMENT The mandrill virus is distinct from all other primate immuno- deficiency viruses, thus it can be regarded as a type 4 virus. There is neither a vpX nor a vpU coding sequence. The splice sites and coding regions for tat and rev are tentative. FEATURES from to/span description pept 450 1958 gag polyprotein pept < 1745 4774 pol (NH2-terminus uncertain; AA at 1745) pept 4728 5246 vif pept 5227 5541 vpR protein pept 5471 5730 tat protein, exon 2 (first expressed exon) 7950 8037 tat protein, exon 3 (AA at 7951) pept 5590 5677 rev protein, exon 2 (first expressed exon) 7950 8167 rev protein, exon 3 (AA at 7952) pept 5661 8126 env polyprotein pept 8170 8814 nef binding 275 291 primer (Lys-tRNA) binding site signal 9191 9196 poly-A signal BASE COUNT 3323 a 1478 c 2196 g 2218 t ORIGIN 1 ggagtctcta ctacagaggc taagggttgt atctctgagc agatcccctt agagcaagga 61 ccagagtcct gagtgactgg gtctgagcac ctcactcggg gctgatcacc tcgaggtagt 121 ggaactcctt gcttgcttgc tattgtcttc aataaagtaa cttagaatta gagcaagtga 181 gtaagtgtta tccattgtgc gcctctcttc taaacctgtt gtgttctcat ttagagaaca 241 gaaggacttc tagttaaccc tagaagcctt tcagtggcgc ccgaacagga cttgaagaga 301 ggcactgaca cttgaggcag agcactccgc ctggaagaag caggttgaag gagagtggac 361 tggtctgaag acgccaggag gtgagtcagt gggactgact ttacaagaat tagttgtacc 421 ctagtgtaag gggcagcata gtcagagcaa tgggtaatgg gaactctgcc ttgttaggga 481 ctgatttgga taaatttgag aaaataagat taaagagagg tggtaaaaaa tgttatagat 541 tgaaacacct ctgttggtgt aaaggtgaat tagatagatt tggcttatcg gataaactcc 601 ttgaaacaca gcaaggatgt gaaaaaatcc tctcagtatg ttggccatta tatgaccaag 661 gatcagataa tctaaaagct ttggtaggga cagtctgtgt tgtagcctgc atacacgcag 721 gtatagaaat taagagcaca caagatgctt taaaaaaatt aaaagtcata acaagaaagg 781 aagaaaagca ggaggatgaa agtaagaatt tccctgtaca aagggatgca gcaggacagt 841 atcagtatac tccaataagt cctaggatta tacagacatg ggtaaaaaca gtggaagaaa 901 agaagtggaa accggaggtc atccctctat tctcagcatt gacagaagga gcaatcagtc 961 atgatttgaa tatcatgctg aatgcagtag gagatcatca gggagcaatg caagtcttaa 1021 aagatgtaat taatgagcaa gcagcagaat gggatctaac acatcctcaa caacaaccag 1081 cacaaccagg aggaggatta aggacccctt caggctctga tatagcagga actacttcta 1141 cagtggaaga acaattggca tggatgaata tgcaacaaaa tgcaatcaat gtaggaacaa 1201 tctataagag ttggattata ctgggcatga atagattggt aaaaagtcat tgtccaataa 1261 gtataacaga tgtaagacag ggaccaaagg aagcttttaa agactatgta gatagattct 1321 acaatgtaat gagagcagaa caagcttcag gagaagtaaa gatgtggatg cagcagcatc 1381 tgcttataga aaatgcaaac ccagaatgca agcagatttt gagaagctta gggaaaggag 1441 caactttaga ggaaatgttg gaagcatgtc agggagtagg tgggccacaa cataaagcca 1501 gattaatggc agaaatgatg agaacagtgg taggacaatc acaaaatttt gtgcagcaga 1561 gagggcctca aagaggacca gttagacaac ctactggaag gaaacctatc tgcttcaact 1621 gtaataaaga agggcatgta gcaaggttct tcaaggcccc tagaaggaaa gggtgctgga 1681 attgtggagc aatggatcat cagaaagctc aatgccctaa gccagctcag cagcagaggg 1741 ttaatttttt agggtatggc ccttggggtc cctccaaacc ggggaattat ccggcacaag 1801 aggtgactcc aacagctcca ccattagagg agaaacctct gcagaaaact ctgagcactt 1861 atcagaaatt agggagaggg ctcaggcaga agatgaagga ggagaagaga gaggaggatt 1921 ttcattccct gagtactctc tttcaagaag accaatagaa gaggtctcag tggatggtgt 1981 cactataaga gctctactag atacaggagc tgatgatacc atctttaatg aaagaaatat 2041 aaaattaaaa ggaaattggc agccaaaaat tataggggga ataggtggaa acttaagagt 2101 aaaacagtat gataatgtat atgtagaaat aagagggaag ggaacatttg ggacagtatt 2161 gataggacct actccaatag atataatagg gagaaacata atggaaaaat taggaggaaa 2221 attaatattg gcacaattgt ctgataaaat accaataaca aaagtgaaat taaaaccagg 2281 agtagatgga cccagaataa aacaatggcc tttaagtaaa gagaaaatag ttggtcttca 2341 gaaaatatgt gatagattag aggaggaagg aaaaattagt agggtagatc caggaaataa 2401 ttacaataca cctatctttg ccataaagaa gaaggataaa aatgaatgga gaaaattaat 2461 agactttaga gaattaaaca agttaacaca ggattttcat gaattacagt taggtatacc 2521 tcacccagca ggaataaaaa agtgtaaaag aataacagtc ctagatatag gggatgccta 2581 ttttagtata cctctggatc cagattatag accctatact gcctttacgg taccatcagt 2641 taataatcaa gcaccaggaa aaagatacat gtataatgtt cttcctcaag ggtggaaggg 2701 aagtccatgt atctttcaag ggacagtagc atcactgctg gaggtattta gaaagaacca 2761 tccaacagta cagttatatc aatacatgga tgatttgttt gtagggtcag actatacagc 2821 agaagagcat gagaaagcta tagtagaatt aagggcttta ttaatgacat ggaacttaga 2881 aacacctgaa aagaaatatc agaaagaacc tccctttcat tggatggggt atgagttaca 2941 cccagataag tggaagatag aaaaggttca actaccagaa ttagcagaac agccaacagt 3001 aaatgaaata cagaaattgg taggtaaatt aaattgggct gcacagttat atcctgggat 3061 caaaacaaaa caactgtgca agctaataag aggaggacta aacataacag agaaagtcac 3121 aatgacagaa gaagcaagac tggaatatga acaaaataaa gagatcttgg ctgaagaaca 3181 agaagggtct tattatgatc ctaataagga attatatgta agatttcaga aaacaacagg 3241 aggagatata tcatttcaat ggaagcaagg aaataaggtt ttaagagcag ggaaatatgg 3301 gaaacagaaa acagcacata gtaatgacct catgaaattg gcaggtgcta cgcagaaggt 3361 aggaagagaa agtatagtaa tctggggttt tgtaccaaaa atgcagatac ccactacaag 3421 ggagatatgg gaagattggt ggcatgagta ttggcagtgt acatggatac cagaagtaga 3481 atttatcagc acacctatgt tagaaaggga atggtatagc ttgtccccag aacctctaga 3541 gggggtagaa acatattatg ttgatggagc agctaacagg gacagtaaaa tgggaaaagc 3601 aggatatatt acagatagag gttttcaaag ggtagaagaa tatctaaata ccaccaatca 3661 gcagacagaa ttacatgcag taaaactagc tctagaagat agtggaagtt atgttaacat 3721 agtaacagat tcacaatatg tagtaggtat actagcaagc agacctactg aaacagatca 3781 ccccatagta aaggaaataa tagaattaat gaaaggaaaa gaaaaaattt atttaagttg 3841 gctaccagca cacaaaggga taggagggaa tgagcaaata gataagctag taagttcagg 3901 aatcagaaaa gtcttattcc tacaaaatat agaaccagca caggaagaac atgagaaata 3961 tcatagcaat gaagcacaat taagagagaa attccactta ccagctctag tagccaaaca 4021 gattgtgcaa agttgcagta agtgctgtca tcatggagag cccataaagg gacagacaga 4081 tgcttcactt ggagtctggc agatagattg cacacatctg gaaaatcaaa ttattatagt 4141 agcagtgcat gtagcttcag gcttcatgaa ggcagaagtt ataacagcag aaactggaaa 4201 aaagacagca gagtttctgt taaagttagc agcacaatgg cctattagta aactacacac 4261 agataatggg cctaacttta ctagtcagga agtagaaacc atgtgttggt ggttagggat 4321 agaacacaca tttggaatac cctataaccc acaaagtcag ggggtagtgg aaaataaaaa 4381 taagtatcta aaagaattga ttgagaaaat aagagaagat tgcaaagaat taaaaacagc 4441 agtagccatg gccacattca ttcataattt taaacaaagg ggaggactag gggggatgac 4501 agcaggagag agaatagtaa atatgatcaa tacagaatta gaatatcaat atcaacaaaa 4561 tcaaatttca aaaaatttaa attttaaggt ttacttcaga gaaggaagag atcagctgtg 4621 gaaaggacct ggtatccttt tgtggaaagg agaaggggca gtagttttaa aatatcaaga 4681 agagataaag atagtaccta gaagaaagtg taaaataata aaagattatg gagagagtgg 4741 aaagaatagt caggttaact tggaaagtgt ctagtcagag aatagaaaag tggcactggt 4801 tagtaagaag acagatggca tgggccactg caaataatga ggaaggatgt tggtggctgt 4861 atcctcattt tatggcttat aatgaatggt atacttgcag taaagtagtg attataataa 4921 atagggacat aagattaata gttagaagct attggcattt gcaaatagag gtaggatgct 4981 taagtactta tgcagtaagc atagaagcag tagttagacc gccacccttt gagaaagagt 5041 ggtgtacaga gataactcca gaggtagcag atcatctaat acatttacat ttttatgact 5101 gcttcatgga cagtgcagtt atgaaagcca tcaggggaga agaagtgtta aaagtttgta 5161 gatttccagc tggccataaa gcacaaggtg ttctctcttt gcagtttctc tgcttgagag 5221 tcatctatgg gccagaagag agatgagcaa gtatcagaag atcaaggacc tcccagagag 5281 ccatacaatc agtggctagc agatactatg gaggaaataa aggaagaagc aagaaagcac 5341 ttccctctca ttatcctaaa tgcagtatca gaatattgtg tgcaaaacac agggagtgag 5401 gaagaggcct gtgagaaatt tattacctta atgaatagag ccatttgggt ccacctagct 5461 caagggtgtg atggaacctt cagggaaaga agaccacaac tgcccccctc aggattcagg 5521 ccaagaggag atagattata agcaactgct agaagagtat tatcagcctt tgcaagcttg 5581 tgagaataaa tgctggtgca agaaatgctg ctttcattgt atgctttgct ttcaaaagaa 5641 gggtttagga ataaggtacc atgtctacag gaaacgtgta ccaggaacta ataagaagat 5701 acctggtagt ggtgaagaag ctatacgaag gtaagtatga agtgtccagg tctttttctt 5761 atactatgtt tagcctacta gtaggtatta taggaaaaca atatgtgaca gtcttctatg 5821 gagtaccagt atggaaggaa gctaaaacac atttgatttg tgctacagat aattcaagtc 5881 tctgggtaac cactaattgc ataccttcat tgccagatta tgatgaggta gaaattcctg 5941 atataaagga aaattttaca ggacttataa gggaaaatca gatagtttat caagcatggc 6001 atgctatggg aagtatgtta gataccatac ttaagccatg tgtaaagatt aacccatatt 6061 gtgttaagat gcaatgtcag gaaacagaaa atgtatcagc aacaacagct aagcctataa 6121 ctacacctac tactacatct acagttgcaa gtagtacaga gatttactta gatgtagata 6181 aaaataatac agaagaaaag gtagagagga atcatgtatg taggtataac ataacaggac 6241 tatgcaggga ttcgaaggaa gaaatagtaa caaattttag aggggatgat gtgaaatgtg 6301 aaaataatac ttgctatatg aatcattgta atgagtcagt taatacagaa gactgtcaga 6361 agggactttt gataagatgt attttaggtt gtgtgcctcc aggatatgtc atgttaagat 6421 ataatgagaa gttaaataat aataaattgt gtagcaatat atcagcagtg cagtgtactc 6481 agcacttagt agccacagta agtagctttt ttggctttaa tggaactatg cataaggaag 6541 gagaattgat acccatagat gataaatata ggggcccaga ggaatttcat caaaggaagt 6601 ttgtctataa ggtgccagga aaatatggct taaagataga atgtcacaga aaaggaaata 6661 ggtcagtagt gagtactcca tcagctacag gattattatt ttatcatggg ttagaacctg 6721 gaaagaattt aaagaaaggc atgtgcacct tcaaaggacg ttgggggtta gcactttgga 6781 gtctagctaa agaactaaat aaattaaatg actccatcaa agtgaaccag acctgtaaaa 6841 attttactag cactggagag gagaacaaac aaaacacgga caagcaaaag gagtttgcca 6901 aatgcataaa gactcttaag atagataatt atactacatc aggagataga gcagcagaaa 6961 tgatgatgat gacatgtcaa ggtgaaatgt tcttctgtaa tgtaacaaga atcatgaggg 7021 catggaatga tcctaatgag aagaagtggt atccttatgc ctcatgtcaa attaggcaaa 7081 tagtagatga ctggatgcaa gtaggaagaa agatatattt accacctaca tcaggattta 7141 ataatcacat aaggtgtaca catagggtaa cagaaatgta ctttgaaatg caaaagatag 7201 atagtaatga aacaaaaatg caaattaaat tcttgcctcc cagtgaaacc tccaatcaat 7261 ttgttgctta tggagctcat tataaattag tcaaaataat gccaattggc atagcaccta 7321 cagatgtgaa aagacacact ttacctgaac atcataaaga gaagagagga gcagtaatac 7381 ttggtatcct tggtctgctc tcgctggcag gatccgcgat gggctcagtg tcggtggcac 7441 tgactgtcca atctcagtct ttggtgactg ggatagtgga acaacaaaaa cagttgttga 7501 agctcataga gcaacagtct gaactcttaa aactcaccat atggggagta aagaatttac 7561 agactcgcct gaccagtttg gagaattata tcaaggacca agctttgctg tctcaatggg 7621 ggtgttcatg ggcacaggtg tgtcatactt ctgtagagtg gactaataca agcatcactc 7681 caaattggac atcagaaact tggaaggaat gggagacaag aactgattat ctgcaacaaa 7741 acattacaga aatgttaaaa caggcatatg atcgagagca aagaaacaca tatgaattac 7801 agaagttagg agaccttaca tcttgggcaa gttggtttga ctttacttgg tgggttcaat 7861 acttaaaatg gggagttttc ttagtgttag gaattatagg attaagaatt ttgttagcct 7921 tatggaatac aataagtagg tttaggcagg gctatcgacc tgtcttttca caggactgcc 7981 agcagaacct ataccgcaaa cggccagaca acggagaaga agaaagcaac agcttagaac 8041 taggagagca caactccgag aacttgaagg aagaatcctt aaacagatcc ttgatagagg 8101 acctgaccag ctttgccagg gagtgaccaa tttggctttg gctgaaaaat ctgagagcag 8161 caattgaata tgggttcctc gcagtccaag aagcgatcag aagcttgggt tcgctactcg 8221 tcagctttgc ggcaattagt tggagggccg gttacaccgg atggctacaa gcaaatagaa 8281 tcttcacagg gtgcagagaa gcaatcattg ctgcggggac gtgcatatgg cacatactca 8341 gaaggattag acaaagtgca gaacgacccc ttaactaaag atgagaaact tgacttaaca 8401 cagcaggatc cagaagagga ggaagaagtt ggatttcctg tgtgtcgcca agtttcctta 8461 agagtgccat catacaaaga tctgatagac ttctctcatt ttataaaaga aaagggggga 8521 ctgggaggga tatattatag caggagaaga gaagaaatcc tagatctcta tgcagagaat 8581 gagtggggat ttgaacctgg atggcaacag tatacgacag gtccaggaac cagatatcct 8641 aagacatttg gattcctgtt taagctggaa ccagtgagca gagctatagg agatgagtat 8701 gcagctaaca atcatctgtt acactcctcc cagttatgtc ctcaggaaga tccagaagga 8761 gagaccctca tgtggtctgg gaccctcatc ttgcctatga ctttgcagca ttaacatatc 8821 accctgagtg tttcaataag gctaagagta ttgaacatct gccattttgg aagaggaagt 8881 agcctaaccg caaaaccaca tcctactgca gaactgtagt tgcttggcaa cctgcttagc 8941 aacctggact ggcgcttgcg cgctaggaag ggactttcca aacagggagg gggaggctcg 9001 ccccatgctg ctatataagc agctgcattt cgcttgttcg ggagtctcta ctacagaggc 9061 taagggttgt atctctgagc agatcccctt agagcaagga ccagagtcct gagtgactgg 9121 gtctgagcac ctcactcggg gctgatcacc tcgaggtagt ggaactcctt gcttgcttgc 9181 tattgtcttc aataaagtaa cttagaatta gagca // LOCUS SIVMNE 9628 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus (SIV) complete proviral genome. ACCESSION M32741 KEYWORDS complete genome. SOURCE Simian immunodeficiency virus from captive Macaque nemestrina proviral DNA, clone 8. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9628) AUTHORS Benveniste,R.E., Heidecker,G., Greenwood,J. and Gonda,M.A. TITLE ; JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Kindly submitted in computer-readable form by R. Benveniste. The gag protein sequence was reported in J. Virol. 62, 2587-2595, 1988. This molecular clone, after transfection into T-cell lines, produces infectious viral particles. In particular, clone 8 has been inoculated intravenously into two pig-tailed macaques causing CD4 lymphocyte depletion; see J. Virol. 62, 2091-2101, 1988. The env cds is truncated as is the case with MM251 and MM142. Author address:R.Benveniste National Cancer Institute Frederick, MD (301-698-5836) FEATURES from to/span description pept 533 2053 gag polyprotein pept < 1708 4878 pol polyprotein (NH2-terminus uncertain) pept 4808 5452 vif protein pept 5280 5618 vpX protein pept 5619 5924 vpR protein pept 5770 6065 tat protein, exon 2 (first expressed exon) 8280 8376 tat protein, exon 3 (AA at 8281) pept 5996 6065 rev protein, exon 2 (first expressed exon) 8280 8533 rev protein, exon 3 (AA at 8282) pept.ps 6072 8779 env protein (premature stop codon) pept 6072 8276 env protein 8280 8717 env protein pept 8551 9342 nef protein site 8277 8279 env protein in-frame stop codon BASE COUNT 3294 a 1807 c 2379 g 2148 t ORIGIN 5' end of 5' LTR R region (putative mRNA start). 1 agtcgctctg cggagaggct ggcagattga gccctgggag gttctctcca gcactagcag 61 gtagagcctg ggtgttccct gctagactct caccagcact tggccggtgc tgggcagagt 121 ggctccacgc ttgcttgctt aaagacctct tcaataaagc tgccttttag aagtaagcca 181 gtgtgtgctc ccatctctcc tagtcgccgc ctggtcaact cggtactcga taataagaag 241 accctggtct gttaggaccc tttctgcttt gggaaaccga agcaggaaaa tccctagcag 301 attggcgccc gaacagggac ttgaaggaga gtgagagact cctgagtacg gctgagtgaa 361 ggcagtaagg gcggcaggaa ccaaccacgg cggagtgctc ctagaaaggc gcgggtcggt 421 accagacggc gtgaggagcg ggagagaaga ggcctccggt tgcaggtaag tgcaacacaa 481 aaaagagata gctgtctttt atccaggaag ggataataag atagagtggg agatgggcgc 541 gagaaactcc gtcttgtcag ggaagaaagc agatgaatta gaaaaaatta ggctacgacc 601 cggcgggaag aaaaagtaca tgttgaagca tgtagtatgg gcagcaaatg aattagatag 661 atttggatta gcagaaagcc tgttggagaa caaagaagga tgtcaaaaaa tactttcggt 721 cttagctcca ttagtgccaa caggctcaga aaatttaaag agcctttata atactgtctg 781 cgtcatctgg tgcattcacg cagaagagaa agtgaaacac actgaggaag caaaacagat 841 agtgcagaga cacctagtgg tggaaacagg aacagcagaa actatgccaa aaacaagtag 901 accaacagca ccatctagtg gcagaggagg aaattaccca gtacaacaag taggtggtaa 961 ctatacccac ctaccattaa gcccgagaac attaaatgcc tgggtaaaat tgatagagga 1021 gaagaaattt ggagcagaag tagtgccagg atttcaggca ctgtcagaag gctgcacccc 1081 ctatgacatt aatcagatgt taaattgtgt gggagaacat caagcagcta tgcagattat 1141 cagagaaatt ataaacgagg aggctgcaga ttgggacttg cagcacccac aacaagctcc 1201 acaacaagga cagcttaggg agccgtcagg atcagacatt gcaggaacaa ctagtacagt 1261 agatgaacaa atccagtgga tgtacagaca acagaacccc ataccagtag gcaacattta 1321 caggagatgg atccaactgg ggttgcaaaa atgtgtcaga atgtataacc caacaagcat 1381 tctagatgta aaacaagggc caaaagagcc atttcagagc tatgtagaca ggttctacaa 1441 aagcttaaga gcagaacaaa cagatccagc agtaaagaat tggatgactc aaacactgct 1501 gattcaaaat gctaacccag attgcaagct agtgctgaag gggctgggta tgaatcccac 1561 cctagaagaa atgctgacgg cttgtcaagg agtaggagga ccaggacaaa aggcaagatt 1621 aatggcagaa gccctgaaag aggcccttgc accagggcca ctcccttttg cagcagccca 1681 acagaaggga ccaagaaagc caattaagtg ttggaattgt gggaaagagg gacactctgc 1741 aaggcaatgc agaaccccaa gaagacaggg ctgctggaaa tgtggacaaa tgggccatgt 1801 tatggccaaa tgcccagaca gacaggcagg ttttttaggc tttggcccat ggggaaagaa 1861 gccccgcaat ttccccatgg cccaaatgca tcaggggctg acgccaactg ctcccccaga 1921 ggacccagct gtggatctgc taaaaaacta catgcagttg ggcaaacagc agagagaaag 1981 caaaaggaag ccttacaagg aggtgacaga ggatttgctg cacctcaatt ctctctttgg 2041 agaagaccag tagtcactgc tcatattgag ggacagcctg cagaagtatt attagataca 2101 ggggctgatg attctattgt agcaggaata gagttaggtc cacattatac cccaaaaata 2161 gtaggaggaa taggaggttt tattaatact aaagaataca aaaatgtaaa aatagaagtt 2221 ttaggcaaaa ggattaaagg gacaatcatg acaggggaca ccccgattaa catttttggt 2281 agaaatttgc taacagctct ggagatgtct ctaaatttcc ccatagctaa ggtagagcct 2341 gtaaaagtca ccttaaagcc aggaaaagat ggaccaaaat tgaggcagtg gccattatca 2401 aaagaaaaga tagttgcatt aagagaaatc tgtgaaaaga tggaaaagga tggtcagttg 2461 gaggaagctc ccccgaccaa tccatacaac acccccacat ttgccataaa gaaaaaggac 2521 aagaacaaat ggagaatact gatagatttt agggaactaa ataaggtcac tcaggacttt 2581 acagaagtcc aattgggaat accacaccct gcaggactag caaaaaggaa gaggatcaca 2641 gtactggatg taggtgacgc atatttctcc atacctctag atgaagaatt taggcagtac 2701 actgctttta ctttaccatc agtaaataat gcagaaccag gaaaacgata catttataag 2761 gttctgcctc aggggtggaa ggggtcacca gccatcttcc aacacactat gagaaatgtg 2821 ctggaaccct tcaggaaggc aaatccagat gtgaccttag tccagtatat ggatgacatc 2881 ttagtagcta gtgacaggac agacctggaa catgacaggg tagttttaca gttaaaggaa 2941 ctcttaaata gcatagggtt ttctacccca gaagagaagt tccaaaaaga tcccccattt 3001 caatggatgg ggtatgaatt gtggccaaca aaatggaagt tgcaaaagat agagttgcca 3061 caaaaagaga cctggacagt gaatgatata cagaagttag taggagtatt aaattgggca 3121 gctcaaattt atccaggtat aaaaaccaaa catctctgta ggttaattag aggaaaaatg 3181 actctaacag aggaagttca gtggactgag atggcagagg cagaatatga ggaaaataaa 3241 ataattctca gtcaggaaca agaaggatgt tattaccaag aaggcaagcc attagaggcc 3301 acggtaataa agaatcagga caatcagtgg tcttataaga ttcaccaaga agacaaaata 3361 ctaaaagtag gaaaatttgc aaagataaaa aatacacata ccaatggagt tagactatta 3421 gcacatgtaa tacagaaaat aggaaaggaa gcaatagtga tctggggaca ggtcccaaaa 3481 ttccacttac cagttgagaa agatgtatgg gaacagtggt ggacagacta ttggcaggta 3541 acctggatac cgaaatggga ttttatctca acaccaccac tagtaagatt agtcttcaat 3601 ctggtaaagg accctataaa gggagaagaa acctattatg tagatggatc atgtaataaa 3661 cagtcaaaag aagggaaagc aggatatatc acagataggg gcaaagacaa agtaaaagtc 3721 ttagaacaga ctactaatca acaagcagaa ttggaagcat ttctcatggc attggcagac 3781 tcagggccaa aggcaaatat tatagtagat tcacaatatg ttatgggaat aataacagga 3841 tgccctacag aatcagagag caggctagtt aaccaaataa tagaagaaat gattaaaaag 3901 acagaaattt atgtagcatg ggtgccagca cacaaaggta taggaggaaa ccaagaaata 3961 gaccacctag ttagtcaagg gattagacaa gttctcttct tggaaaagat agagccagca 4021 caagaagaac atgataaata ccatagtaat gtaaaagaat tggtattcaa atttggatta 4081 cccagactag tggccaaaca gatagtagac acatgtgata aatgtcatca gaaaggagaa 4141 gctatacatg ggcaggtaaa ttcagatcta gggacttggc aaatggattg tacccatcta 4201 gagggaaaaa taatcatagt tgcagtacat gtagctagtg gattcataga agcagaagta 4261 attccacaag agacaggaag acagacagca ctatttctgt taaaattggc aagcagatgg 4321 cctattacgc atctacacac agataatggt gccaactttg cttcgcaaga agtaaagatg 4381 gttgcatggt gggcagggat agagcacacc tttggggtac catacaatcc acagagtcag 4441 ggagtagtgg aagcaatgaa tcaccatcta aaaaatcaaa tagatagaat cagggaacaa 4501 gcaaattcaa tggaaaccat agtattaatg gcagttcatt gcatgaattt taaaagaagg 4561 ggaggaatag gggatatgac tccagcagaa agattactta acatgatcac tacagaacaa 4621 gaaatacaat tccaacaatc aaaaaactca aaatttaaaa attttcgggt ctattacaga 4681 gaaggcagag atcagctgtg gaaaggacct ggtgagctat tgtggaaagg ggaaggagca 4741 gtcgtcttaa aggtagggac agacattaag gtagtaccca gaagaaaggc taagattatc 4801 aaagattatg gaggaggaaa agaggtggat agcagttccc acatggagga taccggagag 4861 gctagagagg tggcatagcc tcataaaata tctgaaatat aaaactaaag atctacaaaa 4921 ggtttgctat gtgccccatc ataaggtcgg atgggcatgg tggacctgca gcagagtaat 4981 cttcccacta caagaaaaaa gccaattaga agtacaaggg tattggaatt tgacaccaga 5041 aagagggtgg ctcagtactc atgcagtgag aataacctgg tactcaagga acttttggac 5101 agatgtaaca ccagactgtg cagacatttt actgcatagc acttatttcc cttgctttac 5161 agcgggagaa gtgagaaggg ccatcagggg agaacaactg ctgtcttgct gcaggttccc 5221 gagagctcat aagacccagg taccaagtct acagtactta gcactgagag tagtaagtta 5281 tgtcagatcc cagagagaga atcccacctg gaaacagtgg agaagagaca ataggagaag 5341 ccttcgaatg gctaaacaga acagtagagg agataaacag agaggcagta aaccacctac 5401 caagggagtt gattttccag gtttggcaaa ggtcttggga atactggcat gatgaacaag 5461 ggatgtcgca aagctatgta aagtacagat acttgtgttt aatacaaaag gctttattta 5521 tgcattgcaa gaaaggctgt agatgtctag gggaaggaca tggggcaggg ggatggagac 5581 caggacctcc tcctcctccc cctccaggac tagcataaat ggaagaaaga cctccagaag 5641 atgaaggccc acaaagggaa ccatgggatg aatgggtagt ggaggttctg gaggaactga 5701 aagaagaagc tttaaaacat tttgatcctc gcttgctaac tgcgcttggt aatcatatct 5761 ataatagaca tggagacacc cttgagggag caggagaact cattaaaatc ctccaacggg 5821 cgctcttcat gcacttcaga ggcggctgca cccactctag aatcggccaa tctggaggag 5881 gaaatcctct ctcaactata ccgccctcta gaagaatgct ataacacatg ctattgcaaa 5941 aagtgttgct accattgcca gttttgtttt cttaaaaagg gcttggggat atgttatgag 6001 cagtcacgca gaagaagaag aactccgaag aaggctaagg ctaatacatc ttctgcatca 6061 aacaagtaag tatgggatgt cttgggaatc agctgcttat cgccatcttg tttctaagtg 6121 cctatgggat ctattgcatt caatatgtca cagtctttta tggtgtacca gcttggagga 6181 atgcgacaat tcccctcttc tgtgtaacca ggaataggga tacttgggga acaactcagt 6241 gcctaccaga taatgatgat tattcagaat tggcccttaa tattacagaa agctttgatg 6301 cttgggagaa tacagtcaca gaacaggcaa tagaggatgt atggcatctc tttgagacct 6361 caataaagcc ttgtgtaaaa ttaaccccat tatgcattac tatgaaatgc aacaaaagtg 6421 agacagataa atggggattg acaaaatcat caacaacaac agcaccaaca gcaataccaa 6481 caaaagcaga ggcaataaaa gtggtcaatg agaatagtcc ttgtataaat catgataatt 6541 gcacaggctt ggaacaagag ccaatgataa gctgtaaatt caacatgaca gggttaaaaa 6601 gagacaagag aagagagtac aatgaaactt ggtactctgc agatttggtt tgtgaacaag 6661 gtaatagcac tgaaaatgaa agtagatgtt acatgaatca ctgtaacact tctgttattc 6721 aagaatcttg tgacaaacat tattgggatg ctattagatt taggtattgt gcacctccag 6781 gttatgcttt gcttagatgt aatgacacaa attattcagg ctttatgcct aactgttcta 6841 aggtggtggt ctcttcatgc acaagaatga tggagacaca gacttctact tggtttggct 6901 ttaatggaac tagagcagaa aatagaactt atatttactg gcatagcaaa gataatagga 6961 ctataattag tttgaataag tattataatc taacaatgaa atgtagaaga ccaggaaata 7021 agacagtttt accagtcacc atcatgtctg gattggtttt ccactcacaa ccaatcaatg 7081 ataggccaaa acaggcatgg tgtaggtttg aaggaaattg gaaggaggca ataaaagagg 7141 taaagcagac cattgtcaaa catcccaggt atactggaac taacaatact gataaaatca 7201 atttgacggc tcctggagga ggagatccgg aagttacctt catgtggaca aattgcagag 7261 gagagtttct ctactgtaaa atgaattggt ttctaaattg ggtagaagat aagaatctga 7321 ctggaactac ccagaagcca caggaacggc ataaaaggaa ttacgtgcca tgtcatatta 7381 gacaaataat caacacttgg cataaagtag gcagaaatgt ttatttgcct ccaagagagg 7441 gagacctcac gtgtaattcc acagtgacca gtctcatagc aaacatagat tggattgatg 7501 gaaaccaaac taatatcacc atgagtgcag aggtggcaga actgtatcga ttggaattgg 7561 gagattataa attagtagag atcactccaa ttggcttggc ccccacaaat gtgaagaggt 7621 acactactgg tggcacctca agaaataaaa gaggggtctt tgtgctaggg ttcttaggtt 7681 ttctcgcaac ggcaggttct gcaatgggcg cggcgtcgtt gacgctgacc gctcagtccc 7741 ggactttatt ggctgggata gtgcagcaac agcaacagct gttggacgtg gtcaagagac 7801 aacaagaatt gttgcgactg accgtctggg gaacaaagaa cctccagact agagtcactg 7861 ccatcgagaa gtacttaaag gaccaggcgc agctaaatgc ttggggatgt gcatttagac 7921 aagtctgcca tactactgta ccatggccaa atgcaaatct aacaccaaat tggaacaatg 7981 agacttggca agagtgggag cgaaaggttg acttcttgga ggaaaatata acggcccttt 8041 tagaagaggc acaaattcaa caagaaaaga acatgtatga attacaaaag ttgaatagct 8101 gggatgtgtt tggcaattgg tttgaccttg cttcttggat aaggtatata caatacggag 8161 tttatatagt tgtaggagta atactgttaa gaatagtgat ctatatagta caaatgctag 8221 ctaagttaag gcaagggtat aggccagtgt tctcttcccc accttcttat ttccagtaga 8281 cccatatccg acaggaccag gcactgccaa ccaaagaagg aacagaagga gacggtggag 8341 gcagcggtgg caacagctcc tggccttggc agatagaata tattcatttc ctgatccgcc 8401 aactaatacg cctcttgact tggttattca gcaactgcag aaccttgcta tcgagagcat 8461 accagatcct ccaaccaata ttccagagat tctccacgac cctacagaga atccgagaag 8521 tcctcaggac tgaactaacc tacctacaat atgggtggag ctacttccaa gaggcggtcc 8581 aagtcgcctg gagatctgcg acagagactc ttgcgggcgc gtggggagac ttatgggaga 8641 ctctgggaag agttggaaga tggatactcg caatccctag gaggatcaga caagggctcg 8701 agcttactct cttgtgaggg acagaaatac aatcagggac agtttatgaa tactccatgg 8761 aaaaacccag ctggagagag ggaaaaatta gcatacagaa aacaaaatat agatgatata 8821 gatgaagaag ataatgactt ggtaggggta ccagtgaggc cacgagttcc cttaagaata 8881 ataagttaca aattggcagt agatatgtct cattttataa aagaaaaggg gggactggaa 8941 gggatttatt acagtgaaag aagacataaa atcttagaca tgtacttaga aaaggaagaa 9001 ggcatcatgc cagattggca gaattacacc tcgggaccag gacctagata cccaaagaca 9061 tttggctggc tatggaaatt agtccctgta aatgtatcag atgaggcaca ggagggtgag 9121 gagaattatt tactgcatcc agctcaaact tcccagtggg atgacccttg gggagaggtt 9181 ctagtatgga agtttgatcc aactctagcc tacacttatg aggcatatat tagataccca 9241 gaagagtttg gaagcaagtc aggcctgtca gaggaagagg ttagaagaag gctaaccgca 9301 agaggcctct taaaaatggc tgacaagagg gaaactagct gagacagcag ggactttcca 9361 taaggggatg tcatggggag gtactgggga ggagccggtc gggaacaccc actttcttga 9421 tgtataaata tcactgcatt tcgctctgta ttcagtcgct ctgcggagag gctggcagat 9481 tgagccctgg gaggttctct ccagcactag caggtagagc ctgggtgttc cctgctagac 9541 tctcaccagc acttggccgg tgctgggcag agtggctcca cgcttgcttg cttaaagacc 9601 tcttcaataa agctgccttt tagaagta // LOCUS SIVAGM677 2438 bp ss-RNA VRL 15-AUG-1990 DEFINITION Simian immunodeficiency virus LTR and gag gene, complete cds. ACCESSION M29973 KEYWORDS . SOURCE Simian immunodeficiency virus (isolate 677,(gri-1)) from African green monkey. ORGANISM Simian immunodeficiency virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 2438) AUTHORS Johnson,P.R., Fomsgaard,A., Allan,J., Gravell,M., London,W.T., Olmstead,R.A. and Hirsch,V.M. TITLE Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity JOURNAL J. Virol. 64, 1086-92 (1990) STANDARD full staff_entry COMMENT Kindly submitted prior to publication by P. Johnson, Georgetown University, Rockville MD (301-496-2976). The remainder of this complete genomic sequence will become available later in 1990. The gri-1 isolate is from a monkey imported from Ethiopia. Author address:P.Johnson Georgetown University Rockville, MD (301-496-2976) FEATURES from to/span description pept 897 2438 gag polyprotein LTR 1 688 5' LTR rpt 461 588 R repeat 5' copy binding 689 706 primer (Lys-tRNA) binding site BASE COUNT 752 a 503 c 689 g 494 t ORIGIN 1 tggatgggat atattactct gaaagaagag aaaagatcct gaatttgtat gccttgaacg 61 agtggggaat aatagatgat tggcaagctt actcaccagg cccggggata aggtacccga 121 gagtctttgg cttctgcttt aagctagtcc cagtggacct gcatgaggag gcacgcaact 181 gtgagagaca ctgtctgatg catccagcac agatggggga agatcctgat ggaatagatc 241 atggagaagt cttggtctgg aagtttgacc cgaagttggc ggtggagtac cgcccggaca 301 tgtttaagga catgcacgaa catgcaaagc gctagtgtca gcactttgcg gttgggactt 361 tccgccaggg actttccaca gtgggtggat cggaggcggt acaggggcgg tactgggagt 421 ggctttcccc tcagagctgc ataaaagcag atgctcgctg gcttgtaact cagtctctta 481 ctaggagacc agctagagcc tgggtgttcg ctggttagcc taacccggtt ggccaccggg 541 ggtaaggact ccttggcttc atatagctca ataaacctgc tcgcttagtc gctatattgg 601 agtcaagtgc tcattgctgc gccgagcctc tagaggtgaa cctctcttac tgggttctcc 661 tgtacccagg tgggagaaac tccagcagtg gcgcccgaac agggacttga gaagaggcat 721 cggcaccgac cgctgagttg ctgagcgtcg gagagggacg actcaggtag ggtgagagcc 781 tacgagtttt ttgctaccta gtcagcgaga aaggctaggc cgcgacaggg gcgcgggtcc 841 cattagtggc aaccaaccca gttggacgaa gggttggtag gggacgggtc ggagcaatgg 901 gcgggggtca ctcagcactg tcagggagaa gcctcgacac gttcgagaag attaggctac 961 gtccgaacgg gaaaaagaag taccaaatta aacatttaat atgggcagga aaagaaatgg 1021 aacgatttgg gttacatgag aaacttttag aaacaaaaga aggctgtcaa aaaatcatag 1081 aagttttaac cccgttggaa ccgacaggct ccgaggggct aaaagctctg tttaatttgt 1141 gctgcgtcat ttggtgcatt cacgcagaac agaaagtgaa agacacagag gaagctgtag 1201 taacagttaa gcaacactac catctagtgg acaaaaatga gaaagcagct aaaaagaaaa 1261 atgagacaac agcgccacct ggtggcgaat caagaaatta cccagtagta aatcagaata 1321 atgcctgggt acaccagcct ttgtctccgc gcacgttaaa tgcgtgggtc aaatgcgtgg 1381 aggaaaaaag gtggggagca gaagtagtcc ccatgttcca agcactctca gagggatgtc 1441 tctcctatga tgtaaatcag atgctcaatg taataggaga ccatcagggg gcattacaaa 1501 ttcttaagga agtcattaat gaagaagcag cagagtggga caggacacac agaccaccag 1561 ctggcccgtt accagcaggg cagctaagag acccgacagg gtcagatata gcaggaacta 1621 ccagctcaat tcaggaacaa atagagtgga ccttcaatgc caatccaaga atagacgtag 1681 gggcacaata cagaaaatgg gttattttgg gcttacaaaa ggtagtgcag atgtacaatc 1741 cccaaaaggt cctagacatt cgacagggac ctaaagaacc cttccaggac tatgtagaca 1801 gattctataa agccctgaga gcagaacaag caccacagga tgttaaaaat tggatgacac 1861 aaactttgct tatccagaat gccaatccgg attgtaaatt gattctgaaa ggattgggaa 1921 tgaatccaac cttggaggaa atgctaatag cttgccaggg agtaggaggg ccacaacata 1981 aggctaagct aatggtagaa atgatgagta atggacagaa tatggtccaa gtgggacctc 2041 agaaaaaggg cccccgaggg ccgctaaaat gctttaattg tggcaaattt ggacatatgc 2101 aaagggaatg caaggcacca agacagatca aatgctttaa gtgcggcaaa attggccata 2161 tggcaaaaga ctgcaagaat ggacaggcaa attttttagg gtatggccat tggggaggag 2221 cgaaaccaag aaattttgtg caatacagag gagacacagt tggtctggaa ccaacagccc 2281 ccccaatgga aacagcttac gatccagcaa agaagctcct ccagcagtat gcagagaagg 2341 gacagcgcct gagagaggag agagaacaga caaggaaaca gaaggagaaa gaagtggagg 2401 atgtttcctt gagctccctc tttggaggag accaatga // LOCUS BOVMHDQBQ1 624 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 2. ACCESSION M30008 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 1 of 2 SOURCE Bovine (Holstein individual 2042) DNA, clone Q1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 624) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept / 172 + 438 MHC DQ-beta cell surface glycoprotein, exon 2 (AA at 174) pre-msg < 1 > 624 MHC DQ-beta mRNA and introns IVS < 1 171 MHC DQ-beta intron A IVS 439 > 624 MHC DQ-beta intron B BASE COUNT 103 a 192 c 243 g 86 t ORIGIN Chromosome 23. 1 cccgggttca cagcgggagg cgcagggccg ggctggagcg caacaggggt tgagaggcgg 61 cgggtttcag gtttagggac cctctggcgg cggcggcacc tccccatctg gccgagcggc 121 gccgcgtggg gctgtggggc tgagcctgac cgagcggctg tctccccgca gaggatttcg 181 tggtccagtt taagggcctg tgttacttca ccaacgggac ggagcgagtg cggctcgtgg 241 tcagacacat ctacaaccgg gaggagtacg cgcggtttga cagcgacgtg aacgagtacc 301 gggcggtgac ctctggggcg ccgcacgccg agtactggaa cagccagaag gacctcctgg 361 agcagaggcg ggccgaggtg gacagggtgt gcagacacaa ctaccaggtg gctgccccct 421 tcacctggca gcggctaggt gagtacgggc tgccctccgc gggcccgccc tccacccgag 481 actcagcgcg ggagggggcc gggtctccag ggcggggttc ccaggcccgc atagggacag 541 ggaggccggg gcttcgcgga ggggcaggga ccgacgctcc gcggaaatgg acactcgcag 601 ccctggacct ctccccgcag aggc // LOCUS BOVMHDQBQ2 1151 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exons 3 and 4. ACCESSION M30007 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 2 of 2 SOURCE Bovine (Holstein individual 2042) DNA, clone Q1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1151) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 113 394 MHC DQ-beta cell surface glycoprotein, exon 3 870 / 980 MHC DQ-beta cell surface glycoprotein, exon 4 pre-msg < 1 > 980 MHC DQ-beta mRNA and introns IVS < 1 112 MHC DQ-beta intron B IVS 395 869 MHC DQ-beta intron C BASE COUNT 243 a 310 c 324 g 274 t ORIGIN Chromosome 23, about 3.7 kb after segment 1. 1 tggaatccgg ggatcttcct actctggaac cgaggaagga ctcttctcca tgggagacgt 61 gctgtgcggt ctcatgtctc actgtgtctt ttcctgtctg ttcctccctc agtggaacct 121 acagtgacca tctccccgtc caggactgag gctctaaacc accacaacct gctggtctgc 181 tcggtgacag atttctatcc gggccagatc aaggttcggt ggttccggaa tgaccgggag 241 gagacagctg gtgttgtgtc cacccctctt attaggaacg gggactggac cttccagatc 301 ctcgtgatgc tggaaatgac cccccagcga ggagatgtct acacctgccg cgtggagcac 361 cccagcctcc agagtcccat ctcagtggag tggcgtaagg gcacttggtc tcctttcact 421 gtgggcccta caggataggg cagacagagc ttcccgggtt catcccatct cacctctagt 481 ccccagcatc cctactgaaa tcagaggaca caagagtgct catacctcat agcaggggca 541 ttggaagagc ctagttacat tgtctttcca gatacgggag ctcactcaca caccatggcc 601 ccagagcccc acccagggag ctctgcagga gtgacaggtc caaggttatg catgtgtcct 661 tgaggggcag ggattggctt tctctgctta ttcaccttcc cagtctgtcc aaggatcttt 721 tgctgggtcc ctcacctggg ggtggttaga atgaagaact gagttcccct ggtacttcca 781 cttcctgtac ctcagactgg acttcaggat tctcaaggga cactgtggga tgtggagaca 841 aatgctgaca ctcaggctct gctccccagg ggcgcagtct gaatctgccc agagcaagat 901 gctgagtggt gttgggggct tcgtgctggg gctgatcttc ctcgggctgg gcctcattat 961 ccgtcacagg agccagaagg gtaaggagct ctggggacat ggggaagact ttgactggga 1021 ccttcttctc agggaggctc tagatgtagc tcttttccct gaccctgaca taaaggaggt 1081 taaggtggtg gcaggaagaa acaagcaacc tagggagaga ctgaagtctt actttactga 1141 ttgaaaggta g // LOCUS BOVMHDQBY1 779 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 1. ACCESSION M30006 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 1 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 779) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept 467 + 575 MHC DQ-beta cell surface glycoprotein, exon 1 pre-msg 467 > 779 MHC DQ-beta mRNA and introns IVS 576 > 779 MHC DQ-beta intron A signal 351 357 CAAT box signal 384 391 TATA box site 292 304 X box site 324 333 Y box BASE COUNT 204 a 179 c 164 g 232 t ORIGIN Chromosome 23. 1 ggatcctgaa gggctacagt ccatggggtc gtaaagagta gaacacaact cattaattaa 61 cactttcact tttattttcc catacctcaa attctaagaa caacaggttt taaataaata 121 tcacagaaat atctactctt gaatcatttt ttttcattat ttaaactcct aaggcattca 181 atattcagat attttataac tgagagaaca ttttcatctc tatccagtgt aatttgatta 241 ggacacagtg ccaggcatta gattaagaac cttcaaaaaa aaaatgtcta cccagaaaca 301 gatgaagttt ttccgctcca ctgctgattg gtcccttttc tagggactct ccaatcttgc 361 catacatgga agctctcata ggctttttat tctgtgaagt aggctcacca gatccactgt 421 gtttgagctg tgttgactac cattagttct tcctttgttc tcaattatgt ttgggatggt 481 ggctctgcgg atccccagag ccctctggac agcagttgtg atggtgaccc tggtgatgct 541 gagcacccca ggggctgagg gcagagactc accaagtaag tgcagggcag ctgctccctg 601 gagccaccac actggggagc aggctctgag ggacccttgg gctggggtgt gatcttggga 661 tactgtcttt tatcacacat ttcctcccat tgggaatgag ggctatgtta cattctcatt 721 tccaccctct aaggacaagg tgaggacaat tcccctccca caggtttaac cctgggaat // LOCUS BOVMHDQBY2 977 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 2. ACCESSION M30005 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 2 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 977) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 559 + 825 MHC DQ-beta cell surface glycoprotein, exon 2 pre-msg < 1 > 977 MHC DQ-beta mRNA and introns IVS < 1 558 MHC DQ-beta intron A IVS 826 > 977 MHC DQ-beta intron B BASE COUNT 191 a 264 c 338 g 182 t 2 others ORIGIN Chromosome 23, about 0.9 kb after segment 1. 1 actggcgcaa ctgttggaag gcgatcggtg cgggcctctt cgctattagc cagctggacg 61 aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc ccagtcacga 121 cgttgtaaaa cgacgccagt gccaagctta attctacagg tcctttctca tcccttgaac 181 tctcctgttg tcgtttgtct ctgaggttcc caggagttca gggtaaaatg ggatttaatg 241 tgagaatctt ttaagtatag agatggatgc aaaatcaacc tgccgccctg tttacttgat 301 tctgagcctc tagggatcac aggtcctagg gctctctcag cgtcaggcct cctcacatcc 361 tgggagccct cagagggggc ggnaagcccg ggttcacagc gggaggcgca gggccgggct 421 ggagcggaac agggtttgag aggcggctgg tttcaggttt aaagaccccg tggcggcggc 481 ggcacctccc catctggccg agcggcgccg cgtggggctg tggggctgag cctgacagag 541 cggctgtctc ccccgcagag gatttcgtgg tccagtttat gggccagtgt tatttcacca 601 acgggacgga gcgggtgcgg tacgtgacca gatacatcta caaccaggag gagtacgcgc 661 gcttcgacag cgactggggc gagtaccggg cgctgacccg ctggcggccg gccgccgagt 721 actggaacag ccagaaggac atcctggagc agacgtgggc cgaggtggac agggtgtgca 781 gaaacaacta ccaggtggaa gcccccttca cctggcagcg gcaaggtgag tgccggnctc 841 tccgcggggc cgccctccac ccgccaggac ttcgcgcagg gagggactga gtcctccgag 901 gcggtcccca gaccctcgaa tgggacagag gggcgctgag ggacagggga ccgagggcac 961 agcgtatggg gcggggg // LOCUS BOVMHDQBY3 1199 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exons 3 and 4. ACCESSION M30004 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 3 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1199) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 129 + 410 MHC DQ-beta cell surface glycoprotein, exon 3 pept + 907 + 1017 MHC DQ-beta cell surface glycoprotein, exon 4 pre-msg 467 > 1199 MHC DQ-beta mRNA and introns IVS < 1 128 MHC DQ-beta intron B IVS 411 906 MHC DQ-beta intron C IVS 1018 > 1199 MHC DQ-beta intron D BASE COUNT 263 a 338 c 321 g 277 t ORIGIN Chromosome 23, about 3.7 kb after segment 2. 1 atctaaatcc aagccttgga atccaacgat ctttccactc tggtatcaag gaatgactcc 61 tgcccatggg agacatgctg tgcggtctca tgtctcactg tgtcttttcc tgtctgttcc 121 tccctcagtg gaacctacag tgaccatctc cccgtccagg acagaggctc taaaccacca 181 caacctgctg gtctgctcgg tgacggattt ctatccgggc cagatcaagg ttcggtggtt 241 ccggaatgac cgggaggaga cagccggcgt tgtgtccacc cctcttatag ggaatgggga 301 ctggaccttc cagatcctcg tgatgctgga aatgaccccc cagcgaggag atgtctacac 361 ctgccgcgtg gagcacccca gcctccagag ccccatcatg gtggagtggc gtaagggcac 421 ttggtttcct ttcactgtgg gcctaccgga cagggcagac agagcttccc ctgtccatgc 481 cctctcatcc cttgtcccca gcatcactac tgaactggaa atcacaggac acaagagtgc 541 tcatgcctcc tagcacaggc atcagaagag ccaaatcaca ttgtcttttc acatacaggg 601 agctcactgt acacatcatg gccccagagc ccagcctggt agctctgtag aactgactgg 661 tgaccatagt cttaaggtct aaggttatgg aagtgtccct gagagcaggg atccactttc 721 accttctctc acctgcccac tgtgtccaaa gatctgttgg tgggtccctc ccctggggtg 781 gtcagaatgg agagccacgt tcccctgaca cctccacctc ctgtacctca gactagacct 841 caagcttcct aaaggaatac catgagatgt ggggacaaac gctgacactc gggctctgct 901 ccccaggggc acagtctgaa tctgcccaga gcaagatgct gagtggtgtt gggggcttcg 961 tgctggggct gatcttcctc gggctgggcc tcattatccg tcacaggagc cagaagggta 1021 aggaactctg gggaaatggg aagatgggct gtgattcaga ccctctgttc agatcagcct 1081 ctgcctctga atgtagctct ttcctcctga tcctgaaacg gggaggcggg gctggggatg 1141 ggaggaaatg aacaacctag ggagacattg gagtttgact ttactagttt gaaagggta // LOCUS BOVMHDQBY4 883 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DQ-beta gene, exon 5. ACCESSION M30003 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 4 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone Y1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 883) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 521 534 MHC DQ-beta cell surface glycoprotein, exon 5 pre-msg < 1 841 MHC DQ-beta mRNA and introns IVS < 1 520 MHC DQ-beta intron D site 263 276 MHC DQ-beta g/t cluster implicated to contribute additional information to polyadenylation BASE COUNT 200 a 201 c 226 g 256 t ORIGIN Chromosome 23, about 0.3 kb after segment 3. 1 tttgtgtcat gagatctttt gtagacattg tgacccctag cagaaggtgc tctatttctg 61 ttctgtgtca gtgggattgt gggacaggta aaggagggaa gggtgtgaga tgagtgtgcc 121 tgggcgcagt gtctcattca tgacctgttc cctgctatgg aatcaagagt tagggaagaa 181 gtttctgtag gaggttctgt aggaagctcc tgaggttgtt ccccagaacc aggccataac 241 tttgatggca cctttctgtg aaacttggag ccagagctct ggtttgaaag atagacacca 301 ggatatcacc tactttgtgc cacatgttgg tgcctactgc ctgtgggcat ttataagtga 361 ttgaatgtgg tagaaagaag gtgaactatc actgcaattt actaaaaaat tgaaatcttc 421 atatccctca gaaggacaac agctgcttcc tggcttccca tgcctccttg ttaggttgaa 481 tgtgcgtgcc tgtgtgctga tcactctctc tcttctacag ggctcatgcg ctgactcctg 541 aggatatttt gggattggtg tttgctcttc tataatgtgt gcctgatctt gcccggaatt 601 cccagattcc tgtcagcctg tcccactctg agatcagagt caggtcacca ggtcatttcc 661 cgtggccatc ccccaaccac ggatctggct gtgatgctgc ttcctccact gaccctggaa 721 tctctgcctg tgcgttgtca gctgaatcta ctcagatccc aaaagcttct gacatagaca 781 tcagaagggg gacggagagt gtccccgcta gtctttagcc cagtgtttag aagctattaa 841 tcagataaga gagacacctc aaggttgatg gagtttcacc agg // LOCUS BOVMHDRB1 459 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta gene, exon 2. ACCESSION M30012 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 1 of 3 SOURCE Bovine (Holstein individual 2042) DNA, clone A1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 459) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept / 21 + 290 MHC DR-beta cell surface glycoprotein, exon 2 (AA at 23) pre-msg 21 290 MHC DR-beta mRNA and introns IVS < 1 20 MHC DR-beta intron A IVS 291 > 459 MHC DR-beta intron B BASE COUNT 108 a 92 c 169 g 90 t ORIGIN Chromosome 23. 1 gatctatcct ctctctgcag cacatttcct ggagtattct aagagcgagt gtcatttctt 61 caacgggacc gagcgggtgc ggttcctgga cagatactac actaatggag aagagaccgt 121 gcgcttcgac agcgactggg gcgagttccg ggcggtgacc gagctggggc cgcaggaccg 181 cgagtactgg aacagccaga aggacttcct ggaggagaag cgggccgagg tggacagggt 241 gtgcagacac aactacgggg gtatggagag tttcactgtg cagcggcgag gtgagcgcgg 301 gggtggactg gccagtgtgg agcagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 361 gtgtgagaga gagagagaga gacagagaca gagacagaga cagagataga cagacagaaa 421 cagagatact tcactcactc tggtcgagtg tgtaccgac // LOCUS BOVMHDRB2 427 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta gene, exon 3. ACCESSION M30013 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 2 of 3 SOURCE Bovine (Holstein individual 2042) DNA, clone A1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 427) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 71 + 352 MHC DR-beta cell surface glycoprotein, exon 3 pre-msg < 1 > 427 MHC DR-beta mRNA and introns IVS < 1 70 MHC DR-beta intron B IVS 353 > 427 MHC DR-beta intron C BASE COUNT 95 a 124 c 107 g 101 t ORIGIN Chromosome 23, about 2.7 kb after segment 1. 1 ctgaaaggca gctaaccaag gagacttact ctgttgtcct cactgattcc ctccaccttt 61 tctctcctag tggagcctac agtgactgtg tatcctgcaa agactcagcc cctgcagcac 121 cacaacctcc tggtctgctc tgtgaacggt ttctacccag gccacattga agtcaggtgg 181 ttccggaacg cccatgaaga ggaggctggg gtgatctcca caggcctgat ccagaatgga 241 gactggacct tccagaccat ggtgatgctt gaaacagttc ctcagagtgg agaggtctac 301 acctgccaag tggatcaccc cagccggacg agccctatca cagtagaatg gagtgagctt 361 tctgatctca taaatccctc acccactgtg gagggggctt gctttcctct gagtgtcccc 421 tgagtgt // LOCUS BOVMHDRB3 276 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta gene, exon 4. ACCESSION M30014 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein. SEGMENT 3 of 3 SOURCE Bovine (Holstein individual 2042) DNA, clone A1. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 276) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept + 116 / 226 MHC DR-beta cell surface glycoprotein, exon 4 pre-msg < 1 > 226 MHC DR-beta mRNA and introns IVS < 1 115 MHC DR-beta intron C BASE COUNT 61 a 63 c 68 g 84 t ORIGIN Chromosome 23, about 0.35 kb after segment 2. 1 attctgattc ttccgggtag ccttctttcc tcattcccat agttcacaat ttcagcatca 61 caattagaga agagaatttg ggataaaaat gactaaaact ggcttctttt ctcaggggca 121 cggtctgact ctgctcagag caagatgatg agtggagtcg ggggcttcgt tctgggtctg 181 ctcttccttg ccgtggggct cttcatctac ttcaggaatc agaaaggtaa ggagcttgtt 241 ctttggacag ctgagcctcc ccactgactt ttggag // LOCUS BOVMHDRBE1 483 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 1. ACCESSION M30011 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 1 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 483) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps 167 + 236 pseudo-MHC DR-beta, exon 1 pre-msg < 167 > 483 pseudo-MHC DR-beta mRNA and introns IVS 237 > 483 pseudo-MHC DR-beta intron A signal 9 14 CAAT box BASE COUNT 114 a 100 c 124 g 145 t ORIGIN Chromosome 23. 1 gagctcaccc aatccaggaa caaagatatg agccatttgt tggtatcact tggaatgtgg 61 gtggaggagg gctcatgtct ttactgagtg agacttccct gctcccccac accttgtctt 121 ttcctgttct ccagcatggt gtgactgttt ccccagaggc tcctggatgg cagctctgac 181 agtgatactg atggtgatga accctcccct ggcttgggcc agggacaccc acataagtgc 241 gtacctttcc ggcgggggtg aggggggtga gctatcatgg gatgggggga aggaagggag 301 ctagctttgt cactgtattc aggccatgtc ccttaaaatt gtgacatatt cttcatacta 361 tatatagtgg ctaagctgag tctgaataat tggtaacatt ttctgatgtt catatgtaac 421 atcagtgtac cttatggtat atttcaatat ataggggaat ttattcattc acattatatt 481 gaa // LOCUS BOVMHDRBE2 929 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 2. ACCESSION M30010 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 2 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 929) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps + 228 + 500 pseudo-MHC DR-beta, exon 2 pre-msg < 1 > 929 pseudo-MHC DR-beta mRNA and introns IVS < 1 227 pseudo-MHC DR-beta intron A IVS 501 > 929 pseudo-MHC DR-beta intron B BASE COUNT 203 a 209 c 302 g 215 t ORIGIN Chromosome 23, about 5.4 kb after segment 1. 1 gtcgaccact gaagccactt ggagacctga ggggtctcct ctgcccacct tcgcctccct 61 gcactgtagg cagatgaaag aagggcccgt ggtagttcag gggtgcctgt ggagccaatg 121 agggagccct agtggccttc ctgtgcttgg gcagccctca ttggtggccg tcacatcagt 181 tccttcctgg gagcccacca ggtgaccgaa tcctggtgtg cccacagcac atttgatggt 241 gcagggcaag tccgagtgtc atttctccat ccggactgag caggtacgat tcttggccag 301 atacttctat aaccagaagg agttggtgca ttttgtcagc aacgatgtgg gtgagttcag 361 ggcagtgacc gagcggggca ggctcttcgc tgagagttgg aatcatcaga aggacttagt 421 ggagtgaacg caggctgtgg tggacacgtt ctgcagatac aactactgga ttggggagag 481 cttcatcctg cagcagcaag gtgagcacag gggtgggcgg ccaggggact ggggacagtg 541 tgtgtgtgtg tgtgtgtgtg tgagagagag agagagagac aaagagatag agagactgag 601 tcccggtgaa tgtgttgtat tatgagcaag tatgcttaag gagagttcct gtgagagcat 661 gttgcctgga gaaatgacac ttggacttgc cctgcaccat gaaatttgct gtgggaacag 721 caggattcgg tcaccctggt gggctcccag gaaggaactg atgtgacggc caccaatgac 781 gggctgccca agcacaggag ggccactagt gctccctcat tggctttaca ggcacccctc 841 aactaccatg ggttcttctt tcatctgcct gtatgacttt gtcagttatt gtgaaggaag 901 agacagtgtg tgtggtgggg ggagtacct // LOCUS BOVMHDRBE3 548 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 3. ACCESSION M30002 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 3 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 548) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps + 12 + 293 pseudo-MHC DR-beta, exon 3 pre-msg < 1 > 548 pseudo-MHC DR-beta mRNA and introns IVS < 1 11 pseudo-MHC DR-beta intron B IVS 294 > 548 pseudo-MHC DR-beta intron C site 425 427 in-frame stop codon BASE COUNT 122 a 139 c 134 g 153 t ORIGIN Chromosome 23, about 5.8 kb after segment 2. 1 tttcctccta gtggaggatc ctacagtgac tgtgtatcct gcaaagaccc agcctctgca 61 gcaccacaac ctcctggtct gctctgtgaa tggtttctat ccaggacacg ttgaagtcag 121 gtggttccag aacggccatg aagaggctgg agtgatctcc acaggcctga tccagaatgg 181 agactggacc ttccagaccg tggtgatgct tgaaacagtt cctcagagtg gagaggtcta 241 cgcctgccaa gtggagcacc ccagccggac gagccctctc acagtggaat ggagtgagaa 301 gctttctgat ctcgtaagtt cctcacccac caagaagggg gcttgctcac ctctgagtgt 361 caggtttctc ctctctccat accatatttt ttatttgctt catgctcttt ctttcttagc 421 acaaattgtt ggggagtagc tctgtgatag cctgtgttag aaatcctctg atagtttaca 481 gatatcgttt gatagtttct atcaatacct atacctgctg gtgagacagt tcttcctggc 541 aggcagag // LOCUS BOVMHDRBE4 206 bp ds-DNA MAM 15-AUG-1990 DEFINITION Bovine MHC class II DR-beta pseudogene, exon 4. ACCESSION M30009 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility protein; pseudogene. SEGMENT 4 of 4 SOURCE Bovine (Holstein individual 2042) DNA, clone E4. ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 206) AUTHORS Groenen,M.A.M., Van der Poel,J.J., Dijkhof,R.J.M. and Giphart,M.J. TITLE The nucleotide sequence of bovine MHC class II DQB and DRB genes JOURNAL Immunogenetics 31, 37-44 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by M.A.M.Groenen, 20-NOV-1989. FEATURES from to/span description pept.ps + 86 / 196 pseudo-MHC DR-beta, exon 4 pre-msg < 1 > 196 pseudo-MHC DR-beta mRNA and introns IVS < 1 85 pseudo-MHC DR-beta intron C BASE COUNT 47 a 42 c 54 g 63 t ORIGIN Chromosome 23, about 0.35 kb after segment 3. 1 cttccaggca accttcttct cccatcctca aaagcttagg gaagttggat tgggataaga 61 tcactgaaac ttacttcttt tctaggggca tgatctgact ctgctcagag caggatgatg 121 agtggagtca ggggctttgt tgtgggtctg ctcttccttg ggatcaggtt gttcatctac 181 tttaggaatc agaaaggtaa ggatcc // LOCUS VECPCE30 143 bp ds-DNA SYN 15-AUG-1990 DEFINITION Expression vector pCE30, partial sequence. ACCESSION M36426 KEYWORDS expression vector. SOURCE Synthetic DNA. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 143) AUTHORS Elvin,C.M., Thompson,P.R., Argall,M.E., Hendry,P., Stamford,N.P.J., Lilley,P.E. and Dixon,N.E. TITLE Modified bacteriophage lambda promoter vectors for overproduction of proteins in Escherichia coli JOURNAL Gene 87, 123-126 (1990) STANDARD simple staff_entry BASE COUNT 37 a 35 c 40 g 31 t ORIGIN 1 agggcagcat tcaaagcaga aggctttggg gtgtgtgata cgaaacgaag cattgggatc 61 cccgggaatt cactggccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc 121 caacttaatc gccttgcagc aca // LOCUS CHKPPPTH 1723 bp ss-mRNA VRT 15-AUG-1990 DEFINITION Chicken parathyroid hormone mRNA, complete cds. ACCESSION M36522 KEYWORDS parathyroid hormone. SOURCE Chicken parathyroid gland, cDNA to mRNA, clones cPTH-[11,12,3]. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 1723) AUTHORS Khosla,S., Demay,M., Pines,M., Hurwitz,S., Potts,J.T.Jr. and Kronenberg,H.M. TITLE Nucleotide sequence of cloned cDNAs encoding chicken preproparathyroid hormone JOURNAL J. Bone Miner. Res. 3, 689-698 (1988) STANDARD simple staff_review FEATURES from to/span description pept 128 487 parathyroid hormone precursor sigp 128 202 parathyroid hormone signal peptide matp 221 484 parathyroid hormone BASE COUNT 626 a 311 c 331 g 455 t ORIGIN 1 ttttaaagtt agatttaagg gatccactaa accaattcag tagtctttaa atatacttga 61 catcaagaca cagccatctg ctgacatacc ccaaccagaa aactgttaag gacaatatct 121 gataaaaatg acttctacaa aaaatctggc caaggccata gtgattttat atgctatatg 181 tttttttaca aactctgatg gaagaccaat gatgaagaga tcggtgagtg agatgcaatt 241 aatgcataac cttggagagc atcgacacac tgtggagaga caggactggc ttcagatgaa 301 gctgcaggat gtgcacagtg cccttgagga tgccaggacc cagaggcctc gaaacaagga 361 ggatattgtc ctgggggaga taagaaaccg gaggctgctc cctgagcatt tgcgggcagc 421 agtgcagaag aaatccattg acctggacaa agcttacatg aatgtactct ttaaaactaa 481 gccatgatga aaagaccaag agcattataa ctgtccaagt aagcacatgt ctgtagatca 541 ctgaccagtt agggcatttt atttattatt ttttttttaa ctcaaactat gataaggatt 601 aaaggctcca tgccagactg tagccccact gagatgggta tttcacaact aaatagtaaa 661 gtgtatttat aggccaccca tggccattgc tgctaactcc caggtatctt ttaaatggct 721 aatgtaactc attaacttcc aggagaatta aaaacaaatg gcaaaacaaa aaacaacaaa 781 gaccacctgc aatagaataa gaaagttgaa aaacatttaa gaccagttct accactccta 841 tatggagagc atttgtctgt aatctttaga cctactagta ctgtaaacta acaacgtaat 901 ataggcataa ctgcattatg cctagggtta aacttcaagt ttgtcctaat gaaaggaacg 961 caaacttaaa tccactctta ctttcccaag aaggcctaaa gccagaccaa tgtcagtaac 1021 atagacaaag ctgcatgata ataacttagg attaaagagt gcgaacatga aaaatagaag 1081 gaacccaaag cttaagatta aagtagaatg aaataaattg tgcatgaaaa agaagaacga 1141 agttttacaa gatactgaaa tgaaagggag gtttattaac tttccctctt aattatgagc 1201 tgtcaccttt tggaactgca ggaacagtga gagcagagat tgtagcatat atgtatgcaa 1261 agccctaact atagaactgg gaaatggttc aacacgagat aaaaacaaga cttgtttcaa 1321 ttgttatcat ctctccttca gtcaataatc tatgagtttc tgtatattgt gcttaggcca 1381 catgggtaag tggctcacat aaaattactc atcttcacat gtgcacttat acagaattgg 1441 gatttcagtt tgttaaaacc ctgaaattac aaccattaaa atatagaaat caaaacctgg 1501 gaaccatcag ttaaaatata agcaggattc agaaagaatt tgacaggaac atggatggga 1561 gaaaatgatg ataataatat agaaaagaaa gcagcaaata taaaatgatt ttgaattgta 1621 tagacaagta tgtgcttatg acctcgacca cttctgaata ataagaatat ttcccctgta 1681 gaagtgacag cagtttcctc ccaatgttcc actgtgagaa ttc // LOCUS CUC11SGB 1684 bp ss-mRNA PLN 15-AUG-1990 DEFINITION Pumpkin 11-S globulin beta-subunit mRNA, complete cds. ACCESSION M36407 KEYWORDS 11-S globulin beta-subunit. SOURCE Pumpkin (cv. Kurokawa Amakuri Nankin) cotyledon mRNA, clone pPG-beta-2. ORGANISM Cucurbita pepo Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Violales; Cucurbitaceae. REFERENCE 1 (bases 1 to 1684) AUTHORS Hayashi,M., Mori,H., Nishimura,M., Akazawa,T. and Hara-Nishimura,I. TITLE Nucleotide sequence of cloned cDNA for pumpkin 11-S globulin beta- subunit JOURNAL Eur. J. Biochem. 172, 627-632 (1988) STANDARD simple staff_review FEATURES from to/span description pept 31 1473 11-S globulin beta-subunit precursor sigp 31 93 11-S globulin beta-subunit signal peptide matp 94 918 11-S globulin beta-subunit gamma-chain matp 919 1470 11-S globulin beta-subunit delta-chain BASE COUNT 457 a 406 c 463 g 358 t ORIGIN 1 ctaatagccc ttctcttctc cataccagca atggctcgct cttctctttt taccttttta 61 tgtttagcag ttttcatcaa tggctgcctc tctcagattg agcagcagag cccctgggaa 121 ttccaaggca gcgaagtatg gcaacagcac cgctaccaat ctcctagagc ctgtcgtctt 181 gagaatcttc gagctcaaga ccccgttcgc cgggctgagg cggaggcgat cttcactgaa 241 gtctgggacc aggacaacga tgagttccag tgcgccggcg tcaatatgat ccgccataca 301 atccggccca aaggtctgct tcttcctggt ttctctaatg ctcctaaact catcttcgtc 361 gcccaaggct tcggtattcg cggcattgca atccccggct gtgcagagac ttaccagact 421 gatttacgaa gatcgcaatc ggccggatct gcgttcaaag accagcatca gaagatccgc 481 cccttcagag agggagatct cctcgtcgtc ccggccggag tttctcactg gatgtataat 541 cgaggacagt ccgatctcgt tttgatcgta ttcgctgaca ctcgcaacgt cgcaaaccaa 601 atcgatccct acctcagaaa attctacctt gccggaaggc cagagcaggt agaaagaggc 661 gtagaggaat gggaaagaag tagccgaaag ggatcttccg gcgagaaatc aggcaatata 721 ttcagcggat ttgcagacga atttctagag gaagctttcc agatcgacgg tggactggtt 781 aggaagctaa agggagaaga cgacgagaga gacagaatcg tgcaggtcga cgaagatttc 841 gaggtgcttc taccggagaa agatgaagaa gagagatcga gaggaagata catcgaatca 901 gaatcagaat cggagaatgg cttagaagaa accatttgca cactccgatt aaagcaaaac 961 atcggccgat ctgttcgcgc cgacgtgttc aacccacgcg gcggccgaat ctccacggcc 1021 aactaccata ccctccccat tctccgccaa gtccgcctta gcgccgaacg aggagtcctc 1081 tacagcaacg cgatggtggc gccgcactac acagtgaaca gtcactcagt gatgtacgcg 1141 acgagaggca acgcgagagt gcaggtggtg gacaacttcg ggcagtcagt gttcgacggc 1201 gaggtccggg aaggacaggt actgatgatt ccgcagaact tcgtggtgat taaacgagca 1261 agcgacagag gattcgagtg gatcgcattc aagacgaacg acaacgcaat cacgaatctg 1321 ctggcggggc gagtgtcgca gatgaggatg ttgccgctgg gagtgctgtc gaacatgtac 1381 cggatctcga gagaggaggc gcagaggctg aagtacgggc agcaggagat gagggtgctc 1441 agccccggaa ggtcgcaggg aagaagagag tgaaaatgaa gaagtgggta gtgggtaatg 1501 ggtaatggga aatatatata tatggtagta gtaatctaat gtaatttagt gaataaagag 1561 cgagctttca ggtgatgccg ccgacgagcc ctgcttgtta ccggccggaa aaaatggaga 1621 aatctctcag aaagacaccg agttttaata ataaaagtaa taatattcgc ctcttttttc 1681 cttc // LOCUS DROKINLA 2175 bp ds-DNA INV 15-AUG-1990 DEFINITION D.melanogaster kinesin-like protein (nod) gene, complete cds. ACCESSION M36195 KEYWORDS kinesin-like protein; nod gene. SOURCE D.melanogaster DNA. ORGANISM Drosophila melanogaster Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Cyclorrhapha; Schizophora; Drosophiloidea; Drosophilidae. REFERENCE 1 (bases 1 to 2175) AUTHORS Zhang,P., Brodeur,B.A., Goldstein,L.S.B. and Hawley,R. TITLE A kinesin-like protein required for the distributive chromosome segregation in Drosophila JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by P.Zhang, 06-JUL-1990. Author address: P.Zhang Albert Einstein College of Medicine Molecular Genetics Dept. 1300 Morris Park Avenue Bronx, NY 10461 FEATURES from to/span description pept 72 2072 kinesin-like protein (nod) mRNA 1 2175 nod mRNA BASE COUNT 557 a 594 c 561 g 463 t ORIGIN 1 caaagtaaaa taattacggt gaatgcaagc caattgtgca ttattcaaac aacttcaatt 61 cttcaatctg catggagggc gccaaattaa gcgcagttcg gattgcggtc cgcgaggcgc 121 cgtaccgcca gttcttgggg cgtcgggagc ccagcgtcgt ccagtttccg ccatggagcg 181 acggaaagtc gttaatagtg gatcagaatg aattccactt cgatcacgcc tttcccgcga 241 ccatcagcca ggatgagatg taccaggcgc tgatcttgcc gctggtggac aagctgctcg 301 agggattcca gtgcactgca ctcgcctacg gccagacggg aacgggcaag agctactcaa 361 tgggcatgac acctccggga gagatactgc ccgagcacct gggtattctg cctcgcgccc 421 tgggcgacat ttttgagcgc gtgaccgccc ggcaggagaa caacaaggat gcgattcagg 481 tgtacgcctc cttcatagag atctacaatg agaaaccctt cgatctgctg ggctccacgc 541 cacatatgcc catggtggcg gcgcgttgcc agcgatgcac ctgccttcct ttgcacagcc 601 aggcggatct gcatcacatc ttggagctag gcactcgcaa tcgacgcgtt cgtcccacca 661 atatgaattc caatagttcg cgatcccatg ccatagtcac cattcacgtg aagagtaaaa 721 cccatcactc gcggatgaat attgtggatc tggccggttc agaaggcgtg cggcgaactg 781 ggcacgaggg cgtggccagg caggagggcg tcaacatcaa tctgggcctg ttgagcatca 841 acaaggtggt gatgtccatg gcggcgggcc acacagtgat accataccgc gacagcgtcc 901 ttaccacagt tctgcaggcc tcgctaaccg cgcagtcgta tctgaccttt ctggcctgca 961 tcagtccgca tcaatgcgat ctcagcgaga cgttgtccac cctgcgtttt ggcaccagtg 1021 ccaagaagct tcggctgaat ccgatgcaag tggcgcgcca gaagcaatcg ctggccgcac 1081 ggacaacaca cgtcttccgc caagcgctat gcacctcgac ggccatcaag tcaaacgcag 1141 ccaatcataa tagcatagtg gttccaaaat ccaaatatag cacaaccaag ccgctgagcg 1201 ccgtgctcca tcgaactcgc tccgaacttg gcatgacgcc caaagctaag aaaagggctc 1261 gcgagctatt ggagctggag gagaccacgc tggagctctc gtctatacac attcaggaca 1321 gcagtctgag tctgttgggt ttccatagcg atagcgataa ggataggcat ttaatgcctc 1381 ccccaacagg gcaagagcca aggcaagcca gcagccagaa ctctacgcta atgggcattg 1441 tcgaagagac cgagcccaag gaatcgtcaa aggtgcaaca gtcaatggtt gcccccacgg 1501 tgcccacaac tgtacgctgc cagctgttca acaccaccat cagtcccatc agtctacggg 1561 catccagctc tcagcgagaa cttagcggca tccagccaat ggaggagaca gtagtggctt 1621 cgccacagca gccatgcctt cgtcgttccg tgcgtctagc gagtagcatg cgttcgcaga 1681 actatggagc cattcccaag gttatgaatt tgcggcgcag cacgcggctg gcgggaatcc 1741 gggaacatgc cacctccgtt gttgtgaaaa acgagacgga tgcgataccg caccttcgaa 1801 gtacagtgca aaaaaaacgt acgcgaaacg tgaaacctgc gcccaaggcc tggatggcca 1861 ataatacaaa atgttttctg gacctgctta acaatggaaa cgttaagcaa ttgcaggaga 1921 ttccagggat cggtccaaag tccgccttta gtttggcctt gcacagatcc cgcctgggtt 1981 gcttcgagaa tctttttcaa gtcaaatccc tgcccatttg gtcgggaaat aaatgggaac 2041 gattttgtca aattaactgt ctcgacactt gatacaatta ctaattaaat agcattttaa 2101 ttcgaatata gtatagtgat tgttatttat gtggcatata ctttgatttt acaactatag 2161 taggagtaaa aaaag // LOCUS HAMCADCA 3902 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Hamster carbamoyl-phosphate synthetase mRNA, partial cds. ACCESSION J05503 KEYWORDS carbamoyl-phosphate synthetase. SOURCE Hamster cell line 165-28, cDNA to mRNA, clone pCAD142. ORGANISM Mesocricetus auratus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Cricetidae; Cricetinae; Cricetini. REFERENCE 1 (bases 1 to 3902) AUTHORS Simmer,J.P., Kelly,R.E., Rinker,A.G.Jr., Scully,J.L. and Evans,D.R. TITLE Mammalian carbamyl phosphate synthetase (CPS): cDNA sequence and evolution of the CPS domain of the Syrian hamster multifunctional protein CAD JOURNAL J. Biol. Chem. 265, 10395-10402 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 3902 carbamoyl-phosphate synthetase (E.C.6.3.5.5; AA at 3) BASE COUNT 829 a 1056 c 1135 g 882 t ORIGIN 1 tcaggcccct ggcaccagag gtttctatta agaccccacg ggtattcaat gcagggggtg 61 cccctcggat ctgtgccttg gactgcggcc tcaagtataa tcagatcaga tgtctctgcc 121 agcttggggc tgaggttact gtggtgccct ggaaccacga attagacagt cagaagtatg 181 atggcctttt tctgagtaat ggacctggcg atcctgcctc ttatcctggt gtggtagcca 241 cactgaaccg cgtcttgtct gagcccaatc cccgacctgt gtttggaatc tgccttggac 301 accagctgtt ggctttagcc attggggcca aaacttacaa aatgaggtat ggaaaccgag 361 gccacaacca gccctgttta ctggtgggca ccgggcgctg ctttctgacg tctcagaatc 421 acgggtttgc cgtggatgca gactcgctgc cagcaggctg gactccgctc ttcaccaatg 481 ccaacgactg ttccaacgaa ggcattgtac atgacagcct gccctttttc agtgtccagt 541 ttcacccaga gcaccgagct ggcccttcag atatggaact gctttttgat gtatttctgg 601 agactgtgag agaggctgta gctgggaacc ccgggggcca gacagttaaa gagcggttgg 661 tgcagcgcct ctgtccccct gggcttctca ttcctggttc tgggcttcca ccaccacgga 721 aggttctgat cctaggctct gggggcctct ccattggcca ggctggagaa tttgactact 781 caggctctca ggccattaaa gccctgaagg aggagaacat ccagacgctg ctgatcaacc 841 ccaacattgc tacagtgcag acctcgcagg ggctggcaga caaggtctac ttccttccca 901 ttacacctca ctacgtaacc caggtgattc ggaatgaacg cccagatggt gtgttactga 961 cttttggggg ccaaacagcc cttaactgcg gtgtagaact gaccaaagcc ggagtgctag 1021 ctcggtatgg ggttcgggtc ttgggtacac ctgtggagac cattgaactg actgaggacc 1081 gacgagcctt cgcggccagg atggctgaga tcggagagca tgtagccccc agcgaagcgg 1141 caaattctct tgaacaggct caggcagctg ctgagcgact gggctaccct gtgctggtgc 1201 gtgcagcctt tgccctgggt ggtcttggtt ctggctttgc ttccaccaaa gaggaactct 1261 cagctcttgt ggctccagct ttcgcccata ccagccaggt gctgatagac aagtctctga 1321 agggctggaa ggagattgaa tatgaggtgg tgagagacgc ctatggcaac tgtgtgacgg 1381 tatgtaacat ggagaactta gacccactgg gcatccacac tggtgagtcc atagtggtgg 1441 cgcccagcca gacgctgaat gacagagagt accaacttct gcgacggaca gctatcaaag 1501 tcacccagca cctggggatc gtcggggagt gcaacgtgca gtatgccttg aacccggagt 1561 ctgagcagta ttacatcatt gaagtaaatg ccaggctgtc tcgaagctct gccctggcca 1621 gtaaggccac aggctatcct ctagcctatg tggcagccaa gctggcgttg ggcattcccc 1681 tgccggagct caggaactct gtcactgggg gaacagcagc ctttgagcct agcctggact 1741 actgtgtggt aaagattcct cgatgggacc tcagcaagtt cttgcgtgtc agtacgaaga 1801 ttgggagctg tatgaagagt gttggtgaag tcatgggcat tggacgctca tttgaagagg 1861 ccttccaaaa ggccctgcgc atggtggatg agaactgtgt gggcttcgac catacagtga 1921 agccagtcag tgatgtggag ttggagacac caacagataa gcggatcttt gtggtggctg 1981 ctgctctgtg ggctggctac tcggtggagc gcctgtatga gctcacacgc atcgactgct 2041 ggttcctgca tcgaatgaag cgtatcgtga cccacgccca gttgctggaa caacaccgag 2101 gacagccgtt gtctcaagac ctgctgcacc aggccaagtg cctcggcttc tcagacaaac 2161 aaattgccct tgcagtcctg agcacagagc tggcggttcg aaagctacgt caggaactgg 2221 gaatctgccc tgcagtgaaa cagattgaca cagttgcggc tgagtggcca gcacagacca 2281 attacctgta cctgacatac tggggcaaca cccatgacct cgactttcga actcctcacg 2341 tcctggtcct tggctctggt gtctaccgca tcggctccag tgttgagttt gactggtgtg 2401 ccgtcggctg catccagcag ctccggaaga tgggttataa gaccatcatg gtgaactaca 2461 acccagagac agtcagcaca gactatgaca tgtgcgaccg actctacttt gatgagatct 2521 cctttgaggt ggtgatggac atctatgagc tggagaaccc cgacggcgtg atcctgtcca 2581 tgggtggaca gctgcccaac aacatggcca tggctctgca tcggcagcag tgccgagtgc 2641 tgggcacctc cccggaagcg atcgattcag ctgagaaccg gttcaagttc tcccggcttc 2701 tagataccat cggcatcagc cagcctcagt ggcgtgaact cagtgacctc gagtctgctc 2761 gccagttctg ccagactgtg gggtacccct gtgtggtgcg cccctcctat gtgctcagcg 2821 gtgccgctat gaatgtggcc tacactgatg gggacctgga gcgcttcctg agcagtgcgg 2881 ccgctgtctc caaggagcac cccgtggtca tctccaaatt catccaggaa gcaaaggaga 2941 ttgatgtgga cgctgtggcc tgcgatggcg tcgtgtcagc cattgccatc tccgagcacg 3001 tggagaatgc aggtgtgcat tcaggggatg ctacgctggt caccccccca caagacatca 3061 cccccaaaac tctggagcgg atcaaagcca ttgtgcatgc cgtggggcag gaactacagg 3121 tcacagggcc cttcaatctg cagctcattg ccaaggatga ccagctgaaa gttattgagt 3181 gcaatgtgcg tgtctctcgc tccttcccct tcgtgtctaa gacgctgggt gttgacctag 3241 tggccttggc cacgaggatc atcatgggag agaaggtaga acccatcgga ctcatgacgg 3301 gctctggagt cgtgggagta aaggtgcctc agttctcctt ctcgcgcttg gcgggtgctg 3361 atgtggtgct gggcgtggag atgaccagta ctggagaagt agctggcttt ggagagagcc 3421 gttgtgaggc ctacctcaaa gccatgctta gcactggctt taagatcccc aagaagaaca 3481 tcctgctgac catcggcagc tacaagaaca aaagtgagct gctcccgact gtgcggttgc 3541 tggagagcct gggctatagc ctctacgcca gcctgggtac ggcggacttc tacactgagc 3601 acggggtcaa ggtgacagct gtggactggc actttgaaga ggctgtggat ggcgagtgcc 3661 cgccacagcg gagcatcttg gatcagctgg ctgagaatca ctttgagtta gtgattaacc 3721 tgtcaatgcg tggggccggg ggtcgacggc tttcctcctt cgtcaccaag ggctaccgca 3781 cgcggcgcct ggctgctgac ttctctgtgc ctctcatcat cgacatcaag tgcaccaaac 3841 tcttcgtgga ggccctgggt cagattggcc ccgccccgcc tttgaaggtt catgtagact 3901 gc // LOCUS LEIKPDNP 376 bp ds-DNA ORG 15-AUG-1990 DEFINITION L.aethiopica kinetoplast DNA. ACCESSION M36194 KEYWORDS . SOURCE Kinetoplast L.aethiopica (strain 1467/85) promastigote, clone R3,. ORGANISM Kinetoplast Leishmania aethiopica Eukaryota; Animalia; Protozoa; Sarcomastigophora; Mastigophora; Kinetoplastida; Trypanosomatina; Trypanosomatidae. REFERENCE 1 (bases 1 to 376) AUTHORS Laskay,T., Kiessing,R., Rinke de Wit,T.F. and Wirth,D.F. TITLE Generation of species-specific DNA probes for Leishmania aethiopica JOURNAL Unpublished (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by T.F.Rinke de Wit, 06-JUL-1990. Author address: T.F.Rinke de Wit Leiden University Hospital Rijnsbugerweg 10 2300 RC Leiden THE NETHERLANDS email:WBLGIPHAR@HLERUL52.BITNET BASE COUNT 113 a 99 c 70 g 94 t ORIGIN 1 ctctaatagc ccaggaccta tcgtcgccac tctccgaact atagaaagac ccgcgctgta 61 ggcacaatag gaccaactgt actacctgca gtggctagac cactactggc aaatcaatag 121 aactattacc tttaactata agtgatttaa ctttaaccta taatagaaca ttattcgtcg 181 ctcattcccg ggccccacgt agcctttccc atgaagttcg tataccgact ctacggttca 241 agtttatata ccggttcact ccgttgcacc atggtgacct tacgtcacta gatacaattg 301 atattaataa ttaaatacag ccaagatagg cggcatgtgc cacagagtag cggcaggaag 361 ccagccaatg agcata // LOCUS LMIB19KP 938 bp ss-mRNA INV 15-AUG-1990 DEFINITION L.migratoria basic 19kD hemolymph protein mRNA, complete cds. ACCESSION M36206 KEYWORDS basic 19k protein. SOURCE L.migratoria adult female fat body, cDNA to mRNA, clone lambda-LmF2. ORGANISM Locusta migratoria Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Orthoptera; Caelifera; Acrididea; Acridoidea; Acrididae. REFERENCE 1 (bases 1 to 938) AUTHORS Kanost,M.R., Bradfield,J.Y., Cook,K.E., Locke,J., Wells,M.A. and Wyatt,G.R. TITLE Gene structure, cDNA sequence, and developmental regulation of a low molecular weight hemolymph protein from Locusta migratoria JOURNAL Arch. Insect Biochem. Physiol. 8, 203-217 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by M.R.Kanost, 06-JUL-1990. FEATURES from to/span description pept 54 572 basic 19k protein precursor sigp 54 95 basic 19k protein signal peptide matp 108 569 basic 19k protein BASE COUNT 231 a 267 c 222 g 218 t ORIGIN 1 agctctgctg tctcctgtcc actccacacc acaggctcag taccaggatc aggatgaagc 61 tggtggtggc tgcagttctc gcgatggccg cgtcgcggtg gcggcgcctg tcggcccacg 121 gccaggtgcc gtccagcacg tgcgccgaca tgctgcccgt gcacggcaac gcaatgccca 181 gcacagccct gccctacacc atcaccgtgt cgcccacctc cgtcaacggc ggcgacaccg 241 tcagagtgca catctcgggc acggaggagt tccgcggcgt ctacctgcag cgaggagggg 301 ccaagagcag taggagagtt cctgctgccc gccggagaga acaacaagat cgccctgtcc 361 gactgcccgc cggacacaac aacgccttct catacatttc gcgcacaccc ctggacacac 421 tggacatcga ctggaaggca ccatacacca gcgatgaaat cgttttcagg gctactttcg 481 tcaagagctt ctccgagttc tgggtcggcg ttgagtcacc gaagatcaca ttgggaccgc 541 tacgtcaact tgacaacgca gttgctgctt agtgactgaa gtcgccatat tcatatacga 601 gcacatccag tactgatgtc ctagtttatc acaacatcgc cgcaccacca ctttcacgtt 661 ctctactact aaaatggtag ataaatcgct tattacagct gttagctgca tataagagaa 721 gcgtttcaaa acgagaaact ctttttgatt ttgtactgag ggaattcaag taaagatttg 781 acaggcagac gtcaccatct tgttcaagac ttggcatcca gtttgcctgt ctgctgtgtg 841 tttgtagatg ctcacacttc ttgtgatatt tactaccaca aattttgtac tcaagacttg 901 aagaattgaa atatattctc taattaatat aaaaaaaa // LOCUS MUSALDAA 8190 bp ds-DNA ROD 15-AUG-1990 DEFINITION Mouse aldolase A gene, complete cds. ACCESSION J05517 KEYWORDS aldolase A. SOURCE Mouse (strain RIII S/J and Blue Spruce (outbred Swiss Webster)) adult DNA, clone lambda 16. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 8190) AUTHORS Stauffer,J.K., Colbert,M.C. and Ciejek-Baez,E. TITLE Nonconservative utilization of aldolase A alternative promoters JOURNAL J. Biol. Chem. 265, 11773-11782 (1990) STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.K.Stauffer, 15-JUN-1990. FEATURES from to/span description pept 4301 4412 aldolase A, exon 4 (E.C. 4.1.2.13) (first expressed exon) 4493 4704 aldolase A, exon 5 5125 5179 aldolase A, exon 6 5278 5438 aldolase A, exon 7 5727 5810 aldolase A, exon 8 5908 6082 aldolase A, exon 9 6222 6421 aldolase A, exon 10 6519 6614 aldolase A, exon 11 pre-msg 1700 > 6614 aldolase A mRNA and introns (alt.) pre-msg 1740 > 6614 aldolase A mRNA and introns (alt.) IVS 1804 1951 aldolase A intron A (put.) IVS 2023 4278 aldolase A intron B (put.) IVS 2205 4278 aldolase A intron C (alt.) IVS 3256 4278 aldolase A intron C (alt.) IVS 4413 4492 aldolase A intron D IVS 4705 5124 aldolase A intron E IVS 5180 5277 aldolase A intron F IVS 5439 5726 aldolase A intron G IVS 5811 5907 aldolase A intron H IVS 6083 6221 aldolase A intron I (no splice consensus) IVS 6422 6518 aldolase A intron J signal 3032 3036 CAAT box signal 1673 1676 TATA box signal 2235 2240 TATA box signal 3089 3094 TATA box signal 3132 3137 TATA box BASE COUNT 1676 a 1884 c 2061 g 1876 t 693 others ORIGIN 1 gatccttgct ttttgaagcc ttagaatgaa gccagcattc ctggccttgg gagggcaggc 61 acgggagact ccaaggcctg gggaaagcaa ctctagtcca aaccagtttc tcttgctggt 121 tgtagtcttt tgggcaaacc actgagtttc tatctcatta ttttgtgatg agccccccac 181 gagtgtgacc cccattcaag gtggctcaga agcagagtgc ttgccttgtg tttgtgacat 241 cccaagttca attcatcact gaggaaaccc ctccctttaa gatttatctt atctctgaac 301 gttttcccga ttgtatgact cgtatgtatc tgaggaagtc agaagaaatg tcagatcccc 361 caggatcttg ggatctggag tcgtgatggc tgtgagtcac tgtatatatg tgctggagct 421 gaactcaggt cctctggaac agccattgct cttaaccact gagccatggt ccggacacct 481 ggcttagaca gggtcccttt ctgtcagtgg ttctcaacct gtgggttatg gccctttgtg 541 ggggtggagg tgggtattaa cttatacagg gctgacctaa ggttataaaa acccagatat 601 ttatgattca taacagcaaa attacaggtg taaagtagca acaaaaattc ttttttggtt 661 gggagtacca caacatgggg aactgtatta aaaggtagca ttaggaaggt tgggcaccac 721 tgctctcgta gccctggcta tcctagaact caaatagtag atcaggctgg tccaaactga 781 cagagatcta tctctgccag cgtcagcact aggaagtgag taaattccat gatagccagg 841 ccatacagtg aaaccctgtc tcaaaacagg acaagaggaa ccccagtact tagtaggttg 901 aagtaaggat tgtcattttt tttgaggcca gcttgggttt catggctctt gactagtctg 961 agctgtagag ggagagcctg tctcacgagg aagcttagga gggagatatt atagtttggt 1021 ttatgccagc aagaaagtcc aaagtcccag aaattatctt catgaggatt gaaacatgtt 1081 ttctggtcct gacttcctct aggttgcata gggctttgag agtatagtat acctactatg 1141 tgcgcataca cacacgcgcg cgcgcgcgca cgcgacacac acaggaccca gtgggacaga 1201 tactttatca ctgctgctgt tcagcatgga gggagcttct ttccagtgct ttgtctctcc 1261 gtccactggg cctggtgggt gggtgctcct cagccctctg cttacccacc tctctcttct 1321 cctttagggt tgggcccctc gatgccctgg cctgctgccc actgtgtgac tgtgcctgtg 1381 cctgccagct cccagactgc cagagcctca actgcctctg tttcgagatc aagctcagat 1441 gaaagatggg gctggggacg ttgttctttg gggagtggcc agtccccagg gccccctcta 1501 tgatcctcag gacatcatta tactggagct atggatggca ggcccagcct aattacctgg 1561 gttccttgag ttctctgaaa ggcaggattc tgagagccct tggaccgctg aaaagggcct 1621 gatgctctgg ccagtgcccc tgcctttctt cctctccctt ccctgataaa ctattgtatg 1681 tgaggtagga tcgagacatt gctcacccag gcaacagtgt gggaggtttc tgccaacctg 1741 gactatcagg ataaagggat ggccagccac accctgcctt tagactcctg gttattttaa 1801 gaggtgagta tcctgcctga ctctgctctc ctttggaaaa aaaaaaaaag ttcaaccacc 1861 agcaggcacc agagtcaagg gaggagggaa ccagaggagg gcagtgggag gcaatatcta 1921 gatgttttcc cttcttgttc tgccttaaca gatcctggac ctgagactga tttcttgact 1981 aatttcactg tatttccaag gaagaggttc ctctaaagac cggtgagtga gcagtggcac 2041 ctcctcctct caaggcaaac caaagctgcc tcttcttcac cccccacgca gggatgaatg 2101 tcaggagcct caggtttccc taaatatagg tcccggccgc gggattcgtg gtggggaaag 2161 ggcaggggtt accgagaagg tctgggacac tggtgcgggg gtgtgtaggg gaggggtggg 2221 gagtaggagc tgccttaaaa cccagccctg gactgccggg ctcactctct gctgaccggg 2281 ctctgcggct tctgtcactg cgccacaggt gggccgctat ccggattgca ggatgggaat 2341 gggggttgcg gattgggacc tgaggaaact gactgctctg agagttacag ggtgacaaga 2401 gagctccgag acggattttt ttattttgga gaaggaaatc aggttcggga aagacctgtc 2461 tggcttgggc cagtccttgt cggtcatttc ctcaaactgg gtgtgtttag ctcgcgggtg 2521 gtgcctcccg ccaatctgct aggcaacgcc aggcctggat acgccactca gttccgatgt 2581 ggccggcaca ctagttctgg gaggttttgc ctgcgtacca tgtcactcgc cgtgctctgg 2641 ccagggagag atggaatgng ccctgcattt tagtcaagcg acgaagcagg caggcaggga 2701 ggctccgaag ctctgcgttc ttagcagtga cgtcaggctg caactacaca gccggaagcc 2761 tgggtcttgg aggagaggcc agccaccatc tcactctgac cccctcccta ctcttcgcca 2821 acccacattc cggctgagtc acatgttccg cgcgcgccag gcaggggttg gggggggggg 2881 tgttgggggg ggggggtggt gacctgcggg atgtggctcg agtcacgtcc tagcggggcg 2941 gaggagggat cgtgttctag ccgcttgtct cctccccagt gccgcctcct atcggagcat 3001 cttggggcgg tctgcgcaca gtgcccacct tcaattgacg gttcccgtcc ctgcaaggga 3061 aaaaacctgc agagggcgga gcggcgcctt taaatgtccg gggccccgcc tccggtcccc 3121 cccaacccag ctgaataggc tgggttctct tggaacgcgc agcagaacca ggttctggtg 3181 accctagccg ttcgctcctt agtcctttcg cctacccacc ggcgtaccag gcagacccac 3241 cccgtcctgt gccaggtgag cgccatttac acgtgctcgg ggaagggtct atggggttag 3301 gatcttgggc cggtggcggg cagtgcagag ccgtcttccc cacggcccct cacttctcct 3361 ttttctaccc ccacgcttgc ccccagcccn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3481 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3601 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3661 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3781 nnnnnnnnnc tggttctctc ttaactctcg cctttgggtt gctatgtggc tttgagcaca 3841 gatcatttct ttcttgggct ctttcagatg agggtattag gctcctgccc tattcgtgat 3901 ccttaaattc taaaatatcc cggttcaatt ttgtttctag gcaaggtgac ccatggcaac 3961 gcgcaggcca gatgggtcag cttcaacatg accgctgtcc tggctctggc ttcttcttcc 4021 ccagttggcc agtgagcgaa cccactctga gctgggcaac acccagcaac agacagagtt 4081 aggaaaggta caggaagagg caggtctagt atagggaagt cgggagtagg ggagagctct 4141 gggacaggaa gtatcccagg accctcaggg agtggggcag gggaggtggg ggctagtgcc 4201 ctggcctcca ggaagctttg taccggggag accatgggat ggtccaacta agcgctggtc 4261 tctgcctccc tcacccagga aagcaactgc caccggcacc atgccccacc catacccagc 4321 actgaccccg gagcagaaga aggagctgtc tgacatcgct caccgcattg tggctccggg 4381 caagggcatc ctggctgcag atgagtccac cggtgcggta caggagaaga aagggaggag 4441 gacccaggtt ggagctagca ggctgatccc ttatctccat catgactttt aggaagcatt 4501 gccaagcgcc tgcagtccat tggcaccgag aacaccgagg agaacaggcg cttctaccgc 4561 cagctgctgc tgactgcaga cgaccgtgtg aatccctgca ttgggggggt gatcctcttc 4621 cacgagacac tgtaccagaa ggcagatgat ggacgtccct tcccccaagt tatcaagtcc 4681 aagggtggtg ttgtgggcat taaggtaaga gggcagactc tggggggggg gtaagattag 4741 aggaggatct cggagaaagg gattaatagg tagggagggg gtaatatggc tagcaggcct 4801 agagactcag gtggatgtat cagcataatt ttttttcagt gtttggggtg aacttaggtc 4861 cttgtgcatg tcggcaagcg cgctgttgcc aacttaatgg ttccctgtga tacaagaagg 4921 tgatttcatg gtgaagaagt gaaaaggttt tctcagtgtg cagtagcacc aggtccctct 4981 agtccagtta acattctctc aaatatacac atcttttctc ataaatatgt gcaagccatg 5041 agaggctaca gtgaaaggtg aagtttgggc ctgggtagag gagacagggg ccataaagct 5101 gactgctggt ctcctccctg gcaggtagat aagggtgtgg tgcccctggc aggaaccaat 5161 ggcgagacaa ctacccaggg taagaatgat ctgcctgcct ccttcccttc tccaccagct 5221 catcagagtt ccagagtgag tctgatcaaa agccttctct ttattcttcc ccttcagggc 5281 tggatgggct gtctgaacgc tgtgcccagt ataagaagga tggagccgac tttgccaagt 5341 ggcgctgtgt gctaaagatt ggggaacata ctccctcggc cctggccatc atggaaaatg 5401 ccaatgttct ggcccgttat gccagcatct gccagcaggt gggattggac tacttcctaa 5461 cacattgatg cagcgcgggc tagctttctg tctatctgcc aggatatctg cctcctcaga 5521 gcagctgctc tcaatacccg ctgtggccag gtcttgagtg gaggtctgca atgtagaggt 5581 ggcaacaggt gtacaggcag attgatagga ttgcttgtcc cctgtaaact gctgaggcct 5641 ttgaagcctg ggtctctgtc atcaagttaa tggtgaggag gctcctagtc aggaggcctt 5701 gcctcattac cctgtccctc ccacagaatg gcattgtacc cattgtggag cctgaaattc 5761 tccctgatgg ggaccatgac ttgaagcgct gccagtatgt tactgagaag gtagtgccat 5821 ctgctgtaga tagtgtgtgc tgcgcgtagt atcgtttcac ttctcgtctg cnnnnnnnnn 5881 nnnnnnnnnn nccctgctgt cttccaggtc ctggcggctg tctacaaggc tctgagcgac 5941 caccatgtct atctggaagg cacattgctg aagcccaaca tggtcacccc tggccatgct 6001 tgcacccaga aattttccaa tgaggagatt gccatggcaa cggtcacagc acttcgtcgc 6061 acagtgcccc ctgctgtcac tggtgaggcc actcctcatc ttggtggtga ggtggatgca 6121 ccatcacatt tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6181 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6301 nnngccatgg gccttgactt tctcctatgg tcgagccctg caggcctctg ctctaaaggc 6361 ctggggtggg aagaaggaga acctgaaggc agcccaggag gagtacatca agcgcgccct 6421 ggtaaggcag gcaggcaggc gtggaagtgt gaacaggtgc ctgggcgggg tggggaggga 6481 ctcaagaaga gaattcctct gattcctctt ccttttaggc caacagcctc gcttgtcaag 6541 gaaagtatac cccaagtggc cagtctggag ccgcagccag tgaatctctc ttcatctcta 6601 accatgccta ctaaccagag ctgaactaag gctgctccat caacactcca ggcccctgcc 6661 tacccacttg ctattgaaga ggggtcttca ggctctttcc catcactctt gctgccctcg 6721 tgtgcggtgt tgtctgtgaa tgctaaatct gccatccctt ccagcccact gccaataaac 6781 aactatttaa gggggagtct gttgttcatg tcttgtaggg tataggggag ggctgaggaa 6841 agagctactt gggttcttct tcttggacag taaaaggaag gggttttttg accagagctt 6901 tgagaaaggc atagtattat gggatgttct ttgcctacat ctaattgaag gtaactttta 6961 cactaattaa tattcagttt aagccaacca agggcttatg aatacttggc aaggattgta 7021 tcagggctaa cacatttatg cgttttgggg actatggagc tttggagacg agatctctct 7081 gcagtgacat aggtatacag ctcactgcag aactcttggg ttccaggttg agaatggagc 7141 ctcagagctg ctgatgttcc ctggtgatag aataagaagc acatcaaacc atgggccact 7201 gtatcttgcc acattatatt gagtgtagtc ggtgtgctag tgcacacttt aatccagcac 7261 tcaggaggca gaggcaggca ggaggcaact ggaactcaca aagtgagttc caggacagcc 7321 agggctatac agagaaaccc tgtcttgaaa aaaaaaaatt ctggcctaaa tgaatggata 7381 cagtgtatct gcctttggag gccaaaaggc gtgtatcaag tgctagcttc tggcaagata 7441 agaaacctta aggagtaggg cttcgactat actcagtagc agagtcttgc atggtactca 7501 tggttgtgag cacatgtggt gctaactgct gagtctctct cagtccatca tactctagta 7561 tatagtcaga gactctagat actgacgact agactagact cgtcgtctnn nnnnnnnnnn 7621 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7681 nnnnnnnnnn nnnnnnnnnn nnnnnnnntt ccttcccaag catctttttc tttgacactt 7741 tcgttttcag tgatctgcgt agaattgtct tactaggagt atcaaagcat agtctccact 7801 gtcctaatat tcccatgtat tggccaatag tcaaagctat gcgcaggctg tggatagagc 7861 ccagtggctg agtacccaaa gctctggttc cttccccagt gctgcaaggg aaaactcaaa 7921 tccctatgct tccccaaact tcagcctccc attttactgc tcatcacgta cttgtagcct 7981 tgctctctag aattctgtag cccacactgg ccttgaactc tcaagatctg ctttccaagt 8041 actgggatga aaggcatgtg ctattctcct agcttctatg aggcgatcct ttttatttta 8101 tatacattgg tattaactga atgtgtgtat gtgtgtagtg tgatccggta cgagctcgag 8161 cgtatagtga gtcgatacat catgcgcgct // LOCUS MUSCR2AA 2102 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse complement receptor (Cr2) gene, 5' end. ACCESSION M36470 KEYWORDS Cr2 gene; complement receptor. SOURCE Mouse (strain Balb/c) spleen, cDNA to mRNA, clone 31-1. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2102) AUTHORS Kurtz,C.B., O'Toole,E., Christensen,S.M. and Weis,J.H. TITLE The murine complement receptor gene family. IV. Alternative splicing of Cr2 gene transcripts predicts two distinct gene products that share homologous domains with both human Cr2 and Cr1 JOURNAL J. Immunol. 144, 3581-3591 (1990) STANDARD simple staff_review FEATURES from to/span description pept 67 > 2102 complement receptor (Cr2) BASE COUNT 590 a 472 c 452 g 588 t ORIGIN 1 ctcttcctct ccttgctaca ggctcacaac tcacagagcg caacctgcca ttggactgct 61 gcacacatgg gatccttggg ttcgctctgg gttttcttca ctctcatcac tccaggagtt 121 cttggtcagt gtaagttgct gccaaagtat tcttttgcta aaccttctat tgtgagtgat 181 aaatctgagt ttgccattgg aacaacttgg gaatacaaat gtcgccctgg gtattttagg 241 aagtcattta ttatcacctg cttagaaacc tccaagtggt cagatgctca gcagttctgt 301 aaacgtaaac catgtatgaa tcctcaagaa cccctccatg gttctgtgca tataaacacg 361 ggtatcgagt ttgggtcaac aattacgtat tcttgtaatc aaggatatcg actcattggt 421 gactcgtctg ctacatgtat tgtatcagac aatactgtaa tgtgggataa tgatatgcct 481 ctttgtgaat ctattccttg tgagtcacct ccagccatct ccaatggaga cttctacagc 541 agcagcagag acagcttttt ctatgggatg gtagtaactt attattgcca taccggaaag 601 aatagggaaa aactgtttga tctggtgggt gagaagtcaa tatattgtac cagcaaagac 661 aatcaagttg gcatctggaa tagtccacct cctcagtgta ttcctagagt caagtgccca 721 atgccagaaa ttgaaaatgg actagtggag tctggattta aacactcctt cttcttaaat 781 gatacagtaa tatttaagtg caaatctggc tttaccatga aaggcagcag aatagcatgg 841 tgccagccaa acagcaaatg gagccctcca ttgccaacat gcttcatggg atgtctacca 901 cctcaaaata tcctccatgg tgattataac aaaaaggatg agttcttctc tgttggccag 961 aaagtgtcat atacgtgtaa ccctggctat actctcattg gaactaacct cgtggagtgt 1021 acatccttgg gaacctggag caatacagtc ccgacatgtg aagtgaaatc atgtgatgca 1081 attccaaacc atcttctcca tggccgtgtg tttcttcccc ctaatctcca gcttggggca 1141 gaggtttcct ttgtttgtga cttagggttc cagttaaaag gcaaaccttc tagtcagtgt 1201 atcccagaag gagagacagt aatctggaat aataagtttc ctgtctgtga acagatttct 1261 tgtgaccctc ctcctgaagt caaaaatgct cggaaaccct attattctct tcccatagtt 1321 cctggaactg ttctgaggta cacttgttca cctagctacc gcctcattgg agaaaaggct 1381 atcttttgta taagtgaaaa tcaagtgcat gccacctggg ataaagctcc tcctatatgt 1441 gaatctgtga ataaaaccat ttcttgctca gatcccatag taccaggggg attcatgaat 1501 aaaggatcta aggcaccatt cagacatggt gattctgtga catttacctg taaagccaac 1561 ttcaccatga aaggaagcaa aactgtctgg tgccaggcaa atgaaatgtg gggaccaaca 1621 gctctgccag tctgtgagag tgatttccct ctggagtgcc catcacttcc aacgattcat 1681 aatggacacc acacaggaca gcatgttgac cagtttgttg cggggttgtc tgtgacatac 1741 agttgtgaac ctggctattt gctcactgga aaaaagacaa ttaagtgctt atcttcagga 1801 gactgggatg gtgtcatccc gacatgcaaa gaggcccagt gtgaacatcc aggaaagttt 1861 cccaatgggc aggtaaagga acctctgagc cttcaggttg gcacaactgt gtacttctcc 1921 tgtaatgaag ggtaccaatt acaaggacaa ccctctagtc agtgtgtaat tgttgaacag 1981 aaagccatct ggactaagaa gccagtatgt aaagaaattc tctgcccacc acctccacct 2041 gttcgtaatg gaagtcatac aggcagcttt tcagaaaatg taccatatgg aagcacagtt 2101 ac // LOCUS NEUALCA 1639 bp ds-DNA PLN 15-AUG-1990 DEFINITION N.crassa allantoicase (alc) gene, complete cds. ACCESSION J02927 KEYWORDS allantoicase. SOURCE N.crassa (strain Oak Ridge), clone pALC-1. ORGANISM Neurospora crassa Eukaryota; Plantae; Thallobionta; Eumycota; Ascomycotina; Pyrenomycetes; Sordariales; Sordariaceae. REFERENCE 1 (bases 1 to 1639) AUTHORS Lee,H., Fu,Y.-H. and Marzluf,G.A. TITLE Nucleotide sequence and DNA recognition elements of alc, the structural gene which encodes allantoicase, a pirine catabolic enzyme of Neurospora crassa JOURNAL Biochemistry (1990) In press STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.A.Marzluf, 12-JUL-1990. FEATURES from to/span description pept 250 340 allantoicase (alc), exon 1 413 1386 allantoicase (alc), exon 2 IVS 341 412 alc intron A signal 129 135 TATA box BASE COUNT 383 a 441 c 443 g 372 t ORIGIN 1 cgttgcagat cgaatacgac ggttaggtac gacgaagaag gaccacgatt gtcgttgctg 61 ttacgtactt tgacctcctc aacgcactat cttgcttaag ctatcgctct tgtctgtcgc 121 tgtggtgata taaattctgc gcctgctctt ggtttattcc gaggacgctc gttccatctc 181 tgtttttttt ttctctctgt gacatcgagg actgaagtct cacttattca aatacacatt 241 tccctcacca tgaccgacat cgattacaag ctcgaggctg ttccggccac tcggattgcc 301 gccgatgata tcgacaagac tttccgttcc agcaccatcg gtccgtagca tccatctcac 361 caaacatggc aacccaaacc tttcaactaa cggaagtcga gctgggatac agatcttatc 421 tcaggggctc tcggtggcaa ggtttccggt ttctcggacg aatggttcgc cgaagcagcc 481 aacctcctca ctcctacagc cccaatccgc cagccgggaa agatggttta caccggcgcc 541 tggtatgacg gatgggagac aaggagacac aaccctgccg agttcgactg ggttgtgatc 601 cgtctgggcg tcgcctcggg taccgtcgag ggtgtcgaga ttgacacggc tttcttcaac 661 ggcaaccatg cgcccgccat ctcggtcgag ggttgcttca gccaaaacga cgatgaggtt 721 ctgtcatgga agggcgagct gggtggatgg gagactattc ttggcgttca agagtgcggc 781 ccttcgcaga gattctgctg gaaactcgag aaccctacca agaagcagta cacccatgtg 841 cgactaaaca tgtaccccga cggcggcatt gccaggttcc gtctgtttgg acacgccgta 901 ccggtcttcc ccgacaatac ggatgccatc tttgacttgg cggctgccca gaacggcgga 961 gttgcgatct cctgcagtga ccagcacttt ggtaccaagg acaaccttat ccttccgggc 1021 cgcggcaagg acatgggcga cggttgggag acagcacgct cgcgcaccaa gggccacgtc 1081 gactggacca tcatcagact cggcgcgccc ggctacattc agaatttcat ggtcgacacg 1141 gctcacttcc gcggtaacta cccccagcag gtcaagctgc aacgtatcga gtggaagagc 1201 gagggcaggc cgggagcgga ttctgagggc tggacagagg ttgttgagcc catcaagtgc 1261 ggtcccgatc aggaacaccc tgtcgagagc ttggtgaagg acaagccgtt cacccacgtc 1321 aagctcatca ttgtgcctga cggcggagtg aaaagactgc gggtgtttgc gaagagggct 1381 gtttaagaaa ttaccaagct atatatctga aggcaattat tcggtgagag cagcatttac 1441 ggggagccat caacagcgag cgatccacat aaaaaggggg aggacctcat ttagtatgat 1501 gggcaacgag tgcagtcatt tagccgcgaa gaatcgaaat ctctcagatc tttgattgtc 1561 tgcgcttaag taacaaagtc taattctcaa tcagctttcg tcgtagagta aaattagaag 1621 gatgcacggc tgcccacga // LOCUS RATINHA 1561 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat inhibin alpha-subunit mRNA, complete cds. ACCESSION M36453 KEYWORDS inhibin. SOURCE Rat female (strain Sprague-Dawley) ovary, cDNA to mRNA, clone rINA-13. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1561) AUTHORS Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E. TITLE Rat inhibin: Molecular cloning of alpha- and beta-subunit complementary deoxyribonucleic acids and expression in the ovary JOURNAL Mol. Endocrinol. 1, 561-568 (1987) STANDARD simple staff_review FEATURES from to/span description pept 256 1356 inhibin alpha-subunit precursor sigp 256 954 inhibin alpha-subunit signal peptide matp 955 1353 inhibin alpha-subunit mRNA < 1 1561 inhibin alpha-subunit mRNA BASE COUNT 308 a 465 c 440 g 348 t ORIGIN 1 ggacactaga atgctgtgtt gttagaggag tggagagagg aagatgtgct aagtgtagca 61 gtacacacct ataatcctag cacttgagag gttgaaggca ggaggatgag acattcaggt 121 cattcttagc tacatgaaga gtttaaggcc agcacggatt acaggatatc tgtttctggg 181 gaaaaaggag gggaagagag agaggaaagg gcaaagggca gagtgtgggc tccctgtcgt 241 cagggcaaga gaactatggt gatccagccg tctctgctgc tccttttgct gttgactcta 301 caggatgtgg acagctgcca ggggccagaa cttgtccggg agcttgtcct ggccaaagtg 361 aaggcactat tcctagatgc cttggggccc ccagcaatgg atggggaagg tgggggtcct 421 ggaataaggc ggctgcctcg aagacatgcc cttgggggct tcatgcacag gacctctgaa 481 ccagaggagg aggatgtctc ccaggccatc cttttcccag ccacaggtgc cacctgtgag 541 gatcaggcag ctgctggagg gcttgcccag gagcctgagg aaggtctctt cacttatgta 601 ttccggccat cccaacacat acgcagccac caggtgactt cagcccagct gtggttccac 661 acggggctcg acaggaagag cacagcagcc tccaatagct ctaggcccct gctagatctt 721 ctggtgctgt catctggggg gcccatggct gtgcctgtgt ccttgggaca gagcccccca 781 cgctgggctg tcctgcacct ggcggcctcc gctttccctc tgttgaccca ccccatcctc 841 gtgttgctgc tgcggtgccc actctgttct tgctcaggcc ggcctgagac cactcctttc 901 ctggtggccc acactagggc tcgagccccc agtgcggggg agagggctcg acgttcagct 961 ccctcgatgc cttggccttg gtctcctgca gccttgcgtt tgctgcagag gcctccagag 1021 gaaccctctg cccatgcctt ctgccatcga gctgccctca acatctcctt ccaggagctg 1081 ggctgggacc gctggatcgt acaccctccc agcttcattt tccactactg ccatggtagc 1141 tgcgggatgc ccacatctga tctgcccctg ccagtccctg gggctccccc taccccggct 1201 cagcccctgt ttttggtgcc aggggccaag ccctgctgtg cagctctacc agggagcatg 1261 aggtccctac gcgtccgaac cacctcagat ggaggctact ctttcaagta tgagatggta 1321 ccgaacctca ttacacaaca ctgtgcttgt atctaaaagc acctcgtctc ctcctccaca 1381 gccactggcc accatcacct caccatccca cggtcggtcg gtcggtcggt cgtcagctag 1441 gaggaaggtg ggtgtggaaa gtagacagtt tccacttcct tttcccttca tctttctgtc 1501 tgaggcttcc acaccccact ccacccaggt cctgtggata acaataaaga aggaagtgtg 1561 t // LOCUS RATINHB 1543 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat inhibin beta-A-subunit mRNA, complete cds. ACCESSION M37482 KEYWORDS inhibin. SOURCE Rat female (strain Sprague-Dawley) granulosa cell, cDNA to mRNA, clone rINB-5. ORGANISM Rattus rattus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1543) AUTHORS Woodruff,T.K., Meunier,H., Jones,P.B.C., Hsueh,A.J.W. and Mayo,K.E. TITLE Rat inhibin: Molecular cloning of alpha- and beta-subunit complementary deoxyribonucleic acids and expression in the ovary JOURNAL Mol. Endocrinol. 1, 561-568 (1987) STANDARD simple staff_review FEATURES from to/span description pept 163 1437 inhibin beta-A-subunit precursor sigp 163 1086 inhibin beta-A-subunit signal peptide matp 1087 1434 inhibin beta-A-subunit mRNA < 1 1543 inhibin beta-A subunit mRNA BASE COUNT 435 a 356 c 454 g 298 t ORIGIN 1 ctctgacctc atgagacaag agccggctgg caaaacagaa gggacccgaa agagaatttg 61 ctgaagagga gaaggaaaaa agtccaaaaa acctgtacgt gaggggtggg gaggaaaagc 121 agggccttta aagaaggcaa ccacacgact tttgctgcca ggatgccctt gctttggctg 181 agaggatttc tgttggcaag ttgctggatt atagtgagga gttcccccac cccaggatcc 241 gaggggcacg gcgcagcccc ggactgcccg tcctgtgcgc tggccaccct tccgaaggat 301 ggacctaact ctcagccaga gatggtagag gctgtcaaga agcacatctt aaacatgctg 361 cacttgaaga agagacccga tgtcacccag ccggtaccca aggcggcgct tctcaacgcg 421 atcagaaagc ttcatgtggg taaagtgggg gaaaacgggt atgtggagat agaggacgac 481 attggcagga gggccgaaat gaatgaactc atggagcaga cctcggagat catcaccttt 541 gccgagtcag gcacagccag gaagacactg cattttgaga tttccaagga aggcagtgac 601 ctgtcagtcg tggagcgtgc agaagtctgg ctcttcctga aagtccccaa ggccaacagg 661 accaggacca aagtcaccat ccgtctgttt cagcagcaga agcatccaca gggcagcttg 721 gacatggggg atgaggccga ggaaatgggc ttgaaggggg agaggagtga actgttgcta 781 tcagagaaag tggtagatgc tcggaagagc acttggcaca tcttcccagt gtctagcagc 841 atccagcgcc tgctggacca ggggaagagt tccctggatg tgcggattgc ttgtgaacag 901 tgccaggaga gcggtgccag cctagtgctc ctgggcaaga agaagaagaa agaggtggat 961 ggagacggga agaagaaaga cggaagtgac ggagggctgg aagaggaaaa agaacagtca 1021 cacagacctt tcctcatgct gcaggctagg cagtctgaag accatcctca ccgcaggcgt 1081 aggcggggct tggagtgtga tggcaaggtc aacatttgct gtaagaaaca gttctttgtc 1141 agcttcaagg atattggctg gaatgactgg atcattgctc cctctggcta tcatgccaac 1201 tattgtgagg gtgagtgccc aagccacata gcaggcacct ctgggtcctc actctccttc 1261 cactcaacag tcattaacca ctaccgcatg aggggtcaca gcccctttgc caaccttaag 1321 tcatgctgtg tgcccaccaa gctgagaccc atgtccatgc tgtattatga tgatggtcaa 1381 aacattatca aaaaggacat tcagaacatg attgtggagg agtgtggctg ctcctagagt 1441 tgccaggtcc cagagcaaat ggatctaggg tgtccaggaa aagacagtgg caaatgaaga 1501 aaaatatata agatttctgc ctaaacaaga caaccagaaa aat // LOCUS RSBMNP 1201 bp ss-RNA VRL 15-AUG-1990 DEFINITION Bovine syncytial virus major nucleocapsid protein (N) mRNA, complete cds. ACCESSION M35076 KEYWORDS major nucleocapsid protein. SOURCE Bovine syncytial virus (strain A51908) MDBK cell, cDNA to mRNA. ORGANISM Bovine syncytial virus Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Spumavirinae. REFERENCE 1 (bases 1 to 1201) AUTHORS Samal,S.K., Zamora,M., McPhillips,T.H. and Mohanty,S.B. TITLE Molecular cloning and sequence analysis of bovine respiratory syncytial virus mRNA encoding the major nucleocapsid protein JOURNAL Unpublished (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by S.K.Samal, 12-JUL-1990. Author address: S.K.Samal Univ Maryland at College Park Dept. Veterinary Medicine College Park, MD 20742 FEATURES from to/span description pept 16 1191 major nucleocapsid protein mRNA 1 1200 major nucleocapsid protein mRNA BASE COUNT 434 a 196 c 270 g 301 t ORIGIN 1 ggggcaaata caaaaatggc tcttagcaag gtcaaactaa atgacacttt caacaaggat 61 caactgttat caaccagcaa atatactatt caacgtagta caggtgacaa cattgatata 121 cccaattatg atgtacaaaa acatctcaat aagttgtgtg gtatgctact aataacagaa 181 gatgccaatc ataaatttac aggattgata ggtatattat atgctatgtc ccgattgggg 241 agagaagata cccttaaaat actcaaagat gcaggctacc aagtaagggc caatggggtt 301 gatgtgataa cacatcgaca ggatgtgaat ggaaaagaaa tgaaatttga agtgctaaca 361 ttagtcagct taacatcaga agttcaaggc aatatagaaa tagagtcaag gaagtcttac 421 aaaaagatgc taaaagagat gggagaggta gccccagaat acagacatga ctctcctgat 481 tgtggtatga tagtgctatg tgttgctgct ttggttataa caaaattagc agcaggtgat 541 agatcaggcc tcactgcagt cattaggaga gccaacaatg tactaaggaa tgaaatgaaa 601 cgatacaaag gacttatccc gaaagatata gctaacagct tctatgaagt gattgaaaag 661 taccctcatt acatagatgt attcgtacat tttggcattg ctcaatcctc aactagagga 721 ggtagtaggg tagaaggaat ctttgcaggg ttattcatga atgcatatgg agcaggtcaa 781 gtgatgttaa gatggggtgt attagccaaa tcagtcaaga acattatgct tggtcatgcc 841 agcgtgcaag cagaaatgga acaggttgta gaggtctatg aatatgcaca aaagttaggt 901 ggagaagctg gtttttatca catattgaac aaccctaaag catcactgtt atccttaaca 961 caattcccca acttctctag tgtagtccta ggcaatgctg caggactagg tataatgggt 1021 gagtatagag gtacaccaag aaaccaagac ttgtatgatg ctgccaaagc atatgcggaa 1081 caattaaaag agaatggggt catcaattac agtgtattag atctgactac agaggaacta 1141 gaggcaatca agaaccaatt gaatcccaaa gacaatgatg tggaactgtg agttaataaa 1201 a // LOCUS URELOCAB 558 bp ds-DNA BCT 15-AUG-1990 DEFINITION U.urealyticum urease locus proteins A and B, complete cds. ACCESSION M36190 KEYWORDS urease locus-encoded protein. SOURCE U.urealyticum (serotype 8) DNA. ORGANISM Ureaplasma urealyticum Prokaryota; Bacteria; Tenericutes; Mollicutes; Mycoplasmas; Mycoplasmatales; Mycoplasmataceae. REFERENCE 1 (bases 1 to 558) AUTHORS Willoughby,J.J., Russell,W.C., Thirkell,D. and Burdon,M.G. TITLE PCR primers that detect Ureaplasma species and a study of the urease locus by 'PCR walking' JOURNAL Unpublished (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.J.Willoughby, 27-JUN-1990. Author address: J.J.Willoughby University of St. Andrews Biochemistry and Microbiology North Street St. Andrews, Fife KY16 9AL SCOTLAND FEATURES from to/span description pept 23 349 urease locus protein A pept 436 522 urease locus protein B BASE COUNT 209 a 78 c 111 g 160 t ORIGIN 1 tttataagga gataatgatt atatgtcagg atcatcaaat caattcactc caggtaaatt 61 agtaccagga gcaattaact tcgctgaagg cgaaaatgtg atgaacgaag gtagagaagc 121 aaaagtaatc agcattaaaa atactggtga ccgtcctatc caagttggat cacatttgca 181 cttatttgaa acaaatagtg cattagtatt ctttgatgaa aaaggaaacg aagacaaaga 241 acgtaaagtt gcttatggac gtcgtttcga tattctcagt actgctattc gttttgaacc 301 aggagacaaa aaagaagttt cagttattga tttagtcgga acacgttgaa gtttgaggtg 361 taaacggctt agttaacggc aaaaccttaa aaaataatct atttacaagt ttctatatag 421 acgaagggga acattatgtt taaaatttca agaaaaaatt actcagatct atatggtatc 481 acaactggtg atagcgttag attaggagac acaaatcttt gagttaaagt tgaaaaagac 541 ttaactactt atggcgaa // LOCUS YSCFUR1A 2123 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.cerevisiae uracil phosphoribosyltransferase (FUR1) gene, complete cds. ACCESSION M36485 KEYWORDS uracil phosphoribosyltransferase. SOURCE S.cerevisiae (strain FL100, ATCC 28383) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 2123) AUTHORS Kern,L., de Montigny,J., Jund,R. and Lacroute,F. TITLE The FUR1 gene of Saccharomyces cerevisiae: Cloning, structure and expression of wild-type and mutant alleles JOURNAL Gene 88, 149-157 (1990) STANDARD simple staff_review FEATURES from to/span description pept 895 1650 uracil phosphoribosyltransferase (FUR1) mRNA 886 1791 FUR1 mRNA (alt.) mRNA 888 1791 FUR1 mRNA (alt.) signal 841 848 TATA box signal 1896 1901 poly-A signal BASE COUNT 659 a 427 c 392 g 645 t ORIGIN 1 atcgataaaa gaactaatgt ttcccaaaga aataggaaaa agggaataaa gaataatagg 61 ccccacaaag acataaacag cagtcctgac tggggcaact gcacagagga accgattggc 121 agagcgaaaa agcaaacggc atgaacaggg ccaagaactc tcggaatttt accactaata 181 ttaaattgca gcgacaacat tttggcgaag aaatacaagg tggccagcca gccttgtgat 241 atctacaaat tcagatgctt cagataaatt gttaatgcta ttcaacctaa ctttgggagt 301 aaaccaagaa aacttgaaaa atgttctgga aaacatttct caggtgcaga tagctcaaat 361 tagggttaga gacctgcctt caggatctgc caccgctaag gtccgtctgg catatcctac 421 aacacagtct ttggagaagg taagaaaact gttccatggc gctctagttg atggaaggcg 481 catccaagtg gtgattgcat ctgatgaatc gtcccacttg tcgtattaga gtttgtcaac 541 gacactcaca aggtatttaa tcagcaaaat ccccgccaca aactattttt ttgaagacat 601 gctttctcat gactgcctaa taacaatacc tcattctact agtaatcgac ctatgtaatt 661 atttcataaa ctataaagca ggtcattgca ataacagaaa ggccggtttt tctataagct 721 tatctcatcg cataaaaaat cgacagttgt aattatctcc ggcggacttt tccctttccg 781 tctttttttt caaaattttt ttttttttca cttcttcttt caaagctgcc tcaaaagaga 841 tatatatatt ggtaagaatc ctcttccaat actagcttca tttcttcttg aaccatgaac 901 ccgttattct ttttggcttc tccattcttg taccttacat atcttatata ttatccaaac 961 aaagggtctt tcgttagcaa acctagaaat ctgcaaaaaa tgtcttcgga accatttaag 1021 aacgtctact tgctacctca aacaaaccaa ttgctgggtt tgtacaccat catcagaaat 1081 aagaatacaa ctagacctga tttcattttc tactccgata gaatcatcag attgttggtt 1141 gaagaaggtt tgaaccatct acctgtgcaa aagcaaattg tggaaactga caccaacgaa 1201 aacttcgaag gtgtctcatt catgggtaaa atctgtggtg tttccattgt cagagctggt 1261 gaatcgatgg agcaaggatt aagagactgt tgtaggtctg tgcgtatcgg taaaatttta 1321 attcaaaggg acgaggagac tgctttacca aagttattct acgaaaaatt accagaggat 1381 atatctgaaa ggtatgtctt cctattagac ccaatgctgg ccaccggtgg tagtgctatc 1441 atggctacag aagtcttgat taagagaggt gttaagccag agagaattta cttcttaaac 1501 ctaatctgta gtaaggaagg gattgaaaaa taccatgccg ccttcccaga ggtcagaatt 1561 gttactggtg ccctcgacag aggtctagat gaaaacaagt atctagttcc agggttgggt 1621 gactttggtg acagatacta ctgtgtttaa ataaatcaca cccgaacacc atcttgaagg 1681 ttcagaacgg ctgaagccat atcaactttg ggtttctact gttttaaatt tcctttctcg 1741 ttttaaactt ttgttgccgt ctcttctact atcaattttt gttgttcatg catgtttaat 1801 tacctttttt gtaaaaataa tataaacgta ccaatggtca tttataacaa atatgcttga 1861 aaaatctaac gactctgttt cttacattag gttcgaataa acacggtaca tgtcctctag 1921 ccaatctgac atttttggtc caaagtcttt gaaaggtaga taaccccgtt aaaatagaac 1981 caccaatcca tgtagtatat tttctttctg aaggggctat aatctttatc taggatgttc 2041 ctttggttaa tgcctccaaa tcccatagca ttcggtctcc aaagccttta agcgttgtag 2101 ttccgccact taggattatc gat // LOCUS YSCMET16A 1986 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.cerevisiae 3'-phosphoadenylyl sulfate reductase (MET16) gene, complete cds. ACCESSION J05591 KEYWORDS 3'-phosphoadenylyl sulfate reductase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1986) AUTHORS Thomas,D., Barbey,R. and Surdin-Kerjan,Y. TITLE Gene-enzyme relationship in the sulfate assimilation pathway of Saccharomyces cerevisiae: Study of the 3'-phosphoadenylylsulfate (PAPS) reductase structural gene JOURNAL J. Biol. Chem. (1990) In press STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Surdin-Kerjan, 28-JUN-1990. FEATURES from to/span description pept 792 1562 3'-phosphoadenylyl sulfate reductase (MET16) signal 686 692 TATA-box site 643 648 cis-acting element in general control of AA synthesis site 612 618 UAS (methionine metabolism) BASE COUNT 670 a 368 c 379 g 569 t ORIGIN 1 atgcatcttg cctctttgat attggttgga tcttcttatg gcttccacga actctcttgt 61 gtaaatatct ggatttctac cgtcctcaat gtattgaaca acttccaagg gaatgtccac 121 cttagacaag ctggattgag gatcgttgct tctcacgttc agcttgtaca agcgatccac 181 atttctttgc aagttggtga tcattccctt ggtggcttct ggagtaccag gaaaatcata 241 tatcgagaca cctaattcaa cgaaggactc aataatcgaa gccacttggt cttgagtagt 301 ggccagttct tgctgcaatt gttcattgtt agtgctgttt ccattcatct tatcggttta 361 tttttctata tatttgcctc tttctcaaac aggagttagt agttaaaagt acgaagttct 421 tgttctttaa tgcgcgctga caaaagaatt ggataaaaga gaatggtggg gggacaagaa 481 ggaaatttgt cctagtttaa catgaatggc atcttgttac cgggtggaca tcacctattg 541 attctaaata tctttacggt ttatcatact gttctttatt ccgtcgttat tctttttatt 601 tttatcatca tttcacgtgg ctagtaaaag aaaagccaca acatgactca gcaaatctcg 661 acaaagtaaa agctcataga gatagtatta tattgatata aaaaaagtat actgtactgt 721 ttgtaacctt ttcaatgctt taagatcaaa actaaggcca gcaaaggtat caacccatag 781 caactcataa aatgaagacc tatcatttga ataatgatat aattgtcaca caagaacagt 841 tggatcattg gaatgaacaa ctaatcaagc tggaaacgcc acaggagatt attgcatggt 901 ctatcgtaac gtttcctcac cttttccaaa ccactgcatt tggtttgact ggcttggtta 961 ctatcgatat gttgtcaaag ctatctgaaa aatactacat gccagaacta ttatttatag 1021 acactttgca ccatttccca caaactttaa cactaaaaaa cgagattgag aaaaaatact 1081 accagcctaa aaatcaaacc attcacgtat ataagccgga tggatgtgaa tcggaggcag 1141 attttgcctc gaaatacggg gatttcttat gggagaaaga tgatgacaag tacgattatc 1201 tggccaaagt ggaacctgca catcgtgcct acaaagagct acatataagt gctgtgttta 1261 ctggtagaag aaaatcacaa ggttctgccc gctcccaact gtcgattatt gaaatagacg 1321 aacttaatgg aatcttaaaa ataaatccat tgatcaattg gacgttcgag caggttaaac 1381 agtatataga tgcaaacaat gtaccataca acgaactttt ggaccttgga tatagatcca 1441 ttggtgatta ccattccaca caacccgtca aggaaggtga agatgagaga gcaggaagat 1501 ggaagggcaa ggcaagaccg agtgtggaat tcatgaagcc agccgattcg cgcaattttt 1561 aaagcaagat gcctagatag atagagtacg atatataacc atatgtatgt gactaattat 1621 ttattcctta ataacaccaa tgattacaac tttctaaagc tggcggagaa ttcgcgctgt 1681 acgagaaaag agcgaaaaca gaggaatatt caaactaaga accaaactgc gataaagagg 1741 attgaaagga aaaacgaaag aaaaggtaaa ctgacaaata tatacattaa ccgatgggta 1801 atttcagatt tcctataaaa accaagctac caccagggtt tatcaatgct cgcatactta 1861 gggataactt caaaagacaa caatttaaag agaatgaaat ccttgttaaa tctttgaaat 1921 tcatcgctag aaatatgaac cttccaacaa aactgaggtt ggaggctcag ttaaaactaa 1981 atgcat // LOCUS CHT59KD 2429 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.trachomatis 59-kDa immunogenic protein (SK59) gene, complete cds. ACCESSION M31119 KEYWORDS antigen; immunogenic protein. SOURCE C.trachomatis L2 (strain LGV-2 434BU) elementary body DNA, clone beta-1. ORGANISM Chlamydia trachomatis Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Rickettsias and Chlamydias; Chlamydiales; Chlamydiaceae. REFERENCE 1 (bases 1 to 2429) AUTHORS Kahane,S., Weinstein,Y. and Sarov,I. TITLE Cloning, characterization and sequence of a novel 59-kDa protein of Chlamydia trachomatis JOURNAL Gene 90, 61-67 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by Y.Weinstein, 05-JAN-1990, for release after publication. FEATURES from to/span description pept 466 2043 59-kDa immunogenic protein BASE COUNT 700 a 626 c 417 g 686 t ORIGIN 1 ggatcccgaa ttgggtaact ctcagaccca cacataaggc catatgctcg agtacgtgag 61 ccactccact agaatcttgc gggcaagtcc gaaaagaaat attgaacaca ttttcatcat 121 catcattcac gatcatcatg atcgttgccc cggtcggagt atgttccact tcaattagct 181 tgctctcgat ctcgggaaga tcctgactca acttgactac aaaatttcta taggtatccc 241 agttttcata tcccactcaa tcttctataa tagagaagct tgttgcatct ccctattttc 301 gattcaccta acatagaaga cagctactgt gagctcttat atccacacaa atattctttc 361 tgaaggcttc tcttattaaa aaaaaagacg ggactcgatt gagtccccat actagactag 421 cttcctaaaa tataaggcca ggactactcg tctgatttca agacgatgaa tcgcaccaca 481 tctcccttga gaaaccataa ggagaacatt ctctcccttt cgagtttttc aaaacctgat 541 ttaactcttc aacggaagcg acctcgctgc ctattcaccg ctaagataag ctgtccagga 601 gcgacgcctg cagaagctgc aggcgagctg cctccacagc aactaccaga atcctcgggt 661 atctgctgcc aatccgagtt tcttacaaat ttctggagta atgtaagtct cacggactcc 721 catcttctgg caacgctgaa acgccatcct ctgttgggtc tgtgtaaccg tcacaggtat 781 ctcgcttgtt ttcccttcac gacgattttt aaaataacac gagtccctgg catcattagg 841 gaaatggcat tacgcaacgc actcaaagac tctacttctt tttccattgt aagccacaat 901 gacatcttct tccagccccg ctttttctgc tggagaacct ttaacaacat ccgtcaccaa 961 acgttccgta cacttttcca atttgtaaca agtagccaat tcagaatcta tcggttgcaa 1021 ggtaactccc aaaaagcctc ttgttacctg cccatcacta atcaattgat caatgactcg 1081 tttagccatc aagctaggaa tagcaaaccc tattccaata tatcccccgc taccactgac 1141 aatggcagta ttaaccccga taacttgacc attgattgtt taacaatgga ccgccctgaa 1201 ttcccaggat taatggcagc atctgttgta acaaatcttc gaaatctaca atatgtagat 1261 gatttcttcc tttcagcact aacgaccccg atagtgaccg ttgcttgcaa tccaaaagga 1321 tttccaatag caatagccca gtcacctatc tgcagtcgat cagaattccc aaaagtcaaa 1381 aatggtaatt tctctgctgt aattttgatc acagcaagat ctgtttttgg atctaacccc 1441 acgatcttag gctgtgtatt tttgtccatc gtggagagta acatgaattt ttcctgcatc 1501 ctcgactaca tgatggttag taacaacata accattcgaa ttcagaaaca tagaacccag 1561 ttcctcttac agcatcacgc cgctgctgcg gacgctgctg ctctctatcg aaggcaaccc 1621 aaaaaatcga ttaaaaaatt cgtcattaaa ataatcaaaa acaaaagggt tctcttgaag 1681 cctcttttgt ttcctggaga agcaatagcc tggttccctg ttttaggaaa attttcatat 1741 atatcaactc caggacgttg ccttagacgc gcgacccgag taaaacctcg ggatacttct 1801 ttaggagatc ttcttgtgaa acctcttgat ctccgtgagg atactgcaag acaaatatca 1861 gccattaaga atctttcttt gacgcactat agcctagcat tggcgaagag aaaacgtgat 1921 gtcgatagca acacacataa taataatctt ttcaatcatc ttttccttga taagcgatct 1981 gcgtctagcc cggtttttca tttatgcacc ataacaagca gatatgcagc atacaaaatc 2041 taatgatgca aatcaaggag actactctga tgattctcca atctaaaaaa ctaacgtggt 2101 tttagaacgg atgcaaccgg cctctccaat cagtgcagga gattctacaa cggtaacccc 2161 tgcctgtctc aaagcttctt gtttgctaaa agcatcccca cttttccctg aaataatagc 2221 tcctgcatgt cccatacgtt tccctttggg gagccgtagc tcctgcaata aatgcaatca 2281 caggcttact actatgttga cgtatccaat ctgcagcttc ctcttcagcg cttccaccaa 2341 tctccaatca taagaacagc ttctgtttgg ctatcctttt caaactcttg gagaggcatc 2401 gataaaagat gtgccacttt aaaggatcc // LOCUS AFAAZU 810 bp ds-DNA BCT 15-AUG-1990 DEFINITION A.denitrificans azurin (azu) gene, complete cds. ACCESSION M30388 KEYWORDS azurin. SOURCE A.denitrificans (strain NCTC8582) DNA. ORGANISM Alcaligenes denitrificans Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Aerobic rods and cocci; Alcaligenaceae. REFERENCE 1 (bases 1 to 810) AUTHORS Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., van De Kamp,M. and Canters,G.W. TITLE Isolation and sequencing of the Alcaligenes denitrificans azurin-encoding gene: Comparison with the genes encoding blue copper proteins from Pseudomonas aeruginosa and Alcaligenes denitrificans JOURNAL Gene 90, 15-20 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.W.Canters, 01-DEC-1989, for release after publication. FEATURES from to/span description pept 307 756 azurin (azu) precursor sigp 307 366 azurin signal peptide matp 367 753 azurin site 190 203 fnr-box site 251 267 ntrA-box binding 296 299 ribosome binding site signal 778 800 terminator BASE COUNT 160 a 238 c 245 g 167 t ORIGIN 1 cccgccgctg tgctgccttg catgctcgaa ctctacttgt ttgcaattgt ttgcaggcat 61 cctacgaaga tggaagaccc ttcgtattgc ggtttgtcaa tgggcacggt ttcggtgcgc 121 cggatgggcc aataccccta tgcggcatgg ggatttcccc tgtttttggg catctgaacg 181 gggtgggatt gatgtccgtc aatagcgcgc ttttttcgcc gtcttagact tgtgcgtggc 241 ggcagcgacg caggcatgtg cctggcgcga gtcgaagaat ggccgccctg tttacggaga 301 gtctccatgc tggcaaaagc caccctagct atcgttctgt ccgcagccag cctgcccgtg 361 ctggctgctc aatgcgaagc aaccatcgaa agcaacgacg ccatgcagta caacctgaag 421 gaaatggtcg ttgacaaaag ctgcaagcag ttcacggtgc acctcaagca cgtcggcaag 481 atggccaagg tcgccatggg ccacaactgg gtgctgacca aggaagccga caagcagggc 541 gtcgccactg acggcatgaa cgccggcctg gcgcaggact acgtgaaggc gggcgatacc 601 cgtgtcatcg cgcacaccaa ggtcatcggc ggcggcgaat cggattcggt aacgttcgac 661 gtgtccaagc tgaccccggg cgaagcctat gcctacttct gctcgttccc cggccactgg 721 gccatgatga agggcacgct caagctgagc aactgacccc gccctagcgc gcagataccg 781 gcccagggcc ggtttttttt gtcttggggc // LOCUS PSEAZU 1287 bp ds-DNA BCT 15-AUG-1990 DEFINITION P.aeruginosa azurin (azu) gene, complete cds. ACCESSION M30389 KEYWORDS azurin. SOURCE P.aeruginosa (strain CIT135) DNA. ORGANISM Pseudomonas aeruginosa Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 1287) AUTHORS Hoitink,C.W.G., Woudt,L.P., Turenhout,J.C.M., Van De Kamp,M. and Canters,G.W. TITLE Isolation and sequencing of the Alcaligenes denitrificans azurin-encoding gene: Comparison with the genes encoding blue copper proteins from Pseudomonas aeruginosa and Alcaligenes denitrificans JOURNAL Gene 90, 15-20 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by G.W.Canters, 01-DEC-1989, for release after publication. FEATURES from to/span description pept 213 < 1 (c) ORF1 pept 489 935 azurin (azu) precursor sigp 489 548 azurin signal peptide matp 549 932 azurin pept > 1287 985 (c) ORF2 (AA at 1287) site 318 333 ntrA-box site 403 416 fnr-box signal 958 985 terminator (bidirectional azu and ORF2) binding 476 481 ribosome binding site binding 224 220 (c) ribosome binding site (ORF1) BASE COUNT 229 a 423 c 428 g 207 t ORIGIN 1 ctgcaggctc tgcgggatga tcccgatcac ttcgctgccg gcggccaatg cggcgtccgc 61 cacggtgccc atcagaccga ccgcgccgcc accgtagacc agggtcaggc cgcgctcggc 121 caggtgccgg ccgagggcca cggcggcttc ctggtagacc ggggaagcgc cggggctggc 181 gccacagaat acgcagacgg aacgcaaggt catgatcgac tcctgtcggg ggtggaaaaa 241 ggcgcacagg gtagcggctg ggagcgcttc gaccaagccg tgcgaagcgt tgccggacgt 301 tgcgtcgcag gcgcgaagcg gcacatctgt gctaaaacag gagttccccg tagtaaacgc 361 cgggcagatc ccgctcgatg ccccgccacg tccggttcgg gtttgacctg aatcagtgga 421 actcggtgcc cgatcgggca gtctgctctt tcaggattca tcgcccaacc tgcctaggag 481 gctgctccat gctacgtaaa ctcgctgccg tatccctgct gtccctgctc agtgcgccgc 541 tgctggctgc cgagtgctcg gtggacatcc agggtaacga ccagatgcag ttcaacacca 601 atgccatcac cgtcgacaag agctgcaagc agttcaccgt caacctgtcc caccccggca 661 acctgccgaa gaacgtcatg ggccacaact gggtactgag caccgccgcc gacatgcagg 721 gcgtggtcac cgacggcatg gcttccggcc tggacaagga ttacctgaag cccgacgaca 781 gccgcgtcat cgcccacacc aagctgatcg gctcgggcga gaaggactcg gtgaccttcg 841 acgtctccaa gctgaaggaa ggcgagcagt acatgttctt ctgcaccttc ccgggccact 901 ccgcgctgat gaagggcacc ctgaccctga agtgatgcgc gagcgatccg ctgcatgaaa 961 aagcccggcc gctgccgggc tttttcatgg gcgcgcgccg ggctcagcgc gcgtagctgc 1021 cgccatcgcc tcgccggcca gttggtgcac gcgccgggtc ggatgccact cgtcccagaa 1081 gtagtactgg tccgggttgg cgcaggccgg gcggacgctg ggctgggtcg gctggcaggg 1141 cgcgtccagc tccaccaggc catagcgcgc cgggttgcgc cgcaagtggc ggctgaaggt 1201 gagatggtcg aaccagctca gctccaggcc gcgggtcttg cgcagggcgg cgagctggat 1261 cggcaggctg gcgttgactg cctgcag // LOCUS MZEADH1CM 6167 bp ds-DNA PLN 15-AUG-1990 DEFINITION Z.mays alcohol dehydrogenase (ADH-1 C-m allele) gene, complete cds. ACCESSION M32984 KEYWORDS alcohol dehydrogenase. SOURCE Z.mays DNA. ORGANISM Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 6167) AUTHORS Osterman,J.C. and Dennis,E.S. TITLE Molecular analysis of the ADH1-Cm allele of maize JOURNAL Plant Mol. Biol. 13, 203-212 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.C.Osterman, 18-MAR-1990. FEATURES from to/span description pept 1217 1250 alcohol dehydrogenase, exon 1 (ADH-1) (EC 1.1.1.1) 1785 1921 alcohol dehydrogenase, exon 2 2019 2065 alcohol dehydrogenase, exon 3 2482 2807 alcohol dehydrogenase, exon 4 2894 2976 alcohol dehydrogenase, exon 5 3070 3145 alcohol dehydrogenase, exon 6 3487 3548 alcohol dehydrogenase, exon 7 3636 3731 alcohol dehydrogenase, exon 8 3823 3984 alcohol dehydrogenase, exon 9 4085 4201 alcohol dehydrogenase, exon 10 pre-msg 1110 > 4201 ADH-1 mRNA and introns IVS 1251 1784 ADH-1 intron A IVS 1922 2018 ADH-1 intron B IVS 2066 2481 ADH-1 intron C IVS 2808 2893 ADH-1 intron D IVS 2977 3069 ADH-1 intron E IVS 3146 3486 ADH-1 intron F IVS 3549 3635 ADH-1 intron G IVS 3732 3822 ADH-1 intron H IVS 3985 4084 ADH-1 intron I BASE COUNT 1574 a 1335 c 1378 g 1880 t ORIGIN 1 bp upstream of BamHI site. 1 ggatccaata ggctagtcac ttttacttta gcttctgaga tccaaacagt cacttaggac 61 atgtttggaa gcacaccagt ttttaaaaaa ctttttccta tcctcaattt ctagaaaatg 121 gtttatgaaa aaaaatttgg gtgggatgtt tgtaacccag tttctagttt tttttataaa 181 gagagtagct tcttggtttt agttagagga gagtagcttc ttggttttta agaaactggg 241 aatccagttt ctataaactg gaacataaat aagtatattt ggaatcactt tagtttgtac 301 aaaccgattt cttagaaatt ggatgcttat aaataggccc tcaatgtcct tgttgggttt 361 atgaaattta catctattac cacattttta aaaatagagg aagagtatgc tagtagttat 421 gtataaaaaa actagaaact gtttttttta aaaaaaaact gagttccagt ttcctttatc 481 taattctttt ataagctatt ttttagaaaa ggatagaaac tgtttttaaa aaaactggtg 541 tgcttctgtt taactcttcg taagaacagt gttacgtccc gtgtctatat tttgcttttg 601 ttgaaagcca tcgtaagtac atgcttgcgt gggtgaaatg ccatcgcaat gctacaactt 661 ttcggctccc tcctgcttcg gtgcttccac atgccctgca cggcgtctag aaaccctaat 721 gattcagcag cacacctgtc cgcctagccg cctacgcgta cacagaaaac aaattttttg 781 tccacacacg cgcgcgctcc gagccgcaga tccgagctag cgcggcgcat ccgacggcca 841 cgacagcgcg gtgccgtcct ccgccgccac cgcttggcgc ttgtccgcac cccccaccag 901 tccaccacct cccccacgag cgaaaaccac ggtccacgga ccacggctat gttccactcc 961 aggtggaggc tgcagccccg gtttcgcaag ccgcgccgtg gtttgcttgc ccacaggcgg 1021 ccaaaccgca ccctccttcc cgtcgtttcc catctcttcc tcctttagag ctaccactat 1081 ataaatcagg gctcattttc tcgctcctca caggctcgtc tcgctttgga tcgattggtt 1141 tcgtaagtgg tgagggactg agggtctcgg agtggattga tttgggattc tgttcgaaga 1201 tttgcggagg ggggcaatgg cgaccgcggg gaaggtgatc aagtgcaaag gtccgccttg 1261 tttctcctct gtctcttgat ctgactaatc ttggtttatg attcgttgag taattttggg 1321 gaaagcttcg tccacagttt tttttttcga tgaacagtgc cgcagtggcg ctgatcttgt 1381 atgctatcct gcaatcgtgg tgaacttatt tcttttatat ccttcactcc catgaaaggc 1441 tagtaatctt tctcgatgta acatcgtcca gcactgctat taccgtgtgg tccatccgac 1501 agtctggctg aacacatcat acgatattga gcaaagatct atcttccctg ttctttaatg 1561 aaagacgtca ttttcatcag tatgatctaa gaatgttgca acttgcaagg aggcgtttct 1621 ttctttgaat ttaactaact cgttgagtgg ccctgtttct cggacgtaag gcctttgctg 1681 ctccacacat gtccattcga attttaccgt gtttagcaag ggcgaaaagt ttgcatcttg 1741 atgatttagc ttgactatgc gattgctttc ctggacccgt gcagctgcgg tggcatggga 1801 ggccggcaag ccactgtcga tcgaggaggt ggaggtagcg cctccgcagg ccatggaggt 1861 gcgcgtcaag atcctcttca cctcgctctg ccacaccgac gtcgacttct gggaggccaa 1921 ggtatctaat cagccatccc atttgtgatc tttgtcagta gatatgatac aacaactcgc 1981 ggttgacttg cgccttcttg gcggcttatc tgtcttaggg gcagactccc gtgttccctc 2041 ggatctttgg ccatgaggct ggagggtatg ttctattccc cgatttactt cactatgttg 2101 ctgactatat atgtgctgtg tttatatttt gcatatttat tatgtttttg cgtctgaatt 2161 tatgggtatg gttggtggtc tttgtttact gttttactag atgcatgtgg aagagtcaga 2221 agaaatagtt tttgtttgaa atggtatacc aacggttgga tattatctgt gtggacatca 2281 gatgttctgg gttactggca gtggactttt gacagattta tctatgattc tttcattagc 2341 agtttcttcg gctaatttac tcttactatt ttttcagtat acaaaggcac gtacagcttg 2401 gattgtgtag aatcatttta gatctgttat ctgaggcaaa tttgcttatt ctagccgcct 2461 gaaaattctt gattttgcca gtatcataga gagtgttgga gagggtgtga ctgacgtagc 2521 tccgggcgac catgtccttc ctgtgttcac tggggagtgc aaggagtgcg ctcactgcaa 2581 gtcggcagag agcaacatgt gtgatctgct caggatcaac accgaccgcg gtgtgatgat 2641 tgccgatggc aagtcgcggt tttcaatcaa tgggaagcct atctaccact ttgttgggac 2701 ttccaccttc agcgagtaca ctgtcatgca tgtcggttgt gttgcaaaga tcaatcctca 2761 ggctcccctt gataaagttt gcgtccttag ctgtggtatt tctaccggta agttcattta 2821 ctacattttg gtgtggatgc tggagtacat ttatcttgag atgctgagtt acacaaattc 2881 tttatctgtt taggtcttgg tgcatcaatt aatgttgcaa aacctccgaa gggttcgaca 2941 gtggctgttt tcggtttagg agccgttggt cttgccgtaa gtgttgaaac gatttgcttg 3001 ttctatgacc tttcaattgc aatgagaacg tgtgttgggt ttgcatctga ttaccctgcg 3061 catggttagg ctgcagaagg tgcaaggatt gctggagcgt caaggatcat tggtgtcgac 3121 ctgaacccca gcagattcga agaaggtaca gtacacacac atgtatatat gtatgatgta 3181 tcccttcgat cgaaggcatg ccttggtata atcactgagt agtcatttta ttactttgtt 3241 ttgacaagtc agtagttcat ccatttgtcc cattttttca gcttggaagt ttggttgcac 3301 tggccttggt ctaataactg agtagtcatt ttattacgtt gtttcgacaa gtcagtagct 3361 catccatctg tcccattttt tcagctagga agtttggttg cactggcctt ggactaataa 3421 ctgattagtc attttattac attgtttcga caagtcagta gctcatccat ctgtcccatt 3481 tttcagctag gaagttcggt tgcactgaat ttgtgaaccc aaaagaccac aacaagccgg 3541 tgcaggaggt ctgtttcttt acccaaggca acaaaaggtt atcacagctt atgctgaact 3601 tggccataac attcaataat tcctttatgg tctaggtact tgctgagatg accaacggag 3661 gggtcgaccg cagcgtggaa tgcactggca acatcaatgc tatgatccaa gctttcgaat 3721 gtgttcatga tgtaagtata tgtatacact ctcagctact ttcattctcc aggttccctt 3781 catccagaca tgcatgttct aaccgccgcc ctcgtgatcc agggctgggg tgttgccgtg 3841 ctggtgggtg tgccgcataa ggacgctgag ttcaagaccc acccgatgaa cttcctgaac 3901 gaaaggaccc tgaaggggac cttctttggc aactataagc cacgcactga tctgccaaat 3961 gtggtggagc tgtacatgaa aaaggtaaat tgcaaagtgc tgttccttcg gtttccttac 4021 cagccgagct tttgctgaaa aactgttaag aatcgttcct gcaattctgc ttggctctgc 4081 acaggagctg gaggtggaga agttcatcac gcacagcgtc ccgttcgccg agatcaacaa 4141 ggcgttcgac ctgatggcca agggggaggg catccgctgc atcatccgca tggagaacta 4201 gatttcgctg tctagtttgt gatctggctg ggcttggggt taataaagga ggcaatgcta 4261 gcctgccctt tcgatgagga ggtacataca cgctggcgat ggaccgcgct tgtgtgtcgc 4321 gttcagtttg gcttttgcca agcagtaggg tagcttcccg tgtcggtaat tatatggtat 4381 gaaccatcac cttttggcgc aatacatggt atgaacgtaa gatacaaatt ccaactacct 4441 ctagctcgct tgtgtgctat atgtatctct ctcgacggat gacacaagat cgcttctata 4501 tccgaagtga aactaaaagg agaaggaaaa gaaggtaaca gaataggaac cggtttggtg 4561 agaattggag aggattcatg aaagagaaaa tcccttttca ttaaatttta aatagcaagt 4621 gatttactct ctcatgatct cctccagttt ccatttcatc aaaacaaacc ttattcattt 4681 tcccctctaa tctctttctt gtcaccaccg gtggagcaag gtgattaaag agactaaatt 4741 attattcaat gaatagtagg ggttttagcc cctcaattcc tccaatacct ttgctcccaa 4801 ataagggggt gtttggtttc tagggactaa ttcctccaat acctttgctc ccaaataagg 4861 gggtgtttgg tttctaggga ctaatgttta gtcccatcat ttttttttct attttagtct 4921 ataaattgct aaatatagaa actaaaataa attaaaatat agttttagtt tctatatttg 4981 acaattttag aactaaaatg gaataaaatg tagggactaa aaattagtct agaaatcaaa 5041 caccccctaa atccctaaga gccgaggaag gggattaaaa aggataaaat cttctttgtg 5101 ttcaatttta aataggactc gccgtatcgg taaggccttg ttcgtttaca ttggattgca 5161 cctggaatcg ttccggctaa tcaaagttta tataaattag agaagcaatc cggatcggaa 5221 tcgttccgac ccaccaatcc gacgcaaacg aacaaggcct aaggcttcgc ggcggggctc 5281 gcagtccgga cgccggagag ggggagtgga gatggagaat gacaaggggg tgttctggaa 5341 agtttccttt ccaagagtaa gggtggttgg tttcgtacac taatttttaa gagcgtttgg 5401 ttaagaaaca gagaaaaatg gagtaactct attcttattt tttatgttta gttttcatta 5461 aaaaaggagc agaataccac ttgaagttct tatatagaaa tttatcataa atagttaaaa 5521 tgctctcact ccataaaaac aatcggatgc tagcgctctt cttcctatcc taccctctat 5581 attcatatga ctctttaacc aaacagagaa cggagcggct ccgctctatt ttactcttca 5641 accaaataaa aaaggagcaa ctctgtttgt catacgcgga atagaacgga tttatcctca 5701 aaaactagaa tggagcccct ctattttagt cgattctcca accaaacgca tagtgtctcc 5761 atttcattct attttagtct ctaaattgac aaatacataa actaaattat attttaagtt 5821 ttcgtattta atcaatccct accaaccaaa cactccctaa tttcgcatat cagccccaaa 5881 tcaagagtgg ttgacccatc gagacgttat cggcggatca aaggcatgcc ccgctaagca 5941 ataagtgtct aaactaacgt gccgtcgatc tcattaaaca gcaccacgag ctaaacagaa 6001 tgccaacctc aaaatcaaac atcacctgga tgctggatct gacatccgac ctaggtgcta 6061 ggcaacgatt gtgcgtagtg ctgaccatat ttgagatttt cactttattt attaaaaaaa 6121 agaggccagc agggtgggcc gctacccggc ctggtggccg agctaga // LOCUS CFICMCASE 1828 bp ds-DNA BCT 15-AUG-1990 DEFINITION C.uda endoglucanase gene, complete cds. ACCESSION M36503 KEYWORDS endoglucanase. SOURCE C.uda CB4 DNA. ORGANISM Cellulomonas uda Prokaryota; Bacteria; Firmicutes; Irregular asporogenous rods. REFERENCE 1 (bases 1 to 1828) AUTHORS Nakamura,K., Misawa,N. and Kitamura,K. TITLE Sequence of a cellulase gene of Cellulomonas uda CB4 JOURNAL J. Biotechnol. 4, 247-254 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 105 1184 endoglucanase BASE COUNT 348 a 542 c 557 g 381 t ORIGIN 1 ctgcagagtc agggaggcag cgctcacgta atattgcagc gtgaccgcgt gttctctgtc 61 tctgacgttc agtttcttta ctaccatcca taatgagtga atttatgccc ctgcgtgctt 121 tagtggcggt gatagtgaca acggcagtaa tgctggtgcc ccgggcgtgg gcgcagacgg 181 cctgggagcg ttataaggcc cgttttatga tgccggacgc gcgtatcatt gataccgcca 241 atggcaatgt gtcgcatacg gaaggccagg gcttcgccat gctcctggcg gtggcgaata 301 acgatcgccc ggcgttcgac aagctgtggc agtggacgga cagcaccctg cgcgacaagt 361 ctaacgggct gttttactgg cgctataacc cggtggcgcc ggacccgatc gccgataaaa 421 acaacgccac cgatggcgat accctgatcg cctgggcgct gctgcgcgcg caaaagcagt 481 ggcaggacaa gcgctacgcc acggcctccg atgccatcac cgcctccctg ctgaaatata 541 cggtggtgac tttcgccggt cgccaggtga tgctcccggg cgtgaagggg tttaaccgca 601 acgaccacct gaaccttaac ccctcctatt tcatcttccc ggcctggcgg gcctttgcgg 661 agcggacgca cctgaccgcc tggcggacat tgcagagtga cgggcaggcg ctgctggggc 721 aaatgggctg ggggaaatcg catctgccca gcgactgggt ggcgctgcgg gcggatggca 781 agatgctgcc ggccaaagag tggccgccgc ggatgagttt cgatgcgatc cgtatcccgc 841 tgtatatctc gtgggtcgat ccgcacagcg ccttgctcgc accgtggaaa gcctggatgc 901 agagttaccc gcgcctgcaa actccggcgt ggatcaacgt tagcaccaac gaggtcgccc 961 cgtggaatat ggccggcggc ctgctggcgg tgcgtgattt aacgcttggc gaaccgctgg 1021 aacgccgcag attgacgaca aggatgatta ttactccgcc agcctcaagc tgctggtctg 1081 gctggcgaaa caggatcagc gctagcgctg tgatggcttt gcaggtttct cagcccgtat 1141 gcctgcgggc tgagagaaaa gagcaggaac gtctcacgat gtaaggccgc cagaataggc 1201 ggccttgtcg cttattgcgg ataaggcacc caactgccgc cattcagctg gacataaggc 1261 ttgccctgat actggataac gatggcgttg gcgttttcgg acaccgccgc gctctgcggc 1321 aggttggcga catactgctg ccagttgacg ctgtcttcgc tgaacatttt gccgtcgagg 1381 gcgcgcgcac caccagctcc gacaccgcca ggtagctgct gggctgatcg atgataattg 1441 gcgcgccttc atgtggcgcc ttcatgccga agaatttcac cgccgtcggg acgttagtga 1501 tcgacgggct cgggatatcc cgcaggccag acacctgcat cttatcgccc ttcagcgcgc 1561 cgccgtgttc cggcaccacc accaccatca ccttacgccc cgatttttcc agttcggtga 1621 agaagttatc caggtcgtca aacagcttct gcgcccgcac tttgtagtcc gcggttttgc 1681 tttgccccgg gaagtgattg ccgtcatgca gcggcagggt gttatagaac gtggcgctcc 1741 gcggattgct gctggcctct tcggtcttca gccacgggtt gagaaccgcg agatcctcat 1801 acactggcga accatcaaat gcctgcag // LOCUS HUMTAPA1 1496 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human 26-kDa cell surface protein TAPA-1 mRNA, complete cds. ACCESSION M33680 KEYWORDS 26-kDa cell surface protein TAPA-1; target of antiproliferative antibody. SOURCE Human cell line OCI-LY8, cDNA to mRNA, clones 7-3 and 8-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1496) AUTHORS Oren,R., Takahashi,S., Doss,C., Levy,R. and Levy,S. TITLE TAPA-1, the target of an anti-proliferative antibody, defines a new family of transmembrane proteins JOURNAL Mol. Cell. Biol. 10, 4007-4015 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by S.Levy, 10-APR-1990, for release after publication. FEATURES from to/span description pept 239 949 26-kDa cell surface protein TAPA-1 signal 1455 1460 Poly-A signal BASE COUNT 257 a 504 c 413 g 322 t ORIGIN 1 ccattgtgct ggaaaggcgc gcaacggcgg cgacggcggc gaccccaccg cgcatcctgc 61 caggcctccg cgcccagccg cccacgcgcc cccgcgcccc gcgccccgac cctttcttcg 121 cgcccccgcc cctcggcccg ccaggccccc ttgccggcca cccgccaggc cccgcgccgg 181 cccgcccgcc gcccaggacc ggcccgcgcc ccgcaggccg cccgccgccc gcgccgccat 241 gggagtggag ggctgcacca agtgcatcaa gtacctgctc ttcgtcttca atttcgtctt 301 ctggctggct ggaggcgtga tcctgggtgt ggccctgtgg ctccgccatg acccgcagac 361 caccaacctc ctgtatctgg agctgggaga caagcccgcg cccaacacct tctatgtagg 421 catctacatc ctcatcgctg tgggcgctgt catgatgttc gttggcttcc tgggctgcta 481 cggggccatc caggaatccc agtgcctgct ggggacgttc ttcacctgcc tggtcatcct 541 gtttgcctgt gaggtggccg ccggcatctg gggctttgtc aacaaggacc agatcgccaa 601 ggatgtgaag cagttctatg accaggccct acagcaggcc gtggtggatg atgacgccaa 661 caacgccaag gctgtggtga agaccttcca cgagacgctt gactgctgtg gctccagcac 721 actgactgct ttgaccacct cagtgctcaa gaacaatttg tgtccctcgg gcagcaacat 781 catcagcaac ctcttcaagg aggactgcca ccagaagatc gatgacctct tctccgggaa 841 gctgtacctc atcggcattg ctgccatcgt ggtcgctgtg atcatgatct tcgagatgat 901 cctgagcatg gtgctgtgct gtggcatccg gaacagctcc gtgtactgag gccccgcagc 961 tctggccaca gggacctctg cagtgccccc taagtgaccc ggacacttcc gagggggcca 1021 tcaccgcctg tgtatataac gtttccggta ttactctgct acacgtagcc tttttacttt 1081 tggggttttg tttttgttct gaactttcct gttacctttt cagggctgat gtcacatgta 1141 ggtggcgtgt atgagtggag acgggcctgg gtcttgggga ctggagggca ggggtccttc 1201 tgcccctggg gtcccagggt gctctgcctg ctcagccagg cctctcctgg gagccactcg 1261 cccagagact cagcttggcc aacttggggg gctgtgtcca cccagcccgc ccgtcctgtg 1321 ggctgcacag ctcaccttgt tccctcctgc cccggttcga gagccgagtc tgtgggcact 1381 ctctgccttc atgcacctgt cctttctaac acgtcgcctt caactgtaat cacaacatcc 1441 tgactccgtc atttaataaa gaaggaacat caggcatgct aaaaaaaaaa aaaaaa // LOCUS DROSYNCL 3727 bp ds-DNA SYN 15-AUG-1990 DEFINITION Synthetic cloning vector encoding heat-shock protein 82/neomycin phosphotransferase fusion protein (hsp82-neo) gene, complete cds. ACCESSION M32616 KEYWORDS heat-shock protein 82; neomycin phosphotransferase. SOURCE Synthetic, D.pseudoobscura, D.melanogaster and bacterial DNA, clone pHS85. ORGANISM Cloning vector Artificial sequences; Cloning vehicles. REFERENCE 1 (bases 1 to 3727) AUTHORS Sass,H. TITLE P-transposable vectors expressing a constitutive and thermoinducible hsp82-neo fusion gene for Drosophila germline transformation tissue-culture transfection JOURNAL Gene 89, 179-186 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Saas 06-MAR-1990, for release after publication. FEATURES from to/span description pept 2068 2925 heat-shock protein 82/neomcyn phosphotransferase fusion protein (hsp82-neo) IVS 1005 2067 hsp82 intron A pre-msg 6 3610 hsp82-neo fusion protein mRNA and intron site 6 868 D.pseudoobscura hsp82 gene 5' flank site 869 1004 D.pseudoobscura heat-shock protein 82, exon 1 site 2068 2126 D.pseudoobscura hsp82 truncated exon 2 site 2127 2142 coding linker site 2143 3269 neomycin phosphotransferase coding sequence site 3270 3610 non-coding 3'flank of D.melanogaster hsp82 gene with Poly-A signal site 3620 3726 multiple cloning site (MCS) BASE COUNT 950 a 866 c 882 g 1029 t ORIGIN 23 on XR. 1 ggatccgatg gatttttacc atattattat tatttctagc cacgttgcaa ctctatgtca 61 gtaccggaaa tagcagccct ggagtctctt agcctctaga aacggctaga acattctacg 121 cttgtggttg gttttcattg aaagcaggcg tcttttatat actttacggt atatagctac 181 atgtatataa tggtatactt catcaatatc atcaatctat gaattttaat ttttaagagt 241 acatatataa attaacatgg gggatatagt tctcaatacc caagtatttg aattttccat 301 ctctcatcgg gggtaattca tgaaccggtt ccagccgaaa aatgaacgaa attcatgaga 361 gattattttt tcgggattgc ttgccaatac atttcggaaa aacaaaatgt actacatttt 421 tgtcatctca gggtgctcca attaattatg aatgctacga cactacaaag cagcttggaa 481 atccgaattt taacaataat taaaggaaat agggtatagc gtatataggg tatcatagct 541 gaaacgggta taccaacaat aatgacgcag cacttacgtt tcactccgta ctcacttacg 601 atttatgctt ataatttttg ttcacctctt ttacttaaac ctcactttaa aaacaatcaa 661 ataaatggga gtatttatgt atatttctaa gattacggcg gtattgttct gctgtctgcg 721 gtcacactgg ttttcagcct cggtgcaact ctgtttcagt accggaaata gcagccctgg 781 attctcgtag cctctagaaa cgtctagaaa attctacgct tggggttggt ttgctataaa 841 agcaggcggg ccgactgttg ccggctcgag tcttgaaaaa tttttgtcca gtgaaggtgc 901 gtttgcttag agcgcagtgc aacaaagtga atttattcta cacaaatcga agtgaaaata 961 tatatatatt tttatctctg ctgttaaatt aaaacacata caaggtaagc gttaacaatg 1021 aaagtgcatt tatttaacaa aatgtaaaga tctgctgtgg tgcaatgctt gctgcgcgtc 1081 tgctgatgaa aagttcttga cccaaatgca gaaaatcaat agaatctgtg aaatcttcta 1141 taatcttaaa attagattaa agttctattt ttttgcccga gtttgtaacc acgggcgata 1201 aaaagtagct ttacgcctcg cacaccaata cacgaacaga aaaattatgc cggctgtaat 1261 atgagctcgg cgcgaaattt ctagatgacc ggttcttaga acatcaacct tgcatgtcca 1321 acaaatgctg gttaattaaa gacgtgcctt aacttaattt tcttggcaca cgtgcttatt 1381 tgaattcagt cttttgcact tgccatgcac acagccacac atatgtgaat ttgcgaattt 1441 gccactcatg catacactca tgtatgttcc atcatcgaga aaattcgaaa atcgtgaatc 1501 aaacttcggc atgaatcaaa tttcaaagag gtctttgttt ccacctggtt ctagaagttt 1561 cctttcgcgt gcttggatac ctatcttatg cataaacggt ttctgcacat gtaacttgaa 1621 cacatacaca cttgcaaaca tatgtatgta catatgcata ccctgaccac aaaattttca 1681 gcaaacttta gccgtacatc aaaccaccaa agagctgtgc tgttgtcaag gagaattttc 1741 ttccagaaag cttcaattag attgtttatc tgggggtgat gtacgcattg gacaacccta 1801 tgcgctctag aaacttccag taaatgttaa ctggatgtac aatgggtaca tccctaagcg 1861 tgcgagtgta tgcgtgttcg ctaactgtaa tgtatgtgtg ttcgtgtgcg aaagagaaaa 1921 ggatgagaag tctgccattt tgaaataaaa agattttgtg ctaggggggt ggggaaatat 1981 gattatcgaa aatgggcagt gaacaatgca gctgcatatt taatgagttg tgactaattc 2041 tcgtgtggta ttttcttgct cttccagatg cccgaagaag ctgagacttt cgcattccag 2101 gctgagattg ctcagcttat gtcgttgatc cggccaagct tggatggatt gcacgcaggt 2161 tctccggccg cttgggtgga gaggctattc ggctatgact gggcacaaca gacaatcggc 2221 tgctctgatg ccgccgtgtt ccggctgtca gcgcaggggc gcccggttct ttttgtcaag 2281 accgacctgt ccggtgccct gaatgaactg caggacgagg cagcgcggct atcgtggctg 2341 gccacgacgg gcgttccttg cgcagctgtg ctcgacgttg tcactgaagc gggaagggac 2401 tggctgctat tgggcgaagt gccggggcag gatctcctgt catctcacct tgctcctgcc 2461 gagaaagtat ccatcatggc tgatgcaatg cggcggctgc atacgcttga tccggctacc 2521 tgcccattcg accaccaagc gaaacatcgc atcgagcgag cacgtactcg gatggaagcc 2581 ggtcttgtcg atcaggatga tctggacgaa gagcatcagg ggctcgcgcc agccgaactg 2641 ttcgccaggc tcaaggcgcg catgcccgac ggcgaggatc tcgtcgtgac ccatggcgat 2701 gcctgcttgc cgaatatcat ggtggaaaat ggccgctttt ctggattcat cgactgtggc 2761 cggctgggtg tggcggaccg ctatcaggac atagcgttgg ctacccgtga tattgctgaa 2821 gagcttggcg gcgaatgggc tgaccgcttc ctcgtgcttt acggtatcgc cgctcccgat 2881 tcgcagcgca tcgccttcta tcgccttctt gacgagttct tctgagcggg actctggggt 2941 tcgaaatgac cgaccaagcg acgcccaacc tgccatcacg agatttcgat tccaccgccg 3001 ccttctatga aaggttgggc ttcggaatcg ttttccggga cgccggctgg atgatcctcc 3061 agcgcgggga tctcatgctg gagttcttcg cccaccccgg gctcgatccc ctcgcgagtt 3121 ggttcagctg ctgcctgagg ctggacgacc tcgcggagtt ctaccggcag tgcaaatccg 3181 tcggcatcca ggaaaccagc agcggctatc cgcgcatcca tgcccccgaa ctgcaggagt 3241 ggggaggcac gatggccgct ttggtcgatc gatgataaac ataaaaccaa ataaacaaca 3301 agcaaatgtg ttttaaaaat ctaacttctg agcgagtatt tattgggggg aataaacaat 3361 ctatgaatcg gattctttgc gcagcagctg ctcaatggcc tccaccgtgg acactccgtt 3421 ggttatcatt attatcttgt ttcgcgatcg agatcccttg tccaaagaaa cgtcgctctt 3481 tcgaagacct agaactttcg acagaaactt gaccagttcg gcgttagctt ctccctcgct 3541 gggcggagcg gcgatttgga cgcccactcc ttcaaagcca attcctgtga ttccgttctg 3601 cttagccccc ccggaattgg gtacccccac cgcggtggcg gccgctctag aactagtgga 3661 tcccccgggc tgcaggaatt cgatatcaag cttatcgata ccgtcgacct cgaggggggg 3721 cccggta // LOCUS ECOARGD 1221 bp ds-DNA BCT 15-AUG-1990 DEFINITION E.coli acetylornithine aminotransferase (argD) gene, complete cds. ACCESSION M32796 KEYWORDS acetylornithine aminotransferase. SOURCE E.coli (K12) DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1221) AUTHORS Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N. TITLE Escherichia coli and Saccharomyces cerevisiae acetylornithine aminotransferases: Evolutionary relationship with ornithine aminotransferases JOURNAL Gene 90, 69-78 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by A.H.T.Boyen 13-MAR-1990, for release after publication. FEATURES from to/span description pept 1 1221 acetylornithine aminotransferase (argD) (EC 2.6.1.11) BASE COUNT 261 a 290 c 376 g 294 t ORIGIN 73 minutes. 1 atggcaattg aacaaacagc aattacacgc gcgactttcg atgaagtgat cctgccgatt 61 tatgctccgg cagagtttat tccggtaaaa ggtcagggca gccgaatctg ggatcagcaa 121 ggcaaggagt atgtcgattt cgcgggtggc attgcagtta cggcgttggg ccattgccat 181 cctgcgctgg tgaacgcgtt aaaaacccag ggcgaaactc tgtggcatat cagtaacgtt 241 ttcaccaatg aaccggcgct gcgtcttggg cgtaaactga ttgaggcaac gtttgccgaa 301 cgcgtggtgt ttatgaactc cggcacggaa gctaacgaaa ccgcctttaa actggcacgc 361 cattacgcct gtgtgcgtca tagcccgttc aaaaccaaaa ttattgcctt ccataacgct 421 tttcatggtc gctcgctgtt taccgtttcg gtgggtgggc agccaaaata ttccgacggc 481 tttgggccga aaccggcaga catcatccac gttcccttta acgatctcca tgcagtgaaa 541 gcggtgatgg atgatcacac ctgtgcggtg gtggttgagc cgatccaggg cgagggcggt 601 gtgacggcag cgacgccaga gtttttgcag ggcttgcgcg agctgtgcga tcaacatcag 661 gcattattgg tgtttgatga agtgcagtgc gggatggggc ggaccggcga tttgtttgct 721 tacatgcact acgcgttagc gccggatatt ctgacctctg cgaaagcgtt aggcggcggc 781 ttcccgatta gcgccatgct gaccacggcg gaaattgctt ctgcgtttca tcctggttct 841 cacggttcca cctacggcgg taatcctctg gcctgtgcag tagcgggggc ggcgtttgat 901 atcatcaata cccctgaagt gctggaaggc attcaggcga aacgccagcg ttttgttgac 961 catctgcaga agatcgatca gcagtacgat gtatttagcg atattcgcgg tatggggctg 1021 ttgattggcg cagagctgaa accacagtac aaaggtcggg cgcgtgattt cctgtatgcg 1081 ggcgcagagg ctggcgtaat ggtgctgaat gccggaccgg atgtgatgcg ttttgcaccg 1141 tcgctggtgg tggaagatgc ggatatcgat gaagggatgc aacgtttcgc ccacgcggtg 1201 gcgaaggtgg ttggggcgta a // LOCUS YSCARG8 1272 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.cerevisiae acetylornithine aminotransferase (ARG8) gene, complete cds. ACCESSION M32795 KEYWORDS acetylornithine aminotransferase. SOURCE S.cerevisiae FL100 DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1272) AUTHORS Heimberg,H., Boyen,A., Crabeel,M. and Glansdorff,N. TITLE Escherichia coli and Saccharomyces cerevisiae acetylornithine aminotransferases: Evolutionary relationship with ornithine aminotransferases JOURNAL Gene 90, 69-78 (1990) STANDARD full staff_entry COMMENT Draft entry and computer readable sequence for [1] kindly submitted by A.H.T.Boyen 13-MAR-1990, for release after publication. FEATURES from to/span description pept 1 1272 acetylornithine aminotransferase (ARG8) (EC 2.6.1.11) BASE COUNT 404 a 230 c 283 g 355 t ORIGIN 1 atgtttaaaa gatatttatc cagtacgtca tcaagaagat ttacaagcat tttagaggaa 61 aaggcctttc aagtgaccac ttactctaga cctgaagatc tatgtataac tagaggtaaa 121 aatgcaaagc tgtatgatga cgtgaatggt aaagaatata tcgatttcac cgcaggtatt 181 gcggtgaccg cattaggcca tgcaaatcct aaagtggcag aaattctgca ccatcaggct 241 aacaaactgg ttcattcctc caacctttac ttcactaagg aatgtttgga tttaagtgaa 301 aagattgttg aaaagaccaa gcaattcggt ggtcaacacg acgcctcaag agtattttta 361 tgtaattctg gtacggaagc aaatgaagct gctttgaagt ttgcaaagaa acatggtata 421 atgaaaaatc ctagcaagca aggcattgtt gcatttgaga actcttttca tggccgtact 481 atgggcgctt tatctgtcac ttggaatagt aaatatagaa ctccttttgg ggatttggtt 541 ccccatgtct cattcttaaa tttgaatgac gaaatgacca aactacaaag ttatatcgag 601 accaaaaagg acgagattgc tggtttaatt gtcgagccca tacaaggtga aggtggggtt 661 tttcccgtag aagttgaaaa gctaaccgga ttgaagaaaa tatgtcaaga taatgatgtg 721 attgtcattc atgatgaaat tcaatgcggt ttgggccgtt caggtaaact atgggctcat 781 gcttatttac caagtgaggc tcatccggat atttttacat ctgccaaagc attgggaaat 841 ggcttcccca tcgctgccac catcgtcaat gaaaaagtta ataatgcttt gagagttggt 901 gaccacggca ccacgtatgg tggtaatccg ctggcctgtt ctgtaagcaa ctatgttttg 961 gataccatag cagacgaagc ttttttgaaa caagtctcta agaagagtga tatcttacaa 1021 aagcgcttgc gcgaaattca agccaaatat ccaaatcaaa taaagactat cagaggaaaa 1081 ggtttgatgc ttggtgctga gttcgtcgaa ccacccaccg aggtcatcaa aaaggccaga 1141 gaattgggac ttttgatcat taccgctggt aagagtaccg ttagatttgt tcccgcatta 1201 acgattgaag acgaactaat cgaagaaggg atggatgctt ttgaaaaggc tattgaagcg 1261 gtttacgctt aa // LOCUS MZEMTMINI 1445 bp ds-DNA ORG 15-AUG-1990 DEFINITION Maize mitochondrion 1.4 kb minicircle DNA open reading frame. ACCESSION M36398 KEYWORDS . SOURCE Maize mitochondrion 1.4 kb minicircle DNA. ORGANISM Mitochondrion Zea mays Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae; Zea mexicana. REFERENCE 1 (bases 1 to 1445) AUTHORS Smith,A.G. and Pring,D.R. TITLE Nucleotide sequence and molecular characterization of a maize mitochondrial plasmid-like DNA JOURNAL Curr. Genet. 12, 617-623 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 120 353 ORF 1 pept 1240 1356 ORF 2 pept 1245 1403 ORF 3 BASE COUNT 375 a 327 c 320 g 423 t ORIGIN 1 gaattccttc ctttggtcgg actactcttt ttaggttatt gccttcggtc aaccctaaat 61 aagttgattg tcaaattgcg ctgtaactgc attcagttga atatgcggat attttatcaa 121 tgaatctcga tatcctgttg ataaagattg gatttcttgc gattctgatc gttttatcaa 181 tccaaatcat cgatgaatat ttccataaag tgatctgtga tcctttagtc tcaatatcag 241 ttgtttcctg ccgggataac ttgggttatg ctagccacct acttctacaa acaggtgaga 301 tccacctggg tgggttcgaa tcccatctgc tagatgcgtg gtcatggaat tgaaacctct 361 atggctggcc caagggaacc ggtcttgtcg attgacctag cttaggaaga gcccagtgaa 421 cctatccaca agtcaacccc cagggataat ggaaaacctc attcgcccat tggcaaacac 481 ttaaatatga ggacattcct ctggcaagac aggttagaga cttgagagac taaagacaag 541 aaggcacagg ttgtagtttt cttccaaggc caaaagcccc gcatggtgga agaagctact 601 ggtaagtccg agggggggct taactgcgat agttgaccga cgcgacgcta taccggaaag 661 gccttcgggg tgttgaaagt atggaacttt tattctcgca tagcttggga aagggtatcc 721 ggtgaaactc cccttaaaag ggtttttccc ccgtaccccc ttttcccaaa aaatttttta 781 aaaaaagtgg atcagtgaac ctatctttat ctgattaaat cagtggttag gttcactact 841 atttatagat aacaacccta gccttggggg gacaccccct ccccccaatc ccccctgtct 901 ggttttgttt taaaccaagt ttgcagggcg agcttgtttt gttatttata attagttatt 961 tcatgtttga tccgagcttc gggataggga acctctcttg tcagaaaggc ttccctctcc 1021 cttggtctct tgaaacagga cttttattca ctcagctatg cttcccggaa atccggatta 1081 aagaataaag acttctatac ctttccggga agcagagcag agggaaacgg agccctcgcc 1141 ccggagggga atcaattctc tggtttatcg ttcttatgct gttgcggtta taacgatagg 1201 aattactaga taacatcctc taggaattac tagataacaa tggaatggtt gagcctacta 1261 tctcaagtgt tggaaggctc aacctacttg cttgtccctc tccactatcg ttccggtctt 1321 accttccctc gagtccgatc tcgggaaggc gcttaggcag gggccccaag actaagcagg 1381 taatacaata cctatattta tagagggctt ttacctcgat aaatgagggc gcttcctata 1441 atgtg // LOCUS NGOTEM1A 1199 bp ds-DNA BCT 15-AUG-1990 DEFINITION N.gonorrhoeae plasmid pFA7 beta-lactamase (TEM-1) gene, 3' end. ACCESSION M36543 KEYWORDS beta-lactamase. SOURCE N.gonorrhoeae plasmid pFA7 DNA. ORGANISM Neisseria gonorrhoeae Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Neisseriaceae. REFERENCE 1 (bases 1 to 1199) AUTHORS Sanchez-Pescador,R., Stempien,M.S. and Urdea,M.S. TITLE Rapid chemiluminescent nucleic acid assays for detection of TEM-1 beta-lactamase-mediated penicillin resistance in Neisseria gonorrhoeae and other bacteria JOURNAL J. Clin. Microbiol. 26, 1934-1938 (1988) STANDARD simple staff_entry FEATURES from to/span description pept < 1 21 beta-lactamase (TEM-1) (AA at 1) BASE COUNT 344 a 237 c 178 g 440 t ORIGIN 1 tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat 61 ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg 121 accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccta tctataaact 181 cttggcttgg ttctaatccc tctaaacgat tattatcaat agccgctcta accgcttttt 241 ctcggcttaa tttttctgtc tctgttataa aattgcttat tcattcttgt tcttctttca 301 aaaaaaagtt aagtaaaata cctacctaaa tttttactag ttcgcaatct acgagcttat 361 aacctcgttt tttcaattca tttaaaaaat cagattttga gcctaatttg atctattgct 421 atcgttaccc gctagaaata cccagtaatt acgcaaatct tcattggtaa ctttcgtaat 481 atctgtgtaa tgatcttcga gtatttttaa gcaatctcta gcccataaac cgtactcgtg 541 attgctcatc ttagggtttt gcttatcgag tttgacgaac ttcccatact tgtttttatg 601 tggaaatact ggccgtttgc aacttcttca attttttgag ctgttcgttt tttactacca 661 atcacaaaat ttaaagagtg aatagtacgc ccacgcttga tttgttcaac ctcaacgact 721 aaatcagatt tctcgttaat ctcagttatt gcaggttcca aaacacgttg atttaatgaa 781 ttaaatctag gtattattca acctgaagcc attctttagt tttctactgt aatttcacga 841 ctaccaacag agcgatattg tgtaattagc tcataaattc gaattgaatg tacactgttg 901 aaataagcga tatgtttgag ttgatattgc gtgaattgcc ctttaagttg cgttaggtat 961 ggcataactt catcagtcat tgcaattcta aaacgcccct ctttctgaaa tatgttctag 1021 aggaaaccca acgaaattca gttacacggt ctttatcttc agttttaaca cttcggtcat 1081 aaatccgttt tatagccgcc tgaatttgct tataggcgtt atcttggctt atttctggaa 1141 actcacggac aaaatcagcc accgtaaaat caaaaatttt ttgattagat ttcggatcc // LOCUS FLANAX 1461 bp ss-RNA VRL 15-AUG-1990 DEFINITION Influenza A/Chile/1/83 (H1N1), neuraminidase (seg 6), cDNA to mRNA. ACCESSION M24783 M33023 KEYWORDS neuraminidase. SOURCE Influenza virus type A, cDNA to viral RNA. ORGANISM Influenza virus type A Viridae; ss-RNA enveloped viruses; Negative strand RNA viruses; Orthomyxoviridae; Influenzavirus; Influenza A viruses. REFERENCE 1 (bases 1 to 1461) AUTHORS Schreier,E., Roeske,H., Driesel,G., Kuenkel,U., Petzold,D.R., Berlinghoff,R. and Michel,S. TITLE Complete nucleotide sequence of the neuraminidase gene of the human influenza virus A/Chile/1/83 (H1N1) JOURNAL Arch. Virol. 99, 271-276 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 21 1433 neuraminidase BASE COUNT 466 a 263 c 343 g 388 t 1 others ORIGIN 1 agcaaaagca ggagtttaaa atgaatccaa atcagaaaat aataaccatt ggatcaatct 61 gtatgacaat cggaataatt agtctaatat tgcaaatagg aaatattatt tcaatatggg 121 ttagccactc aatccaaact ggaagtcaaa accacactgg aatatgcaac caaagaatca 181 ttacttatga aaatagcacc tgggtaaatc aaacatatgt caatattaac aacactaacg 241 ttgttgctgg aaaggacaca acttcagtga cattagccgg caattcatct ctttgtccta 301 tccgtgggtg ggctatatac agcaaagaca acagcataag aattggttcc aaaggagatg 361 tttttgtcat aagagaacct tttatatcat gttctcactt ggaatgcaga accttttttc 421 tgacccaagg tgctctatta aatgacaagc attcaaatgg gaccgttaag gacagaagcc 481 cttatagggc cttaatgagc tgtcctatag gtgaagctcc gtctccatac aattcaaggt 541 ttgaatcagt tgcttggtca gcaagcgcat gtcatgatgg catgggctgg ctaacaatcg 601 gaatttctgg tccagatgat ggagcagtgg ctgtactaaa atacaacggc ataataactg 661 aaaccataaa aagttggagg aagcgaatat taagaacaca agagtctgaa tgtgtctgtg 721 taaacggttc atgttttacc ataatgaccg atggcccgag taatggacct gcctcgtaca 781 gaatcttcaa aatcgagaag gggaagatta ctaaatcaat adagttggat gcacccaatt 841 ctcattacga ggaatgttcc tgttacccag acaccggcac agtgatgtgt gtgtgcagag 901 acaattggca tggttcgaat cgaccttggg tgtcttttaa tcaaaacctg gattatcaaa 961 taggatacat ctgcagtggg gttttcggtg acaatccgcg tcccaaagat ggaaaaggca 1021 gctgtgatcc agtaactgtt gatggagcag acggagtaaa ggggttttca tacaggtatg 1081 gtaatggtgt ttggatagga aggactaaaa gtaacagctc cagaaaggga tttgagatga 1141 tttgggatcc taatggatgg acagataccg atagtaattt cttagtgaaa caggatgtag 1201 tggcaatgac tgattggtca gggtacagcg gaagtttcgt tcaacatcct gagctaacag 1261 gattggactg tatgaggcct tgcttctggg ttgaattaat cagaggacga cctagagaaa 1321 agacaacaat ctggactagt gggagcagca tttctttttg tggcgtgaat agtgatactg 1381 caaattggtc ttggccagac ggtgccgagt tgccattcac cattgacaag tagtccgttg 1441 aaaaaactcc ttgtttctac t // LOCUS YSPURA4 1764 bp ds-DNA PLN 15-AUG-1990 DEFINITION S.pombe orotidine-5'-phosphate decarboxylase (ura4) gene. ACCESSION M36504 KEYWORDS orotidine-5'-phosphate decarboxylase. SOURCE S.pombe DNA. ORGANISM Schizosaccharomyces pombe Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1764) AUTHORS Grimm,C., Kohli,J., Murray,J. and Maundrell,K. TITLE Genetic engineering of Schizosaccharomyces pombe: A system for gene disruption and replacement using the ura4 gene as a selectable marker JOURNAL Mol. Gen. Genet. 215, 81-86 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 534 1328 orotidine-5'-phosphate decarboxylase (ura4) mRNA 484 > 1328 ura4 mRNA BASE COUNT 550 a 274 c 349 g 591 t ORIGIN 1 aagcttagct acaaatccca ctggctatat gtatgcattt gtgttaaaaa agtttgtata 61 gattatttaa tctactcagc attctttctc taaataggaa tttgttactt aatggagaaa 121 aaaatgtttc gatttaccta gtgtatttgt ttgtatactc acgtttaatt tcaaacatcc 181 attctatctt gtgtaatttt tggcatggtg aaaaagataa tcagccttat aatctttaca 241 aaagtaagaa attctgtaaa taagccttaa tgcccttgct ttaaattaaa atggttcttt 301 ttcatgataa tgtttgcact ttgtgaatat attttagata gttctgtgag gtataattaa 361 gatgttttag agacttatac aattttgtct ttataaattc ttaattgatt ttaccatccc 421 agtttaacta tgcttcgtcg gcatctctgc acatgtcgtg ttttcttacc gtattgtcct 481 accaagaacc tcttttttgc ttggatcgaa attaaaggtt taaaagcaaa gttatggatg 541 ctagagtatt tcaaagctat tcagctagag ctgaggggat gaaaaatccc attgccaagg 601 aattgttggc tttgatggaa gaaaagcaaa gcaacttgtc agtcgcggtc gatttgacga 661 agaaatccga aatcttagaa ttggtagata aaattggacc ctatgtctgt gttatcaaga 721 cacatattga cgttgtcgag gatttcgacc aggatatggt agaaaaactg gtggccttag 781 gtaaaaagca tcgttttctt atctttgagg atcgcaaatt cgcagacatt ggaaataccg 841 tcaagctaca atatgcatct ggtgtgtaca aaattgcttc ttgggctcat atcacaaatt 901 gccatacagt gccaggcgag ggtattatac aaggcctcaa agaagttggt ttacctttgg 961 gacgtggtct cttgcttttg gctgaaatgt cttccaaagg ctctttggct actggttcct 1021 acacagagaa aaccttagaa tggtttgaga agcataccga tttttgcttt ggctttatag 1081 ctggtcgtcg atttcctaac cttcaaagcg actacataac tatgtcccct ggtatcggct 1141 tggatgttaa aggagacggg ctgggacagc aatatcgtac tcctgaagaa gtgattgtaa 1201 actgcggtag cgatatcatc attgttggtc gtggagtcta tggagctggt cgtaatcctg 1261 ttgtcgaagc caagagatat agagaagctg gttggaaggc atatcagcaa agactttctc 1321 agcattaaaa aaagactaat gtaaaatttt tttggttggt tattgaaaaa gtcgatgcct 1381 tgtttgcgtt tgttttccta ggcgttttat gtcagaaggc atttagaatt agtatacaag 1441 tactctttgg taaaatttta tgtagcgact aaaatattaa ctattataga taaacacctt 1501 gggaataaaa agtaatttgc tatagtaatt tattaaacat gctcctacaa cattaccaca 1561 atcttttctc ttggattgac attgaataag aaaagagtga atttttttag acttgtaatg 1621 ataactatgt acaaagccaa tgaaagatgt atgtagatga atgtaaaata ccatgtagac 1681 aaacaagata aaacttggtt ataaacattg gtgttggaac agaataaatt agatgtcaaa 1741 aagtttcgtc aatatcacaa gctt // LOCUS BMEGDH1 2834 bp ds-DNA BCT 15-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase gene and ORFs. ACCESSION D90043 KEYWORDS glucose dehydrogenase. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2834) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura, Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan. FEATURES from to/span description pept 1964 2749 glucose dehydrogenase (EC 1.1.1.47) ORF 181 867 ORF1 ORF 1086 1946 ORF2 signal 89 94 put. -35 region for ORF1 signal 116 121 put. -10 region for ORF1 signal 167 172 ORF2 ribosome binding site signal 893 928 termination signal signal 972 977 put. -35 region for ORF2 signal 997 1003 put. -10 region for ORF2 signal 1070 1076 ORF2 ribosome binding site signal 1949 1954 glucose dehydrogenase ribosome binding site signal 2751 2783 termination signal for glucose dehydrogenase BASE COUNT 896 a 442 c 644 g 852 t ORIGIN 1 gatcaggtag cgagaatctt tgatgaaggt ttttcaacca aagcaaagga aaatagagga 61 attggtttgc atttagtaaa acaaattgtt gaaaaaggaa acggtcagat cgaagtagag 121 tcagaattag atgttggaac gacttttatc attacattct ttttataggg ggagtgggaa 181 atgaataaaa aagcatggac cgtgcttctc atagaagacg atcctatggt acaagaagtg 241 aaccgccaat ttattgaaca agttgaaggg ttcactgtta tcgctgcagc ttcgaatggt 301 ttagaggggg tacagctcat taaacagcat cagcctgatt taacgattat tgatatgtat 361 atgcctagtc aagatggctt aaccacctta cagcaaattc gagcaaatgg ctataaaaca 421 gacgtgatag cagttacggc tgcaagtgat attgaaaccg tacgcaaagt tcttcaatat 481 ggcgctgtgg attatattat gaaaccgttc aagtttgaac gaatgaagca agcgcttgag 541 cagtatcgtt cgtttcaagt taaaataagt caaaaagaac atattactca gtctgaatta 601 gattctatgc tgtttcagca attcgaagaa aaagccgatt tgcttcccaa ggggctaaat 661 gcggttacgt taaggaggat acaacaatat ctttccgaac aaaatcatcc aatttctgct 721 gaagaagtgg cggacggcgt aggaattgcg cgtgttacag caagaaggta tttagagttt 781 ttagaacagg aaaacgagct gaaattatca gttgaatacg gcagagtggg gagacctatt 841 aatcgctata tgttaaaaat aaattaaatc atacagaaca gcttttattt ggaaaagctg 901 tttttttgcg ttagaaagta tatctttttc tctcctagaa caaattaagg tatacagttt 961 tcgctaccca aagaatattt cgtgcggtca ttaatccata aaatgtccct gaaaaggatt 1021 aatggcggaa aaattgggga atatgcactt tgacatttaa ttttaacaca ggaaggtttt 1081 gaaacatgga catattttta gccgtcttac cagccatatt ttggggaagc attgtgcttt 1141 ttaatgtgaa actaggcgga ggaccttata gtcaaacgct tggaaccaca ttgggagctt 1201 taattttctc catcggtatt tatatttttg tacaccctac gtttacacct ttaatctttg 1261 gggttggagt tgtttcgggg ctattttggg cagttggaca aagtaatcag ctgaaaagta 1321 ttgatttaat tggagtttct aaaacgatgc ctatttcaac ggggcttcag ttagtttcca 1381 cttcattatt tggagtaatt gtgtttcacg agtggtctac aaaaacttca atcattcttg 1441 gtgtgctcgc tcttatcttt attattgtag ggattgtttt agcatcactt caaagcaaag 1501 aagagaaaga ggctgaagaa ggaaaaggaa acttcaaaaa aggaattgtt attttattaa 1561 tttcaaccgt tggttattta gtttatgttg tagtagcccg tctatttaat gtagacggat 1621 ggtcggcttt attacctcaa gcaattggta tggttattgg aggagtattg ctgacgttca 1681 agcataagcc atttaataaa tatgcaattc gcaacattat cccaggtctt atttgggccg 1741 ctggtaatat gtttttattc atctcacaac ctaaagtagg cgtagcgaca agcttttcgc 1801 tttctcaaat gggaatcgtc atttcaacat taggcgggat cattatttta ggtgagaaga 1861 aaacgaagcg tcagttagtt gggattatta ttgggattat actgatcatc atagcaggag 1921 tcatgttagg gctcgccaaa agctaactag gaggttatta acaatgtata aagatttaga 1981 agggaaagta gttgtcataa caggttcatc taccggttta ggaaaagcaa tggcgattcg 2041 ttttgcgaca gaaaaagcta aagtagttgt gaattatcgt tctaaagaag aagaagctaa 2101 cagcgtttta gaagaaatta aaaaagtcgg cggagaggca attgccgtta aaggtgacgt 2161 aacagttgag tctgacgtga tcaatttagt tcaatcttct attaaagaat ttggaaagtt 2221 agacgttatg attaataacg caggaatgga aaatccggtt tcatctcatg aaatgtcttt 2281 aagcgattgg aataaagtaa ttgatacgaa cttaacggga gcatttttag gcagccgtga 2341 agcgattaaa tattttgtgg aaaatgatat taagggaaca gttattaaca tgtcgagtgt 2401 tcacgagaaa attccttggc cattatttgt tcattacgca gcaagtaaag gcggaatgaa 2461 gctcatgacc gaaacacttg cattagaata cgctccaaaa ggtattcgtg taaataacat 2521 tggaccggga gcgattaata caccgattaa cgctgagaaa tttgctgatc ctgagcagcg 2581 tgcggatgta gaaagcatga ttccaatggg atacattgga gagccggaag aaattgcagc 2641 ggttgctgca tggctagctt cttcagaggc aagttatgta acagggatta cgctctttgc 2701 tgacggcggt atgacccagt acccatcatt ccaagcagga cgcggataag aaaaaacgca 2761 ctctataata gagtgcgttt tttagtttcc ctgagctttt ttttggttct taggagctga 2821 ctggtgttga attc // LOCUS BMEGDH2 1202 bp ds-DNA BCT 15-AUG-1990 DEFINITION B. megaterium glucose dehydrogenase (EC 1.1.1.47) gene. ACCESSION D90044 KEYWORDS glucose dehydrogenase; isozyme. SOURCE Bacillus megaterium (strain IAM1030) DNA. ORGANISM Bacillus megaterium Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 1202) AUTHORS Mitamura,T., Ebora,R.V., Nakai,T., Makino,Y., Negoro,S., Urabe,I. and Okada,H. TITLE Active and silent isozyme genes of glucose dehydrogenase from Bacillus megaterium IAM1030 JOURNAL J. Bacteriol. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Toshihide Mitamura Department of Fermentation Technology Osaka University 2-1 Yamada-oka Suita, Osaka 565 Japan Phone: 06-877-5111 x4373 Fax: 06-876-9036 FEATURES from to/span description pept 125 910 glucose dehydrogenase signal 27 32 put. -35 region signal 49 55 put. -10 region signal 111 116 SD sequence signal 928 959 termination signal BASE COUNT 427 a 190 c 268 g 317 t ORIGIN 1 tgaatgacag tttgagaaag aagagataga aaaatgttta ttcccttctt aaaacttaaa 61 ctgtatctgt aattagtaca gtataacaag acatatcagg cagaaaaagt aggaggactt 121 caagatgtat acagatttaa aagataaagt agtagttgta acaggtggat caaaagggtt 181 gggtcgcgcc atggccgttc gttttggtca agagcagtca aaagtagttg taaactaccg 241 cagcaatgaa gaggaagcgc tagaagtgaa aaaagaaatt gaagaagctg gcggtcaagc 301 tattattgtt cgaggcgacg ttacaaaaga agaagacgtt gtgaaccttg tagagacagc 361 tgttaaagaa tttggttcat tagacgttat gattaataat gcaggtgttg aaaacccggt 421 tccttctcat gaattatcat tagaaaactg gaaccaagtg attgatacaa acttaacagg 481 ggcattttta ggaagccgtg aagcaattaa atatttcgtc gaaaatgaca ttaaaggaaa 541 cgttattaac atgtccagcg ttcacgaaat gattccttgg ccattatttg ttcactatgc 601 agcaagtaaa ggcggtatga aattaatgac ggaaacattg gctcttgaat atgcgccaaa 661 aggtatccgc gtaaataaca ttggaccagg tgcaatcgat acgccaatca acgctgaaaa 721 attcgcagat ccggaacagc gtgcagacgt agaaagcatg attccaatgg gctatatcgg 781 caaaccggaa gaaatcgcat cagttgcagc attcttagca tcatcacaag caagctatgt 841 aacaggtatt acattatttg ctgatggcgg tatgacaaaa tatccttctt tccaagcggg 901 aagaggttaa taaataaagc taaaaggaaa aagacctcgg aatattccga ggtctttttt 961 gtattgtcat aaatgtacgg attatttacc gaatattgaa acttttattg aagtgttacg 1021 tatataagct aacgacgaat aaaggacgtg ttgatatgct acccgaaacg attcaacaaa 1081 aagtagatca gtatagaggt ttttatatca gcttaaaaaa tgaactcaaa tggaaagtgg 1141 cagatcccaa gcagtttatg gctatcgctt ctatgtatgc agtgaaaggt aaatcgctcg 1201 ag // LOCUS BMOPTTHP1 1023 bp ss-mRNA INV 15-AUG-1990 DEFINITION B.mori PTTH mRNA. ACCESSION D90082 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th-instar larva brain, cDNA to mRNA, clones P1, P2, C2, C9 and C19. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 1023) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept 34 708 preproPTTH matp 379 708 PTTH subunit signal 768 773 polyadenylation signal signal 826 831 polyadenylation signal signal 997 1002 polyadenylation signal signal 1004 1009 polyadenylation signal BASE COUNT 358 a 179 c 186 g 300 t ORIGIN 1 atcgttcagt tgagttatcc agcattccca atcatgatta ctcgaccgat tatattagtc 61 attttgtgtt acgctattct tatgatagtg cagtcattcg tgcctaaagc ggtagcgctg 121 aaaagaaaac cagacgtggg tggttttatg gtagaagacc aacgcacaca taaaagtcac 181 aactacatga tgaaaagagc aagaaatgac gttttgggag ataaagaaaa cgtcaggccg 241 aatccttact acacggagcc ttttgaccca gacacgagcc cagaagaatt gtccgcttta 301 atagttgatt acgccaatat gattaggaac gatgttattc tgttggataa ttccgttgaa 361 acgagaactc gaaaaagggg aaacattcaa gttgaaaacc aagctattcc ggatccacct 421 tgcacttgca aatacaagaa agaaatagaa gacttgggcg aaaactctgt tccacgcttc 481 attgaaacca gaaactgtaa taaaacacaa cagccgactt gtcgaccccc ctacatttgc 541 aaagaaagtt tatacagtat aactatttta aaaagaaggg aaactaaatc gcaggagtct 601 ctcgagatac cgaatgaatt gaaatatcga tgggtggcgg aatctcaccc cgtcagcgtg 661 gcgtgtttgt gtacaagaga ctaccaacta cgatataata ataattaatt gttttgactt 721 acgcctgatg atttgttccg aatcgaattt atttaattac tttatacaat aaagcttata 781 ttaaaaatta atgataatca attttaatta aaccaaattg aaaaaaataa aaatttcctc 841 cgattttttg tttttagtgg tggtacattc agcgaagcac tgttttgcta ggccagatgt 901 tagtagatca atacagtttt gatgcttacc ttgaaagctg tgctcttatt atactattca 961 aataagatta tatagttaaa tatattatgt atatctatta aatattaaaa gacacaattt 1021 aaa // LOCUS BMOPTTHP4 944 bp ss-mRNA INV 15-AUG-1990 DEFINITION B.mori preproPTTH mRNA. ACCESSION D90083 KEYWORDS PTTH; preproPTTH. SOURCE B.mori (Kinshu X Showa strain) 5th instar larva brain, cDNA to mRNA, clone P4 and C21. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 944) AUTHORS Kawakami,A., Kataoka,H., Oka,T., Mizoguchi,A., Kimura-Kawakami,M., Adachi,T., Iwami,M., Nagasawa,H., Suzuki,A. and Ishizaki,H. TITLE Molecular cloning of the Bombyx mori prothoracicotropic hormone JOURNAL Science 247, 1333-1335 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Hironori Ishizaki Department of Biology, School of Science, Nagoya University Chikusa-ku Nagoya 464-01 Japan Phone: 052-781-5111 x2472 Fax: 052-783-0719 Telex: SCUNAG J: 447-7323 FEATURES from to/span description pept < 1 631 preproPTTH matp 302 631 PTTH subunit signal 691 696 polyadenylation signal signal 749 754 polyadenylation signal signal 918 923 polyadenylation signal signal 925 930 polyadenylation signal BASE COUNT 337 a 163 c 177 g 267 t ORIGIN 1 tcttatgata gtgcagtcat tcgtgcctaa agcggtagcg ctgaaaagaa aaccagacgt 61 gggtggtttt atggtagaag accaacgcac acataaaagt cacaactaca tgatgaaaag 121 agcaagaaat gacgttttgg gagataaaga aaacgtcagg ccgaatcctt actacacgga 181 gccttttgac ccagacacga gcccagaaga attgtccgct ttaatagttg attacgccaa 241 tatgattagg aatgatgtta ttctgttgga taattccgtt gaaacgagaa cgcgaaaaag 301 gggaaacatt caagttgaaa accaagctat tccggaccca ccttgcactt gcaaatacaa 361 gaaagaaata gaagacttgg gcgaaaactc tgttccacgc ttcattgaaa ccagaaactg 421 taataaaaca caacagccga cctgtcgacc cccctacatt tgcaaagaaa gtttatacag 481 tataactatt ttaaaaagaa gggaaactaa atcgcaggag tctctcgaga taccgaatga 541 attgaaatat cgatgggtgg cggaatctca ccccgtcagc gtggcgtgtt tgtgtaccag 601 agactaccaa ctacgatata ataataatta attgttttga ctcacgcctg atgatttgtt 661 ccgaatcgaa tttatttaat tactttatac aataaagctt atattaaaaa ttaatgataa 721 tcaattttaa ttaaaccaaa ttgaaaaaaa taaaaatttc ctcagatttt tggtttttag 781 tgctggtaca ttcagggaag tactgttttg ctaggccaga tgttagtaga tcaatagagt 841 ttttatgctt gccttgaaag ctgtgctctt attatattat gctattcaaa taagattata 901 tagttaaata tatatctatt aaatattaaa agacacaatt taaa // LOCUS HUMMTSDHB 958 bp ss-mRNA ORG 15-AUG-1990 DEFINITION Human mitochondrial succinate-ubiquinone oxidoreductase (EC 1.3.99.1) iron sulfur subunit (sdh B) mRNA. ACCESSION D90047 KEYWORDS Ip; complex II; iron sulfur subunit; sdh B; succinate-ubiquinone oxidoreductase. SOURCE Human liver mitochondrion, cDNA to mRNA. ORGANISM Mitochondrion Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae; Homo sapiens. REFERENCE 1 (bases 1 to 958) AUTHORS Kita,K., Oya,H., Gennis,R.B., Ackrell,B.A.C. and Kasahara,M. TITLE Human complex II(succinate-ubiquinone oxidoreductase): cDNA cloning of iron sulfur(Ip) subunit of liver mitochondria JOURNAL Biochem. Biophys. Res. Commun. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kiyoshi Kita Department of Parasitology Juntendo University 2-1-1,Hongo Bunkyo-ku, Tokyo 113 Japan Phone: 03-813-3111 x3542 Fax: 03-814-9300 FEATURES from to/span description pept < 1 789 succinate-ubiquinone oxidoreductase (sdh B) (AA at 1) site 74 95 iron-sulfur binding site I site 167 179 iron-sulfur binding site II site 224 236 iron-sulfur binding site III BASE COUNT 319 a 212 c 204 g 223 t ORIGIN 1 tggcggacgt gcctgcaggc ctcccgagga gcccagacag ctgcagccac agctccccgt 61 atcaagaaat ttgccatcta tcgatgggac ccagacaagg ctggagacaa acctcatatg 121 cagacttata aggttgacct taataaatgt ggccccatgg tattggatgc tttaatcaag 181 attaagaatg aagttgactc tactttgacc ttccgaagat catgcagaga aggcatctgt 241 ggctcttgtg caatgaacat caatggaggc aacactctag cttgcacccg aaggattgac 301 accaacctca ataaggtctc aaaaatctac cctcttccac acatgtatgt gataaaggat 361 cttgttcccg atttgagcaa cttctatgca cagtacaaat ccattgagcc ttatttgaag 421 aagaaggatg aatctcagga aggcaagcag cagtatctgc agtccataga agagcgtgag 481 aaactggacg ggctctacga gtgcattctc tgtgcctgct gtagcaccag ctgccccagc 541 tactggtgga acggagacaa atatctgggg cctgcagttc ttatgcaggc ctatcgctgg 601 atgattgact ccagagatga cttcacagag gagcgcctgg ccaagctgca ggacccattc 661 tctctatacc gctgccacac catcatgaac tgcacaagga cctgtcctaa gggtctgaat 721 ccagggaaag ctattgcaga gatcaagaaa atgatggcaa cctataagga gaagaaagct 781 tcagtttaac tgtttccatg ctaaacatga tttataacca gctcagagct gaacataatt 841 tatatctaat ttgagttcct ttaaagatct tggttttcca tgaatacagc atgtataata 901 aaaattttaa gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa // LOCUS HUMNCAW 2287 bp ss-mRNA PRI 15-AUG-1990 DEFINITION Human nonspecific cross-reacting antigen (NCA-W272) mRNA. ACCESSION D90064 KEYWORDS CEA; CEA gene family; PI-anchored membrane protein. SOURCE Human white blood cells, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2287) AUTHORS Arakawa,F., Kuroki,M., Misumi,Y., Oikawa,S., Nakazato,H. and Matsuoka,Y. TITLE Characterization of a cDNA clone encoding a new species of the nonspecific cross-reacting antigen (NCA), a member of the CEA gene family JOURNAL Biochem. Biophys. Res. Commun. 166, 1063-1071 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumiko Arakawa First Department of Biochemistry School of Medicine Fukuoka University 7-45-1 Nanakuma Jonan-ku Fukuoka 814-01 Japan Phone: 092-801-1011 x2892 Fax: 092-801-3600 FEATURES from to/span description ORF 87 1136 nonspecific cross-reacting antigen ORF BASE COUNT 618 a 593 c 453 g 623 t ORIGIN 1 ggacagcaca gctgacagcc gtgctcagaa agtttctgga tcccaggctc atctccacag 61 aggagaacac gcaggcagca gagaccatgg ggcccatctc agccccttcc tgcagatggc 121 gcatcccctg gcaggggctc ctgctcacag cctcactttt caccttctgg aacccgccca 181 ccactgctca gctcactatt gaagctgtgc catccaatgc tgcagagggg aaggaggttc 241 ttctacttgt ccacaatctg ccccaggacc ctcgtggcta caactggtac aaaggggaaa 301 cagtggatgc caaccgtcga attataggat atgtaatatc aaatcaacag attaccccag 361 ggcctgcata cagcaatcga gagacaatat accccaatgc atccctgctg atgcggaacg 421 tcaccagaaa tgacacagga tcctacaccc tacaagtcat aaagctaaat cttatgagtg 481 aagaagtaac tggccagttc agcgtacatc cggagactcc caagccctcc atctccagca 541 acaactccaa ccccgtggag gacaaggatg ctgtggcctt cacctgtgaa cctgagactc 601 agaacacaac ctacctgtgg tgggtaaatg gtcagagtct cccggtcagt cccaggctgc 661 agctgtccaa tggcaacagg accctcactc tactcagtgt cacaaggaat gacgtaggac 721 cctatgaatg tgaaatacag aacccagcga gtgcaaactt cagtgaccca gtcaccctga 781 atgtcctcta tggcccagat gcccccacca tttccccttc agacacctat taccatgcag 841 gggtaaatct caacctctcc tgccatgcgg cctctaatcc accctcacag tattcttggt 901 ctgtcaatgg cacattccag caatacacac aaaagctctt tatccccaac atcactacaa 961 agaacagcgg atcctatgcc tgccacacca ctaactcagc cactggccgc aacaggacca 1021 cagtcaggat gatcacagtc tctgatgctg tagtacaagg aagttctcct ggcctctcag 1081 ctagagccac tgtcagcatc atgattggag tactggccag ggtggctctg atatagtagc 1141 tctggtgtag tttctgcatt tcaagaagac tggcagacag ttgtttttat tcttcctcaa 1201 agcatttgca atcagctacc attcaaaatt gcttcttctt caagatttat ggaaaatact 1261 ctgacgagta ctcttgaaca caagttcctg ataactttaa gatcacgcca ctggactgtc 1321 tatgaacttg caaacaggct gatacctttg tgaagttgcc caccaaaaca cagaaggaaa 1381 aaaacatgaa tttcattgaa ctaaataata atgaggataa tgtttttaag attttttttt 1441 tttttttttt tgagatggaa tctcgctctg tcgcccaggc tggagtgcag tggcacgatc 1501 tcaactcact gcaacgtccg cctcctgggt tcacaccatt ctcctgcctc agcctcctga 1561 gtagctggga ctacaggcgc ctgccacaac gcccggctaa ttttttgtat ttttagtaga 1621 gacggggttt cactgtggtc tcaatctcct gacttcatgg tccgcctgcc tcagcctccc 1681 aaagttctgg gattacaggt gtgagccacc gcgcccagcc cgtttttaag attttttatt 1741 tgaaaaattg ccaattcttt aagtgttttc tttttcagat ttatgaattt ctttatcttt 1801 taagctatct ataccttact gcaatttggt aaagcagact tttgtgaaca aaaattataa 1861 catttacttt tgctccctac ctgactgcca cagaactggg caactattca tgagtattca 1921 tatgtttatg gtaattcagt tatttgcaca agttcagtga gaatctgctg tctttataat 1981 gggatatagt ttaaaacatt ggttatatta ccaaggcttt gattgggatg ttatatttga 2041 gaaaatacag agaatgatag attaacggag tgtctaatct atcgtgtcaa ccccaaattt 2101 ttacgtatga gatcctttag tccacccaat ggctgacagt aacagcatct ttaacacaac 2161 tctttgttca aatgtactat ggtctctttt agagtcagac tcctagactc acttgttctc 2221 actgtctgtt ttaatttaac ccaggcatgc aatgctagat aataaaattg ctccctattg 2281 gctgatc // LOCUS PIGDESTN 1666 bp ss-mRNA MAM 15-AUG-1990 DEFINITION Porcine destrin mRNA. ACCESSION D90053 J05290 KEYWORDS actin-binding protein; cofilin; destrin. SOURCE Pig adult brain, cDNA to mRNA, clone PD2. ORGANISM Sus scrofa Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Suiformes; Suidae. REFERENCE 1 (bases 1 to 1666) AUTHORS Moriyama,K., Nishida,E., Yonezawa,N., Sakai,H., Matsumoto,S., Iida,K. and Yahara,I. TITLE Destrin, a mammalian actin-depolymerizing protein, is closely related to cofilin: Cloning and expression of porcine brain destrin cDNA JOURNAL J. Biol. Chem. 265, 5768-5773 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Moriyama Department of Biophysics and Biochemistry Faculty of Science University of Tokyo 7-3-1 Hongo Bunkyoku Tokyo 113 Japan Phone: 03-821-2111 x4408 FEATURES from to/span description pept 54 551 destrin mRNA < 1 1666 destrin mRNA signal 1645 1660 polyadenylation signal BASE COUNT 492 a 322 c 372 g 480 t ORIGIN 1 actcggctcc ggccggctcg gtctcccgcg cttctgcgac cgccgaggcg aacatggctt 61 caggagtgca agttgctgat gaagtatgtc gcatttttta tgacatgaaa gttcggaagt 121 gctccacacc agaagaaatc aagaaaagaa agaaggctgt cattttttgt ctcagtgcag 181 acaaaaagtg catcattgta gaagaaggca aagagatctt agttggagat gttggtgtaa 241 ccataaccga tcctttcaag catttcgtgg ggatgcttcc tgagaaagat tgtcgctatg 301 ctttgtatga tgcaagcttt gaaaccaagg aatccagaaa agaggagttg atgttttttc 361 tgtgggcacc agaactagca cctctgaaaa gtaaaatgat ctatgccagc tccaaggacg 421 caatcaaaaa gaaatttcaa ggcataaaac atgaatgtca agcaaatggg ccagaagacc 481 tcaatcgggc ttgtattgct gaaaagctag gtggatcctt aattgtagcc tttgaaggat 541 gccctgtgta gatgatcatt cagtgccaca gatcgaaagc ttccgtgttc aatgttatcc 601 tcttgctata taagtaaagc aaacactgag gccagggact cactgagggg agctgtcttg 661 tcatttgtta gagtaaacta actattctat gaacatgtgc acatggccct aaatcaatct 721 aaactctact ttttttgggg gtgtgtgtga aagtcttatt ggccaaaata tctattttga 781 tgagtctgct tgtagagatt tttgttaagc tcatgatttt taatcgtttc aacgtgtggt 841 tcattaaaca atgcaaggcc agatgaagag aattattgca tctttgttaa cttcagcagt 901 tactttgttt cttttgctta gagaattggt cataatcagt tatattggtc atataatttt 961 ggcccaaatt cttgagtctc tgctgagcta acctgaataa tggaaaataa ttctactcac 1021 aacaggtaac agcactaata tgctaactac agtaagatta aatcaggcca gattctacca 1081 gacgtggata ctgcctccaa aactgtgtgc acttagaacc agcgctgagc ttgcaaagca 1141 ctatttcaag cacgtagttg aaacacagca aacagctcct gcacttgaag tgagctgctt 1201 gctcactagt cagaaggctg tacagagagt gaccttgcat cttggaaatc agaacatgta 1261 ctgtcttgta ccaactaatt agagtacaaa ttagggctcc gttgtaatat gctttattag 1321 tggaaatggt aagatggtat atcaacaagc tgggtaccta tgctatcttt aatttatctc 1381 ctttggaact gtgttgcttc tggtacagta aggtgtagaa gaacattctg tttactctgg 1441 ggcctgggag aacctcttta ccttcctaga gcagtttgcc gactgtatgt gatacgggga 1501 ccagctatga cggcagcatc cacaggaagc cactgcctga tgacacttgg aagtgattgt 1561 ctttaacatc acaggcataa cactctgaac agtatagaga tgcaccaaca gttgaattta 1621 gaagtagcag tactggcttt acgtaataaa ggaaccattt taactt // LOCUS RATPMP70X 3324 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat liver 70-kDa peroxisomal membrane protein (PMP70) mRNA. ACCESSION D90038 J05256 KEYWORDS PMP70; peroxisomal membrane protein. SOURCE Rat(Wistar) liver, cDNA to mRNA, clones lambda-cPM[36,102,156,181, 189,201]. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3324) AUTHORS Kamijo,K., Taketani,S., Yokota,S., Osumi,T. and Hashimoto,T. TITLE The 70-kDa Peroxisomal Membrane Protein Is a Member of the Mdr(P-Glycoprotein)-Related ATP-binding Protein Superfamily JOURNAL J. Biol. Chem. (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Keiju Kamijo Department of Biochemistry Shinshu University School of Medicine Matsumoto 390 Japan Phone: 263-35-4600 x5182 Fax: 263-33-6458 FEATURES from to/span description pept 36 2015 peroxisomal membrane protein (PMP70) signal 3271 3276 polyadenylation signal (put.) BASE COUNT 921 a 658 c 773 g 972 t ORIGIN 1 gaattccagt gcggctcgct cgccctgccg gtgccatggc ggccttcagc aagtacttga 61 cggcgcggaa ctcctcgctg gcgggggccg cgttcctgct gttctgcctg ctccacaagc 121 ggcgtcgcgc cctcggcctg cacggtaaga aaagtggaaa accgccatta cagaataatg 181 agaaagaagg aaagaaagag cgagctgtgg tggacaaagt gtttttatca aggctctcac 241 agatcctaaa aattatggtc cctagaacat tttgtaaaga gacagggtac ttgatactta 301 ttgctgttat gctggtatct cgaacatact gtgatgtttg gatgattcaa aatggcacac 361 tgattgaaag tggcatcatt ggtcgtagca gtaaagattt caagagatac ttattcaact 421 tcatcgctgc catgcctctt atctctctgg ttaataactt cttgaagtat gggttaaatg 481 agctcaaact gtgcttccgt gtgcggctca ctagatacct ctatgaggag tatctccaag 541 ccttcaccta ctataaaatg ggcaacctgg ataacagaat agcaaaccca gaccagctgc 601 ttacacaaga tgtagaaaag ttttgtaaca gtgtagttga tctttattcg aatcttagta 661 agccattttt agacatagtt ttgtatattt tcaagttaac aagtgcaatt ggagctcagg 721 gcccggcaag catgatggcc tacttgcttg tttctgggct attcctaact cgactcagaa 781 gacccatcgg taaaatgacg attatggagc agaagtatga aggagaatat agattcgtta 841 attcacggct tatcactaat agtgaagaaa ttgcctttta caatgggaat aaacgagaaa 901 agcagacaat ccactctgtc ttccgaaaac tggtggaaca cctacataat ttcattttct 961 tccggttttc tatgggtttc attgatagca tcattgccaa atatattgcc actgtagttg 1021 ggtacctggt tgtcagtcgc ccgttcctag acctggcgca tccgcgacac cttcacagca 1081 cccactcaga gctgctggag gattactacc aaagtggaag aatgcttttg agaatgtctc 1141 aagctttggg gcggatagtt ttggctgggc gtgaaatgac tagattggct ggttttacgg 1201 ctcggattac ggaattaatg caagtactaa aggatttaaa tcatggcaaa tatgaacgta 1261 caatggtgtc acaacaggat aagggtattg aaggagcaca agctagtccc ttgatacctg 1321 gtgctggaga aatcatcaat gcagacaaca ttataaagtt tgatcatgtt cctttagcaa 1381 caccaaatgg agatatcttg atccaagacc ttagttttga agttcgatct ggggccaacg 1441 ttctcatttg tggtccaaat ggctgtggaa agagctccct cttccgtgtt cttggtgaat 1501 tatggcctct ctttggagga catcttacta aacctgagag aggaaagtta ttttatgttc 1561 ctcagcgacc ctatatgacc ctgggaacac tgagagacca agtaatatat ccagatggaa 1621 aggaggatca gaagaagaag gggatatctg accaagtgct gaaggggtac ttggacaatg 1681 tacagttggg ccatatcctt gagcgggaag gaggctggga cagtgttcag gactggatgg 1741 atgtactcag cggaggagaa aaacaaagaa tggcgatggc aagattgttt tatcataaac 1801 cccagtttgc cattctggat gagtgcacaa gtgcagttag tgtggatgtg gaagactaca 1861 tttacagcca ctgtcggaag gttggcatca ccctcttcac tgtctcacac aggaaatccc 1921 tttggaaaca ccacgagtac tacctgcaca tggatggcag aggcaattat gaattcaaaa 1981 agatcacaga agacacagtt gagttcggat catagagacc atctggagaa cttcacactt 2041 cacaagagaa tgaatgaaca gaatgcattt gtaaacaacg tgcattgtaa aataaagtta 2101 agcttgtttt ttttaaaaaa acaaagctac aaattgacta gatataggat aattgaaaca 2161 tgttaaaaca tttaatattg tataggatat tgctaattgt gtatatgttg gtttaattat 2221 taattatgta ctaagaatgt ccttattctt gtggttaaaa aacctgcctg aattaaattg 2281 ggcttaaatc agtgtaacct gattcatggg atgtaaacca tttgaagtca gctaatttga 2341 cttttatagc tctgtctttt tctttaatga agaaccctat ttaaaactgg gtcattagct 2401 gtttattcta acaaagtagt cttgagttcc tttttgggtt tttttttttt tttttttttt 2461 tttttttttg tgccccatgg tagtgggaac caaaccaatc acaatgtttt attggaacat 2521 attccatcat cacaggatag catttattaa acagtggcgg atttctctag ctgctacatt 2581 tattctcatt cctcatacat accttgaggt gcatttgatt ccaggagagc catttgggtt 2641 ttctttagct aaataataaa tgtacccgtc tcagtctttt ggactgagtc gttctgaagg 2701 ctctcgtgtg gacagcagtg tgtgcagtct cttacagtcc gtgcctgctc cacatggtac 2761 cagtcttacc agtgcttgag agctcagaca caccctgctg catgaagttg gaggtctcgg 2821 gagggtttta gattttgtga cgggaaccgg aaaggctcgt cagagtgtgg ctgtgtcatg 2881 gtgagcacca cgtggctgta gaggcccgac atgaggtaat gcactgagca cacaacgcca 2941 ctgctgctgt ctgtggctgt gggttcttaa aagtgctgga ctttgtcatg ctcgtgggcc 3001 aatgacattt cctaggagcg gcctctgact cctgtgcagc tgcgtctgtg tcagctctgg 3061 ctccctggaa ccacgagtga ctttgcacaa aggagggctg agagcggact tgatcagtaa 3121 gtcgtcgtga atcagtttgc ttgagtgggc tcggaatggg ccttatcacg atggttttgt 3181 ttcttcgtaa ctcataatca ctggctacca ggataaccct gatgtattga ttccgtgaat 3241 acatcacatt caatcttacc atgtctcctt agcaaacgtg tgtacttatt ttctgttcag 3301 attaaaaaaa aaaaaaagga attc // LOCUS VACSANT 1525 bp ds-DNA VRL 15-AUG-1990 DEFINITION Vaccinia virus surface (S) antigen gene. ACCESSION D90076 KEYWORDS S gene; surface antigen. SOURCE Vaccinia virus DNA. ORGANISM Vaccinia virus Viridae; ds-DNA enveloped viruses; Poxvirinae; Orthopoxvirus. REFERENCE 1 (bases 1 to 1525) AUTHORS Ueda,Y., Morikawa,S. and Matsuura,Y. TITLE Identification and nucleotide sequence of the gene encoding a surface antigen induced by Vaccinia virus JOURNAL Virology 177, 588-594 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Yoshiaki Ueda National Institute of Health Gakuen, Musashimurayama Tokyo 190-12 Japan Phone: 0425-61-0771 Fax: 0425-65-3315 FEATURES from to/span description pept 382 1437 surface antigen S BASE COUNT 568 a 229 c 258 g 470 t ORIGIN 1 tctagacact acactatatg cagttttaag atgccataat tcgaaaaagt taagaagata 61 cctcaacgag ttaaaaaaat ataataacga taagtccttt aaaatatatt ctaatattat 121 gaatgagaga taccttaatg tatattataa agatatgtac gtgtcaaagg tatatgataa 181 actatttcct gttttcacag ataaaaattg tctactaaca ttactacctt cagaaattat 241 atacgaaata ttatacatgc tgacaattaa cgatctttat aatatatcgt atccacctac 301 caaagtatag ttgtattttt ctcatgcgat gtgtgtaaaa aaactgatat tatataaata 361 ttttagtgcc gtataataaa gatgacgatg aaaatgatgg tacatatata tttcgtatca 421 ttattgttat tgctattcca cagttacgcc atagacatcg aaaatgaaat cacagaattc 481 ttcaataaaa tgagagatac tctaccagct aaagactcta aatggttgaa tccagcatgt 541 atgttcggag gcacaatgaa tgatatagcc gctctaggag agccattcag cgcaaagtgt 601 cctcctattg aagacagtct tttatcgcac agatataaag actatgtggt taaatgggaa 661 aggctagaaa aaaatagacg gcgacaggtt tctaataaac gtgttaaaca tggtgattta 721 tggatagcca actatacatc taaattcagt aaccgtaggt atttgtgtac cgtaactaca 781 aagaatggtg actgtgttca gggtatagtt agatctcata ttaaaaaacc tccttcatgc 841 attccaaaaa catatgaact aggtactcat gataagtatg gcatagactt atactgtgga 901 attctttacg caaaacatta taataatata acttggtata aagataataa ggaaattaat 961 atcgacgata ttaagtattc acaaacggga aagaaattaa ttattcataa tccagagtta 1021 gaagatagtg gaagatacaa ctgttacgtt cattacgacg acgttagaat caagaatgat 1081 atcgtagtat caagatgtaa aatacttacg gttataccgt cgcaagacca caggtttaaa 1141 ctaatactag atccaaaaat caacgtaacg ataggagaac ctgccaatat aacatgcact 1201 gctgtgtcaa cgtcattatt gattgacgat gtactgattg aatgggaaaa tccatccgga 1261 tggcttatag gattcgattt tgatgtatac tctgttttaa ctagtagagg cggtatcacc 1321 gaggcgacct tgtactttga aaatgttact gaagaatata taggtaatac atataaatgt 1381 cgtggacaca actattattt tgaaaaaacc cttacaacta cagtagtatt ggagtaaata 1441 cacaatgcat ttttatatac attactgaat aattattatt attatttata tcgtatttgt 1501 gctatagaat gaatgaggat acgcg // LOCUS YSCA1 881 bp ds-DNA PLN 15-AUG-1990 DEFINITION S. cerevisiae acidic ribosomal protein A1 (YSCA1). ACCESSION D90072 X13682 KEYWORDS acidic ribosomal protein; ribosomal protein. SOURCE S. cerevisiae (strain IFO-40028) DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Unpublished (1990) STANDARD full staff_entry REFERENCE 2 (bases 1 to 315; 631 to 881) AUTHORS Mitsui,K. and Tsurugi,K. TITLE Identification of A1 protein as the fourth member of 13 kDa-type acidic ribosomal protein family in yeast Saccharomyces cerevisiae JOURNAL Biochem. Biophys. Res. Commun. 161, 1001-1006 (1989) STANDARD full staff_entry REFERENCE 3 (bases 277 to 742) AUTHORS Tsurugi,K. and Mitsui,K. TITLE cDNA and deduced amino acid sequence of acidic ribosomal protein A1 from Saccharomyces cerevisiae JOURNAL Nucleic Acids Res. 16, 3574-3574 (1988) STANDARD simple automatic COMMENT These data kindly submitted in computer readable form by: Kazuhiro Mitsui Department of Biochemistry Yamanashi Medical college Tamaho, Nakakoma-gun Yamanashi 409-38 Japan Phone: 0552-73-1111 x2257 FEATURES from to/span description pept 313 633 acidic ribosomal protein A1 signal 125 135 UASrpg box1 signal 182 192 UASrpg box2 signal 716 721 poly(A) signal variant 303 303 a in [1]; g in [3] variant 684 685 tt in [1]; t in [3] variant 719 719 a in [1]; t in [3] BASE COUNT 270 a 165 c 157 g 289 t ORIGIN 1 gatcttatta aactctagta tcttgtctaa tacttcattt aaaagaagcc ttaaccctgt 61 agcctcatct atgtctgcta catatcgtga ggtacgaata tcgtaagatg ataccacgca 121 actttgtaat gatttttttt ttttcatttt ttaaagaatg cctttacatg gtattgaaaa 181 aaatatctat aactttgcga tcctccttct gttctgaata atttttagta aaagaaatca 241 aaagaataag aaatagtccg ctttgtccaa tacaacagct taaaccgatt atctctaaaa 301 taacaagaag aaatgtctac tgaatccgct ttgtcttacg ccgccttgat tttggctgac 361 tctgaaatcg aaatctcttc tgaaaagttg ttgactttga ctaacgctgc caatgtccca 421 gatgaaaata tctgggctga tatttttgct aaggctttgg acggccaaaa cttgaaggac 481 ttattggtca acttcagcgc tggtgctgct gccccagctg gtgtcgctgg tggtgtcgct 541 ggtggtgaag ccggtgaagc cgaagctgaa aaggaagaag aagaagctaa agaagaatcc 601 gatgacgaca tgggtttcgg tttatttgat tagaagtgcc gcactgttta gaagaaattg 661 catattctaa catttaaaat tttttataat ttttctatat agtcgctttt aatacaataa 721 gacagtactt tctttttgtt caataccatc tttcgcatct cttctatgct atatataatg 781 ccacgttgtg ctcgaaggaa aagcctgcaa acctgactac tactaataca ataatgttcc 841 atcatatcaa gaaaactgcg ctaacttgta aaaatactgt c // LOCUS YSCCDC23X 3107 bp ds-DNA PLN 15-AUG-1990 DEFINITION S. cerevisiae CDC23 protein gene. ACCESSION D90081 KEYWORDS CDC23 protein. SOURCE Saccharomyces cerevisiae (strain X2180-1A or X2180-1B; cell line D22) DNA, clone YX34. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 3107) AUTHORS Doi,A. and Doi,K. TITLE Cloning and nucleotide sequence of the CDC23 gene of Saccharomyces cerevisiae JOURNAL Gene (1990) In press STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Kenji Doi The Institute of Scientific and Industrial Research Osaka University 8-1 Mihogaoka Ibaraki Osaka 567 Japan Phone: 06-877-5111 Fax: 06-877-4977 FEATURES from to/span description ORF 765 2645 ORF for CDC23 site 153 161 calcium-binding site BASE COUNT 1010 a 595 c 629 g 873 t ORIGIN 1 tcgagaatac cctgaagttt ctcagatgga acccatttat ccatttcata cactgtcact 61 gatggatcag acacttccac ctgctttgct aaatcaacag aaagtcgctt cagtaaattt 121 atgtacctta aagtatccct attcaaatgt tcgaaagtag aatagtactc gctaatactc 181 ttaggattct gtactcccgc tgcaacgtcc cttccagttt ttgtatcctc caaaagctgt 241 gcttctcttg tttgatattt atcgtatcgc aggcggatgg aactatttat cagctccctg 301 tgtaaatcag gcaacttctt gagggattca gtaagcagat catcagatga tctagggtct 361 gccaatactg ataatatatc taaaatattt aataagtggg tttggctttc ctgcaaactt 421 tgttcctcct cgcagagaga ttcaaaatac gtacgacctt cttcctttgt catgctatga 481 acttgataac ttgagcagtg taaacctgat aaactagtcg ctgttgtttc ttactgtaag 541 atactgcact tctgcagctt cttaagtatt ctacttacca agtttctatt atttttcaat 601 gcgcgtacat aaaaagcact tcgggtaaaa caaacacttc ataatagcag accaagtact 661 gcggtactca catcaaatta agaggaagaa gggagtatta gcgagcggaa aactgaaatc 721 tggatatata ctgatcagaa tcagattgtg aagcatttag aaccatgaat gacgacagcc 781 aggataaaat aatacatgat atacgtattc agctacgaaa ggctgccaca gaattatcac 841 gatggaagct atacggctcc tcaaagtggg cagcagaggc gctagcaggt cttgcagaag 901 ctattgatgt tgatcaaaca cactctttag ccgatgaatc gccactaaga aataaacaag 961 gtgtaccgaa acagatgttt gaaataccac aaaacgggtt tggcctatca gagactgagt 1021 atgacctgta cctccttggt tctacgttgt ttgatgctaa agagtttgat cgatgcgttt 1081 tttttctaaa agatgtcact aatccatacc ttaagttctt aaaattatac agtaaatttc 1141 tatcgtggga taagaaaagc caggaaagta tggaaaatat cttaactaca gggaagttta 1201 cggacgaaat gtacagagct aacaaagatg gggatggtag tgggaatgag gatataaatc 1261 aaagtgggca ccaacgcgcc aatttaaaaa tggtcagcaa tgagcatgag tcacaatcga 1321 acatatcatc tattttgaag gaaattaaca catttctgga gtcttatgaa ataaagatag 1381 acgatgatga ggccgattta gggttagcac tgttgtatta tttacgaggg gtcatcttaa 1441 agcaagagaa gaatatttct aaggcaatgt cgtcattctt gaaatctctg agttgctact 1501 cctttaactg gtcctgctgg ctggagttaa tggactgttt acaaaaggtt gacgatgcat 1561 tgcttttaaa taattatcta tatcaaaatt tccaattcaa attttctgaa aatcttggta 1621 gtcaacgaac gatagaattt aatataatga tcaaattttt caagctaaaa gtgtttgagg 1681 agcttaatgg ccagttagag gactactttg aagatttaga gtttttgtta caagttttcc 1741 ccaatttcac ttttttaaag gcttacaatg ctactattag ttacaacaat ttggattatg 1801 ttaccgcaga aagccgattt gatgacatcg ttaaacaaga tccgtaccgt ctcaacgatt 1861 tggaaaccta ctccaatatt ctatacgtca tgcagaagaa ttcaaaatta gcctatttgg 1921 cgcaattcgt ctcccaaata gatagattta gaccggaaac atgttgtatc atagcgaact 1981 attacagtgc ccgacaggaa catgaaaaat ctatcatgta tttccgtcga gcactaactt 2041 tggataaaaa aacaacaaac gcatggactt tgatgggtca cgaatttgtt gaactaagca 2101 attcacatgc cgcaatagaa tgctatcgtc gggccgtaga tatatgccct cgagacttca 2161 aagcatggtt tggtttgggc caggcttatg ctctcctgga catgcattta tattctcttt 2221 actacttcca gaaagcttgc actttgaaac cttgggatcg tcggatttgg caagtattgg 2281 gagaatgtta tagtaagacg ggaaataagg tagaagctat aaaatgctac aaaagatcca 2341 taaaagcttc acaaacggtc gatcaaaata cttcaatata ttaccggtta gcgcaactat 2401 atgaagaact tgaagacttg caagaatgta agaagttcat gatgaaatgt gtagatgtgg 2461 aagaacttct ggaaggtata gtaacagatg aaaccgtgaa ggctaggctt tggctggcaa 2521 tatttgagat taaggcagga aactaccaat tggcttatga ttatgccatg ggggtatcta 2581 gtggaacgtc tcaagagatt gaagaggctc gtatgctggc tcgggagtgc agaaggcata 2641 tgtagtgaag tgaacataca catagctatt cgtactaaat gatatgaaat ttttataaat 2701 gccaggctat atagctattt aaagtgacca tggcagaagg atgaaccgag gtaatacggc 2761 tagtacaaaa gcaacaaagt taggaataca atttgagaaa cgaagaccat agaaaatact 2821 tgtgcgattg aacttccttc caaaaaaaaa atagcgtcaa agaaagatga gtggactacc 2881 gcccccacct cctggttttg aagaggacag cgacttagca cttccaccac caccaccacc 2941 accgcctgga tacgaaatcg aagaactgga taatccgatg gtgccatcat cggtaaatga 3001 ggatacattc cttccgcctc caccacctcc tccaagcaac ttcgaaataa acgctgaaga 3061 aattgtggac ttcacattac caccgccacc accccctcca ggtctag // LOCUS BSPRSDA 2996 bp ds-DNA BCT 15-AUG-1990 DEFINITION Bacillus sp. raw-starch-digesting amylase gene. ACCESSION D90112 KEYWORDS alpha amylase; raw-starch-digesting amylase. SOURCE Bacillus sp.(strain B1018) DNA. ORGANISM Bacillus sp. Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 2996) AUTHORS Itokor,P., Tsukagoshi,N. and Udaka,S. TITLE Nucleotide sequence of the raw-starch-digesting amylase gene from Bacillus sp. B1018 and its strong homology to the cyclodextrin glucanotransferase genes JOURNAL Biochem. Biophys. Res. Commun. 166, 630-636 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Shigezo Udaka Department of Food Science and Technology, Faculty of Agriculture Nagoya University Furo-cho, Chikusa-ku Nagoya 464 Japan Phone: 052-782-5111 x6356 Fax: 052-781-4447 FEATURES from to/span description pept 313 2454 raw-starch-digesting amylase precursor (EC 3.2.1.1) sigp 313 393 raw-starch-digesting amylase signal peptide matp 394 2454 raw-starch-digesting amylase mature peptide binding 302 306 ribosome binding site signal 101 106 -35 region signal 125 130 -10 region rpt 2567 2580 inverted repeat rpt 2585 2598 inverted repeat BASE COUNT 764 a 852 c 774 g 606 t ORIGIN 10 bp upstream of RsaI site. 1 ttatttgagt acattttatg tattcccaca ttgcgcccga tatctacgct tagaaaaaaa 61 tcgtcggaaa agcgccccaa aaaattttta ttgttattta ttgacagttg tattcgcttt 121 catctacaat gatggaggaa cgcaatactc gatataattt aagggccatg cattccgtga 181 ccgcacaccc ggtatggaac aaccccggta tctcgatgga gaagccgggg ttttttgtcg 241 ccctttttta ggaggtgatc cggcgacagc ggatcaagcc tggaattcaa ataattacat 301 aggaggtata acatgaagaa atttctgaaa atgacagccg cgttttccct gggattatcc 361 ctggcgttcg ggcttttcag ccccgcccag gccgcgccgg atacctcggt atccaacaag 421 caaaatttca gcaccgacgt catctatcaa attttcaccg acaggttttc ggacggcaat 481 cccgccaaca atccgaccgg cgcggcgttt gacggaacct gcacgaacct ccggctgtat 541 tgcggcggcg actggcaggg catcatcaac aaaatcaacg acggttacct gaccgggatg 601 ggcgttaccg ccatctggat ctcccagccg gtcgaaaaca tctacagcat catcaattat 661 tccggcgtca acaacacggc ctatcacggc tactgggccc gggacttcaa gaagacgaat 721 ccggcctacg gcacgattgc ggacttccag aacctgatcg ccgccgcgca tgccaaaaac 781 atcaaagtca ttatcgactt cgccccgaac catacgtcgc ccgcctcgtc cgaccagcct 841 tcctttgcgg aaaacggccg gctgtacgat aacggcacgc tgctcggggg atacacgaac 901 gatacgcaga acctgttcca ccataacggc ggcacggact tttccacgac cgaaaacggc 961 atctacaaaa acctgtacga tctcgccgac ctgaaccata acaacagcac gtcggacgtc 1021 tacttgaagg acgcgatcaa aatgtggctg gatctcggca tcgacggcat ccgcatggat 1081 gcggtgaagc atatgccgtt cggctggcag aagagcttta tggctgccgt caacaactat 1141 aagccggtct ttaccttcgg cgaatggttc ctgggcgtaa acgaagtagg cccggaaaac 1201 cataagtttg ccaacgaatc cggcatgagc ctgcttgatt tccgttttgc ccaaaaggtg 1261 cggcaggtgt tccgggacaa caccgacaat atgtacggcc tgaaggcgat gctggagggc 1321 tccgcagccg attacgccca ggtggatgac caggtgacgt tcatcgacaa ccatgacatg 1381 gagcgtttcc acgcaagcaa tgcaaaccgc cggaagctgg agcaagcgct ggcgttcacg 1441 ctgatcctcg cgcgcgtccc cgccatttat tacggcaccg agcagtacat gtcgggtggg 1501 accgatccgg acaaccgggc gcggatccct tccttctcca cgtcgacgac cgcctatcaa 1561 gtcattcaaa agctggcgcc gctgcgcaag tccaacccgg ccatcgccta cggatcgacg 1621 caggagcgct ggatcaacaa cgacgtgctc atttatgagc gcaaattcgg cagcaacgtt 1681 gccgtcgttg ccgtcaaccg caatttgaac gcgccggctt ccatttcggg acttgtcact 1741 tccctgccgc aaggcagcta caatgacgtc cttggcggcc ttctgaacgg caacacgtta 1801 acggtaggct ccggcggagc cgcctccaat ttcacgcttg cggccggcgg cacggcggtg 1861 tggcagtaca ccgcggcaac ggcgacgccg accatcgggc atgtcgggcc gatgatggcc 1921 aagccgggcg tgacgatcac gatcgacggc cgcggcttcg gctctagcaa aggcaccgtc 1981 tacttcggca cgacggcggt gagcggcgcc aacatcacgt cttgggaaga cacgcagatc 2041 aaagtgaaaa ttccggccgt cgcaggcggc atctacaaca ttaaagtcgc aaacgccgcc 2101 ggaacggcaa gcaacgtgta cgacaacttc gaggtattgt ccggagacca ggtcagcgtc 2161 cgcttcgtgg tcaacaacgc gacaacggcc cttgggcaaa atctctacct gacgggcaat 2221 gtcagcgagc tggggaactg ggacccggca aaagcgatcg ggccgatgta caaccaggtc 2281 gtttaccaat atccgaactg gtattatgac gtcagcgttc cggccggcaa aacgatcgag 2341 ttcaagtttt tgaaaaaaca aggctccacc gtcacgtggg aaggcggcag caaccacacc 2401 ttcaccgcgc cgtccagcgg caccgcgacc attaacgtga attggcagcc ataaggcgtg 2461 agggataggc ggctggcatt cattggaaaa ggcggactat atgacgtccg ttccgtgagc 2521 aacgctcatc gctccgttca aaccgccaca aggctgatct tcagccaaaa aaagagggga 2581 cctttcccct ctttttttat ttccgttgac taacggtatt cccaaaaatt acattggggg 2641 ataagctccc tcccctctaa tagcaataac aagagcgtaa acccaaccag gtgatccata 2701 gcgtgcggtc gcctttaatc ccggtatcaa aatgtatcct accttacaaa aatgatcgga 2761 tcatacaaaa tagtgcgtac tactcaacga aatagaacct acatacagaa cgatcgatcc 2821 agatttcaac gaacggcacg gtcgtttaaa aaaatggtgt gcggggtgcg agaatatgca 2881 agaatatcaa ctgactttga aagataagcg gatcgtatgg gggaaggcga tcgaccttga 2941 gcctctcatt ggcaaatatc ctggcgactc gattagacag ggcatgaacg aagctt // LOCUS HUMALPL 3101 bp ds-DNA PRI 15-AUG-1990 DEFINITION Human alkaline phosphatase (EC 3.1.3.1) gene. ACCESSION D90054 KEYWORDS alkaline phosphatase. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3101) AUTHORS Matsuura,S., Kishi,F. and Kajii,T. TITLE Characterization of a 5'-flanking region of the human liver/bone/ kidney alkaline phosphatase gene: Two kinds of mRNA from a single gene JOURNAL Biochem. Biophys. Res. Commun. 168, 993-1000 (1990) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Fumio Kishi Department of Pediatrics Yamaguchi University School of Medicine Ube, Yamaguchi 755 Japan Phone: 0836-22-2258 Fax: 0836-22-2696 FEATURES from to/span description pre-msg 2130 3101 alkaline phosphatase mRNA and intron IVS 2341 3101 alkaline phosphatase intron rpt 375 664 Alu sequence rpt 2631 2926 Alu sequence BASE COUNT 775 a 752 c 692 g 882 t ORIGIN chromosome 1; map position p34-36.1. 1 aagctttctc cagcgagtat gatggtttct gcaggttctt ggcataaagc ctttatcaga 61 ttaaggaaat tcttttcaat acctggtttg ctgagggctt ctgtcacatc gttttctgtg 121 accccattcc ctctccctag gtgagcacgt caagtttgat cagggtgtta aactgccacc 181 cctgtgccta tgattcccaa atttatactc taacccagac ttctttttca aatgccagag 241 ccaaatattc agctgcctcc ttagtgtctc cacttctaaa agacatctcc aactcaacat 301 atccaaaaac aagttcctga ttgtctccac ctcatgcctc aaaagaccac cccaaacgcc 361 gaaaggctga atgctttttt ctttttcttt tttttttttt tctgagatgg agtctcactc 421 tgttgcccag gctggactgc agtgatgcga tctcagctca ctgcaaactc tgcttcctgg 481 gttcaagtga ttctcctacc tcagcctctc aggtagctgg gactacaggt gcacaccacc 541 atgcccagct aatttttgta gagagagttt caccatgttg gccaggctgg tctcaaacac 601 ctgaccttaa gggatccacc cgcctcagcc tctcaaagtg ctgggattac aggtgtgagc 661 catcgcactt ggctcggtag tatatggctc agaaacattg ccatttacaa tagttcccca 721 aaaagcaaaa ttcttaggta taaatctgga ttcagagtcc agaatgctaa ccattacacg 781 atggaacccg taggtataaa tctaagaaaa catatccaag atctacaggc tgaagactac 841 agagtgctga taaaaccgaa gaactctgac tgaatgagtg gagagacgtg gtgtcttcat 901 gactgggcaa ctccatgtgg tatagacgta aaccctccca cattgatctg tggatttaat 961 accataccta tcaaaaacac agtggtggag gacagatcag ggatcgccag gtttagggat 1021 ggggggattg tgtaactata aagaacgcaa gagagatttt tggggtggca gagctgttct 1081 gggtcctgac ggtggcggtg gtggttacat aaatctatcc atgtgtcaaa cgtcagaaca 1141 ctcattttac acttgggggc aacagaaatc cctccctctg gagggggtga ctgatggtaa 1201 cctgattgct aattctggaa tcaggagccc tgtggtcagg tttctgctct gcaacttcct 1261 gttggtaacc ttgggcaagt ctccgtccag agccttggtt ttctcatctg taaaaggaga 1321 tgataggtcc ttttctgtcc actgcatagc tgattagtga aacatcatgg tgaaattctt 1381 tatgaactat ggagtgcagc acatagactt gctttcattt tgtcagtatc ctttatagat 1441 tgttcatgta agctcccaaa gagtagtatt tattttattg aaataaaatg cacgtagaga 1501 aaaatgtgtg tatcatacat tgacagctga acccaccgtg taaccagcac ccacccaccc 1561 agatcaatca taaaccgaac cgcaccagca ccccagcagc ccgttcccgt ttccgtaccc 1621 tccacgtgga gcctccgttc tgtctcccaa cgccctgggt tagtttttat actttctgtc 1681 atcggaatca cactgtaagt gctcttgggt ttagcttcct ttgctcaagc ttaccttgtg 1741 cgattcattc atgttgttgt gaggagctgt ggatcatcca ttctccttgc tgtctgtggt 1801 ggtttctgtg ttgtgaacac acacaatgta ttatccagcc tgccgtagat ggaggcagtt 1861 ttgaagccat tataaacagg gctgatgtgc acattctgct ggagagaaac gggtcccagg 1921 gtacaggtag gatgatcagc ttcggtagat cctgccggtt ttcccatgcg ctgtgcctgt 1981 ctgcactcca ccaacggcga gcggaccttc cggtagttaa acatcttcac gaactcttgg 2041 actttcctgc acacacagag aagataattt tggatggctc ttcccttccc cccacaacct 2101 tccttagggc actggctttc aactgatgta aatatttact atgccaagca ctaggagggc 2161 agagacaaac aagacaaagt cctcacactt agaaactccc ggtgtggcag ctgagatggc 2221 ccaggaaaga actatattac cttcaaaaag agaggtacat gcgatgtttg aggtggcatg 2281 aagctcagtg gtgttatatt ggaatgagtg agtgaccatc ctggagcctt cctgaaagag 2341 gtgacttcat ttttaagtga ttttaaataa tagtttaatg aattagtatt tcgtattcag 2401 ttaataacat ttttctgatt ttaggatttg ctatagaaat atttggaaac cgtaaagtag 2461 aacaaaaaaa aaatgtagga atcatctgaa attccaaatt ctaccactca cagttaagtg 2521 ttgttagatg ttagatgtgg gatattgcct tttaatttcc actctgcgcc gctaccccca 2581 gcccctaccc cagagccgtc acttctggca ctggagcgca gcttgcgtgg tttttttttt 2641 tttttttttt tttttgagac agagtcctgc ctgtcgccca ggctggagtg cagtggcgcg 2701 atctcggctc actgcaactc cccctcccgg gttcacgcca ttctcctgcc tcagcctcct 2761 gagtagctgg gactacaggc gcccgctacc tctcccggct aattttttgt atttttagta 2821 gagacggggt ttcactgtgt tagccaggat ggtctcgatc tcctgacttc gtgatccgcc 2881 cgcctcggcc tcccaaagtg ggcagatcac ctgaggtaga gagttcgaga ccagacctga 2941 ccaacatgga ccccatctct actaaaaata caaaattggc cagggcatgg tggcgcaagc 3001 tgctaatccc agccactcag ggaggctgag gctggaaaat tgcttgaacc cgacctgcag 3061 gcatgcaagc ttggcgtaat catggtcata gctgttttcc t // LOCUS RATCNRAA 2337 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat calcineurin A alpha mRNA, complete cds. ACCESSION D90035 KEYWORDS calcineurin; calcineurin A alpha; calmodulin binding protein; calmodulin-dependent protein phosphatase; isoform. SOURCE Rat brain, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 2337) AUTHORS Ito,A., Hashimoto,T., Hirai,M., Takeda,T., Shuntoh,H., Kuno,T. and Tanaka,C. TITLE The Complete Primary Structure of Calcineurin A, a Calmodulin Binding Protein Homologous with Protein Phosphatases 1 and 2A JOURNAL Biochem. Biophys. Res. Commun. 163, 1492-1497 (1989) STANDARD full staff_entry COMMENT These data kindly submitted in computer readable form by: Takayoshi Kuno Department of Pharmacology Kobe University School of Medicine 7-5-1 Kusunoki-cho, Chuo-ku Kobe 650 Japan Phone: 078-341-7451 x3273 Fax: 078-351-6531 Peptides, 78-329 and 391-414, seem to be putative catalytic domain and calmodulin binding domain, respectively. FEATURES from to/span description ORF 208 1773 calcineurin A alpha signal 1944 1950 polyadenylation signal BASE COUNT 649 a 523 c 596 g 569 t ORIGIN 1 cgggaggagg agtgaaggcg gcggcggcgg aggagggacg cgcggagccg gcagtaactt 61 tcgagccagc ccagagcccg gagctccagc cgagcggttt gcagcgcggc ggcgcggcgc 121 tgagtgtctg gcccgccggt gcggtcgggg tgtgcagtcg gacgggacca gcagcgcgtc 181 gctgtccccc cctcccggtg actggagatg tccgagccca aggcgattga tcccaagttg 241 tcgactacgg acagggtggt gaaagccgtt ccatttccgc caagtcaccg gctgacagca 301 aaggaagtgt ttgataacga tgggaagcct cgtgtggata tcttaaaagc acatctcatg 361 aaggaaggca ggctggaaga aagtgtcgcg ttgagaataa taacagaggg tgcttcgatt 421 ctccgacagg aaaaaaactt gctggatatt gatgccccag tcacagtttg cggggacatc 481 catggacaat tctttgactt gatgaagctc tttgaagtgg gaggatctcc tgccaacact 541 cgctacctct tcttagggga ctatgttgac agagggtact tcagtatcga atgtgtgctg 601 tatttgtggg ccttgaaaat tctttacccc aaaacactgt ttttacttcg tggaaaccat 661 gaatgtaggc acctaacaga gtatttcacg tttaaacaag aatgtaaaat aaagtattca 721 gaacgcgttt atgacgcctg tatggatgcc ttcgactgcc ttcccctggc tgcgctgatg 781 aaccaacaat tcctgtgtgt acacggtggt ttgtctccag agattaacac tctagatgac 841 atcagaaaat tagaccgatt caaagaacca cctgcttatg ggcctatgtg tgacatcttg 901 tggtcagacc ccctggagga ctttggaaat gagaagactc aggaacattt cactcacaac 961 acagtcaggg gttgttcgta cttctacagt tacccggctg tatgtgactt cctgcagcac 1021 aataatttgt tgtccatact ccgagcccac gaagcccagg acgcagggta ccgcatgtac 1081 aggaaaagcc aaacaactgg cttcccgtct ctaattacga tcttctcggc accaaattac 1141 ttagatgtgt acaataataa agctgcagtg ttgaagtacg agaacaacgt gatgaacatc 1201 aggcagttca actgctcccc ccatccgtac tggctcccaa atttcatgga tgttttcacc 1261 tggtcgctgc catttgttgg ggagaaagtg actgagatgc tggtaaacgt cctgaacatc 1321 tgctcagatg atgaactggg gtcagaagaa gatggatttg acggagccac ggctgcagcc 1381 cggaaggagg tcatcaggaa caagatccga gcaataggca aaatggccag agtattctca 1441 gttctcagag aagagagtga gagcgttcta actctgaagg gcctgacccc gactggcatg 1501 ctccccagcg gagtgctctc tggcgggaaa caaactctgc aaagcgctac tgttgaggcc 1561 attgaggctg atgaagccat caaaggattc tcaccacaac ataagattac cagcttcgag 1621 gaggccaagg gcttagaccg aattaacgag aggatgccgc ctcgcagaga cgccatgcct 1681 tccgacgcca accttaactc catcaacaag gctctcgcct cagagactaa cggcacagac 1741 agcaacggca gtaatagcag caatattcag tgaccacttc ctgttcactt tttttttttg 1801 agctgcaggg catgatgggt ttgctgcatc tcagcagttg gatgttcttg cctctgacgg 1861 tagcttgttt gctctggggg ggccaggaat tggattcagt ttacactatc atgaaaaaaa 1921 aaaagaggga gagagagaga gataataaaa ctatattttg gtgagggtgg tgattaaaca 1981 cctcttttgg gtatgccttt aaaaatgctt ctaggaaaaa aaaagtttta aaaagaaagc 2041 taatgctagt ctatacttca atgttagggg aatgaacacg ttttcctagc gcactgggga 2101 cttttagata ggttaatgaa aggcctttta ttctgttact ggacacgaaa actttgtcta 2161 atttcttata ctctattgta cgtttacagt cgcagcacta aaaatggatg acatcaaaca 2221 tttttaaaca gaaaaaaaag atgtacaaac taaataagga ctatttattg ataatgtttt 2281 gctactcttg tcagacaatg gctataaact gaattaggca gtcttaaaaa aaaaccg // LOCUS PHALPO 5710 bp ds-DNA PLN 15-AUG-1990 DEFINITION P.chrysosporium lignin peroxidase genes, complete cds. ACCESSION M37701 M22720 KEYWORDS lignin peroxidase. SOURCE P.chrysosporium (strain BKM-F-1767 (ATCC 24725)) DNA. ORGANISM Phanerochaete chrysosporium Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; Agaricales; Corticiaceae. REFERENCE 1 (bases 3402 to 5365) AUTHORS Walther,I., Kaelin,M., Reiser,J., Suter,F., Fritsche,B., Saloheimo,M., Leisola,M., Teeri,T., Knowles,J.K.C. and Fiechter,A. TITLE Molecular analysis of a Phanerochaete chrysosporium lignin peroxidase gene JOURNAL Gene 70, 127-137 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3543; 5096 to 5710) AUTHORS Huoponen,K., Ollikka,P., Kaelin,M., Walther,I., Maentsaelae,P. and Reiser,J. TITLE Characterization of lignin peroxidase-encoding genes from lignin-degrading basidiomycetes JOURNAL Gene 89, 145-150 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by J.Reiser, 22-FEB-1989. FEATURES from to/span description pept 652 712 lignin peroxidase lpoB, exon 1 770 923 lignin peroxidase lpoB, exon 2 976 1032 lignin peroxidase lpoB, exon 3 1087 1301 lignin peroxidase lpoB, exon 4 1354 1395 lignin peroxidase lpoB, exon 5 1474 1552 lignin peroxidase lpoB, exon 6 1603 2026 lignin peroxidase lpoB, exon 7 2079 2143 lignin peroxidase lpoB, exon 8 2197 2218 lignin peroxidase lpoB, exon 9 pept 5098 5038 (c) lignin peroxidase lpoA, exon 1 4977 4824 (c) lignin peroxidase lpoA, exon 2 4770 4714 (c) lignin peroxidase lpoA, exon 3 4659 4445 (c) lignin peroxidase lpoA, exon 4 4391 4350 (c) lignin peroxidase lpoA, exon 5 4296 4218 (c) lignin peroxidase lpoA, exon 6 4167 3744 (c) lignin peroxidase lpoA, exon 7 3689 3625 (c) lignin peroxidase lpoA, exon 8 3562 3541 (c) lignin peroxidase lpoA, exon 9 IVS 713 769 lpoB intron A IVS 924 975 lpoB intron B IVS 1033 1086 lpoB intron C IVS 1302 1353 lpoB intron D IVS 1396 1473 lpoB intron E IVS 1553 1602 lpoB intron F IVS 2027 2078 lpoB intron G IVS 2144 2196 lpoB intron H IVS 2219 769 lpoB intron I IVS 5037 4978 (c) lpoA intron A IVS 4823 4771 (c) lpoA intron B IVS 4713 4660 (c) lpoA intron C IVS 4444 4392 (c) lpoA intron D IVS 4349 4297 (c) lpoA intron E IVS 4217 4168 (c) lpoA intron F IVS 3743 3690 (c) lpoA intron G IVS 3624 3563 (c) lpoA intron H BASE COUNT 1204 a 1613 c 1594 g 1299 t ORIGIN 1 agctcacttt acctatacac atctgcattc agtccttcca gttctctgac cctaacatcc 61 ggtaaatgta ccttcagtga tcgggacgga aggtatgggc ctttcgcata ggtgggtaat 121 ctgcgactgt atgttttgta tggtaccctg agacagtcac ttactgtttc tgctcgctcc 181 aggtaccatt gtcccgcctc tgcgtgattt ccgaggctgg actggcccat ctctgcccac 241 cctgtcctca tctgccaaga gccatcggaa tgccaagccg tgaccactcc aaccggtccc 301 gttctctcag ccactgcgca agtttcttac aggagggctg cttcgccgtt cattcgcggc 361 ctccggatag ctagcgagct tcgatgctcg tggccaatta tggaagcagt cgttgatcgc 421 accggtcccg tactgccttc gctcacaagc cgtgttgttg cgagactctc attcgctggc 481 tcagggtatt gtgcctgttt gctgaggcac agtgcagtca atacacactt gtctcgtcag 541 gacgcggttt gacattccgt ggtgcgtgaa acggtataaa agggatacgc gatttgcagc 601 atatcctcag gccattcgtc ttctacagcc caagttccaa gtcaaacggt catggccttc 661 aagcagctcg tcgcagcgat ttccctcgca ctctcgctca ccactgccaa tggtacgcac 721 cgcttctgca tgctgtgata acgggccccg actaacgcct ccgctgcagc cgccgtggtc 781 aaggagaagc gcgccacctg ctccaacggc gccaccgttg gcgacgcgtc ctgctgtgct 841 tggttcgatg tcctcgacga catacagcag aacctgttcc aaggaggcca gtgcggcgct 901 gaggcccacg agtctatccg tctgtaagtc aatacgctgg tgttgcgcca aggtcataga 961 ttcactttgc tgcagcgtgt tccacgatgc tattgccatc tctcctgcta tggaggccca 1021 gggcaagttc gggtatgtct ttccggcatg gcaatatttt acagcagaca ctgagatatt 1081 gcgcagcggt ggtggtgctg acggctccat catgatcttc gacgacatcg agcccaactt 1141 ccaccctaac attggcctcg acgagattat caacctccag aagccgttcg tccagaagca 1201 cggtgtcacc cctggtgact tcatcgcctt cgccggtgct gtcgcgctca gcaactgccc 1261 gggtgcccca cagatgaact tcttcactgg tcgtcgtcct ggtacgtctc ctctacgaat 1321 cgatctcgac acctcattca tatcgcctta tagctaccca gcccgcaccc gatggtctcg 1381 ttcccgagcc tttccgtgag tttgcagacc acttcatcgc atagttctta gctgacctct 1441 tcatcgcata gttcttagct gacttcagca cagacaccgt cgaccagatc atcgctcgtg 1501 ttaacgatgc cggcgagttc gacgagctcg agcttgtctg gatgctttcc gcgtaagtga 1561 ctgccgcctc gaatttccat cccgacttac accccgattc agccactccg ttgctgcagt 1621 caacgacgtg gacccgaccg tccagggcct gcccttcgac tccacccccg gaatcttcga 1681 ctcgcagttc ttcgtcgaga ctcagttccg tggtatcctc ttccccggct ccggtggcaa 1741 ccagggtgag gtcgagtccg gtatggctgg cgagatccgc atccagaccg accacactct 1801 cgcccgcgac tcccgcaccg cttgcgagtg gcagtcgttc gtcaacaacc agtccaagct 1861 cgtctccgac ttccagttca tcttccacgc cctcacccag ctcggccagg acccgaacgc 1921 gatgaccgac tgctcggatg tcatcccgat ctcgaagccc atccccggca accttccgtt 1981 ctcgttcttc ccccctggca agagcatgaa ggatgttgag caggctgtag tatccgattc 2041 agtccttgtc gcagagctta tgctgacggc ttctgcagtg cgccgagacc cccttcccca 2101 gcctcgtcac tctccccggc cccgcgacct ctgtcgctcg catgtgagta tctccgacgg 2161 tctatgaagc ccccagctga catattcctc ttccagcccc ccgccgccgg gtgcttaagt 2221 cattctatcg gtcatctttg gctgaaacgg agtatttgga atacggctca ctcgtaacgg 2281 taacttgcgc tcaagtgttt agaaatgtct cctttgtatc tacgcgattg gtccgctttt 2341 gacgatagat cgttactgtg ttcattgaaa ttctcgtccg cgcgccctgg agcgaaccgg 2401 ttagcattgc cacacgagag ctcttccgtt gctccaactc gagctgtaat ggtccaacgc 2461 tccacgctac atcaatttaa cctctcatgg gtacggtgta ttcggcaagt ttatctcaca 2521 taataagagg cacgctatca ttcgacgata caagaacatg agccttcgct tcgtttatga 2581 tattggttca ctgtcgagct aatttctgag ggttagcgct ctgacatgat cagctacagg 2641 aacggaggcc gtaccttgaa tgtgcccata aacccgctgt cttattcttc tcaaattgat 2701 tcttcatgtt tgaatcacgt ttgcaggtgc attcgtgtac ctgcggcgcg tacacgcggt 2761 atgtattggt cgcaaatcgc atcatggtga gatcttgctc ttcactcttg aagttgctac 2821 cgtataccac catgtgcagg aattctcgta catccctgtt tctcctcgaa tgtatgtgga 2881 gccagggaaa ccctaacccc ggattctgct gagatgcgtc gatgcacgca gccgtagcgg 2941 aggtccgtga ggtccgctcc ggccacgaag caggggccgt cctgaccggt cgaaggtcat 3001 gtcgtgcgac atagtcggct tccaggagga cgatatcgac caatacgtcg aaaggaggag 3061 actgcgggtc taggctggac gctgtttgcg agggcccggg ggagaacgag gccattggga 3121 gtcagcgaga ttattgaata gtcgaagggt attcattgag tcactaaggg aaacacttct 3181 gagccgctgg tagtacttgt gtatgcccgg gttctgcgcc tgataattag cctcgctcct 3241 ccgttgacgt tgggttttgg caataggaca tcaccacttt caccacgcgg acgcaatgcg 3301 aagggcacga gtggtatctc aatagctagt taccttccaa gaccctcaat catgatcgga 3361 agaagaggat gtgcaccgat atttcataag cccacggcag atatcgtaag agagtagacg 3421 aatgagattc gtagttaggt gcagagatac gatgaatgaa atctagtaaa gccgaagttc 3481 cgtcacgagt tagccggcca ccgttacagt cggtttgagg agtattctgt atggcatcat 3541 ttaagcaccc ggaggcggag ggctggagaa ggagcatgtc agcccagatt gcatttcctg 3601 aaagatctca tggattgtac tcacatgcgc tggacggacg tctcggggcc cgggagagtg 3661 gtgagagtcg ggaagggggt ctccgcacac tgtcatgcga tgttcagcag ccactctact 3721 gcatggtggg gtgaaatacg caccgcctgc tcaacgtcct tgatggtctt gccagcgggg 3781 aagaacgaga atgggaggtt gccagggatg ggcttggact gcgggataac atccgagcag 3841 tcggtcatcg cgttcgggtc ctggccgagc tgggtgaggg cgaggaagat gaactggaag 3901 tcatcgacga gcttggactg gttgttgacg aaggactgcc attcacacgc cgtgcgcgag 3961 tcgcgggcga tagtgtggtc ggactggatg cgaatttcgc cagggagcgg cgactcgacc 4021 tcgccttggt tgccaccaga gccggggaag gcggtaccac gaagctgagt ctcgacgaag 4081 aactgggagt cgaagattcc gggggtcgag tcaaagggca gaccctggac ggtcgggtcg 4141 acgtcgttca ccgctgcgac ggagtgcctg tcgaggtctc aggaagggag tgtcgaagtc 4201 aacagtgagt gacttacgcg gagagcatcc agacaagctc gagctcatcg aactcgcctg 4261 cgtcgttgac acggttgatg atttggtcga cagtgtctgc atgctagtca gtatagaccg 4321 cacctaactg cttggataag accacttacg gaagggctcg gggacaaggc catcaggagc 4381 gggctgggta gctaaagcag acagttagtt cgtaccatcc gcaaagcgag ttttgcaggt 4441 ataccaggtg cacgaccagt gaagaagttc atctgcgggg caccagggca gttgctgagc 4501 gcgacacgac cagcgaaggc gatgaagtca ccaggggtga caccgtgctt ctgaacgaat 4561 ggcttctgga gcttgacgat ctcgtcgaga ccgatgttag ggtggaacgc agtctcgata 4621 tcgtcgaaga tcatgatgga gccgtcagca ccaccgccgc tgcaaggagg gatcagcaaa 4681 cgactaggtg gcgcaacgcg ggtggcaact tacccgaact tgccctgtgc ctccatggcg 4741 ggcgaaattg cgatggagtc gtggaagacg ctgggcgggg tgttcaaaca tgcatagcag 4801 gagatcgcga cgggatcact cacagacgaa tcgactcgtg cgcctcagcg ccgcactggc 4861 cgccgtggaa caggttctgc tggatatcat ccaggacgtc gaaccaagcg cagcacgacg 4921 catcgccgac ggtcttgccg ttggaacagg tggcgcgctt ctcgatcacc gcagccgctg 4981 cacaagacga cgttcagcat gcagtccact ggtcaacgct aactgcgatg ggcataccgt 5041 tcgcagccga gagcaagaga gcgagagaga tagctgcgaa gagctgcttg aaggccatgt 5101 ccgctgtgtt gctggtgctg agtgggactg aagagactgg atgtctgagg gactgcggtg 5161 gtcctgtcgc ccttttatac cctaggcgtg gtcgacgtcc tggtattgtt cgccgtagaa 5221 cagtgtcgaa tcgacgtgac gcggtgcgcg gacatgcacg acactgcgcc agccaatgag 5281 gacgctgcca aaacgcagcc tgtgagcgag ttggtgcggt gccggcaacc atcaccgact 5341 cgtctcacat ttgggccact gcgtcgagcg cagttcgcgc cggcaccgct gttgaatagc 5401 acgcgagctc tgcaagaaag aatagggcgg cccatgagaa cagaaatccg agtcagagga 5461 attaactgcg cgtgccgatg agtcttgaca tgaggatgat ctaacgaaga gaccttgcat 5521 tgagccgttt ccagtgctgc caggggtaat cagtcggcat tactgccaag tccggggatg 5581 tactgctagc tcactcccat cgcaatatgt caccgagtat tgcctttgtg aacataccat 5641 tgattcggtc ccgatcatgc acgaacgact cccgcaaagt ggggccagtg actatcacgt 5701 ccgtgctcag // LOCUS XANXCAA 2333 bp ds-DNA BCT 15-AUG-1990 DEFINITION X.campestris major extracellular endoglucanase (engXCA) gene, complete cds. ACCESSION M32700 KEYWORDS major extracellular endoglucanase. SOURCE X.campestris DNA. ORGANISM Xanthomonas campestris Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Pseudomonadaceae. REFERENCE 1 (bases 1 to 2333) AUTHORS Gough,C.L., Dow,J.M., Keen,J., Henrissat,B. and Daniels,M.J. TITLE Nucleotide sequence of the engXCA gene encoding the major endoglucanase of Xanthomonas campestris pv. campestris JOURNAL Gene 89, 53-59 (1990) STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by C.L.Gough, 09-MAR-1990. FEATURES from to/span description pept 383 1864 major extracellular endoglucanase (engXCA) precursor sigp 383 457 major extracellular endoglucanase signal peptide (put.) matp 458 1861 major extracellular endoglucanase (put.) BASE COUNT 444 a 800 c 734 g 355 t ORIGIN 1 gaattcccgg ggatcacaaa cgacgcgaac aagccgacct gcgggtccac gcctgcgacg 61 aacgcaaagg cgatgacttc gggaatcagg gcgaacgtgg caacggcgcc agccatcagt 121 tcgcgcgcag gcgaggcgcc attgcgccag ttcggtgcgc aggaaggaca tgggggacac 181 tccagggaca agaacgacat gcctgcggac agcgcgcagg gggcactagt gtgcgggaaa 241 cggccgctcc cgcagccgcg atgtgatcgg tgcggcaatg gtgttttctg tggggacgat 301 cacaccacgc gacgcgcgca cagaccaaga tgcccgcctt accgcgctcg ggtgtcgagc 361 ccggttctct agggagatca ccatgtccat attcaggacc gcaagcacgc tcgctttggc 421 caccgccctc gcactggccg ccgggccggc cttcagctat tccatcaaca acagcaggca 481 gatcgtcgac gacagcggca aggtcgtgca gctcaagggt gtgaacgtgt tcggcttcga 541 aaccggcaac cacgtgatgc atggcctgtg ggcacgcaac tggaaggaca tgatcgtgca 601 gatgcagggc ctgggcttca acgccgtgcg cctgccgttc tgcccggcca cgctgcgtag 661 cgacaccatg ccggccagca tcgactacag ccgcaacgcc gacctgcagg gcctgacctc 721 gctgcagatc ctcgacaagg tgatcgccga attcaatgcg cgcggcatgt atgtgctgct 781 ggatcaccac acccccgatt gcgccggcat ttccgagctc tggtacaccg gctcctatac 841 cgaggcacag tggctggccg acctgcgctt tgtggccaac cgctacaaga acgtgccgta 901 tgtactcggc ctggatctga agaacgaacc gcacggcgcc gccacctggg gtaccggcaa 961 cgccgccacc gattggaaca aggctgccga gcgcggctcg gccgcggtgt tggcggtcgc 1021 gccgaagtgg ctgatcgcgg tggaaggcat caccgacaac ccggtgtgct ccaccaacgg 1081 cggcatcttc tggggcggca acctgcagcc gctggcctgc accccgctca acatcccggc 1141 caaccgcctg ctgctggccc cgcacgtgta cggcccggac gtgttcgtgc agtcgtactt 1201 caacgacagc aacttcccca acaacatgcc cgccatctgg gaacgccatt tcggtcagtt 1261 cgccggcacg catgcgctgt tgctgggcga gttcggtggc aagtacggcg aaggcgacgc 1321 acgcgacaag acctggcagg acgcgctggt gaagtacctg cgcagcaagg gcatcaacca 1381 gggcttctac tggtcgtgga atcccaacag cggcgacacc ggcggcatcc tgcgcgatga 1441 ctggaccagc gtgcgccagg acaagatgac cctgctgcgc acgctgtggg gcaccgccgg 1501 caataccacg ccgacgccga ctcccacacc tacgcccaca ccgacaccga cgcctacccc 1561 cacgccgacg cccaccccgg gcaccagcac cttcagcacc aaggtgatcg cctcgccggt 1621 ggtggggtcg gcagcgcgaa aactgccggc ggcatcgcgg ctggcttgcc attggccggc 1681 cagcagcacg ggttggagag tctgggtcat cgcggcacct tcggttacgt ggaagcgccc 1741 gcacgcagca cgggcgatcg aacggcggat gagggtaacg cgcctgcgac gtgccacccg 1801 tttgaatcgt ggaccactac cggcaccggc ccatacaacg cagcacgcac cgcggctgcg 1861 ctaaacaagg ccgcgcgacg gcggtggcgc gtgctcagtg caggctgggc gcggtggcga 1921 tggcgtggtc gatcaccttc agcgctgcct cgcgctcggc accgtccacc accaggcgtg 1981 gcgcacggac acgctcgctg cccaggccca ccttttcctg caccagtttg atcagctgca 2041 cgaacttggg cacggtatcc aggcgcagca gcggcaggaa ccagtcgtac agttccttgg 2101 cggcggggta accgccgtcg cgtgccagtt cgaacaggcg taccgactcc ttcggactac 2161 tgtacttgac cagcccggcg atccacccct tggcgcccat gctcaggcct tcgacgatgg 2221 cgtcgtccat gccgaccagc agcgccagac gatcgcccag caattcctgc agcgcggcga 2281 agcggcgcac atcgccggaa gattccttta ctgcctgcag gattggggaa ttc // LOCUS FLAHANENJ8 1458 bp ss-RNA VRL 15-AUG-1990 DEFINITION Influenza virus A/NJ/8/76 (H1N1) hemagglutinin/neuraminidase (seg 4) gene, complete cds. ACCESSION M27970 KEYWORDS hemagglutinin/neuraminidase. SOURCE Influenza virus A/NJ/8/76, cDNA to viral RNA, clones pNA[6,28], passed in embryonated eggs. REFERENCE 1 (bases 1 to 1458) AUTHORS Miki,T., Nishida,Y., Hisajima,H., Miyata,T., Kumahara,Y., Nerome,K., Oya,A., Fukui,T., Ohtsuka,E., Ikehara,M. and Honjo,T. TITLE The complete nucleotide sequence of the influenza virus neuraminidase gene of A/NJ/8/76 strain and its evolution by segmental duplication and deletion JOURNAL Mol. Biol. Med. 1, 401-413 (1983) STANDARD simple staff_entry FEATURES from to/span description pept 21 1430 hemagglutinin/neuraminidase precursor sigp 21 125 hemagglutinin/neuraminidase signal peptide matp 126 1427 hemagglutinin/neuraminidase BASE COUNT 462 a 257 c 343 g 396 t ORIGIN 1 agcaaaagca ggagtttaaa atgaatacaa atcaaagaat aataaccatt gggacaatct 61 gtctaatagt tggaataatt agtctattat tgcagatagg aaatataatc ttgttatgga 121 tgagccattc aattcagact ggagaaaaaa gccatcctaa ggtatgcaac caaagtgtca 181 ttacctatga aaacaacaca tgggtgaacc agacttatgt aaacattagc aataccaata 241 ttgctgctgg acagggtgtg actccaataa tactagccgg caattcctct ctttgcccaa 301 tcagtgggtg ggctatatac agcaaagaca atagcataag gattggttcc aaaggagaca 361 tttttgtcat gagagagcca ttcatttcat gctctcactt ggaatgcaga accttttttc 421 tgacccaagg cgctttgctg aatgacaggc attctaatgg aaccgtcaag gacaggagtc 481 cttatagaac cttaatgagc tgccccatcg gtgaagctcc atctccgtac aattcaaggt 541 tcgaatcagt tgcttggtca gcaagtgcat gccatgatgg aatgggatgg ctaacaatcg 601 ggatttccgg tccagataat ggagcagtgg ctgttttaaa atacaatggt ataataacag 661 atacaataaa aagttggaga aacaaaatat taagaacaca agagtctgaa tgtgtttgta 721 taaacggttc gtgttttact ataatgactg acggcccaag caatgggcaa gcctcgtaca 781 aattattcaa aatggagaaa gggaagatta ttaagtcaat tgagctggat gcacctaatt 841 accactatga ggaatgctcc tgttaccctg atacaggcaa agtggtgtgt gtgtgcagag 901 acaattggca tgcttcgaat cgaccatggg tctctttcga tcagaatctt gattatcaaa 961 tagggtacat atgcagtggg gttttcggtg ataatccgcg ttctaatgat gggaaaggca 1021 attgtggccc agtactttct aatggagcaa atggagtgaa ggggttttca tttagatatg 1081 gcaatggtgt ttggatagga agaactaaaa gtatcagctc tagacgtgga tttgagatga 1141 tttgggatcc taatggatgg acagaaactg atagtagttt ctctatgaag caagatatta 1201 tagcattaac tgattggtcg ggatacagtg gaagttttgt ccaacatcct gaattaacag 1261 gaatgaactg cataaggcct tgtttctggg tagagttaat cagagggcaa cccaaggaga 1321 gcacaatctg gactagtgga agcagcattt ctttctgtgg cgtgaacagt ggcactgcaa 1381 gctggtcatg gccagacgga gctgatctgc cattcaccat tgacaagtag tttatccaaa 1441 aaactccttg tttctact // LOCUS HUMHIS3PRM 1125 bp ds-DNA PRI 15-AUG-1990 DEFINITION Human histone H3 gene, complete cds. ACCESSION M26150 KEYWORDS histone. SOURCE Human HeLa cell DNA, clone pST519. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1125) AUTHORS Marashi,F., Helms,S., Shiels,A., Silverstein,S., Greenspan,D.S., Stein,G. and Stein,J. TITLE Enhancer-facilitated expression of prokaryotic and eukaryotic genes using human histone gene 5' regulatory sequences JOURNAL Biochem. Cell Biol. 64, 277-289 (1986) STANDARD simple staff_entry FEATURES from to/span description pept 557 964 histone H3 /hgml_locus_uid="LV0006C" /nomgen="H3F2" /map="1q21" mRNA 520 > 964 histone H3 mRNA (5' end + / - 4 bp) signal 422 425 CAAT box signal 463 468 CAAT box signal 485 492 TATA box BASE COUNT 298 a 283 c 267 g 277 t ORIGIN 1 gcagcggcgt gataacagct cactgtaacc tcgaactcgg gctcaagcga tcctcatcga 61 cagccttctg agtagctggg attacaggcg agagcgccac gcccgactaa gagcattttc 121 taattgccca cacttcttat gcgacaccca gaaaaataca attttaaata aagcgcatat 181 gcaaataacc ctaatcgtct ccaatattca ctgatttctt ttttatattt taactagaaa 241 caattggagg tttccgcgtt gctttgtgtg gttgtaaatt ttaagacttc aggaaacttt 301 tccagtacaa gacttgtcca acagtggata tagcagctaa ggggttaaca aaatgacgtc 361 agagtagcta cggtaatggg caggagcctc tcttaatctg caaccaagca cagagatgga 421 ccaatccagg aagggcgcgg ggatttttga atttacttgg gtccaatggt tggtggtctg 481 actctataaa agaagagtag ctctttcctt tcctccacag acgtctctgc aggcaaagct 541 tttctgtggt tttgccatgg ctcgtactaa acagacagct cggaaatcca ccggcggtaa 601 agcgccacgc aagcagctgg ctaccaaggc tgctcgcaag agcgcgccgg ctaccggggg 661 cgtgaaaaag cctcaccgtt accgcccggg cactgtggct ctgcgcgaga tccgccgcta 721 ccaaaagtcg accgagttgc tgattcggaa gctgccgttc cagcgcttgg tgcgagaaat 781 cgcccaagac ttcaagaccg atcttcgatt ccagagctcg gcggtgatgg cgctgcagga 841 ggcttgtgag gcctacttgg tagggctctt tgaggacaca aacctttgcg ccatccatgc 901 taagcgagtg actattatgc ccaaagacat ccagctcgct cgccgcattc gcggagaagc 961 gtaaatgtaa agtcactttt tcatcagtct taaaacccaa aggctctttt cagagccacc 1021 cacttattcc aacgaaagta gctgtgataa ttttttgttg tcttaacaga acaaatttct 1081 aaggaccccc ccggaaagca ttagactatg gcttaaagtt gatac // LOCUS MUSTUBMA1 786 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse alpha-tubulin gene M-alpha-1, 3' end. ACCESSION M28729 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 786) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 597 alpha-tubulin (AA at 1) signal 773 778 poly-A signal BASE COUNT 186 a 187 c 210 g 203 t ORIGIN 1 gaattccaga ccaacctggt accctaccct cgtatccact tccctctggc cacttatgcc 61 cctgtcatct ctgctgagaa agcctaccac gagcagcttt ctgtagcaga gatcaccaat 121 gcctgctttg agccagccaa ccagatggtg aaatgtgacc ctcgccatgg taaatacatg 181 gcttgctgcc tgctgtaccg tggtgatgtg gttcccaaag atgtcaatgc tgccattgcc 241 accatcaaga ccaagcgtac catccagttt gtggactggt gccccactgg cttcaaggtt 301 ggcattaact accagcctcc cactgtggta cccggtggtg acctggccaa ggtgcagaga 361 gctgtgtgca tgctgagcaa caccacagcc attgctgagg cctgggctcg cctagatcac 421 aagtttgatc tgatgtatgc caagcgtgcc tttgtgcact ggtatgtggg tgagggcatg 481 gaggagggtg agttctctga ggcccgtgag gacatggctg ccctagagaa ggattatgag 541 gaggttggtg tggattctgt ggaaggcgag ggggaggaag aaggagagga atactaaatt 601 aaatgtcaca aggtgctgct tccacaggga tgtttattgt gttccaacac agaaagttgt 661 ggtctgatca gttaatttct atgtggcaat gtgtgctttc atacagttac tgacttatga 721 atgattgatt ttgacagaga ccccaagctg cccatttcac ttatgggttt taaataaaat 781 actccc // LOCUS MUSTUBMA2 1198 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse alpha-tubulin gene M-alpha-2, 3' end. ACCESSION M28727 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1198) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 1059 alpha-tubulin (AA at 1) BASE COUNT 259 a 329 c 303 g 307 t ORIGIN 1 gcaaataact atgcccgtgg ccactacacc attggcaagg agatcattga ccttgtcctg 61 gacaggattc gcaagctggc tgaccagtgc acgggtctcc agggcttgtt cgttttccac 121 agctttggcg ggggaactgg ctctggcttc acctccctgc tgatggagcg gctctctgtg 181 gattacggaa agaagtccaa gctggagttc tccatttacc cagcccccca ggtttccact 241 gctgtggttg agccctacaa ttccatcctc accacccaca ccaccctgga gcactctgat 301 tgtgccttca tggtagacaa tgaggccatc tatgacatct gtcgtagaaa cctcgacatt 361 gagcgcccaa cctacaccaa ccttaaccgc cttattagcc agattgtgtc ttccatcact 421 gcttccctca gatttgatgg ggccctcaat gttgatctga cagaattcca gaccaacctg 481 gtaccctacc ctcgcatcca cttccctctg gccacttatg cccctgtcat ctctgctgag 541 aaagcctacc atgagcagct ttctgtagca gagatcacca atgcctgctt tgagccagcc 601 aaccagatgg tgaaatgtga ccctcgccat ggtaaataca tggcttgctg cctgctatac 661 cgtggtgatg tggttcccaa agatgtcaat gctgccattg ccaccatcaa gaccaagcgc 721 acgatccagt ttgtagactg gtgccccact ggcttcaagg ttggcattaa ttaccagcct 781 cccactgtgg tacccggtgg tgacctggcc aaggtgcaga gagctgtgtg catgctgagc 841 aacaccacag ccattgctga ggcctgggct cgcctagatc acaagtttga tctgatgtat 901 gccaagcgtg cctttgtgca ctggtatgtg ggtgagggca tggaggaggg tgagttctct 961 gaggcccgtg aggacatggc tgccctagag aaggattatg aggaggttgg tgtggattct 1021 gtggaaggcg agggggagga agaaggagag gagtactaag tccattcctt gagccccctg 1081 tgtcgtcaaa tgctccagta ttagttgcag gcacctgatg cttctgtgct gtttccattc 1141 tgtgatcatg tcttctccat gttgtacctc ttaagttttc catgatgtct caaactaa // LOCUS MUSTUBMB2 488 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse beta-tubulin gene M-beta-2, 3' end. ACCESSION M28739 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 488) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 303 beta-tubulin (AA at 1) signal 472 477 poly-A signal BASE COUNT 121 a 116 c 131 g 120 t ORIGIN 1 cccaacaacg tcaagacggc cgtgtgtgac atccctcctc gtggcctcaa gatgtcagcc 61 accttcattg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc 121 actgccatgt tccggcgcaa ggctttcctg cactggtaca cggctgaggg catggacgag 181 atggagttca ccgaggcgga gagcaacatg aatgacctgg tgtctgagta ccagcagtac 241 caggatgcca cggccgatga gcagggcgag ttcgaggagg aggagggtga agatgaggct 301 tgagaacttc tcagatacag tgtgcaccct tagtgaactt ctgttgtcct ccagcattgg 361 tctttctatt tgtaaattat ggtgctcagt ttgcctctgt cagaaattca ctgttgatgt 421 aatagtgtga acctctttca agatcacagt attgtctcag aaatctatat gaataaaaaa 481 gcatgtgg // LOCUS MUSTUBMB4 1454 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse beta-tubulin gene M-beta-4, 3' end. ACCESSION M28730 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1454) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 1140 beta-tubulin (AA at 1) BASE COUNT 327 a 456 c 376 g 295 t ORIGIN 1 gtcgacctgg aacccggcac catcgactct gtccgctccg gcccttttgg ccagatcttt 61 cggccagaca actttgtatt tggtcaatcc ggagcaggca acaactgggc caagggtcac 121 tacaccgagg gcgcgcagtt agtggatgcc gtcctggacg tggtgcgcaa agaggcggaa 181 agctgcgact gtctccaggg cttccagctc acccactcgc tcggaggtgg caccggctca 241 ggcatgggga ccttgctcat cagcaagatc cgagaggagt ttccagacag gatcatgaat 301 acgttcagcg tggtgccatc acccaaggtg tctgacacgg tggtggagcc ctacaatgcc 361 acactgtctg tgcatcagct ggtggagaac actgatgaga cctactgcat cgacaacgag 421 gccctgtacg acatctgctt ccgtacgctc aagctgacca cgcccacgta cggggacctc 481 aaccacctcg tgtcagccac catgagtgga gtcaccacct gcctacgttt cccgggccag 541 ctcaatgcag acctacgcaa gctggctgtg aacatggtgc cattcccccg tctccacttc 601 ttcatgccag gattagcacc cttgaccagc aggggcagcc agcagtaccg ggccctcacc 661 gtccctgagc tgacccaaca ggtgttcgat gctaagaaca tgatggctgc gtctgacccg 721 agacacggtc gctacctgac tgtggctgct gtcttccggg gacggatgtc catgaaggag 781 gtagacgagc agatgttaag tgtgcagagc aagaacagca gttacttcgt tgagtggatc 841 cccaacaatg tgaaggcagc cgtatgtgac atcccgcccc gcggcctgaa gatggcagcc 901 accttcatcg gcaacagcac tgccatccag gagctgttca agcgcatctc ggagcagttc 961 accgccatgt tcagacgcaa ggccttcctg cactggtaca cggccgaagg catggacgag 1021 atggagttta cggaagcaga gagcaatatg aacgacctgg tgtccgagta ccagcagtac 1081 caggatgcca ctgctgaaga gggcgagttc gaagaggagg ctgaagagga ggtggcttaa 1141 gtctcctgcc atcactctgt ccctggggcc caccagcaaa gctttgaccc taagcatcac 1201 acccctgcac ctagttgcct cattccctag gaccccatga gcatcttcac catgaggcca 1261 agcccaggtt gcttctattt gcttcacctt taactcctaa accccactgt ctctccaacc 1321 tgccagggaa gggctcttct agttcccatg agcgcccctc aacacatgta cacacgcaca 1381 cacactccac cttcttagat cttgaaaatc ctttccttta tgccctgtcc cttccccagc 1441 actcctgaac cgat // LOCUS MUSTUBMB5 542 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Mouse beta-tubulin gene M-beta-5, 3' end. ACCESSION M28732 KEYWORDS alpha-tubulin. SOURCE Mouse 15-21 day old brain, and 18 day old embryo, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 542) AUTHORS Lewis,S.A., Lee,M.G.-S. and Cowan,N.J. TITLE Five mouse tubulin isotypes and their regulated expression during development JOURNAL J. Cell Biol. 101, 852-861 (1985) STANDARD simple staff_entry FEATURES from to/span description pept < 1 363 beta-tubulin (AA at 1) signal 524 529 poly-A signal BASE COUNT 126 a 136 c 149 g 131 t ORIGIN 1 gaggtggatg agcagatgct caatgtgcag aacaagaata gcagctactt cgtggaatgg 61 atccccaaca atgtcaagac agctgtctgt gacatcccac cgcgtggcct caagatggca 121 gtcaccttca ttggaaacag cacagccatc caggagctgt tcaagcgcat ctctgagcag 181 tttacggcta tgttccgccg gaaggctttc ctccactggt acacggctga gggcatggac 241 gagatggagt tcaccgaggc tgagagcaac atgaacgacc tggtgtctga gtaccagcag 301 taccaggatg ccaccgctga agaggaagag gatttcggag aggaggcaga agaggaggcc 361 taacggcaga gagccctgca tcagctcagg ctgcttagac tccctcagcc tttctccaac 421 tgccctttgt cctccagttt ctttctgctg cctctgtctt gtatttgttt tgcttctgtt 481 ttctcattgg gggtaaatgg tgcctggcac atggcaggca ctcaataaat atttgtttgt 541 gg // LOCUS XELPAL 353 bp ss-mRNA VRT 15-AUG-1990 DEFINITION X.laevis parvalbumin mRNA, 3' end. protein. ACCESSION M28644 KEYWORDS parvalbumin. SOURCE X.laevis tadpole, cDNA to mRNA, clone lambda-PV1. ORGANISM Xenopus laevis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 353) AUTHORS Kay,B.K., Shah,A.J. and Halstead,W.E. TITLE Expression of the Ca2+ -binding protein, parvalbumin, during embryonic development of the frog, Xenopus laevis JOURNAL J. Cell Biol. 104, 841-847 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 339 parvalbumin (AA at 1) BASE COUNT 92 a 86 c 90 g 85 t ORIGIN 1 agatttacta tggcattcgg tggtatcctg agtgaggctg acatctctgc tgccctgcag 61 aactgccaag ctgctgactc cttcaacttc aaaactttct ttgcccagtc tggtctgagc 121 agcaagtccg cagatgatgt gaaaaacgtc tttgccatcc tcgaccagga caggagcggc 181 ttcattgagg aagaggaact gaagttgttc ctccagaact tcagcgcaag tgccagggca 241 ctgactgatg ctgaaaccaa ggccttcctg gcagctggtg actctgatgg tgatggcaaa 301 attggagttg aagaattcca gtccctagtc aaaccttgaa gaagtaagac caa // LOCUS RATMLVI4 100 bp ss-mRNA ROD 15-AUG-1990 DEFINITION Rat Moloney murine leukemia provirus Mlvi-4 mRNA, partial sequence. ACCESSION M36432 KEYWORDS provirus. SOURCE Rat Moloney murine leukemia virus-induced T-cell lymphoma, cDNA to mRNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 100) AUTHORS Tsichlis,P.N., Lee,J.S., Bear,S.E., Lazo,P.A., Patriotis,C., Gustafson,E., Shinton,S., Jenkins,N.A., Copeland,N.G., Huebner,K., Croce,C., Levan,G. and Hanson,C. TITLE Activation of multiple genes by provirus integration in the Mlvi-4 locus in T-cell lymphomas induced by Moloney murine leukemia virus JOURNAL J. Virol. 64, 2236-2244 (1990) STANDARD simple staff_entry FEATURES from to/span description mRNA < 1 > 100 Mlvi-4 mRNA recomb 73 74 Rat DNA end/provirus DNA start BASE COUNT 24 a 29 c 19 g 28 t ORIGIN 1 ttactggaag ccctcctcat catgggattt catcacagta aacaacaatc tcacctctga 61 ccaggctgtc caggattctc ctcatggttt gtcgaaggtc //
GenBank-Updates@genbank.bio.net (08/16/90)
LOCUS THARGA 635 bp ds-DNA BCT 11-AUG-1990 DEFINITION T.acidophilum 5S rRNA gene. ACCESSION M32297 KEYWORDS 5S ribosomal RNA. SOURCE T.acidophilum (strain 122-1B2) DNA, clone pT5PH-23. ORGANISM Thermoplasma acidophilum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoplasmales. REFERENCE 1 (bases 1 to 635) AUTHORS Ree,H.K. and Zimmerman,R.A. TITLE Organization and expression of the 16S, 23S and 5S ribosomal RNA genes from the archaebacterium Thermoplasma acidophilum JOURNAL Nucleic Acids Res. 18, 4471-4478 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.K.Ree 23-FEB-1990. FEATURES from to/span description rRNA 213 335 5S rRNA RNA 211 346 5S RNA ( 3' end +/- 4 bp) site 184 191 put. promoter site 333 354 put. terminator BASE COUNT 175 a 140 c 140 g 180 t ORIGIN 1 ctgcagcctt atcatgggaa aatgggcatc tgctggagac gactatgtac ttcatgcatc 61 catattatct tatgatatta aaatgctaaa gcgcatgatc gcttcggaag gaatatcgac 121 agggcgagat tcgctgtctc ataatcgccc ggatctgcgc tgaagccgta aaatcacgaa 181 aatcttatat agatgtgttc tatatagtgt tcggcaacgg tcatagcagc agggaaacac 241 cagatcccat tccgaactcg acggttaagc ctgctgcgta ttgcgttgta ctgtatgccg 301 cgagggtacg ggaagcgcaa tatgctgtta ccacttttga aatgaaagtt ttttacatct 361 atctttaatt cagtggcaat gattaatcct tggtcatcat ctgatttctt cgactatgag 421 aggctaaaga gaggaattcg gcataggcga tatgaacgtt cccttcgatc attttctctt 481 caggacttat acttggccag cgatccatag actatatcga ctacgcttta aagcatcaca 541 tgaaattcaa cgttatgact ggcctcatgc cttcaggaga gatgcatctt ggcaacaaga 601 gcgccataga ccaggtcatc ttcttccaga agctt // LOCUS THARGG 4154 bp ds-DNA BCT 11-AUG-1990 DEFINITION T.acidophilum 23S ribosomal RNA gene. ACCESSION M32298 KEYWORDS 23S ribosomal RNA. SOURCE T.acidophilum (strain 122-1B2) DNA, clones pTH1-1, pL8 and pTH3-7. ORGANISM Thermoplasma acidophilum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoplasmales. REFERENCE 1 (bases 1 to 4154) AUTHORS Ree,H.K. and Zimmermann,R.A. TITLE Organization and expression of the 16S, 23S and 5S ribosomal RNA genes from the archaebacterium Thermoplasma acidophilum JOURNAL Nucleic Acids Res. 18, 4471-4478 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.K.Ree 23-FEB-1990. FEATURES from to/span description rRNA 521 3426 23S RNA RNA 429 3473 23S RNA gene (3' end +/- 5 bp) BASE COUNT 1097 a 903 c 1177 g 977 t ORIGIN 1 tggtgttaag gtggatgcac tatatctatg agagctaccg tgagatggcc ataagggaga 61 tacgcgatat gatagagtag ctgcataagg caattcggaa taacagacgc ggcggttgtt 121 catcgcatag gtagagtaaa tctcacagag gactccgttg ttatagcggt atctgcgcct 181 catagaggac aggcgtttga cgcatgcaga tacataatcg accagataaa ggagagagcg 241 cccatatgga aacgtgacat cgtgcaggga tctcctgcga gctggcactg aacccatttt 301 ttacggtatc agcaagaaaa agttaattaa aatatatttc aaattctgaa aagtgagtat 361 tataaaatta ttggataatt gggcttccgg atcaaaatgc ttatatccct cttaatgata 421 tagtccatac acgcttacaa tgcacctttt aatgcatcac attaaaaggg aagatgtcga 481 attttgatga gactagatga gagatggcac aatggattgt gctctgctaa tctgcctaga 541 ggatggcttg gttcgggcgc cgaagaagga cgtgccaagc tgcgataagc cttggggagg 601 cgcatggagc cttagatcca aggatctccg aatgggactt cctgccgtaa ggcactccga 661 aaggagaggg aacccgggga attgaaacat cttagtaccc ggaggaaaag aaatcaattg 721 agataccgtt agtaaaggcg atcgaaagcg gtagaaggca aaccgaatag cccttcgaaa 781 gaagggaaag atgtggagtt tggtcttcct ctaatgcctc ctgaagcgag atgaatcttc 841 tggaaagaag agccttagaa ggtgatagcc ctgtaatcga agcttcagaa gctacaaggg 901 gaagtaacca gagtaccatg cgtcgttttt cgcgtgggaa tttgggtggc actaacatcc 961 aaccttaaat acgtcccgag tccgatagcg aacaaagtac cgtgagggaa agctgaaaag 1021 aaacccggaa gggtggtgaa aagagcctga aactaggcag agataaactt atagggcagt 1081 taagaggtga agtcgttaac tacgatggat cgctgtccta ttgtccgtgt tgaagaacgg 1141 gccagggagt tctgacgagt ggcaaggtta atcctgaaag gagtagccgt agcgaaagca 1201 actacccgca cagcaatggg ggggtggcgt ggtaaacgcg tttagtcact cgtgagagac 1261 ccgaagccgg tcgatctaca cctgagtagg ttgaagctca gtgaaagctg ggtggaggac 1321 cgaacctatt ctgatgtgca aatcgtttgg atgacttggg tgtaggggtt aaaggccaat 1381 ctaggccggc aatagcgggt tccccccgat actacccgca ggtagacctc gatggagatt 1441 ctcggcgagg tagagcgacc gattggttgg taagcagtcg aaaggctgcg ccgacttgtc 1501 aaactccgaa cttgtcgaga tcgtagaagt cgggtgctag ggggcaggga taagctttgc 1561 ttccgtgatg ggaacaacca agacgagggt taaggtccct aagttctagt taagtgcact 1621 aaataggttt gtggccaaag acagtgggga ggtaggctca gaagcagcca tccttcaaag 1681 agtgcgtaac agctcactca ccgaggtcac atgcctagaa gatggaaggg gctaaaacta 1741 gacaccgaga ccttcgagca ccgaaaggtg atctggtagg ggggcgtgcc atgtggatag 1801 aagtctcccc gagaggaggg atggaccgca tggtatcgcg gatcctggtg aaagtagcag 1861 agaagaaccg tgagaatcgg ttccgccgaa agggctaggg ttccttggca atgttcgtca 1921 gccgagggtt agtcgatcct aaggccatac ctaacaggat atggtcgaag gggaagccgg 1981 ttaatattcc ggcacactga acgttttgcc ctgtatgaga aggttcaggg taggggcggt 2041 acgggtgcca acgtatttat gctcataagc ggatggagag tcgtaatgac gagaagttcg 2101 cgaaagagcg tatgttcccg tttgggaatc gcctcgatcc ccggatccca tgaaaatcat 2161 gcaggggtca ggttcagtat cgtaccaaga accgacactg gtgcccctag gtgagaagcc 2221 taaggcgttt tgggataatg gacgcgaggg aaatcggcaa aatagctccg tatcttcggt 2281 ataaggagtg cctattccgt gagggaatag gtcgcagtga cgaagggact ccgactgttt 2341 accacaaaca cagatcgctg ctagtccgta aggatgtgta tagcggttga aacctgccca 2401 gtgctggtac ctgaaagccc cgtacaaggg gaagaagggc cagtaaacgg cgggggtaac 2461 tatgaccctc ttaaggtagc gtaataccta gccgcttaat tggcggcttg catgaaggtt 2521 caacgtgggt cccactgtcc ccgcgttcag cccagtgaaa ttgatgtact ggtgcacaat 2581 ccagtctctc ccacgtgaaa gcgaagtccc cgtggagctt tactgcagcc tgtagctgtg 2641 gtgcgatcct gactgcgtag tgtaggaagg agccgtcgaa gctctggttt cggccggagt 2701 ggaggcgcca atgaaacact tccctctcgg gattgcgtca ctaacctctt cggagggaca 2761 actattggtg ggcagtttgg gtggggcgcc acgcccctaa caacgtaaca ggggccccca 2821 aaggtcagct caggagggtc agaaatcctc cgtagagtgt aaaagcaaaa gctggcttga 2881 ctgtgttgca gacaactagc aacgcagatg cgaaagcagg gtttagcgaa ccacccagtt 2941 cctccttagt gggggcgggt gataagagag aagttacccc agggataact gagtcgtcct 3001 cggcaagagt acacatcgac ccgagggttt gctacttcga tgtcgtctgt tcctatcctg 3061 gtgctgcata aggtgccaag ggtggggctg ttcgcccatt aaaagggatc ctgagatggg 3121 ttcactacgt cgcgagacag tagggttgct tctccgtggg agtgctcgat gtctgagggg 3181 aaggggcctt tagtacgaga ggaacaaggg ctcgtgacct ctagtttacc ggttgtctgg 3241 caaggcatcg ccgggtagcc acgtcatacg cggataagag ctgaaagcat ctaagctcga 3301 agccgccccc gaaaatagac atcgtcatca gatcgcctct agaagagagg tttgatagag 3361 ccgggatgta aggatcgagc ttcggcgaga ttttaagtcc acggctacta aagatcgaag 3421 gcacaatcca tgctaaattc tcgtctagtt gaagtcataa gcgtgtttga tttttgtttt 3481 ttattttttt gccgttgtag tgtgaactat gatgaaaaat tatattgata taaaagagaa 3541 atagaaaaaa tgatcactgc tttatatttt ctatctgagc ctctttcagt atctgatcgg 3601 ttatgctgtc gaaaatctca cgtatcttat catttataac gacggcaggt attccgttgt 3661 caccgttttc tgctatttct ggtattagag gtatttttcc caggaacggg aggttgtatt 3721 cctttgcgga tttctccccg ccgccgttct tgaatatgta tgtgagctta ccgcagtgcg 3781 cacacaaaac cgctcatgtt ctctattatg ccaaggacag ggagcttcag ctgttttgca 3841 aagtttattg ccttctttgc atcgagaagg gcgacatcct gcggcgttac gactatgaca 3901 acgccatttg attctggtat cagctgggcc acgctcagtg caacgtctcc ggtgccaggc 3961 ggcatgtcaa gcacaacaac atccgtatcc ttccagttta cgtcttccag aaactgctgt 4021 attgccttgt gcatcagcga accacgccat attatcggcg tatcctctgt cggtatgagg 4081 aaacccatgg acaccacgcg aacaccgtat cttgtctctg ccggtattat tccctcctcg 4141 tcagcgtaaa gctt // LOCUS HIVANT70 583 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral LTR region. ACCESSION M31171 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate ANT70) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 583) AUTHORS De Leys,R., Vanderborght,B., Haesevelde,M.V., Heyndrickx,L., van Geel,A., Wauters,C., Bernaerts,R., Saman,E., Nijs,P., Willems,B., Taelman,H., van der Groen,G., Piot,P., Tersmette,T., Huisman,J.G. and van Heuverswyn,H. TITLE Isolation and partial characterization of an unusual human immunodeficiency retrovirus from two persons of west-central African origin JOURNAL J. Virol. 64, 1207-16 (1990) STANDARD full staff_review COMMENT Kindly provided prior to publication in computer-readable form by R. De Leys, Innogenetics, Antwerp. BASE COUNT 162 a 133 c 153 g 135 t ORIGIN 5'-terminus of 5'LTR. 1 tggaagggtt aatttactcc cataaaagag cagaaatcct ggatctttgg gtgtataaca 61 ctcagggatt cttccctgat tggcagaact acacaccagg accaggaacc aggttcccac 121 tgacatttgg atggttgttt aaactagtac cagtgtcaga agaagaggca gaaagactag 181 gaaatacatg tgagagggct aatctcctgc atccagcatg tgcccatgga tttgaagata 241 cacataaaga aatactgatg tggaagtttg atagatctct aggcaacacc catgttgcta 301 tgataactca cccagagctc ttccagaagg actaaaaact gctgacctga agattgctga 361 cactgtggaa ctttccagca aagactgctg acactgcggg gactttccag tgggagggac 421 agggggcggt tcggggagtg gctaaccctc agaagctgca tataagcagc cgctttctgc 481 ttgtaccggg tctcggttag aggaccaggt ctgagcccgg gagctccctg gcctctagct 541 gaacccgctc gttaacgctc aataaagctt gccttgagtg aga // LOCUS HIVBAN1 59 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral gag region. ACCESSION M27953 KEYWORDS . SEGMENT 1 of 2 SOURCE Human immunodeficiency virus type 1 (isolate BAN) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by M. Goodenow. Sibling sequences are available (see 1989: I-A-172). BASE COUNT 26 a 7 c 10 g 16 t ORIGIN 1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacta // LOCUS HIVBAN2 250 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral gag region. ACCESSION M27944 KEYWORDS . SEGMENT 2 of 2 SOURCE Human immunodeficiency virus type 1 (isolate BAN) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 250) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT Sibling sequences are available (see 1989: I-A-173). BASE COUNT 96 a 40 c 42 g 72 t ORIGIN 1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc 61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg 121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag 181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca 241 aaggtatcct // LOCUS HIVBRU 9229 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION K02013 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (HIV-1), isolate BRU proviral DNA clone lambda-J19. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9229) AUTHORS Wain-Hobson,S., Sonigo,P., Danos,O., Cole,S. and Alizon,M. TITLE Nucleotide sequence of the AIDS Virus, LAV JOURNAL Cell 40, 9-17 (1985) STANDARD full staff_review REFERENCE 2 (bases 1713 to 1748; revision of [1]) AUTHORS Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P. TITLE Genetic variability of the AIDS virus: Nucleotide sequence analysis of two isolates from African patients JOURNAL Cell 46, 63-74 (1986) STANDARD full staff_review COMMENT The original LAV, sometimes called LAV-1 to distinguish it from HIV2 (LAV-2), is now referred to as HIV-1bru. An infectious clone of this virus has been constructed by Keith Peden, Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, MD 21205 (301) 955-3652. HIVNL43 is also an infectious clone having for its 3' half a clone of the BRU isolate. The tat sequence for the L20 clone is available (see 1989: I-A-54). FEATURES from to/span description pept 336 1874 gag polyprotein pept < 1631 4678 pol polyprotein (NH2-terminus uncertain; AA at 1631) pept 4623 5201 vif protein pept 5141 5431 vpr protein pept 5412 5626 tat protein, exon 2 (first expressed exon) 7972 8017 tat protein, exon 3 (AA at 7973) pept 5551 5626 rev protein, exon 2 (first expressed exon) 7972 8246 rev protein, exon 3 (AA at 7974) pept 5643 5888 vpu protein pept 5803 8388 envelope polyprotein pept 8390 9010 nef protein IVS 290 5358 tat, rev, nef subgenomic mRNA intron 1 IVS 5627 7971 tat cds intron 2 IVS 5627 7971 rev cds intron 2 LTR < 1 180 5' LTR LTR 8679 > 9229 3' LTR rpt < 1 97 R repeat 5' copy rpt 9133 9229 R repeat 3' copy binding 182 199 primer (Lys-tRNA) binding site revision 1712 1749 atttcttcagagcagaccagagccaacagccccaccag in [2]; ag in [1] signal 9205 9210 poly-A signal BASE COUNT 3289 a 1656 c 2232 g 2052 t ORIGIN Cap site of genomic RNA. 1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctaacta gggaacccac 61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca 181 gtggcgcccg aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag 241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg aggcgactgg tgagtacgcc 301 aaaaattttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 361 gcgggggaga attagatcga tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 421 ataaattaaa acatatagta tgggcaagca gggagctaga acgattcgca gttaatcctg 481 gcctgttaga aacatcagaa ggctgtagac aaatactggg acagctacaa ccatcccttc 541 agacaggatc agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtgc 601 atcaaaggat agagataaaa gacaccaagg aagctttaga caagatagag gaagagcaaa 661 acaaaagtaa gaaaaaagca cagcaagcag cagctgacac aggacacagc agccaggtca 721 gccaaaatta ccctatagtg cagaacatcc aggggcaaat ggtacatcag gccatatcac 781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtga 841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa 901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag 961 ctgcagaatg ggatagagtg catccagtgc atgcagggcc tattgcacca ggccagatga 1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat 1081 ggatgacaaa taatccacct atcccagtag gagaaattta taaaagatgg ataatcctgg 1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac 1201 caaaagaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1261 cttcacagga ggtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcgaacccag 1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag 1381 catgtcaggg agtgggagga cccggccata aggcaagagt tttggctgaa gcaatgagcc 1441 aagtaacaaa ttcagctacc ataatgatgc aaagaggcaa ttttaggaac caaagaaaga 1501 ttgttaagtg tttcaattgt ggcaaagaag ggcacatagc cagaaattgc agggccccta 1561 ggaaaaaggg ctgttggaaa tgtggaaagg aaggacacca aatgaaagat tgtactgaga 1621 gacaggctaa ttttttaggg aagatctggc cttcctacaa gggaaggcca gggaattttc 1681 ttcagagcag accagagcca acagccccac catttcttca gagcagacca gagccaacag 1741 ccccaccaga agagagcttc aggtctgggg tagagacaac aactccctct cagaagcagg 1801 agccgataga caaggaactg tatcctttaa cttccctcag atcactcttt ggcaacgacc 1861 cctcgtcaca ataaagatag gggggcaact aaaggaagct ctattagata caggagcaga 1921 tgatacagta ttagaagaaa tgagtttgcc aggaagatgg aaaccaaaaa tgataggggg 1981 aattggaggt tttatcaaag taagacagta tgatcagata ctcatagaaa tctgtggaca 2041 taaagctata ggtacagtat tagtaggacc tacacctgtc aacataattg gaagaaatct 2101 gttgactcag attggttgca ctttaaattt tcccattagt cctattgaaa ctgtaccagt 2161 aaaattaaag ccaggaatgg atggcccaaa agttaaacaa tggccattga cagaagaaaa 2221 aataaaagca ttagtagaaa tttgtacaga aatggaaaag gaagggaaaa tttcaaaaat 2281 tgggcctgaa aatccataca atactccagt atttgccata aagaaaaaag acagtactaa 2341 atggagaaaa ttagtagatt tcagagaact taataagaga actcaagact tctgggaagt 2401 tcaattagga ataccacatc ccgcagggtt aaaaaagaaa aaatcagtaa cagtactgga 2461 tgtgggtgat gcatattttt cagttccctt agatgaagac ttcaggaagt atactgcatt 2521 taccatacct agtataaaca atgagacacc agggattaga tatcagtaca atgtgcttcc 2581 acagggatgg aaaggatcac cagcaatatt ccaaagtagc atgacaaaaa tcttagagcc 2641 ttttagaaaa caaaatccag acatagttat ctatcaatac atggatgatt tgtatgtagg 2701 atctgactta gaaatagggc agcatagaac aaaaatagag gagctgagac aacatctgtt 2761 gaggtgggga cttaccacac cagacaaaaa acatcagaaa gaacctccat tcctttggat 2821 gggttatgaa ctccatcctg ataaatggac agtacagcct atagtgctgc cagaaaaaga 2881 cagctggact gtcaatgaca tacagaagtt agtgggaaaa ttgaattggg caagtcagat 2941 ttacccaggg attaaagtaa ggcaattatg taaactcctt agaggaacca aagcactaac 3001 agaagtaata ccactaacag aagaagcaga gctagaactg gcagaaaaca gagagattct 3061 aaaagaacca gtacatggag tgtattatga cccatcaaaa gacttaatag cagaaataca 3121 gaagcagggg caaggccaat ggacatatca aatttatcaa gagccattta aaaatctgaa 3181 aacaggaaaa tatgcaagaa cgaggggtgc ccacactaat gatgtaaaac aattaacaga 3241 ggcagtgcaa aaaataacca cagaaagcat agtaatatgg ggaaagactc ctaaatttaa 3301 actacccata caaaaggaaa catgggaaac atggtggaca gagtattggc aagccacctg 3361 gattcctgag tgggagtttg tcaatacccc tcctttagtg aaattatggt accagttaga 3421 gaaagaaccc atagtaggag cagaaacgtt ctatgtagat ggggcagcta gcagggagac 3481 taaattagga aaagcaggat atgttactaa tagaggaaga caaaaagttg tcaccctaac 3541 tgacacaaca aatcagaaga ctgagttaca agcaattcat ctagctttgc aggattcggg 3601 attagaagta aatatagtaa cagactcaca atatgcatta ggaatcattc aagcacaacc 3661 agataaaagt gaatcagagt tagtcaatca aataatagag cagttaataa aaaaggaaaa 3721 ggtctatctg gcatgggtac cagcacacaa aggaattgga ggaaatgaac aagtagataa 3781 attagtcagt gctggaatca ggaaagtact atttttagat ggaatagata aggcccaaga 3841 tgaacatgag aaatatcaca gtaattggag agcaatggct agtgatttta acctgccacc 3901 tgtagtagca aaagaaatag tagccagctg tgataaatgt cagctaaaag gagaagccat 3961 gcatggacaa gtagactgta gtccaggaat atggcaacta gattgtacac atttagaagg 4021 aaaagttatc ctggtagcag ttcatgtagc cagtggatat atagaagcag aagttattcc 4081 agcagaaaca gggcaggaaa cagcatactt tcttttaaaa ttagcaggaa gatggccagt 4141 aaaaacaata catacagaca atggcagcaa tttcaccagt actacggtta aggccgcctg 4201 ttggtgggcg ggaatcaagc aggaatttgg aattccctac aatccccaaa gtcaaggagt 4261 agtagaatct atgaataaag aattaaagaa aattataggc caggtaagag atcaggctga 4321 acatcttaag acagcagtac aaatggcagt attcatccac aattttaaaa gaaaaggggg 4381 gattgggggg tacagtgcag gggaaagaat agtagacata atagcaacag acatacaaac 4441 taaagaatta caaaaacaaa ttacaaaaat tcaaaatttt cgggtttatt acagggacag 4501 cagagatcca ctttggaaag gaccagcaaa gctcctctgg aaaggtgaag gggcagtagt 4561 aatacaagat aatagtgaca taaaagtagt gccaagaaga aaagcaaaga tcattaggga 4621 ttatggaaaa cagatggcag gtgatgattg tgtggcaagt agacaggatg aggattagaa 4681 catggaaaag tttagtaaaa caccatatgt atgtttcagg gaaagctagg ggatggtttt 4741 atagacatca ctatgaaagc cctcatccaa gaataagttc agaagtacac atcccactag 4801 gggatgctag attggtaata acaacatatt ggggtctgca tacaggagaa agagactggc 4861 atctgggtca gggagtctcc atagaatgga ggaaaaagag atatagcaca caagtagacc 4921 ctgaactagc agaccaacta attcatctgt attactttga ctgtttttca gactctgcta 4981 taagaaaggc cttattagga catatagtta gccctaggtg tgaatatcaa gcaggacata 5041 acaaggtagg atctctacaa tacttggcac tagcagcatt aataacacca aaaaagataa 5101 agccaccttt gcctagtgtt acgaaactga cagaggatag atggaacaag ccccagaaga 5161 ccaagggcca cagagggagc cacacaatga atggacacta gagcttttag aggagcttaa 5221 gaatgaagct gttagacatt ttcctaggat ttggctccat ggcttagggc aacatatcta 5281 tgaaacttat ggggatactt gggcaggagt ggaagccata ataagaattc tgcaacaact 5341 gctgtttatc catttcagaa ttgggtgtcg acatagcaga ataggcgtta ctcaacagag 5401 gagagcaaga aatggagcca gtagatccta gactagagcc ctggaagcat ccaggaagtc 5461 agcctaaaac tgcttgtacc acttgctatt gtaaaaagtg ttgctttcat tgccaagttt 5521 gtttcacaac aaaagcctta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa 5581 gacctcctca aggcagtcag actcatcaag tttctctatc aaagcagtaa gtagtacatg 5641 taatgcaacc tatacaaata gcaatagcag cattagtagt agcaataata atagcaatag 5701 ttgtgtggtc catagtaatc atagaatata ggaaaatatt aagacaaaga aaaatagaca 5761 ggttaattga tagactaata gaaagagcag aagacagtgg caatgagagt gaaggagaaa 5821 tatcagcact tgtggagatg ggggtggaaa tggggcacca tgctccttgg gatattgatg 5881 atctgtagtg ctacagaaaa attgtgggtc acagtctatt atggggtacc tgtgtggaag 5941 gaagcaacca ccactctatt ttgtgcatca gatgctaaag catatgatac agaggtacat 6001 aatgtttggg ccacacatgc ctgtgtaccc acagacccca acccacaaga agtagtattg 6061 gtaaatgtga cagaaaattt taacatgtgg aaaaatgaca tggtagaaca gatgcatgag 6121 gatataatca gtttatggga tcaaagccta aagccatgtg taaaattaac cccactctgt 6181 gttagtttaa agtgcactga tttggggaat gctactaata ccaatagtag taataccaat 6241 agtagtagcg gggaaatgat gatggagaaa ggagagataa aaaactgctc tttcaatatc 6301 agcacaagca taagaggtaa ggtgcagaaa gaatatgcat ttttttataa acttgatata 6361 ataccaatag ataatgatac taccagctat acgttgacaa gttgtaacac ctcagtcatt 6421 acacaggcct gtccaaaggt atcctttgag ccaattccca tacattattg tgccccggct 6481 ggttttgcga ttctaaaatg taataataag acgttcaatg gaacaggacc atgtacaaat 6541 gtcagcacag tacaatgtac acatggaatt aggccagtag tatcaactca actgctgttg 6601 aatggcagtc tagcagaaga agaggtagta attagatctg ccaatttcac agacaatgct 6661 aaaaccataa tagtacagct gaaccaatct gtagaaatta attgtacaag acccaacaac 6721 aatacaagaa aaagtatccg tatccagagg ggaccaggga gagcatttgt tacaatagga 6781 aaaataggaa atatgagaca agcacattgt aacattagta gagcaaaatg gaatgccact 6841 ttaaaacaga tagctagcaa attaagagaa caatttggaa ataataaaac aataatcttt 6901 aagcaatcct caggagggga cccagaaatt gtaacgcaca gttttaattg tggaggggaa 6961 tttttctact gtaattcaac acaactgttt aatagtactt ggtttaatag tacttggagt 7021 actgaagggt caaataacac tgaaggaagt gacacaatca cactcccatg cagaataaaa 7081 caatttataa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagcgga 7141 caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtaataac 7201 aacaatgggt ccgagatctt cagacctgga ggaggagata tgagggacaa ttggagaagt 7261 gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 7321 aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagcttt gttccttggg 7381 ttcttgggag cagcaggaag cactatgggc gcacggtcaa tgacgctgac ggtacaggcc 7441 agacaattat tgtctggtat agtgcagcag cagaacaatt tgctgagggc tattgaggcg 7501 caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagaatcctg 7561 gctgtggaaa gatacctaaa ggatcaacag ctcctgggga tttggggttg ctctggaaaa 7621 ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctggaacag 7681 atttggaata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagctta 7741 atacattcct taattgaaga atcgcaaaac cagcaagaaa agaatgaaca agaattattg 7801 gaattagata aatgggcaag tttgtggaat tggtttaaca taacaaattg gctgtggtat 7861 ataaaaatat tcataatgat agtaggaggc ttggtaggtt taagaatagt ttttgctgta 7921 ctttctatag tgaatagagt taggcaggga tattcaccat tatcgtttca gacccacctc 7981 ccaaccccga ggggacccga caggcccgaa ggaatagaag aagaaggtgg agagagagac 8041 agagacagat ccattcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg 8101 agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt 8161 gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacag 8221 tattggagtc aggaactaaa gaatagtgct gttagcttgc tcaatgccac agccatagca 8281 gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc 8341 cacataccta gaagaataag acagggcttg gaaaggattt tgctataaga tgggtggcaa 8401 gtggtcaaaa agtagtgtgg ttggatggcc tactgtaagg gaaagaatga gacgagctga 8461 gccagcagca gatggggtgg gagcagcatc tcgagacctg gaaaaacatg gagcaatcac 8521 aagtagcaat acagcagcta ccaatgctgc ttgtgcctgg ctagaagcac aagaggagga 8581 ggaggtgggt tttccagtca cacctcaggt acctttaaga ccaatgactt acaaggcagc 8641 tgtagatctt agccactttt taaaagaaaa ggggggactg gaagggctaa ttcactccca 8701 acgaagacaa gatatccttg atctgtggat ctaccacaca caaggctact tccctgattg 8761 gcagaactac acaccagggc caggggtcag atatccactg acctttggat ggtgctacaa 8821 gctagtacca gttgagccag ataaggtaga agaggccaat aaaggagaga acaccagctt 8881 gttacaccct gtgagcctgc atggaatgga tgaccctgag agagaagtgt tagagtggag 8941 gtttgacagc cgcctagcat ttcatcacgt ggcccgagag ctgcatccgg agtacttcaa 9001 gaactgctga catcgagctt gctacaaggg actttccgct ggggactttc cagggaggcg 9061 tggcctgggc gggactgggg agtggcgagc cctcagatgc tgcatataag cagctgcttt 9121 ttgcctgtac tgggtctctc tggttagacc agatttgagc ctgggagctc tctggctaac 9181 tagggaaccc actgcttaag cctcaataaa gcttgccttg agtgcttca // LOCUS HIVELI 9176 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION K03454 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1, isolate ELI, proviral DNA, clone E-H12. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9176) AUTHORS Alizon,M., Wain-Hobson,S., Montagnier,L. and Sonigo,P. TITLE Genetic variability of the AIDS virus: Nucleotide sequence analysis of two isolates from African patients JOURNAL Cell 46, 63-74 (1986) STANDARD full staff_review COMMENT Sibling sequences of the ELI isolate are available (see 1989: I-179) FEATURES from to/span description pept 336 1838 gag polyprotein pept < 1634 4642 pol polyprotein (NH2-terminus uncertain; AA at 1634) pept 4587 5165 vif protein pept 5105 5395 vpr protein pept 5376 5590 tat protein, exon 2 (first expressed exon) 7915 7999 tat protein, exon 3 (AA at 7916) pept 5515 5590 rev protein, exon 2 (first expressed exon) 7915 8195 rev protein, exon 3 (AA at 7917) pept 5607 5852 vpu protein pept 5770 8331 envelope polyprotein precursor pept 8333 8953 nef protein IVS 5591 7914 tat cds intron 2 IVS 5591 7914 rev cds intron 2 LTR < 1 180 5' LTR LTR 8625 > 9176 3' LTR rpt < 1 97 R repeat 5' copy rpt 9078 9175 R repeat 3' copy binding 182 199 primer (Lys-tRNA) binding site signal 9151 9156 mRNA polyadenylation signal BASE COUNT 3333 a 1632 c 2179 g 2032 t ORIGIN Cap site of genomic RNA. 1 ggtctctctg gttagaccag atttgagcct gggagctctc tggctagcta gggaacccac 61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 121 gtgactctgg taactagaga tccctcagac ccctttagtc agagtggaaa atctctagca 181 gtggcgcccg aacagggacc tgaaagcgaa agtagaacca gaggagctct ctcgacgcag 241 gactcggctt gctgaagcgc gcacggcaag aggcgagggg cagcgactgg tgagtacgct 301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 361 gcgggggaaa attagataaa tgggaaaaaa ttcggttacg gccaggagga aagaaaaaat 421 atagactaaa acatatagta tgggcaagca gggagctaga acgatatgca cttaatcctg 481 gccttttaga aacatcagaa ggctgtaaac aaataatagg gcagctacaa ccagctattc 541 agacaggaac agaagaactt agatcattat ataatacagt agcaaccctc tattgtgtac 601 ataaaggaat agatgtaaaa gacaccaagg aagctttaga aaagatggag gaagagcaaa 661 acaaaagtaa gaaaaaggca cagcaagcag cagctgacac aggaaacaac agccaggtca 721 gccaaaatta tcctatagtg cagaacctac aggggcaaat ggtacatcag gccatatcac 781 ctagaacttt gaacgcatgg gtaaaagtaa tagaagaaaa ggctttcagc ccagaagtaa 841 tacccatgtt ttcagcatta tcagaaggag ccaccccaca agatttaaac accatgctaa 901 acacagtggg gggacatcaa gcagccatgc aaatgctaaa agagaccatc aatgaagaag 961 ctgcagaatg ggataggtta catccagtgc atgcagggcc tattgcacca ggccagatga 1021 gagaaccaag gggaagtgat atagcaggaa ctactagtac ccttcaggaa caaatagcat 1081 ggatgacaag taacccacct atcccagtag gagaaatcta taaaagatgg ataattgtgg 1141 gattaaataa aatagtaaga atgtatagcc ctgtcagcat tttggacata agacagggac 1201 caaaggaacc ttttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag 1321 attgcaagac tatcttaaaa gcattgggac cacaggctac actagaagaa atgatgacag 1381 catgtcaggg agtggggggg cccagccata aagcaagagt tctggctgag gcaatgagcc 1441 aagcaacaaa ttcagttact acagcaatga tgcagagagg caattttaag ggcccaagaa 1501 aaattattaa gtgtttcaat tgtggcaaag aagggcacat agcaaaaaat tgcagggccc 1561 ctaggaaaaa gggctgttgg agatgtggaa aggaaggaca ccaactaaaa gattgcactg 1621 agagacaggc taatttttta gggagaattt ggccttccca caagggaagg ccggggaact 1681 ttctccaaag cagaccagag ccaacagccc caccagcaga gagcttcggg tttggggaag 1741 agataacccc ctctcaaaaa caggagcaga aagacaagga actgtatcct ttaacttccc 1801 tcaaatcact ctttggcaac gaccccttgt cgcaataaaa atagggggac agctaaagga 1861 agctctatta gatacaggag cagatgatac agtattagaa gaaatgaatt tgccaggaaa 1921 atggaaacca aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca 1981 aatacccata gaaatctgtg gacagaaagc tataggtaca gtattagtag gacctacgcc 2041 tgtcaacata atcggaagaa atttgttgac ccagattggc tgcactttaa attttccaat 2101 tagtcctatt gaaactgtac cagtaaaatt aaagccagga atggatggcc caaaagttaa 2161 acaatggcca ttgacagaag aaaaaataaa agcattaaca gaaatttgta cagatatgga 2221 aaaggaagga aaaatttcaa gaattgggcc tgaaaatcca tacaatactc caatatttgc 2281 cataaagaaa aaagacagta ccaagtggag aaaattagta gatttcagag aacttaataa 2341 gagaactcaa gatttctggg aagttcaatt aggaataccg catcctgcag ggctgaaaaa 2401 gaaaaaatca gtaacagtac tggatgtggg tgatgcatat ttttcagttc ccttagatga 2461 agattttagg aaatataccg cctttaccat atctagtata aacaatgaga caccagggat 2521 tagatatcag tacaatgtgc ttccacaggg atggaaagga tcaccggcaa tattccaaag 2581 tagcatgaca aaaatcttag agccctttag aaaacaaaat ccagaaatgg ttatctatca 2641 atacatggat gatttgtatg taggatctga cttagaaata gggcagcata ggacaaaaat 2701 agagaaatta agagaacatc tattgaggtg gggatttacc agaccagata aaaaacatca 2761 gaaagaaccc ccatttcttt ggatgggtta tgaactccat cctgataaat ggacagtaca 2821 gtctataaaa ctgccagaaa aggagagctg gactgtcaat gatatacaga acttagtgga 2881 gagattaaac tgggcaagcc agatttatcc aggaattaaa gtaagacaat tatgtaaact 2941 ccttagggga accaaagcac taacagaagt aataccacta acagaagaag cagaattaga 3001 actggcagaa aacagggaaa ttttaaaaga accagtacat ggagtgtatt atgacccatc 3061 aaaagactta atagcagaaa tacagaaaca agggcacggc caatggacat accaaattta 3121 tcaagaacca tttaaaaatc tgaaaacagg aaagtatgca agaatgaggg gtgcccacac 3181 taatgatgta aagcaattag cagaggcagt gcaaagaata tccacagaaa gcatagtgat 3241 atggggaagg actcctaaat ttagactacc catacaaaag gaaacatggg aaacatggtg 3301 ggcagagtat tggcaagcca cttggattcc tgagtgggaa tttgtcaata cccctccttt 3361 agtaaaatta tggtaccagt tagagaagga acccataata ggagcagaaa ctttctatgt 3421 agatggggca gctaatagag agactaaatt aggaaaagca ggatatgtta ctgacagagg 3481 aagacagaaa gttgtccctt tgactgacac gacaaatcag aagactgagt tacaagcaat 3541 taatctagcc ttgcaggatt cgggattaga agtaaacata gtaacagatt cacaatatgc 3601 attaggaatc attcaagcac aaccagataa gagtgaatca gagttagtca atcaaataat 3661 agagcagtta ataaaaaagg aaaaggttta cctggcatgg gtaccagcac acaaaggaat 3721 tggaggaaat gaacaagtag ataaattagt cagtcaagga atcaggaaag tactattttt 3781 ggatggaata gataaggctc aagaagaaca tgagaaatat cacaacaatt ggagagcaat 3841 ggctagtgat tttaacctac cacccgtggt agcaaaagaa atagtagcta gctgtgataa 3901 atgtcagcta aaaggagaag ccatgcatgg acaagtagac tgtagtccag gaatatggca 3961 attagattgt acacacttag aaggaaaagt tatcctggta gcagttcatg tagccagtgg 4021 ctatatagaa gcagaagtta ttccagcaga aacagggcag gaaacagcat attttctttt 4081 aaaattagca ggaagatggc cagtaaaagt agtacataca gacaatggca gcaatttcac 4141 cagtgctgca gttaaggccg cctgttggtg ggcaggtatc aaacaggaat ttggaattcc 4201 ctacaatccc caaagtcaag gagtagtaga atctatgaat aaagaattaa agaaaattat 4261 aggacaggta agagatcaag ctgaacatct taagacagca gtacaaatgg cagtattcat 4321 ccacaatttt aaaagaagaa gggggattgg gggatacagt gcaggggaaa gaataataga 4381 cataatagca acagacatac aaactaaaga attacaaaaa caaattataa aaattcaaaa 4441 ttttcgggtt tattacagag acagcagaga tccaatttgg aaaggaccag caaagctcct 4501 ctggaaaggt gaaggggcag tagtaataca agacaagagt gacataaagg tagtaccaag 4561 aagaaaagta aagattatta gggattatgg aaaacagatg gcaggtgatg attgtgtggc 4621 aagtagacag gatgaggatt aaaacatgga aaagtttagt aaaacaccat atgtatgttt 4681 caaagaaagc taacagatgg ttttatagac atcactatga aagcccccac ccaaaaataa 4741 gttcagaagt acacatccca ctaggagaag ctagactggt aataaaaaca tattggggtc 4801 tgcatacagg agaaagagaa tggcatctgg gtcagggagt ctccatagaa tggaggaaaa 4861 ggagatatag cacacaagta gaccctggcc tggcagacca actaattcat atgtattatt 4921 ttgattgttt ttcagaatct gctataagaa aagccatatt aggagatata gttagtccta 4981 ggtgtgagta tcaagcagga cataacaagg taggatccct acagtatttg gcactaacag 5041 cattaatagc accaaaacag ataaagccac ctttgcctag tgttaggaag ctaacagaag 5101 atagatggaa caagccccag cagaccaggg gccacagagg gagccataca atgaatgggc 5161 attagagctt ttagaggagc ttaagagtga agctgttaga cattttccta ggatatggct 5221 ccatagctta ggacaacata tttatgaaac ttatggggat acctgggtag gagttgaagc 5281 tataataaga atactgcaac aattactgtt tattcatttc agaattgggt gtcaacatag 5341 cagaataggc attattcgac agagaagagc aagaaatgga tccagtagat cctaacctag 5401 agccctggaa ccatccagga agtcagccta ggactccttg taacaagtgt cattgtaaaa 5461 agtgttgcta tcattgccca gtttgcttct taaacaaagg cttaggcatc tcctatggca 5521 ggaagaagcg gagacagcga cgaggacctc ctcaaggcgg tcaggctcat caagttccta 5581 taccaaagca gtaagtagta catgtaatgc aacctttagg gataatagca atagcagcat 5641 tagtagtagc aataatacta gcaatagttg tgtggaccat agtattcata gaatatagaa 5701 ggataaaaaa gcaaaggaga atagactgtt tacttgatag aataacagaa agagcagaag 5761 acagtggcaa tgagagcgag ggggatagag agaaattgtc aaaactggtg gaaatggggc 5821 atcatgctcc ttgggatatt gatgacctgt agtgctgcag acaatctgtg ggtcacagtt 5881 tattatgggg tgcctgtatg gaaggaagca accaccactc tattttgtgc atcagatgct 5941 aaatcatatg aaacagaggc acataatatc tgggccacac atgcctgtgt acccacggac 6001 cccaacccac aagaaatagc actggaaaat gtgacagaaa actttaacat gtggaaaaat 6061 aacatggtgg aacagatgca tgaggatata atcagtttat gggatcaaag cctaaaacca 6121 tgtgtaaaat taaccccact ctgtgtcact ttaaactgta gtgatgaatt gaggaacaat 6181 ggcactatgg ggaacaatgt cactacagag gagaaaggaa tgaaaaactg ctctttcaat 6241 gtaaccacag tactaaaaga taagaagcag caagtatatg cactttttta tagacttgat 6301 atagtaccaa tagacaatga tagtagtacc aatagtacca attataggtt aataaattgt 6361 aatacctcag ccattacaca ggcttgtcca aaggtatcct ttgagccaat tcccatacat 6421 tattgtgccc cagctggttt tgcgattcta aagtgtagag ataagaagtt caatggaaca 6481 ggcccatgca caaatgtcag cacagtacaa tgtacacatg gaattaggcc agtggtgtca 6541 actcaactgc tgttgaatgg cagtctagca gaagaagagg tcataattag atccgaaaat 6601 ctcacaaaca atgctaaaaa cataatagca catcttaatg aatctgtaaa aattacctgt 6661 gcaaggccct atcaaaatac aagacaaaga acacctatag gactagggca atcactctat 6721 actacaagat caagatcaat aataggacaa gcacattgta atattagtag agcacaatgg 6781 agtaaaactt tacaacaagt agctagaaaa ttaggaaccc ttcttaacaa aacaataata 6841 aagtttaaac catcctcagg aggggaccca gaaattacaa cacacagttt taattgtgga 6901 ggggaattct tctactgtaa tacatcagga ctgtttaata gtacatggaa tattagtgca 6961 tggaataata ttacagagtc aaataatagc acaaacacaa acatcacact ccaatgcaga 7021 ataaaacaaa ttataaagat ggtggcaggc aggaaagcaa tatatgcccc tcctatcgaa 7081 agaaacattc tatgttcatc aaatattaca gggctactat tgacaagaga tggtggtata 7141 aataatagta ctaacgagac ctttagacct ggaggaggag atatgaggga caattggaga 7201 agtgaattat ataaatataa ggtagtacaa attgaaccac taggagtagc acccaccagg 7261 gcaaagagaa gagtggtgga aagagaaaaa agagcaatag gattaggagc tatgttcctt 7321 gggttcttgg gagcagcagg aagcacgatg ggcgcacggt cagtgacgct gacggtacag 7381 gccagacaat taatgtctgg tatagtgcaa cagcaaaaca atttgctgag ggctatagag 7441 gcgcaacagc atctgttgca actcacggtc tggggcatta aacagctcca ggcaagaatc 7501 ctggctgtgg aaagatacct aaaggatcaa cagctcctag gaatttgggg ttgctctgga 7561 aaacacattt gcaccactaa tgtgccctgg aactctagtt ggagtaatag atctctaaat 7621 gagatttggc agaacatgac ctggatggag tgggaaagag aaattgacaa ttacacaggc 7681 ttaatatata gcttaattga ggaatcgcag acccagcaag aaaagaatga aaaagaattg 7741 ttggaattgg acaagtgggc aagtttgtgg aattggttta gcataacaca atggctgtgg 7801 tatataaaaa tattcataat gataatagga ggcttgatag gtttaagaat agtttttgct 7861 gtgctttctt tagtaaatag agttaggcag ggatactcac ctctgtcgtt tcagaccctc 7921 ctcccagccc cgaggggacc cgacaggccc gaaggaacag aagaagaagg tggagagcga 7981 ggcagagaca gatccgtgag attgctgaac ggattctcgg cacttatctg ggacgacctg 8041 cggagcctgt gcctcttcag ctaccaccgc ttgagagact taatcttaat tgcagtgagg 8101 attgtagaac ttctgggacg cagggggtgg gacatcctca aatatctgtg gaatctccta 8161 cagtattgga gtcaggaact gaggaacagt gctagtagct tgtttgatgc catagcaata 8221 gcagtagctg aggggacaga tagagttata gaaataatac aaagagcttg cagagctgtt 8281 cttaacatac ccagaagaat aagacagggc ttagaaaggt ctttacttta aaatgggtgg 8341 caaatggtca aaaagtagta tagtgggatg gcctgctata agggaaagaa taagaagaac 8401 taatccagca gcagatgggg taggagcagt atctcgagac ctggaaaaac atggggcaat 8461 cacaagtagc aatacagcaa gtactaatgc tgactgtgcc tggctagaag cacaagaaga 8521 gagcgacgag gtgggctttc cagtcagacc ccaggtacct ttaagaccaa tgacttacaa 8581 agaagctcta gatctcagcc actttttaaa agaaaagggg ggactggaag ggctaatttg 8641 gtccaaaaag agacaagaga tccttgatct ttgggtctac aacacacaag gcatcttccc 8701 tgattggcaa aactacacac cagggccagg gatcagatat ccactaacct ttggatggtg 8761 ctacgagcta gtaccagttg atccacagga ggtagaagaa gacactgaag gagagaccaa 8821 cagcttgtta caccctatat gccagcatgg aatggaggac ccggagagac aagtgttaaa 8881 atggagattt aacagcagac tagcatttga gcacaaggcc cgagagatgc atccggagtt 8941 ctacaaaaac tgatgacacc gagctttcta caagggactt tccgctgggg actttccagg 9001 gaggcgtgga ctgggcggga ctggggagtg gctaaccctc agatgctgca tataagcagc 9061 tgctttttgc ctgtactggg tctctctggt tagaccagat ttgagcctgg gagctctctg 9121 gctagctagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaa // LOCUS HIVHXB2 9718 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION K03455 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate HXB2) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 653) AUTHORS Starcich,B., Ratner,L., Josephs,S.F., Okamato,T., Gallo,R.C. and Wong-Staal,F. TITLE Characterization of long terminal repeat sequences of HTLV-III JOURNAL Science 227, 538-540 (1985) STANDARD full staff_review REFERENCE 2 (bases 493 to 674; 9577 to 9718) AUTHORS Ratner,L., Haseltine,W., Patarca,R., Livak,K.J., Starcich,B., Josephs,S.J., Doran,E.R., Rafalski,J.A., Whitehorn,E.A., Baumeister,K., Ivanoff,L., Petteway,S.R.Jr., Pearson,M.L., Lautenberger,J.A., Papas,T.S., Ghrayeb,J., Chang,N.T., Gallo,R.C. and Wong-Staal,F. TITLE Complete nucleotide sequence of the AIDS virus, HTLV-III JOURNAL Nature 313, 277-284 (1985) STANDARD full staff_review REFERENCE 3 (bases 1 to 9718) AUTHORS Ratner,L., Fisher,A., Jagodzinski,L.L., Mitsuya,H., Liou,R.-S., Gallo,R.C. and Wong-Staal,F. TITLE Complete nucleotide sequences of functional clones of the AIDS virus JOURNAL AIDS Res. Hum. Retroviruses 3, 57-69 (1987) STANDARD full staff_review REFERENCE 4 (bases 5611 to 5611; revises [3]) AUTHORS Ratner,L. JOURNAL Unpublished (1987) Washington U. Med School, St. Louis, MO STANDARD full staff_review REFERENCE 5 (bases 6224 to 8794) AUTHORS Reitz,M.S. JOURNAL Unpublished (1988) STANDARD full staff_review COMMENT Submitted in computer-readable form by L. Ratner. A second envelope gene sequence for the HXB2D clone has been independently determined by Reitz and co-workers ([5]); the seven or so differences from the Ratner sequence ([3] and [4]) have not been resolved nor annotated herein. The vpU cds annotated below does not possess a start codon in the normal position (bases 6061 to 6063; "ACG"). As there are no nearby in-frame upstream or downstream start codons, HXB2 may not possess a functional vpU cds. This clone is infectious [3]. Sibling sequences are available (see 1989: I-A-47). FEATURES from to/span description pept 789 2291 gag polyprotein pept < 2084 5095 pol polyprotein (NH2-terminus uncertain; AA at 2084) pept 5040 5618 vif protein pept 5558 5794 vpr protein pept 5830 6044 tat protein, exon 2 (first expressed exon) 8378 8423 tat protein, exon 3 (AA at 8379) pept 5969 6044 rev protein, exon 2 (first expressed exon) 8378 8652 rev protein, exon 3 (AA at 8380) pept < 6061 6309 vpu protein (AA at 6061, put.) pept 6224 8794 envelope polyprotein pept 8796 9167 nef protein (premature termination) IVS 743 5776 tat, rev, nef mRNA intron 1 IVS 6045 8377 tat cds intron 2 IVS 6045 8377 rev cds intron 2 IVS 6045 8377 tat, rev, nef mRNA intron 2 LTR 1 634 5' LTR LTR 9085 9718 3' LTR rpt 454 551 R repeat 5' copy rpt 9538 9635 R repeat 3' copy binding 636 653 primer (Lys-tRNA) binding site signal 9611 9616 poly-A signal BASE COUNT 3411 a 1773 c 2370 g 2164 t ORIGIN 5' terminus of 5' LTR-U3. 1 tggaagggct aattcactcc caacgaagac aagatatcct tgatctgtgg atctaccaca 61 cacaaggcta cttccctgat tagcagaact acacaccagg gccagggatc agatatccac 121 tgacctttgg atggtgctac aagctagtac cagttgagcc agagaagtta gaagaagcca 181 acaaaggaga gaacaccagc ttgttacacc ctgtgagcct gcatggaatg gatgacccgg 241 agagagaagt gttagagtgg aggtttgaca gccgcctagc atttcatcac atggcccgag 301 agctgcatcc ggagtacttc aagaactgct gacatcgagc ttgctacaag ggactttccg 361 ctggggactt tccagggagg cgtggcctgg gcgggactgg ggagtggcga gccctcagat 421 cctgcatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga 481 gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 541 tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 601 agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacctgaaag 661 cgaaagggaa accagagctc tctcgacgca ggactcggct tgctgaagcg cccgcacggc 721 aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg gaggctagaa 781 ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat cgatgggaaa 841 aaattcggtt aaggccaggg ggaaagaaaa aatataaatt aaaacatata gtatgggcaa 901 gcagggagct agaacgattc gcagttaatc ctggcctgtt agaaacatca gaaggctgta 961 gacaaatact gggacagcta caaccatccc ttcagacagg atcagaagaa cttagatcat 1021 tatataatac agtagcaacc ctctattgtg tgcatcaaag gatagagata aaagacacca 1081 aggaagcttt agacaagata gaggaagagc aaaacaaaag taagaaaaaa gcacagcaag 1141 cagcagctga cacaggacac agcaatcagg tcagccaaaa ttaccctata gtgcagaaca 1201 tccaggggca aatggtacat caggccatat cacctagaac tttaaatgca tgggtaaaag 1261 tagtagaaga gaaggctttc agcccagaag tgatacccat gttttcagca ttatcagaag 1321 gagccacccc acaagattta aacaccatgc taaacacagt ggggggacat caagcagcca 1381 tgcaaatgtt aaaagagacc atcaatgagg aagctgcaga atgggataga gtgcatccag 1441 tgcatgcagg gcctattgca ccaggccaga tgagagaacc aaggggaagt gacatagcag 1501 gaactactag tacccttcag gaacaaatag gatggatgac aaataatcca cctatcccag 1561 taggagaaat ttataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata 1621 gccctaccag cattctggac ataagacaag gaccaaagga accctttaga gactatgtag 1681 accggttcta taaaactcta agagccgagc aagcttcaca ggaggtaaaa aattggatga 1741 cagaaacctt gttggtccaa aatgcgaacc cagattgtaa gactatttta aaagcattgg 1801 gaccagcggc tacactagaa gaaatgatga cagcatgtca gggagtagga ggacccggcc 1861 ataaggcaag agttttggct gaagcaatga gccaagtaac aaattcagct accataatga 1921 tgcagagagg caattttagg aaccaaagaa agattgttaa gtgtttcaat tgtggcaaag 1981 aagggcacac agccagaaat tgcagggccc ctaggaaaaa gggctgttgg aaatgtggaa 2041 aggaaggaca ccaaatgaaa gattgtactg agagacaggc taatttttta gggaagatct 2101 ggccttccta caagggaagg ccagggaatt ttcttcagag cagaccagag ccaacagccc 2161 caccagaaga gagcttcagg tctggggtag agacaacaac tccccctcag aagcaggagc 2221 cgatagacaa ggaactgtat cctttaactt ccctcaggtc actctttggc aacgacccct 2281 cgtcacaata aagatagggg ggcaactaaa ggaagctcta ttagatacag gagcagatga 2341 tacagtatta gaagaaatga gtttgccagg aagatggaaa ccaaaaatga tagggggaat 2401 tggaggtttt atcaaagtaa gacagtatga tcagatactc atagaaatct gtggacataa 2461 agctataggt acagtattag taggacctac acctgtcaac ataattggaa gaaatctgtt 2521 gactcagatt ggttgcactt taaattttcc cattagccct attgagactg taccagtaaa 2581 attaaagcca ggaatggatg gcccaaaagt taaacaatgg ccattgacag aagaaaaaat 2641 aaaagcatta gtagaaattt gtacagagat ggaaaaggaa gggaaaattt caaaaattgg 2701 gcctgaaaat ccatacaata ctccagtatt tgccataaag aaaaaagaca gtactaaatg 2761 gagaaaatta gtagatttca gagaacttaa taagagaact caagacttct gggaagttca 2821 attaggaata ccacatcccg cagggttaaa aaagaaaaaa tcagtaacag tactggatgt 2881 gggtgatgca tatttttcag ttcccttaga tgaagacttc aggaagtata ctgcatttac 2941 catacctagt ataaacaatg agacaccagg gattagatat cagtacaatg tgcttccaca 3001 gggatggaaa ggatcaccag caatattcca aagtagcatg acaaaaatct tagagccttt 3061 tagaaaacaa aatccagaca tagttatcta tcaatacatg gatgatttgt atgtaggatc 3121 tgacttagaa atagggcagc atagaacaaa aatagaggag ctgagacaac atctgttgag 3181 gtggggactt accacaccag acaaaaaaca tcagaaagaa cctccattcc tttggatggg 3241 ttatgaactc catcctgata aatggacagt acagcctata gtgctgccag aaaaagacag 3301 ctggactgtc aatgacatac agaagttagt ggggaaattg aattgggcaa gtcagattta 3361 cccagggatt aaagtaaggc aattatgtaa actccttaga ggaaccaaag cactaacaga 3421 agtaatacca ctaacagaag aagcagagct agaactggca gaaaacagag agattctaaa 3481 agaaccagta catggagtgt attatgaccc atcaaaagac ttaatagcag aaatacagaa 3541 gcaggggcaa ggccaatgga catatcaaat ttatcaagag ccatttaaaa atctgaaaac 3601 aggaaaatat gcaagaatga ggggtgccca cactaatgat gtaaaacaat taacagaggc 3661 agtgcaaaaa ataaccacag aaagcatagt aatatgggga aagactccta aatttaaact 3721 gcccatacaa aaggaaacat gggaaacatg gtggacagag tattggcaag ccacctggat 3781 tcctgagtgg gagtttgtta atacccctcc cttagtgaaa ttatggtacc agttagagaa 3841 agaacccata gtaggagcag aaaccttcta tgtagatggg gcagctaaca gggagactaa 3901 attaggaaaa gcaggatatg ttactaatag aggaagacaa aaagttgtca ccctaactga 3961 cacaacaaat cagaagactg agttacaagc aatttatcta gctttgcagg attcgggatt 4021 agaagtaaac atagtaacag actcacaata tgcattagga atcattcaag cacaaccaga 4081 tcaaagtgaa tcagagttag tcaatcaaat aatagagcag ttaataaaaa aggaaaaggt 4141 ctatctggca tgggtaccag cacacaaagg aattggagga aatgaacaag tagataaatt 4201 agtcagtgct ggaatcagga aagtactatt tttagatgga atagataagg cccaagatga 4261 acatgagaaa tatcacagta attggagagc aatggctagt gattttaacc tgccacctgt 4321 agtagcaaaa gaaatagtag ccagctgtga taaatgtcag ctaaaaggag aagccatgca 4381 tggacaagta gactgtagtc caggaatatg gcaactagat tgtacacatt tagaaggaaa 4441 agttatcctg gtagcagttc atgtagccag tggatatata gaagcagaag ttattccagc 4501 agaaacaggg caggaaacag catattttct tttaaaatta gcaggaagat ggccagtaaa 4561 aacaatacat actgacaatg gcagcaattt caccggtgct acggttaggg ccgcctgttg 4621 gtgggcggga atcaagcagg aatttggaat tccctacaat ccccaaagtc aaggagtagt 4681 agaatctatg aataaagaat taaagaaaat tataggacag gtaagagatc aggctgaaca 4741 tcttaagaca gcagtacaaa tggcagtatt catccacaat tttaaaagaa aaggggggat 4801 tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca tacaaactaa 4861 agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca gggacagcag 4921 aaattcactt tggaaaggac cagcaaagct cctctggaaa ggtgaagggg cagtagtaat 4981 acaagataat agtgacataa aagtagtgcc aagaagaaaa gcaaagatca ttagggatta 5041 tggaaaacag atggcaggtg atgattgtgt ggcaagtaga caggatgagg attagaacat 5101 ggaaaagttt agtaaaacac catatgtatg tttcagggaa agctagggga tggttttata 5161 gacatcacta tgaaagccct catccaagaa taagttcaga agtacacatc ccactagggg 5221 atgctagatt ggtaataaca acatattggg gtctgcatac aggagaaaga gactggcatt 5281 tgggtcaggg agtctccata gaatggagga aaaagagata tagcacacaa gtagaccctg 5341 aactagcaga ccaactaatt catctgtatt actttgactg tttttcagac tctgctataa 5401 gaaaggcctt attaggacac atagttagcc ctaggtgtga atatcaagca ggacataaca 5461 aggtaggatc tctacaatac ttggcactag cagcattaat aacaccaaaa aagataaagc 5521 cacctttgcc tagtgttacg aaactgacag aggatagatg gaacaagccc cagaagacca 5581 agggccacag agggagccac acaatgaatg gacactagag cttttagagg agcttaagaa 5641 tgaagctgtt agacattttc ctaggatttg gctccatggc ttagggcaac atatctatga 5701 aacttatggg gatacttggg caggagtgga agccataata agaattctgc aacaactgct 5761 gtttatccat tttcagaatt gggtgtcgac atagcagaat aggcgttact cgacagagga 5821 gagcaagaaa tggagccagt agatcctaga ctagagccct ggaagcatcc aggaagtcag 5881 cctaaaactg cttgtaccaa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt 5941 ttcataacaa aagccttagg catctcctat ggcaggaaga agcggagaca gcgacgaaga 6001 gctcatcaga acagtcagac tcatcaagct tctctatcaa agcagtaagt agtacatgta 6061 acgcaaccta taccaatagt agcaatagta gcattagtag tagcaataat aatagcaata 6121 gttgtgtggt ccatagtaat catagaatat aggaaaatat taagacaaag aaaaatagac 6181 aggttaattg atagactaat agaaagagca gaagacagtg gcaatgagag tgaaggagaa 6241 atatcagcac ttgtggagat gggggtggag atggggcacc atgctccttg ggatgttgat 6301 gatctgtagt gctacagaaa aattgtgggt cacagtctat tatggggtac ctgtgtggaa 6361 ggaagcaacc accactctat tttgtgcatc agatgctaaa gcatatgata cagaggtaca 6421 taatgtttgg gccacacatg cctgtgtacc cacagacccc aacccacaag aagtagtatt 6481 ggtaaatgtg acagaaaatt ttgacatgtg gaaaaatgac atggtagaac agatgcatga 6541 ggatataatc agtttatggg atcaaagcct aaagccatgt gtaaaattaa ccccactctg 6601 tgttagttta aagtgcactg atttgaagaa tgatactaat accaatagta gtagcgggag 6661 aatgataatg gagaaaggag agataaaaaa ctgctctttc aatatcagca caagcataag 6721 aggtaaggtg cagaaagaat atgcattttt ttataaactt gatataatac caatagataa 6781 tgatactacc agctatagct tgacaagttg taacacctca gtcattacac aggcctgtcc 6841 aaaggtatcc tttgagccaa ttcccataca ttattgtgcc ccggctggtt ttgcgattct 6901 aaaatgtaat aataagacgt tcaatggaac aggaccatgt acaaatgtca gcacagtaca 6961 atgtacacat ggaattaggc cagtagtatc aactcaactg ctgttaaatg gcagtctagc 7021 agaagaagag gtagtaatta gatctgtcaa tttcacggac aatgctaaaa ccataatagt 7081 acagctgaac acatctgtag aaattaattg tacaagaccc aacaacaata caagaaaaag 7141 aatccgtatc cagagaggac cagggagagc atttgttaca ataggaaaaa taggaaatat 7201 gagacaagca cattgtaaca ttagtagagc aaaatggaat aacactttaa aacagataga 7261 tagcaaatta agagaacaat tcggaaataa taaaacaata atctttaagc aatcctcagg 7321 aggggaccca gaaattgtaa cgcacagttt taattgtgga ggggaatttt tctactgtaa 7381 ttcaacacaa ctgtttaata gtacttggtt taatagtact tggagtactg aagggtcaaa 7441 taacactgaa ggaagtgaca caatcaccct cccatgcaga ataaaacaaa ttataaacat 7501 gtggcagaaa gtaggaaaag caatgtatgc ccctcccatc agtggacaaa ttagatgttc 7561 atcaaatatt acagggctgc tattaacaag agatggtggt aatagcaaca atgagtccga 7621 gatcttcaga cttggaggag gagatatgag ggacaattgg agaagtgaat tatataaata 7681 taaagtagta aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt 7741 gcagagagaa aaaagagcag tgggaatagg agctttgttc cttgggttct tgggagcagc 7801 aggaagcact atgggcgcag cctcaatgac gctgacggta caggccagac aattattgtc 7861 tggtatagtg cagcagcaga acaatttgct gagggctatt gaggcgcaac agcatctgtt 7921 gcaactcaca gtctggggca tcaagcagct ccaagcaaga atcctagctg tggaaagata 7981 cctaaaggat caacagctcc tagggatttg gggttgctct ggaaaactca tttgcaccac 8041 tgctgtgcct tggaatgcta gttggagtaa taaatctctg gaacagatct ggaatcacac 8101 gacctggatg gagtgggaca gagaaattaa caattacaca agcttaatac actccttaat 8161 tgaagaatcg caaaaccagc aagaaaagaa tgaacaagaa ttattggaat tagataaatg 8221 ggcaagtttg tggaattggt ttaacataac aaattggctg tggtatataa aattattcat 8281 aatgatagta ggaggcttgg taggtttaag aatagttttt gctgtacttt ctatagtgaa 8341 tagagttagg cagggatatt caccattatc gtttcagacc cacctcccaa tcccgagggg 8401 acccgacagg cccgaaggaa tagaagaaga aggtggagag agagacagag acagatccat 8461 tcgattagtg aacggatcct tggcacttat ctgggacgat ctgcggagcc tgtgcctctt 8521 cagctaccac cgcttgagag acttactctt gattgtaacg aggattgtgg aacttctggg 8581 acgcaggggg tgggaagccc tcaaatattg gtggaatctc ctacagtatt ggagtcagga 8641 actaaagaat agtgctgtta gcttgctcaa tgccacagcc atagcagtag ctgaggggac 8701 agatagggtt atagaagtag tacaaggagc ttgtagagct attcgccaca tacctagaag 8761 aataagacag ggcttggaaa ggattttgct ataagatggg tggcaagtgg tcaaaaagta 8821 gtgtgattgg atggcttact gtaagggaaa gaatgagacg agctgagcca gcagcagatg 8881 gggtgggagc agcatctcga gacctggaaa aacatggagc aatcacaagt agcaacacag 8941 cagctaccaa tgctgcttgt gcctggctag aagcacaaga ggaggaggag gtgggttttc 9001 cagtcacacc tcaggtacct ttaagaccaa tgacttacaa ggcagctgta gatcttagcc 9061 actttttaaa agaaaagggg ggactggaag ggctaattca ctcccaaaga agacaagata 9121 tccttgatct gtggatctac cacacacaag gctacttccc tgattgacag aactacacac 9181 cagggccagg ggtcagatat ccactgacct ttggatggtg ctacaagcta gtaccagttg 9241 agccagataa gatagaagag gccaataaag gagagaacac cagcttgtta caccctgtga 9301 gcctgcatgg gatggatgac ccggagagag aagtgttaga gtggaggttt gacagccgcc 9361 tagcatttca tcacgtggcc cgagagctgc atccggagta cttcaagaac tgctgacatc 9421 gagcttgcta caagggactt tccgctgggg actttccagg gaggcgtggc ctgggcggga 9481 ctggggagtg gcgagccctc agatcctgca tataagcagc tgctttttgc ctgtactggg 9541 tctctctggt tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg 9601 cttaagcctc aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt 9661 gactctggta actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagca // LOCUS HIVMAB 59 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) gag region. ACCESSION M27955 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate MAB), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. Sibling sequences are available (see 1989: I-A-184). FEATURES from to/span description pept < 1 > 59 gag protein (AA at 2) BASE COUNT 24 a 7 c 12 g 16 t ORIGIN Within gag cds. 1 ctataaaaga tggataatcc tggggttaaa taaaatagta agaatgtata gccctgtca // LOCUS HIVMN 9738 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION M17449 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (isolate MN), proviral DNA (non-infectious). ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9738) AUTHORS Gurgo,C., Guo,H.-G., Franchini,G., Aldovini,A., Collalti,E., Farrell,K., Wong-Staal,F., Gallo,R.C. and Reitz,M.S.Jr. TITLE Envelope sequences of two new United States HIV-1 isolates JOURNAL Virology 164, 531-536 (1988) STANDARD full staff_review COMMENT Sequence kindly provided in computer-readable form by M. Reitz, N.C.I., Bethesda, MD. 20892 U.S.A. The MN isolate was taken from a pediatric AIDS patient in 1984. The pol coding sequence shows an in-frame stop codon at 3783; the nef protein is prematurely truncated at position 9357. The vpU protein is prematurely truncated at position 6142. FEATURES from to/span description pept 787 2307 gag polyprotein pept < 2091 3782 pol polyprotein (NH2-terminus uncertain; AA at 2091; in-frame stop codon at 3783) 3786 5111 pol polyprotein pept 5056 5634 vif protein pept 5574 5864 vpr protein pept 5845 6059 tat protein, exon 2 (first expressed exon) 8396 8486 tat protein, exon 3 (AA at 8397) pept 5984 6059 rev protein, exon 2 (first expressed exon) 8396 8667 rev protein, exon 3 (AA at 8398) pept 6076 6144 vpu protein (premature termination) pept 6239 8809 envelope polyprotein pept 8811 9359 nef protein (premature termination at 9357 relative to other HIV-1 sequences) LTR 1 633 5' LTR LTR 9106 9738 3' LTR rpt 453 550 R repeat 5' copy rpt 9558 9655 R repeat 3' copy binding 635 652 primer (Lys-tRNA) binding site site 3783 3785 pol cds in-frame stop codon signal 9631 9636 poly-A signal BASE COUNT 3463 a 1769 c 2344 g 2162 t ORIGIN Left end of viral genome. 1 tggatgggtt aatttactcc caaaagagac aagacatcct tgatctgtgg gtctaccaca 61 cacaaggcta cttccctgat tggcagaact acacaccagg gccagggatc agatatccac 121 tgacctttgg atggtgcttc aagctagtac cagttgagcc agagaagata gaagaggcca 181 ataaaggaga gaacaactgc ttgttacacc ctatgagcca gcatggatgg atgacccgga 241 gagagaagtg ttagtgtgga agtctgacag ccacctagca tttcagcatt atgcccgaga 301 gctgcatccg gagtactaca agaactgctg acatcgagct atctacaagg gactttccgc 361 tggggacttt ccagggaggt gtggcctggg cgggaccggg gagtggcgag ccctcagatc 421 gtgcatataa gcagctgctt tctgcctgta ctgggtctct ctggttagac cagatctgag 481 cctgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt 541 gagtgcttca agtagtgtgt gcccgtctgt tatgtgactc tggtagctag agatccctca 601 gatcctttta ggcagtgtgg aaaatctcta gcagtggcgc ccgaacaggg acttgaaagc 661 gaaagaaaaa ccagagctct ctcgacgcag gactcggctt gctgaagcgc gcacggcaag 721 aggcgagggg cggcgactgg tgagtacgcc aaaaattctt gactagcgga ggctagaagg 781 agagagatgg gtgcgagagc gtcggtatta agcgggggag aattagatcg atgggaaaac 841 attcggttaa ggccaggggg aaagaaaaaa tataaattaa aacatgtagt atgggcaagc 901 agggagctag aacgattcgc agtcaatcct ggcctgttag aaacatcaga aggctgtaga 961 caaatactgg gacagctaca accatccctt cagacaggat cagaagaact taaatcatta 1021 tataatacag tagcaaccct ctattgtgtg catcaaaaga tagagataaa agacaccaag 1081 gaagctttag agaaaataga ggaagagcaa aacaaaagta agaaaaaagc acagcaagca 1141 gcagctgaca caggaaacag aggaaacagc agccaagtca gccaaaatta ccccatagtg 1201 cagaacatcg aggggcaaat ggtacatcag gccatatcac ctagaacttt aaatgcatgg 1261 gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa tacccatgtt ttcagcatta 1321 tcagaaggag ccaccccaca agatttaaac accatgctaa acacagtggg gggacatcaa 1381 gcagccatgc aaatgttaaa agagaccatc aatgaggaag ctgcagaatg ggatagattg 1441 catccagtgc atgcagggcc tattacacca ggccagatga gagaaccaag gggaagtgac 1501 atagcaggaa ctactagtac ccttcaggaa caaataggat ggatgacaaa taatccacct 1561 atcccagtag gagaaatcta taaaagatgg ataatcctgg gattaaataa aatagtaagg 1621 atgtatagcc cttccagcat tctggacata agacaaggac caaaggaacc ctttagagac 1681 tatgtagacc ggttctataa aactctaaga gccgagcaag cttcacagga ggtaaaaaac 1741 cggacgacag aaaccttgtt ggtccaaaat gcgaacccag attgtaagac tattttaaaa 1801 gcattgggac cagcagctac actagaagaa atgatgacag catgtcaggg agtgggagga 1861 cctggtcata aagcaagagt tttggcggaa gcgatgagcc aagtaacaaa ttcagctacc 1921 ataatgatgc agagaggcaa ttttaggaat caaagaaaga ttatcaagtg cttcaattgt 1981 ggcaaagaag ggcacatagc caaaaattgc agggccccta ggaaaagggg ctgttggaaa 2041 tgtggaaagg aaggacacca aatgaaagat tgtactgaga gacaggctaa ttttttaggg 2101 aagatctggc cttcctgcaa gggaaggcgg aattttcctc agagcagaac agagccaaca 2161 gccccaccag aagagagctt caggtttggg gaagagacaa caactcccta tcagaagcag 2221 gagaagaagc aggagacgat agacaaggac ctgtatcctt tagcttccct caaatcactc 2281 tttggcaacg acccattgtc acaataaaga taggggggca actaaaggaa gctctattag 2341 atacaggagc agatgataca gtattaggag aaatgaattt gccaagaaga tggaaaccaa 2401 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcag ataaccatag 2461 gaatctgtgg acataaagct ataggtacag tattagtagg acctacacct gtcaacataa 2521 ttggaagaaa tctgttgact cagcttgggt gcactttaaa ttttcccatt agtcctattg 2581 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat 2641 tgacagaaga aaaaataaaa gcattaatag aaatttgtac agaaatggaa aaggaaggga 2701 aaatttcaaa aattgggcct gaaaatccat acaatactcc agtatttgcc ataaagaaaa 2761 aagacagtac taaatggaga aaattagtag atttcagaga acttaataag aaaactcaag 2821 acttctggga agttcaatta ggaataccac atcctgcagg gttaaaaaag aaaaaatcag 2881 taacagtact ggatgtgggt gatgcatatt tttcagttcc cttagataaa gacttcagga 2941 agtatactgc atttaccata cctagtataa acaatgaaac accagggatt agatatcagt 3001 acaatgtgct tccacaggga tggaaaggat caccagcaat attccaaagt agcatgacaa 3061 aaatcttaga gccttttaga aaacaaaatc cagacatagt tatctatcaa tacatggatg 3121 atttgtatgt aggatctgac ttagaaatag ggcagcatag agcaaaaata gaggaactga 3181 gacgacatct gttgaggtgg ggatttacca caccagacaa aaaacatcag aaagaacctc 3241 cattcctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctatagtgc 3301 taccagaaaa agacagctgg actgtcaatg acatacagaa gttagtggga aaattgaatt 3361 gggcaagtca gatttacgca gggattaaag taaagcaatt atgtaaactc cttagaggaa 3421 ccaaagcact aacagaagta ataccactaa cagaagaagc agagctagaa ctggcagaaa 3481 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa 3541 tagcagaagt acagaagcag gggcaaggcc aatggacata tcaaatttat caagagccat 3601 ttaaaaatct gaaaacaggc aaatatgcaa gaatgagggg tgcccacact aatgatgtaa 3661 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtaata tggggaaaga 3721 ctcctaaatt tagactaccc atacaaaaag aaacatggga aacatggtgg acagagtata 3781 cgtaagccac ctggattcct gagtgggagg ttgtcaatac ccctccctta gtgaaattat 3841 ggtaccagtt agagaaagaa cccatagtag gtgcagaaac tttctatgta gatggggcag 3901 ctaacaggga gactaaaaaa ggaaaagcag gatatgttac taacagagga agacaaaagg 3961 ttgtctccct aactgacaca acaaatcaga agactgagtt acaagcaatt catctagctt 4021 tgcaagattc agggttagaa gtaaacatag taacagactc acaatatgca ttaggaatca 4081 ttcaagcaca accagataaa agtgaatcag agttagtcag tcaaataata gagcagttaa 4141 taaaaaagga aaaggtctat ctggcatggg taccagcaca caaaggaatt ggaggaaatg 4201 aacaagtaga taaattagtc agtgctggaa tcaggaaagt actattttta gatggaatag 4261 ataaggccca agaagaccat gagaaatatc acagtaattg gagagcaatg gctagtgact 4321 ttaacctacc acctatagta gcaaaagaaa tagtagccag ctgtgataaa tgtcagctaa 4381 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ctagattgta 4441 cacatttaga aggaaaagtt atcctggtag cagttcatgt agccagtgga tacatagaag 4501 cagaagttat tccagcagag acagggcagg agacagcata ctttctctta aaattagcag 4561 gaagatggcc agtaaaaaca atacatacag acaatggccc caatttcacc agtactacgg 4621 ttaaggccgc ctgttggtgg acgggaatca agcaggaatt tggcattccc tacaatcccc 4681 aaagtcaagg agtaatagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa 4741 gagatcaggc tgaacatctt aagagagcag tacaaatggc agtattcatc cacaatttta 4801 aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtaggc ataatagcaa 4861 cagacataca aactaaagaa ctacaaaaac aaattacaaa aattcaaaat tttcgggttt 4921 attacaggga cagcagagat ccactttgga aaggaccagc aaagcttctc tggaaaggtg 4981 aaggggcagt agtaatacaa gataataatg acataaaagt agtgccaaga agaaaagcaa 5041 aggtcattag ggattatgga aaacagacgg caggtgatga ttgtgtggca agcagacagg 5101 atgaggatta gaacatggaa aagtttagta aaacaccata tgtatatttc aaagaaagct 5161 aaaggacggt tttatagaca tcactatgaa agcactcatc caagaataag ttcagaagta 5221 cacatcccac taggggatgc tagattggta ataacaacat attggggtct gcatacagga 5281 gaaagagact ggcatttagg tcagggagtc tccatagaat ggaggaaaaa gagatatagc 5341 acacaagtag accctgacct agcagaccac ctaattcatc tgcattactt tgattgtttt 5401 tcagactctg ccataagaaa ggccatatta ggacatagag ttagtcctat ttgtgaattt 5461 caagcaggac ataacaaggt aggacctcta cagtacttgg cactaacagc attaataaca 5521 ccaaaaaaga taaagccacc tttgcctagt gttaagaaac tgacagagga tagatggaac 5581 aagccccaga agaccaaggg ccacagaggg agccatacaa tcaatgggca ctagagcttt 5641 tagaggagct taagaatgaa gctgttagac attttcctag gatatggctc catggcttag 5701 ggcaacatat ctatgaaact tatggggata cttgggcagg agtggaagcc ataataagaa 5761 ttctacaaca actgctgttt attcatttca gaattgggtg tcgacatagc agaataggca 5821 ttattcgaca gaggagagca agaaatggag ccagtagatc ctagactaga gccctggaag 5881 catccaggaa gtcagcctaa gactgcttgt accacttgct attgtaaaaa gtgttgcttt 5941 cattgccaag tttgtttcac aaaaaaagcc ttaggcatct cctatggcag gaagaagcgg 6001 agacagcgac gaagagctcc tgaagacagt cagactcatc aagtttctct accaaagcag 6061 taagtagtac atgtaatgca acctttagta atagcagcaa tagtagcatt agtagtagca 6121 ggaataatag caatagttgt gtgatccata gtattcatag aatataggaa aataagaaga 6181 caaagaaaaa tagacaggtt aattgataga ataagcgaaa gagcagaaga cagtggcaat 6241 gagagtgaag gggatcagga ggaattatca gcactggtgg ggatggggca cgatgctcct 6301 tgggttatta atgatctgta gtgctacaga aaaattgtgg gtcacagtct attatggggt 6361 acctgtgtgg aaagaagcaa ccaccactct attttgtgca tcagatgcta aagcatatga 6421 tacagaggta cataatgttt gggccacaca agcctgtgta cccacagacc ccaacccaca 6481 agaagtagaa ttggtaaatg tgacagaaaa ttttaacatg tggaaaaata acatggtaga 6541 acagatgcat gaggatataa tcagtttatg ggatcaaagc ctaaagccat gtgtaaaatt 6601 aaccccactc tgtgttactt taaattgcac tgatttgagg aatactacta ataccaataa 6661 tagtactgct aataacaata gtaatagcga gggaacaata aagggaggag aaatgaaaaa 6721 ctgctctttc aatatcacca caagcataag agataagatg cagaaagaat atgcacttct 6781 ttataaactt gatatagtat caatagataa tgatagtacc agctataggt tgataagttg 6841 taatacctca gtcattacac aagcttgtcc aaagatatcc tttgagccaa ttcccataca 6901 ctattgtgcc ccggctggtt ttgcgattct aaaatgtaac gataaaaagt tcagtggaaa 6961 aggatcatgt aaaaatgtca gcacagtaca atgtacacat ggaattaggc cagtagtatc 7021 aactcaactg ctgttaaatg gcagtctagc agaagaagag gtagtaatta gatctgagaa 7081 tttcactgat aatgctaaaa ccatcatagt acatctgaat gaatctgtac aaattaattg 7141 tacaagaccc aactacaata aaagaaaaag gatacatata ggaccaggga gagcatttta 7201 tacaacaaaa aatataatag gaactataag acaagcacat tgtaacatta gtagagcaaa 7261 atggaatgac actttaagac agatagttag caaattaaaa gaacaattta agaataaaac 7321 aatagtcttt aatcaatcct caggagggga cccagaaatt gtaatgcaca gttttaattg 7381 tggaggggaa tttttctact gtaatacatc accactgttt aatagtactt ggaatggtaa 7441 taatacttgg aataatacta cagggtcaaa taacaatatc acacttcaat gcaaaataaa 7501 acaaattata aacatgtggc aggaagtagg aaaagcaatg tatgcccctc ccattgaagg 7561 acaaattaga tgttcatcaa atattacagg gctactatta acaagagatg gtggtaagga 7621 cacggacacg aacgacaccg agatcttcag acctggagga ggagatatga gggacaattg 7681 gagaagtgaa ttatataaat ataaagtagt aacaattgaa ccattaggag tagcacccac 7741 caaggcaaag agaagagtgg tgcagagaga aaaaagagca gcgataggag ctctgttcct 7801 tgggttctta ggagcagcag gaagcactat gggcgcagcg tcagtgacgc tgacggtaca 7861 ggccagacta ttattgtctg gtatagtgca acagcagaac aatttgctga gggccattga 7921 ggcgcaacag catatgttgc aactcacagt ctggggcatc aagcagctcc aggcaagagt 7981 cctggctgtg gaaagatacc taaaggatca acagctcctg gggttttggg gttgctctgg 8041 aaaactcatt tgcaccacta ctgtgccttg gaatgctagt tggagtaata aatctctgga 8101 tgatatttgg aataacatga cctggatgca gtgggaaaga gaaattgaca attacacaag 8161 cttaatatac tcattactag aaaaatcgca aacccaacaa gaaaagaatg aacaagaatt 8221 attggaattg gataaatggg caagtttgtg gaattggttt gacataacaa attggctgtg 8281 gtatataaaa atattcataa tgatagtagg aggcttggta ggtttaagaa tagtttttgc 8341 tgtactttct atagtgaata gagttaggca gggatactca ccattgtcgt tgcagacccg 8401 ccccccagtt ccgaggggac ccgacaggcc cgaaggaatc gaagaagaag gtggagagag 8461 agacagagac acatccggtc gattagtgca tggattctta gcaattatct gggtcgacct 8521 gcggagcctg ttcctcttca gctaccacca cagagactta ctcttgattg cagcgaggat 8581 tgtggaactt ctgggacgca gggggtggga agtcctcaaa tattggtgga atctcctaca 8641 gtattggagt caggaactaa agagtagtgc tgttagcttg cttaatgcca cagctatagc 8701 agtagctgag gggacagata gggttataga agtactgcaa agagctggta gagctattct 8761 ccacatacct acaagaataa gacagggctt ggaaagggct ttgctataag atgggtggca 8821 aatggtcaaa acgtgtgact ggatggccta ctgtaaggga aagaatgaga cgagctgaac 8881 cagctgagct agcagcagat ggggtgggag cagcatcccg agacctggaa aaacatggag 8941 cactcacaag tagcaataca gcagctacca atgctgattg tgcctggcta gaagcacaag 9001 aggaggagga agtgggtttt ccagtcaaac ctcaggtacc tttaagacca atgacttaca 9061 aagcagcttt agatcttagc cactttttaa aagaaaaggg gggactggat gggttaattt 9121 actcccaaaa gagacaagac atccttgatc tgtgggtcta ccacacacaa ggctacttcc 9181 ctgattggca gaactacaca ccagggccag ggatcagata tccactgacc tttggatggt 9241 gcttcaagct agtaccagtt gagccagaga agatagaaga ggccaataaa ggagagaaca 9301 actgcttgtt acaccctatg agccagcatg gatggatgac ccggagagag aagtgttagt 9361 gtggaagtct gacagccacc tagcatttca gcattatgcc cgagagctgc atccggagta 9421 ctacaagaac tgctgacatc gagctatcta caagggactt tccgctgggg actttccagg 9481 gaggtgtggc ctgggcggga ccggggagtg gcgagccctc agatcgtgca tataagcagc 9541 tgctttctgc ctgtactggg tctctctggt tagaccagat ctgagcctgg gagctctctg 9601 gctaactagg gaacccactg cttaagcctc aataaagctt gccttgagtg cttcaagtag 9661 tgtgtgcccg tctgttatgt gactctggta gctagagatc cctcagatcc ttttaggcag 9721 tgtggaaaat ctctagca // LOCUS HIVNDK 9143 bp ds-DNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral complete genome. ACCESSION M27323 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (isolate NDK), proviral DNA infectious clone. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9143) AUTHORS Spire,B., Sire,J., Zachar,V., Rey,F., Barre-Sinoussi,F., Galibert,F., Hampe,A. and Chermann,J.-C. TITLE Nucleotide sequence of HIV1-NDK a highly cytopathic strain of the human immunodeficiency virus HIV1 JOURNAL Gene 81, 275-84 (1989) STANDARD full staff_review COMMENT Kindly provided prior to publication by J.-C. Chermann, Pasteur Institute, Marseille. This is an infectious molecular clone of an isolate estimated to be 10,000 times more cytopathic in vitro than a prototypical HIV-1. [1] reports that only minor sequence differences appear to be responsible for the "acute biological effect". FEATURES from to/span description pept 334 1827 gag polyprotein pept < 1623 4631 pol polyprotein (NH2-terminus uncertain) pept 4576 5154 vif protein pept 5094 5384 vpR protein pept 5365 5579 tat protein, exon 2 (first expressed exon) 7883 7928 tat protein, exon 3 (AA at 7884) pept 5504 5579 rev protein, exon 2 (first expressed exon) 7883 8157 rev protein, exon 3 (AA at 7885) pept 5596 5841 vpU protein pept 5759 8299 env polyprotein pept 8301 8924 nef protein pre-msg 1 9143 genomic mRNA BASE COUNT 3307 a 1617 c 2193 g 2026 t ORIGIN 5'-terminus of 5' R repeat (putative mRNA start). 1 ggtctctctg gttagaccag atttgagcct gggagctctg gctaattagg gaacccactg 61 cttaagcctc aataaagctt gccttgagcg cttcaagtag tgtgtgcccg tctgttgtgt 121 gactctggta actagagatc cctcagatcc gtttagtccg agtgaaaaat ctctagcagt 181 ggcgcccgaa cagggacctg aaagcgaaag tagaaccaga gaagatctct cgacgcagga 241 ctcggcttgc tgaagcgcgc acagcaagag gcgaggggca gcgactggtg agtacgctga 301 aatttttgac tagcggaggc tagaaggaga gagatgggtg cgagagcgtc agtattaagc 361 gggggaaaat tagatacatg ggaaagaatt cggttacggc caggaggaaa gaaaaaatat 421 gcactaaaac atttgatatg ggcaagcagg gagctagaac gatttacact taatcctggc 481 cttttagaga catcagaagg ctgtaaacaa ataataggac agctacaacc atctattcaa 541 acaggatcag aagaaattag atcattatat aatacagtag caaccctcta ttgtgtacat 601 gaaaggatag aggtaaaaga caccaaagaa gctgtagaaa agatggagga agaacaaaac 661 aaaagtaaga aaaagacaca gcaagcagca gctgatagca gccaggtcag ccaaaattac 721 cctatagtgc agaacctaca ggggcaaatg gtacatcagg ccatatcacc tagaactttg 781 aacgcatggg taaaagtaat agaagaaaag gccttcagcc cggaagtaat acccatgttt 841 tcagcattat cagaaggagc caccccacaa gatttaaaca ccatgctaaa cacagtgggg 901 ggacatcaag cagctatgca aatgctaaaa gagaccatca atgacgaagc tgcagaatgg 961 gacagattac atccagtgca tgcagggcct gttgcaccag gccaaatgag agaaccaagg 1021 ggaagtgata tagcaggaac tactagtacc cttcaggaac aaatagcatg gatgacaagc 1081 aacccaccta tcccagtagg agaaatctat aaaagatgga taatcctggg attaaataaa 1141 atagtaagaa tgtatagccc tgtcagcatt ttggacataa gacagggacc aaaggaacct 1201 tttagagact atgtagaccg gttctataaa actctaagag ccgagcaagc ttcacaggat 1261 gtaaaaaact ggatgacaga aaccttgttg gtccaaaatg caaacccaga ttgtaaaact 1321 atcttaaaag cattgggacc acaggctaca ctagaagaaa tgatgacagc atgccaggga 1381 gtgggggggc ccggccataa agcaagagtt ttggctgagg caatgagcca agtaacaggt 1441 tcagctactg cagtaatgat gcagagaggc aattttaagg gcccaagaaa aagtattaag 1501 tgtttcaact gtggcaagga agggcacaca gcaaaaaatt gcagggcccc tagaaaaaag 1561 ggctgttgga aatgcggaag ggaaggacac caaatgaaag attgcactga aagacaggct 1621 aattttttag ggaagatttg gccttcccac aagggaaggc cggggaattt tcttcagagc 1681 agaccagagc caacagcccc accagcagag agcttcgggt ttggggagga gataaccccc 1741 tctcagaaac aggagcagaa agacaaggaa ctgtatcctt tagcttccct caaatcactc 1801 tttggcaacg acccctcgtc acaataaaga tagggggaca gctaaaggaa gctctattag 1861 atacaggagc agatgataca gtattagaag aaataaattt gccaggaaaa tggaagccaa 1921 aaatgatagg gggaattgga ggttttatca aagtaagaca gtatgatcaa atactcatag 1981 aaatctgtgg atataaagct atgggtacag tattagtagg acctacacct gtcaacataa 2041 ttggaagaaa tttgttgacc cagattggct gcactttaaa ttttccaatt agtcctattg 2101 aaactgtacc agtaaaatta aagccaggaa tggatggccc aaaagttaaa caatggccat 2161 tgacagaaga aaaaataaaa gcattaacag aaatttgtac agaaatggaa aaggaaggaa 2221 aaatttcaag aattgggcct gaaaatccat ataatactcc aatatttgcc ataaagaaaa 2281 aagacagtac caagtggaga aaattagtag atttcagaga acttaataag agaactcaag 2341 atttctggga ggttcaatta ggaataccgc atcctgcagg gctgaaaaag aaaaaatcag 2401 taacagtact ggatgtgggt gatgcatatt tctcagttcc cttagatgaa gattttagga 2461 aatataccgc atttaccata cctagtataa acaatgagac accagggatt agatatcagt 2521 acaatgtgct cccacaggga tggaaaggat caccggcaat attccaaagt agcatgacaa 2581 aaatcttaga gccctttaga aaacaaaatc cagaaatagt tatctatcaa tacatggatg 2641 atttgtatgt aggatctgac ttagaaatag ggcagcatag aacaaaaata gaggaattaa 2701 gagaacatct attgaggtgg ggatttacca caccagataa aaaacatcag aaagaacctc 2761 catttctttg gatgggttat gaactccatc ctgataaatg gacagtacag cctataaacc 2821 tgccagaaaa agaaagctgg actgtcaatg atatacagaa gttagtgggg aaattaaact 2881 gggcaagcca gatttatgca ggaattaaag taaagcaatt atgtaaactc cttaggggaa 2941 ccaaagcact aacagaagta gtaccactaa cagaagaagc agaattagaa ctggcagaaa 3001 acagggaaat tctaaaagaa ccagtacatg gagtgtatta tgacccatca aaagacttaa 3061 tagcagaact acagaaacaa ggggacggcc aatggacata ccaaatttat caagaaccat 3121 ttaaaaatct aaaaacagga aagtatgcaa gaacgagggg tgcccacact aatgatgtaa 3181 aacaattaac agaggcagtg caaaaaatag ccacagaaag catagtgata tggggaaaga 3241 ctcctaaatt taaactaccc atacaaaagg aaacatggga aacatggtgg atagagtatt 3301 ggcaagccac ctggattcct gagtgggaat ttgtcaatac ccctccttta gtaaaattat 3361 ggtaccagtt agagaaggaa cccataatag gagcagaaac tttctatgta gatggggcag 3421 ctaatagaga gactaaatta ggaaaagcag gatatgttac tgacagagga agacagaaag 3481 ttgtcccttt cactgacacg acaaatcaga agactgagtt acaagcaatt aatctagctt 3541 tacaggattc gggattagaa gtaaacatag taacagattc acaatatgca ctaggaatca 3601 ttcaagcaca accagataag agtgaatcag agttagtcag tcaaataata gagcagctaa 3661 taaaaaagga aaaggtttac ctggcatggg taccagcaca caaaggaatt ggaggaaatg 3721 aacaagtaga taaattagtc agtcagggaa tcaggaaagt actatttttg gatggaatag 3781 ataaggctca ggaagaacat gagaaatatc acaacaattg gagagcaatg gctagtgatt 3841 ttaacctacc acctgtggta gcgaaagaaa tagtagctag ctgtgataaa tgtcagctaa 3901 aaggagaagc catgcatgga caagtagact gtagtccagg aatatggcaa ttagattgta 3961 cacatctgga aggaaaagtt atcctggtag cagttcatgt agccagtggc tatatagaag 4021 cagaagttat tccagcagaa acggggcaag aaacagcata ctttctctta aaattagcag 4081 gaagatggcc agtaaaagta gtacatacag ataatggcag caatttcacc agtgctacag 4141 ttaaggccgc ctgttggtgg gcagggatca aacaggaatt tggaattccc tacaatcccc 4201 aaagtcaagg agtagtagaa tctatgaata aagaattaaa gaaaattata ggacaggtaa 4261 gagatcaagc tgaacatctt aagacagcag tacaaatggc agtatttatc cacaatttta 4321 aaagaaaagg ggggattggg ggatacagtg caggggaaag aataatagac ataatagcaa 4381 cagacataca aactagagaa ttacaaaaac aaatcataaa aattcaaaat tttcgggttt 4441 attacaggga cagcagagat ccaatttgga aaggaccagc aaagcttctc tggaaaggtg 4501 aaggggcagt agtaatacaa gacaatagtg acataaaggt agtaccaaga agaaaagtaa 4561 agatcattag ggattatgga aaacagatgg caggtgatga ttgtgtggca agtagacagg 4621 atgaggatta acacatggaa aagtttagta aaataccata tgtatgtttc aaagaaagct 4681 aacagatggt tttatagaca tcactatgac agccaccacc caaaaataag ttcagaagta 4741 cacatcccac taggagaagc tagactggta gtaacaacat attggggtct gcatacagga 4801 gaaaaagaat ggcatctggg tcagggagtc tccatagaat ggaggaaaag gagatatagc 4861 acacaagtag accctggcct ggcagaccaa ctaattcata tgtattattt tgattgtttt 4921 gcagaatctg ctataagaaa agccatatta ggacatatag ttagtcctag ttgtgagtat 4981 caagcaggac ataacaaggt aggatcctta cagtatttgg cactagcagc attaatagca 5041 ccaaaaaaga taaagccacc tttgcctagt gttaggaagc taacagaaga tagatggaac 5101 aagccccaga agaccaaggg ccgcagaggg agccatacaa tgaatggaca ttagagcttt 5161 tagaggagct taagagtgaa gctgtcagac attttcctag gatatggctc catagcttag 5221 gacaacatat ctatgaaact tatggggata cctgggcagg tgttgaagct ataataagaa 5281 ttctgcaaca actactgttt attcatttca gaattgggtg tcaacatagc agaataagta 5341 ttactcgaca gagaagagca agaaatggat ccagtagatc ctaatctaga gtcctggaac 5401 catccaggaa gtcagcctag gactgcttgt aataagtgtc attgtaaaaa gtgttgctat 5461 cattgccaag tttgcttcat aacgaaaggc ttaggcatct cctatggcag gaagaagcgg 5521 agacagcgac gaaaacctcc tcaaggcgat caggctcatc aagttcctat accagagcag 5581 taagtattcc atgtaatgca acctttagtg ataatagcaa tagcagcatt agtagtagca 5641 ataataatag caatagttgt gtggaccata gtatacatag aatataggag gataaaaagg 5701 caaagaaaaa tagactgttt aattgataga ataagagaaa gagcagaaga cagtggcaat 5761 gagagcgagg gagaaagaga ggaattgtca aaacttgtgg aaatggggca tcatgctcct 5821 tgggatgttg atgacctgta gtgctgcaga agatttgtgg gtcacagttt attatggggt 5881 gcctatatgg aaggaagcaa ctaccactct attttgtgca tcagatgcta aagcatataa 5941 aaaagaggca cataatatct gggccacaca tgcctgtgta cccacggacc ccaacccaca 6001 agaaatagaa ttggaaaatg tgacagaaaa ctttaacatg tggaaaaata acatggtgga 6061 acaaatgcat gaggatataa tcagtttatg ggatcaaagc ctaaaaccat gtgtaaaatt 6121 aactccactc tgtgtcactt taaactgcac tgatgaattg aggaacagca agggcaatgg 6181 gaaggtagaa gaggaggaaa aaaggaaaaa ctgctctttc aatgtaaggg ataagaggga 6241 gcaagtatat gcactttttt ataaacttga tatagtgcca atagacaata ataataggac 6301 caatagtact aattataggt taataaattg tgatacctca accattacac aggcttgtcc 6361 aaagatatcc tttgaaccaa ttcccataca tttttgtgcc ccagctggtt ttgcaattct 6421 aaagtgtaga gataagaagt tcaatgggac aggcccatgc tcaaatgtca gcacagtaca 6481 atgtacacat ggaattagac cagtggtgtc aactcaactg ctgttgaatg gcagtctagc 6541 agaagaagag atcataatta gatctgaaaa tctcacaaac aatgttaaaa ccataatagt 6601 acagcttaat gcatctatag taattaattg tacaaggccc tacaaatata caaggcaaag 6661 gacatcgata ggactaaggc aatcactcta tacaataaca ggaaaaaaga agaaaacagg 6721 atacatagga caagcacatt gtaaaattag cagagcagaa tggaataaag ctttacaaca 6781 ggtagctaca aaactaggaa accttcttaa caaaacaaca ataactttta agccatcctc 6841 aggaggggac ccagaaatta catcacacat gcttaattgt ggaggggact tcttctactg 6901 taatacatca agactgttta atagtacatg gaatcagact aatagtacag ggttcaataa 6961 tggcacagtc acactcccat gcagaataaa acaaattgta aacttgtggc agagagtagg 7021 aaaagcaatg tatgcccctc ccatcgaagg actaattaaa tgttcatcaa atattacagg 7081 gctactatta acaagagatg gtggtgcaaa taatagttct catgagacca tcagacctgg 7141 aggaggagat atgagggaca attggagaag tgaattatat aaatataaag tagtaaaaat 7201 tgaaccaata ggagtagcac ccaccaaggc aaggagaaga gtggtggaaa gagaaaaaag 7261 agcaatagga ctaggagctg tgttccttgg gttcttggga gcagcaggaa gcacgatggg 7321 cgcagcgtca gtgacgctga cggtacaggc cagacagtta atgtctggta tagtgcacca 7381 gcaaaacaat ttgctgaggg ctatagaggc gcaacaacat ctgttgcaac tcacggtctg 7441 gggcattaaa cagctccagg caagagtcct ggctgtggaa agatacctaa gggatcaaca 7501 gctcctagga atttggggtt gctctggaag gcacatttgc accactaatg taccctggaa 7561 ctctagttgg agtaatagat ctctagatga gatttggcag aacatgacct ggatggagtg 7621 ggaaagagaa attgacaatt acacaggctt aatatacagc ttaattgaag aatcgcagat 7681 ccagcaagaa aagaatgaaa aagaattatt ggaattggac aaatgggcaa gtttgtggaa 7741 ttggtttagc ataacaaaat ggctgtggta tataaaatta ttcataatga tagtaggagg 7801 cctgataggt ttaagaatag tttttgctgt gctttctgta gtaaatagag ttaggcaggg 7861 atactcacct ctgtcatttc agaccctcct cccagtcccg aggggacccg acaggcccga 7921 agaaatagaa gaagaaggtg gagagcgagg cagagacaga tccattcgat tggtgaacgg 7981 attatttgca cttttctggg acgacctgag gaacctgtgc ctcttcagct accaccgctt 8041 gagagactca atcttaattg cagcgaggat tgtggaactt ctgggacgca gggggtggga 8101 agccctcaaa tacctgtgga acctcctgca gtattggagt caggaactga ggaatagtgc 8161 tagtagcttg cttgatacca tagcaatagc agtagctgag aggacagata gggttataga 8221 agtagtacaa agagcttgca gagctattct taacgtaccc agaagaataa gacagggctt 8281 ggaaaggctt ttgctataaa atgggtggca aatggtcaaa aagtagttta gttggatggc 8341 ctgctataag ggaaagaata agaaaaactg atccagcagc agatggggta ggagcagtat 8401 ctcgagacct ggaaaaacat ggggcaatca caagtagcaa tacagcaagt actaatgata 8461 cctgtgcatg gctggaagca caagaggaga gcgaggaggt gggctttcca gttagacctc 8521 aggtaccttt aagaccaatg acttacaaag aagctgttga tcttagccac tttttaaaag 8581 aaaagggggg actggaaggg ctaatttggt ccaagaaaag acaagagatc cttgatcttt 8641 gggtctacaa cacacaaggc atcttccccg attggcagaa ctacacacca gggccaggga 8701 tcagataccc actgaccttt ggatggtgct tccagctagt accagttgat ccacaggagg 8761 tagaagaggc cactgaaaga gaggacaact gcttgttaca ccctatgtgc cagcaaggaa 8821 tggaggaccc ggagagacaa gtgttaatgt ggagatttaa cagcagacta gcacttgagc 8881 acaaggcccg agagctgcat ccggagttct acaaagactg ctgacatcga gtttcctaca 8941 agggactttc cgctggggac tttccaggga ggcgtggcct gggcgggact tgggagtggc 9001 taaccctcag atgctgcata taagcagctg cttttcgcct gtactgggtc tctctggtta 9061 gaccagattt gagcctggga gctctctggc taattaggga acccactgct taagcctcaa 9121 taaagcttgc cttgagtgct tca // LOCUS HIVOYI 9190 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1) proviral DNA, complete genome. ACCESSION M26727 KEYWORDS complete genome. SOURCE Human immunodeficiency virus type 1 (isolate OYI) proviral DNA. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 9190) AUTHORS Wain-Hobson,S. and Pasteur Institute,Paris. TITLE ; JOURNAL AIDS Res. Hum. Retroviruses (1990) In press STANDARD full staff_review COMMENT Kindly provided in computer-readable form by Simon Wain-Hobson, Pasteur Institute, Paris. OYI, designated elsewhere as isolate 397, was taken from a healthy Gabonese presenting an atypical Western Blot. Tree analysis (see Part III) reveals that the sequence is closely related to the North American isolate SF2 and the European virus HAN(across the genome). This is the first report of a virus from Africa that clusters with North American rather than Zairean viruses: OYI and SF2 differ by approximately 7% in envelope. The single C -> S substitution at residue 22 of the OYI tat protein renders it inactive, but may not account for the avirulence of the virus. The pattern of env termination and nef start is not what is typically encountered. Sibling sequences for OYI(397) are available (see 1989:I-A-181). FEATURES from to/span description pept 336 1835 gag polyprotein pept < 1628 4639 pol polyprotein pept 4584 5162 vif protein pept 5102 5392 vpR protein ORF 5599 5721 vpu ORF pept 5373 5587 tat protein, exon 2 (first expressed exon) 7918 8008 tat protein, exon 3 (AA at 7919) pept 5512 5587 rev protein, exon 2 (first expressed exon) 7918 8192 rev protein, exon 3 (AA at 7920) pept 5767 8334 env polyprotein pept 8336 8971 nef protein binding 182 199 primer (Lys-tRNA) binding site pre-msg 1 9190 genomic mRNA signal 9166 9171 poly-A signal BASE COUNT 3304 a 1630 c 2202 g 2054 t ORIGIN 1 ggtctctcta gctagaccag atctgagccc gggagctctc tggctaacta gggaacccac 61 tgcttaagcc tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt 121 gtgactctgg taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca 181 gtggcgcccg aacagggacc ttaaagtgaa agtggaacca gaggagctct ctcgacgcag 241 gactcggctt gcttagcgcg cacggcaaga ggcgaggggc ggcgactggt gagtacgcca 301 aaaatttttg actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa 361 gcgggggaga attagataaa tgggaaaaaa ttcggttaag gccaggggga aagaaaaaat 421 atcaactaaa acatatagta tgggcaagca gggagctaga acgattcgca attaatcctg 481 gcctgttaga aacatcagaa ggttgtagac aaatactggg acagctacaa ccatccctta 541 agacaggatc agaagaaatt agatcattat ataatacagt agcaactctt tattgtgtgc 601 atcaaaagat agaggtaaaa gacaccaagg aagctttaga taagatagag gaagagcaaa 661 acaaaagtaa gaaaaaagca cagcaaacag cagctgacac aggaaacagc agccaggtca 721 gccaaaatta ccctatagta cagaaccttc aggggcaaat ggtacatcag cccatatcac 781 ctagaacttt aaatgcatgg gtaaaagtag tagaagagaa ggctttcagc ccagaagtaa 841 tacccatgtt ttcagcatta gcagaaggag ccaccccaca agatctaaac accatgctaa 901 acacagtggg gggacatcaa gcagccatgc aaatgttaaa agagaccatc aatgaggaag 961 ctgcagaatg ggatagattg catccagtac atgcagggcc tattgcacca ggccagatga 1021 gagaaccaag gggaagtgac atagcaggaa ctactagtac ccttcaggaa caaataggat 1081 ggatgacaaa taatccacct atcccagtag gagaaatcta taaaagatgg ataatcctgg 1141 gattaaataa aatagtaaga atgtatagcc ctaccagcat tctggacata agacaaggac 1201 caaaggaacc ctttagagac tatgtagacc ggttctataa aactctaaga gccgagcaag 1261 cttcacagga tgtaaaaaat tggatgacag aaaccttgtt ggtccaaaat gcaaacccag 1321 attgtaagac tattttaaaa gcattgggac cagcagctac actagaagaa atgatgacag 1381 catgtcaggg agtgggggga cccggccata aagcaagagt tttggctgaa gcaatgagcc 1441 aagtaaattc agtcaccgta atgatgcaga aaggcaattt taagaaccaa agaaagactg 1501 ttaagtgttt caattgtggc aaagaagggc acatagccaa aaattgcagg gctcctagga 1561 aaaagggctg ttggaaatgt ggaagggaag gacaccaaat gaaagattgt actgagagac 1621 aggctaattt tttagggaag atctggcctt cccacaaggg aaggccaggg aattttcttc 1681 agaacagacc agagccaaca gccccaccag cagagagctt cgggtttggg gaagagacaa 1741 caactccccc tcagaagcag gagccgatag acaagggact gtatccttta acctccctca 1801 gatcactctt tggcaacgac ccatcgtcac aataaagata ggggggcaac taaaagaagc 1861 tctattagat acaggagcag atgatacagt attagaagaa atgaatttgc caggaagatg 1921 gaaaccaaaa atgatagggg gaattggagg ttttatcaaa gtaagacagt atgatcagat 1981 actcatagaa atctgtggac ataaagctat aggtacagta ttagtaggac ctacacctgt 2041 caacataatt ggaagaaatc tgttgactca gcttggttgt actttaaatt ttcccattag 2101 tcctattgaa actgtaccag taaaattaaa gccaggaatg gatggcccaa aagttaaaca 2161 atggccattg acagaagaga aaataaaagt attaatagaa atttgtacag aaatggaaaa 2221 ggaagggaaa atttcaaaag ttgggcctga aaatccatac aatactccag tatttgccat 2281 aaagaaaaaa gacagtacta aatggagaaa attagtagat ttcagagaac ttaataagag 2341 aactcaggac ttctgggaag tccaattagg aataccacat ccagcagggt taaaaaagaa 2401 aaaatcagta acagtactgg atgtgggtga tgcatacttt tcagttccct tagataaaga 2461 cttcagaaag tatactgcat ttaccatacc tagtataaac aatgagacac cagggattag 2521 atatcagtac aatgtgcttc cacagggatg gaaaggatca ccagcgatat tccaaagtag 2581 tatgacaaaa atcttagagc cttttagaaa acaaaatcca gacatagtta tctatcaata 2641 catggatgat ttgtatgtag gatctgactt agaaataggg cagcatagaa caaaaataga 2701 ggaactgaga caacatctgt tgaggtgggg atttaccaca ccagacaaaa aacatcagaa 2761 agaacctcca ttcctttgga tgggttatga actccatcct gataaatgga cagtacagcc 2821 tataatgctg ccagaaaaag acagctggac tgtcaatgac atacagaagt tagtgggaaa 2881 attaaattgg gcaagtcaga tttatgcagg gattaaagta aagaacttat gtaaactcct 2941 taggggaacc aaagcactaa cagaagtaat accactaaca gaagaagcag agctagaact 3001 ggcagaaaac agggagattc taaaagaacc agtacatgga gtgtattatg acccatcaaa 3061 agacttagta gcagaattac agaaacaggg acaaggccaa tggacatatc aaatttatca 3121 agagccattt aaaaatctga aaacaggaaa gtatgcaaga atgaggggtg cccacactaa 3181 tgatgtaaaa cagttaacag aggcagtgca aaaaataacc caagaaagca tagtaatatg 3241 gggaaagact cctaaattta aactacccat acaaaaagaa acatgggaag catggtggac 3301 ggagtattgg caagccacct ggattcctga atgggagttt gtcaataccc ctcccttagt 3361 aaaattatgg taccagttag agaaagaccc catagtagga gcagaaactt tctatgtaga 3421 tggggcagct aatagggaga ctaaattagg aaaagcagga tatgttactg acagaggaag 3481 acagaaagtt gtctccctaa ctgacacaac aaatcagaag actgaattac aagcaattca 3541 tctagctctg caggattcgg gattagaagt aaacatagta acagactcac aatatgcatt 3601 aggaatcatt caagcacaac cagataaaag tgaatcagag ttagtcagtc aaataataga 3661 gcagttaata aaaaaggaaa aggtctacct ggcatgggta ccagcacaca aaggaattgg 3721 aggaaatgaa caagtagata aattagtcag tgctggaatc aggaaagtac tatttttgga 3781 tggaatagat aaggcccaag aggaacatga gaaatatcac agtaactgga gagcaatggc 3841 tagtgatttt aacctgccac ctgtagtagc aaaagaaata gtagccagct gtgataaatg 3901 ccagctaaaa ggagaagcca tgcatggaca agtagactgt agtccaggaa tatggcaact 3961 agattgtaca catttagaag gaaaaattat cctggtagca gttcatgtag ccagtggata 4021 tatagaagca gaagttattc cagcagagac agggcaggaa acagcatact ttatcttaaa 4081 attagcagga agatggccag taaaaacaat acatacagac aatggcagca atttcaccag 4141 tactacggtt aaggccgcct gttggtgggc agggatcaag caggaatttg gcattcccta 4201 caatccccaa agtcaaggag tagtagaatc tatgaataat gaattaaaga aaattatagg 4261 acaagtaaga gatcaggctg aacatcttaa gacagcagta caaatggcag tatttatcca 4321 caattttaaa agaaaagggg ggattggggg gtacagtgca ggggaaagaa tagtagatat 4381 aatagctaca gacatacaaa ctaaagaact acaaaaacaa attacaaaaa ttcaaaattt 4441 tcgggtttat tacagggaca gcagagaacc actttggaaa ggaccagcaa agcttctttg 4501 gaaaggtgaa ggggcagtag taatacaaga taatagtgac ataaaagtag tgccaagaag 4561 gaaagcaaag atcattaggg attatggaaa acagatggca ggtgatgatt gtgtggcaag 4621 tagacaggat gaggattaga acatggaaaa gtttagtaaa acaccatatg tatgtttcaa 4681 agaaagctaa gggatggttt tatagacatc actatgaaag cactcatcca agaataagtt 4741 cagaagtaca catcccacta ggggatgcta ccttggtagt aacaacatat tggggtctgc 4801 atacaggaga aagagaatgg catttgggcc agggagcctc tatagaatgg aggaaaaaga 4861 gatatagcac acaagtagac cctggcctag cagaccaact aattcataca tattattttg 4921 attgtttttc agaatctgct ataagaaatg ccatactagg aaatatagtt agtcctaggt 4981 gtgaatatcc agcaggacat aacaaggtag gatctctaca atacttggca ctagcagcat 5041 tgataaaacc aaaaaagata aagccacctt tgcctagtgt tacgaaacta acagaggata 5101 gatggaacaa gccccagaag accaagggcc acagagggag ccatacaatg aatggacact 5161 agagctttta gaggagctta agagtgaagc tgttagacat tttcctagga tatggctcca 5221 tagcttagaa caatatatct atgaaactta tggggatact tgggaaggag tggaagccat 5281 aataagaatt ctgcaacaac tgctgtttat tcatttcaga attgggtgtc aacatagcag 5341 aataggcatt actcgacaga gaagagcaag aaatggagcc agtagatcct agactagagc 5401 cctggaagca tccaggaagt cagcctaaga ctgctagtaa caattgctat tgtaaaaggt 5461 gttgccttca ctgtcaagtt tgtttcacaa aaaaaggctt aggcatttcc tatggcagga 5521 agaagcggag acagcgacga agagctcctc aagacagtaa gactcatcaa gtttctctat 5581 caaagcagta agtaatacat gtaatatact ctttacaaat attagcaata gtagcattag 5641 tagtagtaac aataatagca atagttgtgt ggaccatagt actcttagaa tataggaaaa 5701 tattaagaca aagaaaaata gacaggttaa ttgatagaat aagagaaaga gcagaagaca 5761 gtggcaatga cagcgagggg gaccaggaag aattatcagc gcttgtggag atggggcacc 5821 atgctccttg ggatgttgat gatatgtagt gctgcagaaa atttgtgggt cacagtctat 5881 tatggggtac ctgtgtggaa agaagcaacc accactctat tctgtgcatc agatgctaga 5941 gcatatgcta cagaggtaca taatgtttgg gccacacatg cctgtgtacc cacagacccc 6001 aacccacaag aagtagtatt gggaaatgtg acagaaaatt ttgacatgtg gaaaaataat 6061 atggtagaac aaatgcaaga agatataatc agtttatggg atcaaagcct aaagccatgt 6121 gtaaaattaa ccccactctg tgttacttta gattgcactg atgttaatac cactagtagt 6181 agtttgagga atgctactaa taccacaagt agtagttggg aaacgatgga gaaaggagaa 6241 ttaaaaaact gctctttcaa taccaccaca agcataagag ataagatgca ggaacaatat 6301 gcactttttt ataaacttga tgtattacca atagataaga atgatactaa atttaggtta 6361 atacattgta acacctcaac cattacacag gcctgtccaa agatatcctt tgagccaatt 6421 cccatgcatt attgtactcc ggctggtttt gcgattctaa agtgtaatga taagaagttc 6481 aatggaacag gaccatgtac aaatgtcagc acagtacaat gtacacatgg aattaagcca 6541 gtagtgtcaa ctcaactgct gttaaatggc agcctagcag aagaagaggt aataattaga 6601 tctagcaatt tcacaaacaa tgctaaaatc ataatagtac agctgaataa atctgtagaa 6661 attaattgta caagacccaa caacaataca agaaacagga tatcaatagg accagggaga 6721 gcatttcata caacaaaaca aataatagga gatataagac aagcacattg taaccttagt 6781 agagcaacat gggagaaaac tttagaacag atagctacaa aattaagaaa acaatttagg 6841 aataaaacaa tagcctttga tcgatcctca ggaggggatc cagaaattgt aatgcacagt 6901 tttaattgtg gaggggaatt tttctactgt aatacatcac aactgtttaa tagtacttgg 6961 aatgatacta caagggcaaa tagcactgaa gtaactatca cactcccatg tagaataaaa 7021 caaattgtaa acatgtggca ggaagtagga aaagcaatgt atgcccctcc catcagtgga 7081 caaattagat gttcatcaaa gattacaggg ctgctattaa caagagatgg tggtaagaac 7141 accacgaacg ggatcgaaat cttcagacct gcaggaggag acatgaggga caattggaga 7201 agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc acccaccaag 7261 gcaaggagaa gagtggtgca gagagaaaaa agagcagtgg gaatgctagg agctatgttc 7321 cttgggttct tgggagcagc aggaagcact atgggcgcac ggtcgatgac gctgacggta 7381 caggccagac aattattgtc tggtatagtg cagcagcaga acaatctgct gagggctatt 7441 gaggcgcaac agcacctgtt gcaactcaca gtctggggca tcaagcagct ccaggcaaga 7501 gtcctggctg tggaaagata cctaaaggat caacagctcc tagggatttg gggttgctct 7561 ggaaagctca tttgcaccac tactgtgcct tggaatgcta gttggagtaa taaatctctg 7621 aatgagattt gggataacat gacctggatg cagtgggaaa gagaaattga caattacaca 7681 cacttaatat acaccttaat tgaagaatcg cagaaccaac aggaaaagaa tgaacaagaa 7741 ttattggaat tggataagtg ggcaggtttg tggagttggt ttagcataac aaactggctg 7801 tggtatataa gaatattcat aataatagta ggaggcttgg taggtttaag aatagttttt 7861 gctgtacttt ctatagtgaa tagagttagg cagggatact caccattatc gtttcagacc 7921 cgcctcccaa cccagagggg acccgacagg cccgaaggaa tcgaagaaga aggtggagag 7981 agagacagag acagatccgg aagattagtg gatggattct tagcacttat ctgggacgat 8041 ctgcggagcc tgtgcctctt cagctaccac cgcttgagag acttaatctt gattgtagcg 8101 aggattgtgg aacttctggg acgcaggggg tgggaagtcc tgaaatattg gtggaatctc 8161 ctgcagtatt ggagccagga actaaagaat agtgtcatta gcttgctcaa cgccacagct 8221 atagcagtag ctgaggggac agatagggtt atagaaatag tacaaagagc ttatagagct 8281 tttctcaata tacctagaag aataagacag ggcttggaaa gggctttgct ataagatggg 8341 tggcaagtgg tcaaaatgta gtatgaaggg atggcctact ataagggaaa gaatgaagcg 8401 agctgagcta cagccacctg agccagcagc agaaggggtg ggagcagcat ctcgagacct 8461 ggaaaaacat ggagcaatca ctagtagcaa tacagcagct actaatgctg attgtgcctg 8521 gctagaagca caagaggatg aagaggtggg ttttccagtc agacctcagg tacctttaag 8581 accaatgact tacaagggag ctttagatct tagccacttt ttaaaagaaa aggggggact 8641 ggaagggcta atttactccc agaaaagaca agatattctt gatctgtggg tctatcacac 8701 acaaggctat ttccctgatt ggcagaacta cacaccaggg ccagggatca gatatccact 8761 gtgctttgga tggtgcttca agttagtacc aatggatcca gaccaggtag aagaggccaa 8821 cgaaggagag aacaacagct tgttacatcc tataagcctg catgggatgg atgacccaga 8881 gaaagaagtg ttagtgtgga agtttgacag ccgcctagca tttcgtcaca tggcccgaga 8941 ggtgcatccg gagtactaca aagactgctg acatcgagct ttctacaggg gactttccgc 9001 tggggacttt ccagggaggc gtggcctggg cgggaccggg gagtggcgag ccctcagatg 9061 ctgcatataa gcagctgctt tttgcctgta ctgggtctct ctagctagac cagatctgag 9121 cccgggagct ctctggctaa ctagggaacc cactgcttaa gcctcaataa agcttgcctt 9181 gagtgcttca // LOCUS HIVPHIL1 259 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region. ACCESSION M27911 KEYWORDS . SOURCE Human immunodeficiency virus type 1 (isolate 799), clone B7. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 259) AUTHORS Ou,C.-Y. and colleagues CDC Atlanta GA, JOURNAL Unpublished (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by C.-Y. Ou, CDC, Atlanta. The sequence below, designated 799-B7, appears to be a major form of the sibling sequences 799, taken from a hemophilia patient in Philadelphia. FEATURES from to/span description pept < 1 > 259 env region (AA at 2) BASE COUNT 99 a 45 c 56 g 59 t ORIGIN Approximately 1172 nt from env start (BRU coordinates). 1 ttcaacacaa ctgtttaata gtacttggaa tgttactgaa gggttaaata acactgcaaa 61 taacagtgaa ggaactatca cactcccatg cagaataaaa caaattataa acatgtggca 121 ggaagtagga aaagcaatgt atgcccctcc catcagagga caaattagat gttcatcaaa 181 tattacaggg ctgctattaa caagagatgg tggtaatgaa actaacggga ccgagatctt 241 cagacctgga ggaggagat // LOCUS HIVSWB881 59 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), gag region. ACCESSION M27952 KEYWORDS . SEGMENT 1 of 3 SOURCE Human immunodeficiency virus type 1 (isolate SWB88), clone 01. ORGANISM Human immunodeficiency virus type 1 Viridae; ss-RNA enveloped viruses; Positive strand RNA virus; Retroviridae; Lentivirinae. REFERENCE 1 (bases 1 to 59) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV-1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL j acquired immuno defic syndr 2, 344-52 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. B88 is a so-called "slow virus" isolated in Sweden. Sibling sequences available (see 1989: I-A-174). FEATURES from to/span description pept < 1 > 59 gag region (AA at 2) BASE COUNT 26 a 8 c 10 g 15 t ORIGIN 1 ctataaaaga tggataatcc tgggattaaa taaaatagta agaatgtata gccctacca // LOCUS HIVSWB882 314 bp ss-DNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), tat first expressed exon of isolate L1.01. ACCESSION M31462 KEYWORDS . SEGMENT 2 of 3 SOURCE Human immunodeficiency virus type 1 (isolate B88). REFERENCE 1 (bases 1 to 314) AUTHORS Meyerhans,A., Cheynier,R., Albert,J., Seth,M., Kwok,S., Sninsky,J., Morfeldt-Manson,L., Asjo,B. and Wain-Hobson,S. TITLE Temporal fluctuations in HIV quasispecies in vivo are not reflected by sequential HIV isolations JOURNAL Cell 58, 901-910 (1989) STANDARD full staff_review COMMENT Kindly submitted in computer-readable form by S. Wain-Hobson, Pasteur Institute, Paris. The B88 isolate is considered to be a "slow-low" viral form. Sibling sequences from pbl's and viruses propagated on pbl's are available. This sequence corresponds to the earliest sample major form, L1.01. FEATURES from to/span description pept 30 / 244 tat exon 2 (first expressed exon) BASE COUNT 111 a 57 c 70 g 76 t ORIGIN 1 cattattcaa cagaggagag caagaagaaa tggagccagt agatcctaga ctagagccct 61 ggaagcatcc aggaagtcag cctaagactg cttgtaacaa ttgctattgt aaaaagtgtt 121 gctttcattg ccaagtttgt ttcacaaaaa aaggcttagg catctcctat ggcaggaaga 181 agcggagaca gcgacgaaga gctcctcagg acagtgagac tcatcaagtt tctctatcaa 241 agcagtaagt agtacatgta atgcaatctt tagaaatatt agcaatagta gcattagtag 301 tagcaataat acta // LOCUS HIVSWB883 250 bp ss-RNA VRL 11-AUG-1990 DEFINITION Human immunodeficiency virus type 1 (HIV-1), envelope region. ACCESSION M27947 KEYWORDS . SEGMENT 3 of 3 SOURCE Human immunodeficiency virus type 1 (isolate SWB88), clone 01. REFERENCE 1 (bases 1 to 250) AUTHORS Goodenow,M., Huet,T., Saurin,W., Kwok,S., Sninsky,J. and Wain-Hobson,S. TITLE HIV1 isolates are rapidly evolving quasispecies: Evidence for viral mixtures and preferred nucleotide substitutions JOURNAL J. Acq. Immun. Defic. Syndrome. 2, 344-8 (1989) STANDARD full staff_review COMMENT [1] kindly submitted in computer readable form by S. Wain-Hobson. This stretch comes from the first hypervariable region of env. Sibling sequences are available (see 1989: I-A-175). FEATURES from to/span description ORF < 3 245 env ORF (AA at 3) BASE COUNT 96 a 40 c 42 g 72 t ORIGIN 1 aaattaaccc cactctgtgt tactttaaat tgcactaagt tgaagaatgt tacttatgcc 61 aataatagta gtagggaaaa agaagaaatg aaaaactgct ctttcaccac aagaataggg 121 aataaggtgc agaaagaata tgcacttttt tataaacttg atgtagtacc aatagataag 181 gataatacaa gctttatatt gatacattgt aatacctcga ccattacaca ggcttgtcca 241 aaggtatcct // LOCUS ALFLBA 578 bp ss-mRNA PLN 11-AUG-1990 DEFINITION Alfalfa leghemoglobin mRNA, complete cds. ACCESSION M36100 KEYWORDS leghemoglobin. SOURCE Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL549. ORGANISM Medicago sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 578) AUTHORS Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and Huguet,T. TITLE Identification of two groups of leghemoglobin genes in alfalfa (Medicago sativa) and a study of their expression during root nodule development JOURNAL Plant Mol. Biol. 11, 761-772 (1988) STANDARD simple staff_review FEATURES from to/span description pept 7 450 leghemoglobin BASE COUNT 189 a 91 c 126 g 172 t ORIGIN 1 agaaatatga gtttcaccga taagcaggag gctttagtca atagctcatg ggaggcattc 61 aaacaaaacc ttcctcgtta tagtgttttt ttctacactg tcatattaga gaaagcacct 121 gcagcaaaag gtttgttctc ttttcttaag aactcagctg aagtacaaga tagtcctcaa 181 ctccaagctc acgctgaaaa ggtttttgga ctggtgcgtg actcagctgt tcaacttcga 241 gcaacagggg gggtagtttt gggagatgct acgttgggtg ctatccacgt tcgaaaggga 301 gttgttgatc ctcattttgt ggtggttaaa gaggctttgc tcaaaacaat aaaagaagca 361 gctggagaca aatggagtga agagcttaac actgcttggg aagtagctta tgatgcactg 421 gcaactgcaa ttaaaaaagc aatgagttaa acctgtgatg atctattatc ataaaaagat 481 taataaataa aatatgtatt actaaaactt gttaaacaag ttcctataat aaattatgat 541 aaaaatattt atgaaatatt gttagtgttt atgccggt // LOCUS ALFLBB 521 bp ss-mRNA PLN 11-AUG-1990 DEFINITION Alfalfa leghemoglobin mRNA, complete cds. ACCESSION M36101 KEYWORDS leghemoglobin. SOURCE Alfalfa 15 day old nodule, cDNA to mRNA, clone pNL154. ORGANISM Medicago sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 521) AUTHORS Barker,D.G., Gallusci,P., Lullien,V., Khan,H., Gherardi,M. and Huguet,T. TITLE Identification of two groups of leghemoglobin genes in alfalfa (Medicago sativa) and a study of their expression during root nodule development JOURNAL Plant Mol. Biol. 11, 761-772 (1988) STANDARD simple staff_review FEATURES from to/span description pept > 1 396 leghemoglobin BASE COUNT 175 a 77 c 112 g 157 t ORIGIN 1 aaatcattta aacaaaaccc tggtaatagt gttttgttct acactataat attggagaaa 61 gcccctgcag caaagggcat gttctctttt cttaaggact cagctggagt acaggatagc 121 cctaaactcc aaagccatgc tgaaaaagtt tttggaatgg tgcgcgattc agctgctcaa 181 ctccgagcaa caggaggagt agttttggga gatgctacat tgggtgcaat ccacattcag 241 aaaggagttg ttgatcctca ttttgcggtg gttaaagaag ctctgctgaa aacaataaag 301 gaagtatcag gagataaatg gagcgaagaa ttgaacactg cttgggaagt agcctatgat 361 gcattggcaa ctgcaattaa gaaggcaatg gtttaaattt gggatgatct atagttatta 421 ctcaatttaa taaataaaaa atgtattact aaaacttgtt taacaagttt ctataataaa 481 tgttgttgaa atgactatta aatattattt gagtttatgt t // LOCUS CLTPHYC 1077 bp ds-DNA BCT 11-AUG-1990 DEFINITION Calothrix sp. phycocyanin alpha (PC2-alpha) and phycocyanin beta (PC2-beta) genes, complete cds. ACCESSION M36276 KEYWORDS phycocyanin. SOURCE Calothrix sp. (strain PCC 7601 (Fremyella diplosiphon UTEX 481)) DNA. ORGANISM Calothrix sp. Prokaryota; Bacteria; Gracilicutes; Oxyphotobacteria; Cyanobacteria; Nostocales; Rivulariaceae. REFERENCE 1 (bases 1 to 1077) AUTHORS Capuano,V., Mazel,D., Tandeau de Marsac,N. and Houmard,J. TITLE Complete nucleotide sequence of the red-light specific set of phycocyanin genes from the cyanobacterium Calothrix PCC 7601 JOURNAL Nucleic Acids Res. 16, 1626-1626 (1988) STANDARD full staff_review FEATURES from to/span description pept 1 519 phycocyanin beta (PC2-beta) pept 589 1077 phycocyanin alpha (PC2-alpha) BASE COUNT 276 a 259 c 242 g 300 t ORIGIN 1 atgcttgacg cttttactaa agtagtttcc caggctgata ctagaggcgc atacattagc 61 gatgcagaaa tcgatgcatt gaagacaatg gttgctgctg gttctaagcg catggatgtt 121 gttaaccgca ttaccggcaa tgcctccaca atcgttgcta atgcagctcg cgctttattt 181 gaagaacaac cccagttgat tgcacctggt ggtaatgctt acaccaaccg tcgtatggct 241 gcttgcttac gcgacatgga aattatctta cgctatgtaa cctacgcagt atttgcaggt 301 gatgctagcg ttttagacga tcgctgcttg aacggtcttc gtgaaacata tcaagcatta 361 ggcgttcctg gcgcttccgt atctactggc gttcaaaaga tgaaagaagc tgcgatcgct 421 attgctaacg accccagcgg tgttacccgt ggtgactgta gctccttaat gtctgaattg 481 ggtagctact tcgatcgcgc tgctgctgca gttggctagt aaatcgctag ttaattattt 541 ctactttctg acgcataact ttagatattt gttgagagaa taatcagcat gaaaacacct 601 ttaaccgaag ccgtagctac cgcagattcc caaggtcggt ttttgagctc cacagaactt 661 caagttgctt ttggtcgttt ccgtcaagca tctgctagct tggatgcagc taaagcatta 721 agcagcaaag ctaacagctt ggctcaaggt gccgttaacg cagtttacca aaagttcccc 781 tacaccaccc aaatgcaagg taagaacttc gcttctgacc aacgcggtaa agacaagtgt 841 gcacgcgaca tcggttacta cattcgcatc gttacctact gtctagttgc tggtggtaca 901 ggccctctag atgactacct aattggtggt ttggcagaaa tcaaccgcac ctttgatctg 961 tctcccagct ggtacgttga agctctcaaa tacatcaaag ctaatcatgg tttgagtggc 1021 gatcctgctg ttgaagcaaa ctcctacatt gactacgcaa tcaacgccct aagctag // LOCUS PINACT 745 bp ds-DNA PLN 11-AUG-1990 DEFINITION P.contorta (lodgepole pine) actin gene, last two exons. ACCESSION M36171 KEYWORDS actin. SOURCE P.contorta (var. latifolia) seed DNA. ORGANISM Pinus contorta Eukaryota; Plantae; Embryobionta; Pinophyta; Pinicae; Pinatae; Pinaceae. REFERENCE 1 (bases 1 to 745) AUTHORS Kenny,J.R., Dancik,B.P., Florence,L.Z. and Nargang,F.E. TITLE Nucleotide sequence of the carboxyl-terminal portion of a lodgepole pine actin gene JOURNAL Can. J. For. Res. 18, 1595-1602 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 420 actin, exon X (AA at 1) 555 620 actin, exon X + 1 IVS 421 554 actin intron A BASE COUNT 196 a 124 c 172 g 253 t ORIGIN 1 aagcttgctt atgtagccct tgactatgag caagaacttg aaacatcgaa gagtagttct 61 tcacaagaaa agaattatga gcttcctgat ggacaggtca ttaccgttgg tgctgagcgt 121 ttcagatgtg ctgaggtttt gttccagcca tctctcattg gaatggaagc agcgggtatc 181 catgagacta catacaattc cattatgaaa tgtgatgtgg atatcagaaa agatctctat 241 ggaaacattg ttcttagtgg tggttctacc atgtttcctg gtatagctga tcgtatgagc 301 aaagagatca ctgcacttgc tcccagtagc atgaaaatca aagttgtagc accccctgag 361 aggaagtaca gtgtttggat cggaggttct atcttggcat ctcttagcac atttcagcag 421 gttattctca actttctaca tgccggaact tgaggttctt gtttggtttg gtggtattta 481 atgtgctttg caatatataa atattatttc tttgattgta tatttaacca tgccttgatt 541 gtttttgtat gcagatgtgg attgccaaga ccgagtatga ggagtcaggt ccctcaattg 601 tccacagaaa gtgtttttaa acagagtgct ctattgatta tgctttatga ttggccttta 661 ggtatattgt gagggtggaa gacacgatta ccgagatttt gatttagtat catgctttag 721 gatttgcatc tgtatagttg aattc // LOCUS RATMHCI 1355 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Rat MHC class I RT1.C/E mRNA, 3' end. ACCESSION X16979 KEYWORDS cell surface antigen; cell surface glycoprotein; class I gene; integral membrane protein; major histocompatibility complex. SOURCE Rat liver, cDNA to mRNA, clone RT.BM1. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1355) AUTHORS Parker,K.E., Carter,C.A. and Fabre,J.W. TITLE A rat class I cDNA clone with an Alu-like sequence and mapping to two genes in RT1.C/E JOURNAL Immunogenetics 31, 211-214 (1990) STANDARD simple staff_review FEATURES from to/span description pept < 1 857 MHC RT1.C/E (AA at 3) mRNA < 1 1355 MHC RT1.C/E mRNA rpt 1210 1349 type 2 Alu-like repeat BASE COUNT 322 a 334 c 387 g 312 t ORIGIN 1 attgggagcg ggagactcgg aaagccaggg acaccgggag gaacttcaaa gtgaatctga 61 ggaccctgct ccgctactac aatcagagtg acgacgaatc tcacacgctg cagtggatgt 121 acggctgcga cgtggggccc gatgggcacc tgctccgcgg ttattgtcag gaggcttatg 181 acggccggga ttatatctcc ttgaacgagg acctgcgctc ctggaccgcg actgacatgg 241 cctcacaggc ctctaagatc aagtcagagg aagttggtga ggcccatcac cagagggcct 301 acctgcaagg tccttgcgtg gagtggctcc acacatacct acatctggga aaggagacgc 361 tgctgcgctc agacccccca aaggcacatg tgacccttca ccccagacct gaaggtgatg 421 tcaccctgag gtgctgggcc ctgggcttct accctgctga catcaccttg acctggcaat 481 tgaatgggga ggacctgacc caggacatgg agcttgtgga gaccaggcct gcaggggatg 541 gaaccttcca gaagtgggca gctgtggtag tgccttctgg ggaggagctg aagtacacat 601 gccatgtgga gcatgaggga ctgcctgaac ctctcgccct gagatgggag ccttctccat 661 tcaccgactc cagcatgcca gtgattgttg tccttggagc tgtggccatc attggagctg 721 tggccatcat tggagctgtg gccatcattg gagctgttgt gaggaggagg aagagaaaca 781 caggtgaaaa aggcagctat gctcatgttc taggcagcaa ggctttccag atctctgact 841 ggcctcagaa ggcatgaaaa gacagttgcc tcgtggggct gagcgacaag ttctgactaa 901 acatctgagg gagctgcttc ccctgaggac aatgtgaaga actggcaatc ccagaccttc 961 ttgcctccct ggacacctga cttcacaaca gaaaataatc tgactctggg ctgggtgttt 1021 cctattgctt ttatctgcac tgagtggaaa tgcggtttcc aacagtgtaa cgcagtaccg 1081 ttgtgtggtg gtagtgacat tgcacacagg tttgtgagct gggggtgtga tgtgcttttg 1141 ttactgtagg cagaacaagc tagaagtcca ccattcctct tatggtcatt gttttaaaac 1201 aaaacaaatg ctggagagat ggctcagtgt taagagcact gacttctctt acagaggtcc 1261 tggttcaaat cccagcaacc acatggtggc tcacaaccat atgtaatgag atctgatgct 1321 ctcttctgat gtgtctgaac acagctatat atgat // LOCUS SCOMTCOIII 1120 bp ds-DNA ORG 11-AUG-1990 DEFINITION S.commune mitochondrial cytochrome oxidase III (COIII) gene, complete cds. ACCESSION M36270 KEYWORDS cytochrome oxidase III. SOURCE S.commune mitochondrial DNA. ORGANISM Mitochondrion Schizophyllum commune Eukaryota; Plantae; Thallobionta; Basidiomycotina; Hymenomycetes; Agaricales; Schizophyllaceae; Schizophyllum commune. REFERENCE 1 (bases 1 to 1120) AUTHORS Phelps,L.G., Burke,J.M., Ullrich,R.C. and Novotny,C.P. TITLE Nucleotide base sequence of the mitochondrial COIII gene of Schizophyllum commune JOURNAL Curr. Genet. 14, 401-403 (1988) STANDARD simple staff_review FEATURES from to/span description pept 147 953 cytochrome oxidase III (COIII) BASE COUNT 387 a 155 c 158 g 420 t ORIGIN 1 ggatcttaca aagaatatta tctttcaact ttcttttaaa taataaaatt acttgttaat 61 taattcaaat tacaagtgta aagccatagt ataaaacaaa atttataaat aaataaataa 121 aaaataacaa ataataaata acaaaaatga caagtattaa attctatcaa tcatttagtg 181 cacacttagt acaacattca ccttggccaa ttttagttag tttcagtctt tttaatttag 241 ctattggtac agtattaact atgcatggat acagtcacag ttcaacaact tttgatttag 301 gtttagcagt aactgttgga tcaattttat tatggactag agatattgta attgaaggat 361 ctttcttagg agatcacaca aaacaagtac aagaaggatt aatcatagga tttatcttat 421 ttattataag tgaagtattc gcttttattt cagtattctg ggcttatttc catagtgctt 481 tatcaccagc tgtagaatta ggatcaacat ggccaccagt aggaataatt ccattagata 541 ccttttcatt accattattt aatacaataa ttttattatc tagtggagct tttgttacct 601 atggacacca tgctattttc tcaggaaaaa gattagattc aatcatagga ttatttttaa 661 cagtagcctt agcattaatc tttagttact ttcaagcatt tgaatatatt catgctggat 721 tttccatgtc cgatagtgta tttggaacag tattttttgc aagtacagga ctacatggga 781 tacatgttat gcttgggact ctttttcttt tcgtatcttt tcttagacaa gttaactatc 841 aaactacaaa agaacacaat attggacttg aaacaagtat tctatattgg cactttgttg 901 atcttgtgtg gttattctta tttcttgttg tttacttttg gggtggtgca taattatttc 961 ttttctttta ttgaaagtaa taactattca ttagaactta taaactatgt gcaatttcca 1021 attttattcg gatttatcaa tacaggggga aaaacttctc ccttagaaaa tacaagaaat 1081 tgaagacaaa gaattaaatg taataaataa aataattaca // LOCUS STYHSQ 135 bp ds-DNA SYN 11-AUG-1990 DEFINITION S.potsdam and S.typhimurium hsdS recombinant DNA. ACCESSION M15940 KEYWORDS DNA binding protein; type I restriction and modification system. SOURCE S.potsdam and S.typhimurium DNA, clone lambda-hsd SQ. ORGANISM Artificial gene Artificial sequences; Genes. REFERENCE 1 (bases 1 to 135) AUTHORS Fuller-Pace,F.V. and Murray,N.E. TITLE Two DNA recognition domains of the specificity polypeptides of a family of type I restriction enzymes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9368-9372 (1986) STANDARD full staff_review COMMENT The hsdS genes of the SB (S.typhimurium) and SP (S.potsdam) have a conserved sequence of around 100 bp flanked by two nonhomologous variable regions of around 500 bp. Recombination between the hsdS genes of SB and SP generates a system (SQ) with a different recognition specificity. The crossover point lies in the central conserved region. Concomitant with the generation of a new combination of flanking variable regions is the recombination of minor differences in the central conserved region. A polypeptide domain encoded on the 5' side of the crossover dictates recognition of the trinucleotide component "aac" (positions 25-27) of the target sequence, and a second domain, encoded on the 3' side of the crossover (positions 50-54), governs recognition of the tetra- or penta-nucleotide component. Parts of the variable regions are implicated in the determination of the specificity of interaction between protein and DNA. FEATURES from to/span description pept < 1 > 135 hsdS specificity protein (AA at 1) recomb 32 101 SP DNA end/SB DNA start (uncertain within span) BASE COUNT 46 a 39 c 24 g 26 t ORIGIN Unreported. 1 ataccaatcc cgtcacttgc tgaacaaaaa atcatcgccg aaaaactcga tacgctgctg 61 gcgcaggtag acagcaccaa agcacgtctt gagcaaatcc cacaaatcct gaaacgtttt 121 cgccaatcag tgata // LOCUS RATCYP2A3A 14315 bp ds-DNA ROD 11-AUG-1990 DEFINITION Rat cytochrome P450 II A3 (CYP2A3) gene, complete cds. ACCESSION M33190 KEYWORDS coumarin 7-hydroxylase; cytochrome P450 II A3. SOURCE Rat (Sprague-Dawley) DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 14315) AUTHORS Ueno,T. and Gonzalez,F. TITLE Complete sequence of the rat CYP2A3 gene specifically transcribed in lung JOURNAL Nucleic Acids Res. 18, 4623-4623 (1990) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Gonzalez 23-MAR-1990, for release after publication. FEATURES from to/span description pept 3446 3625 cytochrome P450 II A3, exon 1 3892 4054 cytochrome P450 II A3, exon 2 4736 4885 cytochrome P450 II A3, exon 3 5184 5344 cytochrome P450 II A3, exon 4 6879 7055 cytochrome P450 II A3, exon 5 8457 8598 cytochrome P450 II A3, exon 6 9049 9236 cytochrome P450 II A3, exon 7 9806 9947 cytochrome P450 II A3, exon 8 11123 11304 cytochrome P450 II A3, exon 9 pre-msg 3422 11497 cytochrome P450 II A3 mRNA and introns IVS 3626 3891 intron A IVS 4055 4735 intron B IVS 4886 5183 intron C IVS 5345 6878 intron D IVS 7056 8456 intron E IVS 8599 9048 intron F IVS 9237 9805 intron G IVS 9948 11122 intron H signal 11466 11471 poly-A signal BASE COUNT 3819 a 3801 c 2926 g 3769 t ORIGIN Chromosome 7. 1 gaattccaac agagacccca cttgtgcacc cgaaatgata cacccagaag aaggtacaag 61 cagagctaag cgaggcagag gctcccatca taacctacta atcaaaacga tagcagcgtg 121 aacatgttca caagcatgga ggacttgggg acaacagtgg ctcgctaagg aggagaaagc 181 aaacatgagt cgatagtcta cagtgagaga tgatagacaa caaggcaaag tcctgtgcag 241 gcttcagctc catccagagc aatctgaaca tctaacgctg atcagaaatt gcccaacagg 301 gcaccaagcc tccaggataa acttcctact cacagatcct tgagtatggt ggaagagggc 361 actgatgctt ggctttgatg aagaagcact ggggtgggag gcttctcaca caccaccagc 421 ttctttaggg ttgtcctgga ggcatttcaa aatgagacag gccttgcttg cccaccgtaa 481 ctgaatgtta ccaaagacac caggagcttt ggctacagga catagaaata tcaatctcaa 541 actgaccagg aaattgtctc cttagtggtt ctgggagtat cattggtctg accaagtgtg 601 gaccctctcg ctacaataaa aacaagaggc tcccaaaggc agattgaagt tcagaggaac 661 atcaatccat atcctttttt cttgtcaagt aataagatgt gataataaca gatgaataaa 721 atagctattt attgacttta gcattaactc aaccattact accaagaaga agattcacct 781 ataaagttca gtttgttatt gaagtcagct cttcaaggac acacaacaga caagatctca 841 gggacactga ttggattgcc caggagttgg tatctgccca ggttagggtt ccattctatg 901 ctgtgccatt gctctcaatc acggaaccag aaagggagac tggaaccatc agagggagac 961 ttcccaaggt tgctgcttat tcactctgga aacctgtctg agatgcttgc tgctgttccc 1021 acatgtctct ggcatgtgac cgctgtcacc gatccttcat tcacgcccct gtgtctctcc 1081 tgcataactc ctgttcctgc tctgtcattc atgtgcccgg aaaaaaaaat caataaggac 1141 aaatacactt gatctctcct gaactctcca agctctgttc ccattctggc ccccgtgccg 1201 aacatgagaa catgcctgac cactcagggt ccttctgggt gcctcctcca ctgagaaaca 1261 gattctctaa attacccctt cctcctattc aagtggaaac aaggggagag gagaggtccg 1321 tgctgggtat agtgcaggtg caggagagat tgatgcctgc caagcctagg ctaatcccat 1381 tccccccccc gccccttcct cctccaaaat caacaaacaa cctttctctc tgctttcttc 1441 tcggcccttc gtttcctccc caaccggtct cagagtccaa cagcctaaaa tgagatgagc 1501 taattgccac tcatgagctc caacctcacc caatggtcta atgctgggat accatcatgg 1561 gagggaccca tccatcaaag cctgccagca tccacccaag gtccctccca gggtctcaag 1621 gaagccaggg tcagcaagca cctagtgttt atatcagagc ctgctggaac caaatccata 1681 cccagcatgg tgccacactt atgccttcac tcaaacaatt ccagtgacca cctgccgccc 1741 ctctcaaggt ctctctcagg gtaagaatat caccaatatc cttttggtgt caagctccgt 1801 gctaactcct ttttcacctt agtgcttgcc tgggggctat ctcaatgcca accccatcct 1861 cctccctcct gggagcccat cctgcagcta agcttaattc caggtgtctc cagggtgagc 1921 tgagctggat aatgaggagg aaagagaatc aaatccctgc tcctttatga cccggtcatc 1981 tcaatgaggt tggtgatgca ccaactgcac aatgactaag attccacaga tgggatacac 2041 atcatcttcc tgggctacgg gttcatcaaa actgtccaga ttattaaaga agacttacaa 2101 tgtgagagaa cagtagggtc attttccctt ctccatagcc accctcagca tcccagggtc 2161 ctcttctgtt ccccatgctg gacgctcatc taaatctctc cacctcagcc cactgataac 2221 atccaggcat cagcacacat acccaccccc caccaaatat gacctttatt tccccaatgc 2281 tctcttctca ctggtgggaa agatgtctag acagggaaag ctaaaagtat gagcaagagt 2341 agatgttaat gcctctgact ttcgggcttt ccatgtatgt gccccccatg ccacccctat 2401 gttttgttca cgtgcaccca tagaaaacat gtgtaaatta cagcatcatt gagccagtga 2461 ctctgatggc ctcctctaca aatctcgata ggaatccagt ggggctttcg taaattagaa 2521 ctttgctacc atattggagt tgtgattatc atttctcaag acctagacag acccccagat 2581 attggctcta gtactgtgcc aaattccagt gaaatatgac tctggtcttc cctacctttc 2641 tcagatctta tgttccagca atccatccta gactcagaga cttctctggg ttctgagcat 2701 taagagacag catgaagcca taaccaaagt gcatccacgc cggggcacag gtgttgactc 2761 caagatcctc ctgcagtttg tgtccttcca ggcactcccc atccacagcc tttcttgact 2821 caagatccat aacccatagc cctggagtcg ccaattctcg gttccttaca ttcaggaccc 2881 acttcattga aagtctccca gggttggctg tggaagtgta ggtctgtaat cccagcactc 2941 agagcagaca ggtggatctc tgtagtctat gtacagactc cggactaccc agaactacat 3001 agtgaaaact tgtctcaaac aagaaagaaa agtgatcttt caacttcata gccattcatc 3061 ctatttcgaa caaatgaatt atagggttct tctctcctaa gaccctaata tggctatcac 3121 ttgtgccctt tctcaaagga agggtctcta gtatacatgt catgaaggat ccagttctgg 3181 cttctctcta ctgtcctctg taatgcatag ttttggagca cttcctcctt agacaccaca 3241 aactcagtct actccctgtt aatctgtcat ggagccctcc tcctccttga gtgttggcta 3301 tgtcccaaac taggtgcatg acgggggtag tctggaggta aaattagaca gttgcataat 3361 caagaccaaa ggccagcctt ctatctcaga atgtataaaa acaagctaat tcagccatca 3421 ctgtgtctgt catcccatcg ccaccatgct ggcctcagga ctccttctgg tggcctcagt 3481 ggccttcctc agtgtcctgg tcttgatgtc tgtctggaag cagaggaagc tctcagggaa 3541 gctgcctcct ggacccaccc cattgccctt catcgggaac tacctccagc tgaacacaga 3601 gaaaatgtac agctctctca tgaaggtgcc actgggcatg gagatgggtc gaatgggggc 3661 gtggagggat gcccaatttt gattgaggtt ttgtggtagg tattgaccag atgggagtgt 3721 aattttaggc caggggaatt gtgaagaaat cagcactttc taggccagtg gtatcacatc 3781 ttgggacacc cagtctgata actattagaa ctctagagag atgaatccca gaccggatat 3841 caaagaatga accatcccct cttaccttct ctgatcattc tcctccatca gatcagccaa 3901 cgttacggtc ctgtattcac catccacctg ggacctcgcc gagttgtggt gctgtgcgga 3961 caggaggcag tcaaggaggc tctggtggac caagctgagg aattcagtgg tcggggagag 4021 caggccacct tcgactggct tttcaaaggc tatggtgagg ggaagaccca ggggaatgtg 4081 gcagaggaca tttgagggca tcagtttccc cagatttctc cctgactgtc tatcccctcc 4141 aagcaatctt ctccccacat ccctgtctca gactggtctc tctgccagtg tttcctgact 4201 gccccacctc cacccatgcc tccttttcca cctagcaatt gctgtcggaa tcctgccgcc 4261 tctaagggga tccgaagctt ccctgcaacc cacaccatca ttttcagctt ctgcgattgc 4321 tttttctctt cctctcctct gctttccagt gtcagacatc acagaacgtt tgttcctagg 4381 ttctatctac cagcaacgat ttcttctctt tccagactct ctttgtgtct ctctctttat 4441 ttccctctct ttttttgctc aattacactt ctttcacaat tgtatccacc ctcagctctc 4501 tgcagtctcc tgtctctcac ctctgtctct gcacctctgt ctccctcttc ttactgtcca 4561 gtaacttgct cttttacctg tactttcgtg gtgccttctt gtgttttgca tttgcatttc 4621 caaggatctg tgtgcatctt tttccttctt gaccgcctga aggggaacag cgcaagggtg 4681 gtcaatggcc agccacgccc cctgacttct cctggcttct cattcacctt cccaggcgta 4741 gccttcagca gcggggagcg agccaaacag ctaaggcgct tctccatcgc cacgctgcgg 4801 gacttcggcg tgggcaagcg tggcatcgag gagcgtatcc aagaggaggc gggctttctc 4861 atcgagtcat ttcgaaagac gaacggtaag cggagcaccg ttctccggac ctaccaaggg 4921 aaagcagcac catgtcccag gatagaaact aggctaggga agtgtatgct ggtcccctac 4981 tgtgggatct ggtctcagcg tttttggttc agcgctgcag ccagtgccct tcttgagttc 5041 tcagattcaa agtatctgat gctaacaccc aggcgatgct ttttgcctag ttccaatgtc 5101 tccctttctt tcctcctcct cctccccttc ctccccctcc tccccctcct cctcctccct 5161 ctcctcctct tccccatcca caggtgccct cattgacccc accttctatc tgagccggac 5221 agtctccaat gtcattagct caatagtctt cggggaccgc ttcgactatg aggacaaaga 5281 gttcctgtca ctgcttcgaa tgatgctggg aagcttccag ttcacagcta cctccacggg 5341 gcaggtaacc tgtttcaacc ttgtcctgtc atacctcctg accttgactg ccagctcact 5401 cccaaactgc ttccagcagg caccttcaca ctagcccttg cagttttgca agaacaaata 5461 aatgtcaata tgtagacatc ctcaacagct gcccctgaaa tcaaaagctg gcagtctatc 5521 tccctttccc cctccctttt ccccttcccc tttccccctc cccttccccc tcccctctcc 5581 ctcccctccc cttccctctc atgtacacac acacacacac acacacacac acaccacaca 5641 cacacacaca gccactcagc tccttcagct gggcacatgt taccaccaca atttgtttta 5701 atttctaaca caagattgcc ataggaaggg tccgctgaaa acaccacccc tcttcagttc 5761 ctgaacatct ggcactgcag aaaacacaga gctagcccac tggagtatgt agatgagtgc 5821 cccttactca gccacggaac taagcaaacg tctgaggctc caagcgaaat ttctgcacag 5881 atgtccccca aacatccaag taagggtctt tcaacacagc tcattcacac tcctaagcac 5941 gaccatctgt gaagctgttt gataacaaag caggtgaaga tatggatatc agcactactt 6001 aaaatagaaa taataagaca attttggaca agcatacatt gtgtattgag ccttttcagc 6061 cccatacccc ccaccttccc tgttctcaca cttcctcctc ttcctcctcc tcctcttcct 6121 cttcctcctc ttcctcctcc tccttctgtt agtcctgttt gtccctgaga cagttttatt 6181 tctacttcta agtgatacac atgtacatat atatgcatat gtatgtacaa cgtgatttga 6241 attaatacca cagattgaaa gaacacatac acgggttctc taatactggt taaatttatt 6301 tgatatgatc tcccattgtg tctgctttcc tgaaaatttc cataccattg tttttaatgg 6361 ctgaaaaatt ttacacacac acacacacac acatacacat atacatatat acatatacat 6421 atatatctca cattttctct ttcattacct tgtctttgga cagctagttt gagtgcataa 6481 ttcagttact gtgactaccc aggactggtt ggtttaatac tggaaaacct gcgaaccttc 6541 accatccaaa ccctgctagt atttgaacag ctgcctctag ctaatgtcac ccaaatacct 6601 aagcatctac agatgaaact tcagcccaac aactgaatgt cattcaagcc agccatgcgc 6661 aagaactcca aacctatata tgcgaccaaa tccacctcac ttaaaatctt ggacaatagc 6721 cctctcactt aaactataag aaaactatat tgcctccttg ccctaccagg ttacctgaac 6781 ctcaggacag gtgacagaaa agtcccttgc ctgaaaacag ccaccacacc tgttggattt 6841 tatgcagtct tggtttctcc tcctttttcc accctcagct ctatgagatg ttctcttctg 6901 tgatgaaaca cctgccaggc ccccagcaac aggcctttaa ggagctgcag gggctggagg 6961 acttcataac caagaaggtg gaacagaatc agcgcacgct ggatcccaat tccccaaggg 7021 acttcatcga ctctttcctc atccgaatgc tggaggtaaa cccagaagtt agggagcatg 7081 gcgggaggtc caaagccaag caggatgaaa tcacgcttgg ggaatggggg taatggtgga 7141 ccacttaaat cagtcctctc aatataaacc tcacatatta ccactcagca ccactgttcc 7201 tttcacttct cagtctcagg aggatggaca gactgagatc acagccagga tgtggtgctg 7261 ccatatgcca ggcaccattt taaaccattt taaaggctta gtgatatcca cttatttagt 7321 attcacacct tggacgaggt tgtgttatga ctcccatttt acaaatgagt aggctgcagc 7381 tctgagaata taggcaactc gtcctgcatc acagctgtct caggctgaca cacaacttaa 7441 cgttgcctgt cctcattgtc attcacgtct gtggcacagg tgacctagct ttattgtgac 7501 cttctctcac ctatatggaa cagaggcagg ggtgttttgt gataggctgt agccctagtt 7561 ggcctgaaac ttgctctgca caccaggcaa gtctcaaacc tgccgtggtc ctcctgaggc 7621 tgctttctga tggcttggat tatcaacatg tggtatccgg ctgagatgga acttcatgta 7681 gtctgaatgc aaaggattga gggatgagat tcccctagaa aatattaagg ttcgtaatga 7741 agagatggga tttattgcct agcattcatt gtctgagggt tggaaagatg ggtcagtggt 7801 taagagcact ggctgctctt ccagagtgcc agtgggttca atcccagcac atgcatggca 7861 gctcacaacc atgtcatttc agtcccagta gatccaatgc cctctttggg cctccatggt 7921 tactgaacgc atgtagtgct ctgacacaca tgcaggcgaa acactcatac actcaaaata 7981 atttttaaaa agctctgtat caggatgggg atttagctca gtggtagagc acttgcctag 8041 gaagcgcaag gccctgggtt tggtccccag ctccgaaaaa aagaacaaaa aaaaagctct 8101 gtatctacct ctgaaaaacc atctcaagtt ttccctgact tcctccaaca tgacgtcagt 8161 accattgttt cttgggctct cttcagcctt cctttcctaa tttcgaccat taaataactg 8221 acgactaagt tagtaacaat cagacaagca ctaagtcacg aatgacagaa gagcatttat 8281 ggtcccaggt cacttccaaa tgtctgggcc ccggagagaa ctcagcatag ccagggttcc 8341 tcttctaagc cttaggtctg gcccaaagag aagtgagttc aagttgggaa aaaaaagcta 8401 gaagggtacc aagagtgtaa ggaatggatt tgacgtgaat atttccctcc ctgcaggaaa 8461 agaagaaccc caatactgag ttctacatga agaacttggt gctgactacc ctaaatctct 8521 tctttgccgg cacagagacc gtcagcacca ccctgcgtta cggctttctg ttgctcatga 8581 agcacccgga tattgagggt aacgaaggct tcaagggaca ggaaagcagg ggttgagggg 8641 agcataacct ggatatagat tctaaatttg catcttttat ctgccacaat gtcccctacc 8701 tttctcaaga aaccggactc tcaaacagct agctaccccc tgctttgcct agacagagcc 8761 acagctggtg ggcaccagtc tctagttgct taaccaactg gtgtgctgct cgccacactt 8821 cagaggagtt acttccccgt ctcccgggtc tcctcgcaac tcccgtacaa ttatggtatg 8881 atccatccag ctcacgggaa acttcttcac caatgcagga gttgcacata ttgaaaatga 8941 tctacaatgc catacactgt tccatggcct ccatcccgtc ttagtttctg aactatcatc 9001 tctactgtgg cgttcttgga ttcctaaacc ggtccctgtt cttctcagct aaggttcacg 9061 aggagattga ccgggtgatt ggcaggaacc ggcaggccaa gtatgaggac cgaatgaaga 9121 tgccctacac ggaggctgtg atccacgaga tccagagatt tgcagacatg atccccatgg 9181 gcctggctcg cagggtcacc aaggacacca agtttcgaga gttcctcctc cccaaggtgc 9241 tgattcacct agcctagagt gacctctaga cctcttcatg ggcctccagg acctcctgct 9301 cattagaaac cacccggccc ctctcttggt ttcttagtga caggctttct ctgtcaccac 9361 atcccactgc cctcctactt agagactcct atatcttgtc ttcaccccag actttcccac 9421 tcaggaattt taactccatg cctcttaaga agtataaact taatttatct agatcttgac 9481 tgtgttggag ataggagtct catgtctgct atatatcctg tttggggcta cacactttca 9541 atacccccaa atcttgactt gagaggcaca atttgcccca gaagacaaag cctgtattac 9601 tcatctgtcc tggctactag aacatgagtc ccacatcccc catatcttaa ggataacctc 9661 acattcggca aagctcctcc ctcaaaggtc ccctaactca ctatctttac cttcacttgg 9721 ataccctagc tcaccctttt aacccctgtg ggccagcagg acattcatcc aaacatacct 9781 catggagtct cttcttcatc accagggtac tgaagtattt cctatgctgg gctctgtact 9841 gaaagaccct aagttcttct ccaaccccaa cgacttcaac ccaaagcact tcctagatga 9901 caagggacag tttaagaaga gtgatgcctt tgtgcccttt tccattggta agaacccact 9961 gccgagtgcc agaccactgc tcacaccaac agataagctt ctccattgct tctgtctgat 10021 gcagccaaat atttctctct attttggaat attctcttgc atctaactct cggctagtta 10081 gctgtagctc ccataattcc caagcaccat gtatctgaac cctggagacg tgatggtcaa 10141 actttagtga taggtgaagg gaatggaacc tcaaactaag ccagaggtct ttctcaggtc 10201 acacaaccag ttcttaaaaa ggatgtggcc acatggcagc cataccattc cctatagctg 10261 tagggaaagg tgtaaaagta ttttttaatc tcctgatgta aatgtctccg aggcttactg 10321 cctccctctg ctaacctagg cctagccctg gaagcttcta gcctctgaac aatcttatct 10381 agcctagaat gttttcagcc tctgagactt gctgctaaat aagctcacac cttcctagtt 10441 ccctctgatc tctggatggc tggttcaact cagctgtttt ggctcaaact cctctaacag 10501 attcagtctc tctctttgcc tctcctgagc tgctctactt ggctccatgc taactttggc 10561 aatctgttct gatcttctgt ctccttctca ttctctggct tgttctgtct tcacctgggc 10621 ctagctcgct ctctcagtaa cctgcctcta tataactgtc ccagttaaac tgcccccttt 10681 ctccatgttt ctctctcttc actgttctct ttgtctgcca ctcaattaga cgtcattttc 10741 aaacatgggt gctttcttct acaaactaac tttatcttca ttgtttggaa tttaaagtgt 10801 gtaccaaggg tttgcattcc agccagaggg attaaaggtt gtgtgctaag gctgagccac 10861 tccacaccta gaaacaggtt tttcagtaaa taacacaatc tcggggttca cagtgtgatc 10921 gaatatcctg caacagaagg gaatccacat ttctcattgt tacagcctgg tatggatcaa 10981 cccactacaa gtacaaatac aactgaggct caggagttag agcgttcctt aaacgtctca 11041 gcacttcctc ccatgaagaa tgcagccagg gatttgctat cggcaaaaac tggcttcacc 11101 ttcccctgtc ttgtctcttt aggaaaacgg tattgtttcg gggaaggact ggcaaggatg 11161 gaactctttc tcttcctcac aaacatcatg cagaacttct gcttcaaatc cccacaggca 11221 ccccaggaca tcgatgtgtc tcctagactt gtgggctttg ccacaatccc accaaactac 11281 actatgagtt tcttgtcccg ttgagccagc attgcgtgag agtaaaggga agaatggggg 11341 ttgaaactta gaaagggtcg gatgtactga agaaacagag gaaaaagagt agacctgctg 11401 aagatatata tttcaaaggt agagccagag aagggggaaa tatctactat actgtgaata 11461 ggaataataa aataataaaa gtagatatta tttatggcat aagtagtctt tgtcaagttt 11521 atgctaaggg gtcttacaat ctcacttaat actcagaagt gcatgggaag gggaagagag 11581 ttcatgctca ttttattagg gtcacattct tgatcttatg tttccctagt ctaagtttac 11641 caaagtctca cagaatccaa gagctcacta atatttggac aaagatctgt gttcattggc 11701 cttgggcaga ttttcaaata tcaccacctc tcagataaaa catgaaagct caagtctatt 11761 gccacatcat cctgggcaga tctttgggtg ggcaacctgc caggctctac cacttgcctg 11821 gccacactct gatctactta cagagaaaac tcagggcctg caccatgaca tgaatgtgtg 11881 ctgatttatg gatgcccacg taagtgtggt gcagggtttc ctcaagttat gcacgagtat 11941 attttctgtc atgagttggc ttccaaaggc caagtgtttg ctcacagata agagtcagtt 12001 aaagcagttg aaaagatgtg ctgtggtaca ctggtgtaaa tctggtgagt agatgcctgt 12061 gcgatgcagt ctcaacagca gtggacacag cagccagcac tttctaggct tcttactttt 12121 atgatgtcag tattagtcat acccaggagg aagggacaga tataaggata tcaaggtgac 12181 atctttgaga actgtattgc aagccctccc tggtagtaaa tttcaaccct gcaagctgca 12241 taagcttcct gggataatga agagaagaag gagggaacag gggtgaggat ggctggggag 12301 gggatggggt agagtaggga gaatgactca attagaaaag tgcttgccaa gcaataatga 12361 ggacccgagt ttggatcctc agcacccctc aaaagcccag cgtaatggca caagcctgtc 12421 accctgatgc tgtgaggtgc gtgaaaattg cctggtccct gaagatcact gggcagccca 12481 gtccagtcaa atggtgaact cgggatttag agaccctgtc tcaaagaaca aagtggagac 12541 aggaagatca acagagtcaa ctaacctggg ggctagctca ctagaccctt ggggctctca 12601 aagactgagc caccaaccaa ggtgcacacg tggcctggac agaggccccc agcacatacg 12661 tagcatatgt acatctcaat ctttatgtgg gttccccaac aactggagca ggagctatca 12721 taaagctatt gcctgtcagt ggaatccatt cccctaactg ggctgccttg tctcacctcc 12781 gtgggagagg gtgcacttag ccctacagag acttgatgca caaggaaggg aatataccca 12841 ggtgggcccc atcctctcag aagagaaggg gatgggagat gggggagagg tttgtaagga 12901 gggaacagga gttgggcatt gttcaggatg gaaaatgaat aaataaatta attaatgggg 12961 aaaactggag agaaactaaa gaagacatgt actgttgacc tgtggcctct acacacccat 13021 gtacacatgc acctgcatgc acatacaaaa atacacacac acacacacac acacacacac 13081 acacacacac acagactagc tcatttttag caactggcaa gaaagccttt gaaagtgagg 13141 gagggaaaaa aaactaacac attctttaat gctgaaaaac attaaaataa caaacaaatg 13201 aaaagctcct aaagacgtga cctgggctgc tgagacaggc cacattcaac caacgatgtt 13261 gcattacata gcagcagagc ttgcagcctt ctgaagggca tccgtgacct tcttcaggct 13321 aagctgtatg ttagggttca aagtttccca tcactttatg aatattagcc taaccagagt 13381 aactttgatg aacgagatgg ccagaaacca acagcaaagt caaaaccaca aaagccatgg 13441 gtatgtggaa attaagagac gtgccatcaa aattgtttct tttcaaataa aacttaggca 13501 tataccgagg tagtagagag acagttcaat ggttaaaagc acagaatgct catgcagagg 13561 acttgagttg ggtccctggc actcatgtct ggcagcctaa agtcagctgt aactaactaa 13621 ctaactaact aactccaggg gatccaatgc catcctctgg attccatggg gcctgcatgt 13681 atatatgcat atgcacacac acacacacac acacacacac acacaaagac acacacaaag 13741 atacacacaa agacacatga ctttaaaaat aagtcttttt aaaaatataa ctattaatga 13801 aattaatgaa gtgcaactaa tgttttccat acatgtataa attatgcaat gactgaatca 13861 tagcaacaaa aaatgtagac ttatacaaca gtggttcaaa gaaatattac atggtaaatc 13921 acaaaacact taaataatgg aacaaaaata aatacacaca gacttacagg atgagaacag 13981 ccacagcagg agatgcaggt aaactcatag catggttttt aaagaaatgt caaattcata 14041 aacggtgttt taaattttta cccactagat tgaaagactc actaaactga tgtttttttt 14101 tctctttttt gtgattatac atacttattt tactgggtat tttttattta catttcaaat 14161 gtgattccct ttcgaggttt cccggccata tgcactctat cacaactccc tccccttctt 14221 ctataagggt gttccctctc cccaaccacc ccaccttcct gccccccacc gacattcccc 14281 tgcactgggg gtccagcctc agcagggatc aaggg // LOCUS ACARRDX 119 bp ss-rRNA RNA 11-AUG-1990 DEFINITION A.castellani 5S ribosomal RNA. ACCESSION M28984 KEYWORDS 5S ribosomal RNA. SOURCE A.castellani ribosomal RNA from Viluy encephalomyelitis patient KPN.. ORGANISM Acanthamoeba castellanii Eukaryota; Animalia; Protozoa; Sarcomastigophora; Sarcodina; Rhizopoda; Lobosa; Gymnamoeba; Amoebida; Acanthopodina; Acanthamoebidae. REFERENCE 1 (bases 1 to 119) AUTHORS Chumakov,K.M. and Karavanov,A.S. TITLE Preliminary characterization of an organism isolated from a case of Viluy encephalomyelitis indicates a protozoal, rather than viral, aetiology JOURNAL J. Gen. Microbiol. 132, 1127-1133 (1986) STANDARD simple staff_review FEATURES from to/span description rRNA 1 119 5S ribosomal RNA BASE COUNT 26 a 36 c 37 g 20 t ORIGIN 1 ggatacggcc atactgcgca gaaagcaccg cttcccatcc gaacagcgaa gttaagctgc 61 gcgaggcggt gttagtactg gggtgggcga ccacccggga atccaccgtg ccgtatcct // LOCUS BMOCHORB 538 bp ss-mRNA INV 11-AUG-1990 DEFINITION B.mori early chorion protein mRNA, 3' end. ACCESSION M36291 KEYWORDS early chorion protein; eggshell protein. SOURCE B.mori, cDNA to mRNA, clone m2G12. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 538) AUTHORS Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C. TITLE A complex set of early chorion DNA sequences from Bombyx mori JOURNAL Dev. Biol. 112, 368-376 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 482 early chorion protein precursor (AA at 3) sigp < 1 44 early chorion protein (AA at 1) matp 45 479 early chorion protein signal 518 523 poly-A signal BASE COUNT 104 a 128 c 159 g 147 t ORIGIN 1 tcctcatctg tgcttcggca ttattcgtcc agtctgctgt cagtcaatat ctgagtcgcg 61 tcggctgtgg ttgtggaggg gttggcaatg gcttgggcta cggtggtatc ggctataacg 121 gctatgtcgg taacgatatc ggcgctgctg gagctctagg agcttccaat ggagggtgtc 181 tcaacgtcgt cagctcctct gccgctccta ctagtttagg tgtagcctct gagaactctt 241 acgagggaac tgtcggtgtc tgcggtaatc tgcccctctt gggtactgca atcgtcaccg 301 gcgagttctc tactggcggc cttggtggta ttaactacgg ctgcggtaat ggcgctgttg 361 ggatcaccgc tgaagatcgc gccggtatcg gctacgccgg cggcctcggc tacggatcag 421 gctacggact cggctacgga ggctacgccg gcaatggatg tggatgcggt ggcgcctatt 481 gaatgctaga atatatataa aaaaacacgt tttttttaat aaattccttt aaaaaaat // LOCUS BMODNA 586 bp ss-mRNA INV 11-AUG-1990 DEFINITION B.mori early chorion protein mRNA, 3' end. ACCESSION M36290 KEYWORDS early chorion protein; eggshell protein. SOURCE B.mori, cDNA to mRNA, clone m6F6. ORGANISM Bombyx mori Eukaryota; Animalia; Metazoa; Arthropoda; Uniramia; Insecta; Pterygota; Neoptera; Holometabola; Lepidoptera; Ditrysia; Bombycoidea; Bombycidae. REFERENCE 1 (bases 1 to 586) AUTHORS Eickbush,T.H., Rodakis,G.C., Lecanidou,R. and Kafatos,F.C. TITLE A complex set of early chorion DNA sequences from Bombyx mori JOURNAL Dev. Biol. 112, 368-376 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 483 early chorion protein precursor (AA at 1) sigp < 1 39 early chorion protein (AA at 1) matp 40 480 early chorion protein signal 575 580 poly-A signal BASE COUNT 117 a 133 c 166 g 170 t ORIGIN 1 ttgtctggat gtgctctcgt ggctcagttc acagttggcc agtacatcgg ccgcgtgaac 61 aatggttgtg gatgcgggag tttcgacaac cgtggcctcg gttacaccgc tggctgtggt 121 ctcaccgctg ctagttctct tgcagcctcc catggaggag ggttattcgt cgtcacctcc 181 tctgccgcgc ctgctggtct cggcatagct tccgagaaca gatacgaagg cgctgtcgat 241 gtgtgcggca acattccatt cttgggcacc gctgatgtcg caggcgagtt ccccactgcg 301 ggcattggtg agatcaacta cggctgcggc gatggagcag tcgccattac cgctgaaggt 361 ggtctcggct acgctggagg acttgactac actggtggac tcggctatgc tagtggactt 421 ggctacggct taggctatgg agaatacgtt ggatgcagtt gtggttgtgg tgacatctac 481 tagaatggat attgttatat taatcttgta caagtatgta cattcaggtt gtaattttta 541 caagaattaa attttatttt tccgaataaa ctctggtata taattt // LOCUS BTHINSECT 977 bp ds-DNA BCT 11-AUG-1990 DEFINITION B.thuringiensis 27.3 Kd cytolytic insecticidal protein gene, complete cds. ACCESSION M35968 KEYWORDS 27.3 Kd cytolytic insecticidal protein. SOURCE B.thuringiensis morrisoni (mosquitocidal isolate PG-14) DNA. ORGANISM Bacillus thuringiensis Prokaryota; Bacteria; Firmicutes; Endospore-forming rods and cocci; Bacillaceae. REFERENCE 1 (bases 1 to 977) AUTHORS Galjart,N.J., Sivasubramanian,N. and Federici,B.A. TITLE Plasmid location, cloning, and sequence analysis of the gene encoding a 27.3-kilodalton cytolytic protein from Bacillus thuringiensis subsp. morrisoni (PG-14) JOURNAL Curr. Microbiol. 16, 171-177 (1987) STANDARD simple staff_review FEATURES from to/span description pept 67 816 27.3 Kd cytolytic insecticidal protein BASE COUNT 349 a 161 c 149 g 318 t ORIGIN 1 aagtttaaaa tcataattta atgttgaaag gccactattc taattaactt aaggagttgt 61 ttatttatgg aaaatttaaa tcattgtcca ttagaagata taaaggtaaa tccatggaaa 121 acccctcaat caacagcaag ggttattaca ttacgtgttg aggatccaaa tgaaatcaat 181 aatcttcttt ctattaacga aattgataat ccgaattata tattgcaagc aattatgtta 241 gcaaatgcat ttcaaaatgc attagttccc acttctacag attttggtga tgccctacgc 301 tttagtatgg caaaaggttt agaaatcgca aacacaatta caccgatggg tgctgtagtg 361 agttatgttg atcaaaatgt aactcaaacg aataaccaag taagtgttat gattaataaa 421 gtcttagaag tgttaaaaac tgtattagga gttgcattaa gtggatctgt aatagatcaa 481 ttaactgcag cagttacaaa tacgtttaca aatttaaata ctcaaaaaaa tgaagcatgg 541 attttctggg gcaaggaaac tgctaatcaa acaaattaca catacaatgt cctgtttgca 601 atccaaaatg cccaaactgg tggcgttatg tattgtgtac cagttggttt tgaaattaaa 661 gtatcagcag taaaggaaca agttttattt ttcacaattc aagattctgc gagctacaat 721 gttaacatcc aatctttgaa atttgcacaa ccattagtta gctcaagtca gtatccaatt 781 gcagatctta ctagcgctat taatggaacc ctctaatctt agtagctata tttattaaag 841 atggtaatat cacaagtata aatacttgtg gtattaccta ccattcttaa attatatcca 901 aaatcatgcg ttaatctaca ttcccctttc tctaaaattt gttcttcaca catccacatt 961 tttcgataaa acaatct // LOCUS ECOPYRI 133 bp ds-DNA BCT 11-AUG-1990 DEFINITION E.coli ATCase regulatory protein (pyrI) gene, 3' end. ACCESSION M28578 KEYWORDS ATCase regulatory protein. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 133) AUTHORS Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G., Glansdorff,N. and Pierard,A. TITLE Structure-function relationship in allosteric aspartate carbamoyltransferase from Escherichia coli: I. Primary structure of a pyrI gene encoding a modified regulatory subunit JOURNAL J. Mol. Biol. 186, 707-713 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 81 ATCase regulatory protein (pyrI) (EC 2.1.3.2) signal 100 129 transcription termination signal BASE COUNT 41 a 27 c 29 g 36 t ORIGIN 1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttcccat 61 aatgtggtgc tggccaatta attgcggttg gtaataaaag ttggctccct ataatgagcc 121 agacttttta ccg // LOCUS HUMBHD 1269 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human placental 17-beta-hydroxysteroid dehydrogenase mRNA, complete cds. ACCESSION M36263 KEYWORDS placental 17-beta-hydroxysteroid dehydrogenase. SOURCE Human placenta, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1269) AUTHORS Peltoketo,H., Isomaa,V., Maeentausta,O. and Vihko,R. TITLE Complete amino acid sequence of human placental 17-beta- hydroxysteroid dehydrogenase deduced from cDNA JOURNAL FEBS Lett. 239, 73-77 (1988) STANDARD simple staff_review FEATURES from to/span description pept 10 996 placental 17-beta-hydroxysteroid dehydrogenase precursor (EC 1.1.1.62) matp 13 993 placental 17-beta-hydroxysteroid dehydrogenase mRNA < 1 1269 placental 17-beta-hydroxysteroid dehydrogenase mRNA BASE COUNT 220 a 378 c 437 g 234 t ORIGIN 1 agtctcacca tggcccgcac cgtggtgctc atcaccggct gttcctcggg catcggcctg 61 cacttggccg tacgtctggc ttcagatcca tcccagagct tcaaagtgta tgccacgttg 121 agggacctga aaacacaggg ccggctgtgg gaggcggccc gggccctggc atgccctccg 181 ggatccctgg agacgttgca gctggacgta agggactcaa aatccgtggc cgctgcccgg 241 gaacgcgtga ctgagggccg cgtggacgtg ctggtgtgta acgcaggcct gggcctgctg 301 gggccgctgg aggcgctggg ggaggacgcc gtggcctctg tgctggacgt gaatgtagta 361 gggactgtgc ggatgctgca ggccttcctg ccagacatga agaggcgcgg ttcgggacgc 421 gtgttggtga ccgggagcgt gggaggattg atggggctgc ctttcaatga cgtttattgc 481 gccagcaagt tcgcgctcga aggcttatgc gagagtctgg cggttctgct gctgcccttt 541 ggggtccact tgagcctgat cgagtgcggc ccagtgcaca ccgccttcat ggagaaggtg 601 ttgggcagcc cagaggaggt gctggaccgc acggacatcc acaccttcca ccgcttctac 661 caatacctcg cccacagcaa gcaagtcttt cgcgaggcgg cgcagaaccc tgaggaggtg 721 gcggaggtct tcctcaccgc tttgcgcgcc ccgaagccga ccctgcgcta cttcaccacc 781 gagcgcttcc tgcccctgct gcggatgcgc ctggacgacc ccagcggctc caactacgtc 841 accgccatgc accgggaagt gttcggcgac gttccggcaa aggccgaggc tggggccgag 901 gctgggggcg gggccgggcc tggggcagag gacgaggccg ggcgcagtgc ggtgggggac 961 cctgagctcg gcgatcctcc ggccgccccg cagtaaaggc ttcctcagcc gctgtctccc 1021 gcgcccttct ttgtcccctg ggtctgtgtg gtccctgggg atggggcggc ggtagcagct 1081 gtgggtggct aattaagata gatcgcgtta gccagtttta ccagcgcagc taggcgcgat 1141 ggcgtcgcct gtaatgccag cgctttggga ggcggaggca ggaggatcgc tcaagccccg 1201 gagttggaga ccagccagag caacacagtg agacccccat ctctacaaaa ataaagaaaa 1261 tttaaaaat // LOCUS HUMC1Q 84 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human complement component 1, beta-chain (C1QB) mRNA, partial cds. ACCESSION M36278 KEYWORDS complement component 1. SOURCE Human liver, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 84) AUTHORS Reid,K.B.M., Bentley,D.R. and Wood,K.J. TITLE Cloning and characterization of the complementary DNA for the B chain of normal human serum C1q JOURNAL Philos. Trans. R. Soc. Lond., B, Biol. Sci. 306, 345-354 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 84 complement 1-q subcomponent beta-chain (AA at 1) /hgml_locus_uid="LR0054Y" /nomgen="C1QB" /map="1p" BASE COUNT 18 a 25 c 22 g 19 t ORIGIN 1 gccaccgaca agaactcact actgggcatg gagggtgcca acagcatctt ttccgggttc 61 ctgctctttc cagatatgga ggcc // LOCUS HUMGAPDA 197 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human glyceraldehyde-3-phosphate dehydrogenase mRNA, 3' flank. ACCESSION M36164 KEYWORDS glyceraldehyde-3-phosphate dehydrogenase. SOURCE Human pancreatic adenocarcinoma cell line HPAF, cDNA to mRNA, clone pII5B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 197) AUTHORS Schek,N., Hall,B.L. and Finn,O.J. TITLE Increased glyceraldehyde-3-phosphate dehydrogenase gene expression in human pancreatic adenocarcinoma JOURNAL Cancer Res. 48, 6354-6359 (1988) STANDARD simple staff_review FEATURES from to/span description signal 175 180 poly-A signal /hgml_locus_uid="LM0055R" /nomgen="GAPD" /map="12p13" BASE COUNT 50 a 69 c 46 g 31 t 1 others ORIGIN 1 ccctggacca ccagccccag caagagcaca agaggaagag agagaccctc actgctgggg 61 agtccctgcc acactcagtc ccccaccaca ctgaatctcc cctcctcaca gttgccatgt 121 agaccccttg aagaggggag gggcctaggg agccgcacct tgtcatgtac catcaataaa 181 gtaccctgtg ctnaacc // LOCUS HUMGYPAA 445 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human glycophorin A mRNA, partial cds. ACCESSION M36281 KEYWORDS glycophorin A. SOURCE Human, cDNA to mRNA, clone pHGpA-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 445) AUTHORS Siebert,P.-D. and Fukuda,M. TITLE Molecular biological study of the structure and expression of human glycophorin A JOURNAL Rev. Fr. Transfus. Immunohematol. 24, 251-266 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 445 glycophorin A precursor (AA at 2) /hgml_locus_uid="LW0071X" /nomgen="GYPA" /map="4q28-q31" sigp < 1 67 glycophorin A signal peptide (AA at 1) matp 68 > 445 glycophorin A BASE COUNT 135 a 103 c 84 g 123 t ORIGIN 1 tgatctcagg atgtatggaa aaataatctt tgtattacta ttgtcagcaa ttgtgagcat 61 atcagcatca agtaccactg gtgtggcaat gcacacttca acctcttctt cagtcacaaa 121 gagttacatc tcatcacaga caaatgatac gcacaaacgg gacacatatg cagccactcc 181 tagagctcat gaagtttcag aaatttctgt tagaactgtt taccctccag aagaggaaac 241 cggagaaagg gtacaacttg cccatcattt ctctgaacca gagataacac tcattatttt 301 tggggtgatg gctggtgtta ttggaacgat cctcttaatt tcttacggta ttcgccgact 361 gataaagaaa agcccatctg atgtaaaacc tctcccctca cctgacacag acgtgccttt 421 aagttctgtt gaaattgaga acccc // LOCUS HUMGYPCAA 915 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human glycophorin C mRNA, complete cds. ACCESSION M36284 KEYWORDS glycophorin C; syaloglycoprotein. SOURCE Human reticulocytes and fetal liver, cDNA to mRNA, clones pGCW5 and pGCF22. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 915) AUTHORS Cartron,J.-P., Colin,Y., Le Van Kim,C., Rahuel,C., Blanchard,D., Bloy,C. and London,J. TITLE Structure of human erythrocyte glycophorin C deduced from cDNA analysis JOURNAL Rev. Fr. Transfus. Immunohematol. 24, 267-285 (1986) STANDARD simple staff_review COMMENT The first 6 nucleotides at the 5' end were not present in the cDNA clones but where deduced from the known protein sequence. FEATURES from to/span description pept 1 387 glycophorin C /hgml_locus_uid="LM0084R" /nomgen="GYPC" /map="2q14-q21" mRNA < 1 915 glycophorin C mRNA BASE COUNT 228 a 249 c 262 g 176 t ORIGIN 1 atgtggtcga cgagaagccc caacagcacg gcgtggcctc tcagcctcga gcctgatccg 61 gggatggcct ctgcctccac cacaatgcat actaccacca ttgcagagcc tgatccaggg 121 atgtctggat ggccggatgg cagaatggag acctccaccc ccaccataat ggacattgtc 181 gtcattgcag gtgtgattgc tgctgtggcc atcgtcctag tctccctcct cttcgtcatg 241 ctgcgctaca tgtaccggca caagggcacg taccacacca atgaggccaa gggcacggag 301 tttgctgaga gtgcagatgc agccctgcag ggagaccctg ccctccaaga tgctggtgat 361 agcagcagaa aggagtactt tatttgaggg acaacagact tcacttccct gaatgcctcc 421 cccatctcca tcaggaaaaa tacaccccat cgcccagtat ccccgtcgat accaccagac 481 agagagagag agtacacttg atttcttccc gagatagtta cctagaaaca ctaggtgcct 541 gcccaaggag gaacggagga ggactcgcgc tacaagaggc cactcccagg gacccaggga 601 ggcgatggcc accccagagg ccaccttttg ctccacggag gtgggagaga atctgggcac 661 atggggcccc ctagggcagt gcaggacaac atcagctcac tggcaggaaa gtccttgttg 721 agggtgaggg ggtgctgggg tacccggggg ctggggaagc aaggaaataa gtcatctgta 781 tgctgactgg ggataatggc atcaatgtca gtccttgact ttggggggaa cagcaggtgc 841 cagagctaaa aggtaccttt gtctgccatt gatccagcta agaacgattg gaaataaatt 901 ggaaatgtaa ccgag // LOCUS HUMIGDD 102 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36090 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient AW. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 102) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 102 Ig H-chain VDJ-region (AA at 2) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 26 a 23 c 36 g 17 t ORIGIN 1 gagagctgag gacacggctg tgtattactg tgcgaaagat cgagcagcaa cggacgggct 61 actactacgg tatggacgtc tggggccaag ggaccacggt ca // LOCUS HUMIGDE 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36091 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient AW. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig H-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 22 a 24 c 30 g 24 t ORIGIN 1 cctgcagctg aactctgtaa ctcccgagga cacggctgtg tattactgtg caagaggact 61 gggtggcttt tgatatctgg ggccaaggga caatcgtcac // LOCUS HUMIGDF 101 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36092 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient AW. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 101) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 101 Ig heavy-chain V-D-J region (AA at 3) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 17 a 27 c 34 g 23 t ORIGIN 1 ccgccgcgga cacggccgtg tattactgtg ccagggtagg atattgtact aatggtgttt 61 ccgggatggt tcgacccctg gggccaggga accctggtca c // LOCUS HUMIGDG 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36093 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 22 a 34 c 25 g 19 t ORIGIN 1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt 61 gcgaccattg actactgggg ccagggaacc ctggtcacgg // LOCUS HUMIGDH 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36094 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 23 a 30 c 24 g 23 t ORIGIN 1 ccatcacact cctacctgca tggagcacct aaggcctcgg acaccgccat gtattactgt 61 gcgagggggg atggttacta ctactactac atggacgtct // LOCUS HUMIGDI 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36095 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept.ps < 1 > 100 Ig heavy-chain V-D-J region (AA at 3) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 25 a 22 c 30 g 23 t ORIGIN 1 aaatgaacag cctgagagct gaggacacgg ctgtgtattg aacccacggt ggtagctgct 61 actctgcttt tgatatctgg ggccaaggga caatggtcac // LOCUS HUMIGDJ 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36096 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 24 a 23 c 30 g 23 t ORIGIN 1 tctgagagcc gaggacacgg ctgtgtatta ctgtgcaaca gggccgtcac tggttattat 61 aaactactac tacggtatgg acgtctgggg ccaagggcca // LOCUS HUMIGDK 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36097 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 2) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 26 a 24 c 33 g 17 t ORIGIN 1 ggcgtatctg caaatgaaca gcctgaaaac cgaggacacg gccgtgtatt actgtactag 61 acgggacggt atggacgtct ggggccaagg gaccacggtc // LOCUS HUMIGDL 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene V-D-J region, partial cds. ACCESSION M36098 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain V-D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 21 a 18 c 33 g 28 t ORIGIN 1 acggccgtgt attactgtgc gagagatagg ggatttctac gagtagtggt ggtagctgct 61 actctgcttt tgatatctgg ggccaaggga caatggtcac // LOCUS HUMIGDM 100 bp ds-DNA PRI 11-AUG-1990 DEFINITION Human Ig rearranged heavy-chain gene D-J region, partial cds. ACCESSION M36099 KEYWORDS diversity region; immunoglobulin heavy-chain; joining exon; processed gene. SOURCE Human acute lymphoblastic leukemia B-cell line DNA, from patient TD. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Bird,J., Galili,N., Link,M., Stites,D. and Sklar,J. TITLE Continuing rearrangement but absence of somatic hypermutation in immunoglobulin genes of human B cell precursor leukemia JOURNAL J. Exp. Med. 168, 229-245 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 100 Ig heavy-chain D-J region (AA at 1) /hgml_locus_uid="LR0220V" /nomgen="IGH@" /map="14q32.33" BASE COUNT 24 a 15 c 29 g 32 t ORIGIN 1 ggtaggttta gaatgaggtc tgtgtcactg tggtattacg atattttgac tggttattat 61 aaactactac tacggtatgg acgtctgggg ccaagggcca // LOCUS LAMPYRIB 135 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage lambda mutant truncated regulatory protein (pyrI) gene from E.coli, 3' end. ACCESSION M28579 KEYWORDS ATCase regulatory protein. SOURCE Bacteriophage lambda (strain lambda OTC3) DNA. ORGANISM Bacteriophage lambda Viridae; ds-DNA nonenveloped viruses; Siphoviridae. REFERENCE 1 (bases 1 to 135) AUTHORS Cunin,R., Jacobs,A., Charlier,D., Crabeel,M., Herve,G., Glansdorff,N. and Pierard,A. TITLE Struture-function relationship in allosteric aspartate carbamoyltransferase from Escherichia coli: I. Primary structure of a pyrI gene encoding a modified regulatory subunit JOURNAL J. Mol. Biol. 186, 707-713 (1985) STANDARD simple staff_review FEATURES from to/span description pept < 1 75 ATCase regulatory protein BASE COUNT 54 a 24 c 22 g 35 t ORIGIN 1 cgaaaacgcg ccaatgatat cgcgctcaaa tgcaaatact gtgaaaaaga gttttatact 61 aagttggcat tataaaaaag cattgcttat caatttgttg caagcaacag gtcactatca 121 gtcaaaataa aatca // LOCUS MUSBGALA 240 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse beta-1,4-galactosyltransferase mRNA, 5' end. ACCESSION M36289 KEYWORDS beta-1,4-galactosyltransferase. SOURCE Mouse, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 240) AUTHORS Shaper,J.H., Hollis,G.F. and Shaper,N.L. TITLE Evidence for two forms of murine beta-1,4-galactosyltransferase based on cloning studies JOURNAL Biochimie 70, 1683-1688 (1988) STANDARD simple staff_review FEATURES from to/span description pept 52 > 240 beta-1,4-galactosyltransferase (long form) precursor sigp 52 111 beta-1,4-galactosyltransferase (long form) signal peptide (put.) matp 112 > 240 beta-1,4-galactosyltransferase (long form) pept 91 > 240 beta-1,4-galactosyltransferase (short form) BASE COUNT 27 a 91 c 77 g 45 t ORIGIN Chromosome 4. 1 gggccgtcct ctcagccgta gcccaccccc tcttaaagcc gcggcgggaa gatgaggttt 61 cgtgagcagt tcctgggcgg cagcgccgcg atgccgggcg cgaccctgca gcgggcctgc 121 cgcctgctcg tggccgtctg cgcgctgcac ctcggcgtca ccctcgtcta ttacctctct 181 ggccgcgatc tgagccgcct gccccagttg gtcggagtct cctctacact gcagggcggc // LOCUS MUSIGHADN 348 bp ds-DNA ROD 11-AUG-1990 DEFINITION Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds. ACCESSION M36103 KEYWORDS immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma GA733 DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J. TITLE Comparative sequences analysis of CO17-1A antigen-specific mono- clonal antibodies JOURNAL Hybridoma 5, 11-16 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 348 Ig heavy-chain V-J region (AA at 1) recomb 294 303 V-region end/D-region start recomb 303 304 D-region end/J3-region start BASE COUNT 91 a 83 c 91 g 83 t ORIGIN 1 caggttcagc tacagcagtc tggagctgag gtgatgaggc ctggggcctc agtgaagata 61 tcctgcaagg ctactggcta cacattcact aggtactaca tacaatgggg taaaaacaga 121 gggcctggac atggccttga gtggattgga gagattttac ctggaactct tactaattac 181 aatgagaaat tcaagggcaa ggccgcattc actgcagata gatcctccaa cacagcctac 241 atgcaactca gcagccttac atctgaggac tctgccgtct attactgtgc aagagatggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS MUSIGHADO 348 bp ds-DNA ROD 11-AUG-1990 DEFINITION Mouse rearranged Ig heavy-chain gene V-D-J3 region, partial cds. ACCESSION M36104 KEYWORDS immunoglobulin heavy-chain; joining exon; processed gene; variable region. SOURCE Mouse hybridoma CO17A-1A DNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 348) AUTHORS Caton,A.J. TITLE Comparative sequences analysis of CO17-1A antigen-specific mono- clonal antibodies JOURNAL Hybridoma 5, 11-16 (1986) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 348 Ig heavy-chain V-J region (AA at 1) recomb 294 303 V-region end/D-region start recomb 303 304 D-region end/J3-region start BASE COUNT 81 a 79 c 101 g 87 t ORIGIN 1 caggtccagt tgcagcagtc tttagctgag ctggtaaggc ctgggacttc agtgaaggtg 61 tcctgcaagg cttctggata cgccttcact aattacttga tagagtgggt aaagcagagg 121 cctggacagg gccttgagtg gattggggtg attaatcctg gaagtggtgg tactaactac 181 aatgagaagt tcaagggcaa ggcaacactg actgcagaca aatcctccag cactgcctac 241 atgcagctca gcagcctgac atctgatgac tctgcggtct atttctgtgc aagagatggt 301 ccctggtttg cttactgggg ccaagggact ctggtcactg tctctgca // LOCUS RICGICR 524 bp ds-DNA PLN 11-AUG-1990 DEFINITION O.sativa Gly-tRNA gene. ACCESSION M35993 KEYWORDS transfer RNA-Gly. SOURCE O.sativa DNA. ORGANISM Oryza sativa Eukaryota; Plantae; Embryobionta; Magnoliophyta; Liliopsida; Commelinidae; Cyperales; Poaceae. REFERENCE 1 (bases 1 to 524) AUTHORS Reddy,P.S. and Padayatty,J.D. TITLE Effects of 5' flanking sequences and changes in the 5' internal control region on the transcription of rice Gly-tRNA-GCC gene JOURNAL Plant Mol. Biol. 11, 575-583 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA 381 451 Gly-tRNA anticdn 413 415 Gly-tRNA anticodon cgg BASE COUNT 166 a 72 c 98 g 187 t 1 others ORIGIN 1 gcgcatgttt gtttcagctt aagattattg taatntagat tatttaaatc aagattactt 61 taagtttgga ttataataag ctgatataaa ataagctaca agttgtttgt ttttctagat 121 tattaaaggc atctaagggt agtgggtttt tagccacttc aattaatctg gaaaaagctc 181 atatagaaaa gattattgga ttatagtaat ctggcttata gttataataa tctagcatca 241 ataatctact tgtttgtttc agcttacttc taataattta gtttataata attctgaaac 301 tgaattaaac agggtcttag acatatgagt tggatttgtg tagagctgaa attagtttac 361 tcgtagaatc aagaacgaaa gcaccagtgg tctagtggta gaatagtacc ctgccacggt 421 acagacccgg gttcgattcc cggctggtgc attattaatt tttggctctt ccattttaga 481 taaagacaat tggcaacttg atgcatttat gaacaaatct gcgc // LOCUS M27244 1413 bp ds-DNA UNA 11-AUG-1990 DEFINITION Figure 2. The sequence and predicted coding capacity of the HOX-2.5 h14 cDNA. ACCESSION M34857 M27244 REFERENCE 1 (bases 1 to 1413) AUTHORS Bogarad,L.D., Utset,M.F., Awgulewitsch,A., Miki,T., Hart,C.P. and Ruddle,F.H. TITLE the developmental expression pattern of a new murine homeo box gene: hox-2.5 JOURNAL Dev. Biol. 133, 537-549 (1989) STANDARD unannotated staff_entry COMMENT FEATURES from to/span description BASE COUNT 366 a 386 c 311 g 350 t ORIGIN 1 gcgccggggg agctgctcaa acagggcacg cccgagtaca gtttggaaac ttcggcgggc 61 agggaggctg tcctgtctaa tcaaagagct ggctacgggg acaataaaat ttgcgaagga 121 agcgaggaca aagagaggcc gggtcaaacc aacccctctg ccaactggct gcacgcccgc 181 tcttcccgga aaaagcgctg tccctacacc aaataccaga cgctggagct agagaaggag 241 tttctgttca atatgtacct caccagggac cgcaggcatg aagtggccag actcctcaat 301 ctgagtgaga gacaagtcaa aatctggttt cagaaccggc ggatgaaaat gaagaagatg 361 aataacggag caggcaaaga gtaaagatga ccaccccccc cttccccagc tcactcttat 421 tatttatgtg atggtcaaaa agccactgct gtctgggtgt acccaacgtg agtggggaag 481 agtatctcct ctttaaaatc cctcatctgc acctggagcc tgtctcctct cctttgcgcc 541 tctctctctc tctctctctc tctctctctc tctctctctc tctctctcca ctctcctggt 601 gttaagagga aattttatta agttagaagc tagctatact tggttcctag aaaagtgatg 661 agcctcgctc cgaggaaaga gaccctagtc gagctcctag catccctgct aatttttctg 721 gaaaacccca gcccctctct tccagtctgc ctgttccctc cctgtaacct agccagacgc 781 cacccaggta cagtttaact gcacacagcc cagtggatgc ccacagacat agagtttgga 841 gtctatgcta atcatcgtgg agaaaggaca tcaagacgtt gttcatcaaa ccaagcaggg 901 cagctctcag atcaggttca ttttggttgt gccctgatcc aaacacacac taggaagatg 961 tctagagatg ccctcaatga ggcactccag tcccaaagtc tcaacatccc aatatgtatc 1021 ctatggaatt gccagctaga cccatatctg gggaggcctg tgggagatgt ggccattcct 1081 gtccccaagc tgttgtgagc tagaaagggc aagaaaaagc agggagtggt tttatgaagg 1141 taccattagt ggagtcccct cgtattttcc ctgcaggaag tgtttaattc tgcatgtttc 1201 ctttccttcc aactaaagga ggtcaaagtg ggttgtagag ccctgacagt gctgtttccc 1261 cattcataca ttcctatcct catttgacag aatgtaactt ttcttgctga ccacagtctc 1321 agatttctct ctatcctcca actttgaaat caactctcct ttccaccaaa cccatcttgc 1381 accagcaaag acaagtctcc cattgtaaaa aaa // LOCUS CHKBGLOA 168 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-3-globin gene, exon N. ACCESSION M33204 KEYWORDS beta-3-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 168) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 7 / 168 beta-3-globin, exon N (AA at 7) IVS < 1 6 beta-3-globin intron N (no splice consensus) BASE COUNT 37 a 58 c 38 g 35 t ORIGIN 1 tacaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctc 61 tccagcccca ctgccatcct tggcaacccc atggtccgcg cccacggcaa gaaagtgctc 121 acctcctttg gggatgctgt gaagaaccta gacaacatca agaacacc // LOCUS CHKBGLOB 90 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-1-globin gene, exon N. ACCESSION M33205 KEYWORDS beta-1-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 90) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 1 / 90 beta-1-globin, exon N (AA at 1) BASE COUNT 22 a 29 c 20 g 19 t ORIGIN 1 aggttctttg ataacttcgg gaacctctcc agccccaccg ccatcattgg taaccccaag 61 gtccgtgctc acggcaaaaa agtgctgagc // LOCUS CHKBGLOC 234 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-2-globin gene, exon N. ACCESSION M33206 KEYWORDS beta-2-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 234) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 7 / 228 beta-2-globin, exon N (AA at 7) IVS < 1 6 beta-2-globin intron N (no splice consensus) IVS 229 > 234 beta-2-globin intron N+1 BASE COUNT 53 a 68 c 65 g 48 t ORIGIN 1 tgcaggctgc tgatcgtcta cccctggacc cagaggttct ttgcgtcctt tgggaacctg 61 tccagtgcca ccgccatcat tggcaacccc atggttcgcg cccacggcaa gaaagtgctg 121 agctcctttg gggaggctgt aaagaacctg gacaacatca agaagagctt tgcccagctg 181 agcaaactgc actgtgacaa gctgcacgtg gaccccgaga acttcagggt gagt // LOCUS CHKBGLOD 121 bp ds-DNA VRT 11-AUG-1990 DEFINITION Chicken beta-4-globin gene, exon N. ACCESSION M33207 KEYWORDS beta-4-globin. SOURCE Chicken DNA. ORGANISM Gallus gallus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Aves; Neornithes; Neognathae; Galliformes; Phasianidae. REFERENCE 1 (bases 1 to 121) AUTHORS Dolan,M., Sugarman,B.J., Dodgson,J.B. and Engel,J.D. TITLE Chromosomal arrangement of the chicken beta-type globin genes JOURNAL Cell 24, 669-677 (1981) STANDARD simple staff_review FEATURES from to/span description pept / 1 / 121 beta-4-globin, exon N (AA at 1) BASE COUNT 19 a 43 c 29 g 30 t ORIGIN 1 ctccttgggg acatcctgat catcgtcctg gcctcccact ttcccaggga tttcactcct 61 gcctgccaat ttgcctggca gaagctggtc aacgttgtgg cccatgccca ggcccgcaag 121 t // LOCUS ECOTRYA 87 bp ss-RNA RNA 11-AUG-1990 DEFINITION E.coli Tyr-tRNA. ACCESSION M35994 KEYWORDS transfer RNA-Tyr. SOURCE E.coli. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 87) AUTHORS Doctor,B.P., Loebel,J.E., Sodd,M.A. and Winter,D.B. TITLE Nucleotide sequence of Escherichia coli tyrosine transfer ribonucleic acid JOURNAL Science 163, 693-695 (1969) STANDARD simple staff_review FEATURES from to/span description tRNA 1 87 Tyr-tRNA anticdn 36 38 Tyr-tRNA anticodon rta modified 8 8 p modified 18 18 gm modified 20 20 q modified 41 41 p modified 66 66 p BASE COUNT 19 a 27 c 24 g 16 t 1 others ORIGIN 1 gggtgggttc cccgagcggg caaagggagc agactrtata tctgccggtc acagacttcg 61 aaggttcgaa tccttcccca cccacca // LOCUS HUMMHDCBA 142 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DC-beta chain mRNA, 5' end. ACCESSION M35982 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-beta-5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 142) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 142 MHC HLA-DC-beta chain precursor (AA at 2) sigp < 1 79 MHC HLA-DC-beta signal peptide matp 80 > 142 MHC HLA-DC-beta BASE COUNT 27 a 40 c 44 g 31 t ORIGIN 1 aggggggatc cccggagacc ttcgggtagc aactgtcacc ttgatgctgg cgatcctgag 61 ctcctcactg gctgagggca gagactctcc cgaggatttc gtgtaccagt ttaagggcct 121 gtgctacttc accaacggga cg // LOCUS HUMMHDRAA 157 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DR-alpha chain mRNA, 5' end. ACCESSION M35979 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-alpha-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 157) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept 8 > 157 HLA-DR-alpha chain precursor sigp 8 82 HLA-DR-alpha chain signal peptide matp 83 > 157 HLA-DR-alpha chain BASE COUNT 43 a 31 c 40 g 43 t ORIGIN 1 gaagaaaatg gccattagtg gagtccctgt gctaggattt ttcatcatag ctgtgctgat 61 gagcgctcag gaatcatggg ctatcaaaga agaacatgtg atcatccagg ccgagttcta 121 tctgaatcct gaccaatcag gcgagtttat gtttgac // LOCUS HUMMHDRBB 157 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DR-beta chain mRNA, 5' end. ACCESSION M35980 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-beta-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 157) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept 8 > 157 MHC HLA-DR-beta chain precursor sigp 8 94 MHC HLA-DR-beta chain signal peptide matp 95 > 157 MHC HLA-DR-beta chain BASE COUNT 26 a 42 c 46 g 43 t ORIGIN 1 ctccagcatg gtgtgtctga ggctccctgg aggctcctgc atggcagttc tgacagtgac 61 actgatggtg ctgagctccc cactggcttt ggctggggac accagaccac gtttcttgga 121 gtactctacg tctgagtgtc atttcttcaa tgggacg // LOCUS HUMMHDRBC 154 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human MHC class II HLA-DR-beta chain mRNA, 5' end. ACCESSION M35981 KEYWORDS cell surface glycoprotein; class II gene; integral membrane protein; major histocompatibility complex. SOURCE Human, cDNA to mRNA, clone pII-beta-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 154) AUTHORS Gustafsson,K., Wiman,K., Larhammar,D., Rask,L. and Peterson,P.A. TITLE Signal sequences distinguish class II histocompatibility antigen beta chains of different loci JOURNAL Scand. J. Immunol. 19, 91-97 (1984) STANDARD simple staff_review FEATURES from to/span description pept 5 > 154 MHC HLA-DR-beta chain precursor sigp 5 91 MHC HLA-DR-beta chain signal peptide matp 92 > 154 MHC HLA-DR-beta chain BASE COUNT 27 a 37 c 49 g 41 t ORIGIN 1 cagcatggtg tgtctgaggc tccctggagg ctcctgcatg gcagttctga cagtgacact 61 gatggtgctg agctccccac tggctttggc tggggacacc agaccacgtt tcttggagga 121 ggttaagttt gagtgtcatt tcttcaacgg gacg // LOCUS MUSIAPA 494 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse intracisternal A particle mRNA, 5 'end. ACCESSION M36322 KEYWORDS intracisternal A particle. SOURCE Mouse endogenous A type retrovirus embryonal carcinoma cell line PCC3, cDNA to mRNA, clone EC26. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 494) AUTHORS Morgan,R.A., Christy,R.J. and Huang,R.C.C. TITLE Murine A type retroviruses promote high levels of gene expression in embryonal carcinoma cells JOURNAL Development 102, 23-30 (1988) STANDARD simple staff_review FEATURES from to/span description mRNA 477 > 494 intracisternal A particle mRNA BASE COUNT 111 a 124 c 125 g 134 t ORIGIN 1 ctgcagctag aaaggctctg cggcatctct gagcctattc tagggagaca tgtcatcttt 61 caagaaggtt gagtgtccaa gtgtccttct ctgcaggcaa aacgacaagg gagcaggtca 121 gggttcctct gggtaaaagc ctgtgagcct aagagcctaa tcctgtacat ggctccttaa 181 cctacacatt ggggatttga cctctatctc cactctcatt aatatgggtg gcctatttgc 241 ctcttattaa aaggaaaggg ggagatgttg ggagccgccc ccacattcgc cgttacaaga 301 tggcgctgac agctgtgttc taagtggtaa acaaataatc tgcgcatgtg ccgagggtgg 361 ttctccactc catgtctctg ccttccccgt gacgtcaact cggccatggc tgcagccaat 421 cagggagtga cacgtcctag gcgaaggata attctcctta atagggacgg gttttcgttt 481 tttctctctc tctt // LOCUS MUSIAPB 492 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse intracisternal A particle mRNA, 5 'end. ACCESSION M36323 KEYWORDS intracisternal A particle. SOURCE Mouse endogenous A type retrovirus embryonal carcinoma cell line PCC3, cDNA to mRNA, clone EC32. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 492) AUTHORS Morgan,R.A., Christy,R.J. and Huang,R.C.C. TITLE Murine A type retroviruses promote high levels of gene expression in embryonal carcinoma cells JOURNAL Development 102, 23-30 (1988) STANDARD simple staff_review FEATURES from to/span description mRNA 475 > 492 intracisternal A particle mRNA BASE COUNT 111 a 118 c 129 g 134 t ORIGIN 1 ctgcagtaga aaggctctgc ggcatattga gcctattcta gggagacatg tcatctttca 61 agaaggttga gtgtccaagt gtccttctct gcaggcaaaa cgacacggga gcaggtcagg 121 gttgcctctg ggtaaaagcc tgtgagccta agagctaatc ctgtaatggc tccttaacct 181 acacattggg gatttgacct cttctccact ctcattaata tgggtggcct atttgcctct 241 tattaaaagg aaagggggag atgttgggag ccgcccccac attcgccgtt acaagatggc 301 gtgacagctg tgttctaagt ggtaaacaaa taatctgcgc atgtgccgag ggtggttctc 361 cactccatgt gctctgcctt cccggtgacg tcaactcggc catggctgca gccaatcagg 421 gagtgacacg tcctaggcga aggataattc tccttaatag ggacgggttt tatcgttttt 481 tctcgctctc tt // LOCUS PT4SB 122 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene. ACCESSION M35988 KEYWORDS serine transfer-RNA. SOURCE Bacteriophage T4 (strain R24) DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 122) AUTHORS McClain,W.H. TITLE Specific duplications fostered by a DNA structure containing adjacent inverted repeat sequences JOURNAL J. Mol. Biol. 204, 27-40 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA < 1 > 122 Ser-tRNA BASE COUNT 25 a 35 c 36 g 26 t ORIGIN 1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ccggagcgac 61 tgccggtctt taaaaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc 121 cg // LOCUS PT4SC 121 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene. ACCESSION M35989 KEYWORDS serine transfer-RNA. SOURCE Bacteriophage T4 (strain R66) DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 121) AUTHORS McClain,W.H. TITLE Specific duplications fostered by a DNA structure containing adjacent inverted repeat sequences JOURNAL J. Mol. Biol. 204, 27-40 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA < 1 > 121 Ser-tRNA BASE COUNT 24 a 36 c 34 g 27 t ORIGIN 1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac 61 tccggtcttt aaaaccggca gtcgctccgg cgactcatag gttcaaatcc tatcgcctcc 121 g // LOCUS PT4SEA 122 bp ds-DNA PHG 11-AUG-1990 DEFINITION Bacteriophage T4 mutant Ser-tRNA suppressor gene. ACCESSION M35987 KEYWORDS serine transfer-RNA. SOURCE Bacteriophage T4 (strain R51) DNA. ORGANISM Bacteriophage T4 Viridae; ds-DNA nonenveloped viruses; Myoviridae. REFERENCE 1 (bases 1 to 122) AUTHORS McClain,W.H. TITLE Specific duplications fostered by a DNA structure containing adjacent inverted repeat sequences JOURNAL J. Mol. Biol. 204, 27-40 (1988) STANDARD simple staff_review FEATURES from to/span description tRNA < 1 > 122 tRNA-Ser BASE COUNT 24 a 35 c 36 g 27 t ORIGIN 1 ggaggcgtgg cagagtggtt taatgcaccg gtctttaaaa ccggcagtcg ctccggcgac 61 tgccggtttt aaagaccggc agtcgctccg gcgactcata ggttcaaatc ctatcgcctc 121 cg // LOCUS VFARDNAA 1234 bp ds-DNA PLN 11-AUG-1990 DEFINITION V.faba ribosomal spacer DNA repeat. ACCESSION M36318 KEYWORDS direct repeat; spacer DNA. SOURCE V.faba DNA, clone VER 1. ORGANISM Vicia faba Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 1234) AUTHORS Kato,A., Yakura,K. and Tanifuji,S. TITLE Repeated DNA sequences found in the large spacer of Vicia faba rDNA JOURNAL Biochim. Biophys. Acta 825, 411-415 (1985) STANDARD simple staff_review FEATURES from to/span description rpt 78 228 direct repeat copy A rpt 1064 1123 direct repeat copy A BASE COUNT 320 a 329 c 304 g 281 t ORIGIN 1 agacaagtcc atgcaaagga ctaggcaaga acgcattgcg taggccatgg tttgcgttgc 61 ttccgcctca tggcccaccg acacgcgaaa agccgaaaaa catgaaggga tcgcccgaat 121 ggtacgaaac cttgcattcc atctcaattt ggtgtcagat taatggatga aaaaattcag 181 accggaactc attgcgtagc ccgacgttgg cttgctttcc gccccttgga gaaaccgcac 241 acgcacgcgt tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa 301 ctccttggac actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt 361 attcccctaa ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgttcg 421 ggccggaaat tcaaaattgt cgaaacgcta cgaaactttg cagatccctt ctatgtgtta 481 tttgggactt ccatgcaaaa attcatcggg agattattcc cctaaggcgc gtttgcctcc 541 gctcccgttc atggagaaac cgcacacgca cgcgttccgg tcggaaattg aaagtagacc 601 gagggcttgt cgaaacgcta cgaaactcct tggacactct caatgtgtta tttgggaagt 661 gcatgcaaaa atccatcggg agattattcc cctaaggcgc gtttgcctcc gctcccgtcg 721 gaaaaaccgc acgggggccc gttcgggccg gaaattcaaa attgtcgaaa cgctacgaaa 781 ctttgcagat cccttctatg tgttatttgg gacttccatg caaaaattga tcgggagatt 841 attcccctaa ggcgcgtttg cctccgctcc cgttcatgga gaaaccgcac acgcacgcgt 901 tccggtcgga aattgaaagt agaccgaggg cttgtcgaaa cgctacgaaa ctccttggac 961 actctcaatg tgttatttgg gaagtgcatg caaaaatcca tcgggagatt attcccctaa 1021 ggcgcgtttg cctccgctcc cgtcggaaaa accgcacggg ggcccgacac gcgaaaagcc 1081 gaaatatatc gagggatcgc ccgaatggca tgaaacttgg catgccatct caatttggtg 1141 gcagattaat gcatgaaaaa ttccagaccg gaactcattc cgtagcccga cgtttgtttc 1201 cttccgcccc ttgaaaaaaa ctcaattgca cgcg // LOCUS VFARDNAB 149 bp ds-DNA PLN 11-AUG-1990 DEFINITION V.faba ribosomal spacer DNA repeat. ACCESSION M36319 KEYWORDS . SOURCE V.faba DNA, clone VER 6. ORGANISM Vicia faba Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Rosidae; Rosales; Fabaceaea. REFERENCE 1 (bases 1 to 149) AUTHORS Kato,A., Yakura,K. and Tanifuji,S. TITLE Repeated DNA sequences found in the large spacer of Vicia faba rDNA JOURNAL Biochim. Biophys. Acta 825, 411-415 (1985) STANDARD simple staff_review BASE COUNT 39 a 40 c 35 g 35 t ORIGIN 1 ccgacacgcg aaaagccgaa atatatcgag ggatcgcccg aatggcatga aacttggcat 61 gccatctcaa tttggtggca gattaatgca tgaaaaattc cgaccggaac tcattccgta 121 gcccgacgtt tgtttgcttc cgccccttg // LOCUS XEBEBNA1 185 bp ds-DNA VRT 11-AUG-1990 DEFINITION X.borealis external transcribed spacer and 18S rRNA gene, 5' end. ACCESSION M36006 KEYWORDS 18S ribosomal RNA. SEGMENT 1 of 2 SOURCE X.borealis DNA, clone pXbr101. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 185) AUTHORS Furlong,J.C. and Maden,B.E.H. TITLE Patterns of major divergence between the internal transcribed spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis, and of minimal divergence within ribosomal coding regions JOURNAL EMBO J. 2, 443-448 (1983) STANDARD simple staff_review FEATURES from to/span description rRNA 23 > 185 18S ribosomal RNA site 1 22 external transcribed spacer BASE COUNT 43 a 44 c 46 g 52 t ORIGIN 1 ggttcccccc ggagccgagg gctacctggt tgatcctgcc agtagcatat gcttgtctca 61 aagattaagc catgcacgtg taagtacgca cggccggtac agtgaaactg cgaatggctc 121 attaaatcag ttatggttcc tttgatcgct ccatctgtta cttggataac tgtggtaatt 181 ctaga // LOCUS XEBEBNA2 1589 bp ds-DNA VRT 11-AUG-1990 DEFINITION X.borealis 18S rRNA gene, 3' end, 5.8S rRNA gene, complete, ITS2, and 28S ribosomal RNA gene, 5 ' end. ACCESSION M36008 M36010 M36012 M36014 M36016 KEYWORDS 18S ribosomal RNA; 28S ribosomal RNA; 5.8S ribosomal RNA. SEGMENT 2 of 2 SOURCE X.borealis DNA, clone pXbr101. ORGANISM Xenopus borealis Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Amphibia; Lissamphibia; Anura; Archeobatrachia; Pipoidea; Pipidae. REFERENCE 1 (bases 1 to 1589) AUTHORS Furlong,J.C. and Maden,B.E.H. TITLE Patterns of major divergence between the internal transcribed spacers of ribosomal DNA in Xenopus borealis and Xenopus laevis, and of minimal divergence within ribosomal coding regions JOURNAL EMBO J. 2, 443-448 (1983) STANDARD simple staff_review FEATURES from to/span description rRNA < 1 1 18S ribosomal RNA site 401 954 internal transcribed spacer 1 (ITS1) rRNA 955 1116 5.8S ribosomal RNA site 1117 1452 internal transcribed spacer 2 (ITS2) rRNA 1453 > 1589 28S ribosomal RNA BASE COUNT 287 a 560 c 546 g 196 t ORIGIN About 1500 bp after segment 1. 1 gaattcccag taagtgcggg tcataagctc gcgttgatta agtccctgcc ctttgtacac 61 accgcccgtc gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg 121 ggtcggcaac ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa 181 gtaaaagtcg taacaaggtt tccgtaggtg aacctgcgga aggatcatta accgcccgtc 241 gctactaccg attggatggt ttagtgaggt cctcggatcg gccccgccgg ggtcggcaac 301 ggccctggcg gagcgccgag aagacgatca aacttgacta tctagaggaa gtaaaagtcg 361 taacaaggtt tccgtaggtg aacctgcgga aggatcatta acgagagagg gcgagaggct 421 cacggtgcga gagcgcggct ctgccgcacc cccgccaccg tcccggctcc ccccgcggac 481 gacgacaggt ccgggcgcgg gctcccgccc gccccgacac gaggagagag cggaaaagga 541 acaacaacaa aaaaaacgag cccgggtacc tagccggggg ggggcgaggg cggggacggg 601 gggcagaggg ccctccgacc ccccctccct cctcccgggc ggttcgaaga ccccggccca 661 cggccggcgg cggcgacgag gggcgagccg tccggaggcg gggggcgggc gcggaggggg 721 gacggggggg aggcggaggt cacggccgga cgcggccgcg tccctccgcc cgcccccccc 781 ccgtctgtcc gcgccccgcc ccgcgcgccg gtcgcgtcag tccccgccgc cccgtcggcg 841 gcgccccggg gcgcccgggg cccgggggcc cgccccgtcc agcttccacg gcgcccgcgc 901 gcgccggtac ccctgccgcg aggcaacgag agagaaaaac aaacgaaaaa aatctcgcga 961 ctcttagcgg tggatcactc ggctcgtgcg tcgatgaaga acgcagctag ctgcgagaat 1021 tagtgtgaat tgcaggacac attgatcatc gacacttcga acgcaccttg cggccccggg 1081 ttcctcccgg ggctacgcct gtctgagggt cgctccgacg tccatcgccc ccgccgggtc 1141 cccgacccgg ggaaaacggg cggcgcggct ggggccgtcg caggggtcga gggacgcccc 1201 cccaccgccc ttccggggga ggtgaggggg ccccgcgacc ccttcgtccc cccaaggcca 1261 gaccccgacc atcccggcgg ccccccccag ggccccggcg cggctgtctg tgggaaaccg 1321 atttcacggc tgccggcgcc gcgggcgctg gggacccgcc ggggcgggaa gagcccccgg 1381 gcgggccggg gccggggaag gcgcgcttga acccgcgccc ccccgcgtcc cccgcccgcc 1441 cccccgctcg actcagacct cagatcagac gtggcgaccc gctgaattta agcatattac 1501 taagcggagg aaaagaaact aaccaggatt cccccagtaa cggcgagtga agagggaaga 1561 gcccagcgcc gaatccccgc ccgcccggc // LOCUS BOVTGFB 1117 bp ss-mRNA MAM 11-AUG-1990 DEFINITION Bovine transforming growth factor-beta-1 (TGF beta-1) mRNA, 3' end. ACCESSION M36271 KEYWORDS transforming growth factor-beta-1. SOURCE Bovine fibropapilloma, cDNA to mRNA, (library of Okayama and Berg). ORGANISM Bos taurus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Artiodactyla; Ruminantia; Pecora; Bovidae. REFERENCE 1 (bases 1 to 1117) AUTHORS Van Obberghen-Schilling,E., Kondaiah,P., Ludwig,R.L., Sporn,M.B. and Baker,C.C. TITLE Complementary deoxyribonucleic acid cloning of bovine transforming growth factor-beta-1 JOURNAL Mol. Endocrinol. 1, 693-698 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 1 950 transforming growth factor-beta-1 precursor (AA 75 at 3) matp 612 947 transforming growth factor-beta-1 BASE COUNT 244 a 379 c 301 g 193 t ORIGIN 1 aggccatact ggccctttac aacagtaccc gcgaccgggt ggccggggaa agtgccgaaa 61 cggagcctga gccagaggcg gactactacg ccaaggaggt cacccgcgtg ctaatggtgg 121 aatacggcaa caaaatctat gacaaaatga agtctagctc gcacagcata tatatgttct 181 tcaacacgtc cgagctccgg gaagcggtgc ccgaacctgt gttgctctct cgggcagacg 241 tgcgcctgct gaggctcaag ttaaaagtgg agcagcacgt ggagctgtac cagaaatata 301 gcaacaattc ctggcgctac ctcagcaacc ggctgctcgc ccccagcgac tcaccggagt 361 ggctgtcctt tgacgtcact ggagttgtgc ggcagtggct gacccgcaga gaggaaatag 421 agggctttcg cctcagtgcc cactgttcct gtgacagtaa agataacacg cttcaagtgg 481 acattaacgg gttcagttcc ggccgccggg gtgacctcgc caccattcac ggcatgaacc 541 ggcccttcct gctcctcatg gccacccctc tggagagggc ccagcacctg cacagctccc 601 gccaccgccg agccctggac accaactact gcttcagctc cacagaaaag aactgctgtg 661 ttcgtcagct ctacattgac ttccggaagg acctgggctg gaagtggatt catgaaccca 721 aggggtacca cgccaatttc tgcctggggc cctgccctta catctggagc ctggatacac 781 agtacagcaa ggtcctggcc ctgtacaacc agcacaaccc gggcgcttcg gcggcgccgt 841 gctgcgtgcc tcaggcgctg gagcccctgc ccatcgtgta ctacgtgggc cgcaagccca 901 aggtggagca gttgtccaac atgatcgtgc gctcctgcaa gtgcagctga ggccccgtcc 961 caccccaaca gcccccgccc cgtagccccg ccacccggca gcccggcccc gcccccgccc 1021 gcctcaccag gactgtattt aaggacacag caccaccccc ccccccccac tcccatcaag 1081 cccacctggg gtccattaaa ggtggcgaga ggaaaaa // LOCUS FSBPROLA 1148 bp ss-mRNA VRT 11-AUG-1990 DEFINITION O.keta (chum salmon) prolactin mRNA, complete cds. ACCESSION M36267 KEYWORDS prolactin. SOURCE O.keta pituitary gland, cDNA to mRNA. ORGANISM Oncorhynchus keta Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Osteichthyes; Actinopterygii; Salmoniformes; Salmonoidei; Salminidae. REFERENCE 1 (bases 1 to 1148) AUTHORS Song,S., Trinh,K.-Y., Hew,C.L., Hwang,S.-J., Belkhode,S. and Idler,D.R. TITLE Molecular cloning and expression of salmon prolactin cDNA JOURNAL Eur. J. Biochem. 172, 279-285 (1988) STANDARD simple staff_review FEATURES from to/span description pept 19 654 prolactin precursor sigp 19 87 prolactin signal peptide matp 88 651 prolactin mRNA 1 1148 prolactin mRNA BASE COUNT 319 a 284 c 239 g 306 t ORIGIN 1 gtccgaaaag aaggaaagat ggctcgccga tcccagggta ccaaactcca cttagcagtt 61 ctgtgtctag ttgtgtcctg tcatgccatt ggccttagtg acctaatgga gagagcttcc 121 cagcgatcag acaagcttca ctcactcagc acttccctca ccaaggacct ggactctcac 181 ttcccaccaa tgggacgagt gatgatgcca cgcccgtcta tgtgtcacac ctcctcgctc 241 cagacaccca aggacaagga gcaagcactc aaggtatcgg agaatgagct gatctccctg 301 gctcgctacc tcctcctggc ctggaacgat cccctgctgc tgctctcctc agaggcgccg 361 actctgcccc acacgccgtc caatggcgac atcagcagta agatcaggga actgcaggac 421 tactccaaga gcctgggaga tggactggac ataatggtca acaagatggg accctcctcc 481 cagtacattt cttcaatccc cttcaagggt ggagacctcg gcaatgacaa gacctcccgc 541 ctcatcaact tccacttcct catgtcttgc ttccgcaggg actcccacaa aatcgacagt 601 ttcctcaagg tccttaggtg ccgggctaca aatatgcgac cagaaacatg ttaggagaaa 661 atggcaggat tttggttctg gattgttcca ttttcaaact gatagtcaaa atggggtagg 721 aattcaagag agtgattgtc ttgttgtagt ttgattttgt gaaataacta atgctgccac 781 ctacatcaca tttggactat tcatagatct atacagttag tagtcatcct ggttatctga 841 taccgacatg tttacctata catatcttta taggtagttt atgttcggta atgcattatg 901 catcagaaaa atgatgcaac ctaagtattg gtcacttata tgatcatatt caatagattc 961 atatatagat aacaaaggat cctcctagtc attcttagta gatatacagg atggctgata 1021 ggtagatgcc atgaaccgtc atgattaaca atctagaact tgaatatagg taactcagcc 1081 tttgtaaact attgataatt gcttccaaga gttctgttta cacaataaat caagtaagat 1141 tttgcaat // LOCUS PETIRAB 183 bp ds-DNA ORG 11-AUG-1990 DEFINITION P.hybrida chloroplast rps19' and rp12 genes, partial cds. ACCESSION M35955 KEYWORDS inverted repeat; rp12; rps19'. SOURCE Petunia hybrida chloroplast DNA. ORGANISM Chloroplast Petunia hybrida Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Petunia hybrida. REFERENCE 1 (bases 1 to 183) AUTHORS Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E. TITLE Sequence analysis of the junction of the large single copy region and the large inverted repeat in the petunia chloroplast genome JOURNAL Curr. Genet. 14, 487-492 (1988) STANDARD simple staff_review FEATURES from to/span description pept > 183 143 (c) rp12 gene product (AA at 181) pept 91 < 1 (c) rps19' gene product (gtg start codon) BASE COUNT 58 a 28 c 26 g 71 t ORIGIN 1 cgcccatacc ataattcaga attcatcaaa tattagataa atttaataaa tgattggcta 61 caaaggggtt tttttttagt gaacgtgtca caagcttact cctatttttt ttttttttgt 121 aaagtttttg taaaacgaag aattaaattc gatttcctct cctatttact atggcgacga 181 aga // LOCUS PETIRABX 162 bp ds-DNA ORG 11-AUG-1990 DEFINITION P.hybrida chloroplast rps19 and rp12 genes, partial cds. ACCESSION M37322 KEYWORDS inverted repeat; rp12; rps19. SOURCE Petunia hybrida chloroplast DNA. ORGANISM Chloroplast Petunia hybrida Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Asteridae; Solanales; Solanaceae; Petunia hybrida. REFERENCE 1 (bases 1 to 162) AUTHORS Aldrich,J., Cherney,B.W., Williams,C. and Merlin,E. TITLE Sequence analysis of the junction of the large single copy region and the large inverted repeat in the petunia chloroplast genome JOURNAL Curr. Genet. 14, 487-492 (1988) STANDARD simple staff_review FEATURES from to/span description pept < 1 41 rp12 gene product (AA at 3) pept 92 > 162 rps 19 gene product (gtg start codon) BASE COUNT 70 a 26 c 21 g 45 t ORIGIN 1 tcttcgtcgc catagtaaat aggagaggaa atcgaattta attcttcgtt ttacaaaaac 61 ttacaaaaaa aaaaaaaata ggagtaagct tgtgacacgt tcactaaaaa aaaacccctt 121 tgtagccaat catttattaa ataaaattga taagcttaac ac // LOCUS RATATROPO 1633 bp ds-DNA ROD 11-AUG-1990 DEFINITION Rat alpha-tropomyosin and N5 genes, 5' flank. ACCESSION J05467 KEYWORDS alpha-tropomyosin. SOURCE Rat (Sprague-Dawley) adult liver DNA. ORGANISM Rattus norvegicus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1633) AUTHORS Herrera,V.L.M. and Ruiz-Opazo,N. TITLE Regulation of alpha-tropomyosin and N5 genes by a shared enhancer JOURNAL J. Biol. Chem. 265, 9555-9562 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by N.Ruiz-Opazo, 04-JUN-1990. FEATURES from to/span description signal 569 772 enhancer BASE COUNT 424 a 509 c 405 g 294 t 1 others ORIGIN 1 bp upstream of HindIII site. 1 aagcttaaag agatgggggg ggggggtgtc ctcctgcctt cagtgcagac ctggctggcc 61 cctttccctc tgcagagttc accctacgga ttgaaccatt tccgtagcta atttgtagca 121 tgttttcagc accagcaccc aagtcgctaa cagagaaaag aaggacnccc accccccacc 181 cccaatagat ccgagcaagg aaaaaaatga aaagctagat ctgaagggat tttaggaatc 241 cactaccctc tgaatttaga tatgaggaaa cctgaactaa caagatggat ggctcagcag 301 gtaacggagc ttgctatcaa gctcaaagac ctgtgcacgg atcgaacacc cacatggtag 361 aaacaactga ttcttaaaga gttgctctct gaccgccaca gcaccaccac gccacgccca 421 cgcccctata agtaaacgac taaagtaagg gaagctacag catggaagct gtttctaatc 481 acagctacac aatgtagctg ttagctagct aatgactgaa gaccagaccc ctcccactcc 541 ccacgatgtt aactgagtct aaactcaaca cacacacaca cacacacaca cacacacaca 601 cacacacctt ggcacactcc ttctaccctt tccaaagcac agcacaggca cataactcag 661 caaaccacac acacacacac acacacacac acacacacac acacacacac acaccttggc 721 acactccttc taccctttcc aaagcacagc acaggcacat aactcagcaa accgggccct 781 aagttcctaa tgtattttac tgcccacctc cagcttgggt gaaacgattg ggagacttcc 841 tgtgtgtatc attatgccca ccatcacggg cagagagaca gggcctcagg aatggccccc 901 acaccttcct ccacctctat ttctctgcgt gggttgaaat ttccagactg tgagagccca 961 ggactatgac tggtgggtgg actcagagct gctagaagga cgcagctagg tactgagaac 1021 tcccaggccc agatcgaagg tgcggtggct gtgactggca gcccccggac cttacagtga 1081 gcaccccagg acccggaggc caagtgagat ggtgcgcaca caccacctgc cctgttcccg 1141 ggattccctg tgactcccga cagactcttc tggggaccct gaccgccagg cgacggctct 1201 gggaggtggc tagagactcc tcgccagtac cgggctcctc tccagcgagg tccgggagct 1261 ctttcccagg cgtgcaggtt tagaaattaa ggggagcctc gcccggccgc ggtctcggag 1321 gctcgggtcc ccatatatag tcatatccac cgtcgactgg gagcggcagg cggcagcgaa 1381 tggcgtgagg ccccgcggga ggagcgagga gggggcatgg ggcggaggca ggagactggg 1441 aaagggggaa ggaggaacaa gcttttccaa aaaagtattg gctgacctaa ggaatgcggt 1501 cgccccctgg gaaagtacat atcagggagc agcaggcacg tccgcgctcg caccttccgg 1561 cctctgccac cctaccgcgc tcgcgtctcc tctggctcct ctcgctaagg gcccctcgcc 1621 accgccacca tgg // LOCUS STMXYLI 1201 bp ds-DNA BCT 11-AUG-1990 DEFINITION S.violaceoniger xylose isomerase gene, complete cds. ACCESSION M36269 KEYWORDS glucose isomerase; xylose isomerase. SOURCE S.violaceoniger DNA. ORGANISM Streptomyces violaceoniger Prokaryota; Bacteria; Firmicutes; Streptomycetaceae. REFERENCE 1 (bases 1 to 1201) AUTHORS Drocourt,D., Bejar,S., Calmels,T., Reynes,J.P. and Tiraby,G. TITLE Nucleotide sequence of the xylose isomerase gene from Streptomyces violaceoniger JOURNAL Nucleic Acids Res. 16, 9337-9337 (1988) STANDARD simple staff_review FEATURES from to/span description pept 24 1193 xylose isomerase BASE COUNT 187 a 455 c 402 g 157 t ORIGIN 1 cgtacggaac aaggagcagc gccatgagct tccagcccac ccccgaggac aagttcacct 61 tcggtctgtg gaccgtcggc tggcagggaa gggacccgtt cggcgacgcc acccgccctg 121 ccctcgaccc ggtcgagacc gtgcagcgcc tggccgagct gggcgcctac ggagtgacct 181 tccacgacga cgacctgatc cccttcgggt cgtccgacac cgagcgcgag tcgcacatca 241 agcggttccg ccaggccctg gacgccactg gcatgacggt gccgatggcc accacgaacc 301 tcttcaccca ccccgtcttc aaggacggcg gtttcaccgc caacgaccgc gacgtgcgcc 361 gctacgcgct gcgcaagacg atccgcaaca tcgacctggc ggccgagctg ggcgccaaga 421 cgtacgtcgc ctggggcggc cgtgagggcg ccgagtccgg tggcgccaag gacgtgcgcg 481 acgccctcga ccgcatgaag gaggcgttcg acctcctcgg cgagtacgtc accgcccagg 541 gctacgacct ccgcttcgcc atcgagccca agcccaacga gccccgcggc gacatcctcc 601 tgcccaccgt cggccacgcc ctggccttca tcgagcgcct ggagcgcccg gagctgtacg 661 gcgtcaaccc ggaggtcggc cacgagcaga tggccggcct gaacttcccg cacggcatcg 721 cgcaggccct gtgggcgggc aagctcttcc acatcgacct caacggccag tccggcatca 781 agtacgacca ggacctgcgg ttcggcgccg gcgacctgcg ggcggcgttc tggctggtcg 841 acctcctgga gagcgccggt tacgagggcc cgcggcactt cgacttcaag ccgccgcgga 901 ccgaggactt cgacggcgtg tgggcctcgg ccgagggctg catgcgcaac tacctgatcc 961 tcaaggagcg cgcggccgcc ttccgcgccg acccggaggt gcaggaggcc ctgcgcgccg 1021 cgcgtctgga ccagctggcc cagccgaccg cggcggacgg cctggaggcc ctgctcgccg 1081 accgcaccgc gttcgaggac ttcgacgtgg aggcggccgc cgcgcgcgcg gcatggccgt 1141 tcgaacgcct cgaccagctg gcgatggacc acctgctggg cgcgcgcggc tgacggtcgc 1201 g // LOCUS BNAALS 3315 bp ds-DNA PLN 11-AUG-1990 DEFINITION B.napus chloroplast acetolactate synthase (ALS) gene, complete cds. ACCESSION M20498 KEYWORDS acetolactate synthase. SOURCE B.napus (cv. Westar) 3 week old leaf DNA. ORGANISM Brassica napus Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae. REFERENCE 1 (bases 1 to 3315) AUTHORS Wiersma,P.A., Schmiemann,M.G., Condie,J.A., Crosby,W.L. and Moloney,M.M. TITLE Isolation, expression and phylogenetic inheritance of an acetolactate synthase gene from Brassica napus JOURNAL Mol. Gen. Genet. 219, 413-420 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by W.L.Crosby, 06-DEC-1988, for release after publication. FEATURES from to/span description pept 784 2697 acetolactate synthase (ALS) precursor (EC 4.1.3.18) sigp 784 1011 acetolactate synthase (ALS) signal peptide matp 1012 2694 acetolactate synthase (ALS) signal 2111 2116 transcriptional stop sequence (put.) signal 446 459 promoter element (put.) signal 665 668 CAAT box signal 705 709 TATA box signal 2894 2899 polyA signal BASE COUNT 954 a 712 c 702 g 947 t ORIGIN 1 tctagatata aaagtatttt attacttaaa aagtatattt ttttgttaat taaagatatc 61 ttttagatgg actattacta ttttgtgaac tttctttttt ttctgaaatc atattatata 121 tacatatttt tttgttttta aattattttt taaaacttta taaactttcc ttttttattt 181 atattgttat ttgaaaaatt ttaagaaagt aaagaataat atttaaatgt acttttttta 241 tttcattagg gatatcatag taatcaacca ccgtgagagt taacgtgagt gcgacaccta 301 aaaaagtgac ttctcaaata atattatgga gatgtatcta caagaaaaat aaacattgtt 361 gtgtaaattt tgaataaatg actgagggca tccacatggt aaaatttact aagagtatct 421 gaacaaaaat attaaaataa aaataaaaag aaagaaagag aaatcagaaa acaagagaag 481 ccattggtta agaagagact cacacattct tttgacagag tttttctacg gatgacattt 541 attgaatgag acaacaaatt ttacagcatt aagttttatt catctaatta acctattaat 601 aattttatgt taagagacat ggataaagaa acttattgct gcagacgctc tgagatgggg 661 ttggcaatat tagttacgag ccaacttgaa gtgtcacatg catgtataaa ttttagttct 721 ctctcacaga gtttatttac tcattcaaat taagcaattt ctcgcaacac tccattttgc 781 accatggctt cgttttcgtt cttcggcacc attccgtcgt ctcccacaaa agcttccgtc 841 ttctccctgc cggtgtcggt aactacgctc ccgtccttcc cgcgccgccg tgctactcgt 901 gtctccgttt ccgccaactc gaagaaagac caagaccgca cagcttcacg tcgagagaat 961 ccgagcacat tcagctccaa atacgctccc aacgtgcccc gcagtggcgc agacatcctg 1021 gtcgaagccc tggagcgtca aggagtggac gtagtcttcg cttacccagg aggcgcatca 1081 atggagatcc atcaagccct aactcgctcc aacacaatcc gaaacgtcct tccccgtcac 1141 gaacaaggag gtatcttcgc cgccgagggt tacgctcgtt cctccggtaa acccggaatc 1201 tgcatcgcca cttccggtcc aggagctatg aatctcgtca gcggattagc cgacgccctg 1261 tttgacagcg tacccctcat cgcaatcaca ggacaggtcc ctcgccggat gattggtacc 1321 atggcgttcc aggagacacc cgttgttgag gtaacgagga ctataacgaa acataactat 1381 cttgttatgg aagttgatga tatacctagg atcgttcgag aagctttctt tctagctact 1441 tcggttagac cgggaccggt tcttatagac gtccccaaag atgttcagca acagtttgcg 1501 attcctaact gggaacagcc tatgcgctta cctctttaca tgtctacgat gcctaaaccc 1561 cccaaagttt ctcacttaga gcagattctt aggttggttt cggagtctaa gaggcctgtc 1621 ttgtacgttg gaggtggttg tctgaactcg agtgaggaac tgcgcagatt tgtggaactt 1681 actggcatcc ctgttgctag tacgttcatg ggacttggat cgtatccttg tgacgatgaa 1741 gagttctctc tgcaaatgct aggaatgcat ggaacagtgt acgctaatta cgctgtcgag 1801 tatagcgatc ttctgcttgc ttttggggtt aggtttgacg accgtgtgac cggaaagctt 1861 gaggcctttg ctagccgggc caagatcgtg catattgata ttgattctac cgaaatcggg 1921 aagaacaaga cacctcatgt gtcggtgtgt tgtgatgttc agctagcctt gcaagggatg 1981 aacgaggttc ttgagaaccg acgagatgtg cttgacttcg gggaatggag atgtgaattg 2041 aacgaacaga gactaaagtt ccctctccgc tacaagacgt ttggggaaga gattcctcca 2101 cagtacgcca ttcaactact tgacgagcta accgacggga aggcaattat cactactggt 2161 gtcgggcaac accagatgtg ggccgcccaa ttctacagat tcaagaaacc ccgccaatgg 2221 ctgtcttcag gaggcctagg agccatgggg ttcggtcttc ctgcagccat gggagccgct 2281 atagccaacc cgggagcagt ggttgtcgac attgatgggg atggtagctt catcatgaac 2341 attcaagaac tggcaaccat cagggttgag aatcttccag tcaaggtttt gctgattaat 2401 aatcagcacc tcggaatggt ccttcagtgg gaagaccact tctacgcagc taacagagcc 2461 gattcttttc tgggagaccc ggcgaaccct gaggcggtat tcccggatat gctgttgttc 2521 gccgcatcgt gcggtatacc agccgccagg gtcaccagaa gggaggacct ccgagaggca 2581 atccagacaa tgctggacac acctggacca ttcttgttgg atgtggtctg tcctcaccag 2641 gaccatgtgt taccactcat ccctagtggc ggcaccttca aggacattat tgtgtagtac 2701 tactttaatg tttgattatt ctatgctctc ttctgttttg taataacaaa gtaaaaccag 2761 tttgcaatat cagaagatct tctgattatc aatatcatgc tactctcata atgaaaaacg 2821 caaacttcac tctttattta tatgagaata ataactgtag tgtttacata acaatttata 2881 agtatgaaag actaataaaa aaaaagttat atacttgata aaaaggcaga agtaagttta 2941 caaatgtagt aggccatcat aacgatctga taaggaattt ttgtataggg actaatacct 3001 ctatgaagga gaacatgtat cgcaacgagt tgacatatgt gtgaacctca gatggactcg 3061 atgttaagtt ctacgttcct gcatgaaaaa aaaaaaacat acaatttttt tcaagcatgg 3121 gtaaccttat tgaatacgaa caatccagac gaaaatgggg caaaactacc ttgagatctg 3181 cacctgaaca gaaaactcca ggaaccagac ctgtgatcat cacaactcta aaagctgaac 3241 tatcctgctg tatggtctca aatgtgttct gcatctcctt atcaatagca tttttcggct 3301 actggcctgt ctaga // LOCUS HUM5AR 2102 bp ss-mRNA PRI 11-AUG-1990 DEFINITION Human steroid 5-alpha-reductase mRNA, complete cds. ACCESSION M32313 KEYWORDS dihydrotestosterone; steroid 5-alpha-reductase. SOURCE Human adult prostate, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2102) AUTHORS Andersson,S. and Russell,D.W. TITLE Structural and biochemical properties of cloned and expressed human and rat steroid 5-alpha-reductases JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 3640-3644 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by D.W. Russell, 23-FEB-1990, for release after publication. FEATURES from to/span description pept 31 810 steroid 5-alpha-reductase (EC 1.3.99.5) mRNA < 1 2102 steroid 5-alpha-reductase mRNA BASE COUNT 470 a 482 c 486 g 664 t ORIGIN 1 gggcatggag cacgctgccc agccctggcg atggcaacgg cgacgggggt ggcggaggag 61 cgcctgctgg ccgcgctcgc ctacctgcag tgcgccgtgg gctgcgcggt cttcgcgcgg 121 aatcgtcaga cgaactcagt gtacggccgc cacgcgctgc ccagccacag gctccgagtg 181 ccggcgcggg ccgcctgggt ggtgcaggag ctgccctcgc tggccctgcc gctctaccag 241 tacgccagcg agtccgcccc gcgtctccgc agcgcgccca actgcatcct cctggccatg 301 ttcctcgtcc actacgggca tcggtgctta atttacccgt ttctgatgcg aggaggaaag 361 cctatgccac tgttggcatg tacaatggcg attatgttct gtacctgtaa cggctatttg 421 caaagcagat acttgagcca ttgtgcagtg tatgctgatg actgggtaac agatccccgt 481 tttctaatag gttttggctt gtggttaaca ggcatgttga taaacatcca ttcagatcat 541 atcctaagga atctcagaaa accaggagat actggataca aaataccaag gggaggctta 601 tttgaatacg taactgcagc caactatttt ggagaaatca tggagtggtg tggctatgcc 661 ctggccagct ggtctgtcca aggcgcggct tttgctttct tcacgttttg ttttttatct 721 ggtagagcaa aagagcatca tgagtggtac ctccggaaat ttgaagagta tccaaagttc 781 agaaaaatta taattccatt tttgttttaa gtgcgttttt catgaaatta tcttcaactt 841 gaagctttcc aatggcgctt ctctatggac tttgtaaata agttatatct ttgtaatttt 901 cctgctactt tatcattttc aagatgtcct ctaggaattt tttttctagt aattttgcaa 961 tctacctaat aagtacctaa atacgctgaa atggaggttg aatatcctac tgtgtaacag 1021 gtcagaattt caagctctgg gtaataactg ctgatatttt ttctaatttc aaatttacct 1081 cttttggcta tgtcttgcca agtgtgtatg agactagact ttacaactgt ctttgatggc 1141 attttcagaa caataaatgt cacaatccct tctatagccc cctacagtga tctcttcaag 1201 gtcaactgca gtgttgcttc cctcccccta tagggctgga atctgtctag gagccctctc 1261 tcggaggcca cagaggctgg gggtagccat tgtgcagtca tggcccgggg gaaacttgcc 1321 aaccttcgtg tcaggtgctg tgtgtaagtg gagaacttgg ggatagagga ggaagctcct 1381 cgtggccctt ccaaggtgag gcaaaggcat ctggacttgt tccagcccag cccaccgggt 1441 gacatcaccg ggcagggagg ggtgctggtg gtggttcata cggagtaagc tgctctgcct 1501 gtgtgagtgg ctcctgggcc ctaaacaggc acctttaggc catgggtcac tcaccgtgag 1561 ccatcaatgt gctctggtct gacatggttt ctctctgtct tctagtctag acctagtttt 1621 tttgttctgt tccccacgta tggatatagt agagattgtt gtctgtgaaa tttctctttt 1681 gtagattttg agttttccct tgtagtgtaa agaatgatca ctttctgtaa caataacaag 1741 accacttttt aagatttatc ctgtttgttc tttgttgatt gaaacataat aattgttaaa 1801 attctctaca gccttctttt tcttccatag ctaatcttcc ttctaatagt ttttgctttc 1861 tgttttgctg ttgttgcttt gcaaagcttt cccctcatag cctgtacctg ttatcaatat 1921 aaaataatct tcctgttgaa tgcttcatga cttgaattct actttgataa aaacattgcc 1981 atactgcttt ttatcttgat gaattcatct ggcattgctt tgccttatca tctcatctgg 2041 agtttttaaa tgccatttgt ttcagttgtc tttaacaaca taataaatag actttgccat 2101 tt // LOCUS SPIBADH 1797 bp ss-mRNA PLN 11-AUG-1990 DEFINITION Spinach betaine-aldehyde dehydrogenase (BADH) mRNA, complete cds. ACCESSION M31480 KEYWORDS betaine-aldehyde dehydrogenase. SOURCE Spinach salt-stressed leaf, cDNA to mRNA. ORGANISM Spinacia oleracea Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Caryophyllidae; Caryophyllales; Chenopodiaceae. REFERENCE 1 (bases 1 to 1797) AUTHORS Weretilnyk,E.A. and Hanson,A.D. TITLE Molecular cloning of a plant betaine-aldehyde dehydrogenase, an enzyme implicated in adaptation to salinity and drought JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87, 2745-2749 (1990) STANDARD full staff_review COMMENT Draft entry and printed sequence for [1] kindly submitted by A.D.Hanson, 27-MAR-1990,, for release after publication. FEATURES from to/span description pept 68 1561 betaine-aldehyde dehydrogenase (BADH) (EC 1.2.1.8) BASE COUNT 504 a 324 c 443 g 526 t ORIGIN 1 cgttgcgtgc tcgccttacc ctctcaactc aatttcttca acccaatttc ttcgcattta 61 accaagaatg gcgttcccaa ttcctgctcg tcagctattc atcgacggag agtggagaga 121 acccattaaa aaaaatcgca tacccgtcat caatccgtcc actgaagaaa tcatcggtga 181 tattccggca gccacggctg aagatgtgga ggttgcggtg gtggcagctc gaagagcctt 241 taggaggaac aattggtcag caacatctgg ggctcatcgt gccacatact tgcgtgctat 301 tgctgctaag ataacagaaa aaaaagatca tttcgttaaa ctggaaacca ttgattctgg 361 gaaacctttt gatgaagcag tgctggacat tgatgacgtt gcttcatgtt ttgaatattt 421 tgccggacaa gcagaagctc ttgatggtaa acaaaaggct ccagtcaccc tgcctatgga 481 aaggttcaaa agtcatgttc tcaggcagcc ccttggtgtt gttggattaa tatccccatg 541 gaattaccca cttctaatgg ctacatggaa aattgctcca gcacttgctg ctgggtgtac 601 agctgtactt aagccatccg agttggcatc tgtgacttgt ctagaattcg gtgaagtttg 661 caacgaagtg ggacttcctc caggcgtgtt gaatatcttg acaggattag gtccagatgc 721 tggtgcacca ttagtatcac accccgatgt tgacaagatt gcctttactg ggagtagtgc 781 cactggaagc aaggttatgg cttctgctgc ccaattggtt aagcctgtta cattagaact 841 tgggggtaaa agtcctattg tagtgtttga agatgttgat attgataaag ttgtggaatg 901 gactattttt ggctgtttct ggacaaatgg tcaaatatgt agtgcaacgt ctagactgct 961 tgtgcatgaa agtattgcag ctgagtttgt tgataagctt gtaaaatgga cgaaaaacat 1021 taaaatttct gacccatttg aagaaggatg ccggcttggc cctgttatta gtaaaggaca 1081 gtacgacaaa attatgaagt tcatatcaac agcaaagagt gagggggcaa ctattttgta 1141 tggaggttcc cgtcctgagc atttgaagaa aggttattac attgaaccca ccattgtaac 1201 tgatatctcc acatccatgc aaatatggaa agaggaagtt tttggccctg tcttgtgtgt 1261 taaaacattt agttccgaag atgaagccat tgcattggca aatgatacag agtacggttt 1321 agctgctgct gtgttttcta atgatcttga aagatgtgag aggataacga aggctctaga 1381 agttggagct gtttgggtta attgctcaca accatgcttt gttcaagctc cttggggagg 1441 catcaagcgt agtggttttg gacgtgaact tggagaatgg ggtatccaga attacttgaa 1501 tatcaagcag gtgactcaag atatttctga tgaaccatgg ggatggtaca agtctccttg 1561 aagctatgat caaatttgaa tgacggtgtt gtttttgtta agtgagcagc ggttggactg 1621 taccttgaaa tggttcgcag agaaggtcga gtttacagta aaaatggatt gaataaaggg 1681 ttggttgatg cagaagtcca acaagcatag cttaattttg ttgtatcatg taatagtgtg 1741 tattatttca gacaattgag ttggatcaga aataagatga tacagagttt gacaatg // LOCUS ECORTSA 1407 bp ds-DNA BCT 11-AUG-1990 DEFINITION E.coli rts gene, complete cds. ACCESSION M36321 KEYWORDS rts gene. SOURCE E.coli DNA. ORGANISM Escherichia coli Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Facultatively anaerobic rods; Enterobacteriaceae. REFERENCE 1 (bases 1 to 1407) AUTHORS Flamm,J.A., Friesen,J.D. and Otsuka,A.J. TITLE The nucleotide sequence of the Escherichia coli rts gene JOURNAL Gene 74, 555-558 (1988) STANDARD simple staff_review FEATURES from to/span description pept 294 1304 rts gene product (gtg start codon) BASE COUNT 383 a 316 c 298 g 410 t ORIGIN 1 gcgcgcattc tatggagaca tgcgagttca tgcaactaaa aaattgcata atttgtttta 61 ttggtcacat tttatgcgac acgatgaaga aacagccgaa cggagggcag attagcacac 121 tttttcaaca tcattgtgct caacaatgcg ctcctgctaa accataattc tttttatcag 181 atggaatatc tgtcacattg cttttcaacg atagcttcct ggcagagatt ttttcttatt 241 attcctcccc atctggtgtt accctcctgc ccattaaccc attcaacaga actgtgacgc 301 gccatggcaa atatcgcttt gccgatagag ctatgaccgc cagaaacatg cttatgagta 361 taaaagagca aacgttaatg acgccttacc tacagtttga ccgcaaccag tgggcagctc 421 tgcgtgattc cgtacctatg acgttatcgg aagatgagat cgcccgtctc aaaggtatta 481 atgaagatct ctcgttagaa gaagttgccg agatctattt acctttgtca cgtttgctga 541 acttctatat aagctcgaat ctgcgccgtc aggcagttct ggaacagttt cttggtacca 601 acgggcaacg cattccttac attatcagta ttgctggcag tgtcgcggtg gggaaaagta 661 caaccgcccg tgtattgcag gcgctattaa gccgttggcc ggaacatcgt cgtgttgaac 721 tgatcactac agatggcttc cttcacccta atcaggttct gaaagaacgt ggtctgatga 781 agaagaaagg cttcccggaa tcgtatgata tgcatcgcct ggtgaagttt gtttccgatc 841 tcaaatccgg cgtgccaaac gttacagcac ctgtttactc acatcttatt tatgatgtga 901 tcccggatgg agataaaacg gttgttcagc ctgatatttt aattcttgaa gggttaaatg 961 tcttacagag cgggatggat tatccacacg atccacatca tgtatttgtt tctgattttg 1021 tcgatttttc gatatatgtt gatgcaccgg aagacttact tcagacatgg tatatcaacc 1081 gttttctgaa attccgcgaa ggggctttta ccgacccgga ttcctatttt cataactacg 1141 cgaaattaac taaagaagaa gcgattaaga ctgccatgac attgtggaaa gagatcaact 1201 ggctgaactt aaagcaaaat attctaccta ctcgtgagcg cgccagttta atcctgacga 1261 aaagtgctaa tcatgcggta gaagaggtca gactacgcaa ataatttgca ggggagcgaa 1321 tactcccctt tcttattttt ctgcactacg cagggatatt tcaccgccca tccagggttt 1381 tattattcca tcctgctcaa gtaataa // LOCUS MUSBCLQ 1056 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse complement component subunit Clq B-chain mRNA, complete cds. ACCESSION M36293 KEYWORDS complement component C1q. SOURCE Mouse Englebreth-Holm-Swarm tumor, cDNA to mRNA. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 1056) AUTHORS Wood,L., Pulaski,S. and Vogeli,G. TITLE cDNA clones coding for the complete murine B chain of complement Clq: Nucleotide and derived amino acid sequences JOURNAL Immunol. Lett. 17, 59-62 (1988) STANDARD simple staff_review FEATURES from to/span description pept 157 918 complement component subunit Clq B-chain precursor sigp 157 231 complement component subunit Clq B-chain signal peptide matp 232 915 complement component subunit Clq B-chain BASE COUNT 236 a 320 c 285 g 215 t ORIGIN 1 acagcaaggg aaggttcttc cttcctctag ggacccagac ttccgctttc tgaggacagc 61 agcaggctct gggctctggg aatccactgc tgtccggcct agaagcatca cagaacacca 121 ggattccata cacaggaagc ccctgaggct gagctgatga agacacagtg gggtgaggtc 181 tggacacacc tgttactgct gcttctaggt tttctccatg tgtcctgggc ccaaagcagc 241 tgcaccgggc cccctggcat ccctggcatc cctggggtcc ctggggttcc tggctctgat 301 ggccaacctg gcactccagg gattaaaggg gagaaagggc tccctggact ggctggagac 361 cttggtgagt ttggagagaa aggggaccca gggatccctg ggactccagg caaagttggc 421 cctaagggtc ccgtcggccc taagggtact ccaggcccct ctggaccccg cggtcccaaa 481 ggcgattctg gggactacgg ggctacacag aaagtcgcct tctctgccct gaggaccatc 541 aacagcccct tgcgaccgaa ccaggtcatt cgcttcgaaa aggtgatcac caacgcgaac 601 gagaactatg agccacgcaa cggcaagttc acctgcaagg tgcctggcct ctactacttc 661 acctatcatg ccagctcccg gggcaacctg tgtgtgaatc tcgttcgtgg ccgcgatcgg 721 gacagcatgc agaaagtagt caccttctgt gactatgccc agaacacctt ccaggtgacc 781 acaggtgggg tagtcttgaa gctagagcaa gaggaggttg ttcacctgca ggccacagac 841 aagaactccc tcctgggcat tgagggtgcc aacagcatct tcactggctt tctgcttttc 901 cctgacatgg atgcgtaatc acggggtcaa attacaccta tccaacacca tcttcctgct 961 cctgcagcaa tcctccctgg acccctgaca tcaccccctt gactgcctga aacccagacc 1021 agagccctgt agatgttaca gaacgaatgg gtcaat // LOCUS MUSHOX23R 3542 bp ss-mRNA ROD 11-AUG-1990 DEFINITION Mouse Hox2.3 protein mRNA, complete cds. ACCESSION X06762 M36183 KEYWORDS Hox2.3 protein; homeo box. SOURCE Mouse plasmacytoma cell line, cDNA to mRNA, clones pRO.8, pR1.2, and genomic clones L13 and L2. ORGANISM Mus musculus Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Rodentia; Myomorpha; Muridae; Murinae. REFERENCE 1 (bases 1 to 3542) AUTHORS Meijlink,F., Verrijzer,P., De Laaf,L., De Graaff,W. and Deschamps,J. TITLE A mouse homeobox containing gene on chromosome 11: Sequence and tissue-specific expression JOURNAL Nucleic Acids Res. 15, 6773-6786 (1987) STANDARD simple staff_review REFERENCE 2 (bases 1184 to 2206; revises [1]) AUTHORS Verrijzer,P., De Graaff,W., Deschamps,J. and Meijlink,F. TITLE Nucleotide sequence of the Hox2.3 gene region JOURNAL Nucleic Acids Res. 16, 2729-2729 (1988) STANDARD simple staff_review FEATURES from to/span description pept 2187 2840 Hox2.3 polypeptide mRNA 2100 3373 Hox2.3 mRNA site 2595 2774 homeo box site 3371 3371 hox2.2 mRNA BASE COUNT 738 a 924 c 951 g 890 t 39 others ORIGIN Chromosome 11. 1 gagctcggcc ccgcgagcga cgcaggagct gggcctccca cagcagcgtc ccccgccgcg 61 ccagtccccg ctagtggtag tatctcgtaa tagcttctgt gtgtgagcta ccgtggatct 121 ccttcccttc tcttgggggt ccggggggaa aaaaagaaaa ggattttaag caaggactcc 181 ctcgtcctgc gagggtgatc gactgcggcc tggcagaacc ccctcgcccc cgccccatgt 241 aaaaaagcct ccttgtgcaa tggtctgttt cctttgaacg tgcttctttg taatgaccga 301 ggtaccgatt tctgctaagt tttcccaaca acatgaaact gcctattcac gccgtaattc 361 tttctgtctc ccgctcactt tctctctttc tctcgctctc tttctctcac cgcgtcccca 421 tctttcctcg caaccccctc tccccgctgc cctccctagc tggctttctc tcttgcttct 481 ctcttttcct cctgtccccc cacccccacc ccctttggtt tgacaatttt gtcttaagtg 541 tttctcaaaa gagattactt tagttagcat gcgcgctgtg agcattgtta aaagtgttct 601 taggtttact gtgaagagaa tgtatcctgt atctgtgaat tgctttatgg gggggaggga 661 gggctaatta tatattttgt tgttcctcta tactttgttc tgttgtctgc gcctgaaaag 721 ggcggaagag ttacaataaa gtttacaagc agagaacccg agnnnnnnnn nnnnnnnnnn 781 nncccgggcc gccgctcctc attcgctcct aggcgccttg cagggctggg ggtggggggg 841 agctggtcag caggctcctg ggctggccta ggctaggtcg ctgagaggag ggggcggggg 901 cgggggctgg aagcaggtgg tgcgagtccc tgggcccagg ggcgcagggg gtgagggagg 961 cggctgaacg tgattggagg agagaggatc gagggagggg agccaagaga aaccccctcc 1021 ccttgcattc tgaggctgaa ggaccaggga gactccagcg cccaggccgc tcttgggaag 1081 agatctaccc aggctggtgg ctagtgtccc ccgccgcttt tctctttgtt tccgtgtgtg 1141 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tttggggnnn nnnnnnnnnn 1201 nnnnnngggg tgagaaagat gcagcgcaag acttctgagt ttgtatttcc attttttcct 1261 tggactcagg ttggggaaac aggagcagag ggaagcggtt cctccctacc tcccctctct 1321 gggacgtcgt cactttctcc gacgtttcta ggcctcggct tgccgcagcc ttgcttcctt 1381 cgttgcttct gccttcctgg cagccacgct ccagtgagtg aggcatccgc cttccggaac 1441 cgggaaagca aagagccgga cccaagcctc cttcctccct tcctttttct cccagccccc 1501 attccattct ttttaaattt tgtatatctt tttattgtca tcagaaatct gagcgtccaa 1561 cgccttattg ggttggatct ctgccttagg gacgccttgg tcacatctag ttactacaac 1621 tggggcacta agacaaccgg gaggccaggc tcgcctcctc ttgggagaag agcagcagct 1681 cggatgaatt aacccaaatt aataaatatt cggccagcac ccaccgacca agttgcgaac 1741 attcaatccc tgcgtctctc tggctctgta accggctggg ggaaatgggt gggggatgac 1801 aacacggttc cctcagaggt tatttatttt ctcttccact caattccttc ttccccaaat 1861 ctcgcctgca agctgcctcc agcccgcggg ggtcgacagc ggcccttaag cccccagccc 1921 caatccgcag agctcggcct tcccattcat tattgatcat attttataaa tcgaacgcga 1981 cacaattttt tccacattac tgggagcctc cgggaggccg tcatagcatt ggccgagggg 2041 atatcacgtg ggccggggtc acgtggtcag aagaggaaaa agggggtcct tttggtgtaa 2101 atctggactc taattctgta atatatcaag gaatctcgta aaaccgacac taaaacgtcc 2161 ccgactacaa atcatccggc caaattatga gttcattgta ttatgcgaat gctttatttt 2221 ctaaatatcc agccgcaagt tcggttttcg ctccaggagc cttccccgaa caaacttctt 2281 gcgcctttgc ttccaacccc cagcgcccgg gctatggagc aggtccgggc gctcctttct 2341 ccgcctcggt gcagggtctg tactccggcg ggggggccat ggcgggccag agcgcggctg 2401 gcgtctatgc ggccggctac gggctcgaac cgagttcctt caacatgcac tgcgcgccct 2461 ttgagcagaa cctctccggg gtgtgtccgg gcgacgccgc caaggccggt ggcgccaagg 2521 agcagaggga ctcggacttg gcggccgaga gtaacttccg gatctacccc tggatgcgaa 2581 gctcagggcc tgaccgaaag cgaggccgcc agacctacac gcgctaccag accctggagc 2641 tggagaaaga atttcactac aatcgctacc tgactcggcg gaggcgcatc gagatcgcgc 2701 acacgctctg cctcaccgaa agacagatca agatctggtt tcagaaccgg cgcatgaagt 2761 ggaaaaagga gaacaaaacc tcaggaccgg gaaccaccgg ccaggacaag gcggaaggcg 2821 aggaggagga ggaagagtga gggacagaga aagccaagag gaggaaagcg aagagaggga 2881 gaacccaatt atgggaactg aagcatgaaa ctcaaataaa ggggcaaact atttaaatga 2941 agaggtctaa aaataataag gaaagagaag ggtgaaattt gggtttctta acactgtaaa 3001 aaaatatact acctatggga aagtgtgttg tctgtttttg tacagtatcg gaaggacatt 3061 atctacctgt tctgtggctt tctggaatgt gcctcccctt ttctatgttg ctagtaaggt 3121 ctttgtaaaa tcttgctgtt ttgtaagccc tctttgaagc tgtctttgtg aactgtggtt 3181 ccagatgaac agattagtgc ggttccttac ctaccccata cttcccagta gcagtactag 3241 aggggttgta tggagccctg aggatccact agcttctgcg tctggtgcat tttggctgct 3301 gtttctaggt cctattcacg agcctctttc tgtatatctg aaggatggaa aataaaacag 3361 gattaaatat caacagacac gctgtgtttt ctgtctccct ctggctgggg gaggtgtaag 3421 ggttggtagt gggcgagtgt cccagagccc ctctgctggt ctttttctgt cggtgctggg 3481 tttgctttta ttttccagat agcctctgct atagagactg ccagctagtg aatccccaac 3541 at // LOCUS POLVP1A 900 bp ss-RNA VRL 11-AUG-1990 DEFINITION Poliovirus type 3 VP1 gene, partial cds. ACCESSION M37321 M36161 M36162 KEYWORDS . SOURCE Poliovirus type 3 (P3/Leon/1947) RNA. ORGANISM Poliovirus type 3 Unclassified. REFERENCE 1 (bases 1 to 900) AUTHORS Minor,P.D., Evans,D.M.A., Schild,G.C., Ferguson,M. and Almond,J.W. TITLE Identification of an antigenic site in the neutralization of type 3 poliovirus JOURNAL Rev Infect Dis 6, 516-518 (1984) STANDARD simple staff_review REFERENCE 2 (bases 34 to 900) AUTHORS Almond,J.W., Cann,A.J., Minor,P.D., Reeve,P., Schild,G.C., Hauptmann,R. and Stanway,G. TITLE Nucleotide sequence from neurovirulent and attenuated strains of type 3 poliovirus JOURNAL Rev Infect Dis 6, 487-493 (1984) STANDARD simple staff_review FEATURES from to/span description pept < 1 > 900 VP1 (AA at 1) BASE COUNT 238 a 250 c 206 g 206 t ORIGIN 1 ggtattgaag atttgatttc tgaagttgca cagggcgccc taactttgtc actcccgaag 61 caacaggata gcttacctga tactaaggcc agtggcccgg cgcattccaa ggaggtacct 121 gcactcactg cagtcgagac tggagccacc aatcctctgg caccatccga cacagttcaa 181 acgcgccacg tagtccaacg acgcagcagg tcagagtcca caatagaatc attcttcgca 241 cgcggggcgt gcgtcgctat tattgaggtg gacaatgaac aaccaaccac ccgggcacag 301 aaactatttg ccatgtggcg cattacatac aaagatacag tgcagttgcg ccgtaagttg 361 gagtttttca catactctcg ttttgacatg gaattcacct tcgtggtaac cgccaacttc 421 accaacgcta ataatgggca tgcactcaac caggtgtacc agataatgta catcccccca 481 ggggcaccca caccaaagtc atgggacgac tacacttggc aaacatcttc caacccgtcc 541 atattttaca cctatggggc tgccccggcg cgaatctcag tgccatacgt ggggttagcc 601 aatgcttact cgcactttta cgacggcttc gccaaggtgc cattgaagac agatgccaat 661 gaccagattg gtgattcctt gtacagcgcc atgacagttg atgactttgg tgtattggca 721 gttcgtgttg tcaatgatca caaccccact aaagtaacct ccaaagtccg catttacatg 781 aaacccaaac acgtacgtgt ctggtgccct agaccgccgc gcgcggtacc ttattatgga 841 ccaggggtgg actataagaa caacttggac cccttatctg agaaaggttt gaccacatat // LOCUS YSCATRP3A 1240 bp ds-DNA PLN 11-AUG-1990 DEFINITION S.cerevisiae glutamine amidotransferase (TRP3) gene, 3' end. ACCESSION M36300 KEYWORDS TRP3 gene; glutamine amidotransferase. SOURCE S.cerevisiae DNA. ORGANISM Saccharomyces cerevisiae Eukaryota; Plantae; Thallobionta; Eumycota; Hemiascomycetes; Endomycetales; Saccharomycetaceae. REFERENCE 1 (bases 1 to 1240) AUTHORS Aebi,M., Furter,R., Prantl,F., Niederberger,P. and Huetter,R. TITLE Structure and function of the TRP3 gene of Saccharomyces cerevisiae: Analysis of transcription, promoter sequence, and sequence coding for a glutamine amidotransferase JOURNAL Curr. Genet. 8, 165-172 (1984) STANDARD simple staff_review FEATURES from to/span description pept 401 > 1240 glutamine amidotransferase (TRP3) mRNA 309 > 1240 TRP3 mRNA (alt.) mRNA 314 > 1240 TRP3 mRNA (alt.) mRNA 320 > 1240 TRP3 mRNA (alt.) BASE COUNT 352 a 267 c 259 g 362 t ORIGIN 1 ttcacttttt ttgcttctta taggcctttt tgaactattt tctgttaggg gcttaaagaa 61 agcgcagcaa atggctgatt ttaaatgctg ttttctcgaa agcaatgctg atgatgtacc 121 acaaagtagt atttgcgctt agcttgtcga tataagtatg cctttgagcc agtattaaac 181 ttcctactat ctcttctgga cctcaccgtg ctattcatat gactgacgac ggaaaatgcc 241 tcttgtagta atcattgaat gatcatcaat ggctcgttga ctcattctaa tctatagcag 301 atattaaggt acgcaaaaag ttcgacaagg agctggtcag tattaacaca ttaccaaatc 361 tgtttggttc tacatagaac gccataaaag taagaaaaaa atgtctgtgc acgctgcaac 421 aaacccaatc aataagcatg tggttctaat tgacaactac gattccttta cctggaacgt 481 ttacgagtac ttgtcccagg agggcgccaa agtgagcgtc taccgtaacg atgcaattac 541 agttccagaa attgccgcct tgaatcccga cacattgctt atctcgcttg gactaggcca 601 cccaaagaca gattctggca tttcaagaga ctgtatccgg tattttactg ggaaaattcc 661 tgtatttgga atctgtatgg gccagcaatg catgtttgac gtatttggtg gtgaagttgc 721 ctacgctggt gagattgtcc acggtaaaac gtccccaatc tctcacgaca actgtggaat 781 tttcaagaac gtgccgcaag gtattgctgt gacaagatac cattcattgg ccgggacaga 841 atcgtcccta ccatcctgct tgaaggttac tgcgagtacc gaaaatggaa ttatcatggg 901 tgtaagatac aagaagtaca ctgtagaagg tgtgcaattt catccggaat ccatcttgac 961 cgaggaaggt catctgatga tcaggaacat tttaaatgtc agtggaggca cttgggagga 1021 aaacaaatca tctccttcaa attctatttt ggaccgtatc tatgctcggc gtaaaataga 1081 cgtcaatgag cagtctaaaa tcccaagttt cacctttcaa gacttacaat ctaactatga 1141 tttaggtctt gccccaccgt tacaggattt ctacacggtg ttgtcatcat cccataaaag 1201 agccgttgtt cttgctgaag tcaagcgtgc ctctccatcg //